BLASTX nr result

ID: Cephaelis21_contig00019423 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Cephaelis21_contig00019423
         (1870 letters)

Database: ./nr 
           23,641,837 sequences; 8,123,359,852 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_003631454.1| PREDICTED: aspartic proteinase nepenthesin-2...   566   e-159
ref|XP_002528927.1| pepsin A, putative [Ricinus communis] gi|223...   545   e-152
ref|XP_002334311.1| predicted protein [Populus trichocarpa] gi|2...   540   e-151
ref|XP_002326638.1| predicted protein [Populus trichocarpa] gi|2...   540   e-151
ref|XP_002304273.1| predicted protein [Populus trichocarpa] gi|2...   530   e-148

>ref|XP_003631454.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Vitis vinifera]
          Length = 485

 Score =  566 bits (1459), Expect = e-159
 Identities = 289/465 (62%), Positives = 338/465 (72%), Gaps = 4/465 (0%)
 Frame = -1

Query: 1663 PLTHSLSKSRSAKATPSHLLKSTSTHSAARFRQR-RQVSLPLNPGSDYTMSFTLGS---Q 1496
            PLTHSLSKS+   +TP HLLK TS  SA RF  R RQ+SLPL+PGSDYT+SF LGS   Q
Sbjct: 28   PLTHSLSKSQF-NSTP-HLLKFTSARSATRFHHRHRQISLPLSPGSDYTLSFNLGSHPPQ 85

Query: 1495 TISLYMDTGSDVVWLPCHPFECILCDGKYEPSTIPDTSPLNLTSATHVTCKSNACXXXXX 1316
             ISLYMDTGSD+VW PC PFECILC+GKY+ +     SP N+TS+  V+CKS AC     
Sbjct: 86   PISLYMDTGSDLVWFPCAPFECILCEGKYDTAATGGLSPPNITSSASVSCKSPACSAAHT 145

Query: 1315 XXXXSDLCAMANCPLDQIETSDCKKFSCPPFYYAYGDGSFIAKLYQDSLSFSLASPFFVL 1136
                SDLCAMA CPL+ IETSDC  FSCPPFYYAYGDGS +A+LY+DSLS   +SP  VL
Sbjct: 146  SLSSSDLCAMARCPLELIETSDCSSFSCPPFYYAYGDGSLVARLYRDSLSMPASSP-LVL 204

Query: 1135 KDFTFGCAHSALGEPIGVAGFGRGALSMPAQLASSSPDIGNYFSYCLVSHSFDTNRVRMP 956
             +FTFGCAH+ALGEP+GVAGFGRG LS+PAQLAS SP +GN FSYCLVSHSFD +RVR P
Sbjct: 205  HNFTFGCAHTALGEPVGVAGFGRGVLSLPAQLASFSPHLGNQFSYCLVSHSFDADRVRRP 264

Query: 955  SPLILGRYNSAGENKEKQSDQIGDGFVYTPMLKNNKHPYFYYIGLAGISLGKKRIAAPES 776
            SPLILGRY+   E K++     G+ FVYT ML N KHPYFY +GL GI++G ++I  PE 
Sbjct: 265  SPLILGRYSLDDEKKKRVGHDRGE-FVYTAMLDNPKHPYFYCVGLEGITVGNRKIPVPEI 323

Query: 775  LRTIDGKGNGGMVVDSGTTFTMLPPGIYNSIVAEFDARVGSVYKRAADVEDRTGLGPCYY 596
            L+ +D +GNGGMVVDSGTTFTMLP G+Y S+V EF+ R+G VYKRA  +E+RTGLGPCYY
Sbjct: 324  LKRVDRRGNGGMVVDSGTTFTMLPAGLYESLVTEFNHRMGRVYKRATQIEERTGLGPCYY 383

Query: 595  FEGGGKVAPQAVVVPQMLLHFAGNSTVVMPKRNYFCEFLXXXXXXXXXXXXGCMMLMNXX 416
             +        A  VP + LHF GNSTV++P+ NY+ EF             GC+MLMN  
Sbjct: 384  SD------DSAAKVPAVALHFVGNSTVILPRNNYYYEFFDGRDGQKKKRKVGCLMLMNGG 437

Query: 415  XXXXXXXXXGLLGNYQQQGFEVVYDLEKLRVGFARRKCASLWKTL 281
                       LGNYQQQGFEVVYDLEK RVGFARRKCA LW +L
Sbjct: 438  DEAESGGPAATLGNYQQQGFEVVYDLEKHRVGFARRKCALLWDSL 482


>ref|XP_002528927.1| pepsin A, putative [Ricinus communis] gi|223531629|gb|EEF33456.1|
            pepsin A, putative [Ricinus communis]
          Length = 493

 Score =  545 bits (1404), Expect = e-152
 Identities = 284/504 (56%), Positives = 349/504 (69%), Gaps = 14/504 (2%)
 Frame = -1

Query: 1747 MASSVYSILIIAXXXXXXXXXXXXXXVFPLTHSLSKSRSAKATPSHLLKSTSTHSAARF- 1571
            MA+S Y+ L                   PLTHSLS ++       HLLKSTS+ SA+RF 
Sbjct: 1    MATSCYAFLCFILCFSCISVSISEILYLPLTHSLSNTQFTST--HHLLKSTSSRSASRFQ 58

Query: 1570 --------RQRRQVSLPLNPGSDYTMSFTLGS---QTISLYMDTGSDVVWLPCHPFECIL 1424
                    R R QVSLPL+PGSDYT+SFTL S   Q +SLY+DTGSD+VW PC PFECIL
Sbjct: 59   HQHQKRHLRNRHQVSLPLSPGSDYTLSFTLNSNPPQHVSLYLDTGSDLVWFPCKPFECIL 118

Query: 1423 CDGKYEPSTIPDTSPLNLTSATHVTCKSNACXXXXXXXXXSDLCAMANCPLDQIETSDCK 1244
            C+GK E +T     P   ++A  V CKS+AC         SDLCA+A+CPL+ IETSDC 
Sbjct: 119  CEGKAENTTASTPPPRLSSTARSVHCKSSACSAAHSNLPTSDLCAIADCPLESIETSDCH 178

Query: 1243 KFSCPPFYYAYGDGSFIAKLYQDSLSFSLASPFFVLKDFTFGCAHSALGEPIGVAGFGRG 1064
             FSCP FYYAYGDGS +A+LY DS+   LA+P   L +FTFGCAH+AL EP+GVAGFGRG
Sbjct: 179  SFSCPSFYYAYGDGSLVARLYHDSIKLPLATPSLSLHNFTFGCAHTALAEPVGVAGFGRG 238

Query: 1063 ALSMPAQLASSSPDIGNYFSYCLVSHSFDTNRVRMPSPLILGRYNSAGENKEKQSDQIGD 884
             LS+PAQLAS +P +GN FSYCLVSHSF+++R+R+PSPLILG      ++KEK+ ++   
Sbjct: 239  VLSLPAQLASFAPQLGNRFSYCLVSHSFNSDRLRLPSPLILGH----SDDKEKRVNKDDV 294

Query: 883  GFVYTPMLKNNKHPYFYYIGLAGISLGKKRIAAPESLRTIDGKGNGGMVVDSGTTFTMLP 704
             FVYT ML N KHPYFY +GL GIS+GKK+I APE L+ +D +G+GG+VVDSGTTFTMLP
Sbjct: 295  QFVYTSMLDNPKHPYFYCVGLEGISIGKKKIPAPEFLKRVDREGSGGVVVDSGTTFTMLP 354

Query: 703  PGIYNSIVAEFDARVGSVYKRAADVEDRTGLGPCYYFEGGGKVAPQAVVVPQMLLHFAGN 524
              +YNS+VAEFD RVG VY+RA +VED+TGLGPCYY++         V +P ++LHF GN
Sbjct: 355  ASLYNSVVAEFDNRVGRVYERAKEVEDKTGLGPCYYYD-------TVVNIPSLVLHFVGN 407

Query: 523  -STVVMPKRNYFCEFLXXXXXXXXXXXXGCMMLMN-XXXXXXXXXXXGLLGNYQQQGFEV 350
             S+VV+PK+NYF +FL            GC+MLMN              LGNYQQ GFEV
Sbjct: 408  ESSVVLPKKNYFYDFLDGGDGVRRKRRVGCLMLMNGGEEAELTGGPGATLGNYQQHGFEV 467

Query: 349  VYDLEKLRVGFARRKCASLWKTLN 278
            VYDLE+ RVGFARRKCASLW++LN
Sbjct: 468  VYDLEQRRVGFARRKCASLWESLN 491


>ref|XP_002334311.1| predicted protein [Populus trichocarpa] gi|222871031|gb|EEF08162.1|
            predicted protein [Populus trichocarpa]
          Length = 496

 Score =  540 bits (1391), Expect = e-151
 Identities = 281/476 (59%), Positives = 346/476 (72%), Gaps = 14/476 (2%)
 Frame = -1

Query: 1663 PLTHSLSKSRSAKATPSHLLKSTSTHSAARFRQR---------RQVSLPLNPGSDYTMSF 1511
            PLTHSLSK++       HL+KSTST S  RFR+          RQVSLPL+PGSDYT+SF
Sbjct: 29   PLTHSLSKTQFTST--HHLIKSTSTSSITRFRRHHHQKNTHNHRQVSLPLSPGSDYTLSF 86

Query: 1510 TLGSQTISLYMDTGSDVVWLPCHPFECILCDGKYEPSTIPDTSPLNLT-SATHVTCKSNA 1334
            TL SQ I LY+DTGSD+VW PC PFECILC+GK E +++  T P  L+ +AT V+CKS+A
Sbjct: 87   TLDSQPIFLYLDTGSDLVWFPCQPFECILCEGKAENTSLASTPPPKLSKTATPVSCKSSA 146

Query: 1333 CXXXXXXXXXSDLCAMANCPLDQIETSDCKKFSCPPFYYAYGDGSFIAKLYQDSLSFSLA 1154
            C         SDLCA++NCPL+ IETSDC+K SCP FYYAYGDGS IA+LY+DS+S  L+
Sbjct: 147  CSAAHSNLPSSDLCAISNCPLESIETSDCQKHSCPQFYYAYGDGSLIARLYRDSISLPLS 206

Query: 1153 SPF-FVLKDFTFGCAHSALGEPIGVAGFGRGALSMPAQLASSSPDIGNYFSYCLVSHSFD 977
            +P   ++ +FTFGCAH+AL EPIGVAGFGRG LS+PAQLA+ SP +GN FSYCLVSHSFD
Sbjct: 207  NPTNLIVNNFTFGCAHTALAEPIGVAGFGRGVLSLPAQLATLSPQLGNQFSYCLVSHSFD 266

Query: 976  TNRVRMPSPLILGRYNSAGENKEKQSDQIGDG-FVYTPMLKNNKHPYFYYIGLAGISLGK 800
            ++R+R PSPLILGRY+   + KE++ + +    FVYT ML N +HPYFY +GL GIS+G+
Sbjct: 267  SDRLRRPSPLILGRYDH--DEKERRVNGVNKPRFVYTSMLDNLEHPYFYCVGLEGISIGR 324

Query: 799  KRIAAPESLRTIDGKGNGGMVVDSGTTFTMLPPGIYNSIVAEFDARVGSVYKRAADVEDR 620
            K+I AP  LR +DG+G+GG+VVDSGTTFTMLP  +Y S+VAEF+ RVG V +RA  +E+ 
Sbjct: 325  KKIPAPGFLRKVDGEGSGGLVVDSGTTFTMLPASLYGSVVAEFENRVGRVNERARVIEED 384

Query: 619  TGLGPCYYFEGGGKVAPQAVVVPQMLLHFAGN-STVVMPKRNYFCEFLXXXXXXXXXXXX 443
            TGL PCYYF+         V VP ++LHF GN S+VV+P+RNYF EFL            
Sbjct: 385  TGLSPCYYFDN------NVVNVPSVVLHFVGNGSSVVLPRRNYFYEFLDGGDGKGKKRKV 438

Query: 442  GCMMLMN-XXXXXXXXXXXGLLGNYQQQGFEVVYDLEKLRVGFARRKCASLWKTLN 278
            GC+MLMN              LGNYQQQGFEVVYDLE  RVGFARR+CASLW+TLN
Sbjct: 439  GCLMLMNGGDEAELSGGPGATLGNYQQQGFEVVYDLENKRVGFARRQCASLWETLN 494


>ref|XP_002326638.1| predicted protein [Populus trichocarpa] gi|222833960|gb|EEE72437.1|
            predicted protein [Populus trichocarpa]
          Length = 496

 Score =  540 bits (1391), Expect = e-151
 Identities = 281/476 (59%), Positives = 346/476 (72%), Gaps = 14/476 (2%)
 Frame = -1

Query: 1663 PLTHSLSKSRSAKATPSHLLKSTSTHSAARFRQR---------RQVSLPLNPGSDYTMSF 1511
            PLTHSLSK++       HL+KSTST S  RFR+          RQVSLPL+PGSDYT+SF
Sbjct: 29   PLTHSLSKTQFTST--HHLIKSTSTSSITRFRRHHHQKNTHNHRQVSLPLSPGSDYTLSF 86

Query: 1510 TLGSQTISLYMDTGSDVVWLPCHPFECILCDGKYEPSTIPDTSPLNLT-SATHVTCKSNA 1334
            TL SQ I LY+DTGSD+VW PC PFECILC+GK E +++  T P  L+ +AT V+CKS+A
Sbjct: 87   TLDSQPIFLYLDTGSDLVWFPCQPFECILCEGKAENTSLASTPPPKLSKTATPVSCKSSA 146

Query: 1333 CXXXXXXXXXSDLCAMANCPLDQIETSDCKKFSCPPFYYAYGDGSFIAKLYQDSLSFSLA 1154
            C         SDLCA++NCPL+ IETSDC+K SCP FYYAYGDGS IA+LY+DS+S  L+
Sbjct: 147  CSAAHSNLPSSDLCAISNCPLESIETSDCQKHSCPQFYYAYGDGSLIARLYRDSISLPLS 206

Query: 1153 SPF-FVLKDFTFGCAHSALGEPIGVAGFGRGALSMPAQLASSSPDIGNYFSYCLVSHSFD 977
            +P   ++ +FTFGCAH+AL EPIGVAGFGRG LS+PAQLA+ SP +GN FSYCLVSHSFD
Sbjct: 207  NPTNLIVNNFTFGCAHTALAEPIGVAGFGRGVLSLPAQLATLSPQLGNQFSYCLVSHSFD 266

Query: 976  TNRVRMPSPLILGRYNSAGENKEKQSDQIGDG-FVYTPMLKNNKHPYFYYIGLAGISLGK 800
            ++R+R PSPLILGRY+   + KE++ + +    FVYT ML N +HPYFY +GL GIS+G+
Sbjct: 267  SDRLRRPSPLILGRYDH--DEKERRVNGVNKPRFVYTSMLDNLEHPYFYCVGLEGISIGR 324

Query: 799  KRIAAPESLRTIDGKGNGGMVVDSGTTFTMLPPGIYNSIVAEFDARVGSVYKRAADVEDR 620
            K+I AP  LR +DG+G+GG+VVDSGTTFTMLP  +Y S+VAEF+ RVG V +RA  +E+ 
Sbjct: 325  KKIPAPGFLRKVDGEGSGGLVVDSGTTFTMLPASLYGSVVAEFENRVGRVNERARVIEED 384

Query: 619  TGLGPCYYFEGGGKVAPQAVVVPQMLLHFAGN-STVVMPKRNYFCEFLXXXXXXXXXXXX 443
            TGL PCYYF+         V VP ++LHF GN S+VV+P+RNYF EFL            
Sbjct: 385  TGLSPCYYFDN------NVVNVPSVVLHFVGNGSSVVLPRRNYFYEFLDGGDGKGKKRKV 438

Query: 442  GCMMLMN-XXXXXXXXXXXGLLGNYQQQGFEVVYDLEKLRVGFARRKCASLWKTLN 278
            GC+MLMN              LGNYQQQGFEVVYDLE  RVGFARR+CASLW+TLN
Sbjct: 439  GCLMLMNGGEEAELSGGPGATLGNYQQQGFEVVYDLENKRVGFARRQCASLWETLN 494


>ref|XP_002304273.1| predicted protein [Populus trichocarpa] gi|222841705|gb|EEE79252.1|
            predicted protein [Populus trichocarpa]
          Length = 496

 Score =  530 bits (1364), Expect = e-148
 Identities = 275/476 (57%), Positives = 340/476 (71%), Gaps = 14/476 (2%)
 Frame = -1

Query: 1663 PLTHSLSKSRSAKATPSHLLKSTSTHSAARFRQR---------RQVSLPLNPGSDYTMSF 1511
            PL HSLSK++       HLLKSTST S  RF            RQVSLPL+PGSDYT+SF
Sbjct: 29   PLIHSLSKTQFTST--HHLLKSTSTRSTTRFHHHHHNKNSHNHRQVSLPLSPGSDYTLSF 86

Query: 1510 TLGSQTISLYMDTGSDVVWLPCHPFECILCDGKYEPSTIPDTSPLNLT-SATHVTCKSNA 1334
            T+ SQ ISLY+DTGSD+VW PC PFECILC+GK E +++  T P  L+ +AT V+CKS+A
Sbjct: 87   TINSQPISLYLDTGSDLVWFPCQPFECILCEGKAENASLASTPPPKLSKTATPVSCKSSA 146

Query: 1333 CXXXXXXXXXSDLCAMANCPLDQIETSDCKKFSCPPFYYAYGDGSFIAKLYQDSLSFSLA 1154
            C         SDLCA++NCPL+ IE SDC+K SCP FYYAYGDGS IA+LY+DS+   L+
Sbjct: 147  CSAVHSNLPSSDLCAISNCPLESIEISDCRKHSCPQFYYAYGDGSLIARLYRDSIRLPLS 206

Query: 1153 SPF-FVLKDFTFGCAHSALGEPIGVAGFGRGALSMPAQLASSSPDIGNYFSYCLVSHSFD 977
            +    +  +FTFGCAH+ L EPIGVAGFGRG LS+PAQLA+ SP +GN FSYCLVSHSFD
Sbjct: 207  NQTNLIFNNFTFGCAHTTLAEPIGVAGFGRGVLSLPAQLATLSPQLGNQFSYCLVSHSFD 266

Query: 976  TNRVRMPSPLILGRYNSAGENKEKQSDQIGD-GFVYTPMLKNNKHPYFYYIGLAGISLGK 800
            ++RVR PSPLILGRY+   + KE++ + +    FVYT ML N +HPYFY +GL GIS+G+
Sbjct: 267  SDRVRRPSPLILGRYDH--DEKERRVNGVKKPSFVYTSMLDNPRHPYFYCVGLEGISIGR 324

Query: 799  KRIAAPESLRTIDGKGNGGMVVDSGTTFTMLPPGIYNSIVAEFDARVGSVYKRAADVEDR 620
            K+I AP+ LR +D KG+GG+VVDSGTTFTMLP  +Y+ +VAEF+ RVG V +RA+ +E+ 
Sbjct: 325  KKIPAPDFLRKVDRKGSGGVVVDSGTTFTMLPASLYDFVVAEFENRVGRVNERASVIEEN 384

Query: 619  TGLGPCYYFEGGGKVAPQAVVVPQMLLHFAGN-STVVMPKRNYFCEFLXXXXXXXXXXXX 443
            TGL PCYYF+         V VP+++LHF GN S+VV+P+RNYF EFL            
Sbjct: 385  TGLSPCYYFDN------NVVNVPRVVLHFVGNGSSVVLPRRNYFYEFLDGGHGKGKKRKV 438

Query: 442  GCMMLMN-XXXXXXXXXXXGLLGNYQQQGFEVVYDLEKLRVGFARRKCASLWKTLN 278
            GC+MLMN              LGNYQQQGFEVVYDLE  RVGFARR+CASLW+ LN
Sbjct: 439  GCLMLMNGGDEAELSGGPGATLGNYQQQGFEVVYDLENRRVGFARRQCASLWEALN 494


Top