BLASTX nr result

ID: Cnidium21_contig00002000 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Cnidium21_contig00002000
         (1798 letters)

Database: ./nr 
           23,641,837 sequences; 8,123,359,852 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_002281973.1| PREDICTED: aspartic proteinase nepenthesin-2...   610   e-172
ref|XP_002315953.1| predicted protein [Populus trichocarpa] gi|2...   601   e-169
ref|XP_002890686.1| aspartyl protease family protein [Arabidopsi...   589   e-166
ref|NP_173922.1| aspartyl protease-like protein [Arabidopsis tha...   585   e-164
ref|XP_002529147.1| Aspartic proteinase nepenthesin-1 precursor,...   585   e-164

>ref|XP_002281973.1| PREDICTED: aspartic proteinase nepenthesin-2 [Vitis vinifera]
          Length = 491

 Score =  610 bits (1572), Expect = e-172
 Identities = 309/482 (64%), Positives = 368/482 (76%), Gaps = 6/482 (1%)
 Frame = +2

Query: 170  FFCILLTVSFTSVLSRNLPLRSTTSVLDVSASIHKTL---SFNSQSINTLNKIEEXXXXX 340
            FFC   T   + V +R L L  TT VLDVS SI ++L   S N Q      + +E     
Sbjct: 14   FFC---TWGVSLVNARRLSLPRTT-VLDVSGSIRESLNVLSLNPQYEQMEFQHQERSFPS 69

Query: 341  XXXXXXXX---HPRSSLHKNAHTDYKGLTLARLGRDSVRVNSLQARLDLAINGVLKSDLK 511
                       H R+S+HK++H DYK L LARL RDS RV SL  R+DLAI G+ KSDLK
Sbjct: 70   SSSSSSLTLSLHSRTSIHKSSHKDYKSLVLARLERDSDRVRSLATRMDLAIAGITKSDLK 129

Query: 512  PVYTELAAEELEVPVISGTSQGSGEYFTRLGIGHPPSQLYMILDTGSDVNWLQCAPCADC 691
            PV  EL AE LE P++SG SQGSGEYF+R+GIG PP  +YM++DTGSDVNW+QCAPCADC
Sbjct: 130  PVEKELEAEALETPLVSGASQGSGEYFSRVGIGSPPKHVYMVVDTGSDVNWVQCAPCADC 189

Query: 692  YQQTDPIFEPALSSSYSPLTCNTQQCKSLDVFQCRNDTCLYEVSYGDGSYTVGDFVTETV 871
            YQQ DPIFEP+ SSSY+PLTC T QCKSLDV +CRND+CLYEVSYGDGSYTVGDF TET+
Sbjct: 190  YQQADPIFEPSFSSSYAPLTCETHQCKSLDVSECRNDSCLYEVSYGDGSYTVGDFATETI 249

Query: 872  TFGGSASVNNVAIGCGHSNEXXXXXXXXXXXXXXXXXXXPSQINATSFSYCLVDRDSDSA 1051
            T  GSAS+NNVAIGCGH NE                   PSQINA+SFSYCLV+RD+DSA
Sbjct: 250  TLDGSASLNNVAIGCGHDNEGLFVGAAGLLGLGGGSLSFPSQINASSFSYCLVNRDTDSA 309

Query: 1052 STLEFDSVIPPNAVTAPLVRNDKLGTYYYIGLTGISVAGEMLKISESTFQLNSNGEGGVI 1231
            STLEF+S IP ++VTAPL+RN++L T+YY+G+TGI V G+ML I  S+F+++ +G GG+I
Sbjct: 310  STLEFNSPIPSHSVTAPLLRNNQLDTFYYLGMTGIGVGGQMLSIPRSSFEVDESGNGGII 369

Query: 1232 IDSGTAVTRLQNGAYYSLRDAFKKGTKDLPSTDGVALFDTCYDLSAKKSVEVPTVSFHFS 1411
            +DSGTAVTRLQ+  Y SLRD+F +GT+ LPST GVALFDTCYDLS++ SVEVPTVSFHF 
Sbjct: 370  VDSGTAVTRLQSDVYNSLRDSFVRGTQHLPSTSGVALFDTCYDLSSRSSVEVPTVSFHFP 429

Query: 1412 NGKEWSLPAKNYLIPVDSAGTFCLAFAPTSSALSIIGNVQQQGTRVSYDMGHSLIGFTPN 1591
            +GK  +LPAKNYLIPVDSAGTFC AFAPT+SALSIIGNVQQQGTRVSYD+ +SL+GF+PN
Sbjct: 430  DGKYLALPAKNYLIPVDSAGTFCFAFAPTTSALSIIGNVQQQGTRVSYDLSNSLVGFSPN 489

Query: 1592 KC 1597
             C
Sbjct: 490  GC 491


>ref|XP_002315953.1| predicted protein [Populus trichocarpa] gi|222864993|gb|EEF02124.1|
            predicted protein [Populus trichocarpa]
          Length = 484

 Score =  601 bits (1549), Expect = e-169
 Identities = 300/460 (65%), Positives = 355/460 (77%), Gaps = 2/460 (0%)
 Frame = +2

Query: 224  PLRSTTSVLDVSASIHKTLSFNSQSINTLNKIEEXXXXXXXXXXXXXHPRSSLHKNAHTD 403
            P  S T+VLDV+ASI +T +  S         ++               R+S+ K  HT 
Sbjct: 26   PHPSETTVLDVAASIQRTKNIFSSGPKMSPFNQQEKETTSSELTVELLSRTSIQKTTHTG 85

Query: 404  YKGLTLARLGRDSVRVNSLQARLDLAINGVLKSDLKPVYT--ELAAEELEVPVISGTSQG 577
            YK LTL+RL RDS RV SL  RLDLAIN +  SDLKP+ T  E   E+L+ P+ISGTSQG
Sbjct: 86   YKSLTLSRLQRDSARVKSLVTRLDLAINSISSSDLKPLETDSEFKPEDLQSPIISGTSQG 145

Query: 578  SGEYFTRLGIGHPPSQLYMILDTGSDVNWLQCAPCADCYQQTDPIFEPALSSSYSPLTCN 757
            SGEYF+R+GIG PPSQ Y+ILDTGSDVNW+QCAPCADCYQQ DPIFEPA S+S+S L+CN
Sbjct: 146  SGEYFSRVGIGKPPSQAYLILDTGSDVNWVQCAPCADCYQQADPIFEPASSASFSTLSCN 205

Query: 758  TQQCKSLDVFQCRNDTCLYEVSYGDGSYTVGDFVTETVTFGGSASVNNVAIGCGHSNEXX 937
            T+QC+SLDV +CRNDTCLYEVSYGDGSYTVGDFVTET+T G SA V+NVAIGCGH+NE  
Sbjct: 206  TRQCRSLDVSECRNDTCLYEVSYGDGSYTVGDFVTETITLG-SAPVDNVAIGCGHNNEGL 264

Query: 938  XXXXXXXXXXXXXXXXXPSQINATSFSYCLVDRDSDSASTLEFDSVIPPNAVTAPLVRND 1117
                             PSQINATSFSYCLVDRDS+SASTLEF+S +PPNAV+APL+RN 
Sbjct: 265  FVGAAGLLGLGGGSLSFPSQINATSFSYCLVDRDSESASTLEFNSTLPPNAVSAPLLRNH 324

Query: 1118 KLGTYYYIGLTGISVAGEMLKISESTFQLNSNGEGGVIIDSGTAVTRLQNGAYYSLRDAF 1297
             L T+YY+GLTG+SV GE++ I ES FQ++ +G GGVI+DSGTA+TRLQ   Y SLRDAF
Sbjct: 325  HLDTFYYVGLTGLSVGGELVSIPESAFQIDESGNGGVIVDSGTAITRLQTDVYNSLRDAF 384

Query: 1298 KKGTKDLPSTDGVALFDTCYDLSAKKSVEVPTVSFHFSNGKEWSLPAKNYLIPVDSAGTF 1477
             K T+DLPST+G+ALFDTCYDLS+K +VEVPTVSFHF +GKE  LPAKNYL+P+DS GTF
Sbjct: 385  VKRTRDLPSTNGIALFDTCYDLSSKGNVEVPTVSFHFPDGKELPLPAKNYLVPLDSEGTF 444

Query: 1478 CLAFAPTSSALSIIGNVQQQGTRVSYDMGHSLIGFTPNKC 1597
            C AFAPT+S+LSIIGNVQQQGTRV YD+ + L+GF PNKC
Sbjct: 445  CFAFAPTASSLSIIGNVQQQGTRVVYDLVNHLVGFVPNKC 484


>ref|XP_002890686.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
            gi|297336528|gb|EFH66945.1| aspartyl protease family
            protein [Arabidopsis lyrata subsp. lyrata]
          Length = 486

 Score =  589 bits (1519), Expect = e-166
 Identities = 299/482 (62%), Positives = 363/482 (75%), Gaps = 5/482 (1%)
 Frame = +2

Query: 167  FFFCILLTVSFTSVLSRNLPLRS--TTSVLDVSASIHKTLSFNSQSINTLNKIEEXXXXX 340
            FFF   LT S + V SR LP  S  TTS+L+V+ SIH+T   +S     LN+ EE     
Sbjct: 10   FFFVFFLT-SHSFVFSRILPKTSVTTTSILNVADSIHRTKYTSSFR---LNQQEEQTHSR 65

Query: 341  XXXXXXXXHPRSSLHKNAHTDYKGLTLARLGRDSVRVNSLQARLDLAINGVLKSDLKPV- 517
                    H R S+    H+DYK LTLARL RD+ RV SL  RLDLAIN + K+DLKPV 
Sbjct: 66   SSSFSLQLHSRVSVRGTEHSDYKSLTLARLNRDTARVKSLITRLDLAINNISKADLKPVT 125

Query: 518  --YTELAAEELEVPVISGTSQGSGEYFTRLGIGHPPSQLYMILDTGSDVNWLQCAPCADC 691
              YT    E++E P+ISGT+QGSGEYFTR+GIG+P  ++YM+LDTGSDVNWLQC PCADC
Sbjct: 126  TMYTTTEEEDIEAPLISGTTQGSGEYFTRVGIGNPAREVYMVLDTGSDVNWLQCTPCADC 185

Query: 692  YQQTDPIFEPALSSSYSPLTCNTQQCKSLDVFQCRNDTCLYEVSYGDGSYTVGDFVTETV 871
            Y QT+PIFEP+ SSSY PL+C+T QC +L+V +CRN TCLYEVSYGDGSYTVGDF TET+
Sbjct: 186  YHQTEPIFEPSSSSSYEPLSCDTPQCNALEVSECRNATCLYEVSYGDGSYTVGDFATETL 245

Query: 872  TFGGSASVNNVAIGCGHSNEXXXXXXXXXXXXXXXXXXXPSQINATSFSYCLVDRDSDSA 1051
            T G S  V NVA+GCGHSNE                   PSQ+N TSFSYCLVDRDSDSA
Sbjct: 246  TIG-STLVQNVAVGCGHSNEGLFVGAAGLLGLGGGLLALPSQLNTTSFSYCLVDRDSDSA 304

Query: 1052 STLEFDSVIPPNAVTAPLVRNDKLGTYYYIGLTGISVAGEMLKISESTFQLNSNGEGGVI 1231
            ST+EF + +PP+AV APL+RN +L T+YY+GLTGISV GE+L+I +S+F+++ +G GG+I
Sbjct: 305  STVEFGTSLPPDAVVAPLLRNHQLDTFYYLGLTGISVGGELLQIPQSSFEMDESGSGGII 364

Query: 1232 IDSGTAVTRLQNGAYYSLRDAFKKGTKDLPSTDGVALFDTCYDLSAKKSVEVPTVSFHFS 1411
            IDSGTAVTRLQ G Y SLRD+F KGT DL    GVA+FDTCY+LSAK ++EVPTV+FHF 
Sbjct: 365  IDSGTAVTRLQTGIYNSLRDSFLKGTSDLEKAAGVAMFDTCYNLSAKTTIEVPTVAFHFP 424

Query: 1412 NGKEWSLPAKNYLIPVDSAGTFCLAFAPTSSALSIIGNVQQQGTRVSYDMGHSLIGFTPN 1591
             GK  +LPAKNY+IPVDS GTFCLAFAPT+S+L+IIGNVQQQGTRV++D+ +SLIGF+ N
Sbjct: 425  GGKMLALPAKNYMIPVDSVGTFCLAFAPTASSLAIIGNVQQQGTRVTFDLANSLIGFSSN 484

Query: 1592 KC 1597
            KC
Sbjct: 485  KC 486


>ref|NP_173922.1| aspartyl protease-like protein [Arabidopsis thaliana]
            gi|12321511|gb|AAG50814.1|AC079281_16 hypothetical
            protein [Arabidopsis thaliana] gi|20466516|gb|AAM20575.1|
            unknown protein [Arabidopsis thaliana]
            gi|23198172|gb|AAN15613.1| unknown protein [Arabidopsis
            thaliana] gi|110736960|dbj|BAF00436.1| hypothetical
            protein [Arabidopsis thaliana]
            gi|332192515|gb|AEE30636.1| aspartyl protease-like
            protein [Arabidopsis thaliana]
          Length = 483

 Score =  585 bits (1507), Expect = e-164
 Identities = 297/487 (60%), Positives = 362/487 (74%), Gaps = 4/487 (0%)
 Frame = +2

Query: 149  MAQRVTFFFCILLTVSFTSVLSRNLPLRSTT--SVLDVSASIHKTLSFNSQSINTLNKIE 322
            M+   +FFF I    S +SV SR LP  STT  S+L+V+ SIH+T   +S     LN+ E
Sbjct: 1    MSPNYSFFFFIFFLTSHSSVFSRILPETSTTTTSILNVADSIHRTKYTSSFR---LNQQE 57

Query: 323  EXXXXXXXXXXXXXHPRSSLHKNAHTDYKGLTLARLGRDSVRVNSLQARLDLAINGVLKS 502
            E             H R S+    H+DYK LTLARL RD+ RV SL  RLDLAIN + K+
Sbjct: 58   EQTHSASSSFSLQLHSRVSVRGTEHSDYKSLTLARLNRDTARVKSLITRLDLAINNISKA 117

Query: 503  DLKPVYTELAAEE--LEVPVISGTSQGSGEYFTRLGIGHPPSQLYMILDTGSDVNWLQCA 676
            DLKP+ T    EE  +E P+ISGT+QGSGEYFTR+GIG P  ++YM+LDTGSDVNWLQC 
Sbjct: 118  DLKPISTMYTTEEQDIEAPLISGTTQGSGEYFTRVGIGKPAREVYMVLDTGSDVNWLQCT 177

Query: 677  PCADCYQQTDPIFEPALSSSYSPLTCNTQQCKSLDVFQCRNDTCLYEVSYGDGSYTVGDF 856
            PCADCY QT+PIFEP+ SSSY PL+C+T QC +L+V +CRN TCLYEVSYGDGSYTVGDF
Sbjct: 178  PCADCYHQTEPIFEPSSSSSYEPLSCDTPQCNALEVSECRNATCLYEVSYGDGSYTVGDF 237

Query: 857  VTETVTFGGSASVNNVAIGCGHSNEXXXXXXXXXXXXXXXXXXXPSQINATSFSYCLVDR 1036
             TET+T G S  V NVA+GCGHSNE                   PSQ+N TSFSYCLVDR
Sbjct: 238  ATETLTIG-STLVQNVAVGCGHSNEGLFVGAAGLLGLGGGLLALPSQLNTTSFSYCLVDR 296

Query: 1037 DSDSASTLEFDSVIPPNAVTAPLVRNDKLGTYYYIGLTGISVAGEMLKISESTFQLNSNG 1216
            DSDSAST++F + + P+AV APL+RN +L T+YY+GLTGISV GE+L+I +S+F+++ +G
Sbjct: 297  DSDSASTVDFGTSLSPDAVVAPLLRNHQLDTFYYLGLTGISVGGELLQIPQSSFEMDESG 356

Query: 1217 EGGVIIDSGTAVTRLQNGAYYSLRDAFKKGTKDLPSTDGVALFDTCYDLSAKKSVEVPTV 1396
             GG+IIDSGTAVTRLQ   Y SLRD+F KGT DL    GVA+FDTCY+LSAK +VEVPTV
Sbjct: 357  SGGIIIDSGTAVTRLQTEIYNSLRDSFVKGTLDLEKAAGVAMFDTCYNLSAKTTVEVPTV 416

Query: 1397 SFHFSNGKEWSLPAKNYLIPVDSAGTFCLAFAPTSSALSIIGNVQQQGTRVSYDMGHSLI 1576
            +FHF  GK  +LPAKNY+IPVDS GTFCLAFAPT+S+L+IIGNVQQQGTRV++D+ +SLI
Sbjct: 417  AFHFPGGKMLALPAKNYMIPVDSVGTFCLAFAPTASSLAIIGNVQQQGTRVTFDLANSLI 476

Query: 1577 GFTPNKC 1597
            GF+ NKC
Sbjct: 477  GFSSNKC 483


>ref|XP_002529147.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
            communis] gi|223531426|gb|EEF33260.1| Aspartic proteinase
            nepenthesin-1 precursor, putative [Ricinus communis]
          Length = 479

 Score =  585 bits (1507), Expect = e-164
 Identities = 292/456 (64%), Positives = 351/456 (76%), Gaps = 3/456 (0%)
 Frame = +2

Query: 239  TSVLDVSASIHKTLSFNSQSINTLNKI-EEXXXXXXXXXXXXXHPRSSLHKNAHTDYKGL 415
            T++LDV ASI K  +  + S   +    ++             H R+S+ K  H DY+ L
Sbjct: 25   TTLLDVEASIQKAEAIFTSSATKMTPFNQQEIVTSSSQLTMELHSRTSVQKTKHPDYRSL 84

Query: 416  TLARLGRDSVRVNSLQARLDLAINGVLKSDLKPVYTE--LAAEELEVPVISGTSQGSGEY 589
            TL+RL RDS RV S+  RLDLAI+G+  SDLKP+ T+    AE+L+ P+ISGTSQGSGEY
Sbjct: 85   TLSRLERDSARVKSINTRLDLAIHGLSTSDLKPLDTDSQFRAEDLQGPIISGTSQGSGEY 144

Query: 590  FTRLGIGHPPSQLYMILDTGSDVNWLQCAPCADCYQQTDPIFEPALSSSYSPLTCNTQQC 769
            F+R+GIG P S +YM+LDTGSDVNW+QCAPCADCY Q DPIFEPA S+SYSPL+C+T+QC
Sbjct: 145  FSRVGIGKPSSPVYMVLDTGSDVNWIQCAPCADCYHQADPIFEPASSTSYSPLSCDTKQC 204

Query: 770  KSLDVFQCRNDTCLYEVSYGDGSYTVGDFVTETVTFGGSASVNNVAIGCGHSNEXXXXXX 949
            +SLDV +CRN+TCLYEVSYGDGSYTVGDFVTET+T G SASV+NVAIGCGH+NE      
Sbjct: 205  QSLDVSECRNNTCLYEVSYGDGSYTVGDFVTETITLG-SASVDNVAIGCGHNNEGLFIGA 263

Query: 950  XXXXXXXXXXXXXPSQINATSFSYCLVDRDSDSASTLEFDSVIPPNAVTAPLVRNDKLGT 1129
                         PSQINA+SFSYCLVDRDSDSASTLEF+S + P+A+TAPL+RN +L T
Sbjct: 264  AGLLGLGGGKLSFPSQINASSFSYCLVDRDSDSASTLEFNSALLPHAITAPLLRNRELDT 323

Query: 1130 YYYIGLTGISVAGEMLKISESTFQLNSNGEGGVIIDSGTAVTRLQNGAYYSLRDAFKKGT 1309
            +YY+G+TG+SV GE+L I ES F+++ +G GG+IIDSGTAVTRLQ  AY +LRDAF KGT
Sbjct: 324  FYYVGMTGLSVGGELLSIPESMFEMDESGNGGIIIDSGTAVTRLQTAAYNALRDAFVKGT 383

Query: 1310 KDLPSTDGVALFDTCYDLSAKKSVEVPTVSFHFSNGKEWSLPAKNYLIPVDSAGTFCLAF 1489
            KDLP T  VALFDTCYDLS K SVEVPTV+FH + GK   LPA NYLIPVDS GTFC AF
Sbjct: 384  KDLPVTSEVALFDTCYDLSRKTSVEVPTVTFHLAGGKVLPLPATNYLIPVDSDGTFCFAF 443

Query: 1490 APTSSALSIIGNVQQQGTRVSYDMGHSLIGFTPNKC 1597
            APTSSALSIIGNVQQQGTRV +D+ +SL+GF P +C
Sbjct: 444  APTSSALSIIGNVQQQGTRVGFDLANSLVGFEPRQC 479


Top