BLASTX nr result

ID: Angelica23_contig00013114 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Angelica23_contig00013114
         (1686 letters)

Database: ./nr 
           23,641,837 sequences; 8,123,359,852 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_002281973.1| PREDICTED: aspartic proteinase nepenthesin-2...   607   e-171
ref|XP_002315953.1| predicted protein [Populus trichocarpa] gi|2...   598   e-168
ref|XP_002529147.1| Aspartic proteinase nepenthesin-1 precursor,...   589   e-166
ref|XP_002890686.1| aspartyl protease family protein [Arabidopsi...   580   e-163
ref|NP_173922.1| aspartyl protease-like protein [Arabidopsis tha...   574   e-161

>ref|XP_002281973.1| PREDICTED: aspartic proteinase nepenthesin-2 [Vitis vinifera]
          Length = 491

 Score =  607 bits (1564), Expect = e-171
 Identities = 307/481 (63%), Positives = 369/481 (76%), Gaps = 5/481 (1%)
 Frame = -1

Query: 1677 FSILLTVSFTPVLSRNLQLRDTTSVLDVSASIHKTLSFDS-----QSINTLNQIEAXXXX 1513
            F+   T   + V +R L L  TT VLDVS SI ++L+  S     + +   +Q  +    
Sbjct: 12   FAFFCTWGVSLVNARRLSLPRTT-VLDVSGSIRESLNVLSLNPQYEQMEFQHQERSFPSS 70

Query: 1512 XXXXXXXXXLHPRSSIHKTTHTDYKELTLARLGRDSVRVNSLQARLDLAIHGVLKSDLKP 1333
                     LH R+SIHK++H DYK L LARL RDS RV SL  R+DLAI G+ KSDLKP
Sbjct: 71   SSSSSLTLSLHSRTSIHKSSHKDYKSLVLARLERDSDRVRSLATRMDLAIAGITKSDLKP 130

Query: 1332 VYTELAAEELEVPVISGTSQGSGEYFTRLGIGHPPSQLYMVLDTGSDVNWLQCAPCADCY 1153
            V  EL AE LE P++SG SQGSGEYF+R+GIG PP  +YMV+DTGSDVNW+QCAPCADCY
Sbjct: 131  VEKELEAEALETPLVSGASQGSGEYFSRVGIGSPPKHVYMVVDTGSDVNWVQCAPCADCY 190

Query: 1152 QQTDPIFEPALSSSYSPLTCNTQQCKSLDVFQCRNDTCLYEVSYGDGSYTVGDFVTETVT 973
            QQ DPIFEP+ SSSY+PLTC T QCKSLDV +CRND+CLYEVSYGDGSYTVGDF TET+T
Sbjct: 191  QQADPIFEPSFSSSYAPLTCETHQCKSLDVSECRNDSCLYEVSYGDGSYTVGDFATETIT 250

Query: 972  FGDSASVNDVAIGCGHSNEXXXXXXXXXXXXXXXXXXFPSQINATSFSYCLVDRDSDSAS 793
               SAS+N+VAIGCGH NE                  FPSQINA+SFSYCLV+RD+DSAS
Sbjct: 251  LDGSASLNNVAIGCGHDNEGLFVGAAGLLGLGGGSLSFPSQINASSFSYCLVNRDTDSAS 310

Query: 792  TLEFDSVIPPNAVTAPLVRNDKLNTYYYIGLTGISVAGEMLKISESTFQLNNNGEGGVII 613
            TLEF+S IP ++VTAPL+RN++L+T+YY+G+TGI V G+ML I  S+F+++ +G GG+I+
Sbjct: 311  TLEFNSPIPSHSVTAPLLRNNQLDTFYYLGMTGIGVGGQMLSIPRSSFEVDESGNGGIIV 370

Query: 612  DSGTAVTRLQNGAYYSLRDAFKKGTKDLPSTDGVALFDTCYDLSAKKSVEVPTVSFHFSN 433
            DSGTAVTRLQ+  Y SLRD+F +GT+ LPST GVALFDTCYDLS++ SVEVPTVSFHF +
Sbjct: 371  DSGTAVTRLQSDVYNSLRDSFVRGTQHLPSTSGVALFDTCYDLSSRSSVEVPTVSFHFPD 430

Query: 432  GKEWSLPAKNYLIPVDSAGTFCLAFAPTSSALSIIGNVQQQGTRVSYDLGHSLIGFTANK 253
            GK  +LPAKNYLIPVDSAGTFC AFAPT+SALSIIGNVQQQGTRVSYDL +SL+GF+ N 
Sbjct: 431  GKYLALPAKNYLIPVDSAGTFCFAFAPTTSALSIIGNVQQQGTRVSYDLSNSLVGFSPNG 490

Query: 252  C 250
            C
Sbjct: 491  C 491


>ref|XP_002315953.1| predicted protein [Populus trichocarpa] gi|222864993|gb|EEF02124.1|
            predicted protein [Populus trichocarpa]
          Length = 484

 Score =  598 bits (1541), Expect = e-168
 Identities = 302/458 (65%), Positives = 359/458 (78%), Gaps = 4/458 (0%)
 Frame = -1

Query: 1611 TSVLDVSASIHKTLSFDSQS--INTLNQIEAXXXXXXXXXXXXXLHPRSSIHKTTHTDYK 1438
            T+VLDV+ASI +T +  S    ++  NQ E                 R+SI KTTHT YK
Sbjct: 31   TTVLDVAASIQRTKNIFSSGPKMSPFNQQEKETTSSELTVELLS---RTSIQKTTHTGYK 87

Query: 1437 ELTLARLGRDSVRVNSLQARLDLAIHGVLKSDLKPVYT--ELAAEELEVPVISGTSQGSG 1264
             LTL+RL RDS RV SL  RLDLAI+ +  SDLKP+ T  E   E+L+ P+ISGTSQGSG
Sbjct: 88   SLTLSRLQRDSARVKSLVTRLDLAINSISSSDLKPLETDSEFKPEDLQSPIISGTSQGSG 147

Query: 1263 EYFTRLGIGHPPSQLYMVLDTGSDVNWLQCAPCADCYQQTDPIFEPALSSSYSPLTCNTQ 1084
            EYF+R+GIG PPSQ Y++LDTGSDVNW+QCAPCADCYQQ DPIFEPA S+S+S L+CNT+
Sbjct: 148  EYFSRVGIGKPPSQAYLILDTGSDVNWVQCAPCADCYQQADPIFEPASSASFSTLSCNTR 207

Query: 1083 QCKSLDVFQCRNDTCLYEVSYGDGSYTVGDFVTETVTFGDSASVNDVAIGCGHSNEXXXX 904
            QC+SLDV +CRNDTCLYEVSYGDGSYTVGDFVTET+T G SA V++VAIGCGH+NE    
Sbjct: 208  QCRSLDVSECRNDTCLYEVSYGDGSYTVGDFVTETITLG-SAPVDNVAIGCGHNNEGLFV 266

Query: 903  XXXXXXXXXXXXXXFPSQINATSFSYCLVDRDSDSASTLEFDSVIPPNAVTAPLVRNDKL 724
                          FPSQINATSFSYCLVDRDS+SASTLEF+S +PPNAV+APL+RN  L
Sbjct: 267  GAAGLLGLGGGSLSFPSQINATSFSYCLVDRDSESASTLEFNSTLPPNAVSAPLLRNHHL 326

Query: 723  NTYYYIGLTGISVAGEMLKISESTFQLNNNGEGGVIIDSGTAVTRLQNGAYYSLRDAFKK 544
            +T+YY+GLTG+SV GE++ I ES FQ++ +G GGVI+DSGTA+TRLQ   Y SLRDAF K
Sbjct: 327  DTFYYVGLTGLSVGGELVSIPESAFQIDESGNGGVIVDSGTAITRLQTDVYNSLRDAFVK 386

Query: 543  GTKDLPSTDGVALFDTCYDLSAKKSVEVPTVSFHFSNGKEWSLPAKNYLIPVDSAGTFCL 364
             T+DLPST+G+ALFDTCYDLS+K +VEVPTVSFHF +GKE  LPAKNYL+P+DS GTFC 
Sbjct: 387  RTRDLPSTNGIALFDTCYDLSSKGNVEVPTVSFHFPDGKELPLPAKNYLVPLDSEGTFCF 446

Query: 363  AFAPTSSALSIIGNVQQQGTRVSYDLGHSLIGFTANKC 250
            AFAPT+S+LSIIGNVQQQGTRV YDL + L+GF  NKC
Sbjct: 447  AFAPTASSLSIIGNVQQQGTRVVYDLVNHLVGFVPNKC 484


>ref|XP_002529147.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
            communis] gi|223531426|gb|EEF33260.1| Aspartic proteinase
            nepenthesin-1 precursor, putative [Ricinus communis]
          Length = 479

 Score =  589 bits (1519), Expect = e-166
 Identities = 296/456 (64%), Positives = 352/456 (77%), Gaps = 2/456 (0%)
 Frame = -1

Query: 1611 TSVLDVSASIHKTLSFDSQSINTLNQIEAXXXXXXXXXXXXXLHPRSSIHKTTHTDYKEL 1432
            T++LDV ASI K  +  + S   +                  LH R+S+ KT H DY+ L
Sbjct: 25   TTLLDVEASIQKAEAIFTSSATKMTPFNQQEIVTSSSQLTMELHSRTSVQKTKHPDYRSL 84

Query: 1431 TLARLGRDSVRVNSLQARLDLAIHGVLKSDLKPVYTE--LAAEELEVPVISGTSQGSGEY 1258
            TL+RL RDS RV S+  RLDLAIHG+  SDLKP+ T+    AE+L+ P+ISGTSQGSGEY
Sbjct: 85   TLSRLERDSARVKSINTRLDLAIHGLSTSDLKPLDTDSQFRAEDLQGPIISGTSQGSGEY 144

Query: 1257 FTRLGIGHPPSQLYMVLDTGSDVNWLQCAPCADCYQQTDPIFEPALSSSYSPLTCNTQQC 1078
            F+R+GIG P S +YMVLDTGSDVNW+QCAPCADCY Q DPIFEPA S+SYSPL+C+T+QC
Sbjct: 145  FSRVGIGKPSSPVYMVLDTGSDVNWIQCAPCADCYHQADPIFEPASSTSYSPLSCDTKQC 204

Query: 1077 KSLDVFQCRNDTCLYEVSYGDGSYTVGDFVTETVTFGDSASVNDVAIGCGHSNEXXXXXX 898
            +SLDV +CRN+TCLYEVSYGDGSYTVGDFVTET+T G SASV++VAIGCGH+NE      
Sbjct: 205  QSLDVSECRNNTCLYEVSYGDGSYTVGDFVTETITLG-SASVDNVAIGCGHNNEGLFIGA 263

Query: 897  XXXXXXXXXXXXFPSQINATSFSYCLVDRDSDSASTLEFDSVIPPNAVTAPLVRNDKLNT 718
                        FPSQINA+SFSYCLVDRDSDSASTLEF+S + P+A+TAPL+RN +L+T
Sbjct: 264  AGLLGLGGGKLSFPSQINASSFSYCLVDRDSDSASTLEFNSALLPHAITAPLLRNRELDT 323

Query: 717  YYYIGLTGISVAGEMLKISESTFQLNNNGEGGVIIDSGTAVTRLQNGAYYSLRDAFKKGT 538
            +YY+G+TG+SV GE+L I ES F+++ +G GG+IIDSGTAVTRLQ  AY +LRDAF KGT
Sbjct: 324  FYYVGMTGLSVGGELLSIPESMFEMDESGNGGIIIDSGTAVTRLQTAAYNALRDAFVKGT 383

Query: 537  KDLPSTDGVALFDTCYDLSAKKSVEVPTVSFHFSNGKEWSLPAKNYLIPVDSAGTFCLAF 358
            KDLP T  VALFDTCYDLS K SVEVPTV+FH + GK   LPA NYLIPVDS GTFC AF
Sbjct: 384  KDLPVTSEVALFDTCYDLSRKTSVEVPTVTFHLAGGKVLPLPATNYLIPVDSDGTFCFAF 443

Query: 357  APTSSALSIIGNVQQQGTRVSYDLGHSLIGFTANKC 250
            APTSSALSIIGNVQQQGTRV +DL +SL+GF   +C
Sbjct: 444  APTSSALSIIGNVQQQGTRVGFDLANSLVGFEPRQC 479


>ref|XP_002890686.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
            gi|297336528|gb|EFH66945.1| aspartyl protease family
            protein [Arabidopsis lyrata subsp. lyrata]
          Length = 486

 Score =  580 bits (1495), Expect = e-163
 Identities = 296/483 (61%), Positives = 362/483 (74%), Gaps = 5/483 (1%)
 Frame = -1

Query: 1683 FFFSILLTVSFTPVLSRNLQLRD--TTSVLDVSASIHKTLSFDSQSINTLNQIEAXXXXX 1510
            FFF   LT S + V SR L      TTS+L+V+ SIH+T    S  +N   +        
Sbjct: 10   FFFVFFLT-SHSFVFSRILPKTSVTTTSILNVADSIHRTKYTSSFRLNQQEE----QTHS 64

Query: 1509 XXXXXXXXLHPRSSIHKTTHTDYKELTLARLGRDSVRVNSLQARLDLAIHGVLKSDLKPV 1330
                    LH R S+  T H+DYK LTLARL RD+ RV SL  RLDLAI+ + K+DLKPV
Sbjct: 65   RSSSFSLQLHSRVSVRGTEHSDYKSLTLARLNRDTARVKSLITRLDLAINNISKADLKPV 124

Query: 1329 ---YTELAAEELEVPVISGTSQGSGEYFTRLGIGHPPSQLYMVLDTGSDVNWLQCAPCAD 1159
               YT    E++E P+ISGT+QGSGEYFTR+GIG+P  ++YMVLDTGSDVNWLQC PCAD
Sbjct: 125  TTMYTTTEEEDIEAPLISGTTQGSGEYFTRVGIGNPAREVYMVLDTGSDVNWLQCTPCAD 184

Query: 1158 CYQQTDPIFEPALSSSYSPLTCNTQQCKSLDVFQCRNDTCLYEVSYGDGSYTVGDFVTET 979
            CY QT+PIFEP+ SSSY PL+C+T QC +L+V +CRN TCLYEVSYGDGSYTVGDF TET
Sbjct: 185  CYHQTEPIFEPSSSSSYEPLSCDTPQCNALEVSECRNATCLYEVSYGDGSYTVGDFATET 244

Query: 978  VTFGDSASVNDVAIGCGHSNEXXXXXXXXXXXXXXXXXXFPSQINATSFSYCLVDRDSDS 799
            +T G S  V +VA+GCGHSNE                   PSQ+N TSFSYCLVDRDSDS
Sbjct: 245  LTIG-STLVQNVAVGCGHSNEGLFVGAAGLLGLGGGLLALPSQLNTTSFSYCLVDRDSDS 303

Query: 798  ASTLEFDSVIPPNAVTAPLVRNDKLNTYYYIGLTGISVAGEMLKISESTFQLNNNGEGGV 619
            AST+EF + +PP+AV APL+RN +L+T+YY+GLTGISV GE+L+I +S+F+++ +G GG+
Sbjct: 304  ASTVEFGTSLPPDAVVAPLLRNHQLDTFYYLGLTGISVGGELLQIPQSSFEMDESGSGGI 363

Query: 618  IIDSGTAVTRLQNGAYYSLRDAFKKGTKDLPSTDGVALFDTCYDLSAKKSVEVPTVSFHF 439
            IIDSGTAVTRLQ G Y SLRD+F KGT DL    GVA+FDTCY+LSAK ++EVPTV+FHF
Sbjct: 364  IIDSGTAVTRLQTGIYNSLRDSFLKGTSDLEKAAGVAMFDTCYNLSAKTTIEVPTVAFHF 423

Query: 438  SNGKEWSLPAKNYLIPVDSAGTFCLAFAPTSSALSIIGNVQQQGTRVSYDLGHSLIGFTA 259
              GK  +LPAKNY+IPVDS GTFCLAFAPT+S+L+IIGNVQQQGTRV++DL +SLIGF++
Sbjct: 424  PGGKMLALPAKNYMIPVDSVGTFCLAFAPTASSLAIIGNVQQQGTRVTFDLANSLIGFSS 483

Query: 258  NKC 250
            NKC
Sbjct: 484  NKC 486


>ref|NP_173922.1| aspartyl protease-like protein [Arabidopsis thaliana]
            gi|12321511|gb|AAG50814.1|AC079281_16 hypothetical
            protein [Arabidopsis thaliana] gi|20466516|gb|AAM20575.1|
            unknown protein [Arabidopsis thaliana]
            gi|23198172|gb|AAN15613.1| unknown protein [Arabidopsis
            thaliana] gi|110736960|dbj|BAF00436.1| hypothetical
            protein [Arabidopsis thaliana]
            gi|332192515|gb|AEE30636.1| aspartyl protease-like
            protein [Arabidopsis thaliana]
          Length = 483

 Score =  574 bits (1480), Expect = e-161
 Identities = 293/483 (60%), Positives = 358/483 (74%), Gaps = 4/483 (0%)
 Frame = -1

Query: 1686 SFFFSILLTVSFTPVLSRNLQLRDTT--SVLDVSASIHKTLSFDSQSINTLNQIEAXXXX 1513
            SFFF I    S + V SR L    TT  S+L+V+ SIH+T    S  +N   +       
Sbjct: 6    SFFFFIFFLTSHSSVFSRILPETSTTTTSILNVADSIHRTKYTSSFRLNQQEE----QTH 61

Query: 1512 XXXXXXXXXLHPRSSIHKTTHTDYKELTLARLGRDSVRVNSLQARLDLAIHGVLKSDLKP 1333
                     LH R S+  T H+DYK LTLARL RD+ RV SL  RLDLAI+ + K+DLKP
Sbjct: 62   SASSSFSLQLHSRVSVRGTEHSDYKSLTLARLNRDTARVKSLITRLDLAINNISKADLKP 121

Query: 1332 VYTELAAEE--LEVPVISGTSQGSGEYFTRLGIGHPPSQLYMVLDTGSDVNWLQCAPCAD 1159
            + T    EE  +E P+ISGT+QGSGEYFTR+GIG P  ++YMVLDTGSDVNWLQC PCAD
Sbjct: 122  ISTMYTTEEQDIEAPLISGTTQGSGEYFTRVGIGKPAREVYMVLDTGSDVNWLQCTPCAD 181

Query: 1158 CYQQTDPIFEPALSSSYSPLTCNTQQCKSLDVFQCRNDTCLYEVSYGDGSYTVGDFVTET 979
            CY QT+PIFEP+ SSSY PL+C+T QC +L+V +CRN TCLYEVSYGDGSYTVGDF TET
Sbjct: 182  CYHQTEPIFEPSSSSSYEPLSCDTPQCNALEVSECRNATCLYEVSYGDGSYTVGDFATET 241

Query: 978  VTFGDSASVNDVAIGCGHSNEXXXXXXXXXXXXXXXXXXFPSQINATSFSYCLVDRDSDS 799
            +T G S  V +VA+GCGHSNE                   PSQ+N TSFSYCLVDRDSDS
Sbjct: 242  LTIG-STLVQNVAVGCGHSNEGLFVGAAGLLGLGGGLLALPSQLNTTSFSYCLVDRDSDS 300

Query: 798  ASTLEFDSVIPPNAVTAPLVRNDKLNTYYYIGLTGISVAGEMLKISESTFQLNNNGEGGV 619
            AST++F + + P+AV APL+RN +L+T+YY+GLTGISV GE+L+I +S+F+++ +G GG+
Sbjct: 301  ASTVDFGTSLSPDAVVAPLLRNHQLDTFYYLGLTGISVGGELLQIPQSSFEMDESGSGGI 360

Query: 618  IIDSGTAVTRLQNGAYYSLRDAFKKGTKDLPSTDGVALFDTCYDLSAKKSVEVPTVSFHF 439
            IIDSGTAVTRLQ   Y SLRD+F KGT DL    GVA+FDTCY+LSAK +VEVPTV+FHF
Sbjct: 361  IIDSGTAVTRLQTEIYNSLRDSFVKGTLDLEKAAGVAMFDTCYNLSAKTTVEVPTVAFHF 420

Query: 438  SNGKEWSLPAKNYLIPVDSAGTFCLAFAPTSSALSIIGNVQQQGTRVSYDLGHSLIGFTA 259
              GK  +LPAKNY+IPVDS GTFCLAFAPT+S+L+IIGNVQQQGTRV++DL +SLIGF++
Sbjct: 421  PGGKMLALPAKNYMIPVDSVGTFCLAFAPTASSLAIIGNVQQQGTRVTFDLANSLIGFSS 480

Query: 258  NKC 250
            NKC
Sbjct: 481  NKC 483


Top