BLASTX nr result

ID: Angelica22_contig00009722 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Angelica22_contig00009722
         (1937 letters)

Database: ./nr 
           23,641,837 sequences; 8,123,359,852 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_002281973.1| PREDICTED: aspartic proteinase nepenthesin-2...   607   e-171
ref|XP_002315953.1| predicted protein [Populus trichocarpa] gi|2...   598   e-168
ref|XP_002529147.1| Aspartic proteinase nepenthesin-1 precursor,...   589   e-166
ref|XP_002890686.1| aspartyl protease family protein [Arabidopsi...   580   e-163
ref|NP_173922.1| aspartyl protease-like protein [Arabidopsis tha...   576   e-162

>ref|XP_002281973.1| PREDICTED: aspartic proteinase nepenthesin-2 [Vitis vinifera]
          Length = 491

 Score =  607 bits (1564), Expect = e-171
 Identities = 307/481 (63%), Positives = 369/481 (76%), Gaps = 5/481 (1%)
 Frame = -2

Query: 1741 FSILLTVSFTPVLSRNLQLRDTTSVLDVSASIHKTLSFDS-----QSINTLNQIEAXXXX 1577
            F+   T   + V +R L L  TT VLDVS SI ++L+  S     + +   +Q  +    
Sbjct: 12   FAFFCTWGVSLVNARRLSLPRTT-VLDVSGSIRESLNVLSLNPQYEQMEFQHQERSFPSS 70

Query: 1576 XXXXXXXXXLHPRSSIHKTTHTDYKELTLARLGRDSVRVNSLQARLDLAIHGVLKSDLKP 1397
                     LH R+SIHK++H DYK L LARL RDS RV SL  R+DLAI G+ KSDLKP
Sbjct: 71   SSSSSLTLSLHSRTSIHKSSHKDYKSLVLARLERDSDRVRSLATRMDLAIAGITKSDLKP 130

Query: 1396 VYTELAAEELEVPVISGTSQGSGEYFTRLGIGHPPSQLYMVLDTGSDVNWLQCAPCADCY 1217
            V  EL AE LE P++SG SQGSGEYF+R+GIG PP  +YMV+DTGSDVNW+QCAPCADCY
Sbjct: 131  VEKELEAEALETPLVSGASQGSGEYFSRVGIGSPPKHVYMVVDTGSDVNWVQCAPCADCY 190

Query: 1216 QQTDPIFEPALSSSYSPLTCNTQQCKSLDVFQCRNDTCLYEVSYGDGSYTVGDFVTETVT 1037
            QQ DPIFEP+ SSSY+PLTC T QCKSLDV +CRND+CLYEVSYGDGSYTVGDF TET+T
Sbjct: 191  QQADPIFEPSFSSSYAPLTCETHQCKSLDVSECRNDSCLYEVSYGDGSYTVGDFATETIT 250

Query: 1036 FGDSASVNDVAIGCGHSNEXXXXXXXXXXXXXXXXXXFPSQINATSFSYCLVDRDSDSAS 857
               SAS+N+VAIGCGH NE                  FPSQINA+SFSYCLV+RD+DSAS
Sbjct: 251  LDGSASLNNVAIGCGHDNEGLFVGAAGLLGLGGGSLSFPSQINASSFSYCLVNRDTDSAS 310

Query: 856  TLEFDSVIPPNAVTAPLVRNDKLNTYYYIGLTGISVAGEMLKISESTFQLNNNGEGGVII 677
            TLEF+S IP ++VTAPL+RN++L+T+YY+G+TGI V G+ML I  S+F+++ +G GG+I+
Sbjct: 311  TLEFNSPIPSHSVTAPLLRNNQLDTFYYLGMTGIGVGGQMLSIPRSSFEVDESGNGGIIV 370

Query: 676  DSGTAVTRLQNGAYYSLRDAFKKGTKDLPSTDGVALFDTCYDLSAKKSVEVPTVSFHFSN 497
            DSGTAVTRLQ+  Y SLRD+F +GT+ LPST GVALFDTCYDLS++ SVEVPTVSFHF +
Sbjct: 371  DSGTAVTRLQSDVYNSLRDSFVRGTQHLPSTSGVALFDTCYDLSSRSSVEVPTVSFHFPD 430

Query: 496  GKEWSLPAKNYLIPVDSAGTFCLAFAPTSSALSIIGNVQQQGTRVSYDLGHSLIGFTANK 317
            GK  +LPAKNYLIPVDSAGTFC AFAPT+SALSIIGNVQQQGTRVSYDL +SL+GF+ N 
Sbjct: 431  GKYLALPAKNYLIPVDSAGTFCFAFAPTTSALSIIGNVQQQGTRVSYDLSNSLVGFSPNG 490

Query: 316  C 314
            C
Sbjct: 491  C 491


>ref|XP_002315953.1| predicted protein [Populus trichocarpa] gi|222864993|gb|EEF02124.1|
            predicted protein [Populus trichocarpa]
          Length = 484

 Score =  598 bits (1541), Expect = e-168
 Identities = 302/458 (65%), Positives = 359/458 (78%), Gaps = 4/458 (0%)
 Frame = -2

Query: 1675 TSVLDVSASIHKTLSFDSQS--INTLNQIEAXXXXXXXXXXXXXLHPRSSIHKTTHTDYK 1502
            T+VLDV+ASI +T +  S    ++  NQ E                 R+SI KTTHT YK
Sbjct: 31   TTVLDVAASIQRTKNIFSSGPKMSPFNQQEKETTSSELTVELLS---RTSIQKTTHTGYK 87

Query: 1501 ELTLARLGRDSVRVNSLQARLDLAIHGVLKSDLKPVYT--ELAAEELEVPVISGTSQGSG 1328
             LTL+RL RDS RV SL  RLDLAI+ +  SDLKP+ T  E   E+L+ P+ISGTSQGSG
Sbjct: 88   SLTLSRLQRDSARVKSLVTRLDLAINSISSSDLKPLETDSEFKPEDLQSPIISGTSQGSG 147

Query: 1327 EYFTRLGIGHPPSQLYMVLDTGSDVNWLQCAPCADCYQQTDPIFEPALSSSYSPLTCNTQ 1148
            EYF+R+GIG PPSQ Y++LDTGSDVNW+QCAPCADCYQQ DPIFEPA S+S+S L+CNT+
Sbjct: 148  EYFSRVGIGKPPSQAYLILDTGSDVNWVQCAPCADCYQQADPIFEPASSASFSTLSCNTR 207

Query: 1147 QCKSLDVFQCRNDTCLYEVSYGDGSYTVGDFVTETVTFGDSASVNDVAIGCGHSNEXXXX 968
            QC+SLDV +CRNDTCLYEVSYGDGSYTVGDFVTET+T G SA V++VAIGCGH+NE    
Sbjct: 208  QCRSLDVSECRNDTCLYEVSYGDGSYTVGDFVTETITLG-SAPVDNVAIGCGHNNEGLFV 266

Query: 967  XXXXXXXXXXXXXXFPSQINATSFSYCLVDRDSDSASTLEFDSVIPPNAVTAPLVRNDKL 788
                          FPSQINATSFSYCLVDRDS+SASTLEF+S +PPNAV+APL+RN  L
Sbjct: 267  GAAGLLGLGGGSLSFPSQINATSFSYCLVDRDSESASTLEFNSTLPPNAVSAPLLRNHHL 326

Query: 787  NTYYYIGLTGISVAGEMLKISESTFQLNNNGEGGVIIDSGTAVTRLQNGAYYSLRDAFKK 608
            +T+YY+GLTG+SV GE++ I ES FQ++ +G GGVI+DSGTA+TRLQ   Y SLRDAF K
Sbjct: 327  DTFYYVGLTGLSVGGELVSIPESAFQIDESGNGGVIVDSGTAITRLQTDVYNSLRDAFVK 386

Query: 607  GTKDLPSTDGVALFDTCYDLSAKKSVEVPTVSFHFSNGKEWSLPAKNYLIPVDSAGTFCL 428
             T+DLPST+G+ALFDTCYDLS+K +VEVPTVSFHF +GKE  LPAKNYL+P+DS GTFC 
Sbjct: 387  RTRDLPSTNGIALFDTCYDLSSKGNVEVPTVSFHFPDGKELPLPAKNYLVPLDSEGTFCF 446

Query: 427  AFAPTSSALSIIGNVQQQGTRVSYDLGHSLIGFTANKC 314
            AFAPT+S+LSIIGNVQQQGTRV YDL + L+GF  NKC
Sbjct: 447  AFAPTASSLSIIGNVQQQGTRVVYDLVNHLVGFVPNKC 484


>ref|XP_002529147.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
            communis] gi|223531426|gb|EEF33260.1| Aspartic proteinase
            nepenthesin-1 precursor, putative [Ricinus communis]
          Length = 479

 Score =  589 bits (1519), Expect = e-166
 Identities = 296/456 (64%), Positives = 352/456 (77%), Gaps = 2/456 (0%)
 Frame = -2

Query: 1675 TSVLDVSASIHKTLSFDSQSINTLNQIEAXXXXXXXXXXXXXLHPRSSIHKTTHTDYKEL 1496
            T++LDV ASI K  +  + S   +                  LH R+S+ KT H DY+ L
Sbjct: 25   TTLLDVEASIQKAEAIFTSSATKMTPFNQQEIVTSSSQLTMELHSRTSVQKTKHPDYRSL 84

Query: 1495 TLARLGRDSVRVNSLQARLDLAIHGVLKSDLKPVYTE--LAAEELEVPVISGTSQGSGEY 1322
            TL+RL RDS RV S+  RLDLAIHG+  SDLKP+ T+    AE+L+ P+ISGTSQGSGEY
Sbjct: 85   TLSRLERDSARVKSINTRLDLAIHGLSTSDLKPLDTDSQFRAEDLQGPIISGTSQGSGEY 144

Query: 1321 FTRLGIGHPPSQLYMVLDTGSDVNWLQCAPCADCYQQTDPIFEPALSSSYSPLTCNTQQC 1142
            F+R+GIG P S +YMVLDTGSDVNW+QCAPCADCY Q DPIFEPA S+SYSPL+C+T+QC
Sbjct: 145  FSRVGIGKPSSPVYMVLDTGSDVNWIQCAPCADCYHQADPIFEPASSTSYSPLSCDTKQC 204

Query: 1141 KSLDVFQCRNDTCLYEVSYGDGSYTVGDFVTETVTFGDSASVNDVAIGCGHSNEXXXXXX 962
            +SLDV +CRN+TCLYEVSYGDGSYTVGDFVTET+T G SASV++VAIGCGH+NE      
Sbjct: 205  QSLDVSECRNNTCLYEVSYGDGSYTVGDFVTETITLG-SASVDNVAIGCGHNNEGLFIGA 263

Query: 961  XXXXXXXXXXXXFPSQINATSFSYCLVDRDSDSASTLEFDSVIPPNAVTAPLVRNDKLNT 782
                        FPSQINA+SFSYCLVDRDSDSASTLEF+S + P+A+TAPL+RN +L+T
Sbjct: 264  AGLLGLGGGKLSFPSQINASSFSYCLVDRDSDSASTLEFNSALLPHAITAPLLRNRELDT 323

Query: 781  YYYIGLTGISVAGEMLKISESTFQLNNNGEGGVIIDSGTAVTRLQNGAYYSLRDAFKKGT 602
            +YY+G+TG+SV GE+L I ES F+++ +G GG+IIDSGTAVTRLQ  AY +LRDAF KGT
Sbjct: 324  FYYVGMTGLSVGGELLSIPESMFEMDESGNGGIIIDSGTAVTRLQTAAYNALRDAFVKGT 383

Query: 601  KDLPSTDGVALFDTCYDLSAKKSVEVPTVSFHFSNGKEWSLPAKNYLIPVDSAGTFCLAF 422
            KDLP T  VALFDTCYDLS K SVEVPTV+FH + GK   LPA NYLIPVDS GTFC AF
Sbjct: 384  KDLPVTSEVALFDTCYDLSRKTSVEVPTVTFHLAGGKVLPLPATNYLIPVDSDGTFCFAF 443

Query: 421  APTSSALSIIGNVQQQGTRVSYDLGHSLIGFTANKC 314
            APTSSALSIIGNVQQQGTRV +DL +SL+GF   +C
Sbjct: 444  APTSSALSIIGNVQQQGTRVGFDLANSLVGFEPRQC 479


>ref|XP_002890686.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
            gi|297336528|gb|EFH66945.1| aspartyl protease family
            protein [Arabidopsis lyrata subsp. lyrata]
          Length = 486

 Score =  580 bits (1495), Expect = e-163
 Identities = 296/483 (61%), Positives = 362/483 (74%), Gaps = 5/483 (1%)
 Frame = -2

Query: 1747 FFFSILLTVSFTPVLSRNLQLRD--TTSVLDVSASIHKTLSFDSQSINTLNQIEAXXXXX 1574
            FFF   LT S + V SR L      TTS+L+V+ SIH+T    S  +N   +        
Sbjct: 10   FFFVFFLT-SHSFVFSRILPKTSVTTTSILNVADSIHRTKYTSSFRLNQQEE----QTHS 64

Query: 1573 XXXXXXXXLHPRSSIHKTTHTDYKELTLARLGRDSVRVNSLQARLDLAIHGVLKSDLKPV 1394
                    LH R S+  T H+DYK LTLARL RD+ RV SL  RLDLAI+ + K+DLKPV
Sbjct: 65   RSSSFSLQLHSRVSVRGTEHSDYKSLTLARLNRDTARVKSLITRLDLAINNISKADLKPV 124

Query: 1393 ---YTELAAEELEVPVISGTSQGSGEYFTRLGIGHPPSQLYMVLDTGSDVNWLQCAPCAD 1223
               YT    E++E P+ISGT+QGSGEYFTR+GIG+P  ++YMVLDTGSDVNWLQC PCAD
Sbjct: 125  TTMYTTTEEEDIEAPLISGTTQGSGEYFTRVGIGNPAREVYMVLDTGSDVNWLQCTPCAD 184

Query: 1222 CYQQTDPIFEPALSSSYSPLTCNTQQCKSLDVFQCRNDTCLYEVSYGDGSYTVGDFVTET 1043
            CY QT+PIFEP+ SSSY PL+C+T QC +L+V +CRN TCLYEVSYGDGSYTVGDF TET
Sbjct: 185  CYHQTEPIFEPSSSSSYEPLSCDTPQCNALEVSECRNATCLYEVSYGDGSYTVGDFATET 244

Query: 1042 VTFGDSASVNDVAIGCGHSNEXXXXXXXXXXXXXXXXXXFPSQINATSFSYCLVDRDSDS 863
            +T G S  V +VA+GCGHSNE                   PSQ+N TSFSYCLVDRDSDS
Sbjct: 245  LTIG-STLVQNVAVGCGHSNEGLFVGAAGLLGLGGGLLALPSQLNTTSFSYCLVDRDSDS 303

Query: 862  ASTLEFDSVIPPNAVTAPLVRNDKLNTYYYIGLTGISVAGEMLKISESTFQLNNNGEGGV 683
            AST+EF + +PP+AV APL+RN +L+T+YY+GLTGISV GE+L+I +S+F+++ +G GG+
Sbjct: 304  ASTVEFGTSLPPDAVVAPLLRNHQLDTFYYLGLTGISVGGELLQIPQSSFEMDESGSGGI 363

Query: 682  IIDSGTAVTRLQNGAYYSLRDAFKKGTKDLPSTDGVALFDTCYDLSAKKSVEVPTVSFHF 503
            IIDSGTAVTRLQ G Y SLRD+F KGT DL    GVA+FDTCY+LSAK ++EVPTV+FHF
Sbjct: 364  IIDSGTAVTRLQTGIYNSLRDSFLKGTSDLEKAAGVAMFDTCYNLSAKTTIEVPTVAFHF 423

Query: 502  SNGKEWSLPAKNYLIPVDSAGTFCLAFAPTSSALSIIGNVQQQGTRVSYDLGHSLIGFTA 323
              GK  +LPAKNY+IPVDS GTFCLAFAPT+S+L+IIGNVQQQGTRV++DL +SLIGF++
Sbjct: 424  PGGKMLALPAKNYMIPVDSVGTFCLAFAPTASSLAIIGNVQQQGTRVTFDLANSLIGFSS 483

Query: 322  NKC 314
            NKC
Sbjct: 484  NKC 486


>ref|NP_173922.1| aspartyl protease-like protein [Arabidopsis thaliana]
            gi|12321511|gb|AAG50814.1|AC079281_16 hypothetical
            protein [Arabidopsis thaliana] gi|20466516|gb|AAM20575.1|
            unknown protein [Arabidopsis thaliana]
            gi|23198172|gb|AAN15613.1| unknown protein [Arabidopsis
            thaliana] gi|110736960|dbj|BAF00436.1| hypothetical
            protein [Arabidopsis thaliana]
            gi|332192515|gb|AEE30636.1| aspartyl protease-like
            protein [Arabidopsis thaliana]
          Length = 483

 Score =  576 bits (1484), Expect = e-162
 Identities = 294/488 (60%), Positives = 360/488 (73%), Gaps = 4/488 (0%)
 Frame = -2

Query: 1765 MAQRVSFFFSILLTVSFTPVLSRNLQLRDTT--SVLDVSASIHKTLSFDSQSINTLNQIE 1592
            M+   SFFF I    S + V SR L    TT  S+L+V+ SIH+T    S  +N   +  
Sbjct: 1    MSPNYSFFFFIFFLTSHSSVFSRILPETSTTTTSILNVADSIHRTKYTSSFRLNQQEE-- 58

Query: 1591 AXXXXXXXXXXXXXLHPRSSIHKTTHTDYKELTLARLGRDSVRVNSLQARLDLAIHGVLK 1412
                          LH R S+  T H+DYK LTLARL RD+ RV SL  RLDLAI+ + K
Sbjct: 59   --QTHSASSSFSLQLHSRVSVRGTEHSDYKSLTLARLNRDTARVKSLITRLDLAINNISK 116

Query: 1411 SDLKPVYTELAAEE--LEVPVISGTSQGSGEYFTRLGIGHPPSQLYMVLDTGSDVNWLQC 1238
            +DLKP+ T    EE  +E P+ISGT+QGSGEYFTR+GIG P  ++YMVLDTGSDVNWLQC
Sbjct: 117  ADLKPISTMYTTEEQDIEAPLISGTTQGSGEYFTRVGIGKPAREVYMVLDTGSDVNWLQC 176

Query: 1237 APCADCYQQTDPIFEPALSSSYSPLTCNTQQCKSLDVFQCRNDTCLYEVSYGDGSYTVGD 1058
             PCADCY QT+PIFEP+ SSSY PL+C+T QC +L+V +CRN TCLYEVSYGDGSYTVGD
Sbjct: 177  TPCADCYHQTEPIFEPSSSSSYEPLSCDTPQCNALEVSECRNATCLYEVSYGDGSYTVGD 236

Query: 1057 FVTETVTFGDSASVNDVAIGCGHSNEXXXXXXXXXXXXXXXXXXFPSQINATSFSYCLVD 878
            F TET+T G S  V +VA+GCGHSNE                   PSQ+N TSFSYCLVD
Sbjct: 237  FATETLTIG-STLVQNVAVGCGHSNEGLFVGAAGLLGLGGGLLALPSQLNTTSFSYCLVD 295

Query: 877  RDSDSASTLEFDSVIPPNAVTAPLVRNDKLNTYYYIGLTGISVAGEMLKISESTFQLNNN 698
            RDSDSAST++F + + P+AV APL+RN +L+T+YY+GLTGISV GE+L+I +S+F+++ +
Sbjct: 296  RDSDSASTVDFGTSLSPDAVVAPLLRNHQLDTFYYLGLTGISVGGELLQIPQSSFEMDES 355

Query: 697  GEGGVIIDSGTAVTRLQNGAYYSLRDAFKKGTKDLPSTDGVALFDTCYDLSAKKSVEVPT 518
            G GG+IIDSGTAVTRLQ   Y SLRD+F KGT DL    GVA+FDTCY+LSAK +VEVPT
Sbjct: 356  GSGGIIIDSGTAVTRLQTEIYNSLRDSFVKGTLDLEKAAGVAMFDTCYNLSAKTTVEVPT 415

Query: 517  VSFHFSNGKEWSLPAKNYLIPVDSAGTFCLAFAPTSSALSIIGNVQQQGTRVSYDLGHSL 338
            V+FHF  GK  +LPAKNY+IPVDS GTFCLAFAPT+S+L+IIGNVQQQGTRV++DL +SL
Sbjct: 416  VAFHFPGGKMLALPAKNYMIPVDSVGTFCLAFAPTASSLAIIGNVQQQGTRVTFDLANSL 475

Query: 337  IGFTANKC 314
            IGF++NKC
Sbjct: 476  IGFSSNKC 483


Top