BLASTX nr result

ID: Angelica22_contig00002442 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Angelica22_contig00002442
         (1935 letters)

Database: ./nr 
           23,641,837 sequences; 8,123,359,852 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_002302634.1| predicted protein [Populus trichocarpa] gi|2...   580   e-163
ref|NP_171637.1| aspartyl protease-like protein [Arabidopsis tha...   566   e-159
ref|XP_003554768.1| PREDICTED: aspartic proteinase nepenthesin-1...   565   e-158
ref|XP_002523337.1| Aspartic proteinase nepenthesin-1 precursor,...   564   e-158
ref|XP_002892074.1| aspartyl protease family protein [Arabidopsi...   563   e-158

>ref|XP_002302634.1| predicted protein [Populus trichocarpa] gi|222844360|gb|EEE81907.1|
            predicted protein [Populus trichocarpa]
          Length = 490

 Score =  580 bits (1494), Expect = e-163
 Identities = 298/469 (63%), Positives = 349/469 (74%), Gaps = 1/469 (0%)
 Frame = -1

Query: 1731 QYQTLIPKSLPLLHPLTWDQQSDTLTHLAGDELTVAXXXXXXXXXENTLSLQLHHLDYLN 1552
            Q+QTL    LP    L+W   +DT      +  T+            +LS+QLHHLD L+
Sbjct: 33   QFQTLTVNPLPNKPTLSW---ADTEPESEPETQTLTDSTSTEASTTTSLSVQLHHLDALS 89

Query: 1551 LDHTNANTTSDALFINRITRDAGRVDALSTIAALKTNAIRTRHRGKATSDFSSSIISGLA 1372
             D T  +     LF +R+ RDA RV +L+++AA   +  RTR RG     FSSS+ SGLA
Sbjct: 90   SDETPQD-----LFNSRLARDASRVKSLTSLAAAVGSTNRTRARGPG---FSSSVTSGLA 141

Query: 1371 HGSGEYFTRLGVGTPARYSYMVLDTGSDVVWIQCSPCRKCYTQSDPVFDPTKSSTFGGVA 1192
             GSGEYFTRLGVGTPARY +MVLDTGSDVVWIQC+PC+KCY+Q+DPVF+PTKS +F  + 
Sbjct: 142  QGSGEYFTRLGVGTPARYVFMVLDTGSDVVWIQCAPCKKCYSQTDPVFNPTKSRSFANIP 201

Query: 1191 CGSPLCRRLESPGCNSGKK-CMYQVSYGDGSFTVGEFSTETMTFRKNRVKNIALGCGHDN 1015
            CGSPLCRRL+SPGC++ K  C+YQVSYGDGSFT GEFSTET+TFR  RV  +ALGCGHDN
Sbjct: 202  CGSPLCRRLDSPGCSTKKHICLYQVSYGDGSFTYGEFSTETLTFRGTRVGRVALGCGHDN 261

Query: 1014 EGLFVXXXXXXXXXXXXLSFPSQAGPRFGRAFSYCLVDRSASSKPSSIIFGSAAVPRKAV 835
            EGLF+            LSFPSQ G RF R FSYCLVDRSASSKPS ++FG +A+ R A 
Sbjct: 262  EGLFIGAAGLLGLGRGRLSFPSQIGRRFSRKFSYCLVDRSASSKPSYMVFGDSAISRTAR 321

Query: 834  FTPLISNPKLDTFYYIGLTXXXXXXXXXXXXXXSHFKIDAAGNGGVIIDSGTSVTRLTRP 655
            FTPL+SNPKLDTFYY+ L               S FK+D+ GNGGVIIDSGTSVTRLTRP
Sbjct: 322  FTPLVSNPKLDTFYYVELLGVSVGGTRVPGITASLFKLDSTGNGGVIIDSGTSVTRLTRP 381

Query: 654  AYIAMRDAFRVGARNLKRAPSFSLFDTCFDLSGVSEVKVPTVLFHFKGANVALPASNYLI 475
            AY+A+RDAFRVGA NLKRAP FSLFDTCFDLSG +EVKVPTV+ HF+GA+V+LPASNYLI
Sbjct: 382  AYVALRDAFRVGASNLKRAPEFSLFDTCFDLSGKTEVKVPTVVLHFRGADVSLPASNYLI 441

Query: 474  PVDSKGTFCFAFAGTSNGLSIIGNIQQQGFRVVYDLGKSRVGFAKNGCA 328
            PVD+ G+FCFAFAGT +GLSI+GNIQQQGFRVVYDL  SRVGFA  GCA
Sbjct: 442  PVDNSGSFCFAFAGTMSGLSIVGNIQQQGFRVVYDLAASRVGFAPRGCA 490


>ref|NP_171637.1| aspartyl protease-like protein [Arabidopsis thaliana]
            gi|9665144|gb|AAF97328.1|AC023628_9 Unknown protein
            [Arabidopsis thaliana] gi|22135930|gb|AAM91547.1|
            chloroplast nucleoid DNA binding protein, putative
            [Arabidopsis thaliana] gi|30387595|gb|AAP31963.1|
            At1g01300 [Arabidopsis thaliana]
            gi|332189147|gb|AEE27268.1| aspartyl protease-like
            protein [Arabidopsis thaliana]
          Length = 485

 Score =  566 bits (1459), Expect = e-159
 Identities = 290/470 (61%), Positives = 346/470 (73%), Gaps = 3/470 (0%)
 Frame = -1

Query: 1728 YQTLIPKS--LPLLHPLTWDQQSDTLTHLAGDELTVAXXXXXXXXXENTLSLQLHHLDYL 1555
            +QTL P S  LP   P+++   SD+ + L  +               ++++L L H+D L
Sbjct: 28   FQTLFPNSHSLPCASPVSFQPDSDSESLLESE-----FESGSDSESSSSITLNLDHIDAL 82

Query: 1554 NLDHTNANTTSDALFINRITRDAGRVDALSTIAALKTNAIRTRHRGKATSDFSSSIISGL 1375
            +     +N T D LF +R+ RD+ RV +++T+AA      R          FSSS++SGL
Sbjct: 83   S-----SNKTPDELFSSRLQRDSRRVKSIATLAAQIPG--RNVTHAPRPGGFSSSVVSGL 135

Query: 1374 AHGSGEYFTRLGVGTPARYSYMVLDTGSDVVWIQCSPCRKCYTQSDPVFDPTKSSTFGGV 1195
            + GSGEYFTRLGVGTPARY YMVLDTGSD+VW+QC+PCR+CY+QSDP+FDP KS T+  +
Sbjct: 136  SQGSGEYFTRLGVGTPARYVYMVLDTGSDIVWLQCAPCRRCYSQSDPIFDPRKSKTYATI 195

Query: 1194 ACGSPLCRRLESPGCNSGKK-CMYQVSYGDGSFTVGEFSTETMTFRKNRVKNIALGCGHD 1018
             C SP CRRL+S GCN+ +K C+YQVSYGDGSFTVG+FSTET+TFR+NRVK +ALGCGHD
Sbjct: 196  PCSSPHCRRLDSAGCNTRRKTCLYQVSYGDGSFTVGDFSTETLTFRRNRVKGVALGCGHD 255

Query: 1017 NEGLFVXXXXXXXXXXXXLSFPSQAGPRFGRAFSYCLVDRSASSKPSSIIFGSAAVPRKA 838
            NEGLFV            LSFP Q G RF + FSYCLVDRSASSKPSS++FG+AAV R A
Sbjct: 256  NEGLFVGAAGLLGLGKGKLSFPGQTGHRFNQKFSYCLVDRSASSKPSSVVFGNAAVSRIA 315

Query: 837  VFTPLISNPKLDTFYYIGLTXXXXXXXXXXXXXXSHFKIDAAGNGGVIIDSGTSVTRLTR 658
             FTPL+SNPKLDTFYY+GL               S FK+D  GNGGVIIDSGTSVTRL R
Sbjct: 316  RFTPLLSNPKLDTFYYVGLLGISVGGTRVPGVTASLFKLDQIGNGGVIIDSGTSVTRLIR 375

Query: 657  PAYIAMRDAFRVGARNLKRAPSFSLFDTCFDLSGVSEVKVPTVLFHFKGANVALPASNYL 478
            PAYIAMRDAFRVGA+ LKRAP FSLFDTCFDLS ++EVKVPTV+ HF+GA+V+LPA+NYL
Sbjct: 376  PAYIAMRDAFRVGAKTLKRAPDFSLFDTCFDLSNMNEVKVPTVVLHFRGADVSLPATNYL 435

Query: 477  IPVDSKGTFCFAFAGTSNGLSIIGNIQQQGFRVVYDLGKSRVGFAKNGCA 328
            IPVD+ G FCFAFAGT  GLSIIGNIQQQGFRVVYDL  SRVGFA  GCA
Sbjct: 436  IPVDTNGKFCFAFAGTMGGLSIIGNIQQQGFRVVYDLASSRVGFAPGGCA 485


>ref|XP_003554768.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
          Length = 472

 Score =  565 bits (1455), Expect = e-158
 Identities = 283/420 (67%), Positives = 329/420 (78%), Gaps = 1/420 (0%)
 Frame = -1

Query: 1587 LSLQLHHLDYLNLDHTNANTTSDALFINRITRDAGRVDALSTIAALKTNAIRTRHRGKAT 1408
            LSL LHH+D L+     +N T + LF  R+ RDA RV+ +  +AAL  +     H  ++ 
Sbjct: 62   LSLHLHHIDALS-----SNKTPEQLFQLRLQRDAKRVEGVVALAALNQS-----HARRSG 111

Query: 1407 SDFSSSIISGLAHGSGEYFTRLGVGTPARYSYMVLDTGSDVVWIQCSPCRKCYTQSDPVF 1228
            S FSSSIISGLA GSGEYFTR+GVGTPARY YMVLDTGSDVVW+QC+PCRKCYTQ+DPVF
Sbjct: 112  SSFSSSIISGLAQGSGEYFTRIGVGTPARYVYMVLDTGSDVVWLQCAPCRKCYTQADPVF 171

Query: 1227 DPTKSSTFGGVACGSPLCRRLESPGCNSGKK-CMYQVSYGDGSFTVGEFSTETMTFRKNR 1051
            DPTKS T+ G+ CG+PLCRRL+SPGCN+  K C YQVSYGDGSFT G+FSTET+TFR+ R
Sbjct: 172  DPTKSRTYAGIPCGAPLCRRLDSPGCNNKNKVCQYQVSYGDGSFTFGDFSTETLTFRRTR 231

Query: 1050 VKNIALGCGHDNEGLFVXXXXXXXXXXXXLSFPSQAGPRFGRAFSYCLVDRSASSKPSSI 871
            V  +ALGCGHDNEGLF+            LSFP Q G RF + FSYCLVDRSAS+KPSS+
Sbjct: 232  VTRVALGCGHDNEGLFIGAAGLLGLGRGRLSFPVQTGRRFNQKFSYCLVDRSASAKPSSV 291

Query: 870  IFGSAAVPRKAVFTPLISNPKLDTFYYIGLTXXXXXXXXXXXXXXSHFKIDAAGNGGVII 691
            +FG +AV R A FTPLI NPKLDTFYY+ L               S F++DAAGNGGVII
Sbjct: 292  VFGDSAVSRTARFTPLIKNPKLDTFYYLELLGISVGGSPVRGLSASLFRLDAAGNGGVII 351

Query: 690  DSGTSVTRLTRPAYIAMRDAFRVGARNLKRAPSFSLFDTCFDLSGVSEVKVPTVLFHFKG 511
            DSGTSVTRLTRPAYIA+RDAFRVGA +LKRA  FSLFDTCFDLSG++EVKVPTV+ HF+G
Sbjct: 352  DSGTSVTRLTRPAYIALRDAFRVGASHLKRAAEFSLFDTCFDLSGLTEVKVPTVVLHFRG 411

Query: 510  ANVALPASNYLIPVDSKGTFCFAFAGTSNGLSIIGNIQQQGFRVVYDLGKSRVGFAKNGC 331
            A+V+LPA+NYLIPVD+ G+FCFAFAGT +GLSIIGNIQQQGFRV +DL  SRVGFA  GC
Sbjct: 412  ADVSLPATNYLIPVDNSGSFCFAFAGTMSGLSIIGNIQQQGFRVSFDLAGSRVGFAPRGC 471


>ref|XP_002523337.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
            communis] gi|223537425|gb|EEF39053.1| Aspartic proteinase
            nepenthesin-1 precursor, putative [Ricinus communis]
          Length = 469

 Score =  564 bits (1453), Expect = e-158
 Identities = 296/470 (62%), Positives = 338/470 (71%), Gaps = 3/470 (0%)
 Frame = -1

Query: 1728 YQTLIPKSLPLLHPLTW-DQQSDTLTHLAGDELTVAXXXXXXXXXENTLSLQLHHLDYLN 1552
            YQTL+   L     L+W D +S T T  +                  T S+QLHH+D L+
Sbjct: 28   YQTLVANPLRSQPTLSWTDSESPTDTAESSA----------------TFSVQLHHVDALS 71

Query: 1551 LDHTNANTTSDALFINRITRDAGRVDALSTIAALKTNAIRTRHRGKAT-SDFSSSIISGL 1375
                  N+T + LF  R+ RDA RV+A+S +A        T   GK   + FSSS+ISGL
Sbjct: 72   F-----NSTPETLFTTRLQRDAARVEAISYLA-------ETAGTGKRVGTGFSSSVISGL 119

Query: 1374 AHGSGEYFTRLGVGTPARYSYMVLDTGSDVVWIQCSPCRKCYTQSDPVFDPTKSSTFGGV 1195
            A GSGEYFTR+GVGTP RY YMVLDTGSD+VWIQC+PC++CY QSDPVFDP KS +F  +
Sbjct: 120  AQGSGEYFTRIGVGTPPRYVYMVLDTGSDIVWIQCAPCKRCYAQSDPVFDPRKSRSFASI 179

Query: 1194 ACGSPLCRRLESPGCNSGKK-CMYQVSYGDGSFTVGEFSTETMTFRKNRVKNIALGCGHD 1018
            AC SPLC RL+SPGCN+ K+ CMYQVSYGDGSFT G+FSTET+TFR+ RV  +ALGCGHD
Sbjct: 180  ACRSPLCHRLDSPGCNTQKQTCMYQVSYGDGSFTFGDFSTETLTFRRTRVARVALGCGHD 239

Query: 1017 NEGLFVXXXXXXXXXXXXLSFPSQAGPRFGRAFSYCLVDRSASSKPSSIIFGSAAVPRKA 838
            NEGLFV            LSFPSQ G RF   FSYCLVDRSASSKPSS++FG +AV R A
Sbjct: 240  NEGLFVGAAGLLGLGRGRLSFPSQTGRRFNHKFSYCLVDRSASSKPSSMVFGDSAVSRTA 299

Query: 837  VFTPLISNPKLDTFYYIGLTXXXXXXXXXXXXXXSHFKIDAAGNGGVIIDSGTSVTRLTR 658
             FTPL+SNPKLDTFYY+ L               S FK+D  GNGGVIIDSGTSVTRLTR
Sbjct: 300  RFTPLVSNPKLDTFYYVELLGISVGGTRVPGITASLFKLDQTGNGGVIIDSGTSVTRLTR 359

Query: 657  PAYIAMRDAFRVGARNLKRAPSFSLFDTCFDLSGVSEVKVPTVLFHFKGANVALPASNYL 478
            PAYIA RDAFR GA NLKRAP FSLFDTCFDLSG +EVKVPTV+ HF+GA+V+LPASNYL
Sbjct: 360  PAYIAFRDAFRAGASNLKRAPQFSLFDTCFDLSGKTEVKVPTVVLHFRGADVSLPASNYL 419

Query: 477  IPVDSKGTFCFAFAGTSNGLSIIGNIQQQGFRVVYDLGKSRVGFAKNGCA 328
            IPVD+ G FC AFAGT  GLSIIGNIQQQGFRVVYDL  SRVGFA +GCA
Sbjct: 420  IPVDTSGNFCLAFAGTMGGLSIIGNIQQQGFRVVYDLAGSRVGFAPHGCA 469


>ref|XP_002892074.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
            gi|297337916|gb|EFH68333.1| aspartyl protease family
            protein [Arabidopsis lyrata subsp. lyrata]
          Length = 485

 Score =  563 bits (1451), Expect = e-158
 Identities = 290/470 (61%), Positives = 346/470 (73%), Gaps = 3/470 (0%)
 Frame = -1

Query: 1728 YQTLIPKS--LPLLHPLTWDQQSDTLTHLAGDELTVAXXXXXXXXXENTLSLQLHHLDYL 1555
            +QTLIP S  LP   P+++  +S+        E  +          E++++L L H+D L
Sbjct: 28   FQTLIPNSHSLPSASPISFQPESEP-----DSESLLGSEFESGSDSESSITLNLDHIDAL 82

Query: 1554 NLDHTNANTTSDALFINRITRDAGRVDALSTIAALKTNAIRTRHRGKATSDFSSSIISGL 1375
            +     +N T   LF +R+ RD+ RV +++T+AA      R       T  FSSS++SGL
Sbjct: 83   S-----SNKTPQELFSSRLQRDSRRVKSIATLAAQIPG--RNVTHAPRTGGFSSSVVSGL 135

Query: 1374 AHGSGEYFTRLGVGTPARYSYMVLDTGSDVVWIQCSPCRKCYTQSDPVFDPTKSSTFGGV 1195
            + GSGEYFTRLGVGTPARY YMVLDTGSD+VW+QC+PCR+CY+QSDP+FDP KS T+  +
Sbjct: 136  SQGSGEYFTRLGVGTPARYVYMVLDTGSDIVWLQCAPCRRCYSQSDPIFDPRKSKTYATI 195

Query: 1194 ACGSPLCRRLESPGCNSGKK-CMYQVSYGDGSFTVGEFSTETMTFRKNRVKNIALGCGHD 1018
             C SP CRRL+S GCN+ +K C+YQVSYGDGSFTVG+FSTET+TFR+NRVK +ALGCGHD
Sbjct: 196  PCSSPHCRRLDSAGCNTRRKTCLYQVSYGDGSFTVGDFSTETLTFRRNRVKGVALGCGHD 255

Query: 1017 NEGLFVXXXXXXXXXXXXLSFPSQAGPRFGRAFSYCLVDRSASSKPSSIIFGSAAVPRKA 838
            NEGLFV            LSFP Q G RF + FSYCLVDRSASSKPSS++FG+AAV R A
Sbjct: 256  NEGLFVGAAGLLGLGKGKLSFPGQTGHRFNQKFSYCLVDRSASSKPSSVVFGNAAVSRIA 315

Query: 837  VFTPLISNPKLDTFYYIGLTXXXXXXXXXXXXXXSHFKIDAAGNGGVIIDSGTSVTRLTR 658
             FTPL+SNPKLDTFYY+ L               S FK+D  GNGGVIIDSGTSVTRL R
Sbjct: 316  RFTPLLSNPKLDTFYYVELLGISVGGTRVPGVAASLFKLDQIGNGGVIIDSGTSVTRLIR 375

Query: 657  PAYIAMRDAFRVGARNLKRAPSFSLFDTCFDLSGVSEVKVPTVLFHFKGANVALPASNYL 478
            PAYIAMRDAFRVGA+ LKRAP FSLFDTCFDLS ++EVKVPTV+ HF+GA+V+LPA+NYL
Sbjct: 376  PAYIAMRDAFRVGAKALKRAPDFSLFDTCFDLSNMNEVKVPTVVLHFRGADVSLPATNYL 435

Query: 477  IPVDSKGTFCFAFAGTSNGLSIIGNIQQQGFRVVYDLGKSRVGFAKNGCA 328
            IPVD+ G FCFAFAGT  GLSIIGNIQQQGFRVVYDL  SRVGFA  GCA
Sbjct: 436  IPVDTNGKFCFAFAGTMGGLSIIGNIQQQGFRVVYDLASSRVGFAPGGCA 485


Top