BLASTX nr result
ID: Angelica22_contig00002442
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Angelica22_contig00002442 (1935 letters) Database: ./nr 23,641,837 sequences; 8,123,359,852 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_002302634.1| predicted protein [Populus trichocarpa] gi|2... 580 e-163 ref|NP_171637.1| aspartyl protease-like protein [Arabidopsis tha... 566 e-159 ref|XP_003554768.1| PREDICTED: aspartic proteinase nepenthesin-1... 565 e-158 ref|XP_002523337.1| Aspartic proteinase nepenthesin-1 precursor,... 564 e-158 ref|XP_002892074.1| aspartyl protease family protein [Arabidopsi... 563 e-158 >ref|XP_002302634.1| predicted protein [Populus trichocarpa] gi|222844360|gb|EEE81907.1| predicted protein [Populus trichocarpa] Length = 490 Score = 580 bits (1494), Expect = e-163 Identities = 298/469 (63%), Positives = 349/469 (74%), Gaps = 1/469 (0%) Frame = -1 Query: 1731 QYQTLIPKSLPLLHPLTWDQQSDTLTHLAGDELTVAXXXXXXXXXENTLSLQLHHLDYLN 1552 Q+QTL LP L+W +DT + T+ +LS+QLHHLD L+ Sbjct: 33 QFQTLTVNPLPNKPTLSW---ADTEPESEPETQTLTDSTSTEASTTTSLSVQLHHLDALS 89 Query: 1551 LDHTNANTTSDALFINRITRDAGRVDALSTIAALKTNAIRTRHRGKATSDFSSSIISGLA 1372 D T + LF +R+ RDA RV +L+++AA + RTR RG FSSS+ SGLA Sbjct: 90 SDETPQD-----LFNSRLARDASRVKSLTSLAAAVGSTNRTRARGPG---FSSSVTSGLA 141 Query: 1371 HGSGEYFTRLGVGTPARYSYMVLDTGSDVVWIQCSPCRKCYTQSDPVFDPTKSSTFGGVA 1192 GSGEYFTRLGVGTPARY +MVLDTGSDVVWIQC+PC+KCY+Q+DPVF+PTKS +F + Sbjct: 142 QGSGEYFTRLGVGTPARYVFMVLDTGSDVVWIQCAPCKKCYSQTDPVFNPTKSRSFANIP 201 Query: 1191 CGSPLCRRLESPGCNSGKK-CMYQVSYGDGSFTVGEFSTETMTFRKNRVKNIALGCGHDN 1015 CGSPLCRRL+SPGC++ K C+YQVSYGDGSFT GEFSTET+TFR RV +ALGCGHDN Sbjct: 202 CGSPLCRRLDSPGCSTKKHICLYQVSYGDGSFTYGEFSTETLTFRGTRVGRVALGCGHDN 261 Query: 1014 EGLFVXXXXXXXXXXXXLSFPSQAGPRFGRAFSYCLVDRSASSKPSSIIFGSAAVPRKAV 835 EGLF+ LSFPSQ G RF R FSYCLVDRSASSKPS ++FG +A+ R A Sbjct: 262 EGLFIGAAGLLGLGRGRLSFPSQIGRRFSRKFSYCLVDRSASSKPSYMVFGDSAISRTAR 321 Query: 834 FTPLISNPKLDTFYYIGLTXXXXXXXXXXXXXXSHFKIDAAGNGGVIIDSGTSVTRLTRP 655 FTPL+SNPKLDTFYY+ L S FK+D+ GNGGVIIDSGTSVTRLTRP Sbjct: 322 FTPLVSNPKLDTFYYVELLGVSVGGTRVPGITASLFKLDSTGNGGVIIDSGTSVTRLTRP 381 Query: 654 AYIAMRDAFRVGARNLKRAPSFSLFDTCFDLSGVSEVKVPTVLFHFKGANVALPASNYLI 475 AY+A+RDAFRVGA NLKRAP FSLFDTCFDLSG +EVKVPTV+ HF+GA+V+LPASNYLI Sbjct: 382 AYVALRDAFRVGASNLKRAPEFSLFDTCFDLSGKTEVKVPTVVLHFRGADVSLPASNYLI 441 Query: 474 PVDSKGTFCFAFAGTSNGLSIIGNIQQQGFRVVYDLGKSRVGFAKNGCA 328 PVD+ G+FCFAFAGT +GLSI+GNIQQQGFRVVYDL SRVGFA GCA Sbjct: 442 PVDNSGSFCFAFAGTMSGLSIVGNIQQQGFRVVYDLAASRVGFAPRGCA 490 >ref|NP_171637.1| aspartyl protease-like protein [Arabidopsis thaliana] gi|9665144|gb|AAF97328.1|AC023628_9 Unknown protein [Arabidopsis thaliana] gi|22135930|gb|AAM91547.1| chloroplast nucleoid DNA binding protein, putative [Arabidopsis thaliana] gi|30387595|gb|AAP31963.1| At1g01300 [Arabidopsis thaliana] gi|332189147|gb|AEE27268.1| aspartyl protease-like protein [Arabidopsis thaliana] Length = 485 Score = 566 bits (1459), Expect = e-159 Identities = 290/470 (61%), Positives = 346/470 (73%), Gaps = 3/470 (0%) Frame = -1 Query: 1728 YQTLIPKS--LPLLHPLTWDQQSDTLTHLAGDELTVAXXXXXXXXXENTLSLQLHHLDYL 1555 +QTL P S LP P+++ SD+ + L + ++++L L H+D L Sbjct: 28 FQTLFPNSHSLPCASPVSFQPDSDSESLLESE-----FESGSDSESSSSITLNLDHIDAL 82 Query: 1554 NLDHTNANTTSDALFINRITRDAGRVDALSTIAALKTNAIRTRHRGKATSDFSSSIISGL 1375 + +N T D LF +R+ RD+ RV +++T+AA R FSSS++SGL Sbjct: 83 S-----SNKTPDELFSSRLQRDSRRVKSIATLAAQIPG--RNVTHAPRPGGFSSSVVSGL 135 Query: 1374 AHGSGEYFTRLGVGTPARYSYMVLDTGSDVVWIQCSPCRKCYTQSDPVFDPTKSSTFGGV 1195 + GSGEYFTRLGVGTPARY YMVLDTGSD+VW+QC+PCR+CY+QSDP+FDP KS T+ + Sbjct: 136 SQGSGEYFTRLGVGTPARYVYMVLDTGSDIVWLQCAPCRRCYSQSDPIFDPRKSKTYATI 195 Query: 1194 ACGSPLCRRLESPGCNSGKK-CMYQVSYGDGSFTVGEFSTETMTFRKNRVKNIALGCGHD 1018 C SP CRRL+S GCN+ +K C+YQVSYGDGSFTVG+FSTET+TFR+NRVK +ALGCGHD Sbjct: 196 PCSSPHCRRLDSAGCNTRRKTCLYQVSYGDGSFTVGDFSTETLTFRRNRVKGVALGCGHD 255 Query: 1017 NEGLFVXXXXXXXXXXXXLSFPSQAGPRFGRAFSYCLVDRSASSKPSSIIFGSAAVPRKA 838 NEGLFV LSFP Q G RF + FSYCLVDRSASSKPSS++FG+AAV R A Sbjct: 256 NEGLFVGAAGLLGLGKGKLSFPGQTGHRFNQKFSYCLVDRSASSKPSSVVFGNAAVSRIA 315 Query: 837 VFTPLISNPKLDTFYYIGLTXXXXXXXXXXXXXXSHFKIDAAGNGGVIIDSGTSVTRLTR 658 FTPL+SNPKLDTFYY+GL S FK+D GNGGVIIDSGTSVTRL R Sbjct: 316 RFTPLLSNPKLDTFYYVGLLGISVGGTRVPGVTASLFKLDQIGNGGVIIDSGTSVTRLIR 375 Query: 657 PAYIAMRDAFRVGARNLKRAPSFSLFDTCFDLSGVSEVKVPTVLFHFKGANVALPASNYL 478 PAYIAMRDAFRVGA+ LKRAP FSLFDTCFDLS ++EVKVPTV+ HF+GA+V+LPA+NYL Sbjct: 376 PAYIAMRDAFRVGAKTLKRAPDFSLFDTCFDLSNMNEVKVPTVVLHFRGADVSLPATNYL 435 Query: 477 IPVDSKGTFCFAFAGTSNGLSIIGNIQQQGFRVVYDLGKSRVGFAKNGCA 328 IPVD+ G FCFAFAGT GLSIIGNIQQQGFRVVYDL SRVGFA GCA Sbjct: 436 IPVDTNGKFCFAFAGTMGGLSIIGNIQQQGFRVVYDLASSRVGFAPGGCA 485 >ref|XP_003554768.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max] Length = 472 Score = 565 bits (1455), Expect = e-158 Identities = 283/420 (67%), Positives = 329/420 (78%), Gaps = 1/420 (0%) Frame = -1 Query: 1587 LSLQLHHLDYLNLDHTNANTTSDALFINRITRDAGRVDALSTIAALKTNAIRTRHRGKAT 1408 LSL LHH+D L+ +N T + LF R+ RDA RV+ + +AAL + H ++ Sbjct: 62 LSLHLHHIDALS-----SNKTPEQLFQLRLQRDAKRVEGVVALAALNQS-----HARRSG 111 Query: 1407 SDFSSSIISGLAHGSGEYFTRLGVGTPARYSYMVLDTGSDVVWIQCSPCRKCYTQSDPVF 1228 S FSSSIISGLA GSGEYFTR+GVGTPARY YMVLDTGSDVVW+QC+PCRKCYTQ+DPVF Sbjct: 112 SSFSSSIISGLAQGSGEYFTRIGVGTPARYVYMVLDTGSDVVWLQCAPCRKCYTQADPVF 171 Query: 1227 DPTKSSTFGGVACGSPLCRRLESPGCNSGKK-CMYQVSYGDGSFTVGEFSTETMTFRKNR 1051 DPTKS T+ G+ CG+PLCRRL+SPGCN+ K C YQVSYGDGSFT G+FSTET+TFR+ R Sbjct: 172 DPTKSRTYAGIPCGAPLCRRLDSPGCNNKNKVCQYQVSYGDGSFTFGDFSTETLTFRRTR 231 Query: 1050 VKNIALGCGHDNEGLFVXXXXXXXXXXXXLSFPSQAGPRFGRAFSYCLVDRSASSKPSSI 871 V +ALGCGHDNEGLF+ LSFP Q G RF + FSYCLVDRSAS+KPSS+ Sbjct: 232 VTRVALGCGHDNEGLFIGAAGLLGLGRGRLSFPVQTGRRFNQKFSYCLVDRSASAKPSSV 291 Query: 870 IFGSAAVPRKAVFTPLISNPKLDTFYYIGLTXXXXXXXXXXXXXXSHFKIDAAGNGGVII 691 +FG +AV R A FTPLI NPKLDTFYY+ L S F++DAAGNGGVII Sbjct: 292 VFGDSAVSRTARFTPLIKNPKLDTFYYLELLGISVGGSPVRGLSASLFRLDAAGNGGVII 351 Query: 690 DSGTSVTRLTRPAYIAMRDAFRVGARNLKRAPSFSLFDTCFDLSGVSEVKVPTVLFHFKG 511 DSGTSVTRLTRPAYIA+RDAFRVGA +LKRA FSLFDTCFDLSG++EVKVPTV+ HF+G Sbjct: 352 DSGTSVTRLTRPAYIALRDAFRVGASHLKRAAEFSLFDTCFDLSGLTEVKVPTVVLHFRG 411 Query: 510 ANVALPASNYLIPVDSKGTFCFAFAGTSNGLSIIGNIQQQGFRVVYDLGKSRVGFAKNGC 331 A+V+LPA+NYLIPVD+ G+FCFAFAGT +GLSIIGNIQQQGFRV +DL SRVGFA GC Sbjct: 412 ADVSLPATNYLIPVDNSGSFCFAFAGTMSGLSIIGNIQQQGFRVSFDLAGSRVGFAPRGC 471 >ref|XP_002523337.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus communis] gi|223537425|gb|EEF39053.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus communis] Length = 469 Score = 564 bits (1453), Expect = e-158 Identities = 296/470 (62%), Positives = 338/470 (71%), Gaps = 3/470 (0%) Frame = -1 Query: 1728 YQTLIPKSLPLLHPLTW-DQQSDTLTHLAGDELTVAXXXXXXXXXENTLSLQLHHLDYLN 1552 YQTL+ L L+W D +S T T + T S+QLHH+D L+ Sbjct: 28 YQTLVANPLRSQPTLSWTDSESPTDTAESSA----------------TFSVQLHHVDALS 71 Query: 1551 LDHTNANTTSDALFINRITRDAGRVDALSTIAALKTNAIRTRHRGKAT-SDFSSSIISGL 1375 N+T + LF R+ RDA RV+A+S +A T GK + FSSS+ISGL Sbjct: 72 F-----NSTPETLFTTRLQRDAARVEAISYLA-------ETAGTGKRVGTGFSSSVISGL 119 Query: 1374 AHGSGEYFTRLGVGTPARYSYMVLDTGSDVVWIQCSPCRKCYTQSDPVFDPTKSSTFGGV 1195 A GSGEYFTR+GVGTP RY YMVLDTGSD+VWIQC+PC++CY QSDPVFDP KS +F + Sbjct: 120 AQGSGEYFTRIGVGTPPRYVYMVLDTGSDIVWIQCAPCKRCYAQSDPVFDPRKSRSFASI 179 Query: 1194 ACGSPLCRRLESPGCNSGKK-CMYQVSYGDGSFTVGEFSTETMTFRKNRVKNIALGCGHD 1018 AC SPLC RL+SPGCN+ K+ CMYQVSYGDGSFT G+FSTET+TFR+ RV +ALGCGHD Sbjct: 180 ACRSPLCHRLDSPGCNTQKQTCMYQVSYGDGSFTFGDFSTETLTFRRTRVARVALGCGHD 239 Query: 1017 NEGLFVXXXXXXXXXXXXLSFPSQAGPRFGRAFSYCLVDRSASSKPSSIIFGSAAVPRKA 838 NEGLFV LSFPSQ G RF FSYCLVDRSASSKPSS++FG +AV R A Sbjct: 240 NEGLFVGAAGLLGLGRGRLSFPSQTGRRFNHKFSYCLVDRSASSKPSSMVFGDSAVSRTA 299 Query: 837 VFTPLISNPKLDTFYYIGLTXXXXXXXXXXXXXXSHFKIDAAGNGGVIIDSGTSVTRLTR 658 FTPL+SNPKLDTFYY+ L S FK+D GNGGVIIDSGTSVTRLTR Sbjct: 300 RFTPLVSNPKLDTFYYVELLGISVGGTRVPGITASLFKLDQTGNGGVIIDSGTSVTRLTR 359 Query: 657 PAYIAMRDAFRVGARNLKRAPSFSLFDTCFDLSGVSEVKVPTVLFHFKGANVALPASNYL 478 PAYIA RDAFR GA NLKRAP FSLFDTCFDLSG +EVKVPTV+ HF+GA+V+LPASNYL Sbjct: 360 PAYIAFRDAFRAGASNLKRAPQFSLFDTCFDLSGKTEVKVPTVVLHFRGADVSLPASNYL 419 Query: 477 IPVDSKGTFCFAFAGTSNGLSIIGNIQQQGFRVVYDLGKSRVGFAKNGCA 328 IPVD+ G FC AFAGT GLSIIGNIQQQGFRVVYDL SRVGFA +GCA Sbjct: 420 IPVDTSGNFCLAFAGTMGGLSIIGNIQQQGFRVVYDLAGSRVGFAPHGCA 469 >ref|XP_002892074.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata] gi|297337916|gb|EFH68333.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata] Length = 485 Score = 563 bits (1451), Expect = e-158 Identities = 290/470 (61%), Positives = 346/470 (73%), Gaps = 3/470 (0%) Frame = -1 Query: 1728 YQTLIPKS--LPLLHPLTWDQQSDTLTHLAGDELTVAXXXXXXXXXENTLSLQLHHLDYL 1555 +QTLIP S LP P+++ +S+ E + E++++L L H+D L Sbjct: 28 FQTLIPNSHSLPSASPISFQPESEP-----DSESLLGSEFESGSDSESSITLNLDHIDAL 82 Query: 1554 NLDHTNANTTSDALFINRITRDAGRVDALSTIAALKTNAIRTRHRGKATSDFSSSIISGL 1375 + +N T LF +R+ RD+ RV +++T+AA R T FSSS++SGL Sbjct: 83 S-----SNKTPQELFSSRLQRDSRRVKSIATLAAQIPG--RNVTHAPRTGGFSSSVVSGL 135 Query: 1374 AHGSGEYFTRLGVGTPARYSYMVLDTGSDVVWIQCSPCRKCYTQSDPVFDPTKSSTFGGV 1195 + GSGEYFTRLGVGTPARY YMVLDTGSD+VW+QC+PCR+CY+QSDP+FDP KS T+ + Sbjct: 136 SQGSGEYFTRLGVGTPARYVYMVLDTGSDIVWLQCAPCRRCYSQSDPIFDPRKSKTYATI 195 Query: 1194 ACGSPLCRRLESPGCNSGKK-CMYQVSYGDGSFTVGEFSTETMTFRKNRVKNIALGCGHD 1018 C SP CRRL+S GCN+ +K C+YQVSYGDGSFTVG+FSTET+TFR+NRVK +ALGCGHD Sbjct: 196 PCSSPHCRRLDSAGCNTRRKTCLYQVSYGDGSFTVGDFSTETLTFRRNRVKGVALGCGHD 255 Query: 1017 NEGLFVXXXXXXXXXXXXLSFPSQAGPRFGRAFSYCLVDRSASSKPSSIIFGSAAVPRKA 838 NEGLFV LSFP Q G RF + FSYCLVDRSASSKPSS++FG+AAV R A Sbjct: 256 NEGLFVGAAGLLGLGKGKLSFPGQTGHRFNQKFSYCLVDRSASSKPSSVVFGNAAVSRIA 315 Query: 837 VFTPLISNPKLDTFYYIGLTXXXXXXXXXXXXXXSHFKIDAAGNGGVIIDSGTSVTRLTR 658 FTPL+SNPKLDTFYY+ L S FK+D GNGGVIIDSGTSVTRL R Sbjct: 316 RFTPLLSNPKLDTFYYVELLGISVGGTRVPGVAASLFKLDQIGNGGVIIDSGTSVTRLIR 375 Query: 657 PAYIAMRDAFRVGARNLKRAPSFSLFDTCFDLSGVSEVKVPTVLFHFKGANVALPASNYL 478 PAYIAMRDAFRVGA+ LKRAP FSLFDTCFDLS ++EVKVPTV+ HF+GA+V+LPA+NYL Sbjct: 376 PAYIAMRDAFRVGAKALKRAPDFSLFDTCFDLSNMNEVKVPTVVLHFRGADVSLPATNYL 435 Query: 477 IPVDSKGTFCFAFAGTSNGLSIIGNIQQQGFRVVYDLGKSRVGFAKNGCA 328 IPVD+ G FCFAFAGT GLSIIGNIQQQGFRVVYDL SRVGFA GCA Sbjct: 436 IPVDTNGKFCFAFAGTMGGLSIIGNIQQQGFRVVYDLASSRVGFAPGGCA 485