BLASTX nr result
ID: Angelica23_contig00013114
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Angelica23_contig00013114 (1686 letters) Database: ./nr 23,641,837 sequences; 8,123,359,852 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_002281973.1| PREDICTED: aspartic proteinase nepenthesin-2... 607 e-171 ref|XP_002315953.1| predicted protein [Populus trichocarpa] gi|2... 598 e-168 ref|XP_002529147.1| Aspartic proteinase nepenthesin-1 precursor,... 589 e-166 ref|XP_002890686.1| aspartyl protease family protein [Arabidopsi... 580 e-163 ref|NP_173922.1| aspartyl protease-like protein [Arabidopsis tha... 574 e-161 >ref|XP_002281973.1| PREDICTED: aspartic proteinase nepenthesin-2 [Vitis vinifera] Length = 491 Score = 607 bits (1564), Expect = e-171 Identities = 307/481 (63%), Positives = 369/481 (76%), Gaps = 5/481 (1%) Frame = -1 Query: 1677 FSILLTVSFTPVLSRNLQLRDTTSVLDVSASIHKTLSFDS-----QSINTLNQIEAXXXX 1513 F+ T + V +R L L TT VLDVS SI ++L+ S + + +Q + Sbjct: 12 FAFFCTWGVSLVNARRLSLPRTT-VLDVSGSIRESLNVLSLNPQYEQMEFQHQERSFPSS 70 Query: 1512 XXXXXXXXXLHPRSSIHKTTHTDYKELTLARLGRDSVRVNSLQARLDLAIHGVLKSDLKP 1333 LH R+SIHK++H DYK L LARL RDS RV SL R+DLAI G+ KSDLKP Sbjct: 71 SSSSSLTLSLHSRTSIHKSSHKDYKSLVLARLERDSDRVRSLATRMDLAIAGITKSDLKP 130 Query: 1332 VYTELAAEELEVPVISGTSQGSGEYFTRLGIGHPPSQLYMVLDTGSDVNWLQCAPCADCY 1153 V EL AE LE P++SG SQGSGEYF+R+GIG PP +YMV+DTGSDVNW+QCAPCADCY Sbjct: 131 VEKELEAEALETPLVSGASQGSGEYFSRVGIGSPPKHVYMVVDTGSDVNWVQCAPCADCY 190 Query: 1152 QQTDPIFEPALSSSYSPLTCNTQQCKSLDVFQCRNDTCLYEVSYGDGSYTVGDFVTETVT 973 QQ DPIFEP+ SSSY+PLTC T QCKSLDV +CRND+CLYEVSYGDGSYTVGDF TET+T Sbjct: 191 QQADPIFEPSFSSSYAPLTCETHQCKSLDVSECRNDSCLYEVSYGDGSYTVGDFATETIT 250 Query: 972 FGDSASVNDVAIGCGHSNEXXXXXXXXXXXXXXXXXXFPSQINATSFSYCLVDRDSDSAS 793 SAS+N+VAIGCGH NE FPSQINA+SFSYCLV+RD+DSAS Sbjct: 251 LDGSASLNNVAIGCGHDNEGLFVGAAGLLGLGGGSLSFPSQINASSFSYCLVNRDTDSAS 310 Query: 792 TLEFDSVIPPNAVTAPLVRNDKLNTYYYIGLTGISVAGEMLKISESTFQLNNNGEGGVII 613 TLEF+S IP ++VTAPL+RN++L+T+YY+G+TGI V G+ML I S+F+++ +G GG+I+ Sbjct: 311 TLEFNSPIPSHSVTAPLLRNNQLDTFYYLGMTGIGVGGQMLSIPRSSFEVDESGNGGIIV 370 Query: 612 DSGTAVTRLQNGAYYSLRDAFKKGTKDLPSTDGVALFDTCYDLSAKKSVEVPTVSFHFSN 433 DSGTAVTRLQ+ Y SLRD+F +GT+ LPST GVALFDTCYDLS++ SVEVPTVSFHF + Sbjct: 371 DSGTAVTRLQSDVYNSLRDSFVRGTQHLPSTSGVALFDTCYDLSSRSSVEVPTVSFHFPD 430 Query: 432 GKEWSLPAKNYLIPVDSAGTFCLAFAPTSSALSIIGNVQQQGTRVSYDLGHSLIGFTANK 253 GK +LPAKNYLIPVDSAGTFC AFAPT+SALSIIGNVQQQGTRVSYDL +SL+GF+ N Sbjct: 431 GKYLALPAKNYLIPVDSAGTFCFAFAPTTSALSIIGNVQQQGTRVSYDLSNSLVGFSPNG 490 Query: 252 C 250 C Sbjct: 491 C 491 >ref|XP_002315953.1| predicted protein [Populus trichocarpa] gi|222864993|gb|EEF02124.1| predicted protein [Populus trichocarpa] Length = 484 Score = 598 bits (1541), Expect = e-168 Identities = 302/458 (65%), Positives = 359/458 (78%), Gaps = 4/458 (0%) Frame = -1 Query: 1611 TSVLDVSASIHKTLSFDSQS--INTLNQIEAXXXXXXXXXXXXXLHPRSSIHKTTHTDYK 1438 T+VLDV+ASI +T + S ++ NQ E R+SI KTTHT YK Sbjct: 31 TTVLDVAASIQRTKNIFSSGPKMSPFNQQEKETTSSELTVELLS---RTSIQKTTHTGYK 87 Query: 1437 ELTLARLGRDSVRVNSLQARLDLAIHGVLKSDLKPVYT--ELAAEELEVPVISGTSQGSG 1264 LTL+RL RDS RV SL RLDLAI+ + SDLKP+ T E E+L+ P+ISGTSQGSG Sbjct: 88 SLTLSRLQRDSARVKSLVTRLDLAINSISSSDLKPLETDSEFKPEDLQSPIISGTSQGSG 147 Query: 1263 EYFTRLGIGHPPSQLYMVLDTGSDVNWLQCAPCADCYQQTDPIFEPALSSSYSPLTCNTQ 1084 EYF+R+GIG PPSQ Y++LDTGSDVNW+QCAPCADCYQQ DPIFEPA S+S+S L+CNT+ Sbjct: 148 EYFSRVGIGKPPSQAYLILDTGSDVNWVQCAPCADCYQQADPIFEPASSASFSTLSCNTR 207 Query: 1083 QCKSLDVFQCRNDTCLYEVSYGDGSYTVGDFVTETVTFGDSASVNDVAIGCGHSNEXXXX 904 QC+SLDV +CRNDTCLYEVSYGDGSYTVGDFVTET+T G SA V++VAIGCGH+NE Sbjct: 208 QCRSLDVSECRNDTCLYEVSYGDGSYTVGDFVTETITLG-SAPVDNVAIGCGHNNEGLFV 266 Query: 903 XXXXXXXXXXXXXXFPSQINATSFSYCLVDRDSDSASTLEFDSVIPPNAVTAPLVRNDKL 724 FPSQINATSFSYCLVDRDS+SASTLEF+S +PPNAV+APL+RN L Sbjct: 267 GAAGLLGLGGGSLSFPSQINATSFSYCLVDRDSESASTLEFNSTLPPNAVSAPLLRNHHL 326 Query: 723 NTYYYIGLTGISVAGEMLKISESTFQLNNNGEGGVIIDSGTAVTRLQNGAYYSLRDAFKK 544 +T+YY+GLTG+SV GE++ I ES FQ++ +G GGVI+DSGTA+TRLQ Y SLRDAF K Sbjct: 327 DTFYYVGLTGLSVGGELVSIPESAFQIDESGNGGVIVDSGTAITRLQTDVYNSLRDAFVK 386 Query: 543 GTKDLPSTDGVALFDTCYDLSAKKSVEVPTVSFHFSNGKEWSLPAKNYLIPVDSAGTFCL 364 T+DLPST+G+ALFDTCYDLS+K +VEVPTVSFHF +GKE LPAKNYL+P+DS GTFC Sbjct: 387 RTRDLPSTNGIALFDTCYDLSSKGNVEVPTVSFHFPDGKELPLPAKNYLVPLDSEGTFCF 446 Query: 363 AFAPTSSALSIIGNVQQQGTRVSYDLGHSLIGFTANKC 250 AFAPT+S+LSIIGNVQQQGTRV YDL + L+GF NKC Sbjct: 447 AFAPTASSLSIIGNVQQQGTRVVYDLVNHLVGFVPNKC 484 >ref|XP_002529147.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus communis] gi|223531426|gb|EEF33260.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus communis] Length = 479 Score = 589 bits (1519), Expect = e-166 Identities = 296/456 (64%), Positives = 352/456 (77%), Gaps = 2/456 (0%) Frame = -1 Query: 1611 TSVLDVSASIHKTLSFDSQSINTLNQIEAXXXXXXXXXXXXXLHPRSSIHKTTHTDYKEL 1432 T++LDV ASI K + + S + LH R+S+ KT H DY+ L Sbjct: 25 TTLLDVEASIQKAEAIFTSSATKMTPFNQQEIVTSSSQLTMELHSRTSVQKTKHPDYRSL 84 Query: 1431 TLARLGRDSVRVNSLQARLDLAIHGVLKSDLKPVYTE--LAAEELEVPVISGTSQGSGEY 1258 TL+RL RDS RV S+ RLDLAIHG+ SDLKP+ T+ AE+L+ P+ISGTSQGSGEY Sbjct: 85 TLSRLERDSARVKSINTRLDLAIHGLSTSDLKPLDTDSQFRAEDLQGPIISGTSQGSGEY 144 Query: 1257 FTRLGIGHPPSQLYMVLDTGSDVNWLQCAPCADCYQQTDPIFEPALSSSYSPLTCNTQQC 1078 F+R+GIG P S +YMVLDTGSDVNW+QCAPCADCY Q DPIFEPA S+SYSPL+C+T+QC Sbjct: 145 FSRVGIGKPSSPVYMVLDTGSDVNWIQCAPCADCYHQADPIFEPASSTSYSPLSCDTKQC 204 Query: 1077 KSLDVFQCRNDTCLYEVSYGDGSYTVGDFVTETVTFGDSASVNDVAIGCGHSNEXXXXXX 898 +SLDV +CRN+TCLYEVSYGDGSYTVGDFVTET+T G SASV++VAIGCGH+NE Sbjct: 205 QSLDVSECRNNTCLYEVSYGDGSYTVGDFVTETITLG-SASVDNVAIGCGHNNEGLFIGA 263 Query: 897 XXXXXXXXXXXXFPSQINATSFSYCLVDRDSDSASTLEFDSVIPPNAVTAPLVRNDKLNT 718 FPSQINA+SFSYCLVDRDSDSASTLEF+S + P+A+TAPL+RN +L+T Sbjct: 264 AGLLGLGGGKLSFPSQINASSFSYCLVDRDSDSASTLEFNSALLPHAITAPLLRNRELDT 323 Query: 717 YYYIGLTGISVAGEMLKISESTFQLNNNGEGGVIIDSGTAVTRLQNGAYYSLRDAFKKGT 538 +YY+G+TG+SV GE+L I ES F+++ +G GG+IIDSGTAVTRLQ AY +LRDAF KGT Sbjct: 324 FYYVGMTGLSVGGELLSIPESMFEMDESGNGGIIIDSGTAVTRLQTAAYNALRDAFVKGT 383 Query: 537 KDLPSTDGVALFDTCYDLSAKKSVEVPTVSFHFSNGKEWSLPAKNYLIPVDSAGTFCLAF 358 KDLP T VALFDTCYDLS K SVEVPTV+FH + GK LPA NYLIPVDS GTFC AF Sbjct: 384 KDLPVTSEVALFDTCYDLSRKTSVEVPTVTFHLAGGKVLPLPATNYLIPVDSDGTFCFAF 443 Query: 357 APTSSALSIIGNVQQQGTRVSYDLGHSLIGFTANKC 250 APTSSALSIIGNVQQQGTRV +DL +SL+GF +C Sbjct: 444 APTSSALSIIGNVQQQGTRVGFDLANSLVGFEPRQC 479 >ref|XP_002890686.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata] gi|297336528|gb|EFH66945.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata] Length = 486 Score = 580 bits (1495), Expect = e-163 Identities = 296/483 (61%), Positives = 362/483 (74%), Gaps = 5/483 (1%) Frame = -1 Query: 1683 FFFSILLTVSFTPVLSRNLQLRD--TTSVLDVSASIHKTLSFDSQSINTLNQIEAXXXXX 1510 FFF LT S + V SR L TTS+L+V+ SIH+T S +N + Sbjct: 10 FFFVFFLT-SHSFVFSRILPKTSVTTTSILNVADSIHRTKYTSSFRLNQQEE----QTHS 64 Query: 1509 XXXXXXXXLHPRSSIHKTTHTDYKELTLARLGRDSVRVNSLQARLDLAIHGVLKSDLKPV 1330 LH R S+ T H+DYK LTLARL RD+ RV SL RLDLAI+ + K+DLKPV Sbjct: 65 RSSSFSLQLHSRVSVRGTEHSDYKSLTLARLNRDTARVKSLITRLDLAINNISKADLKPV 124 Query: 1329 ---YTELAAEELEVPVISGTSQGSGEYFTRLGIGHPPSQLYMVLDTGSDVNWLQCAPCAD 1159 YT E++E P+ISGT+QGSGEYFTR+GIG+P ++YMVLDTGSDVNWLQC PCAD Sbjct: 125 TTMYTTTEEEDIEAPLISGTTQGSGEYFTRVGIGNPAREVYMVLDTGSDVNWLQCTPCAD 184 Query: 1158 CYQQTDPIFEPALSSSYSPLTCNTQQCKSLDVFQCRNDTCLYEVSYGDGSYTVGDFVTET 979 CY QT+PIFEP+ SSSY PL+C+T QC +L+V +CRN TCLYEVSYGDGSYTVGDF TET Sbjct: 185 CYHQTEPIFEPSSSSSYEPLSCDTPQCNALEVSECRNATCLYEVSYGDGSYTVGDFATET 244 Query: 978 VTFGDSASVNDVAIGCGHSNEXXXXXXXXXXXXXXXXXXFPSQINATSFSYCLVDRDSDS 799 +T G S V +VA+GCGHSNE PSQ+N TSFSYCLVDRDSDS Sbjct: 245 LTIG-STLVQNVAVGCGHSNEGLFVGAAGLLGLGGGLLALPSQLNTTSFSYCLVDRDSDS 303 Query: 798 ASTLEFDSVIPPNAVTAPLVRNDKLNTYYYIGLTGISVAGEMLKISESTFQLNNNGEGGV 619 AST+EF + +PP+AV APL+RN +L+T+YY+GLTGISV GE+L+I +S+F+++ +G GG+ Sbjct: 304 ASTVEFGTSLPPDAVVAPLLRNHQLDTFYYLGLTGISVGGELLQIPQSSFEMDESGSGGI 363 Query: 618 IIDSGTAVTRLQNGAYYSLRDAFKKGTKDLPSTDGVALFDTCYDLSAKKSVEVPTVSFHF 439 IIDSGTAVTRLQ G Y SLRD+F KGT DL GVA+FDTCY+LSAK ++EVPTV+FHF Sbjct: 364 IIDSGTAVTRLQTGIYNSLRDSFLKGTSDLEKAAGVAMFDTCYNLSAKTTIEVPTVAFHF 423 Query: 438 SNGKEWSLPAKNYLIPVDSAGTFCLAFAPTSSALSIIGNVQQQGTRVSYDLGHSLIGFTA 259 GK +LPAKNY+IPVDS GTFCLAFAPT+S+L+IIGNVQQQGTRV++DL +SLIGF++ Sbjct: 424 PGGKMLALPAKNYMIPVDSVGTFCLAFAPTASSLAIIGNVQQQGTRVTFDLANSLIGFSS 483 Query: 258 NKC 250 NKC Sbjct: 484 NKC 486 >ref|NP_173922.1| aspartyl protease-like protein [Arabidopsis thaliana] gi|12321511|gb|AAG50814.1|AC079281_16 hypothetical protein [Arabidopsis thaliana] gi|20466516|gb|AAM20575.1| unknown protein [Arabidopsis thaliana] gi|23198172|gb|AAN15613.1| unknown protein [Arabidopsis thaliana] gi|110736960|dbj|BAF00436.1| hypothetical protein [Arabidopsis thaliana] gi|332192515|gb|AEE30636.1| aspartyl protease-like protein [Arabidopsis thaliana] Length = 483 Score = 574 bits (1480), Expect = e-161 Identities = 293/483 (60%), Positives = 358/483 (74%), Gaps = 4/483 (0%) Frame = -1 Query: 1686 SFFFSILLTVSFTPVLSRNLQLRDTT--SVLDVSASIHKTLSFDSQSINTLNQIEAXXXX 1513 SFFF I S + V SR L TT S+L+V+ SIH+T S +N + Sbjct: 6 SFFFFIFFLTSHSSVFSRILPETSTTTTSILNVADSIHRTKYTSSFRLNQQEE----QTH 61 Query: 1512 XXXXXXXXXLHPRSSIHKTTHTDYKELTLARLGRDSVRVNSLQARLDLAIHGVLKSDLKP 1333 LH R S+ T H+DYK LTLARL RD+ RV SL RLDLAI+ + K+DLKP Sbjct: 62 SASSSFSLQLHSRVSVRGTEHSDYKSLTLARLNRDTARVKSLITRLDLAINNISKADLKP 121 Query: 1332 VYTELAAEE--LEVPVISGTSQGSGEYFTRLGIGHPPSQLYMVLDTGSDVNWLQCAPCAD 1159 + T EE +E P+ISGT+QGSGEYFTR+GIG P ++YMVLDTGSDVNWLQC PCAD Sbjct: 122 ISTMYTTEEQDIEAPLISGTTQGSGEYFTRVGIGKPAREVYMVLDTGSDVNWLQCTPCAD 181 Query: 1158 CYQQTDPIFEPALSSSYSPLTCNTQQCKSLDVFQCRNDTCLYEVSYGDGSYTVGDFVTET 979 CY QT+PIFEP+ SSSY PL+C+T QC +L+V +CRN TCLYEVSYGDGSYTVGDF TET Sbjct: 182 CYHQTEPIFEPSSSSSYEPLSCDTPQCNALEVSECRNATCLYEVSYGDGSYTVGDFATET 241 Query: 978 VTFGDSASVNDVAIGCGHSNEXXXXXXXXXXXXXXXXXXFPSQINATSFSYCLVDRDSDS 799 +T G S V +VA+GCGHSNE PSQ+N TSFSYCLVDRDSDS Sbjct: 242 LTIG-STLVQNVAVGCGHSNEGLFVGAAGLLGLGGGLLALPSQLNTTSFSYCLVDRDSDS 300 Query: 798 ASTLEFDSVIPPNAVTAPLVRNDKLNTYYYIGLTGISVAGEMLKISESTFQLNNNGEGGV 619 AST++F + + P+AV APL+RN +L+T+YY+GLTGISV GE+L+I +S+F+++ +G GG+ Sbjct: 301 ASTVDFGTSLSPDAVVAPLLRNHQLDTFYYLGLTGISVGGELLQIPQSSFEMDESGSGGI 360 Query: 618 IIDSGTAVTRLQNGAYYSLRDAFKKGTKDLPSTDGVALFDTCYDLSAKKSVEVPTVSFHF 439 IIDSGTAVTRLQ Y SLRD+F KGT DL GVA+FDTCY+LSAK +VEVPTV+FHF Sbjct: 361 IIDSGTAVTRLQTEIYNSLRDSFVKGTLDLEKAAGVAMFDTCYNLSAKTTVEVPTVAFHF 420 Query: 438 SNGKEWSLPAKNYLIPVDSAGTFCLAFAPTSSALSIIGNVQQQGTRVSYDLGHSLIGFTA 259 GK +LPAKNY+IPVDS GTFCLAFAPT+S+L+IIGNVQQQGTRV++DL +SLIGF++ Sbjct: 421 PGGKMLALPAKNYMIPVDSVGTFCLAFAPTASSLAIIGNVQQQGTRVTFDLANSLIGFSS 480 Query: 258 NKC 250 NKC Sbjct: 481 NKC 483