BLASTX nr result
ID: Angelica22_contig00009722
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Angelica22_contig00009722 (1937 letters) Database: ./nr 23,641,837 sequences; 8,123,359,852 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_002281973.1| PREDICTED: aspartic proteinase nepenthesin-2... 607 e-171 ref|XP_002315953.1| predicted protein [Populus trichocarpa] gi|2... 598 e-168 ref|XP_002529147.1| Aspartic proteinase nepenthesin-1 precursor,... 589 e-166 ref|XP_002890686.1| aspartyl protease family protein [Arabidopsi... 580 e-163 ref|NP_173922.1| aspartyl protease-like protein [Arabidopsis tha... 576 e-162 >ref|XP_002281973.1| PREDICTED: aspartic proteinase nepenthesin-2 [Vitis vinifera] Length = 491 Score = 607 bits (1564), Expect = e-171 Identities = 307/481 (63%), Positives = 369/481 (76%), Gaps = 5/481 (1%) Frame = -2 Query: 1741 FSILLTVSFTPVLSRNLQLRDTTSVLDVSASIHKTLSFDS-----QSINTLNQIEAXXXX 1577 F+ T + V +R L L TT VLDVS SI ++L+ S + + +Q + Sbjct: 12 FAFFCTWGVSLVNARRLSLPRTT-VLDVSGSIRESLNVLSLNPQYEQMEFQHQERSFPSS 70 Query: 1576 XXXXXXXXXLHPRSSIHKTTHTDYKELTLARLGRDSVRVNSLQARLDLAIHGVLKSDLKP 1397 LH R+SIHK++H DYK L LARL RDS RV SL R+DLAI G+ KSDLKP Sbjct: 71 SSSSSLTLSLHSRTSIHKSSHKDYKSLVLARLERDSDRVRSLATRMDLAIAGITKSDLKP 130 Query: 1396 VYTELAAEELEVPVISGTSQGSGEYFTRLGIGHPPSQLYMVLDTGSDVNWLQCAPCADCY 1217 V EL AE LE P++SG SQGSGEYF+R+GIG PP +YMV+DTGSDVNW+QCAPCADCY Sbjct: 131 VEKELEAEALETPLVSGASQGSGEYFSRVGIGSPPKHVYMVVDTGSDVNWVQCAPCADCY 190 Query: 1216 QQTDPIFEPALSSSYSPLTCNTQQCKSLDVFQCRNDTCLYEVSYGDGSYTVGDFVTETVT 1037 QQ DPIFEP+ SSSY+PLTC T QCKSLDV +CRND+CLYEVSYGDGSYTVGDF TET+T Sbjct: 191 QQADPIFEPSFSSSYAPLTCETHQCKSLDVSECRNDSCLYEVSYGDGSYTVGDFATETIT 250 Query: 1036 FGDSASVNDVAIGCGHSNEXXXXXXXXXXXXXXXXXXFPSQINATSFSYCLVDRDSDSAS 857 SAS+N+VAIGCGH NE FPSQINA+SFSYCLV+RD+DSAS Sbjct: 251 LDGSASLNNVAIGCGHDNEGLFVGAAGLLGLGGGSLSFPSQINASSFSYCLVNRDTDSAS 310 Query: 856 TLEFDSVIPPNAVTAPLVRNDKLNTYYYIGLTGISVAGEMLKISESTFQLNNNGEGGVII 677 TLEF+S IP ++VTAPL+RN++L+T+YY+G+TGI V G+ML I S+F+++ +G GG+I+ Sbjct: 311 TLEFNSPIPSHSVTAPLLRNNQLDTFYYLGMTGIGVGGQMLSIPRSSFEVDESGNGGIIV 370 Query: 676 DSGTAVTRLQNGAYYSLRDAFKKGTKDLPSTDGVALFDTCYDLSAKKSVEVPTVSFHFSN 497 DSGTAVTRLQ+ Y SLRD+F +GT+ LPST GVALFDTCYDLS++ SVEVPTVSFHF + Sbjct: 371 DSGTAVTRLQSDVYNSLRDSFVRGTQHLPSTSGVALFDTCYDLSSRSSVEVPTVSFHFPD 430 Query: 496 GKEWSLPAKNYLIPVDSAGTFCLAFAPTSSALSIIGNVQQQGTRVSYDLGHSLIGFTANK 317 GK +LPAKNYLIPVDSAGTFC AFAPT+SALSIIGNVQQQGTRVSYDL +SL+GF+ N Sbjct: 431 GKYLALPAKNYLIPVDSAGTFCFAFAPTTSALSIIGNVQQQGTRVSYDLSNSLVGFSPNG 490 Query: 316 C 314 C Sbjct: 491 C 491 >ref|XP_002315953.1| predicted protein [Populus trichocarpa] gi|222864993|gb|EEF02124.1| predicted protein [Populus trichocarpa] Length = 484 Score = 598 bits (1541), Expect = e-168 Identities = 302/458 (65%), Positives = 359/458 (78%), Gaps = 4/458 (0%) Frame = -2 Query: 1675 TSVLDVSASIHKTLSFDSQS--INTLNQIEAXXXXXXXXXXXXXLHPRSSIHKTTHTDYK 1502 T+VLDV+ASI +T + S ++ NQ E R+SI KTTHT YK Sbjct: 31 TTVLDVAASIQRTKNIFSSGPKMSPFNQQEKETTSSELTVELLS---RTSIQKTTHTGYK 87 Query: 1501 ELTLARLGRDSVRVNSLQARLDLAIHGVLKSDLKPVYT--ELAAEELEVPVISGTSQGSG 1328 LTL+RL RDS RV SL RLDLAI+ + SDLKP+ T E E+L+ P+ISGTSQGSG Sbjct: 88 SLTLSRLQRDSARVKSLVTRLDLAINSISSSDLKPLETDSEFKPEDLQSPIISGTSQGSG 147 Query: 1327 EYFTRLGIGHPPSQLYMVLDTGSDVNWLQCAPCADCYQQTDPIFEPALSSSYSPLTCNTQ 1148 EYF+R+GIG PPSQ Y++LDTGSDVNW+QCAPCADCYQQ DPIFEPA S+S+S L+CNT+ Sbjct: 148 EYFSRVGIGKPPSQAYLILDTGSDVNWVQCAPCADCYQQADPIFEPASSASFSTLSCNTR 207 Query: 1147 QCKSLDVFQCRNDTCLYEVSYGDGSYTVGDFVTETVTFGDSASVNDVAIGCGHSNEXXXX 968 QC+SLDV +CRNDTCLYEVSYGDGSYTVGDFVTET+T G SA V++VAIGCGH+NE Sbjct: 208 QCRSLDVSECRNDTCLYEVSYGDGSYTVGDFVTETITLG-SAPVDNVAIGCGHNNEGLFV 266 Query: 967 XXXXXXXXXXXXXXFPSQINATSFSYCLVDRDSDSASTLEFDSVIPPNAVTAPLVRNDKL 788 FPSQINATSFSYCLVDRDS+SASTLEF+S +PPNAV+APL+RN L Sbjct: 267 GAAGLLGLGGGSLSFPSQINATSFSYCLVDRDSESASTLEFNSTLPPNAVSAPLLRNHHL 326 Query: 787 NTYYYIGLTGISVAGEMLKISESTFQLNNNGEGGVIIDSGTAVTRLQNGAYYSLRDAFKK 608 +T+YY+GLTG+SV GE++ I ES FQ++ +G GGVI+DSGTA+TRLQ Y SLRDAF K Sbjct: 327 DTFYYVGLTGLSVGGELVSIPESAFQIDESGNGGVIVDSGTAITRLQTDVYNSLRDAFVK 386 Query: 607 GTKDLPSTDGVALFDTCYDLSAKKSVEVPTVSFHFSNGKEWSLPAKNYLIPVDSAGTFCL 428 T+DLPST+G+ALFDTCYDLS+K +VEVPTVSFHF +GKE LPAKNYL+P+DS GTFC Sbjct: 387 RTRDLPSTNGIALFDTCYDLSSKGNVEVPTVSFHFPDGKELPLPAKNYLVPLDSEGTFCF 446 Query: 427 AFAPTSSALSIIGNVQQQGTRVSYDLGHSLIGFTANKC 314 AFAPT+S+LSIIGNVQQQGTRV YDL + L+GF NKC Sbjct: 447 AFAPTASSLSIIGNVQQQGTRVVYDLVNHLVGFVPNKC 484 >ref|XP_002529147.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus communis] gi|223531426|gb|EEF33260.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus communis] Length = 479 Score = 589 bits (1519), Expect = e-166 Identities = 296/456 (64%), Positives = 352/456 (77%), Gaps = 2/456 (0%) Frame = -2 Query: 1675 TSVLDVSASIHKTLSFDSQSINTLNQIEAXXXXXXXXXXXXXLHPRSSIHKTTHTDYKEL 1496 T++LDV ASI K + + S + LH R+S+ KT H DY+ L Sbjct: 25 TTLLDVEASIQKAEAIFTSSATKMTPFNQQEIVTSSSQLTMELHSRTSVQKTKHPDYRSL 84 Query: 1495 TLARLGRDSVRVNSLQARLDLAIHGVLKSDLKPVYTE--LAAEELEVPVISGTSQGSGEY 1322 TL+RL RDS RV S+ RLDLAIHG+ SDLKP+ T+ AE+L+ P+ISGTSQGSGEY Sbjct: 85 TLSRLERDSARVKSINTRLDLAIHGLSTSDLKPLDTDSQFRAEDLQGPIISGTSQGSGEY 144 Query: 1321 FTRLGIGHPPSQLYMVLDTGSDVNWLQCAPCADCYQQTDPIFEPALSSSYSPLTCNTQQC 1142 F+R+GIG P S +YMVLDTGSDVNW+QCAPCADCY Q DPIFEPA S+SYSPL+C+T+QC Sbjct: 145 FSRVGIGKPSSPVYMVLDTGSDVNWIQCAPCADCYHQADPIFEPASSTSYSPLSCDTKQC 204 Query: 1141 KSLDVFQCRNDTCLYEVSYGDGSYTVGDFVTETVTFGDSASVNDVAIGCGHSNEXXXXXX 962 +SLDV +CRN+TCLYEVSYGDGSYTVGDFVTET+T G SASV++VAIGCGH+NE Sbjct: 205 QSLDVSECRNNTCLYEVSYGDGSYTVGDFVTETITLG-SASVDNVAIGCGHNNEGLFIGA 263 Query: 961 XXXXXXXXXXXXFPSQINATSFSYCLVDRDSDSASTLEFDSVIPPNAVTAPLVRNDKLNT 782 FPSQINA+SFSYCLVDRDSDSASTLEF+S + P+A+TAPL+RN +L+T Sbjct: 264 AGLLGLGGGKLSFPSQINASSFSYCLVDRDSDSASTLEFNSALLPHAITAPLLRNRELDT 323 Query: 781 YYYIGLTGISVAGEMLKISESTFQLNNNGEGGVIIDSGTAVTRLQNGAYYSLRDAFKKGT 602 +YY+G+TG+SV GE+L I ES F+++ +G GG+IIDSGTAVTRLQ AY +LRDAF KGT Sbjct: 324 FYYVGMTGLSVGGELLSIPESMFEMDESGNGGIIIDSGTAVTRLQTAAYNALRDAFVKGT 383 Query: 601 KDLPSTDGVALFDTCYDLSAKKSVEVPTVSFHFSNGKEWSLPAKNYLIPVDSAGTFCLAF 422 KDLP T VALFDTCYDLS K SVEVPTV+FH + GK LPA NYLIPVDS GTFC AF Sbjct: 384 KDLPVTSEVALFDTCYDLSRKTSVEVPTVTFHLAGGKVLPLPATNYLIPVDSDGTFCFAF 443 Query: 421 APTSSALSIIGNVQQQGTRVSYDLGHSLIGFTANKC 314 APTSSALSIIGNVQQQGTRV +DL +SL+GF +C Sbjct: 444 APTSSALSIIGNVQQQGTRVGFDLANSLVGFEPRQC 479 >ref|XP_002890686.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata] gi|297336528|gb|EFH66945.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata] Length = 486 Score = 580 bits (1495), Expect = e-163 Identities = 296/483 (61%), Positives = 362/483 (74%), Gaps = 5/483 (1%) Frame = -2 Query: 1747 FFFSILLTVSFTPVLSRNLQLRD--TTSVLDVSASIHKTLSFDSQSINTLNQIEAXXXXX 1574 FFF LT S + V SR L TTS+L+V+ SIH+T S +N + Sbjct: 10 FFFVFFLT-SHSFVFSRILPKTSVTTTSILNVADSIHRTKYTSSFRLNQQEE----QTHS 64 Query: 1573 XXXXXXXXLHPRSSIHKTTHTDYKELTLARLGRDSVRVNSLQARLDLAIHGVLKSDLKPV 1394 LH R S+ T H+DYK LTLARL RD+ RV SL RLDLAI+ + K+DLKPV Sbjct: 65 RSSSFSLQLHSRVSVRGTEHSDYKSLTLARLNRDTARVKSLITRLDLAINNISKADLKPV 124 Query: 1393 ---YTELAAEELEVPVISGTSQGSGEYFTRLGIGHPPSQLYMVLDTGSDVNWLQCAPCAD 1223 YT E++E P+ISGT+QGSGEYFTR+GIG+P ++YMVLDTGSDVNWLQC PCAD Sbjct: 125 TTMYTTTEEEDIEAPLISGTTQGSGEYFTRVGIGNPAREVYMVLDTGSDVNWLQCTPCAD 184 Query: 1222 CYQQTDPIFEPALSSSYSPLTCNTQQCKSLDVFQCRNDTCLYEVSYGDGSYTVGDFVTET 1043 CY QT+PIFEP+ SSSY PL+C+T QC +L+V +CRN TCLYEVSYGDGSYTVGDF TET Sbjct: 185 CYHQTEPIFEPSSSSSYEPLSCDTPQCNALEVSECRNATCLYEVSYGDGSYTVGDFATET 244 Query: 1042 VTFGDSASVNDVAIGCGHSNEXXXXXXXXXXXXXXXXXXFPSQINATSFSYCLVDRDSDS 863 +T G S V +VA+GCGHSNE PSQ+N TSFSYCLVDRDSDS Sbjct: 245 LTIG-STLVQNVAVGCGHSNEGLFVGAAGLLGLGGGLLALPSQLNTTSFSYCLVDRDSDS 303 Query: 862 ASTLEFDSVIPPNAVTAPLVRNDKLNTYYYIGLTGISVAGEMLKISESTFQLNNNGEGGV 683 AST+EF + +PP+AV APL+RN +L+T+YY+GLTGISV GE+L+I +S+F+++ +G GG+ Sbjct: 304 ASTVEFGTSLPPDAVVAPLLRNHQLDTFYYLGLTGISVGGELLQIPQSSFEMDESGSGGI 363 Query: 682 IIDSGTAVTRLQNGAYYSLRDAFKKGTKDLPSTDGVALFDTCYDLSAKKSVEVPTVSFHF 503 IIDSGTAVTRLQ G Y SLRD+F KGT DL GVA+FDTCY+LSAK ++EVPTV+FHF Sbjct: 364 IIDSGTAVTRLQTGIYNSLRDSFLKGTSDLEKAAGVAMFDTCYNLSAKTTIEVPTVAFHF 423 Query: 502 SNGKEWSLPAKNYLIPVDSAGTFCLAFAPTSSALSIIGNVQQQGTRVSYDLGHSLIGFTA 323 GK +LPAKNY+IPVDS GTFCLAFAPT+S+L+IIGNVQQQGTRV++DL +SLIGF++ Sbjct: 424 PGGKMLALPAKNYMIPVDSVGTFCLAFAPTASSLAIIGNVQQQGTRVTFDLANSLIGFSS 483 Query: 322 NKC 314 NKC Sbjct: 484 NKC 486 >ref|NP_173922.1| aspartyl protease-like protein [Arabidopsis thaliana] gi|12321511|gb|AAG50814.1|AC079281_16 hypothetical protein [Arabidopsis thaliana] gi|20466516|gb|AAM20575.1| unknown protein [Arabidopsis thaliana] gi|23198172|gb|AAN15613.1| unknown protein [Arabidopsis thaliana] gi|110736960|dbj|BAF00436.1| hypothetical protein [Arabidopsis thaliana] gi|332192515|gb|AEE30636.1| aspartyl protease-like protein [Arabidopsis thaliana] Length = 483 Score = 576 bits (1484), Expect = e-162 Identities = 294/488 (60%), Positives = 360/488 (73%), Gaps = 4/488 (0%) Frame = -2 Query: 1765 MAQRVSFFFSILLTVSFTPVLSRNLQLRDTT--SVLDVSASIHKTLSFDSQSINTLNQIE 1592 M+ SFFF I S + V SR L TT S+L+V+ SIH+T S +N + Sbjct: 1 MSPNYSFFFFIFFLTSHSSVFSRILPETSTTTTSILNVADSIHRTKYTSSFRLNQQEE-- 58 Query: 1591 AXXXXXXXXXXXXXLHPRSSIHKTTHTDYKELTLARLGRDSVRVNSLQARLDLAIHGVLK 1412 LH R S+ T H+DYK LTLARL RD+ RV SL RLDLAI+ + K Sbjct: 59 --QTHSASSSFSLQLHSRVSVRGTEHSDYKSLTLARLNRDTARVKSLITRLDLAINNISK 116 Query: 1411 SDLKPVYTELAAEE--LEVPVISGTSQGSGEYFTRLGIGHPPSQLYMVLDTGSDVNWLQC 1238 +DLKP+ T EE +E P+ISGT+QGSGEYFTR+GIG P ++YMVLDTGSDVNWLQC Sbjct: 117 ADLKPISTMYTTEEQDIEAPLISGTTQGSGEYFTRVGIGKPAREVYMVLDTGSDVNWLQC 176 Query: 1237 APCADCYQQTDPIFEPALSSSYSPLTCNTQQCKSLDVFQCRNDTCLYEVSYGDGSYTVGD 1058 PCADCY QT+PIFEP+ SSSY PL+C+T QC +L+V +CRN TCLYEVSYGDGSYTVGD Sbjct: 177 TPCADCYHQTEPIFEPSSSSSYEPLSCDTPQCNALEVSECRNATCLYEVSYGDGSYTVGD 236 Query: 1057 FVTETVTFGDSASVNDVAIGCGHSNEXXXXXXXXXXXXXXXXXXFPSQINATSFSYCLVD 878 F TET+T G S V +VA+GCGHSNE PSQ+N TSFSYCLVD Sbjct: 237 FATETLTIG-STLVQNVAVGCGHSNEGLFVGAAGLLGLGGGLLALPSQLNTTSFSYCLVD 295 Query: 877 RDSDSASTLEFDSVIPPNAVTAPLVRNDKLNTYYYIGLTGISVAGEMLKISESTFQLNNN 698 RDSDSAST++F + + P+AV APL+RN +L+T+YY+GLTGISV GE+L+I +S+F+++ + Sbjct: 296 RDSDSASTVDFGTSLSPDAVVAPLLRNHQLDTFYYLGLTGISVGGELLQIPQSSFEMDES 355 Query: 697 GEGGVIIDSGTAVTRLQNGAYYSLRDAFKKGTKDLPSTDGVALFDTCYDLSAKKSVEVPT 518 G GG+IIDSGTAVTRLQ Y SLRD+F KGT DL GVA+FDTCY+LSAK +VEVPT Sbjct: 356 GSGGIIIDSGTAVTRLQTEIYNSLRDSFVKGTLDLEKAAGVAMFDTCYNLSAKTTVEVPT 415 Query: 517 VSFHFSNGKEWSLPAKNYLIPVDSAGTFCLAFAPTSSALSIIGNVQQQGTRVSYDLGHSL 338 V+FHF GK +LPAKNY+IPVDS GTFCLAFAPT+S+L+IIGNVQQQGTRV++DL +SL Sbjct: 416 VAFHFPGGKMLALPAKNYMIPVDSVGTFCLAFAPTASSLAIIGNVQQQGTRVTFDLANSL 475 Query: 337 IGFTANKC 314 IGF++NKC Sbjct: 476 IGFSSNKC 483