BLASTX nr result
ID: Angelica23_contig00014038
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Angelica23_contig00014038 (1660 letters) Database: ./nr 23,641,837 sequences; 8,123,359,852 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value dbj|BAJ53169.1| JHL18I08.3 [Jatropha curcas] 596 e-168 ref|XP_002510459.1| cysteine protease, putative [Ricinus communi... 576 e-162 ref|XP_002307688.1| predicted protein [Populus trichocarpa] gi|2... 572 e-160 ref|XP_004152671.1| PREDICTED: cysteine proteinase RD21a-like [C... 567 e-159 ref|XP_002889773.1| hypothetical protein ARALYDRAFT_471096 [Arab... 566 e-159 >dbj|BAJ53169.1| JHL18I08.3 [Jatropha curcas] Length = 441 Score = 596 bits (1536), Expect = e-168 Identities = 282/440 (64%), Positives = 343/440 (77%), Gaps = 6/440 (1%) Frame = +1 Query: 223 LFVLLLHTPCCIYSSTTYD---LFQTWCASYGKTYNSDQEKLSRFKIFEENYAYITQHNN 393 LFV L + ++SS++ + LF+TWC +GKTY S +EKL R K+F++NY ++T+HN+ Sbjct: 7 LFVAFLLSYLFLFSSSSSEIAHLFETWCQQHGKTYASQEEKLFRLKVFQDNYDFVTEHNS 66 Query: 394 KLAANSSLSYTLSLNAFADLSHQEFKASRLGLSAGGTNKLIRMNRGSSIIKGSSGVRNIP 573 + + SYTLSLNAFADL+H EFKASRLGLS+ + L ++R + I V ++P Sbjct: 67 Q----GNSSYTLSLNAFADLTHHEFKASRLGLSSAASASL-NVDRSNRQIPDF--VADVP 119 Query: 574 SSLDWRDKGAVTNVKDQGSCGACWSFSATGAIEGINQIVTGSLTSLSEQELVDCDRSYNS 753 +S+DWR GAVT VKDQG+CGACWSFSATGAIEGIN+IVTGSL SLSEQELVDCD+SYN+ Sbjct: 120 ASVDWRKNGAVTQVKDQGNCGACWSFSATGAIEGINKIVTGSLVSLSEQELVDCDKSYNN 179 Query: 754 GCEGGLMDYAYQFVIDNNGIDTEDDYPYQSRETKCNKNKLNRHVVTIDGYVDVRENDEER 933 GCEGG+MDYA+QFVIDN+GIDTE+DYPYQ R+ CNK KL RHVVTIDGYVDV +N+E+ Sbjct: 180 GCEGGIMDYAFQFVIDNHGIDTEEDYPYQGRDRSCNKEKLKRHVVTIDGYVDVPQNNEKE 239 Query: 934 LLEAVAAQPVSVGICGSERNFQLYSKGIFSGPCSTSLDHAVLIVGYGTENGVDYWILKNS 1113 LL+AVA QPVSVGICGSER FQLYSKGIF+GPCSTSLDHAVLIVGYG+ENGVDYWI+KNS Sbjct: 240 LLKAVANQPVSVGICGSERAFQLYSKGIFTGPCSTSLDHAVLIVGYGSENGVDYWIVKNS 299 Query: 1114 WGTQWGMNGYMHMQRNNGNSQGICGINMMXXXXXXXXXXXXXXXXXXXXKCSLLFSCSEG 1293 WG+ WGM+GYMHMQRN+G+S+G+CGINM+ +C L C EG Sbjct: 300 WGSYWGMDGYMHMQRNSGSSRGLCGINMLASYPKKTSPNPPPPAPPGPTRCDLFTHCGEG 359 Query: 1294 ETCCCSWSLFGLCLSWKCCELDSAVCCDDHRHCCPQDYPICDTKRNLCLKKTGNYTLVKE 1473 ETCCC +FG+CLSWKCCELDSAVCC D RHCCP+DYP+CDT RN+CLK GN T +++ Sbjct: 360 ETCCCVHHIFGICLSWKCCELDSAVCCKDGRHCCPRDYPVCDTTRNICLKHYGNATRIEK 419 Query: 1474 FGNKRSSG---NWSSLLRDW 1524 F SSG +WSSLL W Sbjct: 420 FAKNSSSGKFRSWSSLLEGW 439 >ref|XP_002510459.1| cysteine protease, putative [Ricinus communis] gi|223551160|gb|EEF52646.1| cysteine protease, putative [Ricinus communis] Length = 422 Score = 576 bits (1485), Expect = e-162 Identities = 280/419 (66%), Positives = 322/419 (76%), Gaps = 4/419 (0%) Frame = +1 Query: 199 MNQLRSWLLFVLLLHTPCCIYSSTTYD---LFQTWCASYGKTYNSDQEKLSRFKIFEENY 369 MN L + L LL S++ D LF++W +GKTY S ++KL RFKIFEENY Sbjct: 1 MNFLSALFLITLLFFNLSISSFSSSSDISKLFESWTKEHGKTYTSKEDKLYRFKIFEENY 60 Query: 370 AYITQHNNKLAANSSLSYTLSLNAFADLSHQEFKASRLGLSAGGTN-KLIRMNRGSSIIK 546 ++ +HN++ + SYTLSLNAFADL+H EFKASRLGLSA T+ KL R N Sbjct: 61 EFVKKHNSQ----GNSSYTLSLNAFADLTHHEFKASRLGLSAFSTSGKLSRRNFPLHDFV 116 Query: 547 GSSGVRNIPSSLDWRDKGAVTNVKDQGSCGACWSFSATGAIEGINQIVTGSLTSLSEQEL 726 G ++P S+DWR KGAV+ VKDQG+CGACWSFSATGAIEGIN+IVTGSL SLSEQEL Sbjct: 117 G-----DVPISIDWRKKGAVSQVKDQGNCGACWSFSATGAIEGINKIVTGSLVSLSEQEL 171 Query: 727 VDCDRSYNSGCEGGLMDYAYQFVIDNNGIDTEDDYPYQSRETKCNKNKLNRHVVTIDGYV 906 VDCDRSYN+GCEGGLMDYAYQFVI+NNGIDTE+DYPYQ+RE CNK KL RHVVTIDGY Sbjct: 172 VDCDRSYNNGCEGGLMDYAYQFVIENNGIDTEEDYPYQAREKTCNKEKLKRHVVTIDGYT 231 Query: 907 DVRENDEERLLEAVAAQPVSVGICGSERNFQLYSKGIFSGPCSTSLDHAVLIVGYGTENG 1086 DV +N+E+ LL+AVAAQPVSVGICGSER FQLYSKGIF+GPCSTSLDHAVLIVGYG+ENG Sbjct: 232 DVPQNNEKELLKAVAAQPVSVGICGSERAFQLYSKGIFTGPCSTSLDHAVLIVGYGSENG 291 Query: 1087 VDYWILKNSWGTQWGMNGYMHMQRNNGNSQGICGINMMXXXXXXXXXXXXXXXXXXXXKC 1266 VDYWI+KNSWGT WG+NGYM+M RN+GNSQG+CGINM+ KC Sbjct: 292 VDYWIVKNSWGTHWGINGYMYMLRNSGNSQGLCGINMLASFPVKTSPNPPPPAPPGPTKC 351 Query: 1267 SLLFSCSEGETCCCSWSLFGLCLSWKCCELDSAVCCDDHRHCCPQDYPICDTKRNLCLK 1443 L C EGETCCC+ +FGLC SWKCCELDSAVCC D HCCP DYP+CDTKRN+CLK Sbjct: 352 DLFTRCGEGETCCCTRRIFGLCFSWKCCELDSAVCCKDGLHCCPHDYPVCDTKRNMCLK 410 >ref|XP_002307688.1| predicted protein [Populus trichocarpa] gi|222857137|gb|EEE94684.1| predicted protein [Populus trichocarpa] Length = 436 Score = 572 bits (1474), Expect = e-160 Identities = 271/444 (61%), Positives = 332/444 (74%), Gaps = 2/444 (0%) Frame = +1 Query: 199 MNQLRSWLLFVLLLHTPCCIYSSTTYDLFQTWCASYGKTYNSDQEKLSRFKIFEENYAYI 378 MN L + L +L+ SS LF+TWC +GK+Y S +E+ R K+FE+NY ++ Sbjct: 1 MNFLYIFALTLLISVLSPSTSSSDISQLFETWCKEHGKSYTSQEERSHRLKVFEDNYDFV 60 Query: 379 TQHNNKLAANSSLSYTLSLNAFADLSHQEFKASRLGLSAGGTNKLIRMNRGSSIIKGSSG 558 T+HN+K + SY+L+LNAFADL+H EFK SRLGLSA N R + + Sbjct: 61 TKHNSK----GNSSYSLALNAFADLTHHEFKTSRLGLSAAPLNLAHRNLEITGV------ 110 Query: 559 VRNIPSSLDWRDKGAVTNVKDQGSCGACWSFSATGAIEGINQIVTGSLTSLSEQELVDCD 738 V +IP+S+DWR+KG VTNVKDQGSCGACWSFSATGAIEGIN+IVTGSL SLSEQEL++CD Sbjct: 111 VGDIPASIDWRNKGVVTNVKDQGSCGACWSFSATGAIEGINKIVTGSLVSLSEQELIECD 170 Query: 739 RSYNSGCEGGLMDYAYQFVIDNNGIDTEDDYPYQSRETKCNKNKLNRHVVTIDGYVDVRE 918 +SYN GC GGLMDYA+QFVI+N+GIDTE+DYPY++R+ CNK+++ R VVTID YVDV E Sbjct: 171 KSYNDGCGGGLMDYAFQFVINNHGIDTEEDYPYRARDGTCNKDRMKRRVVTIDKYVDVPE 230 Query: 919 NDEERLLEAVAAQPVSVGICGSERNFQLYSKGIFSGPCSTSLDHAVLIVGYGTENGVDYW 1098 N+E++LL+AVAAQPVSVGICGSER FQ+YSKGIF+GPCSTSLDHAVLIVGYG+ENGVDYW Sbjct: 231 NNEKQLLQAVAAQPVSVGICGSERAFQMYSKGIFTGPCSTSLDHAVLIVGYGSENGVDYW 290 Query: 1099 ILKNSWGTQWGMNGYMHMQRNNGNSQGICGINMMXXXXXXXXXXXXXXXXXXXXKCSLLF 1278 I+KNSWGT WGM GYMHMQRN+GNSQG+CGINM+ KC+LL Sbjct: 291 IVKNSWGTGWGMRGYMHMQRNSGNSQGVCGINMLASYPVKTSPNPPPPPPPGPTKCNLLT 350 Query: 1279 SCSEGETCCCSWSLFGLCLSWKCCELDSAVCCDDHRHCCPQDYPICDTKRNLCLKKTGNY 1458 C+ GETCCC+ FG+C+SWKCC LDSAVCC D HCCP DYP+CDT +N+C K+ GN Sbjct: 351 YCAAGETCCCARKFFGICISWKCCGLDSAVCCKDRLHCCPHDYPVCDTDKNMCFKRAGNA 410 Query: 1459 TLVKEFGNKRSS--GNWSSLLRDW 1524 T ++ K S G+W SL W Sbjct: 411 TRMEAIEGKTSGKFGSWISLPEAW 434 >ref|XP_004152671.1| PREDICTED: cysteine proteinase RD21a-like [Cucumis sativus] gi|449529596|ref|XP_004171784.1| PREDICTED: cysteine proteinase RD21a-like [Cucumis sativus] Length = 431 Score = 567 bits (1462), Expect = e-159 Identities = 276/434 (63%), Positives = 324/434 (74%), Gaps = 3/434 (0%) Frame = +1 Query: 217 WLLFVLLLHTPCCIYSSTTYDLFQTWCASYGKTYNSDQEKLSRFKIFEENYAYITQHNNK 396 +L LLL P S+ + +LF+ WC +GK+Y+S +EKL R +F +NY ++T HNN Sbjct: 8 FLTLFLLLFRPLSATSNVS-ELFEIWCTEHGKSYSSAEEKLYRLGVFADNYEFVTHHNNL 66 Query: 397 LAANSSLSYTLSLNAFADLSHQEFKASRLGLSAGGTNKLIRMNRGSSIIKGSSGVRNIPS 576 + SYTLSLN++ADL+H EFK SRLG S N + + S+ R++P Sbjct: 67 ----DNSSYTLSLNSYADLTHHEFKVSRLGFSPALRNFRPVLPQEPSL------PRDVPD 116 Query: 577 SLDWRDKGAVTNVKDQGSCGACWSFSATGAIEGINQIVTGSLTSLSEQELVDCDRSYNSG 756 SLDWR KGAVT VKDQGSCGACWSFSATGA+EGINQI+TGSL SLSEQEL+DCDRSYNSG Sbjct: 117 SLDWRKKGAVTAVKDQGSCGACWSFSATGAMEGINQIMTGSLISLSEQELIDCDRSYNSG 176 Query: 757 CEGGLMDYAYQFVIDNNGIDTEDDYPYQSRETKCNKNKLNRHVVTIDGYVDVRENDEERL 936 C GGLMDYAYQFVI N+GIDTE+DYPYQ+R+ C K+KL R+VVTIDGY D+ NDE +L Sbjct: 177 CGGGLMDYAYQFVISNHGIDTENDYPYQARDGSCRKDKLQRNVVTIDGYADIPSNDEGKL 236 Query: 937 LEAVAAQPVSVGICGSERNFQLYSKGIFSGPCSTSLDHAVLIVGYGTENGVDYWILKNSW 1116 L+AVAAQPVSVGICGSER FQLYSKGIFSGPCSTSLDHAVLIVGYG+ENGVDYWI+KNSW Sbjct: 237 LQAVAAQPVSVGICGSERAFQLYSKGIFSGPCSTSLDHAVLIVGYGSENGVDYWIVKNSW 296 Query: 1117 GTQWGMNGYMHMQRNNGNSQGICGINMMXXXXXXXXXXXXXXXXXXXXKCSLLFSCSEGE 1296 G WGM+GYMHMQRN+GNS+G+CGIN + KCS+L SC+ GE Sbjct: 297 GKSWGMDGYMHMQRNSGNSEGVCGINKLASYPTKTNPNPPPSPPPGPTKCSILTSCAAGE 356 Query: 1297 TCCCSWSLFGLCLSWKCCELDSAVCCDDHRHCCPQDYPICDTKRNLCLKKTGNYTLVKEF 1476 TCCC+ GLCLSWKCC L SAVCC D RHCCP DYPICDT RNLCLK+T N T + Sbjct: 357 TCCCAKKFLGLCLSWKCCGLSSAVCCKDGRHCCPFDYPICDTDRNLCLKQTMNGTRTEIL 416 Query: 1477 GNKRSSGN---WSS 1509 N+ SSG+ WSS Sbjct: 417 ENRSSSGSSGTWSS 430 >ref|XP_002889773.1| hypothetical protein ARALYDRAFT_471096 [Arabidopsis lyrata subsp. lyrata] gi|297335615|gb|EFH66032.1| hypothetical protein ARALYDRAFT_471096 [Arabidopsis lyrata subsp. lyrata] Length = 439 Score = 567 bits (1460), Expect = e-159 Identities = 271/425 (63%), Positives = 325/425 (76%), Gaps = 2/425 (0%) Frame = +1 Query: 226 FVLLLHTPCCIYSSTTYDLFQTWCASYGKTYNSDQEKLSRFKIFEENYAYITQHNNKLAA 405 F+LL+ +P S +LF WC +GKTY S++E+ R +IF++N+ ++TQHN L Sbjct: 15 FLLLVSSPSS--SDDISELFDDWCQRHGKTYGSEEERQQRIQIFKDNHDFVTQHN--LIT 70 Query: 406 NSSLSYTLSLNAFADLSHQEFKASRLGLSAGGTNKLIRMNRGSSIIKGSSGVRNIPSSLD 585 N++ Y+LSLNAFADL+H EFKASRLGLS ++ LI ++G S+ G +P S+D Sbjct: 71 NAT--YSLSLNAFADLTHHEFKASRLGLSVSASS-LIMASKGQSL----GGNAKVPDSVD 123 Query: 586 WRDKGAVTNVKDQGSCGACWSFSATGAIEGINQIVTGSLTSLSEQELVDCDRSYNSGCEG 765 WR KGAVTNVKDQGSCGACWSFSATGA+EGINQIVTG L SLSEQEL+DCD+SYN+GC G Sbjct: 124 WRKKGAVTNVKDQGSCGACWSFSATGAMEGINQIVTGDLISLSEQELIDCDKSYNAGCNG 183 Query: 766 GLMDYAYQFVIDNNGIDTEDDYPYQSRETKCNKNKLNRHVVTIDGYVDVRENDEERLLEA 945 GLMDYA++FVI N+GIDTE DYPYQ R+ C K+KL + VVTID Y V+ NDE+ L EA Sbjct: 184 GLMDYAFEFVIKNHGIDTEKDYPYQERDGTCKKDKLKQKVVTIDSYAGVKSNDEKALREA 243 Query: 946 VAAQPVSVGICGSERNFQLYSK--GIFSGPCSTSLDHAVLIVGYGTENGVDYWILKNSWG 1119 VAAQPVSVGICGSER FQLYS+ GIFSGPCSTSLDHAVLIVGYG++NGVDYWI+KNSWG Sbjct: 244 VAAQPVSVGICGSERAFQLYSRVSGIFSGPCSTSLDHAVLIVGYGSQNGVDYWIVKNSWG 303 Query: 1120 TQWGMNGYMHMQRNNGNSQGICGINMMXXXXXXXXXXXXXXXXXXXXKCSLLFSCSEGET 1299 WGM+G+MHMQRN GNS+GICGINM+ KC+L CS GET Sbjct: 304 KSWGMDGFMHMQRNTGNSEGICGINMLASYPIKTHPNPPPPSPPGPTKCNLFTYCSAGET 363 Query: 1300 CCCSWSLFGLCLSWKCCELDSAVCCDDHRHCCPQDYPICDTKRNLCLKKTGNYTLVKEFG 1479 CCC+ +LFGLC SWKCCE++SAVCC D RHCCP DYP+CDT R+LCLKKTGN+T +K F Sbjct: 364 CCCARNLFGLCFSWKCCEIESAVCCSDGRHCCPHDYPVCDTTRSLCLKKTGNFTAIKPFW 423 Query: 1480 NKRSS 1494 K SS Sbjct: 424 KKDSS 428