BLASTX nr result
ID: Angelica22_contig00022046
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Angelica22_contig00022046 (1526 letters) Database: ./nr 23,641,837 sequences; 8,123,359,852 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value dbj|BAJ53169.1| JHL18I08.3 [Jatropha curcas] 612 e-173 ref|XP_002307688.1| predicted protein [Populus trichocarpa] gi|2... 602 e-170 ref|XP_002510459.1| cysteine protease, putative [Ricinus communi... 581 e-163 ref|XP_004152671.1| PREDICTED: cysteine proteinase RD21a-like [C... 579 e-163 ref|XP_003523725.1| PREDICTED: oryzain alpha chain-like [Glycine... 571 e-160 >dbj|BAJ53169.1| JHL18I08.3 [Jatropha curcas] Length = 441 Score = 612 bits (1578), Expect = e-173 Identities = 293/454 (64%), Positives = 349/454 (76%), Gaps = 4/454 (0%) Frame = -3 Query: 1518 NTHLSLTMTWLWSFLVPTLLLCIPCIYSSTTSD---LFEAWCITHGKTYSSQQEKLHRFK 1348 N++ +L + +L S+L ++SS++S+ LFE WC HGKTY+SQ+EKL R K Sbjct: 2 NSNCALFVAFLLSYLF---------LFSSSSSEIAHLFETWCQQHGKTYASQEEKLFRLK 52 Query: 1347 IFEENYMYVTKHNMNNDMKANSSLSYTLSIDNAFADLTHQEFKASRLGLSSNGIIRMNLG 1168 +F++NY +VT+HN + + SYTLS+ NAFADLTH EFKASRLGLSS +N+ Sbjct: 53 VFQDNYDFVTEHN------SQGNSSYTLSL-NAFADLTHHEFKASRLGLSSAASASLNVD 105 Query: 1167 GSSKDSDD-VTSVPASLDWRDKGAVTNVKDQGSCGACWSFSATGAIEGINKIVTGSLTSL 991 S++ D V VPAS+DWR GAVT VKDQG+CGACWSFSATGAIEGINKIVTGSL SL Sbjct: 106 RSNRQIPDFVADVPASVDWRKNGAVTQVKDQGNCGACWSFSATGAIEGINKIVTGSLVSL 165 Query: 990 SEQELVDCDRSYNDGCEGGLMDYAYQFVVKNKGIDTEDDYPYQSRDTTCNKNKMNRHVVT 811 SEQELVDCD+SYN+GCEGG+MDYA+QFV+ N GIDTE+DYPYQ RD +CNK K+ RHVVT Sbjct: 166 SEQELVDCDKSYNNGCEGGIMDYAFQFVIDNHGIDTEEDYPYQGRDRSCNKEKLKRHVVT 225 Query: 810 IDGYIDVRENDEKQLLAAVAAQPVSVGICGSERNFQLYSKGIFNGPCSTLLNHAVLIVGY 631 IDGY+DV +N+EK+LL AVA QPVSVGICGSER FQLYSKGIF GPCST L+HAVLIVGY Sbjct: 226 IDGYVDVPQNNEKELLKAVANQPVSVGICGSERAFQLYSKGIFTGPCSTSLDHAVLIVGY 285 Query: 630 GSENGVDYWIVKNSWGKEWGMNGYMHMQRNSGNAQGICGINMMASYXXXXXXXXXXXXXX 451 GSENGVDYWIVKNSWG WGM+GYMHMQRNSG+++G+CGINM+ASY Sbjct: 286 GSENGVDYWIVKNSWGSYWGMDGYMHMQRNSGSSRGLCGINMLASYPKKTSPNPPPPAPP 345 Query: 450 XXTKCSLLTSCSEGETCCCARSLFGICLSWKCCELNSAVCCDDHRHCCPQDYPICDTKRN 271 T+C L T C EGETCCC +FGICLSWKCCEL+SAVCC D RHCCP+DYP+CDT RN Sbjct: 346 GPTRCDLFTHCGEGETCCCVHHIFGICLSWKCCELDSAVCCKDGRHCCPRDYPVCDTTRN 405 Query: 270 MCLKQTGNYTLVKEFKNKKSFGKLGGWTSLLGEW 169 +CLK GN T +++F S GK W+SLL W Sbjct: 406 ICLKHYGNATRIEKFAKNSSSGKFRSWSSLLEGW 439 >ref|XP_002307688.1| predicted protein [Populus trichocarpa] gi|222857137|gb|EEE94684.1| predicted protein [Populus trichocarpa] Length = 436 Score = 602 bits (1553), Expect = e-170 Identities = 294/446 (65%), Positives = 340/446 (76%), Gaps = 3/446 (0%) Frame = -3 Query: 1497 MTWLWSFLVPTLLLCI--PCIYSSTTSDLFEAWCITHGKTYSSQQEKLHRFKIFEENYMY 1324 M +L+ F + TLL+ + P SS S LFE WC HGK+Y+SQ+E+ HR K+FE+NY + Sbjct: 1 MNFLYIFAL-TLLISVLSPSTSSSDISQLFETWCKEHGKSYTSQEERSHRLKVFEDNYDF 59 Query: 1323 VTKHNMNNDMKANSSLSYTLSIDNAFADLTHQEFKASRLGLSSNGIIRMNLGGSSKDSDD 1144 VTKHN K NSS S L NAFADLTH EFK SRLGLS+ +NL + + Sbjct: 60 VTKHNS----KGNSSYSLAL---NAFADLTHHEFKTSRLGLSA---APLNLAHRNLEITG 109 Query: 1143 VTS-VPASLDWRDKGAVTNVKDQGSCGACWSFSATGAIEGINKIVTGSLTSLSEQELVDC 967 V +PAS+DWR+KG VTNVKDQGSCGACWSFSATGAIEGINKIVTGSL SLSEQEL++C Sbjct: 110 VVGDIPASIDWRNKGVVTNVKDQGSCGACWSFSATGAIEGINKIVTGSLVSLSEQELIEC 169 Query: 966 DRSYNDGCEGGLMDYAYQFVVKNKGIDTEDDYPYQSRDTTCNKNKMNRHVVTIDGYIDVR 787 D+SYNDGC GGLMDYA+QFV+ N GIDTE+DYPY++RD TCNK++M R VVTID Y+DV Sbjct: 170 DKSYNDGCGGGLMDYAFQFVINNHGIDTEEDYPYRARDGTCNKDRMKRRVVTIDKYVDVP 229 Query: 786 ENDEKQLLAAVAAQPVSVGICGSERNFQLYSKGIFNGPCSTLLNHAVLIVGYGSENGVDY 607 EN+EKQLL AVAAQPVSVGICGSER FQ+YSKGIF GPCST L+HAVLIVGYGSENGVDY Sbjct: 230 ENNEKQLLQAVAAQPVSVGICGSERAFQMYSKGIFTGPCSTSLDHAVLIVGYGSENGVDY 289 Query: 606 WIVKNSWGKEWGMNGYMHMQRNSGNAQGICGINMMASYXXXXXXXXXXXXXXXXTKCSLL 427 WIVKNSWG WGM GYMHMQRNSGN+QG+CGINM+ASY TKC+LL Sbjct: 290 WIVKNSWGTGWGMRGYMHMQRNSGNSQGVCGINMLASYPVKTSPNPPPPPPPGPTKCNLL 349 Query: 426 TSCSEGETCCCARSLFGICLSWKCCELNSAVCCDDHRHCCPQDYPICDTKRNMCLKQTGN 247 T C+ GETCCCAR FGIC+SWKCC L+SAVCC D HCCP DYP+CDT +NMC K+ GN Sbjct: 350 TYCAAGETCCCARKFFGICISWKCCGLDSAVCCKDRLHCCPHDYPVCDTDKNMCFKRAGN 409 Query: 246 YTLVKEFKNKKSFGKLGGWTSLLGEW 169 T ++ + K S GK G W SL W Sbjct: 410 ATRMEAIEGKTS-GKFGSWISLPEAW 434 >ref|XP_002510459.1| cysteine protease, putative [Ricinus communis] gi|223551160|gb|EEF52646.1| cysteine protease, putative [Ricinus communis] Length = 422 Score = 581 bits (1497), Expect = e-163 Identities = 277/393 (70%), Positives = 313/393 (79%) Frame = -3 Query: 1437 SSTTSDLFEAWCITHGKTYSSQQEKLHRFKIFEENYMYVTKHNMNNDMKANSSLSYTLSI 1258 SS S LFE+W HGKTY+S+++KL+RFKIFEENY +V KHN + + SYTLS+ Sbjct: 25 SSDISKLFESWTKEHGKTYTSKEDKLYRFKIFEENYEFVKKHN------SQGNSSYTLSL 78 Query: 1257 DNAFADLTHQEFKASRLGLSSNGIIRMNLGGSSKDSDDVTSVPASLDWRDKGAVTNVKDQ 1078 NAFADLTH EFKASRLGLS+ + D V VP S+DWR KGAV+ VKDQ Sbjct: 79 -NAFADLTHHEFKASRLGLSAFSTSGKLSRRNFPLHDFVGDVPISIDWRKKGAVSQVKDQ 137 Query: 1077 GSCGACWSFSATGAIEGINKIVTGSLTSLSEQELVDCDRSYNDGCEGGLMDYAYQFVVKN 898 G+CGACWSFSATGAIEGINKIVTGSL SLSEQELVDCDRSYN+GCEGGLMDYAYQFV++N Sbjct: 138 GNCGACWSFSATGAIEGINKIVTGSLVSLSEQELVDCDRSYNNGCEGGLMDYAYQFVIEN 197 Query: 897 KGIDTEDDYPYQSRDTTCNKNKMNRHVVTIDGYIDVRENDEKQLLAAVAAQPVSVGICGS 718 GIDTE+DYPYQ+R+ TCNK K+ RHVVTIDGY DV +N+EK+LL AVAAQPVSVGICGS Sbjct: 198 NGIDTEEDYPYQAREKTCNKEKLKRHVVTIDGYTDVPQNNEKELLKAVAAQPVSVGICGS 257 Query: 717 ERNFQLYSKGIFNGPCSTLLNHAVLIVGYGSENGVDYWIVKNSWGKEWGMNGYMHMQRNS 538 ER FQLYSKGIF GPCST L+HAVLIVGYGSENGVDYWIVKNSWG WG+NGYM+M RNS Sbjct: 258 ERAFQLYSKGIFTGPCSTSLDHAVLIVGYGSENGVDYWIVKNSWGTHWGINGYMYMLRNS 317 Query: 537 GNAQGICGINMMASYXXXXXXXXXXXXXXXXTKCSLLTSCSEGETCCCARSLFGICLSWK 358 GN+QG+CGINM+AS+ TKC L T C EGETCCC R +FG+C SWK Sbjct: 318 GNSQGLCGINMLASFPVKTSPNPPPPAPPGPTKCDLFTRCGEGETCCCTRRIFGLCFSWK 377 Query: 357 CCELNSAVCCDDHRHCCPQDYPICDTKRNMCLK 259 CCEL+SAVCC D HCCP DYP+CDTKRNMCLK Sbjct: 378 CCELDSAVCCKDGLHCCPHDYPVCDTKRNMCLK 410 >ref|XP_004152671.1| PREDICTED: cysteine proteinase RD21a-like [Cucumis sativus] gi|449529596|ref|XP_004171784.1| PREDICTED: cysteine proteinase RD21a-like [Cucumis sativus] Length = 431 Score = 579 bits (1493), Expect = e-163 Identities = 283/436 (64%), Positives = 330/436 (75%) Frame = -3 Query: 1491 WLWSFLVPTLLLCIPCIYSSTTSDLFEAWCITHGKTYSSQQEKLHRFKIFEENYMYVTKH 1312 + + FL LLL P +S S+LFE WC HGK+YSS +EKL+R +F +NY +VT H Sbjct: 4 YAFHFLTLFLLLFRPLSATSNVSELFEIWCTEHGKSYSSAEEKLYRLGVFADNYEFVTHH 63 Query: 1311 NMNNDMKANSSLSYTLSIDNAFADLTHQEFKASRLGLSSNGIIRMNLGGSSKDSDDVTSV 1132 N N D + SYTLS+ N++ADLTH EFK SRLG S +R ++ V Sbjct: 64 N-NLD-----NSSYTLSL-NSYADLTHHEFKVSRLGFSP--ALRNFRPVLPQEPSLPRDV 114 Query: 1131 PASLDWRDKGAVTNVKDQGSCGACWSFSATGAIEGINKIVTGSLTSLSEQELVDCDRSYN 952 P SLDWR KGAVT VKDQGSCGACWSFSATGA+EGIN+I+TGSL SLSEQEL+DCDRSYN Sbjct: 115 PDSLDWRKKGAVTAVKDQGSCGACWSFSATGAMEGINQIMTGSLISLSEQELIDCDRSYN 174 Query: 951 DGCEGGLMDYAYQFVVKNKGIDTEDDYPYQSRDTTCNKNKMNRHVVTIDGYIDVRENDEK 772 GC GGLMDYAYQFV+ N GIDTE+DYPYQ+RD +C K+K+ R+VVTIDGY D+ NDE Sbjct: 175 SGCGGGLMDYAYQFVISNHGIDTENDYPYQARDGSCRKDKLQRNVVTIDGYADIPSNDEG 234 Query: 771 QLLAAVAAQPVSVGICGSERNFQLYSKGIFNGPCSTLLNHAVLIVGYGSENGVDYWIVKN 592 +LL AVAAQPVSVGICGSER FQLYSKGIF+GPCST L+HAVLIVGYGSENGVDYWIVKN Sbjct: 235 KLLQAVAAQPVSVGICGSERAFQLYSKGIFSGPCSTSLDHAVLIVGYGSENGVDYWIVKN 294 Query: 591 SWGKEWGMNGYMHMQRNSGNAQGICGINMMASYXXXXXXXXXXXXXXXXTKCSLLTSCSE 412 SWGK WGM+GYMHMQRNSGN++G+CGIN +ASY TKCS+LTSC+ Sbjct: 295 SWGKSWGMDGYMHMQRNSGNSEGVCGINKLASYPTKTNPNPPPSPPPGPTKCSILTSCAA 354 Query: 411 GETCCCARSLFGICLSWKCCELNSAVCCDDHRHCCPQDYPICDTKRNMCLKQTGNYTLVK 232 GETCCCA+ G+CLSWKCC L+SAVCC D RHCCP DYPICDT RN+CLKQT N T + Sbjct: 355 GETCCCAKKFLGLCLSWKCCGLSSAVCCKDGRHCCPFDYPICDTDRNLCLKQTMNGTRTE 414 Query: 231 EFKNKKSFGKLGGWTS 184 +N+ S G G W+S Sbjct: 415 ILENRSSSGSSGTWSS 430 >ref|XP_003523725.1| PREDICTED: oryzain alpha chain-like [Glycine max] Length = 439 Score = 571 bits (1472), Expect = e-160 Identities = 275/435 (63%), Positives = 326/435 (74%) Frame = -3 Query: 1488 LWSFLVPTLLLCIPCIYSSTTSDLFEAWCITHGKTYSSQQEKLHRFKIFEENYMYVTKHN 1309 L FL LL + + +S TS+LFE WC H KTYSS++EKL+R K+FE+NY +V +HN Sbjct: 9 LLQFLSLILLFTLFFLSASDTSELFEKWCKEHSKTYSSEEEKLYRLKVFEDNYAFVAQHN 68 Query: 1308 MNNDMKANSSLSYTLSIDNAFADLTHQEFKASRLGLSSNGIIRMNLGGSSKDSDDVTSVP 1129 N + N+S SYTLS+ NAFADLTH EFK +RLGL ++R ++ S D+ +P Sbjct: 69 QNANNNNNNS-SYTLSL-NAFADLTHHEFKTTRLGLPLT-LLRFKRP-QNQQSRDLLHIP 124 Query: 1128 ASLDWRDKGAVTNVKDQGSCGACWSFSATGAIEGINKIVTGSLTSLSEQELVDCDRSYND 949 + +DWR GAVT VKDQ SCGACW+FSATGAIEGINKIVTGSL SLSEQEL+DCD SYN Sbjct: 125 SQIDWRQSGAVTPVKDQASCGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDTSYNS 184 Query: 948 GCEGGLMDYAYQFVVKNKGIDTEDDYPYQSRDTTCNKNKMNRHVVTIDGYIDVRENDEKQ 769 GC GGLMD+AYQFV+ NKGIDTEDDYPYQ+R +C+K+K+ R VTI+ Y+DV ++E + Sbjct: 185 GCGGGLMDFAYQFVIDNKGIDTEDDYPYQARQRSCSKDKLKRRAVTIEDYVDVPPSEE-E 243 Query: 768 LLAAVAAQPVSVGICGSERNFQLYSKGIFNGPCSTLLNHAVLIVGYGSENGVDYWIVKNS 589 +L AVA+QPVSVGICGSER FQLYSKGIF GPCST L+HAVLIVGYGSENGVDYWIVKNS Sbjct: 244 ILKAVASQPVSVGICGSEREFQLYSKGIFTGPCSTFLDHAVLIVGYGSENGVDYWIVKNS 303 Query: 588 WGKEWGMNGYMHMQRNSGNAQGICGINMMASYXXXXXXXXXXXXXXXXTKCSLLTSCSEG 409 WGK WGMNGY+HM RNSGN++GICGIN +ASY +C+L T CSEG Sbjct: 304 WGKYWGMNGYIHMIRNSGNSKGICGINTLASYPVKTKPNPPIPPPPGPVRCNLFTHCSEG 363 Query: 408 ETCCCARSLFGICLSWKCCELNSAVCCDDHRHCCPQDYPICDTKRNMCLKQTGNYTLVKE 229 ETCCCA+S GIC SWKCC L SAVCC D RHCCPQDYPICDT+R CLK+T N T Sbjct: 364 ETCCCAKSFLGICFSWKCCGLTSAVCCKDKRHCCPQDYPICDTRRGQCLKRTANGTTTIT 423 Query: 228 FKNKKSFGKLGGWTS 184 +N+ K GW S Sbjct: 424 SENQDFSHKSRGWKS 438