BLASTX nr result
ID: Mentha29_contig00016975
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Mentha29_contig00016975 (1693 letters) Database: ./nr 38,876,450 sequences; 13,856,398,315 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value gb|EYU44157.1| hypothetical protein MIMGU_mgv1a006046mg [Mimulus... 536 e-149 gb|EYU44158.1| hypothetical protein MIMGU_mgv1a020049mg [Mimulus... 513 e-142 gb|EPS71762.1| hypothetical protein M569_02997, partial [Genlise... 447 e-123 ref|XP_007216617.1| hypothetical protein PRUPE_ppa027121mg [Prun... 405 e-110 ref|XP_007216183.1| hypothetical protein PRUPE_ppa015845mg, part... 394 e-107 ref|XP_007032647.1| UDP-glucosyl transferase 88A1, putative [The... 392 e-106 gb|EXB38050.1| UDP-glycosyltransferase [Morus notabilis] 379 e-102 gb|EXB38047.1| UDP-glycosyltransferase [Morus notabilis] 379 e-102 ref|XP_006338402.1| PREDICTED: anthocyanidin 5,3-O-glucosyltrans... 369 2e-99 ref|XP_004232188.1| PREDICTED: anthocyanidin 5,3-O-glucosyltrans... 368 5e-99 ref|XP_002324085.2| hypothetical protein POPTR_0017s12490g [Popu... 355 3e-95 ref|XP_006438793.1| hypothetical protein CICLE_v10031419mg [Citr... 353 1e-94 ref|XP_004135442.1| PREDICTED: anthocyanidin 5,3-O-glucosyltrans... 350 8e-94 gb|EXB38045.1| Anthocyanidin 5,3-O-glucosyltransferase [Morus no... 348 3e-93 ref|XP_007045939.1| UDP-glucosyl transferase 88A1, putative isof... 348 4e-93 ref|XP_003563944.1| PREDICTED: anthocyanidin 5,3-O-glucosyltrans... 347 7e-93 gb|ACU64894.1| UDP-T1 [Oryza officinalis] 346 2e-92 ref|XP_002532899.1| UDP-glucosyltransferase, putative [Ricinus c... 345 3e-92 gb|ACU64887.1| UDP-T1 [Oryza minuta] 343 1e-91 ref|XP_002306046.2| hypothetical protein POPTR_0004s12460g [Popu... 340 1e-90 >gb|EYU44157.1| hypothetical protein MIMGU_mgv1a006046mg [Mimulus guttatus] Length = 459 Score = 536 bits (1380), Expect = e-149 Identities = 275/446 (61%), Positives = 337/446 (75%), Gaps = 9/446 (2%) Frame = -1 Query: 1453 PHVALFPCAGMGHLIPFLRLAGQLHSRGCAVTVITVEPTVSAAESNHLSSFFSTFPRIKR 1274 PH+ALFPCAGMGHL+PFLRL+ L SRGC+VT+ITV PTVSAAES+HLS+FF+ P+I+R Sbjct: 12 PHIALFPCAGMGHLLPFLRLSAMLSSRGCSVTLITVNPTVSAAESDHLSAFFAAHPQIQR 71 Query: 1273 LEFRLLPHRKSELTNDDPFFIQMERIGNSIHXXXXXXXXXXXXXSAAVVDLPVLSRLAST 1094 L F+L+P++KS TN+DPFFIQME I NS+H SA V D P+ +A+T Sbjct: 72 LHFQLIPYKKSNFTNEDPFFIQMESISNSVHLLPPLLSTLSPPLSAVVADFPIAHGVATT 131 Query: 1093 LS-----LPIYTLITTSARFFSLMTSLSHL--QQNADSVEIPNFGPIPLSSVPPPMLNPN 935 LS +PIYTL+TTSARFF+LMT L HL Q++ +EIP+ G IPLS++PPPML+PN Sbjct: 132 LSPEQPPIPIYTLVTTSARFFTLMTHLPHLITQKDNSCIEIPSLGKIPLSNIPPPMLDPN 191 Query: 934 HFFAASITTNTSSLSKSTGVIINTFTSLESQAIEALRRNGV-DQILPIGPLPPF-SETSA 761 FF+A+I +N SSLSK GV+INTF S E +AIEAL +N V +IL +GP +E A Sbjct: 192 TFFSANIISNVSSLSKLNGVLINTFDSFEPEAIEALSQNAVLPEILHVGPFESLETEARA 251 Query: 760 LDLPWLDEQAPSSVLYISFGSRTALSKEQITELASALEISGCKFLWVLKGGKVDREDKEE 581 LPWLDEQAP SV+++SFGSRTALSK QI EL + L +G KFLWVLKGGKVD++DKEE Sbjct: 252 HTLPWLDEQAPKSVVFVSFGSRTALSKPQIRELGNGLLKTGSKFLWVLKGGKVDKDDKEE 311 Query: 580 VGEMVGAEFVERTKGKGKVIKGWVEQEQILSHAAIGGFVSHCGWNSVTEAAALGVPVLAW 401 VGE++G F+ER KGKG V+KGWV+QE IL HAAIGGFVSHCGWNSVTEAA LGVPV W Sbjct: 312 VGEILGESFLERVKGKGLVVKGWVDQELILGHAAIGGFVSHCGWNSVTEAARLGVPVFGW 371 Query: 400 PLHGDQRVNAAVVEEVGLGIWVREWGWGGERLIGRDEIAKQLRMVMGDEMXXXXXXXXXX 221 PLHGDQRVNA VVE+VGLG WVREWG GE+L+G +EIA++++ +MG+E Sbjct: 372 PLHGDQRVNAEVVEKVGLGFWVREWGL-GEKLVGENEIAEKIKDLMGNENLRGRAMEVKE 430 Query: 220 XXXXXXEINGSSETLIRGLMESFKRK 143 EI+GSSE LIRGL+ES K K Sbjct: 431 KARLAREIDGSSEMLIRGLIESLKNK 456 >gb|EYU44158.1| hypothetical protein MIMGU_mgv1a020049mg [Mimulus guttatus] Length = 465 Score = 513 bits (1320), Expect = e-142 Identities = 266/456 (58%), Positives = 326/456 (71%), Gaps = 19/456 (4%) Frame = -1 Query: 1453 PHVALFPCAGMGHLIPFLRLAGQLHSRGCAVTVITVEPTVSAAESNHLSSFFSTFPRIKR 1274 PH+ALFPCAGMGHL+PFLRL+ L SRGC+VT+ITV PTV+AAES+HLS+FF+ P+I+R Sbjct: 7 PHIALFPCAGMGHLLPFLRLSAMLSSRGCSVTLITVNPTVTAAESDHLSAFFAAHPQIQR 66 Query: 1273 LEFRLLPHRKSELTNDDPFFIQMERIGNSIHXXXXXXXXXXXXXSAAVVDLPV----LSR 1106 L F+LLP++KS TN+DPFFIQME I NS+H SA + D V + Sbjct: 67 LHFQLLPYKKSNFTNEDPFFIQMESISNSVHLLPPLLSTLSPPLSAVIADFSVANAVFTH 126 Query: 1105 LASTLSLPIYTLITTSARFFSLMTSLSHLQQNADS--------VEIPNFGPIPLSSVPPP 950 LA L +PIYTL TTSARFF+LMT+L HL + V +P+ G PLS++PPP Sbjct: 127 LAPELPIPIYTLTTTSARFFTLMTNLPHLTTHTQGEDNNGYVYVTVPSLGRTPLSNIPPP 186 Query: 949 MLNPNHFFAASITTNTSSLSKSTGVIINTFTSLESQAIEALRRNG----VDQILPIGPLP 782 ML NH+FAA+I +N SSLSK GVIINTF S E +AIEAL V +ILP+GP Sbjct: 187 MLEANHYFAANIISNLSSLSKLNGVIINTFDSFEPEAIEALISKEKLALVPKILPLGPFE 246 Query: 781 PFSETSALD--LPWLDEQAPSSVLYISFGSRTALSKEQITELASALEISGCKFLWVLKGG 608 + D LPWLDEQAP SV+++SFG+RTALSKEQI EL + L SG KFLWVLKGG Sbjct: 247 SLETDAREDNNLPWLDEQAPESVVFVSFGNRTALSKEQIRELGNGLLRSGSKFLWVLKGG 306 Query: 607 KVDREDKEEVGEMVGAEFVERTKGKGKVIKGWVEQEQILSHAAIGGFVSHCGWNSVTEAA 428 KVD++DKEEVGE++G F+ER K KG V+KGWV QE IL H A+GGFVSHCGWNSVTEAA Sbjct: 307 KVDKDDKEEVGEILGESFLERVKSKGLVVKGWVNQELILGHVAVGGFVSHCGWNSVTEAA 366 Query: 427 ALGVPVLAWPLHGDQRVNAAVVEEVGLGIWVREWG-WGGERLIGRDEIAKQLRMVMGDEM 251 LGVP+LAWPLHGDQ VNA VVE+VGLG+WVR WG GGE+L+G +EIA++++ +MG++ Sbjct: 367 RLGVPILAWPLHGDQGVNAEVVEKVGLGLWVRGWGLGGGEKLVGENEIAEKIKDLMGNQK 426 Query: 250 XXXXXXXXXXXXXXXXEINGSSETLIRGLMESFKRK 143 E NGSSE LIRG++ES K K Sbjct: 427 LRSIAMEVKEKARLVREANGSSEMLIRGVIESLKNK 462 >gb|EPS71762.1| hypothetical protein M569_02997, partial [Genlisea aurea] Length = 431 Score = 447 bits (1149), Expect = e-123 Identities = 224/406 (55%), Positives = 286/406 (70%), Gaps = 8/406 (1%) Frame = -1 Query: 1450 HVALFPCAGMGHLIPFLRLAGQLHSRGCAVTVITVEPTVSAAESNHLSSFFSTFPRIKRL 1271 H+AL P AGMGHL+PFLRL L +RG VT+IT PTV+ AES+HLS FFS FP I RL Sbjct: 1 HLALLPVAGMGHLMPFLRLGAMLAARGATVTIITAHPTVTTAESDHLSRFFSQFPAINRL 60 Query: 1270 EFRLLPHRK--SELTNDDPFFIQMERIGNSIHXXXXXXXXXXXXXSAAVVDLPV---LSR 1106 EF L+P + SEL NDDPFFIQ E IG S H SA V D PV LS Sbjct: 61 EFHLIPREEYNSELKNDDPFFIQFESIGKSAHLLVPQLSSLSPPLSALVADFPVNAALSE 120 Query: 1105 LASTLSLPIYTLITTSARFFSLMTSLSHLQQN--ADSVEIPNFGPIPLSSVPPPMLNPNH 932 ++ LS+P+YTLITTSARFF++M L + ++ +++EIP G IP SS+PP ML+ H Sbjct: 121 ISDALSIPLYTLITTSARFFTIMFHLPRILEDNKKEAIEIPKLGKIPSSSIPPIMLDQAH 180 Query: 931 FFAASITTNTSSLSKSTGVIINTFTSLESQAIEALRRNGVDQILPIGPLPPFSETSALDL 752 FF++ IT+N +L KS G++INTF S E +AI+ L ILPIGPL + + +L Sbjct: 181 FFSSFITSNALTLHKSKGILINTFHSFEPEAIQCLTNPLPCPILPIGPLDVYDQHQPFNL 240 Query: 751 -PWLDEQAPSSVLYISFGSRTALSKEQITELASALEISGCKFLWVLKGGKVDREDKEEVG 575 PWLD Q+P SV+Y+SFG+RT+LSK+Q+ EL LE S CKFLWV+K KVD ED E + Sbjct: 241 LPWLDNQSPGSVVYVSFGNRTSLSKQQLQELGHGLEKSRCKFLWVVKSKKVDTEDTEGID 300 Query: 574 EMVGAEFVERTKGKGKVIKGWVEQEQILSHAAIGGFVSHCGWNSVTEAAALGVPVLAWPL 395 E++G FVER K +G ++KGWV+QE+IL H ++GGF+SHCGWNSV EAA LGVP+LAWP Sbjct: 301 EILGGPFVERNKERGMILKGWVDQEKILGHPSVGGFMSHCGWNSVMEAARLGVPILAWPQ 360 Query: 394 HGDQRVNAAVVEEVGLGIWVREWGWGGERLIGRDEIAKQLRMVMGD 257 HGDQR+NA VVE+ GLGIW EWGW G++L+ RDEI+ + +MG+ Sbjct: 361 HGDQRINADVVEKGGLGIWPEEWGWLGQKLVKRDEISNMISKLMGE 406 >ref|XP_007216617.1| hypothetical protein PRUPE_ppa027121mg [Prunus persica] gi|462412767|gb|EMJ17816.1| hypothetical protein PRUPE_ppa027121mg [Prunus persica] Length = 465 Score = 405 bits (1042), Expect = e-110 Identities = 216/462 (46%), Positives = 279/462 (60%), Gaps = 18/462 (3%) Frame = -1 Query: 1474 SKSENEAPHVALFPCAGMGHLIPFLRLAGQLHSRGCAVTVITVEPTVSAAESNHLSSFFS 1295 SK+ PH+AL P AGMGHL PFLRLA L SR C VT+IT P+VSAAES+H+S F S Sbjct: 2 SKTTASPPHIALLPSAGMGHLTPFLRLASMLSSRSCTVTLITASPSVSAAESSHVSFFLS 61 Query: 1294 TFPRIKRLEFRLLPHRK-SELTNDDPFFIQMERIGNSIHXXXXXXXXXXXXXSAAVVDLP 1118 P +K +EF+++P + S T DDPFF+Q E S+H SA D Sbjct: 62 QHPLVKHIEFKVIPSKPYSNPTTDDPFFLQFEATNRSVHLLYPSLASASPPLSAIFSDFA 121 Query: 1117 VLSR---LASTLSLPIYTLITTSARFFSLMTSLSHLQQNADS-------VEIPNFGPIPL 968 V S +A+ L +P Y + TTS +FF LM L L + S V IP P PL Sbjct: 122 VASSFAPVAADLGIPNYIISTTSCKFFCLMAYLPVLLSDPSSFSSGLSEVNIPGITPFPL 181 Query: 967 SSVPPPMLNPNHFFAASITTNTSSLSKSTGVIINTFTSLESQAIEALRRNGV----DQIL 800 S+PP NPNH F + I T+ +LSK+ G+++NTF E + + A+ + V IL Sbjct: 182 PSIPPQFKNPNHLFTSLIATSAQALSKAKGILMNTFDDFEPETLAAVNSSRVLDNLPPIL 241 Query: 799 PIGPLPPFSETSALD---LPWLDEQAPSSVLYISFGSRTALSKEQITELASALEISGCKF 629 PIGPL F D LPWLD Q SV+Y+SFGSRTALS QI EL+ LE SG +F Sbjct: 242 PIGPLETFEPKKEQDQSYLPWLDSQPAESVVYVSFGSRTALSSAQIRELSKGLERSGYRF 301 Query: 628 LWVLKGGKVDREDKEEVGEMVGAEFVERTKGKGKVIKGWVEQEQILSHAAIGGFVSHCGW 449 LWVLK KVD++DKEE+ +++ F++RTK KG+V+KGWV Q+ IL H A GGF+SHCGW Sbjct: 302 LWVLKTSKVDKDDKEELKDLLEESFLDRTKNKGRVVKGWVSQQDILEHPATGGFISHCGW 361 Query: 448 NSVTEAAALGVPVLAWPLHGDQRVNAAVVEEVGLGIWVREWGWGGERLIGRDEIAKQLRM 269 NSV EAA G+P+LAWP HGDQ VNA VVE+ GLGIW R+W WG E L+ +EI K++ Sbjct: 362 NSVMEAARKGIPMLAWPQHGDQSVNAEVVEKAGLGIWERKWDWGLEGLVSGEEIGKKIVE 421 Query: 268 VMGDEMXXXXXXXXXXXXXXXXEINGSSETLIRGLMESFKRK 143 +M DE I G SE ++ ++E ++K Sbjct: 422 LMEDEKLRGLARKVGENAGKATGIGGKSEKVLTEVLEYLEQK 463 >ref|XP_007216183.1| hypothetical protein PRUPE_ppa015845mg, partial [Prunus persica] gi|462412333|gb|EMJ17382.1| hypothetical protein PRUPE_ppa015845mg, partial [Prunus persica] Length = 433 Score = 394 bits (1013), Expect = e-107 Identities = 206/425 (48%), Positives = 265/425 (62%), Gaps = 18/425 (4%) Frame = -1 Query: 1474 SKSENEAPHVALFPCAGMGHLIPFLRLAGQLHSRGCAVTVITVEPTVSAAESNHLSSFFS 1295 SK+ PH+AL P AGMGHL PFLRLA L SR C VT+IT P+VSAAES+H+S F S Sbjct: 2 SKTTASPPHIALLPSAGMGHLTPFLRLASMLSSRSCTVTLITASPSVSAAESSHVSFFLS 61 Query: 1294 TFPRIKRLEFRLLPHR-KSELTNDDPFFIQMERIGNSIHXXXXXXXXXXXXXSAAVVDLP 1118 P +K +EF+++P + S T DDPFF+Q E S+H SA D Sbjct: 62 QHPLVKHIEFQVIPSKPSSNPTTDDPFFLQFEATNRSVHLLYPSLASASPPISAIFSDFA 121 Query: 1117 VLSRLA---STLSLPIYTLITTSARFFSLMTSLSHLQQNADS-------VEIPNFGPIPL 968 V S +A + L +P Y + TTS +FF LM L L + S V IP P PL Sbjct: 122 VASSIAPVAADLGIPNYIISTTSCKFFCLMAYLPVLLSDPSSFSSGLSEVNIPGITPFPL 181 Query: 967 SSVPPPMLNPNHFFAASITTNTSSLSKSTGVIINTFTSLESQAIEALRRNGV----DQIL 800 S+PPP NP+H + I T+ +LSK+ G+++NTF E + + ++ V IL Sbjct: 182 PSIPPPFKNPSHLLTSLIATDAQALSKAKGILMNTFDDFERETLAPIKSGRVLDNLPPIL 241 Query: 799 PIGPLPPFSETSALD---LPWLDEQAPSSVLYISFGSRTALSKEQITELASALEISGCKF 629 PIGPL + D LPWLD Q SV+Y+SFGSRTALS QI EL+ LE SG +F Sbjct: 242 PIGPLETYEPKKEQDQSYLPWLDSQPAESVVYVSFGSRTALSSAQIRELSKGLERSGYRF 301 Query: 628 LWVLKGGKVDREDKEEVGEMVGAEFVERTKGKGKVIKGWVEQEQILSHAAIGGFVSHCGW 449 LWV K KVD++DKEE+ +++ F++RTK KG+V+KGWV Q+ IL H AIGGF+SHCGW Sbjct: 302 LWVPKTSKVDKDDKEELKDLLEESFLDRTKNKGRVVKGWVSQQDILEHPAIGGFISHCGW 361 Query: 448 NSVTEAAALGVPVLAWPLHGDQRVNAAVVEEVGLGIWVREWGWGGERLIGRDEIAKQLRM 269 NSV EA G+P+LAWP H DQ VNA VVE+ GLGIW R+WGWG E L+ +EI K++ Sbjct: 362 NSVMEAVRKGIPMLAWPQHMDQSVNAEVVEKAGLGIWERKWGWGLEGLVSGEEIGKKIVE 421 Query: 268 VMGDE 254 +M DE Sbjct: 422 LMEDE 426 >ref|XP_007032647.1| UDP-glucosyl transferase 88A1, putative [Theobroma cacao] gi|508711676|gb|EOY03573.1| UDP-glucosyl transferase 88A1, putative [Theobroma cacao] Length = 467 Score = 392 bits (1007), Expect = e-106 Identities = 206/457 (45%), Positives = 277/457 (60%), Gaps = 16/457 (3%) Frame = -1 Query: 1468 SENEAPHVALFPCAGMGHLIPFLRLAGQLHSRGCAVTVITVEPTVSAAESNHLSSFFSTF 1289 SE PH+ALFP AGMGHL PFLRLA L S C VT++T + TVSAAES ++S F ST Sbjct: 7 SEASQPHIALFPSAGMGHLTPFLRLASMLLSHNCMVTLLTTKSTVSAAESTYISFFLSTN 66 Query: 1288 PRIKRLEFRLLPHRKSELTNDDPFFIQMERIGNSIHXXXXXXXXXXXXXSAAVVDLPV-- 1115 P IK +EF++ P + S T DDPFFIQ + S H SA DL V Sbjct: 67 PEIKHIEFQVPPMQPSNTTADDPFFIQFKATSRSAHLIYPLISSLSPPLSAIFSDLVVAS 126 Query: 1114 -LSRLASTLSLPIYTLITTSARFFSLMTSL-------SHLQQNADSVEIPNFGPIPLSSV 959 +S++A L +P Y + TTSA+F SL+ L + L + +EIP P+P+SS+ Sbjct: 127 GVSKVAVYLGIPNYAVSTTSAKFLSLLAYLPILTSDAAKLSNRSTDIEIPGLTPLPISSI 186 Query: 958 PPPMLNPNHFFAASITTNTSSLSKSTGVIINTFTSLESQAIEALRRN----GVDQILPIG 791 PPP NP+H F A++ +N +L G+++NTF E + + A+ + ILPIG Sbjct: 187 PPPFFNPDHLFTATLVSNAIALPDCKGILMNTFDCFEPETLSAINNKRALRNLPPILPIG 246 Query: 790 PLPPFSETSALD--LPWLDEQAPSSVLYISFGSRTALSKEQITELASALEISGCKFLWVL 617 PL + L LPWL+ Q SV+++SFGSRTA++K+QI EL LE S +FLW+L Sbjct: 247 PLETYELKKDLGQYLPWLNSQPAESVVFVSFGSRTAMTKDQIKELRHGLEKSEYRFLWIL 306 Query: 616 KGGKVDREDKEEVGEMVGAEFVERTKGKGKVIKGWVEQEQILSHAAIGGFVSHCGWNSVT 437 K VD++D E++ +++ F+ERTK KG V+K WV Q+ IL+H A+GGFV+HCGWNSV Sbjct: 307 KTKTVDKDDTEDLEDLLSCSFLERTKNKGMVLKEWVNQQDILAHPAVGGFVNHCGWNSVM 366 Query: 436 EAAALGVPVLAWPLHGDQRVNAAVVEEVGLGIWVREWGWGGERLIGRDEIAKQLRMVMGD 257 EAA G+P+LAWP HGDQR NA V+E+ GLGIW R WGWGG+RL+ DEI K++ +M D Sbjct: 367 EAAQRGIPMLAWPQHGDQRANAEVLEKAGLGIWDRTWGWGGQRLVKTDEIQKRISELMTD 426 Query: 256 EMXXXXXXXXXXXXXXXXEINGSSETLIRGLMESFKR 146 GSS I ++ES K+ Sbjct: 427 VKLKSRAKKVGEEARKATGNGGSSIKTIMEVIESLKQ 463 >gb|EXB38050.1| UDP-glycosyltransferase [Morus notabilis] Length = 463 Score = 379 bits (972), Expect = e-102 Identities = 196/450 (43%), Positives = 274/450 (60%), Gaps = 18/450 (4%) Frame = -1 Query: 1453 PHVALFPCAGMGHLIPFLRLAGQLHSRGCAVTVITVEPTVSAAESNHLSSFFSTFPRIKR 1274 PH+AL P AGMGHL+PFLR+A L SR C VT+IT +P VSAAES+H+S+F S P++K Sbjct: 13 PHIALIPSAGMGHLLPFLRIASTLLSRNCTVTLITAKPIVSAAESSHISAFLSQHPQVKH 72 Query: 1273 LEFRLLPHRKSELTNDDPFFIQMERIGNSIHXXXXXXXXXXXXXSAAVVDLPVLSRL--- 1103 ++F+ + T DDPF++Q E I S H SA D V S + Sbjct: 73 VDFQTIQSHNP--TADDPFYLQYESITRSAHLLYPLLSSSSLPFSAIFADFIVASSITPM 130 Query: 1102 ASTLSLPIYTLITTSARFFSLMTSL-------SHLQQNADSVEIPNFGPIPLSSVPPPML 944 A+ L +P Y + TTS +FF L+ L + L ++ + IP P P+SS+PPP Sbjct: 131 AAELGIPSYIICTTSIKFFCLIAYLPVLVTDPAKLGNSSTELIIPGLTPFPVSSIPPPFK 190 Query: 943 NPNHFFAASITTNTSSLSKSTGVIINTFTSLESQAIEALRR-----NGVDQILPIGPLPP 779 NPNH F + N +LSK+ G+I+N+ E + +E ++ N + LPIGPL Sbjct: 191 NPNHLFTRCLALNAKALSKAEGIIVNSVDFFEKETLEEIKNGRVLENSLPSFLPIGPLAS 250 Query: 778 FS--ETSALDLPWLDEQAPSSVLYISFGSRTALSKEQITELASALEISGCKFLWVLKGGK 605 F + + WLD Q SV+Y+SFGSRTA+S++QI E++ LE SG +FLWV+K Sbjct: 251 FEIKKDKGEYMSWLDNQPEESVVYVSFGSRTAISRDQIREVSKGLERSGHRFLWVVKSTT 310 Query: 604 VDREDKEEVGEMVGAEFVERTKGKGKVIKGWVEQEQILSHAAIGGFVSHCGWNSVTEAAA 425 +D+EDK+E+ +++G F+ERT KG +K WV QE+IL+H +IG FVSHCGWNSV EAA Sbjct: 311 IDKEDKDELKDLLGRSFLERTMNKGMAVKEWVSQEEILAHTSIGAFVSHCGWNSVIEAAR 370 Query: 424 LGVPVLAWPLHGDQRVNAAVVEEVGLGIWVREWGWGGE-RLIGRDEIAKQLRMVMGDEMX 248 GVP++AWP HGDQ+VNA +VE+ GLGIW R WGW + L+ +EI +++R VM DE Sbjct: 371 QGVPMVAWPQHGDQKVNAEIVEKAGLGIWERNWGWADQAELVCGEEIGEKIREVMEDEKL 430 Query: 247 XXXXXXXXXXXXXXXEINGSSETLIRGLME 158 +I G SE +++ L+E Sbjct: 431 REKAKKVGEEARKATKIGGKSEKVLKELLE 460 >gb|EXB38047.1| UDP-glycosyltransferase [Morus notabilis] Length = 463 Score = 379 bits (972), Expect = e-102 Identities = 196/450 (43%), Positives = 273/450 (60%), Gaps = 18/450 (4%) Frame = -1 Query: 1453 PHVALFPCAGMGHLIPFLRLAGQLHSRGCAVTVITVEPTVSAAESNHLSSFFSTFPRIKR 1274 PH+AL P AGMGHL+PFLR+A L SR C VT+IT +P VSAAES+H+S+F S P++K Sbjct: 13 PHIALIPSAGMGHLLPFLRIASTLLSRNCTVTLITAKPIVSAAESSHISAFLSQHPQVKH 72 Query: 1273 LEFRLLPHRKSELTNDDPFFIQMERIGNSIHXXXXXXXXXXXXXSAAVVDLPVLSRL--- 1103 ++F+ + T DDPF++Q E I S H SA D V S + Sbjct: 73 VDFQTIQSHNP--TADDPFYLQYESITRSAHLLYPLLSSSSPPFSAIFADFFVASSITPM 130 Query: 1102 ASTLSLPIYTLITTSARFFSLMTSL-------SHLQQNADSVEIPNFGPIPLSSVPPPML 944 A+ L +P Y + TTS +FF L+ L + L ++ + IP P P+SS+P P Sbjct: 131 AAELGIPSYIICTTSIKFFCLIAYLPVLVTDPAKLGNSSTELIIPGLTPFPVSSIPSPFK 190 Query: 943 NPNHFFAASITTNTSSLSKSTGVIINTFTSLESQAIEALRR-----NGVDQILPIGPLPP 779 NPNH F + N SK+ G+I+N+ E + +E ++ N + LPIGPL Sbjct: 191 NPNHLFTRCLVLNAKEFSKAKGIIVNSVDFFEKETLEEIKNGRVLENSLPSFLPIGPLAS 250 Query: 778 FS--ETSALDLPWLDEQAPSSVLYISFGSRTALSKEQITELASALEISGCKFLWVLKGGK 605 F + + WLD Q SV+Y+SFGSRTA+S++QI E++ LE SG +FLWV+K K Sbjct: 251 FEIKKDKGEYMSWLDNQPEESVVYVSFGSRTAISRDQIREVSKGLERSGHRFLWVVKSTK 310 Query: 604 VDREDKEEVGEMVGAEFVERTKGKGKVIKGWVEQEQILSHAAIGGFVSHCGWNSVTEAAA 425 +D+EDK+E+ +++G F+ERT KG +KGWV QE+IL+H +IG FVSHCGWNSV EAA Sbjct: 311 IDKEDKDELKDLLGGSFLERTMNKGMAVKGWVSQEEILAHPSIGAFVSHCGWNSVIEAAR 370 Query: 424 LGVPVLAWPLHGDQRVNAAVVEEVGLGIWVREWGWGGE-RLIGRDEIAKQLRMVMGDEMX 248 GVP++AWP HGDQ+VNA +VE+ GLGIW R WGW + L+ +EI +++R VM DE Sbjct: 371 QGVPMVAWPQHGDQKVNAEIVEKAGLGIWERNWGWADQAELVCGEEIGEKIREVMEDEKL 430 Query: 247 XXXXXXXXXXXXXXXEINGSSETLIRGLME 158 +I G SE +++ L+E Sbjct: 431 REKAKKVGEEARKATKIGGKSEKVLKELLE 460 >ref|XP_006338402.1| PREDICTED: anthocyanidin 5,3-O-glucosyltransferase-like [Solanum tuberosum] Length = 453 Score = 369 bits (948), Expect = 2e-99 Identities = 201/451 (44%), Positives = 277/451 (61%), Gaps = 14/451 (3%) Frame = -1 Query: 1453 PHVALFPCAGMGHLIPFLRLAGQLHSRGCAVTVITVEPTVSAAESNHLSSFFSTFPRIKR 1274 PH+AL P AGMGHL+PFLRLA L SR C VT++ +PTVSAAESNHL+SFFS P I+R Sbjct: 3 PHIALLPSAGMGHLMPFLRLAAMLASRNCKVTLLPAQPTVSAAESNHLNSFFSAHPHIQR 62 Query: 1273 LEFRLLPHRKSELTNDDPFFIQMERIGNSIHXXXXXXXXXXXXXSAAVVDLPV---LSRL 1103 L+F ++P S + DPFF+Q E I S+H SA +D+ + +L Sbjct: 63 LDFHVVPLHTSN-PHGDPFFLQFEAIIRSVHLLPPLLSSLSPPISALFLDIAAATCVDQL 121 Query: 1102 AS--TLSLPIYTLITTSARFFSLMTSLSHL--QQNADSVEIPNFGPIPLSSVPPPMLNPN 935 A +LS+ Y L TTSARFFSL++ L HL + + +++++ +S++PPP+ NP Sbjct: 122 ADHPSLSISYYILSTTSARFFSLLSHLPHLTLESSCENLKLHGLPSFSISNIPPPLFNPQ 181 Query: 934 HFFAASITTNTSSLSKSTGVIINTFTSLESQAIEALRRNGVD----QILPIGPLPPFSET 767 + F + +N ++S+ GV+ NTF E++ IEAL Q LPIGP P+ + Sbjct: 182 NLFTTQLISNARAISRVKGVVSNTFHWFEAETIEALNSGKTSITLPQFLPIGPFKPYEDP 241 Query: 766 S-ALDLPWLDEQAPSSVLYISFGSRTALSKEQITELASALEISGCKFLWVLKGGKVDRED 590 L WLD Q SV+Y+SFGSRT +SK+QI E+ L S KFLWVLK VD+ + Sbjct: 242 GKCASLSWLDGQPAKSVVYVSFGSRTTMSKDQIKEIGEGLLKSKQKFLWVLKSVIVDKVE 301 Query: 589 KEEVGEMVGAEFVERTKGK--GKVIKGWVEQEQILSHAAIGGFVSHCGWNSVTEAAALGV 416 + E+ E+VG +E+ + K G V+K WV+QE+IL+H AIGGF SHCGWNS EAA GV Sbjct: 302 ETELQELVGRSLLEKIEEKKQGIVVKEWVKQEEILTHHAIGGFFSHCGWNSAMEAAQRGV 361 Query: 415 PVLAWPLHGDQRVNAAVVEEVGLGIWVREWGWGGERLIGRDEIAKQLRMVMGDEMXXXXX 236 P+LAW L+GDQR NA VVE+ GLG+W + WGW GERL+ +EI +++ +M D Sbjct: 362 PMLAWTLNGDQRFNAEVVEKAGLGLWPKHWGWLGERLVKSEEIEEKIEELMQDHKFRSMA 421 Query: 235 XXXXXXXXXXXEINGSSETLIRGLMESFKRK 143 EI G+SE ++ ++E K K Sbjct: 422 QKVGEEAKRAWEIGGTSEKVVGQIIEMLKLK 452 >ref|XP_004232188.1| PREDICTED: anthocyanidin 5,3-O-glucosyltransferase-like [Solanum lycopersicum] Length = 461 Score = 368 bits (944), Expect = 5e-99 Identities = 204/452 (45%), Positives = 278/452 (61%), Gaps = 15/452 (3%) Frame = -1 Query: 1453 PHVALFPCAGMGHLIPFLRLAGQLHSRGCAVTVITVEPTVSAAESNHLSSFFSTFPRIKR 1274 PH+AL P AGMGHL+PFLRLA L SR C VT++T +PTVSAAES HL+SFFS P I+R Sbjct: 3 PHIALLPSAGMGHLMPFLRLAAMLASRNCKVTLLTAQPTVSAAESKHLNSFFSAHPHIQR 62 Query: 1273 LEFRLLPHRKSELTNDDPFFIQMERIGNSIHXXXXXXXXXXXXXSAAVVDLPV---LSRL 1103 L+F+++P + S + DPFF+Q E I S+H SA +D+ + +L Sbjct: 63 LDFQVVPLQSSN-PHGDPFFLQFEAIIRSVHLLPPLLSSLSPPISALFLDIAAATCVDQL 121 Query: 1102 AS--TLSLPIYTLITTSARFFSLMTSLSHLQQNADSVEIPNFG--PIPLSSVPPPMLNPN 935 A +LS+ Y L TTSARFFSL+T L HL + V + G +S++PPP+ NP Sbjct: 122 ADHPSLSISYYILSTTSARFFSLITHLPHLTLESSCVNLKLHGLPSFSISNIPPPIFNPQ 181 Query: 934 HFFAASITTNTSSLSKSTGVIINTFTSLESQAIEALRRNGVD----QILPIGPLPPFSET 767 + F + +N ++S+ GV+ NTF E++ IE L Q LPIGP + + Sbjct: 182 NLFTTQMISNARAISRVKGVVSNTFHWFEAETIEPLNSGKTSITLPQFLPIGPFKHYEDP 241 Query: 766 SALD-LPWLDEQAPSSVLYISFGSRTALSKEQITELASALEISGCKFLWVLKGGKVDRED 590 L WLDEQ SV+Y+SFGSRTA+SK+QI E+ L S KFLWVLK KVD+ + Sbjct: 242 GKCSSLSWLDEQPAKSVVYVSFGSRTAMSKDQIKEIGEGLLKSKQKFLWVLKSVKVDKAE 301 Query: 589 KEEVGEMVGAEFVERTKGK--GKVIKGWVEQEQILSHAAIGGFVSHCGWNSVTEAAALGV 416 + E+ E+VG +E+ + K G V+K WV+QE+IL+H AIGGF SHCGWNS EAA GV Sbjct: 302 ETELKELVGHSLLEKIEEKKQGIVVKEWVKQEEILTHHAIGGFFSHCGWNSTMEAAQRGV 361 Query: 415 PVLAWPLHGDQRVNAAVVEEVGLGIWVREWGWGGERLIGRDEIAKQLRMVMGD-EMXXXX 239 P+LAW L+GDQR NA VVE+ GLG+W + WGW GERL+ +EI +++ +M D ++ Sbjct: 362 PMLAWTLNGDQRFNAEVVEKAGLGLWPKHWGWLGERLVKSEEIEEKIEELMQDHKLRSMV 421 Query: 238 XXXXXXXXXXXXEINGSSETLIRGLMESFKRK 143 + G+SE ++ L+E K K Sbjct: 422 PKVGGRGQNGLGKFGGTSEKVVGQLIEMLKLK 453 >ref|XP_002324085.2| hypothetical protein POPTR_0017s12490g [Populus trichocarpa] gi|550320130|gb|EEF04218.2| hypothetical protein POPTR_0017s12490g [Populus trichocarpa] Length = 460 Score = 355 bits (912), Expect = 3e-95 Identities = 195/459 (42%), Positives = 272/459 (59%), Gaps = 17/459 (3%) Frame = -1 Query: 1474 SKSENEAPHVALFPCAGMGHLIPFLRLAGQLHSRGCAVTVITVEPTVSAAESNHLSSFFS 1295 S S+ + HVAL P AGMGHL PFLRLA L +R VT IT PTVS ES LS FF+ Sbjct: 3 SSSDQKLAHVALLPSAGMGHLTPFLRLAALLTARNVQVTFITPHPTVSLTESQALSGFFA 62 Query: 1294 TFPRIKRLEFRLLPHRKSELTNDDPFFIQMERIGNSIHXXXXXXXXXXXXXSAAVVDLPV 1115 +FP++K+ +F LLP ++ + DPFF QM+ I +S H S + D+ + Sbjct: 63 SFPQVKQKQFHLLPLEENSV---DPFFYQMQLIKSSCHLLSPLLSALTPSLSVFITDMTL 119 Query: 1114 LSR---LASTLSLPIYTLITTSARFFSLMTSLSHLQ--------QNADSVEIPNFGPIPL 968 S + +SLP Y L T+SA+ +L S L D ++I N +P Sbjct: 120 ASTVIPITQAISLPNYVLFTSSAKMMTLFLSYPTLAGSKALDDLDETDVIKIRNVELMPK 179 Query: 967 SSVPPPMLNP-NHFFAASITTNTSSLSKSTGVIINTFTSLESQAIEALRRNGV----DQI 803 S +PPP+L N+FF S + +++S G+++NTF S E +++ + V + Sbjct: 180 SLLPPPLLQKSNNFFKNSFIEDGRKVTESCGILLNTFVSFELESLRKINDGQVLERPPSV 239 Query: 802 LPIGPLPPF-SETSALDLPWLDEQAPSSVLYISFGSRTALSKEQITELASALEISGCKFL 626 + IGP PP SE S L L WLD+Q SVLY+SFGSRTAL+++QI EL L SG +F+ Sbjct: 240 VAIGPFPPCNSEKSQLQLTWLDDQPAGSVLYVSFGSRTALARDQIRELGEGLIKSGSRFV 299 Query: 625 WVLKGGKVDREDKEEVGEMVGAEFVERTKGKGKVIKGWVEQEQILSHAAIGGFVSHCGWN 446 W++K KVD+ED EE+ E++G E +ER K KG ++K W+ Q+ ILSH A+GGF+SHCGWN Sbjct: 300 WMVKDKKVDKEDSEELEEVIGYELMERVKEKGLIVKDWLNQDGILSHRAVGGFLSHCGWN 359 Query: 445 SVTEAAALGVPVLAWPLHGDQRVNAAVVEEVGLGIWVREWGWGGERLIGRDEIAKQLRMV 266 SV EAA GV +LAWP +GDQ++NA +VE +GLG WV+ WGW GE L+ EIA+++R Sbjct: 360 SVMEAAWHGVRILAWPQNGDQKINADIVERIGLGTWVKSWGWSGEMLVKGAEIAERIRES 419 Query: 265 MGDEMXXXXXXXXXXXXXXXXEINGSSETLIRGLMESFK 149 MG+E GSS+ + L+ +K Sbjct: 420 MGNESLRIQALGIKEDARKAVGFGGSSDKGLTELISMWK 458 >ref|XP_006438793.1| hypothetical protein CICLE_v10031419mg [Citrus clementina] gi|568859072|ref|XP_006483066.1| PREDICTED: anthocyanidin 5,3-O-glucosyltransferase-like [Citrus sinensis] gi|557540989|gb|ESR52033.1| hypothetical protein CICLE_v10031419mg [Citrus clementina] Length = 472 Score = 353 bits (906), Expect = 1e-94 Identities = 191/456 (41%), Positives = 257/456 (56%), Gaps = 21/456 (4%) Frame = -1 Query: 1453 PHVALFPCAGMGHLIPFLRLAGQLHSRGCAVTVITVEPTVSAAESNHLSSFFSTFPRIKR 1274 PHVAL P AGMGHL PFLRLA L C VT+IT PTVS AE+ H+S F S +P++ Sbjct: 11 PHVALIPSAGMGHLTPFLRLAASLVQHHCRVTLITTYPTVSLAETQHVSHFLSAYPQVTE 70 Query: 1273 LEFRLLPHRKSELTNDDPFFIQMERIGNSIHXXXXXXXXXXXXXSAAVVDLPVLSRLAST 1094 F LLP + DPF ++ E I S H V + + + Sbjct: 71 KRFHLLPFDPNSANATDPFLLRWEAIRRSAHLLAPLLSPPLSALITDVTLISAVLPVTIN 130 Query: 1093 LSLPIYTLITTSARFFSLM-----------TSLSHLQQNADSVEIPNFGPIPLSSVPPPM 947 L LP Y L T SA+ FSL TS ++ + D +EIP PIPLSSVPP + Sbjct: 131 LHLPNYVLFTASAKMFSLTASFPAIVASKSTSSGSVEFDDDFIEIPGLPPIPLSSVPPAV 190 Query: 946 LNPNHFFAASITTNTSSLSKSTGVIINTFTSLESQAIEALRRN----GVDQILPIGPLPP 779 ++ FA S N +S KS GV+IN+F +LE+ + AL G+ + +GPL P Sbjct: 191 MDSKSLFATSFLENGNSFVKSNGVLINSFDALEADTLVALNGRRVVAGLPPVYAVGPLLP 250 Query: 778 FS------ETSALDLPWLDEQAPSSVLYISFGSRTALSKEQITELASALEISGCKFLWVL 617 +++L L WLD+Q SV+Y+SFGSR ALS EQ EL L SGC+FLWV+ Sbjct: 251 CEFEKRDDPSTSLILKWLDDQPEGSVVYVSFGSRLALSMEQTKELGDGLLSSGCRFLWVV 310 Query: 616 KGGKVDREDKEEVGEMVGAEFVERTKGKGKVIKGWVEQEQILSHAAIGGFVSHCGWNSVT 437 KG VD+ED+E + ++G E E+ K +G V+K WV+Q+++LSH A+GGFVSH GWNS+ Sbjct: 311 KGKIVDKEDEESLKNVLGHELTEKIKDQGLVVKNWVDQDKVLSHRAVGGFVSHGGWNSLV 370 Query: 436 EAAALGVPVLAWPLHGDQRVNAAVVEEVGLGIWVREWGWGGERLIGRDEIAKQLRMVMGD 257 EAA GVP+L WP GDQ++NA VE GLG+WVR WGWG E DEI +++ +M + Sbjct: 371 EAARHGVPLLVWPHFGDQKINAEAVERAGLGMWVRSWGWGTELRAKGDEIGLKIKDLMAN 430 Query: 256 EMXXXXXXXXXXXXXXXXEINGSSETLIRGLMESFK 149 + + GSSE + L++ +K Sbjct: 431 DFLREQAKRIEEEARKAIGVGGSSERTFKELIDKWK 466 >ref|XP_004135442.1| PREDICTED: anthocyanidin 5,3-O-glucosyltransferase-like [Cucumis sativus] gi|449530181|ref|XP_004172074.1| PREDICTED: anthocyanidin 5,3-O-glucosyltransferase-like [Cucumis sativus] Length = 458 Score = 350 bits (899), Expect = 8e-94 Identities = 186/454 (40%), Positives = 266/454 (58%), Gaps = 13/454 (2%) Frame = -1 Query: 1468 SENEAPHVALFPCAGMGHLIPFLRLAGQLHSRGCAVTVITVEPTVSAAESNHLSSFFSTF 1289 S + HVALFP AGMGHL+PFLRLA L S C +T+IT P VS+AES+ +S F S F Sbjct: 3 SSDHQTHVALFPSAGMGHLVPFLRLANTLLSHNCKLTLITSHPPVSSAESHLISRFLSAF 62 Query: 1288 PRIKRLEFRLLPHRKSELTNDDPFFIQMERIGNSIHXXXXXXXXXXXXXSAAVVDLPVLS 1109 P++ L+F +LP S +DDPFF+Q E I S+H SA V D+ ++S Sbjct: 63 PQVNELKFHILPLDPSIANSDDPFFLQFEAIRRSVHVLNSPISALSPPLSALVCDVTLIS 122 Query: 1108 R---LASTLSLPIYTLITTSARFFSLMTSLSHLQQN---ADSVEIPNFGPIPLSSVPPPM 947 L +TL++PIY L T+SA+ SL + + +D + IP G IP +S+PPP+ Sbjct: 123 SGLLLNTTLNIPIYALFTSSAKMLSLFAYYPFAKMSDPSSDFIRIPAIGSIPKTSLPPPL 182 Query: 946 LNPNHFFAASITTNTSSLSKSTGVIINTFTSLESQAIEALRR----NGVDQILPIGPLPP 779 L N F + + + G++IN +E + AL NGV ++PIGP P Sbjct: 183 LINNSIFGKIFAQDGQRIKELNGILINAMDGIEGDTLTALNTGKVLNGVPPVIPIGPFLP 242 Query: 778 --FSETSALD-LPWLDEQAPSSVLYISFGSRTALSKEQITELASALEISGCKFLWVLKGG 608 F A + WLD P SV++ SFGSRTA S++QI E+ S L SG +F+WV+K Sbjct: 243 CDFENPDAKSPIKWLDNLPPRSVVFASFGSRTATSRDQIKEIGSGLVSSGYRFVWVVKDK 302 Query: 607 KVDREDKEEVGEMVGAEFVERTKGKGKVIKGWVEQEQILSHAAIGGFVSHCGWNSVTEAA 428 VD+EDKE + +++G E +++ K KG V+K WV Q++IL H A+GGF+ HCGWNSV EAA Sbjct: 303 VVDKEDKEGLEDIMGEELMKKLKEKGMVLKEWVNQQEILGHRAVGGFICHCGWNSVMEAA 362 Query: 427 ALGVPVLAWPLHGDQRVNAAVVEEVGLGIWVREWGWGGERLIGRDEIAKQLRMVMGDEMX 248 GVP+L WP GDQ +NA ++ + GLG+WV EWGWG + L+ +E+ +++ +M E Sbjct: 363 LNGVPILGWPQIGDQMINAELIAKKGLGMWVEEWGWGQKCLVKGEEVGGRIKEMMESEAL 422 Query: 247 XXXXXXXXXXXXXXXEINGSSETLIRGLMESFKR 146 E+ GS + I+GL+ + + Sbjct: 423 RKQAAKFRDEAIKAVEVGGSCDRAIQGLIRMWSK 456 >gb|EXB38045.1| Anthocyanidin 5,3-O-glucosyltransferase [Morus notabilis] Length = 469 Score = 348 bits (894), Expect = 3e-93 Identities = 189/453 (41%), Positives = 271/453 (59%), Gaps = 18/453 (3%) Frame = -1 Query: 1453 PHVALFPCAGMGHLIPFLRLAGQLHSRGCAVTVITVEPTVSAAESNHLSSFFSTFPRIKR 1274 PHVAL P AGMGHL PF+RLA L + VT IT PTVS +ES LS FSTFPRI R Sbjct: 12 PHVALLPSAGMGHLTPFIRLAVLLTTSNVRVTFITPYPTVSLSESQSLSHLFSTFPRITR 71 Query: 1273 LEFRLLPHRKSELTNDDPFFIQMERIGNSIHXXXXXXXXXXXXXSAAVVDLPVLSR---L 1103 + LLP ++DPF+ E I +S H SA + D+ + S + Sbjct: 72 KQLHLLPLEDPSAKSEDPFYYHFEVIRHSSHLLSPLLSSLSPPLSALITDMSLASTVIPI 131 Query: 1102 ASTLSLPIYTLITTSARFFSLMTSLSHL---------QQNADSVEIPNFGPIPLSSVPPP 950 L LP Y T+SA+ +L S + + D ++I PIP S +PPP Sbjct: 132 TDALQLPNYIFFTSSAKMLTLFLSFHIMVDPRDRCETSEMKDFIKIAGLEPIPRSWIPPP 191 Query: 949 ML-NPNHFFAASITTNTSSLSKSTGVIINTFTSLESQAIEALRRNGVDQILP----IGPL 785 +L + + + N +++S+G+++NT +++ +++EAL + V + LP IGPL Sbjct: 192 LLQDTKNLLKSYFIENGKKMTESSGILVNTNETVDGESLEALSKGKVLRGLPPVHAIGPL 251 Query: 784 PPFSETSALDLPWLDEQAPSSVLYISFGSRTALSKEQITELASALEISGCKFLWVLKGGK 605 PPF+ + L WLD+Q P SVLY+SFGSRTA+S+EQI EL L SG +FLWV+K K Sbjct: 252 PPFNLEQSQPLAWLDDQPPGSVLYVSFGSRTAISREQIRELGDGLVRSGKRFLWVVKDKK 311 Query: 604 VDREDKEEVGEMVGAEFVERTKGKGKVIKGWVEQEQILSHAAIGGFVSHCGWNSVTEAAA 425 VD+ED E+ +M+G + +ER K KG V+K W+ QE++LSHAA+GGF+SH GWNS+TEA Sbjct: 312 VDKEDSLELMDMMGQQLMERMKEKGFVVKNWLNQEEVLSHAAVGGFLSHSGWNSITEALW 371 Query: 424 LGVPVLAWPLHGDQRVNAAVVEEVGLGIWVREWGWGGERLIGR-DEIAKQLRMVMGDEMX 248 GVP+L WP HGDQ++NA +VE +G+G+WV+ WGW GE ++ + +EIA+ + ++G++ Sbjct: 372 HGVPMLLWPQHGDQKINAELVERIGVGMWVKSWGWCGEAMVVKGEEIAETVGELLGNQFM 431 Query: 247 XXXXXXXXXXXXXXXEINGSSETLIRGLMESFK 149 + GSS + L+ES+K Sbjct: 432 RSRAAKVRNEVRMAVDEGGSSYKRLADLIESWK 464 >ref|XP_007045939.1| UDP-glucosyl transferase 88A1, putative isoform 1 [Theobroma cacao] gi|590699499|ref|XP_007045940.1| UDP-glucosyl transferase 88A1, putative isoform 1 [Theobroma cacao] gi|590699502|ref|XP_007045941.1| UDP-glucosyl transferase 88A1, putative isoform 1 [Theobroma cacao] gi|590699505|ref|XP_007045942.1| UDP-glucosyl transferase 88A1, putative isoform 1 [Theobroma cacao] gi|590699508|ref|XP_007045943.1| UDP-glucosyl transferase 88A1, putative isoform 1 [Theobroma cacao] gi|590699511|ref|XP_007045944.1| UDP-glucosyl transferase 88A1, putative isoform 1 [Theobroma cacao] gi|508709874|gb|EOY01771.1| UDP-glucosyl transferase 88A1, putative isoform 1 [Theobroma cacao] gi|508709875|gb|EOY01772.1| UDP-glucosyl transferase 88A1, putative isoform 1 [Theobroma cacao] gi|508709876|gb|EOY01773.1| UDP-glucosyl transferase 88A1, putative isoform 1 [Theobroma cacao] gi|508709877|gb|EOY01774.1| UDP-glucosyl transferase 88A1, putative isoform 1 [Theobroma cacao] gi|508709878|gb|EOY01775.1| UDP-glucosyl transferase 88A1, putative isoform 1 [Theobroma cacao] gi|508709879|gb|EOY01776.1| UDP-glucosyl transferase 88A1, putative isoform 1 [Theobroma cacao] Length = 474 Score = 348 bits (893), Expect = 4e-93 Identities = 188/455 (41%), Positives = 264/455 (58%), Gaps = 20/455 (4%) Frame = -1 Query: 1450 HVALFPCAGMGHLIPFLRLAGQLHSRGCAVTVITVEPTVSAAESNHLSSFFSTFPRIKRL 1271 HVAL P +GMGHL+PFLRLAG L S+ C VT+IT P VS AES +S+F S FP++ Sbjct: 12 HVALLPSSGMGHLLPFLRLAGSLISQRCQVTLITTHPIVSLAESQLISAFLSAFPQVSEK 71 Query: 1270 EFRLLPHRKSELTNDDPFFIQMERIGNSIHXXXXXXXXXXXXXSAAVVDLPVLSRLAST- 1094 +F LLP +DPF +Q E I S H S + D+ ++S + S Sbjct: 72 KFTLLPLDPLTANCNDPFKLQWETIRRSAHLLSPLLSSLSPPLSFIITDMTLMSSVVSVT 131 Query: 1093 --LSLPIYTLITTSARFFSLMTSLSHLQQN---------ADSVEIPNFG-PIPLSSVPPP 950 L LP Y L TTSAR FSL + ++ D + +P G PIP+SS+P Sbjct: 132 ANLCLPNYILFTTSARMFSLFAYFPSIAESKTDGGSSRFGDEIRVPGLGSPIPVSSLPST 191 Query: 949 MLNPNHFFAASITTNTSSLSKSTGVIINTFTSLESQAIEALR----RNGVDQILPIGPLP 782 +L+ N FF + + N+ S+ GV+IN+F LE Q++E L G+ + P+GPL Sbjct: 192 LLDLNSFFTKNFSDNSRSIKNVNGVLINSFEGLEKQSLEMLTVGKAMEGLPPVFPVGPLL 251 Query: 781 PFS---ETSALDLPWLDEQAPSSVLYISFGSRTALSKEQITELASALEISGCKFLWVLKG 611 P ++S L WL+ Q SV+Y+SFGSRT +SKEQI EL + L +SG KF+WV+K Sbjct: 252 PLEFEGQSSFSPLKWLEGQKERSVVYVSFGSRTPMSKEQIRELGTGLVLSGYKFVWVVKS 311 Query: 610 GKVDREDKEEVGEMVGAEFVERTKGKGKVIKGWVEQEQILSHAAIGGFVSHCGWNSVTEA 431 VD+E+ E + E++G E E+ G V+K WV Q +ILSH A+GGF+SHCGWNSV EA Sbjct: 312 KVVDKEEDESLDEILGQELKEKVMNNGLVVKEWVNQWKILSHKAVGGFISHCGWNSVVEA 371 Query: 430 AALGVPVLAWPLHGDQRVNAAVVEEVGLGIWVREWGWGGERLIGRDEIAKQLRMVMGDEM 251 A GVPVL WP HGDQ +NA V+E G G+ ++ WGW + ++ +EI +++ +MG E Sbjct: 372 AWHGVPVLGWPQHGDQMINAEVIEGGGWGLCMKSWGWVSDIVVKGEEIGDRIKELMGSET 431 Query: 250 XXXXXXXXXXXXXXXXEINGSSETLIRGLMESFKR 146 + GS E +++ L +S+K+ Sbjct: 432 LKSTAARISEEARQAVGVGGSCENMLKELFQSWKK 466 >ref|XP_003563944.1| PREDICTED: anthocyanidin 5,3-O-glucosyltransferase-like [Brachypodium distachyon] Length = 472 Score = 347 bits (891), Expect = 7e-93 Identities = 191/422 (45%), Positives = 256/422 (60%), Gaps = 21/422 (4%) Frame = -1 Query: 1453 PHVALFPCAGMGHLIPFLRLAGQLHS-RGCAVTVITVEPTVSAAESNHLSSFFSTFPRIK 1277 PHV L P AGMGHL+PF RLA L S GC V+++TV PTVS+AES+HL + F FP ++ Sbjct: 12 PHVVLLPSAGMGHLVPFSRLAVALSSAHGCDVSLVTVLPTVSSAESSHLEALFGAFPAVR 71 Query: 1276 RLEFRLLPHRKSELTNDDPFFIQMERIGNSIHXXXXXXXXXXXXXSAAVVDLPVLS---R 1106 RLEF L SE N DPFF++ E + S +A V D+ + S Sbjct: 72 RLEFHLADFDASEFPNADPFFLRFEAMRRSA-PLLLGPLLARASATALVTDIALSSVVIP 130 Query: 1105 LASTLSLPIYTLITTSARFFSLMTSL-SHLQQNADS----VEIPNFGPIPLSSVPPPMLN 941 +A L LP Y L T SA SL ++L N + V+IP IP +SVP + + Sbjct: 131 VAKQLRLPCYVLFTASAAMLSLCVHFPAYLDANGNGLVGDVDIPGVYQIPKASVPQALHD 190 Query: 940 PNHFFAASITTNTSSLSKSTGVIINTFTSLESQAIEALRRNGVDQ------ILPIGPLPP 779 P H F N L+KS GV++N+F + E +AI ALR V + +GPL P Sbjct: 191 PKHLFTRQFVANGRELAKSDGVLVNSFDAFEPEAIAALREGAVSAAGFFPPVFSVGPLAP 250 Query: 778 FS-----ETSALDLPWLDEQAPSSVLYISFGSRTALSKEQITELASALEISGCKFLWVLK 614 S A + WL+ Q SV+Y+SFGSR A++++Q+ ELA+ LE SG +FLWV+K Sbjct: 251 VSFPAGNNNRADYIQWLEAQPARSVVYVSFGSRKAVARDQLRELAAGLEASGHRFLWVVK 310 Query: 613 GGKVDREDKEEVGEMVGAEFVERTKGKGKVIKGWVEQEQILSHAAIGGFVSHCGWNSVTE 434 VDR+D ++GE++G F+ER +G+G V KGWVEQE +L ++G F+SHCGWNSVTE Sbjct: 311 STVVDRDDDADLGELLGEGFLERVQGRGMVTKGWVEQEDVLKQESVGLFISHCGWNSVTE 370 Query: 433 AAALGVPVLAWPLHGDQRVNAAVVEEVGLGIWVREWGWGGER-LIGRDEIAKQLRMVMGD 257 AAA G+PVLAWP GDQRVNA VV GLG+WV W W GE ++ + IA++++ VMGD Sbjct: 371 AAAGGLPVLAWPRFGDQRVNAGVVARSGLGVWVDSWSWEGEEGVVSGESIAEKVKAVMGD 430 Query: 256 EM 251 E+ Sbjct: 431 EI 432 >gb|ACU64894.1| UDP-T1 [Oryza officinalis] Length = 461 Score = 346 bits (887), Expect = 2e-92 Identities = 190/416 (45%), Positives = 254/416 (61%), Gaps = 16/416 (3%) Frame = -1 Query: 1453 PHVALFPCAGMGHLIPFLRLAGQLHS-RGCAVTVITVEPTVSAAESNHLSSFFSTFPRIK 1277 PHV L P AGMGHL+PF RLA L S GC V+++TV PTVS AES HL + F FP ++ Sbjct: 12 PHVVLIPSAGMGHLVPFGRLAVALSSGHGCDVSLVTVLPTVSTAESKHLEALFDAFPAVR 71 Query: 1276 RLEFRLLPHRKSELTNDDPFFIQMERIGNSIHXXXXXXXXXXXXXSAAVVDLP-VLSRLA 1100 RL+F L P SE DPFF++ E + S A + L V+ +A Sbjct: 72 RLDFELAPFDASEFPGADPFFLRFEAMRRSAPLLGPLLTDAGASALATDIALTSVVIPVA 131 Query: 1099 STLSLPIYTLITTSARFFSLMTSL-SHLQQNAD-----SVEIPNFGPIPLSSVPPPMLNP 938 LP + L T SA SL ++L NA V+IP IP +S+P + +P Sbjct: 132 KEQGLPCHILFTASAAMLSLCAYFPTYLDANAGRGSVGDVDIPGVYRIPKASIPQALHDP 191 Query: 937 NHFFAASITTNTSSLSKSTGVIINTFTSLESQAIEALRR----NGVDQILPIGPLPPFSE 770 NH F N SL+ + G+++NTF +LE +A+ AL++ +G + +GPL P S Sbjct: 192 NHLFTRQFVANGRSLTSAAGILVNTFDALEPEAVTALQQGKVASGFPPVFAVGPLLPASN 251 Query: 769 TS---ALDLPWLDEQAPSSVLYISFGSRTALSKEQITELASALEISGCKFLWVLKGGKVD 599 + A + WLD Q SV+Y+SFGSR A+S EQ+ ELA+ LE SG +FLWV+K VD Sbjct: 252 QAKDPANYMEWLDAQPARSVVYVSFGSRKAVSGEQLRELAAGLEASGHRFLWVVKSTVVD 311 Query: 598 REDKEEVGEMVGAEFVERTKGKGKVIKGWVEQEQILSHAAIGGFVSHCGWNSVTEAAALG 419 R+D E+GE++G F+ER + +G V K WVEQE++L H A+G FVSHCGWNSVTEAAA G Sbjct: 312 RDDAAELGELLGEGFLERVEKRGLVTKAWVEQEEVLKHEAVGLFVSHCGWNSVTEAAASG 371 Query: 418 VPVLAWPLHGDQRVNAAVVEEVGLGIWVREWGWGGER-LIGRDEIAKQLRMVMGDE 254 +PVLA P GDQRVN++VV GLG+WV W W GE +IG EI+++++ MGDE Sbjct: 372 IPVLALPRFGDQRVNSSVVARAGLGVWVDSWSWEGEEGVIGAGEISEKVKAAMGDE 427 >ref|XP_002532899.1| UDP-glucosyltransferase, putative [Ricinus communis] gi|223527333|gb|EEF29479.1| UDP-glucosyltransferase, putative [Ricinus communis] Length = 462 Score = 345 bits (886), Expect = 3e-92 Identities = 180/459 (39%), Positives = 269/459 (58%), Gaps = 16/459 (3%) Frame = -1 Query: 1474 SKSENEAPHVALFPCAGMGHLIPFLRLAGQLHSRGCAVTVITVEPTVSAAESNHLSSFFS 1295 S S + H+ L P AGMGHL PFLRLA L VT+IT PTVS +ES L FF+ Sbjct: 3 SCSHQKLAHIVLLPSAGMGHLTPFLRLAALLAIHNVKVTLITPNPTVSLSESQALIHFFT 62 Query: 1294 TFPRIKRLEFRLLPHRKSELTNDDPFFIQMERIGNSIHXXXXXXXXXXXXXSAAVVDLPV 1115 +FP I + + LL + +++DPF+ MERI S H SA + D+ + Sbjct: 63 SFPHINQKQLHLLSIERFPTSSEDPFYDHMERICQSSHLLLPLLSSLSPPLSAVITDMTL 122 Query: 1114 ---LSRLASTLSLPIYTLITTSARFFSLMTSLSHL--------QQNADSVEIPNFGPIPL 968 + + L+LP Y L T+SA+ +L S + + D ++IP+ PIP Sbjct: 123 AFAVIPITQALNLPNYVLFTSSAKMLALYLSFHAMIGSEPTIDLGDTDGIKIPSLEPIPR 182 Query: 967 SSVPPPML-NPNHFFAASITTNTSSLSKSTGVIINTFTSLESQAIEALRRNGVDQILP-- 797 S +PPP+L + N+ N +++S+G+++NTF S+E + +E L V + LP Sbjct: 183 SWIPPPLLQDTNNLLKTYFIKNGKKMAESSGILVNTFDSIEHEVLEQLNAGKVIENLPPV 242 Query: 796 --IGPLPPFSETSALDLPWLDEQAPSSVLYISFGSRTALSKEQITELASALEISGCKFLW 623 IG L + L WLD Q SVL++SFGSRTA+S+ Q+TEL L SG +FLW Sbjct: 243 IAIGSLASCESETKQALAWLDSQQNGSVLFVSFGSRTAISRAQLTELGEGLVRSGIRFLW 302 Query: 622 VLKGGKVDREDKEEVGEMVGAEFVERTKGKGKVIKGWVEQEQILSHAAIGGFVSHCGWNS 443 ++K KVD+ED+E++ +++G +ER K +G V+K W+ QE +L H+AIGGF+SHCGWNS Sbjct: 303 IVKDKKVDKEDEEDLSQVIGNRLIERLKERGLVVKSWLNQEDVLRHSAIGGFLSHCGWNS 362 Query: 442 VTEAAALGVPVLAWPLHGDQRVNAAVVEEVGLGIWVREWGWGGERLIGRDEIAKQLRMVM 263 VTEA G+P+LAWP HGDQ++NA +VE + LG W + WGWGGE ++ ++IA+ ++ +M Sbjct: 363 VTEAVQHGIPILAWPQHGDQKINADIVERIVLGTWEKSWGWGGEVVVKGNDIAEMIKEMM 422 Query: 262 GDEMXXXXXXXXXXXXXXXXEINGSSETLIRGLMESFKR 146 G+++ G+S + GL+E++K+ Sbjct: 423 GNDLLRAHAVQIREEARRAIADTGNSTKGLMGLIETWKK 461 >gb|ACU64887.1| UDP-T1 [Oryza minuta] Length = 461 Score = 343 bits (880), Expect = 1e-91 Identities = 190/416 (45%), Positives = 253/416 (60%), Gaps = 16/416 (3%) Frame = -1 Query: 1453 PHVALFPCAGMGHLIPFLRLAGQLHS-RGCAVTVITVEPTVSAAESNHLSSFFSTFPRIK 1277 PHV L P AGMGHL+PF RLA L S GC V+++TV PTVS AES HL + F FP ++ Sbjct: 12 PHVVLIPSAGMGHLVPFGRLAVALSSGHGCDVSLVTVLPTVSTAESKHLEALFDAFPAVR 71 Query: 1276 RLEFRLLPHRKSELTNDDPFFIQMERIGNSIHXXXXXXXXXXXXXSAAVVDLP-VLSRLA 1100 RL+F L P SE DPFF++ E + S A + L V+ +A Sbjct: 72 RLDFELAPFDASEFPGADPFFLRFEAMRRSAPLLGPLLTDAGASALATDIALTSVVIPVA 131 Query: 1099 STLSLPIYTLITTSARFFSLMTSL-SHLQQNAD-----SVEIPNFGPIPLSSVPPPMLNP 938 LP + L T SA SL ++L NA V+IP IP +S+P + +P Sbjct: 132 KEQGLPCHILFTASAAMLSLCAYFPTYLDANAGRGGVGDVDIPGVYRIPKASIPQALHDP 191 Query: 937 NHFFAASITTNTSSLSKSTGVIINTFTSLESQAIEALRR----NGVDQILPIGPLPPFSE 770 NH F N SL+ + G+++NTF +LE +A+ AL++ +G + +GPL S Sbjct: 192 NHLFTRQFVANGRSLTSAAGILVNTFDALEPEAVTALQQGKVASGFPPVFAVGPLLLASN 251 Query: 769 TS---ALDLPWLDEQAPSSVLYISFGSRTALSKEQITELASALEISGCKFLWVLKGGKVD 599 + A + WLD Q SV+Y+SFGSR A+S EQ+ ELA+ LE SG +FLWV+K VD Sbjct: 252 QAKDPANYMEWLDAQPARSVVYVSFGSRKAVSGEQLRELAAGLEASGHRFLWVVKSTVVD 311 Query: 598 REDKEEVGEMVGAEFVERTKGKGKVIKGWVEQEQILSHAAIGGFVSHCGWNSVTEAAALG 419 R+D E+GE++G F+ER + +G V K WVEQE++L H A+G FVSHCGWNSVTEAA G Sbjct: 312 RDDAAELGELLGEGFLERVEKRGLVTKAWVEQEEVLKHEAVGLFVSHCGWNSVTEAATSG 371 Query: 418 VPVLAWPLHGDQRVNAAVVEEVGLGIWVREWGWGGER-LIGRDEIAKQLRMVMGDE 254 VPVLA P GDQRVN+ VV GLG+WV W W GE +IG +EI+++++ VMGDE Sbjct: 372 VPVLALPRFGDQRVNSGVVARAGLGVWVDSWSWEGEEGVIGAEEISEKVKAVMGDE 427 >ref|XP_002306046.2| hypothetical protein POPTR_0004s12460g [Populus trichocarpa] gi|550340898|gb|EEE86557.2| hypothetical protein POPTR_0004s12460g [Populus trichocarpa] Length = 461 Score = 340 bits (872), Expect = 1e-90 Identities = 185/425 (43%), Positives = 256/425 (60%), Gaps = 18/425 (4%) Frame = -1 Query: 1474 SKSENEAPHVALFPCAGMGHLIPFLRLAGQLHSRGCAVTVITVEPTVSAAESNHLSSFFS 1295 S S+ + HVAL P AGMGHL PFLRLA L + VT I PTVS +ES LS F+ Sbjct: 3 SSSDRKLAHVALLPSAGMGHLTPFLRLAASLTLQNVQVTFIIPHPTVSLSESQALSQLFA 62 Query: 1294 TFPRIKRLEFRLLPHRKSELTNDDPFFIQMERIGNSIHXXXXXXXXXXXXXSAAVVDLPV 1115 +FP+IK +F LLP + +DDPFF + I NS S + D+ + Sbjct: 63 SFPQIKHQQFHLLP---LDNPSDDPFFEHFQLIKNSSRLLSPLLSALNPPLSVFITDMSL 119 Query: 1114 LSR---LASTLSLPIYTLITTSAR---FFSLMTSLSHLQ-----QNADSVEIPNFGPIPL 968 S + +SLP Y L T+SA+ FF +L+ + D ++I +P Sbjct: 120 ASTVTPITEAISLPNYVLFTSSAKMLTFFLCYPTLADSKAMDELDEMDVIKIRGLELMPK 179 Query: 967 SSVPPPMLNP-NHFFAASITTNTSSLSKSTGVIINTFTSLESQAIEALRR-----NGVDQ 806 S +PPP+L N+ S ++ +++S+G+++NTF S E +++ L + Sbjct: 180 SWIPPPLLKKGNNILKTSFIEDSRKVAESSGILVNTFESFEQESLRKLNDCQLLLERLPS 239 Query: 805 ILPIGPLPPFS-ETSALDLPWLDEQAPSSVLYISFGSRTALSKEQITELASALEISGCKF 629 ++ IGPLPP E S L L WLD+Q SV+Y+SFGSRTALS++Q+ EL L SG +F Sbjct: 240 VVAIGPLPPCDFEKSQLQLTWLDDQPAGSVVYVSFGSRTALSRDQVRELGEGLVRSGSRF 299 Query: 628 LWVLKGGKVDREDKEEVGEMVGAEFVERTKGKGKVIKGWVEQEQILSHAAIGGFVSHCGW 449 +WV+K KVDRED E + ++G E +ER K KG V++ WV QE +LSH A+GGF SHCGW Sbjct: 300 IWVVKDKKVDREDNEGLEGVIGDELMERMKEKGLVVRNWVNQEDVLSHPAVGGFFSHCGW 359 Query: 448 NSVTEAAALGVPVLAWPLHGDQRVNAAVVEEVGLGIWVREWGWGGERLIGRDEIAKQLRM 269 NSV EAA GV +LAWP HGDQ+VNA +VE +GLG WV+ WGWG E ++ R EIA+++ Sbjct: 360 NSVMEAAWHGVKILAWPQHGDQKVNADIVERIGLGTWVKSWGWGEEMIVNRAEIAEKIGE 419 Query: 268 VMGDE 254 +MG+E Sbjct: 420 IMGNE 424