BLASTX nr result
ID: Rheum21_contig00014716
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Rheum21_contig00014716 (1194 letters) Database: ./nr 37,332,560 sequences; 13,225,080,153 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value gb|EOY03573.1| UDP-glucosyl transferase 88A1, putative [Theobrom... 275 3e-71 gb|EXB38050.1| UDP-glycosyltransferase [Morus notabilis] 271 5e-70 gb|EMJ17816.1| hypothetical protein PRUPE_ppa027121mg [Prunus pe... 270 7e-70 gb|EMJ17382.1| hypothetical protein PRUPE_ppa015845mg, partial [... 270 9e-70 gb|EXB38047.1| UDP-glycosyltransferase [Morus notabilis] 270 1e-69 ref|XP_006338402.1| PREDICTED: anthocyanidin 5,3-O-glucosyltrans... 258 3e-66 ref|XP_004232188.1| PREDICTED: anthocyanidin 5,3-O-glucosyltrans... 258 3e-66 gb|EPS71762.1| hypothetical protein M569_02997, partial [Genlise... 253 1e-64 ref|XP_006438793.1| hypothetical protein CICLE_v10031419mg [Citr... 243 1e-61 gb|EOY01768.1| UDP-glucosyl transferase 88A1, putative [Theobrom... 234 4e-59 ref|XP_004505380.1| PREDICTED: anthocyanidin 5,3-O-glucosyltrans... 234 5e-59 ref|XP_003607777.1| Anthocyanidin 5 3-O-glucosyltransferase [Med... 231 3e-58 ref|XP_004135442.1| PREDICTED: anthocyanidin 5,3-O-glucosyltrans... 230 1e-57 gb|EXB38054.1| UDP-glycosyltransferase [Morus notabilis] 228 3e-57 gb|EOY01771.1| UDP-glucosyl transferase 88A1, putative isoform 1... 228 5e-57 ref|XP_002306046.2| hypothetical protein POPTR_0004s12460g [Popu... 224 4e-56 gb|EOY01770.1| UDP-glucosyl transferase 88A1, putative [Theobrom... 223 1e-55 ref|XP_002324085.2| hypothetical protein POPTR_0017s12490g [Popu... 214 4e-53 ref|XP_002444986.1| hypothetical protein SORBIDRAFT_07g002370 [S... 214 6e-53 gb|EMJ16549.1| hypothetical protein PRUPE_ppa005427mg [Prunus pe... 213 9e-53 >gb|EOY03573.1| UDP-glucosyl transferase 88A1, putative [Theobroma cacao] Length = 467 Score = 275 bits (702), Expect = 3e-71 Identities = 165/413 (39%), Positives = 233/413 (56%), Gaps = 16/413 (3%) Frame = +2 Query: 2 HLLPCLNLASALSAH-CTVTLVIAVPIISAAETGEISSFTASHPHIRRLDFRMDSRQ--N 172 HL P L LAS L +H C VTL+ +SAAE+ IS F +++P I+ ++F++ Q N Sbjct: 24 HLTPFLRLASMLLSHNCMVTLLTTKSTVSAAESTYISFFLSTNPEIKHIEFQVPPMQPSN 83 Query: 173 TQAEDPYLMRYDALSRNAHLLGHHLPTLSPPLSAVFSEFSLTRGLN-TSLPSSLPNYTIV 349 T A+DP+ +++ A SR+AHL+ + +LSPPLSA+FS+ + G++ ++ +PNY + Sbjct: 84 TTADDPFFIQFKATSRSAHLIYPLISSLSPPLSAIFSDLVVASGVSKVAVYLGIPNYAVS 143 Query: 350 PISATFFSIMARLPRLHDSLDXXXXXXXXXXEIPGIGSVPRENIPPPFFSPGHTFAELVT 529 SA F S++A LP L EIPG+ +P +IPPPFF+P H F + Sbjct: 144 TTSAKFLSLLAYLPILTSDA-AKLSNRSTDIEIPGLTPLPISSIPPPFFNPDHLFTATLV 202 Query: 530 SNARHLHLSKGIVLNTXXXXXXXXXXXX---RVAPELPELFPFAVPLHAAKKSSTPTTAC 700 SNA L KGI++NT R LP + P PL + Sbjct: 203 SNAIALPDCKGILMNTFDCFEPETLSAINNKRALRNLPPILPIG-PLETYELKKDLGQYL 261 Query: 701 TWLDGQPDDSVVYVAFGSRTAMPRDQMRELGMGLKMSGWRFLWXXXXXXXXXXXXXXXXX 880 WL+ QP +SVV+V+FGSRTAM +DQ++EL GL+ S +RFLW Sbjct: 262 PWLNSQPAESVVFVSFGSRTAMTKDQIKELRHGLEKSEYRFLW----------ILKTKTV 311 Query: 881 XXXXXRELYEALVVS------GKGMIERKWVNQDEILGHRAVGGFLSHAGWSSVLEAARH 1042 +L + L S KGM+ ++WVNQ +IL H AVGGF++H GW+SV+EAA+ Sbjct: 312 DKDDTEDLEDLLSCSFLERTKNKGMVLKEWVNQQDILAHPAVGGFVNHCGWNSVMEAAQR 371 Query: 1043 GVRVLAWPLGGDQRMNAWSVNKAGLGIWD---GWGPDKLVPAAEIGRKVEVLM 1192 G+ +LAWP GDQR NA + KAGLGIWD GWG +LV EI +++ LM Sbjct: 372 GIPMLAWPQHGDQRANAEVLEKAGLGIWDRTWGWGGQRLVKTDEIQKRISELM 424 >gb|EXB38050.1| UDP-glycosyltransferase [Morus notabilis] Length = 463 Score = 271 bits (692), Expect = 5e-70 Identities = 158/407 (38%), Positives = 228/407 (56%), Gaps = 10/407 (2%) Frame = +2 Query: 2 HLLPCLNLASAL-SAHCTVTLVIAVPIISAAETGEISSFTASHPHIRRLDFRMDSRQNTQ 178 HLLP L +AS L S +CTVTL+ A PI+SAAE+ IS+F + HP ++ +DF+ N Sbjct: 25 HLLPFLRIASTLLSRNCTVTLITAKPIVSAAESSHISAFLSQHPQVKHVDFQTIQSHNPT 84 Query: 179 AEDPYLMRYDALSRNAHLLGHHLPTLSPPLSAVFSEFSLTRGLNTSLPS-SLPNYTIVPI 355 A+DP+ ++Y++++R+AHLL L + S P SA+F++F + + +P+Y I Sbjct: 85 ADDPFYLQYESITRSAHLLYPLLSSSSLPFSAIFADFIVASSITPMAAELGIPSYIICTT 144 Query: 356 SATFFSIMARLPRLHDSLDXXXXXXXXXXEIPGIGSVPRENIPPPFFSPGHTFAELVTSN 535 S FF ++A LP L IPG+ P +IPPPF +P H F + N Sbjct: 145 SIKFFCLIAYLPVLVTD-PAKLGNSSTELIIPGLTPFPVSSIPPPFKNPNHLFTRCLALN 203 Query: 536 ARHLHLSKGIVLNTXXXXXXXXXXXXR----VAPELPELFPFAVPLHAAKKSSTPTTACT 703 A+ L ++GI++N+ + + LP P PL + + + Sbjct: 204 AKALSKAEGIIVNSVDFFEKETLEEIKNGRVLENSLPSFLPIG-PLASFEIKKDKGEYMS 262 Query: 704 WLDGQPDDSVVYVAFGSRTAMPRDQMRELGMGLKMSGWRFLWXXXXXXXXXXXXXXXXXX 883 WLD QP++SVVYV+FGSRTA+ RDQ+RE+ GL+ SG RFLW Sbjct: 263 WLDNQPEESVVYVSFGSRTAISRDQIREVSKGLERSGHRFLWVVKSTTIDKEDKDELKDL 322 Query: 884 XXXXRELYEALVVSGKGMIERKWVNQDEILGHRAVGGFLSHAGWSSVLEAARHGVRVLAW 1063 R E + KGM ++WV+Q+EIL H ++G F+SH GW+SV+EAAR GV ++AW Sbjct: 323 LG--RSFLERTM--NKGMAVKEWVSQEEILAHTSIGAFVSHCGWNSVIEAARQGVPMVAW 378 Query: 1064 PLGGDQRMNAWSVNKAGLGIWD---GWGPD-KLVPAAEIGRKVEVLM 1192 P GDQ++NA V KAGLGIW+ GW +LV EIG K+ +M Sbjct: 379 PQHGDQKVNAEIVEKAGLGIWERNWGWADQAELVCGEEIGEKIREVM 425 >gb|EMJ17816.1| hypothetical protein PRUPE_ppa027121mg [Prunus persica] Length = 465 Score = 270 bits (691), Expect = 7e-70 Identities = 161/411 (39%), Positives = 224/411 (54%), Gaps = 14/411 (3%) Frame = +2 Query: 2 HLLPCLNLASALSAH-CTVTLVIAVPIISAAETGEISSFTASHPHIRRLDFRMDSRQ--- 169 HL P L LAS LS+ CTVTL+ A P +SAAE+ +S F + HP ++ ++F++ + Sbjct: 21 HLTPFLRLASMLSSRSCTVTLITASPSVSAAESSHVSFFLSQHPLVKHIEFKVIPSKPYS 80 Query: 170 NTQAEDPYLMRYDALSRNAHLLGHHLPTLSPPLSAVFSEFSLTRGLN-TSLPSSLPNYTI 346 N +DP+ ++++A +R+ HLL L + SPPLSA+FS+F++ + +PNY I Sbjct: 81 NPTTDDPFFLQFEATNRSVHLLYPSLASASPPLSAIFSDFAVASSFAPVAADLGIPNYII 140 Query: 347 VPISATFFSIMARLPRLHDSLDXXXXXXXXXXEIPGIGSVPRENIPPPFFSPGHTFAELV 526 S FF +MA LP L IPGI P +IPP F +P H F L+ Sbjct: 141 STTSCKFFCLMAYLPVLLSD-PSSFSSGLSEVNIPGITPFPLPSIPPQFKNPNHLFTSLI 199 Query: 527 TSNARHLHLSKGIVLNTXXXXXXXXXXXX---RVAPELPELFPFAVPLHAAKKSSTPTTA 697 ++A+ L +KGI++NT RV LP + P K + Sbjct: 200 ATSAQALSKAKGILMNTFDDFEPETLAAVNSSRVLDNLPPILPIGPLETFEPKKEQDQSY 259 Query: 698 CTWLDGQPDDSVVYVAFGSRTAMPRDQMRELGMGLKMSGWRFLWXXXXXXXXXXXXXXXX 877 WLD QP +SVVYV+FGSRTA+ Q+REL GL+ SG+RFLW Sbjct: 260 LPWLDSQPAESVVYVSFGSRTALSSAQIRELSKGLERSGYRFLWVLKTSKVDKDDKEEL- 318 Query: 878 XXXXXXRELYEALVVS---GKGMIERKWVNQDEILGHRAVGGFLSHAGWSSVLEAARHGV 1048 ++L E + KG + + WV+Q +IL H A GGF+SH GW+SV+EAAR G+ Sbjct: 319 ------KDLLEESFLDRTKNKGRVVKGWVSQQDILEHPATGGFISHCGWNSVMEAARKGI 372 Query: 1049 RVLAWPLGGDQRMNAWSVNKAGLGIWD---GWGPDKLVPAAEIGRKVEVLM 1192 +LAWP GDQ +NA V KAGLGIW+ WG + LV EIG+K+ LM Sbjct: 373 PMLAWPQHGDQSVNAEVVEKAGLGIWERKWDWGLEGLVSGEEIGKKIVELM 423 >gb|EMJ17382.1| hypothetical protein PRUPE_ppa015845mg, partial [Prunus persica] Length = 433 Score = 270 bits (690), Expect = 9e-70 Identities = 159/411 (38%), Positives = 224/411 (54%), Gaps = 14/411 (3%) Frame = +2 Query: 2 HLLPCLNLASALSAH-CTVTLVIAVPIISAAETGEISSFTASHPHIRRLDFRM---DSRQ 169 HL P L LAS LS+ CTVTL+ A P +SAAE+ +S F + HP ++ ++F++ Sbjct: 21 HLTPFLRLASMLSSRSCTVTLITASPSVSAAESSHVSFFLSQHPLVKHIEFQVIPSKPSS 80 Query: 170 NTQAEDPYLMRYDALSRNAHLLGHHLPTLSPPLSAVFSEFSLTRGLN-TSLPSSLPNYTI 346 N +DP+ ++++A +R+ HLL L + SPP+SA+FS+F++ + + +PNY I Sbjct: 81 NPTTDDPFFLQFEATNRSVHLLYPSLASASPPISAIFSDFAVASSIAPVAADLGIPNYII 140 Query: 347 VPISATFFSIMARLPRLHDSLDXXXXXXXXXXEIPGIGSVPRENIPPPFFSPGHTFAELV 526 S FF +MA LP L IPGI P +IPPPF +P H L+ Sbjct: 141 STTSCKFFCLMAYLPVLLSD-PSSFSSGLSEVNIPGITPFPLPSIPPPFKNPSHLLTSLI 199 Query: 527 TSNARHLHLSKGIVLNTXXXXXXXXXXXX---RVAPELPELFPFAVPLHAAKKSSTPTTA 697 ++A+ L +KGI++NT RV LP + P K + Sbjct: 200 ATDAQALSKAKGILMNTFDDFERETLAPIKSGRVLDNLPPILPIGPLETYEPKKEQDQSY 259 Query: 698 CTWLDGQPDDSVVYVAFGSRTAMPRDQMRELGMGLKMSGWRFLWXXXXXXXXXXXXXXXX 877 WLD QP +SVVYV+FGSRTA+ Q+REL GL+ SG+RFLW Sbjct: 260 LPWLDSQPAESVVYVSFGSRTALSSAQIRELSKGLERSGYRFLWVPKTSKVDKDDKEEL- 318 Query: 878 XXXXXXRELYEALVVS---GKGMIERKWVNQDEILGHRAVGGFLSHAGWSSVLEAARHGV 1048 ++L E + KG + + WV+Q +IL H A+GGF+SH GW+SV+EA R G+ Sbjct: 319 ------KDLLEESFLDRTKNKGRVVKGWVSQQDILEHPAIGGFISHCGWNSVMEAVRKGI 372 Query: 1049 RVLAWPLGGDQRMNAWSVNKAGLGIWD---GWGPDKLVPAAEIGRKVEVLM 1192 +LAWP DQ +NA V KAGLGIW+ GWG + LV EIG+K+ LM Sbjct: 373 PMLAWPQHMDQSVNAEVVEKAGLGIWERKWGWGLEGLVSGEEIGKKIVELM 423 >gb|EXB38047.1| UDP-glycosyltransferase [Morus notabilis] Length = 463 Score = 270 bits (689), Expect = 1e-69 Identities = 160/413 (38%), Positives = 229/413 (55%), Gaps = 16/413 (3%) Frame = +2 Query: 2 HLLPCLNLASAL-SAHCTVTLVIAVPIISAAETGEISSFTASHPHIRRLDFRMDSRQNTQ 178 HLLP L +AS L S +CTVTL+ A PI+SAAE+ IS+F + HP ++ +DF+ N Sbjct: 25 HLLPFLRIASTLLSRNCTVTLITAKPIVSAAESSHISAFLSQHPQVKHVDFQTIQSHNPT 84 Query: 179 AEDPYLMRYDALSRNAHLLGHHLPTLSPPLSAVFSEFSLTRGLN-TSLPSSLPNYTIVPI 355 A+DP+ ++Y++++R+AHLL L + SPP SA+F++F + + + +P+Y I Sbjct: 85 ADDPFYLQYESITRSAHLLYPLLSSSSPPFSAIFADFFVASSITPMAAELGIPSYIICTT 144 Query: 356 SATFFSIMARLPRLHDSLDXXXXXXXXXXEIPGIGSVPRENIPPPFFSPGHTFAELVTSN 535 S FF ++A LP L IPG+ P +IP PF +P H F + N Sbjct: 145 SIKFFCLIAYLPVLVTD-PAKLGNSSTELIIPGLTPFPVSSIPSPFKNPNHLFTRCLVLN 203 Query: 536 ARHLHLSKGIVLNTXXXXXXXXXXXXR----VAPELPELFPFAVPLHAAKKSSTPTTACT 703 A+ +KGI++N+ + + LP P PL + + + Sbjct: 204 AKEFSKAKGIIVNSVDFFEKETLEEIKNGRVLENSLPSFLPIG-PLASFEIKKDKGEYMS 262 Query: 704 WLDGQPDDSVVYVAFGSRTAMPRDQMRELGMGLKMSGWRFLWXXXXXXXXXXXXXXXXXX 883 WLD QP++SVVYV+FGSRTA+ RDQ+RE+ GL+ SG RFLW Sbjct: 263 WLDNQPEESVVYVSFGSRTAISRDQIREVSKGLERSGHRFLWVVKSTKIDKEDKD----- 317 Query: 884 XXXXRELYEALVVS------GKGMIERKWVNQDEILGHRAVGGFLSHAGWSSVLEAARHG 1045 EL + L S KGM + WV+Q+EIL H ++G F+SH GW+SV+EAAR G Sbjct: 318 -----ELKDLLGGSFLERTMNKGMAVKGWVSQEEILAHPSIGAFVSHCGWNSVIEAARQG 372 Query: 1046 VRVLAWPLGGDQRMNAWSVNKAGLGIWD---GWGPD-KLVPAAEIGRKVEVLM 1192 V ++AWP GDQ++NA V KAGLGIW+ GW +LV EIG K+ +M Sbjct: 373 VPMVAWPQHGDQKVNAEIVEKAGLGIWERNWGWADQAELVCGEEIGEKIREVM 425 >ref|XP_006338402.1| PREDICTED: anthocyanidin 5,3-O-glucosyltransferase-like [Solanum tuberosum] Length = 453 Score = 258 bits (660), Expect = 3e-66 Identities = 158/407 (38%), Positives = 225/407 (55%), Gaps = 10/407 (2%) Frame = +2 Query: 2 HLLPCLNLASAL-SAHCTVTLVIAVPIISAAETGEISSFTASHPHIRRLDFRMDSRQNTQ 178 HL+P L LA+ L S +C VTL+ A P +SAAE+ ++SF ++HPHI+RLDF + + Sbjct: 15 HLMPFLRLAAMLASRNCKVTLLPAQPTVSAAESNHLNSFFSAHPHIQRLDFHVVPLHTSN 74 Query: 179 AE-DPYLMRYDALSRNAHLLGHHLPTLSPPLSAVFSEFSLTRGLN--TSLPS-SLPNYTI 346 DP+ ++++A+ R+ HLL L +LSPP+SA+F + + ++ PS S+ Y + Sbjct: 75 PHGDPFFLQFEAIIRSVHLLPPLLSSLSPPISALFLDIAAATCVDQLADHPSLSISYYIL 134 Query: 347 VPISATFFSIMARLPRLHDSLDXXXXXXXXXXEIPGIGSVPRENIPPPFFSPGHTFAELV 526 SA FFS+++ LP L ++ G+ S NIPPP F+P + F + Sbjct: 135 STTSARFFSLLSHLPHL------TLESSCENLKLHGLPSFSISNIPPPLFNPQNLFTTQL 188 Query: 527 TSNARHLHLSKGIVLNTXXXXXXXXXXXXRVAPELPELFPFAVPLHAAKKSSTP--TTAC 700 SNAR + KG+V NT L P +P+ K P + Sbjct: 189 ISNARAISRVKGVVSNTFHWFEAETIEALNSGKTSITL-PQFLPIGPFKPYEDPGKCASL 247 Query: 701 TWLDGQPDDSVVYVAFGSRTAMPRDQMRELGMGLKMSGWRFLWXXXXXXXXXXXXXXXXX 880 +WLDGQP SVVYV+FGSRT M +DQ++E+G GL S +FLW Sbjct: 248 SWLDGQPAKSVVYVSFGSRTTMSKDQIKEIGEGLLKSKQKFLWVLKSVIVDKVEETELQE 307 Query: 881 XXXXXRELYEALVVSGKGMIERKWVNQDEILGHRAVGGFLSHAGWSSVLEAARHGVRVLA 1060 R L E + +G++ ++WV Q+EIL H A+GGF SH GW+S +EAA+ GV +LA Sbjct: 308 LVG--RSLLEKIEEKKQGIVVKEWVKQEEILTHHAIGGFFSHCGWNSAMEAAQRGVPMLA 365 Query: 1061 WPLGGDQRMNAWSVNKAGLGIWD---GWGPDKLVPAAEIGRKVEVLM 1192 W L GDQR NA V KAGLG+W GW ++LV + EI K+E LM Sbjct: 366 WTLNGDQRFNAEVVEKAGLGLWPKHWGWLGERLVKSEEIEEKIEELM 412 >ref|XP_004232188.1| PREDICTED: anthocyanidin 5,3-O-glucosyltransferase-like [Solanum lycopersicum] Length = 461 Score = 258 bits (660), Expect = 3e-66 Identities = 157/408 (38%), Positives = 227/408 (55%), Gaps = 11/408 (2%) Frame = +2 Query: 2 HLLPCLNLASAL-SAHCTVTLVIAVPIISAAETGEISSFTASHPHIRRLDFRMDSRQNTQ 178 HL+P L LA+ L S +C VTL+ A P +SAAE+ ++SF ++HPHI+RLDF++ Q++ Sbjct: 15 HLMPFLRLAAMLASRNCKVTLLTAQPTVSAAESKHLNSFFSAHPHIQRLDFQVVPLQSSN 74 Query: 179 AE-DPYLMRYDALSRNAHLLGHHLPTLSPPLSAVFSEFSLTRGLN--TSLPS-SLPNYTI 346 DP+ ++++A+ R+ HLL L +LSPP+SA+F + + ++ PS S+ Y + Sbjct: 75 PHGDPFFLQFEAIIRSVHLLPPLLSSLSPPISALFLDIAAATCVDQLADHPSLSISYYIL 134 Query: 347 VPISATFFSIMARLPRLHDSLDXXXXXXXXXXEIPGIGSVPRENIPPPFFSPGHTFAELV 526 SA FFS++ LP L ++ G+ S NIPPP F+P + F + Sbjct: 135 STTSARFFSLITHLPHL------TLESSCVNLKLHGLPSFSISNIPPPIFNPQNLFTTQM 188 Query: 527 TSNARHLHLSKGIVLNTXXXXXXXXXXXX---RVAPELPELFPFAVPLHAAKKSSTPTTA 697 SNAR + KG+V NT + + LP+ P H ++ Sbjct: 189 ISNARAISRVKGVVSNTFHWFEAETIEPLNSGKTSITLPQFLPIGPFKHYEDPGKC--SS 246 Query: 698 CTWLDGQPDDSVVYVAFGSRTAMPRDQMRELGMGLKMSGWRFLWXXXXXXXXXXXXXXXX 877 +WLD QP SVVYV+FGSRTAM +DQ++E+G GL S +FLW Sbjct: 247 LSWLDEQPAKSVVYVSFGSRTAMSKDQIKEIGEGLLKSKQKFLWVLKSVKVDKAEETELK 306 Query: 878 XXXXXXRELYEALVVSGKGMIERKWVNQDEILGHRAVGGFLSHAGWSSVLEAARHGVRVL 1057 L E + +G++ ++WV Q+EIL H A+GGF SH GW+S +EAA+ GV +L Sbjct: 307 ELVG--HSLLEKIEEKKQGIVVKEWVKQEEILTHHAIGGFFSHCGWNSTMEAAQRGVPML 364 Query: 1058 AWPLGGDQRMNAWSVNKAGLGIWD---GWGPDKLVPAAEIGRKVEVLM 1192 AW L GDQR NA V KAGLG+W GW ++LV + EI K+E LM Sbjct: 365 AWTLNGDQRFNAEVVEKAGLGLWPKHWGWLGERLVKSEEIEEKIEELM 412 >gb|EPS71762.1| hypothetical protein M569_02997, partial [Genlisea aurea] Length = 431 Score = 253 bits (646), Expect = 1e-64 Identities = 157/409 (38%), Positives = 223/409 (54%), Gaps = 12/409 (2%) Frame = +2 Query: 2 HLLPCLNLASALSAH-CTVTLVIAVPIISAAETGEISSFTASHPHIRRLDFRMDSRQNTQ 178 HL+P L L + L+A TVT++ A P ++ AE+ +S F + P I RL+F + R+ Sbjct: 12 HLMPFLRLGAMLAARGATVTIITAHPTVTTAESDHLSRFFSQFPAINRLEFHLIPREEYN 71 Query: 179 AE----DPYLMRYDALSRNAHLLGHHLPTLSPPLSAVFSEFSLTRGLNT-SLPSSLPNYT 343 +E DP+ ++++++ ++AHLL L +LSPPLSA+ ++F + L+ S S+P YT Sbjct: 72 SELKNDDPFFIQFESIGKSAHLLVPQLSSLSPPLSALVADFPVNAALSEISDALSIPLYT 131 Query: 344 IVPISATFFSIMARLPRLHDSLDXXXXXXXXXXEIPGIGSVPRENIPPPFFSPGHTFAEL 523 ++ SA FF+IM LPR+ + EIP +G +P +IPP H F+ Sbjct: 132 LITTSARFFTIMFHLPRILED------NKKEAIEIPKLGKIPSSSIPPIMLDQAHFFSSF 185 Query: 524 VTSNARHLHLSKGIVLNTXXXXXXXXXXXXRVAPELPELFPFAVPLHAAKKSSTPTTACT 703 +TSNA LH SKGI++NT P + P PL + P Sbjct: 186 ITSNALTLHKSKGILINTFHSFEPEAIQCLTNPLPCP-ILPIG-PLDVYDQHQ-PFNLLP 242 Query: 704 WLDGQPDDSVVYVAFGSRTAMPRDQMRELGMGLKMSGWRFLWXXXXXXXXXXXXXXXXXX 883 WLD Q SVVYV+FG+RT++ + Q++ELG GL+ S +FLW Sbjct: 243 WLDNQSPGSVVYVSFGNRTSLSKQQLQELGHGLEKSRCKFLWVVKSKKVDTEDTEGID-- 300 Query: 884 XXXXRELYEALVV---SGKGMIERKWVNQDEILGHRAVGGFLSHAGWSSVLEAARHGVRV 1054 E+ V +GMI + WV+Q++ILGH +VGGF+SH GW+SV+EAAR GV + Sbjct: 301 -----EILGGPFVERNKERGMILKGWVDQEKILGHPSVGGFMSHCGWNSVMEAARLGVPI 355 Query: 1055 LAWPLGGDQRMNAWSVNKAGLGIWD---GWGPDKLVPAAEIGRKVEVLM 1192 LAWP GDQR+NA V K GLGIW GW KLV EI + LM Sbjct: 356 LAWPQHGDQRINADVVEKGGLGIWPEEWGWLGQKLVKRDEISNMISKLM 404 >ref|XP_006438793.1| hypothetical protein CICLE_v10031419mg [Citrus clementina] gi|568859072|ref|XP_006483066.1| PREDICTED: anthocyanidin 5,3-O-glucosyltransferase-like [Citrus sinensis] gi|557540989|gb|ESR52033.1| hypothetical protein CICLE_v10031419mg [Citrus clementina] Length = 472 Score = 243 bits (620), Expect = 1e-61 Identities = 151/413 (36%), Positives = 217/413 (52%), Gaps = 16/413 (3%) Frame = +2 Query: 2 HLLPCLNLASAL-SAHCTVTLVIAVPIISAAETGEISSFTASHPHIRRLDFRM--DSRQN 172 HL P L LA++L HC VTL+ P +S AET +S F +++P + F + + Sbjct: 23 HLTPFLRLAASLVQHHCRVTLITTYPTVSLAETQHVSHFLSAYPQVTEKRFHLLPFDPNS 82 Query: 173 TQAEDPYLMRYDALSRNAHLLGHHLPTLSPPLSAVFSEFSLTRG-LNTSLPSSLPNYTIV 349 A DP+L+R++A+ R+AHLL P LSPPLSA+ ++ +L L ++ LPNY + Sbjct: 83 ANATDPFLLRWEAIRRSAHLLA---PLLSPPLSALITDVTLISAVLPVTINLHLPNYVLF 139 Query: 350 PISATFFSIMARLPRL---HDSLDXXXXXXXXXXEIPGIGSVPRENIPPPFFSPGHTFAE 520 SA FS+ A P + + EIPG+ +P ++PP FA Sbjct: 140 TASAKMFSLTASFPAIVASKSTSSGSVEFDDDFIEIPGLPPIPLSSVPPAVMDSKSLFAT 199 Query: 521 LVTSNARHLHLSKGIVLNTXXXXXXXXXXXX---RVAPELPELFPFAVPLHAA-KKSSTP 688 N S G+++N+ RV LP ++ L +K P Sbjct: 200 SFLENGNSFVKSNGVLINSFDALEADTLVALNGRRVVAGLPPVYAVGPLLPCEFEKRDDP 259 Query: 689 TTACT--WLDGQPDDSVVYVAFGSRTAMPRDQMRELGMGLKMSGWRFLWXXXXXXXXXXX 862 +T+ WLD QP+ SVVYV+FGSR A+ +Q +ELG GL SG RFLW Sbjct: 260 STSLILKWLDDQPEGSVVYVSFGSRLALSMEQTKELGDGLLSSGCRFLWVVKGKIVDKED 319 Query: 863 XXXXXXXXXXXRELYEALVVSGKGMIERKWVNQDEILGHRAVGGFLSHAGWSSVLEAARH 1042 EL E + +G++ + WV+QD++L HRAVGGF+SH GW+S++EAARH Sbjct: 320 EESLKNVLG--HELTEK--IKDQGLVVKNWVDQDKVLSHRAVGGFVSHGGWNSLVEAARH 375 Query: 1043 GVRVLAWPLGGDQRMNAWSVNKAGLGIWD---GWGPDKLVPAAEIGRKVEVLM 1192 GV +L WP GDQ++NA +V +AGLG+W GWG + EIG K++ LM Sbjct: 376 GVPLLVWPHFGDQKINAEAVERAGLGMWVRSWGWGTELRAKGDEIGLKIKDLM 428 >gb|EOY01768.1| UDP-glucosyl transferase 88A1, putative [Theobroma cacao] Length = 465 Score = 234 bits (598), Expect = 4e-59 Identities = 143/409 (34%), Positives = 217/409 (53%), Gaps = 12/409 (2%) Frame = +2 Query: 2 HLLPCLNLASA-LSAHCTVTLVIAVPIISAAETGEISSFTASHPHIRRLDFRMDSRQNTQ 178 HL+P L LA++ L HC +TL+ P++S AE+ IS F ++ P + F + Sbjct: 23 HLIPFLRLAASFLRCHCQLTLITTDPVVSLAESQLISRFLSAFPPVTEKKFTLLPLDPAT 82 Query: 179 AE--DPYLMRYDALSRNAHLLGHHLPTLSPPLSAVFSEFSLTRGLNTSLPSS----LPNY 340 A DP+ ++++ + R+AHLL + +LSPPLS + ++ +L +++ +P S LPNY Sbjct: 83 ANSTDPFTLQWETIRRSAHLLSPLISSLSPPLSFIVTDITL---MSSVIPISANLCLPNY 139 Query: 341 TIVPISATFFSIMARLPRLHDSLDXXXXXXXXXXEIPGIGSVPRENIPPPFFSPGHTFAE 520 + SA FS++A P + EIPGI +PR ++PP + FA+ Sbjct: 140 MLFTSSARMFSLLAYFPSTKTA--DGSFQFGNVIEIPGIPPIPRSSLPPVLLNSNSLFAK 197 Query: 521 LVTSNARHLHLSKGIVLNTXXXXXXXXXXXXRVAPELPELFPFAVPLHAAKKSSTPTTAC 700 + + N++ + G+++NT A LP +FP L + + Sbjct: 198 IFSENSQTITKLNGVLINTFEGLEKQALDMLNSAKGLPPVFPIGPLLRCEFEGAESLATL 257 Query: 701 TWLDGQPDDSVVYVAFGSRTAMPRDQMRELGMGLKMSGWRFLWXXXXXXXXXXXXXXXXX 880 WLD Q + SV+YV FGSRT ++Q++E+GMGL +SG +FLW Sbjct: 258 KWLDDQKEGSVLYVGFGSRTTTSKEQIKEIGMGLLLSGCKFLWVVRTKILDKEEEEGLDE 317 Query: 881 XXXXXRELYEALVVSGKGMIERKWVNQDEILGHRAVGGFLSHAGWSSVLEAARHGVRVLA 1060 EL + + S G++ ++WVNQ EIL H+AVGGFLSH GW+SV+EAA +GV +LA Sbjct: 318 ILGY--ELMQRIKSSNNGLVVKEWVNQCEILSHKAVGGFLSHCGWNSVVEAALNGVPMLA 375 Query: 1061 WPLG--GDQRMNAWSVNKAGLGIW---DGWGPDKLVPAAEIGRKVEVLM 1192 P GDQR+N V AG + GWG D L+ EIG K++ LM Sbjct: 376 CPQRQFGDQRINLEVVEAAGWVLCVKSSGWGEDVLLKGEEIGEKIKELM 424 >ref|XP_004505380.1| PREDICTED: anthocyanidin 5,3-O-glucosyltransferase-like [Cicer arietinum] Length = 465 Score = 234 bits (597), Expect = 5e-59 Identities = 155/415 (37%), Positives = 216/415 (52%), Gaps = 18/415 (4%) Frame = +2 Query: 2 HLLPCLNLASAL-SAHCTVTLVIAVPIISAAETGEISSFTASHPHIRRLDFRM---DSRQ 169 HL P L LA+ L + HC VTL+ +P +S AE+ +S F +S PH+ L F + S Sbjct: 18 HLTPFLRLAALLLNNHCKVTLINPLPTVSHAESNLLSHFHSSFPHLNILPFHLPLPSSSP 77 Query: 170 NTQAEDPYLMRYDALSRNAHLLGHHLPTLSPPLSAVFSEFSLTRGL-NTSLPSSLPNYTI 346 + + DP+ R L + HLL L +LSPPLSA S+ L L + +L S+PNYT+ Sbjct: 78 PSNSIDPFFFRVQTLRDSIHLLPPLLSSLSPPLSAFISDIMLISPLLSITLKLSIPNYTL 137 Query: 347 VPISATFFSIMARLPRLHDSLDXXXXXXXXXX--EIPGI--GSVPRENIPPPFFSPGHTF 514 SA+ FS + P L SL E+PGI +P +IPP P T Sbjct: 138 FTSSASMFSFFSHFPTLSQSLSSQPISDSDAVAVEVPGIPFSPLPYSSIPPFLIFPT-TI 196 Query: 515 AELVTSNARHLHLSKGIVLNTXXXXXXXXXXXX---RVAPELPELF---PFAVPLHAAKK 676 + ++ +L G+ NT +V LP ++ PF +K Sbjct: 197 RNFIMEDSPNLTNLDGVFANTFEALESYSLETLNSGKVVKNLPPVYAVGPFVS--FEFEK 254 Query: 677 SSTPTTACTWLDGQPDDSVVYVAFGSRTAMPRDQMRELGMGLKMSGWRFLWXXXXXXXXX 856 S T WLDGQP SVVYV FGSRTA+ RDQMRE+G GL SG++FLW Sbjct: 255 ESQQTALTKWLDGQPIGSVVYVCFGSRTALGRDQMREIGNGLIRSGYKFLWVVKDKIVDK 314 Query: 857 XXXXXXXXXXXXXRELYEALVVSGKGMIERKWVNQDEILGHRAVGGFLSHAGWSSVLEAA 1036 +L E + KG++ ++WV+Q EILGH++VGGF+SH GW+S++EA Sbjct: 315 EEEIGLDEILGV--DLVEKM--KEKGLVIKEWVDQSEILGHKSVGGFVSHCGWNSLVEAV 370 Query: 1037 RHGVRVLAWPLGGDQRMNAWSVNKAGLGIWD---GWGPDKLVPAAEIGRKVEVLM 1192 +GV +LAWP GDQ++NA V G GIW+ GW + +V EIG ++ +M Sbjct: 371 WNGVPILAWPQHGDQKINAKLVEIGGWGIWNKNWGWSGELVVKGEEIGDAIQEMM 425 >ref|XP_003607777.1| Anthocyanidin 5 3-O-glucosyltransferase [Medicago truncatula] gi|355508832|gb|AES89974.1| Anthocyanidin 5 3-O-glucosyltransferase [Medicago truncatula] Length = 469 Score = 231 bits (590), Expect = 3e-58 Identities = 150/415 (36%), Positives = 219/415 (52%), Gaps = 18/415 (4%) Frame = +2 Query: 2 HLLPCLNLASA-LSAHCTVTLVIAVPIISAAETGEISSFTASHPHIRRLDFRMDSRQNTQ 178 HL P L LAS L+ +C VTL+ +P +S AE+ + F +S P + + F + Sbjct: 18 HLTPFLRLASLFLNNNCKVTLITPLPTVSLAESQLLDHFHSSFPQVNFIPFHLQPSSPDS 77 Query: 179 AEDPYLMRYDALSRNAHLLGHHLPTLSPPLSAVFSE-FSLTRGLNTSLPSSLPNYTIVPI 355 DP+ R L + +LL + +LSPP++ S+ F L+ ++ + SLPNYT+ Sbjct: 78 VVDPFFHRVQTLRDSTNLLPPLISSLSPPITVFISDIFLLSPLISITQQLSLPNYTLFTS 137 Query: 356 SATFFSIMARLPRLHDSLDXXXXXXXXXXEIPGIG--SVPRENIPPPFFSPGHTFAELVT 529 SA+ FS + P L S+ +PGI +P +IPP F P F L+ Sbjct: 138 SASMFSFFSHFPTLAQSISDASAEISEIP-VPGIAFSPLPYSSIPPILFKPT-IFRNLMM 195 Query: 530 SNARHLHLSKGIVLNTXXXXXXXXXXXXR---VAPELPELF---PFAVPLHAAKKSSTPT 691 ++ +L +G+ LNT V +P ++ PF VPL K+S T Sbjct: 196 EDSPNLTKLQGVFLNTFKALESHSLQALNNGEVVKGMPPVYAVGPF-VPLEFEKESQKET 254 Query: 692 TA-----CTWLDGQPDDSVVYVAFGSRTAMPRDQMRELGMGLKMSGWRFLWXXXXXXXXX 856 ++ WLD QP SVVYV FGSRTA+ RDQMRE+G GL SG+ FLW Sbjct: 255 SSESPPLTKWLDEQPIGSVVYVCFGSRTALGRDQMREIGDGLMRSGYNFLWVVKDKIVDK 314 Query: 857 XXXXXXXXXXXXXRELYEALVVSGKGMIERKWVNQDEILGHRAVGGFLSHAGWSSVLEAA 1036 EL E + KG++ ++WV+Q EIL H+++GGF+SH GW+S++EAA Sbjct: 315 EDKEVGLDEVLGV-ELVERM--KKKGLVVKEWVDQSEILSHKSIGGFVSHCGWNSIMEAA 371 Query: 1037 RHGVRVLAWPLGGDQRMNAWSVNKAGLGIWD---GWGPDKLVPAAEIGRKVEVLM 1192 +GV +LAWP GDQR+NA V +G GIW+ GWG +++V EIG ++ +M Sbjct: 372 LNGVPILAWPQHGDQRINAGLVEISGWGIWNKNWGWGGERVVKGEEIGDAIKEMM 426 >ref|XP_004135442.1| PREDICTED: anthocyanidin 5,3-O-glucosyltransferase-like [Cucumis sativus] gi|449530181|ref|XP_004172074.1| PREDICTED: anthocyanidin 5,3-O-glucosyltransferase-like [Cucumis sativus] Length = 458 Score = 230 bits (586), Expect = 1e-57 Identities = 142/409 (34%), Positives = 215/409 (52%), Gaps = 12/409 (2%) Frame = +2 Query: 2 HLLPCLNLASALSAH-CTVTLVIAVPIISAAETGEISSFTASHPHIRRLDFRMDSRQNTQ 178 HL+P L LA+ L +H C +TL+ + P +S+AE+ IS F ++ P + L F + + Sbjct: 20 HLVPFLRLANTLLSHNCKLTLITSHPPVSSAESHLISRFLSAFPQVNELKFHILPLDPSI 79 Query: 179 A--EDPYLMRYDALSRNAHLLGHHLPTLSPPLSAVFSEFSLTRG---LNTSLPSSLPNYT 343 A +DP+ ++++A+ R+ H+L + LSPPLSA+ + +L LNT+L ++P Y Sbjct: 80 ANSDDPFFLQFEAIRRSVHVLNSPISALSPPLSALVCDVTLISSGLLLNTTL--NIPIYA 137 Query: 344 IVPISATFFSIMARLPRLHDSLDXXXXXXXXXXEIPGIGSVPRENIPPPFFSPGHTFAEL 523 + SA S+ A P S IP IGS+P+ ++PPP F ++ Sbjct: 138 LFTSSAKMLSLFAYYPFAKMS-----DPSSDFIRIPAIGSIPKTSLPPPLLINNSIFGKI 192 Query: 524 VTSNARHLHLSKGIVLNTXXXXXXXXXXXX---RVAPELPELFPFAVPLHAAKKSSTPTT 694 + + + GI++N +V +P + P L ++ + Sbjct: 193 FAQDGQRIKELNGILINAMDGIEGDTLTALNTGKVLNGVPPVIPIGPFLPCDFENPDAKS 252 Query: 695 ACTWLDGQPDDSVVYVAFGSRTAMPRDQMRELGMGLKMSGWRFLWXXXXXXXXXXXXXXX 874 WLD P SVV+ +FGSRTA RDQ++E+G GL SG+RF+W Sbjct: 253 PIKWLDNLPPRSVVFASFGSRTATSRDQIKEIGSGLVSSGYRFVWVVKDKVVDKEDKEGL 312 Query: 875 XXXXXXXRELYEALVVSGKGMIERKWVNQDEILGHRAVGGFLSHAGWSSVLEAARHGVRV 1054 EL + L KGM+ ++WVNQ EILGHRAVGGF+ H GW+SV+EAA +GV + Sbjct: 313 EDIMG--EELMKKL--KEKGMVLKEWVNQQEILGHRAVGGFICHCGWNSVMEAALNGVPI 368 Query: 1055 LAWPLGGDQRMNAWSVNKAGLGIWD---GWGPDKLVPAAEIGRKVEVLM 1192 L WP GDQ +NA + K GLG+W GWG LV E+G +++ +M Sbjct: 369 LGWPQIGDQMINAELIAKKGLGMWVEEWGWGQKCLVKGEEVGGRIKEMM 417 >gb|EXB38054.1| UDP-glycosyltransferase [Morus notabilis] Length = 465 Score = 228 bits (582), Expect = 3e-57 Identities = 148/410 (36%), Positives = 216/410 (52%), Gaps = 13/410 (3%) Frame = +2 Query: 2 HLLPCLNLASAL-SAHCTVTLVIAVPIISAAETGEISSFTASHPHIRRLDFR-MDSRQNT 175 HLLP L + S L S +CTVTL+ A +SA E+ ISSF + HP ++ +D + + N Sbjct: 24 HLLPFLRITSMLLSRNCTVTLITAESTVSAVESSYISSFLSQHPQVKHVDIQPIQLHSNP 83 Query: 176 QAEDPYLMRYDALSRNAHLLGHHLPTLSPPLSAVFSEFSLTRGLNTSLPSSL--PNYTIV 349 + DP ++++++SR+ LL L + SPPLSA+F+ S+ + T + + L P+Y + Sbjct: 84 TSNDPLFLQFESISRSFQLLSPALSSSSPPLSAIFTHLSMA-SIITPIAAELGVPSYLVS 142 Query: 350 PISATFFSIMARLPRLHDSLDXXXXXXXXXXEIPGIGSVPRENIPPPFFSPGHTFAE-LV 526 S F +MA P L D IPG+ P +IP PF +P + F ++ Sbjct: 143 STSTKFLCLMAYHPVLIADPDKLGNSSTELT-IPGLTPFPISSIPSPFKNPDNIFTRSIL 201 Query: 527 TSNARHLHLSKGIVLNTXXXXXXXXXXXXR----VAPELPELFPFAVPLHAAKKSSTPTT 694 NAR L +KGI++N+ + LP + P PL + + + Sbjct: 202 VPNARALSKAKGIIVNSFDCFEPETLEAINNGRVLEHSLPPVLPIG-PLESYEIKKEKSH 260 Query: 695 ACTWLDGQPDDSVVYVAFGSRTAMPRDQMRELGMGLKMSGWRFLWXXXXXXXXXXXXXXX 874 TWLD QP++SVVYV FG RT M Q+REL GL+ SG+RFL Sbjct: 261 YMTWLDNQPEESVVYVNFGGRTTMSNHQIRELSKGLERSGYRFL--LVLKCSEVDEEDKD 318 Query: 875 XXXXXXXRELYEALVVSGKGMIERKWVNQDEILGHRAVGGFLSHAGWSSVLEAARHGVRV 1054 E KGM+ + WV+Q EIL H ++G F++H GW+SV+EAAR G+ + Sbjct: 319 DLKDLVGDSFLER--TRNKGMVVKGWVSQQEILEHPSIGAFVNHCGWNSVMEAARRGIPM 376 Query: 1055 LAWPLGGDQRMNAWSVNKAGLGIWDG-WG---PDKLVPAAEIGRKVEVLM 1192 +AWP GDQR+NA V AGLGIW+ WG +LV EI +K++ +M Sbjct: 377 VAWPQIGDQRVNAEIVKNAGLGIWESKWGLGLQAELVCGEEIEKKIKEVM 426 >gb|EOY01771.1| UDP-glucosyl transferase 88A1, putative isoform 1 [Theobroma cacao] gi|508709875|gb|EOY01772.1| UDP-glucosyl transferase 88A1, putative isoform 1 [Theobroma cacao] gi|508709876|gb|EOY01773.1| UDP-glucosyl transferase 88A1, putative isoform 1 [Theobroma cacao] gi|508709877|gb|EOY01774.1| UDP-glucosyl transferase 88A1, putative isoform 1 [Theobroma cacao] gi|508709878|gb|EOY01775.1| UDP-glucosyl transferase 88A1, putative isoform 1 [Theobroma cacao] gi|508709879|gb|EOY01776.1| UDP-glucosyl transferase 88A1, putative isoform 1 [Theobroma cacao] Length = 474 Score = 228 bits (580), Expect = 5e-57 Identities = 144/410 (35%), Positives = 216/410 (52%), Gaps = 13/410 (3%) Frame = +2 Query: 2 HLLPCLNLASAL-SAHCTVTLVIAVPIISAAETGEISSFTASHPHIRRLDFRMDSRQNTQ 178 HLLP L LA +L S C VTL+ PI+S AE+ IS+F ++ P + F + Sbjct: 23 HLLPFLRLAGSLISQRCQVTLITTHPIVSLAESQLISAFLSAFPQVSEKKFTLLPLDPLT 82 Query: 179 AE--DPYLMRYDALSRNAHLLGHHLPTLSPPLSAVFSEFSLTRGL-NTSLPSSLPNYTIV 349 A DP+ ++++ + R+AHLL L +LSPPLS + ++ +L + + + LPNY + Sbjct: 83 ANCNDPFKLQWETIRRSAHLLSPLLSSLSPPLSFIITDMTLMSSVVSVTANLCLPNYILF 142 Query: 350 PISATFFSIMARLPRLHDS-LDXXXXXXXXXXEIPGIGS-VPRENIPPPFFSPGHTFAEL 523 SA FS+ A P + +S D +PG+GS +P ++P F + Sbjct: 143 TTSARMFSLFAYFPSIAESKTDGGSSRFGDEIRVPGLGSPIPVSSLPSTLLDLNSFFTKN 202 Query: 524 VTSNARHLHLSKGIVLNTXXXXXXXXXXXXRVAPE---LPELFPFAVPLHAAKKSSTPTT 694 + N+R + G+++N+ V LP +FP L + + + Sbjct: 203 FSDNSRSIKNVNGVLINSFEGLEKQSLEMLTVGKAMEGLPPVFPVGPLLPLEFEGQSSFS 262 Query: 695 ACTWLDGQPDDSVVYVAFGSRTAMPRDQMRELGMGLKMSGWRFLWXXXXXXXXXXXXXXX 874 WL+GQ + SVVYV+FGSRT M ++Q+RELG GL +SG++F+W Sbjct: 263 PLKWLEGQKERSVVYVSFGSRTPMSKEQIRELGTGLVLSGYKFVWVVKSKVVDKEEDESL 322 Query: 875 XXXXXXXRELYEALVVSGKGMIERKWVNQDEILGHRAVGGFLSHAGWSSVLEAARHGVRV 1054 +EL E V G++ ++WVNQ +IL H+AVGGF+SH GW+SV+EAA HGV V Sbjct: 323 DEILG--QELKEK--VMNNGLVVKEWVNQWKILSHKAVGGFISHCGWNSVVEAAWHGVPV 378 Query: 1055 LAWPLGGDQRMNAWSVNKAGLGI----WDGWGPDKLVPAAEIGRKVEVLM 1192 L WP GDQ +NA + G G+ W GW D +V EIG +++ LM Sbjct: 379 LGWPQHGDQMINAEVIEGGGWGLCMKSW-GWVSDIVVKGEEIGDRIKELM 427 >ref|XP_002306046.2| hypothetical protein POPTR_0004s12460g [Populus trichocarpa] gi|550340898|gb|EEE86557.2| hypothetical protein POPTR_0004s12460g [Populus trichocarpa] Length = 461 Score = 224 bits (572), Expect = 4e-56 Identities = 144/405 (35%), Positives = 203/405 (50%), Gaps = 8/405 (1%) Frame = +2 Query: 2 HLLPCLNLASALSA-HCTVTLVIAVPIISAAETGEISSFTASHPHIRRLDFRMDSRQNTQ 178 HL P L LA++L+ + VT +I P +S +E+ +S AS P I+ F + N Sbjct: 22 HLTPFLRLAASLTLQNVQVTFIIPHPTVSLSESQALSQLFASFPQIKHQQFHLLPLDNP- 80 Query: 179 AEDPYLMRYDALSRNAHLLGHHLPTLSPPLSAVFSEFSLTRGLNTSLPS-SLPNYTIVPI 355 ++DP+ + + ++ LL L L+PPLS ++ SL + + SLPNY + Sbjct: 81 SDDPFFEHFQLIKNSSRLLSPLLSALNPPLSVFITDMSLASTVTPITEAISLPNYVLFTS 140 Query: 356 SATFFSIMARLPRLHDSLDXXXXXXXXXXEIPGIGSVPRENIPPPFFSPGHTFAEL-VTS 532 SA + P L DS +I G+ +P+ IPPP G+ + Sbjct: 141 SAKMLTFFLCYPTLADSKAMDELDEMDVIKIRGLELMPKSWIPPPLLKKGNNILKTSFIE 200 Query: 533 NARHLHLSKGIVLNTXXXXXXXXXXXXRVAPELPELFPFAVPLHAAKKSSTPTTAC--TW 706 ++R + S GI++NT L E P V + + TW Sbjct: 201 DSRKVAESSGILVNTFESFEQESLRKLNDCQLLLERLPSVVAIGPLPPCDFEKSQLQLTW 260 Query: 707 LDGQPDDSVVYVAFGSRTAMPRDQMRELGMGLKMSGWRFLWXXXXXXXXXXXXXXXXXXX 886 LD QP SVVYV+FGSRTA+ RDQ+RELG GL SG RF+W Sbjct: 261 LDDQPAGSVVYVSFGSRTALSRDQVRELGEGLVRSGSRFIWVVKDKKVDREDNEGLEGVI 320 Query: 887 XXXRELYEALVVSGKGMIERKWVNQDEILGHRAVGGFLSHAGWSSVLEAARHGVRVLAWP 1066 EL E + KG++ R WVNQ+++L H AVGGF SH GW+SV+EAA HGV++LAWP Sbjct: 321 GD--ELMERM--KEKGLVVRNWVNQEDVLSHPAVGGFFSHCGWNSVMEAAWHGVKILAWP 376 Query: 1067 LGGDQRMNAWSVNKAGLGIWD---GWGPDKLVPAAEIGRKVEVLM 1192 GDQ++NA V + GLG W GWG + +V AEI K+ +M Sbjct: 377 QHGDQKVNADIVERIGLGTWVKSWGWGEEMIVNRAEIAEKIGEIM 421 >gb|EOY01770.1| UDP-glucosyl transferase 88A1, putative [Theobroma cacao] Length = 465 Score = 223 bits (568), Expect = 1e-55 Identities = 138/406 (33%), Positives = 208/406 (51%), Gaps = 9/406 (2%) Frame = +2 Query: 2 HLLPCLNLASAL-SAHCTVTLVIAVPIISAAETGEISSFTASHPHI--RRLDFRMDSRQN 172 HL P L A+AL HC +TL+ P++S AE+ IS F ++ P + +++ Sbjct: 23 HLTPFLRFAAALLRCHCQLTLITTDPVVSLAESQLISRFLSAFPQVTEKKITLLPLDPAT 82 Query: 173 TQAEDPYLMRYDALSRNAHLLGHHLPTLSPPLSAVFSEFSLTRGLNTSLPS-SLPNYTIV 349 + DP+ ++++ + R+AHLL + +LSPPLS + ++ SL + + LPNY + Sbjct: 83 INSADPFTLQWETIRRSAHLLSPLISSLSPPLSFIVTDISLQSSIIPITANLRLPNYILF 142 Query: 350 PISATFFSIMARLPRLHDSLDXXXXXXXXXXEIPGIGSVPRENIPPPFFSPGHTFAELVT 529 SA FS++A P D IPGI +PR ++PP + FA+ + Sbjct: 143 ISSARMFSLLAYFPST--KTDDGSFQFGNVIIIPGIPPIPRSSLPPVLLNSNSPFAKNFS 200 Query: 530 SNARHLHLSKGIVLNTXXXXXXXXXXXXRVAPELPELFPFAVPLHAAKKSSTPTTACTWL 709 ++ + G+++NT LP +FP L + WL Sbjct: 201 EGSQTITKVNGVLINTFDGLEKQALDMLNTVKGLPPVFPVGPLLPCEFEGPESLATLKWL 260 Query: 710 DGQPDDSVVYVAFGSRTAMPRDQMRELGMGLKMSGWRFLWXXXXXXXXXXXXXXXXXXXX 889 + Q + SV++V FGSRTA ++Q+RE+GMGL +SG +FLW Sbjct: 261 EDQKEGSVLFVCFGSRTATSKEQIREIGMGLLLSGCKFLWVVRIKIFDKEEEEGLDEILG 320 Query: 890 XXRELYEALVVSGKGMIERKWVNQDEILGHRAVGGFLSHAGWSSVLEAARHGVRVLAWPL 1069 EL + + S G++ ++WVNQ EIL H+AVGGFLSH GW+SV+EAA +GV +LA P Sbjct: 321 Y--ELMQRIKSSNNGLVVKEWVNQCEILSHKAVGGFLSHCGWNSVVEAALNGVPMLACPQ 378 Query: 1070 G--GDQRMNAWSVNKAGLGIW---DGWGPDKLVPAAEIGRKVEVLM 1192 GDQR+N V AG + GWG D L+ EIG K++ LM Sbjct: 379 RQFGDQRINLEVVEAAGWVLCVKSSGWGEDVLLKGEEIGEKIKELM 424 >ref|XP_002324085.2| hypothetical protein POPTR_0017s12490g [Populus trichocarpa] gi|550320130|gb|EEF04218.2| hypothetical protein POPTR_0017s12490g [Populus trichocarpa] Length = 460 Score = 214 bits (546), Expect = 4e-53 Identities = 146/412 (35%), Positives = 202/412 (49%), Gaps = 15/412 (3%) Frame = +2 Query: 2 HLLPCLNLASALSA-HCTVTLVIAVPIISAAETGEISSFTASHPHIRRLDFRMDSRQNTQ 178 HL P L LA+ L+A + VT + P +S E+ +S F AS P +++ F + + Sbjct: 22 HLTPFLRLAALLTARNVQVTFITPHPTVSLTESQALSGFFASFPQVKQKQFHLLPLEENS 81 Query: 179 AEDPYLMRYDALSRNAHLLGHHLPTLSPPLSAVFSEFSLTRGLNTSLPS----SLPNYTI 346 DP+ + + + HLL L L+P LS ++ +L +T +P SLPNY + Sbjct: 82 V-DPFFYQMQLIKSSCHLLSPLLSALTPSLSVFITDMTLA---STVIPITQAISLPNYVL 137 Query: 347 VPISATFFSIMARLPRLHDSLDXXXXXXXXXXEIPGIGSVPRENIPPPFFSPGHTFAE-L 523 SA ++ P L S +I + +P+ +PPP + F + Sbjct: 138 FTSSAKMMTLFLSYPTLAGSKALDDLDETDVIKIRNVELMPKSLLPPPLLQKSNNFFKNS 197 Query: 524 VTSNARHLHLSKGIVLNTXXXXXXXXXXXXRVA------PELPELFPFAVPLHAAKKSST 685 + R + S GI+LNT P + + PF P ++ K Sbjct: 198 FIEDGRKVTESCGILLNTFVSFELESLRKINDGQVLERPPSVVAIGPFP-PCNSEKSQ-- 254 Query: 686 PTTACTWLDGQPDDSVVYVAFGSRTAMPRDQMRELGMGLKMSGWRFLWXXXXXXXXXXXX 865 TWLD QP SV+YV+FGSRTA+ RDQ+RELG GL SG RF+W Sbjct: 255 --LQLTWLDDQPAGSVLYVSFGSRTALARDQIRELGEGLIKSGSRFVWMVKDKKVDKEDS 312 Query: 866 XXXXXXXXXXRELYEALVVSGKGMIERKWVNQDEILGHRAVGGFLSHAGWSSVLEAARHG 1045 EL E V KG+I + W+NQD IL HRAVGGFLSH GW+SV+EAA HG Sbjct: 313 EELEEVIGY--ELMER--VKEKGLIVKDWLNQDGILSHRAVGGFLSHCGWNSVMEAAWHG 368 Query: 1046 VRVLAWPLGGDQRMNAWSVNKAGLGIWD---GWGPDKLVPAAEIGRKVEVLM 1192 VR+LAWP GDQ++NA V + GLG W GW + LV AEI ++ M Sbjct: 369 VRILAWPQNGDQKINADIVERIGLGTWVKSWGWSGEMLVKGAEIAERIRESM 420 >ref|XP_002444986.1| hypothetical protein SORBIDRAFT_07g002370 [Sorghum bicolor] gi|241941336|gb|EES14481.1| hypothetical protein SORBIDRAFT_07g002370 [Sorghum bicolor] Length = 499 Score = 214 bits (545), Expect = 6e-53 Identities = 145/431 (33%), Positives = 214/431 (49%), Gaps = 34/431 (7%) Frame = +2 Query: 2 HLLPCLNLASALSAH---CTVTLVIAVPIISAAETGEISSFTASHPHIRRLDFRMDSRQN 172 HL+P +ALS+H C+V V+ P +S AE ++ A+ P I+R+DF + Sbjct: 35 HLVPFFRFITALSSHGVRCSVMTVL--PTVSDAEADHFAALFAALPSIQRVDFNLLPLDA 92 Query: 173 TQ--AEDPYLMRYDALSRNAHLLGHHLPTLSPPLSAVFSEFSL-TRGLNTSLPSSLPNYT 343 + DP+L+R++AL R+AHLL + P +SAV ++ +L + + + LP + Sbjct: 93 SAFPGTDPFLLRWEALRRSAHLLDRLIAGAYPRVSAVVTDVTLASHVIPVAKQLQLPCHV 152 Query: 344 IVPISATFFSIMARLP----RLHDSLDXXXXXXXXXXEIPGIGSVPRENIPPPFFSPGHT 511 + SAT S++A P + D D +IPG+ + + ++P P H Sbjct: 153 LYISSATMLSLVAYFPIHLDKKQDDDDAGAGGGVGDVDIPGVRRIRQSSLPQPLHDLNHL 212 Query: 512 FAELVTSNARHLHLSKGIVLNTXXXXXXXXXXXXRVAPELPELFPFAVPLHAAKKSSTPT 691 F N R L + GI++NT R +P FP + K SS+P+ Sbjct: 213 FTRQFIDNGRALSQADGILVNTFDALEPMALAALRDGKVVPG-FPPVYAIGLLKSSSSPS 271 Query: 692 TACT--------------------WLDGQPDDSVVYVAFGSRTAMPRDQMRELGMGLKMS 811 ++ + WL QP SVVY+AFGSR A+ +Q+RE+G GL+ S Sbjct: 272 SSSSSIFTEAGEKQAAAAASPVIAWLGEQPARSVVYIAFGSRIAVSHEQIREMGAGLEAS 331 Query: 812 GWRFLWXXXXXXXXXXXXXXXXXXXXXXRELYEALVVSGKGMIERKWVNQDEILGHRAVG 991 G RFLW E E V G+GM+ + WV Q+ +L H AVG Sbjct: 332 GCRFLWVLKTTVVDREDTAEPRDVLGD--EFLER--VKGRGMVTKGWVEQEAVLRHAAVG 387 Query: 992 GFLSHAGWSSVLEAARHGVRVLAWPLGGDQRMNAWSVNKAGLGIW-DGW---GPDKLVPA 1159 FLSH+GW+SV EAA GV +LAWP GGDQR+NA ++ G+G+W + W G D +V Sbjct: 388 LFLSHSGWNSVTEAAACGVPLLAWPRGGDQRVNAMALESGGVGVWMERWSWDGEDGIVSG 447 Query: 1160 AEIGRKVEVLM 1192 EIG KV+ M Sbjct: 448 REIGEKVKAAM 458 >gb|EMJ16549.1| hypothetical protein PRUPE_ppa005427mg [Prunus persica] Length = 462 Score = 213 bits (543), Expect = 9e-53 Identities = 153/422 (36%), Positives = 213/422 (50%), Gaps = 25/422 (5%) Frame = +2 Query: 2 HLLPCLNLASALSAHCT-VTLVIAVPIISAAETGEISSFTASHPHI--RRLDFRMDSRQN 172 HL P L LA+ L+AH VT + P +S AE+ +S + P I + L + + Sbjct: 22 HLTPFLRLAALLTAHNVHVTFITPSPTVSLAESLSLSHLFTTFPQITQKHLHLLPLDQPS 81 Query: 173 TQAEDPYLMRYDALSRNAHLLGHHLPTLSPPLSAVFSEFSLTRGLNTSLPS-SLPNYTIV 349 +EDP+ ++ + R++HLL L +L PPLSA+ ++ SLT +N S LPNY Sbjct: 82 ANSEDPFYYHFELIRRSSHLLPPLLSSLCPPLSAIITDMSLTSTVNPLTDSLGLPNYIFF 141 Query: 350 PISA---TFF-SIMARLPRLHDSLDXXXXXXXXXXEIPGIGSVPRENIPPPFFSPGHTFA 517 SA TF+ S L H+ D ++ G+ +P+ IPPP G+ Sbjct: 142 TSSAKMLTFYVSFHTMLGPNHEIEDHT--------KVSGLEQIPKAWIPPPLLRGGNNLL 193 Query: 518 E-LVTSNARHLHLSKGIVLNTXXXXXXXXXXXX---RVAPELPELFPFAVPLHAAKKSST 685 + N + + S GI++NT +V +LP + PL + Sbjct: 194 KTFFLENGKKMTESSGILVNTYESIERETLAALNEGKVLRKLPSVIAIG-PLAPCIFEES 252 Query: 686 PTTACTWLDGQPDDSVVYVAFGSRTAMPRDQMRELGMGLKMSGWRFLWXXXXXXXXXXXX 865 A WLD QP SV+YV+FGSRTAM RDQ+RELG GL SG RFLW Sbjct: 253 QQLA--WLDDQPTGSVLYVSFGSRTAMSRDQIRELGDGLVRSGCRFLWVVKDKKVDVEDD 310 Query: 866 XXXXXXXXXXRELYEALVVSGKGMIER---------KWVNQDEILGHRAVGGFLSHAGWS 1018 ++L E L G+G++ER W+NQ EIL H A+GGFLSH GW+ Sbjct: 311 ----------KKLIEVL---GQGLLERVKKNGFAVKNWLNQQEILSHPAIGGFLSHCGWN 357 Query: 1019 SVLEAARHGVRVLAWPLGGDQRMNAWSVNKAGLGIWD---GWGP-DKLVPAAEIGRKVEV 1186 S+ EA +GVR+LAWP GDQ++NA V + GLG WD GWG + LV A +I +V Sbjct: 358 SLTEALWNGVRILAWPQHGDQKINADLVERIGLGTWDKSWGWGEGEMLVKAQDIAERVRE 417 Query: 1187 LM 1192 +M Sbjct: 418 IM 419