BLASTX nr result

ID: Glycyrrhiza35_contig00002114 seq

BLASTX 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Glycyrrhiza35_contig00002114
         (1595 letters)

Database: ./nr 
           115,041,592 sequences; 42,171,959,267 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

XP_004502258.1 PREDICTED: crocetin glucosyltransferase, chloropl...   600   0.0  
GAU18595.1 hypothetical protein TSUD_124260 [Trifolium subterran...   593   0.0  
XP_003552552.1 PREDICTED: crocetin glucosyltransferase, chloropl...   592   0.0  
XP_019415874.1 PREDICTED: crocetin glucosyltransferase, chloropl...   586   0.0  
XP_007163802.1 hypothetical protein PHAVU_001G265400g [Phaseolus...   585   0.0  
XP_017405475.1 PREDICTED: crocetin glucosyltransferase, chloropl...   573   0.0  
XP_014521681.1 PREDICTED: crocetin glucosyltransferase, chloropl...   569   0.0  
XP_002263700.1 PREDICTED: crocetin glucosyltransferase, chloropl...   546   0.0  
XP_003531212.1 PREDICTED: crocetin glucosyltransferase, chloropl...   538   0.0  
AAY27090.1 UDP-glucose:flavonoid 7-O-glucosyltransferase [Pyrus ...   533   0.0  
XP_018844047.1 PREDICTED: crocetin glucosyltransferase, chloropl...   531   0.0  
KYP56999.1 Anthocyanin 5-O-glucosyltransferase [Cajanus cajan]        529   0.0  
XP_007221288.1 hypothetical protein PRUPE_ppa016890mg [Prunus pe...   529   0.0  
NP_001315912.1 crocetin glucosyltransferase, chloroplastic-like ...   529   0.0  
XP_018848545.1 PREDICTED: crocetin glucosyltransferase, chloropl...   526   0.0  
XP_003524180.1 PREDICTED: crocetin glucosyltransferase, chloropl...   526   0.0  
XP_018809219.1 PREDICTED: crocetin glucosyltransferase, chloropl...   523   e-179
XP_008348418.1 PREDICTED: crocetin glucosyltransferase, chloropl...   522   e-179
AMO27404.1 anthocyanidin 3-o-glucoside 5-o-glucosyltransferase 1...   520   e-179
XP_014634355.1 PREDICTED: crocetin glucosyltransferase, chloropl...   520   e-178

>XP_004502258.1 PREDICTED: crocetin glucosyltransferase, chloroplastic [Cicer
            arietinum] AGU14117.1 UDP-glycosyltransferase [Cicer
            arietinum]
          Length = 471

 Score =  600 bits (1546), Expect = 0.0
 Identities = 312/472 (66%), Positives = 350/472 (74%), Gaps = 3/472 (0%)
 Frame = +2

Query: 155  HRFLLITYPIQGHINPALQFAKRLATMGVPVTFATTVHLHRRLVNKPTTKGLSFAVFXXX 334
            H FL++TYP+QGHINPALQFAKRL TMG  VTF TT++LHRRL+NKPT   LSFA F   
Sbjct: 5    HNFLIVTYPLQGHINPALQFAKRLVTMGAHVTFTTTIYLHRRLINKPTIPNLSFAAFSDG 64

Query: 335  XXXXXXXXXXAEVASYISELKRRGSESLRDTILSAAKEGQPFTYLAYTLLLPWVATVARE 514
                       ++++Y+ EL  RGSE LR+ ILSA     PFT LAYTLLLPW A VARE
Sbjct: 65   YDDGYNSNAIVDLSTYMLELSSRGSEFLRNIILSAKHGNHPFTCLAYTLLLPWAANVARE 124

Query: 515  LHLPSALLWIQAATVFDIYYYYFHEHGDYITQKTEDPTYSIELPGLPFSLTSRDLPSFLL 694
            L LP ALLWIQAATVFDIYYYY HEHGDYIT K++D T +IELPGL FSL SRDLPSFL 
Sbjct: 125  LQLPYALLWIQAATVFDIYYYYLHEHGDYITNKSKDATCNIELPGLSFSLKSRDLPSFLQ 184

Query: 695  ASNTYTFALPSFKEQLEILEEETNPIVLVNTVEELEPESLRAVDEYDKKLRMIPIGPLIP 874
            ASN YTF L S KEQ  IL+EETNPIVLVNTV+E E ES+RA+D+   K++MIPIGPLIP
Sbjct: 185  ASNIYTFILSSMKEQFRILDEETNPIVLVNTVDEFELESVRAIDD---KIKMIPIGPLIP 241

Query: 875  SAFLDGNDPADTSFGGDTISV-SSDYLEWLDSKTESSVVYVSFGSLAVLPKRQMEEIARA 1051
            SA+LDG D  DTSFGGD I V S DY+EWLDSK ESSVVYVSFGS +VLPK+QMEE ARA
Sbjct: 242  SAYLDGKDLTDTSFGGDVIRVDSEDYIEWLDSKDESSVVYVSFGSFSVLPKKQMEEFARA 301

Query: 1052 LLDCKIPFLWVIXXXXXXXXXXXXXXXXXXXXXXXXKIGKIVKWCSQLEVLSHRSLGCFV 1231
            LLD  + FLWVI                          GKIVKWCSQ+EVLSH S+GCFV
Sbjct: 302  LLDSGLNFLWVIREKKVDEKKEDDELSCKEELEKNVN-GKIVKWCSQVEVLSHSSVGCFV 360

Query: 1232 THCGWNSTLESLSCGVPMVAFPQWTDQTTNAKLIEDVWKTGVR--XXXXXXXXXXXXXXR 1405
            THCGWNST ESL  GVPMVAFPQWTDQ+TNAKLIEDVWK GVR                R
Sbjct: 361  THCGWNSTTESLVSGVPMVAFPQWTDQSTNAKLIEDVWKCGVRMDNNRDEEGIVKADEIR 420

Query: 1406 RCLEVVMGSGGEKGEEVRRNARRWKSLAREAVKEGGSSDKNLRAFLDDVGRI 1561
            RCLE+V+G GGEKGEE++RNA +WKSL REAVKEGGSSDKNL++FL  +G I
Sbjct: 421  RCLELVIG-GGEKGEELKRNAEKWKSLGREAVKEGGSSDKNLKSFLHHIGSI 471


>GAU18595.1 hypothetical protein TSUD_124260 [Trifolium subterraneum]
          Length = 468

 Score =  593 bits (1529), Expect = 0.0
 Identities = 312/470 (66%), Positives = 346/470 (73%), Gaps = 2/470 (0%)
 Frame = +2

Query: 146  MVQHRFLLITYPIQGHINPALQFAKRLATMGVPVTFATTVHLHRRLVNKPTTKGLSFAVF 325
            M QH FL+I YPIQGHINPALQFAKR+  +G  VTF TT++ +RRLVNKPT   LSFA F
Sbjct: 1    MAQHNFLIIAYPIQGHINPALQFAKRVINLGAHVTFTTTIYAYRRLVNKPTIPSLSFAAF 60

Query: 326  XXXXXXXXXXXXXAEVASYISELKRRGSESLRDTILSAAKEGQPFTYLAYTLLLPWVATV 505
                         A +  YISE +RRGSE L++ ILSA ++  PFT L YTL LPW + V
Sbjct: 61   SDGYDDGYILKDDASILFYISEHQRRGSEFLKNIILSAKQKIHPFTCLIYTLTLPWASKV 120

Query: 506  ARELHLPSALLWIQAATVFDIYYYYFHEHGDYITQKTEDPTYSIELPGLPFSLTSRDLPS 685
            ARE  LPSALLWIQAATVFDIYYYYFH HGDYIT K ED   SI+LPGL FSL SRDLPS
Sbjct: 121  AREFDLPSALLWIQAATVFDIYYYYFHNHGDYITNKLEDAECSIDLPGLSFSLKSRDLPS 180

Query: 686  FLLASNTYTFALPSFKEQLEILEEETNPIVLVNTVEELEPESLRAVDEYDKKLRMIPIGP 865
            FLLASN YT+ALPSFKE L+IL+EETNP VLVNTVEE E ++++ VD    K++MIPIGP
Sbjct: 181  FLLASNIYTWALPSFKEHLQILDEETNPRVLVNTVEEFELDAIKDVD--IGKIKMIPIGP 238

Query: 866  LIPSAFLDGNDPADTSFGGDTISVSSD--YLEWLDSKTESSVVYVSFGSLAVLPKRQMEE 1039
            LIPSAFLDG DP+D+S GGD I   S+  YLEWLD K ESSVVYV+FG+LAVL KRQM+E
Sbjct: 239  LIPSAFLDGKDPSDSSSGGDIIRGDSEDNYLEWLDLKGESSVVYVAFGTLAVLSKRQMDE 298

Query: 1040 IARALLDCKIPFLWVIXXXXXXXXXXXXXXXXXXXXXXXXKIGKIVKWCSQLEVLSHRSL 1219
            IA ALLD    FLWVI                          GKIVKWCSQLEVLSH SL
Sbjct: 299  IACALLDSGFSFLWVIRDNKLRKQRDDDDELSYREEIEKNVNGKIVKWCSQLEVLSHSSL 358

Query: 1220 GCFVTHCGWNSTLESLSCGVPMVAFPQWTDQTTNAKLIEDVWKTGVRXXXXXXXXXXXXX 1399
            GCFVTHCGWNSTLE LS GVPMVAFPQW DQTTNAKLIEDVWKTGVR             
Sbjct: 359  GCFVTHCGWNSTLEGLSSGVPMVAFPQWIDQTTNAKLIEDVWKTGVRMDRDEEGIVKADE 418

Query: 1400 XRRCLEVVMGSGGEKGEEVRRNARRWKSLAREAVKEGGSSDKNLRAFLDD 1549
             +RCLEVVMG  GEKGEE+RRNA++WKS AREAVKEGGSSDKNLR FL+D
Sbjct: 419  IKRCLEVVMGK-GEKGEELRRNAKKWKSFAREAVKEGGSSDKNLRNFLND 467


>XP_003552552.1 PREDICTED: crocetin glucosyltransferase, chloroplastic-like [Glycine
            max] KRG97360.1 hypothetical protein GLYMA_18G003100
            [Glycine max]
          Length = 465

 Score =  592 bits (1525), Expect = 0.0
 Identities = 308/475 (64%), Positives = 358/475 (75%), Gaps = 4/475 (0%)
 Frame = +2

Query: 146  MVQHRFLLITYPIQGHINPALQFAKRLATMGVPVTFATTVHLHRRLVNKPTTKGLSFAVF 325
            MVQHRFLLITYPIQGHINP++QFAKRL +MGV VTFAT+++LHRR++ KPT  GLSFA F
Sbjct: 1    MVQHRFLLITYPIQGHINPSIQFAKRLVSMGVHVTFATSLYLHRRMLKKPTIPGLSFATF 60

Query: 326  XXXXXXXXXXXXXAEVASYISELKRRGSESLRDTILSAAKEGQPFTYLAYTLLLPWVATV 505
                         + ++SY+SELKRRGSE LR+ I +A +EGQPFT LAYT+LLPW A V
Sbjct: 61   SDGYDDGYKATDDSSLSSYMSELKRRGSEFLRNIITAAKQEGQPFTCLAYTILLPWAAKV 120

Query: 506  ARELHLPSALLWIQAATVFDIYYYYFHEHGDYITQKTEDPTYSIELPGLPFSLTSRDLPS 685
            ARELH+P ALLWIQAATVFDIYYYYFHE+GD    K+ DPT  IELPGLPFSLT+RD+PS
Sbjct: 121  ARELHIPGALLWIQAATVFDIYYYYFHEYGDSFNYKS-DPT--IELPGLPFSLTARDVPS 177

Query: 686  FLLASNTYTFALPSFKEQLEILEEETNPIVLVNTVEELEPESLRAVDEYDKKLRMIPIGP 865
            FLL SN Y FALP+ +EQ + L++ETNPI+LVNT ++LEP++LRAVD    K  MIPIGP
Sbjct: 178  FLLPSNIYRFALPTLQEQFQDLDDETNPIILVNTFQDLEPDALRAVD----KFTMIPIGP 233

Query: 866  L-IPSAFLDGNDPADTSFGGDTISVSSDYLEWLDSKTESSVVYVSFGSLAVLPKRQMEEI 1042
            L IPSAFLDG DPADTS+GGD    S+DY+EWLDS+ E SVVYVSFG+LAVL  RQM+E+
Sbjct: 234  LNIPSAFLDGKDPADTSYGGDLFDASNDYVEWLDSQPELSVVYVSFGTLAVLADRQMKEL 293

Query: 1043 ARALLDCKIPFLWVIXXXXXXXXXXXXXXXXXXXXXXXXKIGKIVKWCSQLEVLSHRSLG 1222
            ARALLD    FLWVI                        + GKIVKWCSQ+EVLSH SLG
Sbjct: 294  ARALLDSGYLFLWVI---------RDMQGIEDNCREELEQRGKIVKWCSQVEVLSHGSLG 344

Query: 1223 CFVTHCGWNSTLESLSCGVPMVAFPQWTDQTTNAKLIEDVWKTGVR---XXXXXXXXXXX 1393
            CFVTHCGWNST+ESL  GVPMVAFPQWTDQ TNAK+++DVWKTGVR              
Sbjct: 345  CFVTHCGWNSTMESLGSGVPMVAFPQWTDQGTNAKMVQDVWKTGVRVDDKVNVEEGIVEA 404

Query: 1394 XXXRRCLEVVMGSGGEKGEEVRRNARRWKSLAREAVKEGGSSDKNLRAFLDDVGR 1558
               R+CL+VVMGSGG KG+E RRNA +WK LAREAV EGGSSD N+R FL DV +
Sbjct: 405  EEIRKCLDVVMGSGG-KGQEFRRNADKWKCLAREAVTEGGSSDSNMRTFLHDVAK 458


>XP_019415874.1 PREDICTED: crocetin glucosyltransferase, chloroplastic-like [Lupinus
            angustifolius] OIV97469.1 hypothetical protein
            TanjilG_10993 [Lupinus angustifolius]
          Length = 463

 Score =  586 bits (1510), Expect = 0.0
 Identities = 307/473 (64%), Positives = 347/473 (73%), Gaps = 3/473 (0%)
 Frame = +2

Query: 146  MVQHRFLLITYPIQGHINPALQFAKRLATMGVPVTFATTVHLHRRL-VNKPTTKGLSFAV 322
            M   RFLLITYP QGHINP+LQFAKRL T+GV VTFATT+H+ R +  NK    GLS   
Sbjct: 1    MAHQRFLLITYPAQGHINPSLQFAKRLITLGVHVTFATTIHMQRCINKNKTIIPGLSITA 60

Query: 323  FXXXXXXXXXXXXXAEVASYISELKRRGSESLRDTILSAAKEGQPFTYLAYTLLLPWVAT 502
            F              +V SYISELK RGSE L + I  A ++G PFT + YTLLLPWVAT
Sbjct: 61   FSDGYDDGFNSAADVDVLSYISELKHRGSECLTNVIAYAIQQGNPFTCITYTLLLPWVAT 120

Query: 503  VARELHLPSALLWIQAATVFDIYYYYFHEHGDYITQKTEDPTYSIELPGLPFSLTSRDLP 682
            VARE  LPSALLWIQ ATVFD+YY+YFH + +Y+ Q  ++PT S+ELPGLPF    RDLP
Sbjct: 121  VAREFQLPSALLWIQPATVFDMYYFYFHGYEEYMIQNVKEPTCSLELPGLPFIFKPRDLP 180

Query: 683  SFLLASNTYTFALPSFKEQLEILEEETNPIVLVNTVEELEPESLRAVDEYDKKLRMIPIG 862
            SF   SN Y+FALPSFKEQLE+L+ ETNPIVLVNT EELE E+LRA+++    +RMIPIG
Sbjct: 181  SFFWPSNMYSFALPSFKEQLEVLDLETNPIVLVNTFEELEHEALRAIED----IRMIPIG 236

Query: 863  PLIPSAFLDGNDPADTSFGGDTISVSSDYLEWLDSKTESSVVYVSFGSLAVLPKRQMEEI 1042
            PLIPSAFLDG DP DTSFGGD I  ++DY+ WLDSK + SVVYVSFGSLAVLPKRQMEEI
Sbjct: 237  PLIPSAFLDGKDPNDTSFGGDIIHGTNDYVTWLDSKPKLSVVYVSFGSLAVLPKRQMEEI 296

Query: 1043 ARALLDCKIPFLWVIXXXXXXXXXXXXXXXXXXXXXXXXKIGKIVKWCSQLEVLSHRSLG 1222
            A ALLD K PFLWVI                          GKIVKWCSQ+EVLSH SLG
Sbjct: 297  AIALLDSKHPFLWVIRENNAKEVLKYRDELEQG--------GKIVKWCSQVEVLSHHSLG 348

Query: 1223 CFVTHCGWNSTLESLSCGVPMVAFPQWTDQTTNAKLIEDVWKTGVR--XXXXXXXXXXXX 1396
            CFVTHCGWNST+ESL CGVP+VAFPQWTDQTTNAKLIEDVWK+GVR              
Sbjct: 349  CFVTHCGWNSTMESLVCGVPVVAFPQWTDQTTNAKLIEDVWKSGVRVDHELNEDGIVERD 408

Query: 1397 XXRRCLEVVMGSGGEKGEEVRRNARRWKSLAREAVKEGGSSDKNLRAFLDDVG 1555
              R+CLEVVMGS GEKG+E+RRN+ +WK LA+EAVKEGGSSDKNLR FLD VG
Sbjct: 409  EIRKCLEVVMGS-GEKGQELRRNSYKWKDLAKEAVKEGGSSDKNLRTFLDVVG 460


>XP_007163802.1 hypothetical protein PHAVU_001G265400g [Phaseolus vulgaris]
            ESW35796.1 hypothetical protein PHAVU_001G265400g
            [Phaseolus vulgaris]
          Length = 531

 Score =  585 bits (1509), Expect = 0.0
 Identities = 306/497 (61%), Positives = 357/497 (71%), Gaps = 5/497 (1%)
 Frame = +2

Query: 95   SQRPQEHYHAP-SRASAAMVQHRFLLITYPIQGHINPALQFAKRLATMGVPVTFATTVHL 271
            S  P  + H+  SRA+  MV HRFLL+TYP+QGHINPA+QFAKRL  +GV VTFAT+  L
Sbjct: 43   SNNPLHNIHSAFSRATVTMVHHRFLLVTYPVQGHINPAIQFAKRLTAIGVHVTFATSTFL 102

Query: 272  HRRLVNKPTTKGLSFAVFXXXXXXXXXXXXXAEVASYISELKRRGSESLRDTILSAAKEG 451
            HRR++NK T  GL+F  F             + V SY++ELK RGSE  R+ I SA +EG
Sbjct: 103  HRRMINKLTVPGLTFVTFSDGYDDGYEGTNDSNVISYMAELKLRGSEFFRNIITSAKQEG 162

Query: 452  QPFTYLAYTLLLPWVATVARELHLPSALLWIQAATVFDIYYYYFHEHGDYIT--QKTEDP 625
            +PFT +AYTL+LPWVA VARE  +P  LLWIQAATV DIYY+YFHE+ DYI    K EDP
Sbjct: 163  KPFTCVAYTLMLPWVAKVAREFRIPGVLLWIQAATVLDIYYHYFHEYRDYINIIHKKEDP 222

Query: 626  TYSIELPGLPFSLTSRDLPSFLLASNTYTFALPSFKEQLEILEEETNPIVLVNTVEELEP 805
            T  IE PGLPFS  +RD+PSFLL SN Y+FA+P  +EQ ++ +EE+NP V+VNT E+LEP
Sbjct: 223  T--IEFPGLPFSFAARDIPSFLLPSNIYSFAIPPLEEQFQLFDEESNPRVIVNTFEDLEP 280

Query: 806  ESLRAVDEYDKKLRMIPIGPLIPSAFLDGNDPADTSFGGDTISVSSDYLEWLDSKTESSV 985
            ESLRA+D    KL MI IGPLIPSAFLDG DPADTSFGGD    S DY+EWLDSK   SV
Sbjct: 281  ESLRAID----KLTMISIGPLIPSAFLDGKDPADTSFGGDIFHGSKDYVEWLDSKPALSV 336

Query: 986  VYVSFGSLAVLPKRQMEEIARALLDCKIPFLWVIXXXXXXXXXXXXXXXXXXXXXXXXKI 1165
            VYVSFGSLAVL +RQMEEIARALLD   PFLWVI                        + 
Sbjct: 337  VYVSFGSLAVLSQRQMEEIARALLDSGYPFLWVI----RESNGKEGTLQELSCRKELEQR 392

Query: 1166 GKIVKWCSQLEVLSHRSLGCFVTHCGWNSTLESLSCGVPMVAFPQWTDQTTNAKLIEDVW 1345
            GKIVKWC+Q+EVLSH S+GCFVTHCGWNST+ESLS GVPMV FPQWTDQ TNAKL+EDVW
Sbjct: 393  GKIVKWCTQVEVLSHGSVGCFVTHCGWNSTMESLSSGVPMVGFPQWTDQGTNAKLVEDVW 452

Query: 1346 KTGVR--XXXXXXXXXXXXXXRRCLEVVMGSGGEKGEEVRRNARRWKSLAREAVKEGGSS 1519
            KTGVR                R+CLEVVMG+GG K EE+RRNA++WK LAREAVKEGGSS
Sbjct: 453  KTGVRVDDKVNEEGIVEAMEIRKCLEVVMGNGG-KAEELRRNAQKWKCLAREAVKEGGSS 511

Query: 1520 DKNLRAFLDDVGRIVQN 1570
            D+N+  FLD V +  Q+
Sbjct: 512  DRNMNTFLDYVSKFEQD 528


>XP_017405475.1 PREDICTED: crocetin glucosyltransferase, chloroplastic [Vigna
            angularis] KOM25418.1 hypothetical protein
            LR48_Vigan102s007600 [Vigna angularis] BAT86657.1
            hypothetical protein VIGAN_04433100 [Vigna angularis var.
            angularis]
          Length = 472

 Score =  573 bits (1478), Expect = 0.0
 Identities = 295/471 (62%), Positives = 347/471 (73%), Gaps = 2/471 (0%)
 Frame = +2

Query: 146  MVQHRFLLITYPIQGHINPALQFAKRLATMGVPVTFATTVHLHRRLVNKPTTKGLSFAVF 325
            MVQ RFLLITYPIQGH+NP+LQFAKRLA +GV VTFAT+V+LHRR++ KPT  GLSF  F
Sbjct: 1    MVQQRFLLITYPIQGHVNPSLQFAKRLAAIGVHVTFATSVYLHRRMLKKPTVPGLSFVTF 60

Query: 326  XXXXXXXXXXXXXAEVASYISELKRRGSESLRDTILSAAKEGQPFTYLAYTLLLPWVATV 505
                         ++V SY++ELKRRGSE L + I SA +EG+PFT + YTL+LPWVA V
Sbjct: 61   SDGYDDGYKTTDDSDVNSYMAELKRRGSEFLGNIITSAKEEGKPFTCVTYTLMLPWVAKV 120

Query: 506  ARELHLPSALLWIQAATVFDIYYYYFHEHGDYITQKTEDPTYSIELPGLPFSLTSRDLPS 685
            ARE  +P  LLWIQAATV DIYY+YFHE+ DYI    +     IE PGLPFSL +RD+PS
Sbjct: 121  AREFLIPGVLLWIQAATVLDIYYHYFHEYKDYINTIQKSENSIIEFPGLPFSLATRDVPS 180

Query: 686  FLLASNTYTFALPSFKEQLEILEEETNPIVLVNTVEELEPESLRAVDEYDKKLRMIPIGP 865
            FLL SN Y+FALPS +EQ ++ +EETNP VLVNT ++LEPE+L+A+D    KL MI IGP
Sbjct: 181  FLLPSNAYSFALPSLEEQFKLFDEETNPRVLVNTFQDLEPEALKAID----KLTMISIGP 236

Query: 866  LIPSAFLDGNDPADTSFGGDTISVSSDYLEWLDSKTESSVVYVSFGSLAVLPKRQMEEIA 1045
            LIPSAFLDG DP DTSFGGD    S+DY++WLDSK   SVVYVSFG+LAVL +RQMEE+A
Sbjct: 237  LIPSAFLDGKDPTDTSFGGDIYHGSNDYIKWLDSKPALSVVYVSFGTLAVLGQRQMEEVA 296

Query: 1046 RALLDCKIPFLWVIXXXXXXXXXXXXXXXXXXXXXXXXKIGKIVKWCSQLEVLSHRSLGC 1225
            RALLD   PFLWVI                        + GKIV WCSQ+EVLSH SLGC
Sbjct: 297  RALLDSGYPFLWVIREPNGKQEKEEKLQELSCRKELEQR-GKIVNWCSQVEVLSHGSLGC 355

Query: 1226 FVTHCGWNSTLESLSCGVPMVAFPQWTDQTTNAKLIEDVWKTGVR--XXXXXXXXXXXXX 1399
            F+THCGWNST+ESLS G+PMV+FPQW+DQ TNAKL+EDVWKTGVR               
Sbjct: 356  FLTHCGWNSTMESLSSGIPMVSFPQWSDQGTNAKLVEDVWKTGVRVDGKANAEGIVEGEE 415

Query: 1400 XRRCLEVVMGSGGEKGEEVRRNARRWKSLAREAVKEGGSSDKNLRAFLDDV 1552
             R+CLEVVMGS GEK +E+RRNA +WK LAREAVKEGGSSD+N+R FLD V
Sbjct: 416  IRKCLEVVMGS-GEKRDELRRNAEKWKCLAREAVKEGGSSDRNIRTFLDYV 465


>XP_014521681.1 PREDICTED: crocetin glucosyltransferase, chloroplastic-like [Vigna
            radiata var. radiata]
          Length = 469

 Score =  569 bits (1466), Expect = 0.0
 Identities = 293/471 (62%), Positives = 345/471 (73%), Gaps = 2/471 (0%)
 Frame = +2

Query: 146  MVQHRFLLITYPIQGHINPALQFAKRLATMGVPVTFATTVHLHRRLVNKPTTKGLSFAVF 325
            MVQ RFLLITYPIQGH+NP+LQFAKRLA +GV VTFAT+V+LHRR++ KPT  GLSF  F
Sbjct: 1    MVQPRFLLITYPIQGHVNPSLQFAKRLAAIGVHVTFATSVYLHRRMLKKPTVPGLSFVTF 60

Query: 326  XXXXXXXXXXXXXAEVASYISELKRRGSESLRDTILSAAKEGQPFTYLAYTLLLPWVATV 505
                         + V SY++ELKRRGSE L + I SA +EG+PFT L YTL+LPWVA V
Sbjct: 61   SDGYDDGYKTTEDSHVNSYMAELKRRGSEFLGNIITSAKEEGKPFTCLTYTLMLPWVAKV 120

Query: 506  ARELHLPSALLWIQAATVFDIYYYYFHEHGDYITQKTEDPTYSIELPGLPFSLTSRDLPS 685
            ARE  +P  LLWIQ ATV DIYY+YFHE+ DYI    +     I+ PGLPFSL +RD+PS
Sbjct: 121  AREFRIPGVLLWIQPATVLDIYYHYFHEYRDYINTIHKSENSIIDFPGLPFSLATRDVPS 180

Query: 686  FLLASNTYTFALPSFKEQLEILEEETNPIVLVNTVEELEPESLRAVDEYDKKLRMIPIGP 865
            FLL SN Y+FA+PS +EQ ++ +EETNP VLVNT ++LEPE+L+A+D    KL MI IGP
Sbjct: 181  FLLPSNAYSFAIPSLEEQFKLFDEETNPRVLVNTFQDLEPEALKAID----KLTMISIGP 236

Query: 866  LIPSAFLDGNDPADTSFGGDTISVSSDYLEWLDSKTESSVVYVSFGSLAVLPKRQMEEIA 1045
            LIPSAFLDG DP DTSFG D  + S+DY+EWLDSK   SVVYVSFG+LAVL +RQMEE+A
Sbjct: 237  LIPSAFLDGKDPTDTSFGADIYNGSNDYIEWLDSKPALSVVYVSFGTLAVLGQRQMEEVA 296

Query: 1046 RALLDCKIPFLWVIXXXXXXXXXXXXXXXXXXXXXXXXKIGKIVKWCSQLEVLSHRSLGC 1225
            RALLD   PFLWVI                        + GKIV WCSQ+EVLSH SLGC
Sbjct: 297  RALLDSGYPFLWVI----REPNGKQEKLEELSCRKELEERGKIVNWCSQVEVLSHGSLGC 352

Query: 1226 FVTHCGWNSTLESLSCGVPMVAFPQWTDQTTNAKLIEDVWKTGVR--XXXXXXXXXXXXX 1399
            F+THCGWNST+ESLS G+PMV+FPQW+DQ TNAKL+EDVWKTGVR               
Sbjct: 353  FLTHCGWNSTMESLSSGIPMVSFPQWSDQGTNAKLVEDVWKTGVRVDGKANAEGIVDGEE 412

Query: 1400 XRRCLEVVMGSGGEKGEEVRRNARRWKSLAREAVKEGGSSDKNLRAFLDDV 1552
             R+CLEVVMGS GEK +E+RRNA +WK LAREAVKEGGSSD+N+R FLD V
Sbjct: 413  IRKCLEVVMGS-GEKRDELRRNAEKWKCLAREAVKEGGSSDRNIRTFLDYV 462


>XP_002263700.1 PREDICTED: crocetin glucosyltransferase, chloroplastic-like [Vitis
            vinifera]
          Length = 469

 Score =  546 bits (1406), Expect = 0.0
 Identities = 280/465 (60%), Positives = 336/465 (72%)
 Frame = +2

Query: 161  FLLITYPIQGHINPALQFAKRLATMGVPVTFATTVHLHRRLVNKPTTKGLSFAVFXXXXX 340
            FLL+T+P QGHINPALQFAKR+   G  V+FAT+V  HRR+  + T +GL+F  F     
Sbjct: 6    FLLVTFPAQGHINPALQFAKRIIRTGAQVSFATSVSAHRRMAKRSTPEGLNFVPFSDGYD 65

Query: 341  XXXXXXXXAEVASYISELKRRGSESLRDTILSAAKEGQPFTYLAYTLLLPWVATVARELH 520
                     +V  Y+SE+KRRGSE+LR+ ++  A EGQPFT + YTLLLPW A VAR L 
Sbjct: 66   DGFKPTD--DVQHYMSEIKRRGSETLREIVVRNADEGQPFTCIVYTLLLPWAAEVARGLG 123

Query: 521  LPSALLWIQAATVFDIYYYYFHEHGDYITQKTEDPTYSIELPGLPFSLTSRDLPSFLLAS 700
            +PSALLWIQ ATV DIYYYYF+ +GD     + +P+ S+ELPGLP  L+SRDLPSFL+ S
Sbjct: 124  VPSALLWIQPATVLDIYYYYFNGYGDVFRNISNEPSCSVELPGLPL-LSSRDLPSFLVKS 182

Query: 701  NTYTFALPSFKEQLEILEEETNPIVLVNTVEELEPESLRAVDEYDKKLRMIPIGPLIPSA 880
            N YTF LP+F+EQLE L +ET+P VLVNT + LEPE LRAVD    KL +I IGPL+PSA
Sbjct: 183  NAYTFVLPTFQEQLEALSQETSPKVLVNTFDALEPEPLRAVD----KLHLIGIGPLVPSA 238

Query: 881  FLDGNDPADTSFGGDTISVSSDYLEWLDSKTESSVVYVSFGSLAVLPKRQMEEIARALLD 1060
            +LDG DP+DTSFGGD    S DY+EWL+SK +SSVVYVSFGS++VL K Q E+IARALLD
Sbjct: 239  YLDGKDPSDTSFGGDMFQGSDDYMEWLNSKPKSSVVYVSFGSISVLSKTQKEDIARALLD 298

Query: 1061 CKIPFLWVIXXXXXXXXXXXXXXXXXXXXXXXXKIGKIVKWCSQLEVLSHRSLGCFVTHC 1240
            C  PFLWVI                          G IV WCSQ+EVL+H SLGCFV+HC
Sbjct: 299  CGHPFLWVIRAPENGEEVKEQDKLSCREELEQK--GMIVSWCSQIEVLTHPSLGCFVSHC 356

Query: 1241 GWNSTLESLSCGVPMVAFPQWTDQTTNAKLIEDVWKTGVRXXXXXXXXXXXXXXRRCLEV 1420
            GWNSTLESL  GVP+VAFPQWTDQ TNAKLIED+WK G+R              +RCLE+
Sbjct: 357  GWNSTLESLVSGVPVVAFPQWTDQGTNAKLIEDMWKIGIRVTVNEEGIVESDEFKRCLEI 416

Query: 1421 VMGSGGEKGEEVRRNARRWKSLAREAVKEGGSSDKNLRAFLDDVG 1555
            VMG GGEKGEE+RRNA +WK+LAREAVK+GGSSDKNL+ F+D+VG
Sbjct: 417  VMG-GGEKGEEMRRNAEKWKNLAREAVKDGGSSDKNLKGFVDEVG 460


>XP_003531212.1 PREDICTED: crocetin glucosyltransferase, chloroplastic-like [Glycine
            max] KRH42734.1 hypothetical protein GLYMA_08G107500
            [Glycine max]
          Length = 465

 Score =  538 bits (1387), Expect = 0.0
 Identities = 287/473 (60%), Positives = 328/473 (69%), Gaps = 4/473 (0%)
 Frame = +2

Query: 146  MVQHRFLLITYPIQGHINPALQFAKRLATMGVPVTFATTVHLHRRLVNKPTTKGLSFAVF 325
            M +HRFLLI YP QGHI+PA Q AKRL ++G  VT +TTVH+HRR+ NKPT   LSF  F
Sbjct: 1    MFRHRFLLILYPAQGHIHPAFQLAKRLVSLGAHVTVSTTVHMHRRITNKPTLPHLSFLPF 60

Query: 326  XXXXXXXXXXXXXAEVASYISELKRRGSESLRDTILSAAKEGQPFTYLAYTLLLPWVATV 505
                         ++ + + S  KRRGSE + + ILS A+EG PFT L YT LL WVA V
Sbjct: 61   SDGYDDGFTS---SDFSLHASVFKRRGSEFVTNLILSNAQEGHPFTCLVYTTLLSWVAEV 117

Query: 506  ARELHLPSALLWIQAATVFDIYYYYFHEHGDYITQKTEDPTYSIELPGLPFSLTSRDLPS 685
            ARE HLP+A+LW Q AT+ DI+YYYFHEHG+YI  K +DP+  IELPGLP  L  RDLPS
Sbjct: 118  AREFHLPTAMLWTQPATILDIFYYYFHEHGEYIKDKIKDPSCFIELPGLPLLLAPRDLPS 177

Query: 686  FLLASNTY--TFALPSFKEQLEILEEETNPIVLVNTVEELEPESLRAVDEYDKKLRMIPI 859
            FLL SN    +F +P F++    L+ ET P +LVNT E LE E+LRAVD    K  MIPI
Sbjct: 178  FLLGSNPTIDSFIVPMFEKMFYDLDVETKPRILVNTFEALEAEALRAVD----KFNMIPI 233

Query: 860  GPLIPSAFLDGNDPADTSFGGDTISVSSDYLEWLDSKTESSVVYVSFGSLAVLPKRQMEE 1039
            GPLIPSAFLDG D  DTSFGGD   +S+   EWLDSK E SVVYVSFGSL VLPK QMEE
Sbjct: 234  GPLIPSAFLDGKDTNDTSFGGDIFRLSNGCSEWLDSKPEMSVVYVSFGSLCVLPKTQMEE 293

Query: 1040 IARALLDCKIPFLWVIXXXXXXXXXXXXXXXXXXXXXXXXKIGKIVKWCSQLEVLSHRSL 1219
            +ARALLDC  PFLWVI                        + GKIV WCSQ+EVLSH S+
Sbjct: 294  LARALLDCGSPFLWVI--KEKENKSQVEGKEELSCIEELEQKGKIVNWCSQVEVLSHGSV 351

Query: 1220 GCFVTHCGWNSTLESLSCGVPMVAFPQWTDQTTNAKLIEDVWKTGVR--XXXXXXXXXXX 1393
            GCFVTHCGWNST+ESL+ GVPMVAFPQW +Q TNAKLIEDVWKTGVR             
Sbjct: 352  GCFVTHCGWNSTMESLASGVPMVAFPQWVEQKTNAKLIEDVWKTGVRVDKQVNEDGIVEN 411

Query: 1394 XXXRRCLEVVMGSGGEKGEEVRRNARRWKSLAREAVKEGGSSDKNLRAFLDDV 1552
               RRCLE VMGS GEKG+E+R NA +W+ LAREAVKEGGSSDKNLRAFLDDV
Sbjct: 412  EEIRRCLEEVMGS-GEKGQELRNNAEKWRGLAREAVKEGGSSDKNLRAFLDDV 463


>AAY27090.1 UDP-glucose:flavonoid 7-O-glucosyltransferase [Pyrus communis]
          Length = 481

 Score =  533 bits (1374), Expect = 0.0
 Identities = 276/480 (57%), Positives = 338/480 (70%), Gaps = 11/480 (2%)
 Frame = +2

Query: 146  MVQHRFLLITYPIQGHINPALQFAKRLA-TMGVPVTFATTVHLHRRLVNKPTTKGLSFAV 322
            MVQHRFLL+TYP QGHINP+LQFAKRL  T G  VT+ T++  HRR+ N     GL++A 
Sbjct: 1    MVQHRFLLVTYPAQGHINPSLQFAKRLTNTTGAHVTYVTSLSAHRRIGNGSIPDGLTYAP 60

Query: 323  FXXXXXXXXXXXXXAEVASYISELKRRGSESLRDTILSAAKEGQPFTYLAYTLLLPWVAT 502
            F               +  Y+SEL+ RG++++ D ++++A EG P+T L Y+L++PW A 
Sbjct: 61   FSDGYDDGFKPGD--NIDDYMSELRHRGAQAITDLVVASANEGHPYTCLVYSLIVPWSAG 118

Query: 503  VARELHLPSALLWIQAATVFDIYYYYFHEHGDYITQKTEDPTY-----SIELPGLPFSLT 667
            VA ELHLPS LLWIQ ATVFDIYYYYF+ + D I   T   T      SIELPGLP S T
Sbjct: 119  VAHELHLPSVLLWIQPATVFDIYYYYFNGYKDLIRDNTSSGTNNVLPCSIELPGLPLSFT 178

Query: 668  SRDLPSFLLASNTYTFALPSFKEQLEILEEETNPIVLVNTVEELEPESLRAVDEYDKKLR 847
            SRDLPSF++ +N Y FALP F+EQ+E+LE ETNP +LVNT + LEPE+L+A+D+Y+    
Sbjct: 179  SRDLPSFMVDTNPYNFALPLFQEQMELLERETNPTILVNTFDALEPEALKAIDKYN---- 234

Query: 848  MIPIGPLIPSAFLDGNDPADTSFGGDTISVSSD--YLEWLDSKTESSVVYVSFGSLAVLP 1021
            +I +GPLIPSAFLDG DP+D SFGGD +  S D  YLEWL+SK E SV+YVSFGS++VL 
Sbjct: 235  LIGVGPLIPSAFLDGKDPSDKSFGGDLVQKSRDSSYLEWLNSKPEGSVIYVSFGSISVLG 294

Query: 1022 KRQMEEIARALLDCKIPFLWVIXXXXXXXXXXXXXXXXXXXXXXXXKI---GKIVKWCSQ 1192
            K QMEEIA+ LLDC +PFLWVI                        ++   G+IV WCSQ
Sbjct: 295  KAQMEEIAKGLLDCGLPFLWVIRDKVDKKGDDNEAKQEEAMLSCRVELEELGRIVPWCSQ 354

Query: 1193 LEVLSHRSLGCFVTHCGWNSTLESLSCGVPMVAFPQWTDQTTNAKLIEDVWKTGVRXXXX 1372
            +EVLS  SLGCFVTHCGWNS+LESL  GVP+VAFPQWTDQ TNAKLIED WKTGVR    
Sbjct: 355  VEVLSSPSLGCFVTHCGWNSSLESLVSGVPVVAFPQWTDQGTNAKLIEDFWKTGVRVTPN 414

Query: 1373 XXXXXXXXXXRRCLEVVMGSGGEKGEEVRRNARRWKSLAREAVKEGGSSDKNLRAFLDDV 1552
                      +RCL++V+GS GE GEEVRRNA++WK LAREAV EGGSSDKNL+AFLD +
Sbjct: 415  VEGIVTGEELKRCLDLVLGS-GEIGEEVRRNAKKWKDLAREAVNEGGSSDKNLKAFLDQI 473


>XP_018844047.1 PREDICTED: crocetin glucosyltransferase, chloroplastic-like [Juglans
            regia]
          Length = 463

 Score =  531 bits (1369), Expect = 0.0
 Identities = 273/465 (58%), Positives = 330/465 (70%), Gaps = 1/465 (0%)
 Frame = +2

Query: 164  LLITYPIQGHINPALQFAKRLATMGVPVTFATTVHLHRRLVNKPTTKGLSFAVFXXXXXX 343
            LL+T+P QGHINP LQFAKRL  +G  VT AT+V  +RR+   P  +GLSFA F      
Sbjct: 8    LLVTFPAQGHINPGLQFAKRLIRLGAHVTLATSVSAYRRMTKTPIPQGLSFATFSDGYDD 67

Query: 344  XXXXXXXAEVASYISELKRRGSESLRDTILSAAKEGQPFTYLAYTLLLPWVATVARELHL 523
                    +   Y+S +KR GS++L D I+S+  EG+PF YL YTLLLPW   VA ELHL
Sbjct: 68   GFKPGTD-DAEHYMSAIKRSGSKTLTDLIVSSTNEGRPFQYLVYTLLLPWAGNVAHELHL 126

Query: 524  PSALLWIQAATVFDIYYYYFHEHGDYITQKTEDPTYSIELPGLPFSLTSRDLPSFLLASN 703
            PSALLWIQ ATV DIYYYYF+ +GD I +K  DP+YS++LPGLP  L  RDLPSFLL SN
Sbjct: 127  PSALLWIQPATVLDIYYYYFNGYGDDIRKKGTDPSYSLQLPGLPL-LYGRDLPSFLLDSN 185

Query: 704  TYTFALPSFKEQLEILEEETNPIVLVNTVEELEPESLRAVDEYDKKLRMIPIGPLIPSAF 883
            TYTFALPSF+EQ+E LE+E+NP VLVNT + LEPE+LR ++    K  +  +GPLIPSAF
Sbjct: 186  TYTFALPSFQEQIEALEKESNPTVLVNTFDALEPEALRVIE----KFNLTAVGPLIPSAF 241

Query: 884  LDGNDPADTSFGGDTISVSSD-YLEWLDSKTESSVVYVSFGSLAVLPKRQMEEIARALLD 1060
            LDG DP+D +FGGD    S + Y+EWL+SK  SSV+YVSFGS++ L K+QMEE+AR LLD
Sbjct: 242  LDGKDPSDKAFGGDLFQGSKEYYIEWLNSKPNSSVIYVSFGSISTLAKQQMEEMARGLLD 301

Query: 1061 CKIPFLWVIXXXXXXXXXXXXXXXXXXXXXXXXKIGKIVKWCSQLEVLSHRSLGCFVTHC 1240
            C  PFLWVI                        + G IV WCSQ+EVLSH SL CFVTHC
Sbjct: 302  CGRPFLWVI------RAKENGEEERLSCREELEQKGMIVPWCSQVEVLSHPSLACFVTHC 355

Query: 1241 GWNSTLESLSCGVPMVAFPQWTDQTTNAKLIEDVWKTGVRXXXXXXXXXXXXXXRRCLEV 1420
            GWNS+LESL  GVP+VAFPQWTDQ TNAKLIEDVWKTG+R              +RCLE+
Sbjct: 356  GWNSSLESLVSGVPVVAFPQWTDQGTNAKLIEDVWKTGLRVTANKDGIVESDEIKRCLEL 415

Query: 1421 VMGSGGEKGEEVRRNARRWKSLAREAVKEGGSSDKNLRAFLDDVG 1555
            V G GGE+GEE+RRNA++WK LAREA KEGGSS KNL+AF++++G
Sbjct: 416  VAG-GGERGEEMRRNAKKWKELAREAAKEGGSSHKNLKAFVEEIG 459


>KYP56999.1 Anthocyanin 5-O-glucosyltransferase [Cajanus cajan]
          Length = 420

 Score =  529 bits (1362), Expect = 0.0
 Identities = 276/428 (64%), Positives = 315/428 (73%), Gaps = 2/428 (0%)
 Frame = +2

Query: 281  LVNKPTTKGLSFAVFXXXXXXXXXXXXXAEVASYISELKRRGSESLRDTILSAAKEGQPF 460
            ++ KPT  GLSF+ F             ++V SYISELKRRG+E LR+ I +A  +GQPF
Sbjct: 1    MLKKPTVPGLSFSTFSDGYDDGYRTTDDSQVVSYISELKRRGAECLRNIITAAKNDGQPF 60

Query: 461  TYLAYTLLLPWVATVARELHLPSALLWIQAATVFDIYYYYFHEHGDYITQKTEDPTYSIE 640
            T LAYT+LLPW   VARE HLP ALLWIQ ATVFDIYY+YFH++ DYI+ ++      IE
Sbjct: 61   TCLAYTILLPWAGRVAREFHLPGALLWIQPATVFDIYYHYFHQYKDYISHQSH-----IE 115

Query: 641  LPGLPFSLTSRDLPSFLLASNTYTFALPSFKEQLEILEEETNPIVLVNTVEELEPESLRA 820
            LPGLPFSLTSRD+PSFLL+SN YTFALP+F+EQ E L+ ETNPIVLVNT EELE ESLRA
Sbjct: 116  LPGLPFSLTSRDIPSFLLSSNIYTFALPTFQEQFEDLDGETNPIVLVNTFEELESESLRA 175

Query: 821  VDEYDKKLRMIPIGPLIPSAFLDGNDPADTSFGGDTISVSSDYLEWLDSKTESSVVYVSF 1000
            VD    K  MIPIGPLIPSAFLDG DP+DTSFGGD    S+DY+EWLDSK E SVVYVSF
Sbjct: 176  VD----KFTMIPIGPLIPSAFLDGKDPSDTSFGGDIFDGSNDYVEWLDSKPELSVVYVSF 231

Query: 1001 GSLAVLPKRQMEEIARALLDCKIPFLWVIXXXXXXXXXXXXXXXXXXXXXXXXKIGKIVK 1180
            G+LAVL KRQ EE+ARALLD   PFLWVI                          GKIVK
Sbjct: 232  GTLAVLNKRQTEELARALLDSGYPFLWVIRENNGKEEKEKLQEVSCREELEVK--GKIVK 289

Query: 1181 WCSQLEVLSHRSLGCFVTHCGWNSTLESLSCGVPMVAFPQWTDQTTNAKLIEDVWKTGVR 1360
            WCSQ+EVLSH SLGCFV+HCGWNST ESL+ GVPMVAFPQW+DQ TNAKL++DVWKTGVR
Sbjct: 290  WCSQVEVLSHGSLGCFVSHCGWNSTTESLASGVPMVAFPQWSDQMTNAKLVQDVWKTGVR 349

Query: 1361 --XXXXXXXXXXXXXXRRCLEVVMGSGGEKGEEVRRNARRWKSLAREAVKEGGSSDKNLR 1534
                            R+CLEVVMGS GEKG+E+RRNA +WK LAREAVKEGGSSD+N+R
Sbjct: 350  VDDKVNEEGVVEAEEIRKCLEVVMGS-GEKGQEMRRNAEKWKCLAREAVKEGGSSDRNMR 408

Query: 1535 AFLDDVGR 1558
             FLDDV +
Sbjct: 409  TFLDDVAK 416


>XP_007221288.1 hypothetical protein PRUPE_ppa016890mg [Prunus persica] ONI28923.1
            hypothetical protein PRUPE_1G169300 [Prunus persica]
          Length = 474

 Score =  529 bits (1362), Expect = 0.0
 Identities = 276/480 (57%), Positives = 336/480 (70%), Gaps = 10/480 (2%)
 Frame = +2

Query: 146  MVQHRFLLITYPIQGHINPALQFAKRLA-TMGVPVTFATTVHLHRRLVNKPTTKGLSFAV 322
            MVQHRFL +TYP QGHINPALQ AKRL    G  VT+ T+++ +RR+VN  T  GL++A 
Sbjct: 1    MVQHRFLFLTYPAQGHINPALQLAKRLIRNTGAQVTYVTSLYAYRRIVNGSTPNGLTYAP 60

Query: 323  FXXXXXXXXXXXXXAEVASYISELKRRGSESLRDTILSAAKEGQPFTYLAYTLLLPWVAT 502
            +              +V  Y+SEL+R GS+ + D + S+AKEG P+T L YT+LLPW A 
Sbjct: 61   YSDGYDDGFKFSD--DVDHYMSELRRAGSQVITDLVASSAKEGHPYTCLVYTILLPWAAD 118

Query: 503  VARELHLPSALLWIQAATVFDIYYYYFHEHGDYITQK----TEDPTYSIELPGLPFSLTS 670
            +ARELHLPS L WIQAAT+FD+YYYY   + D I +     T DP+ SI+LPGLP  L S
Sbjct: 119  LARELHLPSVLAWIQAATLFDVYYYYLSGYKDLIRESFGTDTNDPSCSIQLPGLPLDLAS 178

Query: 671  RDLPSFLLASNTYTFALPSFKEQLEILEEETNPIVLVNTVEELEPESLRAVDEYDKKLRM 850
            RDLPSF++A N+Y FALP F++Q E+LE ET PI+LVNT + LEPE+L+A+D+Y+    +
Sbjct: 179  RDLPSFMVAENSYNFALPLFEKQFELLERETKPIILVNTFDALEPEALKAIDKYN----L 234

Query: 851  IPIGPLIPSAFLDGNDPADTSFGGDTI--SVSSDYLEWLDSKTESSVVYVSFGSLAVLPK 1024
            I IGPLIPSAFLDG DP+DTSFGGD    S+ S  +EWL+SK E SV+YVSFGS++ L K
Sbjct: 235  IGIGPLIPSAFLDGKDPSDTSFGGDLFQKSMDSSCIEWLNSKPEGSVIYVSFGSVSALSK 294

Query: 1025 RQMEEIARALLDCKIPFLWVIXXXXXXXXXXXXXXXXXXXXXXXXKI---GKIVKWCSQL 1195
             QMEEIA+ LLD   PFLWVI                        ++   GKIV WCSQL
Sbjct: 295  DQMEEIAKGLLDYGRPFLWVIREKEERNGQDNETEKEEEKFSCREELKELGKIVLWCSQL 354

Query: 1196 EVLSHRSLGCFVTHCGWNSTLESLSCGVPMVAFPQWTDQTTNAKLIEDVWKTGVRXXXXX 1375
            EVLS+ SLGCFVTHCGWNS++ESL  GVP+VAFP WTDQ TNAKLIED WKTGVR     
Sbjct: 355  EVLSNPSLGCFVTHCGWNSSMESLVSGVPVVAFPLWTDQRTNAKLIEDTWKTGVRVAPNE 414

Query: 1376 XXXXXXXXXRRCLEVVMGSGGEKGEEVRRNARRWKSLAREAVKEGGSSDKNLRAFLDDVG 1555
                     +RCLE+VMGS GE GEE+RRNA++WK LAREAV EGGSSDKNL AFLD +G
Sbjct: 415  EGIVVGEELKRCLELVMGS-GEIGEELRRNAKKWKGLAREAVSEGGSSDKNLMAFLDQIG 473


>NP_001315912.1 crocetin glucosyltransferase, chloroplastic-like [Malus domestica]
            AAX16493.1 UDP-glucose:flavonoid 7-O-glucosyltransferase
            [Malus domestica]
          Length = 481

 Score =  529 bits (1362), Expect = 0.0
 Identities = 277/480 (57%), Positives = 335/480 (69%), Gaps = 11/480 (2%)
 Frame = +2

Query: 146  MVQHRFLLITYPIQGHINPALQFAKRLA-TMGVPVTFATTVHLHRRLVNKPTTKGLSFAV 322
            MVQHRFLL+T+P QGHINP+LQFAKRL  T G  VT+ T++  HRR+ N     GL++A 
Sbjct: 1    MVQHRFLLVTFPAQGHINPSLQFAKRLINTTGAHVTYVTSLSAHRRIGNGSIPDGLTYAP 60

Query: 323  FXXXXXXXXXXXXXAEVASYISELKRRGSESLRDTILSAAKEGQPFTYLAYTLLLPWVAT 502
            F               V  Y+SEL+RRG +++ D ++++A EG P+T L Y+LLLPW A 
Sbjct: 61   FSDGYDDGFKPGD--NVDDYMSELRRRGVQAITDLVVASANEGHPYTCLVYSLLLPWSAG 118

Query: 503  VARELHLPSALLWIQAATVFDIYYYYFHEHGDYITQKTEDPTY-----SIELPGLPFSLT 667
            +A ELHLPS LLWIQ ATVFDIYYYYF+ + D I   T   T      SIELPGLP S T
Sbjct: 119  MAHELHLPSVLLWIQPATVFDIYYYYFNGYKDLIRDNTSSGTNNVLPCSIELPGLPLSFT 178

Query: 668  SRDLPSFLLASNTYTFALPSFKEQLEILEEETNPIVLVNTVEELEPESLRAVDEYDKKLR 847
            SRDLPSF++ +N Y FALP F+EQ+E+LE ETNP +LVNT + LEPE+L+A+D+Y+    
Sbjct: 179  SRDLPSFMVDTNPYNFALPLFQEQMELLERETNPTILVNTFDALEPEALKAIDKYN---- 234

Query: 848  MIPIGPLIPSAFLDGNDPADTSFGGDTISVSSD--YLEWLDSKTESSVVYVSFGSLAVLP 1021
            +I +GPLIPSAFLDG DP+D SFGGD    S D  YLEWL+SK E SV+YVSFGS++VL 
Sbjct: 235  LIGVGPLIPSAFLDGKDPSDKSFGGDLFQKSKDSSYLEWLNSKPEGSVIYVSFGSISVLG 294

Query: 1022 KRQMEEIARALLDCKIPFLWVIXXXXXXXXXXXXXXXXXXXXXXXXKI---GKIVKWCSQ 1192
            K QMEEIA+ LLDC +PFLWVI                        ++   G IV WCSQ
Sbjct: 295  KAQMEEIAKGLLDCGLPFLWVIRDKVGKKGDDNEAKKEEEMLRCREELEELGMIVPWCSQ 354

Query: 1193 LEVLSHRSLGCFVTHCGWNSTLESLSCGVPMVAFPQWTDQTTNAKLIEDVWKTGVRXXXX 1372
            +EVLS  SLGCFVTHCGWNS+LESL  GVP+VAFPQWTDQ TNAKLIED WKTGVR    
Sbjct: 355  VEVLSSPSLGCFVTHCGWNSSLESLVSGVPVVAFPQWTDQGTNAKLIEDYWKTGVRVTPN 414

Query: 1373 XXXXXXXXXXRRCLEVVMGSGGEKGEEVRRNARRWKSLAREAVKEGGSSDKNLRAFLDDV 1552
                      +RCL++V+GS GE GE+VRRNA++WK LAREAV EG SSDKNLRAFLD +
Sbjct: 415  EEGIVTGEELKRCLDLVLGS-GEIGEDVRRNAKKWKDLAREAVSEGDSSDKNLRAFLDQI 473


>XP_018848545.1 PREDICTED: crocetin glucosyltransferase, chloroplastic-like [Juglans
            regia]
          Length = 466

 Score =  526 bits (1356), Expect = 0.0
 Identities = 271/470 (57%), Positives = 331/470 (70%)
 Frame = +2

Query: 146  MVQHRFLLITYPIQGHINPALQFAKRLATMGVPVTFATTVHLHRRLVNKPTTKGLSFAVF 325
            MV H FLL+ +P QGHINPALQFAKRL  +G  VTFATTV  HRR+   PT  GLSFA F
Sbjct: 1    MVNHHFLLVIFPAQGHINPALQFAKRLIRLGAHVTFATTVAAHRRMNKSPTPDGLSFATF 60

Query: 326  XXXXXXXXXXXXXAEVASYISELKRRGSESLRDTILSAAKEGQPFTYLAYTLLLPWVATV 505
                          +   Y+SELKRRGS++L D ++S+A +G+ FT L Y++LLPW   V
Sbjct: 61   SDGYDDGGFKHGDHDFVDYMSELKRRGSQTLTDLVVSSANKGRTFTCLVYSILLPWACDV 120

Query: 506  ARELHLPSALLWIQAATVFDIYYYYFHEHGDYITQKTEDPTYSIELPGLPFSLTSRDLPS 685
            ARELHL SALLWIQ ATV DIYYYYF+ +GD I +   DP++SIELPGLP SLTSRDLPS
Sbjct: 121  ARELHLLSALLWIQPATVLDIYYYYFNGYGDVI-RNIPDPSFSIELPGLP-SLTSRDLPS 178

Query: 686  FLLASNTYTFALPSFKEQLEILEEETNPIVLVNTVEELEPESLRAVDEYDKKLRMIPIGP 865
            F+   NT+TFALP F+E  E L +E+NP VLVNT + LEPE+LRA++ +        IGP
Sbjct: 179  FMADLNTHTFALPLFQEHFEELGKESNPRVLVNTFDALEPEALRAIERFS----FTGIGP 234

Query: 866  LIPSAFLDGNDPADTSFGGDTISVSSDYLEWLDSKTESSVVYVSFGSLAVLPKRQMEEIA 1045
            LIPSAFLDG DP+DT+FGGD    ++DY+EWL+SK  SSV+YVSFGSL++L K QMEEIA
Sbjct: 235  LIPSAFLDGKDPSDTAFGGDLFQGATDYIEWLNSKPSSSVIYVSFGSLSLLAKNQMEEIA 294

Query: 1046 RALLDCKIPFLWVIXXXXXXXXXXXXXXXXXXXXXXXXKIGKIVKWCSQLEVLSHRSLGC 1225
            R LLD   PFLWV                         + GK V+WCSQ+EVLSH S+ C
Sbjct: 295  RGLLDYGCPFLWV---KRANENGEEKEEDRLSCREELEQKGKFVQWCSQVEVLSHPSVAC 351

Query: 1226 FVTHCGWNSTLESLSCGVPMVAFPQWTDQTTNAKLIEDVWKTGVRXXXXXXXXXXXXXXR 1405
            FVTHCGWNSTLESL  GVP+VAFPQWTDQ TNAKL++DVWK G+R              +
Sbjct: 352  FVTHCGWNSTLESLVSGVPLVAFPQWTDQGTNAKLVQDVWKIGLRVTTNKDGIVEGDEIK 411

Query: 1406 RCLEVVMGSGGEKGEEVRRNARRWKSLAREAVKEGGSSDKNLRAFLDDVG 1555
            RCLE+V+G+ GE+GEE+R+NA++WK LAREA  EGG+S  NL+AF+D+ G
Sbjct: 412  RCLELVLGN-GERGEEMRKNAKKWKDLAREAAMEGGTSYNNLKAFVDEFG 460


>XP_003524180.1 PREDICTED: crocetin glucosyltransferase, chloroplastic-like [Glycine
            max] KRH58827.1 hypothetical protein GLYMA_05G150600
            [Glycine max]
          Length = 460

 Score =  526 bits (1354), Expect = 0.0
 Identities = 281/474 (59%), Positives = 324/474 (68%), Gaps = 5/474 (1%)
 Frame = +2

Query: 146  MVQHRFLLITYPIQGHINPALQFAKRLATMGVPVTFATTVHLHRRLVNKPTTKGLSFAVF 325
            M +HRFL++ YP QGHINPA QFAKRL ++G  VT +TTVH+HRR+ NKPT   LSF  F
Sbjct: 1    MFRHRFLIVMYPAQGHINPAFQFAKRLVSLGAHVTVSTTVHMHRRITNKPTLPHLSFLPF 60

Query: 326  XXXXXXXXXXXXXAEVASYISELKRRGSESLRDTILSAAKEGQPFTYLAYTLLLPWVATV 505
                          + A   SE KRRGSE + + I S A+EG PFT L +T+LLPW A  
Sbjct: 61   SDGYDDGYTS---TDYALQASEFKRRGSEFVTNLIASKAQEGHPFTCLVHTVLLPWAARA 117

Query: 506  ARELHLPSALLWIQAATVFDIYYYYFHEHGDYITQKTEDPTYSIELPGLPFSLTSRDLPS 685
            AR  HLP+ALLW Q AT+ DI+Y YFHEHGDYI  K +DP+ SIELPGLP  L  RDLPS
Sbjct: 118  ARGFHLPTALLWTQPATILDIFYCYFHEHGDYIKGKIKDPSSSIELPGLPLLLAPRDLPS 177

Query: 686  FLLASNTY--TFALPSFKEQLEILEEETNPIVLVNTVEELEPESLRAVDEYDKKLRMIPI 859
            FLL SN    + A+  F+EQL  L+ +  P +LVNT E LE E+LRAVD ++    MIPI
Sbjct: 178  FLLGSNPTIDSLAVSMFEEQLHDLDMQAKPRILVNTFEALEHEALRAVDNFN----MIPI 233

Query: 860  GPLIPSAFLDGNDPADTSFGGDTISVSSDYLEWLDSKTESSVVYVSFGSLAVLPKRQMEE 1039
            GPLIPSAFLDG DP DTSFGGD    S+D  EWLDSK E SVVYVSFGS  VL K+QMEE
Sbjct: 234  GPLIPSAFLDGKDPTDTSFGGDIFRPSNDCGEWLDSKPEMSVVYVSFGSFCVLSKKQMEE 293

Query: 1040 IARALLDCKIPFLWVIXXXXXXXXXXXXXXXXXXXXXXXXKIGKIVKWCSQLEVLSHRSL 1219
            +A ALLDC  PFLWV                         + GKIV WCSQ+EVLSHRS+
Sbjct: 294  LALALLDCGSPFLWV---------SREKEEEELSCREELEQKGKIVNWCSQVEVLSHRSV 344

Query: 1220 GCFVTHCGWNSTLESLSCGVPMVAFPQWTDQTTNAKLIEDVWKTGVRXXXXXXXXXXXXX 1399
            GCFVTHCGWNST+ESL+ GVPM AFPQW +Q TNAKLIEDVWKTGVR             
Sbjct: 345  GCFVTHCGWNSTMESLASGVPMFAFPQWIEQKTNAKLIEDVWKTGVRVDKQVNEEGIVEK 404

Query: 1400 XR--RCLEVVMGSGGEKGEEVRRNARRWKSLAREAVKEG-GSSDKNLRAFLDDV 1552
                +CLEV MGS G+KG+E+R NA+ WK LAREAVKEG GSSDKNLRAFLDD+
Sbjct: 405  EEIIKCLEVAMGS-GKKGQELRNNAKNWKGLAREAVKEGSGSSDKNLRAFLDDL 457


>XP_018809219.1 PREDICTED: crocetin glucosyltransferase, chloroplastic-like [Juglans
            regia]
          Length = 502

 Score =  523 bits (1348), Expect = e-179
 Identities = 274/490 (55%), Positives = 337/490 (68%), Gaps = 3/490 (0%)
 Frame = +2

Query: 95   SQRPQEHYHAPSRASAAMVQH--RFLLITYPIQGHINPALQFAKRLATMGVPVTFATTVH 268
            SQ    H  AP   +   ++     LL+T+P QGHINPALQFAK L  +G  VT AT+V 
Sbjct: 18   SQTLTPHPQAPPCTTTTTMEKYPHVLLVTFPAQGHINPALQFAKGLVRLGALVTLATSVS 77

Query: 269  LHRRLVNKPTTKGLSFAVFXXXXXXXXXXXXXAEVASYISELKRRGSESLRDTILSAAKE 448
             +RR+   P  +GLSFA +              ++  YI  +K  GS++L D I+S+A E
Sbjct: 78   SYRRMTKTPAPQGLSFATYSDGYDDGFKPTTD-DLEHYIFAIKHSGSKTLTDLIVSSANE 136

Query: 449  GQPFTYLAYTLLLPWVATVARELHLPSALLWIQAATVFDIYYYYFHEHGDYITQKTEDPT 628
            G+PF YL Y +LLPW   VARELHLPSA+LWIQ ATV DIYYYYF+ +GD I +   DP+
Sbjct: 137  GRPFQYLVYNMLLPWAGNVARELHLPSAVLWIQPATVLDIYYYYFNGYGDDIRKNGTDPS 196

Query: 629  YSIELPGLPFSLTSRDLPSFLLASNTYTFALPSFKEQLEILEEETNPIVLVNTVEELEPE 808
            YS++LPGLP  L  RDLPSFLL SNTYTFALPSF+EQ E LE+E+NP VLVNT + LEPE
Sbjct: 197  YSLQLPGLPL-LYGRDLPSFLLGSNTYTFALPSFQEQFEALEKESNPRVLVNTFDGLEPE 255

Query: 809  SLRAVDEYDKKLRMIPIGPLIPSAFLDGNDPADTSFGGDTISVSSDY-LEWLDSKTESSV 985
            +LR ++    KL +  +GPLIPSAFLDG DP+D +FGGD    S +Y +EWL+SK  SSV
Sbjct: 256  ALRVIE----KLNLSAVGPLIPSAFLDGKDPSDKAFGGDLFQGSKEYYIEWLNSKPNSSV 311

Query: 986  VYVSFGSLAVLPKRQMEEIARALLDCKIPFLWVIXXXXXXXXXXXXXXXXXXXXXXXXKI 1165
            +YVSFGS+AVL K+QMEE+AR LLDC  PFLWVI                          
Sbjct: 312  IYVSFGSMAVLAKQQMEEMARGLLDCGRPFLWVIRAKEKGEEETEEERLSCRKELEQK-- 369

Query: 1166 GKIVKWCSQLEVLSHRSLGCFVTHCGWNSTLESLSCGVPMVAFPQWTDQTTNAKLIEDVW 1345
            G IV WCSQ+EVLSH SL CFVTHCGWNS+LESL  GVP+VAFPQW+DQ TNAKLIEDVW
Sbjct: 370  GMIVPWCSQVEVLSHPSLACFVTHCGWNSSLESLVTGVPVVAFPQWSDQGTNAKLIEDVW 429

Query: 1346 KTGVRXXXXXXXXXXXXXXRRCLEVVMGSGGEKGEEVRRNARRWKSLAREAVKEGGSSDK 1525
            KTG+R              +RCLE+V G GG++GEE+RRNA++WK LAREA +EGGSS K
Sbjct: 430  KTGLRVTANKDGIVEGDEIKRCLELVAG-GGDRGEELRRNAKKWKELAREAAREGGSSYK 488

Query: 1526 NLRAFLDDVG 1555
            NL+AF++++G
Sbjct: 489  NLKAFVEEIG 498


>XP_008348418.1 PREDICTED: crocetin glucosyltransferase, chloroplastic-like [Malus
            domestica]
          Length = 474

 Score =  522 bits (1344), Expect = e-179
 Identities = 271/480 (56%), Positives = 331/480 (68%), Gaps = 11/480 (2%)
 Frame = +2

Query: 146  MVQHRFLLITYPIQGHINPALQFAKRLA-TMGVPVTFATTVHLHRRLVNKPTTKGLSFAV 322
            MVQHRFLL+TYP QGHINP+LQFAKRL  T G  VTF T++  H R+ N     GL++A 
Sbjct: 1    MVQHRFLLVTYPAQGHINPSLQFAKRLINTTGAHVTFITSLSAHHRIGNGSIPDGLTYAP 60

Query: 323  FXXXXXXXXXXXXXAEVASYISELKRRGSESLRDTILSAAKEGQPFTYLAYTLLLPWVAT 502
            F               +  Y+SEL+  G++++ D ++S+  EG P+T + YT+LLPW A 
Sbjct: 61   FSDGYDDGFKPGD--NIDHYLSELRHHGAQAITDLVVSSENEGHPYTCMVYTILLPWAAD 118

Query: 503  VARELHLPSALLWIQAATVFDIYYYYFHEHGDYITQKTEDPTY-----SIELPGLPFSLT 667
            VA ELHLP+ LLWIQ ATVFDIYYYYF+   D I   T   T      SIELPGLP SLT
Sbjct: 119  VAHELHLPNVLLWIQPATVFDIYYYYFNGFKDLIRDNTSSGTNDALPCSIELPGLPLSLT 178

Query: 668  SRDLPSFLLASNTYTFALPSFKEQLEILEEETNPIVLVNTVEELEPESLRAVDEYDKKLR 847
            SRDLPSF++ +N Y FALP F+EQ+++LE ETNPI+LVNT + LEPE+L+A ++Y+    
Sbjct: 179  SRDLPSFMVDTNPYNFALPLFQEQMDLLERETNPIILVNTFDALEPEALKATEKYN---- 234

Query: 848  MIPIGPLIPSAFLDGNDPADTSFGGDTISVSSD--YLEWLDSKTESSVVYVSFGSLAVLP 1021
            +I +GPLIP+ FLDG DP+D SFGGD +  S D  YLEWL+ K E SV+YVSFGS+ VL 
Sbjct: 235  LIGVGPLIPTTFLDGKDPSDKSFGGDLLKKSKDSPYLEWLNLKPEGSVIYVSFGSICVLE 294

Query: 1022 KRQMEEIARALLDCKIPFLWVIXXXXXXXXXXXXXXXXXXXXXXXXKI---GKIVKWCSQ 1192
            K QMEEIA+ LLDC  PFLWVI                        ++   G+IV WCSQ
Sbjct: 295  KAQMEEIAKGLLDCGRPFLWVIRDKVNKKGEDNEAKEEEEMLSCREELEELGRIVPWCSQ 354

Query: 1193 LEVLSHRSLGCFVTHCGWNSTLESLSCGVPMVAFPQWTDQTTNAKLIEDVWKTGVRXXXX 1372
            +EVLS  SLGCFVTHCGWNS+LES + GVP+VAFPQWTDQ TNAKLIED WKTG+R    
Sbjct: 355  VEVLSSPSLGCFVTHCGWNSSLESFASGVPVVAFPQWTDQGTNAKLIEDAWKTGLRVTPN 414

Query: 1373 XXXXXXXXXXRRCLEVVMGSGGEKGEEVRRNARRWKSLAREAVKEGGSSDKNLRAFLDDV 1552
                      +RCLE+VMGS GE GEE+RRNA++WK LAREAV EGGSSDKNL+AFLD +
Sbjct: 415  EKGIVTGDELKRCLELVMGS-GEIGEEMRRNAKKWKDLAREAVSEGGSSDKNLKAFLDRI 473


>AMO27404.1 anthocyanidin 3-o-glucoside 5-o-glucosyltransferase 1-like protein
            [Glycine max]
          Length = 478

 Score =  520 bits (1340), Expect = e-179
 Identities = 278/475 (58%), Positives = 331/475 (69%), Gaps = 6/475 (1%)
 Frame = +2

Query: 146  MVQHRFLLITYPIQGHINPALQFAKRLATMGVPVTFATTVHLHRRLVNKPTTKGLSFAVF 325
            MV  RFLL+TYP Q HINPALQ AKRL  MG  VT   T+H++RR+ NKPT  GLSF  F
Sbjct: 1    MVLQRFLLVTYPAQSHINPALQLAKRLIAMGAHVTILLTLHVYRRISNKPTIPGLSFLPF 60

Query: 326  XXXXXXXXXXXXX--AEVASYISELKRRGSESLRDTILSAAKEGQPFTYLAYTLLLPWVA 499
                           ++   Y S+LK R S+ L + ILS+A EG+PFT L YTLLLPWVA
Sbjct: 61   SDGYDAGFDALHATDSDFFLYESQLKHRTSDLLSNLILSSASEGRPFTCLLYTLLLPWVA 120

Query: 500  TVARELHLPSALLWIQAATVFDIYYYYFHEHGDYITQKTEDPTYSIELPGLPFSLTSRDL 679
             VAR+ +LP+ALLWI+ ATV DI Y++FH + D+I  +T++   +I LPGL FSL+ RD+
Sbjct: 121  DVARQFYLPTALLWIEPATVLDILYHFFHGYADFINDETKE---NIVLPGLSFSLSPRDV 177

Query: 680  PSFLLA--SNTYTFALPSFKEQLEILEEETNPIVLVNTVEELEPESLRAVDEYDKKLRMI 853
            PSFLL    + ++F LPSF+ Q++ L+ ETNP VLVNT E LE E+LRA+D    K+ MI
Sbjct: 178  PSFLLLWKPSVFSFTLPSFENQIKQLDLETNPTVLVNTFEALEEEALRAID----KINMI 233

Query: 854  PIGPLIPSAFLDGNDPADTSFGGDTISVSSDYLEWLDSKTESSVVYVSFGSLAVLPKRQM 1033
            PIGPLIPSAFLDGNDP DTSFGGD   VS+DY+EWLDSK E+SVVYVSFGS   L KRQM
Sbjct: 234  PIGPLIPSAFLDGNDPTDTSFGGDIFQVSNDYVEWLDSKEENSVVYVSFGSYFELSKRQM 293

Query: 1034 EEIARALLDCKIPFLWVIXXXXXXXXXXXXXXXXXXXXXXXXKIGKIVKWCSQLEVLSHR 1213
            EEIAR LLDC  PFLWV+                        K GKIV WCSQ+EVLSH 
Sbjct: 294  EEIARGLLDCGRPFLWVV-REKVINGKKEEEEELCCFREELEKWGKIVTWCSQVEVLSHS 352

Query: 1214 SLGCFVTHCGWNSTLESLSCGVPMVAFPQWTDQTTNAKLIEDVWKTGVR--XXXXXXXXX 1387
            S+GCF+THCGWNST+ESL  GVPMVAFPQWTDQ TNAKLIEDVWK GVR           
Sbjct: 353  SVGCFLTHCGWNSTMESLVSGVPMVAFPQWTDQMTNAKLIEDVWKIGVRVDHHVNANGIV 412

Query: 1388 XXXXXRRCLEVVMGSGGEKGEEVRRNARRWKSLAREAVKEGGSSDKNLRAFLDDV 1552
                   CL+VVMGS G++  E R+NA++WK LAR+A KEGGSS+KNLRAF+DDV
Sbjct: 413  EGKEIEACLDVVMGS-GDRASEFRKNAKKWKVLARDAAKEGGSSEKNLRAFVDDV 466


>XP_014634355.1 PREDICTED: crocetin glucosyltransferase, chloroplastic-like [Glycine
            max] KRH42735.1 hypothetical protein GLYMA_08G107600
            [Glycine max]
          Length = 478

 Score =  520 bits (1339), Expect = e-178
 Identities = 278/475 (58%), Positives = 330/475 (69%), Gaps = 6/475 (1%)
 Frame = +2

Query: 146  MVQHRFLLITYPIQGHINPALQFAKRLATMGVPVTFATTVHLHRRLVNKPTTKGLSFAVF 325
            MV  RFLL+TYP Q HINPALQ AKRL  MG  VT   T+H++RR+ NKPT  GLSF  F
Sbjct: 1    MVLQRFLLVTYPAQSHINPALQLAKRLIAMGAHVTILLTLHVYRRISNKPTIPGLSFLPF 60

Query: 326  XXXXXXXXXXXXX--AEVASYISELKRRGSESLRDTILSAAKEGQPFTYLAYTLLLPWVA 499
                           ++   Y S+LK R S+ L + ILS+A EG+PFT L YTLLLPWVA
Sbjct: 61   SDGYDAGFDALHATDSDFFLYESQLKHRTSDLLSNLILSSASEGRPFTCLLYTLLLPWVA 120

Query: 500  TVARELHLPSALLWIQAATVFDIYYYYFHEHGDYITQKTEDPTYSIELPGLPFSLTSRDL 679
             VAR+ +LP+ALLWI+ ATV DI Y++FH + D+I  +T++   +I LPGL FSL+ RD+
Sbjct: 121  DVARQFYLPTALLWIEPATVLDILYHFFHGYADFINDETKE---NIVLPGLSFSLSPRDV 177

Query: 680  PSFLLA--SNTYTFALPSFKEQLEILEEETNPIVLVNTVEELEPESLRAVDEYDKKLRMI 853
            PSFLL    + ++F LPSF+ Q++ L+ ETNP VLVNT E LE E+LRA+D    K+ MI
Sbjct: 178  PSFLLLWKPSVFSFTLPSFENQIKQLDLETNPTVLVNTFEALEEEALRAID----KINMI 233

Query: 854  PIGPLIPSAFLDGNDPADTSFGGDTISVSSDYLEWLDSKTESSVVYVSFGSLAVLPKRQM 1033
            PIGPLIPSAFLDGNDP DTSFGGD   VS+DY+EWLDSK E SVVYVSFGS   L KRQM
Sbjct: 234  PIGPLIPSAFLDGNDPTDTSFGGDIFQVSNDYVEWLDSKEEDSVVYVSFGSYFELSKRQM 293

Query: 1034 EEIARALLDCKIPFLWVIXXXXXXXXXXXXXXXXXXXXXXXXKIGKIVKWCSQLEVLSHR 1213
            EEIAR LLDC  PFLWV+                        K GKIV WCSQ+EVLSH 
Sbjct: 294  EEIARGLLDCGRPFLWVV-REKVINGKKEEEEELCCFREELEKWGKIVTWCSQVEVLSHS 352

Query: 1214 SLGCFVTHCGWNSTLESLSCGVPMVAFPQWTDQTTNAKLIEDVWKTGVR--XXXXXXXXX 1387
            S+GCF+THCGWNST+ESL  GVPMVAFPQWTDQ TNAKLIEDVWK GVR           
Sbjct: 353  SVGCFLTHCGWNSTMESLVSGVPMVAFPQWTDQMTNAKLIEDVWKIGVRVDHHVNANGIV 412

Query: 1388 XXXXXRRCLEVVMGSGGEKGEEVRRNARRWKSLAREAVKEGGSSDKNLRAFLDDV 1552
                   CL+VVMGS G++  E R+NA++WK LAR+A KEGGSS+KNLRAF+DDV
Sbjct: 413  EGKEIEACLDVVMGS-GDRASEFRKNAKKWKVLARDAAKEGGSSEKNLRAFVDDV 466


Top