BLASTX nr result

ID: Glycyrrhiza36_contig00002585 seq

BLASTX 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Glycyrrhiza36_contig00002585
         (1688 letters)

Database: ./nr 
           115,041,592 sequences; 42,171,959,267 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

XP_004502258.1 PREDICTED: crocetin glucosyltransferase, chloropl...   600   0.0  
GAU18595.1 hypothetical protein TSUD_124260 [Trifolium subterran...   593   0.0  
XP_003552552.1 PREDICTED: crocetin glucosyltransferase, chloropl...   592   0.0  
XP_019415874.1 PREDICTED: crocetin glucosyltransferase, chloropl...   586   0.0  
XP_007163802.1 hypothetical protein PHAVU_001G265400g [Phaseolus...   585   0.0  
XP_017405475.1 PREDICTED: crocetin glucosyltransferase, chloropl...   573   0.0  
XP_014521681.1 PREDICTED: crocetin glucosyltransferase, chloropl...   569   0.0  
XP_002263700.1 PREDICTED: crocetin glucosyltransferase, chloropl...   546   0.0  
XP_003531212.1 PREDICTED: crocetin glucosyltransferase, chloropl...   538   0.0  
AAY27090.1 UDP-glucose:flavonoid 7-O-glucosyltransferase [Pyrus ...   533   0.0  
XP_018844047.1 PREDICTED: crocetin glucosyltransferase, chloropl...   531   0.0  
KYP56999.1 Anthocyanin 5-O-glucosyltransferase [Cajanus cajan]        529   0.0  
XP_007221288.1 hypothetical protein PRUPE_ppa016890mg [Prunus pe...   529   0.0  
NP_001315912.1 crocetin glucosyltransferase, chloroplastic-like ...   529   0.0  
XP_018848545.1 PREDICTED: crocetin glucosyltransferase, chloropl...   526   0.0  
XP_003524180.1 PREDICTED: crocetin glucosyltransferase, chloropl...   526   e-180
XP_018809219.1 PREDICTED: crocetin glucosyltransferase, chloropl...   523   e-179
XP_008348418.1 PREDICTED: crocetin glucosyltransferase, chloropl...   522   e-179
AMO27404.1 anthocyanidin 3-o-glucoside 5-o-glucosyltransferase 1...   520   e-178
XP_014634355.1 PREDICTED: crocetin glucosyltransferase, chloropl...   520   e-178

>XP_004502258.1 PREDICTED: crocetin glucosyltransferase, chloroplastic [Cicer
            arietinum] AGU14117.1 UDP-glycosyltransferase [Cicer
            arietinum]
          Length = 471

 Score =  600 bits (1546), Expect = 0.0
 Identities = 312/472 (66%), Positives = 350/472 (74%), Gaps = 3/472 (0%)
 Frame = +2

Query: 83   HRFLLITYPIQGHINPALQFAKRLATMGVPVTFATTVHLHRRLVNKPTTKGLSFAVFXXX 262
            H FL++TYP+QGHINPALQFAKRL TMG  VTF TT++LHRRL+NKPT   LSFA F   
Sbjct: 5    HNFLIVTYPLQGHINPALQFAKRLVTMGAHVTFTTTIYLHRRLINKPTIPNLSFAAFSDG 64

Query: 263  XXXXXXXXXXAEVASYISELKRRGSESLRDTILSAAKEGQPFTYLAYTLLLPWVATVARE 442
                       ++++Y+ EL  RGSE LR+ ILSA     PFT LAYTLLLPW A VARE
Sbjct: 65   YDDGYNSNAIVDLSTYMLELSSRGSEFLRNIILSAKHGNHPFTCLAYTLLLPWAANVARE 124

Query: 443  LHLPSALLWIQAATVFDIYYYYFHEHGDYITQKTEDPTYSIELPGLPFSLTSRDLPSFLL 622
            L LP ALLWIQAATVFDIYYYY HEHGDYIT K++D T +IELPGL FSL SRDLPSFL 
Sbjct: 125  LQLPYALLWIQAATVFDIYYYYLHEHGDYITNKSKDATCNIELPGLSFSLKSRDLPSFLQ 184

Query: 623  ASNTYTFALPSFKEQLEILEEETNPIVLVNTVEELEPESLRAVDEYDKKLRMIPIGPLIP 802
            ASN YTF L S KEQ  IL+EETNPIVLVNTV+E E ES+RA+D+   K++MIPIGPLIP
Sbjct: 185  ASNIYTFILSSMKEQFRILDEETNPIVLVNTVDEFELESVRAIDD---KIKMIPIGPLIP 241

Query: 803  SAFLDGNDPADTSFGGDTISV-SSDYLEWLDSKTESSVVYVSFGSLAVLPKRQMEEIARA 979
            SA+LDG D  DTSFGGD I V S DY+EWLDSK ESSVVYVSFGS +VLPK+QMEE ARA
Sbjct: 242  SAYLDGKDLTDTSFGGDVIRVDSEDYIEWLDSKDESSVVYVSFGSFSVLPKKQMEEFARA 301

Query: 980  LLDCKIPFLWVIXXXXXXXXXXXXXXXXXXXXXXXXKIGKIVKWCSQLEVLSHRSLGCFV 1159
            LLD  + FLWVI                          GKIVKWCSQ+EVLSH S+GCFV
Sbjct: 302  LLDSGLNFLWVIREKKVDEKKEDDELSCKEELEKNVN-GKIVKWCSQVEVLSHSSVGCFV 360

Query: 1160 THCGWNSTLESLSCGVPMVAFPQWTDQTTNAKLIEDVWKTGVR--XXXXXXXXXXXXXXR 1333
            THCGWNST ESL  GVPMVAFPQWTDQ+TNAKLIEDVWK GVR                R
Sbjct: 361  THCGWNSTTESLVSGVPMVAFPQWTDQSTNAKLIEDVWKCGVRMDNNRDEEGIVKADEIR 420

Query: 1334 RCLEVVMGSGGEKGEEVRRNARRWKSLAREAVKEGGSSDKNLRAFLDDVGRI 1489
            RCLE+V+G GGEKGEE++RNA +WKSL REAVKEGGSSDKNL++FL  +G I
Sbjct: 421  RCLELVIG-GGEKGEELKRNAEKWKSLGREAVKEGGSSDKNLKSFLHHIGSI 471


>GAU18595.1 hypothetical protein TSUD_124260 [Trifolium subterraneum]
          Length = 468

 Score =  593 bits (1529), Expect = 0.0
 Identities = 312/470 (66%), Positives = 346/470 (73%), Gaps = 2/470 (0%)
 Frame = +2

Query: 74   MVQHRFLLITYPIQGHINPALQFAKRLATMGVPVTFATTVHLHRRLVNKPTTKGLSFAVF 253
            M QH FL+I YPIQGHINPALQFAKR+  +G  VTF TT++ +RRLVNKPT   LSFA F
Sbjct: 1    MAQHNFLIIAYPIQGHINPALQFAKRVINLGAHVTFTTTIYAYRRLVNKPTIPSLSFAAF 60

Query: 254  XXXXXXXXXXXXXAEVASYISELKRRGSESLRDTILSAAKEGQPFTYLAYTLLLPWVATV 433
                         A +  YISE +RRGSE L++ ILSA ++  PFT L YTL LPW + V
Sbjct: 61   SDGYDDGYILKDDASILFYISEHQRRGSEFLKNIILSAKQKIHPFTCLIYTLTLPWASKV 120

Query: 434  ARELHLPSALLWIQAATVFDIYYYYFHEHGDYITQKTEDPTYSIELPGLPFSLTSRDLPS 613
            ARE  LPSALLWIQAATVFDIYYYYFH HGDYIT K ED   SI+LPGL FSL SRDLPS
Sbjct: 121  AREFDLPSALLWIQAATVFDIYYYYFHNHGDYITNKLEDAECSIDLPGLSFSLKSRDLPS 180

Query: 614  FLLASNTYTFALPSFKEQLEILEEETNPIVLVNTVEELEPESLRAVDEYDKKLRMIPIGP 793
            FLLASN YT+ALPSFKE L+IL+EETNP VLVNTVEE E ++++ VD    K++MIPIGP
Sbjct: 181  FLLASNIYTWALPSFKEHLQILDEETNPRVLVNTVEEFELDAIKDVD--IGKIKMIPIGP 238

Query: 794  LIPSAFLDGNDPADTSFGGDTISVSSD--YLEWLDSKTESSVVYVSFGSLAVLPKRQMEE 967
            LIPSAFLDG DP+D+S GGD I   S+  YLEWLD K ESSVVYV+FG+LAVL KRQM+E
Sbjct: 239  LIPSAFLDGKDPSDSSSGGDIIRGDSEDNYLEWLDLKGESSVVYVAFGTLAVLSKRQMDE 298

Query: 968  IARALLDCKIPFLWVIXXXXXXXXXXXXXXXXXXXXXXXXKIGKIVKWCSQLEVLSHRSL 1147
            IA ALLD    FLWVI                          GKIVKWCSQLEVLSH SL
Sbjct: 299  IACALLDSGFSFLWVIRDNKLRKQRDDDDELSYREEIEKNVNGKIVKWCSQLEVLSHSSL 358

Query: 1148 GCFVTHCGWNSTLESLSCGVPMVAFPQWTDQTTNAKLIEDVWKTGVRXXXXXXXXXXXXX 1327
            GCFVTHCGWNSTLE LS GVPMVAFPQW DQTTNAKLIEDVWKTGVR             
Sbjct: 359  GCFVTHCGWNSTLEGLSSGVPMVAFPQWIDQTTNAKLIEDVWKTGVRMDRDEEGIVKADE 418

Query: 1328 XRRCLEVVMGSGGEKGEEVRRNARRWKSLAREAVKEGGSSDKNLRAFLDD 1477
             +RCLEVVMG  GEKGEE+RRNA++WKS AREAVKEGGSSDKNLR FL+D
Sbjct: 419  IKRCLEVVMGK-GEKGEELRRNAKKWKSFAREAVKEGGSSDKNLRNFLND 467


>XP_003552552.1 PREDICTED: crocetin glucosyltransferase, chloroplastic-like [Glycine
            max] KRG97360.1 hypothetical protein GLYMA_18G003100
            [Glycine max]
          Length = 465

 Score =  592 bits (1525), Expect = 0.0
 Identities = 308/475 (64%), Positives = 358/475 (75%), Gaps = 4/475 (0%)
 Frame = +2

Query: 74   MVQHRFLLITYPIQGHINPALQFAKRLATMGVPVTFATTVHLHRRLVNKPTTKGLSFAVF 253
            MVQHRFLLITYPIQGHINP++QFAKRL +MGV VTFAT+++LHRR++ KPT  GLSFA F
Sbjct: 1    MVQHRFLLITYPIQGHINPSIQFAKRLVSMGVHVTFATSLYLHRRMLKKPTIPGLSFATF 60

Query: 254  XXXXXXXXXXXXXAEVASYISELKRRGSESLRDTILSAAKEGQPFTYLAYTLLLPWVATV 433
                         + ++SY+SELKRRGSE LR+ I +A +EGQPFT LAYT+LLPW A V
Sbjct: 61   SDGYDDGYKATDDSSLSSYMSELKRRGSEFLRNIITAAKQEGQPFTCLAYTILLPWAAKV 120

Query: 434  ARELHLPSALLWIQAATVFDIYYYYFHEHGDYITQKTEDPTYSIELPGLPFSLTSRDLPS 613
            ARELH+P ALLWIQAATVFDIYYYYFHE+GD    K+ DPT  IELPGLPFSLT+RD+PS
Sbjct: 121  ARELHIPGALLWIQAATVFDIYYYYFHEYGDSFNYKS-DPT--IELPGLPFSLTARDVPS 177

Query: 614  FLLASNTYTFALPSFKEQLEILEEETNPIVLVNTVEELEPESLRAVDEYDKKLRMIPIGP 793
            FLL SN Y FALP+ +EQ + L++ETNPI+LVNT ++LEP++LRAVD    K  MIPIGP
Sbjct: 178  FLLPSNIYRFALPTLQEQFQDLDDETNPIILVNTFQDLEPDALRAVD----KFTMIPIGP 233

Query: 794  L-IPSAFLDGNDPADTSFGGDTISVSSDYLEWLDSKTESSVVYVSFGSLAVLPKRQMEEI 970
            L IPSAFLDG DPADTS+GGD    S+DY+EWLDS+ E SVVYVSFG+LAVL  RQM+E+
Sbjct: 234  LNIPSAFLDGKDPADTSYGGDLFDASNDYVEWLDSQPELSVVYVSFGTLAVLADRQMKEL 293

Query: 971  ARALLDCKIPFLWVIXXXXXXXXXXXXXXXXXXXXXXXXKIGKIVKWCSQLEVLSHRSLG 1150
            ARALLD    FLWVI                        + GKIVKWCSQ+EVLSH SLG
Sbjct: 294  ARALLDSGYLFLWVI---------RDMQGIEDNCREELEQRGKIVKWCSQVEVLSHGSLG 344

Query: 1151 CFVTHCGWNSTLESLSCGVPMVAFPQWTDQTTNAKLIEDVWKTGVR---XXXXXXXXXXX 1321
            CFVTHCGWNST+ESL  GVPMVAFPQWTDQ TNAK+++DVWKTGVR              
Sbjct: 345  CFVTHCGWNSTMESLGSGVPMVAFPQWTDQGTNAKMVQDVWKTGVRVDDKVNVEEGIVEA 404

Query: 1322 XXXRRCLEVVMGSGGEKGEEVRRNARRWKSLAREAVKEGGSSDKNLRAFLDDVGR 1486
               R+CL+VVMGSGG KG+E RRNA +WK LAREAV EGGSSD N+R FL DV +
Sbjct: 405  EEIRKCLDVVMGSGG-KGQEFRRNADKWKCLAREAVTEGGSSDSNMRTFLHDVAK 458


>XP_019415874.1 PREDICTED: crocetin glucosyltransferase, chloroplastic-like [Lupinus
            angustifolius] OIV97469.1 hypothetical protein
            TanjilG_10993 [Lupinus angustifolius]
          Length = 463

 Score =  586 bits (1510), Expect = 0.0
 Identities = 307/473 (64%), Positives = 347/473 (73%), Gaps = 3/473 (0%)
 Frame = +2

Query: 74   MVQHRFLLITYPIQGHINPALQFAKRLATMGVPVTFATTVHLHRRL-VNKPTTKGLSFAV 250
            M   RFLLITYP QGHINP+LQFAKRL T+GV VTFATT+H+ R +  NK    GLS   
Sbjct: 1    MAHQRFLLITYPAQGHINPSLQFAKRLITLGVHVTFATTIHMQRCINKNKTIIPGLSITA 60

Query: 251  FXXXXXXXXXXXXXAEVASYISELKRRGSESLRDTILSAAKEGQPFTYLAYTLLLPWVAT 430
            F              +V SYISELK RGSE L + I  A ++G PFT + YTLLLPWVAT
Sbjct: 61   FSDGYDDGFNSAADVDVLSYISELKHRGSECLTNVIAYAIQQGNPFTCITYTLLLPWVAT 120

Query: 431  VARELHLPSALLWIQAATVFDIYYYYFHEHGDYITQKTEDPTYSIELPGLPFSLTSRDLP 610
            VARE  LPSALLWIQ ATVFD+YY+YFH + +Y+ Q  ++PT S+ELPGLPF    RDLP
Sbjct: 121  VAREFQLPSALLWIQPATVFDMYYFYFHGYEEYMIQNVKEPTCSLELPGLPFIFKPRDLP 180

Query: 611  SFLLASNTYTFALPSFKEQLEILEEETNPIVLVNTVEELEPESLRAVDEYDKKLRMIPIG 790
            SF   SN Y+FALPSFKEQLE+L+ ETNPIVLVNT EELE E+LRA+++    +RMIPIG
Sbjct: 181  SFFWPSNMYSFALPSFKEQLEVLDLETNPIVLVNTFEELEHEALRAIED----IRMIPIG 236

Query: 791  PLIPSAFLDGNDPADTSFGGDTISVSSDYLEWLDSKTESSVVYVSFGSLAVLPKRQMEEI 970
            PLIPSAFLDG DP DTSFGGD I  ++DY+ WLDSK + SVVYVSFGSLAVLPKRQMEEI
Sbjct: 237  PLIPSAFLDGKDPNDTSFGGDIIHGTNDYVTWLDSKPKLSVVYVSFGSLAVLPKRQMEEI 296

Query: 971  ARALLDCKIPFLWVIXXXXXXXXXXXXXXXXXXXXXXXXKIGKIVKWCSQLEVLSHRSLG 1150
            A ALLD K PFLWVI                          GKIVKWCSQ+EVLSH SLG
Sbjct: 297  AIALLDSKHPFLWVIRENNAKEVLKYRDELEQG--------GKIVKWCSQVEVLSHHSLG 348

Query: 1151 CFVTHCGWNSTLESLSCGVPMVAFPQWTDQTTNAKLIEDVWKTGVR--XXXXXXXXXXXX 1324
            CFVTHCGWNST+ESL CGVP+VAFPQWTDQTTNAKLIEDVWK+GVR              
Sbjct: 349  CFVTHCGWNSTMESLVCGVPVVAFPQWTDQTTNAKLIEDVWKSGVRVDHELNEDGIVERD 408

Query: 1325 XXRRCLEVVMGSGGEKGEEVRRNARRWKSLAREAVKEGGSSDKNLRAFLDDVG 1483
              R+CLEVVMGS GEKG+E+RRN+ +WK LA+EAVKEGGSSDKNLR FLD VG
Sbjct: 409  EIRKCLEVVMGS-GEKGQELRRNSYKWKDLAKEAVKEGGSSDKNLRTFLDVVG 460


>XP_007163802.1 hypothetical protein PHAVU_001G265400g [Phaseolus vulgaris]
            ESW35796.1 hypothetical protein PHAVU_001G265400g
            [Phaseolus vulgaris]
          Length = 531

 Score =  585 bits (1509), Expect = 0.0
 Identities = 306/497 (61%), Positives = 357/497 (71%), Gaps = 5/497 (1%)
 Frame = +2

Query: 23   SQRPQEHYHAP-SRASAAMVQHRFLLITYPIQGHINPALQFAKRLATMGVPVTFATTVHL 199
            S  P  + H+  SRA+  MV HRFLL+TYP+QGHINPA+QFAKRL  +GV VTFAT+  L
Sbjct: 43   SNNPLHNIHSAFSRATVTMVHHRFLLVTYPVQGHINPAIQFAKRLTAIGVHVTFATSTFL 102

Query: 200  HRRLVNKPTTKGLSFAVFXXXXXXXXXXXXXAEVASYISELKRRGSESLRDTILSAAKEG 379
            HRR++NK T  GL+F  F             + V SY++ELK RGSE  R+ I SA +EG
Sbjct: 103  HRRMINKLTVPGLTFVTFSDGYDDGYEGTNDSNVISYMAELKLRGSEFFRNIITSAKQEG 162

Query: 380  QPFTYLAYTLLLPWVATVARELHLPSALLWIQAATVFDIYYYYFHEHGDYIT--QKTEDP 553
            +PFT +AYTL+LPWVA VARE  +P  LLWIQAATV DIYY+YFHE+ DYI    K EDP
Sbjct: 163  KPFTCVAYTLMLPWVAKVAREFRIPGVLLWIQAATVLDIYYHYFHEYRDYINIIHKKEDP 222

Query: 554  TYSIELPGLPFSLTSRDLPSFLLASNTYTFALPSFKEQLEILEEETNPIVLVNTVEELEP 733
            T  IE PGLPFS  +RD+PSFLL SN Y+FA+P  +EQ ++ +EE+NP V+VNT E+LEP
Sbjct: 223  T--IEFPGLPFSFAARDIPSFLLPSNIYSFAIPPLEEQFQLFDEESNPRVIVNTFEDLEP 280

Query: 734  ESLRAVDEYDKKLRMIPIGPLIPSAFLDGNDPADTSFGGDTISVSSDYLEWLDSKTESSV 913
            ESLRA+D    KL MI IGPLIPSAFLDG DPADTSFGGD    S DY+EWLDSK   SV
Sbjct: 281  ESLRAID----KLTMISIGPLIPSAFLDGKDPADTSFGGDIFHGSKDYVEWLDSKPALSV 336

Query: 914  VYVSFGSLAVLPKRQMEEIARALLDCKIPFLWVIXXXXXXXXXXXXXXXXXXXXXXXXKI 1093
            VYVSFGSLAVL +RQMEEIARALLD   PFLWVI                        + 
Sbjct: 337  VYVSFGSLAVLSQRQMEEIARALLDSGYPFLWVI----RESNGKEGTLQELSCRKELEQR 392

Query: 1094 GKIVKWCSQLEVLSHRSLGCFVTHCGWNSTLESLSCGVPMVAFPQWTDQTTNAKLIEDVW 1273
            GKIVKWC+Q+EVLSH S+GCFVTHCGWNST+ESLS GVPMV FPQWTDQ TNAKL+EDVW
Sbjct: 393  GKIVKWCTQVEVLSHGSVGCFVTHCGWNSTMESLSSGVPMVGFPQWTDQGTNAKLVEDVW 452

Query: 1274 KTGVR--XXXXXXXXXXXXXXRRCLEVVMGSGGEKGEEVRRNARRWKSLAREAVKEGGSS 1447
            KTGVR                R+CLEVVMG+GG K EE+RRNA++WK LAREAVKEGGSS
Sbjct: 453  KTGVRVDDKVNEEGIVEAMEIRKCLEVVMGNGG-KAEELRRNAQKWKCLAREAVKEGGSS 511

Query: 1448 DKNLRAFLDDVGRIVQN 1498
            D+N+  FLD V +  Q+
Sbjct: 512  DRNMNTFLDYVSKFEQD 528


>XP_017405475.1 PREDICTED: crocetin glucosyltransferase, chloroplastic [Vigna
            angularis] KOM25418.1 hypothetical protein
            LR48_Vigan102s007600 [Vigna angularis] BAT86657.1
            hypothetical protein VIGAN_04433100 [Vigna angularis var.
            angularis]
          Length = 472

 Score =  573 bits (1478), Expect = 0.0
 Identities = 295/471 (62%), Positives = 347/471 (73%), Gaps = 2/471 (0%)
 Frame = +2

Query: 74   MVQHRFLLITYPIQGHINPALQFAKRLATMGVPVTFATTVHLHRRLVNKPTTKGLSFAVF 253
            MVQ RFLLITYPIQGH+NP+LQFAKRLA +GV VTFAT+V+LHRR++ KPT  GLSF  F
Sbjct: 1    MVQQRFLLITYPIQGHVNPSLQFAKRLAAIGVHVTFATSVYLHRRMLKKPTVPGLSFVTF 60

Query: 254  XXXXXXXXXXXXXAEVASYISELKRRGSESLRDTILSAAKEGQPFTYLAYTLLLPWVATV 433
                         ++V SY++ELKRRGSE L + I SA +EG+PFT + YTL+LPWVA V
Sbjct: 61   SDGYDDGYKTTDDSDVNSYMAELKRRGSEFLGNIITSAKEEGKPFTCVTYTLMLPWVAKV 120

Query: 434  ARELHLPSALLWIQAATVFDIYYYYFHEHGDYITQKTEDPTYSIELPGLPFSLTSRDLPS 613
            ARE  +P  LLWIQAATV DIYY+YFHE+ DYI    +     IE PGLPFSL +RD+PS
Sbjct: 121  AREFLIPGVLLWIQAATVLDIYYHYFHEYKDYINTIQKSENSIIEFPGLPFSLATRDVPS 180

Query: 614  FLLASNTYTFALPSFKEQLEILEEETNPIVLVNTVEELEPESLRAVDEYDKKLRMIPIGP 793
            FLL SN Y+FALPS +EQ ++ +EETNP VLVNT ++LEPE+L+A+D    KL MI IGP
Sbjct: 181  FLLPSNAYSFALPSLEEQFKLFDEETNPRVLVNTFQDLEPEALKAID----KLTMISIGP 236

Query: 794  LIPSAFLDGNDPADTSFGGDTISVSSDYLEWLDSKTESSVVYVSFGSLAVLPKRQMEEIA 973
            LIPSAFLDG DP DTSFGGD    S+DY++WLDSK   SVVYVSFG+LAVL +RQMEE+A
Sbjct: 237  LIPSAFLDGKDPTDTSFGGDIYHGSNDYIKWLDSKPALSVVYVSFGTLAVLGQRQMEEVA 296

Query: 974  RALLDCKIPFLWVIXXXXXXXXXXXXXXXXXXXXXXXXKIGKIVKWCSQLEVLSHRSLGC 1153
            RALLD   PFLWVI                        + GKIV WCSQ+EVLSH SLGC
Sbjct: 297  RALLDSGYPFLWVIREPNGKQEKEEKLQELSCRKELEQR-GKIVNWCSQVEVLSHGSLGC 355

Query: 1154 FVTHCGWNSTLESLSCGVPMVAFPQWTDQTTNAKLIEDVWKTGVR--XXXXXXXXXXXXX 1327
            F+THCGWNST+ESLS G+PMV+FPQW+DQ TNAKL+EDVWKTGVR               
Sbjct: 356  FLTHCGWNSTMESLSSGIPMVSFPQWSDQGTNAKLVEDVWKTGVRVDGKANAEGIVEGEE 415

Query: 1328 XRRCLEVVMGSGGEKGEEVRRNARRWKSLAREAVKEGGSSDKNLRAFLDDV 1480
             R+CLEVVMGS GEK +E+RRNA +WK LAREAVKEGGSSD+N+R FLD V
Sbjct: 416  IRKCLEVVMGS-GEKRDELRRNAEKWKCLAREAVKEGGSSDRNIRTFLDYV 465


>XP_014521681.1 PREDICTED: crocetin glucosyltransferase, chloroplastic-like [Vigna
            radiata var. radiata]
          Length = 469

 Score =  569 bits (1466), Expect = 0.0
 Identities = 293/471 (62%), Positives = 345/471 (73%), Gaps = 2/471 (0%)
 Frame = +2

Query: 74   MVQHRFLLITYPIQGHINPALQFAKRLATMGVPVTFATTVHLHRRLVNKPTTKGLSFAVF 253
            MVQ RFLLITYPIQGH+NP+LQFAKRLA +GV VTFAT+V+LHRR++ KPT  GLSF  F
Sbjct: 1    MVQPRFLLITYPIQGHVNPSLQFAKRLAAIGVHVTFATSVYLHRRMLKKPTVPGLSFVTF 60

Query: 254  XXXXXXXXXXXXXAEVASYISELKRRGSESLRDTILSAAKEGQPFTYLAYTLLLPWVATV 433
                         + V SY++ELKRRGSE L + I SA +EG+PFT L YTL+LPWVA V
Sbjct: 61   SDGYDDGYKTTEDSHVNSYMAELKRRGSEFLGNIITSAKEEGKPFTCLTYTLMLPWVAKV 120

Query: 434  ARELHLPSALLWIQAATVFDIYYYYFHEHGDYITQKTEDPTYSIELPGLPFSLTSRDLPS 613
            ARE  +P  LLWIQ ATV DIYY+YFHE+ DYI    +     I+ PGLPFSL +RD+PS
Sbjct: 121  AREFRIPGVLLWIQPATVLDIYYHYFHEYRDYINTIHKSENSIIDFPGLPFSLATRDVPS 180

Query: 614  FLLASNTYTFALPSFKEQLEILEEETNPIVLVNTVEELEPESLRAVDEYDKKLRMIPIGP 793
            FLL SN Y+FA+PS +EQ ++ +EETNP VLVNT ++LEPE+L+A+D    KL MI IGP
Sbjct: 181  FLLPSNAYSFAIPSLEEQFKLFDEETNPRVLVNTFQDLEPEALKAID----KLTMISIGP 236

Query: 794  LIPSAFLDGNDPADTSFGGDTISVSSDYLEWLDSKTESSVVYVSFGSLAVLPKRQMEEIA 973
            LIPSAFLDG DP DTSFG D  + S+DY+EWLDSK   SVVYVSFG+LAVL +RQMEE+A
Sbjct: 237  LIPSAFLDGKDPTDTSFGADIYNGSNDYIEWLDSKPALSVVYVSFGTLAVLGQRQMEEVA 296

Query: 974  RALLDCKIPFLWVIXXXXXXXXXXXXXXXXXXXXXXXXKIGKIVKWCSQLEVLSHRSLGC 1153
            RALLD   PFLWVI                        + GKIV WCSQ+EVLSH SLGC
Sbjct: 297  RALLDSGYPFLWVI----REPNGKQEKLEELSCRKELEERGKIVNWCSQVEVLSHGSLGC 352

Query: 1154 FVTHCGWNSTLESLSCGVPMVAFPQWTDQTTNAKLIEDVWKTGVR--XXXXXXXXXXXXX 1327
            F+THCGWNST+ESLS G+PMV+FPQW+DQ TNAKL+EDVWKTGVR               
Sbjct: 353  FLTHCGWNSTMESLSSGIPMVSFPQWSDQGTNAKLVEDVWKTGVRVDGKANAEGIVDGEE 412

Query: 1328 XRRCLEVVMGSGGEKGEEVRRNARRWKSLAREAVKEGGSSDKNLRAFLDDV 1480
             R+CLEVVMGS GEK +E+RRNA +WK LAREAVKEGGSSD+N+R FLD V
Sbjct: 413  IRKCLEVVMGS-GEKRDELRRNAEKWKCLAREAVKEGGSSDRNIRTFLDYV 462


>XP_002263700.1 PREDICTED: crocetin glucosyltransferase, chloroplastic-like [Vitis
            vinifera]
          Length = 469

 Score =  546 bits (1406), Expect = 0.0
 Identities = 280/465 (60%), Positives = 336/465 (72%)
 Frame = +2

Query: 89   FLLITYPIQGHINPALQFAKRLATMGVPVTFATTVHLHRRLVNKPTTKGLSFAVFXXXXX 268
            FLL+T+P QGHINPALQFAKR+   G  V+FAT+V  HRR+  + T +GL+F  F     
Sbjct: 6    FLLVTFPAQGHINPALQFAKRIIRTGAQVSFATSVSAHRRMAKRSTPEGLNFVPFSDGYD 65

Query: 269  XXXXXXXXAEVASYISELKRRGSESLRDTILSAAKEGQPFTYLAYTLLLPWVATVARELH 448
                     +V  Y+SE+KRRGSE+LR+ ++  A EGQPFT + YTLLLPW A VAR L 
Sbjct: 66   DGFKPTD--DVQHYMSEIKRRGSETLREIVVRNADEGQPFTCIVYTLLLPWAAEVARGLG 123

Query: 449  LPSALLWIQAATVFDIYYYYFHEHGDYITQKTEDPTYSIELPGLPFSLTSRDLPSFLLAS 628
            +PSALLWIQ ATV DIYYYYF+ +GD     + +P+ S+ELPGLP  L+SRDLPSFL+ S
Sbjct: 124  VPSALLWIQPATVLDIYYYYFNGYGDVFRNISNEPSCSVELPGLPL-LSSRDLPSFLVKS 182

Query: 629  NTYTFALPSFKEQLEILEEETNPIVLVNTVEELEPESLRAVDEYDKKLRMIPIGPLIPSA 808
            N YTF LP+F+EQLE L +ET+P VLVNT + LEPE LRAVD    KL +I IGPL+PSA
Sbjct: 183  NAYTFVLPTFQEQLEALSQETSPKVLVNTFDALEPEPLRAVD----KLHLIGIGPLVPSA 238

Query: 809  FLDGNDPADTSFGGDTISVSSDYLEWLDSKTESSVVYVSFGSLAVLPKRQMEEIARALLD 988
            +LDG DP+DTSFGGD    S DY+EWL+SK +SSVVYVSFGS++VL K Q E+IARALLD
Sbjct: 239  YLDGKDPSDTSFGGDMFQGSDDYMEWLNSKPKSSVVYVSFGSISVLSKTQKEDIARALLD 298

Query: 989  CKIPFLWVIXXXXXXXXXXXXXXXXXXXXXXXXKIGKIVKWCSQLEVLSHRSLGCFVTHC 1168
            C  PFLWVI                          G IV WCSQ+EVL+H SLGCFV+HC
Sbjct: 299  CGHPFLWVIRAPENGEEVKEQDKLSCREELEQK--GMIVSWCSQIEVLTHPSLGCFVSHC 356

Query: 1169 GWNSTLESLSCGVPMVAFPQWTDQTTNAKLIEDVWKTGVRXXXXXXXXXXXXXXRRCLEV 1348
            GWNSTLESL  GVP+VAFPQWTDQ TNAKLIED+WK G+R              +RCLE+
Sbjct: 357  GWNSTLESLVSGVPVVAFPQWTDQGTNAKLIEDMWKIGIRVTVNEEGIVESDEFKRCLEI 416

Query: 1349 VMGSGGEKGEEVRRNARRWKSLAREAVKEGGSSDKNLRAFLDDVG 1483
            VMG GGEKGEE+RRNA +WK+LAREAVK+GGSSDKNL+ F+D+VG
Sbjct: 417  VMG-GGEKGEEMRRNAEKWKNLAREAVKDGGSSDKNLKGFVDEVG 460


>XP_003531212.1 PREDICTED: crocetin glucosyltransferase, chloroplastic-like [Glycine
            max] KRH42734.1 hypothetical protein GLYMA_08G107500
            [Glycine max]
          Length = 465

 Score =  538 bits (1387), Expect = 0.0
 Identities = 287/473 (60%), Positives = 328/473 (69%), Gaps = 4/473 (0%)
 Frame = +2

Query: 74   MVQHRFLLITYPIQGHINPALQFAKRLATMGVPVTFATTVHLHRRLVNKPTTKGLSFAVF 253
            M +HRFLLI YP QGHI+PA Q AKRL ++G  VT +TTVH+HRR+ NKPT   LSF  F
Sbjct: 1    MFRHRFLLILYPAQGHIHPAFQLAKRLVSLGAHVTVSTTVHMHRRITNKPTLPHLSFLPF 60

Query: 254  XXXXXXXXXXXXXAEVASYISELKRRGSESLRDTILSAAKEGQPFTYLAYTLLLPWVATV 433
                         ++ + + S  KRRGSE + + ILS A+EG PFT L YT LL WVA V
Sbjct: 61   SDGYDDGFTS---SDFSLHASVFKRRGSEFVTNLILSNAQEGHPFTCLVYTTLLSWVAEV 117

Query: 434  ARELHLPSALLWIQAATVFDIYYYYFHEHGDYITQKTEDPTYSIELPGLPFSLTSRDLPS 613
            ARE HLP+A+LW Q AT+ DI+YYYFHEHG+YI  K +DP+  IELPGLP  L  RDLPS
Sbjct: 118  AREFHLPTAMLWTQPATILDIFYYYFHEHGEYIKDKIKDPSCFIELPGLPLLLAPRDLPS 177

Query: 614  FLLASNTY--TFALPSFKEQLEILEEETNPIVLVNTVEELEPESLRAVDEYDKKLRMIPI 787
            FLL SN    +F +P F++    L+ ET P +LVNT E LE E+LRAVD    K  MIPI
Sbjct: 178  FLLGSNPTIDSFIVPMFEKMFYDLDVETKPRILVNTFEALEAEALRAVD----KFNMIPI 233

Query: 788  GPLIPSAFLDGNDPADTSFGGDTISVSSDYLEWLDSKTESSVVYVSFGSLAVLPKRQMEE 967
            GPLIPSAFLDG D  DTSFGGD   +S+   EWLDSK E SVVYVSFGSL VLPK QMEE
Sbjct: 234  GPLIPSAFLDGKDTNDTSFGGDIFRLSNGCSEWLDSKPEMSVVYVSFGSLCVLPKTQMEE 293

Query: 968  IARALLDCKIPFLWVIXXXXXXXXXXXXXXXXXXXXXXXXKIGKIVKWCSQLEVLSHRSL 1147
            +ARALLDC  PFLWVI                        + GKIV WCSQ+EVLSH S+
Sbjct: 294  LARALLDCGSPFLWVI--KEKENKSQVEGKEELSCIEELEQKGKIVNWCSQVEVLSHGSV 351

Query: 1148 GCFVTHCGWNSTLESLSCGVPMVAFPQWTDQTTNAKLIEDVWKTGVR--XXXXXXXXXXX 1321
            GCFVTHCGWNST+ESL+ GVPMVAFPQW +Q TNAKLIEDVWKTGVR             
Sbjct: 352  GCFVTHCGWNSTMESLASGVPMVAFPQWVEQKTNAKLIEDVWKTGVRVDKQVNEDGIVEN 411

Query: 1322 XXXRRCLEVVMGSGGEKGEEVRRNARRWKSLAREAVKEGGSSDKNLRAFLDDV 1480
               RRCLE VMGS GEKG+E+R NA +W+ LAREAVKEGGSSDKNLRAFLDDV
Sbjct: 412  EEIRRCLEEVMGS-GEKGQELRNNAEKWRGLAREAVKEGGSSDKNLRAFLDDV 463


>AAY27090.1 UDP-glucose:flavonoid 7-O-glucosyltransferase [Pyrus communis]
          Length = 481

 Score =  533 bits (1374), Expect = 0.0
 Identities = 276/480 (57%), Positives = 338/480 (70%), Gaps = 11/480 (2%)
 Frame = +2

Query: 74   MVQHRFLLITYPIQGHINPALQFAKRLA-TMGVPVTFATTVHLHRRLVNKPTTKGLSFAV 250
            MVQHRFLL+TYP QGHINP+LQFAKRL  T G  VT+ T++  HRR+ N     GL++A 
Sbjct: 1    MVQHRFLLVTYPAQGHINPSLQFAKRLTNTTGAHVTYVTSLSAHRRIGNGSIPDGLTYAP 60

Query: 251  FXXXXXXXXXXXXXAEVASYISELKRRGSESLRDTILSAAKEGQPFTYLAYTLLLPWVAT 430
            F               +  Y+SEL+ RG++++ D ++++A EG P+T L Y+L++PW A 
Sbjct: 61   FSDGYDDGFKPGD--NIDDYMSELRHRGAQAITDLVVASANEGHPYTCLVYSLIVPWSAG 118

Query: 431  VARELHLPSALLWIQAATVFDIYYYYFHEHGDYITQKTEDPTY-----SIELPGLPFSLT 595
            VA ELHLPS LLWIQ ATVFDIYYYYF+ + D I   T   T      SIELPGLP S T
Sbjct: 119  VAHELHLPSVLLWIQPATVFDIYYYYFNGYKDLIRDNTSSGTNNVLPCSIELPGLPLSFT 178

Query: 596  SRDLPSFLLASNTYTFALPSFKEQLEILEEETNPIVLVNTVEELEPESLRAVDEYDKKLR 775
            SRDLPSF++ +N Y FALP F+EQ+E+LE ETNP +LVNT + LEPE+L+A+D+Y+    
Sbjct: 179  SRDLPSFMVDTNPYNFALPLFQEQMELLERETNPTILVNTFDALEPEALKAIDKYN---- 234

Query: 776  MIPIGPLIPSAFLDGNDPADTSFGGDTISVSSD--YLEWLDSKTESSVVYVSFGSLAVLP 949
            +I +GPLIPSAFLDG DP+D SFGGD +  S D  YLEWL+SK E SV+YVSFGS++VL 
Sbjct: 235  LIGVGPLIPSAFLDGKDPSDKSFGGDLVQKSRDSSYLEWLNSKPEGSVIYVSFGSISVLG 294

Query: 950  KRQMEEIARALLDCKIPFLWVIXXXXXXXXXXXXXXXXXXXXXXXXKI---GKIVKWCSQ 1120
            K QMEEIA+ LLDC +PFLWVI                        ++   G+IV WCSQ
Sbjct: 295  KAQMEEIAKGLLDCGLPFLWVIRDKVDKKGDDNEAKQEEAMLSCRVELEELGRIVPWCSQ 354

Query: 1121 LEVLSHRSLGCFVTHCGWNSTLESLSCGVPMVAFPQWTDQTTNAKLIEDVWKTGVRXXXX 1300
            +EVLS  SLGCFVTHCGWNS+LESL  GVP+VAFPQWTDQ TNAKLIED WKTGVR    
Sbjct: 355  VEVLSSPSLGCFVTHCGWNSSLESLVSGVPVVAFPQWTDQGTNAKLIEDFWKTGVRVTPN 414

Query: 1301 XXXXXXXXXXRRCLEVVMGSGGEKGEEVRRNARRWKSLAREAVKEGGSSDKNLRAFLDDV 1480
                      +RCL++V+GS GE GEEVRRNA++WK LAREAV EGGSSDKNL+AFLD +
Sbjct: 415  VEGIVTGEELKRCLDLVLGS-GEIGEEVRRNAKKWKDLAREAVNEGGSSDKNLKAFLDQI 473


>XP_018844047.1 PREDICTED: crocetin glucosyltransferase, chloroplastic-like [Juglans
            regia]
          Length = 463

 Score =  531 bits (1369), Expect = 0.0
 Identities = 273/465 (58%), Positives = 330/465 (70%), Gaps = 1/465 (0%)
 Frame = +2

Query: 92   LLITYPIQGHINPALQFAKRLATMGVPVTFATTVHLHRRLVNKPTTKGLSFAVFXXXXXX 271
            LL+T+P QGHINP LQFAKRL  +G  VT AT+V  +RR+   P  +GLSFA F      
Sbjct: 8    LLVTFPAQGHINPGLQFAKRLIRLGAHVTLATSVSAYRRMTKTPIPQGLSFATFSDGYDD 67

Query: 272  XXXXXXXAEVASYISELKRRGSESLRDTILSAAKEGQPFTYLAYTLLLPWVATVARELHL 451
                    +   Y+S +KR GS++L D I+S+  EG+PF YL YTLLLPW   VA ELHL
Sbjct: 68   GFKPGTD-DAEHYMSAIKRSGSKTLTDLIVSSTNEGRPFQYLVYTLLLPWAGNVAHELHL 126

Query: 452  PSALLWIQAATVFDIYYYYFHEHGDYITQKTEDPTYSIELPGLPFSLTSRDLPSFLLASN 631
            PSALLWIQ ATV DIYYYYF+ +GD I +K  DP+YS++LPGLP  L  RDLPSFLL SN
Sbjct: 127  PSALLWIQPATVLDIYYYYFNGYGDDIRKKGTDPSYSLQLPGLPL-LYGRDLPSFLLDSN 185

Query: 632  TYTFALPSFKEQLEILEEETNPIVLVNTVEELEPESLRAVDEYDKKLRMIPIGPLIPSAF 811
            TYTFALPSF+EQ+E LE+E+NP VLVNT + LEPE+LR ++    K  +  +GPLIPSAF
Sbjct: 186  TYTFALPSFQEQIEALEKESNPTVLVNTFDALEPEALRVIE----KFNLTAVGPLIPSAF 241

Query: 812  LDGNDPADTSFGGDTISVSSD-YLEWLDSKTESSVVYVSFGSLAVLPKRQMEEIARALLD 988
            LDG DP+D +FGGD    S + Y+EWL+SK  SSV+YVSFGS++ L K+QMEE+AR LLD
Sbjct: 242  LDGKDPSDKAFGGDLFQGSKEYYIEWLNSKPNSSVIYVSFGSISTLAKQQMEEMARGLLD 301

Query: 989  CKIPFLWVIXXXXXXXXXXXXXXXXXXXXXXXXKIGKIVKWCSQLEVLSHRSLGCFVTHC 1168
            C  PFLWVI                        + G IV WCSQ+EVLSH SL CFVTHC
Sbjct: 302  CGRPFLWVI------RAKENGEEERLSCREELEQKGMIVPWCSQVEVLSHPSLACFVTHC 355

Query: 1169 GWNSTLESLSCGVPMVAFPQWTDQTTNAKLIEDVWKTGVRXXXXXXXXXXXXXXRRCLEV 1348
            GWNS+LESL  GVP+VAFPQWTDQ TNAKLIEDVWKTG+R              +RCLE+
Sbjct: 356  GWNSSLESLVSGVPVVAFPQWTDQGTNAKLIEDVWKTGLRVTANKDGIVESDEIKRCLEL 415

Query: 1349 VMGSGGEKGEEVRRNARRWKSLAREAVKEGGSSDKNLRAFLDDVG 1483
            V G GGE+GEE+RRNA++WK LAREA KEGGSS KNL+AF++++G
Sbjct: 416  VAG-GGERGEEMRRNAKKWKELAREAAKEGGSSHKNLKAFVEEIG 459


>KYP56999.1 Anthocyanin 5-O-glucosyltransferase [Cajanus cajan]
          Length = 420

 Score =  529 bits (1362), Expect = 0.0
 Identities = 276/428 (64%), Positives = 315/428 (73%), Gaps = 2/428 (0%)
 Frame = +2

Query: 209  LVNKPTTKGLSFAVFXXXXXXXXXXXXXAEVASYISELKRRGSESLRDTILSAAKEGQPF 388
            ++ KPT  GLSF+ F             ++V SYISELKRRG+E LR+ I +A  +GQPF
Sbjct: 1    MLKKPTVPGLSFSTFSDGYDDGYRTTDDSQVVSYISELKRRGAECLRNIITAAKNDGQPF 60

Query: 389  TYLAYTLLLPWVATVARELHLPSALLWIQAATVFDIYYYYFHEHGDYITQKTEDPTYSIE 568
            T LAYT+LLPW   VARE HLP ALLWIQ ATVFDIYY+YFH++ DYI+ ++      IE
Sbjct: 61   TCLAYTILLPWAGRVAREFHLPGALLWIQPATVFDIYYHYFHQYKDYISHQSH-----IE 115

Query: 569  LPGLPFSLTSRDLPSFLLASNTYTFALPSFKEQLEILEEETNPIVLVNTVEELEPESLRA 748
            LPGLPFSLTSRD+PSFLL+SN YTFALP+F+EQ E L+ ETNPIVLVNT EELE ESLRA
Sbjct: 116  LPGLPFSLTSRDIPSFLLSSNIYTFALPTFQEQFEDLDGETNPIVLVNTFEELESESLRA 175

Query: 749  VDEYDKKLRMIPIGPLIPSAFLDGNDPADTSFGGDTISVSSDYLEWLDSKTESSVVYVSF 928
            VD    K  MIPIGPLIPSAFLDG DP+DTSFGGD    S+DY+EWLDSK E SVVYVSF
Sbjct: 176  VD----KFTMIPIGPLIPSAFLDGKDPSDTSFGGDIFDGSNDYVEWLDSKPELSVVYVSF 231

Query: 929  GSLAVLPKRQMEEIARALLDCKIPFLWVIXXXXXXXXXXXXXXXXXXXXXXXXKIGKIVK 1108
            G+LAVL KRQ EE+ARALLD   PFLWVI                          GKIVK
Sbjct: 232  GTLAVLNKRQTEELARALLDSGYPFLWVIRENNGKEEKEKLQEVSCREELEVK--GKIVK 289

Query: 1109 WCSQLEVLSHRSLGCFVTHCGWNSTLESLSCGVPMVAFPQWTDQTTNAKLIEDVWKTGVR 1288
            WCSQ+EVLSH SLGCFV+HCGWNST ESL+ GVPMVAFPQW+DQ TNAKL++DVWKTGVR
Sbjct: 290  WCSQVEVLSHGSLGCFVSHCGWNSTTESLASGVPMVAFPQWSDQMTNAKLVQDVWKTGVR 349

Query: 1289 --XXXXXXXXXXXXXXRRCLEVVMGSGGEKGEEVRRNARRWKSLAREAVKEGGSSDKNLR 1462
                            R+CLEVVMGS GEKG+E+RRNA +WK LAREAVKEGGSSD+N+R
Sbjct: 350  VDDKVNEEGVVEAEEIRKCLEVVMGS-GEKGQEMRRNAEKWKCLAREAVKEGGSSDRNMR 408

Query: 1463 AFLDDVGR 1486
             FLDDV +
Sbjct: 409  TFLDDVAK 416


>XP_007221288.1 hypothetical protein PRUPE_ppa016890mg [Prunus persica] ONI28923.1
            hypothetical protein PRUPE_1G169300 [Prunus persica]
          Length = 474

 Score =  529 bits (1362), Expect = 0.0
 Identities = 276/480 (57%), Positives = 336/480 (70%), Gaps = 10/480 (2%)
 Frame = +2

Query: 74   MVQHRFLLITYPIQGHINPALQFAKRLA-TMGVPVTFATTVHLHRRLVNKPTTKGLSFAV 250
            MVQHRFL +TYP QGHINPALQ AKRL    G  VT+ T+++ +RR+VN  T  GL++A 
Sbjct: 1    MVQHRFLFLTYPAQGHINPALQLAKRLIRNTGAQVTYVTSLYAYRRIVNGSTPNGLTYAP 60

Query: 251  FXXXXXXXXXXXXXAEVASYISELKRRGSESLRDTILSAAKEGQPFTYLAYTLLLPWVAT 430
            +              +V  Y+SEL+R GS+ + D + S+AKEG P+T L YT+LLPW A 
Sbjct: 61   YSDGYDDGFKFSD--DVDHYMSELRRAGSQVITDLVASSAKEGHPYTCLVYTILLPWAAD 118

Query: 431  VARELHLPSALLWIQAATVFDIYYYYFHEHGDYITQK----TEDPTYSIELPGLPFSLTS 598
            +ARELHLPS L WIQAAT+FD+YYYY   + D I +     T DP+ SI+LPGLP  L S
Sbjct: 119  LARELHLPSVLAWIQAATLFDVYYYYLSGYKDLIRESFGTDTNDPSCSIQLPGLPLDLAS 178

Query: 599  RDLPSFLLASNTYTFALPSFKEQLEILEEETNPIVLVNTVEELEPESLRAVDEYDKKLRM 778
            RDLPSF++A N+Y FALP F++Q E+LE ET PI+LVNT + LEPE+L+A+D+Y+    +
Sbjct: 179  RDLPSFMVAENSYNFALPLFEKQFELLERETKPIILVNTFDALEPEALKAIDKYN----L 234

Query: 779  IPIGPLIPSAFLDGNDPADTSFGGDTI--SVSSDYLEWLDSKTESSVVYVSFGSLAVLPK 952
            I IGPLIPSAFLDG DP+DTSFGGD    S+ S  +EWL+SK E SV+YVSFGS++ L K
Sbjct: 235  IGIGPLIPSAFLDGKDPSDTSFGGDLFQKSMDSSCIEWLNSKPEGSVIYVSFGSVSALSK 294

Query: 953  RQMEEIARALLDCKIPFLWVIXXXXXXXXXXXXXXXXXXXXXXXXKI---GKIVKWCSQL 1123
             QMEEIA+ LLD   PFLWVI                        ++   GKIV WCSQL
Sbjct: 295  DQMEEIAKGLLDYGRPFLWVIREKEERNGQDNETEKEEEKFSCREELKELGKIVLWCSQL 354

Query: 1124 EVLSHRSLGCFVTHCGWNSTLESLSCGVPMVAFPQWTDQTTNAKLIEDVWKTGVRXXXXX 1303
            EVLS+ SLGCFVTHCGWNS++ESL  GVP+VAFP WTDQ TNAKLIED WKTGVR     
Sbjct: 355  EVLSNPSLGCFVTHCGWNSSMESLVSGVPVVAFPLWTDQRTNAKLIEDTWKTGVRVAPNE 414

Query: 1304 XXXXXXXXXRRCLEVVMGSGGEKGEEVRRNARRWKSLAREAVKEGGSSDKNLRAFLDDVG 1483
                     +RCLE+VMGS GE GEE+RRNA++WK LAREAV EGGSSDKNL AFLD +G
Sbjct: 415  EGIVVGEELKRCLELVMGS-GEIGEELRRNAKKWKGLAREAVSEGGSSDKNLMAFLDQIG 473


>NP_001315912.1 crocetin glucosyltransferase, chloroplastic-like [Malus domestica]
            AAX16493.1 UDP-glucose:flavonoid 7-O-glucosyltransferase
            [Malus domestica]
          Length = 481

 Score =  529 bits (1362), Expect = 0.0
 Identities = 277/480 (57%), Positives = 335/480 (69%), Gaps = 11/480 (2%)
 Frame = +2

Query: 74   MVQHRFLLITYPIQGHINPALQFAKRLA-TMGVPVTFATTVHLHRRLVNKPTTKGLSFAV 250
            MVQHRFLL+T+P QGHINP+LQFAKRL  T G  VT+ T++  HRR+ N     GL++A 
Sbjct: 1    MVQHRFLLVTFPAQGHINPSLQFAKRLINTTGAHVTYVTSLSAHRRIGNGSIPDGLTYAP 60

Query: 251  FXXXXXXXXXXXXXAEVASYISELKRRGSESLRDTILSAAKEGQPFTYLAYTLLLPWVAT 430
            F               V  Y+SEL+RRG +++ D ++++A EG P+T L Y+LLLPW A 
Sbjct: 61   FSDGYDDGFKPGD--NVDDYMSELRRRGVQAITDLVVASANEGHPYTCLVYSLLLPWSAG 118

Query: 431  VARELHLPSALLWIQAATVFDIYYYYFHEHGDYITQKTEDPTY-----SIELPGLPFSLT 595
            +A ELHLPS LLWIQ ATVFDIYYYYF+ + D I   T   T      SIELPGLP S T
Sbjct: 119  MAHELHLPSVLLWIQPATVFDIYYYYFNGYKDLIRDNTSSGTNNVLPCSIELPGLPLSFT 178

Query: 596  SRDLPSFLLASNTYTFALPSFKEQLEILEEETNPIVLVNTVEELEPESLRAVDEYDKKLR 775
            SRDLPSF++ +N Y FALP F+EQ+E+LE ETNP +LVNT + LEPE+L+A+D+Y+    
Sbjct: 179  SRDLPSFMVDTNPYNFALPLFQEQMELLERETNPTILVNTFDALEPEALKAIDKYN---- 234

Query: 776  MIPIGPLIPSAFLDGNDPADTSFGGDTISVSSD--YLEWLDSKTESSVVYVSFGSLAVLP 949
            +I +GPLIPSAFLDG DP+D SFGGD    S D  YLEWL+SK E SV+YVSFGS++VL 
Sbjct: 235  LIGVGPLIPSAFLDGKDPSDKSFGGDLFQKSKDSSYLEWLNSKPEGSVIYVSFGSISVLG 294

Query: 950  KRQMEEIARALLDCKIPFLWVIXXXXXXXXXXXXXXXXXXXXXXXXKI---GKIVKWCSQ 1120
            K QMEEIA+ LLDC +PFLWVI                        ++   G IV WCSQ
Sbjct: 295  KAQMEEIAKGLLDCGLPFLWVIRDKVGKKGDDNEAKKEEEMLRCREELEELGMIVPWCSQ 354

Query: 1121 LEVLSHRSLGCFVTHCGWNSTLESLSCGVPMVAFPQWTDQTTNAKLIEDVWKTGVRXXXX 1300
            +EVLS  SLGCFVTHCGWNS+LESL  GVP+VAFPQWTDQ TNAKLIED WKTGVR    
Sbjct: 355  VEVLSSPSLGCFVTHCGWNSSLESLVSGVPVVAFPQWTDQGTNAKLIEDYWKTGVRVTPN 414

Query: 1301 XXXXXXXXXXRRCLEVVMGSGGEKGEEVRRNARRWKSLAREAVKEGGSSDKNLRAFLDDV 1480
                      +RCL++V+GS GE GE+VRRNA++WK LAREAV EG SSDKNLRAFLD +
Sbjct: 415  EEGIVTGEELKRCLDLVLGS-GEIGEDVRRNAKKWKDLAREAVSEGDSSDKNLRAFLDQI 473


>XP_018848545.1 PREDICTED: crocetin glucosyltransferase, chloroplastic-like [Juglans
            regia]
          Length = 466

 Score =  526 bits (1356), Expect = 0.0
 Identities = 271/470 (57%), Positives = 331/470 (70%)
 Frame = +2

Query: 74   MVQHRFLLITYPIQGHINPALQFAKRLATMGVPVTFATTVHLHRRLVNKPTTKGLSFAVF 253
            MV H FLL+ +P QGHINPALQFAKRL  +G  VTFATTV  HRR+   PT  GLSFA F
Sbjct: 1    MVNHHFLLVIFPAQGHINPALQFAKRLIRLGAHVTFATTVAAHRRMNKSPTPDGLSFATF 60

Query: 254  XXXXXXXXXXXXXAEVASYISELKRRGSESLRDTILSAAKEGQPFTYLAYTLLLPWVATV 433
                          +   Y+SELKRRGS++L D ++S+A +G+ FT L Y++LLPW   V
Sbjct: 61   SDGYDDGGFKHGDHDFVDYMSELKRRGSQTLTDLVVSSANKGRTFTCLVYSILLPWACDV 120

Query: 434  ARELHLPSALLWIQAATVFDIYYYYFHEHGDYITQKTEDPTYSIELPGLPFSLTSRDLPS 613
            ARELHL SALLWIQ ATV DIYYYYF+ +GD I +   DP++SIELPGLP SLTSRDLPS
Sbjct: 121  ARELHLLSALLWIQPATVLDIYYYYFNGYGDVI-RNIPDPSFSIELPGLP-SLTSRDLPS 178

Query: 614  FLLASNTYTFALPSFKEQLEILEEETNPIVLVNTVEELEPESLRAVDEYDKKLRMIPIGP 793
            F+   NT+TFALP F+E  E L +E+NP VLVNT + LEPE+LRA++ +        IGP
Sbjct: 179  FMADLNTHTFALPLFQEHFEELGKESNPRVLVNTFDALEPEALRAIERFS----FTGIGP 234

Query: 794  LIPSAFLDGNDPADTSFGGDTISVSSDYLEWLDSKTESSVVYVSFGSLAVLPKRQMEEIA 973
            LIPSAFLDG DP+DT+FGGD    ++DY+EWL+SK  SSV+YVSFGSL++L K QMEEIA
Sbjct: 235  LIPSAFLDGKDPSDTAFGGDLFQGATDYIEWLNSKPSSSVIYVSFGSLSLLAKNQMEEIA 294

Query: 974  RALLDCKIPFLWVIXXXXXXXXXXXXXXXXXXXXXXXXKIGKIVKWCSQLEVLSHRSLGC 1153
            R LLD   PFLWV                         + GK V+WCSQ+EVLSH S+ C
Sbjct: 295  RGLLDYGCPFLWV---KRANENGEEKEEDRLSCREELEQKGKFVQWCSQVEVLSHPSVAC 351

Query: 1154 FVTHCGWNSTLESLSCGVPMVAFPQWTDQTTNAKLIEDVWKTGVRXXXXXXXXXXXXXXR 1333
            FVTHCGWNSTLESL  GVP+VAFPQWTDQ TNAKL++DVWK G+R              +
Sbjct: 352  FVTHCGWNSTLESLVSGVPLVAFPQWTDQGTNAKLVQDVWKIGLRVTTNKDGIVEGDEIK 411

Query: 1334 RCLEVVMGSGGEKGEEVRRNARRWKSLAREAVKEGGSSDKNLRAFLDDVG 1483
            RCLE+V+G+ GE+GEE+R+NA++WK LAREA  EGG+S  NL+AF+D+ G
Sbjct: 412  RCLELVLGN-GERGEEMRKNAKKWKDLAREAAMEGGTSYNNLKAFVDEFG 460


>XP_003524180.1 PREDICTED: crocetin glucosyltransferase, chloroplastic-like [Glycine
            max] KRH58827.1 hypothetical protein GLYMA_05G150600
            [Glycine max]
          Length = 460

 Score =  526 bits (1354), Expect = e-180
 Identities = 281/474 (59%), Positives = 324/474 (68%), Gaps = 5/474 (1%)
 Frame = +2

Query: 74   MVQHRFLLITYPIQGHINPALQFAKRLATMGVPVTFATTVHLHRRLVNKPTTKGLSFAVF 253
            M +HRFL++ YP QGHINPA QFAKRL ++G  VT +TTVH+HRR+ NKPT   LSF  F
Sbjct: 1    MFRHRFLIVMYPAQGHINPAFQFAKRLVSLGAHVTVSTTVHMHRRITNKPTLPHLSFLPF 60

Query: 254  XXXXXXXXXXXXXAEVASYISELKRRGSESLRDTILSAAKEGQPFTYLAYTLLLPWVATV 433
                          + A   SE KRRGSE + + I S A+EG PFT L +T+LLPW A  
Sbjct: 61   SDGYDDGYTS---TDYALQASEFKRRGSEFVTNLIASKAQEGHPFTCLVHTVLLPWAARA 117

Query: 434  ARELHLPSALLWIQAATVFDIYYYYFHEHGDYITQKTEDPTYSIELPGLPFSLTSRDLPS 613
            AR  HLP+ALLW Q AT+ DI+Y YFHEHGDYI  K +DP+ SIELPGLP  L  RDLPS
Sbjct: 118  ARGFHLPTALLWTQPATILDIFYCYFHEHGDYIKGKIKDPSSSIELPGLPLLLAPRDLPS 177

Query: 614  FLLASNTY--TFALPSFKEQLEILEEETNPIVLVNTVEELEPESLRAVDEYDKKLRMIPI 787
            FLL SN    + A+  F+EQL  L+ +  P +LVNT E LE E+LRAVD ++    MIPI
Sbjct: 178  FLLGSNPTIDSLAVSMFEEQLHDLDMQAKPRILVNTFEALEHEALRAVDNFN----MIPI 233

Query: 788  GPLIPSAFLDGNDPADTSFGGDTISVSSDYLEWLDSKTESSVVYVSFGSLAVLPKRQMEE 967
            GPLIPSAFLDG DP DTSFGGD    S+D  EWLDSK E SVVYVSFGS  VL K+QMEE
Sbjct: 234  GPLIPSAFLDGKDPTDTSFGGDIFRPSNDCGEWLDSKPEMSVVYVSFGSFCVLSKKQMEE 293

Query: 968  IARALLDCKIPFLWVIXXXXXXXXXXXXXXXXXXXXXXXXKIGKIVKWCSQLEVLSHRSL 1147
            +A ALLDC  PFLWV                         + GKIV WCSQ+EVLSHRS+
Sbjct: 294  LALALLDCGSPFLWV---------SREKEEEELSCREELEQKGKIVNWCSQVEVLSHRSV 344

Query: 1148 GCFVTHCGWNSTLESLSCGVPMVAFPQWTDQTTNAKLIEDVWKTGVRXXXXXXXXXXXXX 1327
            GCFVTHCGWNST+ESL+ GVPM AFPQW +Q TNAKLIEDVWKTGVR             
Sbjct: 345  GCFVTHCGWNSTMESLASGVPMFAFPQWIEQKTNAKLIEDVWKTGVRVDKQVNEEGIVEK 404

Query: 1328 XR--RCLEVVMGSGGEKGEEVRRNARRWKSLAREAVKEG-GSSDKNLRAFLDDV 1480
                +CLEV MGS G+KG+E+R NA+ WK LAREAVKEG GSSDKNLRAFLDD+
Sbjct: 405  EEIIKCLEVAMGS-GKKGQELRNNAKNWKGLAREAVKEGSGSSDKNLRAFLDDL 457


>XP_018809219.1 PREDICTED: crocetin glucosyltransferase, chloroplastic-like [Juglans
            regia]
          Length = 502

 Score =  523 bits (1348), Expect = e-179
 Identities = 274/490 (55%), Positives = 337/490 (68%), Gaps = 3/490 (0%)
 Frame = +2

Query: 23   SQRPQEHYHAPSRASAAMVQH--RFLLITYPIQGHINPALQFAKRLATMGVPVTFATTVH 196
            SQ    H  AP   +   ++     LL+T+P QGHINPALQFAK L  +G  VT AT+V 
Sbjct: 18   SQTLTPHPQAPPCTTTTTMEKYPHVLLVTFPAQGHINPALQFAKGLVRLGALVTLATSVS 77

Query: 197  LHRRLVNKPTTKGLSFAVFXXXXXXXXXXXXXAEVASYISELKRRGSESLRDTILSAAKE 376
             +RR+   P  +GLSFA +              ++  YI  +K  GS++L D I+S+A E
Sbjct: 78   SYRRMTKTPAPQGLSFATYSDGYDDGFKPTTD-DLEHYIFAIKHSGSKTLTDLIVSSANE 136

Query: 377  GQPFTYLAYTLLLPWVATVARELHLPSALLWIQAATVFDIYYYYFHEHGDYITQKTEDPT 556
            G+PF YL Y +LLPW   VARELHLPSA+LWIQ ATV DIYYYYF+ +GD I +   DP+
Sbjct: 137  GRPFQYLVYNMLLPWAGNVARELHLPSAVLWIQPATVLDIYYYYFNGYGDDIRKNGTDPS 196

Query: 557  YSIELPGLPFSLTSRDLPSFLLASNTYTFALPSFKEQLEILEEETNPIVLVNTVEELEPE 736
            YS++LPGLP  L  RDLPSFLL SNTYTFALPSF+EQ E LE+E+NP VLVNT + LEPE
Sbjct: 197  YSLQLPGLPL-LYGRDLPSFLLGSNTYTFALPSFQEQFEALEKESNPRVLVNTFDGLEPE 255

Query: 737  SLRAVDEYDKKLRMIPIGPLIPSAFLDGNDPADTSFGGDTISVSSDY-LEWLDSKTESSV 913
            +LR ++    KL +  +GPLIPSAFLDG DP+D +FGGD    S +Y +EWL+SK  SSV
Sbjct: 256  ALRVIE----KLNLSAVGPLIPSAFLDGKDPSDKAFGGDLFQGSKEYYIEWLNSKPNSSV 311

Query: 914  VYVSFGSLAVLPKRQMEEIARALLDCKIPFLWVIXXXXXXXXXXXXXXXXXXXXXXXXKI 1093
            +YVSFGS+AVL K+QMEE+AR LLDC  PFLWVI                          
Sbjct: 312  IYVSFGSMAVLAKQQMEEMARGLLDCGRPFLWVIRAKEKGEEETEEERLSCRKELEQK-- 369

Query: 1094 GKIVKWCSQLEVLSHRSLGCFVTHCGWNSTLESLSCGVPMVAFPQWTDQTTNAKLIEDVW 1273
            G IV WCSQ+EVLSH SL CFVTHCGWNS+LESL  GVP+VAFPQW+DQ TNAKLIEDVW
Sbjct: 370  GMIVPWCSQVEVLSHPSLACFVTHCGWNSSLESLVTGVPVVAFPQWSDQGTNAKLIEDVW 429

Query: 1274 KTGVRXXXXXXXXXXXXXXRRCLEVVMGSGGEKGEEVRRNARRWKSLAREAVKEGGSSDK 1453
            KTG+R              +RCLE+V G GG++GEE+RRNA++WK LAREA +EGGSS K
Sbjct: 430  KTGLRVTANKDGIVEGDEIKRCLELVAG-GGDRGEELRRNAKKWKELAREAAREGGSSYK 488

Query: 1454 NLRAFLDDVG 1483
            NL+AF++++G
Sbjct: 489  NLKAFVEEIG 498


>XP_008348418.1 PREDICTED: crocetin glucosyltransferase, chloroplastic-like [Malus
            domestica]
          Length = 474

 Score =  522 bits (1344), Expect = e-179
 Identities = 271/480 (56%), Positives = 331/480 (68%), Gaps = 11/480 (2%)
 Frame = +2

Query: 74   MVQHRFLLITYPIQGHINPALQFAKRLA-TMGVPVTFATTVHLHRRLVNKPTTKGLSFAV 250
            MVQHRFLL+TYP QGHINP+LQFAKRL  T G  VTF T++  H R+ N     GL++A 
Sbjct: 1    MVQHRFLLVTYPAQGHINPSLQFAKRLINTTGAHVTFITSLSAHHRIGNGSIPDGLTYAP 60

Query: 251  FXXXXXXXXXXXXXAEVASYISELKRRGSESLRDTILSAAKEGQPFTYLAYTLLLPWVAT 430
            F               +  Y+SEL+  G++++ D ++S+  EG P+T + YT+LLPW A 
Sbjct: 61   FSDGYDDGFKPGD--NIDHYLSELRHHGAQAITDLVVSSENEGHPYTCMVYTILLPWAAD 118

Query: 431  VARELHLPSALLWIQAATVFDIYYYYFHEHGDYITQKTEDPTY-----SIELPGLPFSLT 595
            VA ELHLP+ LLWIQ ATVFDIYYYYF+   D I   T   T      SIELPGLP SLT
Sbjct: 119  VAHELHLPNVLLWIQPATVFDIYYYYFNGFKDLIRDNTSSGTNDALPCSIELPGLPLSLT 178

Query: 596  SRDLPSFLLASNTYTFALPSFKEQLEILEEETNPIVLVNTVEELEPESLRAVDEYDKKLR 775
            SRDLPSF++ +N Y FALP F+EQ+++LE ETNPI+LVNT + LEPE+L+A ++Y+    
Sbjct: 179  SRDLPSFMVDTNPYNFALPLFQEQMDLLERETNPIILVNTFDALEPEALKATEKYN---- 234

Query: 776  MIPIGPLIPSAFLDGNDPADTSFGGDTISVSSD--YLEWLDSKTESSVVYVSFGSLAVLP 949
            +I +GPLIP+ FLDG DP+D SFGGD +  S D  YLEWL+ K E SV+YVSFGS+ VL 
Sbjct: 235  LIGVGPLIPTTFLDGKDPSDKSFGGDLLKKSKDSPYLEWLNLKPEGSVIYVSFGSICVLE 294

Query: 950  KRQMEEIARALLDCKIPFLWVIXXXXXXXXXXXXXXXXXXXXXXXXKI---GKIVKWCSQ 1120
            K QMEEIA+ LLDC  PFLWVI                        ++   G+IV WCSQ
Sbjct: 295  KAQMEEIAKGLLDCGRPFLWVIRDKVNKKGEDNEAKEEEEMLSCREELEELGRIVPWCSQ 354

Query: 1121 LEVLSHRSLGCFVTHCGWNSTLESLSCGVPMVAFPQWTDQTTNAKLIEDVWKTGVRXXXX 1300
            +EVLS  SLGCFVTHCGWNS+LES + GVP+VAFPQWTDQ TNAKLIED WKTG+R    
Sbjct: 355  VEVLSSPSLGCFVTHCGWNSSLESFASGVPVVAFPQWTDQGTNAKLIEDAWKTGLRVTPN 414

Query: 1301 XXXXXXXXXXRRCLEVVMGSGGEKGEEVRRNARRWKSLAREAVKEGGSSDKNLRAFLDDV 1480
                      +RCLE+VMGS GE GEE+RRNA++WK LAREAV EGGSSDKNL+AFLD +
Sbjct: 415  EKGIVTGDELKRCLELVMGS-GEIGEEMRRNAKKWKDLAREAVSEGGSSDKNLKAFLDRI 473


>AMO27404.1 anthocyanidin 3-o-glucoside 5-o-glucosyltransferase 1-like protein
            [Glycine max]
          Length = 478

 Score =  520 bits (1340), Expect = e-178
 Identities = 278/475 (58%), Positives = 331/475 (69%), Gaps = 6/475 (1%)
 Frame = +2

Query: 74   MVQHRFLLITYPIQGHINPALQFAKRLATMGVPVTFATTVHLHRRLVNKPTTKGLSFAVF 253
            MV  RFLL+TYP Q HINPALQ AKRL  MG  VT   T+H++RR+ NKPT  GLSF  F
Sbjct: 1    MVLQRFLLVTYPAQSHINPALQLAKRLIAMGAHVTILLTLHVYRRISNKPTIPGLSFLPF 60

Query: 254  XXXXXXXXXXXXX--AEVASYISELKRRGSESLRDTILSAAKEGQPFTYLAYTLLLPWVA 427
                           ++   Y S+LK R S+ L + ILS+A EG+PFT L YTLLLPWVA
Sbjct: 61   SDGYDAGFDALHATDSDFFLYESQLKHRTSDLLSNLILSSASEGRPFTCLLYTLLLPWVA 120

Query: 428  TVARELHLPSALLWIQAATVFDIYYYYFHEHGDYITQKTEDPTYSIELPGLPFSLTSRDL 607
             VAR+ +LP+ALLWI+ ATV DI Y++FH + D+I  +T++   +I LPGL FSL+ RD+
Sbjct: 121  DVARQFYLPTALLWIEPATVLDILYHFFHGYADFINDETKE---NIVLPGLSFSLSPRDV 177

Query: 608  PSFLLA--SNTYTFALPSFKEQLEILEEETNPIVLVNTVEELEPESLRAVDEYDKKLRMI 781
            PSFLL    + ++F LPSF+ Q++ L+ ETNP VLVNT E LE E+LRA+D    K+ MI
Sbjct: 178  PSFLLLWKPSVFSFTLPSFENQIKQLDLETNPTVLVNTFEALEEEALRAID----KINMI 233

Query: 782  PIGPLIPSAFLDGNDPADTSFGGDTISVSSDYLEWLDSKTESSVVYVSFGSLAVLPKRQM 961
            PIGPLIPSAFLDGNDP DTSFGGD   VS+DY+EWLDSK E+SVVYVSFGS   L KRQM
Sbjct: 234  PIGPLIPSAFLDGNDPTDTSFGGDIFQVSNDYVEWLDSKEENSVVYVSFGSYFELSKRQM 293

Query: 962  EEIARALLDCKIPFLWVIXXXXXXXXXXXXXXXXXXXXXXXXKIGKIVKWCSQLEVLSHR 1141
            EEIAR LLDC  PFLWV+                        K GKIV WCSQ+EVLSH 
Sbjct: 294  EEIARGLLDCGRPFLWVV-REKVINGKKEEEEELCCFREELEKWGKIVTWCSQVEVLSHS 352

Query: 1142 SLGCFVTHCGWNSTLESLSCGVPMVAFPQWTDQTTNAKLIEDVWKTGVR--XXXXXXXXX 1315
            S+GCF+THCGWNST+ESL  GVPMVAFPQWTDQ TNAKLIEDVWK GVR           
Sbjct: 353  SVGCFLTHCGWNSTMESLVSGVPMVAFPQWTDQMTNAKLIEDVWKIGVRVDHHVNANGIV 412

Query: 1316 XXXXXRRCLEVVMGSGGEKGEEVRRNARRWKSLAREAVKEGGSSDKNLRAFLDDV 1480
                   CL+VVMGS G++  E R+NA++WK LAR+A KEGGSS+KNLRAF+DDV
Sbjct: 413  EGKEIEACLDVVMGS-GDRASEFRKNAKKWKVLARDAAKEGGSSEKNLRAFVDDV 466


>XP_014634355.1 PREDICTED: crocetin glucosyltransferase, chloroplastic-like [Glycine
            max] KRH42735.1 hypothetical protein GLYMA_08G107600
            [Glycine max]
          Length = 478

 Score =  520 bits (1339), Expect = e-178
 Identities = 278/475 (58%), Positives = 330/475 (69%), Gaps = 6/475 (1%)
 Frame = +2

Query: 74   MVQHRFLLITYPIQGHINPALQFAKRLATMGVPVTFATTVHLHRRLVNKPTTKGLSFAVF 253
            MV  RFLL+TYP Q HINPALQ AKRL  MG  VT   T+H++RR+ NKPT  GLSF  F
Sbjct: 1    MVLQRFLLVTYPAQSHINPALQLAKRLIAMGAHVTILLTLHVYRRISNKPTIPGLSFLPF 60

Query: 254  XXXXXXXXXXXXX--AEVASYISELKRRGSESLRDTILSAAKEGQPFTYLAYTLLLPWVA 427
                           ++   Y S+LK R S+ L + ILS+A EG+PFT L YTLLLPWVA
Sbjct: 61   SDGYDAGFDALHATDSDFFLYESQLKHRTSDLLSNLILSSASEGRPFTCLLYTLLLPWVA 120

Query: 428  TVARELHLPSALLWIQAATVFDIYYYYFHEHGDYITQKTEDPTYSIELPGLPFSLTSRDL 607
             VAR+ +LP+ALLWI+ ATV DI Y++FH + D+I  +T++   +I LPGL FSL+ RD+
Sbjct: 121  DVARQFYLPTALLWIEPATVLDILYHFFHGYADFINDETKE---NIVLPGLSFSLSPRDV 177

Query: 608  PSFLLA--SNTYTFALPSFKEQLEILEEETNPIVLVNTVEELEPESLRAVDEYDKKLRMI 781
            PSFLL    + ++F LPSF+ Q++ L+ ETNP VLVNT E LE E+LRA+D    K+ MI
Sbjct: 178  PSFLLLWKPSVFSFTLPSFENQIKQLDLETNPTVLVNTFEALEEEALRAID----KINMI 233

Query: 782  PIGPLIPSAFLDGNDPADTSFGGDTISVSSDYLEWLDSKTESSVVYVSFGSLAVLPKRQM 961
            PIGPLIPSAFLDGNDP DTSFGGD   VS+DY+EWLDSK E SVVYVSFGS   L KRQM
Sbjct: 234  PIGPLIPSAFLDGNDPTDTSFGGDIFQVSNDYVEWLDSKEEDSVVYVSFGSYFELSKRQM 293

Query: 962  EEIARALLDCKIPFLWVIXXXXXXXXXXXXXXXXXXXXXXXXKIGKIVKWCSQLEVLSHR 1141
            EEIAR LLDC  PFLWV+                        K GKIV WCSQ+EVLSH 
Sbjct: 294  EEIARGLLDCGRPFLWVV-REKVINGKKEEEEELCCFREELEKWGKIVTWCSQVEVLSHS 352

Query: 1142 SLGCFVTHCGWNSTLESLSCGVPMVAFPQWTDQTTNAKLIEDVWKTGVR--XXXXXXXXX 1315
            S+GCF+THCGWNST+ESL  GVPMVAFPQWTDQ TNAKLIEDVWK GVR           
Sbjct: 353  SVGCFLTHCGWNSTMESLVSGVPMVAFPQWTDQMTNAKLIEDVWKIGVRVDHHVNANGIV 412

Query: 1316 XXXXXRRCLEVVMGSGGEKGEEVRRNARRWKSLAREAVKEGGSSDKNLRAFLDDV 1480
                   CL+VVMGS G++  E R+NA++WK LAR+A KEGGSS+KNLRAF+DDV
Sbjct: 413  EGKEIEACLDVVMGS-GDRASEFRKNAKKWKVLARDAAKEGGSSEKNLRAFVDDV 466


Top