BLASTX nr result

ID: Achyranthes23_contig00004318 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Achyranthes23_contig00004318
         (1792 letters)

Database: ./nr 
           37,332,560 sequences; 13,225,080,153 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

dbj|BAF75887.1| tetrahydroxychalcone glucosyltransferase [Dianth...   657   0.0  
ref|XP_006369755.1| hypothetical protein POPTR_0001s31040g [Popu...   570   e-160
ref|XP_002298734.2| hypothetical protein POPTR_0001s31120g [Popu...   564   e-158
gb|EOX98312.1| UDP-glycosyltransferase 73B4, putative [Theobroma...   561   e-157
ref|XP_002334266.1| predicted protein [Populus trichocarpa] gi|5...   558   e-156
dbj|BAG71127.1| glucosyltransferase [Phytolacca americana] gi|21...   556   e-156
ref|XP_002518725.1| UDP-glucosyltransferase, putative [Ricinus c...   552   e-154
gb|EMJ01477.1| hypothetical protein PRUPE_ppa014816mg [Prunus pe...   545   e-152
gb|EMJ02187.1| hypothetical protein PRUPE_ppa004924mg [Prunus pe...   545   e-152
ref|XP_004304714.1| PREDICTED: UDP-glucose flavonoid 3-O-glucosy...   543   e-151
gb|EMJ02188.1| hypothetical protein PRUPE_ppa004972mg [Prunus pe...   540   e-151
gb|EOX98313.1| UDP-glucosyl transferase 73B3, putative [Theobrom...   539   e-150
ref|XP_002298733.1| hypothetical protein POPTR_0001s31130g [Popu...   538   e-150
ref|XP_006422969.1| hypothetical protein CICLE_v10028305mg [Citr...   537   e-150
ref|XP_002518724.1| UDP-glucosyltransferase, putative [Ricinus c...   537   e-150
dbj|BAF75890.1| tetrahydroxychalcone glucosyltransferase [Dianth...   536   e-150
ref|XP_006369760.1| hypothetical protein POPTR_0001s31090g [Popu...   536   e-149
gb|AAB36653.1| immediate-early salicylate-induced glucosyltransf...   535   e-149
sp|Q9AT54.1|SCGT_TOBAC RecName: Full=Scopoletin glucosyltransfer...   534   e-149
gb|EOY09906.1| Anthocyanin 3'-O-beta-glucosyltransferase, putati...   533   e-148

>dbj|BAF75887.1| tetrahydroxychalcone glucosyltransferase [Dianthus caryophyllus]
          Length = 489

 Score =  657 bits (1695), Expect = 0.0
 Identities = 323/473 (68%), Positives = 387/473 (81%), Gaps = 3/473 (0%)
 Frame = -1

Query: 1777 MGAEPQRLHVAFFPFMAAGHMIPTLDIAKLFASHNVKITIITTPKNAPFFTKPLQTYNNN 1598
            MG EPQRLHV FFP MAAGHMIPTLDIAKLFA+H+VK TI+TTP NAP F KPLQ+Y N 
Sbjct: 1    MGTEPQRLHVVFFPLMAAGHMIPTLDIAKLFAAHHVKTTIVTTPLNAPTFLKPLQSYTN- 59

Query: 1597 NIIQTLIDVEIIPFPSQEAGLPDGVENFDQFNTLDMSFKFCKATYILEESLENVIKKCKP 1418
              I   IDV++IPFP++EAGLP+GVENF+ F + +MS KF KA  +LEE L  V+++C P
Sbjct: 60   --IGPPIDVQVIPFPAKEAGLPEGVENFEHFTSDEMSLKFLKAAELLEEPLIQVLERCNP 117

Query: 1417 --NCLVADLLLPFATDVAAKFNIPRLVFHGTNCFSQCVSHAMIKYEPHKSILSDDEEFII 1244
              +CLVAD+LLPFAT+VAAKF+IPRLVFHG+ CF+  V  A IKY+PHK + +DDEEF+I
Sbjct: 118  KADCLVADMLLPFATEVAAKFDIPRLVFHGSCCFALSVMDAFIKYQPHKDVSNDDEEFVI 177

Query: 1243 PNLPHQIKITKKQLPEMMRGHQVDEEIIKDSMEVLTRAYEAEEKSYGIIVNTFYELEPDY 1064
            P+LPH+IKIT+ QL E ++ ++ D       M+VL RA E+E KSYG+IVN+FYELEP+Y
Sbjct: 178  PHLPHEIKITRMQLNEGVKQNKQDTMW----MDVLGRALESEIKSYGVIVNSFYELEPEY 233

Query: 1063 VDYYKEVMGRKSWQIGPVSLCNRENEAKFQRGKDSSIDEHECLKWLDSKKPKSVVYVCFG 884
             D+Y++VMGRK+WQIGPVSLCNRENEAKFQRGKDSSIDE+ CLKWLDSKKP SV+YVCFG
Sbjct: 234  ADFYRKVMGRKTWQIGPVSLCNRENEAKFQRGKDSSIDENACLKWLDSKKPNSVIYVCFG 293

Query: 883  SLAEVSDSQLHEIALGLEASEQDFIWVVRRSSDEEKDNEDWLPFGFEKRIEGKGLIIRGW 704
            SL EVS  QLHEIA GLEASEQ+F+WV+RRS+   ++ ED  P GFE+R +GKGLIIRGW
Sbjct: 294  SLTEVSLLQLHEIAKGLEASEQNFVWVIRRSNTNGEETEDIFPKGFEERTKGKGLIIRGW 353

Query: 703  APQVLILDHEAIGAFVTHCGWNSTLEGVSCGVPMVTWPVFAEQFYNAKLVTEILKSGISI 524
            APQVLILDHEA+G FVTHCGWNSTLEG+SCGVPMVTWP FAEQFY  KLVTEILK+GI +
Sbjct: 354  APQVLILDHEAVGGFVTHCGWNSTLEGISCGVPMVTWPAFAEQFYIEKLVTEILKTGIPV 413

Query: 523  GANEWNRVVDGKNK-EDIKMAISRVMVGEECLEIRSRAKKFKYLARKAVDVGG 368
            G+  WNR ++   K EDIK  + R+MV EE +EIRSRA K K +ARKA+D GG
Sbjct: 414  GSKHWNRTIECNVKWEDIKEVVRRLMVEEEGMEIRSRALKLKNMARKAIDEGG 466


>ref|XP_006369755.1| hypothetical protein POPTR_0001s31040g [Populus trichocarpa]
            gi|550348597|gb|ERP66324.1| hypothetical protein
            POPTR_0001s31040g [Populus trichocarpa]
          Length = 485

 Score =  570 bits (1470), Expect = e-160
 Identities = 280/475 (58%), Positives = 350/475 (73%), Gaps = 5/475 (1%)
 Frame = -1

Query: 1777 MGAEPQRLHVAFFPFMAAGHMIPTLDIAKLFASHNVKITIITTPKNAPFFTKPLQTYNNN 1598
            MG+   +LH+ F PF A GHMIP++D+AKLFAS  +K TIITTP NAPFF+K +Q     
Sbjct: 1    MGSLGHQLHIFFLPFFAHGHMIPSVDMAKLFASRGIKTTIITTPLNAPFFSKTIQKTKEL 60

Query: 1597 NIIQTLIDVEIIPFPSQEAGLPDGVENFDQF----NTLDMSFKFCKATYILEESLENVIK 1430
                  I++  I FP+ EAGLP+G EN D F    N  +M+ KF KAT  L+   E V++
Sbjct: 61   GFD---INILTIKFPAAEAGLPEGYENTDAFIFSENAREMTIKFIKATTFLQAPFEKVLQ 117

Query: 1429 KCKPNCLVADLLLPFATDVAAKFNIPRLVFHGTNCFSQCVSHAMIKYEPHKSILSDDEEF 1250
            +C P+C+VAD+  P+ATD AAKF IPRLVFHGT+ F+   S  +  YEPHK + SD E F
Sbjct: 118  ECHPDCIVADVFFPWATDAAAKFGIPRLVFHGTSNFALSASECVRLYEPHKKVSSDSEPF 177

Query: 1249 IIPNLPHQIKITKKQLPEMMRGHQVDEEIIKDSMEVLTRAYEAEEKSYGIIVNTFYELEP 1070
            ++P+LP  IK+TKKQLP+ +R     E +  D  + L  + EAE +S+G++VN+FYELEP
Sbjct: 178  VVPDLPGDIKLTKKQLPDDVR-----ENVENDFSKFLKASKEAELRSFGVVVNSFYELEP 232

Query: 1069 DYVDYYKEVMGRKSWQIGPVSLCNRENEAKFQRGKDSSIDEHECLKWLDSKKPKSVVYVC 890
             Y DYYK+V+GR++W +GPVSLCNR+ E K  RGK++SID HECLKWLDSKKP SVVY+C
Sbjct: 233  AYADYYKKVLGRRAWNVGPVSLCNRDTEDKAGRGKETSIDHHECLKWLDSKKPNSVVYIC 292

Query: 889  FGSLAEVSDSQLHEIALGLEASEQDFIWVVRRSSDEEKDNEDWLPFGFEKRIEGKGLIIR 710
            FGS    SDSQL EIA GLEAS Q FIWVVRR+   ++D EDWLP GFE+R+EG GLIIR
Sbjct: 293  FGSTTNFSDSQLKEIAAGLEASGQQFIWVVRRNKKGQEDKEDWLPEGFEERMEGVGLIIR 352

Query: 709  GWAPQVLILDHEAIGAFVTHCGWNSTLEGVSCGVPMVTWPVFAEQFYNAKLVTEILKSGI 530
            GWAPQVLILDHEAIGAFVTHCGWNSTLEG++ G PMVTWP+FAEQFYN KLVT++LK+G+
Sbjct: 353  GWAPQVLILDHEAIGAFVTHCGWNSTLEGITAGKPMVTWPIFAEQFYNEKLVTDVLKTGV 412

Query: 529  SIGANEWNRV-VDGKNKEDIKMAISRVMVGEECLEIRSRAKKFKYLARKAVDVGG 368
             +G  EW RV  D    E ++  I+++MVGEE  E+RSRAKK    ARKAV+ GG
Sbjct: 413  GVGVKEWFRVHGDHVKSEAVEKTITQIMVGEEAEEMRSRAKKLGETARKAVEEGG 467


>ref|XP_002298734.2| hypothetical protein POPTR_0001s31120g [Populus trichocarpa]
            gi|550348606|gb|EEE83539.2| hypothetical protein
            POPTR_0001s31120g [Populus trichocarpa]
          Length = 485

 Score =  564 bits (1454), Expect = e-158
 Identities = 276/475 (58%), Positives = 348/475 (73%), Gaps = 5/475 (1%)
 Frame = -1

Query: 1777 MGAEPQRLHVAFFPFMAAGHMIPTLDIAKLFASHNVKITIITTPKNAPFFTKPLQTYNNN 1598
            MG+   +LH+ FFPF A GHMIP++D+AKLFAS  +K TIITTP NAP F+K +Q     
Sbjct: 1    MGSLGHQLHIFFFPFFAHGHMIPSVDMAKLFASRGIKTTIITTPLNAPLFSKTIQKTKEL 60

Query: 1597 NIIQTLIDVEIIPFPSQEAGLPDGVENFDQF----NTLDMSFKFCKATYILEESLENVIK 1430
                  I++  I FP+ EAG P+G EN D F    N   M+ KF KAT +L+   E  ++
Sbjct: 61   GFD---INILTIKFPAAEAGFPEGYENTDTFIFSENARAMTTKFFKATTLLQAPFEKALQ 117

Query: 1429 KCKPNCLVADLLLPFATDVAAKFNIPRLVFHGTNCFSQCVSHAMIKYEPHKSILSDDEEF 1250
            +C P+C+VAD+  P+ATD AAKF IPRLVFHGT+ F+   +  +  YEPHK + SD E F
Sbjct: 118  ECHPDCIVADMFFPWATDAAAKFGIPRLVFHGTSNFALSAAECVRLYEPHKKVSSDSEPF 177

Query: 1249 IIPNLPHQIKITKKQLPEMMRGHQVDEEIIKDSMEVLTRAYEAEEKSYGIIVNTFYELEP 1070
            ++P+LP  IK+TKKQLP+ +R     E +  D  ++L  + EAE +S+G++VN+FYELEP
Sbjct: 178  VVPDLPGDIKLTKKQLPDYVR-----ENVENDFSKILKASKEAELRSFGVVVNSFYELEP 232

Query: 1069 DYVDYYKEVMGRKSWQIGPVSLCNRENEAKFQRGKDSSIDEHECLKWLDSKKPKSVVYVC 890
             Y DYYK+V+GR++W +GPVSLCNR+ E K  RGK++SID HECLKWLDSKKP SVVY+C
Sbjct: 233  AYADYYKKVLGRRAWNVGPVSLCNRDTEDKAGRGKETSIDHHECLKWLDSKKPNSVVYIC 292

Query: 889  FGSLAEVSDSQLHEIALGLEASEQDFIWVVRRSSDEEKDNEDWLPFGFEKRIEGKGLIIR 710
            FGS    SDSQL EIA GLEAS Q FIWVVRR+   ++D EDWLP GF +R+EG GLIIR
Sbjct: 293  FGSTTNFSDSQLKEIAAGLEASGQQFIWVVRRNKKGQEDKEDWLPEGFGERMEGVGLIIR 352

Query: 709  GWAPQVLILDHEAIGAFVTHCGWNSTLEGVSCGVPMVTWPVFAEQFYNAKLVTEILKSGI 530
            GWAPQVLILDHEAIGAFVTHCGWNSTLEG++ G PMVTWP+FAEQFYN KLVT++LK+G+
Sbjct: 353  GWAPQVLILDHEAIGAFVTHCGWNSTLEGITAGKPMVTWPIFAEQFYNEKLVTDVLKTGV 412

Query: 529  SIGANEWNRV-VDGKNKEDIKMAISRVMVGEECLEIRSRAKKFKYLARKAVDVGG 368
             +G  EW RV  D    E ++  I+++MVGEE  E+RSRAKK    ARKAV+ GG
Sbjct: 413  GVGVKEWLRVHGDHVKSEAVEKKITQIMVGEEAEEMRSRAKKLGQTARKAVEEGG 467


>gb|EOX98312.1| UDP-glycosyltransferase 73B4, putative [Theobroma cacao]
          Length = 485

 Score =  561 bits (1447), Expect = e-157
 Identities = 275/476 (57%), Positives = 353/476 (74%), Gaps = 6/476 (1%)
 Frame = -1

Query: 1777 MGAEPQRLHVAFFPFMAAGHMIPTLDIAKLFASHNVKITIITTPKNAPFFTKPLQTYNNN 1598
            MG+E  ++H+ FFP MA GHMIPT+D+AK+FA+  VK TI+TTP NA FFTK ++    +
Sbjct: 1    MGSEIPQVHMFFFPLMAHGHMIPTVDMAKVFATRGVKTTIVTTPLNASFFTKTIERSKES 60

Query: 1597 NIIQTLIDVEIIPFPSQEAGLPDGVENFDQFNTL-----DMSFKFCKATYILEESLENVI 1433
             I    I ++I+ FP+ EAGLP+G EN D   T      DM  KF KAT++L+E LE ++
Sbjct: 61   GID---IGIKILKFPAVEAGLPEGCENADLIPTSQDESGDMLGKFFKATFMLQEPLEQLL 117

Query: 1432 KKCKPNCLVADLLLPFATDVAAKFNIPRLVFHGTNCFSQCVSHAMIKYEPHKSILSDDEE 1253
            ++CKP+CLVAD+  P+ATD A KF IPRLVFHGT+ FS C S +M  YEPHK + SD E 
Sbjct: 118  QECKPDCLVADMFFPWATDAANKFGIPRLVFHGTSFFSLCASESMRLYEPHKKVQSDSEP 177

Query: 1252 FIIPNLPHQIKITKKQLPEMMRGHQVDEEIIKDSMEVLTRAYEAEEKSYGIIVNTFYELE 1073
            F++PNLP  IK+TKKQLP+ M+     ++   D  +++  + E+E +SYG++VN+FYELE
Sbjct: 178  FVVPNLPGDIKLTKKQLPDYMK-----QDAETDFTKMVKASKESELRSYGVVVNSFYELE 232

Query: 1072 PDYVDYYKEVMGRKSWQIGPVSLCNRENEAKFQRGKDSSIDEHECLKWLDSKKPKSVVYV 893
              Y D Y+ ++GRK+W IGPVSLCNR  E K +RGK S+IDEHECLKWLDSK+P SVVY+
Sbjct: 233  DTYADCYRNILGRKAWHIGPVSLCNRATEDKAERGKKSAIDEHECLKWLDSKEPNSVVYI 292

Query: 892  CFGSLAEVSDSQLHEIALGLEASEQDFIWVVRRSSDEEKDNEDWLPFGFEKRIEGKGLII 713
            CFGS+A  + +QL EIA+ LEASEQ FIWVVR+  + E++ EDWLP GFEKR+EGKGLII
Sbjct: 293  CFGSMANFTSAQLKEIAMALEASEQQFIWVVRKQKNNEEE-EDWLPEGFEKRMEGKGLII 351

Query: 712  RGWAPQVLILDHEAIGAFVTHCGWNSTLEGVSCGVPMVTWPVFAEQFYNAKLVTEILKSG 533
            RGWAPQVLILDHEA+G FVTHCGWNSTLEGVS GV MVTWPVFAEQFYN KLVT++LK G
Sbjct: 352  RGWAPQVLILDHEAVGGFVTHCGWNSTLEGVSAGVSMVTWPVFAEQFYNEKLVTQVLKIG 411

Query: 532  ISIGANEWNRVV-DGKNKEDIKMAISRVMVGEECLEIRSRAKKFKYLARKAVDVGG 368
            + +GA +W R V D   +E I+ A+  +M G+   E+R+RAK     A+ A+  GG
Sbjct: 412  VGVGAQQWARTVGDFVKREAIEKAVKEIMKGDRAEEMRNRAKALAEAAKGAIAKGG 467


>ref|XP_002334266.1| predicted protein [Populus trichocarpa]
            gi|566187274|ref|XP_006379195.1|
            UDP-glucoronosyl/UDP-glucosyl transferase family protein
            [Populus trichocarpa] gi|550331437|gb|ERP56992.1|
            UDP-glucoronosyl/UDP-glucosyl transferase family protein
            [Populus trichocarpa]
          Length = 486

 Score =  558 bits (1439), Expect = e-156
 Identities = 277/476 (58%), Positives = 353/476 (74%), Gaps = 6/476 (1%)
 Frame = -1

Query: 1777 MGAEPQRLHVAFFPFMAAGHMIPTLDIAKLFASHNVKITIITTPKNAPFFTKPLQTYNNN 1598
            MG+   +LH+ FFPF+A GHMIPT+D+AKLFAS  VK TIITTP NAP F+K +Q   + 
Sbjct: 1    MGSLGHQLHIFFFPFLAHGHMIPTVDMAKLFASRGVKTTIITTPLNAPLFSKTIQKTKDL 60

Query: 1597 NIIQTLIDVEIIPFPSQEAGLPDGVENFDQF-----NTLDMSFKFCKATYILEESLENVI 1433
                  ID++ I FP+ EAGLP+G EN D F     N  +M+ KF  AT  L+E  E V+
Sbjct: 61   GFD---IDIQTIKFPAAEAGLPEGCENTDAFITTNENAGEMTKKFFIATTFLQEPFEKVL 117

Query: 1432 KKCKPNCLVADLLLPFATDVAAKFNIPRLVFHGTNCFSQCVSHAMIKYEPHKSILSDDEE 1253
            ++  P+C+VAD+  P+ATD AAKF IPRLVFHGT+ F+     ++  YEPHK + SD E 
Sbjct: 118  QERHPDCVVADMFFPWATDAAAKFGIPRLVFHGTSNFALSAGESVRLYEPHKKVSSDYEP 177

Query: 1252 FIIPNLPHQIKITKKQLPEMMRGHQVDEEIIKDSMEVLTRAYEAEEKSYGIIVNTFYELE 1073
            F++PNLP  IK+T+KQLP+ +R     E +  D  +++  + E+E +S+G+I N+FYELE
Sbjct: 178  FVVPNLPGDIKLTRKQLPDFIR-----ENVQNDFTKLVKASKESELRSFGVIFNSFYELE 232

Query: 1072 PDYVDYYKEVMGRKSWQIGPVSLCNRENEAKFQRGKDSSIDEHECLKWLDSKKPKSVVYV 893
            P Y DYY++V+GR++W +GPVSLCNR+ E K  RGK++SID+HECLKWLDSKKP SVVY+
Sbjct: 233  PAYADYYRKVLGRRAWNVGPVSLCNRDIEDKSGRGKEASIDQHECLKWLDSKKPNSVVYI 292

Query: 892  CFGSLAEVSDSQLHEIALGLEASEQDFIWVVRRSSDEEKDNEDWLPFGFEKRIEGKGLII 713
            CFGS+A    SQL EIA GLEAS Q FIWVVRR+ + E+D EDWLP GFE+R+E KGLII
Sbjct: 293  CFGSMASFPASQLKEIATGLEASGQQFIWVVRRNKNSEEDKEDWLPEGFEERMEDKGLII 352

Query: 712  RGWAPQVLILDHEAIGAFVTHCGWNSTLEGVSCGVPMVTWPVFAEQFYNAKLVTEILKSG 533
            RGWAPQVLILDHEAIGAFVTHCGWNSTLEG++ G PM+TWPV AEQFYN KLVT++LK+G
Sbjct: 353  RGWAPQVLILDHEAIGAFVTHCGWNSTLEGITAGKPMITWPVSAEQFYNEKLVTDVLKTG 412

Query: 532  ISIGANEWNRV-VDGKNKEDIKMAISRVMVGEECLEIRSRAKKFKYLARKAVDVGG 368
            + +G  EW RV  D    E ++ AI+++MVGEE  E RSRA K   +ARKAV+ GG
Sbjct: 413  VGVGVKEWVRVRGDHVKSEAVEKAITQIMVGEEGEEKRSRAIKLGEMARKAVEEGG 468


>dbj|BAG71127.1| glucosyltransferase [Phytolacca americana]
            gi|219566998|dbj|BAH05017.1| glucosyltransferase
            [Phytolacca americana]
          Length = 485

 Score =  556 bits (1434), Expect = e-156
 Identities = 276/474 (58%), Positives = 353/474 (74%), Gaps = 4/474 (0%)
 Frame = -1

Query: 1777 MGAEPQRLHVAFFPFMAAGHMIPTLDIAKLFASHNVKITIITTPKNAPFFTKPLQTYNNN 1598
            MGAEPQ+LHV FFP MA GHMIPTLDIA+LFA+ NV+ TIITTP NA  FTK ++    N
Sbjct: 1    MGAEPQQLHVVFFPIMAHGHMIPTLDIARLFAARNVRATIITTPLNAHTFTKAIEMGKKN 60

Query: 1597 NIIQTLIDVEIIPFPSQEAGLPDGVENFDQFNTLDMSFKFCKATYILEESLENVIKKCKP 1418
                  I +E+  FP+Q+ GLP+G EN +Q     +  KF K   +L E LE  ++K +P
Sbjct: 61   G--SPTIHLELFKFPAQDVGLPEGCENLEQALGSSLIEKFFKGVGLLREQLEAYLEKTRP 118

Query: 1417 NCLVADLLLPFATDVAAKFNIPRLVFHGTNCFSQCVSHAMIKYEPHKSILSDDEEFIIPN 1238
            NCLVAD+  P+ATD AAKFNIPRLVFHGT+ FS C    +  YEPHK++ SD+E F +P 
Sbjct: 119  NCLVADMFFPWATDSAAKFNIPRLVFHGTSFFSLCALEVVRLYEPHKNVSSDEELFSLPL 178

Query: 1237 LPHQIKITKKQLPEMMRGHQVDEEIIKDSMEVLTRAYEAEEKSYGIIVNTFYELEPDYVD 1058
             PH IK+ + QLPE +  H+  E   K  ++++    E+E KSYG+IVN+FYELEP+Y +
Sbjct: 179  FPHDIKMMRLQLPEDVWKHEKAEG--KTRLKLIK---ESELKSYGVIVNSFYELEPNYAE 233

Query: 1057 YYKEVMGRKSWQIGPVSLCNRENEAKFQRGKDSSIDEHECLKWLDSKKPKSVVYVCFGSL 878
            ++++ +GR++W IGPVSLCNR  E K QRGK +SIDEHECLKWL+SKK  SV+Y+CFGS 
Sbjct: 234  FFRKELGRRAWNIGPVSLCNRSTEDKAQRGKQTSIDEHECLKWLNSKKKNSVIYICFGST 293

Query: 877  AEVSDSQLHEIALGLEASEQDFIWVVRRSSDEEKDNED-WLPFGFEKRIEGKGLIIRGWA 701
            A     QL+EIA+ LEAS Q+FIWVVR +++ + D++D WLP GFE+R+EGKGLIIRGWA
Sbjct: 294  AHQIAPQLYEIAMALEASGQEFIWVVRNNNNNDDDDDDSWLPRGFEQRVEGKGLIIRGWA 353

Query: 700  PQVLILDHEAIGAFVTHCGWNSTLEGVSCGVPMVTWPVFAEQFYNAKLVTEILKSGISIG 521
            PQVLIL+HEAIGAFVTHCGWNSTLEG++ GVPMVTWP+FAEQFYN KLV +ILK G+ +G
Sbjct: 354  PQVLILEHEAIGAFVTHCGWNSTLEGITAGVPMVTWPIFAEQFYNEKLVNQILKIGVPVG 413

Query: 520  ANEWNR---VVDGKNKEDIKMAISRVMVGEECLEIRSRAKKFKYLARKAVDVGG 368
            AN+W+R   + D   K+ I+ A+  +MVG+E  E RSRAKK K +A KAV+ GG
Sbjct: 414  ANKWSRETSIEDVIKKDAIEKALREIMVGDEAEERRSRAKKLKEMAWKAVEEGG 467


>ref|XP_002518725.1| UDP-glucosyltransferase, putative [Ricinus communis]
            gi|223542106|gb|EEF43650.1| UDP-glucosyltransferase,
            putative [Ricinus communis]
          Length = 483

 Score =  552 bits (1423), Expect = e-154
 Identities = 267/474 (56%), Positives = 351/474 (74%), Gaps = 4/474 (0%)
 Frame = -1

Query: 1777 MGAEPQRLHVAFFPFMAAGHMIPTLDIAKLFASHNVKITIITTPKNAPFFTKPLQTYNNN 1598
            MG+E    H+ FFPFMA GHMIPT+D+AKLFAS  +K TI+TTP N  F +KP+Q   N 
Sbjct: 1    MGSEANVPHIFFFPFMAHGHMIPTVDMAKLFASRGLKTTIVTTPLNESFISKPIQRTKNL 60

Query: 1597 NIIQTLIDVEIIPFPSQEAGLPDGVENFDQFNTLDMSF----KFCKATYILEESLENVIK 1430
             +    I+++I+ FP+ EAGLP+G EN D   + +M      KF KA  +L+E LE ++ 
Sbjct: 61   GLE---INIKILKFPTVEAGLPEGCENLDFITSQNMDMEIVNKFLKAIALLQEPLEKLLS 117

Query: 1429 KCKPNCLVADLLLPFATDVAAKFNIPRLVFHGTNCFSQCVSHAMIKYEPHKSILSDDEEF 1250
             C+P+CLVAD+  P+AT+ ++KF IPRLVFHGT+ FS C + +++ +EPHK + SD E F
Sbjct: 118  ACRPDCLVADMFFPWATEASSKFRIPRLVFHGTSFFSLCATISVVLHEPHKKVASDSEPF 177

Query: 1249 IIPNLPHQIKITKKQLPEMMRGHQVDEEIIKDSMEVLTRAYEAEEKSYGIIVNTFYELEP 1070
            I+PNLP  IK++ +QLP  MR    D   +   ME    + ++E  S+G++ N+FYELEP
Sbjct: 178  IVPNLPGDIKLSGQQLPGFMRE---DGSYVAKFMEA---SIKSELTSFGVLANSFYELEP 231

Query: 1069 DYVDYYKEVMGRKSWQIGPVSLCNRENEAKFQRGKDSSIDEHECLKWLDSKKPKSVVYVC 890
             Y D+YK V+GR++W IGPVSLCNR+ E K +RGK++SIDEHECLKWL+SKKP SVVY+C
Sbjct: 232  TYADHYKNVLGRRAWHIGPVSLCNRDMEDKARRGKEASIDEHECLKWLNSKKPNSVVYLC 291

Query: 889  FGSLAEVSDSQLHEIALGLEASEQDFIWVVRRSSDEEKDNEDWLPFGFEKRIEGKGLIIR 710
            FG++A  + SQL EIA+ LE+S Q+FIWVVR++ + E+DN+DWLP GFE+RIEGKGLIIR
Sbjct: 292  FGTIANFTASQLKEIAMALESSGQEFIWVVRKNKNPEEDNQDWLPEGFEERIEGKGLIIR 351

Query: 709  GWAPQVLILDHEAIGAFVTHCGWNSTLEGVSCGVPMVTWPVFAEQFYNAKLVTEILKSGI 530
            GWAPQV+ILDHEA+G FVTHCGWNSTLEG++ GVPMVTWPV AEQFYN KLVTE+LK G+
Sbjct: 352  GWAPQVMILDHEALGGFVTHCGWNSTLEGIAAGVPMVTWPVGAEQFYNEKLVTEVLKIGV 411

Query: 529  SIGANEWNRVVDGKNKEDIKMAISRVMVGEECLEIRSRAKKFKYLARKAVDVGG 368
            S+G   W    D   +E I+ AI R+M G E  E+RS+ KK   +AR+AV+ GG
Sbjct: 412  SVGVQHWTVYGDSIKRECIEKAIIRIMEGAEAEEMRSKTKKLGKMAREAVEDGG 465


>gb|EMJ01477.1| hypothetical protein PRUPE_ppa014816mg [Prunus persica]
          Length = 485

 Score =  545 bits (1404), Expect = e-152
 Identities = 268/473 (56%), Positives = 354/473 (74%), Gaps = 3/473 (0%)
 Frame = -1

Query: 1777 MGAEPQRLHVAFFPFMAAGHMIPTLDIAKLFASHNVKITIITTPKNAPFFTKPLQTYNNN 1598
            M ++ + LHV  FPFMA GHMIP  D+AKLFA+  VK TIIT   NAP F+K +++   N
Sbjct: 1    MASQNRELHVFLFPFMAHGHMIPVSDMAKLFAAQGVKTTIITNTLNAPTFSKAIRSRKTN 60

Query: 1597 NIIQTL-IDVEIIPFPSQEAGLPDGVENFDQFNTLDMSFKFCKATYILEESLENVIKKCK 1421
            +    + I+++ I FPSQEAGLP+G EN D   T +++  F KA  +L+  LE ++ + +
Sbjct: 61   SCGCGIEIEIKTIKFPSQEAGLPEGCENLDSLPTPELAGNFFKAMGLLQAPLEQLLLEDQ 120

Query: 1420 PNCLVADLLLPFATDVAAKFNIPRLVFHGTNCFSQCVSHAMIKYEPHKSILSDDEEFIIP 1241
            P CLVAD+  P+ATD AAKF IPRLVFHGT+ F+   S  + +YEP K+  SD E F+IP
Sbjct: 121  PTCLVADMFFPWATDAAAKFGIPRLVFHGTSFFALAASDCVWRYEPFKNTSSDSEPFVIP 180

Query: 1240 NLPHQIKITKKQLPEMMRGHQVDEEIIKDSMEVLTRAYEAEEKSYGIIVNTFYELEPDYV 1061
            NLP  I++T+ Q+P+ ++     + I  D   +L ++ EAE +SYGI+VN+FYELEP Y 
Sbjct: 181  NLPGLIRMTRAQVPDFIK-----DNIENDLSRLLKQSKEAEVRSYGIVVNSFYELEPVYA 235

Query: 1060 DYYKEVMGRKSWQIGPVSLCNRENEAKFQRGKDSSIDEHECLKWLDSKKPKSVVYVCFGS 881
            DYY++V+G+K+W IGP+SLCNR+NE K  RGK++SIDEHECLKWLDSKKP SVVYVCFGS
Sbjct: 236  DYYRKVLGKKAWHIGPLSLCNRDNEEKAYRGKEASIDEHECLKWLDSKKPNSVVYVCFGS 295

Query: 880  LAEVSDSQLHEIALGLEASEQDFIWVVRRSSDEEK-DNEDWLPFGFEKRIEGKGLIIRGW 704
            + + ++SQL +IALGLEAS  +FIWVVR+  D++    EDWLP GFE+R+EGKGLIIRGW
Sbjct: 296  VVKFNNSQLKDIALGLEASGLEFIWVVRKGKDDDDVGKEDWLPEGFEERMEGKGLIIRGW 355

Query: 703  APQVLILDHEAIGAFVTHCGWNSTLEGVSCGVPMVTWPVFAEQFYNAKLVTEILKSGISI 524
            APQVLILDH A+G FVTHCGWNSTLEG++ G+PMVTWPV AEQFYN KLVT++LK G+++
Sbjct: 356  APQVLILDHGAVGGFVTHCGWNSTLEGIAAGLPMVTWPVAAEQFYNEKLVTQVLKIGVAV 415

Query: 523  GANEWNRVV-DGKNKEDIKMAISRVMVGEECLEIRSRAKKFKYLARKAVDVGG 368
            GA +W RVV D   KE I+ A++++MVGEE  E+R+RA+     AR+A + GG
Sbjct: 416  GAQKWVRVVGDSVKKEAIEKAVTQMMVGEEAEEMRNRARVLAEQARRANEKGG 468


>gb|EMJ02187.1| hypothetical protein PRUPE_ppa004924mg [Prunus persica]
          Length = 485

 Score =  545 bits (1403), Expect = e-152
 Identities = 272/473 (57%), Positives = 352/473 (74%), Gaps = 3/473 (0%)
 Frame = -1

Query: 1777 MGAEPQRLHVAFFPFMAAGHMIPTLDIAKLFASHNVKITIITTPKNAPFFTKPLQTYNNN 1598
            M ++ +  HV  FPFMA GHMIP  D+AKLFA+  VK TIITTP NAP F+K  ++   N
Sbjct: 1    MCSQNRDFHVFLFPFMAHGHMIPVSDMAKLFAAQGVKTTIITTPLNAPTFSKATRSSKTN 60

Query: 1597 NIIQTLIDVEIIPFPSQEAGLPDGVENFDQFN-TLDMSFKFCKATYILEESLENVIKKCK 1421
            +     I+++ I FPSQEAGLP+G EN D    T  ++  F KA  +L+E LE ++ + +
Sbjct: 61   SG-GIEIEIKTIKFPSQEAGLPEGCENLDSLPPTPVLADSFFKAAGLLQEPLERLLLEDQ 119

Query: 1420 PNCLVADLLLPFATDVAAKFNIPRLVFHGTNCFSQCVSHAMIKYEPHKSILSDDEEFIIP 1241
            P CLVAD+  P+ATD AAKF IPRLVFHGT+ F+   S  + +YEP K+I SD E F+IP
Sbjct: 120  PTCLVADMFFPWATDAAAKFGIPRLVFHGTSFFALAASDCVRRYEPFKNISSDSEPFVIP 179

Query: 1240 NLPHQIKITKKQLPEMMRGHQVDEEIIKDSMEVLTRAYEAEEKSYGIIVNTFYELEPDYV 1061
            +LP +IK+T+ Q+P  ++     + I  D   +L ++ EAE +SYGI+VN+FYELEP Y 
Sbjct: 180  DLPGEIKMTRAQVPGFIK-----DNIENDLTRLLKQSKEAEVRSYGIVVNSFYELEPVYA 234

Query: 1060 DYYKEVMGRKSWQIGPVSLCNRENEAKFQRGKDSSIDEHECLKWLDSKKPKSVVYVCFGS 881
            DYY++V+G+K+W IGP+SLCNRENE K  RGK++SIDEHECLKWLDSKKP SVVYVCFGS
Sbjct: 235  DYYRKVLGKKAWHIGPLSLCNRENEEKAYRGKEASIDEHECLKWLDSKKPNSVVYVCFGS 294

Query: 880  LAEVSDSQLHEIALGLEASEQDFIWVVRRSSDE-EKDNEDWLPFGFEKRIEGKGLIIRGW 704
            +A+ ++SQL EIA+GLEAS  DFIWVVR+  D+ +   EDWLP GFE+ +EGKGLIIRGW
Sbjct: 295  VAKFNNSQLKEIAIGLEASGVDFIWVVRKGKDDVDVGKEDWLPEGFEEMMEGKGLIIRGW 354

Query: 703  APQVLILDHEAIGAFVTHCGWNSTLEGVSCGVPMVTWPVFAEQFYNAKLVTEILKSGISI 524
            APQVLILDH A+G FVTHCGWNSTLEG++ G+PMVTWPV AEQFYN KLVT++LK G+ +
Sbjct: 355  APQVLILDHGAVGGFVTHCGWNSTLEGIAAGLPMVTWPVSAEQFYNEKLVTQVLKIGVGV 414

Query: 523  GANEWNRVV-DGKNKEDIKMAISRVMVGEECLEIRSRAKKFKYLARKAVDVGG 368
            G  +W RVV D    E I+ A++++MVGEE  ++RSRAK     AR+A++ GG
Sbjct: 415  GTQKWIRVVGDSVKNEAIEKAVTQIMVGEEAEKMRSRAKGLAEQARRAIETGG 467


>ref|XP_004304714.1| PREDICTED: UDP-glucose flavonoid 3-O-glucosyltransferase 7-like
            [Fragaria vesca subsp. vesca]
          Length = 477

 Score =  543 bits (1398), Expect = e-151
 Identities = 268/472 (56%), Positives = 346/472 (73%), Gaps = 2/472 (0%)
 Frame = -1

Query: 1777 MGAEPQ-RLHVAFFPFMAAGHMIPTLDIAKLFASHNVKITIITTPKNAPFFTKPLQTYNN 1601
            MG+E    +H+  FPFMA GHMIP  D+AKLFASH VKITI+TTP NA  F +  Q+   
Sbjct: 1    MGSESHDSVHIFLFPFMAHGHMIPVSDMAKLFASHGVKITIVTTPLNAIRFAQTTQSSKF 60

Query: 1600 NNIIQTLIDVEIIPFPSQEAGLPDGVENFDQFNTLDMSFKFCKATYILEESLENVIKKCK 1421
            N      I ++ I FPS+EAGLP G EN D   + ++   F KAT +L+   E ++K+ K
Sbjct: 61   N------IQIKAIEFPSEEAGLPKGCENVDTLPSPNLVNPFFKATRLLQPQFEELLKEVK 114

Query: 1420 PNCLVADLLLPFATDVAAKFNIPRLVFHGTNCFSQCVSHAMIKYEPHKSILSDDEEFIIP 1241
            P C+VAD+  P+AT+ AAKF IPRLVFHGT+ F+ C S  +  YEP+  I SD E F+IP
Sbjct: 115  PTCIVADMFFPWATEAAAKFGIPRLVFHGTSFFAMCASDCVKVYEPYNKISSDTEPFVIP 174

Query: 1240 NLPHQIKITKKQLPEMMRGHQVDEEIIKDSMEVLTRAYEAEEKSYGIIVNTFYELEPDYV 1061
             LP +I++T+ QLP+ ++ +     ++ D  ++L  A EAE KS+GII+N+FYELEP Y 
Sbjct: 175  YLPGEIELTRAQLPDFIKNN-----VLNDVTQLLKEAREAELKSFGIIMNSFYELEPVYA 229

Query: 1060 DYYKEVMGRKSWQIGPVSLCNRENEAKFQRGKDSSIDEHECLKWLDSKKPKSVVYVCFGS 881
            D+Y+  +GRK+W IGPVSLCNRE E K QRGK+++IDEHECLKWLDSKKP SVVYVCFGS
Sbjct: 230  DFYRNELGRKAWHIGPVSLCNRETEEKVQRGKEATIDEHECLKWLDSKKPDSVVYVCFGS 289

Query: 880  LAEVSDSQLHEIALGLEASEQDFIWVVRRSSDEEKDNEDWLPFGFEKRIEGKGLIIRGWA 701
            +A+ + +QL EIA+ LEA+ QDFIWVVR+  DE    ++WLP GFE+R+EGKGLIIRGWA
Sbjct: 290  VADFNSTQLKEIAMALEAAGQDFIWVVRKGKDE---MDEWLPEGFEERMEGKGLIIRGWA 346

Query: 700  PQVLILDHEAIGAFVTHCGWNSTLEGVSCGVPMVTWPVFAEQFYNAKLVTEILKSGISIG 521
            PQVLILDH ++G FVTHCGWNSTLEG+S G+PMVTWPV AEQFYN KLVT++LK G+ +G
Sbjct: 347  PQVLILDHPSVGGFVTHCGWNSTLEGISAGLPMVTWPVSAEQFYNEKLVTQVLKIGVGVG 406

Query: 520  ANEWNRVV-DGKNKEDIKMAISRVMVGEECLEIRSRAKKFKYLARKAVDVGG 368
              +W R+  D   KE +  A+S++MVGEE  E RSRA++    AR+AV+ GG
Sbjct: 407  TQKWVRLFGDSVKKEAVVKAVSQIMVGEEAEERRSRARELGKQARRAVEEGG 458


>gb|EMJ02188.1| hypothetical protein PRUPE_ppa004972mg [Prunus persica]
          Length = 483

 Score =  540 bits (1391), Expect = e-151
 Identities = 262/472 (55%), Positives = 350/472 (74%), Gaps = 2/472 (0%)
 Frame = -1

Query: 1777 MGAEPQRLHVAFFPFMAAGHMIPTLDIAKLFASHNVKITIITTPKNAPFFTKPLQTYNNN 1598
            M +E +  H+   PFMA GHMIP  D+AKLFA+  VK TIITTP NAP F+K  ++   N
Sbjct: 1    MSSENREFHIFLLPFMAYGHMIPVSDMAKLFAAQGVKTTIITTPLNAPTFSKATRSSKTN 60

Query: 1597 NIIQTLIDVEIIPFPSQEAGLPDGVENFDQFNTLDMSFKFCKATYILEESLENVIKKCKP 1418
            +  +  + ++ I FPSQEAGLP+G EN D   T + +  F KA  +L+E LE ++ + +P
Sbjct: 61   SG-RIEVQIKTIKFPSQEAGLPEGCENLDSLPTPEFANNFSKALGLLQEPLERLLLEDQP 119

Query: 1417 NCLVADLLLPFATDVAAKFNIPRLVFHGTNCFSQCVSHAMIKYEPHKSILSDDEEFIIPN 1238
            +CLVAD+  P+ATD AAKF IPRL+FHGT+ F+   S  + +Y+P K++ SD E F+IPN
Sbjct: 120  SCLVADMFFPWATDAAAKFGIPRLLFHGTSFFTLAASDCVRRYQPFKNMSSDSEPFVIPN 179

Query: 1237 LPHQIKITKKQLPEMMRGHQVDEEIIKDSMEVLTRAYEAEEKSYGIIVNTFYELEPDYVD 1058
            LP +IK+T+ Q+P+ ++     E I  D  +++ +A+++E  SYG +VN+FYELEP Y D
Sbjct: 180  LPGEIKMTRAQVPDFLK-----ENIENDFTQLMKQAHDSEVGSYGTVVNSFYELEPVYAD 234

Query: 1057 YYKEVMGRKSWQIGPVSLCNRENEAKFQRGKDSSIDEHECLKWLDSKKPKSVVYVCFGSL 878
            YY++++GRK+W IGP+SLCNR+NE K  RGK+ SIDEHECLKWL+SKKP SVVYVCFGS+
Sbjct: 235  YYRKLLGRKAWHIGPLSLCNRDNEEKSYRGKEVSIDEHECLKWLNSKKPNSVVYVCFGSM 294

Query: 877  AEVSDSQLHEIALGLEASEQDFIWVVRR-SSDEEKDNEDWLPFGFEKRIEGKGLIIRGWA 701
            A  S+SQL EIA GLEA+  +FIWVVRR  +D++   EDWLP GFE+R+EGKGLIIRGWA
Sbjct: 295  ARFSNSQLKEIAAGLEATRLEFIWVVRRGKNDDDVGKEDWLPEGFEERMEGKGLIIRGWA 354

Query: 700  PQVLILDHEAIGAFVTHCGWNSTLEGVSCGVPMVTWPVFAEQFYNAKLVTEILKSGISIG 521
            PQVLILDH A+G FVTHCGWNSTLEG++ G+PMVTWP+ AEQFYN KLVT++LK G+ +G
Sbjct: 355  PQVLILDHGAVGGFVTHCGWNSTLEGIAAGLPMVTWPLSAEQFYNDKLVTQVLKIGVGVG 414

Query: 520  ANEWNRVV-DGKNKEDIKMAISRVMVGEECLEIRSRAKKFKYLARKAVDVGG 368
              +W RV  D   +E I+ A++++MVGEE  E+RSR+K     AR  ++ GG
Sbjct: 415  DQKWVRVEGDSVKREAIEKAVTQIMVGEEAEEMRSRSKGLAEQARGVIEKGG 466


>gb|EOX98313.1| UDP-glucosyl transferase 73B3, putative [Theobroma cacao]
          Length = 485

 Score =  539 bits (1388), Expect = e-150
 Identities = 264/476 (55%), Positives = 346/476 (72%), Gaps = 6/476 (1%)
 Frame = -1

Query: 1777 MGAEPQRLHVAFFPFMAAGHMIPTLDIAKLFASHNVKITIITTPKNAPFFTKPLQTYNNN 1598
            MG+E  ++H+ FFP MA GH+IPT+D+AKLFA+  VK TIITTP NA FF+K +Q    +
Sbjct: 1    MGSEIPQVHIFFFPIMAQGHVIPTVDMAKLFATRGVKTTIITTPANASFFSKTIQRSKES 60

Query: 1597 NIIQTLIDVEIIPFPSQEAGLPDGVENFDQFNTL-----DMSFKFCKATYILEESLENVI 1433
             +    +DV+I+ + ++EAGLP+G EN D   T      D+  KF KA  +L+E LE ++
Sbjct: 61   GLD---VDVKILKYSTEEAGLPEGCENADLLPTSRDEPKDIVSKFFKAKVMLQEPLERLL 117

Query: 1432 KKCKPNCLVADLLLPFATDVAAKFNIPRLVFHGTNCFSQCVSHAMIKYEPHKSILSDDEE 1253
            ++CKP+CLVA +  P+ATD A KF IPRLVF+G + FS C    M  YEPHK + S+ E+
Sbjct: 118  QECKPDCLVAHMFFPWATDAADKFGIPRLVFYGVSVFSTCAMECMTLYEPHKHVESEFEQ 177

Query: 1252 FIIPNLPHQIKITKKQLPEMMRGHQVDEEIIKDSMEVLTRAYEAEEKSYGIIVNTFYELE 1073
            F++PNLP  IK++KKQLP+ ++     E +  D  ++L  + E+E +SYG+I N+FYELE
Sbjct: 178  FVVPNLPGDIKLSKKQLPDYVK-----ESVETDFTKMLKASKESELRSYGVIFNSFYELE 232

Query: 1072 PDYVDYYKEVMGRKSWQIGPVSLCNRENEAKFQRGKDSSIDEHECLKWLDSKKPKSVVYV 893
              Y DYYK V+GRK+W IGPVSLCNR  E K +RGK S++DEHEC KWLDS+KP SVVYV
Sbjct: 233  DMYADYYKNVLGRKAWHIGPVSLCNRAIEDKAERGKKSAVDEHECSKWLDSRKPNSVVYV 292

Query: 892  CFGSLAEVSDSQLHEIALGLEASEQDFIWVVRRSSDEEKDNEDWLPFGFEKRIEGKGLII 713
            CFGS+A  + +QL EIA+GLE S Q FIWVVR+     ++ EDWLP GFEKR+EGKGLII
Sbjct: 293  CFGSMANFNSAQLKEIAMGLETSGQQFIWVVRKEKSNGEE-EDWLPEGFEKRMEGKGLII 351

Query: 712  RGWAPQVLILDHEAIGAFVTHCGWNSTLEGVSCGVPMVTWPVFAEQFYNAKLVTEILKSG 533
            RGWAPQ+LILDHEA+G FVTHCGWNSTLEG+S GVPM+TWPV AEQFYN K VTEILK G
Sbjct: 352  RGWAPQILILDHEAVGGFVTHCGWNSTLEGISAGVPMITWPVSAEQFYNEKFVTEILKIG 411

Query: 532  ISIGANEW-NRVVDGKNKEDIKMAISRVMVGEECLEIRSRAKKFKYLARKAVDVGG 368
            +++G  +W + V D   KE I+ A+ ++M G+   E+R++AK    +A+ AV  GG
Sbjct: 412  VAVGVQQWVSTVGDFVKKEAIEKAVKKIMNGKTAKELRNKAKALAEMAKGAVAKGG 467


>ref|XP_002298733.1| hypothetical protein POPTR_0001s31130g [Populus trichocarpa]
            gi|222845991|gb|EEE83538.1| hypothetical protein
            POPTR_0001s31130g [Populus trichocarpa]
          Length = 483

 Score =  538 bits (1387), Expect = e-150
 Identities = 261/475 (54%), Positives = 345/475 (72%), Gaps = 5/475 (1%)
 Frame = -1

Query: 1777 MGAEPQRLHVAFFPFMAAGHMIPTLDIAKLFASHNVKITIITTPKNAPFFTKPLQTYNNN 1598
            MG E  ++H+ FFPFMA GHMIPT+D+AKLFAS  VK TI+TTP NAP  ++ +Q     
Sbjct: 1    MGGEENQVHIFFFPFMAHGHMIPTIDMAKLFASRGVKATIVTTPLNAPLVSRTIQRSKGL 60

Query: 1597 NIIQTLIDVEIIPFPSQEAGLPDGVENFDQFNTLD----MSFKFCKATYILEESLENVIK 1430
                  I+++ I FP+ E GLP+G EN D   + +    M+ K   AT +L++ LE +++
Sbjct: 61   GFD---INIKTIKFPAVEVGLPEGCENADSITSHETQGEMTKKLFMATAMLQQPLEKLLQ 117

Query: 1429 KCKPNCLVADLLLPFATDVAAKFNIPRLVFHGTNCFSQCVSHAMIKYEPHKSILSDDEEF 1250
            +C P+CL+AD+ LP+ TD AAKF IPRLVFHG +CFS C S  + +Y+P+K + SD E F
Sbjct: 118  ECHPDCLIADMFLPWTTDAAAKFGIPRLVFHGISCFSLCTSDCLNRYKPYKKVSSDSELF 177

Query: 1249 IIPNLPHQIKITKKQLPEMMRGHQVDEEIIKDSMEVLTRAYEAEEKSYGIIVNTFYELEP 1070
            ++P LP  IK T KQLP+ M+     + +  D   ++ +  E+  KSYGI+VN+FYELE 
Sbjct: 178  VVPELPGDIKFTSKQLPDYMK-----QNVETDFTRLIQKVRESSLKSYGIVVNSFYELES 232

Query: 1069 DYVDYYKEVMGRKSWQIGPVSLCNRENEAKFQRGKDSSIDEHECLKWLDSKKPKSVVYVC 890
            DY +++KE +GRK+W IGPVSLCNRE E K QRGK++SIDEHECLKWLDSKKP SVVY+C
Sbjct: 233  DYANFFKE-LGRKAWHIGPVSLCNREFEDKAQRGKEASIDEHECLKWLDSKKPNSVVYIC 291

Query: 889  FGSLAEVSDSQLHEIALGLEASEQDFIWVVRRSSDEEKDNEDWLPFGFEKRIEGKGLIIR 710
            FG++A  SDSQL EIA+ LEAS Q FIWVVR+   + KDNE+WLP GFEKR+E KGLIIR
Sbjct: 292  FGTVANFSDSQLKEIAIALEASGQQFIWVVRKDK-KAKDNEEWLPEGFEKRMESKGLIIR 350

Query: 709  GWAPQVLILDHEAIGAFVTHCGWNSTLEGVSCGVPMVTWPVFAEQFYNAKLVTEILKSGI 530
            GWAPQV+ILDHEAIG FVTHCGWNST+EG++ G PMVTWPV AEQF+N KLVT++LK G+
Sbjct: 351  GWAPQVVILDHEAIGGFVTHCGWNSTIEGIAAGKPMVTWPVSAEQFFNEKLVTDVLKIGV 410

Query: 529  SIGANEWNRVVDGK-NKEDIKMAISRVMVGEECLEIRSRAKKFKYLARKAVDVGG 368
            ++G  +W  V   K     ++ A++R+M GEE  E+RSR +    +A++A++  G
Sbjct: 411  AVGVQQWVTVYGDKITSGAVEKAVTRIMTGEEAKEMRSRVEALGGMAKRAIEEDG 465


>ref|XP_006422969.1| hypothetical protein CICLE_v10028305mg [Citrus clementina]
            gi|557524903|gb|ESR36209.1| hypothetical protein
            CICLE_v10028305mg [Citrus clementina]
          Length = 490

 Score =  537 bits (1383), Expect = e-150
 Identities = 264/476 (55%), Positives = 348/476 (73%), Gaps = 6/476 (1%)
 Frame = -1

Query: 1777 MGAEPQRLHVAFFPFMAAGHMIPTLDIAKLFASHNVKITIITTPKNAPFFTKPLQTYNNN 1598
            MG++  +LHV FFPFMA GHMIP +D+AKLFA+  VK ++ITTP NAP+ +K ++  N  
Sbjct: 1    MGSKIPQLHVFFFPFMAHGHMIPIVDMAKLFATRGVKASVITTPANAPYVSKSVERANEL 60

Query: 1597 NIIQTLIDVEIIPFPSQEAGLPDGVENFDQFNT---LDMSFKFCKATYILEESLENVIKK 1427
             I    +DV+ I FPS EAGLPDG EN D        ++  KF  AT  L+E LE +++ 
Sbjct: 61   GIE---LDVKTIKFPSVEAGLPDGCENLDAITNEVNKELIVKFLGATTKLQEPLEQLLRD 117

Query: 1426 CKPNCLVADLLLPFATDVAAKFNIPRLVFHGTNCFSQCVSHAMIKYEPHKSILSDDEEFI 1247
             KP+CLVAD+  P+ATD AAKF IPRLVFHGT+ FS C S+ +  YEPHK + SD E F+
Sbjct: 118  HKPDCLVADIFFPWATDAAAKFGIPRLVFHGTSFFSLCASNCLRLYEPHKKVSSDSEPFV 177

Query: 1246 IPNLPHQIKITKKQLPEMMRGHQVDEEIIKDSMEVLTRAYEAEEKSYGIIVNTFYELEPD 1067
            +P+ P +IK+T+ QLP+ ++    D ++ +    +L    E+E +SYG+ VN+FYELEP 
Sbjct: 178  MPHFPGEIKLTRNQLPDFVKQDMGDNDLSR----LLKATNESESRSYGVAVNSFYELEPA 233

Query: 1066 YVDYYKEVMGRKSWQIGPVSLCNRENEAKFQRGKDSSIDEHECLKWLDSKKPKSVVYVCF 887
            Y D+Y++ +GR++W IGPVSLCNR  E K  RGK +SIDE ECLKWL+SK+P SVVY+CF
Sbjct: 234  YADHYRKALGRRAWHIGPVSLCNRNFEDKALRGKQASIDELECLKWLNSKQPNSVVYICF 293

Query: 886  GSLAEVSDSQLHEIALGLEASEQDFIWVVRRSSDE--EKDNEDWLPFGFEKRIEGKGLII 713
            GSLA  + +QL EIA GLEAS ++FIWVVR++ ++  E   EDWLP GFEKR+EGKGLII
Sbjct: 294  GSLANFTSAQLMEIATGLEASGRNFIWVVRKNKNDGGEGGKEDWLPEGFEKRMEGKGLII 353

Query: 712  RGWAPQVLILDHEAIGAFVTHCGWNSTLEGVSCGVPMVTWPVFAEQFYNAKLVTEILKSG 533
            RGWAPQVLILDHEA+G FVTHCGWNST+E V+ GVP+VTWPV AEQFYN K+V E+LK G
Sbjct: 354  RGWAPQVLILDHEAVGGFVTHCGWNSTIEAVAAGVPLVTWPVSAEQFYNEKMVNEVLKIG 413

Query: 532  ISIGANEWNRVV-DGKNKEDIKMAISRVMVGEECLEIRSRAKKFKYLARKAVDVGG 368
            + +G  +W R+V D   +E I+ A++ +MVG+   E+RSRAK    +A++AV+ GG
Sbjct: 414  VGVGIQKWCRIVGDFVKREKIEKAVNEIMVGDRAEEMRSRAKALGKMAKRAVENGG 469


>ref|XP_002518724.1| UDP-glucosyltransferase, putative [Ricinus communis]
            gi|223542105|gb|EEF43649.1| UDP-glucosyltransferase,
            putative [Ricinus communis]
          Length = 486

 Score =  537 bits (1383), Expect = e-150
 Identities = 266/470 (56%), Positives = 343/470 (72%), Gaps = 6/470 (1%)
 Frame = -1

Query: 1759 RLHVAFFPFMAAGHMIPTLDIAKLFASHNVKITIITTPKNAPFFTKPLQTYNNNNIIQTL 1580
            +LH+ FFPFMA GH+IPT+D+AKLFAS  VK T+ITTP NA   +K +Q   N+      
Sbjct: 8    QLHIFFFPFMAHGHIIPTIDMAKLFASRGVKSTVITTPLNAKTISKTIQRTKNSGFD--- 64

Query: 1579 IDVEIIPFPSQEAGLPDGVENFDQF----NTLDMSFKFCKATYILEESLENVIKKCKPNC 1412
            ID+ I+ FP+ EAGLP+G EN D      +  D+  KF +A   L++ LEN++ +CKP+C
Sbjct: 65   IDIRILEFPA-EAGLPEGCENMDVIISHQDGKDLVMKFFRAIARLQQPLENLLGECKPDC 123

Query: 1411 LVADLLLPFATDVAAKFNIPRLVFHGTNCFSQCVSHAMIKYEPHKSILSDDEEFIIPNLP 1232
            LVAD+  P+ TD AAKF IPRLVFHG N FS C    +  YEPHK + SD E F+IP LP
Sbjct: 124  LVADMFFPWTTDAAAKFGIPRLVFHGINFFSLCTGECIKLYEPHKKVSSDSEPFVIPYLP 183

Query: 1231 HQIKITKKQLPEMMRGHQVDEEIIKDSMEVLTRAYEAEEKSYGIIVNTFYELEPDYVDYY 1052
             +IK T+KQLP+ +R  + +     D ++++    E+E KSYG+IVN+FYELE  Y D+Y
Sbjct: 184  GEIKYTRKQLPDFLRQQEEN-----DFLKMVKAVKESELKSYGVIVNSFYELESVYADFY 238

Query: 1051 KEVMGRKSWQIGPVSLCNRENEAKFQRGKDSSIDEHECLKWLDSKKPKSVVYVCFGSLAE 872
            ++ +GR++W IGP+SLCN   E K QRG++++IDEHEC KWLDSKKP S++Y+CFGSLA 
Sbjct: 239  RKELGRRAWHIGPLSLCNSGIEDKTQRGREATIDEHECTKWLDSKKPNSIIYICFGSLAN 298

Query: 871  VSDSQLHEIALGLEASEQDFIWVVRRS-SDEEKDNEDWLPFGFEKRIEGKGLIIRGWAPQ 695
             + SQL E+A+GLEAS Q FIWVVRR+   +E+D+E+WLP GFE+R+EGKG+IIRGWAPQ
Sbjct: 299  FTASQLMELAVGLEASGQQFIWVVRRNKKSQEEDDEEWLPKGFEERMEGKGMIIRGWAPQ 358

Query: 694  VLILDHEAIGAFVTHCGWNSTLEGVSCGVPMVTWPVFAEQFYNAKLVTEILKSGISIGAN 515
            VLILDHEAIG FVTHCGWNSTLEG++ G PMVTWP+ AEQFYN KLVTEILK G  +G  
Sbjct: 359  VLILDHEAIGGFVTHCGWNSTLEGITAGKPMVTWPISAEQFYNEKLVTEILKIGTGVGVK 418

Query: 514  EWNRV-VDGKNKEDIKMAISRVMVGEECLEIRSRAKKFKYLARKAVDVGG 368
            EW +   D    E ++ AI+R+M GEE  E+RSRAKK   +A  AV+ GG
Sbjct: 419  EWVKFHGDHVTSEAVEKAINRIMTGEEAEEMRSRAKKLAEMAGHAVEEGG 468


>dbj|BAF75890.1| tetrahydroxychalcone glucosyltransferase [Dianthus caryophyllus]
          Length = 483

 Score =  536 bits (1382), Expect = e-150
 Identities = 262/474 (55%), Positives = 350/474 (73%), Gaps = 4/474 (0%)
 Frame = -1

Query: 1777 MGAEPQRLHVAFFPFMAAGHMIPTLDIAKLFASHNVKITIITTPKNAPFFTKPLQTYNNN 1598
            M AEP RLH+  FPF+A GHMIPTLDIA+LFA+ NV+++IITTP NAP FTK ++T N  
Sbjct: 1    MVAEPHRLHIVMFPFLAHGHMIPTLDIARLFAARNVEVSIITTPVNAPIFTKAIETGN-- 58

Query: 1597 NIIQTLIDVEIIPFPSQEAGLPDGVENFD-QFNTLDMSFKFCKATYILEESLENVIKKCK 1421
                 LI+VE+  FP++EAGLP+G EN +      ++  +F KAT++ ++ LE  + + +
Sbjct: 59   ----PLINVELFKFPAKEAGLPEGCENAEIVIRQPELIPQFFKATHLFQQQLEEYLDRVR 114

Query: 1420 PNCLVADLLLPFATDVAAKFNIPRLVFHGTNCFSQCVSHAMIKYEPHKSILSDDEEFIIP 1241
            P+CLVAD+  P+ATD A KFN+PRLVFHG +CF+ C   ++ +YEP++++ SDDE F +P
Sbjct: 115  PDCLVADMFYPWATDSATKFNLPRLVFHGISCFALCAQESVSRYEPYRNVSSDDEPFALP 174

Query: 1240 NLPHQIKITKKQLPEMMRGHQVDEEIIKDSMEVLTRAYEAEEKSYGIIVNTFYELEPDYV 1061
             LPH+IK+ + Q+    RG +  E   K + E++    ++E +S+G+I+N+FYELEP+Y 
Sbjct: 175  GLPHEIKLIRSQISPDSRGDK--ENSSKTTTELIN---DSEVESFGVIMNSFYELEPEYA 229

Query: 1060 DYYKEVMGRKSWQIGPVSLCNRENEAKFQRGKDSSIDEHECLKWLDSKKPKSVVYVCFGS 881
            ++Y + MGRK+W IGPVSLCNR N+ K  RGK +SID+HECL WLDSK+P SVVYVCFGS
Sbjct: 230  EFYAKDMGRKAWHIGPVSLCNRSNDQKALRGKRASIDDHECLAWLDSKEPNSVVYVCFGS 289

Query: 880  LAEVSDSQLHEIALGLEASEQDFIWVVRRSSDEEKDNEDWLPFGFEKRIEGKGLIIRGWA 701
             +     QL EIA+ LE S ++FIW VR   + +  NE+WLP GFE+R +GKGLIIRGWA
Sbjct: 290  TSVSIAPQLREIAMALEQSGKNFIWAVRDGGNGK--NEEWLPLGFEERTKGKGLIIRGWA 347

Query: 700  PQVLILDHEAIGAFVTHCGWNSTLEGVSCGVPMVTWPVFAEQFYNAKLVTEILKSGISIG 521
            PQVLILDH+A+GAFVTHCGWNSTLEG+S GVPMVTWP+FAEQF+N KLVT +L++G+SIG
Sbjct: 348  PQVLILDHKAVGAFVTHCGWNSTLEGISAGVPMVTWPLFAEQFFNEKLVTNVLRTGVSIG 407

Query: 520  ANEWNR---VVDGKNKEDIKMAISRVMVGEECLEIRSRAKKFKYLARKAVDVGG 368
              +WNR   V D   +E I+ AI  +M GE+  E+R RAKK K  AR AV+ GG
Sbjct: 408  VKKWNRTPSVEDLITREAIEAAIREIMEGEKAEEMRLRAKKLKEAARNAVEEGG 461


>ref|XP_006369760.1| hypothetical protein POPTR_0001s31090g [Populus trichocarpa]
            gi|550348602|gb|ERP66329.1| hypothetical protein
            POPTR_0001s31090g [Populus trichocarpa]
          Length = 483

 Score =  536 bits (1380), Expect = e-149
 Identities = 260/475 (54%), Positives = 345/475 (72%), Gaps = 5/475 (1%)
 Frame = -1

Query: 1777 MGAEPQRLHVAFFPFMAAGHMIPTLDIAKLFASHNVKITIITTPKNAPFFTKPLQTYNNN 1598
            MG E  ++H+ FFPFMA GHMIPT+D+AKLFAS  VK TI+TTP NAP  ++ +Q     
Sbjct: 1    MGGEENQVHIFFFPFMAHGHMIPTIDMAKLFASRGVKATIVTTPLNAPLVSRTIQRSKGL 60

Query: 1597 NIIQTLIDVEIIPFPSQEAGLPDGVENFDQFNTLD----MSFKFCKATYILEESLENVIK 1430
                  I+++ I FP+ E GLP+G EN D   + +    M+ K   AT +L++ LE +++
Sbjct: 61   GFD---INIKTIKFPAVEVGLPEGCENADSITSHETQGEMTKKVFMATTMLQQPLEKLLQ 117

Query: 1429 KCKPNCLVADLLLPFATDVAAKFNIPRLVFHGTNCFSQCVSHAMIKYEPHKSILSDDEEF 1250
            +C P+CL+AD+ LP+ TD AAKF IPRLVFHG +CFS C S  + +Y+P+K + SD E F
Sbjct: 118  ECHPDCLIADMFLPWTTDAAAKFGIPRLVFHGISCFSLCASDCLNRYKPYKKVSSDSELF 177

Query: 1249 IIPNLPHQIKITKKQLPEMMRGHQVDEEIIKDSMEVLTRAYEAEEKSYGIIVNTFYELEP 1070
            ++P LP  IK T KQLP+ M+     + +  D   ++ +  E+  KSYGI+VN+FYELE 
Sbjct: 178  VVPELPGDIKFTSKQLPDYMK-----QNVETDFTRLIQKVRESSLKSYGIVVNSFYELES 232

Query: 1069 DYVDYYKEVMGRKSWQIGPVSLCNRENEAKFQRGKDSSIDEHECLKWLDSKKPKSVVYVC 890
            DY +++KE +GRK+W IGPVSLCNRE E K QRGK++SIDEHECLKWLDSKKP SVVY+C
Sbjct: 233  DYANFFKE-LGRKAWHIGPVSLCNREFEDKAQRGKEASIDEHECLKWLDSKKPNSVVYIC 291

Query: 889  FGSLAEVSDSQLHEIALGLEASEQDFIWVVRRSSDEEKDNEDWLPFGFEKRIEGKGLIIR 710
            FG++ + SDSQL EIA+ LEAS Q FIWVVR+   + KDNE+WLP GFEKR+E KGLIIR
Sbjct: 292  FGTVDKFSDSQLKEIAIALEASGQQFIWVVRKDK-KAKDNEEWLPEGFEKRMESKGLIIR 350

Query: 709  GWAPQVLILDHEAIGAFVTHCGWNSTLEGVSCGVPMVTWPVFAEQFYNAKLVTEILKSGI 530
            GWAPQV+ILDHEAIG FVTHCGWNST+EG++ G PMVTWPV AEQF+N KLVT++LK G+
Sbjct: 351  GWAPQVVILDHEAIGGFVTHCGWNSTIEGIAAGKPMVTWPVSAEQFFNEKLVTDVLKIGV 410

Query: 529  SIGANEWNRVVDGKNKED-IKMAISRVMVGEECLEIRSRAKKFKYLARKAVDVGG 368
            ++G  +W  V   K     ++ A++R+M GEE  E+RSR +    +A++A++  G
Sbjct: 411  AVGVQQWVTVYGDKIASGAVEKAVTRIMTGEEAKEMRSRVEALGGMAKRAIEEDG 465


>gb|AAB36653.1| immediate-early salicylate-induced glucosyltransferase [Nicotiana
            tabacum]
          Length = 476

 Score =  535 bits (1377), Expect = e-149
 Identities = 264/465 (56%), Positives = 345/465 (74%), Gaps = 1/465 (0%)
 Frame = -1

Query: 1759 RLHVAFFPFMAAGHMIPTLDIAKLFASHNVKITIITTPKNAPFFTKPLQTYNNNNIIQTL 1580
            +LH+ FFP MA GHMIPTLD+AKLFAS  VK TIITTP N   F+K +Q    N  +   
Sbjct: 3    QLHIFFFPVMAHGHMIPTLDMAKLFASRGVKATIITTPLNEFVFSKAIQ---RNKHLGIE 59

Query: 1579 IDVEIIPFPSQEAGLPDGVENFDQFNTLDMSFKFCKATYILEESLENVIKKCKPNCLVAD 1400
            I++ +I FP+ E GLP+  E  DQ  + +    F KA  +++E LE +I++C+P+CL++D
Sbjct: 60   IEIRLIKFPAVENGLPEECERLDQIPSDEKLPNFFKAVAMMQEPLEQLIEECRPDCLISD 119

Query: 1399 LLLPFATDVAAKFNIPRLVFHGTNCFSQCVSHAMIKYEPHKSILSDDEEFIIPNLPHQIK 1220
            + LP+ TD AAKFNIPR+VFHGT+ F+ CV +++   +P K++ SD E F++P+LPH+IK
Sbjct: 120  MFLPWTTDTAAKFNIPRIVFHGTSFFALCVENSVRLNKPFKNVSSDSETFVVPDLPHEIK 179

Query: 1219 ITKKQLPEMMRGHQVDEEIIKDSMEVLTRAYEAEEKSYGIIVNTFYELEPDYVDYYKEVM 1040
            +T+ Q+    R     EE     M    R  E++ KSYG++ N+FYELE DYV++Y +V+
Sbjct: 180  LTRTQVSPFERS---GEETAMTRMIKTVR--ESDSKSYGVVFNSFYELETDYVEHYTKVL 234

Query: 1039 GRKSWQIGPVSLCNRENEAKFQRGKDSSIDEHECLKWLDSKKPKSVVYVCFGSLAEVSDS 860
            GR++W IGP+S+CNR+ E K +RGK SSID+HECLKWLDSKKP SVVY+CFGS+A  + S
Sbjct: 235  GRRAWAIGPLSMCNRDIEDKAERGKKSSIDKHECLKWLDSKKPSSVVYICFGSVANFTAS 294

Query: 859  QLHEIALGLEASEQDFIWVVRRSSDEEKDNEDWLPFGFEKRIEGKGLIIRGWAPQVLILD 680
            QLHE+A+G+EAS Q+FIWVVR     E DNEDWLP GFE+R + KGLIIRGWAPQVLILD
Sbjct: 295  QLHELAMGVEASGQEFIWVVR----TELDNEDWLPEGFEERTKEKGLIIRGWAPQVLILD 350

Query: 679  HEAIGAFVTHCGWNSTLEGVSCGVPMVTWPVFAEQFYNAKLVTEILKSGISIGANEWNR- 503
            HE++GAFVTHCGWNSTLEGVS GVPMVTWPVFAEQF+N KLVTE+LK+G  +G+ +W R 
Sbjct: 351  HESVGAFVTHCGWNSTLEGVSGGVPMVTWPVFAEQFFNEKLVTEVLKTGAGVGSIQWKRS 410

Query: 502  VVDGKNKEDIKMAISRVMVGEECLEIRSRAKKFKYLARKAVDVGG 368
              +G  +E I  AI RVMV EE    R+RAK +K +ARKA++ GG
Sbjct: 411  ASEGVKREAIAKAIKRVMVSEEADGFRNRAKAYKEMARKAIEEGG 455


>sp|Q9AT54.1|SCGT_TOBAC RecName: Full=Scopoletin glucosyltransferase; AltName:
            Full=Phenylpropanoid:glucosyltransferase 1
            gi|13492674|gb|AAK28303.1|AF346431_1
            phenylpropanoid:glucosyltransferase 1, partial [Nicotiana
            tabacum]
          Length = 476

 Score =  534 bits (1375), Expect = e-149
 Identities = 265/465 (56%), Positives = 344/465 (73%), Gaps = 1/465 (0%)
 Frame = -1

Query: 1759 RLHVAFFPFMAAGHMIPTLDIAKLFASHNVKITIITTPKNAPFFTKPLQTYNNNNIIQTL 1580
            +LH  FFP MA GHMIPTLD+AKLFAS  VK TIITTP N   F+K +Q    N  +   
Sbjct: 3    QLHFFFFPVMAHGHMIPTLDMAKLFASRGVKATIITTPLNEFVFSKAIQ---RNKHLGIE 59

Query: 1579 IDVEIIPFPSQEAGLPDGVENFDQFNTLDMSFKFCKATYILEESLENVIKKCKPNCLVAD 1400
            I++ +I FP+ E GLP+  E  DQ  + +    F KA  +++E LE +I++C+P+CL++D
Sbjct: 60   IEIRLIKFPAVENGLPEECERLDQIPSDEKLPNFFKAVAMMQEPLEQLIEECRPDCLISD 119

Query: 1399 LLLPFATDVAAKFNIPRLVFHGTNCFSQCVSHAMIKYEPHKSILSDDEEFIIPNLPHQIK 1220
            + LP+ TD AAKFNIPR+VFHGT+ F+ CV +++   +P K++ SD E F++P+LPH+IK
Sbjct: 120  MFLPWTTDTAAKFNIPRIVFHGTSFFALCVENSVRLNKPFKNVSSDSETFVVPDLPHEIK 179

Query: 1219 ITKKQLPEMMRGHQVDEEIIKDSMEVLTRAYEAEEKSYGIIVNTFYELEPDYVDYYKEVM 1040
            +T+ Q+    R     EE     M    R  E++ KSYG++ N+FYELE DYV++Y +V+
Sbjct: 180  LTRTQVSPFERS---GEETAMTRMIKTVR--ESDSKSYGVVFNSFYELETDYVEHYTKVL 234

Query: 1039 GRKSWQIGPVSLCNRENEAKFQRGKDSSIDEHECLKWLDSKKPKSVVYVCFGSLAEVSDS 860
            GR++W IGP+S+CNR+ E K +RGK SSID+HECLKWLDSKKP SVVYVCFGS+A  + S
Sbjct: 235  GRRAWAIGPLSMCNRDIEDKAERGKKSSIDKHECLKWLDSKKPSSVVYVCFGSVANFTAS 294

Query: 859  QLHEIALGLEASEQDFIWVVRRSSDEEKDNEDWLPFGFEKRIEGKGLIIRGWAPQVLILD 680
            QLHE+A+G+EAS Q+FIWVVR     E DNEDWLP GFE+R + KGLIIRGWAPQVLILD
Sbjct: 295  QLHELAMGIEASGQEFIWVVR----TELDNEDWLPEGFEERTKEKGLIIRGWAPQVLILD 350

Query: 679  HEAIGAFVTHCGWNSTLEGVSCGVPMVTWPVFAEQFYNAKLVTEILKSGISIGANEWNR- 503
            HE++GAFVTHCGWNSTLEGVS GVPMVTWPVFAEQF+N KLVTE+LK+G  +G+ +W R 
Sbjct: 351  HESVGAFVTHCGWNSTLEGVSGGVPMVTWPVFAEQFFNEKLVTEVLKTGAGVGSIQWKRS 410

Query: 502  VVDGKNKEDIKMAISRVMVGEECLEIRSRAKKFKYLARKAVDVGG 368
              +G  +E I  AI RVMV EE    R+RAK +K +ARKA++ GG
Sbjct: 411  ASEGVKREAIAKAIKRVMVSEEADGFRNRAKAYKEMARKAIEEGG 455


>gb|EOY09906.1| Anthocyanin 3'-O-beta-glucosyltransferase, putative [Theobroma cacao]
          Length = 484

 Score =  533 bits (1372), Expect = e-148
 Identities = 262/477 (54%), Positives = 345/477 (72%), Gaps = 7/477 (1%)
 Frame = -1

Query: 1777 MGAEPQRLHVAFFPFMAAGHMIPTLDIAKLFASHNVKITIITTPKNAPFFTKPLQTYNNN 1598
            M +  ++LH  FFP +A GH+IPT+D+A+LFA H VK+TI+TTP NA  F   +Q     
Sbjct: 1    MASNSRQLHFIFFPQLAHGHLIPTVDMARLFAMHGVKVTIVTTPLNALLFASKIQREKQL 60

Query: 1597 NI-IQTLIDVEIIPFPSQEAGLPDGVENFDQFNTLDMSFKFCKATYILEESLENVIKKCK 1421
               I TL+    I FP+ E GLP+G EN     + +M  KF KA  + ++ LE ++++ +
Sbjct: 61   GFDISTLV----IKFPASEVGLPEGCENVSSITSQEMIPKFLKAINLFQQPLERILEELR 116

Query: 1420 PNCLVADLLLPFATDVAAKFNIPRLVFHGTNCFSQCVSHAMIKYEPHKSILSDDEEFIIP 1241
            P+CLVAD + P+ATD+A KF IPRLVFHGT+CF+ CV   +I++EP K I S+ E F +P
Sbjct: 117  PDCLVADWMFPWATDIAGKFGIPRLVFHGTSCFALCVVDTLIRHEPFKKISSESEPFDVP 176

Query: 1240 NLPHQIKITKKQLPEMMRGHQVDEEIIKDSMEVLTRAYEAEEKSYGIIVNTFYELEPDYV 1061
             LP QIK+T+ QLP+ ++     E       +++  A ++E  SYG+IVN+F+ELEP Y 
Sbjct: 177  GLPDQIKMTRLQLPDYIKDTAETER-----QKLIDEAIKSELTSYGVIVNSFHELEPAYT 231

Query: 1060 DYYKEVMGRKSWQIGPVSLCNRENEAKFQRGKDSSIDEHECLKWLDSKKPKSVVYVCFGS 881
             +Y +VM RK+WQ+GPVSLCN  NE K +RG  +SID HECL+WLDSKKP SV+Y+CFGS
Sbjct: 232  QHYSKVMRRKAWQVGPVSLCNMNNEDKAERGNAASIDRHECLRWLDSKKPNSVLYICFGS 291

Query: 880  LAEVSDSQLHEIALGLEASEQDFIWVVRRSSDEEKDNEDWLPFGFEKRIEGKGLIIRGWA 701
            +   S +QL+EIA GLEAS QDFIWVVR+ +DE+K  E+WLP GFE+R+EGKGLIIRGWA
Sbjct: 292  IFRTSAAQLNEIAKGLEASGQDFIWVVRKVNDEDK--EEWLPEGFEERMEGKGLIIRGWA 349

Query: 700  PQVLILDHEAIGAFVTHCGWNSTLEGVSCGVPMVTWPVFAEQFYNAKLVTEILKSGISIG 521
             QVLILDHEA+G F+THCGWNST+E ++ GVPMVTWP+ AEQF N KLVTE+LK G+ +G
Sbjct: 350  AQVLILDHEAVGGFMTHCGWNSTIESITAGVPMVTWPLCAEQFCNEKLVTEVLKIGVDVG 409

Query: 520  ANEWNRVVDGKN------KEDIKMAISRVMVGEECLEIRSRAKKFKYLARKAVDVGG 368
            A EW R  D  +      KEDI+ A+SRVMVGEE  E+RSRAK+ K +ARKA++ GG
Sbjct: 410  AKEWCRWGDDPSTKFKVMKEDIERAVSRVMVGEEAEEMRSRAKELKNMARKAMEEGG 466


Top