BLASTX nr result

ID: Cornus23_contig00006041 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Cornus23_contig00006041
         (1364 letters)

Database: ./nr 
           77,306,371 sequences; 28,104,191,420 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

gb|AIZ94008.1| UDP-glucose glycoprotein glucosyltransferase [Cam...   676   0.0  
gb|AJA90807.1| UDP glucose: glycoprotein glucosyltransferase pro...   669   0.0  
ref|XP_010657684.1| PREDICTED: UDP-glucose:glycoprotein glucosyl...   640   e-180
ref|XP_010657683.1| PREDICTED: UDP-glucose:glycoprotein glucosyl...   640   e-180
ref|XP_010101162.1| UDP-glucose:glycoprotein glucosyltransferase...   621   e-175
emb|CBI23772.3| unnamed protein product [Vitis vinifera]              613   e-172
emb|CDO97565.1| unnamed protein product [Coffea canephora]            611   e-172
ref|XP_012071315.1| PREDICTED: UDP-glucose:glycoprotein glucosyl...   611   e-172
gb|KHG12185.1| glycoprotein glucosyltransferase -like protein [G...   608   e-171
ref|XP_009348356.1| PREDICTED: UDP-glucose:glycoprotein glucosyl...   608   e-171
gb|KJB77699.1| hypothetical protein B456_012G151700 [Gossypium r...   607   e-171
gb|KJB77698.1| hypothetical protein B456_012G151700 [Gossypium r...   607   e-171
gb|KJB77697.1| hypothetical protein B456_012G151700 [Gossypium r...   607   e-171
ref|XP_012458584.1| PREDICTED: UDP-glucose:glycoprotein glucosyl...   607   e-171
ref|XP_008339491.1| PREDICTED: UDP-glucose:glycoprotein glucosyl...   607   e-171
ref|XP_007042249.1| UDP-glucose:glycoprotein glucosyltransferase...   605   e-170
ref|XP_007042248.1| UDP-glucose:glycoprotein glucosyltransferase...   605   e-170
ref|XP_007042247.1| UDP-glucose:glycoprotein glucosyltransferase...   605   e-170
ref|XP_011006543.1| PREDICTED: UDP-glucose:glycoprotein glucosyl...   598   e-168
ref|XP_011006542.1| PREDICTED: UDP-glucose:glycoprotein glucosyl...   598   e-168

>gb|AIZ94008.1| UDP-glucose glycoprotein glucosyltransferase [Camellia sinensis]
          Length = 1637

 Score =  676 bits (1744), Expect = 0.0
 Identities = 337/400 (84%), Positives = 361/400 (90%), Gaps = 2/400 (0%)
 Frame = -2

Query: 1195 MDTHFRSGFWVLLAFVAFLSLSGHLVSGESRRPKNVQVAVRAKWSGTPLVLEAGELLSKE 1016
            M THFRSG WVL  FV FLSLSG+LVS ESRRPKNVQVA++AKWSGTPL+LEAGELLSKE
Sbjct: 1    MWTHFRSGCWVLFVFVGFLSLSGNLVSVESRRPKNVQVALQAKWSGTPLLLEAGELLSKE 60

Query: 1015 WKDLFWEFIEIWLHTENEDADSHTAKGCLKKIVKYGQSLLGEPLASVFEFSLTLRSASPR 836
            WKD FWEFIE+W H  NEDADS TAK CLKKIVKYGQSLL EPLAS+FEFSLTLRS SPR
Sbjct: 61   WKDYFWEFIEVWHH--NEDADSQTAKDCLKKIVKYGQSLLSEPLASLFEFSLTLRSTSPR 118

Query: 835  LVLYRQLAEESLSSFPLTEDMDSNFVNGGISELNENMG--KNEPLLVGINPRSPRGKCCW 662
            LVLYRQLA ESLSSFPL +D++S  VNGGI E NEN+   K EPLLVG+NPRSP G+CCW
Sbjct: 119  LVLYRQLAVESLSSFPLYDDINSQSVNGGIPETNENVESKKVEPLLVGMNPRSPGGECCW 178

Query: 661  LDTGGALFFDATELVSWLNSPNELAGDSFQQPELYEFDHIYFNSSIASPVAILYGALGTE 482
            +DTGGA FFD +E  +WL+SP E A DSFQQPELYEFDHI+F+SSI SPVAILYGALGT+
Sbjct: 179  VDTGGAFFFDVSEFQTWLHSPKESARDSFQQPELYEFDHIHFDSSIGSPVAILYGALGTD 238

Query: 481  CFREFHVTLVEAAKEGKVKYVVRPVLPSGCESKSGHCGAVGTRDPLNLGGYGVELALKNM 302
            CFREFHV LV AAKEGKVKYV RPVLPSGC+SKSGHC AVGT DP+NLGGYGVELALKNM
Sbjct: 239  CFREFHVALVAAAKEGKVKYVARPVLPSGCQSKSGHCAAVGTNDPVNLGGYGVELALKNM 298

Query: 301  EYKAMDDSAIKKGVTLEDPHTEDLSQEVRGFIFSKILERKPELASEIMAFRDYLLSSTIS 122
            EYKAMDDSAIKKGVTLEDPHTEDLSQEVRGFIFS+ILERKPEL SEIMAFRDYLLSST+S
Sbjct: 299  EYKAMDDSAIKKGVTLEDPHTEDLSQEVRGFIFSRILERKPELTSEIMAFRDYLLSSTVS 358

Query: 121  DTLDVWELKDLGHQTAQRIVHASDPLQSMQEINQNFPSVV 2
            DTLDVWELKDLGHQTAQRIVHASDPLQSMQEINQNFP+VV
Sbjct: 359  DTLDVWELKDLGHQTAQRIVHASDPLQSMQEINQNFPTVV 398


>gb|AJA90807.1| UDP glucose: glycoprotein glucosyltransferase protein [Camellia
            sinensis] gi|741207321|gb|AJA90808.1| UDP glucose:
            glycoprotein glucosyltransferase protein [Camellia
            sinensis] gi|741207323|gb|AJA90809.1| UDP glucose:
            glycoprotein glucosyltransferase protein [Camellia
            sinensis] gi|741207325|gb|AJA90810.1| UDP glucose:
            glycoprotein glucosyltransferase protein [Camellia
            sinensis] gi|741207327|gb|AJA90811.1| UDP glucose:
            glycoprotein glucosyltransferase protein [Camellia
            sinensis]
          Length = 1638

 Score =  669 bits (1726), Expect = 0.0
 Identities = 335/400 (83%), Positives = 358/400 (89%), Gaps = 2/400 (0%)
 Frame = -2

Query: 1195 MDTHFRSGFWVLLAFVAFLSLSGHLVSGESRRPKNVQVAVRAKWSGTPLVLEAGELLSKE 1016
            M THFRSG WVL  FV FLSLSG+LVS ESRRPKNVQVA++AKWSGTPL+LEAGELLSKE
Sbjct: 1    MWTHFRSGCWVLFVFVGFLSLSGNLVSVESRRPKNVQVALQAKWSGTPLLLEAGELLSKE 60

Query: 1015 WKDLFWEFIEIWLHTENEDADSHTAKGCLKKIVKYGQSLLGEPLASVFEFSLTLRSASPR 836
            WKD FWEFIE+W H  NEDADS TAK CLKKIVKYGQSLL EPLAS+FEFSLTLRS SPR
Sbjct: 61   WKDYFWEFIEVWHH--NEDADSQTAKDCLKKIVKYGQSLLSEPLASLFEFSLTLRSTSPR 118

Query: 835  LVLYRQLAEESLSSFPLTEDMDSNFVNGGISELNENMG--KNEPLLVGINPRSPRGKCCW 662
            LVLYRQLA ESLSSFPL +D++S  VNGGI E NEN+   K EPLLVG+NP SP GKCCW
Sbjct: 119  LVLYRQLAVESLSSFPLYDDINSQSVNGGIPETNENVESKKVEPLLVGMNPSSPGGKCCW 178

Query: 661  LDTGGALFFDATELVSWLNSPNELAGDSFQQPELYEFDHIYFNSSIASPVAILYGALGTE 482
            +DTGGA FF  +E  +WL+S  E A DSFQQPELYEFDHI+F+SSI SPVAILYGALGT+
Sbjct: 179  VDTGGAFFFAVSEFQTWLHSSKESAQDSFQQPELYEFDHIHFDSSIGSPVAILYGALGTD 238

Query: 481  CFREFHVTLVEAAKEGKVKYVVRPVLPSGCESKSGHCGAVGTRDPLNLGGYGVELALKNM 302
            CFREFHV LV AAKEGKVKYV RPVLPSGC+SKSGHC AVGT DP+NLGGYGVELALKNM
Sbjct: 239  CFREFHVALVAAAKEGKVKYVARPVLPSGCQSKSGHCAAVGTNDPVNLGGYGVELALKNM 298

Query: 301  EYKAMDDSAIKKGVTLEDPHTEDLSQEVRGFIFSKILERKPELASEIMAFRDYLLSSTIS 122
            EYKAMDDSAIKKGVTLEDPHTEDLSQEVRGFIFS+ILERKPEL SEIMAFRDYLLSST+S
Sbjct: 299  EYKAMDDSAIKKGVTLEDPHTEDLSQEVRGFIFSRILERKPELTSEIMAFRDYLLSSTVS 358

Query: 121  DTLDVWELKDLGHQTAQRIVHASDPLQSMQEINQNFPSVV 2
            DTLDVWELKDLGHQTAQRIVHASDPLQSMQEINQNFP+VV
Sbjct: 359  DTLDVWELKDLGHQTAQRIVHASDPLQSMQEINQNFPTVV 398


>ref|XP_010657684.1| PREDICTED: UDP-glucose:glycoprotein glucosyltransferase isoform X2
            [Vitis vinifera]
          Length = 1583

 Score =  640 bits (1650), Expect = e-180
 Identities = 321/401 (80%), Positives = 352/401 (87%), Gaps = 3/401 (0%)
 Frame = -2

Query: 1195 MDTHFRSGFWVLLAFV-AFLSLSGHLVSGESRRPKNVQVAVRAKWSGTPLVLEAGELLSK 1019
            M THFRSGFWVL+    A L  +G +V+ ++RRPKNVQVAVRAKWSGTPL+LEAGELL+K
Sbjct: 1    MGTHFRSGFWVLVVLACASLCWNGSVVA-DNRRPKNVQVAVRAKWSGTPLLLEAGELLAK 59

Query: 1018 EWKDLFWEFIEIWLHTENEDADSHTAKGCLKKIVKYGQSLLGEPLASVFEFSLTLRSASP 839
            E KDLFW FIE+WL  E +DADS TAK CLKKIVKYG SLL E LAS+FEFSLTLRSASP
Sbjct: 60   ERKDLFWRFIEVWLSAEKDDADSFTAKDCLKKIVKYGHSLLSESLASLFEFSLTLRSASP 119

Query: 838  RLVLYRQLAEESLSSFPLTEDMDSNFVNGGISELNENMG--KNEPLLVGINPRSPRGKCC 665
            RLVLYRQLAEESLSSFPLT++ + N + GG SE+NENM   K +P LVG+NP+SP GKCC
Sbjct: 120  RLVLYRQLAEESLSSFPLTDESNPNNIGGGTSEINENMETKKLDPFLVGVNPKSPGGKCC 179

Query: 664  WLDTGGALFFDATELVSWLNSPNELAGDSFQQPELYEFDHIYFNSSIASPVAILYGALGT 485
            W+DTGG+LFFD  EL+ WL SP E    SFQ PEL++FDHI+F SS++SPV ILYGALGT
Sbjct: 180  WVDTGGSLFFDGAELLLWLRSPTE--SGSFQPPELFDFDHIHFGSSVSSPVTILYGALGT 237

Query: 484  ECFREFHVTLVEAAKEGKVKYVVRPVLPSGCESKSGHCGAVGTRDPLNLGGYGVELALKN 305
            +CFREFHV L EAAKEGKVKYVVRPVLPSGCE+K GHCG VGT+DPLNLGGYGVELALKN
Sbjct: 238  DCFREFHVILAEAAKEGKVKYVVRPVLPSGCETKIGHCGVVGTKDPLNLGGYGVELALKN 297

Query: 304  MEYKAMDDSAIKKGVTLEDPHTEDLSQEVRGFIFSKILERKPELASEIMAFRDYLLSSTI 125
            MEYKAMDDS IKKGVTLEDP TEDLSQEVRGFIFSKILERKPEL+SEIMAFRDYLLSSTI
Sbjct: 298  MEYKAMDDSMIKKGVTLEDPRTEDLSQEVRGFIFSKILERKPELSSEIMAFRDYLLSSTI 357

Query: 124  SDTLDVWELKDLGHQTAQRIVHASDPLQSMQEINQNFPSVV 2
            SDTLDVWELKDLGHQTAQRIVHASDPLQSMQEINQNFPSVV
Sbjct: 358  SDTLDVWELKDLGHQTAQRIVHASDPLQSMQEINQNFPSVV 398


>ref|XP_010657683.1| PREDICTED: UDP-glucose:glycoprotein glucosyltransferase isoform X1
            [Vitis vinifera]
          Length = 1642

 Score =  640 bits (1650), Expect = e-180
 Identities = 321/401 (80%), Positives = 352/401 (87%), Gaps = 3/401 (0%)
 Frame = -2

Query: 1195 MDTHFRSGFWVLLAFV-AFLSLSGHLVSGESRRPKNVQVAVRAKWSGTPLVLEAGELLSK 1019
            M THFRSGFWVL+    A L  +G +V+ ++RRPKNVQVAVRAKWSGTPL+LEAGELL+K
Sbjct: 1    MGTHFRSGFWVLVVLACASLCWNGSVVA-DNRRPKNVQVAVRAKWSGTPLLLEAGELLAK 59

Query: 1018 EWKDLFWEFIEIWLHTENEDADSHTAKGCLKKIVKYGQSLLGEPLASVFEFSLTLRSASP 839
            E KDLFW FIE+WL  E +DADS TAK CLKKIVKYG SLL E LAS+FEFSLTLRSASP
Sbjct: 60   ERKDLFWRFIEVWLSAEKDDADSFTAKDCLKKIVKYGHSLLSESLASLFEFSLTLRSASP 119

Query: 838  RLVLYRQLAEESLSSFPLTEDMDSNFVNGGISELNENMG--KNEPLLVGINPRSPRGKCC 665
            RLVLYRQLAEESLSSFPLT++ + N + GG SE+NENM   K +P LVG+NP+SP GKCC
Sbjct: 120  RLVLYRQLAEESLSSFPLTDESNPNNIGGGTSEINENMETKKLDPFLVGVNPKSPGGKCC 179

Query: 664  WLDTGGALFFDATELVSWLNSPNELAGDSFQQPELYEFDHIYFNSSIASPVAILYGALGT 485
            W+DTGG+LFFD  EL+ WL SP E    SFQ PEL++FDHI+F SS++SPV ILYGALGT
Sbjct: 180  WVDTGGSLFFDGAELLLWLRSPTE--SGSFQPPELFDFDHIHFGSSVSSPVTILYGALGT 237

Query: 484  ECFREFHVTLVEAAKEGKVKYVVRPVLPSGCESKSGHCGAVGTRDPLNLGGYGVELALKN 305
            +CFREFHV L EAAKEGKVKYVVRPVLPSGCE+K GHCG VGT+DPLNLGGYGVELALKN
Sbjct: 238  DCFREFHVILAEAAKEGKVKYVVRPVLPSGCETKIGHCGVVGTKDPLNLGGYGVELALKN 297

Query: 304  MEYKAMDDSAIKKGVTLEDPHTEDLSQEVRGFIFSKILERKPELASEIMAFRDYLLSSTI 125
            MEYKAMDDS IKKGVTLEDP TEDLSQEVRGFIFSKILERKPEL+SEIMAFRDYLLSSTI
Sbjct: 298  MEYKAMDDSMIKKGVTLEDPRTEDLSQEVRGFIFSKILERKPELSSEIMAFRDYLLSSTI 357

Query: 124  SDTLDVWELKDLGHQTAQRIVHASDPLQSMQEINQNFPSVV 2
            SDTLDVWELKDLGHQTAQRIVHASDPLQSMQEINQNFPSVV
Sbjct: 358  SDTLDVWELKDLGHQTAQRIVHASDPLQSMQEINQNFPSVV 398


>ref|XP_010101162.1| UDP-glucose:glycoprotein glucosyltransferase [Morus notabilis]
            gi|587898963|gb|EXB87380.1| UDP-glucose:glycoprotein
            glucosyltransferase [Morus notabilis]
          Length = 1603

 Score =  621 bits (1602), Expect = e-175
 Identities = 303/401 (75%), Positives = 350/401 (87%), Gaps = 3/401 (0%)
 Frame = -2

Query: 1195 MDTHFRSGFWVLLAFVAFLSLSG-HLVSGESRRPKNVQVAVRAKWSGTPLVLEAGELLSK 1019
            M+T FRSGF VL+  V F+ L G   V  E+RRPKNVQ++V+AKWSGTPL+LEAGELLS 
Sbjct: 1    METRFRSGFCVLIVLV-FVGLCGVRSVCAENRRPKNVQISVQAKWSGTPLLLEAGELLSN 59

Query: 1018 EWKDLFWEFIEIWLHTENEDADSHTAKGCLKKIVKYGQSLLGEPLASVFEFSLTLRSASP 839
            EWKD FW+FIE+WLH+EN+DADS++AK CLKKI+++G+SLL EPLAS+FEF+LTLRSASP
Sbjct: 60   EWKDFFWDFIEVWLHSENDDADSYSAKDCLKKILRHGRSLLSEPLASIFEFTLTLRSASP 119

Query: 838  RLVLYRQLAEESLSSFPLTEDMDSNFVNGGISELNENMG--KNEPLLVGINPRSPRGKCC 665
            RLVLYRQLAEESLSSFPLT++   N +  GISE NE +   K++PL VG+NP+SP GKCC
Sbjct: 120  RLVLYRQLAEESLSSFPLTDETTQNSLGEGISETNEQLQTKKSDPLSVGVNPKSPNGKCC 179

Query: 664  WLDTGGALFFDATELVSWLNSPNELAGDSFQQPELYEFDHIYFNSSIASPVAILYGALGT 485
            W+D GG LFFD  +L SWL S ++ A DSFQQPEL+EFDHI+ +SS  SPVAILYGALGT
Sbjct: 180  WVDNGGTLFFDVADLRSWLQSSSDPAVDSFQQPELFEFDHIHVHSSAGSPVAILYGALGT 239

Query: 484  ECFREFHVTLVEAAKEGKVKYVVRPVLPSGCESKSGHCGAVGTRDPLNLGGYGVELALKN 305
            +CFREFH TLVEAAKEGKV+Y VRPVLPSGCE+K GHCG VGTR+ LNLGGYGVELALKN
Sbjct: 240  DCFREFHFTLVEAAKEGKVRYAVRPVLPSGCEAKIGHCGGVGTRNSLNLGGYGVELALKN 299

Query: 304  MEYKAMDDSAIKKGVTLEDPHTEDLSQEVRGFIFSKILERKPELASEIMAFRDYLLSSTI 125
            MEYKAMDDS +KKG+TLEDPHTEDLSQEVRGFIFSKILERKPEL SEIMAFRD+LLS+TI
Sbjct: 300  MEYKAMDDSTVKKGITLEDPHTEDLSQEVRGFIFSKILERKPELTSEIMAFRDHLLSTTI 359

Query: 124  SDTLDVWELKDLGHQTAQRIVHASDPLQSMQEINQNFPSVV 2
            SD LDVWELKDLGHQ AQRIV ASDPL+SM+EINQNFP++V
Sbjct: 360  SDMLDVWELKDLGHQAAQRIVQASDPLRSMEEINQNFPNIV 400


>emb|CBI23772.3| unnamed protein product [Vitis vinifera]
          Length = 1715

 Score =  613 bits (1581), Expect = e-172
 Identities = 311/399 (77%), Positives = 338/399 (84%), Gaps = 1/399 (0%)
 Frame = -2

Query: 1195 MDTHFRSGFWVLLAFV-AFLSLSGHLVSGESRRPKNVQVAVRAKWSGTPLVLEAGELLSK 1019
            M THFRSGFWVL+    A L  +G +V+ ++RRPKNVQVAVRAKWSGTPL+LEAGELL+K
Sbjct: 1    MGTHFRSGFWVLVVLACASLCWNGSVVA-DNRRPKNVQVAVRAKWSGTPLLLEAGELLAK 59

Query: 1018 EWKDLFWEFIEIWLHTENEDADSHTAKGCLKKIVKYGQSLLGEPLASVFEFSLTLRSASP 839
            E KDLFW FIE+WL  E +DADS TAK CLKKIVKYG SLL E LAS+FEFSLTLRSASP
Sbjct: 60   ERKDLFWRFIEVWLSAEKDDADSFTAKDCLKKIVKYGHSLLSESLASLFEFSLTLRSASP 119

Query: 838  RLVLYRQLAEESLSSFPLTEDMDSNFVNGGISELNENMGKNEPLLVGINPRSPRGKCCWL 659
            RLVLYRQLAEESLSSFPLT++                     P LVG+NP+SP GKCCW+
Sbjct: 120  RLVLYRQLAEESLSSFPLTDE--------------------NPFLVGVNPKSPGGKCCWV 159

Query: 658  DTGGALFFDATELVSWLNSPNELAGDSFQQPELYEFDHIYFNSSIASPVAILYGALGTEC 479
            DTGG+LFFD  EL+ WL SP E    SFQ PEL++FDHI+F SS++SPV ILYGALGT+C
Sbjct: 160  DTGGSLFFDGAELLLWLRSPTE--SGSFQPPELFDFDHIHFGSSVSSPVTILYGALGTDC 217

Query: 478  FREFHVTLVEAAKEGKVKYVVRPVLPSGCESKSGHCGAVGTRDPLNLGGYGVELALKNME 299
            FREFHV L EAAKEGKVKYVVRPVLPSGCE+K GHCG VGT+DPLNLGGYGVELALKNME
Sbjct: 218  FREFHVILAEAAKEGKVKYVVRPVLPSGCETKIGHCGVVGTKDPLNLGGYGVELALKNME 277

Query: 298  YKAMDDSAIKKGVTLEDPHTEDLSQEVRGFIFSKILERKPELASEIMAFRDYLLSSTISD 119
            YKAMDDS IKKGVTLEDP TEDLSQEVRGFIFSKILERKPEL+SEIMAFRDYLLSSTISD
Sbjct: 278  YKAMDDSMIKKGVTLEDPRTEDLSQEVRGFIFSKILERKPELSSEIMAFRDYLLSSTISD 337

Query: 118  TLDVWELKDLGHQTAQRIVHASDPLQSMQEINQNFPSVV 2
            TLDVWELKDLGHQTAQRIVHASDPLQSMQEINQNFPSVV
Sbjct: 338  TLDVWELKDLGHQTAQRIVHASDPLQSMQEINQNFPSVV 376


>emb|CDO97565.1| unnamed protein product [Coffea canephora]
          Length = 551

 Score =  611 bits (1576), Expect = e-172
 Identities = 294/400 (73%), Positives = 343/400 (85%), Gaps = 2/400 (0%)
 Frame = -2

Query: 1195 MDTHFRSGFWVLLAFVAFLSLSGHLVSGESRRPKNVQVAVRAKWSGTPLVLEAGELLSKE 1016
            M  HFRSGFW+    + F+ LSG+L+S ++R PKNVQVA+RAKWSGTPL+LEAGELLS +
Sbjct: 1    MKPHFRSGFWLFFVVLLFVGLSGNLISAQTRSPKNVQVALRAKWSGTPLLLEAGELLSSQ 60

Query: 1015 WKDLFWEFIEIWLHTENEDADSHTAKGCLKKIVKYGQSLLGEPLASVFEFSLTLRSASPR 836
            WKD +W+F E WL   +ED+ SHTAK CL+ IV YG+SLL +PLASVFEFSLTLRSASPR
Sbjct: 61   WKDFYWDFTEFWLLKGSEDSGSHTAKDCLRTIVNYGKSLLSKPLASVFEFSLTLRSASPR 120

Query: 835  LVLYRQLAEESLSSFPLTEDMDSNFVNGGISELNENMG--KNEPLLVGINPRSPRGKCCW 662
            LVLYRQLAE+SLSSFPL +   ++   GG  E N+N    K EPLL+G+N R+P GKCCW
Sbjct: 121  LVLYRQLAEDSLSSFPLVDYSSASSNEGGF-ETNDNAKSKKVEPLLLGVNSRAPNGKCCW 179

Query: 661  LDTGGALFFDATELVSWLNSPNELAGDSFQQPELYEFDHIYFNSSIASPVAILYGALGTE 482
            +DTG AL FDA EL+ WL +P++   D+FQQPEL+EFDH++ +SSI SP+AILYGALGT+
Sbjct: 180  VDTGAALLFDANELLLWLENPDKATTDTFQQPELFEFDHVHPDSSIGSPIAILYGALGTD 239

Query: 481  CFREFHVTLVEAAKEGKVKYVVRPVLPSGCESKSGHCGAVGTRDPLNLGGYGVELALKNM 302
            CF+EFH  LV  A++GK+ YVVRP+LPSGCESK GHCGA+GTRD +NLGGYGVELALKNM
Sbjct: 240  CFKEFHNVLVGTARQGKITYVVRPILPSGCESKVGHCGAIGTRDAVNLGGYGVELALKNM 299

Query: 301  EYKAMDDSAIKKGVTLEDPHTEDLSQEVRGFIFSKILERKPELASEIMAFRDYLLSSTIS 122
            EYKAMDDSA+KKGVTLEDPHTEDLSQ+VRGFIFS+ILERKPEL SE+MAFRDYLLSSTIS
Sbjct: 300  EYKAMDDSAVKKGVTLEDPHTEDLSQDVRGFIFSRILERKPELTSEVMAFRDYLLSSTIS 359

Query: 121  DTLDVWELKDLGHQTAQRIVHASDPLQSMQEINQNFPSVV 2
            DTLDVWELKDLGHQTAQRIVHASDPLQSMQEINQNFPS+V
Sbjct: 360  DTLDVWELKDLGHQTAQRIVHASDPLQSMQEINQNFPSIV 399


>ref|XP_012071315.1| PREDICTED: UDP-glucose:glycoprotein glucosyltransferase [Jatropha
            curcas] gi|643731599|gb|KDP38843.1| hypothetical protein
            JCGZ_05000 [Jatropha curcas]
          Length = 1644

 Score =  611 bits (1576), Expect = e-172
 Identities = 305/401 (76%), Positives = 346/401 (86%), Gaps = 3/401 (0%)
 Frame = -2

Query: 1195 MDTHFRSGFWVLLAFVAFLSLSGHL-VSGESRRPKNVQVAVRAKWSGTPLVLEAGELLSK 1019
            MD  FRSGF V +  +  +S SG + VSGE+RRPKNVQVAVRAKW GTP++LEA ELLSK
Sbjct: 1    MDIRFRSGFCVFIILIC-VSFSGFVSVSGENRRPKNVQVAVRAKWEGTPVLLEAAELLSK 59

Query: 1018 EWKDLFWEFIEIWLHTENEDADSHTAKGCLKKIVKYGQSLLGEPLASVFEFSLTLRSASP 839
            EWKDL+WEFIE+WL  E  +ADSH+AK CLK+I+ +G+SLL + +AS+FEFSL LRSASP
Sbjct: 60   EWKDLYWEFIEVWLRAEEIEADSHSAKDCLKRILNHGKSLLSDQVASLFEFSLILRSASP 119

Query: 838  RLVLYRQLAEESLSSFPLTEDMDSNFVNGGISELNEN--MGKNEPLLVGINPRSPRGKCC 665
            RLVLYRQLAEESLSSFPL +D  S+  +  I+E +E     ++E LLVG+NP+SP GKCC
Sbjct: 120  RLVLYRQLAEESLSSFPLCDDSISSNDSEEIAETSEKNESKRSETLLVGVNPKSPCGKCC 179

Query: 664  WLDTGGALFFDATELVSWLNSPNELAGDSFQQPELYEFDHIYFNSSIASPVAILYGALGT 485
            W+DTGGALFFD  EL  WLNSP   AGDSF QPEL++FDH++F S   SPVAILYGALGT
Sbjct: 180  WVDTGGALFFDVAELRLWLNSPVNHAGDSFHQPELFDFDHVHFGSHTRSPVAILYGALGT 239

Query: 484  ECFREFHVTLVEAAKEGKVKYVVRPVLPSGCESKSGHCGAVGTRDPLNLGGYGVELALKN 305
            +CF+EFHVTLVE+AK+G+VKYVVRPVLP+GCE K GHCGA+G +D LNLGGYGVELALKN
Sbjct: 240  DCFKEFHVTLVESAKQGRVKYVVRPVLPAGCEGKVGHCGAIGAKDSLNLGGYGVELALKN 299

Query: 304  MEYKAMDDSAIKKGVTLEDPHTEDLSQEVRGFIFSKILERKPELASEIMAFRDYLLSSTI 125
            MEYKAMDDSAIKKGVTLEDP TEDLSQEVRGFIFSKILERKPEL SEIMAFRDYLLSSTI
Sbjct: 300  MEYKAMDDSAIKKGVTLEDPRTEDLSQEVRGFIFSKILERKPELTSEIMAFRDYLLSSTI 359

Query: 124  SDTLDVWELKDLGHQTAQRIVHASDPLQSMQEINQNFPSVV 2
            SDTLDVWELKDLGHQTAQRIVHASDPLQSMQEI+QNFPSVV
Sbjct: 360  SDTLDVWELKDLGHQTAQRIVHASDPLQSMQEISQNFPSVV 400


>gb|KHG12185.1| glycoprotein glucosyltransferase -like protein [Gossypium arboreum]
          Length = 1599

 Score =  608 bits (1569), Expect = e-171
 Identities = 306/398 (76%), Positives = 337/398 (84%)
 Frame = -2

Query: 1195 MDTHFRSGFWVLLAFVAFLSLSGHLVSGESRRPKNVQVAVRAKWSGTPLVLEAGELLSKE 1016
            MDT FRS F +L+     LS     V  ++RRPKNVQVA+RAKWSGTPL+LEAGELLSKE
Sbjct: 1    MDTCFRSRFCILILLTCLLSSGFTFVGAQNRRPKNVQVAIRAKWSGTPLLLEAGELLSKE 60

Query: 1015 WKDLFWEFIEIWLHTENEDADSHTAKGCLKKIVKYGQSLLGEPLASVFEFSLTLRSASPR 836
             K+LFWEFI+ WL     D DSH+AK CL KI+K+G SLL E LAS+FEFSLTLRSASPR
Sbjct: 61   SKNLFWEFIDDWLLVGKTDNDSHSAKDCLVKILKHGSSLLSEQLASLFEFSLTLRSASPR 120

Query: 835  LVLYRQLAEESLSSFPLTEDMDSNFVNGGISELNENMGKNEPLLVGINPRSPRGKCCWLD 656
            LVLYRQLAEES+SSFPL++D  S+  +G          K +PLLVG+NP+SPRGKCCW+D
Sbjct: 121  LVLYRQLAEESISSFPLSDDSYSHNASGVDDSEAVGTKKLDPLLVGVNPKSPRGKCCWVD 180

Query: 655  TGGALFFDATELVSWLNSPNELAGDSFQQPELYEFDHIYFNSSIASPVAILYGALGTECF 476
             G  LFFD  EL SWL  PNE+ GDSFQQPELY+FDHI+F+S+IASPVAILYGALGTECF
Sbjct: 181  VGEELFFDVAELQSWLLGPNEVNGDSFQQPELYDFDHIHFDSNIASPVAILYGALGTECF 240

Query: 475  REFHVTLVEAAKEGKVKYVVRPVLPSGCESKSGHCGAVGTRDPLNLGGYGVELALKNMEY 296
            REFHVTLV+AAKEGKVKYVVRPVLPSGCE + G CGAVG RD LNLGGYGVELALKNMEY
Sbjct: 241  REFHVTLVQAAKEGKVKYVVRPVLPSGCEGEVGLCGAVGARDSLNLGGYGVELALKNMEY 300

Query: 295  KAMDDSAIKKGVTLEDPHTEDLSQEVRGFIFSKILERKPELASEIMAFRDYLLSSTISDT 116
            KAMDDS +KKGVTLEDP TEDLSQEVRGFIFSKILERKP+L SEIMAFRDYLLSSTISDT
Sbjct: 301  KAMDDSTVKKGVTLEDPRTEDLSQEVRGFIFSKILERKPDLTSEIMAFRDYLLSSTISDT 360

Query: 115  LDVWELKDLGHQTAQRIVHASDPLQSMQEINQNFPSVV 2
            LDVWELKDLGHQTAQRIV ASDPLQSMQE+NQNFPSVV
Sbjct: 361  LDVWELKDLGHQTAQRIVQASDPLQSMQELNQNFPSVV 398


>ref|XP_009348356.1| PREDICTED: UDP-glucose:glycoprotein glucosyltransferase-like [Pyrus x
            bretschneideri]
          Length = 1633

 Score =  608 bits (1568), Expect = e-171
 Identities = 306/400 (76%), Positives = 346/400 (86%), Gaps = 2/400 (0%)
 Frame = -2

Query: 1195 MDTHFRSGFWVLLAFVAFLSLSGHLVSGESRRPKNVQVAVRAKWSGTPLVLEAGELLSKE 1016
            M   F+S F V++  V   +    LVSG++RRPKNVQ AVRAKWSGTPL+LEAGELLSKE
Sbjct: 1    MRIRFKSAFCVMIVLVCLGASGIGLVSGQNRRPKNVQAAVRAKWSGTPLLLEAGELLSKE 60

Query: 1015 WKDLFWEFIEIWLHTENEDADSHTAKGCLKKIVKYGQSLLGEPLASVFEFSLTLRSASPR 836
             KD FW+FI+ W H+E +DA+S+TAKGCLKKIVK+G S+L +PLAS+FEFSL LRS SPR
Sbjct: 61   QKDHFWDFIDAWHHSEKDDAESYTAKGCLKKIVKHGLSILDKPLASLFEFSLMLRSTSPR 120

Query: 835  LVLYRQLAEESLSSFPLTEDMDSNFVNGGISELNENM-GKNEPLL-VGINPRSPRGKCCW 662
            LVLYRQLAEESLSSFPL ++ +S+  +GGISE NE M G+   LL +G NP+SP GKCCW
Sbjct: 121  LVLYRQLAEESLSSFPLVDETNSSN-DGGISETNELMEGQRSDLLNIGRNPKSPNGKCCW 179

Query: 661  LDTGGALFFDATELVSWLNSPNELAGDSFQQPELYEFDHIYFNSSIASPVAILYGALGTE 482
            +DTGGALFFD  +L  WL SP + +GDSFQQPEL+EFDHI+F+SSI SPVA+LYGALGT+
Sbjct: 180  VDTGGALFFDPADLKIWLQSPRDFSGDSFQQPELFEFDHIHFDSSIGSPVAVLYGALGTD 239

Query: 481  CFREFHVTLVEAAKEGKVKYVVRPVLPSGCESKSGHCGAVGTRDPLNLGGYGVELALKNM 302
            CFREFH+TLVEAAKEGK KYVVR VLPSGC++K   CGAVGTRD LNLGGYGVELALKNM
Sbjct: 240  CFREFHLTLVEAAKEGKAKYVVRQVLPSGCDAKIDRCGAVGTRDSLNLGGYGVELALKNM 299

Query: 301  EYKAMDDSAIKKGVTLEDPHTEDLSQEVRGFIFSKILERKPELASEIMAFRDYLLSSTIS 122
            EYKAMDDSAIKKGVTLEDP  EDLSQEVRGFIFSKILERKPEL+SEIMAFRDYLLSSTIS
Sbjct: 300  EYKAMDDSAIKKGVTLEDPRIEDLSQEVRGFIFSKILERKPELSSEIMAFRDYLLSSTIS 359

Query: 121  DTLDVWELKDLGHQTAQRIVHASDPLQSMQEINQNFPSVV 2
            DTLDVWELKDLGHQTAQRIV ASDPLQ+MQEINQNFPS+V
Sbjct: 360  DTLDVWELKDLGHQTAQRIVQASDPLQAMQEINQNFPSIV 399


>gb|KJB77699.1| hypothetical protein B456_012G151700 [Gossypium raimondii]
          Length = 1673

 Score =  607 bits (1566), Expect = e-171
 Identities = 307/398 (77%), Positives = 336/398 (84%)
 Frame = -2

Query: 1195 MDTHFRSGFWVLLAFVAFLSLSGHLVSGESRRPKNVQVAVRAKWSGTPLVLEAGELLSKE 1016
            MDT FRS F +L+     L      V  ++RRPKNVQVA+RAKWSGTPL+LEAGELLSKE
Sbjct: 1    MDTCFRSRFCILILLTCLLISGFTFVGAQNRRPKNVQVAIRAKWSGTPLLLEAGELLSKE 60

Query: 1015 WKDLFWEFIEIWLHTENEDADSHTAKGCLKKIVKYGQSLLGEPLASVFEFSLTLRSASPR 836
             K+LFWEFI+ WL     D DSH+AK CL KI+K+G SLL E LAS+FEFSLTLRSASPR
Sbjct: 61   SKNLFWEFIDDWLLVGKTDDDSHSAKDCLVKILKHGSSLLSEQLASLFEFSLTLRSASPR 120

Query: 835  LVLYRQLAEESLSSFPLTEDMDSNFVNGGISELNENMGKNEPLLVGINPRSPRGKCCWLD 656
            LVLYRQLAEESLSSFPL++D  S+  +G          K +PLLVG+NP+SPRGKCCW+D
Sbjct: 121  LVLYRQLAEESLSSFPLSDDSYSHNASGVDDSEAVVTKKLDPLLVGVNPKSPRGKCCWVD 180

Query: 655  TGGALFFDATELVSWLNSPNELAGDSFQQPELYEFDHIYFNSSIASPVAILYGALGTECF 476
             G  LFF+  EL SWL  PNE+ GDSFQQPELYEFDHI+F+S+IASPVAILYGALGTECF
Sbjct: 181  VGEELFFEVAELQSWLLGPNEVNGDSFQQPELYEFDHIHFDSNIASPVAILYGALGTECF 240

Query: 475  REFHVTLVEAAKEGKVKYVVRPVLPSGCESKSGHCGAVGTRDPLNLGGYGVELALKNMEY 296
            REFHVTLV+AAKEGKVKYVVRPVLPSGCE + G CGAVG RD LNLGGYGVELALKNMEY
Sbjct: 241  REFHVTLVQAAKEGKVKYVVRPVLPSGCEGEVGQCGAVGARDSLNLGGYGVELALKNMEY 300

Query: 295  KAMDDSAIKKGVTLEDPHTEDLSQEVRGFIFSKILERKPELASEIMAFRDYLLSSTISDT 116
            KAMDDS +KKGVTLEDP TEDLSQEVRGFIFSKILERKP+L SEIMAFRDYLLSSTISDT
Sbjct: 301  KAMDDSTVKKGVTLEDPRTEDLSQEVRGFIFSKILERKPDLTSEIMAFRDYLLSSTISDT 360

Query: 115  LDVWELKDLGHQTAQRIVHASDPLQSMQEINQNFPSVV 2
            LDVWELKDLGHQTAQRIV ASDPLQSMQEINQNFPSVV
Sbjct: 361  LDVWELKDLGHQTAQRIVQASDPLQSMQEINQNFPSVV 398


>gb|KJB77698.1| hypothetical protein B456_012G151700 [Gossypium raimondii]
          Length = 1592

 Score =  607 bits (1566), Expect = e-171
 Identities = 307/398 (77%), Positives = 336/398 (84%)
 Frame = -2

Query: 1195 MDTHFRSGFWVLLAFVAFLSLSGHLVSGESRRPKNVQVAVRAKWSGTPLVLEAGELLSKE 1016
            MDT FRS F +L+     L      V  ++RRPKNVQVA+RAKWSGTPL+LEAGELLSKE
Sbjct: 1    MDTCFRSRFCILILLTCLLISGFTFVGAQNRRPKNVQVAIRAKWSGTPLLLEAGELLSKE 60

Query: 1015 WKDLFWEFIEIWLHTENEDADSHTAKGCLKKIVKYGQSLLGEPLASVFEFSLTLRSASPR 836
             K+LFWEFI+ WL     D DSH+AK CL KI+K+G SLL E LAS+FEFSLTLRSASPR
Sbjct: 61   SKNLFWEFIDDWLLVGKTDDDSHSAKDCLVKILKHGSSLLSEQLASLFEFSLTLRSASPR 120

Query: 835  LVLYRQLAEESLSSFPLTEDMDSNFVNGGISELNENMGKNEPLLVGINPRSPRGKCCWLD 656
            LVLYRQLAEESLSSFPL++D  S+  +G          K +PLLVG+NP+SPRGKCCW+D
Sbjct: 121  LVLYRQLAEESLSSFPLSDDSYSHNASGVDDSEAVVTKKLDPLLVGVNPKSPRGKCCWVD 180

Query: 655  TGGALFFDATELVSWLNSPNELAGDSFQQPELYEFDHIYFNSSIASPVAILYGALGTECF 476
             G  LFF+  EL SWL  PNE+ GDSFQQPELYEFDHI+F+S+IASPVAILYGALGTECF
Sbjct: 181  VGEELFFEVAELQSWLLGPNEVNGDSFQQPELYEFDHIHFDSNIASPVAILYGALGTECF 240

Query: 475  REFHVTLVEAAKEGKVKYVVRPVLPSGCESKSGHCGAVGTRDPLNLGGYGVELALKNMEY 296
            REFHVTLV+AAKEGKVKYVVRPVLPSGCE + G CGAVG RD LNLGGYGVELALKNMEY
Sbjct: 241  REFHVTLVQAAKEGKVKYVVRPVLPSGCEGEVGQCGAVGARDSLNLGGYGVELALKNMEY 300

Query: 295  KAMDDSAIKKGVTLEDPHTEDLSQEVRGFIFSKILERKPELASEIMAFRDYLLSSTISDT 116
            KAMDDS +KKGVTLEDP TEDLSQEVRGFIFSKILERKP+L SEIMAFRDYLLSSTISDT
Sbjct: 301  KAMDDSTVKKGVTLEDPRTEDLSQEVRGFIFSKILERKPDLTSEIMAFRDYLLSSTISDT 360

Query: 115  LDVWELKDLGHQTAQRIVHASDPLQSMQEINQNFPSVV 2
            LDVWELKDLGHQTAQRIV ASDPLQSMQEINQNFPSVV
Sbjct: 361  LDVWELKDLGHQTAQRIVQASDPLQSMQEINQNFPSVV 398


>gb|KJB77697.1| hypothetical protein B456_012G151700 [Gossypium raimondii]
          Length = 1553

 Score =  607 bits (1566), Expect = e-171
 Identities = 307/398 (77%), Positives = 336/398 (84%)
 Frame = -2

Query: 1195 MDTHFRSGFWVLLAFVAFLSLSGHLVSGESRRPKNVQVAVRAKWSGTPLVLEAGELLSKE 1016
            MDT FRS F +L+     L      V  ++RRPKNVQVA+RAKWSGTPL+LEAGELLSKE
Sbjct: 1    MDTCFRSRFCILILLTCLLISGFTFVGAQNRRPKNVQVAIRAKWSGTPLLLEAGELLSKE 60

Query: 1015 WKDLFWEFIEIWLHTENEDADSHTAKGCLKKIVKYGQSLLGEPLASVFEFSLTLRSASPR 836
             K+LFWEFI+ WL     D DSH+AK CL KI+K+G SLL E LAS+FEFSLTLRSASPR
Sbjct: 61   SKNLFWEFIDDWLLVGKTDDDSHSAKDCLVKILKHGSSLLSEQLASLFEFSLTLRSASPR 120

Query: 835  LVLYRQLAEESLSSFPLTEDMDSNFVNGGISELNENMGKNEPLLVGINPRSPRGKCCWLD 656
            LVLYRQLAEESLSSFPL++D  S+  +G          K +PLLVG+NP+SPRGKCCW+D
Sbjct: 121  LVLYRQLAEESLSSFPLSDDSYSHNASGVDDSEAVVTKKLDPLLVGVNPKSPRGKCCWVD 180

Query: 655  TGGALFFDATELVSWLNSPNELAGDSFQQPELYEFDHIYFNSSIASPVAILYGALGTECF 476
             G  LFF+  EL SWL  PNE+ GDSFQQPELYEFDHI+F+S+IASPVAILYGALGTECF
Sbjct: 181  VGEELFFEVAELQSWLLGPNEVNGDSFQQPELYEFDHIHFDSNIASPVAILYGALGTECF 240

Query: 475  REFHVTLVEAAKEGKVKYVVRPVLPSGCESKSGHCGAVGTRDPLNLGGYGVELALKNMEY 296
            REFHVTLV+AAKEGKVKYVVRPVLPSGCE + G CGAVG RD LNLGGYGVELALKNMEY
Sbjct: 241  REFHVTLVQAAKEGKVKYVVRPVLPSGCEGEVGQCGAVGARDSLNLGGYGVELALKNMEY 300

Query: 295  KAMDDSAIKKGVTLEDPHTEDLSQEVRGFIFSKILERKPELASEIMAFRDYLLSSTISDT 116
            KAMDDS +KKGVTLEDP TEDLSQEVRGFIFSKILERKP+L SEIMAFRDYLLSSTISDT
Sbjct: 301  KAMDDSTVKKGVTLEDPRTEDLSQEVRGFIFSKILERKPDLTSEIMAFRDYLLSSTISDT 360

Query: 115  LDVWELKDLGHQTAQRIVHASDPLQSMQEINQNFPSVV 2
            LDVWELKDLGHQTAQRIV ASDPLQSMQEINQNFPSVV
Sbjct: 361  LDVWELKDLGHQTAQRIVQASDPLQSMQEINQNFPSVV 398


>ref|XP_012458584.1| PREDICTED: UDP-glucose:glycoprotein glucosyltransferase-like
            [Gossypium raimondii] gi|763810793|gb|KJB77695.1|
            hypothetical protein B456_012G151700 [Gossypium
            raimondii]
          Length = 1641

 Score =  607 bits (1566), Expect = e-171
 Identities = 307/398 (77%), Positives = 336/398 (84%)
 Frame = -2

Query: 1195 MDTHFRSGFWVLLAFVAFLSLSGHLVSGESRRPKNVQVAVRAKWSGTPLVLEAGELLSKE 1016
            MDT FRS F +L+     L      V  ++RRPKNVQVA+RAKWSGTPL+LEAGELLSKE
Sbjct: 1    MDTCFRSRFCILILLTCLLISGFTFVGAQNRRPKNVQVAIRAKWSGTPLLLEAGELLSKE 60

Query: 1015 WKDLFWEFIEIWLHTENEDADSHTAKGCLKKIVKYGQSLLGEPLASVFEFSLTLRSASPR 836
             K+LFWEFI+ WL     D DSH+AK CL KI+K+G SLL E LAS+FEFSLTLRSASPR
Sbjct: 61   SKNLFWEFIDDWLLVGKTDDDSHSAKDCLVKILKHGSSLLSEQLASLFEFSLTLRSASPR 120

Query: 835  LVLYRQLAEESLSSFPLTEDMDSNFVNGGISELNENMGKNEPLLVGINPRSPRGKCCWLD 656
            LVLYRQLAEESLSSFPL++D  S+  +G          K +PLLVG+NP+SPRGKCCW+D
Sbjct: 121  LVLYRQLAEESLSSFPLSDDSYSHNASGVDDSEAVVTKKLDPLLVGVNPKSPRGKCCWVD 180

Query: 655  TGGALFFDATELVSWLNSPNELAGDSFQQPELYEFDHIYFNSSIASPVAILYGALGTECF 476
             G  LFF+  EL SWL  PNE+ GDSFQQPELYEFDHI+F+S+IASPVAILYGALGTECF
Sbjct: 181  VGEELFFEVAELQSWLLGPNEVNGDSFQQPELYEFDHIHFDSNIASPVAILYGALGTECF 240

Query: 475  REFHVTLVEAAKEGKVKYVVRPVLPSGCESKSGHCGAVGTRDPLNLGGYGVELALKNMEY 296
            REFHVTLV+AAKEGKVKYVVRPVLPSGCE + G CGAVG RD LNLGGYGVELALKNMEY
Sbjct: 241  REFHVTLVQAAKEGKVKYVVRPVLPSGCEGEVGQCGAVGARDSLNLGGYGVELALKNMEY 300

Query: 295  KAMDDSAIKKGVTLEDPHTEDLSQEVRGFIFSKILERKPELASEIMAFRDYLLSSTISDT 116
            KAMDDS +KKGVTLEDP TEDLSQEVRGFIFSKILERKP+L SEIMAFRDYLLSSTISDT
Sbjct: 301  KAMDDSTVKKGVTLEDPRTEDLSQEVRGFIFSKILERKPDLTSEIMAFRDYLLSSTISDT 360

Query: 115  LDVWELKDLGHQTAQRIVHASDPLQSMQEINQNFPSVV 2
            LDVWELKDLGHQTAQRIV ASDPLQSMQEINQNFPSVV
Sbjct: 361  LDVWELKDLGHQTAQRIVQASDPLQSMQEINQNFPSVV 398


>ref|XP_008339491.1| PREDICTED: UDP-glucose:glycoprotein glucosyltransferase [Malus
            domestica]
          Length = 1633

 Score =  607 bits (1565), Expect = e-171
 Identities = 305/400 (76%), Positives = 344/400 (86%), Gaps = 2/400 (0%)
 Frame = -2

Query: 1195 MDTHFRSGFWVLLAFVAFLSLSGHLVSGESRRPKNVQVAVRAKWSGTPLVLEAGELLSKE 1016
            M T F+S F  ++  V   +    LVSG++RRPKNVQ AVRAKWSGTPL+LEAGELLSKE
Sbjct: 1    MRTRFKSAFCAVIVLVCLGASGIGLVSGQNRRPKNVQAAVRAKWSGTPLLLEAGELLSKE 60

Query: 1015 WKDLFWEFIEIWLHTENEDADSHTAKGCLKKIVKYGQSLLGEPLASVFEFSLTLRSASPR 836
             KD FW+FI+ W H+E +DA+S+TAKGCLKKIVK+G S+L EPLAS+FEFSL LRS SPR
Sbjct: 61   QKDHFWDFIDAWHHSEKDDAESYTAKGCLKKIVKHGLSILNEPLASLFEFSLMLRSTSPR 120

Query: 835  LVLYRQLAEESLSSFPLTEDMDSNFVNGGISELNENM-GKNEPLL-VGINPRSPRGKCCW 662
            LVLYRQLAEE+LSSFPL ++ +S+  + GISE NE M GK   LL +G NP+SP GKCCW
Sbjct: 121  LVLYRQLAEEALSSFPLVDETNSSS-DSGISETNELMEGKRSDLLNIGRNPKSPNGKCCW 179

Query: 661  LDTGGALFFDATELVSWLNSPNELAGDSFQQPELYEFDHIYFNSSIASPVAILYGALGTE 482
            +DTGGALFFD  +L  WL SP + +GDSFQQPEL+EFDHI+F+SS+ SPVA+LYGALGT+
Sbjct: 180  VDTGGALFFDPADLKIWLQSPRDSSGDSFQQPELFEFDHIHFDSSVGSPVAVLYGALGTD 239

Query: 481  CFREFHVTLVEAAKEGKVKYVVRPVLPSGCESKSGHCGAVGTRDPLNLGGYGVELALKNM 302
            CFREFH+TLVEAAKEGK KYVVR VLPSGC+ K   CGAVGTRD LNLGGYGVELALKNM
Sbjct: 240  CFREFHLTLVEAAKEGKAKYVVRQVLPSGCDXKIDRCGAVGTRDSLNLGGYGVELALKNM 299

Query: 301  EYKAMDDSAIKKGVTLEDPHTEDLSQEVRGFIFSKILERKPELASEIMAFRDYLLSSTIS 122
            EYKAMDDSAIKKGVTLEDP  EDLSQEVRGFIFSKILERKPEL+SEIMAFRDYLLSSTIS
Sbjct: 300  EYKAMDDSAIKKGVTLEDPRIEDLSQEVRGFIFSKILERKPELSSEIMAFRDYLLSSTIS 359

Query: 121  DTLDVWELKDLGHQTAQRIVHASDPLQSMQEINQNFPSVV 2
            DTLDVWELKDLGHQTAQRIV ASDPLQ+MQEINQNFPS+V
Sbjct: 360  DTLDVWELKDLGHQTAQRIVQASDPLQAMQEINQNFPSIV 399


>ref|XP_007042249.1| UDP-glucose:glycoprotein glucosyltransferases,transferases isoform 3
            [Theobroma cacao] gi|508706184|gb|EOX98080.1|
            UDP-glucose:glycoprotein
            glucosyltransferases,transferases isoform 3 [Theobroma
            cacao]
          Length = 1353

 Score =  605 bits (1559), Expect = e-170
 Identities = 305/399 (76%), Positives = 338/399 (84%), Gaps = 1/399 (0%)
 Frame = -2

Query: 1195 MDTHFRSGFWVLLAFVAFLSLSGHLVSGESRRPKNVQVAVRAKWSGTPLVLEAGELLSKE 1016
            M+T FRS   +L+     +      V  ++RRPKNVQ A+RAKWSGTPL+LEAGELLSKE
Sbjct: 1    METRFRSRLCILIVLACVIFCGFTSVGAQNRRPKNVQAAIRAKWSGTPLLLEAGELLSKE 60

Query: 1015 WKDLFWEFIEIWLHTENEDADSHTAKGCLKKIVKYGQSLLGEPLASVFEFSLTLRSASPR 836
             K+LFWEF + WLH      DSH+AK CLKKI+K+G SLL E L+S+FEFSLTLRSASPR
Sbjct: 61   SKNLFWEFFDDWLHVAKTGGDSHSAKDCLKKILKHGSSLLSETLSSLFEFSLTLRSASPR 120

Query: 835  LVLYRQLAEESLSSFPLTEDMDSNFVNG-GISELNENMGKNEPLLVGINPRSPRGKCCWL 659
            LVLYRQLAEESLSSFPL +D  SN VNG   SE  E + K +PLLVGINPRSP GKCCW+
Sbjct: 121  LVLYRQLAEESLSSFPLGDDSYSNNVNGLDASETLETI-KLDPLLVGINPRSPGGKCCWV 179

Query: 658  DTGGALFFDATELVSWLNSPNELAGDSFQQPELYEFDHIYFNSSIASPVAILYGALGTEC 479
            DTGGALFFD  EL+ WL  PNEL  DSFQQPELY+FDHI+F+S+I SPVAILYGALGT C
Sbjct: 180  DTGGALFFDVAELLLWLQRPNELGVDSFQQPELYDFDHIHFDSNIMSPVAILYGALGTNC 239

Query: 478  FREFHVTLVEAAKEGKVKYVVRPVLPSGCESKSGHCGAVGTRDPLNLGGYGVELALKNME 299
            F+EFHVTLV+AAKEGKVKYVVRPVLPSGCE++ G CGAVG RD LNLGGYGVELALKNME
Sbjct: 240  FKEFHVTLVQAAKEGKVKYVVRPVLPSGCEAEVGLCGAVGARDSLNLGGYGVELALKNME 299

Query: 298  YKAMDDSAIKKGVTLEDPHTEDLSQEVRGFIFSKILERKPELASEIMAFRDYLLSSTISD 119
            YKA+DDS +KKGVTLEDP TEDLSQEVRGFIFSK+LERKPEL SEIMAFRDYL+SSTISD
Sbjct: 300  YKAIDDSTVKKGVTLEDPRTEDLSQEVRGFIFSKMLERKPELTSEIMAFRDYLMSSTISD 359

Query: 118  TLDVWELKDLGHQTAQRIVHASDPLQSMQEINQNFPSVV 2
            TLDVWELKDLGHQTAQRIV ASDPLQSMQEI+QNFPSVV
Sbjct: 360  TLDVWELKDLGHQTAQRIVQASDPLQSMQEISQNFPSVV 398


>ref|XP_007042248.1| UDP-glucose:glycoprotein glucosyltransferase isoform 2 [Theobroma
            cacao] gi|508706183|gb|EOX98079.1|
            UDP-glucose:glycoprotein glucosyltransferase isoform 2
            [Theobroma cacao]
          Length = 1518

 Score =  605 bits (1559), Expect = e-170
 Identities = 305/399 (76%), Positives = 338/399 (84%), Gaps = 1/399 (0%)
 Frame = -2

Query: 1195 MDTHFRSGFWVLLAFVAFLSLSGHLVSGESRRPKNVQVAVRAKWSGTPLVLEAGELLSKE 1016
            M+T FRS   +L+     +      V  ++RRPKNVQ A+RAKWSGTPL+LEAGELLSKE
Sbjct: 1    METRFRSRLCILIVLACVIFCGFTSVGAQNRRPKNVQAAIRAKWSGTPLLLEAGELLSKE 60

Query: 1015 WKDLFWEFIEIWLHTENEDADSHTAKGCLKKIVKYGQSLLGEPLASVFEFSLTLRSASPR 836
             K+LFWEF + WLH      DSH+AK CLKKI+K+G SLL E L+S+FEFSLTLRSASPR
Sbjct: 61   SKNLFWEFFDDWLHVAKTGGDSHSAKDCLKKILKHGSSLLSETLSSLFEFSLTLRSASPR 120

Query: 835  LVLYRQLAEESLSSFPLTEDMDSNFVNG-GISELNENMGKNEPLLVGINPRSPRGKCCWL 659
            LVLYRQLAEESLSSFPL +D  SN VNG   SE  E + K +PLLVGINPRSP GKCCW+
Sbjct: 121  LVLYRQLAEESLSSFPLGDDSYSNNVNGLDASETLETI-KLDPLLVGINPRSPGGKCCWV 179

Query: 658  DTGGALFFDATELVSWLNSPNELAGDSFQQPELYEFDHIYFNSSIASPVAILYGALGTEC 479
            DTGGALFFD  EL+ WL  PNEL  DSFQQPELY+FDHI+F+S+I SPVAILYGALGT C
Sbjct: 180  DTGGALFFDVAELLLWLQRPNELGVDSFQQPELYDFDHIHFDSNIMSPVAILYGALGTNC 239

Query: 478  FREFHVTLVEAAKEGKVKYVVRPVLPSGCESKSGHCGAVGTRDPLNLGGYGVELALKNME 299
            F+EFHVTLV+AAKEGKVKYVVRPVLPSGCE++ G CGAVG RD LNLGGYGVELALKNME
Sbjct: 240  FKEFHVTLVQAAKEGKVKYVVRPVLPSGCEAEVGLCGAVGARDSLNLGGYGVELALKNME 299

Query: 298  YKAMDDSAIKKGVTLEDPHTEDLSQEVRGFIFSKILERKPELASEIMAFRDYLLSSTISD 119
            YKA+DDS +KKGVTLEDP TEDLSQEVRGFIFSK+LERKPEL SEIMAFRDYL+SSTISD
Sbjct: 300  YKAIDDSTVKKGVTLEDPRTEDLSQEVRGFIFSKMLERKPELTSEIMAFRDYLMSSTISD 359

Query: 118  TLDVWELKDLGHQTAQRIVHASDPLQSMQEINQNFPSVV 2
            TLDVWELKDLGHQTAQRIV ASDPLQSMQEI+QNFPSVV
Sbjct: 360  TLDVWELKDLGHQTAQRIVQASDPLQSMQEISQNFPSVV 398


>ref|XP_007042247.1| UDP-glucose:glycoprotein glucosyltransferase isoform 1 [Theobroma
            cacao] gi|508706182|gb|EOX98078.1|
            UDP-glucose:glycoprotein glucosyltransferase isoform 1
            [Theobroma cacao]
          Length = 1639

 Score =  605 bits (1559), Expect = e-170
 Identities = 305/399 (76%), Positives = 338/399 (84%), Gaps = 1/399 (0%)
 Frame = -2

Query: 1195 MDTHFRSGFWVLLAFVAFLSLSGHLVSGESRRPKNVQVAVRAKWSGTPLVLEAGELLSKE 1016
            M+T FRS   +L+     +      V  ++RRPKNVQ A+RAKWSGTPL+LEAGELLSKE
Sbjct: 1    METRFRSRLCILIVLACVIFCGFTSVGAQNRRPKNVQAAIRAKWSGTPLLLEAGELLSKE 60

Query: 1015 WKDLFWEFIEIWLHTENEDADSHTAKGCLKKIVKYGQSLLGEPLASVFEFSLTLRSASPR 836
             K+LFWEF + WLH      DSH+AK CLKKI+K+G SLL E L+S+FEFSLTLRSASPR
Sbjct: 61   SKNLFWEFFDDWLHVAKTGGDSHSAKDCLKKILKHGSSLLSETLSSLFEFSLTLRSASPR 120

Query: 835  LVLYRQLAEESLSSFPLTEDMDSNFVNG-GISELNENMGKNEPLLVGINPRSPRGKCCWL 659
            LVLYRQLAEESLSSFPL +D  SN VNG   SE  E + K +PLLVGINPRSP GKCCW+
Sbjct: 121  LVLYRQLAEESLSSFPLGDDSYSNNVNGLDASETLETI-KLDPLLVGINPRSPGGKCCWV 179

Query: 658  DTGGALFFDATELVSWLNSPNELAGDSFQQPELYEFDHIYFNSSIASPVAILYGALGTEC 479
            DTGGALFFD  EL+ WL  PNEL  DSFQQPELY+FDHI+F+S+I SPVAILYGALGT C
Sbjct: 180  DTGGALFFDVAELLLWLQRPNELGVDSFQQPELYDFDHIHFDSNIMSPVAILYGALGTNC 239

Query: 478  FREFHVTLVEAAKEGKVKYVVRPVLPSGCESKSGHCGAVGTRDPLNLGGYGVELALKNME 299
            F+EFHVTLV+AAKEGKVKYVVRPVLPSGCE++ G CGAVG RD LNLGGYGVELALKNME
Sbjct: 240  FKEFHVTLVQAAKEGKVKYVVRPVLPSGCEAEVGLCGAVGARDSLNLGGYGVELALKNME 299

Query: 298  YKAMDDSAIKKGVTLEDPHTEDLSQEVRGFIFSKILERKPELASEIMAFRDYLLSSTISD 119
            YKA+DDS +KKGVTLEDP TEDLSQEVRGFIFSK+LERKPEL SEIMAFRDYL+SSTISD
Sbjct: 300  YKAIDDSTVKKGVTLEDPRTEDLSQEVRGFIFSKMLERKPELTSEIMAFRDYLMSSTISD 359

Query: 118  TLDVWELKDLGHQTAQRIVHASDPLQSMQEINQNFPSVV 2
            TLDVWELKDLGHQTAQRIV ASDPLQSMQEI+QNFPSVV
Sbjct: 360  TLDVWELKDLGHQTAQRIVQASDPLQSMQEISQNFPSVV 398


>ref|XP_011006543.1| PREDICTED: UDP-glucose:glycoprotein glucosyltransferase isoform X2
            [Populus euphratica]
          Length = 1640

 Score =  598 bits (1541), Expect = e-168
 Identities = 296/402 (73%), Positives = 340/402 (84%), Gaps = 4/402 (0%)
 Frame = -2

Query: 1195 MDTHFRSGFWVLLAFVAFLSLSGH--LVSGESRRPKNVQVAVRAKWSGTPLVLEAGELLS 1022
            M+T FRSG  VL+     +   G   +  GE+RRPKNVQVAVRAKW GTP++LEAGELLS
Sbjct: 1    METRFRSGSCVLVILFCVVGFCGFGSVSCGENRRPKNVQVAVRAKWEGTPILLEAGELLS 60

Query: 1021 KEWKDLFWEFIEIWLHTENEDADSHTAKGCLKKIVKYGQSLLGEPLASVFEFSLTLRSAS 842
            KE KD++WEFI+ WLH++ ED DS+TAK CLKKI+K+G  LL + LAS+F+FSL LRSAS
Sbjct: 61   KERKDIYWEFIDSWLHSKKEDNDSYTAKDCLKKIMKHGHGLLSDTLASLFDFSLILRSAS 120

Query: 841  PRLVLYRQLAEESLSSFPLTEDMDSNFVNGGISELNEN--MGKNEPLLVGINPRSPRGKC 668
            PRLVLYRQLAEESLSSFPL +D  SN  +GG+++ N+   + +++PLLVG NP  P GKC
Sbjct: 121  PRLVLYRQLAEESLSSFPLLDDSFSNSASGGLAKTNDTNEIKRSDPLLVGRNPEIPGGKC 180

Query: 667  CWLDTGGALFFDATELVSWLNSPNELAGDSFQQPELYEFDHIYFNSSIASPVAILYGALG 488
            CW+DTG ALF+D  +L+ WL+SP+ + GDSFQQPEL++FDH++F S   SPV ILYGALG
Sbjct: 181  CWVDTGAALFYDVADLLLWLHSPSGMEGDSFQQPELFDFDHVHFESLSGSPVTILYGALG 240

Query: 487  TECFREFHVTLVEAAKEGKVKYVVRPVLPSGCESKSGHCGAVGTRDPLNLGGYGVELALK 308
            T+CF+EFH  L+EAAK+GKVKYVVRPVLPSGCESK G C AVG  D LNLGGYGVELA+K
Sbjct: 241  TDCFKEFHSALMEAAKQGKVKYVVRPVLPSGCESKVGRCVAVGASDSLNLGGYGVELAMK 300

Query: 307  NMEYKAMDDSAIKKGVTLEDPHTEDLSQEVRGFIFSKILERKPELASEIMAFRDYLLSST 128
            NMEYKAMDDSAIKKGVTLEDP TEDLSQEVRGFIFSKILERKPEL SEIMAFRDYLLSST
Sbjct: 301  NMEYKAMDDSAIKKGVTLEDPRTEDLSQEVRGFIFSKILERKPELTSEIMAFRDYLLSST 360

Query: 127  ISDTLDVWELKDLGHQTAQRIVHASDPLQSMQEINQNFPSVV 2
            ISDTLDVWELKDLGHQTAQRIVHASDPLQSMQEINQNFPSVV
Sbjct: 361  ISDTLDVWELKDLGHQTAQRIVHASDPLQSMQEINQNFPSVV 402


>ref|XP_011006542.1| PREDICTED: UDP-glucose:glycoprotein glucosyltransferase isoform X1
            [Populus euphratica]
          Length = 1642

 Score =  598 bits (1541), Expect = e-168
 Identities = 296/402 (73%), Positives = 340/402 (84%), Gaps = 4/402 (0%)
 Frame = -2

Query: 1195 MDTHFRSGFWVLLAFVAFLSLSGH--LVSGESRRPKNVQVAVRAKWSGTPLVLEAGELLS 1022
            M+T FRSG  VL+     +   G   +  GE+RRPKNVQVAVRAKW GTP++LEAGELLS
Sbjct: 1    METRFRSGSCVLVILFCVVGFCGFGSVSCGENRRPKNVQVAVRAKWEGTPILLEAGELLS 60

Query: 1021 KEWKDLFWEFIEIWLHTENEDADSHTAKGCLKKIVKYGQSLLGEPLASVFEFSLTLRSAS 842
            KE KD++WEFI+ WLH++ ED DS+TAK CLKKI+K+G  LL + LAS+F+FSL LRSAS
Sbjct: 61   KERKDIYWEFIDSWLHSKKEDNDSYTAKDCLKKIMKHGHGLLSDTLASLFDFSLILRSAS 120

Query: 841  PRLVLYRQLAEESLSSFPLTEDMDSNFVNGGISELNEN--MGKNEPLLVGINPRSPRGKC 668
            PRLVLYRQLAEESLSSFPL +D  SN  +GG+++ N+   + +++PLLVG NP  P GKC
Sbjct: 121  PRLVLYRQLAEESLSSFPLLDDSFSNSASGGLAKTNDTNEIKRSDPLLVGRNPEIPGGKC 180

Query: 667  CWLDTGGALFFDATELVSWLNSPNELAGDSFQQPELYEFDHIYFNSSIASPVAILYGALG 488
            CW+DTG ALF+D  +L+ WL+SP+ + GDSFQQPEL++FDH++F S   SPV ILYGALG
Sbjct: 181  CWVDTGAALFYDVADLLLWLHSPSGMEGDSFQQPELFDFDHVHFESLSGSPVTILYGALG 240

Query: 487  TECFREFHVTLVEAAKEGKVKYVVRPVLPSGCESKSGHCGAVGTRDPLNLGGYGVELALK 308
            T+CF+EFH  L+EAAK+GKVKYVVRPVLPSGCESK G C AVG  D LNLGGYGVELA+K
Sbjct: 241  TDCFKEFHSALMEAAKQGKVKYVVRPVLPSGCESKVGRCVAVGASDSLNLGGYGVELAMK 300

Query: 307  NMEYKAMDDSAIKKGVTLEDPHTEDLSQEVRGFIFSKILERKPELASEIMAFRDYLLSST 128
            NMEYKAMDDSAIKKGVTLEDP TEDLSQEVRGFIFSKILERKPEL SEIMAFRDYLLSST
Sbjct: 301  NMEYKAMDDSAIKKGVTLEDPRTEDLSQEVRGFIFSKILERKPELTSEIMAFRDYLLSST 360

Query: 127  ISDTLDVWELKDLGHQTAQRIVHASDPLQSMQEINQNFPSVV 2
            ISDTLDVWELKDLGHQTAQRIVHASDPLQSMQEINQNFPSVV
Sbjct: 361  ISDTLDVWELKDLGHQTAQRIVHASDPLQSMQEINQNFPSVV 402


Top