BLASTX nr result

ID: Mentha26_contig00025744 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Mentha26_contig00025744
         (1916 letters)

Database: ./nr 
           38,876,450 sequences; 13,856,398,315 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

gb|EYU21559.1| hypothetical protein MIMGU_mgv1a002407mg [Mimulus...   719   0.0  
ref|XP_006343109.1| PREDICTED: uncharacterized protein LOC102601...   658   0.0  
emb|CAN71826.1| hypothetical protein VITISV_013841 [Vitis vinifera]   650   0.0  
ref|XP_004235700.1| PREDICTED: uncharacterized protein LOC101247...   647   0.0  
ref|XP_002284822.1| PREDICTED: uncharacterized protein LOC100246...   645   0.0  
ref|XP_007217014.1| hypothetical protein PRUPE_ppa002059mg [Prun...   635   e-179
emb|CBI36173.3| unnamed protein product [Vitis vinifera]              626   e-176
ref|XP_004302927.1| PREDICTED: uncharacterized protein LOC101300...   625   e-176
ref|XP_006427083.1| hypothetical protein CICLE_v10024994mg [Citr...   625   e-176
ref|XP_006465456.1| PREDICTED: uncharacterized protein LOC102612...   624   e-176
ref|XP_007024055.1| UDP-Glycosyltransferase superfamily protein ...   622   e-175
ref|XP_007024056.1| UDP-Glycosyltransferase superfamily protein ...   618   e-174
ref|XP_002528176.1| glycosyltransferase, putative [Ricinus commu...   615   e-173
ref|XP_002298139.1| glycosyl transferase family 1 family protein...   612   e-172
ref|XP_007150675.1| hypothetical protein PHAVU_005G172300g [Phas...   603   e-169
gb|EXC25804.1| Putative glycosyltransferase ytcC [Morus notabilis]    600   e-169
ref|XP_004149847.1| PREDICTED: uncharacterized protein LOC101207...   597   e-168
ref|XP_004486717.1| PREDICTED: uncharacterized protein LOC101501...   596   e-167
ref|XP_006597141.1| PREDICTED: uncharacterized protein LOC100793...   595   e-167
ref|XP_007024057.1| UDP-Glycosyltransferase superfamily protein ...   593   e-166

>gb|EYU21559.1| hypothetical protein MIMGU_mgv1a002407mg [Mimulus guttatus]
          Length = 678

 Score =  719 bits (1857), Expect = 0.0
 Identities = 402/607 (66%), Positives = 443/607 (72%), Gaps = 15/607 (2%)
 Frame = -2

Query: 1777 TPRGSPSFRRLSSGRTPRREGRSGGFVLSYCFRSNRXXXXXXXXXXWAYAGFYFQSRWAH 1598
            TPRGSPSFRRL+SGRTPRR+ RSG F  S+C RSNR          WAYAGFYFQS+WAH
Sbjct: 35   TPRGSPSFRRLNSGRTPRRDARSGVFS-SHCLRSNRIVLWLLLITLWAYAGFYFQSKWAH 93

Query: 1597 GDNKEDLFXXXXXXXXXXXXS------MRRDLSAAVGTGALKLKNETSNSSLEN--VDVV 1442
            GDNKEDLF                    RRDL A V + A++LKN+T+  SL    +DVV
Sbjct: 94   GDNKEDLFSGGYGGESGGDKFEPQIKNRRRDLIAKVDSAAVELKNDTNELSLNKSVMDVV 153

Query: 1441 LAKSRSGDSLXXXXXXXXXXXXXXXXXXXXXXXVAQD-VESEVDLPIEDI-PKKNTTYGF 1268
            LAK+ + D                         +A++ VESEVD+  E+I PKKNTTYGF
Sbjct: 154  LAKNTTLDKNKPSKRRSKRSLRRKKPVSSKPKAMAEEEVESEVDMQTEEIIPKKNTTYGF 213

Query: 1267 LVGPFGSVEDSILEWSPDKRSGTCDRKGAFARLVWSRKFVLIFHELSMTGAPLAMMELAT 1088
            LVGPFGSVEDSILEWS +KRSGTCDRKGAFARLVWSRKFVLIFHELSMTGAPLAM+ELAT
Sbjct: 214  LVGPFGSVEDSILEWSAEKRSGTCDRKGAFARLVWSRKFVLIFHELSMTGAPLAMLELAT 273

Query: 1087 EFLSCGATISVIVLNKKGGLMSELNRRKIKVLVDKSDLSFKTAMKANLIIAGSAVCSSWI 908
            EFLSCGATISVIVLNK+GGLMSEL+RRKIKVL DK+DLSFKTAMKA++IIAGSAVCSSWI
Sbjct: 274  EFLSCGATISVIVLNKRGGLMSELSRRKIKVLEDKTDLSFKTAMKADIIIAGSAVCSSWI 333

Query: 907  EQYLSRTVLGSSQIMWWIMENRREYFDRSKHVLNRVKKLIFLSESQSKQWLDWCEEENIH 728
            EQYLSRTVLGSSQIMWWIMENRREYFDRSK VLNRVKKLIFLS+SQSKQWL WCEEE I 
Sbjct: 334  EQYLSRTVLGSSQIMWWIMENRREYFDRSKLVLNRVKKLIFLSKSQSKQWLSWCEEEKIQ 393

Query: 727  LKDEPALVPLSVNDELAFAAGISCSLNTPSFTTENMLEKRQSLRKVVREEMGLNDDDMLV 548
            LK EPALVPLSVNDELAF AGI CSLNTPSF+TE M+EKR  LR  VREEMGL++DDML 
Sbjct: 394  LKSEPALVPLSVNDELAFVAGIPCSLNTPSFSTEKMMEKRGLLRSAVREEMGLSEDDMLA 453

Query: 547  VSLSSINPGKGQLLLMESARLVIEQGQ----KLNNSGSKDSVLLDHDYYS-RALLQNGKR 383
            VSLSSINPGKGQLLL+E+ R +IEQ +     L  S   DS++ D D    R LL  G  
Sbjct: 454  VSLSSINPGKGQLLLLEAGRFLIEQPRTDQTNLRLSSEFDSMVFDGDSSGLRKLLSEG-- 511

Query: 382  DNESSNIDTPTKKRIRSSRIFTNEGRLNSARYGRDARMRKMLSENVGKKGQNLKVLIGSV 203
                                                        N+GKKG NLK+L+GSV
Sbjct: 512  --------------------------------------------NIGKKGGNLKILVGSV 527

Query: 202  GSKSNKVAYVKTLLTYLSTHSNLSKSVLWTPATTRVASLYAAADVYVMNSQGIGETFGRV 23
            GSKSNKV YVKTLL +LS HSNLSK V+WTP+TTRVASLYAAADVYVMNSQGIGETFGRV
Sbjct: 528  GSKSNKVPYVKTLLNFLSMHSNLSKVVIWTPSTTRVASLYAAADVYVMNSQGIGETFGRV 587

Query: 22   TIEAMAF 2
            TIEAMAF
Sbjct: 588  TIEAMAF 594


>ref|XP_006343109.1| PREDICTED: uncharacterized protein LOC102601346 [Solanum tuberosum]
          Length = 711

 Score =  658 bits (1697), Expect = 0.0
 Identities = 367/606 (60%), Positives = 435/606 (71%), Gaps = 14/606 (2%)
 Frame = -2

Query: 1777 TPRG-SPSFRRLSSGRTPRREGRSGGFVLSYCFRSNRXXXXXXXXXXWAYAGFYFQSRWA 1601
            TPRG SPSFRRL+SGRTPRR+G+S  F  S  FRSNR          WAY GFY QSRWA
Sbjct: 29   TPRGGSPSFRRLNSGRTPRRDGKSSAFG-SQWFRSNRILLWLLLITLWAYGGFYVQSRWA 87

Query: 1600 HGDNKEDLFXXXXXXXXXXXXSM----RRDLSAAVGTGALKLKNETSNSSLENVDVVLAK 1433
            HGDNKE +F                  +R L A   + A+K  +  +  +  ++DVVLAK
Sbjct: 88   HGDNKEGIFGGTGGDVANGTSQPEEKNQRILVANEESLAVKPPSNKTQGNSMDLDVVLAK 147

Query: 1432 SRSG---DSLXXXXXXXXXXXXXXXXXXXXXXXVAQDVESE-VDLPIEDIPKKNTTYGFL 1265
              +    D +                       V  +V+++ +++  E+IPK+NTTYG L
Sbjct: 148  QGNSVVSDKVSSSKKKSKKSTRASRRKTHGKKKVVAEVKTDDIEVQEEEIPKRNTTYGLL 207

Query: 1264 VGPFGSVEDSILEWSPDKRSGTCDRKGAFARLVWSRKFVLIFHELSMTGAPLAMMELATE 1085
            VGPFGS+ED ILEWSP+KRSGTCDRK  FARLVWSRKFVLI HELSMTGAPLAM+ELATE
Sbjct: 208  VGPFGSIEDKILEWSPEKRSGTCDRKSQFARLVWSRKFVLILHELSMTGAPLAMLELATE 267

Query: 1084 FLSCGATISVIVLNKKGGLMSELNRRKIKVLVDKSDLSFKTAMKANLIIAGSAVCSSWIE 905
             LSCGAT+ V+ L+K+GGLMSEL+RRKIKVL DKSDLSFKTAMKA+LIIAGSAVC+SWIE
Sbjct: 268  LLSCGATVYVVPLSKRGGLMSELSRRKIKVLEDKSDLSFKTAMKADLIIAGSAVCASWIE 327

Query: 904  QYLSRTVLGSSQIMWWIMENRREYFDRSKHVLNRVKKLIFLSESQSKQWLDWCEEENIHL 725
            QY +RTVLGSSQI WWIMENRREYFDR+K   NRVKKLIFLSESQSK+WL WCEEE+I L
Sbjct: 328  QYAARTVLGSSQITWWIMENRREYFDRAKLAFNRVKKLIFLSESQSKRWLAWCEEEHIKL 387

Query: 724  KDEPALVPLSVNDELAFAAGISCSLNTPSFTTENMLEKRQSLRKVVREEMGLNDDDMLVV 545
            K +PALVPLS++DELAF AGI CSL+TP F+ E MLEKRQ LR  VR+EMGL D+DMLV+
Sbjct: 388  KTQPALVPLSISDELAFVAGIPCSLSTPLFSPEKMLEKRQLLRDFVRKEMGLTDNDMLVM 447

Query: 544  SLSSINPGKGQLLLMESARLVIEQGQKLNNSGSKDSVLLDHDYYSRALLQN----GKRDN 377
            SLSSINPGKGQ LL+E+ RL+IE    LN S  K       +Y  R LL N    G+   
Sbjct: 448  SLSSINPGKGQFLLLETTRLLIEGAPPLNGSAVK-----RREYQKRTLLYNWKQFGEWKK 502

Query: 376  ESSNI-DTPTKKRIRSSRIFTNEGRLNSARYGRDARMRKMLSENVGKKGQNLKVLIGSVG 200
            ESS + + P  + ++  ++F  +G   +A    D   RK+ S   GK+G+ LKVLIGSVG
Sbjct: 503  ESSTLSNNPQTETLQVPQLFI-KGVNYTAGIENDRGTRKLFSLTEGKQGEKLKVLIGSVG 561

Query: 199  SKSNKVAYVKTLLTYLSTHSNLSKSVLWTPATTRVASLYAAADVYVMNSQGIGETFGRVT 20
            SKSNKV YVK LL +L+ HSNLS +VLWTP+TTRVA+LYAAAD YVMNSQG+GETFGRVT
Sbjct: 562  SKSNKVPYVKALLNFLNQHSNLSNTVLWTPSTTRVAALYAAADAYVMNSQGLGETFGRVT 621

Query: 19   IEAMAF 2
            IEAMAF
Sbjct: 622  IEAMAF 627


>emb|CAN71826.1| hypothetical protein VITISV_013841 [Vitis vinifera]
          Length = 734

 Score =  650 bits (1676), Expect = 0.0
 Identities = 363/619 (58%), Positives = 427/619 (68%), Gaps = 27/619 (4%)
 Frame = -2

Query: 1777 TPRGSPSFRRLSSGRTPRREGRSGGFVLSYCFRSNRXXXXXXXXXXWAYAGFYFQSRWAH 1598
            TPR SPSFRR  S RTPRRE RS G V S  FR+NR          WAY GFY QS+WAH
Sbjct: 35   TPRNSPSFRRSHSSRTPRREARSSG-VGSQWFRNNRVVFWLILITLWAYLGFYVQSKWAH 93

Query: 1597 GDNKEDLFXXXXXXXXXXXXS-MRRDLSAAVGTGALKLKNETSNSSL---ENVDVVLAKS 1430
            GDN ED+             S + R          L +KN +  + +   + VDVVLAK 
Sbjct: 94   GDNNEDIIGFGGKPNNGISDSELNRKAPLIANDKLLAVKNGSDKNPVGSGKKVDVVLAKK 153

Query: 1429 RSGDSLXXXXXXXXXXXXXXXXXXXXXXXVAQDVESEVDLPIED-----IPKKNTTYGFL 1265
              G+S+                         Q  ++EV++   D     IPK NT+YG L
Sbjct: 154  --GNSVPSRRSASSKKRSKKSERSLRGKTRKQKTKTEVEVTEMDEQEQEIPKLNTSYGLL 211

Query: 1264 VGPFGSVEDSILEWSPDKRSGTCDRKGAFARLVWSRKFVLIFHELSMTGAPLAMMELATE 1085
            VGPFGS ED ILEWSP+KRSGTCDR+G  ARLVWSRKFVLIFHELSMTGAPL+MMELATE
Sbjct: 212  VGPFGSTEDRILEWSPEKRSGTCDRRGELARLVWSRKFVLIFHELSMTGAPLSMMELATE 271

Query: 1084 FLSCGATISVIVLNKKGGLMSELNRRKIKVLVDKSDLSFKTAMKANLIIAGSAVCSSWIE 905
             LSCGAT+S +VL+KKGGLM EL RR+IKVL D++DLSFKTAMKA+L+IAGSAVC+SWIE
Sbjct: 272  LLSCGATVSAVVLSKKGGLMPELARRRIKVLEDRADLSFKTAMKADLVIAGSAVCASWIE 331

Query: 904  QYLSRTVLGSSQIMWWIMENRREYFDRSKHVLNRVKKLIFLSESQSKQWLDWCEEENIHL 725
            QY++    GSSQI+WWIMENRREYFDRSK V+NRVK LIFLSESQSKQWL WC+EENI L
Sbjct: 332  QYIAHFTAGSSQIVWWIMENRREYFDRSKLVINRVKMLIFLSESQSKQWLTWCKEENIRL 391

Query: 724  KDEPALVPLSVNDELAFAAGISCSLNTPSFTTENMLEKRQSLRKVVREEMGLNDDDMLVV 545
              +PA+VPLSVNDELAF AGI+CSLNTPSFTTE M EKR+ LR  +R+EMGL D DML++
Sbjct: 392  ISQPAVVPLSVNDELAFVAGITCSLNTPSFTTEKMQEKRRLLRDSIRKEMGLTDTDMLLL 451

Query: 544  SLSSINPGKGQLLLMESARLVIEQGQKLNNSGSKDSVLLDHD-------YYSRALLQNGK 386
            SLSSINPGKGQ  L+ES R +IEQ    ++   KD   +  D       +YSRALLQN  
Sbjct: 452  SLSSINPGKGQFFLLESVRSMIEQEPSQDDPELKDLAKIGQDQSNFSGKHYSRALLQNVN 511

Query: 385  RDNESSN-----------IDTPTKKRIRSSRIFTNEGRLNSARYGRDARMRKMLSENVGK 239
              + SS+           ++ P  K +    +F +    ++   G   + RK+LSEN G 
Sbjct: 512  HFSVSSSGLRLSNESFIELNGPKSKNLMLPSLFPSISPSDAVSIGSGYKRRKVLSENEGT 571

Query: 238  KGQNLKVLIGSVGSKSNKVAYVKTLLTYLSTHSNLSKSVLWTPATTRVASLYAAADVYVM 59
            + Q LKVLIGSVGSKSNKV YVK LL +L  HSNLSKSVLWTPATTRVASLY+AADVYV+
Sbjct: 572  QEQALKVLIGSVGSKSNKVPYVKGLLRFLXRHSNLSKSVLWTPATTRVASLYSAADVYVI 631

Query: 58   NSQGIGETFGRVTIEAMAF 2
            NSQG+GETFGRV+IEAMAF
Sbjct: 632  NSQGMGETFGRVSIEAMAF 650


>ref|XP_004235700.1| PREDICTED: uncharacterized protein LOC101247116 [Solanum
            lycopersicum]
          Length = 711

 Score =  647 bits (1668), Expect = 0.0
 Identities = 361/605 (59%), Positives = 423/605 (69%), Gaps = 13/605 (2%)
 Frame = -2

Query: 1777 TPRG-SPSFRRLSSGRTPRREGRSGGFVLSYCFRSNRXXXXXXXXXXWAYAGFYFQSRWA 1601
            TPRG SPSFRRL+SGRTPRR+G+S  F  S  FRSNR          WAY GFY QSRWA
Sbjct: 29   TPRGGSPSFRRLNSGRTPRRDGKSSVFG-SQWFRSNRIVLWLLLITLWAYGGFYVQSRWA 87

Query: 1600 HGDNKEDLF----XXXXXXXXXXXXSMRRDLSAAVGTGALKLKNETSNSSLENVDVVLAK 1433
            HGDNKE +F                  +R L A   + A+K  +  +  +  ++DVVLAK
Sbjct: 88   HGDNKEGIFGGSGGDVANGTSQPEEKNQRILVANEESLAVKPPSNKTQGNSMDLDVVLAK 147

Query: 1432 SR----SGDSLXXXXXXXXXXXXXXXXXXXXXXXVAQDVESEVDLPIEDIPKKNTTYGFL 1265
                  S                           VA+    ++++  E+IPK+NTTYG L
Sbjct: 148  QGNSVVSDKGASPKKKSKKSTRASRRKTRGKKKVVAEVKSDDIEIQEEEIPKRNTTYGLL 207

Query: 1264 VGPFGSVEDSILEWSPDKRSGTCDRKGAFARLVWSRKFVLIFHELSMTGAPLAMMELATE 1085
            VGPFGS+ED ILEWSP+KR+GTCDRK  FARLVWSRKFVLI HELSMTGAPLAM+ELATE
Sbjct: 208  VGPFGSIEDKILEWSPEKRTGTCDRKSQFARLVWSRKFVLILHELSMTGAPLAMLELATE 267

Query: 1084 FLSCGATISVIVLNKKGGLMSELNRRKIKVLVDKSDLSFKTAMKANLIIAGSAVCSSWIE 905
             LSCGAT+ V+ L+K+GGLMSEL+RRKIKVL DKSDLSFKTAMKA+LIIAGSAVC+SWIE
Sbjct: 268  LLSCGATVYVVPLSKRGGLMSELSRRKIKVLEDKSDLSFKTAMKADLIIAGSAVCASWIE 327

Query: 904  QYLSRTVLGSSQIMWWIMENRREYFDRSKHVLNRVKKLIFLSESQSKQWLDWCEEENIHL 725
            QY +RTVLGS+QI WWIMENRREYFDR+K   NRVKKLIFLSESQSK+WL WCEEE+I L
Sbjct: 328  QYAARTVLGSTQITWWIMENRREYFDRAKLAFNRVKKLIFLSESQSKRWLAWCEEEHIKL 387

Query: 724  KDEPALVPLSVNDELAFAAGISCSLNTPSFTTENMLEKRQSLRKVVREEMGLNDDDMLVV 545
            K +PAL+PLS++DELAF AGI CSL+TP F+ E MLEKRQ LR  VR+EMGL D+DMLV+
Sbjct: 388  KTQPALIPLSISDELAFVAGIPCSLSTPLFSPEKMLEKRQLLRDFVRKEMGLTDNDMLVM 447

Query: 544  SLSSINPGKGQLLLMESARLVIEQGQKLNNSGSKDSVLLDHDYYSRALLQN----GKRDN 377
            SLSSINPGKGQ LL+E+ RL+IE    L  S  K       +Y  R LL N    G+   
Sbjct: 448  SLSSINPGKGQFLLLETTRLLIEGAPPLYGSAVK-----RREYQKRTLLYNWKQFGEWKK 502

Query: 376  ESSNIDTPTKKRIRSSRIFTNEGRLNSARYGRDARMRKMLSENVGKKGQNLKVLIGSVGS 197
            ESS +    +           +G   +A    D   RK+ S   GK+G+ LKVLIGSVGS
Sbjct: 503  ESSTLSNNQETEALQVPQLFIKGVNYTAGIENDRGTRKLFSLPEGKQGEKLKVLIGSVGS 562

Query: 196  KSNKVAYVKTLLTYLSTHSNLSKSVLWTPATTRVASLYAAADVYVMNSQGIGETFGRVTI 17
            KSNKV YVK LL +L+ HSNLS +VLWTP+TTRVA+LYAAAD YVMNSQG+GETFGRVTI
Sbjct: 563  KSNKVPYVKALLNFLNQHSNLSNTVLWTPSTTRVAALYAAADAYVMNSQGLGETFGRVTI 622

Query: 16   EAMAF 2
            EAMAF
Sbjct: 623  EAMAF 627


>ref|XP_002284822.1| PREDICTED: uncharacterized protein LOC100246448 [Vitis vinifera]
          Length = 691

 Score =  645 bits (1665), Expect = 0.0
 Identities = 362/608 (59%), Positives = 420/608 (69%), Gaps = 16/608 (2%)
 Frame = -2

Query: 1777 TPRGSPSFRRLSSGRTPRREGRSGGFVLSYCFRSNRXXXXXXXXXXWAYAGFYFQSRWAH 1598
            TPR SPSFRR  S RTPRRE RS G V S  FR+NR          WAY GFY QS+WAH
Sbjct: 24   TPRNSPSFRRSHSSRTPRREARSSG-VGSQWFRNNRVVFWLILITLWAYLGFYVQSKWAH 82

Query: 1597 GDNKEDLFXXXXXXXXXXXXS-MRRDLSAAVGTGALKLKNETSNSSL---ENVDVVLAKS 1430
            GDN ED+             S + R          L +KN +  + +   + VDVVLAK 
Sbjct: 83   GDNNEDIIGFGGKPNNGISDSELNRKAPLIANDKLLAVKNGSDKNPVGSGKKVDVVLAKK 142

Query: 1429 RSGDSLXXXXXXXXXXXXXXXXXXXXXXXVAQDVESEVDLPIED-----IPKKNTTYGFL 1265
              G+S+                         Q  ++EV++   D     IPK NT+YG L
Sbjct: 143  --GNSVPSRRSASSKKRSKKSERSLRGKTRKQKTKTEVEVTEMDEQEQEIPKLNTSYGLL 200

Query: 1264 VGPFGSVEDSILEWSPDKRSGTCDRKGAFARLVWSRKFVLIFHELSMTGAPLAMMELATE 1085
            VGPFGS ED ILEWSP+KRSGTCDR+G  ARLVWSRKFVLIFHELSMTGAPL+MMELATE
Sbjct: 201  VGPFGSTEDRILEWSPEKRSGTCDRRGELARLVWSRKFVLIFHELSMTGAPLSMMELATE 260

Query: 1084 FLSCGATISVIVLNKKGGLMSELNRRKIKVLVDKSDLSFKTAMKANLIIAGSAVCSSWIE 905
             LSCGAT+S +VL+KKGGLM EL RR+IKVL D++DLSFKTAMKA+L+IAGSAVC+SWIE
Sbjct: 261  LLSCGATVSAVVLSKKGGLMPELARRRIKVLEDRADLSFKTAMKADLVIAGSAVCASWIE 320

Query: 904  QYLSRTVLGSSQIMWWIMENRREYFDRSKHVLNRVKKLIFLSESQSKQWLDWCEEENIHL 725
            QY++    GSSQI+WWIMENRREYFDRSK V+NRVK LIFLSESQSKQWL WC+EENI L
Sbjct: 321  QYIAHFTAGSSQIVWWIMENRREYFDRSKLVINRVKMLIFLSESQSKQWLTWCKEENIRL 380

Query: 724  KDEPALVPLSVNDELAFAAGISCSLNTPSFTTENMLEKRQSLRKVVREEMGLNDDDMLVV 545
              +PA+VPLSVNDELAF AGI+CSLNTPSFTTE M EKR+ LR  +R+EMGL D DML++
Sbjct: 381  ISQPAVVPLSVNDELAFVAGITCSLNTPSFTTEKMQEKRRLLRDSIRKEMGLTDTDMLLL 440

Query: 544  SLSSINPGKGQLLLMESARLVIEQGQKLNNSGSKDSVLLDHD-------YYSRALLQNGK 386
            SLSSINPGKGQ  L+ES R +IEQ    ++   KD V +  D       +YSRALLQN  
Sbjct: 441  SLSSINPGKGQFFLLESVRSMIEQEPSQDDPELKDLVKIGQDQSNFSGKHYSRALLQNVN 500

Query: 385  RDNESSNIDTPTKKRIRSSRIFTNEGRLNSARYGRDARMRKMLSENVGKKGQNLKVLIGS 206
              + SS+                     +    G   + RK+LSEN G + Q LKVLIGS
Sbjct: 501  HFSVSSS---------------------DEVSIGSGYKRRKVLSENEGTQEQALKVLIGS 539

Query: 205  VGSKSNKVAYVKTLLTYLSTHSNLSKSVLWTPATTRVASLYAAADVYVMNSQGIGETFGR 26
            VGSKSNKV YVK LL +L+ HSNLSKSVLWTPATTRVASLY+AADVYV+NSQG+GETFGR
Sbjct: 540  VGSKSNKVPYVKGLLRFLTRHSNLSKSVLWTPATTRVASLYSAADVYVINSQGMGETFGR 599

Query: 25   VTIEAMAF 2
            VTIEAMAF
Sbjct: 600  VTIEAMAF 607


>ref|XP_007217014.1| hypothetical protein PRUPE_ppa002059mg [Prunus persica]
            gi|462413164|gb|EMJ18213.1| hypothetical protein
            PRUPE_ppa002059mg [Prunus persica]
          Length = 723

 Score =  635 bits (1639), Expect = e-179
 Identities = 364/620 (58%), Positives = 429/620 (69%), Gaps = 28/620 (4%)
 Frame = -2

Query: 1777 TPRGSPSFRRLSSGRTPRREGRSGGFVLSYCFRSNRXXXXXXXXXXWAYAGFYFQSRWAH 1598
            +PR SPSFRRL+S RTPRRE RS G V    FRSNR          WAY GFYFQS WAH
Sbjct: 27   SPRNSPSFRRLNSSRTPRREARSSGGV--QWFRSNRLLFWLLLITLWAYLGFYFQSSWAH 84

Query: 1597 GDNKEDLFXXXXXXXXXXXXS---MRRDLSAAVGTGALKLKNETSNSSLE---NVDVVLA 1436
             +NKE+              +    RRDL A+    ++ +KNET+ + ++   ++DVVL 
Sbjct: 85   -NNKENFLGFGNKASNGNSDTEQNARRDLLAS--DSSMAVKNETNQNQVKAGKSIDVVLT 141

Query: 1435 KSRSGDSLXXXXXXXXXXXXXXXXXXXXXXXVAQ---DVES-EVDLPIEDIPKKNTTYGF 1268
            K  +G S                          +   +VE  E +    DIPK NT+YG 
Sbjct: 142  KKENGVSSRRSASSKKRSKKSARSLRGKVHGKQKKTVEVEGHETEEQELDIPKTNTSYGM 201

Query: 1267 LVGPFGSVEDSILEWSPDKRSGTCDRKGAFARLVWSRKFVLIFHELSMTGAPLAMMELAT 1088
            LVGPFG VED  LEWSP  RSGTCDRKG FARLVWSR+F+LIFHELSMTGAPL+MMELAT
Sbjct: 202  LVGPFGFVEDRTLEWSPKTRSGTCDRKGDFARLVWSRRFLLIFHELSMTGAPLSMMELAT 261

Query: 1087 EFLSCGATISVIVLNKKGGLMSELNRRKIKVLVDKSDLSFKTAMKANLIIAGSAVCSSWI 908
            E LSCGAT+S +VL+KKGGLM EL RR+IKVL DK + SFKTAMKA+L+IAGSAVC+SWI
Sbjct: 262  ELLSCGATVSAVVLSKKGGLMPELARRRIKVLEDKVEQSFKTAMKADLVIAGSAVCASWI 321

Query: 907  EQYLSRTVLGSSQIMWWIMENRREYFDRSKHVLNRVKKLIFLSESQSKQWLDWCEEENIH 728
            +QY+     G+SQI WWIMENRREYFDR+K VLNRVK L FLSESQSKQWLDWCEEE I 
Sbjct: 322  DQYMDHFPAGASQIAWWIMENRREYFDRAKVVLNRVKMLAFLSESQSKQWLDWCEEEKIK 381

Query: 727  LKDEPALVPLSVNDELAFAAGISCSLNTPSFTTENMLEKRQSLRKVVREEMGLNDDDMLV 548
            L+ +PA+VPLS+NDELAF AGI CSLNTPS +TE MLEKRQ LR  VR+EMGL D+DMLV
Sbjct: 382  LRSQPAVVPLSINDELAFVAGIGCSLNTPSSSTEKMLEKRQLLRDSVRKEMGLTDNDMLV 441

Query: 547  VSLSSINPGKGQLLLMESARLVIEQGQKLNNSGSKDSV-------LLDHDYYSRALLQNG 389
            +SLSSINPGKGQLLL+ESARLVIE+  K  NS  K+ V        L   ++ RAL Q  
Sbjct: 442  MSLSSINPGKGQLLLLESARLVIEEPLKY-NSKIKNPVRKRQARSTLARKHHLRALFQEL 500

Query: 388  KRDNESSN-----------IDTPTKKRIRSSRIFTNEGRLNSARYGRDARMRKMLSENVG 242
              D  SSN           ++ P KK++R   ++T+        +      RK+LS+N G
Sbjct: 501  NDDGVSSNELPLSNESDVQLNEPQKKKLRLRSLYTSFDDTGDLTF-NVTHKRKVLSDNGG 559

Query: 241  KKGQNLKVLIGSVGSKSNKVAYVKTLLTYLSTHSNLSKSVLWTPATTRVASLYAAADVYV 62
               Q++K LIGSVGSKSNKV YVK LL +LS HSN+SKSVLWTPATTRVA+LY+AADVYV
Sbjct: 560  TLEQSVKFLIGSVGSKSNKVLYVKELLGFLSQHSNMSKSVLWTPATTRVAALYSAADVYV 619

Query: 61   MNSQGIGETFGRVTIEAMAF 2
            MNSQG+GETFGRVTIEAMAF
Sbjct: 620  MNSQGLGETFGRVTIEAMAF 639


>emb|CBI36173.3| unnamed protein product [Vitis vinifera]
          Length = 683

 Score =  626 bits (1614), Expect = e-176
 Identities = 356/608 (58%), Positives = 411/608 (67%), Gaps = 16/608 (2%)
 Frame = -2

Query: 1777 TPRGSPSFRRLSSGRTPRREGRSGGFVLSYCFRSNRXXXXXXXXXXWAYAGFYFQSRWAH 1598
            TPR SPSFRR  S RTPRRE RS G V S  FR+NR          WAY GFY QS+WAH
Sbjct: 35   TPRNSPSFRRSHSSRTPRREARSSG-VGSQWFRNNRVVFWLILITLWAYLGFYVQSKWAH 93

Query: 1597 GDNKEDLFXXXXXXXXXXXXS-MRRDLSAAVGTGALKLKNETSNSSL---ENVDVVLAKS 1430
            GDN ED+             S + R          L +KN +  + +   + VDVVLAK 
Sbjct: 94   GDNNEDIIGFGGKPNNGISDSELNRKAPLIANDKLLAVKNGSDKNPVGSGKKVDVVLAKK 153

Query: 1429 RSGDSLXXXXXXXXXXXXXXXXXXXXXXXVAQDVESEVDLPIED-----IPKKNTTYGFL 1265
              G+S+                         Q  ++EV++   D     IPK NT+YG L
Sbjct: 154  --GNSVPSRRSASSKKRSKKSERSLRGKTRKQKTKTEVEVTEMDEQEQEIPKLNTSYGLL 211

Query: 1264 VGPFGSVEDSILEWSPDKRSGTCDRKGAFARLVWSRKFVLIFHELSMTGAPLAMMELATE 1085
            VGPFGS ED ILEWSP+KRSGTCDR+G  ARLVWSRKFVLIFHELSMTGAPL+MMELATE
Sbjct: 212  VGPFGSTEDRILEWSPEKRSGTCDRRGELARLVWSRKFVLIFHELSMTGAPLSMMELATE 271

Query: 1084 FLSCGATISVIVLNKKGGLMSELNRRKIKVLVDKSDLSFKTAMKANLIIAGSAVCSSWIE 905
             LSCGAT+S +VL+KKGGLM EL RR+IKVL D++DLSFKTAMKA+L+IAGSAVC+SWIE
Sbjct: 272  LLSCGATVSAVVLSKKGGLMPELARRRIKVLEDRADLSFKTAMKADLVIAGSAVCASWIE 331

Query: 904  QYLSRTVLGSSQIMWWIMENRREYFDRSKHVLNRVKKLIFLSESQSKQWLDWCEEENIHL 725
            QY++    GSSQI+WWIMENRREYFDRSK V+NRVK LIFLSESQSKQWL WC+EENI L
Sbjct: 332  QYIAHFTAGSSQIVWWIMENRREYFDRSKLVINRVKMLIFLSESQSKQWLTWCKEENIRL 391

Query: 724  KDEPALVPLSVNDELAFAAGISCSLNTPSFTTENMLEKRQSLRKVVREEMGLNDDDMLVV 545
              +PA+VPLSVNDELAF AGI+CSLNTPSFTTE M EKR+ LR  +R+EMGL D DML++
Sbjct: 392  ISQPAVVPLSVNDELAFVAGITCSLNTPSFTTEKMQEKRRLLRDSIRKEMGLTDTDMLLL 451

Query: 544  SLSSINPGKGQLLLMESARLVIEQGQKLNNSGSKDSVLLDHD-------YYSRALLQNGK 386
            SLSSINPGKGQ  L+ES R +IEQ    ++   KD V +  D       +YSRALLQN  
Sbjct: 452  SLSSINPGKGQFFLLESVRSMIEQEPSQDDPELKDLVKIGQDQSNFSGKHYSRALLQN-- 509

Query: 385  RDNESSNIDTPTKKRIRSSRIFTNEGRLNSARYGRDARMRKMLSENVGKKGQNLKVLIGS 206
                                       LN  +           S+N+    Q LKVLIGS
Sbjct: 510  ---------------------------LNGPK-----------SKNLMLPKQALKVLIGS 531

Query: 205  VGSKSNKVAYVKTLLTYLSTHSNLSKSVLWTPATTRVASLYAAADVYVMNSQGIGETFGR 26
            VGSKSNKV YVK LL +L+ HSNLSKSVLWTPATTRVASLY+AADVYV+NSQG+GETFGR
Sbjct: 532  VGSKSNKVPYVKGLLRFLTRHSNLSKSVLWTPATTRVASLYSAADVYVINSQGMGETFGR 591

Query: 25   VTIEAMAF 2
            VTIEAMAF
Sbjct: 592  VTIEAMAF 599


>ref|XP_004302927.1| PREDICTED: uncharacterized protein LOC101300160 [Fragaria vesca
            subsp. vesca]
          Length = 720

 Score =  625 bits (1613), Expect = e-176
 Identities = 356/619 (57%), Positives = 422/619 (68%), Gaps = 27/619 (4%)
 Frame = -2

Query: 1777 TPRGSPSFRRLSSGRTPRREGRSGGFVLSYCFRSNRXXXXXXXXXXWAYAGFYFQSRWAH 1598
            +PR SPSF+RL S RTPRRE RS G V    FRSNR          WAY GFYFQS WAH
Sbjct: 27   SPRSSPSFKRLHSSRTPRREARSSGGV--QWFRSNRLLFWLLLITLWAYLGFYFQSSWAH 84

Query: 1597 GDNKEDLFXXXXXXXXXXXXS---MRRDLSAAVGTGALKLKNETSNSSLE---NVDVVLA 1436
             +NK +              +    RRDL  +     +KLKNET  +  E    +DVVLA
Sbjct: 85   SNNKVNFLGVGNEASNDKSDAEQNQRRDLLDS----PVKLKNETGQNQPEAGKTIDVVLA 140

Query: 1435 KSRSGDSLXXXXXXXXXXXXXXXXXXXXXXXVAQDVE-SEVDLPIEDIPKKNTTYGFLVG 1259
            K   G +                            +E  E++    DIPK N +YG LVG
Sbjct: 141  KKDDGVASRRSLSSKKKSKKAARGKSHGKPKKTVAIEIHEIEEQEPDIPKTNASYGMLVG 200

Query: 1258 PFGSVEDSILEWSPDKRSGTCDRKGAFARLVWSRKFVLIFHELSMTGAPLAMMELATEFL 1079
            PFGS ED ILEW+P  R+GTCDRKG F+RLVWSR+F+LIFHELSMTGAPL+MMELATE L
Sbjct: 201  PFGSTEDRILEWNPKTRTGTCDRKGDFSRLVWSRRFLLIFHELSMTGAPLSMMELATELL 260

Query: 1078 SCGATISVIVLNKKGGLMSELNRRKIKVLVDKSDLSFKTAMKANLIIAGSAVCSSWIEQY 899
            SCGAT+S IVL+KKGGLM EL RR+IKVL DK+D SFKTAMK +L+IAGSAVC+SWI+QY
Sbjct: 261  SCGATVSAIVLSKKGGLMPELTRRRIKVLEDKADHSFKTAMKQDLVIAGSAVCASWIDQY 320

Query: 898  LSRTVLGSSQIMWWIMENRREYFDRSKHVLNRVKKLIFLSESQSKQWLDWCEEENIHLKD 719
            + +   G+SQI WWIMENRREYFDR+K VL+RVK L FLSESQSKQWLDWCEEE I L+ 
Sbjct: 321  IDKFPAGASQIAWWIMENRREYFDRAKVVLDRVKMLAFLSESQSKQWLDWCEEEKIKLRS 380

Query: 718  EPALVPLSVNDELAFAAGISCSLNTPSFTTENMLEKRQSLRKVVREEMGLNDDDMLVVSL 539
            +PA+VPLS+NDELAF AGI CSLNTPS + E MLEK + LR  VR+EMGL D+DML +SL
Sbjct: 381  QPAIVPLSINDELAFVAGIGCSLNTPSSSIEKMLEKMKLLRDAVRKEMGLTDNDMLAISL 440

Query: 538  SSINPGKGQLLLMESARLVIEQGQKLNNSGSKDSV-------LLDHDYYSRALLQNGKRD 380
            SSINPGKGQLL++ SARLVIE+  + +NS  K+SV        L   ++ RALLQ G  D
Sbjct: 441  SSINPGKGQLLVLNSARLVIEEEPQPDNSKIKNSVRKGRVRSALARKHHIRALLQ-GSND 499

Query: 379  NESSNIDTPTKKRIRSSRIFTNEGRLNSARYGRDARM-------------RKMLSENVGK 239
            + +S    P      SS  F  + + +   + R A +             RK+L++N G 
Sbjct: 500  HSASLNGFPLS--TESSVHFKEDQKKHLHLHNRFASVDDTDAMNFDVTYKRKVLADNGGT 557

Query: 238  KGQNLKVLIGSVGSKSNKVAYVKTLLTYLSTHSNLSKSVLWTPATTRVASLYAAADVYVM 59
              Q+ K LIGSVGSKSNKVAYVK LL+YLS HSNLSKSVLWTP+TTRVA+LY+AADVYVM
Sbjct: 558  VKQSAKFLIGSVGSKSNKVAYVKELLSYLSQHSNLSKSVLWTPSTTRVAALYSAADVYVM 617

Query: 58   NSQGIGETFGRVTIEAMAF 2
            NSQG+GETFGRVTIEAMAF
Sbjct: 618  NSQGLGETFGRVTIEAMAF 636


>ref|XP_006427083.1| hypothetical protein CICLE_v10024994mg [Citrus clementina]
            gi|557529073|gb|ESR40323.1| hypothetical protein
            CICLE_v10024994mg [Citrus clementina]
          Length = 732

 Score =  625 bits (1612), Expect = e-176
 Identities = 346/621 (55%), Positives = 422/621 (67%), Gaps = 29/621 (4%)
 Frame = -2

Query: 1777 TPRGSPSFRRLSSGRTPRREGRSGGFVLSYCFRSNRXXXXXXXXXXWAYAGFYFQSRWAH 1598
            TP+ SPSFRRL++ RTPRRE RS        FRSNR          W Y GFY QSRWAH
Sbjct: 35   TPKNSPSFRRLNASRTPRREVRSASL---QWFRSNRLVYWLLLITLWTYLGFYVQSRWAH 91

Query: 1597 GDNKEDLFXXXXXXXXXXXXS---MRRDLSAA-----VGTGALKLKNETSNSSLENVDVV 1442
            G+N +               S    RRDL A      +  G +K    T  +  + +D+V
Sbjct: 92   GENNDKFLGFGGKRRNEIVDSNQNKRRDLIANHSDLDINNGTIK----TLGADSKKIDMV 147

Query: 1441 LAKSRSGDSLXXXXXXXXXXXXXXXXXXXXXXXVAQDVESE-VDLPIEDIPKKNTTYGFL 1265
            L + R+ D+                           DVES  ++  + +IP  N +YG L
Sbjct: 148  LTQRRNNDASRRSVAKRKKSKRSSRGKGRGKQKAKLDVESNYMEAQLPEIPMTNASYGLL 207

Query: 1264 VGPFGSVEDSILEWSPDKRSGTCDRKGAFARLVWSRKFVLIFHELSMTGAPLAMMELATE 1085
            VGPFG  ED ILEWSP+KRSGTCDRKG FAR VWSRKF+LIFHELSMTGAPL+MMELATE
Sbjct: 208  VGPFGLTEDRILEWSPEKRSGTCDRKGDFARFVWSRKFILIFHELSMTGAPLSMMELATE 267

Query: 1084 FLSCGATISVIVLNKKGGLMSELNRRKIKVLVDKSDLSFKTAMKANLIIAGSAVCSSWIE 905
             LSCGAT+S +VL+K+GGLM EL RRKIKVL D+ + SFKT+MKA+L+IAGSAVC++WI+
Sbjct: 268  LLSCGATVSAVVLSKRGGLMPELARRKIKVLEDRGEPSFKTSMKADLVIAGSAVCATWID 327

Query: 904  QYLSRTVLGSSQIMWWIMENRREYFDRSKHVLNRVKKLIFLSESQSKQWLDWCEEENIHL 725
            QY++R   G SQ++WWIMENRREYFDR+K VL+RVK L+FLSESQ+KQWL WCEEE + L
Sbjct: 328  QYITRFPAGGSQVVWWIMENRREYFDRAKLVLDRVKMLVFLSESQTKQWLTWCEEEKLKL 387

Query: 724  KDEPALVPLSVNDELAFAAGISCSLNTPSFTTENMLEKRQSLRKVVREEMGLNDDDMLVV 545
            + +PA+VPLSVNDELAF AG +CSLNTP+ + E M EKR  LR  VR+EMGL D DMLV+
Sbjct: 388  RSQPAVVPLSVNDELAFVAGFTCSLNTPTSSPEKMCEKRNLLRDSVRKEMGLTDQDMLVL 447

Query: 544  SLSSINPGKGQLLLMESARLVIEQG--------QKLNNSGSKDSVLLD-HDYYSRALLQN 392
            SLSSINPGKGQLLL+ESA+L+IEQ         +K  N G K S L   H    R LLQ 
Sbjct: 448  SLSSINPGKGQLLLVESAQLMIEQEPSMDDSKIRKSRNVGRKKSSLTSRHHLRGRGLLQM 507

Query: 391  ----GKRDNESS-------NIDTPTKKRIRSSRIFTNEGRLNSARYGRDARMRKMLSENV 245
                G   NE S        ++ P +K + S  +FT+ G  ++  +G     RK+LS++ 
Sbjct: 508  SDDVGLSSNELSVSSESFTQLNEPVRKNLLSPSLFTSIGNTDAVSFGSGHLRRKVLSKSD 567

Query: 244  GKKGQNLKVLIGSVGSKSNKVAYVKTLLTYLSTHSNLSKSVLWTPATTRVASLYAAADVY 65
            GK+ Q LK+LIGSVGSKSNKV YVK +L +LS HSNLSK++LWTPATTRVASLY+AADVY
Sbjct: 568  GKQQQALKILIGSVGSKSNKVPYVKEILEFLSQHSNLSKAMLWTPATTRVASLYSAADVY 627

Query: 64   VMNSQGIGETFGRVTIEAMAF 2
            V+NSQG+GETFGRVTIEAMAF
Sbjct: 628  VINSQGLGETFGRVTIEAMAF 648


>ref|XP_006465456.1| PREDICTED: uncharacterized protein LOC102612096 isoform X1 [Citrus
            sinensis] gi|568822059|ref|XP_006465457.1| PREDICTED:
            uncharacterized protein LOC102612096 isoform X2 [Citrus
            sinensis]
          Length = 732

 Score =  624 bits (1608), Expect = e-176
 Identities = 346/621 (55%), Positives = 422/621 (67%), Gaps = 29/621 (4%)
 Frame = -2

Query: 1777 TPRGSPSFRRLSSGRTPRREGRSGGFVLSYCFRSNRXXXXXXXXXXWAYAGFYFQSRWAH 1598
            TP+ SPSFRRL++ RTPRRE RS        FRSNR          W Y GFY QSRWAH
Sbjct: 35   TPKNSPSFRRLNASRTPRREVRSASL---QWFRSNRLVYWLLLITLWTYLGFYVQSRWAH 91

Query: 1597 GDNKEDLFXXXXXXXXXXXXS---MRRDLSAA-----VGTGALKLKNETSNSSLENVDVV 1442
            G+N +               S    RRDL A      +  G +K    T  +  + +D+V
Sbjct: 92   GENNDKFLGFGGKRRNEIVDSNQNKRRDLIANHSDLDINNGTIK----TLGADSKKMDMV 147

Query: 1441 LAKSRSGDSLXXXXXXXXXXXXXXXXXXXXXXXVAQDVESE-VDLPIEDIPKKNTTYGFL 1265
            L + R+ D+                           DVES  ++  + +IP  N +YG L
Sbjct: 148  LTQRRNNDASRRSVAKRKKSKRSSRGKGRGKQKAKLDVESNYMEAQLPEIPMTNASYGLL 207

Query: 1264 VGPFGSVEDSILEWSPDKRSGTCDRKGAFARLVWSRKFVLIFHELSMTGAPLAMMELATE 1085
            VGPFG  ED ILEWSP+KRSGTCDRKG FAR VWSRKF+LIFHELSMTGAPL+MMELATE
Sbjct: 208  VGPFGLTEDRILEWSPEKRSGTCDRKGDFARFVWSRKFILIFHELSMTGAPLSMMELATE 267

Query: 1084 FLSCGATISVIVLNKKGGLMSELNRRKIKVLVDKSDLSFKTAMKANLIIAGSAVCSSWIE 905
             LSCGAT+S +VL+K+GGLM EL RRKIKVL D+ + SFKT+MKA+L+IAGSAVC++WI+
Sbjct: 268  LLSCGATVSAVVLSKRGGLMPELARRKIKVLEDRGEPSFKTSMKADLVIAGSAVCATWID 327

Query: 904  QYLSRTVLGSSQIMWWIMENRREYFDRSKHVLNRVKKLIFLSESQSKQWLDWCEEENIHL 725
            QY++R   G SQ++WWIMENRREYFDR+K VL+RVK L+FLSESQ+KQWL WCEEE + L
Sbjct: 328  QYITRFPAGGSQVVWWIMENRREYFDRAKLVLDRVKLLVFLSESQTKQWLTWCEEEKLKL 387

Query: 724  KDEPALVPLSVNDELAFAAGISCSLNTPSFTTENMLEKRQSLRKVVREEMGLNDDDMLVV 545
            + +PA+VPLSVNDELAF AG +CSLNTP+ + E M EKR  LR  VR+EMGL D DMLV+
Sbjct: 388  RSQPAVVPLSVNDELAFVAGFTCSLNTPTSSPEKMREKRNLLRDSVRKEMGLTDQDMLVL 447

Query: 544  SLSSINPGKGQLLLMESARLVIEQG--------QKLNNSGSKDSVLLD-HDYYSRALLQN 392
            SLSSINPGKGQLLL+ESA+L+IEQ         +K  N G K S L   H    R LLQ 
Sbjct: 448  SLSSINPGKGQLLLVESAQLMIEQEPSMDDSKIRKSRNVGRKKSSLTSRHHLRGRGLLQM 507

Query: 391  ----GKRDNESS-------NIDTPTKKRIRSSRIFTNEGRLNSARYGRDARMRKMLSENV 245
                G   NE S        ++ P +K + S  +FT+ G  ++  +G     RK+LS++ 
Sbjct: 508  SDDVGLSSNELSVSSESFTQLNEPVRKNLLSPSLFTSIGNTDAVSFGSGHLRRKVLSKSD 567

Query: 244  GKKGQNLKVLIGSVGSKSNKVAYVKTLLTYLSTHSNLSKSVLWTPATTRVASLYAAADVY 65
            GK+ Q LK+LIGSVGSKSNKV YVK +L +LS HSNLSK++LWTPATTRVASLY+AADVY
Sbjct: 568  GKQQQALKILIGSVGSKSNKVPYVKEILEFLSQHSNLSKAMLWTPATTRVASLYSAADVY 627

Query: 64   VMNSQGIGETFGRVTIEAMAF 2
            V+NSQG+GETFGRVTIEAMAF
Sbjct: 628  VINSQGLGETFGRVTIEAMAF 648


>ref|XP_007024055.1| UDP-Glycosyltransferase superfamily protein isoform 1 [Theobroma
            cacao] gi|508779421|gb|EOY26677.1|
            UDP-Glycosyltransferase superfamily protein isoform 1
            [Theobroma cacao]
          Length = 702

 Score =  622 bits (1605), Expect = e-175
 Identities = 348/608 (57%), Positives = 420/608 (69%), Gaps = 16/608 (2%)
 Frame = -2

Query: 1777 TPRGSPSFRRLSSGRTPRREGRSGGFVLSYCFRSNRXXXXXXXXXXWAYAGFYFQSRWAH 1598
            TP+ SP+FRRL+S RTPRRE RSG   + + FRSNR          WAY GFY QSRWAH
Sbjct: 26   TPKSSPTFRRLNSSRTPRREARSGAGGIQW-FRSNRLVYWLLLITLWAYLGFYVQSRWAH 84

Query: 1597 GDNKEDLFXXXXXXXXXXXXSM---RRDLSA-----AVGTGALKLKNETSNSSLENVDVV 1442
            G NKE+              +    RRDL A     AV  G     N+T   S    DV+
Sbjct: 85   GHNKEEFLGFSGNPRNGLIDAEQNPRRDLLADDSLVAVNNGT----NKTQVYSDRKFDVI 140

Query: 1441 LAKSRSGDSLXXXXXXXXXXXXXXXXXXXXXXXVAQDVES-EVDLPIEDIPKKNTTYGFL 1265
            LAK R+  S                           ++E+ E +    +I +KN+TYG L
Sbjct: 141  LAKKRNEVSFNKKRSRRSKRAGRNLSKMRGKRKATINIENGETEGQEHEILQKNSTYGLL 200

Query: 1264 VGPFGSVEDSILEWSPDKRSGTCDRKGAFARLVWSRKFVLIFHELSMTGAPLAMMELATE 1085
            VGPFGSVED ILEWSP+KRSGTCDRKG FARLVWSR+ VL+FHELSMTGAP++MMELATE
Sbjct: 201  VGPFGSVEDRILEWSPEKRSGTCDRKGDFARLVWSRRLVLVFHELSMTGAPISMMELATE 260

Query: 1084 FLSCGATISVIVLNKKGGLMSELNRRKIKVLVDKSDLSFKTAMKANLIIAGSAVCSSWIE 905
             LSCGAT+S +VL+KKGGLMSEL RR+IKV+ D++DLSFKTAMKA+L+IAGSAVC+SWI+
Sbjct: 261  LLSCGATVSAVVLSKKGGLMSELARRRIKVIEDRADLSFKTAMKADLVIAGSAVCASWID 320

Query: 904  QYLSRTVLGSSQIMWWIMENRREYFDRSKHVLNRVKKLIFLSESQSKQWLDWCEEENIHL 725
            QY++    G SQI WWIMENRREYFDRSK VL+RVK LIFLSE QSKQWL WC+EENI L
Sbjct: 321  QYIAHFPAGGSQIAWWIMENRREYFDRSKLVLHRVKMLIFLSELQSKQWLTWCQEENIKL 380

Query: 724  KDEPALVPLSVNDELAFAAGISCSLNTPSFTTENMLEKRQSLRKVVREEMGLNDDDMLVV 545
            + +PALVPL+VNDELAF AGI CSLNTPS + E MLEKRQ LR  VR+EMGL D+DMLV+
Sbjct: 381  RSQPALVPLAVNDELAFVAGIPCSLNTPSASPEKMLEKRQLLRDAVRKEMGLTDNDMLVM 440

Query: 544  SLSSINPGKGQLLLMESARLVIEQGQKLNNSGSKDSVLLDHD-------YYSRALLQNGK 386
            SLSSIN GKGQLLL+E+A L+I+Q     +S    S+ +  D       ++ R LLQ   
Sbjct: 441  SLSSINTGKGQLLLLEAAGLMIDQDPLQTDSEVTKSLDIRQDQSTLTVKHHLRGLLQ--- 497

Query: 385  RDNESSNIDTPTKKRIRSSRIFTNEGRLNSARYGRDARMRKMLSENVGKKGQNLKVLIGS 206
               +SS++D  +       R+F +    N+       R R ML ++ G + Q LK+LIGS
Sbjct: 498  ---KSSDVDVSS----TDLRLFASVNGTNAVSIDSSHRRRNMLFDSKGTQEQALKILIGS 550

Query: 205  VGSKSNKVAYVKTLLTYLSTHSNLSKSVLWTPATTRVASLYAAADVYVMNSQGIGETFGR 26
            VGSKSNK+ YVK +L +LS H+ LS+SVLWTPATT VASLY+AADVYVMNSQG+GETFGR
Sbjct: 551  VGSKSNKMPYVKEILRFLSQHAKLSESVLWTPATTHVASLYSAADVYVMNSQGLGETFGR 610

Query: 25   VTIEAMAF 2
            VT+EAMAF
Sbjct: 611  VTVEAMAF 618


>ref|XP_007024056.1| UDP-Glycosyltransferase superfamily protein isoform 2 [Theobroma
            cacao] gi|508779422|gb|EOY26678.1|
            UDP-Glycosyltransferase superfamily protein isoform 2
            [Theobroma cacao]
          Length = 703

 Score =  618 bits (1593), Expect = e-174
 Identities = 348/609 (57%), Positives = 420/609 (68%), Gaps = 17/609 (2%)
 Frame = -2

Query: 1777 TPRGSPSFRRLSSGRTPRREGRSGGFVLSYCFRSNRXXXXXXXXXXWAYAGFYFQSRWAH 1598
            TP+ SP+FRRL+S RTPRRE RSG   + + FRSNR          WAY GFY QSRWAH
Sbjct: 26   TPKSSPTFRRLNSSRTPRREARSGAGGIQW-FRSNRLVYWLLLITLWAYLGFYVQSRWAH 84

Query: 1597 GDNKEDLFXXXXXXXXXXXXSM---RRDLSA-----AVGTGALKLKNETSNSSLENVDVV 1442
            G NKE+              +    RRDL A     AV  G     N+T   S    DV+
Sbjct: 85   GHNKEEFLGFSGNPRNGLIDAEQNPRRDLLADDSLVAVNNGT----NKTQVYSDRKFDVI 140

Query: 1441 LAKSRSGDSLXXXXXXXXXXXXXXXXXXXXXXXVAQDVES-EVDLPIEDIPKKNTTYGFL 1265
            LAK R+  S                           ++E+ E +    +I +KN+TYG L
Sbjct: 141  LAKKRNEVSFNKKRSRRSKRAGRNLSKMRGKRKATINIENGETEGQEHEILQKNSTYGLL 200

Query: 1264 VGPFGSVEDSILEWSPDKRSGTCDRKGAFARLVWSRKFVLIFHELSMTGAPLAMMELATE 1085
            VGPFGSVED ILEWSP+KRSGTCDRKG FARLVWSR+ VL+FHELSMTGAP++MMELATE
Sbjct: 201  VGPFGSVEDRILEWSPEKRSGTCDRKGDFARLVWSRRLVLVFHELSMTGAPISMMELATE 260

Query: 1084 FLSCGATISVIVLNKKGGLMSELNRRKIKVLVDKSDLSFKTAMKANLIIAGSAVCSSWIE 905
             LSCGAT+S +VL+KKGGLMSEL RR+IKV+ D++DLSFKTAMKA+L+IAGSAVC+SWI+
Sbjct: 261  LLSCGATVSAVVLSKKGGLMSELARRRIKVIEDRADLSFKTAMKADLVIAGSAVCASWID 320

Query: 904  QYLSRTVLGSSQIMWWIMENRREYFDRSKHVLNRVKKLIFLSESQSKQWLDWCEEENIHL 725
            QY++    G SQI WWIMENRREYFDRSK VL+RVK LIFLSE QSKQWL WC+EENI L
Sbjct: 321  QYIAHFPAGGSQIAWWIMENRREYFDRSKLVLHRVKMLIFLSELQSKQWLTWCQEENIKL 380

Query: 724  KDEPALVPLSVNDELAFAAGISCSLNTPSFTTENMLEKRQSLRKVVREEMGLNDDDMLVV 545
            + +PALVPL+VNDELAF AGI CSLNTPS + E MLEKRQ LR  VR+EMGL D+DMLV+
Sbjct: 381  RSQPALVPLAVNDELAFVAGIPCSLNTPSASPEKMLEKRQLLRDAVRKEMGLTDNDMLVM 440

Query: 544  SLSSINPGKGQLLLMESARLVIEQGQKLNNSGSKDSVLLDHD-------YYSRALLQNGK 386
            SLSSIN GKGQLLL+E+A L+I+Q     +S    S+ +  D       ++ R LLQ   
Sbjct: 441  SLSSINTGKGQLLLLEAAGLMIDQDPLQTDSEVTKSLDIRQDQSTLTVKHHLRGLLQ--- 497

Query: 385  RDNESSNIDTPTKKRIRSSRIFTNEGRLNSARYGRDARMRKMLSENVGKKGQNLKVLIGS 206
               +SS++D  +       R+F +    N+       R R ML ++ G + Q LK+LIGS
Sbjct: 498  ---KSSDVDVSS----TDLRLFASVNGTNAVSIDSSHRRRNMLFDSKGTQEQALKILIGS 550

Query: 205  VGSKSNKVAYVKTLLTYLSTHSNLSKSVLWTPATTRVASLYAAADVYVMNS-QGIGETFG 29
            VGSKSNK+ YVK +L +LS H+ LS+SVLWTPATT VASLY+AADVYVMNS QG+GETFG
Sbjct: 551  VGSKSNKMPYVKEILRFLSQHAKLSESVLWTPATTHVASLYSAADVYVMNSQQGLGETFG 610

Query: 28   RVTIEAMAF 2
            RVT+EAMAF
Sbjct: 611  RVTVEAMAF 619


>ref|XP_002528176.1| glycosyltransferase, putative [Ricinus communis]
            gi|223532388|gb|EEF34183.1| glycosyltransferase, putative
            [Ricinus communis]
          Length = 686

 Score =  615 bits (1587), Expect = e-173
 Identities = 354/612 (57%), Positives = 424/612 (69%), Gaps = 20/612 (3%)
 Frame = -2

Query: 1777 TPRGSPSFRRLSSGRTPRREGRSGGFVLSYCFRSNRXXXXXXXXXXWAYAGFYFQSRWAH 1598
            T + SP+FRRL S RTPR E RS G  + + FRS R          WAY GFY QSRWAH
Sbjct: 35   TAKNSPTFRRLHSSRTPRGEARSIGGGVQW-FRSTRLVYWLLLITLWAYLGFYVQSRWAH 93

Query: 1597 GDNKEDLF---XXXXXXXXXXXXSMRRDLSAAVGTGALKLKNETSNSSLEN---VDVVLA 1436
            GDNKED                 + RRDL A     ++ + + T N  +E+   + VVLA
Sbjct: 94   GDNKEDFLGFGGQNRNEISVPEQNTRRDLLA--NDSSVAVNDGTDNVQVEDDRRIGVVLA 151

Query: 1435 KSRSGDSLXXXXXXXXXXXXXXXXXXXXXXXVAQD-------VESE-VDLPIEDIPKKNT 1280
            K   G+++                         +D       VESE V++   DIP+KNT
Sbjct: 152  K--KGNTVSSNQKKNSFSKKRSKRAGRRLRSKTRDKQKATVEVESEDVEVQEPDIPQKNT 209

Query: 1279 TYGFLVGPFGSVEDSILEWSPDKRSGTCDRKGAFARLVWSRKFVLIFHELSMTGAPLAMM 1100
            TYGFLVGPFGS ED ILEWSP+KR+GTCDRKG FARLVWSRKFVLIFHELSMTGAPL+MM
Sbjct: 210  TYGFLVGPFGSTEDRILEWSPEKRTGTCDRKGDFARLVWSRKFVLIFHELSMTGAPLSMM 269

Query: 1099 ELATEFLSCGATISVIVLNKKGGLMSELNRRKIKVLVDKSDLSFKTAMKANLIIAGSAVC 920
            ELATEFLSCGAT+S +VL+KKGGLMSELNRR+IKVL DK+DLSFKTAMKA+L+IAGSAVC
Sbjct: 270  ELATEFLSCGATVSAVVLSKKGGLMSELNRRRIKVLEDKADLSFKTAMKADLVIAGSAVC 329

Query: 919  SSWIEQYLSRTVLGSSQIMWWIMENRREYFDRSKHVLNRVKKLIFLSESQSKQWLDWCEE 740
            +SWI+QY++R   G SQI+WWIMENRREYFDRSK VLNRVK L+FLSESQ++QWL WC+E
Sbjct: 330  ASWIDQYMTRFPAGGSQIVWWIMENRREYFDRSKIVLNRVKMLVFLSESQTEQWLSWCDE 389

Query: 739  ENIHLKDEPALVPLSVNDELAFAAGISCSLNTPSFTTENMLEKRQSLRKVVREEMGLNDD 560
            E I L+  PA+VPLS+NDELAF AGI+CSLNTPS + E MLEKR+ L   VR+EMGL DD
Sbjct: 390  EKIKLRAPPAIVPLSINDELAFVAGIACSLNTPSSSPEKMLEKRRLLADSVRKEMGLTDD 449

Query: 559  DMLVVSLSSINPGKGQLLLMESARLVIEQG--QKLNNS---GSKDS-VLLDHDYYSRALL 398
            D+L+VSLSSINPGKGQLL++ESA+L+IE    QKL +S   G + S + + H  + RALL
Sbjct: 450  DVLLVSLSSINPGKGQLLILESAKLLIEPEPLQKLRSSVGIGEEQSRIAVKH--HLRALL 507

Query: 397  QNGKRDNESSNIDTPTKKRIRSSRIFTNEGRLNSARYGRDARMRKMLSENVGKKGQNLKV 218
            Q  ++    S++    +K +++                                   LKV
Sbjct: 508  Q--EKSKAVSDLKEGQEKYLKA-----------------------------------LKV 530

Query: 217  LIGSVGSKSNKVAYVKTLLTYLSTHSNLSKSVLWTPATTRVASLYAAADVYVMNSQGIGE 38
            LIGSVGSKSNKV YVK +L+YL+ HSNLSKSVLWTPATTRVASLY+AAD YV+NSQG+GE
Sbjct: 531  LIGSVGSKSNKVPYVKEMLSYLTQHSNLSKSVLWTPATTRVASLYSAADAYVINSQGLGE 590

Query: 37   TFGRVTIEAMAF 2
            TFGRVTIEAMAF
Sbjct: 591  TFGRVTIEAMAF 602


>ref|XP_002298139.1| glycosyl transferase family 1 family protein [Populus trichocarpa]
            gi|222845397|gb|EEE82944.1| glycosyl transferase family 1
            family protein [Populus trichocarpa]
          Length = 681

 Score =  612 bits (1579), Expect = e-172
 Identities = 347/606 (57%), Positives = 408/606 (67%), Gaps = 14/606 (2%)
 Frame = -2

Query: 1777 TPRGSPSFRRLSSGRTPRREGRSGGFVLSYCFRSNRXXXXXXXXXXWAYAGFYFQSRWAH 1598
            TPR SP+ R L S RTPRREGR  G +    FRSNR          W Y GFY QSRWAH
Sbjct: 36   TPRNSPTHRLLHSSRTPRREGRGSGGI--QWFRSNRLIYWLLLITLWTYLGFYVQSRWAH 93

Query: 1597 GDNKEDLFXXXXXXXXXXXXS---MRRDLSAAVGTGALKLKNETSNSSLEN---VDVVLA 1436
            GDNK++              +    RRDL A      + + N T+   + N   +DVVLA
Sbjct: 94   GDNKDEFLGFGGKSSNGLLDAEQHTRRDLLA--NDSLVVVNNGTNKIQVRNAKKIDVVLA 151

Query: 1435 KSRSGDSLXXXXXXXXXXXXXXXXXXXXXXXVAQD----VESE-VDLPIEDIPKKNTTYG 1271
            K  +G S                          Q     VES+ V++   D+PK N +YG
Sbjct: 152  KKGNGVSSNRRATPKKKKSKRGGRRSRAKAHDKQKATVVVESDDVEVAEPDVPKNNASYG 211

Query: 1270 FLVGPFGSVEDSILEWSPDKRSGTCDRKGAFARLVWSRKFVLIFHELSMTGAPLAMMELA 1091
             LVGPFG +ED ILEWSP+KRSGTCDRKGAFARLVWSRKFVLIFHELSMTGAPL+M+ELA
Sbjct: 212  LLVGPFGPIEDRILEWSPEKRSGTCDRKGAFARLVWSRKFVLIFHELSMTGAPLSMLELA 271

Query: 1090 TEFLSCGATISVIVLNKKGGLMSELNRRKIKVLVDKSDLSFKTAMKANLIIAGSAVCSSW 911
            TEFLSCGAT+S +VL+KKGGLM EL RR+IKVL D++DLSFKTAMKA+L+IAGSAVC+SW
Sbjct: 272  TEFLSCGATVSAVVLSKKGGLMPELARRRIKVLEDRADLSFKTAMKADLVIAGSAVCTSW 331

Query: 910  IEQYLSRTVLGSSQIMWWIMENRREYFDRSKHVLNRVKKLIFLSESQSKQWLDWCEEENI 731
            I+QY++R   G SQ++WWIMENRREYFDRSK +LNRVK L+FLSESQ KQW  WCEEENI
Sbjct: 332  IDQYIARFPAGGSQVVWWIMENRREYFDRSKIILNRVKMLVFLSESQMKQWQTWCEEENI 391

Query: 730  HLKDEPALVPLSVNDELAFAAGISCSLNTPSFTTENMLEKRQSLRKVVREEMGLNDDDML 551
             L+  PA+V LSVNDELAF AGI+CSLNTP+ ++E MLEKRQ LR+ VR+EMGL D+DML
Sbjct: 392  RLRSPPAVVQLSVNDELAFVAGIACSLNTPTSSSEKMLEKRQLLRESVRKEMGLTDNDML 451

Query: 550  VVSLSSINPGKGQLLLMESARLVIE--QGQKLNNSGSK-DSVLLDHDYYSRALLQNGKRD 380
            V+SLSSIN GKGQLLL+ESA LVIE     K+ NS  K +   L   ++ RAL       
Sbjct: 452  VMSLSSINAGKGQLLLLESANLVIEPDPSPKITNSVDKGNQSTLAAKHHLRAL------- 504

Query: 379  NESSNIDTPTKKRIRSSRIFTNEGRLNSARYGRDARMRKMLSENVGKKGQNLKVLIGSVG 200
                                               R RK+L+++ G   Q LKVLIGSVG
Sbjct: 505  ---------------------------------SHRKRKLLADSEGTHEQALKVLIGSVG 531

Query: 199  SKSNKVAYVKTLLTYLSTHSNLSKSVLWTPATTRVASLYAAADVYVMNSQGIGETFGRVT 20
            SKSNKV YVK +L ++S HSNLSKSVLWT ATTRVASLY+AADVY+ NSQG+GETFGRVT
Sbjct: 532  SKSNKVPYVKEILRFISQHSNLSKSVLWTSATTRVASLYSAADVYITNSQGLGETFGRVT 591

Query: 19   IEAMAF 2
            IEAMAF
Sbjct: 592  IEAMAF 597


>ref|XP_007150675.1| hypothetical protein PHAVU_005G172300g [Phaseolus vulgaris]
            gi|593700475|ref|XP_007150676.1| hypothetical protein
            PHAVU_005G172300g [Phaseolus vulgaris]
            gi|561023939|gb|ESW22669.1| hypothetical protein
            PHAVU_005G172300g [Phaseolus vulgaris]
            gi|561023940|gb|ESW22670.1| hypothetical protein
            PHAVU_005G172300g [Phaseolus vulgaris]
          Length = 701

 Score =  603 bits (1554), Expect = e-169
 Identities = 343/610 (56%), Positives = 410/610 (67%), Gaps = 18/610 (2%)
 Frame = -2

Query: 1777 TPRGSPSFRRLSSGRTPRREGRSG-GFVLSYCFRSNRXXXXXXXXXXWAYAGFYFQSRWA 1601
            TPR SPSFRR +SGRTPR+EGRSG G  L   FRSNR          WAY GF+ QSRWA
Sbjct: 35   TPRNSPSFRRQNSGRTPRKEGRSGIGGAL--WFRSNRLLFWLLLITLWAYLGFFVQSRWA 92

Query: 1600 HGDNKEDLF---XXXXXXXXXXXXSMRRDLSAAVGTGALKLKNETSNS---SLENVDVVL 1439
            H D KE+                   RRDL A+    +L   NET  +   S + ++VVL
Sbjct: 93   HSDKKEEFSGFGTGPRNTGSDAEQVQRRDLLAS--DHSLSANNETDANIALSSKTINVVL 150

Query: 1438 AKSRSGDSLXXXXXXXXXXXXXXXXXXXXXXXVAQDVESEVDLPIE----DIPKKNTTYG 1271
            AK R  D                           +      D  IE    +IP  N TYG
Sbjct: 151  AK-RGNDVPSHRKTSSKKRSRRRRASKGKSSGKLKPSTDVKDADIEEQKPEIPTANGTYG 209

Query: 1270 FLVGPFGSVEDSILEWSPDKRSGTCDRKGAFARLVWSRKFVLIFHELSMTGAPLAMMELA 1091
             LVGPFG VED ILEWSP+KRSGTC+RKG FARLVWSR+F+L+FHELSMTGAPL+MMELA
Sbjct: 210  LLVGPFGPVEDRILEWSPEKRSGTCNRKGDFARLVWSRRFILVFHELSMTGAPLSMMELA 269

Query: 1090 TEFLSCGATISVIVLNKKGGLMSELNRRKIKVLVDKSDLSFKTAMKANLIIAGSAVCSSW 911
            TE LSCGAT+S +VL+KKGGLMSEL RR+IKVL DK+DLSFKTAMKA+L+IAGSAVC+SW
Sbjct: 270  TELLSCGATVSAVVLSKKGGLMSELARRRIKVLEDKADLSFKTAMKADLVIAGSAVCASW 329

Query: 910  IEQYLSRTVLGSSQIMWWIMENRREYFDRSKHVLNRVKKLIFLSESQSKQWLDWCEEENI 731
            I+QY+ R   G+SQ++WWIMENRREYFD SK  L+RVK L+FLSESQSKQWL WCEEE+I
Sbjct: 330  IDQYIERFPAGASQVVWWIMENRREYFDLSKDALDRVKMLVFLSESQSKQWLKWCEEESI 389

Query: 730  HLKDEPALVPLSVNDELAFAAGISCSLNTPSFTTENMLEKRQSLRKVVREEMGLNDDDML 551
             L+  P ++PLSVNDELAF AGI  +LNTPSF+T+ M+EKRQ LR+ VR+E+GLND DML
Sbjct: 390  KLRSYPEIIPLSVNDELAFVAGIPSTLNTPSFSTDKMVEKRQLLRESVRKEIGLNDSDML 449

Query: 550  VVSLSSINPGKGQLLLMESARLVIEQGQKLNNSGSKDSVLLDHDYYSRALLQNGKRDNES 371
            V+SLSSINPGKGQLLL+ES   V+EQG                       LQ+ K+  + 
Sbjct: 450  VISLSSINPGKGQLLLLESVSSVLEQG----------------------WLQDDKKMKKV 487

Query: 370  SNI-----DTPTKKRIRSSRIFTNEGRL--NSARYGRDARMRKMLSENVGKKGQNLKVLI 212
            SNI         K RIR        G++  N       +R +++L ++ G   ++LK+LI
Sbjct: 488  SNIKEGISTLARKHRIRKLLPVLKNGKVVSNDISSNSLSRRKQVLPDDKGTIQKSLKLLI 547

Query: 211  GSVGSKSNKVAYVKTLLTYLSTHSNLSKSVLWTPATTRVASLYAAADVYVMNSQGIGETF 32
            GSVGSKSNK  YVK+LL +L  H N SKS+ WTPATTRVASLY+AADVYV+NSQG+GETF
Sbjct: 548  GSVGSKSNKADYVKSLLNFLEQHPNTSKSIFWTPATTRVASLYSAADVYVINSQGLGETF 607

Query: 31   GRVTIEAMAF 2
            GRVTIEAMAF
Sbjct: 608  GRVTIEAMAF 617


>gb|EXC25804.1| Putative glycosyltransferase ytcC [Morus notabilis]
          Length = 688

 Score =  600 bits (1548), Expect = e-169
 Identities = 338/604 (55%), Positives = 410/604 (67%), Gaps = 12/604 (1%)
 Frame = -2

Query: 1777 TPRGSPSFRRLSSGRTPRREGRSGGFVLSYCFRSNRXXXXXXXXXXWAYAGFYFQSRWAH 1598
            TPR SPSFRR  S RTPRREGR     L + FRSNR          WAY GF+ QSRWAH
Sbjct: 28   TPRNSPSFRRSQSSRTPRREGRGSARGLQW-FRSNRLLFWLLLITLWAYLGFFVQSRWAH 86

Query: 1597 GDNKEDLFXXXXXXXXXXXXS---MRRDLSAAVGTGALKLKNETSNSSLEN---VDVVLA 1436
             ++ +++             +   +RRDL A     +L +KN T  + + +   +DVVLA
Sbjct: 87   DNDNDNVMGFGKKPKNWNSETEQNLRRDLIAT--DISLAVKNGTGKNQVSDGKRMDVVLA 144

Query: 1435 KSRSGDSLXXXXXXXXXXXXXXXXXXXXXXXVAQDVESEV-DLPIE----DIPKKNTTYG 1271
                G S                          Q +  EV ++ IE    DIPK N +YG
Sbjct: 145  GRNDGISSHRKLNSKKKKTKRANRSLRSKVHGKQKMTMEVKNVEIEEQEPDIPKTNASYG 204

Query: 1270 FLVGPFGSVEDSILEWSPDKRSGTCDRKGAFARLVWSRKFVLIFHELSMTGAPLAMMELA 1091
             LVGPFGS+ED ILEWSP+KRSGTCDRKG FAR+VWSR+FVLIFHELSMTG+PL+MMELA
Sbjct: 205  MLVGPFGSLEDRILEWSPEKRSGTCDRKGDFARIVWSRRFVLIFHELSMTGSPLSMMELA 264

Query: 1090 TEFLSCGATISVIVLNKKGGLMSELNRRKIKVLVDKSDLSFKTAMKANLIIAGSAVCSSW 911
            TE LSCGAT+S + L+KKGGLMSEL RR+IKVL DK+DLSFKTAMKA+L+IAGSAVC+SW
Sbjct: 265  TELLSCGATVSAVALSKKGGLMSELARRRIKVLEDKADLSFKTAMKADLVIAGSAVCASW 324

Query: 910  IEQYLSRTVLGSSQIMWWIMENRREYFDRSKHVLNRVKKLIFLSESQSKQWLDWCEEENI 731
            I+Q++     G+SQ+ WWIMENRREYFDR+K VLNRVK L+F+SE Q KQWL W EEE I
Sbjct: 325  IDQFIEHFPAGASQVAWWIMENRREYFDRAKVVLNRVKMLVFISELQWKQWLAWAEEEKI 384

Query: 730  HLKDEPALVPLSVNDELAFAAGISCSLNTPSFTTENMLEKRQSLRKVVREEMGLNDDDML 551
            +L+ +P LVPLS+NDE+AF AGI+C+LNTPSFTTE M+EKRQ LR   R+EMGL D+DML
Sbjct: 385  YLRSQPVLVPLSINDEMAFVAGIACTLNTPSFTTEKMIEKRQLLRDSARKEMGLKDNDML 444

Query: 550  VVSLSSINPGKGQLLLMESARLVIEQGQKLNNSGSKDSVLLDHDYYSRALLQNGKRDNES 371
            V+SLSSINPGKGQ LL+ S RL+IE+      S  K+ V + H                 
Sbjct: 445  VMSLSSINPGKGQHLLLGSGRLMIEKEAFEEKSNIKNPVDIKHH---------------- 488

Query: 370  SNIDTPTKKRIRSSRIFTNEGRLN-SARYGRDARMRKMLSENVGKKGQNLKVLIGSVGSK 194
                    K  R  R+ T   +LN S  +G     RK + ++ G + +++K+LIGSVGSK
Sbjct: 489  ------QSKSTRKHRLKTVFQKLNGSMAFG--GTHRKEMLDSGGMRERSVKILIGSVGSK 540

Query: 193  SNKVAYVKTLLTYLSTHSNLSKSVLWTPATTRVASLYAAADVYVMNSQGIGETFGRVTIE 14
            SNKV YVK LL YLS H N SKSVLWTPA+TRVA+LYAAADVYV+NSQG+GETFGRVTIE
Sbjct: 541  SNKVVYVKELLNYLSQHPNTSKSVLWTPASTRVAALYAAADVYVINSQGLGETFGRVTIE 600

Query: 13   AMAF 2
            AMAF
Sbjct: 601  AMAF 604


>ref|XP_004149847.1| PREDICTED: uncharacterized protein LOC101207532 [Cucumis sativus]
            gi|449496350|ref|XP_004160111.1| PREDICTED:
            uncharacterized protein LOC101223486 [Cucumis sativus]
          Length = 682

 Score =  597 bits (1538), Expect = e-168
 Identities = 328/596 (55%), Positives = 406/596 (68%), Gaps = 4/596 (0%)
 Frame = -2

Query: 1777 TPRGSPSFRRLSSGRTPRREGRSGGFVLSYCFRSNRXXXXXXXXXXWAYAGFYFQSRWAH 1598
            TPRGSPSFRRL S RTPRRE RS GF L +  R+N+          WAY GFY QSRWAH
Sbjct: 34   TPRGSPSFRRLHSSRTPRREARSTGFSLHW-IRNNKVLFWLLLITLWAYLGFYVQSRWAH 92

Query: 1597 GDNKED-LFXXXXXXXXXXXXSMRRDLSAAVGTGALKLKNETSNSSLEN---VDVVLAKS 1430
            G+NK++ L                + LS       L ++N +  +   +   V+VVLAK 
Sbjct: 93   GENKDEFLGFGGQQSNQKLDSEQNQSLSLISTNNRLVVENRSGENDRSDGGVVNVVLAKK 152

Query: 1429 RSGDSLXXXXXXXXXXXXXXXXXXXXXXXVAQDVESEVDLPIEDIPKKNTTYGFLVGPFG 1250
             +G S                         A+    +++    +IP KN++YG LVGPFG
Sbjct: 153  ANGVSASKKTKPRKRSKRSKRDKVHKGKIPAEVTNHDIEEQEPEIPLKNSSYGMLVGPFG 212

Query: 1249 SVEDSILEWSPDKRSGTCDRKGAFARLVWSRKFVLIFHELSMTGAPLAMMELATEFLSCG 1070
            S ED ILEWSP+KRSGTCDRKG FARLVWSR+FVLIFHELSMTGAP++MMELATE LSCG
Sbjct: 213  STEDRILEWSPEKRSGTCDRKGDFARLVWSRRFVLIFHELSMTGAPISMMELATELLSCG 272

Query: 1069 ATISVIVLNKKGGLMSELNRRKIKVLVDKSDLSFKTAMKANLIIAGSAVCSSWIEQYLSR 890
            A++S + L+KKGGLMSEL+RR+IKVL DK+DLSFKTAMKA+L+IAGSAVC+SWI+ Y+  
Sbjct: 273  ASVSAVALSKKGGLMSELSRRRIKVLDDKADLSFKTAMKADLVIAGSAVCASWIDGYIEH 332

Query: 889  TVLGSSQIMWWIMENRREYFDRSKHVLNRVKKLIFLSESQSKQWLDWCEEENIHLKDEPA 710
               G+SQ+ WWIMENRREYF+RSK VL+RVK LIF+SE QSKQWL+W +EENI L+ +PA
Sbjct: 333  FPAGASQVAWWIMENRREYFNRSKVVLDRVKMLIFISELQSKQWLNWSQEENIKLRSQPA 392

Query: 709  LVPLSVNDELAFAAGISCSLNTPSFTTENMLEKRQSLRKVVREEMGLNDDDMLVVSLSSI 530
            +VPLSVNDELAF AGISCSLNT S + E MLEK+Q LR   R+EMG+ D+D++V++LSSI
Sbjct: 393  IVPLSVNDELAFVAGISCSLNTESSSPEKMLEKKQLLRNTTRKEMGVGDNDVVVMTLSSI 452

Query: 529  NPGKGQLLLMESARLVIEQGQKLNNSGSKDSVLLDHDYYSRALLQNGKRDNESSNIDTPT 350
            NPGKG  LL+ES+ L+I++G K ++                       R+ + S+   P 
Sbjct: 453  NPGKGHFLLLESSNLLIDRGLKRDDPKI--------------------RNPDDSSPSRPK 492

Query: 349  KKRIRSSRIFTNEGRLNSARYGRDARMRKMLSENVGKKGQNLKVLIGSVGSKSNKVAYVK 170
              R R  R      +LN          R++L++       + K+LIGSVGSKSNKV YVK
Sbjct: 493  LARRRYMRALLQ--KLND--------RRRLLADGGELPETSFKLLIGSVGSKSNKVVYVK 542

Query: 169  TLLTYLSTHSNLSKSVLWTPATTRVASLYAAADVYVMNSQGIGETFGRVTIEAMAF 2
             LL +LS HSNLS+SVLWTPATTRVASLY+AAD+YV+NSQGIGETFGRVTIEAMAF
Sbjct: 543  RLLRFLSQHSNLSQSVLWTPATTRVASLYSAADIYVINSQGIGETFGRVTIEAMAF 598


>ref|XP_004486717.1| PREDICTED: uncharacterized protein LOC101501726 [Cicer arietinum]
          Length = 709

 Score =  596 bits (1537), Expect = e-167
 Identities = 337/613 (54%), Positives = 411/613 (67%), Gaps = 21/613 (3%)
 Frame = -2

Query: 1777 TPRGSPSFRRLSSGRTPRREGRSGGFVLSYCFRSNRXXXXXXXXXXWAYAGFYFQSRWAH 1598
            TPR SP+FRRL++ RTPR++GRS G   S  FRSNR          WAY GF+ QSRWAH
Sbjct: 38   TPRNSPTFRRLNTSRTPRKDGRSVG--SSLWFRSNRVLLWLLLITLWAYLGFFVQSRWAH 95

Query: 1597 GDNKEDL----FXXXXXXXXXXXXSMRRDLSAAVGTGALKLKNET---SNSSLENVDVVL 1439
             D KE+                  S+RRDL A+    +L + NET          ++V L
Sbjct: 96   SDKKEEFSGFGTGPRNTGSNDDSTSLRRDLIAS--EDSLSVNNETVINKGGVGRTINVAL 153

Query: 1438 AKSRSGDSLXXXXXXXXXXXXXXXXXXXXXXXVAQDVESEVDLPIED-------IPKKNT 1280
            A   + D                              + +V++   D       IP+ N+
Sbjct: 154  AMKGNDDDDDDVPSRRKASSKKKKSKRSSRGKARGKNKPKVEIKNNDIEEQEPEIPETNS 213

Query: 1279 TYGFLVGPFGSVEDSILEWSPDKRSGTCDRKGAFARLVWSRKFVLIFHELSMTGAPLAMM 1100
            TYG LVGPFGS ED ILEWSP KRSGTC+RKG FARLVWSR+F+LIFHELSMTGAPL+MM
Sbjct: 214  TYGLLVGPFGSTEDRILEWSPQKRSGTCNRKGDFARLVWSRRFILIFHELSMTGAPLSMM 273

Query: 1099 ELATEFLSCGATISVIVLNKKGGLMSELNRRKIKVLVDKSDLSFKTAMKANLIIAGSAVC 920
            ELATE LSCGAT+S + L++KGGLMSEL RR+IK+L DK+DLSFKTAMKA+L+IAGSAVC
Sbjct: 274  ELATELLSCGATVSAVALSRKGGLMSELARRRIKLLEDKADLSFKTAMKADLVIAGSAVC 333

Query: 919  SSWIEQYLSRTVLGSSQIMWWIMENRREYFDRSKHVLNRVKKLIFLSESQSKQWLDWCEE 740
            +SWIEQY+     G+SQ+ WWIMENRREYF+R+K VL+RVK L+FLSESQSKQW  WCEE
Sbjct: 334  ASWIEQYIEHFPAGASQVAWWIMENRREYFNRTKGVLDRVKMLVFLSESQSKQWQKWCEE 393

Query: 739  ENIHLKDEPALVPLSVNDELAFAAGISCSLNTPSFTTENMLEKRQSLRKVVREEMGLNDD 560
            ENI L+  P ++PLSVNDELAF AGI  +LNTPSF T+ M+EK+Q LR+ VR+EMGL D 
Sbjct: 394  ENIKLRSRPEIIPLSVNDELAFVAGIPSTLNTPSFDTDKMIEKKQLLRESVRKEMGLTDH 453

Query: 559  DMLVVSLSSINPGKGQLLLMESARLVIEQGQKLNNSGSKDSVLLDHDYYSRALLQNGKRD 380
            DMLV+SLSSINPGKGQLLL+ESA  V+E GQ                      LQ+ K+ 
Sbjct: 454  DMLVISLSSINPGKGQLLLLESAISVVEHGQ----------------------LQDDKKM 491

Query: 379  NESSNI----DTPTKK-RIRSSRIFTNEGR--LNSARYGRDARMRKMLSENVGKKGQNLK 221
             +SSNI     T T+K RIR       +G+  L        +R +++L  N     Q+LK
Sbjct: 492  KKSSNIKEGLSTLTRKQRIRKLLPMLKDGKVALKDISINSLSRRKQVLPNNKTTTQQSLK 551

Query: 220  VLIGSVGSKSNKVAYVKTLLTYLSTHSNLSKSVLWTPATTRVASLYAAADVYVMNSQGIG 41
            VLIGSVGSKSNK  YVK+LL++L+ H N SK+VLWTP+TT+VASLY+AADVYV+NSQG+G
Sbjct: 552  VLIGSVGSKSNKADYVKSLLSFLAQHPNTSKTVLWTPSTTQVASLYSAADVYVINSQGLG 611

Query: 40   ETFGRVTIEAMAF 2
            ETFGRVTIEAMAF
Sbjct: 612  ETFGRVTIEAMAF 624


>ref|XP_006597141.1| PREDICTED: uncharacterized protein LOC100793827 isoform X1 [Glycine
            max] gi|571514725|ref|XP_006597142.1| PREDICTED:
            uncharacterized protein LOC100793827 isoform X2 [Glycine
            max]
          Length = 701

 Score =  595 bits (1535), Expect = e-167
 Identities = 337/609 (55%), Positives = 407/609 (66%), Gaps = 17/609 (2%)
 Frame = -2

Query: 1777 TPRGSPSFRRLSSGRTPRREGRS--GGFVLSYCFRSNRXXXXXXXXXXWAYAGFYFQSRW 1604
            TPR SPSFRRL+SGRTPR+EGRS  GG   +  FRSNR          WAY GF+ QSRW
Sbjct: 35   TPRNSPSFRRLNSGRTPRKEGRSSVGG---ALWFRSNRLLLWLLLITLWAYLGFFVQSRW 91

Query: 1603 AHGDNKEDLFXXXXXXXXXXXXS---MRRDLSAAVGTGALKLKNETSNSSL---ENVDVV 1442
            AH D KE+              +    RRDL A+    +L   N+T        + ++V 
Sbjct: 92   AHSDKKEEFSGYGTGPRNTNSDAEQIQRRDLLAS--NKSLSANNDTDADIAGISKTINVA 149

Query: 1441 LAKS-------RSGDSLXXXXXXXXXXXXXXXXXXXXXXXVAQDVESEVDLPIEDIPKKN 1283
            LAK+       R   S                           D+E +      +IP  N
Sbjct: 150  LAKNDNDVPSHRKTSSKNRSKGRRSSKGKSRGKLKPTTEIKNTDIEEQEP----EIPTTN 205

Query: 1282 TTYGFLVGPFGSVEDSILEWSPDKRSGTCDRKGAFARLVWSRKFVLIFHELSMTGAPLAM 1103
            +TYG LVGPFG +ED ILEWSP+KRSGTC+RK  FARLVWSR+F+LIFHELSMTGAPL+M
Sbjct: 206  STYGLLVGPFGPMEDRILEWSPEKRSGTCNRKEDFARLVWSRRFILIFHELSMTGAPLSM 265

Query: 1102 MELATEFLSCGATISVIVLNKKGGLMSELNRRKIKVLVDKSDLSFKTAMKANLIIAGSAV 923
            MELATE LSCGAT+S +VL++KGGLMSEL RR+IKVL DK+DLSFKTAMKA+L+IAGSAV
Sbjct: 266  MELATELLSCGATVSAVVLSRKGGLMSELARRRIKVLEDKADLSFKTAMKADLVIAGSAV 325

Query: 922  CSSWIEQYLSRTVLGSSQIMWWIMENRREYFDRSKHVLNRVKKLIFLSESQSKQWLDWCE 743
            C+SWIEQY+     G+SQ+ WWIMENRREYFDRSK VL+RVK L+FLSESQSKQW  WCE
Sbjct: 326  CASWIEQYIEHFPAGASQVAWWIMENRREYFDRSKDVLHRVKMLVFLSESQSKQWQKWCE 385

Query: 742  EENIHLKDEPALVPLSVNDELAFAAGISCSLNTPSFTTENMLEKRQSLRKVVREEMGLND 563
            EE+I L+  P +VPLSVNDELAF AGI  +LNTPSF+TE M+EK+Q LR+ VR+EMGL D
Sbjct: 386  EESIKLRSHPEIVPLSVNDELAFVAGIPSTLNTPSFSTEKMVEKKQLLRESVRKEMGLTD 445

Query: 562  DDMLVVSLSSINPGKGQLLLMESARLVIEQGQKLNNSGSKDSVLLDHDYYSRALLQNGKR 383
            +DMLV+SLSSINPGKGQLLL+ES   V+EQGQ   +   K+   +     S A       
Sbjct: 446  NDMLVISLSSINPGKGQLLLLESVSSVLEQGQSPGDKKMKEVSNIKEGLSSLA------- 498

Query: 382  DNESSNIDTPTKKRIRSSRIFTNEGRL--NSARYGRDARMRKMLSENVGKKGQNLKVLIG 209
                       K RIR      + G++  NS      +R +++L  + G   Q+LK+LIG
Sbjct: 499  ----------RKHRIRKLLPLMSNGKVASNSISSNSLSRRKQVLPNDKGTIQQSLKLLIG 548

Query: 208  SVGSKSNKVAYVKTLLTYLSTHSNLSKSVLWTPATTRVASLYAAADVYVMNSQGIGETFG 29
            SV SKSNK  YVK+LL++L  H N S S+ WTPATTRVASLY+AADVYV+NSQG+GETFG
Sbjct: 549  SVRSKSNKADYVKSLLSFLEQHPNTSTSIFWTPATTRVASLYSAADVYVINSQGLGETFG 608

Query: 28   RVTIEAMAF 2
            RVTIEAMAF
Sbjct: 609  RVTIEAMAF 617


>ref|XP_007024057.1| UDP-Glycosyltransferase superfamily protein isoform 3 [Theobroma
            cacao] gi|508779423|gb|EOY26679.1|
            UDP-Glycosyltransferase superfamily protein isoform 3
            [Theobroma cacao]
          Length = 608

 Score =  593 bits (1528), Expect = e-166
 Identities = 334/592 (56%), Positives = 404/592 (68%), Gaps = 16/592 (2%)
 Frame = -2

Query: 1777 TPRGSPSFRRLSSGRTPRREGRSGGFVLSYCFRSNRXXXXXXXXXXWAYAGFYFQSRWAH 1598
            TP+ SP+FRRL+S RTPRRE RSG   + + FRSNR          WAY GFY QSRWAH
Sbjct: 26   TPKSSPTFRRLNSSRTPRREARSGAGGIQW-FRSNRLVYWLLLITLWAYLGFYVQSRWAH 84

Query: 1597 GDNKEDLFXXXXXXXXXXXXSM---RRDLSA-----AVGTGALKLKNETSNSSLENVDVV 1442
            G NKE+              +    RRDL A     AV  G     N+T   S    DV+
Sbjct: 85   GHNKEEFLGFSGNPRNGLIDAEQNPRRDLLADDSLVAVNNGT----NKTQVYSDRKFDVI 140

Query: 1441 LAKSRSGDSLXXXXXXXXXXXXXXXXXXXXXXXVAQDVES-EVDLPIEDIPKKNTTYGFL 1265
            LAK R+  S                           ++E+ E +    +I +KN+TYG L
Sbjct: 141  LAKKRNEVSFNKKRSRRSKRAGRNLSKMRGKRKATINIENGETEGQEHEILQKNSTYGLL 200

Query: 1264 VGPFGSVEDSILEWSPDKRSGTCDRKGAFARLVWSRKFVLIFHELSMTGAPLAMMELATE 1085
            VGPFGSVED ILEWSP+KRSGTCDRKG FARLVWSR+ VL+FHELSMTGAP++MMELATE
Sbjct: 201  VGPFGSVEDRILEWSPEKRSGTCDRKGDFARLVWSRRLVLVFHELSMTGAPISMMELATE 260

Query: 1084 FLSCGATISVIVLNKKGGLMSELNRRKIKVLVDKSDLSFKTAMKANLIIAGSAVCSSWIE 905
             LSCGAT+S +VL+KKGGLMSEL RR+IKV+ D++DLSFKTAMKA+L+IAGSAVC+SWI+
Sbjct: 261  LLSCGATVSAVVLSKKGGLMSELARRRIKVIEDRADLSFKTAMKADLVIAGSAVCASWID 320

Query: 904  QYLSRTVLGSSQIMWWIMENRREYFDRSKHVLNRVKKLIFLSESQSKQWLDWCEEENIHL 725
            QY++    G SQI WWIMENRREYFDRSK VL+RVK LIFLSE QSKQWL WC+EENI L
Sbjct: 321  QYIAHFPAGGSQIAWWIMENRREYFDRSKLVLHRVKMLIFLSELQSKQWLTWCQEENIKL 380

Query: 724  KDEPALVPLSVNDELAFAAGISCSLNTPSFTTENMLEKRQSLRKVVREEMGLNDDDMLVV 545
            + +PALVPL+VNDELAF AGI CSLNTPS + E MLEKRQ LR  VR+EMGL D+DMLV+
Sbjct: 381  RSQPALVPLAVNDELAFVAGIPCSLNTPSASPEKMLEKRQLLRDAVRKEMGLTDNDMLVM 440

Query: 544  SLSSINPGKGQLLLMESARLVIEQGQKLNNSGSKDSVLLDHD-------YYSRALLQNGK 386
            SLSSIN GKGQLLL+E+A L+I+Q     +S    S+ +  D       ++ R LLQ   
Sbjct: 441  SLSSINTGKGQLLLLEAAGLMIDQDPLQTDSEVTKSLDIRQDQSTLTVKHHLRGLLQ--- 497

Query: 385  RDNESSNIDTPTKKRIRSSRIFTNEGRLNSARYGRDARMRKMLSENVGKKGQNLKVLIGS 206
               +SS++D  +       R+F +    N+       R R ML ++ G + Q LK+LIGS
Sbjct: 498  ---KSSDVDVSS----TDLRLFASVNGTNAVSIDSSHRRRNMLFDSKGTQEQALKILIGS 550

Query: 205  VGSKSNKVAYVKTLLTYLSTHSNLSKSVLWTPATTRVASLYAAADVYVMNSQ 50
            VGSKSNK+ YVK +L +LS H+ LS+SVLWTPATT VASLY+AADVYVMNSQ
Sbjct: 551  VGSKSNKMPYVKEILRFLSQHAKLSESVLWTPATTHVASLYSAADVYVMNSQ 602


Top