BLASTX nr result

ID: Mentha25_contig00025799 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Mentha25_contig00025799
         (1767 letters)

Database: ./nr 
           38,876,450 sequences; 13,856,398,315 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

gb|EYU21559.1| hypothetical protein MIMGU_mgv1a002407mg [Mimulus...   640   0.0  
ref|XP_006343109.1| PREDICTED: uncharacterized protein LOC102601...   578   e-162
emb|CAN71826.1| hypothetical protein VITISV_013841 [Vitis vinifera]   573   e-160
ref|XP_004235700.1| PREDICTED: uncharacterized protein LOC101247...   567   e-159
ref|XP_002284822.1| PREDICTED: uncharacterized protein LOC100246...   567   e-159
ref|XP_007217014.1| hypothetical protein PRUPE_ppa002059mg [Prun...   554   e-155
ref|XP_004302927.1| PREDICTED: uncharacterized protein LOC101300...   551   e-154
ref|XP_006427083.1| hypothetical protein CICLE_v10024994mg [Citr...   548   e-153
ref|XP_006465456.1| PREDICTED: uncharacterized protein LOC102612...   546   e-153
emb|CBI36173.3| unnamed protein product [Vitis vinifera]              546   e-153
ref|XP_007024057.1| UDP-Glycosyltransferase superfamily protein ...   544   e-152
ref|XP_007024056.1| UDP-Glycosyltransferase superfamily protein ...   544   e-152
ref|XP_007024055.1| UDP-Glycosyltransferase superfamily protein ...   544   e-152
ref|XP_002528176.1| glycosyltransferase, putative [Ricinus commu...   541   e-151
ref|XP_002298139.1| glycosyl transferase family 1 family protein...   539   e-150
ref|XP_007150675.1| hypothetical protein PHAVU_005G172300g [Phas...   524   e-146
gb|EXC25804.1| Putative glycosyltransferase ytcC [Morus notabilis]    524   e-146
ref|XP_004486717.1| PREDICTED: uncharacterized protein LOC101501...   520   e-145
ref|XP_006597141.1| PREDICTED: uncharacterized protein LOC100793...   519   e-144
ref|XP_004149847.1| PREDICTED: uncharacterized protein LOC101207...   518   e-144

>gb|EYU21559.1| hypothetical protein MIMGU_mgv1a002407mg [Mimulus guttatus]
          Length = 678

 Score =  640 bits (1652), Expect = 0.0
 Identities = 361/565 (63%), Positives = 399/565 (70%), Gaps = 15/565 (2%)
 Frame = +2

Query: 113  TPRGSPSFRRLSSGRTPRREGRSGGFVLSYCFRSNRXXXXXXXXXXXAYAGFYFQSRWAH 292
            TPRGSPSFRRL+SGRTPRR+ RSG F  S+C RSNR           AYAGFYFQS+WAH
Sbjct: 35   TPRGSPSFRRLNSGRTPRRDARSGVFS-SHCLRSNRIVLWLLLITLWAYAGFYFQSKWAH 93

Query: 293  GDNKEDLFXXXXXXXXXXXXX------MRRDLSVAVGTGALKLKTETSNSSLEN--VDVV 448
            GDNKEDLF                    RRDL   V + A++LK +T+  SL    +DVV
Sbjct: 94   GDNKEDLFSGGYGGESGGDKFEPQIKNRRRDLIAKVDSAAVELKNDTNELSLNKSVMDVV 153

Query: 449  LAKSRSGDSLXXXXXXXXXXXXXXXXXXXXXXXXAQD-VESEVDLPIEDI-PKKNTTYGF 622
            LAK+ + D                          A++ VESEVD+  E+I PKKNTTYGF
Sbjct: 154  LAKNTTLDKNKPSKRRSKRSLRRKKPVSSKPKAMAEEEVESEVDMQTEEIIPKKNTTYGF 213

Query: 623  LVGPFGSVEDSILEWSPDKRSGTCDRKGAFARLVWSRKFVLIFHELSMTGAPLAMMELAT 802
            LVGPFGSVEDSILEWS +KRSGTCDRKGAFARLVWSRKFVLIFHELSMTGAPLAM+ELAT
Sbjct: 214  LVGPFGSVEDSILEWSAEKRSGTCDRKGAFARLVWSRKFVLIFHELSMTGAPLAMLELAT 273

Query: 803  EFLSCGATISVIVLNKKGGLMSELNRRKIKVLVDKSDLSFKTAMKANLIIAGSAVCSSWI 982
            EFLSCGATISVIVLNK+GGLMSEL+RRKIKVL DK+DLSFKTAMKA++IIAGSAVCSSWI
Sbjct: 274  EFLSCGATISVIVLNKRGGLMSELSRRKIKVLEDKTDLSFKTAMKADIIIAGSAVCSSWI 333

Query: 983  EQYLSRTVLGSSQIMWWIMENRREYFDRSKHVLNRVKKLIFLSESQSKQWLDWCEEENIH 1162
            EQYLSRTVLGSSQIMWWIMENRREYFDRSK VLNRVKKLIFLS+SQSKQWL WCEEE I 
Sbjct: 334  EQYLSRTVLGSSQIMWWIMENRREYFDRSKLVLNRVKKLIFLSKSQSKQWLSWCEEEKIQ 393

Query: 1163 LKDEPALVPLSVNDELAFAAGISCSLNTPSFTTENMLEKRQSLRKVVREEMGLNDDDMLA 1342
            LK EPALVPLSVNDELAF AGI CSLNTPSF+TE M+EKR  LR  VREEMGL++DDMLA
Sbjct: 394  LKSEPALVPLSVNDELAFVAGIPCSLNTPSFSTEKMMEKRGLLRSAVREEMGLSEDDMLA 453

Query: 1343 VSLSSINPGKGQLLLMESARLVIEQGQ----KLNNSGSKDSILLDHDYYS-RALLQNGKR 1507
            VSLSSINPGKGQLLL+E+ R +IEQ +     L  S   DS++ D D    R LL  G  
Sbjct: 454  VSLSSINPGKGQLLLLEAGRFLIEQPRTDQTNLRLSSEFDSMVFDGDSSGLRKLLSEG-- 511

Query: 1508 DNESSNIDTPTKKRIRSSRIFTNEGRLNSAMYGREARMRKMLSENVGKKGQNLKVLIGSV 1687
                                                        N+GKKG NLK+L+GSV
Sbjct: 512  --------------------------------------------NIGKKGGNLKILVGSV 527

Query: 1688 GSKSNKVAYVKTLLTYLSTHSNLSK 1762
            GSKSNKV YVKTLL +LS HSNLSK
Sbjct: 528  GSKSNKVPYVKTLLNFLSMHSNLSK 552


>ref|XP_006343109.1| PREDICTED: uncharacterized protein LOC102601346 [Solanum tuberosum]
          Length = 711

 Score =  578 bits (1490), Expect = e-162
 Identities = 325/565 (57%), Positives = 391/565 (69%), Gaps = 14/565 (2%)
 Frame = +2

Query: 113  TPRG-SPSFRRLSSGRTPRREGRSGGFVLSYCFRSNRXXXXXXXXXXXAYAGFYFQSRWA 289
            TPRG SPSFRRL+SGRTPRR+G+S  F  S  FRSNR           AY GFY QSRWA
Sbjct: 29   TPRGGSPSFRRLNSGRTPRRDGKSSAFG-SQWFRSNRILLWLLLITLWAYGGFYVQSRWA 87

Query: 290  HGDNKEDLFXXXXXXXXXXXXXM----RRDLSVAVGTGALKLKTETSNSSLENVDVVLAK 457
            HGDNKE +F                  +R L     + A+K  +  +  +  ++DVVLAK
Sbjct: 88   HGDNKEGIFGGTGGDVANGTSQPEEKNQRILVANEESLAVKPPSNKTQGNSMDLDVVLAK 147

Query: 458  SRSG---DSLXXXXXXXXXXXXXXXXXXXXXXXXAQDVESE-VDLPIEDIPKKNTTYGFL 625
              +    D +                          +V+++ +++  E+IPK+NTTYG L
Sbjct: 148  QGNSVVSDKVSSSKKKSKKSTRASRRKTHGKKKVVAEVKTDDIEVQEEEIPKRNTTYGLL 207

Query: 626  VGPFGSVEDSILEWSPDKRSGTCDRKGAFARLVWSRKFVLIFHELSMTGAPLAMMELATE 805
            VGPFGS+ED ILEWSP+KRSGTCDRK  FARLVWSRKFVLI HELSMTGAPLAM+ELATE
Sbjct: 208  VGPFGSIEDKILEWSPEKRSGTCDRKSQFARLVWSRKFVLILHELSMTGAPLAMLELATE 267

Query: 806  FLSCGATISVIVLNKKGGLMSELNRRKIKVLVDKSDLSFKTAMKANLIIAGSAVCSSWIE 985
             LSCGAT+ V+ L+K+GGLMSEL+RRKIKVL DKSDLSFKTAMKA+LIIAGSAVC+SWIE
Sbjct: 268  LLSCGATVYVVPLSKRGGLMSELSRRKIKVLEDKSDLSFKTAMKADLIIAGSAVCASWIE 327

Query: 986  QYLSRTVLGSSQIMWWIMENRREYFDRSKHVLNRVKKLIFLSESQSKQWLDWCEEENIHL 1165
            QY +RTVLGSSQI WWIMENRREYFDR+K   NRVKKLIFLSESQSK+WL WCEEE+I L
Sbjct: 328  QYAARTVLGSSQITWWIMENRREYFDRAKLAFNRVKKLIFLSESQSKRWLAWCEEEHIKL 387

Query: 1166 KDEPALVPLSVNDELAFAAGISCSLNTPSFTTENMLEKRQSLRKVVREEMGLNDDDMLAV 1345
            K +PALVPLS++DELAF AGI CSL+TP F+ E MLEKRQ LR  VR+EMGL D+DML +
Sbjct: 388  KTQPALVPLSISDELAFVAGIPCSLSTPLFSPEKMLEKRQLLRDFVRKEMGLTDNDMLVM 447

Query: 1346 SLSSINPGKGQLLLMESARLVIEQGQKLNNSGSKDSILLDHDYYSRALLQN----GKRDN 1513
            SLSSINPGKGQ LL+E+ RL+IE    LN S  K       +Y  R LL N    G+   
Sbjct: 448  SLSSINPGKGQFLLLETTRLLIEGAPPLNGSAVK-----RREYQKRTLLYNWKQFGEWKK 502

Query: 1514 ESSNI-DTPTKKRIRSSRIFTNEGRLNSAMYGREARMRKMLSENVGKKGQNLKVLIGSVG 1690
            ESS + + P  + ++  ++F  +G   +A    +   RK+ S   GK+G+ LKVLIGSVG
Sbjct: 503  ESSTLSNNPQTETLQVPQLFI-KGVNYTAGIENDRGTRKLFSLTEGKQGEKLKVLIGSVG 561

Query: 1691 SKSNKVAYVKTLLTYLSTHSNLSKS 1765
            SKSNKV YVK LL +L+ HSNLS +
Sbjct: 562  SKSNKVPYVKALLNFLNQHSNLSNT 586


>emb|CAN71826.1| hypothetical protein VITISV_013841 [Vitis vinifera]
          Length = 734

 Score =  573 bits (1476), Expect = e-160
 Identities = 323/578 (55%), Positives = 383/578 (66%), Gaps = 27/578 (4%)
 Frame = +2

Query: 113  TPRGSPSFRRLSSGRTPRREGRSGGFVLSYCFRSNRXXXXXXXXXXXAYAGFYFQSRWAH 292
            TPR SPSFRR  S RTPRRE RS G V S  FR+NR           AY GFY QS+WAH
Sbjct: 35   TPRNSPSFRRSHSSRTPRREARSSG-VGSQWFRNNRVVFWLILITLWAYLGFYVQSKWAH 93

Query: 293  GDNKEDLFXXXXXXXXXXXXX-MRRDLSVAVGTGALKLKTETSNSSL---ENVDVVLAKS 460
            GDN ED+               + R   +      L +K  +  + +   + VDVVLAK 
Sbjct: 94   GDNNEDIIGFGGKPNNGISDSELNRKAPLIANDKLLAVKNGSDKNPVGSGKKVDVVLAKK 153

Query: 461  RSGDSLXXXXXXXXXXXXXXXXXXXXXXXXAQDVESEVDLPIED-----IPKKNTTYGFL 625
              G+S+                         Q  ++EV++   D     IPK NT+YG L
Sbjct: 154  --GNSVPSRRSASSKKRSKKSERSLRGKTRKQKTKTEVEVTEMDEQEQEIPKLNTSYGLL 211

Query: 626  VGPFGSVEDSILEWSPDKRSGTCDRKGAFARLVWSRKFVLIFHELSMTGAPLAMMELATE 805
            VGPFGS ED ILEWSP+KRSGTCDR+G  ARLVWSRKFVLIFHELSMTGAPL+MMELATE
Sbjct: 212  VGPFGSTEDRILEWSPEKRSGTCDRRGELARLVWSRKFVLIFHELSMTGAPLSMMELATE 271

Query: 806  FLSCGATISVIVLNKKGGLMSELNRRKIKVLVDKSDLSFKTAMKANLIIAGSAVCSSWIE 985
             LSCGAT+S +VL+KKGGLM EL RR+IKVL D++DLSFKTAMKA+L+IAGSAVC+SWIE
Sbjct: 272  LLSCGATVSAVVLSKKGGLMPELARRRIKVLEDRADLSFKTAMKADLVIAGSAVCASWIE 331

Query: 986  QYLSRTVLGSSQIMWWIMENRREYFDRSKHVLNRVKKLIFLSESQSKQWLDWCEEENIHL 1165
            QY++    GSSQI+WWIMENRREYFDRSK V+NRVK LIFLSESQSKQWL WC+EENI L
Sbjct: 332  QYIAHFTAGSSQIVWWIMENRREYFDRSKLVINRVKMLIFLSESQSKQWLTWCKEENIRL 391

Query: 1166 KDEPALVPLSVNDELAFAAGISCSLNTPSFTTENMLEKRQSLRKVVREEMGLNDDDMLAV 1345
              +PA+VPLSVNDELAF AGI+CSLNTPSFTTE M EKR+ LR  +R+EMGL D DML +
Sbjct: 392  ISQPAVVPLSVNDELAFVAGITCSLNTPSFTTEKMQEKRRLLRDSIRKEMGLTDTDMLLL 451

Query: 1346 SLSSINPGKGQLLLMESARLVIEQGQKLNNSGSKDSILLDHD-------YYSRALLQNGK 1504
            SLSSINPGKGQ  L+ES R +IEQ    ++   KD   +  D       +YSRALLQN  
Sbjct: 452  SLSSINPGKGQFFLLESVRSMIEQEPSQDDPELKDLAKIGQDQSNFSGKHYSRALLQNVN 511

Query: 1505 RDNESSN-----------IDTPTKKRIRSSRIFTNEGRLNSAMYGREARMRKMLSENVGK 1651
              + SS+           ++ P  K +    +F +    ++   G   + RK+LSEN G 
Sbjct: 512  HFSVSSSGLRLSNESFIELNGPKSKNLMLPSLFPSISPSDAVSIGSGYKRRKVLSENEGT 571

Query: 1652 KGQNLKVLIGSVGSKSNKVAYVKTLLTYLSTHSNLSKS 1765
            + Q LKVLIGSVGSKSNKV YVK LL +L  HSNLSKS
Sbjct: 572  QEQALKVLIGSVGSKSNKVPYVKGLLRFLXRHSNLSKS 609


>ref|XP_004235700.1| PREDICTED: uncharacterized protein LOC101247116 [Solanum
            lycopersicum]
          Length = 711

 Score =  567 bits (1461), Expect = e-159
 Identities = 320/564 (56%), Positives = 382/564 (67%), Gaps = 13/564 (2%)
 Frame = +2

Query: 113  TPRG-SPSFRRLSSGRTPRREGRSGGFVLSYCFRSNRXXXXXXXXXXXAYAGFYFQSRWA 289
            TPRG SPSFRRL+SGRTPRR+G+S  F  S  FRSNR           AY GFY QSRWA
Sbjct: 29   TPRGGSPSFRRLNSGRTPRRDGKSSVFG-SQWFRSNRIVLWLLLITLWAYGGFYVQSRWA 87

Query: 290  HGDNKEDLFXXXXXXXXXXXXXM----RRDLSVAVGTGALKLKTETSNSSLENVDVVLAK 457
            HGDNKE +F                  +R L     + A+K  +  +  +  ++DVVLAK
Sbjct: 88   HGDNKEGIFGGSGGDVANGTSQPEEKNQRILVANEESLAVKPPSNKTQGNSMDLDVVLAK 147

Query: 458  SRSG---DSLXXXXXXXXXXXXXXXXXXXXXXXXAQDVESE-VDLPIEDIPKKNTTYGFL 625
              +    D                            +V+S+ +++  E+IPK+NTTYG L
Sbjct: 148  QGNSVVSDKGASPKKKSKKSTRASRRKTRGKKKVVAEVKSDDIEIQEEEIPKRNTTYGLL 207

Query: 626  VGPFGSVEDSILEWSPDKRSGTCDRKGAFARLVWSRKFVLIFHELSMTGAPLAMMELATE 805
            VGPFGS+ED ILEWSP+KR+GTCDRK  FARLVWSRKFVLI HELSMTGAPLAM+ELATE
Sbjct: 208  VGPFGSIEDKILEWSPEKRTGTCDRKSQFARLVWSRKFVLILHELSMTGAPLAMLELATE 267

Query: 806  FLSCGATISVIVLNKKGGLMSELNRRKIKVLVDKSDLSFKTAMKANLIIAGSAVCSSWIE 985
             LSCGAT+ V+ L+K+GGLMSEL+RRKIKVL DKSDLSFKTAMKA+LIIAGSAVC+SWIE
Sbjct: 268  LLSCGATVYVVPLSKRGGLMSELSRRKIKVLEDKSDLSFKTAMKADLIIAGSAVCASWIE 327

Query: 986  QYLSRTVLGSSQIMWWIMENRREYFDRSKHVLNRVKKLIFLSESQSKQWLDWCEEENIHL 1165
            QY +RTVLGS+QI WWIMENRREYFDR+K   NRVKKLIFLSESQSK+WL WCEEE+I L
Sbjct: 328  QYAARTVLGSTQITWWIMENRREYFDRAKLAFNRVKKLIFLSESQSKRWLAWCEEEHIKL 387

Query: 1166 KDEPALVPLSVNDELAFAAGISCSLNTPSFTTENMLEKRQSLRKVVREEMGLNDDDMLAV 1345
            K +PAL+PLS++DELAF AGI CSL+TP F+ E MLEKRQ LR  VR+EMGL D+DML +
Sbjct: 388  KTQPALIPLSISDELAFVAGIPCSLSTPLFSPEKMLEKRQLLRDFVRKEMGLTDNDMLVM 447

Query: 1346 SLSSINPGKGQLLLMESARLVIEQGQKLNNSGSKDSILLDHDYYSRALLQN----GKRDN 1513
            SLSSINPGKGQ LL+E+ RL+IE    L  S  K       +Y  R LL N    G+   
Sbjct: 448  SLSSINPGKGQFLLLETTRLLIEGAPPLYGSAVK-----RREYQKRTLLYNWKQFGEWKK 502

Query: 1514 ESSNIDTPTKKRIRSSRIFTNEGRLNSAMYGREARMRKMLSENVGKKGQNLKVLIGSVGS 1693
            ESS +    +           +G   +A    +   RK+ S   GK+G+ LKVLIGSVGS
Sbjct: 503  ESSTLSNNQETEALQVPQLFIKGVNYTAGIENDRGTRKLFSLPEGKQGEKLKVLIGSVGS 562

Query: 1694 KSNKVAYVKTLLTYLSTHSNLSKS 1765
            KSNKV YVK LL +L+ HSNLS +
Sbjct: 563  KSNKVPYVKALLNFLNQHSNLSNT 586


>ref|XP_002284822.1| PREDICTED: uncharacterized protein LOC100246448 [Vitis vinifera]
          Length = 691

 Score =  567 bits (1461), Expect = e-159
 Identities = 320/567 (56%), Positives = 376/567 (66%), Gaps = 16/567 (2%)
 Frame = +2

Query: 113  TPRGSPSFRRLSSGRTPRREGRSGGFVLSYCFRSNRXXXXXXXXXXXAYAGFYFQSRWAH 292
            TPR SPSFRR  S RTPRRE RS G V S  FR+NR           AY GFY QS+WAH
Sbjct: 24   TPRNSPSFRRSHSSRTPRREARSSG-VGSQWFRNNRVVFWLILITLWAYLGFYVQSKWAH 82

Query: 293  GDNKEDLFXXXXXXXXXXXXX-MRRDLSVAVGTGALKLKTETSNSSL---ENVDVVLAKS 460
            GDN ED+               + R   +      L +K  +  + +   + VDVVLAK 
Sbjct: 83   GDNNEDIIGFGGKPNNGISDSELNRKAPLIANDKLLAVKNGSDKNPVGSGKKVDVVLAKK 142

Query: 461  RSGDSLXXXXXXXXXXXXXXXXXXXXXXXXAQDVESEVDLPIED-----IPKKNTTYGFL 625
              G+S+                         Q  ++EV++   D     IPK NT+YG L
Sbjct: 143  --GNSVPSRRSASSKKRSKKSERSLRGKTRKQKTKTEVEVTEMDEQEQEIPKLNTSYGLL 200

Query: 626  VGPFGSVEDSILEWSPDKRSGTCDRKGAFARLVWSRKFVLIFHELSMTGAPLAMMELATE 805
            VGPFGS ED ILEWSP+KRSGTCDR+G  ARLVWSRKFVLIFHELSMTGAPL+MMELATE
Sbjct: 201  VGPFGSTEDRILEWSPEKRSGTCDRRGELARLVWSRKFVLIFHELSMTGAPLSMMELATE 260

Query: 806  FLSCGATISVIVLNKKGGLMSELNRRKIKVLVDKSDLSFKTAMKANLIIAGSAVCSSWIE 985
             LSCGAT+S +VL+KKGGLM EL RR+IKVL D++DLSFKTAMKA+L+IAGSAVC+SWIE
Sbjct: 261  LLSCGATVSAVVLSKKGGLMPELARRRIKVLEDRADLSFKTAMKADLVIAGSAVCASWIE 320

Query: 986  QYLSRTVLGSSQIMWWIMENRREYFDRSKHVLNRVKKLIFLSESQSKQWLDWCEEENIHL 1165
            QY++    GSSQI+WWIMENRREYFDRSK V+NRVK LIFLSESQSKQWL WC+EENI L
Sbjct: 321  QYIAHFTAGSSQIVWWIMENRREYFDRSKLVINRVKMLIFLSESQSKQWLTWCKEENIRL 380

Query: 1166 KDEPALVPLSVNDELAFAAGISCSLNTPSFTTENMLEKRQSLRKVVREEMGLNDDDMLAV 1345
              +PA+VPLSVNDELAF AGI+CSLNTPSFTTE M EKR+ LR  +R+EMGL D DML +
Sbjct: 381  ISQPAVVPLSVNDELAFVAGITCSLNTPSFTTEKMQEKRRLLRDSIRKEMGLTDTDMLLL 440

Query: 1346 SLSSINPGKGQLLLMESARLVIEQGQKLNNSGSKDSILLDHD-------YYSRALLQNGK 1504
            SLSSINPGKGQ  L+ES R +IEQ    ++   KD + +  D       +YSRALLQN  
Sbjct: 441  SLSSINPGKGQFFLLESVRSMIEQEPSQDDPELKDLVKIGQDQSNFSGKHYSRALLQNVN 500

Query: 1505 RDNESSNIDTPTKKRIRSSRIFTNEGRLNSAMYGREARMRKMLSENVGKKGQNLKVLIGS 1684
              + SS+                     +    G   + RK+LSEN G + Q LKVLIGS
Sbjct: 501  HFSVSSS---------------------DEVSIGSGYKRRKVLSENEGTQEQALKVLIGS 539

Query: 1685 VGSKSNKVAYVKTLLTYLSTHSNLSKS 1765
            VGSKSNKV YVK LL +L+ HSNLSKS
Sbjct: 540  VGSKSNKVPYVKGLLRFLTRHSNLSKS 566


>ref|XP_007217014.1| hypothetical protein PRUPE_ppa002059mg [Prunus persica]
            gi|462413164|gb|EMJ18213.1| hypothetical protein
            PRUPE_ppa002059mg [Prunus persica]
          Length = 723

 Score =  554 bits (1427), Expect = e-155
 Identities = 321/577 (55%), Positives = 381/577 (66%), Gaps = 26/577 (4%)
 Frame = +2

Query: 113  TPRGSPSFRRLSSGRTPRREGRSGGFVLSYCFRSNRXXXXXXXXXXXAYAGFYFQSRWAH 292
            +PR SPSFRRL+S RTPRRE RS G V    FRSNR           AY GFYFQS WAH
Sbjct: 27   SPRNSPSFRRLNSSRTPRREARSSGGV--QWFRSNRLLFWLLLITLWAYLGFYFQSSWAH 84

Query: 293  GDNKEDLFXXXXXXXXXXXXX---MRRDLSVAVGTGALKLKTETSNSSL-ENVDVVLAKS 460
             +NKE+                   RRDL  +  + A+K +T  +     +++DVVL K 
Sbjct: 85   -NNKENFLGFGNKASNGNSDTEQNARRDLLASDSSMAVKNETNQNQVKAGKSIDVVLTKK 143

Query: 461  RSGDSLXXXXXXXXXXXXXXXXXXXXXXXXAQ---DVES-EVDLPIEDIPKKNTTYGFLV 628
             +G S                          +   +VE  E +    DIPK NT+YG LV
Sbjct: 144  ENGVSSRRSASSKKRSKKSARSLRGKVHGKQKKTVEVEGHETEEQELDIPKTNTSYGMLV 203

Query: 629  GPFGSVEDSILEWSPDKRSGTCDRKGAFARLVWSRKFVLIFHELSMTGAPLAMMELATEF 808
            GPFG VED  LEWSP  RSGTCDRKG FARLVWSR+F+LIFHELSMTGAPL+MMELATE 
Sbjct: 204  GPFGFVEDRTLEWSPKTRSGTCDRKGDFARLVWSRRFLLIFHELSMTGAPLSMMELATEL 263

Query: 809  LSCGATISVIVLNKKGGLMSELNRRKIKVLVDKSDLSFKTAMKANLIIAGSAVCSSWIEQ 988
            LSCGAT+S +VL+KKGGLM EL RR+IKVL DK + SFKTAMKA+L+IAGSAVC+SWI+Q
Sbjct: 264  LSCGATVSAVVLSKKGGLMPELARRRIKVLEDKVEQSFKTAMKADLVIAGSAVCASWIDQ 323

Query: 989  YLSRTVLGSSQIMWWIMENRREYFDRSKHVLNRVKKLIFLSESQSKQWLDWCEEENIHLK 1168
            Y+     G+SQI WWIMENRREYFDR+K VLNRVK L FLSESQSKQWLDWCEEE I L+
Sbjct: 324  YMDHFPAGASQIAWWIMENRREYFDRAKVVLNRVKMLAFLSESQSKQWLDWCEEEKIKLR 383

Query: 1169 DEPALVPLSVNDELAFAAGISCSLNTPSFTTENMLEKRQSLRKVVREEMGLNDDDMLAVS 1348
             +PA+VPLS+NDELAF AGI CSLNTPS +TE MLEKRQ LR  VR+EMGL D+DML +S
Sbjct: 384  SQPAVVPLSINDELAFVAGIGCSLNTPSSSTEKMLEKRQLLRDSVRKEMGLTDNDMLVMS 443

Query: 1349 LSSINPGKGQLLLMESARLVIEQGQKLNNSGSKDSI-------LLDHDYYSRALLQNGKR 1507
            LSSINPGKGQLLL+ESARLVIE+  K  NS  K+ +        L   ++ RAL Q    
Sbjct: 444  LSSINPGKGQLLLLESARLVIEEPLKY-NSKIKNPVRKRQARSTLARKHHLRALFQELND 502

Query: 1508 DNESSN-----------IDTPTKKRIRSSRIFTNEGRLNSAMYGREARMRKMLSENVGKK 1654
            D  SSN           ++ P KK++R   ++T+        +      RK+LS+N G  
Sbjct: 503  DGVSSNELPLSNESDVQLNEPQKKKLRLRSLYTSFDDTGDLTF-NVTHKRKVLSDNGGTL 561

Query: 1655 GQNLKVLIGSVGSKSNKVAYVKTLLTYLSTHSNLSKS 1765
             Q++K LIGSVGSKSNKV YVK LL +LS HSN+SKS
Sbjct: 562  EQSVKFLIGSVGSKSNKVLYVKELLGFLSQHSNMSKS 598


>ref|XP_004302927.1| PREDICTED: uncharacterized protein LOC101300160 [Fragaria vesca
            subsp. vesca]
          Length = 720

 Score =  551 bits (1419), Expect = e-154
 Identities = 317/578 (54%), Positives = 379/578 (65%), Gaps = 27/578 (4%)
 Frame = +2

Query: 113  TPRGSPSFRRLSSGRTPRREGRSGGFVLSYCFRSNRXXXXXXXXXXXAYAGFYFQSRWAH 292
            +PR SPSF+RL S RTPRRE RS G V    FRSNR           AY GFYFQS WAH
Sbjct: 27   SPRSSPSFKRLHSSRTPRREARSSGGV--QWFRSNRLLFWLLLITLWAYLGFYFQSSWAH 84

Query: 293  GDNKEDLFXXXXXXXXXXXXX---MRRDLSVAVGTGALKLKTETSNSSLE---NVDVVLA 454
             +NK +                   RRDL        +KLK ET  +  E    +DVVLA
Sbjct: 85   SNNKVNFLGVGNEASNDKSDAEQNQRRDLL----DSPVKLKNETGQNQPEAGKTIDVVLA 140

Query: 455  KSRSGDSLXXXXXXXXXXXXXXXXXXXXXXXXAQDVE-SEVDLPIEDIPKKNTTYGFLVG 631
            K   G +                            +E  E++    DIPK N +YG LVG
Sbjct: 141  KKDDGVASRRSLSSKKKSKKAARGKSHGKPKKTVAIEIHEIEEQEPDIPKTNASYGMLVG 200

Query: 632  PFGSVEDSILEWSPDKRSGTCDRKGAFARLVWSRKFVLIFHELSMTGAPLAMMELATEFL 811
            PFGS ED ILEW+P  R+GTCDRKG F+RLVWSR+F+LIFHELSMTGAPL+MMELATE L
Sbjct: 201  PFGSTEDRILEWNPKTRTGTCDRKGDFSRLVWSRRFLLIFHELSMTGAPLSMMELATELL 260

Query: 812  SCGATISVIVLNKKGGLMSELNRRKIKVLVDKSDLSFKTAMKANLIIAGSAVCSSWIEQY 991
            SCGAT+S IVL+KKGGLM EL RR+IKVL DK+D SFKTAMK +L+IAGSAVC+SWI+QY
Sbjct: 261  SCGATVSAIVLSKKGGLMPELTRRRIKVLEDKADHSFKTAMKQDLVIAGSAVCASWIDQY 320

Query: 992  LSRTVLGSSQIMWWIMENRREYFDRSKHVLNRVKKLIFLSESQSKQWLDWCEEENIHLKD 1171
            + +   G+SQI WWIMENRREYFDR+K VL+RVK L FLSESQSKQWLDWCEEE I L+ 
Sbjct: 321  IDKFPAGASQIAWWIMENRREYFDRAKVVLDRVKMLAFLSESQSKQWLDWCEEEKIKLRS 380

Query: 1172 EPALVPLSVNDELAFAAGISCSLNTPSFTTENMLEKRQSLRKVVREEMGLNDDDMLAVSL 1351
            +PA+VPLS+NDELAF AGI CSLNTPS + E MLEK + LR  VR+EMGL D+DMLA+SL
Sbjct: 381  QPAIVPLSINDELAFVAGIGCSLNTPSSSIEKMLEKMKLLRDAVRKEMGLTDNDMLAISL 440

Query: 1352 SSINPGKGQLLLMESARLVIEQGQKLNNSGSKDSI-------LLDHDYYSRALLQNGKRD 1510
            SSINPGKGQLL++ SARLVIE+  + +NS  K+S+        L   ++ RALLQ G  D
Sbjct: 441  SSINPGKGQLLVLNSARLVIEEEPQPDNSKIKNSVRKGRVRSALARKHHIRALLQ-GSND 499

Query: 1511 NESSNIDTPTKKRIRSSRIFTNEGRLNSAMYGREARM-------------RKMLSENVGK 1651
            + +S    P      SS  F  + + +  ++ R A +             RK+L++N G 
Sbjct: 500  HSASLNGFPLS--TESSVHFKEDQKKHLHLHNRFASVDDTDAMNFDVTYKRKVLADNGGT 557

Query: 1652 KGQNLKVLIGSVGSKSNKVAYVKTLLTYLSTHSNLSKS 1765
              Q+ K LIGSVGSKSNKVAYVK LL+YLS HSNLSKS
Sbjct: 558  VKQSAKFLIGSVGSKSNKVAYVKELLSYLSQHSNLSKS 595


>ref|XP_006427083.1| hypothetical protein CICLE_v10024994mg [Citrus clementina]
            gi|557529073|gb|ESR40323.1| hypothetical protein
            CICLE_v10024994mg [Citrus clementina]
          Length = 732

 Score =  548 bits (1412), Expect = e-153
 Identities = 302/580 (52%), Positives = 378/580 (65%), Gaps = 29/580 (5%)
 Frame = +2

Query: 113  TPRGSPSFRRLSSGRTPRREGRSGGFVLSYCFRSNRXXXXXXXXXXXAYAGFYFQSRWAH 292
            TP+ SPSFRRL++ RTPRRE RS        FRSNR            Y GFY QSRWAH
Sbjct: 35   TPKNSPSFRRLNASRTPRREVRSASL---QWFRSNRLVYWLLLITLWTYLGFYVQSRWAH 91

Query: 293  GDNKEDLFXXXXXXXXXXXXX---MRRDL-----SVAVGTGALKLKTETSNSSLENVDVV 448
            G+N +                    RRDL      + +  G +K    T  +  + +D+V
Sbjct: 92   GENNDKFLGFGGKRRNEIVDSNQNKRRDLIANHSDLDINNGTIK----TLGADSKKIDMV 147

Query: 449  LAKSRSGDSLXXXXXXXXXXXXXXXXXXXXXXXXAQDVESE-VDLPIEDIPKKNTTYGFL 625
            L + R+ D+                           DVES  ++  + +IP  N +YG L
Sbjct: 148  LTQRRNNDASRRSVAKRKKSKRSSRGKGRGKQKAKLDVESNYMEAQLPEIPMTNASYGLL 207

Query: 626  VGPFGSVEDSILEWSPDKRSGTCDRKGAFARLVWSRKFVLIFHELSMTGAPLAMMELATE 805
            VGPFG  ED ILEWSP+KRSGTCDRKG FAR VWSRKF+LIFHELSMTGAPL+MMELATE
Sbjct: 208  VGPFGLTEDRILEWSPEKRSGTCDRKGDFARFVWSRKFILIFHELSMTGAPLSMMELATE 267

Query: 806  FLSCGATISVIVLNKKGGLMSELNRRKIKVLVDKSDLSFKTAMKANLIIAGSAVCSSWIE 985
             LSCGAT+S +VL+K+GGLM EL RRKIKVL D+ + SFKT+MKA+L+IAGSAVC++WI+
Sbjct: 268  LLSCGATVSAVVLSKRGGLMPELARRKIKVLEDRGEPSFKTSMKADLVIAGSAVCATWID 327

Query: 986  QYLSRTVLGSSQIMWWIMENRREYFDRSKHVLNRVKKLIFLSESQSKQWLDWCEEENIHL 1165
            QY++R   G SQ++WWIMENRREYFDR+K VL+RVK L+FLSESQ+KQWL WCEEE + L
Sbjct: 328  QYITRFPAGGSQVVWWIMENRREYFDRAKLVLDRVKMLVFLSESQTKQWLTWCEEEKLKL 387

Query: 1166 KDEPALVPLSVNDELAFAAGISCSLNTPSFTTENMLEKRQSLRKVVREEMGLNDDDMLAV 1345
            + +PA+VPLSVNDELAF AG +CSLNTP+ + E M EKR  LR  VR+EMGL D DML +
Sbjct: 388  RSQPAVVPLSVNDELAFVAGFTCSLNTPTSSPEKMCEKRNLLRDSVRKEMGLTDQDMLVL 447

Query: 1346 SLSSINPGKGQLLLMESARLVIEQGQKLNNS---------GSKDSILLDHDYYSRALLQN 1498
            SLSSINPGKGQLLL+ESA+L+IEQ   +++S           K S+   H    R LLQ 
Sbjct: 448  SLSSINPGKGQLLLVESAQLMIEQEPSMDDSKIRKSRNVGRKKSSLTSRHHLRGRGLLQM 507

Query: 1499 ----GKRDNESS-------NIDTPTKKRIRSSRIFTNEGRLNSAMYGREARMRKMLSENV 1645
                G   NE S        ++ P +K + S  +FT+ G  ++  +G     RK+LS++ 
Sbjct: 508  SDDVGLSSNELSVSSESFTQLNEPVRKNLLSPSLFTSIGNTDAVSFGSGHLRRKVLSKSD 567

Query: 1646 GKKGQNLKVLIGSVGSKSNKVAYVKTLLTYLSTHSNLSKS 1765
            GK+ Q LK+LIGSVGSKSNKV YVK +L +LS HSNLSK+
Sbjct: 568  GKQQQALKILIGSVGSKSNKVPYVKEILEFLSQHSNLSKA 607


>ref|XP_006465456.1| PREDICTED: uncharacterized protein LOC102612096 isoform X1 [Citrus
            sinensis] gi|568822059|ref|XP_006465457.1| PREDICTED:
            uncharacterized protein LOC102612096 isoform X2 [Citrus
            sinensis]
          Length = 732

 Score =  546 bits (1408), Expect = e-153
 Identities = 302/580 (52%), Positives = 378/580 (65%), Gaps = 29/580 (5%)
 Frame = +2

Query: 113  TPRGSPSFRRLSSGRTPRREGRSGGFVLSYCFRSNRXXXXXXXXXXXAYAGFYFQSRWAH 292
            TP+ SPSFRRL++ RTPRRE RS        FRSNR            Y GFY QSRWAH
Sbjct: 35   TPKNSPSFRRLNASRTPRREVRSASL---QWFRSNRLVYWLLLITLWTYLGFYVQSRWAH 91

Query: 293  GDNKEDLFXXXXXXXXXXXXX---MRRDL-----SVAVGTGALKLKTETSNSSLENVDVV 448
            G+N +                    RRDL      + +  G +K    T  +  + +D+V
Sbjct: 92   GENNDKFLGFGGKRRNEIVDSNQNKRRDLIANHSDLDINNGTIK----TLGADSKKMDMV 147

Query: 449  LAKSRSGDSLXXXXXXXXXXXXXXXXXXXXXXXXAQDVESE-VDLPIEDIPKKNTTYGFL 625
            L + R+ D+                           DVES  ++  + +IP  N +YG L
Sbjct: 148  LTQRRNNDASRRSVAKRKKSKRSSRGKGRGKQKAKLDVESNYMEAQLPEIPMTNASYGLL 207

Query: 626  VGPFGSVEDSILEWSPDKRSGTCDRKGAFARLVWSRKFVLIFHELSMTGAPLAMMELATE 805
            VGPFG  ED ILEWSP+KRSGTCDRKG FAR VWSRKF+LIFHELSMTGAPL+MMELATE
Sbjct: 208  VGPFGLTEDRILEWSPEKRSGTCDRKGDFARFVWSRKFILIFHELSMTGAPLSMMELATE 267

Query: 806  FLSCGATISVIVLNKKGGLMSELNRRKIKVLVDKSDLSFKTAMKANLIIAGSAVCSSWIE 985
             LSCGAT+S +VL+K+GGLM EL RRKIKVL D+ + SFKT+MKA+L+IAGSAVC++WI+
Sbjct: 268  LLSCGATVSAVVLSKRGGLMPELARRKIKVLEDRGEPSFKTSMKADLVIAGSAVCATWID 327

Query: 986  QYLSRTVLGSSQIMWWIMENRREYFDRSKHVLNRVKKLIFLSESQSKQWLDWCEEENIHL 1165
            QY++R   G SQ++WWIMENRREYFDR+K VL+RVK L+FLSESQ+KQWL WCEEE + L
Sbjct: 328  QYITRFPAGGSQVVWWIMENRREYFDRAKLVLDRVKLLVFLSESQTKQWLTWCEEEKLKL 387

Query: 1166 KDEPALVPLSVNDELAFAAGISCSLNTPSFTTENMLEKRQSLRKVVREEMGLNDDDMLAV 1345
            + +PA+VPLSVNDELAF AG +CSLNTP+ + E M EKR  LR  VR+EMGL D DML +
Sbjct: 388  RSQPAVVPLSVNDELAFVAGFTCSLNTPTSSPEKMREKRNLLRDSVRKEMGLTDQDMLVL 447

Query: 1346 SLSSINPGKGQLLLMESARLVIEQGQKLNNS---------GSKDSILLDHDYYSRALLQN 1498
            SLSSINPGKGQLLL+ESA+L+IEQ   +++S           K S+   H    R LLQ 
Sbjct: 448  SLSSINPGKGQLLLVESAQLMIEQEPSMDDSKIRKSRNVGRKKSSLTSRHHLRGRGLLQM 507

Query: 1499 ----GKRDNESS-------NIDTPTKKRIRSSRIFTNEGRLNSAMYGREARMRKMLSENV 1645
                G   NE S        ++ P +K + S  +FT+ G  ++  +G     RK+LS++ 
Sbjct: 508  SDDVGLSSNELSVSSESFTQLNEPVRKNLLSPSLFTSIGNTDAVSFGSGHLRRKVLSKSD 567

Query: 1646 GKKGQNLKVLIGSVGSKSNKVAYVKTLLTYLSTHSNLSKS 1765
            GK+ Q LK+LIGSVGSKSNKV YVK +L +LS HSNLSK+
Sbjct: 568  GKQQQALKILIGSVGSKSNKVPYVKEILEFLSQHSNLSKA 607


>emb|CBI36173.3| unnamed protein product [Vitis vinifera]
          Length = 683

 Score =  546 bits (1408), Expect = e-153
 Identities = 314/567 (55%), Positives = 366/567 (64%), Gaps = 16/567 (2%)
 Frame = +2

Query: 113  TPRGSPSFRRLSSGRTPRREGRSGGFVLSYCFRSNRXXXXXXXXXXXAYAGFYFQSRWAH 292
            TPR SPSFRR  S RTPRRE RS G V S  FR+NR           AY GFY QS+WAH
Sbjct: 35   TPRNSPSFRRSHSSRTPRREARSSG-VGSQWFRNNRVVFWLILITLWAYLGFYVQSKWAH 93

Query: 293  GDNKEDLFXXXXXXXXXXXXX-MRRDLSVAVGTGALKLKTETSNSSL---ENVDVVLAKS 460
            GDN ED+               + R   +      L +K  +  + +   + VDVVLAK 
Sbjct: 94   GDNNEDIIGFGGKPNNGISDSELNRKAPLIANDKLLAVKNGSDKNPVGSGKKVDVVLAKK 153

Query: 461  RSGDSLXXXXXXXXXXXXXXXXXXXXXXXXAQDVESEVDLPIED-----IPKKNTTYGFL 625
              G+S+                         Q  ++EV++   D     IPK NT+YG L
Sbjct: 154  --GNSVPSRRSASSKKRSKKSERSLRGKTRKQKTKTEVEVTEMDEQEQEIPKLNTSYGLL 211

Query: 626  VGPFGSVEDSILEWSPDKRSGTCDRKGAFARLVWSRKFVLIFHELSMTGAPLAMMELATE 805
            VGPFGS ED ILEWSP+KRSGTCDR+G  ARLVWSRKFVLIFHELSMTGAPL+MMELATE
Sbjct: 212  VGPFGSTEDRILEWSPEKRSGTCDRRGELARLVWSRKFVLIFHELSMTGAPLSMMELATE 271

Query: 806  FLSCGATISVIVLNKKGGLMSELNRRKIKVLVDKSDLSFKTAMKANLIIAGSAVCSSWIE 985
             LSCGAT+S +VL+KKGGLM EL RR+IKVL D++DLSFKTAMKA+L+IAGSAVC+SWIE
Sbjct: 272  LLSCGATVSAVVLSKKGGLMPELARRRIKVLEDRADLSFKTAMKADLVIAGSAVCASWIE 331

Query: 986  QYLSRTVLGSSQIMWWIMENRREYFDRSKHVLNRVKKLIFLSESQSKQWLDWCEEENIHL 1165
            QY++    GSSQI+WWIMENRREYFDRSK V+NRVK LIFLSESQSKQWL WC+EENI L
Sbjct: 332  QYIAHFTAGSSQIVWWIMENRREYFDRSKLVINRVKMLIFLSESQSKQWLTWCKEENIRL 391

Query: 1166 KDEPALVPLSVNDELAFAAGISCSLNTPSFTTENMLEKRQSLRKVVREEMGLNDDDMLAV 1345
              +PA+VPLSVNDELAF AGI+CSLNTPSFTTE M EKR+ LR  +R+EMGL D DML +
Sbjct: 392  ISQPAVVPLSVNDELAFVAGITCSLNTPSFTTEKMQEKRRLLRDSIRKEMGLTDTDMLLL 451

Query: 1346 SLSSINPGKGQLLLMESARLVIEQGQKLNNSGSKDSILLDHD-------YYSRALLQNGK 1504
            SLSSINPGKGQ  L+ES R +IEQ    ++   KD + +  D       +YSRALLQN  
Sbjct: 452  SLSSINPGKGQFFLLESVRSMIEQEPSQDDPELKDLVKIGQDQSNFSGKHYSRALLQN-- 509

Query: 1505 RDNESSNIDTPTKKRIRSSRIFTNEGRLNSAMYGREARMRKMLSENVGKKGQNLKVLIGS 1684
                                       LN              S+N+    Q LKVLIGS
Sbjct: 510  ---------------------------LNGPK-----------SKNLMLPKQALKVLIGS 531

Query: 1685 VGSKSNKVAYVKTLLTYLSTHSNLSKS 1765
            VGSKSNKV YVK LL +L+ HSNLSKS
Sbjct: 532  VGSKSNKVPYVKGLLRFLTRHSNLSKS 558


>ref|XP_007024057.1| UDP-Glycosyltransferase superfamily protein isoform 3 [Theobroma
            cacao] gi|508779423|gb|EOY26679.1|
            UDP-Glycosyltransferase superfamily protein isoform 3
            [Theobroma cacao]
          Length = 608

 Score =  544 bits (1402), Expect = e-152
 Identities = 309/567 (54%), Positives = 376/567 (66%), Gaps = 16/567 (2%)
 Frame = +2

Query: 113  TPRGSPSFRRLSSGRTPRREGRSGGFVLSYCFRSNRXXXXXXXXXXXAYAGFYFQSRWAH 292
            TP+ SP+FRRL+S RTPRRE RSG   + + FRSNR           AY GFY QSRWAH
Sbjct: 26   TPKSSPTFRRLNSSRTPRREARSGAGGIQW-FRSNRLVYWLLLITLWAYLGFYVQSRWAH 84

Query: 293  GDNKEDLFXXXXXXXXXXXXXM---RRDLS-----VAVGTGALKLKTETSNSSLENVDVV 448
            G NKE+                   RRDL      VAV  G  K    T   S    DV+
Sbjct: 85   GHNKEEFLGFSGNPRNGLIDAEQNPRRDLLADDSLVAVNNGTNK----TQVYSDRKFDVI 140

Query: 449  LAKSRSGDSLXXXXXXXXXXXXXXXXXXXXXXXXAQDVES-EVDLPIEDIPKKNTTYGFL 625
            LAK R+  S                           ++E+ E +    +I +KN+TYG L
Sbjct: 141  LAKKRNEVSFNKKRSRRSKRAGRNLSKMRGKRKATINIENGETEGQEHEILQKNSTYGLL 200

Query: 626  VGPFGSVEDSILEWSPDKRSGTCDRKGAFARLVWSRKFVLIFHELSMTGAPLAMMELATE 805
            VGPFGSVED ILEWSP+KRSGTCDRKG FARLVWSR+ VL+FHELSMTGAP++MMELATE
Sbjct: 201  VGPFGSVEDRILEWSPEKRSGTCDRKGDFARLVWSRRLVLVFHELSMTGAPISMMELATE 260

Query: 806  FLSCGATISVIVLNKKGGLMSELNRRKIKVLVDKSDLSFKTAMKANLIIAGSAVCSSWIE 985
             LSCGAT+S +VL+KKGGLMSEL RR+IKV+ D++DLSFKTAMKA+L+IAGSAVC+SWI+
Sbjct: 261  LLSCGATVSAVVLSKKGGLMSELARRRIKVIEDRADLSFKTAMKADLVIAGSAVCASWID 320

Query: 986  QYLSRTVLGSSQIMWWIMENRREYFDRSKHVLNRVKKLIFLSESQSKQWLDWCEEENIHL 1165
            QY++    G SQI WWIMENRREYFDRSK VL+RVK LIFLSE QSKQWL WC+EENI L
Sbjct: 321  QYIAHFPAGGSQIAWWIMENRREYFDRSKLVLHRVKMLIFLSELQSKQWLTWCQEENIKL 380

Query: 1166 KDEPALVPLSVNDELAFAAGISCSLNTPSFTTENMLEKRQSLRKVVREEMGLNDDDMLAV 1345
            + +PALVPL+VNDELAF AGI CSLNTPS + E MLEKRQ LR  VR+EMGL D+DML +
Sbjct: 381  RSQPALVPLAVNDELAFVAGIPCSLNTPSASPEKMLEKRQLLRDAVRKEMGLTDNDMLVM 440

Query: 1346 SLSSINPGKGQLLLMESARLVIEQGQKLNNSGSKDSILLDHD-------YYSRALLQNGK 1504
            SLSSIN GKGQLLL+E+A L+I+Q     +S    S+ +  D       ++ R LLQ   
Sbjct: 441  SLSSINTGKGQLLLLEAAGLMIDQDPLQTDSEVTKSLDIRQDQSTLTVKHHLRGLLQ--- 497

Query: 1505 RDNESSNIDTPTKKRIRSSRIFTNEGRLNSAMYGREARMRKMLSENVGKKGQNLKVLIGS 1684
               +SS++D  +       R+F +    N+       R R ML ++ G + Q LK+LIGS
Sbjct: 498  ---KSSDVDVSS----TDLRLFASVNGTNAVSIDSSHRRRNMLFDSKGTQEQALKILIGS 550

Query: 1685 VGSKSNKVAYVKTLLTYLSTHSNLSKS 1765
            VGSKSNK+ YVK +L +LS H+ LS+S
Sbjct: 551  VGSKSNKMPYVKEILRFLSQHAKLSES 577


>ref|XP_007024056.1| UDP-Glycosyltransferase superfamily protein isoform 2 [Theobroma
            cacao] gi|508779422|gb|EOY26678.1|
            UDP-Glycosyltransferase superfamily protein isoform 2
            [Theobroma cacao]
          Length = 703

 Score =  544 bits (1402), Expect = e-152
 Identities = 309/567 (54%), Positives = 376/567 (66%), Gaps = 16/567 (2%)
 Frame = +2

Query: 113  TPRGSPSFRRLSSGRTPRREGRSGGFVLSYCFRSNRXXXXXXXXXXXAYAGFYFQSRWAH 292
            TP+ SP+FRRL+S RTPRRE RSG   + + FRSNR           AY GFY QSRWAH
Sbjct: 26   TPKSSPTFRRLNSSRTPRREARSGAGGIQW-FRSNRLVYWLLLITLWAYLGFYVQSRWAH 84

Query: 293  GDNKEDLFXXXXXXXXXXXXXM---RRDLS-----VAVGTGALKLKTETSNSSLENVDVV 448
            G NKE+                   RRDL      VAV  G  K    T   S    DV+
Sbjct: 85   GHNKEEFLGFSGNPRNGLIDAEQNPRRDLLADDSLVAVNNGTNK----TQVYSDRKFDVI 140

Query: 449  LAKSRSGDSLXXXXXXXXXXXXXXXXXXXXXXXXAQDVES-EVDLPIEDIPKKNTTYGFL 625
            LAK R+  S                           ++E+ E +    +I +KN+TYG L
Sbjct: 141  LAKKRNEVSFNKKRSRRSKRAGRNLSKMRGKRKATINIENGETEGQEHEILQKNSTYGLL 200

Query: 626  VGPFGSVEDSILEWSPDKRSGTCDRKGAFARLVWSRKFVLIFHELSMTGAPLAMMELATE 805
            VGPFGSVED ILEWSP+KRSGTCDRKG FARLVWSR+ VL+FHELSMTGAP++MMELATE
Sbjct: 201  VGPFGSVEDRILEWSPEKRSGTCDRKGDFARLVWSRRLVLVFHELSMTGAPISMMELATE 260

Query: 806  FLSCGATISVIVLNKKGGLMSELNRRKIKVLVDKSDLSFKTAMKANLIIAGSAVCSSWIE 985
             LSCGAT+S +VL+KKGGLMSEL RR+IKV+ D++DLSFKTAMKA+L+IAGSAVC+SWI+
Sbjct: 261  LLSCGATVSAVVLSKKGGLMSELARRRIKVIEDRADLSFKTAMKADLVIAGSAVCASWID 320

Query: 986  QYLSRTVLGSSQIMWWIMENRREYFDRSKHVLNRVKKLIFLSESQSKQWLDWCEEENIHL 1165
            QY++    G SQI WWIMENRREYFDRSK VL+RVK LIFLSE QSKQWL WC+EENI L
Sbjct: 321  QYIAHFPAGGSQIAWWIMENRREYFDRSKLVLHRVKMLIFLSELQSKQWLTWCQEENIKL 380

Query: 1166 KDEPALVPLSVNDELAFAAGISCSLNTPSFTTENMLEKRQSLRKVVREEMGLNDDDMLAV 1345
            + +PALVPL+VNDELAF AGI CSLNTPS + E MLEKRQ LR  VR+EMGL D+DML +
Sbjct: 381  RSQPALVPLAVNDELAFVAGIPCSLNTPSASPEKMLEKRQLLRDAVRKEMGLTDNDMLVM 440

Query: 1346 SLSSINPGKGQLLLMESARLVIEQGQKLNNSGSKDSILLDHD-------YYSRALLQNGK 1504
            SLSSIN GKGQLLL+E+A L+I+Q     +S    S+ +  D       ++ R LLQ   
Sbjct: 441  SLSSINTGKGQLLLLEAAGLMIDQDPLQTDSEVTKSLDIRQDQSTLTVKHHLRGLLQ--- 497

Query: 1505 RDNESSNIDTPTKKRIRSSRIFTNEGRLNSAMYGREARMRKMLSENVGKKGQNLKVLIGS 1684
               +SS++D  +       R+F +    N+       R R ML ++ G + Q LK+LIGS
Sbjct: 498  ---KSSDVDVSS----TDLRLFASVNGTNAVSIDSSHRRRNMLFDSKGTQEQALKILIGS 550

Query: 1685 VGSKSNKVAYVKTLLTYLSTHSNLSKS 1765
            VGSKSNK+ YVK +L +LS H+ LS+S
Sbjct: 551  VGSKSNKMPYVKEILRFLSQHAKLSES 577


>ref|XP_007024055.1| UDP-Glycosyltransferase superfamily protein isoform 1 [Theobroma
            cacao] gi|508779421|gb|EOY26677.1|
            UDP-Glycosyltransferase superfamily protein isoform 1
            [Theobroma cacao]
          Length = 702

 Score =  544 bits (1402), Expect = e-152
 Identities = 309/567 (54%), Positives = 376/567 (66%), Gaps = 16/567 (2%)
 Frame = +2

Query: 113  TPRGSPSFRRLSSGRTPRREGRSGGFVLSYCFRSNRXXXXXXXXXXXAYAGFYFQSRWAH 292
            TP+ SP+FRRL+S RTPRRE RSG   + + FRSNR           AY GFY QSRWAH
Sbjct: 26   TPKSSPTFRRLNSSRTPRREARSGAGGIQW-FRSNRLVYWLLLITLWAYLGFYVQSRWAH 84

Query: 293  GDNKEDLFXXXXXXXXXXXXXM---RRDLS-----VAVGTGALKLKTETSNSSLENVDVV 448
            G NKE+                   RRDL      VAV  G  K    T   S    DV+
Sbjct: 85   GHNKEEFLGFSGNPRNGLIDAEQNPRRDLLADDSLVAVNNGTNK----TQVYSDRKFDVI 140

Query: 449  LAKSRSGDSLXXXXXXXXXXXXXXXXXXXXXXXXAQDVES-EVDLPIEDIPKKNTTYGFL 625
            LAK R+  S                           ++E+ E +    +I +KN+TYG L
Sbjct: 141  LAKKRNEVSFNKKRSRRSKRAGRNLSKMRGKRKATINIENGETEGQEHEILQKNSTYGLL 200

Query: 626  VGPFGSVEDSILEWSPDKRSGTCDRKGAFARLVWSRKFVLIFHELSMTGAPLAMMELATE 805
            VGPFGSVED ILEWSP+KRSGTCDRKG FARLVWSR+ VL+FHELSMTGAP++MMELATE
Sbjct: 201  VGPFGSVEDRILEWSPEKRSGTCDRKGDFARLVWSRRLVLVFHELSMTGAPISMMELATE 260

Query: 806  FLSCGATISVIVLNKKGGLMSELNRRKIKVLVDKSDLSFKTAMKANLIIAGSAVCSSWIE 985
             LSCGAT+S +VL+KKGGLMSEL RR+IKV+ D++DLSFKTAMKA+L+IAGSAVC+SWI+
Sbjct: 261  LLSCGATVSAVVLSKKGGLMSELARRRIKVIEDRADLSFKTAMKADLVIAGSAVCASWID 320

Query: 986  QYLSRTVLGSSQIMWWIMENRREYFDRSKHVLNRVKKLIFLSESQSKQWLDWCEEENIHL 1165
            QY++    G SQI WWIMENRREYFDRSK VL+RVK LIFLSE QSKQWL WC+EENI L
Sbjct: 321  QYIAHFPAGGSQIAWWIMENRREYFDRSKLVLHRVKMLIFLSELQSKQWLTWCQEENIKL 380

Query: 1166 KDEPALVPLSVNDELAFAAGISCSLNTPSFTTENMLEKRQSLRKVVREEMGLNDDDMLAV 1345
            + +PALVPL+VNDELAF AGI CSLNTPS + E MLEKRQ LR  VR+EMGL D+DML +
Sbjct: 381  RSQPALVPLAVNDELAFVAGIPCSLNTPSASPEKMLEKRQLLRDAVRKEMGLTDNDMLVM 440

Query: 1346 SLSSINPGKGQLLLMESARLVIEQGQKLNNSGSKDSILLDHD-------YYSRALLQNGK 1504
            SLSSIN GKGQLLL+E+A L+I+Q     +S    S+ +  D       ++ R LLQ   
Sbjct: 441  SLSSINTGKGQLLLLEAAGLMIDQDPLQTDSEVTKSLDIRQDQSTLTVKHHLRGLLQ--- 497

Query: 1505 RDNESSNIDTPTKKRIRSSRIFTNEGRLNSAMYGREARMRKMLSENVGKKGQNLKVLIGS 1684
               +SS++D  +       R+F +    N+       R R ML ++ G + Q LK+LIGS
Sbjct: 498  ---KSSDVDVSS----TDLRLFASVNGTNAVSIDSSHRRRNMLFDSKGTQEQALKILIGS 550

Query: 1685 VGSKSNKVAYVKTLLTYLSTHSNLSKS 1765
            VGSKSNK+ YVK +L +LS H+ LS+S
Sbjct: 551  VGSKSNKMPYVKEILRFLSQHAKLSES 577


>ref|XP_002528176.1| glycosyltransferase, putative [Ricinus communis]
            gi|223532388|gb|EEF34183.1| glycosyltransferase, putative
            [Ricinus communis]
          Length = 686

 Score =  541 bits (1394), Expect = e-151
 Identities = 320/573 (55%), Positives = 376/573 (65%), Gaps = 22/573 (3%)
 Frame = +2

Query: 113  TPRGSPSFRRLSSGRTPRREGRSGGFVLSYCFRSNRXXXXXXXXXXXAYAGFYFQSRWAH 292
            T + SP+FRRL S RTPR E RS G  + + FRS R           AY GFY QSRWAH
Sbjct: 35   TAKNSPTFRRLHSSRTPRGEARSIGGGVQW-FRSTRLVYWLLLITLWAYLGFYVQSRWAH 93

Query: 293  GDNKEDLF---XXXXXXXXXXXXXMRRDL-----SVAVGTGALKLKTETSNSSLENVDVV 448
            GDNKED                   RRDL     SVAV  G   ++ E        + VV
Sbjct: 94   GDNKEDFLGFGGQNRNEISVPEQNTRRDLLANDSSVAVNDGTDNVQVEDD----RRIGVV 149

Query: 449  LAKSRSGDSLXXXXXXXXXXXXXXXXXXXXXXXXAQD-------VESE-VDLPIEDIPKK 604
            LAK   G+++                         +D       VESE V++   DIP+K
Sbjct: 150  LAK--KGNTVSSNQKKNSFSKKRSKRAGRRLRSKTRDKQKATVEVESEDVEVQEPDIPQK 207

Query: 605  NTTYGFLVGPFGSVEDSILEWSPDKRSGTCDRKGAFARLVWSRKFVLIFHELSMTGAPLA 784
            NTTYGFLVGPFGS ED ILEWSP+KR+GTCDRKG FARLVWSRKFVLIFHELSMTGAPL+
Sbjct: 208  NTTYGFLVGPFGSTEDRILEWSPEKRTGTCDRKGDFARLVWSRKFVLIFHELSMTGAPLS 267

Query: 785  MMELATEFLSCGATISVIVLNKKGGLMSELNRRKIKVLVDKSDLSFKTAMKANLIIAGSA 964
            MMELATEFLSCGAT+S +VL+KKGGLMSELNRR+IKVL DK+DLSFKTAMKA+L+IAGSA
Sbjct: 268  MMELATEFLSCGATVSAVVLSKKGGLMSELNRRRIKVLEDKADLSFKTAMKADLVIAGSA 327

Query: 965  VCSSWIEQYLSRTVLGSSQIMWWIMENRREYFDRSKHVLNRVKKLIFLSESQSKQWLDWC 1144
            VC+SWI+QY++R   G SQI+WWIMENRREYFDRSK VLNRVK L+FLSESQ++QWL WC
Sbjct: 328  VCASWIDQYMTRFPAGGSQIVWWIMENRREYFDRSKIVLNRVKMLVFLSESQTEQWLSWC 387

Query: 1145 EEENIHLKDEPALVPLSVNDELAFAAGISCSLNTPSFTTENMLEKRQSLRKVVREEMGLN 1324
            +EE I L+  PA+VPLS+NDELAF AGI+CSLNTPS + E MLEKR+ L   VR+EMGL 
Sbjct: 388  DEEKIKLRAPPAIVPLSINDELAFVAGIACSLNTPSSSPEKMLEKRRLLADSVRKEMGLT 447

Query: 1325 DDDMLAVSLSSINPGKGQLLLMESARLVIEQG--QKLNNS---GSKDS-ILLDHDYYSRA 1486
            DDD+L VSLSSINPGKGQLL++ESA+L+IE    QKL +S   G + S I + H  + RA
Sbjct: 448  DDDVLLVSLSSINPGKGQLLILESAKLLIEPEPLQKLRSSVGIGEEQSRIAVKH--HLRA 505

Query: 1487 LLQNGKRDNESSNIDTPTKKRIRSSRIFTNEGRLNSAMYGREARMRKMLSENVGKKGQNL 1666
            LLQ                                      +++    L E   K  + L
Sbjct: 506  LLQ-------------------------------------EKSKAVSDLKEGQEKYLKAL 528

Query: 1667 KVLIGSVGSKSNKVAYVKTLLTYLSTHSNLSKS 1765
            KVLIGSVGSKSNKV YVK +L+YL+ HSNLSKS
Sbjct: 529  KVLIGSVGSKSNKVPYVKEMLSYLTQHSNLSKS 561


>ref|XP_002298139.1| glycosyl transferase family 1 family protein [Populus trichocarpa]
            gi|222845397|gb|EEE82944.1| glycosyl transferase family 1
            family protein [Populus trichocarpa]
          Length = 681

 Score =  539 bits (1389), Expect = e-150
 Identities = 309/567 (54%), Positives = 366/567 (64%), Gaps = 16/567 (2%)
 Frame = +2

Query: 113  TPRGSPSFRRLSSGRTPRREGRSGGFVLSYCFRSNRXXXXXXXXXXXAYAGFYFQSRWAH 292
            TPR SP+ R L S RTPRREGR  G +    FRSNR            Y GFY QSRWAH
Sbjct: 36   TPRNSPTHRLLHSSRTPRREGRGSGGI--QWFRSNRLIYWLLLITLWTYLGFYVQSRWAH 93

Query: 293  GDNKEDLFXXXXXXXXXXXXX---MRRDLS-----VAVGTGALKLKTETSNSSLENVDVV 448
            GDNK++                   RRDL      V V  G  K++   +    + +DVV
Sbjct: 94   GDNKDEFLGFGGKSSNGLLDAEQHTRRDLLANDSLVVVNNGTNKIQVRNA----KKIDVV 149

Query: 449  LAKSRSGDSLXXXXXXXXXXXXXXXXXXXXXXXXAQD----VESE-VDLPIEDIPKKNTT 613
            LAK  +G S                          Q     VES+ V++   D+PK N +
Sbjct: 150  LAKKGNGVSSNRRATPKKKKSKRGGRRSRAKAHDKQKATVVVESDDVEVAEPDVPKNNAS 209

Query: 614  YGFLVGPFGSVEDSILEWSPDKRSGTCDRKGAFARLVWSRKFVLIFHELSMTGAPLAMME 793
            YG LVGPFG +ED ILEWSP+KRSGTCDRKGAFARLVWSRKFVLIFHELSMTGAPL+M+E
Sbjct: 210  YGLLVGPFGPIEDRILEWSPEKRSGTCDRKGAFARLVWSRKFVLIFHELSMTGAPLSMLE 269

Query: 794  LATEFLSCGATISVIVLNKKGGLMSELNRRKIKVLVDKSDLSFKTAMKANLIIAGSAVCS 973
            LATEFLSCGAT+S +VL+KKGGLM EL RR+IKVL D++DLSFKTAMKA+L+IAGSAVC+
Sbjct: 270  LATEFLSCGATVSAVVLSKKGGLMPELARRRIKVLEDRADLSFKTAMKADLVIAGSAVCT 329

Query: 974  SWIEQYLSRTVLGSSQIMWWIMENRREYFDRSKHVLNRVKKLIFLSESQSKQWLDWCEEE 1153
            SWI+QY++R   G SQ++WWIMENRREYFDRSK +LNRVK L+FLSESQ KQW  WCEEE
Sbjct: 330  SWIDQYIARFPAGGSQVVWWIMENRREYFDRSKIILNRVKMLVFLSESQMKQWQTWCEEE 389

Query: 1154 NIHLKDEPALVPLSVNDELAFAAGISCSLNTPSFTTENMLEKRQSLRKVVREEMGLNDDD 1333
            NI L+  PA+V LSVNDELAF AGI+CSLNTP+ ++E MLEKRQ LR+ VR+EMGL D+D
Sbjct: 390  NIRLRSPPAVVQLSVNDELAFVAGIACSLNTPTSSSEKMLEKRQLLRESVRKEMGLTDND 449

Query: 1334 MLAVSLSSINPGKGQLLLMESARLVIE--QGQKLNNSGSK-DSILLDHDYYSRALLQNGK 1504
            ML +SLSSIN GKGQLLL+ESA LVIE     K+ NS  K +   L   ++ RAL     
Sbjct: 450  MLVMSLSSINAGKGQLLLLESANLVIEPDPSPKITNSVDKGNQSTLAAKHHLRAL----- 504

Query: 1505 RDNESSNIDTPTKKRIRSSRIFTNEGRLNSAMYGREARMRKMLSENVGKKGQNLKVLIGS 1684
                                                 R RK+L+++ G   Q LKVLIGS
Sbjct: 505  -----------------------------------SHRKRKLLADSEGTHEQALKVLIGS 529

Query: 1685 VGSKSNKVAYVKTLLTYLSTHSNLSKS 1765
            VGSKSNKV YVK +L ++S HSNLSKS
Sbjct: 530  VGSKSNKVPYVKEILRFISQHSNLSKS 556


>ref|XP_007150675.1| hypothetical protein PHAVU_005G172300g [Phaseolus vulgaris]
            gi|593700475|ref|XP_007150676.1| hypothetical protein
            PHAVU_005G172300g [Phaseolus vulgaris]
            gi|561023939|gb|ESW22669.1| hypothetical protein
            PHAVU_005G172300g [Phaseolus vulgaris]
            gi|561023940|gb|ESW22670.1| hypothetical protein
            PHAVU_005G172300g [Phaseolus vulgaris]
          Length = 701

 Score =  524 bits (1350), Expect = e-146
 Identities = 301/567 (53%), Positives = 368/567 (64%), Gaps = 16/567 (2%)
 Frame = +2

Query: 113  TPRGSPSFRRLSSGRTPRREGRSG-GFVLSYCFRSNRXXXXXXXXXXXAYAGFYFQSRWA 289
            TPR SPSFRR +SGRTPR+EGRSG G  L   FRSNR           AY GF+ QSRWA
Sbjct: 35   TPRNSPSFRRQNSGRTPRKEGRSGIGGAL--WFRSNRLLFWLLLITLWAYLGFFVQSRWA 92

Query: 290  HGDNKEDLF---XXXXXXXXXXXXXMRRDLSVAVGTGALKLKTETSNS-SLENVDVVLAK 457
            H D KE+                   RRDL  +  + +   +T+ + + S + ++VVLAK
Sbjct: 93   HSDKKEEFSGFGTGPRNTGSDAEQVQRRDLLASDHSLSANNETDANIALSSKTINVVLAK 152

Query: 458  SRSGDSLXXXXXXXXXXXXXXXXXXXXXXXXAQDVESEVDLPIE----DIPKKNTTYGFL 625
             R  D                           +      D  IE    +IP  N TYG L
Sbjct: 153  -RGNDVPSHRKTSSKKRSRRRRASKGKSSGKLKPSTDVKDADIEEQKPEIPTANGTYGLL 211

Query: 626  VGPFGSVEDSILEWSPDKRSGTCDRKGAFARLVWSRKFVLIFHELSMTGAPLAMMELATE 805
            VGPFG VED ILEWSP+KRSGTC+RKG FARLVWSR+F+L+FHELSMTGAPL+MMELATE
Sbjct: 212  VGPFGPVEDRILEWSPEKRSGTCNRKGDFARLVWSRRFILVFHELSMTGAPLSMMELATE 271

Query: 806  FLSCGATISVIVLNKKGGLMSELNRRKIKVLVDKSDLSFKTAMKANLIIAGSAVCSSWIE 985
             LSCGAT+S +VL+KKGGLMSEL RR+IKVL DK+DLSFKTAMKA+L+IAGSAVC+SWI+
Sbjct: 272  LLSCGATVSAVVLSKKGGLMSELARRRIKVLEDKADLSFKTAMKADLVIAGSAVCASWID 331

Query: 986  QYLSRTVLGSSQIMWWIMENRREYFDRSKHVLNRVKKLIFLSESQSKQWLDWCEEENIHL 1165
            QY+ R   G+SQ++WWIMENRREYFD SK  L+RVK L+FLSESQSKQWL WCEEE+I L
Sbjct: 332  QYIERFPAGASQVVWWIMENRREYFDLSKDALDRVKMLVFLSESQSKQWLKWCEEESIKL 391

Query: 1166 KDEPALVPLSVNDELAFAAGISCSLNTPSFTTENMLEKRQSLRKVVREEMGLNDDDMLAV 1345
            +  P ++PLSVNDELAF AGI  +LNTPSF+T+ M+EKRQ LR+ VR+E+GLND DML +
Sbjct: 392  RSYPEIIPLSVNDELAFVAGIPSTLNTPSFSTDKMVEKRQLLRESVRKEIGLNDSDMLVI 451

Query: 1346 SLSSINPGKGQLLLMESARLVIEQGQKLNNSGSKDSILLDHDYYSRALLQNGKRDNESSN 1525
            SLSSINPGKGQLLL+ES   V+EQG                       LQ+ K+  + SN
Sbjct: 452  SLSSINPGKGQLLLLESVSSVLEQG----------------------WLQDDKKMKKVSN 489

Query: 1526 I-----DTPTKKRIRSSRIFTNEGRL--NSAMYGREARMRKMLSENVGKKGQNLKVLIGS 1684
            I         K RIR        G++  N       +R +++L ++ G   ++LK+LIGS
Sbjct: 490  IKEGISTLARKHRIRKLLPVLKNGKVVSNDISSNSLSRRKQVLPDDKGTIQKSLKLLIGS 549

Query: 1685 VGSKSNKVAYVKTLLTYLSTHSNLSKS 1765
            VGSKSNK  YVK+LL +L  H N SKS
Sbjct: 550  VGSKSNKADYVKSLLNFLEQHPNTSKS 576


>gb|EXC25804.1| Putative glycosyltransferase ytcC [Morus notabilis]
          Length = 688

 Score =  524 bits (1349), Expect = e-146
 Identities = 298/564 (52%), Positives = 365/564 (64%), Gaps = 13/564 (2%)
 Frame = +2

Query: 113  TPRGSPSFRRLSSGRTPRREGRSGGFVLSYCFRSNRXXXXXXXXXXXAYAGFYFQSRWAH 292
            TPR SPSFRR  S RTPRREGR     L + FRSNR           AY GF+ QSRWAH
Sbjct: 28   TPRNSPSFRRSQSSRTPRREGRGSARGLQW-FRSNRLLFWLLLITLWAYLGFFVQSRWAH 86

Query: 293  GDNKEDLFXXXXXXXXXXXXX---MRRDL-----SVAVGTGALKLKTETSNSSLENVDVV 448
             ++ +++                 +RRDL     S+AV  G  K +     S  + +DVV
Sbjct: 87   DNDNDNVMGFGKKPKNWNSETEQNLRRDLIATDISLAVKNGTGKNQV----SDGKRMDVV 142

Query: 449  LAKSRSGDSLXXXXXXXXXXXXXXXXXXXXXXXXAQDVESEV-DLPIE----DIPKKNTT 613
            LA    G S                          Q +  EV ++ IE    DIPK N +
Sbjct: 143  LAGRNDGISSHRKLNSKKKKTKRANRSLRSKVHGKQKMTMEVKNVEIEEQEPDIPKTNAS 202

Query: 614  YGFLVGPFGSVEDSILEWSPDKRSGTCDRKGAFARLVWSRKFVLIFHELSMTGAPLAMME 793
            YG LVGPFGS+ED ILEWSP+KRSGTCDRKG FAR+VWSR+FVLIFHELSMTG+PL+MME
Sbjct: 203  YGMLVGPFGSLEDRILEWSPEKRSGTCDRKGDFARIVWSRRFVLIFHELSMTGSPLSMME 262

Query: 794  LATEFLSCGATISVIVLNKKGGLMSELNRRKIKVLVDKSDLSFKTAMKANLIIAGSAVCS 973
            LATE LSCGAT+S + L+KKGGLMSEL RR+IKVL DK+DLSFKTAMKA+L+IAGSAVC+
Sbjct: 263  LATELLSCGATVSAVALSKKGGLMSELARRRIKVLEDKADLSFKTAMKADLVIAGSAVCA 322

Query: 974  SWIEQYLSRTVLGSSQIMWWIMENRREYFDRSKHVLNRVKKLIFLSESQSKQWLDWCEEE 1153
            SWI+Q++     G+SQ+ WWIMENRREYFDR+K VLNRVK L+F+SE Q KQWL W EEE
Sbjct: 323  SWIDQFIEHFPAGASQVAWWIMENRREYFDRAKVVLNRVKMLVFISELQWKQWLAWAEEE 382

Query: 1154 NIHLKDEPALVPLSVNDELAFAAGISCSLNTPSFTTENMLEKRQSLRKVVREEMGLNDDD 1333
             I+L+ +P LVPLS+NDE+AF AGI+C+LNTPSFTTE M+EKRQ LR   R+EMGL D+D
Sbjct: 383  KIYLRSQPVLVPLSINDEMAFVAGIACTLNTPSFTTEKMIEKRQLLRDSARKEMGLKDND 442

Query: 1334 MLAVSLSSINPGKGQLLLMESARLVIEQGQKLNNSGSKDSILLDHDYYSRALLQNGKRDN 1513
            ML +SLSSINPGKGQ LL+ S RL+IE+      S  K+ + + H               
Sbjct: 443  MLVMSLSSINPGKGQHLLLGSGRLMIEKEAFEEKSNIKNPVDIKHH-------------- 488

Query: 1514 ESSNIDTPTKKRIRSSRIFTNEGRLNSAMYGREARMRKMLSENVGKKGQNLKVLIGSVGS 1693
                      K  R  R+ T   +LN +M       ++ML ++ G + +++K+LIGSVGS
Sbjct: 489  --------QSKSTRKHRLKTVFQKLNGSMAFGGTHRKEML-DSGGMRERSVKILIGSVGS 539

Query: 1694 KSNKVAYVKTLLTYLSTHSNLSKS 1765
            KSNKV YVK LL YLS H N SKS
Sbjct: 540  KSNKVVYVKELLNYLSQHPNTSKS 563


>ref|XP_004486717.1| PREDICTED: uncharacterized protein LOC101501726 [Cicer arietinum]
          Length = 709

 Score =  520 bits (1339), Expect = e-145
 Identities = 294/570 (51%), Positives = 367/570 (64%), Gaps = 19/570 (3%)
 Frame = +2

Query: 113  TPRGSPSFRRLSSGRTPRREGRSGGFVLSYCFRSNRXXXXXXXXXXXAYAGFYFQSRWAH 292
            TPR SP+FRRL++ RTPR++GRS G   S  FRSNR           AY GF+ QSRWAH
Sbjct: 38   TPRNSPTFRRLNTSRTPRKDGRSVGS--SLWFRSNRVLLWLLLITLWAYLGFFVQSRWAH 95

Query: 293  GDNKEDLFXXXXXXXXXXXXX----MRRDLSVAVGTGALKLKTETSNSSL-ENVDVVLAK 457
             D KE+                   +RRDL  +  + ++  +T  +   +   ++V LA 
Sbjct: 96   SDKKEEFSGFGTGPRNTGSNDDSTSLRRDLIASEDSLSVNNETVINKGGVGRTINVALAM 155

Query: 458  SRSGDSLXXXXXXXXXXXXXXXXXXXXXXXXAQDVESEVDLPIEDI-------PKKNTTY 616
              + D                              + +V++   DI       P+ N+TY
Sbjct: 156  KGNDDDDDDVPSRRKASSKKKKSKRSSRGKARGKNKPKVEIKNNDIEEQEPEIPETNSTY 215

Query: 617  GFLVGPFGSVEDSILEWSPDKRSGTCDRKGAFARLVWSRKFVLIFHELSMTGAPLAMMEL 796
            G LVGPFGS ED ILEWSP KRSGTC+RKG FARLVWSR+F+LIFHELSMTGAPL+MMEL
Sbjct: 216  GLLVGPFGSTEDRILEWSPQKRSGTCNRKGDFARLVWSRRFILIFHELSMTGAPLSMMEL 275

Query: 797  ATEFLSCGATISVIVLNKKGGLMSELNRRKIKVLVDKSDLSFKTAMKANLIIAGSAVCSS 976
            ATE LSCGAT+S + L++KGGLMSEL RR+IK+L DK+DLSFKTAMKA+L+IAGSAVC+S
Sbjct: 276  ATELLSCGATVSAVALSRKGGLMSELARRRIKLLEDKADLSFKTAMKADLVIAGSAVCAS 335

Query: 977  WIEQYLSRTVLGSSQIMWWIMENRREYFDRSKHVLNRVKKLIFLSESQSKQWLDWCEEEN 1156
            WIEQY+     G+SQ+ WWIMENRREYF+R+K VL+RVK L+FLSESQSKQW  WCEEEN
Sbjct: 336  WIEQYIEHFPAGASQVAWWIMENRREYFNRTKGVLDRVKMLVFLSESQSKQWQKWCEEEN 395

Query: 1157 IHLKDEPALVPLSVNDELAFAAGISCSLNTPSFTTENMLEKRQSLRKVVREEMGLNDDDM 1336
            I L+  P ++PLSVNDELAF AGI  +LNTPSF T+ M+EK+Q LR+ VR+EMGL D DM
Sbjct: 396  IKLRSRPEIIPLSVNDELAFVAGIPSTLNTPSFDTDKMIEKKQLLRESVRKEMGLTDHDM 455

Query: 1337 LAVSLSSINPGKGQLLLMESARLVIEQGQKLNNSGSKDSILLDHDYYSRALLQNGKRDNE 1516
            L +SLSSINPGKGQLLL+ESA  V+E GQ                      LQ+ K+  +
Sbjct: 456  LVISLSSINPGKGQLLLLESAISVVEHGQ----------------------LQDDKKMKK 493

Query: 1517 SSNI----DTPTKK-RIRSSRIFTNEGR--LNSAMYGREARMRKMLSENVGKKGQNLKVL 1675
            SSNI     T T+K RIR       +G+  L        +R +++L  N     Q+LKVL
Sbjct: 494  SSNIKEGLSTLTRKQRIRKLLPMLKDGKVALKDISINSLSRRKQVLPNNKTTTQQSLKVL 553

Query: 1676 IGSVGSKSNKVAYVKTLLTYLSTHSNLSKS 1765
            IGSVGSKSNK  YVK+LL++L+ H N SK+
Sbjct: 554  IGSVGSKSNKADYVKSLLSFLAQHPNTSKT 583


>ref|XP_006597141.1| PREDICTED: uncharacterized protein LOC100793827 isoform X1 [Glycine
            max] gi|571514725|ref|XP_006597142.1| PREDICTED:
            uncharacterized protein LOC100793827 isoform X2 [Glycine
            max]
          Length = 701

 Score =  519 bits (1336), Expect = e-144
 Identities = 296/566 (52%), Positives = 364/566 (64%), Gaps = 15/566 (2%)
 Frame = +2

Query: 113  TPRGSPSFRRLSSGRTPRREGRS--GGFVLSYCFRSNRXXXXXXXXXXXAYAGFYFQSRW 286
            TPR SPSFRRL+SGRTPR+EGRS  GG   +  FRSNR           AY GF+ QSRW
Sbjct: 35   TPRNSPSFRRLNSGRTPRKEGRSSVGG---ALWFRSNRLLLWLLLITLWAYLGFFVQSRW 91

Query: 287  AHGDNKEDLFXXXXXXXXXXXXX---MRRDLSVAVGTGALKLKTETSNSSL-ENVDVVLA 454
            AH D KE+                   RRDL  +  + +    T+   + + + ++V LA
Sbjct: 92   AHSDKKEEFSGYGTGPRNTNSDAEQIQRRDLLASNKSLSANNDTDADIAGISKTINVALA 151

Query: 455  KS-------RSGDSLXXXXXXXXXXXXXXXXXXXXXXXXAQDVESEVDLPIEDIPKKNTT 613
            K+       R   S                           D+E +      +IP  N+T
Sbjct: 152  KNDNDVPSHRKTSSKNRSKGRRSSKGKSRGKLKPTTEIKNTDIEEQEP----EIPTTNST 207

Query: 614  YGFLVGPFGSVEDSILEWSPDKRSGTCDRKGAFARLVWSRKFVLIFHELSMTGAPLAMME 793
            YG LVGPFG +ED ILEWSP+KRSGTC+RK  FARLVWSR+F+LIFHELSMTGAPL+MME
Sbjct: 208  YGLLVGPFGPMEDRILEWSPEKRSGTCNRKEDFARLVWSRRFILIFHELSMTGAPLSMME 267

Query: 794  LATEFLSCGATISVIVLNKKGGLMSELNRRKIKVLVDKSDLSFKTAMKANLIIAGSAVCS 973
            LATE LSCGAT+S +VL++KGGLMSEL RR+IKVL DK+DLSFKTAMKA+L+IAGSAVC+
Sbjct: 268  LATELLSCGATVSAVVLSRKGGLMSELARRRIKVLEDKADLSFKTAMKADLVIAGSAVCA 327

Query: 974  SWIEQYLSRTVLGSSQIMWWIMENRREYFDRSKHVLNRVKKLIFLSESQSKQWLDWCEEE 1153
            SWIEQY+     G+SQ+ WWIMENRREYFDRSK VL+RVK L+FLSESQSKQW  WCEEE
Sbjct: 328  SWIEQYIEHFPAGASQVAWWIMENRREYFDRSKDVLHRVKMLVFLSESQSKQWQKWCEEE 387

Query: 1154 NIHLKDEPALVPLSVNDELAFAAGISCSLNTPSFTTENMLEKRQSLRKVVREEMGLNDDD 1333
            +I L+  P +VPLSVNDELAF AGI  +LNTPSF+TE M+EK+Q LR+ VR+EMGL D+D
Sbjct: 388  SIKLRSHPEIVPLSVNDELAFVAGIPSTLNTPSFSTEKMVEKKQLLRESVRKEMGLTDND 447

Query: 1334 MLAVSLSSINPGKGQLLLMESARLVIEQGQKLNNSGSKDSILLDHDYYSRALLQNGKRDN 1513
            ML +SLSSINPGKGQLLL+ES   V+EQGQ   +   K+   +     S A         
Sbjct: 448  MLVISLSSINPGKGQLLLLESVSSVLEQGQSPGDKKMKEVSNIKEGLSSLA--------- 498

Query: 1514 ESSNIDTPTKKRIRSSRIFTNEGRL--NSAMYGREARMRKMLSENVGKKGQNLKVLIGSV 1687
                     K RIR      + G++  NS      +R +++L  + G   Q+LK+LIGSV
Sbjct: 499  --------RKHRIRKLLPLMSNGKVASNSISSNSLSRRKQVLPNDKGTIQQSLKLLIGSV 550

Query: 1688 GSKSNKVAYVKTLLTYLSTHSNLSKS 1765
             SKSNK  YVK+LL++L  H N S S
Sbjct: 551  RSKSNKADYVKSLLSFLEQHPNTSTS 576


>ref|XP_004149847.1| PREDICTED: uncharacterized protein LOC101207532 [Cucumis sativus]
            gi|449496350|ref|XP_004160111.1| PREDICTED:
            uncharacterized protein LOC101223486 [Cucumis sativus]
          Length = 682

 Score =  518 bits (1333), Expect = e-144
 Identities = 290/558 (51%), Positives = 367/558 (65%), Gaps = 7/558 (1%)
 Frame = +2

Query: 113  TPRGSPSFRRLSSGRTPRREGRSGGFVLSYCFRSNRXXXXXXXXXXXAYAGFYFQSRWAH 292
            TPRGSPSFRRL S RTPRRE RS GF L +  R+N+           AY GFY QSRWAH
Sbjct: 34   TPRGSPSFRRLHSSRTPRREARSTGFSLHW-IRNNKVLFWLLLITLWAYLGFYVQSRWAH 92

Query: 293  GDNKEDLFXXXXXXXXXXXXXMRRDLSVAVGTGALKLKTETSNSSLEN-------VDVVL 451
            G+NK++ F               ++ S+++ +   +L  E  N S EN       V+VVL
Sbjct: 93   GENKDE-FLGFGGQQSNQKLDSEQNQSLSLISTNNRLVVE--NRSGENDRSDGGVVNVVL 149

Query: 452  AKSRSGDSLXXXXXXXXXXXXXXXXXXXXXXXXAQDVESEVDLPIEDIPKKNTTYGFLVG 631
            AK  +G S                         A+    +++    +IP KN++YG LVG
Sbjct: 150  AKKANGVSASKKTKPRKRSKRSKRDKVHKGKIPAEVTNHDIEEQEPEIPLKNSSYGMLVG 209

Query: 632  PFGSVEDSILEWSPDKRSGTCDRKGAFARLVWSRKFVLIFHELSMTGAPLAMMELATEFL 811
            PFGS ED ILEWSP+KRSGTCDRKG FARLVWSR+FVLIFHELSMTGAP++MMELATE L
Sbjct: 210  PFGSTEDRILEWSPEKRSGTCDRKGDFARLVWSRRFVLIFHELSMTGAPISMMELATELL 269

Query: 812  SCGATISVIVLNKKGGLMSELNRRKIKVLVDKSDLSFKTAMKANLIIAGSAVCSSWIEQY 991
            SCGA++S + L+KKGGLMSEL+RR+IKVL DK+DLSFKTAMKA+L+IAGSAVC+SWI+ Y
Sbjct: 270  SCGASVSAVALSKKGGLMSELSRRRIKVLDDKADLSFKTAMKADLVIAGSAVCASWIDGY 329

Query: 992  LSRTVLGSSQIMWWIMENRREYFDRSKHVLNRVKKLIFLSESQSKQWLDWCEEENIHLKD 1171
            +     G+SQ+ WWIMENRREYF+RSK VL+RVK LIF+SE QSKQWL+W +EENI L+ 
Sbjct: 330  IEHFPAGASQVAWWIMENRREYFNRSKVVLDRVKMLIFISELQSKQWLNWSQEENIKLRS 389

Query: 1172 EPALVPLSVNDELAFAAGISCSLNTPSFTTENMLEKRQSLRKVVREEMGLNDDDMLAVSL 1351
            +PA+VPLSVNDELAF AGISCSLNT S + E MLEK+Q LR   R+EMG+ D+D++ ++L
Sbjct: 390  QPAIVPLSVNDELAFVAGISCSLNTESSSPEKMLEKKQLLRNTTRKEMGVGDNDVVVMTL 449

Query: 1352 SSINPGKGQLLLMESARLVIEQGQKLNNSGSKDSILLDHDYYSRALLQNGKRDNESSNID 1531
            SSINPGKG  LL+ES+ L+I++G K                         + D +  N D
Sbjct: 450  SSINPGKGHFLLLESSNLLIDRGLK-------------------------RDDPKIRNPD 484

Query: 1532 TPTKKRIRSSRIFTNEGRLNSAMYGREARMRKMLSENVGKKGQNLKVLIGSVGSKSNKVA 1711
              +  R + +R      R   A+  +    R++L++       + K+LIGSVGSKSNKV 
Sbjct: 485  DSSPSRPKLAR-----RRYMRALLQKLNDRRRLLADGGELPETSFKLLIGSVGSKSNKVV 539

Query: 1712 YVKTLLTYLSTHSNLSKS 1765
            YVK LL +LS HSNLS+S
Sbjct: 540  YVKRLLRFLSQHSNLSQS 557


Top