BLASTX nr result

ID: Mentha27_contig00015816 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Mentha27_contig00015816
         (1974 letters)

Database: ./nr 
           38,876,450 sequences; 13,856,398,315 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

gb|EYU21559.1| hypothetical protein MIMGU_mgv1a002407mg [Mimulus...   793   0.0  
ref|XP_006343109.1| PREDICTED: uncharacterized protein LOC102601...   720   0.0  
ref|XP_004235700.1| PREDICTED: uncharacterized protein LOC101247...   711   0.0  
emb|CAN71826.1| hypothetical protein VITISV_013841 [Vitis vinifera]   709   0.0  
ref|XP_002284822.1| PREDICTED: uncharacterized protein LOC100246...   708   0.0  
ref|XP_007217014.1| hypothetical protein PRUPE_ppa002059mg [Prun...   706   0.0  
ref|XP_002528176.1| glycosyltransferase, putative [Ricinus commu...   699   0.0  
ref|XP_004302927.1| PREDICTED: uncharacterized protein LOC101300...   699   0.0  
ref|XP_006465456.1| PREDICTED: uncharacterized protein LOC102612...   695   0.0  
ref|XP_006427083.1| hypothetical protein CICLE_v10024994mg [Citr...   693   0.0  
ref|XP_002298139.1| glycosyl transferase family 1 family protein...   693   0.0  
emb|CBI36173.3| unnamed protein product [Vitis vinifera]              688   0.0  
ref|XP_007024055.1| UDP-Glycosyltransferase superfamily protein ...   684   0.0  
ref|XP_007024056.1| UDP-Glycosyltransferase superfamily protein ...   679   0.0  
gb|EXC25804.1| Putative glycosyltransferase ytcC [Morus notabilis]    674   0.0  
ref|XP_004149847.1| PREDICTED: uncharacterized protein LOC101207...   664   0.0  
ref|XP_007150675.1| hypothetical protein PHAVU_005G172300g [Phas...   663   0.0  
ref|XP_004486717.1| PREDICTED: uncharacterized protein LOC101501...   658   0.0  
ref|XP_006597141.1| PREDICTED: uncharacterized protein LOC100793...   654   0.0  
ref|XP_003542107.1| PREDICTED: uncharacterized protein LOC100795...   653   0.0  

>gb|EYU21559.1| hypothetical protein MIMGU_mgv1a002407mg [Mimulus guttatus]
          Length = 678

 Score =  793 bits (2047), Expect = 0.0
 Identities = 431/619 (69%), Positives = 475/619 (76%), Gaps = 15/619 (2%)
 Frame = -1

Query: 1974 GGESKGGK------SMRRDLSAAVGTGALKLKNETSNSSLEN--VDVVLAKSRSGDSLXX 1819
            GGES G K      + RRDL A V + A++LKN+T+  SL    +DVVLAK+ + D    
Sbjct: 106  GGESGGDKFEPQIKNRRRDLIAKVDSAAVELKNDTNELSLNKSVMDVVLAKNTTLDKNKP 165

Query: 1818 XXXXXXXXXXXXXXXXXXXXXVAQD-VESEVDLPIEDI-PKKNTTYGFLVGPFGSVEDSI 1645
                                 +A++ VESEVD+  E+I PKKNTTYGFLVGPFGSVEDSI
Sbjct: 166  SKRRSKRSLRRKKPVSSKPKAMAEEEVESEVDMQTEEIIPKKNTTYGFLVGPFGSVEDSI 225

Query: 1644 LEWSPDKRSGTCDRKGAFARLVWSRKFVLIFHELSMTGAPLAMMELATEFLSCGATISVI 1465
            LEWS +KRSGTCDRKGAFARLVWSRKFVLIFHELSMTGAPLAM+ELATEFLSCGATISVI
Sbjct: 226  LEWSAEKRSGTCDRKGAFARLVWSRKFVLIFHELSMTGAPLAMLELATEFLSCGATISVI 285

Query: 1464 VLNKKGGLMSELNRRKIKVLVDKSDLSFKTAMKANLIIAGSAVCSSWIEQYLSRTVLGSS 1285
            VLNK+GGLMSEL+RRKIKVL DK+DLSFKTAMKA++IIAGSAVCSSWIEQYLSRTVLGSS
Sbjct: 286  VLNKRGGLMSELSRRKIKVLEDKTDLSFKTAMKADIIIAGSAVCSSWIEQYLSRTVLGSS 345

Query: 1284 QIMWWIMENRREYFDRSKHVLNRVKKLIFLSESQSKQWLDWCEEENIHLKDEPALVPLSV 1105
            QIMWWIMENRREYFDRSK VLNRVKKLIFLS+SQSKQWL WCEEE I LK EPALVPLSV
Sbjct: 346  QIMWWIMENRREYFDRSKLVLNRVKKLIFLSKSQSKQWLSWCEEEKIQLKSEPALVPLSV 405

Query: 1104 NDELAFAAGISCSLNTPSFTTENMLEKRQSLRKVVREEMGLNDDDMLVVSLSSINPGKGQ 925
            NDELAF AGI CSLNTPSF+TE M+EKR  LR  VREEMGL++DDML VSLSSINPGKGQ
Sbjct: 406  NDELAFVAGIPCSLNTPSFSTEKMMEKRGLLRSAVREEMGLSEDDMLAVSLSSINPGKGQ 465

Query: 924  LLLMESARLVIEQGQ----KLNNSGSKDSVLLDHDYYS-RALLQNGKRDNESSNIDTPTK 760
            LLL+E+ R +IEQ +     L  S   DS++ D D    R LL  G              
Sbjct: 466  LLLLEAGRFLIEQPRTDQTNLRLSSEFDSMVFDGDSSGLRKLLSEG-------------- 511

Query: 759  KRIRSSRIFTNEGRLNSARYGRDARMRKMLSENVGKKGQNLKVLIGSVGSKSNKVAYVKT 580
                                            N+GKKG NLK+L+GSVGSKSNKV YVKT
Sbjct: 512  --------------------------------NIGKKGGNLKILVGSVGSKSNKVPYVKT 539

Query: 579  LLTYLSTHSNLSKSVLWTPATTRVASLYAAADVYVMNSQGIGETFGRVTIEAMAFGLPVL 400
            LL +LS HSNLSK V+WTP+TTRVASLYAAADVYVMNSQGIGETFGRVTIEAMAFGLPVL
Sbjct: 540  LLNFLSMHSNLSKVVIWTPSTTRVASLYAAADVYVMNSQGIGETFGRVTIEAMAFGLPVL 599

Query: 399  GTDSGGTREIVEHNITGLLHPLGRPGSQVLARNFEFFLENPQARHEMGMRGREKVEKMYL 220
            GTDSGGTREIVEHNITGLLHPLGR G+++LA N +F LENP AR EMG++GREKVEKMYL
Sbjct: 600  GTDSGGTREIVEHNITGLLHPLGRAGARILANNLQFLLENPNARQEMGLKGREKVEKMYL 659

Query: 219  KKHMFQKFGEVLYKCMRIK 163
            KKHMFQKFGEVLYKCMRIK
Sbjct: 660  KKHMFQKFGEVLYKCMRIK 678


>ref|XP_006343109.1| PREDICTED: uncharacterized protein LOC102601346 [Solanum tuberosum]
          Length = 711

 Score =  720 bits (1858), Expect = 0.0
 Identities = 382/612 (62%), Positives = 461/612 (75%), Gaps = 9/612 (1%)
 Frame = -1

Query: 1971 GESKGGKSMRRDLSAAVGTGALKLKNETSNSSLENVDVVLAKSRSG---DSLXXXXXXXX 1801
            G S+  +  +R L A   + A+K  +  +  +  ++DVVLAK  +    D +        
Sbjct: 106  GTSQPEEKNQRILVANEESLAVKPPSNKTQGNSMDLDVVLAKQGNSVVSDKVSSSKKKSK 165

Query: 1800 XXXXXXXXXXXXXXXVAQDVESE-VDLPIEDIPKKNTTYGFLVGPFGSVEDSILEWSPDK 1624
                           V  +V+++ +++  E+IPK+NTTYG LVGPFGS+ED ILEWSP+K
Sbjct: 166  KSTRASRRKTHGKKKVVAEVKTDDIEVQEEEIPKRNTTYGLLVGPFGSIEDKILEWSPEK 225

Query: 1623 RSGTCDRKGAFARLVWSRKFVLIFHELSMTGAPLAMMELATEFLSCGATISVIVLNKKGG 1444
            RSGTCDRK  FARLVWSRKFVLI HELSMTGAPLAM+ELATE LSCGAT+ V+ L+K+GG
Sbjct: 226  RSGTCDRKSQFARLVWSRKFVLILHELSMTGAPLAMLELATELLSCGATVYVVPLSKRGG 285

Query: 1443 LMSELNRRKIKVLVDKSDLSFKTAMKANLIIAGSAVCSSWIEQYLSRTVLGSSQIMWWIM 1264
            LMSEL+RRKIKVL DKSDLSFKTAMKA+LIIAGSAVC+SWIEQY +RTVLGSSQI WWIM
Sbjct: 286  LMSELSRRKIKVLEDKSDLSFKTAMKADLIIAGSAVCASWIEQYAARTVLGSSQITWWIM 345

Query: 1263 ENRREYFDRSKHVLNRVKKLIFLSESQSKQWLDWCEEENIHLKDEPALVPLSVNDELAFA 1084
            ENRREYFDR+K   NRVKKLIFLSESQSK+WL WCEEE+I LK +PALVPLS++DELAF 
Sbjct: 346  ENRREYFDRAKLAFNRVKKLIFLSESQSKRWLAWCEEEHIKLKTQPALVPLSISDELAFV 405

Query: 1083 AGISCSLNTPSFTTENMLEKRQSLRKVVREEMGLNDDDMLVVSLSSINPGKGQLLLMESA 904
            AGI CSL+TP F+ E MLEKRQ LR  VR+EMGL D+DMLV+SLSSINPGKGQ LL+E+ 
Sbjct: 406  AGIPCSLSTPLFSPEKMLEKRQLLRDFVRKEMGLTDNDMLVMSLSSINPGKGQFLLLETT 465

Query: 903  RLVIEQGQKLNNSGSKDSVLLDHDYYSRALLQN----GKRDNESSNI-DTPTKKRIRSSR 739
            RL+IE    LN S  K       +Y  R LL N    G+   ESS + + P  + ++  +
Sbjct: 466  RLLIEGAPPLNGSAVK-----RREYQKRTLLYNWKQFGEWKKESSTLSNNPQTETLQVPQ 520

Query: 738  IFTNEGRLNSARYGRDARMRKMLSENVGKKGQNLKVLIGSVGSKSNKVAYVKTLLTYLST 559
            +F  +G   +A    D   RK+ S   GK+G+ LKVLIGSVGSKSNKV YVK LL +L+ 
Sbjct: 521  LFI-KGVNYTAGIENDRGTRKLFSLTEGKQGEKLKVLIGSVGSKSNKVPYVKALLNFLNQ 579

Query: 558  HSNLSKSVLWTPATTRVASLYAAADVYVMNSQGIGETFGRVTIEAMAFGLPVLGTDSGGT 379
            HSNLS +VLWTP+TTRVA+LYAAAD YVMNSQG+GETFGRVTIEAMAFGLPVLGTD+GGT
Sbjct: 580  HSNLSNTVLWTPSTTRVAALYAAADAYVMNSQGLGETFGRVTIEAMAFGLPVLGTDAGGT 639

Query: 378  REIVEHNITGLLHPLGRPGSQVLARNFEFFLENPQARHEMGMRGREKVEKMYLKKHMFQK 199
            +EIVEHN+TGLLH LGRPG+Q+LA N ++ L NP  R  +G  GR+KV+ MYLKKHM+++
Sbjct: 640  KEIVEHNVTGLLHTLGRPGTQILANNLQYLLNNPSERQRLGSNGRKKVKDMYLKKHMYKR 699

Query: 198  FGEVLYKCMRIK 163
            FGEVLY CMRIK
Sbjct: 700  FGEVLYDCMRIK 711


>ref|XP_004235700.1| PREDICTED: uncharacterized protein LOC101247116 [Solanum
            lycopersicum]
          Length = 711

 Score =  711 bits (1836), Expect = 0.0
 Identities = 362/534 (67%), Positives = 425/534 (79%), Gaps = 4/534 (0%)
 Frame = -1

Query: 1752 AQDVESEVDLPIEDIPKKNTTYGFLVGPFGSVEDSILEWSPDKRSGTCDRKGAFARLVWS 1573
            A+    ++++  E+IPK+NTTYG LVGPFGS+ED ILEWSP+KR+GTCDRK  FARLVWS
Sbjct: 183  AEVKSDDIEIQEEEIPKRNTTYGLLVGPFGSIEDKILEWSPEKRTGTCDRKSQFARLVWS 242

Query: 1572 RKFVLIFHELSMTGAPLAMMELATEFLSCGATISVIVLNKKGGLMSELNRRKIKVLVDKS 1393
            RKFVLI HELSMTGAPLAM+ELATE LSCGAT+ V+ L+K+GGLMSEL+RRKIKVL DKS
Sbjct: 243  RKFVLILHELSMTGAPLAMLELATELLSCGATVYVVPLSKRGGLMSELSRRKIKVLEDKS 302

Query: 1392 DLSFKTAMKANLIIAGSAVCSSWIEQYLSRTVLGSSQIMWWIMENRREYFDRSKHVLNRV 1213
            DLSFKTAMKA+LIIAGSAVC+SWIEQY +RTVLGS+QI WWIMENRREYFDR+K   NRV
Sbjct: 303  DLSFKTAMKADLIIAGSAVCASWIEQYAARTVLGSTQITWWIMENRREYFDRAKLAFNRV 362

Query: 1212 KKLIFLSESQSKQWLDWCEEENIHLKDEPALVPLSVNDELAFAAGISCSLNTPSFTTENM 1033
            KKLIFLSESQSK+WL WCEEE+I LK +PAL+PLS++DELAF AGI CSL+TP F+ E M
Sbjct: 363  KKLIFLSESQSKRWLAWCEEEHIKLKTQPALIPLSISDELAFVAGIPCSLSTPLFSPEKM 422

Query: 1032 LEKRQSLRKVVREEMGLNDDDMLVVSLSSINPGKGQLLLMESARLVIEQGQKLNNSGSKD 853
            LEKRQ LR  VR+EMGL D+DMLV+SLSSINPGKGQ LL+E+ RL+IE    L  S  K 
Sbjct: 423  LEKRQLLRDFVRKEMGLTDNDMLVMSLSSINPGKGQFLLLETTRLLIEGAPPLYGSAVK- 481

Query: 852  SVLLDHDYYSRALLQN----GKRDNESSNIDTPTKKRIRSSRIFTNEGRLNSARYGRDAR 685
                  +Y  R LL N    G+   ESS +    +           +G   +A    D  
Sbjct: 482  ----RREYQKRTLLYNWKQFGEWKKESSTLSNNQETEALQVPQLFIKGVNYTAGIENDRG 537

Query: 684  MRKMLSENVGKKGQNLKVLIGSVGSKSNKVAYVKTLLTYLSTHSNLSKSVLWTPATTRVA 505
             RK+ S   GK+G+ LKVLIGSVGSKSNKV YVK LL +L+ HSNLS +VLWTP+TTRVA
Sbjct: 538  TRKLFSLPEGKQGEKLKVLIGSVGSKSNKVPYVKALLNFLNQHSNLSNTVLWTPSTTRVA 597

Query: 504  SLYAAADVYVMNSQGIGETFGRVTIEAMAFGLPVLGTDSGGTREIVEHNITGLLHPLGRP 325
            +LYAAAD YVMNSQG+GETFGRVTIEAMAFGLPVLGTD+GGT+EIVEHN+TGLLH LGRP
Sbjct: 598  ALYAAADAYVMNSQGLGETFGRVTIEAMAFGLPVLGTDAGGTKEIVEHNVTGLLHSLGRP 657

Query: 324  GSQVLARNFEFFLENPQARHEMGMRGREKVEKMYLKKHMFQKFGEVLYKCMRIK 163
            G+QVLA+N ++ L NP  R  +G  GR+KV+ MYLKKHM+++FGEVLY CMRIK
Sbjct: 658  GTQVLAQNLQYLLNNPSERQRLGSNGRKKVKDMYLKKHMYRRFGEVLYDCMRIK 711


>emb|CAN71826.1| hypothetical protein VITISV_013841 [Vitis vinifera]
          Length = 734

 Score =  709 bits (1831), Expect = 0.0
 Identities = 381/633 (60%), Positives = 457/633 (72%), Gaps = 29/633 (4%)
 Frame = -1

Query: 1974 GGESKGGKS---MRRDLSAAVGTGALKLKNETSNSSL---ENVDVVLAKSRSGDSLXXXX 1813
            GG+   G S   + R          L +KN +  + +   + VDVVLAK   G+S+    
Sbjct: 104  GGKPNNGISDSELNRKAPLIANDKLLAVKNGSDKNPVGSGKKVDVVLAKK--GNSVPSRR 161

Query: 1812 XXXXXXXXXXXXXXXXXXXVAQDVESEVDLPIED-----IPKKNTTYGFLVGPFGSVEDS 1648
                                 Q  ++EV++   D     IPK NT+YG LVGPFGS ED 
Sbjct: 162  SASSKKRSKKSERSLRGKTRKQKTKTEVEVTEMDEQEQEIPKLNTSYGLLVGPFGSTEDR 221

Query: 1647 ILEWSPDKRSGTCDRKGAFARLVWSRKFVLIFHELSMTGAPLAMMELATEFLSCGATISV 1468
            ILEWSP+KRSGTCDR+G  ARLVWSRKFVLIFHELSMTGAPL+MMELATE LSCGAT+S 
Sbjct: 222  ILEWSPEKRSGTCDRRGELARLVWSRKFVLIFHELSMTGAPLSMMELATELLSCGATVSA 281

Query: 1467 IVLNKKGGLMSELNRRKIKVLVDKSDLSFKTAMKANLIIAGSAVCSSWIEQYLSRTVLGS 1288
            +VL+KKGGLM EL RR+IKVL D++DLSFKTAMKA+L+IAGSAVC+SWIEQY++    GS
Sbjct: 282  VVLSKKGGLMPELARRRIKVLEDRADLSFKTAMKADLVIAGSAVCASWIEQYIAHFTAGS 341

Query: 1287 SQIMWWIMENRREYFDRSKHVLNRVKKLIFLSESQSKQWLDWCEEENIHLKDEPALVPLS 1108
            SQI+WWIMENRREYFDRSK V+NRVK LIFLSESQSKQWL WC+EENI L  +PA+VPLS
Sbjct: 342  SQIVWWIMENRREYFDRSKLVINRVKMLIFLSESQSKQWLTWCKEENIRLISQPAVVPLS 401

Query: 1107 VNDELAFAAGISCSLNTPSFTTENMLEKRQSLRKVVREEMGLNDDDMLVVSLSSINPGKG 928
            VNDELAF AGI+CSLNTPSFTTE M EKR+ LR  +R+EMGL D DML++SLSSINPGKG
Sbjct: 402  VNDELAFVAGITCSLNTPSFTTEKMQEKRRLLRDSIRKEMGLTDTDMLLLSLSSINPGKG 461

Query: 927  QLLLMESARLVIEQGQKLNNSGSKDSVLLDHD-------YYSRALLQNGKRDNESSN--- 778
            Q  L+ES R +IEQ    ++   KD   +  D       +YSRALLQN    + SS+   
Sbjct: 462  QFFLLESVRSMIEQEPSQDDPELKDLAKIGQDQSNFSGKHYSRALLQNVNHFSVSSSGLR 521

Query: 777  --------IDTPTKKRIRSSRIFTNEGRLNSARYGRDARMRKMLSENVGKKGQNLKVLIG 622
                    ++ P  K +    +F +    ++   G   + RK+LSEN G + Q LKVLIG
Sbjct: 522  LSNESFIELNGPKSKNLMLPSLFPSISPSDAVSIGSGYKRRKVLSENEGTQEQALKVLIG 581

Query: 621  SVGSKSNKVAYVKTLLTYLSTHSNLSKSVLWTPATTRVASLYAAADVYVMNSQGIGETFG 442
            SVGSKSNKV YVK LL +L  HSNLSKSVLWTPATTRVASLY+AADVYV+NSQG+GETFG
Sbjct: 582  SVGSKSNKVPYVKGLLRFLXRHSNLSKSVLWTPATTRVASLYSAADVYVINSQGMGETFG 641

Query: 441  RVTIEAMAFGLPVLGTDSGGTREIVEHNITGLLHPLGRPGSQVLARNFEFFLENPQARHE 262
            RV+IEAMAFGL VLGTD+GGT EIVE N+TGLLHP+G  G+Q+L+ N  F L+NP AR +
Sbjct: 642  RVSIEAMAFGLTVLGTDAGGTXEIVEQNVTGLLHPVGHLGTQILSENIRFLLKNPSAREQ 701

Query: 261  MGMRGREKVEKMYLKKHMFQKFGEVLYKCMRIK 163
            MG RGR+KVE+MYLK+HM+++  EVLYKCMRIK
Sbjct: 702  MGKRGRKKVERMYLKRHMYKRLAEVLYKCMRIK 734


>ref|XP_002284822.1| PREDICTED: uncharacterized protein LOC100246448 [Vitis vinifera]
          Length = 691

 Score =  708 bits (1827), Expect = 0.0
 Identities = 379/622 (60%), Positives = 452/622 (72%), Gaps = 18/622 (2%)
 Frame = -1

Query: 1974 GGESKGGKS---MRRDLSAAVGTGALKLKNETSNSSL---ENVDVVLAKSRSGDSLXXXX 1813
            GG+   G S   + R          L +KN +  + +   + VDVVLAK   G+S+    
Sbjct: 93   GGKPNNGISDSELNRKAPLIANDKLLAVKNGSDKNPVGSGKKVDVVLAKK--GNSVPSRR 150

Query: 1812 XXXXXXXXXXXXXXXXXXXVAQDVESEVDLPIED-----IPKKNTTYGFLVGPFGSVEDS 1648
                                 Q  ++EV++   D     IPK NT+YG LVGPFGS ED 
Sbjct: 151  SASSKKRSKKSERSLRGKTRKQKTKTEVEVTEMDEQEQEIPKLNTSYGLLVGPFGSTEDR 210

Query: 1647 ILEWSPDKRSGTCDRKGAFARLVWSRKFVLIFHELSMTGAPLAMMELATEFLSCGATISV 1468
            ILEWSP+KRSGTCDR+G  ARLVWSRKFVLIFHELSMTGAPL+MMELATE LSCGAT+S 
Sbjct: 211  ILEWSPEKRSGTCDRRGELARLVWSRKFVLIFHELSMTGAPLSMMELATELLSCGATVSA 270

Query: 1467 IVLNKKGGLMSELNRRKIKVLVDKSDLSFKTAMKANLIIAGSAVCSSWIEQYLSRTVLGS 1288
            +VL+KKGGLM EL RR+IKVL D++DLSFKTAMKA+L+IAGSAVC+SWIEQY++    GS
Sbjct: 271  VVLSKKGGLMPELARRRIKVLEDRADLSFKTAMKADLVIAGSAVCASWIEQYIAHFTAGS 330

Query: 1287 SQIMWWIMENRREYFDRSKHVLNRVKKLIFLSESQSKQWLDWCEEENIHLKDEPALVPLS 1108
            SQI+WWIMENRREYFDRSK V+NRVK LIFLSESQSKQWL WC+EENI L  +PA+VPLS
Sbjct: 331  SQIVWWIMENRREYFDRSKLVINRVKMLIFLSESQSKQWLTWCKEENIRLISQPAVVPLS 390

Query: 1107 VNDELAFAAGISCSLNTPSFTTENMLEKRQSLRKVVREEMGLNDDDMLVVSLSSINPGKG 928
            VNDELAF AGI+CSLNTPSFTTE M EKR+ LR  +R+EMGL D DML++SLSSINPGKG
Sbjct: 391  VNDELAFVAGITCSLNTPSFTTEKMQEKRRLLRDSIRKEMGLTDTDMLLLSLSSINPGKG 450

Query: 927  QLLLMESARLVIEQGQKLNNSGSKDSVLLDHD-------YYSRALLQNGKRDNESSNIDT 769
            Q  L+ES R +IEQ    ++   KD V +  D       +YSRALLQN    + SS+   
Sbjct: 451  QFFLLESVRSMIEQEPSQDDPELKDLVKIGQDQSNFSGKHYSRALLQNVNHFSVSSS--- 507

Query: 768  PTKKRIRSSRIFTNEGRLNSARYGRDARMRKMLSENVGKKGQNLKVLIGSVGSKSNKVAY 589
                              +    G   + RK+LSEN G + Q LKVLIGSVGSKSNKV Y
Sbjct: 508  ------------------DEVSIGSGYKRRKVLSENEGTQEQALKVLIGSVGSKSNKVPY 549

Query: 588  VKTLLTYLSTHSNLSKSVLWTPATTRVASLYAAADVYVMNSQGIGETFGRVTIEAMAFGL 409
            VK LL +L+ HSNLSKSVLWTPATTRVASLY+AADVYV+NSQG+GETFGRVTIEAMAFGL
Sbjct: 550  VKGLLRFLTRHSNLSKSVLWTPATTRVASLYSAADVYVINSQGMGETFGRVTIEAMAFGL 609

Query: 408  PVLGTDSGGTREIVEHNITGLLHPLGRPGSQVLARNFEFFLENPQARHEMGMRGREKVEK 229
            PVLGTD+GGT+E+VE N+TGLLHP+G  G+Q+L+ N  F L+NP +R +MG RGR+KVE+
Sbjct: 610  PVLGTDAGGTKEVVEQNVTGLLHPVGHLGTQILSENIRFLLKNPSSREQMGKRGRKKVER 669

Query: 228  MYLKKHMFQKFGEVLYKCMRIK 163
            MYLK+HM+++  EVLYKCMRIK
Sbjct: 670  MYLKRHMYKRLAEVLYKCMRIK 691


>ref|XP_007217014.1| hypothetical protein PRUPE_ppa002059mg [Prunus persica]
            gi|462413164|gb|EMJ18213.1| hypothetical protein
            PRUPE_ppa002059mg [Prunus persica]
          Length = 723

 Score =  706 bits (1821), Expect = 0.0
 Identities = 382/628 (60%), Positives = 461/628 (73%), Gaps = 25/628 (3%)
 Frame = -1

Query: 1971 GESKGGKSMRRDLSAAVGTGALKLKNETSNSSLE---NVDVVLAKSRSGDSLXXXXXXXX 1801
            G S   ++ RRDL A+    ++ +KNET+ + ++   ++DVVL K  +G S         
Sbjct: 100  GNSDTEQNARRDLLAS--DSSMAVKNETNQNQVKAGKSIDVVLTKKENGVSSRRSASSKK 157

Query: 1800 XXXXXXXXXXXXXXXVAQ---DVES-EVDLPIEDIPKKNTTYGFLVGPFGSVEDSILEWS 1633
                             +   +VE  E +    DIPK NT+YG LVGPFG VED  LEWS
Sbjct: 158  RSKKSARSLRGKVHGKQKKTVEVEGHETEEQELDIPKTNTSYGMLVGPFGFVEDRTLEWS 217

Query: 1632 PDKRSGTCDRKGAFARLVWSRKFVLIFHELSMTGAPLAMMELATEFLSCGATISVIVLNK 1453
            P  RSGTCDRKG FARLVWSR+F+LIFHELSMTGAPL+MMELATE LSCGAT+S +VL+K
Sbjct: 218  PKTRSGTCDRKGDFARLVWSRRFLLIFHELSMTGAPLSMMELATELLSCGATVSAVVLSK 277

Query: 1452 KGGLMSELNRRKIKVLVDKSDLSFKTAMKANLIIAGSAVCSSWIEQYLSRTVLGSSQIMW 1273
            KGGLM EL RR+IKVL DK + SFKTAMKA+L+IAGSAVC+SWI+QY+     G+SQI W
Sbjct: 278  KGGLMPELARRRIKVLEDKVEQSFKTAMKADLVIAGSAVCASWIDQYMDHFPAGASQIAW 337

Query: 1272 WIMENRREYFDRSKHVLNRVKKLIFLSESQSKQWLDWCEEENIHLKDEPALVPLSVNDEL 1093
            WIMENRREYFDR+K VLNRVK L FLSESQSKQWLDWCEEE I L+ +PA+VPLS+NDEL
Sbjct: 338  WIMENRREYFDRAKVVLNRVKMLAFLSESQSKQWLDWCEEEKIKLRSQPAVVPLSINDEL 397

Query: 1092 AFAAGISCSLNTPSFTTENMLEKRQSLRKVVREEMGLNDDDMLVVSLSSINPGKGQLLLM 913
            AF AGI CSLNTPS +TE MLEKRQ LR  VR+EMGL D+DMLV+SLSSINPGKGQLLL+
Sbjct: 398  AFVAGIGCSLNTPSSSTEKMLEKRQLLRDSVRKEMGLTDNDMLVMSLSSINPGKGQLLLL 457

Query: 912  ESARLVIEQGQKLNNSGSKDSV-------LLDHDYYSRALLQNGKRDNESSN-------- 778
            ESARLVIE+  K  NS  K+ V        L   ++ RAL Q    D  SSN        
Sbjct: 458  ESARLVIEEPLKY-NSKIKNPVRKRQARSTLARKHHLRALFQELNDDGVSSNELPLSNES 516

Query: 777  ---IDTPTKKRIRSSRIFTNEGRLNSARYGRDARMRKMLSENVGKKGQNLKVLIGSVGSK 607
               ++ P KK++R   ++T+        +      RK+LS+N G   Q++K LIGSVGSK
Sbjct: 517  DVQLNEPQKKKLRLRSLYTSFDDTGDLTF-NVTHKRKVLSDNGGTLEQSVKFLIGSVGSK 575

Query: 606  SNKVAYVKTLLTYLSTHSNLSKSVLWTPATTRVASLYAAADVYVMNSQGIGETFGRVTIE 427
            SNKV YVK LL +LS HSN+SKSVLWTPATTRVA+LY+AADVYVMNSQG+GETFGRVTIE
Sbjct: 576  SNKVLYVKELLGFLSQHSNMSKSVLWTPATTRVAALYSAADVYVMNSQGLGETFGRVTIE 635

Query: 426  AMAFGLPVLGTDSGGTREIVEHNITGLLHPLGRPGSQVLARNFEFFLENPQARHEMGMRG 247
            AMAFGLPVLGT++GGT EIVEHN+TGLLHP+G PG++VLA N  F L++P AR +MG++G
Sbjct: 636  AMAFGLPVLGTEAGGTTEIVEHNVTGLLHPVGHPGTRVLAENIRFLLKSPNARKQMGLKG 695

Query: 246  REKVEKMYLKKHMFQKFGEVLYKCMRIK 163
            REKVE+MYLK+HM+++F +VL KCMR K
Sbjct: 696  REKVERMYLKRHMYKRFVDVLLKCMRPK 723


>ref|XP_002528176.1| glycosyltransferase, putative [Ricinus communis]
            gi|223532388|gb|EEF34183.1| glycosyltransferase, putative
            [Ricinus communis]
          Length = 686

 Score =  699 bits (1804), Expect = 0.0
 Identities = 362/535 (67%), Positives = 430/535 (80%), Gaps = 7/535 (1%)
 Frame = -1

Query: 1746 DVESE-VDLPIEDIPKKNTTYGFLVGPFGSVEDSILEWSPDKRSGTCDRKGAFARLVWSR 1570
            +VESE V++   DIP+KNTTYGFLVGPFGS ED ILEWSP+KR+GTCDRKG FARLVWSR
Sbjct: 191  EVESEDVEVQEPDIPQKNTTYGFLVGPFGSTEDRILEWSPEKRTGTCDRKGDFARLVWSR 250

Query: 1569 KFVLIFHELSMTGAPLAMMELATEFLSCGATISVIVLNKKGGLMSELNRRKIKVLVDKSD 1390
            KFVLIFHELSMTGAPL+MMELATEFLSCGAT+S +VL+KKGGLMSELNRR+IKVL DK+D
Sbjct: 251  KFVLIFHELSMTGAPLSMMELATEFLSCGATVSAVVLSKKGGLMSELNRRRIKVLEDKAD 310

Query: 1389 LSFKTAMKANLIIAGSAVCSSWIEQYLSRTVLGSSQIMWWIMENRREYFDRSKHVLNRVK 1210
            LSFKTAMKA+L+IAGSAVC+SWI+QY++R   G SQI+WWIMENRREYFDRSK VLNRVK
Sbjct: 311  LSFKTAMKADLVIAGSAVCASWIDQYMTRFPAGGSQIVWWIMENRREYFDRSKIVLNRVK 370

Query: 1209 KLIFLSESQSKQWLDWCEEENIHLKDEPALVPLSVNDELAFAAGISCSLNTPSFTTENML 1030
             L+FLSESQ++QWL WC+EE I L+  PA+VPLS+NDELAF AGI+CSLNTPS + E ML
Sbjct: 371  MLVFLSESQTEQWLSWCDEEKIKLRAPPAIVPLSINDELAFVAGIACSLNTPSSSPEKML 430

Query: 1029 EKRQSLRKVVREEMGLNDDDMLVVSLSSINPGKGQLLLMESARLVIEQG--QKLNNS--- 865
            EKR+ L   VR+EMGL DDD+L+VSLSSINPGKGQLL++ESA+L+IE    QKL +S   
Sbjct: 431  EKRRLLADSVRKEMGLTDDDVLLVSLSSINPGKGQLLILESAKLLIEPEPLQKLRSSVGI 490

Query: 864  GSKDS-VLLDHDYYSRALLQNGKRDNESSNIDTPTKKRIRSSRIFTNEGRLNSARYGRDA 688
            G + S + + H  + RALLQ  ++    S++    +K +++                   
Sbjct: 491  GEEQSRIAVKH--HLRALLQ--EKSKAVSDLKEGQEKYLKA------------------- 527

Query: 687  RMRKMLSENVGKKGQNLKVLIGSVGSKSNKVAYVKTLLTYLSTHSNLSKSVLWTPATTRV 508
                            LKVLIGSVGSKSNKV YVK +L+YL+ HSNLSKSVLWTPATTRV
Sbjct: 528  ----------------LKVLIGSVGSKSNKVPYVKEMLSYLTQHSNLSKSVLWTPATTRV 571

Query: 507  ASLYAAADVYVMNSQGIGETFGRVTIEAMAFGLPVLGTDSGGTREIVEHNITGLLHPLGR 328
            ASLY+AAD YV+NSQG+GETFGRVTIEAMAFGLPVLGTD+GGT+EIVEHN+TGLLHP+GR
Sbjct: 572  ASLYSAADAYVINSQGLGETFGRVTIEAMAFGLPVLGTDAGGTKEIVEHNVTGLLHPVGR 631

Query: 327  PGSQVLARNFEFFLENPQARHEMGMRGREKVEKMYLKKHMFQKFGEVLYKCMRIK 163
            PG+ VLA+N  F L NP  R +MGM GR+KVE+MYLK+HM++KF EVLYKCMR+K
Sbjct: 632  PGTHVLAQNLRFLLRNPSVREQMGMAGRKKVERMYLKRHMYKKFSEVLYKCMRVK 686


>ref|XP_004302927.1| PREDICTED: uncharacterized protein LOC101300160 [Fragaria vesca
            subsp. vesca]
          Length = 720

 Score =  699 bits (1803), Expect = 0.0
 Identities = 377/626 (60%), Positives = 457/626 (73%), Gaps = 24/626 (3%)
 Frame = -1

Query: 1968 ESKGGKSMRRDLSAAVGTGALKLKNETSNSSLE---NVDVVLAKSRSGDSLXXXXXXXXX 1798
            +S   ++ RRDL  +     +KLKNET  +  E    +DVVLAK   G +          
Sbjct: 102  KSDAEQNQRRDLLDS----PVKLKNETGQNQPEAGKTIDVVLAKKDDGVASRRSLSSKKK 157

Query: 1797 XXXXXXXXXXXXXXVAQDVE-SEVDLPIEDIPKKNTTYGFLVGPFGSVEDSILEWSPDKR 1621
                              +E  E++    DIPK N +YG LVGPFGS ED ILEW+P  R
Sbjct: 158  SKKAARGKSHGKPKKTVAIEIHEIEEQEPDIPKTNASYGMLVGPFGSTEDRILEWNPKTR 217

Query: 1620 SGTCDRKGAFARLVWSRKFVLIFHELSMTGAPLAMMELATEFLSCGATISVIVLNKKGGL 1441
            +GTCDRKG F+RLVWSR+F+LIFHELSMTGAPL+MMELATE LSCGAT+S IVL+KKGGL
Sbjct: 218  TGTCDRKGDFSRLVWSRRFLLIFHELSMTGAPLSMMELATELLSCGATVSAIVLSKKGGL 277

Query: 1440 MSELNRRKIKVLVDKSDLSFKTAMKANLIIAGSAVCSSWIEQYLSRTVLGSSQIMWWIME 1261
            M EL RR+IKVL DK+D SFKTAMK +L+IAGSAVC+SWI+QY+ +   G+SQI WWIME
Sbjct: 278  MPELTRRRIKVLEDKADHSFKTAMKQDLVIAGSAVCASWIDQYIDKFPAGASQIAWWIME 337

Query: 1260 NRREYFDRSKHVLNRVKKLIFLSESQSKQWLDWCEEENIHLKDEPALVPLSVNDELAFAA 1081
            NRREYFDR+K VL+RVK L FLSESQSKQWLDWCEEE I L+ +PA+VPLS+NDELAF A
Sbjct: 338  NRREYFDRAKVVLDRVKMLAFLSESQSKQWLDWCEEEKIKLRSQPAIVPLSINDELAFVA 397

Query: 1080 GISCSLNTPSFTTENMLEKRQSLRKVVREEMGLNDDDMLVVSLSSINPGKGQLLLMESAR 901
            GI CSLNTPS + E MLEK + LR  VR+EMGL D+DML +SLSSINPGKGQLL++ SAR
Sbjct: 398  GIGCSLNTPSSSIEKMLEKMKLLRDAVRKEMGLTDNDMLAISLSSINPGKGQLLVLNSAR 457

Query: 900  LVIEQGQKLNNSGSKDSV-------LLDHDYYSRALLQNGKRDNESSNIDTPTKKRIRSS 742
            LVIE+  + +NS  K+SV        L   ++ RALLQ G  D+ +S    P      SS
Sbjct: 458  LVIEEEPQPDNSKIKNSVRKGRVRSALARKHHIRALLQ-GSNDHSASLNGFPLS--TESS 514

Query: 741  RIFTNEGRLNSARYGRDARM-------------RKMLSENVGKKGQNLKVLIGSVGSKSN 601
              F  + + +   + R A +             RK+L++N G   Q+ K LIGSVGSKSN
Sbjct: 515  VHFKEDQKKHLHLHNRFASVDDTDAMNFDVTYKRKVLADNGGTVKQSAKFLIGSVGSKSN 574

Query: 600  KVAYVKTLLTYLSTHSNLSKSVLWTPATTRVASLYAAADVYVMNSQGIGETFGRVTIEAM 421
            KVAYVK LL+YLS HSNLSKSVLWTP+TTRVA+LY+AADVYVMNSQG+GETFGRVTIEAM
Sbjct: 575  KVAYVKELLSYLSQHSNLSKSVLWTPSTTRVAALYSAADVYVMNSQGLGETFGRVTIEAM 634

Query: 420  AFGLPVLGTDSGGTREIVEHNITGLLHPLGRPGSQVLARNFEFFLENPQARHEMGMRGRE 241
            AFGLPVLGTD+GGT+EIV+HN+TGLLHPLG PG+QVLA+N    L+NP+ R +MG++GRE
Sbjct: 635  AFGLPVLGTDAGGTKEIVDHNVTGLLHPLGHPGTQVLAKNLRLLLKNPELRKQMGVKGRE 694

Query: 240  KVEKMYLKKHMFQKFGEVLYKCMRIK 163
            KVE+MYLK+HM++KF +VL KCMR K
Sbjct: 695  KVERMYLKRHMYKKFVDVLLKCMRPK 720


>ref|XP_006465456.1| PREDICTED: uncharacterized protein LOC102612096 isoform X1 [Citrus
            sinensis] gi|568822059|ref|XP_006465457.1| PREDICTED:
            uncharacterized protein LOC102612096 isoform X2 [Citrus
            sinensis]
          Length = 732

 Score =  695 bits (1793), Expect = 0.0
 Identities = 365/623 (58%), Positives = 453/623 (72%), Gaps = 26/623 (4%)
 Frame = -1

Query: 1953 KSMRRDLSAA-----VGTGALKLKNETSNSSLENVDVVLAKSRSGDSLXXXXXXXXXXXX 1789
            ++ RRDL A      +  G +K    T  +  + +D+VL + R+ D+             
Sbjct: 114  QNKRRDLIANHSDLDINNGTIK----TLGADSKKMDMVLTQRRNNDASRRSVAKRKKSKR 169

Query: 1788 XXXXXXXXXXXVAQDVESE-VDLPIEDIPKKNTTYGFLVGPFGSVEDSILEWSPDKRSGT 1612
                          DVES  ++  + +IP  N +YG LVGPFG  ED ILEWSP+KRSGT
Sbjct: 170  SSRGKGRGKQKAKLDVESNYMEAQLPEIPMTNASYGLLVGPFGLTEDRILEWSPEKRSGT 229

Query: 1611 CDRKGAFARLVWSRKFVLIFHELSMTGAPLAMMELATEFLSCGATISVIVLNKKGGLMSE 1432
            CDRKG FAR VWSRKF+LIFHELSMTGAPL+MMELATE LSCGAT+S +VL+K+GGLM E
Sbjct: 230  CDRKGDFARFVWSRKFILIFHELSMTGAPLSMMELATELLSCGATVSAVVLSKRGGLMPE 289

Query: 1431 LNRRKIKVLVDKSDLSFKTAMKANLIIAGSAVCSSWIEQYLSRTVLGSSQIMWWIMENRR 1252
            L RRKIKVL D+ + SFKT+MKA+L+IAGSAVC++WI+QY++R   G SQ++WWIMENRR
Sbjct: 290  LARRKIKVLEDRGEPSFKTSMKADLVIAGSAVCATWIDQYITRFPAGGSQVVWWIMENRR 349

Query: 1251 EYFDRSKHVLNRVKKLIFLSESQSKQWLDWCEEENIHLKDEPALVPLSVNDELAFAAGIS 1072
            EYFDR+K VL+RVK L+FLSESQ+KQWL WCEEE + L+ +PA+VPLSVNDELAF AG +
Sbjct: 350  EYFDRAKLVLDRVKLLVFLSESQTKQWLTWCEEEKLKLRSQPAVVPLSVNDELAFVAGFT 409

Query: 1071 CSLNTPSFTTENMLEKRQSLRKVVREEMGLNDDDMLVVSLSSINPGKGQLLLMESARLVI 892
            CSLNTP+ + E M EKR  LR  VR+EMGL D DMLV+SLSSINPGKGQLLL+ESA+L+I
Sbjct: 410  CSLNTPTSSPEKMREKRNLLRDSVRKEMGLTDQDMLVLSLSSINPGKGQLLLVESAQLMI 469

Query: 891  EQG--------QKLNNSGSKDSVLLD-HDYYSRALLQN----GKRDNESS-------NID 772
            EQ         +K  N G K S L   H    R LLQ     G   NE S        ++
Sbjct: 470  EQEPSMDDSKIRKSRNVGRKKSSLTSRHHLRGRGLLQMSDDVGLSSNELSVSSESFTQLN 529

Query: 771  TPTKKRIRSSRIFTNEGRLNSARYGRDARMRKMLSENVGKKGQNLKVLIGSVGSKSNKVA 592
             P +K + S  +FT+ G  ++  +G     RK+LS++ GK+ Q LK+LIGSVGSKSNKV 
Sbjct: 530  EPVRKNLLSPSLFTSIGNTDAVSFGSGHLRRKVLSKSDGKQQQALKILIGSVGSKSNKVP 589

Query: 591  YVKTLLTYLSTHSNLSKSVLWTPATTRVASLYAAADVYVMNSQGIGETFGRVTIEAMAFG 412
            YVK +L +LS HSNLSK++LWTPATTRVASLY+AADVYV+NSQG+GETFGRVTIEAMAFG
Sbjct: 590  YVKEILEFLSQHSNLSKAMLWTPATTRVASLYSAADVYVINSQGLGETFGRVTIEAMAFG 649

Query: 411  LPVLGTDSGGTREIVEHNITGLLHPLGRPGSQVLARNFEFFLENPQARHEMGMRGREKVE 232
            +PVLGTD+GGT+EIVEHN+TGLLHP G PG+QVLA+N  + L+NP  R  M M GR+KVE
Sbjct: 650  VPVLGTDAGGTKEIVEHNVTGLLHPPGHPGAQVLAQNLRYLLKNPSVRERMAMEGRKKVE 709

Query: 231  KMYLKKHMFQKFGEVLYKCMRIK 163
            +MYLKKHM++K  +V+YKCM+ K
Sbjct: 710  RMYLKKHMYKKLSQVIYKCMKPK 732


>ref|XP_006427083.1| hypothetical protein CICLE_v10024994mg [Citrus clementina]
            gi|557529073|gb|ESR40323.1| hypothetical protein
            CICLE_v10024994mg [Citrus clementina]
          Length = 732

 Score =  693 bits (1789), Expect = 0.0
 Identities = 364/623 (58%), Positives = 452/623 (72%), Gaps = 26/623 (4%)
 Frame = -1

Query: 1953 KSMRRDLSAA-----VGTGALKLKNETSNSSLENVDVVLAKSRSGDSLXXXXXXXXXXXX 1789
            ++ RRDL A      +  G +K    T  +  + +D+VL + R+ D+             
Sbjct: 114  QNKRRDLIANHSDLDINNGTIK----TLGADSKKIDMVLTQRRNNDASRRSVAKRKKSKR 169

Query: 1788 XXXXXXXXXXXVAQDVESE-VDLPIEDIPKKNTTYGFLVGPFGSVEDSILEWSPDKRSGT 1612
                          DVES  ++  + +IP  N +YG LVGPFG  ED ILEWSP+KRSGT
Sbjct: 170  SSRGKGRGKQKAKLDVESNYMEAQLPEIPMTNASYGLLVGPFGLTEDRILEWSPEKRSGT 229

Query: 1611 CDRKGAFARLVWSRKFVLIFHELSMTGAPLAMMELATEFLSCGATISVIVLNKKGGLMSE 1432
            CDRKG FAR VWSRKF+LIFHELSMTGAPL+MMELATE LSCGAT+S +VL+K+GGLM E
Sbjct: 230  CDRKGDFARFVWSRKFILIFHELSMTGAPLSMMELATELLSCGATVSAVVLSKRGGLMPE 289

Query: 1431 LNRRKIKVLVDKSDLSFKTAMKANLIIAGSAVCSSWIEQYLSRTVLGSSQIMWWIMENRR 1252
            L RRKIKVL D+ + SFKT+MKA+L+IAGSAVC++WI+QY++R   G SQ++WWIMENRR
Sbjct: 290  LARRKIKVLEDRGEPSFKTSMKADLVIAGSAVCATWIDQYITRFPAGGSQVVWWIMENRR 349

Query: 1251 EYFDRSKHVLNRVKKLIFLSESQSKQWLDWCEEENIHLKDEPALVPLSVNDELAFAAGIS 1072
            EYFDR+K VL+RVK L+FLSESQ+KQWL WCEEE + L+ +PA+VPLSVNDELAF AG +
Sbjct: 350  EYFDRAKLVLDRVKMLVFLSESQTKQWLTWCEEEKLKLRSQPAVVPLSVNDELAFVAGFT 409

Query: 1071 CSLNTPSFTTENMLEKRQSLRKVVREEMGLNDDDMLVVSLSSINPGKGQLLLMESARLVI 892
            CSLNTP+ + E M EKR  LR  VR+EMGL D DMLV+SLSSINPGKGQLLL+ESA+L+I
Sbjct: 410  CSLNTPTSSPEKMCEKRNLLRDSVRKEMGLTDQDMLVLSLSSINPGKGQLLLVESAQLMI 469

Query: 891  EQG--------QKLNNSGSKDSVLLD-HDYYSRALLQN----GKRDNESS-------NID 772
            EQ         +K  N G K S L   H    R LLQ     G   NE S        ++
Sbjct: 470  EQEPSMDDSKIRKSRNVGRKKSSLTSRHHLRGRGLLQMSDDVGLSSNELSVSSESFTQLN 529

Query: 771  TPTKKRIRSSRIFTNEGRLNSARYGRDARMRKMLSENVGKKGQNLKVLIGSVGSKSNKVA 592
             P +K + S  +FT+ G  ++  +G     RK+LS++ GK+ Q LK+LIGSVGSKSNKV 
Sbjct: 530  EPVRKNLLSPSLFTSIGNTDAVSFGSGHLRRKVLSKSDGKQQQALKILIGSVGSKSNKVP 589

Query: 591  YVKTLLTYLSTHSNLSKSVLWTPATTRVASLYAAADVYVMNSQGIGETFGRVTIEAMAFG 412
            YVK +L +LS HSNLSK++LWTPATTRVASLY+AADVYV+NSQG+GETFGRVTIEAMAFG
Sbjct: 590  YVKEILEFLSQHSNLSKAMLWTPATTRVASLYSAADVYVINSQGLGETFGRVTIEAMAFG 649

Query: 411  LPVLGTDSGGTREIVEHNITGLLHPLGRPGSQVLARNFEFFLENPQARHEMGMRGREKVE 232
            +PVLGTD+GGT+EIVEHN+TGLLHP G PG+QVLA+N  + L+NP  R  M M GR+KVE
Sbjct: 650  VPVLGTDAGGTKEIVEHNVTGLLHPPGHPGAQVLAQNLRYLLKNPSVRERMAMEGRKKVE 709

Query: 231  KMYLKKHMFQKFGEVLYKCMRIK 163
            +MYLKK M++K  +V+YKCM+ K
Sbjct: 710  RMYLKKQMYKKLSQVIYKCMKPK 732


>ref|XP_002298139.1| glycosyl transferase family 1 family protein [Populus trichocarpa]
            gi|222845397|gb|EEE82944.1| glycosyl transferase family 1
            family protein [Populus trichocarpa]
          Length = 681

 Score =  693 bits (1788), Expect = 0.0
 Identities = 376/620 (60%), Positives = 448/620 (72%), Gaps = 16/620 (2%)
 Frame = -1

Query: 1974 GGESKGG-----KSMRRDLSAAVGTGALKLKNETSNSSLEN---VDVVLAKSRSGDSLXX 1819
            GG+S  G     +  RRDL A      + + N T+   + N   +DVVLAK  +G S   
Sbjct: 104  GGKSSNGLLDAEQHTRRDLLA--NDSLVVVNNGTNKIQVRNAKKIDVVLAKKGNGVSSNR 161

Query: 1818 XXXXXXXXXXXXXXXXXXXXXVAQD----VESE-VDLPIEDIPKKNTTYGFLVGPFGSVE 1654
                                   Q     VES+ V++   D+PK N +YG LVGPFG +E
Sbjct: 162  RATPKKKKSKRGGRRSRAKAHDKQKATVVVESDDVEVAEPDVPKNNASYGLLVGPFGPIE 221

Query: 1653 DSILEWSPDKRSGTCDRKGAFARLVWSRKFVLIFHELSMTGAPLAMMELATEFLSCGATI 1474
            D ILEWSP+KRSGTCDRKGAFARLVWSRKFVLIFHELSMTGAPL+M+ELATEFLSCGAT+
Sbjct: 222  DRILEWSPEKRSGTCDRKGAFARLVWSRKFVLIFHELSMTGAPLSMLELATEFLSCGATV 281

Query: 1473 SVIVLNKKGGLMSELNRRKIKVLVDKSDLSFKTAMKANLIIAGSAVCSSWIEQYLSRTVL 1294
            S +VL+KKGGLM EL RR+IKVL D++DLSFKTAMKA+L+IAGSAVC+SWI+QY++R   
Sbjct: 282  SAVVLSKKGGLMPELARRRIKVLEDRADLSFKTAMKADLVIAGSAVCTSWIDQYIARFPA 341

Query: 1293 GSSQIMWWIMENRREYFDRSKHVLNRVKKLIFLSESQSKQWLDWCEEENIHLKDEPALVP 1114
            G SQ++WWIMENRREYFDRSK +LNRVK L+FLSESQ KQW  WCEEENI L+  PA+V 
Sbjct: 342  GGSQVVWWIMENRREYFDRSKIILNRVKMLVFLSESQMKQWQTWCEEENIRLRSPPAVVQ 401

Query: 1113 LSVNDELAFAAGISCSLNTPSFTTENMLEKRQSLRKVVREEMGLNDDDMLVVSLSSINPG 934
            LSVNDELAF AGI+CSLNTP+ ++E MLEKRQ LR+ VR+EMGL D+DMLV+SLSSIN G
Sbjct: 402  LSVNDELAFVAGIACSLNTPTSSSEKMLEKRQLLRESVRKEMGLTDNDMLVMSLSSINAG 461

Query: 933  KGQLLLMESARLVIE--QGQKLNNSGSK-DSVLLDHDYYSRALLQNGKRDNESSNIDTPT 763
            KGQLLL+ESA LVIE     K+ NS  K +   L   ++ RAL                 
Sbjct: 462  KGQLLLLESANLVIEPDPSPKITNSVDKGNQSTLAAKHHLRAL----------------- 504

Query: 762  KKRIRSSRIFTNEGRLNSARYGRDARMRKMLSENVGKKGQNLKVLIGSVGSKSNKVAYVK 583
                                     R RK+L+++ G   Q LKVLIGSVGSKSNKV YVK
Sbjct: 505  -----------------------SHRKRKLLADSEGTHEQALKVLIGSVGSKSNKVPYVK 541

Query: 582  TLLTYLSTHSNLSKSVLWTPATTRVASLYAAADVYVMNSQGIGETFGRVTIEAMAFGLPV 403
             +L ++S HSNLSKSVLWT ATTRVASLY+AADVY+ NSQG+GETFGRVTIEAMAFGLPV
Sbjct: 542  EILRFISQHSNLSKSVLWTSATTRVASLYSAADVYITNSQGLGETFGRVTIEAMAFGLPV 601

Query: 402  LGTDSGGTREIVEHNITGLLHPLGRPGSQVLARNFEFFLENPQARHEMGMRGREKVEKMY 223
            LGTD+GGT+EIVEHNITGLLHP+GRPGS+VLA+N E  L+NP  R +MG++GR+KVEKMY
Sbjct: 602  LGTDAGGTQEIVEHNITGLLHPVGRPGSRVLAQNIELLLKNPSVRKQMGIKGRKKVEKMY 661

Query: 222  LKKHMFQKFGEVLYKCMRIK 163
            LK+HM++K  EVLYKCMR+K
Sbjct: 662  LKRHMYKKIWEVLYKCMRVK 681


>emb|CBI36173.3| unnamed protein product [Vitis vinifera]
          Length = 683

 Score =  688 bits (1776), Expect = 0.0
 Identities = 373/622 (59%), Positives = 443/622 (71%), Gaps = 18/622 (2%)
 Frame = -1

Query: 1974 GGESKGGKS---MRRDLSAAVGTGALKLKNETSNSSL---ENVDVVLAKSRSGDSLXXXX 1813
            GG+   G S   + R          L +KN +  + +   + VDVVLAK   G+S+    
Sbjct: 104  GGKPNNGISDSELNRKAPLIANDKLLAVKNGSDKNPVGSGKKVDVVLAKK--GNSVPSRR 161

Query: 1812 XXXXXXXXXXXXXXXXXXXVAQDVESEVDLPIED-----IPKKNTTYGFLVGPFGSVEDS 1648
                                 Q  ++EV++   D     IPK NT+YG LVGPFGS ED 
Sbjct: 162  SASSKKRSKKSERSLRGKTRKQKTKTEVEVTEMDEQEQEIPKLNTSYGLLVGPFGSTEDR 221

Query: 1647 ILEWSPDKRSGTCDRKGAFARLVWSRKFVLIFHELSMTGAPLAMMELATEFLSCGATISV 1468
            ILEWSP+KRSGTCDR+G  ARLVWSRKFVLIFHELSMTGAPL+MMELATE LSCGAT+S 
Sbjct: 222  ILEWSPEKRSGTCDRRGELARLVWSRKFVLIFHELSMTGAPLSMMELATELLSCGATVSA 281

Query: 1467 IVLNKKGGLMSELNRRKIKVLVDKSDLSFKTAMKANLIIAGSAVCSSWIEQYLSRTVLGS 1288
            +VL+KKGGLM EL RR+IKVL D++DLSFKTAMKA+L+IAGSAVC+SWIEQY++    GS
Sbjct: 282  VVLSKKGGLMPELARRRIKVLEDRADLSFKTAMKADLVIAGSAVCASWIEQYIAHFTAGS 341

Query: 1287 SQIMWWIMENRREYFDRSKHVLNRVKKLIFLSESQSKQWLDWCEEENIHLKDEPALVPLS 1108
            SQI+WWIMENRREYFDRSK V+NRVK LIFLSESQSKQWL WC+EENI L  +PA+VPLS
Sbjct: 342  SQIVWWIMENRREYFDRSKLVINRVKMLIFLSESQSKQWLTWCKEENIRLISQPAVVPLS 401

Query: 1107 VNDELAFAAGISCSLNTPSFTTENMLEKRQSLRKVVREEMGLNDDDMLVVSLSSINPGKG 928
            VNDELAF AGI+CSLNTPSFTTE M EKR+ LR  +R+EMGL D DML++SLSSINPGKG
Sbjct: 402  VNDELAFVAGITCSLNTPSFTTEKMQEKRRLLRDSIRKEMGLTDTDMLLLSLSSINPGKG 461

Query: 927  QLLLMESARLVIEQGQKLNNSGSKDSVLLDHD-------YYSRALLQNGKRDNESSNIDT 769
            Q  L+ES R +IEQ    ++   KD V +  D       +YSRALLQN            
Sbjct: 462  QFFLLESVRSMIEQEPSQDDPELKDLVKIGQDQSNFSGKHYSRALLQN------------ 509

Query: 768  PTKKRIRSSRIFTNEGRLNSARYGRDARMRKMLSENVGKKGQNLKVLIGSVGSKSNKVAY 589
                             LN  +           S+N+    Q LKVLIGSVGSKSNKV Y
Sbjct: 510  -----------------LNGPK-----------SKNLMLPKQALKVLIGSVGSKSNKVPY 541

Query: 588  VKTLLTYLSTHSNLSKSVLWTPATTRVASLYAAADVYVMNSQGIGETFGRVTIEAMAFGL 409
            VK LL +L+ HSNLSKSVLWTPATTRVASLY+AADVYV+NSQG+GETFGRVTIEAMAFGL
Sbjct: 542  VKGLLRFLTRHSNLSKSVLWTPATTRVASLYSAADVYVINSQGMGETFGRVTIEAMAFGL 601

Query: 408  PVLGTDSGGTREIVEHNITGLLHPLGRPGSQVLARNFEFFLENPQARHEMGMRGREKVEK 229
            PVLGTD+GGT+E+VE N+TGLLHP+G  G+Q+L+ N  F L+NP +R +MG RGR+KVE+
Sbjct: 602  PVLGTDAGGTKEVVEQNVTGLLHPVGHLGTQILSENIRFLLKNPSSREQMGKRGRKKVER 661

Query: 228  MYLKKHMFQKFGEVLYKCMRIK 163
            MYLK+HM+++  EVLYKCMRIK
Sbjct: 662  MYLKRHMYKRLAEVLYKCMRIK 683


>ref|XP_007024055.1| UDP-Glycosyltransferase superfamily protein isoform 1 [Theobroma
            cacao] gi|508779421|gb|EOY26677.1|
            UDP-Glycosyltransferase superfamily protein isoform 1
            [Theobroma cacao]
          Length = 702

 Score =  684 bits (1764), Expect = 0.0
 Identities = 365/607 (60%), Positives = 445/607 (73%), Gaps = 13/607 (2%)
 Frame = -1

Query: 1944 RRDLSA-----AVGTGALKLKNETSNSSLENVDVVLAKSRSGDSLXXXXXXXXXXXXXXX 1780
            RRDL A     AV  G     N+T   S    DV+LAK R+  S                
Sbjct: 110  RRDLLADDSLVAVNNGT----NKTQVYSDRKFDVILAKKRNEVSFNKKRSRRSKRAGRNL 165

Query: 1779 XXXXXXXXVAQDVES-EVDLPIEDIPKKNTTYGFLVGPFGSVEDSILEWSPDKRSGTCDR 1603
                       ++E+ E +    +I +KN+TYG LVGPFGSVED ILEWSP+KRSGTCDR
Sbjct: 166  SKMRGKRKATINIENGETEGQEHEILQKNSTYGLLVGPFGSVEDRILEWSPEKRSGTCDR 225

Query: 1602 KGAFARLVWSRKFVLIFHELSMTGAPLAMMELATEFLSCGATISVIVLNKKGGLMSELNR 1423
            KG FARLVWSR+ VL+FHELSMTGAP++MMELATE LSCGAT+S +VL+KKGGLMSEL R
Sbjct: 226  KGDFARLVWSRRLVLVFHELSMTGAPISMMELATELLSCGATVSAVVLSKKGGLMSELAR 285

Query: 1422 RKIKVLVDKSDLSFKTAMKANLIIAGSAVCSSWIEQYLSRTVLGSSQIMWWIMENRREYF 1243
            R+IKV+ D++DLSFKTAMKA+L+IAGSAVC+SWI+QY++    G SQI WWIMENRREYF
Sbjct: 286  RRIKVIEDRADLSFKTAMKADLVIAGSAVCASWIDQYIAHFPAGGSQIAWWIMENRREYF 345

Query: 1242 DRSKHVLNRVKKLIFLSESQSKQWLDWCEEENIHLKDEPALVPLSVNDELAFAAGISCSL 1063
            DRSK VL+RVK LIFLSE QSKQWL WC+EENI L+ +PALVPL+VNDELAF AGI CSL
Sbjct: 346  DRSKLVLHRVKMLIFLSELQSKQWLTWCQEENIKLRSQPALVPLAVNDELAFVAGIPCSL 405

Query: 1062 NTPSFTTENMLEKRQSLRKVVREEMGLNDDDMLVVSLSSINPGKGQLLLMESARLVIEQG 883
            NTPS + E MLEKRQ LR  VR+EMGL D+DMLV+SLSSIN GKGQLLL+E+A L+I+Q 
Sbjct: 406  NTPSASPEKMLEKRQLLRDAVRKEMGLTDNDMLVMSLSSINTGKGQLLLLEAAGLMIDQD 465

Query: 882  QKLNNSGSKDSVLLDHD-------YYSRALLQNGKRDNESSNIDTPTKKRIRSSRIFTNE 724
                +S    S+ +  D       ++ R LLQ      +SS++D  +       R+F + 
Sbjct: 466  PLQTDSEVTKSLDIRQDQSTLTVKHHLRGLLQ------KSSDVDVSS----TDLRLFASV 515

Query: 723  GRLNSARYGRDARMRKMLSENVGKKGQNLKVLIGSVGSKSNKVAYVKTLLTYLSTHSNLS 544
               N+       R R ML ++ G + Q LK+LIGSVGSKSNK+ YVK +L +LS H+ LS
Sbjct: 516  NGTNAVSIDSSHRRRNMLFDSKGTQEQALKILIGSVGSKSNKMPYVKEILRFLSQHAKLS 575

Query: 543  KSVLWTPATTRVASLYAAADVYVMNSQGIGETFGRVTIEAMAFGLPVLGTDSGGTREIVE 364
            +SVLWTPATT VASLY+AADVYVMNSQG+GETFGRVT+EAMAFGLPVLGTD+GGT+EIVE
Sbjct: 576  ESVLWTPATTHVASLYSAADVYVMNSQGLGETFGRVTVEAMAFGLPVLGTDAGGTKEIVE 635

Query: 363  HNITGLLHPLGRPGSQVLARNFEFFLENPQARHEMGMRGREKVEKMYLKKHMFQKFGEVL 184
            +N+TGL HP+G PG+Q LA N  F L+NP AR +MGM GR+KVE+ YLK+HM+++F EVL
Sbjct: 636  NNVTGLFHPMGHPGAQALAGNLRFLLKNPSARKQMGMEGRKKVERKYLKRHMYKRFVEVL 695

Query: 183  YKCMRIK 163
             +CMRIK
Sbjct: 696  TRCMRIK 702


>ref|XP_007024056.1| UDP-Glycosyltransferase superfamily protein isoform 2 [Theobroma
            cacao] gi|508779422|gb|EOY26678.1|
            UDP-Glycosyltransferase superfamily protein isoform 2
            [Theobroma cacao]
          Length = 703

 Score =  679 bits (1752), Expect = 0.0
 Identities = 365/608 (60%), Positives = 445/608 (73%), Gaps = 14/608 (2%)
 Frame = -1

Query: 1944 RRDLSA-----AVGTGALKLKNETSNSSLENVDVVLAKSRSGDSLXXXXXXXXXXXXXXX 1780
            RRDL A     AV  G     N+T   S    DV+LAK R+  S                
Sbjct: 110  RRDLLADDSLVAVNNGT----NKTQVYSDRKFDVILAKKRNEVSFNKKRSRRSKRAGRNL 165

Query: 1779 XXXXXXXXVAQDVES-EVDLPIEDIPKKNTTYGFLVGPFGSVEDSILEWSPDKRSGTCDR 1603
                       ++E+ E +    +I +KN+TYG LVGPFGSVED ILEWSP+KRSGTCDR
Sbjct: 166  SKMRGKRKATINIENGETEGQEHEILQKNSTYGLLVGPFGSVEDRILEWSPEKRSGTCDR 225

Query: 1602 KGAFARLVWSRKFVLIFHELSMTGAPLAMMELATEFLSCGATISVIVLNKKGGLMSELNR 1423
            KG FARLVWSR+ VL+FHELSMTGAP++MMELATE LSCGAT+S +VL+KKGGLMSEL R
Sbjct: 226  KGDFARLVWSRRLVLVFHELSMTGAPISMMELATELLSCGATVSAVVLSKKGGLMSELAR 285

Query: 1422 RKIKVLVDKSDLSFKTAMKANLIIAGSAVCSSWIEQYLSRTVLGSSQIMWWIMENRREYF 1243
            R+IKV+ D++DLSFKTAMKA+L+IAGSAVC+SWI+QY++    G SQI WWIMENRREYF
Sbjct: 286  RRIKVIEDRADLSFKTAMKADLVIAGSAVCASWIDQYIAHFPAGGSQIAWWIMENRREYF 345

Query: 1242 DRSKHVLNRVKKLIFLSESQSKQWLDWCEEENIHLKDEPALVPLSVNDELAFAAGISCSL 1063
            DRSK VL+RVK LIFLSE QSKQWL WC+EENI L+ +PALVPL+VNDELAF AGI CSL
Sbjct: 346  DRSKLVLHRVKMLIFLSELQSKQWLTWCQEENIKLRSQPALVPLAVNDELAFVAGIPCSL 405

Query: 1062 NTPSFTTENMLEKRQSLRKVVREEMGLNDDDMLVVSLSSINPGKGQLLLMESARLVIEQG 883
            NTPS + E MLEKRQ LR  VR+EMGL D+DMLV+SLSSIN GKGQLLL+E+A L+I+Q 
Sbjct: 406  NTPSASPEKMLEKRQLLRDAVRKEMGLTDNDMLVMSLSSINTGKGQLLLLEAAGLMIDQD 465

Query: 882  QKLNNSGSKDSVLLDHD-------YYSRALLQNGKRDNESSNIDTPTKKRIRSSRIFTNE 724
                +S    S+ +  D       ++ R LLQ      +SS++D  +       R+F + 
Sbjct: 466  PLQTDSEVTKSLDIRQDQSTLTVKHHLRGLLQ------KSSDVDVSS----TDLRLFASV 515

Query: 723  GRLNSARYGRDARMRKMLSENVGKKGQNLKVLIGSVGSKSNKVAYVKTLLTYLSTHSNLS 544
               N+       R R ML ++ G + Q LK+LIGSVGSKSNK+ YVK +L +LS H+ LS
Sbjct: 516  NGTNAVSIDSSHRRRNMLFDSKGTQEQALKILIGSVGSKSNKMPYVKEILRFLSQHAKLS 575

Query: 543  KSVLWTPATTRVASLYAAADVYVMNS-QGIGETFGRVTIEAMAFGLPVLGTDSGGTREIV 367
            +SVLWTPATT VASLY+AADVYVMNS QG+GETFGRVT+EAMAFGLPVLGTD+GGT+EIV
Sbjct: 576  ESVLWTPATTHVASLYSAADVYVMNSQQGLGETFGRVTVEAMAFGLPVLGTDAGGTKEIV 635

Query: 366  EHNITGLLHPLGRPGSQVLARNFEFFLENPQARHEMGMRGREKVEKMYLKKHMFQKFGEV 187
            E+N+TGL HP+G PG+Q LA N  F L+NP AR +MGM GR+KVE+ YLK+HM+++F EV
Sbjct: 636  ENNVTGLFHPMGHPGAQALAGNLRFLLKNPSARKQMGMEGRKKVERKYLKRHMYKRFVEV 695

Query: 186  LYKCMRIK 163
            L +CMRIK
Sbjct: 696  LTRCMRIK 703


>gb|EXC25804.1| Putative glycosyltransferase ytcC [Morus notabilis]
          Length = 688

 Score =  674 bits (1740), Expect = 0.0
 Identities = 363/610 (59%), Positives = 442/610 (72%), Gaps = 9/610 (1%)
 Frame = -1

Query: 1965 SKGGKSMRRDLSAAVGTGALKLKNETSNSSLEN---VDVVLAKSRSGDSLXXXXXXXXXX 1795
            S+  +++RRDL A     +L +KN T  + + +   +DVVLA    G S           
Sbjct: 105  SETEQNLRRDLIAT--DISLAVKNGTGKNQVSDGKRMDVVLAGRNDGISSHRKLNSKKKK 162

Query: 1794 XXXXXXXXXXXXXVAQDVESEV-DLPIE----DIPKKNTTYGFLVGPFGSVEDSILEWSP 1630
                           Q +  EV ++ IE    DIPK N +YG LVGPFGS+ED ILEWSP
Sbjct: 163  TKRANRSLRSKVHGKQKMTMEVKNVEIEEQEPDIPKTNASYGMLVGPFGSLEDRILEWSP 222

Query: 1629 DKRSGTCDRKGAFARLVWSRKFVLIFHELSMTGAPLAMMELATEFLSCGATISVIVLNKK 1450
            +KRSGTCDRKG FAR+VWSR+FVLIFHELSMTG+PL+MMELATE LSCGAT+S + L+KK
Sbjct: 223  EKRSGTCDRKGDFARIVWSRRFVLIFHELSMTGSPLSMMELATELLSCGATVSAVALSKK 282

Query: 1449 GGLMSELNRRKIKVLVDKSDLSFKTAMKANLIIAGSAVCSSWIEQYLSRTVLGSSQIMWW 1270
            GGLMSEL RR+IKVL DK+DLSFKTAMKA+L+IAGSAVC+SWI+Q++     G+SQ+ WW
Sbjct: 283  GGLMSELARRRIKVLEDKADLSFKTAMKADLVIAGSAVCASWIDQFIEHFPAGASQVAWW 342

Query: 1269 IMENRREYFDRSKHVLNRVKKLIFLSESQSKQWLDWCEEENIHLKDEPALVPLSVNDELA 1090
            IMENRREYFDR+K VLNRVK L+F+SE Q KQWL W EEE I+L+ +P LVPLS+NDE+A
Sbjct: 343  IMENRREYFDRAKVVLNRVKMLVFISELQWKQWLAWAEEEKIYLRSQPVLVPLSINDEMA 402

Query: 1089 FAAGISCSLNTPSFTTENMLEKRQSLRKVVREEMGLNDDDMLVVSLSSINPGKGQLLLME 910
            F AGI+C+LNTPSFTTE M+EKRQ LR   R+EMGL D+DMLV+SLSSINPGKGQ LL+ 
Sbjct: 403  FVAGIACTLNTPSFTTEKMIEKRQLLRDSARKEMGLKDNDMLVMSLSSINPGKGQHLLLG 462

Query: 909  SARLVIEQGQKLNNSGSKDSVLLDHDYYSRALLQNGKRDNESSNIDTPTKKRIRSSRIFT 730
            S RL+IE+      S  K+ V + H                         K  R  R+ T
Sbjct: 463  SGRLMIEKEAFEEKSNIKNPVDIKHH----------------------QSKSTRKHRLKT 500

Query: 729  NEGRLN-SARYGRDARMRKMLSENVGKKGQNLKVLIGSVGSKSNKVAYVKTLLTYLSTHS 553
               +LN S  +G     RK + ++ G + +++K+LIGSVGSKSNKV YVK LL YLS H 
Sbjct: 501  VFQKLNGSMAFG--GTHRKEMLDSGGMRERSVKILIGSVGSKSNKVVYVKELLNYLSQHP 558

Query: 552  NLSKSVLWTPATTRVASLYAAADVYVMNSQGIGETFGRVTIEAMAFGLPVLGTDSGGTRE 373
            N SKSVLWTPA+TRVA+LYAAADVYV+NSQG+GETFGRVTIEAMAF LPVLGTD+GGT+E
Sbjct: 559  NTSKSVLWTPASTRVAALYAAADVYVINSQGLGETFGRVTIEAMAFSLPVLGTDAGGTKE 618

Query: 372  IVEHNITGLLHPLGRPGSQVLARNFEFFLENPQARHEMGMRGREKVEKMYLKKHMFQKFG 193
            IVEHN+TGLLHP G PG+ VLA N EF L+NP  R EMGM+GREKVE+MYLK+H+++KF 
Sbjct: 619  IVEHNVTGLLHPTGSPGAPVLAGNLEFLLKNPVTRKEMGMKGREKVERMYLKRHLYKKFV 678

Query: 192  EVLYKCMRIK 163
            +VL KCMR K
Sbjct: 679  DVLVKCMRPK 688


>ref|XP_004149847.1| PREDICTED: uncharacterized protein LOC101207532 [Cucumis sativus]
            gi|449496350|ref|XP_004160111.1| PREDICTED:
            uncharacterized protein LOC101223486 [Cucumis sativus]
          Length = 682

 Score =  664 bits (1714), Expect = 0.0
 Identities = 351/610 (57%), Positives = 438/610 (71%), Gaps = 6/610 (0%)
 Frame = -1

Query: 1974 GGESKGGK---SMRRDLSAAVGTGALKLKNETSNSSLEN---VDVVLAKSRSGDSLXXXX 1813
            GG+    K      + LS       L ++N +  +   +   V+VVLAK  +G S     
Sbjct: 103  GGQQSNQKLDSEQNQSLSLISTNNRLVVENRSGENDRSDGGVVNVVLAKKANGVSASKKT 162

Query: 1812 XXXXXXXXXXXXXXXXXXXVAQDVESEVDLPIEDIPKKNTTYGFLVGPFGSVEDSILEWS 1633
                                A+    +++    +IP KN++YG LVGPFGS ED ILEWS
Sbjct: 163  KPRKRSKRSKRDKVHKGKIPAEVTNHDIEEQEPEIPLKNSSYGMLVGPFGSTEDRILEWS 222

Query: 1632 PDKRSGTCDRKGAFARLVWSRKFVLIFHELSMTGAPLAMMELATEFLSCGATISVIVLNK 1453
            P+KRSGTCDRKG FARLVWSR+FVLIFHELSMTGAP++MMELATE LSCGA++S + L+K
Sbjct: 223  PEKRSGTCDRKGDFARLVWSRRFVLIFHELSMTGAPISMMELATELLSCGASVSAVALSK 282

Query: 1452 KGGLMSELNRRKIKVLVDKSDLSFKTAMKANLIIAGSAVCSSWIEQYLSRTVLGSSQIMW 1273
            KGGLMSEL+RR+IKVL DK+DLSFKTAMKA+L+IAGSAVC+SWI+ Y+     G+SQ+ W
Sbjct: 283  KGGLMSELSRRRIKVLDDKADLSFKTAMKADLVIAGSAVCASWIDGYIEHFPAGASQVAW 342

Query: 1272 WIMENRREYFDRSKHVLNRVKKLIFLSESQSKQWLDWCEEENIHLKDEPALVPLSVNDEL 1093
            WIMENRREYF+RSK VL+RVK LIF+SE QSKQWL+W +EENI L+ +PA+VPLSVNDEL
Sbjct: 343  WIMENRREYFNRSKVVLDRVKMLIFISELQSKQWLNWSQEENIKLRSQPAIVPLSVNDEL 402

Query: 1092 AFAAGISCSLNTPSFTTENMLEKRQSLRKVVREEMGLNDDDMLVVSLSSINPGKGQLLLM 913
            AF AGISCSLNT S + E MLEK+Q LR   R+EMG+ D+D++V++LSSINPGKG  LL+
Sbjct: 403  AFVAGISCSLNTESSSPEKMLEKKQLLRNTTRKEMGVGDNDVVVMTLSSINPGKGHFLLL 462

Query: 912  ESARLVIEQGQKLNNSGSKDSVLLDHDYYSRALLQNGKRDNESSNIDTPTKKRIRSSRIF 733
            ES+ L+I++G K ++                       R+ + S+   P   R R  R  
Sbjct: 463  ESSNLLIDRGLKRDDPKI--------------------RNPDDSSPSRPKLARRRYMRAL 502

Query: 732  TNEGRLNSARYGRDARMRKMLSENVGKKGQNLKVLIGSVGSKSNKVAYVKTLLTYLSTHS 553
                +LN          R++L++       + K+LIGSVGSKSNKV YVK LL +LS HS
Sbjct: 503  LQ--KLND--------RRRLLADGGELPETSFKLLIGSVGSKSNKVVYVKRLLRFLSQHS 552

Query: 552  NLSKSVLWTPATTRVASLYAAADVYVMNSQGIGETFGRVTIEAMAFGLPVLGTDSGGTRE 373
            NLS+SVLWTPATTRVASLY+AAD+YV+NSQGIGETFGRVTIEAMAFGLPVLGTD+GGT+E
Sbjct: 553  NLSQSVLWTPATTRVASLYSAADIYVINSQGIGETFGRVTIEAMAFGLPVLGTDAGGTKE 612

Query: 372  IVEHNITGLLHPLGRPGSQVLARNFEFFLENPQARHEMGMRGREKVEKMYLKKHMFQKFG 193
            IVEHN+TGLLHPLGRPG+QVLA+N EF L+NPQ R +MG  GR+KV+K+YLK+HM++KF 
Sbjct: 613  IVEHNVTGLLHPLGRPGTQVLAQNLEFLLKNPQVREKMGAEGRKKVKKIYLKRHMYKKFV 672

Query: 192  EVLYKCMRIK 163
            EV+ KCMR K
Sbjct: 673  EVIVKCMRTK 682


>ref|XP_007150675.1| hypothetical protein PHAVU_005G172300g [Phaseolus vulgaris]
            gi|593700475|ref|XP_007150676.1| hypothetical protein
            PHAVU_005G172300g [Phaseolus vulgaris]
            gi|561023939|gb|ESW22669.1| hypothetical protein
            PHAVU_005G172300g [Phaseolus vulgaris]
            gi|561023940|gb|ESW22670.1| hypothetical protein
            PHAVU_005G172300g [Phaseolus vulgaris]
          Length = 701

 Score =  663 bits (1710), Expect = 0.0
 Identities = 336/533 (63%), Positives = 415/533 (77%), Gaps = 7/533 (1%)
 Frame = -1

Query: 1740 ESEVDLPIEDIPKKNTTYGFLVGPFGSVEDSILEWSPDKRSGTCDRKGAFARLVWSRKFV 1561
            +++++    +IP  N TYG LVGPFG VED ILEWSP+KRSGTC+RKG FARLVWSR+F+
Sbjct: 191  DADIEEQKPEIPTANGTYGLLVGPFGPVEDRILEWSPEKRSGTCNRKGDFARLVWSRRFI 250

Query: 1560 LIFHELSMTGAPLAMMELATEFLSCGATISVIVLNKKGGLMSELNRRKIKVLVDKSDLSF 1381
            L+FHELSMTGAPL+MMELATE LSCGAT+S +VL+KKGGLMSEL RR+IKVL DK+DLSF
Sbjct: 251  LVFHELSMTGAPLSMMELATELLSCGATVSAVVLSKKGGLMSELARRRIKVLEDKADLSF 310

Query: 1380 KTAMKANLIIAGSAVCSSWIEQYLSRTVLGSSQIMWWIMENRREYFDRSKHVLNRVKKLI 1201
            KTAMKA+L+IAGSAVC+SWI+QY+ R   G+SQ++WWIMENRREYFD SK  L+RVK L+
Sbjct: 311  KTAMKADLVIAGSAVCASWIDQYIERFPAGASQVVWWIMENRREYFDLSKDALDRVKMLV 370

Query: 1200 FLSESQSKQWLDWCEEENIHLKDEPALVPLSVNDELAFAAGISCSLNTPSFTTENMLEKR 1021
            FLSESQSKQWL WCEEE+I L+  P ++PLSVNDELAF AGI  +LNTPSF+T+ M+EKR
Sbjct: 371  FLSESQSKQWLKWCEEESIKLRSYPEIIPLSVNDELAFVAGIPSTLNTPSFSTDKMVEKR 430

Query: 1020 QSLRKVVREEMGLNDDDMLVVSLSSINPGKGQLLLMESARLVIEQGQKLNNSGSKDSVLL 841
            Q LR+ VR+E+GLND DMLV+SLSSINPGKGQLLL+ES   V+EQG              
Sbjct: 431  QLLRESVRKEIGLNDSDMLVISLSSINPGKGQLLLLESVSSVLEQG-------------- 476

Query: 840  DHDYYSRALLQNGKRDNESSNI-----DTPTKKRIRSSRIFTNEGRL--NSARYGRDARM 682
                     LQ+ K+  + SNI         K RIR        G++  N       +R 
Sbjct: 477  --------WLQDDKKMKKVSNIKEGISTLARKHRIRKLLPVLKNGKVVSNDISSNSLSRR 528

Query: 681  RKMLSENVGKKGQNLKVLIGSVGSKSNKVAYVKTLLTYLSTHSNLSKSVLWTPATTRVAS 502
            +++L ++ G   ++LK+LIGSVGSKSNK  YVK+LL +L  H N SKS+ WTPATTRVAS
Sbjct: 529  KQVLPDDKGTIQKSLKLLIGSVGSKSNKADYVKSLLNFLEQHPNTSKSIFWTPATTRVAS 588

Query: 501  LYAAADVYVMNSQGIGETFGRVTIEAMAFGLPVLGTDSGGTREIVEHNITGLLHPLGRPG 322
            LY+AADVYV+NSQG+GETFGRVTIEAMAFGLPVLGT++GGT+EIVEHN+TGLLHP+G PG
Sbjct: 589  LYSAADVYVINSQGLGETFGRVTIEAMAFGLPVLGTEAGGTKEIVEHNVTGLLHPVGHPG 648

Query: 321  SQVLARNFEFFLENPQARHEMGMRGREKVEKMYLKKHMFQKFGEVLYKCMRIK 163
            + VLA+N  F L+N  AR +MG+ GR+KV++MYLK+HM++KF EV+ +CMR K
Sbjct: 649  NLVLAQNLRFLLKNQLARKQMGVEGRKKVQQMYLKQHMYKKFVEVIVRCMRSK 701


>ref|XP_004486717.1| PREDICTED: uncharacterized protein LOC101501726 [Cicer arietinum]
          Length = 709

 Score =  658 bits (1697), Expect = 0.0
 Identities = 339/525 (64%), Positives = 413/525 (78%), Gaps = 8/525 (1%)
 Frame = -1

Query: 1713 DIPKKNTTYGFLVGPFGSVEDSILEWSPDKRSGTCDRKGAFARLVWSRKFVLIFHELSMT 1534
            +IP+ N+TYG LVGPFGS ED ILEWSP KRSGTC+RKG FARLVWSR+F+LIFHELSMT
Sbjct: 207  EIPETNSTYGLLVGPFGSTEDRILEWSPQKRSGTCNRKGDFARLVWSRRFILIFHELSMT 266

Query: 1533 GAPLAMMELATEFLSCGATISVIVLNKKGGLMSELNRRKIKVLVDKSDLSFKTAMKANLI 1354
            GAPL+MMELATE LSCGAT+S + L++KGGLMSEL RR+IK+L DK+DLSFKTAMKA+L+
Sbjct: 267  GAPLSMMELATELLSCGATVSAVALSRKGGLMSELARRRIKLLEDKADLSFKTAMKADLV 326

Query: 1353 IAGSAVCSSWIEQYLSRTVLGSSQIMWWIMENRREYFDRSKHVLNRVKKLIFLSESQSKQ 1174
            IAGSAVC+SWIEQY+     G+SQ+ WWIMENRREYF+R+K VL+RVK L+FLSESQSKQ
Sbjct: 327  IAGSAVCASWIEQYIEHFPAGASQVAWWIMENRREYFNRTKGVLDRVKMLVFLSESQSKQ 386

Query: 1173 WLDWCEEENIHLKDEPALVPLSVNDELAFAAGISCSLNTPSFTTENMLEKRQSLRKVVRE 994
            W  WCEEENI L+  P ++PLSVNDELAF AGI  +LNTPSF T+ M+EK+Q LR+ VR+
Sbjct: 387  WQKWCEEENIKLRSRPEIIPLSVNDELAFVAGIPSTLNTPSFDTDKMIEKKQLLRESVRK 446

Query: 993  EMGLNDDDMLVVSLSSINPGKGQLLLMESARLVIEQGQKLNNSGSKDSVLLDHDYYSRAL 814
            EMGL D DMLV+SLSSINPGKGQLLL+ESA  V+E GQ                      
Sbjct: 447  EMGLTDHDMLVISLSSINPGKGQLLLLESAISVVEHGQ---------------------- 484

Query: 813  LQNGKRDNESSNI----DTPTKK-RIRSSRIFTNEGR--LNSARYGRDARMRKMLSENVG 655
            LQ+ K+  +SSNI     T T+K RIR       +G+  L        +R +++L  N  
Sbjct: 485  LQDDKKMKKSSNIKEGLSTLTRKQRIRKLLPMLKDGKVALKDISINSLSRRKQVLPNNKT 544

Query: 654  KKGQNLKVLIGSVGSKSNKVAYVKTLLTYLSTHSNLSKSVLWTPATTRVASLYAAADVYV 475
               Q+LKVLIGSVGSKSNK  YVK+LL++L+ H N SK+VLWTP+TT+VASLY+AADVYV
Sbjct: 545  TTQQSLKVLIGSVGSKSNKADYVKSLLSFLAQHPNTSKTVLWTPSTTQVASLYSAADVYV 604

Query: 474  MNSQGIGETFGRVTIEAMAFGLPVLGTDSGGTREIVEHNITGLLHPLGR-PGSQVLARNF 298
            +NSQG+GETFGRVTIEAMAFGLPVLGTD+GGT+EIVE+N+TGLLHP+GR  G+ VLA+N 
Sbjct: 605  INSQGLGETFGRVTIEAMAFGLPVLGTDAGGTKEIVENNVTGLLHPVGRAAGNDVLAQNL 664

Query: 297  EFFLENPQARHEMGMRGREKVEKMYLKKHMFQKFGEVLYKCMRIK 163
             + L+N  AR +MGM GR+KVE+MYLK+HM++KF EV+ +CMR K
Sbjct: 665  VYLLKNQLARKQMGMEGRKKVERMYLKQHMYKKFVEVIVRCMRNK 709


>ref|XP_006597141.1| PREDICTED: uncharacterized protein LOC100793827 isoform X1 [Glycine
            max] gi|571514725|ref|XP_006597142.1| PREDICTED:
            uncharacterized protein LOC100793827 isoform X2 [Glycine
            max]
          Length = 701

 Score =  654 bits (1688), Expect = 0.0
 Identities = 334/519 (64%), Positives = 405/519 (78%), Gaps = 2/519 (0%)
 Frame = -1

Query: 1713 DIPKKNTTYGFLVGPFGSVEDSILEWSPDKRSGTCDRKGAFARLVWSRKFVLIFHELSMT 1534
            +IP  N+TYG LVGPFG +ED ILEWSP+KRSGTC+RK  FARLVWSR+F+LIFHELSMT
Sbjct: 200  EIPTTNSTYGLLVGPFGPMEDRILEWSPEKRSGTCNRKEDFARLVWSRRFILIFHELSMT 259

Query: 1533 GAPLAMMELATEFLSCGATISVIVLNKKGGLMSELNRRKIKVLVDKSDLSFKTAMKANLI 1354
            GAPL+MMELATE LSCGAT+S +VL++KGGLMSEL RR+IKVL DK+DLSFKTAMKA+L+
Sbjct: 260  GAPLSMMELATELLSCGATVSAVVLSRKGGLMSELARRRIKVLEDKADLSFKTAMKADLV 319

Query: 1353 IAGSAVCSSWIEQYLSRTVLGSSQIMWWIMENRREYFDRSKHVLNRVKKLIFLSESQSKQ 1174
            IAGSAVC+SWIEQY+     G+SQ+ WWIMENRREYFDRSK VL+RVK L+FLSESQSKQ
Sbjct: 320  IAGSAVCASWIEQYIEHFPAGASQVAWWIMENRREYFDRSKDVLHRVKMLVFLSESQSKQ 379

Query: 1173 WLDWCEEENIHLKDEPALVPLSVNDELAFAAGISCSLNTPSFTTENMLEKRQSLRKVVRE 994
            W  WCEEE+I L+  P +VPLSVNDELAF AGI  +LNTPSF+TE M+EK+Q LR+ VR+
Sbjct: 380  WQKWCEEESIKLRSHPEIVPLSVNDELAFVAGIPSTLNTPSFSTEKMVEKKQLLRESVRK 439

Query: 993  EMGLNDDDMLVVSLSSINPGKGQLLLMESARLVIEQGQKLNNSGSKDSVLLDHDYYSRAL 814
            EMGL D+DMLV+SLSSINPGKGQLLL+ES   V+EQGQ   +   K+   +     S A 
Sbjct: 440  EMGLTDNDMLVISLSSINPGKGQLLLLESVSSVLEQGQSPGDKKMKEVSNIKEGLSSLA- 498

Query: 813  LQNGKRDNESSNIDTPTKKRIRSSRIFTNEGRL--NSARYGRDARMRKMLSENVGKKGQN 640
                             K RIR      + G++  NS      +R +++L  + G   Q+
Sbjct: 499  ----------------RKHRIRKLLPLMSNGKVASNSISSNSLSRRKQVLPNDKGTIQQS 542

Query: 639  LKVLIGSVGSKSNKVAYVKTLLTYLSTHSNLSKSVLWTPATTRVASLYAAADVYVMNSQG 460
            LK+LIGSV SKSNK  YVK+LL++L  H N S S+ WTPATTRVASLY+AADVYV+NSQG
Sbjct: 543  LKLLIGSVRSKSNKADYVKSLLSFLEQHPNTSTSIFWTPATTRVASLYSAADVYVINSQG 602

Query: 459  IGETFGRVTIEAMAFGLPVLGTDSGGTREIVEHNITGLLHPLGRPGSQVLARNFEFFLEN 280
            +GETFGRVTIEAMAFGLPVLGTD+GGT+EIVEHN+TGLLHP+G PG+ VLA+N  F L+N
Sbjct: 603  LGETFGRVTIEAMAFGLPVLGTDAGGTQEIVEHNVTGLLHPVGHPGNLVLAQNLWFLLKN 662

Query: 279  PQARHEMGMRGREKVEKMYLKKHMFQKFGEVLYKCMRIK 163
              AR +MG+ GR+KV+KMYLK+ M++ F EV+ +CMR K
Sbjct: 663  QSARKQMGVVGRKKVQKMYLKQQMYKNFVEVIARCMRSK 701


>ref|XP_003542107.1| PREDICTED: uncharacterized protein LOC100795000 isoform X1 [Glycine
            max] gi|571503664|ref|XP_006595144.1| PREDICTED:
            uncharacterized protein LOC100795000 isoform X2 [Glycine
            max]
          Length = 701

 Score =  653 bits (1685), Expect = 0.0
 Identities = 335/524 (63%), Positives = 405/524 (77%), Gaps = 7/524 (1%)
 Frame = -1

Query: 1713 DIPKKNTTYGFLVGPFGSVEDSILEWSPDKRSGTCDRKGAFARLVWSRKFVLIFHELSMT 1534
            +IP  N TYG LVGPFG +ED ILEWSP+KRSGTC+RK  FARLVWSR+F+LIFHELSMT
Sbjct: 200  EIPTTNNTYGLLVGPFGPMEDRILEWSPEKRSGTCNRKEDFARLVWSRRFILIFHELSMT 259

Query: 1533 GAPLAMMELATEFLSCGATISVIVLNKKGGLMSELNRRKIKVLVDKSDLSFKTAMKANLI 1354
            GAPL+MMELATE LSCGAT+S +VL++KGGLMSEL RR+IKVL DKSDLSFKTAMKA+L+
Sbjct: 260  GAPLSMMELATELLSCGATVSAVVLSRKGGLMSELARRRIKVLEDKSDLSFKTAMKADLV 319

Query: 1353 IAGSAVCSSWIEQYLSRTVLGSSQIMWWIMENRREYFDRSKHVLNRVKKLIFLSESQSKQ 1174
            IAGSAVC+SWIEQY+     G+SQ+ WWIMENRREYFDRSK +L+RVK L+FLSESQSKQ
Sbjct: 320  IAGSAVCASWIEQYIDHFPAGASQVAWWIMENRREYFDRSKDILHRVKMLVFLSESQSKQ 379

Query: 1173 WLDWCEEENIHLKDEPALVPLSVNDELAFAAGISCSLNTPSFTTENMLEKRQSLRKVVRE 994
            W  WCEEE+I L+  P +V LSVN+ELAF AGI  +LNTPSF+TE M+EK+Q LR+ VR+
Sbjct: 380  WQKWCEEESIKLRSLPEIVALSVNEELAFVAGIPSTLNTPSFSTEKMVEKKQLLRESVRK 439

Query: 993  EMGLNDDDMLVVSLSSINPGKGQLLLMESARLVIEQGQKLNNSGSKDSVLLDHDYYSRAL 814
            EMGL D+DMLV+SLSSINPGKGQLLL+ES   V+EQGQ                      
Sbjct: 440  EMGLTDNDMLVISLSSINPGKGQLLLLESVSSVLEQGQ---------------------- 477

Query: 813  LQNGKRDNESSNI-----DTPTKKRIRSSRIFTNEGRL--NSARYGRDARMRKMLSENVG 655
            LQ+ K+  + SNI         K RIR        G++  NS      +R +++L    G
Sbjct: 478  LQDDKKMKKVSNIKEGLSSLTRKHRIRKLLPLMKNGKVASNSISSNSLSRRKQVLPNGKG 537

Query: 654  KKGQNLKVLIGSVGSKSNKVAYVKTLLTYLSTHSNLSKSVLWTPATTRVASLYAAADVYV 475
               Q+LK+LIGSV SKSNK  YVK+LL++L  H N S S+ WTPATTRVASLY+AADVYV
Sbjct: 538  TIQQSLKLLIGSVRSKSNKADYVKSLLSFLEQHPNASTSIFWTPATTRVASLYSAADVYV 597

Query: 474  MNSQGIGETFGRVTIEAMAFGLPVLGTDSGGTREIVEHNITGLLHPLGRPGSQVLARNFE 295
            +NSQG+GETFGRVTIEAMA+GLPVLGTD+GGTREIVE+N+TGLLHP+G PG+ VLA+N  
Sbjct: 598  INSQGLGETFGRVTIEAMAYGLPVLGTDAGGTREIVENNVTGLLHPVGHPGNDVLAQNLR 657

Query: 294  FFLENPQARHEMGMRGREKVEKMYLKKHMFQKFGEVLYKCMRIK 163
            F L+N  AR +MG+ GR+KV+KMYLK+HM++ F EV+ +CMR K
Sbjct: 658  FLLKNQLARKQMGVEGRKKVQKMYLKQHMYKNFVEVITRCMRSK 701


Top