BLASTX nr result

ID: Akebia25_contig00004295 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Akebia25_contig00004295
         (1935 letters)

Database: ./nr 
           38,876,450 sequences; 13,856,398,315 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_007210253.1| hypothetical protein PRUPE_ppa003418mg [Prun...   581   e-163
ref|XP_006368254.1| hypothetical protein POPTR_0001s01040g [Popu...   580   e-163
ref|XP_004300397.1| PREDICTED: uncharacterized protein LOC101306...   577   e-162
ref|NP_974886.1| major facilitator protein [Arabidopsis thaliana...   575   e-161
ref|XP_002863496.1| hypothetical protein ARALYDRAFT_494458 [Arab...   575   e-161
gb|EXC35192.1| hypothetical protein L484_022746 [Morus notabilis]     575   e-161
gb|AAV84499.1| At5g45275 [Arabidopsis thaliana] gi|56790236|gb|A...   574   e-161
ref|XP_006398191.1| hypothetical protein EUTSA_v10000837mg [Eutr...   571   e-160
ref|XP_006280245.1| hypothetical protein CARUB_v10026159mg [Caps...   570   e-160
ref|XP_002274370.1| PREDICTED: uncharacterized protein LOC100263...   570   e-160
ref|XP_007040912.1| Major facilitator superfamily protein isofor...   568   e-159
ref|XP_007218877.1| hypothetical protein PRUPE_ppa003313mg [Prun...   568   e-159
ref|XP_004308650.1| PREDICTED: uncharacterized protein LOC101296...   566   e-158
gb|AAM60820.1| unknown [Arabidopsis thaliana]                         563   e-158
ref|XP_007029848.1| Major facilitator superfamily protein isofor...   563   e-157
ref|XP_002869987.1| hypothetical protein ARALYDRAFT_492916 [Arab...   562   e-157
emb|CBI29223.3| unnamed protein product [Vitis vinifera]              562   e-157
ref|NP_567588.1| major facilitator protein [Arabidopsis thaliana...   562   e-157
ref|XP_006283421.1| hypothetical protein CARUB_v10004470mg [Caps...   562   e-157
ref|XP_006471628.1| PREDICTED: uncharacterized protein LOC102626...   561   e-157

>ref|XP_007210253.1| hypothetical protein PRUPE_ppa003418mg [Prunus persica]
            gi|462405988|gb|EMJ11452.1| hypothetical protein
            PRUPE_ppa003418mg [Prunus persica]
          Length = 576

 Score =  581 bits (1498), Expect = e-163
 Identities = 324/577 (56%), Positives = 380/577 (65%), Gaps = 40/577 (6%)
 Frame = -2

Query: 1835 GQSRKWLILVAVIWIQAFTGTNFDFSAYSSDLKSVVGISQVKLNYLATASDLGKALGWSS 1656
            GQSRKW+ILVA IWIQAFTGTNFDFS+YSS LKSV+GISQV+LNYLATASDLGK LGWSS
Sbjct: 4    GQSRKWMILVATIWIQAFTGTNFDFSSYSSSLKSVLGISQVQLNYLATASDLGKVLGWSS 63

Query: 1655 GLALLYFPLWTVLFMAVLMGFVGYGVQWLIIRSLISLPYFLVFLSCLLAGCSICWFNTVC 1476
            GLAL+YFPLW VLF+A  MGFVGYG+QWL+IR +ISLPYFL+FL CLLAGCSICWFNTVC
Sbjct: 64   GLALMYFPLWVVLFIAAFMGFVGYGIQWLVIRQIISLPYFLMFLLCLLAGCSICWFNTVC 123

Query: 1475 FVLCIRNFPKNRALALSLTVGFNGISPALYTILMKAISSTDDSLYLILNXXXXXXXXXXX 1296
            FVLCIRNFP N+ LA+SLTV FNG+S ALY +   AI S+  SL+LILN           
Sbjct: 124  FVLCIRNFPANQPLAISLTVSFNGVSAALYNLAADAIDSSSTSLFLILNAVIPLLTSVAA 183

Query: 1295 XXXXLNQSPQQPEPKDIVPHDXXXXXXXXXXXXITGFPXXXXXXXXXIASTPWPLVTAAI 1116
                + Q    P P D V  D            +TG              T    +  AI
Sbjct: 184  LIPIVRQPSLDPLPPDGVRRDSLIFLLLNILAVLTGIYLLLFGSTSYDTETARLFLGGAI 243

Query: 1115 F-LVLPLCVPGIVLARN--HRPL---------------VNDLELHKVQIEDNDCMGNDNK 990
            F L+ PL +PGIV AR+  HR +               V+DLELHK  +   + +   N 
Sbjct: 244  FLLIFPLFIPGIVYARDWFHRAIHSSIRIEGSGFVLVDVDDLELHKELLTRENSLNYGNG 303

Query: 989  A------NDDA---------RHGVEPSGGC-------RLARLGEEHSAQVLLSRWDFWLY 876
            +      N+D          ++G + +G C       +LA LGE+H+A+ L+ R DFWLY
Sbjct: 304  SVVQPVNNNDGPTTTLSFRQKNGYQSAGCCGAIVGKDQLAMLGEDHTARALVRRLDFWLY 363

Query: 875  YVVYLCGGTIGLVYGNNLGQIAQSLGYSSKTTTTLVTLYSSCSFFGRLLSAAPDFMPRKV 696
            YV Y CGGTIGLVY NN+GQIAQSLG SS  TTTLVTLYSS SFFGRLLSAAPD++  K+
Sbjct: 364  YVAYFCGGTIGLVYSNNMGQIAQSLGQSS-NTTTLVTLYSSFSFFGRLLSAAPDYIRAKL 422

Query: 695  YFARTGWLTIALLPNTIXXXXXXXXXXXXXLHIGTALIGLSSGFIFAAAVSITSELFGSN 516
            YFARTGWL IALLP  I             LH GTALIGLSSGFIF+AAVSITSELFG N
Sbjct: 423  YFARTGWLAIALLPTPIAFMLLASSGGSLALHTGTALIGLSSGFIFSAAVSITSELFGPN 482

Query: 515  SVGVNHNLLVTNIPIGSLLYGLLAALVYDANVSSNFGDSGIGSDEAMECMGRKCYLRTFV 336
            SVGVNHN+++TNIPIGSL+YGLLAA+VYD+N SS  G S +   +++ CMGR CY  TFV
Sbjct: 483  SVGVNHNIVITNIPIGSLVYGLLAAIVYDSNASS--GLSILTFSDSVVCMGRDCYFLTFV 540

Query: 335  WWGIXXXXXXXXXXXXXLRTKPAYNRFEQNRITSQVY 225
            WW               LRT+ AY+ FE NR T Q+Y
Sbjct: 541  WWACISVLGLASSVLLFLRTRHAYDHFEHNRST-QLY 576


>ref|XP_006368254.1| hypothetical protein POPTR_0001s01040g [Populus trichocarpa]
            gi|550346157|gb|ERP64823.1| hypothetical protein
            POPTR_0001s01040g [Populus trichocarpa]
          Length = 598

 Score =  580 bits (1496), Expect = e-163
 Identities = 316/556 (56%), Positives = 372/556 (66%), Gaps = 13/556 (2%)
 Frame = -2

Query: 1835 GQSRKWLILVAVIWIQAFTGTNFDFSAYSSDLKSVVGISQVKLNYLATASDLGKALGWSS 1656
            GQSRKW+ILVA +WIQAFTGTNFDFSAYSSDLKSV+GISQV+LNYLA ASDLGK  GWSS
Sbjct: 52   GQSRKWMILVATVWIQAFTGTNFDFSAYSSDLKSVLGISQVQLNYLAVASDLGKVFGWSS 111

Query: 1655 GLALLYFPLWTVLFMAVLMGFVGYGVQWLIIRSLISLPYFLVFLSCLLAGCSICWFNTVC 1476
            GLALLYFPLW VLFMA  MG  GYG+QWL++R +ISLPY LVFL CLLAGCSICWFNTVC
Sbjct: 112  GLALLYFPLWVVLFMAAFMGLFGYGLQWLVMRDIISLPYILVFLLCLLAGCSICWFNTVC 171

Query: 1475 FVLCIRNFPKNRALALSLTVGFNGISPALYTILMKAISSTDDSLYLILNXXXXXXXXXXX 1296
            FVLCI+NFP NR LALSLT+ FNG+S ALYT+   AI S+ + +YL+LN           
Sbjct: 172  FVLCIQNFPANRPLALSLTIAFNGVSAALYTLAGNAIDSSSNDIYLLLNAFIPLITSVVS 231

Query: 1295 XXXXLNQSPQQPEPKDIVPHDXXXXXXXXXXXXITGFPXXXXXXXXXIASTPWPLVTAAI 1116
                + Q    P P D V  D            +TG             +    L+  AI
Sbjct: 232  LIPIIRQPSLDPLPPDGVRRDSLIFLILNFLAILTGIYLLLFGSSSSDGTRARLLLGGAI 291

Query: 1115 F-LVLPLCVPGIVLARN--HRPLVNDLELHK---VQIEDNDCMGNDNKANDDARHGVEPS 954
            F L+ PLC+PGIV AR   HR + +   +H    + ++ +D   +      + +   E  
Sbjct: 292  FLLIFPLCIPGIVYAREWFHRTIHSSFSIHGSGFILVDVDDLELHKELITRERKSSGEKE 351

Query: 953  GGC-------RLARLGEEHSAQVLLSRWDFWLYYVVYLCGGTIGLVYGNNLGQIAQSLGY 795
            G C       RLA LGEEH   +L+SR DFWLYY  Y+CGGTIGLVY NNLGQIAQSLG 
Sbjct: 352  GCCDSIVKKDRLAMLGEEHPVSLLVSRLDFWLYYTAYVCGGTIGLVYSNNLGQIAQSLGQ 411

Query: 794  SSKTTTTLVTLYSSCSFFGRLLSAAPDFMPRKVYFARTGWLTIALLPNTIXXXXXXXXXX 615
            SS  TTTLVTLYSS SFFGRLLSAAPD++  K+YFART WLTIAL+P  I          
Sbjct: 412  SS-NTTTLVTLYSSFSFFGRLLSAAPDYIRAKMYFARTAWLTIALVPTPIAFFLLAASGN 470

Query: 614  XXXLHIGTALIGLSSGFIFAAAVSITSELFGSNSVGVNHNLLVTNIPIGSLLYGLLAALV 435
               LHI TAL+GLSSGFIFAAAVSITSELFG NSVGVNHN+L+TNIPIGSL+YG LAA+V
Sbjct: 471  AVALHISTALVGLSSGFIFAAAVSITSELFGPNSVGVNHNILITNIPIGSLVYGFLAAIV 530

Query: 434  YDANVSSNFGDSGIGSDEAMECMGRKCYLRTFVWWGIXXXXXXXXXXXXXLRTKPAYNRF 255
            YD++VSS+     I +D  + CMGR+CY  TFVWWG              LRT+ AY++F
Sbjct: 531  YDSHVSSSL---NIITDSVV-CMGRQCYFLTFVWWGCLSVLGLTSSLLLFLRTRHAYDQF 586

Query: 254  EQNRITSQVY*LTPFF 207
            E  RI+S    +TP +
Sbjct: 587  EAKRISS----MTPLY 598


>ref|XP_004300397.1| PREDICTED: uncharacterized protein LOC101306433 [Fragaria vesca
            subsp. vesca]
          Length = 627

 Score =  577 bits (1486), Expect = e-162
 Identities = 321/575 (55%), Positives = 375/575 (65%), Gaps = 38/575 (6%)
 Frame = -2

Query: 1835 GQSRKWLILVAVIWIQAFTGTNFDFSAYSSDLKSVVGISQVKLNYLATASDLGKALGWSS 1656
            GQSRKW+ILVA IWIQAFTGTNFDFS+YSS LKSV+GISQV+LNYLATASDLGK  GWSS
Sbjct: 58   GQSRKWMILVATIWIQAFTGTNFDFSSYSSVLKSVLGISQVQLNYLATASDLGKVFGWSS 117

Query: 1655 GLALLYFPLWTVLFMAVLMGFVGYGVQWLIIRSLISLPYFLVFLSCLLAGCSICWFNTVC 1476
            GLAL+YFPLW VLF+A  MGFVGYG+QWL+IR +I LPYF++FL CLLAGCSICWFNTVC
Sbjct: 118  GLALMYFPLWVVLFIAAFMGFVGYGLQWLVIRQIIVLPYFVMFLLCLLAGCSICWFNTVC 177

Query: 1475 FVLCIRNFPKNRALALSLTVGFNGISPALYTILMKAISSTDDSLYLILNXXXXXXXXXXX 1296
            FVLCIRNFP N+ LA+SLTV FNG+S ALY +   AI S+  ++YLILN           
Sbjct: 178  FVLCIRNFPANQPLAISLTVSFNGVSAALYNLAADAIDSSSTTIYLILNAVIPLITSVAA 237

Query: 1295 XXXXLNQSPQQPEPKDIVPHDXXXXXXXXXXXXITGFPXXXXXXXXXIASTPWPLVTAAI 1116
                L Q    P P D V  D            +TG             +T    +  AI
Sbjct: 238  LIPILRQPSLDPLPPDGVRRDSVIFLFLNILAVLTGIYLLLFGSSSFDTATARVFLAGAI 297

Query: 1115 F-LVLPLCVPGIVLARN------HRPL-----------VNDLELHK-VQIEDNDCMGNDN 993
            F L+ PLC+PGIV AR+      H  +           V+DLELHK +   +    GN +
Sbjct: 298  FLLIFPLCIPGIVYARDWFRRAVHSNIRIEGSGFILVDVDDLELHKELLTREPSINGNGS 357

Query: 992  KANDDARHG------------VEPSGGC-------RLARLGEEHSAQVLLSRWDFWLYYV 870
             ++    +G            +E +G C       +LA LGEEHSA++L+ R DFWLYY+
Sbjct: 358  LSHLLVYNGASAQSLSFRNKSMECTGCCGTLVGKDQLAMLGEEHSARLLVRRLDFWLYYI 417

Query: 869  VYLCGGTIGLVYGNNLGQIAQSLGYSSKTTTTLVTLYSSCSFFGRLLSAAPDFMPRKVYF 690
             Y CGGTIGLVY NNLGQIAQSLG SS  TTTLVTLYSS SFFGRLLSA PD++  K YF
Sbjct: 418  AYFCGGTIGLVYSNNLGQIAQSLGQSS-NTTTLVTLYSSFSFFGRLLSAVPDYIRAKFYF 476

Query: 689  ARTGWLTIALLPNTIXXXXXXXXXXXXXLHIGTALIGLSSGFIFAAAVSITSELFGSNSV 510
            ARTGWLTIALLP  +             LH GTALIGLSSGFIFAAAVSITSELFG NSV
Sbjct: 477  ARTGWLTIALLPTPVAFILLAASSGTMALHAGTALIGLSSGFIFAAAVSITSELFGPNSV 536

Query: 509  GVNHNLLVTNIPIGSLLYGLLAALVYDANVSSNFGDSGIGSDEAMECMGRKCYLRTFVWW 330
            GVNHN+L+TNIP+GSL+YG LAA+VYDANVS+      I + + + CMGR CY  TFVWW
Sbjct: 537  GVNHNILITNIPLGSLVYGFLAAIVYDANVSTGL---SIVTSDTIVCMGRNCYFLTFVWW 593

Query: 329  GIXXXXXXXXXXXXXLRTKPAYNRFEQNRITSQVY 225
                            RT+ AY+ FE NR T Q+Y
Sbjct: 594  ACISVLGLASSVLLFFRTRHAYDHFEHNRST-QLY 627


>ref|NP_974886.1| major facilitator protein [Arabidopsis thaliana]
            gi|332007841|gb|AED95224.1| major facilitator protein
            [Arabidopsis thaliana]
          Length = 570

 Score =  575 bits (1483), Expect = e-161
 Identities = 320/572 (55%), Positives = 368/572 (64%), Gaps = 36/572 (6%)
 Frame = -2

Query: 1835 GQSRKWLILVAVIWIQAFTGTNFDFSAYSSDLKSVVGISQVKLNYLATASDLGKALGWSS 1656
            GQSRKW+ILVA IWIQAFTGTNFDFS YSS+LKSV+GISQV+LNYLA ASDLGK  GWSS
Sbjct: 3    GQSRKWMILVATIWIQAFTGTNFDFSTYSSNLKSVLGISQVQLNYLAVASDLGKVFGWSS 62

Query: 1655 GLALLYFPLWTVLFMAVLMGFVGYGVQWLIIRSLISLPYFLVFLSCLLAGCSICWFNTVC 1476
            GLALLYFPLWTVLF A +MGFVGYGVQWL+I ++ISLPY LVFL CLLAG SICWFNTVC
Sbjct: 63   GLALLYFPLWTVLFAAAIMGFVGYGVQWLVITNVISLPYILVFLCCLLAGLSICWFNTVC 122

Query: 1475 FVLCIRNFPKNRALALSLTVGFNGISPALYTILMKAISSTDDSLYLILNXXXXXXXXXXX 1296
            FVLCIRNFP NR+LALSLTV FNG+S ALYT+   AI+     LYL+LN           
Sbjct: 123  FVLCIRNFPANRSLALSLTVSFNGVSAALYTLAYNAINPVSTELYLLLNALVPLFVSFAA 182

Query: 1295 XXXXLNQSPQQPEPKDIVPHDXXXXXXXXXXXXITGFPXXXXXXXXXIASTPWPLVTAAI 1116
                L Q P +P P D V  D            + G             ++   L   +I
Sbjct: 183  LIPILRQPPLEPLPPDGVRRDSLMFLLLNILAVLNGVYLLLFRSKTSDVTSARLLFGGSI 242

Query: 1115 -FLVLPLCVPGIVLARN------HRPL-----------VNDLELHKVQIEDNDCMGNDNK 990
              L+LPLC+PG+V ARN      H              V++LE+HK  +     +     
Sbjct: 243  LLLILPLCLPGLVYARNWYLHNIHSSFRLEGSGFILVDVDELEMHKGMVTREASLEGYQL 302

Query: 989  ANDDA---------RHGVEPSGGC---------RLARLGEEHSAQVLLSRWDFWLYYVVY 864
             NDD          +  +E   GC         +L  LGEEH    LL R DFWLYY+ Y
Sbjct: 303  LNDDVVRAVNTPDQKSFIEDDDGCCCTKVITRNQLGMLGEEHPLSFLLCRSDFWLYYIAY 362

Query: 863  LCGGTIGLVYGNNLGQIAQSLGYSSKTTTTLVTLYSSCSFFGRLLSAAPDFMPRKVYFAR 684
             CGGTIGLVY NNLGQIAQSLG SS+ TTTLVTLYSS SFFGRLLSA PD++  KVYFAR
Sbjct: 363  FCGGTIGLVYSNNLGQIAQSLGQSSE-TTTLVTLYSSFSFFGRLLSATPDYIRAKVYFAR 421

Query: 683  TGWLTIALLPNTIXXXXXXXXXXXXXLHIGTALIGLSSGFIFAAAVSITSELFGSNSVGV 504
            TGWL +ALLP TI             L  GTALIGLSSGFIFAAAVSITSELFG NSVGV
Sbjct: 422  TGWLAVALLPTTIALFLLASSGSLAALQAGTALIGLSSGFIFAAAVSITSELFGPNSVGV 481

Query: 503  NHNLLVTNIPIGSLLYGLLAALVYDANVSSNFGDSGIGSDEAMECMGRKCYLRTFVWWGI 324
            NHN+L+TNIPIGSL+YG LAALVY+++  +          E++ CMGR CYL+TF+WWG 
Sbjct: 482  NHNILITNIPIGSLVYGFLAALVYESHSVAG------SKTESVICMGRDCYLQTFMWWGC 535

Query: 323  XXXXXXXXXXXXXLRTKPAYNRFEQNRITSQV 228
                         LRT+ AY RFEQ+RITS +
Sbjct: 536  LSVIGLASSVVLFLRTRRAYQRFEQDRITSSM 567


>ref|XP_002863496.1| hypothetical protein ARALYDRAFT_494458 [Arabidopsis lyrata subsp.
            lyrata] gi|297309331|gb|EFH39755.1| hypothetical protein
            ARALYDRAFT_494458 [Arabidopsis lyrata subsp. lyrata]
          Length = 570

 Score =  575 bits (1483), Expect = e-161
 Identities = 320/572 (55%), Positives = 368/572 (64%), Gaps = 36/572 (6%)
 Frame = -2

Query: 1835 GQSRKWLILVAVIWIQAFTGTNFDFSAYSSDLKSVVGISQVKLNYLATASDLGKALGWSS 1656
            GQSRKW+ILVA IWIQAFTGTNFDFS YSS+LKSV+GISQV+LNYLA ASDLGK  GWSS
Sbjct: 3    GQSRKWMILVATIWIQAFTGTNFDFSTYSSNLKSVLGISQVQLNYLAVASDLGKVFGWSS 62

Query: 1655 GLALLYFPLWTVLFMAVLMGFVGYGVQWLIIRSLISLPYFLVFLSCLLAGCSICWFNTVC 1476
            GLALLYFPLWTVLF A +MGFVGYGVQWL+I ++ISLPY LVFL CLLAG SICWFNTVC
Sbjct: 63   GLALLYFPLWTVLFAAAIMGFVGYGVQWLVITNVISLPYILVFLCCLLAGLSICWFNTVC 122

Query: 1475 FVLCIRNFPKNRALALSLTVGFNGISPALYTILMKAISSTDDSLYLILNXXXXXXXXXXX 1296
            FVLCIRNFP NR+LALSLTV FNG+S ALYT+   AI+     LYL+LN           
Sbjct: 123  FVLCIRNFPANRSLALSLTVSFNGVSAALYTLAYNAINPVSTELYLLLNALVPLFVSFAA 182

Query: 1295 XXXXLNQSPQQPEPKDIVPHDXXXXXXXXXXXXITGFPXXXXXXXXXIASTPWPLVTAA- 1119
                L Q P +P P D V  D            + G             ++   L   + 
Sbjct: 183  LIPILRQPPLEPLPPDGVRRDSLMFLLLNILAVLNGVYLLLFRSKTSDVTSARLLFGGSL 242

Query: 1118 IFLVLPLCVPGIVLARN------HRPL-----------VNDLELHKVQIEDNDCMGNDNK 990
            + L+LPLC+PG+V ARN      H              V++LE+HK  +     +     
Sbjct: 243  LLLILPLCLPGLVYARNWYLHNIHSSFRLEGSGFILVDVDELEMHKGMVTREASLEGYQL 302

Query: 989  ANDDA---------RHGVEPSGGC---------RLARLGEEHSAQVLLSRWDFWLYYVVY 864
             NDD          +  +E   GC         +L  LGEEH   +LL R DFWLYY+ Y
Sbjct: 303  LNDDVVRAVNTPDQKSFIEDDDGCCCTKLITRNQLGMLGEEHPLSLLLCRSDFWLYYIAY 362

Query: 863  LCGGTIGLVYGNNLGQIAQSLGYSSKTTTTLVTLYSSCSFFGRLLSAAPDFMPRKVYFAR 684
             CGGTIGLVY NNLGQIAQSLG SS+ TTTLVTLYSS SFFGRLLSA PD++  KVYFAR
Sbjct: 363  FCGGTIGLVYSNNLGQIAQSLGQSSE-TTTLVTLYSSFSFFGRLLSATPDYIRAKVYFAR 421

Query: 683  TGWLTIALLPNTIXXXXXXXXXXXXXLHIGTALIGLSSGFIFAAAVSITSELFGSNSVGV 504
            TGWL +ALLP TI             L  GTALIGLSSGFIFAAAVSITSELFG NSVGV
Sbjct: 422  TGWLAVALLPTTIALFLLASSGSLAALQAGTALIGLSSGFIFAAAVSITSELFGPNSVGV 481

Query: 503  NHNLLVTNIPIGSLLYGLLAALVYDANVSSNFGDSGIGSDEAMECMGRKCYLRTFVWWGI 324
            NHN+L+TNIPIGSL+YG LAALVY+++  +          E++ CMGR CYL TFVWWG 
Sbjct: 482  NHNILITNIPIGSLVYGFLAALVYESHSVAG------SKTESVICMGRDCYLLTFVWWGC 535

Query: 323  XXXXXXXXXXXXXLRTKPAYNRFEQNRITSQV 228
                         LRT+ AY RFEQ+RITS +
Sbjct: 536  LSVIGLASSVVLFLRTRRAYQRFEQDRITSSM 567


>gb|EXC35192.1| hypothetical protein L484_022746 [Morus notabilis]
          Length = 583

 Score =  575 bits (1481), Expect = e-161
 Identities = 329/586 (56%), Positives = 378/586 (64%), Gaps = 49/586 (8%)
 Frame = -2

Query: 1835 GQSRKWLILVAVIWIQAFTGTNFDFSAYSSDLKSVVGISQVKLNYLATASDLGKALGWSS 1656
            GQSRKW+ILVA IWIQAFTGTNFDFSAYSS LKSV+GISQV+LNYLA ASD+GK LGWSS
Sbjct: 3    GQSRKWMILVATIWIQAFTGTNFDFSAYSSALKSVLGISQVQLNYLAVASDMGKVLGWSS 62

Query: 1655 GLALLYFPLWTVLFMAVLMGFVGYGVQWLIIRSLISLPYFLVFLSCLLAGCSICWFNTVC 1476
            GLAL+YFPL+ VLFMA  MGF+GYG+QWL+I  LISLPYFLVFL CLLAGCSICWFNTVC
Sbjct: 63   GLALMYFPLYVVLFMAAFMGFLGYGIQWLVITQLISLPYFLVFLLCLLAGCSICWFNTVC 122

Query: 1475 FVLCIRNFPKNRALALSLTVGFNGISPALYTILMKAISSTDDSLYLILNXXXXXXXXXXX 1296
            FVLCIRNFP N+ LA+SLTV FNG+S ALYT+   AI+ +   LYLILN           
Sbjct: 123  FVLCIRNFPSNQPLAISLTVSFNGVSAALYTLAANAINPSSPPLYLILNAFIPLVASFAA 182

Query: 1295 XXXXLNQSPQQPEPKDIVPHDXXXXXXXXXXXXITG-FPXXXXXXXXXIASTPWPLVTAA 1119
                L Q P  P P D V  D            +TG +           AST   L   +
Sbjct: 183  LIPILRQPPLDPLPPDGVRRDSLIFLILNVLAVLTGLYLLLFGSKPSDPASTARLLFGGS 242

Query: 1118 IF-LVLPLCVPGIVLARN--HRPL----------------VNDLELHKVQIEDNDCMGND 996
            IF L+ PLC+PG+V AR+  +R +                V+DLELHK  +   +   + 
Sbjct: 243  IFLLIFPLCIPGVVYARDWFYRAVNSSFSLDGGSGFILVDVDDLELHKELMITREAASSA 302

Query: 995  N---------------------KANDDARHGVEPSGGC--------RLARLGEEHSAQVL 903
            N                     K +       + S GC        RLA LGEEHSA+ L
Sbjct: 303  NGNGSVLDSPLLSESVGVLGLLKNSSTTYSSAKISEGCCETMIGKDRLAMLGEEHSARDL 362

Query: 902  LSRWDFWLYYVVYLCGGTIGLVYGNNLGQIAQSLGYSSKTTTTLVTLYSSCSFFGRLLSA 723
            + R DFWLY+V Y CGGTIGLVY NNLGQIAQSLG+ SK TT LVTLYSS SFFGRLLSA
Sbjct: 363  VRRLDFWLYFVAYFCGGTIGLVYSNNLGQIAQSLGHGSK-TTALVTLYSSFSFFGRLLSA 421

Query: 722  APDFMPRKVYFARTGWLTIALLPNTIXXXXXXXXXXXXXLHIGTALIGLSSGFIFAAAVS 543
            APD++  K YFARTGWLTIAL+P                L +GTALIGLSSGFIFAAAVS
Sbjct: 422  APDYVRAKFYFARTGWLTIALVPTPFAFFLLAALGNAAALQVGTALIGLSSGFIFAAAVS 481

Query: 542  ITSELFGSNSVGVNHNLLVTNIPIGSLLYGLLAALVYDANVSSNFGDSGIGSDEAMECMG 363
            ITSELFG NSV VNHN+L+TNIPIGSL+YGLLAA+VYDAN SS   D  + +D A+ CMG
Sbjct: 482  ITSELFGPNSVSVNHNILITNIPIGSLVYGLLAAIVYDANGSSGLRD--VITDSAV-CMG 538

Query: 362  RKCYLRTFVWWGIXXXXXXXXXXXXXLRTKPAYNRFEQNRITSQVY 225
            R+CY  TFVWWG              LRT+ AY+ FE  R TSQ+Y
Sbjct: 539  RQCYSSTFVWWGCISVLGLASSLMLFLRTRHAYDHFEHQR-TSQLY 583


>gb|AAV84499.1| At5g45275 [Arabidopsis thaliana] gi|56790236|gb|AAW30035.1| At5g45275
            [Arabidopsis thaliana]
          Length = 570

 Score =  574 bits (1479), Expect = e-161
 Identities = 319/572 (55%), Positives = 367/572 (64%), Gaps = 36/572 (6%)
 Frame = -2

Query: 1835 GQSRKWLILVAVIWIQAFTGTNFDFSAYSSDLKSVVGISQVKLNYLATASDLGKALGWSS 1656
            GQSRKW+ILVA IWIQAFTGTNFDFS YSS+LKSV+GISQV+LNYLA ASDLGK  GWSS
Sbjct: 3    GQSRKWMILVATIWIQAFTGTNFDFSTYSSNLKSVLGISQVQLNYLAVASDLGKVFGWSS 62

Query: 1655 GLALLYFPLWTVLFMAVLMGFVGYGVQWLIIRSLISLPYFLVFLSCLLAGCSICWFNTVC 1476
            GLALLYFPLWTVLF A +MGFVGYGVQWL+I ++ISLPY LVFL CLLAG SICWFNTVC
Sbjct: 63   GLALLYFPLWTVLFAAAIMGFVGYGVQWLVITNVISLPYILVFLCCLLAGLSICWFNTVC 122

Query: 1475 FVLCIRNFPKNRALALSLTVGFNGISPALYTILMKAISSTDDSLYLILNXXXXXXXXXXX 1296
            FVLCIRNFP NR+LALSLTV FNG+S ALYT+   AI+     LYL+LN           
Sbjct: 123  FVLCIRNFPANRSLALSLTVSFNGVSAALYTLAYNAINPVSTELYLLLNALVPLFVSFAA 182

Query: 1295 XXXXLNQSPQQPEPKDIVPHDXXXXXXXXXXXXITGFPXXXXXXXXXIASTPWPLVTAAI 1116
                L Q P +P P D V  D            + G             ++   L   +I
Sbjct: 183  LIPILRQPPLEPLPPDGVRRDSLMFLLLNILAVLNGVYLLLFRSKTSDVTSARLLFGGSI 242

Query: 1115 -FLVLPLCVPGIVLARN------HRPL-----------VNDLELHKVQIEDNDCMGNDNK 990
              L+LPLC+PG+V ARN      H              V++LE+HK  +     +     
Sbjct: 243  LLLILPLCLPGLVYARNWYLHNIHSSFRLEGSGFILVDVDELEMHKGMVTREASLEGYQL 302

Query: 989  ANDDA---------RHGVEPSGGC---------RLARLGEEHSAQVLLSRWDFWLYYVVY 864
             NDD          +  +E   GC         +L  LGEEH    LL R DFWLYY+ Y
Sbjct: 303  LNDDVVRAVNTPDQKSFIEDDDGCCCTKVITRNQLGMLGEEHPLSFLLCRSDFWLYYIAY 362

Query: 863  LCGGTIGLVYGNNLGQIAQSLGYSSKTTTTLVTLYSSCSFFGRLLSAAPDFMPRKVYFAR 684
             CGGTIGLVY NNLGQIAQSLG SS+ TTTLVTLYSS SFFGRLLSA PD++  KVYFAR
Sbjct: 363  FCGGTIGLVYSNNLGQIAQSLGQSSE-TTTLVTLYSSFSFFGRLLSATPDYIRAKVYFAR 421

Query: 683  TGWLTIALLPNTIXXXXXXXXXXXXXLHIGTALIGLSSGFIFAAAVSITSELFGSNSVGV 504
            TGWL +ALLP TI             L  GTALIGLSSGFIFAAAVSITSELFG NSVGV
Sbjct: 422  TGWLAVALLPTTIALFLLASSGSLAALQAGTALIGLSSGFIFAAAVSITSELFGPNSVGV 481

Query: 503  NHNLLVTNIPIGSLLYGLLAALVYDANVSSNFGDSGIGSDEAMECMGRKCYLRTFVWWGI 324
            NHN+L+TNIPIGSL+YG LAAL Y+++  +          E++ CMGR CYL+TF+WWG 
Sbjct: 482  NHNILITNIPIGSLVYGFLAALAYESHSVAG------SKTESVICMGRDCYLQTFMWWGC 535

Query: 323  XXXXXXXXXXXXXLRTKPAYNRFEQNRITSQV 228
                         LRT+ AY RFEQ+RITS +
Sbjct: 536  LSVIGLASSVVLFLRTRRAYQRFEQDRITSSM 567


>ref|XP_006398191.1| hypothetical protein EUTSA_v10000837mg [Eutrema salsugineum]
            gi|557099280|gb|ESQ39644.1| hypothetical protein
            EUTSA_v10000837mg [Eutrema salsugineum]
          Length = 570

 Score =  571 bits (1472), Expect = e-160
 Identities = 319/572 (55%), Positives = 367/572 (64%), Gaps = 36/572 (6%)
 Frame = -2

Query: 1835 GQSRKWLILVAVIWIQAFTGTNFDFSAYSSDLKSVVGISQVKLNYLATASDLGKALGWSS 1656
            GQSRKW+ILVA IWIQAFTGTNFDFS YSSDLKSV+GISQV+LNYLA ASDLGK  GWSS
Sbjct: 3    GQSRKWMILVATIWIQAFTGTNFDFSTYSSDLKSVLGISQVQLNYLAVASDLGKVFGWSS 62

Query: 1655 GLALLYFPLWTVLFMAVLMGFVGYGVQWLIIRSLISLPYFLVFLSCLLAGCSICWFNTVC 1476
            GLALLYFPLWTVLF A +MGFVGYGVQWL+I ++ISLPY LVFL CLLAG SICWFNTVC
Sbjct: 63   GLALLYFPLWTVLFAAAIMGFVGYGVQWLVITNIISLPYILVFLCCLLAGLSICWFNTVC 122

Query: 1475 FVLCIRNFPKNRALALSLTVGFNGISPALYTILMKAISSTDDSLYLILNXXXXXXXXXXX 1296
            FVLCIRNFP NR+LALSLTV FNG+S ALYT+   AI+     LYL+LN           
Sbjct: 123  FVLCIRNFPANRSLALSLTVSFNGVSAALYTLAYNAINPVSTQLYLLLNALIPLIISFAA 182

Query: 1295 XXXXLNQSPQQPEPKDIVPHDXXXXXXXXXXXXITGFPXXXXXXXXXIASTPWPLVTAAI 1116
                L Q P +P P D V  D            + G             ++   L   +I
Sbjct: 183  LIPILRQPPLEPLPPDGVRRDSLMFLLLNILAVLNGVYLLLFGSKTSDVTSARLLFGGSI 242

Query: 1115 -FLVLPLCVPGIVLARN------HRPL-----------VNDLELHKVQIEDNDCMGNDNK 990
              L+LPLC+PG+V ARN      H              V+DLE+HK        +     
Sbjct: 243  LLLILPLCLPGLVYARNWYLHNVHSSFRLEGSGFILVDVDDLEMHKGLATREASLEGYQL 302

Query: 989  ANDD-ARHGVEP---------SGGC--------RLARLGEEHSAQVLLSRWDFWLYYVVY 864
             NDD  R  + P         +  C        +L  LGEEH   +LL R DFWLYY+ Y
Sbjct: 303  LNDDVVRTAITPDQKSFIEDDNVSCCNKLITRNQLGMLGEEHPLSLLLCRSDFWLYYIAY 362

Query: 863  LCGGTIGLVYGNNLGQIAQSLGYSSKTTTTLVTLYSSCSFFGRLLSAAPDFMPRKVYFAR 684
             CGGTIGLVY NNLGQIAQSLG+SS+ TTTLVTLYSS SFFGRLLSA PD++  K YFAR
Sbjct: 363  FCGGTIGLVYSNNLGQIAQSLGHSSE-TTTLVTLYSSFSFFGRLLSATPDYIRAKFYFAR 421

Query: 683  TGWLTIALLPNTIXXXXXXXXXXXXXLHIGTALIGLSSGFIFAAAVSITSELFGSNSVGV 504
            TGWLT+ALLP T+             L  GTALIGLSSGFIFAAAVSITSELFG NSVGV
Sbjct: 422  TGWLTVALLPTTVALFLLASSGSLSALQAGTALIGLSSGFIFAAAVSITSELFGPNSVGV 481

Query: 503  NHNLLVTNIPIGSLLYGLLAALVYDANVSSNFGDSGIGSDEAMECMGRKCYLRTFVWWGI 324
            NHN+L+TNIPIGSL+YG LAALVY+++  +          E++ CMGR CYL TF+WWG 
Sbjct: 482  NHNILITNIPIGSLVYGFLAALVYESHSIAG------SKTESVICMGRDCYLLTFLWWGC 535

Query: 323  XXXXXXXXXXXXXLRTKPAYNRFEQNRITSQV 228
                         +RT+ AY RFEQ+RI S +
Sbjct: 536  LSVIGLGSSVFLFMRTRRAYKRFEQDRIASSM 567


>ref|XP_006280245.1| hypothetical protein CARUB_v10026159mg [Capsella rubella]
            gi|482548949|gb|EOA13143.1| hypothetical protein
            CARUB_v10026159mg [Capsella rubella]
          Length = 570

 Score =  570 bits (1469), Expect = e-160
 Identities = 319/572 (55%), Positives = 365/572 (63%), Gaps = 36/572 (6%)
 Frame = -2

Query: 1835 GQSRKWLILVAVIWIQAFTGTNFDFSAYSSDLKSVVGISQVKLNYLATASDLGKALGWSS 1656
            GQSRKW+ILVA IWIQAFTGTNFDFS YSS+LKSV+GISQV+LNYLA ASDLGK  GWSS
Sbjct: 3    GQSRKWMILVATIWIQAFTGTNFDFSTYSSNLKSVLGISQVQLNYLAVASDLGKVFGWSS 62

Query: 1655 GLALLYFPLWTVLFMAVLMGFVGYGVQWLIIRSLISLPYFLVFLSCLLAGCSICWFNTVC 1476
            GLALLYFPLWTVLF A  MGFVGYGVQWL+I ++I+LPY LVFL CLLAG SICWFNTVC
Sbjct: 63   GLALLYFPLWTVLFAAASMGFVGYGVQWLVITNVIALPYILVFLCCLLAGLSICWFNTVC 122

Query: 1475 FVLCIRNFPKNRALALSLTVGFNGISPALYTILMKAISSTDDSLYLILNXXXXXXXXXXX 1296
            FVLCIRNFP NR+LALSLTV FNG+S ALYT+   AI+      YL+LN           
Sbjct: 123  FVLCIRNFPANRSLALSLTVSFNGVSAALYTLAYNAINPVSTQQYLLLNSLIPLVVSFAS 182

Query: 1295 XXXXLNQSPQQPEPKDIVPHDXXXXXXXXXXXXITGFPXXXXXXXXXIASTPWPLVTAAI 1116
                L Q P +P P D V  D            + G             ++   L   +I
Sbjct: 183  LIPILRQPPLEPLPPDGVRRDSLMFLLLNILAVLNGVYLLLFGSKTSDVTSARLLFGGSI 242

Query: 1115 -FLVLPLCVPGIVLARN------HRPL-----------VNDLELHKVQIEDNDCMGNDNK 990
              L+LPLC+PG+V ARN      H              V++LE+HK  +  +  +     
Sbjct: 243  LLLILPLCLPGLVYARNWYLHNVHSSFRLEGSGFILVDVDELEMHKGMVTRDASVDGYQL 302

Query: 989  ANDDARHGV---------EPSGGC---------RLARLGEEHSAQVLLSRWDFWLYYVVY 864
             NDD    V         E   GC          L  LGEEH   +LL R DFWLYY+ Y
Sbjct: 303  LNDDVMRAVITSDQKSFIEDDDGCCCNKLITRNLLGMLGEEHPLYLLLRRSDFWLYYIAY 362

Query: 863  LCGGTIGLVYGNNLGQIAQSLGYSSKTTTTLVTLYSSCSFFGRLLSAAPDFMPRKVYFAR 684
             CGGTIGLVY NNLGQIAQSLG SS+ TTTLVTLYSS SFFGRLLSA PD++  KVYFAR
Sbjct: 363  FCGGTIGLVYSNNLGQIAQSLGQSSE-TTTLVTLYSSFSFFGRLLSATPDYIRAKVYFAR 421

Query: 683  TGWLTIALLPNTIXXXXXXXXXXXXXLHIGTALIGLSSGFIFAAAVSITSELFGSNSVGV 504
            TGWL +ALLP TI             L  GTALIGLSSGFIFAAAVSITSELFG NSVGV
Sbjct: 422  TGWLAVALLPTTIALFLLASSGSLAALQAGTALIGLSSGFIFAAAVSITSELFGPNSVGV 481

Query: 503  NHNLLVTNIPIGSLLYGLLAALVYDANVSSNFGDSGIGSDEAMECMGRKCYLRTFVWWGI 324
            NHN+L+TNIPIGSL+YG LAALVY+++  +        + E++ CMGR CY  TFVWWG 
Sbjct: 482  NHNILITNIPIGSLVYGFLAALVYESHSMAG------SNTESVICMGRDCYFLTFVWWGC 535

Query: 323  XXXXXXXXXXXXXLRTKPAYNRFEQNRITSQV 228
                         LRT+ AY RFEQ+RITS +
Sbjct: 536  LSVIGLASSVFLFLRTRRAYQRFEQDRITSSM 567


>ref|XP_002274370.1| PREDICTED: uncharacterized protein LOC100263024 isoform 2 [Vitis
            vinifera]
          Length = 570

 Score =  570 bits (1469), Expect = e-160
 Identities = 316/573 (55%), Positives = 380/573 (66%), Gaps = 36/573 (6%)
 Frame = -2

Query: 1835 GQSRKWLILVAVIWIQAFTGTNFDFSAYSSDLKSVVGISQVKLNYLATASDLGKALGWSS 1656
            GQSRKW+ILVA IWIQAFTGTNFDFSAYSS+LK+V+G+SQV+LNYLATASDLGK  GWSS
Sbjct: 3    GQSRKWMILVATIWIQAFTGTNFDFSAYSSELKTVLGVSQVQLNYLATASDLGKLFGWSS 62

Query: 1655 GLALLYFPLWTVLFMAVLMGFVGYGVQWLIIRSLISLPYFLVFLSCLLAGCSICWFNTVC 1476
            GLAL+Y PLW V+FM+  MGF  YG+QWL+IRS+I+LPYFLVFL CLLAGCSICWFNTVC
Sbjct: 63   GLALMYMPLWVVMFMSAFMGFFAYGLQWLVIRSIITLPYFLVFLLCLLAGCSICWFNTVC 122

Query: 1475 FVLCIRNFPKNRALALSLTVGFNGISPALYTILMKAISSTDDSLYLILNXXXXXXXXXXX 1296
            FVLC +NFP NR LA+SLTV FNG+S ALY +   AI+ + DSLYL+LN           
Sbjct: 123  FVLCTQNFPANRPLAISLTVSFNGVSAALYALAADAINPSSDSLYLLLNAVIPLLTSIVA 182

Query: 1295 XXXXLNQSPQQPEPKDIVPHDXXXXXXXXXXXXITGFPXXXXXXXXXIASTPWPLVTAAI 1116
                L Q    P P D V  D            +TG            A+T   L + AI
Sbjct: 183  LPPILRQPSLDPLPPDAVRRDSLIFLILNFLAVLTGVYLLLISSISSNATTSRLLFSGAI 242

Query: 1115 F-LVLPLCVPGIVLARN-HRPLVN----------------DLELHKVQIED--------- 1017
            F LVLP+C+PG+V A+N  R  VN                DLELHK  I           
Sbjct: 243  FLLVLPICIPGVVYAKNWFRRTVNSSFRLDGSGFILVDADDLELHKELITRSGSGYGNGI 302

Query: 1016 NDCMGNDNKANDDARH-GVEPSGGC-------RLARLGEEHSAQVLLSRWDFWLYYVVYL 861
            +D + ++   ++  R+  VE    C       +L  LGEEH A++L+ R DFWLYY+ Y 
Sbjct: 303  SDIIKSNGSTHEIVRYNSVERESCCEKLMGKDQLVMLGEEHRARMLVRRLDFWLYYIAYF 362

Query: 860  CGGTIGLVYGNNLGQIAQSLGYSSKTTTTLVTLYSSCSFFGRLLSAAPDFMPRKVYFART 681
            CGGTIGLVY NNLGQIAQSLG SS  T+ L+T+YS+ S+FGRLLSAAPD+M  KVYFART
Sbjct: 363  CGGTIGLVYSNNLGQIAQSLGNSS-DTSALITIYSAFSYFGRLLSAAPDYMRAKVYFART 421

Query: 680  GWLTIALLPNTI-XXXXXXXXXXXXXLHIGTALIGLSSGFIFAAAVSITSELFGSNSVGV 504
            GWL+IALLP  +              LH  TAL+GLSSGFIFAAAVSITSELFG NSVGV
Sbjct: 422  GWLSIALLPTPVAFFLLAASGSSGSILHASTALVGLSSGFIFAAAVSITSELFGPNSVGV 481

Query: 503  NHNLLVTNIPIGSLLYGLLAALVYDANVSSNFGDSGIGSDEAMECMGRKCYLRTFVWWGI 324
            NHN+L+TNIPIGSL+YG+LAA++YDAN+ S+     + +D A+ CMG +CY  TFV WG 
Sbjct: 482  NHNILITNIPIGSLVYGMLAAIIYDANIGSSL---RMVTDTAV-CMGTRCYFLTFVLWGS 537

Query: 323  XXXXXXXXXXXXXLRTKPAYNRFEQNRITSQVY 225
                         LRT+ AY+RFE NRI+SQ+Y
Sbjct: 538  LSVIGLVCSVLLFLRTRHAYDRFEHNRISSQLY 570


>ref|XP_007040912.1| Major facilitator superfamily protein isoform 1 [Theobroma cacao]
            gi|508778157|gb|EOY25413.1| Major facilitator superfamily
            protein isoform 1 [Theobroma cacao]
          Length = 610

 Score =  568 bits (1464), Expect = e-159
 Identities = 318/583 (54%), Positives = 374/583 (64%), Gaps = 36/583 (6%)
 Frame = -2

Query: 1871 LLRYLIMADQGKGQSRKWLILVAVIWIQAFTGTNFDFSAYSSDLKSVVGISQVKLNYLAT 1692
            LLR+++      GQSRKW+ILVA IWIQAFTGTNFDFSAYS+++K V+GISQV+LNYLA 
Sbjct: 33   LLRFVMA-----GQSRKWMILVATIWIQAFTGTNFDFSAYSTEMKRVLGISQVQLNYLAV 87

Query: 1691 ASDLGKALGWSSGLALLYFPLWTVLFMAVLMGFVGYGVQWLIIRSLISLPYFLVFLSCLL 1512
            ASD+GKA GWSSGLAL YFPLW VLFMA  MG  GYG+QWL+IR++ISLPY LVF  CLL
Sbjct: 88   ASDMGKAFGWSSGLALTYFPLWVVLFMAAFMGLFGYGIQWLVIRNVISLPYMLVFCLCLL 147

Query: 1511 AGCSICWFNTVCFVLCIRNFPKNRALALSLTVGFNGISPALYTILMKAISSTDDSLYLIL 1332
            AGCSICWFNTVCFVLCI+NFP NRALALSLTV +NG+S ALY +   AI+++  SLYL+L
Sbjct: 148  AGCSICWFNTVCFVLCIKNFPANRALALSLTVSYNGVSAALYALAGDAINASSSSLYLLL 207

Query: 1331 NXXXXXXXXXXXXXXXLNQSPQQPEPKDIVPHDXXXXXXXXXXXXITGFPXXXXXXXXXI 1152
            N               L Q P  P   + V  D            +TG            
Sbjct: 208  NSLVPLIISIAALVPILRQPPVDPLSPEAVRSDSIMFLLLNVLALLTGVYLLIFGSNATD 267

Query: 1151 ASTPWPLVTAAIF-LVLPLCVPGIVLARN--HRPL---------------VNDLELHKVQ 1026
            ++T   L   AIF LV PLCVPG+V AR+  H  +                +DLELHK +
Sbjct: 268  STTARLLFGGAIFLLVFPLCVPGVVYARHWFHHTVHSSFQLGGSGFILVDDDDLELHK-R 326

Query: 1025 IEDNDCMGNDNKAN------------------DDARHGVEPSGGCRLARLGEEHSAQVLL 900
            +   +   ND   +                  D AR   +  G  +L  LGEEH AQVL+
Sbjct: 327  LLSREASFNDRNGSLSDDASEYKMGSQKCIDEDSARCCEKMIGKDQLVILGEEHPAQVLV 386

Query: 899  SRWDFWLYYVVYLCGGTIGLVYGNNLGQIAQSLGYSSKTTTTLVTLYSSCSFFGRLLSAA 720
             RWDFWLYYV Y CGGTIGLVY NNLGQIAQSLG SS  T  L+TLYSS SFFGRLLSAA
Sbjct: 387  RRWDFWLYYVAYFCGGTIGLVYSNNLGQIAQSLGESS-NTALLLTLYSSFSFFGRLLSAA 445

Query: 719  PDFMPRKVYFARTGWLTIALLPNTIXXXXXXXXXXXXXLHIGTALIGLSSGFIFAAAVSI 540
            PD++  K+YFARTGWL IALLP  I             L  GTALIGLSSGFIFAAAVS+
Sbjct: 446  PDYVRAKMYFARTGWLAIALLPTPIAFFLLAGLGNSMALRAGTALIGLSSGFIFAAAVSV 505

Query: 539  TSELFGSNSVGVNHNLLVTNIPIGSLLYGLLAALVYDANVSSNFGDSGIGSDEAMECMGR 360
            TSELFG NSVGVNHN+L+TNIPIGSL+YG+LAA+VYDAN         +   +++ CMGR
Sbjct: 506  TSELFGPNSVGVNHNILITNIPIGSLVYGVLAAIVYDANAGKGL---KLSFADSVVCMGR 562

Query: 359  KCYLRTFVWWGIXXXXXXXXXXXXXLRTKPAYNRFEQNRITSQ 231
            +CY  TFVWWG              LRTK AY+ FE+NR  ++
Sbjct: 563  QCYFLTFVWWGCLSILGLASSLLLFLRTKHAYDAFERNRALAE 605


>ref|XP_007218877.1| hypothetical protein PRUPE_ppa003313mg [Prunus persica]
            gi|462415339|gb|EMJ20076.1| hypothetical protein
            PRUPE_ppa003313mg [Prunus persica]
          Length = 585

 Score =  568 bits (1464), Expect = e-159
 Identities = 310/567 (54%), Positives = 371/567 (65%), Gaps = 32/567 (5%)
 Frame = -2

Query: 1835 GQSRKWLILVAVIWIQAFTGTNFDFSAYSSDLKSVVGISQVKLNYLATASDLGKALGWSS 1656
            GQSRKW+ILVA  WIQAFTGTNFDFS+YSSDLKSV+GISQV+LNYL+ ASD+GKALGW S
Sbjct: 3    GQSRKWMILVAATWIQAFTGTNFDFSSYSSDLKSVLGISQVQLNYLSVASDMGKALGWCS 62

Query: 1655 GLALLYFPLWTVLFMAVLMGFVGYGVQWLIIRSLISLPYFLVFLSCLLAGCSICWFNTVC 1476
            G++L+YFPLW V+FMA  MG  GYG+QW +I+ LI+LPY LVF+ CLLAGCSICWFNTVC
Sbjct: 63   GVSLMYFPLWAVMFMAAFMGLFGYGLQWFVIQRLITLPYVLVFILCLLAGCSICWFNTVC 122

Query: 1475 FVLCIRNFPKNRALALSLTVGFNGISPALYTILMKAISSTDDSLYLILNXXXXXXXXXXX 1296
            +VLCI++F  NRALALSLTV FNG+S ALYT++  AI+  DD++YL LN           
Sbjct: 123  YVLCIKHFQANRALALSLTVSFNGVSAALYTLIANAINPNDDTIYLFLNALVPLFTSSVA 182

Query: 1295 XXXXLNQSPQQPEPKDIVPHDXXXXXXXXXXXXITGFPXXXXXXXXXIASTPWPLVTAAI 1116
                L Q P Q  P D    D            ITG             S    L+  A+
Sbjct: 183  LIPVLRQPPIQSLPADATRRDSIIFLCLNILAVITGLYLLLLNSLSSHVSKARMLLVGAL 242

Query: 1115 F-LVLPLCVPGIVL----ARNHRP------------LV--NDLELHKVQIEDNDCMGNDN 993
            F L+LPLC+PGI      AR + P            LV  +DLELHKV I  ++     +
Sbjct: 243  FLLILPLCLPGIAYGREWARRNFPSRFPSDNSSTFNLVDPDDLELHKVLIAGSESTNATS 302

Query: 992  KANDDARHGVEPSG--GC-----------RLARLGEEHSAQVLLSRWDFWLYYVVYLCGG 852
              N ++    +  G   C           RL  LGEEHSA++L+ R DFWLYY  Y CGG
Sbjct: 303  ATNANSLGMTDTEGFFRCFKCFGKVMEKGRLTVLGEEHSAKLLVRRRDFWLYYAAYFCGG 362

Query: 851  TIGLVYGNNLGQIAQSLGYSSKTTTTLVTLYSSCSFFGRLLSAAPDFMPRKVYFARTGWL 672
            TIGLVY NNLGQI+QSLGYSS  T++LVTLYSSCSFFGRLLSAAPDF+  K+YFARTGWL
Sbjct: 363  TIGLVYSNNLGQISQSLGYSS-LTSSLVTLYSSCSFFGRLLSAAPDFLRDKIYFARTGWL 421

Query: 671  TIALLPNTIXXXXXXXXXXXXXLHIGTALIGLSSGFIFAAAVSITSELFGSNSVGVNHNL 492
             +AL+P  I             L  GT LIG+SSGF+F+AAVS+TSELFG NS GVNHN+
Sbjct: 422  AVALVPTPIAFFLLAASGSEAMLRAGTGLIGISSGFVFSAAVSVTSELFGPNSAGVNHNI 481

Query: 491  LVTNIPIGSLLYGLLAALVYDANVSSNFGDSGIGSDEAMECMGRKCYLRTFVWWGIXXXX 312
            L+TNIPIGSLLYGLLAALVYD+N  S+     +  D  + CMGR CY +TF+WWG     
Sbjct: 482  LITNIPIGSLLYGLLAALVYDSNEGSSIIGVSLLKDATL-CMGRSCYRQTFIWWGCISIV 540

Query: 311  XXXXXXXXXLRTKPAYNRFEQNRITSQ 231
                     LRT+ AYNRFE+NR  +Q
Sbjct: 541  GLASSLFLFLRTRTAYNRFERNRNRTQ 567


>ref|XP_004308650.1| PREDICTED: uncharacterized protein LOC101296911 [Fragaria vesca
            subsp. vesca]
          Length = 565

 Score =  566 bits (1459), Expect = e-158
 Identities = 308/566 (54%), Positives = 372/566 (65%), Gaps = 31/566 (5%)
 Frame = -2

Query: 1835 GQSRKWLILVAVIWIQAFTGTNFDFSAYSSDLKSVVGISQVKLNYLATASDLGKALGWSS 1656
            G+SRKW+ILV   WIQAFTGTNFDFS+YSS+LK+V+GISQV+LNYL+ ASD+GKALGW S
Sbjct: 3    GESRKWMILVVTTWIQAFTGTNFDFSSYSSELKTVLGISQVQLNYLSVASDMGKALGWCS 62

Query: 1655 GLALLYFPLWTVLFMAVLMGFVGYGVQWLIIRSLISLPYFLVFLSCLLAGCSICWFNTVC 1476
            G++L+Y PLW V+FMA LMG  GYG+QW +I  +I+LPY LVF  CLLAGCSICWFNTVC
Sbjct: 63   GVSLMYLPLWVVMFMAALMGLFGYGLQWFVIERMITLPYVLVFFLCLLAGCSICWFNTVC 122

Query: 1475 FVLCIRNFPKNRALALSLTVGFNGISPALYTILMKAISSTDDSLYLILNXXXXXXXXXXX 1296
            +VLCIR+F  NRALALSLT+ FNG+S ALY+++  AI+ +DD+LYL LN           
Sbjct: 123  YVLCIRHFQANRALALSLTISFNGVSAALYSLIANAINPSDDNLYLFLNALVPLFISGVA 182

Query: 1295 XXXXLNQSPQ-QPEPKDIVPHDXXXXXXXXXXXXITGFPXXXXXXXXXIASTPWPLVTAA 1119
                L Q P  QP   + +  D            +TG             S    L+  A
Sbjct: 183  LVPLLRQPPPIQPLSSEAIHRDSMVFLCLYILAVVTGLYLLLLNSLSSNISNARILLVGA 242

Query: 1118 IF-LVLPLCVPGIVLAR----NHRP------------LVN--DLELHKVQIEDNDCMGND 996
            IF L+LPLC+PGI  AR     + P            LVN  DLELHK  I +N+   N 
Sbjct: 243  IFLLILPLCLPGIAYAREWACRNMPFSFQNENSSDFNLVNPDDLELHKELIGENE---NG 299

Query: 995  NKANDDARHGVEPSGGC-----------RLARLGEEHSAQVLLSRWDFWLYYVVYLCGGT 849
            N  N  + +G+    GC           RL  LGEEHSA++L+ RWDFWLYY  Y CGGT
Sbjct: 300  NVVNATS-YGLIDKEGCLWCFGKVMEKDRLTVLGEEHSARLLVRRWDFWLYYAAYFCGGT 358

Query: 848  IGLVYGNNLGQIAQSLGYSSKTTTTLVTLYSSCSFFGRLLSAAPDFMPRKVYFARTGWLT 669
            IGLVY NNLGQI++SLGYSS  T++LVTLYSSCSFFGRLL+AAPDF+  K+YFARTGWL 
Sbjct: 359  IGLVYSNNLGQISESLGYSS-MTSSLVTLYSSCSFFGRLLAAAPDFLRDKIYFARTGWLA 417

Query: 668  IALLPNTIXXXXXXXXXXXXXLHIGTALIGLSSGFIFAAAVSITSELFGSNSVGVNHNLL 489
            +AL+P  I             L  GT LIG+SSGF+F+AAVS+TSELFG NS GVNHN+L
Sbjct: 418  VALVPTPIGFLLLTLSGSEAMLRAGTGLIGISSGFVFSAAVSVTSELFGPNSAGVNHNIL 477

Query: 488  VTNIPIGSLLYGLLAALVYDANVSSNFGDSGIGSDEAMECMGRKCYLRTFVWWGIXXXXX 309
            +TNIPIGSLLYGLLAALVYDAN  S+     +   EA  CMGR CY +TF+WW       
Sbjct: 478  ITNIPIGSLLYGLLAALVYDANEGSSVIQVNL-LKEATLCMGRSCYRQTFIWWSCISVVG 536

Query: 308  XXXXXXXXLRTKPAYNRFEQNRITSQ 231
                    LRT+ AYNRFE+NR  S+
Sbjct: 537  LASSILLFLRTRAAYNRFERNRCQSE 562


>gb|AAM60820.1| unknown [Arabidopsis thaliana]
          Length = 572

 Score =  563 bits (1452), Expect = e-158
 Identities = 320/571 (56%), Positives = 363/571 (63%), Gaps = 35/571 (6%)
 Frame = -2

Query: 1835 GQSRKWLILVAVIWIQAFTGTNFDFSAYSSDLKSVVGISQVKLNYLATASDLGKALGWSS 1656
            GQSRKW+ILVA IWIQAFTGTNFDFSAYSSDLKSV+GISQV+LNYLA ASDLGK  GWSS
Sbjct: 3    GQSRKWMILVATIWIQAFTGTNFDFSAYSSDLKSVLGISQVQLNYLAVASDLGKVFGWSS 62

Query: 1655 GLALLYFPLWTVLFMAVLMGFVGYGVQWLIIRSLISLPYFLVFLSCLLAGCSICWFNTVC 1476
            GLAL+YFPLWTVLF A  MGFVGYGVQWL+I   ISLPY +VFL CLLAG SICWFNTVC
Sbjct: 63   GLALMYFPLWTVLFAAAFMGFVGYGVQWLVITHFISLPYIMVFLCCLLAGLSICWFNTVC 122

Query: 1475 FVLCIRNFPKNRALALSLTVGFNGISPALYTILMKAISSTDDSLYLILNXXXXXXXXXXX 1296
            FVLCI NFP NR+LALSLTV FNG+S ALYT+   AI+ T   LYL+LN           
Sbjct: 123  FVLCISNFPANRSLALSLTVSFNGVSAALYTLAYNAINPTSPELYLLLNALIPLIVSFTA 182

Query: 1295 XXXXLNQSPQQPEPKDIVPHDXXXXXXXXXXXXITGFPXXXXXXXXXIASTPWPLVTAAI 1116
                L Q P +P P D V  D            + G             ++   L   AI
Sbjct: 183  IIPILRQPPFEPLPPDGVRRDSLMFLLLNILAALNGVYLLLFGSNSSDLTSARLLFGGAI 242

Query: 1115 -FLVLPLCVPGIVLARN--HRPL---------------VNDLELHKVQI------EDNDC 1008
              LV PLC+PG+V+ARN  +R +                ++LELHK  +      E    
Sbjct: 243  LLLVFPLCIPGLVIARNWYNRTIHTSFRLEGSGFILVDPDELELHKGMLAHEANREGYQL 302

Query: 1007 MGNDNKANDDARHGVEPSGG----CR-------LARLGEEHSAQVLLSRWDFWLYYVVYL 861
            + +D   N      VE        C+       L  LG EHS  +LL+R DFWLYY+ Y 
Sbjct: 303  LSDDVVQNPVKSVAVEEEDSDESCCKKLITRDQLEGLGIEHSLSLLLTRSDFWLYYITYF 362

Query: 860  CGGTIGLVYGNNLGQIAQSLGYSSKTTTTLVTLYSSCSFFGRLLSAAPDFMPRKVYFART 681
            CGGTIGLVY NNLGQIAQSLG SS  TTTLVTLYSS SFFGRLLSA PD++  KVYFART
Sbjct: 363  CGGTIGLVYSNNLGQIAQSLGQSS-NTTTLVTLYSSFSFFGRLLSATPDYIRAKVYFART 421

Query: 680  GWLTIALLPNTIXXXXXXXXXXXXXLHIGTALIGLSSGFIFAAAVSITSELFGSNSVGVN 501
            GWL IALLP                L  GTAL+GLSSGFIFAAAVSITSELFG NSVGVN
Sbjct: 422  GWLAIALLPTPFALFLLASSGNASALQAGTALMGLSSGFIFAAAVSITSELFGPNSVGVN 481

Query: 500  HNLLVTNIPIGSLLYGLLAALVYDANVSSNFGDSGIGSDEAMECMGRKCYLRTFVWWGIX 321
            HN+L+TNIPIGSL+YG LAALVYD   S  F  +   + E++ CMGR CY  TFVWWG  
Sbjct: 482  HNILITNIPIGSLIYGFLAALVYD---SHGFTGTKSMTSESVVCMGRDCYYLTFVWWGCL 538

Query: 320  XXXXXXXXXXXXLRTKPAYNRFEQNRITSQV 228
                        +RT+ AY RFEQ RI+S +
Sbjct: 539  SLLGLGSSLVLFIRTRRAYQRFEQARISSNI 569


>ref|XP_007029848.1| Major facilitator superfamily protein isoform 1 [Theobroma cacao]
            gi|508718453|gb|EOY10350.1| Major facilitator superfamily
            protein isoform 1 [Theobroma cacao]
          Length = 558

 Score =  563 bits (1451), Expect = e-157
 Identities = 308/559 (55%), Positives = 369/559 (66%), Gaps = 24/559 (4%)
 Frame = -2

Query: 1835 GQSRKWLILVAVIWIQAFTGTNFDFSAYSSDLKSVVGISQVKLNYLATASDLGKALGWSS 1656
            GQSRKW++L+A  W+QAFTGTNFDFS+YSS LK+V+GISQV+LNYL+ ASD+GKA GW S
Sbjct: 3    GQSRKWMMLIATTWVQAFTGTNFDFSSYSSTLKTVLGISQVQLNYLSVASDMGKAFGWCS 62

Query: 1655 GLALLYFPLWTVLFMAVLMGFVGYGVQWLIIRSLISLPYFLVFLSCLLAGCSICWFNTVC 1476
            G++L+Y PLW V+FMA  +G +GYGVQW +I+ +I+LPYFLVFL CL+AGCSICWFNTVC
Sbjct: 63   GVSLMYLPLWVVMFMAAFLGLLGYGVQWFVIKQVITLPYFLVFLLCLVAGCSICWFNTVC 122

Query: 1475 FVLCIRNFPKNRALALSLTVGFNGISPALYTILMKAISSTDDSLYLILNXXXXXXXXXXX 1296
            FVLCIRNF  +RALALSLT+ FNG+S ALYT++  AI+  DD+LYL LN           
Sbjct: 123  FVLCIRNFANSRALALSLTISFNGVSAALYTLIANAINPDDDTLYLFLNALVPLLASSLA 182

Query: 1295 XXXXLNQSPQQPEPKDIVPHDXXXXXXXXXXXXITGFPXXXXXXXXXIAS-TPWPLVTAA 1119
                L Q P Q    D V  D            ITG            AS     LV A 
Sbjct: 183  LIPILRQPPLQLLSTDAVSQDSFIFIILNVLAVITGLYLLLLNSLSSEASRARILLVGAL 242

Query: 1118 IFLVLPLCVPGIVLARN------HRPLV-----------NDLELHKVQI--EDNDCMGND 996
            I L+LPLC+PGIV  RN      H               +DLELHK  +  + N+ +   
Sbjct: 243  ILLLLPLCLPGIVCGRNWARHNIHTSFCLDGSTFSLVDPDDLELHKELLGSDYNNSLSVS 302

Query: 995  NKANDDARHG----VEPSGGCRLARLGEEHSAQVLLSRWDFWLYYVVYLCGGTIGLVYGN 828
            N      R G    V   G  RL  LGEEH A++L+ RWDFWLYY+ Y CGGTIGLVY N
Sbjct: 303  NSFCVTNREGFFKKVMEKG--RLTVLGEEHPARLLVHRWDFWLYYLAYFCGGTIGLVYSN 360

Query: 827  NLGQIAQSLGYSSKTTTTLVTLYSSCSFFGRLLSAAPDFMPRKVYFARTGWLTIALLPNT 648
            NLGQIAQS G+ S+  +T+VTLYSS SFFGRLLSAAPDF+  KVYFARTGWL +AL+P  
Sbjct: 361  NLGQIAQSRGFYSQ-ISTVVTLYSSFSFFGRLLSAAPDFLRDKVYFARTGWLAVALVPTP 419

Query: 647  IXXXXXXXXXXXXXLHIGTALIGLSSGFIFAAAVSITSELFGSNSVGVNHNLLVTNIPIG 468
            I             LH GTA+IGLSSGF+F+AAVSITSELFG NS  VNHN+L+TNIPIG
Sbjct: 420  IAFFLLAASGSEVALHAGTAMIGLSSGFVFSAAVSITSELFGPNSASVNHNILITNIPIG 479

Query: 467  SLLYGLLAALVYDANVSSNFGDSGIGSDEAMECMGRKCYLRTFVWWGIXXXXXXXXXXXX 288
            SLLYGLLAALVYD+NV S   ++ +G  EAM CMGR CY +TF++WG             
Sbjct: 480  SLLYGLLAALVYDSNVKSYSQENSLG--EAMVCMGRDCYQKTFIYWGCISLLGLISSFLL 537

Query: 287  XLRTKPAYNRFEQNRITSQ 231
             LRTKPAY+  E+NR  +Q
Sbjct: 538  FLRTKPAYDHLERNRSRAQ 556


>ref|XP_002869987.1| hypothetical protein ARALYDRAFT_492916 [Arabidopsis lyrata subsp.
            lyrata] gi|297315823|gb|EFH46246.1| hypothetical protein
            ARALYDRAFT_492916 [Arabidopsis lyrata subsp. lyrata]
          Length = 572

 Score =  562 bits (1449), Expect = e-157
 Identities = 320/571 (56%), Positives = 362/571 (63%), Gaps = 35/571 (6%)
 Frame = -2

Query: 1835 GQSRKWLILVAVIWIQAFTGTNFDFSAYSSDLKSVVGISQVKLNYLATASDLGKALGWSS 1656
            GQSRKW+ILVA IWIQAFTGTNFDFSAYSSDLKSV+GISQV+LNYLA ASDLGK  GWSS
Sbjct: 3    GQSRKWMILVATIWIQAFTGTNFDFSAYSSDLKSVLGISQVQLNYLAVASDLGKVFGWSS 62

Query: 1655 GLALLYFPLWTVLFMAVLMGFVGYGVQWLIIRSLISLPYFLVFLSCLLAGCSICWFNTVC 1476
            GLAL+YFPLWTVLF A  MGFVGYGVQWL+I   ISLPY +VFL CLLAG SICWFNTVC
Sbjct: 63   GLALMYFPLWTVLFAAAFMGFVGYGVQWLVITHFISLPYIMVFLCCLLAGLSICWFNTVC 122

Query: 1475 FVLCIRNFPKNRALALSLTVGFNGISPALYTILMKAISSTDDSLYLILNXXXXXXXXXXX 1296
            FVLCI NFP NR+LALSLTV FNG+S ALYT+   AI+ T   LYL+LN           
Sbjct: 123  FVLCISNFPANRSLALSLTVSFNGVSAALYTLAYNAINPTSPELYLLLNALIPLIVSFTA 182

Query: 1295 XXXXLNQSPQQPEPKDIVPHDXXXXXXXXXXXXITGFPXXXXXXXXXIASTPWPLVTAAI 1116
                L Q P +P P D V  D            + G             ++   L   AI
Sbjct: 183  IIPILRQPPFEPLPPDGVRRDSLMFLLLNILAALNGVYLLLFGSNSTDLTSARLLFGGAI 242

Query: 1115 -FLVLPLCVPGIVLARN--HRPL---------------VNDLELHKVQI------EDNDC 1008
              L+ PLC+PG+V+ARN  +R +                +DLELHK  +      E    
Sbjct: 243  VLLIFPLCIPGLVIARNWYNRTIHTSFRLEGSGFILVDPDDLELHKGMLAHEANREGYQL 302

Query: 1007 MGNDNKANDDARHGVEPSGG----CR-------LARLGEEHSAQVLLSRWDFWLYYVVYL 861
            + +D   N      VE        C+       L  LG EHS  +LL R DFWLYY+ Y 
Sbjct: 303  LNDDVVQNPVKTVAVEEDDSDESCCKKLITRDQLEGLGIEHSLSLLLRRSDFWLYYIAYF 362

Query: 860  CGGTIGLVYGNNLGQIAQSLGYSSKTTTTLVTLYSSCSFFGRLLSAAPDFMPRKVYFART 681
            CGGTIGLVY NNLGQIAQSLG SS  TTTLVTLYS+ SFFGRLLSA PD++  KVYFART
Sbjct: 363  CGGTIGLVYSNNLGQIAQSLGQSS-NTTTLVTLYSAFSFFGRLLSATPDYIRAKVYFART 421

Query: 680  GWLTIALLPNTIXXXXXXXXXXXXXLHIGTALIGLSSGFIFAAAVSITSELFGSNSVGVN 501
            GWL IALLP                L  GTAL+GLSSGFIFAAAVSITSELFG NSVGVN
Sbjct: 422  GWLAIALLPTPFALFLLASSGNASALQAGTALMGLSSGFIFAAAVSITSELFGPNSVGVN 481

Query: 500  HNLLVTNIPIGSLLYGLLAALVYDANVSSNFGDSGIGSDEAMECMGRKCYLRTFVWWGIX 321
            HN+L+TNIPIGSL+YG LAALVYD   S  F  +   + E++ CMGR CY  TFVWWG  
Sbjct: 482  HNILITNIPIGSLIYGFLAALVYD---SHGFTGTKSMTAESVVCMGRDCYYLTFVWWGCL 538

Query: 320  XXXXXXXXXXXXLRTKPAYNRFEQNRITSQV 228
                        +RT+ AY RFEQ RI+S V
Sbjct: 539  SLFGLGSSLVLFIRTRRAYQRFEQARISSNV 569


>emb|CBI29223.3| unnamed protein product [Vitis vinifera]
          Length = 507

 Score =  562 bits (1449), Expect = e-157
 Identities = 303/539 (56%), Positives = 361/539 (66%), Gaps = 2/539 (0%)
 Frame = -2

Query: 1835 GQSRKWLILVAVIWIQAFTGTNFDFSAYSSDLKSVVGISQVKLNYLATASDLGKALGWSS 1656
            GQSRKW+ILVA IWIQAFTGTNFDFSAYSS+LK+V+G+SQV+LNYLATASDLGK  GWSS
Sbjct: 3    GQSRKWMILVATIWIQAFTGTNFDFSAYSSELKTVLGVSQVQLNYLATASDLGKLFGWSS 62

Query: 1655 GLALLYFPLWTVLFMAVLMGFVGYGVQWLIIRSLISLPYFLVFLSCLLAGCSICWFNTVC 1476
            GLAL+Y PLW V+FM+  MGF  YG+QWL+IRS+I+LPYFLVFL CLLAGCSICWFNTVC
Sbjct: 63   GLALMYMPLWVVMFMSAFMGFFAYGLQWLVIRSIITLPYFLVFLLCLLAGCSICWFNTVC 122

Query: 1475 FVLCIRNFPKNRALALSLTVGFNGISPALYTILMKAISSTDDSLYLILNXXXXXXXXXXX 1296
            FVLC +NFP NR LA+SLTV FNG+S ALY +   AI+ + DSLYL+LN           
Sbjct: 123  FVLCTQNFPANRPLAISLTVSFNGVSAALYALAADAINPSSDSLYLLLNAVIPLLTSIVA 182

Query: 1295 XXXXLNQSPQQPEPKDIVPHDXXXXXXXXXXXXITGFPXXXXXXXXXIASTPWPLVTAAI 1116
                L Q    P P D V  D            +TG            A+T   L + AI
Sbjct: 183  LPPILRQPSLDPLPPDAVRRDSLIFLILNFLAVLTGVYLLLISSISSNATTSRLLFSGAI 242

Query: 1115 F-LVLPLCVPGIVLARNHRPLVNDLELHKVQIEDNDCMGNDNKANDDARHGVEPSGGCRL 939
            F LVLP+C+PG+V A+N                               R  +    G +L
Sbjct: 243  FLLVLPICIPGVVYAKNW-----------------------------FRRTLITRSGNQL 273

Query: 938  ARLGEEHSAQVLLSRWDFWLYYVVYLCGGTIGLVYGNNLGQIAQSLGYSSKTTTTLVTLY 759
              LGEEH A++L+ R DFWLYY+ Y CGGTIGLVY NNLGQIAQSLG SS  T+ L+T+Y
Sbjct: 274  VMLGEEHRARMLVRRLDFWLYYIAYFCGGTIGLVYSNNLGQIAQSLGNSS-DTSALITIY 332

Query: 758  SSCSFFGRLLSAAPDFMPRKVYFARTGWLTIALLPNTI-XXXXXXXXXXXXXLHIGTALI 582
            S+ S+FGRLLSAAPD+M  KVYFARTGWL+IALLP  +              LH  TAL+
Sbjct: 333  SAFSYFGRLLSAAPDYMRAKVYFARTGWLSIALLPTPVAFFLLAASGSSGSILHASTALV 392

Query: 581  GLSSGFIFAAAVSITSELFGSNSVGVNHNLLVTNIPIGSLLYGLLAALVYDANVSSNFGD 402
            GLSSGFIFAAAVSITSELFG NSVGVNHN+L+TNIPIGSL+YG+LAA++YDAN+ S+   
Sbjct: 393  GLSSGFIFAAAVSITSELFGPNSVGVNHNILITNIPIGSLVYGMLAAIIYDANIGSSL-- 450

Query: 401  SGIGSDEAMECMGRKCYLRTFVWWGIXXXXXXXXXXXXXLRTKPAYNRFEQNRITSQVY 225
              + +D A+ CMG +CY  TFV WG              LRT+ AY+RFE NRI+SQ+Y
Sbjct: 451  -RMVTDTAV-CMGTRCYFLTFVLWGSLSVIGLVCSVLLFLRTRHAYDRFEHNRISSQLY 507


>ref|NP_567588.1| major facilitator protein [Arabidopsis thaliana]
            gi|24030181|gb|AAN41272.1| unknown protein [Arabidopsis
            thaliana] gi|332658784|gb|AEE84184.1| major facilitator
            protein [Arabidopsis thaliana]
          Length = 572

 Score =  562 bits (1449), Expect = e-157
 Identities = 319/571 (55%), Positives = 363/571 (63%), Gaps = 35/571 (6%)
 Frame = -2

Query: 1835 GQSRKWLILVAVIWIQAFTGTNFDFSAYSSDLKSVVGISQVKLNYLATASDLGKALGWSS 1656
            GQSRKW+ILVA IWIQAFTGTNFDFSAYSSDLKSV+GISQV+LNYLA ASDLGK  GWSS
Sbjct: 3    GQSRKWMILVATIWIQAFTGTNFDFSAYSSDLKSVLGISQVQLNYLAVASDLGKVFGWSS 62

Query: 1655 GLALLYFPLWTVLFMAVLMGFVGYGVQWLIIRSLISLPYFLVFLSCLLAGCSICWFNTVC 1476
            GLAL+YFPLWTVLF A  MGFVGYGVQWL+I   ISLPY +VFL CLLAG SICWFNTVC
Sbjct: 63   GLALMYFPLWTVLFAAAFMGFVGYGVQWLVITHFISLPYIMVFLCCLLAGLSICWFNTVC 122

Query: 1475 FVLCIRNFPKNRALALSLTVGFNGISPALYTILMKAISSTDDSLYLILNXXXXXXXXXXX 1296
            FVLCI NFP NR+LALSLTV FNG+S ALYT+   AI+ T   LYL+LN           
Sbjct: 123  FVLCISNFPANRSLALSLTVSFNGVSAALYTLAYNAINPTSPELYLLLNALIPLIVSFTA 182

Query: 1295 XXXXLNQSPQQPEPKDIVPHDXXXXXXXXXXXXITGFPXXXXXXXXXIASTPWPLVTAAI 1116
                L Q P +P P D V  D            + G             ++   L   AI
Sbjct: 183  IIPILRQPPFEPLPPDGVRRDSLMFLLLNILAALNGVYLLLFGSNSSDLTSARLLFGGAI 242

Query: 1115 -FLVLPLCVPGIVLARN--HRPL---------------VNDLELHKVQI------EDNDC 1008
              LV PLC+PG+V+ARN  +R +                ++LELHK  +      E    
Sbjct: 243  LLLVFPLCIPGLVIARNWYNRTIHTSFRLEGSGFILVDPDELELHKGMLAHEANREGYQL 302

Query: 1007 MGNDNKANDDARHGVEPSGG----CR-------LARLGEEHSAQVLLSRWDFWLYYVVYL 861
            + +D   N      VE        C+       L  LG EHS  +LL+R DFWLYY+ Y 
Sbjct: 303  LSDDVVQNPVKSVAVEEEDSDESCCKKLITRDQLEGLGIEHSLSLLLTRSDFWLYYITYF 362

Query: 860  CGGTIGLVYGNNLGQIAQSLGYSSKTTTTLVTLYSSCSFFGRLLSAAPDFMPRKVYFART 681
            CGGTIGLVY NNLGQIAQSLG SS  TTTLVTLYS+ SFFGRLLSA PD++  KVYFART
Sbjct: 363  CGGTIGLVYSNNLGQIAQSLGQSS-NTTTLVTLYSAFSFFGRLLSATPDYIRAKVYFART 421

Query: 680  GWLTIALLPNTIXXXXXXXXXXXXXLHIGTALIGLSSGFIFAAAVSITSELFGSNSVGVN 501
            GWL IALLP                L  GTAL+GLSSGFIFAAAVSITSELFG NSVGVN
Sbjct: 422  GWLAIALLPTPFALFLLASSGTASALQAGTALMGLSSGFIFAAAVSITSELFGPNSVGVN 481

Query: 500  HNLLVTNIPIGSLLYGLLAALVYDANVSSNFGDSGIGSDEAMECMGRKCYLRTFVWWGIX 321
            HN+L+TNIPIGSL+YG LAALVYD   S  F  +   + E++ CMGR CY  TFVWWG  
Sbjct: 482  HNILITNIPIGSLIYGFLAALVYD---SHGFTGTKSMTSESVVCMGRDCYYLTFVWWGCL 538

Query: 320  XXXXXXXXXXXXLRTKPAYNRFEQNRITSQV 228
                        +RT+ AY RFEQ RI+S +
Sbjct: 539  SLLGLGSSLVLFIRTRRAYQRFEQARISSNI 569


>ref|XP_006283421.1| hypothetical protein CARUB_v10004470mg [Capsella rubella]
            gi|482552126|gb|EOA16319.1| hypothetical protein
            CARUB_v10004470mg [Capsella rubella]
          Length = 571

 Score =  562 bits (1448), Expect = e-157
 Identities = 320/573 (55%), Positives = 363/573 (63%), Gaps = 37/573 (6%)
 Frame = -2

Query: 1835 GQSRKWLILVAVIWIQAFTGTNFDFSAYSSDLKSVVGISQVKLNYLATASDLGKALGWSS 1656
            GQSRKW+ILVA IWIQAFTGTNFDFSAYSSDLKSV+GISQV+LNYLA ASDLGK  GWSS
Sbjct: 3    GQSRKWMILVATIWIQAFTGTNFDFSAYSSDLKSVLGISQVQLNYLAVASDLGKVFGWSS 62

Query: 1655 GLALLYFPLWTVLFMAVLMGFVGYGVQWLIIRSLISLPYFLVFLSCLLAGCSICWFNTVC 1476
            GLAL+YFPLWTVLF A  MGFVGYGVQWL+I   +SLPY +VFL CLLAG SICWFNTVC
Sbjct: 63   GLALMYFPLWTVLFAAAFMGFVGYGVQWLVITHFLSLPYIMVFLCCLLAGLSICWFNTVC 122

Query: 1475 FVLCIRNFPKNRALALSLTVGFNGISPALYTILMKAISSTDDSLYLILNXXXXXXXXXXX 1296
            FVLCI NFP NR+LALSLTV FNG+S ALYT+   AI+     LYL+LN           
Sbjct: 123  FVLCISNFPANRSLALSLTVSFNGVSAALYTLAYNAINPASPELYLLLNALIPLIVSFTA 182

Query: 1295 XXXXLNQSPQQPEPKDIVPHDXXXXXXXXXXXXITGFPXXXXXXXXXIASTPWPLVTAAI 1116
                L Q P +P P D V  D            + G             ++   L   AI
Sbjct: 183  IIPILRQPPFEPLPPDGVRRDSLMFLVLNILAALNGVYLLLFGSNSSDLTSARLLFGGAI 242

Query: 1115 -FLVLPLCVPGIVLARN------HRPL-----------VNDLELHKVQI------EDNDC 1008
              L+ PLC+PG+V+AR+      H               ++LELHK  +      E    
Sbjct: 243  VLLIFPLCIPGLVIARSWYDRTIHTSFRLEGSGFILVDPDELELHKGMLAHEANRESYQL 302

Query: 1007 MGNDNKANDDARHGVEP---SGGC--------RLARLGEEHSAQVLLSRWDFWLYYVVYL 861
            + +D   N      VE     G C        +L  LG EHS  +LL R DFWLYY+ Y 
Sbjct: 303  LSDDVVQNPVKTVAVEEDDIDGSCCTKLITKDQLEGLGIEHSLSLLLHRSDFWLYYIAYF 362

Query: 860  CGGTIGLVYGNNLGQIAQSLGYSSKTTTTLVTLYSSCSFFGRLLSAAPDFMPRKVYFART 681
            CGGTIGLVY NNLGQIAQSLG SS  TTTLVTLYSS SFFGRLLSA PD++  KVYFART
Sbjct: 363  CGGTIGLVYSNNLGQIAQSLGQSS-NTTTLVTLYSSFSFFGRLLSATPDYIRAKVYFART 421

Query: 680  GWLTIALLPNTIXXXXXXXXXXXXXLHIGTALIGLSSGFIFAAAVSITSELFGSNSVGVN 501
            GWL IALLP                L  GTAL+GLSSGFIFAAAVSITSELFG NSVGVN
Sbjct: 422  GWLAIALLPTPFALFLLALSGNASALQAGTALMGLSSGFIFAAAVSITSELFGPNSVGVN 481

Query: 500  HNLLVTNIPIGSLLYGLLAALVYDANVSSNFGDSGIGS--DEAMECMGRKCYLRTFVWWG 327
            HN+L+TNIPIGSL+YG LAALVYD++     G SGI S   E++ CMGR CY  TFVWWG
Sbjct: 482  HNILITNIPIGSLIYGFLAALVYDSH-----GFSGIKSVTSESVVCMGRDCYYLTFVWWG 536

Query: 326  IXXXXXXXXXXXXXLRTKPAYNRFEQNRITSQV 228
                          +RT+ AY RFEQ RI+S +
Sbjct: 537  CLSLIGLGSSLVLFIRTRRAYRRFEQARISSNI 569


>ref|XP_006471628.1| PREDICTED: uncharacterized protein LOC102626277 [Citrus sinensis]
          Length = 622

 Score =  561 bits (1446), Expect = e-157
 Identities = 318/578 (55%), Positives = 370/578 (64%), Gaps = 41/578 (7%)
 Frame = -2

Query: 1835 GQSRKWLILVAVIWIQAFTGTNFDFSAYSSDLKSVVGISQVKLNYLATASDLGKALGWSS 1656
            GQSRKW+ILVA IWIQAFTGTNFDFSAYSSDLKSV+G+SQV+LNYLATASDLGK  GW S
Sbjct: 47   GQSRKWMILVATIWIQAFTGTNFDFSAYSSDLKSVLGVSQVQLNYLATASDLGKLFGWLS 106

Query: 1655 GLALLYFPLWTVLFMAVLMGFVGYGVQWLIIRSLISLPYFLVFLSCLLAGCSICWFNTVC 1476
            GLAL+YFPLW VLFMA  MGF GYG+QWL+I  +ISLPY LVF  CLL+G SI WFNTVC
Sbjct: 107  GLALMYFPLWVVLFMAAFMGFFGYGLQWLVISHVISLPYILVFFLCLLSGLSITWFNTVC 166

Query: 1475 FVLCIRNFPKNRALALSLTVGFNGISPALYTILMKAISSTDDSLYLILNXXXXXXXXXXX 1296
            FVLCI+NFP NRALALSLTV FNG+S A+Y +   AI+ +  +LYL+LN           
Sbjct: 167  FVLCIQNFPANRALALSLTVSFNGVSAAIYALAGNAINPSSHALYLLLNALIPLITSLAA 226

Query: 1295 XXXXLNQSPQQPEPKDIVPHDXXXXXXXXXXXXITGFPXXXXXXXXXIASTPWPLVTAAI 1116
                L Q    P P + V  D            +TG             +    L   A+
Sbjct: 227  LIPILRQPSLDPLPPEGVKRDSFIFLILNIIAILTGVYLLLFGAHSSDLTVSRLLFGGAL 286

Query: 1115 FLVL-PLCVPGIVLARN------HRPL-----------VNDLELHKVQIEDNDCMGNDNK 990
            FL++ PLC+PGIV AR+      H              V+DLELHK  +       N+ K
Sbjct: 287  FLLMFPLCIPGIVYARDWFKRTIHSSFRLDGSGFLLIDVDDLELHKELLMREAEASNNGK 346

Query: 989  ANDD--------------ARHGVEPSGGC--------RLARLGEEHSAQVLLSRWDFWLY 876
              D                R      GGC        +LA LG+EHSA++L+ R DFWLY
Sbjct: 347  ELDQPLLSTDDISMTYSLTRTKSFEKGGCCETIIGKDQLAMLGQEHSARLLVLRLDFWLY 406

Query: 875  YVVYLCGGTIGLVYGNNLGQIAQSLGYSSKTTTTLVTLYSSCSFFGRLLSAAPDFMPRKV 696
            Y+ Y CGG IGLVY NNLGQIAQSLG SS+ TTTL+TLYSS SFFGRLLSAAPD+M  KV
Sbjct: 407  YIAYFCGGAIGLVYSNNLGQIAQSLGESSR-TTTLLTLYSSFSFFGRLLSAAPDYMRAKV 465

Query: 695  YFARTGWLTIALLPNTIXXXXXXXXXXXXXLHIGTALIGLSSGFIFAAAVSITSELFGSN 516
            YFARTGWLTIALLP  +             L  GT+LIGLSSGFIFAAAVSITSELFG N
Sbjct: 466  YFARTGWLTIALLPTPVAFCLLATSGNAVALQAGTSLIGLSSGFIFAAAVSITSELFGPN 525

Query: 515  SVGVNHNLLVTNIPIGSLLYGLLAALVYDANVSSNFGDSGIGSDEAMECMGRKCYLRTFV 336
            SVGVNHN+L+TNIPIGSL+YG LAA+VYD+NVSS  G   + SD  + CMGR CY  TFV
Sbjct: 526  SVGVNHNILITNIPIGSLVYGFLAAIVYDSNVSSGIGIGNVVSDSVV-CMGRHCYFLTFV 584

Query: 335  WWGIXXXXXXXXXXXXXLRTKPAYNRFEQNRITS-QVY 225
             WG              LRT+ AY+ FE+ R++S Q+Y
Sbjct: 585  LWGCVSVVGLAASVLLFLRTRHAYDCFERKRLSSTQLY 622