BLASTX nr result

ID: Mentha24_contig00031594 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Mentha24_contig00031594
         (874 letters)

Database: ./nr 
           38,876,450 sequences; 13,856,398,315 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

gb|EYU46643.1| hypothetical protein MIMGU_mgv1a001710mg [Mimulus...   220   6e-55
emb|CAN67843.1| hypothetical protein VITISV_016666 [Vitis vinifera]   196   1e-47
ref|XP_002520708.1| conserved hypothetical protein [Ricinus comm...   167   4e-39
gb|EXC34665.1| hypothetical protein L484_020433 [Morus notabilis]     162   1e-37
ref|XP_007020458.1| Sequence-specific DNA binding,sequence-speci...   157   4e-36
ref|XP_007020457.1| NDX1 homeobox protein, putative isoform 2 [T...   157   4e-36
ref|XP_007020456.1| Sequence-specific DNA binding,sequence-speci...   157   4e-36
ref|XP_007150278.1| hypothetical protein PHAVU_005G140400g [Phas...   155   1e-35
ref|XP_003542016.1| PREDICTED: uncharacterized protein LOC100781...   151   4e-34
emb|CAA09794.1| NDX1 homeobox protein [Glycine max]                   150   6e-34
ref|XP_006597288.1| PREDICTED: uncharacterized protein LOC547668...   150   6e-34
ref|XP_007150277.1| hypothetical protein PHAVU_005G140400g [Phas...   150   8e-34
emb|CBI32285.3| unnamed protein product [Vitis vinifera]              149   1e-33
ref|XP_006479839.1| PREDICTED: uncharacterized protein LOC102620...   149   2e-33
ref|XP_006479838.1| PREDICTED: uncharacterized protein LOC102620...   149   2e-33
ref|XP_006479836.1| PREDICTED: uncharacterized protein LOC102620...   149   2e-33
ref|XP_006444197.1| hypothetical protein CICLE_v10018730mg [Citr...   149   2e-33
ref|XP_006366379.1| PREDICTED: uncharacterized protein LOC102594...   148   2e-33
emb|CAA09791.1| NDX1 homeobox protein [Lotus japonicus]               146   9e-33
ref|XP_004247476.1| PREDICTED: uncharacterized protein LOC101264...   144   6e-32

>gb|EYU46643.1| hypothetical protein MIMGU_mgv1a001710mg [Mimulus guttatus]
          Length = 770

 Score =  220 bits (560), Expect = 6e-55
 Identities = 131/307 (42%), Positives = 167/307 (54%), Gaps = 17/307 (5%)
 Frame = +3

Query: 3    DHLAHDGKNVGMYSSPLHKEITPDHGTNVVQMERGTPDLGSREVDQFDVSRNGDGQ---- 170
            D L  D ++ G+      KE+  D G +    E+ T +  + + +  D SRN + Q    
Sbjct: 455  DRLVQDSQHKGV-----PKEV--DRGYSDSNAEKRTLENVALQENHLDASRNRNSQCFDG 507

Query: 171  -----FMEQDRSNGPSINSRENEKDARNFETSGSDSSPTRGKTPIDRMDVDHIKGGSYDE 335
                  +EQ  SNG +IN RE E+D+R  ETSG+DSSPTRGK   D MDVDH+KG  ++E
Sbjct: 508  ERKYGMVEQCTSNGDNINFREFERDSRTVETSGTDSSPTRGKNSSDLMDVDHVKGSGFEE 567

Query: 336  AAEEDKVDATHSDEKQQRKRKRTVMNDKQIALIESALIDEPDMHRNAISLRSWADKLSIH 515
              E++K DA +SDEKQQRKRKRT+MND+QIALIESAL+DEPDMHRN  SLR+WAD+LS+ 
Sbjct: 568  TMEDEKADAMYSDEKQQRKRKRTIMNDRQIALIESALVDEPDMHRNLTSLRNWADRLSLQ 627

Query: 516  GAEVTTSRLKNWXXXXXXXXXXXXXDV--------SLERLGSSGHLDSPRSSMDDARVSL 671
            GAEVTTSRLKNW             DV        +L R G SG+L+SP ++        
Sbjct: 628  GAEVTTSRLKNWLNNRKARLARVAKDVRVPYEGDKNLNRQGGSGNLESPLNT-------- 679

Query: 672  AARGSVENEATNIEVTASVDEEDMGTSRRNNPARTLSFEPGQYVMLVDEMGNEVGKGSVV 851
                                                 FE GQYV+LV E    +GK  V 
Sbjct: 680  ------------------------------------DFEAGQYVILVGEKAETIGKAKVF 703

Query: 852  QVNGNWC 872
            Q+ GNWC
Sbjct: 704  QIGGNWC 710


>emb|CAN67843.1| hypothetical protein VITISV_016666 [Vitis vinifera]
          Length = 1134

 Score =  196 bits (497), Expect = 1e-47
 Identities = 129/312 (41%), Positives = 170/312 (54%), Gaps = 28/312 (8%)
 Frame = +3

Query: 18   DGKNVGMYSSPLHKEITPDHGTNVVQMERGTPDLGS-REVDQFDVSRNGD--GQFMEQDR 188
            + ++ G  SSPL ++  PD       ++ GT +  + +EVDQF   RN D     M QDR
Sbjct: 683  EAQSTGGCSSPLLRKAAPDVTNRSANLKEGTSENSTLQEVDQF-FGRNMDQADDVMRQDR 741

Query: 189  ---SNGPSINSRENEKDARNFETSGSDSSPTRGKTPIDRMDV-------DHIKGGSYDEA 338
                N      R+ EKD +N ETSGSDSS TRGK   D++D        +HIK       
Sbjct: 742  RKDKNKLGRALRDGEKDVQNVETSGSDSSSTRGKNSTDQIDNSEFPKSNEHIKASGSGGV 801

Query: 339  AEEDKVDATHSDEKQQRKRKRTVMNDKQIALIESALIDEPDMHRNAISLRSWADKLSIHG 518
             E++KV+   S+EKQ+RKRKRT+MND Q+ LIE AL+DEPDM RNA  ++SWADKLS HG
Sbjct: 802  QEDEKVEIIPSEEKQRRKRKRTIMNDTQMTLIEKALVDEPDMQRNAALIQSWADKLSFHG 861

Query: 519  AEVTTSRLKNWXXXXXXXXXXXXXDVSL----------ERLGSS-GHL-DSPRSSMDDAR 662
             E+T S+LKNW             DV +          +++GS  G L DSP S  +D  
Sbjct: 862  PELTASQLKNWLNNRKARLARAAKDVRVASEVDSTFPDKQVGSGVGSLHDSPESPGEDFF 921

Query: 663  VSLAARGSVENEATNIEVT-ASVDEEDMGTSR--RNNPARTLSFEPGQYVMLVDEMGNEV 833
                ARG     A    V+ A  D  +  T+     NPA  +  EPGQYV+L+D  G+++
Sbjct: 922  APSTARGGTHQSAIGGSVSRAGADNAEAATAEFVDINPAEFVRREPGQYVVLLDGQGDDI 981

Query: 834  GKGSVVQVNGNW 869
            GKG V QV G W
Sbjct: 982  GKGKVHQVQGKW 993


>ref|XP_002520708.1| conserved hypothetical protein [Ricinus communis]
            gi|223540093|gb|EEF41670.1| conserved hypothetical
            protein [Ricinus communis]
          Length = 957

 Score =  167 bits (424), Expect = 4e-39
 Identities = 111/316 (35%), Positives = 157/316 (49%), Gaps = 32/316 (10%)
 Frame = +3

Query: 18   DGKNVGMYSSPLHKEITPDHGTNVVQMERGTPDLGSREVDQFD-----VSRNGDGQFMEQ 182
            + ++ G YSS L K+   +   +  + E  + +    E +Q       +    D    E+
Sbjct: 591  EAQSTGGYSSALSKKELSNRNISSNRKEEISENSAFLEEEQLSFRNEHMKYGDDAMREEK 650

Query: 183  DRSNGP-SINSRENEKDARNFETSGSDSSPTRGKTPIDRM-------DVDHIKGGSYDEA 338
            D+S G  S   RE ++D +N ETSGSD+S TRGK    ++         +H K       
Sbjct: 651  DKSGGTASTIKREIDRDFQNIETSGSDTSSTRGKNFAGQLGNSDFPKSSEHKKENGLQGV 710

Query: 339  AEEDKVDATHSDEKQQRKRKRTVMNDKQIALIESALIDEPDMHRNAISLRSWADKLSIHG 518
             E +KV+    +EKQ RKRKRT+MN+ Q++LIE AL+DEPDMHRNA SL+SWADKLS+HG
Sbjct: 711  QEGEKVETIQFEEKQPRKRKRTIMNEYQMSLIEEALVDEPDMHRNAASLQSWADKLSLHG 770

Query: 519  AEVTTSRLKNWXXXXXXXXXXXXXDVSLERLGSSGHLDSPRSSMD--------------- 653
            +EVT+S+LKNW                +       H  S + S+                
Sbjct: 771  SEVTSSQLKNWLNNRKARLARAGAGKDVRTPMEVDHALSEKQSVPALRHSHDSSESHGEV 830

Query: 654  ----DARVSLAARGSVENEATNIEVTASVDEEDMGTSRRNNPARTLSFEPGQYVMLVDEM 821
                 AR+S A  GS EN   ++     +D            A  +  +PGQYV+LVD+ 
Sbjct: 831  NVPAGARLSTARIGSAENAEISLAQFFGID-----------AAELVQCKPGQYVVLVDKQ 879

Query: 822  GNEVGKGSVVQVNGNW 869
            G+E+GKG V QV G W
Sbjct: 880  GDEIGKGKVYQVQGKW 895


>gb|EXC34665.1| hypothetical protein L484_020433 [Morus notabilis]
          Length = 965

 Score =  162 bits (411), Expect = 1e-37
 Identities = 110/310 (35%), Positives = 157/310 (50%), Gaps = 26/310 (8%)
 Frame = +3

Query: 18   DGKNVGMYSSPLHKEITPDHGTNVVQM-ERGTPDLGSREVDQF-----DVSRNGDGQFME 179
            + ++ G  SSPL  +  P+       + E  + +   ++ DQ        ++ GD    +
Sbjct: 597  EAQSAGGCSSPLLMKEPPNLNNRSSSLKEEMSENSAIQDADQKYQNIEHTAQGGDAVRED 656

Query: 180  QDRSNGPSINSR-ENEKDARNFETSGSDSSPTRGKTPIDRMDVDHI--------KGGSYD 332
            + +S+  +     E +KDA+N ETSGSD+S TRGK  +D+MD            + G   
Sbjct: 657  KGKSSRSAFGGTVEIDKDAQNVETSGSDTSSTRGKN-VDQMDNSEFPKSSAPTKESGYGR 715

Query: 333  EAAEEDKVDATHSDEKQQRKRKRTVMNDKQIALIESALIDEPDMHRNAISLRSWADKLSI 512
             AAEE KV+    DEKQ+RKRKRT+MNDKQ+ L+E AL+DEPDM RNA  +++WADKLS 
Sbjct: 716  NAAEEKKVETVQHDEKQRRKRKRTIMNDKQVELMERALVDEPDMQRNASLIQAWADKLSF 775

Query: 513  HGAEVTTSRLKNWXXXXXXXXXXXXXDVSLERLGSSGHLD-----------SPRSSMDDA 659
            HG+EVT+S+LKNW             DV       +  L+           SP S  +DA
Sbjct: 776  HGSEVTSSQLKNWLNNRKARLARTGKDVRPTLEAENSFLEKQGGPILRSNYSPESPGEDA 835

Query: 660  RVSLAARGSVENEATNIEVTASVDEEDMGTSRRNNPARTLSFEPGQYVMLVDEMGNEVGK 839
             V        + +A      A+   E         P+  +  EPGQ V++VD  G E+ K
Sbjct: 836  TVQ--PNVGRDPQAMTWRTNAAETSEVAPAEAAFGPSEFVQCEPGQQVVIVDAAGEEIAK 893

Query: 840  GSVVQVNGNW 869
            G V QV+G W
Sbjct: 894  GKVFQVHGKW 903


>ref|XP_007020458.1| Sequence-specific DNA binding,sequence-specific DNA binding
            transcription factors, putative isoform 3 [Theobroma
            cacao] gi|508720086|gb|EOY11983.1| Sequence-specific DNA
            binding,sequence-specific DNA binding transcription
            factors, putative isoform 3 [Theobroma cacao]
          Length = 874

 Score =  157 bits (398), Expect = 4e-36
 Identities = 104/313 (33%), Positives = 160/313 (51%), Gaps = 23/313 (7%)
 Frame = +3

Query: 3    DHLAHDGKNVGMYSSPLHKEITPDHGT-----------NVVQMERGTPDLGSREVDQFDV 149
            ++   + +++G  SSPL +   P+              N    E     + S  +DQ D 
Sbjct: 510  ENRVQEDRSLGGCSSPLLRTEPPNRNNRNGNLKEEMSENSAFQEEEQCYVRSNHMDQADD 569

Query: 150  SRNGDGQFMEQDRSNGPSINSRENEKDARNFETSGSDSSPTRGKTPIDRMDVDHIKGGSY 329
                D    ++D+S  P I  +E ++D +N ETSGSD+S T+GK  +D++ V+ ++  + 
Sbjct: 570  ITRQD-MMDDKDKSVTP-IGLKEIDRDVQNVETSGSDTSSTKGKNAVDKL-VERLRDSTP 626

Query: 330  DEAAEEDKVDATHSDEKQQRKRKRTVMNDKQIALIESALIDEPDMHRNAISLRSWADKLS 509
                E++KV+   ++EKQ+RKRKRT+MND+Q+ +IE AL+DEP+M RN  S++SWADKL 
Sbjct: 627  AGVREDEKVETVQTEEKQRRKRKRTIMNDEQVTIIERALLDEPEMQRNTASIQSWADKLC 686

Query: 510  IHGAEVTTSRLKNWXXXXXXXXXXXXXD-----------VSLERLGSSGH-LDSPRSSMD 653
             HG+EVT S+L+NW             D              +     GH   +P SS +
Sbjct: 687  HHGSEVTCSQLRNWLNNRKARLARASKDARPPPEPDNAFAGKQGGPQPGHPFKAPDSSGE 746

Query: 654  DARVSLAARGSVENEATNIEVTASVDEEDMGTSRRNNPARTLSFEPGQYVMLVDEMGNEV 833
            +A  S        +  +  E   + +  D G       A  +  +PGQ+V+LVD  G E+
Sbjct: 747  EAAPSNTRGTRSMSRISTSENPEAPEFVDFGA------AEFVQCKPGQFVVLVDGRGEEI 800

Query: 834  GKGSVVQVNGNWC 872
            GKG V QV G WC
Sbjct: 801  GKGKVHQVQGKWC 813


>ref|XP_007020457.1| NDX1 homeobox protein, putative isoform 2 [Theobroma cacao]
            gi|508720085|gb|EOY11982.1| NDX1 homeobox protein,
            putative isoform 2 [Theobroma cacao]
          Length = 926

 Score =  157 bits (398), Expect = 4e-36
 Identities = 104/313 (33%), Positives = 160/313 (51%), Gaps = 23/313 (7%)
 Frame = +3

Query: 3    DHLAHDGKNVGMYSSPLHKEITPDHGT-----------NVVQMERGTPDLGSREVDQFDV 149
            ++   + +++G  SSPL +   P+              N    E     + S  +DQ D 
Sbjct: 562  ENRVQEDRSLGGCSSPLLRTEPPNRNNRNGNLKEEMSENSAFQEEEQCYVRSNHMDQADD 621

Query: 150  SRNGDGQFMEQDRSNGPSINSRENEKDARNFETSGSDSSPTRGKTPIDRMDVDHIKGGSY 329
                D    ++D+S  P I  +E ++D +N ETSGSD+S T+GK  +D++ V+ ++  + 
Sbjct: 622  ITRQD-MMDDKDKSVTP-IGLKEIDRDVQNVETSGSDTSSTKGKNAVDKL-VERLRDSTP 678

Query: 330  DEAAEEDKVDATHSDEKQQRKRKRTVMNDKQIALIESALIDEPDMHRNAISLRSWADKLS 509
                E++KV+   ++EKQ+RKRKRT+MND+Q+ +IE AL+DEP+M RN  S++SWADKL 
Sbjct: 679  AGVREDEKVETVQTEEKQRRKRKRTIMNDEQVTIIERALLDEPEMQRNTASIQSWADKLC 738

Query: 510  IHGAEVTTSRLKNWXXXXXXXXXXXXXD-----------VSLERLGSSGH-LDSPRSSMD 653
             HG+EVT S+L+NW             D              +     GH   +P SS +
Sbjct: 739  HHGSEVTCSQLRNWLNNRKARLARASKDARPPPEPDNAFAGKQGGPQPGHPFKAPDSSGE 798

Query: 654  DARVSLAARGSVENEATNIEVTASVDEEDMGTSRRNNPARTLSFEPGQYVMLVDEMGNEV 833
            +A  S        +  +  E   + +  D G       A  +  +PGQ+V+LVD  G E+
Sbjct: 799  EAAPSNTRGTRSMSRISTSENPEAPEFVDFGA------AEFVQCKPGQFVVLVDGRGEEI 852

Query: 834  GKGSVVQVNGNWC 872
            GKG V QV G WC
Sbjct: 853  GKGKVHQVQGKWC 865


>ref|XP_007020456.1| Sequence-specific DNA binding,sequence-specific DNA binding
            transcription factors, putative isoform 1 [Theobroma
            cacao] gi|508720084|gb|EOY11981.1| Sequence-specific DNA
            binding,sequence-specific DNA binding transcription
            factors, putative isoform 1 [Theobroma cacao]
          Length = 1035

 Score =  157 bits (398), Expect = 4e-36
 Identities = 104/313 (33%), Positives = 160/313 (51%), Gaps = 23/313 (7%)
 Frame = +3

Query: 3    DHLAHDGKNVGMYSSPLHKEITPDHGT-----------NVVQMERGTPDLGSREVDQFDV 149
            ++   + +++G  SSPL +   P+              N    E     + S  +DQ D 
Sbjct: 671  ENRVQEDRSLGGCSSPLLRTEPPNRNNRNGNLKEEMSENSAFQEEEQCYVRSNHMDQADD 730

Query: 150  SRNGDGQFMEQDRSNGPSINSRENEKDARNFETSGSDSSPTRGKTPIDRMDVDHIKGGSY 329
                D    ++D+S  P I  +E ++D +N ETSGSD+S T+GK  +D++ V+ ++  + 
Sbjct: 731  ITRQD-MMDDKDKSVTP-IGLKEIDRDVQNVETSGSDTSSTKGKNAVDKL-VERLRDSTP 787

Query: 330  DEAAEEDKVDATHSDEKQQRKRKRTVMNDKQIALIESALIDEPDMHRNAISLRSWADKLS 509
                E++KV+   ++EKQ+RKRKRT+MND+Q+ +IE AL+DEP+M RN  S++SWADKL 
Sbjct: 788  AGVREDEKVETVQTEEKQRRKRKRTIMNDEQVTIIERALLDEPEMQRNTASIQSWADKLC 847

Query: 510  IHGAEVTTSRLKNWXXXXXXXXXXXXXD-----------VSLERLGSSGH-LDSPRSSMD 653
             HG+EVT S+L+NW             D              +     GH   +P SS +
Sbjct: 848  HHGSEVTCSQLRNWLNNRKARLARASKDARPPPEPDNAFAGKQGGPQPGHPFKAPDSSGE 907

Query: 654  DARVSLAARGSVENEATNIEVTASVDEEDMGTSRRNNPARTLSFEPGQYVMLVDEMGNEV 833
            +A  S        +  +  E   + +  D G       A  +  +PGQ+V+LVD  G E+
Sbjct: 908  EAAPSNTRGTRSMSRISTSENPEAPEFVDFGA------AEFVQCKPGQFVVLVDGRGEEI 961

Query: 834  GKGSVVQVNGNWC 872
            GKG V QV G WC
Sbjct: 962  GKGKVHQVQGKWC 974


>ref|XP_007150278.1| hypothetical protein PHAVU_005G140400g [Phaseolus vulgaris]
            gi|561023542|gb|ESW22272.1| hypothetical protein
            PHAVU_005G140400g [Phaseolus vulgaris]
          Length = 934

 Score =  155 bits (393), Expect = 1e-35
 Identities = 96/239 (40%), Positives = 134/239 (56%), Gaps = 19/239 (7%)
 Frame = +3

Query: 210  SRENEKDARNFETSGSDSSPTRGKTPIDRMDV-------DHIKGGSYDEAAEEDKVDATH 368
            +R+ +KDA+N ETSGSD+S  +GK  +D MD+       + +K  + +E  E++K++ + 
Sbjct: 650  ARDMDKDAQNVETSGSDTSSAKGKNVVDHMDIGELSKSNERLKRTAVEENPEDEKIELS- 708

Query: 369  SDEKQQRKRKRTVMNDKQIALIESALIDEPDMHRNAISLRSWADKLSIHGAEVTTSRLKN 548
                Q+RKRKRT+MNDKQ+ LIE AL DEPDM RNA+SL+SWA+KLS+HG+EVT+S+LKN
Sbjct: 709  ----QRRKRKRTIMNDKQVLLIERALKDEPDMQRNAVSLQSWAEKLSVHGSEVTSSQLKN 764

Query: 549  WXXXXXXXXXXXXXDV------------SLERLGSSGHLDSPRSSMDDARVSLAARGSVE 692
            W             DV              +R    G  DSP S  D + V+  A G  +
Sbjct: 765  WLNNRKARLARTARDVRTAGGDADNPVLEKQRGPVPGSYDSPESPGDVSHVARIASGDNK 824

Query: 693  NEATNIEVTASVDEEDMGTSRRNNPARTLSFEPGQYVMLVDEMGNEVGKGSVVQVNGNW 869
             E +   +   VD       R N          GQYV+LV   G+E+G+G V QV+G W
Sbjct: 825  PEPS---LARFVDIGSPEFGRCN---------AGQYVVLVGVRGDEIGRGKVFQVHGKW 871


>ref|XP_003542016.1| PREDICTED: uncharacterized protein LOC100781915 isoform X1 [Glycine
            max] gi|571502767|ref|XP_006595007.1| PREDICTED:
            uncharacterized protein LOC100781915 isoform X2 [Glycine
            max] gi|571502774|ref|XP_006595008.1| PREDICTED:
            uncharacterized protein LOC100781915 isoform X3 [Glycine
            max]
          Length = 945

 Score =  151 bits (381), Expect = 4e-34
 Identities = 97/238 (40%), Positives = 132/238 (55%), Gaps = 18/238 (7%)
 Frame = +3

Query: 210  SRENEKDARNFETSGSDSSPTRGKTPIDRMDV-------DHIKGGSYDEAAEEDKVDATH 368
            +RE +KDA+N ETSGSDSS  +GK  +D MD        + +K  + +E  E++K++ + 
Sbjct: 661  AREMDKDAQNVETSGSDSSSAKGKNVVDNMDNGELSKSNERLKRTAVEENPEDEKIELS- 719

Query: 369  SDEKQQRKRKRTVMNDKQIALIESALIDEPDMHRNAISLRSWADKLSIHGAEVTTSRLKN 548
                Q+RKRKRT+MNDKQ+ LIE AL DEPDM RNA SL+SWADKLS HG+EVT+S+LKN
Sbjct: 720  ----QRRKRKRTIMNDKQVMLIERALKDEPDMQRNAASLQSWADKLSGHGSEVTSSQLKN 775

Query: 549  WXXXXXXXXXXXXXDVSL-----------ERLGSSGHLDSPRSSMDDARVSLAARGSVEN 695
            W             DV             +R    G  DSP S  D + V+  A G  ++
Sbjct: 776  WLNNRKARLARTARDVKAAAGDDNPVPDKQRGPVPGSYDSPGSPGDVSHVARIASGDNKS 835

Query: 696  EATNIEVTASVDEEDMGTSRRNNPARTLSFEPGQYVMLVDEMGNEVGKGSVVQVNGNW 869
            E +     A     D+G+    +         GQYV+LV    +E+G+G V QV+G W
Sbjct: 836  EPS----LALARFVDIGSPEFGH------CNAGQYVVLVGVRQDEIGRGKVFQVHGKW 883


>emb|CAA09794.1| NDX1 homeobox protein [Glycine max]
          Length = 626

 Score =  150 bits (379), Expect = 6e-34
 Identities = 96/238 (40%), Positives = 130/238 (54%), Gaps = 18/238 (7%)
 Frame = +3

Query: 210  SRENEKDARNFETSGSDSSPTRGKTPIDRMDV-------DHIKGGSYDEAAEEDKVDATH 368
            +RE +KDA+N ETSGSDSS  +GK  +D MD        + +K  + +E  E++K++ + 
Sbjct: 346  AREMDKDAQNVETSGSDSSSAKGKNVVDNMDNGELSKSNERLKRTAVEENPEDEKIELS- 404

Query: 369  SDEKQQRKRKRTVMNDKQIALIESALIDEPDMHRNAISLRSWADKLSIHGAEVTTSRLKN 548
                Q+RKRKRT+MNDKQ+ LIE AL DEPDM RNA SL+SWADKLS HG+EVT+S+LKN
Sbjct: 405  ----QRRKRKRTIMNDKQVMLIERALKDEPDMQRNAASLQSWADKLSGHGSEVTSSQLKN 460

Query: 549  WXXXXXXXXXXXXXDVSL-----------ERLGSSGHLDSPRSSMDDARVSLAARGSVEN 695
            W             DV             +R    G  DSP S  D + V+  A G  ++
Sbjct: 461  WLNNRKARLARTARDVKAAAGDDNPVPEKQRGPVPGSYDSPGSPGDVSHVARIASGDNKS 520

Query: 696  EATNIEVTASVDEEDMGTSRRNNPARTLSFEPGQYVMLVDEMGNEVGKGSVVQVNGNW 869
            E             D+G+    +         GQ V+LV   G+E+G+G V QV+G W
Sbjct: 521  ELARF--------VDIGSPEFGH------CNAGQNVVLVGVRGDEIGRGKVFQVHGKW 564


>ref|XP_006597288.1| PREDICTED: uncharacterized protein LOC547668 isoform X1 [Glycine max]
            gi|571515697|ref|XP_006597289.1| PREDICTED:
            uncharacterized protein LOC547668 isoform X2 [Glycine
            max] gi|571515700|ref|XP_006597290.1| PREDICTED:
            uncharacterized protein LOC547668 isoform X3 [Glycine
            max] gi|571515704|ref|XP_006597291.1| PREDICTED:
            uncharacterized protein LOC547668 isoform X4 [Glycine
            max]
          Length = 941

 Score =  150 bits (379), Expect = 6e-34
 Identities = 96/238 (40%), Positives = 130/238 (54%), Gaps = 18/238 (7%)
 Frame = +3

Query: 210  SRENEKDARNFETSGSDSSPTRGKTPIDRMDV-------DHIKGGSYDEAAEEDKVDATH 368
            +RE +KDA+N ETSGSDSS  +GK  +D MD        + +K  + +E  E++K++ + 
Sbjct: 661  AREMDKDAQNVETSGSDSSSAKGKNVVDNMDNGELSKSNERLKRTAVEENPEDEKIELS- 719

Query: 369  SDEKQQRKRKRTVMNDKQIALIESALIDEPDMHRNAISLRSWADKLSIHGAEVTTSRLKN 548
                Q+RKRKRT+MNDKQ+ LIE AL DEPDM RNA SL+SWADKLS HG+EVT+S+LKN
Sbjct: 720  ----QRRKRKRTIMNDKQVMLIERALKDEPDMQRNAASLQSWADKLSGHGSEVTSSQLKN 775

Query: 549  WXXXXXXXXXXXXXDVSL-----------ERLGSSGHLDSPRSSMDDARVSLAARGSVEN 695
            W             DV             +R    G  DSP S  D + V+  A G  ++
Sbjct: 776  WLNNRKARLARTARDVKAAAGDDNPVPEKQRGPVPGSYDSPGSPGDVSHVARIASGDNKS 835

Query: 696  EATNIEVTASVDEEDMGTSRRNNPARTLSFEPGQYVMLVDEMGNEVGKGSVVQVNGNW 869
            E             D+G+    +         GQ V+LV   G+E+G+G V QV+G W
Sbjct: 836  ELARF--------VDIGSPEFGH------CNAGQNVVLVGVRGDEIGRGKVFQVHGKW 879


>ref|XP_007150277.1| hypothetical protein PHAVU_005G140400g [Phaseolus vulgaris]
            gi|561023541|gb|ESW22271.1| hypothetical protein
            PHAVU_005G140400g [Phaseolus vulgaris]
          Length = 898

 Score =  150 bits (378), Expect = 8e-34
 Identities = 90/227 (39%), Positives = 125/227 (55%), Gaps = 7/227 (3%)
 Frame = +3

Query: 210  SRENEKDARNFETSGSDSSPTRGKTPIDRMDV-------DHIKGGSYDEAAEEDKVDATH 368
            +R+ +KDA+N ETSGSD+S  +GK  +D MD+       + +K  + +E  E++K++ + 
Sbjct: 650  ARDMDKDAQNVETSGSDTSSAKGKNVVDHMDIGELSKSNERLKRTAVEENPEDEKIELS- 708

Query: 369  SDEKQQRKRKRTVMNDKQIALIESALIDEPDMHRNAISLRSWADKLSIHGAEVTTSRLKN 548
                Q+RKRKRT+MNDKQ+ LIE AL DEPDM RNA+SL+SWA+KLS+HG+EVT+S+LKN
Sbjct: 709  ----QRRKRKRTIMNDKQVLLIERALKDEPDMQRNAVSLQSWAEKLSVHGSEVTSSQLKN 764

Query: 549  WXXXXXXXXXXXXXDVSLERLGSSGHLDSPRSSMDDARVSLAARGSVENEATNIEVTASV 728
            W             DV      + G  D+P        V    RG V     + E     
Sbjct: 765  WLNNRKARLARTARDVRT----AGGDADNP--------VLEKQRGPVPGSYDSPE----- 807

Query: 729  DEEDMGTSRRNNPARTLSFEPGQYVMLVDEMGNEVGKGSVVQVNGNW 869
                                PGQYV+LV   G+E+G+G V QV+G W
Sbjct: 808  -------------------SPGQYVVLVGVRGDEIGRGKVFQVHGKW 835


>emb|CBI32285.3| unnamed protein product [Vitis vinifera]
          Length = 878

 Score =  149 bits (376), Expect = 1e-33
 Identities = 103/290 (35%), Positives = 145/290 (50%), Gaps = 6/290 (2%)
 Frame = +3

Query: 18   DGKNVGMYSSPLHKEITPDHGTNVVQMERGTPDLGS-REVDQFDVSRNGD--GQFMEQDR 188
            + ++ G  SSPL ++  PD       ++ GT +  + +EVDQF   RN D     M QDR
Sbjct: 578  EAQSTGGCSSPLLRKAAPDVTNRSANLKEGTSENSTLQEVDQF-FGRNMDQADDVMRQDR 636

Query: 189  ---SNGPSINSRENEKDARNFETSGSDSSPTRGKTPIDRMDVDHIKGGSYDEAAEEDKVD 359
                N      R+ EKD +N ETSGSDSS TRGK   D++D                +  
Sbjct: 637  RKDKNKLGRALRDGEKDVQNVETSGSDSSSTRGKNSTDQID--------------NSEFP 682

Query: 360  ATHSDEKQQRKRKRTVMNDKQIALIESALIDEPDMHRNAISLRSWADKLSIHGAEVTTSR 539
             ++   K   KRKRT+MND Q+ LIE AL+DEPDM RNA  ++SWADKLS HG E+T S+
Sbjct: 683  KSNEHIKASGKRKRTIMNDTQMTLIEKALVDEPDMQRNAALIQSWADKLSFHGPELTASQ 742

Query: 540  LKNWXXXXXXXXXXXXXDVSLERLGSSGHLDSPRSSMDDARVSLAARGSVENEATNIEVT 719
            LKNW                         L++ ++ +  A   +     V++   + +V 
Sbjct: 743  LKNW-------------------------LNNRKARLARAAKDVRVASEVDSTFPDKQVG 777

Query: 720  ASVDEEDMGTSRRNNPARTLSFEPGQYVMLVDEMGNEVGKGSVVQVNGNW 869
            + V       S  ++P       PGQYV+L+D  G+++GKG V QV G W
Sbjct: 778  SGVG------SLHDSPE-----SPGQYVVLLDGQGDDIGKGKVHQVQGKW 816


>ref|XP_006479839.1| PREDICTED: uncharacterized protein LOC102620367 isoform X4 [Citrus
            sinensis]
          Length = 932

 Score =  149 bits (375), Expect = 2e-33
 Identities = 101/274 (36%), Positives = 139/274 (50%), Gaps = 29/274 (10%)
 Frame = +3

Query: 135  DQFDVSRN----GDGQFMEQDRSNGPSI----NSRENEKDARNFETSGSDSSPTRGKTPI 290
            D+FD   N    GD    + +R N   +    +SRE +KD +   +SGSD+SP  GK  +
Sbjct: 602  DRFDSRSNLMDQGDDMMRQDNRENKDKVGMPGSSREVDKDVQIVGSSGSDTSPLGGKNFV 661

Query: 291  DRMDV-------DHIKGGSYDEAAEEDKVDATHSDEKQQRKRKRTVMNDKQIALIESALI 449
            D+++        + IK   +    EE+KV+   S+EKQQRKRKRT+MND Q+ALIE AL+
Sbjct: 662  DQVENVEFPKPNEPIKESVFGGVQEEEKVETVQSEEKQQRKRKRTIMNDNQMALIERALL 721

Query: 450  DEPDMHRNAISLRSWADKLSIHGAEVTTSRLKNWXXXXXXXXXXXXXDVSLERLGSSGHL 629
            DEPDM RN  S+R WA +LS HG+EVT+S+LKNW             D        +   
Sbjct: 722  DEPDMQRNTSSIRLWASRLSHHGSEVTSSQLKNWLNNRKARLARASKDARASSEADNSFT 781

Query: 630  ------------DSPRSSMDDARVSLAARGSVENEATNIE--VTASVDEEDMGTSRRNNP 767
                        DSP S  +D  + L +RG+     T  +  + A  D  D+G S     
Sbjct: 782  GKQSGPGLRQSHDSPDSPGED-HLPLNSRGTRSTLRTGADDNLEALTDIVDIGAS----- 835

Query: 768  ARTLSFEPGQYVMLVDEMGNEVGKGSVVQVNGNW 869
                  + GQ V+L+D  G E+G G V QV G W
Sbjct: 836  -EFAQRKAGQLVVLLDGQGEEIGSGRVHQVYGKW 868


>ref|XP_006479838.1| PREDICTED: uncharacterized protein LOC102620367 isoform X3 [Citrus
            sinensis]
          Length = 954

 Score =  149 bits (375), Expect = 2e-33
 Identities = 101/274 (36%), Positives = 139/274 (50%), Gaps = 29/274 (10%)
 Frame = +3

Query: 135  DQFDVSRN----GDGQFMEQDRSNGPSI----NSRENEKDARNFETSGSDSSPTRGKTPI 290
            D+FD   N    GD    + +R N   +    +SRE +KD +   +SGSD+SP  GK  +
Sbjct: 624  DRFDSRSNLMDQGDDMMRQDNRENKDKVGMPGSSREVDKDVQIVGSSGSDTSPLGGKNFV 683

Query: 291  DRMDV-------DHIKGGSYDEAAEEDKVDATHSDEKQQRKRKRTVMNDKQIALIESALI 449
            D+++        + IK   +    EE+KV+   S+EKQQRKRKRT+MND Q+ALIE AL+
Sbjct: 684  DQVENVEFPKPNEPIKESVFGGVQEEEKVETVQSEEKQQRKRKRTIMNDNQMALIERALL 743

Query: 450  DEPDMHRNAISLRSWADKLSIHGAEVTTSRLKNWXXXXXXXXXXXXXDVSLERLGSSGHL 629
            DEPDM RN  S+R WA +LS HG+EVT+S+LKNW             D        +   
Sbjct: 744  DEPDMQRNTSSIRLWASRLSHHGSEVTSSQLKNWLNNRKARLARASKDARASSEADNSFT 803

Query: 630  ------------DSPRSSMDDARVSLAARGSVENEATNIE--VTASVDEEDMGTSRRNNP 767
                        DSP S  +D  + L +RG+     T  +  + A  D  D+G S     
Sbjct: 804  GKQSGPGLRQSHDSPDSPGED-HLPLNSRGTRSTLRTGADDNLEALTDIVDIGAS----- 857

Query: 768  ARTLSFEPGQYVMLVDEMGNEVGKGSVVQVNGNW 869
                  + GQ V+L+D  G E+G G V QV G W
Sbjct: 858  -EFAQRKAGQLVVLLDGQGEEIGSGRVHQVYGKW 890


>ref|XP_006479836.1| PREDICTED: uncharacterized protein LOC102620367 isoform X1 [Citrus
            sinensis] gi|568852343|ref|XP_006479837.1| PREDICTED:
            uncharacterized protein LOC102620367 isoform X2 [Citrus
            sinensis]
          Length = 957

 Score =  149 bits (375), Expect = 2e-33
 Identities = 101/274 (36%), Positives = 139/274 (50%), Gaps = 29/274 (10%)
 Frame = +3

Query: 135  DQFDVSRN----GDGQFMEQDRSNGPSI----NSRENEKDARNFETSGSDSSPTRGKTPI 290
            D+FD   N    GD    + +R N   +    +SRE +KD +   +SGSD+SP  GK  +
Sbjct: 627  DRFDSRSNLMDQGDDMMRQDNRENKDKVGMPGSSREVDKDVQIVGSSGSDTSPLGGKNFV 686

Query: 291  DRMDV-------DHIKGGSYDEAAEEDKVDATHSDEKQQRKRKRTVMNDKQIALIESALI 449
            D+++        + IK   +    EE+KV+   S+EKQQRKRKRT+MND Q+ALIE AL+
Sbjct: 687  DQVENVEFPKPNEPIKESVFGGVQEEEKVETVQSEEKQQRKRKRTIMNDNQMALIERALL 746

Query: 450  DEPDMHRNAISLRSWADKLSIHGAEVTTSRLKNWXXXXXXXXXXXXXDVSLERLGSSGHL 629
            DEPDM RN  S+R WA +LS HG+EVT+S+LKNW             D        +   
Sbjct: 747  DEPDMQRNTSSIRLWASRLSHHGSEVTSSQLKNWLNNRKARLARASKDARASSEADNSFT 806

Query: 630  ------------DSPRSSMDDARVSLAARGSVENEATNIE--VTASVDEEDMGTSRRNNP 767
                        DSP S  +D  + L +RG+     T  +  + A  D  D+G S     
Sbjct: 807  GKQSGPGLRQSHDSPDSPGED-HLPLNSRGTRSTLRTGADDNLEALTDIVDIGAS----- 860

Query: 768  ARTLSFEPGQYVMLVDEMGNEVGKGSVVQVNGNW 869
                  + GQ V+L+D  G E+G G V QV G W
Sbjct: 861  -EFAQRKAGQLVVLLDGQGEEIGSGRVHQVYGKW 893


>ref|XP_006444197.1| hypothetical protein CICLE_v10018730mg [Citrus clementina]
            gi|567903420|ref|XP_006444198.1| hypothetical protein
            CICLE_v10018730mg [Citrus clementina]
            gi|567903422|ref|XP_006444199.1| hypothetical protein
            CICLE_v10018730mg [Citrus clementina]
            gi|557546459|gb|ESR57437.1| hypothetical protein
            CICLE_v10018730mg [Citrus clementina]
            gi|557546460|gb|ESR57438.1| hypothetical protein
            CICLE_v10018730mg [Citrus clementina]
            gi|557546461|gb|ESR57439.1| hypothetical protein
            CICLE_v10018730mg [Citrus clementina]
          Length = 957

 Score =  149 bits (375), Expect = 2e-33
 Identities = 101/274 (36%), Positives = 139/274 (50%), Gaps = 29/274 (10%)
 Frame = +3

Query: 135  DQFDVSRN----GDGQFMEQDRSNGPSI----NSRENEKDARNFETSGSDSSPTRGKTPI 290
            D+FD   N    GD    + +R N   +    +SRE +KD +   +SGSD+SP  GK  +
Sbjct: 627  DRFDSRSNLMDQGDDMMRQDNRENKDKVGMPGSSREVDKDVQIVGSSGSDTSPLGGKNFV 686

Query: 291  DRMDV-------DHIKGGSYDEAAEEDKVDATHSDEKQQRKRKRTVMNDKQIALIESALI 449
            D+++        + IK   +    EE+KV+   S+EKQQRKRKRT+MND Q+ALIE AL+
Sbjct: 687  DQVENVEFPKPNEPIKESVFGGVQEEEKVETVQSEEKQQRKRKRTIMNDNQMALIERALL 746

Query: 450  DEPDMHRNAISLRSWADKLSIHGAEVTTSRLKNWXXXXXXXXXXXXXDVSLERLGSSGHL 629
            DEPDM RN  S+R WA +LS HG+EVT+S+LKNW             D        +   
Sbjct: 747  DEPDMQRNTSSIRLWASRLSHHGSEVTSSQLKNWLNNRKARLARASKDARASSEADNSFT 806

Query: 630  ------------DSPRSSMDDARVSLAARGSVENEATNIE--VTASVDEEDMGTSRRNNP 767
                        DSP S  +D  + L +RG+     T  +  + A  D  D+G S     
Sbjct: 807  GKQSGPGLRQSHDSPDSPGED-HLPLNSRGTRSTLRTGADDNLEALTDIVDIGAS----- 860

Query: 768  ARTLSFEPGQYVMLVDEMGNEVGKGSVVQVNGNW 869
                  + GQ V+L+D  G E+G G V QV G W
Sbjct: 861  -EFAQRKAGQLVVLLDGQGEEIGSGRVHQVYGKW 893


>ref|XP_006366379.1| PREDICTED: uncharacterized protein LOC102594863 [Solanum tuberosum]
          Length = 934

 Score =  148 bits (374), Expect = 2e-33
 Identities = 108/310 (34%), Positives = 155/310 (50%), Gaps = 21/310 (6%)
 Frame = +3

Query: 3    DHLAHDGKNVGMYSSPLHKEITPDHGTNVVQMERGTPD---------LGSREVDQFDVSR 155
            ++   + +N+G Y  P  +E++ D             D         L SR  D+   S 
Sbjct: 568  ENRVQEAQNLGGYLPPQLREVSLDLNNRSANSREDILDNSSLQRLNQLNSRFNDEGQSSE 627

Query: 156  NGD-GQFMEQDRSNGPSINSRENEKDARNFETSGSDSSPTRGKTPIDRMD-VDHIKGGSY 329
             G  G+  E +R    SI+ ++ E   +N ETSGSDSS TR + P D++  V  I     
Sbjct: 628  AGTKGEMTEHERFIATSIDMKDIE--TQNVETSGSDSSSTRSRHPTDQVGKVGQINCNGP 685

Query: 330  DEAAEEDKVDATHSDEKQQRKRKRTVMNDKQIALIESALIDEPDMHRNAISLRSWADKLS 509
             E  E++ V+A H +EKQQRKRKRT+MND QI+L+E AL+ EPDM RN   L  WA KLS
Sbjct: 686  GEVREDETVEAQH-EEKQQRKRKRTIMNDTQISLVEKALMGEPDMQRNKTLLEKWAVKLS 744

Query: 510  IHGAEVTTSRLKNWXXXXXXXXXXXXXDVSL----ERLGSSGHL------DSPRSSMDDA 659
             HG+EVT S+LKNW             D  +    + L   G L      DSP S ++D 
Sbjct: 745  DHGSEVTKSQLKNWLNNRKARLARAAKDGRMLSEGDSLDKQGGLLTLLPSDSPGSPVEDV 804

Query: 660  RVSLAARGSVENEATNIEVTASVDEEDMGTSRRNNPARTLSFEPGQYVMLVDEMGNEVGK 839
             +  AAR +     T +  +++   E+  T+     +       G YV+L++E   E+G+
Sbjct: 805  GILSAARENAP-RLTGLAPSSTCLTENT-TAVPAASSEQAKCVAGDYVVLINEKAEEIGR 862

Query: 840  GSVVQVNGNW 869
            G V QV+G W
Sbjct: 863  GKVCQVSGKW 872


>emb|CAA09791.1| NDX1 homeobox protein [Lotus japonicus]
          Length = 958

 Score =  146 bits (369), Expect = 9e-33
 Identities = 91/238 (38%), Positives = 124/238 (52%), Gaps = 15/238 (6%)
 Frame = +3

Query: 201  SINSRENEKDARNFETSGSDSSPTRGKTPIDRMD-------VDHIKGGSYDEAAEEDKVD 359
            S  +R+ +KD +N ETS SD+S  +GK+ ID MD       V H K  +  E  E++KV+
Sbjct: 666  SRGARDFDKDCQNAETSSSDTSSAKGKSVIDHMDSGELSKSVAHPKKVTVGETPEDEKVE 725

Query: 360  ATHSDEKQQRKRKRTVMNDKQIALIESALIDEPDMHRNAISLRSWADKLSIHGAEVTTSR 539
                    +RKRKRT+MND+Q+ LIE AL+DEPDM RNA SL+SWADKLS+HG++VT S+
Sbjct: 726  TV-----PRRKRKRTIMNDEQVMLIERALLDEPDMQRNAASLQSWADKLSLHGSDVTPSQ 780

Query: 540  LKNWXXXXXXXXXXXXXDVSLERLGSSGHLDSPRSSMDDARVSLAARGSVENEATNIEVT 719
            +KNW             DV    +  S   D PR        S    G   N   ++   
Sbjct: 781  IKNWLNNRKARLARTAKDVPAADVAKSVP-DKPRGPSLGPYASPDNYGDASNARQDLLSL 839

Query: 720  ASVDEED--------MGTSRRNNPARTLSFEPGQYVMLVDEMGNEVGKGSVVQVNGNW 869
            A +   D        +     + P   +    GQ+V+L D  G E+G+G VVQV G W
Sbjct: 840  AKIASGDNPEPSLAELKAELVDAPPEIVRCNVGQHVVLTDTRGKEIGRGKVVQVQGKW 897


>ref|XP_004247476.1| PREDICTED: uncharacterized protein LOC101264065 [Solanum
            lycopersicum]
          Length = 934

 Score =  144 bits (362), Expect = 6e-32
 Identities = 104/311 (33%), Positives = 156/311 (50%), Gaps = 22/311 (7%)
 Frame = +3

Query: 3    DHLAHDGKNVGMYSSPLHKEITPDHGTNVVQMERGTPDLGS-REVDQF-----DVSRNGD 164
            ++   + +N+G Y  P  +E++               D  S + ++Q      D  ++G+
Sbjct: 568  ENRVQEAQNLGGYLPPQLREVSLGLNNRSANSREDILDNSSLQRLNQLNSRTNDAGQSGE 627

Query: 165  ----GQFMEQDRSNGPSINSRENEKDARNFETSGSDSSPTRGKTPIDRMD-VDHIKGGSY 329
                G+ +E +R     I  ++ E   +N ETSGSDSS TR + P D++  V+ I     
Sbjct: 628  AGTKGEMIEHERFIATCIEMKDIE--TQNVETSGSDSSSTRSRHPTDQVGKVEQINCNGP 685

Query: 330  DEAAEEDKVDATHSDEKQQRKRKRTVMNDKQIALIESALIDEPDMHRNAISLRSWADKLS 509
             E  E++ V+A H +EKQQRKRKRT+MNDKQI+L+E AL+ EPDM RN   L  WA KLS
Sbjct: 686  GEVREDETVEAQH-EEKQQRKRKRTIMNDKQISLVEKALMGEPDMQRNKNLLEKWAVKLS 744

Query: 510  IHGAEVTTSRLKNWXXXXXXXXXXXXXDVSL----ERLGSSGHL------DSPRSSMDDA 659
             HG+EVT S+LKNW             D  +    + L   G L       SP S ++D 
Sbjct: 745  DHGSEVTKSQLKNWLNNRKARLARAAKDGRVLSEGDSLDKQGGLLTLLPCGSPGSPVEDV 804

Query: 660  RVSLAARGSVENEATNIEVTASVDEEDMGT-SRRNNPARTLSFEPGQYVMLVDEMGNEVG 836
             +  AAR +          +  + E      +  + PA  ++   G YV+L++E   E+G
Sbjct: 805  GILSAARENAPRLTGLAPSSTCLTENTTAVPAASSEPAVCVA---GDYVVLINEKAEEIG 861

Query: 837  KGSVVQVNGNW 869
            +G V QV+G W
Sbjct: 862  RGKVCQVSGKW 872


Top