BLASTX nr result

ID: Mentha29_contig00004759 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Mentha29_contig00004759
         (1297 letters)

Database: ./nr 
           38,876,450 sequences; 13,856,398,315 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_004243958.1| PREDICTED: putative GATA transcription facto...   137   1e-29
ref|XP_006346565.1| PREDICTED: GATA transcription factor 21-like...   130   2e-27
ref|XP_002514107.1| hypothetical protein RCOM_1046780 [Ricinus c...   129   3e-27
ref|XP_002282173.1| PREDICTED: uncharacterized protein LOC100261...   125   4e-26
ref|XP_003546455.1| PREDICTED: putative GATA transcription facto...   125   5e-26
ref|XP_007012845.1| GATA type zinc finger transcription factor f...   124   7e-26
ref|XP_006451458.1| hypothetical protein CICLE_v10009004mg [Citr...   119   2e-24
ref|XP_003595135.1| GATA transcription factor [Medicago truncatu...   111   6e-22
ref|XP_007154661.1| hypothetical protein PHAVU_003G137100g [Phas...   111   8e-22
gb|EXB38836.1| Putative GATA transcription factor 22 [Morus nota...   109   3e-21
ref|XP_004507931.1| PREDICTED: GATA transcription factor 21-like...   108   4e-21
gb|AFK42954.1| unknown [Medicago truncatula]                          108   5e-21
ref|XP_002308561.2| hypothetical protein POPTR_0006s24560g [Popu...    92   4e-16
gb|EYU27295.1| hypothetical protein MIMGU_mgv1a020800mg [Mimulus...    92   5e-16
ref|XP_007012281.1| GATA type zinc finger transcription factor f...    92   5e-16
ref|XP_004251667.1| PREDICTED: putative GATA transcription facto...    92   5e-16
ref|XP_007138732.1| hypothetical protein PHAVU_009G232700g [Phas...    92   6e-16
gb|AGU42761.1| GATA nirate-inducible carbon-metabolism involved ...    91   1e-15
ref|XP_003543725.1| PREDICTED: GATA transcription factor 21-like...    90   2e-15
ref|XP_006353530.1| PREDICTED: putative GATA transcription facto...    89   4e-15

>ref|XP_004243958.1| PREDICTED: putative GATA transcription factor 22-like [Solanum
           lycopersicum]
          Length = 266

 Score =  137 bits (345), Expect = 1e-29
 Identities = 103/286 (36%), Positives = 129/286 (45%), Gaps = 16/286 (5%)
 Frame = +2

Query: 152 DENQETFTFGLNHHRHHQIVXXXXXXXXXCHIFFNQAQDH-AGFFXXXXXXXXXXXXXDD 328
           D     F FGLN+  ++ +V          H FFN   +  A F              + 
Sbjct: 5   DHITPNFPFGLNNSNNNSLVTPNY------HFFFNSTTNQTASFHHQHTQYYMQHEQLEV 58

Query: 329 XXXXXXXXXXEIENKVEKGLKLSLQKKEDENMLAKERSSGDRVKWMPRLMKKKDEEGSMM 508
                       +N+V  GLKLSL K+ED+ +L+ E    D+ K          ++ S  
Sbjct: 59  DNDGGSSYDLGKKNEVGSGLKLSLWKREDK-LLSSEIKKLDQEK----------KKNSTN 107

Query: 509 NGCAKMEVKKVKQSSSVETDLXXXXXXXXXXTPIRVCSDCHTTKTPLWRSGPKGPKSLCN 688
           + C K+++   KQ   ++TD            PIRVC+DC+TTKTPLWRSGPKGPKSLCN
Sbjct: 108 SACIKLKLGDQKQKP-IQTDYCSNNI------PIRVCTDCNTTKTPLWRSGPKGPKSLCN 160

Query: 689 ACGIRQRK---XXXXXXXXXXXXXXXXXXCDTXXXXXXXXXXXXXXXXRCKSGGE----- 844
           ACGIRQRK                                        RCK G       
Sbjct: 161 ACGIRQRKARRAMAAAAAEGKTDQKVQQHKQNITTKVTSNNDVKPLKKRCKFGPSSSSTN 220

Query: 845 -------LEEFLIRLSEKLAIHRVFPQDEKEAAILLMALSSGLVHG 961
                   E+FLI LS KLA  ++FPQDE EAAILLMALSSGLVHG
Sbjct: 221 NAPKKLGFEDFLINLSNKLAFQQIFPQDEMEAAILLMALSSGLVHG 266


>ref|XP_006346565.1| PREDICTED: GATA transcription factor 21-like [Solanum tuberosum]
          Length = 222

 Score =  130 bits (326), Expect = 2e-27
 Identities = 87/218 (39%), Positives = 108/218 (49%), Gaps = 19/218 (8%)
 Frame = +2

Query: 365 ENKVEKGLKLSLQKKEDENMLAKERSSGDRVKWMPRLMKKKDEEGSMMNGCAKMEVKKVK 544
           +NK   GLKLSL K+ED+ +++ E    D+ +          ++    N C K+++   K
Sbjct: 22  KNKGGSGLKLSLWKREDKLVMSSEIKDLDQER----------KKNITNNDCIKLKLGDQK 71

Query: 545 QSSSVETDLXXXXXXXXXXTPIRVCSDCHTTKTPLWRSGPKGPKSLCNACGIRQRKXXXX 724
           Q   ++TD            PIRVC+DC+TTKTPLWRSGPKGPKSLCNACGIRQRK    
Sbjct: 72  QQP-IQTDYSSNNI------PIRVCTDCNTTKTPLWRSGPKGPKSLCNACGIRQRKARRA 124

Query: 725 XXXXXXXXXXXXXXC-------DTXXXXXXXXXXXXXXXXRCKSGGE------------L 847
                                                   RCK G               
Sbjct: 125 MAAAANGKTDHQTAMKIKVQQHKPNITKVRTNNHVTPFKKRCKLGPSSSGTNNAPKKLGF 184

Query: 848 EEFLIRLSEKLAIHRVFPQDEKEAAILLMALSSGLVHG 961
           E+ LI LS +LA  ++FPQDEKEAAILLMALSSGLVHG
Sbjct: 185 EDLLINLSNQLAFQQIFPQDEKEAAILLMALSSGLVHG 222


>ref|XP_002514107.1| hypothetical protein RCOM_1046780 [Ricinus communis]
           gi|223546563|gb|EEF48061.1| hypothetical protein
           RCOM_1046780 [Ricinus communis]
          Length = 312

 Score =  129 bits (324), Expect = 3e-27
 Identities = 100/301 (33%), Positives = 130/301 (43%), Gaps = 37/301 (12%)
 Frame = +2

Query: 170 FTFGLNHHRHH-QIVXXXXXXXXXCH--------IFFNQAQDHAGFFXXXXXXXXXXXXX 322
           FT  LN  +HH Q++                   IF N  Q+  G++             
Sbjct: 12  FTIDLNEDQHHHQLIFCSKTTTEDASSSSSISYPIFINPPQEEVGYYHKELQPLHHQEVD 71

Query: 323 DDXXXXXXXXXXEI-ENKVEKGLKLSLQKKEDENMLAKERSSGDRVKWMP---RLMKKKD 490
           +            I +N+ E G +LS+ KKED++   +++     VKWM    RLM+K  
Sbjct: 72  NIYASHGRSWDHRIIKNENENGQELSVCKKEDKSTSIEDQRDNSSVKWMSSKMRLMRKMM 131

Query: 491 EEGSMMN------GCAKMEVKKVKQSSSVETDLXXXXXXXXXXTPIRVCSDCHTTKTPLW 652
                +N         K+E K+  +S  ++ D             IRVCSDC+TTKTPLW
Sbjct: 132 TTDQTVNTTQHTSSMHKLEDKEKSRSLPLQDDYSSKNLSDNSNNTIRVCSDCNTTKTPLW 191

Query: 653 RSGPKGPKSLCNACGIRQRKXXXXXXXXXXXXXXXXXXCDT--------XXXXXXXXXXX 808
           RSGP+GPKSLCNACGIRQRK                   DT                   
Sbjct: 192 RSGPRGPKSLCNACGIRQRKARRALAAAQASANGTIFAPDTAAMKTNKVQNKEKRTNNSH 251

Query: 809 XXXXXRCK----SGGELEEFLIR------LSEKLAIHRVFPQDEKEAAILLMALSSGLVH 958
                RCK    S G  ++          LS+  A  ++FPQDEKEAAILLMALS GLVH
Sbjct: 252 LPFKKRCKFTAQSRGSRKKLCFEDLSSTILSKNSAFQQLFPQDEKEAAILLMALSYGLVH 311

Query: 959 G 961
           G
Sbjct: 312 G 312


>ref|XP_002282173.1| PREDICTED: uncharacterized protein LOC100261004 [Vitis vinifera]
           gi|297738668|emb|CBI27913.3| unnamed protein product
           [Vitis vinifera]
          Length = 309

 Score =  125 bits (314), Expect = 4e-26
 Identities = 102/299 (34%), Positives = 126/299 (42%), Gaps = 35/299 (11%)
 Frame = +2

Query: 170 FTFGLNHHRHHQIVXXXXXXXXX-------CHIFFNQAQDHAGFFXXXXXXXXXXXXXDD 328
           F   LN  +HHQ++                C IFF+  ++  G                D
Sbjct: 14  FPLQLNEDQHHQLLFSPKPQPSSSSSSSLTCPIFFSPTKEQGGCHYRDLHQAQPQQEAHD 73

Query: 329 XXXXXXXXXXE--IENKVEKGLKLSLQKKEDENMLAKERSSGDRVKWMP---RLMKK--- 484
                        +E++ + GLKL++ K ED N   +  S    VKWM    R+M+K   
Sbjct: 74  KFVFRGGSYDHPTLESESDNGLKLTIWKTEDRN---ENHSENGSVKWMSSKMRVMQKMMI 130

Query: 485 KDEEGSMMNGCAKMEVKKVKQSS-SVETDLXXXXXXXXXXT-PIRVCSDCHTTKTPLWRS 658
            D+ G+       +     KQ S   ETD              IRVC+DC+TTKTPLWRS
Sbjct: 131 SDQTGAQKPSNTALNFGDHKQQSLPSETDYNSINSSNINSNNTIRVCADCNTTKTPLWRS 190

Query: 659 GPKGPKSLCNACGIRQRKXXXXXXXXXXXXXXXXXXCDT--------XXXXXXXXXXXXX 814
           GP+GPKSLCNACGIRQRK                   +T                     
Sbjct: 191 GPRGPKSLCNACGIRQRKARRAMAAAAATANGTILPTNTAPTKTKAKHKDKKSSNGHVSH 250

Query: 815 XXXRCKSGGE---------LEEFLIRLSEKLAIHRVFPQDE-KEAAILLMALSSGLVHG 961
              RCK              E+F I LS+  A HRVF QDE KEAAILLMALS GLVHG
Sbjct: 251 YKKRCKLAAAPSCETKKLCFEDFTISLSKNSAFHRVFLQDEIKEAAILLMALSCGLVHG 309


>ref|XP_003546455.1| PREDICTED: putative GATA transcription factor 22-like [Glycine max]
          Length = 315

 Score =  125 bits (313), Expect = 5e-26
 Identities = 90/231 (38%), Positives = 108/231 (46%), Gaps = 32/231 (13%)
 Frame = +2

Query: 365 ENKVEKGLKLSLQKKED--ENMLAKERSSGDRVKWMP---RLMKK---------KDEEGS 502
           ENK +  LKL + KKED  EN   ++ S+    KWMP   R+M++          D EG 
Sbjct: 83  ENKSD--LKLRVWKKEDKCENFQGEDNST----KWMPLKMRMMRRLMVSDQTGSDDTEGM 136

Query: 503 MMNGCA-KMEVKKVKQSSSVETDLXXXXXXXXXXTPIRVCSDCHTTKTPLWRSGPKGPKS 679
           + N    K E K    S     D             +RVCSDCHTTKTPLWRSGPKGPKS
Sbjct: 137 ISNSQKIKYEEKNSPLSPLGTDDSNYNSSSNHSNITVRVCSDCHTTKTPLWRSGPKGPKS 196

Query: 680 LCNACGIRQRKXXXXXXXXXXXXXXXXXXCDTXXXXXXXXXXXXXXXXRC---------- 829
           LCNACGIRQRK                   +                 +           
Sbjct: 197 LCNACGIRQRKVRRAIAAAATSNGTNPVEAEKSQVKKGNTLHSKGMKSKTEGAQQMKKNR 256

Query: 830 -------KSGGELEEFLIRLSEKLAIHRVFPQDEKEAAILLMALSSGLVHG 961
                  K  G  E+  +RLS+  A+ +VFPQDEKEAAILLMALS GL+HG
Sbjct: 257 KLGARYRKRFGAFEDLTVRLSKNFALQQVFPQDEKEAAILLMALSYGLLHG 307


>ref|XP_007012845.1| GATA type zinc finger transcription factor family protein, putative
           [Theobroma cacao] gi|508783208|gb|EOY30464.1| GATA type
           zinc finger transcription factor family protein,
           putative [Theobroma cacao]
          Length = 302

 Score =  124 bits (312), Expect = 7e-26
 Identities = 88/218 (40%), Positives = 105/218 (48%), Gaps = 23/218 (10%)
 Frame = +2

Query: 377 EKGLKLSLQKKEDENMLAKERSSGDRVKWMP---RLMKK---KDEEGSMMNGCAKMEVKK 538
           + GL LSL+KKE+ N   +   S    KWM    R+M+K    D      +   K+E  K
Sbjct: 88  DSGLNLSLRKKEEGNEHHQIEDSS--AKWMSSKMRMMRKMMSSDRADLSNSSTPKLEEPK 145

Query: 539 VKQSSSVETDLXXXXXXXXXXTPIRVCSDCHTTKTPLWRSGPKGPKSLCNACGIRQRKXX 718
            + SSS +             T IRVC+DC+TTKTPLWRSGP+GPKSLCNACGIRQRK  
Sbjct: 146 QQPSSSPDNSSNSSYNNNDNIT-IRVCADCNTTKTPLWRSGPRGPKSLCNACGIRQRKAR 204

Query: 719 XXXXXXXXXXXXXXXXCDT---------XXXXXXXXXXXXXXXXRCKSGGE--------L 847
                             T                         +CK   +         
Sbjct: 205 RAMAAAAAANGAIVAAQTTPTMKSKVQDKSKRSSNSGCVAQLKKKCKHSSQSQGRKKLCF 264

Query: 848 EEFLIRLSEKLAIHRVFPQDEKEAAILLMALSSGLVHG 961
           E+  I LS+  A HRVFPQDEKEAAILLMALS GLVHG
Sbjct: 265 EDLRIILSKNSAFHRVFPQDEKEAAILLMALSYGLVHG 302


>ref|XP_006451458.1| hypothetical protein CICLE_v10009004mg [Citrus clementina]
           gi|568843031|ref|XP_006475428.1| PREDICTED: putative
           GATA transcription factor 22-like [Citrus sinensis]
           gi|557554684|gb|ESR64698.1| hypothetical protein
           CICLE_v10009004mg [Citrus clementina]
          Length = 306

 Score =  119 bits (299), Expect = 2e-24
 Identities = 85/218 (38%), Positives = 103/218 (47%), Gaps = 25/218 (11%)
 Frame = +2

Query: 383 GLKLSLQKKEDENMLAKERSSGDRVKWMP---RLMKKKDEEGSMMNGCAKMEV-KKVKQS 550
           GLKLS+  +++E     +  +   VKWM    RLMKK            K+E  +K   S
Sbjct: 92  GLKLSMSSEKEERNDQNQSENSSSVKWMSSKMRLMKKMMYSSPDAAAMQKLEDHQKQPPS 151

Query: 551 SSVETDLXXXXXXXXXXTPIRVCSDCHTTKTPLWRSGPKGPKSLCNACGIRQRKXXXXXX 730
           SS+E D             IRVC+DC+TTKTPLWRSGP+GPKSLCNACGIRQRK      
Sbjct: 152 SSLEPD---NGNNNNNTNTIRVCADCNTTKTPLWRSGPRGPKSLCNACGIRQRKARRAMA 208

Query: 731 XXXXXXXXXXXXCD--------TXXXXXXXXXXXXXXXXRCKSGG-----------ELEE 853
                        D        +                RCK                E+
Sbjct: 209 AAAANGTAVQLAADDTSSNKKKSKTPRPSNNNSCLPFKKRCKYNSNSPSRGKKKLCSFED 268

Query: 854 FLIRLSE--KLAIHRVFPQDEKEAAILLMALSSGLVHG 961
             + LS+    A+ RVFPQ+EKEAAILLMALS GLVHG
Sbjct: 269 LTLNLSKNNSSALQRVFPQEEKEAAILLMALSYGLVHG 306


>ref|XP_003595135.1| GATA transcription factor [Medicago truncatula]
           gi|355484183|gb|AES65386.1| GATA transcription factor
           [Medicago truncatula]
          Length = 297

 Score =  111 bits (278), Expect = 6e-22
 Identities = 77/211 (36%), Positives = 105/211 (49%), Gaps = 14/211 (6%)
 Frame = +2

Query: 371 KVEKGLKLSLQKKEDENMLAKERSSGDRVKWMP---RLMKKKDEEGSMMNGCAKMEVKKV 541
           +VEK  K   +++++EN   + R S   +KWMP   R++K+  E+        + ++K++
Sbjct: 86  EVEKDAKDWKKEEDNENFRDEGRIS---MKWMPSKKRMIKRMMEDQRASEQEFEKQIKQL 142

Query: 542 KQSSSVETDLXXXXXXXXXXTPIRVCSDCHTTKTPLWRSGPKGPKSLCNACGIRQRKXXX 721
             +     D           + +RVC+DCHTTKTPLWRSGP GPKSLCNACGIRQRK   
Sbjct: 143 SPNLVGTED--SSNNNFSNNSTVRVCTDCHTTKTPLWRSGPTGPKSLCNACGIRQRKARR 200

Query: 722 XXXXXXXXXXXXXXXCDTXXXXXXXXXXXXXXXXRC----KSGGE-------LEEFLIRL 868
                                             +C    K  G+        E+ +   
Sbjct: 201 ALAAAANGETLVVAEKPYVKGKKLQIKRKRSKTDQCAQLLKRKGKSENKCNNFEDLITSW 260

Query: 869 SEKLAIHRVFPQDEKEAAILLMALSSGLVHG 961
           S  LA H+VFPQD KEAAILLMALSSGL++G
Sbjct: 261 SNNLASHQVFPQDVKEAAILLMALSSGLLNG 291


>ref|XP_007154661.1| hypothetical protein PHAVU_003G137100g [Phaseolus vulgaris]
           gi|561028015|gb|ESW26655.1| hypothetical protein
           PHAVU_003G137100g [Phaseolus vulgaris]
          Length = 309

 Score =  111 bits (277), Expect = 8e-22
 Identities = 77/225 (34%), Positives = 106/225 (47%), Gaps = 30/225 (13%)
 Frame = +2

Query: 377 EKGLKLSLQK---KEDENMLAKERSSGDRVKWMPRLMKKK---DEEGSMMNGCAKMEVKK 538
           E  LK+++ K   + +++  A E  S + +    R+M+K    D+ G+ +      + + 
Sbjct: 87  ESELKVAVWKNKERSEDHEAAAEDGSVNLMSLKMRMMRKTMVPDQTGAYIEDRTMHKFED 146

Query: 539 VKQSSS---VETDLXXXXXXXXXXTPIRVCSDCHTTKTPLWRSGPKGPKSLCNACGIRQR 709
            KQ  S    +               +RVC+DCHTTKTPLWRSGP+GPKSLCNACGIRQR
Sbjct: 147 QKQPLSPLGTDNSSSSNNYSNHSNNTVRVCADCHTTKTPLWRSGPRGPKSLCNACGIRQR 206

Query: 710 KXXXXXXXXXXXXXXXXXXCDTXXXXXXXXXXXXXXXXRCKSGGEL-------------- 847
           K                   +T                R +   ++              
Sbjct: 207 K--ARRAMAAAASGNGTVILETQKSVKGNKLQKKEKKTRTQGAPQMKKKRNHGVGAKPSQ 264

Query: 848 -------EEFLIRLSEKLAIHRVFPQDEKEAAILLMALSSGLVHG 961
                  E+  +RL + LA+H+VFPQDEKEAAILLMALS GLVHG
Sbjct: 265 SRNKFGFEDLTLRLRKSLAMHQVFPQDEKEAAILLMALSYGLVHG 309


>gb|EXB38836.1| Putative GATA transcription factor 22 [Morus notabilis]
          Length = 335

 Score =  109 bits (272), Expect = 3e-21
 Identities = 84/238 (35%), Positives = 108/238 (45%), Gaps = 43/238 (18%)
 Frame = +2

Query: 377 EKGLKLSLQKKE--------DENMLAKERSSGDRVKWMP---RLMKK--------KDEEG 499
           +  LKLS+ K          D++    + ++G   KWMP   R+M+K          +  
Sbjct: 98  QNDLKLSIWKSSTEDSNYDHDKSSHVSDNNAGYSAKWMPSKMRMMRKMIVNPDQTNIDHH 157

Query: 500 SMMNGCAKME--VKKVKQSSSVETD-LXXXXXXXXXXTPIRVCSDCHTTKTPLWRSGPKG 670
           + +N   K +  +K+   +S + TD              IRVC+DC+TTKTPLWRSGP+G
Sbjct: 158 TPLNFTHKFDQVMKRKHPASPLGTDHSSTSSSNNNNNNTIRVCADCNTTKTPLWRSGPRG 217

Query: 671 PKSLCNACGIRQRKXXXXXXXXXXXXXXXXXXCDT------------XXXXXXXXXXXXX 814
           PKSLCNACGIRQRK                   D                          
Sbjct: 218 PKSLCNACGIRQRKARRAMAAAAAAANGTILATDATTMKSSTKVQRKEKKPKNGNGVVPQ 277

Query: 815 XXXRCK-----SGGE----LEEFLIRLSEKLAIHRVFPQDEKEAAILLMALSSGLVHG 961
              RCK     S G      E+  I +S+  A  RVFPQDEK+AAILLMALS GLVHG
Sbjct: 278 FKKRCKLTASPSRGRKKICFEDLAISISKNSAFQRVFPQDEKDAAILLMALSYGLVHG 335


>ref|XP_004507931.1| PREDICTED: GATA transcription factor 21-like [Cicer arietinum]
          Length = 266

 Score =  108 bits (271), Expect = 4e-21
 Identities = 80/234 (34%), Positives = 101/234 (43%), Gaps = 37/234 (15%)
 Frame = +2

Query: 371 KVEKGLKLSLQKKEDENMLAKERSSGDRVKWMPRLMKKKDEEGSMMNGCAKMEV------ 532
           +VEK +  + +K +  +M  K       ++W      KK+E   M N    +E       
Sbjct: 44  EVEKMIPSTAEKHDSRSMKQKLT-----IRW------KKEESDEMNNNIESVESADTKMI 92

Query: 533 -------KKVKQSSSVETDLXXXXXXXXXXTP--IRVCSDCHTTKTPLWRSGPKGPKSLC 685
                  KK  + S V  D           T   +RVCSDC+TTKTPLWRSGPKGPK+LC
Sbjct: 93  DSDDVPNKKQLEVSPVGRDNSSSSSNNNYSTTPTVRVCSDCNTTKTPLWRSGPKGPKTLC 152

Query: 686 NACGIRQRKXXXXXXXXXXXXXXXXXXCDTXXXXXXXXXXXXXXXXRCKSGGEL------ 847
           NACGIRQRK                   D                 +     ++      
Sbjct: 153 NACGIRQRKARRAMAAAAAAAANGVMTVDKTSSVKQNKLQKKENKSKIHHSSKVPHMKKK 212

Query: 848 ----------------EEFLIRLSEKLAIHRVFPQDEKEAAILLMALSSGLVHG 961
                           E+F + LS  LA+ +VFPQDEKEAAILLMALS GLVHG
Sbjct: 213 RKVETKPSQTTNNFSFEDFTLMLSNNLAVQQVFPQDEKEAAILLMALSYGLVHG 266


>gb|AFK42954.1| unknown [Medicago truncatula]
          Length = 302

 Score =  108 bits (270), Expect = 5e-21
 Identities = 76/211 (36%), Positives = 104/211 (49%), Gaps = 14/211 (6%)
 Frame = +2

Query: 371 KVEKGLKLSLQKKEDENMLAKERSSGDRVKWMP---RLMKKKDEEGSMMNGCAKMEVKKV 541
           +VEK  K   +++++EN   + R S   +KWMP   R++K+  E+        + ++K++
Sbjct: 91  EVEKDAKDWKKEEDNENFRDEGRIS---MKWMPSKKRMIKRMMEDQRASEQEFEKQIKQL 147

Query: 542 KQSSSVETDLXXXXXXXXXXTPIRVCSDCHTTKTPLWRSGPKGPKSLCNACGIRQRKXXX 721
             +     D           + +RVC+DC TTKTPLWRSGP GPKSLCNACGIRQRK   
Sbjct: 148 SPNLVGTED--SSNNNFSNNSTVRVCTDCRTTKTPLWRSGPTGPKSLCNACGIRQRKARR 205

Query: 722 XXXXXXXXXXXXXXXCDTXXXXXXXXXXXXXXXXRC----KSGGE-------LEEFLIRL 868
                                             +C    K  G+        E+ +   
Sbjct: 206 ALAAAANGETLVVAEKPYVKGKKLQIKRKRSKTDQCAQLLKRKGKSENKCNNFEDLITSW 265

Query: 869 SEKLAIHRVFPQDEKEAAILLMALSSGLVHG 961
           S  LA H+VFPQD KEAAILLMALSSGL++G
Sbjct: 266 SNNLASHQVFPQDVKEAAILLMALSSGLLNG 296


>ref|XP_002308561.2| hypothetical protein POPTR_0006s24560g [Populus trichocarpa]
           gi|118487597|gb|ABK95624.1| unknown [Populus
           trichocarpa] gi|550337006|gb|EEE92084.2| hypothetical
           protein POPTR_0006s24560g [Populus trichocarpa]
          Length = 303

 Score = 92.4 bits (228), Expect = 4e-16
 Identities = 69/211 (32%), Positives = 88/211 (41%), Gaps = 22/211 (10%)
 Frame = +2

Query: 392 LSLQKKEDENMLAKERSSGDRVKWMP---RLMKKKDEEGSMMNGCAKMEVKKVKQSSSVE 562
           LS  K ED      E S    VKWMP   RLM+K     S  +    M +K + +  + +
Sbjct: 98  LSSSKMED----GAEESGESSVKWMPSKMRLMQKMTN--SNCSETDHMPMKFMLKFHNQQ 151

Query: 563 TDLXXXXXXXXXXTPIRVCSDCHTTKTPLWRSGPKGPKSLCNACGIRQRKXXXXXXXXXX 742
                        + IRVCSDC+TT TPLWRSGP+GPKSLCNACGIRQRK          
Sbjct: 152 YQNNEINSSSNSNSNIRVCSDCNTTSTPLWRSGPRGPKSLCNACGIRQRKARRAMAAAAA 211

Query: 743 XXXXXXXXCDTXXXXXXXXXXXXXXXXRCKSGGE-------------------LEEFLIR 865
                    +                 R     +                    +   + 
Sbjct: 212 AANGTVIAIEASSSTRSTKVNNKVKKSRTNHVSQNKKLSKPPESSLQSQKKLCFKNLALS 271

Query: 866 LSEKLAIHRVFPQDEKEAAILLMALSSGLVH 958
           LS+  A+ +V P D +EAAILLM LS G +H
Sbjct: 272 LSKNPALQQVLPHDVEEAAILLMELSCGFIH 302


>gb|EYU27295.1| hypothetical protein MIMGU_mgv1a020800mg [Mimulus guttatus]
          Length = 315

 Score = 92.0 bits (227), Expect = 5e-16
 Identities = 74/221 (33%), Positives = 97/221 (43%), Gaps = 14/221 (6%)
 Frame = +2

Query: 92  MNLNLAPSPFPMEQTKEKEHDENQETFTFGLNHHRHHQIVXXXXXXXXXCHIFFNQAQDH 271
           MNLN  P    M   K+ +   NQ+   F L    H+Q+V           +FF     H
Sbjct: 1   MNLNSLPILEQMNNHKDHDQQHNQQQLPFALIA-THNQLVSSSSSSSSS-QLFFTTPPHH 58

Query: 272 AGFFXXXXXXXXXXXXXDDXXXXXXXXXXEIENKVEKGLKLSLQKKEDENMLAKERSSGD 451
             +               +             N    GLK++L KKE +   A + +   
Sbjct: 59  QLYNQPHFQDHMIKNSNSNNN----------NNNNNNGLKITLWKKEPDEGAAADINP-- 106

Query: 452 RVKWMP---RLMKKKDEEGSMMNGCAKMEVKKVKQSSSVETDLXXXXXXXXXXT------ 604
            VKWM    RLMK+ ++     N  AK ++   +  SS  + L          +      
Sbjct: 107 -VKWMSSKIRLMKRMNK-----NIPAKSKIDSDQNPSSNSSLLESSDHLSSGNSSSYNNN 160

Query: 605 -----PIRVCSDCHTTKTPLWRSGPKGPKSLCNACGIRQRK 712
                PIRVC+DC+TTKTPLWRSGPKGPKSLCNACGIRQRK
Sbjct: 161 NNSNYPIRVCADCNTTKTPLWRSGPKGPKSLCNACGIRQRK 201



 Score = 65.5 bits (158), Expect = 5e-08
 Identities = 32/38 (84%), Positives = 34/38 (89%)
 Frame = +2

Query: 848 EEFLIRLSEKLAIHRVFPQDEKEAAILLMALSSGLVHG 961
           EEFLI LS  L+IHRVFP DEK+AAILLMALSSGLVHG
Sbjct: 278 EEFLINLSNNLSIHRVFPDDEKDAAILLMALSSGLVHG 315


>ref|XP_007012281.1| GATA type zinc finger transcription factor family protein, putative
           [Theobroma cacao] gi|508782644|gb|EOY29900.1| GATA type
           zinc finger transcription factor family protein,
           putative [Theobroma cacao]
          Length = 311

 Score = 92.0 bits (227), Expect = 5e-16
 Identities = 72/226 (31%), Positives = 93/226 (41%), Gaps = 18/226 (7%)
 Frame = +2

Query: 89  PMNLNLAPSPFPMEQTKEKEHDE------------NQETFTFGLNHHRHHQIVXXXXXXX 232
           P+ LN  P PFP+ + KE++H +            +  TF          Q V       
Sbjct: 3   PVYLNPPPLPFPLVKLKEEQHLQLFLSPQQAATSLSASTFLNSNTASHQDQTVTKPEESK 62

Query: 233 XXCHIFFNQAQDHAGFFXXXXXXXXXXXXXDDXXXXXXXXXXEIENKVEKGLKLSLQKKE 412
              H   NQ   H G               D            ++     G  LS  +KE
Sbjct: 63  PHDHKG-NQFMTHEGSI-------------DQQASSSSSLQSAVDQSTANGYNLSFSRKE 108

Query: 413 DENMLAKERSSGDRVKWMP---RLMKKKDEEGSMMNGCAKMEVKKVKQSSSVET---DLX 574
           D +  +    +G  VKWM    RLMKK      M + C+  + K  K +   +    D  
Sbjct: 109 DGDCESAS-GNGSSVKWMSSKVRLMKKM-----MNSNCSGADDKPPKFTQRFQYPVHDSD 162

Query: 575 XXXXXXXXXTPIRVCSDCHTTKTPLWRSGPKGPKSLCNACGIRQRK 712
                      +RVCSDC+TT TPLWRSGP+GPKSLCNACGIRQRK
Sbjct: 163 ETNSFSKANNTVRVCSDCNTTTTPLWRSGPRGPKSLCNACGIRQRK 208


>ref|XP_004251667.1| PREDICTED: putative GATA transcription factor 22-like [Solanum
           lycopersicum]
          Length = 326

 Score = 92.0 bits (227), Expect = 5e-16
 Identities = 71/208 (34%), Positives = 88/208 (42%), Gaps = 8/208 (3%)
 Frame = +2

Query: 113 SPFPMEQTKEKEHD---ENQETFTFGLNHHRHHQIVXXXXXXXXXCHIFFN-----QAQD 268
           S FP E T E  HD    N    +    ++ ++Q           C  FFN       QD
Sbjct: 12  SSFPFELTNEVHHDYLSHNNNNMSLVSPYNNNYQFASSSTNSS--CQNFFNISTTTNIQD 69

Query: 269 HAGFFXXXXXXXXXXXXXDDXXXXXXXXXXEIENKVEKGLKLSLQKKEDENMLAKERSSG 448
            +G+              D+           ++ K  KGLKL+L KK            G
Sbjct: 70  QSGY-DYQFHQPQHHHEVDNFASRSSGSHDHVDKK-NKGLKLTLWKK-----------GG 116

Query: 449 DRVKWMPRLMKKKDEEGSMMNGCAKMEVKKVKQSSSVETDLXXXXXXXXXXTPIRVCSDC 628
            +VK                     ++V+  KQ   +ETD            PIRVCSDC
Sbjct: 117 QKVK--------------------NLKVEDQKQQI-IETDYSSNSSSNNNIIPIRVCSDC 155

Query: 629 HTTKTPLWRSGPKGPKSLCNACGIRQRK 712
           +TTKTPLWRSGPKGPKSLCNACGIRQRK
Sbjct: 156 NTTKTPLWRSGPKGPKSLCNACGIRQRK 183



 Score = 64.3 bits (155), Expect = 1e-07
 Identities = 32/38 (84%), Positives = 33/38 (86%)
 Frame = +2

Query: 848 EEFLIRLSEKLAIHRVFPQDEKEAAILLMALSSGLVHG 961
           E+F I LS  LAIHRVFPQDEKEAAILLMALSS LVHG
Sbjct: 289 EDFFINLSNNLAIHRVFPQDEKEAAILLMALSSDLVHG 326


>ref|XP_007138732.1| hypothetical protein PHAVU_009G232700g [Phaseolus vulgaris]
           gi|561011819|gb|ESW10726.1| hypothetical protein
           PHAVU_009G232700g [Phaseolus vulgaris]
          Length = 306

 Score = 91.7 bits (226), Expect = 6e-16
 Identities = 56/125 (44%), Positives = 71/125 (56%), Gaps = 7/125 (5%)
 Frame = +2

Query: 359 EIENKVEKGLKLSLQKKED--ENMLAKERSSGDRVKWMPRLMKKKDEEGSMMNGCAKMEV 532
           +IEN+ +  LKL + KKE+  +N+  ++ S+      M R M   DE  S +   +  + 
Sbjct: 75  KIENRSD--LKLRVWKKEEGCDNLKGEDSSTMSSKMRMVRKMIVSDETDSDIADISSSKQ 132

Query: 533 KKVKQ-----SSSVETDLXXXXXXXXXXTPIRVCSDCHTTKTPLWRSGPKGPKSLCNACG 697
            K K+     S  V  D            P+RVC DCHTTKTPLWRSGPKGPKSLCNACG
Sbjct: 133 IKYKKKNPELSPLVTDDSNCNSSSNQNSVPLRVCVDCHTTKTPLWRSGPKGPKSLCNACG 192

Query: 698 IRQRK 712
           IRQRK
Sbjct: 193 IRQRK 197


>gb|AGU42761.1| GATA nirate-inducible carbon-metabolism involved protein [Populus
           nigra x Populus x canadensis]
          Length = 303

 Score = 90.9 bits (224), Expect = 1e-15
 Identities = 66/209 (31%), Positives = 87/209 (41%), Gaps = 20/209 (9%)
 Frame = +2

Query: 392 LSLQKKEDENMLAKERSSGDRVKWMP-RLMKKKDEEGSMMNGCAKMEVKKVKQSSSVETD 568
           LS  K ED      E S    VKWMP ++M  +    S  +    M +K + +  + +  
Sbjct: 98  LSSSKMED----GAEESGESSVKWMPSKMMLMQKMTNSNCSETDHMPMKFMLKFHNQQYW 153

Query: 569 LXXXXXXXXXXTPIRVCSDCHTTKTPLWRSGPKGPKSLCNACGIRQRKXXXXXXXXXXXX 748
                      + IRVCSDC+TT TPLWRSGP+GPKSLCNACGIRQRK            
Sbjct: 154 NNEINSSSNSNSNIRVCSDCNTTSTPLWRSGPRGPKSLCNACGIRQRKARRAMAAAAAAA 213

Query: 749 XXXXXXCDTXXXXXXXXXXXXXXXXRCKSGGE-------------------LEEFLIRLS 871
                  +                 R     +                    +   + LS
Sbjct: 214 NGTVIAIEASSSTRSTKVNNKVKKSRTSHVSQNKKLSKPPESSLQSQKKLCFKNLALSLS 273

Query: 872 EKLAIHRVFPQDEKEAAILLMALSSGLVH 958
           +  A+ +V P D +EAAILLM LS G +H
Sbjct: 274 KNPALQQVLPHDVEEAAILLMELSCGFIH 302


>ref|XP_003543725.1| PREDICTED: GATA transcription factor 21-like [Glycine max]
          Length = 314

 Score = 89.7 bits (221), Expect = 2e-15
 Identities = 54/119 (45%), Positives = 69/119 (57%), Gaps = 11/119 (9%)
 Frame = +2

Query: 389 KLSLQKKEDENMLAKERSSGDRVKWMP---RLMKKK------DEEGSMMNGCAKMEVKKV 541
           K+++ +KE+ N    E  S   VKWMP   R+M+K       D   S  N   K +  K 
Sbjct: 94  KVTVWRKEERNENLAEDGS---VKWMPSKMRIMRKMLVSNQTDAYTSDNNTTHKFDDHKQ 150

Query: 542 KQSS--SVETDLXXXXXXXXXXTPIRVCSDCHTTKTPLWRSGPKGPKSLCNACGIRQRK 712
           + SS   ++ +           + +RVCSDCHTTKTPLWRSGP+GPKSLCNACGIRQRK
Sbjct: 151 QLSSPLGIDDNSSNNYSDKSNNSIVRVCSDCHTTKTPLWRSGPRGPKSLCNACGIRQRK 209


>ref|XP_006353530.1| PREDICTED: putative GATA transcription factor 22-like [Solanum
           tuberosum]
          Length = 323

 Score = 89.0 bits (219), Expect = 4e-15
 Identities = 68/212 (32%), Positives = 85/212 (40%), Gaps = 12/212 (5%)
 Frame = +2

Query: 113 SPFPMEQTKEKEHD-----ENQETFTFGLNHHRHHQIVXXXXXXXXXCHIFFN-----QA 262
           S FP E   E  HD      N       L    ++            C  FFN       
Sbjct: 11  SSFPFELNNEVHHDYLSHHNNNNNNIMSLVSPYNNNYQFSSSSTNSSCQTFFNISTTTNI 70

Query: 263 QDHAGF--FXXXXXXXXXXXXXDDXXXXXXXXXXEIENKVEKGLKLSLQKKEDENMLAKE 436
           QD +G+                D+           +E K  KGLKL+L KK ++ M    
Sbjct: 71  QDQSGYDYHSHQFHQPQHQHEVDNFASRSSGSHDHLEKK-NKGLKLTLCKKGEQKM---- 125

Query: 437 RSSGDRVKWMPRLMKKKDEEGSMMNGCAKMEVKKVKQSSSVETDLXXXXXXXXXXTPIRV 616
                      + +K +D++  +                 +ETD            PIRV
Sbjct: 126 -----------KNLKLEDQKQQI-----------------IETDYSSNSSSNNNIIPIRV 157

Query: 617 CSDCHTTKTPLWRSGPKGPKSLCNACGIRQRK 712
           CSDC+TTKTPLWRSGPKGPKSLCNACGIRQRK
Sbjct: 158 CSDCNTTKTPLWRSGPKGPKSLCNACGIRQRK 189



 Score = 66.6 bits (161), Expect = 2e-08
 Identities = 32/38 (84%), Positives = 34/38 (89%)
 Frame = +2

Query: 848 EEFLIRLSEKLAIHRVFPQDEKEAAILLMALSSGLVHG 961
           E+F + LS  LAIHRVFPQDEKEAAILLMALSSGLVHG
Sbjct: 286 EDFFVNLSNNLAIHRVFPQDEKEAAILLMALSSGLVHG 323


Top