BLASTX nr result

ID: Astragalus22_contig00013149 seq

BLASTX 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Astragalus22_contig00013149
         (1029 letters)

Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF excluding environmental samples
from WGS projects 
           149,584,005 sequences; 54,822,741,787 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_007154661.1| hypothetical protein PHAVU_003G137100g [Phas...   206   1e-60
ref|XP_020205216.1| GATA transcription factor 21 [Cajanus cajan]...   205   3e-60
ref|XP_006600457.1| PREDICTED: GATA transcription factor 21-like...   202   2e-59
ref|XP_003550634.1| PREDICTED: GATA transcription factor 21-like...   202   3e-59
ref|XP_019464266.1| PREDICTED: GATA transcription factor 21-like...   199   7e-58
ref|XP_017410381.1| PREDICTED: GATA transcription factor 21-like...   197   2e-57
ref|XP_019433018.1| PREDICTED: GATA transcription factor 21-like...   195   2e-56
gb|OIV89729.1| hypothetical protein TanjilG_03518 [Lupinus angus...   195   3e-56
ref|XP_014507425.1| GATA transcription factor 21 isoform X1 [Vig...   194   5e-56
ref|XP_014507426.1| GATA transcription factor 21 isoform X2 [Vig...   187   1e-54
gb|KHN06609.1| GATA transcription factor 21 [Glycine soja]            186   6e-53
ref|XP_003543725.1| PREDICTED: GATA transcription factor 21-like...   186   6e-53
ref|XP_013458498.1| GATA type zinc finger transcription factor f...   184   5e-52
dbj|GAU34770.1| hypothetical protein TSUD_205740 [Trifolium subt...   172   2e-48
gb|PNX96606.1| GATA transcription factor 22-like protein [Trifol...   166   8e-46
gb|KHN35841.1| Putative GATA transcription factor 22 [Glycine soja]   162   9e-44
gb|POF08592.1| putative gata transcription factor 22 [Quercus su...   158   7e-43
ref|XP_020224491.1| GATA transcription factor 21-like [Cajanus c...   159   1e-42
ref|XP_015954445.1| GATA transcription factor 21 [Arachis durane...   157   2e-41
dbj|GAU34769.1| hypothetical protein TSUD_205730 [Trifolium subt...   155   2e-41

>ref|XP_007154661.1| hypothetical protein PHAVU_003G137100g [Phaseolus vulgaris]
 gb|ESW26655.1| hypothetical protein PHAVU_003G137100g [Phaseolus vulgaris]
          Length = 309

 Score =  206 bits (523), Expect = 1e-60
 Identities = 131/257 (50%), Positives = 149/257 (57%), Gaps = 13/257 (5%)
 Frame = +2

Query: 2   EKIIPSSRSWDHSAEENKESSSKRKLVVWK-KDRNENHEAAAEPEGGTXXXXXXXXXXXX 178
           EKI P+  SWDHS  E     S+ K+ VWK K+R+E+HEAAAE +G              
Sbjct: 72  EKINPTRGSWDHSVTE-----SELKVAVWKNKERSEDHEAAAE-DGSVNLMSLKMRMMRK 125

Query: 179 XXXX------VSDH-QHKFQDQKQPLSPLGTVTSGSNNN--NYSNHIVRVCSDCHTTKTP 331
                     + D   HKF+DQKQPLSPLGT  S S+NN  N+SN+ VRVC+DCHTTKTP
Sbjct: 126 TMVPDQTGAYIEDRTMHKFEDQKQPLSPLGTDNSSSSNNYSNHSNNTVRVCADCHTTKTP 185

Query: 332 LWRSGPRGPKSLCNACGIRQRKXXXXXXXXXXXXXXXXXXXXXETTVXXXXXXXXXXXXX 511
           LWRSGPRGPKSLCNACGIRQRK                      T +             
Sbjct: 186 LWRSGPRGPKSLCNACGIRQRKARRAMAAAASGNG---------TVILETQKSVKGNKLQ 236

Query: 512 XGEFASSTY---MNKKKRKLVVGSKPISDNSQSRMKLGFEDLRLSLSKNLALQQVFPQDE 682
             E  + T      KKKR   VG+KP    SQSR K GFEDL L L K+LA+ QVFPQDE
Sbjct: 237 KKEKKTRTQGAPQMKKKRNHGVGAKP----SQSRNKFGFEDLTLRLRKSLAMHQVFPQDE 292

Query: 683 KEAAILLMALSYGLVHG 733
           KEAAILLMALSYGLVHG
Sbjct: 293 KEAAILLMALSYGLVHG 309


>ref|XP_020205216.1| GATA transcription factor 21 [Cajanus cajan]
 gb|KYP36941.1| Putative GATA transcription factor 20 [Cajanus cajan]
          Length = 312

 Score =  205 bits (521), Expect = 3e-60
 Identities = 130/252 (51%), Positives = 144/252 (57%), Gaps = 8/252 (3%)
 Frame = +2

Query: 2   EKIIPSSRSWDHSAEENKESSSKRKLVVWKKD-RNENHEAAAEPEGGTXXXXXXXXXXXX 178
           EKIIP S S D S  E     S++K+ VWKK+ RNEN EA AE                 
Sbjct: 76  EKIIPPSGSRDQSVAE-----SEQKVTVWKKEERNENLEAVAEDGSMNWMSSKMRMTRKM 130

Query: 179 XXXXVSDH------QHKFQDQKQPLSPLGTVTSGSNN-NNYSNHIVRVCSDCHTTKTPLW 337
                +D       +HKF+DQKQPLSPLGT  S SNN +N+ N+ VRVC+DCHTTKTPLW
Sbjct: 131 VVSDQTDACVADNTRHKFEDQKQPLSPLGTDNSSSNNYSNHGNNTVRVCADCHTTKTPLW 190

Query: 338 RSGPRGPKSLCNACGIRQRKXXXXXXXXXXXXXXXXXXXXXETTVXXXXXXXXXXXXXXG 517
           RSGPRGPKSLCNACGIRQRK                     E +V               
Sbjct: 191 RSGPRGPKSLCNACGIRQRKARRAMAAAAAAAGNGTVLVEAEKSVKGNKLQKKEKKSR-- 248

Query: 518 EFASSTYMNKKKRKLVVGSKPISDNSQSRMKLGFEDLRLSLSKNLALQQVFPQDEKEAAI 697
                    KKKRKL  G+KP    SQSR K GFEDL L L KNLA+ QVFPQDEKEAAI
Sbjct: 249 --IEGAPQMKKKRKL--GAKP----SQSRSKFGFEDLTLRLRKNLAMHQVFPQDEKEAAI 300

Query: 698 LLMALSYGLVHG 733
           LLMALSYGLVHG
Sbjct: 301 LLMALSYGLVHG 312


>ref|XP_006600457.1| PREDICTED: GATA transcription factor 21-like isoform X2 [Glycine
           max]
 gb|KRH02717.1| hypothetical protein GLYMA_17G055200 [Glycine max]
          Length = 310

 Score =  202 bits (515), Expect = 2e-59
 Identities = 132/258 (51%), Positives = 144/258 (55%), Gaps = 15/258 (5%)
 Frame = +2

Query: 2   EKIIPSSRSWDHSAEENKESSSKRKLVVWKK--DRNENHEAAAEPEGGTXXXXXXXXXXX 175
           EKIIPSS SWDHS  E++ +    K  VWKK  +RNEN E+ A  +G             
Sbjct: 63  EKIIPSSGSWDHSVAESEHN----KATVWKKAEERNENLESVAAEDGSLKWMPAKMRIMR 118

Query: 176 XXXXXVSDHQ-----------HKFQDQKQPLS-PLGTVTSGSNN-NNYSNHIVRVCSDCH 316
                VSD             HKF DQKQ LS PLGT  S SNN +N+SN+ VRVCSDCH
Sbjct: 119 KML--VSDQTDTYTNSDNNTTHKFDDQKQQLSSPLGTDNSSSNNYSNHSNNTVRVCSDCH 176

Query: 317 TTKTPLWRSGPRGPKSLCNACGIRQRKXXXXXXXXXXXXXXXXXXXXXETTVXXXXXXXX 496
           TTKTPLWRSGPRGPKSLCNACGIRQRK                                 
Sbjct: 177 TTKTPLWRSGPRGPKSLCNACGIRQRKARRAMAAAAASASGNGTVIVEAKKSVKGRNKLQ 236

Query: 497 XXXXXXGEFASSTYMNKKKRKLVVGSKPISDNSQSRMKLGFEDLRLSLSKNLALQQVFPQ 676
                      +  M KKKRKL VGS   +  SQSR K GFEDL L L KNLA+ QVFPQ
Sbjct: 237 KKKEKKTRTEGAAQM-KKKRKLGVGS---AKASQSRNKFGFEDLTLRLRKNLAMHQVFPQ 292

Query: 677 DEKEAAILLMALSYGLVH 730
           DEKEAAILLMALSYGLVH
Sbjct: 293 DEKEAAILLMALSYGLVH 310


>ref|XP_003550634.1| PREDICTED: GATA transcription factor 21-like isoform X1 [Glycine
           max]
 gb|KHN17667.1| Putative GATA transcription factor 22 [Glycine soja]
 gb|KRH02716.1| hypothetical protein GLYMA_17G055200 [Glycine max]
          Length = 322

 Score =  202 bits (515), Expect = 3e-59
 Identities = 132/258 (51%), Positives = 144/258 (55%), Gaps = 15/258 (5%)
 Frame = +2

Query: 2   EKIIPSSRSWDHSAEENKESSSKRKLVVWKK--DRNENHEAAAEPEGGTXXXXXXXXXXX 175
           EKIIPSS SWDHS  E++ +    K  VWKK  +RNEN E+ A  +G             
Sbjct: 75  EKIIPSSGSWDHSVAESEHN----KATVWKKAEERNENLESVAAEDGSLKWMPAKMRIMR 130

Query: 176 XXXXXVSDHQ-----------HKFQDQKQPLS-PLGTVTSGSNN-NNYSNHIVRVCSDCH 316
                VSD             HKF DQKQ LS PLGT  S SNN +N+SN+ VRVCSDCH
Sbjct: 131 KML--VSDQTDTYTNSDNNTTHKFDDQKQQLSSPLGTDNSSSNNYSNHSNNTVRVCSDCH 188

Query: 317 TTKTPLWRSGPRGPKSLCNACGIRQRKXXXXXXXXXXXXXXXXXXXXXETTVXXXXXXXX 496
           TTKTPLWRSGPRGPKSLCNACGIRQRK                                 
Sbjct: 189 TTKTPLWRSGPRGPKSLCNACGIRQRKARRAMAAAAASASGNGTVIVEAKKSVKGRNKLQ 248

Query: 497 XXXXXXGEFASSTYMNKKKRKLVVGSKPISDNSQSRMKLGFEDLRLSLSKNLALQQVFPQ 676
                      +  M KKKRKL VGS   +  SQSR K GFEDL L L KNLA+ QVFPQ
Sbjct: 249 KKKEKKTRTEGAAQM-KKKRKLGVGS---AKASQSRNKFGFEDLTLRLRKNLAMHQVFPQ 304

Query: 677 DEKEAAILLMALSYGLVH 730
           DEKEAAILLMALSYGLVH
Sbjct: 305 DEKEAAILLMALSYGLVH 322


>ref|XP_019464266.1| PREDICTED: GATA transcription factor 21-like [Lupinus
           angustifolius]
 gb|OIV99760.1| hypothetical protein TanjilG_26098 [Lupinus angustifolius]
          Length = 310

 Score =  199 bits (505), Expect = 7e-58
 Identities = 124/252 (49%), Positives = 141/252 (55%), Gaps = 9/252 (3%)
 Frame = +2

Query: 5   KIIP-SSRSWDHSAEENKESSSKRKLVVWKK-DRNENHEAAAE-------PEGGTXXXXX 157
           KIIP S  SWD +A EN ESS   K+ VWKK D  EN +A  E       P         
Sbjct: 71  KIIPLSGSSWDQTASENHESSIGSKVTVWKKEDMAENLQAGDEDGSLKLLPSKMRIMRKM 130

Query: 158 XXXXXXXXXXXVSDHQHKFQDQKQPLSPLGTVTSGSNNNNYSNHIVRVCSDCHTTKTPLW 337
                            KF+DQKQPLSPLGT  S +N   +SN+IVRVCSDCHTTKTPLW
Sbjct: 131 MVSGQTTDSYVGGSSMQKFEDQKQPLSPLGTDNSSNNYPKHSNNIVRVCSDCHTTKTPLW 190

Query: 338 RSGPRGPKSLCNACGIRQRKXXXXXXXXXXXXXXXXXXXXXETTVXXXXXXXXXXXXXXG 517
           RSGPRGPKSLCNACGIRQRK                      T V               
Sbjct: 191 RSGPRGPKSLCNACGIRQRKARRAMAVAAAASENG-------TIVVAAAQKSVKGKEKKS 243

Query: 518 EFASSTYMNKKKRKLVVGSKPISDNSQSRMKLGFEDLRLSLSKNLALQQVFPQDEKEAAI 697
           +   +    K+KRKL+  +KP   +S+SR K  FEDL L LSKN+A +QVFPQDE+EAAI
Sbjct: 244 KVEYAPQQMKRKRKLI--AKP---SSESRNKFSFEDLTLRLSKNVAFKQVFPQDEREAAI 298

Query: 698 LLMALSYGLVHG 733
           LLMALSYGLVHG
Sbjct: 299 LLMALSYGLVHG 310


>ref|XP_017410381.1| PREDICTED: GATA transcription factor 21-like [Vigna angularis]
 gb|KOM29610.1| hypothetical protein LR48_Vigan728s003300 [Vigna angularis]
 dbj|BAT76812.1| hypothetical protein VIGAN_01486900 [Vigna angularis var.
           angularis]
          Length = 306

 Score =  197 bits (502), Expect = 2e-57
 Identities = 127/255 (49%), Positives = 147/255 (57%), Gaps = 11/255 (4%)
 Frame = +2

Query: 2   EKIIPSSRSWDHSAEENKESSSKRKLVVWK-KDRNENHEAAAEPEG-----GTXXXXXXX 163
           EKI P+  SWDHS  +     S+ K+ V K K+R+E+HEAAAE                 
Sbjct: 70  EKINPTMGSWDHSVAQ-----SELKVTVCKQKERSEDHEAAAEDGSVKLMSSKMRMMQKM 124

Query: 164 XXXXXXXXXVSDHQ-HKFQDQKQPLSPLGTVTSGSNN-NNYSNHIVRVCSDCHTTKTPLW 337
                    + D   +KF+D+KQPLSPLGT  S SNN +N+SN+ VRVC+DCHTTKTPLW
Sbjct: 125 MGSDQTGAYIEDSTVNKFEDEKQPLSPLGTDNSSSNNCSNHSNNTVRVCADCHTTKTPLW 184

Query: 338 RSGPRGPKSLCNACGIRQRKXXXXXXXXXXXXXXXXXXXXXETTVXXXXXXXXXXXXXXG 517
           RSGPRGPKSLCNACGIRQRK                      T +               
Sbjct: 185 RSGPRGPKSLCNACGIRQRKARRAMAAAASGNG---------TVIFETEKSVKGNKLQKK 235

Query: 518 EFASSTY---MNKKKRKLVVGSKPISDNSQSRMKLGFEDLRLSLSKNLALQQVFPQDEKE 688
           E  + T      KKKRK  VG+KP    SQSR K GFEDL L L K+LA+ QVFPQDEKE
Sbjct: 236 EKKARTQGAPQMKKKRKHGVGAKP----SQSRNKFGFEDLTLRLRKSLAMHQVFPQDEKE 291

Query: 689 AAILLMALSYGLVHG 733
           AAILLMALSYGLVHG
Sbjct: 292 AAILLMALSYGLVHG 306


>ref|XP_019433018.1| PREDICTED: GATA transcription factor 21-like [Lupinus
           angustifolius]
          Length = 315

 Score =  195 bits (495), Expect = 2e-56
 Identities = 125/253 (49%), Positives = 139/253 (54%), Gaps = 10/253 (3%)
 Frame = +2

Query: 5   KIIPSSRS-WDHSAE-ENKESSSKRKLVVWKKDRNENHEAAAE-------PEGGTXXXXX 157
           KIIPSS S WDHSA  EN ++    K+ VW++DR EN +A AE       P         
Sbjct: 77  KIIPSSESSWDHSAAAENHDNIIGSKVTVWEEDRGENLQADAEDGSMKWMPSKMRIMRKM 136

Query: 158 XXXXXXXXXXXVSDHQHKFQDQKQPLSPLGTVTSGSNNNNYS-NHIVRVCSDCHTTKTPL 334
                            KF+ QKQPLSPLGT  S +N  N+S N+ VRVCSDCHTTKTPL
Sbjct: 137 MASDQTKGSYVAGSSMKKFEHQKQPLSPLGTDNSSNNYPNHSTNNTVRVCSDCHTTKTPL 196

Query: 335 WRSGPRGPKSLCNACGIRQRKXXXXXXXXXXXXXXXXXXXXXETTVXXXXXXXXXXXXXX 514
           WRSGPRGPKSLCNACGIRQRK                      T V              
Sbjct: 197 WRSGPRGPKSLCNACGIRQRKARRAMAAAAAANG---------TIVMAAQKSVKGKEKKK 247

Query: 515 GEFASSTYMNKKKRKLVVGSKPISDNSQSRMKLGFEDLRLSLSKNLALQQVFPQDEKEAA 694
            +   +    KKKRKL   S P     QSR K  FEDL L LSKN+A +QVFPQDEKEAA
Sbjct: 248 SKTECAPPKMKKKRKLQSKSSP-----QSRNKFTFEDLTLRLSKNVAFKQVFPQDEKEAA 302

Query: 695 ILLMALSYGLVHG 733
           ILLMALSYGLVHG
Sbjct: 303 ILLMALSYGLVHG 315


>gb|OIV89729.1| hypothetical protein TanjilG_03518 [Lupinus angustifolius]
          Length = 316

 Score =  195 bits (495), Expect = 3e-56
 Identities = 125/253 (49%), Positives = 139/253 (54%), Gaps = 10/253 (3%)
 Frame = +2

Query: 5   KIIPSSRS-WDHSAE-ENKESSSKRKLVVWKKDRNENHEAAAE-------PEGGTXXXXX 157
           KIIPSS S WDHSA  EN ++    K+ VW++DR EN +A AE       P         
Sbjct: 78  KIIPSSESSWDHSAAAENHDNIIGSKVTVWEEDRGENLQADAEDGSMKWMPSKMRIMRKM 137

Query: 158 XXXXXXXXXXXVSDHQHKFQDQKQPLSPLGTVTSGSNNNNYS-NHIVRVCSDCHTTKTPL 334
                            KF+ QKQPLSPLGT  S +N  N+S N+ VRVCSDCHTTKTPL
Sbjct: 138 MASDQTKGSYVAGSSMKKFEHQKQPLSPLGTDNSSNNYPNHSTNNTVRVCSDCHTTKTPL 197

Query: 335 WRSGPRGPKSLCNACGIRQRKXXXXXXXXXXXXXXXXXXXXXETTVXXXXXXXXXXXXXX 514
           WRSGPRGPKSLCNACGIRQRK                      T V              
Sbjct: 198 WRSGPRGPKSLCNACGIRQRKARRAMAAAAAANG---------TIVMAAQKSVKGKEKKK 248

Query: 515 GEFASSTYMNKKKRKLVVGSKPISDNSQSRMKLGFEDLRLSLSKNLALQQVFPQDEKEAA 694
            +   +    KKKRKL   S P     QSR K  FEDL L LSKN+A +QVFPQDEKEAA
Sbjct: 249 SKTECAPPKMKKKRKLQSKSSP-----QSRNKFTFEDLTLRLSKNVAFKQVFPQDEKEAA 303

Query: 695 ILLMALSYGLVHG 733
           ILLMALSYGLVHG
Sbjct: 304 ILLMALSYGLVHG 316


>ref|XP_014507425.1| GATA transcription factor 21 isoform X1 [Vigna radiata var.
           radiata]
          Length = 306

 Score =  194 bits (492), Expect = 5e-56
 Identities = 126/255 (49%), Positives = 145/255 (56%), Gaps = 11/255 (4%)
 Frame = +2

Query: 2   EKIIPSSRSWDHSAEENKESSSKRKLVVWK-KDRNENHEAAAEPEG-----GTXXXXXXX 163
           EKI P+  SWDHS  +     S+ K+ V K K+R+E+H AAAE                 
Sbjct: 70  EKINPTMGSWDHSVAQ-----SELKVTVCKQKERSEDHVAAAEDGSVKLMPSKMRMMQKM 124

Query: 164 XXXXXXXXXVSDHQ-HKFQDQKQPLSPLGTVTSGSNN-NNYSNHIVRVCSDCHTTKTPLW 337
                    + D   HKF+D+KQPLSPLGT  S SNN +N+SN+ VRVC+DCHTTKTPLW
Sbjct: 125 MGPDQTGAYIEDSTVHKFEDEKQPLSPLGTDNSSSNNCSNHSNNTVRVCADCHTTKTPLW 184

Query: 338 RSGPRGPKSLCNACGIRQRKXXXXXXXXXXXXXXXXXXXXXETTVXXXXXXXXXXXXXXG 517
           RSGPRGPKSLCNACGIRQRK                      T +               
Sbjct: 185 RSGPRGPKSLCNACGIRQRKARRAMAAAASGNG---------TVILKTEKSVKGNKLQKK 235

Query: 518 EFASSTYM---NKKKRKLVVGSKPISDNSQSRMKLGFEDLRLSLSKNLALQQVFPQDEKE 688
           E    T +    KKKRK  VG+KP    SQSR K GFEDL L L K+LA+ QVFPQDEKE
Sbjct: 236 EKKVRTQVAPQMKKKRKHGVGAKP----SQSRNKFGFEDLTLRLRKSLAMHQVFPQDEKE 291

Query: 689 AAILLMALSYGLVHG 733
           AAILLMALSYGLV G
Sbjct: 292 AAILLMALSYGLVQG 306


>ref|XP_014507426.1| GATA transcription factor 21 isoform X2 [Vigna radiata var.
           radiata]
          Length = 231

 Score =  187 bits (476), Expect = 1e-54
 Identities = 122/247 (49%), Positives = 140/247 (56%), Gaps = 11/247 (4%)
 Frame = +2

Query: 26  SWDHSAEENKESSSKRKLVVWK-KDRNENHEAAAEPEG-----GTXXXXXXXXXXXXXXX 187
           SWDHS  +     S+ K+ V K K+R+E+H AAAE                         
Sbjct: 3   SWDHSVAQ-----SELKVTVCKQKERSEDHVAAAEDGSVKLMPSKMRMMQKMMGPDQTGA 57

Query: 188 XVSDHQ-HKFQDQKQPLSPLGTVTSGSNN-NNYSNHIVRVCSDCHTTKTPLWRSGPRGPK 361
            + D   HKF+D+KQPLSPLGT  S SNN +N+SN+ VRVC+DCHTTKTPLWRSGPRGPK
Sbjct: 58  YIEDSTVHKFEDEKQPLSPLGTDNSSSNNCSNHSNNTVRVCADCHTTKTPLWRSGPRGPK 117

Query: 362 SLCNACGIRQRKXXXXXXXXXXXXXXXXXXXXXETTVXXXXXXXXXXXXXXGEFASSTYM 541
           SLCNACGIRQRK                      T +               E    T +
Sbjct: 118 SLCNACGIRQRKARRAMAAAASGNG---------TVILKTEKSVKGNKLQKKEKKVRTQV 168

Query: 542 ---NKKKRKLVVGSKPISDNSQSRMKLGFEDLRLSLSKNLALQQVFPQDEKEAAILLMAL 712
               KKKRK  VG+KP    SQSR K GFEDL L L K+LA+ QVFPQDEKEAAILLMAL
Sbjct: 169 APQMKKKRKHGVGAKP----SQSRNKFGFEDLTLRLRKSLAMHQVFPQDEKEAAILLMAL 224

Query: 713 SYGLVHG 733
           SYGLV G
Sbjct: 225 SYGLVQG 231


>gb|KHN06609.1| GATA transcription factor 21 [Glycine soja]
          Length = 314

 Score =  186 bits (472), Expect = 6e-53
 Identities = 125/252 (49%), Positives = 141/252 (55%), Gaps = 9/252 (3%)
 Frame = +2

Query: 2   EKIIPSSRSWDHSAEENKESSSKRKLVVWKKD-RNEN--HEAAAEPEGGTXXXXXXXXXX 172
           EKIIP+S SW HS EE     S+ K+ VW+K+ RNEN   + + +               
Sbjct: 75  EKIIPTSGSWGHSVEE-----SEHKVTVWRKEERNENLAEDGSVKWMPSKMRIMRKMLVS 129

Query: 173 XXXXXXVSDHQ--HKFQDQKQPLS-PLGTVTSGSNN--NNYSNHIVRVCSDCHTTKTPLW 337
                  SD+   HKF D KQ LS PLG   + SNN  +  +N IVRVCSDCHTTKTPLW
Sbjct: 130 NQTDAYTSDNNTTHKFDDHKQQLSSPLGIDDNSSNNYSDKSNNSIVRVCSDCHTTKTPLW 189

Query: 338 RSGPRGPKSLCNACGIRQRK-XXXXXXXXXXXXXXXXXXXXXETTVXXXXXXXXXXXXXX 514
           RSGPRGPKSLCNACGIRQRK                      E +V              
Sbjct: 190 RSGPRGPKSLCNACGIRQRKARRAMAAAAAAALGDGAVIVEAEKSVKGKKLQKKKEKKTR 249

Query: 515 GEFASSTYMNKKKRKLVVGSKPISDNSQSRMKLGFEDLRLSLSKNLALQQVFPQDEKEAA 694
            E A+     K KRKL VG+K     SQSR K GFEDL L L KNLA+ QVFPQDEKEAA
Sbjct: 250 IEGAAQM---KMKRKLGVGAKA----SQSRNKFGFEDLTLRLRKNLAMHQVFPQDEKEAA 302

Query: 695 ILLMALSYGLVH 730
           ILLMALSYGLVH
Sbjct: 303 ILLMALSYGLVH 314


>ref|XP_003543725.1| PREDICTED: GATA transcription factor 21-like [Glycine max]
 gb|KRH19153.1| hypothetical protein GLYMA_13G103900 [Glycine max]
          Length = 314

 Score =  186 bits (472), Expect = 6e-53
 Identities = 125/252 (49%), Positives = 141/252 (55%), Gaps = 9/252 (3%)
 Frame = +2

Query: 2   EKIIPSSRSWDHSAEENKESSSKRKLVVWKKD-RNEN--HEAAAEPEGGTXXXXXXXXXX 172
           EKIIP+S SW HS EE     S+ K+ VW+K+ RNEN   + + +               
Sbjct: 75  EKIIPTSGSWGHSVEE-----SEHKVTVWRKEERNENLAEDGSVKWMPSKMRIMRKMLVS 129

Query: 173 XXXXXXVSDHQ--HKFQDQKQPLS-PLGTVTSGSNN--NNYSNHIVRVCSDCHTTKTPLW 337
                  SD+   HKF D KQ LS PLG   + SNN  +  +N IVRVCSDCHTTKTPLW
Sbjct: 130 NQTDAYTSDNNTTHKFDDHKQQLSSPLGIDDNSSNNYSDKSNNSIVRVCSDCHTTKTPLW 189

Query: 338 RSGPRGPKSLCNACGIRQRK-XXXXXXXXXXXXXXXXXXXXXETTVXXXXXXXXXXXXXX 514
           RSGPRGPKSLCNACGIRQRK                      E +V              
Sbjct: 190 RSGPRGPKSLCNACGIRQRKARRAMAAAAAAALGDGAVIVEAEKSVKGKKLQKKKEKKTR 249

Query: 515 GEFASSTYMNKKKRKLVVGSKPISDNSQSRMKLGFEDLRLSLSKNLALQQVFPQDEKEAA 694
            E A+     K KRKL VG+K     SQSR K GFEDL L L KNLA+ QVFPQDEKEAA
Sbjct: 250 IEGAAQM---KMKRKLGVGAKA----SQSRNKFGFEDLTLRLRKNLAMHQVFPQDEKEAA 302

Query: 695 ILLMALSYGLVH 730
           ILLMALSYGLVH
Sbjct: 303 ILLMALSYGLVH 314


>ref|XP_013458498.1| GATA type zinc finger transcription factor family protein [Medicago
           truncatula]
 gb|KEH32529.1| GATA type zinc finger transcription factor family protein [Medicago
           truncatula]
          Length = 327

 Score =  184 bits (467), Expect = 5e-52
 Identities = 127/263 (48%), Positives = 147/263 (55%), Gaps = 19/263 (7%)
 Frame = +2

Query: 2   EKI-IPSSRSWDHSAEENKES-SSKRKLVV-WKKDR-----NENHEAAAEPEGGTXXXXX 157
           EKI IPSS SW+ S  EN E+  +K KL + WKK++     N N EA    + GT     
Sbjct: 76  EKINIPSSGSWNSSTAENHENYKTKHKLTIRWKKEQISDEMNNNQEA---DQDGTSVKWM 132

Query: 158 XXXXXXXXXXXVSDH-----------QHKFQDQKQPLSPLGTVTSGSNNNNYSNHIVRVC 304
                      VSD            Q KF+DQKQPLSP GT    S++NNYS   +RVC
Sbjct: 133 SSKMRIMKKMMVSDQTGSSNLTSNSKQIKFEDQKQPLSPQGT--DNSSSNNYST--IRVC 188

Query: 305 SDCHTTKTPLWRSGPRGPKSLCNACGIRQRKXXXXXXXXXXXXXXXXXXXXXETTVXXXX 484
           SDC+TTKTPLWRSGPRGPKSLCNACGIRQRK                       +V    
Sbjct: 189 SDCNTTKTPLWRSGPRGPKSLCNACGIRQRK-ARRALAAAAASANGTTIADQTASVKRKK 247

Query: 485 XXXXXXXXXXGEFASSTYMNKKKRKLVVGSKPISDNSQSRMKLGFEDLRLSLSKNLALQQ 664
                      EF  ST   KKK KL   +KP S  S+    + FEDL+LSLS+NL +QQ
Sbjct: 248 LQKKKENKSKIEFDCSTVHMKKKHKL--EAKPPSHQSRKEF-ITFEDLKLSLSENLGVQQ 304

Query: 665 VFPQDEKEAAILLMALSYGLVHG 733
           VFPQDE+EAAILLMALSYGLVHG
Sbjct: 305 VFPQDEREAAILLMALSYGLVHG 327


>dbj|GAU34770.1| hypothetical protein TSUD_205740 [Trifolium subterraneum]
          Length = 254

 Score =  172 bits (437), Expect = 2e-48
 Identities = 121/264 (45%), Positives = 146/264 (55%), Gaps = 25/264 (9%)
 Frame = +2

Query: 17  SSRSWDHSA--EENKESSSKRKLVV-WKKDR---NENHEAAAEPE----GGTXXXXXXXX 166
           SS SWD+++  E ++   SK KL + WKK+    N N EAA         GT        
Sbjct: 3   SSGSWDNNSTGENHEIIKSKHKLTIRWKKEEIINNNNIEAADHHHHHHHDGTSVKWMSSK 62

Query: 167 XXXXXXXXVSDH------------QHKFQDQKQPLSPLGTVTSGSNNNNYSNHIVRVCSD 310
                   VSD             + KF+DQKQPLSPLG+    S+ NNYSN I RVCSD
Sbjct: 63  MRMMRKMIVSDQTSGGSSNIASNSKQKFEDQKQPLSPLGS----SSTNNYSNQI-RVCSD 117

Query: 311 CHTTKTPLWRSGPRGPKSLCNACGIRQRKXXXXXXXXXXXXXXXXXXXXXETTVXXXXXX 490
           C+TTKTPLWRSGPRGPKSLCNACGIRQRK                      ++V      
Sbjct: 118 CNTTKTPLWRSGPRGPKSLCNACGIRQRKARRALALAAASANGTTVTADQTSSVKRKKLQ 177

Query: 491 XXXXXXXXGEFASSTYMNKKKRKLVVGSKPISDNSQSRMK--LGFEDLRLSLSKNLALQQ 664
                    + ++ST++ KK       +K  S+ SQ   K  + FEDLRLSLSKNL++QQ
Sbjct: 178 TKKENKSKIDCSTSTHLKKK-------TKFESEPSQISKKELITFEDLRLSLSKNLSVQQ 230

Query: 665 VFPQDEK-EAAILLMALSYGLVHG 733
           VFPQDE+ EAAILLMALSYGLVHG
Sbjct: 231 VFPQDEREEAAILLMALSYGLVHG 254


>gb|PNX96606.1| GATA transcription factor 22-like protein [Trifolium pratense]
          Length = 263

 Score =  166 bits (420), Expect = 8e-46
 Identities = 115/257 (44%), Positives = 139/257 (54%), Gaps = 18/257 (7%)
 Frame = +2

Query: 17  SSRSWDHSA--EENKESSSKRKLVV-WKKD--RNENHEAAAEPEGGTXXXXXXXXXXXXX 181
           SS SWD+++  E ++   SK KL + WKK+   N   E     + GT             
Sbjct: 3   SSGSWDNNSTGENHEIIKSKHKLTIRWKKEGINNNIEEVNHHHDDGTSVKWMSSKMRMMR 62

Query: 182 XXXVSDH------------QHKFQDQKQPLSPLGTVTSGSNNNNYSNHIVRVCSDCHTTK 325
               SD             + KF+DQKQPLSPLG     S+ NNYSN I RVCSDC+TTK
Sbjct: 63  KMIDSDQTSGGSSNIASNSKQKFEDQKQPLSPLG-----SSTNNYSNQI-RVCSDCNTTK 116

Query: 326 TPLWRSGPRGPKSLCNACGIRQRKXXXXXXXXXXXXXXXXXXXXXETTVXXXXXXXXXXX 505
           TPLWRSGPRGPKSLCNACGIRQRK                       +V           
Sbjct: 117 TPLWRSGPRGPKSLCNACGIRQRKARRAMALAAASANGTTVTADQTCSVKRKKLQKKKEN 176

Query: 506 XXXGEFASSTYMNKKKRKLVVGSKPISDNSQSRMKLGFEDLRLSLSKNLALQQVFPQDEK 685
               ++  ST++ KK +     S+P   +   +  + FEDLRLSLSKNL++QQVFPQDEK
Sbjct: 177 KSKIDYC-STHLKKKTK---FESEP--SHQTKKEFITFEDLRLSLSKNLSVQQVFPQDEK 230

Query: 686 -EAAILLMALSYGLVHG 733
            EAAILLMALSYGLVHG
Sbjct: 231 EEAAILLMALSYGLVHG 247


>gb|KHN35841.1| Putative GATA transcription factor 22 [Glycine soja]
          Length = 325

 Score =  162 bits (411), Expect = 9e-44
 Identities = 110/261 (42%), Positives = 134/261 (51%), Gaps = 17/261 (6%)
 Frame = +2

Query: 2   EKIIPSSRSWDHSAEENKESSSKRKLVVWKK-DRNENHEAAAEPEGGTXXXXXXXXXXXX 178
           +KI+PSS SW+H   E  E+ S  KL VWKK D+ EN +                     
Sbjct: 71  QKIVPSSESWEHPVSEKDENRSDLKLRVWKKEDKCENFQVE-----DNSTKWMPLKMRMM 125

Query: 179 XXXXVSDH-------------QHKFQDQKQPLSPLGTVTSGSNNN--NYSNHIVRVCSDC 313
               VSD              Q K +++  PL+PLGT  S + N+  N+S   VRVCSDC
Sbjct: 126 RRMMVSDQTGFDTEGMISNSKQIKNEEKNPPLTPLGTDDSNNYNSSANHSKITVRVCSDC 185

Query: 314 HTTKTPLWRSGPRGPKSLCNACGIRQRKXXXXXXXXXXXXXXXXXXXXXETTVXXXXXXX 493
           HTTKTPLWRSGP+GPK+LCNACGIRQRK                                
Sbjct: 186 HTTKTPLWRSGPKGPKTLCNACGIRQRKARRAIAVAATANGMNPVEAEKSQV---KKGNK 242

Query: 494 XXXXXXXGEFASSTYMNKKKRKLVVGSKPISDNSQSRMKLG-FEDLRLSLSKNLALQQVF 670
                   +   + +M KKKRKL          ++ R + G FEDL + LSKNLALQ+VF
Sbjct: 243 LHSKGMKSKTKGAPHM-KKKRKL---------GAKYRKRFGAFEDLTVRLSKNLALQKVF 292

Query: 671 PQDEKEAAILLMALSYGLVHG 733
           P DEKEAAILLMALSYGL+HG
Sbjct: 293 PPDEKEAAILLMALSYGLLHG 313


>gb|POF08592.1| putative gata transcription factor 22 [Quercus suber]
          Length = 248

 Score =  158 bits (399), Expect = 7e-43
 Identities = 103/241 (42%), Positives = 125/241 (51%), Gaps = 5/241 (2%)
 Frame = +2

Query: 26  SWDHSAEENKESSSKRKLVVWKK-----DRNENHEAAAEPEGGTXXXXXXXXXXXXXXXX 190
           S DH + +N ES S+ K   W K     D++E       P                    
Sbjct: 22  SCDHISLKN-ESESENKFSFWNKESKIEDQSETFSVKWMPSKMRMMRKMINSEQTGHADI 80

Query: 191 VSDHQHKFQDQKQPLSPLGTVTSGSNNNNYSNHIVRVCSDCHTTKTPLWRSGPRGPKSLC 370
             +   KF+DQKQP++P  T  S SNN+  +N IVRVC+DC+TTKTPLWRSGPRGPKSLC
Sbjct: 81  PLNSMKKFEDQKQPMAPAKTDNS-SNNSFNNNPIVRVCADCNTTKTPLWRSGPRGPKSLC 139

Query: 371 NACGIRQRKXXXXXXXXXXXXXXXXXXXXXETTVXXXXXXXXXXXXXXGEFASSTYMNKK 550
           NACGIRQRK                      +                 + AS +Y+ K 
Sbjct: 140 NACGIRQRKARRAMAAAAAAANGTILATNPPS-------MKSTKVQHKDKRASKSYVPKF 192

Query: 551 KRKLVVGSKPISDNSQSRMKLGFEDLRLSLSKNLALQQVFPQDEKEAAILLMALSYGLVH 730
           K+K       ++     R K+ FED  +SLSKN A QQVFPQDEKEAAILLMALSYGLVH
Sbjct: 193 KKKC-----KLNTPDHGRKKVCFEDFTISLSKNSAFQQVFPQDEKEAAILLMALSYGLVH 247

Query: 731 G 733
           G
Sbjct: 248 G 248


>ref|XP_020224491.1| GATA transcription factor 21-like [Cajanus cajan]
 ref|XP_020224492.1| GATA transcription factor 21-like [Cajanus cajan]
 gb|KYP59312.1| Putative GATA transcription factor 20 [Cajanus cajan]
          Length = 299

 Score =  159 bits (401), Expect = 1e-42
 Identities = 108/245 (44%), Positives = 130/245 (53%), Gaps = 4/245 (1%)
 Frame = +2

Query: 2   EKIIPSSRSWDHSAEENKESSSKRKLVVWKK-DRNEN--HEAAAEPEGGTXXXXXXXXXX 172
           EKI PS  SWDH  E+N E+ S  K  VWKK DR EN   E ++     +          
Sbjct: 56  EKIDPSGGSWDHPIEKNDENRSDLKQRVWKKKDRCENLQGEDSSRKWMPSKIRMMRKMMV 115

Query: 173 XXXXXXVSDHQHKFQDQKQPLSPLGTVTSG-SNNNNYSNHIVRVCSDCHTTKTPLWRSGP 349
                  +  Q K +++  PLSP G      S+++N+SN  VRVC+DCHTT+TPLWR+GP
Sbjct: 116 SDIKSVSNSKQIKCEEKNSPLSPQGPDNINYSSSSNHSNITVRVCADCHTTETPLWRTGP 175

Query: 350 RGPKSLCNACGIRQRKXXXXXXXXXXXXXXXXXXXXXETTVXXXXXXXXXXXXXXGEFAS 529
            GPKSLCNACGIRQRK                     ++ V               E A 
Sbjct: 176 NGPKSLCNACGIRQRK-ARRAIAAAASANGTSLVEPDKSQVKKGKKLHKKRMKSKAECAP 234

Query: 530 STYMNKKKRKLVVGSKPISDNSQSRMKLGFEDLRLSLSKNLALQQVFPQDEKEAAILLMA 709
                KKKRKL        D  + R +  +EDL +SLSKNL LQQVFPQDEKEAAILLMA
Sbjct: 235 QL---KKKRKL-------GDKYRKRFE-NYEDLTISLSKNLDLQQVFPQDEKEAAILLMA 283

Query: 710 LSYGL 724
           LSYGL
Sbjct: 284 LSYGL 288


>ref|XP_015954445.1| GATA transcription factor 21 [Arachis duranensis]
          Length = 358

 Score =  157 bits (398), Expect = 2e-41
 Identities = 121/294 (41%), Positives = 149/294 (50%), Gaps = 50/294 (17%)
 Frame = +2

Query: 2   EKI-IPSSR-SWDH--------------SAEENKESSSKRKLVVWKKD-RNENH----EA 118
           EKI +PSS  SWDH                + NK+SS   KL + KK+ RNENH    +A
Sbjct: 61  EKIHVPSSGGSWDHIHDHRKKEEKEEEEEKDGNKKSSKLLKLKILKKEERNENHHLDNQA 120

Query: 119 AAEPEGGTXXXXXXXXXXXXXXXXVSDHQHKFQD----QKQPLSPLGTVTSGSN--NNNY 280
             + E                    ++ + +F +    Q+ PLSPLGT  S SN  NNN 
Sbjct: 121 HHDEEDHGSVKWMSSKMRIMGGSDTNNFRLRFDEEGPKQQAPLSPLGTDNSSSNSSNNNS 180

Query: 281 S--------NHIVRVCSDCHTTKTPLWRSGPRGPKSLCNACGIRQRKXXXXXXXXXXXXX 436
           S        N IVRVCSDCHTTKTPLWRSGPRGPKSLCNACGIRQRK             
Sbjct: 181 SSNRHENNNNMIVRVCSDCHTTKTPLWRSGPRGPKSLCNACGIRQRK---ARRAAAVAAA 237

Query: 437 XXXXXXXXETTVXXXXXXXXXXXXXXGEF-----------ASSTYMNKKKRKLVVGSKPI 583
                   +TT+               +            A  +  N+ K+K  +G+   
Sbjct: 238 AEVAASENDTTLMASTDDDDGMKKKEKKLHKHNNKDKKLKAKCSAPNQLKKKHKIGTNNN 297

Query: 584 SDNS----QSRMKLGFEDLRLSLSKNLALQQVFPQDEKEAAILLMALSYGLVHG 733
           ++ +    + R K+GFEDL +SLSKNLAL  VFP DEKEAAILLMALSYGL+HG
Sbjct: 298 NNTNKLSHRGRKKVGFEDLTISLSKNLAL-NVFPHDEKEAAILLMALSYGLLHG 350


>dbj|GAU34769.1| hypothetical protein TSUD_205730 [Trifolium subterraneum]
          Length = 286

 Score =  155 bits (392), Expect = 2e-41
 Identities = 95/174 (54%), Positives = 111/174 (63%), Gaps = 3/174 (1%)
 Frame = +2

Query: 218 DQKQPLSPLGTVTSGSNNNNYSNHIVRVCSDCHTTKTPLWRSGPRGPKSLCNACGIRQRK 397
           DQKQPLSPLG+    S+ NNYSN I RVCSDC+TTKTPLWRSGPRGPKSLCNACGIRQRK
Sbjct: 81  DQKQPLSPLGS----SSTNNYSNQI-RVCSDCNTTKTPLWRSGPRGPKSLCNACGIRQRK 135

Query: 398 XXXXXXXXXXXXXXXXXXXXXETTVXXXXXXXXXXXXXXGEFASSTYMNKKKRKLVVGSK 577
                                 ++V               + ++ST++ KK       +K
Sbjct: 136 ARRALALAAASANGTTVTADQTSSVKRKKLQTKKENKSKIDCSTSTHLKKK-------TK 188

Query: 578 PISDNSQSRMK--LGFEDLRLSLSKNLALQQVFPQDEK-EAAILLMALSYGLVH 730
             S+ SQ   K  + FEDLRLSLSKNL++QQVFPQDE+ EAAILLMALSYGLVH
Sbjct: 189 FESEPSQISKKELITFEDLRLSLSKNLSVQQVFPQDEREEAAILLMALSYGLVH 242


Top