BLASTX nr result

ID: Astragalus22_contig00009941 seq

BLASTX 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Astragalus22_contig00009941
         (1263 letters)

Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF excluding environmental samples
from WGS projects 
           149,584,005 sequences; 54,822,741,787 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_013467175.1| GATA type zinc finger transcription factor f...   288   1e-92
ref|NP_001242460.2| GATA transcription factor 1-like [Glycine ma...   266   8e-84
gb|KRH33499.1| hypothetical protein GLYMA_10G126900 [Glycine max]     266   8e-84
gb|KHN48447.1| GATA transcription factor 1 [Glycine soja]             265   2e-83
ref|XP_007145298.1| hypothetical protein PHAVU_007G227300g [Phas...   263   1e-82
ref|XP_019450003.1| PREDICTED: GATA transcription factor 1-like ...   259   2e-81
ref|XP_019414036.1| PREDICTED: GATA transcription factor 1-like ...   259   3e-81
gb|PNY15886.1| GATA transcription factor 1-like protein, partial...   257   4e-80
gb|KYP34580.1| GATA transcription factor 1 [Cajanus cajan]            252   1e-78
ref|XP_022640410.1| GATA transcription factor 1 [Vigna radiata v...   248   1e-76
gb|KOM26490.1| hypothetical protein LR48_Vigan277s001000 [Vigna ...   247   2e-76
ref|XP_020993399.1| GATA transcription factor 1-like [Arachis du...   246   8e-76
dbj|GAU30487.1| hypothetical protein TSUD_18670 [Trifolium subte...   224   2e-68
ref|XP_003610840.1| GATA type zinc finger transcription factor f...   218   3e-64
ref|XP_012092669.1| GATA transcription factor 1 [Jatropha curcas...   199   7e-58
ref|XP_004497744.1| PREDICTED: LOW QUALITY PROTEIN: GATA transcr...   194   1e-56
ref|XP_021607544.1| GATA transcription factor 1-like isoform X1 ...   196   1e-56
ref|XP_021676729.1| GATA transcription factor 1-like [Hevea bras...   196   2e-56
ref|XP_018845952.1| PREDICTED: LOW QUALITY PROTEIN: GATA transcr...   195   3e-56
ref|XP_002303808.2| hypothetical protein POPTR_0003s17340g [Popu...   195   4e-56

>ref|XP_013467175.1| GATA type zinc finger transcription factor family protein [Medicago
           truncatula]
 gb|KEH41211.1| GATA type zinc finger transcription factor family protein [Medicago
           truncatula]
          Length = 244

 Score =  288 bits (737), Expect = 1e-92
 Identities = 154/242 (63%), Positives = 173/242 (71%), Gaps = 18/242 (7%)
 Frame = +3

Query: 69  MEALGSVDDLLXXXXXXXXXXXXXXKPMKAFPSLKPKCSDPPSLNPL----------DEF 218
           MEALGSVDDLL              KP KAFPSLKP+CSDPPSLNPL          +E 
Sbjct: 1   MEALGSVDDLLDFSSDIGEDDDDD-KPKKAFPSLKPECSDPPSLNPLALDDPINSLSEEV 59

Query: 219 AEEELEWLSNKDAFPAVETFVELTSIQPIISQHQRTATVPMLEYXXXXXXXXXXXXXXXX 398
           AEEELEWLSNKDAFPAVETFV+L+ IQP + +HQ T+  PMLE                 
Sbjct: 60  AEEELEWLSNKDAFPAVETFVDLSCIQPDLLKHQMTS--PMLENSTSSSNSNNSSNSITL 117

Query: 399 XXXX---KVPVRARSKSRSRPRTGLADASSQKFSWRQPSSKISKEVI-----IGRKCHHC 554
                  K PVRARSKSRS+PR GLADAS+ +F W+QPS+K SKE +     IGRKCHHC
Sbjct: 118 LSGYNHMKFPVRARSKSRSKPRLGLADASNLQFPWKQPSTKTSKEKVKQTPTIGRKCHHC 177

Query: 555 GAEKTPQWRAGPHGPKTLCNACGVRFKSGRLVPEYRPASSPTFRSDLHSNSHRKIVEMRK 734
           G + TPQWRAGP+GPKTLCNACGVR+KSGRLVPEYRPA+SPTFRSD+HSNSHRK+VEMRK
Sbjct: 178 GVDDTPQWRAGPNGPKTLCNACGVRYKSGRLVPEYRPANSPTFRSDVHSNSHRKVVEMRK 237

Query: 735 QK 740
           QK
Sbjct: 238 QK 239


>ref|NP_001242460.2| GATA transcription factor 1-like [Glycine max]
 gb|KRH33500.1| hypothetical protein GLYMA_10G126900 [Glycine max]
 gb|KRH33501.1| hypothetical protein GLYMA_10G126900 [Glycine max]
          Length = 245

 Score =  266 bits (679), Expect = 8e-84
 Identities = 148/243 (60%), Positives = 163/243 (67%), Gaps = 18/243 (7%)
 Frame = +3

Query: 69  MEALGSVDDLLXXXXXXXXXXXXXXKPMKAFPSLKPKCSDPPSLNPL---------DEFA 221
           ME +GSVDDLL              KP KA PSL  KC+ P   NPL          EFA
Sbjct: 1   METIGSVDDLLDFSSDIGEEDDYDDKPRKACPSLNSKCAGPSLFNPLVQVDPNHSFSEFA 60

Query: 222 EEELEWLSNKDAFPAVETFVELTSIQPIISQHQRTATVPMLEYXXXXXXXXXXXXXXXXX 401
           EEELEWLSNKDAFP+VETFV+L+SIQP  +++Q++A  P+LE                  
Sbjct: 61  EEELEWLSNKDAFPSVETFVDLSSIQPGTTKNQKSA--PVLECSTGSSNSNNSTNSISLL 118

Query: 402 XXX---KVPVRARSKSRSRPRTGLADASSQKFSWRQPSSKISKE------VIIGRKCHHC 554
                 KVPVRARSKSRSR R GLA+ SSQ+  WRQPS+  SK         IGRKC HC
Sbjct: 119 NSCDHLKVPVRARSKSRSRHRPGLAENSSQQVWWRQPSNGTSKADEGMKISSIGRKCQHC 178

Query: 555 GAEKTPQWRAGPHGPKTLCNACGVRFKSGRLVPEYRPASSPTFRSDLHSNSHRKIVEMRK 734
           GAEKTPQWRAGP GPKTLCNACGVRFKSGRLVPEYRPASSPTF SDLHSNSHRKIVEMR+
Sbjct: 179 GAEKTPQWRAGPSGPKTLCNACGVRFKSGRLVPEYRPASSPTFHSDLHSNSHRKIVEMRR 238

Query: 735 QKQ 743
           QKQ
Sbjct: 239 QKQ 241


>gb|KRH33499.1| hypothetical protein GLYMA_10G126900 [Glycine max]
          Length = 256

 Score =  266 bits (680), Expect = 8e-84
 Identities = 149/247 (60%), Positives = 165/247 (66%), Gaps = 18/247 (7%)
 Frame = +3

Query: 57  SKAGMEALGSVDDLLXXXXXXXXXXXXXXKPMKAFPSLKPKCSDPPSLNPL--------- 209
           S + ME +GSVDDLL              KP KA PSL  KC+ P   NPL         
Sbjct: 8   SLSRMETIGSVDDLLDFSSDIGEEDDYDDKPRKACPSLNSKCAGPSLFNPLVQVDPNHSF 67

Query: 210 DEFAEEELEWLSNKDAFPAVETFVELTSIQPIISQHQRTATVPMLEYXXXXXXXXXXXXX 389
            EFAEEELEWLSNKDAFP+VETFV+L+SIQP  +++Q++A  P+LE              
Sbjct: 68  SEFAEEELEWLSNKDAFPSVETFVDLSSIQPGTTKNQKSA--PVLECSTGSSNSNNSTNS 125

Query: 390 XXXXXXX---KVPVRARSKSRSRPRTGLADASSQKFSWRQPSSKISKE------VIIGRK 542
                     KVPVRARSKSRSR R GLA+ SSQ+  WRQPS+  SK         IGRK
Sbjct: 126 ISLLNSCDHLKVPVRARSKSRSRHRPGLAENSSQQVWWRQPSNGTSKADEGMKISSIGRK 185

Query: 543 CHHCGAEKTPQWRAGPHGPKTLCNACGVRFKSGRLVPEYRPASSPTFRSDLHSNSHRKIV 722
           C HCGAEKTPQWRAGP GPKTLCNACGVRFKSGRLVPEYRPASSPTF SDLHSNSHRKIV
Sbjct: 186 CQHCGAEKTPQWRAGPSGPKTLCNACGVRFKSGRLVPEYRPASSPTFHSDLHSNSHRKIV 245

Query: 723 EMRKQKQ 743
           EMR+QKQ
Sbjct: 246 EMRRQKQ 252


>gb|KHN48447.1| GATA transcription factor 1 [Glycine soja]
          Length = 245

 Score =  265 bits (676), Expect = 2e-83
 Identities = 148/243 (60%), Positives = 162/243 (66%), Gaps = 18/243 (7%)
 Frame = +3

Query: 69  MEALGSVDDLLXXXXXXXXXXXXXXKPMKAFPSLKPKCSDPPSLNPL---------DEFA 221
           ME +GSVDDLL              KP KA PSL  KC+ P   NPL          EFA
Sbjct: 1   METIGSVDDLLDFSSDIGEEDDYDDKPRKACPSLNSKCAGPSLFNPLVQVDPNHSFSEFA 60

Query: 222 EEELEWLSNKDAFPAVETFVELTSIQPIISQHQRTATVPMLEYXXXXXXXXXXXXXXXXX 401
           EEELEWLSNKDAFP+VETFV+L+SIQP   ++Q++A  P+LE                  
Sbjct: 61  EEELEWLSNKDAFPSVETFVDLSSIQPGTIKNQKSA--PVLECSTGSSNSNNSTNSISLL 118

Query: 402 XXX---KVPVRARSKSRSRPRTGLADASSQKFSWRQPSSKISKE------VIIGRKCHHC 554
                 KVPVRARSKSRSR R GLA+ SSQ+  WRQPS+  SK         IGRKC HC
Sbjct: 119 NSCDHLKVPVRARSKSRSRHRPGLAENSSQQVWWRQPSNGTSKADEGMKISSIGRKCQHC 178

Query: 555 GAEKTPQWRAGPHGPKTLCNACGVRFKSGRLVPEYRPASSPTFRSDLHSNSHRKIVEMRK 734
           GAEKTPQWRAGP GPKTLCNACGVRFKSGRLVPEYRPASSPTF SDLHSNSHRKIVEMR+
Sbjct: 179 GAEKTPQWRAGPSGPKTLCNACGVRFKSGRLVPEYRPASSPTFHSDLHSNSHRKIVEMRR 238

Query: 735 QKQ 743
           QKQ
Sbjct: 239 QKQ 241


>ref|XP_007145298.1| hypothetical protein PHAVU_007G227300g [Phaseolus vulgaris]
 ref|XP_007145299.1| hypothetical protein PHAVU_007G227300g [Phaseolus vulgaris]
 gb|ESW17292.1| hypothetical protein PHAVU_007G227300g [Phaseolus vulgaris]
 gb|ESW17293.1| hypothetical protein PHAVU_007G227300g [Phaseolus vulgaris]
          Length = 250

 Score =  263 bits (672), Expect = 1e-82
 Identities = 147/246 (59%), Positives = 163/246 (66%), Gaps = 21/246 (8%)
 Frame = +3

Query: 69  MEALGSVDDLLXXXXXXXXXXXXXXKPMKAFPSLKPKCSDPPSLNPL---------DEFA 221
           MEA+GSVDDLL              KP K  PSL  KC +P   NPL          EF 
Sbjct: 1   MEAIGSVDDLLDFSLDIGEEDDDEDKPRKPCPSLNSKCGNPSLFNPLVPDDPNHSYSEFV 60

Query: 222 EEELEWLSNKDAFPAVETFVELTSIQPIISQHQRT--ATVPMLEYXXXXXXXXXXXXXXX 395
           EEELEWLSNKDAFP+VETFV+L+ IQP  ++ ++T  AT PMLEY               
Sbjct: 61  EEELEWLSNKDAFPSVETFVDLSCIQPDTAKMRKTTPATTPMLEYSSGSSNSNNSSNSIS 120

Query: 396 XXXXX---KVPVRARSKSRSRPRTGLADASS-QKFSWRQPSSKISKEVI------IGRKC 545
                   KVPVRARSK RSR R G+AD +S Q+F WRQPS++ SK         IGRKC
Sbjct: 121 LLNSCDHLKVPVRARSKRRSRCRPGIADENSGQQFWWRQPSNETSKAEEGMKISPIGRKC 180

Query: 546 HHCGAEKTPQWRAGPHGPKTLCNACGVRFKSGRLVPEYRPASSPTFRSDLHSNSHRKIVE 725
            HCGAEKTPQWRAGP GPKTLCNACGVRFKSGRLVPEYRPASSP+FRSDLHSNSHRKI E
Sbjct: 181 QHCGAEKTPQWRAGPLGPKTLCNACGVRFKSGRLVPEYRPASSPSFRSDLHSNSHRKITE 240

Query: 726 MRKQKQ 743
           MR+QKQ
Sbjct: 241 MRRQKQ 246


>ref|XP_019450003.1| PREDICTED: GATA transcription factor 1-like [Lupinus angustifolius]
 ref|XP_019450004.1| PREDICTED: GATA transcription factor 1-like [Lupinus angustifolius]
 gb|OIW07684.1| hypothetical protein TanjilG_07726 [Lupinus angustifolius]
          Length = 243

 Score =  259 bits (663), Expect = 2e-81
 Identities = 146/239 (61%), Positives = 160/239 (66%), Gaps = 14/239 (5%)
 Frame = +3

Query: 69  MEALGSVDDLLXXXXXXXXXXXXXXKPMKAFPSLKPKCSDPPSLNPLD---------EFA 221
           MEA+G VDDLL              K  KAF  L PKCSDP SL PLD         EFA
Sbjct: 1   MEAIGFVDDLLDFSLGMGEEDDDEDKNRKAFLELNPKCSDPASLCPLDMGDPSPPFSEFA 60

Query: 222 EEELEWLSNKDAFPAVETFVELTSIQPIISQHQRTATVPMLEYXXXXXXXXXXXXXXXXX 401
           EEELEWLSNKDAFPAVETFV++TSIQP +S+HQ  + +                      
Sbjct: 61  EEELEWLSNKDAFPAVETFVDITSIQPNLSKHQTGSMLEHSTSSSNSNNSTNSISLLAGY 120

Query: 402 XXXKVPVRARSKSRSRPRTGLADASSQKFSWRQPSSKISK-EVI----IGRKCHHCGAEK 566
              KVPVRARSKSRSR   G +  S+Q    RQPS + +K EVI    IGRKC HCGAEK
Sbjct: 121 DNLKVPVRARSKSRSRRLPGNSGISAQHSWTRQPSKENAKAEVITIPTIGRKCLHCGAEK 180

Query: 567 TPQWRAGPHGPKTLCNACGVRFKSGRLVPEYRPASSPTFRSDLHSNSHRKIVEMRKQKQ 743
           TPQWRAGP GPKTLCNACGVRFKSGRLVPEYRPASSP+FRSDLHSNSHRK++EMRKQKQ
Sbjct: 181 TPQWRAGPLGPKTLCNACGVRFKSGRLVPEYRPASSPSFRSDLHSNSHRKVMEMRKQKQ 239


>ref|XP_019414036.1| PREDICTED: GATA transcription factor 1-like [Lupinus angustifolius]
 gb|OIV98677.1| hypothetical protein TanjilG_23969 [Lupinus angustifolius]
          Length = 245

 Score =  259 bits (662), Expect = 3e-81
 Identities = 148/244 (60%), Positives = 166/244 (68%), Gaps = 19/244 (7%)
 Frame = +3

Query: 69  MEALGSVDDLLXXXXXXXXXXXXXXKPMKAFPSLKPKCSDPPSLNPLD---------EFA 221
           MEA+GSVD+LL              K  KAFP L  KCSDPPSL+PLD         EFA
Sbjct: 1   MEAIGSVDELLDFSLDVGEVDDDDDKNRKAFPKLDLKCSDPPSLSPLDLGDPSPPFSEFA 60

Query: 222 EEELEWLSNKDAFPAVETFVELTSIQPIISQHQRTATVPMLEYXXXXXXXXXXXXXXXXX 401
           EEELEWLSNKDAFP VETFV+L SIQP +S+H+   T  MLEY                 
Sbjct: 61  EEELEWLSNKDAFPEVETFVDLPSIQPNLSKHE---TGSMLEYSTSSSNSNNSPNSISLL 117

Query: 402 XXX---KVPVRARSKSRSRPRTGLADA--SSQKFSWRQPSSKISK-EVI----IGRKCHH 551
                  VPVR RSKSRSR R   +++  SSQ+  WRQP ++ +K EVI    IGRKC H
Sbjct: 118 SGYDNLNVPVRPRSKSRSRSRHLASNSGISSQQSWWRQPINESAKLEVITMSTIGRKCQH 177

Query: 552 CGAEKTPQWRAGPHGPKTLCNACGVRFKSGRLVPEYRPASSPTFRSDLHSNSHRKIVEMR 731
           CGAEKTPQWRAGP GPKTLCNACGVR+KSGRLVPEYRPASSP+FRSDLHSNSHRK++EMR
Sbjct: 178 CGAEKTPQWRAGPLGPKTLCNACGVRYKSGRLVPEYRPASSPSFRSDLHSNSHRKVMEMR 237

Query: 732 KQKQ 743
           KQKQ
Sbjct: 238 KQKQ 241


>gb|PNY15886.1| GATA transcription factor 1-like protein, partial [Trifolium
           pratense]
          Length = 266

 Score =  257 bits (656), Expect = 4e-80
 Identities = 139/243 (57%), Positives = 165/243 (67%), Gaps = 18/243 (7%)
 Frame = +3

Query: 66  GMEALGSVDDLLXXXXXXXXXXXXXXKPMKAFPSLKPKCSDPPSLNPLD---------EF 218
           GM+ L  VDDLL              K  K+ PSLKPKCSDPPSL+PL          E+
Sbjct: 27  GMDGLSIVDDLLDFSSDIGEDDDDD-KSKKSVPSLKPKCSDPPSLSPLGLDDANHSFPEY 85

Query: 219 AEEELEWLSNKDAFPAVETFVELTSIQPIISQHQRTATVPMLEYXXXXXXXXXXXXXXXX 398
           AEEELEWLSNKDAFPAVETFV+++ IQP +S++Q+T   P LE                 
Sbjct: 86  AEEELEWLSNKDAFPAVETFVDISCIQPDMSKYQKTT--PTLENSTSSSNNSNNSSNSIT 143

Query: 399 XXXX----KVPVRARSKSRSRPRTGLADASSQKFSWRQPSSKISKEVI-----IGRKCHH 551
                   K PVRARSKSRS+PR    D  +Q+F W+QPS+KIS+E +     I RKCHH
Sbjct: 144 LLSGYNQMKFPVRARSKSRSKPRL---DTLNQQFPWKQPSTKISREQVRPTSNIERKCHH 200

Query: 552 CGAEKTPQWRAGPHGPKTLCNACGVRFKSGRLVPEYRPASSPTFRSDLHSNSHRKIVEMR 731
           CGA+ TPQWRAGP+GPKTLCNACGVR+KSGRLVPEYRPA+SPTFR D+HSNSHRK++EMR
Sbjct: 201 CGADNTPQWRAGPNGPKTLCNACGVRYKSGRLVPEYRPANSPTFRRDVHSNSHRKVLEMR 260

Query: 732 KQK 740
           +QK
Sbjct: 261 RQK 263


>gb|KYP34580.1| GATA transcription factor 1 [Cajanus cajan]
          Length = 245

 Score =  252 bits (644), Expect = 1e-78
 Identities = 146/242 (60%), Positives = 160/242 (66%), Gaps = 18/242 (7%)
 Frame = +3

Query: 69  MEALGSVDDLLXXXXXXXXXXXXXXKPMKAFPSLKPKCSDPPSLNPLD---------EFA 221
           ME + SVDDLL              K  KA PSL  KC DP   N LD         EFA
Sbjct: 1   METIDSVDDLLEFASDIGQEDDDDEKSRKACPSLNSKCGDPSFFNSLDLDDLNQSLSEFA 60

Query: 222 EEELEWLSNKDAFPAVETFVELTSIQPIISQHQRTATVPMLEYXXXXXXXXXXXXXXXXX 401
           EE+LEWLSNKDAFPAVETFV+L+SIQP  +++Q+TA  P+LE                  
Sbjct: 61  EEDLEWLSNKDAFPAVETFVDLSSIQPDTTKNQKTA--PVLENSTSSSNSNNSSNSISLL 118

Query: 402 XXX---KVPVRARSKSRSRPRTGLADASSQKFSWRQPSSKISK--EVI----IGRKCHHC 554
                 KVPVRARSK+R+R R G AD SSQ     QP ++ISK  E I    IGRKC HC
Sbjct: 119 NSCDHLKVPVRARSKTRNRRRPGNADNSSQTVWGGQPINEISKAEEGIQISPIGRKCQHC 178

Query: 555 GAEKTPQWRAGPHGPKTLCNACGVRFKSGRLVPEYRPASSPTFRSDLHSNSHRKIVEMRK 734
           GAEKTPQWRAGP GPKTLCNACGVRFKSGRLVPEYRPASSPTFRSDLHSNSHRKI+EMRK
Sbjct: 179 GAEKTPQWRAGPLGPKTLCNACGVRFKSGRLVPEYRPASSPTFRSDLHSNSHRKIIEMRK 238

Query: 735 QK 740
           QK
Sbjct: 239 QK 240


>ref|XP_022640410.1| GATA transcription factor 1 [Vigna radiata var. radiata]
 ref|XP_022640411.1| GATA transcription factor 1 [Vigna radiata var. radiata]
 ref|XP_022640412.1| GATA transcription factor 1 [Vigna radiata var. radiata]
          Length = 250

 Score =  248 bits (632), Expect = 1e-76
 Identities = 141/246 (57%), Positives = 160/246 (65%), Gaps = 21/246 (8%)
 Frame = +3

Query: 69  MEALGSVDDLLXXXXXXXXXXXXXXKPMKAFPSLKPKCSDPPSLNPL---------DEFA 221
           ME +GSVDDLL              K  K+ PSL  KC +P   N L          EF 
Sbjct: 1   METIGSVDDLLDFSLDIGEEDDDENKHRKSCPSLNSKCGNPSLFNSLVPDDPNHSYSEFV 60

Query: 222 EEELEWLSNKDAFPAVETFVELTSIQPIISQHQRTA--TVPMLEYXXXXXXXXXXXXXXX 395
           EEELEWLSNKDAFP+VETFV+L+ IQP  ++ +++   T P+LE                
Sbjct: 61  EEELEWLSNKDAFPSVETFVDLSCIQPDTAKIKKSTPVTTPVLEDSTGSSNSNNSSNSIS 120

Query: 396 XXXXX---KVPVRARSKSRSRPRTGLADASS-QKFSWRQPSSKISKEVI------IGRKC 545
                   KVPVRARSK RSR R G+AD +S Q+  WRQPS++ISK         IGRKC
Sbjct: 121 LLNSCDHLKVPVRARSKRRSRCRPGIADENSGQQVWWRQPSNEISKAEEGMKISPIGRKC 180

Query: 546 HHCGAEKTPQWRAGPHGPKTLCNACGVRFKSGRLVPEYRPASSPTFRSDLHSNSHRKIVE 725
            HCGAEKTPQWRAGP GPKTLCNACGVRFKSGRLVPEYRPASSP+FRSDLHSNSHRKIVE
Sbjct: 181 QHCGAEKTPQWRAGPLGPKTLCNACGVRFKSGRLVPEYRPASSPSFRSDLHSNSHRKIVE 240

Query: 726 MRKQKQ 743
           MR+QKQ
Sbjct: 241 MRRQKQ 246


>gb|KOM26490.1| hypothetical protein LR48_Vigan277s001000 [Vigna angularis]
 dbj|BAT96054.1| hypothetical protein VIGAN_08292600 [Vigna angularis var.
           angularis]
          Length = 250

 Score =  247 bits (630), Expect = 2e-76
 Identities = 141/246 (57%), Positives = 160/246 (65%), Gaps = 21/246 (8%)
 Frame = +3

Query: 69  MEALGSVDDLLXXXXXXXXXXXXXXKPMKAFPSLKPKCSDPPSLNPL---------DEFA 221
           ME +GSVDDLL              K  K+ PSL  KC +P   N L          EF 
Sbjct: 1   METIGSVDDLLDFSLDIGEEDDDEDKHRKSCPSLNSKCGNPSLFNSLVPDDPNHSYSEFV 60

Query: 222 EEELEWLSNKDAFPAVETFVELTSIQPIISQHQRTATV--PMLEYXXXXXXXXXXXXXXX 395
           EEELEWLSNKDAFP+VETFV+L+ IQP  ++ +++  V  P+LE                
Sbjct: 61  EEELEWLSNKDAFPSVETFVDLSCIQPDTAKIKKSTPVTSPVLEDSTGSSNSNNSSNSIS 120

Query: 396 XXXXX---KVPVRARSKSRSRPRTGLADASS-QKFSWRQPSSKISKEVI------IGRKC 545
                   KVPVRARSK RSR R G+AD +S Q+  WRQPS++ISK         IGR+C
Sbjct: 121 LLNSCDHLKVPVRARSKRRSRCRPGIADENSGQQVWWRQPSNEISKAEEGMKISPIGRQC 180

Query: 546 HHCGAEKTPQWRAGPHGPKTLCNACGVRFKSGRLVPEYRPASSPTFRSDLHSNSHRKIVE 725
            HCGAEKTPQWRAGP GPKTLCNACGVRFKSGRLVPEYRPASSPTFRSDLHSNSHRKIVE
Sbjct: 181 QHCGAEKTPQWRAGPLGPKTLCNACGVRFKSGRLVPEYRPASSPTFRSDLHSNSHRKIVE 240

Query: 726 MRKQKQ 743
           MR+QKQ
Sbjct: 241 MRRQKQ 246


>ref|XP_020993399.1| GATA transcription factor 1-like [Arachis duranensis]
          Length = 268

 Score =  246 bits (628), Expect = 8e-76
 Identities = 141/248 (56%), Positives = 160/248 (64%), Gaps = 15/248 (6%)
 Frame = +3

Query: 45  KYTTSKAGMEALGSVDDLLXXXXXXXXXXXXXX-KPMKAFPSLKPKCSDPPSLNPL---- 209
           K    + GMEALG+VDDLL               +  K FP   P+C  P S  PL    
Sbjct: 19  KLKAFELGMEALGTVDDLLDFSSDVGEDNDVVVDRCRKGFPC-NPECKQP-SFTPLAMDD 76

Query: 210 -----DEFAEEELEWLSNKDAFPAVETFVELTSIQPIISQHQRTATVPMLEYXXXXXXXX 374
                 EFAEEELEWLSNKDAFPAVETFV++ SI+P +S+HQ TA+V             
Sbjct: 77  PNYSFSEFAEEELEWLSNKDAFPAVETFVDIPSIRPNMSKHQGTASVLEYRRSIPNNNCT 136

Query: 375 XXXXXXXXXXXXKVPVRARSKSRSRPRTGLADASSQKFSWRQPSSKIS-KEVI----IGR 539
                       KVPVRARSK RSRPR  +AD SS +  WR  S +IS  EVI    IGR
Sbjct: 137 NNITLLNGFDHLKVPVRARSKYRSRPRLAIADVSSHQSWWRLSSREISGAEVIKIPTIGR 196

Query: 540 KCHHCGAEKTPQWRAGPHGPKTLCNACGVRFKSGRLVPEYRPASSPTFRSDLHSNSHRKI 719
           KC HCG+E+TPQWR+GP GPKTLCNACGVRFKSGRLVPEYRPA+SPTFR +LHSNSHRKI
Sbjct: 197 KCQHCGSEETPQWRSGPLGPKTLCNACGVRFKSGRLVPEYRPATSPTFRHELHSNSHRKI 256

Query: 720 VEMRKQKQ 743
           +EMRKQKQ
Sbjct: 257 IEMRKQKQ 264


>dbj|GAU30487.1| hypothetical protein TSUD_18670 [Trifolium subterraneum]
          Length = 200

 Score =  224 bits (572), Expect = 2e-68
 Identities = 116/193 (60%), Positives = 138/193 (71%), Gaps = 10/193 (5%)
 Frame = +3

Query: 192 PSLNPLDEFAEEELEWLSNKDAFPAVETFVELTSIQPIISQHQRTATVPMLEYXXXXXXX 371
           P +    E+AEEELEWLSNKDAFPAVETFV+++ IQ  +S++Q+T   P LE        
Sbjct: 9   PVVFMFQEYAEEELEWLSNKDAFPAVETFVDISCIQTDMSKYQKTT--PTLENSTSSSNN 66

Query: 372 XXXXXXXXXXXXX----KVPVRARSKSRSRPRTGLADASSQKFSWRQPSSKISKEVI--- 530
                            K PVRARSKSRS+PR  L D  +Q+F W+QPS+KIS+E +   
Sbjct: 67  SNNSSNSITLLSGYNQMKFPVRARSKSRSKPR--LVDTLNQQFPWKQPSNKISREQVRQT 124

Query: 531 ---IGRKCHHCGAEKTPQWRAGPHGPKTLCNACGVRFKSGRLVPEYRPASSPTFRSDLHS 701
               GRKCHHCGA+ TPQWRAGP GPKTLCNACGVRFKSGRLVPEYRPA+SPTFR D+HS
Sbjct: 125 SNNTGRKCHHCGADSTPQWRAGPDGPKTLCNACGVRFKSGRLVPEYRPANSPTFRRDVHS 184

Query: 702 NSHRKIVEMRKQK 740
           NSHRK++EMR+QK
Sbjct: 185 NSHRKVLEMRRQK 197


>ref|XP_003610840.1| GATA type zinc finger transcription factor family protein [Medicago
           truncatula]
 gb|AES93798.1| GATA type zinc finger transcription factor family protein [Medicago
           truncatula]
          Length = 331

 Score =  218 bits (556), Expect = 3e-64
 Identities = 120/228 (52%), Positives = 149/228 (65%), Gaps = 8/228 (3%)
 Frame = +3

Query: 69  MEALGSVDDLLXXXXXXXXXXXXXXKPMKAFPSLKPKCSDPPSLNPLDEFAEEELEWLSN 248
           MEAL SVDDL               K  KAFPS+    ++    +   EFA E+LEWLSN
Sbjct: 1   MEALDSVDDL--WGFLSDIGEDDYDKSRKAFPSVDLDDTN----HSFSEFAVEDLEWLSN 54

Query: 249 KDAFPAVETFVELTSIQPIISQHQRTATVPMLEYXXXXXXXXXXXXXXXXXXXX---KVP 419
           KDAFPAVETFV+ + IQP ISQ+Q+ A  P++E                        K P
Sbjct: 55  KDAFPAVETFVDFSCIQPDISQNQKIA--PIVENSTSSSNSNNSSNSITLLSGYNHVKFP 112

Query: 420 VRARSKSRSRPRTGLADASSQKFSWRQPSSKISKEVI-----IGRKCHHCGAEKTPQWRA 584
           VRARSKSRS+PR G++D  + +F+W+QP++K SKE       IGR+CHHCGA+ TP WR 
Sbjct: 113 VRARSKSRSKPRLGISDTWNHQFAWKQPNNKTSKEQAKQTSTIGRQCHHCGADNTPLWRT 172

Query: 585 GPHGPKTLCNACGVRFKSGRLVPEYRPASSPTFRSDLHSNSHRKIVEM 728
           GP GPKTLCNACGVR++SGRLVPEYRPA SPTF +++HSNSHRK+VE+
Sbjct: 173 GPGGPKTLCNACGVRYRSGRLVPEYRPAKSPTFCNNVHSNSHRKVVEI 220



 Score =  159 bits (402), Expect = 2e-41
 Identities = 70/105 (66%), Positives = 88/105 (83%), Gaps = 5/105 (4%)
 Frame = +3

Query: 444 SRPRTGLADASSQKFSWRQPSSKISKEV-----IIGRKCHHCGAEKTPQWRAGPHGPKTL 608
           S+P  G++D  +++F+W+QPS+  SKE       IGRKCHHCGA+ TPQWR GP GPKTL
Sbjct: 223 SKPHLGISDTWNRQFTWKQPSNNTSKEQSKKTSTIGRKCHHCGADNTPQWRVGPDGPKTL 282

Query: 609 CNACGVRFKSGRLVPEYRPASSPTFRSDLHSNSHRKIVEMRKQKQ 743
           CNACGVR++SGRLVPEYRPA+SPTF S++HSNSHRK+VE+RKQK+
Sbjct: 283 CNACGVRYRSGRLVPEYRPANSPTFCSNVHSNSHRKVVEIRKQKR 327


>ref|XP_012092669.1| GATA transcription factor 1 [Jatropha curcas]
 gb|KDP20343.1| hypothetical protein JCGZ_06429 [Jatropha curcas]
          Length = 260

 Score =  199 bits (507), Expect = 7e-58
 Identities = 116/220 (52%), Positives = 134/220 (60%), Gaps = 20/220 (9%)
 Frame = +3

Query: 144 KPMKAFPSLKPKCSDPP----------SLNPLDEFAEEELEWLSNKDAFPAVETFVELTS 293
           KP KA P+L P    P           S +PL EFAEEELEWLSNKDAFPAVETFV++ S
Sbjct: 32  KPRKALPTLNPNGLHPAPFDVLDHPDDSTHPLPEFAEEELEWLSNKDAFPAVETFVDIIS 91

Query: 294 IQPIISQHQRTATVPMLE------YXXXXXXXXXXXXXXXXXXXXKVPVRARSKSRSRPR 455
             P     QR + V +LE                           +VPV+ARSK   R R
Sbjct: 92  ENPGSLPKQR-SPVSVLENSTTSSTSISGNSSTNGSVIMNYCRSLQVPVKARSKHHRRRR 150

Query: 456 TGLADASSQKFSWRQPSSKISKEVI----IGRKCHHCGAEKTPQWRAGPHGPKTLCNACG 623
               D  + +  W Q + K  +  +    +GRKC HCGAEKTPQWRAGP GPKTLCNACG
Sbjct: 151 ---RDLQAHQCWWNQENLKKVRPPVTSSTMGRKCQHCGAEKTPQWRAGPLGPKTLCNACG 207

Query: 624 VRFKSGRLVPEYRPASSPTFRSDLHSNSHRKIVEMRKQKQ 743
           VRFKSGRLVPEYRPASSP+F S +HSNSHRK++EMRKQKQ
Sbjct: 208 VRFKSGRLVPEYRPASSPSFCSKMHSNSHRKVLEMRKQKQ 247


>ref|XP_004497744.1| PREDICTED: LOW QUALITY PROTEIN: GATA transcription factor 1 [Cicer
           arietinum]
          Length = 194

 Score =  194 bits (493), Expect = 1e-56
 Identities = 111/187 (59%), Positives = 121/187 (64%), Gaps = 17/187 (9%)
 Frame = +3

Query: 69  MEALGSVDDLLXXXXXXXXXXXXXXKPMKAFPSLKPKCSDPPSLNPLD---------EFA 221
           MEALGSVDDLL              KP KAFPSLKPKCSDP SLNPLD         EF 
Sbjct: 1   MEALGSVDDLLDFSSDIGEDVDD--KPRKAFPSLKPKCSDPSSLNPLDLSDPNHSFSEFV 58

Query: 222 EEELEWLSNKDAFPAVETFVELTSIQPIISQHQRTATVPMLEY---XXXXXXXXXXXXXX 392
           EEELEWLSNKDAFP+VETFV+L SIQP IS++QR  T PMLEY                 
Sbjct: 59  EEELEWLSNKDAFPSVETFVDLPSIQPFISKNQR--TTPMLEYSTSSSNSNNSTNSISLL 116

Query: 393 XXXXXXKVPVRARSKSRSRPRTGLADASSQKFSWRQPSSKISKE-----VIIGRKCHHCG 557
                 K PVRARSKSRSRPR G+A+ S+Q+FSWRQP +KISK+       IGRKCHHCG
Sbjct: 117 SGYDHMKFPVRARSKSRSRPRIGIAETSNQQFSWRQPCNKISKDQGMQISTIGRKCHHCG 176

Query: 558 AEKTPQW 578
           AE TPQW
Sbjct: 177 AESTPQW 183


>ref|XP_021607544.1| GATA transcription factor 1-like isoform X1 [Manihot esculenta]
 ref|XP_021607545.1| GATA transcription factor 1-like isoform X2 [Manihot esculenta]
 gb|OAY55441.1| hypothetical protein MANES_03G154500 [Manihot esculenta]
 gb|OAY55443.1| hypothetical protein MANES_03G154500 [Manihot esculenta]
          Length = 261

 Score =  196 bits (499), Expect = 1e-56
 Identities = 119/222 (53%), Positives = 136/222 (61%), Gaps = 23/222 (10%)
 Frame = +3

Query: 144 KPMKAFPSLKPKCS------------DPPSLNPLDEFAEEELEWLSNKDAFPAVETFVEL 287
           KP KAFP L P  +            D P  +P  EFAEEELEWLSNKDAFPAVETFV++
Sbjct: 33  KPTKAFPPLNPSPNGLAVAPLPFDVFDHPDPSP--EFAEEELEWLSNKDAFPAVETFVDI 90

Query: 288 TSIQPIISQHQRTATVPMLE------YXXXXXXXXXXXXXXXXXXXXKVPVRARSK-SRS 446
            S  P     QR + V +LE                           +VPV+ARSK  RS
Sbjct: 91  ISENPGGLPKQR-SPVSVLENSTTSSTSNSGNSGTNGSITMDYCWSLQVPVKARSKHHRS 149

Query: 447 RPRTGLADASSQKFSWRQPSSKISKEVI----IGRKCHHCGAEKTPQWRAGPHGPKTLCN 614
           R R    D   Q+  W   + +  K  +    +GRKC HCGAEKTPQWRAGP GPKTLCN
Sbjct: 150 RRR----DLQGQQCWWSLENLRKVKPAVTSSTMGRKCQHCGAEKTPQWRAGPLGPKTLCN 205

Query: 615 ACGVRFKSGRLVPEYRPASSPTFRSDLHSNSHRKIVEMRKQK 740
           ACGVR+KSGRLVPEYRPASSPTFRS+LHSNSHRK++EMRKQK
Sbjct: 206 ACGVRYKSGRLVPEYRPASSPTFRSELHSNSHRKVMEMRKQK 247


>ref|XP_021676729.1| GATA transcription factor 1-like [Hevea brasiliensis]
          Length = 264

 Score =  196 bits (498), Expect = 2e-56
 Identities = 117/224 (52%), Positives = 134/224 (59%), Gaps = 24/224 (10%)
 Frame = +3

Query: 144 KPMKAFPSLKPKCSD-----PP---------SLNPLDEFAEEELEWLSNKDAFPAVETFV 281
           KP  AFPSL P  +      PP         S  P  EFAEEELEWLSNKDAFPA+ETFV
Sbjct: 32  KPRNAFPSLNPSPNGLAVVPPPFDVFDHPDDSTRPSPEFAEEELEWLSNKDAFPALETFV 91

Query: 282 ELTSIQPIISQHQRTATVPMLEYXXXXXXXXXXXXXXXXXXXXK------VPVRARSKSR 443
           ++ S  P     QR + V +LE                            VPV+ARSK +
Sbjct: 92  DVLSEHPGSLPKQR-SPVSVLENSTTSSTSNSGNSGANGSVIMNYCRSPHVPVKARSKHQ 150

Query: 444 SRPRTGLADASSQKFSWRQPSSKISKEVI----IGRKCHHCGAEKTPQWRAGPHGPKTLC 611
            R R    D  +Q+  W   + K  K  +    +GRKC HCGAEKTPQWRAGP GPKTLC
Sbjct: 151 RRRR---RDLQAQQCWWSLENLKKLKPAVTSSTMGRKCQHCGAEKTPQWRAGPLGPKTLC 207

Query: 612 NACGVRFKSGRLVPEYRPASSPTFRSDLHSNSHRKIVEMRKQKQ 743
           NACGVR+KSGRLVPEYRPASSPTF S+ HSNSHRK++EMRKQKQ
Sbjct: 208 NACGVRYKSGRLVPEYRPASSPTFCSEWHSNSHRKVMEMRKQKQ 251


>ref|XP_018845952.1| PREDICTED: LOW QUALITY PROTEIN: GATA transcription factor 1
           [Juglans regia]
          Length = 261

 Score =  195 bits (496), Expect = 3e-56
 Identities = 113/234 (48%), Positives = 133/234 (56%), Gaps = 16/234 (6%)
 Frame = +3

Query: 87  VDDLLXXXXXXXXXXXXXXKPMKAFPSLKPKC----------SDPPSLNPLDEFAEEELE 236
           VDDLL              KP KA P L  +           SD P L   +E AEE+LE
Sbjct: 11  VDDLLDFASDIGEEDDDEDKPRKALPPLNRRGHGPLSFDLLHSDDPGLPSSEELAEEDLE 70

Query: 237 WLSNKDAFPAVETFVELTSIQP--IISQHQRTATVPMLEYXXXXXXXXXXXXXXXXXXXX 410
           W+SNKDAFPAVETF  + S  P  I   H   + +                         
Sbjct: 71  WISNKDAFPAVETFAGILSEHPGSISKHHSPVSLLESSTTSSLTNSTTNSSTLVRCCGSL 130

Query: 411 KVPVRARSKSRSRPRTGLADASSQKFSWRQPSSKISKEVI----IGRKCHHCGAEKTPQW 578
           K PVRARSK R + R  +       +S +Q ++K  K V     IGRKC HCG+EKTPQW
Sbjct: 131 KFPVRARSKCRQKRRRYMPCQLQLWWSRQQATTKNVKPVASTATIGRKCQHCGSEKTPQW 190

Query: 579 RAGPHGPKTLCNACGVRFKSGRLVPEYRPASSPTFRSDLHSNSHRKIVEMRKQK 740
           RAGP GPKTLCNACGVR+KSGRLVPEYRPASSP+F ++LHSNSHRKI+EMR+QK
Sbjct: 191 RAGPFGPKTLCNACGVRYKSGRLVPEYRPASSPSFSAELHSNSHRKILEMRRQK 244


>ref|XP_002303808.2| hypothetical protein POPTR_0003s17340g [Populus trichocarpa]
 gb|PNT46147.1| hypothetical protein POPTR_003G174800v3 [Populus trichocarpa]
          Length = 258

 Score =  195 bits (495), Expect = 4e-56
 Identities = 111/212 (52%), Positives = 130/212 (61%), Gaps = 12/212 (5%)
 Frame = +3

Query: 144 KPMKAFPSLKPKCSDPPSLNPLD-----EFAEEELEWLSNKDAFPAVETFVELTSIQP-I 305
           KP K  PSL P      S N L+     EFAEEELEWLSNKDAFPAVET   + S +P  
Sbjct: 37  KPRKGLPSLNPNALASASFNVLEHTLLPEFAEEELEWLSNKDAFPAVETCFGILSEEPGS 96

Query: 306 ISQHQRTATV--PMLEYXXXXXXXXXXXXXXXXXXXXKVPVRARSKSRSRPRTGLADA-- 473
           I +H    +V                           +VPV+ARSK R R    + +   
Sbjct: 97  IPKHHSPVSVLENSTTSSTSISGNSSNSSIIMSYCSLRVPVKARSKRRHRRPREIREQER 156

Query: 474 --SSQKFSWRQPSSKISKEVIIGRKCHHCGAEKTPQWRAGPHGPKTLCNACGVRFKSGRL 647
             S +  + R+P+  ++K   +GRKC HCG EKTPQWRAGP GPKTLCNACGVR+KSGRL
Sbjct: 157 WWSRENSTRRKPAVSVAK---MGRKCQHCGVEKTPQWRAGPDGPKTLCNACGVRYKSGRL 213

Query: 648 VPEYRPASSPTFRSDLHSNSHRKIVEMRKQKQ 743
           VPEYRPA+SPTF S LHSNSHRK+VEMRKQKQ
Sbjct: 214 VPEYRPANSPTFSSKLHSNSHRKVVEMRKQKQ 245


Top