BLASTX nr result

ID: Zanthoxylum22_contig00023967 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Zanthoxylum22_contig00023967
         (1309 letters)

Database: ./nr 
           77,306,371 sequences; 28,104,191,420 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_006492137.1| PREDICTED: GATA transcription factor 1-like ...   358   6e-96
gb|KDO46287.1| hypothetical protein CISIN_1g024771mg [Citrus sin...   356   3e-95
ref|XP_006420528.1| hypothetical protein CICLE_v10005658mg [Citr...   354   1e-94
gb|KDO46288.1| hypothetical protein CISIN_1g024771mg [Citrus sin...   290   2e-75
ref|XP_012092669.1| PREDICTED: GATA transcription factor 1 [Jatr...   236   2e-59
ref|XP_007034503.1| GATA transcription factor 1, putative [Theob...   224   1e-55
ref|XP_002303808.2| hypothetical protein POPTR_0003s17340g [Popu...   222   5e-55
ref|XP_011020707.1| PREDICTED: GATA transcription factor 1 [Popu...   221   1e-54
ref|XP_002299291.2| hypothetical protein POPTR_0001s14130g [Popu...   221   1e-54
ref|XP_002518163.1| GATA transcription factor, putative [Ricinus...   216   3e-53
ref|XP_011002163.1| PREDICTED: GATA transcription factor 1-like ...   214   2e-52
gb|KHG24532.1| GATA transcription factor 1 -like protein [Gossyp...   207   2e-50
ref|XP_012484192.1| PREDICTED: GATA transcription factor 1-like ...   206   4e-50
gb|KJB34235.1| hypothetical protein B456_006G054800 [Gossypium r...   206   4e-50
ref|XP_012484193.1| PREDICTED: GATA transcription factor 1-like ...   206   4e-50
ref|XP_010264014.1| PREDICTED: GATA transcription factor 1 [Nelu...   198   1e-47
ref|XP_008460722.1| PREDICTED: GATA transcription factor 1 isofo...   191   1e-45
gb|KRH33500.1| hypothetical protein GLYMA_10G126900 [Glycine max...   191   2e-45
gb|KRH33499.1| hypothetical protein GLYMA_10G126900 [Glycine max]     191   2e-45
ref|XP_008460721.1| PREDICTED: GATA transcription factor 1 isofo...   191   2e-45

>ref|XP_006492137.1| PREDICTED: GATA transcription factor 1-like [Citrus sinensis]
          Length = 262

 Score =  358 bits (919), Expect = 6e-96
 Identities = 187/262 (71%), Positives = 200/262 (76%), Gaps = 4/262 (1%)
 Frame = -2

Query: 1185 MESLDLQGCCXXXXXXXXXXXDEYSKPNKRTKNALSSLNRNGRDFDVSEADDDPDRLFPE 1006
            MESLDLQ CC           DE  KPNKR +NALSS+NRNG DFDV EA DD DRLFPE
Sbjct: 1    MESLDLQVCCIDDLLDFNINDDECGKPNKRPRNALSSVNRNGCDFDVFEAGDDTDRLFPE 60

Query: 1005 CAEEELEWLSTFPTVETYLDISSNANISKQQSPVSVLEXXXXXXXXXXXXXXXXXXXXXX 826
            CAEEELEWLS FPTVET++DISSN NI KQQSP SVLE                      
Sbjct: 61   CAEEELEWLSNFPTVETFVDISSNPNILKQQSPNSVLENSNSSSSTSTNGSTITNGNNNS 120

Query: 825  XI--MNCFGSLRVPVRARSKRCTR-RRDLLYQEAWWG-VHENVKTVKPVISKVIIGRKCQ 658
                MNC G+LRVPVRARSK  TR RR+LL QEAWWG VH +VK  KPV+SKVIIGRKCQ
Sbjct: 121  NSIIMNCCGNLRVPVRARSKLRTRCRRELLNQEAWWGSVHGSVKAAKPVVSKVIIGRKCQ 180

Query: 657  HCGAEKTPQWRAGPMGPKTLCNACGVRYKSGRLVPEYRPACSPTFSSEFHSNSHRKVVEM 478
            HCGAEKTPQWRAGPMGPKTLCNACGVR+KSGRLVPEYRPA SPTFSSE HSNSHRKVVEM
Sbjct: 181  HCGAEKTPQWRAGPMGPKTLCNACGVRFKSGRLVPEYRPANSPTFSSELHSNSHRKVVEM 240

Query: 477  RRKKQMLGIEIGTVAVKPVDKG 412
            RR+KQM+GIE+G + VKPVDKG
Sbjct: 241  RRQKQMMGIELGVLGVKPVDKG 262


>gb|KDO46287.1| hypothetical protein CISIN_1g024771mg [Citrus sinensis]
          Length = 262

 Score =  356 bits (913), Expect = 3e-95
 Identities = 186/262 (70%), Positives = 199/262 (75%), Gaps = 4/262 (1%)
 Frame = -2

Query: 1185 MESLDLQGCCXXXXXXXXXXXDEYSKPNKRTKNALSSLNRNGRDFDVSEADDDPDRLFPE 1006
            MESLDLQ CC           DE  KP KR +NALSS+NRNG DFDV EA DD DRLFPE
Sbjct: 1    MESLDLQVCCIDDLLDFNINDDECGKPTKRPRNALSSVNRNGCDFDVFEAGDDTDRLFPE 60

Query: 1005 CAEEELEWLSTFPTVETYLDISSNANISKQQSPVSVLEXXXXXXXXXXXXXXXXXXXXXX 826
            CAEEELEWLS FPTVET++DISSN NI KQQSP SVLE                      
Sbjct: 61   CAEEELEWLSNFPTVETFVDISSNPNILKQQSPNSVLENSNSSSSTSTNGSTITNGNNNS 120

Query: 825  XI--MNCFGSLRVPVRARSKRCTR-RRDLLYQEAWWG-VHENVKTVKPVISKVIIGRKCQ 658
                MNC G+LRVPVRARSK  TR RR+LL QEAWWG VH +VK  KPV+SKVIIGRKCQ
Sbjct: 121  NSIIMNCCGNLRVPVRARSKLRTRCRRELLNQEAWWGSVHGSVKAAKPVVSKVIIGRKCQ 180

Query: 657  HCGAEKTPQWRAGPMGPKTLCNACGVRYKSGRLVPEYRPACSPTFSSEFHSNSHRKVVEM 478
            HCGAEKTPQWRAGPMGPKTLCNACGVR+KSGRLVPEYRPA SPTFSSE HSNSHRKVVEM
Sbjct: 181  HCGAEKTPQWRAGPMGPKTLCNACGVRFKSGRLVPEYRPANSPTFSSELHSNSHRKVVEM 240

Query: 477  RRKKQMLGIEIGTVAVKPVDKG 412
            RR+KQM+GIE+G + VKPVDKG
Sbjct: 241  RRQKQMMGIELGVLGVKPVDKG 262


>ref|XP_006420528.1| hypothetical protein CICLE_v10005658mg [Citrus clementina]
            gi|557522401|gb|ESR33768.1| hypothetical protein
            CICLE_v10005658mg [Citrus clementina]
          Length = 262

 Score =  354 bits (908), Expect = 1e-94
 Identities = 185/262 (70%), Positives = 198/262 (75%), Gaps = 4/262 (1%)
 Frame = -2

Query: 1185 MESLDLQGCCXXXXXXXXXXXDEYSKPNKRTKNALSSLNRNGRDFDVSEADDDPDRLFPE 1006
            MESLDLQ CC           DE  KP KR +NALSS+NRNG DFDV EA DD D LFPE
Sbjct: 1    MESLDLQVCCIDDLLDFNINDDECGKPTKRPRNALSSVNRNGCDFDVFEAGDDTDHLFPE 60

Query: 1005 CAEEELEWLSTFPTVETYLDISSNANISKQQSPVSVLEXXXXXXXXXXXXXXXXXXXXXX 826
            CAEEELEWLS FPTVET++DISSN NI KQQSP SVLE                      
Sbjct: 61   CAEEELEWLSNFPTVETFVDISSNPNILKQQSPNSVLENSNSSSSTSTNGSTITNGNNNS 120

Query: 825  XI--MNCFGSLRVPVRARSKRCTR-RRDLLYQEAWWG-VHENVKTVKPVISKVIIGRKCQ 658
                MNC G+LRVPVRARSK  TR RR+LL QEAWWG VH +VK  KPV+SKVIIGRKCQ
Sbjct: 121  NSIIMNCCGNLRVPVRARSKLRTRCRRELLNQEAWWGSVHGSVKAAKPVVSKVIIGRKCQ 180

Query: 657  HCGAEKTPQWRAGPMGPKTLCNACGVRYKSGRLVPEYRPACSPTFSSEFHSNSHRKVVEM 478
            HCGAEKTPQWRAGPMGPKTLCNACGVR+KSGRLVPEYRPA SPTFSSE HSNSHRKVVEM
Sbjct: 181  HCGAEKTPQWRAGPMGPKTLCNACGVRFKSGRLVPEYRPANSPTFSSELHSNSHRKVVEM 240

Query: 477  RRKKQMLGIEIGTVAVKPVDKG 412
            RR+KQM+GIE+G + VKPVDKG
Sbjct: 241  RRQKQMMGIELGVLGVKPVDKG 262


>gb|KDO46288.1| hypothetical protein CISIN_1g024771mg [Citrus sinensis]
          Length = 238

 Score =  290 bits (741), Expect = 2e-75
 Identities = 149/203 (73%), Positives = 160/203 (78%), Gaps = 4/203 (1%)
 Frame = -2

Query: 1008 ECAEEELEWLSTFPTVETYLDISSNANISKQQSPVSVLEXXXXXXXXXXXXXXXXXXXXX 829
            ECAEEELEWLS FPTVET++DISSN NI KQQSP SVLE                     
Sbjct: 36   ECAEEELEWLSNFPTVETFVDISSNPNILKQQSPNSVLENSNSSSSTSTNGSTITNGNNN 95

Query: 828  XXI--MNCFGSLRVPVRARSKRCTR-RRDLLYQEAWWG-VHENVKTVKPVISKVIIGRKC 661
                 MNC G+LRVPVRARSK  TR RR+LL QEAWWG VH +VK  KPV+SKVIIGRKC
Sbjct: 96   SNSIIMNCCGNLRVPVRARSKLRTRCRRELLNQEAWWGSVHGSVKAAKPVVSKVIIGRKC 155

Query: 660  QHCGAEKTPQWRAGPMGPKTLCNACGVRYKSGRLVPEYRPACSPTFSSEFHSNSHRKVVE 481
            QHCGAEKTPQWRAGPMGPKTLCNACGVR+KSGRLVPEYRPA SPTFSSE HSNSHRKVVE
Sbjct: 156  QHCGAEKTPQWRAGPMGPKTLCNACGVRFKSGRLVPEYRPANSPTFSSELHSNSHRKVVE 215

Query: 480  MRRKKQMLGIEIGTVAVKPVDKG 412
            MRR+KQM+GIE+G + VKPVDKG
Sbjct: 216  MRRQKQMMGIELGVLGVKPVDKG 238


>ref|XP_012092669.1| PREDICTED: GATA transcription factor 1 [Jatropha curcas]
            gi|643701029|gb|KDP20343.1| hypothetical protein
            JCGZ_06429 [Jatropha curcas]
          Length = 260

 Score =  236 bits (603), Expect = 2e-59
 Identities = 134/245 (54%), Positives = 162/245 (66%), Gaps = 9/245 (3%)
 Frame = -2

Query: 1119 EYSKPNKRTKNALSSLNRNG---RDFDVSEADDDPDRLFPECAEEELEWLST---FPTVE 958
            E++KP K    AL +LN NG     FDV +  DD     PE AEEELEWLS    FP VE
Sbjct: 29   EHNKPRK----ALPTLNPNGLHPAPFDVLDHPDDSTHPLPEFAEEELEWLSNKDAFPAVE 84

Query: 957  TYLDISSN--ANISKQQSPVSVLEXXXXXXXXXXXXXXXXXXXXXXXIMNCFGSLRVPVR 784
            T++DI S    ++ KQ+SPVSVLE                        MN   SL+VPV+
Sbjct: 85   TFVDIISENPGSLPKQRSPVSVLENSTTSSTSISGNSSTNGSVI----MNYCRSLQVPVK 140

Query: 783  ARSKRCTRRR-DLLYQEAWWGVHENVKTVKPVISKVIIGRKCQHCGAEKTPQWRAGPMGP 607
            ARSK   RRR DL   + WW   EN+K V+P ++   +GRKCQHCGAEKTPQWRAGP+GP
Sbjct: 141  ARSKHHRRRRRDLQAHQCWWN-QENLKKVRPPVTSSTMGRKCQHCGAEKTPQWRAGPLGP 199

Query: 606  KTLCNACGVRYKSGRLVPEYRPACSPTFSSEFHSNSHRKVVEMRRKKQMLGIEIGTVAVK 427
            KTLCNACGVR+KSGRLVPEYRPA SP+F S+ HSNSHRKV+EMR++KQM+G+    V VK
Sbjct: 200  KTLCNACGVRFKSGRLVPEYRPASSPSFCSKMHSNSHRKVLEMRKQKQMMGL----VVVK 255

Query: 426  PVDKG 412
            P++KG
Sbjct: 256  PMEKG 260


>ref|XP_007034503.1| GATA transcription factor 1, putative [Theobroma cacao]
            gi|508713532|gb|EOY05429.1| GATA transcription factor 1,
            putative [Theobroma cacao]
          Length = 243

 Score =  224 bits (571), Expect = 1e-55
 Identities = 131/247 (53%), Positives = 152/247 (61%), Gaps = 24/247 (9%)
 Frame = -2

Query: 1080 SSLNRNGRDF--DVSEADDDPD----------------RLFPECAEEELEWLST---FPT 964
            +S + N  DF  DV E D+D +                R FPE AEEELEW+S    FP+
Sbjct: 8    ASFDENLLDFGSDVGEEDEDEENNKSSKLNTSSSLNANRSFPEFAEEELEWISNKDAFPS 67

Query: 963  VETYLDISSNANISKQQSPVSVLEXXXXXXXXXXXXXXXXXXXXXXXIMNCFGSLRVPVR 784
            VET++DI   A  +K QSPVSVL+                        M C G+L+VPV+
Sbjct: 68   VETFVDILGTA--AKHQSPVSVLDNSNSSSNSSGSSTLTNGNIV----MYCCGNLKVPVK 121

Query: 783  ARSKRCTRRRDLLYQEAWWGVHENVKTVKPVISKV---IIGRKCQHCGAEKTPQWRAGPM 613
            ARSKR  + RDL  QE  W V ENVK     +       IGRKCQHCGAEKTPQWRAGP+
Sbjct: 122  ARSKRLRKCRDLRNQENSWWVQENVKNASAHVKGAGSRTIGRKCQHCGAEKTPQWRAGPL 181

Query: 612  GPKTLCNACGVRYKSGRLVPEYRPACSPTFSSEFHSNSHRKVVEMRRKKQMLGIEIGTVA 433
            GPKTLCNACGVRYKSGRLVPEYRPA SPTFS E HSNSHRK++EMRR+KQ      G  A
Sbjct: 182  GPKTLCNACGVRYKSGRLVPEYRPASSPTFSIELHSNSHRKILEMRRQKQ-----FGFSA 236

Query: 432  VKPVDKG 412
            +KP+DKG
Sbjct: 237  MKPMDKG 243


>ref|XP_002303808.2| hypothetical protein POPTR_0003s17340g [Populus trichocarpa]
            gi|550343381|gb|EEE78787.2| hypothetical protein
            POPTR_0003s17340g [Populus trichocarpa]
          Length = 258

 Score =  222 bits (566), Expect = 5e-55
 Identities = 133/245 (54%), Positives = 152/245 (62%), Gaps = 9/245 (3%)
 Frame = -2

Query: 1119 EYSKPNKRTKNALSSLNRNG---RDFDVSEADDDPDRLFPECAEEELEWLST---FPTVE 958
            E+   NK+ +  L SLN N      F+V E       L PE AEEELEWLS    FP VE
Sbjct: 30   EHQNNNKKPRKGLPSLNPNALASASFNVLE-----HTLLPEFAEEELEWLSNKDAFPAVE 84

Query: 957  TYLDISSN--ANISKQQSPVSVLEXXXXXXXXXXXXXXXXXXXXXXXIMNCFGSLRVPVR 784
            T   I S    +I K  SPVSVLE                        +  + SLRVPV+
Sbjct: 85   TCFGILSEEPGSIPKHHSPVSVLENSTTSSTSISGNSSNSSI------IMSYCSLRVPVK 138

Query: 783  ARSKRCTRR-RDLLYQEAWWGVHENVKTVKPVISKVIIGRKCQHCGAEKTPQWRAGPMGP 607
            ARSKR  RR R++  QE WW   EN    KP +S   +GRKCQHCG EKTPQWRAGP GP
Sbjct: 139  ARSKRRHRRPREIREQERWWS-RENSTRRKPAVSVAKMGRKCQHCGVEKTPQWRAGPDGP 197

Query: 606  KTLCNACGVRYKSGRLVPEYRPACSPTFSSEFHSNSHRKVVEMRRKKQMLGIEIGTVAVK 427
            KTLCNACGVRYKSGRLVPEYRPA SPTFSS+ HSNSHRKVVEMR++KQM+    G++ VK
Sbjct: 198  KTLCNACGVRYKSGRLVPEYRPANSPTFSSKLHSNSHRKVVEMRKQKQMM----GSLVVK 253

Query: 426  PVDKG 412
            P+DKG
Sbjct: 254  PMDKG 258


>ref|XP_011020707.1| PREDICTED: GATA transcription factor 1 [Populus euphratica]
          Length = 258

 Score =  221 bits (562), Expect = 1e-54
 Identities = 133/245 (54%), Positives = 151/245 (61%), Gaps = 9/245 (3%)
 Frame = -2

Query: 1119 EYSKPNKRTKNALSSLNRNG---RDFDVSEADDDPDRLFPECAEEELEWLST---FPTVE 958
            E+   NK+ +  L SLN N      F+V E       L PE AEEELEWLS    FP VE
Sbjct: 30   EHQSNNKKPRKGLPSLNPNALASTSFNVLE-----HALLPEFAEEELEWLSNKDAFPAVE 84

Query: 957  TYLDISSNA--NISKQQSPVSVLEXXXXXXXXXXXXXXXXXXXXXXXIMNCFGSLRVPVR 784
            T   I S    +I K  SPVSVLE                        +  + SLRVPV+
Sbjct: 85   TCFGIVSEEPDSIPKHHSPVSVLENSTTSSTSISGNSSNSSI------IMSYCSLRVPVK 138

Query: 783  ARSKRCTRR-RDLLYQEAWWGVHENVKTVKPVISKVIIGRKCQHCGAEKTPQWRAGPMGP 607
            ARSKR  RR R++  QE WW   EN    KP +S   +GRKCQHCG EKTPQWRAGP GP
Sbjct: 139  ARSKRRHRRPREIREQERWWS-RENSTRRKPAVSVAKMGRKCQHCGVEKTPQWRAGPDGP 197

Query: 606  KTLCNACGVRYKSGRLVPEYRPACSPTFSSEFHSNSHRKVVEMRRKKQMLGIEIGTVAVK 427
            KTLCNACGVRYKSGRLVPEYRPA SPTFSS+ HSNSHRKV+EMRR+KQM     G++ VK
Sbjct: 198  KTLCNACGVRYKSGRLVPEYRPANSPTFSSKLHSNSHRKVLEMRRQKQM----TGSLVVK 253

Query: 426  PVDKG 412
            P+DKG
Sbjct: 254  PMDKG 258


>ref|XP_002299291.2| hypothetical protein POPTR_0001s14130g [Populus trichocarpa]
            gi|550347223|gb|EEE84096.2| hypothetical protein
            POPTR_0001s14130g [Populus trichocarpa]
          Length = 308

 Score =  221 bits (562), Expect = 1e-54
 Identities = 132/245 (53%), Positives = 152/245 (62%), Gaps = 9/245 (3%)
 Frame = -2

Query: 1119 EYSKPNKRTKNALSSLNRNG---RDFDVSEADDDPDRLFPECAEEELEWLST---FPTVE 958
            E+ + +K+++ AL SLN N      F+V E       L PE AEEELEWLS    FPTVE
Sbjct: 80   EHQRNSKKSRRALPSLNPNALHPASFNVLEHS-----LLPEFAEEELEWLSNKDAFPTVE 134

Query: 957  TYLDISSN--ANISKQQSPVSVLEXXXXXXXXXXXXXXXXXXXXXXXIMNCFGSLRVPVR 784
            T     S    +I K  SPVSVLE                        +  +  LRVPV+
Sbjct: 135  TCFGSLSGEPGSIPKHHSPVSVLENSTTSSTSNSGNSSNSNI------IMSYCRLRVPVK 188

Query: 783  ARSKRCTRR-RDLLYQEAWWGVHENVKTVKPVISKVIIGRKCQHCGAEKTPQWRAGPMGP 607
            ARSKR  R  R++  QE WW   EN  T KP +S   +GRKCQHCG EKTPQWRAGP GP
Sbjct: 189  ARSKRHHRHPREIQEQECWWS-QENFITRKPAVSVAKLGRKCQHCGVEKTPQWRAGPDGP 247

Query: 606  KTLCNACGVRYKSGRLVPEYRPACSPTFSSEFHSNSHRKVVEMRRKKQMLGIEIGTVAVK 427
            KTLCNACGVRYKSGRLVPEYRPA SPTFSS+ HSNSHRKVVEMRR+KQM G+    +  K
Sbjct: 248  KTLCNACGVRYKSGRLVPEYRPANSPTFSSKLHSNSHRKVVEMRRQKQMTGL----LVAK 303

Query: 426  PVDKG 412
            P+DKG
Sbjct: 304  PMDKG 308


>ref|XP_002518163.1| GATA transcription factor, putative [Ricinus communis]
            gi|223542759|gb|EEF44296.1| GATA transcription factor,
            putative [Ricinus communis]
          Length = 205

 Score =  216 bits (551), Expect = 3e-53
 Identities = 124/208 (59%), Positives = 142/208 (68%), Gaps = 7/208 (3%)
 Frame = -2

Query: 1014 FPECAEEELEWLST---FPTVETYLDISSN--ANISKQQSPVSVLEXXXXXXXXXXXXXX 850
            + E AEEELEWLS    FP+VET++DI +    ++ K +SPVSVLE              
Sbjct: 7    YREFAEEELEWLSNKDAFPSVETFVDILTENPGSLQKHRSPVSVLENSTTSSTSNSGHSG 66

Query: 849  XXXXXXXXXIMNCFGSLRVPVRARSK-RCTRRRDLLYQEAWWGVHENVKTVKPV-ISKVI 676
                      MN   SL VPV+ARSK    RRRDL  Q+ WW   EN+K VK V  S   
Sbjct: 67   TNDSVI----MNYCRSLHVPVKARSKPHRRRRRDLGGQQCWWS-QENLKKVKVVKSSSST 121

Query: 675  IGRKCQHCGAEKTPQWRAGPMGPKTLCNACGVRYKSGRLVPEYRPACSPTFSSEFHSNSH 496
            IGRKCQHCGAEKTPQWRAGP+GPKTLCNACGVRYKSGRLVPEYRPA SPTFSS  HSNSH
Sbjct: 122  IGRKCQHCGAEKTPQWRAGPLGPKTLCNACGVRYKSGRLVPEYRPASSPTFSSVLHSNSH 181

Query: 495  RKVVEMRRKKQMLGIEIGTVAVKPVDKG 412
            RKV+EMRR+KQM+GI    + VKP++KG
Sbjct: 182  RKVLEMRRQKQMMGI----MVVKPMEKG 205


>ref|XP_011002163.1| PREDICTED: GATA transcription factor 1-like [Populus euphratica]
            gi|743935693|ref|XP_011012220.1| PREDICTED: GATA
            transcription factor 1-like [Populus euphratica]
          Length = 256

 Score =  214 bits (544), Expect = 2e-52
 Identities = 132/245 (53%), Positives = 153/245 (62%), Gaps = 9/245 (3%)
 Frame = -2

Query: 1119 EYSKPNKRTKNALSSLNRNG---RDFDVSEADDDPDRLFPECAEEELEWLS---TFPTVE 958
            E+ + +K+++ AL SLN N      F+V E       L PE AEE+LEWLS    FPTVE
Sbjct: 29   EHQRNSKKSRRALPSLNPNDLHPASFNVLEHS-----LLPEFAEEDLEWLSNKDAFPTVE 83

Query: 957  TYLDISSN--ANISKQQSPVSVLEXXXXXXXXXXXXXXXXXXXXXXXIMNCFGSLRVPVR 784
            T     S    +I K  SPVSVLE                       + +C   LRVPV+
Sbjct: 84   TCFGSLSGEPGSIPKHHSPVSVLE----NSTTSSTSNSGNSSNSNIIMSSC--RLRVPVK 137

Query: 783  ARSKRCTRR-RDLLYQEAWWGVHENVKTVKPVISKVIIGRKCQHCGAEKTPQWRAGPMGP 607
            ARSKR  R  R++  QE WW   EN  T KP  S   +GRKCQHCG EKTPQWRAGP GP
Sbjct: 138  ARSKRHHRHPREIQEQECWWS-QENF-TRKPAESVAKLGRKCQHCGVEKTPQWRAGPDGP 195

Query: 606  KTLCNACGVRYKSGRLVPEYRPACSPTFSSEFHSNSHRKVVEMRRKKQMLGIEIGTVAVK 427
            KTLCNACGVRYKSGRLVPEYRPA SPTFSS+ HSNSHRKVVEMRR+KQM+G+    +  K
Sbjct: 196  KTLCNACGVRYKSGRLVPEYRPANSPTFSSKLHSNSHRKVVEMRRQKQMMGL----LVAK 251

Query: 426  PVDKG 412
            P+DKG
Sbjct: 252  PMDKG 256


>gb|KHG24532.1| GATA transcription factor 1 -like protein [Gossypium arboreum]
          Length = 228

 Score =  207 bits (527), Expect = 2e-50
 Identities = 125/267 (46%), Positives = 153/267 (57%), Gaps = 10/267 (3%)
 Frame = -2

Query: 1185 MESLDLQGCCXXXXXXXXXXXDEYSKP---NKRTKNALSSLNRNGRDFDVSEADDDPDRL 1015
            ME+LD+  C             E  +    NK++  + SSLN N             +  
Sbjct: 1    MEALDMAACFEDNLLDFASDVGEEDEDKEHNKKSSTSSSSLNPN-------------NSC 47

Query: 1014 FPECAEEELEWLST---FPTVET-YLDISSNANISKQQSPVSVLEXXXXXXXXXXXXXXX 847
            F E AEEELEWLS    FP VET ++D+   A  +K QS +++                 
Sbjct: 48   FSEFAEEELEWLSNKDAFPAVETSFVDVLGTA--TKHQSSLTLANGNVV----------- 94

Query: 846  XXXXXXXXIMNCFGSLRVPVRARSKRCTRRRDLLYQEAWWGVHENVKTVKPVISKV---I 676
                     M CFG++++PV+ARSKR  + RDL   E  W VHENVKT            
Sbjct: 95   ---------MYCFGNVKIPVKARSKRLRKCRDLRDHEKNWRVHENVKTSNATAKGNRWRT 145

Query: 675  IGRKCQHCGAEKTPQWRAGPMGPKTLCNACGVRYKSGRLVPEYRPACSPTFSSEFHSNSH 496
            +GRKCQHCGAEKTPQWRAGP+GPKTLCNACGVRYKSGRLVPEYRPA SPTFSS  HSNSH
Sbjct: 146  MGRKCQHCGAEKTPQWRAGPLGPKTLCNACGVRYKSGRLVPEYRPASSPTFSSRLHSNSH 205

Query: 495  RKVVEMRRKKQMLGIEIGTVAVKPVDK 415
            RK++EMRR KQ     +G  ++KP+DK
Sbjct: 206  RKILEMRRHKQ-----LGFPSMKPMDK 227


>ref|XP_012484192.1| PREDICTED: GATA transcription factor 1-like isoform X1 [Gossypium
            raimondii]
          Length = 236

 Score =  206 bits (524), Expect = 4e-50
 Identities = 124/267 (46%), Positives = 153/267 (57%), Gaps = 10/267 (3%)
 Frame = -2

Query: 1185 MESLDLQGCCXXXXXXXXXXXDEYSKP---NKRTKNALSSLNRNGRDFDVSEADDDPDRL 1015
            ME+ D+  C             E  +    NK++  + SSLN N             +  
Sbjct: 1    MEAFDMAACFEDNLLDFASDVGEEDEDKEHNKKSSTSSSSLNPN-------------NSC 47

Query: 1014 FPECAEEELEWLST---FPTVET-YLDISSNANISKQQSPVSVLEXXXXXXXXXXXXXXX 847
            F E AEEELEWLS    FP VET ++D+   A  +K QS +++                 
Sbjct: 48   FSEFAEEELEWLSNKDAFPAVETSFVDVLGTA--TKHQSSLTLTNGNVV----------- 94

Query: 846  XXXXXXXXIMNCFGSLRVPVRARSKRCTRRRDLLYQEAWWGVHENVKTVKPVISKV---I 676
                     M CFG++++PV+ARSKR  + RDL   E  W VHENVKT            
Sbjct: 95   ---------MYCFGNVKIPVKARSKRLRKCRDLRDHEKNWWVHENVKTSNATAKGNRWRT 145

Query: 675  IGRKCQHCGAEKTPQWRAGPMGPKTLCNACGVRYKSGRLVPEYRPACSPTFSSEFHSNSH 496
            +GRKCQHCGAEKTPQWRAGP+GPKTLCNACGVRYKSGRLVPEYRPA SPTFSS+ HSNSH
Sbjct: 146  MGRKCQHCGAEKTPQWRAGPLGPKTLCNACGVRYKSGRLVPEYRPASSPTFSSKLHSNSH 205

Query: 495  RKVVEMRRKKQMLGIEIGTVAVKPVDK 415
            RK++EMRR KQ     +G  ++KP+DK
Sbjct: 206  RKILEMRRHKQ-----LGFPSMKPMDK 227


>gb|KJB34235.1| hypothetical protein B456_006G054800 [Gossypium raimondii]
          Length = 228

 Score =  206 bits (524), Expect = 4e-50
 Identities = 124/267 (46%), Positives = 153/267 (57%), Gaps = 10/267 (3%)
 Frame = -2

Query: 1185 MESLDLQGCCXXXXXXXXXXXDEYSKP---NKRTKNALSSLNRNGRDFDVSEADDDPDRL 1015
            ME+ D+  C             E  +    NK++  + SSLN N             +  
Sbjct: 1    MEAFDMAACFEDNLLDFASDVGEEDEDKEHNKKSSTSSSSLNPN-------------NSC 47

Query: 1014 FPECAEEELEWLST---FPTVET-YLDISSNANISKQQSPVSVLEXXXXXXXXXXXXXXX 847
            F E AEEELEWLS    FP VET ++D+   A  +K QS +++                 
Sbjct: 48   FSEFAEEELEWLSNKDAFPAVETSFVDVLGTA--TKHQSSLTLTNGNVV----------- 94

Query: 846  XXXXXXXXIMNCFGSLRVPVRARSKRCTRRRDLLYQEAWWGVHENVKTVKPVISKV---I 676
                     M CFG++++PV+ARSKR  + RDL   E  W VHENVKT            
Sbjct: 95   ---------MYCFGNVKIPVKARSKRLRKCRDLRDHEKNWWVHENVKTSNATAKGNRWRT 145

Query: 675  IGRKCQHCGAEKTPQWRAGPMGPKTLCNACGVRYKSGRLVPEYRPACSPTFSSEFHSNSH 496
            +GRKCQHCGAEKTPQWRAGP+GPKTLCNACGVRYKSGRLVPEYRPA SPTFSS+ HSNSH
Sbjct: 146  MGRKCQHCGAEKTPQWRAGPLGPKTLCNACGVRYKSGRLVPEYRPASSPTFSSKLHSNSH 205

Query: 495  RKVVEMRRKKQMLGIEIGTVAVKPVDK 415
            RK++EMRR KQ     +G  ++KP+DK
Sbjct: 206  RKILEMRRHKQ-----LGFPSMKPMDK 227


>ref|XP_012484193.1| PREDICTED: GATA transcription factor 1-like isoform X2 [Gossypium
            raimondii] gi|763767019|gb|KJB34234.1| hypothetical
            protein B456_006G054800 [Gossypium raimondii]
          Length = 229

 Score =  206 bits (524), Expect = 4e-50
 Identities = 124/267 (46%), Positives = 153/267 (57%), Gaps = 10/267 (3%)
 Frame = -2

Query: 1185 MESLDLQGCCXXXXXXXXXXXDEYSKP---NKRTKNALSSLNRNGRDFDVSEADDDPDRL 1015
            ME+ D+  C             E  +    NK++  + SSLN N             +  
Sbjct: 1    MEAFDMAACFEDNLLDFASDVGEEDEDKEHNKKSSTSSSSLNPN-------------NSC 47

Query: 1014 FPECAEEELEWLST---FPTVET-YLDISSNANISKQQSPVSVLEXXXXXXXXXXXXXXX 847
            F E AEEELEWLS    FP VET ++D+   A  +K QS +++                 
Sbjct: 48   FSEFAEEELEWLSNKDAFPAVETSFVDVLGTA--TKHQSSLTLTNGNVV----------- 94

Query: 846  XXXXXXXXIMNCFGSLRVPVRARSKRCTRRRDLLYQEAWWGVHENVKTVKPVISKV---I 676
                     M CFG++++PV+ARSKR  + RDL   E  W VHENVKT            
Sbjct: 95   ---------MYCFGNVKIPVKARSKRLRKCRDLRDHEKNWWVHENVKTSNATAKGNRWRT 145

Query: 675  IGRKCQHCGAEKTPQWRAGPMGPKTLCNACGVRYKSGRLVPEYRPACSPTFSSEFHSNSH 496
            +GRKCQHCGAEKTPQWRAGP+GPKTLCNACGVRYKSGRLVPEYRPA SPTFSS+ HSNSH
Sbjct: 146  MGRKCQHCGAEKTPQWRAGPLGPKTLCNACGVRYKSGRLVPEYRPASSPTFSSKLHSNSH 205

Query: 495  RKVVEMRRKKQMLGIEIGTVAVKPVDK 415
            RK++EMRR KQ     +G  ++KP+DK
Sbjct: 206  RKILEMRRHKQ-----LGFPSMKPMDK 227


>ref|XP_010264014.1| PREDICTED: GATA transcription factor 1 [Nelumbo nucifera]
          Length = 278

 Score =  198 bits (503), Expect = 1e-47
 Identities = 116/224 (51%), Positives = 137/224 (61%), Gaps = 18/224 (8%)
 Frame = -2

Query: 1029 DPDR---LFPECAEEELEWLST---FPTVETYLD--ISSNANISKQQSPVSVLEXXXXXX 874
            DPD     FPE  EE+LEWLS    FP VE + D  +   +   KQQSPVSVLE      
Sbjct: 72   DPDEHHHSFPELLEEDLEWLSNEDAFPAVEAFDDFLLGKLSKGPKQQSPVSVLENSSNSA 131

Query: 873  XXXXXXXXXXXXXXXXXIMNCFGSLRVPVRARSKRCTRRR----DLLYQEAWWGVHENVK 706
                              M+C G+L+VPVRARSKR  RRR    D+  Q+ WW      K
Sbjct: 132  INSSSSI-----------MSCCGNLQVPVRARSKRRRRRRSGFSDISGQQWWWWWEPKNK 180

Query: 705  TV------KPVISKVIIGRKCQHCGAEKTPQWRAGPMGPKTLCNACGVRYKSGRLVPEYR 544
            ++      K   +   +GR+C HC AEKTPQWRAGP+GPKTLCNACGVRYKSGRLVPEYR
Sbjct: 181  SIGGGGAAKVTKTTASMGRRCLHCLAEKTPQWRAGPLGPKTLCNACGVRYKSGRLVPEYR 240

Query: 543  PACSPTFSSEFHSNSHRKVVEMRRKKQMLGIEIGTVAVKPVDKG 412
            PACSPTFSSE HSNSHRK++EMRR+KQ        + +K +DKG
Sbjct: 241  PACSPTFSSELHSNSHRKILEMRRQKQK------ELLLKSMDKG 278


>ref|XP_008460722.1| PREDICTED: GATA transcription factor 1 isoform X2 [Cucumis melo]
          Length = 287

 Score =  191 bits (485), Expect = 1e-45
 Identities = 123/264 (46%), Positives = 146/264 (55%), Gaps = 30/264 (11%)
 Frame = -2

Query: 1113 SKPNKRTKNALSSLNRNGRDFDVSEADDDPDRLFPE-CAEEELEWLST---FPTVETYLD 946
            SK +  T    S LN      D    D    R+ PE  AEEELEWLS    FP VET++D
Sbjct: 35   SKSSSTTAPDSSDLNAAAMHPD----DSSSCRVLPEDYAEEELEWLSNEDAFPAVETFVD 90

Query: 945  ISSN------------ANISKQQSPVSVLEXXXXXXXXXXXXXXXXXXXXXXXI-MNCFG 805
            I S+             ++SKQ SPVSVLE                       I M+C G
Sbjct: 91   ILSDHHHHHAPQPPPLTSVSKQNSPVSVLESTSISSHGETINGGNKTSVHGSSILMSCCG 150

Query: 804  SLRVPVRARSKRCTRRRDLLYQEAWWGVHENVKTVKPVISKVI-------------IGRK 664
             L+VP +ARSKR  R R +     W+    + K +K V+                 IGRK
Sbjct: 151  GLKVPGKARSKR-RRGRHISGHHLWFKQQPSSKNLKQVVPTTETAAAVAATTGAAGIGRK 209

Query: 663  CQHCGAEKTPQWRAGPMGPKTLCNACGVRYKSGRLVPEYRPACSPTFSSEFHSNSHRKVV 484
            C HCGAEKTPQWRAGP GPKTLCNACGVR+KSGRLVPEYRPA SPTFS++ HSNSHRKV+
Sbjct: 210  CLHCGAEKTPQWRAGPFGPKTLCNACGVRFKSGRLVPEYRPASSPTFSADLHSNSHRKVM 269

Query: 483  EMRRKKQMLGIEIGTVAVKPVDKG 412
            EMRR+KQ+       + V P+DKG
Sbjct: 270  EMRRQKQL------GMVVNPMDKG 287


>gb|KRH33500.1| hypothetical protein GLYMA_10G126900 [Glycine max]
            gi|947084780|gb|KRH33501.1| hypothetical protein
            GLYMA_10G126900 [Glycine max]
          Length = 245

 Score =  191 bits (484), Expect = 2e-45
 Identities = 108/199 (54%), Positives = 124/199 (62%), Gaps = 9/199 (4%)
 Frame = -2

Query: 1029 DPDRLFPECAEEELEWLST---FPTVETYLDISS-NANISKQQSPVSVLEXXXXXXXXXX 862
            DP+  F E AEEELEWLS    FP+VET++D+SS     +K Q    VLE          
Sbjct: 51   DPNHSFSEFAEEELEWLSNKDAFPSVETFVDLSSIQPGTTKNQKSAPVLECSTGSSNSNN 110

Query: 861  XXXXXXXXXXXXXIMNCFGSLRVPVRARSKRCTRRRDLLY----QEAWWGVHENVKTVKP 694
                          +N    L+VPVRARSK  +R R  L     Q+ WW    N  +   
Sbjct: 111  STNSISL-------LNSCDHLKVPVRARSKSRSRHRPGLAENSSQQVWWRQPSNGTSKAD 163

Query: 693  VISKVI-IGRKCQHCGAEKTPQWRAGPMGPKTLCNACGVRYKSGRLVPEYRPACSPTFSS 517
               K+  IGRKCQHCGAEKTPQWRAGP GPKTLCNACGVR+KSGRLVPEYRPA SPTF S
Sbjct: 164  EGMKISSIGRKCQHCGAEKTPQWRAGPSGPKTLCNACGVRFKSGRLVPEYRPASSPTFHS 223

Query: 516  EFHSNSHRKVVEMRRKKQM 460
            + HSNSHRK+VEMRR+KQM
Sbjct: 224  DLHSNSHRKIVEMRRQKQM 242


>gb|KRH33499.1| hypothetical protein GLYMA_10G126900 [Glycine max]
          Length = 256

 Score =  191 bits (484), Expect = 2e-45
 Identities = 108/199 (54%), Positives = 124/199 (62%), Gaps = 9/199 (4%)
 Frame = -2

Query: 1029 DPDRLFPECAEEELEWLST---FPTVETYLDISS-NANISKQQSPVSVLEXXXXXXXXXX 862
            DP+  F E AEEELEWLS    FP+VET++D+SS     +K Q    VLE          
Sbjct: 62   DPNHSFSEFAEEELEWLSNKDAFPSVETFVDLSSIQPGTTKNQKSAPVLECSTGSSNSNN 121

Query: 861  XXXXXXXXXXXXXIMNCFGSLRVPVRARSKRCTRRRDLLY----QEAWWGVHENVKTVKP 694
                          +N    L+VPVRARSK  +R R  L     Q+ WW    N  +   
Sbjct: 122  STNSISL-------LNSCDHLKVPVRARSKSRSRHRPGLAENSSQQVWWRQPSNGTSKAD 174

Query: 693  VISKVI-IGRKCQHCGAEKTPQWRAGPMGPKTLCNACGVRYKSGRLVPEYRPACSPTFSS 517
               K+  IGRKCQHCGAEKTPQWRAGP GPKTLCNACGVR+KSGRLVPEYRPA SPTF S
Sbjct: 175  EGMKISSIGRKCQHCGAEKTPQWRAGPSGPKTLCNACGVRFKSGRLVPEYRPASSPTFHS 234

Query: 516  EFHSNSHRKVVEMRRKKQM 460
            + HSNSHRK+VEMRR+KQM
Sbjct: 235  DLHSNSHRKIVEMRRQKQM 253


>ref|XP_008460721.1| PREDICTED: GATA transcription factor 1 isoform X1 [Cucumis melo]
          Length = 288

 Score =  191 bits (484), Expect = 2e-45
 Identities = 123/265 (46%), Positives = 146/265 (55%), Gaps = 31/265 (11%)
 Frame = -2

Query: 1113 SKPNKRTKNALSSLNRNGRDFDVSEADDDPDRLFPE--CAEEELEWLST---FPTVETYL 949
            SK +  T    S LN      D    D    R+ PE   AEEELEWLS    FP VET++
Sbjct: 35   SKSSSTTAPDSSDLNAAAMHPD----DSSSCRVLPEEDYAEEELEWLSNEDAFPAVETFV 90

Query: 948  DISSN------------ANISKQQSPVSVLEXXXXXXXXXXXXXXXXXXXXXXXI-MNCF 808
            DI S+             ++SKQ SPVSVLE                       I M+C 
Sbjct: 91   DILSDHHHHHAPQPPPLTSVSKQNSPVSVLESTSISSHGETINGGNKTSVHGSSILMSCC 150

Query: 807  GSLRVPVRARSKRCTRRRDLLYQEAWWGVHENVKTVKPVISKVI-------------IGR 667
            G L+VP +ARSKR  R R +     W+    + K +K V+                 IGR
Sbjct: 151  GGLKVPGKARSKR-RRGRHISGHHLWFKQQPSSKNLKQVVPTTETAAAVAATTGAAGIGR 209

Query: 666  KCQHCGAEKTPQWRAGPMGPKTLCNACGVRYKSGRLVPEYRPACSPTFSSEFHSNSHRKV 487
            KC HCGAEKTPQWRAGP GPKTLCNACGVR+KSGRLVPEYRPA SPTFS++ HSNSHRKV
Sbjct: 210  KCLHCGAEKTPQWRAGPFGPKTLCNACGVRFKSGRLVPEYRPASSPTFSADLHSNSHRKV 269

Query: 486  VEMRRKKQMLGIEIGTVAVKPVDKG 412
            +EMRR+KQ+       + V P+DKG
Sbjct: 270  MEMRRQKQL------GMVVNPMDKG 288


Top