BLASTX nr result

ID: Catharanthus23_contig00002601 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Catharanthus23_contig00002601
         (1297 letters)

Database: ./nr 
           37,332,560 sequences; 13,225,080,153 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

dbj|BAC98492.1| AG-motif binding protein-2 [Nicotiana tabacum]        237   9e-60
ref|XP_004247814.1| PREDICTED: GATA transcription factor 1-like ...   234   6e-59
ref|XP_006340920.1| PREDICTED: GATA transcription factor 1-like ...   228   4e-57
ref|XP_002518163.1| GATA transcription factor, putative [Ricinus...   198   5e-48
gb|EOY05429.1| GATA transcription factor 1, putative [Theobroma ...   188   5e-45
ref|XP_006355741.1| PREDICTED: GATA transcription factor 1-like ...   186   2e-44
ref|XP_004150343.1| PREDICTED: GATA transcription factor 1-like ...   186   2e-44
ref|XP_002299291.2| hypothetical protein POPTR_0001s14130g [Popu...   185   4e-44
ref|XP_004239871.1| PREDICTED: GATA transcription factor 1-like ...   185   4e-44
gb|EXB66651.1| GATA transcription factor 1 [Morus notabilis]          181   8e-43
ref|XP_004309759.1| PREDICTED: GATA transcription factor 1-like ...   178   4e-42
ref|XP_004309758.1| PREDICTED: GATA transcription factor 1-like ...   178   4e-42
gb|ESW17292.1| hypothetical protein PHAVU_007G227300g [Phaseolus...   176   2e-41
ref|XP_006492137.1| PREDICTED: GATA transcription factor 1-like ...   171   6e-40
ref|XP_006420528.1| hypothetical protein CICLE_v10005658mg [Citr...   171   6e-40
ref|XP_002285624.1| PREDICTED: GATA transcription factor 1-like ...   166   2e-38
ref|XP_006418795.1| hypothetical protein EUTSA_v10002609mg [Eutr...   165   3e-38
ref|NP_189047.1| GATA transcription factor 1 [Arabidopsis thalia...   162   2e-37
gb|AAM65139.1| GATA transcription factor 1 (AtGATA-1) [Arabidops...   161   5e-37
ref|XP_002885621.1| hypothetical protein ARALYDRAFT_479930 [Arab...   161   5e-37

>dbj|BAC98492.1| AG-motif binding protein-2 [Nicotiana tabacum]
          Length = 289

 Score =  237 bits (604), Expect = 9e-60
 Identities = 126/223 (56%), Positives = 153/223 (68%), Gaps = 5/223 (2%)
 Frame = -2

Query: 954 PEFVEEELEWLSNKDAFPTVETCFDIFSENPDIMGLDHQSPVSVLEXXXXXXXXXXXXXN 775
           PE VEEELEWLSNKDAFP VE  F I ++NP I+  DH SPVSVLE              
Sbjct: 74  PECVEEELEWLSNKDAFPAVE--FGILADNPSIV-FDHHSPVSVLENSSSTCNSSGNGSA 130

Query: 774 ---AIMSCCRSLQVPTSFPVXXXXXXXXXXRIGGFGDLPSQHLLWSN--NVKKMLQQELA 610
              A MSCC SL+VP ++PV          + G F DLPS+H +  N  + K + Q+E  
Sbjct: 131 NANAYMSCCASLKVPVNYPVRARSKRRRRRQRGSFADLPSEHCMSVNKPSFKSVKQRE-- 188

Query: 609 LPPLPTTRTNRNASSIGRRCLHCGADKTPQWRAGPMGPKTLCNACGVRYKSGRLVPEYRP 430
            P L     +  ++SIGRRC HCGADKTPQWRAGP+GPKTLCNACGVRYKSGRL+PEYRP
Sbjct: 189 -PLLSLPLNSAKSASIGRRCQHCGADKTPQWRAGPLGPKTLCNACGVRYKSGRLLPEYRP 247

Query: 429 ASSPTFCAALHSNSHRKIVEMRRQKQPGIGGIMANESIGYRLG 301
           A+SPTF   +HSNSHRK++EMR+QK  G+GG+M +E+ GYR+G
Sbjct: 248 ANSPTFSPTVHSNSHRKVLEMRKQK-IGVGGMMIHEACGYRVG 289


>ref|XP_004247814.1| PREDICTED: GATA transcription factor 1-like [Solanum lycopersicum]
          Length = 285

 Score =  234 bits (597), Expect = 6e-59
 Identities = 127/225 (56%), Positives = 156/225 (69%), Gaps = 7/225 (3%)
 Frame = -2

Query: 954 PEFVEEELEWLSNKDAFPTVETCFDIFSENPDIMGLDHQSPVSVLEXXXXXXXXXXXXXN 775
           PE VEEELEWLSNKDAFP +E  F I SENP ++  DH SPVSVLE              
Sbjct: 61  PECVEEELEWLSNKDAFPAIE--FGILSENPGMV-FDHHSPVSVLENSSSTSHSSGNGVV 117

Query: 774 ---AIMSCCRSLQVPTSFPVXXXXXXXXXXRIGGFGDLPSQHLL--WSNNVKKMLQQE-- 616
              A  SCC +L+VP ++PV          R GGF D+PS+H L     + K + Q+E  
Sbjct: 118 SGNAYTSCCVNLKVPVNYPVRARSKRRRRRRRGGFADMPSEHCLPVTQPSFKNVKQREPL 177

Query: 615 LALPPLPTTRTNRNASSIGRRCLHCGADKTPQWRAGPMGPKTLCNACGVRYKSGRLVPEY 436
           L+LP      + ++A+SIGRRC HCGADKTPQWRAGP+GPKTLCNACGVRYKSGRL+PEY
Sbjct: 178 LSLP----MNSAKSAASIGRRCQHCGADKTPQWRAGPLGPKTLCNACGVRYKSGRLLPEY 233

Query: 435 RPASSPTFCAALHSNSHRKIVEMRRQKQPGIGGIMANESIGYRLG 301
           RPA+SPTF AA HSNSHRK++EMR+ K  G+GG++ +E+ GYR+G
Sbjct: 234 RPANSPTFSAAAHSNSHRKVLEMRKHK-IGVGGMLIHEACGYRVG 277


>ref|XP_006340920.1| PREDICTED: GATA transcription factor 1-like [Solanum tuberosum]
          Length = 285

 Score =  228 bits (581), Expect = 4e-57
 Identities = 123/221 (55%), Positives = 151/221 (68%), Gaps = 4/221 (1%)
 Frame = -2

Query: 954 PEFVEEELEWLSNKDAFPTVETCFDIFSENPDIMGLDHQSPVSVLEXXXXXXXXXXXXXN 775
           PE VEEELEWLSNKDAFP +E  F I SENP ++  DH SPVSVLE              
Sbjct: 61  PECVEEELEWLSNKDAFPAIE--FGILSENPGMV-FDHHSPVSVLENSSSTSHSSGNGVV 117

Query: 774 ---AIMSCCRSLQVPTSFPVXXXXXXXXXXRIGGFGDLPSQHLLWSNNVK-KMLQQELAL 607
              A  SCC +L+VP ++PV          R GGF ++PS+H L       K ++Q   L
Sbjct: 118 NGNAYTSCCVNLKVPVNYPVRARSKRRRRRRRGGFANMPSEHCLPVTQPSFKNVKQHEPL 177

Query: 606 PPLPTTRTNRNASSIGRRCLHCGADKTPQWRAGPMGPKTLCNACGVRYKSGRLVPEYRPA 427
             LP     ++A+SIGRRC HCGADKTPQWRAGP+GPKTLCNACGVRYKSGRL+PEYRPA
Sbjct: 178 LSLPMNSA-KSAASIGRRCQHCGADKTPQWRAGPLGPKTLCNACGVRYKSGRLLPEYRPA 236

Query: 426 SSPTFCAALHSNSHRKIVEMRRQKQPGIGGIMANESIGYRL 304
           +SP+F AA HSNSHRK++EMR+ K  G+GG++ +E+ GYR+
Sbjct: 237 NSPSFSAAAHSNSHRKVLEMRKHK-IGVGGMLIHEACGYRV 276


>ref|XP_002518163.1| GATA transcription factor, putative [Ricinus communis]
           gi|223542759|gb|EEF44296.1| GATA transcription factor,
           putative [Ricinus communis]
          Length = 205

 Score =  198 bits (503), Expect = 5e-48
 Identities = 111/207 (53%), Positives = 128/207 (61%), Gaps = 5/207 (2%)
 Frame = -2

Query: 957 FPEFVEEELEWLSNKDAFPTVETCFDIFSENPDIMGLDHQSPVSVLEXXXXXXXXXXXXX 778
           + EF EEELEWLSNKDAFP+VET  DI +ENP  +   H+SPVSVLE             
Sbjct: 7   YREFAEEELEWLSNKDAFPSVETFVDILTENPGSLQ-KHRSPVSVLENSTTSSTSNSGHS 65

Query: 777 NA----IMSCCRSLQVPTSFPVXXXXXXXXXXRIGGFGDLPSQHLLWSN-NVKKMLQQEL 613
                 IM+ CRSL VP                     DL  Q   WS  N+KK+     
Sbjct: 66  GTNDSVIMNYCRSLHVPVKARSKPHRRRRR--------DLGGQQCWWSQENLKKVKV--- 114

Query: 612 ALPPLPTTRTNRNASSIGRRCLHCGADKTPQWRAGPMGPKTLCNACGVRYKSGRLVPEYR 433
                       ++S+IGR+C HCGA+KTPQWRAGP+GPKTLCNACGVRYKSGRLVPEYR
Sbjct: 115 ---------VKSSSSTIGRKCQHCGAEKTPQWRAGPLGPKTLCNACGVRYKSGRLVPEYR 165

Query: 432 PASSPTFCAALHSNSHRKIVEMRRQKQ 352
           PASSPTF + LHSNSHRK++EMRRQKQ
Sbjct: 166 PASSPTFSSVLHSNSHRKVLEMRRQKQ 192


>gb|EOY05429.1| GATA transcription factor 1, putative [Theobroma cacao]
          Length = 243

 Score =  188 bits (477), Expect = 5e-45
 Identities = 108/214 (50%), Positives = 123/214 (57%), Gaps = 6/214 (2%)
 Frame = -2

Query: 957 FPEFVEEELEWLSNKDAFPTVETCFDIFSENPDIMGLDHQSPVSVLEXXXXXXXXXXXXX 778
           FPEF EEELEW+SNKDAFP+VET  DI           HQSPVSVL+             
Sbjct: 48  FPEFAEEELEWISNKDAFPSVETFVDILGT-----AAKHQSPVSVLDNSNSSSNSSGSST 102

Query: 777 NA----IMSCCRSLQVPTSFPVXXXXXXXXXXRIGGFGDLPSQHLLW--SNNVKKMLQQE 616
                 +M CC +L+VP               R+    DL +Q   W    NVK      
Sbjct: 103 LTNGNIVMYCCGNLKVPVK---------ARSKRLRKCRDLRNQENSWWVQENVKNASAHV 153

Query: 615 LALPPLPTTRTNRNASSIGRRCLHCGADKTPQWRAGPMGPKTLCNACGVRYKSGRLVPEY 436
                         + +IGR+C HCGA+KTPQWRAGP+GPKTLCNACGVRYKSGRLVPEY
Sbjct: 154 ----------KGAGSRTIGRKCQHCGAEKTPQWRAGPLGPKTLCNACGVRYKSGRLVPEY 203

Query: 435 RPASSPTFCAALHSNSHRKIVEMRRQKQPGIGGI 334
           RPASSPTF   LHSNSHRKI+EMRRQKQ G   +
Sbjct: 204 RPASSPTFSIELHSNSHRKILEMRRQKQFGFSAM 237


>ref|XP_006355741.1| PREDICTED: GATA transcription factor 1-like [Solanum tuberosum]
          Length = 255

 Score =  186 bits (471), Expect = 2e-44
 Identities = 105/210 (50%), Positives = 125/210 (59%), Gaps = 9/210 (4%)
 Frame = -2

Query: 957 FPEFVEEELEWLSNKDAFPTVETCFDIFSENPDIMGLDHQSPVSVLEXXXXXXXXXXXXX 778
           FP++VEEELEWLSNKDAFP VE  FDIFS++   +  DH SP SVLE             
Sbjct: 53  FPDYVEEELEWLSNKDAFPAVE--FDIFSDHVPNVIFDHHSPNSVLENSSSNNNNNNNCN 110

Query: 777 NAIMSCCRS------LQVPTSFPVXXXXXXXXXXRIGGFGDLPSQHLLWSNNVK---KML 625
             +     +      LQVP + PV           +           +W N VK      
Sbjct: 111 VNVKKNAFTSHTSSLLQVPINHPVGARSKRRRRIALQC-----DNSCVWGNQVKFNNTST 165

Query: 624 QQELALPPLPTTRTNRNASSIGRRCLHCGADKTPQWRAGPMGPKTLCNACGVRYKSGRLV 445
           +Q L L  +  T+  R  +SIGRRC HCG DKTPQWRAGP GPKTLCNACGVRYKSGRL 
Sbjct: 166 KQGLTLLKISMTKAKRG-TSIGRRCQHCGVDKTPQWRAGPTGPKTLCNACGVRYKSGRLF 224

Query: 444 PEYRPASSPTFCAALHSNSHRKIVEMRRQK 355
           PEYRPA+SPTF   LHS+SHRK++EMR+Q+
Sbjct: 225 PEYRPANSPTFSVDLHSSSHRKVLEMRKQR 254


>ref|XP_004150343.1| PREDICTED: GATA transcription factor 1-like [Cucumis sativus]
           gi|449514819|ref|XP_004164489.1| PREDICTED: GATA
           transcription factor 1-like [Cucumis sativus]
          Length = 287

 Score =  186 bits (471), Expect = 2e-44
 Identities = 111/227 (48%), Positives = 131/227 (57%), Gaps = 24/227 (10%)
 Frame = -2

Query: 951 EFVEEELEWLSNKDAFPTVETCFDIFSEN--------PDIMGLDHQ-SPVSVLEXXXXXX 799
           E+ EEELEWLSN+DAFP VET  DI S++        P +  +  Q SPVSVLE      
Sbjct: 67  EYAEEELEWLSNEDAFPAVETFVDILSDHHHHHAPQPPPLPSVSKQNSPVSVLESTSISS 126

Query: 798 XXXXXXXN---------AIMSCCRSLQVPTSFPVXXXXXXXXXXRIGGFGDLPSQHLLWS 646
                             +MSCC SL+VP+                  F   PS     S
Sbjct: 127 HGETTNGGNKTSVHSSSILMSCCGSLKVPSKARSKRRRGRHISGHHLLFKQQPS-----S 181

Query: 645 NNVKKMLQQELALPPLPTTRT------NRNASSIGRRCLHCGADKTPQWRAGPMGPKTLC 484
            N+K+++         PTT T          + IGR+CLHCGA+KTPQWRAGP GPKTLC
Sbjct: 182 KNLKQVV---------PTTATAAVVAATTGTAGIGRKCLHCGAEKTPQWRAGPFGPKTLC 232

Query: 483 NACGVRYKSGRLVPEYRPASSPTFCAALHSNSHRKIVEMRRQKQPGI 343
           NACGVR+KSGRLVPEYRPASSPTF A LHSNSHRK++EMRRQKQ G+
Sbjct: 233 NACGVRFKSGRLVPEYRPASSPTFSAELHSNSHRKVMEMRRQKQLGM 279


>ref|XP_002299291.2| hypothetical protein POPTR_0001s14130g [Populus trichocarpa]
           gi|550347223|gb|EEE84096.2| hypothetical protein
           POPTR_0001s14130g [Populus trichocarpa]
          Length = 308

 Score =  185 bits (469), Expect = 4e-44
 Identities = 107/204 (52%), Positives = 121/204 (59%), Gaps = 3/204 (1%)
 Frame = -2

Query: 954 PEFVEEELEWLSNKDAFPTVETCFDIFSENPDIMGLDHQSPVSVLEXXXXXXXXXXXXXN 775
           PEF EEELEWLSNKDAFPTVETCF   S  P  +   H SPVSVLE             +
Sbjct: 114 PEFAEEELEWLSNKDAFPTVETCFGSLSGEPGSIP-KHHSPVSVLENSTTSSTSNSGNSS 172

Query: 774 ---AIMSCCRSLQVPTSFPVXXXXXXXXXXRIGGFGDLPSQHLLWSNNVKKMLQQELALP 604
               IMS CR L+VP                     ++  Q   WS        QE  + 
Sbjct: 173 NSNIIMSYCR-LRVPVKARSKRHHRHPR--------EIQEQECWWS--------QENFIT 215

Query: 603 PLPTTRTNRNASSIGRRCLHCGADKTPQWRAGPMGPKTLCNACGVRYKSGRLVPEYRPAS 424
             P      + + +GR+C HCG +KTPQWRAGP GPKTLCNACGVRYKSGRLVPEYRPA+
Sbjct: 216 RKPAV----SVAKLGRKCQHCGVEKTPQWRAGPDGPKTLCNACGVRYKSGRLVPEYRPAN 271

Query: 423 SPTFCAALHSNSHRKIVEMRRQKQ 352
           SPTF + LHSNSHRK+VEMRRQKQ
Sbjct: 272 SPTFSSKLHSNSHRKVVEMRRQKQ 295


>ref|XP_004239871.1| PREDICTED: GATA transcription factor 1-like [Solanum lycopersicum]
          Length = 247

 Score =  185 bits (469), Expect = 4e-44
 Identities = 103/204 (50%), Positives = 121/204 (59%), Gaps = 3/204 (1%)
 Frame = -2

Query: 957 FPEFVEEELEWLSNKDAFPTVETCFDIFSENPDIMGLDHQSPVSVLEXXXXXXXXXXXXX 778
           FP++VEEELEWLSNKDAFP VE  FD+FS++   +  DH SP SVLE             
Sbjct: 54  FPDYVEEELEWLSNKDAFPAVE--FDLFSDH---VIFDHHSPNSVLENNNNNCNVNLKDN 108

Query: 777 NAIMSCCRSLQVPTSFPVXXXXXXXXXXRIGGFGDLPSQHLLWSNNVK---KMLQQELAL 607
                    LQVP + PV           +           +W N VK      +Q L L
Sbjct: 109 AFTSHASSLLQVPMNHPVGTRSKRRRRIALQC-----DNSCVWGNQVKFNNTSTKQGLTL 163

Query: 606 PPLPTTRTNRNASSIGRRCLHCGADKTPQWRAGPMGPKTLCNACGVRYKSGRLVPEYRPA 427
             +   +  R  +SIGR C HCG DKTPQWRAGP GPKTLCNACGVRYKSGRL PEYRPA
Sbjct: 164 LKISMAKAKRG-TSIGRTCQHCGVDKTPQWRAGPTGPKTLCNACGVRYKSGRLFPEYRPA 222

Query: 426 SSPTFCAALHSNSHRKIVEMRRQK 355
           +SPTF   LHSNSHRK++EMR+Q+
Sbjct: 223 NSPTFSVELHSNSHRKVLEMRKQR 246


>gb|EXB66651.1| GATA transcription factor 1 [Morus notabilis]
          Length = 518

 Score =  181 bits (458), Expect = 8e-43
 Identities = 107/216 (49%), Positives = 121/216 (56%), Gaps = 16/216 (7%)
 Frame = -2

Query: 951 EFVEEELEWLSNKDAFPTVETCFDIFSENPDIMGLDHQSPVSVLEXXXXXXXXXXXXXNA 772
           E VEEELEW+SNKDAFP VE+   I  +NP    L H SPVSVL+             N+
Sbjct: 133 ELVEEELEWISNKDAFPAVESFVGILPDNPSGAILKHHSPVSVLDGGSGGSSTISCNSNS 192

Query: 771 -----------IMSCCRSLQVPTSFPVXXXXXXXXXXRIGGFGDLPSQHLLWS-----NN 640
                      + SC  SL+ P                    GD+  + L WS     NN
Sbjct: 193 NCSNSSSSIATLTSCFSSLKAPRRARSKRRCRRRG-------GDITGRQLCWSQANNNNN 245

Query: 639 VKKMLQQELALPPLPTTRTNRNASSIGRRCLHCGADKTPQWRAGPMGPKTLCNACGVRYK 460
            +     E A     T  T    + IGR+C HCGADKTPQWRAGP GPKTLCNACGVRYK
Sbjct: 246 NESFTGYEKATRKTTTMTT----TIIGRKCQHCGADKTPQWRAGPYGPKTLCNACGVRYK 301

Query: 459 SGRLVPEYRPASSPTFCAALHSNSHRKIVEMRRQKQ 352
           SGRLV EYRPASSPTF + LHSNSHRKI+EMRR KQ
Sbjct: 302 SGRLVSEYRPASSPTFSSELHSNSHRKILEMRRTKQ 337


>ref|XP_004309759.1| PREDICTED: GATA transcription factor 1-like isoform 2 [Fragaria
           vesca subsp. vesca]
          Length = 194

 Score =  178 bits (452), Expect = 4e-42
 Identities = 101/218 (46%), Positives = 128/218 (58%), Gaps = 5/218 (2%)
 Frame = -2

Query: 951 EFVEEELEWLSNKDAFPTVETCFDIFSENPDIMGLDHQSPVSVLEXXXXXXXXXXXXXNA 772
           E  EEELEW+SNKDAFP VET F +  +   I    HQSPVSVLE              +
Sbjct: 6   EEAEEELEWISNKDAFPAVET-FILSEQVGGIAIAKHQSPVSVLETSTNSSSA------S 58

Query: 771 IMSCCRSLQVPTSFPVXXXXXXXXXXRIGGFGDLPSQHLLWSNNVKKMLQQELALPPLPT 592
           +MS C  L+ P                     ++P Q L W+             PP+ +
Sbjct: 59  LMSSCGGLKPPHRARTKGRRRR---------SEIPPQQLFWNQ------------PPIES 97

Query: 591 TRTNRNASS-----IGRRCLHCGADKTPQWRAGPMGPKTLCNACGVRYKSGRLVPEYRPA 427
           ++ +R++ S     IGR+CLHCG D+TPQWRAGP GPKTLCNACGVRYKSGRL PEYRPA
Sbjct: 98  SKPSRSSGSASKLDIGRKCLHCGTDQTPQWRAGPHGPKTLCNACGVRYKSGRLCPEYRPA 157

Query: 426 SSPTFCAALHSNSHRKIVEMRRQKQPGIGGIMANESIG 313
           SSP+F + +HSNSHRK++EMR+ K  G+G ++  E  G
Sbjct: 158 SSPSFSSQMHSNSHRKVLEMRKHKY-GVGMVVKPEDKG 194


>ref|XP_004309758.1| PREDICTED: GATA transcription factor 1-like isoform 1 [Fragaria
           vesca subsp. vesca]
          Length = 227

 Score =  178 bits (452), Expect = 4e-42
 Identities = 101/218 (46%), Positives = 128/218 (58%), Gaps = 5/218 (2%)
 Frame = -2

Query: 951 EFVEEELEWLSNKDAFPTVETCFDIFSENPDIMGLDHQSPVSVLEXXXXXXXXXXXXXNA 772
           E  EEELEW+SNKDAFP VET F +  +   I    HQSPVSVLE              +
Sbjct: 39  EEAEEELEWISNKDAFPAVET-FILSEQVGGIAIAKHQSPVSVLETSTNSSSA------S 91

Query: 771 IMSCCRSLQVPTSFPVXXXXXXXXXXRIGGFGDLPSQHLLWSNNVKKMLQQELALPPLPT 592
           +MS C  L+ P                     ++P Q L W+             PP+ +
Sbjct: 92  LMSSCGGLKPPHRARTKGRRRR---------SEIPPQQLFWNQ------------PPIES 130

Query: 591 TRTNRNASS-----IGRRCLHCGADKTPQWRAGPMGPKTLCNACGVRYKSGRLVPEYRPA 427
           ++ +R++ S     IGR+CLHCG D+TPQWRAGP GPKTLCNACGVRYKSGRL PEYRPA
Sbjct: 131 SKPSRSSGSASKLDIGRKCLHCGTDQTPQWRAGPHGPKTLCNACGVRYKSGRLCPEYRPA 190

Query: 426 SSPTFCAALHSNSHRKIVEMRRQKQPGIGGIMANESIG 313
           SSP+F + +HSNSHRK++EMR+ K  G+G ++  E  G
Sbjct: 191 SSPSFSSQMHSNSHRKVLEMRKHKY-GVGMVVKPEDKG 227


>gb|ESW17292.1| hypothetical protein PHAVU_007G227300g [Phaseolus vulgaris]
           gi|561018489|gb|ESW17293.1| hypothetical protein
           PHAVU_007G227300g [Phaseolus vulgaris]
          Length = 250

 Score =  176 bits (445), Expect = 2e-41
 Identities = 100/209 (47%), Positives = 123/209 (58%), Gaps = 3/209 (1%)
 Frame = -2

Query: 957 FPEFVEEELEWLSNKDAFPTVETCFDIFSENPDIMGLDHQSPVS--VLE-XXXXXXXXXX 787
           + EFVEEELEWLSNKDAFP+VET  D+    PD   +   +P +  +LE           
Sbjct: 56  YSEFVEEELEWLSNKDAFPSVETFVDLSCIQPDTAKMRKTTPATTPMLEYSSGSSNSNNS 115

Query: 786 XXXNAIMSCCRSLQVPTSFPVXXXXXXXXXXRIGGFGDLPSQHLLWSNNVKKMLQQELAL 607
               ++++ C  L+V    PV          R G   +   Q   W     +  + E  +
Sbjct: 116 SNSISLLNSCDHLKV----PVRARSKRRSRCRPGIADENSGQQFWWRQPSNETSKAEEGM 171

Query: 606 PPLPTTRTNRNASSIGRRCLHCGADKTPQWRAGPMGPKTLCNACGVRYKSGRLVPEYRPA 427
                       S IGR+C HCGA+KTPQWRAGP+GPKTLCNACGVR+KSGRLVPEYRPA
Sbjct: 172 ----------KISPIGRKCQHCGAEKTPQWRAGPLGPKTLCNACGVRFKSGRLVPEYRPA 221

Query: 426 SSPTFCAALHSNSHRKIVEMRRQKQPGIG 340
           SSP+F + LHSNSHRKI EMRRQKQ G+G
Sbjct: 222 SSPSFRSDLHSNSHRKITEMRRQKQTGMG 250


>ref|XP_006492137.1| PREDICTED: GATA transcription factor 1-like [Citrus sinensis]
          Length = 262

 Score =  171 bits (433), Expect = 6e-40
 Identities = 106/212 (50%), Positives = 122/212 (57%), Gaps = 10/212 (4%)
 Frame = -2

Query: 957 FPEFVEEELEWLSNKDAFPTVETCFDIFSENPDIMGLDHQSPVSVLEXXXXXXXXXXXXX 778
           FPE  EEELEWLSN   FPTVET  DI S NP+I  L  QSP SVLE             
Sbjct: 58  FPECAEEELEWLSN---FPTVETFVDI-SSNPNI--LKQQSPNSVLENSNSSSSTSTNGS 111

Query: 777 NA----------IMSCCRSLQVPTSFPVXXXXXXXXXXRIGGFGDLPSQHLLWSNNVKKM 628
                       IM+CC +L+VP                     +L +Q   W +    +
Sbjct: 112 TITNGNNNSNSIIMNCCGNLRVPVRARSKLRTRCRR--------ELLNQEAWWGSVHGSV 163

Query: 627 LQQELALPPLPTTRTNRNASSIGRRCLHCGADKTPQWRAGPMGPKTLCNACGVRYKSGRL 448
              + A P +           IGR+C HCGA+KTPQWRAGPMGPKTLCNACGVR+KSGRL
Sbjct: 164 ---KAAKPVVSKV-------IIGRKCQHCGAEKTPQWRAGPMGPKTLCNACGVRFKSGRL 213

Query: 447 VPEYRPASSPTFCAALHSNSHRKIVEMRRQKQ 352
           VPEYRPA+SPTF + LHSNSHRK+VEMRRQKQ
Sbjct: 214 VPEYRPANSPTFSSELHSNSHRKVVEMRRQKQ 245


>ref|XP_006420528.1| hypothetical protein CICLE_v10005658mg [Citrus clementina]
           gi|557522401|gb|ESR33768.1| hypothetical protein
           CICLE_v10005658mg [Citrus clementina]
          Length = 262

 Score =  171 bits (433), Expect = 6e-40
 Identities = 106/212 (50%), Positives = 122/212 (57%), Gaps = 10/212 (4%)
 Frame = -2

Query: 957 FPEFVEEELEWLSNKDAFPTVETCFDIFSENPDIMGLDHQSPVSVLEXXXXXXXXXXXXX 778
           FPE  EEELEWLSN   FPTVET  DI S NP+I  L  QSP SVLE             
Sbjct: 58  FPECAEEELEWLSN---FPTVETFVDI-SSNPNI--LKQQSPNSVLENSNSSSSTSTNGS 111

Query: 777 NA----------IMSCCRSLQVPTSFPVXXXXXXXXXXRIGGFGDLPSQHLLWSNNVKKM 628
                       IM+CC +L+VP                     +L +Q   W +    +
Sbjct: 112 TITNGNNNSNSIIMNCCGNLRVPVRARSKLRTRCRR--------ELLNQEAWWGSVHGSV 163

Query: 627 LQQELALPPLPTTRTNRNASSIGRRCLHCGADKTPQWRAGPMGPKTLCNACGVRYKSGRL 448
              + A P +           IGR+C HCGA+KTPQWRAGPMGPKTLCNACGVR+KSGRL
Sbjct: 164 ---KAAKPVVSKV-------IIGRKCQHCGAEKTPQWRAGPMGPKTLCNACGVRFKSGRL 213

Query: 447 VPEYRPASSPTFCAALHSNSHRKIVEMRRQKQ 352
           VPEYRPA+SPTF + LHSNSHRK+VEMRRQKQ
Sbjct: 214 VPEYRPANSPTFSSELHSNSHRKVVEMRRQKQ 245


>ref|XP_002285624.1| PREDICTED: GATA transcription factor 1-like [Vitis vinifera]
          Length = 251

 Score =  166 bits (420), Expect = 2e-38
 Identities = 97/201 (48%), Positives = 120/201 (59%), Gaps = 3/201 (1%)
 Frame = -2

Query: 945 VEEELEWLSNKDAFPTVETCFDIFSENPDIMGLDHQSPVSVLEXXXXXXXXXXXXXNA-- 772
           VEEELEWL NKD FP VET  D    + + +    QSP+SVLE             +   
Sbjct: 53  VEEELEWL-NKDVFPGVETFLDYLPTSVENIP-KQQSPISVLENSSHSSSSNNSNSSTTT 110

Query: 771 IMSCCRSLQVPTSFPVXXXXXXXXXXRIGGFGDLPSQHLLWSNNVKKMLQQELALPPLPT 592
           IMSCC + +VP+                  F D+P Q   W ++         +    PT
Sbjct: 111 IMSCCENFRVPSRARSKRRRRRHKD-----FSDIPGQPWWWWSSQGNTNANHSS----PT 161

Query: 591 -TRTNRNASSIGRRCLHCGADKTPQWRAGPMGPKTLCNACGVRYKSGRLVPEYRPASSPT 415
            ++    +S+IGR+C HC A+KTPQWRAGP+GPKTLCNACGVRYKSGRLV EYRPASSPT
Sbjct: 162 NSKQTITSSTIGRKCQHCQAEKTPQWRAGPLGPKTLCNACGVRYKSGRLVAEYRPASSPT 221

Query: 414 FCAALHSNSHRKIVEMRRQKQ 352
           F + +HSNSHRKI+EMR+ KQ
Sbjct: 222 FSSKVHSNSHRKIMEMRKLKQ 242


>ref|XP_006418795.1| hypothetical protein EUTSA_v10002609mg [Eutrema salsugineum]
           gi|557096723|gb|ESQ37231.1| hypothetical protein
           EUTSA_v10002609mg [Eutrema salsugineum]
          Length = 319

 Score =  165 bits (418), Expect = 3e-38
 Identities = 103/230 (44%), Positives = 123/230 (53%), Gaps = 27/230 (11%)
 Frame = -2

Query: 954 PEFVEEELEWLSNKDAFPTVETC--------FDIFSENPDIMGLDHQSPVSVLEXXXXXX 799
           P  VEE+LEW+SNKDAFP +ET         F + S   +       SPVSVLE      
Sbjct: 97  PGVVEEDLEWISNKDAFPVIETFVGVLPSEHFRLSSPEGEATEGKQLSPVSVLETSSHNS 156

Query: 798 XXXXXXXNA-------------------IMSCCRSLQVPTSFPVXXXXXXXXXXRIGGFG 676
                  ++                   +M+CC  L VP                  G  
Sbjct: 157 SITTATTSSGGSNGSTVAATATAATTTTMMNCCVGLNVPGKARSKRRRT--------GRR 208

Query: 675 DLPSQHLLWSNNVKKMLQQELALPPLPTTRTNRNASSIGRRCLHCGADKTPQWRAGPMGP 496
           DL    +LW+ N ++  Q++       T      A S+GR+C HCGA+KTPQWRAGP GP
Sbjct: 209 DLK---VLWTGNNEQGPQKK------KTPSVAAAAVSLGRKCQHCGAEKTPQWRAGPSGP 259

Query: 495 KTLCNACGVRYKSGRLVPEYRPASSPTFCAALHSNSHRKIVEMRRQKQPG 346
           KTLCNACGVRYKSGRLVPEYRPA+SPTF A LHSNSHRKIVEMR+Q Q G
Sbjct: 260 KTLCNACGVRYKSGRLVPEYRPANSPTFSAELHSNSHRKIVEMRKQFQSG 309


>ref|NP_189047.1| GATA transcription factor 1 [Arabidopsis thaliana]
           gi|62900367|sp|Q8LAU9.2|GATA1_ARATH RecName: Full=GATA
           transcription factor 1; Short=AtGATA-1
           gi|2959730|emb|CAA73999.1| homologous to GATA-binding
           transcription factors [Arabidopsis thaliana]
           gi|9294674|dbj|BAB03023.1| protein homologous to
           GATA-binding transcription factors [Arabidopsis
           thaliana] gi|87116628|gb|ABD19678.1| At3g24050
           [Arabidopsis thaliana] gi|332643327|gb|AEE76848.1| GATA
           transcription factor 1 [Arabidopsis thaliana]
          Length = 274

 Score =  162 bits (411), Expect = 2e-37
 Identities = 102/226 (45%), Positives = 122/226 (53%), Gaps = 25/226 (11%)
 Frame = -2

Query: 942 EEELEWLSNKDAFPTVETCFDIF-SENPDIMGLDHQ--------SPVSVLEXXXXXXXXX 790
           EE+LEW+SNK+AFP +ET   +  SE+  I  L  +        SPVSVLE         
Sbjct: 58  EEDLEWISNKNAFPVIETFVGVLPSEHFPITSLLEREATEVKQLSPVSVLETSSHSSTTT 117

Query: 789 XXXXNA----------------IMSCCRSLQVPTSFPVXXXXXXXXXXRIGGFGDLPSQH 658
               +                 IMSCC   + P                  G  DL    
Sbjct: 118 TSNSSGGSNGSTAVATTTTTPTIMSCCVGFKAPAKARSKRRRT--------GRRDL---R 166

Query: 657 LLWSNNVKKMLQQELALPPLPTTRTNRNASSIGRRCLHCGADKTPQWRAGPMGPKTLCNA 478
           +LW+ N +  +Q++       T      A  +GR+C HCGA+KTPQWRAGP GPKTLCNA
Sbjct: 167 VLWTGNEQGGIQKK------KTMTVAAAALIMGRKCQHCGAEKTPQWRAGPAGPKTLCNA 220

Query: 477 CGVRYKSGRLVPEYRPASSPTFCAALHSNSHRKIVEMRRQKQPGIG 340
           CGVRYKSGRLVPEYRPA+SPTF A LHSNSHRKIVEMR+Q Q G G
Sbjct: 221 CGVRYKSGRLVPEYRPANSPTFTAELHSNSHRKIVEMRKQYQSGDG 266


>gb|AAM65139.1| GATA transcription factor 1 (AtGATA-1) [Arabidopsis thaliana]
          Length = 268

 Score =  161 bits (408), Expect = 5e-37
 Identities = 101/226 (44%), Positives = 122/226 (53%), Gaps = 25/226 (11%)
 Frame = -2

Query: 942 EEELEWLSNKDAFPTVETCFDIF-SENPDIMGLDHQ--------SPVSVLEXXXXXXXXX 790
           EE+L+W+SNK+AFP +ET   +  SE+  I  L  +        SPVSVLE         
Sbjct: 52  EEDLQWISNKNAFPVIETFVGVLPSEHFPITSLLEREATEVKQLSPVSVLETSSHSSTTT 111

Query: 789 XXXXNA----------------IMSCCRSLQVPTSFPVXXXXXXXXXXRIGGFGDLPSQH 658
               +                 IMSCC   + P                  G  DL    
Sbjct: 112 TSNSSGGSNGSTAVATTTTTPTIMSCCVGFKAPAKARSKRRRT--------GRRDL---R 160

Query: 657 LLWSNNVKKMLQQELALPPLPTTRTNRNASSIGRRCLHCGADKTPQWRAGPMGPKTLCNA 478
           +LW+ N +  +Q++       T      A  +GR+C HCGA+KTPQWRAGP GPKTLCNA
Sbjct: 161 VLWTGNEQGGIQKK------KTMTVAAAALIMGRKCQHCGAEKTPQWRAGPAGPKTLCNA 214

Query: 477 CGVRYKSGRLVPEYRPASSPTFCAALHSNSHRKIVEMRRQKQPGIG 340
           CGVRYKSGRLVPEYRPA+SPTF A LHSNSHRKIVEMR+Q Q G G
Sbjct: 215 CGVRYKSGRLVPEYRPANSPTFTAELHSNSHRKIVEMRKQYQSGDG 260


>ref|XP_002885621.1| hypothetical protein ARALYDRAFT_479930 [Arabidopsis lyrata subsp.
           lyrata] gi|297331461|gb|EFH61880.1| hypothetical protein
           ARALYDRAFT_479930 [Arabidopsis lyrata subsp. lyrata]
          Length = 270

 Score =  161 bits (408), Expect = 5e-37
 Identities = 98/221 (44%), Positives = 120/221 (54%), Gaps = 22/221 (9%)
 Frame = -2

Query: 942 EEELEWLSNKDAFPTVETCFDIFSENPDIMGLDHQ--SPVSVLEXXXXXXXXXXXXXN-- 775
           EE+LEW+SNK+AFP +ET   +   +P+    + +  SPVSVLE             +  
Sbjct: 58  EEDLEWISNKNAFPVIETFVGVLPLSPEREATEGKQLSPVSVLETSSHSSTTTTATTSNS 117

Query: 774 ------------------AIMSCCRSLQVPTSFPVXXXXXXXXXXRIGGFGDLPSQHLLW 649
                              IMSCC   + P                  G  DL    +LW
Sbjct: 118 SGGSNGSTAVATTATTTTTIMSCCVGFKAPAKARSKRRRT--------GRRDLG---VLW 166

Query: 648 SNNVKKMLQQELALPPLPTTRTNRNASSIGRRCLHCGADKTPQWRAGPMGPKTLCNACGV 469
           + N +  +Q+    P +        A  +GR+C HCGA+KTPQWRAGP GPKTLCNACGV
Sbjct: 167 TGNEQVGIQKRKT-PSVAAAA----AMIMGRKCQHCGAEKTPQWRAGPAGPKTLCNACGV 221

Query: 468 RYKSGRLVPEYRPASSPTFCAALHSNSHRKIVEMRRQKQPG 346
           RYKSGRLVPEYRPA+SPTF A LHSNSHRKIVEMR+Q Q G
Sbjct: 222 RYKSGRLVPEYRPANSPTFTAELHSNSHRKIVEMRKQYQSG 262


Top