BLASTX nr result

ID: Zanthoxylum22_contig00028881 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Zanthoxylum22_contig00028881
         (871 letters)

Database: ./nr 
           77,306,371 sequences; 28,104,191,420 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_006424556.1| hypothetical protein CICLE_v10029015mg [Citr...   347   5e-93
ref|XP_006488078.1| PREDICTED: GATA transcription factor 11-like...   337   7e-90
gb|KHG27547.1| GATA transcription factor 10 -like protein [Gossy...   210   1e-51
ref|XP_012471520.1| PREDICTED: GATA transcription factor 11-like...   210   1e-51
ref|XP_012471521.1| PREDICTED: GATA transcription factor 11-like...   210   1e-51
ref|XP_007016761.1| GATA zinc finger protein regulating nitrogen...   207   1e-50
ref|XP_012443598.1| PREDICTED: GATA transcription factor 11-like...   191   6e-46
gb|KHG05024.1| GATA transcription factor 11 -like protein [Gossy...   191   6e-46
gb|KJB26178.1| hypothetical protein B456_004G230100 [Gossypium r...   189   3e-45
gb|KJB20301.1| hypothetical protein B456_003G142600 [Gossypium r...   186   3e-44
gb|KJB56141.1| hypothetical protein B456_009G107900 [Gossypium r...   178   5e-42
gb|KJB56138.1| hypothetical protein B456_009G107900 [Gossypium r...   178   5e-42
ref|XP_012064808.1| PREDICTED: GATA transcription factor 11 [Jat...   176   3e-41
gb|KHG08445.1| GATA transcription factor 11 -like protein [Gossy...   167   7e-39
ref|XP_010031984.1| PREDICTED: GATA transcription factor 10-like...   165   5e-38
ref|XP_010031983.1| PREDICTED: GATA transcription factor 10-like...   165   5e-38
ref|XP_011040891.1| PREDICTED: GATA transcription factor 11-like...   163   1e-37
ref|XP_011040892.1| PREDICTED: GATA transcription factor 11-like...   162   4e-37
gb|KHG09661.1| GATA transcription factor 11 -like protein [Gossy...   159   2e-36
ref|XP_002529940.1| GATA transcription factor, putative [Ricinus...   156   2e-35

>ref|XP_006424556.1| hypothetical protein CICLE_v10029015mg [Citrus clementina]
           gi|557526490|gb|ESR37796.1| hypothetical protein
           CICLE_v10029015mg [Citrus clementina]
           gi|641854508|gb|KDO73316.1| hypothetical protein
           CISIN_1g023588mg [Citrus sinensis]
          Length = 280

 Score =  347 bits (891), Expect = 5e-93
 Identities = 181/258 (70%), Positives = 193/258 (74%)
 Frame = -3

Query: 869 LEDLEANNLGVEDWKLKFQALEPPPFSSLTDFSEVPRSNSISCNIGPQVKQQQGSTDTFS 690
           LEDLEA N+GV+DW   F+ALEPPPF   TDF  VP SN ISC+  PQVKQ+  STDT S
Sbjct: 22  LEDLEACNIGVDDWNANFEALEPPPFG-WTDFPVVPTSNHISCH-RPQVKQKPSSTDTSS 79

Query: 689 SRSSFICNNSNDGKYLLLXXXXXXXXXXXXXXXXXXVKHVPINPKLVFXXXXXXXXXXRP 510
           SRSS++CN SNDGKYLLL                   KHVPINPKLVF          RP
Sbjct: 80  SRSSYVCNKSNDGKYLLLSQTSSPISVLESGGSCSADKHVPINPKLVFAVKRARSKRRRP 139

Query: 509 ATLNPLFVYHFIPSTSSTSEDYNPSTASESGYEINLTEXXXXXXXXXXKNLTMLSGPREI 330
           ATLNPLF+Y FI STSSTSEDY+P TASESG E+NLTE          KNLT+LSG RE 
Sbjct: 140 ATLNPLFIYPFISSTSSTSEDYHPETASESGSEMNLTEKPVRKKQKRKKNLTVLSGSREN 199

Query: 329 KKPSFQQTDTLRKCMHCEVTETPQWREGPMGPKTLCNACGVRYRSGRLVPEYRPAASPTF 150
           KK SFQQTDT RKCMHCEV ETPQWREGPMGPKTLCNACGVRYRSGRLVPEYRPAASPTF
Sbjct: 200 KKLSFQQTDTPRKCMHCEVAETPQWREGPMGPKTLCNACGVRYRSGRLVPEYRPAASPTF 259

Query: 149 LPSLHSNSHKRIMEMRNK 96
           +PSLHSNSHKRIMEMRNK
Sbjct: 260 VPSLHSNSHKRIMEMRNK 277


>ref|XP_006488078.1| PREDICTED: GATA transcription factor 11-like [Citrus sinensis]
          Length = 277

 Score =  337 bits (864), Expect = 7e-90
 Identities = 178/258 (68%), Positives = 190/258 (73%)
 Frame = -3

Query: 869 LEDLEANNLGVEDWKLKFQALEPPPFSSLTDFSEVPRSNSISCNIGPQVKQQQGSTDTFS 690
           LEDLEA N+GV+DW   F+ALEPPPF   TDF  VP SN ISC+  PQVKQ+  STDT S
Sbjct: 22  LEDLEACNIGVDDWNANFEALEPPPFG-WTDFPVVPTSNHISCH-RPQVKQKPSSTDTSS 79

Query: 689 SRSSFICNNSNDGKYLLLXXXXXXXXXXXXXXXXXXVKHVPINPKLVFXXXXXXXXXXRP 510
           SRSS++CN SNDGKYLLL                   KHVPINPKLVF          RP
Sbjct: 80  SRSSYVCNKSNDGKYLLLSQTSSPISVLESGGSCSAEKHVPINPKLVFAVKRARSKRRRP 139

Query: 509 ATLNPLFVYHFIPSTSSTSEDYNPSTASESGYEINLTEXXXXXXXXXXKNLTMLSGPREI 330
           ATLNPLF+Y FI   SSTSEDY+P TASESG E+NLTE          KNLT+LSG RE 
Sbjct: 140 ATLNPLFIYPFI---SSTSEDYHPETASESGSEMNLTEKPVRKKQKRKKNLTVLSGSREN 196

Query: 329 KKPSFQQTDTLRKCMHCEVTETPQWREGPMGPKTLCNACGVRYRSGRLVPEYRPAASPTF 150
           KK SFQQTD  RKCMHCEV ETPQWREGPMGPKTLCNACGVRYRSGRLVPEYRPAASPTF
Sbjct: 197 KKLSFQQTDAPRKCMHCEVAETPQWREGPMGPKTLCNACGVRYRSGRLVPEYRPAASPTF 256

Query: 149 LPSLHSNSHKRIMEMRNK 96
           +PSLHSNSHKRIMEMRNK
Sbjct: 257 VPSLHSNSHKRIMEMRNK 274


>gb|KHG27547.1| GATA transcription factor 10 -like protein [Gossypium arboreum]
          Length = 343

 Score =  210 bits (535), Expect = 1e-51
 Identities = 122/282 (43%), Positives = 155/282 (54%), Gaps = 11/282 (3%)
 Frame = -3

Query: 866 EDLEANNLGV-EDWKLKFQALEPPPFSSLTDFSE----------VPRSNSISCNIGPQVK 720
           +D+E NN G  E+W   FQ LEPPP + L   S            P+  ++SC+   Q+ 
Sbjct: 55  DDVEHNNSGGGEEWDCNFQNLEPPPANVLAGMSSGFSGDFFNDSSPKCLTVSCDGSSQLN 114

Query: 719 QQQGSTDTFSSRSSFICNNSNDGKYLLLXXXXXXXXXXXXXXXXXXVKHVPINPKLVFXX 540
           Q    T+  S RS  + +   D K  +                      VPINPKL F  
Sbjct: 115 QWSSVTEASSGRSVTLHSEPTDVKGSIRFQTSSPVSVLESSSTCSAANLVPINPKLGFLV 174

Query: 539 XXXXXXXXRPATLNPLFVYHFIPSTSSTSEDYNPSTASESGYEINLTEXXXXXXXXXXKN 360
                   R +  N  F   F  STSSTS+  N S  SES  E +LTE           N
Sbjct: 175 KRGRSKRRRASAFNVHFTLPFTSSTSSTSQGSNSSVGSESESESHLTEEPAKKRLKKKMN 234

Query: 359 LTMLSGPREIKKPSFQQTDTLRKCMHCEVTETPQWREGPMGPKTLCNACGVRYRSGRLVP 180
           LT LS   E+KK   +Q   +RKC+HCEVT+TPQWREGPMGPKTLCNACGVR+RSGRL+P
Sbjct: 235 LTWLSDFSEMKKSPSRQPIEVRKCLHCEVTKTPQWREGPMGPKTLCNACGVRFRSGRLLP 294

Query: 179 EYRPAASPTFLPSLHSNSHKRIMEMRNKDGM*TTTLPQVVPL 54
           EYRPAASPTF+P+LHSNSHK+++EMR +  + TT  P ++ +
Sbjct: 295 EYRPAASPTFVPALHSNSHKKVVEMRKQANLPTTGTPPMLSI 336


>ref|XP_012471520.1| PREDICTED: GATA transcription factor 11-like isoform X1 [Gossypium
           raimondii]
          Length = 352

 Score =  210 bits (534), Expect = 1e-51
 Identities = 122/282 (43%), Positives = 154/282 (54%), Gaps = 11/282 (3%)
 Frame = -3

Query: 866 EDLEANNLGV-EDWKLKFQALEPPPFSSLTDFSE----------VPRSNSISCNIGPQVK 720
           +D+E NN G  E+W   FQ LEPPP + L   S            P+  ++SC+   Q+ 
Sbjct: 64  DDVEQNNSGGGEEWDCNFQNLEPPPANVLAGMSSGFSGDFFNDSSPKCLTVSCDGSSQLN 123

Query: 719 QQQGSTDTFSSRSSFICNNSNDGKYLLLXXXXXXXXXXXXXXXXXXVKHVPINPKLVFXX 540
           Q    T+  S RS  + + S D K  +                      VPINPKL F  
Sbjct: 124 QWSSVTEASSGRSVTLHSESTDVKGSIRFQTSSPVSVLESSSTCSAANPVPINPKLGFLV 183

Query: 539 XXXXXXXXRPATLNPLFVYHFIPSTSSTSEDYNPSTASESGYEINLTEXXXXXXXXXXKN 360
                   R +  N  F   F  STSSTS   N S  SES  E +LTE           N
Sbjct: 184 KRGRSKRRRASAFNVHFTLPFTSSTSSTSRGSNSSVGSESESESHLTEEPAKKRLKKKMN 243

Query: 359 LTMLSGPREIKKPSFQQTDTLRKCMHCEVTETPQWREGPMGPKTLCNACGVRYRSGRLVP 180
           LT LS   E+KK   +Q   +RKC+HCEVT+TPQWREGPMGPKTLCNACGVR+RSGRL+P
Sbjct: 244 LTWLSDFSEMKKSPSRQPIEVRKCLHCEVTKTPQWREGPMGPKTLCNACGVRFRSGRLLP 303

Query: 179 EYRPAASPTFLPSLHSNSHKRIMEMRNKDGM*TTTLPQVVPL 54
           EYRPAASPTF+P+LHSNSHK+++EMR +  +  T  P ++ +
Sbjct: 304 EYRPAASPTFVPALHSNSHKKVVEMRKQANLPMTGTPPMLSI 345


>ref|XP_012471521.1| PREDICTED: GATA transcription factor 11-like isoform X2 [Gossypium
           raimondii] gi|763752912|gb|KJB20300.1| hypothetical
           protein B456_003G142600 [Gossypium raimondii]
          Length = 343

 Score =  210 bits (534), Expect = 1e-51
 Identities = 122/282 (43%), Positives = 154/282 (54%), Gaps = 11/282 (3%)
 Frame = -3

Query: 866 EDLEANNLGV-EDWKLKFQALEPPPFSSLTDFSE----------VPRSNSISCNIGPQVK 720
           +D+E NN G  E+W   FQ LEPPP + L   S            P+  ++SC+   Q+ 
Sbjct: 55  DDVEQNNSGGGEEWDCNFQNLEPPPANVLAGMSSGFSGDFFNDSSPKCLTVSCDGSSQLN 114

Query: 719 QQQGSTDTFSSRSSFICNNSNDGKYLLLXXXXXXXXXXXXXXXXXXVKHVPINPKLVFXX 540
           Q    T+  S RS  + + S D K  +                      VPINPKL F  
Sbjct: 115 QWSSVTEASSGRSVTLHSESTDVKGSIRFQTSSPVSVLESSSTCSAANPVPINPKLGFLV 174

Query: 539 XXXXXXXXRPATLNPLFVYHFIPSTSSTSEDYNPSTASESGYEINLTEXXXXXXXXXXKN 360
                   R +  N  F   F  STSSTS   N S  SES  E +LTE           N
Sbjct: 175 KRGRSKRRRASAFNVHFTLPFTSSTSSTSRGSNSSVGSESESESHLTEEPAKKRLKKKMN 234

Query: 359 LTMLSGPREIKKPSFQQTDTLRKCMHCEVTETPQWREGPMGPKTLCNACGVRYRSGRLVP 180
           LT LS   E+KK   +Q   +RKC+HCEVT+TPQWREGPMGPKTLCNACGVR+RSGRL+P
Sbjct: 235 LTWLSDFSEMKKSPSRQPIEVRKCLHCEVTKTPQWREGPMGPKTLCNACGVRFRSGRLLP 294

Query: 179 EYRPAASPTFLPSLHSNSHKRIMEMRNKDGM*TTTLPQVVPL 54
           EYRPAASPTF+P+LHSNSHK+++EMR +  +  T  P ++ +
Sbjct: 295 EYRPAASPTFVPALHSNSHKKVVEMRKQANLPMTGTPPMLSI 336


>ref|XP_007016761.1| GATA zinc finger protein regulating nitrogen assimilation, putative
           [Theobroma cacao] gi|508787124|gb|EOY34380.1| GATA zinc
           finger protein regulating nitrogen assimilation,
           putative [Theobroma cacao]
          Length = 342

 Score =  207 bits (526), Expect = 1e-50
 Identities = 121/273 (44%), Positives = 150/273 (54%), Gaps = 10/273 (3%)
 Frame = -3

Query: 842 GVEDWKLKFQALEPPPFSSLTDFSE----------VPRSNSISCNIGPQVKQQQGSTDTF 693
           G E+W   FQ LEPPP + L   S           + ++ ++SC+   Q  QQ  +T   
Sbjct: 63  GGEEWDCNFQNLEPPPANVLAGLSSGFYGDFFGDNLAKNLTVSCDGSSQPNQQTSTTKAS 122

Query: 692 SSRSSFICNNSNDGKYLLLXXXXXXXXXXXXXXXXXXVKHVPINPKLVFXXXXXXXXXXR 513
           SSRS  + + S D K                          PI+P L F          R
Sbjct: 123 SSRSITLNSESADLKGSNRFQTSSPVSVLESSSSCSAANPTPIDPNLSFPVKRSRSKRRR 182

Query: 512 PATLNPLFVYHFIPSTSSTSEDYNPSTASESGYEINLTEXXXXXXXXXXKNLTMLSGPRE 333
            +T N      FI STSSTS   N    SES  E +LTE          +NLT+LSG  E
Sbjct: 183 VSTFNLHVSLPFISSTSSTSRGSNSLVGSESESESHLTEKSAKKRQKKKRNLTLLSGSSE 242

Query: 332 IKKPSFQQTDTLRKCMHCEVTETPQWREGPMGPKTLCNACGVRYRSGRLVPEYRPAASPT 153
           IKK   QQ   +RKCMHCEVT+TPQWREGPMGPKTLCNACGVRYRSGRL+PEYRPAASPT
Sbjct: 243 IKKSPSQQPVVVRKCMHCEVTKTPQWREGPMGPKTLCNACGVRYRSGRLLPEYRPAASPT 302

Query: 152 FLPSLHSNSHKRIMEMRNKDGM*TTTLPQVVPL 54
           F+ SLHSNSHK+++EMR K  +  + +P ++ +
Sbjct: 303 FVSSLHSNSHKKVVEMRKKAKLPISVMPSMLSI 335


>ref|XP_012443598.1| PREDICTED: GATA transcription factor 11-like [Gossypium raimondii]
           gi|763789143|gb|KJB56139.1| hypothetical protein
           B456_009G107900 [Gossypium raimondii]
           gi|763789144|gb|KJB56140.1| hypothetical protein
           B456_009G107900 [Gossypium raimondii]
          Length = 329

 Score =  191 bits (485), Expect = 6e-46
 Identities = 117/266 (43%), Positives = 139/266 (52%), Gaps = 11/266 (4%)
 Frame = -3

Query: 860 LEANNLGVEDWKLKFQALEPPPFSSLTDFSE----------VPRSNSISCNIGPQVKQQQ 711
           +E NN G EDW   F+ LEPPP + L   S           + ++ + SC+   Q+ Q  
Sbjct: 53  VEENNDGGEDWDCDFENLEPPPTNVLASLSSGFYGDFFSDSLAQNLTDSCDGSSQLNQLS 112

Query: 710 GSTDTFSSRSSFICNNSNDGKYLLLXXXXXXXXXXXXXXXXXXVKHVPINPKLVFXXXXX 531
            +T           ++  D K                          PI+PKL F     
Sbjct: 113 STTSITPH------SDCTDVKGSTWFQTSSPVSVLESSSPCSAANPTPIDPKLSFLVKKR 166

Query: 530 XXXXXRPA-TLNPLFVYHFIPSTSSTSEDYNPSTASESGYEINLTEXXXXXXXXXXKNLT 354
                RPA T N  F++  I STSS S   N    SES  E N TE          KNLT
Sbjct: 167 GRSKRRPASTFNQQFIFSSISSTSSASRGTNYVVGSESESENNPTEKPAKKRQKKKKNLT 226

Query: 353 MLSGPREIKKPSFQQTDTLRKCMHCEVTETPQWREGPMGPKTLCNACGVRYRSGRLVPEY 174
           +LSG  E KKP   Q   + KC HCEVTETPQWREGPMGPKTLCNACGVRYRSGRL PEY
Sbjct: 227 LLSGCNETKKPPSLQPIVIMKCTHCEVTETPQWREGPMGPKTLCNACGVRYRSGRLFPEY 286

Query: 173 RPAASPTFLPSLHSNSHKRIMEMRNK 96
           RPAASPTF+ S+HSNSHK+++EMR K
Sbjct: 287 RPAASPTFVASMHSNSHKKVVEMRKK 312


>gb|KHG05024.1| GATA transcription factor 11 -like protein [Gossypium arboreum]
          Length = 347

 Score =  191 bits (485), Expect = 6e-46
 Identities = 125/292 (42%), Positives = 152/292 (52%), Gaps = 25/292 (8%)
 Frame = -3

Query: 860 LEANNLGVEDWKLKFQALEPPPFSSLTDFSEVPR----SNSISCNIG------------- 732
           +E NN G EDW   F+ LEPPP + L   S        S+S++ N+              
Sbjct: 53  VEENNDGGEDWDCDFENLEPPPTNVLASLSSGFYGDFFSDSLAQNLTDSPLSAVAHSVKD 112

Query: 731 ----PQVKQQQGST--DTFSSRSSFICNNS-NDGKYLLLXXXXXXXXXXXXXXXXXXVKH 573
               P +KQ  GS+  +  SS +S    +   D K                         
Sbjct: 113 ALPEPWLKQCDGSSQLNQLSSTTSITPRSDCTDVKGSTWFQTSSPVSVLESSSSCSAANP 172

Query: 572 VPINPKLVFXXXXXXXXXXRPA-TLNPLFVYHFIPSTSSTSEDYNPSTASESGYEINLTE 396
            PI+PKL F          RPA T N  F++  I STSS S   N    SES  E N TE
Sbjct: 173 TPIDPKLSFLVKKRGRSKRRPASTFNRQFIFPSISSTSSASRGTNYVVGSESESENNPTE 232

Query: 395 XXXXXXXXXXKNLTMLSGPREIKKPSFQQTDTLRKCMHCEVTETPQWREGPMGPKTLCNA 216
                     KNLT+LSG  E KKP + Q   + KC HCEVTETPQWREGPMGPKTLCNA
Sbjct: 233 KPAKKRQKKKKNLTLLSGCNETKKPPYLQPVVIMKCTHCEVTETPQWREGPMGPKTLCNA 292

Query: 215 CGVRYRSGRLVPEYRPAASPTFLPSLHSNSHKRIMEMRNKDGM*TTTLPQVV 60
           CGVRYRSGRL PEYRPAASPTF+ S+HSNSHK+++EMR K  +  +++P  V
Sbjct: 293 CGVRYRSGRLFPEYRPAASPTFVASMHSNSHKKVVEMRKKAKLPLSSIPPKV 344


>gb|KJB26178.1| hypothetical protein B456_004G230100 [Gossypium raimondii]
          Length = 346

 Score =  189 bits (479), Expect = 3e-45
 Identities = 117/260 (45%), Positives = 136/260 (52%), Gaps = 11/260 (4%)
 Frame = -3

Query: 842 GVEDWKLKFQALEPPPFSSLTDFSE----------VPRSNSISCNIGPQVKQQQGSTDTF 693
           G EDW   FQ LEPPP + L   S           +     +SC+   ++ Q  G T   
Sbjct: 71  GGEDWDCNFQNLEPPPANVLAGLSSGFCGDFFSDTLENKFPVSCDESSELNQLSGITKAS 130

Query: 692 SSRSSFICNNSN-DGKYLLLXXXXXXXXXXXXXXXXXXVKHVPINPKLVFXXXXXXXXXX 516
           SSRS+   N  + D K                       K  PINPKL F          
Sbjct: 131 SSRSTTTLNGESVDVKGSTWFQTSSPVSVLETSSACSSAKPAPINPKLSFLVKRDRSKRR 190

Query: 515 RPATLNPLFVYHFIPSTSSTSEDYNPSTASESGYEINLTEXXXXXXXXXXKNLTMLSGPR 336
           R +T N  F    I ST STS        SES  E +LTE           NLT LSG  
Sbjct: 191 RASTFNLQFTLPSISSTGSTS-----LVGSESESESHLTEKSAKKRQKKK-NLTWLSGSC 244

Query: 335 EIKKPSFQQTDTLRKCMHCEVTETPQWREGPMGPKTLCNACGVRYRSGRLVPEYRPAASP 156
           EI K S ++   +RKC HCEVT+TPQWREGP GPKTLCNACGVRYRSGRL PEYRPAASP
Sbjct: 245 EIDKSSSEEAAVVRKCTHCEVTKTPQWREGPKGPKTLCNACGVRYRSGRLFPEYRPAASP 304

Query: 155 TFLPSLHSNSHKRIMEMRNK 96
           TF+PSLHSNSHK+++EMR +
Sbjct: 305 TFVPSLHSNSHKKVIEMRKQ 324


>gb|KJB20301.1| hypothetical protein B456_003G142600 [Gossypium raimondii]
          Length = 248

 Score =  186 bits (471), Expect = 3e-44
 Identities = 104/230 (45%), Positives = 129/230 (56%)
 Frame = -3

Query: 743 CNIGPQVKQQQGSTDTFSSRSSFICNNSNDGKYLLLXXXXXXXXXXXXXXXXXXVKHVPI 564
           C+   Q+ Q    T+  S RS  + + S D K  +                      VPI
Sbjct: 12  CDGSSQLNQWSSVTEASSGRSVTLHSESTDVKGSIRFQTSSPVSVLESSSTCSAANPVPI 71

Query: 563 NPKLVFXXXXXXXXXXRPATLNPLFVYHFIPSTSSTSEDYNPSTASESGYEINLTEXXXX 384
           NPKL F          R +  N  F   F  STSSTS   N S  SES  E +LTE    
Sbjct: 72  NPKLGFLVKRGRSKRRRASAFNVHFTLPFTSSTSSTSRGSNSSVGSESESESHLTEEPAK 131

Query: 383 XXXXXXKNLTMLSGPREIKKPSFQQTDTLRKCMHCEVTETPQWREGPMGPKTLCNACGVR 204
                  NLT LS   E+KK   +Q   +RKC+HCEVT+TPQWREGPMGPKTLCNACGVR
Sbjct: 132 KRLKKKMNLTWLSDFSEMKKSPSRQPIEVRKCLHCEVTKTPQWREGPMGPKTLCNACGVR 191

Query: 203 YRSGRLVPEYRPAASPTFLPSLHSNSHKRIMEMRNKDGM*TTTLPQVVPL 54
           +RSGRL+PEYRPAASPTF+P+LHSNSHK+++EMR +  +  T  P ++ +
Sbjct: 192 FRSGRLLPEYRPAASPTFVPALHSNSHKKVVEMRKQANLPMTGTPPMLSI 241


>gb|KJB56141.1| hypothetical protein B456_009G107900 [Gossypium raimondii]
          Length = 234

 Score =  178 bits (451), Expect = 5e-42
 Identities = 95/159 (59%), Positives = 105/159 (66%), Gaps = 1/159 (0%)
 Frame = -3

Query: 569 PINPKLVFXXXXXXXXXXRPA-TLNPLFVYHFIPSTSSTSEDYNPSTASESGYEINLTEX 393
           PI+PKL F          RPA T N  F++  I STSS S   N    SES  E N TE 
Sbjct: 59  PIDPKLSFLVKKRGRSKRRPASTFNQQFIFSSISSTSSASRGTNYVVGSESESENNPTEK 118

Query: 392 XXXXXXXXXKNLTMLSGPREIKKPSFQQTDTLRKCMHCEVTETPQWREGPMGPKTLCNAC 213
                    KNLT+LSG  E KKP   Q   + KC HCEVTETPQWREGPMGPKTLCNAC
Sbjct: 119 PAKKRQKKKKNLTLLSGCNETKKPPSLQPIVIMKCTHCEVTETPQWREGPMGPKTLCNAC 178

Query: 212 GVRYRSGRLVPEYRPAASPTFLPSLHSNSHKRIMEMRNK 96
           GVRYRSGRL PEYRPAASPTF+ S+HSNSHK+++EMR K
Sbjct: 179 GVRYRSGRLFPEYRPAASPTFVASMHSNSHKKVVEMRKK 217


>gb|KJB56138.1| hypothetical protein B456_009G107900 [Gossypium raimondii]
          Length = 276

 Score =  178 bits (451), Expect = 5e-42
 Identities = 95/159 (59%), Positives = 105/159 (66%), Gaps = 1/159 (0%)
 Frame = -3

Query: 569 PINPKLVFXXXXXXXXXXRPA-TLNPLFVYHFIPSTSSTSEDYNPSTASESGYEINLTEX 393
           PI+PKL F          RPA T N  F++  I STSS S   N    SES  E N TE 
Sbjct: 101 PIDPKLSFLVKKRGRSKRRPASTFNQQFIFSSISSTSSASRGTNYVVGSESESENNPTEK 160

Query: 392 XXXXXXXXXKNLTMLSGPREIKKPSFQQTDTLRKCMHCEVTETPQWREGPMGPKTLCNAC 213
                    KNLT+LSG  E KKP   Q   + KC HCEVTETPQWREGPMGPKTLCNAC
Sbjct: 161 PAKKRQKKKKNLTLLSGCNETKKPPSLQPIVIMKCTHCEVTETPQWREGPMGPKTLCNAC 220

Query: 212 GVRYRSGRLVPEYRPAASPTFLPSLHSNSHKRIMEMRNK 96
           GVRYRSGRL PEYRPAASPTF+ S+HSNSHK+++EMR K
Sbjct: 221 GVRYRSGRLFPEYRPAASPTFVASMHSNSHKKVVEMRKK 259


>ref|XP_012064808.1| PREDICTED: GATA transcription factor 11 [Jatropha curcas]
           gi|643738052|gb|KDP44040.1| hypothetical protein
           JCGZ_05507 [Jatropha curcas]
          Length = 324

 Score =  176 bits (445), Expect = 3e-41
 Identities = 115/277 (41%), Positives = 148/277 (53%), Gaps = 12/277 (4%)
 Frame = -3

Query: 869 LEDLEANNLGVEDWKLKFQALEPPPFSSLT--------DFSEVPRSNSISCNIGPQVKQQ 714
           LE++E N+LG EDW++KFQ L PPP + L         D  +V +S+S+  +   Q KQ 
Sbjct: 36  LENVELNDLG-EDWEVKFQQLVPPPSNVLAGLCGENGNDILKVKKSSSVLHDESSQPKQW 94

Query: 713 QGSTDTFSSRSSFICNNSNDGKYLLLXXXXXXXXXXXXXXXXXXVKHVPI-NPKLVFXXX 537
             + +  S R   +  +S++ KY  L                   ++  + +PK V    
Sbjct: 95  PSTAEASSGRGIPLNYDSSEAKYSRLFWTSSPVSVLESSSSSSSAENAVVQHPKFVIPVK 154

Query: 536 XXXXXXXRPATLNPLFVYHFIPSTSSTSEDYNPSTASESGYEINLTEXXXXXXXXXXK-- 363
                   P +  P       P  S   + Y+P   SES  E    E          K  
Sbjct: 155 RPRSKR--PRSKRPHPHRRTFPFISYAPKQYHP-LGSESETESCPDEKMLNVAKRKQKKK 211

Query: 362 -NLTMLSGPREIKKPSFQQTDTLRKCMHCEVTETPQWREGPMGPKTLCNACGVRYRSGRL 186
            NL +LS   E+KK S QQ   +RKC HCEVT+TPQWREGPMGPKTLCNACGVRYRSGRL
Sbjct: 212 RNLMLLSCTVEMKKSSTQQPIEIRKCTHCEVTKTPQWREGPMGPKTLCNACGVRYRSGRL 271

Query: 185 VPEYRPAASPTFLPSLHSNSHKRIMEMRNKDGM*TTT 75
            PEYRPAASPTF+PSLHSNSH++++EMR K     TT
Sbjct: 272 FPEYRPAASPTFVPSLHSNSHRKVVEMRKKTTRPMTT 308


>gb|KHG08445.1| GATA transcription factor 11 -like protein [Gossypium arboreum]
          Length = 390

 Score =  167 bits (424), Expect = 7e-39
 Identities = 102/220 (46%), Positives = 119/220 (54%), Gaps = 1/220 (0%)
 Frame = -3

Query: 752 SISCNIGPQVKQQQGSTDTFSSRSSFICNNSN-DGKYLLLXXXXXXXXXXXXXXXXXXVK 576
           S  C+   ++ Q  G T   SSRS+   N  + D K                       K
Sbjct: 153 SKQCDESSELNQLSGITKASSSRSTTTLNGESVDVKGSTWFQTSSPVSVLETSSACSSAK 212

Query: 575 HVPINPKLVFXXXXXXXXXXRPATLNPLFVYHFIPSTSSTSEDYNPSTASESGYEINLTE 396
             PINPKL F          R +T N  F   FI ST STS        SES  E +LTE
Sbjct: 213 PAPINPKLSFLVKRDRSKRRRASTFNLQFTLPFISSTGSTS-----LVGSESESESHLTE 267

Query: 395 XXXXXXXXXXKNLTMLSGPREIKKPSFQQTDTLRKCMHCEVTETPQWREGPMGPKTLCNA 216
                      NLT LSG  E  + S ++   +RKC HCEVT+TPQWREGP GPKTLCNA
Sbjct: 268 KSAKKRQKKK-NLTWLSGSSENDRSSSEEAAVVRKCTHCEVTKTPQWREGPKGPKTLCNA 326

Query: 215 CGVRYRSGRLVPEYRPAASPTFLPSLHSNSHKRIMEMRNK 96
           CGVRYRSGRL PEYRPAASPTF+ SLHSNSHK+++EMR +
Sbjct: 327 CGVRYRSGRLFPEYRPAASPTFVASLHSNSHKKVIEMRKQ 366


>ref|XP_010031984.1| PREDICTED: GATA transcription factor 10-like isoform X2 [Eucalyptus
           grandis]
          Length = 318

 Score =  165 bits (417), Expect = 5e-38
 Identities = 112/267 (41%), Positives = 132/267 (49%), Gaps = 11/267 (4%)
 Frame = -3

Query: 869 LEDLEANNLGVEDWKLKFQALEPPPFSSLTDFSEV----------PRSNSISCNIGP-QV 723
           LED+E N L  +DW   FQ LE P    L  F                 S+S N    +V
Sbjct: 36  LEDVEGNILS-DDWDASFQLLELPSSDVLEGFPSSYCGKKCEDISAPDRSLSRNEDKAKV 94

Query: 722 KQQQGSTDTFSSRSSFICNNSNDGKYLLLXXXXXXXXXXXXXXXXXXVKHVPINPKLVFX 543
           K     T+T S+RS      SND    L                      +  + K +  
Sbjct: 95  KASPSYTETNSNRSICHHQTSNDKDSSL-----------ENSSSGSIENSLLFSSKSLIP 143

Query: 542 XXXXXXXXXRPATLNPLFVYHFIPSTSSTSEDYNPSTASESGYEINLTEXXXXXXXXXXK 363
                    RP T    F + F  S SS S  Y+P  AS S  E   T+          K
Sbjct: 144 VKRPRSKRLRPQTPKTRFNFPFASSASSASGIYHPLAASRSSSEGRHTKKSSGKRKGKEK 203

Query: 362 NLTMLSGPREIKKPSFQQTDTLRKCMHCEVTETPQWREGPMGPKTLCNACGVRYRSGRLV 183
           NL +LSG  E+   S Q     RKCMHCEVT+TPQWREGPMGPKTLCNACGVRYRSGRL 
Sbjct: 204 NLHLLSGAEEMNSAS-QNPSATRKCMHCEVTKTPQWREGPMGPKTLCNACGVRYRSGRLF 262

Query: 182 PEYRPAASPTFLPSLHSNSHKRIMEMR 102
           PEYRPAASPTF+ SLHSNSH++++EMR
Sbjct: 263 PEYRPAASPTFISSLHSNSHRKVIEMR 289


>ref|XP_010031983.1| PREDICTED: GATA transcription factor 10-like isoform X1 [Eucalyptus
           grandis] gi|629085021|gb|KCW51378.1| hypothetical
           protein EUGRSUZ_J00922 [Eucalyptus grandis]
          Length = 319

 Score =  165 bits (417), Expect = 5e-38
 Identities = 112/267 (41%), Positives = 132/267 (49%), Gaps = 11/267 (4%)
 Frame = -3

Query: 869 LEDLEANNLGVEDWKLKFQALEPPPFSSLTDFSEV----------PRSNSISCNIGP-QV 723
           LED+E N L  +DW   FQ LE P    L  F                 S+S N    +V
Sbjct: 37  LEDVEGNILS-DDWDASFQLLELPSSDVLEGFPSSYCGKKCEDISAPDRSLSRNEDKAKV 95

Query: 722 KQQQGSTDTFSSRSSFICNNSNDGKYLLLXXXXXXXXXXXXXXXXXXVKHVPINPKLVFX 543
           K     T+T S+RS      SND    L                      +  + K +  
Sbjct: 96  KASPSYTETNSNRSICHHQTSNDKDSSL-----------ENSSSGSIENSLLFSSKSLIP 144

Query: 542 XXXXXXXXXRPATLNPLFVYHFIPSTSSTSEDYNPSTASESGYEINLTEXXXXXXXXXXK 363
                    RP T    F + F  S SS S  Y+P  AS S  E   T+          K
Sbjct: 145 VKRPRSKRLRPQTPKTRFNFPFASSASSASGIYHPLAASRSSSEGRHTKKSSGKRKGKEK 204

Query: 362 NLTMLSGPREIKKPSFQQTDTLRKCMHCEVTETPQWREGPMGPKTLCNACGVRYRSGRLV 183
           NL +LSG  E+   S Q     RKCMHCEVT+TPQWREGPMGPKTLCNACGVRYRSGRL 
Sbjct: 205 NLHLLSGAEEMNSAS-QNPSATRKCMHCEVTKTPQWREGPMGPKTLCNACGVRYRSGRLF 263

Query: 182 PEYRPAASPTFLPSLHSNSHKRIMEMR 102
           PEYRPAASPTF+ SLHSNSH++++EMR
Sbjct: 264 PEYRPAASPTFISSLHSNSHRKVIEMR 290


>ref|XP_011040891.1| PREDICTED: GATA transcription factor 11-like isoform X1 [Populus
           euphratica]
          Length = 320

 Score =  163 bits (413), Expect = 1e-37
 Identities = 111/292 (38%), Positives = 147/292 (50%), Gaps = 16/292 (5%)
 Frame = -3

Query: 869 LEDLEANNLGVEDWKLKFQALEPPPFSSLTDFSEV----------PRSNSISCNIGP--Q 726
           LED+E N    EDW+ KF+ LEPP    LT FS            P  NS S  +    Q
Sbjct: 34  LEDVEPNGDDGEDWESKFRHLEPPSSHLLTTFSTALCGEDASSLEPNYNSCSVLLDGSLQ 93

Query: 725 VKQQQGSTDTFSSRSSFICNNSNDGKYLLLXXXXXXXXXXXXXXXXXXVKHVPI-NPKLV 549
           +K    S +  SSRS      S+D KY  L                   ++     PK V
Sbjct: 94  LKHWASSAEASSSRSKPNLCRSSDSKYSHLFQATSPVSVLESSGSSCPTENATTCYPKFV 153

Query: 548 FXXXXXXXXXXRPATLNPLFVYHFIPSTSSTSEDY-NPSTASESGYEINLTEXXXXXXXX 372
                      RP        + FIP+  ++ + Y + S+  E  Y  +           
Sbjct: 154 TPVKRPRSKLTRPRR----HTFPFIPTACASKKFYCSASSDPELEYYNDEEILDSSRKTQ 209

Query: 371 XXKNLTMLSGPREIKKPSFQQTDTLRKCMHCEVTETPQWREGPMGPKTLCNACGVRYRSG 192
             +NL +LS   E+     Q  +T R+C HC+VT+TPQWREGP+GPKTLCNACGVRYRSG
Sbjct: 210 KKRNLMLLSSAVEMAPKKKQPVET-RRCTHCQVTKTPQWREGPLGPKTLCNACGVRYRSG 268

Query: 191 RLVPEYRPAASPTFLPSLHSNSHKRIMEMRNKDGM*T--TTLPQVVPLGDDH 42
           RL+PEYRPAASPTF+P LHSNSH++++EMR +    T   T+  +VP+  +H
Sbjct: 269 RLLPEYRPAASPTFVPFLHSNSHRKVLEMRKQTEQVTPMATMDVLVPMAPEH 320


>ref|XP_011040892.1| PREDICTED: GATA transcription factor 11-like isoform X2 [Populus
           euphratica]
          Length = 314

 Score =  162 bits (409), Expect = 4e-37
 Identities = 109/288 (37%), Positives = 146/288 (50%), Gaps = 12/288 (4%)
 Frame = -3

Query: 869 LEDLEANNLGVEDWKLKFQALEPPPFSSLTDFSEVP--------RSNSISCNIGPQVKQQ 714
           LED+E N    EDW+ KF+ LEPP    LT FS             N  SC++   +K  
Sbjct: 34  LEDVEPNGDDGEDWESKFRHLEPPSSHLLTTFSTALCGEDASSLEPNYNSCSV--LLKHW 91

Query: 713 QGSTDTFSSRSSFICNNSNDGKYLLLXXXXXXXXXXXXXXXXXXVKHVPI-NPKLVFXXX 537
             S +  SSRS      S+D KY  L                   ++     PK V    
Sbjct: 92  ASSAEASSSRSKPNLCRSSDSKYSHLFQATSPVSVLESSGSSCPTENATTCYPKFVTPVK 151

Query: 536 XXXXXXXRPATLNPLFVYHFIPSTSSTSEDY-NPSTASESGYEINLTEXXXXXXXXXXKN 360
                  RP        + FIP+  ++ + Y + S+  E  Y  +             +N
Sbjct: 152 RPRSKLTRPRR----HTFPFIPTACASKKFYCSASSDPELEYYNDEEILDSSRKTQKKRN 207

Query: 359 LTMLSGPREIKKPSFQQTDTLRKCMHCEVTETPQWREGPMGPKTLCNACGVRYRSGRLVP 180
           L +LS   E+     Q  +T R+C HC+VT+TPQWREGP+GPKTLCNACGVRYRSGRL+P
Sbjct: 208 LMLLSSAVEMAPKKKQPVET-RRCTHCQVTKTPQWREGPLGPKTLCNACGVRYRSGRLLP 266

Query: 179 EYRPAASPTFLPSLHSNSHKRIMEMRNKDGM*T--TTLPQVVPLGDDH 42
           EYRPAASPTF+P LHSNSH++++EMR +    T   T+  +VP+  +H
Sbjct: 267 EYRPAASPTFVPFLHSNSHRKVLEMRKQTEQVTPMATMDVLVPMAPEH 314


>gb|KHG09661.1| GATA transcription factor 11 -like protein [Gossypium arboreum]
          Length = 314

 Score =  159 bits (403), Expect = 2e-36
 Identities = 86/159 (54%), Positives = 102/159 (64%)
 Frame = -3

Query: 572 VPINPKLVFXXXXXXXXXXRPATLNPLFVYHFIPSTSSTSEDYNPSTASESGYEINLTEX 393
           +PINP L F          R +TLN  F   F+ ST STS+ +     S+S    +L++ 
Sbjct: 133 IPINPNLSFQGKRCRTNRRRASTLNSPFTLPFVSSTFSTSQRFYFPVGSKSESVSHLSQK 192

Query: 392 XXXXXXXXXKNLTMLSGPREIKKPSFQQTDTLRKCMHCEVTETPQWREGPMGPKTLCNAC 213
                     NL +LSG  E    S QQ   +RKC HCEVTETPQWR+GPMGPKTLCNAC
Sbjct: 193 PTKKRLKKK-NLKLLSGFSESNNSSSQQL-VIRKCGHCEVTETPQWRKGPMGPKTLCNAC 250

Query: 212 GVRYRSGRLVPEYRPAASPTFLPSLHSNSHKRIMEMRNK 96
           GVRYRSGRL PEYRPA SPTF+ SLHSNSHK+++EMR K
Sbjct: 251 GVRYRSGRLFPEYRPAGSPTFIASLHSNSHKKVVEMRRK 289


>ref|XP_002529940.1| GATA transcription factor, putative [Ricinus communis]
           gi|223530570|gb|EEF32448.1| GATA transcription factor,
           putative [Ricinus communis]
          Length = 323

 Score =  156 bits (394), Expect = 2e-35
 Identities = 105/277 (37%), Positives = 140/277 (50%), Gaps = 22/277 (7%)
 Frame = -3

Query: 866 EDLEANNLGVEDWKLKFQALEPPPFSSLTDFSE-----------VPRSNSISCNIGPQVK 720
           ED+E+N+  +EDW+ +FQ L P P + L DF+                +S+SC+   Q K
Sbjct: 31  EDVESND-AIEDWESQFQQL-PTPSNILADFTSGICDQISKDSLKLEKSSVSCDESSQPK 88

Query: 719 QQQGSTDTFSSRSSFICNNSNDGKYLLLXXXXXXXXXXXXXXXXXXVKHVPI-NPKLVFX 543
               + +  SSR+  +  + ++GKY  L                   ++  + +PK  F 
Sbjct: 89  PWLRAAEAPSSRNIPLNYDPSEGKYSHLFWTSSPVSVLESSSSSSSAENSTVYHPK--FA 146

Query: 542 XXXXXXXXXRPATLNPLFVYHFIPSTSSTSEDYNPSTASESGYEINLTEXXXXXXXXXXK 363
                     P      F +     ++S +   NP   SES  E                
Sbjct: 147 KPVKRPRSKCPRRRRCTFPF----LSTSYAPKNNPLGGSESESESESESESESNPDEKML 202

Query: 362 NLT----------MLSGPREIKKPSFQQTDTLRKCMHCEVTETPQWREGPMGPKTLCNAC 213
           NL           MLS   E KKPS +    +RKC HCEVT+TPQWREGPMGPKTLCNAC
Sbjct: 203 NLAKKIQKKKDLMMLSCTVEKKKPSSEVPGEIRKCTHCEVTKTPQWREGPMGPKTLCNAC 262

Query: 212 GVRYRSGRLVPEYRPAASPTFLPSLHSNSHKRIMEMR 102
           GVRYRSGRL PEYRPAASPTF+P+LHSNSH++++EMR
Sbjct: 263 GVRYRSGRLFPEYRPAASPTFVPALHSNSHRKVIEMR 299


Top