BLASTX nr result

ID: Aconitum23_contig00029104 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Aconitum23_contig00029104
         (1180 letters)

Database: ./nr 
           77,306,371 sequences; 28,104,191,420 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_010262144.1| PREDICTED: putative GATA transcription facto...   189   3e-45
ref|XP_010656197.1| PREDICTED: GATA transcription factor 21 isof...   184   1e-43
ref|XP_002282173.1| PREDICTED: GATA transcription factor 21 isof...   184   1e-43
ref|XP_010242203.1| PREDICTED: GATA transcription factor 21-like...   178   9e-42
ref|XP_007012281.1| GATA type zinc finger transcription factor f...   159   4e-36
ref|XP_007012845.1| GATA type zinc finger transcription factor f...   158   1e-35
gb|KHG09089.1| Putative GATA transcription factor 22 -like prote...   155   5e-35
ref|XP_010090039.1| Putative GATA transcription factor 22 [Morus...   155   6e-35
ref|XP_006450838.1| hypothetical protein CICLE_v10008968mg [Citr...   154   1e-34
ref|XP_006451458.1| hypothetical protein CICLE_v10009004mg [Citr...   152   4e-34
ref|XP_006450839.1| hypothetical protein CICLE_v10008968mg [Citr...   152   5e-34
ref|XP_006600457.1| PREDICTED: GATA transcription factor 21-like...   150   2e-33
ref|XP_003550634.1| PREDICTED: GATA transcription factor 21-like...   150   2e-33
emb|CDP03165.1| unnamed protein product [Coffea canephora]            150   2e-33
ref|XP_010942001.1| PREDICTED: putative GATA transcription facto...   150   3e-33
ref|XP_002514107.1| hypothetical protein RCOM_1046780 [Ricinus c...   149   3e-33
gb|KDO57760.1| hypothetical protein CISIN_1g021859mg [Citrus sin...   149   4e-33
ref|NP_001280882.1| transcription factor GATA-5 [Malus domestica...   149   4e-33
ref|XP_013458498.1| GATA type zinc finger transcription factor f...   148   8e-33
gb|KOM29610.1| hypothetical protein LR48_Vigan728s003300 [Vigna ...   148   1e-32

>ref|XP_010262144.1| PREDICTED: putative GATA transcription factor 22 [Nelumbo nucifera]
          Length = 316

 Score =  189 bits (481), Expect = 3e-45
 Identities = 115/260 (44%), Positives = 149/260 (57%), Gaps = 24/260 (9%)
 Frame = -1

Query: 922 EQFKQYQPSQEVVDNVSQGGSSDHQSVSPPSLSTLESG----------SEYGDQNKSHST 773
           E   + Q  Q+      +G    H     P L + ++G           E  D+++S+ST
Sbjct: 63  EAHDREQQEQQQRQEADKGSEGYHLDFPYPPLQSSKNGINSSLELSIKQEIRDESQSNST 122

Query: 772 PSEKGSGSWMSSKMRLMKKMINSD--GSNDALKPIRKRFW--FQDP---KLSLSGDNLAS 614
               GS  WMSSKMRLM+KM+NSD  G++       ++F    Q P   ++  S  N +S
Sbjct: 123 ----GSARWMSSKMRLMRKMMNSDRMGADKPASGNTQKFQDHHQQPSSLEMDSSSSNSSS 178

Query: 613 NNN--IIRTCSDCNTTRTPLWRSGPKGPKSLCNACGIXXXXXXXXXXXXXXASTDGILTT 440
           NN+   +R CSDCNTT+TPLWRSGP+GPKSLCNACGI              + T  +L  
Sbjct: 179 NNSNITVRVCSDCNTTKTPLWRSGPRGPKSLCNACGIRQRKARRAMAAAAASGT--LLPA 236

Query: 439 DTPSVASKVHCSKKKSEKGHVAQHKKQCK-----SKTNNIGFEDFIVNLSNNLVLHRVFP 275
           DTPS+  KVH  +K+SE G+V Q+KK+CK          + FEDF +NLS N   HRVFP
Sbjct: 237 DTPSLQRKVHHKEKRSETGYVPQYKKRCKLAPSPRSRKKLCFEDFTINLSKNSAFHRVFP 296

Query: 274 QDETEAAILLMALSCGLVNG 215
           QDE EAAILLMALSCGLV+G
Sbjct: 297 QDEKEAAILLMALSCGLVHG 316


>ref|XP_010656197.1| PREDICTED: GATA transcription factor 21 isoform X1 [Vitis vinifera]
          Length = 310

 Score =  184 bits (468), Expect = 1e-43
 Identities = 120/260 (46%), Positives = 147/260 (56%), Gaps = 28/260 (10%)
 Frame = -1

Query: 910 QYQPSQEVVDN--VSQGGSSDHQSVSPPSLSTLESGSEYG---------DQNKSHSTPSE 764
           Q QP QEV  +  V +GGS DH         TLES S+ G         D+N++HS   E
Sbjct: 64  QAQPQQEVAHDKFVFRGGSYDHP--------TLESESDNGLKLTIWKTEDRNENHS---E 112

Query: 763 KGSGSWMSSKMRLMKKMINSDGSNDALKPIRKRFWFQDPKL----------SLSGDNLAS 614
            GS  WMSSKMR+M+KM+ SD +  A KP      F D K           S++  N+ S
Sbjct: 113 NGSVKWMSSKMRVMQKMMISDQTG-AQKPSNTALNFGDHKQQSLPSETDYNSINSSNINS 171

Query: 613 NNNIIRTCSDCNTTRTPLWRSGPKGPKSLCNACGIXXXXXXXXXXXXXXASTDGILTTDT 434
           NN I R C+DCNTT+TPLWRSGP+GPKSLCNACGI               +   IL T+T
Sbjct: 172 NNTI-RVCADCNTTKTPLWRSGPRGPKSLCNACGIRQRKARRAMAAAAATANGTILPTNT 230

Query: 433 PSVASKVHCSKKKSEKGHVAQHKKQCK------SKTNNIGFEDFIVNLSNNLVLHRVFPQ 272
               +K     KKS  GHV+ +KK+CK       +T  + FEDF ++LS N   HRVF Q
Sbjct: 231 APTKTKAKHKDKKSSNGHVSHYKKRCKLAAAPSCETKKLCFEDFTISLSKNSAFHRVFLQ 290

Query: 271 DE-TEAAILLMALSCGLVNG 215
           DE  EAAILLMALSCGLV+G
Sbjct: 291 DEIKEAAILLMALSCGLVHG 310


>ref|XP_002282173.1| PREDICTED: GATA transcription factor 21 isoform X2 [Vitis vinifera]
           gi|297738668|emb|CBI27913.3| unnamed protein product
           [Vitis vinifera]
          Length = 309

 Score =  184 bits (468), Expect = 1e-43
 Identities = 120/259 (46%), Positives = 146/259 (56%), Gaps = 27/259 (10%)
 Frame = -1

Query: 910 QYQPSQEVVDN-VSQGGSSDHQSVSPPSLSTLESGSEYG---------DQNKSHSTPSEK 761
           Q QP QE  D  V +GGS DH         TLES S+ G         D+N++HS   E 
Sbjct: 64  QAQPQQEAHDKFVFRGGSYDHP--------TLESESDNGLKLTIWKTEDRNENHS---EN 112

Query: 760 GSGSWMSSKMRLMKKMINSDGSNDALKPIRKRFWFQDPKL----------SLSGDNLASN 611
           GS  WMSSKMR+M+KM+ SD +  A KP      F D K           S++  N+ SN
Sbjct: 113 GSVKWMSSKMRVMQKMMISDQTG-AQKPSNTALNFGDHKQQSLPSETDYNSINSSNINSN 171

Query: 610 NNIIRTCSDCNTTRTPLWRSGPKGPKSLCNACGIXXXXXXXXXXXXXXASTDGILTTDTP 431
           N I R C+DCNTT+TPLWRSGP+GPKSLCNACGI               +   IL T+T 
Sbjct: 172 NTI-RVCADCNTTKTPLWRSGPRGPKSLCNACGIRQRKARRAMAAAAATANGTILPTNTA 230

Query: 430 SVASKVHCSKKKSEKGHVAQHKKQCK------SKTNNIGFEDFIVNLSNNLVLHRVFPQD 269
              +K     KKS  GHV+ +KK+CK       +T  + FEDF ++LS N   HRVF QD
Sbjct: 231 PTKTKAKHKDKKSSNGHVSHYKKRCKLAAAPSCETKKLCFEDFTISLSKNSAFHRVFLQD 290

Query: 268 E-TEAAILLMALSCGLVNG 215
           E  EAAILLMALSCGLV+G
Sbjct: 291 EIKEAAILLMALSCGLVHG 309


>ref|XP_010242203.1| PREDICTED: GATA transcription factor 21-like [Nelumbo nucifera]
          Length = 305

 Score =  178 bits (451), Expect = 9e-42
 Identities = 111/245 (45%), Positives = 141/245 (57%), Gaps = 27/245 (11%)
 Frame = -1

Query: 868 GGSSDHQ----SVSPPSLST-LESGSEYGDQNKSHS---TPSEKGSGSWMSSKMRLMKKM 713
           GG SDHQ       PP++   + SG E  +  +  +   +    GS  WMSSKMRLM+KM
Sbjct: 68  GGPSDHQYFPDDPPPPTVEDDINSGLELSNSKQRENRGGSQGNMGSVRWMSSKMRLMRKM 127

Query: 712 INSD--GSNDALKPIRKRF-----------WFQDPKLSLSGDNLASNNNIIRTCSDCNTT 572
            NSD  G +  +     +F           W  D   + S +N    NN +R CSDCNTT
Sbjct: 128 KNSDRVGMDKPVNTNMHKFQQDHHHRSPSPWEMDTSSNSSSNNA---NNTVRVCSDCNTT 184

Query: 571 RTPLWRSGPKGPKSLCNACGIXXXXXXXXXXXXXXASTDGILTTDTPSVASKVHCSKKK- 395
           +TPLWRSGP+GPKSLCNACGI              A+   +L T+  S+ +KVH  +K+ 
Sbjct: 185 KTPLWRSGPRGPKSLCNACGI----RQRKARRAMAAANGTLLPTEASSMKNKVHHKEKRS 240

Query: 394 SEKGHVAQHKKQCKSKTN-----NIGFEDFIVNLSNNLVLHRVFPQDETEAAILLMALSC 230
           SE G+V Q+KK+CK  T+      + FEDF +NLS N   HRVFPQDE EAAILLMALSC
Sbjct: 241 SETGYVQQYKKRCKLATSPRSMKKVCFEDFTINLSKNSSFHRVFPQDEKEAAILLMALSC 300

Query: 229 GLVNG 215
           GLV+G
Sbjct: 301 GLVHG 305


>ref|XP_007012281.1| GATA type zinc finger transcription factor family protein, putative
           [Theobroma cacao] gi|508782644|gb|EOY29900.1| GATA type
           zinc finger transcription factor family protein,
           putative [Theobroma cacao]
          Length = 311

 Score =  159 bits (402), Expect = 4e-36
 Identities = 111/260 (42%), Positives = 137/260 (52%), Gaps = 20/260 (7%)
 Frame = -1

Query: 937 QDQLKEQFKQYQPSQEVVDN-VSQGGSSDHQSVSPPSLSTLESGSEYGDQNKSHSTP--- 770
           QDQ   + ++ +P     +  ++  GS D Q+ S  SL +    S     N S S     
Sbjct: 51  QDQTVTKPEESKPHDHKGNQFMTHEGSIDQQASSSSSLQSAVDQSTANGYNLSFSRKEDG 110

Query: 769 ---SEKGSGS---WMSSKMRLMKKMINSDGSNDALKPIRKRFWFQDPKLSLSGDNLASN- 611
              S  G+GS   WMSSK+RLMKKM+NS+ S    KP +    FQ P       N  S  
Sbjct: 111 DCESASGNGSSVKWMSSKVRLMKKMMNSNCSGADDKPPKFTQRFQYPVHDSDETNSFSKA 170

Query: 610 NNIIRTCSDCNTTRTPLWRSGPKGPKSLCNACGIXXXXXXXXXXXXXXASTD--GILTTD 437
           NN +R CSDCNTT TPLWRSGP+GPKSLCNACGI              A+ +       D
Sbjct: 171 NNTVRVCSDCNTTTTPLWRSGPRGPKSLCNACGIRQRKARRAMEAAAAAAAENGAAAAAD 230

Query: 436 TPSVASKVHCSK-KKSEKGHVAQHKKQCK------SKTNNIGFEDFIVNLSNNLVLHRVF 278
             S+  KVH  K KKS   HVAQ KKQ K           + F++F ++LS N  L RVF
Sbjct: 231 ASSMKIKVHIHKEKKSRTSHVAQCKKQVKPPYYSPQSQKKLCFKEFALSLSKNSALQRVF 290

Query: 277 PQDETEAAILLMALSCGLVN 218
           PQD  +AAILLM LSCGLV+
Sbjct: 291 PQDVEDAAILLMELSCGLVH 310


>ref|XP_007012845.1| GATA type zinc finger transcription factor family protein, putative
           [Theobroma cacao] gi|508783208|gb|EOY30464.1| GATA type
           zinc finger transcription factor family protein,
           putative [Theobroma cacao]
          Length = 302

 Score =  158 bits (399), Expect = 1e-35
 Identities = 110/249 (44%), Positives = 139/249 (55%), Gaps = 14/249 (5%)
 Frame = -1

Query: 919 QFKQYQPSQEVVDNVSQGGSSDHQSVSPPSLSTLESGSEYGDQNKSHSTPSEKGSGSWMS 740
           Q  QYQ  Q  +  V Q    +  S    SL   E G+E+           E  S  WMS
Sbjct: 66  QHFQYQEDQAKI-YVPQDEPLESDSGLNLSLRKKEEGNEHHQ--------IEDSSAKWMS 116

Query: 739 SKMRLMKKMINSDGSNDALKPIRKRFWFQDPKL--SLSGDNLAS-----NNNI-IRTCSD 584
           SKMR+M+KM++SD ++ +     K    ++PK   S S DN ++     N+NI IR C+D
Sbjct: 117 SKMRMMRKMMSSDRADLSNSSTPK---LEEPKQQPSSSPDNSSNSSYNNNDNITIRVCAD 173

Query: 583 CNTTRTPLWRSGPKGPKSLCNACGIXXXXXXXXXXXXXXASTDGILTTDTPSVASKVH-C 407
           CNTT+TPLWRSGP+GPKSLCNACGI              A+   +    TP++ SKV   
Sbjct: 174 CNTTKTPLWRSGPRGPKSLCNACGIRQRKARRAMAAAAAANGAIVAAQTTPTMKSKVQDK 233

Query: 406 SKKKSEKGHVAQHKKQCKSKTNNIG-----FEDFIVNLSNNLVLHRVFPQDETEAAILLM 242
           SK+ S  G VAQ KK+CK  + + G     FED  + LS N   HRVFPQDE EAAILLM
Sbjct: 234 SKRSSNSGCVAQLKKKCKHSSQSQGRKKLCFEDLRIILSKNSAFHRVFPQDEKEAAILLM 293

Query: 241 ALSCGLVNG 215
           ALS GLV+G
Sbjct: 294 ALSYGLVHG 302


>gb|KHG09089.1| Putative GATA transcription factor 22 -like protein [Gossypium
           arboreum]
          Length = 305

 Score =  155 bits (393), Expect = 5e-35
 Identities = 105/250 (42%), Positives = 135/250 (54%), Gaps = 7/250 (2%)
 Frame = -1

Query: 943 AFQDQLKEQFKQYQPSQEVVDNVSQGGSSDHQSVSPPSLSTLESGSEYGDQNKSHSTPSE 764
           AF   L +QF   Q  QE + +V Q G    +S     LS  +        ++SH     
Sbjct: 68  AFYQSLPQQFHDDQQDQEKI-HVPQDGPL--RSDCELRLSIWKKEERVETHHQSHD---- 120

Query: 763 KGSGSWMSSKMRLMKKMINSDGSNDALKPIRKRFWFQDPKL-SLSGDNLASNNNIIRTCS 587
             S  WM SKMR+M+KM+NSD ++ +  P  K    Q+ K  S S DN   NN+ IR C+
Sbjct: 121 --SAKWMPSKMRMMRKMMNSDHTDLSNSPTPKSEDHQEQKQPSSSPDN---NNSTIRVCA 175

Query: 586 DCNTTRTPLWRSGPKGPKSLCNACGIXXXXXXXXXXXXXXASTDGILTTDTPSVASKVHC 407
           DCNTT+TPLWRSGP+GPKSLCNACGI              A++  +     PS+ S+V  
Sbjct: 176 DCNTTKTPLWRSGPRGPKSLCNACGIRQRKARRAAAVAAAAASSVVAAETPPSMRSEVQL 235

Query: 406 SKKKSEKGHVAQHK-KQCKSKTNN-----IGFEDFIVNLSNNLVLHRVFPQDETEAAILL 245
             K+S    V   K K+CK  + +     + FED  + LS N   H VFPQDE EAAILL
Sbjct: 236 KAKRSSNNGVPHLKNKKCKHNSQSQSRKKLCFEDLRIILSKNSAFHGVFPQDEKEAAILL 295

Query: 244 MALSCGLVNG 215
           MALS GLV+G
Sbjct: 296 MALSYGLVHG 305


>ref|XP_010090039.1| Putative GATA transcription factor 22 [Morus notabilis]
           gi|587848577|gb|EXB38836.1| Putative GATA transcription
           factor 22 [Morus notabilis]
          Length = 335

 Score =  155 bits (392), Expect = 6e-35
 Identities = 109/263 (41%), Positives = 137/263 (52%), Gaps = 42/263 (15%)
 Frame = -1

Query: 877 VSQGGSSDHQSVSPPSLSTLESG-----------------SEYGDQNKSHSTPSEKG-SG 752
           VS GGSSD   + PP ++  ES                  S Y     SH + +  G S 
Sbjct: 76  VSSGGSSD---IHPPRVAESESDHHQNDLKLSIWKSSTEDSNYDHDKSSHVSDNNAGYSA 132

Query: 751 SWMSSKMRLMKKMI-NSDGSN-DALKPIRKRFWFQD------PKLSLSGDNLAS------ 614
            WM SKMR+M+KMI N D +N D   P+     F        P   L  D+ ++      
Sbjct: 133 KWMPSKMRMMRKMIVNPDQTNIDHHTPLNFTHKFDQVMKRKHPASPLGTDHSSTSSSNNN 192

Query: 613 NNNIIRTCSDCNTTRTPLWRSGPKGPKSLCNACGIXXXXXXXXXXXXXXASTDGILTTDT 434
           NNN IR C+DCNTT+TPLWRSGP+GPKSLCNACGI              A+   IL TD 
Sbjct: 193 NNNTIRVCADCNTTKTPLWRSGPRGPKSLCNACGIRQRKARRAMAAAAAAANGTILATDA 252

Query: 433 PSVAS--KVHCSKKKSEKGH--VAQHKKQCKSKTN------NIGFEDFIVNLSNNLVLHR 284
            ++ S  KV   +KK + G+  V Q KK+CK   +       I FED  +++S N    R
Sbjct: 253 TTMKSSTKVQRKEKKPKNGNGVVPQFKKRCKLTASPSRGRKKICFEDLAISISKNSAFQR 312

Query: 283 VFPQDETEAAILLMALSCGLVNG 215
           VFPQDE +AAILLMALS GLV+G
Sbjct: 313 VFPQDEKDAAILLMALSYGLVHG 335


>ref|XP_006450838.1| hypothetical protein CICLE_v10008968mg [Citrus clementina]
           gi|568844084|ref|XP_006475926.1| PREDICTED: putative
           GATA transcription factor 22-like isoform X2 [Citrus
           sinensis] gi|557554064|gb|ESR64078.1| hypothetical
           protein CICLE_v10008968mg [Citrus clementina]
           gi|641861410|gb|KDO80098.1| hypothetical protein
           CISIN_1g021329mg [Citrus sinensis]
          Length = 312

 Score =  154 bits (389), Expect = 1e-34
 Identities = 103/267 (38%), Positives = 138/267 (51%), Gaps = 26/267 (9%)
 Frame = -1

Query: 940 FQDQLKEQFKQYQPSQEVVDNVSQGGSSDHQSVSPPSLSTLESGSEYGDQ------NKSH 779
           FQDQ     ++ Q   + VD+    GSS+ Q  S  S+ T +  +   ++          
Sbjct: 48  FQDQRMIIMEESQQHDQKVDH---SGSSNLQVFSSSSIQTKKMNNITNNKLPIRKREVGE 104

Query: 778 STPSEKGS---GSWMSSKMRLMKKMINSDGSNDA-----LKPIRKRFWFQ-DPKLSLSGD 626
            T SE GS   G WMSSK+RLM KMINS  ++ A     +K  +K  + Q      ++  
Sbjct: 105 GTTSENGSSSSGKWMSSKIRLMHKMINSSSNSTATHELAVKVTQKLQYHQLHDNSEVNSF 164

Query: 625 NLASNNNIIRTCSDCNTTRTPLWRSGPKGPKSLCNACGIXXXXXXXXXXXXXXASTDGIL 446
           N +++NN +R CSDCNTT TPLWRSGP+GPKSLCNACGI                T  I 
Sbjct: 165 NSSNSNNTMRACSDCNTTTTPLWRSGPRGPKSLCNACGIRQRKARRAMQAAAAVETGTIA 224

Query: 445 TT-DTPSVASKVHCSKKKSEKGHVAQHKKQCKS----------KTNNIGFEDFIVNLSNN 299
            T  +P    K+    KK    HV+Q+KKQ ++              + F+DF + LS N
Sbjct: 225 ATGGSPFAKIKLQIKDKKPRTSHVSQNKKQYRTLDPDPTHQYQSQRKLCFKDFAIALSKN 284

Query: 298 LVLHRVFPQDETEAAILLMALSCGLVN 218
             L +VFPQD  EAAILLM LSCG ++
Sbjct: 285 SALKQVFPQDVEEAAILLMELSCGFIH 311


>ref|XP_006451458.1| hypothetical protein CICLE_v10009004mg [Citrus clementina]
           gi|568843031|ref|XP_006475428.1| PREDICTED: putative
           GATA transcription factor 22-like [Citrus sinensis]
           gi|557554684|gb|ESR64698.1| hypothetical protein
           CICLE_v10009004mg [Citrus clementina]
          Length = 306

 Score =  152 bits (385), Expect = 4e-34
 Identities = 98/217 (45%), Positives = 118/217 (54%), Gaps = 12/217 (5%)
 Frame = -1

Query: 829 LSTLESGSEYGDQNKSHSTPSEKGSGSWMSSKMRLMKKMINSDGSNDALKPIRKRFWFQD 650
           LS      E  DQN+S ++ S K    WMSSKMRLMKKM+ S     A++ +      Q 
Sbjct: 95  LSMSSEKEERNDQNQSENSSSVK----WMSSKMRLMKKMMYSSPDAAAMQKLEDH-QKQP 149

Query: 649 PKLSLSGDNLASNNNI--IRTCSDCNTTRTPLWRSGPKGPKSLCNACGIXXXXXXXXXXX 476
           P  SL  DN  +NNN   IR C+DCNTT+TPLWRSGP+GPKSLCNACGI           
Sbjct: 150 PSSSLEPDNGNNNNNTNTIRVCADCNTTKTPLWRSGPRGPKSLCNACGIRQRKARRAMAA 209

Query: 475 XXXASTDGILTTDTPSVASKVHCSKKKSEKGHVAQHKKQCKSKTNN--------IGFEDF 320
                T   L  D  S   K   + + S        KK+CK  +N+          FED 
Sbjct: 210 AAANGTAVQLAADDTSSNKKKSKTPRPSNNNSCLPFKKRCKYNSNSPSRGKKKLCSFEDL 269

Query: 319 IVNLS--NNLVLHRVFPQDETEAAILLMALSCGLVNG 215
            +NLS  N+  L RVFPQ+E EAAILLMALS GLV+G
Sbjct: 270 TLNLSKNNSSALQRVFPQEEKEAAILLMALSYGLVHG 306


>ref|XP_006450839.1| hypothetical protein CICLE_v10008968mg [Citrus clementina]
           gi|568844082|ref|XP_006475925.1| PREDICTED: putative
           GATA transcription factor 22-like isoform X1 [Citrus
           sinensis] gi|557554065|gb|ESR64079.1| hypothetical
           protein CICLE_v10008968mg [Citrus clementina]
           gi|641861411|gb|KDO80099.1| hypothetical protein
           CISIN_1g021329mg [Citrus sinensis]
          Length = 314

 Score =  152 bits (384), Expect = 5e-34
 Identities = 102/267 (38%), Positives = 136/267 (50%), Gaps = 26/267 (9%)
 Frame = -1

Query: 940 FQDQLKEQFKQYQPSQEVVDNVSQGGSSDHQSVSPPSLSTLESGSEYGDQ------NKSH 779
           FQDQ     ++ Q   +    V   GSS+ Q  S  S+ T +  +   ++          
Sbjct: 48  FQDQRMIIMEESQQHDQKA-RVDHSGSSNLQVFSSSSIQTKKMNNITNNKLPIRKREVGE 106

Query: 778 STPSEKGS---GSWMSSKMRLMKKMINSDGSNDA-----LKPIRKRFWFQ-DPKLSLSGD 626
            T SE GS   G WMSSK+RLM KMINS  ++ A     +K  +K  + Q      ++  
Sbjct: 107 GTTSENGSSSSGKWMSSKIRLMHKMINSSSNSTATHELAVKVTQKLQYHQLHDNSEVNSF 166

Query: 625 NLASNNNIIRTCSDCNTTRTPLWRSGPKGPKSLCNACGIXXXXXXXXXXXXXXASTDGIL 446
           N +++NN +R CSDCNTT TPLWRSGP+GPKSLCNACGI                T  I 
Sbjct: 167 NSSNSNNTMRACSDCNTTTTPLWRSGPRGPKSLCNACGIRQRKARRAMQAAAAVETGTIA 226

Query: 445 TT-DTPSVASKVHCSKKKSEKGHVAQHKKQCKS----------KTNNIGFEDFIVNLSNN 299
            T  +P    K+    KK    HV+Q+KKQ ++              + F+DF + LS N
Sbjct: 227 ATGGSPFAKIKLQIKDKKPRTSHVSQNKKQYRTLDPDPTHQYQSQRKLCFKDFAIALSKN 286

Query: 298 LVLHRVFPQDETEAAILLMALSCGLVN 218
             L +VFPQD  EAAILLM LSCG ++
Sbjct: 287 SALKQVFPQDVEEAAILLMELSCGFIH 313


>ref|XP_006600457.1| PREDICTED: GATA transcription factor 21-like isoform X2 [Glycine
           max] gi|947053264|gb|KRH02717.1| hypothetical protein
           GLYMA_17G055200 [Glycine max]
          Length = 310

 Score =  150 bits (380), Expect = 2e-33
 Identities = 105/263 (39%), Positives = 138/263 (52%), Gaps = 28/263 (10%)
 Frame = -1

Query: 922 EQFKQYQPS--QEVVDNVSQGGSSDHQ-SVSPPSLSTLESGSEYGDQNKSHSTPSEKGSG 752
           E  KQY PS  +E    +   GS DH  + S  + +T+   +E  ++N   S  +E GS 
Sbjct: 49  EPTKQYLPSHEEETEKIIPSSGSWDHSVAESEHNKATVWKKAEERNENLE-SVAAEDGSL 107

Query: 751 SWMSSKMRLMKKMINSDGSNDALKPIRKRFW-FQDPKLSLSG----DNLASNN------N 605
            WM +KMR+M+KM+ SD ++            F D K  LS     DN +SNN      N
Sbjct: 108 KWMPAKMRIMRKMLVSDQTDTYTNSDNNTTHKFDDQKQQLSSPLGTDNSSSNNYSNHSNN 167

Query: 604 IIRTCSDCNTTRTPLWRSGPKGPKSLCNACGIXXXXXXXXXXXXXXASTDG--ILTTDTP 431
            +R CSDC+TT+TPLWRSGP+GPKSLCNACGI              +++    ++     
Sbjct: 168 TVRVCSDCHTTKTPLWRSGPRGPKSLCNACGIRQRKARRAMAAAAASASGNGTVIVEAKK 227

Query: 430 SVASKVHCSKKKSEKGH---VAQHKKQCK---------SKTNNIGFEDFIVNLSNNLVLH 287
           SV  +    KKK +K      AQ KK+ K            N  GFED  + L  NL +H
Sbjct: 228 SVKGRNKLQKKKEKKTRTEGAAQMKKKRKLGVGSAKASQSRNKFGFEDLTLRLRKNLAMH 287

Query: 286 RVFPQDETEAAILLMALSCGLVN 218
           +VFPQDE EAAILLMALS GLV+
Sbjct: 288 QVFPQDEKEAAILLMALSYGLVH 310


>ref|XP_003550634.1| PREDICTED: GATA transcription factor 21-like isoform X1 [Glycine
           max] gi|734365288|gb|KHN17667.1| Putative GATA
           transcription factor 22 [Glycine soja]
           gi|947053263|gb|KRH02716.1| hypothetical protein
           GLYMA_17G055200 [Glycine max]
          Length = 322

 Score =  150 bits (380), Expect = 2e-33
 Identities = 105/263 (39%), Positives = 138/263 (52%), Gaps = 28/263 (10%)
 Frame = -1

Query: 922 EQFKQYQPS--QEVVDNVSQGGSSDHQ-SVSPPSLSTLESGSEYGDQNKSHSTPSEKGSG 752
           E  KQY PS  +E    +   GS DH  + S  + +T+   +E  ++N   S  +E GS 
Sbjct: 61  EPTKQYLPSHEEETEKIIPSSGSWDHSVAESEHNKATVWKKAEERNENLE-SVAAEDGSL 119

Query: 751 SWMSSKMRLMKKMINSDGSNDALKPIRKRFW-FQDPKLSLSG----DNLASNN------N 605
            WM +KMR+M+KM+ SD ++            F D K  LS     DN +SNN      N
Sbjct: 120 KWMPAKMRIMRKMLVSDQTDTYTNSDNNTTHKFDDQKQQLSSPLGTDNSSSNNYSNHSNN 179

Query: 604 IIRTCSDCNTTRTPLWRSGPKGPKSLCNACGIXXXXXXXXXXXXXXASTDG--ILTTDTP 431
            +R CSDC+TT+TPLWRSGP+GPKSLCNACGI              +++    ++     
Sbjct: 180 TVRVCSDCHTTKTPLWRSGPRGPKSLCNACGIRQRKARRAMAAAAASASGNGTVIVEAKK 239

Query: 430 SVASKVHCSKKKSEKGH---VAQHKKQCK---------SKTNNIGFEDFIVNLSNNLVLH 287
           SV  +    KKK +K      AQ KK+ K            N  GFED  + L  NL +H
Sbjct: 240 SVKGRNKLQKKKEKKTRTEGAAQMKKKRKLGVGSAKASQSRNKFGFEDLTLRLRKNLAMH 299

Query: 286 RVFPQDETEAAILLMALSCGLVN 218
           +VFPQDE EAAILLMALS GLV+
Sbjct: 300 QVFPQDEKEAAILLMALSYGLVH 322


>emb|CDP03165.1| unnamed protein product [Coffea canephora]
          Length = 318

 Score =  150 bits (379), Expect = 2e-33
 Identities = 102/259 (39%), Positives = 130/259 (50%), Gaps = 24/259 (9%)
 Frame = -1

Query: 919 QFKQYQPSQEVVDNVSQGGSSD-HQSVSPPSLSTLESGSEYGDQNKSHSTPSEKGSGSWM 743
           Q  Q +  Q+V ++    GS D  +  +  S  +L   +  G+Q   H   +   +  W+
Sbjct: 63  QMHQQEYQQQVENHAPYTGSQDPEKKANKGSKISLWKNNTNGNQADDHEEINPVNN-KWV 121

Query: 742 SSKMRLMKKMINSDGSNDALKPIRKRFWFQDPK-----LSLSGDNLASN------NNIIR 596
           SSK++LM+KM N     +          F+D +      S   DN +SN      N  IR
Sbjct: 122 SSKVKLMQKM-NKPDLKEITSSTTTTMKFEDHQKQPTSASPEADNFSSNSSSNISNTPIR 180

Query: 595 TCSDCNTTRTPLWRSGPKGPKSLCNACGIXXXXXXXXXXXXXXASTDGILTTDTPSVASK 416
            C+DCNTT+TPLWRSGPKGPKSLCNACGI              A+      T   +   K
Sbjct: 181 VCADCNTTKTPLWRSGPKGPKSLCNACGIRQRKARRAMAAAAAAANGTSPPTYDTTAPLK 240

Query: 415 VHCSKKKSEKGHVAQHKKQCKSKTN------------NIGFEDFIVNLSNNLVLHRVFPQ 272
           V    K   K +  Q KK+CK  T+              GFEDF+ NLS NL  HRVFPQ
Sbjct: 241 VKVQNKDKLKNN-GQFKKRCKLNTSAESSQNLHAVQKKSGFEDFLFNLSKNLAFHRVFPQ 299

Query: 271 DETEAAILLMALSCGLVNG 215
           DE EAAILLMALSCGLV+G
Sbjct: 300 DEKEAAILLMALSCGLVHG 318


>ref|XP_010942001.1| PREDICTED: putative GATA transcription factor 22 [Elaeis
           guineensis]
          Length = 291

 Score =  150 bits (378), Expect = 3e-33
 Identities = 97/245 (39%), Positives = 133/245 (54%), Gaps = 12/245 (4%)
 Frame = -1

Query: 916 FKQYQPSQEVVDNVSQGGSSDHQSVSPPSLSTLESGSEYGDQ---NKSHSTPSEKGSGSW 746
           + Q Q  ++  + V   GSSD      P  +  +  ++  DQ   N  H      GS  W
Sbjct: 60  YHQQQQQEKPNEFVLIDGSSDF-----PQPTNTDDNNDKMDQYVCNGYHEDEDGHGSVKW 114

Query: 745 MSSKMRLMKKMINSDGSNDALKPIRKRFWFQDPKLSLSGDNLASNNNI----IRTCSDCN 578
           M SKMR M+KM+ S+ +  + KP R+    +D +     +   SN+N     IR CSDC+
Sbjct: 115 MPSKMRWMRKMVASEQTVRS-KPARRSM--EDLQEEKQHNQDMSNSNFPSGTIRVCSDCS 171

Query: 577 TTRTPLWRSGPKGPKSLCNACGIXXXXXXXXXXXXXXASTDGILTTDTPSVASKVHCSKK 398
           TT+TPLWRSGP+GPKSLCNACGI               S  G+  T+TP    K    +K
Sbjct: 172 TTKTPLWRSGPQGPKSLCNACGIRQRKARRAMAAAATGS--GLRATNTPRKVQK----EK 225

Query: 397 KSEKGHVAQHKKQCKSKT-----NNIGFEDFIVNLSNNLVLHRVFPQDETEAAILLMALS 233
           +  + H   +KK+CK  T       +  +D +++LSNN   HRVFPQDET+AAILLMALS
Sbjct: 226 RRGRDHTIPNKKRCKIDTTRTAQRKLEIDDIMMSLSNNSAFHRVFPQDETDAAILLMALS 285

Query: 232 CGLVN 218
           CGL++
Sbjct: 286 CGLIH 290


>ref|XP_002514107.1| hypothetical protein RCOM_1046780 [Ricinus communis]
           gi|223546563|gb|EEF48061.1| hypothetical protein
           RCOM_1046780 [Ricinus communis]
          Length = 312

 Score =  149 bits (377), Expect = 3e-33
 Identities = 104/269 (38%), Positives = 142/269 (52%), Gaps = 25/269 (9%)
 Frame = -1

Query: 946 NAFQDQLKEQFKQYQP-SQEVVDNV--SQGGSSDHQSVSPPSLSTLESGSEYG-----DQ 791
           N  Q+++    K+ QP   + VDN+  S G S DH+ +   +    E+G E       D+
Sbjct: 49  NPPQEEVGYYHKELQPLHHQEVDNIYASHGRSWDHRIIKNEN----ENGQELSVCKKEDK 104

Query: 790 NKSHSTPSEKGSGSWMSSKMRLMKKMINSDGSNDALKPIRKRFWFQDPKLS--------L 635
           + S     +  S  WMSSKMRLM+KM+ +D + +  +        +D + S         
Sbjct: 105 STSIEDQRDNSSVKWMSSKMRLMRKMMTTDQTVNTTQHTSSMHKLEDKEKSRSLPLQDDY 164

Query: 634 SGDNLASN-NNIIRTCSDCNTTRTPLWRSGPKGPKSLCNACGIXXXXXXXXXXXXXXAST 458
           S  NL+ N NN IR CSDCNTT+TPLWRSGP+GPKSLCNACGI              ++ 
Sbjct: 165 SSKNLSDNSNNTIRVCSDCNTTKTPLWRSGPRGPKSLCNACGIRQRKARRALAAAQASAN 224

Query: 457 DGILTTDTPSV-ASKVHCSKKKSEKGHVAQHKKQCKSKTNNIG------FEDFIVN-LSN 302
             I   DT ++  +KV   +K++   H+   KK+CK    + G      FED     LS 
Sbjct: 225 GTIFAPDTAAMKTNKVQNKEKRTNNSHL-PFKKRCKFTAQSRGSRKKLCFEDLSSTILSK 283

Query: 301 NLVLHRVFPQDETEAAILLMALSCGLVNG 215
           N    ++FPQDE EAAILLMALS GLV+G
Sbjct: 284 NSAFQQLFPQDEKEAAILLMALSYGLVHG 312


>gb|KDO57760.1| hypothetical protein CISIN_1g021859mg [Citrus sinensis]
          Length = 306

 Score =  149 bits (376), Expect = 4e-33
 Identities = 96/217 (44%), Positives = 118/217 (54%), Gaps = 12/217 (5%)
 Frame = -1

Query: 829 LSTLESGSEYGDQNKSHSTPSEKGSGSWMSSKMRLMKKMINSDGSNDALKPIRKRFWFQD 650
           LS      E  DQN+S ++ S K    WMSSKMRLMKKM+ S     A++ +      Q 
Sbjct: 95  LSMSSEKEERNDQNQSENSSSVK----WMSSKMRLMKKMMYSSPDAAAMQKLEDH-QKQP 149

Query: 649 PKLSLSGDNLASNNNI--IRTCSDCNTTRTPLWRSGPKGPKSLCNACGIXXXXXXXXXXX 476
           P  SL  DN  +NNN   IR C+DCNTT+TPLWRSGP+GPKSLCNACGI           
Sbjct: 150 PSSSLEPDNGNNNNNTNTIRVCADCNTTKTPLWRSGPRGPKSLCNACGIRQRKARRAMAA 209

Query: 475 XXXASTDGILTTDTPSVASKVHCSKKKSEKGHVAQHKKQCKSKTNN--------IGFED- 323
                T   L  D  S   K   + + S        KK+CK  +N+          FED 
Sbjct: 210 AAANGTAVQLAADDTSSNKKKSKTPRPSNNNSCLPFKKRCKYNSNSPSRGKKKLCSFEDL 269

Query: 322 -FIVNLSNNLVLHRVFPQDETEAAILLMALSCGLVNG 215
             I++ +N+  L RVFPQ+E EAAILLMALS GLV+G
Sbjct: 270 TLILSKNNSSALQRVFPQEEKEAAILLMALSYGLVHG 306


>ref|NP_001280882.1| transcription factor GATA-5 [Malus domestica]
           gi|302398801|gb|ADL36695.1| GATA domain class
           transcription factor [Malus domestica]
          Length = 359

 Score =  149 bits (376), Expect = 4e-33
 Identities = 107/286 (37%), Positives = 139/286 (48%), Gaps = 54/286 (18%)
 Frame = -1

Query: 910 QYQPSQEVVDNVSQGGSSDHQ---------SVSPPSLSTLESGSEYGDQNKSHSTPSEKG 758
           Q+Q  +   + V  GGS DH          S +   LS  ++G+  G+ N      +   
Sbjct: 75  QFQLLEADHNIVPHGGSHDHDHQAIENEGGSGTVLKLSISKNGA-VGNGNPGTDHETSTS 133

Query: 757 SGSWMSSKMRLMKKMINSDGSNDAL-----KPIRKRFW--------FQDPKLSLSGDNLA 617
           S  WMSSKMR+M+KM N D ++ +      KPI  +           Q P   L  D ++
Sbjct: 134 SVKWMSSKMRMMRKMSNPDQTSSSSTSSDDKPISMKLSSHKFEEQKLQHPSSQLGADMIS 193

Query: 616 SNNN---------IIRTCSDCNTTRTPLWRSGPKGPKSLCNACGIXXXXXXXXXXXXXXA 464
            +NN         IIR CSDCNTT+TPLWRSGP+GPKSLCNACGI              A
Sbjct: 194 CSNNSSNNMNNVPIIRVCSDCNTTKTPLWRSGPRGPKSLCNACGIRQRKARRAMAAAAAA 253

Query: 463 STDGILTTDTPSV-ASKVHCSKKKSEKGHVAQHKKQ----------CKSKTNNIGFEDFI 317
           ++   LT   PS+ +SKV     KS        KK+           + K+  + FEDF 
Sbjct: 254 ASGTTLTVAAPSMKSSKVQPKANKSRVSSTVPFKKRPYNKLSSSPSSRGKSKKLCFEDFT 313

Query: 316 VNLSNN------------LVLHRVFPQDETEAAILLMALSCGLVNG 215
           +++ NN              L RVFPQDE EAAILLMALSCGLV+G
Sbjct: 314 ISMKNNSSSGNPTAATTTTALQRVFPQDEKEAAILLMALSCGLVHG 359


>ref|XP_013458498.1| GATA type zinc finger transcription factor family protein [Medicago
           truncatula] gi|657391198|gb|KEH32529.1| GATA type zinc
           finger transcription factor family protein [Medicago
           truncatula]
          Length = 327

 Score =  148 bits (374), Expect = 8e-33
 Identities = 95/214 (44%), Positives = 120/214 (56%), Gaps = 20/214 (9%)
 Frame = -1

Query: 796 DQNKSHSTPSEKGSGSWMSSKMRLMKKMINSDGSNDA-LKPIRKRFWFQDPKLSLSG--- 629
           + N +     +  S  WMSSKMR+MKKM+ SD +  + L    K+  F+D K  LS    
Sbjct: 115 EMNNNQEADQDGTSVKWMSSKMRIMKKMMVSDQTGSSNLTSNSKQIKFEDQKQPLSPQGT 174

Query: 628 DNLASNN-NIIRTCSDCNTTRTPLWRSGPKGPKSLCNACGIXXXXXXXXXXXXXXASTDG 452
           DN +SNN + IR CSDCNTT+TPLWRSGP+GPKSLCNACGI              ++   
Sbjct: 175 DNSSSNNYSTIRVCSDCNTTKTPLWRSGPRGPKSLCNACGIRQRKARRALAAAAASANGT 234

Query: 451 ILTTDTPSVASKVHCSKKKSEKGHV----------AQHKKQCK-----SKTNNIGFEDFI 317
            +   T SV  K    KKK  K  +           +HK + K     S+   I FED  
Sbjct: 235 TIADQTASVKRK-KLQKKKENKSKIEFDCSTVHMKKKHKLEAKPPSHQSRKEFITFEDLK 293

Query: 316 VNLSNNLVLHRVFPQDETEAAILLMALSCGLVNG 215
           ++LS NL + +VFPQDE EAAILLMALS GLV+G
Sbjct: 294 LSLSENLGVQQVFPQDEREAAILLMALSYGLVHG 327


>gb|KOM29610.1| hypothetical protein LR48_Vigan728s003300 [Vigna angularis]
          Length = 306

 Score =  148 bits (373), Expect = 1e-32
 Identities = 96/236 (40%), Positives = 129/236 (54%), Gaps = 19/236 (8%)
 Frame = -1

Query: 865 GSSDHQSVSPPSLSTLESGSEYGDQNKSHSTPSEKGSGSWMSSKMRLMKKMINSDGSNDA 686
           GS DH SV+   L       +  ++++ H   +E GS   MSSKMR+M+KM+ SD +   
Sbjct: 77  GSWDH-SVAQSELKVTVCKQK--ERSEDHEAAAEDGSVKLMSSKMRMMQKMMGSDQTGAY 133

Query: 685 LKP--IRKRFWFQDPKLSLSG---DNLASNN------NIIRTCSDCNTTRTPLWRSGPKG 539
           ++   + K   F+D K  LS    DN +SNN      N +R C+DC+TT+TPLWRSGP+G
Sbjct: 134 IEDSTVNK---FEDEKQPLSPLGTDNSSSNNCSNHSNNTVRVCADCHTTKTPLWRSGPRG 190

Query: 538 PKSLCNACGIXXXXXXXXXXXXXXASTDGILTTDTPSVASKVHCSKKKSEKGHVAQHKKQ 359
           PKSLCNACGI               +   I  T+     +K+   +KK+      Q KK+
Sbjct: 191 PKSLCNACGIRQRKARRAMAAAASGNGTVIFETEKSVKGNKLQKKEKKARTQGAPQMKKK 250

Query: 358 CK--------SKTNNIGFEDFIVNLSNNLVLHRVFPQDETEAAILLMALSCGLVNG 215
            K           N  GFED  + L  +L +H+VFPQDE EAAILLMALS GLV+G
Sbjct: 251 RKHGVGAKPSQSRNKFGFEDLTLRLRKSLAMHQVFPQDEKEAAILLMALSYGLVHG 306


Top