BLASTX nr result

ID: Rehmannia28_contig00009198 seq

BLASTX 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Rehmannia28_contig00009198
         (1367 letters)

Database: ./nr 
           84,704,028 sequences; 31,038,470,784 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_011093726.1| PREDICTED: putative GATA transcription facto...   231   3e-69
gb|EYU27295.1| hypothetical protein MIMGU_mgv1a020800mg [Erythra...   212   7e-62
ref|XP_002282173.1| PREDICTED: GATA transcription factor 21 isof...   206   9e-60
ref|XP_010656197.1| PREDICTED: GATA transcription factor 21 isof...   204   7e-59
ref|XP_011078200.1| PREDICTED: putative GATA transcription facto...   198   1e-57
emb|CDP03165.1| unnamed protein product [Coffea canephora]            197   3e-56
ref|XP_006346565.2| PREDICTED: GATA transcription factor 21 [Sol...   194   3e-55
ref|XP_009768461.1| PREDICTED: putative GATA transcription facto...   192   5e-54
ref|XP_007012845.1| GATA type zinc finger transcription factor f...   190   2e-53
ref|XP_009604300.1| PREDICTED: putative GATA transcription facto...   188   1e-52
ref|XP_004243958.1| PREDICTED: GATA transcription factor 21 [Sol...   184   1e-51
ref|XP_010262144.1| PREDICTED: putative GATA transcription facto...   184   3e-51
ref|XP_012848547.1| PREDICTED: putative GATA transcription facto...   180   7e-51
ref|XP_015080599.1| PREDICTED: putative GATA transcription facto...   180   4e-50
ref|XP_010242203.1| PREDICTED: GATA transcription factor 21-like...   178   7e-49
ref|XP_012452366.1| PREDICTED: GATA transcription factor 21 [Gos...   177   1e-48
ref|XP_010090039.1| Putative GATA transcription factor 22 [Morus...   176   1e-47
gb|KHG09089.1| Putative GATA transcription factor 22 -like prote...   172   1e-46
gb|EEF48061.1| hypothetical protein RCOM_1046780 [Ricinus communis]   171   5e-46
ref|XP_010049743.1| PREDICTED: GATA transcription factor 21 [Euc...   171   7e-46

>ref|XP_011093726.1| PREDICTED: putative GATA transcription factor 22 [Sesamum indicum]
          Length = 308

 Score =  231 bits (589), Expect = 3e-69
 Identities = 146/307 (47%), Positives = 174/307 (56%), Gaps = 20/307 (6%)
 Frame = +2

Query: 227  DNNDQHHQPFGPCH------------------IFFNSTQDHMMESYNYDHHPHLYQPRQI 352
            D++ Q  QPF P                    IFF+S QDH    ++  +HPH   P++ 
Sbjct: 20   DHDHQFLQPFAPNQYHNQVVSSSPSSSASSSPIFFSSAQDHTGFYHHQLYHPH---PQED 76

Query: 353  SDDNLGYHGGSTY-EIKNKVDQSGLKLTLWKKEDHMGHDDEHIPQKNNPVKWMSSKMRLM 529
                 GY GGS+  E+KNKV+ SGLKLTLWK +D      E + +K+ PVKWMS+KMR+M
Sbjct: 77   DSTTYGYRGGSSCDEMKNKVN-SGLKLTLWKNKD------EGLAKKDIPVKWMSTKMRVM 129

Query: 530  QKMKTPDRVALKITSTATTKLEQPXXXXXXXXXXXXXXXXXXXPIRVCSDCNTTKTPLWR 709
            QKMK  DR+++    ++  +L QP                   PIRVC+DCNTTKTPLWR
Sbjct: 130  QKMKNTDRISV----SSFKQLLQPSSSMETDLSSNSSSYNSNSPIRVCADCNTTKTPLWR 185

Query: 710  SGPKGPKSLCNACGIRQRKXXXXXXXXXXXXNGTADQPPA-MKIKVQHKLEKTGKNGHAS 886
            SGPKGPKSLCNACGIRQRK               A+  PA +KIKVQHK  KT     + 
Sbjct: 186  SGPKGPKSLCNACGIRQRKARRAMAAAAANGMVVANARPATLKIKVQHKEMKTTGKNVSG 245

Query: 887  HFKKRCKXXXXXXXXXXGSPSDGPKKLGFEDFLINLSKNLAFGRVFPEDEKDAAILLMAL 1066
              KK CK           S S   K    E+FLINLS NLA  RVFPEDEKDAAILLMAL
Sbjct: 246  GLKKGCK----IASAGSSSSSSSSKASRLEEFLINLSNNLAVHRVFPEDEKDAAILLMAL 301

Query: 1067 SSGLVHG 1087
            SSGLVHG
Sbjct: 302  SSGLVHG 308


>gb|EYU27295.1| hypothetical protein MIMGU_mgv1a020800mg [Erythranthe guttata]
          Length = 315

 Score =  212 bits (540), Expect = 7e-62
 Identities = 135/286 (47%), Positives = 159/286 (55%), Gaps = 13/286 (4%)
 Frame = +2

Query: 269  IFFNSTQDHMMESYNYDHHPHLYQPRQISDDNLGYHGGSTYEIKNKVDQSGLKLTLWKKE 448
            +FF +   H +  YN  H    +Q   I + N   +        N  + +GLK+TLWKKE
Sbjct: 50   LFFTTPPHHQL--YNQPH----FQDHMIKNSNSNNN--------NNNNNNGLKITLWKKE 95

Query: 449  DHMGHDDEHIPQKNNPVKWMSSKMRLMQKMKTPDRVALKITS-----TATTKLEQPXXXX 613
                  DE      NPVKWMSSK+RLM++M        KI S     + ++ LE      
Sbjct: 96   P-----DEGAAADINPVKWMSSKIRLMKRMNKNIPAKSKIDSDQNPSSNSSLLESSDHLS 150

Query: 614  XXXXXXXXXXXXXXXPIRVCSDCNTTKTPLWRSGPKGPKSLCNACGIRQRKXXXXXXXXX 793
                           PIRVC+DCNTTKTPLWRSGPKGPKSLCNACGIRQRK         
Sbjct: 151  SGNSSSYNNNNNSNYPIRVCADCNTTKTPLWRSGPKGPKSLCNACGIRQRKARRAMAAAA 210

Query: 794  XXXNGTA----DQPPAMKIKVQHKLEKTGK-NGHASHFKKRCKXXXXXXXXXXG---SPS 949
               +G        PP +KIKVQHK EK GK NGH+S  KKR K              S +
Sbjct: 211  AAASGAVVAANQPPPVLKIKVQHK-EKMGKNNGHSSLLKKRFKTADNNTNAAGSSADSTN 269

Query: 950  DGPKKLGFEDFLINLSKNLAFGRVFPEDEKDAAILLMALSSGLVHG 1087
            +G KKLGFE+FLINLS NL+  RVFP+DEKDAAILLMALSSGLVHG
Sbjct: 270  NGKKKLGFEEFLINLSNNLSIHRVFPDDEKDAAILLMALSSGLVHG 315


>ref|XP_002282173.1| PREDICTED: GATA transcription factor 21 isoform X2 [Vitis vinifera]
            gi|297738668|emb|CBI27913.3| unnamed protein product
            [Vitis vinifera]
          Length = 309

 Score =  206 bits (525), Expect = 9e-60
 Identities = 139/308 (45%), Positives = 163/308 (52%), Gaps = 22/308 (7%)
 Frame = +2

Query: 230  NNDQHHQP-FGP-------------CHIFFNSTQDHMMESYNYDHHPHLYQPRQISDDNL 367
            N DQHHQ  F P             C IFF+ T++     Y   H     QP+Q + D  
Sbjct: 19   NEDQHHQLLFSPKPQPSSSSSSSLTCPIFFSPTKEQGGCHYRDLHQA---QPQQEAHDKF 75

Query: 368  GYHGGS-TYEIKNKVDQSGLKLTLWKKEDHMGHDDEHIPQKNNPVKWMSSKMRLMQKMKT 544
             + GGS  +        +GLKLT+WK ED   +  E     N  VKWMSSKMR+MQKM  
Sbjct: 76   VFRGGSYDHPTLESESDNGLKLTIWKTEDRNENHSE-----NGSVKWMSSKMRVMQKMMI 130

Query: 545  PDRV-ALKITSTATT---KLEQPXXXXXXXXXXXXXXXXXXXPIRVCSDCNTTKTPLWRS 712
             D+  A K ++TA       +Q                     IRVC+DCNTTKTPLWRS
Sbjct: 131  SDQTGAQKPSNTALNFGDHKQQSLPSETDYNSINSSNINSNNTIRVCADCNTTKTPLWRS 190

Query: 713  GPKGPKSLCNACGIRQRKXXXXXXXXXXXXNGT--ADQPPAMKIKVQHKLEKTGKNGHAS 886
            GP+GPKSLCNACGIRQRK            NGT         K K +HK +K   NGH S
Sbjct: 191  GPRGPKSLCNACGIRQRKARRAMAAAAATANGTILPTNTAPTKTKAKHK-DKKSSNGHVS 249

Query: 887  HFKKRCKXXXXXXXXXXGSPSDGPKKLGFEDFLINLSKNLAFGRVFPEDE-KDAAILLMA 1063
            H+KKRCK           +PS   KKL FEDF I+LSKN AF RVF +DE K+AAILLMA
Sbjct: 250  HYKKRCK--------LAAAPSCETKKLCFEDFTISLSKNSAFHRVFLQDEIKEAAILLMA 301

Query: 1064 LSSGLVHG 1087
            LS GLVHG
Sbjct: 302  LSCGLVHG 309


>ref|XP_010656197.1| PREDICTED: GATA transcription factor 21 isoform X1 [Vitis vinifera]
          Length = 310

 Score =  204 bits (519), Expect = 7e-59
 Identities = 139/309 (44%), Positives = 164/309 (53%), Gaps = 23/309 (7%)
 Frame = +2

Query: 230  NNDQHHQP-FGP-------------CHIFFNSTQDHMMESYNYDHHPHLYQPRQ-ISDDN 364
            N DQHHQ  F P             C IFF+ T++     Y   H     QP+Q ++ D 
Sbjct: 19   NEDQHHQLLFSPKPQPSSSSSSSLTCPIFFSPTKEQGGCHYRDLHQA---QPQQEVAHDK 75

Query: 365  LGYHGGS-TYEIKNKVDQSGLKLTLWKKEDHMGHDDEHIPQKNNPVKWMSSKMRLMQKMK 541
              + GGS  +        +GLKLT+WK ED   +  E     N  VKWMSSKMR+MQKM 
Sbjct: 76   FVFRGGSYDHPTLESESDNGLKLTIWKTEDRNENHSE-----NGSVKWMSSKMRVMQKMM 130

Query: 542  TPDRV-ALKITSTATT---KLEQPXXXXXXXXXXXXXXXXXXXPIRVCSDCNTTKTPLWR 709
              D+  A K ++TA       +Q                     IRVC+DCNTTKTPLWR
Sbjct: 131  ISDQTGAQKPSNTALNFGDHKQQSLPSETDYNSINSSNINSNNTIRVCADCNTTKTPLWR 190

Query: 710  SGPKGPKSLCNACGIRQRKXXXXXXXXXXXXNGT--ADQPPAMKIKVQHKLEKTGKNGHA 883
            SGP+GPKSLCNACGIRQRK            NGT         K K +HK +K   NGH 
Sbjct: 191  SGPRGPKSLCNACGIRQRKARRAMAAAAATANGTILPTNTAPTKTKAKHK-DKKSSNGHV 249

Query: 884  SHFKKRCKXXXXXXXXXXGSPSDGPKKLGFEDFLINLSKNLAFGRVFPEDE-KDAAILLM 1060
            SH+KKRCK           +PS   KKL FEDF I+LSKN AF RVF +DE K+AAILLM
Sbjct: 250  SHYKKRCK--------LAAAPSCETKKLCFEDFTISLSKNSAFHRVFLQDEIKEAAILLM 301

Query: 1061 ALSSGLVHG 1087
            ALS GLVHG
Sbjct: 302  ALSCGLVHG 310


>ref|XP_011078200.1| PREDICTED: putative GATA transcription factor 22 [Sesamum indicum]
          Length = 231

 Score =  198 bits (504), Expect = 1e-57
 Identities = 122/221 (55%), Positives = 135/221 (61%), Gaps = 24/221 (10%)
 Frame = +2

Query: 176 MNLNXXXXXXXXXXXXNDNNDQHHQPFGP-------CHIFFNS-TQDHMMESYNYDHHPH 331
           MNLN            + +  Q HQPF P       CHIFFNS TQDH    Y+YD    
Sbjct: 1   MNLNSSPIEQISDDHHHHHQQQPHQPFAPNPSPSVSCHIFFNSATQDHT--GYSYDRR-Q 57

Query: 332 LYQPRQISDDNLG--YHGGSTYEIKNKVDQSGLKLTLWKKEDHMGHDDEHIPQKN--NPV 499
            Y P+   DDN G  YHGGST EIKNKV+  GLKLTLWKKED   +++  +  K+  NPV
Sbjct: 58  FYHPQHHLDDNYGANYHGGSTCEIKNKVE-GGLKLTLWKKEDEE-YNESIVADKSTTNPV 115

Query: 500 KWMSSKMRLMQKMKTP------DRVALKITSTAT--TKLE----QPXXXXXXXXXXXXXX 643
           KWMSSKMRLM KMK P      DRVALKI+S+AT  TKLE    QP              
Sbjct: 116 KWMSSKMRLMHKMKNPTTTTTTDRVALKISSSATAPTKLEDQKLQPSSSLETDLSSNSFP 175

Query: 644 XXXXXPIRVCSDCNTTKTPLWRSGPKGPKSLCNACGIRQRK 766
                PIRVC+DCNTTKTPLWRSGPKGPKSLCNACGIRQRK
Sbjct: 176 SNNNSPIRVCADCNTTKTPLWRSGPKGPKSLCNACGIRQRK 216


>emb|CDP03165.1| unnamed protein product [Coffea canephora]
          Length = 318

 Score =  197 bits (502), Expect = 3e-56
 Identities = 130/283 (45%), Positives = 155/283 (54%), Gaps = 12/283 (4%)
 Frame = +2

Query: 275  FNSTQDHMMESYNYDHHPHLYQPRQISDDNLGYHGGSTYEIKNKVDQSGLKLTLWKKEDH 454
            F S      E Y+   H   YQ  Q  +++  Y G    E   K    G K++LWK   +
Sbjct: 49   FRSIALDQTEDYHAQMHQQEYQ--QQVENHAPYTGSQDPE---KKANKGSKISLWKNNTN 103

Query: 455  MGHDDEHIPQKNNPV--KWMSSKMRLMQKMKTPDRVALKITSTATTKLE----QPXXXXX 616
                D+H  ++ NPV  KW+SSK++LMQKM  PD   +  ++T T K E    QP     
Sbjct: 104  GNQADDH--EEINPVNNKWVSSKVKLMQKMNKPDLKEITSSTTTTMKFEDHQKQPTSASP 161

Query: 617  XXXXXXXXXXXXXX--PIRVCSDCNTTKTPLWRSGPKGPKSLCNACGIRQRKXXXXXXXX 790
                            PIRVC+DCNTTKTPLWRSGPKGPKSLCNACGIRQRK        
Sbjct: 162  EADNFSSNSSSNISNTPIRVCADCNTTKTPLWRSGPKGPKSLCNACGIRQRKARRAMAAA 221

Query: 791  XXXXNGTA----DQPPAMKIKVQHKLEKTGKNGHASHFKKRCKXXXXXXXXXXGSPSDGP 958
                NGT+    D    +K+KVQ+K +K   NG    FKKRCK           +     
Sbjct: 222  AAAANGTSPPTYDTTAPLKVKVQNK-DKLKNNG---QFKKRCKLNTSAESSQ--NLHAVQ 275

Query: 959  KKLGFEDFLINLSKNLAFGRVFPEDEKDAAILLMALSSGLVHG 1087
            KK GFEDFL NLSKNLAF RVFP+DEK+AAILLMALS GLVHG
Sbjct: 276  KKSGFEDFLFNLSKNLAFHRVFPQDEKEAAILLMALSCGLVHG 318


>ref|XP_006346565.2| PREDICTED: GATA transcription factor 21 [Solanum tuberosum]
          Length = 286

 Score =  194 bits (493), Expect = 3e-55
 Identities = 130/304 (42%), Positives = 164/304 (53%), Gaps = 16/304 (5%)
 Frame = +2

Query: 224  NDNNDQHHQPFGPC-----------HIFFNSTQDHMMESYNYDHHPHLYQPRQISDDNLG 370
            N+NND ++    P            H FFNST +     ++     ++    Q+  DN  
Sbjct: 18   NNNNDLNNSLVTPNYQASSSHSSYNHFFFNSTTNQTASFHHQHTQYYMQNEHQLEVDN-- 75

Query: 371  YHGGSTYEI--KNKVDQSGLKLTLWKKEDHMGHDDEHIPQKNNPVKWMSSKMRLMQKMKT 544
              GGS+Y++  KNK   SGLKL+LWK+ED +    E        +K +  + +  + +  
Sbjct: 76   -DGGSSYDLGKKNK-GGSGLKLSLWKREDKLVMSSE--------IKDLDQERK--KNITN 123

Query: 545  PDRVALKITSTATTKLEQPXXXXXXXXXXXXXXXXXXXPIRVCSDCNTTKTPLWRSGPKG 724
             D + LK+      + +QP                   PIRVC+DCNTTKTPLWRSGPKG
Sbjct: 124  NDCIKLKLGD----QKQQPIQTDYSSNNI---------PIRVCTDCNTTKTPLWRSGPKG 170

Query: 725  PKSLCNACGIRQRKXXXXXXXXXXXXNGTADQPPAMKIKV-QHK--LEKTGKNGHASHFK 895
            PKSLCNACGIRQRK            NG  D   AMKIKV QHK  + K   N H + FK
Sbjct: 171  PKSLCNACGIRQRK---ARRAMAAAANGKTDHQTAMKIKVQQHKPNITKVRTNNHVTPFK 227

Query: 896  KRCKXXXXXXXXXXGSPSDGPKKLGFEDFLINLSKNLAFGRVFPEDEKDAAILLMALSSG 1075
            KRCK             ++ PKKLGFED LINLS  LAF ++FP+DEK+AAILLMALSSG
Sbjct: 228  KRCK-----LGPSSSGTNNAPKKLGFEDLLINLSNQLAFQQIFPQDEKEAAILLMALSSG 282

Query: 1076 LVHG 1087
            LVHG
Sbjct: 283  LVHG 286


>ref|XP_009768461.1| PREDICTED: putative GATA transcription factor 22 [Nicotiana
            sylvestris]
          Length = 317

 Score =  192 bits (487), Expect = 5e-54
 Identities = 129/285 (45%), Positives = 154/285 (54%), Gaps = 9/285 (3%)
 Frame = +2

Query: 260  PCHIFFNSTQDHMMESYNYDHHPHLYQPRQISDDNLGYHGGSTYEIKNKVDQSGLKLTLW 439
            P   FFNST +     Y Y H  +   P Q   DN G  G    E KNKV  SGLKLTLW
Sbjct: 49   PYQFFFNSTTNQNQTGY-YHHDEYYTPPHQPEGDNEG--GSYDVEKKNKVG-SGLKLTLW 104

Query: 440  KKEDHMGHDDEHIPQKNNPVKWMSSKMRLM---QKMKTPDRVALKITSTATTKLEQPXXX 610
            K+ED   H++++   + +PVKWMSSK+      +  KT +   +K+      +       
Sbjct: 105  KREDKY-HENQN---EKDPVKWMSSKINTSDGARMEKTTNNTYIKLKLEDQKQQPSYPLE 160

Query: 611  XXXXXXXXXXXXXXXXPIRVCSDCNTTKTPLWRSGPKGPKSLCNACGIRQRKXXXXXXXX 790
                            PIRVC+DCNTTKTPLWRSGPKGPK+LCNACGIRQRK        
Sbjct: 161  STDYSSNSSNNSNNNIPIRVCADCNTTKTPLWRSGPKGPKTLCNACGIRQRK--ARRAMA 218

Query: 791  XXXXNG----TADQPPAMKIKVQH-KLEKTGKNGHASHFKKRCKXXXXXXXXXXGSPSDG 955
                NG    T     +MK KV+H   EK         +KKRCK           S +  
Sbjct: 219  AAAANGETLTTETSSSSMKKKVKHLHKEKITNVNVTLPYKKRCK------FGPSSSSNTA 272

Query: 956  PKKLG-FEDFLINLSKNLAFGRVFPEDEKDAAILLMALSSGLVHG 1087
            PKKLG FEDFLI LS NLA  ++FP+DEK+AAILLMALSSGLVHG
Sbjct: 273  PKKLGNFEDFLIKLSNNLALHQIFPQDEKEAAILLMALSSGLVHG 317


>ref|XP_007012845.1| GATA type zinc finger transcription factor family protein, putative
            [Theobroma cacao] gi|508783208|gb|EOY30464.1| GATA type
            zinc finger transcription factor family protein, putative
            [Theobroma cacao]
          Length = 302

 Score =  190 bits (482), Expect = 2e-53
 Identities = 127/306 (41%), Positives = 159/306 (51%), Gaps = 19/306 (6%)
 Frame = +2

Query: 227  DNNDQHHQPFG-------------PCHIFFNSTQDHMMESYNYDHHPHLYQPRQISDDNL 367
            D+  Q HQ F               C I FN         +  + H H     Q  +D  
Sbjct: 20   DDQHQQHQLFSLKPQPPSLSSSSLTCPILFNPVVQEQAGGHQREPHQHF----QYQEDQA 75

Query: 368  GYHGGSTYEIKNKVDQSGLKLTLWKKEDHMGHDDEHIPQKNNPVKWMSSKMRLMQKMKTP 547
              +      +++    SGL L+L KKE+     +EH   +++  KWMSSKMR+M+KM + 
Sbjct: 76   KIYVPQDEPLES---DSGLNLSLRKKEE----GNEHHQIEDSSAKWMSSKMRMMRKMMSS 128

Query: 548  DRVALKITSTATTKLEQPXXXXXXXXXXXXXXXXXXXP---IRVCSDCNTTKTPLWRSGP 718
            DR  L  ++++T KLE+P                       IRVC+DCNTTKTPLWRSGP
Sbjct: 129  DRADL--SNSSTPKLEEPKQQPSSSPDNSSNSSYNNNDNITIRVCADCNTTKTPLWRSGP 186

Query: 719  KGPKSLCNACGIRQRKXXXXXXXXXXXXNG---TADQPPAMKIKVQHKLEKTGKNGHASH 889
            +GPKSLCNACGIRQRK            NG    A   P MK KVQ K +++  +G  + 
Sbjct: 187  RGPKSLCNACGIRQRK-ARRAMAAAAAANGAIVAAQTTPTMKSKVQDKSKRSSNSGCVAQ 245

Query: 890  FKKRCKXXXXXXXXXXGSPSDGPKKLGFEDFLINLSKNLAFGRVFPEDEKDAAILLMALS 1069
             KK+CK           S S G KKL FED  I LSKN AF RVFP+DEK+AAILLMALS
Sbjct: 246  LKKKCK---------HSSQSQGRKKLCFEDLRIILSKNSAFHRVFPQDEKEAAILLMALS 296

Query: 1070 SGLVHG 1087
             GLVHG
Sbjct: 297  YGLVHG 302


>ref|XP_009604300.1| PREDICTED: putative GATA transcription factor 22 [Nicotiana
            tomentosiformis]
          Length = 315

 Score =  188 bits (477), Expect = 1e-52
 Identities = 133/285 (46%), Positives = 155/285 (54%), Gaps = 13/285 (4%)
 Frame = +2

Query: 272  FFNSTQDHMMESYNYDHHPHLYQPRQISDDNLGYHGGSTYEIKNKVDQSGLKLTLWKKED 451
            FFNST +     Y Y    +   P Q   DN G  G    E KNKV  SGLKLTLWK+ED
Sbjct: 53   FFNSTTNQNQTGY-YHRDEYYTPPHQPEVDNEG--GLYDVEKKNKVG-SGLKLTLWKRED 108

Query: 452  HMGHDDEHIPQKNNPVKWMSSKMRLMQKMKTPDRVALKITSTATT----KLE--QPXXXX 613
               + +E      NPVKWMSSK+      K  D+  L+ T+  +T    KLE  +     
Sbjct: 109  KNENQNE-----KNPVKWMSSKI------KASDQTILEKTTNNSTYIKLKLEDQKQQPSY 157

Query: 614  XXXXXXXXXXXXXXXPIRVCSDCNTTKTPLWRSGPKGPKSLCNACGIRQRKXXXXXXXXX 793
                           PIRVC+DCNTTKTPLWRSGPKGPK+LCNACGIRQRK         
Sbjct: 158  PLETDYSSNNSNNNIPIRVCADCNTTKTPLWRSGPKGPKTLCNACGIRQRK-ARRAMAAA 216

Query: 794  XXXNG----TADQPPAMKIKVQHKL--EKTGKNGHASHFKKRCKXXXXXXXXXXGSPSDG 955
               NG    T     +MK KV   L  EK  K      +KKRCK           S +  
Sbjct: 217  AAANGETLTTETSSSSMKKKVNKHLHKEKITKVNVTVPYKKRCK------FGQSSSSNTE 270

Query: 956  PKKLG-FEDFLINLSKNLAFGRVFPEDEKDAAILLMALSSGLVHG 1087
            PKKLG FEDFLINL+ NLA  ++FP+DEK+AA+LLMALSSGLVHG
Sbjct: 271  PKKLGNFEDFLINLTNNLALHQIFPQDEKEAAVLLMALSSGLVHG 315


>ref|XP_004243958.1| PREDICTED: GATA transcription factor 21 [Solanum lycopersicum]
          Length = 266

 Score =  184 bits (466), Expect = 1e-51
 Identities = 128/293 (43%), Positives = 160/293 (54%), Gaps = 5/293 (1%)
 Frame = +2

Query: 224  NDNNDQHHQPFGPCHIFFNSTQDHMMESYNYDHHPHLYQPRQISDDNLGYHGGSTYEI-- 397
            N NN+    P    H FFNST +    S+++ H  +  Q  Q+  DN    GGS+Y++  
Sbjct: 17   NSNNNSLVTP--NYHFFFNSTTNQTA-SFHHQHTQYYMQHEQLEVDN---DGGSSYDLGK 70

Query: 398  KNKVDQSGLKLTLWKKEDHMGHDDEHIPQKNNPVKWMSSKMRLMQKMKTPDRVALKITST 577
            KN+V  SGLKL+LWK+ED                K +SS+++ + + K  +      T++
Sbjct: 71   KNEVG-SGLKLSLWKRED----------------KLLSSEIKKLDQEKKKNS-----TNS 108

Query: 578  ATTKLEQPXXXXXXXXXXXXXXXXXXXPIRVCSDCNTTKTPLWRSGPKGPKSLCNACGIR 757
            A  KL+                     PIRVC+DCNTTKTPLWRSGPKGPKSLCNACGIR
Sbjct: 109  ACIKLK---LGDQKQKPIQTDYCSNNIPIRVCTDCNTTKTPLWRSGPKGPKSLCNACGIR 165

Query: 758  QRKXXXXXXXXXXXXNGTADQPPAMKIKVQHKLEKTGK---NGHASHFKKRCKXXXXXXX 928
            QRK             G  DQ    K++ QHK   T K   N      KKRCK       
Sbjct: 166  QRK--ARRAMAAAAAEGKTDQ----KVQ-QHKQNITTKVTSNNDVKPLKKRCK-----FG 213

Query: 929  XXXGSPSDGPKKLGFEDFLINLSKNLAFGRVFPEDEKDAAILLMALSSGLVHG 1087
                S ++ PKKLGFEDFLINLS  LAF ++FP+DE +AAILLMALSSGLVHG
Sbjct: 214  PSSSSTNNAPKKLGFEDFLINLSNKLAFQQIFPQDEMEAAILLMALSSGLVHG 266


>ref|XP_010262144.1| PREDICTED: putative GATA transcription factor 22 [Nelumbo nucifera]
          Length = 316

 Score =  184 bits (468), Expect = 3e-51
 Identities = 130/311 (41%), Positives = 160/311 (51%), Gaps = 23/311 (7%)
 Frame = +2

Query: 224  NDNNDQHHQPFGPCHIFFNSTQDHMMESYN-------YDHHPH-----LYQPRQISDDNL 367
            ND + Q+ Q F P     NS+      S+N       +DH  H       Q RQ +D   
Sbjct: 23   NDEDQQYCQLFSPPPSQTNSSLHSTPLSFNSAREGGSHDHEAHDREQQEQQQRQEADKGS 82

Query: 368  -GYHGGSTY----EIKNKVDQSGLKLTLWKKEDHMGHDDEHIPQKNNPVKWMSSKMRLMQ 532
             GYH    Y      KN ++ S L+L++ K+E      DE         +WMSSKMRLM+
Sbjct: 83   EGYHLDFPYPPLQSSKNGINSS-LELSI-KQEIR----DESQSNSTGSARWMSSKMRLMR 136

Query: 533  KMKTPDRVALKITSTATTKL-----EQPXXXXXXXXXXXXXXXXXXXPIRVCSDCNTTKT 697
            KM   DR+     ++  T+      +QP                    +RVCSDCNTTKT
Sbjct: 137  KMMNSDRMGADKPASGNTQKFQDHHQQPSSLEMDSSSSNSSSNNSNITVRVCSDCNTTKT 196

Query: 698  PLWRSGPKGPKSLCNACGIRQRKXXXXXXXXXXXXNGTADQPPAMKIKVQHKLEKTGKNG 877
            PLWRSGP+GPKSLCNACGIRQRK                   P+++ KV HK EK  + G
Sbjct: 197  PLWRSGPRGPKSLCNACGIRQRKARRAMAAAAASGTLLPADTPSLQRKVHHK-EKRSETG 255

Query: 878  HASHFKKRCKXXXXXXXXXXGSPSD-GPKKLGFEDFLINLSKNLAFGRVFPEDEKDAAIL 1054
            +   +KKRCK           +PS    KKL FEDF INLSKN AF RVFP+DEK+AAIL
Sbjct: 256  YVPQYKKRCKL----------APSPRSRKKLCFEDFTINLSKNSAFHRVFPQDEKEAAIL 305

Query: 1055 LMALSSGLVHG 1087
            LMALS GLVHG
Sbjct: 306  LMALSCGLVHG 316


>ref|XP_012848547.1| PREDICTED: putative GATA transcription factor 22 [Erythranthe
            guttata]
          Length = 206

 Score =  180 bits (456), Expect = 7e-51
 Identities = 109/207 (52%), Positives = 124/207 (59%), Gaps = 13/207 (6%)
 Frame = +2

Query: 506  MSSKMRLMQKMKTPDRVALKITS-----TATTKLEQPXXXXXXXXXXXXXXXXXXXPIRV 670
            MSSK+RLM++M        KI S     + ++ LE                     PIRV
Sbjct: 1    MSSKIRLMKRMNKNIPAKSKIDSDQNPSSNSSLLESSDHLSSGNSSSYNNNNNSNYPIRV 60

Query: 671  CSDCNTTKTPLWRSGPKGPKSLCNACGIRQRKXXXXXXXXXXXXNGTA----DQPPAMKI 838
            C+DCNTTKTPLWRSGPKGPKSLCNACGIRQRK            +G        PP +KI
Sbjct: 61   CADCNTTKTPLWRSGPKGPKSLCNACGIRQRKARRAMAAAAAAASGAVVAANQPPPVLKI 120

Query: 839  KVQHKLEKTGK-NGHASHFKKRCKXXXXXXXXXXG---SPSDGPKKLGFEDFLINLSKNL 1006
            KVQHK EK GK NGH+S  KKR K              S ++G KKLGFE+FLINLS NL
Sbjct: 121  KVQHK-EKMGKNNGHSSLLKKRFKTADNNTNAAGSSADSTNNGKKKLGFEEFLINLSNNL 179

Query: 1007 AFGRVFPEDEKDAAILLMALSSGLVHG 1087
            +  RVFP+DEKDAAILLMALSSGLVHG
Sbjct: 180  SIHRVFPDDEKDAAILLMALSSGLVHG 206


>ref|XP_015080599.1| PREDICTED: putative GATA transcription factor 22 [Solanum pennellii]
          Length = 267

 Score =  180 bits (456), Expect = 4e-50
 Identities = 125/293 (42%), Positives = 157/293 (53%), Gaps = 5/293 (1%)
 Frame = +2

Query: 224  NDNNDQHHQPFGPCHIFFNSTQDHMMESYNYDHHPHLYQPRQISDDNLGYHGGSTYEI-- 397
            N NN+    P    H FFNST +     ++     ++    Q+  DN    GGS+Y++  
Sbjct: 17   NSNNNSLVTP--NYHFFFNSTTNQTASFHHQHTQYYMQHEHQLEVDN---DGGSSYDLGK 71

Query: 398  KNKVDQSGLKLTLWKKEDHMGHDDEHIPQKNNPVKWMSSKMRLMQKMKTPDRVALKITST 577
            KN+V  SGLKL+LWK+ED                K +SS+++ + + K  +      T++
Sbjct: 72   KNEVG-SGLKLSLWKRED----------------KLLSSEIKKLDQEKKKNS-----TNS 109

Query: 578  ATTKLEQPXXXXXXXXXXXXXXXXXXXPIRVCSDCNTTKTPLWRSGPKGPKSLCNACGIR 757
            A  KL+                     PIRVC+DCNTTKTPLWRSGPKGPKSLCNACGIR
Sbjct: 110  ACIKLK---LGDQKQKPIQTDYCSNNIPIRVCTDCNTTKTPLWRSGPKGPKSLCNACGIR 166

Query: 758  QRKXXXXXXXXXXXXNGTADQPPAMKIKVQHKLEKTGK---NGHASHFKKRCKXXXXXXX 928
            QRK             G  DQ    K++ QHK   T K   N      KKRCK       
Sbjct: 167  QRK--ARRAMAAAAAEGKTDQ----KVQ-QHKQNITTKVTSNNDVKPLKKRCK-----FG 214

Query: 929  XXXGSPSDGPKKLGFEDFLINLSKNLAFGRVFPEDEKDAAILLMALSSGLVHG 1087
                S ++ PKKLGFEDFLINLS  LAF ++FP+DE +AAILLMALSSGLVHG
Sbjct: 215  PSSSSTNNAPKKLGFEDFLINLSNKLAFQQIFPQDEMEAAILLMALSSGLVHG 267


>ref|XP_010242203.1| PREDICTED: GATA transcription factor 21-like [Nelumbo nucifera]
          Length = 305

 Score =  178 bits (451), Expect = 7e-49
 Identities = 108/232 (46%), Positives = 135/232 (58%), Gaps = 8/232 (3%)
 Frame = +2

Query: 416  SGLKLTLWKKEDHMGHDDEHIPQKNNPVKWMSSKMRLMQKMKTPDRVAL-KITSTATTKL 592
            SGL+L+  K+ ++ G    ++      V+WMSSKMRLM+KMK  DRV + K  +T   K 
Sbjct: 91   SGLELSNSKQRENRGGSQGNM----GSVRWMSSKMRLMRKMKNSDRVGMDKPVNTNMHKF 146

Query: 593  EQPXXXXXXXXXXXXXXXXXXX-----PIRVCSDCNTTKTPLWRSGPKGPKSLCNACGIR 757
            +Q                          +RVCSDCNTTKTPLWRSGP+GPKSLCNACGIR
Sbjct: 147  QQDHHHRSPSPWEMDTSSNSSSNNANNTVRVCSDCNTTKTPLWRSGPRGPKSLCNACGIR 206

Query: 758  QRKXXXXXXXXXXXXNGT--ADQPPAMKIKVQHKLEKTGKNGHASHFKKRCKXXXXXXXX 931
            QRK            NGT    +  +MK KV HK +++ + G+   +KKRCK        
Sbjct: 207  QRK----ARRAMAAANGTLLPTEASSMKNKVHHKEKRSSETGYVQQYKKRCK-------- 254

Query: 932  XXGSPSDGPKKLGFEDFLINLSKNLAFGRVFPEDEKDAAILLMALSSGLVHG 1087
               +     KK+ FEDF INLSKN +F RVFP+DEK+AAILLMALS GLVHG
Sbjct: 255  -LATSPRSMKKVCFEDFTINLSKNSSFHRVFPQDEKEAAILLMALSCGLVHG 305


>ref|XP_012452366.1| PREDICTED: GATA transcription factor 21 [Gossypium raimondii]
            gi|763798209|gb|KJB65164.1| hypothetical protein
            B456_010G082800 [Gossypium raimondii]
          Length = 298

 Score =  177 bits (449), Expect = 1e-48
 Identities = 120/299 (40%), Positives = 155/299 (51%), Gaps = 12/299 (4%)
 Frame = +2

Query: 227  DNNDQHHQPFGPCHIFFNSTQDHMMESYNYDHH---PHLYQPRQISDDNLGYHGGSTYEI 397
            D+  QHH      H+F   +Q     S +  HH       +  Q+  D    +      +
Sbjct: 20   DDQHQHH------HLFTFKSQPSSSSSSSTVHHLAAGSCQREPQVFQDQAKIYVSKDGAL 73

Query: 398  KNKVDQSGLKLTLWKKEDHMGHDDEHIPQKNNPVKWMSSKMRLMQKMKTPDRVALKITST 577
            +N     GLKL+LWKKE+ +  D  H   +++  KWM SK+R+++KM +     L  +S+
Sbjct: 74   ENS--DCGLKLSLWKKEERVESDHHH---EDSSTKWMPSKLRILRKMMSSHHTDLSKSSS 128

Query: 578  ATT---KLE-QPXXXXXXXXXXXXXXXXXXXPIRVCSDCNTTKTPLWRSGPKGPKSLCNA 745
                  KL+ QP                   PIRVC+DCNTTKTPLWRSGP+GPKSLCNA
Sbjct: 129  PKIEDQKLQNQPSPSPDNSCNSSYNNGINNSPIRVCADCNTTKTPLWRSGPRGPKSLCNA 188

Query: 746  CGIRQRK--XXXXXXXXXXXXNG---TADQPPAMKIKVQHKLEKTGKNGHASHFKKRCKX 910
            CGIRQRK              NG   TA+   +MK KVQ+K +++     A    K+CK 
Sbjct: 189  CGIRQRKARRAMAAAAAATASNGTIVTAETTTSMKNKVQNKAKRSSNGCVAKLKNKKCK- 247

Query: 911  XXXXXXXXXGSPSDGPKKLGFEDFLINLSKNLAFGRVFPEDEKDAAILLMALSSGLVHG 1087
                      S S G  KL FED  I LSK+ AF  VFP+DEK+AAILLMALS GLVHG
Sbjct: 248  --------LSSQSQGRNKLCFEDLRIILSKSSAFHGVFPQDEKEAAILLMALSYGLVHG 298


>ref|XP_010090039.1| Putative GATA transcription factor 22 [Morus notabilis]
            gi|587848577|gb|EXB38836.1| Putative GATA transcription
            factor 22 [Morus notabilis]
          Length = 335

 Score =  176 bits (445), Expect = 1e-47
 Identities = 124/279 (44%), Positives = 145/279 (51%), Gaps = 22/279 (7%)
 Frame = +2

Query: 317  DHHPHLYQPRQISDDNLGYHGGSTYEIKNKVDQSGLKLTLWK---KEDHMGHDDEHIPQK 487
            DHH  L      SD     H     E ++   Q+ LKL++WK   ++ +  HD       
Sbjct: 70   DHHHKLVSSGGSSD----IHPPRVAESESDHHQNDLKLSIWKSSTEDSNYDHDKSSHVSD 125

Query: 488  NNP---VKWMSSKMRLMQKM-KTPDRVALKITSTA--TTKLEQ-------PXXXXXXXXX 628
            NN     KWM SKMR+M+KM   PD+  +   +    T K +Q                 
Sbjct: 126  NNAGYSAKWMPSKMRMMRKMIVNPDQTNIDHHTPLNFTHKFDQVMKRKHPASPLGTDHSS 185

Query: 629  XXXXXXXXXXPIRVCSDCNTTKTPLWRSGPKGPKSLCNACGIRQRKXXXXXXXXXXXXNG 808
                       IRVC+DCNTTKTPLWRSGP+GPKSLCNACGIRQRK            NG
Sbjct: 186  TSSSNNNNNNTIRVCADCNTTKTPLWRSGPRGPKSLCNACGIRQRKARRAMAAAAAAANG 245

Query: 809  T--ADQPPAMK--IKVQHKLEKTGKNGH--ASHFKKRCKXXXXXXXXXXGSPSDGPKKLG 970
            T  A     MK   KVQ K EK  KNG+     FKKRCK           SPS G KK+ 
Sbjct: 246  TILATDATTMKSSTKVQRK-EKKPKNGNGVVPQFKKRCK--------LTASPSRGRKKIC 296

Query: 971  FEDFLINLSKNLAFGRVFPEDEKDAAILLMALSSGLVHG 1087
            FED  I++SKN AF RVFP+DEKDAAILLMALS GLVHG
Sbjct: 297  FEDLAISISKNSAFQRVFPQDEKDAAILLMALSYGLVHG 335


>gb|KHG09089.1| Putative GATA transcription factor 22 -like protein [Gossypium
            arboreum]
          Length = 305

 Score =  172 bits (436), Expect = 1e-46
 Identities = 122/320 (38%), Positives = 152/320 (47%), Gaps = 33/320 (10%)
 Frame = +2

Query: 227  DNNDQHHQPFG----------------------PCHIFFNSTQDHMMESYNYDHHPHLYQ 340
            D+  QHHQ F                        C   F+ T  H   ++        YQ
Sbjct: 20   DDQHQHHQLFNLISQPSSSSSSSSSSSSPSTSLTCPFSFSPTVQHQQAAF--------YQ 71

Query: 341  --PRQISDDNLGYHGGSTYEIKNKVDQSG-------LKLTLWKKEDHMGHDDEHIPQKNN 493
              P+Q  DD            K  V Q G       L+L++WKKE+ +    E   Q ++
Sbjct: 72   SLPQQFHDDQQDQE-------KIHVPQDGPLRSDCELRLSIWKKEERV----ETHHQSHD 120

Query: 494  PVKWMSSKMRLMQKMKTPDRVALKITSTATTKLEQPXXXXXXXXXXXXXXXXXXXPIRVC 673
              KWM SKMR+M+KM   D   L  + T  ++  Q                     IRVC
Sbjct: 121  SAKWMPSKMRMMRKMMNSDHTDLSNSPTPKSEDHQEQKQPSSSPDNNNST------IRVC 174

Query: 674  SDCNTTKTPLWRSGPKGPKSLCNACGIRQRKXXXXXXXXXXXXNG--TADQPPAMKIKVQ 847
            +DCNTTKTPLWRSGP+GPKSLCNACGIRQRK            +    A+ PP+M+ +VQ
Sbjct: 175  ADCNTTKTPLWRSGPRGPKSLCNACGIRQRKARRAAAVAAAAASSVVAAETPPSMRSEVQ 234

Query: 848  HKLEKTGKNGHASHFKKRCKXXXXXXXXXXGSPSDGPKKLGFEDFLINLSKNLAFGRVFP 1027
             K +++  NG      K+CK           S S   KKL FED  I LSKN AF  VFP
Sbjct: 235  LKAKRSSNNGVPHLKNKKCK---------HNSQSQSRKKLCFEDLRIILSKNSAFHGVFP 285

Query: 1028 EDEKDAAILLMALSSGLVHG 1087
            +DEK+AAILLMALS GLVHG
Sbjct: 286  QDEKEAAILLMALSYGLVHG 305


>gb|EEF48061.1| hypothetical protein RCOM_1046780 [Ricinus communis]
          Length = 312

 Score =  171 bits (432), Expect = 5e-46
 Identities = 131/312 (41%), Positives = 153/312 (49%), Gaps = 26/312 (8%)
 Frame = +2

Query: 230  NNDQHHQPFGPCH----------------IFFNSTQDHMMESYNYDHHPHLYQPRQISDD 361
            N DQHH     C                 IF N  Q    E   Y H        Q  D+
Sbjct: 17   NEDQHHHQLIFCSKTTTEDASSSSSISYPIFINPPQ----EEVGYYHKELQPLHHQEVDN 72

Query: 362  NLGYHGGS-TYEIKNKVDQSGLKLTLWKKEDHMGHDDEHIPQKNNPVKWMSSKMRLMQKM 538
                HG S  + I    +++G +L++ KKED     ++   + N+ VKWMSSKMRLM+KM
Sbjct: 73   IYASHGRSWDHRIIKNENENGQELSVCKKEDKSTSIEDQ--RDNSSVKWMSSKMRLMRKM 130

Query: 539  KTPDRVALKITSTATT-KLEQPXXXXXXXXXXXXXXXXXXX----PIRVCSDCNTTKTPL 703
             T D+       T++  KLE                          IRVCSDCNTTKTPL
Sbjct: 131  MTTDQTVNTTQHTSSMHKLEDKEKSRSLPLQDDYSSKNLSDNSNNTIRVCSDCNTTKTPL 190

Query: 704  WRSGPKGPKSLCNACGIRQRKXXXXXXXXXXXXNGT--ADQPPAMKI-KVQHKLEKTGKN 874
            WRSGP+GPKSLCNACGIRQRK            NGT  A    AMK  KVQ+K EK   N
Sbjct: 191  WRSGPRGPKSLCNACGIRQRKARRALAAAQASANGTIFAPDTAAMKTNKVQNK-EKRTNN 249

Query: 875  GHASHFKKRCKXXXXXXXXXXGSPSDGPKKLGFEDFLIN-LSKNLAFGRVFPEDEKDAAI 1051
             H   FKKRCK                 KKL FED     LSKN AF ++FP+DEK+AAI
Sbjct: 250  SHLP-FKKRCK--------FTAQSRGSRKKLCFEDLSSTILSKNSAFQQLFPQDEKEAAI 300

Query: 1052 LLMALSSGLVHG 1087
            LLMALS GLVHG
Sbjct: 301  LLMALSYGLVHG 312


>ref|XP_010049743.1| PREDICTED: GATA transcription factor 21 [Eucalyptus grandis]
            gi|629117828|gb|KCW82503.1| hypothetical protein
            EUGRSUZ_C03895 [Eucalyptus grandis]
          Length = 340

 Score =  171 bits (433), Expect = 7e-46
 Identities = 122/304 (40%), Positives = 149/304 (49%), Gaps = 28/304 (9%)
 Frame = +2

Query: 260  PCHIFFNST-QDHMMESYNY-DHHP--HLYQPRQISDDNLGY---HGGSTYEIKNKVDQS 418
            P  IF+N   QDH   S  Y DHHP   L  P+++    +     HGG            
Sbjct: 48   PSPIFYNLLGQDHQGGSCKYEDHHPLEKLQHPQEVDHKPVSQGESHGGH----------- 96

Query: 419  GLKLTLWKKEDHMGHDDEHIPQKNNPVKWMSSKMRLMQKM-----------KTPDRVALK 565
            GL +TLWKKE+    D+     K   VKWMSSKMR+M+KM            TP   +  
Sbjct: 97   GLNITLWKKEEGGVRDESDQVDKYYSVKWMSSKMRIMKKMMSSDHHEQSQDNTPMMTSGS 156

Query: 566  ITSTATTKL------EQPXXXXXXXXXXXXXXXXXXXPIRVCSDCNTTKTPLWRSGPKGP 727
            I++    KL        P                   PIRVCSDCNTTKTPLWRSGP+GP
Sbjct: 157  ISAGGDHKLFVEDHKRLPSSSSLETDSSSNSSSYDTNPIRVCSDCNTTKTPLWRSGPRGP 216

Query: 728  KSLCNACGIRQRKXXXXXXXXXXXXNGT--ADQPPAMKIKVQHKLE--KTGKNGHASHFK 895
            KSLCNACGIRQRK              T   D PPA  +  + KL+  K  ++  A+   
Sbjct: 217  KSLCNACGIRQRKARRAMAAAAAAAAATDSGDLPPATTLTPKSKLKQHKNKRSSRATVCG 276

Query: 896  KRCKXXXXXXXXXXGSPSDGPKKLGFEDFLINLSKNLAFGRVFPEDEKDAAILLMALSSG 1075
            +                S G +KL  E+F I LSK+LAF RVFP++EK+AAILLMALS G
Sbjct: 277  QESIKTKRKAKPHKNHASVGGRKLCLEEFAIRLSKHLAFQRVFPQEEKEAAILLMALSYG 336

Query: 1076 LVHG 1087
            LV G
Sbjct: 337  LVPG 340


Top