BLASTX nr result

ID: Cornus23_contig00018751 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Cornus23_contig00018751
         (944 letters)

Database: ./nr 
           77,306,371 sequences; 28,104,191,420 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_010656197.1| PREDICTED: GATA transcription factor 21 isof...   212   4e-52
ref|XP_002282173.1| PREDICTED: GATA transcription factor 21 isof...   212   4e-52
ref|XP_007012845.1| GATA type zinc finger transcription factor f...   185   5e-44
ref|XP_009604300.1| PREDICTED: putative GATA transcription facto...   182   4e-43
ref|XP_002514107.1| hypothetical protein RCOM_1046780 [Ricinus c...   177   8e-42
ref|XP_012452366.1| PREDICTED: GATA transcription factor 21 [Gos...   177   1e-41
ref|XP_010262144.1| PREDICTED: putative GATA transcription facto...   175   4e-41
ref|XP_010242203.1| PREDICTED: GATA transcription factor 21-like...   172   4e-40
ref|XP_009768461.1| PREDICTED: putative GATA transcription facto...   170   2e-39
ref|XP_010090039.1| Putative GATA transcription factor 22 [Morus...   169   3e-39
ref|XP_006346565.1| PREDICTED: GATA transcription factor 21-like...   169   3e-39
emb|CDP03165.1| unnamed protein product [Coffea canephora]            166   2e-38
gb|KHG09089.1| Putative GATA transcription factor 22 -like prote...   166   3e-38
ref|XP_012447374.1| PREDICTED: GATA transcription factor 21-like...   164   9e-38
ref|XP_004243958.1| PREDICTED: GATA transcription factor 21 [Sol...   164   9e-38
ref|XP_012076922.1| PREDICTED: putative GATA transcription facto...   160   2e-36
ref|XP_012076920.1| PREDICTED: putative GATA transcription facto...   160   2e-36
gb|EYU27295.1| hypothetical protein MIMGU_mgv1a020800mg [Erythra...   158   7e-36
ref|XP_006451458.1| hypothetical protein CICLE_v10009004mg [Citr...   155   4e-35
emb|CAN63090.1| hypothetical protein VITISV_032017 [Vitis vinifera]   154   7e-35

>ref|XP_010656197.1| PREDICTED: GATA transcription factor 21 isoform X1 [Vitis vinifera]
          Length = 310

 Score =  212 bits (539), Expect = 4e-52
 Identities = 130/241 (53%), Positives = 155/241 (64%), Gaps = 18/241 (7%)
 Frame = -1

Query: 881 NNFGWHGGSYD---LENESDSGLNLSLWKKEDRDENHTENNSVXXXXXXXXXXXXXXXSD 711
           + F + GGSYD   LE+ESD+GL L++WK EDR+ENH+EN SV               SD
Sbjct: 74  DKFVFRGGSYDHPTLESESDNGLKLTIWKTEDRNENHSENGSVKWMSSKMRVMQKMMISD 133

Query: 710 QLIAKK-------LEDLKQPISSPTATDDHN------NSNIPVRVCSDCNTSKTPLWRSG 570
           Q  A+K         D KQ  S P+ TD ++      NSN  +RVC+DCNT+KTPLWRSG
Sbjct: 134 QTGAQKPSNTALNFGDHKQQ-SLPSETDYNSINSSNINSNNTIRVCADCNTTKTPLWRSG 192

Query: 569 PRGPKSLCNACGIRQXXXXXXXXXXXXXXKGTILATDEASTMKTKVALKNKRSSNGHVAR 390
           PRGPKSLCNACGIRQ               GTIL T+ A T KTK   K+K+SSNGHV+ 
Sbjct: 193 PRGPKSLCNACGIRQRKARRAMAAAAATANGTILPTNTAPT-KTKAKHKDKKSSNGHVSH 251

Query: 389 HKKRSKF-AIPSDDHAREKLCFEDFFINLNTNLSFQRVFPQDE-KEAAILLMALSCGLVH 216
           +KKR K  A PS +   +KLCFEDF I+L+ N +F RVF QDE KEAAILLMALSCGLVH
Sbjct: 252 YKKRCKLAAAPSCE--TKKLCFEDFTISLSKNSAFHRVFLQDEIKEAAILLMALSCGLVH 309

Query: 215 G 213
           G
Sbjct: 310 G 310


>ref|XP_002282173.1| PREDICTED: GATA transcription factor 21 isoform X2 [Vitis vinifera]
           gi|297738668|emb|CBI27913.3| unnamed protein product
           [Vitis vinifera]
          Length = 309

 Score =  212 bits (539), Expect = 4e-52
 Identities = 130/241 (53%), Positives = 155/241 (64%), Gaps = 18/241 (7%)
 Frame = -1

Query: 881 NNFGWHGGSYD---LENESDSGLNLSLWKKEDRDENHTENNSVXXXXXXXXXXXXXXXSD 711
           + F + GGSYD   LE+ESD+GL L++WK EDR+ENH+EN SV               SD
Sbjct: 73  DKFVFRGGSYDHPTLESESDNGLKLTIWKTEDRNENHSENGSVKWMSSKMRVMQKMMISD 132

Query: 710 QLIAKK-------LEDLKQPISSPTATDDHN------NSNIPVRVCSDCNTSKTPLWRSG 570
           Q  A+K         D KQ  S P+ TD ++      NSN  +RVC+DCNT+KTPLWRSG
Sbjct: 133 QTGAQKPSNTALNFGDHKQQ-SLPSETDYNSINSSNINSNNTIRVCADCNTTKTPLWRSG 191

Query: 569 PRGPKSLCNACGIRQXXXXXXXXXXXXXXKGTILATDEASTMKTKVALKNKRSSNGHVAR 390
           PRGPKSLCNACGIRQ               GTIL T+ A T KTK   K+K+SSNGHV+ 
Sbjct: 192 PRGPKSLCNACGIRQRKARRAMAAAAATANGTILPTNTAPT-KTKAKHKDKKSSNGHVSH 250

Query: 389 HKKRSKF-AIPSDDHAREKLCFEDFFINLNTNLSFQRVFPQDE-KEAAILLMALSCGLVH 216
           +KKR K  A PS +   +KLCFEDF I+L+ N +F RVF QDE KEAAILLMALSCGLVH
Sbjct: 251 YKKRCKLAAAPSCE--TKKLCFEDFTISLSKNSAFHRVFLQDEIKEAAILLMALSCGLVH 308

Query: 215 G 213
           G
Sbjct: 309 G 309


>ref|XP_007012845.1| GATA type zinc finger transcription factor family protein, putative
           [Theobroma cacao] gi|508783208|gb|EOY30464.1| GATA type
           zinc finger transcription factor family protein,
           putative [Theobroma cacao]
          Length = 302

 Score =  185 bits (469), Expect = 5e-44
 Identities = 117/220 (53%), Positives = 139/220 (63%), Gaps = 11/220 (5%)
 Frame = -1

Query: 839 ESDSGLNLSLWKKEDRDENHT-ENNSVXXXXXXXXXXXXXXXSDQL-----IAKKLEDLK 678
           ESDSGLNLSL KKE+ +E+H  E++S                SD+         KLE+ K
Sbjct: 86  ESDSGLNLSLRKKEEGNEHHQIEDSSAKWMSSKMRMMRKMMSSDRADLSNSSTPKLEEPK 145

Query: 677 Q-PISSPTATDD---HNNSNIPVRVCSDCNTSKTPLWRSGPRGPKSLCNACGIRQXXXXX 510
           Q P SSP  + +   +NN NI +RVC+DCNT+KTPLWRSGPRGPKSLCNACGIRQ     
Sbjct: 146 QQPSSSPDNSSNSSYNNNDNITIRVCADCNTTKTPLWRSGPRGPKSLCNACGIRQ-RKAR 204

Query: 509 XXXXXXXXXKGTILATDEASTMKTKVALKNKRSSN-GHVARHKKRSKFAIPSDDHAREKL 333
                     G I+A     TMK+KV  K+KRSSN G VA+ KK+ K +  S    R+KL
Sbjct: 205 RAMAAAAAANGAIVAAQTTPTMKSKVQDKSKRSSNSGCVAQLKKKCKHS--SQSQGRKKL 262

Query: 332 CFEDFFINLNTNLSFQRVFPQDEKEAAILLMALSCGLVHG 213
           CFED  I L+ N +F RVFPQDEKEAAILLMALS GLVHG
Sbjct: 263 CFEDLRIILSKNSAFHRVFPQDEKEAAILLMALSYGLVHG 302


>ref|XP_009604300.1| PREDICTED: putative GATA transcription factor 22 [Nicotiana
           tomentosiformis]
          Length = 315

 Score =  182 bits (461), Expect = 4e-43
 Identities = 107/232 (46%), Positives = 132/232 (56%), Gaps = 15/232 (6%)
 Frame = -1

Query: 863 GGSYDLE--NESDSGLNLSLWKKEDRDENHTENNSVXXXXXXXXXXXXXXXS-----DQL 705
           GG YD+E  N+  SGL L+LWK+ED++EN  E N V                        
Sbjct: 84  GGLYDVEKKNKVGSGLKLTLWKREDKNENQNEKNPVKWMSSKIKASDQTILEKTTNNSTY 143

Query: 704 IAKKLEDLKQPISSPTATD---DHNNSNIPVRVCSDCNTSKTPLWRSGPRGPKSLCNACG 534
           I  KLED KQ  S P  TD   +++N+NIP+RVC+DCNT+KTPLWRSGP+GPK+LCNACG
Sbjct: 144 IKLKLEDQKQQPSYPLETDYSSNNSNNNIPIRVCADCNTTKTPLWRSGPKGPKTLCNACG 203

Query: 533 IRQXXXXXXXXXXXXXXKGTILATDEASTMKTKV---ALKNKRSSNGHVARHKKRSKFAI 363
           IRQ                T+     +S+MK KV     K K +       +KKR KF  
Sbjct: 204 IRQRKARRAMAAAAAANGETLTTETSSSSMKKKVNKHLHKEKITKVNVTVPYKKRCKFGQ 263

Query: 362 PSDDHAREKLC--FEDFFINLNTNLSFQRVFPQDEKEAAILLMALSCGLVHG 213
            S  +   K    FEDF INL  NL+  ++FPQDEKEAA+LLMALS GLVHG
Sbjct: 264 SSSSNTEPKKLGNFEDFLINLTNNLALHQIFPQDEKEAAVLLMALSSGLVHG 315


>ref|XP_002514107.1| hypothetical protein RCOM_1046780 [Ricinus communis]
           gi|223546563|gb|EEF48061.1| hypothetical protein
           RCOM_1046780 [Ricinus communis]
          Length = 312

 Score =  177 bits (450), Expect = 8e-42
 Identities = 112/243 (46%), Positives = 141/243 (58%), Gaps = 20/243 (8%)
 Frame = -1

Query: 881 NNFGWHGGSYD---LENESDSGLNLSLWKKEDRD---ENHTENNSVXXXXXXXXXXXXXX 720
           N +  HG S+D   ++NE+++G  LS+ KKED+    E+  +N+SV              
Sbjct: 72  NIYASHGRSWDHRIIKNENENGQELSVCKKEDKSTSIEDQRDNSSVKWMSSKMRLMRKMM 131

Query: 719 XSDQLI--------AKKLEDLKQPISSPTATDDHN-----NSNIPVRVCSDCNTSKTPLW 579
            +DQ +          KLED ++  S P   D  +     NSN  +RVCSDCNT+KTPLW
Sbjct: 132 TTDQTVNTTQHTSSMHKLEDKEKSRSLPLQDDYSSKNLSDNSNNTIRVCSDCNTTKTPLW 191

Query: 578 RSGPRGPKSLCNACGIRQXXXXXXXXXXXXXXKGTILATDEASTMKTKVALKNKRSSNGH 399
           RSGPRGPKSLCNACGIRQ               GTI A D A+    KV  K KR++N H
Sbjct: 192 RSGPRGPKSLCNACGIRQRKARRALAAAQASANGTIFAPDTAAMKTNKVQNKEKRTNNSH 251

Query: 398 VARHKKRSKFAIPSDDHAREKLCFEDFFIN-LNTNLSFQRVFPQDEKEAAILLMALSCGL 222
           +   KKR KF   S   +R+KLCFED     L+ N +FQ++FPQDEKEAAILLMALS GL
Sbjct: 252 LP-FKKRCKFTAQSRG-SRKKLCFEDLSSTILSKNSAFQQLFPQDEKEAAILLMALSYGL 309

Query: 221 VHG 213
           VHG
Sbjct: 310 VHG 312


>ref|XP_012452366.1| PREDICTED: GATA transcription factor 21 [Gossypium raimondii]
           gi|763798209|gb|KJB65164.1| hypothetical protein
           B456_010G082800 [Gossypium raimondii]
          Length = 298

 Score =  177 bits (448), Expect = 1e-41
 Identities = 111/228 (48%), Positives = 136/228 (59%), Gaps = 16/228 (7%)
 Frame = -1

Query: 848 LENESDSGLNLSLWKKEDR---DENHTENNSVXXXXXXXXXXXXXXXSDQLIAK----KL 690
           LEN SD GL LSLWKKE+R   D +H ++++                    ++K    K+
Sbjct: 73  LEN-SDCGLKLSLWKKEERVESDHHHEDSSTKWMPSKLRILRKMMSSHHTDLSKSSSPKI 131

Query: 689 EDLK---QPISSP--TATDDHNN--SNIPVRVCSDCNTSKTPLWRSGPRGPKSLCNACGI 531
           ED K   QP  SP  +    +NN  +N P+RVC+DCNT+KTPLWRSGPRGPKSLCNACGI
Sbjct: 132 EDQKLQNQPSPSPDNSCNSSYNNGINNSPIRVCADCNTTKTPLWRSGPRGPKSLCNACGI 191

Query: 530 RQ--XXXXXXXXXXXXXXKGTILATDEASTMKTKVALKNKRSSNGHVARHKKRSKFAIPS 357
           RQ                 GTI+  +  ++MK KV  K KRSSNG VA+ K + K  + S
Sbjct: 192 RQRKARRAMAAAAAATASNGTIVTAETTTSMKNKVQNKAKRSSNGCVAKLKNK-KCKLSS 250

Query: 356 DDHAREKLCFEDFFINLNTNLSFQRVFPQDEKEAAILLMALSCGLVHG 213
               R KLCFED  I L+ + +F  VFPQDEKEAAILLMALS GLVHG
Sbjct: 251 QSQGRNKLCFEDLRIILSKSSAFHGVFPQDEKEAAILLMALSYGLVHG 298


>ref|XP_010262144.1| PREDICTED: putative GATA transcription factor 22 [Nelumbo nucifera]
          Length = 316

 Score =  175 bits (444), Expect = 4e-41
 Identities = 113/225 (50%), Positives = 137/225 (60%), Gaps = 14/225 (6%)
 Frame = -1

Query: 845 ENESDSGLNLSLWKKEDRDENHTENN-SVXXXXXXXXXXXXXXXSDQLIAKK-------- 693
           +N  +S L LS+ K+E RDE+ + +  S                SD++ A K        
Sbjct: 98  KNGINSSLELSI-KQEIRDESQSNSTGSARWMSSKMRLMRKMMNSDRMGADKPASGNTQK 156

Query: 692 -LEDLKQPIS----SPTATDDHNNSNIPVRVCSDCNTSKTPLWRSGPRGPKSLCNACGIR 528
             +  +QP S    S ++    NNSNI VRVCSDCNT+KTPLWRSGPRGPKSLCNACGIR
Sbjct: 157 FQDHHQQPSSLEMDSSSSNSSSNNSNITVRVCSDCNTTKTPLWRSGPRGPKSLCNACGIR 216

Query: 527 QXXXXXXXXXXXXXXKGTILATDEASTMKTKVALKNKRSSNGHVARHKKRSKFAIPSDDH 348
           Q               GT+L  D  S ++ KV  K KRS  G+V ++KKR K A PS   
Sbjct: 217 Q--RKARRAMAAAAASGTLLPADTPS-LQRKVHHKEKRSETGYVPQYKKRCKLA-PS-PR 271

Query: 347 AREKLCFEDFFINLNTNLSFQRVFPQDEKEAAILLMALSCGLVHG 213
           +R+KLCFEDF INL+ N +F RVFPQDEKEAAILLMALSCGLVHG
Sbjct: 272 SRKKLCFEDFTINLSKNSAFHRVFPQDEKEAAILLMALSCGLVHG 316


>ref|XP_010242203.1| PREDICTED: GATA transcription factor 21-like [Nelumbo nucifera]
          Length = 305

 Score =  172 bits (435), Expect = 4e-40
 Identities = 94/150 (62%), Positives = 108/150 (72%), Gaps = 1/150 (0%)
 Frame = -1

Query: 659 TATDDHNNSNIPVRVCSDCNTSKTPLWRSGPRGPKSLCNACGIRQXXXXXXXXXXXXXXK 480
           ++    NN+N  VRVCSDCNT+KTPLWRSGPRGPKSLCNACGIRQ               
Sbjct: 163 SSNSSSNNANNTVRVCSDCNTTKTPLWRSGPRGPKSLCNACGIRQ----RKARRAMAAAN 218

Query: 479 GTILATDEASTMKTKVALKNKRSS-NGHVARHKKRSKFAIPSDDHAREKLCFEDFFINLN 303
           GT+L T EAS+MK KV  K KRSS  G+V ++KKR K A  +   + +K+CFEDF INL+
Sbjct: 219 GTLLPT-EASSMKNKVHHKEKRSSETGYVQQYKKRCKLA--TSPRSMKKVCFEDFTINLS 275

Query: 302 TNLSFQRVFPQDEKEAAILLMALSCGLVHG 213
            N SF RVFPQDEKEAAILLMALSCGLVHG
Sbjct: 276 KNSSFHRVFPQDEKEAAILLMALSCGLVHG 305


>ref|XP_009768461.1| PREDICTED: putative GATA transcription factor 22 [Nicotiana
           sylvestris]
          Length = 317

 Score =  170 bits (430), Expect = 2e-39
 Identities = 107/235 (45%), Positives = 137/235 (58%), Gaps = 18/235 (7%)
 Frame = -1

Query: 863 GGSYDLE--NESDSGLNLSLWKKEDR-DENHTENNSVXXXXXXXXXXXXXXXS----DQL 705
           GGSYD+E  N+  SGL L+LWK+ED+  EN  E + V                    +  
Sbjct: 84  GGSYDVEKKNKVGSGLKLTLWKREDKYHENQNEKDPVKWMSSKINTSDGARMEKTTNNTY 143

Query: 704 IAKKLEDLKQPISSPTATDDHN-------NSNIPVRVCSDCNTSKTPLWRSGPRGPKSLC 546
           I  KLED KQ  S P  + D++       N+NIP+RVC+DCNT+KTPLWRSGP+GPK+LC
Sbjct: 144 IKLKLEDQKQQPSYPLESTDYSSNSSNNSNNNIPIRVCADCNTTKTPLWRSGPKGPKTLC 203

Query: 545 NACGIRQXXXXXXXXXXXXXXKGTILATDEASTMKTKVA-LKNKRSSNGHVA-RHKKRSK 372
           NACGIRQ              + T+     +S+MK KV  L  ++ +N +V   +KKR K
Sbjct: 204 NACGIRQRKARRAMAAAAANGE-TLTTETSSSSMKKKVKHLHKEKITNVNVTLPYKKRCK 262

Query: 371 FAIPSDDHAREKLC--FEDFFINLNTNLSFQRVFPQDEKEAAILLMALSCGLVHG 213
           F   S  +   K    FEDF I L+ NL+  ++FPQDEKEAAILLMALS GLVHG
Sbjct: 263 FGPSSSSNTAPKKLGNFEDFLIKLSNNLALHQIFPQDEKEAAILLMALSSGLVHG 317


>ref|XP_010090039.1| Putative GATA transcription factor 22 [Morus notabilis]
           gi|587848577|gb|EXB38836.1| Putative GATA transcription
           factor 22 [Morus notabilis]
          Length = 335

 Score =  169 bits (428), Expect = 3e-39
 Identities = 113/259 (43%), Positives = 140/259 (54%), Gaps = 42/259 (16%)
 Frame = -1

Query: 863 GGSYDL------ENESD---SGLNLSLWKKEDRDENHTENNSVXXXXXXXXXXXXXXXSD 711
           GGS D+      E+ESD   + L LS+WK    D N+  + S                S 
Sbjct: 79  GGSSDIHPPRVAESESDHHQNDLKLSIWKSSTEDSNYDHDKSSHVSDNNAGYSAKWMPSK 138

Query: 710 QLIAKKLE--------DLKQPIS---------------SPTATD------DHNNSNIPVR 618
             + +K+         D   P++               SP  TD       +NN+N  +R
Sbjct: 139 MRMMRKMIVNPDQTNIDHHTPLNFTHKFDQVMKRKHPASPLGTDHSSTSSSNNNNNNTIR 198

Query: 617 VCSDCNTSKTPLWRSGPRGPKSLCNACGIRQXXXXXXXXXXXXXXKGTILATDEASTMK- 441
           VC+DCNT+KTPLWRSGPRGPKSLCNACGIRQ               GTILATD A+TMK 
Sbjct: 199 VCADCNTTKTPLWRSGPRGPKSLCNACGIRQRKARRAMAAAAAAANGTILATD-ATTMKS 257

Query: 440 -TKVALKNKRSSNGH--VARHKKRSKFAIPSDDHAREKLCFEDFFINLNTNLSFQRVFPQ 270
            TKV  K K+  NG+  V + KKR K    S    R+K+CFED  I+++ N +FQRVFPQ
Sbjct: 258 STKVQRKEKKPKNGNGVVPQFKKRCKLT-ASPSRGRKKICFEDLAISISKNSAFQRVFPQ 316

Query: 269 DEKEAAILLMALSCGLVHG 213
           DEK+AAILLMALS GLVHG
Sbjct: 317 DEKDAAILLMALSYGLVHG 335


>ref|XP_006346565.1| PREDICTED: GATA transcription factor 21-like [Solanum tuberosum]
          Length = 222

 Score =  169 bits (428), Expect = 3e-39
 Identities = 109/232 (46%), Positives = 134/232 (57%), Gaps = 8/232 (3%)
 Frame = -1

Query: 884 VNNFGWHGGSYDL--ENESDSGLNLSLWKKEDRDENHTENNSVXXXXXXXXXXXXXXXSD 711
           V+N G  G SYDL  +N+  SGL LSLWK+ED+    +E   +                +
Sbjct: 9   VDNDG--GSSYDLGKKNKGGSGLKLSLWKREDKLVMSSEIKDLDQERKKNITN------N 60

Query: 710 QLIAKKLEDLKQPISSPTATDDHNNSNIPVRVCSDCNTSKTPLWRSGPRGPKSLCNACGI 531
             I  KL D KQ    P  TD ++++NIP+RVC+DCNT+KTPLWRSGP+GPKSLCNACGI
Sbjct: 61  DCIKLKLGDQKQ---QPIQTD-YSSNNIPIRVCTDCNTTKTPLWRSGPKGPKSLCNACGI 116

Query: 530 RQXXXXXXXXXXXXXXKGTILATDEASTMKTKVALK----NKRSSNGHVARHKKRSKFAI 363
           RQ                    TD  + MK KV        K  +N HV   KKR K   
Sbjct: 117 RQRKARRAMAAAANG------KTDHQTAMKIKVQQHKPNITKVRTNNHVTPFKKRCKLGP 170

Query: 362 PSD--DHAREKLCFEDFFINLNTNLSFQRVFPQDEKEAAILLMALSCGLVHG 213
            S   ++A +KL FED  INL+  L+FQ++FPQDEKEAAILLMALS GLVHG
Sbjct: 171 SSSGTNNAPKKLGFEDLLINLSNQLAFQQIFPQDEKEAAILLMALSSGLVHG 222


>emb|CDP03165.1| unnamed protein product [Coffea canephora]
          Length = 318

 Score =  166 bits (421), Expect = 2e-38
 Identities = 106/246 (43%), Positives = 134/246 (54%), Gaps = 22/246 (8%)
 Frame = -1

Query: 884 VNNFGWHGGSYDLENESDSGLNLSLWKK-------EDRDENHTENNSVXXXXXXXXXXXX 726
           V N   + GS D E +++ G  +SLWK        +D +E +  NN              
Sbjct: 73  VENHAPYTGSQDPEKKANKGSKISLWKNNTNGNQADDHEEINPVNNKWVSSKVKLMQKMN 132

Query: 725 XXXSDQLIAK-----KLED-LKQPISSPTATDDH------NNSNIPVRVCSDCNTSKTPL 582
                ++ +      K ED  KQP S+    D+       N SN P+RVC+DCNT+KTPL
Sbjct: 133 KPDLKEITSSTTTTMKFEDHQKQPTSASPEADNFSSNSSSNISNTPIRVCADCNTTKTPL 192

Query: 581 WRSGPRGPKSLCNACGIRQXXXXXXXXXXXXXXKGTILAT-DEASTMKTKVALKNKRSSN 405
           WRSGP+GPKSLCNACGIRQ               GT   T D  + +K KV  K+K  +N
Sbjct: 193 WRSGPKGPKSLCNACGIRQRKARRAMAAAAAAANGTSPPTYDTTAPLKVKVQNKDKLKNN 252

Query: 404 GHVARHKKRSKFAIPSDD-HA-REKLCFEDFFINLNTNLSFQRVFPQDEKEAAILLMALS 231
           G   +  K +  A  S + HA ++K  FEDF  NL+ NL+F RVFPQDEKEAAILLMALS
Sbjct: 253 GQFKKRCKLNTSAESSQNLHAVQKKSGFEDFLFNLSKNLAFHRVFPQDEKEAAILLMALS 312

Query: 230 CGLVHG 213
           CGLVHG
Sbjct: 313 CGLVHG 318


>gb|KHG09089.1| Putative GATA transcription factor 22 -like protein [Gossypium
           arboreum]
          Length = 305

 Score =  166 bits (419), Expect = 3e-38
 Identities = 107/217 (49%), Positives = 127/217 (58%), Gaps = 9/217 (4%)
 Frame = -1

Query: 836 SDSGLNLSLWKKEDRDENHTENN-SVXXXXXXXXXXXXXXXSDQLIAK-----KLED--- 684
           SD  L LS+WKKE+R E H +++ S                SD          K ED   
Sbjct: 97  SDCELRLSIWKKEERVETHHQSHDSAKWMPSKMRMMRKMMNSDHTDLSNSPTPKSEDHQE 156

Query: 683 LKQPISSPTATDDHNNSNIPVRVCSDCNTSKTPLWRSGPRGPKSLCNACGIRQXXXXXXX 504
            KQP SSP    D+NNS I  RVC+DCNT+KTPLWRSGPRGPKSLCNACGIRQ       
Sbjct: 157 QKQPSSSP----DNNNSTI--RVCADCNTTKTPLWRSGPRGPKSLCNACGIRQ-RKARRA 209

Query: 503 XXXXXXXKGTILATDEASTMKTKVALKNKRSSNGHVARHKKRSKFAIPSDDHAREKLCFE 324
                    +++A +   +M+++V LK KRSSN  V  H K  K    S   +R+KLCFE
Sbjct: 210 AAVAAAAASSVVAAETPPSMRSEVQLKAKRSSNNGVP-HLKNKKCKHNSQSQSRKKLCFE 268

Query: 323 DFFINLNTNLSFQRVFPQDEKEAAILLMALSCGLVHG 213
           D  I L+ N +F  VFPQDEKEAAILLMALS GLVHG
Sbjct: 269 DLRIILSKNSAFHGVFPQDEKEAAILLMALSYGLVHG 305


>ref|XP_012447374.1| PREDICTED: GATA transcription factor 21-like [Gossypium raimondii]
           gi|763787365|gb|KJB54361.1| hypothetical protein
           B456_009G031400 [Gossypium raimondii]
          Length = 301

 Score =  164 bits (415), Expect = 9e-38
 Identities = 105/214 (49%), Positives = 125/214 (58%), Gaps = 6/214 (2%)
 Frame = -1

Query: 836 SDSGLNLSLWKKEDRDENHTENNSVXXXXXXXXXXXXXXXSD--QLIAKKLED---LKQP 672
           SD  L LS+WKKE+R E H +++                 +D       K ED    KQP
Sbjct: 97  SDCELRLSIWKKEERVETHHQSHDSAKWMPSKTRMMSSDHTDLSNSPTPKSEDHQEQKQP 156

Query: 671 ISSPTATDDHNNSNIPVRVCSDCNTSKTPLWRSGPRGPKSLCNACGIRQXXXXXXXXXXX 492
            SSP    D+NNS I  RVC+DCNT+KTPLWRSGPRGPKSLCNACGIRQ           
Sbjct: 157 SSSP----DNNNSTI--RVCADCNTTKTPLWRSGPRGPKSLCNACGIRQ--RKARRAAAA 208

Query: 491 XXXKGTILATDEASTMKTKV-ALKNKRSSNGHVARHKKRSKFAIPSDDHAREKLCFEDFF 315
               G ++A +   +MK++V  LK KRS N     H K  K  + S   +R+KLCFED  
Sbjct: 209 AAAAGIVVAAETPPSMKSEVQRLKAKRSINDGFP-HLKNKKCKLNSQSQSRKKLCFEDLR 267

Query: 314 INLNTNLSFQRVFPQDEKEAAILLMALSCGLVHG 213
           I L+ N +F  VFPQDEKEAAILLMALS GLVHG
Sbjct: 268 IILSKNSAFHGVFPQDEKEAAILLMALSYGLVHG 301


>ref|XP_004243958.1| PREDICTED: GATA transcription factor 21 [Solanum lycopersicum]
          Length = 266

 Score =  164 bits (415), Expect = 9e-38
 Identities = 110/237 (46%), Positives = 136/237 (57%), Gaps = 13/237 (5%)
 Frame = -1

Query: 884 VNNFGWHGGSYDL--ENESDSGLNLSLWKKEDR---------DENHTENNSVXXXXXXXX 738
           V+N G  G SYDL  +NE  SGL LSLWK+ED+         D+   +N++         
Sbjct: 58  VDNDG--GSSYDLGKKNEVGSGLKLSLWKREDKLLSSEIKKLDQEKKKNST--------- 106

Query: 737 XXXXXXXSDQLIAKKLEDLKQPISSPTATDDHNNSNIPVRVCSDCNTSKTPLWRSGPRGP 558
                  +   I  KL D KQ    P  TD  +N NIP+RVC+DCNT+KTPLWRSGP+GP
Sbjct: 107 -------NSACIKLKLGDQKQ---KPIQTDYCSN-NIPIRVCTDCNTTKTPLWRSGPKGP 155

Query: 557 KSLCNACGIRQXXXXXXXXXXXXXXKGTILATDEASTMKTKVALKNKRSSNGHVARHKKR 378
           KSLCNACGIRQ                    TD+    + K  +  K +SN  V   KKR
Sbjct: 156 KSLCNACGIRQRKARRAMAAAAAEG-----KTDQ-KVQQHKQNITTKVTSNNDVKPLKKR 209

Query: 377 SKF--AIPSDDHAREKLCFEDFFINLNTNLSFQRVFPQDEKEAAILLMALSCGLVHG 213
            KF  +  S ++A +KL FEDF INL+  L+FQ++FPQDE EAAILLMALS GLVHG
Sbjct: 210 CKFGPSSSSTNNAPKKLGFEDFLINLSNKLAFQQIFPQDEMEAAILLMALSSGLVHG 266


>ref|XP_012076922.1| PREDICTED: putative GATA transcription factor 22 isoform X2
           [Jatropha curcas]
          Length = 304

 Score =  160 bits (404), Expect = 2e-36
 Identities = 101/226 (44%), Positives = 134/226 (59%), Gaps = 14/226 (6%)
 Frame = -1

Query: 848 LENESDSGLNLSLWKKED-RDENHTENNSVXXXXXXXXXXXXXXXSDQLIA-------KK 693
           L NE+ +GL + + KK++  D++  EN SV               SDQ+ +       ++
Sbjct: 82  LSNENKNGLTIPVSKKQETNDQDQRENTSVKWMSSKMRLMRKMMSSDQMESNNYVHKFEE 141

Query: 692 LEDLKQPI----SSPTATDDHNNSNIPVRVCSDCNTSKTPLWRSGPRGPKSLCNACGIRQ 525
            +D   P+    SS   ++++NN+N  +RVC+DCNT+KTPLWRSGPRGPKSLCNACGIRQ
Sbjct: 142 KKDRSSPLQDDNSSKNFSNNNNNNNNSIRVCADCNTTKTPLWRSGPRGPKSLCNACGIRQ 201

Query: 524 XXXXXXXXXXXXXXKGTILATDEASTMKTKVALKNKRSSNGHVARHKKRSKFAIPSDDHA 345
                          GTI A  E   MKTK   K  +++N H+   KKR +F   +    
Sbjct: 202 RKARRALAAAQANANGTICA-PEIPAMKTKAQSKEGKANNNHLP-FKKRCRFTAQARGR- 258

Query: 344 REKLCFEDFF-INLNTNLSF-QRVFPQDEKEAAILLMALSCGLVHG 213
           + KLCFED+  I L+ N +F Q+VFPQDEKEAAILLMALS GLVHG
Sbjct: 259 KNKLCFEDYLSIILSKNSAFNQQVFPQDEKEAAILLMALSYGLVHG 304


>ref|XP_012076920.1| PREDICTED: putative GATA transcription factor 22 isoform X1
           [Jatropha curcas] gi|643724627|gb|KDP33828.1|
           hypothetical protein JCGZ_07399 [Jatropha curcas]
          Length = 305

 Score =  160 bits (404), Expect = 2e-36
 Identities = 101/226 (44%), Positives = 134/226 (59%), Gaps = 14/226 (6%)
 Frame = -1

Query: 848 LENESDSGLNLSLWKKED-RDENHTENNSVXXXXXXXXXXXXXXXSDQLIA-------KK 693
           L NE+ +GL + + KK++  D++  EN SV               SDQ+ +       ++
Sbjct: 83  LSNENKNGLTIPVSKKQETNDQDQRENTSVKWMSSKMRLMRKMMSSDQMESNNYVHKFEE 142

Query: 692 LEDLKQPI----SSPTATDDHNNSNIPVRVCSDCNTSKTPLWRSGPRGPKSLCNACGIRQ 525
            +D   P+    SS   ++++NN+N  +RVC+DCNT+KTPLWRSGPRGPKSLCNACGIRQ
Sbjct: 143 KKDRSSPLQDDNSSKNFSNNNNNNNNSIRVCADCNTTKTPLWRSGPRGPKSLCNACGIRQ 202

Query: 524 XXXXXXXXXXXXXXKGTILATDEASTMKTKVALKNKRSSNGHVARHKKRSKFAIPSDDHA 345
                          GTI A  E   MKTK   K  +++N H+   KKR +F   +    
Sbjct: 203 RKARRALAAAQANANGTICA-PEIPAMKTKAQSKEGKANNNHLP-FKKRCRFTAQARGR- 259

Query: 344 REKLCFEDFF-INLNTNLSF-QRVFPQDEKEAAILLMALSCGLVHG 213
           + KLCFED+  I L+ N +F Q+VFPQDEKEAAILLMALS GLVHG
Sbjct: 260 KNKLCFEDYLSIILSKNSAFNQQVFPQDEKEAAILLMALSYGLVHG 305


>gb|EYU27295.1| hypothetical protein MIMGU_mgv1a020800mg [Erythranthe guttata]
          Length = 315

 Score =  158 bits (399), Expect = 7e-36
 Identities = 101/245 (41%), Positives = 132/245 (53%), Gaps = 27/245 (11%)
 Frame = -1

Query: 866 HGGSYDLENESDSGLNLSLWKKEDRDENHTENNSVXXXXXXXXXXXXXXXSDQLIAKKLE 687
           +  S +  N +++GL ++LWKKE  +    + N V                  + AK   
Sbjct: 73  NSNSNNNNNNNNNGLKITLWKKEPDEGAAADINPVKWMSSKIRLMKRMNK--NIPAKSKI 130

Query: 686 DLKQPISSPTA---TDDH------------NNSNIPVRVCSDCNTSKTPLWRSGPRGPKS 552
           D  Q  SS ++   + DH            NNSN P+RVC+DCNT+KTPLWRSGP+GPKS
Sbjct: 131 DSDQNPSSNSSLLESSDHLSSGNSSSYNNNNNSNYPIRVCADCNTTKTPLWRSGPKGPKS 190

Query: 551 LCNACGIRQXXXXXXXXXXXXXXKGTILATDE-ASTMKTKVALKNKR-SSNGHVARHKKR 378
           LCNACGIRQ               G ++A ++    +K KV  K K   +NGH +  KKR
Sbjct: 191 LCNACGIRQRKARRAMAAAAAAASGAVVAANQPPPVLKIKVQHKEKMGKNNGHSSLLKKR 250

Query: 377 SKFA----------IPSDDHAREKLCFEDFFINLNTNLSFQRVFPQDEKEAAILLMALSC 228
            K A            S ++ ++KL FE+F INL+ NLS  RVFP DEK+AAILLMALS 
Sbjct: 251 FKTADNNTNAAGSSADSTNNGKKKLGFEEFLINLSNNLSIHRVFPDDEKDAAILLMALSS 310

Query: 227 GLVHG 213
           GLVHG
Sbjct: 311 GLVHG 315


>ref|XP_006451458.1| hypothetical protein CICLE_v10009004mg [Citrus clementina]
           gi|568843031|ref|XP_006475428.1| PREDICTED: putative
           GATA transcription factor 22-like [Citrus sinensis]
           gi|557554684|gb|ESR64698.1| hypothetical protein
           CICLE_v10009004mg [Citrus clementina]
          Length = 306

 Score =  155 bits (392), Expect = 4e-35
 Identities = 102/224 (45%), Positives = 125/224 (55%), Gaps = 11/224 (4%)
 Frame = -1

Query: 851 DLENESDSGLNLSLW--KKEDRDENHTENNSVXXXXXXXXXXXXXXXS---DQLIAKKLE 687
           D      +GL LS+   K+E  D+N +EN+S                    D    +KLE
Sbjct: 84  DESGSESTGLKLSMSSEKEERNDQNQSENSSSVKWMSSKMRLMKKMMYSSPDAAAMQKLE 143

Query: 686 D-LKQPISSPTATDDHNNSNIP--VRVCSDCNTSKTPLWRSGPRGPKSLCNACGIRQXXX 516
           D  KQP SS    D+ NN+N    +RVC+DCNT+KTPLWRSGPRGPKSLCNACGIRQ   
Sbjct: 144 DHQKQPPSSSLEPDNGNNNNNTNTIRVCADCNTTKTPLWRSGPRGPKSLCNACGIRQRKA 203

Query: 515 XXXXXXXXXXXKGTILATDEASTMKTKVALKNKRSSNGHVARHKKRSKFAIPSDDHAREK 336
                          LA D+ S+ K K +   + S+N      KKR K+   S    ++K
Sbjct: 204 RRAMAAAAANGTAVQLAADDTSSNKKK-SKTPRPSNNNSCLPFKKRCKYNSNSPSRGKKK 262

Query: 335 LC-FEDFFINLNTNLS--FQRVFPQDEKEAAILLMALSCGLVHG 213
           LC FED  +NL+ N S   QRVFPQ+EKEAAILLMALS GLVHG
Sbjct: 263 LCSFEDLTLNLSKNNSSALQRVFPQEEKEAAILLMALSYGLVHG 306


>emb|CAN63090.1| hypothetical protein VITISV_032017 [Vitis vinifera]
          Length = 211

 Score =  154 bits (390), Expect = 7e-35
 Identities = 90/212 (42%), Positives = 118/212 (55%), Gaps = 1/212 (0%)
 Frame = -1

Query: 848 LENESDSGLNLSLWKKEDRDE-NHTENNSVXXXXXXXXXXXXXXXSDQLIAKKLEDLKQP 672
           + +++ S   LS++KKE+ DE N +    +               +   I +K+ED +Q 
Sbjct: 1   MADDNKSSHKLSVFKKEEGDEGNKSTEKWMSSKMRLMRKMMNSDCTTAKIEQKVEDHQQW 60

Query: 671 ISSPTATDDHNNSNIPVRVCSDCNTSKTPLWRSGPRGPKSLCNACGIRQXXXXXXXXXXX 492
            +       +N SNIP+RVCSDCNT+KTPLWRSGPRGPKSLCNACGIRQ           
Sbjct: 61  DNINEXNSSNNTSNIPIRVCSDCNTTKTPLWRSGPRGPKSLCNACGIRQRKARRAMAAAA 120

Query: 491 XXXKGTILATDEASTMKTKVALKNKRSSNGHVARHKKRSKFAIPSDDHAREKLCFEDFFI 312
                      E S MK K+  K K+    +V + KK  K   P      +KLCFEDF  
Sbjct: 121 AAAANGTAVGTEISPMKMKLPNKEKKMHTSNVGQQKKLCK--PPCPPPTEKKLCFEDFTS 178

Query: 311 NLNTNLSFQRVFPQDEKEAAILLMALSCGLVH 216
           ++  N  F+RVFP+DE+EAAILLMALSC LV+
Sbjct: 179 SICKNSGFRRVFPRDEEEAAILLMALSCDLVY 210


Top