BLASTX nr result

ID: Panax25_contig00024425 seq

BLASTX 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Panax25_contig00024425
         (1149 letters)

Database: ./nr 
           115,041,592 sequences; 42,171,959,267 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

XP_002282173.1 PREDICTED: GATA transcription factor 21 [Vitis vi...   176   9e-49
XP_011019465.1 PREDICTED: putative GATA transcription factor 22 ...   173   1e-47
XP_002308561.2 hypothetical protein POPTR_0006s24560g [Populus t...   172   3e-47
XP_015884441.1 PREDICTED: GATA transcription factor 21-like [Ziz...   172   3e-47
XP_017221410.1 PREDICTED: putative GATA transcription factor 22 ...   171   3e-47
XP_006450838.1 hypothetical protein CICLE_v10008968mg [Citrus cl...   171   8e-47
AGU42761.1 GATA nirate-inducible carbon-metabolism involved prot...   170   1e-46
XP_010262144.1 PREDICTED: putative GATA transcription factor 22 ...   171   1e-46
XP_006450839.1 hypothetical protein CICLE_v10008968mg [Citrus cl...   168   1e-45
XP_010242203.1 PREDICTED: GATA transcription factor 21-like [Nel...   164   2e-44
XP_002279283.1 PREDICTED: putative GATA transcription factor 22 ...   164   2e-44
XP_006600457.1 PREDICTED: GATA transcription factor 21-like isof...   164   2e-44
XP_003550634.1 PREDICTED: GATA transcription factor 21-like isof...   164   3e-44
EEF48061.1 hypothetical protein RCOM_1046780 [Ricinus communis]       164   4e-44
KZN11770.1 hypothetical protein DCAR_004426 [Daucus carota subsp...   159   6e-44
ABK96478.1 unknown [Populus trichocarpa x Populus deltoides]          163   8e-44
XP_002514107.2 PREDICTED: putative GATA transcription factor 22 ...   164   2e-43
ABK96296.1 unknown [Populus trichocarpa x Populus deltoides]          161   3e-43
XP_007012845.2 PREDICTED: putative GATA transcription factor 22 ...   160   8e-43
XP_017982034.1 PREDICTED: putative GATA transcription factor 22 ...   160   1e-42

>XP_002282173.1 PREDICTED: GATA transcription factor 21 [Vitis vinifera] CBI27913.3
           unnamed protein product, partial [Vitis vinifera]
          Length = 309

 Score =  176 bits (446), Expect = 9e-49
 Identities = 129/312 (41%), Positives = 168/312 (53%), Gaps = 41/312 (13%)
 Frame = +2

Query: 128 IDLNEDHHLEHLISLPHHPTPSIST----PIYFNLNQEYQICGSHSRESLQNQKKVEKNN 295
           + LNED H + L S    P+ S S+    PI+F+  +E   C  H R+  Q Q + E ++
Sbjct: 16  LQLNEDQHHQLLFSPKPQPSSSSSSSLTCPIFFSPTKEQGGC--HYRDLHQAQPQQEAHD 73

Query: 296 IVLFXXXXXXXXXXXXXXLLEPTAGD----MVCKREDHEDEGKSSRITDHDGSMKWMSSK 463
             +F               LE  + +     + K ED  +    +      GS+KWMSSK
Sbjct: 74  KFVFRGGSYDHPT------LESESDNGLKLTIWKTEDRNENHSEN------GSVKWMSSK 121

Query: 464 MRVMPKMMNSN---SDKPACRSVVHKFNDQIKYS------------TNYNSNDFVRVCSD 598
           MRVM KMM S+   + KP+  ++   F D  + S            +N NSN+ +RVC+D
Sbjct: 122 MRVMQKMMISDQTGAQKPSNTAL--NFGDHKQQSLPSETDYNSINSSNINSNNTIRVCAD 179

Query: 599 CNTTKTPLWRGGPQGPKSLCNACGIRQRKARRAMAEA---ANGTELNTGT---------- 739
           CNTTKTPLWR GP+GPKSLCNACGIRQRKARRAMA A   ANGT L T T          
Sbjct: 180 CNTTKTPLWRSGPRGPKSLCNACGIRQRKARRAMAAAAATANGTILPTNTAPTKTKAKHK 239

Query: 740 -KKSRTSYIAQNKQHFKLIGTDSAHTSDQKKLSFEDFALSLTSKK---NLTIPDE-EEAA 904
            KKS   +++  K+  KL    S  T   KKL FEDF +SL+       + + DE +EAA
Sbjct: 240 DKKSSNGHVSHYKKRCKLAAAPSCET---KKLCFEDFTISLSKNSAFHRVFLQDEIKEAA 296

Query: 905 ILLMALSCGLIH 940
           ILLMALSCGL+H
Sbjct: 297 ILLMALSCGLVH 308


>XP_011019465.1 PREDICTED: putative GATA transcription factor 22 [Populus
           euphratica]
          Length = 303

 Score =  173 bits (438), Expect = 1e-47
 Identities = 128/310 (41%), Positives = 162/310 (52%), Gaps = 38/310 (12%)
 Frame = +2

Query: 128 IDLNEDHHLEHLISLPHHPTPSISTPI-YFNLNQEYQICGSHSRESLQNQKKVEKNNIVL 304
           +DL E+ +L+  +S PH    S+S P  +FN +        H RE+   + +   N  V 
Sbjct: 16  VDLREEQNLQLFLS-PHQAATSLSGPTNFFNASAH-----DHQRETKPGESRQHDNQEV- 68

Query: 305 FXXXXXXXXXXXXXXLLEPTAGD----------MVCKREDHEDEGKSSRITDHDGSMKWM 454
                            +P   D             K ED  +E   S       S+KWM
Sbjct: 69  ---DMYNISHGGSSSSFQPEVNDHNYNSNFRNLSSSKMEDGAEESGES-------SVKWM 118

Query: 455 SSKMRVMPKMMNSNSDKPACRSV--VHKF------NDQIKYSTNYNSNDFVRVCSDCNTT 610
            SKMR+M KM NSN  +   + +  + KF      N++I  S+N NSN  +RVCSDCNTT
Sbjct: 119 PSKMRLMQKMTNSNCSETDHKPMKFMLKFHNQQYQNNEINSSSNSNSN--IRVCSDCNTT 176

Query: 611 KTPLWRGGPQGPKSLCNACGIRQRKARRAM---AEAANGT-------------ELNTGTK 742
            TPLWR GP+GPKSLCNACGIRQRKARRAM   A AANGT             ++N   K
Sbjct: 177 STPLWRSGPRGPKSLCNACGIRQRKARRAMAAAAAAANGTVIAIEASSSTRSSKVNNKVK 236

Query: 743 KSRTSYIAQNKQHFKLIGTDSAHTSDQKKLSFEDFALSLTSKKNL--TIP-DEEEAAILL 913
           KSRTS+++QNK   KL     +    QKKL F++ ALSL+    L   +P D EEAAILL
Sbjct: 237 KSRTSHVSQNK---KLSKPPESSLQSQKKLCFKNLALSLSKNPALQQVLPHDVEEAAILL 293

Query: 914 MALSCGLIHS 943
           M LSCG IHS
Sbjct: 294 MELSCGFIHS 303


>XP_002308561.2 hypothetical protein POPTR_0006s24560g [Populus trichocarpa]
           ABK95624.1 unknown [Populus trichocarpa] EEE92084.2
           hypothetical protein POPTR_0006s24560g [Populus
           trichocarpa]
          Length = 303

 Score =  172 bits (435), Expect = 3e-47
 Identities = 128/310 (41%), Positives = 161/310 (51%), Gaps = 38/310 (12%)
 Frame = +2

Query: 128 IDLNEDHHLEHLISLPHHPTPSISTPI-YFNLNQEYQICGSHSRESLQNQKKVEKNNIVL 304
           +DL E+ +L+  +S PH    S+S P  +FN +        H RE+   + +   N  V 
Sbjct: 16  VDLREEQNLQLFLS-PHQAATSLSGPTNFFNTSAH-----DHQRETKPGESRQHDNQEV- 68

Query: 305 FXXXXXXXXXXXXXXLLEPTAGDMVCKREDHEDEGKSSRITDH-----DGSMKWMSSKMR 469
                            +P   D       H     SS++ D      + S+KWM SKMR
Sbjct: 69  ---DMYNISHGGSSSSFQPEVNDHNYNSNFHNLS--SSKMEDGAEESGESSVKWMPSKMR 123

Query: 470 VMPKMMNSNSDKPACRSVVH-------KF------NDQIKYSTNYNSNDFVRVCSDCNTT 610
           +M KM NSN     C    H       KF      N++I  S+N NSN  +RVCSDCNTT
Sbjct: 124 LMQKMTNSN-----CSETDHMPMKFMLKFHNQQYQNNEINSSSNSNSN--IRVCSDCNTT 176

Query: 611 KTPLWRGGPQGPKSLCNACGIRQRKARRAM---AEAANG-------------TELNTGTK 742
            TPLWR GP+GPKSLCNACGIRQRKARRAM   A AANG             T++N   K
Sbjct: 177 STPLWRSGPRGPKSLCNACGIRQRKARRAMAAAAAAANGTVIAIEASSSTRSTKVNNKVK 236

Query: 743 KSRTSYIAQNKQHFKLIGTDSAHTSDQKKLSFEDFALSLTSKKNL--TIP-DEEEAAILL 913
           KSRT++++QNK   KL     +    QKKL F++ ALSL+    L   +P D EEAAILL
Sbjct: 237 KSRTNHVSQNK---KLSKPPESSLQSQKKLCFKNLALSLSKNPALQQVLPHDVEEAAILL 293

Query: 914 MALSCGLIHS 943
           M LSCG IHS
Sbjct: 294 MELSCGFIHS 303


>XP_015884441.1 PREDICTED: GATA transcription factor 21-like [Ziziphus jujuba]
          Length = 316

 Score =  172 bits (436), Expect = 3e-47
 Identities = 119/302 (39%), Positives = 158/302 (52%), Gaps = 34/302 (11%)
 Frame = +2

Query: 140 EDHHLEHLI-SLPHH-PTPSISTPIYFNLNQEYQI-CGSHSRESLQNQKKVEK---NNIV 301
           ED HL+ L  S+P+  P+ S+STP +FN  Q++Q   G+   ES +   K +K       
Sbjct: 21  EDQHLKLLFNSVPYQTPSNSLSTPTFFNSLQDHQDQSGTTVEESQERDHKADKPIWKEAA 80

Query: 302 LFXXXXXXXXXXXXXXLLEPTAGDMVCKREDHEDEGKSSRITDHDGSMKWMSSKMRVMPK 481
                           +    A D +  R+  + +   S+    +GS+KWMSSKMR+M K
Sbjct: 81  GSSSYYYKVCSSSSTSVQPVVASDFMSNRQSEDQDEDDSK--SKNGSVKWMSSKMRLMQK 138

Query: 482 MMNSNSDKPACR-SVVHKFNDQIKYSTNYNSNDFVRVCSDCNTTKTPLWRGGPQGPKSLC 658
           MMN     PA       + N   +++ ++N+N+ VRVCSDCNTT TPLWR GP+GPKSLC
Sbjct: 139 MMNPPDHIPAATVDKRERNNSTTQFNFSFNANNTVRVCSDCNTTTTPLWRSGPRGPKSLC 198

Query: 659 NACGIRQRKARRAMAE---AANGTELNTGT----------------KKSRTSYIAQNKQH 781
           NACGIRQRKARRAM+E   AANG  + T T                KKSR  ++AQ K  
Sbjct: 199 NACGIRQRKARRAMSEAAAAANGFLIATDTSSSSMSTKNIKVHNKEKKSRAGHVAQYKNK 258

Query: 782 FKLI--------GTDSAHTSDQKKLSFEDFALSLTSKKNLTIPDEEEAAILLMALSCGLI 937
            KL+         T ++ +S +K   F DF         +   D  EAAILLM LSCG I
Sbjct: 259 CKLVDSSVTSTSSTTTSISSQRKLCRFNDFGFG----HGVFPQDVAEAAILLMELSCGFI 314

Query: 938 HS 943
           HS
Sbjct: 315 HS 316


>XP_017221410.1 PREDICTED: putative GATA transcription factor 22 [Daucus carota
           subsp. sativus]
          Length = 291

 Score =  171 bits (434), Expect = 3e-47
 Identities = 124/290 (42%), Positives = 163/290 (56%), Gaps = 20/290 (6%)
 Frame = +2

Query: 128 IDLNEDHHLE---------HLISLPHHPTPSISTPIYFNLNQEYQICGSHSRESLQNQKK 280
           +DLN+DH  E         H   L      S   P +FNL  ++Q  GSH  +  Q+ +K
Sbjct: 17  LDLNKDHMPEYQQYASTTNHSALLQQATATSSKCPAFFNLTPDHQT-GSHLGDYPQSLQK 75

Query: 281 VEKNNIVLFXXXXXXXXXXXXXXLLEPTAGDMVCKREDHEDEGKSSRITDHDGSMKWMSS 460
           V + + VL                 E    D   K  DHE+   S   T+ DGSM+WMSS
Sbjct: 76  VTEKS-VLEGSSAGHVFYASSPQTEEDNTKDHK-KMVDHENISTS---TEGDGSMRWMSS 130

Query: 461 KMRVMPKMMNSNSDKPACRSVVHKFNDQIKYSTNYNSND--FVRVCSDCNTTKTPLWRGG 634
           KMRV  K++ S+++           +DQI +S + NS+    VRVCSDCNTT+TPLWRGG
Sbjct: 131 KMRVKRKILPSSNNH----------SDQINFSDHGNSSTDVVVRVCSDCNTTRTPLWRGG 180

Query: 635 PQGPKSLCNACGIRQRKARRAMAEAANGTELNTGTKKSRTSYIAQNKQH--FKLIG---- 796
           P+GPKSLCNACGIR+RKAR+A+A AA+ + + + +K  + S    +K H   KL      
Sbjct: 181 PRGPKSLCNACGIRRRKARKAIALAAHNS-IESVSKMDQHSSTRPSKFHKGKKLHTTYHD 239

Query: 797 -TDSAHTSDQKKLSFEDFALSLTSKKNLTI--PDEEEAAILLMALSCGLI 937
            T S  T+   KLSFEDFA+SL +KK+  +   DEEEAAILLM LSC LI
Sbjct: 240 VTHSRQTTGNSKLSFEDFAMSLVNKKSAELFPGDEEEAAILLMTLSCSLI 289


>XP_006450838.1 hypothetical protein CICLE_v10008968mg [Citrus clementina]
           XP_006475926.1 PREDICTED: putative GATA transcription
           factor 22 isoform X2 [Citrus sinensis] ESR64078.1
           hypothetical protein CICLE_v10008968mg [Citrus
           clementina] KDO80098.1 hypothetical protein
           CISIN_1g021329mg [Citrus sinensis]
          Length = 312

 Score =  171 bits (433), Expect = 8e-47
 Identities = 113/298 (37%), Positives = 155/298 (52%), Gaps = 30/298 (10%)
 Frame = +2

Query: 140 EDHHLEHLISLPHHPTPSISTPIYFNLNQEYQICGSHSRESLQNQKKVEKNNIVLFXXXX 319
           +D HL HL+    H   + S+  + N   +  I    S++  Q       +N+ +F    
Sbjct: 23  DDQHL-HLLHSSSHNRAASSSVSWTNFQDQRMIIMEESQQHDQKVDHSGSSNLQVFSSSS 81

Query: 320 XXXXXXXXXXLLEPTAGDMVCKREDHEDEGKSSRITDHDGSMKWMSSKMRVMPKMMNSNS 499
                      +     + +  R+    EG +S       S KWMSSK+R+M KM+NS+S
Sbjct: 82  IQTKK------MNNITNNKLPIRKREVGEGTTSE-NGSSSSGKWMSSKIRLMHKMINSSS 134

Query: 500 DKPACRSVVHKFNDQIKYS-----------TNYNSNDFVRVCSDCNTTKTPLWRGGPQGP 646
           +  A   +  K   +++Y             + NSN+ +R CSDCNTT TPLWR GP+GP
Sbjct: 135 NSTATHELAVKVTQKLQYHQLHDNSEVNSFNSSNSNNTMRACSDCNTTTTPLWRSGPRGP 194

Query: 647 KSLCNACGIRQRKARRAMAEAA---NGTELNTG------------TKKSRTSYIAQNKQH 781
           KSLCNACGIRQRKARRAM  AA    GT   TG             KK RTS+++QNK+ 
Sbjct: 195 KSLCNACGIRQRKARRAMQAAAAVETGTIAATGGSPFAKIKLQIKDKKPRTSHVSQNKKQ 254

Query: 782 FKLIGTDSAHT-SDQKKLSFEDFALSLTSK---KNLTIPDEEEAAILLMALSCGLIHS 943
           ++ +  D  H    Q+KL F+DFA++L+     K +   D EEAAILLM LSCG IHS
Sbjct: 255 YRTLDPDPTHQYQSQRKLCFKDFAIALSKNSALKQVFPQDVEEAAILLMELSCGFIHS 312


>AGU42761.1 GATA nirate-inducible carbon-metabolism involved protein [Populus
           nigra x Populus x canadensis]
          Length = 303

 Score =  170 bits (431), Expect = 1e-46
 Identities = 126/310 (40%), Positives = 159/310 (51%), Gaps = 38/310 (12%)
 Frame = +2

Query: 128 IDLNEDHHLEHLISLPHHPTPSISTPI-YFNLNQEYQICGSHSRESLQNQKKVEKNNIVL 304
           +DL E+ +L+  +S PH    S+S P  +FN +        H RE+   + +   N  V 
Sbjct: 16  VDLREEQNLQLFLS-PHQAATSLSGPTNFFNTSAH-----DHQRETKTGESRQHDNLEV- 68

Query: 305 FXXXXXXXXXXXXXXLLEPTAGDMVCKREDHEDEGKSSRITDH-----DGSMKWMSSKMR 469
                            +P   D       H     SS++ D      + S+KWM SKM 
Sbjct: 69  ---DMYNISHGGSSSSFQPEVNDQNYNSNFHNLS--SSKMEDGAEESGESSVKWMPSKMM 123

Query: 470 VMPKMMNSNSDKPACRSVVHK-------------FNDQIKYSTNYNSNDFVRVCSDCNTT 610
           +M KM NSN     C    H              +N++I  S+N NSN  +RVCSDCNTT
Sbjct: 124 LMQKMTNSN-----CSETDHMPMKFMLKFHNQQYWNNEINSSSNSNSN--IRVCSDCNTT 176

Query: 611 KTPLWRGGPQGPKSLCNACGIRQRKARRAM---AEAANG-------------TELNTGTK 742
            TPLWR GP+GPKSLCNACGIRQRKARRAM   A AANG             T++N   K
Sbjct: 177 STPLWRSGPRGPKSLCNACGIRQRKARRAMAAAAAAANGTVIAIEASSSTRSTKVNNKVK 236

Query: 743 KSRTSYIAQNKQHFKLIGTDSAHTSDQKKLSFEDFALSLTSKKNL--TIP-DEEEAAILL 913
           KSRTS+++QNK   KL     +    QKKL F++ ALSL+    L   +P D EEAAILL
Sbjct: 237 KSRTSHVSQNK---KLSKPPESSLQSQKKLCFKNLALSLSKNPALQQVLPHDVEEAAILL 293

Query: 914 MALSCGLIHS 943
           M LSCG IHS
Sbjct: 294 MELSCGFIHS 303


>XP_010262144.1 PREDICTED: putative GATA transcription factor 22 [Nelumbo nucifera]
          Length = 316

 Score =  171 bits (432), Expect = 1e-46
 Identities = 123/300 (41%), Positives = 157/300 (52%), Gaps = 32/300 (10%)
 Frame = +2

Query: 137 NEDHHLEHLISLPHHPTPSI--STPIYFNLNQEYQICGSHSRESLQNQKKVEKNNIVLFX 310
           +ED     L S P   T S   STP+ FN  +E    GSH  E+   +++ ++       
Sbjct: 24  DEDQQYCQLFSPPPSQTNSSLHSTPLSFNSAREG---GSHDHEAHDREQQEQQQRQEADK 80

Query: 311 XXXXXXXXXXXXXLLEPTAGDMVCKREDHEDEGKSSRITDHDGSMKWMSSKMRVMPKMMN 490
                        L     G         + E +    ++  GS +WMSSKMR+M KMMN
Sbjct: 81  GSEGYHLDFPYPPLQSSKNGINSSLELSIKQEIRDESQSNSTGSARWMSSKMRLMRKMMN 140

Query: 491 SN---SDKPACRSVVHKFNDQIKY------------STNYNSNDFVRVCSDCNTTKTPLW 625
           S+   +DKPA  +   KF D  +             S++ NSN  VRVCSDCNTTKTPLW
Sbjct: 141 SDRMGADKPASGNT-QKFQDHHQQPSSLEMDSSSSNSSSNNSNITVRVCSDCNTTKTPLW 199

Query: 626 RGGPQGPKSLCNACGIRQRKARRAM-AEAANGTELNTGT-----------KKSRTSYIAQ 769
           R GP+GPKSLCNACGIRQRKARRAM A AA+GT L   T           K+S T Y+ Q
Sbjct: 200 RSGPRGPKSLCNACGIRQRKARRAMAAAAASGTLLPADTPSLQRKVHHKEKRSETGYVPQ 259

Query: 770 NKQHFKLIGTDSAHTSDQKKLSFEDFALSLTSKK---NLTIPDEEEAAILLMALSCGLIH 940
            K+  KL    +     +KKL FEDF ++L+       +   DE+EAAILLMALSCGL+H
Sbjct: 260 YKKRCKL----APSPRSRKKLCFEDFTINLSKNSAFHRVFPQDEKEAAILLMALSCGLVH 315


>XP_006450839.1 hypothetical protein CICLE_v10008968mg [Citrus clementina]
           XP_006475925.1 PREDICTED: putative GATA transcription
           factor 22 isoform X1 [Citrus sinensis] ESR64079.1
           hypothetical protein CICLE_v10008968mg [Citrus
           clementina] KDO80099.1 hypothetical protein
           CISIN_1g021329mg [Citrus sinensis]
          Length = 314

 Score =  168 bits (425), Expect = 1e-45
 Identities = 100/216 (46%), Positives = 128/216 (59%), Gaps = 30/216 (13%)
 Frame = +2

Query: 386 REDHEDEGKSSRITDHDGSMKWMSSKMRVMPKMMNSNSDKPACRSVVHKFNDQIKYS--- 556
           R+    EG +S       S KWMSSK+R+M KM+NS+S+  A   +  K   +++Y    
Sbjct: 100 RKREVGEGTTSE-NGSSSSGKWMSSKIRLMHKMINSSSNSTATHELAVKVTQKLQYHQLH 158

Query: 557 --------TNYNSNDFVRVCSDCNTTKTPLWRGGPQGPKSLCNACGIRQRKARRAMAEAA 712
                    + NSN+ +R CSDCNTT TPLWR GP+GPKSLCNACGIRQRKARRAM  AA
Sbjct: 159 DNSEVNSFNSSNSNNTMRACSDCNTTTTPLWRSGPRGPKSLCNACGIRQRKARRAMQAAA 218

Query: 713 ---NGTELNTG------------TKKSRTSYIAQNKQHFKLIGTDSAHT-SDQKKLSFED 844
               GT   TG             KK RTS+++QNK+ ++ +  D  H    Q+KL F+D
Sbjct: 219 AVETGTIAATGGSPFAKIKLQIKDKKPRTSHVSQNKKQYRTLDPDPTHQYQSQRKLCFKD 278

Query: 845 FALSLTSK---KNLTIPDEEEAAILLMALSCGLIHS 943
           FA++L+     K +   D EEAAILLM LSCG IHS
Sbjct: 279 FAIALSKNSALKQVFPQDVEEAAILLMELSCGFIHS 314


>XP_010242203.1 PREDICTED: GATA transcription factor 21-like [Nelumbo nucifera]
          Length = 305

 Score =  164 bits (416), Expect = 2e-44
 Identities = 101/199 (50%), Positives = 123/199 (61%), Gaps = 31/199 (15%)
 Frame = +2

Query: 437 GSMKWMSSKMRVMPKMMNSNS---DKPACRSVVHKFN-------------DQIKYSTNYN 568
           GS++WMSSKMR+M KM NS+    DKP   + +HKF              D    S++ N
Sbjct: 112 GSVRWMSSKMRLMRKMKNSDRVGMDKPV-NTNMHKFQQDHHHRSPSPWEMDTSSNSSSNN 170

Query: 569 SNDFVRVCSDCNTTKTPLWRGGPQGPKSLCNACGIRQRKARRAMAEAANGTELNTGT--- 739
           +N+ VRVCSDCNTTKTPLWR GP+GPKSLCNACGIRQRKARRAMA AANGT L T     
Sbjct: 171 ANNTVRVCSDCNTTKTPLWRSGPRGPKSLCNACGIRQRKARRAMA-AANGTLLPTEASSM 229

Query: 740 ---------KKSRTSYIAQNKQHFKLIGTDSAHTSDQKKLSFEDFALSLTSKKNL--TIP 886
                    + S T Y+ Q K+  KL    +      KK+ FEDF ++L+   +     P
Sbjct: 230 KNKVHHKEKRSSETGYVQQYKKRCKL----ATSPRSMKKVCFEDFTINLSKNSSFHRVFP 285

Query: 887 -DEEEAAILLMALSCGLIH 940
            DE+EAAILLMALSCGL+H
Sbjct: 286 QDEKEAAILLMALSCGLVH 304


>XP_002279283.1 PREDICTED: putative GATA transcription factor 22 [Vitis vinifera]
           CBI20665.3 unnamed protein product, partial [Vitis
           vinifera]
          Length = 306

 Score =  164 bits (416), Expect = 2e-44
 Identities = 123/305 (40%), Positives = 158/305 (51%), Gaps = 33/305 (10%)
 Frame = +2

Query: 128 IDLNEDHHLEHLISLPHHPTPSIST----PIYFNLNQEYQICGSHSRESLQNQKKVEKNN 295
           ++L EDH    L+   + P+   S+    P +FN + + Q  G HS    Q  +  +K++
Sbjct: 17  LELKEDHQHFQLLFSTNPPSYQASSSHPCPSFFNSSTQSQR-GDHSPRDPQQHE--DKDD 73

Query: 296 IVLFXXXXXXXXXXXXXXLLEPTAGD-------MVCKREDHEDEGKSSRITDHDGSMKWM 454
             +               LL+P A D        V K+E+ ++  KS+         KWM
Sbjct: 74  KYISHGGCGESQVFSSSSLLQPMADDNKSSHKLSVFKKEEGDEGNKSTE--------KWM 125

Query: 455 SSKMRVMPKMMNSNSDKPACRSVV--HKFNDQIKY--STNYNSNDFVRVCSDCNTTKTPL 622
           SSKMR+M KMMNS+         V  H+  D I    S+N  SN  +RVCSDCNTTKTPL
Sbjct: 126 SSKMRLMRKMMNSDCTTAKIEQKVEDHQQWDNINEFNSSNNTSNIPIRVCSDCNTTKTPL 185

Query: 623 WRGGPQGPKSLCNACGIRQRKARRAM----AEAANGTELNT-----------GTKKSRTS 757
           WR GP+GPKSLCNACGIRQRKARRAM    A AANGT + T             KK  TS
Sbjct: 186 WRSGPRGPKSLCNACGIRQRKARRAMAAAAAAAANGTAVGTEISPMKMKLPNKEKKMHTS 245

Query: 758 YIAQNKQHFKLIGTDSAHTSDQKKLSFEDFALSLTSK---KNLTIPDEEEAAILLMALSC 928
            + Q K+  K           +KKL FEDF  S+      + +   DEEEAAILLMALSC
Sbjct: 246 NVGQQKKLCK----PPCPPPTEKKLCFEDFTSSICKNSGFRRVFPRDEEEAAILLMALSC 301

Query: 929 GLIHS 943
            L++S
Sbjct: 302 DLVYS 306


>XP_006600457.1 PREDICTED: GATA transcription factor 21-like isoform X2 [Glycine
           max] KRH02717.1 hypothetical protein GLYMA_17G055200
           [Glycine max]
          Length = 310

 Score =  164 bits (416), Expect = 2e-44
 Identities = 129/317 (40%), Positives = 163/317 (51%), Gaps = 46/317 (14%)
 Frame = +2

Query: 128 IDLNEDHHLEHLISLPHHPTPSIST----PIYFNL-NQEYQICGSHSRESLQNQKKVEKN 292
           +DLNED + E   S  HHP+ S S+    PI FN  NQ+ +    +   + Q     E+ 
Sbjct: 3   LDLNEDQNHE-FFSPTHHPSSSFSSLSSYPILFNPPNQDQEARSYYWEPTKQYLPSHEEE 61

Query: 293 NIVLFXXXXXXXXXXXXXXLLEPTAGDMVCKREDHEDEGKSSRITDHDGSMKWMSSKMRV 472
              +                 + T    V K+ +  +E   S +   DGS+KWM +KMR+
Sbjct: 62  TEKIIPSSGSWDHSVAESEHNKAT----VWKKAEERNENLES-VAAEDGSLKWMPAKMRI 116

Query: 473 MPKMMNSNSDKPACRS---VVHKFNDQIKY-----------STNYN--SNDFVRVCSDCN 604
           M KM+ S+       S     HKF+DQ +            S NY+  SN+ VRVCSDC+
Sbjct: 117 MRKMLVSDQTDTYTNSDNNTTHKFDDQKQQLSSPLGTDNSSSNNYSNHSNNTVRVCSDCH 176

Query: 605 TTKTPLWRGGPQGPKSLCNACGIRQRKARRAMAEAA-----NGTEL-------------- 727
           TTKTPLWR GP+GPKSLCNACGIRQRKARRAMA AA     NGT +              
Sbjct: 177 TTKTPLWRSGPRGPKSLCNACGIRQRKARRAMAAAAASASGNGTVIVEAKKSVKGRNKLQ 236

Query: 728 NTGTKKSRTSYIAQNKQHFKLIGTDSAHTSDQK-KLSFEDFALSLTSKKNLTI-----PD 889
               KK+RT   AQ K+  KL G  SA  S  + K  FED  L L  +KNL +      D
Sbjct: 237 KKKEKKTRTEGAAQMKKKRKL-GVGSAKASQSRNKFGFEDLTLRL--RKNLAMHQVFPQD 293

Query: 890 EEEAAILLMALSCGLIH 940
           E+EAAILLMALS GL+H
Sbjct: 294 EKEAAILLMALSYGLVH 310


>XP_003550634.1 PREDICTED: GATA transcription factor 21-like isoform X1 [Glycine
           max] KHN17667.1 Putative GATA transcription factor 22
           [Glycine soja] KRH02716.1 hypothetical protein
           GLYMA_17G055200 [Glycine max]
          Length = 322

 Score =  164 bits (416), Expect = 3e-44
 Identities = 129/317 (40%), Positives = 163/317 (51%), Gaps = 46/317 (14%)
 Frame = +2

Query: 128 IDLNEDHHLEHLISLPHHPTPSIST----PIYFNL-NQEYQICGSHSRESLQNQKKVEKN 292
           +DLNED + E   S  HHP+ S S+    PI FN  NQ+ +    +   + Q     E+ 
Sbjct: 15  LDLNEDQNHE-FFSPTHHPSSSFSSLSSYPILFNPPNQDQEARSYYWEPTKQYLPSHEEE 73

Query: 293 NIVLFXXXXXXXXXXXXXXLLEPTAGDMVCKREDHEDEGKSSRITDHDGSMKWMSSKMRV 472
              +                 + T    V K+ +  +E   S +   DGS+KWM +KMR+
Sbjct: 74  TEKIIPSSGSWDHSVAESEHNKAT----VWKKAEERNENLES-VAAEDGSLKWMPAKMRI 128

Query: 473 MPKMMNSNSDKPACRS---VVHKFNDQIKY-----------STNYN--SNDFVRVCSDCN 604
           M KM+ S+       S     HKF+DQ +            S NY+  SN+ VRVCSDC+
Sbjct: 129 MRKMLVSDQTDTYTNSDNNTTHKFDDQKQQLSSPLGTDNSSSNNYSNHSNNTVRVCSDCH 188

Query: 605 TTKTPLWRGGPQGPKSLCNACGIRQRKARRAMAEAA-----NGTEL-------------- 727
           TTKTPLWR GP+GPKSLCNACGIRQRKARRAMA AA     NGT +              
Sbjct: 189 TTKTPLWRSGPRGPKSLCNACGIRQRKARRAMAAAAASASGNGTVIVEAKKSVKGRNKLQ 248

Query: 728 NTGTKKSRTSYIAQNKQHFKLIGTDSAHTSDQK-KLSFEDFALSLTSKKNLTI-----PD 889
               KK+RT   AQ K+  KL G  SA  S  + K  FED  L L  +KNL +      D
Sbjct: 249 KKKEKKTRTEGAAQMKKKRKL-GVGSAKASQSRNKFGFEDLTLRL--RKNLAMHQVFPQD 305

Query: 890 EEEAAILLMALSCGLIH 940
           E+EAAILLMALS GL+H
Sbjct: 306 EKEAAILLMALSYGLVH 322


>EEF48061.1 hypothetical protein RCOM_1046780 [Ricinus communis]
          Length = 312

 Score =  164 bits (415), Expect = 4e-44
 Identities = 131/313 (41%), Positives = 165/313 (52%), Gaps = 42/313 (13%)
 Frame = +2

Query: 128 IDLNEDHHLEHLISLPHHPTP------SISTPIYFNLNQEYQICGSHSRESLQNQKKVEK 289
           IDLNED H   LI      T       SIS PI+ N  QE    G + +E LQ     E 
Sbjct: 14  IDLNEDQHHHQLIFCSKTTTEDASSSSSISYPIFINPPQEE--VGYYHKE-LQPLHHQEV 70

Query: 290 NNIVLFXXXXXXXXXXXXXXLLEPTAGDMVCKREDHEDEGKSSRITDH--DGSMKWMSSK 463
           +NI  +                E      VCK+ED     KS+ I D   + S+KWMSSK
Sbjct: 71  DNI--YASHGRSWDHRIIKNENENGQELSVCKKED-----KSTSIEDQRDNSSVKWMSSK 123

Query: 464 MRVMPKMMNSNSDKPACR--SVVHKFNDQIK---------YSTNY---NSNDFVRVCSDC 601
           MR+M KMM ++      +  S +HK  D+ K         YS+     NSN+ +RVCSDC
Sbjct: 124 MRLMRKMMTTDQTVNTTQHTSSMHKLEDKEKSRSLPLQDDYSSKNLSDNSNNTIRVCSDC 183

Query: 602 NTTKTPLWRGGPQGPKSLCNACGIRQRKARRAMAEA---ANGTELNTGTKKSRTSYIAQN 772
           NTTKTPLWR GP+GPKSLCNACGIRQRKARRA+A A   ANGT     T   +T+ + QN
Sbjct: 184 NTTKTPLWRSGPRGPKSLCNACGIRQRKARRALAAAQASANGTIFAPDTAAMKTNKV-QN 242

Query: 773 KQHFKLIGTDSAH-------------TSDQKKLSFEDFALSLTSK----KNLTIPDEEEA 901
           K+      T+++H                +KKL FED + ++ SK    + L   DE+EA
Sbjct: 243 KEK----RTNNSHLPFKKRCKFTAQSRGSRKKLCFEDLSSTILSKNSAFQQLFPQDEKEA 298

Query: 902 AILLMALSCGLIH 940
           AILLMALS GL+H
Sbjct: 299 AILLMALSYGLVH 311


>KZN11770.1 hypothetical protein DCAR_004426 [Daucus carota subsp. sativus]
          Length = 183

 Score =  159 bits (403), Expect = 6e-44
 Identities = 99/193 (51%), Positives = 128/193 (66%), Gaps = 11/193 (5%)
 Frame = +2

Query: 392 DHEDEGKSSRITDHDGSMKWMSSKMRVMPKMMNSNSDKPACRSVVHKFNDQIKYSTNYNS 571
           DHE+   S   T+ DGSM+WMSSKMRV  K++ S+++           +DQI +S + NS
Sbjct: 3   DHENISTS---TEGDGSMRWMSSKMRVKRKILPSSNNH----------SDQINFSDHGNS 49

Query: 572 ND--FVRVCSDCNTTKTPLWRGGPQGPKSLCNACGIRQRKARRAMAEAANGTELNTGTKK 745
           +    VRVCSDCNTT+TPLWRGGP+GPKSLCNACGIR+RKAR+A+A AA+ + + + +K 
Sbjct: 50  STDVVVRVCSDCNTTRTPLWRGGPRGPKSLCNACGIRRRKARKAIALAAHNS-IESVSKM 108

Query: 746 SRTSYIAQNKQH--FKLIG-----TDSAHTSDQKKLSFEDFALSLTSKKNLTI--PDEEE 898
            + S    +K H   KL       T S  T+   KLSFEDFA+SL +KK+  +   DEEE
Sbjct: 109 DQHSSTRPSKFHKGKKLHTTYHDVTHSRQTTGNSKLSFEDFAMSLVNKKSAELFPGDEEE 168

Query: 899 AAILLMALSCGLI 937
           AAILLM LSC LI
Sbjct: 169 AAILLMTLSCSLI 181


>ABK96478.1 unknown [Populus trichocarpa x Populus deltoides]
          Length = 303

 Score =  163 bits (412), Expect = 8e-44
 Identities = 120/301 (39%), Positives = 159/301 (52%), Gaps = 29/301 (9%)
 Frame = +2

Query: 128 IDLNEDHHLEHLISLPHHPTPSISTPIYFNLNQEYQICGSHSRESLQNQK-KVEKNNIVL 304
           +DL E+ HL+  +S PH    S+S P  F  N  +    S   ES Q+   +V+K +I L
Sbjct: 16  VDLKEEQHLQLFLS-PHQAATSLSGPTNF-FNTTHDQRESKLAESRQHDDHEVDKYSISL 73

Query: 305 FXXXXXXXXXXXXXXLLEPTAGDMVCKREDHEDEGK----SSRITDH-----DGSMKWMS 457
                           L P++       +D +D       SS+  D      D S+ WM 
Sbjct: 74  ---------GRSSDHKLFPSSSFQPVVNDDDDDSNFHKLFSSKTEDGTEGSGDSSVNWMP 124

Query: 458 SKMRVMPKMMNSN---SDKPACRSVVHKFNDQIKYST-NYNSNDFVRVCSDCNTTKTPLW 625
           S+M  M +M NSN   +D    + ++   N Q + +  N +SN  +RVCSDCNTT TPLW
Sbjct: 125 SRMTTMQEMSNSNRSETDHQPMKFMLKFHNQQCQNNDINSSSNSNIRVCSDCNTTSTPLW 184

Query: 626 RGGPQGPKSLCNACGIRQRKARRAMAEAANG------------TELNTGTKKSRTSYIAQ 769
           R GP+GPKSLCNACGIRQRKARRAMA A NG            +++N+  KK RTS++ Q
Sbjct: 185 RSGPRGPKSLCNACGIRQRKARRAMAAAENGAVISVEASSSTKSKVNSKVKKLRTSHVVQ 244

Query: 770 NKQHFKLIGTDSAHTSDQKKLSFEDFALSLTSKKNL--TIP-DEEEAAILLMALSCGLIH 940
            K+        +     QKKL F++ ALSL+    L   +P D EEAAILLM LSCG IH
Sbjct: 245 GKKLSN--KPPNPPLQSQKKLCFKNLALSLSKNPALRQVLPHDVEEAAILLMELSCGFIH 302

Query: 941 S 943
           S
Sbjct: 303 S 303


>XP_002514107.2 PREDICTED: putative GATA transcription factor 22 [Ricinus communis]
          Length = 377

 Score =  164 bits (415), Expect = 2e-43
 Identities = 131/313 (41%), Positives = 165/313 (52%), Gaps = 42/313 (13%)
 Frame = +2

Query: 128 IDLNEDHHLEHLISLPHHPTP------SISTPIYFNLNQEYQICGSHSRESLQNQKKVEK 289
           IDLNED H   LI      T       SIS PI+ N  QE    G + +E LQ     E 
Sbjct: 79  IDLNEDQHHHQLIFCSKTTTEDASSSSSISYPIFINPPQEE--VGYYHKE-LQPLHHQEV 135

Query: 290 NNIVLFXXXXXXXXXXXXXXLLEPTAGDMVCKREDHEDEGKSSRITDH--DGSMKWMSSK 463
           +NI  +                E      VCK+ED     KS+ I D   + S+KWMSSK
Sbjct: 136 DNI--YASHGRSWDHRIIKNENENGQELSVCKKED-----KSTSIEDQRDNSSVKWMSSK 188

Query: 464 MRVMPKMMNSNSDKPACR--SVVHKFNDQIK---------YSTNY---NSNDFVRVCSDC 601
           MR+M KMM ++      +  S +HK  D+ K         YS+     NSN+ +RVCSDC
Sbjct: 189 MRLMRKMMTTDQTVNTTQHTSSMHKLEDKEKSRSLPLQDDYSSKNLSDNSNNTIRVCSDC 248

Query: 602 NTTKTPLWRGGPQGPKSLCNACGIRQRKARRAMAEA---ANGTELNTGTKKSRTSYIAQN 772
           NTTKTPLWR GP+GPKSLCNACGIRQRKARRA+A A   ANGT     T   +T+ + QN
Sbjct: 249 NTTKTPLWRSGPRGPKSLCNACGIRQRKARRALAAAQASANGTIFAPDTAAMKTNKV-QN 307

Query: 773 KQHFKLIGTDSAH-------------TSDQKKLSFEDFALSLTSK----KNLTIPDEEEA 901
           K+      T+++H                +KKL FED + ++ SK    + L   DE+EA
Sbjct: 308 KEK----RTNNSHLPFKKRCKFTAQSRGSRKKLCFEDLSSTILSKNSAFQQLFPQDEKEA 363

Query: 902 AILLMALSCGLIH 940
           AILLMALS GL+H
Sbjct: 364 AILLMALSYGLVH 376


>ABK96296.1 unknown [Populus trichocarpa x Populus deltoides]
          Length = 306

 Score =  161 bits (408), Expect = 3e-43
 Identities = 119/304 (39%), Positives = 159/304 (52%), Gaps = 32/304 (10%)
 Frame = +2

Query: 128 IDLNEDHHLEHLISLPHHPTPSISTPIYFNLNQEYQICGSHSRESLQNQK-KVEKNNIVL 304
           +DL E+ HL+  +S PH    S+S P  F  N  +    S   ES Q+   +V+K +I L
Sbjct: 16  VDLKEEQHLQLFLS-PHQAATSLSGPTNF-FNTTHDQRESKLAESRQHDDHEVDKYSISL 73

Query: 305 FXXXXXXXXXXXXXXLLEPTAGDMVCKREDHEDEGK-----SSRITDH-----DGSMKWM 454
                           L P++       +D +D+       SS+  D      D S+ WM
Sbjct: 74  ---------GRSSDHKLFPSSSFQPVVNDDDDDDSNFHKLFSSKTEDGTEGSGDSSVNWM 124

Query: 455 SSKMRVMPKMMNSN---SDKPACRSVVHKFNDQIKYSTN---YNSNDFVRVCSDCNTTKT 616
            S+M  M +M  SN   +D    + ++   N Q + + N    +SN  +RVCSDCNTT T
Sbjct: 125 PSRMTTMQEMTTSNRSETDHQPMKFMLKFHNQQCQNNVNDINSSSNSNIRVCSDCNTTST 184

Query: 617 PLWRGGPQGPKSLCNACGIRQRKARRAMAEAANG------------TELNTGTKKSRTSY 760
           PLWR GP+GPKSLCNACGIRQRKARRAMA A NG            +++N+  KK RTS+
Sbjct: 185 PLWRSGPRGPKSLCNACGIRQRKARRAMAAAENGAVISVEASSSTKSKVNSKVKKLRTSH 244

Query: 761 IAQNKQHFKLIGTDSAHTSDQKKLSFEDFALSLTSKKNL--TIP-DEEEAAILLMALSCG 931
           + Q K+        +     QKKL F++ ALSL+    L   +P D EEAAILLM LSCG
Sbjct: 245 VVQGKKLSN--KPPNPPLQSQKKLCFKNLALSLSKNPVLRQVLPHDVEEAAILLMELSCG 302

Query: 932 LIHS 943
            IHS
Sbjct: 303 FIHS 306


>XP_007012845.2 PREDICTED: putative GATA transcription factor 22 [Theobroma cacao]
          Length = 302

 Score =  160 bits (405), Expect = 8e-43
 Identities = 127/306 (41%), Positives = 160/306 (52%), Gaps = 35/306 (11%)
 Frame = +2

Query: 128 IDLNED--HHLEHLISLPHHP----TPSISTPIYFNLNQEYQICGSHSRESLQN-QKKVE 286
           IDLNED  H    L SL   P    + S++ PI FN   + Q  G H RE  Q+ Q + +
Sbjct: 15  IDLNEDDQHQQHQLFSLKPQPPSLSSSSLTCPILFNPVVQEQ-AGGHQREPHQHFQYQED 73

Query: 287 KNNIVLFXXXXXXXXXXXXXXLLEPTAGDMVCKREDHEDEGKSSRITDHDGSMKWMSSKM 466
           +  I +               L +   G+     E+H+ E         D S KWMSSKM
Sbjct: 74  QAKIYVPQDEPLESDSGLNLSLFKKEEGN-----ENHQIE---------DSSAKWMSSKM 119

Query: 467 RVMPKMMNS------NSDKPACRSVVHKFNDQIKYSTN--YNSND--FVRVCSDCNTTKT 616
           R+M KMM+S      NS  P       + +     S+N  YN+ND   +RVC+DCNTTKT
Sbjct: 120 RMMRKMMSSDRADLSNSSTPKLEEPKQQPSSSPDNSSNSSYNNNDNITIRVCADCNTTKT 179

Query: 617 PLWRGGPQGPKSLCNACGIRQRKARRAM--AEAANGTELNTGT-------------KKSR 751
           PLWR GP+GPKSLCNACGIRQRKARRAM  A AANGT +   T             + S 
Sbjct: 180 PLWRSGPRGPKSLCNACGIRQRKARRAMAAAAAANGTIVAAQTTPTMKSKVQDKSKRSSN 239

Query: 752 TSYIAQNKQHFKLIGTDSAHTSDQKKLSFED--FALSLTSKKNLTIP-DEEEAAILLMAL 922
           +  +AQ K+  K     S+ +  +KKL FED    LS  S  +   P DE+EAAILLMAL
Sbjct: 240 SGCVAQLKKKCK----HSSQSQGRKKLCFEDLRIILSKNSAFHRVFPQDEKEAAILLMAL 295

Query: 923 SCGLIH 940
           S GL+H
Sbjct: 296 SYGLVH 301


>XP_017982034.1 PREDICTED: putative GATA transcription factor 22 [Theobroma cacao]
          Length = 311

 Score =  160 bits (404), Expect = 1e-42
 Identities = 103/214 (48%), Positives = 124/214 (57%), Gaps = 32/214 (14%)
 Frame = +2

Query: 398 EDEGKSSRITDHDGSMKWMSSKMRVMPKMMNSN----SDKPACRSVVHKFNDQIKY---- 553
           +++G     + +  S+KWMSSK+R+M KMMNSN     DKP       KF  +++Y    
Sbjct: 107 KEDGDCESASGNGSSVKWMSSKVRLMKKMMNSNCSGVDDKPP------KFTQRLQYPVHD 160

Query: 554 STNYNS----NDFVRVCSDCNTTKTPLWRGGPQGPKSLCNACGIRQRKARRAMAEAANGT 721
           S   NS    N+ VRVCSDCNTT TPLWR GP+GPKSLCNACGIRQRKARRAM  AA   
Sbjct: 161 SDETNSFSKANNTVRVCSDCNTTTTPLWRSGPRGPKSLCNACGIRQRKARRAMEAAAAAA 220

Query: 722 ELNTGT-----------------KKSRTSYIAQNKQHFKLIGTDSAHTSDQKKLSFEDFA 850
             N                    KKSRTS++AQ K+  K           QKKL F++FA
Sbjct: 221 AENGAAAAADASSMKIKVHIHKEKKSRTSHVAQCKKQVK---PPYYSPQSQKKLCFKEFA 277

Query: 851 LSLTSKKNL--TIP-DEEEAAILLMALSCGLIHS 943
           LSL+    L    P D E+AAILLM LSCGL+HS
Sbjct: 278 LSLSKNSALQRVFPQDVEDAAILLMELSCGLVHS 311


Top