BLASTX nr result
ID: Aconitum23_contig00029104
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Aconitum23_contig00029104 (1180 letters) Database: ./nr 77,306,371 sequences; 28,104,191,420 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_010262144.1| PREDICTED: putative GATA transcription facto... 189 3e-45 ref|XP_010656197.1| PREDICTED: GATA transcription factor 21 isof... 184 1e-43 ref|XP_002282173.1| PREDICTED: GATA transcription factor 21 isof... 184 1e-43 ref|XP_010242203.1| PREDICTED: GATA transcription factor 21-like... 178 9e-42 ref|XP_007012281.1| GATA type zinc finger transcription factor f... 159 4e-36 ref|XP_007012845.1| GATA type zinc finger transcription factor f... 158 1e-35 gb|KHG09089.1| Putative GATA transcription factor 22 -like prote... 155 5e-35 ref|XP_010090039.1| Putative GATA transcription factor 22 [Morus... 155 6e-35 ref|XP_006450838.1| hypothetical protein CICLE_v10008968mg [Citr... 154 1e-34 ref|XP_006451458.1| hypothetical protein CICLE_v10009004mg [Citr... 152 4e-34 ref|XP_006450839.1| hypothetical protein CICLE_v10008968mg [Citr... 152 5e-34 ref|XP_006600457.1| PREDICTED: GATA transcription factor 21-like... 150 2e-33 ref|XP_003550634.1| PREDICTED: GATA transcription factor 21-like... 150 2e-33 emb|CDP03165.1| unnamed protein product [Coffea canephora] 150 2e-33 ref|XP_010942001.1| PREDICTED: putative GATA transcription facto... 150 3e-33 ref|XP_002514107.1| hypothetical protein RCOM_1046780 [Ricinus c... 149 3e-33 gb|KDO57760.1| hypothetical protein CISIN_1g021859mg [Citrus sin... 149 4e-33 ref|NP_001280882.1| transcription factor GATA-5 [Malus domestica... 149 4e-33 ref|XP_013458498.1| GATA type zinc finger transcription factor f... 148 8e-33 gb|KOM29610.1| hypothetical protein LR48_Vigan728s003300 [Vigna ... 148 1e-32 >ref|XP_010262144.1| PREDICTED: putative GATA transcription factor 22 [Nelumbo nucifera] Length = 316 Score = 189 bits (481), Expect = 3e-45 Identities = 115/260 (44%), Positives = 149/260 (57%), Gaps = 24/260 (9%) Frame = -1 Query: 922 EQFKQYQPSQEVVDNVSQGGSSDHQSVSPPSLSTLESG----------SEYGDQNKSHST 773 E + Q Q+ +G H P L + ++G E D+++S+ST Sbjct: 63 EAHDREQQEQQQRQEADKGSEGYHLDFPYPPLQSSKNGINSSLELSIKQEIRDESQSNST 122 Query: 772 PSEKGSGSWMSSKMRLMKKMINSD--GSNDALKPIRKRFW--FQDP---KLSLSGDNLAS 614 GS WMSSKMRLM+KM+NSD G++ ++F Q P ++ S N +S Sbjct: 123 ----GSARWMSSKMRLMRKMMNSDRMGADKPASGNTQKFQDHHQQPSSLEMDSSSSNSSS 178 Query: 613 NNN--IIRTCSDCNTTRTPLWRSGPKGPKSLCNACGIXXXXXXXXXXXXXXASTDGILTT 440 NN+ +R CSDCNTT+TPLWRSGP+GPKSLCNACGI + T +L Sbjct: 179 NNSNITVRVCSDCNTTKTPLWRSGPRGPKSLCNACGIRQRKARRAMAAAAASGT--LLPA 236 Query: 439 DTPSVASKVHCSKKKSEKGHVAQHKKQCK-----SKTNNIGFEDFIVNLSNNLVLHRVFP 275 DTPS+ KVH +K+SE G+V Q+KK+CK + FEDF +NLS N HRVFP Sbjct: 237 DTPSLQRKVHHKEKRSETGYVPQYKKRCKLAPSPRSRKKLCFEDFTINLSKNSAFHRVFP 296 Query: 274 QDETEAAILLMALSCGLVNG 215 QDE EAAILLMALSCGLV+G Sbjct: 297 QDEKEAAILLMALSCGLVHG 316 >ref|XP_010656197.1| PREDICTED: GATA transcription factor 21 isoform X1 [Vitis vinifera] Length = 310 Score = 184 bits (468), Expect = 1e-43 Identities = 120/260 (46%), Positives = 147/260 (56%), Gaps = 28/260 (10%) Frame = -1 Query: 910 QYQPSQEVVDN--VSQGGSSDHQSVSPPSLSTLESGSEYG---------DQNKSHSTPSE 764 Q QP QEV + V +GGS DH TLES S+ G D+N++HS E Sbjct: 64 QAQPQQEVAHDKFVFRGGSYDHP--------TLESESDNGLKLTIWKTEDRNENHS---E 112 Query: 763 KGSGSWMSSKMRLMKKMINSDGSNDALKPIRKRFWFQDPKL----------SLSGDNLAS 614 GS WMSSKMR+M+KM+ SD + A KP F D K S++ N+ S Sbjct: 113 NGSVKWMSSKMRVMQKMMISDQTG-AQKPSNTALNFGDHKQQSLPSETDYNSINSSNINS 171 Query: 613 NNNIIRTCSDCNTTRTPLWRSGPKGPKSLCNACGIXXXXXXXXXXXXXXASTDGILTTDT 434 NN I R C+DCNTT+TPLWRSGP+GPKSLCNACGI + IL T+T Sbjct: 172 NNTI-RVCADCNTTKTPLWRSGPRGPKSLCNACGIRQRKARRAMAAAAATANGTILPTNT 230 Query: 433 PSVASKVHCSKKKSEKGHVAQHKKQCK------SKTNNIGFEDFIVNLSNNLVLHRVFPQ 272 +K KKS GHV+ +KK+CK +T + FEDF ++LS N HRVF Q Sbjct: 231 APTKTKAKHKDKKSSNGHVSHYKKRCKLAAAPSCETKKLCFEDFTISLSKNSAFHRVFLQ 290 Query: 271 DE-TEAAILLMALSCGLVNG 215 DE EAAILLMALSCGLV+G Sbjct: 291 DEIKEAAILLMALSCGLVHG 310 >ref|XP_002282173.1| PREDICTED: GATA transcription factor 21 isoform X2 [Vitis vinifera] gi|297738668|emb|CBI27913.3| unnamed protein product [Vitis vinifera] Length = 309 Score = 184 bits (468), Expect = 1e-43 Identities = 120/259 (46%), Positives = 146/259 (56%), Gaps = 27/259 (10%) Frame = -1 Query: 910 QYQPSQEVVDN-VSQGGSSDHQSVSPPSLSTLESGSEYG---------DQNKSHSTPSEK 761 Q QP QE D V +GGS DH TLES S+ G D+N++HS E Sbjct: 64 QAQPQQEAHDKFVFRGGSYDHP--------TLESESDNGLKLTIWKTEDRNENHS---EN 112 Query: 760 GSGSWMSSKMRLMKKMINSDGSNDALKPIRKRFWFQDPKL----------SLSGDNLASN 611 GS WMSSKMR+M+KM+ SD + A KP F D K S++ N+ SN Sbjct: 113 GSVKWMSSKMRVMQKMMISDQTG-AQKPSNTALNFGDHKQQSLPSETDYNSINSSNINSN 171 Query: 610 NNIIRTCSDCNTTRTPLWRSGPKGPKSLCNACGIXXXXXXXXXXXXXXASTDGILTTDTP 431 N I R C+DCNTT+TPLWRSGP+GPKSLCNACGI + IL T+T Sbjct: 172 NTI-RVCADCNTTKTPLWRSGPRGPKSLCNACGIRQRKARRAMAAAAATANGTILPTNTA 230 Query: 430 SVASKVHCSKKKSEKGHVAQHKKQCK------SKTNNIGFEDFIVNLSNNLVLHRVFPQD 269 +K KKS GHV+ +KK+CK +T + FEDF ++LS N HRVF QD Sbjct: 231 PTKTKAKHKDKKSSNGHVSHYKKRCKLAAAPSCETKKLCFEDFTISLSKNSAFHRVFLQD 290 Query: 268 E-TEAAILLMALSCGLVNG 215 E EAAILLMALSCGLV+G Sbjct: 291 EIKEAAILLMALSCGLVHG 309 >ref|XP_010242203.1| PREDICTED: GATA transcription factor 21-like [Nelumbo nucifera] Length = 305 Score = 178 bits (451), Expect = 9e-42 Identities = 111/245 (45%), Positives = 141/245 (57%), Gaps = 27/245 (11%) Frame = -1 Query: 868 GGSSDHQ----SVSPPSLST-LESGSEYGDQNKSHS---TPSEKGSGSWMSSKMRLMKKM 713 GG SDHQ PP++ + SG E + + + + GS WMSSKMRLM+KM Sbjct: 68 GGPSDHQYFPDDPPPPTVEDDINSGLELSNSKQRENRGGSQGNMGSVRWMSSKMRLMRKM 127 Query: 712 INSD--GSNDALKPIRKRF-----------WFQDPKLSLSGDNLASNNNIIRTCSDCNTT 572 NSD G + + +F W D + S +N NN +R CSDCNTT Sbjct: 128 KNSDRVGMDKPVNTNMHKFQQDHHHRSPSPWEMDTSSNSSSNNA---NNTVRVCSDCNTT 184 Query: 571 RTPLWRSGPKGPKSLCNACGIXXXXXXXXXXXXXXASTDGILTTDTPSVASKVHCSKKK- 395 +TPLWRSGP+GPKSLCNACGI A+ +L T+ S+ +KVH +K+ Sbjct: 185 KTPLWRSGPRGPKSLCNACGI----RQRKARRAMAAANGTLLPTEASSMKNKVHHKEKRS 240 Query: 394 SEKGHVAQHKKQCKSKTN-----NIGFEDFIVNLSNNLVLHRVFPQDETEAAILLMALSC 230 SE G+V Q+KK+CK T+ + FEDF +NLS N HRVFPQDE EAAILLMALSC Sbjct: 241 SETGYVQQYKKRCKLATSPRSMKKVCFEDFTINLSKNSSFHRVFPQDEKEAAILLMALSC 300 Query: 229 GLVNG 215 GLV+G Sbjct: 301 GLVHG 305 >ref|XP_007012281.1| GATA type zinc finger transcription factor family protein, putative [Theobroma cacao] gi|508782644|gb|EOY29900.1| GATA type zinc finger transcription factor family protein, putative [Theobroma cacao] Length = 311 Score = 159 bits (402), Expect = 4e-36 Identities = 111/260 (42%), Positives = 137/260 (52%), Gaps = 20/260 (7%) Frame = -1 Query: 937 QDQLKEQFKQYQPSQEVVDN-VSQGGSSDHQSVSPPSLSTLESGSEYGDQNKSHSTP--- 770 QDQ + ++ +P + ++ GS D Q+ S SL + S N S S Sbjct: 51 QDQTVTKPEESKPHDHKGNQFMTHEGSIDQQASSSSSLQSAVDQSTANGYNLSFSRKEDG 110 Query: 769 ---SEKGSGS---WMSSKMRLMKKMINSDGSNDALKPIRKRFWFQDPKLSLSGDNLASN- 611 S G+GS WMSSK+RLMKKM+NS+ S KP + FQ P N S Sbjct: 111 DCESASGNGSSVKWMSSKVRLMKKMMNSNCSGADDKPPKFTQRFQYPVHDSDETNSFSKA 170 Query: 610 NNIIRTCSDCNTTRTPLWRSGPKGPKSLCNACGIXXXXXXXXXXXXXXASTD--GILTTD 437 NN +R CSDCNTT TPLWRSGP+GPKSLCNACGI A+ + D Sbjct: 171 NNTVRVCSDCNTTTTPLWRSGPRGPKSLCNACGIRQRKARRAMEAAAAAAAENGAAAAAD 230 Query: 436 TPSVASKVHCSK-KKSEKGHVAQHKKQCK------SKTNNIGFEDFIVNLSNNLVLHRVF 278 S+ KVH K KKS HVAQ KKQ K + F++F ++LS N L RVF Sbjct: 231 ASSMKIKVHIHKEKKSRTSHVAQCKKQVKPPYYSPQSQKKLCFKEFALSLSKNSALQRVF 290 Query: 277 PQDETEAAILLMALSCGLVN 218 PQD +AAILLM LSCGLV+ Sbjct: 291 PQDVEDAAILLMELSCGLVH 310 >ref|XP_007012845.1| GATA type zinc finger transcription factor family protein, putative [Theobroma cacao] gi|508783208|gb|EOY30464.1| GATA type zinc finger transcription factor family protein, putative [Theobroma cacao] Length = 302 Score = 158 bits (399), Expect = 1e-35 Identities = 110/249 (44%), Positives = 139/249 (55%), Gaps = 14/249 (5%) Frame = -1 Query: 919 QFKQYQPSQEVVDNVSQGGSSDHQSVSPPSLSTLESGSEYGDQNKSHSTPSEKGSGSWMS 740 Q QYQ Q + V Q + S SL E G+E+ E S WMS Sbjct: 66 QHFQYQEDQAKI-YVPQDEPLESDSGLNLSLRKKEEGNEHHQ--------IEDSSAKWMS 116 Query: 739 SKMRLMKKMINSDGSNDALKPIRKRFWFQDPKL--SLSGDNLAS-----NNNI-IRTCSD 584 SKMR+M+KM++SD ++ + K ++PK S S DN ++ N+NI IR C+D Sbjct: 117 SKMRMMRKMMSSDRADLSNSSTPK---LEEPKQQPSSSPDNSSNSSYNNNDNITIRVCAD 173 Query: 583 CNTTRTPLWRSGPKGPKSLCNACGIXXXXXXXXXXXXXXASTDGILTTDTPSVASKVH-C 407 CNTT+TPLWRSGP+GPKSLCNACGI A+ + TP++ SKV Sbjct: 174 CNTTKTPLWRSGPRGPKSLCNACGIRQRKARRAMAAAAAANGAIVAAQTTPTMKSKVQDK 233 Query: 406 SKKKSEKGHVAQHKKQCKSKTNNIG-----FEDFIVNLSNNLVLHRVFPQDETEAAILLM 242 SK+ S G VAQ KK+CK + + G FED + LS N HRVFPQDE EAAILLM Sbjct: 234 SKRSSNSGCVAQLKKKCKHSSQSQGRKKLCFEDLRIILSKNSAFHRVFPQDEKEAAILLM 293 Query: 241 ALSCGLVNG 215 ALS GLV+G Sbjct: 294 ALSYGLVHG 302 >gb|KHG09089.1| Putative GATA transcription factor 22 -like protein [Gossypium arboreum] Length = 305 Score = 155 bits (393), Expect = 5e-35 Identities = 105/250 (42%), Positives = 135/250 (54%), Gaps = 7/250 (2%) Frame = -1 Query: 943 AFQDQLKEQFKQYQPSQEVVDNVSQGGSSDHQSVSPPSLSTLESGSEYGDQNKSHSTPSE 764 AF L +QF Q QE + +V Q G +S LS + ++SH Sbjct: 68 AFYQSLPQQFHDDQQDQEKI-HVPQDGPL--RSDCELRLSIWKKEERVETHHQSHD---- 120 Query: 763 KGSGSWMSSKMRLMKKMINSDGSNDALKPIRKRFWFQDPKL-SLSGDNLASNNNIIRTCS 587 S WM SKMR+M+KM+NSD ++ + P K Q+ K S S DN NN+ IR C+ Sbjct: 121 --SAKWMPSKMRMMRKMMNSDHTDLSNSPTPKSEDHQEQKQPSSSPDN---NNSTIRVCA 175 Query: 586 DCNTTRTPLWRSGPKGPKSLCNACGIXXXXXXXXXXXXXXASTDGILTTDTPSVASKVHC 407 DCNTT+TPLWRSGP+GPKSLCNACGI A++ + PS+ S+V Sbjct: 176 DCNTTKTPLWRSGPRGPKSLCNACGIRQRKARRAAAVAAAAASSVVAAETPPSMRSEVQL 235 Query: 406 SKKKSEKGHVAQHK-KQCKSKTNN-----IGFEDFIVNLSNNLVLHRVFPQDETEAAILL 245 K+S V K K+CK + + + FED + LS N H VFPQDE EAAILL Sbjct: 236 KAKRSSNNGVPHLKNKKCKHNSQSQSRKKLCFEDLRIILSKNSAFHGVFPQDEKEAAILL 295 Query: 244 MALSCGLVNG 215 MALS GLV+G Sbjct: 296 MALSYGLVHG 305 >ref|XP_010090039.1| Putative GATA transcription factor 22 [Morus notabilis] gi|587848577|gb|EXB38836.1| Putative GATA transcription factor 22 [Morus notabilis] Length = 335 Score = 155 bits (392), Expect = 6e-35 Identities = 109/263 (41%), Positives = 137/263 (52%), Gaps = 42/263 (15%) Frame = -1 Query: 877 VSQGGSSDHQSVSPPSLSTLESG-----------------SEYGDQNKSHSTPSEKG-SG 752 VS GGSSD + PP ++ ES S Y SH + + G S Sbjct: 76 VSSGGSSD---IHPPRVAESESDHHQNDLKLSIWKSSTEDSNYDHDKSSHVSDNNAGYSA 132 Query: 751 SWMSSKMRLMKKMI-NSDGSN-DALKPIRKRFWFQD------PKLSLSGDNLAS------ 614 WM SKMR+M+KMI N D +N D P+ F P L D+ ++ Sbjct: 133 KWMPSKMRMMRKMIVNPDQTNIDHHTPLNFTHKFDQVMKRKHPASPLGTDHSSTSSSNNN 192 Query: 613 NNNIIRTCSDCNTTRTPLWRSGPKGPKSLCNACGIXXXXXXXXXXXXXXASTDGILTTDT 434 NNN IR C+DCNTT+TPLWRSGP+GPKSLCNACGI A+ IL TD Sbjct: 193 NNNTIRVCADCNTTKTPLWRSGPRGPKSLCNACGIRQRKARRAMAAAAAAANGTILATDA 252 Query: 433 PSVAS--KVHCSKKKSEKGH--VAQHKKQCKSKTN------NIGFEDFIVNLSNNLVLHR 284 ++ S KV +KK + G+ V Q KK+CK + I FED +++S N R Sbjct: 253 TTMKSSTKVQRKEKKPKNGNGVVPQFKKRCKLTASPSRGRKKICFEDLAISISKNSAFQR 312 Query: 283 VFPQDETEAAILLMALSCGLVNG 215 VFPQDE +AAILLMALS GLV+G Sbjct: 313 VFPQDEKDAAILLMALSYGLVHG 335 >ref|XP_006450838.1| hypothetical protein CICLE_v10008968mg [Citrus clementina] gi|568844084|ref|XP_006475926.1| PREDICTED: putative GATA transcription factor 22-like isoform X2 [Citrus sinensis] gi|557554064|gb|ESR64078.1| hypothetical protein CICLE_v10008968mg [Citrus clementina] gi|641861410|gb|KDO80098.1| hypothetical protein CISIN_1g021329mg [Citrus sinensis] Length = 312 Score = 154 bits (389), Expect = 1e-34 Identities = 103/267 (38%), Positives = 138/267 (51%), Gaps = 26/267 (9%) Frame = -1 Query: 940 FQDQLKEQFKQYQPSQEVVDNVSQGGSSDHQSVSPPSLSTLESGSEYGDQ------NKSH 779 FQDQ ++ Q + VD+ GSS+ Q S S+ T + + ++ Sbjct: 48 FQDQRMIIMEESQQHDQKVDH---SGSSNLQVFSSSSIQTKKMNNITNNKLPIRKREVGE 104 Query: 778 STPSEKGS---GSWMSSKMRLMKKMINSDGSNDA-----LKPIRKRFWFQ-DPKLSLSGD 626 T SE GS G WMSSK+RLM KMINS ++ A +K +K + Q ++ Sbjct: 105 GTTSENGSSSSGKWMSSKIRLMHKMINSSSNSTATHELAVKVTQKLQYHQLHDNSEVNSF 164 Query: 625 NLASNNNIIRTCSDCNTTRTPLWRSGPKGPKSLCNACGIXXXXXXXXXXXXXXASTDGIL 446 N +++NN +R CSDCNTT TPLWRSGP+GPKSLCNACGI T I Sbjct: 165 NSSNSNNTMRACSDCNTTTTPLWRSGPRGPKSLCNACGIRQRKARRAMQAAAAVETGTIA 224 Query: 445 TT-DTPSVASKVHCSKKKSEKGHVAQHKKQCKS----------KTNNIGFEDFIVNLSNN 299 T +P K+ KK HV+Q+KKQ ++ + F+DF + LS N Sbjct: 225 ATGGSPFAKIKLQIKDKKPRTSHVSQNKKQYRTLDPDPTHQYQSQRKLCFKDFAIALSKN 284 Query: 298 LVLHRVFPQDETEAAILLMALSCGLVN 218 L +VFPQD EAAILLM LSCG ++ Sbjct: 285 SALKQVFPQDVEEAAILLMELSCGFIH 311 >ref|XP_006451458.1| hypothetical protein CICLE_v10009004mg [Citrus clementina] gi|568843031|ref|XP_006475428.1| PREDICTED: putative GATA transcription factor 22-like [Citrus sinensis] gi|557554684|gb|ESR64698.1| hypothetical protein CICLE_v10009004mg [Citrus clementina] Length = 306 Score = 152 bits (385), Expect = 4e-34 Identities = 98/217 (45%), Positives = 118/217 (54%), Gaps = 12/217 (5%) Frame = -1 Query: 829 LSTLESGSEYGDQNKSHSTPSEKGSGSWMSSKMRLMKKMINSDGSNDALKPIRKRFWFQD 650 LS E DQN+S ++ S K WMSSKMRLMKKM+ S A++ + Q Sbjct: 95 LSMSSEKEERNDQNQSENSSSVK----WMSSKMRLMKKMMYSSPDAAAMQKLEDH-QKQP 149 Query: 649 PKLSLSGDNLASNNNI--IRTCSDCNTTRTPLWRSGPKGPKSLCNACGIXXXXXXXXXXX 476 P SL DN +NNN IR C+DCNTT+TPLWRSGP+GPKSLCNACGI Sbjct: 150 PSSSLEPDNGNNNNNTNTIRVCADCNTTKTPLWRSGPRGPKSLCNACGIRQRKARRAMAA 209 Query: 475 XXXASTDGILTTDTPSVASKVHCSKKKSEKGHVAQHKKQCKSKTNN--------IGFEDF 320 T L D S K + + S KK+CK +N+ FED Sbjct: 210 AAANGTAVQLAADDTSSNKKKSKTPRPSNNNSCLPFKKRCKYNSNSPSRGKKKLCSFEDL 269 Query: 319 IVNLS--NNLVLHRVFPQDETEAAILLMALSCGLVNG 215 +NLS N+ L RVFPQ+E EAAILLMALS GLV+G Sbjct: 270 TLNLSKNNSSALQRVFPQEEKEAAILLMALSYGLVHG 306 >ref|XP_006450839.1| hypothetical protein CICLE_v10008968mg [Citrus clementina] gi|568844082|ref|XP_006475925.1| PREDICTED: putative GATA transcription factor 22-like isoform X1 [Citrus sinensis] gi|557554065|gb|ESR64079.1| hypothetical protein CICLE_v10008968mg [Citrus clementina] gi|641861411|gb|KDO80099.1| hypothetical protein CISIN_1g021329mg [Citrus sinensis] Length = 314 Score = 152 bits (384), Expect = 5e-34 Identities = 102/267 (38%), Positives = 136/267 (50%), Gaps = 26/267 (9%) Frame = -1 Query: 940 FQDQLKEQFKQYQPSQEVVDNVSQGGSSDHQSVSPPSLSTLESGSEYGDQ------NKSH 779 FQDQ ++ Q + V GSS+ Q S S+ T + + ++ Sbjct: 48 FQDQRMIIMEESQQHDQKA-RVDHSGSSNLQVFSSSSIQTKKMNNITNNKLPIRKREVGE 106 Query: 778 STPSEKGS---GSWMSSKMRLMKKMINSDGSNDA-----LKPIRKRFWFQ-DPKLSLSGD 626 T SE GS G WMSSK+RLM KMINS ++ A +K +K + Q ++ Sbjct: 107 GTTSENGSSSSGKWMSSKIRLMHKMINSSSNSTATHELAVKVTQKLQYHQLHDNSEVNSF 166 Query: 625 NLASNNNIIRTCSDCNTTRTPLWRSGPKGPKSLCNACGIXXXXXXXXXXXXXXASTDGIL 446 N +++NN +R CSDCNTT TPLWRSGP+GPKSLCNACGI T I Sbjct: 167 NSSNSNNTMRACSDCNTTTTPLWRSGPRGPKSLCNACGIRQRKARRAMQAAAAVETGTIA 226 Query: 445 TT-DTPSVASKVHCSKKKSEKGHVAQHKKQCKS----------KTNNIGFEDFIVNLSNN 299 T +P K+ KK HV+Q+KKQ ++ + F+DF + LS N Sbjct: 227 ATGGSPFAKIKLQIKDKKPRTSHVSQNKKQYRTLDPDPTHQYQSQRKLCFKDFAIALSKN 286 Query: 298 LVLHRVFPQDETEAAILLMALSCGLVN 218 L +VFPQD EAAILLM LSCG ++ Sbjct: 287 SALKQVFPQDVEEAAILLMELSCGFIH 313 >ref|XP_006600457.1| PREDICTED: GATA transcription factor 21-like isoform X2 [Glycine max] gi|947053264|gb|KRH02717.1| hypothetical protein GLYMA_17G055200 [Glycine max] Length = 310 Score = 150 bits (380), Expect = 2e-33 Identities = 105/263 (39%), Positives = 138/263 (52%), Gaps = 28/263 (10%) Frame = -1 Query: 922 EQFKQYQPS--QEVVDNVSQGGSSDHQ-SVSPPSLSTLESGSEYGDQNKSHSTPSEKGSG 752 E KQY PS +E + GS DH + S + +T+ +E ++N S +E GS Sbjct: 49 EPTKQYLPSHEEETEKIIPSSGSWDHSVAESEHNKATVWKKAEERNENLE-SVAAEDGSL 107 Query: 751 SWMSSKMRLMKKMINSDGSNDALKPIRKRFW-FQDPKLSLSG----DNLASNN------N 605 WM +KMR+M+KM+ SD ++ F D K LS DN +SNN N Sbjct: 108 KWMPAKMRIMRKMLVSDQTDTYTNSDNNTTHKFDDQKQQLSSPLGTDNSSSNNYSNHSNN 167 Query: 604 IIRTCSDCNTTRTPLWRSGPKGPKSLCNACGIXXXXXXXXXXXXXXASTDG--ILTTDTP 431 +R CSDC+TT+TPLWRSGP+GPKSLCNACGI +++ ++ Sbjct: 168 TVRVCSDCHTTKTPLWRSGPRGPKSLCNACGIRQRKARRAMAAAAASASGNGTVIVEAKK 227 Query: 430 SVASKVHCSKKKSEKGH---VAQHKKQCK---------SKTNNIGFEDFIVNLSNNLVLH 287 SV + KKK +K AQ KK+ K N GFED + L NL +H Sbjct: 228 SVKGRNKLQKKKEKKTRTEGAAQMKKKRKLGVGSAKASQSRNKFGFEDLTLRLRKNLAMH 287 Query: 286 RVFPQDETEAAILLMALSCGLVN 218 +VFPQDE EAAILLMALS GLV+ Sbjct: 288 QVFPQDEKEAAILLMALSYGLVH 310 >ref|XP_003550634.1| PREDICTED: GATA transcription factor 21-like isoform X1 [Glycine max] gi|734365288|gb|KHN17667.1| Putative GATA transcription factor 22 [Glycine soja] gi|947053263|gb|KRH02716.1| hypothetical protein GLYMA_17G055200 [Glycine max] Length = 322 Score = 150 bits (380), Expect = 2e-33 Identities = 105/263 (39%), Positives = 138/263 (52%), Gaps = 28/263 (10%) Frame = -1 Query: 922 EQFKQYQPS--QEVVDNVSQGGSSDHQ-SVSPPSLSTLESGSEYGDQNKSHSTPSEKGSG 752 E KQY PS +E + GS DH + S + +T+ +E ++N S +E GS Sbjct: 61 EPTKQYLPSHEEETEKIIPSSGSWDHSVAESEHNKATVWKKAEERNENLE-SVAAEDGSL 119 Query: 751 SWMSSKMRLMKKMINSDGSNDALKPIRKRFW-FQDPKLSLSG----DNLASNN------N 605 WM +KMR+M+KM+ SD ++ F D K LS DN +SNN N Sbjct: 120 KWMPAKMRIMRKMLVSDQTDTYTNSDNNTTHKFDDQKQQLSSPLGTDNSSSNNYSNHSNN 179 Query: 604 IIRTCSDCNTTRTPLWRSGPKGPKSLCNACGIXXXXXXXXXXXXXXASTDG--ILTTDTP 431 +R CSDC+TT+TPLWRSGP+GPKSLCNACGI +++ ++ Sbjct: 180 TVRVCSDCHTTKTPLWRSGPRGPKSLCNACGIRQRKARRAMAAAAASASGNGTVIVEAKK 239 Query: 430 SVASKVHCSKKKSEKGH---VAQHKKQCK---------SKTNNIGFEDFIVNLSNNLVLH 287 SV + KKK +K AQ KK+ K N GFED + L NL +H Sbjct: 240 SVKGRNKLQKKKEKKTRTEGAAQMKKKRKLGVGSAKASQSRNKFGFEDLTLRLRKNLAMH 299 Query: 286 RVFPQDETEAAILLMALSCGLVN 218 +VFPQDE EAAILLMALS GLV+ Sbjct: 300 QVFPQDEKEAAILLMALSYGLVH 322 >emb|CDP03165.1| unnamed protein product [Coffea canephora] Length = 318 Score = 150 bits (379), Expect = 2e-33 Identities = 102/259 (39%), Positives = 130/259 (50%), Gaps = 24/259 (9%) Frame = -1 Query: 919 QFKQYQPSQEVVDNVSQGGSSD-HQSVSPPSLSTLESGSEYGDQNKSHSTPSEKGSGSWM 743 Q Q + Q+V ++ GS D + + S +L + G+Q H + + W+ Sbjct: 63 QMHQQEYQQQVENHAPYTGSQDPEKKANKGSKISLWKNNTNGNQADDHEEINPVNN-KWV 121 Query: 742 SSKMRLMKKMINSDGSNDALKPIRKRFWFQDPK-----LSLSGDNLASN------NNIIR 596 SSK++LM+KM N + F+D + S DN +SN N IR Sbjct: 122 SSKVKLMQKM-NKPDLKEITSSTTTTMKFEDHQKQPTSASPEADNFSSNSSSNISNTPIR 180 Query: 595 TCSDCNTTRTPLWRSGPKGPKSLCNACGIXXXXXXXXXXXXXXASTDGILTTDTPSVASK 416 C+DCNTT+TPLWRSGPKGPKSLCNACGI A+ T + K Sbjct: 181 VCADCNTTKTPLWRSGPKGPKSLCNACGIRQRKARRAMAAAAAAANGTSPPTYDTTAPLK 240 Query: 415 VHCSKKKSEKGHVAQHKKQCKSKTN------------NIGFEDFIVNLSNNLVLHRVFPQ 272 V K K + Q KK+CK T+ GFEDF+ NLS NL HRVFPQ Sbjct: 241 VKVQNKDKLKNN-GQFKKRCKLNTSAESSQNLHAVQKKSGFEDFLFNLSKNLAFHRVFPQ 299 Query: 271 DETEAAILLMALSCGLVNG 215 DE EAAILLMALSCGLV+G Sbjct: 300 DEKEAAILLMALSCGLVHG 318 >ref|XP_010942001.1| PREDICTED: putative GATA transcription factor 22 [Elaeis guineensis] Length = 291 Score = 150 bits (378), Expect = 3e-33 Identities = 97/245 (39%), Positives = 133/245 (54%), Gaps = 12/245 (4%) Frame = -1 Query: 916 FKQYQPSQEVVDNVSQGGSSDHQSVSPPSLSTLESGSEYGDQ---NKSHSTPSEKGSGSW 746 + Q Q ++ + V GSSD P + + ++ DQ N H GS W Sbjct: 60 YHQQQQQEKPNEFVLIDGSSDF-----PQPTNTDDNNDKMDQYVCNGYHEDEDGHGSVKW 114 Query: 745 MSSKMRLMKKMINSDGSNDALKPIRKRFWFQDPKLSLSGDNLASNNNI----IRTCSDCN 578 M SKMR M+KM+ S+ + + KP R+ +D + + SN+N IR CSDC+ Sbjct: 115 MPSKMRWMRKMVASEQTVRS-KPARRSM--EDLQEEKQHNQDMSNSNFPSGTIRVCSDCS 171 Query: 577 TTRTPLWRSGPKGPKSLCNACGIXXXXXXXXXXXXXXASTDGILTTDTPSVASKVHCSKK 398 TT+TPLWRSGP+GPKSLCNACGI S G+ T+TP K +K Sbjct: 172 TTKTPLWRSGPQGPKSLCNACGIRQRKARRAMAAAATGS--GLRATNTPRKVQK----EK 225 Query: 397 KSEKGHVAQHKKQCKSKT-----NNIGFEDFIVNLSNNLVLHRVFPQDETEAAILLMALS 233 + + H +KK+CK T + +D +++LSNN HRVFPQDET+AAILLMALS Sbjct: 226 RRGRDHTIPNKKRCKIDTTRTAQRKLEIDDIMMSLSNNSAFHRVFPQDETDAAILLMALS 285 Query: 232 CGLVN 218 CGL++ Sbjct: 286 CGLIH 290 >ref|XP_002514107.1| hypothetical protein RCOM_1046780 [Ricinus communis] gi|223546563|gb|EEF48061.1| hypothetical protein RCOM_1046780 [Ricinus communis] Length = 312 Score = 149 bits (377), Expect = 3e-33 Identities = 104/269 (38%), Positives = 142/269 (52%), Gaps = 25/269 (9%) Frame = -1 Query: 946 NAFQDQLKEQFKQYQP-SQEVVDNV--SQGGSSDHQSVSPPSLSTLESGSEYG-----DQ 791 N Q+++ K+ QP + VDN+ S G S DH+ + + E+G E D+ Sbjct: 49 NPPQEEVGYYHKELQPLHHQEVDNIYASHGRSWDHRIIKNEN----ENGQELSVCKKEDK 104 Query: 790 NKSHSTPSEKGSGSWMSSKMRLMKKMINSDGSNDALKPIRKRFWFQDPKLS--------L 635 + S + S WMSSKMRLM+KM+ +D + + + +D + S Sbjct: 105 STSIEDQRDNSSVKWMSSKMRLMRKMMTTDQTVNTTQHTSSMHKLEDKEKSRSLPLQDDY 164 Query: 634 SGDNLASN-NNIIRTCSDCNTTRTPLWRSGPKGPKSLCNACGIXXXXXXXXXXXXXXAST 458 S NL+ N NN IR CSDCNTT+TPLWRSGP+GPKSLCNACGI ++ Sbjct: 165 SSKNLSDNSNNTIRVCSDCNTTKTPLWRSGPRGPKSLCNACGIRQRKARRALAAAQASAN 224 Query: 457 DGILTTDTPSV-ASKVHCSKKKSEKGHVAQHKKQCKSKTNNIG------FEDFIVN-LSN 302 I DT ++ +KV +K++ H+ KK+CK + G FED LS Sbjct: 225 GTIFAPDTAAMKTNKVQNKEKRTNNSHL-PFKKRCKFTAQSRGSRKKLCFEDLSSTILSK 283 Query: 301 NLVLHRVFPQDETEAAILLMALSCGLVNG 215 N ++FPQDE EAAILLMALS GLV+G Sbjct: 284 NSAFQQLFPQDEKEAAILLMALSYGLVHG 312 >gb|KDO57760.1| hypothetical protein CISIN_1g021859mg [Citrus sinensis] Length = 306 Score = 149 bits (376), Expect = 4e-33 Identities = 96/217 (44%), Positives = 118/217 (54%), Gaps = 12/217 (5%) Frame = -1 Query: 829 LSTLESGSEYGDQNKSHSTPSEKGSGSWMSSKMRLMKKMINSDGSNDALKPIRKRFWFQD 650 LS E DQN+S ++ S K WMSSKMRLMKKM+ S A++ + Q Sbjct: 95 LSMSSEKEERNDQNQSENSSSVK----WMSSKMRLMKKMMYSSPDAAAMQKLEDH-QKQP 149 Query: 649 PKLSLSGDNLASNNNI--IRTCSDCNTTRTPLWRSGPKGPKSLCNACGIXXXXXXXXXXX 476 P SL DN +NNN IR C+DCNTT+TPLWRSGP+GPKSLCNACGI Sbjct: 150 PSSSLEPDNGNNNNNTNTIRVCADCNTTKTPLWRSGPRGPKSLCNACGIRQRKARRAMAA 209 Query: 475 XXXASTDGILTTDTPSVASKVHCSKKKSEKGHVAQHKKQCKSKTNN--------IGFED- 323 T L D S K + + S KK+CK +N+ FED Sbjct: 210 AAANGTAVQLAADDTSSNKKKSKTPRPSNNNSCLPFKKRCKYNSNSPSRGKKKLCSFEDL 269 Query: 322 -FIVNLSNNLVLHRVFPQDETEAAILLMALSCGLVNG 215 I++ +N+ L RVFPQ+E EAAILLMALS GLV+G Sbjct: 270 TLILSKNNSSALQRVFPQEEKEAAILLMALSYGLVHG 306 >ref|NP_001280882.1| transcription factor GATA-5 [Malus domestica] gi|302398801|gb|ADL36695.1| GATA domain class transcription factor [Malus domestica] Length = 359 Score = 149 bits (376), Expect = 4e-33 Identities = 107/286 (37%), Positives = 139/286 (48%), Gaps = 54/286 (18%) Frame = -1 Query: 910 QYQPSQEVVDNVSQGGSSDHQ---------SVSPPSLSTLESGSEYGDQNKSHSTPSEKG 758 Q+Q + + V GGS DH S + LS ++G+ G+ N + Sbjct: 75 QFQLLEADHNIVPHGGSHDHDHQAIENEGGSGTVLKLSISKNGA-VGNGNPGTDHETSTS 133 Query: 757 SGSWMSSKMRLMKKMINSDGSNDAL-----KPIRKRFW--------FQDPKLSLSGDNLA 617 S WMSSKMR+M+KM N D ++ + KPI + Q P L D ++ Sbjct: 134 SVKWMSSKMRMMRKMSNPDQTSSSSTSSDDKPISMKLSSHKFEEQKLQHPSSQLGADMIS 193 Query: 616 SNNN---------IIRTCSDCNTTRTPLWRSGPKGPKSLCNACGIXXXXXXXXXXXXXXA 464 +NN IIR CSDCNTT+TPLWRSGP+GPKSLCNACGI A Sbjct: 194 CSNNSSNNMNNVPIIRVCSDCNTTKTPLWRSGPRGPKSLCNACGIRQRKARRAMAAAAAA 253 Query: 463 STDGILTTDTPSV-ASKVHCSKKKSEKGHVAQHKKQ----------CKSKTNNIGFEDFI 317 ++ LT PS+ +SKV KS KK+ + K+ + FEDF Sbjct: 254 ASGTTLTVAAPSMKSSKVQPKANKSRVSSTVPFKKRPYNKLSSSPSSRGKSKKLCFEDFT 313 Query: 316 VNLSNN------------LVLHRVFPQDETEAAILLMALSCGLVNG 215 +++ NN L RVFPQDE EAAILLMALSCGLV+G Sbjct: 314 ISMKNNSSSGNPTAATTTTALQRVFPQDEKEAAILLMALSCGLVHG 359 >ref|XP_013458498.1| GATA type zinc finger transcription factor family protein [Medicago truncatula] gi|657391198|gb|KEH32529.1| GATA type zinc finger transcription factor family protein [Medicago truncatula] Length = 327 Score = 148 bits (374), Expect = 8e-33 Identities = 95/214 (44%), Positives = 120/214 (56%), Gaps = 20/214 (9%) Frame = -1 Query: 796 DQNKSHSTPSEKGSGSWMSSKMRLMKKMINSDGSNDA-LKPIRKRFWFQDPKLSLSG--- 629 + N + + S WMSSKMR+MKKM+ SD + + L K+ F+D K LS Sbjct: 115 EMNNNQEADQDGTSVKWMSSKMRIMKKMMVSDQTGSSNLTSNSKQIKFEDQKQPLSPQGT 174 Query: 628 DNLASNN-NIIRTCSDCNTTRTPLWRSGPKGPKSLCNACGIXXXXXXXXXXXXXXASTDG 452 DN +SNN + IR CSDCNTT+TPLWRSGP+GPKSLCNACGI ++ Sbjct: 175 DNSSSNNYSTIRVCSDCNTTKTPLWRSGPRGPKSLCNACGIRQRKARRALAAAAASANGT 234 Query: 451 ILTTDTPSVASKVHCSKKKSEKGHV----------AQHKKQCK-----SKTNNIGFEDFI 317 + T SV K KKK K + +HK + K S+ I FED Sbjct: 235 TIADQTASVKRK-KLQKKKENKSKIEFDCSTVHMKKKHKLEAKPPSHQSRKEFITFEDLK 293 Query: 316 VNLSNNLVLHRVFPQDETEAAILLMALSCGLVNG 215 ++LS NL + +VFPQDE EAAILLMALS GLV+G Sbjct: 294 LSLSENLGVQQVFPQDEREAAILLMALSYGLVHG 327 >gb|KOM29610.1| hypothetical protein LR48_Vigan728s003300 [Vigna angularis] Length = 306 Score = 148 bits (373), Expect = 1e-32 Identities = 96/236 (40%), Positives = 129/236 (54%), Gaps = 19/236 (8%) Frame = -1 Query: 865 GSSDHQSVSPPSLSTLESGSEYGDQNKSHSTPSEKGSGSWMSSKMRLMKKMINSDGSNDA 686 GS DH SV+ L + ++++ H +E GS MSSKMR+M+KM+ SD + Sbjct: 77 GSWDH-SVAQSELKVTVCKQK--ERSEDHEAAAEDGSVKLMSSKMRMMQKMMGSDQTGAY 133 Query: 685 LKP--IRKRFWFQDPKLSLSG---DNLASNN------NIIRTCSDCNTTRTPLWRSGPKG 539 ++ + K F+D K LS DN +SNN N +R C+DC+TT+TPLWRSGP+G Sbjct: 134 IEDSTVNK---FEDEKQPLSPLGTDNSSSNNCSNHSNNTVRVCADCHTTKTPLWRSGPRG 190 Query: 538 PKSLCNACGIXXXXXXXXXXXXXXASTDGILTTDTPSVASKVHCSKKKSEKGHVAQHKKQ 359 PKSLCNACGI + I T+ +K+ +KK+ Q KK+ Sbjct: 191 PKSLCNACGIRQRKARRAMAAAASGNGTVIFETEKSVKGNKLQKKEKKARTQGAPQMKKK 250 Query: 358 CK--------SKTNNIGFEDFIVNLSNNLVLHRVFPQDETEAAILLMALSCGLVNG 215 K N GFED + L +L +H+VFPQDE EAAILLMALS GLV+G Sbjct: 251 RKHGVGAKPSQSRNKFGFEDLTLRLRKSLAMHQVFPQDEKEAAILLMALSYGLVHG 306