BLASTX nr result
ID: Rehmannia24_contig00001982
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Rehmannia24_contig00001982 (1288 letters) Database: ./nr 37,332,560 sequences; 13,225,080,153 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_002282173.1| PREDICTED: uncharacterized protein LOC100261... 204 8e-50 gb|EOY30464.1| GATA type zinc finger transcription factor family... 184 5e-44 ref|XP_006346565.1| PREDICTED: GATA transcription factor 21-like... 178 5e-42 ref|XP_004243958.1| PREDICTED: putative GATA transcription facto... 177 8e-42 ref|XP_002514107.1| hypothetical protein RCOM_1046780 [Ricinus c... 171 8e-40 gb|EXB38836.1| Putative GATA transcription factor 22 [Morus nota... 168 5e-39 ref|XP_006353530.1| PREDICTED: putative GATA transcription facto... 162 2e-37 ref|XP_006451458.1| hypothetical protein CICLE_v10009004mg [Citr... 159 2e-36 ref|XP_002279283.1| PREDICTED: putative GATA transcription facto... 158 5e-36 ref|XP_006600457.1| PREDICTED: GATA transcription factor 21-like... 156 2e-35 ref|XP_003550634.1| PREDICTED: GATA transcription factor 21-like... 156 2e-35 ref|XP_003543725.1| PREDICTED: GATA transcription factor 21-like... 153 1e-34 gb|EOY29900.1| GATA type zinc finger transcription factor family... 152 3e-34 gb|EMJ04350.1| hypothetical protein PRUPE_ppa024374mg [Prunus pe... 150 1e-33 gb|ADL36695.1| GATA domain class transcription factor [Malus dom... 149 2e-33 gb|ESW26655.1| hypothetical protein PHAVU_003G137100g [Phaseolus... 148 4e-33 gb|ESW10726.1| hypothetical protein PHAVU_009G232700g [Phaseolus... 146 2e-32 ref|NP_194345.1| putative GATA transcription factor 22 [Arabidop... 145 5e-32 emb|CAN63090.1| hypothetical protein VITISV_032017 [Vitis vinifera] 144 6e-32 ref|XP_003546455.1| PREDICTED: putative GATA transcription facto... 144 8e-32 >ref|XP_002282173.1| PREDICTED: uncharacterized protein LOC100261004 [Vitis vinifera] gi|297738668|emb|CBI27913.3| unnamed protein product [Vitis vinifera] Length = 309 Score = 204 bits (518), Expect = 8e-50 Identities = 139/308 (45%), Positives = 163/308 (52%), Gaps = 22/308 (7%) Frame = -3 Query: 1103 NNDQHHQP-FGP-------------CHIFFNSTQDHMMESYNYDHHPHLYQPRQISDDNL 966 N DQHHQ F P C IFF+ T++ Y H QP+Q + D Sbjct: 19 NEDQHHQLLFSPKPQPSSSSSSSLTCPIFFSPTKEQGGCHYRDLHQA---QPQQEAHDKF 75 Query: 965 GYHGGS-TYEIKNKVDQSGLKLTLWKKEDHMGHDDEHIPQKNNPVKWMSSKMRLMQKMKT 789 + GGS + +GLKLT+WK ED + E N VKWMSSKMR+MQKM Sbjct: 76 VFRGGSYDHPTLESESDNGLKLTIWKTEDRNENHSE-----NGSVKWMSSKMRVMQKMMI 130 Query: 788 PDRV-ALKITSTATT---KLEQPXXXXXXXXXXXXXXXXXXSPIRVCSDCNTTKTPLWRS 621 D+ A K ++TA +Q + IRVC+DCNTTKTPLWRS Sbjct: 131 SDQTGAQKPSNTALNFGDHKQQSLPSETDYNSINSSNINSNNTIRVCADCNTTKTPLWRS 190 Query: 620 GPKGPKSLCNACGIRQRKXXXXXXXXXXXANGT--ADQPPAMKIKVQHKLEKTGKNGHAS 447 GP+GPKSLCNACGIRQRK ANGT K K +HK +K NGH S Sbjct: 191 GPRGPKSLCNACGIRQRKARRAMAAAAATANGTILPTNTAPTKTKAKHK-DKKSSNGHVS 249 Query: 446 HFKKRCKTATNXXXXXXXXXXXXPKKLGFEDFLINLSKNLAFGRVFPEDE-KDAAILLMA 270 H+KKRCK A KKL FEDF I+LSKN AF RVF +DE K+AAILLMA Sbjct: 250 HYKKRCKLAA--------APSCETKKLCFEDFTISLSKNSAFHRVFLQDEIKEAAILLMA 301 Query: 269 LSSGLVHG 246 LS GLVHG Sbjct: 302 LSCGLVHG 309 >gb|EOY30464.1| GATA type zinc finger transcription factor family protein, putative [Theobroma cacao] Length = 302 Score = 184 bits (468), Expect = 5e-44 Identities = 125/306 (40%), Positives = 160/306 (52%), Gaps = 19/306 (6%) Frame = -3 Query: 1106 DNNDQHHQPFG-------------PCHIFFNSTQDHMMESYNYDHHPHLYQPRQISDDNL 966 D+ Q HQ F C I FN + + H H Q +D Sbjct: 20 DDQHQQHQLFSLKPQPPSLSSSSLTCPILFNPVVQEQAGGHQREPHQHF----QYQEDQA 75 Query: 965 GYHGGSTYEIKNKVDQSGLKLTLWKKEDHMGHDDEHIPQKNNPVKWMSSKMRLMQKMKTP 786 + +++ SGL L+L KKE+ +EH +++ KWMSSKMR+M+KM + Sbjct: 76 KIYVPQDEPLES---DSGLNLSLRKKEE----GNEHHQIEDSSAKWMSSKMRMMRKMMSS 128 Query: 785 DRVALKITSTATTKLEQPXXXXXXXXXXXXXXXXXXSP---IRVCSDCNTTKTPLWRSGP 615 DR L ++++T KLE+P + IRVC+DCNTTKTPLWRSGP Sbjct: 129 DRADL--SNSSTPKLEEPKQQPSSSPDNSSNSSYNNNDNITIRVCADCNTTKTPLWRSGP 186 Query: 614 KGPKSLCNACGIRQRKXXXXXXXXXXXANG---TADQPPAMKIKVQHKLEKTGKNGHASH 444 +GPKSLCNACGIRQRK ANG A P MK KVQ K +++ +G + Sbjct: 187 RGPKSLCNACGIRQRK-ARRAMAAAAAANGAIVAAQTTPTMKSKVQDKSKRSSNSGCVAQ 245 Query: 443 FKKRCKTATNXXXXXXXXXXXXPKKLGFEDFLINLSKNLAFGRVFPEDEKDAAILLMALS 264 KK+CK ++ KKL FED I LSKN AF RVFP+DEK+AAILLMALS Sbjct: 246 LKKKCKHSSQ---------SQGRKKLCFEDLRIILSKNSAFHRVFPQDEKEAAILLMALS 296 Query: 263 SGLVHG 246 GLVHG Sbjct: 297 YGLVHG 302 >ref|XP_006346565.1| PREDICTED: GATA transcription factor 21-like [Solanum tuberosum] Length = 222 Score = 178 bits (451), Expect = 5e-42 Identities = 120/252 (47%), Positives = 145/252 (57%), Gaps = 5/252 (1%) Frame = -3 Query: 986 QISDDNLGYHGGSTYEI--KNKVDQSGLKLTLWKKEDHMGHDDEHIPQKNNPVKWMSSKM 813 Q+ DN GGS+Y++ KNK SGLKL+LWK+ED + E +K + + Sbjct: 6 QLEVDN---DGGSSYDLGKKNK-GGSGLKLSLWKREDKLVMSSE--------IKDLDQER 53 Query: 812 RLMQKMKTPDRVALKITSTATTKLEQPXXXXXXXXXXXXXXXXXXSPIRVCSDCNTTKTP 633 + + + D + LK+ + +QP PIRVC+DCNTTKTP Sbjct: 54 K--KNITNNDCIKLKLGD----QKQQPIQTDYSSNNI---------PIRVCTDCNTTKTP 98 Query: 632 LWRSGPKGPKSLCNACGIRQRKXXXXXXXXXXXANGTADQPPAMKIKV-QHK--LEKTGK 462 LWRSGPKGPKSLCNACGIRQRK ANG D AMKIKV QHK + K Sbjct: 99 LWRSGPKGPKSLCNACGIRQRK---ARRAMAAAANGKTDHQTAMKIKVQQHKPNITKVRT 155 Query: 461 NGHASHFKKRCKTATNXXXXXXXXXXXXPKKLGFEDFLINLSKNLAFGRVFPEDEKDAAI 282 N H + FKKRCK + PKKLGFED LINLS LAF ++FP+DEK+AAI Sbjct: 156 NNHVTPFKKRCKLGPS-----SSGTNNAPKKLGFEDLLINLSNQLAFQQIFPQDEKEAAI 210 Query: 281 LLMALSSGLVHG 246 LLMALSSGLVHG Sbjct: 211 LLMALSSGLVHG 222 >ref|XP_004243958.1| PREDICTED: putative GATA transcription factor 22-like [Solanum lycopersicum] Length = 266 Score = 177 bits (449), Expect = 8e-42 Identities = 128/293 (43%), Positives = 159/293 (54%), Gaps = 5/293 (1%) Frame = -3 Query: 1109 NDNNDQHHQPFGPCHIFFNSTQDHMMESYNYDHHPHLYQPRQISDDNLGYHGGSTYEI-- 936 N NN+ P H FFNST + S+++ H + Q Q+ DN GGS+Y++ Sbjct: 17 NSNNNSLVTP--NYHFFFNSTTNQTA-SFHHQHTQYYMQHEQLEVDN---DGGSSYDLGK 70 Query: 935 KNKVDQSGLKLTLWKKEDHMGHDDEHIPQKNNPVKWMSSKMRLMQKMKTPDRVALKITST 756 KN+V SGLKL+LWK+ED K +SS+++ + + K + T++ Sbjct: 71 KNEVG-SGLKLSLWKRED----------------KLLSSEIKKLDQEKKKNS-----TNS 108 Query: 755 ATTKLEQPXXXXXXXXXXXXXXXXXXSPIRVCSDCNTTKTPLWRSGPKGPKSLCNACGIR 576 A KL+ PIRVC+DCNTTKTPLWRSGPKGPKSLCNACGIR Sbjct: 109 ACIKLK---LGDQKQKPIQTDYCSNNIPIRVCTDCNTTKTPLWRSGPKGPKSLCNACGIR 165 Query: 575 QRKXXXXXXXXXXXANGTADQPPAMKIKVQHKLEKTGK---NGHASHFKKRCKTATNXXX 405 QRK A G DQ K++ QHK T K N KKRCK + Sbjct: 166 QRK--ARRAMAAAAAEGKTDQ----KVQ-QHKQNITTKVTSNNDVKPLKKRCKFGPS--- 215 Query: 404 XXXXXXXXXPKKLGFEDFLINLSKNLAFGRVFPEDEKDAAILLMALSSGLVHG 246 PKKLGFEDFLINLS LAF ++FP+DE +AAILLMALSSGLVHG Sbjct: 216 --SSSTNNAPKKLGFEDFLINLSNKLAFQQIFPQDEMEAAILLMALSSGLVHG 266 >ref|XP_002514107.1| hypothetical protein RCOM_1046780 [Ricinus communis] gi|223546563|gb|EEF48061.1| hypothetical protein RCOM_1046780 [Ricinus communis] Length = 312 Score = 171 bits (432), Expect = 8e-40 Identities = 132/312 (42%), Positives = 154/312 (49%), Gaps = 26/312 (8%) Frame = -3 Query: 1103 NNDQHHQPFGPCH----------------IFFNSTQDHMMESYNYDHHPHLYQPRQISDD 972 N DQHH C IF N Q E Y H Q D+ Sbjct: 17 NEDQHHHQLIFCSKTTTEDASSSSSISYPIFINPPQ----EEVGYYHKELQPLHHQEVDN 72 Query: 971 NLGYHGGS-TYEIKNKVDQSGLKLTLWKKEDHMGHDDEHIPQKNNPVKWMSSKMRLMQKM 795 HG S + I +++G +L++ KKED ++ + N+ VKWMSSKMRLM+KM Sbjct: 73 IYASHGRSWDHRIIKNENENGQELSVCKKEDKSTSIEDQ--RDNSSVKWMSSKMRLMRKM 130 Query: 794 KTPDRVALKITSTATT-KLEQPXXXXXXXXXXXXXXXXXXS----PIRVCSDCNTTKTPL 630 T D+ T++ KLE IRVCSDCNTTKTPL Sbjct: 131 MTTDQTVNTTQHTSSMHKLEDKEKSRSLPLQDDYSSKNLSDNSNNTIRVCSDCNTTKTPL 190 Query: 629 WRSGPKGPKSLCNACGIRQRKXXXXXXXXXXXANGT--ADQPPAMKI-KVQHKLEKTGKN 459 WRSGP+GPKSLCNACGIRQRK ANGT A AMK KVQ+K EK N Sbjct: 191 WRSGPRGPKSLCNACGIRQRKARRALAAAQASANGTIFAPDTAAMKTNKVQNK-EKRTNN 249 Query: 458 GHASHFKKRCKTATNXXXXXXXXXXXXPKKLGFEDFLIN-LSKNLAFGRVFPEDEKDAAI 282 H FKKRCK KKL FED LSKN AF ++FP+DEK+AAI Sbjct: 250 SHLP-FKKRCKFTAQ--------SRGSRKKLCFEDLSSTILSKNSAFQQLFPQDEKEAAI 300 Query: 281 LLMALSSGLVHG 246 LLMALS GLVHG Sbjct: 301 LLMALSYGLVHG 312 >gb|EXB38836.1| Putative GATA transcription factor 22 [Morus notabilis] Length = 335 Score = 168 bits (425), Expect = 5e-39 Identities = 121/279 (43%), Positives = 144/279 (51%), Gaps = 22/279 (7%) Frame = -3 Query: 1016 DHHPHLYQPRQISDDNLGYHGGSTYEIKNKVDQSGLKLTLWK---KEDHMGHDDEHIPQK 846 DHH L SD H E ++ Q+ LKL++WK ++ + HD Sbjct: 70 DHHHKLVSSGGSSD----IHPPRVAESESDHHQNDLKLSIWKSSTEDSNYDHDKSSHVSD 125 Query: 845 NNP---VKWMSSKMRLMQKM-KTPDRVALKITSTA--TTKLEQ-------PXXXXXXXXX 705 NN KWM SKMR+M+KM PD+ + + T K +Q Sbjct: 126 NNAGYSAKWMPSKMRMMRKMIVNPDQTNIDHHTPLNFTHKFDQVMKRKHPASPLGTDHSS 185 Query: 704 XXXXXXXXXSPIRVCSDCNTTKTPLWRSGPKGPKSLCNACGIRQRKXXXXXXXXXXXANG 525 + IRVC+DCNTTKTPLWRSGP+GPKSLCNACGIRQRK ANG Sbjct: 186 TSSSNNNNNNTIRVCADCNTTKTPLWRSGPRGPKSLCNACGIRQRKARRAMAAAAAAANG 245 Query: 524 T--ADQPPAMK--IKVQHKLEKTGKNGH--ASHFKKRCKTATNXXXXXXXXXXXXPKKLG 363 T A MK KVQ K EK KNG+ FKKRCK + KK+ Sbjct: 246 TILATDATTMKSSTKVQRK-EKKPKNGNGVVPQFKKRCKLTAS--------PSRGRKKIC 296 Query: 362 FEDFLINLSKNLAFGRVFPEDEKDAAILLMALSSGLVHG 246 FED I++SKN AF RVFP+DEKDAAILLMALS GLVHG Sbjct: 297 FEDLAISISKNSAFQRVFPQDEKDAAILLMALSYGLVHG 335 >ref|XP_006353530.1| PREDICTED: putative GATA transcription factor 22-like [Solanum tuberosum] Length = 323 Score = 162 bits (411), Expect = 2e-37 Identities = 119/306 (38%), Positives = 148/306 (48%), Gaps = 31/306 (10%) Frame = -3 Query: 1070 CHIFFN-STQDHMMESYNYDHHPH-LYQPR-QISDDNLGYHGGSTYEIKNKVDQSGLKLT 900 C FFN ST ++ + YD+H H +QP+ Q DN +++ K ++ GLKLT Sbjct: 58 CQTFFNISTTTNIQDQSGYDYHSHQFHQPQHQHEVDNFASRSSGSHDHLEKKNK-GLKLT 116 Query: 899 LWKKEDHMGHDDEHIPQKNNPVKWMSSKMRLMQKMKTPDRVALKITSTATTKLEQPXXXX 720 L KK + QK +K K ++++ + + S++ + Sbjct: 117 LCKKGE----------QKMKNLKLEDQKQQIIETDYSSN-------SSSNNNI------- 152 Query: 719 XXXXXXXXXXXXXXSPIRVCSDCNTTKTPLWRSGPKGPKSLCNACGIRQRKXXXXXXXXX 540 PIRVCSDCNTTKTPLWRSGPKGPKSLCNACGIRQRK Sbjct: 153 --------------IPIRVCSDCNTTKTPLWRSGPKGPKSLCNACGIRQRKARRAAAAAA 198 Query: 539 XXANG-----TADQPPAMKIKVQ---HKLEKTGKNGHASHFKKRCKTATNXXXXXXXXXX 384 N + + MKIKVQ HK+ K N H FKKRCK +N Sbjct: 199 AATNNGTNFTSTETTTTMKIKVQQQKHKITKVNTN-HVVPFKKRCKFLSNTTTTPAPVPA 257 Query: 383 XXP--------------------KKLGFEDFLINLSKNLAFGRVFPEDEKDAAILLMALS 264 P K L FEDF +NLS NLA RVFP+DEK+AAILLMALS Sbjct: 258 PAPRVGSSSSSSSYNNNNDVQQKKNLCFEDFFVNLSNNLAIHRVFPQDEKEAAILLMALS 317 Query: 263 SGLVHG 246 SGLVHG Sbjct: 318 SGLVHG 323 >ref|XP_006451458.1| hypothetical protein CICLE_v10009004mg [Citrus clementina] gi|568843031|ref|XP_006475428.1| PREDICTED: putative GATA transcription factor 22-like [Citrus sinensis] gi|557554684|gb|ESR64698.1| hypothetical protein CICLE_v10009004mg [Citrus clementina] Length = 306 Score = 159 bits (403), Expect = 2e-36 Identities = 120/291 (41%), Positives = 147/291 (50%), Gaps = 16/291 (5%) Frame = -3 Query: 1070 CHIFFNSTQDHMMESYNYD---HHPH----LYQPRQISDDNLGYHGGSTYEIKNKVDQSG 912 CH FF Q Y HP LY S D H G ++ + +G Sbjct: 37 CHNFFEPVQREGGFYYRESVLLRHPKEVRILYSQAAGSCD----HPGPAVMDESGSESTG 92 Query: 911 LKLTLWKKEDHMGHDDEHIPQKNNPVKWMSSKMRLMQKM--KTPDRVALKITSTATTKLE 738 LKL++ +++ +D++ + ++ VKWMSSKMRLM+KM +PD A++ KLE Sbjct: 93 LKLSMSSEKEE--RNDQNQSENSSSVKWMSSKMRLMKKMMYSSPDAAAMQ-------KLE 143 Query: 737 --QPXXXXXXXXXXXXXXXXXXSPIRVCSDCNTTKTPLWRSGPKGPKSLCNACGIRQRKX 564 Q + IRVC+DCNTTKTPLWRSGP+GPKSLCNACGIRQRK Sbjct: 144 DHQKQPPSSSLEPDNGNNNNNTNTIRVCADCNTTKTPLWRSGPRGPKSLCNACGIRQRK- 202 Query: 563 XXXXXXXXXXANGTADQPPAMKIKVQHKLEKT---GKNGHASHFKKRCKTATNXXXXXXX 393 ANGTA Q A K KT N FKKRCK +N Sbjct: 203 -ARRAMAAAAANGTAVQLAADDTSSNKKKSKTPRPSNNNSCLPFKKRCKYNSN------S 255 Query: 392 XXXXXPKKLGFEDFLINLSKN--LAFGRVFPEDEKDAAILLMALSSGLVHG 246 K FED +NLSKN A RVFP++EK+AAILLMALS GLVHG Sbjct: 256 PSRGKKKLCSFEDLTLNLSKNNSSALQRVFPQEEKEAAILLMALSYGLVHG 306 >ref|XP_002279283.1| PREDICTED: putative GATA transcription factor 22 [Vitis vinifera] gi|296081660|emb|CBI20665.3| unnamed protein product [Vitis vinifera] Length = 306 Score = 158 bits (399), Expect = 5e-36 Identities = 117/288 (40%), Positives = 141/288 (48%), Gaps = 13/288 (4%) Frame = -3 Query: 1073 PCHIFFNSTQDHMMESYNYDHHPHLYQPRQISDDNLGYHGG----------STYEIKNKV 924 PC FFNS+ +S DH P Q + DD HGG S + Sbjct: 44 PCPSFFNSST----QSQRGDHSPRDPQQHEDKDDKYISHGGCGESQVFSSSSLLQPMADD 99 Query: 923 DQSGLKLTLWKKEDHMGHDDEHIPQKNNPVKWMSSKMRLMQKMKTPDRVALKITSTATTK 744 ++S KL+++KKE+ DE + KWMSSKMRLM+KM D KI Sbjct: 100 NKSSHKLSVFKKEE----GDEG---NKSTEKWMSSKMRLMRKMMNSDCTTAKIEQKV--- 149 Query: 743 LEQPXXXXXXXXXXXXXXXXXXSPIRVCSDCNTTKTPLWRSGPKGPKSLCNACGIRQRK- 567 + PIRVCSDCNTTKTPLWRSGP+GPKSLCNACGIRQRK Sbjct: 150 --EDHQQWDNINEFNSSNNTSNIPIRVCSDCNTTKTPLWRSGPRGPKSLCNACGIRQRKA 207 Query: 566 XXXXXXXXXXXANGTA--DQPPAMKIKVQHKLEKTGKNGHASHFKKRCKTATNXXXXXXX 393 ANGTA + MK+K+ +K EK + KK CK Sbjct: 208 RRAMAAAAAAAANGTAVGTEISPMKMKLPNK-EKKMHTSNVGQQKKLCK---------PP 257 Query: 392 XXXXXPKKLGFEDFLINLSKNLAFGRVFPEDEKDAAILLMALSSGLVH 249 KKL FEDF ++ KN F RVFP DE++AAILLMALS LV+ Sbjct: 258 CPPPTEKKLCFEDFTSSICKNSGFRRVFPRDEEEAAILLMALSCDLVY 305 >ref|XP_006600457.1| PREDICTED: GATA transcription factor 21-like isoform X2 [Glycine max] Length = 310 Score = 156 bits (395), Expect = 2e-35 Identities = 108/312 (34%), Positives = 145/312 (46%), Gaps = 27/312 (8%) Frame = -3 Query: 1103 NNDQHHQPFGPCH-------------IFFNS-TQDHMMESYNYDHHPHLYQPRQISDDNL 966 N DQ+H+ F P H I FN QD SY ++ + + + Sbjct: 6 NEDQNHEFFSPTHHPSSSFSSLSSYPILFNPPNQDQEARSYYWEPTKQYLPSHEEETEKI 65 Query: 965 GYHGGSTYEIKNKVDQSGLKLTLWKKEDHMGHDDEHIPQKNNPVKWMSSKMRLMQKMKTP 786 GS + + + K T+WKK + + E + ++ +KWM +KMR+M+KM Sbjct: 66 IPSSGSWDHSVAESEHN--KATVWKKAEERNENLESVAAEDGSLKWMPAKMRIMRKMLVS 123 Query: 785 DRVALKITSTATT-------KLEQPXXXXXXXXXXXXXXXXXXSPIRVCSDCNTTKTPLW 627 D+ S T K + + +RVCSDC+TTKTPLW Sbjct: 124 DQTDTYTNSDNNTTHKFDDQKQQLSSPLGTDNSSSNNYSNHSNNTVRVCSDCHTTKTPLW 183 Query: 626 RSGPKGPKSLCNACGIRQRKXXXXXXXXXXXANGT------ADQPPAMKIKVQHKLEKTG 465 RSGP+GPKSLCNACGIRQRK A+G A + + K+Q K EK Sbjct: 184 RSGPRGPKSLCNACGIRQRKARRAMAAAAASASGNGTVIVEAKKSVKGRNKLQKKKEKKT 243 Query: 464 KNGHASHFKKRCKTATNXXXXXXXXXXXXPKKLGFEDFLINLSKNLAFGRVFPEDEKDAA 285 + A+ KK+ K K GFED + L KNLA +VFP+DEK+AA Sbjct: 244 RTEGAAQMKKKRKLGVG-----SAKASQSRNKFGFEDLTLRLRKNLAMHQVFPQDEKEAA 298 Query: 284 ILLMALSSGLVH 249 ILLMALS GLVH Sbjct: 299 ILLMALSYGLVH 310 >ref|XP_003550634.1| PREDICTED: GATA transcription factor 21-like isoform X1 [Glycine max] Length = 322 Score = 156 bits (395), Expect = 2e-35 Identities = 108/312 (34%), Positives = 145/312 (46%), Gaps = 27/312 (8%) Frame = -3 Query: 1103 NNDQHHQPFGPCH-------------IFFNS-TQDHMMESYNYDHHPHLYQPRQISDDNL 966 N DQ+H+ F P H I FN QD SY ++ + + + Sbjct: 18 NEDQNHEFFSPTHHPSSSFSSLSSYPILFNPPNQDQEARSYYWEPTKQYLPSHEEETEKI 77 Query: 965 GYHGGSTYEIKNKVDQSGLKLTLWKKEDHMGHDDEHIPQKNNPVKWMSSKMRLMQKMKTP 786 GS + + + K T+WKK + + E + ++ +KWM +KMR+M+KM Sbjct: 78 IPSSGSWDHSVAESEHN--KATVWKKAEERNENLESVAAEDGSLKWMPAKMRIMRKMLVS 135 Query: 785 DRVALKITSTATT-------KLEQPXXXXXXXXXXXXXXXXXXSPIRVCSDCNTTKTPLW 627 D+ S T K + + +RVCSDC+TTKTPLW Sbjct: 136 DQTDTYTNSDNNTTHKFDDQKQQLSSPLGTDNSSSNNYSNHSNNTVRVCSDCHTTKTPLW 195 Query: 626 RSGPKGPKSLCNACGIRQRKXXXXXXXXXXXANGT------ADQPPAMKIKVQHKLEKTG 465 RSGP+GPKSLCNACGIRQRK A+G A + + K+Q K EK Sbjct: 196 RSGPRGPKSLCNACGIRQRKARRAMAAAAASASGNGTVIVEAKKSVKGRNKLQKKKEKKT 255 Query: 464 KNGHASHFKKRCKTATNXXXXXXXXXXXXPKKLGFEDFLINLSKNLAFGRVFPEDEKDAA 285 + A+ KK+ K K GFED + L KNLA +VFP+DEK+AA Sbjct: 256 RTEGAAQMKKKRKLGVG-----SAKASQSRNKFGFEDLTLRLRKNLAMHQVFPQDEKEAA 310 Query: 284 ILLMALSSGLVH 249 ILLMALS GLVH Sbjct: 311 ILLMALSYGLVH 322 >ref|XP_003543725.1| PREDICTED: GATA transcription factor 21-like [Glycine max] Length = 314 Score = 153 bits (387), Expect = 1e-34 Identities = 111/313 (35%), Positives = 150/313 (47%), Gaps = 28/313 (8%) Frame = -3 Query: 1103 NNDQHHQPFGPCH--------------IFFNS-TQDHMMESYNYDHHPHLYQPRQISDDN 969 N DQ+H+ F P H I FN QD SY+++ HL + ++ Sbjct: 18 NEDQNHEFFSPIHHPSSSFSSLSSSYPILFNPPNQDQEARSYDWETTKHLPSHEEEAEKI 77 Query: 968 LGYHGGSTYEIKNKVDQSGLKLTLWKKEDHMGHDDEHIPQKNNPVKWMSSKMRLMQKMKT 789 + G + V++S K+T+W+KE+ +E++ + + VKWM SKMR+M+KM Sbjct: 78 IPTSGSWGHS----VEESEHKVTVWRKEER----NENLAEDGS-VKWMPSKMRIMRKMLV 128 Query: 788 PDRVALKITSTATT--------KLEQPXXXXXXXXXXXXXXXXXXSPIRVCSDCNTTKTP 633 ++ + TT +L P +RVCSDC+TTKTP Sbjct: 129 SNQTDAYTSDNNTTHKFDDHKQQLSSPLGIDDNSSNNYSDKSNNSI-VRVCSDCHTTKTP 187 Query: 632 LWRSGPKGPKSLCNACGIRQRKXXXXXXXXXXXANG-----TADQPPAMKIKVQHKLEKT 468 LWRSGP+GPKSLCNACGIRQRK A G + K+Q K EK Sbjct: 188 LWRSGPRGPKSLCNACGIRQRKARRAMAAAAAAALGDGAVIVEAEKSVKGKKLQKKKEKK 247 Query: 467 GKNGHASHFKKRCKTATNXXXXXXXXXXXXPKKLGFEDFLINLSKNLAFGRVFPEDEKDA 288 + A+ K + K K GFED + L KNLA +VFP+DEK+A Sbjct: 248 TRIEGAAQMKMKRKLGVG------AKASQSRNKFGFEDLTLRLRKNLAMHQVFPQDEKEA 301 Query: 287 AILLMALSSGLVH 249 AILLMALS GLVH Sbjct: 302 AILLMALSYGLVH 314 >gb|EOY29900.1| GATA type zinc finger transcription factor family protein, putative [Theobroma cacao] Length = 311 Score = 152 bits (384), Expect = 3e-34 Identities = 107/268 (39%), Positives = 133/268 (49%), Gaps = 7/268 (2%) Frame = -3 Query: 1031 ESYNYDHHPHLYQPRQISDDNLGYHGGSTYEIKNKVDQS---GLKLTLWKKEDHMGHDDE 861 ES +DH + + + S D S+ +++ VDQS G L+ +KED D E Sbjct: 60 ESKPHDHKGNQFMTHEGSIDQ---QASSSSSLQSAVDQSTANGYNLSFSRKEDG---DCE 113 Query: 860 HIPQKNNPVKWMSSKMRLMQKMKTPDRVALKITSTATTKLEQPXXXXXXXXXXXXXXXXX 681 + VKWMSSK+RLM+KM + K Q Sbjct: 114 SASGNGSSVKWMSSKVRLMKKMMNSNCSG---ADDKPPKFTQRFQYPVHDSDETNSFSKA 170 Query: 680 XSPIRVCSDCNTTKTPLWRSGPKGPKSLCNACGIRQRK--XXXXXXXXXXXANGTADQPP 507 + +RVCSDCNTT TPLWRSGP+GPKSLCNACGIRQRK NG A Sbjct: 171 NNTVRVCSDCNTTTTPLWRSGPRGPKSLCNACGIRQRKARRAMEAAAAAAAENGAAAAAD 230 Query: 506 A--MKIKVQHKLEKTGKNGHASHFKKRCKTATNXXXXXXXXXXXXPKKLGFEDFLINLSK 333 A MKIKV EK + H + KK+ K KKL F++F ++LSK Sbjct: 231 ASSMKIKVHIHKEKKSRTSHVAQCKKQVK--------PPYYSPQSQKKLCFKEFALSLSK 282 Query: 332 NLAFGRVFPEDEKDAAILLMALSSGLVH 249 N A RVFP+D +DAAILLM LS GLVH Sbjct: 283 NSALQRVFPQDVEDAAILLMELSCGLVH 310 >gb|EMJ04350.1| hypothetical protein PRUPE_ppa024374mg [Prunus persica] Length = 297 Score = 150 bits (378), Expect = 1e-33 Identities = 117/307 (38%), Positives = 152/307 (49%), Gaps = 34/307 (11%) Frame = -3 Query: 1064 IFFNSTQDHMMESYNYDHHPHLYQPRQISDDNLGYHGGST-YEIKNKVDQSG----LKLT 900 IF N +Q + + +Q + N+ +GGS Y+ + ++SG LKL+ Sbjct: 4 IFLNPSQAQAPSGHYREPQNFQFQLLEADHHNIVSYGGSCDYDPQTLENESGSGTILKLS 63 Query: 899 LWKKEDHMGHDDEHIPQKNNPV--KWMSSKMRLMQKMKTPDR------------VALKIT 762 + K E + NP KWMSSKMR+M+KM PD+ VA+K++ Sbjct: 64 ISKNE---------AGRNGNPSTDKWMSSKMRMMKKMTNPDQTSSSCTSSDDKPVAMKLS 114 Query: 761 STATTKLEQPXXXXXXXXXXXXXXXXXXSP--IRVCSDCNTTKTPLWRSGPKGPKSLCNA 588 + ++ ++P + IRVCSDCNTTKTPLWRSGP+GPKSLCNA Sbjct: 115 ISHKSEEQKPQHPDMISCSNKSSNIMNNNVPIIRVCSDCNTTKTPLWRSGPRGPKSLCNA 174 Query: 587 CGIRQRKXXXXXXXXXXXANGTA-DQPPAMK--IKVQHKLEKTGKNGHASHFKKRCKTAT 417 CGIRQRK A+GT P+MK K QHK K + FKKR Sbjct: 175 CGIRQRKARRAMAAAAAAASGTTLAAAPSMKSTSKAQHKDNKP-RGASTVPFKKR---PY 230 Query: 416 NXXXXXXXXXXXXPKKLGFEDFLINLSKN----------LAFGRVFPEDEKDAAILLMAL 267 N PKKL FEDF I++ N + RVFP+DEK+AAILLMAL Sbjct: 231 NKLSSTPPSKGRPPKKLCFEDFAISMDNNHSSSATTTTTTSLQRVFPQDEKEAAILLMAL 290 Query: 266 SSGLVHG 246 S GLVHG Sbjct: 291 SCGLVHG 297 >gb|ADL36695.1| GATA domain class transcription factor [Malus domestica] Length = 359 Score = 149 bits (377), Expect = 2e-33 Identities = 118/302 (39%), Positives = 147/302 (48%), Gaps = 45/302 (14%) Frame = -3 Query: 1016 DHH--PHLYQPRQI-SDDNLGYHGGSTYEIKNKVDQSG-----LKLTLWKK----EDHMG 873 DH+ PH +Q + + +D N+ HGGS ++ G LKL++ K + G Sbjct: 66 DHYREPHQFQFQLLEADHNIVPHGGSHDHDHQAIENEGGSGTVLKLSISKNGAVGNGNPG 125 Query: 872 HDDEHIPQKNNPVKWMSSKMRLMQKMKTPDRVALKITST--------------ATTKLEQ 735 D E + VKWMSSKMR+M+KM PD+ + TS+ KL+ Sbjct: 126 TDHE---TSTSSVKWMSSKMRMMRKMSNPDQTSSSSTSSDDKPISMKLSSHKFEEQKLQH 182 Query: 734 PXXXXXXXXXXXXXXXXXXSP----IRVCSDCNTTKTPLWRSGPKGPKSLCNACGIRQRK 567 P IRVCSDCNTTKTPLWRSGP+GPKSLCNACGIRQRK Sbjct: 183 PSSQLGADMISCSNNSSNNMNNVPIIRVCSDCNTTKTPLWRSGPRGPKSLCNACGIRQRK 242 Query: 566 XXXXXXXXXXXANGT--ADQPPAMK-IKVQHKLEKTGKNGHASHFKKRCKTATNXXXXXX 396 A+GT P+MK KVQ K K+ + FKKR + Sbjct: 243 ARRAMAAAAAAASGTTLTVAAPSMKSSKVQPKANKS-RVSSTVPFKKRPYNKLS----SS 297 Query: 395 XXXXXXPKKLGFEDFLINLSKNLAFG------------RVFPEDEKDAAILLMALSSGLV 252 KKL FEDF I++ N + G RVFP+DEK+AAILLMALS GLV Sbjct: 298 PSSRGKSKKLCFEDFTISMKNNSSSGNPTAATTTTALQRVFPQDEKEAAILLMALSCGLV 357 Query: 251 HG 246 HG Sbjct: 358 HG 359 >gb|ESW26655.1| hypothetical protein PHAVU_003G137100g [Phaseolus vulgaris] Length = 309 Score = 148 bits (374), Expect = 4e-33 Identities = 110/308 (35%), Positives = 143/308 (46%), Gaps = 22/308 (7%) Frame = -3 Query: 1103 NNDQHHQPFGPCH--------------IFFNSTQDHMMESYNYDHHPHLYQPRQISDDNL 966 N DQ+H+ F P H + FN + E+ ++ P + P + + Sbjct: 18 NEDQNHELFTPTHHAYPSFSSLSSSYPLLFNPPEQ---EAGSHYWEPTKHLPAYEQAEKI 74 Query: 965 GYHGGSTYEIKNKVDQSGLKLTLWKKEDHMGHDDEHIPQKNNPVKWMSSKMRLMQKMKTP 786 GS + V +S LK+ +WK ++ +D ++ V MS KMR+M+K P Sbjct: 75 NPTRGSW---DHSVTESELKVAVWKNKERS--EDHEAAAEDGSVNLMSLKMRMMRKTMVP 129 Query: 785 DRVALKITSTATTKLE---QPXXXXXXXXXXXXXXXXXXS--PIRVCSDCNTTKTPLWRS 621 D+ I K E QP S +RVC+DC+TTKTPLWRS Sbjct: 130 DQTGAYIEDRTMHKFEDQKQPLSPLGTDNSSSSNNYSNHSNNTVRVCADCHTTKTPLWRS 189 Query: 620 GPKGPKSLCNACGIRQRKXXXXXXXXXXXANGTA---DQPPAMKIKVQHKLEKTGKNGHA 450 GP+GPKSLCNACGIRQRK NGT Q K+Q K +KT G Sbjct: 190 GPRGPKSLCNACGIRQRK-ARRAMAAAASGNGTVILETQKSVKGNKLQKKEKKTRTQGAP 248 Query: 449 SHFKKRCKTATNXXXXXXXXXXXXPKKLGFEDFLINLSKNLAFGRVFPEDEKDAAILLMA 270 KKR K GFED + L K+LA +VFP+DEK+AAILLMA Sbjct: 249 QMKKKR-------NHGVGAKPSQSRNKFGFEDLTLRLRKSLAMHQVFPQDEKEAAILLMA 301 Query: 269 LSSGLVHG 246 LS GLVHG Sbjct: 302 LSYGLVHG 309 >gb|ESW10726.1| hypothetical protein PHAVU_009G232700g [Phaseolus vulgaris] Length = 306 Score = 146 bits (369), Expect = 2e-32 Identities = 104/296 (35%), Positives = 140/296 (47%), Gaps = 8/296 (2%) Frame = -3 Query: 1109 NDNNDQHHQPFG-PCHIFFNSTQDHMMESYNYDHHPHLYQPRQISDDNLGYHGGSTYEIK 933 N+ + H QP I FN QD Y H +Q + + + G + ++ Sbjct: 18 NEEDHTHKQPSSLSTSILFNPDQDQGGFCYWESKH---FQSDEEAQKIVPSSGSWDHPVE 74 Query: 932 NKVDQSGLKLTLWKKEDHMGHDDEHIPQKNNPVKWMSSKMRLMQKMKTPDRVALKITSTA 753 ++S LKL +WKKE+ G D+ K MSSKMR+++KM D I + Sbjct: 75 KIENRSDLKLRVWKKEE--GCDN----LKGEDSSTMSSKMRMVRKMIVSDETDSDIADIS 128 Query: 752 TTKL-------EQPXXXXXXXXXXXXXXXXXXSPIRVCSDCNTTKTPLWRSGPKGPKSLC 594 ++K + P+RVC DC+TTKTPLWRSGPKGPKSLC Sbjct: 129 SSKQIKYKKKNPELSPLVTDDSNCNSSSNQNSVPLRVCVDCHTTKTPLWRSGPKGPKSLC 188 Query: 593 NACGIRQRKXXXXXXXXXXXANGTADQPPAMKIKVQHKLEKTGKNGHASHFKKRCKTATN 414 NACGIRQRK ANG + ++K + K GK H+ K + + A Sbjct: 189 NACGIRQRKERRAIAAAATTANG------SNRLKAEKSEMKKGKKLHSKGKKSKTEGAPA 242 Query: 413 XXXXXXXXXXXXPKKLGFEDFLINLSKNLAFGRVFPEDEKDAAILLMALSSGLVHG 246 + FED + LS N A +VFP+DEK+AAILLMALS GL+HG Sbjct: 243 LLKKKRKPAKNRKRFRAFEDLTVRLSNNSAVQQVFPQDEKEAAILLMALSHGLLHG 298 >ref|NP_194345.1| putative GATA transcription factor 22 [Arabidopsis thaliana] gi|71660811|sp|Q9SZI6.1|GAT22_ARATH RecName: Full=Putative GATA transcription factor 22 gi|4538944|emb|CAB39680.1| putative transcription factor [Arabidopsis thaliana] gi|7269466|emb|CAB79470.1| putative transcription factor [Arabidopsis thaliana] gi|332659764|gb|AEE85164.1| putative GATA transcription factor 22 [Arabidopsis thaliana] Length = 352 Score = 145 bits (365), Expect = 5e-32 Identities = 104/295 (35%), Positives = 144/295 (48%), Gaps = 25/295 (8%) Frame = -3 Query: 1055 NSTQDHMMESYNYDHH-----PHLYQPRQISDDNLGYHGGSTYEIKNKVDQSGLKLTLWK 891 NS QD + YN + H+ QP + + + + G S+ + ++ LKLT+ K Sbjct: 66 NSRQDQVYVGYNNNTFHDVLDTHISQPLE-TKNFVSDGGSSSSDQMVPKKETRLKLTIKK 124 Query: 890 KEDHMGHDDEHIPQK-------NNPVKWMSSKMRLMQKMKTPDRVALKITSTATTKLEQP 732 K++H D +PQ N +KW+SSK+RLM+K K A+ TS ++ + Sbjct: 125 KDNHQDQTD--LPQSPIKDMTGTNSLKWISSKVRLMKKKK-----AIITTSDSSKQHTNN 177 Query: 731 XXXXXXXXXXXXXXXXXXSPIRVCSDCNTTKTPLWRSGPKGPKSLCNACGIRQRKXXXXX 552 IR+CSDCNTTKTPLWRSGP+GPKSLCNACGIRQRK Sbjct: 178 DQSSNLSNSERQNGYNNDCVIRICSDCNTTKTPLWRSGPRGPKSLCNACGIRQRKARRAA 237 Query: 551 XXXXXXANGTADQPPAMKIKVQHKLE-KTGKNGHASHFKKRCKT------------ATNX 411 + PP MK K+Q+K + G S + T A + Sbjct: 238 MATATATAVSGVSPPVMKKKMQNKNKISNGVYKILSPLPLKVNTCKRMITLEETALAEDL 297 Query: 410 XXXXXXXXXXXPKKLGFEDFLINLSKNLAFGRVFPEDEKDAAILLMALSSGLVHG 246 + F+D + LSK+ A+ +VFP+DEK+AAILLMALS G+VHG Sbjct: 298 ETQSNSTMLSSSDNIYFDDLALLLSKSSAYQQVFPQDEKEAAILLMALSHGMVHG 352 >emb|CAN63090.1| hypothetical protein VITISV_032017 [Vitis vinifera] Length = 211 Score = 144 bits (364), Expect = 6e-32 Identities = 101/228 (44%), Positives = 121/228 (53%), Gaps = 3/228 (1%) Frame = -3 Query: 923 DQSGLKLTLWKKEDHMGHDDEHIPQKNNPVKWMSSKMRLMQKMKTPDRVALKITSTATTK 744 ++S KL+++KKE+ DE + KWMSSKMRLM+KM D KI Sbjct: 5 NKSSHKLSVFKKEE----GDEG---NKSTEKWMSSKMRLMRKMMNSDCTTAKIEQKV--- 54 Query: 743 LEQPXXXXXXXXXXXXXXXXXXSPIRVCSDCNTTKTPLWRSGPKGPKSLCNACGIRQRK- 567 + PIRVCSDCNTTKTPLWRSGP+GPKSLCNACGIRQRK Sbjct: 55 --EDHQQWDNINEXNSSNNTSNIPIRVCSDCNTTKTPLWRSGPRGPKSLCNACGIRQRKA 112 Query: 566 XXXXXXXXXXXANGTA--DQPPAMKIKVQHKLEKTGKNGHASHFKKRCKTATNXXXXXXX 393 ANGTA + MK+K+ +K EK + KK CK Sbjct: 113 RRAMAAAAAAAANGTAVGTEISPMKMKLPNK-EKKMHTSNVGQQKKLCK---------PP 162 Query: 392 XXXXXPKKLGFEDFLINLSKNLAFGRVFPEDEKDAAILLMALSSGLVH 249 KKL FEDF ++ KN F RVFP DE++AAILLMALS LV+ Sbjct: 163 CPPPTEKKLCFEDFTSSICKNSGFRRVFPRDEEEAAILLMALSCDLVY 210 >ref|XP_003546455.1| PREDICTED: putative GATA transcription factor 22-like [Glycine max] Length = 315 Score = 144 bits (363), Expect = 8e-32 Identities = 90/235 (38%), Positives = 121/235 (51%), Gaps = 9/235 (3%) Frame = -3 Query: 923 DQSGLKLTLWKKEDHMGHDDEHIPQKNNPVKWMSSKMRLMQKMKTPDRVALK-----ITS 759 ++S LKL +WKKED E+ ++N KWM KMR+M+++ D+ I++ Sbjct: 84 NKSDLKLRVWKKEDKC----ENFQGEDNSTKWMPLKMRMMRRLMVSDQTGSDDTEGMISN 139 Query: 758 TATTKLEQPXXXXXXXXXXXXXXXXXXS----PIRVCSDCNTTKTPLWRSGPKGPKSLCN 591 + K E+ + +RVCSDC+TTKTPLWRSGPKGPKSLCN Sbjct: 140 SQKIKYEEKNSPLSPLGTDDSNYNSSSNHSNITVRVCSDCHTTKTPLWRSGPKGPKSLCN 199 Query: 590 ACGIRQRKXXXXXXXXXXXANGTADQPPAMKIKVQHKLEKTGKNGHASHFKKRCKTATNX 411 ACGIRQRK +NGT ++ + K G H+ K + + A Sbjct: 200 ACGIRQRK-VRRAIAAAATSNGT------NPVEAEKSQVKKGNTLHSKGMKSKTEGAQQM 252 Query: 410 XXXXXXXXXXXPKKLGFEDFLINLSKNLAFGRVFPEDEKDAAILLMALSSGLVHG 246 + FED + LSKN A +VFP+DEK+AAILLMALS GL+HG Sbjct: 253 KKNRKLGARYRKRFGAFEDLTVRLSKNFALQQVFPQDEKEAAILLMALSYGLLHG 307