BLASTX nr result
ID: Rehmannia23_contig00008923
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Rehmannia23_contig00008923 (1381 letters) Database: ./nr 37,332,560 sequences; 13,225,080,153 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_002282173.1| PREDICTED: uncharacterized protein LOC100261... 205 3e-50 gb|EOY30464.1| GATA type zinc finger transcription factor family... 187 7e-45 ref|XP_006346565.1| PREDICTED: GATA transcription factor 21-like... 182 2e-43 ref|XP_004243958.1| PREDICTED: putative GATA transcription facto... 182 2e-43 gb|EXB38836.1| Putative GATA transcription factor 22 [Morus nota... 176 2e-41 ref|XP_002514107.1| hypothetical protein RCOM_1046780 [Ricinus c... 170 1e-39 ref|XP_006451458.1| hypothetical protein CICLE_v10009004mg [Citr... 167 7e-39 ref|XP_006353530.1| PREDICTED: putative GATA transcription facto... 165 5e-38 ref|XP_002279283.1| PREDICTED: putative GATA transcription facto... 161 7e-37 ref|XP_006600457.1| PREDICTED: GATA transcription factor 21-like... 160 2e-36 ref|XP_003550634.1| PREDICTED: GATA transcription factor 21-like... 160 2e-36 gb|ESW26655.1| hypothetical protein PHAVU_003G137100g [Phaseolus... 155 5e-35 gb|EMJ04350.1| hypothetical protein PRUPE_ppa024374mg [Prunus pe... 155 5e-35 gb|EOY29900.1| GATA type zinc finger transcription factor family... 153 2e-34 ref|XP_003543725.1| PREDICTED: GATA transcription factor 21-like... 151 5e-34 gb|ADL36695.1| GATA domain class transcription factor [Malus dom... 149 2e-33 emb|CAN63090.1| hypothetical protein VITISV_032017 [Vitis vinifera] 147 1e-32 gb|ESW10726.1| hypothetical protein PHAVU_009G232700g [Phaseolus... 146 2e-32 ref|XP_004287558.1| PREDICTED: uncharacterized protein LOC101297... 145 4e-32 ref|XP_003546455.1| PREDICTED: putative GATA transcription facto... 142 3e-31 >ref|XP_002282173.1| PREDICTED: uncharacterized protein LOC100261004 [Vitis vinifera] gi|297738668|emb|CBI27913.3| unnamed protein product [Vitis vinifera] Length = 309 Score = 205 bits (522), Expect = 3e-50 Identities = 138/308 (44%), Positives = 162/308 (52%), Gaps = 22/308 (7%) Frame = -2 Query: 1140 NNDQHHQP-FGP-------------CHIFFNSTQDHMMESYNYDHHPQLYQPRQISDDNL 1003 N DQHHQ F P C IFF+ T++ Y H Q P+Q + D Sbjct: 19 NEDQHHQLLFSPKPQPSSSSSSSLTCPIFFSPTKEQGGCHYRDLHQAQ---PQQEAHDKF 75 Query: 1002 GYHGGS-TYEIKNKVDQSGLKLTLWKKEDHMGHDDEHIPQKNNPVKWMSSKMRLMQKMKT 826 + GGS + +GLKLT+WK ED + E N VKWMSSKMR+MQKM Sbjct: 76 VFRGGSYDHPTLESESDNGLKLTIWKTEDRNENHSE-----NGSVKWMSSKMRVMQKMMI 130 Query: 825 PDRVALKITSAATTKL----EQPXXXXXXXXXXXXXXXXXXSPIRVCSDCNTTKTPLWRS 658 D+ + S +Q + IRVC+DCNTTKTPLWRS Sbjct: 131 SDQTGAQKPSNTALNFGDHKQQSLPSETDYNSINSSNINSNNTIRVCADCNTTKTPLWRS 190 Query: 657 GPKGPKSLCNACGIRQRKXXXXXXXXXXXANGT--ADQPPAMKIKVQHKLEKTGKNGHAS 484 GP+GPKSLCNACGIRQRK ANGT K K +HK +K NGH S Sbjct: 191 GPRGPKSLCNACGIRQRKARRAMAAAAATANGTILPTNTAPTKTKAKHK-DKKSSNGHVS 249 Query: 483 HFKKRCKXXXXXXXXXAGSPSDGPKKLGFEDFLINLSKNLAFGRVFPEDE-KDAAILLMA 307 H+KKRCK A +PS KKL FEDF I+LSKN AF RVF +DE K+AAILLMA Sbjct: 250 HYKKRCK--------LAAAPSCETKKLCFEDFTISLSKNSAFHRVFLQDEIKEAAILLMA 301 Query: 306 LSSGLVHG 283 LS GLVHG Sbjct: 302 LSCGLVHG 309 >gb|EOY30464.1| GATA type zinc finger transcription factor family protein, putative [Theobroma cacao] Length = 302 Score = 187 bits (476), Expect = 7e-45 Identities = 128/306 (41%), Positives = 164/306 (53%), Gaps = 19/306 (6%) Frame = -2 Query: 1143 DNNDQHHQPFG-------------PCHIFFNSTQDHMMESYNYDHHPQLYQPRQISDDNL 1003 D+ Q HQ F C I FN +++ H + +Q Q +D Sbjct: 20 DDQHQQHQLFSLKPQPPSLSSSSLTCPILFNP----VVQEQAGGHQREPHQHFQYQEDQA 75 Query: 1002 GYHGGSTYEIKNKVDQSGLKLTLWKKEDHMGHDDEHIPQKNNPVKWMSSKMRLMQKMKTP 823 + +++ SGL L+L KKE+ +EH +++ KWMSSKMR+M+KM + Sbjct: 76 KIYVPQDEPLES---DSGLNLSLRKKEE----GNEHHQIEDSSAKWMSSKMRMMRKMMSS 128 Query: 822 DRVALKITSAATTKLEQPXXXXXXXXXXXXXXXXXXSP---IRVCSDCNTTKTPLWRSGP 652 DR L ++++T KLE+P + IRVC+DCNTTKTPLWRSGP Sbjct: 129 DRADL--SNSSTPKLEEPKQQPSSSPDNSSNSSYNNNDNITIRVCADCNTTKTPLWRSGP 186 Query: 651 KGPKSLCNACGIRQRKXXXXXXXXXXXANG---TADQPPAMKIKVQHKLEKTGKNGHASH 481 +GPKSLCNACGIRQRK ANG A P MK KVQ K +++ +G + Sbjct: 187 RGPKSLCNACGIRQRK-ARRAMAAAAAANGAIVAAQTTPTMKSKVQDKSKRSSNSGCVAQ 245 Query: 480 FKKRCKXXXXXXXXXAGSPSDGPKKLGFEDFLINLSKNLAFGRVFPEDEKDAAILLMALS 301 KK+CK S S G KKL FED I LSKN AF RVFP+DEK+AAILLMALS Sbjct: 246 LKKKCK---------HSSQSQGRKKLCFEDLRIILSKNSAFHRVFPQDEKEAAILLMALS 296 Query: 300 SGLVHG 283 GLVHG Sbjct: 297 YGLVHG 302 >ref|XP_006346565.1| PREDICTED: GATA transcription factor 21-like [Solanum tuberosum] Length = 222 Score = 182 bits (463), Expect = 2e-43 Identities = 120/252 (47%), Positives = 147/252 (58%), Gaps = 5/252 (1%) Frame = -2 Query: 1023 QISDDNLGYHGGSTYEI--KNKVDQSGLKLTLWKKEDHMGHDDEHIPQKNNPVKWMSSKM 850 Q+ DN GGS+Y++ KNK SGLKL+LWK+ED + E +K + + Sbjct: 6 QLEVDN---DGGSSYDLGKKNK-GGSGLKLSLWKREDKLVMSSE--------IKDLDQER 53 Query: 849 RLMQKMKTPDRVALKITSAATTKLEQPXXXXXXXXXXXXXXXXXXSPIRVCSDCNTTKTP 670 + + + D + LK+ + +QP PIRVC+DCNTTKTP Sbjct: 54 K--KNITNNDCIKLKLGD----QKQQPIQTDYSSNNI---------PIRVCTDCNTTKTP 98 Query: 669 LWRSGPKGPKSLCNACGIRQRKXXXXXXXXXXXANGTADQPPAMKIKV-QHK--LEKTGK 499 LWRSGPKGPKSLCNACGIRQRK ANG D AMKIKV QHK + K Sbjct: 99 LWRSGPKGPKSLCNACGIRQRK---ARRAMAAAANGKTDHQTAMKIKVQQHKPNITKVRT 155 Query: 498 NGHASHFKKRCKXXXXXXXXXAGSPSDGPKKLGFEDFLINLSKNLAFGRVFPEDEKDAAI 319 N H + FKKRCK + ++ PKKLGFED LINLS LAF ++FP+DEK+AAI Sbjct: 156 NNHVTPFKKRCK-----LGPSSSGTNNAPKKLGFEDLLINLSNQLAFQQIFPQDEKEAAI 210 Query: 318 LLMALSSGLVHG 283 LLMALSSGLVHG Sbjct: 211 LLMALSSGLVHG 222 >ref|XP_004243958.1| PREDICTED: putative GATA transcription factor 22-like [Solanum lycopersicum] Length = 266 Score = 182 bits (463), Expect = 2e-43 Identities = 129/293 (44%), Positives = 161/293 (54%), Gaps = 5/293 (1%) Frame = -2 Query: 1146 NDNNDQHHQPFGPCHIFFNSTQDHMMESYNYDHHPQLYQPRQISDDNLGYHGGSTYEI-- 973 N NN+ P H FFNST + S+++ H Q Q+ DN GGS+Y++ Sbjct: 17 NSNNNSLVTP--NYHFFFNSTTNQTA-SFHHQHTQYYMQHEQLEVDN---DGGSSYDLGK 70 Query: 972 KNKVDQSGLKLTLWKKEDHMGHDDEHIPQKNNPVKWMSSKMRLMQKMKTPDRVALKITSA 793 KN+V SGLKL+LWK+ED K +SS+++ + + K + T++ Sbjct: 71 KNEVG-SGLKLSLWKRED----------------KLLSSEIKKLDQEKKKNS-----TNS 108 Query: 792 ATTKLEQPXXXXXXXXXXXXXXXXXXSPIRVCSDCNTTKTPLWRSGPKGPKSLCNACGIR 613 A KL+ PIRVC+DCNTTKTPLWRSGPKGPKSLCNACGIR Sbjct: 109 ACIKLK---LGDQKQKPIQTDYCSNNIPIRVCTDCNTTKTPLWRSGPKGPKSLCNACGIR 165 Query: 612 QRKXXXXXXXXXXXANGTADQPPAMKIKVQHKLEKTGK---NGHASHFKKRCKXXXXXXX 442 QRK A G DQ K++ QHK T K N KKRCK Sbjct: 166 QRK--ARRAMAAAAAEGKTDQ----KVQ-QHKQNITTKVTSNNDVKPLKKRCK-----FG 213 Query: 441 XXAGSPSDGPKKLGFEDFLINLSKNLAFGRVFPEDEKDAAILLMALSSGLVHG 283 + S ++ PKKLGFEDFLINLS LAF ++FP+DE +AAILLMALSSGLVHG Sbjct: 214 PSSSSTNNAPKKLGFEDFLINLSNKLAFQQIFPQDEMEAAILLMALSSGLVHG 266 >gb|EXB38836.1| Putative GATA transcription factor 22 [Morus notabilis] Length = 335 Score = 176 bits (447), Expect = 2e-41 Identities = 125/279 (44%), Positives = 148/279 (53%), Gaps = 22/279 (7%) Frame = -2 Query: 1053 DHHPQLYQPRQISDDNLGYHGGSTYEIKNKVDQSGLKLTLWK---KEDHMGHDDEHIPQK 883 DHH +L SD H E ++ Q+ LKL++WK ++ + HD Sbjct: 70 DHHHKLVSSGGSSD----IHPPRVAESESDHHQNDLKLSIWKSSTEDSNYDHDKSSHVSD 125 Query: 882 NNP---VKWMSSKMRLMQKM-KTPDRVALKITSAA--TTKLEQ-------PXXXXXXXXX 742 NN KWM SKMR+M+KM PD+ + + T K +Q Sbjct: 126 NNAGYSAKWMPSKMRMMRKMIVNPDQTNIDHHTPLNFTHKFDQVMKRKHPASPLGTDHSS 185 Query: 741 XXXXXXXXXSPIRVCSDCNTTKTPLWRSGPKGPKSLCNACGIRQRKXXXXXXXXXXXANG 562 + IRVC+DCNTTKTPLWRSGP+GPKSLCNACGIRQRK ANG Sbjct: 186 TSSSNNNNNNTIRVCADCNTTKTPLWRSGPRGPKSLCNACGIRQRKARRAMAAAAAAANG 245 Query: 561 T--ADQPPAMK--IKVQHKLEKTGKNGH--ASHFKKRCKXXXXXXXXXAGSPSDGPKKLG 400 T A MK KVQ K EK KNG+ FKKRCK SPS G KK+ Sbjct: 246 TILATDATTMKSSTKVQRK-EKKPKNGNGVVPQFKKRCK--------LTASPSRGRKKIC 296 Query: 399 FEDFLINLSKNLAFGRVFPEDEKDAAILLMALSSGLVHG 283 FED I++SKN AF RVFP+DEKDAAILLMALS GLVHG Sbjct: 297 FEDLAISISKNSAFQRVFPQDEKDAAILLMALSYGLVHG 335 >ref|XP_002514107.1| hypothetical protein RCOM_1046780 [Ricinus communis] gi|223546563|gb|EEF48061.1| hypothetical protein RCOM_1046780 [Ricinus communis] Length = 312 Score = 170 bits (430), Expect = 1e-39 Identities = 133/314 (42%), Positives = 161/314 (51%), Gaps = 28/314 (8%) Frame = -2 Query: 1140 NNDQHHQPFGPCH----------------IFFNSTQDHMMESYNYDHHPQLYQPRQISDD 1009 N DQHH C IF N Q E Y +H +L D Sbjct: 17 NEDQHHHQLIFCSKTTTEDASSSSSISYPIFINPPQ----EEVGY-YHKELQPLHHQEVD 71 Query: 1008 NLGYHGGSTYE---IKNKVDQSGLKLTLWKKEDHMGHDDEHIPQKNNPVKWMSSKMRLMQ 838 N+ G +++ IKN+ +++G +L++ KKED ++ + N+ VKWMSSKMRLM+ Sbjct: 72 NIYASHGRSWDHRIIKNE-NENGQELSVCKKEDKSTSIEDQ--RDNSSVKWMSSKMRLMR 128 Query: 837 KMKTPDR-VALKITSAATTKLEQPXXXXXXXXXXXXXXXXXXS----PIRVCSDCNTTKT 673 KM T D+ V +++ KLE IRVCSDCNTTKT Sbjct: 129 KMMTTDQTVNTTQHTSSMHKLEDKEKSRSLPLQDDYSSKNLSDNSNNTIRVCSDCNTTKT 188 Query: 672 PLWRSGPKGPKSLCNACGIRQRKXXXXXXXXXXXANGT--ADQPPAMKI-KVQHKLEKTG 502 PLWRSGP+GPKSLCNACGIRQRK ANGT A AMK KVQ+K EK Sbjct: 189 PLWRSGPRGPKSLCNACGIRQRKARRALAAAQASANGTIFAPDTAAMKTNKVQNK-EKRT 247 Query: 501 KNGHASHFKKRCKXXXXXXXXXAGSPSDGPKKLGFEDFLIN-LSKNLAFGRVFPEDEKDA 325 N H FKKRCK KKL FED LSKN AF ++FP+DEK+A Sbjct: 248 NNSHLP-FKKRCK--------FTAQSRGSRKKLCFEDLSSTILSKNSAFQQLFPQDEKEA 298 Query: 324 AILLMALSSGLVHG 283 AILLMALS GLVHG Sbjct: 299 AILLMALSYGLVHG 312 >ref|XP_006451458.1| hypothetical protein CICLE_v10009004mg [Citrus clementina] gi|568843031|ref|XP_006475428.1| PREDICTED: putative GATA transcription factor 22-like [Citrus sinensis] gi|557554684|gb|ESR64698.1| hypothetical protein CICLE_v10009004mg [Citrus clementina] Length = 306 Score = 167 bits (424), Expect = 7e-39 Identities = 126/292 (43%), Positives = 153/292 (52%), Gaps = 17/292 (5%) Frame = -2 Query: 1107 CHIFFNSTQDHMMESYNYD---HHPQ----LYQPRQISDDNLGYHGGSTYEIKNKVDQSG 949 CH FF Q Y HP+ LY S D H G ++ + +G Sbjct: 37 CHNFFEPVQREGGFYYRESVLLRHPKEVRILYSQAAGSCD----HPGPAVMDESGSESTG 92 Query: 948 LKLTLWKKEDHMGHDDEHIPQKNNPVKWMSSKMRLMQKM--KTPDRVALKITSAATTKLE 775 LKL++ +++ +D++ + ++ VKWMSSKMRLM+KM +PD +AA KLE Sbjct: 93 LKLSMSSEKEE--RNDQNQSENSSSVKWMSSKMRLMKKMMYSSPD-------AAAMQKLE 143 Query: 774 --QPXXXXXXXXXXXXXXXXXXSPIRVCSDCNTTKTPLWRSGPKGPKSLCNACGIRQRKX 601 Q + IRVC+DCNTTKTPLWRSGP+GPKSLCNACGIRQRK Sbjct: 144 DHQKQPPSSSLEPDNGNNNNNTNTIRVCADCNTTKTPLWRSGPRGPKSLCNACGIRQRK- 202 Query: 600 XXXXXXXXXXANGTADQPPAMKIKVQHKLEKT---GKNGHASHFKKRCKXXXXXXXXXAG 430 ANGTA Q A K KT N FKKRCK + Sbjct: 203 -ARRAMAAAAANGTAVQLAADDTSSNKKKSKTPRPSNNNSCLPFKKRCK-------YNSN 254 Query: 429 SPSDGPKKL-GFEDFLINLSKN--LAFGRVFPEDEKDAAILLMALSSGLVHG 283 SPS G KKL FED +NLSKN A RVFP++EK+AAILLMALS GLVHG Sbjct: 255 SPSRGKKKLCSFEDLTLNLSKNNSSALQRVFPQEEKEAAILLMALSYGLVHG 306 >ref|XP_006353530.1| PREDICTED: putative GATA transcription factor 22-like [Solanum tuberosum] Length = 323 Score = 165 bits (417), Expect = 5e-38 Identities = 120/306 (39%), Positives = 149/306 (48%), Gaps = 31/306 (10%) Frame = -2 Query: 1107 CHIFFN-STQDHMMESYNYDHHP-QLYQPR-QISDDNLGYHGGSTYEIKNKVDQSGLKLT 937 C FFN ST ++ + YD+H Q +QP+ Q DN +++ K ++ GLKLT Sbjct: 58 CQTFFNISTTTNIQDQSGYDYHSHQFHQPQHQHEVDNFASRSSGSHDHLEKKNK-GLKLT 116 Query: 936 LWKKEDHMGHDDEHIPQKNNPVKWMSSKMRLMQKMKTPDRVALKITSAATTKLEQPXXXX 757 L KK + QK +K K ++++ + + S++ + Sbjct: 117 LCKKGE----------QKMKNLKLEDQKQQIIETDYSSN-------SSSNNNI------- 152 Query: 756 XXXXXXXXXXXXXXSPIRVCSDCNTTKTPLWRSGPKGPKSLCNACGIRQRKXXXXXXXXX 577 PIRVCSDCNTTKTPLWRSGPKGPKSLCNACGIRQRK Sbjct: 153 --------------IPIRVCSDCNTTKTPLWRSGPKGPKSLCNACGIRQRKARRAAAAAA 198 Query: 576 XXANG-----TADQPPAMKIKVQ---HKLEKTGKNGHASHFKKRCKXXXXXXXXXAGSPS 421 N + + MKIKVQ HK+ K N H FKKRCK A P+ Sbjct: 199 AATNNGTNFTSTETTTTMKIKVQQQKHKITKVNTN-HVVPFKKRCKFLSNTTTTPAPVPA 257 Query: 420 DGP--------------------KKLGFEDFLINLSKNLAFGRVFPEDEKDAAILLMALS 301 P K L FEDF +NLS NLA RVFP+DEK+AAILLMALS Sbjct: 258 PAPRVGSSSSSSSYNNNNDVQQKKNLCFEDFFVNLSNNLAIHRVFPQDEKEAAILLMALS 317 Query: 300 SGLVHG 283 SGLVHG Sbjct: 318 SGLVHG 323 >ref|XP_002279283.1| PREDICTED: putative GATA transcription factor 22 [Vitis vinifera] gi|296081660|emb|CBI20665.3| unnamed protein product [Vitis vinifera] Length = 306 Score = 161 bits (407), Expect = 7e-37 Identities = 118/288 (40%), Positives = 143/288 (49%), Gaps = 13/288 (4%) Frame = -2 Query: 1110 PCHIFFNSTQDHMMESYNYDHHPQLYQPRQISDDNLGYHGG----------STYEIKNKV 961 PC FFNS+ +S DH P+ Q + DD HGG S + Sbjct: 44 PCPSFFNSST----QSQRGDHSPRDPQQHEDKDDKYISHGGCGESQVFSSSSLLQPMADD 99 Query: 960 DQSGLKLTLWKKEDHMGHDDEHIPQKNNPVKWMSSKMRLMQKMKTPDRVALKITSAATTK 781 ++S KL+++KKE+ DE + KWMSSKMRLM+KM D KI Sbjct: 100 NKSSHKLSVFKKEE----GDEG---NKSTEKWMSSKMRLMRKMMNSDCTTAKIEQKV--- 149 Query: 780 LEQPXXXXXXXXXXXXXXXXXXSPIRVCSDCNTTKTPLWRSGPKGPKSLCNACGIRQRK- 604 + PIRVCSDCNTTKTPLWRSGP+GPKSLCNACGIRQRK Sbjct: 150 --EDHQQWDNINEFNSSNNTSNIPIRVCSDCNTTKTPLWRSGPRGPKSLCNACGIRQRKA 207 Query: 603 XXXXXXXXXXXANGTA--DQPPAMKIKVQHKLEKTGKNGHASHFKKRCKXXXXXXXXXAG 430 ANGTA + MK+K+ +K EK + KK CK Sbjct: 208 RRAMAAAAAAAANGTAVGTEISPMKMKLPNK-EKKMHTSNVGQQKKLCKPP--------- 257 Query: 429 SPSDGPKKLGFEDFLINLSKNLAFGRVFPEDEKDAAILLMALSSGLVH 286 P KKL FEDF ++ KN F RVFP DE++AAILLMALS LV+ Sbjct: 258 CPPPTEKKLCFEDFTSSICKNSGFRRVFPRDEEEAAILLMALSCDLVY 305 >ref|XP_006600457.1| PREDICTED: GATA transcription factor 21-like isoform X2 [Glycine max] Length = 310 Score = 160 bits (404), Expect = 2e-36 Identities = 110/312 (35%), Positives = 148/312 (47%), Gaps = 27/312 (8%) Frame = -2 Query: 1140 NNDQHHQPFGPCH-------------IFFNS-TQDHMMESYNYDHHPQLYQPRQISDDNL 1003 N DQ+H+ F P H I FN QD SY ++ Q + + + Sbjct: 6 NEDQNHEFFSPTHHPSSSFSSLSSYPILFNPPNQDQEARSYYWEPTKQYLPSHEEETEKI 65 Query: 1002 GYHGGSTYEIKNKVDQSGLKLTLWKKEDHMGHDDEHIPQKNNPVKWMSSKMRLMQKMKTP 823 GS + + + K T+WKK + + E + ++ +KWM +KMR+M+KM Sbjct: 66 IPSSGSWDHSVAESEHN--KATVWKKAEERNENLESVAAEDGSLKWMPAKMRIMRKMLVS 123 Query: 822 DRVALKITSAATT-------KLEQPXXXXXXXXXXXXXXXXXXSPIRVCSDCNTTKTPLW 664 D+ S T K + + +RVCSDC+TTKTPLW Sbjct: 124 DQTDTYTNSDNNTTHKFDDQKQQLSSPLGTDNSSSNNYSNHSNNTVRVCSDCHTTKTPLW 183 Query: 663 RSGPKGPKSLCNACGIRQRKXXXXXXXXXXXANGT------ADQPPAMKIKVQHKLEKTG 502 RSGP+GPKSLCNACGIRQRK A+G A + + K+Q K EK Sbjct: 184 RSGPRGPKSLCNACGIRQRKARRAMAAAAASASGNGTVIVEAKKSVKGRNKLQKKKEKKT 243 Query: 501 KNGHASHFKKRCKXXXXXXXXXAGSPSDGPKKLGFEDFLINLSKNLAFGRVFPEDEKDAA 322 + A+ KK+ K + S K GFED + L KNLA +VFP+DEK+AA Sbjct: 244 RTEGAAQMKKKRK-----LGVGSAKASQSRNKFGFEDLTLRLRKNLAMHQVFPQDEKEAA 298 Query: 321 ILLMALSSGLVH 286 ILLMALS GLVH Sbjct: 299 ILLMALSYGLVH 310 >ref|XP_003550634.1| PREDICTED: GATA transcription factor 21-like isoform X1 [Glycine max] Length = 322 Score = 160 bits (404), Expect = 2e-36 Identities = 110/312 (35%), Positives = 148/312 (47%), Gaps = 27/312 (8%) Frame = -2 Query: 1140 NNDQHHQPFGPCH-------------IFFNS-TQDHMMESYNYDHHPQLYQPRQISDDNL 1003 N DQ+H+ F P H I FN QD SY ++ Q + + + Sbjct: 18 NEDQNHEFFSPTHHPSSSFSSLSSYPILFNPPNQDQEARSYYWEPTKQYLPSHEEETEKI 77 Query: 1002 GYHGGSTYEIKNKVDQSGLKLTLWKKEDHMGHDDEHIPQKNNPVKWMSSKMRLMQKMKTP 823 GS + + + K T+WKK + + E + ++ +KWM +KMR+M+KM Sbjct: 78 IPSSGSWDHSVAESEHN--KATVWKKAEERNENLESVAAEDGSLKWMPAKMRIMRKMLVS 135 Query: 822 DRVALKITSAATT-------KLEQPXXXXXXXXXXXXXXXXXXSPIRVCSDCNTTKTPLW 664 D+ S T K + + +RVCSDC+TTKTPLW Sbjct: 136 DQTDTYTNSDNNTTHKFDDQKQQLSSPLGTDNSSSNNYSNHSNNTVRVCSDCHTTKTPLW 195 Query: 663 RSGPKGPKSLCNACGIRQRKXXXXXXXXXXXANGT------ADQPPAMKIKVQHKLEKTG 502 RSGP+GPKSLCNACGIRQRK A+G A + + K+Q K EK Sbjct: 196 RSGPRGPKSLCNACGIRQRKARRAMAAAAASASGNGTVIVEAKKSVKGRNKLQKKKEKKT 255 Query: 501 KNGHASHFKKRCKXXXXXXXXXAGSPSDGPKKLGFEDFLINLSKNLAFGRVFPEDEKDAA 322 + A+ KK+ K + S K GFED + L KNLA +VFP+DEK+AA Sbjct: 256 RTEGAAQMKKKRK-----LGVGSAKASQSRNKFGFEDLTLRLRKNLAMHQVFPQDEKEAA 310 Query: 321 ILLMALSSGLVH 286 ILLMALS GLVH Sbjct: 311 ILLMALSYGLVH 322 >gb|ESW26655.1| hypothetical protein PHAVU_003G137100g [Phaseolus vulgaris] Length = 309 Score = 155 bits (391), Expect = 5e-35 Identities = 112/308 (36%), Positives = 145/308 (47%), Gaps = 22/308 (7%) Frame = -2 Query: 1140 NNDQHHQPFGPCH--------------IFFNSTQDHMMESYNYDHHPQLYQPRQISDDNL 1003 N DQ+H+ F P H + FN + E+ ++ P + P + + Sbjct: 18 NEDQNHELFTPTHHAYPSFSSLSSSYPLLFNPPEQ---EAGSHYWEPTKHLPAYEQAEKI 74 Query: 1002 GYHGGSTYEIKNKVDQSGLKLTLWKKEDHMGHDDEHIPQKNNPVKWMSSKMRLMQKMKTP 823 GS + V +S LK+ +WK ++ +D ++ V MS KMR+M+K P Sbjct: 75 NPTRGSW---DHSVTESELKVAVWKNKERS--EDHEAAAEDGSVNLMSLKMRMMRKTMVP 129 Query: 822 DRVALKITSAATTKLE---QPXXXXXXXXXXXXXXXXXXS--PIRVCSDCNTTKTPLWRS 658 D+ I K E QP S +RVC+DC+TTKTPLWRS Sbjct: 130 DQTGAYIEDRTMHKFEDQKQPLSPLGTDNSSSSNNYSNHSNNTVRVCADCHTTKTPLWRS 189 Query: 657 GPKGPKSLCNACGIRQRKXXXXXXXXXXXANGTA---DQPPAMKIKVQHKLEKTGKNGHA 487 GP+GPKSLCNACGIRQRK NGT Q K+Q K +KT G Sbjct: 190 GPRGPKSLCNACGIRQRK-ARRAMAAAASGNGTVILETQKSVKGNKLQKKEKKTRTQGAP 248 Query: 486 SHFKKRCKXXXXXXXXXAGSPSDGPKKLGFEDFLINLSKNLAFGRVFPEDEKDAAILLMA 307 KKR PS K GFED + L K+LA +VFP+DEK+AAILLMA Sbjct: 249 QMKKKR-------NHGVGAKPSQSRNKFGFEDLTLRLRKSLAMHQVFPQDEKEAAILLMA 301 Query: 306 LSSGLVHG 283 LS GLVHG Sbjct: 302 LSYGLVHG 309 >gb|EMJ04350.1| hypothetical protein PRUPE_ppa024374mg [Prunus persica] Length = 297 Score = 155 bits (391), Expect = 5e-35 Identities = 119/309 (38%), Positives = 155/309 (50%), Gaps = 36/309 (11%) Frame = -2 Query: 1101 IFFNSTQDHMMESYNYDHHPQLYQPRQISDDN---LGYHGGSTYEIKNKVDQSG----LK 943 IF N +Q + + PQ +Q + + D+ + Y G Y+ + ++SG LK Sbjct: 4 IFLNPSQAQAPSGHYRE--PQNFQFQLLEADHHNIVSYGGSCDYDPQTLENESGSGTILK 61 Query: 942 LTLWKKEDHMGHDDEHIPQKNNPV--KWMSSKMRLMQKMKTPDR------------VALK 805 L++ K E + NP KWMSSKMR+M+KM PD+ VA+K Sbjct: 62 LSISKNE---------AGRNGNPSTDKWMSSKMRMMKKMTNPDQTSSSCTSSDDKPVAMK 112 Query: 804 ITSAATTKLEQPXXXXXXXXXXXXXXXXXXSP--IRVCSDCNTTKTPLWRSGPKGPKSLC 631 ++ + ++ ++P + IRVCSDCNTTKTPLWRSGP+GPKSLC Sbjct: 113 LSISHKSEEQKPQHPDMISCSNKSSNIMNNNVPIIRVCSDCNTTKTPLWRSGPRGPKSLC 172 Query: 630 NACGIRQRKXXXXXXXXXXXANGTA-DQPPAMK--IKVQHKLEKTGKNGHASHFKKRCKX 460 NACGIRQRK A+GT P+MK K QHK K + FKKR Sbjct: 173 NACGIRQRKARRAMAAAAAAASGTTLAAAPSMKSTSKAQHKDNKP-RGASTVPFKKRPYN 231 Query: 459 XXXXXXXXAGSPSDGPKKLGFEDFLINLSKN----------LAFGRVFPEDEKDAAILLM 310 G P PKKL FEDF I++ N + RVFP+DEK+AAILLM Sbjct: 232 KLSSTPPSKGRP---PKKLCFEDFAISMDNNHSSSATTTTTTSLQRVFPQDEKEAAILLM 288 Query: 309 ALSSGLVHG 283 ALS GLVHG Sbjct: 289 ALSCGLVHG 297 >gb|EOY29900.1| GATA type zinc finger transcription factor family protein, putative [Theobroma cacao] Length = 311 Score = 153 bits (386), Expect = 2e-34 Identities = 109/268 (40%), Positives = 134/268 (50%), Gaps = 7/268 (2%) Frame = -2 Query: 1068 ESYNYDHHPQLYQPRQISDDNLGYHGGSTYEIKNKVDQS---GLKLTLWKKEDHMGHDDE 898 ES +DH + + S D S+ +++ VDQS G L+ +KED D E Sbjct: 60 ESKPHDHKGNQFMTHEGSIDQ---QASSSSSLQSAVDQSTANGYNLSFSRKEDG---DCE 113 Query: 897 HIPQKNNPVKWMSSKMRLMQKMKTPDRVALKITSAATTKLEQPXXXXXXXXXXXXXXXXX 718 + VKWMSSK+RLM+KM + K Q Sbjct: 114 SASGNGSSVKWMSSKVRLMKKMMNSNCSG---ADDKPPKFTQRFQYPVHDSDETNSFSKA 170 Query: 717 XSPIRVCSDCNTTKTPLWRSGPKGPKSLCNACGIRQRK--XXXXXXXXXXXANGTADQPP 544 + +RVCSDCNTT TPLWRSGP+GPKSLCNACGIRQRK NG A Sbjct: 171 NNTVRVCSDCNTTTTPLWRSGPRGPKSLCNACGIRQRKARRAMEAAAAAAAENGAAAAAD 230 Query: 543 A--MKIKVQHKLEKTGKNGHASHFKKRCKXXXXXXXXXAGSPSDGPKKLGFEDFLINLSK 370 A MKIKV EK + H + KK+ K SP KKL F++F ++LSK Sbjct: 231 ASSMKIKVHIHKEKKSRTSHVAQCKKQVKPPYY-------SP-QSQKKLCFKEFALSLSK 282 Query: 369 NLAFGRVFPEDEKDAAILLMALSSGLVH 286 N A RVFP+D +DAAILLM LS GLVH Sbjct: 283 NSALQRVFPQDVEDAAILLMELSCGLVH 310 >ref|XP_003543725.1| PREDICTED: GATA transcription factor 21-like [Glycine max] Length = 314 Score = 151 bits (382), Expect = 5e-34 Identities = 111/313 (35%), Positives = 150/313 (47%), Gaps = 28/313 (8%) Frame = -2 Query: 1140 NNDQHHQPFGPCH--------------IFFNS-TQDHMMESYNYDHHPQLYQPRQISDDN 1006 N DQ+H+ F P H I FN QD SY+++ L + ++ Sbjct: 18 NEDQNHEFFSPIHHPSSSFSSLSSSYPILFNPPNQDQEARSYDWETTKHLPSHEEEAEKI 77 Query: 1005 LGYHGGSTYEIKNKVDQSGLKLTLWKKEDHMGHDDEHIPQKNNPVKWMSSKMRLMQKMKT 826 + G + V++S K+T+W+KE+ +E++ + + VKWM SKMR+M+KM Sbjct: 78 IPTSGSWGHS----VEESEHKVTVWRKEER----NENLAEDGS-VKWMPSKMRIMRKMLV 128 Query: 825 PDRVALKITSAATT--------KLEQPXXXXXXXXXXXXXXXXXXSPIRVCSDCNTTKTP 670 ++ + TT +L P +RVCSDC+TTKTP Sbjct: 129 SNQTDAYTSDNNTTHKFDDHKQQLSSPLGIDDNSSNNYSDKSNNSI-VRVCSDCHTTKTP 187 Query: 669 LWRSGPKGPKSLCNACGIRQRKXXXXXXXXXXXANG-----TADQPPAMKIKVQHKLEKT 505 LWRSGP+GPKSLCNACGIRQRK A G + K+Q K EK Sbjct: 188 LWRSGPRGPKSLCNACGIRQRKARRAMAAAAAAALGDGAVIVEAEKSVKGKKLQKKKEKK 247 Query: 504 GKNGHASHFKKRCKXXXXXXXXXAGSPSDGPKKLGFEDFLINLSKNLAFGRVFPEDEKDA 325 + A+ K + K S K GFED + L KNLA +VFP+DEK+A Sbjct: 248 TRIEGAAQMKMKRK------LGVGAKASQSRNKFGFEDLTLRLRKNLAMHQVFPQDEKEA 301 Query: 324 AILLMALSSGLVH 286 AILLMALS GLVH Sbjct: 302 AILLMALSYGLVH 314 >gb|ADL36695.1| GATA domain class transcription factor [Malus domestica] Length = 359 Score = 149 bits (377), Expect = 2e-33 Identities = 121/312 (38%), Positives = 150/312 (48%), Gaps = 44/312 (14%) Frame = -2 Query: 1086 TQDHMMESYNYDHHPQLYQPRQISDDNLGYHGGSTYEIKNKVDQSG-----LKLTLWKK- 925 + DH E + + QL + +D N+ HGGS ++ G LKL++ K Sbjct: 64 SDDHYREPHQFQF--QLLE----ADHNIVPHGGSHDHDHQAIENEGGSGTVLKLSISKNG 117 Query: 924 ---EDHMGHDDEHIPQKNNPVKWMSSKMRLMQKMKTPDRVALKITSA------------- 793 + G D E + VKWMSSKMR+M+KM PD+ + TS+ Sbjct: 118 AVGNGNPGTDHE---TSTSSVKWMSSKMRMMRKMSNPDQTSSSSTSSDDKPISMKLSSHK 174 Query: 792 -ATTKLEQPXXXXXXXXXXXXXXXXXXSP----IRVCSDCNTTKTPLWRSGPKGPKSLCN 628 KL+ P IRVCSDCNTTKTPLWRSGP+GPKSLCN Sbjct: 175 FEEQKLQHPSSQLGADMISCSNNSSNNMNNVPIIRVCSDCNTTKTPLWRSGPRGPKSLCN 234 Query: 627 ACGIRQRKXXXXXXXXXXXANGT--ADQPPAMK-IKVQHKLEKTGKNGHASHFKKRCKXX 457 ACGIRQRK A+GT P+MK KVQ K K+ + FKKR Sbjct: 235 ACGIRQRKARRAMAAAAAAASGTTLTVAAPSMKSSKVQPKANKS-RVSSTVPFKKR---- 289 Query: 456 XXXXXXXAGSPSD--GPKKLGFEDFLINLSKNLAFG------------RVFPEDEKDAAI 319 + SPS KKL FEDF I++ N + G RVFP+DEK+AAI Sbjct: 290 --PYNKLSSSPSSRGKSKKLCFEDFTISMKNNSSSGNPTAATTTTALQRVFPQDEKEAAI 347 Query: 318 LLMALSSGLVHG 283 LLMALS GLVHG Sbjct: 348 LLMALSCGLVHG 359 >emb|CAN63090.1| hypothetical protein VITISV_032017 [Vitis vinifera] Length = 211 Score = 147 bits (371), Expect = 1e-32 Identities = 102/228 (44%), Positives = 122/228 (53%), Gaps = 3/228 (1%) Frame = -2 Query: 960 DQSGLKLTLWKKEDHMGHDDEHIPQKNNPVKWMSSKMRLMQKMKTPDRVALKITSAATTK 781 ++S KL+++KKE+ DE + KWMSSKMRLM+KM D KI Sbjct: 5 NKSSHKLSVFKKEE----GDEG---NKSTEKWMSSKMRLMRKMMNSDCTTAKIEQKV--- 54 Query: 780 LEQPXXXXXXXXXXXXXXXXXXSPIRVCSDCNTTKTPLWRSGPKGPKSLCNACGIRQRK- 604 + PIRVCSDCNTTKTPLWRSGP+GPKSLCNACGIRQRK Sbjct: 55 --EDHQQWDNINEXNSSNNTSNIPIRVCSDCNTTKTPLWRSGPRGPKSLCNACGIRQRKA 112 Query: 603 XXXXXXXXXXXANGTA--DQPPAMKIKVQHKLEKTGKNGHASHFKKRCKXXXXXXXXXAG 430 ANGTA + MK+K+ +K EK + KK CK Sbjct: 113 RRAMAAAAAAAANGTAVGTEISPMKMKLPNK-EKKMHTSNVGQQKKLCKPP--------- 162 Query: 429 SPSDGPKKLGFEDFLINLSKNLAFGRVFPEDEKDAAILLMALSSGLVH 286 P KKL FEDF ++ KN F RVFP DE++AAILLMALS LV+ Sbjct: 163 CPPPTEKKLCFEDFTSSICKNSGFRRVFPRDEEEAAILLMALSCDLVY 210 >gb|ESW10726.1| hypothetical protein PHAVU_009G232700g [Phaseolus vulgaris] Length = 306 Score = 146 bits (368), Expect = 2e-32 Identities = 106/296 (35%), Positives = 139/296 (46%), Gaps = 8/296 (2%) Frame = -2 Query: 1146 NDNNDQHHQPFG-PCHIFFNSTQDHMMESYNYDHHPQLYQPRQISDDNLGYHGGSTYEIK 970 N+ + H QP I FN QD Y H Q + Q + G +I+ Sbjct: 18 NEEDHTHKQPSSLSTSILFNPDQDQGGFCYWESKHFQSDEEAQKIVPSSGSWDHPVEKIE 77 Query: 969 NKVDQSGLKLTLWKKEDHMGHDDEHIPQKNNPVKWMSSKMRLMQKMKTPDRVALKITSAA 790 N+ D LKL +WKKE+ G D+ K MSSKMR+++KM D I + Sbjct: 78 NRSD---LKLRVWKKEE--GCDN----LKGEDSSTMSSKMRMVRKMIVSDETDSDIADIS 128 Query: 789 TTKL-------EQPXXXXXXXXXXXXXXXXXXSPIRVCSDCNTTKTPLWRSGPKGPKSLC 631 ++K + P+RVC DC+TTKTPLWRSGPKGPKSLC Sbjct: 129 SSKQIKYKKKNPELSPLVTDDSNCNSSSNQNSVPLRVCVDCHTTKTPLWRSGPKGPKSLC 188 Query: 630 NACGIRQRKXXXXXXXXXXXANGTADQPPAMKIKVQHKLEKTGKNGHASHFKKRCKXXXX 451 NACGIRQRK ANG + ++K + K GK H+ K + + Sbjct: 189 NACGIRQRKERRAIAAAATTANG------SNRLKAEKSEMKKGKKLHSKGKKSKTEGAPA 242 Query: 450 XXXXXAGSPSDGPKKLGFEDFLINLSKNLAFGRVFPEDEKDAAILLMALSSGLVHG 283 + + FED + LS N A +VFP+DEK+AAILLMALS GL+HG Sbjct: 243 LLKKKRKPAKNRKRFRAFEDLTVRLSNNSAVQQVFPQDEKEAAILLMALSHGLLHG 298 >ref|XP_004287558.1| PREDICTED: uncharacterized protein LOC101297577 [Fragaria vesca subsp. vesca] Length = 357 Score = 145 bits (366), Expect = 4e-32 Identities = 114/323 (35%), Positives = 147/323 (45%), Gaps = 50/323 (15%) Frame = -2 Query: 1101 IFFNSTQDHMMESYNYDHHPQLYQPRQISDDNLGYHGGSTYEIKNKVDQSGLKLTLWK-- 928 IF + Q S +Y PQ +Q + + D++ +GGS + + G K T+ Sbjct: 49 IFLSPAQVQGPISDHYYREPQDFQFQLLEADHIVSYGGSC-DHDQTLGNEGEKGTVINLS 107 Query: 927 -------KEDHMGHDDEHIPQKNNPVKWMSSKMRLMQKMKTPDRV--------------- 814 +DH H++ +N VKWMSSKMR+M+KM PD+ Sbjct: 108 IDPKHGADDDHRDHENRSARAENISVKWMSSKMRIMRKMTNPDQTISSHNNTTAATNDGT 167 Query: 813 ALKITSAATTKLEQPXXXXXXXXXXXXXXXXXXSPIRVCSDCNTTKTPLWRSGPKGPKSL 634 ++ +A+ E+ PIRVCSDCNTTKTPLWRSGP+GPKSL Sbjct: 168 TARVNFSASHNFEEQKLHPLSPLGTDSSYSTN--PIRVCSDCNTTKTPLWRSGPRGPKSL 225 Query: 633 CNACGIRQRKXXXXXXXXXXXANGT---ADQPPAMKIKVQHKLEKTGKNGHASHFKKRCK 463 CNACGIRQRK AN T + P+M + KL K+ FKKRC Sbjct: 226 CNACGIRQRKARRAMAAAAAAANSTTLAVEAAPSMIKTSKVKL----KDNKTIPFKKRCH 281 Query: 462 XXXXXXXXXAGSPSDGPK---KLGFEDFLIN--------------------LSKNLAFGR 352 SPS K KL FEDF ++ + F R Sbjct: 282 KLAI-------SPSPRGKSKTKLRFEDFSVSSMNQNSGTDPPPPPTTTTTTTTTTTTFQR 334 Query: 351 VFPEDEKDAAILLMALSSGLVHG 283 VFP+DEK+AAILLMALS GLV G Sbjct: 335 VFPQDEKEAAILLMALSCGLVRG 357 >ref|XP_003546455.1| PREDICTED: putative GATA transcription factor 22-like [Glycine max] Length = 315 Score = 142 bits (359), Expect = 3e-31 Identities = 95/240 (39%), Positives = 122/240 (50%), Gaps = 14/240 (5%) Frame = -2 Query: 960 DQSGLKLTLWKKEDHMGHDDEHIPQKNNPVKWMSSKMRLMQKMKTPDRVALK-----ITS 796 ++S LKL +WKKED E+ ++N KWM KMR+M+++ D+ I++ Sbjct: 84 NKSDLKLRVWKKEDKC----ENFQGEDNSTKWMPLKMRMMRRLMVSDQTGSDDTEGMISN 139 Query: 795 AATTKLEQPXXXXXXXXXXXXXXXXXXS----PIRVCSDCNTTKTPLWRSGPKGPKSLCN 628 + K E+ + +RVCSDC+TTKTPLWRSGPKGPKSLCN Sbjct: 140 SQKIKYEEKNSPLSPLGTDDSNYNSSSNHSNITVRVCSDCHTTKTPLWRSGPKGPKSLCN 199 Query: 627 ACGIRQRKXXXXXXXXXXXANGT----ADQPPAMKIKVQHKLEKTGKNGHASHFKKRCKX 460 ACGIRQRK +NGT A++ K H K A KK K Sbjct: 200 ACGIRQRK-VRRAIAAAATSNGTNPVEAEKSQVKKGNTLHSKGMKSKTEGAQQMKKNRKL 258 Query: 459 XXXXXXXXAGSPSDGPKKLG-FEDFLINLSKNLAFGRVFPEDEKDAAILLMALSSGLVHG 283 K+ G FED + LSKN A +VFP+DEK+AAILLMALS GL+HG Sbjct: 259 GARYR-----------KRFGAFEDLTVRLSKNFALQQVFPQDEKEAAILLMALSYGLLHG 307