BLASTX nr result
ID: Cocculus23_contig00009221
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Cocculus23_contig00009221 (1661 letters) Database: ./nr 38,876,450 sequences; 13,856,398,315 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_002272762.1| PREDICTED: GATA transcription factor 5 [Viti... 251 9e-64 emb|CAN64003.1| hypothetical protein VITISV_037635 [Vitis vinifera] 246 2e-62 gb|ADL36694.1| GATA domain class transcription factor [Malus dom... 244 1e-61 ref|XP_007204696.1| hypothetical protein PRUPE_ppa008278mg [Prun... 241 5e-61 ref|XP_007046767.1| GATA transcription factor 5, putative [Theob... 233 1e-58 ref|XP_002305457.2| hypothetical protein POPTR_0004s16860g [Popu... 227 1e-56 ref|XP_002865076.1| zinc finger family protein [Arabidopsis lyra... 221 6e-55 ref|XP_006425558.1| hypothetical protein CICLE_v10025844mg [Citr... 221 1e-54 ref|XP_006425559.1| hypothetical protein CICLE_v10025844mg [Citr... 220 2e-54 ref|NP_201433.1| GATA transcription factor 5 [Arabidopsis thalia... 219 3e-54 ref|XP_004287842.1| PREDICTED: GATA transcription factor 5-like ... 219 4e-54 ref|XP_006280758.1| hypothetical protein CARUB_v10026725mg [Caps... 215 4e-53 ref|XP_002313763.2| hypothetical protein POPTR_0009s12620g [Popu... 213 2e-52 gb|ADL36697.1| GATA domain class transcription factor [Malus dom... 213 2e-52 ref|XP_004230938.1| PREDICTED: GATA transcription factor 5-like ... 213 3e-52 ref|XP_006340696.1| PREDICTED: GATA transcription factor 5-like ... 211 8e-52 ref|XP_006362004.1| PREDICTED: GATA transcription factor 5-like ... 211 1e-51 gb|EYU24084.1| hypothetical protein MIMGU_mgv1a009982mg [Mimulus... 208 7e-51 ref|XP_007042041.1| GATA transcription factor 5, putative [Theob... 208 7e-51 dbj|BAC98494.1| AG-motif binding protein-4 [Nicotiana tabacum] 206 2e-50 >ref|XP_002272762.1| PREDICTED: GATA transcription factor 5 [Vitis vinifera] Length = 338 Score = 251 bits (640), Expect = 9e-64 Identities = 160/343 (46%), Positives = 194/343 (56%), Gaps = 16/343 (4%) Frame = -3 Query: 1455 MECVGIKALKTSFW-PEMSLKSNHPQVICEDIWXXXXXXXXXXXDLFVEGLLDFSNEDVH 1279 MECV KALK+S PE++ K +D+ D ++ LLDF+N + Sbjct: 1 MECVE-KALKSSVVRPELAFKLTQQPACMDDMCMGNGQSGVSGDDFSIDDLLDFTNGGIG 59 Query: 1278 VGFVEDEDKD-------SLSASSS-QEGQNTSHTRSGFSLKDEPMSAPESELAVPTDDLA 1123 G ++ED++ SLS E N++ T + FS+KDE S P +EL VP DDLA Sbjct: 60 EGLFQEEDEEDEDKGCGSLSPRGELTENDNSNLTTTTFSVKDEFPSVPATELTVPADDLA 119 Query: 1122 DLEWLSHFVDDSFSEFSLQHVTKNPSEKKTQTSPKQNRPESALLKRPIGFLTSFPAKSKR 943 DLEWLSHFV+DSFSE+S +EK + PE+ L + T FPAK+ R Sbjct: 120 DLEWLSHFVEDSFSEYSAPFPHGTLTEKAQNQTENPPEPETPLQIKSC-LKTPFPAKA-R 177 Query: 942 SKRLRTTVGRVWSFHGXXXXXXXXXXXXXXXXXXXXXXXXXXXTGVQPLDLVNGAGKPPG 763 SKR RT GRVWS G Q ++ + A KPP Sbjct: 178 SKRARTG-GRVWSM-GSPSLTESSSSSSSSSSSSLSSPWLIYPNTCQNVESFHSAVKPPA 235 Query: 762 KKQKRKL------GAELPQRRCSHCGVVKTPQWRTGPKGAKTLCNACGVRYKSGRLLPEY 601 KK K++L A+ RCSHCGV KTPQWRTGP GAKTLCNACGVRYKSGRLLPEY Sbjct: 236 KKHKKRLDPEASGSAQPTPHRCSHCGVQKTPQWRTGPLGAKTLCNACGVRYKSGRLLPEY 295 Query: 600 RPACSPTFSSEVHSNNHRKVLEMRRKK-MGLPEPSSVPAVQSF 475 RPACSPTFSSE+HSN+HRKVLEMRRKK + PE PAV SF Sbjct: 296 RPACSPTFSSEIHSNHHRKVLEMRRKKEVTRPESGLAPAVPSF 338 >emb|CAN64003.1| hypothetical protein VITISV_037635 [Vitis vinifera] Length = 338 Score = 246 bits (628), Expect = 2e-62 Identities = 160/343 (46%), Positives = 191/343 (55%), Gaps = 16/343 (4%) Frame = -3 Query: 1455 MECVGIKALKTSFW-PEMSLKSNHPQVICEDIWXXXXXXXXXXXDLFVEGLLDFSNEDVH 1279 MECV KALK+S PE++ K +DI D ++ LLDF+N + Sbjct: 1 MECVE-KALKSSVVRPELAFKLTQQPACXDDICMGNGQSGVSGDDFSIDDLLDFTNGGIG 59 Query: 1278 VGFV-----EDEDKDSLSASSSQE---GQNTSHTRSGFSLKDEPMSAPESELAVPTDDLA 1123 G EDEDK S S +E N++ T + FS+KDE S P +EL VP DDLA Sbjct: 60 EGLFQEEDEEDEDKGCGSLSPRRELTENDNSNLTTTTFSVKDEFPSVPATELTVPADDLA 119 Query: 1122 DLEWLSHFVDDSFSEFSLQHVTKNPSEKKTQTSPKQNRPESALLKRPIGFLTSFPAKSKR 943 DLEWLSHFV+DSFSE+S +EK + PE+ L + T FPAK+ R Sbjct: 120 DLEWLSHFVEDSFSEYSAPFPPGTLTEKAQNQTENPPEPETPLQIKSC-LKTPFPAKA-R 177 Query: 942 SKRLRTTVGRVWSFHGXXXXXXXXXXXXXXXXXXXXXXXXXXXTGVQPLDLVNGAGKPPG 763 SKR RT GRVWS G Q ++ + A KPP Sbjct: 178 SKRARTG-GRVWSM-GSPSLTESSSSSSSSSSSSLSSPWLIYPNTCQNVESFHSAVKPPA 235 Query: 762 KKQKRKL------GAELPQRRCSHCGVVKTPQWRTGPKGAKTLCNACGVRYKSGRLLPEY 601 KK K++L A+ RCSHCGV KT QWRTGP GAKTLCNACGVR+KSGRLLPEY Sbjct: 236 KKHKKRLDPEASGSAQXTPHRCSHCGVQKTXQWRTGPLGAKTLCNACGVRFKSGRLLPEY 295 Query: 600 RPACSPTFSSEVHSNNHRKVLEMRRKK-MGLPEPSSVPAVQSF 475 RPACSPTFSSE+HSN+HRKVLEMRRKK + P PAV SF Sbjct: 296 RPACSPTFSSEIHSNHHRKVLEMRRKKEVTRPXSGLAPAVPSF 338 >gb|ADL36694.1| GATA domain class transcription factor [Malus domestica] Length = 331 Score = 244 bits (622), Expect = 1e-61 Identities = 170/346 (49%), Positives = 201/346 (58%), Gaps = 19/346 (5%) Frame = -3 Query: 1455 MECVGIKALKTSFWPEMSLKSNHPQVICED--IWXXXXXXXXXXXDLF-VEGLLDFSNED 1285 MECV ALKTS EM++K+ PQV+ D +W D F V+ LLDFSNED Sbjct: 1 MECVEA-ALKTSIRKEMAVKATGPQVVVFDDFLWGGAVVNGQNACDDFSVDDLLDFSNED 59 Query: 1284 VHVGFVEDE-----DKDSLSA--SSSQEGQNTSHTRSGFSLKDEPMSAPESELAVPTDDL 1126 GFVE E DK+ + S S + QN +S S K EP S EL+VP DDL Sbjct: 60 ---GFVETEAEEEGDKEKVKGFVSVSLQKQNQETEKSNLSEKIEPAS----ELSVPADDL 112 Query: 1125 ADLEWLSHFVDDSFSEFSLQHVTKNPSEKKTQTSPKQNRP--ESALLKRPIGFLTSFPAK 952 +LEWLSHFV+DSFSEF+ T P+ + + RP E+ ++P F T PAK Sbjct: 113 ENLEWLSHFVEDSFSEFT----TALPAGFLPEKPKSEKRPDLETPFPEKPC-FKTPVPAK 167 Query: 951 SKRSKRLRTTVGRVWSFHGXXXXXXXXXXXXXXXXXXXXXXXXXXXTGVQP-LDLVNGAG 775 + RSKR RT GRVWS T Q + V+ Sbjct: 168 A-RSKRRRTG-GRVWSLGSPSLTESSSSSSSSSSSSPSSPWTIYPATQNQESAEPVSSVE 225 Query: 774 KPPGKKQKRKL--GAELPQRRCSHCGVVKTPQWRTGPKGAKTLCNACGVRYKSGRLLPEY 601 KPP K ++R + + P RRCSHCGV KTPQWRTGP GAKTLCNACGVRYKSGRLLPEY Sbjct: 226 KPPRKPKRRLVDGSSSQPPRRCSHCGVQKTPQWRTGPNGAKTLCNACGVRYKSGRLLPEY 285 Query: 600 RPACSPTFSSEVHSNNHRKVLEMRRKK--MGLPEPSSV--PAVQSF 475 RPACSPTFSSE+HSN+HRKV+EMRRKK G PEPS+ PAV SF Sbjct: 286 RPACSPTFSSELHSNHHRKVIEMRRKKEGPGTPEPSTTIPPAVPSF 331 >ref|XP_007204696.1| hypothetical protein PRUPE_ppa008278mg [Prunus persica] gi|462400227|gb|EMJ05895.1| hypothetical protein PRUPE_ppa008278mg [Prunus persica] Length = 338 Score = 241 bits (616), Expect = 5e-61 Identities = 167/348 (47%), Positives = 197/348 (56%), Gaps = 21/348 (6%) Frame = -3 Query: 1455 MECVGIKALKTSFWPEMSLKSNHPQVICEDIWXXXXXXXXXXXDLF-VEGLLDFSNEDVH 1279 MECV ALKTS EM++K++ V + +W D F V+ LLDFSNED Sbjct: 1 MECVEA-ALKTSIRKEMAVKASSQAVFDDLLWGGVNGQNGVACDDFSVDDLLDFSNED-- 57 Query: 1278 VGFVEDE----DKDSLS--ASSSQEGQNTSHTRSGFSLKDEPMSAPESELAVPTDDLADL 1117 GFVE E DKD + AS + Q S S K+E P SEL+VP DDL +L Sbjct: 58 -GFVETEAEEDDKDKVKGFASVPPQKQPQDPENSDLSEKNELGPEPTSELSVPADDLENL 116 Query: 1116 EWLSHFVDDSFSEFSLQ-HVTKNPSEKKTQTSPKQNRPESALLKRPIGFLTSFPAKSKRS 940 EWLSHFV+DSF+EF+ P + KT+ P P + L ++P F T PAK+ RS Sbjct: 117 EWLSHFVEDSFTEFTTSLPAGFIPEKPKTEKRPD---PAAPLPEKPC-FKTPVPAKA-RS 171 Query: 939 KRLRTTVGRVWSFHGXXXXXXXXXXXXXXXXXXXXXXXXXXXTGVQPLDLVNGAGKP--- 769 KR RT GRVWS G Q + G+P Sbjct: 172 KRTRTG-GRVWSL-GSPSLTETSSSSSSSSSSSSPSSPWLIYPTTQNREPAEAGGEPVGS 229 Query: 768 ---PGKKQKRKL---GAELPQRRCSHCGVVKTPQWRTGPKGAKTLCNACGVRYKSGRLLP 607 P KK KR+L + P RRCSHCGV KTPQWRTGP GAKTLCNACGVRYKSGRLLP Sbjct: 230 VEKPPKKPKRRLVDGSSSQPPRRCSHCGVQKTPQWRTGPNGAKTLCNACGVRYKSGRLLP 289 Query: 606 EYRPACSPTFSSEVHSNNHRKVLEMRRKK--MGLPEP--SSVPAVQSF 475 EYRPACSPTFSSE+HSN+HRKVLEMR+KK G+PEP + P V SF Sbjct: 290 EYRPACSPTFSSELHSNHHRKVLEMRKKKDVTGVPEPGLTRPPVVPSF 337 >ref|XP_007046767.1| GATA transcription factor 5, putative [Theobroma cacao] gi|508699028|gb|EOX90924.1| GATA transcription factor 5, putative [Theobroma cacao] Length = 389 Score = 233 bits (595), Expect = 1e-58 Identities = 159/358 (44%), Positives = 191/358 (53%), Gaps = 32/358 (8%) Frame = -3 Query: 1461 ESMECVGIKALKTSFWPEMSLKSNHPQVICEDIWXXXXXXXXXXXDLFVEGLLDFSNEDV 1282 + MECV ALKTSF EM+LKS+ PQ EDIW D V+ L DF+NE+ Sbjct: 37 QEMECVEA-ALKTSFRKEMALKSS-PQAFLEDIWLANGQNGVSSDDFSVDDLFDFTNEE- 93 Query: 1281 HVGFVE--------------DEDKDSLSASSSQEGQNTS---HTRSGFSLKDEPMSAPES 1153 GF+E DE S+SSS + Q S H + + + S P S Sbjct: 94 --GFLEQQQQPQHEEEEEEEDEGAPISSSSSSPKRQKLSQEEHLSNDTTTNFDYGSLPTS 151 Query: 1152 ELAVPTDDLADLEWLSHFVDDSFSEFSLQHVTKNPSEKKTQTSPKQNRPESALLKRPIGF 973 ELAVP DD+A+LEWLSHFV+DSFSE S + T +E + PE ++ F Sbjct: 152 ELAVPADDVANLEWLSHFVEDSFSEHSTAYPTGTLTENPKLQADILAEPEKPVITTC--F 209 Query: 972 LTSFPAKSKRSKRLRTTVGRVWSFHGXXXXXXXXXXXXXXXXXXXXXXXXXXXTG----- 808 T PAK+ RSKR RT GRVWS Sbjct: 210 KTPVPAKA-RSKRTRTG-GRVWSLVASPSLTESSSSSTSSSSSSSPSSPWLLYPNSGSGS 267 Query: 807 -VQPLDLVNGAGKPPGKKQKRKLGAEL-------PQRRCSHCGVVKTPQWRTGPKGAKTL 652 +P + ++ KPP KK K++ + P RRCSHCGV KTPQWR GP GAKTL Sbjct: 268 TFEPSEPLS-VEKPPAKKHKKRPATDSTGGNGTQPTRRCSHCGVTKTPQWRAGPMGAKTL 326 Query: 651 CNACGVRYKSGRLLPEYRPACSPTFSSEVHSNNHRKVLEMRRKK--MGLPEPSSVPAV 484 CNACGVR+KSGRLLPEYRPACSPTFSSE+HSN+HRKVLEMRRKK +G P P V Sbjct: 327 CNACGVRFKSGRLLPEYRPACSPTFSSELHSNHHRKVLEMRRKKETLGQAGPGLAPPV 384 >ref|XP_002305457.2| hypothetical protein POPTR_0004s16860g [Populus trichocarpa] gi|550341195|gb|EEE85968.2| hypothetical protein POPTR_0004s16860g [Populus trichocarpa] Length = 327 Score = 227 bits (578), Expect = 1e-56 Identities = 154/339 (45%), Positives = 189/339 (55%), Gaps = 14/339 (4%) Frame = -3 Query: 1449 CVGIKALKTSFWPEMSLKSNHPQVICEDIWXXXXXXXXXXXDLF-VEGLLDFSNEDVHVG 1273 C+ +ALK+S E++ KS Q I ED + F V+ LDFSN + G Sbjct: 4 CMETRALKSSLRNELATKSTQ-QAISEDFFAFNASAVVSSDQDFSVDCFLDFSNGEFKDG 62 Query: 1272 FV-EDEDKDSLSASSSQEGQNTSHTRSGFSLKDEPMSAPESELAVPTDDLADLEWLSHFV 1096 + E+E+KDSLS SS + ++ S S D +S SELAVPTDD+A+LEW+SHFV Sbjct: 63 YAQEEEEKDSLSVSSQDRVDDDFNSNSS-SFSDSFLS---SELAVPTDDIAELEWVSHFV 118 Query: 1095 DDSFSEFSLQHVT---KNPSEKKTQTSPKQNRPESALLKRPIGFLTSFPAKSKRSKRLRT 925 +DS S+ SL K S K + P+ P+ +L K P F P+K+ R+KR R Sbjct: 119 NDSLSDVSLLVPACKGKPESHAKNRFEPE---PKPSLAKTPGFFPPRVPSKA-RTKRSRR 174 Query: 924 TVGRVWSFHGXXXXXXXXXXXXXXXXXXXXXXXXXXXTGVQPLDLVNGAGKPPGKKQKRK 745 T GR WS VQ +D ++ +PP KK K++ Sbjct: 175 T-GRTWSGRSNQTETPSSSASSTSSMPCLVSANT-----VQTIDSLSWLSEPPMKKPKKR 228 Query: 744 LGAELP--------QRRCSHCGVVKTPQWRTGPKGAKTLCNACGVRYKSGRLLPEYRPAC 589 + QRRCSHC V KTPQWRTGP GAKTLCNACGVRYKSGRL PEYRPAC Sbjct: 229 PAVQTSGITAAPQFQRRCSHCQVQKTPQWRTGPHGAKTLCNACGVRYKSGRLFPEYRPAC 288 Query: 588 SPTFSSEVHSNNHRKVLEMRRKK-MGLPEPSSVPAVQSF 475 SPTFSSEVHSN+HRKVLEMRRKK MG PE V SF Sbjct: 289 SPTFSSEVHSNSHRKVLEMRRKKEMGGPESRLNQMVPSF 327 >ref|XP_002865076.1| zinc finger family protein [Arabidopsis lyrata subsp. lyrata] gi|297310911|gb|EFH41335.1| zinc finger family protein [Arabidopsis lyrata subsp. lyrata] Length = 339 Score = 221 bits (564), Expect = 6e-55 Identities = 147/329 (44%), Positives = 184/329 (55%), Gaps = 16/329 (4%) Frame = -3 Query: 1434 ALKTSFWPEMSLKSNHPQVICEDIWXXXXXXXXXXXDLFVEGLLDFSNEDVHVGFVEDED 1255 ALK+S EM+ K+ P V E + D V+ LLD SN+DV ED D Sbjct: 5 ALKSSIRKEMAFKTT-PPVYEEFLAVTTAPNGFSADDFSVDDLLDLSNDDVFAD--EDTD 61 Query: 1254 ----KDSLSASSSQEGQNTSHTRSGFSLK--DEPMSAPESELAVPTDDLADLEWLSHFVD 1093 +D + SS + + R L D+ S P SEL+VP DDLA+LEWLSHFVD Sbjct: 62 PKAQQDMVRVSSEEPNDDGDALRRSSDLSGCDDFGSLPTSELSVPADDLANLEWLSHFVD 121 Query: 1092 DSFSEFSLQHVTKNPSEKKTQTSPKQNRPESALLKRPIGFLTSFPAKSKRSKRLRTTVGR 913 DSF+E+S ++T P+EK + + + P + + F + PAK+ RSKR R V + Sbjct: 122 DSFTEYSGPNLTGTPTEKPSWLTGDRKHPVTPATEESC-FKSPVPAKA-RSKRNRNGV-K 178 Query: 912 VWSFHGXXXXXXXXXXXXXXXXXXXXXXXXXXXTGVQPLDLVNGAGKPPGKKQKRKLGAE 733 VWS +G + L+ V + +PP K+ +K AE Sbjct: 179 VWSL---GSSSSSGPSSSGSTSSSSSRPSSPWFSGAEMLEPVVTSERPPFPKKHKKRSAE 235 Query: 732 ----------LPQRRCSHCGVVKTPQWRTGPKGAKTLCNACGVRYKSGRLLPEYRPACSP 583 PQRRCSHCGV KTPQWR GP GAKTLCNACGVRYKSGRLLPEYRPACSP Sbjct: 236 SVFCGQLQQLQPQRRCSHCGVQKTPQWRAGPMGAKTLCNACGVRYKSGRLLPEYRPACSP 295 Query: 582 TFSSEVHSNNHRKVLEMRRKKMGLPEPSS 496 TFSSE+HSN+HRKV+EMRRKK EP+S Sbjct: 296 TFSSELHSNHHRKVMEMRRKK----EPTS 320 >ref|XP_006425558.1| hypothetical protein CICLE_v10025844mg [Citrus clementina] gi|568825030|ref|XP_006466892.1| PREDICTED: GATA transcription factor 5-like [Citrus sinensis] gi|557527548|gb|ESR38798.1| hypothetical protein CICLE_v10025844mg [Citrus clementina] Length = 381 Score = 221 bits (562), Expect = 1e-54 Identities = 155/357 (43%), Positives = 194/357 (54%), Gaps = 29/357 (8%) Frame = -3 Query: 1461 ESMECVGIKALKTSFWPEMSLKSNHPQVICEDIWXXXXXXXXXXXDLFVEGLLDFSNEDV 1282 + MECV ALKTS EM+LK + PQ + ++I D FV+ LLDFSN+DV Sbjct: 40 QDMECVEA-ALKTSLRKEMALKLS-PQAV-DEICAVNLPNGVACDDFFVDDLLDFSNDDV 96 Query: 1281 HVGFVEDEDKDSLSASSSQEGQNTS-HTRSGFSLKDEPMSA----------PESELAVPT 1135 ++ L ++G+ HT + S +D+ + P SELAVPT Sbjct: 97 VA------EQQQLQEPQQEKGEEQKKHTLTVCSKQDQDLDERLNFDDLGPIPTSELAVPT 150 Query: 1134 DDLADLEWLSHFVDDSFSEFSLQHVTKN-PSEKKTQTSPKQNRPESALLKRPIGFLTSFP 958 DD+A+LEWLSHFV+DSF+E+S P + K + +++P A+ F T P Sbjct: 151 DDVANLEWLSHFVEDSFAEYSSPFPAGTLPVKAKENGAEPEHKPALAIHC----FKTPIP 206 Query: 957 AKSKRSKRLRTTVGRVWSFHGXXXXXXXXXXXXXXXXXXXXXXXXXXXTG----VQPLDL 790 AK+ RSKR RT + R+WS G ++P + Sbjct: 207 AKA-RSKRSRTGL-RIWSLGSPSLSDSSSTSSASSSSSPSSPWPVSTNPGSLASLRPAEP 264 Query: 789 VNGAGKPPGKKQKRK-------LGAELP----QRRCSHCGVVKTPQWRTGPKGAKTLCNA 643 KPP KK K+K G + RRCSHCGV KTPQWRTGP GAKTLCNA Sbjct: 265 F--IVKPPKKKLKKKSPPEGYNAGGNISWGQFTRRCSHCGVQKTPQWRTGPLGAKTLCNA 322 Query: 642 CGVRYKSGRLLPEYRPACSPTFSSEVHSNNHRKVLEMRRKKMGL--PEPSSVPAVQS 478 CGVRYKSGRL PEYRPACSPTFSSE+HSN+HRKV+EMRRKK GL EP PAV S Sbjct: 323 CGVRYKSGRLFPEYRPACSPTFSSELHSNHHRKVMEMRRKKEGLGRTEPGLAPAVVS 379 >ref|XP_006425559.1| hypothetical protein CICLE_v10025844mg [Citrus clementina] gi|557527549|gb|ESR38799.1| hypothetical protein CICLE_v10025844mg [Citrus clementina] Length = 340 Score = 220 bits (560), Expect = 2e-54 Identities = 155/355 (43%), Positives = 193/355 (54%), Gaps = 29/355 (8%) Frame = -3 Query: 1455 MECVGIKALKTSFWPEMSLKSNHPQVICEDIWXXXXXXXXXXXDLFVEGLLDFSNEDVHV 1276 MECV ALKTS EM+LK + PQ + ++I D FV+ LLDFSN+DV Sbjct: 1 MECVEA-ALKTSLRKEMALKLS-PQAV-DEICAVNLPNGVACDDFFVDDLLDFSNDDVVA 57 Query: 1275 GFVEDEDKDSLSASSSQEGQNTS-HTRSGFSLKDEPMSA----------PESELAVPTDD 1129 ++ L ++G+ HT + S +D+ + P SELAVPTDD Sbjct: 58 ------EQQQLQEPQQEKGEEQKKHTLTVCSKQDQDLDERLNFDDLGPIPTSELAVPTDD 111 Query: 1128 LADLEWLSHFVDDSFSEFSLQHVTKN-PSEKKTQTSPKQNRPESALLKRPIGFLTSFPAK 952 +A+LEWLSHFV+DSF+E+S P + K + +++P A+ F T PAK Sbjct: 112 VANLEWLSHFVEDSFAEYSSPFPAGTLPVKAKENGAEPEHKPALAIHC----FKTPIPAK 167 Query: 951 SKRSKRLRTTVGRVWSFHGXXXXXXXXXXXXXXXXXXXXXXXXXXXTG----VQPLDLVN 784 + RSKR RT + R+WS G ++P + Sbjct: 168 A-RSKRSRTGL-RIWSLGSPSLSDSSSTSSASSSSSPSSPWPVSTNPGSLASLRPAEPF- 224 Query: 783 GAGKPPGKKQKRK-------LGAELP----QRRCSHCGVVKTPQWRTGPKGAKTLCNACG 637 KPP KK K+K G + RRCSHCGV KTPQWRTGP GAKTLCNACG Sbjct: 225 -IVKPPKKKLKKKSPPEGYNAGGNISWGQFTRRCSHCGVQKTPQWRTGPLGAKTLCNACG 283 Query: 636 VRYKSGRLLPEYRPACSPTFSSEVHSNNHRKVLEMRRKKMGL--PEPSSVPAVQS 478 VRYKSGRL PEYRPACSPTFSSE+HSN+HRKV+EMRRKK GL EP PAV S Sbjct: 284 VRYKSGRLFPEYRPACSPTFSSELHSNHHRKVMEMRRKKEGLGRTEPGLAPAVVS 338 >ref|NP_201433.1| GATA transcription factor 5 [Arabidopsis thaliana] gi|42573812|ref|NP_975002.1| GATA transcription factor 5 [Arabidopsis thaliana] gi|71660777|sp|Q9FH57.1|GATA5_ARATH RecName: Full=GATA transcription factor 5 gi|10177426|dbj|BAB10711.1| GATA-binding transcription factor-like protein [Arabidopsis thaliana] gi|22531223|gb|AAM97115.1| GATA-binding transcription factor-like protein [Arabidopsis thaliana] gi|34098855|gb|AAQ56810.1| At5g66320 [Arabidopsis thaliana] gi|332010815|gb|AED98198.1| GATA transcription factor 5 [Arabidopsis thaliana] gi|332010816|gb|AED98199.1| GATA transcription factor 5 [Arabidopsis thaliana] Length = 339 Score = 219 bits (558), Expect = 3e-54 Identities = 144/331 (43%), Positives = 186/331 (56%), Gaps = 18/331 (5%) Frame = -3 Query: 1434 ALKTSFWPEMSLKSNHPQVICEDIWXXXXXXXXXXXDLFVEGLLDFSNEDVHVGFVEDED 1255 ALK+S EM+LK+ P V E + D V+ LLD SN+DV DE+ Sbjct: 5 ALKSSVRKEMALKTTSP-VYEEFLAVTTAQNGFSVDDFSVDDLLDLSNDDVFA----DEE 59 Query: 1254 KD--------SLSASSSQEGQNTSHTRSGFSLKDEPMSAPESELAVPTDDLADLEWLSHF 1099 D +S+ + + S FS D+ S P SEL++P DDLA+LEWLSHF Sbjct: 60 TDLKAQHEMVRVSSEEPNDDGDALRRSSDFSGCDDFGSLPTSELSLPADDLANLEWLSHF 119 Query: 1098 VDDSFSEFSLQHVTKNPSEKKTQTSPKQNRPESALLKRPIGFLTSFPAKSKRSKRLRTTV 919 V+DSF+E+S ++T P+EK + + P +A+ + F + PAK+ RSKR R + Sbjct: 120 VEDSFTEYSGPNLTGTPTEKPAWLTGDRKHPVTAVTEETC-FKSPVPAKA-RSKRNRNGL 177 Query: 918 GRVWSFHGXXXXXXXXXXXXXXXXXXXXXXXXXXXTGVQPLDLVNGAGKPPGKKQKRKLG 739 +VWS +G + L+ V + +PP K+ +K Sbjct: 178 -KVWSL---GSSSSSGPSSSGSTSSSSSGPSSPWFSGAELLEPVVTSERPPFPKKHKKRS 233 Query: 738 AE----------LPQRRCSHCGVVKTPQWRTGPKGAKTLCNACGVRYKSGRLLPEYRPAC 589 AE PQR+CSHCGV KTPQWR GP GAKTLCNACGVRYKSGRLLPEYRPAC Sbjct: 234 AESVFSGELQQLQPQRKCSHCGVQKTPQWRAGPMGAKTLCNACGVRYKSGRLLPEYRPAC 293 Query: 588 SPTFSSEVHSNNHRKVLEMRRKKMGLPEPSS 496 SPTFSSE+HSN+HRKV+EMRRKK EP+S Sbjct: 294 SPTFSSELHSNHHRKVIEMRRKK----EPTS 320 >ref|XP_004287842.1| PREDICTED: GATA transcription factor 5-like [Fragaria vesca subsp. vesca] Length = 333 Score = 219 bits (557), Expect = 4e-54 Identities = 155/337 (45%), Positives = 181/337 (53%), Gaps = 19/337 (5%) Frame = -3 Query: 1455 MECVGIKALKTSFWPEMSLKSNHPQVICEDIWXXXXXXXXXXXD--LFVEGLLDFSNEDV 1282 MECV ALKTS EM++K V + +W V+ LLDFSN+D Sbjct: 1 MECV---ALKTSIRTEMAVKE---AVFDDLLWGLNAQNGGVQNCEDFSVDDLLDFSNDD- 53 Query: 1281 HVGFVEDED-----KDSL--SASSSQEGQNTSHTRSGFSLKDE---PMSAPESELAVPTD 1132 GFVE E+ KDS+ S+ E + S S S K+E + P SEL VP D Sbjct: 54 --GFVEQEEQEDDKKDSVLPKKESTVEEKENSTPSSCVSEKNELGPEPAEPTSELTVPAD 111 Query: 1131 DLADLEWLSHFVDDSFSEFSLQHVTKNPSEKKTQTSPKQNRPESALLKRPIGFLTSFPAK 952 DL +LEWLSHFV+DSFS F+ + K + RPE LK F T PAK Sbjct: 112 DLENLEWLSHFVEDSFSGFNASLPAGFMAVKP------EKRPEPEALKPC--FKTPVPAK 163 Query: 951 SKRSKRLRTTVGRVWSFHGXXXXXXXXXXXXXXXXXXXXXXXXXXXTGVQPLDLVNGAGK 772 + RSKR RT GRVWS Q L + + Sbjct: 164 A-RSKRTRTG-GRVWSLGSPSFTETSSSSSSSSSTSSCPSSPWLIYNPTQGLGGFGSSVE 221 Query: 771 PPGKKQKRKL-----GAELPQRRCSHCGVVKTPQWRTGPKGAKTLCNACGVRYKSGRLLP 607 P KK KR G+ P RRCSHCGV KTPQWRTGP GAKTLCNACGVRYKSGRL+P Sbjct: 222 KPQKKPKRPATTEGGGSSQPPRRCSHCGVQKTPQWRTGPNGAKTLCNACGVRYKSGRLVP 281 Query: 606 EYRPACSPTFSSEVHSNNHRKVLEMRRKKMGL--PEP 502 EYRPACSPTFSSE+HSN+HRKV+E+RRKK G PEP Sbjct: 282 EYRPACSPTFSSELHSNHHRKVMEIRRKKEGPAGPEP 318 >ref|XP_006280758.1| hypothetical protein CARUB_v10026725mg [Capsella rubella] gi|565433824|ref|XP_006280759.1| hypothetical protein CARUB_v10026725mg [Capsella rubella] gi|482549462|gb|EOA13656.1| hypothetical protein CARUB_v10026725mg [Capsella rubella] gi|482549463|gb|EOA13657.1| hypothetical protein CARUB_v10026725mg [Capsella rubella] Length = 342 Score = 215 bits (548), Expect = 4e-53 Identities = 145/333 (43%), Positives = 182/333 (54%), Gaps = 20/333 (6%) Frame = -3 Query: 1434 ALKTSFWPEMSLKSNHPQVICEDIWXXXXXXXXXXXDLF-VEGLLDFSNEDVHVGFVEDE 1258 ALK+S EM+ KS P + ED D F V+ LLD SN+DV D+ Sbjct: 5 ALKSSIRKEMAFKSTLP--VYEDYLSVTTAQNGFSPDDFSVDDLLDLSNDDVFA----DD 58 Query: 1257 DKD---------SLSASSSQEGQNTSHTRSGFSLK---DEPMSAPESELAVPTDDLADLE 1114 D D +S+ +E + G +L D S P SEL+VP DDLA+LE Sbjct: 59 DTDLKPQDPVMVRVSSEEEEEEEEEELNDDGDALPRCIDFSGSLPTSELSVPADDLANLE 118 Query: 1113 WLSHFVDDSFSEFSLQHVTKNPSEKKTQTSPKQNRPESALLKRPIGFLTSFPAKSKRSKR 934 WLSHFV+DSF+E+S ++T P+EK + + P + + F + PAK+ RSKR Sbjct: 119 WLSHFVEDSFTEYSGPNLTGTPTEKPAWLTGDRKHPVTPATQESC-FKSPVPAKA-RSKR 176 Query: 933 LRTTVGRVWSFHGXXXXXXXXXXXXXXXXXXXXXXXXXXXTGVQPLDLVNGAGKPPGKKQ 754 R V + WS G +G + + + +PP K+ Sbjct: 177 HRNGV-KAWSL-GSSSSSGPSSSGSTSSSSSSSGPSSPWFSGADLFEPMVASERPPFPKK 234 Query: 753 KRKLGAEL-------PQRRCSHCGVVKTPQWRTGPKGAKTLCNACGVRYKSGRLLPEYRP 595 +K AE PQRRCSHCGV KTPQWR GP GAKTLCNACGVRYKSGRLLPEYRP Sbjct: 235 HKKRSAESAFCGQLQPQRRCSHCGVQKTPQWRAGPMGAKTLCNACGVRYKSGRLLPEYRP 294 Query: 594 ACSPTFSSEVHSNNHRKVLEMRRKKMGLPEPSS 496 ACSPTFSSE+HSN+HRKV+EMRRKK EP+S Sbjct: 295 ACSPTFSSELHSNHHRKVMEMRRKK----EPTS 323 >ref|XP_002313763.2| hypothetical protein POPTR_0009s12620g [Populus trichocarpa] gi|550331601|gb|EEE87718.2| hypothetical protein POPTR_0009s12620g [Populus trichocarpa] Length = 329 Score = 213 bits (543), Expect = 2e-52 Identities = 142/333 (42%), Positives = 180/333 (54%), Gaps = 12/333 (3%) Frame = -3 Query: 1437 KALKTSFWPEMSLKSNHPQVICEDIWXXXXXXXXXXXDLF-VEGLLDFSNEDVHVGFVED 1261 +ALK+S E+ K+ Q CED F V+ LDFSN + + G+V++ Sbjct: 8 RALKSSLLRELDTKTTSEQAFCEDFLALNTPGVVSFDQDFSVDCFLDFSNGEFNDGYVQE 67 Query: 1260 --EDKDSLSASSSQEGQNTSHTRSGFSLKDEPMSAPESELAVPTDDLADLEWLSHFVDDS 1087 E+KDS+S SS + ++ S S D ++ SELAVPTDD+A+LEW+SHFVDDS Sbjct: 68 QEEEKDSISVSSQDRVDDDFNSNSS-SFSDSFLA---SELAVPTDDIAELEWVSHFVDDS 123 Query: 1086 FSEFSLQHVTKNPSEKKTQTSPKQNRPESALLKRPIGFLTSFPAKSKRSKRLRTTVGRVW 907 S+ SL S K+ + + + K F + P+K+ R+KR R T GR W Sbjct: 124 VSDVSLLVPACKGSSKRHAKNRFEPETKPTFAKTSCLFPSRVPSKA-RTKRSRPT-GRTW 181 Query: 906 SFHGXXXXXXXXXXXXXXXXXXXXXXXXXXXTGVQPLDLVNGAGKPPGKKQKRK------ 745 S VQ D ++ + P K K++ Sbjct: 182 SAGSNQSETPSSSTSSTSSMPCLVATNT-----VQTADSLSWLSEQPMKISKKRPAVHTS 236 Query: 744 --LGAELPQRRCSHCGVVKTPQWRTGPKGAKTLCNACGVRYKSGRLLPEYRPACSPTFSS 571 + + QRRCSHC V KTPQWRTGP GAKTLCNACGVRYKSGRL PEYRPACSPTFSS Sbjct: 237 GLMASTQFQRRCSHCQVQKTPQWRTGPLGAKTLCNACGVRYKSGRLFPEYRPACSPTFSS 296 Query: 570 EVHSNNHRKVLEMRRKK-MGLPEPSSVPAVQSF 475 EVHSN+HRKVLEMRRKK + EP V SF Sbjct: 297 EVHSNSHRKVLEMRRKKEVAGAEPRLNQMVPSF 329 >gb|ADL36697.1| GATA domain class transcription factor [Malus domestica] Length = 321 Score = 213 bits (542), Expect = 2e-52 Identities = 140/335 (41%), Positives = 179/335 (53%), Gaps = 10/335 (2%) Frame = -3 Query: 1449 CVGIKALKTSFWPEMSLKSNHPQVICEDIWXXXXXXXXXXXDLFVEGLLDFSNEDVHVGF 1270 C+ KALK+S E+++KS V+ E++W D V+ LLD SN + G Sbjct: 4 CIEAKALKSSLRRELAVKSTQ-HVLLEELWCATGISGVPCEDFSVDDLLDLSNGEFEDGS 62 Query: 1269 VEDEDKDSLSASSSQEGQNTSHTRSGFSLKDEPMSAPESELAVPTDDLADLEWLSHFVDD 1090 VE+E+++ S S E N+S L D S ++L VP DDLA+LEW+SHFVDD Sbjct: 63 VEEEEEEKESVSVDDEISNSS----SLVLPDSD-SGLATQLLVPDDDLAELEWVSHFVDD 117 Query: 1089 SFSEFSLQHVTKNPSEKKTQTSPKQNRPESALLKRPIGFLTSFPAKSKRSKRLRTTVGRV 910 S + SL H + + + P+ L+ P+ F P K R+KR + RV Sbjct: 118 SLPDLSLFHTIGTQKPEALLMNRFEPEPKPVPLRAPL-FPFQVPVKP-RTKRYKPA-SRV 174 Query: 909 WSFHGXXXXXXXXXXXXXXXXXXXXXXXXXXXTGVQPLDLVNGAGKPPGKKQKRK----- 745 WS VQ +D+ G+P KKQK+K Sbjct: 175 WSSSSSCSPSSSPCSSGFSFSTPCLIFNP-----VQSMDVF--VGEPAAKKQKKKPAVQT 227 Query: 744 ----LGAELPQRRCSHCGVVKTPQWRTGPKGAKTLCNACGVRYKSGRLLPEYRPACSPTF 577 +G + QRRCSHC V KTPQWRTGP G KTLCNACGVR+KSGRL PEYRPACSPTF Sbjct: 228 GEGSIGGQF-QRRCSHCQVQKTPQWRTGPLGPKTLCNACGVRFKSGRLFPEYRPACSPTF 286 Query: 576 SSEVHSNNHRKVLEMR-RKKMGLPEPSSVPAVQSF 475 S VHSN+HRKVLEMR RK +G PEP ++SF Sbjct: 287 SGAVHSNSHRKVLEMRKRKDVGEPEPLLNRMIRSF 321 >ref|XP_004230938.1| PREDICTED: GATA transcription factor 5-like [Solanum lycopersicum] Length = 325 Score = 213 bits (541), Expect = 3e-52 Identities = 138/320 (43%), Positives = 172/320 (53%), Gaps = 8/320 (2%) Frame = -3 Query: 1455 MECVGIKALKTSFWPEMSLKSNHPQVICEDIWXXXXXXXXXXXDLFVEGLLDFSNEDVHV 1276 ME + +ALK+SF +M++K N QV +DIW D V+ LLDFS++D Sbjct: 1 MELIEARALKSSFLSDMAMK-NTQQVFLDDIWCVTGINNGASEDFSVDDLLDFSDKDFKD 59 Query: 1275 GFVEDEDKDSLSASSSQEGQNTSHTRSGFSLKDEPMSAPESELAVPTDDLADLEWLSHFV 1096 + ++D+ + + SSQ+ + T SG E + EL +P DD+ +LEWLS FV Sbjct: 60 PELHEDDEKTSFSGSSQKRNSQDSTFSGM----ESFGSLAGELPIPVDDMENLEWLSQFV 115 Query: 1095 DDSFSEFSLQHVTKNPSEKKTQTSPKQNRPESALLKRPIG-------FLTSFPAKSKRSK 937 DD+ SEFSL T++ +K + ++ P + RP+ F FP K RSK Sbjct: 116 DDTPSEFSLLCPTESFKDKTGGFTESRSEP----VVRPVVKKTRVPCFPLPFPVKP-RSK 170 Query: 936 RLRTTVGRVWSFHGXXXXXXXXXXXXXXXXXXXXXXXXXXXTGVQPLDLVNGAGKPPGKK 757 R R GR WSF V DL KPP KK Sbjct: 171 RSRQA-GRTWSFPSSAVSGDSSSPTSSSYGSSPFPSGFFTNP-VYDGDLFCSVEKPPLKK 228 Query: 756 QKRKLGAELPQ-RRCSHCGVVKTPQWRTGPKGAKTLCNACGVRYKSGRLLPEYRPACSPT 580 K+ E RRC+HC V KTPQWR GP G KTLCNACGVRYKSGRL PEYRPACSPT Sbjct: 229 PKKNPSVETGSGRRCTHCQVQKTPQWRAGPLGPKTLCNACGVRYKSGRLFPEYRPACSPT 288 Query: 579 FSSEVHSNNHRKVLEMRRKK 520 FS EVHSN+HRKVLEMRRKK Sbjct: 289 FSLEVHSNSHRKVLEMRRKK 308 >ref|XP_006340696.1| PREDICTED: GATA transcription factor 5-like [Solanum tuberosum] Length = 339 Score = 211 bits (537), Expect = 8e-52 Identities = 150/352 (42%), Positives = 184/352 (52%), Gaps = 25/352 (7%) Frame = -3 Query: 1455 MECVGIKALKTSFWPEMSLKSNHPQVICEDIWXXXXXXXXXXXD-LFVEGLLDFSNEDVH 1279 M+CV AL+ SF PE LK Q +D+ D FV+ LLDFSN V Sbjct: 1 MDCVK-GALRNSFVPETPLKMTQNQTFGDDLSAAGAGQNGVSGDDFFVDDLLDFSNGFVE 59 Query: 1278 VGFVEDEDKD--------------SLSASSSQEGQNTSHTRSGFSLKDEPMSAPESELAV 1141 E+E K+ S+S S ++ + + S+K++ S P SE++V Sbjct: 60 GEGEEEEGKNQGGEDISVQKPCSVSISVSPLKKTEIDDKDKVTISVKEDFSSLPVSEISV 119 Query: 1140 PTDDLADLEWLSHFVDDSFSEFSLQHVTKNPSEKKTQTSPKQNRPESALLKRPIGFLTSF 961 PTDDL LEWLSHFV+DSFS +SL + K + K E + ++ F T Sbjct: 120 PTDDLDSLEWLSHFVEDSFSGYSLAYPAG-----KLEVEKKTGDGEIPVEEKKPCFATPV 174 Query: 960 PAKSKRSKRLRTTVGRVW-SFHGXXXXXXXXXXXXXXXXXXXXXXXXXXXTGVQPLDLVN 784 K+ R+KR RT+V R W + G P+ Sbjct: 175 QTKA-RTKRGRTSV-RFWPACSGSLTDSSSSSTSSSSTTTMSSSPTASWFLYPTPVHSAE 232 Query: 783 GAGKPPGKKQKRKL---GAELPQ--RRCSHCGVVKTPQWRTGPKGAKTLCNACGVRYKSG 619 GKP KK K+K G PQ RRCSHCGV KTPQWR GP GAKTLCNACGVR+KSG Sbjct: 233 SPGKPLAKKLKKKPAPHGGNGPQQPRRCSHCGVQKTPQWRAGPMGAKTLCNACGVRFKSG 292 Query: 618 RLLPEYRPACSPTFSSEVHSNNHRKVLEMRRKK----MGLPEPSSVPAVQSF 475 RLLPEYRPACSPTFS+E+HSNNHRKVLEMRRKK GL +P VQSF Sbjct: 293 RLLPEYRPACSPTFSTELHSNNHRKVLEMRRKKESEETGLAQP-----VQSF 339 >ref|XP_006362004.1| PREDICTED: GATA transcription factor 5-like [Solanum tuberosum] Length = 325 Score = 211 bits (536), Expect = 1e-51 Identities = 138/320 (43%), Positives = 171/320 (53%), Gaps = 8/320 (2%) Frame = -3 Query: 1455 MECVGIKALKTSFWPEMSLKSNHPQVICEDIWXXXXXXXXXXXDLFVEGLLDFSNEDVHV 1276 ME + +ALK+SF +MS+K N QV +DIW D V+ LLDFS++D Sbjct: 1 MELIEARALKSSFLSDMSMK-NTQQVFLDDIWCVTGINNGASEDFSVDDLLDFSDKDFKD 59 Query: 1275 GFVEDEDKDSLSASSSQEGQNTSHTRSGFSLKDEPMSAPESELAVPTDDLADLEWLSHFV 1096 + ++D+ + + SSQ + T SG E + EL +P D++ +LEWLS FV Sbjct: 60 PELHEDDEKTSFSGSSQNRNSQDSTFSGM----ESFGSLAGELPIPVDEMENLEWLSQFV 115 Query: 1095 DDSFSEFSLQHVTKNPSEKKTQTSPKQNRPESALLKRPIG-------FLTSFPAKSKRSK 937 DD+ SEFSL ++ +K + ++ P + RP+ F FP K RSK Sbjct: 116 DDTPSEFSLLCPAESFKDKTGDFTEFRSEP----VVRPVVKKMRVPCFPLPFPVKP-RSK 170 Query: 936 RLRTTVGRVWSFHGXXXXXXXXXXXXXXXXXXXXXXXXXXXTGVQPLDLVNGAGKPPGKK 757 R R GR WSF V DL KPP KK Sbjct: 171 RSRPA-GRTWSFPSSTVSGDSSSPTSSSYGSSPFPSGFFTNP-VYDGDLFCSVEKPPLKK 228 Query: 756 QKRKLGAELPQ-RRCSHCGVVKTPQWRTGPKGAKTLCNACGVRYKSGRLLPEYRPACSPT 580 K+ AE RRC+HC V KTPQWR GP G KTLCNACGVRYKSGRL PEYRPACSPT Sbjct: 229 PKKNPSAETGSGRRCTHCQVQKTPQWRAGPLGPKTLCNACGVRYKSGRLYPEYRPACSPT 288 Query: 579 FSSEVHSNNHRKVLEMRRKK 520 FS EVHSN+HRKVLEMRRKK Sbjct: 289 FSLEVHSNSHRKVLEMRRKK 308 >gb|EYU24084.1| hypothetical protein MIMGU_mgv1a009982mg [Mimulus guttatus] Length = 325 Score = 208 bits (529), Expect = 7e-51 Identities = 152/349 (43%), Positives = 182/349 (52%), Gaps = 22/349 (6%) Frame = -3 Query: 1455 MECVGIKALKTSFWPEMSLKSNHPQVICED-IWXXXXXXXXXXXDLFVEGLLDFSNEDVH 1279 MECV AL F PE KS ED + DLFV+ LLDFSN+ Sbjct: 1 MECVQ-GALVGGFEPETVFKST--AAFMEDFLGSNGVPNAVSGDDLFVDELLDFSNDFSE 57 Query: 1278 VGFVEDEDKD----------SLSASSSQEGQNTSHTRSGFSLKDEPMSAPESELAVPTDD 1129 +E+++K + S SQ+ Q T +SG S D+ S PE+EL +P + Sbjct: 58 EEEIEEDEKPLQPEEFEHHKNKFCSVSQQMQ-TPPEKSGLSANDDFDSLPETELPLPAEG 116 Query: 1128 LADLEWLSHFVDDSFSEFSLQHVTKNPSEKKTQTSPKQNRPESALLKRPIGFLTSFPAKS 949 L LEWLSHFV+DSFS++SL K +P RPE+ + T+ Sbjct: 117 LESLEWLSHFVEDSFSDYSLTG--------KLPPNPASKRPETVTAAQEQPCFTTPVQTK 168 Query: 948 KRSKRLRTTVGRVWSFHGXXXXXXXXXXXXXXXXXXXXXXXXXXXTGVQPLDLVNGAG-- 775 R+KR RT V RVW T + P G Sbjct: 169 ARTKRARTGV-RVWPV-----------LSPSFTESSTSSSSSSSTTSLSPQYAWTGESFL 216 Query: 774 --KPPGKKQKRKL---GAELPQ--RRCSHCGVVKTPQWRTGPKGAKTLCNACGVRYKSGR 616 PP KKQK+K GA +P RRCSHCGV KTPQWR GP G+KTLCNACGVR+KSGR Sbjct: 217 GYPPPVKKQKKKRADSGAAVPAQPRRCSHCGVTKTPQWRAGPLGSKTLCNACGVRFKSGR 276 Query: 615 LLPEYRPACSPTFSSEVHSNNHRKVLEMRRKKMGLPE-PSSV-PAVQSF 475 LLPEYRPACSPTFS+E+HSNNHRKVLEMRRKK E P+ V P VQSF Sbjct: 277 LLPEYRPACSPTFSTEMHSNNHRKVLEMRRKKESETEAPAGVGPPVQSF 325 >ref|XP_007042041.1| GATA transcription factor 5, putative [Theobroma cacao] gi|508705976|gb|EOX97872.1| GATA transcription factor 5, putative [Theobroma cacao] Length = 322 Score = 208 bits (529), Expect = 7e-51 Identities = 134/336 (39%), Positives = 178/336 (52%), Gaps = 11/336 (3%) Frame = -3 Query: 1449 CVGIKALKTSFWPEMSLKSNHPQVICEDIWXXXXXXXXXXXDLFVEGLLDFSNEDVHVGF 1270 C+ +ALK+S E++++ + + ++ V+ L+F+N + F Sbjct: 4 CMEARALKSSVRGELAMQRTQHAALDDILYMNGAAPGEDFS---VDCFLNFNNGE----F 56 Query: 1269 VEDEDKDSLSASSSQE--GQNTSHTRSGFSLKDEPMSAPESELAVPTDDLADLEWLSHFV 1096 E+E KDS S SS + +++ S FS S +EL+VP D++A LEW+SHFV Sbjct: 57 EEEEQKDSFSVSSEERVADDDSNSNSSSFSFD----SLLTNELSVPDDEIAGLEWVSHFV 112 Query: 1095 DDSFSEFSLQHVTKNPSEKKTQTSPKQNRPESALLKRPIGFLTSFPAKSKRSKRLRTTVG 916 DDSF E + P + + PE +K P F ++ P+K+ RSKR ++T G Sbjct: 113 DDSFPELPILCPVFKPQSDGHAKTLFETEPELVFMKTP-SFSSTVPSKA-RSKRAKST-G 169 Query: 915 RVWSFHGXXXXXXXXXXXXXXXXXXXXXXXXXXXTGVQPLDLVNGAGKPPGKKQKRKLGA 736 R WS VQ DL N +PP KKQK+K Sbjct: 170 RTWSVGSMPLSESSSSTITSSSTSSGFSVTSA---NVQETDLANDFTEPPTKKQKKKPAV 226 Query: 735 ELP--------QRRCSHCGVVKTPQWRTGPKGAKTLCNACGVRYKSGRLLPEYRPACSPT 580 + QRRCSHC V KTPQWRTGP GAKTLCNACGVRYKSGRL PEYRPACSPT Sbjct: 227 QASGLSSGNPFQRRCSHCQVQKTPQWRTGPLGAKTLCNACGVRYKSGRLFPEYRPACSPT 286 Query: 579 FSSEVHSNNHRKVLEMR-RKKMGLPEPSSVPAVQSF 475 FS ++HSN+HRKVLEMR RK++ EP + SF Sbjct: 287 FSGDIHSNSHRKVLEMRKRKEVAGQEPELTRMIPSF 322 >dbj|BAC98494.1| AG-motif binding protein-4 [Nicotiana tabacum] Length = 326 Score = 206 bits (525), Expect = 2e-50 Identities = 141/322 (43%), Positives = 165/322 (51%), Gaps = 10/322 (3%) Frame = -3 Query: 1455 MECVGIKALKTSFWPEMSLKSNHPQVICEDIWXXXXXXXXXXXDLFVEGLLDFSNEDVHV 1276 ME + +ALK+SF +M++K++ QV +DIW D V+ LLDFS++D Sbjct: 1 MELIEARALKSSFLSDMAMKTSQ-QVFLDDIWCVAGINNVPSDDFSVDDLLDFSDKDFKD 59 Query: 1275 G-----FVEDEDKDSLSASSSQEGQNTSHTRSGFSLKDEPMSAPESELAVPTDDLADLEW 1111 G ED++KDS S SS S+ FS D + EL VP D+L +LEW Sbjct: 60 GQSLQELHEDDEKDSFSGSSQHRNSQVSN----FSCMD----SFSGELPVPVDELENLEW 111 Query: 1110 LSHFVDDSFSEFSLQHVTKNPSEK----KTQTSPKQNRPESALLKRPIGFLTSFPAKSKR 943 LS FVDDS SEFSL + +K + S RP LK P L P K Sbjct: 112 LSQFVDDSTSEFSLLCPAGSFKDKTGGFQVSRSEPVVRPVVQKLKVPCFPL---PVVQKP 168 Query: 942 SKRLRTTVGRVWSFHGXXXXXXXXXXXXXXXXXXXXXXXXXXXTGVQPLDLVNGAGKPPG 763 GR WSF V DL KPP Sbjct: 169 RTYRSRPAGRKWSFSSPTVSADSCSPTSSSYGSSPFPSVLFSNP-VLDGDLFCSVEKPPL 227 Query: 762 KKQKRKLGAELPQ-RRCSHCGVVKTPQWRTGPKGAKTLCNACGVRYKSGRLLPEYRPACS 586 KK K+ AE RRC+HC V KTPQWR GP G KTLCNACGVRYKSGRL PEYRPACS Sbjct: 228 KKPKKLSTAETGSGRRCTHCQVQKTPQWRAGPLGPKTLCNACGVRYKSGRLFPEYRPACS 287 Query: 585 PTFSSEVHSNNHRKVLEMRRKK 520 PTFS EVHSN+HRKVLEMRRKK Sbjct: 288 PTFSQEVHSNSHRKVLEMRRKK 309