BLASTX nr result
ID: Ephedra29_contig00011802
seq
BLASTX 2.2.26 [Sep-21-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Ephedra29_contig00011802 (2080 letters) Database: ./nr 115,041,592 sequences; 42,171,959,267 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value XP_006400392.1 hypothetical protein EUTSA_v10013325mg [Eutrema s... 139 1e-31 XP_008456637.1 PREDICTED: uncharacterized protein LOC103496534 i... 134 3e-29 XP_004140922.1 PREDICTED: uncharacterized protein LOC101213190 [... 132 1e-28 XP_010420604.1 PREDICTED: GATA zinc finger domain-containing pro... 129 2e-28 XP_010420607.1 PREDICTED: GATA zinc finger domain-containing pro... 127 1e-27 XP_016499078.1 PREDICTED: mediator of RNA polymerase II transcri... 129 1e-27 XP_009788231.1 PREDICTED: mediator of RNA polymerase II transcri... 129 1e-27 XP_016902016.1 PREDICTED: uncharacterized protein LOC103496534 i... 128 2e-27 GAQ88045.1 Zinc finger domain containing protein [Klebsormidium ... 130 3e-27 XP_016169675.1 PREDICTED: uncharacterized protein LOC107612498 i... 127 4e-27 XP_017984940.1 PREDICTED: uncharacterized protein LOC18586963 is... 127 6e-27 EOY19451.1 Region-like protein isoform 3 [Theobroma cacao] 126 8e-27 EOY19453.1 Region-like protein isoform 5 [Theobroma cacao] 126 8e-27 XP_010492846.1 PREDICTED: GATA zinc finger domain-containing pro... 125 8e-27 NP_001078603.1 GATA zinc finger protein [Arabidopsis thaliana] N... 124 1e-26 XP_002871816.1 hypothetical protein ARALYDRAFT_488722 [Arabidops... 124 1e-26 XP_006287638.1 hypothetical protein CARUB_v10000849mg [Capsella ... 124 2e-26 XP_010454083.1 PREDICTED: GATA zinc finger domain-containing pro... 124 2e-26 XP_007010639.2 PREDICTED: uncharacterized protein LOC18586963 is... 125 3e-26 EOY19449.1 Region-like protein isoform 1 [Theobroma cacao] 125 3e-26 >XP_006400392.1 hypothetical protein EUTSA_v10013325mg [Eutrema salsugineum] ESQ41845.1 hypothetical protein EUTSA_v10013325mg [Eutrema salsugineum] Length = 501 Score = 139 bits (351), Expect = 1e-31 Identities = 131/469 (27%), Positives = 195/469 (41%), Gaps = 19/469 (4%) Frame = -2 Query: 1665 IQNPVALLQISAFLQYMQSFAQMPSPQHQSFSQLPSQGQTGFAQMPSQNGFPQMQTQNHV 1486 + NP+ + + Q+ + + MP Q Q Q Q G P NH+ Sbjct: 61 MNNPIPMQNMPIHPQFFNNLSNMPQQQQQQ-------------QQLHQFGMP-----NHI 102 Query: 1485 NGFAQMPXXXXXXXXXXXXGVENGMPLYGLQGNVQSS--NPQLMMPHSASLVNANFPGNK 1312 N L L GN+Q + N LM HS LV NF Sbjct: 103 NQL-----------------------LPSLLGNLQFAVANNNLMGGHSLPLVQPNF-FQP 138 Query: 1311 ALLPQNFASNGVIEXXXXXXXXXXXXXXXXXXXXXNLASPSKGRPNEQSHGMNGSVSPAV 1132 +L P F S + + +P + NGS S Sbjct: 139 SLEPSPFTSQPQLNSFNSRPYPPVPTPHQNHQLHPPGFPEPRPQPVGNINNTNGSNSKGN 198 Query: 1131 DAKNKLGQESR-----PGYDKKRKNELNGTSQLSPKAKKFNNGPSRFQKHNESLGDTRKG 967 D +NK ++ + G+ + + ++ + + K + G + K L + G Sbjct: 199 DFRNKCTKQQKFKGSGQGFQRSQLHQADNAKKKFG-FNKDHMGKGNYNKMATGLDGSDSG 257 Query: 966 ----------TAKIIETAEDRQWREDRKRNFPTGKNIQKKAEDLQQKAAXXXXXXXXXXX 817 +A + E +QWRE R++N+PT N QKK++ + + Sbjct: 258 RIAKEKKRISSAMFYTSKEIQQWREARRKNWPTKLNAQKKSK--KNVSDCILDDEAKRRR 315 Query: 816 XXXXEILAKQAELGCEVAEIPREYLDDKDGHKAEVKVGNDDTEQNGRKKYQKGHCYLFQK 637 E+LAKQAELG EVAEIP YL + D +V D + NG+ +Y+ G FQ Sbjct: 316 EQLREVLAKQAELGVEVAEIPSHYLSNTDE-----QVNGDRGDNNGQFQYKDGRKGRFQN 370 Query: 636 GRCKKERHCRY--LHVKESTDDDNQNQDLTIPKRKPSLLEKLLSKDIKKEKSQLLQVFRF 463 R K RH K +D N +Q+ +I RKP+LLEKLLS DIK++KSQLLQVFRF Sbjct: 371 NRHNKRRHGGRDKFSKKPRFEDQNSSQESSITMRKPTLLEKLLSADIKRDKSQLLQVFRF 430 Query: 462 IVNNNFFDEWPNKPLEYFPWSKEEPIETVEDVQRKILLQLMNQEQTDSD 316 +V N+FF E P +PL+ P E D + ++ +++ + D D Sbjct: 431 MVINSFFQELPEQPLK-LPLVMVEETGCEHDREEDLISEVLCADLDDDD 478 >XP_008456637.1 PREDICTED: uncharacterized protein LOC103496534 isoform X2 [Cucumis melo] Length = 599 Score = 134 bits (336), Expect = 3e-29 Identities = 105/327 (32%), Positives = 156/327 (47%), Gaps = 6/327 (1%) Frame = -2 Query: 1113 GQESRPGYDKKRKNELNGTSQLSPKAKKFNNGPSRFQKHNESLGDTRKGTAKIIETAEDR 934 G + G+ +R+N+ GT+ + + + ++ + + E R Sbjct: 308 GGQKEKGFHNERRNKFCGTNS------------------TDQVKEQKRSLSLVYTDQEIR 349 Query: 933 QWREDRKRNFPTGKNIQKKAEDLQQKAAXXXXXXXXXXXXXXXEILAKQAELGCEVAEIP 754 QWRE R++N+P+ NIQKK + Q ILAKQAELG EVAEIP Sbjct: 350 QWREARRKNYPSSTNIQKKLAEKQTNCTLVNQEAQLLRQELKE-ILAKQAELGVEVAEIP 408 Query: 753 REYLDDKDGHKAEVKVGNDDT---EQNGRK-KYQKGHCYLFQKGRCKKERHCRYLHVKES 586 EYL + H + G T E +G + +K L ++GR KK+ R K+ Sbjct: 409 PEYLSYSEKHDNRKQRGGPSTLGEEADGASIEKEKSQNRLNKRGRLKKKNRPR----KKG 464 Query: 585 TDDDNQNQDLTIPKRKPSLLEKLLSKDIKKEKSQLLQVFRFIVNNNFFDEWPNKPLEYFP 406 + + + + KR+P+LL+KLL D++K+KSQLLQ RF+V N+FF EWPNKPL+ FP Sbjct: 465 KFEKHLSNKPPLKKREPTLLQKLLKADVRKDKSQLLQALRFMVMNSFFKEWPNKPLK-FP 523 Query: 405 --WSKEEPIETVEDVQRKILLQLMNQEQTDSDSDCGREEKNLLHSRFTDDSTKKAANVET 232 KE ET + + N ++T+++S L+ + T D N + Sbjct: 524 SVTVKENEGETNVVDETPLSTGNFNLQETNNNS--------LVENNGTHDINSDNEN-DI 574 Query: 231 HDSPTKEXXXXXXXXSLEDIEEGEILD 151 DS E LE EEGEI+D Sbjct: 575 EDSDNDEKLKGDGTQVLE--EEGEIID 599 >XP_004140922.1 PREDICTED: uncharacterized protein LOC101213190 [Cucumis sativus] KGN46066.1 hypothetical protein Csa_6G046440 [Cucumis sativus] Length = 599 Score = 132 bits (332), Expect = 1e-28 Identities = 117/360 (32%), Positives = 171/360 (47%), Gaps = 21/360 (5%) Frame = -2 Query: 1167 SHGMNGSVSPAVDAKNK-LGQESRPGYDKKRKNELNGTSQLSPKAKKFNNGPSRFQK--H 997 S G NGS S + ++ ++ + S+ G+ K N T L + KKF + +K H Sbjct: 263 SDGGNGSNSISNNSAHRNFMRNSKKGFQK------NQTHHLKNEKKKFGFPGGQKEKGFH 316 Query: 996 NE------------SLGDTRKGTAKIIETAEDRQWREDRKRNFPTGKNIQKKAEDLQQKA 853 NE + + ++ + + E RQWRE R++N+P+ NIQKK Q Sbjct: 317 NERRNKFCGTNPTDQVKEQKRSLSLVYTDQEIRQWREARRKNYPSSTNIQKKLTGKQTNC 376 Query: 852 AXXXXXXXXXXXXXXXEILAKQAELGCEVAEIPREYLDDKDGHKAEVKVGNDDT--EQNG 679 ILAKQAELG EVAEIP EYL + H + G T E+ Sbjct: 377 TLVDKEAKLLRQELKE-ILAKQAELGVEVAEIPPEYLSYSEKHDNRKQRGGRSTLGEEAE 435 Query: 678 RKKYQKGHCY--LFQKGRCKKERHCRYLHVKESTDDDNQNQDLTIPKRKPSLLEKLLSKD 505 +K + L ++GRCKK+ R K+ + + + + KR+P+LL+KLL D Sbjct: 436 EASIEKENSQNRLNKRGRCKKKNRPR----KKGKFEKHLSNKPPLKKREPTLLQKLLKAD 491 Query: 504 IKKEKSQLLQVFRFIVNNNFFDEWPNKPLEYFP--WSKEEPIETVEDVQRKILLQLMNQE 331 ++K+KSQLLQ RF V N+FF EWPNKPL+ FP KE ET + + N + Sbjct: 492 VRKDKSQLLQALRFTVMNSFFKEWPNKPLK-FPSVTVKENEGETNVVDETSLSTGNFNLQ 550 Query: 330 QTDSDSDCGREEKNLLHSRFTDDSTKKAANVETHDSPTKEXXXXXXXXSLEDIEEGEILD 151 +T+++S E + H +D+ + DS E LE EEGEI+D Sbjct: 551 ETNNNS---LVENDGSHDIDSDNEN------DIKDSNKDEKLKGDGIQVLE--EEGEIID 599 >XP_010420604.1 PREDICTED: GATA zinc finger domain-containing protein 14-like isoform X1 [Camelina sativa] XP_010420605.1 PREDICTED: GATA zinc finger domain-containing protein 14-like isoform X1 [Camelina sativa] Length = 481 Score = 129 bits (325), Expect = 2e-28 Identities = 102/316 (32%), Positives = 148/316 (46%), Gaps = 22/316 (6%) Frame = -2 Query: 1191 SKGRPNEQSHGM----NGSVSPAVDAKNKLGQESR---PGYDKKRKNELNGTSQLSPKAK 1033 S+ RP Q+ G+ NGS S D +NK + + PG +R + Sbjct: 163 SEPRPLGQTGGIVDNTNGSGSKGNDFRNKFTKHQKFKGPGQGFQRSQLHKADNGKRKSGF 222 Query: 1032 KFNNGPSRFQKHNESLGDTRKGT---------AKIIETAEDRQWREDRKRNFPTGKNIQK 880 K + G + K + G A + E +QWRE R++N+PT N+ K Sbjct: 223 KDHKGKGNYNKMTTGFNGSDAGNIPNEKKRSFALVYTPKEIKQWRESRRKNYPTKLNVAK 282 Query: 879 KAEDLQQKAAXXXXXXXXXXXXXXXEILAKQAELGCEVAEIPREYLDDKDGHKAEVKVGN 700 K + + + E+LAKQAELG EVAE+P YL + D +V Sbjct: 283 KVK--KNVSESILDEEAKMRRQQLQEVLAKQAELGIEVAEVPSHYLSNTDE-----QVNG 335 Query: 699 DDTEQNGRKKYQKGHCYLFQKGRCKKERHCRYLHVKEST--DDDNQNQDLTIPKRKPSLL 526 D NG+ +Y FQ R KK R R + T +D +Q+ ++ R+P+LL Sbjct: 336 DRGNNNGKNQYNDRKKGRFQNNRYKKRRLDRKDKSGKKTRFEDKTSSQESSVITREPTLL 395 Query: 525 EKLLSKDIKKEKSQLLQVFRFIVNNNFFDEWPNKPLEYFPWSKEEPIETVEDV----QRK 358 EKLLS DIK++KSQLLQVFRF+V N+ F E+P +PL+ P+ TVE+ R+ Sbjct: 396 EKLLSGDIKRDKSQLLQVFRFMVMNSLFKEFPEQPLKL-------PLITVEETGCEHARE 448 Query: 357 ILLQLMNQEQTDSDSD 310 + L + D D D Sbjct: 449 DEVSLCDDLSDDDDDD 464 >XP_010420607.1 PREDICTED: GATA zinc finger domain-containing protein 14-like isoform X2 [Camelina sativa] Length = 479 Score = 127 bits (320), Expect = 1e-27 Identities = 101/320 (31%), Positives = 146/320 (45%), Gaps = 22/320 (6%) Frame = -2 Query: 1203 LASPSKGRPNEQSHGM----NGSVSPAVDAKNKLGQESR---PGYDKKRKNELNGTSQLS 1045 L P P + G+ NGS S D +NK + + PG +R + Sbjct: 157 LQPPGFSEPRPLTGGIVDNTNGSGSKGNDFRNKFTKHQKFKGPGQGFQRSQLHKADNGKR 216 Query: 1044 PKAKKFNNGPSRFQKHNESLGDTRKGT---------AKIIETAEDRQWREDRKRNFPTGK 892 K + G + K + G A + E +QWRE R++N+PT Sbjct: 217 KSGFKDHKGKGNYNKMTTGFNGSDAGNIPNEKKRSFALVYTPKEIKQWRESRRKNYPTKL 276 Query: 891 NIQKKAEDLQQKAAXXXXXXXXXXXXXXXEILAKQAELGCEVAEIPREYLDDKDGHKAEV 712 N+ KK + + + E+LAKQAELG EVAE+P YL + D Sbjct: 277 NVAKKVK--KNVSESILDEEAKMRRQQLQEVLAKQAELGIEVAEVPSHYLSNTDE----- 329 Query: 711 KVGNDDTEQNGRKKYQKGHCYLFQKGRCKKERHCRYLHVKEST--DDDNQNQDLTIPKRK 538 +V D NG+ +Y FQ R KK R R + T +D +Q+ ++ R+ Sbjct: 330 QVNGDRGNNNGKNQYNDRKKGRFQNNRYKKRRLDRKDKSGKKTRFEDKTSSQESSVITRE 389 Query: 537 PSLLEKLLSKDIKKEKSQLLQVFRFIVNNNFFDEWPNKPLEYFPWSKEEPIETVEDV--- 367 P+LLEKLLS DIK++KSQLLQVFRF+V N+ F E+P +PL+ P+ TVE+ Sbjct: 390 PTLLEKLLSGDIKRDKSQLLQVFRFMVMNSLFKEFPEQPLKL-------PLITVEETGCE 442 Query: 366 -QRKILLQLMNQEQTDSDSD 310 R+ + L + D D D Sbjct: 443 HAREDEVSLCDDLSDDDDDD 462 >XP_016499078.1 PREDICTED: mediator of RNA polymerase II transcription subunit 15-like isoform X2 [Nicotiana tabacum] Length = 615 Score = 129 bits (324), Expect = 1e-27 Identities = 159/576 (27%), Positives = 237/576 (41%), Gaps = 67/576 (11%) Frame = -2 Query: 1812 IPHSNNNLQSAPVTATPFPLQPNQLN----------ILASNISTXXXXXXXXLGAVVTSI 1663 IP+ N+N Q +P F Q NQ+N +LA N+ ++ Sbjct: 48 IPNINSNTQFSPQQL--FAFQMNQMNNVNSQHPKGQVLAQNVVNPPQFLNQN-----VAM 100 Query: 1662 QNPVALLQISAFLQYMQSFAQM--------PSPQHQSFS-QLPSQGQTGFAQMPSQNGFP 1510 QN LLQ+ +Q M FAQ+ P+ Q Q P+ G + + +G Sbjct: 101 QNLNQLLQLQMAMQ-MPGFAQLVPGNVPLYPNQVSQGIGLQNPNFAMNGHLGLMNASGTV 159 Query: 1509 QMQTQNHVNGFAQMPXXXXXXXXXXXXGVENGMPLYGLQGNVQSSNPQ-LMMPHSASLV- 1336 Q +++ QM ++ PL G VQ Q +P SA+L Sbjct: 160 QQSMNGNLS--KQMANATRQ--------LQGQSPLMNSFGTVQQPQTQNFSVPASANLQV 209 Query: 1335 -------NANFPGNKALLPQNFASNGVIEXXXXXXXXXXXXXXXXXXXXXNLASPSKGRP 1177 ++NF N + P N +NGV++ L +P Sbjct: 210 SQGMRPQSSNFVMNAHMGPVN--ANGVVQQSKNGFPKQMSNVTQQLQGQLPLMNPFGFVQ 267 Query: 1176 NEQSHGMNGSVSPAVDAKNKLGQESRPGYDKKRKNELNG-------TSQLSPKAKKFNNG 1018 Q+ N S A + E G +K N+ N + K+ K G Sbjct: 268 QPQAQNFNTPAS----ANTQANPEGVGGLNKSIGNQQNSYNSNFSRNQKHGAKSMKSQFG 323 Query: 1017 PSRFQKHNESL--GDTRKGTAKII-------------------ETAEDRQWREDRKRNFP 901 +F H++SL G RKG K E R+WRE+R++N+P Sbjct: 324 KGKFSPHSKSLEKGHHRKGEKKSFLANSVKPEMEKKRSLLVTYSAQEIRRWREERRKNYP 383 Query: 900 TGKNIQKKAEDLQQKAAXXXXXXXXXXXXXXXEILAKQAELGCEVAEIPREYLDDK---- 733 + N++KK + ++ EILAKQAELGCEVAEIP YL D Sbjct: 384 SKSNLEKKPAE-KRAETDDSSSAAKLRRQQLKEILAKQAELGCEVAEIPSSYLSDSEKQG 442 Query: 732 DGHKAEVKVGNDDTEQNGRKKYQKGHCYLFQKGRCKKERHCRYLHVKES-TDDDNQNQ-- 562 DG + + + + Q KK+ K + + R K+R R+ + S T D N + Sbjct: 443 DGREQKRPLSRKERFQ---KKFNKRERFN-RNDRFSKKR--RFGNSDSSITRDQNASTAG 496 Query: 561 DLTIPKRKPSLLEKLLSKDIKKEKSQLLQVFRFIVNNNFFDEWPNKPLEYFPW----SKE 394 +T R+P+LL+KLLS DI+++K LLQVFRF+ N+FF +WP KPL FP Sbjct: 497 QVTETAREPTLLQKLLSSDIRRDKRHLLQVFRFMTMNSFFKDWPEKPLR-FPQVILKETG 555 Query: 393 EPIETVEDVQRKILLQLMNQEQTDSDSDCGREEKNL 286 + IE E++ I + + DSD+D EE N+ Sbjct: 556 QEIEAAEEIYDAI-DATVEKSANDSDND---EENNI 587 >XP_009788231.1 PREDICTED: mediator of RNA polymerase II transcription subunit 15 isoform X3 [Nicotiana sylvestris] Length = 615 Score = 129 bits (324), Expect = 1e-27 Identities = 159/569 (27%), Positives = 233/569 (40%), Gaps = 60/569 (10%) Frame = -2 Query: 1812 IPHSNNNLQSAPVTATPFPLQPNQLN----------ILASNISTXXXXXXXXLGAVVTSI 1663 IP+ N+N Q +P F Q NQ+N +LA N+ ++ Sbjct: 48 IPNINSNTQFSPQQL--FAFQMNQMNNVNSQQPKGQVLAQNVVNPPQFLNQN-----VAM 100 Query: 1662 QNPVALLQISAFLQYMQSFAQMPSPQHQSFSQLPSQGQTGFAQMPSQNGFPQMQTQNHVN 1483 QN LLQ+ +Q M FAQ+ + SQG + NG + T + Sbjct: 101 QNLNQLLQLQMAMQ-MPGFAQLVPGNVPLYPNQVSQGIGLQNPNFAMNGHLGLMT---AS 156 Query: 1482 GFAQ--MPXXXXXXXXXXXXGVENGMPLYGLQGNVQSSNPQ-LMMPHSASLV-------- 1336 G Q M ++ PL G VQ Q +P SA+L Sbjct: 157 GTVQQSMNGNLSKQMANATRQLQGQSPLMNSFGTVQQPQTQNFSVPASANLQVSQGMRPQ 216 Query: 1335 NANFPGNKALLPQNFASNGVIEXXXXXXXXXXXXXXXXXXXXXNLASPSKGRPNEQSHGM 1156 ++NF N + P N +NGV++ L +P Q+ Sbjct: 217 SSNFVMNAHMGPVN--ANGVVQQSKNGFPKQMSNVTQQLQGQLPLMNPFGFVQQPQAQNF 274 Query: 1155 NGSVSPAVDAKNKLGQESRPGYDKKRKNELNG-------TSQLSPKAKKFNNGPSRFQKH 997 N S A + E G +K N+ N + K+ K G +F H Sbjct: 275 NTPAS----ANTQANPEGVGGLNKSIGNQQNSYNSNFSRNQKHGAKSMKSQFGKGKFSPH 330 Query: 996 NESL--GDTRKGTAKII-------------------ETAEDRQWREDRKRNFPTGKNIQK 880 ++SL G RKG K E R+WRE+R++N+P+ N++K Sbjct: 331 SKSLEKGHHRKGEKKSFLANSVKPEMEKKRSLLVTYSAQEIRRWREERRKNYPSKSNLEK 390 Query: 879 KAEDLQQKAAXXXXXXXXXXXXXXXEILAKQAELGCEVAEIPREYLDDK----DGHKAEV 712 K + ++ EILAKQAELGCEVAEIP YL D DG + + Sbjct: 391 KPTE-KRAETDDSSSAAKLRRQQLKEILAKQAELGCEVAEIPSSYLSDSEKQGDGREQKR 449 Query: 711 KVGNDDTEQNGRKKYQKGHCYLFQKGRCKKERHCRYLHVKES-TDDDNQNQ--DLTIPKR 541 + + Q KK+ K + + R K+R R+ + S T D N + +T R Sbjct: 450 PLSRKERFQ---KKFNKRERFN-RNDRFSKKR--RFGNSDSSITRDQNASTAGQVTETAR 503 Query: 540 KPSLLEKLLSKDIKKEKSQLLQVFRFIVNNNFFDEWPNKPLEYFPW----SKEEPIETVE 373 +P+LL+KLLS DI+++K LLQVFRF+ N+FF +WP KPL FP + IE E Sbjct: 504 EPTLLQKLLSSDIRRDKRHLLQVFRFMTMNSFFKDWPEKPLR-FPQVILKETGQEIEAAE 562 Query: 372 DVQRKILLQLMNQEQTDSDSDCGREEKNL 286 ++ I + + DSD+D EE N+ Sbjct: 563 EIYDAI-DATVEKSANDSDND---EENNI 587 >XP_016902016.1 PREDICTED: uncharacterized protein LOC103496534 isoform X1 [Cucumis melo] Length = 604 Score = 128 bits (322), Expect = 2e-27 Identities = 105/331 (31%), Positives = 158/331 (47%), Gaps = 10/331 (3%) Frame = -2 Query: 1113 GQESRPGYDKKRKNELNGTSQLSPKAKKFNNGPSRFQKHNESLGDTRKGTAKIIETAEDR 934 G + G+ +R+N+ GT+ + + + ++ + + E R Sbjct: 308 GGQKEKGFHNERRNKFCGTNS------------------TDQVKEQKRSLSLVYTDQEIR 349 Query: 933 QWREDRKRNFPTGKNIQKKA--EDLQQKAAXXXXXXXXXXXXXXXE--ILAKQAELGCEV 766 QWRE R++N+P+ NIQK + + L +K ILAKQAELG EV Sbjct: 350 QWREARRKNYPSSTNIQKFSILQKLAEKQTNCTLVNQEAQLLRQELKEILAKQAELGVEV 409 Query: 765 AEIPREYLDDKDGHKAEVKVGNDDT---EQNGRK-KYQKGHCYLFQKGRCKKERHCRYLH 598 AEIP EYL + H + G T E +G + +K L ++GR KK+ R Sbjct: 410 AEIPPEYLSYSEKHDNRKQRGGPSTLGEEADGASIEKEKSQNRLNKRGRLKKKNRPR--- 466 Query: 597 VKESTDDDNQNQDLTIPKRKPSLLEKLLSKDIKKEKSQLLQVFRFIVNNNFFDEWPNKPL 418 K+ + + + + KR+P+LL+KLL D++K+KSQLLQ RF+V N+FF EWPNKPL Sbjct: 467 -KKGKFEKHLSNKPPLKKREPTLLQKLLKADVRKDKSQLLQALRFMVMNSFFKEWPNKPL 525 Query: 417 EYFP--WSKEEPIETVEDVQRKILLQLMNQEQTDSDSDCGREEKNLLHSRFTDDSTKKAA 244 + FP KE ET + + N ++T+++S L+ + T D Sbjct: 526 K-FPSVTVKENEGETNVVDETPLSTGNFNLQETNNNS--------LVENNGTHDINSDNE 576 Query: 243 NVETHDSPTKEXXXXXXXXSLEDIEEGEILD 151 N + DS E LE EEGEI+D Sbjct: 577 N-DIEDSDNDEKLKGDGTQVLE--EEGEIID 604 >GAQ88045.1 Zinc finger domain containing protein [Klebsormidium flaccidum] Length = 1941 Score = 130 bits (326), Expect = 3e-27 Identities = 81/208 (38%), Positives = 112/208 (53%), Gaps = 13/208 (6%) Frame = -2 Query: 966 TAKIIETAED-RQWREDRKRNFPTGKNIQKKAEDLQQKAAXXXXXXXXXXXXXXXE--IL 796 TA ET ED R+WRE+R++++PT N+QKKAE+ + A IL Sbjct: 1693 TANANETEEDVRKWREERRKHYPTEGNVQKKAEEAAARRARGELEDLDGAARRARLREIL 1752 Query: 795 AKQAELGCEVAEIPREYLDDKDGHKAEVKVGNDDTEQNGRKKYQKGHCYLFQKGRCKKER 616 +Q LG +E + DG K + G GR ++G C+ F KG C+K R Sbjct: 1753 QQQRALGVVGSETEAAAM--LDGGKRQAVEG-----PGGRP--ERGVCFFFLKGHCRKGR 1803 Query: 615 HCRYLHVKE----------STDDDNQNQDLTIPKRKPSLLEKLLSKDIKKEKSQLLQVFR 466 C++LH + D QNQ + KR P+LLEKLL+ +I+K+KS LLQ FR Sbjct: 1804 RCQFLHQRTPRGERGPGGPGNDRRRQNQ---VAKRAPTLLEKLLAPEIRKDKSHLLQSFR 1860 Query: 465 FIVNNNFFDEWPNKPLEYFPWSKEEPIE 382 F+VNNNFF EWP +PL+YF W + +E Sbjct: 1861 FMVNNNFFLEWPQQPLKYFEWQSSDELE 1888 >XP_016169675.1 PREDICTED: uncharacterized protein LOC107612498 isoform X2 [Arachis ipaensis] Length = 582 Score = 127 bits (319), Expect = 4e-27 Identities = 135/466 (28%), Positives = 196/466 (42%), Gaps = 17/466 (3%) Frame = -2 Query: 1758 PLQPNQLNILASNISTXXXXXXXXLGAVVTSIQNPVALLQISAFLQYMQSFAQMPSPQHQ 1579 PLQ NQL++ +S + N A Q M + AQ+ Q Q Sbjct: 59 PLQNNQLHMSNMGMSVPPQGPSHAGFGPQNGVSNAGYNPMFQAQGQVMHNAAQINLSQFQ 118 Query: 1578 SFSQLPSQGQTGFAQMPSQN-GFPQMQTQNHV---NGFAQMPXXXXXXXXXXXXGVENGM 1411 + +Q Q P+ N P Q + N Q+P GV G Sbjct: 119 G--HILAQSILSMLQQPNMNMNIPNGQFSSQFPVQNMNQQLPMQVPNPSQVGLHGVHPGS 176 Query: 1410 -PLYGLQGNVQ----SSNPQLMMPHSASLVNAN-----FPGN-KALLPQNFASNGVIEXX 1264 P++G G V S NP LV N F N K+L+ N +N + Sbjct: 177 GPMFGFPGQVPQAMVSQNPMFSSMLHTGLVQGNQVRPQFDQNEKSLVLPNGNTNAFVSSS 236 Query: 1263 XXXXXXXXXXXXXXXXXXXNLASPSKGRPNEQSHGMNGSVSPAVDAKNKLGQESRPGYDK 1084 + S + N+++ GS S KN +++R G+ Sbjct: 237 FSSMQLQGNSSASHAQTN----ANSNTKSNDRNSSWKGSQS-----KNFKNKQTRGGFQG 287 Query: 1083 KRKNELNGTSQLSPKAKKFNNGPSRFQKHNESLGDTRKGTAKIIETAEDRQWREDRKRNF 904 + + N + KF+ P Q+ + ++ E RQWRE R++N Sbjct: 288 RFQKWPNNGRIEHQQKPKFSLNPKEQQQ------EPKRPFFVTYTDQEIRQWREARRKNH 341 Query: 903 PTGKNIQKKAEDLQQKAAXXXXXXXXXXXXXXXEILAKQAELGCEVAEIPREYLDDKD-- 730 P NIQKK Q + ++LAKQAELG EVAEIP YL D++ Sbjct: 342 PFN-NIQKK----QSEHTRSPKVDRVVLQRELKQVLAKQAELGVEVAEIPSYYLKDRENQ 396 Query: 729 GHKAEVKVGNDDTEQNGRKKYQKGHCYLFQKGRCKKERHCRYLHVKESTDDDNQNQDLTI 550 G ++E K DT N ++K+Q K K +R R+ ++ TD D+ Q +I Sbjct: 397 GPQSEGK----DTLNNKKRKFQN-------KFNRKPDRKDRFSKKQKFTDKDSLEQRPSI 445 Query: 549 PKRKPSLLEKLLSKDIKKEKSQLLQVFRFIVNNNFFDEWPNKPLEY 412 K+KP+LL+KLLS DIKK+KS L+Q FRF+V N+FF +P+KPL Y Sbjct: 446 TKKKPTLLQKLLSADIKKDKSHLIQAFRFMVTNSFFKYYPDKPLIY 491 >XP_017984940.1 PREDICTED: uncharacterized protein LOC18586963 isoform X3 [Theobroma cacao] Length = 606 Score = 127 bits (318), Expect = 6e-27 Identities = 114/373 (30%), Positives = 172/373 (46%), Gaps = 28/373 (7%) Frame = -2 Query: 1185 GRPNEQSHGMNGSVSPAVDAKNKLG------QESRPGYDKKRKNELNGTSQLSPKAKKFN 1024 G P + +N V +K G Q+SR K + ++ K + + Sbjct: 248 GSPGKSGQNLNNFPGRNVARDSKWGFHKSKFQQSRFHQVDNAKRKFASSNGQKKKGQD-D 306 Query: 1023 NGPSRFQKHNESLGDT--RKGTAKIIETAEDRQWREDRKRNFPTGKNIQKKAEDLQQKAA 850 ++F N + D RK A E RQWRE+RK+++PT NI+KK A Sbjct: 307 ERAAKFPHSNSTKPDKEKRKSLALTYSEQEIRQWREERKKHYPTKTNIKKKLSGKVSDAE 366 Query: 849 XXXXXXXXXXXXXXXEILAKQAELGCEVAEIPREYLDDKDGHKAEVKVGNDDTEQNGRKK 670 ILAKQAELG EVAEIP YL +E KV E+N Sbjct: 367 VAKLRSEQLKE-----ILAKQAELGVEVAEIPSHYLLG-----SEKKVNG--REENSWPL 414 Query: 669 YQKGHCYLFQKGRCKKERHCRYLHVKESTDDDNQNQDLTIPKRKPSLLEKLLSKDIKKEK 490 ++G R + ++ R+ + ST++++ + ++ KR P+LL+KLLS DI+K+K Sbjct: 415 TKRGRFEKRHDKRVRFDKRDRFSRKRRSTNEESFD-GTSVNKRSPTLLQKLLSADIRKDK 473 Query: 489 SQLLQVFRFIVNNNFFDEWPNKPLEY-----------FPWSKEEPIETVED---VQRKIL 352 S LLQVFRF+V N+FF +WP KPL+Y +E+P+ ED V K + Sbjct: 474 SHLLQVFRFMVINSFFKDWPEKPLKYPLVVVRDGLSEGEIVREKPLVVGEDKLEVCDKTM 533 Query: 351 LQLM----NQEQTDSDSDCGREEK--NLLHSRFTDDSTKKAANVETHDSPTKEXXXXXXX 190 +Q + N++ DSD+D RE K N ++ D++ + +E Sbjct: 534 IQSIVNGENKDGDDSDNDGDRESKDDNDVNGDEDDENKHDTQADQVALYAREEKADSGEG 593 Query: 189 XSLEDIEEGEILD 151 + EEGEI+D Sbjct: 594 IVRNEEEEGEIID 606 >EOY19451.1 Region-like protein isoform 3 [Theobroma cacao] Length = 605 Score = 126 bits (317), Expect = 8e-27 Identities = 114/373 (30%), Positives = 172/373 (46%), Gaps = 28/373 (7%) Frame = -2 Query: 1185 GRPNEQSHGMNGSVSPAVDAKNKLG------QESRPGYDKKRKNELNGTSQLSPKAKKFN 1024 G P + +N V +K G Q+SR K + ++ K + + Sbjct: 247 GSPGKSGQNLNNFPGRNVARDSKWGFHKSKFQQSRFHQVDNAKRKFASSNGQKKKGQD-D 305 Query: 1023 NGPSRFQKHNESLGDT--RKGTAKIIETAEDRQWREDRKRNFPTGKNIQKKAEDLQQKAA 850 ++F N + D RK A E RQWRE+RK+++PT NI+KK A Sbjct: 306 ERAAKFPHSNSTKPDKEKRKSLALTYTEQEIRQWREERKKHYPTKTNIKKKLSGKVSDAE 365 Query: 849 XXXXXXXXXXXXXXXEILAKQAELGCEVAEIPREYLDDKDGHKAEVKVGNDDTEQNGRKK 670 ILAKQAELG EVAEIP YL +E KV E+N Sbjct: 366 VAKLRSEQLKE-----ILAKQAELGVEVAEIPSHYLLG-----SEKKVNG--REENSWPL 413 Query: 669 YQKGHCYLFQKGRCKKERHCRYLHVKESTDDDNQNQDLTIPKRKPSLLEKLLSKDIKKEK 490 ++G R + ++ R+ + ST++++ + ++ KR P+LL+KLLS DI+K+K Sbjct: 414 TKRGRFEKRHDKRVRFDKRDRFSRKRRSTNEESFD-GTSVNKRSPTLLQKLLSADIRKDK 472 Query: 489 SQLLQVFRFIVNNNFFDEWPNKPLEY-----------FPWSKEEPIETVED---VQRKIL 352 S LLQVFRF+V N+FF +WP KPL+Y +E+P+ ED V K + Sbjct: 473 SHLLQVFRFMVINSFFKDWPEKPLKYPLVVVRDGLSEGEIVREKPLVVGEDKLEVCDKTM 532 Query: 351 LQLM----NQEQTDSDSDCGREEK--NLLHSRFTDDSTKKAANVETHDSPTKEXXXXXXX 190 +Q + N++ DSD+D RE K N ++ D++ + +E Sbjct: 533 IQSIVNGENKDGDDSDNDGDRESKDDNDVNGDEDDENKHDTQADQVALYAREEKADSGEG 592 Query: 189 XSLEDIEEGEILD 151 + EEGEI+D Sbjct: 593 IVRNEEEEGEIID 605 >EOY19453.1 Region-like protein isoform 5 [Theobroma cacao] Length = 606 Score = 126 bits (317), Expect = 8e-27 Identities = 114/373 (30%), Positives = 172/373 (46%), Gaps = 28/373 (7%) Frame = -2 Query: 1185 GRPNEQSHGMNGSVSPAVDAKNKLG------QESRPGYDKKRKNELNGTSQLSPKAKKFN 1024 G P + +N V +K G Q+SR K + ++ K + + Sbjct: 248 GSPGKSGQNLNNFPGRNVARDSKWGFHKSKFQQSRFHQVDNAKRKFASSNGQKKKGQD-D 306 Query: 1023 NGPSRFQKHNESLGDT--RKGTAKIIETAEDRQWREDRKRNFPTGKNIQKKAEDLQQKAA 850 ++F N + D RK A E RQWRE+RK+++PT NI+KK A Sbjct: 307 ERAAKFPHSNSTKPDKEKRKSLALTYTEQEIRQWREERKKHYPTKTNIKKKLSGKVSDAE 366 Query: 849 XXXXXXXXXXXXXXXEILAKQAELGCEVAEIPREYLDDKDGHKAEVKVGNDDTEQNGRKK 670 ILAKQAELG EVAEIP YL +E KV E+N Sbjct: 367 VAKLRSEQLKE-----ILAKQAELGVEVAEIPSHYLLG-----SEKKVNG--REENSWPL 414 Query: 669 YQKGHCYLFQKGRCKKERHCRYLHVKESTDDDNQNQDLTIPKRKPSLLEKLLSKDIKKEK 490 ++G R + ++ R+ + ST++++ + ++ KR P+LL+KLLS DI+K+K Sbjct: 415 TKRGRFEKRHDKRVRFDKRDRFSRKRRSTNEESFD-GTSVNKRSPTLLQKLLSADIRKDK 473 Query: 489 SQLLQVFRFIVNNNFFDEWPNKPLEY-----------FPWSKEEPIETVED---VQRKIL 352 S LLQVFRF+V N+FF +WP KPL+Y +E+P+ ED V K + Sbjct: 474 SHLLQVFRFMVINSFFKDWPEKPLKYPLVVVRDGLSEGEIVREKPLVVGEDKLEVCDKTM 533 Query: 351 LQLM----NQEQTDSDSDCGREEK--NLLHSRFTDDSTKKAANVETHDSPTKEXXXXXXX 190 +Q + N++ DSD+D RE K N ++ D++ + +E Sbjct: 534 IQSIVNGENKDGDDSDNDGDRESKDDNDVNGDEDDENKHDTQADQVALYAREEKADSGEG 593 Query: 189 XSLEDIEEGEILD 151 + EEGEI+D Sbjct: 594 IVRNEEEEGEIID 606 >XP_010492846.1 PREDICTED: GATA zinc finger domain-containing protein 14-like isoform X1 [Camelina sativa] Length = 480 Score = 125 bits (313), Expect = 8e-27 Identities = 100/318 (31%), Positives = 151/318 (47%), Gaps = 24/318 (7%) Frame = -2 Query: 1191 SKGRPNEQSHGM----NGSVSPAVDAKNKLGQESR-----PGYDK---------KRKNEL 1066 S+ RP Q+ G+ NGS D +NK + + G+ + KRK+ Sbjct: 163 SEPRPLGQTGGIVDNTNGSGPKGNDFRNKFTKHQKFNGAGQGFQRSQLHQADNGKRKSGF 222 Query: 1065 NGTSQLSPKAKKFNNGPSRFQKHNESLGDTRKGTAKIIETAEDRQWREDRKRNFPTGKNI 886 N + K G + N + + ++ A + E +QWRE R++N+PT N+ Sbjct: 223 NKDHKGKGNYNKMTTGFNGSDAGNIA-SEKKRSFALVYTPKEIKQWRESRRKNYPTKLNV 281 Query: 885 QKKAEDLQQKAAXXXXXXXXXXXXXXXEILAKQAELGCEVAEIPREYLDDKDGHKAEVKV 706 KK + + + E+LAKQAELG EVAE+P YL + D +V Sbjct: 282 AKKVK--KNASESILDEEAKMRRQQLQEVLAKQAELGVEVAEVPSHYLSNTDE-----QV 334 Query: 705 GNDDTEQNGRKKYQKGHCYLFQKGRCKKERHCRYLHVKEST--DDDNQNQDLTIPKRKPS 532 D NG+ +Y G FQ R +K R R + T +D +Q+ ++ R+P+ Sbjct: 335 NGDGGNNNGKNQYNDGRKGRFQNNRQRKRRPDRKDKSVKKTRFEDKTSSQESSVITREPT 394 Query: 531 LLEKLLSKDIKKEKSQLLQVFRFIVNNNFFDEWPNKPLEYFPWSKEEPIETVEDV----Q 364 LLEKLLS D K++KSQLLQV RF+V N+ F E+P +PL+ P+ TVE+ Sbjct: 395 LLEKLLSGDKKRDKSQLLQVIRFMVMNSLFKEFPEQPLKL-------PLITVEETGCEHA 447 Query: 363 RKILLQLMNQEQTDSDSD 310 R+ + L + D D D Sbjct: 448 REDEVSLCDDLSDDDDDD 465 >NP_001078603.1 GATA zinc finger protein [Arabidopsis thaliana] NP_197345.2 GATA zinc finger protein [Arabidopsis thaliana] AAX23912.1 hypothetical protein At5g18440 [Arabidopsis thaliana] AAZ52755.1 hypothetical protein At5g18440 [Arabidopsis thaliana] AAZ52756.1 hypothetical protein At5g18440 [Arabidopsis thaliana] AED92563.1 GATA zinc finger protein [Arabidopsis thaliana] AED92564.1 GATA zinc finger protein [Arabidopsis thaliana] OAO91046.1 NUFIP [Arabidopsis thaliana] Length = 470 Score = 124 bits (312), Expect = 1e-26 Identities = 119/426 (27%), Positives = 176/426 (41%), Gaps = 34/426 (7%) Frame = -2 Query: 1590 PQHQ--------SFSQLPSQGQTGFAQMPSQNGFPQMQTQNHVNGFA--QMPXXXXXXXX 1441 PQHQ FS Q G+ + N M Q N MP Sbjct: 17 PQHQLQQQQQINGFSNHQQQQHNGYQNPMNANQLGMMNPQMMNNPMMGHNMPMPNMPIHP 76 Query: 1440 XXXXGVENGMPLYGLQGNVQSSNPQLMMPHSASLVNANFPGNK-------ALLPQNFASN 1282 + +P + + ++ P L+ ++ N+N G+ +L P F+S Sbjct: 77 QFFNNMPQQLPQFAMPNHINQLLPNLLGNLQFAVANSNLMGHSLPNFFQPSLEPHAFSSR 136 Query: 1281 GVIEXXXXXXXXXXXXXXXXXXXXXNLASPSKGRPNEQS-HGMNGSVSPAVDAKNKLGQE 1105 + S+ RP QS NGS D +NK + Sbjct: 137 PQLNSFNSLPYPPVPNPHQNHQSGPP--GFSEPRPQGQSVDNTNGSGPNGNDFRNKFPKH 194 Query: 1104 SR-----PGYDKKRKNELNGTSQLSPKAK----KFNNGPSRFQKHNESLG----DTRKGT 964 G+ + + ++ + + S K K NN + G + ++ Sbjct: 195 QNFKGPGQGFQRPQLHQADNGKRKSGFNKDHRGKGNNNKMKTGLDGSDTGNIAKEKKRSY 254 Query: 963 AKIIETAEDRQWREDRKRNFPTGKNIQKKAEDLQQKAAXXXXXXXXXXXXXXXEILAKQA 784 A + E +QWRE R++N+PT ++KK + + +A E+LAKQA Sbjct: 255 ALMYTPREVQQWREARRKNYPTKFLVEKKVK--KNVSASILDEEAKMRRQQLREVLAKQA 312 Query: 783 ELGCEVAEIPREYLDDKDGHKAEVKVGNDDTEQNGRKKYQKGHCYLFQKGRCKKERHCR- 607 ELG EVAE+P YL + D +V D NGRK FQ R K RH R Sbjct: 313 ELGVEVAEVPSHYLSNNDE-----QVNGDRGNNNGRKGR-------FQNNRRNKRRHDRK 360 Query: 606 --YLHVKESTDDDNQNQDLTIPKRKPSLLEKLLSKDIKKEKSQLLQVFRFIVNNNFFDEW 433 + + K +D +QD +I RKP+LLEKLLS DIK++KSQLLQVFRF+V N+ E+ Sbjct: 361 DKFDNKKPRLEDKKSSQDSSITTRKPTLLEKLLSADIKRDKSQLLQVFRFMVMNSLLKEF 420 Query: 432 PNKPLE 415 P +PL+ Sbjct: 421 PEQPLK 426 >XP_002871816.1 hypothetical protein ARALYDRAFT_488722 [Arabidopsis lyrata subsp. lyrata] EFH48075.1 hypothetical protein ARALYDRAFT_488722, partial [Arabidopsis lyrata subsp. lyrata] Length = 454 Score = 124 bits (311), Expect = 1e-26 Identities = 122/445 (27%), Positives = 183/445 (41%), Gaps = 20/445 (4%) Frame = -2 Query: 1641 QISAFLQYMQSFAQMPSPQHQSFSQLPSQGQTGFAQMPSQNGFPQMQTQNHVNGFAQMPX 1462 Q+S FL Y Q H + + Q G P P M H+N MP Sbjct: 24 QVSLFLFYSNHHQQ-----HNGYQNPMNSNQLGMMN-PQMMSNPMM---GHMNNPIPMPN 74 Query: 1461 XXXXXXXXXXXGVENGMPLYGLQGNVQSSNPQLMMPHSASLVNANFPGNKALLPQNFASN 1282 + + + + ++ P L+ ++ N+N G+ LP F N Sbjct: 75 MPIHPQFFNNMPQQQQLHQFAMPNHINQLLPNLLGNLQFAVANSNLMGHS--LPNFFQPN 132 Query: 1281 -GVIEXXXXXXXXXXXXXXXXXXXXXNLASPSKGRPNEQ---SHGMNGSVSPAVDAKNKL 1114 +L P P Q NGS S D +NK Sbjct: 133 LEPSAFTSRPQLNSFNSLPYPPVPNHHLRPPGFSEPRPQVGIDDRTNGSGSNGNDFRNKF 192 Query: 1113 GQESR---PGY-----------DKKRKNELNGTSQLSPKAKKFNNGPSRFQKHNESLGDT 976 + PG + KRK+ N + K NG N + + Sbjct: 193 TKHQNFKGPGQGFQRPQLHQADNGKRKSGFNKDHRGKGNYNKMKNGLDGSDADNIAK-EK 251 Query: 975 RKGTAKIIETAEDRQWREDRKRNFPTGKNIQKKAEDLQQKAAXXXXXXXXXXXXXXXEIL 796 R+ A + + QWRE R++NFPT N++KK + + +A E+L Sbjct: 252 RRSYALMYTPKDVNQWREARRKNFPTRLNVEKKVK--KNVSASILDEEAKMRRQQLREVL 309 Query: 795 AKQAELGCEVAEIPREYLDDKDGHKAEVKVGNDDTEQNGRKKYQKGHCYLFQKGRCKKER 616 AKQAELG EVA++P YL + D +V D+ +G+K+ FQ R K+ R Sbjct: 310 AKQAELGIEVADVPSHYLSNTDE-----RVHGDNGANDGQKRK-------FQNNRHKQRR 357 Query: 615 HCRYLHVKEST--DDDNQNQDLTIPKRKPSLLEKLLSKDIKKEKSQLLQVFRFIVNNNFF 442 H R ++ DD N +Q+ + +KP+LLEKLLS +IK++K LLQVFRF+V N+F Sbjct: 358 HGRKDKFDKTPRLDDKNSSQESPMTTKKPTLLEKLLSANIKRDKIHLLQVFRFMVMNSFL 417 Query: 441 DEWPNKPLEYFPWSKEEPIETVEDV 367 E+P +PL+ + EE + + DV Sbjct: 418 KEFPEQPLKLPLITVEETGDDLSDV 442 >XP_006287638.1 hypothetical protein CARUB_v10000849mg [Capsella rubella] EOA20536.1 hypothetical protein CARUB_v10000849mg [Capsella rubella] Length = 481 Score = 124 bits (311), Expect = 2e-26 Identities = 92/286 (32%), Positives = 142/286 (49%), Gaps = 27/286 (9%) Frame = -2 Query: 1191 SKGRPNEQSHGM---NGSVSPAVDAKNKLGQESR-----PGYDKKRKNELNGTSQLSPKA 1036 S+ RP QS G P D +NK + + G+ + + ++ + + S Sbjct: 170 SEPRPQGQSVGNVDNTNGFGPKNDFRNKFSKHQKFKGPGQGFQRSQLHQADNGKRKSG-F 228 Query: 1035 KKFNNGPSRFQKHNESLG---------DTRKGTAKIIETAEDRQWREDRKRNFPTG-KNI 886 K + G + K L + ++ A I E +QWR+ R++N+PT K + Sbjct: 229 NKDHRGKGNYNKMKSGLNGSDAVDMAKEKKRSFALIYTPKEIKQWRDARRKNYPTKLKKL 288 Query: 885 QKKAED--LQQKAAXXXXXXXXXXXXXXXEILAKQAELGCEVAEIPREYLDDKDGHKAEV 712 +K A D L ++A +LAKQAELG EVAE+P YL + D Sbjct: 289 KKNASDSILDEEATLRRQQLQE--------VLAKQAELGVEVAEVPSHYLSNTDE----- 335 Query: 711 KVGNDDTEQNGRKKYQKGHCYLFQKGRCKKERHCRYLH-------VKESTDDDNQNQDLT 553 +V D +G+ +YQ G QKGR + RH + H K ++ +N +++ + Sbjct: 336 QVNGDRGNNSGKNQYQNG-----QKGRVQNNRHNKRRHDRKDRSNKKTRSEVENSSKESS 390 Query: 552 IPKRKPSLLEKLLSKDIKKEKSQLLQVFRFIVNNNFFDEWPNKPLE 415 + R+P+LLEKLLS DIK++KS LLQVFRF+V N+FF E P +PL+ Sbjct: 391 MMTREPTLLEKLLSADIKRDKSHLLQVFRFMVINSFFKELPEQPLK 436 >XP_010454083.1 PREDICTED: GATA zinc finger domain-containing protein 14-like isoform X1 [Camelina sativa] Length = 484 Score = 124 bits (311), Expect = 2e-26 Identities = 105/322 (32%), Positives = 153/322 (47%), Gaps = 28/322 (8%) Frame = -2 Query: 1191 SKGRPNEQSHGM-----NGSVSPAVDAKNKLGQESR---PGY-----------DKKRKNE 1069 S+ RP Q+ G+ NGS S D +NK + + PG + KRK+ Sbjct: 163 SEPRPLGQTGGIVDNNTNGSGSKGNDFRNKFTKHQKFNGPGQGFQRSQLHQADNGKRKSG 222 Query: 1068 LNGTSQLSPKAKKFNNGPSRFQKHNESLGDTRKGTAKIIETAEDRQWREDRKRNFPTGKN 889 N + K G + N + + ++ A + E +QWRE R++N+PT N Sbjct: 223 FNKDHRGKGNYNKMTTGFNGSDAGNIA-NEKKRSFALVYTPKEIKQWRESRRKNYPTKLN 281 Query: 888 IQKKAEDLQQKAAXXXXXXXXXXXXXXXEILAKQAELGCEVAEIPREYLDDKDGHKAEVK 709 + KK + + + E+LAKQAELG EVAE+P YL + D + Sbjct: 282 VAKKVK--KNVSESILDEEAKMRRQQLQEVLAKQAELGVEVAEVPSHYLSNTDE-----Q 334 Query: 708 VGNDDTEQNGRKKY---QKGHCYLFQKGRCKKERHCRYLHVKEST--DDDNQNQDLTIPK 544 V D NG+ +Y QKG FQ R KK R R + T +D +Q+ ++ Sbjct: 335 VNGDGGNNNGKNQYNDGQKGR--RFQNNRHKKRRPDRKDKSSKKTRFEDKTSSQESSVIT 392 Query: 543 RKPSLLEKLLSKDIKKEKSQLLQVFRFIVNNNFFDEWPNKPLEYFPWSKEEPIETVEDV- 367 R+P+LLEKLLS DIK+ KSQLLQV RF+ N+ F E+P +PL+ P+ TVE+ Sbjct: 393 REPTLLEKLLSGDIKRNKSQLLQVIRFMAMNSLFKEFPEQPLKL-------PLITVEETG 445 Query: 366 ---QRKILLQLMNQEQTDSDSD 310 R+ + L + D D D Sbjct: 446 CEHGREDDVSLCDDLSDDDDDD 467 >XP_007010639.2 PREDICTED: uncharacterized protein LOC18586963 isoform X2 [Theobroma cacao] XP_017984938.1 PREDICTED: uncharacterized protein LOC18586963 isoform X2 [Theobroma cacao] XP_017984939.1 PREDICTED: uncharacterized protein LOC18586963 isoform X2 [Theobroma cacao] Length = 606 Score = 125 bits (313), Expect = 3e-26 Identities = 113/374 (30%), Positives = 172/374 (45%), Gaps = 29/374 (7%) Frame = -2 Query: 1185 GRPNEQSHGMNGSVSPAVDAKNKLG------QESRPGYDKKRKNELNGTSQLSPKAKKFN 1024 G P + +N V +K G Q+SR K + ++ K + + Sbjct: 247 GSPGKSGQNLNNFPGRNVARDSKWGFHKSKFQQSRFHQVDNAKRKFASSNGQKKKGQD-D 305 Query: 1023 NGPSRFQKHNESLGDTRKGTAKIIET---AEDRQWREDRKRNFPTGKNIQKKAEDLQQKA 853 ++F N + D K + T E RQWRE+RK+++PT NI+KK A Sbjct: 306 ERAAKFPHSNSTKPDKEKRKRSLALTYSEQEIRQWREERKKHYPTKTNIKKKLSGKVSDA 365 Query: 852 AXXXXXXXXXXXXXXXEILAKQAELGCEVAEIPREYLDDKDGHKAEVKVGNDDTEQNGRK 673 ILAKQAELG EVAEIP YL +E KV E+N Sbjct: 366 EVAKLRSEQLKE-----ILAKQAELGVEVAEIPSHYLLG-----SEKKVNG--REENSWP 413 Query: 672 KYQKGHCYLFQKGRCKKERHCRYLHVKESTDDDNQNQDLTIPKRKPSLLEKLLSKDIKKE 493 ++G R + ++ R+ + ST++++ + ++ KR P+LL+KLLS DI+K+ Sbjct: 414 LTKRGRFEKRHDKRVRFDKRDRFSRKRRSTNEESFD-GTSVNKRSPTLLQKLLSADIRKD 472 Query: 492 KSQLLQVFRFIVNNNFFDEWPNKPLEY-----------FPWSKEEPIETVED---VQRKI 355 KS LLQVFRF+V N+FF +WP KPL+Y +E+P+ ED V K Sbjct: 473 KSHLLQVFRFMVINSFFKDWPEKPLKYPLVVVRDGLSEGEIVREKPLVVGEDKLEVCDKT 532 Query: 354 LLQLM----NQEQTDSDSDCGREEK--NLLHSRFTDDSTKKAANVETHDSPTKEXXXXXX 193 ++Q + N++ DSD+D RE K N ++ D++ + +E Sbjct: 533 MIQSIVNGENKDGDDSDNDGDRESKDDNDVNGDEDDENKHDTQADQVALYAREEKADSGE 592 Query: 192 XXSLEDIEEGEILD 151 + EEGEI+D Sbjct: 593 GIVRNEEEEGEIID 606 >EOY19449.1 Region-like protein isoform 1 [Theobroma cacao] Length = 606 Score = 125 bits (313), Expect = 3e-26 Identities = 113/374 (30%), Positives = 172/374 (45%), Gaps = 29/374 (7%) Frame = -2 Query: 1185 GRPNEQSHGMNGSVSPAVDAKNKLG------QESRPGYDKKRKNELNGTSQLSPKAKKFN 1024 G P + +N V +K G Q+SR K + ++ K + + Sbjct: 247 GSPGKSGQNLNNFPGRNVARDSKWGFHKSKFQQSRFHQVDNAKRKFASSNGQKKKGQD-D 305 Query: 1023 NGPSRFQKHNESLGDTRKGTAKIIET---AEDRQWREDRKRNFPTGKNIQKKAEDLQQKA 853 ++F N + D K + T E RQWRE+RK+++PT NI+KK A Sbjct: 306 ERAAKFPHSNSTKPDKEKRKRSLALTYTEQEIRQWREERKKHYPTKTNIKKKLSGKVSDA 365 Query: 852 AXXXXXXXXXXXXXXXEILAKQAELGCEVAEIPREYLDDKDGHKAEVKVGNDDTEQNGRK 673 ILAKQAELG EVAEIP YL +E KV E+N Sbjct: 366 EVAKLRSEQLKE-----ILAKQAELGVEVAEIPSHYLLG-----SEKKVNG--REENSWP 413 Query: 672 KYQKGHCYLFQKGRCKKERHCRYLHVKESTDDDNQNQDLTIPKRKPSLLEKLLSKDIKKE 493 ++G R + ++ R+ + ST++++ + ++ KR P+LL+KLLS DI+K+ Sbjct: 414 LTKRGRFEKRHDKRVRFDKRDRFSRKRRSTNEESFD-GTSVNKRSPTLLQKLLSADIRKD 472 Query: 492 KSQLLQVFRFIVNNNFFDEWPNKPLEY-----------FPWSKEEPIETVED---VQRKI 355 KS LLQVFRF+V N+FF +WP KPL+Y +E+P+ ED V K Sbjct: 473 KSHLLQVFRFMVINSFFKDWPEKPLKYPLVVVRDGLSEGEIVREKPLVVGEDKLEVCDKT 532 Query: 354 LLQLM----NQEQTDSDSDCGREEK--NLLHSRFTDDSTKKAANVETHDSPTKEXXXXXX 193 ++Q + N++ DSD+D RE K N ++ D++ + +E Sbjct: 533 MIQSIVNGENKDGDDSDNDGDRESKDDNDVNGDEDDENKHDTQADQVALYAREEKADSGE 592 Query: 192 XXSLEDIEEGEILD 151 + EEGEI+D Sbjct: 593 GIVRNEEEEGEIID 606