BLASTX nr result
ID: Rauwolfia21_contig00000969
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Rauwolfia21_contig00000969 (2802 letters) Database: ./nr 37,332,560 sequences; 13,225,080,153 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value emb|CAC28528.1| GATA-1 zinc finger protein [Nicotiana tabacum] 321 1e-84 ref|XP_006340186.1| PREDICTED: GATA transcription factor 11-like... 310 2e-81 ref|XP_004251141.1| PREDICTED: GATA transcription factor 11-like... 299 4e-78 ref|XP_006365758.1| PREDICTED: GATA transcription factor 10-like... 203 3e-49 ref|XP_004242094.1| PREDICTED: GATA transcription factor 11-like... 200 3e-48 gb|EOY34380.1| GATA zinc finger protein regulating nitrogen assi... 189 7e-45 ref|XP_006424556.1| hypothetical protein CICLE_v10029015mg [Citr... 184 2e-43 ref|XP_006488078.1| PREDICTED: GATA transcription factor 11-like... 184 2e-43 gb|EOY18663.1| Plant-specific GATA-type zinc finger transcriptio... 177 3e-41 gb|AGV54633.1| GATA transcription factor [Phaseolus vulgaris] gi... 176 7e-41 ref|XP_002273502.1| PREDICTED: GATA transcription factor 9-like ... 174 1e-40 ref|XP_003543479.1| PREDICTED: GATA transcription factor 11-like... 173 4e-40 gb|EOY18664.1| Plant-specific GATA-type zinc finger transcriptio... 172 7e-40 gb|ESW23870.1| hypothetical protein PHAVU_004G083100g [Phaseolus... 171 2e-39 ref|XP_003597258.1| GATA transcription factor [Medicago truncatu... 171 2e-39 gb|ACU24388.1| unknown [Glycine max] 170 3e-39 gb|EXB38685.1| Protein-tyrosine sulfotransferase [Morus notabilis] 169 5e-39 ref|XP_003540186.1| PREDICTED: GATA transcription factor 11-like... 169 5e-39 ref|XP_006378769.1| hypothetical protein POPTR_0010s23010g [Popu... 168 1e-38 dbj|BAC98495.1| AG-motif binding protein-5 [Nicotiana tabacum] 168 1e-38 >emb|CAC28528.1| GATA-1 zinc finger protein [Nicotiana tabacum] Length = 305 Score = 321 bits (823), Expect = 1e-84 Identities = 178/309 (57%), Positives = 203/309 (65%), Gaps = 9/309 (2%) Frame = -2 Query: 1625 GFLDGVPGDKGFPEDGPLDILDFLDFPMESLEDDTGG-DWDASKSHCLGPIPTDALLGLP 1449 G+LDG+P ED DIL+FLDFP+ESLE+D G +WDAS+S LGPIP DAL+ P Sbjct: 9 GYLDGIPTGPVVDEDFD-DILNFLDFPLESLEEDGQGVEWDASESKFLGPIPMDALMAFP 67 Query: 1448 PVPQDNTGNAFLNMLPQSNAPVGGAGETQESGSFQIHSPVSVLDSGGSCSVGKSLPIKSD 1269 PVPQ N GN + P SN P+ E Q SG FQ SPVSVL+S SCS GKS+ IK D Sbjct: 68 PVPQGNIGNGRVKAEPNSNHPIK-VTEGQGSGIFQTQSPVSVLESSNSCSGGKSISIKHD 126 Query: 1268 IVIPVRTRSKRARPSNLNPWFVMPPISSARXXXXXXXXXXXXXXXXXXXXXKNTDDFFQH 1089 I IPVR RSKR R S LNPW +MPPISS R + Sbjct: 127 IAIPVRPRSKRPRSSALNPWILMPPISSTRFASKKTCDARKGKEKKRKMSLLSVPQI--- 183 Query: 1088 SEEELLNASDATEKKDGSQQRTVALKKCTHCEVTKTPQWREGPMGPKTLCNACGVRYRSG 909 +D T+KK S Q+ + KKCTHC+VTKTPQWREGP+GPKTLCNACGVRYRSG Sbjct: 184 --------ADVTKKKTTSGQQ-FSFKKCTHCQVTKTPQWREGPLGPKTLCNACGVRYRSG 234 Query: 908 RLFPEYRPAASPTFVPTLHSNSHKKVIEMRKKASVQETAVL---HDV-----PVSPPPEF 753 RLFPEYRPAASPTFVPTLHSNSH+KV+EMRKKA ET+ L H+V P+SP PEF Sbjct: 235 RLFPEYRPAASPTFVPTLHSNSHRKVVEMRKKAIYGETSALEEPHNVIVEGPPMSPAPEF 294 Query: 752 VPMSGSLFD 726 VPMS LFD Sbjct: 295 VPMSSYLFD 303 >ref|XP_006340186.1| PREDICTED: GATA transcription factor 11-like [Solanum tuberosum] Length = 337 Score = 310 bits (794), Expect = 2e-81 Identities = 174/338 (51%), Positives = 210/338 (62%), Gaps = 28/338 (8%) Frame = -2 Query: 1655 MNMVEPSYLDGFLDGVPGDKGFPEDGPLDILDFLDFPMESLEDDT--GGDWDASKSHCLG 1482 M MVE G++DGVP +D DIL+FLD PMESLE+D G +WD S+S G Sbjct: 1 MTMVEHG--GGYMDGVPTGPIVDDDFD-DILNFLDMPMESLEEDGLGGVEWDVSESKGFG 57 Query: 1481 PIPTDALLGLPPVPQDNTGNAFLNMLPQSNA-PVGGAGETQESGSFQIHSPVSVLDSGGS 1305 PIPTDAL+ PP+PQ N GN +N + +S+ P E Q +G+FQ SPVSVL+ S Sbjct: 58 PIPTDALMDFPPMPQGNIGNRRVNAVAKSHPHPPIKFTEVQGTGTFQTQSPVSVLEGSNS 117 Query: 1304 CSVGKSLPIKSDIVIPVRTRSKRARPSNLNPWFVMPPISSARXXXXXXXXXXXXXXXXXX 1125 CS GKS+PIK DIVIPVR RSKRARPS +NPW +M PISS R Sbjct: 118 CSGGKSIPIKHDIVIPVRPRSKRARPSAVNPWVLMAPISSTRVASKKISDARKTKEKRRR 177 Query: 1124 XXXKN-----TDDFFQHSEEELLNASDATEKKDGSQQRTVALKKCTHCEVTKTPQWREGP 960 + ++ Q + L SD +KK S Q++ KKCTHCEVTKTPQWREGP Sbjct: 178 LSLLSGAKEPMKNYVQQINDAALPLSDVYKKKITSTQQSSFFKKCTHCEVTKTPQWREGP 237 Query: 959 MGPKTLCNACGVRYRSGRLFPEYRPAASPTFVPTLHSNSHKKVIEMRKKA---------- 810 +GPKTLCNACGVRYRSGRLFPEYRPAASPTFVP++HSNSH+KV+EMRKK Sbjct: 238 LGPKTLCNACGVRYRSGRLFPEYRPAASPTFVPSVHSNSHRKVVEMRKKTLYGTGEVEEP 297 Query: 809 ----------SVQETAVLHDVPVSPPPEFVPMSGSLFD 726 ++ E ++ D +SP PEFVPMS LFD Sbjct: 298 PKVIMGRKSEALPEPTIVADPAMSPAPEFVPMSSYLFD 335 >ref|XP_004251141.1| PREDICTED: GATA transcription factor 11-like [Solanum lycopersicum] Length = 336 Score = 299 bits (766), Expect = 4e-78 Identities = 168/338 (49%), Positives = 205/338 (60%), Gaps = 28/338 (8%) Frame = -2 Query: 1655 MNMVEPSYLDGFLDGVPGDKGFPEDGPLDILDFLDFPMESLEDDT--GGDWDASKSHCLG 1482 M MVE G++D +P +D DIL+FLD PMESLE D G +WD S+S G Sbjct: 1 MTMVEHG--GGYMDEIPTGPIVDDDFD-DILNFLDMPMESLEGDVLGGVEWDVSESKGFG 57 Query: 1481 PIPTDALLGLPPVPQDNTGNAFLNMLPQSNAPVGGAGETQESGSFQIHSPVSVLDSGGSC 1302 PIPT+AL+ P+PQ N GN +N + S+ P+ E Q +G+FQ SPVSVL+ SC Sbjct: 58 PIPTEALMDFLPLPQSNIGNRRVNAVANSHPPIKFT-EVQGTGTFQTQSPVSVLEGSNSC 116 Query: 1301 SVGKSLPIKSDIVIPVRTRSKRARPSNLNPWFVMPPISSARXXXXXXXXXXXXXXXXXXX 1122 S GKS+PIK D VIPVR RSKRARPS +NPW +M PISS R Sbjct: 117 SGGKSVPIKHDPVIPVRPRSKRARPSAVNPWVLMAPISSTRVASKKISDARKTKERRRRL 176 Query: 1121 XXKN-----TDDFFQHSEEELLNASDATEKKDGSQQRTVALKKCTHCEVTKTPQWREGPM 957 + ++ Q + SD ++KK S Q++ KKCTHCEVTKTPQWREGP+ Sbjct: 177 SLLSGAKEPMKNYVQQISDAAPPVSDVSKKKITSTQQSSFFKKCTHCEVTKTPQWREGPL 236 Query: 956 GPKTLCNACGVRYRSGRLFPEYRPAASPTFVPTLHSNSHKKVIEMRKKA----------- 810 GPKTLCNACGVRYRSGRLFPEYRPAASPTFVP++HSNSH+KV+EMRKK Sbjct: 237 GPKTLCNACGVRYRSGRLFPEYRPAASPTFVPSVHSNSHRKVVEMRKKTLYGGAGEVEEP 296 Query: 809 ----------SVQETAVLHDVPVSPPPEFVPMSGSLFD 726 ++ E + D +SP PEFVPMS LFD Sbjct: 297 PKVIMGRSSEALPEPTIAADPAMSPAPEFVPMSSYLFD 334 >ref|XP_006365758.1| PREDICTED: GATA transcription factor 10-like [Solanum tuberosum] Length = 258 Score = 203 bits (517), Expect = 3e-49 Identities = 127/281 (45%), Positives = 159/281 (56%), Gaps = 2/281 (0%) Frame = -2 Query: 1649 MVEPSYLDGFLDGVPGDKGFPEDGPLDILDFLDFPMESLEDDT-GGDWDASK-SHCLGPI 1476 MVE +Y+DG G D+ F IL+ LDF M++LE D DWDA+ GPI Sbjct: 1 MVEQNYMDGISMGHVVDEDFES-----ILNGLDFSMQNLEADVLEEDWDATVYGELFGPI 55 Query: 1475 PTDALLGLPPVPQDNTGNAFLNMLPQSNAPVGGAGETQESGSFQIHSPVSVLDSGGSCSV 1296 P++ L+ LP + N+ L +NAP E+Q + FQ SP+SVL++ SCS Sbjct: 56 PSETLMSLPL----DIANSCLEDRRMTNAP-NEFLESQGNALFQTGSPISVLENNRSCSG 110 Query: 1295 GKSLPIKSDIVIPVRTRSKRARPSNLNPWFVMPPISSARXXXXXXXXXXXXXXXXXXXXX 1116 G+S I + R RSKRAR S LNPW +M PI Sbjct: 111 GRSA-ISFNFGSKGR-RSKRARSSTLNPWLMMAPIPC----------------------- 145 Query: 1115 KNTDDFFQHSEEELLNASDATEKKDGSQQRTVALKKCTHCEVTKTPQWREGPMGPKTLCN 936 + SD+ K S + + K+CTHCEVTKTPQWREGP+GPKTLCN Sbjct: 146 ---------TTSAAKKNSDSKSGKLSSAKGSPLFKRCTHCEVTKTPQWREGPLGPKTLCN 196 Query: 935 ACGVRYRSGRLFPEYRPAASPTFVPTLHSNSHKKVIEMRKK 813 ACGVRYRSGRL PEYRPAASPTF+P+LHSNSH+KV+EMR+K Sbjct: 197 ACGVRYRSGRLLPEYRPAASPTFIPSLHSNSHRKVVEMRRK 237 >ref|XP_004242094.1| PREDICTED: GATA transcription factor 11-like [Solanum lycopersicum] Length = 252 Score = 200 bits (508), Expect = 3e-48 Identities = 123/283 (43%), Positives = 155/283 (54%), Gaps = 2/283 (0%) Frame = -2 Query: 1568 ILDFLDFPMESLEDDT-GGDWDASK-SHCLGPIPTDALLGLPPVPQDNTGNAFLNMLPQS 1395 IL+ LDF +++LE D DWDA+ LGPIP++ L+ LPP+ N N F Sbjct: 17 ILNGLDFSIQNLEADRLDEDWDATVYGELLGPIPSETLMSLPPLELTNVDNVF------- 69 Query: 1394 NAPVGGAGETQESGSFQIHSPVSVLDSGGSCSVGKSLPIKSDIVIPVRTRSKRARPSNLN 1215 E Q + FQ SP+SVL++ SCS G+S I + R RSKRAR S LN Sbjct: 70 -------PEAQGNVIFQTGSPISVLENTRSCSGGRSA-ISFNFGSKGR-RSKRARSSTLN 120 Query: 1214 PWFVMPPISSARXXXXXXXXXXXXXXXXXXXXXKNTDDFFQHSEEELLNASDATEKKDGS 1035 PW M P+ T ++S+ ++ ++K S Sbjct: 121 PWLKMAPMPCT------------------------TSAAKKNSDSKI---GKVNKRKLSS 153 Query: 1034 QQRTVALKKCTHCEVTKTPQWREGPMGPKTLCNACGVRYRSGRLFPEYRPAASPTFVPTL 855 + K+CTHCEVTKTPQWREGP+GPKTLCNACGVRYRSGRL PEYRPAASPTF+P+L Sbjct: 154 AMASPLFKRCTHCEVTKTPQWREGPLGPKTLCNACGVRYRSGRLLPEYRPAASPTFIPSL 213 Query: 854 HSNSHKKVIEMRKKASVQETAVLHDVPVSPPPEFVPMSGSLFD 726 HSNSHKKV+EMR+K + P FVP+ L D Sbjct: 214 HSNSHKKVVEMRRK-------TVESSPEFDSQNFVPLGSYLLD 249 >gb|EOY34380.1| GATA zinc finger protein regulating nitrogen assimilation, putative [Theobroma cacao] Length = 342 Score = 189 bits (479), Expect = 7e-45 Identities = 124/323 (38%), Positives = 157/323 (48%), Gaps = 50/323 (15%) Frame = -2 Query: 1571 DILDFLDFPMESLE----------------------DDTGG-----DWDASKSHCLGPIP 1473 D++ +LDFP+E +E +D+GG +WD + + L P P Sbjct: 20 DVIKYLDFPLEDVEANDGSGGGSSGEDVIKDFHLPLEDSGGGGGGEEWDCNFQN-LEPPP 78 Query: 1472 TDALLGLP-----PVPQDNTGNAFLNMLPQSNAPVGGAGETQESGS-------------- 1350 + L GL DN S+ P T+ S S Sbjct: 79 ANVLAGLSSGFYGDFFGDNLAKNLTVSCDGSSQPNQQTSTTKASSSRSITLNSESADLKG 138 Query: 1349 ---FQIHSPVSVLDSGGSCSVGKSLPIKSDIVIPV-RTRSKRARPSNLNPWFVMPPISSA 1182 FQ SPVSVL+S SCS PI ++ PV R+RSKR R S N +P ISS Sbjct: 139 SNRFQTSSPVSVLESSSSCSAANPTPIDPNLSFPVKRSRSKRRRVSTFNLHVSLPFISST 198 Query: 1181 RXXXXXXXXXXXXXXXXXXXXXKNTDDFFQHSEEELLNASDATEKKDGSQQRTVALKKCT 1002 + + Q + L S ++E K Q+ V ++KC Sbjct: 199 SSTSRGSNSLVGSESESESHLTEKSAKKRQKKKRNLTLLSGSSEIKKSPSQQPVVVRKCM 258 Query: 1001 HCEVTKTPQWREGPMGPKTLCNACGVRYRSGRLFPEYRPAASPTFVPTLHSNSHKKVIEM 822 HCEVTKTPQWREGPMGPKTLCNACGVRYRSGRL PEYRPAASPTFV +LHSNSHKKV+EM Sbjct: 259 HCEVTKTPQWREGPMGPKTLCNACGVRYRSGRLLPEYRPAASPTFVSSLHSNSHKKVVEM 318 Query: 821 RKKASVQETAVLHDVPVSPPPEF 753 RKKA + + + + + P F Sbjct: 319 RKKAKLPISVMPSMLSIPPENSF 341 >ref|XP_006424556.1| hypothetical protein CICLE_v10029015mg [Citrus clementina] gi|557526490|gb|ESR37796.1| hypothetical protein CICLE_v10029015mg [Citrus clementina] Length = 280 Score = 184 bits (467), Expect = 2e-43 Identities = 97/179 (54%), Positives = 114/179 (63%), Gaps = 1/179 (0%) Frame = -2 Query: 1346 QIHSPVSVLDSGGSCSVGKSLPIKSDIVIPV-RTRSKRARPSNLNPWFVMPPISSARXXX 1170 Q SP+SVL+SGGSCS K +PI +V V R RSKR RP+ LNP F+ P ISS Sbjct: 99 QTSSPISVLESGGSCSADKHVPINPKLVFAVKRARSKRRRPATLNPLFIYPFISSTSSTS 158 Query: 1169 XXXXXXXXXXXXXXXXXXKNTDDFFQHSEEELLNASDATEKKDGSQQRTVALKKCTHCEV 990 + Q ++ L S + E K S Q+T +KC HCEV Sbjct: 159 EDYHPETASESGSEMNLTEKPVRKKQKRKKNLTVLSGSRENKKLSFQQTDTPRKCMHCEV 218 Query: 989 TKTPQWREGPMGPKTLCNACGVRYRSGRLFPEYRPAASPTFVPTLHSNSHKKVIEMRKK 813 +TPQWREGPMGPKTLCNACGVRYRSGRL PEYRPAASPTFVP+LHSNSHK+++EMR K Sbjct: 219 AETPQWREGPMGPKTLCNACGVRYRSGRLVPEYRPAASPTFVPSLHSNSHKRIMEMRNK 277 >ref|XP_006488078.1| PREDICTED: GATA transcription factor 11-like [Citrus sinensis] Length = 277 Score = 184 bits (466), Expect = 2e-43 Identities = 98/179 (54%), Positives = 114/179 (63%), Gaps = 1/179 (0%) Frame = -2 Query: 1346 QIHSPVSVLDSGGSCSVGKSLPIKSDIVIPV-RTRSKRARPSNLNPWFVMPPISSARXXX 1170 Q SP+SVL+SGGSCS K +PI +V V R RSKR RP+ LNP F+ P ISS Sbjct: 99 QTSSPISVLESGGSCSAEKHVPINPKLVFAVKRARSKRRRPATLNPLFIYPFISSTSEDY 158 Query: 1169 XXXXXXXXXXXXXXXXXXKNTDDFFQHSEEELLNASDATEKKDGSQQRTVALKKCTHCEV 990 Q ++ L S + E K S Q+T A +KC HCEV Sbjct: 159 HPETASESGSEMNLTEKPVRKK---QKRKKNLTVLSGSRENKKLSFQQTDAPRKCMHCEV 215 Query: 989 TKTPQWREGPMGPKTLCNACGVRYRSGRLFPEYRPAASPTFVPTLHSNSHKKVIEMRKK 813 +TPQWREGPMGPKTLCNACGVRYRSGRL PEYRPAASPTFVP+LHSNSHK+++EMR K Sbjct: 216 AETPQWREGPMGPKTLCNACGVRYRSGRLVPEYRPAASPTFVPSLHSNSHKRIMEMRNK 274 >gb|EOY18663.1| Plant-specific GATA-type zinc finger transcription factor family protein isoform 1 [Theobroma cacao] Length = 414 Score = 177 bits (448), Expect = 3e-41 Identities = 130/365 (35%), Positives = 167/365 (45%), Gaps = 44/365 (12%) Frame = -2 Query: 1682 GVYFVLQKKMNMVEPSYLDGFLDGVPGDKGFPEDGPLDILDFLDFPMESLEDDTGGDWDA 1503 G + + K NM+ P+ F+D + F I D LDFP E +E A Sbjct: 63 GFFIIFYIKENMIGPT---NFIDEIDCGSFFDH-----IDDLLDFPNEDVEAGLSASDSA 114 Query: 1502 SKSHCLGPIPTDALLGLPPVPQDNTGNAFLNMLPQSNAPVG-----------------GA 1374 + I T LP + N+ ++ + + P GA Sbjct: 115 VNASAFPSIWTTHSESLPGSDSVFSNNSASDLSAELSVPYEDIVQLEWLSNFVDDSQCGA 174 Query: 1373 GET---QESGS---------FQIHSPVSVLDSGGSCSVGKSLPIKSDIVIPVR---TRSK 1239 T +ES S FQ SPVSVL+S SCS K+LP + P R RSK Sbjct: 175 SLTIKKEESSSITKDSSQHQFQTSSPVSVLESSSSCSGEKTLPRSPETAAPGRRGRARSK 234 Query: 1238 RARPSNLNPWFVMPPISSARXXXXXXXXXXXXXXXXXXXXXKNTDDFF----------QH 1089 R RP+ NP + IS + +H Sbjct: 235 RPRPTTFNPRPAIQLISPTSSVNENDIPQPFVVPKVPSDSENYAESRLLIKIPRQVNPEH 294 Query: 1088 SEEELLNASDATEKKDGSQQRT-VALKKCTHCEVTKTPQWREGPMGPKTLCNACGVRYRS 912 +++ + S T D +Q + A++KC HCE+TKTPQWR GPMGPKTLCNACGVRY+S Sbjct: 295 KKKKKIKLSLPTAPADNNQNSSGQAVRKCMHCEITKTPQWRAGPMGPKTLCNACGVRYKS 354 Query: 911 GRLFPEYRPAASPTFVPTLHSNSHKKVIEMRKKASVQETAVLHDVPVSPPPEFVP-MSGS 735 GRLFPEYRPAASPTFVP+LHSNSHKKVIEMR K T + V+ PE +P S Sbjct: 355 GRLFPEYRPAASPTFVPSLHSNSHKKVIEMRNKGGAAPTTM-----VTSSPELIPNKSNP 409 Query: 734 LFDFI 720 DF+ Sbjct: 410 ALDFM 414 >gb|AGV54633.1| GATA transcription factor [Phaseolus vulgaris] gi|561023457|gb|ESW22187.1| hypothetical protein PHAVU_005G134400g [Phaseolus vulgaris] Length = 323 Score = 176 bits (445), Expect = 7e-41 Identities = 113/285 (39%), Positives = 149/285 (52%), Gaps = 24/285 (8%) Frame = -2 Query: 1595 GFPEDGPLDILDFLDFPMESLEDDTGGDW-----------DASKSHCLGPIPTD------ 1467 G +D D++ F DFP+E +E+D + AS + G T+ Sbjct: 24 GLSDDIFDDVVGFFDFPLEDVEEDWDSQFKCLEDQHSEIFSASSNGLCGKTQTENPQLGT 83 Query: 1466 ----ALLGLPPVPQ--DNTGNAFLNMLPQSNAPVGGAGETQESGSFQIHSPVSVLDSGGS 1305 + G+ P+ Q G + +P N G ++ F+ +SPVSV +S S Sbjct: 84 EFSVSCNGISPIKQLAKAPGPTYGKTIPLKNVTFNG----KDLHQFRTYSPVSVFESSSS 139 Query: 1304 CSVGKSLPIKSDIVIPV-RTRSKRARPSNLNPWFVMPPISSARXXXXXXXXXXXXXXXXX 1128 SV S + VIPV R RSKR R S+L+P F +P I +A+ Sbjct: 140 SSVENSNFDRP--VIPVKRARSKRQRRSSLSPLFSIPYILNAQALQNQQRTSASESDFET 197 Query: 1127 XXXXKNTDDFFQHSEEELLNASDATEKKDGSQQRTVALKKCTHCEVTKTPQWREGPMGPK 948 ++ H +++L S+ E S + +KC HCEVTKTPQWREGPMGPK Sbjct: 198 NVAGNMSNKVKSHRKKDLSLLSEDVEMMRSSHLVSDPPRKCMHCEVTKTPQWREGPMGPK 257 Query: 947 TLCNACGVRYRSGRLFPEYRPAASPTFVPTLHSNSHKKVIEMRKK 813 TLCNACGVRYRSGRLFPEYRPAASPTFV +LHSN HKKV+EMR + Sbjct: 258 TLCNACGVRYRSGRLFPEYRPAASPTFVSSLHSNCHKKVVEMRSR 302 >ref|XP_002273502.1| PREDICTED: GATA transcription factor 9-like [Vitis vinifera] Length = 340 Score = 174 bits (442), Expect = 1e-40 Identities = 123/327 (37%), Positives = 164/327 (50%), Gaps = 46/327 (14%) Frame = -2 Query: 1562 DFLDFPMESLEDDT-GGDWDASKS---HCLGPIP-TDALLGLP------------PVP-Q 1437 D L+FP E + GGD ++ S + P+P D++ P VP + Sbjct: 21 DLLEFPPEDVSGGLMGGDCNSFPSIWTNASDPLPGPDSVFSGPNSNSNSDLSAELSVPYE 80 Query: 1436 DNTGNAFLNMLPQSNAPVGGAGETQESGS---------FQIHSPVSVLDSGGSCSVG--K 1290 D +L+ + + G G +E GS FQ SPVSVL+S SCS G K Sbjct: 81 DIVQLEWLSNFVEDSFSGGSIGLNKEDGSIVKDSPHHQFQTSSPVSVLESSSSCSGGGGK 140 Query: 1289 SLPIKSDIVIPVRTRSKRARPSNLNPWFVMPPISSARXXXXXXXXXXXXXXXXXXXXXKN 1110 ++P+ + R RSKR RP+ NP + IS + Sbjct: 141 TIPLSPNHRGAQRARSKRPRPATFNPRPAIQLISPTSSVTESPQPVLVPKASS------D 194 Query: 1109 TDDFFQHSEEELLNASDATEKKDGSQQR---------------TVALKKCTHCEVTKTPQ 975 ++++ + S + + A E K + + A++KC HCE+TKTPQ Sbjct: 195 SENYAESSPLKKMPKPAAAEHKKKKKMKLSLPLGPVEMNQNPPAQAVRKCMHCEITKTPQ 254 Query: 974 WREGPMGPKTLCNACGVRYRSGRLFPEYRPAASPTFVPTLHSNSHKKVIEMRKKASVQET 795 WR GPMGPKTLCNACGVRY+SGRLFPEYRPAASPTFVP LHSNSHKKVIEMR KA + T Sbjct: 255 WRAGPMGPKTLCNACGVRYKSGRLFPEYRPAASPTFVPALHSNSHKKVIEMRNKA-CENT 313 Query: 794 AVLHDVP--VSPPPEFVPMSGSLFDFI 720 A+ P + PPE +P S D++ Sbjct: 314 AMTASPPTGTTSPPELIPNSSVSLDYM 340 >ref|XP_003543479.1| PREDICTED: GATA transcription factor 11-like [Glycine max] Length = 327 Score = 173 bits (438), Expect = 4e-40 Identities = 122/312 (39%), Positives = 165/312 (52%), Gaps = 27/312 (8%) Frame = -2 Query: 1652 NMVEPSYLDGFLDGVPGDKGFPEDGPLDILDFLDFPMESLE-DDTGGDWDASKSHCLGP- 1479 NM + + D +G+ D+ F D+++F DFP+E +E + DWDA P Sbjct: 7 NMKDSWFFDNNFNGL-SDEIFD-----DVINFFDFPLEDVEANGVEEDWDAQLKCLEDPR 60 Query: 1478 --IPTDALLGLPPVPQDN----------TGNAF--LNMLPQSNAPVGGAGETQESGS--- 1350 + T + GL Q+ +GN + L ++ PV G T ++ + Sbjct: 61 VDVYTASSAGLCAKTQNEKPQLGMKFSASGNGISPIKQLGKATGPVYGKTITHQNVTSNG 120 Query: 1349 -----FQIH--SPVSVLDSGGSCSVGKSLPIKSDIVIPV-RTRSKRARPSNLNPWFVMPP 1194 FQ + SPVSV +S S SV S + VIPV R RSKR RPS+ +P F +P Sbjct: 121 KDLHQFQTYTYSPVSVFESSSSSSVENSNFDRP--VIPVKRARSKRQRPSSFSPLFSIPF 178 Query: 1193 ISSARXXXXXXXXXXXXXXXXXXXXXKNTDDFFQHSEEELLNASDATEKKDGSQQRTVAL 1014 I ++ ++ + +++ SD E S + + Sbjct: 179 ILNSPAMQNHQRIAAADSDFGTNVAGNLSNKLKKQKKKDSSLLSDDVEMMRSSSPESGSP 238 Query: 1013 KKCTHCEVTKTPQWREGPMGPKTLCNACGVRYRSGRLFPEYRPAASPTFVPTLHSNSHKK 834 +KC HCEVTKTPQWREGP+GPKTLCNACGVRYRSGRLFPEYRPAASPTFV +LHSN HKK Sbjct: 239 RKCMHCEVTKTPQWREGPVGPKTLCNACGVRYRSGRLFPEYRPAASPTFVASLHSNCHKK 298 Query: 833 VIEMRKKASVQE 798 V+EMR +A +QE Sbjct: 299 VVEMRSRA-IQE 309 >gb|EOY18664.1| Plant-specific GATA-type zinc finger transcription factor family protein isoform 2 [Theobroma cacao] Length = 341 Score = 172 bits (436), Expect = 7e-40 Identities = 101/225 (44%), Positives = 123/225 (54%), Gaps = 15/225 (6%) Frame = -2 Query: 1349 FQIHSPVSVLDSGGSCSVGKSLPIKSDIVIPVR---TRSKRARPSNLNPWFVMPPISSAR 1179 FQ SPVSVL+S SCS K+LP + P R RSKR RP+ NP + IS Sbjct: 122 FQTSSPVSVLESSSSCSGEKTLPRSPETAAPGRRGRARSKRPRPTTFNPRPAIQLISPTS 181 Query: 1178 XXXXXXXXXXXXXXXXXXXXXKNTDDFF----------QHSEEELLNASDATEKKDGSQQ 1029 + +H +++ + S T D +Q Sbjct: 182 SVNENDIPQPFVVPKVPSDSENYAESRLLIKIPRQVNPEHKKKKKIKLSLPTAPADNNQN 241 Query: 1028 RT-VALKKCTHCEVTKTPQWREGPMGPKTLCNACGVRYRSGRLFPEYRPAASPTFVPTLH 852 + A++KC HCE+TKTPQWR GPMGPKTLCNACGVRY+SGRLFPEYRPAASPTFVP+LH Sbjct: 242 SSGQAVRKCMHCEITKTPQWRAGPMGPKTLCNACGVRYKSGRLFPEYRPAASPTFVPSLH 301 Query: 851 SNSHKKVIEMRKKASVQETAVLHDVPVSPPPEFVP-MSGSLFDFI 720 SNSHKKVIEMR K T + V+ PE +P S DF+ Sbjct: 302 SNSHKKVIEMRNKGGAAPTTM-----VTSSPELIPNKSNPALDFM 341 >gb|ESW23870.1| hypothetical protein PHAVU_004G083100g [Phaseolus vulgaris] Length = 336 Score = 171 bits (432), Expect = 2e-39 Identities = 99/222 (44%), Positives = 126/222 (56%), Gaps = 11/222 (4%) Frame = -2 Query: 1349 FQIHSPVSVLDSGGSCSVGKSLPIKSDIVIPV---RTRSKRARPSNLNPWFVMPPISSA- 1182 FQ SPVSVL+S CS K++P +I IPV R RSKRARP+ NP VM IS A Sbjct: 119 FQTASPVSVLESSSFCSGEKAVPRSPEIFIPVPCGRARSKRARPTAFNPHPVMQLISPAS 178 Query: 1181 ------RXXXXXXXXXXXXXXXXXXXXXKNTDDFFQHSEEELLNAS-DATEKKDGSQQRT 1023 + F +H +++ + + A + ++GS + Sbjct: 179 STGENTQHNTSTCKASSDSENFAESPIKTPKQAFGEHKKKKKIKVTFSAGQDQNGSPSQ- 237 Query: 1022 VALKKCTHCEVTKTPQWREGPMGPKTLCNACGVRYRSGRLFPEYRPAASPTFVPTLHSNS 843 A++KC HCE+TKTPQWR GPMGPKTLCNACGVRY+SGRLFPEYRPAASPTF +HSNS Sbjct: 238 -AVRKCVHCEITKTPQWRAGPMGPKTLCNACGVRYKSGRLFPEYRPAASPTFCAAVHSNS 296 Query: 842 HKKVIEMRKKASVQETAVLHDVPVSPPPEFVPMSGSLFDFIY 717 HKKV+EMR K+ + + PE +P + S Y Sbjct: 297 HKKVLEMRNKSDTKSGFAADS---ASSPELIPNTNSSLSLEY 335 >ref|XP_003597258.1| GATA transcription factor [Medicago truncatula] gi|355486306|gb|AES67509.1| GATA transcription factor [Medicago truncatula] Length = 312 Score = 171 bits (432), Expect = 2e-39 Identities = 117/292 (40%), Positives = 150/292 (51%), Gaps = 31/292 (10%) Frame = -2 Query: 1601 DKGFP--EDGPLDILDFLDFPMESLEDDTGG-DWDASKSHCL-----------GPIPTD- 1467 DK F D D L F DFP+E ++ +T DW A C G I T+ Sbjct: 15 DKNFNGLSDETFDDLKFFDFPLEDVDANTAEEDWSALGEPCFDVFSVSPAVFCGKIKTEN 74 Query: 1466 ---------ALLGLPPVPQD---NTGNAFLNMLPQSNAPVGGAGETQESGSFQIHSPVSV 1323 G+ P+ ++ G + +P N P E +SPVSV Sbjct: 75 PQLGEGFSAPFNGISPIIKEAARTAGPTYGKTIPNQNVPF------YEKKVVLQYSPVSV 128 Query: 1322 LDSGGSCSV---GKSLPIKSDIVIPV-RTRSKRARPSNLNPWFVMPPISSARXXXXXXXX 1155 + + SV G LP VIPV R RSKR RPS+LNP F + I+S + Sbjct: 129 FEGSSASSVENSGFDLP-----VIPVKRARSKRRRPSSLNPVFSISFIASLQALHKKISA 183 Query: 1154 XXXXXXXXXXXXXKNTDDFFQHSEEELLNASDATEKKDGSQQRTVALKKCTHCEVTKTPQ 975 + D + +++ + + D KK SQ+ +V +KCTHCEVT+TPQ Sbjct: 184 --------------SESDLNRVKKQKRMLSGDIETKKSSSQE-SVVQRKCTHCEVTETPQ 228 Query: 974 WREGPMGPKTLCNACGVRYRSGRLFPEYRPAASPTFVPTLHSNSHKKVIEMR 819 WREGP GPKTLCNACGVRYRSGRL+PEYRPA SPTFV ++HSNSHKKV+EMR Sbjct: 229 WREGPNGPKTLCNACGVRYRSGRLYPEYRPANSPTFVASVHSNSHKKVLEMR 280 >gb|ACU24388.1| unknown [Glycine max] Length = 327 Score = 170 bits (431), Expect = 3e-39 Identities = 121/312 (38%), Positives = 164/312 (52%), Gaps = 27/312 (8%) Frame = -2 Query: 1652 NMVEPSYLDGFLDGVPGDKGFPEDGPLDILDFLDFPMESLE-DDTGGDWDASKSHCLGP- 1479 NM + + D +G+ D+ F D+++F DFP+E +E + DWDA P Sbjct: 7 NMKDSWFFDNNFNGL-SDEIFD-----DVINFFDFPLEDVEANGVEEDWDAQLKCLEDPR 60 Query: 1478 --IPTDALLGLPPVPQDN----------TGNAF--LNMLPQSNAPVGGAGETQESGS--- 1350 + T + GL Q+ +GN + L ++ PV G T ++ + Sbjct: 61 VDVYTASSAGLCAKTQNEKPQLGMKFSASGNGISPIKQLGKATGPVYGKTITHQNVTSNG 120 Query: 1349 -----FQIH--SPVSVLDSGGSCSVGKSLPIKSDIVIPV-RTRSKRARPSNLNPWFVMPP 1194 FQ + SPVSV +S S SV S + VIPV R RSKR RPS+ +P F +P Sbjct: 121 KDLHQFQTYTYSPVSVFESSSSSSVENSNFDRP--VIPVKRARSKRQRPSSFSPLFSIPF 178 Query: 1193 ISSARXXXXXXXXXXXXXXXXXXXXXKNTDDFFQHSEEELLNASDATEKKDGSQQRTVAL 1014 I ++ ++ + +++ S E S + + Sbjct: 179 ILNSPAMQNHQRIAAADSDFGTNVAGNLSNKLKKQKKKDSSLLSGDVEMMRSSSPESGSP 238 Query: 1013 KKCTHCEVTKTPQWREGPMGPKTLCNACGVRYRSGRLFPEYRPAASPTFVPTLHSNSHKK 834 +KC HCEVTKTPQWREGP+GPKTLCNACGVRYRSGRLFPEYRPAASPTFV +LHSN HKK Sbjct: 239 RKCMHCEVTKTPQWREGPVGPKTLCNACGVRYRSGRLFPEYRPAASPTFVASLHSNCHKK 298 Query: 833 VIEMRKKASVQE 798 V+EMR +A +QE Sbjct: 299 VVEMRSRA-IQE 309 >gb|EXB38685.1| Protein-tyrosine sulfotransferase [Morus notabilis] Length = 820 Score = 169 bits (429), Expect = 5e-39 Identities = 113/283 (39%), Positives = 151/283 (53%), Gaps = 28/283 (9%) Frame = -2 Query: 1571 DILDFLDFPMESLEDDTGGDWDASKSHCLGPIPTDALLGLPPV-----PQDNTGN----- 1422 D+L+ DFP+E +E G + D L +P+D +GL V +D++ Sbjct: 515 DLLNIFDFPLEDVE--VGAEKDDWNDIQLLDLPSDISMGLSSVFCSGLQKDSSKEIKNIS 572 Query: 1421 ------AFLNMLPQS--NAPVGGAGETQESGS-------FQIHSPVSVLDSGGSCSVG-- 1293 LN P + GG + +S S F+ SPVS+L+S SC Sbjct: 573 FSYDRTCRLNRSPSAAETTSSGGIVLSDDSSSDIKHIHLFKTSSPVSILESNSSCFAENP 632 Query: 1292 KSLPIKSDIVIPVRTRSK-RARPSNLNPWFVMPPISSARXXXXXXXXXXXXXXXXXXXXX 1116 ++ KS +V R RSK R+RPSN + + +P I++ Sbjct: 633 RTADQKSSVVPVKRPRSKKRSRPSNFDRLYTLPFIAALERLRPSAASESDLGAPQVGKMF 692 Query: 1115 KNTDDFFQHSEEELLNASDATEKKDGSQQRTVALKKCTHCEVTKTPQWREGPMGPKTLCN 936 K + ++ E ++ S Q++ +KKCTHC++T TPQWREGPMGPKTLCN Sbjct: 693 KTAKKAMK--KKRATPHPIGIEVRNVSSQQSGEIKKCTHCQMTTTPQWREGPMGPKTLCN 750 Query: 935 ACGVRYRSGRLFPEYRPAASPTFVPTLHSNSHKKVIEMRKKAS 807 ACGVR+RSGRLFPEYRPAASPTFVP+LHSNSHKKVIEMR KAS Sbjct: 751 ACGVRFRSGRLFPEYRPAASPTFVPSLHSNSHKKVIEMRNKAS 793 >ref|XP_003540186.1| PREDICTED: GATA transcription factor 11-like [Glycine max] Length = 326 Score = 169 bits (429), Expect = 5e-39 Identities = 123/316 (38%), Positives = 161/316 (50%), Gaps = 31/316 (9%) Frame = -2 Query: 1652 NMVEPSYLDGFLDGVPGDKGFPEDGPLDILDFLDFPMESLE-DDTGGDWDA--------- 1503 NM + + D +G+ D+ F D+++F DFP+E ++ + DWDA Sbjct: 7 NMKDSWFFDNNFNGL-SDEIFD-----DVINFFDFPLEDVDANGVEEDWDAQLKCLEDPR 60 Query: 1502 -------SKSHC---------LGPIPTDALLGLPPVPQ--DNTGNAFLNMLPQSNAPVGG 1377 S C LG + + G+ P+ Q G A+ +P N G Sbjct: 61 FDVYSASSAGLCAETQNEKPQLGMKLSASSNGISPIKQLAKAPGPAYGKTIPHQNVTSNG 120 Query: 1376 AGETQESGSFQIH--SPVSVLDSGGSCSVGKSLPIKSDIVIPV-RTRSKRARPSNLNPWF 1206 ++ FQ + SPVSV +S S SV S + VIPV R RSKR RPSN +P F Sbjct: 121 ----KDLHQFQTYTYSPVSVFESSSSSSVENSNFDRP--VIPVKRARSKRQRPSNFSPLF 174 Query: 1205 VMPPISSARXXXXXXXXXXXXXXXXXXXXXKNTDDFFQHSEEELLNASDATEKKDGSQQR 1026 +P I + ++ + +++L SD E S Sbjct: 175 SIPLIVNLPAVRKDQRTAASDSDFGTNVAGNLSNKVKKQRKKDLSLLSDV-EMTRSSSPE 233 Query: 1025 TVALKKCTHCEVTKTPQWREGPMGPKTLCNACGVRYRSGRLFPEYRPAASPTFVPTLHSN 846 + +KC HCEVTKTPQWREGPMGPKTLCNACGVRYRSGRLFPEYRPAASPTFV +LHSN Sbjct: 234 SGPPRKCMHCEVTKTPQWREGPMGPKTLCNACGVRYRSGRLFPEYRPAASPTFVASLHSN 293 Query: 845 SHKKVIEMRKKASVQE 798 HKKV+EMR + +QE Sbjct: 294 CHKKVVEMRSRV-IQE 308 >ref|XP_006378769.1| hypothetical protein POPTR_0010s23010g [Populus trichocarpa] gi|566192292|ref|XP_002316371.2| zinc finger family protein [Populus trichocarpa] gi|550330409|gb|ERP56566.1| hypothetical protein POPTR_0010s23010g [Populus trichocarpa] gi|550330410|gb|EEF02542.2| zinc finger family protein [Populus trichocarpa] Length = 352 Score = 168 bits (426), Expect = 1e-38 Identities = 105/245 (42%), Positives = 127/245 (51%), Gaps = 14/245 (5%) Frame = -2 Query: 1439 QDNTGNAFLNMLPQSNAPVGGAGETQESGSFQIHSPVSVLDSGGSCSVGKSLPIKSDIVI 1260 +D+ L M + +A V T FQ SPVSVL+S CS K+ P +IV Sbjct: 100 EDSFSGGSLTMKKEESASVDKKDSTPHH-QFQTSSPVSVLESSSDCSGEKNAPRSPEIVA 158 Query: 1259 PV---RTRSKRARPSNLNPWFVMP---PISSARXXXXXXXXXXXXXXXXXXXXXKNTDDF 1098 R RSKR RP+ P M P SS + Sbjct: 159 SGKCGRARSKRPRPAAFTPRPAMQLVSPTSSITEVPQQFVSPRVPSDSESFAESRLVIKI 218 Query: 1097 FQHSEEE--------LLNASDATEKKDGSQQRTVALKKCTHCEVTKTPQWREGPMGPKTL 942 +H + E + S E SQ + A++KC HCE+TKTPQWR GPMGPKTL Sbjct: 219 PEHVDPEHKKKKKIKFIVPSGTVEMNQNSQPQQ-AVRKCMHCEITKTPQWRAGPMGPKTL 277 Query: 941 CNACGVRYRSGRLFPEYRPAASPTFVPTLHSNSHKKVIEMRKKASVQETAVLHDVPVSPP 762 CNACGVRY+SGRLFPEYRPAASPTFVP+LHSNSHKKV+EMR KA + T + Sbjct: 278 CNACGVRYKSGRLFPEYRPAASPTFVPSLHSNSHKKVVEMRAKAGEKITTSRPATMMVNS 337 Query: 761 PEFVP 747 PEF+P Sbjct: 338 PEFIP 342 >dbj|BAC98495.1| AG-motif binding protein-5 [Nicotiana tabacum] Length = 342 Score = 168 bits (426), Expect = 1e-38 Identities = 132/341 (38%), Positives = 169/341 (49%), Gaps = 39/341 (11%) Frame = -2 Query: 1652 NMVEPSYLDGFLDGVPGDKGFP---EDGPLDILDFLDFPMESLEDDTGGDWDA--SKSHC 1488 N+V+ F D + FP E L D DFP S+ +D D D+ S SH Sbjct: 4 NLVDEIDCGSFFDHIDDLIDFPLENESAGLSSTDCKDFP--SIWNDPLPDSDSLFSGSHR 61 Query: 1487 LGPIPTDALLGLPPVPQDNTGNAFLNMLPQSNAPVGGAG----------ETQESGSFQIH 1338 A L +P +D +L+ + + GG ET E+ FQ Sbjct: 62 NSASDFSAELSVPY--EDIVQLEWLSTFVEDSFSGGGLTLGKENFPLYKETSEA-KFQTS 118 Query: 1337 SPVSVLDSGGS-----CSVGKSLPIKSDIVI-PVRTRSKRARPSNLNPWFVMPPISSARX 1176 SPVSVL+S S CSV K++P+ S P R RSKR RP+ NP V+ IS Sbjct: 119 SPVSVLESSSSSSSSSCSVEKTVPLSSPCHRGPQRARSKRPRPATFNPAPVIQLISPTSS 178 Query: 1175 XXXXXXXXXXXXXXXXXXXXKNTDDFFQHSEEELLNASDATEKKDG-----------SQQ 1029 +++F + +++L + A +KK + Q Sbjct: 179 FTEIPQPFVARGIAS------ESENFAESPMKKILKPAVAEQKKKKLKLSFPSARVEANQ 232 Query: 1028 RTVA--LKKCTHCEVTKTPQWREGPMGPKTLCNACGVRYRSGRLFPEYRPAASPTFVPTL 855 VA ++KC HCE+TKTPQWR GPMGPKTLCNACGVRY+SGRLFPEYRPAASPTFVP++ Sbjct: 233 NPVAQTIRKCQHCEMTKTPQWRAGPMGPKTLCNACGVRYKSGRLFPEYRPAASPTFVPSI 292 Query: 854 HSNSHKKVIEMRKKASVQETAVLHDVPVSPP-----PEFVP 747 HSNSHKKVIEMR K A + +PP PEF P Sbjct: 293 HSNSHKKVIEMRTKFVPDNNANI--ARTAPPATVTQPEFNP 331