BLASTX nr result

ID: Rauwolfia21_contig00000969 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Rauwolfia21_contig00000969
         (2802 letters)

Database: ./nr 
           37,332,560 sequences; 13,225,080,153 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

emb|CAC28528.1| GATA-1 zinc finger protein [Nicotiana tabacum]        321   1e-84
ref|XP_006340186.1| PREDICTED: GATA transcription factor 11-like...   310   2e-81
ref|XP_004251141.1| PREDICTED: GATA transcription factor 11-like...   299   4e-78
ref|XP_006365758.1| PREDICTED: GATA transcription factor 10-like...   203   3e-49
ref|XP_004242094.1| PREDICTED: GATA transcription factor 11-like...   200   3e-48
gb|EOY34380.1| GATA zinc finger protein regulating nitrogen assi...   189   7e-45
ref|XP_006424556.1| hypothetical protein CICLE_v10029015mg [Citr...   184   2e-43
ref|XP_006488078.1| PREDICTED: GATA transcription factor 11-like...   184   2e-43
gb|EOY18663.1| Plant-specific GATA-type zinc finger transcriptio...   177   3e-41
gb|AGV54633.1| GATA transcription factor [Phaseolus vulgaris] gi...   176   7e-41
ref|XP_002273502.1| PREDICTED: GATA transcription factor 9-like ...   174   1e-40
ref|XP_003543479.1| PREDICTED: GATA transcription factor 11-like...   173   4e-40
gb|EOY18664.1| Plant-specific GATA-type zinc finger transcriptio...   172   7e-40
gb|ESW23870.1| hypothetical protein PHAVU_004G083100g [Phaseolus...   171   2e-39
ref|XP_003597258.1| GATA transcription factor [Medicago truncatu...   171   2e-39
gb|ACU24388.1| unknown [Glycine max]                                  170   3e-39
gb|EXB38685.1| Protein-tyrosine sulfotransferase [Morus notabilis]    169   5e-39
ref|XP_003540186.1| PREDICTED: GATA transcription factor 11-like...   169   5e-39
ref|XP_006378769.1| hypothetical protein POPTR_0010s23010g [Popu...   168   1e-38
dbj|BAC98495.1| AG-motif binding protein-5 [Nicotiana tabacum]        168   1e-38

>emb|CAC28528.1| GATA-1 zinc finger protein [Nicotiana tabacum]
          Length = 305

 Score =  321 bits (823), Expect = 1e-84
 Identities = 178/309 (57%), Positives = 203/309 (65%), Gaps = 9/309 (2%)
 Frame = -2

Query: 1625 GFLDGVPGDKGFPEDGPLDILDFLDFPMESLEDDTGG-DWDASKSHCLGPIPTDALLGLP 1449
            G+LDG+P      ED   DIL+FLDFP+ESLE+D  G +WDAS+S  LGPIP DAL+  P
Sbjct: 9    GYLDGIPTGPVVDEDFD-DILNFLDFPLESLEEDGQGVEWDASESKFLGPIPMDALMAFP 67

Query: 1448 PVPQDNTGNAFLNMLPQSNAPVGGAGETQESGSFQIHSPVSVLDSGGSCSVGKSLPIKSD 1269
            PVPQ N GN  +   P SN P+    E Q SG FQ  SPVSVL+S  SCS GKS+ IK D
Sbjct: 68   PVPQGNIGNGRVKAEPNSNHPIK-VTEGQGSGIFQTQSPVSVLESSNSCSGGKSISIKHD 126

Query: 1268 IVIPVRTRSKRARPSNLNPWFVMPPISSARXXXXXXXXXXXXXXXXXXXXXKNTDDFFQH 1089
            I IPVR RSKR R S LNPW +MPPISS R                      +       
Sbjct: 127  IAIPVRPRSKRPRSSALNPWILMPPISSTRFASKKTCDARKGKEKKRKMSLLSVPQI--- 183

Query: 1088 SEEELLNASDATEKKDGSQQRTVALKKCTHCEVTKTPQWREGPMGPKTLCNACGVRYRSG 909
                    +D T+KK  S Q+  + KKCTHC+VTKTPQWREGP+GPKTLCNACGVRYRSG
Sbjct: 184  --------ADVTKKKTTSGQQ-FSFKKCTHCQVTKTPQWREGPLGPKTLCNACGVRYRSG 234

Query: 908  RLFPEYRPAASPTFVPTLHSNSHKKVIEMRKKASVQETAVL---HDV-----PVSPPPEF 753
            RLFPEYRPAASPTFVPTLHSNSH+KV+EMRKKA   ET+ L   H+V     P+SP PEF
Sbjct: 235  RLFPEYRPAASPTFVPTLHSNSHRKVVEMRKKAIYGETSALEEPHNVIVEGPPMSPAPEF 294

Query: 752  VPMSGSLFD 726
            VPMS  LFD
Sbjct: 295  VPMSSYLFD 303


>ref|XP_006340186.1| PREDICTED: GATA transcription factor 11-like [Solanum tuberosum]
          Length = 337

 Score =  310 bits (794), Expect = 2e-81
 Identities = 174/338 (51%), Positives = 210/338 (62%), Gaps = 28/338 (8%)
 Frame = -2

Query: 1655 MNMVEPSYLDGFLDGVPGDKGFPEDGPLDILDFLDFPMESLEDDT--GGDWDASKSHCLG 1482
            M MVE     G++DGVP      +D   DIL+FLD PMESLE+D   G +WD S+S   G
Sbjct: 1    MTMVEHG--GGYMDGVPTGPIVDDDFD-DILNFLDMPMESLEEDGLGGVEWDVSESKGFG 57

Query: 1481 PIPTDALLGLPPVPQDNTGNAFLNMLPQSNA-PVGGAGETQESGSFQIHSPVSVLDSGGS 1305
            PIPTDAL+  PP+PQ N GN  +N + +S+  P     E Q +G+FQ  SPVSVL+   S
Sbjct: 58   PIPTDALMDFPPMPQGNIGNRRVNAVAKSHPHPPIKFTEVQGTGTFQTQSPVSVLEGSNS 117

Query: 1304 CSVGKSLPIKSDIVIPVRTRSKRARPSNLNPWFVMPPISSARXXXXXXXXXXXXXXXXXX 1125
            CS GKS+PIK DIVIPVR RSKRARPS +NPW +M PISS R                  
Sbjct: 118  CSGGKSIPIKHDIVIPVRPRSKRARPSAVNPWVLMAPISSTRVASKKISDARKTKEKRRR 177

Query: 1124 XXXKN-----TDDFFQHSEEELLNASDATEKKDGSQQRTVALKKCTHCEVTKTPQWREGP 960
                +       ++ Q   +  L  SD  +KK  S Q++   KKCTHCEVTKTPQWREGP
Sbjct: 178  LSLLSGAKEPMKNYVQQINDAALPLSDVYKKKITSTQQSSFFKKCTHCEVTKTPQWREGP 237

Query: 959  MGPKTLCNACGVRYRSGRLFPEYRPAASPTFVPTLHSNSHKKVIEMRKKA---------- 810
            +GPKTLCNACGVRYRSGRLFPEYRPAASPTFVP++HSNSH+KV+EMRKK           
Sbjct: 238  LGPKTLCNACGVRYRSGRLFPEYRPAASPTFVPSVHSNSHRKVVEMRKKTLYGTGEVEEP 297

Query: 809  ----------SVQETAVLHDVPVSPPPEFVPMSGSLFD 726
                      ++ E  ++ D  +SP PEFVPMS  LFD
Sbjct: 298  PKVIMGRKSEALPEPTIVADPAMSPAPEFVPMSSYLFD 335


>ref|XP_004251141.1| PREDICTED: GATA transcription factor 11-like [Solanum lycopersicum]
          Length = 336

 Score =  299 bits (766), Expect = 4e-78
 Identities = 168/338 (49%), Positives = 205/338 (60%), Gaps = 28/338 (8%)
 Frame = -2

Query: 1655 MNMVEPSYLDGFLDGVPGDKGFPEDGPLDILDFLDFPMESLEDDT--GGDWDASKSHCLG 1482
            M MVE     G++D +P      +D   DIL+FLD PMESLE D   G +WD S+S   G
Sbjct: 1    MTMVEHG--GGYMDEIPTGPIVDDDFD-DILNFLDMPMESLEGDVLGGVEWDVSESKGFG 57

Query: 1481 PIPTDALLGLPPVPQDNTGNAFLNMLPQSNAPVGGAGETQESGSFQIHSPVSVLDSGGSC 1302
            PIPT+AL+   P+PQ N GN  +N +  S+ P+    E Q +G+FQ  SPVSVL+   SC
Sbjct: 58   PIPTEALMDFLPLPQSNIGNRRVNAVANSHPPIKFT-EVQGTGTFQTQSPVSVLEGSNSC 116

Query: 1301 SVGKSLPIKSDIVIPVRTRSKRARPSNLNPWFVMPPISSARXXXXXXXXXXXXXXXXXXX 1122
            S GKS+PIK D VIPVR RSKRARPS +NPW +M PISS R                   
Sbjct: 117  SGGKSVPIKHDPVIPVRPRSKRARPSAVNPWVLMAPISSTRVASKKISDARKTKERRRRL 176

Query: 1121 XXKN-----TDDFFQHSEEELLNASDATEKKDGSQQRTVALKKCTHCEVTKTPQWREGPM 957
               +       ++ Q   +     SD ++KK  S Q++   KKCTHCEVTKTPQWREGP+
Sbjct: 177  SLLSGAKEPMKNYVQQISDAAPPVSDVSKKKITSTQQSSFFKKCTHCEVTKTPQWREGPL 236

Query: 956  GPKTLCNACGVRYRSGRLFPEYRPAASPTFVPTLHSNSHKKVIEMRKKA----------- 810
            GPKTLCNACGVRYRSGRLFPEYRPAASPTFVP++HSNSH+KV+EMRKK            
Sbjct: 237  GPKTLCNACGVRYRSGRLFPEYRPAASPTFVPSVHSNSHRKVVEMRKKTLYGGAGEVEEP 296

Query: 809  ----------SVQETAVLHDVPVSPPPEFVPMSGSLFD 726
                      ++ E  +  D  +SP PEFVPMS  LFD
Sbjct: 297  PKVIMGRSSEALPEPTIAADPAMSPAPEFVPMSSYLFD 334


>ref|XP_006365758.1| PREDICTED: GATA transcription factor 10-like [Solanum tuberosum]
          Length = 258

 Score =  203 bits (517), Expect = 3e-49
 Identities = 127/281 (45%), Positives = 159/281 (56%), Gaps = 2/281 (0%)
 Frame = -2

Query: 1649 MVEPSYLDGFLDGVPGDKGFPEDGPLDILDFLDFPMESLEDDT-GGDWDASK-SHCLGPI 1476
            MVE +Y+DG   G   D+ F       IL+ LDF M++LE D    DWDA+      GPI
Sbjct: 1    MVEQNYMDGISMGHVVDEDFES-----ILNGLDFSMQNLEADVLEEDWDATVYGELFGPI 55

Query: 1475 PTDALLGLPPVPQDNTGNAFLNMLPQSNAPVGGAGETQESGSFQIHSPVSVLDSGGSCSV 1296
            P++ L+ LP     +  N+ L     +NAP     E+Q +  FQ  SP+SVL++  SCS 
Sbjct: 56   PSETLMSLPL----DIANSCLEDRRMTNAP-NEFLESQGNALFQTGSPISVLENNRSCSG 110

Query: 1295 GKSLPIKSDIVIPVRTRSKRARPSNLNPWFVMPPISSARXXXXXXXXXXXXXXXXXXXXX 1116
            G+S  I  +     R RSKRAR S LNPW +M PI                         
Sbjct: 111  GRSA-ISFNFGSKGR-RSKRARSSTLNPWLMMAPIPC----------------------- 145

Query: 1115 KNTDDFFQHSEEELLNASDATEKKDGSQQRTVALKKCTHCEVTKTPQWREGPMGPKTLCN 936
                     +       SD+   K  S + +   K+CTHCEVTKTPQWREGP+GPKTLCN
Sbjct: 146  ---------TTSAAKKNSDSKSGKLSSAKGSPLFKRCTHCEVTKTPQWREGPLGPKTLCN 196

Query: 935  ACGVRYRSGRLFPEYRPAASPTFVPTLHSNSHKKVIEMRKK 813
            ACGVRYRSGRL PEYRPAASPTF+P+LHSNSH+KV+EMR+K
Sbjct: 197  ACGVRYRSGRLLPEYRPAASPTFIPSLHSNSHRKVVEMRRK 237


>ref|XP_004242094.1| PREDICTED: GATA transcription factor 11-like [Solanum lycopersicum]
          Length = 252

 Score =  200 bits (508), Expect = 3e-48
 Identities = 123/283 (43%), Positives = 155/283 (54%), Gaps = 2/283 (0%)
 Frame = -2

Query: 1568 ILDFLDFPMESLEDDT-GGDWDASK-SHCLGPIPTDALLGLPPVPQDNTGNAFLNMLPQS 1395
            IL+ LDF +++LE D    DWDA+     LGPIP++ L+ LPP+   N  N F       
Sbjct: 17   ILNGLDFSIQNLEADRLDEDWDATVYGELLGPIPSETLMSLPPLELTNVDNVF------- 69

Query: 1394 NAPVGGAGETQESGSFQIHSPVSVLDSGGSCSVGKSLPIKSDIVIPVRTRSKRARPSNLN 1215
                    E Q +  FQ  SP+SVL++  SCS G+S  I  +     R RSKRAR S LN
Sbjct: 70   -------PEAQGNVIFQTGSPISVLENTRSCSGGRSA-ISFNFGSKGR-RSKRARSSTLN 120

Query: 1214 PWFVMPPISSARXXXXXXXXXXXXXXXXXXXXXKNTDDFFQHSEEELLNASDATEKKDGS 1035
            PW  M P+                           T    ++S+ ++       ++K  S
Sbjct: 121  PWLKMAPMPCT------------------------TSAAKKNSDSKI---GKVNKRKLSS 153

Query: 1034 QQRTVALKKCTHCEVTKTPQWREGPMGPKTLCNACGVRYRSGRLFPEYRPAASPTFVPTL 855
               +   K+CTHCEVTKTPQWREGP+GPKTLCNACGVRYRSGRL PEYRPAASPTF+P+L
Sbjct: 154  AMASPLFKRCTHCEVTKTPQWREGPLGPKTLCNACGVRYRSGRLLPEYRPAASPTFIPSL 213

Query: 854  HSNSHKKVIEMRKKASVQETAVLHDVPVSPPPEFVPMSGSLFD 726
            HSNSHKKV+EMR+K        +   P      FVP+   L D
Sbjct: 214  HSNSHKKVVEMRRK-------TVESSPEFDSQNFVPLGSYLLD 249


>gb|EOY34380.1| GATA zinc finger protein regulating nitrogen assimilation, putative
            [Theobroma cacao]
          Length = 342

 Score =  189 bits (479), Expect = 7e-45
 Identities = 124/323 (38%), Positives = 157/323 (48%), Gaps = 50/323 (15%)
 Frame = -2

Query: 1571 DILDFLDFPMESLE----------------------DDTGG-----DWDASKSHCLGPIP 1473
            D++ +LDFP+E +E                      +D+GG     +WD +  + L P P
Sbjct: 20   DVIKYLDFPLEDVEANDGSGGGSSGEDVIKDFHLPLEDSGGGGGGEEWDCNFQN-LEPPP 78

Query: 1472 TDALLGLP-----PVPQDNTGNAFLNMLPQSNAPVGGAGETQESGS-------------- 1350
             + L GL          DN           S+ P      T+ S S              
Sbjct: 79   ANVLAGLSSGFYGDFFGDNLAKNLTVSCDGSSQPNQQTSTTKASSSRSITLNSESADLKG 138

Query: 1349 ---FQIHSPVSVLDSGGSCSVGKSLPIKSDIVIPV-RTRSKRARPSNLNPWFVMPPISSA 1182
               FQ  SPVSVL+S  SCS     PI  ++  PV R+RSKR R S  N    +P ISS 
Sbjct: 139  SNRFQTSSPVSVLESSSSCSAANPTPIDPNLSFPVKRSRSKRRRVSTFNLHVSLPFISST 198

Query: 1181 RXXXXXXXXXXXXXXXXXXXXXKNTDDFFQHSEEELLNASDATEKKDGSQQRTVALKKCT 1002
                                  + +    Q  +  L   S ++E K    Q+ V ++KC 
Sbjct: 199  SSTSRGSNSLVGSESESESHLTEKSAKKRQKKKRNLTLLSGSSEIKKSPSQQPVVVRKCM 258

Query: 1001 HCEVTKTPQWREGPMGPKTLCNACGVRYRSGRLFPEYRPAASPTFVPTLHSNSHKKVIEM 822
            HCEVTKTPQWREGPMGPKTLCNACGVRYRSGRL PEYRPAASPTFV +LHSNSHKKV+EM
Sbjct: 259  HCEVTKTPQWREGPMGPKTLCNACGVRYRSGRLLPEYRPAASPTFVSSLHSNSHKKVVEM 318

Query: 821  RKKASVQETAVLHDVPVSPPPEF 753
            RKKA +  + +   + + P   F
Sbjct: 319  RKKAKLPISVMPSMLSIPPENSF 341


>ref|XP_006424556.1| hypothetical protein CICLE_v10029015mg [Citrus clementina]
            gi|557526490|gb|ESR37796.1| hypothetical protein
            CICLE_v10029015mg [Citrus clementina]
          Length = 280

 Score =  184 bits (467), Expect = 2e-43
 Identities = 97/179 (54%), Positives = 114/179 (63%), Gaps = 1/179 (0%)
 Frame = -2

Query: 1346 QIHSPVSVLDSGGSCSVGKSLPIKSDIVIPV-RTRSKRARPSNLNPWFVMPPISSARXXX 1170
            Q  SP+SVL+SGGSCS  K +PI   +V  V R RSKR RP+ LNP F+ P ISS     
Sbjct: 99   QTSSPISVLESGGSCSADKHVPINPKLVFAVKRARSKRRRPATLNPLFIYPFISSTSSTS 158

Query: 1169 XXXXXXXXXXXXXXXXXXKNTDDFFQHSEEELLNASDATEKKDGSQQRTVALKKCTHCEV 990
                              +      Q  ++ L   S + E K  S Q+T   +KC HCEV
Sbjct: 159  EDYHPETASESGSEMNLTEKPVRKKQKRKKNLTVLSGSRENKKLSFQQTDTPRKCMHCEV 218

Query: 989  TKTPQWREGPMGPKTLCNACGVRYRSGRLFPEYRPAASPTFVPTLHSNSHKKVIEMRKK 813
             +TPQWREGPMGPKTLCNACGVRYRSGRL PEYRPAASPTFVP+LHSNSHK+++EMR K
Sbjct: 219  AETPQWREGPMGPKTLCNACGVRYRSGRLVPEYRPAASPTFVPSLHSNSHKRIMEMRNK 277


>ref|XP_006488078.1| PREDICTED: GATA transcription factor 11-like [Citrus sinensis]
          Length = 277

 Score =  184 bits (466), Expect = 2e-43
 Identities = 98/179 (54%), Positives = 114/179 (63%), Gaps = 1/179 (0%)
 Frame = -2

Query: 1346 QIHSPVSVLDSGGSCSVGKSLPIKSDIVIPV-RTRSKRARPSNLNPWFVMPPISSARXXX 1170
            Q  SP+SVL+SGGSCS  K +PI   +V  V R RSKR RP+ LNP F+ P ISS     
Sbjct: 99   QTSSPISVLESGGSCSAEKHVPINPKLVFAVKRARSKRRRPATLNPLFIYPFISSTSEDY 158

Query: 1169 XXXXXXXXXXXXXXXXXXKNTDDFFQHSEEELLNASDATEKKDGSQQRTVALKKCTHCEV 990
                                     Q  ++ L   S + E K  S Q+T A +KC HCEV
Sbjct: 159  HPETASESGSEMNLTEKPVRKK---QKRKKNLTVLSGSRENKKLSFQQTDAPRKCMHCEV 215

Query: 989  TKTPQWREGPMGPKTLCNACGVRYRSGRLFPEYRPAASPTFVPTLHSNSHKKVIEMRKK 813
             +TPQWREGPMGPKTLCNACGVRYRSGRL PEYRPAASPTFVP+LHSNSHK+++EMR K
Sbjct: 216  AETPQWREGPMGPKTLCNACGVRYRSGRLVPEYRPAASPTFVPSLHSNSHKRIMEMRNK 274


>gb|EOY18663.1| Plant-specific GATA-type zinc finger transcription factor family
            protein isoform 1 [Theobroma cacao]
          Length = 414

 Score =  177 bits (448), Expect = 3e-41
 Identities = 130/365 (35%), Positives = 167/365 (45%), Gaps = 44/365 (12%)
 Frame = -2

Query: 1682 GVYFVLQKKMNMVEPSYLDGFLDGVPGDKGFPEDGPLDILDFLDFPMESLEDDTGGDWDA 1503
            G + +   K NM+ P+    F+D +     F       I D LDFP E +E        A
Sbjct: 63   GFFIIFYIKENMIGPT---NFIDEIDCGSFFDH-----IDDLLDFPNEDVEAGLSASDSA 114

Query: 1502 SKSHCLGPIPTDALLGLPPVPQDNTGNAFLNMLPQSNAPVG-----------------GA 1374
              +     I T     LP      + N+  ++  + + P                   GA
Sbjct: 115  VNASAFPSIWTTHSESLPGSDSVFSNNSASDLSAELSVPYEDIVQLEWLSNFVDDSQCGA 174

Query: 1373 GET---QESGS---------FQIHSPVSVLDSGGSCSVGKSLPIKSDIVIPVR---TRSK 1239
              T   +ES S         FQ  SPVSVL+S  SCS  K+LP   +   P R    RSK
Sbjct: 175  SLTIKKEESSSITKDSSQHQFQTSSPVSVLESSSSCSGEKTLPRSPETAAPGRRGRARSK 234

Query: 1238 RARPSNLNPWFVMPPISSARXXXXXXXXXXXXXXXXXXXXXKNTDDFF----------QH 1089
            R RP+  NP   +  IS                           +             +H
Sbjct: 235  RPRPTTFNPRPAIQLISPTSSVNENDIPQPFVVPKVPSDSENYAESRLLIKIPRQVNPEH 294

Query: 1088 SEEELLNASDATEKKDGSQQRT-VALKKCTHCEVTKTPQWREGPMGPKTLCNACGVRYRS 912
             +++ +  S  T   D +Q  +  A++KC HCE+TKTPQWR GPMGPKTLCNACGVRY+S
Sbjct: 295  KKKKKIKLSLPTAPADNNQNSSGQAVRKCMHCEITKTPQWRAGPMGPKTLCNACGVRYKS 354

Query: 911  GRLFPEYRPAASPTFVPTLHSNSHKKVIEMRKKASVQETAVLHDVPVSPPPEFVP-MSGS 735
            GRLFPEYRPAASPTFVP+LHSNSHKKVIEMR K     T +     V+  PE +P  S  
Sbjct: 355  GRLFPEYRPAASPTFVPSLHSNSHKKVIEMRNKGGAAPTTM-----VTSSPELIPNKSNP 409

Query: 734  LFDFI 720
              DF+
Sbjct: 410  ALDFM 414


>gb|AGV54633.1| GATA transcription factor [Phaseolus vulgaris]
            gi|561023457|gb|ESW22187.1| hypothetical protein
            PHAVU_005G134400g [Phaseolus vulgaris]
          Length = 323

 Score =  176 bits (445), Expect = 7e-41
 Identities = 113/285 (39%), Positives = 149/285 (52%), Gaps = 24/285 (8%)
 Frame = -2

Query: 1595 GFPEDGPLDILDFLDFPMESLEDDTGGDW-----------DASKSHCLGPIPTD------ 1467
            G  +D   D++ F DFP+E +E+D    +            AS +   G   T+      
Sbjct: 24   GLSDDIFDDVVGFFDFPLEDVEEDWDSQFKCLEDQHSEIFSASSNGLCGKTQTENPQLGT 83

Query: 1466 ----ALLGLPPVPQ--DNTGNAFLNMLPQSNAPVGGAGETQESGSFQIHSPVSVLDSGGS 1305
                +  G+ P+ Q     G  +   +P  N    G    ++   F+ +SPVSV +S  S
Sbjct: 84   EFSVSCNGISPIKQLAKAPGPTYGKTIPLKNVTFNG----KDLHQFRTYSPVSVFESSSS 139

Query: 1304 CSVGKSLPIKSDIVIPV-RTRSKRARPSNLNPWFVMPPISSARXXXXXXXXXXXXXXXXX 1128
             SV  S   +   VIPV R RSKR R S+L+P F +P I +A+                 
Sbjct: 140  SSVENSNFDRP--VIPVKRARSKRQRRSSLSPLFSIPYILNAQALQNQQRTSASESDFET 197

Query: 1127 XXXXKNTDDFFQHSEEELLNASDATEKKDGSQQRTVALKKCTHCEVTKTPQWREGPMGPK 948
                  ++    H +++L   S+  E    S   +   +KC HCEVTKTPQWREGPMGPK
Sbjct: 198  NVAGNMSNKVKSHRKKDLSLLSEDVEMMRSSHLVSDPPRKCMHCEVTKTPQWREGPMGPK 257

Query: 947  TLCNACGVRYRSGRLFPEYRPAASPTFVPTLHSNSHKKVIEMRKK 813
            TLCNACGVRYRSGRLFPEYRPAASPTFV +LHSN HKKV+EMR +
Sbjct: 258  TLCNACGVRYRSGRLFPEYRPAASPTFVSSLHSNCHKKVVEMRSR 302


>ref|XP_002273502.1| PREDICTED: GATA transcription factor 9-like [Vitis vinifera]
          Length = 340

 Score =  174 bits (442), Expect = 1e-40
 Identities = 123/327 (37%), Positives = 164/327 (50%), Gaps = 46/327 (14%)
 Frame = -2

Query: 1562 DFLDFPMESLEDDT-GGDWDASKS---HCLGPIP-TDALLGLP------------PVP-Q 1437
            D L+FP E +     GGD ++  S   +   P+P  D++   P             VP +
Sbjct: 21   DLLEFPPEDVSGGLMGGDCNSFPSIWTNASDPLPGPDSVFSGPNSNSNSDLSAELSVPYE 80

Query: 1436 DNTGNAFLNMLPQSNAPVGGAGETQESGS---------FQIHSPVSVLDSGGSCSVG--K 1290
            D     +L+   + +   G  G  +E GS         FQ  SPVSVL+S  SCS G  K
Sbjct: 81   DIVQLEWLSNFVEDSFSGGSIGLNKEDGSIVKDSPHHQFQTSSPVSVLESSSSCSGGGGK 140

Query: 1289 SLPIKSDIVIPVRTRSKRARPSNLNPWFVMPPISSARXXXXXXXXXXXXXXXXXXXXXKN 1110
            ++P+  +     R RSKR RP+  NP   +  IS                         +
Sbjct: 141  TIPLSPNHRGAQRARSKRPRPATFNPRPAIQLISPTSSVTESPQPVLVPKASS------D 194

Query: 1109 TDDFFQHSEEELLNASDATEKKDGSQQR---------------TVALKKCTHCEVTKTPQ 975
            ++++ + S  + +    A E K   + +                 A++KC HCE+TKTPQ
Sbjct: 195  SENYAESSPLKKMPKPAAAEHKKKKKMKLSLPLGPVEMNQNPPAQAVRKCMHCEITKTPQ 254

Query: 974  WREGPMGPKTLCNACGVRYRSGRLFPEYRPAASPTFVPTLHSNSHKKVIEMRKKASVQET 795
            WR GPMGPKTLCNACGVRY+SGRLFPEYRPAASPTFVP LHSNSHKKVIEMR KA  + T
Sbjct: 255  WRAGPMGPKTLCNACGVRYKSGRLFPEYRPAASPTFVPALHSNSHKKVIEMRNKA-CENT 313

Query: 794  AVLHDVP--VSPPPEFVPMSGSLFDFI 720
            A+    P   + PPE +P S    D++
Sbjct: 314  AMTASPPTGTTSPPELIPNSSVSLDYM 340


>ref|XP_003543479.1| PREDICTED: GATA transcription factor 11-like [Glycine max]
          Length = 327

 Score =  173 bits (438), Expect = 4e-40
 Identities = 122/312 (39%), Positives = 165/312 (52%), Gaps = 27/312 (8%)
 Frame = -2

Query: 1652 NMVEPSYLDGFLDGVPGDKGFPEDGPLDILDFLDFPMESLE-DDTGGDWDASKSHCLGP- 1479
            NM +  + D   +G+  D+ F      D+++F DFP+E +E +    DWDA       P 
Sbjct: 7    NMKDSWFFDNNFNGL-SDEIFD-----DVINFFDFPLEDVEANGVEEDWDAQLKCLEDPR 60

Query: 1478 --IPTDALLGLPPVPQDN----------TGNAF--LNMLPQSNAPVGGAGETQESGS--- 1350
              + T +  GL    Q+           +GN    +  L ++  PV G   T ++ +   
Sbjct: 61   VDVYTASSAGLCAKTQNEKPQLGMKFSASGNGISPIKQLGKATGPVYGKTITHQNVTSNG 120

Query: 1349 -----FQIH--SPVSVLDSGGSCSVGKSLPIKSDIVIPV-RTRSKRARPSNLNPWFVMPP 1194
                 FQ +  SPVSV +S  S SV  S   +   VIPV R RSKR RPS+ +P F +P 
Sbjct: 121  KDLHQFQTYTYSPVSVFESSSSSSVENSNFDRP--VIPVKRARSKRQRPSSFSPLFSIPF 178

Query: 1193 ISSARXXXXXXXXXXXXXXXXXXXXXKNTDDFFQHSEEELLNASDATEKKDGSQQRTVAL 1014
            I ++                        ++   +  +++    SD  E    S   + + 
Sbjct: 179  ILNSPAMQNHQRIAAADSDFGTNVAGNLSNKLKKQKKKDSSLLSDDVEMMRSSSPESGSP 238

Query: 1013 KKCTHCEVTKTPQWREGPMGPKTLCNACGVRYRSGRLFPEYRPAASPTFVPTLHSNSHKK 834
            +KC HCEVTKTPQWREGP+GPKTLCNACGVRYRSGRLFPEYRPAASPTFV +LHSN HKK
Sbjct: 239  RKCMHCEVTKTPQWREGPVGPKTLCNACGVRYRSGRLFPEYRPAASPTFVASLHSNCHKK 298

Query: 833  VIEMRKKASVQE 798
            V+EMR +A +QE
Sbjct: 299  VVEMRSRA-IQE 309


>gb|EOY18664.1| Plant-specific GATA-type zinc finger transcription factor family
            protein isoform 2 [Theobroma cacao]
          Length = 341

 Score =  172 bits (436), Expect = 7e-40
 Identities = 101/225 (44%), Positives = 123/225 (54%), Gaps = 15/225 (6%)
 Frame = -2

Query: 1349 FQIHSPVSVLDSGGSCSVGKSLPIKSDIVIPVR---TRSKRARPSNLNPWFVMPPISSAR 1179
            FQ  SPVSVL+S  SCS  K+LP   +   P R    RSKR RP+  NP   +  IS   
Sbjct: 122  FQTSSPVSVLESSSSCSGEKTLPRSPETAAPGRRGRARSKRPRPTTFNPRPAIQLISPTS 181

Query: 1178 XXXXXXXXXXXXXXXXXXXXXKNTDDFF----------QHSEEELLNASDATEKKDGSQQ 1029
                                    +             +H +++ +  S  T   D +Q 
Sbjct: 182  SVNENDIPQPFVVPKVPSDSENYAESRLLIKIPRQVNPEHKKKKKIKLSLPTAPADNNQN 241

Query: 1028 RT-VALKKCTHCEVTKTPQWREGPMGPKTLCNACGVRYRSGRLFPEYRPAASPTFVPTLH 852
             +  A++KC HCE+TKTPQWR GPMGPKTLCNACGVRY+SGRLFPEYRPAASPTFVP+LH
Sbjct: 242  SSGQAVRKCMHCEITKTPQWRAGPMGPKTLCNACGVRYKSGRLFPEYRPAASPTFVPSLH 301

Query: 851  SNSHKKVIEMRKKASVQETAVLHDVPVSPPPEFVP-MSGSLFDFI 720
            SNSHKKVIEMR K     T +     V+  PE +P  S    DF+
Sbjct: 302  SNSHKKVIEMRNKGGAAPTTM-----VTSSPELIPNKSNPALDFM 341


>gb|ESW23870.1| hypothetical protein PHAVU_004G083100g [Phaseolus vulgaris]
          Length = 336

 Score =  171 bits (432), Expect = 2e-39
 Identities = 99/222 (44%), Positives = 126/222 (56%), Gaps = 11/222 (4%)
 Frame = -2

Query: 1349 FQIHSPVSVLDSGGSCSVGKSLPIKSDIVIPV---RTRSKRARPSNLNPWFVMPPISSA- 1182
            FQ  SPVSVL+S   CS  K++P   +I IPV   R RSKRARP+  NP  VM  IS A 
Sbjct: 119  FQTASPVSVLESSSFCSGEKAVPRSPEIFIPVPCGRARSKRARPTAFNPHPVMQLISPAS 178

Query: 1181 ------RXXXXXXXXXXXXXXXXXXXXXKNTDDFFQHSEEELLNAS-DATEKKDGSQQRT 1023
                  +                          F +H +++ +  +  A + ++GS  + 
Sbjct: 179  STGENTQHNTSTCKASSDSENFAESPIKTPKQAFGEHKKKKKIKVTFSAGQDQNGSPSQ- 237

Query: 1022 VALKKCTHCEVTKTPQWREGPMGPKTLCNACGVRYRSGRLFPEYRPAASPTFVPTLHSNS 843
             A++KC HCE+TKTPQWR GPMGPKTLCNACGVRY+SGRLFPEYRPAASPTF   +HSNS
Sbjct: 238  -AVRKCVHCEITKTPQWRAGPMGPKTLCNACGVRYKSGRLFPEYRPAASPTFCAAVHSNS 296

Query: 842  HKKVIEMRKKASVQETAVLHDVPVSPPPEFVPMSGSLFDFIY 717
            HKKV+EMR K+  +          +  PE +P + S     Y
Sbjct: 297  HKKVLEMRNKSDTKSGFAADS---ASSPELIPNTNSSLSLEY 335


>ref|XP_003597258.1| GATA transcription factor [Medicago truncatula]
            gi|355486306|gb|AES67509.1| GATA transcription factor
            [Medicago truncatula]
          Length = 312

 Score =  171 bits (432), Expect = 2e-39
 Identities = 117/292 (40%), Positives = 150/292 (51%), Gaps = 31/292 (10%)
 Frame = -2

Query: 1601 DKGFP--EDGPLDILDFLDFPMESLEDDTGG-DWDASKSHCL-----------GPIPTD- 1467
            DK F    D   D L F DFP+E ++ +T   DW A    C            G I T+ 
Sbjct: 15   DKNFNGLSDETFDDLKFFDFPLEDVDANTAEEDWSALGEPCFDVFSVSPAVFCGKIKTEN 74

Query: 1466 ---------ALLGLPPVPQD---NTGNAFLNMLPQSNAPVGGAGETQESGSFQIHSPVSV 1323
                        G+ P+ ++     G  +   +P  N P        E      +SPVSV
Sbjct: 75   PQLGEGFSAPFNGISPIIKEAARTAGPTYGKTIPNQNVPF------YEKKVVLQYSPVSV 128

Query: 1322 LDSGGSCSV---GKSLPIKSDIVIPV-RTRSKRARPSNLNPWFVMPPISSARXXXXXXXX 1155
             +   + SV   G  LP     VIPV R RSKR RPS+LNP F +  I+S +        
Sbjct: 129  FEGSSASSVENSGFDLP-----VIPVKRARSKRRRPSSLNPVFSISFIASLQALHKKISA 183

Query: 1154 XXXXXXXXXXXXXKNTDDFFQHSEEELLNASDATEKKDGSQQRTVALKKCTHCEVTKTPQ 975
                          +  D  +  +++ + + D   KK  SQ+ +V  +KCTHCEVT+TPQ
Sbjct: 184  --------------SESDLNRVKKQKRMLSGDIETKKSSSQE-SVVQRKCTHCEVTETPQ 228

Query: 974  WREGPMGPKTLCNACGVRYRSGRLFPEYRPAASPTFVPTLHSNSHKKVIEMR 819
            WREGP GPKTLCNACGVRYRSGRL+PEYRPA SPTFV ++HSNSHKKV+EMR
Sbjct: 229  WREGPNGPKTLCNACGVRYRSGRLYPEYRPANSPTFVASVHSNSHKKVLEMR 280


>gb|ACU24388.1| unknown [Glycine max]
          Length = 327

 Score =  170 bits (431), Expect = 3e-39
 Identities = 121/312 (38%), Positives = 164/312 (52%), Gaps = 27/312 (8%)
 Frame = -2

Query: 1652 NMVEPSYLDGFLDGVPGDKGFPEDGPLDILDFLDFPMESLE-DDTGGDWDASKSHCLGP- 1479
            NM +  + D   +G+  D+ F      D+++F DFP+E +E +    DWDA       P 
Sbjct: 7    NMKDSWFFDNNFNGL-SDEIFD-----DVINFFDFPLEDVEANGVEEDWDAQLKCLEDPR 60

Query: 1478 --IPTDALLGLPPVPQDN----------TGNAF--LNMLPQSNAPVGGAGETQESGS--- 1350
              + T +  GL    Q+           +GN    +  L ++  PV G   T ++ +   
Sbjct: 61   VDVYTASSAGLCAKTQNEKPQLGMKFSASGNGISPIKQLGKATGPVYGKTITHQNVTSNG 120

Query: 1349 -----FQIH--SPVSVLDSGGSCSVGKSLPIKSDIVIPV-RTRSKRARPSNLNPWFVMPP 1194
                 FQ +  SPVSV +S  S SV  S   +   VIPV R RSKR RPS+ +P F +P 
Sbjct: 121  KDLHQFQTYTYSPVSVFESSSSSSVENSNFDRP--VIPVKRARSKRQRPSSFSPLFSIPF 178

Query: 1193 ISSARXXXXXXXXXXXXXXXXXXXXXKNTDDFFQHSEEELLNASDATEKKDGSQQRTVAL 1014
            I ++                        ++   +  +++    S   E    S   + + 
Sbjct: 179  ILNSPAMQNHQRIAAADSDFGTNVAGNLSNKLKKQKKKDSSLLSGDVEMMRSSSPESGSP 238

Query: 1013 KKCTHCEVTKTPQWREGPMGPKTLCNACGVRYRSGRLFPEYRPAASPTFVPTLHSNSHKK 834
            +KC HCEVTKTPQWREGP+GPKTLCNACGVRYRSGRLFPEYRPAASPTFV +LHSN HKK
Sbjct: 239  RKCMHCEVTKTPQWREGPVGPKTLCNACGVRYRSGRLFPEYRPAASPTFVASLHSNCHKK 298

Query: 833  VIEMRKKASVQE 798
            V+EMR +A +QE
Sbjct: 299  VVEMRSRA-IQE 309


>gb|EXB38685.1| Protein-tyrosine sulfotransferase [Morus notabilis]
          Length = 820

 Score =  169 bits (429), Expect = 5e-39
 Identities = 113/283 (39%), Positives = 151/283 (53%), Gaps = 28/283 (9%)
 Frame = -2

Query: 1571 DILDFLDFPMESLEDDTGGDWDASKSHCLGPIPTDALLGLPPV-----PQDNTGN----- 1422
            D+L+  DFP+E +E   G + D      L  +P+D  +GL  V      +D++       
Sbjct: 515  DLLNIFDFPLEDVE--VGAEKDDWNDIQLLDLPSDISMGLSSVFCSGLQKDSSKEIKNIS 572

Query: 1421 ------AFLNMLPQS--NAPVGGAGETQESGS-------FQIHSPVSVLDSGGSCSVG-- 1293
                    LN  P +      GG   + +S S       F+  SPVS+L+S  SC     
Sbjct: 573  FSYDRTCRLNRSPSAAETTSSGGIVLSDDSSSDIKHIHLFKTSSPVSILESNSSCFAENP 632

Query: 1292 KSLPIKSDIVIPVRTRSK-RARPSNLNPWFVMPPISSARXXXXXXXXXXXXXXXXXXXXX 1116
            ++   KS +V   R RSK R+RPSN +  + +P I++                       
Sbjct: 633  RTADQKSSVVPVKRPRSKKRSRPSNFDRLYTLPFIAALERLRPSAASESDLGAPQVGKMF 692

Query: 1115 KNTDDFFQHSEEELLNASDATEKKDGSQQRTVALKKCTHCEVTKTPQWREGPMGPKTLCN 936
            K      +  ++         E ++ S Q++  +KKCTHC++T TPQWREGPMGPKTLCN
Sbjct: 693  KTAKKAMK--KKRATPHPIGIEVRNVSSQQSGEIKKCTHCQMTTTPQWREGPMGPKTLCN 750

Query: 935  ACGVRYRSGRLFPEYRPAASPTFVPTLHSNSHKKVIEMRKKAS 807
            ACGVR+RSGRLFPEYRPAASPTFVP+LHSNSHKKVIEMR KAS
Sbjct: 751  ACGVRFRSGRLFPEYRPAASPTFVPSLHSNSHKKVIEMRNKAS 793


>ref|XP_003540186.1| PREDICTED: GATA transcription factor 11-like [Glycine max]
          Length = 326

 Score =  169 bits (429), Expect = 5e-39
 Identities = 123/316 (38%), Positives = 161/316 (50%), Gaps = 31/316 (9%)
 Frame = -2

Query: 1652 NMVEPSYLDGFLDGVPGDKGFPEDGPLDILDFLDFPMESLE-DDTGGDWDA--------- 1503
            NM +  + D   +G+  D+ F      D+++F DFP+E ++ +    DWDA         
Sbjct: 7    NMKDSWFFDNNFNGL-SDEIFD-----DVINFFDFPLEDVDANGVEEDWDAQLKCLEDPR 60

Query: 1502 -------SKSHC---------LGPIPTDALLGLPPVPQ--DNTGNAFLNMLPQSNAPVGG 1377
                   S   C         LG   + +  G+ P+ Q     G A+   +P  N    G
Sbjct: 61   FDVYSASSAGLCAETQNEKPQLGMKLSASSNGISPIKQLAKAPGPAYGKTIPHQNVTSNG 120

Query: 1376 AGETQESGSFQIH--SPVSVLDSGGSCSVGKSLPIKSDIVIPV-RTRSKRARPSNLNPWF 1206
                ++   FQ +  SPVSV +S  S SV  S   +   VIPV R RSKR RPSN +P F
Sbjct: 121  ----KDLHQFQTYTYSPVSVFESSSSSSVENSNFDRP--VIPVKRARSKRQRPSNFSPLF 174

Query: 1205 VMPPISSARXXXXXXXXXXXXXXXXXXXXXKNTDDFFQHSEEELLNASDATEKKDGSQQR 1026
             +P I +                         ++   +  +++L   SD  E    S   
Sbjct: 175  SIPLIVNLPAVRKDQRTAASDSDFGTNVAGNLSNKVKKQRKKDLSLLSDV-EMTRSSSPE 233

Query: 1025 TVALKKCTHCEVTKTPQWREGPMGPKTLCNACGVRYRSGRLFPEYRPAASPTFVPTLHSN 846
            +   +KC HCEVTKTPQWREGPMGPKTLCNACGVRYRSGRLFPEYRPAASPTFV +LHSN
Sbjct: 234  SGPPRKCMHCEVTKTPQWREGPMGPKTLCNACGVRYRSGRLFPEYRPAASPTFVASLHSN 293

Query: 845  SHKKVIEMRKKASVQE 798
             HKKV+EMR +  +QE
Sbjct: 294  CHKKVVEMRSRV-IQE 308


>ref|XP_006378769.1| hypothetical protein POPTR_0010s23010g [Populus trichocarpa]
            gi|566192292|ref|XP_002316371.2| zinc finger family
            protein [Populus trichocarpa] gi|550330409|gb|ERP56566.1|
            hypothetical protein POPTR_0010s23010g [Populus
            trichocarpa] gi|550330410|gb|EEF02542.2| zinc finger
            family protein [Populus trichocarpa]
          Length = 352

 Score =  168 bits (426), Expect = 1e-38
 Identities = 105/245 (42%), Positives = 127/245 (51%), Gaps = 14/245 (5%)
 Frame = -2

Query: 1439 QDNTGNAFLNMLPQSNAPVGGAGETQESGSFQIHSPVSVLDSGGSCSVGKSLPIKSDIVI 1260
            +D+     L M  + +A V     T     FQ  SPVSVL+S   CS  K+ P   +IV 
Sbjct: 100  EDSFSGGSLTMKKEESASVDKKDSTPHH-QFQTSSPVSVLESSSDCSGEKNAPRSPEIVA 158

Query: 1259 PV---RTRSKRARPSNLNPWFVMP---PISSARXXXXXXXXXXXXXXXXXXXXXKNTDDF 1098
                 R RSKR RP+   P   M    P SS                       +     
Sbjct: 159  SGKCGRARSKRPRPAAFTPRPAMQLVSPTSSITEVPQQFVSPRVPSDSESFAESRLVIKI 218

Query: 1097 FQHSEEE--------LLNASDATEKKDGSQQRTVALKKCTHCEVTKTPQWREGPMGPKTL 942
             +H + E         +  S   E    SQ +  A++KC HCE+TKTPQWR GPMGPKTL
Sbjct: 219  PEHVDPEHKKKKKIKFIVPSGTVEMNQNSQPQQ-AVRKCMHCEITKTPQWRAGPMGPKTL 277

Query: 941  CNACGVRYRSGRLFPEYRPAASPTFVPTLHSNSHKKVIEMRKKASVQETAVLHDVPVSPP 762
            CNACGVRY+SGRLFPEYRPAASPTFVP+LHSNSHKKV+EMR KA  + T       +   
Sbjct: 278  CNACGVRYKSGRLFPEYRPAASPTFVPSLHSNSHKKVVEMRAKAGEKITTSRPATMMVNS 337

Query: 761  PEFVP 747
            PEF+P
Sbjct: 338  PEFIP 342


>dbj|BAC98495.1| AG-motif binding protein-5 [Nicotiana tabacum]
          Length = 342

 Score =  168 bits (426), Expect = 1e-38
 Identities = 132/341 (38%), Positives = 169/341 (49%), Gaps = 39/341 (11%)
 Frame = -2

Query: 1652 NMVEPSYLDGFLDGVPGDKGFP---EDGPLDILDFLDFPMESLEDDTGGDWDA--SKSHC 1488
            N+V+      F D +     FP   E   L   D  DFP  S+ +D   D D+  S SH 
Sbjct: 4    NLVDEIDCGSFFDHIDDLIDFPLENESAGLSSTDCKDFP--SIWNDPLPDSDSLFSGSHR 61

Query: 1487 LGPIPTDALLGLPPVPQDNTGNAFLNMLPQSNAPVGGAG----------ETQESGSFQIH 1338
                   A L +P   +D     +L+   + +   GG            ET E+  FQ  
Sbjct: 62   NSASDFSAELSVPY--EDIVQLEWLSTFVEDSFSGGGLTLGKENFPLYKETSEA-KFQTS 118

Query: 1337 SPVSVLDSGGS-----CSVGKSLPIKSDIVI-PVRTRSKRARPSNLNPWFVMPPISSARX 1176
            SPVSVL+S  S     CSV K++P+ S     P R RSKR RP+  NP  V+  IS    
Sbjct: 119  SPVSVLESSSSSSSSSCSVEKTVPLSSPCHRGPQRARSKRPRPATFNPAPVIQLISPTSS 178

Query: 1175 XXXXXXXXXXXXXXXXXXXXKNTDDFFQHSEEELLNASDATEKKDG-----------SQQ 1029
                                  +++F +   +++L  + A +KK             + Q
Sbjct: 179  FTEIPQPFVARGIAS------ESENFAESPMKKILKPAVAEQKKKKLKLSFPSARVEANQ 232

Query: 1028 RTVA--LKKCTHCEVTKTPQWREGPMGPKTLCNACGVRYRSGRLFPEYRPAASPTFVPTL 855
              VA  ++KC HCE+TKTPQWR GPMGPKTLCNACGVRY+SGRLFPEYRPAASPTFVP++
Sbjct: 233  NPVAQTIRKCQHCEMTKTPQWRAGPMGPKTLCNACGVRYKSGRLFPEYRPAASPTFVPSI 292

Query: 854  HSNSHKKVIEMRKKASVQETAVLHDVPVSPP-----PEFVP 747
            HSNSHKKVIEMR K      A +     +PP     PEF P
Sbjct: 293  HSNSHKKVIEMRTKFVPDNNANI--ARTAPPATVTQPEFNP 331


Top