BLASTX nr result

ID: Mentha29_contig00000574 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Mentha29_contig00000574
         (1207 letters)

Database: ./nr 
           38,876,450 sequences; 13,856,398,315 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

gb|EYU18404.1| hypothetical protein MIMGU_mgv1a010122mg [Mimulus...   353   1e-94
ref|XP_006351719.1| PREDICTED: GATA transcription factor 28-like...   278   4e-72
ref|XP_004230570.1| PREDICTED: GATA transcription factor 24-like...   277   7e-72
emb|CBI38230.3| unnamed protein product [Vitis vinifera]              271   5e-70
ref|XP_002270361.1| PREDICTED: GATA transcription factor 24 [Vit...   271   5e-70
ref|XP_007042820.1| Zim-like 2 [Theobroma cacao] gi|508706755|gb...   253   9e-65
ref|XP_006359783.1| PREDICTED: GATA transcription factor 24-like...   250   7e-64
ref|XP_002522687.1| GATA transcription factor, putative [Ricinus...   250   7e-64
ref|NP_001265920.1| Hop-interacting protein THI008 [Solanum lyco...   249   1e-63
gb|EXC32989.1| GATA transcription factor 28 [Morus notabilis]         248   3e-63
gb|ADL36691.1| GATA domain class transcription factor [Malus dom...   246   1e-62
ref|XP_007200518.1| hypothetical protein PRUPE_ppa009401mg [Prun...   242   2e-61
ref|XP_002310482.2| hypothetical protein POPTR_0007s03130g [Popu...   241   6e-61
ref|XP_004136886.1| PREDICTED: GATA transcription factor 24-like...   235   3e-59
ref|XP_004170398.1| PREDICTED: GATA transcription factor 24-like...   232   2e-58
ref|NP_850618.1| GATA transcription factor 24 [Arabidopsis thali...   220   1e-54
ref|XP_007023733.1| ZIM-like 1 [Theobroma cacao] gi|508779099|gb...   218   3e-54
ref|XP_006406306.1| hypothetical protein EUTSA_v10020970mg [Eutr...   218   5e-54
ref|XP_006393030.1| hypothetical protein EUTSA_v10011695mg [Eutr...   217   9e-54
ref|XP_002883290.1| hypothetical protein ARALYDRAFT_479637 [Arab...   216   2e-53

>gb|EYU18404.1| hypothetical protein MIMGU_mgv1a010122mg [Mimulus guttatus]
          Length = 321

 Score =  353 bits (905), Expect = 1e-94
 Identities = 186/267 (69%), Positives = 206/267 (77%), Gaps = 10/267 (3%)
 Frame = +3

Query: 69  NPSSQIRYDPPSSAHSPHH-----ALVALDTVALYAGPSDMPPQVAPASGDGAADQLTLS 233
           N  S+I Y P +++HSPH       L  ++  ALYA  +DMPP V P  G+G ADQLTLS
Sbjct: 57  NLPSRIGYVPSNNSHSPHALSGGGGLEVIEADALYA--ADMPPPVGPVPGEGGADQLTLS 114

Query: 234 FQGEVYVFDSVSPEKVQAVLLLLGGYEVPTGIPNPGMTPQNHRNLADYPVRSSQPQRAAS 413
           FQGEVYVFDSVSPEKVQAVLLLLGGYEVPTGIP PGMTPQNHRNL DYP RSSQPQRAAS
Sbjct: 115 FQGEVYVFDSVSPEKVQAVLLLLGGYEVPTGIPTPGMTPQNHRNLGDYPGRSSQPQRAAS 174

Query: 414 LNXXXXXXXXXXXXXXIRYTVRKEVALRMQRKKGQFTSSKSALDEPGSSSADWNGNSGPE 593
           LN              IRYTVRKEVALRMQRKKGQFTSSK+  +EPG+SSADW G S  E
Sbjct: 175 LNRFREKRKERCFDKKIRYTVRKEVALRMQRKKGQFTSSKAVSEEPGASSADWTGTSVQE 234

Query: 594 EQETSCRHCGISSKSTPMMRRGPDGPRTLCNACGLKWANKGVMRVLTKPPVS--MQQH-P 764
           EQETSCRHCG SSKSTPMMRRGPDGPRTLCNACGLKWANKG++R L+K PV+   QQH  
Sbjct: 235 EQETSCRHCGNSSKSTPMMRRGPDGPRTLCNACGLKWANKGILRDLSKVPVTPVQQQHRA 294

Query: 765 MKLNGETNGEDTAV--APSNGMASSSG 839
           +KLNG+ NGEDT V   P+N + +SSG
Sbjct: 295 IKLNGDQNGEDTVVNLPPTNVITTSSG 321


>ref|XP_006351719.1| PREDICTED: GATA transcription factor 28-like [Solanum tuberosum]
          Length = 319

 Score =  278 bits (710), Expect = 4e-72
 Identities = 157/266 (59%), Positives = 177/266 (66%), Gaps = 19/266 (7%)
 Frame = +3

Query: 69  NPSSQIRYDPPSSAHSPHHAL--------------VALDTVALYAGPSDMPPQVAPASGD 206
           NP+  IRYD    +HS  HAL                +   ALY GPS    ++ P +G 
Sbjct: 50  NPTPHIRYDQHHHSHS--HALHNGGAGGSMEMNGVEGVSHNALY-GPSS---EIVPTAGS 103

Query: 207 GAADQLTLSFQGEVYVFDSVSPEKVQAVLLLLGGYEVPTGIPNPGMTPQNHRNLADYPVR 386
           GA+DQLTLSFQGEVYVFD+VSPEKVQAVLLLLGGYEVP GIP   + PQ+ R   D+P R
Sbjct: 104 GASDQLTLSFQGEVYVFDAVSPEKVQAVLLLLGGYEVPPGIPAVNVAPQSQRASGDFPGR 163

Query: 387 SSQPQRAASLNXXXXXXXXXXXXXXIRYTVRKEVALRMQRKKGQFTSSKSALDEPGSSSA 566
            +QPQRAASLN              IRYTVRKEVA+RMQRKKGQFTS+KS  DE G SSA
Sbjct: 164 LNQPQRAASLNRFREKRKERCFDKKIRYTVRKEVAMRMQRKKGQFTSAKSIPDEVG-SSA 222

Query: 567 DWNGNSGPEEQETSCRHCGISSKSTPMMRRGPDGPRTLCNACGLKWANKGVMRVLTKPPV 746
           +WN  SG EEQETSCRHC ISSKSTPMMRRGP GPR+LCNACGLKWANKG++R L+K P 
Sbjct: 223 EWNEGSGQEEQETSCRHCNISSKSTPMMRRGPAGPRSLCNACGLKWANKGILRDLSKVPA 282

Query: 747 SMQQHPM-----KLNGETNGEDTAVA 809
              Q        + NGE NG D   A
Sbjct: 283 PGAQDQTAKPSEQSNGEPNGSDAMAA 308


>ref|XP_004230570.1| PREDICTED: GATA transcription factor 24-like [Solanum lycopersicum]
          Length = 326

 Score =  277 bits (708), Expect = 7e-72
 Identities = 155/266 (58%), Positives = 176/266 (66%), Gaps = 19/266 (7%)
 Frame = +3

Query: 69  NPSSQIRYDPPSSAHSPHHAL--------------VALDTVALYAGPSDMPPQVAPASGD 206
           NP+  IRYD    +HS  HAL                +   ALY  PS+    + P +G 
Sbjct: 57  NPTPHIRYDQHHHSHS--HALHNGGAGGSMEMNGVEGVSHNALYGPPSE----IVPTAGS 110

Query: 207 GAADQLTLSFQGEVYVFDSVSPEKVQAVLLLLGGYEVPTGIPNPGMTPQNHRNLADYPVR 386
           GA+DQLTLSFQGEVYVFD+VSPEKVQAVLLLLGGYEVP GIP   + PQ+ R   D+P R
Sbjct: 111 GASDQLTLSFQGEVYVFDAVSPEKVQAVLLLLGGYEVPPGIPAVNVVPQSQRASGDFPGR 170

Query: 387 SSQPQRAASLNXXXXXXXXXXXXXXIRYTVRKEVALRMQRKKGQFTSSKSALDEPGSSSA 566
            +QP+RAASLN              IRYTVRKEVA+RMQRKKGQFTS+KS  DE G SSA
Sbjct: 171 LNQPERAASLNRFREKRKERCFDKKIRYTVRKEVAMRMQRKKGQFTSAKSIPDEVG-SSA 229

Query: 567 DWNGNSGPEEQETSCRHCGISSKSTPMMRRGPDGPRTLCNACGLKWANKGVMRVLTKPPV 746
           DWN  SG EEQETSCRHC ISSKSTPMMRRGP GPR+LCNACGLKWANKG++R L+K P 
Sbjct: 230 DWNEGSGQEEQETSCRHCNISSKSTPMMRRGPAGPRSLCNACGLKWANKGILRDLSKVPA 289

Query: 747 SMQQHPM-----KLNGETNGEDTAVA 809
              Q        + +GE NG D   A
Sbjct: 290 PGTQDQTAKPGEQSHGEPNGSDDMAA 315


>emb|CBI38230.3| unnamed protein product [Vitis vinifera]
          Length = 254

 Score =  271 bits (692), Expect = 5e-70
 Identities = 146/237 (61%), Positives = 166/237 (70%), Gaps = 6/237 (2%)
 Frame = +3

Query: 153 LYAGPSDMPPQVAPASGDGAADQLTLSFQGEVYVFDSVSPEKVQAVLLLLGGYEVPTGIP 332
           LY   SD  P      G G  DQLTLSFQGEVYVFD+VSPEKVQAVLLLLGGYEVPTGIP
Sbjct: 12  LYVPGSDFAPVAGGGGGGGGVDQLTLSFQGEVYVFDAVSPEKVQAVLLLLGGYEVPTGIP 71

Query: 333 NPGMTPQNHRNLADYPVRSSQPQRAASLNXXXXXXXXXXXXXXIRYTVRKEVALRMQRKK 512
            PGM P N R LAD+  RSSQPQRAASL+              IRYTVRKEVALRMQRKK
Sbjct: 72  APGMVPPNQRGLADFTGRSSQPQRAASLSRFREKRKERCFDKKIRYTVRKEVALRMQRKK 131

Query: 513 GQFTSSKSALDE-PGSSSADWNG--NSGPEEQETSCRHCGISSKSTPMMRRGPDGPRTLC 683
           GQFTSSK++ DE  G +S+DWN    SG +E E  C HCG SSK+TPMMRRGP GPR+LC
Sbjct: 132 GQFTSSKASSDEVGGGASSDWNAAHGSGQDEPEILCTHCGTSSKTTPMMRRGPAGPRSLC 191

Query: 684 NACGLKWANKGVMRVLTKPPVSMQQHPMKL---NGETNGEDTAVAPSNGMASSSGDN 845
           NACGLKWANKGV+R L++    +Q+  +K    NG+ N E  A+     + SS+GDN
Sbjct: 192 NACGLKWANKGVLRDLSRVSSGVQETSLKATQSNGDAN-ESGAITTVPDIVSSNGDN 247


>ref|XP_002270361.1| PREDICTED: GATA transcription factor 24 [Vitis vinifera]
          Length = 302

 Score =  271 bits (692), Expect = 5e-70
 Identities = 146/237 (61%), Positives = 166/237 (70%), Gaps = 6/237 (2%)
 Frame = +3

Query: 153 LYAGPSDMPPQVAPASGDGAADQLTLSFQGEVYVFDSVSPEKVQAVLLLLGGYEVPTGIP 332
           LY   SD  P      G G  DQLTLSFQGEVYVFD+VSPEKVQAVLLLLGGYEVPTGIP
Sbjct: 60  LYVPGSDFAPVAGGGGGGGGVDQLTLSFQGEVYVFDAVSPEKVQAVLLLLGGYEVPTGIP 119

Query: 333 NPGMTPQNHRNLADYPVRSSQPQRAASLNXXXXXXXXXXXXXXIRYTVRKEVALRMQRKK 512
            PGM P N R LAD+  RSSQPQRAASL+              IRYTVRKEVALRMQRKK
Sbjct: 120 APGMVPPNQRGLADFTGRSSQPQRAASLSRFREKRKERCFDKKIRYTVRKEVALRMQRKK 179

Query: 513 GQFTSSKSALDE-PGSSSADWNG--NSGPEEQETSCRHCGISSKSTPMMRRGPDGPRTLC 683
           GQFTSSK++ DE  G +S+DWN    SG +E E  C HCG SSK+TPMMRRGP GPR+LC
Sbjct: 180 GQFTSSKASSDEVGGGASSDWNAAHGSGQDEPEILCTHCGTSSKTTPMMRRGPAGPRSLC 239

Query: 684 NACGLKWANKGVMRVLTKPPVSMQQHPMKL---NGETNGEDTAVAPSNGMASSSGDN 845
           NACGLKWANKGV+R L++    +Q+  +K    NG+ N E  A+     + SS+GDN
Sbjct: 240 NACGLKWANKGVLRDLSRVSSGVQETSLKATQSNGDAN-ESGAITTVPDIVSSNGDN 295


>ref|XP_007042820.1| Zim-like 2 [Theobroma cacao] gi|508706755|gb|EOX98651.1| Zim-like 2
           [Theobroma cacao]
          Length = 313

 Score =  253 bits (647), Expect = 9e-65
 Identities = 141/240 (58%), Positives = 167/240 (69%), Gaps = 9/240 (3%)
 Frame = +3

Query: 153 LYAGPSDMPPQVAPASGDGAADQLTLSFQGEVYVFDSVSPEKVQAVLLLLGGYEVPTGIP 332
           +Y   SD+   V P  G+G +DQLTLSFQGEVYVFDSVSP+KVQAVLLLLGGYE+P+GIP
Sbjct: 71  IYGQGSDLT--VVP--GNGGSDQLTLSFQGEVYVFDSVSPDKVQAVLLLLGGYEIPSGIP 126

Query: 333 NPGMTPQNHRNLADYPVRSSQPQRAASLNXXXXXXXXXXXXXXIRYTVRKEVALRMQRKK 512
             G  P   R L D+P R+ QPQRAASLN              IRYTVRKEVALRMQRKK
Sbjct: 127 ALGTVPVTQRGLGDFPGRAIQPQRAASLNRFREKRKERCFDKKIRYTVRKEVALRMQRKK 186

Query: 513 GQFTSSKSALDEPGSSSADWN--GNSGPEE--QETSCRHCGISSKSTPMMRRGPDGPRTL 680
           GQFTSSK+  DE  S+S+ W+    SG +E  +ETSC HCGISSKSTPMMRRGP GPRTL
Sbjct: 187 GQFTSSKAISDEVASASSGWSVTPGSGQDESMEETSCTHCGISSKSTPMMRRGPTGPRTL 246

Query: 681 CNACGLKWANKGVMRVLTK-PPVSMQQHPMK----LNGETNGEDTAVAPSNGMASSSGDN 845
           CNACGLKWANKGV+R L+K   + +Q    K     + E N  +     ++ ++SS+GDN
Sbjct: 247 CNACGLKWANKGVLRDLSKVSTIPIQDASAKPTEQSDAEANDSEAVTVTTDVVSSSNGDN 306


>ref|XP_006359783.1| PREDICTED: GATA transcription factor 24-like [Solanum tuberosum]
          Length = 325

 Score =  250 bits (639), Expect = 7e-64
 Identities = 131/218 (60%), Positives = 157/218 (72%), Gaps = 3/218 (1%)
 Frame = +3

Query: 201 GDGAADQLTLSFQGEVYVFDSVSPEKVQAVLLLLGGYEVPTGIPNPGMTPQNHRNLADYP 380
           G G++DQLTLSF+GEV+V+D+VSPEKVQAVLLLLGGYEVP GIP   M  Q+HR  ++ P
Sbjct: 102 GGGSSDQLTLSFRGEVFVYDAVSPEKVQAVLLLLGGYEVPAGIPTVNMASQSHRASSEGP 161

Query: 381 VRSSQPQRAASLNXXXXXXXXXXXXXXIRYTVRKEVALRMQRKKGQFTSSKSALDEPGSS 560
            R +QPQRAASL+              IRYTVRKEVALRMQRKKGQFTSSK   DE  SS
Sbjct: 162 GRLNQPQRAASLSRFREKRKERCFDKKIRYTVRKEVALRMQRKKGQFTSSKPVSDEAASS 221

Query: 561 SADWNGNSGPEEQETSCRHCGISSKSTPMMRRGPDGPRTLCNACGLKWANKGVMRVLTK- 737
           SA+ N  S  EEQET CRHCG +SKSTPMMRRGP GPR+LCNACGL WANKG++R L+K 
Sbjct: 222 SAEGNAGSSQEEQETLCRHCGTNSKSTPMMRRGPAGPRSLCNACGLTWANKGILRDLSKV 281

Query: 738 PPVSMQQHPMKLNGETNGE--DTAVAPSNGMASSSGDN 845
                Q+H +K + + NGE   + V  + G+ +S  +N
Sbjct: 282 STTGAQEHSVKSSEQNNGEADGSDVMAAAGIITSDDEN 319


>ref|XP_002522687.1| GATA transcription factor, putative [Ricinus communis]
           gi|223538163|gb|EEF39774.1| GATA transcription factor,
           putative [Ricinus communis]
          Length = 311

 Score =  250 bits (639), Expect = 7e-64
 Identities = 140/237 (59%), Positives = 163/237 (68%), Gaps = 9/237 (3%)
 Frame = +3

Query: 162 GPSDMPPQVAPASGDGAADQLTLSFQGEVYVFDSVSPEKVQAVLLLLGGYEVPTGIPNPG 341
           G  D P  +    G G++DQLTLSFQGEVYVFD+VSP+KVQAVLLLLGGYE+P+GIP   
Sbjct: 70  GDPDYP--LVAVYGGGSSDQLTLSFQGEVYVFDAVSPDKVQAVLLLLGGYEIPSGIPTTE 127

Query: 342 MTPQNHRNLADYPVRSSQPQRAASLNXXXXXXXXXXXXXXIRYTVRKEVALRMQRKKGQF 521
               N R   D   RS+QP RAASL               IRYTVRKEVALRMQRKKGQF
Sbjct: 128 TVSLNQRGYTDLSGRSTQPHRAASLRRFREKRKERCFDKKIRYTVRKEVALRMQRKKGQF 187

Query: 522 TSSKSALDEPGSSSADWNG--NSGPEE--QETSCRHCGISSKSTPMMRRGPDGPRTLCNA 689
           TSSK++ DE GS S+ W+G   SG +E   ETSC HCGISSKSTPMMRRGP GPRTLCNA
Sbjct: 188 TSSKNSSDEMGSGSSLWSGPQGSGQDESLMETSCTHCGISSKSTPMMRRGPAGPRTLCNA 247

Query: 690 CGLKWANKGVMRVLTK-PPVSMQQHPMK--LNGETNGEDTAVAPSNG--MASSSGDN 845
           CGLKWANKG++R L+K P   +Q  P K    GE    +TAV  + G  +++S+GDN
Sbjct: 248 CGLKWANKGILRDLSKMPSAGIQGPPAKPMEQGEGEANNTAVVTAGGERLSTSNGDN 304


>ref|NP_001265920.1| Hop-interacting protein THI008 [Solanum lycopersicum]
           gi|365222862|gb|AEW69783.1| Hop-interacting protein
           THI008 [Solanum lycopersicum]
          Length = 317

 Score =  249 bits (637), Expect = 1e-63
 Identities = 132/219 (60%), Positives = 155/219 (70%), Gaps = 4/219 (1%)
 Frame = +3

Query: 201 GDGAADQLTLSFQGEVYVFDSVSPEKVQAVLLLLGGYEVPTGIPNPGMTPQNHRNLADYP 380
           G G++DQLTLSF+GEV+V+D+VSPEKVQAVLLLLGGYEVP GIP   M  Q+HR  ++ P
Sbjct: 95  GGGSSDQLTLSFRGEVFVYDAVSPEKVQAVLLLLGGYEVPAGIPTVNMASQSHRASSEGP 154

Query: 381 VRSSQPQRAASLNXXXXXXXXXXXXXXIRYTVRKEVALRMQRKKGQFTSSKSALDEPGSS 560
            R +QPQRAASL+              IRYTVRKEVALRMQRKKGQFTSSK+  DE  SS
Sbjct: 155 GRLNQPQRAASLSRFREKRKERCFDKKIRYTVRKEVALRMQRKKGQFTSSKTVSDEAASS 214

Query: 561 SADWNGNSGPEEQETSCRHCGISSKSTPMMRRGPDGPRTLCNACGLKWANKGVMRVLTKP 740
           SA+ N  S  EEQET CRHCG SSKSTPMMRRGP GPR+LCNACGL WANKG++R L+K 
Sbjct: 215 SAEGNAGSSQEEQETLCRHCGTSSKSTPMMRRGPAGPRSLCNACGLTWANKGILRDLSKV 274

Query: 741 PVSMQQH----PMKLNGETNGEDTAVAPSNGMASSSGDN 845
             +  Q       + NGE +G D   A   G+ +S  +N
Sbjct: 275 STTGAQELSVKSSEQNGEADGSDVMAAA--GIITSDDEN 311


>gb|EXC32989.1| GATA transcription factor 28 [Morus notabilis]
          Length = 310

 Score =  248 bits (634), Expect = 3e-63
 Identities = 142/263 (53%), Positives = 181/263 (68%), Gaps = 9/263 (3%)
 Frame = +3

Query: 81  QIRYDPPSSAHSPHHALVALDTVALYA-GPSDMPPQVAPASGDGAADQLTLSFQGEVYVF 257
           QIR+D  ++A +    +  + + ALY  G +D     AP + +G +DQLTLSFQGEVYVF
Sbjct: 45  QIRFDDAAAAMN---GIQDVPSNALYVPGVADY----APVAENGGSDQLTLSFQGEVYVF 97

Query: 258 DSVSPEKVQAVLLLLGGYEVPTGIPNPGMTPQNHRNLADYPVRSSQPQRAASLNXXXXXX 437
           D+VSP+KVQAVLLLLGGYE+P+GIP  G TP   R +  +  +  QPQRAASLN      
Sbjct: 98  DAVSPDKVQAVLLLLGGYEIPSGIPAMGATPIGQRGMNQFVAKPIQPQRAASLNRFREKR 157

Query: 438 XXXXXXXXIRYTVRKEVALRMQRKKGQFTSSKSALDEPGSSSADWNG--NSGPEE--QET 605
                   IRY VRKEVA+RMQRKKGQFTS+K++ +E GS+S+ WN    SG +E  QET
Sbjct: 158 KERCFDKKIRYNVRKEVAMRMQRKKGQFTSAKTSSEELGSASSVWNATPGSGQDENMQET 217

Query: 606 SCRHCGISSKSTPMMRRGPDGPRTLCNACGLKWANKGVMRVLTKP-PVSMQQHPMKLNGE 782
           SC HCGISSKSTPMMRRGP GPRTLCNACGLKWANKG++R L+K    ++Q   +K   +
Sbjct: 218 SCTHCGISSKSTPMMRRGPAGPRTLCNACGLKWANKGILRDLSKVLNGNVQDASVKETEQ 277

Query: 783 TNG---EDTAVAPSNGMASSSGD 842
           ++G   +  AV  +  +ASS+GD
Sbjct: 278 SDGDANDSAAVTTTANIASSNGD 300


>gb|ADL36691.1| GATA domain class transcription factor [Malus domestica]
          Length = 294

 Score =  246 bits (629), Expect = 1e-62
 Identities = 133/237 (56%), Positives = 162/237 (68%), Gaps = 6/237 (2%)
 Frame = +3

Query: 153 LYAGPSDMPPQVAPASGDGAADQLTLSFQGEVYVFDSVSPEKVQAVLLLLGGYEVPTGIP 332
           LY   S+ PP   PA+ +GA+DQLTLSFQGEVYVFD+VSP+KVQAVLLLLGGYE+P+GIP
Sbjct: 52  LYLPSSEYPP---PAAANGASDQLTLSFQGEVYVFDAVSPDKVQAVLLLLGGYEIPSGIP 108

Query: 333 NPGMTPQNHRNLADYPVRSSQPQRAASLNXXXXXXXXXXXXXXIRYTVRKEVALRMQRKK 512
           + G  P N + + D P + +QPQRAASL+              IRYTVRKEVALRMQRKK
Sbjct: 109 SMGPVPLNQQGMNDLPAKPTQPQRAASLSRFREKRKERCFDKKIRYTVRKEVALRMQRKK 168

Query: 513 GQFTSSKSALDEPGSSSADWNGNSGPEEQETSCRHCGISSKSTPMMRRGPDGPRTLCNAC 692
           GQFTSSK++ D+ G +S+          QETSC HCGISSKSTPMMRRGP GPRTLCNAC
Sbjct: 169 GQFTSSKASSDDGGPASSTQGSGQDESMQETSCTHCGISSKSTPMMRRGPAGPRTLCNAC 228

Query: 693 GLKWANKGVMRVLTKP-PVSMQQHPMK----LNGETNGEDTAVAPSN-GMASSSGDN 845
           GLKWANKG +  + K   V +Q   +K    ++G     D     +N   +S++GDN
Sbjct: 229 GLKWANKGSLTGVPKVLNVGIQDPSLKGIEQIDGGVQDSDVVAMGANIAPSSANGDN 285


>ref|XP_007200518.1| hypothetical protein PRUPE_ppa009401mg [Prunus persica]
           gi|462395918|gb|EMJ01717.1| hypothetical protein
           PRUPE_ppa009401mg [Prunus persica]
          Length = 294

 Score =  242 bits (618), Expect = 2e-61
 Identities = 140/261 (53%), Positives = 164/261 (62%), Gaps = 10/261 (3%)
 Frame = +3

Query: 93  DPPSSAHSPHHALV---ALDTVALYAGPSDMPPQVAPASGDGAADQLTLSFQGEVYVFDS 263
           D   S  +PH       A+    LY   S+ PP  A    +G +DQLTLSFQGEVYVFD 
Sbjct: 28  DVEESIDNPHIRFEDSSAIPPNPLYLTSSEYPPAAAT---NGGSDQLTLSFQGEVYVFDE 84

Query: 264 VSPEKVQAVLLLLGGYEVPTGIPNPGMTPQNHRNLADYPVRSSQPQRAASLNXXXXXXXX 443
           VSP+KVQAVLLLLGGYE+P+GIP+ G  P N + + D PV+  QPQRAASL+        
Sbjct: 85  VSPDKVQAVLLLLGGYEIPSGIPSMGPVPLNQQGMNDLPVKPIQPQRAASLSRFREKRKE 144

Query: 444 XXXXXXIRYTVRKEVALRMQRKKGQFTSSKSALDEPGSSSADWNGNSGPEE--QETSCRH 617
                 IRYTVRKEVALRMQRKKGQFTSSK++ D+ G +S+     SG +E  QETSC H
Sbjct: 145 RCFDKKIRYTVRKEVALRMQRKKGQFTSSKASSDDGGPASSGATQGSGQDESMQETSCMH 204

Query: 618 CGISSKSTPMMRRGPDGPRTLCNACGLKWANKGVMRVLTKPPVSMQQHPM-----KLNGE 782
           CGISSKSTPMMRRGP GPRTLCNACGLKWANKGV+    K      Q P      + +GE
Sbjct: 205 CGISSKSTPMMRRGPAGPRTLCNACGLKWANKGVLTGGPKVSNIGMQDPSAKGIEQGDGE 264

Query: 783 TNGEDTAVAPSNGMASSSGDN 845
                     +N   S +GDN
Sbjct: 265 AKDSVAITMGANIAPSPNGDN 285


>ref|XP_002310482.2| hypothetical protein POPTR_0007s03130g [Populus trichocarpa]
           gi|550334020|gb|EEE90932.2| hypothetical protein
           POPTR_0007s03130g [Populus trichocarpa]
          Length = 318

 Score =  241 bits (614), Expect = 6e-61
 Identities = 138/225 (61%), Positives = 159/225 (70%), Gaps = 15/225 (6%)
 Frame = +3

Query: 216 DQLTLSFQGEVYVFDSVSPEKVQAVLLLLGGYEVPTGIPNPGMTPQNHR--NLADYPV-- 383
           DQLTLSFQGEVYVFD+V+P+KVQAVLLLLGGYE+P+GIP  G  P N R  N   Y +  
Sbjct: 88  DQLTLSFQGEVYVFDAVAPDKVQAVLLLLGGYEIPSGIPAMGTVPNNQRTPNHGIYDLSG 147

Query: 384 --RSSQPQRAASLNXXXXXXXXXXXXXXIRYTVRKEVALRMQRKKGQFTSSKSALDEPGS 557
             RS QP RAASL+              IRYTVRKEVALRMQRKKGQFTSSK+  DE GS
Sbjct: 148 TGRSIQPHRAASLSRFREKRKERCFDKKIRYTVRKEVALRMQRKKGQFTSSKANSDEGGS 207

Query: 558 SSADWNG--NSGPEEQ--ETSCRHCGISSKSTPMMRRGPDGPRTLCNACGLKWANKGVMR 725
           +S+  +G   SG +E   ET C HCGISSKSTPMMRRGP GPRTLCNACGLKWANKGV+R
Sbjct: 208 ASSGCSGMQGSGQDESMLETLCTHCGISSKSTPMMRRGPSGPRTLCNACGLKWANKGVLR 267

Query: 726 VLTKPPV-SMQQHPMK----LNGETNGEDTAVAPSNGMASSSGDN 845
            ++K P+ S+QQ  MK    +NGE N  DT  A ++   S +GDN
Sbjct: 268 NISKLPIMSIQQSSMKTVAQVNGEANNSDTITAAAD-TVSPNGDN 311


>ref|XP_004136886.1| PREDICTED: GATA transcription factor 24-like [Cucumis sativus]
          Length = 321

 Score =  235 bits (599), Expect = 3e-59
 Identities = 133/233 (57%), Positives = 158/233 (67%), Gaps = 8/233 (3%)
 Frame = +3

Query: 192 PASGDGAADQLTLSFQGEVYVFDSVSPEKVQAVLLLLGGYEVPTGIPNPGMTPQNHRNLA 371
           P +G+G ADQLTLSF+GEVY FDSVSP+KVQAVLLLLGGYE+P+GIP  G  P N +   
Sbjct: 83  PLTGNGGADQLTLSFRGEVYAFDSVSPDKVQAVLLLLGGYEIPSGIPAIGSAPVNQQGAD 142

Query: 372 DYPVRSSQPQRAASLNXXXXXXXXXXXXXXIRYTVRKEVALRMQRKKGQFTSSKSALDEP 551
            + VRS QPQRAASL+              IRY+VRKEVALRMQRKKGQF SSK+  DE 
Sbjct: 143 GFTVRSVQPQRAASLSRFREKRKERCFEKKIRYSVRKEVALRMQRKKGQFISSKAIGDEV 202

Query: 552 GSSSA-DWNGNSGPEE--QETSCRHCGISSKSTPMMRRGPDGPRTLCNACGLKWANKGVM 722
           GSSS      +SG ++   ETSC HCG SSKSTPMMRRGP GPRTLCNACGLKWANKG++
Sbjct: 203 GSSSVLSQTLDSGQDDGLLETSCTHCGTSSKSTPMMRRGPAGPRTLCNACGLKWANKGIL 262

Query: 723 RVLTKPPVSMQQHPM-----KLNGETNGEDTAVAPSNGMASSSGDN*PPLILI 866
           R L+K      Q P      + +GE   E  A A +  + +S+GD  P  +L+
Sbjct: 263 RDLSKVSNPSIQEPSAKEIEQSDGEAANEHNA-AINVDILTSNGDKKPQKVLV 314


>ref|XP_004170398.1| PREDICTED: GATA transcription factor 24-like [Cucumis sativus]
          Length = 304

 Score =  232 bits (592), Expect = 2e-58
 Identities = 132/228 (57%), Positives = 155/228 (67%), Gaps = 8/228 (3%)
 Frame = +3

Query: 192 PASGDGAADQLTLSFQGEVYVFDSVSPEKVQAVLLLLGGYEVPTGIPNPGMTPQNHRNLA 371
           P +G+G ADQLTLSF+GEVY FDSVSP+KVQAVLLLLGGYE+P+GIP  G  P N +   
Sbjct: 74  PLTGNGGADQLTLSFRGEVYAFDSVSPDKVQAVLLLLGGYEIPSGIPAIGSAPVNQQGAD 133

Query: 372 DYPVRSSQPQRAASLNXXXXXXXXXXXXXXIRYTVRKEVALRMQRKKGQFTSSKSALDEP 551
            + VRS QPQRAASL+              IRY+VRKEVALRMQRKKGQF SSK+  DE 
Sbjct: 134 GFTVRSVQPQRAASLSRFREKRKERCFEKKIRYSVRKEVALRMQRKKGQFISSKAIGDEV 193

Query: 552 GSSSA-DWNGNSGPEE--QETSCRHCGISSKSTPMMRRGPDGPRTLCNACGLKWANKGVM 722
           GSSS      +SG ++   ETSC HCG SSKSTPMMRRGP GPRTLCNACGLKWANKG++
Sbjct: 194 GSSSVLSQTLDSGQDDGLLETSCTHCGTSSKSTPMMRRGPAGPRTLCNACGLKWANKGIL 253

Query: 723 RVLTKPPVSMQQHPM-----KLNGETNGEDTAVAPSNGMASSSGDN*P 851
           R L+K      Q P      + +GE   E  A A +  + +S+GD  P
Sbjct: 254 RDLSKVSNPSIQEPSAKEIEQSDGEAANEHNA-AINVDILTSNGDKKP 300


>ref|NP_850618.1| GATA transcription factor 24 [Arabidopsis thaliana]
           gi|14596059|gb|AAK68757.1| Unknown protein [Arabidopsis
           thaliana] gi|17978695|gb|AAL47341.1| unknown protein
           [Arabidopsis thaliana] gi|332642950|gb|AEE76471.1| GATA
           transcription factor 24 [Arabidopsis thaliana]
          Length = 295

 Score =  220 bits (560), Expect = 1e-54
 Identities = 125/217 (57%), Positives = 142/217 (65%), Gaps = 10/217 (4%)
 Frame = +3

Query: 216 DQLTLSFQGEVYVFDSVSPEKVQAVLLLLGGYEVPTGIPNP-GMTPQNHRNLADYPVRSS 392
           DQLTLSFQG+VYVFD VSPEKVQAVLLLLGG EVP  +P   G   QN+R L+  P R S
Sbjct: 78  DQLTLSFQGQVYVFDRVSPEKVQAVLLLLGGREVPHTLPTTLGSPHQNNRGLSGTPQRLS 137

Query: 393 QPQRAASLNXXXXXXXXXXXXXXIRYTVRKEVALRMQRKKGQFTSSKSALDEPGSSSADW 572
            PQR ASL               IRYTVRKEVALRMQRKKGQFTS+KS+ D+ GS+ +DW
Sbjct: 138 VPQRLASLLRFREKRKGRNFDKTIRYTVRKEVALRMQRKKGQFTSAKSSNDDSGSTGSDW 197

Query: 573 NGNS-----GPEEQ--ETSCRHCGISSKSTPMMRRGPDGPRTLCNACGLKWANKGVMRVL 731
             N      G E Q  E  CRHCG S KSTPMMRRGPDGPRTLCNACGL WANKG +R L
Sbjct: 198 GSNQSWAVEGTETQKPEVLCRHCGTSEKSTPMMRRGPDGPRTLCNACGLMWANKGTLRDL 257

Query: 732 TK--PPVSMQQHPMKLNGETNGEDTAVAPSNGMASSS 836
           +K  PP + Q   +  N + N E   +    G  S++
Sbjct: 258 SKVPPPQTPQHLSLNKNEDANLEADQMMEVTGDISNT 294


>ref|XP_007023733.1| ZIM-like 1 [Theobroma cacao] gi|508779099|gb|EOY26355.1| ZIM-like 1
           [Theobroma cacao]
          Length = 308

 Score =  218 bits (556), Expect = 3e-54
 Identities = 130/254 (51%), Positives = 151/254 (59%), Gaps = 16/254 (6%)
 Frame = +3

Query: 69  NPSSQIRYDPPSSAHSPHHALVALDTVA------LYAG--PSDMPPQVAPASGDGAADQL 224
           N +  +  D    AH  HH     D V       + AG  PSD P  ++   G    DQL
Sbjct: 35  NGNGMVDDDDVHHAHHHHHHHDVDDNVGCGEAEGVEAGDLPSDHPGVLSDNQGPDNGDQL 94

Query: 225 TLSFQGEVYVFDSVSPEKVQAVLLLLGGYEVPTGIPNPGMTPQNHRNLADYPVRSSQPQR 404
           TLSFQG+VYV+DSV PEKVQAVLLLLGG EVP  +P   +T QN+R L   P R S PQR
Sbjct: 95  TLSFQGQVYVYDSVPPEKVQAVLLLLGGREVPPTMPAIPITTQNNRGLPGTPQRFSVPQR 154

Query: 405 AASLNXXXXXXXXXXXXXXIRYTVRKEVALRMQRKKGQFTSSKSALDEPGSSSADWNGN- 581
            ASL               IRYTVRKEVALRMQR KGQFTSSK   D+  S+++    N 
Sbjct: 155 LASLLRFREKRKERNFDKKIRYTVRKEVALRMQRNKGQFTSSKPNTDDSVSAASSLGSNQ 214

Query: 582 ------SGPEEQETSCRHCGISSKSTPMMRRGPDGPRTLCNACGLKWANKGVMRVLTK-P 740
                 +G + QE  CRHCGIS KSTPMMRRGP+GPRTLCNACGL WANKG +R L+K  
Sbjct: 215 SWGADGNGSQNQEIVCRHCGISEKSTPMMRRGPEGPRTLCNACGLMWANKGTLRDLSKAA 274

Query: 741 PVSMQQHPMKLNGE 782
           P +     +  NGE
Sbjct: 275 PQTGNSSSLSKNGE 288


>ref|XP_006406306.1| hypothetical protein EUTSA_v10020970mg [Eutrema salsugineum]
           gi|557107452|gb|ESQ47759.1| hypothetical protein
           EUTSA_v10020970mg [Eutrema salsugineum]
          Length = 369

 Score =  218 bits (554), Expect = 5e-54
 Identities = 126/222 (56%), Positives = 141/222 (63%), Gaps = 11/222 (4%)
 Frame = +3

Query: 201 GDGAADQLTLSFQGEVYVFDSVSPEKVQAVLLLLGGYEVPTGIPNP-GMTPQNHRNLADY 377
           G    DQLTLSFQG+VYVFD VSPEKVQAVLLLLGG EVP  +P   G   QN+R L+  
Sbjct: 148 GSENGDQLTLSFQGQVYVFDRVSPEKVQAVLLLLGGREVPQTLPTTLGSPHQNNRGLSGT 207

Query: 378 PVRSSQPQRAASLNXXXXXXXXXXXXXXIRYTVRKEVALRMQRKKGQFTSSKSALDEPGS 557
           P R S PQR ASL               IRYTVRKEVALRMQRKKGQFTS+KS+ D+  S
Sbjct: 208 PQRFSVPQRQASLIRFREKRKERNFDKTIRYTVRKEVALRMQRKKGQFTSAKSSNDDSAS 267

Query: 558 SSADWNGNS-----GPEEQ--ETSCRHCGISSKSTPMMRRGPDGPRTLCNACGLKWANKG 716
           + +DW  +      G E Q  E  CRHCGIS KSTPMMRRGP+GPRTLCNACGL WANKG
Sbjct: 268 TGSDWGSSQSWALEGSETQKPEVLCRHCGISEKSTPMMRRGPEGPRTLCNACGLMWANKG 327

Query: 717 VMRVLTK---PPVSMQQHPMKLNGETNGEDTAVAPSNGMASS 833
            +R L+K   PP   Q  P   N + N E   +    G  SS
Sbjct: 328 TLRDLSKAPPPPQIAQNLPADTNEDPNLEADQMTGVAGDISS 369


>ref|XP_006393030.1| hypothetical protein EUTSA_v10011695mg [Eutrema salsugineum]
           gi|557089608|gb|ESQ30316.1| hypothetical protein
           EUTSA_v10011695mg [Eutrema salsugineum]
          Length = 299

 Score =  217 bits (552), Expect = 9e-54
 Identities = 120/207 (57%), Positives = 136/207 (65%), Gaps = 9/207 (4%)
 Frame = +3

Query: 201 GDGAADQLTLSFQGEVYVFDSVSPEKVQAVLLLLGGYEVPTGIPNP-GMTPQNHRNLADY 377
           G    DQLTLSFQG+VYVFDSV PEKVQAVLLLLGG E+P   P   G + QN+R L + 
Sbjct: 77  GSDQGDQLTLSFQGQVYVFDSVLPEKVQAVLLLLGGRELPQAPPTGLGSSHQNNRGLPNT 136

Query: 378 PVRSSQPQRAASLNXXXXXXXXXXXXXXIRYTVRKEVALRMQRKKGQFTSSKSALDEPGS 557
           P R S PQR ASL               IRYTVRKEVALRMQR KGQFTS+KS+ DE  S
Sbjct: 137 PQRFSMPQRLASLVRFREKRKGRNFDKKIRYTVRKEVALRMQRNKGQFTSAKSSNDEAPS 196

Query: 558 SSADWNGN-------SGPEEQETSCRHCGISSKSTPMMRRGPDGPRTLCNACGLKWANKG 716
           + + W  N       S  + QE SCRHCGI  KSTPMMRRGP+GPRTLCNACGL WANKG
Sbjct: 197 AGSSWGSNQTWAIEGSEAQNQEISCRHCGIGEKSTPMMRRGPEGPRTLCNACGLMWANKG 256

Query: 717 VMRVLTK-PPVSMQQHPMKLNGETNGE 794
            +R L+K  P + Q  P+  N + N E
Sbjct: 257 ALRDLSKGAPQTAQNLPLHKNEDANLE 283


>ref|XP_002883290.1| hypothetical protein ARALYDRAFT_479637 [Arabidopsis lyrata subsp.
           lyrata] gi|297329130|gb|EFH59549.1| hypothetical protein
           ARALYDRAFT_479637 [Arabidopsis lyrata subsp. lyrata]
          Length = 297

 Score =  216 bits (550), Expect = 2e-53
 Identities = 124/205 (60%), Positives = 138/205 (67%), Gaps = 12/205 (5%)
 Frame = +3

Query: 216 DQLTLSFQGEVYVFDSVSPEKVQAVLLLLGGYEVPTGIPNP-GMTPQNHR--NLADYPVR 386
           DQLTLSFQG+VYVFD VSPEKVQAVLLLLGG EVP  +P   G   Q +R   L+  P R
Sbjct: 78  DQLTLSFQGQVYVFDRVSPEKVQAVLLLLGGREVPQTLPTSLGSPHQINRVLGLSGTPQR 137

Query: 387 SSQPQRAASLNXXXXXXXXXXXXXXIRYTVRKEVALRMQRKKGQFTSSKSALDEPGSSSA 566
            S PQR ASL               IRYTVRKEVALRMQRKKGQFTS+KS+ D+ GS+ +
Sbjct: 138 LSVPQRLASLLRFREKRKGRNFDKTIRYTVRKEVALRMQRKKGQFTSAKSSNDDSGSTGS 197

Query: 567 DWNGNS-----GPEEQ--ETSCRHCGISSKSTPMMRRGPDGPRTLCNACGLKWANKGVMR 725
           DW  N      G E Q  E  CRHCGIS KSTPMMRRGPDGPRTLCNACGL WANKG +R
Sbjct: 198 DWGSNQNWAIEGTETQKPEVLCRHCGISEKSTPMMRRGPDGPRTLCNACGLMWANKGTLR 257

Query: 726 VLTK--PPVSMQQHPMKLNGETNGE 794
            L+K  PP + Q  P+  N + N E
Sbjct: 258 DLSKVPPPQTPQHLPLNKNEDPNLE 282


Top