BLASTX nr result

ID: Akebia23_contig00029418 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Akebia23_contig00029418
         (1564 letters)

Database: ./nr 
           38,876,450 sequences; 13,856,398,315 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_006827868.1| hypothetical protein AMTR_s00008p00092930 [A...   189   3e-45
ref|XP_006440183.1| hypothetical protein CICLE_v10019614mg [Citr...   166   2e-38
ref|XP_006477095.1| PREDICTED: GATA transcription factor 26-like...   165   5e-38
ref|XP_006838526.1| hypothetical protein AMTR_s00002p00191340 [A...   162   5e-37
ref|XP_007039699.1| GATA transcription factor, putative isoform ...   159   2e-36
ref|XP_007039698.1| GATA transcription factor, putative isoform ...   159   2e-36
ref|XP_006368951.1| zinc finger family protein [Populus trichoca...   158   6e-36
ref|XP_004147235.1| PREDICTED: GATA transcription factor 26-like...   158   6e-36
ref|XP_003549942.1| PREDICTED: GATA transcription factor 26-like...   157   2e-35
emb|CAN76534.1| hypothetical protein VITISV_006083 [Vitis vinifera]   156   2e-35
ref|XP_004300335.1| PREDICTED: GATA transcription factor 26-like...   155   6e-35
ref|XP_006385556.1| hypothetical protein POPTR_0003s08080g [Popu...   154   8e-35
ref|XP_006414213.1| hypothetical protein EUTSA_v10024940mg [Eutr...   154   1e-34
ref|XP_002531215.1| GATA transcription factor, putative [Ricinus...   152   3e-34
ref|XP_003517400.1| PREDICTED: GATA transcription factor 26-like...   150   1e-33
ref|XP_004511735.1| PREDICTED: GATA transcription factor 26-like...   150   2e-33
gb|ADL36698.1| GATA domain class transcription factor [Malus dom...   148   6e-33
ref|XP_006362478.1| PREDICTED: GATA transcription factor 26-like...   148   8e-33
ref|XP_004244556.1| PREDICTED: GATA transcription factor 26-like...   148   8e-33
ref|XP_004511736.1| PREDICTED: GATA transcription factor 26-like...   146   2e-32

>ref|XP_006827868.1| hypothetical protein AMTR_s00008p00092930 [Amborella trichopoda]
            gi|548832503|gb|ERM95284.1| hypothetical protein
            AMTR_s00008p00092930 [Amborella trichopoda]
          Length = 519

 Score =  189 bits (480), Expect = 3e-45
 Identities = 133/348 (38%), Positives = 180/348 (51%), Gaps = 17/348 (4%)
 Frame = -3

Query: 1559 KPVLCNACGSRWRTRGTLTNYTPLHARAFMSLDSEDSQNSREDKSQSMTKPENWHIK--- 1389
            KPVLCNACGSRWRT+G+L NY PLHAR    +D+E+ ++ R DKS   ++P  + ++   
Sbjct: 123  KPVLCNACGSRWRTKGSLANYAPLHARGVSPIDTENHKSPRVDKSPCRSRPPQFSMRTNQ 182

Query: 1388 -DHIEGDMEDKEIFDPYPIGLEDAXXXXXXXXXXXXXSESCIQLASVDGNDFSGPVQSSF 1212
             D     +E  + +DP P GLED              SESC+Q  S D ND +G  QS  
Sbjct: 183  DDRHAERIERLQGYDPGPKGLEDDTSNRSSSGSGISYSESCVQFGSTDVNDVTGSAQSHV 242

Query: 1211 LDLHIPSKKRTNNRCPKLSLVEKLRRDLLKIL-QQEPSYLSGSSE-EVLIYAREDSTFPL 1038
             D H+PSKKRT      LS VEKLR+DL +IL +Q+ S LSG SE +VL++  E     +
Sbjct: 243  WDSHVPSKKRTCITRQYLSPVEKLRKDLCEILHEQDSSQLSGYSEDDVLLFDSETPMDSV 302

Query: 1037 EIGHGSYLLKPPISFK-EEESEARSLIIENKASCLNEPHLGSSCLSQQSQNTEVNSSQV- 864
            EIG GS L+K P+S   EEESEA SL+ E++   +NE + GSS     + N E  S QV 
Sbjct: 303  EIGLGSVLIKHPLSTSGEEESEASSLVAESRCCIVNEAYSGSSLFPAPTLNRERGSIQVD 362

Query: 863  ---------SKDGLIEEIEDNNKSEEIPSLGRPLLQDPGVLYSTHAPQISVELNKNNERF 711
                       D +  E      +E + SL        G    + +  I   + KNNE F
Sbjct: 363  DNAVKFREGEMDRIFHEKMHTLANEPLESLHSNNFDTLG-RNDSASRYIDPNVKKNNE-F 420

Query: 710  KSENENYALTSHAIRGGISQPAKRPLDPPYFQTYGGTSSDATRTFKRP 567
                   + +S  + GG+S   KRPLD    ++  G       + KRP
Sbjct: 421  AEGKGVPSCSSGLVAGGVSTAVKRPLDWEVCKSIEGAIEKPKSSLKRP 468


>ref|XP_006440183.1| hypothetical protein CICLE_v10019614mg [Citrus clementina]
            gi|567895392|ref|XP_006440184.1| hypothetical protein
            CICLE_v10019614mg [Citrus clementina]
            gi|557542445|gb|ESR53423.1| hypothetical protein
            CICLE_v10019614mg [Citrus clementina]
            gi|557542446|gb|ESR53424.1| hypothetical protein
            CICLE_v10019614mg [Citrus clementina]
          Length = 542

 Score =  166 bits (421), Expect = 2e-38
 Identities = 111/292 (38%), Positives = 159/292 (54%), Gaps = 15/292 (5%)
 Frame = -3

Query: 1559 KPVLCNACGSRWRTRGTLTNYTPLHARAFMSLDSEDSQNSREDKSQSMTKPENWHIKDHI 1380
            KPVLCNACGSRWRT+GTL NYTPLHARA    + +D ++ R  K +S++  +N  +K   
Sbjct: 25   KPVLCNACGSRWRTKGTLANYTPLHARA----EPDDYEDHRVSKVKSISINKNKDVK--- 77

Query: 1379 EGDMEDKEIFDPYPIG-------------LEDAXXXXXXXXXXXXXSESCIQLASVDGND 1239
               ++ K  +D   +G             +++              SESC+Q  S D +D
Sbjct: 78   --VLKRKSNYDNVVVGGFAPDYNHGYRKVVDEDTSNRSSSGSAISNSESCVQFGSADASD 135

Query: 1238 FSGPVQSSFLDLHIPSKKRTNNRCPKLSLVEKLRRDLLKIL-QQEPSYLSGSSEEVLIYA 1062
             +GP QS+  D  +PSKKRT    PK S VEKL +DL  IL +Q+ SY SGSSEE L++ 
Sbjct: 136  LTGPAQSNVWDSVVPSKKRTCVNRPKQSPVEKLTKDLYTILHEQQSSYFSGSSEEDLLFE 195

Query: 1061 REDSTFPLEIGHGSYLLKPPISF-KEEESEARSLIIENKASCLNEPHLGSSCLSQQSQNT 885
             E     +EIGHGS L++ P S  +EEESEA SL +ENK   +NE +  S+ L   +   
Sbjct: 196  SETPMVSVEIGHGSVLIRHPSSIAREEESEASSLSVENKQYLVNESYSRSATLHVYNDYQ 255

Query: 884  EVNSSQVSKDGLIEEIEDNNKSEEIPSLGRPLLQDPGVLYSTHAPQISVELN 729
             VN S  + D     IE   + +++    +   +   +L S ++P   ++LN
Sbjct: 256  GVNFSSRNMDKAKNFIEQGMQQDQL-KRDKSQQEKLQILGSHNSPLCEIDLN 306


>ref|XP_006477095.1| PREDICTED: GATA transcription factor 26-like [Citrus sinensis]
          Length = 542

 Score =  165 bits (418), Expect = 5e-38
 Identities = 111/292 (38%), Positives = 158/292 (54%), Gaps = 15/292 (5%)
 Frame = -3

Query: 1559 KPVLCNACGSRWRTRGTLTNYTPLHARAFMSLDSEDSQNSREDKSQSMTKPENWHIKDHI 1380
            KPVLCNACGSRWRT+GTL NYTPLHARA    + +D ++ R  K +S++  +N  +K   
Sbjct: 25   KPVLCNACGSRWRTKGTLANYTPLHARA----EPDDYEDHRVSKVKSISINKNKDVK--- 77

Query: 1379 EGDMEDKEIFDPYPIG-------------LEDAXXXXXXXXXXXXXSESCIQLASVDGND 1239
               ++ K  +D   +G             +++              SESC+Q  S D +D
Sbjct: 78   --VLKRKSNYDNVVVGGFAPDYNHGYRKVVDEDTSNRSSSGSAISNSESCVQFGSADASD 135

Query: 1238 FSGPVQSSFLDLHIPSKKRTNNRCPKLSLVEKLRRDLLKIL-QQEPSYLSGSSEEVLIYA 1062
             +GP QS+  D  +PSKKRT    PK S VEKL +DL  IL +Q+ SY SGSSEE L++ 
Sbjct: 136  LTGPAQSNVWDSVVPSKKRTCVNRPKQSPVEKLTKDLYTILHEQQSSYFSGSSEEDLLFE 195

Query: 1061 REDSTFPLEIGHGSYLLKPPISF-KEEESEARSLIIENKASCLNEPHLGSSCLSQQSQNT 885
             E     +EIGHGS L++ P S  +EEESEA SL +ENK   +NE +  S+ L   +   
Sbjct: 196  SETPMVSVEIGHGSVLIRHPSSIAREEESEASSLSVENKQYLVNESYSRSATLHVYNDYQ 255

Query: 884  EVNSSQVSKDGLIEEIEDNNKSEEIPSLGRPLLQDPGVLYSTHAPQISVELN 729
             VN S  + D     IE   + +++    +   +   +L S  +P   ++LN
Sbjct: 256  GVNFSSRNMDKAKNFIEQGMQQDQL-KRDKSQQEKLQILGSHTSPLCEIDLN 306


>ref|XP_006838526.1| hypothetical protein AMTR_s00002p00191340 [Amborella trichopoda]
            gi|548841032|gb|ERN01095.1| hypothetical protein
            AMTR_s00002p00191340 [Amborella trichopoda]
          Length = 525

 Score =  162 bits (409), Expect = 5e-37
 Identities = 113/291 (38%), Positives = 158/291 (54%), Gaps = 15/291 (5%)
 Frame = -3

Query: 1559 KPVLCNACGSRWRTRGTLTNYTPLHARAFMSLDSEDSQNSREDKSQSMTKPENWHIK--- 1389
            KPVLCNACGSRWRT+GTLTNYTPLH+R   +++S+ S   +        K +  H +   
Sbjct: 25   KPVLCNACGSRWRTKGTLTNYTPLHSRG-EAIESDVSNFPKVKNPSLKLKEDKLHKRKQN 83

Query: 1388 DHIEGDMEDKEIFDPYPIGLEDAXXXXXXXXXXXXXSESCIQLASVDGNDFSGPVQSSFL 1209
            D IE    ++  F  Y  GLE+              SESC+Q AS D  D  G  QS+  
Sbjct: 84   DIIEEAKGEEAGFALYRRGLEEDTSTRSSSGSAISYSESCVQFASTDAKDIRGSAQSNAW 143

Query: 1208 DLHIPSKKRTNNRCPKLSLVEKLRRDLLKIL-QQEPSYLSGSSEEVLIYAREDSTFPLEI 1032
            D  IPS+KRT     K S VEKL ++L  IL +QE SYLSG+SEE L++        +EI
Sbjct: 144  DSLIPSRKRTCVNRQKPSSVEKLTKELYCILHEQELSYLSGTSEEDLLFETTTPMVSVEI 203

Query: 1031 GHGSYLLKPPISF-KEEESEARSLIIENKASCLNEPHLGSSCLSQQSQNTEVNSSQVSKD 855
            GHG  L++PP S  +EEESEA SL+ E+KA  LN+    S+       +   N S+V  D
Sbjct: 204  GHGGVLIRPPNSLAQEEESEASSLLTESKAHFLNDDCSRSTSHHVNIPSKGCNFSEVG-D 262

Query: 854  GLIEEIEDNNKSEEIPSLGRPLLQDP----------GVLYSTHAPQISVEL 732
            G+++            ++G P+ +D            +L+S+++P IS++L
Sbjct: 263  GIVK-----------TNIGEPIQEDSQRNKTSDDECDILWSSNSPLISIDL 302


>ref|XP_007039699.1| GATA transcription factor, putative isoform 2 [Theobroma cacao]
            gi|508776944|gb|EOY24200.1| GATA transcription factor,
            putative isoform 2 [Theobroma cacao]
          Length = 400

 Score =  159 bits (403), Expect = 2e-36
 Identities = 100/236 (42%), Positives = 133/236 (56%), Gaps = 6/236 (2%)
 Frame = -3

Query: 1559 KPVLCNACGSRWRTRGTLTNYTPLHARAFMSLDSEDSQNSREDKSQSMTKPENWHIKDHI 1380
            KPVLCNACGSRWRT+GTL NYTPLHAR    ++ +D ++ R  + +S++  +N  IK   
Sbjct: 25   KPVLCNACGSRWRTKGTLANYTPLHAR----VEPDDYEDHRASRVKSISINKNKEIKLLK 80

Query: 1379 EGDMEDKEIFDP-YPIG----LEDAXXXXXXXXXXXXXSESCIQLASVDGNDFSGPVQSS 1215
                 D  +  P Y  G    +++              SESC Q  S D +D +GP QS+
Sbjct: 81   RKPNHDTAVVAPDYNQGFRKFVDEDTSNRSSSGSAISNSESCAQFGSGDASDLTGPAQSN 140

Query: 1214 FLDLHIPSKKRTNNRCPKLSLVEKLRRDLLKILQQEPSYLSGSSEEVLIYAREDSTFPLE 1035
              D  +PSKKRT    PK S VEKL +DL  IL ++ SY SGSSEE L+   E     +E
Sbjct: 141  VWDSMVPSKKRTCVNRPKPSPVEKLTKDLYTILHEQSSYFSGSSEEDLLLESETPMVSVE 200

Query: 1034 IGHGSYLLKPPISF-KEEESEARSLIIENKASCLNEPHLGSSCLSQQSQNTEVNSS 870
            IGHGS L++ P S  +EEESEA SL +ENK   +NE +  SS     + +  +  S
Sbjct: 201  IGHGSVLIRHPSSIAREEESEASSLSVENKQYSMNEAYSHSSSFPTHNDSEGIKFS 256


>ref|XP_007039698.1| GATA transcription factor, putative isoform 1 [Theobroma cacao]
            gi|508776943|gb|EOY24199.1| GATA transcription factor,
            putative isoform 1 [Theobroma cacao]
          Length = 538

 Score =  159 bits (403), Expect = 2e-36
 Identities = 100/236 (42%), Positives = 133/236 (56%), Gaps = 6/236 (2%)
 Frame = -3

Query: 1559 KPVLCNACGSRWRTRGTLTNYTPLHARAFMSLDSEDSQNSREDKSQSMTKPENWHIKDHI 1380
            KPVLCNACGSRWRT+GTL NYTPLHAR    ++ +D ++ R  + +S++  +N  IK   
Sbjct: 25   KPVLCNACGSRWRTKGTLANYTPLHAR----VEPDDYEDHRASRVKSISINKNKEIKLLK 80

Query: 1379 EGDMEDKEIFDP-YPIG----LEDAXXXXXXXXXXXXXSESCIQLASVDGNDFSGPVQSS 1215
                 D  +  P Y  G    +++              SESC Q  S D +D +GP QS+
Sbjct: 81   RKPNHDTAVVAPDYNQGFRKFVDEDTSNRSSSGSAISNSESCAQFGSGDASDLTGPAQSN 140

Query: 1214 FLDLHIPSKKRTNNRCPKLSLVEKLRRDLLKILQQEPSYLSGSSEEVLIYAREDSTFPLE 1035
              D  +PSKKRT    PK S VEKL +DL  IL ++ SY SGSSEE L+   E     +E
Sbjct: 141  VWDSMVPSKKRTCVNRPKPSPVEKLTKDLYTILHEQSSYFSGSSEEDLLLESETPMVSVE 200

Query: 1034 IGHGSYLLKPPISF-KEEESEARSLIIENKASCLNEPHLGSSCLSQQSQNTEVNSS 870
            IGHGS L++ P S  +EEESEA SL +ENK   +NE +  SS     + +  +  S
Sbjct: 201  IGHGSVLIRHPSSIAREEESEASSLSVENKQYSMNEAYSHSSSFPTHNDSEGIKFS 256


>ref|XP_006368951.1| zinc finger family protein [Populus trichocarpa]
            gi|550347310|gb|ERP65520.1| zinc finger family protein
            [Populus trichocarpa]
          Length = 552

 Score =  158 bits (400), Expect = 6e-36
 Identities = 110/295 (37%), Positives = 155/295 (52%), Gaps = 18/295 (6%)
 Frame = -3

Query: 1559 KPVLCNACGSRWRTRGTLTNYTPLHARAFMSLDSEDSQNSREDKSQSMTKPENWHIK--- 1389
            KPVLCNACGSRWRT+GTL NYTPLHARA      +D ++ R  + +S++  +N  +K   
Sbjct: 33   KPVLCNACGSRWRTKGTLANYTPLHARA----GPDDYEDHRVSRLKSISMNKNREVKLLK 88

Query: 1388 -----DHIEGDMEDKEIFDPYPIGLEDAXXXXXXXXXXXXXSESCIQLASVDGNDFSGPV 1224
                 DH   +    +  + Y   +++              SESC Q  S D +D +GP 
Sbjct: 89   RKPNYDHRVAEGVALDYNEGYRKVVDEDTSNRSSSGSAISNSESCAQFGSADASDLTGPA 148

Query: 1223 QSSFLDLHIPSKKRTNNRCPKLSLVEKLRRDLLKIL-QQEPSYLSGSSEEVLIYAREDST 1047
            QS   D  +PS+KRT    PK S VEKL +DL  IL +Q+ S  SGSSEE L++  E   
Sbjct: 149  QSVVWDSLVPSRKRTCVNRPKPSPVEKLTKDLYTILHEQQSSCFSGSSEEDLLFDNETPM 208

Query: 1046 FPLEIGHGSYLLKPPISF-KEEESEARSLIIENKASCLNEPHLGSSCLSQQSQNTEVN-- 876
              +EIGHGS L++ P S  ++EESEA SL +ENK    NE +     L   ++N  VN  
Sbjct: 209  VSVEIGHGSVLIRHPSSIARDEESEASSLSVENKQYSTNEAYSHPVILPVHNENQSVNMT 268

Query: 875  ------SSQVSKDGLIEEIEDNNKSEEIPSLGRPLLQDPGVLYSTHAPQISVELN 729
                  +  +S  G+ +E  + +KS           +   +L S ++P  SV+LN
Sbjct: 269  YPVTVKTKNLSGQGMQQEQLNRDKSPH---------EKVHILGSHNSPLCSVDLN 314


>ref|XP_004147235.1| PREDICTED: GATA transcription factor 26-like [Cucumis sativus]
            gi|449510483|ref|XP_004163679.1| PREDICTED: GATA
            transcription factor 26-like [Cucumis sativus]
          Length = 539

 Score =  158 bits (400), Expect = 6e-36
 Identities = 107/273 (39%), Positives = 150/273 (54%), Gaps = 19/273 (6%)
 Frame = -3

Query: 1559 KPVLCNACGSRWRTRGTLTNYTPLHARAFMSLDSEDSQNSR-------EDKSQSMTKPEN 1401
            KPVLCNACGSRWRT+GTL NYTPLHARA    + ED + SR       ++K   + K + 
Sbjct: 25   KPVLCNACGSRWRTKGTLANYTPLHARADPD-EFEDKRISRWKNLSMCKNKEVKLLKRKQ 83

Query: 1400 WHIKDHIEGDMEDKEIFDPYPIGLEDAXXXXXXXXXXXXXSESCIQLASVDGNDFSGPVQ 1221
            +     + G + D      +   +++              SESC Q    D +D +GP Q
Sbjct: 84   YQDNGLVVGVLPDHA--QSFHKVVDEDTSNRSSSGSAISNSESCAQFGGADASDLTGPSQ 141

Query: 1220 SSFLDLHIPSKKRTNNRCPKLSLVEKLRRDLLKILQQEPSYLSGSSEEVLIYAREDSTFP 1041
            S+  +  +PS+KRT    PK + VEKL +DL  IL+++ SY SGSSEE L++  E     
Sbjct: 142  STAWEAMVPSRKRTCVGRPKSTAVEKLTKDLYTILREQQSYFSGSSEEDLLFENETPMVS 201

Query: 1040 LEIGHGSYLLKPPISF-KEEESEARSLIIENKASCLNEPHLGSSCL--SQQSQNTEVNSS 870
            +EIGHGS L++ P S  +EEESEA S+ ++NK   LNE H  SS L    ++QN  VN S
Sbjct: 202  VEIGHGSVLMRHPSSIAREEESEASSISVDNKQFSLNEVHSESSILPVHYETQNKFVNFS 261

Query: 869  --------QVSKDGLIEEIE-DNNKSEEIPSLG 798
                       +  L ++I+ D  +SE + +LG
Sbjct: 262  TLGIGRKHSTGQGFLNDQIKRDRPQSERMQALG 294


>ref|XP_003549942.1| PREDICTED: GATA transcription factor 26-like [Glycine max]
          Length = 544

 Score =  157 bits (396), Expect = 2e-35
 Identities = 101/237 (42%), Positives = 132/237 (55%), Gaps = 7/237 (2%)
 Frame = -3

Query: 1559 KPVLCNACGSRWRTRGTLTNYTPLHARAFMSLDSEDSQNSREDKSQSMTKPENWHIKDHI 1380
            KPVLCNACGSRWRT+GTL NYTPLHARA  ++D ED + SR  KS S+ K     +    
Sbjct: 25   KPVLCNACGSRWRTKGTLANYTPLHARA-ENIDYEDQKVSRV-KSISLNKNTEVKLVKRK 82

Query: 1379 E--GDMEDKEIFDPYPIG----LEDAXXXXXXXXXXXXXSESCIQLASVDGNDFSGPVQS 1218
            +  G+         Y  G    +++              SESC Q    D +D +GP QS
Sbjct: 83   QNYGNAASGGFVPDYSQGYRKVVDEDTSNRSSSGSAVSNSESCAQFGGPDASDLTGPAQS 142

Query: 1217 SFLDLHIPSKKRTNNRCPKLSLVEKLRRDLLKILQQEPSYLSGSSEEVLIYAREDSTFPL 1038
               D  +PSKKRT    PK S VEKL RDL  IL ++ SY S SSEE L++  +     +
Sbjct: 143  VVWDAMVPSKKRTCAGRPKPSSVEKLTRDLCTILHEQQSYFSASSEEDLLFESDTPMVSV 202

Query: 1037 EIGHGSYLLKPPISF-KEEESEARSLIIENKASCLNEPHLGSSCLSQQSQNTEVNSS 870
            EIGHGS L++ P S  ++EESEA SL ++NK   +NE +  SS +   S  + +N S
Sbjct: 203  EIGHGSILIRHPSSIARDEESEASSLSVDNKQCLMNEAYSFSSTIPIYSDRSSMNFS 259


>emb|CAN76534.1| hypothetical protein VITISV_006083 [Vitis vinifera]
          Length = 542

 Score =  156 bits (395), Expect = 2e-35
 Identities = 97/238 (40%), Positives = 136/238 (57%), Gaps = 10/238 (4%)
 Frame = -3

Query: 1559 KPVLCNACGSRWRTRGTLTNYTPLHARAFMSLDSEDSQNSREDKSQSMTKPENWHIKDHI 1380
            KPVLCNACGSRWRT+GTL NYTPLHAR    +D +D+++ R  + +S++  +N  +K   
Sbjct: 25   KPVLCNACGSRWRTKGTLENYTPLHAR----VDGDDAEDYRVSRVKSISINKNKEVKLLK 80

Query: 1379 EGDMEDKEIFD----PYPIG----LEDAXXXXXXXXXXXXXSESCIQLASVDGNDFSGPV 1224
                +D  + +     Y  G    +++              SESC Q  S D +D +GP 
Sbjct: 81   RKQNQDNVVVNGVASDYSQGSRKAIDEDTSNRSSSGSAISNSESCAQFGSADASDLTGPS 140

Query: 1223 QSSFLDLHIPSKKRTNNRCPKLSLVEKLRRDLLKIL-QQEPSYLSGSSEEVLIYAREDST 1047
            QS   D  +PS+KRT    PK S VEKL +DL  IL +Q+ SY SGSSEE L++  E   
Sbjct: 141  QSIVWDTMVPSRKRTCVNRPKPSSVEKLTKDLCTILHEQQSSYFSGSSEEDLLFESETPM 200

Query: 1046 FPLEIGHGSYLLKPPISF-KEEESEARSLIIENKASCLNEPHLGSSCLSQQSQNTEVN 876
              +EIGHGS L++ P +  +EEESEA SL ++NK+  +NE +     L   + N  +N
Sbjct: 201  VSVEIGHGSVLIRHPSAIGREEESEASSLSVDNKSYLVNEVYSRIGALPVNTNNKGIN 258


>ref|XP_004300335.1| PREDICTED: GATA transcription factor 26-like [Fragaria vesca subsp.
            vesca]
          Length = 537

 Score =  155 bits (391), Expect = 6e-35
 Identities = 117/337 (34%), Positives = 168/337 (49%), Gaps = 22/337 (6%)
 Frame = -3

Query: 1559 KPVLCNACGSRWRTRGTLTNYTPLHARAFMSLDSEDSQNSREDKSQSMTKPENWHIKDHI 1380
            KPVLCNACGSRWRT+GTL NYTPLHARA    + +D ++ R  + + M+  +N  +K   
Sbjct: 25   KPVLCNACGSRWRTKGTLVNYTPLHARA----EPDDYEDHRVSRMKIMSINKNKEVKLVK 80

Query: 1379 EGDMEDK-EIFDPYPIG----LEDAXXXXXXXXXXXXXSESCIQLASVDGNDFSGPVQSS 1215
                 D   +   Y +G    +++              +ESC    S D +D +GP QS 
Sbjct: 81   RKQHPDSVGVGADYSLGFRKLVDEDTSNRSSSGSAVSNTESCAHFGSADASDLTGPAQSM 140

Query: 1214 FLDLHIPSKKRTNNRCPKLSLVEKLRRDLLKIL--QQEPSYLSGSSEEVLIYAREDSTFP 1041
              D  +PS+KRT    PK S VEKL +DL  IL  QQ+ SY SGSSEE L++  E     
Sbjct: 141  VWDSTVPSRKRTCVGRPKQSPVEKLTKDLYTILHEQQQSSYFSGSSEEDLLFESETPMVS 200

Query: 1040 LEIGHGSYLLKPPIS-FKEEESEARSLIIENKASCLNEPHLGS-SCLSQQSQNTEVNSSQ 867
            +EIGHGS L++ P S  +EEESEA SL ++N     NE +  S S L   ++   ++S+ 
Sbjct: 201  VEIGHGSVLIRHPSSIIREEESEASSLSVDNLQCHRNEAYSRSASLLVHNNEGVNMSSTV 260

Query: 866  VSK------DGLIEEIEDNNKSEEIPSLGRPLLQDPGVLYSTHAPQISVELNK--NNERF 711
            + K       G+ +E  D ++ + +  LG           + ++P   ++LN   N E F
Sbjct: 261  IGKMNSPAGQGMQQEKRDKSQHDNLQILG-----------NHNSPLRHIDLNDIVNYEEF 309

Query: 710  ---KSENENYALTSHAIRGGISQP--AKRPLDPPYFQ 615
                +  E   L  H     +  P   K   D P F+
Sbjct: 310  IRQLTNEEQQQLLKHLPPADVKFPYSLKNMFDSPQFR 346


>ref|XP_006385556.1| hypothetical protein POPTR_0003s08080g [Populus trichocarpa]
            gi|118486445|gb|ABK95062.1| unknown [Populus trichocarpa]
            gi|550342683|gb|ERP63353.1| hypothetical protein
            POPTR_0003s08080g [Populus trichocarpa]
          Length = 540

 Score =  154 bits (390), Expect = 8e-35
 Identities = 104/283 (36%), Positives = 150/283 (53%), Gaps = 6/283 (2%)
 Frame = -3

Query: 1559 KPVLCNACGSRWRTRGTLTNYTPLHARAFMSLDSEDSQNSREDKSQSMTKPENWHIKDHI 1380
            KPVLCNACGSRWRT+GTL NYTPLHARA    + +D ++ R  + +S++  +N  +K   
Sbjct: 25   KPVLCNACGSRWRTKGTLANYTPLHARA----EPDDYEDHRVSRLKSVSISKNKEVKLLK 80

Query: 1379 EGDMEDKEIFDPYPIG----LEDAXXXXXXXXXXXXXSESCIQLASVDGNDFSGPVQSSF 1212
                 D  +   Y  G    +++               ESC Q  S + +D +GP QS  
Sbjct: 81   RKPNYDNRVALDYNQGYRKVVDEDTSNRSSSGSAISNPESCAQFGSAEASDLTGPAQSVV 140

Query: 1211 LDLHIPSKKRTNNRCPKLSLVEKLRRDLLKIL-QQEPSYLSGSSEEVLIYAREDSTFPLE 1035
             D  +PS+KRT    PK S VEKL +DL  IL +Q+ S  SGSSEE L++  E     +E
Sbjct: 141  WDSLVPSRKRTCVNRPKPSSVEKLTKDLYTILHEQQSSCFSGSSEEDLLFDNETPMVSVE 200

Query: 1034 IGHGSYLLKPPISF-KEEESEARSLIIENKASCLNEPHLGSSCLSQQSQNTEVNSSQVSK 858
            IGHGS L++ P S  ++EESEA SL +ENK    NE +     L   ++N  VN++    
Sbjct: 201  IGHGSVLIRHPSSIARDEESEASSLSVENKQYLTNEAYSHPVILPVHNENKSVNTTYPIT 260

Query: 857  DGLIEEIEDNNKSEEIPSLGRPLLQDPGVLYSTHAPQISVELN 729
            +          + E++     P  +   +L S ++P  S++LN
Sbjct: 261  ETTKNLTGQGMQQEQLKRDKFP-HEKVHILGSHNSPLCSIDLN 302


>ref|XP_006414213.1| hypothetical protein EUTSA_v10024940mg [Eutrema salsugineum]
            gi|312282921|dbj|BAJ34326.1| unnamed protein product
            [Thellungiella halophila] gi|557115383|gb|ESQ55666.1|
            hypothetical protein EUTSA_v10024940mg [Eutrema
            salsugineum]
          Length = 516

 Score =  154 bits (388), Expect = 1e-34
 Identities = 103/217 (47%), Positives = 130/217 (59%), Gaps = 11/217 (5%)
 Frame = -3

Query: 1559 KPVLCNACGSRWRTRGTLTNYTPLHARAFMSLDSEDSQNSREDKSQSMT---KPENWHIK 1389
            KPVLCNACGSRWRT+GTL NYTPLH+RA    D ED Q  +  KS SM+   K      +
Sbjct: 25   KPVLCNACGSRWRTKGTLVNYTPLHSRADCD-DHEDHQRYQRMKSISMSSKNKETKMLKR 83

Query: 1388 DHIEGDMEDKEIFDPYPIGL-----EDAXXXXXXXXXXXXXSESCIQLASVDGNDFSGPV 1224
              I+ ++  K     +  GL     E+              SESC Q +S DG++ +GP 
Sbjct: 84   KAIQENISIKRPLLEFNYGLKKAVVEEDASNRSSSGSAISNSESCAQFSSADGSELTGPS 143

Query: 1223 QSSFLDLHIPSKKRTNNRCPKLSLVEKLRRDLLKILQ-QEPSYLSGSSEEVLIYAREDST 1047
            QS+  D  +PSK+RT    PK S VEKLR+DL  ILQ Q+ S LS SSEE L++  E S 
Sbjct: 144  QSNTWDTTVPSKRRTCVGRPKSSSVEKLRKDLYNILQEQQSSCLSVSSEEDLLFGNEMSM 203

Query: 1046 FPLEIGHGSYLLKPPISF-KEEESEARSL-IIENKAS 942
              +EIGHGS L++ P SF +EEESEA SL  +ENK+S
Sbjct: 204  VSVEIGHGSVLMRNPHSFAREEESEASSLSSVENKSS 240


>ref|XP_002531215.1| GATA transcription factor, putative [Ricinus communis]
            gi|223529175|gb|EEF31151.1| GATA transcription factor,
            putative [Ricinus communis]
          Length = 542

 Score =  152 bits (385), Expect = 3e-34
 Identities = 112/290 (38%), Positives = 156/290 (53%), Gaps = 13/290 (4%)
 Frame = -3

Query: 1559 KPVLCNACGSRWRTRGTLTNYTPLHARAFMSLDSEDSQNSREDKSQSMTKPENWHIK--- 1389
            KPVLCNACGSRWRT+GTL NYTPLHARA    D +D ++ R  + +S++  +N  +K   
Sbjct: 25   KPVLCNACGSRWRTKGTLANYTPLHARA----DPDDYEDHRVSRVKSISINKNKDVKLLK 80

Query: 1388 ---DHIEGDMED--KEIFDPYPIGLEDAXXXXXXXXXXXXXSESCIQLASVDGNDFSGPV 1224
               +H  G +     +    Y   L++              SESC Q  S D +D +GP 
Sbjct: 81   RKANHDNGVVGGVVHDYNQGYRKVLDEDISNRSSSGSAISNSESCAQFGSADASDLTGPA 140

Query: 1223 QSSFLDLHIPSKKRTNNRCPKLSLVEKLRRDLLKIL-QQEPSYLSGSSEEVLIYAREDST 1047
            QS   D  +PSKKRT    PK S VEKL +DL  IL +Q+ S  SGSSEE L++  E   
Sbjct: 141  QSVVWDSMVPSKKRTCVNRPKQSPVEKLTKDLYTILHEQQSSCFSGSSEEDLLFESETPM 200

Query: 1046 FPLEIGHGSYLLKPPISF-KEEESEARSLIIENKASCLNEPHLGSSCLSQQSQNTEVNSS 870
              +EIGHGS L++ P S  ++EESEA SL +ENK    NE +  S  L     N  +++ 
Sbjct: 201  VSVEIGHGSVLIRHPSSIARDEESEASSLSVENKQCSTNEAYSHSLGLLVHIGNKNIHTP 260

Query: 869  QVSKDGLIEEIEDN-NKSEEIPSLGRPLLQDP--GVLYSTHAPQISVELN 729
             +    LIE+ ++   +  +   L R   Q     VL + ++P  +V+LN
Sbjct: 261  SL----LIEKAKNPIGQGLQHEQLKRDKFQHERVQVLGNHNSPLCNVDLN 306


>ref|XP_003517400.1| PREDICTED: GATA transcription factor 26-like [Glycine max]
          Length = 551

 Score =  150 bits (380), Expect = 1e-33
 Identities = 103/294 (35%), Positives = 152/294 (51%), Gaps = 11/294 (3%)
 Frame = -3

Query: 1559 KPVLCNACGSRWRTRGTLTNYTPLHARAFMSLDSEDSQNSREDKSQSMTKPENWHI---- 1392
            KPVLCNACGSRWRT+GTL  YTPLHARA    D  D Q     KS S+ K +   +    
Sbjct: 25   KPVLCNACGSRWRTKGTLAKYTPLHARA--ETDDYDDQRVSRVKSISINKKKEVALLKRK 82

Query: 1391 --KDHIEGDMEDKEIFDPYPIGLEDAXXXXXXXXXXXXXSESCIQLA--SVDGNDFSGPV 1224
               D++       +    Y   +++              SESC Q     +D +D +GP 
Sbjct: 83   QNHDNVVSGGFAPDYNQGYQKVVDEDISNRSSSGSAISNSESCAQFGYGGMDASDLTGPA 142

Query: 1223 QSSFLDLHIPSKKRTNNRCPKLSLVEKLRRDLLKILQQEPSYLSGSSEEVLIYAREDSTF 1044
            QS   D  +PS+KRT    PK S VEKL +DL  IL ++ SY S SSEE L++  +    
Sbjct: 143  QSVVWDAMVPSRKRTCVGRPKPSSVEKLTKDLCTILHEQQSYFSVSSEEDLLFESDTPMV 202

Query: 1043 PLEIGHGSYLLK-PPISFKEEESEARSLIIENKASCLNEPHLGSSCLSQQSQNTEVNSSQ 867
             +EIGHGS L++ P    +EEESEA SL ++NK   ++E +  S  ++  + ++ + SS 
Sbjct: 203  SVEIGHGSILIRHPSYIAREEESEASSLSVDNKQCPMSEAYSFSGAIAMHNDSSRLKSSS 262

Query: 866  VSKDGLIEEIEDNNKSEEIPSLGRPLLQDPGVLYSTHAPQISVELNK--NNERF 711
            +  + +        + E++ S  +  L+   +L +  +P  S++LN   N E F
Sbjct: 263  LEVEKIGNSTGQGMQQEQLKS-DKSQLERVQILGNHESPLCSIDLNDVVNYEEF 315


>ref|XP_004511735.1| PREDICTED: GATA transcription factor 26-like isoform X1 [Cicer
            arietinum]
          Length = 541

 Score =  150 bits (379), Expect = 2e-33
 Identities = 105/292 (35%), Positives = 146/292 (50%), Gaps = 9/292 (3%)
 Frame = -3

Query: 1559 KPVLCNACGSRWRTRGTLTNYTPLHARAFMSLDSEDSQNSREDKSQSMTKPENWHIKDHI 1380
            KPVLCNACGSRWRT+GTL NYTPLHARA    D  D Q +   KS S+ K +   +    
Sbjct: 25   KPVLCNACGSRWRTKGTLANYTPLHARA--ETDDCDDQRATRVKSISLNKNKEAKLLKRK 82

Query: 1379 EG--DMEDKEIFDPYPIGLEDAXXXXXXXXXXXXXS----ESCIQLASVDGNDFSGPVQS 1218
            +   ++    I   Y  G + A             +    ESC Q    D +D +GP QS
Sbjct: 83   QNHENVVSGRIASDYNHGFQKAVDEDYSTRSSSGSALSNSESCAQFGGADASDLTGPAQS 142

Query: 1217 SFLDLHIPSKKRTNNRCPKLSLVEKLRRDLLKILQQEPSYLSGSSEEVLIYAREDSTFPL 1038
               D  +PSKKRT     K S VEKL +DL  IL ++ SY S SSEE L++  E     +
Sbjct: 143  VIWDATVPSKKRTCVGRAKPSSVEKLTKDLCTILHEQQSYFSASSEEDLLFESETPMVSV 202

Query: 1037 EIGHGSYLLK-PPISFKEEESEARSLIIENKASCLNEPHLGSSCLSQQSQNTEVNSSQVS 861
            EIGHGS L++ P    +EEESEA SL  +N+   ++E +  S  +     ++  N S   
Sbjct: 203  EIGHGSVLIRHPSYVAREEESEASSLSFDNRQYPMSEAYSYSGSVLMHDSSSRSNFSSQG 262

Query: 860  KDGLIEEIEDNNKSEEIPSLGRPLLQDPGVLYSTHAPQISVELNK--NNERF 711
             + +        K E++ S  +  L+   +L +  +P   ++LN   N E F
Sbjct: 263  AEKVRNSAFHGMKHEQLKS-DKSQLERVQILGNHDSPLTLIDLNDVVNYEEF 313


>gb|ADL36698.1| GATA domain class transcription factor [Malus domestica]
          Length = 542

 Score =  148 bits (374), Expect = 6e-33
 Identities = 113/299 (37%), Positives = 154/299 (51%), Gaps = 10/299 (3%)
 Frame = -3

Query: 1559 KPVLCNACGSRWRTRGTLTNYTPLHARAFMSLDSEDSQNSREDKSQSMTKPENWHIKDHI 1380
            KPVLCNACGSRWRT+GTL NYTPLHARA    D ED + SR  KS S+ K +   +    
Sbjct: 25   KPVLCNACGSRWRTKGTLVNYTPLHARAEPD-DFEDHRVSRV-KSISVNKSKEIKLVKRK 82

Query: 1379 EG--DMEDKEIFDPYPIG----LEDAXXXXXXXXXXXXXSESCIQLASVDGNDFSGPVQS 1218
            +    M    +   Y  G    +E+              SESC Q  S D +D +GP QS
Sbjct: 83   QNPESMVIGGVNSDYSHGFRKIIEEDKSNRSSSGSAVSNSESCAQFGSGDASDLTGPAQS 142

Query: 1217 SFLDLHIPSKKRTNNRCPKLSLVEKLRRDLLKIL-QQEPSYLSGSSEEVLIYAREDSTFP 1041
               D  +PS+KRT     K S VE+L +DL  IL +Q+ S  SGSSEE L++  E     
Sbjct: 143  MVWDSMVPSRKRTCIGRLKPSPVERLTKDLYTILHEQQSSCFSGSSEEDLLFESETPMVS 202

Query: 1040 LEIGHGSYLLKPPISF-KEEESEARSLIIENKASCLNEPHLGSSCLSQQSQNTEVNSSQV 864
            +EIGHGS L++ P S  +EEESEA S+ ++NK    NE +  S+ L   + N  VN +  
Sbjct: 203  VEIGHGSVLIRHPNSIAREEESEASSISVDNKQCLANEVYSRSATLFVHNNNKGVNMAS- 261

Query: 863  SKDGLIEEIEDNNKSEEIPSLGRPLLQDPGVLYSTHAPQISVELNK--NNERFKSENEN 693
            +  G +  +       E     +  L +  +L + ++P   V+LN   N E F  +  N
Sbjct: 262  TVSGRMNNVAGEGMQHEPLKRDKSQLDNFQILGNHNSPLRHVDLNDILNFEEFTRQLTN 320


>ref|XP_006362478.1| PREDICTED: GATA transcription factor 26-like [Solanum tuberosum]
          Length = 543

 Score =  148 bits (373), Expect = 8e-33
 Identities = 94/218 (43%), Positives = 127/218 (58%), Gaps = 6/218 (2%)
 Frame = -3

Query: 1559 KPVLCNACGSRWRTRGTLTNYTPLHARAFMSLDSEDSQNSREDKSQSMTKPENWHIKDHI 1380
            KP+LCNACGSRWRT+GTL NYTPLHARA    D E+ + SR  K+ SM   E   +K   
Sbjct: 25   KPILCNACGSRWRTKGTLVNYTPLHARA-EPCDFEEHRVSR-FKNISMKNKEAKILKRKQ 82

Query: 1379 EGDMEDKEIFDPYPIG----LEDAXXXXXXXXXXXXXSESCIQLASVDGNDFSGPVQSSF 1212
              D  +      Y +G    L++              SESC Q  S + +D +GP QS+ 
Sbjct: 83   SHDNAEVGTPPDYNLGFRKVLDEDTSNRSSSGSAVSNSESCAQFGSAEASDLTGPAQSNI 142

Query: 1211 LDLHIPSKKRTNNRCPKLSLVEKLRRDLLKIL-QQEPSYLSGSSEEVLIYAREDSTFPLE 1035
             D  +PS+KRT    PK S VEKL +DL  IL +Q+ SYLS SSEE L++  +     +E
Sbjct: 143  WDSTVPSRKRTCFNRPKPSSVEKLTKDLYTILHEQQSSYLSASSEEELLFESDKPMVSVE 202

Query: 1034 IGHGSYLLKPPISF-KEEESEARSLIIENKASCLNEPH 924
            IGHGS L++ P +  +EEESEA SL ++NK   +++ +
Sbjct: 203  IGHGSVLMRHPSTIGREEESEASSLSVDNKHRSVSDAY 240


>ref|XP_004244556.1| PREDICTED: GATA transcription factor 26-like [Solanum lycopersicum]
          Length = 542

 Score =  148 bits (373), Expect = 8e-33
 Identities = 96/219 (43%), Positives = 130/219 (59%), Gaps = 7/219 (3%)
 Frame = -3

Query: 1559 KPVLCNACGSRWRTRGTLTNYTPLHARAFMSLDSEDSQNSREDKSQSMTKPENWHIKDHI 1380
            KP+LCNACGSRWRT+GTL NYTPLHARA    D E+ + SR  K+ SM   E   +K   
Sbjct: 25   KPILCNACGSRWRTKGTLANYTPLHARA-EPCDFEEHRVSR-FKNISMKNKEAKILKR-- 80

Query: 1379 EGDMEDKEIFDP-YPIG----LEDAXXXXXXXXXXXXXSESCIQLASVDGNDFSGPVQSS 1215
            +    D E+  P Y +G    L++              SESC Q  S + +D +GP QS+
Sbjct: 81   KQSHHDAEVGTPDYSLGFRKVLDEDTSNRSSSGSAISNSESCAQFGSAEASDLTGPAQSN 140

Query: 1214 FLDLHIPSKKRTNNRCPKLSLVEKLRRDLLKIL-QQEPSYLSGSSEEVLIYAREDSTFPL 1038
              D  +PS+KRT    PK S VEKL +DL  IL +Q+ SYLS SSEE L++  +     +
Sbjct: 141  IWDSTVPSRKRTCFNRPKPSSVEKLTKDLYTILHEQQSSYLSASSEEELLFESDKPMVSV 200

Query: 1037 EIGHGSYLLKPPISF-KEEESEARSLIIENKASCLNEPH 924
            EIGHGS L++ P +  +EEESEA SL ++NK   +++ +
Sbjct: 201  EIGHGSVLMRYPSTIGREEESEASSLSVDNKHRSVSDAY 239


>ref|XP_004511736.1| PREDICTED: GATA transcription factor 26-like isoform X2 [Cicer
            arietinum]
          Length = 527

 Score =  146 bits (369), Expect = 2e-32
 Identities = 96/232 (41%), Positives = 124/232 (53%), Gaps = 9/232 (3%)
 Frame = -3

Query: 1559 KPVLCNACGSRWRTRGTLTNYTPLHARAFMSLDSEDSQNSREDKSQSMTKPENWHIKDHI 1380
            KPVLCNACGSRWRT+GTL NYTPLHARA    D  D Q +   KS S+ K +   +    
Sbjct: 42   KPVLCNACGSRWRTKGTLANYTPLHARA--ETDDCDDQRATRVKSISLNKNKEAKLLKRK 99

Query: 1379 EG--DMEDKEIFDPYPIGLEDAXXXXXXXXXXXXXS----ESCIQLASVDGNDFSGPVQS 1218
            +   ++    I   Y  G + A             +    ESC Q    D +D +GP QS
Sbjct: 100  QNHENVVSGRIASDYNHGFQKAVDEDYSTRSSSGSALSNSESCAQFGGADASDLTGPAQS 159

Query: 1217 SFLDLHIPSKKRTNNRCPKLSLVEKLRRDLLKILQQEPSYLSGSSEEVLIYAREDSTFPL 1038
               D  +PSKKRT     K S VEKL +DL  IL ++ SY S SSEE L++  E     +
Sbjct: 160  VIWDATVPSKKRTCVGRAKPSSVEKLTKDLCTILHEQQSYFSASSEEDLLFESETPMVSV 219

Query: 1037 EIGHGSYLLK-PPISFKEEESEARSLIIENKASCLNE--PHLGSSCLSQQSQ 891
            EIGHGS L++ P    +EEESEA SL  +N+   ++E   + GS   S +SQ
Sbjct: 220  EIGHGSVLIRHPSYVAREEESEASSLSFDNRQYPMSEAYSYSGSQLKSDKSQ 271


Top