BLASTX nr result

ID: Paeonia23_contig00005954 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Paeonia23_contig00005954
         (1271 letters)

Database: ./nr 
           38,876,450 sequences; 13,856,398,315 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_002285624.1| PREDICTED: GATA transcription factor 1-like ...   227   7e-57
ref|XP_007034503.1| GATA transcription factor 1, putative [Theob...   212   2e-52
ref|XP_004150343.1| PREDICTED: GATA transcription factor 1-like ...   210   9e-52
ref|XP_002518163.1| GATA transcription factor, putative [Ricinus...   200   1e-48
ref|XP_006492137.1| PREDICTED: GATA transcription factor 1-like ...   195   4e-47
ref|XP_002299291.2| hypothetical protein POPTR_0001s14130g [Popu...   194   9e-47
ref|XP_006420528.1| hypothetical protein CICLE_v10005658mg [Citr...   193   1e-46
ref|XP_004309758.1| PREDICTED: GATA transcription factor 1-like ...   191   8e-46
ref|XP_002303808.2| hypothetical protein POPTR_0003s17340g [Popu...   188   4e-45
gb|EXB66651.1| GATA transcription factor 1 [Morus notabilis]          186   1e-44
ref|XP_007145298.1| hypothetical protein PHAVU_007G227300g [Phas...   184   5e-44
dbj|BAC98492.1| AG-motif binding protein-2 [Nicotiana tabacum]        179   3e-42
ref|NP_189047.1| GATA transcription factor 1 [Arabidopsis thalia...   177   9e-42
ref|XP_004309759.1| PREDICTED: GATA transcription factor 1-like ...   175   4e-41
gb|AAM65139.1| GATA transcription factor 1 (AtGATA-1) [Arabidops...   175   4e-41
ref|XP_006418795.1| hypothetical protein EUTSA_v10002609mg [Eutr...   172   3e-40
ref|XP_002885621.1| hypothetical protein ARALYDRAFT_479930 [Arab...   172   4e-40
gb|EYU32614.1| hypothetical protein MIMGU_mgv1a023497mg, partial...   170   1e-39
gb|EYU43811.1| hypothetical protein MIMGU_mgv1a011652mg [Mimulus...   169   2e-39
ref|XP_006298301.1| hypothetical protein CARUB_v10014365mg [Caps...   169   2e-39

>ref|XP_002285624.1| PREDICTED: GATA transcription factor 1-like [Vitis vinifera]
          Length = 251

 Score =  227 bits (579), Expect = 7e-57
 Identities = 135/257 (52%), Positives = 155/257 (60%), Gaps = 8/257 (3%)
 Frame = -1

Query: 1142 AACFMDDLLDFSSDIXXXXXXXEKPKARPSSSSIQSNADDPSRSLPXXXXXXXXEWLSNK 963
            AACF+DDLLDFSSDI        K + R SSS +       SRSLP        EWL NK
Sbjct: 7    AACFVDDLLDFSSDIGEDDDDDHKRRTRSSSSLLVGGH---SRSLPDPPVEEELEWL-NK 62

Query: 962  DAFPAVETFVDIFPVAPSNFSMDLVKPSNVLKDRSPVSVLEXXXXXXXXXXXXXSA--IM 789
            D FP VETF+D  P +  N          + K +SP+SVLE             S   IM
Sbjct: 63   DVFPGVETFLDYLPTSVEN----------IPKQQSPISVLENSSHSSSSNNSNSSTTTIM 112

Query: 788  SCCGGFSVPVCRRARSKRQIKRKRSFSG--NQQWWCVASSKNT----SDTMTTTYINTGC 627
            SCC  F VP   RARSKR+ +R + FS    Q WW  +S  NT    S    +    T  
Sbjct: 113  SCCENFRVP--SRARSKRRRRRHKDFSDIPGQPWWWWSSQGNTNANHSSPTNSKQTITSS 170

Query: 626  IIGRKCTHCHANKTPQWRAGPMGPKTLCNACGVRYKSGRLVPEYRPASSPTYSSRMHSNS 447
             IGRKC HC A KTPQWRAGP+GPKTLCNACGVRYKSGRLV EYRPASSPT+SS++HSNS
Sbjct: 171  TIGRKCQHCQAEKTPQWRAGPLGPKTLCNACGVRYKSGRLVAEYRPASSPTFSSKVHSNS 230

Query: 446  HRKIMEMRRQKEYNVVV 396
            HRKIMEMR+ K+ +VVV
Sbjct: 231  HRKIMEMRKLKQRDVVV 247


>ref|XP_007034503.1| GATA transcription factor 1, putative [Theobroma cacao]
            gi|508713532|gb|EOY05429.1| GATA transcription factor 1,
            putative [Theobroma cacao]
          Length = 243

 Score =  212 bits (540), Expect = 2e-52
 Identities = 125/260 (48%), Positives = 150/260 (57%), Gaps = 4/260 (1%)
 Frame = -1

Query: 1142 AACFMDDLLDFSSDIXXXXXXXEKPKARPSSSSIQSNADDPSRSLPXXXXXXXXEWLSNK 963
            AA F ++LLDF SD+       E  K+   ++S   NA+   RS P         W+SNK
Sbjct: 7    AASFDENLLDFGSDVGEEDEDEENNKSSKLNTSSSLNAN---RSFPEFAEEELE-WISNK 62

Query: 962  DAFPAVETFVDIFPVAPSNFSMDLVKPSNVLKDRSPVSVLEXXXXXXXXXXXXXSA---- 795
            DAFP+VETFVDI   A               K +SPVSVL+                   
Sbjct: 63   DAFPSVETFVDILGTAA--------------KHQSPVSVLDNSNSSSNSSGSSTLTNGNI 108

Query: 794  IMSCCGGFSVPVCRRARSKRQIKRKRSFSGNQQWWCVASSKNTSDTMTTTYINTGCIIGR 615
            +M CCG   VPV  +ARSKR  K +   +    WW   + KN S  +      T   IGR
Sbjct: 109  VMYCCGNLKVPV--KARSKRLRKCRDLRNQENSWWVQENVKNASAHVKGAGSRT---IGR 163

Query: 614  KCTHCHANKTPQWRAGPMGPKTLCNACGVRYKSGRLVPEYRPASSPTYSSRMHSNSHRKI 435
            KC HC A KTPQWRAGP+GPKTLCNACGVRYKSGRLVPEYRPASSPT+S  +HSNSHRKI
Sbjct: 164  KCQHCGAEKTPQWRAGPLGPKTLCNACGVRYKSGRLVPEYRPASSPTFSIELHSNSHRKI 223

Query: 434  MEMRRQKEYNVVVMKPVDQG 375
            +EMRRQK++    MKP+D+G
Sbjct: 224  LEMRRQKQFGFSAMKPMDKG 243


>ref|XP_004150343.1| PREDICTED: GATA transcription factor 1-like [Cucumis sativus]
            gi|449514819|ref|XP_004164489.1| PREDICTED: GATA
            transcription factor 1-like [Cucumis sativus]
          Length = 287

 Score =  210 bits (535), Expect = 9e-52
 Identities = 140/286 (48%), Positives = 163/286 (56%), Gaps = 33/286 (11%)
 Frame = -1

Query: 1133 FMDDLLDFSSDIXXXXXXXE-------KPKAR----PSSSSIQSNA---DDPS--RSLPX 1002
            FMDDLLDFSSDI       +       KPK+     P SS + + A   DD S  R LP 
Sbjct: 7    FMDDLLDFSSDIGEEDEEDDAVPPFSVKPKSSSTTAPDSSDLNAAAMHPDDSSSCRVLPE 66

Query: 1001 XXXXXXXEWLSNKDAFPAVETFVDIFPVAPSNFSMDLVKPSNVLKDRSPVSVLEXXXXXX 822
                   EWLSN+DAFPAVETFVDI      + +       +V K  SPVSVLE      
Sbjct: 67   EYAEEELEWLSNEDAFPAVETFVDILSDHHHHHAPQPPPLPSVSKQNSPVSVLESTSISS 126

Query: 821  XXXXXXXS---------AIMSCCGGFSVPVCRRARSKRQIKRKRSFSGNQQWWCVA-SSK 672
                              +MSCCG   VP   +ARSKR  +R R  SG+   +    SSK
Sbjct: 127  HGETTNGGNKTSVHSSSILMSCCGSLKVP--SKARSKR--RRGRHISGHHLLFKQQPSSK 182

Query: 671  NTSDTMTTTYI------NTGCI-IGRKCTHCHANKTPQWRAGPMGPKTLCNACGVRYKSG 513
            N    + TT         TG   IGRKC HC A KTPQWRAGP GPKTLCNACGVR+KSG
Sbjct: 183  NLKQVVPTTATAAVVAATTGTAGIGRKCLHCGAEKTPQWRAGPFGPKTLCNACGVRFKSG 242

Query: 512  RLVPEYRPASSPTYSSRMHSNSHRKIMEMRRQKEYNVVVMKPVDQG 375
            RLVPEYRPASSPT+S+ +HSNSHRK+MEMRRQK+  +VV  P+D+G
Sbjct: 243  RLVPEYRPASSPTFSAELHSNSHRKVMEMRRQKQLGMVV-NPMDKG 287


>ref|XP_002518163.1| GATA transcription factor, putative [Ricinus communis]
           gi|223542759|gb|EEF44296.1| GATA transcription factor,
           putative [Ricinus communis]
          Length = 205

 Score =  200 bits (508), Expect = 1e-48
 Identities = 108/206 (52%), Positives = 131/206 (63%), Gaps = 5/206 (2%)
 Frame = -1

Query: 977 WLSNKDAFPAVETFVDIFPVAPSNFSMDLVKPSNVLKDRSPVSVLEXXXXXXXXXXXXXS 798
           WLSNKDAFP+VETFVDI              P ++ K RSPVSVLE              
Sbjct: 17  WLSNKDAFPSVETFVDILTE----------NPGSLQKHRSPVSVLENSTTSSTSNSGHSG 66

Query: 797 A----IMSCCGGFSVPVCRRARSKRQIKRKRSFSGNQQWWCVASSKNTSDTMTTTYINTG 630
                IM+ C    VPV  +ARSK   +R+R   G Q WW   + K      +++     
Sbjct: 67  TNDSVIMNYCRSLHVPV--KARSKPHRRRRRDLGGQQCWWSQENLKKVKVVKSSS----- 119

Query: 629 CIIGRKCTHCHANKTPQWRAGPMGPKTLCNACGVRYKSGRLVPEYRPASSPTYSSRMHSN 450
             IGRKC HC A KTPQWRAGP+GPKTLCNACGVRYKSGRLVPEYRPASSPT+SS +HSN
Sbjct: 120 STIGRKCQHCGAEKTPQWRAGPLGPKTLCNACGVRYKSGRLVPEYRPASSPTFSSVLHSN 179

Query: 449 SHRKIMEMRRQKE-YNVVVMKPVDQG 375
           SHRK++EMRRQK+   ++V+KP+++G
Sbjct: 180 SHRKVLEMRRQKQMMGIMVVKPMEKG 205


>ref|XP_006492137.1| PREDICTED: GATA transcription factor 1-like [Citrus sinensis]
          Length = 262

 Score =  195 bits (495), Expect = 4e-47
 Identities = 128/281 (45%), Positives = 153/281 (54%), Gaps = 27/281 (9%)
 Frame = -1

Query: 1136 CFMDDLLDFSSDIXXXXXXXEKPKARPSS--SSIQSNA---------DDPSRSLPXXXXX 990
            C +DDLLDF+ +         KP  RP +  SS+  N          DD  R  P     
Sbjct: 9    CCIDDLLDFNIN----DDECGKPNKRPRNALSSVNRNGCDFDVFEAGDDTDRLFPECAEE 64

Query: 989  XXXEWLSNKDAFPAVETFVDIFPVAPSNFSMDLVKPSNVLKDRSPVSVLEXXXXXXXXXX 810
                WLSN   FP VETFVDI     SN         N+LK +SP SVLE          
Sbjct: 65   ELE-WLSN---FPTVETFVDI----SSN--------PNILKQQSPNSVLENSNSSSSTST 108

Query: 809  XXXSA----------IMSCCGGFSVPVCRRARSKRQIKRKRSFSGNQQWW-CVASSKNTS 663
               +           IM+CCG   VPV  RARSK + + +R     + WW  V  S   +
Sbjct: 109  NGSTITNGNNNSNSIIMNCCGNLRVPV--RARSKLRTRCRRELLNQEAWWGSVHGSVKAA 166

Query: 662  DTMTTTYINTGCIIGRKCTHCHANKTPQWRAGPMGPKTLCNACGVRYKSGRLVPEYRPAS 483
              + +  I     IGRKC HC A KTPQWRAGPMGPKTLCNACGVR+KSGRLVPEYRPA+
Sbjct: 167  KPVVSKVI-----IGRKCQHCGAEKTPQWRAGPMGPKTLCNACGVRFKSGRLVPEYRPAN 221

Query: 482  SPTYSSRMHSNSHRKIMEMRRQK-----EYNVVVMKPVDQG 375
            SPT+SS +HSNSHRK++EMRRQK     E  V+ +KPVD+G
Sbjct: 222  SPTFSSELHSNSHRKVVEMRRQKQMMGIELGVLGVKPVDKG 262


>ref|XP_002299291.2| hypothetical protein POPTR_0001s14130g [Populus trichocarpa]
            gi|550347223|gb|EEE84096.2| hypothetical protein
            POPTR_0001s14130g [Populus trichocarpa]
          Length = 308

 Score =  194 bits (492), Expect = 9e-47
 Identities = 129/289 (44%), Positives = 158/289 (54%), Gaps = 15/289 (5%)
 Frame = -1

Query: 1196 ALVGFFXXXXXXXXXXXSAACFM-DDLLDFSSDIXXXXXXXE----KPKARPSSSSIQSN 1032
            + +GFF           +AACFM DDLLDF SDI       E      K+R +  S+  N
Sbjct: 39   SFLGFFFFFFEEMESLDTAACFMVDDLLDFCSDIGEEEDGEEHQRNSKKSRRALPSLNPN 98

Query: 1031 ADDPS------RSLPXXXXXXXXEWLSNKDAFPAVETFVDIFPVAPSNFSMDLVKPSNVL 870
            A  P+       SL         EWLSNKDAFP VET           F     +P ++ 
Sbjct: 99   ALHPASFNVLEHSLLPEFAEEELEWLSNKDAFPTVETC----------FGSLSGEPGSIP 148

Query: 869  KDRSPVSVLEXXXXXXXXXXXXXS---AIMSCCGGFSVPVCRRARSKRQIKRKRSFSGNQ 699
            K  SPVSVLE             S    IMS C    VPV  +ARSKR  +  R     +
Sbjct: 149  KHHSPVSVLENSTTSSTSNSGNSSNSNIIMSYCR-LRVPV--KARSKRHHRHPREIQEQE 205

Query: 698  QWWCVASSKNTSDTMTTTYINTGCIIGRKCTHCHANKTPQWRAGPMGPKTLCNACGVRYK 519
             WW      +  + +T     +   +GRKC HC   KTPQWRAGP GPKTLCNACGVRYK
Sbjct: 206  CWW------SQENFITRKPAVSVAKLGRKCQHCGVEKTPQWRAGPDGPKTLCNACGVRYK 259

Query: 518  SGRLVPEYRPASSPTYSSRMHSNSHRKIMEMRRQKEY-NVVVMKPVDQG 375
            SGRLVPEYRPA+SPT+SS++HSNSHRK++EMRRQK+   ++V KP+D+G
Sbjct: 260  SGRLVPEYRPANSPTFSSKLHSNSHRKVVEMRRQKQMTGLLVAKPMDKG 308


>ref|XP_006420528.1| hypothetical protein CICLE_v10005658mg [Citrus clementina]
            gi|557522401|gb|ESR33768.1| hypothetical protein
            CICLE_v10005658mg [Citrus clementina]
          Length = 262

 Score =  193 bits (491), Expect = 1e-46
 Identities = 128/280 (45%), Positives = 154/280 (55%), Gaps = 26/280 (9%)
 Frame = -1

Query: 1136 CFMDDLLDFSSDIXXXXXXXEKPKARPSS--SSIQSN--------ADDPSRSLPXXXXXX 987
            C +DDLLDF+ +         KP  RP +  SS+  N        A D +  L       
Sbjct: 9    CCIDDLLDFNIN----DDECGKPTKRPRNALSSVNRNGCDFDVFEAGDDTDHLFPECAEE 64

Query: 986  XXEWLSNKDAFPAVETFVDIFPVAPSNFSMDLVKPSNVLKDRSPVSVLEXXXXXXXXXXX 807
              EWLSN   FP VETFVDI     SN         N+LK +SP SVLE           
Sbjct: 65   ELEWLSN---FPTVETFVDI----SSN--------PNILKQQSPNSVLENSNSSSSTSTN 109

Query: 806  XXSA----------IMSCCGGFSVPVCRRARSKRQIKRKRSFSGNQQWW-CVASSKNTSD 660
              +           IM+CCG   VPV  RARSK + + +R     + WW  V  S   + 
Sbjct: 110  GSTITNGNNNSNSIIMNCCGNLRVPV--RARSKLRTRCRRELLNQEAWWGSVHGSVKAAK 167

Query: 659  TMTTTYINTGCIIGRKCTHCHANKTPQWRAGPMGPKTLCNACGVRYKSGRLVPEYRPASS 480
             + +  I     IGRKC HC A KTPQWRAGPMGPKTLCNACGVR+KSGRLVPEYRPA+S
Sbjct: 168  PVVSKVI-----IGRKCQHCGAEKTPQWRAGPMGPKTLCNACGVRFKSGRLVPEYRPANS 222

Query: 479  PTYSSRMHSNSHRKIMEMRRQK-----EYNVVVMKPVDQG 375
            PT+SS +HSNSHRK++EMRRQK     E  V+ +KPVD+G
Sbjct: 223  PTFSSELHSNSHRKVVEMRRQKQMMGIELGVLGVKPVDKG 262


>ref|XP_004309758.1| PREDICTED: GATA transcription factor 1-like isoform 1 [Fragaria vesca
            subsp. vesca]
          Length = 227

 Score =  191 bits (484), Expect = 8e-46
 Identities = 119/260 (45%), Positives = 152/260 (58%), Gaps = 4/260 (1%)
 Frame = -1

Query: 1142 AACFMDDLLDFSSDIXXXXXXXEKPKARPSSSSIQSNADDPSRSL-PXXXXXXXXEWLSN 966
            AAC +DDL +F SD+           ARP         DDPSR L P        EW+SN
Sbjct: 7    AACLVDDLRNFLSDVADHD-------ARP---------DDPSRPLVPTEEAEEELEWISN 50

Query: 965  KDAFPAVETFVDIFPVAPSNFSMDLVKPSNVLKDRSPVSVLEXXXXXXXXXXXXXSAIMS 786
            KDAFPAVETF+            + V    + K +SPVSVLE              ++MS
Sbjct: 51   KDAFPAVETFI----------LSEQVGGIAIAKHQSPVSVLETSTNSSSA------SLMS 94

Query: 785  CCGGFSVPVCRRARSKRQIKRKRSFSGNQQWWC---VASSKNTSDTMTTTYINTGCIIGR 615
             CGG   P   RAR+K + +R+      Q +W    + SSK +  + + + ++    IGR
Sbjct: 95   SCGGLKPP--HRARTKGR-RRRSEIPPQQLFWNQPPIESSKPSRSSGSASKLD----IGR 147

Query: 614  KCTHCHANKTPQWRAGPMGPKTLCNACGVRYKSGRLVPEYRPASSPTYSSRMHSNSHRKI 435
            KC HC  ++TPQWRAGP GPKTLCNACGVRYKSGRL PEYRPASSP++SS+MHSNSHRK+
Sbjct: 148  KCLHCGTDQTPQWRAGPHGPKTLCNACGVRYKSGRLCPEYRPASSPSFSSQMHSNSHRKV 207

Query: 434  MEMRRQKEYNVVVMKPVDQG 375
            +EMR+ K    +V+KP D+G
Sbjct: 208  LEMRKHKYGVGMVVKPEDKG 227


>ref|XP_002303808.2| hypothetical protein POPTR_0003s17340g [Populus trichocarpa]
            gi|550343381|gb|EEE78787.2| hypothetical protein
            POPTR_0003s17340g [Populus trichocarpa]
          Length = 258

 Score =  188 bits (478), Expect = 4e-45
 Identities = 124/271 (45%), Positives = 151/271 (55%), Gaps = 15/271 (5%)
 Frame = -1

Query: 1142 AACFM-DDLLDFSSDIXXXXXXXE----KPKARPSSSSIQSNA------DDPSRSLPXXX 996
            AA FM DDLLDF SDI       E      K R    S+  NA      +    +L    
Sbjct: 7    AAGFMVDDLLDFCSDIGEGDDDEEHQNNNKKPRKGLPSLNPNALASASFNVLEHTLLPEF 66

Query: 995  XXXXXEWLSNKDAFPAVETFVDIFPVAPSNFSMDLVKPSNVLKDRSPVSVLEXXXXXXXX 816
                 EWLSNKDAFPAVET   I             +P ++ K  SPVSVLE        
Sbjct: 67   AEEELEWLSNKDAFPAVETCFGILSE----------EPGSIPKHHSPVSVLENSTTSSTS 116

Query: 815  XXXXXS---AIMSCCGGFSVPVCRRARSKRQIKRKRSFSGNQQWWCVASSKNTSDTMTTT 645
                 S    IMS C    VPV  +ARSKR+ +R R     ++WW   +S      ++  
Sbjct: 117  ISGNSSNSSIIMSYCS-LRVPV--KARSKRRHRRPREIREQERWWSRENSTRRKPAVSVA 173

Query: 644  YINTGCIIGRKCTHCHANKTPQWRAGPMGPKTLCNACGVRYKSGRLVPEYRPASSPTYSS 465
             +      GRKC HC   KTPQWRAGP GPKTLCNACGVRYKSGRLVPEYRPA+SPT+SS
Sbjct: 174  KM------GRKCQHCGVEKTPQWRAGPDGPKTLCNACGVRYKSGRLVPEYRPANSPTFSS 227

Query: 464  RMHSNSHRKIMEMRRQKE-YNVVVMKPVDQG 375
            ++HSNSHRK++EMR+QK+    +V+KP+D+G
Sbjct: 228  KLHSNSHRKVVEMRKQKQMMGSLVVKPMDKG 258


>gb|EXB66651.1| GATA transcription factor 1 [Morus notabilis]
          Length = 518

 Score =  186 bits (473), Expect = 1e-44
 Identities = 107/218 (49%), Positives = 130/218 (59%), Gaps = 19/218 (8%)
 Frame = -1

Query: 977 WLSNKDAFPAVETFVDIFPVAPSNFSMDLVKPSNVLKDRSPVSVLEXXXXXXXXXXXXXS 798
           W+SNKDAFPAVE+FV I P  PS           +LK  SPVSVL+             +
Sbjct: 141 WISNKDAFPAVESFVGILPDNPSGA---------ILKHHSPVSVLDGGSGGSSTISCNSN 191

Query: 797 A-----------IMSCCGGFSVPVCRRARSKRQIKRKRS-FSGNQQWWCVASSKNTSDTM 654
           +           + SC      P  RRARSKR+ +R+    +G Q  W  A++ N +++ 
Sbjct: 192 SNCSNSSSSIATLTSCFSSLKAP--RRARSKRRCRRRGGDITGRQLCWSQANNNNNNESF 249

Query: 653 T-------TTYINTGCIIGRKCTHCHANKTPQWRAGPMGPKTLCNACGVRYKSGRLVPEY 495
           T        T   T  IIGRKC HC A+KTPQWRAGP GPKTLCNACGVRYKSGRLV EY
Sbjct: 250 TGYEKATRKTTTMTTTIIGRKCQHCGADKTPQWRAGPYGPKTLCNACGVRYKSGRLVSEY 309

Query: 494 RPASSPTYSSRMHSNSHRKIMEMRRQKEYNVVVMKPVD 381
           RPASSPT+SS +HSNSHRKI+EMRR K+   +V+   D
Sbjct: 310 RPASSPTFSSELHSNSHRKILEMRRTKQMMGMVVVAFD 347


>ref|XP_007145298.1| hypothetical protein PHAVU_007G227300g [Phaseolus vulgaris]
            gi|593689360|ref|XP_007145299.1| hypothetical protein
            PHAVU_007G227300g [Phaseolus vulgaris]
            gi|561018488|gb|ESW17292.1| hypothetical protein
            PHAVU_007G227300g [Phaseolus vulgaris]
            gi|561018489|gb|ESW17293.1| hypothetical protein
            PHAVU_007G227300g [Phaseolus vulgaris]
          Length = 250

 Score =  184 bits (468), Expect = 5e-44
 Identities = 119/254 (46%), Positives = 144/254 (56%), Gaps = 14/254 (5%)
 Frame = -1

Query: 1130 MDDLLDFSSDIXXXXXXXEKP-KARPSSSSIQSNA--------DDPSRSLPXXXXXXXXE 978
            +DDLLDFS DI       +KP K  PS +S   N         DDP+ S           
Sbjct: 7    VDDLLDFSLDIGEEDDDEDKPRKPCPSLNSKCGNPSLFNPLVPDDPNHSYSEFVEEELE- 65

Query: 977  WLSNKDAFPAVETFVDIFPVAPSNFSMDLVKPSNVLKDRSPVSVLEXXXXXXXXXXXXXS 798
            WLSNKDAFP+VETFVD+  + P    M    P+          +LE             S
Sbjct: 66   WLSNKDAFPSVETFVDLSCIQPDTAKMRKTTPATT-------PMLEYSSGSSNSNNSSNS 118

Query: 797  -AIMSCCGGFSVPVCRRARSKRQIKRKRSF----SGNQQWWCVASSKNTSDTMTTTYINT 633
             ++++ C    VPV  RARSKR+ + +       SG Q WW   S++ TS       I+ 
Sbjct: 119  ISLLNSCDHLKVPV--RARSKRRSRCRPGIADENSGQQFWWRQPSNE-TSKAEEGMKISP 175

Query: 632  GCIIGRKCTHCHANKTPQWRAGPMGPKTLCNACGVRYKSGRLVPEYRPASSPTYSSRMHS 453
               IGRKC HC A KTPQWRAGP+GPKTLCNACGVR+KSGRLVPEYRPASSP++ S +HS
Sbjct: 176  ---IGRKCQHCGAEKTPQWRAGPLGPKTLCNACGVRFKSGRLVPEYRPASSPSFRSDLHS 232

Query: 452  NSHRKIMEMRRQKE 411
            NSHRKI EMRRQK+
Sbjct: 233  NSHRKITEMRRQKQ 246


>dbj|BAC98492.1| AG-motif binding protein-2 [Nicotiana tabacum]
          Length = 289

 Score =  179 bits (453), Expect = 3e-42
 Identities = 122/275 (44%), Positives = 152/275 (55%), Gaps = 32/275 (11%)
 Frame = -1

Query: 1142 AACFM----DDLLDFSSDIXXXXXXXEKPKA------RPSSSSIQSNADD--------PS 1017
            A+CFM    DDLL+FS +        EK          P SSS  S+ D         PS
Sbjct: 11   ASCFMVDVDDDLLNFSLEDETVFDDDEKTTKSITKHKHPLSSSYSSSLDSSNPVLSLLPS 70

Query: 1016 RSLPXXXXXXXXEWLSNKDAFPAVETFVDIFPVAPSNFSMDLVKPSNVLKDRSPVSVLEX 837
            +  P         WLSNKDAFPAVE     F +   N       PS V    SPVSVLE 
Sbjct: 71   QQHPECVEEELE-WLSNKDAFPAVE-----FGILADN-------PSIVFDHHSPVSVLEN 117

Query: 836  XXXXXXXXXXXXS---AIMSCCGGFSVPVCR--RARSKRQIKRKR-SFSGNQQWWCVASS 675
                        +   A MSCC    VPV    RARSKR+ +R+R SF+      C++ +
Sbjct: 118  SSSTCNSSGNGSANANAYMSCCASLKVPVNYPVRARSKRRRRRQRGSFADLPSEHCMSVN 177

Query: 674  KNT------SDTMTTTYINTG--CIIGRKCTHCHANKTPQWRAGPMGPKTLCNACGVRYK 519
            K +       + + +  +N+     IGR+C HC A+KTPQWRAGP+GPKTLCNACGVRYK
Sbjct: 178  KPSFKSVKQREPLLSLPLNSAKSASIGRRCQHCGADKTPQWRAGPLGPKTLCNACGVRYK 237

Query: 518  SGRLVPEYRPASSPTYSSRMHSNSHRKIMEMRRQK 414
            SGRL+PEYRPA+SPT+S  +HSNSHRK++EMR+QK
Sbjct: 238  SGRLLPEYRPANSPTFSPTVHSNSHRKVLEMRKQK 272


>ref|NP_189047.1| GATA transcription factor 1 [Arabidopsis thaliana]
            gi|62900367|sp|Q8LAU9.2|GATA1_ARATH RecName: Full=GATA
            transcription factor 1; Short=AtGATA-1
            gi|2959730|emb|CAA73999.1| homologous to GATA-binding
            transcription factors [Arabidopsis thaliana]
            gi|9294674|dbj|BAB03023.1| protein homologous to
            GATA-binding transcription factors [Arabidopsis thaliana]
            gi|87116628|gb|ABD19678.1| At3g24050 [Arabidopsis
            thaliana] gi|332643327|gb|AEE76848.1| GATA transcription
            factor 1 [Arabidopsis thaliana]
          Length = 274

 Score =  177 bits (449), Expect = 9e-42
 Identities = 112/265 (42%), Positives = 134/265 (50%), Gaps = 26/265 (9%)
 Frame = -1

Query: 1133 FMDDLLDFS----------SDIXXXXXXXEKPKARPSSSSIQSNADDPSRSLPXXXXXXX 984
            FMDDLL+FS                     K   RP+ S    N DD             
Sbjct: 6    FMDDLLNFSVPEEEEDDDEHTQPPRNITRRKTGLRPTDSFGLFNTDDLG-----VVEEED 60

Query: 983  XEWLSNKDAFPAVETFVDIFPVAPSNFSMDLVKPSNVLKDRSPVSVLEXXXXXXXXXXXX 804
             EW+SNK+AFP +ETFV + P      +  L + +  +K  SPVSVLE            
Sbjct: 61   LEWISNKNAFPVIETFVGVLPSEHFPITSLLEREATEVKQLSPVSVLETSSHSSTTTTSN 120

Query: 803  XSA----------------IMSCCGGFSVPVCRRARSKRQIKRKRSFSGNQQWWCVASSK 672
             S                 IMSCC GF  P   +ARSKR+   +R     +  W      
Sbjct: 121  SSGGSNGSTAVATTTTTPTIMSCCVGFKAPA--KARSKRRRTGRRDL---RVLWTGNEQG 175

Query: 671  NTSDTMTTTYINTGCIIGRKCTHCHANKTPQWRAGPMGPKTLCNACGVRYKSGRLVPEYR 492
                  T T      I+GRKC HC A KTPQWRAGP GPKTLCNACGVRYKSGRLVPEYR
Sbjct: 176  GIQKKKTMTVAAAALIMGRKCQHCGAEKTPQWRAGPAGPKTLCNACGVRYKSGRLVPEYR 235

Query: 491  PASSPTYSSRMHSNSHRKIMEMRRQ 417
            PA+SPT+++ +HSNSHRKI+EMR+Q
Sbjct: 236  PANSPTFTAELHSNSHRKIVEMRKQ 260


>ref|XP_004309759.1| PREDICTED: GATA transcription factor 1-like isoform 2 [Fragaria
           vesca subsp. vesca]
          Length = 194

 Score =  175 bits (443), Expect = 4e-41
 Identities = 99/204 (48%), Positives = 129/204 (63%), Gaps = 3/204 (1%)
 Frame = -1

Query: 977 WLSNKDAFPAVETFVDIFPVAPSNFSMDLVKPSNVLKDRSPVSVLEXXXXXXXXXXXXXS 798
           W+SNKDAFPAVETF+            + V    + K +SPVSVLE              
Sbjct: 14  WISNKDAFPAVETFI----------LSEQVGGIAIAKHQSPVSVLETSTNSSSA------ 57

Query: 797 AIMSCCGGFSVPVCRRARSKRQIKRKRSFSGNQQWWC---VASSKNTSDTMTTTYINTGC 627
           ++MS CGG   P   RAR+K + +R+      Q +W    + SSK +  + + + ++   
Sbjct: 58  SLMSSCGGLKPP--HRARTKGR-RRRSEIPPQQLFWNQPPIESSKPSRSSGSASKLD--- 111

Query: 626 IIGRKCTHCHANKTPQWRAGPMGPKTLCNACGVRYKSGRLVPEYRPASSPTYSSRMHSNS 447
            IGRKC HC  ++TPQWRAGP GPKTLCNACGVRYKSGRL PEYRPASSP++SS+MHSNS
Sbjct: 112 -IGRKCLHCGTDQTPQWRAGPHGPKTLCNACGVRYKSGRLCPEYRPASSPSFSSQMHSNS 170

Query: 446 HRKIMEMRRQKEYNVVVMKPVDQG 375
           HRK++EMR+ K    +V+KP D+G
Sbjct: 171 HRKVLEMRKHKYGVGMVVKPEDKG 194


>gb|AAM65139.1| GATA transcription factor 1 (AtGATA-1) [Arabidopsis thaliana]
          Length = 268

 Score =  175 bits (443), Expect = 4e-41
 Identities = 110/264 (41%), Positives = 133/264 (50%), Gaps = 26/264 (9%)
 Frame = -1

Query: 1130 MDDLLDFS----------SDIXXXXXXXEKPKARPSSSSIQSNADDPSRSLPXXXXXXXX 981
            MDDLL+FS                     K   RP+ S    N DD              
Sbjct: 1    MDDLLNFSVPEEEEDDDEHTQPPRNITRRKTGLRPTDSFGLFNTDDLG-----VVEEEDL 55

Query: 980  EWLSNKDAFPAVETFVDIFPVAPSNFSMDLVKPSNVLKDRSPVSVLEXXXXXXXXXXXXX 801
            +W+SNK+AFP +ETFV + P      +  L + +  +K  SPVSVLE             
Sbjct: 56   QWISNKNAFPVIETFVGVLPSEHFPITSLLEREATEVKQLSPVSVLETSSHSSTTTTSNS 115

Query: 800  SA----------------IMSCCGGFSVPVCRRARSKRQIKRKRSFSGNQQWWCVASSKN 669
            S                 IMSCC GF  P   +ARSKR+   +R     +  W       
Sbjct: 116  SGGSNGSTAVATTTTTPTIMSCCVGFKAPA--KARSKRRRTGRRDL---RVLWTGNEQGG 170

Query: 668  TSDTMTTTYINTGCIIGRKCTHCHANKTPQWRAGPMGPKTLCNACGVRYKSGRLVPEYRP 489
                 T T      I+GRKC HC A KTPQWRAGP GPKTLCNACGVRYKSGRLVPEYRP
Sbjct: 171  IQKKKTMTVAAAALIMGRKCQHCGAEKTPQWRAGPAGPKTLCNACGVRYKSGRLVPEYRP 230

Query: 488  ASSPTYSSRMHSNSHRKIMEMRRQ 417
            A+SPT+++ +HSNSHRKI+EMR+Q
Sbjct: 231  ANSPTFTAELHSNSHRKIVEMRKQ 254


>ref|XP_006418795.1| hypothetical protein EUTSA_v10002609mg [Eutrema salsugineum]
            gi|557096723|gb|ESQ37231.1| hypothetical protein
            EUTSA_v10002609mg [Eutrema salsugineum]
          Length = 319

 Score =  172 bits (436), Expect = 3e-40
 Identities = 115/284 (40%), Positives = 144/284 (50%), Gaps = 31/284 (10%)
 Frame = -1

Query: 1133 FMDDLLDFS----------SDIXXXXXXXEKPKA--RPSSSSIQSNADDPSRSLPXXXXX 990
            FMDDLL+FS           +I        + K   R + S    N DDP          
Sbjct: 48   FMDDLLNFSVPEEEEDEDEGEIVRSPRNISRRKTGLRQTDSFGLFNPDDPG------VVE 101

Query: 989  XXXEWLSNKDAFPAVETFVDIFPVAPSNFSMDLVKPSNVLKDRSPVSVLEXXXXXXXXXX 810
               EW+SNKDAFP +ETFV + P      S    + +   K  SPVSVLE          
Sbjct: 102  EDLEWISNKDAFPVIETFVGVLPSEHFRLSSPEGEATEG-KQLSPVSVLETSSHNSSITT 160

Query: 809  XXXSA-------------------IMSCCGGFSVPVCRRARSKRQIKRKRSFSGNQQWWC 687
               S+                   +M+CC G +VP   +ARSKR+   +R     +  W 
Sbjct: 161  ATTSSGGSNGSTVAATATAATTTTMMNCCVGLNVP--GKARSKRRRTGRRDL---KVLWT 215

Query: 686  VASSKNTSDTMTTTYINTGCIIGRKCTHCHANKTPQWRAGPMGPKTLCNACGVRYKSGRL 507
              + +      T +       +GRKC HC A KTPQWRAGP GPKTLCNACGVRYKSGRL
Sbjct: 216  GNNEQGPQKKKTPSVAAAAVSLGRKCQHCGAEKTPQWRAGPSGPKTLCNACGVRYKSGRL 275

Query: 506  VPEYRPASSPTYSSRMHSNSHRKIMEMRRQKEYNVVVMKPVDQG 375
            VPEYRPA+SPT+S+ +HSNSHRKI+EMR+Q +   VV+   D G
Sbjct: 276  VPEYRPANSPTFSAELHSNSHRKIVEMRKQFQSGDVVVDRKDCG 319


>ref|XP_002885621.1| hypothetical protein ARALYDRAFT_479930 [Arabidopsis lyrata subsp.
            lyrata] gi|297331461|gb|EFH61880.1| hypothetical protein
            ARALYDRAFT_479930 [Arabidopsis lyrata subsp. lyrata]
          Length = 270

 Score =  172 bits (435), Expect = 4e-40
 Identities = 113/273 (41%), Positives = 141/273 (51%), Gaps = 34/273 (12%)
 Frame = -1

Query: 1133 FMDDLLDFS----------SDIXXXXXXXEKPKARPSSSSIQSNADDPSRSLPXXXXXXX 984
            FMDDLL+FS          +          K   R + S    N DD             
Sbjct: 6    FMDDLLNFSVPEEEEDDEENTQPPRNITRRKTGIRQTDSFGLFNTDDLG-----VVEEED 60

Query: 983  XEWLSNKDAFPAVETFVDIFPVAPSNFSMDLVKPSNVLKDRSPVSVLEXXXXXXXXXXXX 804
             EW+SNK+AFP +ETFV + P++P        + +   K  SPVSVLE            
Sbjct: 61   LEWISNKNAFPVIETFVGVLPLSPE-------REATEGKQLSPVSVLETSSHSSTTTTAT 113

Query: 803  XS--------------------AIMSCCGGFSVPVCRRARSKRQIKRKRS----FSGNQQ 696
             S                     IMSCC GF  P   +ARSKR+   +R     ++GN+Q
Sbjct: 114  TSNSSGGSNGSTAVATTATTTTTIMSCCVGFKAPA--KARSKRRRTGRRDLGVLWTGNEQ 171

Query: 695  WWCVASSKNTSDTMTTTYINTGCIIGRKCTHCHANKTPQWRAGPMGPKTLCNACGVRYKS 516
               V   K  + ++         I+GRKC HC A KTPQWRAGP GPKTLCNACGVRYKS
Sbjct: 172  ---VGIQKRKTPSVAAA---AAMIMGRKCQHCGAEKTPQWRAGPAGPKTLCNACGVRYKS 225

Query: 515  GRLVPEYRPASSPTYSSRMHSNSHRKIMEMRRQ 417
            GRLVPEYRPA+SPT+++ +HSNSHRKI+EMR+Q
Sbjct: 226  GRLVPEYRPANSPTFTAELHSNSHRKIVEMRKQ 258


>gb|EYU32614.1| hypothetical protein MIMGU_mgv1a023497mg, partial [Mimulus
           guttatus]
          Length = 235

 Score =  170 bits (431), Expect = 1e-39
 Identities = 105/222 (47%), Positives = 126/222 (56%), Gaps = 29/222 (13%)
 Frame = -1

Query: 977 WLSNKDAFPAVETFVDIFPVAPSNFSMDLVKPSNVLKDRSPVSVLEXXXXXXXXXXXXXS 798
           WLSNKDAFPAVET    F +   N       P  +L  +SPVSVLE             S
Sbjct: 9   WLSNKDAFPAVET---CFGILSDN-------PGLILNHQSPVSVLENNNSSISAGSNGGS 58

Query: 797 A---IMSCCGGFSVPVCR--RARSKRQIKRKRSF----SGNQQWWC-----VASSKNTSD 660
           +   I SCC    VP     RARSKR+ +R+  F    S  Q  W      V  +K    
Sbjct: 59  SGGSIASCCNSIKVPTKYPVRARSKRRRRRRTGFTDLPSQQQCVWMNQVNIVVKNKKQES 118

Query: 659 TMTTTYI---------------NTGCIIGRKCTHCHANKTPQWRAGPMGPKTLCNACGVR 525
            ++   +                 G  +GR+C HC A+KTPQWRAGPMGPKTLCNACGVR
Sbjct: 119 QLSLPPLPLAAAATADSGGGTTGGGGGMGRRCWHCQADKTPQWRAGPMGPKTLCNACGVR 178

Query: 524 YKSGRLVPEYRPASSPTYSSRMHSNSHRKIMEMRRQKEYNVV 399
           YKSGRL+PEYRPA+SPT+SS +HSNSHRK++EMRRQK+  VV
Sbjct: 179 YKSGRLLPEYRPANSPTFSSNLHSNSHRKVVEMRRQKQAVVV 220


>gb|EYU43811.1| hypothetical protein MIMGU_mgv1a011652mg [Mimulus guttatus]
          Length = 275

 Score =  169 bits (429), Expect = 2e-39
 Identities = 100/199 (50%), Positives = 124/199 (62%), Gaps = 7/199 (3%)
 Frame = -1

Query: 977 WLSNKDAFPAVETFVDIFPVAPSNFSMDLVKPSNVLKDRSPVSVLEXXXXXXXXXXXXXS 798
           WLSNK+AFPAVET    F +   N       P  +   +SPVSVLE              
Sbjct: 84  WLSNKEAFPAVET---CFGILSDN-------PELISSHKSPVSVLENSTTNNYT------ 127

Query: 797 AIMSCCGGFSVPVCR--RARSKRQIKRKRSFSGNQ--QWWCVASSKNTSDTMTTTYINTG 630
             +SC     VPV    RARSKR+ + +RS SG+   Q     S +N S   ++   N+G
Sbjct: 128 TALSCFEHLKVPVNFPVRARSKRRRRTRRSCSGDPPLQHHRPLSVENKSKVKSSCGGNSG 187

Query: 629 C---IIGRKCTHCHANKTPQWRAGPMGPKTLCNACGVRYKSGRLVPEYRPASSPTYSSRM 459
                +GR+C HC A++TPQWRAGPMGPKTLCNACGVRYKSGRL+PEYRPA+SPT+SS +
Sbjct: 188 GGGDNMGRRCLHCQADRTPQWRAGPMGPKTLCNACGVRYKSGRLLPEYRPANSPTFSSNL 247

Query: 458 HSNSHRKIMEMRRQKEYNV 402
           HSNSHRK++EMR+QK   V
Sbjct: 248 HSNSHRKVVEMRKQKHVEV 266


>ref|XP_006298301.1| hypothetical protein CARUB_v10014365mg [Capsella rubella]
            gi|482567010|gb|EOA31199.1| hypothetical protein
            CARUB_v10014365mg [Capsella rubella]
          Length = 274

 Score =  169 bits (429), Expect = 2e-39
 Identities = 116/275 (42%), Positives = 143/275 (52%), Gaps = 36/275 (13%)
 Frame = -1

Query: 1133 FMDDLLDFS---------SDIXXXXXXXE-KPKARPSSSSIQS-NADDPSRSLPXXXXXX 987
            FMDDLL+FS          D+         K   R + SS    N+DD            
Sbjct: 6    FMDDLLNFSVPEEEEEEEDDMQPPRNLTRRKTGLRQTDSSFGLFNSDDSG-----VVEEE 60

Query: 986  XXEWLSNKDAFPAVETFVDIFPVAPSNFSMDLVKPSNV---LKDRSPVSVLEXXXXXXXX 816
              EW+SNKDAFP +ETFV + P  P +F +    P  V   +K  SPVSVLE        
Sbjct: 61   DLEWISNKDAFPVIETFVGVLP--PEHFRV--TSPERVATEIKQLSPVSVLETSSHNSST 116

Query: 815  XXXXXSA------------------IMSCCGGFSVPVCRRARSKRQIKRKRS----FSGN 702
                 +                   +MSCC  F  P   +ARSKR+   +R     ++GN
Sbjct: 117  TTSTTTTTSNSSGGSTAVTTTTAATLMSCCVSFKAPA--KARSKRRRTGRRDLRVLWTGN 174

Query: 701  QQWWCVASSKNTSDTMTTTYINTGCIIGRKCTHCHANKTPQWRAGPMGPKTLCNACGVRY 522
            +Q              TT  +    I+GRKC HC A KTPQWRAGP GPKTLCNACGVRY
Sbjct: 175  EQG---GGGGGIQKKKTTAAV----IMGRKCQHCGAEKTPQWRAGPSGPKTLCNACGVRY 227

Query: 521  KSGRLVPEYRPASSPTYSSRMHSNSHRKIMEMRRQ 417
            KSGRLVPEYRPA+SPT+++ +HSNSHRKI+EMR+Q
Sbjct: 228  KSGRLVPEYRPANSPTFTAELHSNSHRKIVEMRKQ 262


Top