BLASTX nr result

ID: Jatropha_contig00022889 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Jatropha_contig00022889
         (758 letters)

Database: NCBI-nr (updated 2014/02/11) 
           35,149,712 sequences; 12,374,887,350 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_002511642.1| GATA transcription factor, putative [Ricinus...   273   6e-71
gb|EOX96349.1| GATA transcription factor 2, putative [Theobroma ...   243   6e-62
gb|ERP53062.1| hypothetical protein POPTR_0014s05760g [Populus t...   231   2e-58
gb|ESR58578.1| hypothetical protein CICLE_v10021733mg [Citrus cl...   222   1e-55
gb|EEE80531.2| hypothetical protein POPTR_0002s14380g [Populus t...   211   3e-52
ref|NP_191612.1| GATA transcription factor 4 [Arabidopsis thalia...   207   4e-51
ref|XP_002277959.1| PREDICTED: GATA transcription factor 2 [Viti...   206   6e-51
emb|CBI16598.3| unnamed protein product [Vitis vinifera]              204   3e-50
ref|XP_006291749.1| hypothetical protein CARUB_v10017916mg [Caps...   203   5e-50
ref|XP_006347916.1| PREDICTED: GATA transcription factor 2-like ...   201   2e-49
dbj|BAC98493.1| AG-motif binding protein-3 [Nicotiana tabacum]        201   2e-49
ref|XP_004229778.1| PREDICTED: GATA transcription factor 2-like ...   200   4e-49
dbj|BAJ34282.1| unnamed protein product [Thellungiella halophila...   197   3e-48
ref|NP_182031.1| GATA transcription factor 2 [Arabidopsis thalia...   197   3e-48
ref|XP_002880154.1| hypothetical protein ARALYDRAFT_903940 [Arab...   194   3e-47
ref|XP_002876563.1| zinc finger family protein [Arabidopsis lyra...   194   3e-47
ref|XP_006294606.1| hypothetical protein CARUB_v10023643mg [Caps...   193   4e-47
gb|ESQ39138.1| hypothetical protein EUTSA_v10001591mg [Eutrema s...   193   6e-47
ref|XP_006359592.1| PREDICTED: GATA transcription factor 2-like ...   189   6e-46
gb|ADK63416.1| GATA type zinc finger protein [Brassica rapa]          189   6e-46

>ref|XP_002511642.1| GATA transcription factor, putative [Ricinus communis]
           gi|223548822|gb|EEF50311.1| GATA transcription factor,
           putative [Ricinus communis]
          Length = 235

 Score =  273 bits (697), Expect = 6e-71
 Identities = 140/203 (68%), Positives = 157/203 (77%), Gaps = 6/203 (2%)
 Frame = +2

Query: 167 MDVYGIPTPDYFRIDDLLDFSNDEIFSSSSTATDHHLSP----PQNPSIHFPSSTPFNPA 334
           MD+YGIPTPDYFRIDDLLD SND++FSS+ST T   ++     P NPSIH  +S PFNPA
Sbjct: 1   MDIYGIPTPDYFRIDDLLDLSNDDLFSSASTCTSSSIAADIHQPLNPSIH--NSAPFNPA 58

Query: 335 LSTDFTDDLSVPSDDVAELEWLSQFVEDSFVDFPANSLAGTINVQSDASFSGKAVRSKRS 514
           LSTDFTD LSVPSDDVAELEWLSQFV+DSF++FP N L GTINV+SD SFSGKA R KRS
Sbjct: 59  LSTDFTDHLSVPSDDVAELEWLSQFVDDSFIEFPPNLLTGTINVRSDTSFSGKAARRKRS 118

Query: 515 R-GTVPNGSPWTSSPETASPATGKSKLKKDT-NRASSPTANGGVRRCTHCASEKNSQWRT 688
           +  T    + WTSSPE      G+SK KK+T NR+ SPT  GG+RRCTHCASEK  QWRT
Sbjct: 119 KAATTTATTAWTSSPE-----IGQSKSKKETNNRSLSPTTEGGIRRCTHCASEKTPQWRT 173

Query: 689 GPLGPKTLCNACGVRYNLAR*SP 757
           GPLGPKTLCNACGVRY   R  P
Sbjct: 174 GPLGPKTLCNACGVRYKSGRLVP 196


>gb|EOX96349.1| GATA transcription factor 2, putative [Theobroma cacao]
          Length = 273

 Score =  243 bits (619), Expect = 6e-62
 Identities = 124/208 (59%), Positives = 150/208 (72%), Gaps = 11/208 (5%)
 Frame = +2

Query: 167 MDVYGIPTPDYFRIDDLLDFSNDEIFSSSSTAT---DHHLSPPQNPSIHFPS------ST 319
           MD+YG+  P+ FRIDDLLD SN+E+FSS+S++T   ++   PP      + S      S 
Sbjct: 1   MDMYGLSAPELFRIDDLLDLSNEELFSSASSSTASTNNDQFPPSEAPFSYASASSSSSSA 60

Query: 320 PFNPALSTDFTDDLSVPSDDVAELEWLSQFVEDSFVDFPANSLAGTINVQSDASFSGKAV 499
            F+P+ STDFT DL +PSDDVAELEWLSQFVEDSF DFP+NS+AGT+N ++D+SFS KA 
Sbjct: 61  AFHPSFSTDFTHDLCLPSDDVAELEWLSQFVEDSFTDFPSNSIAGTLNPRNDSSFSSKA- 119

Query: 500 RSKRSRGT--VPNGSPWTSSPETASPATGKSKLKKDTNRASSPTANGGVRRCTHCASEKN 673
           RSKRSR    +   + WT+  E A P TG SK KK+  R +SP A+GGVRRCTHCASEK 
Sbjct: 120 RSKRSRAATAMKTTTTWTTMSEAAPPFTGNSKTKKEIQRQASPAADGGVRRCTHCASEKT 179

Query: 674 SQWRTGPLGPKTLCNACGVRYNLAR*SP 757
            QWRTGPLGPKTLCNACGVRY   R  P
Sbjct: 180 PQWRTGPLGPKTLCNACGVRYKSGRLVP 207


>gb|ERP53062.1| hypothetical protein POPTR_0014s05760g [Populus trichocarpa]
          Length = 251

 Score =  231 bits (589), Expect = 2e-58
 Identities = 130/209 (62%), Positives = 146/209 (69%), Gaps = 12/209 (5%)
 Frame = +2

Query: 167 MDVYG----IPTPDYFRIDDLLDFSNDEIFSSSSTATDHH--LSPPQNPSIH---FPSST 319
           MDVYG       PDYF IDDLLDFSND++ SS S++ DHH  L PP+  SIH   FPSST
Sbjct: 1   MDVYGGLSTTTAPDYFHIDDLLDFSNDDLLSSPSSSIDHHHHLPPPETSSIHHHHFPSST 60

Query: 320 PFN--PALSTDFTDDLSVPSDDVAELEWLSQFVEDSFVDFPANSLAGTINVQSDASFSGK 493
             N   +LSTDFTD LSVP+DDVAELEWLSQFVEDSF DFP+      IN+ +D SF  K
Sbjct: 61  YINNPSSLSTDFTDHLSVPTDDVAELEWLSQFVEDSFSDFPS-----IINIPTDTSFCNK 115

Query: 494 AVRSKRSRGTVPNGSPWTSSPETASPATGKSKLKKDTNRAS-SPTANGGVRRCTHCASEK 670
           + RSKRSR T    +  +SSPE  +  TGKS+LKK+ N A  SP   G VRRCTHCASEK
Sbjct: 116 S-RSKRSRATATTAT--SSSPELETAVTGKSRLKKENNGAPHSPAEEGTVRRCTHCASEK 172

Query: 671 NSQWRTGPLGPKTLCNACGVRYNLAR*SP 757
             QWRTGPLGPKTLCNACGVRY   R  P
Sbjct: 173 TPQWRTGPLGPKTLCNACGVRYKSGRLVP 201


>gb|ESR58578.1| hypothetical protein CICLE_v10021733mg [Citrus clementina]
          Length = 263

 Score =  222 bits (565), Expect = 1e-55
 Identities = 122/207 (58%), Positives = 145/207 (70%), Gaps = 10/207 (4%)
 Frame = +2

Query: 167 MDVYGIP-----TPDYFRIDDLLDFSNDEIFSSSSTATDHHLSPPQNPSIHFP-----SS 316
           MD+YG+P     T D FRIDDLLDFSNDE+F+SSS+A   + +   + + H P     S 
Sbjct: 1   MDIYGLPSNNTTTQDLFRIDDLLDFSNDELFTSSSSAATANTTAIASDTDHLPQAQHQSF 60

Query: 317 TPFNPALSTDFTDDLSVPSDDVAELEWLSQFVEDSFVDFPANSLAGTINVQSDASFSGKA 496
             FNP  S+DFT DL VPSDDVAELEWLSQFV+DS +DFPANSLAGTI V+SD S SG+ 
Sbjct: 61  DSFNP--SSDFTGDLCVPSDDVAELEWLSQFVDDSCMDFPANSLAGTI-VRSDTSLSGRG 117

Query: 497 VRSKRSRGTVPNGSPWTSSPETASPATGKSKLKKDTNRASSPTANGGVRRCTHCASEKNS 676
            RSKRS+ T    +  T +  ++   +G SK K++ +R SSP   GGVRRCTHCASEK  
Sbjct: 118 -RSKRSKATNSAANTTTWNWTSSESESGNSKQKRENHRQSSPIPEGGVRRCTHCASEKTP 176

Query: 677 QWRTGPLGPKTLCNACGVRYNLAR*SP 757
           QWRTGPLGPKTLCNACGVRY   R  P
Sbjct: 177 QWRTGPLGPKTLCNACGVRYKSGRLVP 203


>gb|EEE80531.2| hypothetical protein POPTR_0002s14380g [Populus trichocarpa]
          Length = 246

 Score =  211 bits (536), Expect = 3e-52
 Identities = 122/210 (58%), Positives = 140/210 (66%), Gaps = 13/210 (6%)
 Frame = +2

Query: 167 MDVYG---IPTPDYFRIDDLLDFSNDEIFSSSSTATDHHLSPPQNPSIH-----FPSSTP 322
           MDVYG      PDYF IDDLLDFSND++ +SS+    HHL PP+  SIH     FPS T 
Sbjct: 1   MDVYGGVSTSAPDYFLIDDLLDFSNDDLLTSSTD--HHHLPPPETSSIHHHHHFFPSPTT 58

Query: 323 F---NPALSTDFTDDLSVPSDDVAELEWLSQFVEDSFVDFPANSLAGTINVQSDASFSGK 493
           +     +LSTDFTD LSVPSDDVAELEWLSQF+EDSF DFP+     TIN+ +D S   K
Sbjct: 59  YINNTSSLSTDFTDHLSVPSDDVAELEWLSQFMEDSFTDFPS-----TINIPTDTSSRIK 113

Query: 494 AVRSKRSRGTVPNGSPWTSSPETASPATGKSKLKKDTNRA--SSPTANGGVRRCTHCASE 667
           +  SKRSR T    S   SS +  +  TG+S++KK+ N A  SS    GG RRCTHCASE
Sbjct: 114 SC-SKRSRTTTTATS---SSADIETAVTGESRVKKENNGAPHSSAETEGGARRCTHCASE 169

Query: 668 KNSQWRTGPLGPKTLCNACGVRYNLAR*SP 757
           K  QWRTGPLGPKTLCNACGVRY   R  P
Sbjct: 170 KTPQWRTGPLGPKTLCNACGVRYKSGRLVP 199


>ref|NP_191612.1| GATA transcription factor 4 [Arabidopsis thaliana]
           gi|62900345|sp|O49743.1|GATA4_ARATH RecName: Full=GATA
           transcription factor 4; Short=AtGATA-4
           gi|14190407|gb|AAK55684.1|AF378881_1 AT3g60530/T8B10_190
           [Arabidopsis thaliana] gi|2959736|emb|CAA74002.1|
           homologous to GATA-binding transcription factors
           [Arabidopsis thaliana] gi|7288001|emb|CAB81839.1| GATA
           transcription factor 4 [Arabidopsis thaliana]
           gi|14517395|gb|AAK62588.1| AT3g60530/T8B10_190
           [Arabidopsis thaliana] gi|15215891|gb|AAK91489.1|
           AT3g60530/T8B10_190 [Arabidopsis thaliana]
           gi|332646554|gb|AEE80075.1| GATA transcription factor 4
           [Arabidopsis thaliana]
          Length = 240

 Score =  207 bits (526), Expect = 4e-51
 Identities = 121/202 (59%), Positives = 136/202 (67%), Gaps = 5/202 (2%)
 Frame = +2

Query: 167 MDVYGIPTPDYFRIDDLLDFSNDEIFSSSSTATDHHLSPP---QNPSIHFPSSTPFNPAL 337
           MDVYG+ +PD  RIDDLLDFSNDEIFSSSST T    S     +NP   FPSST  +P L
Sbjct: 1   MDVYGMSSPDLLRIDDLLDFSNDEIFSSSSTVTSSAASSAASSENP-FSFPSSTYTSPTL 59

Query: 338 STDFTDDLSVPSDDVAELEWLSQFVEDSFVDFPANSLAGTINVQSDASFSGKAVRSKRSR 517
            TDFT DL VPSDD A LEWLS+FV+DSF DFPAN L  T+ V+ + SF+GK  RS+RSR
Sbjct: 60  LTDFTHDLCVPSDDAAHLEWLSRFVDDSFSDFPANPL--TMTVRPEISFTGKP-RSRRSR 116

Query: 518 GTVPN-GSPWTSSPET-ASPATGKSKLKKDTNRASSPTANGGVRRCTHCASEKNSQWRTG 691
              P+    W    E+    +  K K KK  N A S TA+ G RRCTHCASEK  QWRTG
Sbjct: 117 APAPSVAGTWAPMSESELCHSVAKPKPKKVYN-AESVTAD-GARRCTHCASEKTPQWRTG 174

Query: 692 PLGPKTLCNACGVRYNLAR*SP 757
           PLGPKTLCNACGVRY   R  P
Sbjct: 175 PLGPKTLCNACGVRYKSGRLVP 196


>ref|XP_002277959.1| PREDICTED: GATA transcription factor 2 [Vitis vinifera]
          Length = 270

 Score =  206 bits (524), Expect = 6e-51
 Identities = 119/225 (52%), Positives = 140/225 (62%), Gaps = 28/225 (12%)
 Frame = +2

Query: 167 MDVYGIPTPDYFRIDDLLDFSNDEIFSSSSTATDHHLSPPQ----NPSIHFPSSTPF-NP 331
           MD+YG+ T D+FRIDDLLDF+NDE+FSS++T + + L PP+    N S+    +    N 
Sbjct: 1   MDLYGLQTSDFFRIDDLLDFTNDELFSSTTTDSGN-LPPPEIASGNRSLAASGNRDQPNT 59

Query: 332 ALSTDFTDDLSVPSDDVAELEWLSQFVEDSFVDFPANSLAGTINVQSDASFSGKAVRSKR 511
             S DFTDDL VPSDDVAELEWLS FV+DSF DFP N LAGT+  + D+SF G+  RSKR
Sbjct: 60  FHSADFTDDLCVPSDDVAELEWLSNFVDDSFADFPENELAGTVMARPDSSFPGR-TRSKR 118

Query: 512 SRGTVPNGSPWTSSPETASPATGKSKLKKDTNR-----------------------ASSP 622
           SR +  N   WTS P +  P  GKSK   + N                        ASSP
Sbjct: 119 SRASSTN-KVWTSLPVSEIPMIGKSKTNSNKNSIVKKESSSSSSVISGERSSSSSPASSP 177

Query: 623 TANGGVRRCTHCASEKNSQWRTGPLGPKTLCNACGVRYNLAR*SP 757
           T   G R+CTHCASEK  QWRTGPLGPKTLCNACGVRY   R  P
Sbjct: 178 T---GARKCTHCASEKTPQWRTGPLGPKTLCNACGVRYKSGRLVP 219


>emb|CBI16598.3| unnamed protein product [Vitis vinifera]
          Length = 255

 Score =  204 bits (518), Expect = 3e-50
 Identities = 114/202 (56%), Positives = 138/202 (68%), Gaps = 5/202 (2%)
 Frame = +2

Query: 167 MDVYGIPTPDYFRIDDLLDFSNDEIFSSSSTATDHHLSPPQ----NPSIHFPSSTPF-NP 331
           MD+YG+ T D+FRIDDLLDF+NDE+FSS++T + + L PP+    N S+    +    N 
Sbjct: 1   MDLYGLQTSDFFRIDDLLDFTNDELFSSTTTDSGN-LPPPEIASGNRSLAASGNRDQPNT 59

Query: 332 ALSTDFTDDLSVPSDDVAELEWLSQFVEDSFVDFPANSLAGTINVQSDASFSGKAVRSKR 511
             S DFTDDL VPSDDVAELEWLS FV+DSF DFP N LAGT+  + D+SF G+  RSKR
Sbjct: 60  FHSADFTDDLCVPSDDVAELEWLSNFVDDSFADFPENELAGTVMARPDSSFPGR-TRSKR 118

Query: 512 SRGTVPNGSPWTSSPETASPATGKSKLKKDTNRASSPTANGGVRRCTHCASEKNSQWRTG 691
           SR +  N   WTSS   +S +    +    ++ ASSPT   G R+CTHCASEK  QWRTG
Sbjct: 119 SRASSTN-KVWTSS---SSSSVISGERSSSSSPASSPT---GARKCTHCASEKTPQWRTG 171

Query: 692 PLGPKTLCNACGVRYNLAR*SP 757
           PLGPKTLCNACGVRY   R  P
Sbjct: 172 PLGPKTLCNACGVRYKSGRLVP 193


>ref|XP_006291749.1| hypothetical protein CARUB_v10017916mg [Capsella rubella]
           gi|482560456|gb|EOA24647.1| hypothetical protein
           CARUB_v10017916mg [Capsella rubella]
          Length = 247

 Score =  203 bits (516), Expect = 5e-50
 Identities = 115/207 (55%), Positives = 135/207 (65%), Gaps = 10/207 (4%)
 Frame = +2

Query: 167 MDVYGIPTPDYFRIDDLLDFSNDEIFSSSS-TATDHHLS-------PPQNPSIHFPSSTP 322
           MDVYG+ +PD  RIDDLLDFSNDE+FSSSS T T    S       P +NP  +FPSS  
Sbjct: 1   MDVYGMSSPDLLRIDDLLDFSNDELFSSSSSTVTSSAASSAASSSFPSENP-FNFPSSAY 59

Query: 323 FNPALSTDFTDDLSVPSDDVAELEWLSQFVEDSFVDFPANSLAGTINVQSDASFSGKAVR 502
            +P L TDFT DL VPSDD A LEWLS+FV+DSF D+P N L  T+ V+ + SF+GK  R
Sbjct: 60  TSPPLLTDFTHDLCVPSDDAAHLEWLSRFVDDSFSDYPTNPL--TMTVRPEISFTGKP-R 116

Query: 503 SKRSRGTVPN-GSPWTSSPETA-SPATGKSKLKKDTNRASSPTANGGVRRCTHCASEKNS 676
           S+RSR   P+    W   PE+    +  K+K KK+ N        GG RRCTHCASEK  
Sbjct: 117 SRRSRAPAPSVAGTWAPMPESELCHSVPKTKHKKEYNAEPVTPDVGGARRCTHCASEKTP 176

Query: 677 QWRTGPLGPKTLCNACGVRYNLAR*SP 757
           QWRTGPLGPKTLCNACGVR+   R  P
Sbjct: 177 QWRTGPLGPKTLCNACGVRFKSGRLVP 203


>ref|XP_006347916.1| PREDICTED: GATA transcription factor 2-like [Solanum tuberosum]
          Length = 260

 Score =  201 bits (512), Expect = 2e-49
 Identities = 117/208 (56%), Positives = 137/208 (65%), Gaps = 11/208 (5%)
 Frame = +2

Query: 167 MDVYGIPT-PDYFRIDDLLDFSNDEIFS----SSSTATDHHLSPPQNPSIHFPSSTPFN- 328
           MDVYG+ + PD FRIDDLLDFSNDEIFS    SS+T  +HH  P  + S    ++  ++ 
Sbjct: 1   MDVYGVHSAPDLFRIDDLLDFSNDEIFSINNNSSNTDCNHHHQPHSHNSSAAGAANYYDA 60

Query: 329 --PALSTDFTDDLSVPSDDVAELEWLSQFVEDSFVDFPANSLAGTINVQSD-ASFSGKAV 499
             P  S DFTD+L VPSDDVAELEWLS FVEDSF +FPANS+ GT+N+ S+ ASF G++ 
Sbjct: 61  LLPNSSDDFTDNLCVPSDDVAELEWLSNFVEDSFSNFPANSVTGTMNISSNTASFHGRS- 119

Query: 500 RSKRSRGTVPNGSPWTSSPETASPATGKSKLKKD--TNRASSPTANGGVRRCTHCASEKN 673
           RSKRSR T    S WTSS +  +  T     +    T   SS       RRCTHCASEK 
Sbjct: 120 RSKRSRST----SSWTSSLQNTNATTSMKNKESSVYTRERSSSMDEDVPRRCTHCASEKT 175

Query: 674 SQWRTGPLGPKTLCNACGVRYNLAR*SP 757
            QWRTGPLGPKTLCNACGVRY   R  P
Sbjct: 176 PQWRTGPLGPKTLCNACGVRYKSGRLVP 203


>dbj|BAC98493.1| AG-motif binding protein-3 [Nicotiana tabacum]
          Length = 256

 Score =  201 bits (512), Expect = 2e-49
 Identities = 114/210 (54%), Positives = 136/210 (64%), Gaps = 13/210 (6%)
 Frame = +2

Query: 167 MDVYGIPTPDYFRIDDLLDFSNDEIFS-----SSSTAT---DHHLSPPQNPSIHFPSSTP 322
           MDVYG+  PD FRIDDLLDFSNDEIFS     SS+TAT    HH   P + +    ++  
Sbjct: 1   MDVYGVSAPDLFRIDDLLDFSNDEIFSINSNSSSTTATPDSQHHHHQPHSDNSSAATANY 60

Query: 323 FN---PALSTDFTDDLSVPSDDVAELEWLSQFVEDSFVDFPANSLAGTINVQSD--ASFS 487
           ++   P  S DFTD+L VPSDDVAELEWLS FVEDSF +FP NS+ GT+N+ S+  ASF 
Sbjct: 61  YDALLPNCSDDFTDNLCVPSDDVAELEWLSNFVEDSFSNFPTNSITGTMNLSSNSTASFH 120

Query: 488 GKAVRSKRSRGTVPNGSPWTSSPETASPATGKSKLKKDTNRASSPTANGGVRRCTHCASE 667
            ++ RSKRSR T    S WTSS +  +      ++   T   SS   +   RRCTHCASE
Sbjct: 121 SRS-RSKRSRST----SSWTSSLQNPNTTMKNKEISVHTRERSSSMDDDVPRRCTHCASE 175

Query: 668 KNSQWRTGPLGPKTLCNACGVRYNLAR*SP 757
           K  QWRTGPLGPKTLCNACGVR+   R  P
Sbjct: 176 KTPQWRTGPLGPKTLCNACGVRFKSGRLVP 205


>ref|XP_004229778.1| PREDICTED: GATA transcription factor 2-like [Solanum lycopersicum]
          Length = 260

 Score =  200 bits (509), Expect = 4e-49
 Identities = 120/208 (57%), Positives = 142/208 (68%), Gaps = 11/208 (5%)
 Frame = +2

Query: 167 MDVYGIPT-PDYFRIDDLLDFSNDEIFS----SSSTATDHHLSP-PQNPSIHFPSS--TP 322
           MDVYG+ + PD FRIDDLLDFSNDEIFS    S++T ++HH  P   N S   P++    
Sbjct: 1   MDVYGLHSAPDLFRIDDLLDFSNDEIFSINNNSNNTDSNHHHQPHSHNSSAAGPANYYDA 60

Query: 323 FNPALSTDFTDDLSVPSDDVAELEWLSQFVEDSFVDFPANSLAGTINVQSD-ASFSGKAV 499
             P  S DFTD+L VPSDDVAELEWLS FVEDSF +FPANS+ GT+N+ S+ ASF G++ 
Sbjct: 61  LLPNSSDDFTDNLCVPSDDVAELEWLSNFVEDSFSNFPANSVTGTMNITSNTASFHGRS- 119

Query: 500 RSKRSRGTVPNGSPWTSSPETASPATG-KSKLKKDTNRASSPTANGGV-RRCTHCASEKN 673
           RSKRSR T    S WTSS + ++  T  K+K      R  S + +  V RRCTHCASEK 
Sbjct: 120 RSKRSRST----SSWTSSLQNSNATTSVKNKESSVYTRERSSSMDEDVPRRCTHCASEKT 175

Query: 674 SQWRTGPLGPKTLCNACGVRYNLAR*SP 757
            QWRTGPLGPKTLCNACGVRY   R  P
Sbjct: 176 PQWRTGPLGPKTLCNACGVRYKSGRLVP 203


>dbj|BAJ34282.1| unnamed protein product [Thellungiella halophila]
           gi|557103672|gb|ESQ44026.1| hypothetical protein
           EUTSA_v10006202mg [Eutrema salsugineum]
          Length = 247

 Score =  197 bits (501), Expect = 3e-48
 Identities = 113/207 (54%), Positives = 130/207 (62%), Gaps = 10/207 (4%)
 Frame = +2

Query: 167 MDVYGIPTPDYFRIDDLLDFSNDEIFSSSSTATDHHLSPP---QNPSIHFPSSTPFN--- 328
           MDVYG+ +PD  RIDDLLDFSNDEIFSSSST T    S     +NP  +FPSS   +   
Sbjct: 1   MDVYGLSSPDLLRIDDLLDFSNDEIFSSSSTVTSSAASSAASSENP-FNFPSSASNSFHT 59

Query: 329 --PALSTDFTDDLSVPSDDVAELEWLSQFVEDSFVDFPANSLAGTINVQSDASFSGKAVR 502
             P L TDFT D  VPSDD A LEWLS+FV+DSF D+PAN L  T+ V+ + SF+GK  R
Sbjct: 60  SPPPLLTDFTHDFCVPSDDAAHLEWLSRFVDDSFSDYPANPL--TMTVRPEMSFTGKP-R 116

Query: 503 SKRSRGTVPN-GSPWTSSPETA-SPATGKSKLKKDTNRASSPTANGGVRRCTHCASEKNS 676
           S+RSR   P     W   PE+    +  K+K  K           GG RRCTHCASEK  
Sbjct: 117 SRRSRAPAPPVAGTWAPMPESELCYSVAKTKPNKKFEAEPMAADGGGARRCTHCASEKTP 176

Query: 677 QWRTGPLGPKTLCNACGVRYNLAR*SP 757
           QWRTGPLGPKTLCNACGVR+   R  P
Sbjct: 177 QWRTGPLGPKTLCNACGVRFKSGRLVP 203


>ref|NP_182031.1| GATA transcription factor 2 [Arabidopsis thaliana]
           gi|62900344|sp|O49741.1|GATA2_ARATH RecName: Full=GATA
           transcription factor 2; Short=AtGATA-2
           gi|2959732|emb|CAA74000.1| homologous to GATA-binding
           transcription factors [Arabidopsis thaliana]
           gi|24030302|gb|AAN41321.1| putative GATA-type zinc
           finger transcription factor [Arabidopsis thaliana]
           gi|222423708|dbj|BAH19820.1| AT2G45050 [Arabidopsis
           thaliana] gi|225898595|dbj|BAH30428.1| hypothetical
           protein [Arabidopsis thaliana]
           gi|330255406|gb|AEC10500.1| GATA transcription factor 2
           [Arabidopsis thaliana]
          Length = 264

 Score =  197 bits (501), Expect = 3e-48
 Identities = 112/223 (50%), Positives = 135/223 (60%), Gaps = 26/223 (11%)
 Frame = +2

Query: 167 MDVYGIPTPDYFRIDDLLDFSNDEIFSSSSTA------TDHHLSPPQNPSIH---FPSST 319
           MDVYG+ +PD  RIDDLLDFSN++IFS+SS+       +     PPQNPS H    PSS 
Sbjct: 1   MDVYGLSSPDLLRIDDLLDFSNEDIFSASSSGGSTAATSSSSFPPPQNPSFHHHHLPSSA 60

Query: 320 PFNPALSTDFTDDLSVPSDDVAELEWLSQFVEDSFVDFPANSLAGTI-NVQSDASFSGKA 496
             +      F  D+ VPSDD A LEWLSQFV+DSF DFPAN L GT+ +V+++ SF GK 
Sbjct: 61  DHH-----SFLHDICVPSDDAAHLEWLSQFVDDSFADFPANPLGGTMTSVKTETSFPGKP 115

Query: 497 VRSKRSRGTVPNGSPWTSSPETASP----ATGKSKLKKDTN------------RASSPTA 628
            RSKRSR   P    W+  P  +      +  K K KK+ +             +S  T 
Sbjct: 116 -RSKRSRAPAPFAGTWSPMPLESEHQQLHSAAKFKPKKEQSGGGGGGGGRHQSSSSETTE 174

Query: 629 NGGVRRCTHCASEKNSQWRTGPLGPKTLCNACGVRYNLAR*SP 757
            GG+RRCTHCASEK  QWRTGPLGPKTLCNACGVR+   R  P
Sbjct: 175 GGGMRRCTHCASEKTPQWRTGPLGPKTLCNACGVRFKSGRLVP 217


>ref|XP_002880154.1| hypothetical protein ARALYDRAFT_903940 [Arabidopsis lyrata subsp.
           lyrata] gi|297325993|gb|EFH56413.1| hypothetical protein
           ARALYDRAFT_903940 [Arabidopsis lyrata subsp. lyrata]
          Length = 262

 Score =  194 bits (493), Expect = 3e-47
 Identities = 109/221 (49%), Positives = 133/221 (60%), Gaps = 24/221 (10%)
 Frame = +2

Query: 167 MDVYGIPTPDYFRIDDLLDFSNDEIFSSSSTA------TDHHLSPPQNPSIH---FPSST 319
           MDVYG+ +PD  RIDDLLDFSN++IFS+SS+       +     PPQNP+ H    PSS 
Sbjct: 1   MDVYGLSSPDLLRIDDLLDFSNEDIFSASSSGGSTAATSSSSFPPPQNPNFHHHHLPSSA 60

Query: 320 PFNPALSTDFTDDLSVPSDDVAELEWLSQFVEDSFVDFPANSLAGTI-NVQSDASFSGKA 496
             +      F  D+ VPSDD A LEWLSQFV+DSF DFPAN L GT+ + +++ SF GK 
Sbjct: 61  DHH-----SFLHDICVPSDDAAHLEWLSQFVDDSFADFPANPLGGTMTSAKTETSFPGKP 115

Query: 497 VRSKRSRGTVPNGSPWTSSPETASP----ATGKSKLKKD----------TNRASSPTANG 634
            RSKRSR   P    W+  P  +      +  K K KK+           + +S     G
Sbjct: 116 -RSKRSRAPAPFAGTWSPMPTESEHHQLHSAAKFKPKKEHSGGGGGGRHQSSSSESAEGG 174

Query: 635 GVRRCTHCASEKNSQWRTGPLGPKTLCNACGVRYNLAR*SP 757
           G+RRCTHCASEK  QWRTGPLGPKTLCNACGVR+   R  P
Sbjct: 175 GMRRCTHCASEKTPQWRTGPLGPKTLCNACGVRFKSGRLVP 215


>ref|XP_002876563.1| zinc finger family protein [Arabidopsis lyrata subsp. lyrata]
           gi|297322401|gb|EFH52822.1| zinc finger family protein
           [Arabidopsis lyrata subsp. lyrata]
          Length = 240

 Score =  194 bits (493), Expect = 3e-47
 Identities = 114/199 (57%), Positives = 132/199 (66%), Gaps = 9/199 (4%)
 Frame = +2

Query: 188 TPDYFRIDDLLDFSNDEIFSSSSTAT-----DHHLSPPQNPSIHFPSSTPFNPALSTDFT 352
           +PD  RIDDLLDFSNDEIFSSS+++T            +NP  +FPSS   +P L TDFT
Sbjct: 3   SPDLLRIDDLLDFSNDEIFSSSTSSTVTSSAASSAGSSENP-FNFPSSAYTSPPLLTDFT 61

Query: 353 DDLSVPSDDVAELEWLSQFVEDSFVDFPANSLAGTINVQSDASFSGKAVRSKRSRGTVPN 532
            DL VPSDD A LEWLS+FV+DSF DFPAN L  T+ V+ + SF+GK  RS+RSR   P+
Sbjct: 62  HDLCVPSDDAAHLEWLSRFVDDSFSDFPANPL--TMTVRPEISFTGKP-RSRRSRAPAPS 118

Query: 533 -GSPWTSSPETA-SPATGKSKLKKDTNRASSPTAN--GGVRRCTHCASEKNSQWRTGPLG 700
               W   PE+    +  K K KK  N A S TA+  GG RRCTHCASEK  QWRTGPLG
Sbjct: 119 VAGTWAPMPESELCHSVAKPKPKKVYN-AESITADVGGGARRCTHCASEKTPQWRTGPLG 177

Query: 701 PKTLCNACGVRYNLAR*SP 757
           PKTLCNACGVRY   R  P
Sbjct: 178 PKTLCNACGVRYKSGRLVP 196


>ref|XP_006294606.1| hypothetical protein CARUB_v10023643mg [Capsella rubella]
           gi|482563314|gb|EOA27504.1| hypothetical protein
           CARUB_v10023643mg [Capsella rubella]
          Length = 322

 Score =  193 bits (491), Expect = 4e-47
 Identities = 110/222 (49%), Positives = 132/222 (59%), Gaps = 25/222 (11%)
 Frame = +2

Query: 167 MDVYGIPTPDYFRIDDLLDFSNDEIFSSSST--------ATDHHLSPPQNPSIH---FPS 313
           MD+YG+ +PD  RIDDLLDFSN++IFS+SS          +     PPQNP+ H    PS
Sbjct: 58  MDLYGLSSPDLLRIDDLLDFSNEDIFSASSNNSGGSTAATSSSSFPPPQNPNFHHHHLPS 117

Query: 314 STPFNPALSTDFTDDLSVPSDDVAELEWLSQFVEDSFVDFPANSLAGTI-NVQSDASFSG 490
           S   +      F  D+ VPSDD A LEWLSQFV+DSF DFPAN L GT+ +V+S+ SF G
Sbjct: 118 SADHH-----SFLHDICVPSDDAAHLEWLSQFVDDSFADFPANPLGGTMASVKSETSFPG 172

Query: 491 KAVRSKRSRGTVPNGSPWTSSPETASPA----TGKSKLKKDTN---------RASSPTAN 631
           K  RSKRSR   P    W+  P  +         K K KK+ +          +S     
Sbjct: 173 KP-RSKRSRAPAPFAGTWSPMPPESEHQQLHNAAKFKPKKEQSGGGGGRHQSSSSESGEG 231

Query: 632 GGVRRCTHCASEKNSQWRTGPLGPKTLCNACGVRYNLAR*SP 757
           GG+RRCTHCASEK  QWRTGPLGPKTLCNACGVR+   R  P
Sbjct: 232 GGMRRCTHCASEKTPQWRTGPLGPKTLCNACGVRFKSGRLVP 273


>gb|ESQ39138.1| hypothetical protein EUTSA_v10001591mg [Eutrema salsugineum]
          Length = 260

 Score =  193 bits (490), Expect = 6e-47
 Identities = 109/211 (51%), Positives = 134/211 (63%), Gaps = 14/211 (6%)
 Frame = +2

Query: 167 MDVYGIPTPD-YFRIDDLLDFSNDEIFSSSSTATDHHLSPPQNPSIHFPSSTPFNPALST 343
           MDVYG+ +PD   RIDDLLDFSN++IFS+SS+ +    S    P  H P+    + + S 
Sbjct: 1   MDVYGLSSPDNLLRIDDLLDFSNEDIFSASSSTSTAATSSSSFPPPHNPNFLHHHLSSSA 60

Query: 344 D--FTDDLSVPSDDVAELEWLSQFVEDSFVDFPANSLAGTI-NVQSDASFSGKAVRSKRS 514
           D  F  D+ VPSDD A LEWLSQFV+DSF DFPAN L GT+ +V+++ SF GK  RSKRS
Sbjct: 61  DHSFLHDICVPSDDAAHLEWLSQFVDDSFADFPANPLGGTMTSVKTETSFPGKP-RSKRS 119

Query: 515 RGTVPNGSPWTSSPETASP--ATGKSKLKKDTN--------RASSPTANGGVRRCTHCAS 664
           R        W+  PE+       GK K KK+ +          ++ TA GG+RRCTHCAS
Sbjct: 120 RAPAAFAGTWSPLPESDQQIHVAGKFKPKKEQSGGGGGRHQSTTAETAEGGMRRCTHCAS 179

Query: 665 EKNSQWRTGPLGPKTLCNACGVRYNLAR*SP 757
           EK  QWRTGPLGPKTLCNACGVR+   R  P
Sbjct: 180 EKTPQWRTGPLGPKTLCNACGVRFKSGRLVP 210


>ref|XP_006359592.1| PREDICTED: GATA transcription factor 2-like [Solanum tuberosum]
          Length = 258

 Score =  189 bits (481), Expect = 6e-46
 Identities = 111/214 (51%), Positives = 133/214 (62%), Gaps = 17/214 (7%)
 Frame = +2

Query: 167 MDVYGIPTPDYFRIDDLLDFSNDEIFSSSSTATD----HHLSPPQNPSIHFPSSTPFNPA 334
           MDVYG  TP+ FRIDDLLDFSN+EIFSSS TA D    HH  PP  P+ +  ++  +  A
Sbjct: 1   MDVYGRLTPEVFRIDDLLDFSNEEIFSSSKTAIDFDLNHHYQPP--PTDNIAAAGCYYDA 58

Query: 335 L--STDFTDDLSVPSDDVAELEWLSQFVEDSFVDFPANSLAGTI----NVQSDASFSGKA 496
           L  S DFTD L VPSDDVAELEWLS FVED+  +FP+NSL  T+    N  +  +     
Sbjct: 59  LPNSVDFTDKLCVPSDDVAELEWLSNFVEDTSNNFPSNSLTQTMYHLNNTNNTTTILHSK 118

Query: 497 VRSKRSRGTVPNGSPWTSSPETASPATGKSKLKKDTN-------RASSPTANGGVRRCTH 655
            RSKRSR +    + WT+S      +T +    +D N       + SS T+N   R+CTH
Sbjct: 119 SRSKRSRNS---NTSWTTSSLQQHKSTNQKNYNQDENSGIYNRDKFSSITSNITPRKCTH 175

Query: 656 CASEKNSQWRTGPLGPKTLCNACGVRYNLAR*SP 757
           CASEK  QWRTGPLGPKTLCNACGVRY   R  P
Sbjct: 176 CASEKTPQWRTGPLGPKTLCNACGVRYKSGRLVP 209


>gb|ADK63416.1| GATA type zinc finger protein [Brassica rapa]
          Length = 256

 Score =  189 bits (481), Expect = 6e-46
 Identities = 110/215 (51%), Positives = 137/215 (63%), Gaps = 18/215 (8%)
 Frame = +2

Query: 167 MDVYGIPTPDYFRIDDLLDFSNDEIFSSSST----ATDHHLSPPQNPSIH---FPSSTPF 325
           MDVYG+ + D  R+DDLLDFSN++IFS+SS+    AT     PPQNP+ H    PSS   
Sbjct: 1   MDVYGLSSQDLLRVDDLLDFSNEDIFSASSSTSTAATSPSSFPPQNPNYHHHHLPSSADH 60

Query: 326 NPALSTDFTDDLSVPSDDVAELEWLSQFVEDSFVDFPANSLAGTI-NVQSDASFSGKAVR 502
           +      F  D+ VPSDD A LEWLSQFV+DSF DFPAN L GT+ +V+++ SF+GK  R
Sbjct: 61  S------FLHDICVPSDDAAHLEWLSQFVDDSFADFPANPLGGTMTSVKTETSFTGKP-R 113

Query: 503 SKRSRGTVPNGSPWTSSPETASP--ATGKSKLKKDTN-------RASSPTANG-GVRRCT 652
           SKRS+        W    ET       G+SK KK+ +        +S+ TA G G+RRCT
Sbjct: 114 SKRSKPPSTLVGTWAPMSETDQNIHVAGRSKPKKEHSGGGGRHQSSSAETAEGAGLRRCT 173

Query: 653 HCASEKNSQWRTGPLGPKTLCNACGVRYNLAR*SP 757
           HCA++K  QWRTGPLGPKTLCNACGVR+   R  P
Sbjct: 174 HCATDKTPQWRTGPLGPKTLCNACGVRFKSGRLVP 208


Top