BLASTX nr result

ID: Zingiber23_contig00008990 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Zingiber23_contig00008990
         (1182 letters)

Database: ./nr 
           37,332,560 sequences; 13,225,080,153 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_006652600.1| PREDICTED: GATA transcription factor 26-like...   139   2e-30
ref|NP_001149109.1| GATA transcription factor 29 [Zea mays] gi|1...   130   1e-27
gb|EMT15131.1| GATA transcription factor 28 [Aegilops tauschii]       125   3e-26
dbj|BAJ93785.1| predicted protein [Hordeum vulgare subsp. vulgare]    125   3e-26
gb|EEC77737.1| hypothetical protein OsI_16852 [Oryza sativa Indi...   125   3e-26
ref|NP_001053461.1| Os04g0544500 [Oryza sativa Japonica Group] g...   125   4e-26
ref|XP_003580263.1| PREDICTED: uncharacterized protein LOC100829...   119   2e-24
ref|XP_006477095.1| PREDICTED: GATA transcription factor 26-like...   119   2e-24
ref|XP_006440183.1| hypothetical protein CICLE_v10019614mg [Citr...   119   2e-24
gb|EOY24200.1| GATA transcription factor, putative isoform 2 [Th...   118   5e-24
gb|EOY24199.1| GATA transcription factor, putative isoform 1 [Th...   118   5e-24
ref|XP_002448265.1| hypothetical protein SORBIDRAFT_06g024200 [S...   118   5e-24
ref|XP_006838526.1| hypothetical protein AMTR_s00002p00191340 [A...   117   1e-23
ref|XP_006368951.1| zinc finger family protein [Populus trichoca...   116   2e-23
gb|AFW59044.1| hypothetical protein ZEAMMB73_136468 [Zea mays]        116   2e-23
ref|XP_002326479.1| predicted protein [Populus trichocarpa]           116   2e-23
ref|XP_006385556.1| hypothetical protein POPTR_0003s08080g [Popu...   114   6e-23
gb|EMJ11074.1| hypothetical protein PRUPE_ppa003888mg [Prunus pe...   114   8e-23
emb|CAN76534.1| hypothetical protein VITISV_006083 [Vitis vinifera]   114   8e-23
ref|XP_004244556.1| PREDICTED: GATA transcription factor 26-like...   114   1e-22

>ref|XP_006652600.1| PREDICTED: GATA transcription factor 26-like [Oryza brachyantha]
          Length = 450

 Score =  139 bits (351), Expect = 2e-30
 Identities = 124/398 (31%), Positives = 167/398 (41%), Gaps = 69/398 (17%)
 Frame = +3

Query: 3    NYIPLHAREAFDTAELKVPKVIAFRSNEQKLHKNNQNKWKFESECEMQ---YYGQNFCKF 173
            NY P+HAR+  D  E   P+    +    KL +  Q K K  S   M+   +  QNF K 
Sbjct: 44   NYTPMHARDDIDAEE---PRANKLKPPTLKLKEQKQLKKK-PSHITMENGPFSDQNFRKM 99

Query: 174  IEGDTXXXXXXXXXXXXXXXCLHYGTDDASNITGSVQSNACDSLIPSKKRSFVTRPKLS- 350
             + D                C  YGT DAS +T S QS+A +SL+PSKKRS VTRPK S 
Sbjct: 100  GDADLSNRSGSGSALSYSESCAPYGTSDASEMTASAQSHAWESLVPSKKRSCVTRPKPSP 159

Query: 351  VEKLIKELYSIWHEEQASNLSINSEDDLLYSCSTPLGSIEIGYGSVLIK----------- 497
            VEKL K+L SI HEEQ   LS +SE+DL+Y   TP  S EIGYGS+L++           
Sbjct: 160  VEKLAKDLNSIMHEEQLLFLSGSSEEDLIYHSETPADSFEIGYGSMLLRPNSKSVEEESE 219

Query: 498  ----------------------------TAXXXXXXXXXXXXXFPVDKSYNTNYA----- 578
                                         A             FPV K+ N   A     
Sbjct: 220  ASSVPADNKSYITSESYSGSASLVYSESKATSNQNVITEQPKKFPVQKTDNATRAYLHTE 279

Query: 579  ------------LSKKIDGESDRPITQNLAG-----LSDLSSLKRKHERQNQIHSDLKGT 707
                        +S  I+G++   I +          S ++ LKR H+ Q Q   +++ T
Sbjct: 280  NQDTLENANSPLVSLDIEGKNSEEIGEKTNASKRLTRSTMNPLKRPHDTQFQSSGEVRAT 339

Query: 708  ARSPKRVRHSGDDSPPSKC----LTQLDSSHDAACFSPRRVSAVLPDKSSTFSSPTQFIA 875
              SPKRV  SG  +    C    + +  +  D AC        +LP    +   P Q+  
Sbjct: 340  MWSPKRVSKSG-GAMGLNCQVPFMLKPGNGKDLACRGRGLNLFMLPPDKLSMLVPPQYTN 398

Query: 876  DSCESKMLLNVPTNTSIAEAELLYHPWKKKTNRNGSPS 989
            D  +  +LL VP N    EAELL  P +  +  + S S
Sbjct: 399  DDSDQDLLLEVPPNARHPEAELLCQPSQLSSVAHSSTS 436


>ref|NP_001149109.1| GATA transcription factor 29 [Zea mays] gi|194706816|gb|ACF87492.1|
            unknown [Zea mays] gi|195624810|gb|ACG34235.1| GATA
            transcription factor 29 [Zea mays]
            gi|414586055|tpg|DAA36626.1| TPA: GATA transcription
            factor 29 [Zea mays]
          Length = 416

 Score =  130 bits (326), Expect = 1e-27
 Identities = 117/360 (32%), Positives = 158/360 (43%), Gaps = 43/360 (11%)
 Frame = +3

Query: 3    NYIPLHAREAFDTAELKVPKVIAFRSNEQKLHKNNQNKWKFESECEMQYYGQNFCKFIEG 182
            NY P+H ++  D  E +V K+    +++ K  K   N    E+     + GQNF K  + 
Sbjct: 44   NYTPMHRKDDIDDDEPRVSKLKP-PTSKLKSQKKKPNHIIMENG---PFSGQNFRKMGDV 99

Query: 183  DTXXXXXXXXXXXXXXXCLHYGTDDASNITGSVQSNACDSLIPSKKRSFVTRPKLS-VEK 359
            D                C  YG  DAS +TGS QS+A +SL+PS+KRS VTRPK S VEK
Sbjct: 100  DQSYRSSSGSAVSYSESCAPYGAADASEMTGSAQSHAWESLVPSRKRSCVTRPKPSPVEK 159

Query: 360  LIKELYSIWHEEQASNLSINSEDDLLYSCSTPLGSIEIGYGSVLIKTAXXXXXXXXXXXX 539
            L K+L  I HEEQ    S +SE+DLLY   TP+GS E+G GSVL++              
Sbjct: 160  LAKDLNFIMHEEQLYYPSGSSEEDLLYHSETPVGSFEMGSGSVLLRHPNSKSLEKESEAS 219

Query: 540  XFPVD-KSYNTNYALSKK----IDGESDRPITQNLAGL---------------------- 638
              P D KSY T+ + S      I   +   I  N +                        
Sbjct: 220  SIPADNKSYITSESYSGSASFAIHNGNKAAINLNASNARLKKSPLHMEDNARRGVGSISG 279

Query: 639  ------SDLSSLKRKHERQNQIHSDLKGTARSPKRVRHSGDDSPPSKCLTQLDSS----- 785
                  S +  LKR  + Q QI ++L+GT RSP R   SG        L Q +SS     
Sbjct: 280  PEGFTKSTMKPLKRPRDTQFQIDAELEGTMRSPLRGLKSG-------ALAQFESSSLPKS 332

Query: 786  ----HDAACFSPRRVSAVLPDKSSTFSSPTQFIADSCESKMLLNVPTNTSIAEAELLYHP 953
                 D+ C        +LP +      P Q++    +  +LL +P N    EAELL  P
Sbjct: 333  GYTTKDSTCTGGALNLFMLPPE-KLLVVPPQYV--DPDQDLLLEIPLNARHPEAELLCQP 389


>gb|EMT15131.1| GATA transcription factor 28 [Aegilops tauschii]
          Length = 446

 Score =  125 bits (315), Expect = 3e-26
 Identities = 121/373 (32%), Positives = 160/373 (42%), Gaps = 59/373 (15%)
 Frame = +3

Query: 3    NYIPLHAREAFDTAELKVPKVIAFRSNEQKLHKNNQNKWKFESECEMQYYGQNFCKFIEG 182
            NY P H RE    +E +  K+   +   QK  K   N+   + E    +  QNF K    
Sbjct: 45   NYTPAHRREDTGASEARPDKL---KLKGQKQPKKRPNRSIVKDE---PWSDQNFWKMGNA 98

Query: 183  DTXXXXXXXXXXXXXXXCLHYGTDDASNITGSVQSNACDSLIPSKKRSFVTRPKLS-VEK 359
            DT               C  YG+ DAS I GS QS+A +SL+PS+KRS V+RPK S +E 
Sbjct: 99   DTSNRSGSGSAVSYSESCAPYGSIDASEIAGSAQSHALESLVPSRKRSCVSRPKPSALEA 158

Query: 360  LIKELYSIWHEEQASNLSINS-EDDLLYSCSTPLGSIEIGYGSVLIKTAXXXXXXXXXXX 536
            L+ +L SI HEEQ   LS  S E+DLLY   TP GS EIGYGSVL++             
Sbjct: 159  LVDDLNSIMHEEQLYCLSAGSTEEDLLYHSETPAGSFEIGYGSVLLRHPNTKSEEEESEA 218

Query: 537  XXFPVD-KSY----------------------NTNYALSK-------------------- 587
               P D KSY                      N+N A  K                    
Sbjct: 219  NSVPADTKSYITSESYSGCASFIPHSEIKGASNSNAASEKLKWSPMQTHDSARRDELHCS 278

Query: 588  ------KIDGESDRPITQNLAGL--SDLSSLKRKHERQNQIHSDLK---GTAR---SPKR 725
                    D   +   ++ + GL  S + SLKR +E Q Q  +D +   GT R   S  R
Sbjct: 279  NQHILESADSALEDNCSKEVGGLTKSSMRSLKRPYESQQQSFTDAEVRGGTMRLASSRSR 338

Query: 726  VRHSGDDSPPSKCLTQLDSSHDAACFSPRRVSAVLPDKSSTFSSPTQFIADSCESKMLLN 905
               S      S  L +  ++  AA  +P  +  + PDK S+  +P+    DS +  +LL 
Sbjct: 339  AMASSCQLRRSAFLPKSGNATGAAA-APLNLFMLAPDKLSSMLNPSD--KDSDQDSLLLE 395

Query: 906  VPTNTSIAEAELL 944
            VP N    EAELL
Sbjct: 396  VPRNARHPEAELL 408


>dbj|BAJ93785.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 441

 Score =  125 bits (315), Expect = 3e-26
 Identities = 114/392 (29%), Positives = 175/392 (44%), Gaps = 63/392 (16%)
 Frame = +3

Query: 3    NYIPLHAREAFDTAELKVPKVI--AFRSNEQKLHKNNQNKWKFESECEMQYYGQNFCKFI 176
            NY P+H+R+  D  + +V K+     R  EQ+  K   +    E+     +  QNF K  
Sbjct: 44   NYTPMHSRDDIDAEQPRVSKLKPPTLRLKEQRQVKKKPSHSIRENGA---FSDQNFWKMG 100

Query: 177  EGDTXXXXXXXXXXXXXXXCLHYGTDDASNITGSVQSNACDSLIPSKKRSFVTRPKLS-V 353
            + D                C  YG+ D S +TGS QS+A +SL+PSKKRS+VTR K S V
Sbjct: 101  DADPSRSSSGSALSYSES-CAPYGSADVSEMTGSAQSHAWESLVPSKKRSYVTRTKSSSV 159

Query: 354  EKLIKELYSIWHEEQASNLSINSEDDLLYSCSTPLGSIEIGYGSVLIKTAXXXXXXXXXX 533
            + L+K+L+ I HEEQ S LS +SE+DL+Y  +TP+GS EIGYGS+L++++          
Sbjct: 160  DMLVKDLHCIMHEEQLSYLSGSSEEDLIYHNATPVGSFEIGYGSMLLRSSNSKSAEEDSE 219

Query: 534  XXXFPVDKSYNTNYALSKKIDGESDRPITQNLAGLSDLSSLKRK--------HE--RQNQ 683
                P D   N +Y  S+   G +   +     G S+ ++   K        HE  ++ +
Sbjct: 220  ANSVPAD---NKSYLTSESYSGTASFVVHSESKGASNSNAAPEKPKWFPVQTHENVKRGK 276

Query: 684  IH-----------SDLKGTARSPKRVRHSGDDSPPS--KCLTQ-----LDSSHDA----- 794
            +H           S L   A   +  + +G +   S  K LT+     L   H++     
Sbjct: 277  LHYSKQHTLENVGSALVSVALEGEDTKETGGNENTSALKDLTKSNMKPLKRPHESQLQSC 336

Query: 795  ----------------------ACFSPRRVSA-----VLPDKSSTFSSPTQFIADSCESK 893
                                    F P+   A     +LP    +  +P Q++ D+ +  
Sbjct: 337  PEGTMRIAKKVCKSVTMAPQFKGSFLPKSGGAPFNLLMLPPDKISMLAPPQYM-DNSDQD 395

Query: 894  MLLNVPTNTSIAEAELLYHPWKKKTNRNGSPS 989
            +LL VP N    EAELLY P++  +    S S
Sbjct: 396  LLLEVPLNARQPEAELLYQPFQLSSVARSSTS 427


>gb|EEC77737.1| hypothetical protein OsI_16852 [Oryza sativa Indica Group]
          Length = 450

 Score =  125 bits (315), Expect = 3e-26
 Identities = 116/396 (29%), Positives = 167/396 (42%), Gaps = 67/396 (16%)
 Frame = +3

Query: 3    NYIPLHAREAFDTAELKVPKVI--AFRSNEQKLHKNNQNKWKFESECEMQYYGQNFCKFI 176
            NY P+HAR+  D  E +  K+     +  EQK  K N +    E+     +  QNF K  
Sbjct: 44   NYTPMHARDDIDAEEPRASKLKPPTLKLKEQKQLKKNPSHITMENG---PFSDQNFRKMG 100

Query: 177  EGDTXXXXXXXXXXXXXXXCLHYGTDDASNITGSVQSNACDSLIPSKKRSFVTRPKLS-V 353
            + D                C  YGT DAS +T S QS+A +SL+PSK+RS VTRPK S +
Sbjct: 101  DPDLSNRSGSGSALSYSESCAPYGTADASEMTASAQSHAWESLVPSKRRSCVTRPKPSQM 160

Query: 354  EKLIKELYSIWHEEQASNLSINSEDDLLYSCSTPLGSIEIGYGSVLIKTAXXXXXXXXXX 533
            EKL K+L SI HEEQ   LS +SE+DL+Y  +TP+ S E+GYGS+L++            
Sbjct: 161  EKLAKDLNSIMHEEQLLYLSGSSEEDLIYHSATPVDSFEMGYGSMLLRPNSKSLEEESEA 220

Query: 534  XXXFPVDKSYNTNYALSKKID---GESDRPITQNLAGLSDLSSLKRKHE--RQNQIHSDL 698
                  +KSY T+ + S  +     ES     QN+        L +  +  R+  +H++ 
Sbjct: 221  SSIPADNKSYITSESYSGSVSFVYSESKATSNQNVITEQPKKFLVQTSDNARRANLHTEN 280

Query: 699  KGT---ARSP-------------KRVRHSGDDSPPSKCLTQLDSSHD----------AAC 800
            + T   A SP              RV+ S  +      +  L   HD             
Sbjct: 281  QDTLEIANSPLVSLHMEGKDSEETRVKTSASNRLTKSTMNPLKRPHDTHFQSSVELRGTM 340

Query: 801  FSPRRVSA---------------------------------VLPDKSSTFSSPTQFIADS 881
             SP+RVS                                  +LP    +   P Q+  + 
Sbjct: 341  RSPKRVSKYGDAMGLKCQASFMPKPGNGKDLACSDRALNLFMLPPDKLSMLVPPQYANND 400

Query: 882  CESKMLLNVPTNTSIAEAELLYHPWKKKTNRNGSPS 989
             +  +LL+VP N    EAELL  P +  +  + S S
Sbjct: 401  SDQDLLLDVPLNARHPEAELLCQPSQLSSVAHSSTS 436


>ref|NP_001053461.1| Os04g0544500 [Oryza sativa Japonica Group]
            gi|38345953|emb|CAE04346.2| OSJNBb0038F03.10 [Oryza
            sativa Japonica Group] gi|113565032|dbj|BAF15375.1|
            Os04g0544500 [Oryza sativa Japonica Group]
            gi|215697922|dbj|BAG92113.1| unnamed protein product
            [Oryza sativa Japonica Group] gi|222629300|gb|EEE61432.1|
            hypothetical protein OsJ_15656 [Oryza sativa Japonica
            Group]
          Length = 450

 Score =  125 bits (313), Expect = 4e-26
 Identities = 116/396 (29%), Positives = 164/396 (41%), Gaps = 67/396 (16%)
 Frame = +3

Query: 3    NYIPLHAREAFDTAELKVPKVI--AFRSNEQKLHKNNQNKWKFESECEMQYYGQNFCKFI 176
            NY P+HAR+  D  E +  K+     +  EQK  K N +    E+     +  QNF K  
Sbjct: 44   NYTPMHARDDIDAEEPRASKLKPPTLKLKEQKQLKKNPSHITMENG---PFSDQNFRKMG 100

Query: 177  EGDTXXXXXXXXXXXXXXXCLHYGTDDASNITGSVQSNACDSLIPSKKRSFVTRPKLS-V 353
            + D                C  YGT DAS +T S QS+A +SL+PSK+RS VTRPK S +
Sbjct: 101  DPDLSNRSGSGSALSYSESCAPYGTADASEMTASAQSHAWESLVPSKRRSCVTRPKPSQM 160

Query: 354  EKLIKELYSIWHEEQASNLSINSEDDLLYSCSTPLGSIEIGYGSVLIKTAXXXXXXXXXX 533
            EKL K+L SI HEEQ   LS +SE+DL+Y  +TP+ S E+GYGS+L++            
Sbjct: 161  EKLAKDLNSIMHEEQLLYLSGSSEEDLIYHSATPVDSFEMGYGSMLLRPNSKSLEEESEA 220

Query: 534  XXXFPVDKSYNTNYALSKKID---GESDRPITQN---------LAGLSDLSSLKRKH-ER 674
                  +KSY T+ + S  +     ES     QN         L   SD +     H E 
Sbjct: 221  SSIPADNKSYITSESYSGSVSFVYSESKATSNQNVITEQPKKFLVQTSDNARRANLHTEN 280

Query: 675  QNQIHS--------DLKGTARSPKRVRHSGDDSPPSKCLTQLDSSHD----------AAC 800
            Q+ + +         ++G      RV+ S  +      +  L   HD             
Sbjct: 281  QDTLENANSPLVSLHMEGKDSEETRVKTSASNRLTKSTMNPLKRPHDTHFQSSVELRGTM 340

Query: 801  FSPRRVSA---------------------------------VLPDKSSTFSSPTQFIADS 881
             SP+RVS                                  +LP    +   P Q+    
Sbjct: 341  RSPKRVSKYGDAMGLKCQASFMPKPGNGKDLACSDRALNLFMLPPDKLSMLVPPQYANTD 400

Query: 882  CESKMLLNVPTNTSIAEAELLYHPWKKKTNRNGSPS 989
             +  +LL+VP N    EAELL  P +  +  + S S
Sbjct: 401  SDQDLLLDVPLNARHPEAELLCQPSQLSSVAHSSTS 436


>ref|XP_003580263.1| PREDICTED: uncharacterized protein LOC100829762 [Brachypodium
            distachyon]
          Length = 440

 Score =  119 bits (299), Expect = 2e-24
 Identities = 123/378 (32%), Positives = 170/378 (44%), Gaps = 64/378 (16%)
 Frame = +3

Query: 3    NYIPLHAREAFDTAELKVPKVIA--FRSNEQKLHKNNQNKWKFESECEMQYYGQNFCKFI 176
            NY P+H+R+  D  E +V K+     R  EQ+  K   +    ++E    +  QNF K  
Sbjct: 44   NYTPMHSRDDIDVEEPRVSKLKPPMSRLKEQRQLKKRPSHIIKKNE---PFSDQNFRKMG 100

Query: 177  EGDTXXXXXXXXXXXXXXXCLHYGTDDASNITGSVQSNACDSLIPSKKRSFVTRPKLS-V 353
            + D                C  YG+ DAS +TGS QS+A +SL+PS+KRS VTR K S V
Sbjct: 101  DADPSRSSSGSAVSYSES-CAPYGSADASEMTGSAQSHAWESLVPSRKRSCVTRSKPSQV 159

Query: 354  EKLIKELYSIWHEEQASNLSINSEDDLLYSCSTPLGSIEIGYGSVLIKTAXXXXXXXXXX 533
            EKL+K+L SI HEEQ   LS +SE+DLLY   T +GS EIGYGSVL++ A          
Sbjct: 160  EKLVKDLNSIMHEEQFYCLSGSSEEDLLYHSETAVGSFEIGYGSVLLRHANSKSVDGDSE 219

Query: 534  XXXFPVD-KSYNTNYALSKKIDGESDRPITQNLAGLSDLSSLKRK----------HERQN 680
                P D KSY T+ +LS    G +   +     G S+ ++L  K          + R++
Sbjct: 220  ANSVPADNKSYVTSESLS--YSGTASFVVHGESKGASNSNALSEKPKWFPVQIHDNARRD 277

Query: 681  QIH-----------SDLKGTARSPKRVRHSGDDSPPS--KCL-------------TQLDS 782
            ++H           S L   A   K  +  G+    S  KCL             +QL S
Sbjct: 278  KLHYSKPHTLENVDSALVSVALEVKDSKEIGEKENISAVKCLVKPAMKHLKRPHESQLQS 337

Query: 783  SHDAACFSPRRVS------------------------AVLPDKSSTFSSPTQFIADSCES 890
              +    SP+R S                         + PDK S  +   Q++ DS + 
Sbjct: 338  CQETT-RSPKRGSESGAMAPQFKGSFLPKSGGALNLFMLPPDKLSMLA--PQYVDDS-DQ 393

Query: 891  KMLLNVPTNTSIAEAELL 944
             +LL VP N    EAELL
Sbjct: 394  DLLLEVPPNGRHPEAELL 411


>ref|XP_006477095.1| PREDICTED: GATA transcription factor 26-like [Citrus sinensis]
          Length = 542

 Score =  119 bits (298), Expect = 2e-24
 Identities = 71/170 (41%), Positives = 102/170 (60%), Gaps = 5/170 (2%)
 Frame = +3

Query: 3   NYIPLHAR-EAFDTAELKVPKVIAFRSNEQKLHKNNQNKWKFESECEMQY---YGQNFCK 170
           NY PLHAR E  D  + +V KV +   N+ K  K  + K  +++     +   Y   + K
Sbjct: 44  NYTPLHARAEPDDYEDHRVSKVKSISINKNKDVKVLKRKSNYDNVVVGGFAPDYNHGYRK 103

Query: 171 FIEGDTXXXXXXXXXXXXXXXCLHYGTDDASNITGSVQSNACDSLIPSKKRSFVTRPKLS 350
            ++ DT               C+ +G+ DAS++TG  QSN  DS++PSKKR+ V RPK S
Sbjct: 104 VVDEDTSNRSSSGSAISNSESCVQFGSADASDLTGPAQSNVWDSVVPSKKRTCVNRPKQS 163

Query: 351 -VEKLIKELYSIWHEEQASNLSINSEDDLLYSCSTPLGSIEIGYGSVLIK 497
            VEKL K+LY+I HE+Q+S  S +SE+DLL+   TP+ S+EIG+GSVLI+
Sbjct: 164 PVEKLTKDLYTILHEQQSSYFSGSSEEDLLFESETPMVSVEIGHGSVLIR 213


>ref|XP_006440183.1| hypothetical protein CICLE_v10019614mg [Citrus clementina]
           gi|567895392|ref|XP_006440184.1| hypothetical protein
           CICLE_v10019614mg [Citrus clementina]
           gi|557542445|gb|ESR53423.1| hypothetical protein
           CICLE_v10019614mg [Citrus clementina]
           gi|557542446|gb|ESR53424.1| hypothetical protein
           CICLE_v10019614mg [Citrus clementina]
          Length = 542

 Score =  119 bits (298), Expect = 2e-24
 Identities = 71/170 (41%), Positives = 102/170 (60%), Gaps = 5/170 (2%)
 Frame = +3

Query: 3   NYIPLHAR-EAFDTAELKVPKVIAFRSNEQKLHKNNQNKWKFESECEMQY---YGQNFCK 170
           NY PLHAR E  D  + +V KV +   N+ K  K  + K  +++     +   Y   + K
Sbjct: 44  NYTPLHARAEPDDYEDHRVSKVKSISINKNKDVKVLKRKSNYDNVVVGGFAPDYNHGYRK 103

Query: 171 FIEGDTXXXXXXXXXXXXXXXCLHYGTDDASNITGSVQSNACDSLIPSKKRSFVTRPKLS 350
            ++ DT               C+ +G+ DAS++TG  QSN  DS++PSKKR+ V RPK S
Sbjct: 104 VVDEDTSNRSSSGSAISNSESCVQFGSADASDLTGPAQSNVWDSVVPSKKRTCVNRPKQS 163

Query: 351 -VEKLIKELYSIWHEEQASNLSINSEDDLLYSCSTPLGSIEIGYGSVLIK 497
            VEKL K+LY+I HE+Q+S  S +SE+DLL+   TP+ S+EIG+GSVLI+
Sbjct: 164 PVEKLTKDLYTILHEQQSSYFSGSSEEDLLFESETPMVSVEIGHGSVLIR 213


>gb|EOY24200.1| GATA transcription factor, putative isoform 2 [Theobroma cacao]
          Length = 400

 Score =  118 bits (295), Expect = 5e-24
 Identities = 72/167 (43%), Positives = 98/167 (58%), Gaps = 2/167 (1%)
 Frame = +3

Query: 3   NYIPLHAR-EAFDTAELKVPKVIAFRSNEQKLHKNNQNKWKFESECEMQYYGQNFCKFIE 179
           NY PLHAR E  D  + +  +V +   N+ K  K  + K   ++      Y Q F KF++
Sbjct: 44  NYTPLHARVEPDDYEDHRASRVKSISINKNKEIKLLKRKPNHDTAVVAPDYNQGFRKFVD 103

Query: 180 GDTXXXXXXXXXXXXXXXCLHYGTDDASNITGSVQSNACDSLIPSKKRSFVTRPKLS-VE 356
            DT               C  +G+ DAS++TG  QSN  DS++PSKKR+ V RPK S VE
Sbjct: 104 EDTSNRSSSGSAISNSESCAQFGSGDASDLTGPAQSNVWDSMVPSKKRTCVNRPKPSPVE 163

Query: 357 KLIKELYSIWHEEQASNLSINSEDDLLYSCSTPLGSIEIGYGSVLIK 497
           KL K+LY+I H EQ+S  S +SE+DLL    TP+ S+EIG+GSVLI+
Sbjct: 164 KLTKDLYTILH-EQSSYFSGSSEEDLLLESETPMVSVEIGHGSVLIR 209


>gb|EOY24199.1| GATA transcription factor, putative isoform 1 [Theobroma cacao]
          Length = 538

 Score =  118 bits (295), Expect = 5e-24
 Identities = 72/167 (43%), Positives = 98/167 (58%), Gaps = 2/167 (1%)
 Frame = +3

Query: 3   NYIPLHAR-EAFDTAELKVPKVIAFRSNEQKLHKNNQNKWKFESECEMQYYGQNFCKFIE 179
           NY PLHAR E  D  + +  +V +   N+ K  K  + K   ++      Y Q F KF++
Sbjct: 44  NYTPLHARVEPDDYEDHRASRVKSISINKNKEIKLLKRKPNHDTAVVAPDYNQGFRKFVD 103

Query: 180 GDTXXXXXXXXXXXXXXXCLHYGTDDASNITGSVQSNACDSLIPSKKRSFVTRPKLS-VE 356
            DT               C  +G+ DAS++TG  QSN  DS++PSKKR+ V RPK S VE
Sbjct: 104 EDTSNRSSSGSAISNSESCAQFGSGDASDLTGPAQSNVWDSMVPSKKRTCVNRPKPSPVE 163

Query: 357 KLIKELYSIWHEEQASNLSINSEDDLLYSCSTPLGSIEIGYGSVLIK 497
           KL K+LY+I H EQ+S  S +SE+DLL    TP+ S+EIG+GSVLI+
Sbjct: 164 KLTKDLYTILH-EQSSYFSGSSEEDLLLESETPMVSVEIGHGSVLIR 209


>ref|XP_002448265.1| hypothetical protein SORBIDRAFT_06g024200 [Sorghum bicolor]
           gi|241939448|gb|EES12593.1| hypothetical protein
           SORBIDRAFT_06g024200 [Sorghum bicolor]
          Length = 447

 Score =  118 bits (295), Expect = 5e-24
 Identities = 81/199 (40%), Positives = 105/199 (52%), Gaps = 2/199 (1%)
 Frame = +3

Query: 3   NYIPLHAREAFDTAELKVPKVIAFRSNEQKLHKNNQNKWKFESECEMQYYGQNFCKFIEG 182
           NY P+H ++  D  E +V K+    +++ K  K   N    E+     + GQNF K    
Sbjct: 44  NYTPMHRKDDIDDDEPRVSKLKP-PTSKSKSQKKKPNHIIAENGL---FSGQNFRKMGGV 99

Query: 183 DTXXXXXXXXXXXXXXXCLHYGTDDASNITGSVQSNACDSLIPSKKRSFVTRPKLS-VEK 359
           D                C  YG  DAS +TGS QS+A +SL+PS+KRS VTRPK S VEK
Sbjct: 100 DPSYQSSSGSAVSYSESCAPYGAADASEMTGSAQSHAWESLVPSRKRSCVTRPKPSPVEK 159

Query: 360 LIKELYSIWHEEQASNLSINSEDDLLYSCSTPLGSIEIGYGSVLIKTAXXXXXXXXXXXX 539
           L K+L SI H EQ  NLS +SE+DLLY   TP+GS EIG GSVL++              
Sbjct: 160 LAKDLNSIMHGEQLYNLSGSSEEDLLYHSETPVGSFEIGSGSVLLRHPNSKLLEEESEAS 219

Query: 540 XFPVD-KSYNTNYALSKKI 593
             P D KSY T+ + S  +
Sbjct: 220 SIPADNKSYITSESYSGSV 238


>ref|XP_006838526.1| hypothetical protein AMTR_s00002p00191340 [Amborella trichopoda]
           gi|548841032|gb|ERN01095.1| hypothetical protein
           AMTR_s00002p00191340 [Amborella trichopoda]
          Length = 525

 Score =  117 bits (292), Expect = 1e-23
 Identities = 74/169 (43%), Positives = 100/169 (59%), Gaps = 4/169 (2%)
 Frame = +3

Query: 3   NYIPLHAR-EAFDTAELKVPKVI--AFRSNEQKLHKNNQNKWKFESECEMQYYGQNFCKF 173
           NY PLH+R EA ++     PKV   + +  E KLHK  QN    E++ E   +   + + 
Sbjct: 44  NYTPLHSRGEAIESDVSNFPKVKNPSLKLKEDKLHKRKQNDIIEEAKGEEAGFAL-YRRG 102

Query: 174 IEGDTXXXXXXXXXXXXXXXCLHYGTDDASNITGSVQSNACDSLIPSKKRSFVTRPK-LS 350
           +E DT               C+ + + DA +I GS QSNA DSLIPS+KR+ V R K  S
Sbjct: 103 LEEDTSTRSSSGSAISYSESCVQFASTDAKDIRGSAQSNAWDSLIPSRKRTCVNRQKPSS 162

Query: 351 VEKLIKELYSIWHEEQASNLSINSEDDLLYSCSTPLGSIEIGYGSVLIK 497
           VEKL KELY I HE++ S LS  SE+DLL+  +TP+ S+EIG+G VLI+
Sbjct: 163 VEKLTKELYCILHEQELSYLSGTSEEDLLFETTTPMVSVEIGHGGVLIR 211


>ref|XP_006368951.1| zinc finger family protein [Populus trichocarpa]
           gi|550347310|gb|ERP65520.1| zinc finger family protein
           [Populus trichocarpa]
          Length = 552

 Score =  116 bits (290), Expect = 2e-23
 Identities = 76/203 (37%), Positives = 107/203 (52%), Gaps = 6/203 (2%)
 Frame = +3

Query: 3   NYIPLHAREAFDTAE----LKVPKVIAFRSNEQKLHKNNQNKWKFESECEMQYYGQNFCK 170
           NY PLHAR   D  E     ++  +   ++ E KL K   N     +E     Y + + K
Sbjct: 52  NYTPLHARAGPDDYEDHRVSRLKSISMNKNREVKLLKRKPNYDHRVAEGVALDYNEGYRK 111

Query: 171 FIEGDTXXXXXXXXXXXXXXXCLHYGTDDASNITGSVQSNACDSLIPSKKRSFVTRPKLS 350
            ++ DT               C  +G+ DAS++TG  QS   DSL+PS+KR+ V RPK S
Sbjct: 112 VVDEDTSNRSSSGSAISNSESCAQFGSADASDLTGPAQSVVWDSLVPSRKRTCVNRPKPS 171

Query: 351 -VEKLIKELYSIWHEEQASNLSINSEDDLLYSCSTPLGSIEIGYGSVLIKTAXXXXXXXX 527
            VEKL K+LY+I HE+Q+S  S +SE+DLL+   TP+ S+EIG+GSVLI+          
Sbjct: 172 PVEKLTKDLYTILHEQQSSCFSGSSEEDLLFDNETPMVSVEIGHGSVLIRHPSSIARDEE 231

Query: 528 XXXXXFPVD-KSYNTNYALSKKI 593
                  V+ K Y+TN A S  +
Sbjct: 232 SEASSLSVENKQYSTNEAYSHPV 254



 Score = 60.8 bits (146), Expect = 1e-06
 Identities = 40/123 (32%), Positives = 64/123 (52%), Gaps = 2/123 (1%)
 Frame = +3

Query: 585 KKIDGESDRPITQNLAGLSDLSSLKRKHERQNQIHSDLKGTARSPKRVRHSGDDSPPSKC 764
           K + G+   P   N+   S+L   KR  +  +Q  S+ K + +SPKR+          K 
Sbjct: 418 KSLVGKGPNP---NVVASSNLIGAKRSRDNLSQKFSEAK-SMKSPKRI--------VMKA 465

Query: 765 LTQLDS--SHDAACFSPRRVSAVLPDKSSTFSSPTQFIADSCESKMLLNVPTNTSIAEAE 938
             ++     +D +CFSPR + A+ PD SS       F+ +S +  +LL++P+N S A+AE
Sbjct: 466 TYEIKELIDNDGSCFSPRSLFALPPDGSSLMLDSLHFVDESSDQDLLLDIPSNGSFAQAE 525

Query: 939 LLY 947
           LLY
Sbjct: 526 LLY 528


>gb|AFW59044.1| hypothetical protein ZEAMMB73_136468 [Zea mays]
          Length = 543

 Score =  116 bits (290), Expect = 2e-23
 Identities = 80/196 (40%), Positives = 103/196 (52%), Gaps = 2/196 (1%)
 Frame = +3

Query: 3   NYIPLHAREAFDTAELKVPKVIAFRSNEQKLHKNNQNKWKFESECEMQYYGQNFCKFIEG 182
           NY P+H  +  D  E +V K+    +++ K  K   N    E+     + GQNF K  + 
Sbjct: 44  NYTPMHRNDNIDDDEPRVSKLKP-PTSKLKSQKKKTNHIIMENG---PFSGQNFRKMGDV 99

Query: 183 DTXXXXXXXXXXXXXXXCLHYGTDDASNITGSVQSNACDSLIPSKKRSFVTRPKLS-VEK 359
           D                C  YG  DAS +TGS QS+A +SL+PS+KRS VTRPK S VEK
Sbjct: 100 DPSYRSSSGSAVSYSESCAPYGAADASEMTGSAQSHAWESLVPSRKRSCVTRPKPSPVEK 159

Query: 360 LIKELYSIWHEEQASNLSINSEDDLLYSCSTPLGSIEIGYGSVLIKTAXXXXXXXXXXXX 539
           L KEL  I HEE+   LS +SE+DLLY   TP+GS EIG GSVL++              
Sbjct: 160 LAKELNYIMHEEKLYYLSESSEEDLLYHSETPIGSFEIGSGSVLLRHPNSKSLEEESKTS 219

Query: 540 XFPVD-KSYNTNYALS 584
             P D KSY T+ + S
Sbjct: 220 SIPADNKSYITSESYS 235


>ref|XP_002326479.1| predicted protein [Populus trichocarpa]
          Length = 544

 Score =  116 bits (290), Expect = 2e-23
 Identities = 76/203 (37%), Positives = 107/203 (52%), Gaps = 6/203 (2%)
 Frame = +3

Query: 3   NYIPLHAREAFDTAE----LKVPKVIAFRSNEQKLHKNNQNKWKFESECEMQYYGQNFCK 170
           NY PLHAR   D  E     ++  +   ++ E KL K   N     +E     Y + + K
Sbjct: 44  NYTPLHARAGPDDYEDHRVSRLKSISMNKNREVKLLKRKPNYDHRVAEGVALDYNEGYRK 103

Query: 171 FIEGDTXXXXXXXXXXXXXXXCLHYGTDDASNITGSVQSNACDSLIPSKKRSFVTRPKLS 350
            ++ DT               C  +G+ DAS++TG  QS   DSL+PS+KR+ V RPK S
Sbjct: 104 VVDEDTSNRSSSGSAISNSESCAQFGSADASDLTGPAQSVVWDSLVPSRKRTCVNRPKPS 163

Query: 351 -VEKLIKELYSIWHEEQASNLSINSEDDLLYSCSTPLGSIEIGYGSVLIKTAXXXXXXXX 527
            VEKL K+LY+I HE+Q+S  S +SE+DLL+   TP+ S+EIG+GSVLI+          
Sbjct: 164 PVEKLTKDLYTILHEQQSSCFSGSSEEDLLFDNETPMVSVEIGHGSVLIRHPSSIARDEE 223

Query: 528 XXXXXFPVD-KSYNTNYALSKKI 593
                  V+ K Y+TN A S  +
Sbjct: 224 SEASSLSVENKQYSTNEAYSHPV 246



 Score = 60.8 bits (146), Expect = 1e-06
 Identities = 40/123 (32%), Positives = 64/123 (52%), Gaps = 2/123 (1%)
 Frame = +3

Query: 585 KKIDGESDRPITQNLAGLSDLSSLKRKHERQNQIHSDLKGTARSPKRVRHSGDDSPPSKC 764
           K + G+   P   N+   S+L   KR  +  +Q  S+ K + +SPKR+          K 
Sbjct: 410 KSLVGKGPNP---NVVASSNLIGAKRSRDNLSQKFSEAK-SMKSPKRI--------VMKA 457

Query: 765 LTQLDS--SHDAACFSPRRVSAVLPDKSSTFSSPTQFIADSCESKMLLNVPTNTSIAEAE 938
             ++     +D +CFSPR + A+ PD SS       F+ +S +  +LL++P+N S A+AE
Sbjct: 458 TYEIKELIDNDGSCFSPRSLFALPPDGSSLMLDSLHFVDESSDQDLLLDIPSNGSFAQAE 517

Query: 939 LLY 947
           LLY
Sbjct: 518 LLY 520


>ref|XP_006385556.1| hypothetical protein POPTR_0003s08080g [Populus trichocarpa]
           gi|118486445|gb|ABK95062.1| unknown [Populus
           trichocarpa] gi|550342683|gb|ERP63353.1| hypothetical
           protein POPTR_0003s08080g [Populus trichocarpa]
          Length = 540

 Score =  114 bits (286), Expect = 6e-23
 Identities = 85/255 (33%), Positives = 128/255 (50%), Gaps = 16/255 (6%)
 Frame = +3

Query: 3   NYIPLHAR-EAFDTAELKVPKVIAFRSNEQKLHKNNQNKWKFESECEMQYYGQNFCKFIE 179
           NY PLHAR E  D  + +V ++ +   ++ K  K  + K  +++   + Y  Q + K ++
Sbjct: 44  NYTPLHARAEPDDYEDHRVSRLKSVSISKNKEVKLLKRKPNYDNRVALDY-NQGYRKVVD 102

Query: 180 GDTXXXXXXXXXXXXXXXCLHYGTDDASNITGSVQSNACDSLIPSKKRSFVTRPK-LSVE 356
            DT               C  +G+ +AS++TG  QS   DSL+PS+KR+ V RPK  SVE
Sbjct: 103 EDTSNRSSSGSAISNPESCAQFGSAEASDLTGPAQSVVWDSLVPSRKRTCVNRPKPSSVE 162

Query: 357 KLIKELYSIWHEEQASNLSINSEDDLLYSCSTPLGSIEIGYGSVLIKTAXXXXXXXXXXX 536
           KL K+LY+I HE+Q+S  S +SE+DLL+   TP+ S+EIG+GSVLI+             
Sbjct: 163 KLTKDLYTILHEQQSSCFSGSSEEDLLFDNETPMVSVEIGHGSVLIRHPSSIARDEESEA 222

Query: 537 XXFPVD-KSYNTNYALSKKI---------DGESDRPITQNLAGLS----DLSSLKRKHER 674
               V+ K Y TN A S  +            +  PIT+    L+        LKR    
Sbjct: 223 SSLSVENKQYLTNEAYSHPVILPVHNENKSVNTTYPITETTKNLTGQGMQQEQLKRDKFP 282

Query: 675 QNQIHSDLKGTARSP 719
             ++H  + G+  SP
Sbjct: 283 HEKVH--ILGSHNSP 295


>gb|EMJ11074.1| hypothetical protein PRUPE_ppa003888mg [Prunus persica]
          Length = 542

 Score =  114 bits (285), Expect = 8e-23
 Identities = 68/170 (40%), Positives = 95/170 (55%), Gaps = 5/170 (2%)
 Frame = +3

Query: 3   NYIPLHAREAFDTAE----LKVPKVIAFRSNEQKLHKNNQNKWKFESECEMQYYGQNFCK 170
           NY PLHAR   D  E     +V  +   ++ E KL K  QN            Y   F K
Sbjct: 44  NYTPLHARAEPDDYEDHRVSRVKSISINKNKEIKLVKRKQNPDSVMVGGVAADYAHGFRK 103

Query: 171 FIEGDTXXXXXXXXXXXXXXXCLHYGTDDASNITGSVQSNACDSLIPSKKRSFVTRPKLS 350
             + DT               C  +G+ DAS++TG  QS   DS++PS+KR+ + RPK S
Sbjct: 104 VTDEDTSNRSSSGSAVSNSESCAQFGSADASDLTGPAQSMVWDSMVPSRKRTCIGRPKPS 163

Query: 351 -VEKLIKELYSIWHEEQASNLSINSEDDLLYSCSTPLGSIEIGYGSVLIK 497
            VE+L K+LY+I HE+Q+S  S +SE+DLL+ C TP+ S+EIG+GSVL++
Sbjct: 164 PVERLTKDLYTILHEQQSSYFSGSSEEDLLFECETPMVSVEIGHGSVLMR 213



 Score = 60.1 bits (144), Expect = 2e-06
 Identities = 43/129 (33%), Positives = 64/129 (49%), Gaps = 2/129 (1%)
 Frame = +3

Query: 573 YALSKKIDGESDRPITQ--NLAGLSDLSSLKRKHERQNQIHSDLKGTARSPKRVRHSGDD 746
           Y L KK      + +    N    S+   +KR  + + Q   D+K   +SPKR+   G +
Sbjct: 398 YHLLKKCKTSPGKSVISGPNTLASSNFRHVKRLRDSETQSFPDVKMMMKSPKRIIVKGSN 457

Query: 747 SPPSKCLTQLDSSHDAACFSPRRVSAVLPDKSSTFSSPTQFIADSCESKMLLNVPTNTSI 926
              +K L   D S    CFSPR + A+  D SS       F+ +S +  +LL++P+N S 
Sbjct: 458 E--NKDLMDYDGS----CFSPRSLFALPADGSSFLMESMNFVDESSDQDLLLHLPSNGSF 511

Query: 927 AEAELLYHP 953
           A+AELL HP
Sbjct: 512 AQAELL-HP 519


>emb|CAN76534.1| hypothetical protein VITISV_006083 [Vitis vinifera]
          Length = 542

 Score =  114 bits (285), Expect = 8e-23
 Identities = 77/201 (38%), Positives = 104/201 (51%), Gaps = 6/201 (2%)
 Frame = +3

Query: 3   NYIPLHAREAFDTAE----LKVPKVIAFRSNEQKLHKNNQNKWKFESECEMQYYGQNFCK 170
           NY PLHAR   D AE     +V  +   ++ E KL K  QN+           Y Q   K
Sbjct: 44  NYTPLHARVDGDDAEDYRVSRVKSISINKNKEVKLLKRKQNQDNVVVNGVASDYSQGSRK 103

Query: 171 FIEGDTXXXXXXXXXXXXXXXCLHYGTDDASNITGSVQSNACDSLIPSKKRSFVTRPK-L 347
            I+ DT               C  +G+ DAS++TG  QS   D+++PS+KR+ V RPK  
Sbjct: 104 AIDEDTSNRSSSGSAISNSESCAQFGSADASDLTGPSQSIVWDTMVPSRKRTCVNRPKPS 163

Query: 348 SVEKLIKELYSIWHEEQASNLSINSEDDLLYSCSTPLGSIEIGYGSVLIKTAXXXXXXXX 527
           SVEKL K+L +I HE+Q+S  S +SE+DLL+   TP+ S+EIG+GSVLI+          
Sbjct: 164 SVEKLTKDLCTILHEQQSSYFSGSSEEDLLFESETPMVSVEIGHGSVLIRHPSAIGREEE 223

Query: 528 XXXXXFPVD-KSYNTNYALSK 587
                  VD KSY  N   S+
Sbjct: 224 SEASSLSVDNKSYLVNEVYSR 244


>ref|XP_004244556.1| PREDICTED: GATA transcription factor 26-like [Solanum lycopersicum]
          Length = 542

 Score =  114 bits (284), Expect = 1e-22
 Identities = 69/169 (40%), Positives = 100/169 (59%), Gaps = 4/169 (2%)
 Frame = +3

Query: 3   NYIPLHAR-EAFDTAELKVP--KVIAFRSNEQKLHKNNQNKWKFESECEMQYYGQNFCKF 173
           NY PLHAR E  D  E +V   K I+ ++ E K+ K  Q+    ++E     Y   F K 
Sbjct: 44  NYTPLHARAEPCDFEEHRVSRFKNISMKNKEAKILKRKQSH--HDAEVGTPDYSLGFRKV 101

Query: 174 IEGDTXXXXXXXXXXXXXXXCLHYGTDDASNITGSVQSNACDSLIPSKKRSFVTRPK-LS 350
           ++ DT               C  +G+ +AS++TG  QSN  DS +PS+KR+   RPK  S
Sbjct: 102 LDEDTSNRSSSGSAISNSESCAQFGSAEASDLTGPAQSNIWDSTVPSRKRTCFNRPKPSS 161

Query: 351 VEKLIKELYSIWHEEQASNLSINSEDDLLYSCSTPLGSIEIGYGSVLIK 497
           VEKL K+LY+I HE+Q+S LS +SE++LL+    P+ S+EIG+GSVL++
Sbjct: 162 VEKLTKDLYTILHEQQSSYLSASSEEELLFESDKPMVSVEIGHGSVLMR 210


Top