BLASTX nr result

ID: Cornus23_contig00018015 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Cornus23_contig00018015
         (957 letters)

Database: ./nr 
           77,306,371 sequences; 28,104,191,420 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

emb|CAN68079.1| hypothetical protein VITISV_006312 [Vitis vinifera]   104   9e-20
ref|XP_010647950.1| PREDICTED: homeobox protein HAT3.1 isoform X...   104   1e-19
ref|XP_010647949.1| PREDICTED: homeobox protein HAT3.1 isoform X...   104   1e-19
emb|CBI22504.3| unnamed protein product [Vitis vinifera]              104   1e-19
ref|XP_002269077.1| PREDICTED: homeobox protein HAT3.1 isoform X...   104   1e-19
ref|XP_010099058.1| Homeobox protein [Morus notabilis] gi|587887...    87   1e-14
ref|XP_008236405.1| PREDICTED: LOW QUALITY PROTEIN: homeobox pro...    85   7e-14
ref|XP_011457795.1| PREDICTED: homeobox protein HAT3.1 isoform X...    82   5e-13
ref|XP_004289744.1| PREDICTED: homeobox protein HAT3.1 isoform X...    82   5e-13
ref|XP_007042568.1| Homeodomain-like protein with RING/FYVE/PHD-...    82   8e-13
ref|XP_007200058.1| hypothetical protein PRUPE_ppa023106mg [Prun...    81   1e-12
gb|KHG23766.1| Homeobox-leucine zipper protein HAT3 [Gossypium a...    77   3e-11
gb|KJB32306.1| hypothetical protein B456_005G234300 [Gossypium r...    76   3e-11
ref|XP_012480183.1| PREDICTED: homeobox protein HAT3.1-like isof...    76   3e-11
ref|XP_012480184.1| PREDICTED: homeobox protein HAT3.1-like isof...    76   3e-11
gb|KJB32302.1| hypothetical protein B456_005G234300 [Gossypium r...    76   3e-11
gb|KJB32301.1| hypothetical protein B456_005G234300 [Gossypium r...    76   3e-11
ref|XP_011088190.1| PREDICTED: pathogenesis-related homeodomain ...    76   3e-11
ref|XP_011088187.1| PREDICTED: pathogenesis-related homeodomain ...    76   3e-11
ref|XP_009351161.1| PREDICTED: homeobox protein HAT3.1-like [Pyr...    73   4e-10

>emb|CAN68079.1| hypothetical protein VITISV_006312 [Vitis vinifera]
          Length = 611

 Score =  104 bits (260), Expect = 9e-20
 Identities = 89/229 (38%), Positives = 115/229 (50%), Gaps = 4/229 (1%)
 Frame = -1

Query: 675 TPKQTTSEQIPEFGSESLHGESVEQKQAIGSEDFQNRQAETRMADSICMDVEQSGLPPEH 496
           +PKQ   E+  +  SES+  ES EQK+   SE+ Q+  AE     S C+  EQS LPPE 
Sbjct: 17  SPKQNILEEARKL-SESVCSESSEQKRX--SENGQHEPAEISPVLSNCIVTEQSELPPED 73

Query: 495 VTGSSMQPDVEQSGLPPEHATGKSSCEQLRSPPQDVAMKSCLEQLES-PEDAAVNSNLER 319
           V  + +       GLPP   T  S  E L  PP+D       EQL   PE    +S +E+
Sbjct: 74  VGDTIL-------GLPPADVTKNSLXEHLGLPPEDAIKNDGTEQLGXFPEVVTKSSIIEK 126

Query: 318 LGH---PPDSASQNPCLEQIGPQLPDADKNPCELEHRDKRTSAKSRKGKYAVGPSVGCSR 148
           LG    PP++ ++   L+Q G    D           +KRT AK  K KY +  SV  SR
Sbjct: 127 LGQSEPPPENVARYSGLDQSGSAPKDL---------ANKRT-AKLVKRKYKLRSSVSGSR 176

Query: 147 VLRSRSQEKSKAQEPVNTLAEQGANEEXXXXXXKRQEERTTINEFARIR 1
           VLRSRSQEK KA +P +      A+ E      KR   +TT +EFARIR
Sbjct: 177 VLRSRSQEKPKASQPSDNFVNASASRERKGRKKKRM-NKTTADEFARIR 224


>ref|XP_010647950.1| PREDICTED: homeobox protein HAT3.1 isoform X3 [Vitis vinifera]
          Length = 717

 Score =  104 bits (259), Expect = 1e-19
 Identities = 89/229 (38%), Positives = 115/229 (50%), Gaps = 4/229 (1%)
 Frame = -1

Query: 675 TPKQTTSEQIPEFGSESLHGESVEQKQAIGSEDFQNRQAETRMADSICMDVEQSGLPPEH 496
           +PKQ   E+  +  SES+  ES EQK+   SE+ Q+  AE     S C+  EQS LPPE 
Sbjct: 17  SPKQNILEEARKL-SESVCSESSEQKRP--SENGQHEPAEISPVLSNCIVTEQSELPPED 73

Query: 495 VTGSSMQPDVEQSGLPPEHATGKSSCEQLRSPPQDVAMKSCLEQLES-PEDAAVNSNLER 319
           V  + +       GLPP   T  S  E L  PP+D       EQL   PE    +S +E+
Sbjct: 74  VGDTIL-------GLPPADVTKNSLTEHLGLPPEDAIKNDGTEQLGFFPEVVTKSSIIEK 126

Query: 318 LGH---PPDSASQNPCLEQIGPQLPDADKNPCELEHRDKRTSAKSRKGKYAVGPSVGCSR 148
           LG    PP++ ++   L+Q G    D           +KRT AK  K KY +  SV  SR
Sbjct: 127 LGQSEPPPENVARYSGLDQSGSAPKDL---------ANKRT-AKLVKRKYKLRSSVSGSR 176

Query: 147 VLRSRSQEKSKAQEPVNTLAEQGANEEXXXXXXKRQEERTTINEFARIR 1
           VLRSRSQEK KA +P +      A+ E      KR   +TT +EFARIR
Sbjct: 177 VLRSRSQEKPKASQPSDNFVNASASRERKGRKKKRM-NKTTADEFARIR 224


>ref|XP_010647949.1| PREDICTED: homeobox protein HAT3.1 isoform X2 [Vitis vinifera]
          Length = 915

 Score =  104 bits (259), Expect = 1e-19
 Identities = 89/229 (38%), Positives = 115/229 (50%), Gaps = 4/229 (1%)
 Frame = -1

Query: 675 TPKQTTSEQIPEFGSESLHGESVEQKQAIGSEDFQNRQAETRMADSICMDVEQSGLPPEH 496
           +PKQ   E+  +  SES+  ES EQK+   SE+ Q+  AE     S C+  EQS LPPE 
Sbjct: 17  SPKQNILEEARKL-SESVCSESSEQKRP--SENGQHEPAEISPVLSNCIVTEQSELPPED 73

Query: 495 VTGSSMQPDVEQSGLPPEHATGKSSCEQLRSPPQDVAMKSCLEQLES-PEDAAVNSNLER 319
           V  + +       GLPP   T  S  E L  PP+D       EQL   PE    +S +E+
Sbjct: 74  VGDTIL-------GLPPADVTKNSLTEHLGLPPEDAIKNDGTEQLGFFPEVVTKSSIIEK 126

Query: 318 LGH---PPDSASQNPCLEQIGPQLPDADKNPCELEHRDKRTSAKSRKGKYAVGPSVGCSR 148
           LG    PP++ ++   L+Q G    D           +KRT AK  K KY +  SV  SR
Sbjct: 127 LGQSEPPPENVARYSGLDQSGSAPKDL---------ANKRT-AKLVKRKYKLRSSVSGSR 176

Query: 147 VLRSRSQEKSKAQEPVNTLAEQGANEEXXXXXXKRQEERTTINEFARIR 1
           VLRSRSQEK KA +P +      A+ E      KR   +TT +EFARIR
Sbjct: 177 VLRSRSQEKPKASQPSDNFVNASASRERKGRKKKRM-NKTTADEFARIR 224


>emb|CBI22504.3| unnamed protein product [Vitis vinifera]
          Length = 977

 Score =  104 bits (259), Expect = 1e-19
 Identities = 89/229 (38%), Positives = 115/229 (50%), Gaps = 4/229 (1%)
 Frame = -1

Query: 675 TPKQTTSEQIPEFGSESLHGESVEQKQAIGSEDFQNRQAETRMADSICMDVEQSGLPPEH 496
           +PKQ   E+  +  SES+  ES EQK+   SE+ Q+  AE     S C+  EQS LPPE 
Sbjct: 17  SPKQNILEEARKL-SESVCSESSEQKRP--SENGQHEPAEISPVLSNCIVTEQSELPPED 73

Query: 495 VTGSSMQPDVEQSGLPPEHATGKSSCEQLRSPPQDVAMKSCLEQLES-PEDAAVNSNLER 319
           V  + +       GLPP   T  S  E L  PP+D       EQL   PE    +S +E+
Sbjct: 74  VGDTIL-------GLPPADVTKNSLTEHLGLPPEDAIKNDGTEQLGFFPEVVTKSSIIEK 126

Query: 318 LGH---PPDSASQNPCLEQIGPQLPDADKNPCELEHRDKRTSAKSRKGKYAVGPSVGCSR 148
           LG    PP++ ++   L+Q G    D           +KRT AK  K KY +  SV  SR
Sbjct: 127 LGQSEPPPENVARYSGLDQSGSAPKDL---------ANKRT-AKLVKRKYKLRSSVSGSR 176

Query: 147 VLRSRSQEKSKAQEPVNTLAEQGANEEXXXXXXKRQEERTTINEFARIR 1
           VLRSRSQEK KA +P +      A+ E      KR   +TT +EFARIR
Sbjct: 177 VLRSRSQEKPKASQPSDNFVNASASRERKGRKKKRM-NKTTADEFARIR 224


>ref|XP_002269077.1| PREDICTED: homeobox protein HAT3.1 isoform X1 [Vitis vinifera]
          Length = 968

 Score =  104 bits (259), Expect = 1e-19
 Identities = 89/229 (38%), Positives = 115/229 (50%), Gaps = 4/229 (1%)
 Frame = -1

Query: 675 TPKQTTSEQIPEFGSESLHGESVEQKQAIGSEDFQNRQAETRMADSICMDVEQSGLPPEH 496
           +PKQ   E+  +  SES+  ES EQK+   SE+ Q+  AE     S C+  EQS LPPE 
Sbjct: 17  SPKQNILEEARKL-SESVCSESSEQKRP--SENGQHEPAEISPVLSNCIVTEQSELPPED 73

Query: 495 VTGSSMQPDVEQSGLPPEHATGKSSCEQLRSPPQDVAMKSCLEQLES-PEDAAVNSNLER 319
           V  + +       GLPP   T  S  E L  PP+D       EQL   PE    +S +E+
Sbjct: 74  VGDTIL-------GLPPADVTKNSLTEHLGLPPEDAIKNDGTEQLGFFPEVVTKSSIIEK 126

Query: 318 LGH---PPDSASQNPCLEQIGPQLPDADKNPCELEHRDKRTSAKSRKGKYAVGPSVGCSR 148
           LG    PP++ ++   L+Q G    D           +KRT AK  K KY +  SV  SR
Sbjct: 127 LGQSEPPPENVARYSGLDQSGSAPKDL---------ANKRT-AKLVKRKYKLRSSVSGSR 176

Query: 147 VLRSRSQEKSKAQEPVNTLAEQGANEEXXXXXXKRQEERTTINEFARIR 1
           VLRSRSQEK KA +P +      A+ E      KR   +TT +EFARIR
Sbjct: 177 VLRSRSQEKPKASQPSDNFVNASASRERKGRKKKRM-NKTTADEFARIR 224


>ref|XP_010099058.1| Homeobox protein [Morus notabilis] gi|587887924|gb|EXB76647.1|
           Homeobox protein [Morus notabilis]
          Length = 1031

 Score = 87.4 bits (215), Expect = 1e-14
 Identities = 95/320 (29%), Positives = 137/320 (42%), Gaps = 3/320 (0%)
 Frame = -1

Query: 951 AELNNEFSWRSVCSEQLEQKVDIVNENVPNGPAKTTTAEFNCVGIEQLRPVPEELTTNSS 772
           AE+ +     ++C+EQ   K D V+ N+ N   KT     + V  ++L  V E+ +  S 
Sbjct: 148 AEVKHGCGSGNLCAEQAGTKND-VDSNLQNEIRKTDITVSSFVFTQKLEIVSEKRSLISG 206

Query: 771 CKQLKSAAEDATMDSNLGQFELLPKDAVEGIPTPKQTTSEQIPEFGSESLHGESVEQKQA 592
              L   +ED                  E    P+Q+T  QI +F    L GE+ +Q+  
Sbjct: 207 -GNLAVPSEDVVRHCQ-----------TENSSCPQQSTLGQIKDFDCGCLLGETPKQEDH 254

Query: 591 IGSEDFQNRQAETRMADSICMDVEQSGLPPEHVTGSSMQPDVEQSGLPPEHATGKSSC-- 418
           +G+E  QN   ETR+A S       +G+  EH              L P    G  S   
Sbjct: 255 LGTELVQNVLVETRIAAS-------NGIVSEH--------------LEPPVGDGSDSYID 293

Query: 417 EQLRSPPQDVAMKSCLEQLESPEDAAVNSNLERLGHPPDSASQNPCLEQIGPQLPDADKN 238
           +Q+  P +DV+  S LEQLE+   + VN                                
Sbjct: 294 KQVEQPSEDVSKSSSLEQLETSSKSLVNK------------------------------- 322

Query: 237 PCELEHRDKRTSAKSRKGKYAVGPSVGCSRVLRSRSQEKSKAQEPVNTLAEQGAN-EEXX 61
           P +L  +DK+TS KSRK +Y +   V   RVLRSR+QEK K+ E  NTL+  G   E+  
Sbjct: 323 PSQLGRKDKQTS-KSRKKQYMLRSLVHSDRVLRSRTQEKLKSHELSNTLSNIGNGVEKRM 381

Query: 60  XXXXKRQEERTTINEFARIR 1
               KR+  R   +EF+RIR
Sbjct: 382 KERKKRRGTRVIADEFSRIR 401


>ref|XP_008236405.1| PREDICTED: LOW QUALITY PROTEIN: homeobox protein HAT3.1 [Prunus
           mume]
          Length = 1040

 Score = 85.1 bits (209), Expect = 7e-14
 Identities = 94/328 (28%), Positives = 130/328 (39%), Gaps = 12/328 (3%)
 Frame = -1

Query: 948 ELNNEFSWRSVCSEQLEQKVDIVNENVPNGPAKTTTAEFNCVGIEQLRPVPEELTTNSSC 769
           E  N+  + +  SE  E+K    +  V N   +T      C G EQL+P+ E +   S  
Sbjct: 138 EQTNDSGFGTSSSEPAEEKHPSGSFCVQNELLQTIMPLPICSGSEQLQPISENVNMASLN 197

Query: 768 KQLKSAAEDATMDSNLGQFELLPKDAVEGIPTPKQTTSEQIPEFGSESLHGESVEQKQAI 589
            Q     ED +                E I    Q T +QI EFGS S+  E  +QK  +
Sbjct: 198 DQAGLPPEDVSKTCQS-----------EKISCSHQITLQQINEFGSGSVPSEPAKQKYEL 246

Query: 588 GSEDFQNRQAETRMADSICMDVEQSGLPPEHVTGSSMQPDVEQSGLPPEHATGKSSCEQL 409
            S   QN + +T  A S  +  EQ G   E +T  S    +  S  PPE A    S +++
Sbjct: 247 DSVPAQNDEVKTSKAVSSSIVFEQPGPSIEAMTEDS---PIGHSEPPPEDAIKSLSDKEM 303

Query: 408 RSPPQDVAMKSCLEQLESPEDAAVNSNLERLGHPPDSASQNPCLEQIGPQLPDADKNPCE 229
              P+DV   S L+Q E+P   A+                + CL       P   KNP  
Sbjct: 304 EPLPEDVTQNSSLQQSETPSKNALKI--------------SSCLG------PKDKKNP-- 341

Query: 228 LEHRDKRTSAKSRKGKYAVGPSVGCSRVLRSRSQEKSKAQ-----------EPVNTLAE- 85
                     KSRK KY     V   RVLRSR+ EK K +           E  N++A  
Sbjct: 342 ----------KSRKRKYMSRSFVRSDRVLRSRTGEKEKPKDLKLSNNVATLESSNSIANV 391

Query: 84  QGANEEXXXXXXKRQEERTTINEFARIR 1
               E+       R++ R   +EF+RIR
Sbjct: 392 SNGEEKKRKKRKNRRDNRAIADEFSRIR 419


>ref|XP_011457795.1| PREDICTED: homeobox protein HAT3.1 isoform X2 [Fragaria vesca subsp.
            vesca]
          Length = 1202

 Score = 82.4 bits (202), Expect = 5e-13
 Identities = 93/335 (27%), Positives = 133/335 (39%), Gaps = 16/335 (4%)
 Frame = -1

Query: 957  TMAELNNEFSWRSVCSEQLEQKVDIVNENVPNGPAKTTTAEFNCVGIEQLRPVPEELTTN 778
            T+ ++N+  S  S  S+  E+  ++ +  V + P +T     +  G EQLR V E ++  
Sbjct: 309  TLKQINDVSSGTSY-SQPTEENQNLGSSFVQDEPLQTIIPVVSSGGNEQLRVVNENVSVP 367

Query: 777  SSCKQLKSAAEDATMDSNLGQFELLPKDAVEGIPTPK-----QTTSEQIPEFGSESLHGE 613
            S  +Q                  LLP+   +   T K      T S+QI E GS S+  E
Sbjct: 368  SLGEQAG----------------LLPEAVSKTCQTDKLSRSLHTASDQINESGSGSVQCE 411

Query: 612  SVEQKQAIGSEDFQNRQAETRMADSICMDVEQSGLPPEHVTGSSMQPDVEQSGLPPEHAT 433
              EQ+  +GS   QN Q +   A S  +  EQSG                          
Sbjct: 412  PQEQRDQLGSLPSQNDQVKNSTAVSSSIGFEQSG-------------------------- 445

Query: 432  GKSSCEQLRSPPQDVAMKSCLEQLESP-EDAAVNSNLERLGHPPDSASQNPCLEQIGPQL 256
                      P  D    S +  LE P EDA+ + N E +    + A+QN CLE      
Sbjct: 446  ----------PSVDEMNNSVIGHLEPPPEDASKDHNKELIKPHTNDATQNSCLEPSETAS 495

Query: 255  PDADKNPCELEHRDKRTSAKSRKGKYAVGPSVGCSRVLRSRSQEKSKAQEPVNTLAE--- 85
             +A KN  +   +DKR S+  RK +      V   RVLRSR+ EK +A E  N +A    
Sbjct: 496  KNASKNSTQFGCKDKRNSSSRRKSR----SLVSSDRVLRSRTSEKPEAPELSNNVATLDT 551

Query: 84   -------QGANEEXXXXXXKRQEERTTINEFARIR 1
                       E       K+  ER   +EF+RIR
Sbjct: 552  SNSVANVSNEKEGKRKKRKKKHRERVAADEFSRIR 586


>ref|XP_004289744.1| PREDICTED: homeobox protein HAT3.1 isoform X1 [Fragaria vesca subsp.
            vesca] gi|764524477|ref|XP_011457794.1| PREDICTED:
            homeobox protein HAT3.1 isoform X1 [Fragaria vesca subsp.
            vesca]
          Length = 1227

 Score = 82.4 bits (202), Expect = 5e-13
 Identities = 93/335 (27%), Positives = 133/335 (39%), Gaps = 16/335 (4%)
 Frame = -1

Query: 957  TMAELNNEFSWRSVCSEQLEQKVDIVNENVPNGPAKTTTAEFNCVGIEQLRPVPEELTTN 778
            T+ ++N+  S  S  S+  E+  ++ +  V + P +T     +  G EQLR V E ++  
Sbjct: 334  TLKQINDVSSGTSY-SQPTEENQNLGSSFVQDEPLQTIIPVVSSGGNEQLRVVNENVSVP 392

Query: 777  SSCKQLKSAAEDATMDSNLGQFELLPKDAVEGIPTPK-----QTTSEQIPEFGSESLHGE 613
            S  +Q                  LLP+   +   T K      T S+QI E GS S+  E
Sbjct: 393  SLGEQAG----------------LLPEAVSKTCQTDKLSRSLHTASDQINESGSGSVQCE 436

Query: 612  SVEQKQAIGSEDFQNRQAETRMADSICMDVEQSGLPPEHVTGSSMQPDVEQSGLPPEHAT 433
              EQ+  +GS   QN Q +   A S  +  EQSG                          
Sbjct: 437  PQEQRDQLGSLPSQNDQVKNSTAVSSSIGFEQSG-------------------------- 470

Query: 432  GKSSCEQLRSPPQDVAMKSCLEQLESP-EDAAVNSNLERLGHPPDSASQNPCLEQIGPQL 256
                      P  D    S +  LE P EDA+ + N E +    + A+QN CLE      
Sbjct: 471  ----------PSVDEMNNSVIGHLEPPPEDASKDHNKELIKPHTNDATQNSCLEPSETAS 520

Query: 255  PDADKNPCELEHRDKRTSAKSRKGKYAVGPSVGCSRVLRSRSQEKSKAQEPVNTLAE--- 85
             +A KN  +   +DKR S+  RK +      V   RVLRSR+ EK +A E  N +A    
Sbjct: 521  KNASKNSTQFGCKDKRNSSSRRKSR----SLVSSDRVLRSRTSEKPEAPELSNNVATLDT 576

Query: 84   -------QGANEEXXXXXXKRQEERTTINEFARIR 1
                       E       K+  ER   +EF+RIR
Sbjct: 577  SNSVANVSNEKEGKRKKRKKKHRERVAADEFSRIR 611


>ref|XP_007042568.1| Homeodomain-like protein with RING/FYVE/PHD-type zinc finger
           domain, putative isoform 1 [Theobroma cacao]
           gi|590687101|ref|XP_007042569.1| Homeodomain-like
           protein with RING/FYVE/PHD-type zinc finger domain,
           putative isoform 1 [Theobroma cacao]
           gi|508706503|gb|EOX98399.1| Homeodomain-like protein
           with RING/FYVE/PHD-type zinc finger domain, putative
           isoform 1 [Theobroma cacao] gi|508706504|gb|EOX98400.1|
           Homeodomain-like protein with RING/FYVE/PHD-type zinc
           finger domain, putative isoform 1 [Theobroma cacao]
          Length = 950

 Score = 81.6 bits (200), Expect = 8e-13
 Identities = 86/292 (29%), Positives = 127/292 (43%), Gaps = 29/292 (9%)
 Frame = -1

Query: 789 LTTNSSCKQLKSAAEDATMDSNLGQFELLPKDAVEGIPT-----PKQTTSEQIPEFGSES 625
           L  +   K L++ +E  + ++      LLP+D+ +   T     P+  +SE    FGS +
Sbjct: 150 LVCDLPAKNLQTFSEGLSENAITESLGLLPEDSSKHTKTDKLSCPQLVSSEPTVNFGSGN 209

Query: 624 LH---GESVEQKQAIGSEDFQNRQAETRMADS-------ICMDVEQSGL---------PP 502
           +    GES EQ+Q + SE   N   E+ +A S       + +  E  G          PP
Sbjct: 210 VCKELGESPEQRQQLDSESLPNGIEESTIAVSSNVSNQALQLKPEDMGKSHCGGHLHSPP 269

Query: 501 EHVTG---SSMQPDVEQSGLPPEHATGKSSCEQLRSPPQDVAMKSCLEQLES-PEDAAVN 334
           E VT    SS  P VE  GLP E A G  S +Q   P +D+A  S +EQ E+ P++   N
Sbjct: 270 EGVTNVIQSSKSPLVEPLGLPQEFAQGNPSTQQSGLPCEDMAQNSGVEQHETKPKNLLEN 329

Query: 333 SNLERLGHPPDSASQNPCLEQIGPQLPDADKNPCELEHRDKRTSAKSRKGKYAVGPSVGC 154
           S   R G                                    ++K+ K KY +      
Sbjct: 330 SGRRRNGK-----------------------------------TSKTIKKKYMLRSLRSS 354

Query: 153 SRVLRSRSQEKSKAQEPVNTLAEQGANEEXXXXXXKRQE-ERTTINEFARIR 1
            RVLRS+ QEK KA E  N LA+ G++E+      +R++  R   +EF+RIR
Sbjct: 355 DRVLRSKLQEKPKATESSNNLADVGSSEQQKRRKRRRRKANREVADEFSRIR 406


>ref|XP_007200058.1| hypothetical protein PRUPE_ppa023106mg [Prunus persica]
           gi|462395458|gb|EMJ01257.1| hypothetical protein
           PRUPE_ppa023106mg [Prunus persica]
          Length = 1058

 Score = 80.9 bits (198), Expect = 1e-12
 Identities = 91/328 (27%), Positives = 130/328 (39%), Gaps = 12/328 (3%)
 Frame = -1

Query: 948 ELNNEFSWRSVCSEQLEQKVDIVNENVPNGPAKTTTAEFNCVGIEQLRPVPEELTTNSSC 769
           E  N+  + +  SE  E++    +  V N   +T      C G EQ++P+ E +   S  
Sbjct: 138 EQTNDSGFGTSSSEPAEERHPSGSFCVQNELLQTIMPLPICGGSEQVQPISENVNMASLN 197

Query: 768 KQLKSAAEDATMDSNLGQFELLPKDAVEGIPTPKQTTSEQIPEFGSESLHGESVEQKQAI 589
            Q     ED +                + I  P Q TS QI EFGS S+  E  +QK  +
Sbjct: 198 DQAGLPPEDVSKTCQ-----------TQKISCPHQITSHQINEFGSGSVPSEPAKQKDQL 246

Query: 588 GSEDFQNRQAETRMADSICMDVEQSGLPPEHVTGSSMQPDVEQSGLPPEHATGKSSCEQL 409
            S   QN +A+T  A S     EQ G   E +T  S    +  S  P E  +   S +++
Sbjct: 247 DSVPAQNDEAKTSKAVSSSTVFEQPGPSIEAMTEDS---PIGHSEPPLEDLSKSLSDKEM 303

Query: 408 RSPPQDVAMKSCLEQLESPEDAAVNSNLERLGHPPDSASQNPCLEQIGPQLPDADKNPCE 229
              P+DV   S L+QLE+    A+  +               CL       P   KNP  
Sbjct: 304 EPLPEDVTQNSSLQQLETASKNALKIS--------------SCLG------PKDKKNP-- 341

Query: 228 LEHRDKRTSAKSRKGKYAVGPSVGCSRVLRSRSQEKSKAQ-----------EPVNTLAE- 85
                     KSRK KY     V   RVLRS++ EK K +           E  N++A  
Sbjct: 342 ----------KSRKRKYMSRSFVRSDRVLRSKTGEKEKPKDLKLSNNVATLESSNSIANV 391

Query: 84  QGANEEXXXXXXKRQEERTTINEFARIR 1
               E+       R++ R   +EF+RIR
Sbjct: 392 SNGEEKKRKKRKNRRDNRAIADEFSRIR 419


>gb|KHG23766.1| Homeobox-leucine zipper protein HAT3 [Gossypium arboreum]
          Length = 928

 Score = 76.6 bits (187), Expect = 3e-11
 Identities = 82/282 (29%), Positives = 128/282 (45%), Gaps = 10/282 (3%)
 Frame = -1

Query: 816 EQLRPVPEELTTNSSCKQLKSAAEDATMDSNLGQFELLPKDAVEGIPTPKQTTSEQIPEF 637
           E LRP  EE + N+  ++L    ED++  + + Q           +  P+  + E    F
Sbjct: 156 EHLRPCSEEFSKNTLTERLGVLPEDSSKCTQIDQ-----------LSCPQLGSVEPTAAF 204

Query: 636 GSESLH---GESVEQKQAIGSEDFQNRQAETRMADSICMDVEQSGLPPEHVT----GSSM 478
           GS +     GE  EQ+Q +GSE   N   ++  A S     +   L PE +     G S+
Sbjct: 205 GSINTSKELGEPTEQQQQLGSESLSNGIVKSPTATSHNAFYQALELNPEVMNQSNCGQSL 264

Query: 477 QPDVEQSGLPPEHATGKSSC-EQLRSPPQDVAMKSCLEQLE-SPEDAAVNSNLERLGHPP 304
           Q   E + +  +   GKS   E L  PP   +  SC++Q +   ED A +S +E+    P
Sbjct: 265 QSPSEGASIVSQ--IGKSYLVEPLGLPPGFESGNSCVQQPKLHSEDMAQSSGVEQHEATP 322

Query: 303 DSASQNPCLEQIGPQLPDADKNPCELEHRDKRTSAKSRKGKYAVGPSVGCSRVLRSRSQE 124
            +  +N                   ++ RD  +S K+RK KY   P     RVLRS+SQE
Sbjct: 323 KNFLEN------------------SVQGRDGESS-KTRK-KYTPRPLSSSDRVLRSKSQE 362

Query: 123 KSKAQEPVNTLAEQGANEEXXXXXXKRQ-EERTTINEFARIR 1
           KSKA E  N + + G++E+       +  E+R   +E++RIR
Sbjct: 363 KSKASELSNNITDIGSSEQQKGKNRNKMIEKREVSDEYSRIR 404


>gb|KJB32306.1| hypothetical protein B456_005G234300 [Gossypium raimondii]
          Length = 940

 Score = 76.3 bits (186), Expect = 3e-11
 Identities = 82/280 (29%), Positives = 126/280 (45%), Gaps = 8/280 (2%)
 Frame = -1

Query: 816 EQLRPVPEELTTNSSCKQLKSAAEDATMDSNLGQFELLPKDAVEGIPTPKQTTSEQIPEF 637
           E LRP  ++ + N+  + L+   ED++  + + Q   L   +VE  PT    +     E 
Sbjct: 168 EHLRPCSKDFSKNARTESLRVLPEDSSKCTQIDQLSCLQLGSVE--PTAAFGSINTCKEL 225

Query: 636 GSESLHGESVEQKQAIGSEDFQNRQAETRMADSICMDVEQSGLPPEHVTGSSMQPDVEQS 457
           G      E  EQ+Q +GSE   N   ++  A S     +   L PE +  S+     E+ 
Sbjct: 226 G------EPTEQQQQLGSESLSNGIGKSPTATSSNAFYQALELYPEVMNQSNCG---ERF 276

Query: 456 GLPPEHA-----TGKSSC-EQLRSPPQDVAMKSCLEQLE-SPEDAAVNSNLERLGHPPDS 298
             P E A     TGKS   + L  PP   +  SC++Q     ED A +S +E+    P +
Sbjct: 277 QSPSEGASIVSQTGKSYLVDPLGLPPGFESGNSCVQQPRLHSEDMARSSGVEQHEATPKN 336

Query: 297 ASQNPCLEQIGPQLPDADKNPCELEHRDKRTSAKSRKGKYAVGPSVGCSRVLRSRSQEKS 118
             +N                   ++ RD  +S K+RK KY   P     RVLRS+SQEKS
Sbjct: 337 LLEN------------------SVQGRDGESS-KTRK-KYTPRPLTSSDRVLRSKSQEKS 376

Query: 117 KAQEPVNTLAEQGANEEXXXXXXKRQ-EERTTINEFARIR 1
           KA E  N + + G++E+       +  E+R   +E++RIR
Sbjct: 377 KASELSNNITDIGSSEQQKGRNRNKMIEKREVSDEYSRIR 416


>ref|XP_012480183.1| PREDICTED: homeobox protein HAT3.1-like isoform X1 [Gossypium
           raimondii] gi|763765051|gb|KJB32305.1| hypothetical
           protein B456_005G234300 [Gossypium raimondii]
          Length = 928

 Score = 76.3 bits (186), Expect = 3e-11
 Identities = 82/280 (29%), Positives = 126/280 (45%), Gaps = 8/280 (2%)
 Frame = -1

Query: 816 EQLRPVPEELTTNSSCKQLKSAAEDATMDSNLGQFELLPKDAVEGIPTPKQTTSEQIPEF 637
           E LRP  ++ + N+  + L+   ED++  + + Q   L   +VE  PT    +     E 
Sbjct: 156 EHLRPCSKDFSKNARTESLRVLPEDSSKCTQIDQLSCLQLGSVE--PTAAFGSINTCKEL 213

Query: 636 GSESLHGESVEQKQAIGSEDFQNRQAETRMADSICMDVEQSGLPPEHVTGSSMQPDVEQS 457
           G      E  EQ+Q +GSE   N   ++  A S     +   L PE +  S+     E+ 
Sbjct: 214 G------EPTEQQQQLGSESLSNGIGKSPTATSSNAFYQALELYPEVMNQSNCG---ERF 264

Query: 456 GLPPEHA-----TGKSSC-EQLRSPPQDVAMKSCLEQLE-SPEDAAVNSNLERLGHPPDS 298
             P E A     TGKS   + L  PP   +  SC++Q     ED A +S +E+    P +
Sbjct: 265 QSPSEGASIVSQTGKSYLVDPLGLPPGFESGNSCVQQPRLHSEDMARSSGVEQHEATPKN 324

Query: 297 ASQNPCLEQIGPQLPDADKNPCELEHRDKRTSAKSRKGKYAVGPSVGCSRVLRSRSQEKS 118
             +N                   ++ RD  +S K+RK KY   P     RVLRS+SQEKS
Sbjct: 325 LLEN------------------SVQGRDGESS-KTRK-KYTPRPLTSSDRVLRSKSQEKS 364

Query: 117 KAQEPVNTLAEQGANEEXXXXXXKRQ-EERTTINEFARIR 1
           KA E  N + + G++E+       +  E+R   +E++RIR
Sbjct: 365 KASELSNNITDIGSSEQQKGRNRNKMIEKREVSDEYSRIR 404


>ref|XP_012480184.1| PREDICTED: homeobox protein HAT3.1-like isoform X2 [Gossypium
           raimondii] gi|763765049|gb|KJB32303.1| hypothetical
           protein B456_005G234300 [Gossypium raimondii]
           gi|763765050|gb|KJB32304.1| hypothetical protein
           B456_005G234300 [Gossypium raimondii]
          Length = 814

 Score = 76.3 bits (186), Expect = 3e-11
 Identities = 82/280 (29%), Positives = 126/280 (45%), Gaps = 8/280 (2%)
 Frame = -1

Query: 816 EQLRPVPEELTTNSSCKQLKSAAEDATMDSNLGQFELLPKDAVEGIPTPKQTTSEQIPEF 637
           E LRP  ++ + N+  + L+   ED++  + + Q   L   +VE  PT    +     E 
Sbjct: 42  EHLRPCSKDFSKNARTESLRVLPEDSSKCTQIDQLSCLQLGSVE--PTAAFGSINTCKEL 99

Query: 636 GSESLHGESVEQKQAIGSEDFQNRQAETRMADSICMDVEQSGLPPEHVTGSSMQPDVEQS 457
           G      E  EQ+Q +GSE   N   ++  A S     +   L PE +  S+     E+ 
Sbjct: 100 G------EPTEQQQQLGSESLSNGIGKSPTATSSNAFYQALELYPEVMNQSNCG---ERF 150

Query: 456 GLPPEHA-----TGKSSC-EQLRSPPQDVAMKSCLEQLE-SPEDAAVNSNLERLGHPPDS 298
             P E A     TGKS   + L  PP   +  SC++Q     ED A +S +E+    P +
Sbjct: 151 QSPSEGASIVSQTGKSYLVDPLGLPPGFESGNSCVQQPRLHSEDMARSSGVEQHEATPKN 210

Query: 297 ASQNPCLEQIGPQLPDADKNPCELEHRDKRTSAKSRKGKYAVGPSVGCSRVLRSRSQEKS 118
             +N                   ++ RD  +S K+RK KY   P     RVLRS+SQEKS
Sbjct: 211 LLEN------------------SVQGRDGESS-KTRK-KYTPRPLTSSDRVLRSKSQEKS 250

Query: 117 KAQEPVNTLAEQGANEEXXXXXXKRQ-EERTTINEFARIR 1
           KA E  N + + G++E+       +  E+R   +E++RIR
Sbjct: 251 KASELSNNITDIGSSEQQKGRNRNKMIEKREVSDEYSRIR 290


>gb|KJB32302.1| hypothetical protein B456_005G234300 [Gossypium raimondii]
          Length = 869

 Score = 76.3 bits (186), Expect = 3e-11
 Identities = 82/280 (29%), Positives = 126/280 (45%), Gaps = 8/280 (2%)
 Frame = -1

Query: 816 EQLRPVPEELTTNSSCKQLKSAAEDATMDSNLGQFELLPKDAVEGIPTPKQTTSEQIPEF 637
           E LRP  ++ + N+  + L+   ED++  + + Q   L   +VE  PT    +     E 
Sbjct: 156 EHLRPCSKDFSKNARTESLRVLPEDSSKCTQIDQLSCLQLGSVE--PTAAFGSINTCKEL 213

Query: 636 GSESLHGESVEQKQAIGSEDFQNRQAETRMADSICMDVEQSGLPPEHVTGSSMQPDVEQS 457
           G      E  EQ+Q +GSE   N   ++  A S     +   L PE +  S+     E+ 
Sbjct: 214 G------EPTEQQQQLGSESLSNGIGKSPTATSSNAFYQALELYPEVMNQSNCG---ERF 264

Query: 456 GLPPEHA-----TGKSSC-EQLRSPPQDVAMKSCLEQLE-SPEDAAVNSNLERLGHPPDS 298
             P E A     TGKS   + L  PP   +  SC++Q     ED A +S +E+    P +
Sbjct: 265 QSPSEGASIVSQTGKSYLVDPLGLPPGFESGNSCVQQPRLHSEDMARSSGVEQHEATPKN 324

Query: 297 ASQNPCLEQIGPQLPDADKNPCELEHRDKRTSAKSRKGKYAVGPSVGCSRVLRSRSQEKS 118
             +N                   ++ RD  +S K+RK KY   P     RVLRS+SQEKS
Sbjct: 325 LLEN------------------SVQGRDGESS-KTRK-KYTPRPLTSSDRVLRSKSQEKS 364

Query: 117 KAQEPVNTLAEQGANEEXXXXXXKRQ-EERTTINEFARIR 1
           KA E  N + + G++E+       +  E+R   +E++RIR
Sbjct: 365 KASELSNNITDIGSSEQQKGRNRNKMIEKREVSDEYSRIR 404


>gb|KJB32301.1| hypothetical protein B456_005G234300 [Gossypium raimondii]
          Length = 906

 Score = 76.3 bits (186), Expect = 3e-11
 Identities = 82/280 (29%), Positives = 126/280 (45%), Gaps = 8/280 (2%)
 Frame = -1

Query: 816 EQLRPVPEELTTNSSCKQLKSAAEDATMDSNLGQFELLPKDAVEGIPTPKQTTSEQIPEF 637
           E LRP  ++ + N+  + L+   ED++  + + Q   L   +VE  PT    +     E 
Sbjct: 156 EHLRPCSKDFSKNARTESLRVLPEDSSKCTQIDQLSCLQLGSVE--PTAAFGSINTCKEL 213

Query: 636 GSESLHGESVEQKQAIGSEDFQNRQAETRMADSICMDVEQSGLPPEHVTGSSMQPDVEQS 457
           G      E  EQ+Q +GSE   N   ++  A S     +   L PE +  S+     E+ 
Sbjct: 214 G------EPTEQQQQLGSESLSNGIGKSPTATSSNAFYQALELYPEVMNQSNCG---ERF 264

Query: 456 GLPPEHA-----TGKSSC-EQLRSPPQDVAMKSCLEQLE-SPEDAAVNSNLERLGHPPDS 298
             P E A     TGKS   + L  PP   +  SC++Q     ED A +S +E+    P +
Sbjct: 265 QSPSEGASIVSQTGKSYLVDPLGLPPGFESGNSCVQQPRLHSEDMARSSGVEQHEATPKN 324

Query: 297 ASQNPCLEQIGPQLPDADKNPCELEHRDKRTSAKSRKGKYAVGPSVGCSRVLRSRSQEKS 118
             +N                   ++ RD  +S K+RK KY   P     RVLRS+SQEKS
Sbjct: 325 LLEN------------------SVQGRDGESS-KTRK-KYTPRPLTSSDRVLRSKSQEKS 364

Query: 117 KAQEPVNTLAEQGANEEXXXXXXKRQ-EERTTINEFARIR 1
           KA E  N + + G++E+       +  E+R   +E++RIR
Sbjct: 365 KASELSNNITDIGSSEQQKGRNRNKMIEKREVSDEYSRIR 404


>ref|XP_011088190.1| PREDICTED: pathogenesis-related homeodomain protein isoform X2
           [Sesamum indicum]
          Length = 715

 Score = 76.3 bits (186), Expect = 3e-11
 Identities = 72/249 (28%), Positives = 112/249 (44%), Gaps = 1/249 (0%)
 Frame = -1

Query: 744 DATMDSNLGQFELLPKDAVEGIPTPKQTTSEQIPEFGSESLHGESVEQKQAIGSEDFQNR 565
           ++ M   L   E L +D   G  TP     +      SE+L  E++E+K+  GS++F+  
Sbjct: 12  ESNMIEPLETSENLAQDPKSGPLTPANYKMD------SETLVTETMEKKEVTGSQNFRKN 65

Query: 564 QAETRMADSICMDVEQSGLPPEHVTGSSMQPDVEQSGLPPEHATGKSSCEQLRSPPQDVA 385
                  + I   ++++G  PE +   S   D E+   P E A   S  + L    Q+  
Sbjct: 66  IGSV---EEISDQIKETGPNPEDI---SQNLDAEKEEPPLESAKTLSVAQNLEVISQNGL 119

Query: 384 MKSCLEQLESPEDAAVNSNLERLGHPPDSASQNPCLEQIGPQLPDADKNPCELEHRDKRT 205
                            +NLE +   P++AS N    ++     D  KN  +L   D+  
Sbjct: 120 -----------------TNLENMCISPEAASANHGCGKLETVHIDETKNSGQLGTEDRGC 162

Query: 204 SAKSRKGKYAVGPSVGCSRVLRSRSQEKSKAQEPVNTLAEQGAN-EEXXXXXXKRQEERT 28
           S +SRK K  +   V  S VLRS+SQEK KA EP   + E  AN E+      K+  ++T
Sbjct: 163 SVQSRKRKAGLKSPVTSSWVLRSKSQEKPKAPEPNENVKEDSANGEKKKRGRKKKPMQKT 222

Query: 27  TINEFARIR 1
           T+NEF+R +
Sbjct: 223 TVNEFSRTK 231


>ref|XP_011088187.1| PREDICTED: pathogenesis-related homeodomain protein isoform X1
           [Sesamum indicum] gi|747081793|ref|XP_011088188.1|
           PREDICTED: pathogenesis-related homeodomain protein
           isoform X1 [Sesamum indicum]
          Length = 835

 Score = 76.3 bits (186), Expect = 3e-11
 Identities = 72/249 (28%), Positives = 112/249 (44%), Gaps = 1/249 (0%)
 Frame = -1

Query: 744 DATMDSNLGQFELLPKDAVEGIPTPKQTTSEQIPEFGSESLHGESVEQKQAIGSEDFQNR 565
           ++ M   L   E L +D   G  TP     +      SE+L  E++E+K+  GS++F+  
Sbjct: 12  ESNMIEPLETSENLAQDPKSGPLTPANYKMD------SETLVTETMEKKEVTGSQNFRKN 65

Query: 564 QAETRMADSICMDVEQSGLPPEHVTGSSMQPDVEQSGLPPEHATGKSSCEQLRSPPQDVA 385
                  + I   ++++G  PE +   S   D E+   P E A   S  + L    Q+  
Sbjct: 66  IGSV---EEISDQIKETGPNPEDI---SQNLDAEKEEPPLESAKTLSVAQNLEVISQNGL 119

Query: 384 MKSCLEQLESPEDAAVNSNLERLGHPPDSASQNPCLEQIGPQLPDADKNPCELEHRDKRT 205
                            +NLE +   P++AS N    ++     D  KN  +L   D+  
Sbjct: 120 -----------------TNLENMCISPEAASANHGCGKLETVHIDETKNSGQLGTEDRGC 162

Query: 204 SAKSRKGKYAVGPSVGCSRVLRSRSQEKSKAQEPVNTLAEQGAN-EEXXXXXXKRQEERT 28
           S +SRK K  +   V  S VLRS+SQEK KA EP   + E  AN E+      K+  ++T
Sbjct: 163 SVQSRKRKAGLKSPVTSSWVLRSKSQEKPKAPEPNENVKEDSANGEKKKRGRKKKPMQKT 222

Query: 27  TINEFARIR 1
           T+NEF+R +
Sbjct: 223 TVNEFSRTK 231


>ref|XP_009351161.1| PREDICTED: homeobox protein HAT3.1-like [Pyrus x bretschneideri]
          Length = 1052

 Score = 72.8 bits (177), Expect = 4e-10
 Identities = 90/330 (27%), Positives = 138/330 (41%), Gaps = 15/330 (4%)
 Frame = -1

Query: 948 ELNNEFSWRSVCSEQLEQKVDIVNENVPNGPAKTTTAEFNCVGIEQLRPVPEELTTNSSC 769
           E  N+  + +  SE  E+K    +++V N               +QL+ + +     S+ 
Sbjct: 174 EQKNDCGFGTSSSELAEEKHPSASDHVQN---------------DQLQAIIQVPICGSN- 217

Query: 768 KQLKSAAEDATMDSNLGQFELLPKDAVEGIPTPK-----QTTSEQIPEFGSESLHGESVE 604
           + L+S++E+    S   Q  L P+D  +   T K     QTT ++I EFG  S+HGE   
Sbjct: 218 EHLQSSSENVNRTSLNKQAGLPPEDLSKTCQTDKVSCCNQTTLQEINEFGCGSVHGEPET 277

Query: 603 QKQAIGSEDFQNRQAETRMADSICMDVEQSGLPPEHVTGSSMQPDVEQSGLPPEHATGKS 424
           QK  + S    N +  T  A    +  EQS    E +T  S    +E   LP E A+   
Sbjct: 278 QKYQLDSVPAHNNEVTTTEAAPSSIVFEQSRPCIEAMTRDSPTGHLE---LPLEDAS--- 331

Query: 423 SCEQLRSPPQDVAMKSCLEQLESPEDAAVNSNLERLGHPPDSASQNPCLEQIGPQLPDAD 244
                +SPP D  M+        P D   NS+LE+   P  +A             P   
Sbjct: 332 -----KSPPIDKEMEPL------PADVTQNSSLEKTEKPSKNA-------------PKEK 367

Query: 243 KNPCELEHRDKRTSAKSRKGKYAVGPSVGCSRVLRSRSQEKS---KAQEPVNTLAEQG-- 79
           +NP            KSRK KY    S+G  RVLRS++ EK+   K    V+TL      
Sbjct: 368 QNP------------KSRKKKYVSKSSIGSDRVLRSKTGEKTKNPKLSNDVSTLESSNSV 415

Query: 78  ---ANEEXXXXXXKRQEERTTI--NEFARI 4
              +N E      +++ +R  +  +EF+R+
Sbjct: 416 ANPSNVEGKRRKKRKKRQRNKVIDDEFSRV 445


Top