BLASTX nr result

ID: Cinnamomum24_contig00009520 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Cinnamomum24_contig00009520
         (3263 letters)

Database: ./nr 
           69,698,275 sequences; 24,982,196,650 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_010271495.1| PREDICTED: homeobox protein HOX1A [Nelumbo n...   406   e-110
ref|XP_008373077.1| PREDICTED: homeobox protein HAT3.1-like isof...   378   e-101
ref|XP_008373078.1| PREDICTED: homeobox protein HAT3.1-like isof...   376   e-101
ref|XP_008373076.1| PREDICTED: homeobox protein HAT3.1-like isof...   375   e-100
ref|XP_011001393.1| PREDICTED: homeobox protein HAT3.1-like [Pop...   374   e-100
ref|XP_002313886.2| hypothetical protein POPTR_0009s09600g [Popu...   374   e-100
ref|XP_009346873.1| PREDICTED: homeobox protein HAT3.1-like, par...   371   2e-99
ref|XP_010099058.1| Homeobox protein [Morus notabilis] gi|587887...   371   2e-99
ref|XP_012093068.1| PREDICTED: homeobox protein HAT3.1 [Jatropha...   369   9e-99
gb|KDP44446.1| hypothetical protein JCGZ_16279 [Jatropha curcas]      369   9e-99
ref|XP_007042568.1| Homeodomain-like protein with RING/FYVE/PHD-...   367   3e-98
ref|XP_009399131.1| PREDICTED: homeobox protein HOX1A isoform X1...   367   3e-98
ref|XP_009351161.1| PREDICTED: homeobox protein HAT3.1-like [Pyr...   367   4e-98
ref|XP_002300247.2| homeobox family protein [Populus trichocarpa...   366   6e-98
ref|XP_008236405.1| PREDICTED: LOW QUALITY PROTEIN: homeobox pro...   362   1e-96
ref|XP_007200058.1| hypothetical protein PRUPE_ppa023106mg [Prun...   361   2e-96
ref|XP_011457795.1| PREDICTED: homeobox protein HAT3.1 isoform X...   344   3e-91
ref|XP_004289744.1| PREDICTED: homeobox protein HAT3.1 isoform X...   344   3e-91
ref|XP_011629041.1| PREDICTED: homeobox protein HOX1A [Amborella...   311   2e-81
gb|ERM96685.1| hypothetical protein AMTR_s00001p00272780 [Ambore...   311   2e-81

>ref|XP_010271495.1| PREDICTED: homeobox protein HOX1A [Nelumbo nucifera]
            gi|719970501|ref|XP_010271502.1| PREDICTED: homeobox
            protein HOX1A [Nelumbo nucifera]
          Length = 789

 Score =  406 bits (1044), Expect = e-110
 Identities = 278/804 (34%), Positives = 374/804 (46%), Gaps = 8/804 (0%)
 Frame = -1

Query: 3050 KSESEQMDVSHPVEEHCHEQNSLDTKDKHTGAEETMRVGSDSIDTDRPKPSPEDSVDFEA 2871
            K   ++  +S P E    +   LD+ +  +  +E M  GSD  DT++ +P  +       
Sbjct: 10   KESDDKQKISSPEESKLGQNVQLDSGNIQSEPKEPMAGGSDVADTEKLEPESK------- 62

Query: 2870 NLRSDGTPQAATNDSGTPSQIEKRVNSKLGGKRYPLRSSLEGTRVLRSRSNGVCXXXXXX 2691
                       T +S   +  + +V+SK   ++Y LRSS   TRVLRS S G        
Sbjct: 63   ---------VVTKNSSRSAYKKNKVSSKSIKRKYMLRSSTSSTRVLRSMSRGTSKPPVPS 113

Query: 2690 XXXXXXXXXAQXXXXXXXXKGQNKVSNDELADIRRRVKYLSARMNYEQSLIDAYSGEGWK 2511
                      +        K  +K  NDE + IR+R++YL  RMNYEQSLIDAYSGEGWK
Sbjct: 114  SNMGNATT--ESGKKRKRKKKVSKTLNDEFSTIRKRIRYLLTRMNYEQSLIDAYSGEGWK 171

Query: 2510 RQSAEKVRPEKELQRASSEILRRKLELRDLFQRLDSICAEGKLQDSLFDSEGQIDSEDIF 2331
              S EK++PEKELQRA++EILR KL +R+LFQ L S+C+ G+LQ+SLFDSEGQI SEDIF
Sbjct: 172  GNSLEKIKPEKELQRATAEILRCKLRIRELFQHLSSLCSVGRLQESLFDSEGQIYSEDIF 231

Query: 2330 CAKCGSKDFSANNDIVLCDGGCDRGFHQMCLVPPLLN--IPPGDDSWLCPGCDCKVDCIG 2157
            CAKCGSKD S +NDI+LCDG CDRGFHQMCL PPL    IPPGD+ WLCPGCDCKVDCI 
Sbjct: 232  CAKCGSKDLSTDNDIILCDGICDRGFHQMCLEPPLSKEEIPPGDEGWLCPGCDCKVDCIE 291

Query: 2156 VLNDIQGTSLSIEDKWEKVFPEAAA--AASGDKQQXXXXXXXXXXXXXXXXXXXXXXXEK 1983
            +LN+++G  LSI D WEK+FPEAAA  AA  ++                         EK
Sbjct: 292  LLNELRGLDLSINDNWEKIFPEAAAAAAAGDNQDGDFGFPSDDSEDYDYDPNGPQVDDEK 351

Query: 1982 IQKXXXXXXXXXXXXXXXXXXXXXXDKQHINPGFXXXXXXXXXXXXDAPELDEKVQSEGL 1803
            +Q                       D  ++  G              A + DE    E  
Sbjct: 352  VQTDDSSSEESDFTSASDDSGPPPNDDLYL--GLPSDDSEDNDYDPTARDPDEHANRE-- 407

Query: 1802 SSDESDFSSDTNDLIPSKYXXXXXXXXXXXXXNHAEHIGGFDGGEPEASQXXXXXXXXXX 1623
             S  SDF+SD+ D                   +  +      G    +            
Sbjct: 408  -SSNSDFTSDSEDFSALSDHNIPMVTDEIPVSSSVDGTKPLTGSSERSKMDRKRKTPIHS 466

Query: 1622 XXXXXRDQSGEVFLPVSGKRLRERLDYKKLNDETYGNIHXXXXXXXDWTDMNTPKKVKRR 1443
                      E  LPVSGKR RE LDYKKL+DETYGN+        DWT  + P K    
Sbjct: 467  ELLSKLQPDEENALPVSGKRHRELLDYKKLHDETYGNLPSDSSDDEDWTATDAPSKGNNC 526

Query: 1442 GESKSTVMASLQNTQTIQDGENTKQSQRKPE--GEVLRKQEKLPDTRTLHNLESEGANHT 1269
               KST ++   N  TI +G  TK  ++  E      ++  ++P+     +   +     
Sbjct: 527  AV-KSTSVSPNGNLPTINNGITTKGERQNLEVTNNTPKETHQIPELGDASHTADKTNEDD 585

Query: 1268 MKPCYMRRKLTTSRPGCFGREVSQRLCESFKVNAYPTQEMLENLSKEIGITFHQVSIWFE 1089
             +PC + +  TT +    G+ V+Q+L E+F+ N YP +   ENL KE+GIT  QVS WFE
Sbjct: 586  QEPCSIEKTSTTPKHRSLGKAVTQKLYEAFRRNRYPDRATKENLVKELGITLRQVSKWFE 645

Query: 1088 NTRRSFCLLETKEESKEENGTPNEDSISTKANLEEPRXXXXXXXXXXXXXKTENNTPSQD 909
            N RRS  L   + E    N        + K    EP+                 +  S  
Sbjct: 646  NARRSLRLSANEAEPTSANKVSALAPENGKVLEPEPKMPSKDDATNDGM-----DRESSK 700

Query: 908  SNHTEMGVKE--PGTETKVVSTNKEASKTEINTPNKDDIPPKTTVNEPQPEEKVVSATKE 735
              HTE    E   G E K      E+S+ ++  PN      K  +++  P E + S    
Sbjct: 701  EGHTEAVAMECLSGKEEKKDLETGESSQQKLAAPNS----RKAHLDDQTPIETLNSGETP 756

Query: 734  ASEAENSTPNKDSISATTNGKEPG 663
                +     K+ ++A   G   G
Sbjct: 757  MEAQKGQATQKNDLNANQQGTLSG 780


>ref|XP_008373077.1| PREDICTED: homeobox protein HAT3.1-like isoform X2 [Malus domestica]
          Length = 1081

 Score =  378 bits (970), Expect = e-101
 Identities = 270/771 (35%), Positives = 374/771 (48%), Gaps = 46/771 (5%)
 Frame = -1

Query: 2867 LRSDGTPQAATNDSGTPSQI--EKRVNSKLGGKRYPLRSSLEGTRVLRSR---------- 2724
            L +D T  ++   +  PS+   + + N K   K+Y  +SS+   RVLRS+          
Sbjct: 343  LPADVTQNSSLEKTEKPSKNAPKDKQNPKSRKKKYVSKSSVGSDRVLRSKTGEKTKNPKL 402

Query: 2723 SNGVCXXXXXXXXXXXXXXXAQXXXXXXXXKGQNKVSNDELADIRRRVKYLSARMNYEQS 2544
            SN V                 +        +  NKV +DE + +R+ ++YL  R++YE+S
Sbjct: 403  SNDVSTLESSNSVANPSNVEGKRRKKRKKRQ-LNKVIDDEFSRVRKHLRYLLNRISYEKS 461

Query: 2543 LIDAYSGEGWKRQSAEKVRPEKELQRASSEILRRKLELRDLFQRLDSICAEGKLQDSLFD 2364
            LIDAYSGEGWK  S EK++PEKELQRA+SEIL+RKL++RDLFQRLDS+C+EG   +SLFD
Sbjct: 462  LIDAYSGEGWKGSSLEKLKPEKELQRATSEILQRKLKIRDLFQRLDSLCSEGMFPESLFD 521

Query: 2363 SEGQIDSEDIFCAKCGSKDFSANNDIVLCDGGCDRGFHQMCLVPPLL--NIPPGDDSWLC 2190
            SEGQIDSEDIFCAKCGSKD S  NDI+LCDG CDRGFHQ CL PPLL  +IPP D+ WLC
Sbjct: 522  SEGQIDSEDIFCAKCGSKDVSLQNDIILCDGACDRGFHQFCLEPPLLSEDIPPDDEGWLC 581

Query: 2189 PGCDCKVDCIGVLNDIQGTSLSIEDKWEKVFPEAAAAASGDKQQXXXXXXXXXXXXXXXX 2010
            PGCDCKVDC  +LND QGT LS+ D WEKVFPEAAAAASG  Q+                
Sbjct: 582  PGCDCKVDCFDLLNDSQGTDLSVADSWEKVFPEAAAAASGHNQEHTHGLPSDDSDDNDYD 641

Query: 2009 XXXXXXXEKIQKXXXXXXXXXXXXXXXXXXXXXXDKQHINPGFXXXXXXXXXXXXDAPEL 1830
                   +++Q                       +      G             DAPE+
Sbjct: 642  PDGPETDDEVQGEESSSDDESKYASASDGLETPKNNDEQYLGLPSDDSEDDDYNPDAPEV 701

Query: 1829 DEKVQSEGLSSDESDFSSDTNDLIPS-KYXXXXXXXXXXXXXNHAEHIGGFDGGEPEASQ 1653
             E+++ E   S  SDF+SD+ DL  S                   +  G   G   ++S+
Sbjct: 702  TEELKKE---SSSSDFTSDSEDLGASLDDNNMFSEDVESPKSMSLDESGPLRGSGKQSSR 758

Query: 1652 XXXXXXXXXXXXXXXRD----QSGEVFLPVSGKRLRERLDYKKLNDETYGNIHXXXXXXX 1485
                            +    Q+G    PVSGKR  ERL+YKKL+DETYGN+        
Sbjct: 759  RGQKKQPLKDELLSLLESGPGQAGAA--PVSGKRHIERLNYKKLHDETYGNVRTDSSDDE 816

Query: 1484 DWTDMNTPKKVKR----------RGESKSTVMASLQNT--QTIQDGENT-KQSQRKPEGE 1344
            +W D   P+K K+           G+S +     + N     + + ENT K++ R+ +  
Sbjct: 817  EWNDTAGPRKRKKVTTQAPTMSPNGDSSNVKNGMITNNIKHDLDENENTPKRTPRRNKNT 876

Query: 1343 VLR--KQEKLPDTRTLHNLESEGANHTMKPCYMRRKLTTSRPGCFGREVSQRLCESFKVN 1170
              R  ++ K+ DT  L N   +G+  +          + S     G   +QRL +SFK N
Sbjct: 877  PKRAHRKSKVEDTSNLSNKSQKGSTQSASTSEQGGS-SRSTYRKLGEAATQRLSKSFKEN 935

Query: 1169 AYPTQEMLENLSKEIGITFHQVSIWFENTRRSFCLLETKEESKEENGTPNEDSISTKANL 990
             YP + M E+L++E+GI   QVS WFEN R  + +  + ++S   NGTP   +   +   
Sbjct: 936  HYPDRSMKESLARELGIMAKQVSKWFENARHFWKV--SVDKSAAGNGTPLPQTNGKQLE- 992

Query: 989  EEPRXXXXXXXXXXXXXKTENNTPSQDSNHTEMGVKE-PGTETKVVST-----------N 846
                               + +TP  DS+ +    KE P T   +  +            
Sbjct: 993  -------------------KGDTPIGDSDQSGAQNKELPRTNDPMTGSCSGDAKDGELVT 1033

Query: 845  KEASKTEINTPNKDDIPPKTTVNEPQPEEKVVSATKEASEAENSTPNKDSI 693
             ++SK +  TPN      K+  ++P PE K    T    E+E   P  D++
Sbjct: 1034 PKSSKRKAITPNNRKRXRKS--DDPDPENK-TPETNRKGESEEIAPLTDAL 1081


>ref|XP_008373078.1| PREDICTED: homeobox protein HAT3.1-like isoform X3 [Malus domestica]
          Length = 1078

 Score =  376 bits (965), Expect = e-101
 Identities = 259/738 (35%), Positives = 361/738 (48%), Gaps = 39/738 (5%)
 Frame = -1

Query: 2867 LRSDGTPQAATNDSGTPSQI--EKRVNSKLGGKRYPLRSSLEGTRVLRSR---------- 2724
            L +D T  ++   +  PS+   + + N K   K+Y  +SS+   RVLRS+          
Sbjct: 343  LPADVTQNSSLEKTEKPSKNAPKDKQNPKSRKKKYVSKSSVGSDRVLRSKTGEKTKNPKL 402

Query: 2723 SNGVCXXXXXXXXXXXXXXXAQXXXXXXXXKGQNKVSNDELADIRRRVKYLSARMNYEQS 2544
            SN V                 +        +  NKV +DE + +R+ ++YL  R++YE+S
Sbjct: 403  SNDVSTLESSNSVANPSNVEGKRRKKRKKRQ-LNKVIDDEFSRVRKHLRYLLNRISYEKS 461

Query: 2543 LIDAYSGEGWKRQSAEKVRPEKELQRASSEILRRKLELRDLFQRLDSICAEGKLQDSLFD 2364
            LIDAYSGEGWK  S EK++PEKELQRA+SEIL+RKL++RDLFQRLDS+C+EG   +SLFD
Sbjct: 462  LIDAYSGEGWKGSSLEKLKPEKELQRATSEILQRKLKIRDLFQRLDSLCSEGMFPESLFD 521

Query: 2363 SEGQIDSEDIFCAKCGSKDFSANNDIVLCDGGCDRGFHQMCLVPPLL--NIPPGDDSWLC 2190
            SEGQIDSEDIFCAKCGSKD S  NDI+LCDG CDRGFHQ CL PPLL  +IPP D+ WLC
Sbjct: 522  SEGQIDSEDIFCAKCGSKDVSLQNDIILCDGACDRGFHQFCLEPPLLSEDIPPDDEGWLC 581

Query: 2189 PGCDCKVDCIGVLNDIQGTSLSIEDKWEKVFPEAAAAASGDKQQXXXXXXXXXXXXXXXX 2010
            PGCDCKVDC  +LND QGT LS+ D WEKVFPEAAAAASG  Q+                
Sbjct: 582  PGCDCKVDCFDLLNDSQGTDLSVADSWEKVFPEAAAAASGHNQEHTHGLPSDDSDDNDYD 641

Query: 2009 XXXXXXXEKIQKXXXXXXXXXXXXXXXXXXXXXXDKQHINPGFXXXXXXXXXXXXDAPEL 1830
                   +++Q                       +      G             DAPE+
Sbjct: 642  PDGPETDDEVQGEESSSDDESKYASASDGLETPKNNDEQYLGLPSDDSEDDDYNPDAPEV 701

Query: 1829 DEKVQSEGLSSDESDFSSDTNDLIPS-KYXXXXXXXXXXXXXNHAEHIGGFDGGEPEASQ 1653
             E+++ E   S  SDF+SD+ DL  S                   +  G   G   ++S+
Sbjct: 702  TEELKKE---SSSSDFTSDSEDLGASLDDNNMFSEDVESPKSMSLDESGPLRGSGKQSSR 758

Query: 1652 XXXXXXXXXXXXXXXRD----QSGEVFLPVSGKRLRERLDYKKLNDETYGNIHXXXXXXX 1485
                            +    Q+G    PVSGKR  ERL+YKKL+DETYGN+        
Sbjct: 759  RGQKKQPLKDELLSLLESGPGQAGAA--PVSGKRHIERLNYKKLHDETYGNVRTDSSDDE 816

Query: 1484 DWTDMNTPKKVKR----------RGESKSTVMASLQNT--QTIQDGENT-KQSQRKPEGE 1344
            +W D   P+K K+           G+S +     + N     + + ENT K++ R+ +  
Sbjct: 817  EWNDTAGPRKRKKVTTQAPTMSPNGDSSNVKNGMITNNIKHDLDENENTPKRTPRRNKNT 876

Query: 1343 VLR--KQEKLPDTRTLHNLESEGANHTMKPCYMRRKLTTSRPGCFGREVSQRLCESFKVN 1170
              R  ++ K+ DT  L N   +G+  +          + S     G   +QRL +SFK N
Sbjct: 877  PKRAHRKSKVEDTSNLSNKSQKGSTQSASTSEQGGS-SRSTYRKLGEAATQRLSKSFKEN 935

Query: 1169 AYPTQEMLENLSKEIGITFHQVSIWFENTRRSFCLLETKEESKEENGTPNEDSISTKANL 990
             YP + M E+L++E+GI   QVS WFEN R  + +  + ++S   NGTP   +   +   
Sbjct: 936  HYPDRSMKESLARELGIMAKQVSKWFENARHFWKV--SVDKSAAGNGTPLPQTNGKQLEK 993

Query: 989  EEPRXXXXXXXXXXXXXKTENNTP-----SQDSNHTEMGVKEPGTETKVVSTNKEASKTE 825
             +                   N P     S D+   E+ V    ++ K ++ N      +
Sbjct: 994  GDTPIGDSDQSGAQNKELPRTNDPMTGSCSGDAKDGEL-VTPKSSKRKAITPNNRKRXRK 1052

Query: 824  INTPNKDDIPPKTTVNEP 771
             + P+ ++  P+T    P
Sbjct: 1053 SDDPDPENKTPETNRKAP 1070


>ref|XP_008373076.1| PREDICTED: homeobox protein HAT3.1-like isoform X1 [Malus domestica]
          Length = 1081

 Score =  375 bits (962), Expect = e-100
 Identities = 247/649 (38%), Positives = 334/649 (51%), Gaps = 34/649 (5%)
 Frame = -1

Query: 2867 LRSDGTPQAATNDSGTPSQI--EKRVNSKLGGKRYPLRSSLEGTRVLRSR---------- 2724
            L +D T  ++   +  PS+   + + N K   K+Y  +SS+   RVLRS+          
Sbjct: 343  LPADVTQNSSLEKTEKPSKNAPKDKQNPKSRKKKYVSKSSVGSDRVLRSKTGEKTKNPKL 402

Query: 2723 SNGVCXXXXXXXXXXXXXXXAQXXXXXXXXKGQNKVSNDELADIRRRVKYLSARMNYEQS 2544
            SN V                 +        +  NKV +DE + +R+ ++YL  R++YE+S
Sbjct: 403  SNDVSTLESSNSVANPSNVEGKRRKKRKKRQ-LNKVIDDEFSRVRKHLRYLLNRISYEKS 461

Query: 2543 LIDAYSGEGWKRQSAEKVRPEKELQRASSEILRRKLELRDLFQRLDSICAEGKLQDSLFD 2364
            LIDAYSGEGWK  S EK++PEKELQRA+SEIL+RKL++RDLFQRLDS+C+EG   +SLFD
Sbjct: 462  LIDAYSGEGWKGSSLEKLKPEKELQRATSEILQRKLKIRDLFQRLDSLCSEGMFPESLFD 521

Query: 2363 SEGQIDSEDIFCAKCGSKDFSANNDIVLCDGGCDRGFHQMCLVPPLL--NIPPGDDSWLC 2190
            SEGQIDSEDIFCAKCGSKD S  NDI+LCDG CDRGFHQ CL PPLL  +IPP D+ WLC
Sbjct: 522  SEGQIDSEDIFCAKCGSKDVSLQNDIILCDGACDRGFHQFCLEPPLLSEDIPPDDEGWLC 581

Query: 2189 PGCDCKVDCIGVLNDIQGTSLSIEDKWEKVFPEAAAAASGDKQQXXXXXXXXXXXXXXXX 2010
            PGCDCKVDC  +LND QGT LS+ D WEKVFPEAAAAASG  Q+                
Sbjct: 582  PGCDCKVDCFDLLNDSQGTDLSVADSWEKVFPEAAAAASGHNQEHTHGLPSDDSDDNDYD 641

Query: 2009 XXXXXXXEKIQKXXXXXXXXXXXXXXXXXXXXXXDKQHINPGFXXXXXXXXXXXXDAPEL 1830
                   +++Q                       +      G             DAPE+
Sbjct: 642  PDGPETDDEVQGEESSSDDESKYASASDGLETPKNNDEQYLGLPSDDSEDDDYNPDAPEV 701

Query: 1829 DEKVQSEGLSSDESDFSSDTNDLIPS-KYXXXXXXXXXXXXXNHAEHIGGFDGGEPEASQ 1653
             E+++ E   S  SDF+SD+ DL  S                   +  G   G   ++S+
Sbjct: 702  TEELKKE---SSSSDFTSDSEDLGASLDDNNMFSEDVESPKSMSLDESGPLRGSGKQSSR 758

Query: 1652 XXXXXXXXXXXXXXXRD----QSGEVFLPVSGKRLRERLDYKKLNDETYGNIHXXXXXXX 1485
                            +    Q+G    PVSGKR  ERL+YKKL+DETYGN+        
Sbjct: 759  RGQKKQPLKDELLSLLESGPGQAGAA--PVSGKRHIERLNYKKLHDETYGNVRTDSSDDE 816

Query: 1484 DWTDMNTPKKVKR----------RGESKSTVMASLQNT--QTIQDGENT-KQSQRKPEGE 1344
            +W D   P+K K+           G+S +     + N     + + ENT K++ R+ +  
Sbjct: 817  EWNDTAGPRKRKKVTTQAPTMSPNGDSSNVKNGMITNNIKHDLDENENTPKRTPRRNKNT 876

Query: 1343 VLR--KQEKLPDTRTLHNLESEGANHTMKPCYMRRKLTTSRPGCFGREVSQRLCESFKVN 1170
              R  ++ K+ DT  L N   +G+  +          + S     G   +QRL +SFK N
Sbjct: 877  PKRAHRKSKVEDTSNLSNKSQKGSTQSASTSEQGGS-SRSTYRKLGEAATQRLSKSFKEN 935

Query: 1169 AYPTQEMLENLSKEIGITFHQVSIWFENTRRSFCLLETKEESKEENGTP 1023
             YP + M E+L++E+GI   QVS WFEN R  + +  + ++S   NGTP
Sbjct: 936  HYPDRSMKESLARELGIMAKQVSKWFENARHFWKV--SVDKSAAGNGTP 982


>ref|XP_011001393.1| PREDICTED: homeobox protein HAT3.1-like [Populus euphratica]
            gi|743794901|ref|XP_011001400.1| PREDICTED: homeobox
            protein HAT3.1-like [Populus euphratica]
            gi|743794905|ref|XP_011001405.1| PREDICTED: homeobox
            protein HAT3.1-like [Populus euphratica]
          Length = 934

 Score =  374 bits (960), Expect = e-100
 Identities = 260/704 (36%), Positives = 344/704 (48%), Gaps = 23/704 (3%)
 Frame = -1

Query: 3068 PKKERRK-----SESEQMDVSHPVEEHCHEQNSLDTKDKHTGAEETMRVG---SDSI--- 2922
            P +ER+K     SE+E   +   +      +NS       T +     VG    DSI   
Sbjct: 218  PSEERQKPGSELSENESTGIDTELYCGIAIKNSEPLTQLVTKSSPIKHVGLLPGDSIIIP 277

Query: 2921 --DTDRPKPSPEDSVDFEANLRSDGTPQAATNDSGTPSQIEKRVNSKLGGKRYPLRSSLE 2748
              +  RP    ED      +L +           G PS    +  S+L  K Y LRS   
Sbjct: 278  ANEQTRPTHDDEDKGPDHEHLETPSRVAIGITRRGRPSG---KSASRLSRKIYMLRSLRS 334

Query: 2747 GTRVLRSRSNGVCXXXXXXXXXXXXXXXAQXXXXXXXXKGQNKVSNDELADIRRRVKYLS 2568
              RVLRSRS                             +    +  DE + IR  ++YL 
Sbjct: 335  SDRVLRSRSQVKPKAPESSNNSGNVNSTGDKKGKRRKKRRGKNIVADEYSKIRAHLRYLL 394

Query: 2567 ARMNYEQSLIDAYSGEGWKRQSAEKVRPEKELQRASSEILRRKLELRDLFQRLDSICAEG 2388
             RM+YEQSLI AYSGEGWK  S EK++PEKELQRA+SEI RRK+++RDLFQ +D +C+EG
Sbjct: 395  NRMSYEQSLITAYSGEGWKGLSLEKLKPEKELQRATSEITRRKVKIRDLFQHIDYLCSEG 454

Query: 2387 KLQDSLFDSEGQIDSEDIFCAKCGSKDFSANNDIVLCDGGCDRGFHQMCLVPPLL--NIP 2214
            +   SLFDSEGQIDSEDIFCAKCGSKD +A+NDI+LCDG CDRGFHQ CL+PPLL  +IP
Sbjct: 455  RFPSSLFDSEGQIDSEDIFCAKCGSKDLNADNDIILCDGACDRGFHQFCLIPPLLREDIP 514

Query: 2213 PGDDSWLCPGCDCKVDCIGVLNDIQGTSLSIEDKWEKVFPEAAAAASGDKQQXXXXXXXX 2034
            P D+ WLCPGCDCKVDCI +LND QGT++SI D WEKVFPEAAA  SG K          
Sbjct: 515  PDDEGWLCPGCDCKVDCIDLLNDSQGTNISISDSWEKVFPEAAATVSGQKLDHNFGPSSD 574

Query: 2033 XXXXXXXXXXXXXXXEKIQKXXXXXXXXXXXXXXXXXXXXXXDKQHINPGFXXXXXXXXX 1854
                           +K Q+                       K+++  G          
Sbjct: 575  DSDDNDYDPDGPDIDKKSQEEESSSDESDFTSASDEFKAPPDGKEYL--GLSSDDSEDDD 632

Query: 1853 XXXDAPELDEKVQSEGLSSDESDFSSDTNDLIPSKYXXXXXXXXXXXXXNHAEHIGGFDG 1674
               DAP L+EK++ E   S  SDF+SD+ DL  S                  E  G  +G
Sbjct: 633  YDPDAPVLEEKLKQE---SSSSDFTSDSEDL--SATINSDGLPLEDECHMPIETRGVSNG 687

Query: 1673 GEPEASQXXXXXXXXXXXXXXXRDQSGEVFLPVSGKRLRERLDYKKLNDETYGNIHXXXX 1494
             + +                   D   +    VSGKR  +RLDYKKL DETYGNI     
Sbjct: 688  RKSKFDGKKMQSLNSELLSMLEPDLCRDESATVSGKRNVDRLDYKKLYDETYGNI--STS 745

Query: 1493 XXXDWTDMNTPKKVKRRGESKSTVMASLQNTQTIQDGENTKQSQRKPEGEVLRKQEKLPD 1314
               D+TD   P+K ++     +TV A+  +    ++G N+K   ++     L++ ++ P+
Sbjct: 746  SDDDYTDTVGPRKRRKNAGDVATVTAN-GDASVTENGMNSKNMNQE-----LKENKRNPE 799

Query: 1313 TRTLHNLESEGANHTMKPCYMRRKLTTS-----RPGCF---GREVSQRLCESFKVNAYPT 1158
              T HN   +  N +    Y+   L+ S     RP  +   G  V+QRL   FK N YP 
Sbjct: 800  RGTCHNSSFQETNVSPAKSYVGASLSGSSGKSVRPSAYKKLGEAVTQRLYSYFKENQYPD 859

Query: 1157 QEMLENLSKEIGITFHQVSIWFENTRRSFCLLETKEESKEENGT 1026
            +    +L++E+GITF QV+ WF N R SF    +   SK E+ +
Sbjct: 860  RAAKASLAEELGITFEQVNKWFVNARWSFNHSSSTGASKAESAS 903


>ref|XP_002313886.2| hypothetical protein POPTR_0009s09600g [Populus trichocarpa]
            gi|550331388|gb|EEE87841.2| hypothetical protein
            POPTR_0009s09600g [Populus trichocarpa]
          Length = 934

 Score =  374 bits (959), Expect = e-100
 Identities = 257/697 (36%), Positives = 342/697 (49%), Gaps = 16/697 (2%)
 Frame = -1

Query: 3068 PKKERRKSESEQMDV---SHPVEEHCHEQNSLDTKD---KHTGAEETMRVGSDSIDTDRP 2907
            P  E  ++ES  +D    S    E+      L TK    KH G      +   + +  RP
Sbjct: 225  PGSELSENESTGIDTELYSGIAIENSEPLTQLVTKRSPIKHVGLLPGDSIIIPANEQTRP 284

Query: 2906 KPSPEDSVDFEANLRSDGTPQAATNDSGTPSQIEKRVNSKLGGKRYPLRSSLEGTRVLRS 2727
                ED      +L +           G P     +  S+L  K Y LRS     RVLRS
Sbjct: 285  THDDEDKGPDHEHLETPSRVAIGITRRGRP---RGKSASRLSRKIYMLRSLRSSDRVLRS 341

Query: 2726 RSNGVCXXXXXXXXXXXXXXXAQXXXXXXXXKGQNKVSNDELADIRRRVKYLSARMNYEQ 2547
            RS                             +    +  DE + IR  ++YL  RM+YEQ
Sbjct: 342  RSQEKPKAPESSNNSGNVNSTGDKKGKRRKKRRGKNIVADEYSKIRAHLRYLLNRMSYEQ 401

Query: 2546 SLIDAYSGEGWKRQSAEKVRPEKELQRASSEILRRKLELRDLFQRLDSICAEGKLQDSLF 2367
            SLI AYSGEGWK  S EK++PEKELQRA+SEI RRK+++RDLFQ +DS+C+EG+   SLF
Sbjct: 402  SLITAYSGEGWKGLSLEKLKPEKELQRATSEITRRKVKIRDLFQHIDSLCSEGRFPSSLF 461

Query: 2366 DSEGQIDSEDIFCAKCGSKDFSANNDIVLCDGGCDRGFHQMCLVPPLL--NIPPGDDSWL 2193
            DSEGQIDSEDIFCAKCGSKD +A+NDI+LCDG CDRGFHQ CL+PPLL  +IPP D+ WL
Sbjct: 462  DSEGQIDSEDIFCAKCGSKDLNADNDIILCDGACDRGFHQFCLIPPLLREDIPPDDEGWL 521

Query: 2192 CPGCDCKVDCIGVLNDIQGTSLSIEDKWEKVFPEAAAAASGDKQQXXXXXXXXXXXXXXX 2013
            CPGCDCKVDCIG+LND QGT++SI D WEKVFPEAAA ASG K                 
Sbjct: 522  CPGCDCKVDCIGLLNDSQGTNISISDSWEKVFPEAAATASGQKLDHNFGPSSDDSDDNDY 581

Query: 2012 XXXXXXXXEKIQKXXXXXXXXXXXXXXXXXXXXXXDKQHINPGFXXXXXXXXXXXXDAPE 1833
                    +K Q+                       K+++  G             DAP 
Sbjct: 582  EPDGPDIDKKSQEEESSSDESDFTSASDEFKAPPDGKEYL--GLSSDDSEDDDYDPDAPV 639

Query: 1832 LDEKVQSEGLSSDESDFSSDTNDLIPSKYXXXXXXXXXXXXXNHAEHIGGFDGGEPEASQ 1653
            L+EK++ E   S  SDF+SD+ DL  +                  E  G  +G + +   
Sbjct: 640  LEEKLKQE---SSSSDFTSDSEDLAAT--INGDGLSLEDECHMPIEPRGVSNGRKSKFDG 694

Query: 1652 XXXXXXXXXXXXXXXRDQSGEVFLPVSGKRLRERLDYKKLNDETYGNIHXXXXXXXDWTD 1473
                            D   +    VSGKR  +RLDYKKL DETYGNI        D+TD
Sbjct: 695  KKMQSLNSELLSMLEPDLCQDESATVSGKRNVDRLDYKKLYDETYGNI--STSSDDDYTD 752

Query: 1472 MNTPKKVKRRGESKSTVMASLQNTQTIQDGENTKQSQRKPEGEVLRKQEKLPDTRTLHNL 1293
               P+K ++     +TV A+  +    ++G N+K   ++     L++ ++ P+  T  N 
Sbjct: 753  TVGPRKRRKNTGDVATVTAN-GDASVTENGMNSKNMNQE-----LKENKRNPERGTCQNS 806

Query: 1292 ESEGANHTMKPCYMRRKLTTS-----RPGCF---GREVSQRLCESFKVNAYPTQEMLENL 1137
              +  N +    Y+   L+ S     RP  +   G  V+QRL   F+ N YP +    +L
Sbjct: 807  SFQETNVSPAKSYVGASLSGSSGKSVRPSAYKKLGEAVTQRLYSYFRENQYPDRAAKASL 866

Query: 1136 SKEIGITFHQVSIWFENTRRSFCLLETKEESKEENGT 1026
            ++E+GITF QV+ WF N R SF    +   SK E+ +
Sbjct: 867  AEELGITFEQVNKWFVNARWSFNHSSSTGTSKAESAS 903


>ref|XP_009346873.1| PREDICTED: homeobox protein HAT3.1-like, partial [Pyrus x
            bretschneideri]
          Length = 695

 Score =  371 bits (953), Expect = 2e-99
 Identities = 253/706 (35%), Positives = 344/706 (48%), Gaps = 36/706 (5%)
 Frame = -1

Query: 2795 NSKLGGKRYPLRSSLEGTRVLRSR----------SNGVCXXXXXXXXXXXXXXXAQXXXX 2646
            N K   K+Y  +SS+   RVLRS+          SN V                 +    
Sbjct: 3    NPKSRKKKYVSKSSIGSDRVLRSKTGEKTKNPKLSNDVSTLESSNSVANPSNVEGKRRKK 62

Query: 2645 XXXXKGQNKVSNDELADIRRRVKYLSARMNYEQSLIDAYSGEGWKRQSAEKVRPEKELQR 2466
                + +NKV +DE + +R+ ++YL   ++YE+SLIDAYSGEGWK  S EK++PEKELQR
Sbjct: 63   RKKRQ-RNKVIDDEFSRVRKHLRYLLNXISYEKSLIDAYSGEGWKGSSLEKLKPEKELQR 121

Query: 2465 ASSEILRRKLELRDLFQRLDSICAEGKLQDSLFDSEGQIDSEDIFCAKCGSKDFSANNDI 2286
            A+SEILRRKL++RDLFQRLDS+C+EG   +SLFDSEGQIDSEDIFCAKCGSKD S  NDI
Sbjct: 122  ATSEILRRKLKIRDLFQRLDSLCSEGMFPESLFDSEGQIDSEDIFCAKCGSKDVSLQNDI 181

Query: 2285 VLCDGGCDRGFHQMCLVPPLL--NIPPGDDSWLCPGCDCKVDCIGVLNDIQGTSLSIEDK 2112
            +LCDG CDRGFHQ+CL P LL  +IPP D+ WLCPGCDCKVDC  +LN+ QGT LS+ D 
Sbjct: 182  ILCDGACDRGFHQLCLEPSLLSEDIPPDDEGWLCPGCDCKVDCFDLLNESQGTDLSVADS 241

Query: 2111 WEKVFPEAAAAASGDKQQXXXXXXXXXXXXXXXXXXXXXXXEKIQKXXXXXXXXXXXXXX 1932
            WEKVFPEAAAAASG  Q+                       +++Q               
Sbjct: 242  WEKVFPEAAAAASGHNQEHTHGLPSDDSDDNDYDPDGSETDDEVQGEESSSDDESKYASA 301

Query: 1931 XXXXXXXXDKQHINPGFXXXXXXXXXXXXDAPELDEKVQSEGLSSDESDFSSDTNDLIPS 1752
                    +      G             DAPE+ ++++ E   S  SDF+SD+ DL  S
Sbjct: 302  SDGLETPKNNDEQYFGLPSDDSEDDDYNPDAPEVTDELKKE---SSSSDFTSDSEDLGAS 358

Query: 1751 -KYXXXXXXXXXXXXXNHAEHIGGFDGGEPEASQXXXXXXXXXXXXXXXRD----QSGEV 1587
                               +  G   G   ++S+                +    Q G  
Sbjct: 359  LDDNNMSAEDVESPKSMSLDESGPLRGSGKQSSRHGQKKQPLKDELLSLLESGPGQGGAA 418

Query: 1586 FLPVSGKRLRERLDYKKLNDETYGNIHXXXXXXXDWTDMNTPKKVKR----------RGE 1437
              PVSGKR  ERL+YKKL+DETYGN+        +W D   P+K K+           G+
Sbjct: 419  --PVSGKRHIERLNYKKLHDETYGNVRTDSSDDEEWNDTAGPRKRKKVTTQAPMMSPNGD 476

Query: 1436 SKSTVMASLQNTQTIQDGENTKQSQRKPEG-----EVLRKQEKLPDTRTLHNLESEGANH 1272
            S +     + N       EN    +R P G     +   ++ K+ DT  L N   +G+  
Sbjct: 477  SSNVKNVMITNNIKHDLDENENTPKRTPRGSKNTPKRAHRKSKVEDTSNLSNKSQKGSTQ 536

Query: 1271 TMKPCYMRRKLTTSRPGCFGREVSQRLCESFKVNAYPTQEMLENLSKEIGITFHQVSIWF 1092
            +      +   + S     G   +QRL +SFK N YP + M E+L++E+GI   QVS WF
Sbjct: 537  SASTS-EKGGSSRSTYRKLGEAATQRLSKSFKENHYPDRSMKESLAQELGIMAKQVSKWF 595

Query: 1091 ENTRRSFCLLETKEESKEENGTPNEDSISTKANLEEPRXXXXXXXXXXXXXKTENNTPSQ 912
            EN R   C   + ++S   NGTP   +   +                      + +TP  
Sbjct: 596  ENARH--CWKVSLDKSAAGNGTPLPQTNGKQLE--------------------QGDTPIG 633

Query: 911  DSNHTEMGVKE-PGTE---TKVVSTNKEASKTEINTPNKDDIPPKT 786
            DS+      KE P T+    K ++ N    K + + P+ ++  P+T
Sbjct: 634  DSDRGGAQNKELPRTDDRKRKAMTPNNRKRKRKSDDPDPENKTPET 679


>ref|XP_010099058.1| Homeobox protein [Morus notabilis] gi|587887924|gb|EXB76647.1|
            Homeobox protein [Morus notabilis]
          Length = 1031

 Score =  371 bits (952), Expect = 2e-99
 Identities = 251/665 (37%), Positives = 331/665 (49%), Gaps = 17/665 (2%)
 Frame = -1

Query: 2936 GSDSIDTDRPKPSPEDSVDFEANLRSDGTPQAATNDSGTPSQIEKR--VNSKLGGKRYPL 2763
            GSDS   D+    P + V   ++L    T   +  +   PSQ+ ++    SK   K+Y L
Sbjct: 287  GSDSY-IDKQVEQPSEDVSKSSSLEQLETSSKSLVNK--PSQLGRKDKQTSKSRKKQYML 343

Query: 2762 RSSLEGTRVLRSRSNGVCXXXXXXXXXXXXXXXAQXXXXXXXXKGQNKVSNDELADIRRR 2583
            RS +   RVLRSR+                    +        +   +V  DE + IR+R
Sbjct: 344  RSLVHSDRVLRSRTQEKLKSHELSNTLSNIGNGVEKRMKERKKRRGTRVIADEFSRIRKR 403

Query: 2582 VKYLSARMNYEQSLIDAYSGEGWKRQSAEKVRPEKELQRASSEILRRKLELRDLFQRLDS 2403
            +KY   R++YEQ+LIDAYS EGWK  S EK++PEKELQRA SEI RRKL++RDLFQ+LDS
Sbjct: 404  LKYFFNRIHYEQNLIDAYSSEGWKGTSLEKLKPEKELQRAKSEIFRRKLKIRDLFQQLDS 463

Query: 2402 ICAEGKLQDSLFDSEGQIDSEDIFCAKCGSKDFSANNDIVLCDGGCDRGFHQMCLVPPLL 2223
            +CAEG+   SLFDSEGQIDSEDIFCAKCGSKD SANNDI+LCDG CDRGFHQ CL PPLL
Sbjct: 464  LCAEGRFPKSLFDSEGQIDSEDIFCAKCGSKDMSANNDIILCDGACDRGFHQFCLEPPLL 523

Query: 2222 N--IPPGDDSWLCPGCDCKVDCIGVLNDIQGTSLSIEDKWEKVFPEAAA-AASGDKQQXX 2052
            +  IPP D+ WLCPGCDCKVDC  +LND  GT+LS+ D WEKVFPEAAA A  G  Q   
Sbjct: 524  SEDIPPDDEGWLCPGCDCKVDCFDLLNDSYGTNLSVTDSWEKVFPEAAAAAREGKDQDHN 583

Query: 2051 XXXXXXXXXXXXXXXXXXXXXEKIQKXXXXXXXXXXXXXXXXXXXXXXDKQHINPGFXXX 1872
                                 EK++                        K     G    
Sbjct: 584  LEFPSDDSEDDDYDPYGPEIVEKVEGDESSSDESEYTSACDELEGEAPPKDEQYFGLSSD 643

Query: 1871 XXXXXXXXXDAPELDEKVQSEGLSSDESDFSSDTNDLIPSKYXXXXXXXXXXXXXNHAEH 1692
                     D  ++DE  + E   S  SDF+SD+ DL  +               +    
Sbjct: 644  DSEDNDFDPDDQDVDENAKQE---SSSSDFTSDSEDLAFTLDEGQIAEKDEVSSLDPTRS 700

Query: 1691 IGG-FDGGEPEASQXXXXXXXXXXXXXXXRDQSGEVFLPVSGKRLRERLDYKKLNDETYG 1515
            +G                             Q G    P+SGKR  ERLDYK+L+DETYG
Sbjct: 701  LGNAVMQSSKRGGNKSSIKDELLDILESGTGQDGSP--PISGKRHVERLDYKRLHDETYG 758

Query: 1514 NIHXXXXXXXDWTDMNTPKKVKRRGESKSTVM----ASLQNTQTIQDGENTK-------Q 1368
            ++        DWTD   P+K KR     S+V     AS+   QT  D  N          
Sbjct: 759  HLPSDSSDDEDWTDYAAPRKRKRTTGQVSSVSPNENASIIKNQTTTDAANNDLEDNEYVP 818

Query: 1367 SQRKPEGEVLRKQEKLPDTRTLHNLESEGANHTMKPCYMRRKLTTSRPGCFGREVSQRLC 1188
             +R  +  V+  +  +P+ + L      G+         RR+L+T+R    G  V+QRL 
Sbjct: 819  RRRSRQNSVVTDENNIPN-KLLQGSPKSGSTG------RRRELSTNRR--LGEAVTQRLY 869

Query: 1187 ESFKVNAYPTQEMLENLSKEIGITFHQVSIWFENTRRSFCLLETKEESKEENGTPNEDSI 1008
            +SFK N Y  +   E+L++E+G+T +QVS WFEN R S+    +K+    E+ +  E ++
Sbjct: 870  QSFKENQYLDRATKESLAQELGLTSYQVSKWFENARWSYRHSSSKKPGISEHAS-KESTL 928

Query: 1007 STKAN 993
            S + N
Sbjct: 929  SPQTN 933


>ref|XP_012093068.1| PREDICTED: homeobox protein HAT3.1 [Jatropha curcas]
          Length = 1015

 Score =  369 bits (947), Expect = 9e-99
 Identities = 260/698 (37%), Positives = 342/698 (48%), Gaps = 27/698 (3%)
 Frame = -1

Query: 2999 HEQNSLDTKD---KHTGAEETMRVGSDSIDTDRPKPSPEDSVDFEANLRSDGTPQAATND 2829
            H    L TK    +H G      + + + +   P   P D++D   NL+   TP    + 
Sbjct: 239  HLGTQLTTKSSPLEHLGMPSDSEINTCATEKLEP---PHDNMDNHLNLQQSDTPSKDVSI 295

Query: 2828 SGTPSQIEKRVNSKLGGKRYPLRSSLEGTRVLRSRSNGVCXXXXXXXXXXXXXXXAQXXX 2649
            + +   +  +  +K   K+Y LRS     RV +SRS                    +   
Sbjct: 296  NSSRVGVRVKRTAKSTRKKYVLRSLRRSDRVRQSRSQEKPKGPDPNADMANASSNIEKTR 355

Query: 2648 XXXXXKGQNKVSNDELADIRRRVKYLSARMNYEQSLIDAYSGEGWKRQSAEKVRPEKELQ 2469
                 + +  V  DE + IR+ ++YL  R++YEQSLI AYS EGWK  S EK++PEKELQ
Sbjct: 356  KKRKKRQRKSVEGDEYSRIRKHLRYLLNRISYEQSLITAYSAEGWKGLSLEKLKPEKELQ 415

Query: 2468 RASSEILRRKLELRDLFQRLDSICAEGKLQDSLFDSEGQIDSEDIFCAKCGSKDFSANND 2289
            RA+SEILRRKL++RDLFQR+DS+CAEG+L +SLFDS+GQI SEDIFCAKCGSKD +A+ND
Sbjct: 416  RATSEILRRKLKIRDLFQRVDSLCAEGRLPESLFDSDGQISSEDIFCAKCGSKDMTADND 475

Query: 2288 IVLCDGGCDRGFHQMCLVPPLL--NIPPGDDSWLCPGCDCKVDCIGVLNDIQGTSLSIED 2115
            I+LCDG CDRGFHQ CL+PPLL  +IPP D+ WLCPGCDCKVDCI +LND QGT++SI D
Sbjct: 476  IILCDGACDRGFHQFCLLPPLLKEDIPPDDEGWLCPGCDCKVDCIELLNDSQGTNISISD 535

Query: 2114 KWEKVFPEAAAAASGDKQQXXXXXXXXXXXXXXXXXXXXXXXEKIQKXXXXXXXXXXXXX 1935
            +WEKVFPEAA  A+G                           EK Q              
Sbjct: 536  RWEKVFPEAA--AAGQNPDPNFGLPSDDSDDNDYDPDGPEIDEKSQGDESSNDESDYTSA 593

Query: 1934 XXXXXXXXXDKQHINPGFXXXXXXXXXXXXDAPELDEKVQSEGLSSDESDFSSDTNDLIP 1755
                     D+Q +  G             DA + DE V+     S  SDF+SD+ DL  
Sbjct: 594  SDELEASPGDEQQL--GLSSDDSEDDDYDPDALDRDENVEE----SSSSDFTSDSEDLTA 647

Query: 1754 S--KYXXXXXXXXXXXXXNHAEHIGGFDGGEPEASQXXXXXXXXXXXXXXXRDQSGEVFL 1581
            +                  H +     +G +   S+                D SG    
Sbjct: 648  TLDDNHLSGEDENHMSIGLHGDSKHRGNGKQSTHSELSLLDLNSRK------DGSG---- 697

Query: 1580 PVSGKRLRERLDYKKLNDETYGNIHXXXXXXXDWTDMNTPKKVKRRGESKSTVMASLQNT 1401
            P+SGKR  ERLDYKKL DETYGN         D+TD   P+K +R+    ST   S  + 
Sbjct: 698  PISGKRDVERLDYKKLYDETYGNASSDSSDDEDFTDDVEPRK-RRKETYGSTSSDSSDDE 756

Query: 1400 QTIQDGENTKQSQRKPEGEVL--------------------RKQEKLPDTRTLHNLESEG 1281
              I D E  K+ +    G+                      R++ K  +T T      EG
Sbjct: 757  DFIDDVEPRKRRRSTEVGQASVNANAFVSKTAKQDTTPKRHRQKSKFANTSTSSTKGHEG 816

Query: 1280 ANHTMKPCYMRRKLTTSRPGCFGREVSQRLCESFKVNAYPTQEMLENLSKEIGITFHQVS 1101
            A+ +       + + +S     G  V+Q L +SFK N YP +   E+L+KE+GITF QVS
Sbjct: 817  ASPSSSS---GKPVKSSGYRRLGETVTQGLYKSFKENQYPDRAKKESLAKELGITFQQVS 873

Query: 1100 IWFENTRRSFCLLETKEESKEENGTPNEDSISTKANLE 987
             WFENTR SF    + + S     T  EDS   K N E
Sbjct: 874  KWFENTRWSFNHPPSTDASTVRK-TTKEDSQLPKTNTE 910


>gb|KDP44446.1| hypothetical protein JCGZ_16279 [Jatropha curcas]
          Length = 1009

 Score =  369 bits (947), Expect = 9e-99
 Identities = 260/698 (37%), Positives = 342/698 (48%), Gaps = 27/698 (3%)
 Frame = -1

Query: 2999 HEQNSLDTKD---KHTGAEETMRVGSDSIDTDRPKPSPEDSVDFEANLRSDGTPQAATND 2829
            H    L TK    +H G      + + + +   P   P D++D   NL+   TP    + 
Sbjct: 233  HLGTQLTTKSSPLEHLGMPSDSEINTCATEKLEP---PHDNMDNHLNLQQSDTPSKDVSI 289

Query: 2828 SGTPSQIEKRVNSKLGGKRYPLRSSLEGTRVLRSRSNGVCXXXXXXXXXXXXXXXAQXXX 2649
            + +   +  +  +K   K+Y LRS     RV +SRS                    +   
Sbjct: 290  NSSRVGVRVKRTAKSTRKKYVLRSLRRSDRVRQSRSQEKPKGPDPNADMANASSNIEKTR 349

Query: 2648 XXXXXKGQNKVSNDELADIRRRVKYLSARMNYEQSLIDAYSGEGWKRQSAEKVRPEKELQ 2469
                 + +  V  DE + IR+ ++YL  R++YEQSLI AYS EGWK  S EK++PEKELQ
Sbjct: 350  KKRKKRQRKSVEGDEYSRIRKHLRYLLNRISYEQSLITAYSAEGWKGLSLEKLKPEKELQ 409

Query: 2468 RASSEILRRKLELRDLFQRLDSICAEGKLQDSLFDSEGQIDSEDIFCAKCGSKDFSANND 2289
            RA+SEILRRKL++RDLFQR+DS+CAEG+L +SLFDS+GQI SEDIFCAKCGSKD +A+ND
Sbjct: 410  RATSEILRRKLKIRDLFQRVDSLCAEGRLPESLFDSDGQISSEDIFCAKCGSKDMTADND 469

Query: 2288 IVLCDGGCDRGFHQMCLVPPLL--NIPPGDDSWLCPGCDCKVDCIGVLNDIQGTSLSIED 2115
            I+LCDG CDRGFHQ CL+PPLL  +IPP D+ WLCPGCDCKVDCI +LND QGT++SI D
Sbjct: 470  IILCDGACDRGFHQFCLLPPLLKEDIPPDDEGWLCPGCDCKVDCIELLNDSQGTNISISD 529

Query: 2114 KWEKVFPEAAAAASGDKQQXXXXXXXXXXXXXXXXXXXXXXXEKIQKXXXXXXXXXXXXX 1935
            +WEKVFPEAA  A+G                           EK Q              
Sbjct: 530  RWEKVFPEAA--AAGQNPDPNFGLPSDDSDDNDYDPDGPEIDEKSQGDESSNDESDYTSA 587

Query: 1934 XXXXXXXXXDKQHINPGFXXXXXXXXXXXXDAPELDEKVQSEGLSSDESDFSSDTNDLIP 1755
                     D+Q +  G             DA + DE V+     S  SDF+SD+ DL  
Sbjct: 588  SDELEASPGDEQQL--GLSSDDSEDDDYDPDALDRDENVEE----SSSSDFTSDSEDLTA 641

Query: 1754 S--KYXXXXXXXXXXXXXNHAEHIGGFDGGEPEASQXXXXXXXXXXXXXXXRDQSGEVFL 1581
            +                  H +     +G +   S+                D SG    
Sbjct: 642  TLDDNHLSGEDENHMSIGLHGDSKHRGNGKQSTHSELSLLDLNSRK------DGSG---- 691

Query: 1580 PVSGKRLRERLDYKKLNDETYGNIHXXXXXXXDWTDMNTPKKVKRRGESKSTVMASLQNT 1401
            P+SGKR  ERLDYKKL DETYGN         D+TD   P+K +R+    ST   S  + 
Sbjct: 692  PISGKRDVERLDYKKLYDETYGNASSDSSDDEDFTDDVEPRK-RRKETYGSTSSDSSDDE 750

Query: 1400 QTIQDGENTKQSQRKPEGEVL--------------------RKQEKLPDTRTLHNLESEG 1281
              I D E  K+ +    G+                      R++ K  +T T      EG
Sbjct: 751  DFIDDVEPRKRRRSTEVGQASVNANAFVSKTAKQDTTPKRHRQKSKFANTSTSSTKGHEG 810

Query: 1280 ANHTMKPCYMRRKLTTSRPGCFGREVSQRLCESFKVNAYPTQEMLENLSKEIGITFHQVS 1101
            A+ +       + + +S     G  V+Q L +SFK N YP +   E+L+KE+GITF QVS
Sbjct: 811  ASPSSSS---GKPVKSSGYRRLGETVTQGLYKSFKENQYPDRAKKESLAKELGITFQQVS 867

Query: 1100 IWFENTRRSFCLLETKEESKEENGTPNEDSISTKANLE 987
             WFENTR SF    + + S     T  EDS   K N E
Sbjct: 868  KWFENTRWSFNHPPSTDASTVRK-TTKEDSQLPKTNTE 904


>ref|XP_007042568.1| Homeodomain-like protein with RING/FYVE/PHD-type zinc finger domain,
            putative isoform 1 [Theobroma cacao]
            gi|590687101|ref|XP_007042569.1| Homeodomain-like protein
            with RING/FYVE/PHD-type zinc finger domain, putative
            isoform 1 [Theobroma cacao] gi|508706503|gb|EOX98399.1|
            Homeodomain-like protein with RING/FYVE/PHD-type zinc
            finger domain, putative isoform 1 [Theobroma cacao]
            gi|508706504|gb|EOX98400.1| Homeodomain-like protein with
            RING/FYVE/PHD-type zinc finger domain, putative isoform 1
            [Theobroma cacao]
          Length = 950

 Score =  367 bits (943), Expect = 3e-98
 Identities = 247/650 (38%), Positives = 323/650 (49%), Gaps = 11/650 (1%)
 Frame = -1

Query: 2903 PSPEDS-VDFEANLRSDGTPQAATNDSGTPSQIEKRVNSKLGG---KRYPLRSSLEGTRV 2736
            PS + S +  E   ++ G  Q  T          +R N K      K+Y LRS     RV
Sbjct: 298  PSTQQSGLPCEDMAQNSGVEQHETKPKNLLENSGRRRNGKTSKTIKKKYMLRSLRSSDRV 357

Query: 2735 LRSRSNGVCXXXXXXXXXXXXXXXAQXXXXXXXXKGQNKVSNDELADIRRRVKYLSARMN 2556
            LRS+                     Q        +  N+   DE + IR  ++YL  R+N
Sbjct: 358  LRSKLQEKPKATESSNNLADVGSSEQQKRRKRRRRKANREVADEFSRIRTHLRYLLNRIN 417

Query: 2555 YEQSLIDAYSGEGWKRQSAEKVRPEKELQRASSEILRRKLELRDLFQRLDSICAEGKLQD 2376
            YE+SLI AYS EGWK  S EK++PEKELQRA+SEILRRKL++RDLFQ +DS+CAEGKL +
Sbjct: 418  YERSLIAAYSTEGWKGLSLEKLKPEKELQRATSEILRRKLKIRDLFQHIDSLCAEGKLPE 477

Query: 2375 SLFDSEGQIDSEDIFCAKCGSKDFSANNDIVLCDGGCDRGFHQMCLVPPLL--NIPPGDD 2202
            SLFDSEGQIDSEDIFCAKCGSKD SANNDI+LCDG CDRGFHQ CL PPLL  +IPP D+
Sbjct: 478  SLFDSEGQIDSEDIFCAKCGSKDLSANNDIILCDGACDRGFHQYCLQPPLLKEDIPPDDE 537

Query: 2201 SWLCPGCDCKVDCIGVLNDIQGTSLSIEDKWEKVFPEAAAAASGDKQQXXXXXXXXXXXX 2022
             WLCPGCDCKVDCI ++N+ QGTS SI D WEKVFPEAA AA+G  Q             
Sbjct: 538  GWLCPGCDCKVDCIELVNESQGTSFSITDSWEKVFPEAAVAAAGQNQDPNFGLPSDDSDD 597

Query: 2021 XXXXXXXXXXXEKIQKXXXXXXXXXXXXXXXXXXXXXXDKQHINPGFXXXXXXXXXXXXD 1842
                       EK                           Q++  G             D
Sbjct: 598  NDYNPDGSETDEKDHGDESSSEESEFTSTSEELEVPAKVDQYL--GLPSDDSEDDDYDPD 655

Query: 1841 APELDEKVQSEGLSSDESDFSSDTNDLIPSKYXXXXXXXXXXXXXNHAEHIGGFDGGEPE 1662
             P  DE V+ E   S  SDFSSD+ DL                  N A         +P+
Sbjct: 656  GPNHDEVVKPE---SSSSDFSSDSEDLDAMLEEDITSQKDEGPMANSAPR--DSKRRKPK 710

Query: 1661 ASQXXXXXXXXXXXXXXXRDQSGEVFLPVSGKRLRERLDYKKLNDETYGNIHXXXXXXXD 1482
              +                +Q G     +S KR  ERLDYK+L DETYGN+        D
Sbjct: 711  LGEKESMNDELLSIMEPASEQDGSA---ISKKRSIERLDYKRLYDETYGNVPSSSSDDED 767

Query: 1481 WTDMNTPKKVKRRGESKSTVMASLQN-----TQTIQDGENTKQSQRKPEGEVLRKQEKLP 1317
            W+D+  P+K   R +  + V ++ +N     ++T+   +  KQ+  + E +  RK  ++ 
Sbjct: 768  WSDITAPRK---RNKCTAEVASAPENGNVSVSRTVSVSDGLKQNPEETEHKPRRKTRQMS 824

Query: 1316 DTRTLHNLESEGANHTMKPCYMRRKLTTSRPGCFGREVSQRLCESFKVNAYPTQEMLENL 1137
              +   +  +E   +T       +K  +S     G  V QRL +SFK N YP +   ++L
Sbjct: 825  RFKDTDSSPAEIQGNTSVSGSSGKKAGSSTYKRLGEAVKQRLYKSFKENQYPDRATKQSL 884

Query: 1136 SKEIGITFHQVSIWFENTRRSFCLLETKEESKEENGTPNEDSISTKANLE 987
            +KE+ +TF QVS WF+N R SF    +  E+   N +  +D  S+  N E
Sbjct: 885  AKELDMTFQQVSKWFDNARWSFNNSPSSHETIANNAS-EKDITSSLPNKE 933


>ref|XP_009399131.1| PREDICTED: homeobox protein HOX1A isoform X1 [Musa acuminata subsp.
            malaccensis]
          Length = 740

 Score =  367 bits (942), Expect = 3e-98
 Identities = 226/610 (37%), Positives = 315/610 (51%), Gaps = 13/610 (2%)
 Frame = -1

Query: 2864 RSDGTPQAATNDSGTPSQIEKRVNSKLGGKRYPLRSSLEGTRVLRSRSNGVCXXXXXXXX 2685
            ++D    +A ++S  P +  K V+ K+  KRY LRSSL+  R+LRS +NG          
Sbjct: 37   KNDRCTPSAKHNSKKPVRKRKEVSQKVNSKRYSLRSSLDDVRILRSMTNG---KDKTSAG 93

Query: 2684 XXXXXXXAQXXXXXXXXKGQNKVSNDELADIRRRVKYLSARMNYEQSLIDAYSGEGWKRQ 2505
                   +         KG++ ++N+ L  I++RV+YL  R+NYEQSLIDAYS EGWK Q
Sbjct: 94   SSNSTAISMTKTRKKRRKGESALNNEFLL-IKKRVRYLLTRINYEQSLIDAYSNEGWKGQ 152

Query: 2504 SAEKVRPEKELQRASSEILRRKLELRDLFQRLDSICAEGKLQDSLFDSEGQIDSEDIFCA 2325
            S EK+RPEKEL+RA SEILR KL +R+ F+ LDS+ +EGKL+++LFD +GQIDSE IFCA
Sbjct: 153  SLEKIRPEKELERAKSEILRCKLRIRESFRHLDSLLSEGKLEENLFDDDGQIDSEAIFCA 212

Query: 2324 KCGSKDFSANNDIVLCDGGCDRGFHQMCLVPPLL--NIPPGDDSWLCPGCDCKVDCIGVL 2151
            KCGSKD SA+NDI+LCDG CDRGFHQ CL PPL    IPPGD  WLCP CDCKVDC+ +L
Sbjct: 213  KCGSKDLSADNDIILCDGNCDRGFHQKCLNPPLATDEIPPGDQGWLCPACDCKVDCLDLL 272

Query: 2150 NDIQGTSLSIEDKWEKVFPEAAAAASGDKQQXXXXXXXXXXXXXXXXXXXXXXXEKIQKX 1971
            N+ QG+ LSIED WEK+FPEAA  A+G+KQ                         + Q+ 
Sbjct: 273  NEFQGSDLSIEDTWEKIFPEAAVVANGNKQFDDSNDSSDDSEDHDYNPDTPEVGIEDQEE 332

Query: 1970 XXXXXXXXXXXXXXXXXXXXXDKQHINPGFXXXXXXXXXXXXDAPELDEKVQSEGLSSDE 1791
                                      + G             + P+ D+ VQ EG  S E
Sbjct: 333  GSSSEESDSISLSEEAPGSPRHNNFNDLGLPSDDSEDDDYDPERPDPDKDVQKEGSDSSE 392

Query: 1790 SDFSSDTNDLIPSKYXXXXXXXXXXXXXNHAEHIGGFDGGEPEASQXXXXXXXXXXXXXX 1611
            SDF+SD+++                   +  + + G   G  E  +              
Sbjct: 393  SDFTSDSDEFCVELSKSTNINEESSFSLSEPKLLDGSCEGRDETHE----SPINAKPPPV 448

Query: 1610 XRDQSGEV-FLPVSGKRLRERLDYKKLNDETYGNIHXXXXXXXDWTDMNTPKKVKRRGES 1434
               + G+V   PVS KR RE LDY+KL DE YG          DW++ +  KK K+  + 
Sbjct: 449  MEGEPGQVNTFPVSKKREREHLDYEKLYDEAYGKESPVSSKDEDWSEESAAKKAKKDDDK 508

Query: 1433 KSTVMASLQNTQTIQDGENTK-QSQRKPEGEVLRKQEKLPDT--RTLHNLESEGANHTMK 1263
            +          Q+  +  +   + + + + E+       P     T  +   +G  + ++
Sbjct: 509  REDAKLPGAKAQSANNRRSLGIKGKVEDDNEIDLPNLDQPQVSKATSESTPDKGHENLVE 568

Query: 1262 PCYMRRKL-------TTSRPGCFGREVSQRLCESFKVNAYPTQEMLENLSKEIGITFHQV 1104
             C   + L       T S    FG+E SQ+L E FK N YP++E  ENL++E+G+T  ++
Sbjct: 569  QCDGDQLLGLDGITVTYSARKYFGQETSQKLQEIFKENQYPSRETKENLAEELGVTAKRI 628

Query: 1103 SIWFENTRRS 1074
            S WFEN R +
Sbjct: 629  SKWFENARHN 638


>ref|XP_009351161.1| PREDICTED: homeobox protein HAT3.1-like [Pyrus x bretschneideri]
          Length = 1052

 Score =  367 bits (941), Expect = 4e-98
 Identities = 256/731 (35%), Positives = 353/731 (48%), Gaps = 37/731 (5%)
 Frame = -1

Query: 2867 LRSDGTPQAATNDSGTPSQI--EKRVNSKLGGKRYPLRSSLEGTRVLRSR---------- 2724
            L +D T  ++   +  PS+   +++ N K   K+Y  +SS+   RVLRS+          
Sbjct: 343  LPADVTQNSSLEKTEKPSKNAPKEKQNPKSRKKKYVSKSSIGSDRVLRSKTGEKTKNPKL 402

Query: 2723 SNGVCXXXXXXXXXXXXXXXAQXXXXXXXXKGQNKVSNDELADIRRRVKYLSARMNYEQS 2544
            SN V                 +        + +NKV +DE + +    + L  R++YE+S
Sbjct: 403  SNDVSTLESSNSVANPSNVEGKRRKKRKKRQ-RNKVIDDEFSRVXXXXRDLLNRISYEKS 461

Query: 2543 LIDAYSGEGWKRQSAEKVRPEKELQRASSEILRRKLELRDLFQRLDSICAEGKLQDSLFD 2364
            LIDAYSGEGWK  S EK++PEKELQRA+SEILRRKL++RDLFQRLDS+C+EG   +SLFD
Sbjct: 462  LIDAYSGEGWKGSSLEKLKPEKELQRATSEILRRKLKIRDLFQRLDSLCSEGMFPESLFD 521

Query: 2363 SEGQIDSEDIFCAKCGSKDFSANNDIVLCDGGCDRGFHQMCLVPPLL--NIPPGDDSWLC 2190
            SEGQIDSEDIFCAKCGSKD S  NDI+LCDG CDRGFHQ CL P LL  +IPP D+ WLC
Sbjct: 522  SEGQIDSEDIFCAKCGSKDVSLQNDIILCDGACDRGFHQFCLEPSLLSEDIPPDDEGWLC 581

Query: 2189 PGCDCKVDCIGVLNDIQGTSLSIEDKWEKVFPEAAAAASGDKQQXXXXXXXXXXXXXXXX 2010
            PGCDCKVDC  +LN+ QGT LS+ D WEKVFPEAAAAASG  Q+                
Sbjct: 582  PGCDCKVDCFDLLNESQGTDLSVADSWEKVFPEAAAAASGHNQEHTHGLPSDDSDDNDYD 641

Query: 2009 XXXXXXXEKIQKXXXXXXXXXXXXXXXXXXXXXXDKQHINPGFXXXXXXXXXXXXDAPEL 1830
                   +++Q                       +    + G             DAPE+
Sbjct: 642  PDGPETDDEVQGEESSSDDESKYASASDGLETPKNNDEQDFGLPSDDSEDDDYNPDAPEV 701

Query: 1829 DEKVQSEGLSSDESDFSSDTNDLIPSKYXXXXXXXXXXXXXNHAEHIGGFDGGEPEASQX 1650
             ++++ E   S  SDF+SD+ DL  S                  +  G   G   ++S+ 
Sbjct: 702  TDELKKE---SSSSDFTSDSEDLGAS--------LDDNNMSAEDDESGPLRGSGKQSSRH 750

Query: 1649 XXXXXXXXXXXXXXRD----QSGEVFLPVSGKRLRERLDYKKLNDETYGNIHXXXXXXXD 1482
                           +    Q G    PVSGKR  ERL+YKKL+DETYGN+        +
Sbjct: 751  GQKKQPLKDELLSLLESGPGQGGAA--PVSGKRHIERLNYKKLHDETYGNVRTDSSDDEE 808

Query: 1481 WTDMNTPKKVKR----------RGESKSTVMASLQNTQTIQDGENTKQSQRKPEG----- 1347
            W D   P+K K+           G+S +     + N       EN    +R P G     
Sbjct: 809  WNDTAGPRKRKKVTTQAPTMSPNGDSSNVKNVMITNNIKHDLDENENTPKRTPRGNKNTP 868

Query: 1346 EVLRKQEKLPDTRTLHNLESEGANHTMKPCYMRRKLTTSRPGCFGREVSQRLCESFKVNA 1167
            +   ++ K+ DT  L N   +G+  +      +   + S     G   +QRL + FK N 
Sbjct: 869  KRAHRKSKVEDTSNLSNKSQKGSTQSASTS-EKGGSSRSTYRKLGEAAAQRLSKLFKENH 927

Query: 1166 YPTQEMLENLSKEIGITFHQVSIWFENTRRSFCLLETKEESKEENGTPNEDSISTKANLE 987
            YP + M E+L++E+GI   QVS WFEN R   C   + ++S   NGTP   +   +    
Sbjct: 928  YPDRSMKESLAQELGIMAKQVSKWFENARH--CWKVSVDKSAAGNGTPLPQTNGKQLE-- 983

Query: 986  EPRXXXXXXXXXXXXXKTENNTPSQDSNHTEMGVKE-PGTE---TKVVSTNKEASKTEIN 819
                              + +TP  DS+      KE P T+    K ++ N    K + +
Sbjct: 984  ------------------QGDTPIGDSDRGGAQNKELPRTDDRKRKAMTPNNRKRKRKSD 1025

Query: 818  TPNKDDIPPKT 786
             P+ ++  P+T
Sbjct: 1026 DPDPENKTPET 1036


>ref|XP_002300247.2| homeobox family protein [Populus trichocarpa]
            gi|550348560|gb|EEE85052.2| homeobox family protein
            [Populus trichocarpa]
          Length = 930

 Score =  366 bits (940), Expect = 6e-98
 Identities = 261/697 (37%), Positives = 339/697 (48%), Gaps = 16/697 (2%)
 Frame = -1

Query: 3068 PKKERRKSESEQMDVSHP---VEEHCHEQNSLDTKD---KHTGAEETMRVGSDSIDTDRP 2907
            P  E  ++ES ++ +  P     E+      L TK    KH G      +   + +  RP
Sbjct: 224  PGSELAENESMEIGIGLPSGIAIENLEPLTELVTKSCPIKHIGLPPGDDISIPANEQIRP 283

Query: 2906 KPSPEDSVDFEANLRSDGTPQAATNDSGTPSQIEKRVNSKLGGKRYPLRSSLEGTRVLRS 2727
                E       +L             G PS    +  SKL GK+Y   SS +  RVLRS
Sbjct: 284  THDKESKYPDCEHLEKLSGIVIGITSQGVPSV---KRTSKLSGKKYT-SSSRKSDRVLRS 339

Query: 2726 RSNGVCXXXXXXXXXXXXXXXAQXXXXXXXXKGQNKVSNDELADIRRRVKYLSARMNYEQ 2547
             S                    +        +    +  DE + IR R++YL  RM+YEQ
Sbjct: 340  NSQEKPKAPEPSNNSTNVNSTGEEKGKRRKKRRGKSIVADEYSRIRARLRYLLNRMSYEQ 399

Query: 2546 SLIDAYSGEGWKRQSAEKVRPEKELQRASSEILRRKLELRDLFQRLDSICAEGKLQDSLF 2367
            SLI AYSGEGWK  S EK++PEKELQRA+SEI+RRK+++RDLFQ +DS+C EG+   SLF
Sbjct: 400  SLITAYSGEGWKGLSLEKLKPEKELQRATSEIIRRKVKIRDLFQHIDSLCGEGRFPASLF 459

Query: 2366 DSEGQIDSEDIFCAKCGSKDFSANNDIVLCDGGCDRGFHQMCLVPPLL--NIPPGDDSWL 2193
            DSEGQIDSEDIFCAKCGSKD +A+NDI+LCDG CDRGFHQ CLVPPLL  +IPPGD+ WL
Sbjct: 460  DSEGQIDSEDIFCAKCGSKDLTADNDIILCDGACDRGFHQFCLVPPLLREDIPPGDEGWL 519

Query: 2192 CPGCDCKVDCIGVLNDIQGTSLSIEDKWEKVFPEAAAAASGDKQQXXXXXXXXXXXXXXX 2013
            CPGCDCKVDCI +LND QGT++SI D+W+ VFPEAAA ASG K                 
Sbjct: 520  CPGCDCKVDCIDLLNDSQGTNISISDRWDNVFPEAAAVASGQKLDYNFGLSSDDSDDNDY 579

Query: 2012 XXXXXXXXEKIQKXXXXXXXXXXXXXXXXXXXXXXDKQHINPGFXXXXXXXXXXXXDAPE 1833
                    EK Q+                      DKQ++  G             DAP 
Sbjct: 580  DPDGPDIDEKSQE-ESSSDESDFSSASDEFEAPPDDKQYL--GLPSDDSEDDDYDPDAPV 636

Query: 1832 LDEKVQSEGLSSDESDFSSDTNDLIPSKYXXXXXXXXXXXXXNHAEHIGGFDGGEPEASQ 1653
            L+EK++ E   S  SDF+SD+ DL                     E     +G       
Sbjct: 637  LEEKLKQE---SSSSDFTSDSEDL--DATLNGDGLSLGDEYHMPIEPHEDSNGRRSRFGG 691

Query: 1652 XXXXXXXXXXXXXXXRDQSGEVFLPVSGKRLRERLDYKKLNDETYGNIHXXXXXXXDWTD 1473
                            D   E   PVSGKR  ERLDYKKL DETYGNI        D+TD
Sbjct: 692  KKNHSLNSKLLSMLEPDSHQEKSAPVSGKRNIERLDYKKLYDETYGNI--STSSDDDYTD 749

Query: 1472 MNTPKKVKRRGESKSTVMASLQNTQTIQDGENTKQSQRKPEGEVLRKQEKLPDTRTLHNL 1293
               P+K +R+      +  +  +    ++G N+K   ++     L+K E     RT  N 
Sbjct: 750  TVAPRK-RRKNTGDVAMGIANGDASVTENGLNSKNMNQE-----LKKNEH-TSGRTHQNS 802

Query: 1292 ESEGANHTMKPCYMRRKLTTS-----RPGCF---GREVSQRLCESFKVNAYPTQEMLENL 1137
              +  N +    ++   L+ S     RP  +   G  V+Q+L   FK N YP Q    +L
Sbjct: 803  SFQDTNVSPAKTHVGESLSGSSSKRVRPSAYKKLGEAVTQKLYSFFKENRYPDQAAKASL 862

Query: 1136 SKEIGITFHQVSIWFENTRRSFCLLETKEESKEENGT 1026
            ++E+GITF QV+ WF N R SF     +  SK E+ +
Sbjct: 863  AEELGITFEQVNKWFMNARWSFNHSSPEGTSKAESAS 899


>ref|XP_008236405.1| PREDICTED: LOW QUALITY PROTEIN: homeobox protein HAT3.1 [Prunus mume]
          Length = 1040

 Score =  362 bits (929), Expect = 1e-96
 Identities = 269/790 (34%), Positives = 375/790 (47%), Gaps = 75/790 (9%)
 Frame = -1

Query: 3176 LRTLSPLFYIHPERKKIVAPLIFHASASTVAAIAGFPKKERRKS-ESEQMDVSHPV---- 3012
            L+T+ PL       +  + P+  + + +++   AG P ++  K+ +SE++  SH +    
Sbjct: 169  LQTIMPLPICSGSEQ--LQPISENVNMASLNDQAGLPPEDVSKTCQSEKISCSHQITLQQ 226

Query: 3011 ----------EEHCHEQNSLDTKDKHTGAEETMRVGSDSIDTDRPKPS------------ 2898
                       E   ++  LD+        +T +  S SI  ++P PS            
Sbjct: 227  INEFGSGSVPSEPAKQKYELDSVPAQNDEVKTSKAVSSSIVFEQPGPSIEAMTEDSPIGH 286

Query: 2897 ----PEDSVDFEANLRSDGTPQAATNDSG-----TPSQIEKRVNSKLGGK---------- 2775
                PED++   ++   +  P+  T +S      TPS+   +++S LG K          
Sbjct: 287  SEPPPEDAIKSLSDKEMEPLPEDVTQNSSLQQSETPSKNALKISSCLGPKDKKNPKSRKR 346

Query: 2774 RYPLRSSLEGTRVLRSR------------SNGVCXXXXXXXXXXXXXXXAQXXXXXXXXK 2631
            +Y  RS +   RVLRSR            SN V                 +         
Sbjct: 347  KYMSRSFVRSDRVLRSRTGEKEKPKDLKLSNNVATLESSNSIANVSNGEEKKRKKR---- 402

Query: 2630 GQNKVSNDELADIRRRVKYLSARMNYEQSLIDAYSGEGWKRQSAEKVRPEKELQRASSEI 2451
             +N+  N  +AD   R+     R + E+SLIDAYSGEGWK  S EK++PEKELQRA+SEI
Sbjct: 403  -KNRRDNRAIADEFSRI-----RTHXEKSLIDAYSGEGWKGSSLEKLKPEKELQRATSEI 456

Query: 2450 LRRKLELRDLFQRLDSICAEGKLQDSLFDSEGQIDSEDIFCAKCGSKDFSANNDIVLCDG 2271
            LRRKL++RDLFQRL+S+CAEG   +SLFDSEGQIDSEDIFCAKCGSKD S +NDI+LCDG
Sbjct: 457  LRRKLKIRDLFQRLESLCAEGMFPESLFDSEGQIDSEDIFCAKCGSKDVSLDNDIILCDG 516

Query: 2270 GCDRGFHQMCLVPPLL--NIPPGDDSWLCPGCDCKVDCIGVLNDIQGTSLSIEDKWEKVF 2097
             CDRGFHQ CL PPLL  +IPP D+ WLCPGCDCKVDCI +LND QGT LS+ D WEKVF
Sbjct: 517  ACDRGFHQFCLEPPLLSEDIPPDDEGWLCPGCDCKVDCIDLLNDSQGTDLSVTDSWEKVF 576

Query: 2096 PEAAAAASGDKQQXXXXXXXXXXXXXXXXXXXXXXXEKIQ-KXXXXXXXXXXXXXXXXXX 1920
            PEAAAAAS  + Q                        K++ +                  
Sbjct: 577  PEAAAAASAGENQDNHGLPSDDSDDNDYDPDGPETDNKVEGEESSSDESEYASASDGLET 636

Query: 1919 XXXXDKQHINPGFXXXXXXXXXXXXDAPELDEKVQSEGLSSDESDFSSDTNDLIPSKYXX 1740
                D+Q++  G              AP+++E V+ E   S  SDF+SD+ DL  +    
Sbjct: 637  PKSNDEQYL--GLPSEDSEDDDYNPYAPDVNEDVKQE---SSSSDFTSDSEDLGAALDDN 691

Query: 1739 XXXXXXXXXXXNHAEHIGGFDGGEPEASQXXXXXXXXXXXXXXXRDQSGE---VFLPVSG 1569
                       + +        G  E S                  +SG        +SG
Sbjct: 692  IMSSEDVEGSKSTSLDDSKPHRGSGERSSRRGQKKHSLKDELISLLESGPGQGESASLSG 751

Query: 1568 KRLRERLDYKKLNDETYGNIHXXXXXXXDWTDMNTPKKVKRRGESKSTVMASLQNTQTIQ 1389
            KR  ERLDYK+L+DE YGN+        DW D+   +K +++G  +    +    T  I+
Sbjct: 752  KRHIERLDYKRLHDEAYGNVPTDSSDDEDWNDIAIQRK-RKKGTGQVANRSPNGKTSNIK 810

Query: 1388 DG-----------ENTKQSQRKPEGEVLRKQEKLPDTRTLHNLESEGANHTMKPCYMRRK 1242
            +G           EN    +RKP      ++  + DT    N   +G+  +      R  
Sbjct: 811  NGVITKDIKPDVDENENTPRRKP-----HRKSNVEDTSNFSNKSPKGSTKSGSTS-GRAG 864

Query: 1241 LTTSRPGCFGREVSQRLCESFKVNAYPTQEMLENLSKEIGITFHQVSIWFENTRRSFCLL 1062
             + S     G  V+QRLC+SFK N YP + M E+L++E+G+   QVS WFEN R   CL 
Sbjct: 865  SSRSTYSRLGEAVTQRLCKSFKENHYPDRSMKESLARELGLMAKQVSKWFENARH--CLK 922

Query: 1061 ETKEESKEEN 1032
               ++S  EN
Sbjct: 923  VGVDKSASEN 932


>ref|XP_007200058.1| hypothetical protein PRUPE_ppa023106mg [Prunus persica]
            gi|462395458|gb|EMJ01257.1| hypothetical protein
            PRUPE_ppa023106mg [Prunus persica]
          Length = 1058

 Score =  361 bits (927), Expect = 2e-96
 Identities = 271/793 (34%), Positives = 372/793 (46%), Gaps = 54/793 (6%)
 Frame = -1

Query: 2978 TKDKHTG-AEETMRVGSDSIDTDRPKPSPEDSVDFEANLRSDGTPQAATNDSGTPSQIEK 2802
            T+D   G +E  +   S S+     +P PED     +  + +   + A   S      +K
Sbjct: 279  TEDSPIGHSEPPLEDLSKSLSDKEMEPLPEDVTQNSSLQQLETASKNALKISSCLGPKDK 338

Query: 2801 RVNSKLGGKRYPLRSSLEGTRVLRSR------------SNGVCXXXXXXXXXXXXXXXAQ 2658
            + N K   ++Y  RS +   RVLRS+            SN V                 +
Sbjct: 339  K-NPKSRKRKYMSRSFVRSDRVLRSKTGEKEKPKDLKLSNNVATLESSNSIANVSNGEEK 397

Query: 2657 XXXXXXXXKGQNKVSNDELADIRRRVKYLSARMNYEQSLIDAYSGEGWKRQSAEKVRPEK 2478
                    +  N+   DE + IR  ++YL  R+ YE+SLIDAYSGEGWK  S EK++PEK
Sbjct: 398  KRKKRKNRR-DNRAIADEFSRIRTHLRYLLNRIGYEKSLIDAYSGEGWKGSSLEKLKPEK 456

Query: 2477 ELQRASSEILRRKLELRDLFQRLDSICAEGKLQDSLFDSEGQIDSEDIFCAKCGSKDFSA 2298
            ELQRA+SEILRRKL++RDLFQRL+S+CAEG   +SLFDSEGQIDSEDIFC KCGSKD S 
Sbjct: 457  ELQRATSEILRRKLKIRDLFQRLESLCAEGMFPESLFDSEGQIDSEDIFCGKCGSKDVSL 516

Query: 2297 NNDIVLCDGGCDRGFHQMCLVPPLL--NIPPGDDSWLCPGCDCKVDCIGVLNDIQGTSLS 2124
            +NDI+LCDG CDRGFHQ CL PPLL  +IPP D+ WLCPGCDCKVDCI +LND QGT LS
Sbjct: 517  DNDIILCDGACDRGFHQFCLEPPLLSEDIPPDDEGWLCPGCDCKVDCIDLLNDSQGTDLS 576

Query: 2123 IEDKWEKVFPEAAAAASGDKQQXXXXXXXXXXXXXXXXXXXXXXXEKIQ-KXXXXXXXXX 1947
            + D WEKVFPEAAAAAS  + Q                        K+Q +         
Sbjct: 577  VTDSWEKVFPEAAAAASAGENQDNHGLPSDDSDDNDYDPDGPETDNKVQGEESSSDESEY 636

Query: 1946 XXXXXXXXXXXXXDKQHINPGFXXXXXXXXXXXXDAPELDEKVQSEGLSSDESDFSSDTN 1767
                         D+Q++  G              AP+++E V+ E   S  SDF+SD+ 
Sbjct: 637  ASASDGLETPKSNDEQYL--GLPSEDSEDDDYNPYAPDVNEDVKQE---SSSSDFTSDSE 691

Query: 1766 DLIPSKYXXXXXXXXXXXXXNHAEHIGGFDGGEPEASQXXXXXXXXXXXXXXXRDQSGE- 1590
            DL  +               + +        G  E S                  +SG  
Sbjct: 692  DLGAALDDNIMSSEDVEGPKSTSLDDSKPHRGSGEQSSISGQKKHSLKDELISLLESGPG 751

Query: 1589 --VFLPVSGKRLRERLDYKKLNDETYGNIHXXXXXXXDWTDMNTPKKVKRRGESKSTVMA 1416
                 P+SGKR  ERLDYK+L+DE YGN+        DW D+ T +K +++G  +    +
Sbjct: 752  QGESAPLSGKRHIERLDYKRLHDEAYGNVPTDSSDDEDWNDIATQRK-RKKGTGQVANRS 810

Query: 1415 SLQNTQTIQDGENTK-------QSQRKPEGEVLRKQEKLPDTRTLHNLESEGANHTMKPC 1257
                T  I++G  TK       +++  P   +  ++  + DT  L N   +G+  +    
Sbjct: 811  PNGKTSNIKNGVITKDIKPDVDENENTPR-RMPHRKSNVEDTSNLSNKSPKGSTKSGSTS 869

Query: 1256 YMRRKLTTSRPGCFGREVSQRLCESFKVNAYPTQEMLENLSKEIGITFHQ---------V 1104
              R   + S     G   +QRLC+SFK N YP + M E+L++E+G+   Q         V
Sbjct: 870  -GRAGSSRSTYSRLGEAATQRLCKSFKENHYPDRSMKESLARELGLMAKQVIPSFILASV 928

Query: 1103 SIWFENTRRSFCLLETKEESKEENGTPNEDSISTKANLEEPRXXXXXXXXXXXXXKTENN 924
            S WFEN R   CL    ++S  EN  P     + +  LE                  + +
Sbjct: 929  SKWFENARH--CLKVGVDKSASENCAPPPQ--TNRRQLE------------------QGD 966

Query: 923  TPSQDSNHTEMGVKE-PGTETKVVS-----------TNKEASKTEINTPN-------KDD 801
                DS+H     KE  GT+  ++                +S+++++TPN        DD
Sbjct: 967  AIVGDSDHNGAQNKELHGTDDPMIGCCSRDVMDSELATLGSSRSKLSTPNNRKRKRRSDD 1026

Query: 800  IPPKTTVNEPQPE 762
              PKT    P  E
Sbjct: 1027 PDPKTETPTPPAE 1039


>ref|XP_011457795.1| PREDICTED: homeobox protein HAT3.1 isoform X2 [Fragaria vesca subsp.
            vesca]
          Length = 1202

 Score =  344 bits (882), Expect = 3e-91
 Identities = 241/642 (37%), Positives = 314/642 (48%), Gaps = 33/642 (5%)
 Frame = -1

Query: 2906 KPSPED-SVDFEANL----RSDGTPQAATNDSGTPSQIEKRVNSKLGGK-------RYPL 2763
            +P PED S D    L     +D T  +    S T S+   + +++ G K       R   
Sbjct: 460  EPPPEDASKDHNKELIKPHTNDATQNSCLEPSETASKNASKNSTQFGCKDKRNSSSRRKS 519

Query: 2762 RSSLEGTRVLRSRSNGVCXXXXXXXXXXXXXXXAQXXXXXXXXKGQNK---------VSN 2610
            RS +   RVLRSR++                            +G+ K         V+ 
Sbjct: 520  RSLVSSDRVLRSRTSEKPEAPELSNNVATLDTSNSVANVSNEKEGKRKKRKKKHRERVAA 579

Query: 2609 DELADIRRRVKYLSARMNYEQSLIDAYSGEGWKRQSAEKVRPEKELQRASSEILRRKLEL 2430
            DE + IR  ++Y   R+NYE+SLIDAYS EGWK  S EK++PEKELQRA+SEILRRK ++
Sbjct: 580  DEFSRIRSHLRYFLNRINYEKSLIDAYSSEGWKGNSLEKLKPEKELQRATSEILRRKSKI 639

Query: 2429 RDLFQRLDSICAEGKLQDSLFDSEGQIDSEDIFCAKCGSKDFSANNDIVLCDGGCDRGFH 2250
            RDLFQRLDS+CAEG   +SLFD EGQIDSEDIFCAKCGS D  A+NDI+LCDG CDRGFH
Sbjct: 640  RDLFQRLDSLCAEGMFPESLFDEEGQIDSEDIFCAKCGSLDVYADNDIILCDGACDRGFH 699

Query: 2249 QMCLVPPLLN--IPPGDDSWLCPGCDCKVDCIGVLNDIQGTSLSIEDKWEKVFPEAAAAA 2076
            Q CL PPLL+  IPP D+ WLCPGCDCKVDCI +LND QGT LSI D WEKVFPEAA AA
Sbjct: 700  QHCLEPPLLSEEIPPDDEGWLCPGCDCKVDCIDLLNDSQGTDLSITDSWEKVFPEAAVAA 759

Query: 2075 S-GDKQQXXXXXXXXXXXXXXXXXXXXXXXEKIQKXXXXXXXXXXXXXXXXXXXXXXDKQ 1899
            S G  Q+                       E++Q+                      + +
Sbjct: 760  SAGQHQENNQGLPSEDSDDDDYDPDGPETDEEVQEGESSSDESEYASASDGLETPKTNDE 819

Query: 1898 HINPGFXXXXXXXXXXXXDAPELDEKVQSEGLSSDESDFSSDTNDL---IPSKYXXXXXX 1728
                G             DAP+  E V+     S  SDF+SD+ DL   +          
Sbjct: 820  QY-LGIPSDDSEDDDFNPDAPDPTEDVKQ---GSSSSDFTSDSEDLAAVLDEDRKSFENG 875

Query: 1727 XXXXXXXNHAEHIGGFDGGEPEASQXXXXXXXXXXXXXXXRDQSGEVFLPVSGKRLRERL 1548
                     A  +    GG+                     D   +   PVSGKR  ERL
Sbjct: 876  EGPQSSVLEASTLLRGSGGKGSKRGQKRHFIKDELSSLIESDPGQDGSTPVSGKRHVERL 935

Query: 1547 DYKKLNDETYGNIHXXXXXXXDWTDMNTPKKVKRRGESKSTVMASLQNTQTIQDGENTKQ 1368
            DYKKL+DE YG+I        ++ +   P+K +++G  + +  +      TI+ G+ TK 
Sbjct: 936  DYKKLHDEEYGDI--PTSDDEEYIETAVPRK-RKKGAGQVSPGSLKGKPSTIKKGKTTKD 992

Query: 1367 SQRKPEGEVLRKQEKLPDTRTLHNLESEGANHTMKPCYM------RRKLTTSRPGCFGRE 1206
             +  P+        + P  ++  N  S   N ++K          R K +T R    G  
Sbjct: 993  IKDDPDKNE-HTPRRTPRRKSSANDNSSSPNESLKSSPKSGSTSGRAKGSTYRR--LGEA 1049

Query: 1205 VSQRLCESFKVNAYPTQEMLENLSKEIGITFHQVSIWFENTR 1080
            V+QRL  SFK N YP + M E L++E+G+   QVS WFEN R
Sbjct: 1050 VTQRLYTSFKENQYPDRSMKERLAQELGVMAKQVSKWFENAR 1091


>ref|XP_004289744.1| PREDICTED: homeobox protein HAT3.1 isoform X1 [Fragaria vesca subsp.
            vesca] gi|764524477|ref|XP_011457794.1| PREDICTED:
            homeobox protein HAT3.1 isoform X1 [Fragaria vesca subsp.
            vesca]
          Length = 1227

 Score =  344 bits (882), Expect = 3e-91
 Identities = 241/642 (37%), Positives = 314/642 (48%), Gaps = 33/642 (5%)
 Frame = -1

Query: 2906 KPSPED-SVDFEANL----RSDGTPQAATNDSGTPSQIEKRVNSKLGGK-------RYPL 2763
            +P PED S D    L     +D T  +    S T S+   + +++ G K       R   
Sbjct: 485  EPPPEDASKDHNKELIKPHTNDATQNSCLEPSETASKNASKNSTQFGCKDKRNSSSRRKS 544

Query: 2762 RSSLEGTRVLRSRSNGVCXXXXXXXXXXXXXXXAQXXXXXXXXKGQNK---------VSN 2610
            RS +   RVLRSR++                            +G+ K         V+ 
Sbjct: 545  RSLVSSDRVLRSRTSEKPEAPELSNNVATLDTSNSVANVSNEKEGKRKKRKKKHRERVAA 604

Query: 2609 DELADIRRRVKYLSARMNYEQSLIDAYSGEGWKRQSAEKVRPEKELQRASSEILRRKLEL 2430
            DE + IR  ++Y   R+NYE+SLIDAYS EGWK  S EK++PEKELQRA+SEILRRK ++
Sbjct: 605  DEFSRIRSHLRYFLNRINYEKSLIDAYSSEGWKGNSLEKLKPEKELQRATSEILRRKSKI 664

Query: 2429 RDLFQRLDSICAEGKLQDSLFDSEGQIDSEDIFCAKCGSKDFSANNDIVLCDGGCDRGFH 2250
            RDLFQRLDS+CAEG   +SLFD EGQIDSEDIFCAKCGS D  A+NDI+LCDG CDRGFH
Sbjct: 665  RDLFQRLDSLCAEGMFPESLFDEEGQIDSEDIFCAKCGSLDVYADNDIILCDGACDRGFH 724

Query: 2249 QMCLVPPLLN--IPPGDDSWLCPGCDCKVDCIGVLNDIQGTSLSIEDKWEKVFPEAAAAA 2076
            Q CL PPLL+  IPP D+ WLCPGCDCKVDCI +LND QGT LSI D WEKVFPEAA AA
Sbjct: 725  QHCLEPPLLSEEIPPDDEGWLCPGCDCKVDCIDLLNDSQGTDLSITDSWEKVFPEAAVAA 784

Query: 2075 S-GDKQQXXXXXXXXXXXXXXXXXXXXXXXEKIQKXXXXXXXXXXXXXXXXXXXXXXDKQ 1899
            S G  Q+                       E++Q+                      + +
Sbjct: 785  SAGQHQENNQGLPSEDSDDDDYDPDGPETDEEVQEGESSSDESEYASASDGLETPKTNDE 844

Query: 1898 HINPGFXXXXXXXXXXXXDAPELDEKVQSEGLSSDESDFSSDTNDL---IPSKYXXXXXX 1728
                G             DAP+  E V+     S  SDF+SD+ DL   +          
Sbjct: 845  QY-LGIPSDDSEDDDFNPDAPDPTEDVKQ---GSSSSDFTSDSEDLAAVLDEDRKSFENG 900

Query: 1727 XXXXXXXNHAEHIGGFDGGEPEASQXXXXXXXXXXXXXXXRDQSGEVFLPVSGKRLRERL 1548
                     A  +    GG+                     D   +   PVSGKR  ERL
Sbjct: 901  EGPQSSVLEASTLLRGSGGKGSKRGQKRHFIKDELSSLIESDPGQDGSTPVSGKRHVERL 960

Query: 1547 DYKKLNDETYGNIHXXXXXXXDWTDMNTPKKVKRRGESKSTVMASLQNTQTIQDGENTKQ 1368
            DYKKL+DE YG+I        ++ +   P+K +++G  + +  +      TI+ G+ TK 
Sbjct: 961  DYKKLHDEEYGDI--PTSDDEEYIETAVPRK-RKKGAGQVSPGSLKGKPSTIKKGKTTKD 1017

Query: 1367 SQRKPEGEVLRKQEKLPDTRTLHNLESEGANHTMKPCYM------RRKLTTSRPGCFGRE 1206
             +  P+        + P  ++  N  S   N ++K          R K +T R    G  
Sbjct: 1018 IKDDPDKNE-HTPRRTPRRKSSANDNSSSPNESLKSSPKSGSTSGRAKGSTYRR--LGEA 1074

Query: 1205 VSQRLCESFKVNAYPTQEMLENLSKEIGITFHQVSIWFENTR 1080
            V+QRL  SFK N YP + M E L++E+G+   QVS WFEN R
Sbjct: 1075 VTQRLYTSFKENQYPDRSMKERLAQELGVMAKQVSKWFENAR 1116


>ref|XP_011629041.1| PREDICTED: homeobox protein HOX1A [Amborella trichopoda]
          Length = 750

 Score =  311 bits (798), Expect = 2e-81
 Identities = 167/331 (50%), Positives = 211/331 (63%), Gaps = 7/331 (2%)
 Frame = -1

Query: 3032 MDVSHPVEEHCHEQNSLDTKDKHTGAEETMR-----VGSDSIDTDRPKPSPEDSVDFEAN 2868
            MD +   E++C     LD++   T  E+T +     +G  S++ +R  P+P D      N
Sbjct: 1    MDEASHGEQNCPRSLILDSERCSTSFEQTTKEEVPSIGVHSLEIERLTPAPIDPGYAGPN 60

Query: 2867 LRSDGTPQAATNDSGTPSQIEKRVNSKLGGKRYPLRSSLEGTRVLRSRSNGVCXXXXXXX 2688
                G   A+  +S       K+V S++G + Y LRSS  G RVLR RS G         
Sbjct: 61   SGIIGRNTASKGNSSRQEWKGKKVASQVGSRSYFLRSSSNGVRVLRPRSIGTSKTSPAAS 120

Query: 2687 XXXXXXXXAQXXXXXXXXKGQNKVSNDELADIRRRVKYLSARMNYEQSLIDAYSGEGWKR 2508
                     +        K +  +SNDE +  R+ V+YL AR+N+EQ LIDAYSGEGWK 
Sbjct: 121  SKSSPIMPERRKSRREKRKLKEVLSNDEYSRTRKSVRYLLARINFEQGLIDAYSGEGWKG 180

Query: 2507 QSAEKVRPEKELQRASSEILRRKLELRDLFQRLDSICAEGKLQDSLFDSEGQIDSEDIFC 2328
            QS EKV+PEKEL+RA  EI+RRKL +RDLFQ L ++C EG++ +SLFDSEG+I SEDIFC
Sbjct: 181  QSQEKVKPEKELKRAEDEIVRRKLRIRDLFQHLQTLCEEGRIHESLFDSEGKIYSEDIFC 240

Query: 2327 AKCGSKDFSANNDIVLCDGGCDRGFHQMCLVPPLL--NIPPGDDSWLCPGCDCKVDCIGV 2154
            AKCGSKD   +NDI+LCDG C+RGFHQMCLVPPLL   IPPGD+ WLCPGC+CK  C+ +
Sbjct: 241  AKCGSKDVPPDNDIILCDGICNRGFHQMCLVPPLLKEQIPPGDEGWLCPGCECKAFCVDL 300

Query: 2153 LNDIQGTSLSIEDKWEKVFPEAAAAASGDKQ 2061
            +ND  GT L IED WEKVF EAAA ASGDKQ
Sbjct: 301  VNDYLGTDLLIEDGWEKVFAEAAALASGDKQ 331



 Score =  105 bits (261), Expect = 3e-19
 Identities = 82/274 (29%), Positives = 119/274 (43%), Gaps = 21/274 (7%)
 Frame = -1

Query: 1838 PELDEKVQSEGLSSDESDFSSDTNDLIPSKYXXXXXXXXXXXXXNHAEHIGGFD---GGE 1668
            P++D++ Q+   SS+ESD +S ++D   S               +        D    G 
Sbjct: 352  PDIDDEAQNSSSSSEESDMTSGSSDSESSSSDDEASSLDEGSGSSLPGPFLSADLSLNGS 411

Query: 1667 PEASQXXXXXXXXXXXXXXXRDQSGEVFLPVSGKRLRERLDYKKLNDETYGNIHXXXXXX 1488
               S                 + +G+V  P+ GKR RERLDYKKL+DE YGN+       
Sbjct: 412  EGRSNQKKPRMNSELLSILEPESNGKVVSPLPGKRNRERLDYKKLHDEDYGNVSSDSSDD 471

Query: 1487 XDWTDMNTPKKVKRRGESKSTVM---------ASLQNTQTIQDGENTKQSQRKPEGEVLR 1335
             DW  M+T K+ K  G  + T +          SL+  ++I     T+   +KP  E ++
Sbjct: 472  EDWVAMDTSKRKKSGGVGRGTRLPTKHCTLSPGSLKIYESIPSLPETQILLQKPNSETIQ 531

Query: 1334 KQEKLPDTRT------LHNLESEGANHTM---KPCYMRRKLTTSRPGCFGREVSQRLCES 1182
                L           +H + + G    +   +    R    T     FGR V+Q L  S
Sbjct: 532  VGSSLTHNIPGNSQIQVHGVSASGVKSHVGGGEHISSRNGPVTPLSKRFGRLVTQSLHNS 591

Query: 1181 FKVNAYPTQEMLENLSKEIGITFHQVSIWFENTR 1080
            FK N YPT+E    L++E+GITF QVS WFEN R
Sbjct: 592  FKENMYPTKETRAKLAEELGITFKQVSKWFENAR 625


>gb|ERM96685.1| hypothetical protein AMTR_s00001p00272780 [Amborella trichopoda]
          Length = 800

 Score =  311 bits (798), Expect = 2e-81
 Identities = 167/331 (50%), Positives = 211/331 (63%), Gaps = 7/331 (2%)
 Frame = -1

Query: 3032 MDVSHPVEEHCHEQNSLDTKDKHTGAEETMR-----VGSDSIDTDRPKPSPEDSVDFEAN 2868
            MD +   E++C     LD++   T  E+T +     +G  S++ +R  P+P D      N
Sbjct: 1    MDEASHGEQNCPRSLILDSERCSTSFEQTTKEEVPSIGVHSLEIERLTPAPIDPGYAGPN 60

Query: 2867 LRSDGTPQAATNDSGTPSQIEKRVNSKLGGKRYPLRSSLEGTRVLRSRSNGVCXXXXXXX 2688
                G   A+  +S       K+V S++G + Y LRSS  G RVLR RS G         
Sbjct: 61   SGIIGRNTASKGNSSRQEWKGKKVASQVGSRSYFLRSSSNGVRVLRPRSIGTSKTSPAAS 120

Query: 2687 XXXXXXXXAQXXXXXXXXKGQNKVSNDELADIRRRVKYLSARMNYEQSLIDAYSGEGWKR 2508
                     +        K +  +SNDE +  R+ V+YL AR+N+EQ LIDAYSGEGWK 
Sbjct: 121  SKSSPIMPERRKSRREKRKLKEVLSNDEYSRTRKSVRYLLARINFEQGLIDAYSGEGWKG 180

Query: 2507 QSAEKVRPEKELQRASSEILRRKLELRDLFQRLDSICAEGKLQDSLFDSEGQIDSEDIFC 2328
            QS EKV+PEKEL+RA  EI+RRKL +RDLFQ L ++C EG++ +SLFDSEG+I SEDIFC
Sbjct: 181  QSQEKVKPEKELKRAEDEIVRRKLRIRDLFQHLQTLCEEGRIHESLFDSEGKIYSEDIFC 240

Query: 2327 AKCGSKDFSANNDIVLCDGGCDRGFHQMCLVPPLL--NIPPGDDSWLCPGCDCKVDCIGV 2154
            AKCGSKD   +NDI+LCDG C+RGFHQMCLVPPLL   IPPGD+ WLCPGC+CK  C+ +
Sbjct: 241  AKCGSKDVPPDNDIILCDGICNRGFHQMCLVPPLLKEQIPPGDEGWLCPGCECKAFCVDL 300

Query: 2153 LNDIQGTSLSIEDKWEKVFPEAAAAASGDKQ 2061
            +ND  GT L IED WEKVF EAAA ASGDKQ
Sbjct: 301  VNDYLGTDLLIEDGWEKVFAEAAALASGDKQ 331



 Score =  105 bits (261), Expect = 3e-19
 Identities = 82/274 (29%), Positives = 119/274 (43%), Gaps = 21/274 (7%)
 Frame = -1

Query: 1838 PELDEKVQSEGLSSDESDFSSDTNDLIPSKYXXXXXXXXXXXXXNHAEHIGGFD---GGE 1668
            P++D++ Q+   SS+ESD +S ++D   S               +        D    G 
Sbjct: 352  PDIDDEAQNSSSSSEESDMTSGSSDSESSSSDDEASSLDEGSGSSLPGPFLSADLSLNGS 411

Query: 1667 PEASQXXXXXXXXXXXXXXXRDQSGEVFLPVSGKRLRERLDYKKLNDETYGNIHXXXXXX 1488
               S                 + +G+V  P+ GKR RERLDYKKL+DE YGN+       
Sbjct: 412  EGRSNQKKPRMNSELLSILEPESNGKVVSPLPGKRNRERLDYKKLHDEDYGNVSSDSSDD 471

Query: 1487 XDWTDMNTPKKVKRRGESKSTVM---------ASLQNTQTIQDGENTKQSQRKPEGEVLR 1335
             DW  M+T K+ K  G  + T +          SL+  ++I     T+   +KP  E ++
Sbjct: 472  EDWVAMDTSKRKKSGGVGRGTRLPTKHCTLSPGSLKIYESIPSLPETQILLQKPNSETIQ 531

Query: 1334 KQEKLPDTRT------LHNLESEGANHTM---KPCYMRRKLTTSRPGCFGREVSQRLCES 1182
                L           +H + + G    +   +    R    T     FGR V+Q L  S
Sbjct: 532  VGSSLTHNIPGNSQIQVHGVSASGVKSHVGGGEHISSRNGPVTPLSKRFGRLVTQSLHNS 591

Query: 1181 FKVNAYPTQEMLENLSKEIGITFHQVSIWFENTR 1080
            FK N YPT+E    L++E+GITF QVS WFEN R
Sbjct: 592  FKENMYPTKETRAKLAEELGITFKQVSKWFENAR 625


Top