BLASTX nr result

ID: Gardenia21_contig00007530 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Gardenia21_contig00007530
         (4780 letters)

Database: ./nr 
           77,306,371 sequences; 28,104,191,420 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

emb|CDO97241.1| unnamed protein product [Coffea canephora]           1286   0.0  
ref|XP_011088187.1| PREDICTED: pathogenesis-related homeodomain ...   497   e-137
ref|XP_009592467.1| PREDICTED: pathogenesis-related homeodomain ...   484   e-133
ref|XP_009775281.1| PREDICTED: pathogenesis-related homeodomain ...   480   e-132
ref|XP_011088190.1| PREDICTED: pathogenesis-related homeodomain ...   456   e-125
ref|XP_006346339.1| PREDICTED: pathogenesis-related homeodomain ...   456   e-124
ref|XP_012836886.1| PREDICTED: homeobox protein HAT3.1 [Erythran...   455   e-124
ref|XP_004230722.1| PREDICTED: pathogenesis-related homeodomain ...   452   e-123
ref|XP_007200058.1| hypothetical protein PRUPE_ppa023106mg [Prun...   433   e-118
ref|XP_008373076.1| PREDICTED: homeobox protein HAT3.1-like isof...   432   e-117
ref|XP_008373078.1| PREDICTED: homeobox protein HAT3.1-like isof...   429   e-116
ref|XP_008373077.1| PREDICTED: homeobox protein HAT3.1-like isof...   429   e-116
ref|XP_002527467.1| Homeobox protein HAT3.1, putative [Ricinus c...   427   e-116
ref|XP_007042568.1| Homeodomain-like protein with RING/FYVE/PHD-...   410   e-111
emb|CDP08734.1| unnamed protein product [Coffea canephora]            390   e-105
emb|CDP17419.1| unnamed protein product [Coffea canephora]            389   e-104
ref|XP_012093068.1| PREDICTED: homeobox protein HAT3.1 [Jatropha...   328   2e-86
gb|KDP44446.1| hypothetical protein JCGZ_16279 [Jatropha curcas]      328   2e-86
ref|XP_011001393.1| PREDICTED: homeobox protein HAT3.1-like [Pop...   327   7e-86
ref|XP_010099058.1| Homeobox protein [Morus notabilis] gi|587887...   326   1e-85

>emb|CDO97241.1| unnamed protein product [Coffea canephora]
          Length = 881

 Score = 1286 bits (3327), Expect = 0.0
 Identities = 677/883 (76%), Positives = 712/883 (80%), Gaps = 1/883 (0%)
 Frame = -2

Query: 2928 MDVVRXXXXXXXXXSPEQRALELGNGFVSGKLCTELVVQKREMVKDAQMDPEETGIRKSN 2749
            MDVVR         SPEQRALELGNGFVSG  CTE V+QK EMVKDA + PEETGIRKSN
Sbjct: 1    MDVVRESSLSESHLSPEQRALELGNGFVSGNRCTESVIQKCEMVKDAHIGPEETGIRKSN 60

Query: 2748 AHSVENLKTVDGLTNNADVKSLGLHNIQYLPESANAEPLEQKQVAGDDNDDNKLTETEIA 2569
            A+SVE LKTVDGLTNN+D +S  LHNIQYLPESANAEPLEQKQV GDDN DN+LTETEIA
Sbjct: 61   ANSVEILKTVDGLTNNSDFESFRLHNIQYLPESANAEPLEQKQVVGDDNVDNRLTETEIA 120

Query: 2568 APDLAGLEEYIQISVSPCENVAMVPAFSALGSDPQDASMHIDPQQTEPTQQKGAINAGGE 2389
            APDL GLEEYIQISVSPCENVA+VPAF++ GS+PQDASMH+DPQQTE TQ KGA+NAGGE
Sbjct: 121  APDLTGLEEYIQISVSPCENVAVVPAFASPGSEPQDASMHVDPQQTESTQ-KGAVNAGGE 179

Query: 2388 SGLDKRTPFQSRKRKSTLTIPVTARVLRSRSQXXXXXXXXXXXXXXXDAIEANXXXXXXX 2209
            S LDKRTPF+SRKRKST TIPVTARVLRSRSQ                A EA        
Sbjct: 180  SVLDKRTPFESRKRKSTSTIPVTARVLRSRSQEKSKESEKKDVVEDA-ATEAYRRKRGKK 238

Query: 2208 XXXXKIPANEFSRIRAHLRYLLHRIKYEQNLIDAYSGEGWKGQSLEKIKPEKELQRAKSQ 2029
                 IP NEFSRIRAHLRYLLHRIKYEQNLIDAYSGEGWKGQSLEKIKPEKELQRAKSQ
Sbjct: 239  KQRRNIPVNEFSRIRAHLRYLLHRIKYEQNLIDAYSGEGWKGQSLEKIKPEKELQRAKSQ 298

Query: 2028 IFRYKLKIRDLFRQIDLSLAEGKLPESLFDSDGQIDSEDIFCAKCGSKDLTLDNDIILCD 1849
            IFRYKLKIRDLFRQIDL LAEGKLPESLFDSDGQIDSEDIFCAKCGSKDLTLDNDIILCD
Sbjct: 299  IFRYKLKIRDLFRQIDLLLAEGKLPESLFDSDGQIDSEDIFCAKCGSKDLTLDNDIILCD 358

Query: 1848 GACERGFHQYCLEPPLLKEDIPPDEEGWLCPGCDCKVDCIELLSDFQGSNLSVLDKWEKV 1669
            GACERGFHQ+CLEPPLLKEDIPPDEEGWLCPGCDCKVDCIELLSDFQGSNLSVLDKWEKV
Sbjct: 359  GACERGFHQFCLEPPLLKEDIPPDEEGWLCPGCDCKVDCIELLSDFQGSNLSVLDKWEKV 418

Query: 1668 FPEEAAAAASGMKMXXXXXXXXXXXXXXXXXXDKPEVDNMVLGXXXXXXXXXXXXXXXEP 1489
            FPEEAAAAASGMKM                  DKPEVDNMVLG               EP
Sbjct: 419  FPEEAAAAASGMKMDDYSGLPSDDSDDDDYDPDKPEVDNMVLGEESSSDESDYFSASEEP 478

Query: 1488 VSALKADHILGLXXXXXXXXXXXXXXXXXXERVKQXXXXXXXXXXXXXFGGMFHEKEPLG 1309
            VSA+KA+ ILGL                  E  KQ             FG MFHEKEPLG
Sbjct: 479  VSAVKAEQILGLPSDDSEDDDFDPSAADHGELAKQESSSSDFSSDSEDFGAMFHEKEPLG 538

Query: 1308 EEAGCVSSVSTQSNPGVGFIDQILKVGADKKHXXXXXXXXXXXSNDAPVSGKRHVERLDY 1129
            EEAG VSSVSTQSN  VG I  I KVG DK+H           SNDAPVSGKRHVERLDY
Sbjct: 539  EEAGHVSSVSTQSNLAVGSIGPIFKVGRDKRHSLSDELSFLLESNDAPVSGKRHVERLDY 598

Query: 1128 KKLHEETYGDTSDDSSDEDYGETVGSKGRKKSTGKAISVPY-EPETIHKGPDTKDENCNQ 952
            KKLHEETYGDTS DSSDEDYGETVG + RKKSTGKAI VP  EPETIHKG D KDENCNQ
Sbjct: 599  KKLHEETYGDTSSDSSDEDYGETVGPRRRKKSTGKAILVPSNEPETIHKGADIKDENCNQ 658

Query: 951  KDVEMTPVEKIDKKFEIEGSNNMSVDSPRISTKGGSSGKRMSRPYQRLGDGVVQRLLESF 772
            KD EMTPVEKI+KKFEIEGSNNMSVDSPRIST+GGSSGKR  RPYQRLGDG+VQRLLESF
Sbjct: 659  KDFEMTPVEKINKKFEIEGSNNMSVDSPRISTEGGSSGKRTGRPYQRLGDGIVQRLLESF 718

Query: 771  RENQYPKHAVKESLAKELGLRIQQVSKWFENARWSARHSSHMDSRLTGSTSVNGTSLPEI 592
            RENQYPK+ VKESLAKELGLRIQQVSKWFENARWS RHSS MDS++TG+TS+NGT LPEI
Sbjct: 719  RENQYPKNGVKESLAKELGLRIQQVSKWFENARWSCRHSSRMDSKMTGTTSINGTCLPEI 778

Query: 591  SEKVPKLGEQSNLESAICNEEGKMALPQTSAYVDGQHVAGTGEGNSAVGFSPESINGRCI 412
            +EKVPK GEQSNLESA CNEEGKMALPQT+  V+GQH+AGTGEGNSA+ FSP+SINGRC 
Sbjct: 779  NEKVPKHGEQSNLESATCNEEGKMALPQTNPCVEGQHIAGTGEGNSAIDFSPDSINGRCT 838

Query: 411  NVDDQKPDQLSSAEKTSKQDSHVNASKSQSVRRSDRLQARSSN 283
             VD+QKPDQLSSAE+TSKQ S+VNASKSQSVRRS RLQARS N
Sbjct: 839  QVDEQKPDQLSSAEETSKQVSNVNASKSQSVRRSGRLQARSGN 881


>ref|XP_011088187.1| PREDICTED: pathogenesis-related homeodomain protein isoform X1
            [Sesamum indicum] gi|747081793|ref|XP_011088188.1|
            PREDICTED: pathogenesis-related homeodomain protein
            isoform X1 [Sesamum indicum]
          Length = 835

 Score =  497 bits (1280), Expect = e-137
 Identities = 339/880 (38%), Positives = 443/880 (50%), Gaps = 40/880 (4%)
 Frame = -2

Query: 2802 MVKDAQMDPEETGIRKSNAHSVENLKTVDGLTNNADVKSLGLHNIQYLPESANAEPLEQK 2623
            ++K    DPE        ++ +E L+T + L  +     L   N +   E+   E +E+K
Sbjct: 3    LIKIGTQDPE--------SNMIEPLETSENLAQDPKSGPLTPANYKMDSETLVTETMEKK 54

Query: 2622 QVAGDDN-----------------------DDNKLTETEIAAPDLAG---------LEEY 2539
            +V G  N                       D ++  + E   P L           LE  
Sbjct: 55   EVTGSQNFRKNIGSVEEISDQIKETGPNPEDISQNLDAEKEEPPLESAKTLSVAQNLEVI 114

Query: 2538 IQISVSPCENVAMVP-AFSALGSDPQDASMHIDPQQTEPTQQKGAINAGGESGLDKRTPF 2362
             Q  ++  EN+ + P A SA     +  ++HID  +          N+G     D+    
Sbjct: 115  SQNGLTNLENMCISPEAASANHGCGKLETVHIDETK----------NSGQLGTEDRGCSV 164

Query: 2361 QSRKRKSTLTIPVTAR-VLRSRSQXXXXXXXXXXXXXXXDAIEANXXXXXXXXXXXKIPA 2185
            QSRKRK+ L  PVT+  VLRS+SQ                A               K   
Sbjct: 165  QSRKRKAGLKSPVTSSWVLRSKSQEKPKAPEPNENVKEDSANGEKKKRGRKKKPMQKTTV 224

Query: 2184 NEFSRIRAHLRYLLHRIKYEQNLIDAYSGEGWKGQSLEKIKPEKELQRAKSQIFRYKLKI 2005
            NEFSR + HLRYLLHRIKYEQ+LIDAYS EGWKGQSL+K+KPEKELQRAKS I RYKLKI
Sbjct: 225  NEFSRTKTHLRYLLHRIKYEQSLIDAYSAEGWKGQSLDKLKPEKELQRAKSHILRYKLKI 284

Query: 2004 RDLFRQIDLSLAEGKLPESLFDSDGQIDSEDIFCAKCGSKDLTLDNDIILCDGACERGFH 1825
            R L +++D+SLA GKLPESLFDS G+IDSEDIFCAKCGSKDL LDNDIILCDGACERGFH
Sbjct: 285  RALIQRLDMSLAVGKLPESLFDSHGEIDSEDIFCAKCGSKDLPLDNDIILCDGACERGFH 344

Query: 1824 QYCLEPPLLKEDIPPDEEGWLCPGCDCKVDCIELLSDFQGSNLSVLDKWEKVFPEEAAAA 1645
            Q+CLEPPLLKEDIPP +EGW+CPGCDCK+DCI++L DFQG+ +S  D WEK+FP EAAAA
Sbjct: 345  QFCLEPPLLKEDIPPGDEGWICPGCDCKIDCIDMLKDFQGTKISHTDSWEKIFP-EAAAA 403

Query: 1644 ASGMKMXXXXXXXXXXXXXXXXXXDKPEVDNMVLGXXXXXXXXXXXXXXXEPVSALKADH 1465
            ASG  +                  DKP+    V G               +  ++L  + 
Sbjct: 404  ASGKTLDNGSGSSSDDSDDDDYDPDKPDAVEKVEGDESSSDESNYFSASDDLAASLNNEK 463

Query: 1464 ILGLXXXXXXXXXXXXXXXXXXERVKQXXXXXXXXXXXXXFGGMFHEKEPLGEEAGCVSS 1285
             LGL                  ++ +Q              G +  + E  GE+ G +S 
Sbjct: 464  YLGLPSDDSEDDDFDPSALDPDKQAEQESSSSDFTSDSEDLGALLDDTE-AGEDLGHISP 522

Query: 1284 VSTQSNPGVGFIDQILKVGADKKHXXXXXXXXXXXSNDAPVSGKRHVERLDYKKLHEETY 1105
             S Q+    G  ++ +KVG  K+            ++  PVSG+RHVER DYK LH+ETY
Sbjct: 523  SSYQNQSSTGSKEENVKVGGTKRQSLKDELSYLLETSGEPVSGRRHVERWDYKSLHDETY 582

Query: 1104 GDTSDDSSDEDYGETVGSKGRKKSTGKA-ISVPYEPETIHKGPDTKDENCNQKDVEMTPV 928
            G++S DSSDED+ +T   K R+    K  ++ P +          KDEN           
Sbjct: 583  GNSSSDSSDEDFVDTTAPKRRRIDREKTEVTSPNKTPITENNMKAKDEN----------- 631

Query: 927  EKIDKKFEIEGSNNMSVDSPRISTKGGSSGKRMSR-PYQRLGDGVVQRLLESFRENQYPK 751
            +K  K        N+  D+   S+K GS+     R   +RLG+ + QRL  SF ENQYP+
Sbjct: 632  QKESKHLRERTRKNIG-DTIESSSKVGSASTGTKRSANKRLGEAITQRLYASFNENQYPE 690

Query: 750  HAVKESLAKELGLRIQQVSKWFENARWSARHSSHMDSRLTGSTSVNGTSLPEISEKVPKL 571
             AVKE+LAKELGL+IQQVSKWFENARWS +H S +     GS S          EK P+ 
Sbjct: 691  RAVKENLAKELGLKIQQVSKWFENARWSFQHRSRV-----GSNS---------DEKPPE- 735

Query: 570  GEQSNLESAICNEEGKMALPQTSAYVDGQHVAGTGEGNSAVGFSPESINGRCINVD---- 403
              Q  + +   +    M L ++SA    +H+      N A     E+   R         
Sbjct: 736  -PQPTISTDNHSSNQNMELQESSALRAREHITEAKLDNLATNSCKENSGTRDTRKRRAKI 794

Query: 402  DQKPDQLSSAEKTSKQDSHVNASKSQSVRRSDRLQARSSN 283
            DQ PD +   +KT +Q   V+    Q    S + Q RSSN
Sbjct: 795  DQAPDDIHVDDKTQEQKMLVDMRSPQPC--SSKRQTRSSN 832


>ref|XP_009592467.1| PREDICTED: pathogenesis-related homeodomain protein-like [Nicotiana
            tomentosiformis] gi|697167241|ref|XP_009592468.1|
            PREDICTED: pathogenesis-related homeodomain protein-like
            [Nicotiana tomentosiformis]
          Length = 740

 Score =  484 bits (1246), Expect = e-133
 Identities = 306/707 (43%), Positives = 382/707 (54%), Gaps = 10/707 (1%)
 Frame = -2

Query: 2388 SGLDKRTPFQSRKRKSTLTIPVTA-RVLRSRSQXXXXXXXXXXXXXXXDAIEANXXXXXX 2212
            S   ++TP + RKRKST   P+ + R+LRS+S+               +A E        
Sbjct: 54   SQCQEKTPGRPRKRKSTSGTPINSTRLLRSKSKEKSVASEANNTVATHEANEEKKRKRRK 113

Query: 2211 XXXXXKIPANEFSRIRAHLRYLLHRIKYEQNLIDAYSGEGWKGQSLEKIKPEKELQRAKS 2032
                  I  NEF+RIR HLRYLL RIKYEQ LI+AYSGEGWKGQSLEKIK EKELQRAK+
Sbjct: 114  KKQSKHIAVNEFTRIRGHLRYLLQRIKYEQTLIEAYSGEGWKGQSLEKIKLEKELQRAKA 173

Query: 2031 QIFRYKLKIRDLFRQIDLSLAEGKLPESLFDSDGQIDSEDIFCAKCGSKDLTLDNDIILC 1852
             IFRYKLKIRDLF+++D  LA+G+LP SLFD++G+IDSEDIFCAKC +KDL  DNDIILC
Sbjct: 174  HIFRYKLKIRDLFQRLDTLLAQGRLPASLFDNEGEIDSEDIFCAKCSAKDLPADNDIILC 233

Query: 1851 DGACERGFHQYCLEPPLLKEDIPPDEEGWLCPGCDCKVDCIELLSDFQGSNLSVLDKWEK 1672
            DGACERGFHQ CLEPPLLKEDIPPD+EGWLCPGCDCKVDCI+LL+D QG+NLSV D WEK
Sbjct: 234  DGACERGFHQLCLEPPLLKEDIPPDDEGWLCPGCDCKVDCIDLLNDLQGTNLSVTDSWEK 293

Query: 1671 VFPEEAAAAASGMKMXXXXXXXXXXXXXXXXXXDKPEVDNMVLGXXXXXXXXXXXXXXXE 1492
            V+P+EAAAA SG K+                  + P+V+    G               +
Sbjct: 294  VYPKEAAAAESGEKLDDISGLPSDDSEDDDYNPENPDVEKNDSGDESSSDESDFFSASED 353

Query: 1491 PVSA-LKADHILGLXXXXXXXXXXXXXXXXXXERVKQXXXXXXXXXXXXXFGGMFHEKEP 1315
                  K D ILGL                  E VK               G +      
Sbjct: 354  LEEVPPKDDEILGLPSEDSEDDDYSPDDPDKNEPVKAESSSSDFTSDSEDLGLIVDANRL 413

Query: 1314 LGEEAGCVSSVSTQSNPGVGFIDQILKVGADKKHXXXXXXXXXXXSNDAPVSGKRHVERL 1135
             G+E G  SSV   S P     +   K G  K++           S+   VSGKRH+ERL
Sbjct: 414  PGDEQGVSSSVD-NSRPSSASQEDKPKAGRAKRNSLKVELSDLMLSHSPVVSGKRHIERL 472

Query: 1134 DYKKLHEETYGDTSDDSSDEDYGETVGSKGRKKSTGKAISVPYEPETIHKGPDTKDENCN 955
            DYKKLH+ETYG+ S DSSDED+      K R+  + KA      P +     DTK +N  
Sbjct: 473  DYKKLHDETYGNESSDSSDEDFEGGPSPKVREIRSAKAAMT--SPSS--TPADTKYQNGK 528

Query: 954  QKDVEMTPVEKIDKKFEIEGSNNMSVDSPRISTKGGSSGKRMSRPYQRLGDGVVQRLLES 775
            QK    T    + +K +I G   M    PR      SSGK+     +  G+G ++RL ES
Sbjct: 529  QKGSRHTSDRGLCEKLKIGG---MDTSEPR------SSGKK-----KTYGEGAIKRLYES 574

Query: 774  FRENQYPKHAVKESLAKELGLRIQQVSKWFENARWSARHSSHMDSRLTGSTSVNGTSLPE 595
            F+ENQYP    KE L KELGL   QVSKWFENAR   RHSS  +S ++   S    S P+
Sbjct: 575  FKENQYPDRDAKEKLGKELGLTAHQVSKWFENARHCHRHSSRWNSIMSQKVSKESPSSPD 634

Query: 594  ISEKVPKLGEQSNLESAICNEEGKMALPQTSAYVDGQH-VAGTGEGNSAV-------GFS 439
            I  +      +S   + +CN   KM  P+     +  H +    EG   +          
Sbjct: 635  IMGEPLGTESKSTTNNVLCNGVEKMEPPKQCLNGEKCHAIDNKSEGELLIQEASGKKSRK 694

Query: 438  PESINGRCINVDDQKPDQLSSAEKTSKQDSHVNASKSQSVRRSDRLQ 298
            P++ N    +  D+  D   S +   KQ++ V+   SQ+VRRS RLQ
Sbjct: 695  PKAKN----DSTDRGLDDTPSNKTYKKQNAQVDTPNSQNVRRSSRLQ 737


>ref|XP_009775281.1| PREDICTED: pathogenesis-related homeodomain protein [Nicotiana
            sylvestris] gi|698572952|ref|XP_009775282.1| PREDICTED:
            pathogenesis-related homeodomain protein [Nicotiana
            sylvestris]
          Length = 747

 Score =  480 bits (1235), Expect = e-132
 Identities = 304/702 (43%), Positives = 383/702 (54%), Gaps = 5/702 (0%)
 Frame = -2

Query: 2388 SGLDKRTPFQSRKRKSTLTIPVTA-RVLRSRSQXXXXXXXXXXXXXXXDAIEANXXXXXX 2212
            S   ++TP Q RKRKST   P+++ R+LRS+S+               +A E        
Sbjct: 63   SECQEKTPGQPRKRKSTSGTPISSTRLLRSKSKEKSGASEVNNTVVTDEANEEKKRKRRK 122

Query: 2211 XXXXXKIPANEFSRIRAHLRYLLHRIKYEQNLIDAYSGEGWKGQSLEKIKPEKELQRAKS 2032
                  I  NEF+ IR HLRYLL RIKYEQ LI+AYSGEGWKGQSLEKIK EKEL+RAK+
Sbjct: 123  KKHSKHIAVNEFTSIRGHLRYLLQRIKYEQTLIEAYSGEGWKGQSLEKIKLEKELERAKA 182

Query: 2031 QIFRYKLKIRDLFRQIDLSLAEGKLPESLFDSDGQIDSEDIFCAKCGSKDLTLDNDIILC 1852
             IFRYKLKIRDLF+++D  LA+G+LP SLFD++G+IDSEDIFCAKCG+KDL  DNDIILC
Sbjct: 183  HIFRYKLKIRDLFQRVDALLAQGRLPASLFDNEGEIDSEDIFCAKCGAKDLPADNDIILC 242

Query: 1851 DGACERGFHQYCLEPPLLKEDIPPDEEGWLCPGCDCKVDCIELLSDFQGSNLSVLDKWEK 1672
            DGACERGFHQ CLEPPLLKEDIPPD+EGWLCPGCDCKVDCI+LL+D QG+NLS+ D WEK
Sbjct: 243  DGACERGFHQLCLEPPLLKEDIPPDDEGWLCPGCDCKVDCIDLLNDLQGTNLSITDSWEK 302

Query: 1671 VFPEEAAAAASGMKMXXXXXXXXXXXXXXXXXXDKPEVDNMVLGXXXXXXXXXXXXXXXE 1492
            V+P+EAAAAASG K+                  + P+V+    G               +
Sbjct: 303  VYPKEAAAAASGEKLDDISGLPSDDSEDDDYNPENPDVEKNDSGDESSSDESDFFSASED 362

Query: 1491 PVSA-LKADHILGLXXXXXXXXXXXXXXXXXXERVKQXXXXXXXXXXXXXFGGMFHEKEP 1315
                  K D IL L                  E  K               G +      
Sbjct: 363  LEEVPPKDDEILALPSEDSEDGDYSPDDPDKDEPAKTESSSSDFTSDSEDLGLIVDTNRL 422

Query: 1314 LGEEAGCVSSVSTQSNPGVGFIDQILKVGADKKHXXXXXXXXXXXSNDAPVSGKRHVERL 1135
             G+E G  SSV   S P +   ++  K G  K++           S    VS KRH+ERL
Sbjct: 423  PGDELGVSSSVD-NSKPSLASQEEKPKGGRAKRNSLNNELSDLMLSYSPLVSCKRHIERL 481

Query: 1134 DYKKLHEETYGDTSDDSSDEDYGETVGSKGRKKSTGKAISVPYEPETIHKGPDTKDENCN 955
            DYKKLH+ETYG+ S DSSDED+      K R+  + KA      P +     DTK ++  
Sbjct: 482  DYKKLHDETYGNESSDSSDEDFEGDPLPKVREIRSAKAART--SPSS--TPADTKYQSGK 537

Query: 954  QKDVEMTPVEKIDKKFEIEGSNNMSVDSPRISTKGGSSGKRMSRPYQRLGDGVVQRLLES 775
            QK V       + K+ +I G   M    P       SSGK+     +  G+G ++RL ES
Sbjct: 538  QK-VSRHTDRGLCKQLKIGG---MDTSEPH------SSGKK-----KTYGEGAIKRLYES 582

Query: 774  FRENQYPKHAVKESLAKELGLRIQQVSKWFENARWSARHSSHMDSRLTGSTSVNGTSLPE 595
            F+ENQYP    KE L KELGL   QVSKWFENAR   RHSS  DS ++   S    S P 
Sbjct: 583  FKENQYPDRDAKEKLGKELGLTAHQVSKWFENARHCHRHSSRWDSIMSQKVSKESPSSPN 642

Query: 594  ISEKVPKLGEQSNLESAICNEEGKMALPQTSAYVDGQHVAGTGEGNSAV--GFSPESING 421
            I  +      +S + + +CN  GK+  P+     +  H  G  EG+  +      +S   
Sbjct: 643  IIGEPLGTESKSTINNVLCNGVGKVEPPKQCLNGENCHAIGKSEGDLLIRETSGKKSRKP 702

Query: 420  RCINVDDQKPDQLSSAEKTSKQ-DSHVNASKSQSVRRSDRLQ 298
            +  N    +    S A KTSK+ ++ V+   SQ+VRRS RLQ
Sbjct: 703  KAKNDTTDRGLDDSPANKTSKKWNAQVDTPNSQNVRRSSRLQ 744


>ref|XP_011088190.1| PREDICTED: pathogenesis-related homeodomain protein isoform X2
            [Sesamum indicum]
          Length = 715

 Score =  456 bits (1174), Expect = e-125
 Identities = 295/739 (39%), Positives = 385/739 (52%), Gaps = 36/739 (4%)
 Frame = -2

Query: 2802 MVKDAQMDPEETGIRKSNAHSVENLKTVDGLTNNADVKSLGLHNIQYLPESANAEPLEQK 2623
            ++K    DPE        ++ +E L+T + L  +     L   N +   E+   E +E+K
Sbjct: 3    LIKIGTQDPE--------SNMIEPLETSENLAQDPKSGPLTPANYKMDSETLVTETMEKK 54

Query: 2622 QVAGDDN-----------------------DDNKLTETEIAAPDLAG---------LEEY 2539
            +V G  N                       D ++  + E   P L           LE  
Sbjct: 55   EVTGSQNFRKNIGSVEEISDQIKETGPNPEDISQNLDAEKEEPPLESAKTLSVAQNLEVI 114

Query: 2538 IQISVSPCENVAMVP-AFSALGSDPQDASMHIDPQQTEPTQQKGAINAGGESGLDKRTPF 2362
             Q  ++  EN+ + P A SA     +  ++HID  +          N+G     D+    
Sbjct: 115  SQNGLTNLENMCISPEAASANHGCGKLETVHIDETK----------NSGQLGTEDRGCSV 164

Query: 2361 QSRKRKSTLTIPVTAR-VLRSRSQXXXXXXXXXXXXXXXDAIEANXXXXXXXXXXXKIPA 2185
            QSRKRK+ L  PVT+  VLRS+SQ                A               K   
Sbjct: 165  QSRKRKAGLKSPVTSSWVLRSKSQEKPKAPEPNENVKEDSANGEKKKRGRKKKPMQKTTV 224

Query: 2184 NEFSRIRAHLRYLLHRIKYEQNLIDAYSGEGWKGQSLEKIKPEKELQRAKSQIFRYKLKI 2005
            NEFSR + HLRYLLHRIKYEQ+LIDAYS EGWKGQSL+K+KPEKELQRAKS I RYKLKI
Sbjct: 225  NEFSRTKTHLRYLLHRIKYEQSLIDAYSAEGWKGQSLDKLKPEKELQRAKSHILRYKLKI 284

Query: 2004 RDLFRQIDLSLAEGKLPESLFDSDGQIDSEDIFCAKCGSKDLTLDNDIILCDGACERGFH 1825
            R L +++D+SLA GKLPESLFDS G+IDSEDIFCAKCGSKDL LDNDIILCDGACERGFH
Sbjct: 285  RALIQRLDMSLAVGKLPESLFDSHGEIDSEDIFCAKCGSKDLPLDNDIILCDGACERGFH 344

Query: 1824 QYCLEPPLLKEDIPPDEEGWLCPGCDCKVDCIELLSDFQGSNLSVLDKWEKVFPEEAAAA 1645
            Q+CLEPPLLKEDIPP +EGW+CPGCDCK+DCI++L DFQG+ +S  D WEK+FP EAAAA
Sbjct: 345  QFCLEPPLLKEDIPPGDEGWICPGCDCKIDCIDMLKDFQGTKISHTDSWEKIFP-EAAAA 403

Query: 1644 ASGMKMXXXXXXXXXXXXXXXXXXDKPEVDNMVLGXXXXXXXXXXXXXXXEPVSALKADH 1465
            ASG  +                  DKP+    V G               +  ++L  + 
Sbjct: 404  ASGKTLDNGSGSSSDDSDDDDYDPDKPDAVEKVEGDESSSDESNYFSASDDLAASLNNEK 463

Query: 1464 ILGLXXXXXXXXXXXXXXXXXXERVKQXXXXXXXXXXXXXFGGMFHEKEPLGEEAGCVSS 1285
             LGL                  ++ +Q              G +  + E  GE+ G +S 
Sbjct: 464  YLGLPSDDSEDDDFDPSALDPDKQAEQESSSSDFTSDSEDLGALLDDTE-AGEDLGHISP 522

Query: 1284 VSTQSNPGVGFIDQILKVGADKKHXXXXXXXXXXXSNDAPVSGKRHVERLDYKKLHEETY 1105
             S Q+    G  ++ +KVG  K+            ++  PVSG+RHVER DYK LH+ETY
Sbjct: 523  SSYQNQSSTGSKEENVKVGGTKRQSLKDELSYLLETSGEPVSGRRHVERWDYKSLHDETY 582

Query: 1104 GDTSDDSSDEDYGETVGSKGRKKSTGKA-ISVPYEPETIHKGPDTKDENCNQKDVEMTPV 928
            G++S DSSDED+ +T   K R+    K  ++ P +          KDEN           
Sbjct: 583  GNSSSDSSDEDFVDTTAPKRRRIDREKTEVTSPNKTPITENNMKAKDEN----------- 631

Query: 927  EKIDKKFEIEGSNNMSVDSPRISTKGGSSGKRMSR-PYQRLGDGVVQRLLESFRENQYPK 751
            +K  K        N+  D+   S+K GS+     R   +RLG+ + QRL  SF ENQYP+
Sbjct: 632  QKESKHLRERTRKNIG-DTIESSSKVGSASTGTKRSANKRLGEAITQRLYASFNENQYPE 690

Query: 750  HAVKESLAKELGLRIQQVS 694
             AVKE+LAKELGL+IQQ++
Sbjct: 691  RAVKENLAKELGLKIQQIT 709


>ref|XP_006346339.1| PREDICTED: pathogenesis-related homeodomain protein-like isoform X1
            [Solanum tuberosum] gi|565359059|ref|XP_006346340.1|
            PREDICTED: pathogenesis-related homeodomain protein-like
            isoform X2 [Solanum tuberosum]
            gi|565359061|ref|XP_006346341.1| PREDICTED:
            pathogenesis-related homeodomain protein-like isoform X3
            [Solanum tuberosum] gi|565359063|ref|XP_006346342.1|
            PREDICTED: pathogenesis-related homeodomain protein-like
            isoform X4 [Solanum tuberosum]
          Length = 798

 Score =  456 bits (1172), Expect = e-124
 Identities = 292/704 (41%), Positives = 374/704 (53%), Gaps = 6/704 (0%)
 Frame = -2

Query: 2415 KGAINAGGESGLDKRTPFQSRKRKSTLTIPVTA-RVLRSRSQXXXXXXXXXXXXXXXDAI 2239
            + A+    +S   ++TP Q RKRKS    P+++ R+LRS+S+               DA 
Sbjct: 44   ENAVQNLNQSEYREKTPGQPRKRKSISGSPISSTRLLRSKSKEKSGASEANNTVVTHDAT 103

Query: 2238 EANXXXXXXXXXXXKIPANEFSRIRAHLRYLLHRIKYEQNLIDAYSGEGWKGQSLEKIKP 2059
            E              I  NEF+RIR HLRYLL RI YEQ LI+AYSGEGWKGQSLEKIK 
Sbjct: 104  EEKKRKRRKKKHSKHIAVNEFTRIRGHLRYLLQRITYEQTLIEAYSGEGWKGQSLEKIKL 163

Query: 2058 EKELQRAKSQIFRYKLKIRDLFRQIDLSLAEGKLPESLFDSDGQIDSEDIFCAKCGSKDL 1879
            EKELQRAK+ IFRYKLKIRDLF+++D  LAEG+LP SLFD++G+IDSEDIFCAKCGS DL
Sbjct: 164  EKELQRAKTHIFRYKLKIRDLFQRLDTLLAEGRLPASLFDNEGEIDSEDIFCAKCGSMDL 223

Query: 1878 TLDNDIILCDGACERGFHQYCLEPPLLKEDIPPDEEGWLCPGCDCKVDCIELLSDFQGSN 1699
              DNDIILCDGACERGFHQ C+EPPLLKEDIPPD+EGWLCPGCDCKVDCI+LL+D QG++
Sbjct: 224  PADNDIILCDGACERGFHQLCVEPPLLKEDIPPDDEGWLCPGCDCKVDCIDLLNDLQGTD 283

Query: 1698 LSVLDKWEKVFPEEAAAAASGMKMXXXXXXXXXXXXXXXXXXDKPEV-DNMVLGXXXXXX 1522
            LSV D WEKV+P+EAAAAASG K+                  + P+V  N          
Sbjct: 284  LSVTDSWEKVYPKEAAAAASGEKLDDISGLPSDDSEDDDYNPETPDVGKNDSEDESSSDE 343

Query: 1521 XXXXXXXXXEPVSALKADHILGLXXXXXXXXXXXXXXXXXXERVKQXXXXXXXXXXXXXF 1342
                        +  K D ILG+                  E VK              F
Sbjct: 344  SDFYSASEDLAEAPPKDDEILGISSEDSEDDDFNPDDPDKDEPVKTESSSSDFTSDSEDF 403

Query: 1341 GGMFHEKEPLGEEAGCVSSVSTQSNPGVGFIDQILKVGADKKHXXXXXXXXXXXSNDAPV 1162
              +       G+E G  SSV   S P     ++  KVG  K +           S+   V
Sbjct: 404  NLIVDTNRLQGDEQGVSSSVD-NSMPNSASQEEKAKVGKAKGNSLKDELSYLMQSDSPLV 462

Query: 1161 SGKRHVERLDYKKLHEETYGDTSDDSSDEDYGETVGSKGRKKSTGK-AISVPYEPETIHK 985
            S KRH+ERLDYKKLH+ETYG+ S +SSDEDY +    K RK    K A++ P        
Sbjct: 463  SAKRHIERLDYKKLHDETYGNGSSESSDEDYDDGPLPKVRKLRNAKGAMTSPSSTPA--- 519

Query: 984  GPDTKDENCNQKDVEMTPVEKIDKKFEIEGSNNMSVDSPRISTKGGSSGKRMSRPYQRLG 805
              D K ++  QK         I +K ++ G+          +++  SSGKR +      G
Sbjct: 520  --DIKHQSGKQKGSGRASDSGISEKLKVGGAG---------TSESPSSGKRKTH-----G 563

Query: 804  DGVVQRLLESFRENQYPKHAVKESLAKELGLRIQQVSKWFENARWSARHSSHMDSRLTGS 625
            +   +RL ESF++NQYP    K  L KELGL   QVSKWFENAR   RHSSH ++ ++  
Sbjct: 564  EVATKRLYESFKDNQYPDRDAKGKLGKELGLTAYQVSKWFENARHCHRHSSHWNTIMSQK 623

Query: 624  TSVNGTSLPEISEKVPKLGEQSNLESAICNEEGKMALPQTSAYVDGQHVAGTGEGNSAV- 448
             S    S  +I  +   LG +SN   A CN  GK+  P+     +  H     E +  + 
Sbjct: 624  VSKESPSKLQIIGE--PLGTESNSIIAFCNGVGKLEQPKQRLNGEKGHAIDKSEEDLFIQ 681

Query: 447  --GFSPESINGRCINVDDQKPDQLSSAEKTSKQDSHVNASKSQS 322
                   S   + +   +Q  +     + + KQ + V  + SQ+
Sbjct: 682  DASGKKSSEPTKKVYTTNQGSEDTPRNKTSKKQKAKVGTANSQN 725


>ref|XP_012836886.1| PREDICTED: homeobox protein HAT3.1 [Erythranthe guttatus]
            gi|848872657|ref|XP_012836887.1| PREDICTED: homeobox
            protein HAT3.1 [Erythranthe guttatus]
            gi|604333260|gb|EYU37611.1| hypothetical protein
            MIMGU_mgv1a001571mg [Erythranthe guttata]
            gi|604333261|gb|EYU37612.1| hypothetical protein
            MIMGU_mgv1a001571mg [Erythranthe guttata]
          Length = 793

 Score =  455 bits (1170), Expect = e-124
 Identities = 308/764 (40%), Positives = 385/764 (50%), Gaps = 58/764 (7%)
 Frame = -2

Query: 2640 EPLEQKQVAGDDNDDNKLTET-EIA--APDLAGLEEYIQISVSP------CENVAMVPAF 2488
            E +EQK+V       N L  T EI+    ++   +E I ++          ENV  +P F
Sbjct: 45   ETVEQKEVTAPQTIVNVLVSTVEISDKTTEIQPKQEDISLNAGAEKQEPLLENVEELPGF 104

Query: 2487 SA-------------LGSDPQDASMHIDPQQTEPTQQKGAINAGGESGLDKRTPFQSRKR 2347
                           LG+    AS   +  + EP Q    I++G     D     QSRKR
Sbjct: 105  ENTEVASNGSTNHENLGTPLGAASDDPNCGKVEPVQIDFTIDSGQIDNEDGAASGQSRKR 164

Query: 2346 KSTLTIPVTAR-VLRSRSQXXXXXXXXXXXXXXXDAIEANXXXXXXXXXXXK-------- 2194
            KS +  PV +   LRS+SQ               + ++A+                    
Sbjct: 165  KSRVKGPVISSWSLRSKSQERPKAPEPDETVKADETVKADETVKADETVKAGSSNGEKKK 224

Query: 2193 -----------IPANEFSRIRAHLRYLLHRIKYEQNLIDAYSGEGWKGQSLEKIKPEKEL 2047
                          NE+SR R HLRYLLHRIKYEQ+LIDAY  EGWKGQSLEK+KPEKEL
Sbjct: 225  KGRKKKQVKNNTTVNEYSRTRTHLRYLLHRIKYEQSLIDAYCTEGWKGQSLEKLKPEKEL 284

Query: 2046 QRAKSQIFRYKLKIRDLFRQIDLSLAEGKLPESLFDSDGQIDSEDIFCAKCGSKDLTLDN 1867
            QRAKS I RYKL+IR LF  +DLSLA GKLP SLFDS G+IDSEDIFCAKCGSK+L LDN
Sbjct: 285  QRAKSHILRYKLRIRALFENLDLSLAVGKLPTSLFDSQGEIDSEDIFCAKCGSKELPLDN 344

Query: 1866 DIILCDGACERGFHQYCLEPPLLKEDIPPDEEGWLCPGCDCKVDCIELLSDFQGSNLSVL 1687
            DIILCDGACERGFHQ+CL+PPLLKE IPP +EGWLCPGCDCKVDCI++L DFQG+ +S+L
Sbjct: 345  DIILCDGACERGFHQFCLDPPLLKEQIPPGDEGWLCPGCDCKVDCIDMLKDFQGTKISIL 404

Query: 1686 DKWEKVFPEEAAAAASGMKMXXXXXXXXXXXXXXXXXXDKPE----------VDNMVLGX 1537
            D WEK+FP EAAAAASG K+                  DKP+           D  V G 
Sbjct: 405  DSWEKIFP-EAAAAASGKKLDDCSGSSSDDAEDDDYDPDKPDADENNVDENNADEKVEGD 463

Query: 1536 XXXXXXXXXXXXXXEPVSALKADHILGLXXXXXXXXXXXXXXXXXXERVKQXXXXXXXXX 1357
                             + L  D   GL                  E+VKQ         
Sbjct: 464  ESSSDESDYFSASDGVAAPLNNDKYEGLPSEDSEDDDFDPSAPDEDEQVKQDSSGSDFTS 523

Query: 1356 XXXXFGGMFHEK--EPLGEEAGCVSSVSTQSNPGVGFIDQILKVGADKKHXXXXXXXXXX 1183
                   +  E   EP G++ G     + Q  P  G  D+  KVG  K+           
Sbjct: 524  DSEDLDALLEENATEP-GQDPG---QTADQKQPSTGSNDENPKVGRMKRTSLKDELVYLM 579

Query: 1182 XSNDAPVSGKRHVERLDYKKLHEETYGDTSDDSSDEDYGETVGSKGRKKSTGKAISVPYE 1003
             ++  PV+GKR V+RLDYKKL +ETYG+ S DSSDED+ +    K RK    K+     E
Sbjct: 580  ETDAQPVAGKRQVKRLDYKKLLDETYGNASSDSSDEDFDDGTTRKRRKIDPEKS-----E 634

Query: 1002 PETIHKGPDTKDENCNQKDVEMTPVEKIDKKFEIEGSNNMSVDSPRISTKGGSSGKRMSR 823
             ++  K P TK  N N  D      ++  K+   + ++  + +SP      GSS     R
Sbjct: 635  RKSRDKTPITK-SNTNTTDENQKASKRSSKRPRKKVADGGTNESP---ANNGSSTTSKKR 690

Query: 822  PYQRLGDGVVQRLLESFRENQYPKHAVKESLAKELGLRIQQVSKWFENARWSARHSSHMD 643
            P +RLG+   QRL  SF ENQYP+ A KE+LA ELG+ ++QVSKWFENARWS  H    +
Sbjct: 691  PLKRLGEATTQRLYVSFSENQYPQRAAKENLANELGITVRQVSKWFENARWSYNHRPQTE 750

Query: 642  SRLTGSTSVNGTSLPEISEKVPKLGEQSN----LESAICNEEGK 523
            S  T          PE    V   G  SN    LE+ + N  G+
Sbjct: 751  SNSTEKKP------PEPQTSVGTEGNNSNQTLGLENVVNNASGE 788


>ref|XP_004230722.1| PREDICTED: pathogenesis-related homeodomain protein [Solanum
            lycopersicum]
          Length = 796

 Score =  452 bits (1164), Expect = e-123
 Identities = 283/633 (44%), Positives = 350/633 (55%), Gaps = 5/633 (0%)
 Frame = -2

Query: 2391 ESGLDKRTPFQSRKRKSTLTIPVTA-RVLRSRSQXXXXXXXXXXXXXXXDAIEANXXXXX 2215
            +S   +++P Q RKRKS    P+++ R+LRS+S+               DA E       
Sbjct: 51   QSEYREKSPGQPRKRKSISGSPISSTRLLRSKSKEKSGASEAKNTVVTHDATEEKKRKRR 110

Query: 2214 XXXXXXKIPANEFSRIRAHLRYLLHRIKYEQNLIDAYSGEGWKGQSLEKIKPEKELQRAK 2035
                   I ANEF+RIR HLRYLL RIKYEQ LI+AYSGEGWKGQSLEKIK EKELQRAK
Sbjct: 111  KKKHSKHIAANEFTRIRGHLRYLLQRIKYEQTLIEAYSGEGWKGQSLEKIKLEKELQRAK 170

Query: 2034 SQIFRYKLKIRDLFRQIDLSLAEGKLPESLFDSDGQIDSEDIFCAKCGSKDLTLDNDIIL 1855
            + IFRYKLKIRDLF+++D  LAEG+LP SLFD++G+IDSEDIFCAKCGS DL  DNDIIL
Sbjct: 171  THIFRYKLKIRDLFQRLDTLLAEGRLPASLFDNEGEIDSEDIFCAKCGSMDLPADNDIIL 230

Query: 1854 CDGACERGFHQYCLEPPLLKEDIPPDEEGWLCPGCDCKVDCIELLSDFQGSNLSVLDKWE 1675
            CDGACERGFHQ C+EPPLLKEDIPPD+EGWLCPGCDCKVDCI+LL+D QG++LSV D WE
Sbjct: 231  CDGACERGFHQLCVEPPLLKEDIPPDDEGWLCPGCDCKVDCIDLLNDLQGTDLSVTDSWE 290

Query: 1674 KVFPEEAAAAASGMKMXXXXXXXXXXXXXXXXXXDKPEV---DNMVLGXXXXXXXXXXXX 1504
            KV+P+EAAAAASG K+                  + P+V   D+                
Sbjct: 291  KVYPKEAAAAASGEKLDDISGLPSDDSEDDDYNPEAPDVGKNDSEDESSSDESESDFYSA 350

Query: 1503 XXXEPVSALKADHILGLXXXXXXXXXXXXXXXXXXERVKQXXXXXXXXXXXXXFGGMFHE 1324
                  +  K D ILGL                  E VK              F  +   
Sbjct: 351  SEDLAEAPTKDDEILGLSSEDSEDDDYNPDDPDKDEPVKTESSSSDFTSDSEDFSLIVDT 410

Query: 1323 KEPLGEEAGCVSSVSTQSNPGVGFIDQILKVGADKKHXXXXXXXXXXXSNDAPVSGKRHV 1144
                G+E G  SSV   S P    + +  KVG  K +           S+   VS KRH+
Sbjct: 411  NRLRGDEQGVSSSVD-NSMPNSVSLKEKAKVGKAKGNSLKDELSYLMQSDSPLVSAKRHI 469

Query: 1143 ERLDYKKLHEETYGDTSDDSSDEDYGETVGSKGRKKSTGK-AISVPYEPETIHKGPDTKD 967
            ERLDYKKLH+ETYG+ S DSSDEDY +    K RK    K A++ P          D K 
Sbjct: 470  ERLDYKKLHDETYGNGSSDSSDEDYDDGPLPKVRKLRNAKGAMAAPSSTPA-----DIKY 524

Query: 966  ENCNQKDVEMTPVEKIDKKFEIEGSNNMSVDSPRISTKGGSSGKRMSRPYQRLGDGVVQR 787
            ++  QK         I +K ++ G+          +++  SSGKR     +  G+   +R
Sbjct: 525  QSGKQKGSGHASDSGISEKLKVGGTG---------TSESPSSGKR-----KTYGEVSTKR 570

Query: 786  LLESFRENQYPKHAVKESLAKELGLRIQQVSKWFENARWSARHSSHMDSRLTGSTSVNGT 607
            L ESF++NQYP    KE L KELGL   QVSKWFENAR   RHS +    ++   S    
Sbjct: 571  LYESFKDNQYPDRDAKEKLGKELGLTAHQVSKWFENARHCHRHSPNWKKIMSHKVSEESP 630

Query: 606  SLPEISEKVPKLGEQSNLESAICNEEGKMALPQ 508
            S  +I  +   LG +SN   A CN   K+  P+
Sbjct: 631  SKSQIIGE--PLGTESNSIIASCNGVEKLEQPK 661


>ref|XP_007200058.1| hypothetical protein PRUPE_ppa023106mg [Prunus persica]
            gi|462395458|gb|EMJ01257.1| hypothetical protein
            PRUPE_ppa023106mg [Prunus persica]
          Length = 1058

 Score =  433 bits (1114), Expect = e-118
 Identities = 291/707 (41%), Positives = 370/707 (52%), Gaps = 47/707 (6%)
 Frame = -2

Query: 2652 SANAEPLEQKQVAGD---DNDDNKLTET-------EIAAPDLAGLEEYIQISVS--PCEN 2509
            S  +EP +QK         ND+ K ++        E   P +  + E   I  S  P E+
Sbjct: 234  SVPSEPAKQKDQLDSVPAQNDEAKTSKAVSSSTVFEQPGPSIEAMTEDSPIGHSEPPLED 293

Query: 2508 VAMVPAFSALGSDPQDASMHIDPQQTEPTQQKGAINAGGESG-LDKRTPFQSRKRK-STL 2335
            ++   +   +   P+D + +   QQ E T  K A+      G  DK+ P +SRKRK  + 
Sbjct: 294  LSKSLSDKEMEPLPEDVTQNSSLQQLE-TASKNALKISSCLGPKDKKNP-KSRKRKYMSR 351

Query: 2334 TIPVTARVLRSRSQXXXXXXXXXXXXXXXDAIEANXXXXXXXXXXXKIP----------- 2188
            +   + RVLRS++                    +N           K             
Sbjct: 352  SFVRSDRVLRSKTGEKEKPKDLKLSNNVATLESSNSIANVSNGEEKKRKKRKNRRDNRAI 411

Query: 2187 ANEFSRIRAHLRYLLHRIKYEQNLIDAYSGEGWKGQSLEKIKPEKELQRAKSQIFRYKLK 2008
            A+EFSRIR HLRYLL+RI YE++LIDAYSGEGWKG SLEK+KPEKELQRA S+I R KLK
Sbjct: 412  ADEFSRIRTHLRYLLNRIGYEKSLIDAYSGEGWKGSSLEKLKPEKELQRATSEILRRKLK 471

Query: 2007 IRDLFRQIDLSLAEGKLPESLFDSDGQIDSEDIFCAKCGSKDLTLDNDIILCDGACERGF 1828
            IRDLF++++   AEG  PESLFDS+GQIDSEDIFC KCGSKD++LDNDIILCDGAC+RGF
Sbjct: 472  IRDLFQRLESLCAEGMFPESLFDSEGQIDSEDIFCGKCGSKDVSLDNDIILCDGACDRGF 531

Query: 1827 HQYCLEPPLLKEDIPPDEEGWLCPGCDCKVDCIELLSDFQGSNLSVLDKWEKVFPEEAAA 1648
            HQ+CLEPPLL EDIPPD+EGWLCPGCDCKVDCI+LL+D QG++LSV D WEKVFPE AAA
Sbjct: 532  HQFCLEPPLLSEDIPPDDEGWLCPGCDCKVDCIDLLNDSQGTDLSVTDSWEKVFPEAAAA 591

Query: 1647 AASGMKMXXXXXXXXXXXXXXXXXXDKPEVDNMVLGXXXXXXXXXXXXXXXEPVSALKAD 1468
            A++G                       PE DN V G                  +    D
Sbjct: 592  ASAGENQDNHGLPSDDSDDNDYDPDG-PETDNKVQGEESSSDESEYASASDGLETPKSND 650

Query: 1467 -HILGLXXXXXXXXXXXXXXXXXXERVKQXXXXXXXXXXXXXFGGMFHEKEPLGEEA-GC 1294
               LGL                  E VKQ              G    +     E+  G 
Sbjct: 651  EQYLGLPSEDSEDDDYNPYAPDVNEDVKQESSSSDFTSDSEDLGAALDDNIMSSEDVEGP 710

Query: 1293 VSSVSTQSNPGVGFIDQILKVGADKKH-----XXXXXXXXXXXSNDAPVSGKRHVERLDY 1129
             S+    S P  G  +Q   +   KKH                   AP+SGKRH+ERLDY
Sbjct: 711  KSTSLDDSKPHRGSGEQ-SSISGQKKHSLKDELISLLESGPGQGESAPLSGKRHIERLDY 769

Query: 1128 KKLHEETYGDTSDDSS-DEDYGETVGSKGRKKSTGKAISVPYEPET--IHKGPDTKDENC 958
            K+LH+E YG+   DSS DED+ +    + RKK TG+  +     +T  I  G  TKD   
Sbjct: 770  KRLHDEAYGNVPTDSSDDEDWNDIATQRKRKKGTGQVANRSPNGKTSNIKNGVITKDIKP 829

Query: 957  NQKDVEMTPVEKIDKKFEIEGSNNMSVDSPRISTKGGSSGKRMS---RPYQRLGDGVVQR 787
            +  + E TP     +K  +E ++N+S  SP+ STK GS+  R       Y RLG+   QR
Sbjct: 830  DVDENENTPRRMPHRKSNVEDTSNLSNKSPKGSTKSGSTSGRAGSSRSTYSRLGEAATQR 889

Query: 786  LLESFRENQYPKHAVKESLAKELGLRIQQ---------VSKWFENAR 673
            L +SF+EN YP  ++KESLA+ELGL  +Q         VSKWFENAR
Sbjct: 890  LCKSFKENHYPDRSMKESLARELGLMAKQVIPSFILASVSKWFENAR 936


>ref|XP_008373076.1| PREDICTED: homeobox protein HAT3.1-like isoform X1 [Malus domestica]
          Length = 1081

 Score =  432 bits (1111), Expect = e-117
 Identities = 302/824 (36%), Positives = 428/824 (51%), Gaps = 46/824 (5%)
 Frame = -2

Query: 2652 SANAEPLEQK-QVAGDDNDDNKLTETEIAAPDLAGLEEY-----IQISVSPCENVAMVPA 2491
            S + EP  QK Q+      +N++T TE AAP     E+           SP  ++ + P 
Sbjct: 270  SVHGEPETQKDQLDSVPAHNNEVTTTE-AAPSSIVFEQSRPCIEAMTQDSPTGHLEL-PL 327

Query: 2490 FSALGSDPQDASMHIDPQQ-TEPTQQKGAINAGGESGLDKRTPFQSRKRKSTLTIPVTAR 2314
              A  S P D  M   P   T+ +  +        +  DK+ P   +K+  + +   + R
Sbjct: 328  KDASKSPPIDKEMEQLPADVTQNSSLEKTEKPSKNAPKDKQNPKSRKKKYVSKSSVGSDR 387

Query: 2313 VLRSRSQXXXXXXXXXXXXXXXDA---------IEANXXXXXXXXXXXKIPANEFSRIRA 2161
            VLRS++                ++         +E             K+  +EFSR+R 
Sbjct: 388  VLRSKTGEKTKNPKLSNDVSTLESSNSVANPSNVEGKRRKKRKKRQLNKVIDDEFSRVRK 447

Query: 2160 HLRYLLHRIKYEQNLIDAYSGEGWKGQSLEKIKPEKELQRAKSQIFRYKLKIRDLFRQID 1981
            HLRYLL+RI YE++LIDAYSGEGWKG SLEK+KPEKELQRA S+I + KLKIRDLF+++D
Sbjct: 448  HLRYLLNRISYEKSLIDAYSGEGWKGSSLEKLKPEKELQRATSEILQRKLKIRDLFQRLD 507

Query: 1980 LSLAEGKLPESLFDSDGQIDSEDIFCAKCGSKDLTLDNDIILCDGACERGFHQYCLEPPL 1801
               +EG  PESLFDS+GQIDSEDIFCAKCGSKD++L NDIILCDGAC+RGFHQ+CLEPPL
Sbjct: 508  SLCSEGMFPESLFDSEGQIDSEDIFCAKCGSKDVSLQNDIILCDGACDRGFHQFCLEPPL 567

Query: 1800 LKEDIPPDEEGWLCPGCDCKVDCIELLSDFQGSNLSVLDKWEKVFPEEAAAAASGMKMXX 1621
            L EDIPPD+EGWLCPGCDCKVDC +LL+D QG++LSV D WEKVFP EAAAAASG     
Sbjct: 568  LSEDIPPDDEGWLCPGCDCKVDCFDLLNDSQGTDLSVADSWEKVFP-EAAAAASGHNQEH 626

Query: 1620 XXXXXXXXXXXXXXXXDKPEVDNMVLGXXXXXXXXXXXXXXXEPVSALK--ADHILGLXX 1447
                            D PE D+ V G               + +   K   +  LGL  
Sbjct: 627  THGLPSDDSDDNDYDPDGPETDDEVQGEESSSDDESKYASASDGLETPKNNDEQYLGLPS 686

Query: 1446 XXXXXXXXXXXXXXXXERVKQXXXXXXXXXXXXXFGGMFHEKEPLGEEAGCVSSVS-TQS 1270
                            E +K+              G    +     E+     S+S  +S
Sbjct: 687  DDSEDDDYNPDAPEVTEELKKESSSSDFTSDSEDLGASLDDNNMFSEDVESPKSMSLDES 746

Query: 1269 NPGVGFIDQILKVGADKK----HXXXXXXXXXXXSNDAPVSGKRHVERLDYKKLHEETYG 1102
             P  G   Q  + G  K+                +  APVSGKRH+ERL+YKKLH+ETYG
Sbjct: 747  GPLRGSGKQSSRRGQKKQPLKDELLSLLESGPGQAGAAPVSGKRHIERLNYKKLHDETYG 806

Query: 1101 DTSDDSS-DEDYGETVGSKGRKKSTGKA--ISVPYEPETIHKGPDTKDENCNQKDVEMTP 931
            +   DSS DE++ +T G + RKK T +A  +S   +   +  G  T +   +  + E TP
Sbjct: 807  NVRTDSSDDEEWNDTAGPRKRKKVTTQAPTMSPNGDSSNVKNGMITNNIKHDLDENENTP 866

Query: 930  ----------VEKIDKKFEIEGSNNMSVDSPRISTKGGSS---GKRMSRPYQRLGDGVVQ 790
                       ++  +K ++E ++N+S  S + ST+  S+   G      Y++LG+   Q
Sbjct: 867  KRTPRRNKNTPKRAHRKSKVEDTSNLSNKSQKGSTQSASTSEQGGSSRSTYRKLGEAATQ 926

Query: 789  RLLESFRENQYPKHAVKESLAKELGLRIQQVSKWFENARWSARHSSHMDSRLTGSTSVNG 610
            RL +SF+EN YP  ++KESLA+ELG+  +QVSKWFENAR   + S      +  S + NG
Sbjct: 927  RLSKSFKENHYPDRSMKESLARELGIMAKQVSKWFENARHFWKVS------VDKSAAGNG 980

Query: 609  TSLPEISEKVPKLGEQSNLESAICNEEGKMALPQTSAYVDGQHVAGTGEGNSAVGFSPES 430
            T LP+ + K  + G+    +S     + K  LP+T+  + G   + +G+       +P+S
Sbjct: 981  TPLPQTNGKQLEKGDTPIGDSDQSGAQNK-ELPRTNDPMTG---SCSGDAKDGELVTPKS 1036

Query: 429  INGRCINVDDQK-------PDQLSSAEKTSKQDSHVNASKSQSV 319
               + I  +++K       PD  +   +T+++ + V   + +S+
Sbjct: 1037 SKRKAITPNNRKRXRKSDDPDPENKTPETNRKGTGVMTRQRKSI 1080


>ref|XP_008373078.1| PREDICTED: homeobox protein HAT3.1-like isoform X3 [Malus domestica]
          Length = 1078

 Score =  429 bits (1104), Expect = e-116
 Identities = 300/812 (36%), Positives = 422/812 (51%), Gaps = 46/812 (5%)
 Frame = -2

Query: 2652 SANAEPLEQK-QVAGDDNDDNKLTETEIAAPDLAGLEEY-----IQISVSPCENVAMVPA 2491
            S + EP  QK Q+      +N++T TE AAP     E+           SP  ++ + P 
Sbjct: 270  SVHGEPETQKDQLDSVPAHNNEVTTTE-AAPSSIVFEQSRPCIEAMTQDSPTGHLEL-PL 327

Query: 2490 FSALGSDPQDASMHIDPQQ-TEPTQQKGAINAGGESGLDKRTPFQSRKRKSTLTIPVTAR 2314
              A  S P D  M   P   T+ +  +        +  DK+ P   +K+  + +   + R
Sbjct: 328  KDASKSPPIDKEMEQLPADVTQNSSLEKTEKPSKNAPKDKQNPKSRKKKYVSKSSVGSDR 387

Query: 2313 VLRSRSQXXXXXXXXXXXXXXXDA---------IEANXXXXXXXXXXXKIPANEFSRIRA 2161
            VLRS++                ++         +E             K+  +EFSR+R 
Sbjct: 388  VLRSKTGEKTKNPKLSNDVSTLESSNSVANPSNVEGKRRKKRKKRQLNKVIDDEFSRVRK 447

Query: 2160 HLRYLLHRIKYEQNLIDAYSGEGWKGQSLEKIKPEKELQRAKSQIFRYKLKIRDLFRQID 1981
            HLRYLL+RI YE++LIDAYSGEGWKG SLEK+KPEKELQRA S+I + KLKIRDLF+++D
Sbjct: 448  HLRYLLNRISYEKSLIDAYSGEGWKGSSLEKLKPEKELQRATSEILQRKLKIRDLFQRLD 507

Query: 1980 LSLAEGKLPESLFDSDGQIDSEDIFCAKCGSKDLTLDNDIILCDGACERGFHQYCLEPPL 1801
               +EG  PESLFDS+GQIDSEDIFCAKCGSKD++L NDIILCDGAC+RGFHQ+CLEPPL
Sbjct: 508  SLCSEGMFPESLFDSEGQIDSEDIFCAKCGSKDVSLQNDIILCDGACDRGFHQFCLEPPL 567

Query: 1800 LKEDIPPDEEGWLCPGCDCKVDCIELLSDFQGSNLSVLDKWEKVFPEEAAAAASGMKMXX 1621
            L EDIPPD+EGWLCPGCDCKVDC +LL+D QG++LSV D WEKVFP EAAAAASG     
Sbjct: 568  LSEDIPPDDEGWLCPGCDCKVDCFDLLNDSQGTDLSVADSWEKVFP-EAAAAASGHNQEH 626

Query: 1620 XXXXXXXXXXXXXXXXDKPEVDNMVLGXXXXXXXXXXXXXXXEPVSALK--ADHILGLXX 1447
                            D PE D+ V G               + +   K   +  LGL  
Sbjct: 627  THGLPSDDSDDNDYDPDGPETDDEVQGEESSSDDESKYASASDGLETPKNNDEQYLGLPS 686

Query: 1446 XXXXXXXXXXXXXXXXERVKQXXXXXXXXXXXXXFGGMFHEKEPLGEEAGCVSSVS-TQS 1270
                            E +K+              G    +     E+     S+S  +S
Sbjct: 687  DDSEDDDYNPDAPEVTEELKKESSSSDFTSDSEDLGASLDDNNMFSEDVESPKSMSLDES 746

Query: 1269 NPGVGFIDQILKVGADKK----HXXXXXXXXXXXSNDAPVSGKRHVERLDYKKLHEETYG 1102
             P  G   Q  + G  K+                +  APVSGKRH+ERL+YKKLH+ETYG
Sbjct: 747  GPLRGSGKQSSRRGQKKQPLKDELLSLLESGPGQAGAAPVSGKRHIERLNYKKLHDETYG 806

Query: 1101 DTSDDSS-DEDYGETVGSKGRKKSTGKA--ISVPYEPETIHKGPDTKDENCNQKDVEMTP 931
            +   DSS DE++ +T G + RKK T +A  +S   +   +  G  T +   +  + E TP
Sbjct: 807  NVRTDSSDDEEWNDTAGPRKRKKVTTQAPTMSPNGDSSNVKNGMITNNIKHDLDENENTP 866

Query: 930  ----------VEKIDKKFEIEGSNNMSVDSPRISTKGGSS---GKRMSRPYQRLGDGVVQ 790
                       ++  +K ++E ++N+S  S + ST+  S+   G      Y++LG+   Q
Sbjct: 867  KRTPRRNKNTPKRAHRKSKVEDTSNLSNKSQKGSTQSASTSEQGGSSRSTYRKLGEAATQ 926

Query: 789  RLLESFRENQYPKHAVKESLAKELGLRIQQVSKWFENARWSARHSSHMDSRLTGSTSVNG 610
            RL +SF+EN YP  ++KESLA+ELG+  +QVSKWFENAR   + S      +  S + NG
Sbjct: 927  RLSKSFKENHYPDRSMKESLARELGIMAKQVSKWFENARHFWKVS------VDKSAAGNG 980

Query: 609  TSLPEISEKVPKLGEQSNLESAICNEEGKMALPQTSAYVDGQHVAGTGEGNSAVGFSPES 430
            T LP+ + K  + G+    +S     + K  LP+T+  + G   + +G+       +P+S
Sbjct: 981  TPLPQTNGKQLEKGDTPIGDSDQSGAQNK-ELPRTNDPMTG---SCSGDAKDGELVTPKS 1036

Query: 429  INGRCINVDDQK-------PDQLSSAEKTSKQ 355
               + I  +++K       PD  +   +T+++
Sbjct: 1037 SKRKAITPNNRKRXRKSDDPDPENKTPETNRK 1068


>ref|XP_008373077.1| PREDICTED: homeobox protein HAT3.1-like isoform X2 [Malus domestica]
          Length = 1081

 Score =  429 bits (1104), Expect = e-116
 Identities = 300/812 (36%), Positives = 422/812 (51%), Gaps = 46/812 (5%)
 Frame = -2

Query: 2652 SANAEPLEQK-QVAGDDNDDNKLTETEIAAPDLAGLEEY-----IQISVSPCENVAMVPA 2491
            S + EP  QK Q+      +N++T TE AAP     E+           SP  ++ + P 
Sbjct: 270  SVHGEPETQKDQLDSVPAHNNEVTTTE-AAPSSIVFEQSRPCIEAMTQDSPTGHLEL-PL 327

Query: 2490 FSALGSDPQDASMHIDPQQ-TEPTQQKGAINAGGESGLDKRTPFQSRKRKSTLTIPVTAR 2314
              A  S P D  M   P   T+ +  +        +  DK+ P   +K+  + +   + R
Sbjct: 328  KDASKSPPIDKEMEQLPADVTQNSSLEKTEKPSKNAPKDKQNPKSRKKKYVSKSSVGSDR 387

Query: 2313 VLRSRSQXXXXXXXXXXXXXXXDA---------IEANXXXXXXXXXXXKIPANEFSRIRA 2161
            VLRS++                ++         +E             K+  +EFSR+R 
Sbjct: 388  VLRSKTGEKTKNPKLSNDVSTLESSNSVANPSNVEGKRRKKRKKRQLNKVIDDEFSRVRK 447

Query: 2160 HLRYLLHRIKYEQNLIDAYSGEGWKGQSLEKIKPEKELQRAKSQIFRYKLKIRDLFRQID 1981
            HLRYLL+RI YE++LIDAYSGEGWKG SLEK+KPEKELQRA S+I + KLKIRDLF+++D
Sbjct: 448  HLRYLLNRISYEKSLIDAYSGEGWKGSSLEKLKPEKELQRATSEILQRKLKIRDLFQRLD 507

Query: 1980 LSLAEGKLPESLFDSDGQIDSEDIFCAKCGSKDLTLDNDIILCDGACERGFHQYCLEPPL 1801
               +EG  PESLFDS+GQIDSEDIFCAKCGSKD++L NDIILCDGAC+RGFHQ+CLEPPL
Sbjct: 508  SLCSEGMFPESLFDSEGQIDSEDIFCAKCGSKDVSLQNDIILCDGACDRGFHQFCLEPPL 567

Query: 1800 LKEDIPPDEEGWLCPGCDCKVDCIELLSDFQGSNLSVLDKWEKVFPEEAAAAASGMKMXX 1621
            L EDIPPD+EGWLCPGCDCKVDC +LL+D QG++LSV D WEKVFP EAAAAASG     
Sbjct: 568  LSEDIPPDDEGWLCPGCDCKVDCFDLLNDSQGTDLSVADSWEKVFP-EAAAAASGHNQEH 626

Query: 1620 XXXXXXXXXXXXXXXXDKPEVDNMVLGXXXXXXXXXXXXXXXEPVSALK--ADHILGLXX 1447
                            D PE D+ V G               + +   K   +  LGL  
Sbjct: 627  THGLPSDDSDDNDYDPDGPETDDEVQGEESSSDDESKYASASDGLETPKNNDEQYLGLPS 686

Query: 1446 XXXXXXXXXXXXXXXXERVKQXXXXXXXXXXXXXFGGMFHEKEPLGEEAGCVSSVS-TQS 1270
                            E +K+              G    +     E+     S+S  +S
Sbjct: 687  DDSEDDDYNPDAPEVTEELKKESSSSDFTSDSEDLGASLDDNNMFSEDVESPKSMSLDES 746

Query: 1269 NPGVGFIDQILKVGADKK----HXXXXXXXXXXXSNDAPVSGKRHVERLDYKKLHEETYG 1102
             P  G   Q  + G  K+                +  APVSGKRH+ERL+YKKLH+ETYG
Sbjct: 747  GPLRGSGKQSSRRGQKKQPLKDELLSLLESGPGQAGAAPVSGKRHIERLNYKKLHDETYG 806

Query: 1101 DTSDDSS-DEDYGETVGSKGRKKSTGKA--ISVPYEPETIHKGPDTKDENCNQKDVEMTP 931
            +   DSS DE++ +T G + RKK T +A  +S   +   +  G  T +   +  + E TP
Sbjct: 807  NVRTDSSDDEEWNDTAGPRKRKKVTTQAPTMSPNGDSSNVKNGMITNNIKHDLDENENTP 866

Query: 930  ----------VEKIDKKFEIEGSNNMSVDSPRISTKGGSS---GKRMSRPYQRLGDGVVQ 790
                       ++  +K ++E ++N+S  S + ST+  S+   G      Y++LG+   Q
Sbjct: 867  KRTPRRNKNTPKRAHRKSKVEDTSNLSNKSQKGSTQSASTSEQGGSSRSTYRKLGEAATQ 926

Query: 789  RLLESFRENQYPKHAVKESLAKELGLRIQQVSKWFENARWSARHSSHMDSRLTGSTSVNG 610
            RL +SF+EN YP  ++KESLA+ELG+  +QVSKWFENAR   + S      +  S + NG
Sbjct: 927  RLSKSFKENHYPDRSMKESLARELGIMAKQVSKWFENARHFWKVS------VDKSAAGNG 980

Query: 609  TSLPEISEKVPKLGEQSNLESAICNEEGKMALPQTSAYVDGQHVAGTGEGNSAVGFSPES 430
            T LP+ + K  + G+    +S     + K  LP+T+  + G   + +G+       +P+S
Sbjct: 981  TPLPQTNGKQLEKGDTPIGDSDQSGAQNK-ELPRTNDPMTG---SCSGDAKDGELVTPKS 1036

Query: 429  INGRCINVDDQK-------PDQLSSAEKTSKQ 355
               + I  +++K       PD  +   +T+++
Sbjct: 1037 SKRKAITPNNRKRXRKSDDPDPENKTPETNRK 1068


>ref|XP_002527467.1| Homeobox protein HAT3.1, putative [Ricinus communis]
            gi|223533107|gb|EEF34865.1| Homeobox protein HAT3.1,
            putative [Ricinus communis]
          Length = 896

 Score =  427 bits (1099), Expect = e-116
 Identities = 295/761 (38%), Positives = 388/761 (50%), Gaps = 16/761 (2%)
 Frame = -2

Query: 2520 PCENVAMVPAFSALGSDPQDASMHIDPQQTEPTQQKGAINAGGESGLDKRTPFQSRKRKS 2341
            P  N   VPA   LG        H +  Q+E    K A++     G   +T  +SRK+  
Sbjct: 153  PPNNEMKVPASEKLGPPHDAEDKHWNGTQSE-ILSKDAVSNSSRLGRRVKTTAKSRKKYM 211

Query: 2340 TLTIPVTARVLRSRSQXXXXXXXXXXXXXXXDAIEANXXXXXXXXXXXKIPANEFSRIRA 2161
               +  + RV++ RSQ                +                + A+E+S IR 
Sbjct: 212  LRCLRRSDRVMQYRSQEKPKAPESSTNLPNVSSNVEKTRKKKKKRERKSVEADEYSIIRK 271

Query: 2160 HLRYLLHRIKYEQNLIDAYSGEGWKGQSLEKIKPEKELQRAKSQIFRYKLKIRDLFRQID 1981
            +LRYLL+RI YEQ+LI AYS EGWKG SLEK+KPEKELQRA S+I R K KIRDLF++ID
Sbjct: 272  NLRYLLNRIGYEQSLITAYSAEGWKGLSLEKLKPEKELQRATSEILRRKSKIRDLFQRID 331

Query: 1980 LSLAEGKLPESLFDSDGQIDSEDIFCAKCGSKDLTLDNDIILCDGACERGFHQYCLEPPL 1801
                EG+ PESLFDSDGQI SEDIFCAKCGSKDLT DNDIILCDGAC+RGFHQYCL PPL
Sbjct: 332  SLCGEGRFPESLFDSDGQISSEDIFCAKCGSKDLTADNDIILCDGACDRGFHQYCLVPPL 391

Query: 1800 LKEDIPPDEEGWLCPGCDCKVDCIELLSDFQGSNLSVLDKWEKVFPEEAAAAASGMKMXX 1621
            LKEDIPPD++GWLCPGCDCKVDCI+LL++ QG+N+S+ D WEKVFPE   AAA G     
Sbjct: 392  LKEDIPPDDQGWLCPGCDCKVDCIDLLNESQGTNISISDSWEKVFPE---AAAPGQNPDQ 448

Query: 1620 XXXXXXXXXXXXXXXXDKPEVDNMVLGXXXXXXXXXXXXXXXEPVSALKAD-HILGLXXX 1444
                            D PE+D    G               + + A   D   LGL   
Sbjct: 449  NFGPPSDDSDDNDYDPDIPEIDEKSQGDESSSDDSDDSDFTSDELEAPPGDKQQLGLSSE 508

Query: 1443 XXXXXXXXXXXXXXXERVKQXXXXXXXXXXXXXFGGMFHEKEPLGEEAGCVSSVSTQSNP 1264
                           + VK+                     E  GE+   + SV T+ + 
Sbjct: 509  DSGDDDYDPDAPDLDDIVKEESSSSDFTSDSEDLAATLDNNELSGEDERRI-SVGTRGDS 567

Query: 1263 GVGFIDQILKVGADKKHXXXXXXXXXXXSN-----DAPVSGKRHVERLDYKKLHEETYGD 1099
                  +  K G  KK             N      AP+SGKR+VERLDYKKL++ETYG+
Sbjct: 568  ----TKEGSKRGRKKKQSLQSELLSIEEPNPSQDGSAPISGKRNVERLDYKKLYDETYGN 623

Query: 1098 TSDDSS-DEDYGETVGSKGRKKSTGKAISVPYEPETIHKGPDTKDENCNQKDVEMTPVEK 922
             S DSS DED+ + VG+  R+KST  A+       ++    DT  ++   K+ E  P  K
Sbjct: 624  VSSDSSDDEDFTDDVGAVKRRKSTQAALGSANGNASV---TDTGKQDL--KETEYVP--K 676

Query: 921  IDKKFEIEGSNNMSVDSPRISTKGGSSGKRMSRP--YQRLGDGVVQRLLESFRENQYPKH 748
              ++  I  + +++       T   SS  +  RP  Y+RLG+ V + L  SF+ENQYP  
Sbjct: 677  RSRQRLISENTSITPTKAHEGTSPSSSCGKTVRPSGYRRLGETVTKGLYRSFKENQYPDR 736

Query: 747  AVKESLAKELGLRIQQVSKWFENARWSARHSSHMDSRLTGSTSVNGTSLPE----ISEKV 580
              KE LA+ELG+  QQV+KWFENARWS  HSS MD+   G T  N + + +    + E  
Sbjct: 737  DRKEHLAEELGITYQQVTKWFENARWSFNHSSSMDANRIGKTPENNSPVSKTTTILLESA 796

Query: 579  PKLGEQSNLESAICNEEG-KMALPQTSAYVDG--QHVAGTGEGNSAVGFSPESINGRCIN 409
            P+    + ++SA   EE  K+       YV+   + V G  +  +    +P+S   R  N
Sbjct: 797  PETVSGAAIDSAAQREESPKIGDAMVEIYVEDARETVLGIPKCCAQNSKTPKS-RKRKHN 855

Query: 408  VDDQKPDQLSSAEKTSKQDSHVNASKSQSVRRSDRLQARSS 286
              D+  D  S  E+   + +  N  K+Q  R   R+    S
Sbjct: 856  SGDRLSDLESKKEEA--KIAPANLPKAQETRVGGRVTRSKS 894


>ref|XP_007042568.1| Homeodomain-like protein with RING/FYVE/PHD-type zinc finger domain,
            putative isoform 1 [Theobroma cacao]
            gi|590687101|ref|XP_007042569.1| Homeodomain-like protein
            with RING/FYVE/PHD-type zinc finger domain, putative
            isoform 1 [Theobroma cacao] gi|508706503|gb|EOX98399.1|
            Homeodomain-like protein with RING/FYVE/PHD-type zinc
            finger domain, putative isoform 1 [Theobroma cacao]
            gi|508706504|gb|EOX98400.1| Homeodomain-like protein with
            RING/FYVE/PHD-type zinc finger domain, putative isoform 1
            [Theobroma cacao]
          Length = 950

 Score =  410 bits (1053), Expect = e-111
 Identities = 271/689 (39%), Positives = 353/689 (51%), Gaps = 30/689 (4%)
 Frame = -2

Query: 2553 GLEEYIQISVSPCENVAMVPAFSALGSDP--------QDASMHIDPQQTEPTQQKGAINA 2398
            G+   IQ S SP      +P   A G+          +D + +   +Q E T+ K  +  
Sbjct: 271  GVTNVIQSSKSPLVEPLGLPQEFAQGNPSTQQSGLPCEDMAQNSGVEQHE-TKPKNLLEN 329

Query: 2397 GGESGLDKRTPFQSRKRKSTLTIPVTARVLRSRSQXXXXXXXXXXXXXXXDAIEANXXXX 2218
             G     K T    +K+    ++  + RVLRS+ Q                + E      
Sbjct: 330  SGRRRNGK-TSKTIKKKYMLRSLRSSDRVLRSKLQEKPKATESSNNLADVGSSEQQKRRK 388

Query: 2217 XXXXXXXKIPANEFSRIRAHLRYLLHRIKYEQNLIDAYSGEGWKGQSLEKIKPEKELQRA 2038
                   +  A+EFSRIR HLRYLL+RI YE++LI AYS EGWKG SLEK+KPEKELQRA
Sbjct: 389  RRRRKANREVADEFSRIRTHLRYLLNRINYERSLIAAYSTEGWKGLSLEKLKPEKELQRA 448

Query: 2037 KSQIFRYKLKIRDLFRQIDLSLAEGKLPESLFDSDGQIDSEDIFCAKCGSKDLTLDNDII 1858
             S+I R KLKIRDLF+ ID   AEGKLPESLFDS+GQIDSEDIFCAKCGSKDL+ +NDII
Sbjct: 449  TSEILRRKLKIRDLFQHIDSLCAEGKLPESLFDSEGQIDSEDIFCAKCGSKDLSANNDII 508

Query: 1857 LCDGACERGFHQYCLEPPLLKEDIPPDEEGWLCPGCDCKVDCIELLSDFQGSNLSVLDKW 1678
            LCDGAC+RGFHQYCL+PPLLKEDIPPD+EGWLCPGCDCKVDCIEL+++ QG++ S+ D W
Sbjct: 509  LCDGACDRGFHQYCLQPPLLKEDIPPDDEGWLCPGCDCKVDCIELVNESQGTSFSITDSW 568

Query: 1677 EKVFPEEAAAAASGMKMXXXXXXXXXXXXXXXXXXDKPEVDNMVLGXXXXXXXXXXXXXX 1498
            EKVFPE AA AA+G                     D  E D    G              
Sbjct: 569  EKVFPE-AAVAAAGQNQDPNFGLPSDDSDDNDYNPDGSETDEKDHGDESSSEESEFTSTS 627

Query: 1497 XEPVSALKADHILGLXXXXXXXXXXXXXXXXXXERVKQXXXXXXXXXXXXXFGGMFHE-- 1324
             E     K D  LGL                  E VK                 M  E  
Sbjct: 628  EELEVPAKVDQYLGLPSDDSEDDDYDPDGPNHDEVVKPESSSSDFSSDSEDLDAMLEEDI 687

Query: 1323 --KEPLGEEAGCVSSVSTQSNPGVG----FIDQILKVGADKKHXXXXXXXXXXXSNDAPV 1162
              ++  G  A      S +  P +G      D++L +                  + + +
Sbjct: 688  TSQKDEGPMANSAPRDSKRRKPKLGEKESMNDELLSIMEPASEQ-----------DGSAI 736

Query: 1161 SGKRHVERLDYKKLHEETYGDT-SDDSSDEDYGETVGSKGRKKSTGKAISVPYE------ 1003
            S KR +ERLDYK+L++ETYG+  S  S DED+ +    + R K T +  S P        
Sbjct: 737  SKKRSIERLDYKRLYDETYGNVPSSSSDDEDWSDITAPRKRNKCTAEVASAPENGNVSVS 796

Query: 1002 -----PETIHKGPDTKDENCNQKDVEMTPVEKIDKK-FEIEGSNNMSVDSPRISTKGGSS 841
                  + + + P+  +    +K  +M+  +  D    EI+G+ ++S          GSS
Sbjct: 797  RTVSVSDGLKQNPEETEHKPRRKTRQMSRFKDTDSSPAEIQGNTSVS----------GSS 846

Query: 840  GKRM-SRPYQRLGDGVVQRLLESFRENQYPKHAVKESLAKELGLRIQQVSKWFENARWSA 664
            GK+  S  Y+RLG+ V QRL +SF+ENQYP  A K+SLAKEL +  QQVSKWF+NARWS 
Sbjct: 847  GKKAGSSTYKRLGEAVKQRLYKSFKENQYPDRATKQSLAKELDMTFQQVSKWFDNARWSF 906

Query: 663  RHSSHMDSRLTGSTSVNGTSLPEISEKVP 577
             +S       +  T  N  S  +I+  +P
Sbjct: 907  NNSPS-----SHETIANNASEKDITSSLP 930


>emb|CDP08734.1| unnamed protein product [Coffea canephora]
          Length = 296

 Score =  390 bits (1003), Expect = e-105
 Identities = 211/296 (71%), Positives = 224/296 (75%), Gaps = 3/296 (1%)
 Frame = -2

Query: 4371 MLRESEKRNSSME-EDQGGNQILEINLISAQGLKTPSGTRRRMHTYAVTWIDPTEKLRTR 4195
            M RE EKRNSSME EDQ G+QILEINLISAQGLKTPSG+RRRMHTYA+ W+DPT KLRTR
Sbjct: 1    MYREREKRNSSMEREDQEGDQILEINLISAQGLKTPSGSRRRMHTYALAWVDPTAKLRTR 60

Query: 4194 TDRVGAENPTWNDKFLFRVSSHFLACETSGVTVEIYAVGYIRDYLIGTVRFLLSSCLGKF 4015
            TDRVGAENPTWNDKFLFRVSS FLACETSGVTVEIYAVGYIRDYLIGTVRFLLSSCLGKF
Sbjct: 61   TDRVGAENPTWNDKFLFRVSSRFLACETSGVTVEIYAVGYIRDYLIGTVRFLLSSCLGKF 120

Query: 4014 PTSAD--AIGTPAFTAVQIRRPSGRFHGVLNIAASVSSSTCSDFVLFGRASAISFRDLMG 3841
            P+SAD  AIGTPAFTAVQI+RPSGRFHGVLNIAASV SS CSDF +F  ASAISFRDL+G
Sbjct: 121  PSSADAIAIGTPAFTAVQIQRPSGRFHGVLNIAASVCSSACSDFEIFSGASAISFRDLVG 180

Query: 3840 AELEKEKEVDXXXXXXXXXXXXXXXXXXXXXXSCXXXXXXXXXXXXXXXXXXXXXXXXXX 3661
            AELEKEKEVD                      SC                          
Sbjct: 181  AELEKEKEVDRRQRRRLSRIGSSSSVRSSGGESCDFDFSSLDLSSDGAESTTSSSSTASN 240

Query: 3660 ALKEWNGVRTEVAEKLKEMKNEXXXXXXXXXLVLQRKIRFCQSDHNLRFWEESLES 3493
            ALKEWNGVRTEVA+KLKEMKN+         L+LQR++RFCQSD NLRFWEESLES
Sbjct: 241  ALKEWNGVRTEVAQKLKEMKNKGGGEGLLCGLMLQRRVRFCQSDQNLRFWEESLES 296


>emb|CDP17419.1| unnamed protein product [Coffea canephora]
          Length = 293

 Score =  389 bits (999), Expect = e-104
 Identities = 207/293 (70%), Positives = 217/293 (74%)
 Frame = -2

Query: 4371 MLRESEKRNSSMEEDQGGNQILEINLISAQGLKTPSGTRRRMHTYAVTWIDPTEKLRTRT 4192
            M RE EKRNS MEEDQG NQILEINLISAQGLKTPSG+RRRM TYA+ W+DP  KLRTRT
Sbjct: 1    MFREREKRNSLMEEDQGENQILEINLISAQGLKTPSGSRRRMQTYALAWVDPATKLRTRT 60

Query: 4191 DRVGAENPTWNDKFLFRVSSHFLACETSGVTVEIYAVGYIRDYLIGTVRFLLSSCLGKFP 4012
            DRVGAENPTWN+ FLFRVSSHFLACE SGVTVEIYAVGYIRDYLIGTVRFLLSSCLGKFP
Sbjct: 61   DRVGAENPTWNELFLFRVSSHFLACEPSGVTVEIYAVGYIRDYLIGTVRFLLSSCLGKFP 120

Query: 4011 TSADAIGTPAFTAVQIRRPSGRFHGVLNIAASVSSSTCSDFVLFGRASAISFRDLMGAEL 3832
            +SADAIGTPAFTAVQIRRPSGRFHGVLNIAASV SSTCSDF +F  ASAISFRDLMGAE+
Sbjct: 121  SSADAIGTPAFTAVQIRRPSGRFHGVLNIAASVCSSTCSDFEIFSGASAISFRDLMGAEI 180

Query: 3831 EKEKEVDXXXXXXXXXXXXXXXXXXXXXXSCXXXXXXXXXXXXXXXXXXXXXXXXXXALK 3652
            +KEKE D                      SC                           LK
Sbjct: 181  KKEKEDDRRRRRRLSRIGSSRSMQSCGGESCDFDFSSLDLSSDGAESTTSSSSTASNVLK 240

Query: 3651 EWNGVRTEVAEKLKEMKNEXXXXXXXXXLVLQRKIRFCQSDHNLRFWEESLES 3493
            EWNGVRTEVA KLKE+KNE         L+LQRKIRFC SD NLRFWEESLES
Sbjct: 241  EWNGVRTEVAGKLKELKNEGGGGGLLCGLMLQRKIRFCPSDPNLRFWEESLES 293


>ref|XP_012093068.1| PREDICTED: homeobox protein HAT3.1 [Jatropha curcas]
          Length = 1015

 Score =  328 bits (842), Expect = 2e-86
 Identities = 182/384 (47%), Positives = 236/384 (61%), Gaps = 17/384 (4%)
 Frame = -2

Query: 2748 AHSVENLKTV------DGLTNNADVKSLGLHNIQYLPESANAEPLEQKQVAGDDNDDNKL 2587
            AH  +N  T+      +   N     S     +++  + A  +PLE+ +V   D   ++L
Sbjct: 165  AHIAKNSLTMGLELPCEDAINRCHQLSTSEQKVEFASDDATCDPLEESKVPASDLLRDEL 224

Query: 2586 TETE---IAAPDLAGLEEYIQISVSPCENVAMVPAFSALGS------DPQDASM--HIDP 2440
             E             L   +    SP E++ M P+ S + +      +P   +M  H++ 
Sbjct: 225  VEINNELSCCTATRHLGTQLTTKSSPLEHLGM-PSDSEINTCATEKLEPPHDNMDNHLNL 283

Query: 2439 QQTEPTQQKGAINAGGESGLDKRTPFQSRKRKSTLTIPVTARVLRSRSQXXXXXXXXXXX 2260
            QQ++   +  +IN+       KRT   +RK+    ++  + RV +SRSQ           
Sbjct: 284  QQSDTPSKDVSINSSRVGVRVKRTAKSTRKKYVLRSLRRSDRVRQSRSQEKPKGPDPNAD 343

Query: 2259 XXXXDAIEANXXXXXXXXXXXKIPANEFSRIRAHLRYLLHRIKYEQNLIDAYSGEGWKGQ 2080
                 +                +  +E+SRIR HLRYLL+RI YEQ+LI AYS EGWKG 
Sbjct: 344  MANASSNIEKTRKKRKKRQRKSVEGDEYSRIRKHLRYLLNRISYEQSLITAYSAEGWKGL 403

Query: 2079 SLEKIKPEKELQRAKSQIFRYKLKIRDLFRQIDLSLAEGKLPESLFDSDGQIDSEDIFCA 1900
            SLEK+KPEKELQRA S+I R KLKIRDLF+++D   AEG+LPESLFDSDGQI SEDIFCA
Sbjct: 404  SLEKLKPEKELQRATSEILRRKLKIRDLFQRVDSLCAEGRLPESLFDSDGQISSEDIFCA 463

Query: 1899 KCGSKDLTLDNDIILCDGACERGFHQYCLEPPLLKEDIPPDEEGWLCPGCDCKVDCIELL 1720
            KCGSKD+T DNDIILCDGAC+RGFHQ+CL PPLLKEDIPPD+EGWLCPGCDCKVDCIELL
Sbjct: 464  KCGSKDMTADNDIILCDGACDRGFHQFCLLPPLLKEDIPPDDEGWLCPGCDCKVDCIELL 523

Query: 1719 SDFQGSNLSVLDKWEKVFPEEAAA 1648
            +D QG+N+S+ D+WEKVFPE AAA
Sbjct: 524  NDSQGTNISISDRWEKVFPEAAAA 547



 Score =  134 bits (338), Expect = 6e-28
 Identities = 92/281 (32%), Positives = 135/281 (48%), Gaps = 18/281 (6%)
 Frame = -2

Query: 1167 PVSGKRHVERLDYKKLHEETYGDTSDDSSD-EDYGETVGSKGRKKSTGKAISVP------ 1009
            P+SGKR VERLDYKKL++ETYG+ S DSSD ED+ + V  + R+K T  + S        
Sbjct: 698  PISGKRDVERLDYKKLYDETYGNASSDSSDDEDFTDDVEPRKRRKETYGSTSSDSSDDED 757

Query: 1008 ----YEPETIHKGPDTKDENCNQKDV------EMTPVEKIDKKFEIEGSNNMSVDSPRIS 859
                 EP    +  +    + N          + T  ++  +K +   ++  S      +
Sbjct: 758  FIDDVEPRKRRRSTEVGQASVNANAFVSKTAKQDTTPKRHRQKSKFANTSTSSTKGHEGA 817

Query: 858  TKGGSSGKRM-SRPYQRLGDGVVQRLLESFRENQYPKHAVKESLAKELGLRIQQVSKWFE 682
            +   SSGK + S  Y+RLG+ V Q L +SF+ENQYP  A KESLAKELG+  QQVSKWFE
Sbjct: 818  SPSSSSGKPVKSSGYRRLGETVTQGLYKSFKENQYPDRAKKESLAKELGITFQQVSKWFE 877

Query: 681  NARWSARHSSHMDSRLTGSTSVNGTSLPEISEKVPKLGEQSNLESAICNEEGKMALPQTS 502
            N RWS  H    D+     T+   + LP+ + ++     +    +   N       P+  
Sbjct: 878  NTRWSFNHPPSTDASTVRKTTKEDSQLPKTNTELCTPEPEKICRNTTSNGAQSEESPKVD 937

Query: 501  AYVDGQHVAGTGEGNSAVGFSPESINGRCINVDDQKPDQLS 379
                G ++  T +       S ES   +    D +K   +S
Sbjct: 938  DATGGSYIGDTRDTKMG---SQESCKQKSKTPDSRKRKHIS 975


>gb|KDP44446.1| hypothetical protein JCGZ_16279 [Jatropha curcas]
          Length = 1009

 Score =  328 bits (842), Expect = 2e-86
 Identities = 182/384 (47%), Positives = 236/384 (61%), Gaps = 17/384 (4%)
 Frame = -2

Query: 2748 AHSVENLKTV------DGLTNNADVKSLGLHNIQYLPESANAEPLEQKQVAGDDNDDNKL 2587
            AH  +N  T+      +   N     S     +++  + A  +PLE+ +V   D   ++L
Sbjct: 159  AHIAKNSLTMGLELPCEDAINRCHQLSTSEQKVEFASDDATCDPLEESKVPASDLLRDEL 218

Query: 2586 TETE---IAAPDLAGLEEYIQISVSPCENVAMVPAFSALGS------DPQDASM--HIDP 2440
             E             L   +    SP E++ M P+ S + +      +P   +M  H++ 
Sbjct: 219  VEINNELSCCTATRHLGTQLTTKSSPLEHLGM-PSDSEINTCATEKLEPPHDNMDNHLNL 277

Query: 2439 QQTEPTQQKGAINAGGESGLDKRTPFQSRKRKSTLTIPVTARVLRSRSQXXXXXXXXXXX 2260
            QQ++   +  +IN+       KRT   +RK+    ++  + RV +SRSQ           
Sbjct: 278  QQSDTPSKDVSINSSRVGVRVKRTAKSTRKKYVLRSLRRSDRVRQSRSQEKPKGPDPNAD 337

Query: 2259 XXXXDAIEANXXXXXXXXXXXKIPANEFSRIRAHLRYLLHRIKYEQNLIDAYSGEGWKGQ 2080
                 +                +  +E+SRIR HLRYLL+RI YEQ+LI AYS EGWKG 
Sbjct: 338  MANASSNIEKTRKKRKKRQRKSVEGDEYSRIRKHLRYLLNRISYEQSLITAYSAEGWKGL 397

Query: 2079 SLEKIKPEKELQRAKSQIFRYKLKIRDLFRQIDLSLAEGKLPESLFDSDGQIDSEDIFCA 1900
            SLEK+KPEKELQRA S+I R KLKIRDLF+++D   AEG+LPESLFDSDGQI SEDIFCA
Sbjct: 398  SLEKLKPEKELQRATSEILRRKLKIRDLFQRVDSLCAEGRLPESLFDSDGQISSEDIFCA 457

Query: 1899 KCGSKDLTLDNDIILCDGACERGFHQYCLEPPLLKEDIPPDEEGWLCPGCDCKVDCIELL 1720
            KCGSKD+T DNDIILCDGAC+RGFHQ+CL PPLLKEDIPPD+EGWLCPGCDCKVDCIELL
Sbjct: 458  KCGSKDMTADNDIILCDGACDRGFHQFCLLPPLLKEDIPPDDEGWLCPGCDCKVDCIELL 517

Query: 1719 SDFQGSNLSVLDKWEKVFPEEAAA 1648
            +D QG+N+S+ D+WEKVFPE AAA
Sbjct: 518  NDSQGTNISISDRWEKVFPEAAAA 541



 Score =  134 bits (338), Expect = 6e-28
 Identities = 92/281 (32%), Positives = 135/281 (48%), Gaps = 18/281 (6%)
 Frame = -2

Query: 1167 PVSGKRHVERLDYKKLHEETYGDTSDDSSD-EDYGETVGSKGRKKSTGKAISVP------ 1009
            P+SGKR VERLDYKKL++ETYG+ S DSSD ED+ + V  + R+K T  + S        
Sbjct: 692  PISGKRDVERLDYKKLYDETYGNASSDSSDDEDFTDDVEPRKRRKETYGSTSSDSSDDED 751

Query: 1008 ----YEPETIHKGPDTKDENCNQKDV------EMTPVEKIDKKFEIEGSNNMSVDSPRIS 859
                 EP    +  +    + N          + T  ++  +K +   ++  S      +
Sbjct: 752  FIDDVEPRKRRRSTEVGQASVNANAFVSKTAKQDTTPKRHRQKSKFANTSTSSTKGHEGA 811

Query: 858  TKGGSSGKRM-SRPYQRLGDGVVQRLLESFRENQYPKHAVKESLAKELGLRIQQVSKWFE 682
            +   SSGK + S  Y+RLG+ V Q L +SF+ENQYP  A KESLAKELG+  QQVSKWFE
Sbjct: 812  SPSSSSGKPVKSSGYRRLGETVTQGLYKSFKENQYPDRAKKESLAKELGITFQQVSKWFE 871

Query: 681  NARWSARHSSHMDSRLTGSTSVNGTSLPEISEKVPKLGEQSNLESAICNEEGKMALPQTS 502
            N RWS  H    D+     T+   + LP+ + ++     +    +   N       P+  
Sbjct: 872  NTRWSFNHPPSTDASTVRKTTKEDSQLPKTNTELCTPEPEKICRNTTSNGAQSEESPKVD 931

Query: 501  AYVDGQHVAGTGEGNSAVGFSPESINGRCINVDDQKPDQLS 379
                G ++  T +       S ES   +    D +K   +S
Sbjct: 932  DATGGSYIGDTRDTKMG---SQESCKQKSKTPDSRKRKHIS 969


>ref|XP_011001393.1| PREDICTED: homeobox protein HAT3.1-like [Populus euphratica]
            gi|743794901|ref|XP_011001400.1| PREDICTED: homeobox
            protein HAT3.1-like [Populus euphratica]
            gi|743794905|ref|XP_011001405.1| PREDICTED: homeobox
            protein HAT3.1-like [Populus euphratica]
          Length = 934

 Score =  327 bits (838), Expect = 7e-86
 Identities = 183/365 (50%), Positives = 231/365 (63%), Gaps = 15/365 (4%)
 Frame = -2

Query: 2676 HNIQYLPESANAEPLEQKQVAGDDNDDNKLT--ETE----IAAPDLAGLEEYIQISVSPC 2515
            H ++ L + A  EP E++Q  G +  +N+ T  +TE    IA  +   L + +  S SP 
Sbjct: 205  HTLELLSDRACCEPSEERQKPGSELSENESTGIDTELYCGIAIKNSEPLTQLVTKS-SPI 263

Query: 2514 ENVAMVPAFSAL---------GSDPQDASMHIDPQQTEPTQQKGAINAGGESGLDKRTPF 2362
            ++V ++P  S +           D +D     +  +T      G    G  SG  K    
Sbjct: 264  KHVGLLPGDSIIIPANEQTRPTHDDEDKGPDHEHLETPSRVAIGITRRGRPSG--KSASR 321

Query: 2361 QSRKRKSTLTIPVTARVLRSRSQXXXXXXXXXXXXXXXDAIEANXXXXXXXXXXXKIPAN 2182
             SRK     ++  + RVLRSRSQ               ++                I A+
Sbjct: 322  LSRKIYMLRSLRSSDRVLRSRSQVKPKAPESSNNSGNVNSTGDKKGKRRKKRRGKNIVAD 381

Query: 2181 EFSRIRAHLRYLLHRIKYEQNLIDAYSGEGWKGQSLEKIKPEKELQRAKSQIFRYKLKIR 2002
            E+S+IRAHLRYLL+R+ YEQ+LI AYSGEGWKG SLEK+KPEKELQRA S+I R K+KIR
Sbjct: 382  EYSKIRAHLRYLLNRMSYEQSLITAYSGEGWKGLSLEKLKPEKELQRATSEITRRKVKIR 441

Query: 2001 DLFRQIDLSLAEGKLPESLFDSDGQIDSEDIFCAKCGSKDLTLDNDIILCDGACERGFHQ 1822
            DLF+ ID   +EG+ P SLFDS+GQIDSEDIFCAKCGSKDL  DNDIILCDGAC+RGFHQ
Sbjct: 442  DLFQHIDYLCSEGRFPSSLFDSEGQIDSEDIFCAKCGSKDLNADNDIILCDGACDRGFHQ 501

Query: 1821 YCLEPPLLKEDIPPDEEGWLCPGCDCKVDCIELLSDFQGSNLSVLDKWEKVFPEEAAAAA 1642
            +CL PPLL+EDIPPD+EGWLCPGCDCKVDCI+LL+D QG+N+S+ D WEKVFP EAAA  
Sbjct: 502  FCLIPPLLREDIPPDDEGWLCPGCDCKVDCIDLLNDSQGTNISISDSWEKVFP-EAAATV 560

Query: 1641 SGMKM 1627
            SG K+
Sbjct: 561  SGQKL 565



 Score =  138 bits (348), Expect = 4e-29
 Identities = 86/220 (39%), Positives = 125/220 (56%), Gaps = 4/220 (1%)
 Frame = -2

Query: 1170 APVSGKRHVERLDYKKLHEETYGDTSDDSSDEDYGETVGSKGRKKSTGKAISVPY--EPE 997
            A VSGKR+V+RLDYKKL++ETYG+ S  SSD+DY +TVG + R+K+ G   +V    +  
Sbjct: 718  ATVSGKRNVDRLDYKKLYDETYGNIST-SSDDDYTDTVGPRKRRKNAGDVATVTANGDAS 776

Query: 996  TIHKGPDTKDENCNQKDVEMTPVEKIDKKFEIEGSNNMSVDSPRISTKGGSSGKRMSRP- 820
                G ++K+ N   K+ +  P          + +N     S   ++  GSSGK + RP 
Sbjct: 777  VTENGMNSKNMNQELKENKRNPERGTCHNSSFQETNVSPAKSYVGASLSGSSGKSV-RPS 835

Query: 819  -YQRLGDGVVQRLLESFRENQYPKHAVKESLAKELGLRIQQVSKWFENARWSARHSSHMD 643
             Y++LG+ V QRL   F+ENQYP  A K SLA+ELG+  +QV+KWF NARWS  HSS   
Sbjct: 836  AYKKLGEAVTQRLYSYFKENQYPDRAAKASLAEELGITFEQVNKWFVNARWSFNHSSSTG 895

Query: 642  SRLTGSTSVNGTSLPEISEKVPKLGEQSNLESAICNEEGK 523
            +    S S  G+   ++ +   K   +SN +     + G+
Sbjct: 896  ASKAESASGKGSCDGQVRDSELK-NRKSNKQKTNTPKSGR 934


>ref|XP_010099058.1| Homeobox protein [Morus notabilis] gi|587887924|gb|EXB76647.1|
            Homeobox protein [Morus notabilis]
          Length = 1031

 Score =  326 bits (835), Expect = 1e-85
 Identities = 193/440 (43%), Positives = 254/440 (57%), Gaps = 24/440 (5%)
 Frame = -2

Query: 2883 PEQRALELGNGFVSGKLCTELVVQKREMVKDAQMDPEETGIRKSNAHSVENLKTVDGLTN 2704
            P++ A E+ +G  SG LC E    K ++  + Q +  +T I  S+    + L+ V    +
Sbjct: 143  PKEAAAEVKHGCGSGNLCAEQAGTKNDVDSNLQNEIRKTDITVSSFVFTQKLEIVSEKRS 202

Query: 2703 NADVKSLGLHNIQYL----------PESAN-------------AEPLEQKQVAGDDNDDN 2593
                 +L + +   +          P+ +               E  +Q+   G +   N
Sbjct: 203  LISGGNLAVPSEDVVRHCQTENSSCPQQSTLGQIKDFDCGCLLGETPKQEDHLGTELVQN 262

Query: 2592 KLTETEIAAPDLAGLEEYIQISVSPCENVAMVPAFSALGSDPQDASMHIDPQQTEPTQQK 2413
             L ET IAA +   + E+++  V    +  +      +    +D S     +Q E T  K
Sbjct: 263  VLVETRIAASN-GIVSEHLEPPVGDGSDSYID---KQVEQPSEDVSKSSSLEQLE-TSSK 317

Query: 2412 GAINAGGESGLDKRTPFQSRKRKSTLTIPVTA-RVLRSRSQXXXXXXXXXXXXXXXDAIE 2236
              +N   + G   +   +SRK++  L   V + RVLRSR+Q                   
Sbjct: 318  SLVNKPSQLGRKDKQTSKSRKKQYMLRSLVHSDRVLRSRTQEKLKSHELSNTLSNIGNGV 377

Query: 2235 ANXXXXXXXXXXXKIPANEFSRIRAHLRYLLHRIKYEQNLIDAYSGEGWKGQSLEKIKPE 2056
                         ++ A+EFSRIR  L+Y  +RI YEQNLIDAYS EGWKG SLEK+KPE
Sbjct: 378  EKRMKERKKRRGTRVIADEFSRIRKRLKYFFNRIHYEQNLIDAYSSEGWKGTSLEKLKPE 437

Query: 2055 KELQRAKSQIFRYKLKIRDLFRQIDLSLAEGKLPESLFDSDGQIDSEDIFCAKCGSKDLT 1876
            KELQRAKS+IFR KLKIRDLF+Q+D   AEG+ P+SLFDS+GQIDSEDIFCAKCGSKD++
Sbjct: 438  KELQRAKSEIFRRKLKIRDLFQQLDSLCAEGRFPKSLFDSEGQIDSEDIFCAKCGSKDMS 497

Query: 1875 LDNDIILCDGACERGFHQYCLEPPLLKEDIPPDEEGWLCPGCDCKVDCIELLSDFQGSNL 1696
             +NDIILCDGAC+RGFHQ+CLEPPLL EDIPPD+EGWLCPGCDCKVDC +LL+D  G+NL
Sbjct: 498  ANNDIILCDGACDRGFHQFCLEPPLLSEDIPPDDEGWLCPGCDCKVDCFDLLNDSYGTNL 557

Query: 1695 SVLDKWEKVFPEEAAAAASG 1636
            SV D WEKVFPE AAAA  G
Sbjct: 558  SVTDSWEKVFPEAAAAAREG 577



 Score =  163 bits (413), Expect = 1e-36
 Identities = 99/240 (41%), Positives = 138/240 (57%), Gaps = 4/240 (1%)
 Frame = -2

Query: 1167 PVSGKRHVERLDYKKLHEETYGDT-SDDSSDEDYGETVGSKGRKKSTGKAISV-PYEPET 994
            P+SGKRHVERLDYK+LH+ETYG   SD S DED+ +    + RK++TG+  SV P E  +
Sbjct: 737  PISGKRHVERLDYKRLHDETYGHLPSDSSDDEDWTDYAAPRKRKRTTGQVSSVSPNENAS 796

Query: 993  IHKGPDTKDE-NCNQKDVEMTPVEKIDKKFEIEGSNNMSVDSPRISTKGGSSGKRMS-RP 820
            I K   T D  N + +D E  P  +  +   +   NN+     + S K GS+G+R     
Sbjct: 797  IIKNQTTTDAANNDLEDNEYVPRRRSRQNSVVTDENNIPNKLLQGSPKSGSTGRRRELST 856

Query: 819  YQRLGDGVVQRLLESFRENQYPKHAVKESLAKELGLRIQQVSKWFENARWSARHSSHMDS 640
             +RLG+ V QRL +SF+ENQY   A KESLA+ELGL   QVSKWFENARWS RHSS    
Sbjct: 857  NRRLGEAVTQRLYQSFKENQYLDRATKESLAQELGLTSYQVSKWFENARWSYRHSSSKKP 916

Query: 639  RLTGSTSVNGTSLPEISEKVPKLGEQSNLESAICNEEGKMALPQTSAYVDGQHVAGTGEG 460
             ++   S   T  P+ ++K+ +    +++ ++ CN      LP+T   +        G+G
Sbjct: 917  GISEHASKESTLSPQTNKKLFETELNTSITNSTCNGALNNELPRTGNAMPESCSGDVGDG 976


Top