BLASTX nr result

ID: Akebia26_contig00013614 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Akebia26_contig00013614
         (3018 letters)

Database: ./nr 
           38,876,450 sequences; 13,856,398,315 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

emb|CBI22504.3| unnamed protein product [Vitis vinifera]              432   e-118
ref|XP_002269077.1| PREDICTED: homeobox protein HAT3.1-like [Vit...   432   e-118
ref|XP_006486963.1| PREDICTED: homeobox protein HAT3.1-like isof...   429   e-117
ref|XP_006422879.1| hypothetical protein CICLE_v10027725mg [Citr...   429   e-117
ref|XP_007200058.1| hypothetical protein PRUPE_ppa023106mg [Prun...   428   e-117
gb|EXB76647.1| Homeobox protein [Morus notabilis]                     419   e-114
ref|XP_002313886.2| hypothetical protein POPTR_0009s09600g [Popu...   409   e-111
ref|XP_002300247.2| homeobox family protein [Populus trichocarpa...   408   e-111
ref|XP_007042568.1| Homeodomain-like protein with RING/FYVE/PHD-...   404   e-109
ref|XP_004289744.1| PREDICTED: uncharacterized protein LOC101296...   389   e-105
ref|XP_002527467.1| Homeobox protein HAT3.1, putative [Ricinus c...   387   e-104
emb|CAN68079.1| hypothetical protein VITISV_006312 [Vitis vinifera]   379   e-102
ref|XP_004140812.1| PREDICTED: uncharacterized protein LOC101204...   374   e-100
ref|XP_006829269.1| hypothetical protein AMTR_s00001p00272780 [A...   368   8e-99
ref|XP_007143079.1| hypothetical protein PHAVU_007G041800g [Phas...   363   3e-97
ref|XP_003555282.1| PREDICTED: homeobox protein HAT3.1-like isof...   361   1e-96
ref|XP_006605989.1| PREDICTED: homeobox protein HAT3.1-like isof...   359   5e-96
ref|XP_004161446.1| PREDICTED: homeobox protein HAT3.1-like [Cuc...   356   4e-95
ref|XP_006589630.1| PREDICTED: pathogenesis-related homeodomain ...   352   8e-94
ref|XP_004496910.1| PREDICTED: pathogenesis-related homeodomain ...   337   1e-89

>emb|CBI22504.3| unnamed protein product [Vitis vinifera]
          Length = 977

 Score =  432 bits (1110), Expect = e-118
 Identities = 298/725 (41%), Positives = 374/725 (51%), Gaps = 42/725 (5%)
 Frame = -1

Query: 2226 SSARENCQTIESSSLKQNRLQGKHELG-------------SEHAQSEPMETKIVESSVLE 2086
            S A+E+ +T +SSS KQN L+   +L              SE+ Q EP E     S VL 
Sbjct: 4    SPAQESNRTRKSSSPKQNILEEARKLSESVCSESSEQKRPSENGQHEPAEI----SPVLS 59

Query: 2085 NCQKIESSSPKQNTTQEKQNG----QTTESSSPQKIMFGEKNELGLEHVQSETLEIRIVG 1918
            NC   E S        +   G      T++S  + +    ++ +  +  +       +V 
Sbjct: 60   NCIVTEQSELPPEDVGDTILGLPPADVTKNSLTEHLGLPPEDAIKNDGTEQLGFFPEVVT 119

Query: 1917 SDSVDIKQLGT---PPKDAAK------STNRSSRKKNQVSSRPLKRKYMLRSSISSPTVL 1765
              S+ I++LG    PP++ A+      S +      N+ +++ +KRKY LRSS+S   VL
Sbjct: 120  KSSI-IEKLGQSEPPPENVARYSGLDQSGSAPKDLANKRTAKLVKRKYKLRSSVSGSRVL 178

Query: 1764 QSSSHGTSKPPDPSIS-ANVXXXXXXXXXXXXXXXKTLNDEFSRTRKRLRYLLNRMGYEH 1588
            +S S    K   PS +  N                KT  DEF+R RK LRYLLNRM YE 
Sbjct: 179  RSRSQEKPKASQPSDNFVNASASRERKGRKKKRMNKTTADEFARIRKHLRYLLNRMSYEQ 238

Query: 1587 NLIDAYSGEGWKGLSMEKIKPEQELQRATSEILRCKLKIRDLFQHLDSLCAEGRLQPSLF 1408
            NLIDAYS EGWKG S+EK+KPE+ELQRA+SEI R KL+IRDLFQHLDSLCAEGR   SLF
Sbjct: 239  NLIDAYSAEGWKGQSVEKLKPEKELQRASSEISRRKLQIRDLFQHLDSLCAEGRFPESLF 298

Query: 1407 DSEGQICSEDIFCAKCGSKDLSDDNDIILCDGICDRGFHLMCLNPPLSKDKIPPEDEGWL 1228
            DSEGQI SEDIFCAKC SKD+S DNDIILCDG CDRGFH  CL PPL K++IPP+DEGWL
Sbjct: 299  DSEGQIDSEDIFCAKCESKDMSADNDIILCDGACDRGFHQFCLEPPLLKEEIPPDDEGWL 358

Query: 1227 CPGCDCKVDCIYLLNDVQGTNLSIDDKWEKVFPEAAATGSGDNPDDNFGFXXXXXXXXXX 1048
            CP CDCKVDC+ LLND QGT LS+ D WEKVFPEAAA  +G+N D+N GF          
Sbjct: 359  CPACDCKVDCMDLLNDSQGTKLSVIDSWEKVFPEAAA--AGNNQDNNSGFSSDDSEDNDY 416

Query: 1047 XXDGPELDERVQ------------XXXXXXXXXXXXXXXXDLGASPHEEQYLGLXXXXXX 904
              D PE+DE+ Q                            D+  SP+ EQ LGL      
Sbjct: 417  DPDCPEVDEKGQGDKSSSDKFDESDEFDESDESDFTSASDDMVVSPNNEQCLGLPSDDSE 476

Query: 903  XXXXXXXXXXXDEQIKKXXXXXXXXXXXXXXXALSDGKGSSGTNVLVHSSVDGAKLLTGS 724
                       DEQ+ +               A  D +  S        + DG       
Sbjct: 477  DDDFDPDAPEIDEQVNQGSSSSDFTSDSEDFTATLDRRNFS-------DNEDGL------ 523

Query: 723  SNRRSQMGRKKKQSINAELLSILEPGPSQGNASPVLSKRNRERLDYKKLHDEAYGNVPXX 544
             + + + GRKKK ++  ELLS+LE    Q NA P+ +KR+ ERLDYKKLHDEAYGNV   
Sbjct: 524  -DEQRRFGRKKKDTLKDELLSVLESNSGQDNA-PLSAKRHVERLDYKKLHDEAYGNVSSD 581

Query: 543  XXXXXXXXXXEAPRKRKNDGGIGNTSVVPPKGXXXXXXXXXXXXXXXXNLKDIESTPTRR 364
                        PRKRKN    GN + V P G                +L+    TP RR
Sbjct: 582  SSDDEDWTENVIPRKRKNLS--GNVASVSPNGNTSITENGTNTKDIKHDLEAAGCTPKRR 639

Query: 363  TRQKLDSDGAYNIAAETHKDSLGP-DYAVKVATASASRKNTGSTQRNLK--ETKNTPKRR 193
            TRQKL+ +   N  AE+HKDS  P     K   +S  +     T+R  K  +    P R 
Sbjct: 640  TRQKLNFESTNNSLAESHKDSRSPGSTGEKSGQSSYKKLGEAVTERLYKSFQENQYPDRA 699

Query: 192  TREKL 178
             +EKL
Sbjct: 700  MKEKL 704


>ref|XP_002269077.1| PREDICTED: homeobox protein HAT3.1-like [Vitis vinifera]
          Length = 968

 Score =  432 bits (1110), Expect = e-118
 Identities = 298/725 (41%), Positives = 374/725 (51%), Gaps = 42/725 (5%)
 Frame = -1

Query: 2226 SSARENCQTIESSSLKQNRLQGKHELG-------------SEHAQSEPMETKIVESSVLE 2086
            S A+E+ +T +SSS KQN L+   +L              SE+ Q EP E     S VL 
Sbjct: 4    SPAQESNRTRKSSSPKQNILEEARKLSESVCSESSEQKRPSENGQHEPAEI----SPVLS 59

Query: 2085 NCQKIESSSPKQNTTQEKQNG----QTTESSSPQKIMFGEKNELGLEHVQSETLEIRIVG 1918
            NC   E S        +   G      T++S  + +    ++ +  +  +       +V 
Sbjct: 60   NCIVTEQSELPPEDVGDTILGLPPADVTKNSLTEHLGLPPEDAIKNDGTEQLGFFPEVVT 119

Query: 1917 SDSVDIKQLGT---PPKDAAK------STNRSSRKKNQVSSRPLKRKYMLRSSISSPTVL 1765
              S+ I++LG    PP++ A+      S +      N+ +++ +KRKY LRSS+S   VL
Sbjct: 120  KSSI-IEKLGQSEPPPENVARYSGLDQSGSAPKDLANKRTAKLVKRKYKLRSSVSGSRVL 178

Query: 1764 QSSSHGTSKPPDPSIS-ANVXXXXXXXXXXXXXXXKTLNDEFSRTRKRLRYLLNRMGYEH 1588
            +S S    K   PS +  N                KT  DEF+R RK LRYLLNRM YE 
Sbjct: 179  RSRSQEKPKASQPSDNFVNASASRERKGRKKKRMNKTTADEFARIRKHLRYLLNRMSYEQ 238

Query: 1587 NLIDAYSGEGWKGLSMEKIKPEQELQRATSEILRCKLKIRDLFQHLDSLCAEGRLQPSLF 1408
            NLIDAYS EGWKG S+EK+KPE+ELQRA+SEI R KL+IRDLFQHLDSLCAEGR   SLF
Sbjct: 239  NLIDAYSAEGWKGQSVEKLKPEKELQRASSEISRRKLQIRDLFQHLDSLCAEGRFPESLF 298

Query: 1407 DSEGQICSEDIFCAKCGSKDLSDDNDIILCDGICDRGFHLMCLNPPLSKDKIPPEDEGWL 1228
            DSEGQI SEDIFCAKC SKD+S DNDIILCDG CDRGFH  CL PPL K++IPP+DEGWL
Sbjct: 299  DSEGQIDSEDIFCAKCESKDMSADNDIILCDGACDRGFHQFCLEPPLLKEEIPPDDEGWL 358

Query: 1227 CPGCDCKVDCIYLLNDVQGTNLSIDDKWEKVFPEAAATGSGDNPDDNFGFXXXXXXXXXX 1048
            CP CDCKVDC+ LLND QGT LS+ D WEKVFPEAAA  +G+N D+N GF          
Sbjct: 359  CPACDCKVDCMDLLNDSQGTKLSVIDSWEKVFPEAAA--AGNNQDNNSGFSSDDSEDNDY 416

Query: 1047 XXDGPELDERVQ------------XXXXXXXXXXXXXXXXDLGASPHEEQYLGLXXXXXX 904
              D PE+DE+ Q                            D+  SP+ EQ LGL      
Sbjct: 417  DPDCPEVDEKGQGDKSSSDKFDESDEFDESDESDFTSASDDMVVSPNNEQCLGLPSDDSE 476

Query: 903  XXXXXXXXXXXDEQIKKXXXXXXXXXXXXXXXALSDGKGSSGTNVLVHSSVDGAKLLTGS 724
                       DEQ+ +               A  D +  S        + DG       
Sbjct: 477  DDDFDPDAPEIDEQVNQGSSSSDFTSDSEDFTATLDRRNFS-------DNEDGL------ 523

Query: 723  SNRRSQMGRKKKQSINAELLSILEPGPSQGNASPVLSKRNRERLDYKKLHDEAYGNVPXX 544
             + + + GRKKK ++  ELLS+LE    Q NA P+ +KR+ ERLDYKKLHDEAYGNV   
Sbjct: 524  -DEQRRFGRKKKDTLKDELLSVLESNSGQDNA-PLSAKRHVERLDYKKLHDEAYGNVSSD 581

Query: 543  XXXXXXXXXXEAPRKRKNDGGIGNTSVVPPKGXXXXXXXXXXXXXXXXNLKDIESTPTRR 364
                        PRKRKN    GN + V P G                +L+    TP RR
Sbjct: 582  SSDDEDWTENVIPRKRKNLS--GNVASVSPNGNTSITENGTNTKDIKHDLEAAGCTPKRR 639

Query: 363  TRQKLDSDGAYNIAAETHKDSLGP-DYAVKVATASASRKNTGSTQRNLK--ETKNTPKRR 193
            TRQKL+ +   N  AE+HKDS  P     K   +S  +     T+R  K  +    P R 
Sbjct: 640  TRQKLNFESTNNSLAESHKDSRSPGSTGEKSGQSSYKKLGEAVTERLYKSFQENQYPDRA 699

Query: 192  TREKL 178
             +EKL
Sbjct: 700  MKEKL 704


>ref|XP_006486963.1| PREDICTED: homeobox protein HAT3.1-like isoform X1 [Citrus sinensis]
            gi|568867273|ref|XP_006486964.1| PREDICTED: homeobox
            protein HAT3.1-like isoform X2 [Citrus sinensis]
          Length = 1063

 Score =  429 bits (1102), Expect = e-117
 Identities = 338/978 (34%), Positives = 450/978 (46%), Gaps = 45/978 (4%)
 Frame = -1

Query: 2799 MQEKNELGSEHAQSGPMETKIVESNA-EENCQTIESSS----------PKQNTLQGNHAL 2653
            ++ K+ LGSE  ++ P ET I  S   +  C  +  SS            + + Q N A 
Sbjct: 45   LESKSLLGSEAVENEPRETSIPNSEKLQAFCGDVPDSSFTDHLAPPSEDMRKSTQTNKAS 104

Query: 2652 CSEHAQSEPVETKILGSNAQEN-----CQTIESSSPKQNILQGKLELGSEHAQSEPMETK 2488
            CS+   SE      L  N Q       C  I    P+   L        ++A  +P+   
Sbjct: 105  CSQQNTSEQKHGTELMHNEQSEQKHQLCYQIVFDKPQATSLV-------DNATLQPVSKD 157

Query: 2487 IVESNAQESYKTIESSS-----------LKQNTLQGKHELGSEHAQSEPME--MKIVEPS 2347
            + +S+   + + ++  S           +    L  KH+LGSE  Q+EP     ++    
Sbjct: 158  VSKSSQTGTRQALDFLSGNRCNELDVDCVHSEPLDQKHQLGSEIIQNEPAVNIARLPSDG 217

Query: 2346 AQENCQTIESSSLKQNTSQGKHELGSEHAQSEPMETKIVESSARENCQTIESSSLKQNRL 2167
             +EN QTI     K            E +QS P +       A ++CQ  E S L+Q+  
Sbjct: 218  VEENLQTISEDLTKV--------CPVEPSQSPPRD-------ANKSCQAGEISYLQQSSS 262

Query: 2166 QGKHELGSEHAQSEPMETKIVESSVLENCQKIESSSPKQNTTQEKQNGQTTESSSPQK-- 1993
            +   E     +  EP        S LE  +  E+S+ +   + E     + E     +  
Sbjct: 263  EQTPEFTPGISSHEPSVVNYKLGSQLEQTELGETSAGELGASLELVVKSSIEQLKQPEVP 322

Query: 1992 IMFGEKNELGLEHVQSETLEIRIVGSDSVD----IKQLGTPPKDAAKSTNRSSRKKNQVS 1825
            I          +H+QS         SD ++    ++Q  TPP   A ++    RK  + +
Sbjct: 323  ITIPSTKTSATKHLQSS--------SDLMEKKSCLEQSETPPNYVANNSACLGRKGKR-A 373

Query: 1824 SRPLKRKYMLRSSISSPTVLQSSSHGTSKPPDPSIS---ANVXXXXXXXXXXXXXXXKTL 1654
            ++ LK  Y +RS I S  VL+S S     PP+ SI+    N                K +
Sbjct: 374  TKSLKNNYTVRSLIGSDRVLRSRSGERPIPPESSINLADVNSIGERKQKKRNKIRRKKIV 433

Query: 1653 NDEFSRTRKRLRYLLNRMGYEHNLIDAYSGEGWKGLSMEKIKPEQELQRATSEILRCKLK 1474
             DE+SR R  LRYLLNR+ YE NLIDAYS EGWKGLS+EK+KPE+ELQRATSEILR KLK
Sbjct: 434  ADEYSRIRTHLRYLLNRINYEQNLIDAYSSEGWKGLSVEKLKPEKELQRATSEILRRKLK 493

Query: 1473 IRDLFQHLDSLCAEGRLQPSLFDSEGQICSEDIFCAKCGSKDLSDDNDIILCDGICDRGF 1294
            IRDLFQ LDSLCA G    SLFDSEGQI SEDI+CAKCGSKDLS DNDIILCDG CDRGF
Sbjct: 494  IRDLFQRLDSLCAGG-FPKSLFDSEGQIDSEDIYCAKCGSKDLSADNDIILCDGACDRGF 552

Query: 1293 HLMCLNPPLSKDKIPPEDEGWLCPGCDCKVDCIYLLNDVQGTNLSIDDKWEKVFPEAAAT 1114
            H  CL PPL K+ IPP+DEGWLCPGCDCKVDCI L+N++QGT L I D WEKVFPEAAA 
Sbjct: 553  HQYCLEPPLLKEDIPPDDEGWLCPGCDCKVDCIDLVNELQGTRLFITDNWEKVFPEAAA- 611

Query: 1113 GSGDNPDDNFGFXXXXXXXXXXXXDGPELDERVQXXXXXXXXXXXXXXXXD-----LGAS 949
              G N D NFG             DG   DE+ +                      + A 
Sbjct: 612  --GHNQDPNFGLASDDSDDNEYDPDGSATDEQDEGDESSSDGSSSDDSDFTSTSDEVEAP 669

Query: 948  PHEEQYLGLXXXXXXXXXXXXXXXXXDEQIKKXXXXXXXXXXXXXXXALS--DGKGSSGT 775
              ++ YLG                  D+++ +                 +  +   SSG 
Sbjct: 670  ADDKTYLGRSSEDSEDDEYNPDAPDLDDKVTQESSSSGSDFTSDSEDLAAVLEDNRSSGN 729

Query: 774  NVLVHSSVDGAKLLTGSSNRRSQMGRKKKQSINAELLSILEPGPSQGNASPVLSKRNRER 595
            +       +GA    G SN +        +S+N ELLSI++PG  Q  A+PV  KR+ ER
Sbjct: 730  D-------EGAASPLGHSNGQRYKDGGNNESLNNELLSIIKPG--QDGAAPVYGKRSSER 780

Query: 594  LDYKKLHDEAYGNVPXXXXXXXXXXXXEAPRKRKNDGGIGNTSVVPPKGXXXXXXXXXXX 415
            LDYKKL+DE YGNVP                   +DGG                      
Sbjct: 781  LDYKKLYDETYGNVPYDSSDDESW---------SDDGG---------------------- 809

Query: 414  XXXXXNLKDIESTPTRRTRQKLDSDGAYNIAAETHKDSLGPDYAVKVATASASRKNTGST 235
                         P +RT+   +   A             PD    V      RK+T + 
Sbjct: 810  -------------PRKRTKSTKEGSSA------------SPDGKTPVIRR---RKSTKAA 841

Query: 234  QRNLKETKNTPKRRTREKLDFEGADNLAAETPKDNLGPDSSGKIATATPSRTLGKAVTQG 55
            +  L ET+NTPKRR R KL+ E ++   A++ +    P S G+    T  R +G+ VTQ 
Sbjct: 842  KEKLNETENTPKRRGRPKLNTEDSNISPAKSHEGCSTPGSRGR-RHRTSYRKIGEEVTQK 900

Query: 54   LLESFRENEYPTRDKKEN 1
            L  SF+EN+YP R  KE+
Sbjct: 901  LYNSFKENQYPNRTTKES 918



 Score =  411 bits (1056), Expect = e-111
 Identities = 325/939 (34%), Positives = 439/939 (46%), Gaps = 88/939 (9%)
 Frame = -1

Query: 2730 SNAEENCQTIESSSPKQNTLQGNHALCSEHAQSEPVETK-ILGSNAQENCQTIESSSPKQ 2554
            S ++    T + S  KQ T +     C+E   SEP+E+K +LGS A EN +  E+S P  
Sbjct: 10   SPSQVTSSTRDHSCLKQTTPEQMCKFCAECIHSEPLESKSLLGSEAVEN-EPRETSIPNS 68

Query: 2553 NILQGKLELGSEHAQSEPMETKIVESNAQESYKTIESSSLKQNTLQGKHELGSEHAQSEP 2374
              LQ     G     S          + ++S +T ++S  +QNT + KH  G+E   +E 
Sbjct: 69   EKLQA--FCGDVPDSSFTDHLAPPSEDMRKSTQTNKASCSQQNTSEQKH--GTELMHNEQ 124

Query: 2373 MEMKIVEPSAQENCQTI-----ESSSLKQNTSQGKHELGSEHAQSEPMETK-IVESSARE 2212
             E K       + C  I     +++SL  N +     +  + ++S    T+  ++  +  
Sbjct: 125  SEQK------HQLCYQIVFDKPQATSLVDNAT--LQPVSKDVSKSSQTGTRQALDFLSGN 176

Query: 2211 NCQTIESSSLKQNRLQGKHELGSEHAQSEPME--TKIVESSVLENCQKIE---------- 2068
             C  ++   +    L  KH+LGSE  Q+EP     ++    V EN Q I           
Sbjct: 177  RCNELDVDCVHSEPLDQKHQLGSEIIQNEPAVNIARLPSDGVEENLQTISEDLTKVCPVE 236

Query: 2067 -SSSPKQNTTQEKQNGQTT---ESSSPQKIMFG-------------------EKNELGLE 1957
             S SP ++  +  Q G+ +   +SSS Q   F                    E+ ELG  
Sbjct: 237  PSQSPPRDANKSCQAGEISYLQQSSSEQTPEFTPGISSHEPSVVNYKLGSQLEQTELGET 296

Query: 1956 HVQSETLEIRIVGSDSVD----------------------------------IKQLGTPP 1879
                    + +V   S++                                  ++Q  TPP
Sbjct: 297  SAGELGASLELVVKSSIEQLKQPEVPITIPSTKTSATKHLQSSSDLMEKKSCLEQSETPP 356

Query: 1878 KDAAKSTNRSSRKKNQVSSRPLKRKYMLRSSISSPTVLQSSSHGTSKPPDPSIS---ANV 1708
               A ++    RK  + +++ LK  Y +RS I S  VL+S S     PP+ SI+    N 
Sbjct: 357  NYVANNSACLGRKGKR-ATKSLKNNYTVRSLIGSDRVLRSRSGERPIPPESSINLADVNS 415

Query: 1707 XXXXXXXXXXXXXXXKTLNDEFSRTRKRLRYLLNRMGYEHNLIDAYSGEGWKGLSMEKIK 1528
                           K + DE+SR R  LRYLLNR+ YE NLIDAYS EGWKGLS+EK+K
Sbjct: 416  IGERKQKKRNKIRRKKIVADEYSRIRTHLRYLLNRINYEQNLIDAYSSEGWKGLSVEKLK 475

Query: 1527 PEQELQRATSEILRCKLKIRDLFQHLDSLCAEGRLQPSLFDSEGQICSEDIFCAKCGSKD 1348
            PE+ELQRATSEILR KLKIRDLFQ LDSLCA G    SLFDSEGQI SEDI+CAKCGSKD
Sbjct: 476  PEKELQRATSEILRRKLKIRDLFQRLDSLCA-GGFPKSLFDSEGQIDSEDIYCAKCGSKD 534

Query: 1347 LSDDNDIILCDGICDRGFHLMCLNPPLSKDKIPPEDEGWLCPGCDCKVDCIYLLNDVQGT 1168
            LS DNDIILCDG CDRGFH  CL PPL K+ IPP+DEGWLCPGCDCKVDCI L+N++QGT
Sbjct: 535  LSADNDIILCDGACDRGFHQYCLEPPLLKEDIPPDDEGWLCPGCDCKVDCIDLVNELQGT 594

Query: 1167 NLSIDDKWEKVFPEAAATGSGDNPDDNFGFXXXXXXXXXXXXDGPELDERVQ-----XXX 1003
             L I D WEKVFPEAAA   G N D NFG             DG   DE+ +        
Sbjct: 595  RLFITDNWEKVFPEAAA---GHNQDPNFGLASDDSDDNEYDPDGSATDEQDEGDESSSDG 651

Query: 1002 XXXXXXXXXXXXXDLGASPHEEQYLGLXXXXXXXXXXXXXXXXXDEQI--KKXXXXXXXX 829
                         ++ A   ++ YLG                  D+++  +         
Sbjct: 652  SSSDDSDFTSTSDEVEAPADDKTYLGRSSEDSEDDEYNPDAPDLDDKVTQESSSSGSDFT 711

Query: 828  XXXXXXXALSDGKGSSGTNVLVHSSVDGAKLLTGSSNRRSQMGRKKKQSINAELLSILEP 649
                   A+ +   SSG +       +GA    G SN +        +S+N ELLSI++P
Sbjct: 712  SDSEDLAAVLEDNRSSGND-------EGAASPLGHSNGQRYKDGGNNESLNNELLSIIKP 764

Query: 648  GPSQGNASPVLSKRNRERLDYKKLHDEAYGNVPXXXXXXXXXXXXEAPRKRKNDGGIGNT 469
            G  Q  A+PV  KR+ ERLDYKKL+DE YGNVP              PRKR      G++
Sbjct: 765  G--QDGAAPVYGKRSSERLDYKKLYDETYGNVPYDSSDDESWSDDGGPRKRTKSTKEGSS 822

Query: 468  SVVPPKGXXXXXXXXXXXXXXXXNLKDIESTPTRRTRQKLDSDGAYNIAAETHKDSLGPD 289
            +   P G                 L + E+TP RR R KL+++ +    A++H+    P 
Sbjct: 823  A--SPDGKTPVIRRRKSTKAAKEKLNETENTPKRRGRPKLNTEDSNISPAKSHEGCSTPG 880

Query: 288  YAVKVATASASRKNTGSTQR--NLKETKNTPKRRTREKL 178
               +    S  +     TQ+  N  +    P R T+E L
Sbjct: 881  SRGRRHRTSYRKIGEEVTQKLYNSFKENQYPNRTTKESL 919


>ref|XP_006422879.1| hypothetical protein CICLE_v10027725mg [Citrus clementina]
            gi|557524813|gb|ESR36119.1| hypothetical protein
            CICLE_v10027725mg [Citrus clementina]
          Length = 1063

 Score =  429 bits (1102), Expect = e-117
 Identities = 341/978 (34%), Positives = 448/978 (45%), Gaps = 45/978 (4%)
 Frame = -1

Query: 2799 MQEKNELGSEHAQSGPMETKIVESNA-EENCQTIESSS----------PKQNTLQGNHAL 2653
            ++ K+ LGSE  ++ P ET I  S   +  C  +  SS            + + Q N A 
Sbjct: 45   LESKSLLGSEAVENEPRETSIPNSEKLQAFCGDVPDSSFTDHLAPPSEDMRKSTQTNKAS 104

Query: 2652 CSEHAQSEPVETKILGSNAQEN-----CQTIESSSPKQNILQGKLELGSEHAQSEPMETK 2488
            CS+   SE      L  N Q       C  I    P+   L        ++A  +P+   
Sbjct: 105  CSQQNTSEQKHGTELMHNEQSEQKHQLCYQIVFDKPQATSLV-------DNATLQPVSKD 157

Query: 2487 IVESNAQESYKTIESSS-----------LKQNTLQGKHELGSEHAQSEPMEMKIVEPS-- 2347
            + +S+   + + ++  S           +    L  KH+LGSE  Q+EP       PS  
Sbjct: 158  VSKSSQTGNRQALDFLSGNRCNELDVDCVHSEPLNQKHQLGSEIIQNEPAVNVARLPSDG 217

Query: 2346 AQENCQTIESSSLKQNTSQGKHELGSEHAQSEPMETKIVESSARENCQTIESSSLKQNRL 2167
             +EN QTI     K            E +QS P +       A ++CQ  E S L+Q+  
Sbjct: 218  VEENLQTISEDLTKV--------CPVEPSQSPPRD-------ANKSCQAGEISCLQQSSS 262

Query: 2166 QGKHELGSEHAQSEPMETKIVESSVLENCQKIESSSPKQNTTQEKQNGQTTESSSPQK-- 1993
            +   E     +  EP        S LE  +  E+S+ +   + E     + E     +  
Sbjct: 263  EQTPEFTPGISSHEPSVVNYKLGSQLEQTELGETSAGELGASLELVVKSSIEQLKQLEVP 322

Query: 1992 IMFGEKNELGLEHVQSETLEIRIVGSDSVD----IKQLGTPPKDAAKSTNRSSRKKNQVS 1825
            I          +H+QS         SD ++    ++Q  TPP   A ++    RK  + +
Sbjct: 323  ITIPSTKTSATKHLQSS--------SDLMEKKSCLEQSETPPNYVANNSACLGRKGKR-A 373

Query: 1824 SRPLKRKYMLRSSISSPTVLQSSSHGTSKPPDPS---ISANVXXXXXXXXXXXXXXXKTL 1654
            ++ LK  Y +RS I S  VL+S S     PP+ S      N                K +
Sbjct: 374  TKSLKNNYTVRSLIGSDRVLRSRSGERPLPPESSNNLADVNSIGERKQKKRNKIRRKKIV 433

Query: 1653 NDEFSRTRKRLRYLLNRMGYEHNLIDAYSGEGWKGLSMEKIKPEQELQRATSEILRCKLK 1474
             DE+SR R  LRYLLNR+ YE NLIDAYS EGWKGLS+EK+KPE+ELQRATSEILR KLK
Sbjct: 434  ADEYSRIRTHLRYLLNRINYEQNLIDAYSSEGWKGLSVEKLKPEKELQRATSEILRRKLK 493

Query: 1473 IRDLFQHLDSLCAEGRLQPSLFDSEGQICSEDIFCAKCGSKDLSDDNDIILCDGICDRGF 1294
            IRDLFQ LDSLCA G    SLFDSEGQI SEDI+CAKCGSKDLS DNDIILCDG CDRGF
Sbjct: 494  IRDLFQRLDSLCAGG-FPKSLFDSEGQIDSEDIYCAKCGSKDLSADNDIILCDGACDRGF 552

Query: 1293 HLMCLNPPLSKDKIPPEDEGWLCPGCDCKVDCIYLLNDVQGTNLSIDDKWEKVFPEAAAT 1114
            H  CL PPL K+ IPP+DEGWLCPGCDCKVDCI L+N++QGT L I D WEKVFPEAAA 
Sbjct: 553  HQYCLEPPLLKEDIPPDDEGWLCPGCDCKVDCIDLVNELQGTRLFITDNWEKVFPEAAA- 611

Query: 1113 GSGDNPDDNFGFXXXXXXXXXXXXDGPELDERVQXXXXXXXXXXXXXXXXD-----LGAS 949
              G N D NFG             DG   DE+ +                      + A 
Sbjct: 612  --GHNQDPNFGLASDDSDDNEYDPDGSATDEQDEGDESSSDGSSSDDSDFTSTSDEVEAP 669

Query: 948  PHEEQYLGLXXXXXXXXXXXXXXXXXDEQIKKXXXXXXXXXXXXXXXALS--DGKGSSGT 775
              ++ YLGL                 D+++ +                 +  +   SSG 
Sbjct: 670  ADDKTYLGLSSEDSEDDEYNPDAPELDDKVTQESSSSGSDFTSDSEDLAAVLEDNRSSGN 729

Query: 774  NVLVHSSVDGAKLLTGSSNRRSQMGRKKKQSINAELLSILEPGPSQGNASPVLSKRNRER 595
            +       +GA    G SN +        +S+N ELLSI++PG  Q  A PV  KR+ ER
Sbjct: 730  D-------EGAASPLGHSNGQRYKDGGNNESLNNELLSIIKPG--QDGAVPVYGKRSSER 780

Query: 594  LDYKKLHDEAYGNVPXXXXXXXXXXXXEAPRKRKNDGGIGNTSVVPPKGXXXXXXXXXXX 415
            LDYKKL+DE YGNVP                   +DGG                      
Sbjct: 781  LDYKKLYDETYGNVPYDSSDDESW---------SDDGG---------------------- 809

Query: 414  XXXXXNLKDIESTPTRRTRQKLDSDGAYNIAAETHKDSLGPDYAVKVATASASRKNTGST 235
                         P +RT+   +   A             PD    V      RK+T + 
Sbjct: 810  -------------PRKRTKSTKEGSSA------------SPDGKTPVIRR---RKSTKAA 841

Query: 234  QRNLKETKNTPKRRTREKLDFEGADNLAAETPKDNLGPDSSGKIATATPSRTLGKAVTQG 55
            +  L ET+NTPKRR R KL+ E ++   A++ +    P S G+    T  R LG+ VTQ 
Sbjct: 842  KEKLNETENTPKRRGRPKLNTEDSNISPAKSHEGCSTPGSRGR-RHRTSYRKLGEEVTQK 900

Query: 54   LLESFRENEYPTRDKKEN 1
            L  SF+EN+YP R  KE+
Sbjct: 901  LYNSFKENQYPNRTTKES 918



 Score =  412 bits (1058), Expect = e-112
 Identities = 334/939 (35%), Positives = 445/939 (47%), Gaps = 88/939 (9%)
 Frame = -1

Query: 2730 SNAEENCQTIESSSPKQNTLQGNHALCSEHAQSEPVETK-ILGSNAQENCQTIESSSPKQ 2554
            S ++    T + S  KQ T +     C+E   SEP+E+K +LGS A EN +  E+S P  
Sbjct: 10   SPSQVTSSTRDHSCLKQTTPEQMCKFCAECIHSEPLESKSLLGSEAVEN-EPRETSIPNS 68

Query: 2553 NILQGKLELGSEHAQSEPMETKIVESNAQESYKTIESSSLKQNTLQGKHELGSEHAQSEP 2374
              LQ     G     S          + ++S +T ++S  +QNT + KH  G+E   +E 
Sbjct: 69   EKLQA--FCGDVPDSSFTDHLAPPSEDMRKSTQTNKASCSQQNTSEQKH--GTELMHNEQ 124

Query: 2373 MEMKIVEPSAQENCQTI-----ESSSLKQN-TSQGKHELGSEHAQSEPMETKIVESSARE 2212
             E K       + C  I     +++SL  N T Q   +  S+ +Q+     + ++  +  
Sbjct: 125  SEQK------HQLCYQIVFDKPQATSLVDNATLQPVSKDVSKSSQTG--NRQALDFLSGN 176

Query: 2211 NCQTIESSSLKQNRLQGKHELGSEHAQSEPM--ETKIVESSVLENCQKIE---------- 2068
             C  ++   +    L  KH+LGSE  Q+EP     ++    V EN Q I           
Sbjct: 177  RCNELDVDCVHSEPLNQKHQLGSEIIQNEPAVNVARLPSDGVEENLQTISEDLTKVCPVE 236

Query: 2067 -SSSPKQNTTQEKQNGQTT---ESSSPQKIMFG-------------------EKNELG-- 1963
             S SP ++  +  Q G+ +   +SSS Q   F                    E+ ELG  
Sbjct: 237  PSQSPPRDANKSCQAGEISCLQQSSSEQTPEFTPGISSHEPSVVNYKLGSQLEQTELGET 296

Query: 1962 --------LEHV------QSETLEIRIV--------------GSDSVD----IKQLGTPP 1879
                    LE V      Q + LE+ I                SD ++    ++Q  TPP
Sbjct: 297  SAGELGASLELVVKSSIEQLKQLEVPITIPSTKTSATKHLQSSSDLMEKKSCLEQSETPP 356

Query: 1878 KDAAKSTNRSSRKKNQVSSRPLKRKYMLRSSISSPTVLQSSSHGTSKPPDPS---ISANV 1708
               A ++    RK  + +++ LK  Y +RS I S  VL+S S     PP+ S      N 
Sbjct: 357  NYVANNSACLGRKGKR-ATKSLKNNYTVRSLIGSDRVLRSRSGERPLPPESSNNLADVNS 415

Query: 1707 XXXXXXXXXXXXXXXKTLNDEFSRTRKRLRYLLNRMGYEHNLIDAYSGEGWKGLSMEKIK 1528
                           K + DE+SR R  LRYLLNR+ YE NLIDAYS EGWKGLS+EK+K
Sbjct: 416  IGERKQKKRNKIRRKKIVADEYSRIRTHLRYLLNRINYEQNLIDAYSSEGWKGLSVEKLK 475

Query: 1527 PEQELQRATSEILRCKLKIRDLFQHLDSLCAEGRLQPSLFDSEGQICSEDIFCAKCGSKD 1348
            PE+ELQRATSEILR KLKIRDLFQ LDSLCA G    SLFDSEGQI SEDI+CAKCGSKD
Sbjct: 476  PEKELQRATSEILRRKLKIRDLFQRLDSLCA-GGFPKSLFDSEGQIDSEDIYCAKCGSKD 534

Query: 1347 LSDDNDIILCDGICDRGFHLMCLNPPLSKDKIPPEDEGWLCPGCDCKVDCIYLLNDVQGT 1168
            LS DNDIILCDG CDRGFH  CL PPL K+ IPP+DEGWLCPGCDCKVDCI L+N++QGT
Sbjct: 535  LSADNDIILCDGACDRGFHQYCLEPPLLKEDIPPDDEGWLCPGCDCKVDCIDLVNELQGT 594

Query: 1167 NLSIDDKWEKVFPEAAATGSGDNPDDNFGFXXXXXXXXXXXXDGPELDERVQ-----XXX 1003
             L I D WEKVFPEAAA   G N D NFG             DG   DE+ +        
Sbjct: 595  RLFITDNWEKVFPEAAA---GHNQDPNFGLASDDSDDNEYDPDGSATDEQDEGDESSSDG 651

Query: 1002 XXXXXXXXXXXXXDLGASPHEEQYLGLXXXXXXXXXXXXXXXXXDEQI--KKXXXXXXXX 829
                         ++ A   ++ YLGL                 D+++  +         
Sbjct: 652  SSSDDSDFTSTSDEVEAPADDKTYLGLSSEDSEDDEYNPDAPELDDKVTQESSSSGSDFT 711

Query: 828  XXXXXXXALSDGKGSSGTNVLVHSSVDGAKLLTGSSNRRSQMGRKKKQSINAELLSILEP 649
                   A+ +   SSG +       +GA    G SN +        +S+N ELLSI++P
Sbjct: 712  SDSEDLAAVLEDNRSSGND-------EGAASPLGHSNGQRYKDGGNNESLNNELLSIIKP 764

Query: 648  GPSQGNASPVLSKRNRERLDYKKLHDEAYGNVPXXXXXXXXXXXXEAPRKRKNDGGIGNT 469
            G  Q  A PV  KR+ ERLDYKKL+DE YGNVP              PRKR      G++
Sbjct: 765  G--QDGAVPVYGKRSSERLDYKKLYDETYGNVPYDSSDDESWSDDGGPRKRTKSTKEGSS 822

Query: 468  SVVPPKGXXXXXXXXXXXXXXXXNLKDIESTPTRRTRQKLDSDGAYNIAAETHKDSLGPD 289
            +   P G                 L + E+TP RR R KL+++ +    A++H+    P 
Sbjct: 823  A--SPDGKTPVIRRRKSTKAAKEKLNETENTPKRRGRPKLNTEDSNISPAKSHEGCSTPG 880

Query: 288  YAVKVATASASRKNTGSTQR--NLKETKNTPKRRTREKL 178
               +    S  +     TQ+  N  +    P R T+E L
Sbjct: 881  SRGRRHRTSYRKLGEEVTQKLYNSFKENQYPNRTTKESL 919


>ref|XP_007200058.1| hypothetical protein PRUPE_ppa023106mg [Prunus persica]
            gi|462395458|gb|EMJ01257.1| hypothetical protein
            PRUPE_ppa023106mg [Prunus persica]
          Length = 1058

 Score =  428 bits (1100), Expect = e-117
 Identities = 319/905 (35%), Positives = 429/905 (47%), Gaps = 54/905 (5%)
 Frame = -1

Query: 2730 SNAEENCQTIESSSPKQNTLQGNHALCSEHAQSEPVETKILGSNAQENCQTIESSSPKQN 2551
            S+    CQ  E S+  Q    G+ + CSE  + E +  K++ +   E C+   +   +  
Sbjct: 15   SSQTAKCQ--EESTSGQIHEIGSESQCSEKTK-ENIGCKVVQNELLEICKASNNPDEQSQ 71

Query: 2550 ILQGKLELGSEHAQSEPMETKIVESNAQESYKTIESSSLK-------------------- 2431
                 L   S H ++  +  + V+ ++Q   + +  +SL                     
Sbjct: 72   SFSENLTENS-HVENLGLPAEDVDKSSQNGAQNVTKNSLTEQLEMPREDPDVNNQSDKTS 130

Query: 2430 ---QNTLQGKHELGSEHAQSEPMEMKIVEPS---AQENCQTIESSSLKQNTSQGKHELGS 2269
               Q +L+  ++ G   + SEP E +    S     E  QTI    +   + Q +    +
Sbjct: 131  CSGQMSLEQTNDSGFGTSSSEPAEERHPSGSFCVQNELLQTIMPLPICGGSEQVQPISEN 190

Query: 2268 EHAQSEPMETKIVESSARENCQTIESSSLKQNRLQGKHELGSEHAQSEPMETKIVESSVL 2089
             +  S   +  +      + CQT + S   Q      +E GS    SEP + K    SV 
Sbjct: 191  VNMASLNDQAGLPPEDVSKTCQTQKISCPHQITSHQINEFGSGSVPSEPAKQKDQLDSVP 250

Query: 2088 ENCQKIESSSPKQNTTQEKQNGQTTESSSPQKIMFGEKNEL-----GLEHVQSETLEIRI 1924
                + ++S    ++T  +Q G + E+ +    +   +  L      L   + E L   +
Sbjct: 251  AQNDEAKTSKAVSSSTVFEQPGPSIEAMTEDSPIGHSEPPLEDLSKSLSDKEMEPLPEDV 310

Query: 1923 VGSDSVDIKQLGTPPKDAAK-STNRSSRKKNQVSSRPLKRKYMLRSSISSPTVLQSSSHG 1747
              + S  ++QL T  K+A K S+    + K    SR  KRKYM RS + S  VL+S +  
Sbjct: 311  TQNSS--LQQLETASKNALKISSCLGPKDKKNPKSR--KRKYMSRSFVRSDRVLRSKTGE 366

Query: 1746 TSKPPDPSISANVXXXXXXXXXXXXXXXK--------------TLNDEFSRTRKRLRYLL 1609
              KP D  +S NV               +               + DEFSR R  LRYLL
Sbjct: 367  KEKPKDLKLSNNVATLESSNSIANVSNGEEKKRKKRKNRRDNRAIADEFSRIRTHLRYLL 426

Query: 1608 NRMGYEHNLIDAYSGEGWKGLSMEKIKPEQELQRATSEILRCKLKIRDLFQHLDSLCAEG 1429
            NR+GYE +LIDAYSGEGWKG S+EK+KPE+ELQRATSEILR KLKIRDLFQ L+SLCAEG
Sbjct: 427  NRIGYEKSLIDAYSGEGWKGSSLEKLKPEKELQRATSEILRRKLKIRDLFQRLESLCAEG 486

Query: 1428 RLQPSLFDSEGQICSEDIFCAKCGSKDLSDDNDIILCDGICDRGFHLMCLNPPLSKDKIP 1249
                SLFDSEGQI SEDIFC KCGSKD+S DNDIILCDG CDRGFH  CL PPL  + IP
Sbjct: 487  MFPESLFDSEGQIDSEDIFCGKCGSKDVSLDNDIILCDGACDRGFHQFCLEPPLLSEDIP 546

Query: 1248 PEDEGWLCPGCDCKVDCIYLLNDVQGTNLSIDDKWEKVFPEAAATGSGDNPDDNFGFXXX 1069
            P+DEGWLCPGCDCKVDCI LLND QGT+LS+ D WEKVFPEAAA  S     DN G    
Sbjct: 547  PDDEGWLCPGCDCKVDCIDLLNDSQGTDLSVTDSWEKVFPEAAAAASAGENQDNHGLPSD 606

Query: 1068 XXXXXXXXXDGPELDERVQXXXXXXXXXXXXXXXXDLGA-SPHEEQYLGLXXXXXXXXXX 892
                     DGPE D +VQ                 L     ++EQYLGL          
Sbjct: 607  DSDDNDYDPDGPETDNKVQGEESSSDESEYASASDGLETPKSNDEQYLGLPSEDSEDDDY 666

Query: 891  XXXXXXXDEQIKKXXXXXXXXXXXXXXXALSDGKGSSGTNV--LVHSSVDGAKLLTGSSN 718
                   +E +K+               A  D    S  +V     +S+D +K   GS  
Sbjct: 667  NPYAPDVNEDVKQESSSSDFTSDSEDLGAALDDNIMSSEDVEGPKSTSLDDSKPHRGSGE 726

Query: 717  RRSQMGRKKKQSINAELLSILEPGPSQGNASPVLSKRNRERLDYKKLHDEAYGNVPXXXX 538
            + S  G +KK S+  EL+S+LE GP QG ++P+  KR+ ERLDYK+LHDEAYGNVP    
Sbjct: 727  QSSISG-QKKHSLKDELISLLESGPGQGESAPLSGKRHIERLDYKRLHDEAYGNVPTDSS 785

Query: 537  XXXXXXXXEAPRKRKNDGGIGNTSVVPPKGXXXXXXXXXXXXXXXXNLKDIESTPTRRTR 358
                       RKRK   G G  +   P G                ++ + E+TP R   
Sbjct: 786  DDEDWNDIATQRKRKK--GTGQVANRSPNGKTSNIKNGVITKDIKPDVDENENTPRRMPH 843

Query: 357  QKLDSDGAYNIAAETHKDSLGPDYAVKVATASA---SRKNTGSTQRNLKETKNT--PKRR 193
            +K + +   N++ ++ K S         A +S    SR    +TQR  K  K    P R 
Sbjct: 844  RKSNVEDTSNLSNKSPKGSTKSGSTSGRAGSSRSTYSRLGEAATQRLCKSFKENHYPDRS 903

Query: 192  TREKL 178
             +E L
Sbjct: 904  MKESL 908


>gb|EXB76647.1| Homeobox protein [Morus notabilis]
          Length = 1031

 Score =  419 bits (1077), Expect = e-114
 Identities = 333/952 (34%), Positives = 450/952 (47%), Gaps = 87/952 (9%)
 Frame = -1

Query: 2595 QENCQTIESSSPKQNILQGKLELG-SEH---AQSEPMETKIVESNAQESYKTIE-----S 2443
            Q N QT + + P+QN+L+   E   SEH   ++++ +E    ES+   ++   E     S
Sbjct: 13   QVNGQTAKCTHPEQNMLEQVKETEPSEHIHISEAQTVENLRQESSIAATHVFCEQLQPVS 72

Query: 2442 SSLKQNTL-----QGKHELG----------SEHAQSEPM-----------EMKIVEPSAQ 2341
             S+ +N+L     Q    L           ++H +S P            E+++ +  A 
Sbjct: 73   DSVAKNSLPEAPGQQNEPLEPIAAVGMCSTNKHPESSPRNLTNDSLCNSKELELPQKDAG 132

Query: 2340 ENCQTIESSSLKQNTSQGKHELGSEHAQSEPMETKI-----VESSARENCQTIESSSLKQ 2176
             N QT + S  K+  ++ KH  GS +  +E   TK      +++  R+   T+ S    Q
Sbjct: 133  ANGQTEKISFPKEAAAEVKHGCGSGNLCAEQAGTKNDVDSNLQNEIRKTDITVSSFVFTQ 192

Query: 2175 NR--LQGKHELGSEHAQSEPMETKIVESSVLENCQKIESSSPKQNTTQEKQN-------G 2023
                +  K  L S    + P E       V+ +CQ   SS P+Q+T  + ++       G
Sbjct: 193  KLEIVSEKRSLISGGNLAVPSE------DVVRHCQTENSSCPQQSTLGQIKDFDCGCLLG 246

Query: 2022 QTTESSSPQKIMFGEKNELGLEHVQSETLEIRIV----------------GSDSVDIKQL 1891
            +T +          +++ LG E VQ+  +E RI                 GSDS   KQ+
Sbjct: 247  ETPK----------QEDHLGTELVQNVLVETRIAASNGIVSEHLEPPVGDGSDSYIDKQV 296

Query: 1890 GTPPKDAAKST-------------NRSSR--KKNQVSSRPLKRKYMLRSSISSPTVLQSS 1756
              P +D +KS+             N+ S+  +K++ +S+  K++YMLRS + S  VL+S 
Sbjct: 297  EQPSEDVSKSSSLEQLETSSKSLVNKPSQLGRKDKQTSKSRKKQYMLRSLVHSDRVLRSR 356

Query: 1755 SHGTSKPPDPSIS-ANVXXXXXXXXXXXXXXXKT--LNDEFSRTRKRLRYLLNRMGYEHN 1585
            +    K  + S + +N+                T  + DEFSR RKRL+Y  NR+ YE N
Sbjct: 357  TQEKLKSHELSNTLSNIGNGVEKRMKERKKRRGTRVIADEFSRIRKRLKYFFNRIHYEQN 416

Query: 1584 LIDAYSGEGWKGLSMEKIKPEQELQRATSEILRCKLKIRDLFQHLDSLCAEGRLQPSLFD 1405
            LIDAYS EGWKG S+EK+KPE+ELQRA SEI R KLKIRDLFQ LDSLCAEGR   SLFD
Sbjct: 417  LIDAYSSEGWKGTSLEKLKPEKELQRAKSEIFRRKLKIRDLFQQLDSLCAEGRFPKSLFD 476

Query: 1404 SEGQICSEDIFCAKCGSKDLSDDNDIILCDGICDRGFHLMCLNPPLSKDKIPPEDEGWLC 1225
            SEGQI SEDIFCAKCGSKD+S +NDIILCDG CDRGFH  CL PPL  + IPP+DEGWLC
Sbjct: 477  SEGQIDSEDIFCAKCGSKDMSANNDIILCDGACDRGFHQFCLEPPLLSEDIPPDDEGWLC 536

Query: 1224 PGCDCKVDCIYLLNDVQGTNLSIDDKWEKVFPEAAATG-SGDNPDDNFGFXXXXXXXXXX 1048
            PGCDCKVDC  LLND  GTNLS+ D WEKVFPEAAA    G + D N  F          
Sbjct: 537  PGCDCKVDCFDLLNDSYGTNLSVTDSWEKVFPEAAAAAREGKDQDHNLEFPSDDSEDDDY 596

Query: 1047 XXDGPELDERVQXXXXXXXXXXXXXXXXDL--GASPHEEQYLGLXXXXXXXXXXXXXXXX 874
               GPE+ E+V+                +L   A P +EQY GL                
Sbjct: 597  DPYGPEIVEKVEGDESSSDESEYTSACDELEGEAPPKDEQYFGLSSDDSEDNDFDPDDQD 656

Query: 873  XDEQIK-KXXXXXXXXXXXXXXXALSDGKGSSGTNVLVHSSVDGAKLLTGSSNRRSQMGR 697
             DE  K +                L +G+ +    V   SS+D  + L  +  + S+ G 
Sbjct: 657  VDENAKQESSSSDFTSDSEDLAFTLDEGQIAEKDEV---SSLDPTRSLGNAVMQSSKRG- 712

Query: 696  KKKQSINAELLSILEPGPSQGNASPVLSKRNRERLDYKKLHDEAYGNVPXXXXXXXXXXX 517
              K SI  ELL ILE G  Q  + P+  KR+ ERLDYK+LHDE YG++P           
Sbjct: 713  GNKSSIKDELLDILESGTGQDGSPPISGKRHVERLDYKRLHDETYGHLPSDSSDDEDWTD 772

Query: 516  XEAPRKRKNDGGIGNTSVVPPKGXXXXXXXXXXXXXXXXNLKDIESTPTRRTRQKLDSDG 337
              APRKRK     G  S V P                  +L+D E  P RR+RQ      
Sbjct: 773  YAAPRKRKRT--TGQVSSVSPNENASIIKNQTTTDAANNDLEDNEYVPRRRSRQ------ 824

Query: 336  AYNIAAETHKDSLGPDYAVKVATASASRKNTGSTQRNLKETKNTPKRRTREKLDFEGADN 157
                                         N+  T  N     N P +             
Sbjct: 825  -----------------------------NSVVTDEN-----NIPNK------------- 837

Query: 156  LAAETPKDNLGPDSSGKIATATPSRTLGKAVTQGLLESFRENEYPTRDKKEN 1
            L   +PK      S+G+    + +R LG+AVTQ L +SF+EN+Y  R  KE+
Sbjct: 838  LLQGSPKSG----STGRRRELSTNRRLGEAVTQRLYQSFKENQYLDRATKES 885


>ref|XP_002313886.2| hypothetical protein POPTR_0009s09600g [Populus trichocarpa]
            gi|550331388|gb|EEE87841.2| hypothetical protein
            POPTR_0009s09600g [Populus trichocarpa]
          Length = 934

 Score =  409 bits (1051), Expect = e-111
 Identities = 311/933 (33%), Positives = 436/933 (46%), Gaps = 25/933 (2%)
 Frame = -1

Query: 2730 SNAEENCQTIESSSPKQNTLQGNHALCSEH------AQSEPVETKILGSNAQENCQTIES 2569
            S ++ + QT   S P Q  L+  H   +E+       +   +E++I+ + A ++   +  
Sbjct: 10   SPSQVSSQTKSYSCPSQTKLENTHGFTAEYNCGGYSEEKHKLESEIIQTEAGDSGTAVLQ 69

Query: 2568 SSPKQNILQGKLEL-GSEHAQSEPMETKIVESNAQESYKTIESSSLKQNTLQGKHELGSE 2392
            S   + +     ++  +     +P       +   ES + I  +++ Q    G   + + 
Sbjct: 70   SGAGETVEPSTEDVTNNSFTDLDPPPEDARGATFDESSRPI-LTAIDQKLEPGARSVNTA 128

Query: 2391 --HAQSEPMEMKIVEPSAQENCQTIESSSLKQNTSQGKHELGSEHAQSEPMETKIVESSA 2218
              H +S       +      N     SS +   TSQ   E  + ++ +E +     ++S 
Sbjct: 129  CTHGESSKATDSGILQDEPGNTNAASSSCIANETSQASLENLANNSCTEDVGPPYGDASK 188

Query: 2217 RENCQTIESSSLKQNRLQGKH-ELGSEHAQSEPMETKIVESSVLENCQKIESSSPKQNTT 2041
                  I+ SS  Q  + G   EL S+ A  E  E +    S L       S +      
Sbjct: 189  GNQ---IDKSSYPQQTISGHALELLSDRACCERSEERQKPGSEL-------SENESTGID 238

Query: 2040 QEKQNGQTTESSSPQKIMFGEKNELGLEHV------------QSETLEIRIVGSDSVDIK 1897
             E  +G   E+S P   +  +++ +  +HV              +T           D +
Sbjct: 239  TELYSGIAIENSEPLTQLVTKRSPI--KHVGLLPGDSIIIPANEQTRPTHDDEDKGPDHE 296

Query: 1896 QLGTPPKDAAKSTNRSSRKKNQVSSRPLKRKYMLRSSISSPTVLQSSSHGTSKPPDPSIS 1717
             L TP + A   T R  R + + +SR  ++ YMLRS  SS  VL+S S    K P+ S +
Sbjct: 297  HLETPSRVAIGITRRG-RPRGKSASRLSRKIYMLRSLRSSDRVLRSRSQEKPKAPESSNN 355

Query: 1716 A---NVXXXXXXXXXXXXXXXKTLNDEFSRTRKRLRYLLNRMGYEHNLIDAYSGEGWKGL 1546
            +   N                  + DE+S+ R  LRYLLNRM YE +LI AYSGEGWKGL
Sbjct: 356  SGNVNSTGDKKGKRRKKRRGKNIVADEYSKIRAHLRYLLNRMSYEQSLITAYSGEGWKGL 415

Query: 1545 SMEKIKPEQELQRATSEILRCKLKIRDLFQHLDSLCAEGRLQPSLFDSEGQICSEDIFCA 1366
            S+EK+KPE+ELQRATSEI R K+KIRDLFQH+DSLC+EGR   SLFDSEGQI SEDIFCA
Sbjct: 416  SLEKLKPEKELQRATSEITRRKVKIRDLFQHIDSLCSEGRFPSSLFDSEGQIDSEDIFCA 475

Query: 1365 KCGSKDLSDDNDIILCDGICDRGFHLMCLNPPLSKDKIPPEDEGWLCPGCDCKVDCIYLL 1186
            KCGSKDL+ DNDIILCDG CDRGFH  CL PPL ++ IPP+DEGWLCPGCDCKVDCI LL
Sbjct: 476  KCGSKDLNADNDIILCDGACDRGFHQFCLIPPLLREDIPPDDEGWLCPGCDCKVDCIGLL 535

Query: 1185 NDVQGTNLSIDDKWEKVFPEAAATGSGDNPDDNFGFXXXXXXXXXXXXDGPELDERVQXX 1006
            ND QGTN+SI D WEKVFPEAAAT SG   D NFG             DGP++D++ Q  
Sbjct: 536  NDSQGTNISISDSWEKVFPEAAATASGQKLDHNFGPSSDDSDDNDYEPDGPDIDKKSQEE 595

Query: 1005 XXXXXXXXXXXXXXDLGASPHEEQYLGLXXXXXXXXXXXXXXXXXDEQIKKXXXXXXXXX 826
                          +  A P  ++YLGL                 +E++K+         
Sbjct: 596  ESSSDESDFTSASDEFKAPPDGKEYLGLSSDDSEDDDYDPDAPVLEEKLKQESSSSDFTS 655

Query: 825  XXXXXXALSDGKGSSGTNVLVHSSVDGAKLLTGSSNRRSQMGRKKKQSINAELLSILEPG 646
                  A  +G G S  +   H  ++   +   S+ R+S+   KK QS+N+ELLS+LEP 
Sbjct: 656  DSEDLAATINGDGLSLEDE-CHMPIEPRGV---SNGRKSKFDGKKMQSLNSELLSMLEPD 711

Query: 645  PSQGNASPVLSKRNRERLDYKKLHDEAYGNVPXXXXXXXXXXXXEAPRKRKNDGGIGNTS 466
              Q  ++ V  KRN +RLDYKKL+DE YGN+               PRKR+ +   G+ +
Sbjct: 712  LCQDESATVSGKRNVDRLDYKKLYDETYGNI--STSSDDDYTDTVGPRKRRKN--TGDVA 767

Query: 465  VVPPKGXXXXXXXXXXXXXXXXNLKDIESTPTRRTRQKLDSDGAYNIAAETHKDSLGPDY 286
             V   G                 LK+ +  P R T Q                       
Sbjct: 768  TVTANGDASVTENGMNSKNMNQELKENKRNPERGTCQ----------------------- 804

Query: 285  AVKVATASASRKNTGSTQRNLKETKNTPKRRTREKLDFEGADNLAAETPKDNLGPDSSGK 106
                               + +ET  +P +       + GA +L+  + K ++ P +  K
Sbjct: 805  -----------------NSSFQETNVSPAK------SYVGA-SLSGSSGK-SVRPSAYKK 839

Query: 105  IATATPSRTLGKAVTQGLLESFRENEYPTRDKK 7
            +         G+AVTQ L   FREN+YP R  K
Sbjct: 840  L---------GEAVTQRLYSYFRENQYPDRAAK 863


>ref|XP_002300247.2| homeobox family protein [Populus trichocarpa]
            gi|550348560|gb|EEE85052.2| homeobox family protein
            [Populus trichocarpa]
          Length = 930

 Score =  408 bits (1049), Expect = e-111
 Identities = 318/965 (32%), Positives = 434/965 (44%), Gaps = 16/965 (1%)
 Frame = -1

Query: 2853 SDAKNCETMESSSPKHNTMQEKNELGSEHAQSG------PMETKIVESNAEENCQTIESS 2692
            S +K   T   S P   T++  +E  +E+   G       +E +I+++ A +N   +  S
Sbjct: 10   SPSKVSHTKSYSCPAQTTLENTHEPSAEYKFGGYPEERHKLECEIIQTEAGDNRAAVLQS 69

Query: 2691 SPKQNTLQGNHALCSEHAQSEPVETKILGSNAQENCQTIESSSPKQNILQGKLELGSEHA 2512
               +        L                S   +N     S++  Q +  G   + +   
Sbjct: 70   CSGEVVQPSTDDLTKSPLIDLDPPPDDARSALFDNSPRPISTAMDQKLEPGATSVNTACV 129

Query: 2511 QSEPMETKIVESNA----QESYKTIESSSLKQNTLQGKHELGSEHAQSEPMEMKIVEPSA 2344
             SE   +K ++S+       +  T  SS +   T Q   E  +  +++E   + +VE S 
Sbjct: 130  HSE--SSKAIDSSILLDEPRNSNTELSSCIANETSQASLEGLANDSRAEDAGLSLVEAS- 186

Query: 2343 QENCQTIESSSLKQNTSQGK-HELGSEHAQSEPMETKIVESSARENCQTIESSSLKQNRL 2167
              N   I+ SS  Q T+ G+  E  S+ A  +P+E +                       
Sbjct: 187  --NSDLIDESSYSQQTTSGQTREFHSDRACCKPLEER----------------------- 221

Query: 2166 QGKHELGSEHAQSEPMETKIVESS--VLENCQKIESSSPKQNTTQEKQNGQTTESSSPQK 1993
                + GSE A++E ME  I   S   +EN + +     K    +        + S P  
Sbjct: 222  ---QKPGSELAENESMEIGIGLPSGIAIENLEPLTELVTKSCPIKHIGLPPGDDISIPAN 278

Query: 1992 IMFGEKNELGLEHVQSETLEIRIVGSDSVDIKQLGTPPKDAAKSTNRSSRKKNQVSSRPL 1813
                  ++   ++   E LE   +    + I   G P   + K T++ S KK   SSR  
Sbjct: 279  EQIRPTHDKESKYPDCEHLEK--LSGIVIGITSQGVP---SVKRTSKLSGKKYTSSSR-- 331

Query: 1812 KRKYMLRSSISSPTVLQSSSHGTSKPPDPS---ISANVXXXXXXXXXXXXXXXKTLNDEF 1642
                       S  VL+S+S    K P+PS    + N                  + DE+
Sbjct: 332  ----------KSDRVLRSNSQEKPKAPEPSNNSTNVNSTGEEKGKRRKKRRGKSIVADEY 381

Query: 1641 SRTRKRLRYLLNRMGYEHNLIDAYSGEGWKGLSMEKIKPEQELQRATSEILRCKLKIRDL 1462
            SR R RLRYLLNRM YE +LI AYSGEGWKGLS+EK+KPE+ELQRATSEI+R K+KIRDL
Sbjct: 382  SRIRARLRYLLNRMSYEQSLITAYSGEGWKGLSLEKLKPEKELQRATSEIIRRKVKIRDL 441

Query: 1461 FQHLDSLCAEGRLQPSLFDSEGQICSEDIFCAKCGSKDLSDDNDIILCDGICDRGFHLMC 1282
            FQH+DSLC EGR   SLFDSEGQI SEDIFCAKCGSKDL+ DNDIILCDG CDRGFH  C
Sbjct: 442  FQHIDSLCGEGRFPASLFDSEGQIDSEDIFCAKCGSKDLTADNDIILCDGACDRGFHQFC 501

Query: 1281 LNPPLSKDKIPPEDEGWLCPGCDCKVDCIYLLNDVQGTNLSIDDKWEKVFPEAAATGSGD 1102
            L PPL ++ IPP DEGWLCPGCDCKVDCI LLND QGTN+SI D+W+ VFPEAAA  SG 
Sbjct: 502  LVPPLLREDIPPGDEGWLCPGCDCKVDCIDLLNDSQGTNISISDRWDNVFPEAAAVASGQ 561

Query: 1101 NPDDNFGFXXXXXXXXXXXXDGPELDERVQXXXXXXXXXXXXXXXXDLGASPHEEQYLGL 922
              D NFG             DGP++DE+ Q                +  A P ++QYLGL
Sbjct: 562  KLDYNFGLSSDDSDDNDYDPDGPDIDEKSQ-EESSSDESDFSSASDEFEAPPDDKQYLGL 620

Query: 921  XXXXXXXXXXXXXXXXXDEQIKKXXXXXXXXXXXXXXXALSDGKGSSGTNVLVHSSVDGA 742
                             +E++K+               A  +G G S      H  ++  
Sbjct: 621  PSDDSEDDDYDPDAPVLEEKLKQESSSSDFTSDSEDLDATLNGDGLS-LGDEYHMPIEPH 679

Query: 741  KLLTGSSNRRSQMGRKKKQSINAELLSILEPGPSQGNASPVLSKRNRERLDYKKLHDEAY 562
            +    S+ RRS+ G KK  S+N++LLS+LEP   Q  ++PV  KRN ERLDYKKL+DE Y
Sbjct: 680  E---DSNGRRSRFGGKKNHSLNSKLLSMLEPDSHQEKSAPVSGKRNIERLDYKKLYDETY 736

Query: 561  GNVPXXXXXXXXXXXXEAPRKRKNDGGIGNTSVVPPKGXXXXXXXXXXXXXXXXNLKDIE 382
            GN+              APRKR+ +   G+ ++    G                 LK  E
Sbjct: 737  GNI--STSSDDDYTDTVAPRKRRKN--TGDVAMGIANGDASVTENGLNSKNMNQELKKNE 792

Query: 381  STPTRRTRQKLDSDGAYNIAAETHKDSLGPDYAVKVATASASRKNTGSTQRNLKETKNTP 202
             T  R                 TH++S   D  V  A        +GS+ + ++      
Sbjct: 793  HTSGR-----------------THQNSSFQDTNVSPAKTHVGESLSGSSSKRVR------ 829

Query: 201  KRRTREKLDFEGADNLAAETPKDNLGPDSSGKIATATPSRTLGKAVTQGLLESFRENEYP 22
                                                +  + LG+AVTQ L   F+EN YP
Sbjct: 830  -----------------------------------PSAYKKLGEAVTQKLYSFFKENRYP 854

Query: 21   TRDKK 7
             +  K
Sbjct: 855  DQAAK 859


>ref|XP_007042568.1| Homeodomain-like protein with RING/FYVE/PHD-type zinc finger domain,
            putative isoform 1 [Theobroma cacao]
            gi|590687101|ref|XP_007042569.1| Homeodomain-like protein
            with RING/FYVE/PHD-type zinc finger domain, putative
            isoform 1 [Theobroma cacao] gi|508706503|gb|EOX98399.1|
            Homeodomain-like protein with RING/FYVE/PHD-type zinc
            finger domain, putative isoform 1 [Theobroma cacao]
            gi|508706504|gb|EOX98400.1| Homeodomain-like protein with
            RING/FYVE/PHD-type zinc finger domain, putative isoform 1
            [Theobroma cacao]
          Length = 950

 Score =  404 bits (1039), Expect = e-109
 Identities = 315/916 (34%), Positives = 429/916 (46%), Gaps = 37/916 (4%)
 Frame = -1

Query: 2814 PKHNTMQEKNELGSEHAQSGPMETK------IVESNAEENCQTIESSSPKQNTLQGNHAL 2653
            P+ +T ++ +E GSE+  +   E K        ++ + EN   + SS        G H  
Sbjct: 24   PEESTSEQAHEFGSEYLLTELSENKNQCGYAATQNESAENATGVSSS--------GVHER 75

Query: 2652 CSEHA--QSEPVETKILGSNAQENCQTIESSSPKQNILQGKLELGSEHAQSEPMETK--- 2488
              E+    S P  + +L      +  T +S   ++ +     E   E+ ++E  E K   
Sbjct: 76   SPEYVAKNSSPERSGLLPKGVMGHNHTDKSFYAQETVSGKTHEYDCEYVRTETSEEKHQP 135

Query: 2487 ---IVESNAQESYKTIESSSLKQNTLQGKHELGSEHAQSEPMEMKIVEPSAQE------N 2335
               IV++  +E+   +    L    LQ   E  SE+A +E + + + E S++       +
Sbjct: 136  GSEIVQNELEEACSLV--CDLPAKNLQTFSEGLSENAITESLGL-LPEDSSKHTKTDKLS 192

Query: 2334 CQTIESSSLKQNTSQGK--HELGSEHAQSEPMETKIVESSARENCQTIESSSLKQNRLQG 2161
            C  + SS    N   G    ELG    Q + ++++ + +   E+   + SS++    LQ 
Sbjct: 193  CPQLVSSEPTVNFGSGNVCKELGESPEQRQQLDSESLPNGIEESTIAV-SSNVSNQALQL 251

Query: 2160 KHE-LGSEHAQSEPMETKIVESSVLENCQKIESSSPKQNTTQEKQNGQTTESSSPQKIMF 1984
            K E +G  H                         SP +  T   Q+     S SP     
Sbjct: 252  KPEDMGKSHCGGH-------------------LHSPPEGVTNVIQS-----SKSPLVEPL 287

Query: 1983 GEKNEL--GLEHVQSETLEIRIVGSDSVDIKQLGTPPKDAAKSTNRSSRKKNQVSSRPLK 1810
            G   E   G    Q   L    +  +S  ++Q  T PK+  ++   S R++N  +S+ +K
Sbjct: 288  GLPQEFAQGNPSTQQSGLPCEDMAQNS-GVEQHETKPKNLLEN---SGRRRNGKTSKTIK 343

Query: 1809 RKYMLRSSISSPTVLQSSSHGTSKPPDPSIS-ANVXXXXXXXXXXXXXXXKT--LNDEFS 1639
            +KYMLRS  SS  VL+S      K  + S + A+V                   + DEFS
Sbjct: 344  KKYMLRSLRSSDRVLRSKLQEKPKATESSNNLADVGSSEQQKRRKRRRRKANREVADEFS 403

Query: 1638 RTRKRLRYLLNRMGYEHNLIDAYSGEGWKGLSMEKIKPEQELQRATSEILRCKLKIRDLF 1459
            R R  LRYLLNR+ YE +LI AYS EGWKGLS+EK+KPE+ELQRATSEILR KLKIRDLF
Sbjct: 404  RIRTHLRYLLNRINYERSLIAAYSTEGWKGLSLEKLKPEKELQRATSEILRRKLKIRDLF 463

Query: 1458 QHLDSLCAEGRLQPSLFDSEGQICSEDIFCAKCGSKDLSDDNDIILCDGICDRGFHLMCL 1279
            QH+DSLCAEG+L  SLFDSEGQI SEDIFCAKCGSKDLS +NDIILCDG CDRGFH  CL
Sbjct: 464  QHIDSLCAEGKLPESLFDSEGQIDSEDIFCAKCGSKDLSANNDIILCDGACDRGFHQYCL 523

Query: 1278 NPPLSKDKIPPEDEGWLCPGCDCKVDCIYLLNDVQGTNLSIDDKWEKVFPEAAATGSGDN 1099
             PPL K+ IPP+DEGWLCPGCDCKVDCI L+N+ QGT+ SI D WEKVFPEAA   +G N
Sbjct: 524  QPPLLKEDIPPDDEGWLCPGCDCKVDCIELVNESQGTSFSITDSWEKVFPEAAVAAAGQN 583

Query: 1098 PDDNFGFXXXXXXXXXXXXDGPELDERVQXXXXXXXXXXXXXXXXDLGASPHEEQYLGLX 919
             D NFG             DG E DE+                  +L      +QYLGL 
Sbjct: 584  QDPNFGLPSDDSDDNDYNPDGSETDEKDHGDESSSEESEFTSTSEELEVPAKVDQYLGLP 643

Query: 918  XXXXXXXXXXXXXXXXDEQIKKXXXXXXXXXXXXXXXALSDGKGSSGTNVLVHSSVDGAK 739
                            DE +K                       S   + ++   +   K
Sbjct: 644  SDDSEDDDYDPDGPNHDEVVKPESSSSDF------------SSDSEDLDAMLEEDITSQK 691

Query: 738  ---LLTGSSNRRSQMGRKK---KQSINAELLSILEPGPSQGNASPVLSKRNRERLDYKKL 577
                +  S+ R S+  + K   K+S+N ELLSI+EP  S+ + S +  KR+ ERLDYK+L
Sbjct: 692  DEGPMANSAPRDSKRRKPKLGEKESMNDELLSIMEPA-SEQDGSAISKKRSIERLDYKRL 750

Query: 576  HDEAYGNVPXXXXXXXXXXXXEAPRKRKNDGGIGNTSVVPPKGXXXXXXXXXXXXXXXXN 397
            +DE YGNVP             APRKR         +  P  G                N
Sbjct: 751  YDETYGNVPSSSSDDEDWSDITAPRKRNK--CTAEVASAPENGNVSVSRTVSVSDGLKQN 808

Query: 396  LKDIESTPTRRTRQKLDSDGAYNIAAETH-KDSLGPDYAVKVATASASRKNTGSTQRNLK 220
             ++ E  P R+TRQ        +  AE     S+      K  +++  R      QR  K
Sbjct: 809  PEETEHKPRRKTRQMSRFKDTDSSPAEIQGNTSVSGSSGKKAGSSTYKRLGEAVKQRLYK 868

Query: 219  ETKNT--PKRRTREKL 178
              K    P R T++ L
Sbjct: 869  SFKENQYPDRATKQSL 884


>ref|XP_004289744.1| PREDICTED: uncharacterized protein LOC101296723 [Fragaria vesca
            subsp. vesca]
          Length = 1227

 Score =  389 bits (999), Expect = e-105
 Identities = 317/993 (31%), Positives = 433/993 (43%), Gaps = 60/993 (6%)
 Frame = -1

Query: 2802 TMQEKNELGSEHAQSGPMETKI-VESNAEENCQTIESSSPKQNTLQGNHALCSEHAQSEP 2626
            T +EK  +GS+  Q+  ++T I V S   E  Q I        T + N    +E A+  P
Sbjct: 196  TREEKQHIGSDSVQNELLQTSIPVSSCGSEQLQLI--------TERVNVTFPNEQARVPP 247

Query: 2625 VETKILGSNAQENCQTIESSSPKQNILQGKLELGSEHAQSEPMETKIVESNAQESYKTIE 2446
             +     S   E        +    + + K   GS+  Q+E ++++      +    + +
Sbjct: 248  KDV----SKTLEPTNNFGFGTSSSVLAEDKQHCGSDVPQNEALQSQYWTRTIRLEVPSCD 303

Query: 2445 SSSLKQNTLQGKHELGSEHAQSEPMEMKIVEPSAQENCQTIESSSLKQNTSQGKHELGSE 2266
             +    +  +       +     P E   ++ + ++       +S  Q T + ++ LGS 
Sbjct: 304  GNEPLLSFCENVDICSLDEKAGLPCEG--LKKTLKQINDVSSGTSYSQPTEENQN-LGSS 360

Query: 2265 HAQSEPMET--KIVESSAREN---------------------------CQTIESSSLKQN 2173
              Q EP++T   +V S   E                            CQT + S     
Sbjct: 361  FVQDEPLQTIIPVVSSGGNEQLRVVNENVSVPSLGEQAGLLPEAVSKTCQTDKLSRSLHT 420

Query: 2172 RLQGKHELGSEHAQSEPMETKIVESSVLENCQKIESSSPKQNTTQEKQNGQTTESSS--- 2002
                 +E GS   Q EP E +    S+     ++++S+   ++   +Q+G + +  +   
Sbjct: 421  ASDQINESGSGSVQCEPQEQRDQLGSLPSQNDQVKNSTAVSSSIGFEQSGPSVDEMNNSV 480

Query: 2001 -------PQKIMFGEKNELGLEHVQSETLEIRIVGSDSVDIKQLGTPPKDAAKSTNRSSR 1843
                   P+        EL   H    T    +  S+        T  K+A+K++ +   
Sbjct: 481  IGHLEPPPEDASKDHNKELIKPHTNDATQNSCLEPSE--------TASKNASKNSTQFGC 532

Query: 1842 KKNQVSSRPLKRKYMLRSSISSPTVLQSSSHGTSKPPDPSISANVXXXXXXXXXXXXXXX 1663
            K  + SS   K     RS +SS  VL+S +  + KP  P +S NV               
Sbjct: 533  KDKRNSSSRRKS----RSLVSSDRVLRSRT--SEKPEAPELSNNVATLDTSNSVANVSNE 586

Query: 1662 K--------------TLNDEFSRTRKRLRYLLNRMGYEHNLIDAYSGEGWKGLSMEKIKP 1525
            K                 DEFSR R  LRY LNR+ YE +LIDAYS EGWKG S+EK+KP
Sbjct: 587  KEGKRKKRKKKHRERVAADEFSRIRSHLRYFLNRINYEKSLIDAYSSEGWKGNSLEKLKP 646

Query: 1524 EQELQRATSEILRCKLKIRDLFQHLDSLCAEGRLQPSLFDSEGQICSEDIFCAKCGSKDL 1345
            E+ELQRATSEILR K KIRDLFQ LDSLCAEG    SLFD EGQI SEDIFCAKCGS D+
Sbjct: 647  EKELQRATSEILRRKSKIRDLFQRLDSLCAEGMFPESLFDEEGQIDSEDIFCAKCGSLDV 706

Query: 1344 SDDNDIILCDGICDRGFHLMCLNPPLSKDKIPPEDEGWLCPGCDCKVDCIYLLNDVQGTN 1165
              DNDIILCDG CDRGFH  CL PPL  ++IPP+DEGWLCPGCDCKVDCI LLND QGT+
Sbjct: 707  YADNDIILCDGACDRGFHQHCLEPPLLSEEIPPDDEGWLCPGCDCKVDCIDLLNDSQGTD 766

Query: 1164 LSIDDKWEKVFPEAA-ATGSGDNPDDNFGFXXXXXXXXXXXXDGPELDERVQXXXXXXXX 988
            LSI D WEKVFPEAA A  +G + ++N G             DGPE DE VQ        
Sbjct: 767  LSITDSWEKVFPEAAVAASAGQHQENNQGLPSEDSDDDDYDPDGPETDEEVQEGESSSDE 826

Query: 987  XXXXXXXXDLGA-SPHEEQYLGLXXXXXXXXXXXXXXXXXDEQIKKXXXXXXXXXXXXXX 811
                     L     ++EQYLG+                  E +K+              
Sbjct: 827  SEYASASDGLETPKTNDEQYLGIPSDDSEDDDFNPDAPDPTEDVKQGSSSSDFTSDSEDL 886

Query: 810  XAL--SDGKGSSGTNVLVHSSVDGAKLLTGSSNRRSQMGRKKKQSINAELLSILEPGPSQ 637
             A+   D K          S ++ + LL GS  + S+ G +K+  I  EL S++E  P Q
Sbjct: 887  AAVLDEDRKSFENGEGPQSSVLEASTLLRGSGGKGSKRG-QKRHFIKDELSSLIESDPGQ 945

Query: 636  GNASPVLSKRNRERLDYKKLHDEAYGNVPXXXXXXXXXXXXEAPRKRKNDGGIGNTSVVP 457
              ++PV  KR+ ERLDYKKLHDE YG++P              PRKRK   G G  S   
Sbjct: 946  DGSTPVSGKRHVERLDYKKLHDEEYGDIP--TSDDEEYIETAVPRKRKK--GAGQVSPGS 1001

Query: 456  PKGXXXXXXXXXXXXXXXXNLKDIESTPTRRTRQKLDSDGAYNIAAETHKDSLGPDYAVK 277
             KG                +    E TP R  R+K          +  + +S  P+ ++K
Sbjct: 1002 LKGKPSTIKKGKTTKDIKDDPDKNEHTPRRTPRRK----------SSANDNSSSPNESLK 1051

Query: 276  VA--TASASRKNTGSTQRNLKETKNTPKRRTREKLDFEGADNLAAETPKDNLGPDSSGKI 103
             +  + S S +  GST R L                                        
Sbjct: 1052 SSPKSGSTSGRAKGSTYRRL---------------------------------------- 1071

Query: 102  ATATPSRTLGKAVTQGLLESFRENEYPTRDKKE 4
                     G+AVTQ L  SF+EN+YP R  KE
Sbjct: 1072 ---------GEAVTQRLYTSFKENQYPDRSMKE 1095


>ref|XP_002527467.1| Homeobox protein HAT3.1, putative [Ricinus communis]
            gi|223533107|gb|EEF34865.1| Homeobox protein HAT3.1,
            putative [Ricinus communis]
          Length = 896

 Score =  387 bits (995), Expect = e-104
 Identities = 254/588 (43%), Positives = 311/588 (52%), Gaps = 21/588 (3%)
 Frame = -1

Query: 1878 KDAAKSTNRSSRK-KNQVSSRPLKRKYMLRSSISSPTVLQSSSHGTSKPPD-----PSIS 1717
            KDA  +++R  R+ K    SR   +KYMLR    S  V+Q  S    K P+     P++S
Sbjct: 187  KDAVSNSSRLGRRVKTTAKSR---KKYMLRCLRRSDRVMQYRSQEKPKAPESSTNLPNVS 243

Query: 1716 ANVXXXXXXXXXXXXXXXKTLNDEFSRTRKRLRYLLNRMGYEHNLIDAYSGEGWKGLSME 1537
            +NV               +   DE+S  RK LRYLLNR+GYE +LI AYS EGWKGLS+E
Sbjct: 244  SNVEKTRKKKKKRERKSVEA--DEYSIIRKNLRYLLNRIGYEQSLITAYSAEGWKGLSLE 301

Query: 1536 KIKPEQELQRATSEILRCKLKIRDLFQHLDSLCAEGRLQPSLFDSEGQICSEDIFCAKCG 1357
            K+KPE+ELQRATSEILR K KIRDLFQ +DSLC EGR   SLFDS+GQI SEDIFCAKCG
Sbjct: 302  KLKPEKELQRATSEILRRKSKIRDLFQRIDSLCGEGRFPESLFDSDGQISSEDIFCAKCG 361

Query: 1356 SKDLSDDNDIILCDGICDRGFHLMCLNPPLSKDKIPPEDEGWLCPGCDCKVDCIYLLNDV 1177
            SKDL+ DNDIILCDG CDRGFH  CL PPL K+ IPP+D+GWLCPGCDCKVDCI LLN+ 
Sbjct: 362  SKDLTADNDIILCDGACDRGFHQYCLVPPLLKEDIPPDDQGWLCPGCDCKVDCIDLLNES 421

Query: 1176 QGTNLSIDDKWEKVFPEAAATGSGDNPDDNFGFXXXXXXXXXXXXDGPELDERVQ-XXXX 1000
            QGTN+SI D WEKVFPEAAA   G NPD NFG             D PE+DE+ Q     
Sbjct: 422  QGTNISISDSWEKVFPEAAA--PGQNPDQNFGPPSDDSDDNDYDPDIPEIDEKSQGDESS 479

Query: 999  XXXXXXXXXXXXDLGASPHEEQYLGLXXXXXXXXXXXXXXXXXDEQIKKXXXXXXXXXXX 820
                        +L A P ++Q LGL                 D+ +K+           
Sbjct: 480  SDDSDDSDFTSDELEAPPGDKQQLGLSSEDSGDDDYDPDAPDLDDIVKEESSSSDFTSDS 539

Query: 819  XXXXALSDGKGSSGTNVLVHSSVDGAKLLTG----SSNRRSQMGRKKKQSINAELLSILE 652
                A  D    SG         D  ++  G    S+   S+ GRKKKQS+ +ELLSI E
Sbjct: 540  EDLAATLDNNELSGE--------DERRISVGTRGDSTKEGSKRGRKKKQSLQSELLSIEE 591

Query: 651  PGPSQGNASPVLSKRNRERLDYKKLHDEAYGNVPXXXXXXXXXXXXEAPRKRKND----- 487
            P PSQ  ++P+  KRN ERLDYKKL+DE YGNV                 KR+       
Sbjct: 592  PNPSQDGSAPISGKRNVERLDYKKLYDETYGNVSSDSSDDEDFTDDVGAVKRRKSTQAAL 651

Query: 486  -GGIGNTSVVPPKGXXXXXXXXXXXXXXXXNLKDIESTPTRRTRQKLDSDGAYNIAAETH 310
                GN SV                     +LK+ E  P +R+RQ+L S+       + H
Sbjct: 652  GSANGNASVT---------------DTGKQDLKETEYVP-KRSRQRLISENTSITPTKAH 695

Query: 309  KDSLGPDYAVKVATASASRK----NTGSTQRNLKETKNTPKRRTREKL 178
            + +       K    S  R+     T    R+ KE +  P R  +E L
Sbjct: 696  EGTSPSSSCGKTVRPSGYRRLGETVTKGLYRSFKENQ-YPDRDRKEHL 742


>emb|CAN68079.1| hypothetical protein VITISV_006312 [Vitis vinifera]
          Length = 611

 Score =  379 bits (974), Expect = e-102
 Identities = 274/656 (41%), Positives = 335/656 (51%), Gaps = 13/656 (1%)
 Frame = -1

Query: 2478 SNAQESYKTIESSSLKQNTLQGKHELGSEHAQSEPMEMKIVEPSAQENCQTIESSSLKQN 2299
            S AQES +T +SSS KQN L+   +L SE   SE  E K                     
Sbjct: 4    SPAQESNRTRKSSSPKQNILEEARKL-SESVCSESSEQK--------------------- 41

Query: 2298 TSQGKHELGSEHAQSEPMETKIVESSARENCQTIESSSLKQNRLQGKHELGSEHAQSEPM 2119
                     SE+ Q EP E   V S    NC   E S L    + G   LG   A     
Sbjct: 42   -------RXSENGQHEPAEISPVLS----NCIVTEQSELPPEDV-GDTILGLPPAD---- 85

Query: 2118 ETKIVESSVLENCQKIESSSPKQNTTQEKQNGQTTESSSPQKIMFGEKNELGLEHVQSET 1939
               + ++S+ E+       + K + T+  Q G   E  +   I+     +LG    QSE 
Sbjct: 86   ---VTKNSLXEHLGLPPEDAIKNDGTE--QLGXFPEVVTKSSII----EKLG----QSEP 132

Query: 1938 LEIRIVGSDSVDIKQLGTPPKDAAKSTNRSSRKKNQVSSRPLKRKYMLRSSISSPTVLQS 1759
                +     +D  Q G+ PKD A          N+ +++ +KRKY LRSS+S   VL+S
Sbjct: 133  PPENVARYSGLD--QSGSAPKDLA----------NKRTAKLVKRKYKLRSSVSGSRVLRS 180

Query: 1758 SSHGTSKPPDPSIS-ANVXXXXXXXXXXXXXXXKTLNDEFSRTRKRLRYLLNRMGYEHNL 1582
             S    K   PS +  N                KT  DEF+R RK LRYLLNRM YE NL
Sbjct: 181  RSQEKPKASQPSDNFVNASASRERKGRKKKRMNKTTADEFARIRKHLRYLLNRMSYEQNL 240

Query: 1581 IDAYSGEGWKGLSMEKIKPEQELQRATSEILRCKLKIRDLFQHLDSLCAEGRLQPSLFDS 1402
            IDAYS EGWKG S+EK+KPE+ELQRA+SEI R KL IRDLFQHLDSLCAEGR   SLFDS
Sbjct: 241  IDAYSAEGWKGQSVEKLKPEKELQRASSEISRRKLXIRDLFQHLDSLCAEGRFPESLFDS 300

Query: 1401 EGQICSEDIFCAKCGSKDLSDDNDIILCDGICDRGFHLMCLNPPLSKDKIPPEDEGWLCP 1222
            EGQI SEDIFCAKC SKD+S DNDIILCDG CDRGFH  CL PPL K++IPP+DEGWLCP
Sbjct: 301  EGQIDSEDIFCAKCESKDMSADNDIILCDGACDRGFHQFCLEPPLLKEEIPPDDEGWLCP 360

Query: 1221 GCDCKVDCIYLLNDVQGTNLSIDDKWEKVFPEAAATGSGDNPDDNFGFXXXXXXXXXXXX 1042
             CDCKVDC+ LLND QGT LS+ D WEKVFPEAAA  +G+N D+N GF            
Sbjct: 361  ACDCKVDCMDLLNDSQGTKLSVIDSWEKVFPEAAA--AGNNQDNNSGFSSDDSEDNDYDP 418

Query: 1041 DGPELDERVQ------------XXXXXXXXXXXXXXXXDLGASPHEEQYLGLXXXXXXXX 898
            D PE+DE+ Q                            D+  SP+ EQ LGL        
Sbjct: 419  DCPEVDEKGQGDKSSSDKFDESDEFDESDESDFTSASDDMVVSPNNEQCLGLPSDDSEDD 478

Query: 897  XXXXXXXXXDEQIKKXXXXXXXXXXXXXXXALSDGKGSSGTNVLVHSSVDGAKLLTGSSN 718
                     DEQ+ +               A  D +  S        + DG        +
Sbjct: 479  DFDPDAPEIDEQVNQGSSSSDFTSDSEDFTATLDRRNFS-------DNEDGL-------D 524

Query: 717  RRSQMGRKKKQSINAELLSILEPGPSQGNASPVLSKRNRERLDYKKLHDEAYGNVP 550
             + + GRKKK ++  ELLS+LE    Q NA P+ +KR+ ERLDYKKLHD +  + P
Sbjct: 525  EQRRFGRKKKDTLKDELLSVLESNSGQDNA-PLSAKRHVERLDYKKLHDVSAHDSP 579


>ref|XP_004140812.1| PREDICTED: uncharacterized protein LOC101204775 [Cucumis sativus]
          Length = 1061

 Score =  374 bits (959), Expect = e-100
 Identities = 300/912 (32%), Positives = 420/912 (46%), Gaps = 35/912 (3%)
 Frame = -1

Query: 2631 EPVETKILGSNAQENCQTIESSSPKQNILQGKLELGSEHAQSEPMETKIVESNAQESYKT 2452
            E V+  +   N +E  ++  + S   N  +   E  +     E  E  + ES    + + 
Sbjct: 19   EAVQEAMASDNMEERDESTGTESRPNNNAEAVQEAKASDNMKERDENTVTESRPNNNAEA 78

Query: 2451 IESSSLKQNTLQGKHELGSEHAQSEPMEMKIVEPSAQENCQTIESSSLKQNTSQGKHELG 2272
             +      N      E   E+  +E    KI E + QE   ++E   L   +++ K+  G
Sbjct: 79   AQEGKASDNM-----EERDENTDTESRPNKIAE-AVQEAKASVEVEVLTCLSNEAKYS-G 131

Query: 2271 SEHAQSEPMETKIVESSARENCQTIESSSLKQNRLQGKHELGSEHAQSEPMETKIVESSV 2092
             +   + P      E S++ +    E + ++QN      ELGS +  SE  E      S 
Sbjct: 132  YQELGTTP------EFSSKIDGPDEEKAGVQQNM-----ELGSGYLLSELSEKDNQTISN 180

Query: 2091 LENCQKIESSSPKQNTTQEKQNGQTTESSSPQKIMFGEKNELGLEHVQSETLEIRI--VG 1918
              +  ++E+ +   N    K    + E  +    +  E +EL LE V    +E     +G
Sbjct: 181  HADNDRVEAGNLLSNDKDTKNLKLSIEDEATT--LLNECSELPLEDVTKNYIEKMNPPIG 238

Query: 1917 --SDSVDIKQLGTPPKDAAKSTNRSSRKKNQVSSRPLKRKYMLRSSISSPTVLQSSSHGT 1744
              +    I+ L T P     S ++ S +K+++  +  K+ Y LRS +SS  VL+S +   
Sbjct: 239  DLTQITSIQSLETIP-----SNSQQSARKDKIFLKSKKKNYKLRSHVSSDRVLRSRTQEK 293

Query: 1743 SKPPDPS-----ISANVXXXXXXXXXXXXXXXKTLNDEFSRTRKRLRYLLNRMGYEHNLI 1579
            +K P+ S      +A                     DE+S  R  LRYLLNR+ YE +LI
Sbjct: 294  AKAPERSNDLNNFTAEEDGKRKKKKKRNIQGKGARVDEYSSIRNHLRYLLNRIRYEQSLI 353

Query: 1578 DAYSGEGWKGLSMEKIKPEQELQRATSEILRCKLKIRDLFQHLDSLCAEGRLQPSLFDSE 1399
            +AYS EGWKG S +K+KPE+ELQRA++EI+R KLKIRDLFQ +D+LCAEGRL  SLFDSE
Sbjct: 354  EAYSSEGWKGFSSDKLKPEKELQRASNEIMRRKLKIRDLFQRIDALCAEGRLSESLFDSE 413

Query: 1398 GQICSEDIFCAKCGSKDLSDDNDIILCDGICDRGFHLMCLNPPLSKDKIPPEDEGWLCPG 1219
            GQI SEDIFCAKCGSK+LS +NDIILCDGICDRGFH  CL PPL    IPP+DEGWLCPG
Sbjct: 414  GQIDSEDIFCAKCGSKELSLENDIILCDGICDRGFHQFCLEPPLLNTDIPPDDEGWLCPG 473

Query: 1218 CDCKVDCIYLLNDVQGTNLSIDDKWEKVFPEAAATGSGDNPDDNFGFXXXXXXXXXXXXD 1039
            CDCK DC+ LLN+ QG+NLSI D WEKV+PEAAA  +G N D   G             D
Sbjct: 474  CDCKDDCLDLLNEFQGSNLSITDGWEKVYPEAAAAAAGRNSDHTLGLPSDDSEDGDYDPD 533

Query: 1038 GPEL-------------DERVQXXXXXXXXXXXXXXXXDLGASPHEEQYLGLXXXXXXXX 898
             P+               ++                   L  S +++QYLGL        
Sbjct: 534  VPDTIDQDNELSSDESSSDQSNSDPSNSDTSGYASASEGLEVSSNDDQYLGLPSDDSEDN 593

Query: 897  XXXXXXXXXDEQIKKXXXXXXXXXXXXXXXALSDGKGSSGTNVLVHSSVDGAKLLTGSSN 718
                     DE +++               AL D   SS    LV S  +   +     N
Sbjct: 594  DYDPSVPELDEGVRQESSSSDFTSDSEDLAAL-DNNCSSKDGDLVSSLNNTLPV----KN 648

Query: 717  RRSQMGRKKKQSINAELLSILEPGPSQGNASPVLSKRNRERLDYKKLHDEAYGNVPXXXX 538
               Q     K +++ EL S+L+ GP +    PV  +R  ERLDYKKLHDE YGNVP    
Sbjct: 649  SNGQSSGPNKSALHNELSSLLDSGPDKDGLEPVSGRRQVERLDYKKLHDETYGNVPTDSS 708

Query: 537  XXXXXXXXEAPRKRKNDGGIGNTSVVPPKGXXXXXXXXXXXXXXXXNLKDIESTPT--RR 364
                    ++   R  D G   T    PK                 +L ++++  +  RR
Sbjct: 709  DDTYGSTLDSSDDRGWDSG---TRKRGPK----TLVLALSNNGSNDDLTNVKTKRSYKRR 761

Query: 363  TRQKLDSDGAYNIAAETHKDSLGPDYAVKVATASASRKNTGSTQRNLK---ETKNTPKRR 193
            TRQK  +    N   ET  D+     +VK +T+S++R+ +      L    +    PKR 
Sbjct: 762  TRQKPGAINVNNSVTETPVDTAKSSSSVKKSTSSSNRRLSQPALERLLASFQENEYPKRA 821

Query: 192  TREKLDFEGADNLA------AETPKDNLGPDSSGKIATATPSRTLGKAVTQGLLESFREN 31
            T++ L  E    L         T      P SSGK A ++   ++  +   G L S  E 
Sbjct: 822  TKQSLAQELGLGLKQVSKWFENTRWSTRHPSSSGKKAKSSSRMSIYLSQASGEL-SKNEP 880

Query: 30   EYPT--RDKKEN 1
            E  T  RD   N
Sbjct: 881  ESATCFRDTDSN 892


>ref|XP_006829269.1| hypothetical protein AMTR_s00001p00272780 [Amborella trichopoda]
            gi|548834248|gb|ERM96685.1| hypothetical protein
            AMTR_s00001p00272780 [Amborella trichopoda]
          Length = 800

 Score =  368 bits (945), Expect = 8e-99
 Identities = 258/694 (37%), Positives = 339/694 (48%), Gaps = 25/694 (3%)
 Frame = -1

Query: 2013 ESSSPQKIMF-GEKNELGLEHVQSETLEIRIVGSDSVDIKQLGTPPKD------------ 1873
            E + P+ ++   E+     E    E  E+  +G  S++I++L   P D            
Sbjct: 8    EQNCPRSLILDSERCSTSFEQTTKE--EVPSIGVHSLEIERLTPAPIDPGYAGPNSGIIG 65

Query: 1872 ---AAK-STNRSSRKKNQVSSRPLKRKYMLRSSISSPTVLQSSSHGTSK--PPDPSISAN 1711
               A+K +++R   K  +V+S+   R Y LRSS +   VL+  S GTSK  P   S S+ 
Sbjct: 66   RNTASKGNSSRQEWKGKKVASQVGSRSYFLRSSSNGVRVLRPRSIGTSKTSPAASSKSSP 125

Query: 1710 VXXXXXXXXXXXXXXXKTL-NDEFSRTRKRLRYLLNRMGYEHNLIDAYSGEGWKGLSMEK 1534
            +               + L NDE+SRTRK +RYLL R+ +E  LIDAYSGEGWKG S EK
Sbjct: 126  IMPERRKSRREKRKLKEVLSNDEYSRTRKSVRYLLARINFEQGLIDAYSGEGWKGQSQEK 185

Query: 1533 IKPEQELQRATSEILRCKLKIRDLFQHLDSLCAEGRLQPSLFDSEGQICSEDIFCAKCGS 1354
            +KPE+EL+RA  EI+R KL+IRDLFQHL +LC EGR+  SLFDSEG+I SEDIFCAKCGS
Sbjct: 186  VKPEKELKRAEDEIVRRKLRIRDLFQHLQTLCEEGRIHESLFDSEGKIYSEDIFCAKCGS 245

Query: 1353 KDLSDDNDIILCDGICDRGFHLMCLNPPLSKDKIPPEDEGWLCPGCDCKVDCIYLLNDVQ 1174
            KD+  DNDIILCDGIC+RGFH MCL PPL K++IPP DEGWLCPGC+CK  C+ L+ND  
Sbjct: 246  KDVPPDNDIILCDGICNRGFHQMCLVPPLLKEQIPPGDEGWLCPGCECKAFCVDLVNDYL 305

Query: 1173 GTNLSIDDKWEKVFPEAAATGSGDNPDDNFGFXXXXXXXXXXXXDGPELDERVQXXXXXX 994
            GT+L I+D WEKVF EAAA  SGD   D+ G             DGP++D+  Q      
Sbjct: 306  GTDLLIEDGWEKVFAEAAALASGDKQYDDLGLPSDDSEDNDYNPDGPDIDDEAQNSSSSS 365

Query: 993  XXXXXXXXXXDLGASPHEEQYLGLXXXXXXXXXXXXXXXXXDEQIKKXXXXXXXXXXXXX 814
                      D  +S  +++   L                                    
Sbjct: 366  EESDMTSGSSDSESSSSDDEASSL------------------------------------ 389

Query: 813  XXALSDGKGSSGTNVLVHSSVDGAKLLTGSSNRRSQMGRKKKQSINAELLSILEPGPSQG 634
                 +G GSS     + + +     L GS  R +Q    KK  +N+ELLSILEP  +  
Sbjct: 390  ----DEGSGSSLPGPFLSADLS----LNGSEGRSNQ----KKPRMNSELLSILEPESNGK 437

Query: 633  NASPVLSKRNRERLDYKKLHDEAYGNVPXXXXXXXXXXXXEAPRKRKNDGGIGNTSVVPP 454
              SP+  KRNRERLDYKKLHDE YGNV             +   KRK  GG+G  + +P 
Sbjct: 438  VVSPLPGKRNRERLDYKKLHDEDYGNVSSDSSDDEDWVAMDT-SKRKKSGGVGRGTRLPT 496

Query: 453  KGXXXXXXXXXXXXXXXXNLKDIESTP----TRRTRQKLDSDGAYNIAAETHKDSLGPDY 286
            K                 +LK  ES P    T+   QK +S+     ++ TH        
Sbjct: 497  K----------HCTLSPGSLKIYESIPSLPETQILLQKPNSETIQVGSSLTHNIPGNSQI 546

Query: 285  AVKVATASASRKNTGSTQRNLKETKNTPKRRTREKLDFEGADNLAAETPKDNLGPDSSGK 106
             V   +AS  + + G                                      G   S +
Sbjct: 547  QVHGVSASGVKSHVGG-------------------------------------GEHISSR 569

Query: 105  IATATP-SRTLGKAVTQGLLESFRENEYPTRDKK 7
                TP S+  G+ VTQ L  SF+EN YPT++ +
Sbjct: 570  NGPVTPLSKRFGRLVTQSLHNSFKENMYPTKETR 603


>ref|XP_007143079.1| hypothetical protein PHAVU_007G041800g [Phaseolus vulgaris]
            gi|561016269|gb|ESW15073.1| hypothetical protein
            PHAVU_007G041800g [Phaseolus vulgaris]
          Length = 826

 Score =  363 bits (931), Expect = 3e-97
 Identities = 277/824 (33%), Positives = 382/824 (46%), Gaps = 34/824 (4%)
 Frame = -1

Query: 2454 TIESSSLKQNTLQGKHELGSEHAQSEPMEMKIVEPSAQENCQTIESSSLKQNTSQGKHEL 2275
            T    S +Q+ L  K +LG+EH   E  E   V  S+       +S+ +  N ++    L
Sbjct: 15   TAGGMSTEQSGLSEKTQLGTEHLLKEQRESGTVLTSS---VMDEKSNQVSANATENSVIL 71

Query: 2274 GSEHAQSEPMETKIVESSARENCQTIESSSLKQNRLQGKHELGSEHAQSEPMETKIVESS 2095
                 Q +            +NCQT+E S L+Q+      E+  + +  EP         
Sbjct: 72   LPAPPQHD----------LEKNCQTVEGSHLQQST---DKEVSLQLSNDEPENPS----- 113

Query: 2094 VLENCQKIESSSPKQNTTQEKQNGQTTESSSPQKIMFGEKNELGLEHVQSETLEIRIVGS 1915
                 Q +  + P ++      +GQ  + SSP        N +           +     
Sbjct: 114  -----QPLSENEPVESAPAFAGDGQ--KQSSPALANTSYVNNM-----------LDPPSG 155

Query: 1914 DSVDIKQLGTPPKDAAKSTNRSSRKKNQVSSRPLKRKYMLRSSISSPTVLQSSSHGTSKP 1735
            D+V    +    K +    N   R+K + +S+ LK+ YMLRS  SS   L+S +    K 
Sbjct: 156  DAV----INCSEKVSNSPANSQLRRKGKKNSKFLKKTYMLRSVGSSDRALRSKTKENPKT 211

Query: 1734 PDPSISA--------NVXXXXXXXXXXXXXXXKTLNDEFSRTRKRLRYLLNRMGYEHNLI 1579
            P+P+ +         N                  + D+FSR +  LRYLLNR+GYE NLI
Sbjct: 212  PEPNSNLVDCNNNNNNDGVKKKSFKKKRKSGEVGITDQFSRIKSHLRYLLNRIGYEKNLI 271

Query: 1578 DAYSGEGWKGLSMEKIKPEQELQRATSEILRCKLKIRDLFQHLDSLCAEGRLQPSLFDSE 1399
            DAYS EGWKG SMEK+KPE+ELQRA SEI+R KL IR+LF++LDSLC EG+L  SLFDSE
Sbjct: 272  DAYSAEGWKGYSMEKLKPEKELQRAKSEIIRRKLNIRELFRNLDSLCTEGKLPESLFDSE 331

Query: 1398 GQICSEDIFCAKCGSKDLSDDNDIILCDGICDRGFHLMCLNPPLSKDKIPPEDEGWLCPG 1219
            G+I SEDIFCAKC SK+LS +NDIILCDG+CDRGFH +CL+PPL  + IPP DEGWLCPG
Sbjct: 332  GEIDSEDIFCAKCHSKELSSNNDIILCDGVCDRGFHQLCLDPPLLTEDIPPGDEGWLCPG 391

Query: 1218 CDCKVDCIYLLNDVQGTNLSIDDKWEKVFPEAAATGSGDNPDDNFGFXXXXXXXXXXXXD 1039
            CDCK DC+ L+ND  GT+LSI D WE+VFPEAAA  +G+  D+N G             +
Sbjct: 392  CDCKDDCMDLINDSFGTSLSISDTWERVFPEAAA-AAGNKTDNNSGLPSDDSDDDDYNPN 450

Query: 1038 GPELDERVQXXXXXXXXXXXXXXXXDLGASPHEEQYLGLXXXXXXXXXXXXXXXXXDEQI 859
            GPE D +V+                +L  S H +QYLGL                 D ++
Sbjct: 451  GPE-DVKVEGDESSSDESDYASASENLEGS-HGDQYLGLPSDDSDDGDYDPAAPDADSKV 508

Query: 858  KKXXXXXXXXXXXXXXXALSDGKGSSGTNVLVHS-SVDGAKLLTGSSNRRSQMGRKKKQS 682
                             A      S G +  + S S+D  K L     R+ + G  KK S
Sbjct: 509  NVESSSSDFTSDSDDLPAAIVENTSPGQDGEIRSASLDDVKCLNSYGKRKGKAG--KKLS 566

Query: 681  INAELLSILEPGPSQGNASPVLSKRNRERLDYKKLHDEAYGNVPXXXXXXXXXXXXEAPR 502
            +  EL S+LEP   Q  ++PV  +RN ERLDYKKL+DEAY +                 R
Sbjct: 567  MADELSSLLEPDSGQEGSTPVSGRRNLERLDYKKLYDEAYHS--DTSEDEDWTATVTPSR 624

Query: 501  KRKN-------DGGIGNTSVVPPKGXXXXXXXXXXXXXXXXNLKDIESTPTRRTRQKLDS 343
            K+K        DG   N S+  PK                 +L D   + +R+ + K   
Sbjct: 625  KKKGNATPVSPDGNASNNSMHTPKRNGHQKKFENTKNSPAKSLDDHVKSDSRKQKSK--- 681

Query: 342  DGAYNIAAE------------------THKDSLGPDYAVKVATASASRKNTGSTQRNLKE 217
              AY    E                  T K+SL  +  +     +    NT  + R+  +
Sbjct: 682  SSAYKRLGEAVVERLHISFKENQYPDRTTKESLAQELGLTCQQVAKWFDNTRWSFRHSSQ 741

Query: 216  TKNTPKRRTREKLDFEGADNLAAETPKDNLGPDSSGKIATATPS 85
             +    R T ++     A+N   E   +   P+ SG+I+ A  S
Sbjct: 742  METNSGRNTLQQSTDSRAENEGGEC--ELRSPEVSGEISKAPSS 783


>ref|XP_003555282.1| PREDICTED: homeobox protein HAT3.1-like isoform X1 [Glycine max]
          Length = 820

 Score =  361 bits (926), Expect = 1e-96
 Identities = 275/809 (33%), Positives = 387/809 (47%), Gaps = 51/809 (6%)
 Frame = -1

Query: 2355 EPSAQENCQTIE------SSSLKQNTSQGKHELGS-------EHAQSEPMETKIVESSA- 2218
            EP A E C+  E      S  L++   +   EL S           ++  E  +++  A 
Sbjct: 17   EPMATEQCELSEKTPQIGSEGLEREQKELLTELTSFVIDEKSNQVSADVTENSVIQLPAP 76

Query: 2217 -----RENCQTIESSSLKQNRLQGKHELGSEHAQSEPMETKIVESSVLENCQKIESSSPK 2053
                  +NCQT+E S L+Q+ +    E  S    ++  E K    S  EN Q    S P 
Sbjct: 77   PQHDFEKNCQTVEGSCLEQSTV----EQVSVDLSNDKSENKCKPLS--ENVQ----SEPV 126

Query: 2052 QNTTQEKQNGQTTESSSPQKIMFGEKNELGLEHVQSETLEIRIVGSDSVDIKQLGTPPKD 1873
            ++      +GQ    SSP +      NEL L+    +     +V + +   +++   P  
Sbjct: 127  ESIPAFVVDGQM--QSSPAQANMSSVNEL-LDQPSGD-----VVNNITNCSEKMSNSP-- 176

Query: 1872 AAKSTNRSSRKKNQVSSRPLKRKYMLRSSISSPTVLQSSSHGTSKPPDPSIS-----ANV 1708
                ++  SR+K + +S+ LK+KYMLRS  SS   L+S +    K P+P+ +     +N 
Sbjct: 177  ----SHSQSRRKGKRNSKLLKKKYMLRSLGSSGRALRSRTKEKPKEPEPTSNLVDGNSND 232

Query: 1707 XXXXXXXXXXXXXXXKTLNDEFSRTRKRLRYLLNRMGYEHNLIDAYSGEGWKGLSMEKIK 1528
                           + + D+FSR R  LRYLLNR+ YE++LIDAYSGEGWKG SMEK+K
Sbjct: 233  GVKRKSGRKKKKRREEGITDQFSRIRSHLRYLLNRISYENSLIDAYSGEGWKGYSMEKLK 292

Query: 1527 PEQELQRATSEILRCKLKIRDLFQHLDSLCAEGRLQPSLFDSEGQICSEDIFCAKCGSKD 1348
            PE+ELQRA SEILR KLKIRDLF++LDSLCAEG+   SLFDS G+I SEDIFCAKC SK+
Sbjct: 293  PEKELQRAKSEILRRKLKIRDLFRNLDSLCAEGKFPESLFDSAGEIDSEDIFCAKCQSKE 352

Query: 1347 LSDDNDIILCDGICDRGFHLMCLNPPLSKDKIPPEDEGWLCPGCDCKVDCIYLLNDVQGT 1168
            LS +NDIILCDG+CDRGFH +CL+PPL  + IPP DEGWLCPGCDCK DC+ L+ND  GT
Sbjct: 353  LSTNNDIILCDGVCDRGFHQLCLDPPLLTEDIPPGDEGWLCPGCDCKDDCMDLVNDSFGT 412

Query: 1167 NLSIDDKWEKVFPEAAATGSGDNPDDNFGFXXXXXXXXXXXXDGPELDERVQXXXXXXXX 988
            +LSI D WE+VFPEAA+  +G+N D+N G             +G + D +++        
Sbjct: 413  SLSISDTWERVFPEAASF-AGNNMDNNLGLPSDDSDDDDYNPNGSD-DVKIEGDESSSDE 470

Query: 987  XXXXXXXXDLGASPHEEQYLGLXXXXXXXXXXXXXXXXXDEQIKKXXXXXXXXXXXXXXX 808
                     L    HE+QYLGL                 D ++ +               
Sbjct: 471  SEYASASEKLEGGSHEDQYLGLPSEDSDDGDYDPDAPDVDCKVNEESSSSDFTSDSEDLA 530

Query: 807  ALSDGKGSSGTNVLVHSSVDGAKLLTGSSNRRSQMGRKKKQSINAELLSILEPGPSQGNA 628
            A  +   S G +              G  N   + G+  K S+  EL S+LEP   QG  
Sbjct: 531  AAFEDNTSPGQD--------------GGINSSKKKGKVGKLSMADELSSLLEPDSGQGGP 576

Query: 627  SPVLSKRNRERLDYKKLHDEAYGNVPXXXXXXXXXXXXEAP-RKRKNDGGIGNTSVVPPK 451
            +PV  KR+ ERLDYKKL++E Y +               AP RK+K  G +  T V P  
Sbjct: 577  TPVSGKRHVERLDYKKLYEETYHS---DTSDDEDWNDAAAPSRKKKLTGNV--TPVSPNA 631

Query: 450  GXXXXXXXXXXXXXXXXNLKDIESTPTRR--------TRQKLDSDGAYNIAAE------- 316
                              +++  S+PT+         +R K     A+    E       
Sbjct: 632  NASNNSIHTLKRNAHQNKVENTNSSPTKSLDGRSKSGSRDKRSGSSAHKRLGEAVVQRLH 691

Query: 315  -THKDSLGPDYAVKVATASASR----------KNTGSTQRNLKETKNTPKRRTREKLDFE 169
             + K++  PD + K + A               NT  + R+  + +    R    +    
Sbjct: 692  KSFKENQYPDRSTKESLAQELGLTYQQVAKWFDNTRWSFRHSSQMETNSGRNASPEATDG 751

Query: 168  GADNLAAETPKDNLGPDSSGKIATATPSR 82
             A+N   E   +++ P+ SGK +  T SR
Sbjct: 752  RAEN-EGEKQCESMSPEVSGKNSKTTSSR 779


>ref|XP_006605989.1| PREDICTED: homeobox protein HAT3.1-like isoform X2 [Glycine max]
          Length = 751

 Score =  359 bits (921), Expect = 5e-96
 Identities = 240/622 (38%), Positives = 326/622 (52%), Gaps = 24/622 (3%)
 Frame = -1

Query: 2355 EPSAQENCQTIE------SSSLKQNTSQGKHELGS-------EHAQSEPMETKIVESSA- 2218
            EP A E C+  E      S  L++   +   EL S           ++  E  +++  A 
Sbjct: 17   EPMATEQCELSEKTPQIGSEGLEREQKELLTELTSFVIDEKSNQVSADVTENSVIQLPAP 76

Query: 2217 -----RENCQTIESSSLKQNRLQGKHELGSEHAQSEPMETKIVESSVLENCQKIESSSPK 2053
                  +NCQT+E S L+Q+ +    E  S    ++  E K    S  EN Q    S P 
Sbjct: 77   PQHDFEKNCQTVEGSCLEQSTV----EQVSVDLSNDKSENKCKPLS--ENVQ----SEPV 126

Query: 2052 QNTTQEKQNGQTTESSSPQKIMFGEKNELGLEHVQSETLEIRIVGSDSVDIKQLGTPPKD 1873
            ++      +GQ    SSP +      NEL L+    +     +V + +   +++   P  
Sbjct: 127  ESIPAFVVDGQM--QSSPAQANMSSVNEL-LDQPSGD-----VVNNITNCSEKMSNSP-- 176

Query: 1872 AAKSTNRSSRKKNQVSSRPLKRKYMLRSSISSPTVLQSSSHGTSKPPDPSIS-----ANV 1708
                ++  SR+K + +S+ LK+KYMLRS  SS   L+S +    K P+P+ +     +N 
Sbjct: 177  ----SHSQSRRKGKRNSKLLKKKYMLRSLGSSGRALRSRTKEKPKEPEPTSNLVDGNSND 232

Query: 1707 XXXXXXXXXXXXXXXKTLNDEFSRTRKRLRYLLNRMGYEHNLIDAYSGEGWKGLSMEKIK 1528
                           + + D+FSR R  LRYLLNR+ YE++LIDAYSGEGWKG SMEK+K
Sbjct: 233  GVKRKSGRKKKKRREEGITDQFSRIRSHLRYLLNRISYENSLIDAYSGEGWKGYSMEKLK 292

Query: 1527 PEQELQRATSEILRCKLKIRDLFQHLDSLCAEGRLQPSLFDSEGQICSEDIFCAKCGSKD 1348
            PE+ELQRA SEILR KLKIRDLF++LDSLCAEG+   SLFDS G+I SEDIFCAKC SK+
Sbjct: 293  PEKELQRAKSEILRRKLKIRDLFRNLDSLCAEGKFPESLFDSAGEIDSEDIFCAKCQSKE 352

Query: 1347 LSDDNDIILCDGICDRGFHLMCLNPPLSKDKIPPEDEGWLCPGCDCKVDCIYLLNDVQGT 1168
            LS +NDIILCDG+CDRGFH +CL+PPL  + IPP DEGWLCPGCDCK DC+ L+ND  GT
Sbjct: 353  LSTNNDIILCDGVCDRGFHQLCLDPPLLTEDIPPGDEGWLCPGCDCKDDCMDLVNDSFGT 412

Query: 1167 NLSIDDKWEKVFPEAAATGSGDNPDDNFGFXXXXXXXXXXXXDGPELDERVQXXXXXXXX 988
            +LSI D WE+VFPEAA+  +G+N D+N G             +G + D +++        
Sbjct: 413  SLSISDTWERVFPEAASF-AGNNMDNNLGLPSDDSDDDDYNPNGSD-DVKIEGDESSSDE 470

Query: 987  XXXXXXXXDLGASPHEEQYLGLXXXXXXXXXXXXXXXXXDEQIKKXXXXXXXXXXXXXXX 808
                     L    HE+QYLGL                 D ++ +               
Sbjct: 471  SEYASASEKLEGGSHEDQYLGLPSEDSDDGDYDPDAPDVDCKVNEESSSSDFTSDSEDLA 530

Query: 807  ALSDGKGSSGTNVLVHSSVDGAKLLTGSSNRRSQMGRKKKQSINAELLSILEPGPSQGNA 628
            A  +   S G +              G  N   + G+  K S+  EL S+LEP   QG  
Sbjct: 531  AAFEDNTSPGQD--------------GGINSSKKKGKVGKLSMADELSSLLEPDSGQGGP 576

Query: 627  SPVLSKRNRERLDYKKLHDEAY 562
            +PV  KR+ ERLDYKKL++E Y
Sbjct: 577  TPVSGKRHVERLDYKKLYEETY 598


>ref|XP_004161446.1| PREDICTED: homeobox protein HAT3.1-like [Cucumis sativus]
          Length = 749

 Score =  356 bits (913), Expect = 4e-95
 Identities = 246/652 (37%), Positives = 323/652 (49%), Gaps = 31/652 (4%)
 Frame = -1

Query: 1863 STNRSSRKKNQVSSRPLKRKYMLRSSISSPTVLQSSSHGTSKPPDPS-----ISANVXXX 1699
            S ++ S +K+++  +  K+ Y LRS +SS  VL+S +   +K P+ S      +A     
Sbjct: 22   SNSQQSARKDKIFLKSKKKNYKLRSHVSSDRVLRSRTQEKAKAPERSNDLNNFTAEEDGK 81

Query: 1698 XXXXXXXXXXXXKTLNDEFSRTRKRLRYLLNRMGYEHNLIDAYSGEGWKGLSMEKIKPEQ 1519
                            DE+S  R  LRYLLNR+ YE +LI+AYS EGWKG S +K+KPE+
Sbjct: 82   RKKKKKRNIQGKGARVDEYSSIRNHLRYLLNRIRYEQSLIEAYSSEGWKGFSSDKLKPEK 141

Query: 1518 ELQRATSEILRCKLKIRDLFQHLDSLCAEGRLQPSLFDSEGQICSEDIFCAKCGSKDLSD 1339
            ELQRA++EI+R KLKIRDLFQ +D+LCAEGRL  SLFDSEGQI SEDIFCAKCGSK+LS 
Sbjct: 142  ELQRASNEIMRRKLKIRDLFQRIDALCAEGRLSESLFDSEGQIDSEDIFCAKCGSKELSL 201

Query: 1338 DNDIILCDGICDRGFHLMCLNPPLSKDKIPPEDEGWLCPGCDCKVDCIYLLNDVQGTNLS 1159
            +NDIILCDGICDRGFH  CL PPL    IPP+DEGWLCPGCDCK DC+ LLN+ QG+NLS
Sbjct: 202  ENDIILCDGICDRGFHQFCLEPPLLNTDIPPDDEGWLCPGCDCKDDCLDLLNEFQGSNLS 261

Query: 1158 IDDKWEKVFPEAAATGSGDNPDDNFGFXXXXXXXXXXXXDGPEL-------------DER 1018
            I D WEKV+PEAAA  +G N D   G             D P+               ++
Sbjct: 262  ITDGWEKVYPEAAAAAAGRNSDHTLGLPSDDSEDGDYDPDVPDTIDQDNELSSDESSSDQ 321

Query: 1017 VQXXXXXXXXXXXXXXXXDLGASPHEEQYLGLXXXXXXXXXXXXXXXXXDEQIKKXXXXX 838
                               L  S +++QYLGL                 DE +++     
Sbjct: 322  SNSDPSNSDTSGYASASEGLEVSSNDDQYLGLPSDDSEDNDYDPSVPELDEGVRQESSSS 381

Query: 837  XXXXXXXXXXALSDGKGSSGTNVLVHSSVDGAKLLTGSSNRRSQMGRKKKQSINAELLSI 658
                      AL D   SS    LV S  +   +     N   Q     K +++ EL S+
Sbjct: 382  DFTSDSEDLAAL-DNNCSSKDGDLVSSLNNTLPV----KNSNGQSSGPNKSALHNELSSL 436

Query: 657  LEPGPSQGNASPVLSKRNRERLDYKKLHDEAYGNVPXXXXXXXXXXXXEAPRKRKNDGGI 478
            L+ GP +    PV  +R  ERLDYKKLHDE YGNVP            ++   R  D G 
Sbjct: 437  LDSGPDKDGLEPVSGRRQVERLDYKKLHDETYGNVPTDSSDDTYGSTLDSSDDRGWDSG- 495

Query: 477  GNTSVVPPKGXXXXXXXXXXXXXXXXNLKDIESTPT--RRTRQKLDSDGAYNIAAETHKD 304
              T    PK                 +L ++++  +  RRTRQK  +    N   ET  D
Sbjct: 496  --TRKRGPK----TLVLALSNNGSNDDLTNVKTKRSYKRRTRQKPGAINVNNSVTETPVD 549

Query: 303  SLGPDYAVKVATASASRKNTGSTQRNLK---ETKNTPKRRTREKLDFEGADNLA------ 151
            +     +VK +T+S++R+ +      L    +    PKR T++ L  E    L       
Sbjct: 550  TAKSSSSVKKSTSSSNRRLSQPALERLLASFQENEYPKRATKQSLAQELGLGLKQVSKWF 609

Query: 150  AETPKDNLGPDSSGKIATATPSRTLGKAVTQGLLESFRENEYPT--RDKKEN 1
              T      P SSGK A ++   ++  +   G L S  E E  T  RD   N
Sbjct: 610  ENTRWSTRHPSSSGKKAKSSSRMSIYLSQASGEL-SKNEPESATCFRDTDSN 660


>ref|XP_006589630.1| PREDICTED: pathogenesis-related homeodomain protein isoform X1
            [Glycine max]
          Length = 820

 Score =  352 bits (902), Expect = 8e-94
 Identities = 239/619 (38%), Positives = 331/619 (53%), Gaps = 4/619 (0%)
 Frame = -1

Query: 2406 ELGSEHAQSEPMEMKIVEPSAQENCQTIESSSLKQNTSQGKHELGSEHAQSEPMETKIVE 2227
            ++GSE  ++E  E+            T  +SS+ +  S     + +E+A  +  E   ++
Sbjct: 32   QIGSEGLENEQKELG-----------TELTSSVIEEKSNQVSAIVTENAVIQLPEP--LQ 78

Query: 2226 SSARENCQTIESSSLKQNRLQGKHELGSEHAQSEPMETKIVESSVLENCQKIESSSPKQN 2047
               ++NCQT+E S L+Q+ ++   ++  + +  +P E K    S  EN Q    S P ++
Sbjct: 79   HDLQKNCQTVEGSCLEQSTVE---QVTVDLSNDKP-ENKCKPLS--ENVQ----SEPVES 128

Query: 2046 TTQEKQNGQTTESSSPQKIMFGEKNELGLEHVQSETLEIRIVGSDSVDIKQLGTPPKDAA 1867
                   GQ    S+P +      NEL           +     D+V+        K + 
Sbjct: 129  IPAVVVEGQM--QSNPSQANMSSVNEL-----------LDQPSGDAVNNISSNCSEKMSN 175

Query: 1866 KSTNRSSRKKNQVSSRPLKRKYMLRSSISSPTVLQSSSHGTSKPPDPSISA----NVXXX 1699
              T+  SR+K + +S+ LK KYMLRS  SS   L+S +    K P+P+ +     N    
Sbjct: 176  SPTHSQSRRKGKKNSKLLK-KYMLRSLGSSDRALRSRTKEKPKEPEPTSNLVDGNNNGVK 234

Query: 1698 XXXXXXXXXXXXKTLNDEFSRTRKRLRYLLNRMGYEHNLIDAYSGEGWKGLSMEKIKPEQ 1519
                        + + ++FSR R  LRYLLNR+ YE++LIDAYSGEGWKG S+EK+KPE+
Sbjct: 235  RKSGRKKKKRKEEGITNQFSRIRSHLRYLLNRISYENSLIDAYSGEGWKGYSIEKLKPEK 294

Query: 1518 ELQRATSEILRCKLKIRDLFQHLDSLCAEGRLQPSLFDSEGQICSEDIFCAKCGSKDLSD 1339
            ELQRA SEILR KLKIRDLFQ+LDSLCAEG+   SLFDS G+I SEDIFCAKC SK+LS 
Sbjct: 295  ELQRAKSEILRRKLKIRDLFQNLDSLCAEGKFPESLFDSAGEIDSEDIFCAKCQSKELST 354

Query: 1338 DNDIILCDGICDRGFHLMCLNPPLSKDKIPPEDEGWLCPGCDCKVDCIYLLNDVQGTNLS 1159
            +NDIILCDG+CDRGFH +CL+PP+  + IPP DEGWLCPGCDCK DC+ L+ND  GT+LS
Sbjct: 355  NNDIILCDGVCDRGFHQLCLDPPMLTEDIPPGDEGWLCPGCDCKDDCMDLVNDSFGTSLS 414

Query: 1158 IDDKWEKVFPEAAATGSGDNPDDNFGFXXXXXXXXXXXXDGPELDERVQXXXXXXXXXXX 979
            I D WE+VFPEAA+  +G+N D+N G             +GP+ D +V+           
Sbjct: 415  ISDTWERVFPEAASF-AGNNMDNNSGVPSDDSDDDDYNPNGPD-DVKVEGDESSSDESEY 472

Query: 978  XXXXXDLGASPHEEQYLGLXXXXXXXXXXXXXXXXXDEQIKKXXXXXXXXXXXXXXXALS 799
                  L    HE+QYLGL                 + ++ +               A  
Sbjct: 473  ASASEKLEGGSHEDQYLGLPSEDSDDGDYDPDAPDVECKVNEESSSSDFTSDSEDLAAAI 532

Query: 798  DGKGSSGTNVLVHSSVDGAKLLTGSSNRRSQMGRKKKQSINAELLSILEPGPSQGNASPV 619
            +   S G         DG      SS ++ ++G  KK S+  EL S+LEP   Q   +PV
Sbjct: 533  EDNTSPGQ--------DGG---ISSSKKKGKVG--KKLSLPDELSSLLEPDSGQEAPTPV 579

Query: 618  LSKRNRERLDYKKLHDEAY 562
              KR+ ERLDYKKL++E Y
Sbjct: 580  SGKRHVERLDYKKLYEETY 598


>ref|XP_004496910.1| PREDICTED: pathogenesis-related homeodomain protein-like isoform X1
            [Cicer arietinum]
          Length = 995

 Score =  337 bits (865), Expect = 1e-89
 Identities = 240/753 (31%), Positives = 358/753 (47%), Gaps = 72/753 (9%)
 Frame = -1

Query: 2604 SNAQENCQTIESSSPKQNILQGKLELGSEHAQSEP------METKIVESNAQESYKTIES 2443
            S +Q +C T +  +   ++     ++GSE   +E       +   +++  + + Y  +  
Sbjct: 8    SISQYDCTTEKMGTEHCDLSVKSAQIGSECLDNEQSVVDTVLTNSVIDEKSNQDYVNMTE 67

Query: 2442 SSLKQ----------NTLQGKHELGSEHAQSEPMEMKIVEPSAQENCQTIESSSLKQNTS 2293
            +S+ Q          NT Q       + + SE + + + +   +  C     +   + ++
Sbjct: 68   NSVVQLPDPTHHYIENTCQTLEGSCLQQSTSEQVPVWLCDGKLENKCPQNVQNEAGEMSN 127

Query: 2292 QGKHELGSEHAQSEPMETKIV--------------ESSARENCQTIESSSLKQNRLQGKH 2155
                 +  E  QS P +  +V              ++   E  + + +  +++       
Sbjct: 128  AVAALVVEEQTQSVPAQVNVVLDPPSGDVAKNVSFQNEPGEMSEAVAALVVEEQTQSVPA 187

Query: 2154 ELG-------SEHAQSEPMETKIVESS--VLENCQKIESSSPKQNTTQEKQNGQTTESSS 2002
            ++         + A+S   + ++ E S  V++  +    S P Q  T          S  
Sbjct: 188  QVNVVLDPPSGDVAESVSFQNELAEMSDAVIDVVEDQTQSGPAQVNTDSVNEPLDPPSGE 247

Query: 2001 PQKIMF-----GEKNELGLEHVQSETLEIRIVGSDSVDIKQLGTPP-------------- 1879
              KI+      GE ++  +  V+ +T  I       V+   +  P               
Sbjct: 248  VAKIVNLQNEPGEMSDAVIGIVEYQTQSIPXXXXXPVNTYSVNDPSDPPSEDVVKNISSD 307

Query: 1878 ---KDAAKSTNRSSRKKNQVSSRPLKRKYMLRSSISSPTVLQSSSHGTSKPPDP-----S 1723
               + +  S +  SR K + +S+ L +KY+LRS  SS   L+S +    K P+P      
Sbjct: 308  CSERKSKSSAHLRSRHKGKSNSK-LSKKYILRSLGSSDRALRSRTRDKPKDPEPINNVVD 366

Query: 1722 ISANVXXXXXXXXXXXXXXXKT-LNDEFSRTRKRLRYLLNRMGYEHNLIDAYSGEGWKGL 1546
            +S +                K  +ND++S+ R  LRYLLNR+ YE NLIDAYSGEGWKG 
Sbjct: 367  VSNDAMKTKRGKKKKKKRPRKEGINDQYSKIRAHLRYLLNRISYEQNLIDAYSGEGWKGY 426

Query: 1545 SMEKIKPEQELQRATSEILRCKLKIRDLFQHLDSLCAEGRLQPSLFDSEGQICSEDIFCA 1366
            S+EK+KPE+E+QRA SEILR KLKIRDLFQ+LDSLCAEGRL  SLFDS+G+I SEDIFCA
Sbjct: 427  SLEKLKPEKEIQRAKSEILRRKLKIRDLFQNLDSLCAEGRLPESLFDSKGEIDSEDIFCA 486

Query: 1365 KCGSKDLSDDNDIILCDGICDRGFHLMCLNPPLSKDKIPPEDEGWLCPGCDCKVDCIYLL 1186
            KC +K L  DNDIILCDG CDRGFH +CL+PPL  + IPP DEGWLCPGCDCK DCI L+
Sbjct: 487  KCQTKVLGTDNDIILCDGACDRGFHQLCLDPPLLTEDIPPGDEGWLCPGCDCKDDCIELV 546

Query: 1185 NDVQGTNLSIDDKWEKVFPEAAATGSGDNPDDNFGFXXXXXXXXXXXXDGPE----LDER 1018
            ND+ GTNLS+ + WE+VFPE AAT +G   D N G             +GPE     D  
Sbjct: 547  NDLLGTNLSLTNTWERVFPE-AATAAGSILDHNSGLPSDDSEDDDYNPNGPEDVEVEDAE 605

Query: 1017 VQXXXXXXXXXXXXXXXXDLGASPHEEQYLGLXXXXXXXXXXXXXXXXXDEQI-KKXXXX 841
            V+                 L  S HE+QYLGL                   ++ ++    
Sbjct: 606  VEGDESSSDESEYASASEKLEDSRHEDQYLGLPSEDSEDDDFDPDAPDLGGKVTEESSSS 665

Query: 840  XXXXXXXXXXXALSDGKGSSGTNVLVHSSVDGAKLLTGSSNRRSQMGRKKKQSINAELLS 661
                        + D   +     +    +D  K L G S +  ++  +KK S+  EL S
Sbjct: 666  DFTSDSEDLAATIKDNMSTGQDGDITSPLLDDVKNLKGFSRQNHKV--RKKPSMADELSS 723

Query: 660  ILEPGPSQGNASPVLSKRNRERLDYKKLHDEAY 562
            +L+    Q + +P+ +KRN ERLDY+KL++E Y
Sbjct: 724  LLKSDLGQEDITPITAKRNVERLDYQKLYEETY 756


Top