BLASTX nr result

ID: Catharanthus23_contig00009008 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Catharanthus23_contig00009008
         (3267 letters)

Database: ./nr 
           37,332,560 sequences; 13,225,080,153 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_004230722.1| PREDICTED: pathogenesis-related homeodomain ...   505   e-140
ref|XP_006346339.1| PREDICTED: pathogenesis-related homeodomain ...   501   e-139
ref|XP_002313886.2| hypothetical protein POPTR_0009s09600g [Popu...   484   e-134
ref|XP_002300247.2| homeobox family protein [Populus trichocarpa...   483   e-133
ref|XP_004289744.1| PREDICTED: uncharacterized protein LOC101296...   476   e-131
gb|EXB76647.1| Homeobox protein [Morus notabilis]                     472   e-130
emb|CBI22504.3| unnamed protein product [Vitis vinifera]              465   e-128
ref|XP_002269077.1| PREDICTED: homeobox protein HAT3.1-like [Vit...   465   e-128
ref|XP_002527467.1| Homeobox protein HAT3.1, putative [Ricinus c...   464   e-128
gb|EMJ01257.1| hypothetical protein PRUPE_ppa023106mg [Prunus pe...   461   e-127
ref|XP_006486963.1| PREDICTED: homeobox protein HAT3.1-like isof...   458   e-126
ref|XP_006422879.1| hypothetical protein CICLE_v10027725mg [Citr...   457   e-125
gb|EOX98399.1| Homeodomain-like protein with RING/FYVE/PHD-type ...   455   e-125
ref|XP_006589630.1| PREDICTED: pathogenesis-related homeodomain ...   451   e-124
ref|XP_003555282.1| PREDICTED: homeobox protein HAT3.1-like isof...   447   e-122
ref|XP_004496910.1| PREDICTED: pathogenesis-related homeodomain ...   445   e-122
gb|ESW15073.1| hypothetical protein PHAVU_007G041800g [Phaseolus...   433   e-118
sp|P48786.1|PRH_PETCR RecName: Full=Pathogenesis-related homeodo...   429   e-117
ref|XP_004161446.1| PREDICTED: homeobox protein HAT3.1-like [Cuc...   415   e-113
ref|XP_004140812.1| PREDICTED: uncharacterized protein LOC101204...   415   e-113

>ref|XP_004230722.1| PREDICTED: pathogenesis-related homeodomain protein-like [Solanum
            lycopersicum]
          Length = 796

 Score =  505 bits (1301), Expect = e-140
 Identities = 310/740 (41%), Positives = 411/740 (55%), Gaps = 15/740 (2%)
 Frame = -1

Query: 2712 MGSATCAPENMGSATCDAH--------ENHLDLQHSEPAEKDATNVASESVPHEGTSLPS 2557
            +G+ + +PE   +A    H        EN    Q  E  E    N+       +    P 
Sbjct: 4    LGNTSVSPEKARTAGGGHHTASAGNMSENLGADQSRESCENTVQNLNQSEYREKSPGQPR 63

Query: 2556 RKQISSLLPPVSNRVLRSRSQEKPKASESKAVEVENSANEGRKSKQRKGRMKK-IPVNEF 2380
            +++  S  P  S R+LRS+S+EK  ASE+K   V + A E +K K+RK +  K I  NEF
Sbjct: 64   KRKSISGSPISSTRLLRSKSKEKSGASEAKNTVVTHDATEEKKRKRRKKKHSKHIAANEF 123

Query: 2379 SRIRTHLRYLLHRVKYERNLIDAYSCEGWKGQSLEKLKPEKELQRARSQIFRYKLKIRDL 2200
            +RIR HLRYLL R+KYE+ LI+AYS EGWKGQSLEK+K EKELQRA++ IFRYKLKIRDL
Sbjct: 124  TRIRGHLRYLLQRIKYEQTLIEAYSGEGWKGQSLEKIKLEKELQRAKTHIFRYKLKIRDL 183

Query: 2199 FQQLDRSLTEGRFPESLFDSEGQIDSEDIFCAKCGSKDLTIENDIILCDGACERGFHQFC 2020
            FQ+LD  L EGR P SLFD+EG+IDSEDIFCAKCGS DL  +NDIILCDGACERGFHQ C
Sbjct: 184  FQRLDTLLAEGRLPASLFDNEGEIDSEDIFCAKCGSMDLPADNDIILCDGACERGFHQLC 243

Query: 2019 LDPPLLKEDIPPDDEGWLCPGCDCKLDCVGMLNDFQESTLSVTDNWEKVFPEEAAAAASG 1840
            ++PPLLKEDIPPDDEGWLCPGCDCK+DC+ +LND Q + LSVTD+WEKV+P+EAAAAASG
Sbjct: 244  VEPPLLKEDIPPDDEGWLCPGCDCKVDCIDLLNDLQGTDLSVTDSWEKVYPKEAAAAASG 303

Query: 1839 KTMDEGTXXXXXXXXXXXXXXXXXXXDHEVSRGESTSD--GSDYFSASDDIV-PPLDNKQ 1669
            + +D+ +                       S  ES+SD   SD++SAS+D+   P  + +
Sbjct: 304  EKLDDISGLPSDDSEDDDYNPEAPDVGKNDSEDESSSDESESDFYSASEDLAEAPTKDDE 363

Query: 1668 IFXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXDLGAILEDGESPNKDEGHFPS 1489
            I                                      D   I++       ++G   S
Sbjct: 364  ILGLSSEDSEDDDYNPDDPDKDEPVKTESSSSDFTSDSEDFSLIVDTNRLRGDEQGVSSS 423

Query: 1488 VSEDSKPNAVASGEKLKAGKRKGRSLNDELSYLTESNTEAVSGKRRGERLDYKKLHDEAY 1309
            V ++S PN+V+  EK K GK KG SL DELSYL +S++  VS KR  ERLDYKKLHDE Y
Sbjct: 424  V-DNSMPNSVSLKEKAKVGKAKGNSLKDELSYLMQSDSPLVSAKRHIERLDYKKLHDETY 482

Query: 1308 RNATSDSSDEDYTETAGAKRRKSNAGKAIRICTNKIQDRNDRTDTMDGNQNHKENDHSVE 1129
             N +SDSSDEDY +    K RK    K      +     +   D    +   K + H+ +
Sbjct: 483  GNGSSDSSDEDYDDGPLPKVRKLRNAKGAMAAPS-----STPADIKYQSGKQKGSGHASD 537

Query: 1128 GKSHKKLKVXXXXXXXXXXXXXXXXXXXXGKDAAKQYKRLGEAVVQRLVESFKENQYPKQ 949
                +KLKV                       ++ + K  GE   +RL ESFK+NQYP +
Sbjct: 538  SGISEKLKV--------------GGTGTSESPSSGKRKTYGEVSTKRLYESFKDNQYPDR 583

Query: 948  DVKQSLAKELGLRVQQVGKWFENARWSFRHSSRMESKMVAAASTNGSSLRTRVHVMSLAA 769
            D K+ L KELGL   QV KWFENAR   RHS   + K+++   +  S  ++++    L  
Sbjct: 584  DAKEKLGKELGLTAHQVSKWFENARHCHRHSPNWK-KIMSHKVSEESPSKSQIIGEPLGT 642

Query: 768  KQSPVLPSPSDSGMENLISSQVRPGNEECQITDAGEGKSVESEASGEKS---TRKRKVDN 598
            + + ++   S +G+E L   +     E+    D  E + +  + SG+KS   T+K    N
Sbjct: 643  ESNSII--ASCNGVEKLEQPKQCLNGEKGHAIDKSEEELLIQDTSGKKSSEPTKKVHTTN 700

Query: 597  QGSSAGNCMKQDQHDDTPKS 538
            +GS           +DTP+S
Sbjct: 701  EGS-----------EDTPRS 709


>ref|XP_006346339.1| PREDICTED: pathogenesis-related homeodomain protein-like isoform X1
            [Solanum tuberosum] gi|565359059|ref|XP_006346340.1|
            PREDICTED: pathogenesis-related homeodomain protein-like
            isoform X2 [Solanum tuberosum]
            gi|565359061|ref|XP_006346341.1| PREDICTED:
            pathogenesis-related homeodomain protein-like isoform X3
            [Solanum tuberosum] gi|565359063|ref|XP_006346342.1|
            PREDICTED: pathogenesis-related homeodomain protein-like
            isoform X4 [Solanum tuberosum]
          Length = 798

 Score =  501 bits (1291), Expect = e-139
 Identities = 314/741 (42%), Positives = 414/741 (55%), Gaps = 16/741 (2%)
 Frame = -1

Query: 2712 MGSATCAPENMGSATCDAHEN--------HLDLQHSEPAEKDATNVASESVPHEGTSLPS 2557
            +G+ + +PE +       H          +L +  S  A ++A    ++S   E T    
Sbjct: 4    LGNTSVSPEKVARTAGGGHRTASVGNMSENLGVDQSGEACENAVQNLNQSEYREKTPGQP 63

Query: 2556 RKQISSLLPPVSN-RVLRSRSQEKPKASESKAVEVENSANEGRKSKQRKGRMKK-IPVNE 2383
            RK+ S    P+S+ R+LRS+S+EK  ASE+    V + A E +K K+RK +  K I VNE
Sbjct: 64   RKRKSISGSPISSTRLLRSKSKEKSGASEANNTVVTHDATEEKKRKRRKKKHSKHIAVNE 123

Query: 2382 FSRIRTHLRYLLHRVKYERNLIDAYSCEGWKGQSLEKLKPEKELQRARSQIFRYKLKIRD 2203
            F+RIR HLRYLL R+ YE+ LI+AYS EGWKGQSLEK+K EKELQRA++ IFRYKLKIRD
Sbjct: 124  FTRIRGHLRYLLQRITYEQTLIEAYSGEGWKGQSLEKIKLEKELQRAKTHIFRYKLKIRD 183

Query: 2202 LFQQLDRSLTEGRFPESLFDSEGQIDSEDIFCAKCGSKDLTIENDIILCDGACERGFHQF 2023
            LFQ+LD  L EGR P SLFD+EG+IDSEDIFCAKCGS DL  +NDIILCDGACERGFHQ 
Sbjct: 184  LFQRLDTLLAEGRLPASLFDNEGEIDSEDIFCAKCGSMDLPADNDIILCDGACERGFHQL 243

Query: 2022 CLDPPLLKEDIPPDDEGWLCPGCDCKLDCVGMLNDFQESTLSVTDNWEKVFPEEAAAAAS 1843
            C++PPLLKEDIPPDDEGWLCPGCDCK+DC+ +LND Q + LSVTD+WEKV+P+EAAAAAS
Sbjct: 244  CVEPPLLKEDIPPDDEGWLCPGCDCKVDCIDLLNDLQGTDLSVTDSWEKVYPKEAAAAAS 303

Query: 1842 GKTMDEGTXXXXXXXXXXXXXXXXXXXDHEVSRGESTSDGSDYFSASDDI--VPPLDNKQ 1669
            G+ +D+ +                       S  ES+SD SD++SAS+D+   PP D+ +
Sbjct: 304  GEKLDDISGLPSDDSEDDDYNPETPDVGKNDSEDESSSDESDFYSASEDLAEAPPKDD-E 362

Query: 1668 IFXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXDLGAILEDGESPNKDEGHFPS 1489
            I                                      D   I++       ++G   S
Sbjct: 363  ILGISSEDSEDDDFNPDDPDKDEPVKTESSSSDFTSDSEDFNLIVDTNRLQGDEQGVSSS 422

Query: 1488 VSEDSKPNAVASGEKLKAGKRKGRSLNDELSYLTESNTEAVSGKRRGERLDYKKLHDEAY 1309
            V ++S PN+ +  EK K GK KG SL DELSYL +S++  VS KR  ERLDYKKLHDE Y
Sbjct: 423  V-DNSMPNSASQEEKAKVGKAKGNSLKDELSYLMQSDSPLVSAKRHIERLDYKKLHDETY 481

Query: 1308 RNATSDSSDEDYTETAGAKRRK-SNAGKAIRICTNKIQDRNDRTDTMDGNQNHKENDHSV 1132
             N +S+SSDEDY +    K RK  NA  A+   ++   D   ++    G+    ++  S 
Sbjct: 482  GNGSSESSDEDYDDGPLPKVRKLRNAKGAMTSPSSTPADIKHQSGKQKGSGRASDSGIS- 540

Query: 1131 EGKSHKKLKVXXXXXXXXXXXXXXXXXXXXGKDAAKQYKRLGEAVVQRLVESFKENQYPK 952
                 +KLKV                       ++ + K  GE   +RL ESFK+NQYP 
Sbjct: 541  -----EKLKV--------------GGAGTSESPSSGKRKTHGEVATKRLYESFKDNQYPD 581

Query: 951  QDVKQSLAKELGLRVQQVGKWFENARWSFRHSSRMESKMVAAASTNGSSLRTRVHVMSLA 772
            +D K  L KELGL   QV KWFENAR   RHSS   + M    S    S + ++    L 
Sbjct: 582  RDAKGKLGKELGLTAYQVSKWFENARHCHRHSSHWNTIMSQKVSKESPS-KLQIIGEPLG 640

Query: 771  AKQSPVLPSPSDSGMENLISSQVRPGNEECQITDAGEGKSVESEASGEKS---TRKRKVD 601
             + + ++     +G+  L   + R   E+    D  E      +ASG+KS   T+K    
Sbjct: 641  TESNSII--AFCNGVGKLEQPKQRLNGEKGHAIDKSEEDLFIQDASGKKSSEPTKKVYTT 698

Query: 600  NQGSSAGNCMKQDQHDDTPKS 538
            NQGS           +DTP++
Sbjct: 699  NQGS-----------EDTPRN 708


>ref|XP_002313886.2| hypothetical protein POPTR_0009s09600g [Populus trichocarpa]
            gi|550331388|gb|EEE87841.2| hypothetical protein
            POPTR_0009s09600g [Populus trichocarpa]
          Length = 934

 Score =  484 bits (1247), Expect = e-134
 Identities = 298/684 (43%), Positives = 373/684 (54%), Gaps = 17/684 (2%)
 Frame = -1

Query: 2808 IVISNKDPISNSIPGDFRLPHEN---GAAICAPENMGSATCAPENMGSATCDAHENHLDL 2638
            I I N +P++  +     + H     G +I  P N              T D  +   D 
Sbjct: 245  IAIENSEPLTQLVTKRSPIKHVGLLPGDSIIIPAN---------EQTRPTHDDEDKGPDH 295

Query: 2637 QHSEPAEKDATNVASESVPH-EGTSLPSRKQISSLLPPVSNRVLRSRSQEKPKASESKAV 2461
            +H E   + A  +     P  +  S  SRK         S+RVLRSRSQEKPKA ES   
Sbjct: 296  EHLETPSRVAIGITRRGRPRGKSASRLSRKIYMLRSLRSSDRVLRSRSQEKPKAPESSNN 355

Query: 2460 EVENSANEGRKSKQRKGRM-KKIPVNEFSRIRTHLRYLLHRVKYERNLIDAYSCEGWKGQ 2284
                ++   +K K+RK R  K I  +E+S+IR HLRYLL+R+ YE++LI AYS EGWKG 
Sbjct: 356  SGNVNSTGDKKGKRRKKRRGKNIVADEYSKIRAHLRYLLNRMSYEQSLITAYSGEGWKGL 415

Query: 2283 SLEKLKPEKELQRARSQIFRYKLKIRDLFQQLDRSLTEGRFPESLFDSEGQIDSEDIFCA 2104
            SLEKLKPEKELQRA S+I R K+KIRDLFQ +D   +EGRFP SLFDSEGQIDSEDIFCA
Sbjct: 416  SLEKLKPEKELQRATSEITRRKVKIRDLFQHIDSLCSEGRFPSSLFDSEGQIDSEDIFCA 475

Query: 2103 KCGSKDLTIENDIILCDGACERGFHQFCLDPPLLKEDIPPDDEGWLCPGCDCKLDCVGML 1924
            KCGSKDL  +NDIILCDGAC+RGFHQFCL PPLL+EDIPPDDEGWLCPGCDCK+DC+G+L
Sbjct: 476  KCGSKDLNADNDIILCDGACDRGFHQFCLIPPLLREDIPPDDEGWLCPGCDCKVDCIGLL 535

Query: 1923 NDFQESTLSVTDNWEKVFPEEAAAAASGKTMDEGTXXXXXXXXXXXXXXXXXXXDHEVSR 1744
            ND Q + +S++D+WEKVFP EAAA ASG+ +D                      D +   
Sbjct: 536  NDSQGTNISISDSWEKVFP-EAAATASGQKLDHNFGPSSDDSDDNDYEPDGPDIDKKSQE 594

Query: 1743 GESTSDGSDYFSASDDIVPPLDNKQIFXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 1564
             ES+SD SD+ SASD+   P D K+                                   
Sbjct: 595  EESSSDESDFTSASDEFKAPPDGKEYLGLSSDDSEDDDYDPDAPVLEEKLKQESSSSDFT 654

Query: 1563 XXXXDLGAILEDGESPNKDEGHFPSVSEDSKPNAVASGEKLKAGKRKGRSLNDELSYLTE 1384
                DL A +       +DE H P      +P  V++G K K   +K +SLN EL  + E
Sbjct: 655  SDSEDLAATINGDGLSLEDECHMP-----IEPRGVSNGRKSKFDGKKMQSLNSELLSMLE 709

Query: 1383 -----SNTEAVSGKRRGERLDYKKLHDEAYRNATSDSSDEDYTETAGAKRRKSNAGKAIR 1219
                   +  VSGKR  +RLDYKKL+DE Y N  S SSD+DYT+T G ++R+ N G    
Sbjct: 710  PDLCQDESATVSGKRNVDRLDYKKLYDETYGN-ISTSSDDDYTDTVGPRKRRKNTGDVAT 768

Query: 1218 ICTNKIQDRNDRTDTMDG------NQNHKENDHSVEGKSHKKLKVXXXXXXXXXXXXXXX 1057
            +  N      D + T +G      NQ  KEN  + E  + +                   
Sbjct: 769  VTAN-----GDASVTENGMNSKNMNQELKENKRNPERGTCQNSSFQETNVSPAKSYVGAS 823

Query: 1056 XXXXXGKDA-AKQYKRLGEAVVQRLVESFKENQYPKQDVKQSLAKELGLRVQQVGKWFEN 880
                 GK      YK+LGEAV QRL   F+ENQYP +  K SLA+ELG+  +QV KWF N
Sbjct: 824  LSGSSGKSVRPSAYKKLGEAVTQRLYSYFRENQYPDRAAKASLAEELGITFEQVNKWFVN 883

Query: 879  ARWSFRHSSRMESKMVAAASTNGS 808
            ARWSF HSS   +    +AS  GS
Sbjct: 884  ARWSFNHSSSTGTSKAESASGKGS 907


>ref|XP_002300247.2| homeobox family protein [Populus trichocarpa]
            gi|550348560|gb|EEE85052.2| homeobox family protein
            [Populus trichocarpa]
          Length = 930

 Score =  483 bits (1243), Expect = e-133
 Identities = 315/809 (38%), Positives = 416/809 (51%), Gaps = 12/809 (1%)
 Frame = -1

Query: 3198 LHNCLLSGVSSSPTKEPPLKHGDEFVSGGGEPVVQKSVTTKSQQISLSEASVRECDCDSV 3019
            +H+     + SS   + P     E  S       Q S+   +      +A +   +  + 
Sbjct: 129  VHSESSKAIDSSILLDEPRNSNTELSSCIANETSQASLEGLANDSRAEDAGLSLVEASNS 188

Query: 3018 DNLKILDGSSMNSSFESLTAHDLGSDNI--EPLEQKQDVAQDIGRKSPSETGVVASSELP 2845
            D   ++D SS +    S    +  SD    +PLE++Q    ++      E G+   S   
Sbjct: 189  D---LIDESSYSQQTTSGQTREFHSDRACCKPLEERQKPGSELAENESMEIGIGLPSG-- 243

Query: 2844 GPEYLEHSNGEQIVISNKDPISNSIPGDFRLPHENGAAICAPENMGSATCAPENMGSATC 2665
                        I I N +P++  +     + H     I  P     +  A E +   T 
Sbjct: 244  ------------IAIENLEPLTELVTKSCPIKH-----IGLPPGDDISIPANEQI-RPTH 285

Query: 2664 DAHENHLDLQHSEPAEKDATNVASESVPH--EGTSLPSRKQISSLLPPVSNRVLRSRSQE 2491
            D    + D +H E        + S+ VP     + L  +K  SS     S+RVLRS SQE
Sbjct: 286  DKESKYPDCEHLEKLSGIVIGITSQGVPSVKRTSKLSGKKYTSSSRK--SDRVLRSNSQE 343

Query: 2490 KPKASESKAVEVE-NSANEGRKSKQRKGRMKKIPVNEFSRIRTHLRYLLHRVKYERNLID 2314
            KPKA E        NS  E +  +++K R K I  +E+SRIR  LRYLL+R+ YE++LI 
Sbjct: 344  KPKAPEPSNNSTNVNSTGEEKGKRRKKRRGKSIVADEYSRIRARLRYLLNRMSYEQSLIT 403

Query: 2313 AYSCEGWKGQSLEKLKPEKELQRARSQIFRYKLKIRDLFQQLDRSLTEGRFPESLFDSEG 2134
            AYS EGWKG SLEKLKPEKELQRA S+I R K+KIRDLFQ +D    EGRFP SLFDSEG
Sbjct: 404  AYSGEGWKGLSLEKLKPEKELQRATSEIIRRKVKIRDLFQHIDSLCGEGRFPASLFDSEG 463

Query: 2133 QIDSEDIFCAKCGSKDLTIENDIILCDGACERGFHQFCLDPPLLKEDIPPDDEGWLCPGC 1954
            QIDSEDIFCAKCGSKDLT +NDIILCDGAC+RGFHQFCL PPLL+EDIPP DEGWLCPGC
Sbjct: 464  QIDSEDIFCAKCGSKDLTADNDIILCDGACDRGFHQFCLVPPLLREDIPPGDEGWLCPGC 523

Query: 1953 DCKLDCVGMLNDFQESTLSVTDNWEKVFPEEAAAAASGKTMDEGTXXXXXXXXXXXXXXX 1774
            DCK+DC+ +LND Q + +S++D W+ VFP EAAA ASG+ +D                  
Sbjct: 524  DCKVDCIDLLNDSQGTNISISDRWDNVFP-EAAAVASGQKLDY-NFGLSSDDSDDNDYDP 581

Query: 1773 XXXXDHEVSRGESTSDGSDYFSASDDIVPPLDNKQIFXXXXXXXXXXXXXXXXXXXXXXX 1594
                  E S+ ES+SD SD+ SASD+   P D+KQ                         
Sbjct: 582  DGPDIDEKSQEESSSDESDFSSASDEFEAPPDDKQYLGLPSDDSEDDDYDPDAPVLEEKL 641

Query: 1593 XXXXXXXXXXXXXXDLGAILEDGESPNKDEGHFPSVSEDSKPNAVASGEKLKAGKRKGRS 1414
                          DL A L        DE H P      +P+  ++G + + G +K  S
Sbjct: 642  KQESSSSDFTSDSEDLDATLNGDGLSLGDEYHMP-----IEPHEDSNGRRSRFGGKKNHS 696

Query: 1413 LNDELSYLTESNTE-----AVSGKRRGERLDYKKLHDEAYRNATSDSSDEDYTETAGAKR 1249
            LN +L  + E ++       VSGKR  ERLDYKKL+DE Y N  S SSD+DYT+T   ++
Sbjct: 697  LNSKLLSMLEPDSHQEKSAPVSGKRNIERLDYKKLYDETYGN-ISTSSDDDYTDTVAPRK 755

Query: 1248 RKSNAGK-AIRICTNKIQDRNDRTDTMDGNQNHKENDHSVEGKSHKKLKVXXXXXXXXXX 1072
            R+ N G  A+ I         +  ++ + NQ  K+N+H+  G++H+              
Sbjct: 756  RRKNTGDVAMGIANGDASVTENGLNSKNMNQELKKNEHT-SGRTHQNSSFQDTNVSPAKT 814

Query: 1071 XXXXXXXXXXGKDA-AKQYKRLGEAVVQRLVESFKENQYPKQDVKQSLAKELGLRVQQVG 895
                       K      YK+LGEAV Q+L   FKEN+YP Q  K SLA+ELG+  +QV 
Sbjct: 815  HVGESLSGSSSKRVRPSAYKKLGEAVTQKLYSFFKENRYPDQAAKASLAEELGITFEQVN 874

Query: 894  KWFENARWSFRHSSRMESKMVAAASTNGS 808
            KWF NARWSF HSS   +    +AS  GS
Sbjct: 875  KWFMNARWSFNHSSPEGTSKAESASGKGS 903


>ref|XP_004289744.1| PREDICTED: uncharacterized protein LOC101296723 [Fragaria vesca
            subsp. vesca]
          Length = 1227

 Score =  476 bits (1224), Expect = e-131
 Identities = 328/893 (36%), Positives = 460/893 (51%), Gaps = 27/893 (3%)
 Frame = -1

Query: 3174 VSSSPTKEPPLKHGDEFVSGGGEP---VVQKSVTTKS---QQISLSEASVRECDCDSVDN 3013
            + SS  ++ PL+     VS GG     VV ++V+  S   Q   L EA  + C  D +  
Sbjct: 357  LGSSFVQDEPLQTIIPVVSSGGNEQLRVVNENVSVPSLGEQAGLLPEAVSKTCQTDKLSR 416

Query: 3012 LKILDGSSMNSSFESLTAHDLGSDNIEPLEQKQDVAQDIGRKSPSETGVVASSELPGPEY 2833
                   S++++ + +     GS   EP EQ+  +        PS+   V +S       
Sbjct: 417  -------SLHTASDQINESGSGSVQCEPQEQRDQLGS-----LPSQNDQVKNSTAVSSSI 464

Query: 2832 LEHSNGEQIVISNKDPISNSIPGDFRLPHENGAAICAPENMGSATCAPENMGSATCDAHE 2653
                +G  +     D ++NS+ G    P E+     A ++       P      T DA +
Sbjct: 465  GFEQSGPSV-----DEMNNSVIGHLEPPPED-----ASKDHNKELIKPH-----TNDATQ 509

Query: 2652 NHLDLQHSEPAEKDATNVASESVPHEGTSLPSRKQISSLLPPVSNRVLRSRSQEKPKASE 2473
            N   L+ SE A K+A+  +++    +  +  SR++  SL+   S+RVLRSR+ EKP+A E
Sbjct: 510  NSC-LEPSETASKNASKNSTQFGCKDKRNSSSRRKSRSLVS--SDRVLRSRTSEKPEAPE 566

Query: 2472 ----------SKAVEVENSANEGRKSKQRKGRMKKIPVNEFSRIRTHLRYLLHRVKYERN 2323
                      S +V   ++  EG++ K++K   +++  +EFSRIR+HLRY L+R+ YE++
Sbjct: 567  LSNNVATLDTSNSVANVSNEKEGKRKKRKKKHRERVAADEFSRIRSHLRYFLNRINYEKS 626

Query: 2322 LIDAYSCEGWKGQSLEKLKPEKELQRARSQIFRYKLKIRDLFQQLDRSLTEGRFPESLFD 2143
            LIDAYS EGWKG SLEKLKPEKELQRA S+I R K KIRDLFQ+LD    EG FPESLFD
Sbjct: 627  LIDAYSSEGWKGNSLEKLKPEKELQRATSEILRRKSKIRDLFQRLDSLCAEGMFPESLFD 686

Query: 2142 SEGQIDSEDIFCAKCGSKDLTIENDIILCDGACERGFHQFCLDPPLLKEDIPPDDEGWLC 1963
             EGQIDSEDIFCAKCGS D+  +NDIILCDGAC+RGFHQ CL+PPLL E+IPPDDEGWLC
Sbjct: 687  EEGQIDSEDIFCAKCGSLDVYADNDIILCDGACDRGFHQHCLEPPLLSEEIPPDDEGWLC 746

Query: 1962 PGCDCKLDCVGMLNDFQESTLSVTDNWEKVFPEEAAAAASGKTMDEGTXXXXXXXXXXXX 1783
            PGCDCK+DC+ +LND Q + LS+TD+WEKVFPE A AA++G+  +               
Sbjct: 747  PGCDCKVDCIDLLNDSQGTDLSITDSWEKVFPEAAVAASAGQHQENNQGLPSEDSDDDDY 806

Query: 1782 XXXXXXXDHEVSRGESTSDGSDYFSASDDI-VPPLDNKQIFXXXXXXXXXXXXXXXXXXX 1606
                   D EV  GES+SD S+Y SASD +  P  +++Q                     
Sbjct: 807  DPDGPETDEEVQEGESSSDESEYASASDGLETPKTNDEQYLGIPSDDSEDDDFNPDAPDP 866

Query: 1605 XXXXXXXXXXXXXXXXXXDLGAIL-EDGESPNKDEGHFPSVSEDSKPNAVASGEKLKAGK 1429
                              DL A+L ED +S    EG   SV E S     + G+  K G+
Sbjct: 867  TEDVKQGSSSSDFTSDSEDLAAVLDEDRKSFENGEGPQSSVLEASTLLRGSGGKGSKRGQ 926

Query: 1428 RKGRSLNDELSYLTESN-----TEAVSGKRRGERLDYKKLHDEAYRNATSDSSDEDYTET 1264
            ++   + DELS L ES+     +  VSGKR  ERLDYKKLHDE Y +  + S DE+Y ET
Sbjct: 927  KR-HFIKDELSSLIESDPGQDGSTPVSGKRHVERLDYKKLHDEEYGDIPT-SDDEEYIET 984

Query: 1263 AGAKRRKSNAGK-AIRICTNKIQDRNDRTDTMDGNQNHKENDHSVEGKSHKKLKVXXXXX 1087
            A  ++RK  AG+ +      K         T D   +  +N+H+      +K        
Sbjct: 985  AVPRKRKKGAGQVSPGSLKGKPSTIKKGKTTKDIKDDPDKNEHTPRRTPRRKSSANDNSS 1044

Query: 1086 XXXXXXXXXXXXXXXGKDA-AKQYKRLGEAVVQRLVESFKENQYPKQDVKQSLAKELGLR 910
                              A    Y+RLGEAV QRL  SFKENQYP + +K+ LA+ELG+ 
Sbjct: 1045 SPNESLKSSPKSGSTSGRAKGSTYRRLGEAVTQRLYTSFKENQYPDRSMKERLAQELGVM 1104

Query: 909  VQQVGKWFENARWSFRHSSRMESKMVAAASTNGSSLRTRVHVMSLAAKQSPVLPSPSDSG 730
             +QV KWFENAR   +    +   M    +   +S++   H  +   +      + + S 
Sbjct: 1105 AKQVSKWFENARHCVKAGLALPQAMRTQPNQAETSIKDAHHDGAQKNESPGTADAVAGSC 1164

Query: 729  MENLISSQVRPGNEECQITDAGEG--KSVESEASGEKSTRKRKVDNQGSSAGN 577
             +++  +++         T A +G  +  +S+  G     K K   + SS G+
Sbjct: 1165 SQDVKDNKLATPKSSRAKTSAPKGRKRKSKSDPGGSDLDEKFKTPPETSSRGD 1217


>gb|EXB76647.1| Homeobox protein [Morus notabilis]
          Length = 1031

 Score =  472 bits (1214), Expect = e-130
 Identities = 298/694 (42%), Positives = 381/694 (54%), Gaps = 20/694 (2%)
 Frame = -1

Query: 2640 LQHSEPAEKDATNVASESVPHEGTSLPSRKQISSLLPPV-SNRVLRSRSQEKPKASE-SK 2467
            L+  E + K   N  S+    +  +  SRK+   L   V S+RVLRSR+QEK K+ E S 
Sbjct: 309  LEQLETSSKSLVNKPSQLGRKDKQTSKSRKKQYMLRSLVHSDRVLRSRTQEKLKSHELSN 368

Query: 2466 AVEVENSANEGRKSKQRKGRMKKIPVNEFSRIRTHLRYLLHRVKYERNLIDAYSCEGWKG 2287
             +    +  E R  +++K R  ++  +EFSRIR  L+Y  +R+ YE+NLIDAYS EGWKG
Sbjct: 369  TLSNIGNGVEKRMKERKKRRGTRVIADEFSRIRKRLKYFFNRIHYEQNLIDAYSSEGWKG 428

Query: 2286 QSLEKLKPEKELQRARSQIFRYKLKIRDLFQQLDRSLTEGRFPESLFDSEGQIDSEDIFC 2107
             SLEKLKPEKELQRA+S+IFR KLKIRDLFQQLD    EGRFP+SLFDSEGQIDSEDIFC
Sbjct: 429  TSLEKLKPEKELQRAKSEIFRRKLKIRDLFQQLDSLCAEGRFPKSLFDSEGQIDSEDIFC 488

Query: 2106 AKCGSKDLTIENDIILCDGACERGFHQFCLDPPLLKEDIPPDDEGWLCPGCDCKLDCVGM 1927
            AKCGSKD++  NDIILCDGAC+RGFHQFCL+PPLL EDIPPDDEGWLCPGCDCK+DC  +
Sbjct: 489  AKCGSKDMSANNDIILCDGACDRGFHQFCLEPPLLSEDIPPDDEGWLCPGCDCKVDCFDL 548

Query: 1926 LNDFQESTLSVTDNWEKVFPEEAAAAASGKTMDEGTXXXXXXXXXXXXXXXXXXXDHEVS 1747
            LND   + LSVTD+WEKVFPE AAAA  GK  D                        +V 
Sbjct: 549  LNDSYGTNLSVTDSWEKVFPEAAAAAREGKDQDHNLEFPSDDSEDDDYDPYGPEIVEKVE 608

Query: 1746 RGESTSDGSDYFSASDDI---VPPLDNKQIFXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 1576
              ES+SD S+Y SA D++    PP D +Q F                             
Sbjct: 609  GDESSSDESEYTSACDELEGEAPPKD-EQYFGLSSDDSEDNDFDPDDQDVDENAKQESSS 667

Query: 1575 XXXXXXXXDLGAILEDGESPNKDEGHFPSVSEDSKPNAVASGEKLKAGKRKGR--SLNDE 1402
                    DL   L++G+   KDE           P        +++ KR G   S+ DE
Sbjct: 668  SDFTSDSEDLAFTLDEGQIAEKDE------VSSLDPTRSLGNAVMQSSKRGGNKSSIKDE 721

Query: 1401 LSYLTESNT-----EAVSGKRRGERLDYKKLHDEAYRNATSDSS-DEDYTETAGAKRRKS 1240
            L  + ES T       +SGKR  ERLDYK+LHDE Y +  SDSS DED+T+ A  ++RK 
Sbjct: 722  LLDILESGTGQDGSPPISGKRHVERLDYKRLHDETYGHLPSDSSDDEDWTDYAAPRKRKR 781

Query: 1239 NAGKAIRICTNKIQDRNDRTDTMDGNQNHKENDHSVEGKSHKKLKV--XXXXXXXXXXXX 1066
              G+   +  N+         T D   N  E++  V  +  ++  V              
Sbjct: 782  TTGQVSSVSPNENASIIKNQTTTDAANNDLEDNEYVPRRRSRQNSVVTDENNIPNKLLQG 841

Query: 1065 XXXXXXXXGKDAAKQYKRLGEAVVQRLVESFKENQYPKQDVKQSLAKELGLRVQQVGKWF 886
                     +      +RLGEAV QRL +SFKENQY  +  K+SLA+ELGL   QV KWF
Sbjct: 842  SPKSGSTGRRRELSTNRRLGEAVTQRLYQSFKENQYLDRATKESLAQELGLTSYQVSKWF 901

Query: 885  ENARWSFRHSSRMESKMVAAASTNGSSLRTRVHVMSLAAKQSPVLPSPSDSGMENLISSQ 706
            ENARWS+RHSS  +  +   AS   S+L  + +      + +  + + + +G  N  +  
Sbjct: 902  ENARWSYRHSSSKKPGISEHASKE-STLSPQTNKKLFETELNTSITNSTCNGALN--NEL 958

Query: 705  VRPGN---EECQITDAGEGK--SVESEASGEKST 619
             R GN   E C   D G+GK      E+SG+ ST
Sbjct: 959  PRTGNAMPESCS-GDVGDGKVEMPTKESSGQTST 991


>emb|CBI22504.3| unnamed protein product [Vitis vinifera]
          Length = 977

 Score =  465 bits (1196), Expect = e-128
 Identities = 311/779 (39%), Positives = 406/779 (52%), Gaps = 33/779 (4%)
 Frame = -1

Query: 2850 LPGPEYLEHSNGEQIVISNKDPISNSIPGDFRLPHENGAAICAPENMGSATCAPENMGSA 2671
            LP  +  ++S  E + +  +D I N          E        E +G +   PEN+   
Sbjct: 81   LPPADVTKNSLTEHLGLPPEDAIKNDGTEQLGFFPEVVTKSSIIEKLGQSEPPPENVA-- 138

Query: 2670 TCDAHENHLDLQHSEPAEKDATNVASESVPHEGTSLPSRKQISSLLPPVSNRVLRSRSQE 2491
                   +  L  S  A KD  N  +  +      L  R  +S       +RVLRSRSQE
Sbjct: 139  ------RYSGLDQSGSAPKDLANKRTAKLVKRKYKL--RSSVSG------SRVLRSRSQE 184

Query: 2490 KPKASESKAVEVENSANEGRKSKQRKGRMKKIPVNEFSRIRTHLRYLLHRVKYERNLIDA 2311
            KPKAS+     V  SA+  RK +++K RM K   +EF+RIR HLRYLL+R+ YE+NLIDA
Sbjct: 185  KPKASQPSDNFVNASASRERKGRKKK-RMNKTTADEFARIRKHLRYLLNRMSYEQNLIDA 243

Query: 2310 YSCEGWKGQSLEKLKPEKELQRARSQIFRYKLKIRDLFQQLDRSLTEGRFPESLFDSEGQ 2131
            YS EGWKGQS+EKLKPEKELQRA S+I R KL+IRDLFQ LD    EGRFPESLFDSEGQ
Sbjct: 244  YSAEGWKGQSVEKLKPEKELQRASSEISRRKLQIRDLFQHLDSLCAEGRFPESLFDSEGQ 303

Query: 2130 IDSEDIFCAKCGSKDLTIENDIILCDGACERGFHQFCLDPPLLKEDIPPDDEGWLCPGCD 1951
            IDSEDIFCAKC SKD++ +NDIILCDGAC+RGFHQFCL+PPLLKE+IPPDDEGWLCP CD
Sbjct: 304  IDSEDIFCAKCESKDMSADNDIILCDGACDRGFHQFCLEPPLLKEEIPPDDEGWLCPACD 363

Query: 1950 CKLDCVGMLNDFQESTLSVTDNWEKVFPEEAAAA-----ASGKTMDEGTXXXXXXXXXXX 1786
            CK+DC+ +LND Q + LSV D+WEKVFPE AAA       SG + D+             
Sbjct: 364  CKVDCMDLLNDSQGTKLSVIDSWEKVFPEAAAAGNNQDNNSGFSSDDSEDNDYDPDCPEV 423

Query: 1785 XXXXXXXXDHEVSRGES----TSDGSDYFSASDDIVPPLDNKQIFXXXXXXXXXXXXXXX 1618
                           ES     SD SD+ SASDD+V   +N+Q                 
Sbjct: 424  DEKGQGDKSSSDKFDESDEFDESDESDFTSASDDMVVSPNNEQCLGLPSDDSEDDDFDPD 483

Query: 1617 XXXXXXXXXXXXXXXXXXXXXXDLGAILEDGESPNKDEGHFPSVSED---------SKPN 1465
                                       +++  +       F S SED            N
Sbjct: 484  APE------------------------IDEQVNQGSSSSDFTSDSEDFTATLDRRNFSDN 519

Query: 1464 AVASGEKLKAGKRKGRSLNDELSYLTESNT----EAVSGKRRGERLDYKKLHDEAYRNAT 1297
                 E+ + G++K  +L DEL  + ESN+      +S KR  ERLDYKKLHDEAY N +
Sbjct: 520  EDGLDEQRRFGRKKKDTLKDELLSVLESNSGQDNAPLSAKRHVERLDYKKLHDEAYGNVS 579

Query: 1296 SDSS-DEDYTETAGAKRRKSNAGKAIRICTNKIQDRNDRTDTMDGNQNHKENDHSVEG-- 1126
            SDSS DED+TE    ++RK+ +G    +        N  T   +   N K+  H +E   
Sbjct: 580  SDSSDDEDWTENVIPRKRKNLSGNVASV------SPNGNTSITENGTNTKDIKHDLEAAG 633

Query: 1125 -----KSHKKLKV-XXXXXXXXXXXXXXXXXXXXGKDAAKQYKRLGEAVVQRLVESFKEN 964
                 ++ +KL                        K     YK+LGEAV +RL +SF+EN
Sbjct: 634  CTPKRRTRQKLNFESTNNSLAESHKDSRSPGSTGEKSGQSSYKKLGEAVTERLYKSFQEN 693

Query: 963  QYPKQDVKQSLAKELGLRVQQVGKWFENARWSFRHSSRME-SKMVAAASTNGSSLRTRVH 787
            QYP + +K+ LA+ELG+  +QV KWFENARWSFRH    E S   +A   + S+ +T   
Sbjct: 694  QYPDRAMKEKLAEELGITSRQVSKWFENARWSFRHRPPKEASAGKSAVKKDASTSQT--- 750

Query: 786  VMSLAAKQSPVLPSPSDSGMENLISSQVRPGNEECQITDAGEGKS-VESEASGEKSTRK 613
                  +Q  VL   S +G+    S +      + +  +A  GKS V+ +AS  ++ +K
Sbjct: 751  --DQKPEQEVVLRESSHNGVGKKESPKAGASKVD-RSKEANAGKSAVKKDASTSQTDQK 806


>ref|XP_002269077.1| PREDICTED: homeobox protein HAT3.1-like [Vitis vinifera]
          Length = 968

 Score =  465 bits (1196), Expect = e-128
 Identities = 311/779 (39%), Positives = 406/779 (52%), Gaps = 33/779 (4%)
 Frame = -1

Query: 2850 LPGPEYLEHSNGEQIVISNKDPISNSIPGDFRLPHENGAAICAPENMGSATCAPENMGSA 2671
            LP  +  ++S  E + +  +D I N          E        E +G +   PEN+   
Sbjct: 81   LPPADVTKNSLTEHLGLPPEDAIKNDGTEQLGFFPEVVTKSSIIEKLGQSEPPPENVA-- 138

Query: 2670 TCDAHENHLDLQHSEPAEKDATNVASESVPHEGTSLPSRKQISSLLPPVSNRVLRSRSQE 2491
                   +  L  S  A KD  N  +  +      L  R  +S       +RVLRSRSQE
Sbjct: 139  ------RYSGLDQSGSAPKDLANKRTAKLVKRKYKL--RSSVSG------SRVLRSRSQE 184

Query: 2490 KPKASESKAVEVENSANEGRKSKQRKGRMKKIPVNEFSRIRTHLRYLLHRVKYERNLIDA 2311
            KPKAS+     V  SA+  RK +++K RM K   +EF+RIR HLRYLL+R+ YE+NLIDA
Sbjct: 185  KPKASQPSDNFVNASASRERKGRKKK-RMNKTTADEFARIRKHLRYLLNRMSYEQNLIDA 243

Query: 2310 YSCEGWKGQSLEKLKPEKELQRARSQIFRYKLKIRDLFQQLDRSLTEGRFPESLFDSEGQ 2131
            YS EGWKGQS+EKLKPEKELQRA S+I R KL+IRDLFQ LD    EGRFPESLFDSEGQ
Sbjct: 244  YSAEGWKGQSVEKLKPEKELQRASSEISRRKLQIRDLFQHLDSLCAEGRFPESLFDSEGQ 303

Query: 2130 IDSEDIFCAKCGSKDLTIENDIILCDGACERGFHQFCLDPPLLKEDIPPDDEGWLCPGCD 1951
            IDSEDIFCAKC SKD++ +NDIILCDGAC+RGFHQFCL+PPLLKE+IPPDDEGWLCP CD
Sbjct: 304  IDSEDIFCAKCESKDMSADNDIILCDGACDRGFHQFCLEPPLLKEEIPPDDEGWLCPACD 363

Query: 1950 CKLDCVGMLNDFQESTLSVTDNWEKVFPEEAAAA-----ASGKTMDEGTXXXXXXXXXXX 1786
            CK+DC+ +LND Q + LSV D+WEKVFPE AAA       SG + D+             
Sbjct: 364  CKVDCMDLLNDSQGTKLSVIDSWEKVFPEAAAAGNNQDNNSGFSSDDSEDNDYDPDCPEV 423

Query: 1785 XXXXXXXXDHEVSRGES----TSDGSDYFSASDDIVPPLDNKQIFXXXXXXXXXXXXXXX 1618
                           ES     SD SD+ SASDD+V   +N+Q                 
Sbjct: 424  DEKGQGDKSSSDKFDESDEFDESDESDFTSASDDMVVSPNNEQCLGLPSDDSEDDDFDPD 483

Query: 1617 XXXXXXXXXXXXXXXXXXXXXXDLGAILEDGESPNKDEGHFPSVSED---------SKPN 1465
                                       +++  +       F S SED            N
Sbjct: 484  APE------------------------IDEQVNQGSSSSDFTSDSEDFTATLDRRNFSDN 519

Query: 1464 AVASGEKLKAGKRKGRSLNDELSYLTESNT----EAVSGKRRGERLDYKKLHDEAYRNAT 1297
                 E+ + G++K  +L DEL  + ESN+      +S KR  ERLDYKKLHDEAY N +
Sbjct: 520  EDGLDEQRRFGRKKKDTLKDELLSVLESNSGQDNAPLSAKRHVERLDYKKLHDEAYGNVS 579

Query: 1296 SDSS-DEDYTETAGAKRRKSNAGKAIRICTNKIQDRNDRTDTMDGNQNHKENDHSVEG-- 1126
            SDSS DED+TE    ++RK+ +G    +        N  T   +   N K+  H +E   
Sbjct: 580  SDSSDDEDWTENVIPRKRKNLSGNVASV------SPNGNTSITENGTNTKDIKHDLEAAG 633

Query: 1125 -----KSHKKLKV-XXXXXXXXXXXXXXXXXXXXGKDAAKQYKRLGEAVVQRLVESFKEN 964
                 ++ +KL                        K     YK+LGEAV +RL +SF+EN
Sbjct: 634  CTPKRRTRQKLNFESTNNSLAESHKDSRSPGSTGEKSGQSSYKKLGEAVTERLYKSFQEN 693

Query: 963  QYPKQDVKQSLAKELGLRVQQVGKWFENARWSFRHSSRME-SKMVAAASTNGSSLRTRVH 787
            QYP + +K+ LA+ELG+  +QV KWFENARWSFRH    E S   +A   + S+ +T   
Sbjct: 694  QYPDRAMKEKLAEELGITSRQVSKWFENARWSFRHRPPKEASAGKSAVKKDASTSQT--- 750

Query: 786  VMSLAAKQSPVLPSPSDSGMENLISSQVRPGNEECQITDAGEGKS-VESEASGEKSTRK 613
                  +Q  VL   S +G+    S +      + +  +A  GKS V+ +AS  ++ +K
Sbjct: 751  --DQKPEQEVVLRESSHNGVGKKESPKAGASKVD-RSKEANAGKSAVKKDASTSQTDQK 806


>ref|XP_002527467.1| Homeobox protein HAT3.1, putative [Ricinus communis]
            gi|223533107|gb|EEF34865.1| Homeobox protein HAT3.1,
            putative [Ricinus communis]
          Length = 896

 Score =  464 bits (1195), Expect = e-128
 Identities = 292/696 (41%), Positives = 386/696 (55%), Gaps = 9/696 (1%)
 Frame = -1

Query: 2721 PENMGSATCAPENMGSATCDAHENHLDLQHSEPAEKDATNVASESVPHEGTSLPSRKQIS 2542
            P N      A E +G    DA + H +   SE   KDA + +S       T+  SRK+  
Sbjct: 153  PPNNEMKVPASEKLGPPH-DAEDKHWNGTQSEILSKDAVSNSSRLGRRVKTTAKSRKKYM 211

Query: 2541 SLLPPVSNRVLRSRSQEKPKASESKAVEVENSAN-EGRKSKQRKGRMKKIPVNEFSRIRT 2365
                  S+RV++ RSQEKPKA ES       S+N E  + K++K   K +  +E+S IR 
Sbjct: 212  LRCLRRSDRVMQYRSQEKPKAPESSTNLPNVSSNVEKTRKKKKKRERKSVEADEYSIIRK 271

Query: 2364 HLRYLLHRVKYERNLIDAYSCEGWKGQSLEKLKPEKELQRARSQIFRYKLKIRDLFQQLD 2185
            +LRYLL+R+ YE++LI AYS EGWKG SLEKLKPEKELQRA S+I R K KIRDLFQ++D
Sbjct: 272  NLRYLLNRIGYEQSLITAYSAEGWKGLSLEKLKPEKELQRATSEILRRKSKIRDLFQRID 331

Query: 2184 RSLTEGRFPESLFDSEGQIDSEDIFCAKCGSKDLTIENDIILCDGACERGFHQFCLDPPL 2005
                EGRFPESLFDS+GQI SEDIFCAKCGSKDLT +NDIILCDGAC+RGFHQ+CL PPL
Sbjct: 332  SLCGEGRFPESLFDSDGQISSEDIFCAKCGSKDLTADNDIILCDGACDRGFHQYCLVPPL 391

Query: 2004 LKEDIPPDDEGWLCPGCDCKLDCVGMLNDFQESTLSVTDNWEKVFPEEAAAAASGKTMDE 1825
            LKEDIPPDD+GWLCPGCDCK+DC+ +LN+ Q + +S++D+WEKVFPE   AAA G+  D+
Sbjct: 392  LKEDIPPDDQGWLCPGCDCKVDCIDLLNESQGTNISISDSWEKVFPE---AAAPGQNPDQ 448

Query: 1824 GTXXXXXXXXXXXXXXXXXXXDHEVSRGESTSDGSDYFS-ASDDIVPPLDNKQIFXXXXX 1648
                                 D +    ES+SD SD     SD++  P  +KQ       
Sbjct: 449  NFGPPSDDSDDNDYDPDIPEIDEKSQGDESSSDDSDDSDFTSDELEAPPGDKQQLGLSSE 508

Query: 1647 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXDLGAILEDGESPNKDEGHFPSVSEDSKP 1468
                                            DL A L++ E   +DE     +S  ++ 
Sbjct: 509  DSGDDDYDPDAPDLDDIVKEESSSSDFTSDSEDLAATLDNNELSGEDERR---ISVGTRG 565

Query: 1467 NAVASGEKLKAGKRKGRSLNDELSYLTESN-----TEAVSGKRRGERLDYKKLHDEAYRN 1303
            ++   G   K G++K +SL  EL  + E N     +  +SGKR  ERLDYKKL+DE Y N
Sbjct: 566  DSTKEGS--KRGRKKKQSLQSELLSIEEPNPSQDGSAPISGKRNVERLDYKKLYDETYGN 623

Query: 1302 ATSDSS-DEDYTETAGAKRRKSNAGKAIRICTNKIQDRNDRTDTMDGNQNHKENDHSVEG 1126
             +SDSS DED+T+  GA +R+    K+ +            TDT  G Q+ KE ++ V  
Sbjct: 624  VSSDSSDDEDFTDDVGAVKRR----KSTQAALGSANGNASVTDT--GKQDLKETEY-VPK 676

Query: 1125 KSHKKLKVXXXXXXXXXXXXXXXXXXXXGKDAAKQ-YKRLGEAVVQRLVESFKENQYPKQ 949
            +S ++L                      GK      Y+RLGE V + L  SFKENQYP +
Sbjct: 677  RSRQRLISENTSITPTKAHEGTSPSSSCGKTVRPSGYRRLGETVTKGLYRSFKENQYPDR 736

Query: 948  DVKQSLAKELGLRVQQVGKWFENARWSFRHSSRMESKMVAAASTNGSSLRTRVHVMSLAA 769
            D K+ LA+ELG+  QQV KWFENARWSF HSS M++  +     N S +     ++  +A
Sbjct: 737  DRKEHLAEELGITYQQVTKWFENARWSFNHSSSMDANRIGKTPENNSPVSKTTTILLESA 796

Query: 768  KQSPVLPSPSDSGMENLISSQVRPGNEECQITDAGE 661
             ++ V  +  DS  +   S ++     E  + DA E
Sbjct: 797  PET-VSGAAIDSAAQREESPKIGDAMVEIYVEDARE 831


>gb|EMJ01257.1| hypothetical protein PRUPE_ppa023106mg [Prunus persica]
          Length = 1058

 Score =  461 bits (1187), Expect = e-127
 Identities = 311/741 (41%), Positives = 398/741 (53%), Gaps = 43/741 (5%)
 Frame = -1

Query: 2967 LTAHDL-----GSDNIEPLEQKQDVAQDIGRKSPSETGVVASS----ELPGPEYLEHSNG 2815
            +T+H +     GS   EP +QK  +     +   ++T    SS    E PGP        
Sbjct: 222  ITSHQINEFGSGSVPSEPAKQKDQLDSVPAQNDEAKTSKAVSSSTVFEQPGPSIE----- 276

Query: 2814 EQIVISNKDPISNSIPGDFRLPHENGAAICAPENMGSATCAPENMGSATCDAHENHLDLQ 2635
                ++   PI +S P               P    S + + + M     D  +N   LQ
Sbjct: 277  ---AMTEDSPIGHSEP---------------PLEDLSKSLSDKEMEPLPEDVTQNS-SLQ 317

Query: 2634 HSEPAEKDATNVASESVPHEGTSLPSRKQISSLLPPV-SNRVLRSRSQEKPKASESK--- 2467
              E A K+A  ++S   P +  +  SRK+       V S+RVLRS++ EK K  + K   
Sbjct: 318  QLETASKNALKISSCLGPKDKKNPKSRKRKYMSRSFVRSDRVLRSKTGEKEKPKDLKLSN 377

Query: 2466 ---AVEVENSA-----NEGRKSKQRKGRMKKIPV-NEFSRIRTHLRYLLHRVKYERNLID 2314
                +E  NS       E +K K+RK R     + +EFSRIRTHLRYLL+R+ YE++LID
Sbjct: 378  NVATLESSNSIANVSNGEEKKRKKRKNRRDNRAIADEFSRIRTHLRYLLNRIGYEKSLID 437

Query: 2313 AYSCEGWKGQSLEKLKPEKELQRARSQIFRYKLKIRDLFQQLDRSLTEGRFPESLFDSEG 2134
            AYS EGWKG SLEKLKPEKELQRA S+I R KLKIRDLFQ+L+    EG FPESLFDSEG
Sbjct: 438  AYSGEGWKGSSLEKLKPEKELQRATSEILRRKLKIRDLFQRLESLCAEGMFPESLFDSEG 497

Query: 2133 QIDSEDIFCAKCGSKDLTIENDIILCDGACERGFHQFCLDPPLLKEDIPPDDEGWLCPGC 1954
            QIDSEDIFC KCGSKD++++NDIILCDGAC+RGFHQFCL+PPLL EDIPPDDEGWLCPGC
Sbjct: 498  QIDSEDIFCGKCGSKDVSLDNDIILCDGACDRGFHQFCLEPPLLSEDIPPDDEGWLCPGC 557

Query: 1953 DCKLDCVGMLNDFQESTLSVTDNWEKVFPEEAAAAASGKTMDEGTXXXXXXXXXXXXXXX 1774
            DCK+DC+ +LND Q + LSVTD+WEKVFPE AAAA++G+  D                  
Sbjct: 558  DCKVDCIDLLNDSQGTDLSVTDSWEKVFPEAAAAASAGENQD-NHGLPSDDSDDNDYDPD 616

Query: 1773 XXXXDHEVSRGESTSDGSDYFSASDDIVPPLDN-KQIFXXXXXXXXXXXXXXXXXXXXXX 1597
                D++V   ES+SD S+Y SASD +  P  N +Q                        
Sbjct: 617  GPETDNKVQGEESSSDESEYASASDGLETPKSNDEQYLGLPSEDSEDDDYNPYAPDVNED 676

Query: 1596 XXXXXXXXXXXXXXXDLGAILEDGESPNKD-EGHFPSVSEDSKPNAVASGEKLKAGKRKG 1420
                           DLGA L+D    ++D EG   +  +DSKP+   SGE+     +K 
Sbjct: 677  VKQESSSSDFTSDSEDLGAALDDNIMSSEDVEGPKSTSLDDSKPHR-GSGEQSSISGQKK 735

Query: 1419 RSLNDELSYLTES-----NTEAVSGKRRGERLDYKKLHDEAYRNATSDSS-DEDYTETAG 1258
             SL DEL  L ES      +  +SGKR  ERLDYK+LHDEAY N  +DSS DED+ + A 
Sbjct: 736  HSLKDELISLLESGPGQGESAPLSGKRHIERLDYKRLHDEAYGNVPTDSSDDEDWNDIAT 795

Query: 1257 AKRRKSNAGK-AIRICTNKIQDRNDRTDTMDGNQNHKENDHSVEGKSHKKLKVXXXXXXX 1081
             ++RK   G+ A R    K  +  +   T D   +  EN+++     H+K  V       
Sbjct: 796  QRKRKKGTGQVANRSPNGKTSNIKNGVITKDIKPDVDENENTPRRMPHRKSNVEDTSNLS 855

Query: 1080 XXXXXXXXXXXXXGKDAAKQ---YKRLGEAVVQRLVESFKENQYPKQDVKQSLAKELGLR 910
                            A      Y RLGEA  QRL +SFKEN YP + +K+SLA+ELGL 
Sbjct: 856  NKSPKGSTKSGSTSGRAGSSRSTYSRLGEAATQRLCKSFKENHYPDRSMKESLARELGLM 915

Query: 909  VQQ---------VGKWFENAR 874
             +Q         V KWFENAR
Sbjct: 916  AKQVIPSFILASVSKWFENAR 936


>ref|XP_006486963.1| PREDICTED: homeobox protein HAT3.1-like isoform X1 [Citrus sinensis]
            gi|568867273|ref|XP_006486964.1| PREDICTED: homeobox
            protein HAT3.1-like isoform X2 [Citrus sinensis]
          Length = 1063

 Score =  458 bits (1178), Expect = e-126
 Identities = 313/796 (39%), Positives = 411/796 (51%), Gaps = 16/796 (2%)
 Frame = -1

Query: 2925 EQKQDVAQDIGRKSPSETGVVASSELPGPEYLEHSNGEQIVISNKDPISNSIPGDFRLPH 2746
            EQ  +    I    PS       S+L   E  E S GE  + ++ + +  S     + P 
Sbjct: 263  EQTPEFTPGISSHEPSVVNYKLGSQLEQTELGETSAGE--LGASLELVVKSSIEQLKQPE 320

Query: 2745 ENGAAICAPENMGSATCAPENMGSATCDAHENHLDLQHSE-PAEKDATNVA--SESVPHE 2575
                 I  P    SAT   +++ S++ D  E    L+ SE P    A N A         
Sbjct: 321  ---VPITIPSTKTSAT---KHLQSSS-DLMEKKSCLEQSETPPNYVANNSACLGRKGKRA 373

Query: 2574 GTSLPSRKQISSLLPPVSNRVLRSRSQEKPKASESKAVEVE-NSANEGRKSKQRKGRMKK 2398
              SL +   + SL+   S+RVLRSRS E+P   ES     + NS  E ++ K+ K R KK
Sbjct: 374  TKSLKNNYTVRSLIG--SDRVLRSRSGERPIPPESSINLADVNSIGERKQKKRNKIRRKK 431

Query: 2397 IPVNEFSRIRTHLRYLLHRVKYERNLIDAYSCEGWKGQSLEKLKPEKELQRARSQIFRYK 2218
            I  +E+SRIRTHLRYLL+R+ YE+NLIDAYS EGWKG S+EKLKPEKELQRA S+I R K
Sbjct: 432  IVADEYSRIRTHLRYLLNRINYEQNLIDAYSSEGWKGLSVEKLKPEKELQRATSEILRRK 491

Query: 2217 LKIRDLFQQLDRSLTEGRFPESLFDSEGQIDSEDIFCAKCGSKDLTIENDIILCDGACER 2038
            LKIRDLFQ+LD SL  G FP+SLFDSEGQIDSEDI+CAKCGSKDL+ +NDIILCDGAC+R
Sbjct: 492  LKIRDLFQRLD-SLCAGGFPKSLFDSEGQIDSEDIYCAKCGSKDLSADNDIILCDGACDR 550

Query: 2037 GFHQFCLDPPLLKEDIPPDDEGWLCPGCDCKLDCVGMLNDFQESTLSVTDNWEKVFPEEA 1858
            GFHQ+CL+PPLLKEDIPPDDEGWLCPGCDCK+DC+ ++N+ Q + L +TDNWEKVFPE  
Sbjct: 551  GFHQYCLEPPLLKEDIPPDDEGWLCPGCDCKVDCIDLVNELQGTRLFITDNWEKVFPE-- 608

Query: 1857 AAAASGKTMDEGTXXXXXXXXXXXXXXXXXXXDHEVSRGESTSDG-----SDYFSASDDI 1693
              AA+G   D                      D +    ES+SDG     SD+ S SD++
Sbjct: 609  --AAAGHNQDPNFGLASDDSDDNEYDPDGSATDEQDEGDESSSDGSSSDDSDFTSTSDEV 666

Query: 1692 VPPLDNKQIF--XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXDLGAILEDGES 1519
              P D+K                                          DL A+LED  S
Sbjct: 667  EAPADDKTYLGRSSEDSEDDEYNPDAPDLDDKVTQESSSSGSDFTSDSEDLAAVLEDNRS 726

Query: 1518 PNKDEGHFPSVSEDSKPNAVASGEKLKAGKRKGRSLNDELSYLTESNTEA---VSGKRRG 1348
               DEG        + P   ++G++ K G     SLN+EL  + +   +    V GKR  
Sbjct: 727  SGNDEG-------AASPLGHSNGQRYKDG-GNNESLNNELLSIIKPGQDGAAPVYGKRSS 778

Query: 1347 ERLDYKKLHDEAYRNATSDSSDEDYTETAGAKRRKSNAGKAIRICT--NKIQDRNDRTDT 1174
            ERLDYKKL+DE Y N   DSSD++     G  R+++ + K     +   K      R  T
Sbjct: 779  ERLDYKKLYDETYGNVPYDSSDDESWSDDGGPRKRTKSTKEGSSASPDGKTPVIRRRKST 838

Query: 1173 MDGNQNHKENDHSVEGKSHKKLKVXXXXXXXXXXXXXXXXXXXXGKDAAKQYKRLGEAVV 994
                +   E +++ + +   KL                      G+     Y+++GE V 
Sbjct: 839  KAAKEKLNETENTPKRRGRPKLNTEDSNISPAKSHEGCSTPGSRGRRHRTSYRKIGEEVT 898

Query: 993  QRLVESFKENQYPKQDVKQSLAKELGLRVQQVGKWFENARWSFRHSSRMESKMVAAASTN 814
            Q+L  SFKENQYP +  K+SLAKELGL   QV KWFEN RWSF H S   +K+  A S  
Sbjct: 899  QKLYNSFKENQYPNRTTKESLAKELGLTFSQVRKWFENTRWSFNHPSSKNAKL--ANSEK 956

Query: 813  GSSLRTRVHVMSLAAKQSPVLPSPSDSGMENLISSQVRPGNEECQITDAGEGKSVESEAS 634
            G+         +  + ++ V    + +G EN+ SS+    +  C   D  +  + E  + 
Sbjct: 957  GT--------CTPQSNKNTVGRVSNCNGAENVQSSKTGVDDTGCMTGDV-KNNTQECNSI 1007

Query: 633  GEKSTRKRKVDNQGSS 586
               S   RK D  G S
Sbjct: 1008 KPTSQTSRKRDRDGKS 1023


>ref|XP_006422879.1| hypothetical protein CICLE_v10027725mg [Citrus clementina]
            gi|557524813|gb|ESR36119.1| hypothetical protein
            CICLE_v10027725mg [Citrus clementina]
          Length = 1063

 Score =  457 bits (1175), Expect = e-125
 Identities = 314/796 (39%), Positives = 408/796 (51%), Gaps = 16/796 (2%)
 Frame = -1

Query: 2925 EQKQDVAQDIGRKSPSETGVVASSELPGPEYLEHSNGEQIVISNKDPISNSIPGDFRLPH 2746
            EQ  +    I    PS       S+L   E  E S GE +  S +  + +SI    +L  
Sbjct: 263  EQTPEFTPGISSHEPSVVNYKLGSQLEQTELGETSAGE-LGASLELVVKSSIEQLKQLE- 320

Query: 2745 ENGAAICAPENMGSATCAPENMGSATCDAHENHLDLQHSE-PAEKDATNVA--SESVPHE 2575
                 I  P    SAT   +++ S++ D  E    L+ SE P    A N A         
Sbjct: 321  ---VPITIPSTKTSAT---KHLQSSS-DLMEKKSCLEQSETPPNYVANNSACLGRKGKRA 373

Query: 2574 GTSLPSRKQISSLLPPVSNRVLRSRSQEKPKASESKAVEVE-NSANEGRKSKQRKGRMKK 2398
              SL +   + SL+   S+RVLRSRS E+P   ES     + NS  E ++ K+ K R KK
Sbjct: 374  TKSLKNNYTVRSLIG--SDRVLRSRSGERPLPPESSNNLADVNSIGERKQKKRNKIRRKK 431

Query: 2397 IPVNEFSRIRTHLRYLLHRVKYERNLIDAYSCEGWKGQSLEKLKPEKELQRARSQIFRYK 2218
            I  +E+SRIRTHLRYLL+R+ YE+NLIDAYS EGWKG S+EKLKPEKELQRA S+I R K
Sbjct: 432  IVADEYSRIRTHLRYLLNRINYEQNLIDAYSSEGWKGLSVEKLKPEKELQRATSEILRRK 491

Query: 2217 LKIRDLFQQLDRSLTEGRFPESLFDSEGQIDSEDIFCAKCGSKDLTIENDIILCDGACER 2038
            LKIRDLFQ+LD SL  G FP+SLFDSEGQIDSEDI+CAKCGSKDL+ +NDIILCDGAC+R
Sbjct: 492  LKIRDLFQRLD-SLCAGGFPKSLFDSEGQIDSEDIYCAKCGSKDLSADNDIILCDGACDR 550

Query: 2037 GFHQFCLDPPLLKEDIPPDDEGWLCPGCDCKLDCVGMLNDFQESTLSVTDNWEKVFPEEA 1858
            GFHQ+CL+PPLLKEDIPPDDEGWLCPGCDCK+DC+ ++N+ Q + L +TDNWEKVFPE  
Sbjct: 551  GFHQYCLEPPLLKEDIPPDDEGWLCPGCDCKVDCIDLVNELQGTRLFITDNWEKVFPE-- 608

Query: 1857 AAAASGKTMDEGTXXXXXXXXXXXXXXXXXXXDHEVSRGESTSDG-----SDYFSASDDI 1693
              AA+G   D                      D +    ES+SDG     SD+ S SD++
Sbjct: 609  --AAAGHNQDPNFGLASDDSDDNEYDPDGSATDEQDEGDESSSDGSSSDDSDFTSTSDEV 666

Query: 1692 VPPLDNKQI--FXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXDLGAILEDGES 1519
              P D+K                                          DL A+LED  S
Sbjct: 667  EAPADDKTYLGLSSEDSEDDEYNPDAPELDDKVTQESSSSGSDFTSDSEDLAAVLEDNRS 726

Query: 1518 PNKDEGHFPSVSEDSKPNAVASGEKLKAGKRKGRSLNDELSYLTESNTEA---VSGKRRG 1348
               DEG        + P   ++G++ K G     SLN+EL  + +   +    V GKR  
Sbjct: 727  SGNDEG-------AASPLGHSNGQRYKDG-GNNESLNNELLSIIKPGQDGAVPVYGKRSS 778

Query: 1347 ERLDYKKLHDEAYRNATSDSSDEDYTETAGAKRRKSNAGKAIRICT--NKIQDRNDRTDT 1174
            ERLDYKKL+DE Y N   DSSD++     G  R+++ + K     +   K      R  T
Sbjct: 779  ERLDYKKLYDETYGNVPYDSSDDESWSDDGGPRKRTKSTKEGSSASPDGKTPVIRRRKST 838

Query: 1173 MDGNQNHKENDHSVEGKSHKKLKVXXXXXXXXXXXXXXXXXXXXGKDAAKQYKRLGEAVV 994
                +   E +++ + +   KL                      G+     Y++LGE V 
Sbjct: 839  KAAKEKLNETENTPKRRGRPKLNTEDSNISPAKSHEGCSTPGSRGRRHRTSYRKLGEEVT 898

Query: 993  QRLVESFKENQYPKQDVKQSLAKELGLRVQQVGKWFENARWSFRHSSRMESKMVAAASTN 814
            Q+L  SFKENQYP +  K+SLAKELGL   QV KWFEN RWSF H S          S N
Sbjct: 899  QKLYNSFKENQYPNRTTKESLAKELGLTFSQVRKWFENTRWSFNHPS----------SKN 948

Query: 813  GSSLRTRVHVMSLAAKQSPVLPSPSDSGMENLISSQVRPGNEECQITDAGEGKSVESEAS 634
                 +     +  + ++ V    + +G EN+ SS+    +  C   D  +  + E  + 
Sbjct: 949  AELANSEKGTCTPQSNKNTVGRVSNCNGAENVQSSKTGVDDTGCMTGDV-KNNTQECNSI 1007

Query: 633  GEKSTRKRKVDNQGSS 586
               S   RK D  G S
Sbjct: 1008 KPTSQTSRKRDRDGKS 1023


>gb|EOX98399.1| Homeodomain-like protein with RING/FYVE/PHD-type zinc finger domain,
            putative isoform 1 [Theobroma cacao]
            gi|508706504|gb|EOX98400.1| Homeodomain-like protein with
            RING/FYVE/PHD-type zinc finger domain, putative isoform 1
            [Theobroma cacao]
          Length = 950

 Score =  455 bits (1171), Expect = e-125
 Identities = 308/762 (40%), Positives = 395/762 (51%), Gaps = 35/762 (4%)
 Frame = -1

Query: 3000 DGSSMNSSFESLTAHDLGSDNIEPLEQKQDVAQDIGRKSPSETGVVASSELPGPEYLEHS 2821
            + SS ++  + L+   L S          +V +++G +SP +   + S  LP        
Sbjct: 180  EDSSKHTKTDKLSCPQLVSSEPTVNFGSGNVCKELG-ESPEQRQQLDSESLP-------- 230

Query: 2820 NG-EQIVISNKDPISNSI----PGDFRLPHENGAAICAPENM-----GSATCAPENMGSA 2671
            NG E+  I+    +SN      P D    H  G     PE +      S +   E +G  
Sbjct: 231  NGIEESTIAVSSNVSNQALQLKPEDMGKSHCGGHLHSPPEGVTNVIQSSKSPLVEPLGLP 290

Query: 2670 TCDAHENHLDLQHSEPAEKDATNVASESVPHEG----------------TSLPSRKQISS 2539
               A  N    Q   P E  A N   E   HE                 TS   +K+   
Sbjct: 291  QEFAQGNPSTQQSGLPCEDMAQNSGVEQ--HETKPKNLLENSGRRRNGKTSKTIKKKYML 348

Query: 2538 LLPPVSNRVLRSRSQEKPKASESKAVEVENSANEGRKSKQRKGRMKKIPV-NEFSRIRTH 2362
                 S+RVLRS+ QEKPKA+ES     +  ++E +K ++R+ R     V +EFSRIRTH
Sbjct: 349  RSLRSSDRVLRSKLQEKPKATESSNNLADVGSSEQQKRRKRRRRKANREVADEFSRIRTH 408

Query: 2361 LRYLLHRVKYERNLIDAYSCEGWKGQSLEKLKPEKELQRARSQIFRYKLKIRDLFQQLDR 2182
            LRYLL+R+ YER+LI AYS EGWKG SLEKLKPEKELQRA S+I R KLKIRDLFQ +D 
Sbjct: 409  LRYLLNRINYERSLIAAYSTEGWKGLSLEKLKPEKELQRATSEILRRKLKIRDLFQHIDS 468

Query: 2181 SLTEGRFPESLFDSEGQIDSEDIFCAKCGSKDLTIENDIILCDGACERGFHQFCLDPPLL 2002
               EG+ PESLFDSEGQIDSEDIFCAKCGSKDL+  NDIILCDGAC+RGFHQ+CL PPLL
Sbjct: 469  LCAEGKLPESLFDSEGQIDSEDIFCAKCGSKDLSANNDIILCDGACDRGFHQYCLQPPLL 528

Query: 2001 KEDIPPDDEGWLCPGCDCKLDCVGMLNDFQESTLSVTDNWEKVFPEEAAAAASGKTMDEG 1822
            KEDIPPDDEGWLCPGCDCK+DC+ ++N+ Q ++ S+TD+WEKVFP EAA AA+G+  D  
Sbjct: 529  KEDIPPDDEGWLCPGCDCKVDCIELVNESQGTSFSITDSWEKVFP-EAAVAAAGQNQDPN 587

Query: 1821 TXXXXXXXXXXXXXXXXXXXDHEVSRGESTSDGSDYFSASDDIVPPLDNKQIFXXXXXXX 1642
                                D +    ES+S+ S++ S S+++  P    Q         
Sbjct: 588  FGLPSDDSDDNDYNPDGSETDEKDHGDESSSEESEFTSTSEELEVPAKVDQYLGLPSDDS 647

Query: 1641 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXDLGAILEDGESPNKDEGHFP-SVSEDSKPN 1465
                                          DL A+LE+  +  KDEG    S   DSK  
Sbjct: 648  EDDDYDPDGPNHDEVVKPESSSSDFSSDSEDLDAMLEEDITSQKDEGPMANSAPRDSKRR 707

Query: 1464 AVASGEKLKAGKRKGRSLNDELSYLTESNTE----AVSGKRRGERLDYKKLHDEAYRNAT 1297
                GEK         S+NDEL  + E  +E    A+S KR  ERLDYK+L+DE Y N  
Sbjct: 708  KPKLGEK--------ESMNDELLSIMEPASEQDGSAISKKRSIERLDYKRLYDETYGNVP 759

Query: 1296 SDSS-DEDYTETAGAKRRKSNAGKAIRICTNKIQDRNDRTDTMDG-NQNHKENDHSVEGK 1123
            S SS DED+++    ++R     +      N     +      DG  QN +E +H    K
Sbjct: 760  SSSSDDEDWSDITAPRKRNKCTAEVASAPENGNVSVSRTVSVSDGLKQNPEETEHKPRRK 819

Query: 1122 SHKKLKVXXXXXXXXXXXXXXXXXXXXGKDA-AKQYKRLGEAVVQRLVESFKENQYPKQD 946
            + +  +                     GK A +  YKRLGEAV QRL +SFKENQYP + 
Sbjct: 820  TRQMSRFKDTDSSPAEIQGNTSVSGSSGKKAGSSTYKRLGEAVKQRLYKSFKENQYPDRA 879

Query: 945  VKQSLAKELGLRVQQVGKWFENARWSFRHSSRMESKMVAAAS 820
             KQSLAKEL +  QQV KWF+NARWSF +S      +   AS
Sbjct: 880  TKQSLAKELDMTFQQVSKWFDNARWSFNNSPSSHETIANNAS 921


>ref|XP_006589630.1| PREDICTED: pathogenesis-related homeodomain protein isoform X1
            [Glycine max]
          Length = 820

 Score =  451 bits (1161), Expect = e-124
 Identities = 298/693 (43%), Positives = 375/693 (54%), Gaps = 22/693 (3%)
 Frame = -1

Query: 2832 LEHSNGEQIVI--SNKDPISNSIPGDFRLPHENGAAICAPENMGSATCAPE--NMGSATC 2665
            LE S  EQ+ +  SN  P +   P    +  E   +I A    G     P   NM S   
Sbjct: 93   LEQSTVEQVTVDLSNDKPENKCKPLSENVQSEPVESIPAVVVEGQMQSNPSQANMSSV-- 150

Query: 2664 DAHENHLDLQHSEPAEKDATNVASESVPHEGTSLPSR---KQISSLLPPV-------SNR 2515
                N L  Q S  A  + ++  SE + +  T   SR   K+ S LL          S+R
Sbjct: 151  ----NELLDQPSGDAVNNISSNCSEKMSNSPTHSQSRRKGKKNSKLLKKYMLRSLGSSDR 206

Query: 2514 VLRSRSQEKPKASE--SKAVEVENSANEGRKSKQRKGRMKKIPVNEFSRIRTHLRYLLHR 2341
             LRSR++EKPK  E  S  V+  N+  + +  +++K R ++   N+FSRIR+HLRYLL+R
Sbjct: 207  ALRSRTKEKPKEPEPTSNLVDGNNNGVKRKSGRKKKKRKEEGITNQFSRIRSHLRYLLNR 266

Query: 2340 VKYERNLIDAYSCEGWKGQSLEKLKPEKELQRARSQIFRYKLKIRDLFQQLDRSLTEGRF 2161
            + YE +LIDAYS EGWKG S+EKLKPEKELQRA+S+I R KLKIRDLFQ LD    EG+F
Sbjct: 267  ISYENSLIDAYSGEGWKGYSIEKLKPEKELQRAKSEILRRKLKIRDLFQNLDSLCAEGKF 326

Query: 2160 PESLFDSEGQIDSEDIFCAKCGSKDLTIENDIILCDGACERGFHQFCLDPPLLKEDIPPD 1981
            PESLFDS G+IDSEDIFCAKC SK+L+  NDIILCDG C+RGFHQ CLDPP+L EDIPP 
Sbjct: 327  PESLFDSAGEIDSEDIFCAKCQSKELSTNNDIILCDGVCDRGFHQLCLDPPMLTEDIPPG 386

Query: 1980 DEGWLCPGCDCKLDCVGMLNDFQESTLSVTDNWEKVFPEEAAAAASGKTMDEGTXXXXXX 1801
            DEGWLCPGCDCK DC+ ++ND   ++LS++D WE+VFPE  AA+ +G  MD  +      
Sbjct: 387  DEGWLCPGCDCKDDCMDLVNDSFGTSLSISDTWERVFPE--AASFAGNNMDNNSGVPSDD 444

Query: 1800 XXXXXXXXXXXXXDHEVSRGESTSDGSDYFSASDDIVPPLDNKQIFXXXXXXXXXXXXXX 1621
                           +V   ES+SD S+Y SAS+ +       Q                
Sbjct: 445  SDDDDYNPNGPDDV-KVEGDESSSDESEYASASEKLEGGSHEDQYLGLPSEDSDDGDYDP 503

Query: 1620 XXXXXXXXXXXXXXXXXXXXXXXDLGAILEDGESPNKDEGHFPSVSEDSKPNAVASGEKL 1441
                                   DL A +ED  SP +D G    +S   K   V  G+KL
Sbjct: 504  DAPDVECKVNEESSSSDFTSDSEDLAAAIEDNTSPGQDGG----ISSSKKKGKV--GKKL 557

Query: 1440 KAGKRKGRSLNDELSYLTE--SNTEA---VSGKRRGERLDYKKLHDEAYRNATSDSSDED 1276
                    SL DELS L E  S  EA   VSGKR  ERLDYKKL++E Y + TSD  DED
Sbjct: 558  --------SLPDELSSLLEPDSGQEAPTPVSGKRHVERLDYKKLYEETYHSDTSD--DED 607

Query: 1275 YTETAGAKRRKSNAGKAIRICTNKIQDRND-RTDTMDGNQNHKENDHSVEGKSHKKLKVX 1099
            + +TA    +K   G    +  N     N   T   + +QN+ EN ++   KS       
Sbjct: 608  WNDTAAPSGKKKLTGNVTPVSPNGNASNNSIHTPKRNAHQNNVENTNNSPTKS------- 660

Query: 1098 XXXXXXXXXXXXXXXXXXXGKDAAKQYKRLGEAVVQRLVESFKENQYPKQDVKQSLAKEL 919
                                K  +  +KRLGEAVVQRL +SFKENQYP +  K+SLA+EL
Sbjct: 661  --------LEGCSKSGSRDKKSGSSAHKRLGEAVVQRLHKSFKENQYPDRTTKESLAQEL 712

Query: 918  GLRVQQVGKWFENARWSFRHSSRMESKMVAAAS 820
            GL  QQV KWF N RWSFRHSS+ME+     AS
Sbjct: 713  GLTYQQVAKWFGNTRWSFRHSSQMETNSGINAS 745


>ref|XP_003555282.1| PREDICTED: homeobox protein HAT3.1-like isoform X1 [Glycine max]
          Length = 820

 Score =  447 bits (1149), Expect = e-122
 Identities = 304/773 (39%), Positives = 395/773 (51%), Gaps = 31/773 (4%)
 Frame = -1

Query: 2832 LEHSNGEQIVIS-------NK-DPISNSIPGDFRLPHENGAAICAPENMGSATCAPENMG 2677
            LE S  EQ+ +        NK  P+S ++  +   P E+  A      M S+  A  NM 
Sbjct: 93   LEQSTVEQVSVDLSNDKSENKCKPLSENVQSE---PVESIPAFVVDGQMQSSP-AQANMS 148

Query: 2676 SATCDAHENHLDLQHSEPAEKDATNVASE--SVPHEGTSLPSRKQISSLLPPV------- 2524
            S       N L  Q S     + TN + +  + P    S    K+ S LL          
Sbjct: 149  SV------NELLDQPSGDVVNNITNCSEKMSNSPSHSQSRRKGKRNSKLLKKKYMLRSLG 202

Query: 2523 -SNRVLRSRSQEKPKASESKAVEVENSANEGRKSK---QRKGRMKKIPVNEFSRIRTHLR 2356
             S R LRSR++EKPK  E  +  V+ ++N+G K K   ++K R ++   ++FSRIR+HLR
Sbjct: 203  SSGRALRSRTKEKPKEPEPTSNLVDGNSNDGVKRKSGRKKKKRREEGITDQFSRIRSHLR 262

Query: 2355 YLLHRVKYERNLIDAYSCEGWKGQSLEKLKPEKELQRARSQIFRYKLKIRDLFQQLDRSL 2176
            YLL+R+ YE +LIDAYS EGWKG S+EKLKPEKELQRA+S+I R KLKIRDLF+ LD   
Sbjct: 263  YLLNRISYENSLIDAYSGEGWKGYSMEKLKPEKELQRAKSEILRRKLKIRDLFRNLDSLC 322

Query: 2175 TEGRFPESLFDSEGQIDSEDIFCAKCGSKDLTIENDIILCDGACERGFHQFCLDPPLLKE 1996
             EG+FPESLFDS G+IDSEDIFCAKC SK+L+  NDIILCDG C+RGFHQ CLDPPLL E
Sbjct: 323  AEGKFPESLFDSAGEIDSEDIFCAKCQSKELSTNNDIILCDGVCDRGFHQLCLDPPLLTE 382

Query: 1995 DIPPDDEGWLCPGCDCKLDCVGMLNDFQESTLSVTDNWEKVFPEEAAAAASGKTMDEGTX 1816
            DIPP DEGWLCPGCDCK DC+ ++ND   ++LS++D WE+VFPE  AA+ +G  MD    
Sbjct: 383  DIPPGDEGWLCPGCDCKDDCMDLVNDSFGTSLSISDTWERVFPE--AASFAGNNMDNNLG 440

Query: 1815 XXXXXXXXXXXXXXXXXXDHEVSRGESTSDGSDYFSASDDIVPPLDNKQIFXXXXXXXXX 1636
                                ++   ES+SD S+Y SAS+ +       Q           
Sbjct: 441  LPSDDSDDDDYNPNGSDDV-KIEGDESSSDESEYASASEKLEGGSHEDQYLGLPSEDSDD 499

Query: 1635 XXXXXXXXXXXXXXXXXXXXXXXXXXXXDLGAILEDGESPNKDEGHFPSVSEDSKPNAVA 1456
                                        DL A  ED  SP +D G              +
Sbjct: 500  GDYDPDAPDVDCKVNEESSSSDFTSDSEDLAAAFEDNTSPGQDGG------------INS 547

Query: 1455 SGEKLKAGKRKGRSLNDELSYLTESNT-----EAVSGKRRGERLDYKKLHDEAYRNATSD 1291
            S +K K GK    S+ DELS L E ++       VSGKR  ERLDYKKL++E Y + TSD
Sbjct: 548  SKKKGKVGK---LSMADELSSLLEPDSGQGGPTPVSGKRHVERLDYKKLYEETYHSDTSD 604

Query: 1290 SSDEDYTETAGAKRRKSNAGKAIRICTNKIQDRND-RTDTMDGNQNHKENDHSVEGKSHK 1114
              DED+ + A   R+K   G    +  N     N   T   + +QN  EN +S   KS  
Sbjct: 605  --DEDWNDAAAPSRKKKLTGNVTPVSPNANASNNSIHTLKRNAHQNKVENTNSSPTKS-- 660

Query: 1113 KLKVXXXXXXXXXXXXXXXXXXXXGKDAAKQYKRLGEAVVQRLVESFKENQYPKQDVKQS 934
                                     +  +  +KRLGEAVVQRL +SFKENQYP +  K+S
Sbjct: 661  -------------LDGRSKSGSRDKRSGSSAHKRLGEAVVQRLHKSFKENQYPDRSTKES 707

Query: 933  LAKELGLRVQQVGKWFENARWSFRHSSRMESKMVAAASTNGSSLRTRVHVMSLAAKQSPV 754
            LA+ELGL  QQV KWF+N RWSFRHSS+ME+     AS   +  R            SP 
Sbjct: 708  LAQELGLTYQQVAKWFDNTRWSFRHSSQMETNSGRNASPEATDGRAENEGEKQCESMSPE 767

Query: 753  LPSPSDSGMENLISSQVRPGNEECQITDAGEGKSV----ESEASGEKSTRKRK 607
            +   +     +     +     E Q+   G   S     +++   +  TRKRK
Sbjct: 768  VSGKNSKTTSSRKRKHLSEPLSEAQLDINGLATSSPNVHQTQVGNKMKTRKRK 820


>ref|XP_004496910.1| PREDICTED: pathogenesis-related homeodomain protein-like isoform X1
            [Cicer arietinum]
          Length = 995

 Score =  445 bits (1145), Expect = e-122
 Identities = 287/695 (41%), Positives = 372/695 (53%), Gaps = 11/695 (1%)
 Frame = -1

Query: 2631 SEPAEKDATNVASESVPHEGTSLPSRKQISSLLPPVSNRVLRSRSQEKPKASE--SKAVE 2458
            SE   K + ++ S       + L  +  + SL    S+R LRSR+++KPK  E  +  V+
Sbjct: 309  SERKSKSSAHLRSRHKGKSNSKLSKKYILRSL--GSSDRALRSRTRDKPKDPEPINNVVD 366

Query: 2457 VENSANEGRKSKQRKG-RMKKIPVNE-FSRIRTHLRYLLHRVKYERNLIDAYSCEGWKGQ 2284
            V N A + ++ K++K  R +K  +N+ +S+IR HLRYLL+R+ YE+NLIDAYS EGWKG 
Sbjct: 367  VSNDAMKTKRGKKKKKKRPRKEGINDQYSKIRAHLRYLLNRISYEQNLIDAYSGEGWKGY 426

Query: 2283 SLEKLKPEKELQRARSQIFRYKLKIRDLFQQLDRSLTEGRFPESLFDSEGQIDSEDIFCA 2104
            SLEKLKPEKE+QRA+S+I R KLKIRDLFQ LD    EGR PESLFDS+G+IDSEDIFCA
Sbjct: 427  SLEKLKPEKEIQRAKSEILRRKLKIRDLFQNLDSLCAEGRLPESLFDSKGEIDSEDIFCA 486

Query: 2103 KCGSKDLTIENDIILCDGACERGFHQFCLDPPLLKEDIPPDDEGWLCPGCDCKLDCVGML 1924
            KC +K L  +NDIILCDGAC+RGFHQ CLDPPLL EDIPP DEGWLCPGCDCK DC+ ++
Sbjct: 487  KCQTKVLGTDNDIILCDGACDRGFHQLCLDPPLLTEDIPPGDEGWLCPGCDCKDDCIELV 546

Query: 1923 NDFQESTLSVTDNWEKVFPEEAAAAASGKTMDEG--TXXXXXXXXXXXXXXXXXXXDHEV 1750
            ND   + LS+T+ WE+VFPE A AA S    + G  +                   D EV
Sbjct: 547  NDLLGTNLSLTNTWERVFPEAATAAGSILDHNSGLPSDDSEDDDYNPNGPEDVEVEDAEV 606

Query: 1749 SRGESTSDGSDYFSASDDIVPPLDNKQIFXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 1570
               ES+SD S+Y SAS+ +       Q                                 
Sbjct: 607  EGDESSSDESEYASASEKLEDSRHEDQYLGLPSEDSEDDDFDPDAPDLGGKVTEESSSSD 666

Query: 1569 XXXXXXDLGAILEDGESPNKDEGHFPSVSEDSKPNAVASGEKLKAGKRKGRSLNDELSYL 1390
                  DL A ++D  S  +D      + +D K     S +  K   RK  S+ DELS L
Sbjct: 667  FTSDSEDLAATIKDNMSTGQDGDITSPLLDDVKNLKGFSRQNHKV--RKKPSMADELSSL 724

Query: 1389 TES-----NTEAVSGKRRGERLDYKKLHDEAYRNATSDSSDEDYTETAGAKRRKSNAGKA 1225
             +S     +   ++ KR  ERLDY+KL++E Y++ TSD  DED+  +A   R+K  AGK 
Sbjct: 725  LKSDLGQEDITPITAKRNVERLDYQKLYEETYQSDTSD--DEDWDASATPSRKKKLAGKM 782

Query: 1224 IRICTNKIQDRNDRTDTMDGNQNHKENDHSVEGKSHKKLKVXXXXXXXXXXXXXXXXXXX 1045
              +  N     N R       Q HK     VE  ++   K                    
Sbjct: 783  TPVSPNGNASNNSRHTASRNTQQHK-----VENTNNSPTKT----------LEGCTKSGS 827

Query: 1044 XGKDAAKQYKRLGEAVVQRLVESFKENQYPKQDVKQSLAKELGLRVQQVGKWFENARWSF 865
              K     YKRLGEAVVQRL +SFKENQYP++  K+SLA+ELGL  QQV KWF N RWSF
Sbjct: 828  RDKRRGLTYKRLGEAVVQRLYKSFKENQYPERTTKESLAQELGLTFQQVDKWFGNTRWSF 887

Query: 864  RHSSRMESKMVAAASTNGSSLRTRVHVMSLAAKQSPVLPSPSDSGMENLISSQVRPGNEE 685
            RHSS  E    A+  +N S   T     +   + +    +    G+EN        G  E
Sbjct: 888  RHSSHTE----ASPGSNASQQATDSGAENKEERGNASQQATDSPGVEN-------KGEGE 936

Query: 684  CQITDAGEGKSVESEASGEKSTRKRKVDNQGSSAG 580
            C++   G  +      S +K  RKR  + Q S AG
Sbjct: 937  CELVSQGTSREKSRTQSSKK--RKRLSEPQVSEAG 969


>gb|ESW15073.1| hypothetical protein PHAVU_007G041800g [Phaseolus vulgaris]
          Length = 826

 Score =  433 bits (1113), Expect = e-118
 Identities = 288/686 (41%), Positives = 375/686 (54%), Gaps = 21/686 (3%)
 Frame = -1

Query: 2835 YLEHSNGEQIVI--SNKDPISNSIPGDFRLPHENGAAICAPENMGSATCAPENMGSATCD 2662
            +L+ S  +++ +  SN +P + S P     P E+  A        S+         A   
Sbjct: 91   HLQQSTDKEVSLQLSNDEPENPSQPLSENEPVESAPAFAGDGQKQSSPAL------ANTS 144

Query: 2661 AHENHLDLQHSEP----AEKDATNVASESVPHEGTSLPSRKQISSLLPPV--SNRVLRSR 2500
               N LD    +     +EK + + A+  +  +G       + + +L  V  S+R LRS+
Sbjct: 145  YVNNMLDPPSGDAVINCSEKVSNSPANSQLRRKGKKNSKFLKKTYMLRSVGSSDRALRSK 204

Query: 2499 SQEKPKASESKAVEVE---NSANEGRKSKQRKGRMKKIPV---NEFSRIRTHLRYLLHRV 2338
            ++E PK  E  +  V+   N+ N+G K K  K + K   V   ++FSRI++HLRYLL+R+
Sbjct: 205  TKENPKTPEPNSNLVDCNNNNNNDGVKKKSFKKKRKSGEVGITDQFSRIKSHLRYLLNRI 264

Query: 2337 KYERNLIDAYSCEGWKGQSLEKLKPEKELQRARSQIFRYKLKIRDLFQQLDRSLTEGRFP 2158
             YE+NLIDAYS EGWKG S+EKLKPEKELQRA+S+I R KL IR+LF+ LD   TEG+ P
Sbjct: 265  GYEKNLIDAYSAEGWKGYSMEKLKPEKELQRAKSEIIRRKLNIRELFRNLDSLCTEGKLP 324

Query: 2157 ESLFDSEGQIDSEDIFCAKCGSKDLTIENDIILCDGACERGFHQFCLDPPLLKEDIPPDD 1978
            ESLFDSEG+IDSEDIFCAKC SK+L+  NDIILCDG C+RGFHQ CLDPPLL EDIPP D
Sbjct: 325  ESLFDSEGEIDSEDIFCAKCHSKELSSNNDIILCDGVCDRGFHQLCLDPPLLTEDIPPGD 384

Query: 1977 EGWLCPGCDCKLDCVGMLNDFQESTLSVTDNWEKVFPEEAAAAASGKTMDEGTXXXXXXX 1798
            EGWLCPGCDCK DC+ ++ND   ++LS++D WE+VFP EAAAAA  KT  +         
Sbjct: 385  EGWLCPGCDCKDDCMDLINDSFGTSLSISDTWERVFP-EAAAAAGNKT--DNNSGLPSDD 441

Query: 1797 XXXXXXXXXXXXDHEVSRGESTSDGSDYFSASDDIVPPLDNKQIFXXXXXXXXXXXXXXX 1618
                        D +V   ES+SD SDY SAS+++       Q                 
Sbjct: 442  SDDDDYNPNGPEDVKVEGDESSSDESDYASASENL-EGSHGDQYLGLPSDDSDDGDYDPA 500

Query: 1617 XXXXXXXXXXXXXXXXXXXXXXDLGAILEDGESPNKDEGHFPSVSEDSKPNAVASGE-KL 1441
                                  DL A + +  SP +D G   S S D      + G+ K 
Sbjct: 501  APDADSKVNVESSSSDFTSDSDDLPAAIVENTSPGQD-GEIRSASLDDVKCLNSYGKRKG 559

Query: 1440 KAGKRKGRSLNDELSYLTE-----SNTEAVSGKRRGERLDYKKLHDEAYRNATSDSSDED 1276
            KAGK+   S+ DELS L E       +  VSG+R  ERLDYKKL+DEAY + TS+  DED
Sbjct: 560  KAGKK--LSMADELSSLLEPDSGQEGSTPVSGRRNLERLDYKKLYDEAYHSDTSE--DED 615

Query: 1275 YTETAGAKRRKSNAGKAIRICTNKIQDRND-RTDTMDGNQNHKENDHSVEGKSHKKLKVX 1099
            +T T    R+K   G A  +  +     N   T   +G+Q   EN  +   KS       
Sbjct: 616  WTATVTPSRKKK--GNATPVSPDGNASNNSMHTPKRNGHQKKFENTKNSPAKS------- 666

Query: 1098 XXXXXXXXXXXXXXXXXXXGKDAAKQYKRLGEAVVQRLVESFKENQYPKQDVKQSLAKEL 919
                                K  +  YKRLGEAVV+RL  SFKENQYP +  K+SLA+EL
Sbjct: 667  --------LDDHVKSDSRKQKSKSSAYKRLGEAVVERLHISFKENQYPDRTTKESLAQEL 718

Query: 918  GLRVQQVGKWFENARWSFRHSSRMES 841
            GL  QQV KWF+N RWSFRHSS+ME+
Sbjct: 719  GLTCQQVAKWFDNTRWSFRHSSQMET 744


>sp|P48786.1|PRH_PETCR RecName: Full=Pathogenesis-related homeodomain protein; Short=PRHP
            gi|666128|gb|AAA62237.1| homeodomain protein
            [Petroselinum crispum]
          Length = 1088

 Score =  429 bits (1103), Expect = e-117
 Identities = 265/640 (41%), Positives = 354/640 (55%), Gaps = 26/640 (4%)
 Frame = -1

Query: 2565 LPSRKQISSLLPPVSNRVLRSRSQEKPKASESKAVEVENSANEGRKSKQRKGRMKKIPVN 2386
            +P + + S  L   S+R LRSRSQEK    +   +  +  A+  +  K+RK RM++  V+
Sbjct: 429  VPEKGKDSQELSVNSSRSLRSRSQEKSIEPDVNNIVADEGADREKPRKKRKKRMEENRVD 488

Query: 2385 EFSRIRTHLRYLLHRVKYERNLIDAYSCEGWKGQSLEKLKPEKELQRARSQIFRYKLKIR 2206
            EF RIRTHLRYLLHR+KYE+N +DAYS EGWKGQSL+K+KPEKEL+RA+++IF  KLKIR
Sbjct: 489  EFCRIRTHLRYLLHRIKYEKNFLDAYSGEGWKGQSLDKIKPEKELKRAKAEIFGRKLKIR 548

Query: 2205 DLFQQLDRSLTEGRFPESLFDSEGQIDSEDIFCAKCGSKDLTIENDIILCDGACERGFHQ 2026
            DLFQ+LD + +EGR PE LFDS G+IDSEDIFCAKCGSKD+T+ NDIILCDGAC+RGFHQ
Sbjct: 549  DLFQRLDLARSEGRLPEILFDSRGEIDSEDIFCAKCGSKDVTLSNDIILCDGACDRGFHQ 608

Query: 2025 FCLDPPLLKEDIPPDDEGWLCPGCDCKLDCVGMLNDFQESTLSVTDNWEKVFPEEAAAAA 1846
            FCLDPPLLKE IPPDDEGWLCPGC+CK+DC+ +LND QE+ + + D+WEKVF EEAAAAA
Sbjct: 609  FCLDPPLLKEYIPPDDEGWLCPGCECKIDCIKLLNDSQETNILLGDSWEKVFAEEAAAAA 668

Query: 1845 SGKTMDEGTXXXXXXXXXXXXXXXXXXXDHEVSRGESTSDGSDYFSASDDIVPPLDNKQI 1666
            SGK +D+ +                   D +V   +S++D SDY S SDD+       Q+
Sbjct: 669  SGKNLDDNSGLPSDDSEDDDYDPGGPDLDEKVQGDDSSTDESDYQSESDDM-------QV 721

Query: 1665 FXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXDLGAILEDGESPNKDEGHFPSV 1486
                                                     +   D  S ++D   F  V
Sbjct: 722  IRQKNSRGLPSDDSEDDEYDPSGLVTDQMYK---------DSSCSDFTSDSED---FTGV 769

Query: 1485 SEDSKPNAVASGEKLKAGKRKGRSLNDELSYLTESNTEAVSGKRRGERLDYKKLHD---- 1318
             +D K    A G  L +     R+  +   +  + +T  +  +R+ E LDYKKL+D    
Sbjct: 770  FDDYKDTGKAQG-PLASTPDHVRNNEEGCGHPEQGDTAPLYPRRQVESLDYKKLNDIEFS 828

Query: 1317 ----------------------EAYRNATSDSSDEDYTETAGAKRRKSNAGKAIRICTNK 1204
                                  E Y N +SDSSDEDY  T+     K+N+ K        
Sbjct: 829  KMCDILDILSSQLDVIICTGNQEEYGNTSSDSSDEDYMVTSSPD--KNNSDKEA-----T 881

Query: 1203 IQDRNDRTDTMDGNQNHKENDHSVEGKSHKKLKVXXXXXXXXXXXXXXXXXXXXGKDAAK 1024
              +R   +  ++ +Q  +E+ H+   +  KK  V                     K  +K
Sbjct: 882  AMERGRESGDLELDQKARESTHN--RRYIKKFAVEGTDSFLSRSCEDSAAPVAGSKSTSK 939

Query: 1023 QYKRLGEAVVQRLVESFKENQYPKQDVKQSLAKELGLRVQQVGKWFENARWSFRHSSRME 844
                 GE   QRL++SFKENQYP++ VK+SLA EL L V+QV  WF N RWSFRHSSR+ 
Sbjct: 940  TLH--GEHATQRLLQSFKENQYPQRAVKESLAAELALSVRQVSNWFNNRRWSFRHSSRIG 997

Query: 843  SKMVAAASTNGSSLRTRVHVMSLAAKQSPVLPSPSDSGME 724
            S  VA   +N +  +  + +   + K   VL S + S +E
Sbjct: 998  SD-VAKFDSNDTPRQKSIDMSGPSLKS--VLDSATYSEIE 1034


>ref|XP_004161446.1| PREDICTED: homeobox protein HAT3.1-like [Cucumis sativus]
          Length = 749

 Score =  415 bits (1067), Expect = e-113
 Identities = 280/718 (38%), Positives = 375/718 (52%), Gaps = 31/718 (4%)
 Frame = -1

Query: 2598 ASESVPHEGTSLPSRKQISSLLPPVSN-RVLRSRSQEKPKASESKAVEVENSANEGRKSK 2422
            + +S   +   L S+K+   L   VS+ RVLRSR+QEK KA E        +A E  K K
Sbjct: 24   SQQSARKDKIFLKSKKKNYKLRSHVSSDRVLRSRTQEKAKAPERSNDLNNFTAEEDGKRK 83

Query: 2421 QRKGRM---KKIPVNEFSRIRTHLRYLLHRVKYERNLIDAYSCEGWKGQSLEKLKPEKEL 2251
            ++K R    K   V+E+S IR HLRYLL+R++YE++LI+AYS EGWKG S +KLKPEKEL
Sbjct: 84   KKKKRNIQGKGARVDEYSSIRNHLRYLLNRIRYEQSLIEAYSSEGWKGFSSDKLKPEKEL 143

Query: 2250 QRARSQIFRYKLKIRDLFQQLDRSLTEGRFPESLFDSEGQIDSEDIFCAKCGSKDLTIEN 2071
            QRA ++I R KLKIRDLFQ++D    EGR  ESLFDSEGQIDSEDIFCAKCGSK+L++EN
Sbjct: 144  QRASNEIMRRKLKIRDLFQRIDALCAEGRLSESLFDSEGQIDSEDIFCAKCGSKELSLEN 203

Query: 2070 DIILCDGACERGFHQFCLDPPLLKEDIPPDDEGWLCPGCDCKLDCVGMLNDFQESTLSVT 1891
            DIILCDG C+RGFHQFCL+PPLL  DIPPDDEGWLCPGCDCK DC+ +LN+FQ S LS+T
Sbjct: 204  DIILCDGICDRGFHQFCLEPPLLNTDIPPDDEGWLCPGCDCKDDCLDLLNEFQGSNLSIT 263

Query: 1890 DNWEKVFPEEAAAAAS-------GKTMDEGTXXXXXXXXXXXXXXXXXXXDHEVSRGEST 1732
            D WEKV+PE AAAAA        G   D+                       E S  +S 
Sbjct: 264  DGWEKVYPEAAAAAAGRNSDHTLGLPSDDSEDGDYDPDVPDTIDQDNELSSDESSSDQSN 323

Query: 1731 SDGSD-----YFSASDDIVPPLDNKQIFXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 1567
            SD S+     Y SAS+ +    ++ Q                                  
Sbjct: 324  SDPSNSDTSGYASASEGLEVSSNDDQYLGLPSDDSEDNDYDPSVPELDEGVRQESSSSDF 383

Query: 1566 XXXXXDLGAILEDGESPNKDEGHFPSVSEDSKPNAVASGEKLKAGKRKGRSLNDELSYLT 1387
                 DL A+  D    +KD G   S   ++ P   ++G+   +G  K  +L++ELS L 
Sbjct: 384  TSDSEDLAAL--DNNCSSKD-GDLVSSLNNTLPVKNSNGQS--SGPNKS-ALHNELSSLL 437

Query: 1386 ESNT-----EAVSGKRRGERLDYKKLHDEAYRNATSDSSDEDYTET----------AGAK 1252
            +S       E VSG+R+ ERLDYKKLHDE Y N  +DSSD+ Y  T          +G +
Sbjct: 438  DSGPDKDGLEPVSGRRQVERLDYKKLHDETYGNVPTDSSDDTYGSTLDSSDDRGWDSGTR 497

Query: 1251 RRKSNAGKAIRICTNKIQDRNDRTDTMDGNQNHKENDHSVEGKSHKKLKVXXXXXXXXXX 1072
            +R    G    +        ND    +   +++K       G  +    V          
Sbjct: 498  KR----GPKTLVLALSNNGSNDDLTNVKTKRSYKRRTRQKPGAINVNNSVTETPVDTAKS 553

Query: 1071 XXXXXXXXXXGKDAAKQYKRLGEAVVQRLVESFKENQYPKQDVKQSLAKELGLRVQQVGK 892
                       K  +   +RL +  ++RL+ SF+EN+YPK+  KQSLA+ELGL ++QV K
Sbjct: 554  SSSVK------KSTSSSNRRLSQPALERLLASFQENEYPKRATKQSLAQELGLGLKQVSK 607

Query: 891  WFENARWSFRHSSRMESKMVAAASTNGSSLRTRVHVMSLAAKQSPVLPSPSDSGMENLIS 712
            WFEN RWS RH S    K         SS R  +++   + + S   P  S +   +  S
Sbjct: 608  WFENTRWSTRHPSSSGKKAK-------SSSRMSIYLSQASGELSKNEPE-SATCFRDTDS 659

Query: 711  SQVRPGNEECQITDAGEGKSVESEASGEKSTRKRKVDNQGSSAGNCMKQDQHDDTPKS 538
            +  R  +++  + ++    S +S  +G+K    RK     SSA    K+    D   S
Sbjct: 660  NGAR--HQDLPMANSVVA-SCQSGDTGDKKLSSRKTKRADSSATKSRKRKGRSDNTAS 714


>ref|XP_004140812.1| PREDICTED: uncharacterized protein LOC101204775 [Cucumis sativus]
          Length = 1061

 Score =  415 bits (1067), Expect = e-113
 Identities = 280/718 (38%), Positives = 375/718 (52%), Gaps = 31/718 (4%)
 Frame = -1

Query: 2598 ASESVPHEGTSLPSRKQISSLLPPVSN-RVLRSRSQEKPKASESKAVEVENSANEGRKSK 2422
            + +S   +   L S+K+   L   VS+ RVLRSR+QEK KA E        +A E  K K
Sbjct: 256  SQQSARKDKIFLKSKKKNYKLRSHVSSDRVLRSRTQEKAKAPERSNDLNNFTAEEDGKRK 315

Query: 2421 QRKGRM---KKIPVNEFSRIRTHLRYLLHRVKYERNLIDAYSCEGWKGQSLEKLKPEKEL 2251
            ++K R    K   V+E+S IR HLRYLL+R++YE++LI+AYS EGWKG S +KLKPEKEL
Sbjct: 316  KKKKRNIQGKGARVDEYSSIRNHLRYLLNRIRYEQSLIEAYSSEGWKGFSSDKLKPEKEL 375

Query: 2250 QRARSQIFRYKLKIRDLFQQLDRSLTEGRFPESLFDSEGQIDSEDIFCAKCGSKDLTIEN 2071
            QRA ++I R KLKIRDLFQ++D    EGR  ESLFDSEGQIDSEDIFCAKCGSK+L++EN
Sbjct: 376  QRASNEIMRRKLKIRDLFQRIDALCAEGRLSESLFDSEGQIDSEDIFCAKCGSKELSLEN 435

Query: 2070 DIILCDGACERGFHQFCLDPPLLKEDIPPDDEGWLCPGCDCKLDCVGMLNDFQESTLSVT 1891
            DIILCDG C+RGFHQFCL+PPLL  DIPPDDEGWLCPGCDCK DC+ +LN+FQ S LS+T
Sbjct: 436  DIILCDGICDRGFHQFCLEPPLLNTDIPPDDEGWLCPGCDCKDDCLDLLNEFQGSNLSIT 495

Query: 1890 DNWEKVFPEEAAAAAS-------GKTMDEGTXXXXXXXXXXXXXXXXXXXDHEVSRGEST 1732
            D WEKV+PE AAAAA        G   D+                       E S  +S 
Sbjct: 496  DGWEKVYPEAAAAAAGRNSDHTLGLPSDDSEDGDYDPDVPDTIDQDNELSSDESSSDQSN 555

Query: 1731 SDGSD-----YFSASDDIVPPLDNKQIFXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 1567
            SD S+     Y SAS+ +    ++ Q                                  
Sbjct: 556  SDPSNSDTSGYASASEGLEVSSNDDQYLGLPSDDSEDNDYDPSVPELDEGVRQESSSSDF 615

Query: 1566 XXXXXDLGAILEDGESPNKDEGHFPSVSEDSKPNAVASGEKLKAGKRKGRSLNDELSYLT 1387
                 DL A+  D    +KD G   S   ++ P   ++G+   +G  K  +L++ELS L 
Sbjct: 616  TSDSEDLAAL--DNNCSSKD-GDLVSSLNNTLPVKNSNGQS--SGPNKS-ALHNELSSLL 669

Query: 1386 ESNT-----EAVSGKRRGERLDYKKLHDEAYRNATSDSSDEDYTET----------AGAK 1252
            +S       E VSG+R+ ERLDYKKLHDE Y N  +DSSD+ Y  T          +G +
Sbjct: 670  DSGPDKDGLEPVSGRRQVERLDYKKLHDETYGNVPTDSSDDTYGSTLDSSDDRGWDSGTR 729

Query: 1251 RRKSNAGKAIRICTNKIQDRNDRTDTMDGNQNHKENDHSVEGKSHKKLKVXXXXXXXXXX 1072
            +R    G    +        ND    +   +++K       G  +    V          
Sbjct: 730  KR----GPKTLVLALSNNGSNDDLTNVKTKRSYKRRTRQKPGAINVNNSVTETPVDTAKS 785

Query: 1071 XXXXXXXXXXGKDAAKQYKRLGEAVVQRLVESFKENQYPKQDVKQSLAKELGLRVQQVGK 892
                       K  +   +RL +  ++RL+ SF+EN+YPK+  KQSLA+ELGL ++QV K
Sbjct: 786  SSSVK------KSTSSSNRRLSQPALERLLASFQENEYPKRATKQSLAQELGLGLKQVSK 839

Query: 891  WFENARWSFRHSSRMESKMVAAASTNGSSLRTRVHVMSLAAKQSPVLPSPSDSGMENLIS 712
            WFEN RWS RH S    K         SS R  +++   + + S   P  S +   +  S
Sbjct: 840  WFENTRWSTRHPSSSGKKAK-------SSSRMSIYLSQASGELSKNEPE-SATCFRDTDS 891

Query: 711  SQVRPGNEECQITDAGEGKSVESEASGEKSTRKRKVDNQGSSAGNCMKQDQHDDTPKS 538
            +  R  +++  + ++    S +S  +G+K    RK     SSA    K+    D   S
Sbjct: 892  NGAR--HQDLPMANSVVA-SCQSGDTGDKKLSSRKTKRADSSATKSRKRKGRSDNTAS 946


Top