BLASTX nr result

ID: Akebia23_contig00016634 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Akebia23_contig00016634
         (3158 letters)

Database: ./nr 
           38,876,450 sequences; 13,856,398,315 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_002265036.1| PREDICTED: uncharacterized protein LOC100266...   527   e-146
ref|XP_006436656.1| hypothetical protein CICLE_v10030776mg [Citr...   496   e-137
ref|XP_002315450.2| hypothetical protein POPTR_0010s24240g [Popu...   494   e-136
emb|CBI40243.3| unnamed protein product [Vitis vinifera]              494   e-136
ref|XP_002532013.1| conserved hypothetical protein [Ricinus comm...   483   e-133
ref|XP_002311034.2| hypothetical protein POPTR_0008s02470g [Popu...   472   e-130
ref|XP_007010267.1| Enhancer of polycomb-like transcription fact...   469   e-129
ref|XP_006360530.1| PREDICTED: uncharacterized protein LOC102597...   459   e-126
ref|XP_006360531.1| PREDICTED: uncharacterized protein LOC102597...   459   e-126
ref|XP_007010268.1| Enhancer of polycomb-like transcription fact...   459   e-126
ref|XP_007221419.1| hypothetical protein PRUPE_ppa001422mg [Prun...   456   e-125
ref|XP_004243418.1| PREDICTED: uncharacterized protein LOC101263...   451   e-124
gb|EXC25392.1| hypothetical protein L484_016774 [Morus notabilis]     437   e-119
ref|XP_004166800.1| PREDICTED: uncharacterized LOC101207239 [Cuc...   417   e-113
ref|XP_004140897.1| PREDICTED: uncharacterized protein LOC101207...   417   e-113
gb|EYU39775.1| hypothetical protein MIMGU_mgv1a001436mg [Mimulus...   414   e-112
ref|XP_006398922.1| hypothetical protein EUTSA_v10012741mg [Eutr...   414   e-112
ref|NP_196087.1| Enhancer of polycomb-like transcription factor ...   409   e-111
ref|XP_007221418.1| hypothetical protein PRUPE_ppa001422mg [Prun...   408   e-111
ref|XP_007145542.1| hypothetical protein PHAVU_007G247300g [Phas...   402   e-109

>ref|XP_002265036.1| PREDICTED: uncharacterized protein LOC100266152 [Vitis vinifera]
          Length = 791

 Score =  527 bits (1357), Expect = e-146
 Identities = 321/800 (40%), Positives = 452/800 (56%), Gaps = 36/800 (4%)
 Frame = -2

Query: 2689 MPLVEMRRSTRVFVPKSVVKDT-DGARVLRSGKRFSLDCGEEKPVRGNGDEWFCIIDDTA 2513
            MP V MRR+TRVFVPK+  K    GARVLRSG+R   D GE K  R    +WF ++ ++ 
Sbjct: 1    MPSVGMRRTTRVFVPKTAAKGAAGGARVLRSGRRLWPDSGEGKLTRDA--DWFRLLHNSG 58

Query: 2512 DVPR-------CKSIDWYEVDPERDIDVTDFNLNLAXXXXXXXXXXXXXSRDKMH-GIVY 2357
                        K   W+EV+ ++++D  D  + ++               D    GIVY
Sbjct: 59   GGGGGAGGGGGLKENGWHEVNSKQEVDDVDAEVAVSESRNVAGKCGDDQGSDYSRWGIVY 118

Query: 2356 NRKRRRLPGNKFVSSPR-----DRMYGISFVRKQRRKRSSGHSINELPRKDCQXXXXXXX 2192
            +R+ +R      +S  +     D+ +GI F RKQRRKR                      
Sbjct: 119  SRRTKRSDSKSLLSPEKKRGFEDKRFGIRFSRKQRRKRMEESE----------------- 161

Query: 2191 XXXXXSGHESSIDVIHGAILDIVVESSCNSAHRFTCFLVSILKFVKSSRISLTEFAAFMC 2012
                  G    ++++      +V++SS +   RFT FL SIL +++ SR+ L     F+ 
Sbjct: 162  -----EGGYVCVEMV-----TVVIDSSRSGRCRFTSFLNSILGYMRRSRVRLWGLYEFLT 211

Query: 2011 SEPLVRVFAQHGVHFLANLSYRISWKNDLISPGICQIFEARQFIPMFSLDFCASPFTFMR 1832
             EP++  F+ HGV FL +     S+       GIC+IF AR+FIP+FS+DF A P  FM 
Sbjct: 212  WEPMMDAFSSHGVRFLRDPPCARSF-------GICKIFGARRFIPLFSVDFSAVPSCFMY 264

Query: 1831 LHFSVVLRSLYLPDVLIRYLSGLIKKAREITDNKKCLPCIPTEMGFPGSNSMASWS-SSV 1655
            LH S++LR   LP VL+     +     E  D+++ L CIP++    GS S+   + +S 
Sbjct: 265  LHSSMLLRFGCLPFVLVNNSMSVCSNGEEPIDSEENLLCIPSKKDHFGSKSITLENDNSG 324

Query: 1654 KKRKVDFIVRATDFARRSVSTRHSANSRNVQRKRTSLRSTRARNSSSMGFHHD------- 1496
            K+R +   +  + F+ R+   R+  NSR++Q++R+S RS R RN S +G H         
Sbjct: 325  KRRMLQPTIGTSRFSGRNAQWRNGVNSRSIQKRRSSQRSRRVRNPSLVGIHKSNGALVSD 384

Query: 1495 ----------LFRAGYKHKKRKLAQKSPCSNVKELKSTLVELKQNMDSVCCSANILVIES 1346
                           Y  + R+ A+ +  +N++ELKST V +K+ +DSVCCSANIL++ES
Sbjct: 385  FITNRNKGIPFSSVVYNQELRRSARHASATNIRELKSTSVVVKEEIDSVCCSANILIVES 444

Query: 1345 DRCFREEGAKVMLECLALNDWYLVAKTHGLSRYMYKAENVMRPSTPNRFTHAMIWTGENG 1166
            DRCFRE GA VMLE  A  +W++  K  G  +Y +KAE  MR ++ NR THAMIW GE+G
Sbjct: 445  DRCFRENGANVMLEVSASKEWFIAVKKDGSMKYSHKAEKDMRYAS-NRHTHAMIWNGEDG 503

Query: 1165 WKLEFSNRRDWLIFKELHMACMDRNLQVVSARTIPVPGVYEVPCYDDSGYVPFVRPIAYI 986
            WKLEF NR+DW+IFKEL+  C DRN++  S + IPVPGV+EV  Y D    PF RP  YI
Sbjct: 504  WKLEFPNRQDWMIFKELYKECCDRNVEAPSVKIIPVPGVHEVTDYGDYKGDPFSRPDTYI 563

Query: 985  TMRDDEVARALVRKPANYDMDSGDEEWLEKLNNGVFDGDVGVPNYILPEKFEEMMDAFEK 806
              ++DEV+RA+ +  A+YDMDS DEEWL+KLN+  F  +  +  ++  E FE M+DAFEK
Sbjct: 564  AFKNDEVSRAMAKTTASYDMDSEDEEWLKKLNS-EFHAENDLHGHVSEEDFELMVDAFEK 622

Query: 805  AAYHTPDEVSDESRATSFCSSLERKDTVAAVYQYWMKKRKQNHVALLRVFQCPPPRSSQL 626
            A Y +PD+  D + A   C  L  ++ +A VY YWMKKRK+   +L+RVFQ    R +QL
Sbjct: 623  AVYCSPDDYPDANGAADLCVDLGSREAIACVYGYWMKKRKRKRGSLVRVFQGHHLRKAQL 682

Query: 625  MQKPVFRKKRSFKRQMRQSGRGKQQIFFHASAVE----PEQDAMQRVQKAKSLADISMES 458
            + KPV RKKRSF RQ+ + GRGKQQ    A A +     E  A  + Q+A+   D S + 
Sbjct: 683  IPKPVLRKKRSFSRQVGKFGRGKQQNVMQALAAQRKAIDETSAKLKAQEARVSLDRSEKL 742

Query: 457  VLLKRRRAQILMDNADLATY 398
             + KR RAQ LM+NADLATY
Sbjct: 743  AIRKRVRAQSLMENADLATY 762


>ref|XP_006436656.1| hypothetical protein CICLE_v10030776mg [Citrus clementina]
            gi|568878428|ref|XP_006492195.1| PREDICTED:
            uncharacterized protein LOC102612244 [Citrus sinensis]
            gi|557538852|gb|ESR49896.1| hypothetical protein
            CICLE_v10030776mg [Citrus clementina]
          Length = 758

 Score =  496 bits (1276), Expect = e-137
 Identities = 317/796 (39%), Positives = 437/796 (54%), Gaps = 32/796 (4%)
 Frame = -2

Query: 2689 MPLVEMRRSTRVFVPKSVVKDTDGARVLRSGKRFSLDCGEEKPVRGN-GDEWFC--IIDD 2519
            MP V MRR+TRVF    VVK  DGARVLRSG+R   D G+ K  R N GD+W+   +I+ 
Sbjct: 1    MPSVGMRRTTRVF---GVVKGVDGARVLRSGRRLWPDSGDGKLRRTNYGDDWYHHPVINK 57

Query: 2518 T---ADVPRCKSIDWY-EVDPERDIDVTDFNLNLAXXXXXXXXXXXXXSRDKMHGIVYNR 2351
                   P+CK   W   +D   D+ V   N                   D M+GIVY+R
Sbjct: 58   KNGGPGGPKCKPNGWAAHLD---DLKVYANNDEKKEVKMCKKVKEELKGADLMYGIVYSR 114

Query: 2350 KRRRLPGNKFVSSPRDRMYGISFVRKQRRKRSSGHSINELPRKDCQXXXXXXXXXXXXSG 2171
            KR+R  G K     + + YGI F R+QRRK+S                            
Sbjct: 115  KRKRNDGEKSKILEKKK-YGIQFSRRQRRKKSE--------------------------- 146

Query: 2170 HESSIDVIHGAILDIVVESSCNSAHRFTCFLVSILKFVKSSRISLTEFAAFMCSEPLVRV 1991
                  ++  ++  + +ESS  S+     FL S+L  ++ + + L   A+F+ SE +  V
Sbjct: 147  -----KIVPFSVFGVGLESS--SSGFLVSFLSSVLGCMRRATVELPRLASFLLSETISGV 199

Query: 1990 FAQHGVHFLANLSYRISWKNDLISPGICQIFEARQFIPMFSLDFCASPFTFMRLHFSVVL 1811
            F+  G+ F        SW   +   G+C+IF   Q IPMFSLDF A P  FM +H  +++
Sbjct: 200  FSLRGIRF--------SWDPPIARTGMCRIFGTMQLIPMFSLDFSAVPSCFMYIHHCMLV 251

Query: 1810 RSLYLPDVLIRYLSGLIKKARE---ITDNKKCLPCIPTEMGFPGSNSMASWSSSVKKRKV 1640
            R +  P V          +  +   + ++K   P +                +SV K  +
Sbjct: 252  RFMRPPSVNSSASEDDSSEEEDVDYVCESKTVTPVV---------------DNSVNKVAL 296

Query: 1639 DFIVRATDFARRSVSTRHSANSRNVQRKRTSLRSTRARNSSSMGFH-------HDL---- 1493
               VR++  A R+V  R S NSR +Q++R+SLR  RARN S +G          DL    
Sbjct: 297  HPSVRSSKLAARNVQYRSSLNSRAIQKRRSSLRRRRARNPSLIGSQKASGALVSDLTSCR 356

Query: 1492 ------FRAGYKHKKRKLAQKSPCSNVKELKSTLVELKQNMDSVCCSANILVIESDRCFR 1331
                    A  K K R   Q S   ++KE+ ST+  L  ++D  CC  +ILV+ESDRC R
Sbjct: 357  KSSIPSSSAVSKSKLRSSLQHSSVLSIKEVSSTVDSLMLDLDRSCCCVSILVMESDRCCR 416

Query: 1330 EEGAKVMLECLALNDWYLVAKTHGLSRYMYKAENVMRPSTPNRFTHAMIWTGENGWKLEF 1151
             EGA V+LE     +W+LV K  G +RY +KA+ +MRPS+ NRFTHA++W G++ WKLEF
Sbjct: 417  VEGANVILEMSHSKEWHLVVKKDGETRYSFKAQRIMRPSSFNRFTHAILWAGDDNWKLEF 476

Query: 1150 SNRRDWLIFKELHMACMDRNLQVVSARTIPVPGVYEVPCYDDSGYVPFVRPIAYITMRDD 971
            SNR+DWL FK+L+  C DRN QV  ++ IP+PGVYEV  Y+DS  VPF RP +YI++  D
Sbjct: 477  SNRQDWLNFKDLYKECSDRNAQVSVSKVIPIPGVYEVLGYEDSNTVPFCRPDSYISVNVD 536

Query: 970  EVARALVRKPANYDMDSGDEEWLEKLNNGVFDGDVGVPNYILPEKFEEMMDAFEKAAYHT 791
            EV+RAL ++ ANYDMDS DEEWL+K NN  F  +  +  ++  + FE ++DAFEKA + +
Sbjct: 537  EVSRALAKRTANYDMDSEDEEWLKKFNN-EFVTENELHEHVSEDTFELIVDAFEKAYFCS 595

Query: 790  PDEVSDESRATSFCSSLERKDTVAAVYQYWMKKRKQNHVALLRVFQCPPPRSSQLMQKPV 611
            PD+ S+E  A + C  L RK+ V AVY +W +KRKQ   ALLRVFQ   P+   L+ KP 
Sbjct: 596  PDDYSNEEAAVNLCLELGRKEVVLAVYNHWKQKRKQKRAALLRVFQGRQPKKPSLIPKPA 655

Query: 610  FRKKRSFKRQMRQSGRGKQQIFFHASAVE-----PEQDAMQRVQKAKSLADISMESVLLK 446
             RK+RSFKRQ  Q GRGK  +      V       EQ+AM+RV++AK+ A  S+E  +LK
Sbjct: 656  LRKRRSFKRQASQPGRGKPPVVLLPEVVTQQDALEEQNAMRRVEEAKASAKRSLEEAVLK 715

Query: 445  RRRAQILMDNADLATY 398
            R+RAQ+LM NADLATY
Sbjct: 716  RQRAQLLMQNADLATY 731


>ref|XP_002315450.2| hypothetical protein POPTR_0010s24240g [Populus trichocarpa]
            gi|550330500|gb|EEF01621.2| hypothetical protein
            POPTR_0010s24240g [Populus trichocarpa]
          Length = 777

 Score =  494 bits (1272), Expect = e-136
 Identities = 312/802 (38%), Positives = 440/802 (54%), Gaps = 38/802 (4%)
 Frame = -2

Query: 2689 MPLVEMRRSTRVFVPKSVVKDTDGARVLRSGKRFSLDCGEEKPVRGN-GDEWFCII---- 2525
            MP V +RR+TRVF    V+K  DGARVLRSG+R   + G+ K  R N GDEW+  I    
Sbjct: 1    MPSVGLRRTTRVF---GVIKGVDGARVLRSGRRLWQESGDGKLRRSNDGDEWYHTIIKND 57

Query: 2524 -------DDTADVPRCKSIDWYEVDP-ERDIDVTDFNLNLAXXXXXXXXXXXXXSRDKMH 2369
                   +  +D+   ++  W   D  ++D+ V                       +K  
Sbjct: 58   NYQTKNQNKNSDLKYKENSGWAHDDKLKKDLGVV--------IAIAAPKRIKRVKSEKKF 109

Query: 2368 GIVYNRKRRRLPGNKFVSSPRDRMYGISFVRKQRRKRSSGHSINELPRKDCQXXXXXXXX 2189
            GIVY RKR+RL G K   S  D+ +GI F R+QRR                         
Sbjct: 110  GIVYRRKRKRLGGEKSEDS-EDKKFGIQFSRRQRRSLDD--------------------- 147

Query: 2188 XXXXSGHESSIDVIHGAILDIVVES-SCNSAHRFTCFLVSILKFVKSSRISLTEFAAFMC 2012
                   ESS  ++    L ++VE  S +S++  +CFL S+L+++K   +SL+E A F+ 
Sbjct: 148  -------ESSESLVCTPELVVLVEDFSSSSSNGLSCFLSSVLRYIKRVNLSLSELADFLL 200

Query: 2011 SEPLVRVFAQHGVHFLANLSYRISWKNDLISPGICQIFEARQFIPMFSLDFCASPFTFMR 1832
            SEP+  VFA +G+HF  +LS       D I  GIC+ F  RQ +PMFS+DF + P  F+ 
Sbjct: 201  SEPISSVFASNGLHFARDLSA------DRI--GICKFFGTRQLLPMFSVDFSSIPSCFVH 252

Query: 1831 LHFSVVLRSLYLPDVLIRYLSGLIKKAREI--TDNKKCLPCIPTEMGFPGS-NSMASWSS 1661
            +H S+ +R  +L  + +        +  ++  + +K    C   +  F     ++    +
Sbjct: 253  MHLSLFVRFKFLSPIPVNNSLDEDDEDDDVMMSGSKVDQSCTTMKTDFALKITAVPEIDN 312

Query: 1660 SVKKRKVDFIVRATDFARRSVSTRHSANSRNVQRKRTSLRSTRARNSSSMGFHH------ 1499
            S  K  V   VRA+  A RS   R+  NSR +Q++R+SLR  R RNS+  G H       
Sbjct: 313  SGSKAVVHPSVRASKLAGRSTQYRNGLNSRGIQKRRSSLRRGRPRNSAIAGLHKASGALV 372

Query: 1498 -DLF---RAGY-------KHKKRKLAQKSPCSNVKELKSTLVELKQNMDSVCCSANILVI 1352
             DL    R G        K+K R+  + SP +N+KE+ S  V +K++M+   CSANILV 
Sbjct: 373  SDLISSRRKGIPFSSVVSKNKLRRSVRSSPAANIKEMNSAAVGVKKDMNMSSCSANILVS 432

Query: 1351 ESDRCFREEGAKVMLECLALNDWYLVAKTHGLSRYMYKAENVMRPSTPNRFTHAMIWTGE 1172
            ESDRC+R EGA VM E     +W LV K  GL+RY + A+  MR    NRFTH +IWTG+
Sbjct: 433  ESDRCYRIEGATVMFEFTGSREWVLVVKKDGLTRYTHLAQKSMRTCASNRFTHDIIWTGD 492

Query: 1171 NGWKLEFSNRRDWLIFKELHMACMDRNLQVVSARTIPVPGVYEVPCYDDSGYVPFVRPIA 992
            + WKLEF NR+DW IFKEL+  C D N+    ++ I VPGV EV  Y++ G  PF+RP A
Sbjct: 493  DNWKLEFPNRQDWFIFKELYKECSDCNVPASVSKVISVPGVREVLGYENGGGAPFLRPYA 552

Query: 991  YITMRDDEVARALVRKPANYDMDSGDEEWLEKLNNGVFDGDVGVPNYILPEKFEEMMDAF 812
            YI+  +DEVARAL R  A+YDMDS DEEWL+K NN      +   +++  + FE ++DA 
Sbjct: 553  YISSENDEVARALARSTASYDMDSEDEEWLKKYNNDF----LAESDHLSEDNFELLIDAL 608

Query: 811  EKAAYHTPDEVSDESRATSFCSSLERKDTVAAVYQYWMKKRKQNHVALLRVFQCPPPRSS 632
            EK+ Y  PD+ +DE+ A  +C    R++   AVY YWMKKRKQ    LLRVFQ    + +
Sbjct: 609  EKSYYCNPDDFTDENAAAKYCKDFGRREVAEAVYSYWMKKRKQKCSPLLRVFQGHQAKKT 668

Query: 631  QLMQKPVFRKKRSFKRQMRQSGRGKQQIFFHASAVEPE----QDAMQRVQKAKSLADISM 464
             ++ KPV RK+RSFKR   Q GRGKQ       A + +     +AM ++++A++    S+
Sbjct: 669  PVIPKPVLRKRRSFKRPPSQFGRGKQPSLLPVMAADQDALEGYNAMHKIEEAENSVKRSL 728

Query: 463  ESVLLKRRRAQILMDNADLATY 398
            E+ +LKRRRAQ+LM NADLATY
Sbjct: 729  EAAILKRRRAQLLMKNADLATY 750


>emb|CBI40243.3| unnamed protein product [Vitis vinifera]
          Length = 734

 Score =  494 bits (1272), Expect = e-136
 Identities = 292/730 (40%), Positives = 416/730 (56%), Gaps = 28/730 (3%)
 Frame = -2

Query: 2503 RCKSIDWYEVDPERDIDVTDFNLNLAXXXXXXXXXXXXXSRDKMH-GIVYNRKRRRLPGN 2327
            RC+   W+EV+ ++++D  D  + ++               D    GIVY+R+ +R    
Sbjct: 12   RCRLNGWHEVNSKQEVDDVDAEVAVSESRNVAGKCGDDQGSDYSRWGIVYSRRTKRSDSK 71

Query: 2326 KFVSSPR-----DRMYGISFVRKQRRKRSSGHSINELPRKDCQXXXXXXXXXXXXSGHES 2162
              +S  +     D+ +GI F RKQRRKR                            G   
Sbjct: 72   SLLSPEKKRGFEDKRFGIRFSRKQRRKRMEESE----------------------EGGYV 109

Query: 2161 SIDVIHGAILDIVVESSCNSAHRFTCFLVSILKFVKSSRISLTEFAAFMCSEPLVRVFAQ 1982
             ++++      +V++SS +   RFT FL SIL +++ SR+ L     F+  EP++  F+ 
Sbjct: 110  CVEMV-----TVVIDSSRSGRCRFTSFLNSILGYMRRSRVRLWGLYEFLTWEPMMDAFSS 164

Query: 1981 HGVHFLANLSYRISWKNDLISPGICQIFEARQFIPMFSLDFCASPFTFMRLHFSVVLRSL 1802
            HGV FL +     S+       GIC+IF AR+FIP+FS+DF A P  FM LH S++LR  
Sbjct: 165  HGVRFLRDPPCARSF-------GICKIFGARRFIPLFSVDFSAVPSCFMYLHSSMLLRFG 217

Query: 1801 YLPDVLIRYLSGLIKKAREITDNKKCLPCIPTEMGFPGSNSMASWS-SSVKKRKVDFIVR 1625
             LP VL+     +     E  D+++ L CIP++    GS S+   + +S K+R +   + 
Sbjct: 218  CLPFVLVNNSMSVCSNGEEPIDSEENLLCIPSKKDHFGSKSITLENDNSGKRRMLQPTIG 277

Query: 1624 ATDFARRSVSTRHSANSRNVQRKRTSLRSTRARNSSSMGFHHD----------------- 1496
             + F+ R+   R+  NSR++Q++R+S RS R RN S +G H                   
Sbjct: 278  TSRFSGRNAQWRNGVNSRSIQKRRSSQRSRRVRNPSLVGIHKSNGALVSDFITNRNKGIP 337

Query: 1495 LFRAGYKHKKRKLAQKSPCSNVKELKSTLVELKQNMDSVCCSANILVIESDRCFREEGAK 1316
                 Y  + R+ A+ +  +N++ELKST V +K+ +DSVCCSANIL++ESDRCFRE GA 
Sbjct: 338  FSSVVYNQELRRSARHASATNIRELKSTSVVVKEEIDSVCCSANILIVESDRCFRENGAN 397

Query: 1315 VMLECLALNDWYLVAKTHGLSRYMYKAENVMRPSTPNRFTHAMIWTGENGWKLEFSNRRD 1136
            VMLE  A  +W++  K  G  +Y +KAE  MR ++ NR THAMIW GE+GWKLEF NR+D
Sbjct: 398  VMLEVSASKEWFIAVKKDGSMKYSHKAEKDMRYAS-NRHTHAMIWNGEDGWKLEFPNRQD 456

Query: 1135 WLIFKELHMACMDRNLQVVSARTIPVPGVYEVPCYDDSGYVPFVRPIAYITMRDDEVARA 956
            W+IFKEL+  C DRN++  S + IPVPGV+EV  Y D    PF RP  YI  ++DEV+RA
Sbjct: 457  WMIFKELYKECCDRNVEAPSVKIIPVPGVHEVTDYGDYKGDPFSRPDTYIAFKNDEVSRA 516

Query: 955  LVRKPANYDMDSGDEEWLEKLNNGVFDGDVGVPNYILPEKFEEMMDAFEKAAYHTPDEVS 776
            + +  A+YDMDS DEEWL+KLN+  F  +  +  ++  E FE M+DAFEKA Y +PD+  
Sbjct: 517  MAKTTASYDMDSEDEEWLKKLNS-EFHAENDLHGHVSEEDFELMVDAFEKAVYCSPDDYP 575

Query: 775  DESRATSFCSSLERKDTVAAVYQYWMKKRKQNHVALLRVFQCPPPRSSQLMQKPVFRKKR 596
            D + A   C  L  ++ +A VY YWMKKRK+   +L+RVFQ    R +QL+ KPV RKKR
Sbjct: 576  DANGAADLCVDLGSREAIACVYGYWMKKRKRKRGSLVRVFQGHHLRKAQLIPKPVLRKKR 635

Query: 595  SFKRQMRQSGRGKQQIFFHASAVE----PEQDAMQRVQKAKSLADISMESVLLKRRRAQI 428
            SF RQ+ + GRGKQQ    A A +     E  A  + Q+A+   D S +  + KR RAQ 
Sbjct: 636  SFSRQVGKFGRGKQQNVMQALAAQRKAIDETSAKLKAQEARVSLDRSEKLAIRKRVRAQS 695

Query: 427  LMDNADLATY 398
            LM+NADLATY
Sbjct: 696  LMENADLATY 705


>ref|XP_002532013.1| conserved hypothetical protein [Ricinus communis]
            gi|223528325|gb|EEF30368.1| conserved hypothetical
            protein [Ricinus communis]
          Length = 781

 Score =  483 bits (1242), Expect = e-133
 Identities = 308/802 (38%), Positives = 432/802 (53%), Gaps = 38/802 (4%)
 Frame = -2

Query: 2689 MPLVEMRRSTRVFVPKSVVKDTDGARVLRSGKRFSLDCGEEKPVRGN-GDEWFCII---- 2525
            MP V MRRSTRVF    VVK  DGARVLRSG+R  +  GE K  R N GDEW   +    
Sbjct: 1    MPSVGMRRSTRVF---GVVKGVDGARVLRSGRRLLIGAGENKFKRANDGDEWLHTMIKNH 57

Query: 2524 ---DDTADVPRC-KSIDWYEVDP-------ERDIDVTDFNLNLAXXXXXXXXXXXXXSRD 2378
                + + + +C K   W +          ER   V    L +              S +
Sbjct: 58   HHNHNNSPIMKCNKENGWTQTQTHVSKLKKERPSPVA---LGVGAGAGNEVAKKVNDSGN 114

Query: 2377 KMHGIVYNRKRRRLPG-NKFVSSPRDRMYGISFVRKQRRKRSSGHSINELPRKDCQXXXX 2201
            KM GIVY+RKRRR+ G +K     R++ +GI F R+QRR+                    
Sbjct: 115  KMWGIVYSRKRRRMSGIDKLEILGRNKKFGIQFSRRQRRRVLK----------------- 157

Query: 2200 XXXXXXXXSGHESSIDVIHGAILDIVVESSCNSAHRFTCFLVSILKFVKSSRISLTEFAA 2021
                       ++ ++    A+L I+V+ SC+S+     FL  +L +++ + +S+ E   
Sbjct: 158  -----------DNEVESFEPALLGIIVDGSCSSSGLAASFLHLVLGYIRRTNLSIAELVP 206

Query: 2020 FMCSEPLVRVFAQHGVHFLANLSYRISWKNDLISPGICQIFEARQFIPMFSLDFCASPFT 1841
            F+ SE +   FA  G+ FL + +   +        GIC+IF     +P+FSLDF A PF 
Sbjct: 207  FLLSESVKCAFASDGLRFLQDTTANRN--------GICKIFGGMSTVPIFSLDFSAVPFC 258

Query: 1840 FMRLHFSVVLRSLYLPDVLIRYLSGLIKKAREITDNKKCLPCIPTEMGFPGSNSMASWSS 1661
            F+ +H  +  R   L    +            I+++++   C     G   +++     +
Sbjct: 259  FLCMHLRLAFRVKCLSFEPVNNSLDEDSSQEVISESEEDHSC-----GLVRTDTFLLTDN 313

Query: 1660 SVKKRKVDFIVRATDFARRSVSTRHSANSRNVQRKRTSLRSTRARNSSSMGFH------- 1502
            S  K  +   + A+  A R    R+  NSR +Q++R++ R  RARN S +G H       
Sbjct: 314  SGGKVSLHPSLIASKLAGRHSQYRNVLNSRGIQKRRSAFRRRRARNPSGVGIHKANGALV 373

Query: 1501 HDLFRAG----------YKHKKRKLAQKSPCSNVKELKSTLVELKQNMDSVCCSANILVI 1352
             DL  +            K K R+  + +P +N+KE+  T V+  + MDS  CSAN+LVI
Sbjct: 374  SDLISSRKNGIPFSTVVSKDKLRRSLRLTPAANLKEVNPTAVQTSRVMDSSSCSANLLVI 433

Query: 1351 ESDRCFREEGAKVMLECLALNDWYLVAKTHGLSRYMYKAENVMRPSTPNRFTHAMIWTGE 1172
            ESDRC+R  GA V LE   L +W LV K  GL+R  + A+  MRP + NR TH +IWTG+
Sbjct: 434  ESDRCYRMVGATVALEISDLKEWVLVVKKDGLTRCTHLAQKSMRPCSSNRITHDVIWTGD 493

Query: 1171 NGWKLEFSNRRDWLIFKELHMACMDRNLQVVSARTIPVPGVYEVPCYDDSGYVPFVRPIA 992
            + WKLEF NR+DWLIFK+L+  C DRN+    ++ IPVPGV EV  Y+DS  +PF R  A
Sbjct: 494  DSWKLEFPNRQDWLIFKDLYKECYDRNVPAPISKAIPVPGVREVLGYEDSSSLPFSRQDA 553

Query: 991  YITMRDDEVARALVRKPANYDMDSGDEEWLEKLNNGVFDGDVGVPNYILPEKFEEMMDAF 812
            YI+  +DEV RAL ++ ANYDMD  DEEWL+K N+  F  +     ++  EKFE M+D  
Sbjct: 554  YISFNNDEVVRALTKRTANYDMDCEDEEWLKKFNSEFF-VESEEQEHLSEEKFELMIDTL 612

Query: 811  EKAAYHTPDEVSDESRATSFCSSLERKDTVAAVYQYWMKKRKQNHVALLRVFQCPPPRSS 632
            E+A Y +PD+  D   A +FC  L R++ V AVY YWMKK+KQ   ALLRVFQ    + +
Sbjct: 613  ERAFYSSPDDFVDGRAAVNFCIDLGRREVVEAVYGYWMKKQKQRRSALLRVFQLHQGKKA 672

Query: 631  QLMQKPVFRKKRSFKRQMRQSGRGKQQIFFHASAVE----PEQDAMQRVQKAKSLADISM 464
             L+ KP  RK+RSFKRQ  Q GRGK+     A A E     EQ+AM+ ++ AK+ A  S+
Sbjct: 673  SLIPKPGLRKRRSFKRQASQFGRGKKPSLLQAMAAEHDALEEQNAMRNLEAAKASAKSSV 732

Query: 463  ESVLLKRRRAQILMDNADLATY 398
            ES +LKRRRAQ+LM+NADLA Y
Sbjct: 733  ESAILKRRRAQMLMENADLAVY 754


>ref|XP_002311034.2| hypothetical protein POPTR_0008s02470g [Populus trichocarpa]
            gi|550332250|gb|EEE88401.2| hypothetical protein
            POPTR_0008s02470g [Populus trichocarpa]
          Length = 774

 Score =  472 bits (1215), Expect = e-130
 Identities = 311/792 (39%), Positives = 429/792 (54%), Gaps = 28/792 (3%)
 Frame = -2

Query: 2689 MPLVEMRRSTRVFVPKSVVKDTDGARVLRSGKRFSLDCGEEKPVRGN-GDEWF-CIIDDT 2516
            MP V +RR+TRVF   SVVK  DGARVLRSG+R   + G+ K  R + GDE +  II +T
Sbjct: 1    MPSVGLRRTTRVF---SVVKGVDGARVLRSGRRLWPESGDGKLRRSSDGDELYQTIIKNT 57

Query: 2515 ADVPRCKSIDWYEVDPERDIDVTDFNLNLAXXXXXXXXXXXXXSRDKMH----GIVYNRK 2348
             +  + ++ +      E +    D  L                 R K      GIVY+RK
Sbjct: 58   NNHIKNQNSNSNLKYKENNGWTHDVKLKKDRGIVIAIAAPKKIKRVKSEKEKFGIVYSRK 117

Query: 2347 RRRLPGNKFVSSPRDRMYGISFVRKQRRKRSSGHSINELPRKDCQXXXXXXXXXXXXSGH 2168
            R+RL G K   +P D+ +GI F R+QRR+                             G 
Sbjct: 118  RKRLGGEKS-ENPEDKKFGIQFSRRQRRRE----------------------------GS 148

Query: 2167 ESSIDVIHGAILDIVVESSCNSAHRFTCFLVSILKFVKSSRISLTEFAAFMCSEPLVRVF 1988
            ES   ++    L  +VE   +S    +CFL S+L       +SL+E A F+ S+P+  VF
Sbjct: 149  ESQESLVCTPQLVALVEGCSSSNGWLSCFLSSVLGHAMRVSLSLSELADFLLSDPISSVF 208

Query: 1987 AQHGVHFLANLSYRISWKNDLISPGICQIFEARQFIPMFSLDFCASP--FTFMRLHFSVV 1814
            A +G+HF+ +L       +D I  GIC+ FE RQ +PMFS+DF A P  F FM L   V 
Sbjct: 209  ASNGLHFVRDLP------SDRI--GICKFFETRQLLPMFSVDFSAIPSCFAFMHLSLFVK 260

Query: 1813 LRSLYLPDVLIRYLSGLIKKAREITDNKKCLPCIPTEMGFPGSNSMASWSSSVKKRKVDF 1634
             R L L  V    + G       ++++K    C  T+  F    ++   + S   R V  
Sbjct: 261  FRCLSLIPVN-NSVDGDDDDDEIMSESKGDQSCTSTKTDFTQKITVVPKTDSYGCRVVLH 319

Query: 1633 -IVRATDFARRSVSTRHSANSRNVQRKRTSLRSTRARNSSSMGFHH-------DLFRAGY 1478
              VRA+    R+   R+  NSR +Q++R+SLR  R RNSS  G H        DL  +  
Sbjct: 320  PSVRASKLTGRNTQHRNGLNSRGIQKRRSSLRRGRPRNSSIGGLHKANGALVSDLISSRK 379

Query: 1477 ----------KHKKRKLAQKSPCSNVKELKSTLVELKQNMDSVCCSANILVIESDRCFRE 1328
                      K K R+  Q SP +++KEL    V +K+ M+   CSANIL+ E+DRC+R 
Sbjct: 380  IGIPFSSVVSKEKLRRSIQSSPAASIKELNCAAVGVKKGMNLSSCSANILITETDRCYRI 439

Query: 1327 EGAKVMLECLALNDWYLVAKTHGLSRYMYKAENVMRPSTPNRFTHAMIWTGENGWKLEFS 1148
            EGA VMLE     +W LV K +GL+RY + A+ +MR    NRFTH +IW G++ WKLEF 
Sbjct: 440  EGATVMLEFTDSKEWVLVVKKNGLTRYSHLAQKIMRTCVSNRFTHDIIWNGDDNWKLEFP 499

Query: 1147 NRRDWLIFKELHMACMDRNLQVVSARTIPVPGVYEVPCYDDSGYVPFVRPIAYITMRDDE 968
            NR+DW IFKEL+  C D N+    ++ IPVPGV  V    D G  PF RP AYI+  +DE
Sbjct: 500  NRQDWFIFKELYKECSDHNVPASVSKAIPVPGVRGVLDNGDCGSAPFSRPYAYISSNNDE 559

Query: 967  VARALVRKPANYDMDSGDEEWLEKLNNGVFDGDVGVPNYILPEKFEEMMDAFEKAAYHTP 788
            VARAL R  A+YDMDS DEEWL+K N       +   +++  + FE M+DA E++ +  P
Sbjct: 560  VARALSRSTASYDMDSEDEEWLKKYNKEF----LAESDHLSEDNFELMIDALERSYFCDP 615

Query: 787  DEVSDESRATSFCSSLERKDTVAAVYQYWMKKRKQNHVALLRVFQCPPPRSSQLMQKPVF 608
            D+ +DES A  +C    R++   AVY YWMKKRKQ    LLRVFQ    + + L+ KPV 
Sbjct: 616  DDFTDESAAAKYCKDFGRRELAKAVYGYWMKKRKQKRSPLLRVFQGHQAKKTPLIPKPVL 675

Query: 607  RKKRSFKRQMRQSGRGKQQIFFHASAVEPE--QDAMQRVQKAKSLADISMESVLLKRRRA 434
            RK+RSFKR   Q GRGKQ     A A E +    A+++V++A++    S+E+ +LKR++A
Sbjct: 676  RKRRSFKRPPSQFGRGKQPSLLQAMAAEKDALHSALRKVEEARNSVKRSVEAAMLKRQKA 735

Query: 433  QILMDNADLATY 398
            Q+LM NADLAT+
Sbjct: 736  QLLMKNADLATF 747


>ref|XP_007010267.1| Enhancer of polycomb-like transcription factor protein, putative
            isoform 1 [Theobroma cacao] gi|508727180|gb|EOY19077.1|
            Enhancer of polycomb-like transcription factor protein,
            putative isoform 1 [Theobroma cacao]
          Length = 767

 Score =  469 bits (1208), Expect = e-129
 Identities = 307/795 (38%), Positives = 444/795 (55%), Gaps = 31/795 (3%)
 Frame = -2

Query: 2689 MPLVEMRRSTRVFVPKSVVKDTDGARVLRSGKRFSLDCGEEKPVR--GNGDEWFCIIDDT 2516
            MP V MRR+TRVF    +VK ++ ARVLRSG+R   D GE KP R    GDE + ++   
Sbjct: 1    MPSVGMRRTTRVF---RMVKSSEVARVLRSGRRLWPDSGEAKPKRLANEGDENYNLMKKA 57

Query: 2515 ADVPRCKSIDWYEVDPERDIDVTDFNLNLAXXXXXXXXXXXXXSRDKMHGIVYNRKR--R 2342
                           P+ +++     ++                R K  G   N ++  R
Sbjct: 58   ---------------PKSEVNGVAAEVS---------------GRPKRLGNEENPRKQSR 87

Query: 2341 RLPGNKF-VSSPRDRMYGISFVRKQRRKR-SSGHSINELPRKDCQXXXXXXXXXXXXSGH 2168
            ++    F  S   D+M+GI + RK++R    +GH      + +              + +
Sbjct: 88   KMKAGAFNTSGSVDKMFGIVYTRKRKRNGVQNGHLSGNSGQGNYGKKISRRQAIENRNTN 147

Query: 2167 ESSIDVIHGAILDIVVESS-CNSAHRFTCFLVSILKFVKSSRISLTEFAAFMCSEPLVRV 1991
            E   DV    +   VVE+  CN    F+ FL+ +L +VK + + L+E AAF+ S+P+  V
Sbjct: 148  E---DVEEPKMFSFVVENGDCNGC--FSNFLILVLGYVKRAEVRLSELAAFLMSQPISSV 202

Query: 1990 FAQHGVHFLANLSYRISWKNDLISPGICQIFEARQFIPMFSLDFCASPFTFMRLHFSVVL 1811
            ++ +GV+F      R          GIC+ F A+  IP+FSLDF A P  F+ +H+S VL
Sbjct: 203  YSSNGVNFFWGPRNRT---------GICKFFGAKDSIPLFSLDFSAVPRYFLYMHYSKVL 253

Query: 1810 RSLYLPDVLIRYLSGLIKKAREITDNKKCLPCIPTEMGFPGS---NSMASWSSSVKKRKV 1640
            R       L R     +     ++D+++  PC+ + +    S   N+     +   K  +
Sbjct: 254  R-------LKRIQIVPVNSDEIVSDSEEDEPCVTSVVDVCKSTSGNAAVEIDNLGSKVVL 306

Query: 1639 DFIVRATDFARRSVSTRHSANSRNVQRKRTSLRSTRARNSSSMGFHH-------DLF--- 1490
               VRA+    R+   R+  +SR++Q++R+SLR  RARN S +G H        DL    
Sbjct: 307  HPSVRASKLTGRNAQCRNGLSSRSIQKRRSSLRRRRARNPSIVGIHKANGALMSDLISSR 366

Query: 1489 RAGY-------KHKKRKLAQKSPCSNVKELKSTLVELKQNMDSVCCSANILVIESDRCFR 1331
            R G        K+K R   + S  +NV ++ S++ +L QN+DS  CSANILVIE+DRC+R
Sbjct: 367  RNGIPFSSVVSKNKLRSSVRNSSVANVSDVGSSISDLMQNVDSSQCSANILVIEADRCYR 426

Query: 1330 EEGAKVMLECLALNDWYLVAKTHGLSRYMYKAENVMRPSTPNRFTHAMIWTGENGWKLEF 1151
            EEGA V LE  A  +W LV K    +++  KA+  MRPS+ NRFTHA+IWTG++ WKLEF
Sbjct: 427  EEGAIVTLELSASREWLLVVKKGSSTKFACKADKFMRPSSCNRFTHAIIWTGDDNWKLEF 486

Query: 1150 SNRRDWLIFKELHMACMDRNLQVVSARTIPVPGVYEVPCYDDSGYVPFVRPIAYITMRDD 971
             NR+DW+IFK+L+  C +RN+   + + IPVPGV+EVP Y+D   VPF RP  YI++  D
Sbjct: 487  PNRQDWIIFKDLYKECSERNVPASTVKAIPVPGVHEVPGYEDRRSVPFCRPDFYISLDGD 546

Query: 970  EVARALVRKPANYDMDSGDEEWLEKLNNGVFDGDVGVPNYILPEKFEEMMDAFEKAAYHT 791
            EV+RAL ++ ANYDMDS DEEWL+K NN  F G+ G   ++  + FE M+DAFEKA + +
Sbjct: 547  EVSRALAKRTANYDMDSEDEEWLKKFNNEFFSGN-GHCEHLSEDCFELMVDAFEKAYFCS 605

Query: 790  PDEVSDESRATSFCSSLERKDTVAAVYQYWMKKRKQNHVALLRVFQCPPPRSSQLMQKPV 611
            PD+ S+E+ A   C  L  +  V AV+ YW++KRKQ   ALLRVFQ    + + L+ KP 
Sbjct: 606  PDDYSNENAAAHLCLDLGTRGLVEAVHTYWLRKRKQRRSALLRVFQGHQVKKAPLVPKPF 665

Query: 610  FRKKRSFKRQMRQSGRGKQQIFFHASAVE----PEQDAMQRVQKAKSLADISMESVLLKR 443
             RK+RSFKRQ    GRGKQ     A A E     EQ+AM ++++A+  A  S+E  +LKR
Sbjct: 666  LRKRRSFKRQ-ASHGRGKQPYLLQALAAERDSMAEQNAMLKLEEARVSASRSVELAVLKR 724

Query: 442  RRAQILMDNADLATY 398
            +R Q+LM+NADLATY
Sbjct: 725  QRTQLLMENADLATY 739


>ref|XP_006360530.1| PREDICTED: uncharacterized protein LOC102597035 isoform X1 [Solanum
            tuberosum]
          Length = 781

 Score =  459 bits (1182), Expect = e-126
 Identities = 299/822 (36%), Positives = 433/822 (52%), Gaps = 37/822 (4%)
 Frame = -2

Query: 2674 MRRSTRVFVPKSVVKDTDGARVLRSGKRFSLDCGEEKPVRGNGDEWFCIIDDT-----AD 2510
            MRR+TR+F          G RVLRSG+R S   GE K  + +GDEW  ++D+      AD
Sbjct: 7    MRRTTRIF----------GTRVLRSGRRLSTP-GEAKRAK-HGDEWIGLLDNVGGGGAAD 54

Query: 2509 VPRCKSIDWYEVD-------PERDIDVTDFNLNLAXXXXXXXXXXXXXSR--DKMHGIVY 2357
              RCK   W + +        E DIDV   +++               +   D+M G+VY
Sbjct: 55   ATRCKKNGWLKKEVALNLEADEMDIDVDSKSMDELESPEAPVVETISPNSNIDRMWGLVY 114

Query: 2356 NRKRRRLPGNKFVSSPRD-RMYGISFVRKQRRKRSSGHSINELPRKDCQXXXXXXXXXXX 2180
             RKR+R+  +       D R YG  FVRK++ + +    +                    
Sbjct: 115  TRKRKRVADSVKGKVLTDVRRYGKQFVRKKKVRSAYAKDL-------------------- 154

Query: 2179 XSGHESSIDVIHGAILDIVVESSCNSAHRFTCFLVSILKFVKSSRISLTEFAAFMCSEPL 2000
              G      V  G    ++V +S  S +  +C L  IL +++ S +SL +   F+ S+PL
Sbjct: 155  --GKSEDGQVSSGI---VIVNTSYGSGYWVSCLLNCILMYLRRSTVSLQQIFGFINSKPL 209

Query: 1999 VRVFAQHGVHFLANLSYRISWKNDLISPGICQIFEARQFIPMFSLDFCASPFTFMRLHFS 1820
              V +  G+    + + R       I  G C I   R  +P+F+LDF   P  F+ LH S
Sbjct: 210  RDVNSLQGILLFKDQTPR------KIKTGACVISGVRCSVPVFTLDFSTVPCFFLYLHSS 263

Query: 1819 VVLRSLYLPDVLIRYLSGLIKKAREITDNKKCLPCI-PTEMGFPGSNSMASWS----SSV 1655
            ++LR + +   L+   +  I +   +T++K+ + C+ P        N+ +        + 
Sbjct: 264  LLLRFVPMSYALVMQPTVAIDEV-TVTNDKEIVSCLSPVTQSELDVNTQSGLDVVAPGAY 322

Query: 1654 KKRKVDFIVRATDFARRSVSTRHSANSRNVQRKRTSLRSTRARNSS-------------S 1514
              +K++ +       + +       NSRN+Q++R+SLRS R R+SS              
Sbjct: 323  DSKKIEVVNPTVGLPKLAARHLQPRNSRNIQKRRSSLRSMRGRHSSFGTQNATGVLTSDR 382

Query: 1513 MGFHHDLFRAGYK---HKKRKLAQKSPCSNVKELKSTLVELKQNMDSVCCSANILVIESD 1343
            + F  D  R   +   ++ R   QK+   +VKELKS LV L QN++S  CSAN+LVIE D
Sbjct: 383  LRFRRDGLRFSSRTPHYELRSSRQKTSTPSVKELKSALVGLTQNIESTSCSANVLVIEPD 442

Query: 1342 RCFREEGAKVMLECLALNDWYLVAKTHGLSRYMYKAENVMRPSTPNRFTHAMIWTGENGW 1163
            +C+REEGA + +E  A   W L  K  G+ R+    E VMRP + NR TH +IW G+NGW
Sbjct: 443  KCYREEGAVIGMELSAAKQWILAVKIGGVRRFNLTTEKVMRPCSSNRVTHDIIWVGDNGW 502

Query: 1162 KLEFSNRRDWLIFKELHMACMDRNLQVVSARTIPVPGVYEVPCYDDSGYVPFVRPIAYIT 983
            KLEF  R+DWLIFKEL+  C DRN+Q  +   IPVPGV EV  Y +S    F RP++YIT
Sbjct: 503  KLEFPIRQDWLIFKELYKGCSDRNVQPPAVSIIPVPGVREVSGYAESNPPEFARPVSYIT 562

Query: 982  MRDDEVARALVRKPANYDMDSGDEEWLEKLNNGVFDGDVGVPNYILPEKFEEMMDAFEKA 803
            ++DDE+ARAL R  ANYDMD  DEEWL   N    D      +++  + FE ++D FEK 
Sbjct: 563  VKDDELARALARSTANYDMDGDDEEWLRNFN----DQPSLENDHLSADSFELLIDNFEKG 618

Query: 802  AYHTPDEVSDESRATSFCSSLERKDTVAAVYQYWMKKRKQNHVALLRVFQCPPPRSSQLM 623
             Y  PD+ SDE  A S C + E+K+ V AVY YW+KKRKQN  +L+++FQC  PR +Q++
Sbjct: 619  FYCNPDDYSDEKAAVSSCPNKEKKEIVEAVYNYWLKKRKQNRSSLIKIFQCYQPRRTQVI 678

Query: 622  QKPVFRKKRSFKRQMRQSGRGKQQIFFHASAVEPE-QDAMQRVQKAKSLADISMESVLLK 446
             K +FRKKRSFKRQ  ++GRGK + F  A   E E Q+A+ +V++AK+ A+ S +  +  
Sbjct: 679  PKSIFRKKRSFKRQGSKAGRGKHRPFLPAVVAEKEQQNAVLKVKEAKAAANKSEDLAVRM 738

Query: 445  RRRAQILMDNADLATYXXXXXXXXXXXXXXXESPEAYVPFIL 320
            R++AQ LM+NADLATY               +S EA  P  L
Sbjct: 739  RQKAQQLMENADLATYKAMMALKIAEAAKIAKSKEAVGPIFL 780


>ref|XP_006360531.1| PREDICTED: uncharacterized protein LOC102597035 isoform X2 [Solanum
            tuberosum]
          Length = 779

 Score =  459 bits (1180), Expect = e-126
 Identities = 300/822 (36%), Positives = 434/822 (52%), Gaps = 37/822 (4%)
 Frame = -2

Query: 2674 MRRSTRVFVPKSVVKDTDGARVLRSGKRFSLDCGEEKPVRGNGDEWFCIIDDT-----AD 2510
            MRR+TR+F          G RVLRSG+R S   GE K  + +GDEW  ++D+      AD
Sbjct: 7    MRRTTRIF----------GTRVLRSGRRLSTP-GEAKRAK-HGDEWIGLLDNVGGGGAAD 54

Query: 2509 VPRCKSIDWYEVD-------PERDIDVTDFNLNLAXXXXXXXXXXXXXSR--DKMHGIVY 2357
              RCK   W + +        E DIDV   +++               +   D+M G+VY
Sbjct: 55   ATRCKKNGWLKKEVALNLEADEMDIDVDSKSMDELESPEAPVVETISPNSNIDRMWGLVY 114

Query: 2356 NRKRRRLPGNKFVSSPRD-RMYGISFVRKQRRKRSSGHSINELPRKDCQXXXXXXXXXXX 2180
             RKR+R+  +       D R YG  FVRK++ + +    +                    
Sbjct: 115  TRKRKRVADSVKGKVLTDVRRYGKQFVRKKKVRSAYAKDL-------------------- 154

Query: 2179 XSGHESSIDVIHGAILDIVVESSCNSAHRFTCFLVSILKFVKSSRISLTEFAAFMCSEPL 2000
              G      V  G    ++V +S  S +  +C L  IL +++ S +SL +   F+ S+PL
Sbjct: 155  --GKSEDGQVSSGI---VIVNTSYGSGYWVSCLLNCILMYLRRSTVSLQQIFGFINSKPL 209

Query: 1999 VRVFAQHGVHFLANLSYRISWKNDLISPGICQIFEARQFIPMFSLDFCASPFTFMRLHFS 1820
              V +  G+     L ++   K   I  G C I   R  +P+F+LDF   P  F+ LH S
Sbjct: 210  RDVNSLQGI-----LLFKTPRK---IKTGACVISGVRCSVPVFTLDFSTVPCFFLYLHSS 261

Query: 1819 VVLRSLYLPDVLIRYLSGLIKKAREITDNKKCLPCI-PTEMGFPGSNSMASWS----SSV 1655
            ++LR + +   L+   +  I +   +T++K+ + C+ P        N+ +        + 
Sbjct: 262  LLLRFVPMSYALVMQPTVAIDEV-TVTNDKEIVSCLSPVTQSELDVNTQSGLDVVAPGAY 320

Query: 1654 KKRKVDFIVRATDFARRSVSTRHSANSRNVQRKRTSLRSTRARNSS-------------S 1514
              +K++ +       + +       NSRN+Q++R+SLRS R R+SS              
Sbjct: 321  DSKKIEVVNPTVGLPKLAARHLQPRNSRNIQKRRSSLRSMRGRHSSFGTQNATGVLTSDR 380

Query: 1513 MGFHHDLFRAGYK---HKKRKLAQKSPCSNVKELKSTLVELKQNMDSVCCSANILVIESD 1343
            + F  D  R   +   ++ R   QK+   +VKELKS LV L QN++S  CSAN+LVIE D
Sbjct: 381  LRFRRDGLRFSSRTPHYELRSSRQKTSTPSVKELKSALVGLTQNIESTSCSANVLVIEPD 440

Query: 1342 RCFREEGAKVMLECLALNDWYLVAKTHGLSRYMYKAENVMRPSTPNRFTHAMIWTGENGW 1163
            +C+REEGA + +E  A   W L  K  G+ R+    E VMRP + NR TH +IW G+NGW
Sbjct: 441  KCYREEGAVIGMELSAAKQWILAVKIGGVRRFNLTTEKVMRPCSSNRVTHDIIWVGDNGW 500

Query: 1162 KLEFSNRRDWLIFKELHMACMDRNLQVVSARTIPVPGVYEVPCYDDSGYVPFVRPIAYIT 983
            KLEF  R+DWLIFKEL+  C DRN+Q  +   IPVPGV EV  Y +S    F RP++YIT
Sbjct: 501  KLEFPIRQDWLIFKELYKGCSDRNVQPPAVSIIPVPGVREVSGYAESNPPEFARPVSYIT 560

Query: 982  MRDDEVARALVRKPANYDMDSGDEEWLEKLNNGVFDGDVGVPNYILPEKFEEMMDAFEKA 803
            ++DDE+ARAL R  ANYDMD  DEEWL   N    D      +++  + FE ++D FEK 
Sbjct: 561  VKDDELARALARSTANYDMDGDDEEWLRNFN----DQPSLENDHLSADSFELLIDNFEKG 616

Query: 802  AYHTPDEVSDESRATSFCSSLERKDTVAAVYQYWMKKRKQNHVALLRVFQCPPPRSSQLM 623
             Y  PD+ SDE  A S C + E+K+ V AVY YW+KKRKQN  +L+++FQC  PR +Q++
Sbjct: 617  FYCNPDDYSDEKAAVSSCPNKEKKEIVEAVYNYWLKKRKQNRSSLIKIFQCYQPRRTQVI 676

Query: 622  QKPVFRKKRSFKRQMRQSGRGKQQIFFHASAVEPE-QDAMQRVQKAKSLADISMESVLLK 446
             K +FRKKRSFKRQ  ++GRGK + F  A   E E Q+A+ +V++AK+ A+ S +  +  
Sbjct: 677  PKSIFRKKRSFKRQGSKAGRGKHRPFLPAVVAEKEQQNAVLKVKEAKAAANKSEDLAVRM 736

Query: 445  RRRAQILMDNADLATYXXXXXXXXXXXXXXXESPEAYVPFIL 320
            R++AQ LM+NADLATY               +S EA  P  L
Sbjct: 737  RQKAQQLMENADLATYKAMMALKIAEAAKIAKSKEAVGPIFL 778


>ref|XP_007010268.1| Enhancer of polycomb-like transcription factor protein, putative
            isoform 2 [Theobroma cacao] gi|508727181|gb|EOY19078.1|
            Enhancer of polycomb-like transcription factor protein,
            putative isoform 2 [Theobroma cacao]
          Length = 784

 Score =  459 bits (1180), Expect = e-126
 Identities = 307/812 (37%), Positives = 444/812 (54%), Gaps = 48/812 (5%)
 Frame = -2

Query: 2689 MPLVEMRRSTRVFVPKSVVKDTDGARVLRSGKRFSLDCGEEKPVR--GNGDEWFCIIDDT 2516
            MP V MRR+TRVF    +VK ++ ARVLRSG+R   D GE KP R    GDE + ++   
Sbjct: 1    MPSVGMRRTTRVF---RMVKSSEVARVLRSGRRLWPDSGEAKPKRLANEGDENYNLMKKA 57

Query: 2515 ADVPRCKSIDWYEVDPERDIDVTDFNLNLAXXXXXXXXXXXXXSRDKMHGIVYNRKR--R 2342
                           P+ +++     ++                R K  G   N ++  R
Sbjct: 58   ---------------PKSEVNGVAAEVS---------------GRPKRLGNEENPRKQSR 87

Query: 2341 RLPGNKF-VSSPRDRMYGISFVRKQRRKR-SSGHSINELPRKDCQXXXXXXXXXXXXSGH 2168
            ++    F  S   D+M+GI + RK++R    +GH      + +              + +
Sbjct: 88   KMKAGAFNTSGSVDKMFGIVYTRKRKRNGVQNGHLSGNSGQGNYGKKISRRQAIENRNTN 147

Query: 2167 ESSIDVIHGAILDIVVESS-CNSAHRFTCFLVSILKFVKSSRISLTEFAAFMCSEPLVRV 1991
            E   DV    +   VVE+  CN    F+ FL+ +L +VK + + L+E AAF+ S+P+  V
Sbjct: 148  E---DVEEPKMFSFVVENGDCNGC--FSNFLILVLGYVKRAEVRLSELAAFLMSQPISSV 202

Query: 1990 FAQHGVHFLANLSYRISWKNDLISPGICQIFEARQFIPMFSLDFCASPFTFMRLHFSVVL 1811
            ++ +GV+F      R          GIC+ F A+  IP+FSLDF A P  F+ +H+S VL
Sbjct: 203  YSSNGVNFFWGPRNRT---------GICKFFGAKDSIPLFSLDFSAVPRYFLYMHYSKVL 253

Query: 1810 RSLYLPDVLIRYLSGLIKKAREITDNKKCLPCIPTEMGFPGS---NSMASWSSSVKKRKV 1640
            R       L R     +     ++D+++  PC+ + +    S   N+     +   K  +
Sbjct: 254  R-------LKRIQIVPVNSDEIVSDSEEDEPCVTSVVDVCKSTSGNAAVEIDNLGSKVVL 306

Query: 1639 DFIVRATDFARRSVSTRHSANSRNVQRKRTSLRSTRARNSSSMGFHH-------DLF--- 1490
               VRA+    R+   R+  +SR++Q++R+SLR  RARN S +G H        DL    
Sbjct: 307  HPSVRASKLTGRNAQCRNGLSSRSIQKRRSSLRRRRARNPSIVGIHKANGALMSDLISSR 366

Query: 1489 RAGY-------KHKKRKLAQKSPCSNVKELKSTLVELKQNMDSVCCSANILVIESDRCFR 1331
            R G        K+K R   + S  +NV ++ S++ +L QN+DS  CSANILVIE+DRC+R
Sbjct: 367  RNGIPFSSVVSKNKLRSSVRNSSVANVSDVGSSISDLMQNVDSSQCSANILVIEADRCYR 426

Query: 1330 EEGAKVMLECLALNDWYLVAKTHGLSRYMYKAENVMRPSTPNRFTHAMIWTGENGWKLEF 1151
            EEGA V LE  A  +W LV K    +++  KA+  MRPS+ NRFTHA+IWTG++ WKLEF
Sbjct: 427  EEGAIVTLELSASREWLLVVKKGSSTKFACKADKFMRPSSCNRFTHAIIWTGDDNWKLEF 486

Query: 1150 SNRRDWLIFKELHMACMDRNLQVVSARTIPVPGVYEVPCYDDSGYVPFVRPIAYITMRDD 971
             NR+DW+IFK+L+  C +RN+   + + IPVPGV+EVP Y+D   VPF RP  YI++  D
Sbjct: 487  PNRQDWIIFKDLYKECSERNVPASTVKAIPVPGVHEVPGYEDRRSVPFCRPDFYISLDGD 546

Query: 970  EVARALVRKPANYDMDSGDEEWLEKLNNGVFDGDVGVPNYILPEKFEEMMDAFEKAAYHT 791
            EV+RAL ++ ANYDMDS DEEWL+K NN  F G+ G   ++  + FE M+DAFEKA + +
Sbjct: 547  EVSRALAKRTANYDMDSEDEEWLKKFNNEFFSGN-GHCEHLSEDCFELMVDAFEKAYFCS 605

Query: 790  PDEVSDESRATSFCSSLERKDTVAAVYQYWMKKRKQNHVALLRVFQCPPPRSSQLMQKPV 611
            PD+ S+E+ A   C  L  +  V AV+ YW++KRKQ   ALLRVFQ    + + L+ KP 
Sbjct: 606  PDDYSNENAAAHLCLDLGTRGLVEAVHTYWLRKRKQRRSALLRVFQGHQVKKAPLVPKPF 665

Query: 610  FRKKRSFKRQMRQSGRGKQQIFFH-----------------ASAVE----PEQDAMQRVQ 494
             RK+RSFKRQ    GRGKQ                      A A E     EQ+AM +++
Sbjct: 666  LRKRRSFKRQ-ASHGRGKQPYLLQGPRFRYNAETSIICNCAALAAERDSMAEQNAMLKLE 724

Query: 493  KAKSLADISMESVLLKRRRAQILMDNADLATY 398
            +A+  A  S+E  +LKR+R Q+LM+NADLATY
Sbjct: 725  EARVSASRSVELAVLKRQRTQLLMENADLATY 756


>ref|XP_007221419.1| hypothetical protein PRUPE_ppa001422mg [Prunus persica]
            gi|462418131|gb|EMJ22618.1| hypothetical protein
            PRUPE_ppa001422mg [Prunus persica]
          Length = 832

 Score =  456 bits (1173), Expect = e-125
 Identities = 314/835 (37%), Positives = 433/835 (51%), Gaps = 42/835 (5%)
 Frame = -2

Query: 2695 TLMPLVEMRRSTRVFVPKSVVKDTDGARVLRSGKRFSLDCGEEKPVRG-NGDE-WFCIID 2522
            T MP VEMRR+TRVF    V    DGARVLRSG+R   +  E K  R  NGDE W  ++ 
Sbjct: 52   TEMPSVEMRRTTRVFGMGMVKGGVDGARVLRSGRRLWPESSESKLERARNGDEDWLKLMK 111

Query: 2521 DTA--DVPRCKSIDWYEVD----PERDIDVTDFNLNLAXXXXXXXXXXXXXSRDKMHGIV 2360
              A   V       W   +    P R+  V     +L                 K +GIV
Sbjct: 112  SHAGESVVGLNHKKWAGANQVGSPRRNTPV--LKTSLVKKPQSNELLADLLKEHKRYGIV 169

Query: 2359 YNRKRRRLPGNKFVSSPR-----DRMYGISFVRKQRRKRSSGHSINELPRKDCQXXXXXX 2195
            Y RKR+R   +   +  +     DRMYG  F R+QR K+S      EL            
Sbjct: 170  YTRKRKRASASFLGNVEKENGSDDRMYGRRFARRQRMKKSK-----EL------------ 212

Query: 2194 XXXXXXSGHESSIDVIHGAILDIVVESSCNSAHRFTCFLVSILKFVKSSRISLTEFAAFM 2015
                     +S    +   +L   VESS    +    FL S+L ++  + + LTEF+ F+
Sbjct: 213  ---------DSHPGFVCPEVLCFSVESSWAQGYWAGRFLYSVLVYMTRASLGLTEFSEFL 263

Query: 2014 CSEPLVRVFAQHGVHFLANLSYRISWKNDLISPGICQIFEARQFIPMFSLDFCASPFTFM 1835
              EP+  +FA +G+ F  + S            G+C++F A QFIP+FS+DF A P  FM
Sbjct: 264  ALEPIGSIFASYGIQFSRDRSCTRR-------SGVCKLFGAEQFIPLFSVDFSAVPGCFM 316

Query: 1834 RLHFSVVLR---SLYLPDVLIRYLSGLIKKAREITDNKKCLPCIPTEMGFPGSNSMASWS 1664
             +  S+ LR    L + +++  + +G      +  D+ + +  I         N  A  S
Sbjct: 317  FMQTSMHLRFRCHLTVNNLIDGHENGEFIDQGDDDDDGEKVDFI--------ENRHALHS 368

Query: 1663 SSVKKRKVDFIVRATDFARRSVSTRHSANSRNVQRKRTSLRSTRARNSSSMGFHH----- 1499
            S          VR    A RS   R+   SR +Q++R+SLR  R+RN S +         
Sbjct: 369  S----------VRVPKLACRSTQYRNGLTSRGIQKRRSSLRRRRSRNPSLVSLRKPNGAL 418

Query: 1498 -----DLFRAGY-------KHKKRKLAQKSPCSNVKELKSTLVELKQNMDSVCCSANILV 1355
                  + + G        KH  RK    S   N+K    T+   K+++DS  CSANIL 
Sbjct: 419  VSELISIRKNGLPFSSVESKHMLRKSVSLSLAGNLKAESLTIEGSKRDLDSTSCSANILF 478

Query: 1354 IESDRCFREEGAKVMLECLALNDWYLVAKTHGLSRYMYKAENVMRPSTPNRFTHAMIWT- 1178
             E D+C+RE+GA VMLE  +  +W LV K +GL+RY +KAE VMRP + NR T A+IW+ 
Sbjct: 479  TELDKCYREDGATVMLEMSSSGEWLLVVKKNGLTRYTHKAEKVMRPCSKNRITQAIIWSA 538

Query: 1177 ---GENGWKLEFSNRRDWLIFKELHMACMDRNLQVVSARTIPVPGVYEVPCYDDSGYVPF 1007
               G+N WKLEF NR DW IFK+L+  C DR +   + + IPVPGV EVP Y DS    F
Sbjct: 539  DSNGDNNWKLEFPNRCDWAIFKDLYKECSDRVVPAPAIKFIPVPGVREVPGYADSHSTLF 598

Query: 1006 VRPIAYITMRDDEVARALVRKPANYDMDSGDEEWLEKLNNGVFDGDVGVPNYILPEKFEE 827
             RP +YI + DDEV+RA+ ++ ANYDMDS DEEWL+K N+  F  +  + +++  + FE 
Sbjct: 599  DRPESYIYLNDDEVSRAMAKRTANYDMDSDDEEWLKKFNSDFF-AENELHDHVSEDNFEL 657

Query: 826  MMDAFEKAAYHTPDEVSDESRATSFCSSLERKDTVAAVYQYWMKKRKQNH-VALLRVFQC 650
            M+DAFEKA Y  P + +DE+ A + C  + R++ V A+Y YWM KRKQ    +LLRVFQ 
Sbjct: 658  MVDAFEKAFYCRPYDFADENAAANLCLDMGRREVVEAIYSYWMNKRKQKRSSSLLRVFQG 717

Query: 649  PPPRSSQLMQKPVFRKKRSFKRQMRQSGRGKQQIFFHASAVE----PEQDAMQRVQKAKS 482
               + + L  KPV RK+RSFKRQ  Q GRGKQ  F  A A E     EQ+A+ +V++AK+
Sbjct: 718  HQSKRALLDPKPVLRKRRSFKRQPSQFGRGKQPSFLQAMAAEQDALQEQNAIHKVEEAKA 777

Query: 481  LADISMESVLLKRRRAQILMDNADLATYXXXXXXXXXXXXXXXESPEAYVPFILD 317
             AD S+E  + KR+RAQ+LM NADL TY                SP+A   ++LD
Sbjct: 778  EADRSVELAIRKRKRAQLLMQNADLVTYKATMAFRIAEAAQVLGSPDAAAAYVLD 832


>ref|XP_004243418.1| PREDICTED: uncharacterized protein LOC101263728 [Solanum
            lycopersicum]
          Length = 790

 Score =  451 bits (1161), Expect = e-124
 Identities = 292/832 (35%), Positives = 438/832 (52%), Gaps = 47/832 (5%)
 Frame = -2

Query: 2674 MRRSTRVFVPKSVVKDTDGARVLRSGKRFSLDCGEEKPVRGNGDEWFCIIDDT------- 2516
            MRR+TR+F          G RVLRSG+R S    E K  + +GDEW  ++D+        
Sbjct: 7    MRRTTRIF----------GTRVLRSGRRLSTSF-EAKRAK-HGDEWIGLLDNVGGGGGAA 54

Query: 2515 ADVPRCKSIDWYEVDPERDIDVTDFNLNL---------AXXXXXXXXXXXXXSRDKMHGI 2363
            AD  RCK   W + +   +++  + N+++                         D+M G+
Sbjct: 55   ADATRCKKKGWLKKEVALNLEADEMNIDVDSKSMDEQETVEAPVVDTVSPKSYIDRMWGL 114

Query: 2362 VYNRKRRRLPGNKFVSSPRDRM------YGISFVRKQRRKRSSGHSINELPRKDCQXXXX 2201
            VY RKR+R+   +   S R ++      YG  F+RK  +K  S ++ +    +D Q    
Sbjct: 115  VYTRKRKRVDLKRH-DSVRGKVLTDVMRYGKQFIRK--KKHRSAYAKDSDKSEDGQF--- 168

Query: 2200 XXXXXXXXSGHESSIDVIHGAILDIVVESSCNSAHRFTCFLVSILKFVKSSRISLTEFAA 2021
                         S D++       +V +S  S +  +C L  +L +++ S +SL +   
Sbjct: 169  -------------SSDIV-------IVNTSYGSGYWVSCLLNCMLMYLRRSTVSLQQIFG 208

Query: 2020 FMCSEPLVRVFAQHGVHFLANLSYRISWKNDLISPGICQIFEARQFIPMFSLDFCASPFT 1841
            F+ S+PL  V++  G+  L + + R       I  G C I   R  +P+F+LDF   P  
Sbjct: 209  FINSKPLRDVWSLQGILLLKDQTSR------KIKTGACVISGVRCSVPVFTLDFSTVPCF 262

Query: 1840 FMRLHFSVVLRSLYLPDVLIRYLSGLIKKAREITDNKKCLPCI-PTEMGFPGSNSMASWS 1664
            F+ LH S++LR + +   L+   +  I +   +T++ + + C+ P  +     N+ +   
Sbjct: 263  FLYLHSSLLLRFVPMSYALVMQPTVAIDEVT-VTNDMELVSCLTPVTLSELDVNTQSGHD 321

Query: 1663 ----SSVKKRKVDFIVRATDFARRSVSTRHSANSRNVQRKRTSLRSTRARNSS------- 1517
                 +   +K++ +       + +       NSRN+Q++R+SLRS R R+SS       
Sbjct: 322  VVAPGAYDSKKIEVVNTTVGLPKSTARHLQPRNSRNIQKRRSSLRSMRGRHSSFGTQNAS 381

Query: 1516 ------SMGFHHDLFRAGYK---HKKRKLAQKSPCSNVKELKSTLVELKQNMDSVCCSAN 1364
                   + F  D  R   +   ++ R   QK+   +VKELKS LV L QN+++  CSAN
Sbjct: 382  GVLTSDRLRFRRDGLRFSSRTPHYELRSSRQKTSMPSVKELKSALVRLTQNIETASCSAN 441

Query: 1363 ILVIESDRCFREEGAKVMLECLALNDWYLVAKTHGLSRYMYKAENVMRPSTPNRFTHAMI 1184
            ILV E D+C+REEGA + +E  A   W L  K  G+ R+    E VMRP + NR TH +I
Sbjct: 442  ILVTEPDKCYREEGAVIGMELSAAKQWILAVKIGGVRRFNLTTEKVMRPCSSNRVTHDLI 501

Query: 1183 WTGENGWKLEFSNRRDWLIFKELHMACMDRNLQVVSARTIPVPGVYEVPCYDDSGYVPFV 1004
            W G++GWKLEF +R+DWLIFKEL+  C DRN+Q  +   IPVPGV EV  Y +S    F 
Sbjct: 502  WVGDSGWKLEFPDRQDWLIFKELYKGCSDRNVQPPAVSIIPVPGVSEVSGYAESNPPFFA 561

Query: 1003 RPIAYITMRDDEVARALVRKPANYDMDSGDEEWLEKLNNGVFDGDVGVPNYILPEKFEEM 824
            RP++YIT++DDE+ARAL R  ANYDMD  DEEWL   N    D      +++  + FE +
Sbjct: 562  RPVSYITVKDDELARALARSTANYDMDGDDEEWLRNFN----DQPSLENDHLSTDSFELL 617

Query: 823  MDAFEKAAYHTPDEVSDESRATSFCSSLERKDTVAAVYQYWMKKRKQNHVALLRVFQCPP 644
            +D FEK  Y  PD+ SDE  A S C + E+K+ V AVY YW KKRKQN  +L+++FQC  
Sbjct: 618  IDHFEKGFYCNPDDYSDEKAAVSSCPNKEKKEIVEAVYSYWSKKRKQNRSSLIKIFQCYQ 677

Query: 643  PRSSQLMQKPVFRKKRSFKRQMRQSGRGKQQIFFHASAVE----PEQDAMQRVQKAKSLA 476
            PR +Q++ K +FRKKRSFKRQ  ++GRGK + F  A   E     +Q+A+ +V++AK+ A
Sbjct: 678  PRRTQVIPKSIFRKKRSFKRQGSKAGRGKHRPFLPAVVAENDAVQQQNAVLKVKEAKAAA 737

Query: 475  DISMESVLLKRRRAQILMDNADLATYXXXXXXXXXXXXXXXESPEAYVPFIL 320
            + S +  +  R++AQ LM+NADLATY               +S EA  P  L
Sbjct: 738  NKSEDLAVRMRQKAQQLMENADLATYKAMMALRIAEAAKIAKSKEAVAPIFL 789


>gb|EXC25392.1| hypothetical protein L484_016774 [Morus notabilis]
          Length = 795

 Score =  437 bits (1123), Expect = e-119
 Identities = 302/819 (36%), Positives = 435/819 (53%), Gaps = 55/819 (6%)
 Frame = -2

Query: 2689 MPLVEMRRSTRVFVPKSVVKDTDGARVLRSGKRFSLDCGEEKPVRGNGD--EWFCI---- 2528
            MP V MRR+TRVF    VVK  DGARVLRSG+R   D GE K +R + D  +WF I    
Sbjct: 1    MPSVGMRRTTRVF---GVVKGVDGARVLRSGRRLWPDSGEVK-LRRHSDVYDWFKIGKGD 56

Query: 2527 ----------IDDTADVPRCKSIDWYEVD-PERDIDVTDFNLNLAXXXXXXXXXXXXXSR 2381
                        +T   P+ K+    E+  P+ + +     ++LA               
Sbjct: 57   GGLGYDSNGWAHNTNSKPK-KTPPVAEIKAPKPEDNNRGVGVDLAHGGRRP--------- 106

Query: 2380 DKMHGIVYNRKRRRLP----GNKFVSSPR-----DRMYGISFVRKQRRKRSSGHSINELP 2228
            D+M G+VY+RKR+ L     GN  V+S        + YG  FVR+QRRK +SG S     
Sbjct: 107  DRMFGLVYSRKRKNLAVRSSGNASVNSETLGGSVGKRYGRRFVRRQRRKLNSGESFAVAD 166

Query: 2227 RKDCQXXXXXXXXXXXXSGHESSIDVIHGAILDIVVESSCNSAHRFTCFLVSILKFVKSS 2048
              D                  S ++     ++ +V  SS +        L SIL ++  +
Sbjct: 167  DSD------------------SRLEFTPSEVVSVVFGSSMDRNFYAVGVLCSILVYLTRA 208

Query: 2047 RISLTEFAAFMCSEPLVRVFAQHGVH-FLANLSYRISWKNDLISPGICQIFEARQFIPMF 1871
            R+ LT+  AF+ SEP+ RV +  G++ FL + S +            C++F A +F+P+F
Sbjct: 209  RLRLTDLFAFLVSEPISRVNSSCGINIFLDHPSIKRF--------ASCKLFGAPEFVPLF 260

Query: 1870 SLDFCASPFTFMRLHFSVVLRSLYLPDVLIRYLSGLIKKAREITDNKKCLPCIPTEMGFP 1691
             +DF A P  FM +H  +  R    P      L+G  +    I+D+++       ++  P
Sbjct: 261  CVDFSAIPLCFMHMHSCMFFRYKRQPS-----LAGNNEIDEMISDDEE------DQLSSP 309

Query: 1690 GSNSM-------ASWSSSVKKRKVDFIVRATDFARRSVSTRHSANSRNVQRKRTSLRSTR 1532
            G +++       A  + S  +   +   +A+ FA RS   R+   SR +Q++R+SLR  +
Sbjct: 310  GKDALESKPLLSAEANHSENRLASNPSFKASKFACRSNQYRNGLISRGIQKRRSSLRRRK 369

Query: 1531 ARNSSSMGFHH-------DL--FRAGY-------KHKKRKLAQKSPCSNVKELKSTLVEL 1400
            ARN S  G          DL  FR           +K R+  + +    +KE+ ST+ + 
Sbjct: 370  ARNPSLCGVQKPNNALLSDLVSFRKNSVSLSLTSNNKLRRSLRSNSARKLKEVSSTVADS 429

Query: 1399 KQNMDSVCCSANILVIESDRCFREEGAKVMLECLALNDWYLVAKTHGLSRYMYKAENVMR 1220
             Q+MDS  C AN+L+IE ++C+RE G  ++LE   L  W +  K  G +++ +KAE VMR
Sbjct: 430  TQDMDSTSCCANVLIIEPEKCYREGGFSIVLESSPLGGWLIAVKKDGSTKFTHKAEKVMR 489

Query: 1219 PSTPNRFTHAMIWTGENGWKLEFSNRRDWLIFKELHMACMDRNLQVVSARTIPVPGVYEV 1040
            P + NRFTH ++WT ++GWKLEF NR+DWLIFK+L+  C DRN+     + +P+PGV EV
Sbjct: 490  PCSSNRFTHDIMWTADDGWKLEFPNRKDWLIFKDLYQECSDRNMLAPGVKVVPIPGVNEV 549

Query: 1039 PCYDDSGYVPFVRPIAYITMRDDEVARALVRKPANYDMDSGDEEWLEKLNNGVFDGDVGV 860
                DS    F RP +YI+++DDE+ RAL RK +NYDMD  DEEWL KLNN  F  +   
Sbjct: 550  SQKGDSHCTLFRRPDSYISVKDDELCRALKRKTSNYDMDLEDEEWLNKLNN-EFSVENET 608

Query: 859  PNYILPEKFEEMMDAFEKAAYHTPDEVSDESRATSFCSSLERKDTVAAVYQYW-MKKRKQ 683
               +  +KFE M+DAFEKA + +P + SD    T  CS L     + A+Y YW MKKRKQ
Sbjct: 609  YECVSDDKFESMIDAFEKAFFCSPYDNSDVKSLTDLCSHLGGDKAIEAIYVYWTMKKRKQ 668

Query: 682  NHVALLRVFQCPPPRSSQLMQKPVFRKKRSFKRQMRQSGRGKQQIFFHASAVE----PEQ 515
               +L+R+FQ    R + L+ KP  RKKRSF RQ  Q GRGKQ  F  A   E     EQ
Sbjct: 669  KRPSLIRIFQLYQGRRT-LVPKPAIRKKRSFNRQPSQVGRGKQSSFLQAMVAERDAAEEQ 727

Query: 514  DAMQRVQKAKSLADISMESVLLKRRRAQILMDNADLATY 398
            +AM RV++AK+ A+  +E  +  R+RAQ+LM+NADLATY
Sbjct: 728  NAMHRVEEAKASANRCVELAVESRQRAQLLMNNADLATY 766


>ref|XP_004166800.1| PREDICTED: uncharacterized LOC101207239 [Cucumis sativus]
          Length = 819

 Score =  417 bits (1073), Expect = e-113
 Identities = 293/833 (35%), Positives = 422/833 (50%), Gaps = 56/833 (6%)
 Frame = -2

Query: 2668 RSTRVFVPKSVVKDTDGARVLRSGKRFSLDCGEEKPVRG-NGDEWFCIIDDTADVPRC-- 2498
            R TRVF    +VK +DGARVLRSG+R   + GE K  +  +  +W+ IID   +      
Sbjct: 6    RRTRVF---GLVKGSDGARVLRSGRRLWPESGEVKLKKSKDASDWYPIIDGRGNGGGSGH 62

Query: 2497 -----KSIDWYEVDPERDIDVT---DFNLNLAXXXXXXXXXXXXXSRDKMHGI------V 2360
                 K      V P+R + V    D +  +              + DK  G+      V
Sbjct: 63   GRLHGKWTQVRNVKPKRVVVVNIREDDDACVVKVPEPVKVFPRIGNDDKSSGVDRMFGKV 122

Query: 2359 YNRKRRR-----------LPGNKFVSSPRDRMYGISFVRKQR-RKRSSGHSINELPRKDC 2216
            Y+RKR+R           +  +  +S   DRM+G+ F+R+QR RK    H  +    +  
Sbjct: 123  YSRKRKRGRLEDGEVFDEMESDNVLSG--DRMFGLRFIRRQRSRKTDVEHWESTAGGRTS 180

Query: 2215 QXXXXXXXXXXXXSGHESSIDVIHGAILDIVVESSCNSAHRFTCFLVSILKFVKSSRISL 2036
                           H   I       L I   SS +    F+ F++++L+  KS  +S+
Sbjct: 181  NLHF-----------HRQRILHPRDCALTIFAGSSVDGGC-FSDFILTVLRHFKSPGLSV 228

Query: 2035 TEFAAFMCSEPLVRVFAQHGVHFLANLSYRISWKNDLISPGIC---QIFEARQFIPMFSL 1865
             +F+AF+ S P+  VFA  G+ FL                G C    IF +RQ IPMF L
Sbjct: 229  AKFSAFLLSNPINEVFALKGMRFLQGYP----------PTGCCGMFAIFGSRQSIPMFHL 278

Query: 1864 DFCASPFTFMRLHFSVVLRSLYLPDVLIRYLSGL-IKKAREITDNKKCLPCIPTEMGFPG 1688
            DF A P  FM L+  + LR   +   L+   + L +  + +  ++      +P+ +    
Sbjct: 279  DFSAIPLPFMFLYSEMFLRVTRIQARLVYNNNQLDVDISSDSEEDSVEELHVPSPVSSLE 338

Query: 1687 SNSMASWSSSVKKRKVDF-IVRATDFARRSVSTRHSANSRNVQRKRTSLRSTRARNSSSM 1511
               MA      K R V    VRAT    R++  R+  +SR ++++R+SLR  R R+ S  
Sbjct: 339  RKPMAFLFDRPKTRSVSHPSVRATRLGTRTMQYRNGFSSRGIRKRRSSLRIRRPRSHSLS 398

Query: 1510 GFHHDL-------------FRAGYKHKKRKL-AQKSPCSNVKELKSTLVELKQNMDSVCC 1373
                 +             F +G    + K  A +     ++E  ST +    ++DS CC
Sbjct: 399  AMQKSIGPLAVDDVKLGVSFPSGASCNRHKSSAVRDSAGRIRETNSTALRSAMDVDSSCC 458

Query: 1372 SANILVIESDRCFREEGAKVMLECLALNDWYLVAKTHGLSRYMYKAENVMRPSTPNRFTH 1193
             ANIL++E+D+C REEGA ++LE  A  +W LV K  G +RY +KAE VM+PS+ NRFTH
Sbjct: 459  KANILIVEADKCLREEGANIVLEFSASCEWLLVVKKDGSTRYTHKAERVMKPSSCNRFTH 518

Query: 1192 AMIWTGENGWKLEFSNRRDWLIFKELHMACMDRNLQVVSARTIPVPGVYEVPCYDDSGYV 1013
            A++W+ +NGWKLEF NRRDW IFK+L+  C DRN+  + A+ IPVP V EVP Y DS   
Sbjct: 519  AILWSIDNGWKLEFPNRRDWFIFKDLYKECSDRNIPCLIAKAIPVPRVSEVPDYVDSSGA 578

Query: 1012 PFVRPIAYITMRDDEVARALVRKPANYDMDSGDEEWLEKLNNGVFDGDVGVPNYILPEKF 833
             F RP  YI++ DDEV RA+ +  ANYDMDS DEEWL + N+G+   D         + F
Sbjct: 579  SFQRPDTYISVNDDEVCRAMTKSTANYDMDSEDEEWLVEFNDGLIATDKH-QECFSEDNF 637

Query: 832  EEMMDAFEKAAYHTPDEVSDESRATSFCSSLERKDTVAAVYQYWMKKRKQNHVALLRVFQ 653
            E M+DAFEK  Y  PD  SDE      C+ L     V ++Y YW KKRKQ   +L+RVFQ
Sbjct: 638  ESMVDAFEKGFYCNPDAFSDEKVPADICTPLASPSIVESLYTYWTKKRKQRKSSLIRVFQ 697

Query: 652  C-PPPRSSQLMQKPVFRKKRSFKRQMRQSGRGK-------QQIFFHASAVEPEQDAMQRV 497
                 R   L+ KP+ R+KRS KRQ  QSG G+       + I +   AVE +Q+AMQ+ 
Sbjct: 698  AYQSKRKPPLVPKPMMRRKRSLKRQPSQSGSGRTPQPSILEAILWRRDAVE-DQNAMQKY 756

Query: 496  QKAKSLADISMESVLLKRRRAQILMDNADLATYXXXXXXXXXXXXXXXESPEA 338
            +++K+  +  +E+ + KR+RAQ+L++NADLA Y               +SPEA
Sbjct: 757  EESKAAVEKCIENAVNKRQRAQLLLENADLAVYKAMSALRIAEAIETSDSPEA 809


>ref|XP_004140897.1| PREDICTED: uncharacterized protein LOC101207239 [Cucumis sativus]
          Length = 819

 Score =  417 bits (1073), Expect = e-113
 Identities = 293/833 (35%), Positives = 422/833 (50%), Gaps = 56/833 (6%)
 Frame = -2

Query: 2668 RSTRVFVPKSVVKDTDGARVLRSGKRFSLDCGEEKPVRG-NGDEWFCIIDDTADVPRC-- 2498
            R TRVF    +VK +DGARVLRSG+R   + GE K  +  +  +W+ IID   +      
Sbjct: 6    RRTRVF---GLVKGSDGARVLRSGRRLWPESGEVKLKKSKDASDWYPIIDGRGNGGGSGH 62

Query: 2497 -----KSIDWYEVDPERDIDVT---DFNLNLAXXXXXXXXXXXXXSRDKMHGI------V 2360
                 K      V P+R + V    D +  +              + DK  G+      V
Sbjct: 63   GRLHGKWTQVRNVKPKRVVVVNIREDDDACVVKVPEPVKVFPRIGNDDKSSGVDRMFGKV 122

Query: 2359 YNRKRRR-----------LPGNKFVSSPRDRMYGISFVRKQR-RKRSSGHSINELPRKDC 2216
            Y+RKR+R           +  +  +S   DRM+G+ F+R+QR RK    H  +    +  
Sbjct: 123  YSRKRKRGRLEDGEVFDEMESDNVLSG--DRMFGLRFIRRQRSRKTDVEHWESTAGGRTS 180

Query: 2215 QXXXXXXXXXXXXSGHESSIDVIHGAILDIVVESSCNSAHRFTCFLVSILKFVKSSRISL 2036
                           H   I       L I   SS +    F+ F++++L+  KS  +S+
Sbjct: 181  NLHF-----------HRQRILHPRDCALTIFAGSSVDGGC-FSDFILTVLRHFKSPGLSV 228

Query: 2035 TEFAAFMCSEPLVRVFAQHGVHFLANLSYRISWKNDLISPGIC---QIFEARQFIPMFSL 1865
             +F+AF+ S P+  VFA  G+ FL                G C    IF +RQ IPMF L
Sbjct: 229  AKFSAFLLSNPINEVFALKGMRFLQGYP----------PTGCCGMFAIFGSRQSIPMFHL 278

Query: 1864 DFCASPFTFMRLHFSVVLRSLYLPDVLIRYLSGL-IKKAREITDNKKCLPCIPTEMGFPG 1688
            DF A P  FM L+  + LR   +   L+   + L +  + +  ++      +P+ +    
Sbjct: 279  DFSAIPLPFMFLYSEMFLRVTRIQARLVYNNNQLDVDISSDSEEDSVEELHVPSPVSSLE 338

Query: 1687 SNSMASWSSSVKKRKVDF-IVRATDFARRSVSTRHSANSRNVQRKRTSLRSTRARNSSSM 1511
               MA      K R V    VRAT    R++  R+  +SR ++++R+SLR  R R+ S  
Sbjct: 339  RKPMAFLFDRPKTRSVSHPSVRATRLGTRTMQYRNGFSSRGIRKRRSSLRIRRPRSHSLA 398

Query: 1510 GFHHDL-------------FRAGYKHKKRKL-AQKSPCSNVKELKSTLVELKQNMDSVCC 1373
                 +             F +G    + K  A +     ++E  ST +    ++DS CC
Sbjct: 399  AMQKSIGPLAVDDVKLGVSFPSGASCNRHKSSAVRDSAGRIRETNSTALGSAMDVDSSCC 458

Query: 1372 SANILVIESDRCFREEGAKVMLECLALNDWYLVAKTHGLSRYMYKAENVMRPSTPNRFTH 1193
             ANIL++E+D+C REEGA ++LE  A  +W LV K  G +RY +KAE VM+PS+ NRFTH
Sbjct: 459  KANILIVEADKCLREEGANIVLEFSASCEWLLVVKKDGSTRYTHKAERVMKPSSCNRFTH 518

Query: 1192 AMIWTGENGWKLEFSNRRDWLIFKELHMACMDRNLQVVSARTIPVPGVYEVPCYDDSGYV 1013
            A++W+ +NGWKLEF NRRDW IFK+L+  C DRN+  + A+ IPVP V EVP Y DS   
Sbjct: 519  AILWSIDNGWKLEFPNRRDWFIFKDLYKECSDRNIPCLIAKAIPVPRVSEVPDYVDSSGA 578

Query: 1012 PFVRPIAYITMRDDEVARALVRKPANYDMDSGDEEWLEKLNNGVFDGDVGVPNYILPEKF 833
             F RP  YI++ DDEV RA+ +  ANYDMDS DEEWL + N+G+   D         + F
Sbjct: 579  SFQRPDTYISVNDDEVCRAMTKSTANYDMDSEDEEWLIEFNDGLIATDKH-QECFSEDNF 637

Query: 832  EEMMDAFEKAAYHTPDEVSDESRATSFCSSLERKDTVAAVYQYWMKKRKQNHVALLRVFQ 653
            E M+DAFEK  Y  PD  SDE      C+ L     V ++Y YW KKRKQ   +L+RVFQ
Sbjct: 638  ESMVDAFEKGFYCNPDAFSDEKAPADICTPLASPSIVESLYTYWTKKRKQRKSSLIRVFQ 697

Query: 652  C-PPPRSSQLMQKPVFRKKRSFKRQMRQSGRGK-------QQIFFHASAVEPEQDAMQRV 497
                 R   L+ KP+ R+KRS KRQ  QSG G+       + I +   AVE +Q+AMQ+ 
Sbjct: 698  AYQSKRKPPLVPKPMMRRKRSLKRQPSQSGSGRTPQPSILEAILWRRDAVE-DQNAMQKY 756

Query: 496  QKAKSLADISMESVLLKRRRAQILMDNADLATYXXXXXXXXXXXXXXXESPEA 338
            +++K+  +  +E+ + KR+RAQ+L++NADLA Y               +SPEA
Sbjct: 757  EESKAAVEKCIENAVSKRQRAQLLLENADLAVYKAMSALRIAEAIETSDSPEA 809


>gb|EYU39775.1| hypothetical protein MIMGU_mgv1a001436mg [Mimulus guttatus]
          Length = 820

 Score =  414 bits (1063), Expect = e-112
 Identities = 298/836 (35%), Positives = 420/836 (50%), Gaps = 72/836 (8%)
 Frame = -2

Query: 2689 MPLVEMRRSTRVFVPKSVVKDTDGARVLRSGKRFSLDCGE---EKPVRGNGDE--WFCII 2525
            MP V MRR+TRVF          G RVLRSG+R   +  +    K  R +  E  W  I 
Sbjct: 1    MPSVGMRRNTRVF----------GTRVLRSGRRLWTEPSKGSNNKNARASHAENKWTDIP 50

Query: 2524 DDTADVPRCKSIDWYEVDPERDIDVTDFNL----NLAXXXXXXXXXXXXXSRDKMHGIVY 2357
            D         + D     P  D +    ++     +               RD+M GIVY
Sbjct: 51   DGGGGGGGDAASDRLNHTPREDKNSASSDMIVDPTIEERAPEGGGAVEVKDRDRMCGIVY 110

Query: 2356 NRKRRR-LPGNKFVSSPRDRMYGISFVRKQRRKRSSGHSINELPRKDCQXXXXXXXXXXX 2180
             RKR+R L          D+ YG  FVR++ RKR       E   K              
Sbjct: 111  RRKRKRKLVELGKTGLTEDKRYGKKFVRERWRKRFGATESFESCAK----------FGGS 160

Query: 2179 XSGHESSIDVIHGAILDIVVESSCNSAHRFTCFLVSILKFVKSSRISLTEFAAFMCSEPL 2000
              G    + V++        ESS    +   CFL  +L ++   RI +   +AFM S+P+
Sbjct: 161  VRGRRELVVVVN--------ESSNWCGYWVACFLSCVLSYMTKVRIGMRRMSAFMLSKPI 212

Query: 1999 VRVFAQHGVHFLANLSYRISWKNDLISPGICQIFEARQFIPMFSLDFCASPFTFMRLHFS 1820
              V++ HGV F+ +     +  N +  PG+C I  +R  +P+FS+DF A P  F+ +  S
Sbjct: 213  FDVYSSHGVLFVQDAI--TARNNGIKKPGLCIISGSRSLVPIFSVDFSAIPSVFVHMQTS 270

Query: 1819 VVLRSLYLPDVLI-RYLSGLIKKAREIT--DNKKCLPCIPTEMGFPG-SNSMASWSSSVK 1652
            + LRS +L  +L+ R      ++  E+T  D +  L        FP    +  S  S ++
Sbjct: 271  LYLRSEHLAFLLVARSTDDDYEEDEEVTAMDEEPYL--------FPSCEQNQDSLDSPIR 322

Query: 1651 KRKV-DFIVRATDFARRSV-STRHSA---------------NSRNVQRKRTSLRSTRARN 1523
                 D +    D +R  + S+ HS                NSRN++++R+SLR  R R 
Sbjct: 323  DVSCSDVLAFGNDDSRGKIESSSHSPLGLPKSSALRSLQLRNSRNIKKRRSSLRRKRGRP 382

Query: 1522 SSSM-------GFHHDLFR-------------------AGYKHKKRKLA----------- 1454
             SS            D FR                   +  K+  +K +           
Sbjct: 383  PSSFRTQKSSGALASDFFRIRNDAVQFSALSPTRLLRSSDKKNSDKKKSDKNSSDKKSSD 442

Query: 1453 QKSPCSNVKELKSTLVELKQNMDSVCCSANILVIESDRCFREEGAKVMLECLALNDWYLV 1274
            +KS  SN+KE K       Q++    CSANIL+ E+D+C+REEGA V LE      W+LV
Sbjct: 443  KKSSTSNIKETKPAT----QDIYPSTCSANILITETDKCYREEGATVALELSPSKQWFLV 498

Query: 1273 AKTHGLSRYMYKAENVMRPSTPNRFTHAMIWTGENGWKLEFSNRRDWLIFKELHMACMDR 1094
                G  RY   AE VMRPS  NRF+HA+IW+G+  +KLEFSN++DW +FKEL+  C +R
Sbjct: 499  IGKDGTKRYNLTAEKVMRPSCSNRFSHAVIWSGDCNFKLEFSNKQDWFVFKELYKQCSER 558

Query: 1093 NLQVVSARTIPVPGVYEVPCYDDSGYVPFVRPIAYITMRDDEVARALVRKPANYDMDSGD 914
            N+Q  S   IPVPGV EV     + ++P+VRP  YIT++DDE+ RALV+K ANYDMDS D
Sbjct: 559  NMQSPSVSVIPVPGVQEVSMPFYNNFMPYVRPDNYITVKDDELIRALVKKGANYDMDSDD 618

Query: 913  EEWLEKLNNGVFDGDVGVPNYILPEKFEEMMDAFEKAAYHTPDEVSDESRATSFCSSLER 734
            EEWL + N+ +  G + +   + PE FE ++DA EK  +  PDE  +E  A  FC  LER
Sbjct: 619  EEWLSEFNDELC-GGMELQEPVSPECFELVIDALEKGVHCNPDENFEELAAYDFCMHLER 677

Query: 733  KDTVAAVYQYWMKKRKQNHVALLRVFQCPPPRSSQLMQKPVFRKKRSFKRQMRQSGRGKQ 554
            ++ + A+  YW+KKRKQ   AL+R+FQ   PR  Q++ K VFRKKRSFKRQ  Q GRGKQ
Sbjct: 678  REVIEAIRNYWVKKRKQKRSALVRIFQLYQPRRIQVIPKSVFRKKRSFKRQASQGGRGKQ 737

Query: 553  QIFFHASAVE----PEQDAMQRVQKAKSLADISMESVLLKRRRAQILMDNADLATY 398
            +    A A E     +Q+  Q++Q+AK+ A+      + KR+RAQ+LM+NADLATY
Sbjct: 738  RPILQAIAAERDALEQQNNAQKLQEAKAAAERFEALAVEKRQRAQMLMENADLATY 793


>ref|XP_006398922.1| hypothetical protein EUTSA_v10012741mg [Eutrema salsugineum]
            gi|557100012|gb|ESQ40375.1| hypothetical protein
            EUTSA_v10012741mg [Eutrema salsugineum]
          Length = 777

 Score =  414 bits (1063), Expect = e-112
 Identities = 281/806 (34%), Positives = 418/806 (51%), Gaps = 42/806 (5%)
 Frame = -2

Query: 2689 MPLVEMRRSTRVFVPKSVVKDTDGARVLRSGKRFSLDCGEEKPVRGNG---DEWFC---- 2531
            MP V MRR+TRVF    VVK  DGARVLRSG+R   +  E K  R +     +W C    
Sbjct: 1    MPSVGMRRTTRVF---GVVKAADGARVLRSGRRIWPNVDEPKVKRAHDVVDRDWNCLNPS 57

Query: 2530 ------IIDDTADVPRCKSIDWYEVDPERDIDVTDFNLNLAXXXXXXXXXXXXXSRDKMH 2369
                  +    ++    +     E+  E+D    DF +                + DK+ 
Sbjct: 58   KGKGNKVSGGRSNGAGSRPCSPREISSEKDDKEIDFPVRKRRKVATAEAVGDEKTVDKLF 117

Query: 2368 GIVYNRKRRRLPGNKFVSSPRDRMYGISFVRKQRRKRSSGHSINELPRKDCQXXXXXXXX 2189
            G+VY+RKR+RL G    +   + +  + F    RRKR S   ++  PR+           
Sbjct: 118  GVVYSRKRKRLSGQSSDNRSEEPLRSLKFYC--RRKRLSDRVVS--PRR----------- 162

Query: 2188 XXXXSGHESSIDVIHGAILDIVVESSCNSAHRFTCFLVSILKFVKSSRISLTEFAAFMCS 2009
                         ++G ++ + V++SC  +   T F++ ++++V+  ++ L+  A+F  S
Sbjct: 163  -------------LYGPVITLTVDASCEESWFSTVFVL-VMRYVRRGQLGLSSLASFFLS 208

Query: 2008 EPLVRVFAQHGVHFLANLSYRISWKNDLISPGICQIFEARQFIPMFSLDFCASPFTFMRL 1829
            +P+  VFA HGV FLA        +  L S G+C+ F A   +P+FS DF A P  FM +
Sbjct: 209  QPINDVFADHGVRFLA--------EPPLSSRGVCKFFGALNCLPLFSADFNAIPRCFMDM 260

Query: 1828 HFSVVLRSLYLPDVLIRYLSGLIKKAREITDNKKCL----PCIPTEMGFPGSNSMASWSS 1661
            HF++ LR +      ++    L+    E +D++  +    PC P      G +       
Sbjct: 261  HFTLFLRVVPRSFAFVKKSLYLLNNPVEESDSESEIVLSEPCNPRNGVVVGLHPS----- 315

Query: 1660 SVKKRKVDFIVRATDFARRSVSTRHSANSRNVQRKRTSLRSTRARNSSSMGFHH------ 1499
                      V A+     +   R S    ++Q++R+SLR  RARN S  G H       
Sbjct: 316  ----------VTASKLTGGNAQYRGSLGFHSIQKRRSSLRRRRARNLSH-GVHKPHNGTP 364

Query: 1498 -DLFRAGYKHKKRKLAQKSPCSNVKELKS---------TLVELKQNMDSVCCSANILVIE 1349
                   +K++   ++ +   S+V    S         +    K+ +DS+CCSANILVI 
Sbjct: 365  VSELSGNWKNRTTSVSSRKLRSSVLNNSSPSSNGISTISKPRTKEELDSLCCSANILVIG 424

Query: 1348 SDRCFREEGAKVMLECLALNDWYLVAKTHGLSRYMYKAENVMRPSTPNRFTHAMIWTGEN 1169
            SDRC REEG  VMLE  +  +W++V K  G  RY ++A   MRP + NRFT +++W G+N
Sbjct: 425  SDRCTREEGCGVMLEFSSSKEWFVVIKKDGAIRYRHRARKTMRPCSCNRFTQSIVWLGDN 484

Query: 1168 GWKLEFSNRRDWLIFKELHMACMDRNLQVVSARTIPVPGVYEVPCY--DDSGYVPFVRPI 995
             WKLEF +++DWL FKE++  C +RN+   +A+ IP+PGV EV  Y  D + +  FV P+
Sbjct: 485  DWKLEFCDKQDWLGFKEIYNECYERNILEQNAKVIPIPGVREVSGYSEDIADFPSFVMPV 544

Query: 994  AYITMRDDEVARALVRKPANYDMDSGDEEWLEKLNNGVFDGDVGVPNYILPEKFEEMMDA 815
             YI++++DEV RA+ R  A YDMDS DEEWLE+ N  +   +      +  + FE M+D 
Sbjct: 545  PYISVKEDEVTRAMARNIAIYDMDSEDEEWLERQNEEMLGEEHEQSQRLEQDAFELMIDG 604

Query: 814  FEKAAYHTP-DEVSDESRAT-SFCSSLERKDTVAAVYQYWMKKRKQNHVALLRVFQCPPP 641
            FEK  + +P D++ +E  AT +  S L R++ V AV+ YW +KRKQ    LLRVFQ    
Sbjct: 605  FEKCFFQSPADDLLNEKAATVASLSYLGRQEVVEAVHDYWARKRKQRKAPLLRVFQGHQA 664

Query: 640  RSSQLMQKPVFRKKRSFKRQMRQ-SGRGKQQIFFHASAVE----PEQDAMQRVQKAKSLA 476
            + S L+ K VFRK+RSFKRQ  Q  G+ KQ       A E     EQ+   RV++AK+LA
Sbjct: 665  KKSPLLFKHVFRKRRSFKRQGSQLHGKSKQLSLVGVKAAEQEASEEQNDYLRVEEAKALA 724

Query: 475  DISMESVLLKRRRAQILMDNADLATY 398
            D +ME  + KRRRAQ+L +NADLA Y
Sbjct: 725  DRAMEIAIAKRRRAQVLAENADLAVY 750


>ref|NP_196087.1| Enhancer of polycomb-like transcription factor protein [Arabidopsis
            thaliana] gi|7413529|emb|CAB86009.1| putative protein
            [Arabidopsis thaliana] gi|332003387|gb|AED90770.1|
            Enhancer of polycomb-like transcription factor protein
            [Arabidopsis thaliana]
          Length = 766

 Score =  409 bits (1052), Expect = e-111
 Identities = 291/810 (35%), Positives = 411/810 (50%), Gaps = 46/810 (5%)
 Frame = -2

Query: 2689 MPLVEMRRSTRVFVPKSVVKDTDGARVLRSGKRFSLDCGEEKPVRGNGDEWFCIIDDTAD 2510
            MP V MRR+TRVF    VVK  DGARVLRSG+R   + GE K  R +      ++D   D
Sbjct: 1    MPSVGMRRTTRVF---GVVKAADGARVLRSGRRIWPNVGEPKVRRAHD-----VVDRDCD 52

Query: 2509 -------------VPRCKSIDW----YEVDPERDIDVTDFNLNLAXXXXXXXXXXXXXSR 2381
                         V   KS        +V  E++  V DF +                  
Sbjct: 53   SVLKNQNKSKGNKVSSGKSNSQPCSPKQVSSEKEDKVDDFPVTKRRKVRNEGVGDEKTV- 111

Query: 2380 DKMHGIVYNRKRRRLPGNKFVSSPRDRMYGISFVRKQRRKRSSGHSINELPRKDCQXXXX 2201
            DKM GIVY+RKR+RL          + +  + F R+ RRK S   S              
Sbjct: 112  DKMFGIVYSRKRKRLCEPSSSDRSEEPLRSLKFYRR-RRKLSQRVS-------------- 156

Query: 2200 XXXXXXXXSGHESSIDVIHGAILDIVVESSCNSAHRFTCFLVSILKFVKSSRISLTEFAA 2021
                                ++L + V+ SC      T F ++ +++++   + L+  A+
Sbjct: 157  --------------------SVLTLTVDWSCEDCWFLTVFGLA-MRYIRREELRLSSLAS 195

Query: 2020 FMCSEPLVRVFAQHGVHFLANLSYRISWKNDLISPGICQIFEARQFIPMFSLDFCASPFT 1841
            F  S+P+ +VFA HGV FL         ++ L S G+C+ F A   +P+FS DF   P  
Sbjct: 196  FFLSQPINQVFADHGVRFLV--------RSPLSSRGVCKFFGAMSCLPLFSADFAVIPRW 247

Query: 1840 FMRLHFSVVLRSLYLPDVLIRYLSGLIKKAREITDNKKCL----PCIPTEMGFPGSNSMA 1673
            FM +HF++ +R L      +     L+    E +D++  L    PC P      G +   
Sbjct: 248  FMDMHFTLFVRVLPRSFFFVEKSLYLLNNPIEESDSESELALPEPCTPRNGVVVGLHPS- 306

Query: 1672 SWSSSVKKRKVDFIVRATDFARRSVSTRHSANSRNVQRKRTSLRSTRARNSS----SMGF 1505
                          VRA+     +   R +  S + Q++R+SLR  RARN S     +  
Sbjct: 307  --------------VRASKLTGGNAQYRGNLGSHSFQKRRSSLRRRRARNLSHNAHKLNN 352

Query: 1504 HHDLFRAGYKHKKRK------------LAQKSPCSNVKELKSTLVELKQNMDSVCCSANI 1361
               +F      K R             L+  SP SN   +   + + K+ +DS+CCSANI
Sbjct: 353  GTPVFDISGSRKNRTAAVSSKKLRSSVLSNSSPVSNGISI-IPMTKTKEELDSICCSANI 411

Query: 1360 LVIESDRCFREEGAKVMLECLALNDWYLVAKTHGLSRYMYKAENVMRPSTPNRFTHAMIW 1181
            L+I SDRC REEG  VMLE  +  +W+LV K  G  RY + A+  MRP + NR THA +W
Sbjct: 412  LMIHSDRCTREEGFSVMLEASSSKEWFLVIKKDGAIRYSHMAQRTMRPFSSNRITHATVW 471

Query: 1180 TGENGWKLEFSNRRDWLIFKELHMACMDRNLQVVSARTIPVPGVYEVPCYDD--SGYVPF 1007
             G + WKLEF +R+DWL FK+++  C +RNL   S + IP+PGV EV  Y +    +  F
Sbjct: 472  MGGDNWKLEFCDRQDWLGFKDIYKECYERNLLEQSVKVIPIPGVREVCGYAEYIDNFPSF 531

Query: 1006 VR-PIAYITMRDDEVARALVRKPANYDMDSGDEEWLEKLNNGVFDGDVGVPNYILPEKFE 830
             R P++YI++ +DEV+RA+ R  A YDMDS DEEWLE+ N  + + +      +  E FE
Sbjct: 532  SRPPVSYISVNEDEVSRAMARSIALYDMDSEDEEWLERQNQKMLNEEDDQYLQLQREAFE 591

Query: 829  EMMDAFEKAAYHTP-DEVSDESRAT-SFCSSLERKDTVAAVYQYWMKKRKQNHVALLRVF 656
             M+D FEK  +H+P D++ DE  AT    S L R++ V AV+ YW+KKRKQ    LLR+F
Sbjct: 592  LMIDGFEKYHFHSPADDLLDEKAATIGSISYLGRQEVVEAVHDYWLKKRKQRKAPLLRIF 651

Query: 655  QCPPPRSSQLMQKPVFRKKRSFKRQMRQ-SGRGKQQI--FFHASAVEP-EQDAMQRVQKA 488
            Q    + +QL+ KPVFRK+RSFKRQ  Q  G+ KQ         A EP E+D + R+++A
Sbjct: 652  QGHQVKKTQLLSKPVFRKRRSFKRQGSQLHGKAKQTSPWMVAVKAAEPEEEDDILRMEEA 711

Query: 487  KSLADISMESVLLKRRRAQILMDNADLATY 398
            K LAD +ME+ + KRRRAQIL +NADLA Y
Sbjct: 712  KVLADKTMETAIAKRRRAQILAENADLAVY 741


>ref|XP_007221418.1| hypothetical protein PRUPE_ppa001422mg [Prunus persica]
            gi|462418130|gb|EMJ22617.1| hypothetical protein
            PRUPE_ppa001422mg [Prunus persica]
          Length = 768

 Score =  408 bits (1048), Expect = e-111
 Identities = 283/759 (37%), Positives = 390/759 (51%), Gaps = 38/759 (5%)
 Frame = -2

Query: 2695 TLMPLVEMRRSTRVFVPKSVVKDTDGARVLRSGKRFSLDCGEEKPVRG-NGDE-WFCIID 2522
            T MP VEMRR+TRVF    V    DGARVLRSG+R   +  E K  R  NGDE W  ++ 
Sbjct: 52   TEMPSVEMRRTTRVFGMGMVKGGVDGARVLRSGRRLWPESSESKLERARNGDEDWLKLMK 111

Query: 2521 DTA--DVPRCKSIDWYEVD----PERDIDVTDFNLNLAXXXXXXXXXXXXXSRDKMHGIV 2360
              A   V       W   +    P R+  V     +L                 K +GIV
Sbjct: 112  SHAGESVVGLNHKKWAGANQVGSPRRNTPV--LKTSLVKKPQSNELLADLLKEHKRYGIV 169

Query: 2359 YNRKRRRLPGNKFVSSPR-----DRMYGISFVRKQRRKRSSGHSINELPRKDCQXXXXXX 2195
            Y RKR+R   +   +  +     DRMYG  F R+QR K+S      EL            
Sbjct: 170  YTRKRKRASASFLGNVEKENGSDDRMYGRRFARRQRMKKSK-----EL------------ 212

Query: 2194 XXXXXXSGHESSIDVIHGAILDIVVESSCNSAHRFTCFLVSILKFVKSSRISLTEFAAFM 2015
                     +S    +   +L   VESS    +    FL S+L ++  + + LTEF+ F+
Sbjct: 213  ---------DSHPGFVCPEVLCFSVESSWAQGYWAGRFLYSVLVYMTRASLGLTEFSEFL 263

Query: 2014 CSEPLVRVFAQHGVHFLANLSYRISWKNDLISPGICQIFEARQFIPMFSLDFCASPFTFM 1835
              EP+  +FA +G+ F  + S            G+C++F A QFIP+FS+DF A P  FM
Sbjct: 264  ALEPIGSIFASYGIQFSRDRSCTRR-------SGVCKLFGAEQFIPLFSVDFSAVPGCFM 316

Query: 1834 RLHFSVVLR---SLYLPDVLIRYLSGLIKKAREITDNKKCLPCIPTEMGFPGSNSMASWS 1664
             +  S+ LR    L + +++  + +G      +  D+ + +  I         N  A  S
Sbjct: 317  FMQTSMHLRFRCHLTVNNLIDGHENGEFIDQGDDDDDGEKVDFI--------ENRHALHS 368

Query: 1663 SSVKKRKVDFIVRATDFARRSVSTRHSANSRNVQRKRTSLRSTRARNSSSMGFHH----- 1499
            S          VR    A RS   R+   SR +Q++R+SLR  R+RN S +         
Sbjct: 369  S----------VRVPKLACRSTQYRNGLTSRGIQKRRSSLRRRRSRNPSLVSLRKPNGAL 418

Query: 1498 -----DLFRAGY-------KHKKRKLAQKSPCSNVKELKSTLVELKQNMDSVCCSANILV 1355
                  + + G        KH  RK    S   N+K    T+   K+++DS  CSANIL 
Sbjct: 419  VSELISIRKNGLPFSSVESKHMLRKSVSLSLAGNLKAESLTIEGSKRDLDSTSCSANILF 478

Query: 1354 IESDRCFREEGAKVMLECLALNDWYLVAKTHGLSRYMYKAENVMRPSTPNRFTHAMIWT- 1178
             E D+C+RE+GA VMLE  +  +W LV K +GL+RY +KAE VMRP + NR T A+IW+ 
Sbjct: 479  TELDKCYREDGATVMLEMSSSGEWLLVVKKNGLTRYTHKAEKVMRPCSKNRITQAIIWSA 538

Query: 1177 ---GENGWKLEFSNRRDWLIFKELHMACMDRNLQVVSARTIPVPGVYEVPCYDDSGYVPF 1007
               G+N WKLEF NR DW IFK+L+  C DR +   + + IPVPGV EVP Y DS    F
Sbjct: 539  DSNGDNNWKLEFPNRCDWAIFKDLYKECSDRVVPAPAIKFIPVPGVREVPGYADSHSTLF 598

Query: 1006 VRPIAYITMRDDEVARALVRKPANYDMDSGDEEWLEKLNNGVFDGDVGVPNYILPEKFEE 827
             RP +YI + DDEV+RA+ ++ ANYDMDS DEEWL+K N+  F  +  + +++  + FE 
Sbjct: 599  DRPESYIYLNDDEVSRAMAKRTANYDMDSDDEEWLKKFNSDFF-AENELHDHVSEDNFEL 657

Query: 826  MMDAFEKAAYHTPDEVSDESRATSFCSSLERKDTVAAVYQYWMKKRKQNH-VALLRVFQC 650
            M+DAFEKA Y  P + +DE+ A + C  + R++ V A+Y YWM KRKQ    +LLRVFQ 
Sbjct: 658  MVDAFEKAFYCRPYDFADENAAANLCLDMGRREVVEAIYSYWMNKRKQKRSSSLLRVFQG 717

Query: 649  PPPRSSQLMQKPVFRKKRSFKRQMRQSGRGKQQIFFHAS 533
               + + L  KPV RK+RSFKRQ  Q GRGKQ  F   +
Sbjct: 718  HQSKRALLDPKPVLRKRRSFKRQPSQFGRGKQPSFLQGT 756


>ref|XP_007145542.1| hypothetical protein PHAVU_007G247300g [Phaseolus vulgaris]
            gi|561018732|gb|ESW17536.1| hypothetical protein
            PHAVU_007G247300g [Phaseolus vulgaris]
          Length = 734

 Score =  402 bits (1032), Expect = e-109
 Identities = 284/808 (35%), Positives = 393/808 (48%), Gaps = 44/808 (5%)
 Frame = -2

Query: 2689 MPLVEMRRSTRVFVPKSVVKDTDGARVLRSGKRFSLDCGEEKPVRGN-GDEWFCIIDDTA 2513
            MP   MRR+TRVF     +K  D ARVLRSG+R   D GE K  R + GDE         
Sbjct: 1    MPAAGMRRTTRVFG----MKGADTARVLRSGRRLWPDSGEVKTKRSSDGDE--------- 47

Query: 2512 DVPRCKSIDWYEVDPERDIDVTDFNLNLAXXXXXXXXXXXXXSRDKMHGIVYNRKRRRLP 2333
                      + V P +                            KM  ++  R   +  
Sbjct: 48   ----------WAVTPAKAA--------------------------KMDAVMTPRGTAKGK 71

Query: 2332 GNKFVSSPRD----RMYGISFVRKQRRKRSSGHSINELPRKDCQXXXXXXXXXXXXSGHE 2165
              + V   RD    R +GI +VR+++  +  G                            
Sbjct: 72   RQEAVVDARDSTVDRRFGIVYVRRRKGLKKEGS--------------------------R 105

Query: 2164 SSIDVIHGAILDIVVESSCNSAHRFTCFLVSILKFVKSSRISLTEFAAFMCSEPLVRVFA 1985
             S++V    +L +VV      +  F   L S++++ K  R+S  + + F  S  +  VFA
Sbjct: 106  RSVEVSR-CVLSVVVSRCAGKSALFLRLLASVVRYAKRVRVSPRKLSGFFMSGAVNGVFA 164

Query: 1984 QHGVHFLANLSYRISWKNDLISPGICQIFEARQFIPMFSLDFCASPFTFMRLH----FSV 1817
              G+ F+             ++ GICQ F   +F+P+FS+DF A P  F  LH    F  
Sbjct: 165  SQGMQFVKG--------PPAVNSGICQFFGVTEFVPLFSVDFSAVPLCFEYLHSAMFFKS 216

Query: 1816 VLRSLYL----------------PDVLIRYLSGLIKKAREITDNKKCLPCIPTEMGFPGS 1685
            +LRSL+L                 D L+ Y +   K+    T   +    +         
Sbjct: 217  MLRSLFLVCNPINVRSDVEDMESDDDLLEYQNE--KQISSNTFKGELSETVTVTSDVIEI 274

Query: 1684 NSMASWSSSVKKRKVDFIVRATDFARRSVSTRHSANSRNVQRKRTSLRSTRARNSSSMGF 1505
            N + S  SSVK          T  A R+   R+  NSR +Q++R+SLR  +ARN S  G 
Sbjct: 275  NDVLSLQSSVKS--------TTRAAGRNGQYRNMLNSRGIQKRRSSLRKRKARNPSMGGL 326

Query: 1504 H---------------HDLFRAGYKHKK-RKLAQKSPCSNVKELKSTLVELKQNMDSVCC 1373
                            ++ F      K+ R LA  S   ++KE  S +V+ K+ +    C
Sbjct: 327  RRNGAVAFELTGGRKGNNQFSGVTSSKRLRSLANGSTTGSLKEASSAIVDSKERLGLSSC 386

Query: 1372 SANILVIESDRCFREEGAKVMLECLALNDWYLVAKTHGLSRYMYKAENVMRPSTPNRFTH 1193
            SAN+LV E  +C R EGA V LE  A  +W L  K   L+R  +KAE VMRP + NRFTH
Sbjct: 387  SANLLVSEIHQCHRVEGAIVTLEMSASKEWLLTVKKDELTRSTFKAEKVMRPCSSNRFTH 446

Query: 1192 AMIWTGENGWKLEFSNRRDWLIFKELHMACMDRNLQVVSARTIPVPGVYEVPCYDDSGYV 1013
            A++++ +NGWKLEF+NR+DW +FK+L+  C DRN+   +A+ IPVPGV EV  Y +S   
Sbjct: 447  AIMYSLDNGWKLEFTNRQDWNVFKDLYKKCSDRNIPSTAAKFIPVPGVREVSSYAESNSF 506

Query: 1012 PFVRPIAYITMRDDEVARALVRKPANYDMDSGDEEWLEKLNNGVFDGDVGVPNYILPEKF 833
            PF RP  YI++  DE+ RA+ R  ANYDMDS DEEWL+K NN          N +  + F
Sbjct: 507  PFHRPDTYISVFGDELTRAMARTTANYDMDSEDEEWLKKFNN-------ECQNPVSDDNF 559

Query: 832  EEMMDAFEKAAYHTPDEVSDESRATSFCSSLERKDTVAAVYQYWMKKRKQNHVALLRVFQ 653
            E ++D  EK  Y  PDE+ DE  AT+ C  L  K+ V AVY YWM+KRKQ    L+RVFQ
Sbjct: 560  ELIIDTLEKVYYCNPDELFDEKSATNGCQDLGSKEVVEAVYNYWMRKRKQKRSLLIRVFQ 619

Query: 652  CPPPRSSQLMQKPVFRKKRSFKRQMRQSGRGKQQIFFHASAVEP---EQDAMQRVQKAKS 482
                + + L+ KP+ RK+RSFKRQ  Q GR  Q     A A E    E++AM R+++AK+
Sbjct: 620  GHQSKRAPLIPKPLLRKRRSFKRQPSQFGRSNQPSVLKAFAAEQDAMEENAMLRIEEAKA 679

Query: 481  LADISMESVLLKRRRAQILMDNADLATY 398
             A++SME  + KRRRAQ L  NADLATY
Sbjct: 680  NANMSMELAIHKRRRAQSLAQNADLATY 707


Top