BLASTX nr result

ID: Akebia22_contig00009626 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Akebia22_contig00009626
         (3522 letters)

Database: ./nr 
           38,876,450 sequences; 13,856,398,315 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_002265036.1| PREDICTED: uncharacterized protein LOC100266...   527   e-146
ref|XP_006436656.1| hypothetical protein CICLE_v10030776mg [Citr...   496   e-137
ref|XP_002315450.2| hypothetical protein POPTR_0010s24240g [Popu...   494   e-137
emb|CBI40243.3| unnamed protein product [Vitis vinifera]              494   e-137
ref|XP_002532013.1| conserved hypothetical protein [Ricinus comm...   482   e-133
ref|XP_002311034.2| hypothetical protein POPTR_0008s02470g [Popu...   473   e-130
ref|XP_007010267.1| Enhancer of polycomb-like transcription fact...   470   e-129
ref|XP_006360530.1| PREDICTED: uncharacterized protein LOC102597...   459   e-126
ref|XP_007010268.1| Enhancer of polycomb-like transcription fact...   459   e-126
ref|XP_006360531.1| PREDICTED: uncharacterized protein LOC102597...   459   e-126
ref|XP_007221419.1| hypothetical protein PRUPE_ppa001422mg [Prun...   456   e-125
ref|XP_004243418.1| PREDICTED: uncharacterized protein LOC101263...   452   e-124
gb|EXC25392.1| hypothetical protein L484_016774 [Morus notabilis]     437   e-119
ref|XP_004166800.1| PREDICTED: uncharacterized LOC101207239 [Cuc...   418   e-113
ref|XP_004140897.1| PREDICTED: uncharacterized protein LOC101207...   418   e-113
gb|EYU39775.1| hypothetical protein MIMGU_mgv1a001436mg [Mimulus...   414   e-112
ref|XP_006398922.1| hypothetical protein EUTSA_v10012741mg [Eutr...   413   e-112
ref|NP_196087.1| Enhancer of polycomb-like transcription factor ...   409   e-111
ref|XP_007221418.1| hypothetical protein PRUPE_ppa001422mg [Prun...   408   e-111
ref|XP_007145542.1| hypothetical protein PHAVU_007G247300g [Phas...   402   e-109

>ref|XP_002265036.1| PREDICTED: uncharacterized protein LOC100266152 [Vitis vinifera]
          Length = 791

 Score =  527 bits (1358), Expect = e-146
 Identities = 322/800 (40%), Positives = 452/800 (56%), Gaps = 36/800 (4%)
 Frame = -3

Query: 2992 MPLVEMRRSTRVFVPKSVVKDT-DGARVLRSGKRFSLDCGEEKPVRGNGDEWFCIIDDTA 2816
            MP V MRR+TRVFVPK+  K    GARVLRSG+R   D GE K  R    +WF ++ ++ 
Sbjct: 1    MPSVGMRRTTRVFVPKTAAKGAAGGARVLRSGRRLWPDSGEGKLTRDA--DWFRLLHNSG 58

Query: 2815 DVPR-------CKSIDWYEVDPERDIDVTDFNLNLAXXXXXXXXXXXXXSRDKMH-GIVY 2660
                        K   W+EV+ ++++D  D  + ++               D    GIVY
Sbjct: 59   GGGGGAGGGGGLKENGWHEVNSKQEVDDVDAEVAVSESRNVAGKCGDDQGSDYSRWGIVY 118

Query: 2659 NRKRRRLPGNKFVSSPR-----DRMYGISFVRKQRRKRSSGHSINELPRKDCQXXXXXXX 2495
            +R+ +R      +S  +     D+ +GI F RKQRRKR                      
Sbjct: 119  SRRTKRSDSKSLLSPEKKRGFEDKRFGIRFSRKQRRKRMEESE----------------- 161

Query: 2494 XXXXXSGHESSIDVVHGAILDIVVESSCNSAHRFTCFLVSILKFVKSSRISLTEFAAFMC 2315
                  G    +++V      +V++SS +   RFT FL SIL +++ SR+ L     F+ 
Sbjct: 162  -----EGGYVCVEMV-----TVVIDSSRSGRCRFTSFLNSILGYMRRSRVRLWGLYEFLT 211

Query: 2314 SEPLVRVFAQHGVHFLANLSYRISWKNDLISPGICQIFEARQFIPMFSLDFCASPFTFMR 2135
             EP++  F+ HGV FL +     S+       GIC+IF AR+FIP+FS+DF A P  FM 
Sbjct: 212  WEPMMDAFSSHGVRFLRDPPCARSF-------GICKIFGARRFIPLFSVDFSAVPSCFMY 264

Query: 2134 LHFSVVLRSLYLPDVLIRYLSGLIKKAREITDNKKCLPCIPTEMGFPGSNSMASWS-SSV 1958
            LH S++LR   LP VL+     +     E  D+++ L CIP++    GS S+   + +S 
Sbjct: 265  LHSSMLLRFGCLPFVLVNNSMSVCSNGEEPIDSEENLLCIPSKKDHFGSKSITLENDNSG 324

Query: 1957 KKRKVDFIVRATDFARRSVSTRHSANSRNVQRKRTSLRSTRARNSSSMGFHHD------- 1799
            K+R +   +  + F+ R+   R+  NSR++Q++R+S RS R RN S +G H         
Sbjct: 325  KRRMLQPTIGTSRFSGRNAQWRNGVNSRSIQKRRSSQRSRRVRNPSLVGIHKSNGALVSD 384

Query: 1798 ----------LFRAGYKHKKRKLAQKSPCSNVKELKSTLVELKQNMDSVCCSANILVIES 1649
                           Y  + R+ A+ +  +N++ELKST V +K+ +DSVCCSANIL++ES
Sbjct: 385  FITNRNKGIPFSSVVYNQELRRSARHASATNIRELKSTSVVVKEEIDSVCCSANILIVES 444

Query: 1648 DRCFREEGAKVMLECLALNDWYLVAKTHGLSRYMYKAENVMRPSTPNRFTHAMIWTGENG 1469
            DRCFRE GA VMLE  A  +W++  K  G  +Y +KAE  MR ++ NR THAMIW GE+G
Sbjct: 445  DRCFRENGANVMLEVSASKEWFIAVKKDGSMKYSHKAEKDMRYAS-NRHTHAMIWNGEDG 503

Query: 1468 WKLEFSNRRDWLIFKELHMACMDRNLQVVSARTIPVPGVYEVPCYDDSGYVPFVRPIAYI 1289
            WKLEF NR+DW+IFKEL+  C DRN++  S + IPVPGV+EV  Y D    PF RP  YI
Sbjct: 504  WKLEFPNRQDWMIFKELYKECCDRNVEAPSVKIIPVPGVHEVTDYGDYKGDPFSRPDTYI 563

Query: 1288 TMRDDEVARALVRKPANYDMDSGDEEWLEKLNNGVFDGDVGVPNYILPEKFEEMMDAFEK 1109
              ++DEV+RA+ +  A+YDMDS DEEWL+KLN+  F  +  +  ++  E FE M+DAFEK
Sbjct: 564  AFKNDEVSRAMAKTTASYDMDSEDEEWLKKLNS-EFHAENDLHGHVSEEDFELMVDAFEK 622

Query: 1108 AAYHTPDEVSDESRATSFCSSLERKDTVAAVYQYWMKKRKQNHVALLRVFQCPPPRSSQL 929
            A Y +PD+  D + A   C  L  ++ +A VY YWMKKRK+   +L+RVFQ    R +QL
Sbjct: 623  AVYCSPDDYPDANGAADLCVDLGSREAIACVYGYWMKKRKRKRGSLVRVFQGHHLRKAQL 682

Query: 928  MQKPVFRKKRSFKRQMRQSGRGKQQIFFHASAVE----PEQDAMQRVQKAKSLADISMES 761
            + KPV RKKRSF RQ+ + GRGKQQ    A A +     E  A  + Q+A+   D S + 
Sbjct: 683  IPKPVLRKKRSFSRQVGKFGRGKQQNVMQALAAQRKAIDETSAKLKAQEARVSLDRSEKL 742

Query: 760  VLLKRRRAQILMDNADLATY 701
             + KR RAQ LM+NADLATY
Sbjct: 743  AIRKRVRAQSLMENADLATY 762


>ref|XP_006436656.1| hypothetical protein CICLE_v10030776mg [Citrus clementina]
            gi|568878428|ref|XP_006492195.1| PREDICTED:
            uncharacterized protein LOC102612244 [Citrus sinensis]
            gi|557538852|gb|ESR49896.1| hypothetical protein
            CICLE_v10030776mg [Citrus clementina]
          Length = 758

 Score =  496 bits (1277), Expect = e-137
 Identities = 318/796 (39%), Positives = 437/796 (54%), Gaps = 32/796 (4%)
 Frame = -3

Query: 2992 MPLVEMRRSTRVFVPKSVVKDTDGARVLRSGKRFSLDCGEEKPVRGN-GDEWFC--IIDD 2822
            MP V MRR+TRVF    VVK  DGARVLRSG+R   D G+ K  R N GD+W+   +I+ 
Sbjct: 1    MPSVGMRRTTRVF---GVVKGVDGARVLRSGRRLWPDSGDGKLRRTNYGDDWYHHPVINK 57

Query: 2821 T---ADVPRCKSIDWY-EVDPERDIDVTDFNLNLAXXXXXXXXXXXXXSRDKMHGIVYNR 2654
                   P+CK   W   +D   D+ V   N                   D M+GIVY+R
Sbjct: 58   KNGGPGGPKCKPNGWAAHLD---DLKVYANNDEKKEVKMCKKVKEELKGADLMYGIVYSR 114

Query: 2653 KRRRLPGNKFVSSPRDRMYGISFVRKQRRKRSSGHSINELPRKDCQXXXXXXXXXXXXSG 2474
            KR+R  G K     + + YGI F R+QRRK+S                            
Sbjct: 115  KRKRNDGEKSKILEKKK-YGIQFSRRQRRKKSE--------------------------- 146

Query: 2473 HESSIDVVHGAILDIVVESSCNSAHRFTCFLVSILKFVKSSRISLTEFAAFMCSEPLVRV 2294
                  +V  ++  + +ESS  S+     FL S+L  ++ + + L   A+F+ SE +  V
Sbjct: 147  -----KIVPFSVFGVGLESS--SSGFLVSFLSSVLGCMRRATVELPRLASFLLSETISGV 199

Query: 2293 FAQHGVHFLANLSYRISWKNDLISPGICQIFEARQFIPMFSLDFCASPFTFMRLHFSVVL 2114
            F+  G+ F        SW   +   G+C+IF   Q IPMFSLDF A P  FM +H  +++
Sbjct: 200  FSLRGIRF--------SWDPPIARTGMCRIFGTMQLIPMFSLDFSAVPSCFMYIHHCMLV 251

Query: 2113 RSLYLPDVLIRYLSGLIKKARE---ITDNKKCLPCIPTEMGFPGSNSMASWSSSVKKRKV 1943
            R +  P V          +  +   + ++K   P +                +SV K  +
Sbjct: 252  RFMRPPSVNSSASEDDSSEEEDVDYVCESKTVTPVV---------------DNSVNKVAL 296

Query: 1942 DFIVRATDFARRSVSTRHSANSRNVQRKRTSLRSTRARNSSSMGFH-------HDL---- 1796
               VR++  A R+V  R S NSR +Q++R+SLR  RARN S +G          DL    
Sbjct: 297  HPSVRSSKLAARNVQYRSSLNSRAIQKRRSSLRRRRARNPSLIGSQKASGALVSDLTSCR 356

Query: 1795 ------FRAGYKHKKRKLAQKSPCSNVKELKSTLVELKQNMDSVCCSANILVIESDRCFR 1634
                    A  K K R   Q S   ++KE+ ST+  L  ++D  CC  +ILV+ESDRC R
Sbjct: 357  KSSIPSSSAVSKSKLRSSLQHSSVLSIKEVSSTVDSLMLDLDRSCCCVSILVMESDRCCR 416

Query: 1633 EEGAKVMLECLALNDWYLVAKTHGLSRYMYKAENVMRPSTPNRFTHAMIWTGENGWKLEF 1454
             EGA V+LE     +W+LV K  G +RY +KA+ +MRPS+ NRFTHA++W G++ WKLEF
Sbjct: 417  VEGANVILEMSHSKEWHLVVKKDGETRYSFKAQRIMRPSSFNRFTHAILWAGDDNWKLEF 476

Query: 1453 SNRRDWLIFKELHMACMDRNLQVVSARTIPVPGVYEVPCYDDSGYVPFVRPIAYITMRDD 1274
            SNR+DWL FK+L+  C DRN QV  ++ IP+PGVYEV  Y+DS  VPF RP +YI++  D
Sbjct: 477  SNRQDWLNFKDLYKECSDRNAQVSVSKVIPIPGVYEVLGYEDSNTVPFCRPDSYISVNVD 536

Query: 1273 EVARALVRKPANYDMDSGDEEWLEKLNNGVFDGDVGVPNYILPEKFEEMMDAFEKAAYHT 1094
            EV+RAL ++ ANYDMDS DEEWL+K NN  F  +  +  ++  + FE ++DAFEKA + +
Sbjct: 537  EVSRALAKRTANYDMDSEDEEWLKKFNN-EFVTENELHEHVSEDTFELIVDAFEKAYFCS 595

Query: 1093 PDEVSDESRATSFCSSLERKDTVAAVYQYWMKKRKQNHVALLRVFQCPPPRSSQLMQKPV 914
            PD+ S+E  A + C  L RK+ V AVY +W +KRKQ   ALLRVFQ   P+   L+ KP 
Sbjct: 596  PDDYSNEEAAVNLCLELGRKEVVLAVYNHWKQKRKQKRAALLRVFQGRQPKKPSLIPKPA 655

Query: 913  FRKKRSFKRQMRQSGRGKQQIFFHASAVE-----PEQDAMQRVQKAKSLADISMESVLLK 749
             RK+RSFKRQ  Q GRGK  +      V       EQ+AM+RV++AK+ A  S+E  +LK
Sbjct: 656  LRKRRSFKRQASQPGRGKPPVVLLPEVVTQQDALEEQNAMRRVEEAKASAKRSLEEAVLK 715

Query: 748  RRRAQILMDNADLATY 701
            R+RAQ+LM NADLATY
Sbjct: 716  RQRAQLLMQNADLATY 731


>ref|XP_002315450.2| hypothetical protein POPTR_0010s24240g [Populus trichocarpa]
            gi|550330500|gb|EEF01621.2| hypothetical protein
            POPTR_0010s24240g [Populus trichocarpa]
          Length = 777

 Score =  494 bits (1273), Expect = e-137
 Identities = 313/802 (39%), Positives = 440/802 (54%), Gaps = 38/802 (4%)
 Frame = -3

Query: 2992 MPLVEMRRSTRVFVPKSVVKDTDGARVLRSGKRFSLDCGEEKPVRGN-GDEWFCII---- 2828
            MP V +RR+TRVF    V+K  DGARVLRSG+R   + G+ K  R N GDEW+  I    
Sbjct: 1    MPSVGLRRTTRVF---GVIKGVDGARVLRSGRRLWQESGDGKLRRSNDGDEWYHTIIKND 57

Query: 2827 -------DDTADVPRCKSIDWYEVDP-ERDIDVTDFNLNLAXXXXXXXXXXXXXSRDKMH 2672
                   +  +D+   ++  W   D  ++D+ V                       +K  
Sbjct: 58   NYQTKNQNKNSDLKYKENSGWAHDDKLKKDLGVV--------IAIAAPKRIKRVKSEKKF 109

Query: 2671 GIVYNRKRRRLPGNKFVSSPRDRMYGISFVRKQRRKRSSGHSINELPRKDCQXXXXXXXX 2492
            GIVY RKR+RL G K   S  D+ +GI F R+QRR                         
Sbjct: 110  GIVYRRKRKRLGGEKSEDS-EDKKFGIQFSRRQRRSLDD--------------------- 147

Query: 2491 XXXXSGHESSIDVVHGAILDIVVES-SCNSAHRFTCFLVSILKFVKSSRISLTEFAAFMC 2315
                   ESS  +V    L ++VE  S +S++  +CFL S+L+++K   +SL+E A F+ 
Sbjct: 148  -------ESSESLVCTPELVVLVEDFSSSSSNGLSCFLSSVLRYIKRVNLSLSELADFLL 200

Query: 2314 SEPLVRVFAQHGVHFLANLSYRISWKNDLISPGICQIFEARQFIPMFSLDFCASPFTFMR 2135
            SEP+  VFA +G+HF  +LS       D I  GIC+ F  RQ +PMFS+DF + P  F+ 
Sbjct: 201  SEPISSVFASNGLHFARDLSA------DRI--GICKFFGTRQLLPMFSVDFSSIPSCFVH 252

Query: 2134 LHFSVVLRSLYLPDVLIRYLSGLIKKAREI--TDNKKCLPCIPTEMGFPGS-NSMASWSS 1964
            +H S+ +R  +L  + +        +  ++  + +K    C   +  F     ++    +
Sbjct: 253  MHLSLFVRFKFLSPIPVNNSLDEDDEDDDVMMSGSKVDQSCTTMKTDFALKITAVPEIDN 312

Query: 1963 SVKKRKVDFIVRATDFARRSVSTRHSANSRNVQRKRTSLRSTRARNSSSMGFHH------ 1802
            S  K  V   VRA+  A RS   R+  NSR +Q++R+SLR  R RNS+  G H       
Sbjct: 313  SGSKAVVHPSVRASKLAGRSTQYRNGLNSRGIQKRRSSLRRGRPRNSAIAGLHKASGALV 372

Query: 1801 -DLF---RAGY-------KHKKRKLAQKSPCSNVKELKSTLVELKQNMDSVCCSANILVI 1655
             DL    R G        K+K R+  + SP +N+KE+ S  V +K++M+   CSANILV 
Sbjct: 373  SDLISSRRKGIPFSSVVSKNKLRRSVRSSPAANIKEMNSAAVGVKKDMNMSSCSANILVS 432

Query: 1654 ESDRCFREEGAKVMLECLALNDWYLVAKTHGLSRYMYKAENVMRPSTPNRFTHAMIWTGE 1475
            ESDRC+R EGA VM E     +W LV K  GL+RY + A+  MR    NRFTH +IWTG+
Sbjct: 433  ESDRCYRIEGATVMFEFTGSREWVLVVKKDGLTRYTHLAQKSMRTCASNRFTHDIIWTGD 492

Query: 1474 NGWKLEFSNRRDWLIFKELHMACMDRNLQVVSARTIPVPGVYEVPCYDDSGYVPFVRPIA 1295
            + WKLEF NR+DW IFKEL+  C D N+    ++ I VPGV EV  Y++ G  PF+RP A
Sbjct: 493  DNWKLEFPNRQDWFIFKELYKECSDCNVPASVSKVISVPGVREVLGYENGGGAPFLRPYA 552

Query: 1294 YITMRDDEVARALVRKPANYDMDSGDEEWLEKLNNGVFDGDVGVPNYILPEKFEEMMDAF 1115
            YI+  +DEVARAL R  A+YDMDS DEEWL+K NN      +   +++  + FE ++DA 
Sbjct: 553  YISSENDEVARALARSTASYDMDSEDEEWLKKYNNDF----LAESDHLSEDNFELLIDAL 608

Query: 1114 EKAAYHTPDEVSDESRATSFCSSLERKDTVAAVYQYWMKKRKQNHVALLRVFQCPPPRSS 935
            EK+ Y  PD+ +DE+ A  +C    R++   AVY YWMKKRKQ    LLRVFQ    + +
Sbjct: 609  EKSYYCNPDDFTDENAAAKYCKDFGRREVAEAVYSYWMKKRKQKCSPLLRVFQGHQAKKT 668

Query: 934  QLMQKPVFRKKRSFKRQMRQSGRGKQQIFFHASAVEPE----QDAMQRVQKAKSLADISM 767
             ++ KPV RK+RSFKR   Q GRGKQ       A + +     +AM ++++A++    S+
Sbjct: 669  PVIPKPVLRKRRSFKRPPSQFGRGKQPSLLPVMAADQDALEGYNAMHKIEEAENSVKRSL 728

Query: 766  ESVLLKRRRAQILMDNADLATY 701
            E+ +LKRRRAQ+LM NADLATY
Sbjct: 729  EAAILKRRRAQLLMKNADLATY 750


>emb|CBI40243.3| unnamed protein product [Vitis vinifera]
          Length = 734

 Score =  494 bits (1273), Expect = e-137
 Identities = 293/730 (40%), Positives = 416/730 (56%), Gaps = 28/730 (3%)
 Frame = -3

Query: 2806 RCKSIDWYEVDPERDIDVTDFNLNLAXXXXXXXXXXXXXSRDKMH-GIVYNRKRRRLPGN 2630
            RC+   W+EV+ ++++D  D  + ++               D    GIVY+R+ +R    
Sbjct: 12   RCRLNGWHEVNSKQEVDDVDAEVAVSESRNVAGKCGDDQGSDYSRWGIVYSRRTKRSDSK 71

Query: 2629 KFVSSPR-----DRMYGISFVRKQRRKRSSGHSINELPRKDCQXXXXXXXXXXXXSGHES 2465
              +S  +     D+ +GI F RKQRRKR                            G   
Sbjct: 72   SLLSPEKKRGFEDKRFGIRFSRKQRRKRMEESE----------------------EGGYV 109

Query: 2464 SIDVVHGAILDIVVESSCNSAHRFTCFLVSILKFVKSSRISLTEFAAFMCSEPLVRVFAQ 2285
             +++V      +V++SS +   RFT FL SIL +++ SR+ L     F+  EP++  F+ 
Sbjct: 110  CVEMV-----TVVIDSSRSGRCRFTSFLNSILGYMRRSRVRLWGLYEFLTWEPMMDAFSS 164

Query: 2284 HGVHFLANLSYRISWKNDLISPGICQIFEARQFIPMFSLDFCASPFTFMRLHFSVVLRSL 2105
            HGV FL +     S+       GIC+IF AR+FIP+FS+DF A P  FM LH S++LR  
Sbjct: 165  HGVRFLRDPPCARSF-------GICKIFGARRFIPLFSVDFSAVPSCFMYLHSSMLLRFG 217

Query: 2104 YLPDVLIRYLSGLIKKAREITDNKKCLPCIPTEMGFPGSNSMASWS-SSVKKRKVDFIVR 1928
             LP VL+     +     E  D+++ L CIP++    GS S+   + +S K+R +   + 
Sbjct: 218  CLPFVLVNNSMSVCSNGEEPIDSEENLLCIPSKKDHFGSKSITLENDNSGKRRMLQPTIG 277

Query: 1927 ATDFARRSVSTRHSANSRNVQRKRTSLRSTRARNSSSMGFHHD----------------- 1799
             + F+ R+   R+  NSR++Q++R+S RS R RN S +G H                   
Sbjct: 278  TSRFSGRNAQWRNGVNSRSIQKRRSSQRSRRVRNPSLVGIHKSNGALVSDFITNRNKGIP 337

Query: 1798 LFRAGYKHKKRKLAQKSPCSNVKELKSTLVELKQNMDSVCCSANILVIESDRCFREEGAK 1619
                 Y  + R+ A+ +  +N++ELKST V +K+ +DSVCCSANIL++ESDRCFRE GA 
Sbjct: 338  FSSVVYNQELRRSARHASATNIRELKSTSVVVKEEIDSVCCSANILIVESDRCFRENGAN 397

Query: 1618 VMLECLALNDWYLVAKTHGLSRYMYKAENVMRPSTPNRFTHAMIWTGENGWKLEFSNRRD 1439
            VMLE  A  +W++  K  G  +Y +KAE  MR ++ NR THAMIW GE+GWKLEF NR+D
Sbjct: 398  VMLEVSASKEWFIAVKKDGSMKYSHKAEKDMRYAS-NRHTHAMIWNGEDGWKLEFPNRQD 456

Query: 1438 WLIFKELHMACMDRNLQVVSARTIPVPGVYEVPCYDDSGYVPFVRPIAYITMRDDEVARA 1259
            W+IFKEL+  C DRN++  S + IPVPGV+EV  Y D    PF RP  YI  ++DEV+RA
Sbjct: 457  WMIFKELYKECCDRNVEAPSVKIIPVPGVHEVTDYGDYKGDPFSRPDTYIAFKNDEVSRA 516

Query: 1258 LVRKPANYDMDSGDEEWLEKLNNGVFDGDVGVPNYILPEKFEEMMDAFEKAAYHTPDEVS 1079
            + +  A+YDMDS DEEWL+KLN+  F  +  +  ++  E FE M+DAFEKA Y +PD+  
Sbjct: 517  MAKTTASYDMDSEDEEWLKKLNS-EFHAENDLHGHVSEEDFELMVDAFEKAVYCSPDDYP 575

Query: 1078 DESRATSFCSSLERKDTVAAVYQYWMKKRKQNHVALLRVFQCPPPRSSQLMQKPVFRKKR 899
            D + A   C  L  ++ +A VY YWMKKRK+   +L+RVFQ    R +QL+ KPV RKKR
Sbjct: 576  DANGAADLCVDLGSREAIACVYGYWMKKRKRKRGSLVRVFQGHHLRKAQLIPKPVLRKKR 635

Query: 898  SFKRQMRQSGRGKQQIFFHASAVE----PEQDAMQRVQKAKSLADISMESVLLKRRRAQI 731
            SF RQ+ + GRGKQQ    A A +     E  A  + Q+A+   D S +  + KR RAQ 
Sbjct: 636  SFSRQVGKFGRGKQQNVMQALAAQRKAIDETSAKLKAQEARVSLDRSEKLAIRKRVRAQS 695

Query: 730  LMDNADLATY 701
            LM+NADLATY
Sbjct: 696  LMENADLATY 705


>ref|XP_002532013.1| conserved hypothetical protein [Ricinus communis]
            gi|223528325|gb|EEF30368.1| conserved hypothetical
            protein [Ricinus communis]
          Length = 781

 Score =  482 bits (1241), Expect = e-133
 Identities = 308/802 (38%), Positives = 432/802 (53%), Gaps = 38/802 (4%)
 Frame = -3

Query: 2992 MPLVEMRRSTRVFVPKSVVKDTDGARVLRSGKRFSLDCGEEKPVRGN-GDEWFCII---- 2828
            MP V MRRSTRVF    VVK  DGARVLRSG+R  +  GE K  R N GDEW   +    
Sbjct: 1    MPSVGMRRSTRVF---GVVKGVDGARVLRSGRRLLIGAGENKFKRANDGDEWLHTMIKNH 57

Query: 2827 ---DDTADVPRC-KSIDWYEVDP-------ERDIDVTDFNLNLAXXXXXXXXXXXXXSRD 2681
                + + + +C K   W +          ER   V    L +              S +
Sbjct: 58   HHNHNNSPIMKCNKENGWTQTQTHVSKLKKERPSPVA---LGVGAGAGNEVAKKVNDSGN 114

Query: 2680 KMHGIVYNRKRRRLPG-NKFVSSPRDRMYGISFVRKQRRKRSSGHSINELPRKDCQXXXX 2504
            KM GIVY+RKRRR+ G +K     R++ +GI F R+QRR+                    
Sbjct: 115  KMWGIVYSRKRRRMSGIDKLEILGRNKKFGIQFSRRQRRRVLK----------------- 157

Query: 2503 XXXXXXXXSGHESSIDVVHGAILDIVVESSCNSAHRFTCFLVSILKFVKSSRISLTEFAA 2324
                       ++ ++    A+L I+V+ SC+S+     FL  +L +++ + +S+ E   
Sbjct: 158  -----------DNEVESFEPALLGIIVDGSCSSSGLAASFLHLVLGYIRRTNLSIAELVP 206

Query: 2323 FMCSEPLVRVFAQHGVHFLANLSYRISWKNDLISPGICQIFEARQFIPMFSLDFCASPFT 2144
            F+ SE +   FA  G+ FL + +   +        GIC+IF     +P+FSLDF A PF 
Sbjct: 207  FLLSESVKCAFASDGLRFLQDTTANRN--------GICKIFGGMSTVPIFSLDFSAVPFC 258

Query: 2143 FMRLHFSVVLRSLYLPDVLIRYLSGLIKKAREITDNKKCLPCIPTEMGFPGSNSMASWSS 1964
            F+ +H  +  R   L    +            I+++++   C     G   +++     +
Sbjct: 259  FLCMHLRLAFRVKCLSFEPVNNSLDEDSSQEVISESEEDHSC-----GLVRTDTFLLTDN 313

Query: 1963 SVKKRKVDFIVRATDFARRSVSTRHSANSRNVQRKRTSLRSTRARNSSSMGFH------- 1805
            S  K  +   + A+  A R    R+  NSR +Q++R++ R  RARN S +G H       
Sbjct: 314  SGGKVSLHPSLIASKLAGRHSQYRNVLNSRGIQKRRSAFRRRRARNPSGVGIHKANGALV 373

Query: 1804 HDLFRAG----------YKHKKRKLAQKSPCSNVKELKSTLVELKQNMDSVCCSANILVI 1655
             DL  +            K K R+  + +P +N+KE+  T V+  + MDS  CSAN+LVI
Sbjct: 374  SDLISSRKNGIPFSTVVSKDKLRRSLRLTPAANLKEVNPTAVQTSRVMDSSSCSANLLVI 433

Query: 1654 ESDRCFREEGAKVMLECLALNDWYLVAKTHGLSRYMYKAENVMRPSTPNRFTHAMIWTGE 1475
            ESDRC+R  GA V LE   L +W LV K  GL+R  + A+  MRP + NR TH +IWTG+
Sbjct: 434  ESDRCYRMVGATVALEISDLKEWVLVVKKDGLTRCTHLAQKSMRPCSSNRITHDVIWTGD 493

Query: 1474 NGWKLEFSNRRDWLIFKELHMACMDRNLQVVSARTIPVPGVYEVPCYDDSGYVPFVRPIA 1295
            + WKLEF NR+DWLIFK+L+  C DRN+    ++ IPVPGV EV  Y+DS  +PF R  A
Sbjct: 494  DSWKLEFPNRQDWLIFKDLYKECYDRNVPAPISKAIPVPGVREVLGYEDSSSLPFSRQDA 553

Query: 1294 YITMRDDEVARALVRKPANYDMDSGDEEWLEKLNNGVFDGDVGVPNYILPEKFEEMMDAF 1115
            YI+  +DEV RAL ++ ANYDMD  DEEWL+K N+  F  +     ++  EKFE M+D  
Sbjct: 554  YISFNNDEVVRALTKRTANYDMDCEDEEWLKKFNSEFF-VESEEQEHLSEEKFELMIDTL 612

Query: 1114 EKAAYHTPDEVSDESRATSFCSSLERKDTVAAVYQYWMKKRKQNHVALLRVFQCPPPRSS 935
            E+A Y +PD+  D   A +FC  L R++ V AVY YWMKK+KQ   ALLRVFQ    + +
Sbjct: 613  ERAFYSSPDDFVDGRAAVNFCIDLGRREVVEAVYGYWMKKQKQRRSALLRVFQLHQGKKA 672

Query: 934  QLMQKPVFRKKRSFKRQMRQSGRGKQQIFFHASAVE----PEQDAMQRVQKAKSLADISM 767
             L+ KP  RK+RSFKRQ  Q GRGK+     A A E     EQ+AM+ ++ AK+ A  S+
Sbjct: 673  SLIPKPGLRKRRSFKRQASQFGRGKKPSLLQAMAAEHDALEEQNAMRNLEAAKASAKSSV 732

Query: 766  ESVLLKRRRAQILMDNADLATY 701
            ES +LKRRRAQ+LM+NADLA Y
Sbjct: 733  ESAILKRRRAQMLMENADLAVY 754


>ref|XP_002311034.2| hypothetical protein POPTR_0008s02470g [Populus trichocarpa]
            gi|550332250|gb|EEE88401.2| hypothetical protein
            POPTR_0008s02470g [Populus trichocarpa]
          Length = 774

 Score =  473 bits (1216), Expect = e-130
 Identities = 312/792 (39%), Positives = 429/792 (54%), Gaps = 28/792 (3%)
 Frame = -3

Query: 2992 MPLVEMRRSTRVFVPKSVVKDTDGARVLRSGKRFSLDCGEEKPVRGN-GDEWF-CIIDDT 2819
            MP V +RR+TRVF   SVVK  DGARVLRSG+R   + G+ K  R + GDE +  II +T
Sbjct: 1    MPSVGLRRTTRVF---SVVKGVDGARVLRSGRRLWPESGDGKLRRSSDGDELYQTIIKNT 57

Query: 2818 ADVPRCKSIDWYEVDPERDIDVTDFNLNLAXXXXXXXXXXXXXSRDKMH----GIVYNRK 2651
             +  + ++ +      E +    D  L                 R K      GIVY+RK
Sbjct: 58   NNHIKNQNSNSNLKYKENNGWTHDVKLKKDRGIVIAIAAPKKIKRVKSEKEKFGIVYSRK 117

Query: 2650 RRRLPGNKFVSSPRDRMYGISFVRKQRRKRSSGHSINELPRKDCQXXXXXXXXXXXXSGH 2471
            R+RL G K   +P D+ +GI F R+QRR+                             G 
Sbjct: 118  RKRLGGEKS-ENPEDKKFGIQFSRRQRRRE----------------------------GS 148

Query: 2470 ESSIDVVHGAILDIVVESSCNSAHRFTCFLVSILKFVKSSRISLTEFAAFMCSEPLVRVF 2291
            ES   +V    L  +VE   +S    +CFL S+L       +SL+E A F+ S+P+  VF
Sbjct: 149  ESQESLVCTPQLVALVEGCSSSNGWLSCFLSSVLGHAMRVSLSLSELADFLLSDPISSVF 208

Query: 2290 AQHGVHFLANLSYRISWKNDLISPGICQIFEARQFIPMFSLDFCASP--FTFMRLHFSVV 2117
            A +G+HF+ +L       +D I  GIC+ FE RQ +PMFS+DF A P  F FM L   V 
Sbjct: 209  ASNGLHFVRDLP------SDRI--GICKFFETRQLLPMFSVDFSAIPSCFAFMHLSLFVK 260

Query: 2116 LRSLYLPDVLIRYLSGLIKKAREITDNKKCLPCIPTEMGFPGSNSMASWSSSVKKRKVDF 1937
             R L L  V    + G       ++++K    C  T+  F    ++   + S   R V  
Sbjct: 261  FRCLSLIPVN-NSVDGDDDDDEIMSESKGDQSCTSTKTDFTQKITVVPKTDSYGCRVVLH 319

Query: 1936 -IVRATDFARRSVSTRHSANSRNVQRKRTSLRSTRARNSSSMGFHH-------DLFRAGY 1781
              VRA+    R+   R+  NSR +Q++R+SLR  R RNSS  G H        DL  +  
Sbjct: 320  PSVRASKLTGRNTQHRNGLNSRGIQKRRSSLRRGRPRNSSIGGLHKANGALVSDLISSRK 379

Query: 1780 ----------KHKKRKLAQKSPCSNVKELKSTLVELKQNMDSVCCSANILVIESDRCFRE 1631
                      K K R+  Q SP +++KEL    V +K+ M+   CSANIL+ E+DRC+R 
Sbjct: 380  IGIPFSSVVSKEKLRRSIQSSPAASIKELNCAAVGVKKGMNLSSCSANILITETDRCYRI 439

Query: 1630 EGAKVMLECLALNDWYLVAKTHGLSRYMYKAENVMRPSTPNRFTHAMIWTGENGWKLEFS 1451
            EGA VMLE     +W LV K +GL+RY + A+ +MR    NRFTH +IW G++ WKLEF 
Sbjct: 440  EGATVMLEFTDSKEWVLVVKKNGLTRYSHLAQKIMRTCVSNRFTHDIIWNGDDNWKLEFP 499

Query: 1450 NRRDWLIFKELHMACMDRNLQVVSARTIPVPGVYEVPCYDDSGYVPFVRPIAYITMRDDE 1271
            NR+DW IFKEL+  C D N+    ++ IPVPGV  V    D G  PF RP AYI+  +DE
Sbjct: 500  NRQDWFIFKELYKECSDHNVPASVSKAIPVPGVRGVLDNGDCGSAPFSRPYAYISSNNDE 559

Query: 1270 VARALVRKPANYDMDSGDEEWLEKLNNGVFDGDVGVPNYILPEKFEEMMDAFEKAAYHTP 1091
            VARAL R  A+YDMDS DEEWL+K N       +   +++  + FE M+DA E++ +  P
Sbjct: 560  VARALSRSTASYDMDSEDEEWLKKYNKEF----LAESDHLSEDNFELMIDALERSYFCDP 615

Query: 1090 DEVSDESRATSFCSSLERKDTVAAVYQYWMKKRKQNHVALLRVFQCPPPRSSQLMQKPVF 911
            D+ +DES A  +C    R++   AVY YWMKKRKQ    LLRVFQ    + + L+ KPV 
Sbjct: 616  DDFTDESAAAKYCKDFGRRELAKAVYGYWMKKRKQKRSPLLRVFQGHQAKKTPLIPKPVL 675

Query: 910  RKKRSFKRQMRQSGRGKQQIFFHASAVEPE--QDAMQRVQKAKSLADISMESVLLKRRRA 737
            RK+RSFKR   Q GRGKQ     A A E +    A+++V++A++    S+E+ +LKR++A
Sbjct: 676  RKRRSFKRPPSQFGRGKQPSLLQAMAAEKDALHSALRKVEEARNSVKRSVEAAMLKRQKA 735

Query: 736  QILMDNADLATY 701
            Q+LM NADLAT+
Sbjct: 736  QLLMKNADLATF 747


>ref|XP_007010267.1| Enhancer of polycomb-like transcription factor protein, putative
            isoform 1 [Theobroma cacao] gi|508727180|gb|EOY19077.1|
            Enhancer of polycomb-like transcription factor protein,
            putative isoform 1 [Theobroma cacao]
          Length = 767

 Score =  470 bits (1209), Expect = e-129
 Identities = 307/795 (38%), Positives = 444/795 (55%), Gaps = 31/795 (3%)
 Frame = -3

Query: 2992 MPLVEMRRSTRVFVPKSVVKDTDGARVLRSGKRFSLDCGEEKPVR--GNGDEWFCIIDDT 2819
            MP V MRR+TRVF    +VK ++ ARVLRSG+R   D GE KP R    GDE + ++   
Sbjct: 1    MPSVGMRRTTRVF---RMVKSSEVARVLRSGRRLWPDSGEAKPKRLANEGDENYNLMKKA 57

Query: 2818 ADVPRCKSIDWYEVDPERDIDVTDFNLNLAXXXXXXXXXXXXXSRDKMHGIVYNRKR--R 2645
                           P+ +++     ++                R K  G   N ++  R
Sbjct: 58   ---------------PKSEVNGVAAEVS---------------GRPKRLGNEENPRKQSR 87

Query: 2644 RLPGNKF-VSSPRDRMYGISFVRKQRRKR-SSGHSINELPRKDCQXXXXXXXXXXXXSGH 2471
            ++    F  S   D+M+GI + RK++R    +GH      + +              + +
Sbjct: 88   KMKAGAFNTSGSVDKMFGIVYTRKRKRNGVQNGHLSGNSGQGNYGKKISRRQAIENRNTN 147

Query: 2470 ESSIDVVHGAILDIVVESS-CNSAHRFTCFLVSILKFVKSSRISLTEFAAFMCSEPLVRV 2294
            E   DV    +   VVE+  CN    F+ FL+ +L +VK + + L+E AAF+ S+P+  V
Sbjct: 148  E---DVEEPKMFSFVVENGDCNGC--FSNFLILVLGYVKRAEVRLSELAAFLMSQPISSV 202

Query: 2293 FAQHGVHFLANLSYRISWKNDLISPGICQIFEARQFIPMFSLDFCASPFTFMRLHFSVVL 2114
            ++ +GV+F      R          GIC+ F A+  IP+FSLDF A P  F+ +H+S VL
Sbjct: 203  YSSNGVNFFWGPRNRT---------GICKFFGAKDSIPLFSLDFSAVPRYFLYMHYSKVL 253

Query: 2113 RSLYLPDVLIRYLSGLIKKAREITDNKKCLPCIPTEMGFPGS---NSMASWSSSVKKRKV 1943
            R       L R     +     ++D+++  PC+ + +    S   N+     +   K  +
Sbjct: 254  R-------LKRIQIVPVNSDEIVSDSEEDEPCVTSVVDVCKSTSGNAAVEIDNLGSKVVL 306

Query: 1942 DFIVRATDFARRSVSTRHSANSRNVQRKRTSLRSTRARNSSSMGFHH-------DLF--- 1793
               VRA+    R+   R+  +SR++Q++R+SLR  RARN S +G H        DL    
Sbjct: 307  HPSVRASKLTGRNAQCRNGLSSRSIQKRRSSLRRRRARNPSIVGIHKANGALMSDLISSR 366

Query: 1792 RAGY-------KHKKRKLAQKSPCSNVKELKSTLVELKQNMDSVCCSANILVIESDRCFR 1634
            R G        K+K R   + S  +NV ++ S++ +L QN+DS  CSANILVIE+DRC+R
Sbjct: 367  RNGIPFSSVVSKNKLRSSVRNSSVANVSDVGSSISDLMQNVDSSQCSANILVIEADRCYR 426

Query: 1633 EEGAKVMLECLALNDWYLVAKTHGLSRYMYKAENVMRPSTPNRFTHAMIWTGENGWKLEF 1454
            EEGA V LE  A  +W LV K    +++  KA+  MRPS+ NRFTHA+IWTG++ WKLEF
Sbjct: 427  EEGAIVTLELSASREWLLVVKKGSSTKFACKADKFMRPSSCNRFTHAIIWTGDDNWKLEF 486

Query: 1453 SNRRDWLIFKELHMACMDRNLQVVSARTIPVPGVYEVPCYDDSGYVPFVRPIAYITMRDD 1274
             NR+DW+IFK+L+  C +RN+   + + IPVPGV+EVP Y+D   VPF RP  YI++  D
Sbjct: 487  PNRQDWIIFKDLYKECSERNVPASTVKAIPVPGVHEVPGYEDRRSVPFCRPDFYISLDGD 546

Query: 1273 EVARALVRKPANYDMDSGDEEWLEKLNNGVFDGDVGVPNYILPEKFEEMMDAFEKAAYHT 1094
            EV+RAL ++ ANYDMDS DEEWL+K NN  F G+ G   ++  + FE M+DAFEKA + +
Sbjct: 547  EVSRALAKRTANYDMDSEDEEWLKKFNNEFFSGN-GHCEHLSEDCFELMVDAFEKAYFCS 605

Query: 1093 PDEVSDESRATSFCSSLERKDTVAAVYQYWMKKRKQNHVALLRVFQCPPPRSSQLMQKPV 914
            PD+ S+E+ A   C  L  +  V AV+ YW++KRKQ   ALLRVFQ    + + L+ KP 
Sbjct: 606  PDDYSNENAAAHLCLDLGTRGLVEAVHTYWLRKRKQRRSALLRVFQGHQVKKAPLVPKPF 665

Query: 913  FRKKRSFKRQMRQSGRGKQQIFFHASAVE----PEQDAMQRVQKAKSLADISMESVLLKR 746
             RK+RSFKRQ    GRGKQ     A A E     EQ+AM ++++A+  A  S+E  +LKR
Sbjct: 666  LRKRRSFKRQ-ASHGRGKQPYLLQALAAERDSMAEQNAMLKLEEARVSASRSVELAVLKR 724

Query: 745  RRAQILMDNADLATY 701
            +R Q+LM+NADLATY
Sbjct: 725  QRTQLLMENADLATY 739


>ref|XP_006360530.1| PREDICTED: uncharacterized protein LOC102597035 isoform X1 [Solanum
            tuberosum]
          Length = 781

 Score =  459 bits (1182), Expect = e-126
 Identities = 299/822 (36%), Positives = 433/822 (52%), Gaps = 37/822 (4%)
 Frame = -3

Query: 2977 MRRSTRVFVPKSVVKDTDGARVLRSGKRFSLDCGEEKPVRGNGDEWFCIIDDT-----AD 2813
            MRR+TR+F          G RVLRSG+R S   GE K  + +GDEW  ++D+      AD
Sbjct: 7    MRRTTRIF----------GTRVLRSGRRLSTP-GEAKRAK-HGDEWIGLLDNVGGGGAAD 54

Query: 2812 VPRCKSIDWYEVD-------PERDIDVTDFNLNLAXXXXXXXXXXXXXSR--DKMHGIVY 2660
              RCK   W + +        E DIDV   +++               +   D+M G+VY
Sbjct: 55   ATRCKKNGWLKKEVALNLEADEMDIDVDSKSMDELESPEAPVVETISPNSNIDRMWGLVY 114

Query: 2659 NRKRRRLPGNKFVSSPRD-RMYGISFVRKQRRKRSSGHSINELPRKDCQXXXXXXXXXXX 2483
             RKR+R+  +       D R YG  FVRK++ + +    +                    
Sbjct: 115  TRKRKRVADSVKGKVLTDVRRYGKQFVRKKKVRSAYAKDL-------------------- 154

Query: 2482 XSGHESSIDVVHGAILDIVVESSCNSAHRFTCFLVSILKFVKSSRISLTEFAAFMCSEPL 2303
              G      V  G    ++V +S  S +  +C L  IL +++ S +SL +   F+ S+PL
Sbjct: 155  --GKSEDGQVSSGI---VIVNTSYGSGYWVSCLLNCILMYLRRSTVSLQQIFGFINSKPL 209

Query: 2302 VRVFAQHGVHFLANLSYRISWKNDLISPGICQIFEARQFIPMFSLDFCASPFTFMRLHFS 2123
              V +  G+    + + R       I  G C I   R  +P+F+LDF   P  F+ LH S
Sbjct: 210  RDVNSLQGILLFKDQTPR------KIKTGACVISGVRCSVPVFTLDFSTVPCFFLYLHSS 263

Query: 2122 VVLRSLYLPDVLIRYLSGLIKKAREITDNKKCLPCI-PTEMGFPGSNSMASWS----SSV 1958
            ++LR + +   L+   +  I +   +T++K+ + C+ P        N+ +        + 
Sbjct: 264  LLLRFVPMSYALVMQPTVAIDEV-TVTNDKEIVSCLSPVTQSELDVNTQSGLDVVAPGAY 322

Query: 1957 KKRKVDFIVRATDFARRSVSTRHSANSRNVQRKRTSLRSTRARNSS-------------S 1817
              +K++ +       + +       NSRN+Q++R+SLRS R R+SS              
Sbjct: 323  DSKKIEVVNPTVGLPKLAARHLQPRNSRNIQKRRSSLRSMRGRHSSFGTQNATGVLTSDR 382

Query: 1816 MGFHHDLFRAGYK---HKKRKLAQKSPCSNVKELKSTLVELKQNMDSVCCSANILVIESD 1646
            + F  D  R   +   ++ R   QK+   +VKELKS LV L QN++S  CSAN+LVIE D
Sbjct: 383  LRFRRDGLRFSSRTPHYELRSSRQKTSTPSVKELKSALVGLTQNIESTSCSANVLVIEPD 442

Query: 1645 RCFREEGAKVMLECLALNDWYLVAKTHGLSRYMYKAENVMRPSTPNRFTHAMIWTGENGW 1466
            +C+REEGA + +E  A   W L  K  G+ R+    E VMRP + NR TH +IW G+NGW
Sbjct: 443  KCYREEGAVIGMELSAAKQWILAVKIGGVRRFNLTTEKVMRPCSSNRVTHDIIWVGDNGW 502

Query: 1465 KLEFSNRRDWLIFKELHMACMDRNLQVVSARTIPVPGVYEVPCYDDSGYVPFVRPIAYIT 1286
            KLEF  R+DWLIFKEL+  C DRN+Q  +   IPVPGV EV  Y +S    F RP++YIT
Sbjct: 503  KLEFPIRQDWLIFKELYKGCSDRNVQPPAVSIIPVPGVREVSGYAESNPPEFARPVSYIT 562

Query: 1285 MRDDEVARALVRKPANYDMDSGDEEWLEKLNNGVFDGDVGVPNYILPEKFEEMMDAFEKA 1106
            ++DDE+ARAL R  ANYDMD  DEEWL   N    D      +++  + FE ++D FEK 
Sbjct: 563  VKDDELARALARSTANYDMDGDDEEWLRNFN----DQPSLENDHLSADSFELLIDNFEKG 618

Query: 1105 AYHTPDEVSDESRATSFCSSLERKDTVAAVYQYWMKKRKQNHVALLRVFQCPPPRSSQLM 926
             Y  PD+ SDE  A S C + E+K+ V AVY YW+KKRKQN  +L+++FQC  PR +Q++
Sbjct: 619  FYCNPDDYSDEKAAVSSCPNKEKKEIVEAVYNYWLKKRKQNRSSLIKIFQCYQPRRTQVI 678

Query: 925  QKPVFRKKRSFKRQMRQSGRGKQQIFFHASAVEPE-QDAMQRVQKAKSLADISMESVLLK 749
             K +FRKKRSFKRQ  ++GRGK + F  A   E E Q+A+ +V++AK+ A+ S +  +  
Sbjct: 679  PKSIFRKKRSFKRQGSKAGRGKHRPFLPAVVAEKEQQNAVLKVKEAKAAANKSEDLAVRM 738

Query: 748  RRRAQILMDNADLATYXXXXXXXXXXXXXXXESPEAYVPFIL 623
            R++AQ LM+NADLATY               +S EA  P  L
Sbjct: 739  RQKAQQLMENADLATYKAMMALKIAEAAKIAKSKEAVGPIFL 780


>ref|XP_007010268.1| Enhancer of polycomb-like transcription factor protein, putative
            isoform 2 [Theobroma cacao] gi|508727181|gb|EOY19078.1|
            Enhancer of polycomb-like transcription factor protein,
            putative isoform 2 [Theobroma cacao]
          Length = 784

 Score =  459 bits (1181), Expect = e-126
 Identities = 307/812 (37%), Positives = 444/812 (54%), Gaps = 48/812 (5%)
 Frame = -3

Query: 2992 MPLVEMRRSTRVFVPKSVVKDTDGARVLRSGKRFSLDCGEEKPVR--GNGDEWFCIIDDT 2819
            MP V MRR+TRVF    +VK ++ ARVLRSG+R   D GE KP R    GDE + ++   
Sbjct: 1    MPSVGMRRTTRVF---RMVKSSEVARVLRSGRRLWPDSGEAKPKRLANEGDENYNLMKKA 57

Query: 2818 ADVPRCKSIDWYEVDPERDIDVTDFNLNLAXXXXXXXXXXXXXSRDKMHGIVYNRKR--R 2645
                           P+ +++     ++                R K  G   N ++  R
Sbjct: 58   ---------------PKSEVNGVAAEVS---------------GRPKRLGNEENPRKQSR 87

Query: 2644 RLPGNKF-VSSPRDRMYGISFVRKQRRKR-SSGHSINELPRKDCQXXXXXXXXXXXXSGH 2471
            ++    F  S   D+M+GI + RK++R    +GH      + +              + +
Sbjct: 88   KMKAGAFNTSGSVDKMFGIVYTRKRKRNGVQNGHLSGNSGQGNYGKKISRRQAIENRNTN 147

Query: 2470 ESSIDVVHGAILDIVVESS-CNSAHRFTCFLVSILKFVKSSRISLTEFAAFMCSEPLVRV 2294
            E   DV    +   VVE+  CN    F+ FL+ +L +VK + + L+E AAF+ S+P+  V
Sbjct: 148  E---DVEEPKMFSFVVENGDCNGC--FSNFLILVLGYVKRAEVRLSELAAFLMSQPISSV 202

Query: 2293 FAQHGVHFLANLSYRISWKNDLISPGICQIFEARQFIPMFSLDFCASPFTFMRLHFSVVL 2114
            ++ +GV+F      R          GIC+ F A+  IP+FSLDF A P  F+ +H+S VL
Sbjct: 203  YSSNGVNFFWGPRNRT---------GICKFFGAKDSIPLFSLDFSAVPRYFLYMHYSKVL 253

Query: 2113 RSLYLPDVLIRYLSGLIKKAREITDNKKCLPCIPTEMGFPGS---NSMASWSSSVKKRKV 1943
            R       L R     +     ++D+++  PC+ + +    S   N+     +   K  +
Sbjct: 254  R-------LKRIQIVPVNSDEIVSDSEEDEPCVTSVVDVCKSTSGNAAVEIDNLGSKVVL 306

Query: 1942 DFIVRATDFARRSVSTRHSANSRNVQRKRTSLRSTRARNSSSMGFHH-------DLF--- 1793
               VRA+    R+   R+  +SR++Q++R+SLR  RARN S +G H        DL    
Sbjct: 307  HPSVRASKLTGRNAQCRNGLSSRSIQKRRSSLRRRRARNPSIVGIHKANGALMSDLISSR 366

Query: 1792 RAGY-------KHKKRKLAQKSPCSNVKELKSTLVELKQNMDSVCCSANILVIESDRCFR 1634
            R G        K+K R   + S  +NV ++ S++ +L QN+DS  CSANILVIE+DRC+R
Sbjct: 367  RNGIPFSSVVSKNKLRSSVRNSSVANVSDVGSSISDLMQNVDSSQCSANILVIEADRCYR 426

Query: 1633 EEGAKVMLECLALNDWYLVAKTHGLSRYMYKAENVMRPSTPNRFTHAMIWTGENGWKLEF 1454
            EEGA V LE  A  +W LV K    +++  KA+  MRPS+ NRFTHA+IWTG++ WKLEF
Sbjct: 427  EEGAIVTLELSASREWLLVVKKGSSTKFACKADKFMRPSSCNRFTHAIIWTGDDNWKLEF 486

Query: 1453 SNRRDWLIFKELHMACMDRNLQVVSARTIPVPGVYEVPCYDDSGYVPFVRPIAYITMRDD 1274
             NR+DW+IFK+L+  C +RN+   + + IPVPGV+EVP Y+D   VPF RP  YI++  D
Sbjct: 487  PNRQDWIIFKDLYKECSERNVPASTVKAIPVPGVHEVPGYEDRRSVPFCRPDFYISLDGD 546

Query: 1273 EVARALVRKPANYDMDSGDEEWLEKLNNGVFDGDVGVPNYILPEKFEEMMDAFEKAAYHT 1094
            EV+RAL ++ ANYDMDS DEEWL+K NN  F G+ G   ++  + FE M+DAFEKA + +
Sbjct: 547  EVSRALAKRTANYDMDSEDEEWLKKFNNEFFSGN-GHCEHLSEDCFELMVDAFEKAYFCS 605

Query: 1093 PDEVSDESRATSFCSSLERKDTVAAVYQYWMKKRKQNHVALLRVFQCPPPRSSQLMQKPV 914
            PD+ S+E+ A   C  L  +  V AV+ YW++KRKQ   ALLRVFQ    + + L+ KP 
Sbjct: 606  PDDYSNENAAAHLCLDLGTRGLVEAVHTYWLRKRKQRRSALLRVFQGHQVKKAPLVPKPF 665

Query: 913  FRKKRSFKRQMRQSGRGKQQIFFH-----------------ASAVE----PEQDAMQRVQ 797
             RK+RSFKRQ    GRGKQ                      A A E     EQ+AM +++
Sbjct: 666  LRKRRSFKRQ-ASHGRGKQPYLLQGPRFRYNAETSIICNCAALAAERDSMAEQNAMLKLE 724

Query: 796  KAKSLADISMESVLLKRRRAQILMDNADLATY 701
            +A+  A  S+E  +LKR+R Q+LM+NADLATY
Sbjct: 725  EARVSASRSVELAVLKRQRTQLLMENADLATY 756


>ref|XP_006360531.1| PREDICTED: uncharacterized protein LOC102597035 isoform X2 [Solanum
            tuberosum]
          Length = 779

 Score =  459 bits (1180), Expect = e-126
 Identities = 300/822 (36%), Positives = 434/822 (52%), Gaps = 37/822 (4%)
 Frame = -3

Query: 2977 MRRSTRVFVPKSVVKDTDGARVLRSGKRFSLDCGEEKPVRGNGDEWFCIIDDT-----AD 2813
            MRR+TR+F          G RVLRSG+R S   GE K  + +GDEW  ++D+      AD
Sbjct: 7    MRRTTRIF----------GTRVLRSGRRLSTP-GEAKRAK-HGDEWIGLLDNVGGGGAAD 54

Query: 2812 VPRCKSIDWYEVD-------PERDIDVTDFNLNLAXXXXXXXXXXXXXSR--DKMHGIVY 2660
              RCK   W + +        E DIDV   +++               +   D+M G+VY
Sbjct: 55   ATRCKKNGWLKKEVALNLEADEMDIDVDSKSMDELESPEAPVVETISPNSNIDRMWGLVY 114

Query: 2659 NRKRRRLPGNKFVSSPRD-RMYGISFVRKQRRKRSSGHSINELPRKDCQXXXXXXXXXXX 2483
             RKR+R+  +       D R YG  FVRK++ + +    +                    
Sbjct: 115  TRKRKRVADSVKGKVLTDVRRYGKQFVRKKKVRSAYAKDL-------------------- 154

Query: 2482 XSGHESSIDVVHGAILDIVVESSCNSAHRFTCFLVSILKFVKSSRISLTEFAAFMCSEPL 2303
              G      V  G    ++V +S  S +  +C L  IL +++ S +SL +   F+ S+PL
Sbjct: 155  --GKSEDGQVSSGI---VIVNTSYGSGYWVSCLLNCILMYLRRSTVSLQQIFGFINSKPL 209

Query: 2302 VRVFAQHGVHFLANLSYRISWKNDLISPGICQIFEARQFIPMFSLDFCASPFTFMRLHFS 2123
              V +  G+     L ++   K   I  G C I   R  +P+F+LDF   P  F+ LH S
Sbjct: 210  RDVNSLQGI-----LLFKTPRK---IKTGACVISGVRCSVPVFTLDFSTVPCFFLYLHSS 261

Query: 2122 VVLRSLYLPDVLIRYLSGLIKKAREITDNKKCLPCI-PTEMGFPGSNSMASWS----SSV 1958
            ++LR + +   L+   +  I +   +T++K+ + C+ P        N+ +        + 
Sbjct: 262  LLLRFVPMSYALVMQPTVAIDEV-TVTNDKEIVSCLSPVTQSELDVNTQSGLDVVAPGAY 320

Query: 1957 KKRKVDFIVRATDFARRSVSTRHSANSRNVQRKRTSLRSTRARNSS-------------S 1817
              +K++ +       + +       NSRN+Q++R+SLRS R R+SS              
Sbjct: 321  DSKKIEVVNPTVGLPKLAARHLQPRNSRNIQKRRSSLRSMRGRHSSFGTQNATGVLTSDR 380

Query: 1816 MGFHHDLFRAGYK---HKKRKLAQKSPCSNVKELKSTLVELKQNMDSVCCSANILVIESD 1646
            + F  D  R   +   ++ R   QK+   +VKELKS LV L QN++S  CSAN+LVIE D
Sbjct: 381  LRFRRDGLRFSSRTPHYELRSSRQKTSTPSVKELKSALVGLTQNIESTSCSANVLVIEPD 440

Query: 1645 RCFREEGAKVMLECLALNDWYLVAKTHGLSRYMYKAENVMRPSTPNRFTHAMIWTGENGW 1466
            +C+REEGA + +E  A   W L  K  G+ R+    E VMRP + NR TH +IW G+NGW
Sbjct: 441  KCYREEGAVIGMELSAAKQWILAVKIGGVRRFNLTTEKVMRPCSSNRVTHDIIWVGDNGW 500

Query: 1465 KLEFSNRRDWLIFKELHMACMDRNLQVVSARTIPVPGVYEVPCYDDSGYVPFVRPIAYIT 1286
            KLEF  R+DWLIFKEL+  C DRN+Q  +   IPVPGV EV  Y +S    F RP++YIT
Sbjct: 501  KLEFPIRQDWLIFKELYKGCSDRNVQPPAVSIIPVPGVREVSGYAESNPPEFARPVSYIT 560

Query: 1285 MRDDEVARALVRKPANYDMDSGDEEWLEKLNNGVFDGDVGVPNYILPEKFEEMMDAFEKA 1106
            ++DDE+ARAL R  ANYDMD  DEEWL   N    D      +++  + FE ++D FEK 
Sbjct: 561  VKDDELARALARSTANYDMDGDDEEWLRNFN----DQPSLENDHLSADSFELLIDNFEKG 616

Query: 1105 AYHTPDEVSDESRATSFCSSLERKDTVAAVYQYWMKKRKQNHVALLRVFQCPPPRSSQLM 926
             Y  PD+ SDE  A S C + E+K+ V AVY YW+KKRKQN  +L+++FQC  PR +Q++
Sbjct: 617  FYCNPDDYSDEKAAVSSCPNKEKKEIVEAVYNYWLKKRKQNRSSLIKIFQCYQPRRTQVI 676

Query: 925  QKPVFRKKRSFKRQMRQSGRGKQQIFFHASAVEPE-QDAMQRVQKAKSLADISMESVLLK 749
             K +FRKKRSFKRQ  ++GRGK + F  A   E E Q+A+ +V++AK+ A+ S +  +  
Sbjct: 677  PKSIFRKKRSFKRQGSKAGRGKHRPFLPAVVAEKEQQNAVLKVKEAKAAANKSEDLAVRM 736

Query: 748  RRRAQILMDNADLATYXXXXXXXXXXXXXXXESPEAYVPFIL 623
            R++AQ LM+NADLATY               +S EA  P  L
Sbjct: 737  RQKAQQLMENADLATYKAMMALKIAEAAKIAKSKEAVGPIFL 778


>ref|XP_007221419.1| hypothetical protein PRUPE_ppa001422mg [Prunus persica]
            gi|462418131|gb|EMJ22618.1| hypothetical protein
            PRUPE_ppa001422mg [Prunus persica]
          Length = 832

 Score =  456 bits (1174), Expect = e-125
 Identities = 315/835 (37%), Positives = 433/835 (51%), Gaps = 42/835 (5%)
 Frame = -3

Query: 2998 TLMPLVEMRRSTRVFVPKSVVKDTDGARVLRSGKRFSLDCGEEKPVRG-NGDE-WFCIID 2825
            T MP VEMRR+TRVF    V    DGARVLRSG+R   +  E K  R  NGDE W  ++ 
Sbjct: 52   TEMPSVEMRRTTRVFGMGMVKGGVDGARVLRSGRRLWPESSESKLERARNGDEDWLKLMK 111

Query: 2824 DTA--DVPRCKSIDWYEVD----PERDIDVTDFNLNLAXXXXXXXXXXXXXSRDKMHGIV 2663
              A   V       W   +    P R+  V     +L                 K +GIV
Sbjct: 112  SHAGESVVGLNHKKWAGANQVGSPRRNTPV--LKTSLVKKPQSNELLADLLKEHKRYGIV 169

Query: 2662 YNRKRRRLPGNKFVSSPR-----DRMYGISFVRKQRRKRSSGHSINELPRKDCQXXXXXX 2498
            Y RKR+R   +   +  +     DRMYG  F R+QR K+S      EL            
Sbjct: 170  YTRKRKRASASFLGNVEKENGSDDRMYGRRFARRQRMKKSK-----EL------------ 212

Query: 2497 XXXXXXSGHESSIDVVHGAILDIVVESSCNSAHRFTCFLVSILKFVKSSRISLTEFAAFM 2318
                     +S    V   +L   VESS    +    FL S+L ++  + + LTEF+ F+
Sbjct: 213  ---------DSHPGFVCPEVLCFSVESSWAQGYWAGRFLYSVLVYMTRASLGLTEFSEFL 263

Query: 2317 CSEPLVRVFAQHGVHFLANLSYRISWKNDLISPGICQIFEARQFIPMFSLDFCASPFTFM 2138
              EP+  +FA +G+ F  + S            G+C++F A QFIP+FS+DF A P  FM
Sbjct: 264  ALEPIGSIFASYGIQFSRDRSCTRR-------SGVCKLFGAEQFIPLFSVDFSAVPGCFM 316

Query: 2137 RLHFSVVLR---SLYLPDVLIRYLSGLIKKAREITDNKKCLPCIPTEMGFPGSNSMASWS 1967
             +  S+ LR    L + +++  + +G      +  D+ + +  I         N  A  S
Sbjct: 317  FMQTSMHLRFRCHLTVNNLIDGHENGEFIDQGDDDDDGEKVDFI--------ENRHALHS 368

Query: 1966 SSVKKRKVDFIVRATDFARRSVSTRHSANSRNVQRKRTSLRSTRARNSSSMGFHH----- 1802
            S          VR    A RS   R+   SR +Q++R+SLR  R+RN S +         
Sbjct: 369  S----------VRVPKLACRSTQYRNGLTSRGIQKRRSSLRRRRSRNPSLVSLRKPNGAL 418

Query: 1801 -----DLFRAGY-------KHKKRKLAQKSPCSNVKELKSTLVELKQNMDSVCCSANILV 1658
                  + + G        KH  RK    S   N+K    T+   K+++DS  CSANIL 
Sbjct: 419  VSELISIRKNGLPFSSVESKHMLRKSVSLSLAGNLKAESLTIEGSKRDLDSTSCSANILF 478

Query: 1657 IESDRCFREEGAKVMLECLALNDWYLVAKTHGLSRYMYKAENVMRPSTPNRFTHAMIWT- 1481
             E D+C+RE+GA VMLE  +  +W LV K +GL+RY +KAE VMRP + NR T A+IW+ 
Sbjct: 479  TELDKCYREDGATVMLEMSSSGEWLLVVKKNGLTRYTHKAEKVMRPCSKNRITQAIIWSA 538

Query: 1480 ---GENGWKLEFSNRRDWLIFKELHMACMDRNLQVVSARTIPVPGVYEVPCYDDSGYVPF 1310
               G+N WKLEF NR DW IFK+L+  C DR +   + + IPVPGV EVP Y DS    F
Sbjct: 539  DSNGDNNWKLEFPNRCDWAIFKDLYKECSDRVVPAPAIKFIPVPGVREVPGYADSHSTLF 598

Query: 1309 VRPIAYITMRDDEVARALVRKPANYDMDSGDEEWLEKLNNGVFDGDVGVPNYILPEKFEE 1130
             RP +YI + DDEV+RA+ ++ ANYDMDS DEEWL+K N+  F  +  + +++  + FE 
Sbjct: 599  DRPESYIYLNDDEVSRAMAKRTANYDMDSDDEEWLKKFNSDFF-AENELHDHVSEDNFEL 657

Query: 1129 MMDAFEKAAYHTPDEVSDESRATSFCSSLERKDTVAAVYQYWMKKRKQNH-VALLRVFQC 953
            M+DAFEKA Y  P + +DE+ A + C  + R++ V A+Y YWM KRKQ    +LLRVFQ 
Sbjct: 658  MVDAFEKAFYCRPYDFADENAAANLCLDMGRREVVEAIYSYWMNKRKQKRSSSLLRVFQG 717

Query: 952  PPPRSSQLMQKPVFRKKRSFKRQMRQSGRGKQQIFFHASAVE----PEQDAMQRVQKAKS 785
               + + L  KPV RK+RSFKRQ  Q GRGKQ  F  A A E     EQ+A+ +V++AK+
Sbjct: 718  HQSKRALLDPKPVLRKRRSFKRQPSQFGRGKQPSFLQAMAAEQDALQEQNAIHKVEEAKA 777

Query: 784  LADISMESVLLKRRRAQILMDNADLATYXXXXXXXXXXXXXXXESPEAYVPFILD 620
             AD S+E  + KR+RAQ+LM NADL TY                SP+A   ++LD
Sbjct: 778  EADRSVELAIRKRKRAQLLMQNADLVTYKATMAFRIAEAAQVLGSPDAAAAYVLD 832


>ref|XP_004243418.1| PREDICTED: uncharacterized protein LOC101263728 [Solanum
            lycopersicum]
          Length = 790

 Score =  452 bits (1162), Expect = e-124
 Identities = 293/832 (35%), Positives = 438/832 (52%), Gaps = 47/832 (5%)
 Frame = -3

Query: 2977 MRRSTRVFVPKSVVKDTDGARVLRSGKRFSLDCGEEKPVRGNGDEWFCIIDDT------- 2819
            MRR+TR+F          G RVLRSG+R S    E K  + +GDEW  ++D+        
Sbjct: 7    MRRTTRIF----------GTRVLRSGRRLSTSF-EAKRAK-HGDEWIGLLDNVGGGGGAA 54

Query: 2818 ADVPRCKSIDWYEVDPERDIDVTDFNLNL---------AXXXXXXXXXXXXXSRDKMHGI 2666
            AD  RCK   W + +   +++  + N+++                         D+M G+
Sbjct: 55   ADATRCKKKGWLKKEVALNLEADEMNIDVDSKSMDEQETVEAPVVDTVSPKSYIDRMWGL 114

Query: 2665 VYNRKRRRLPGNKFVSSPRDRM------YGISFVRKQRRKRSSGHSINELPRKDCQXXXX 2504
            VY RKR+R+   +   S R ++      YG  F+RK  +K  S ++ +    +D Q    
Sbjct: 115  VYTRKRKRVDLKRH-DSVRGKVLTDVMRYGKQFIRK--KKHRSAYAKDSDKSEDGQF--- 168

Query: 2503 XXXXXXXXSGHESSIDVVHGAILDIVVESSCNSAHRFTCFLVSILKFVKSSRISLTEFAA 2324
                         S D+V       +V +S  S +  +C L  +L +++ S +SL +   
Sbjct: 169  -------------SSDIV-------IVNTSYGSGYWVSCLLNCMLMYLRRSTVSLQQIFG 208

Query: 2323 FMCSEPLVRVFAQHGVHFLANLSYRISWKNDLISPGICQIFEARQFIPMFSLDFCASPFT 2144
            F+ S+PL  V++  G+  L + + R       I  G C I   R  +P+F+LDF   P  
Sbjct: 209  FINSKPLRDVWSLQGILLLKDQTSR------KIKTGACVISGVRCSVPVFTLDFSTVPCF 262

Query: 2143 FMRLHFSVVLRSLYLPDVLIRYLSGLIKKAREITDNKKCLPCI-PTEMGFPGSNSMASWS 1967
            F+ LH S++LR + +   L+   +  I +   +T++ + + C+ P  +     N+ +   
Sbjct: 263  FLYLHSSLLLRFVPMSYALVMQPTVAIDEVT-VTNDMELVSCLTPVTLSELDVNTQSGHD 321

Query: 1966 ----SSVKKRKVDFIVRATDFARRSVSTRHSANSRNVQRKRTSLRSTRARNSS------- 1820
                 +   +K++ +       + +       NSRN+Q++R+SLRS R R+SS       
Sbjct: 322  VVAPGAYDSKKIEVVNTTVGLPKSTARHLQPRNSRNIQKRRSSLRSMRGRHSSFGTQNAS 381

Query: 1819 ------SMGFHHDLFRAGYK---HKKRKLAQKSPCSNVKELKSTLVELKQNMDSVCCSAN 1667
                   + F  D  R   +   ++ R   QK+   +VKELKS LV L QN+++  CSAN
Sbjct: 382  GVLTSDRLRFRRDGLRFSSRTPHYELRSSRQKTSMPSVKELKSALVRLTQNIETASCSAN 441

Query: 1666 ILVIESDRCFREEGAKVMLECLALNDWYLVAKTHGLSRYMYKAENVMRPSTPNRFTHAMI 1487
            ILV E D+C+REEGA + +E  A   W L  K  G+ R+    E VMRP + NR TH +I
Sbjct: 442  ILVTEPDKCYREEGAVIGMELSAAKQWILAVKIGGVRRFNLTTEKVMRPCSSNRVTHDLI 501

Query: 1486 WTGENGWKLEFSNRRDWLIFKELHMACMDRNLQVVSARTIPVPGVYEVPCYDDSGYVPFV 1307
            W G++GWKLEF +R+DWLIFKEL+  C DRN+Q  +   IPVPGV EV  Y +S    F 
Sbjct: 502  WVGDSGWKLEFPDRQDWLIFKELYKGCSDRNVQPPAVSIIPVPGVSEVSGYAESNPPFFA 561

Query: 1306 RPIAYITMRDDEVARALVRKPANYDMDSGDEEWLEKLNNGVFDGDVGVPNYILPEKFEEM 1127
            RP++YIT++DDE+ARAL R  ANYDMD  DEEWL   N    D      +++  + FE +
Sbjct: 562  RPVSYITVKDDELARALARSTANYDMDGDDEEWLRNFN----DQPSLENDHLSTDSFELL 617

Query: 1126 MDAFEKAAYHTPDEVSDESRATSFCSSLERKDTVAAVYQYWMKKRKQNHVALLRVFQCPP 947
            +D FEK  Y  PD+ SDE  A S C + E+K+ V AVY YW KKRKQN  +L+++FQC  
Sbjct: 618  IDHFEKGFYCNPDDYSDEKAAVSSCPNKEKKEIVEAVYSYWSKKRKQNRSSLIKIFQCYQ 677

Query: 946  PRSSQLMQKPVFRKKRSFKRQMRQSGRGKQQIFFHASAVE----PEQDAMQRVQKAKSLA 779
            PR +Q++ K +FRKKRSFKRQ  ++GRGK + F  A   E     +Q+A+ +V++AK+ A
Sbjct: 678  PRRTQVIPKSIFRKKRSFKRQGSKAGRGKHRPFLPAVVAENDAVQQQNAVLKVKEAKAAA 737

Query: 778  DISMESVLLKRRRAQILMDNADLATYXXXXXXXXXXXXXXXESPEAYVPFIL 623
            + S +  +  R++AQ LM+NADLATY               +S EA  P  L
Sbjct: 738  NKSEDLAVRMRQKAQQLMENADLATYKAMMALRIAEAAKIAKSKEAVAPIFL 789


>gb|EXC25392.1| hypothetical protein L484_016774 [Morus notabilis]
          Length = 795

 Score =  437 bits (1124), Expect = e-119
 Identities = 302/819 (36%), Positives = 435/819 (53%), Gaps = 55/819 (6%)
 Frame = -3

Query: 2992 MPLVEMRRSTRVFVPKSVVKDTDGARVLRSGKRFSLDCGEEKPVRGNGD--EWFCI---- 2831
            MP V MRR+TRVF    VVK  DGARVLRSG+R   D GE K +R + D  +WF I    
Sbjct: 1    MPSVGMRRTTRVF---GVVKGVDGARVLRSGRRLWPDSGEVK-LRRHSDVYDWFKIGKGD 56

Query: 2830 ----------IDDTADVPRCKSIDWYEVD-PERDIDVTDFNLNLAXXXXXXXXXXXXXSR 2684
                        +T   P+ K+    E+  P+ + +     ++LA               
Sbjct: 57   GGLGYDSNGWAHNTNSKPK-KTPPVAEIKAPKPEDNNRGVGVDLAHGGRRP--------- 106

Query: 2683 DKMHGIVYNRKRRRLP----GNKFVSSPR-----DRMYGISFVRKQRRKRSSGHSINELP 2531
            D+M G+VY+RKR+ L     GN  V+S        + YG  FVR+QRRK +SG S     
Sbjct: 107  DRMFGLVYSRKRKNLAVRSSGNASVNSETLGGSVGKRYGRRFVRRQRRKLNSGESFAVAD 166

Query: 2530 RKDCQXXXXXXXXXXXXSGHESSIDVVHGAILDIVVESSCNSAHRFTCFLVSILKFVKSS 2351
              D                  S ++     ++ +V  SS +        L SIL ++  +
Sbjct: 167  DSD------------------SRLEFTPSEVVSVVFGSSMDRNFYAVGVLCSILVYLTRA 208

Query: 2350 RISLTEFAAFMCSEPLVRVFAQHGVH-FLANLSYRISWKNDLISPGICQIFEARQFIPMF 2174
            R+ LT+  AF+ SEP+ RV +  G++ FL + S +            C++F A +F+P+F
Sbjct: 209  RLRLTDLFAFLVSEPISRVNSSCGINIFLDHPSIKRF--------ASCKLFGAPEFVPLF 260

Query: 2173 SLDFCASPFTFMRLHFSVVLRSLYLPDVLIRYLSGLIKKAREITDNKKCLPCIPTEMGFP 1994
             +DF A P  FM +H  +  R    P      L+G  +    I+D+++       ++  P
Sbjct: 261  CVDFSAIPLCFMHMHSCMFFRYKRQPS-----LAGNNEIDEMISDDEE------DQLSSP 309

Query: 1993 GSNSM-------ASWSSSVKKRKVDFIVRATDFARRSVSTRHSANSRNVQRKRTSLRSTR 1835
            G +++       A  + S  +   +   +A+ FA RS   R+   SR +Q++R+SLR  +
Sbjct: 310  GKDALESKPLLSAEANHSENRLASNPSFKASKFACRSNQYRNGLISRGIQKRRSSLRRRK 369

Query: 1834 ARNSSSMGFHH-------DL--FRAGY-------KHKKRKLAQKSPCSNVKELKSTLVEL 1703
            ARN S  G          DL  FR           +K R+  + +    +KE+ ST+ + 
Sbjct: 370  ARNPSLCGVQKPNNALLSDLVSFRKNSVSLSLTSNNKLRRSLRSNSARKLKEVSSTVADS 429

Query: 1702 KQNMDSVCCSANILVIESDRCFREEGAKVMLECLALNDWYLVAKTHGLSRYMYKAENVMR 1523
             Q+MDS  C AN+L+IE ++C+RE G  ++LE   L  W +  K  G +++ +KAE VMR
Sbjct: 430  TQDMDSTSCCANVLIIEPEKCYREGGFSIVLESSPLGGWLIAVKKDGSTKFTHKAEKVMR 489

Query: 1522 PSTPNRFTHAMIWTGENGWKLEFSNRRDWLIFKELHMACMDRNLQVVSARTIPVPGVYEV 1343
            P + NRFTH ++WT ++GWKLEF NR+DWLIFK+L+  C DRN+     + +P+PGV EV
Sbjct: 490  PCSSNRFTHDIMWTADDGWKLEFPNRKDWLIFKDLYQECSDRNMLAPGVKVVPIPGVNEV 549

Query: 1342 PCYDDSGYVPFVRPIAYITMRDDEVARALVRKPANYDMDSGDEEWLEKLNNGVFDGDVGV 1163
                DS    F RP +YI+++DDE+ RAL RK +NYDMD  DEEWL KLNN  F  +   
Sbjct: 550  SQKGDSHCTLFRRPDSYISVKDDELCRALKRKTSNYDMDLEDEEWLNKLNN-EFSVENET 608

Query: 1162 PNYILPEKFEEMMDAFEKAAYHTPDEVSDESRATSFCSSLERKDTVAAVYQYW-MKKRKQ 986
               +  +KFE M+DAFEKA + +P + SD    T  CS L     + A+Y YW MKKRKQ
Sbjct: 609  YECVSDDKFESMIDAFEKAFFCSPYDNSDVKSLTDLCSHLGGDKAIEAIYVYWTMKKRKQ 668

Query: 985  NHVALLRVFQCPPPRSSQLMQKPVFRKKRSFKRQMRQSGRGKQQIFFHASAVE----PEQ 818
               +L+R+FQ    R + L+ KP  RKKRSF RQ  Q GRGKQ  F  A   E     EQ
Sbjct: 669  KRPSLIRIFQLYQGRRT-LVPKPAIRKKRSFNRQPSQVGRGKQSSFLQAMVAERDAAEEQ 727

Query: 817  DAMQRVQKAKSLADISMESVLLKRRRAQILMDNADLATY 701
            +AM RV++AK+ A+  +E  +  R+RAQ+LM+NADLATY
Sbjct: 728  NAMHRVEEAKASANRCVELAVESRQRAQLLMNNADLATY 766


>ref|XP_004166800.1| PREDICTED: uncharacterized LOC101207239 [Cucumis sativus]
          Length = 819

 Score =  418 bits (1074), Expect = e-113
 Identities = 293/833 (35%), Positives = 422/833 (50%), Gaps = 56/833 (6%)
 Frame = -3

Query: 2971 RSTRVFVPKSVVKDTDGARVLRSGKRFSLDCGEEKPVRG-NGDEWFCIIDDTADVPRC-- 2801
            R TRVF    +VK +DGARVLRSG+R   + GE K  +  +  +W+ IID   +      
Sbjct: 6    RRTRVF---GLVKGSDGARVLRSGRRLWPESGEVKLKKSKDASDWYPIIDGRGNGGGSGH 62

Query: 2800 -----KSIDWYEVDPERDIDVT---DFNLNLAXXXXXXXXXXXXXSRDKMHGI------V 2663
                 K      V P+R + V    D +  +              + DK  G+      V
Sbjct: 63   GRLHGKWTQVRNVKPKRVVVVNIREDDDACVVKVPEPVKVFPRIGNDDKSSGVDRMFGKV 122

Query: 2662 YNRKRRR-----------LPGNKFVSSPRDRMYGISFVRKQR-RKRSSGHSINELPRKDC 2519
            Y+RKR+R           +  +  +S   DRM+G+ F+R+QR RK    H  +    +  
Sbjct: 123  YSRKRKRGRLEDGEVFDEMESDNVLSG--DRMFGLRFIRRQRSRKTDVEHWESTAGGRTS 180

Query: 2518 QXXXXXXXXXXXXSGHESSIDVVHGAILDIVVESSCNSAHRFTCFLVSILKFVKSSRISL 2339
                           H   I       L I   SS +    F+ F++++L+  KS  +S+
Sbjct: 181  NLHF-----------HRQRILHPRDCALTIFAGSSVDGGC-FSDFILTVLRHFKSPGLSV 228

Query: 2338 TEFAAFMCSEPLVRVFAQHGVHFLANLSYRISWKNDLISPGIC---QIFEARQFIPMFSL 2168
             +F+AF+ S P+  VFA  G+ FL                G C    IF +RQ IPMF L
Sbjct: 229  AKFSAFLLSNPINEVFALKGMRFLQGYP----------PTGCCGMFAIFGSRQSIPMFHL 278

Query: 2167 DFCASPFTFMRLHFSVVLRSLYLPDVLIRYLSGL-IKKAREITDNKKCLPCIPTEMGFPG 1991
            DF A P  FM L+  + LR   +   L+   + L +  + +  ++      +P+ +    
Sbjct: 279  DFSAIPLPFMFLYSEMFLRVTRIQARLVYNNNQLDVDISSDSEEDSVEELHVPSPVSSLE 338

Query: 1990 SNSMASWSSSVKKRKVDF-IVRATDFARRSVSTRHSANSRNVQRKRTSLRSTRARNSSSM 1814
               MA      K R V    VRAT    R++  R+  +SR ++++R+SLR  R R+ S  
Sbjct: 339  RKPMAFLFDRPKTRSVSHPSVRATRLGTRTMQYRNGFSSRGIRKRRSSLRIRRPRSHSLS 398

Query: 1813 GFHHDL-------------FRAGYKHKKRKL-AQKSPCSNVKELKSTLVELKQNMDSVCC 1676
                 +             F +G    + K  A +     ++E  ST +    ++DS CC
Sbjct: 399  AMQKSIGPLAVDDVKLGVSFPSGASCNRHKSSAVRDSAGRIRETNSTALRSAMDVDSSCC 458

Query: 1675 SANILVIESDRCFREEGAKVMLECLALNDWYLVAKTHGLSRYMYKAENVMRPSTPNRFTH 1496
             ANIL++E+D+C REEGA ++LE  A  +W LV K  G +RY +KAE VM+PS+ NRFTH
Sbjct: 459  KANILIVEADKCLREEGANIVLEFSASCEWLLVVKKDGSTRYTHKAERVMKPSSCNRFTH 518

Query: 1495 AMIWTGENGWKLEFSNRRDWLIFKELHMACMDRNLQVVSARTIPVPGVYEVPCYDDSGYV 1316
            A++W+ +NGWKLEF NRRDW IFK+L+  C DRN+  + A+ IPVP V EVP Y DS   
Sbjct: 519  AILWSIDNGWKLEFPNRRDWFIFKDLYKECSDRNIPCLIAKAIPVPRVSEVPDYVDSSGA 578

Query: 1315 PFVRPIAYITMRDDEVARALVRKPANYDMDSGDEEWLEKLNNGVFDGDVGVPNYILPEKF 1136
             F RP  YI++ DDEV RA+ +  ANYDMDS DEEWL + N+G+   D         + F
Sbjct: 579  SFQRPDTYISVNDDEVCRAMTKSTANYDMDSEDEEWLVEFNDGLIATDKH-QECFSEDNF 637

Query: 1135 EEMMDAFEKAAYHTPDEVSDESRATSFCSSLERKDTVAAVYQYWMKKRKQNHVALLRVFQ 956
            E M+DAFEK  Y  PD  SDE      C+ L     V ++Y YW KKRKQ   +L+RVFQ
Sbjct: 638  ESMVDAFEKGFYCNPDAFSDEKVPADICTPLASPSIVESLYTYWTKKRKQRKSSLIRVFQ 697

Query: 955  C-PPPRSSQLMQKPVFRKKRSFKRQMRQSGRGK-------QQIFFHASAVEPEQDAMQRV 800
                 R   L+ KP+ R+KRS KRQ  QSG G+       + I +   AVE +Q+AMQ+ 
Sbjct: 698  AYQSKRKPPLVPKPMMRRKRSLKRQPSQSGSGRTPQPSILEAILWRRDAVE-DQNAMQKY 756

Query: 799  QKAKSLADISMESVLLKRRRAQILMDNADLATYXXXXXXXXXXXXXXXESPEA 641
            +++K+  +  +E+ + KR+RAQ+L++NADLA Y               +SPEA
Sbjct: 757  EESKAAVEKCIENAVNKRQRAQLLLENADLAVYKAMSALRIAEAIETSDSPEA 809


>ref|XP_004140897.1| PREDICTED: uncharacterized protein LOC101207239 [Cucumis sativus]
          Length = 819

 Score =  418 bits (1074), Expect = e-113
 Identities = 293/833 (35%), Positives = 422/833 (50%), Gaps = 56/833 (6%)
 Frame = -3

Query: 2971 RSTRVFVPKSVVKDTDGARVLRSGKRFSLDCGEEKPVRG-NGDEWFCIIDDTADVPRC-- 2801
            R TRVF    +VK +DGARVLRSG+R   + GE K  +  +  +W+ IID   +      
Sbjct: 6    RRTRVF---GLVKGSDGARVLRSGRRLWPESGEVKLKKSKDASDWYPIIDGRGNGGGSGH 62

Query: 2800 -----KSIDWYEVDPERDIDVT---DFNLNLAXXXXXXXXXXXXXSRDKMHGI------V 2663
                 K      V P+R + V    D +  +              + DK  G+      V
Sbjct: 63   GRLHGKWTQVRNVKPKRVVVVNIREDDDACVVKVPEPVKVFPRIGNDDKSSGVDRMFGKV 122

Query: 2662 YNRKRRR-----------LPGNKFVSSPRDRMYGISFVRKQR-RKRSSGHSINELPRKDC 2519
            Y+RKR+R           +  +  +S   DRM+G+ F+R+QR RK    H  +    +  
Sbjct: 123  YSRKRKRGRLEDGEVFDEMESDNVLSG--DRMFGLRFIRRQRSRKTDVEHWESTAGGRTS 180

Query: 2518 QXXXXXXXXXXXXSGHESSIDVVHGAILDIVVESSCNSAHRFTCFLVSILKFVKSSRISL 2339
                           H   I       L I   SS +    F+ F++++L+  KS  +S+
Sbjct: 181  NLHF-----------HRQRILHPRDCALTIFAGSSVDGGC-FSDFILTVLRHFKSPGLSV 228

Query: 2338 TEFAAFMCSEPLVRVFAQHGVHFLANLSYRISWKNDLISPGIC---QIFEARQFIPMFSL 2168
             +F+AF+ S P+  VFA  G+ FL                G C    IF +RQ IPMF L
Sbjct: 229  AKFSAFLLSNPINEVFALKGMRFLQGYP----------PTGCCGMFAIFGSRQSIPMFHL 278

Query: 2167 DFCASPFTFMRLHFSVVLRSLYLPDVLIRYLSGL-IKKAREITDNKKCLPCIPTEMGFPG 1991
            DF A P  FM L+  + LR   +   L+   + L +  + +  ++      +P+ +    
Sbjct: 279  DFSAIPLPFMFLYSEMFLRVTRIQARLVYNNNQLDVDISSDSEEDSVEELHVPSPVSSLE 338

Query: 1990 SNSMASWSSSVKKRKVDF-IVRATDFARRSVSTRHSANSRNVQRKRTSLRSTRARNSSSM 1814
               MA      K R V    VRAT    R++  R+  +SR ++++R+SLR  R R+ S  
Sbjct: 339  RKPMAFLFDRPKTRSVSHPSVRATRLGTRTMQYRNGFSSRGIRKRRSSLRIRRPRSHSLA 398

Query: 1813 GFHHDL-------------FRAGYKHKKRKL-AQKSPCSNVKELKSTLVELKQNMDSVCC 1676
                 +             F +G    + K  A +     ++E  ST +    ++DS CC
Sbjct: 399  AMQKSIGPLAVDDVKLGVSFPSGASCNRHKSSAVRDSAGRIRETNSTALGSAMDVDSSCC 458

Query: 1675 SANILVIESDRCFREEGAKVMLECLALNDWYLVAKTHGLSRYMYKAENVMRPSTPNRFTH 1496
             ANIL++E+D+C REEGA ++LE  A  +W LV K  G +RY +KAE VM+PS+ NRFTH
Sbjct: 459  KANILIVEADKCLREEGANIVLEFSASCEWLLVVKKDGSTRYTHKAERVMKPSSCNRFTH 518

Query: 1495 AMIWTGENGWKLEFSNRRDWLIFKELHMACMDRNLQVVSARTIPVPGVYEVPCYDDSGYV 1316
            A++W+ +NGWKLEF NRRDW IFK+L+  C DRN+  + A+ IPVP V EVP Y DS   
Sbjct: 519  AILWSIDNGWKLEFPNRRDWFIFKDLYKECSDRNIPCLIAKAIPVPRVSEVPDYVDSSGA 578

Query: 1315 PFVRPIAYITMRDDEVARALVRKPANYDMDSGDEEWLEKLNNGVFDGDVGVPNYILPEKF 1136
             F RP  YI++ DDEV RA+ +  ANYDMDS DEEWL + N+G+   D         + F
Sbjct: 579  SFQRPDTYISVNDDEVCRAMTKSTANYDMDSEDEEWLIEFNDGLIATDKH-QECFSEDNF 637

Query: 1135 EEMMDAFEKAAYHTPDEVSDESRATSFCSSLERKDTVAAVYQYWMKKRKQNHVALLRVFQ 956
            E M+DAFEK  Y  PD  SDE      C+ L     V ++Y YW KKRKQ   +L+RVFQ
Sbjct: 638  ESMVDAFEKGFYCNPDAFSDEKAPADICTPLASPSIVESLYTYWTKKRKQRKSSLIRVFQ 697

Query: 955  C-PPPRSSQLMQKPVFRKKRSFKRQMRQSGRGK-------QQIFFHASAVEPEQDAMQRV 800
                 R   L+ KP+ R+KRS KRQ  QSG G+       + I +   AVE +Q+AMQ+ 
Sbjct: 698  AYQSKRKPPLVPKPMMRRKRSLKRQPSQSGSGRTPQPSILEAILWRRDAVE-DQNAMQKY 756

Query: 799  QKAKSLADISMESVLLKRRRAQILMDNADLATYXXXXXXXXXXXXXXXESPEA 641
            +++K+  +  +E+ + KR+RAQ+L++NADLA Y               +SPEA
Sbjct: 757  EESKAAVEKCIENAVSKRQRAQLLLENADLAVYKAMSALRIAEAIETSDSPEA 809


>gb|EYU39775.1| hypothetical protein MIMGU_mgv1a001436mg [Mimulus guttatus]
          Length = 820

 Score =  414 bits (1064), Expect = e-112
 Identities = 299/836 (35%), Positives = 420/836 (50%), Gaps = 72/836 (8%)
 Frame = -3

Query: 2992 MPLVEMRRSTRVFVPKSVVKDTDGARVLRSGKRFSLDCGE---EKPVRGNGDE--WFCII 2828
            MP V MRR+TRVF          G RVLRSG+R   +  +    K  R +  E  W  I 
Sbjct: 1    MPSVGMRRNTRVF----------GTRVLRSGRRLWTEPSKGSNNKNARASHAENKWTDIP 50

Query: 2827 DDTADVPRCKSIDWYEVDPERDIDVTDFNL----NLAXXXXXXXXXXXXXSRDKMHGIVY 2660
            D         + D     P  D +    ++     +               RD+M GIVY
Sbjct: 51   DGGGGGGGDAASDRLNHTPREDKNSASSDMIVDPTIEERAPEGGGAVEVKDRDRMCGIVY 110

Query: 2659 NRKRRR-LPGNKFVSSPRDRMYGISFVRKQRRKRSSGHSINELPRKDCQXXXXXXXXXXX 2483
             RKR+R L          D+ YG  FVR++ RKR       E   K              
Sbjct: 111  RRKRKRKLVELGKTGLTEDKRYGKKFVRERWRKRFGATESFESCAK----------FGGS 160

Query: 2482 XSGHESSIDVVHGAILDIVVESSCNSAHRFTCFLVSILKFVKSSRISLTEFAAFMCSEPL 2303
              G    + VV+        ESS    +   CFL  +L ++   RI +   +AFM S+P+
Sbjct: 161  VRGRRELVVVVN--------ESSNWCGYWVACFLSCVLSYMTKVRIGMRRMSAFMLSKPI 212

Query: 2302 VRVFAQHGVHFLANLSYRISWKNDLISPGICQIFEARQFIPMFSLDFCASPFTFMRLHFS 2123
              V++ HGV F+ +     +  N +  PG+C I  +R  +P+FS+DF A P  F+ +  S
Sbjct: 213  FDVYSSHGVLFVQDAI--TARNNGIKKPGLCIISGSRSLVPIFSVDFSAIPSVFVHMQTS 270

Query: 2122 VVLRSLYLPDVLI-RYLSGLIKKAREIT--DNKKCLPCIPTEMGFPG-SNSMASWSSSVK 1955
            + LRS +L  +L+ R      ++  E+T  D +  L        FP    +  S  S ++
Sbjct: 271  LYLRSEHLAFLLVARSTDDDYEEDEEVTAMDEEPYL--------FPSCEQNQDSLDSPIR 322

Query: 1954 KRKV-DFIVRATDFARRSV-STRHSA---------------NSRNVQRKRTSLRSTRARN 1826
                 D +    D +R  + S+ HS                NSRN++++R+SLR  R R 
Sbjct: 323  DVSCSDVLAFGNDDSRGKIESSSHSPLGLPKSSALRSLQLRNSRNIKKRRSSLRRKRGRP 382

Query: 1825 SSSM-------GFHHDLFR-------------------AGYKHKKRKLA----------- 1757
             SS            D FR                   +  K+  +K +           
Sbjct: 383  PSSFRTQKSSGALASDFFRIRNDAVQFSALSPTRLLRSSDKKNSDKKKSDKNSSDKKSSD 442

Query: 1756 QKSPCSNVKELKSTLVELKQNMDSVCCSANILVIESDRCFREEGAKVMLECLALNDWYLV 1577
            +KS  SN+KE K       Q++    CSANIL+ E+D+C+REEGA V LE      W+LV
Sbjct: 443  KKSSTSNIKETKPAT----QDIYPSTCSANILITETDKCYREEGATVALELSPSKQWFLV 498

Query: 1576 AKTHGLSRYMYKAENVMRPSTPNRFTHAMIWTGENGWKLEFSNRRDWLIFKELHMACMDR 1397
                G  RY   AE VMRPS  NRF+HA+IW+G+  +KLEFSN++DW +FKEL+  C +R
Sbjct: 499  IGKDGTKRYNLTAEKVMRPSCSNRFSHAVIWSGDCNFKLEFSNKQDWFVFKELYKQCSER 558

Query: 1396 NLQVVSARTIPVPGVYEVPCYDDSGYVPFVRPIAYITMRDDEVARALVRKPANYDMDSGD 1217
            N+Q  S   IPVPGV EV     + ++P+VRP  YIT++DDE+ RALV+K ANYDMDS D
Sbjct: 559  NMQSPSVSVIPVPGVQEVSMPFYNNFMPYVRPDNYITVKDDELIRALVKKGANYDMDSDD 618

Query: 1216 EEWLEKLNNGVFDGDVGVPNYILPEKFEEMMDAFEKAAYHTPDEVSDESRATSFCSSLER 1037
            EEWL + N+ +  G + +   + PE FE ++DA EK  +  PDE  +E  A  FC  LER
Sbjct: 619  EEWLSEFNDELC-GGMELQEPVSPECFELVIDALEKGVHCNPDENFEELAAYDFCMHLER 677

Query: 1036 KDTVAAVYQYWMKKRKQNHVALLRVFQCPPPRSSQLMQKPVFRKKRSFKRQMRQSGRGKQ 857
            ++ + A+  YW+KKRKQ   AL+R+FQ   PR  Q++ K VFRKKRSFKRQ  Q GRGKQ
Sbjct: 678  REVIEAIRNYWVKKRKQKRSALVRIFQLYQPRRIQVIPKSVFRKKRSFKRQASQGGRGKQ 737

Query: 856  QIFFHASAVE----PEQDAMQRVQKAKSLADISMESVLLKRRRAQILMDNADLATY 701
            +    A A E     +Q+  Q++Q+AK+ A+      + KR+RAQ+LM+NADLATY
Sbjct: 738  RPILQAIAAERDALEQQNNAQKLQEAKAAAERFEALAVEKRQRAQMLMENADLATY 793


>ref|XP_006398922.1| hypothetical protein EUTSA_v10012741mg [Eutrema salsugineum]
            gi|557100012|gb|ESQ40375.1| hypothetical protein
            EUTSA_v10012741mg [Eutrema salsugineum]
          Length = 777

 Score =  413 bits (1062), Expect = e-112
 Identities = 281/806 (34%), Positives = 418/806 (51%), Gaps = 42/806 (5%)
 Frame = -3

Query: 2992 MPLVEMRRSTRVFVPKSVVKDTDGARVLRSGKRFSLDCGEEKPVRGNG---DEWFC---- 2834
            MP V MRR+TRVF    VVK  DGARVLRSG+R   +  E K  R +     +W C    
Sbjct: 1    MPSVGMRRTTRVF---GVVKAADGARVLRSGRRIWPNVDEPKVKRAHDVVDRDWNCLNPS 57

Query: 2833 ------IIDDTADVPRCKSIDWYEVDPERDIDVTDFNLNLAXXXXXXXXXXXXXSRDKMH 2672
                  +    ++    +     E+  E+D    DF +                + DK+ 
Sbjct: 58   KGKGNKVSGGRSNGAGSRPCSPREISSEKDDKEIDFPVRKRRKVATAEAVGDEKTVDKLF 117

Query: 2671 GIVYNRKRRRLPGNKFVSSPRDRMYGISFVRKQRRKRSSGHSINELPRKDCQXXXXXXXX 2492
            G+VY+RKR+RL G    +   + +  + F    RRKR S   ++  PR+           
Sbjct: 118  GVVYSRKRKRLSGQSSDNRSEEPLRSLKFYC--RRKRLSDRVVS--PRR----------- 162

Query: 2491 XXXXSGHESSIDVVHGAILDIVVESSCNSAHRFTCFLVSILKFVKSSRISLTEFAAFMCS 2312
                         ++G ++ + V++SC  +   T F++ ++++V+  ++ L+  A+F  S
Sbjct: 163  -------------LYGPVITLTVDASCEESWFSTVFVL-VMRYVRRGQLGLSSLASFFLS 208

Query: 2311 EPLVRVFAQHGVHFLANLSYRISWKNDLISPGICQIFEARQFIPMFSLDFCASPFTFMRL 2132
            +P+  VFA HGV FLA        +  L S G+C+ F A   +P+FS DF A P  FM +
Sbjct: 209  QPINDVFADHGVRFLA--------EPPLSSRGVCKFFGALNCLPLFSADFNAIPRCFMDM 260

Query: 2131 HFSVVLRSLYLPDVLIRYLSGLIKKAREITDNKKCL----PCIPTEMGFPGSNSMASWSS 1964
            HF++ LR +      ++    L+    E +D++  +    PC P      G +       
Sbjct: 261  HFTLFLRVVPRSFAFVKKSLYLLNNPVEESDSESEIVLSEPCNPRNGVVVGLHPS----- 315

Query: 1963 SVKKRKVDFIVRATDFARRSVSTRHSANSRNVQRKRTSLRSTRARNSSSMGFHH------ 1802
                      V A+     +   R S    ++Q++R+SLR  RARN S  G H       
Sbjct: 316  ----------VTASKLTGGNAQYRGSLGFHSIQKRRSSLRRRRARNLSH-GVHKPHNGTP 364

Query: 1801 -DLFRAGYKHKKRKLAQKSPCSNVKELKS---------TLVELKQNMDSVCCSANILVIE 1652
                   +K++   ++ +   S+V    S         +    K+ +DS+CCSANILVI 
Sbjct: 365  VSELSGNWKNRTTSVSSRKLRSSVLNNSSPSSNGISTISKPRTKEELDSLCCSANILVIG 424

Query: 1651 SDRCFREEGAKVMLECLALNDWYLVAKTHGLSRYMYKAENVMRPSTPNRFTHAMIWTGEN 1472
            SDRC REEG  VMLE  +  +W++V K  G  RY ++A   MRP + NRFT +++W G+N
Sbjct: 425  SDRCTREEGCGVMLEFSSSKEWFVVIKKDGAIRYRHRARKTMRPCSCNRFTQSIVWLGDN 484

Query: 1471 GWKLEFSNRRDWLIFKELHMACMDRNLQVVSARTIPVPGVYEVPCY--DDSGYVPFVRPI 1298
             WKLEF +++DWL FKE++  C +RN+   +A+ IP+PGV EV  Y  D + +  FV P+
Sbjct: 485  DWKLEFCDKQDWLGFKEIYNECYERNILEQNAKVIPIPGVREVSGYSEDIADFPSFVMPV 544

Query: 1297 AYITMRDDEVARALVRKPANYDMDSGDEEWLEKLNNGVFDGDVGVPNYILPEKFEEMMDA 1118
             YI++++DEV RA+ R  A YDMDS DEEWLE+ N  +   +      +  + FE M+D 
Sbjct: 545  PYISVKEDEVTRAMARNIAIYDMDSEDEEWLERQNEEMLGEEHEQSQRLEQDAFELMIDG 604

Query: 1117 FEKAAYHTP-DEVSDESRAT-SFCSSLERKDTVAAVYQYWMKKRKQNHVALLRVFQCPPP 944
            FEK  + +P D++ +E  AT +  S L R++ V AV+ YW +KRKQ    LLRVFQ    
Sbjct: 605  FEKCFFQSPADDLLNEKAATVASLSYLGRQEVVEAVHDYWARKRKQRKAPLLRVFQGHQA 664

Query: 943  RSSQLMQKPVFRKKRSFKRQMRQ-SGRGKQQIFFHASAVE----PEQDAMQRVQKAKSLA 779
            + S L+ K VFRK+RSFKRQ  Q  G+ KQ       A E     EQ+   RV++AK+LA
Sbjct: 665  KKSPLLFKHVFRKRRSFKRQGSQLHGKSKQLSLVGVKAAEQEASEEQNDYLRVEEAKALA 724

Query: 778  DISMESVLLKRRRAQILMDNADLATY 701
            D +ME  + KRRRAQ+L +NADLA Y
Sbjct: 725  DRAMEIAIAKRRRAQVLAENADLAVY 750


>ref|NP_196087.1| Enhancer of polycomb-like transcription factor protein [Arabidopsis
            thaliana] gi|7413529|emb|CAB86009.1| putative protein
            [Arabidopsis thaliana] gi|332003387|gb|AED90770.1|
            Enhancer of polycomb-like transcription factor protein
            [Arabidopsis thaliana]
          Length = 766

 Score =  409 bits (1052), Expect = e-111
 Identities = 291/810 (35%), Positives = 411/810 (50%), Gaps = 46/810 (5%)
 Frame = -3

Query: 2992 MPLVEMRRSTRVFVPKSVVKDTDGARVLRSGKRFSLDCGEEKPVRGNGDEWFCIIDDTAD 2813
            MP V MRR+TRVF    VVK  DGARVLRSG+R   + GE K  R +      ++D   D
Sbjct: 1    MPSVGMRRTTRVF---GVVKAADGARVLRSGRRIWPNVGEPKVRRAHD-----VVDRDCD 52

Query: 2812 -------------VPRCKSIDW----YEVDPERDIDVTDFNLNLAXXXXXXXXXXXXXSR 2684
                         V   KS        +V  E++  V DF +                  
Sbjct: 53   SVLKNQNKSKGNKVSSGKSNSQPCSPKQVSSEKEDKVDDFPVTKRRKVRNEGVGDEKTV- 111

Query: 2683 DKMHGIVYNRKRRRLPGNKFVSSPRDRMYGISFVRKQRRKRSSGHSINELPRKDCQXXXX 2504
            DKM GIVY+RKR+RL          + +  + F R+ RRK S   S              
Sbjct: 112  DKMFGIVYSRKRKRLCEPSSSDRSEEPLRSLKFYRR-RRKLSQRVS-------------- 156

Query: 2503 XXXXXXXXSGHESSIDVVHGAILDIVVESSCNSAHRFTCFLVSILKFVKSSRISLTEFAA 2324
                                ++L + V+ SC      T F ++ +++++   + L+  A+
Sbjct: 157  --------------------SVLTLTVDWSCEDCWFLTVFGLA-MRYIRREELRLSSLAS 195

Query: 2323 FMCSEPLVRVFAQHGVHFLANLSYRISWKNDLISPGICQIFEARQFIPMFSLDFCASPFT 2144
            F  S+P+ +VFA HGV FL         ++ L S G+C+ F A   +P+FS DF   P  
Sbjct: 196  FFLSQPINQVFADHGVRFLV--------RSPLSSRGVCKFFGAMSCLPLFSADFAVIPRW 247

Query: 2143 FMRLHFSVVLRSLYLPDVLIRYLSGLIKKAREITDNKKCL----PCIPTEMGFPGSNSMA 1976
            FM +HF++ +R L      +     L+    E +D++  L    PC P      G +   
Sbjct: 248  FMDMHFTLFVRVLPRSFFFVEKSLYLLNNPIEESDSESELALPEPCTPRNGVVVGLHPS- 306

Query: 1975 SWSSSVKKRKVDFIVRATDFARRSVSTRHSANSRNVQRKRTSLRSTRARNSS----SMGF 1808
                          VRA+     +   R +  S + Q++R+SLR  RARN S     +  
Sbjct: 307  --------------VRASKLTGGNAQYRGNLGSHSFQKRRSSLRRRRARNLSHNAHKLNN 352

Query: 1807 HHDLFRAGYKHKKRK------------LAQKSPCSNVKELKSTLVELKQNMDSVCCSANI 1664
               +F      K R             L+  SP SN   +   + + K+ +DS+CCSANI
Sbjct: 353  GTPVFDISGSRKNRTAAVSSKKLRSSVLSNSSPVSNGISI-IPMTKTKEELDSICCSANI 411

Query: 1663 LVIESDRCFREEGAKVMLECLALNDWYLVAKTHGLSRYMYKAENVMRPSTPNRFTHAMIW 1484
            L+I SDRC REEG  VMLE  +  +W+LV K  G  RY + A+  MRP + NR THA +W
Sbjct: 412  LMIHSDRCTREEGFSVMLEASSSKEWFLVIKKDGAIRYSHMAQRTMRPFSSNRITHATVW 471

Query: 1483 TGENGWKLEFSNRRDWLIFKELHMACMDRNLQVVSARTIPVPGVYEVPCYDD--SGYVPF 1310
             G + WKLEF +R+DWL FK+++  C +RNL   S + IP+PGV EV  Y +    +  F
Sbjct: 472  MGGDNWKLEFCDRQDWLGFKDIYKECYERNLLEQSVKVIPIPGVREVCGYAEYIDNFPSF 531

Query: 1309 VR-PIAYITMRDDEVARALVRKPANYDMDSGDEEWLEKLNNGVFDGDVGVPNYILPEKFE 1133
             R P++YI++ +DEV+RA+ R  A YDMDS DEEWLE+ N  + + +      +  E FE
Sbjct: 532  SRPPVSYISVNEDEVSRAMARSIALYDMDSEDEEWLERQNQKMLNEEDDQYLQLQREAFE 591

Query: 1132 EMMDAFEKAAYHTP-DEVSDESRAT-SFCSSLERKDTVAAVYQYWMKKRKQNHVALLRVF 959
             M+D FEK  +H+P D++ DE  AT    S L R++ V AV+ YW+KKRKQ    LLR+F
Sbjct: 592  LMIDGFEKYHFHSPADDLLDEKAATIGSISYLGRQEVVEAVHDYWLKKRKQRKAPLLRIF 651

Query: 958  QCPPPRSSQLMQKPVFRKKRSFKRQMRQ-SGRGKQQI--FFHASAVEP-EQDAMQRVQKA 791
            Q    + +QL+ KPVFRK+RSFKRQ  Q  G+ KQ         A EP E+D + R+++A
Sbjct: 652  QGHQVKKTQLLSKPVFRKRRSFKRQGSQLHGKAKQTSPWMVAVKAAEPEEEDDILRMEEA 711

Query: 790  KSLADISMESVLLKRRRAQILMDNADLATY 701
            K LAD +ME+ + KRRRAQIL +NADLA Y
Sbjct: 712  KVLADKTMETAIAKRRRAQILAENADLAVY 741


>ref|XP_007221418.1| hypothetical protein PRUPE_ppa001422mg [Prunus persica]
            gi|462418130|gb|EMJ22617.1| hypothetical protein
            PRUPE_ppa001422mg [Prunus persica]
          Length = 768

 Score =  408 bits (1049), Expect = e-111
 Identities = 284/759 (37%), Positives = 390/759 (51%), Gaps = 38/759 (5%)
 Frame = -3

Query: 2998 TLMPLVEMRRSTRVFVPKSVVKDTDGARVLRSGKRFSLDCGEEKPVRG-NGDE-WFCIID 2825
            T MP VEMRR+TRVF    V    DGARVLRSG+R   +  E K  R  NGDE W  ++ 
Sbjct: 52   TEMPSVEMRRTTRVFGMGMVKGGVDGARVLRSGRRLWPESSESKLERARNGDEDWLKLMK 111

Query: 2824 DTA--DVPRCKSIDWYEVD----PERDIDVTDFNLNLAXXXXXXXXXXXXXSRDKMHGIV 2663
              A   V       W   +    P R+  V     +L                 K +GIV
Sbjct: 112  SHAGESVVGLNHKKWAGANQVGSPRRNTPV--LKTSLVKKPQSNELLADLLKEHKRYGIV 169

Query: 2662 YNRKRRRLPGNKFVSSPR-----DRMYGISFVRKQRRKRSSGHSINELPRKDCQXXXXXX 2498
            Y RKR+R   +   +  +     DRMYG  F R+QR K+S      EL            
Sbjct: 170  YTRKRKRASASFLGNVEKENGSDDRMYGRRFARRQRMKKSK-----EL------------ 212

Query: 2497 XXXXXXSGHESSIDVVHGAILDIVVESSCNSAHRFTCFLVSILKFVKSSRISLTEFAAFM 2318
                     +S    V   +L   VESS    +    FL S+L ++  + + LTEF+ F+
Sbjct: 213  ---------DSHPGFVCPEVLCFSVESSWAQGYWAGRFLYSVLVYMTRASLGLTEFSEFL 263

Query: 2317 CSEPLVRVFAQHGVHFLANLSYRISWKNDLISPGICQIFEARQFIPMFSLDFCASPFTFM 2138
              EP+  +FA +G+ F  + S            G+C++F A QFIP+FS+DF A P  FM
Sbjct: 264  ALEPIGSIFASYGIQFSRDRSCTRR-------SGVCKLFGAEQFIPLFSVDFSAVPGCFM 316

Query: 2137 RLHFSVVLR---SLYLPDVLIRYLSGLIKKAREITDNKKCLPCIPTEMGFPGSNSMASWS 1967
             +  S+ LR    L + +++  + +G      +  D+ + +  I         N  A  S
Sbjct: 317  FMQTSMHLRFRCHLTVNNLIDGHENGEFIDQGDDDDDGEKVDFI--------ENRHALHS 368

Query: 1966 SSVKKRKVDFIVRATDFARRSVSTRHSANSRNVQRKRTSLRSTRARNSSSMGFHH----- 1802
            S          VR    A RS   R+   SR +Q++R+SLR  R+RN S +         
Sbjct: 369  S----------VRVPKLACRSTQYRNGLTSRGIQKRRSSLRRRRSRNPSLVSLRKPNGAL 418

Query: 1801 -----DLFRAGY-------KHKKRKLAQKSPCSNVKELKSTLVELKQNMDSVCCSANILV 1658
                  + + G        KH  RK    S   N+K    T+   K+++DS  CSANIL 
Sbjct: 419  VSELISIRKNGLPFSSVESKHMLRKSVSLSLAGNLKAESLTIEGSKRDLDSTSCSANILF 478

Query: 1657 IESDRCFREEGAKVMLECLALNDWYLVAKTHGLSRYMYKAENVMRPSTPNRFTHAMIWT- 1481
             E D+C+RE+GA VMLE  +  +W LV K +GL+RY +KAE VMRP + NR T A+IW+ 
Sbjct: 479  TELDKCYREDGATVMLEMSSSGEWLLVVKKNGLTRYTHKAEKVMRPCSKNRITQAIIWSA 538

Query: 1480 ---GENGWKLEFSNRRDWLIFKELHMACMDRNLQVVSARTIPVPGVYEVPCYDDSGYVPF 1310
               G+N WKLEF NR DW IFK+L+  C DR +   + + IPVPGV EVP Y DS    F
Sbjct: 539  DSNGDNNWKLEFPNRCDWAIFKDLYKECSDRVVPAPAIKFIPVPGVREVPGYADSHSTLF 598

Query: 1309 VRPIAYITMRDDEVARALVRKPANYDMDSGDEEWLEKLNNGVFDGDVGVPNYILPEKFEE 1130
             RP +YI + DDEV+RA+ ++ ANYDMDS DEEWL+K N+  F  +  + +++  + FE 
Sbjct: 599  DRPESYIYLNDDEVSRAMAKRTANYDMDSDDEEWLKKFNSDFF-AENELHDHVSEDNFEL 657

Query: 1129 MMDAFEKAAYHTPDEVSDESRATSFCSSLERKDTVAAVYQYWMKKRKQNH-VALLRVFQC 953
            M+DAFEKA Y  P + +DE+ A + C  + R++ V A+Y YWM KRKQ    +LLRVFQ 
Sbjct: 658  MVDAFEKAFYCRPYDFADENAAANLCLDMGRREVVEAIYSYWMNKRKQKRSSSLLRVFQG 717

Query: 952  PPPRSSQLMQKPVFRKKRSFKRQMRQSGRGKQQIFFHAS 836
               + + L  KPV RK+RSFKRQ  Q GRGKQ  F   +
Sbjct: 718  HQSKRALLDPKPVLRKRRSFKRQPSQFGRGKQPSFLQGT 756


>ref|XP_007145542.1| hypothetical protein PHAVU_007G247300g [Phaseolus vulgaris]
            gi|561018732|gb|ESW17536.1| hypothetical protein
            PHAVU_007G247300g [Phaseolus vulgaris]
          Length = 734

 Score =  402 bits (1032), Expect = e-109
 Identities = 284/808 (35%), Positives = 393/808 (48%), Gaps = 44/808 (5%)
 Frame = -3

Query: 2992 MPLVEMRRSTRVFVPKSVVKDTDGARVLRSGKRFSLDCGEEKPVRGN-GDEWFCIIDDTA 2816
            MP   MRR+TRVF     +K  D ARVLRSG+R   D GE K  R + GDE         
Sbjct: 1    MPAAGMRRTTRVFG----MKGADTARVLRSGRRLWPDSGEVKTKRSSDGDE--------- 47

Query: 2815 DVPRCKSIDWYEVDPERDIDVTDFNLNLAXXXXXXXXXXXXXSRDKMHGIVYNRKRRRLP 2636
                      + V P +                            KM  ++  R   +  
Sbjct: 48   ----------WAVTPAKAA--------------------------KMDAVMTPRGTAKGK 71

Query: 2635 GNKFVSSPRD----RMYGISFVRKQRRKRSSGHSINELPRKDCQXXXXXXXXXXXXSGHE 2468
              + V   RD    R +GI +VR+++  +  G                            
Sbjct: 72   RQEAVVDARDSTVDRRFGIVYVRRRKGLKKEGS--------------------------R 105

Query: 2467 SSIDVVHGAILDIVVESSCNSAHRFTCFLVSILKFVKSSRISLTEFAAFMCSEPLVRVFA 2288
             S++V    +L +VV      +  F   L S++++ K  R+S  + + F  S  +  VFA
Sbjct: 106  RSVEVSR-CVLSVVVSRCAGKSALFLRLLASVVRYAKRVRVSPRKLSGFFMSGAVNGVFA 164

Query: 2287 QHGVHFLANLSYRISWKNDLISPGICQIFEARQFIPMFSLDFCASPFTFMRLH----FSV 2120
              G+ F+             ++ GICQ F   +F+P+FS+DF A P  F  LH    F  
Sbjct: 165  SQGMQFVKG--------PPAVNSGICQFFGVTEFVPLFSVDFSAVPLCFEYLHSAMFFKS 216

Query: 2119 VLRSLYL----------------PDVLIRYLSGLIKKAREITDNKKCLPCIPTEMGFPGS 1988
            +LRSL+L                 D L+ Y +   K+    T   +    +         
Sbjct: 217  MLRSLFLVCNPINVRSDVEDMESDDDLLEYQNE--KQISSNTFKGELSETVTVTSDVIEI 274

Query: 1987 NSMASWSSSVKKRKVDFIVRATDFARRSVSTRHSANSRNVQRKRTSLRSTRARNSSSMGF 1808
            N + S  SSVK          T  A R+   R+  NSR +Q++R+SLR  +ARN S  G 
Sbjct: 275  NDVLSLQSSVKS--------TTRAAGRNGQYRNMLNSRGIQKRRSSLRKRKARNPSMGGL 326

Query: 1807 H---------------HDLFRAGYKHKK-RKLAQKSPCSNVKELKSTLVELKQNMDSVCC 1676
                            ++ F      K+ R LA  S   ++KE  S +V+ K+ +    C
Sbjct: 327  RRNGAVAFELTGGRKGNNQFSGVTSSKRLRSLANGSTTGSLKEASSAIVDSKERLGLSSC 386

Query: 1675 SANILVIESDRCFREEGAKVMLECLALNDWYLVAKTHGLSRYMYKAENVMRPSTPNRFTH 1496
            SAN+LV E  +C R EGA V LE  A  +W L  K   L+R  +KAE VMRP + NRFTH
Sbjct: 387  SANLLVSEIHQCHRVEGAIVTLEMSASKEWLLTVKKDELTRSTFKAEKVMRPCSSNRFTH 446

Query: 1495 AMIWTGENGWKLEFSNRRDWLIFKELHMACMDRNLQVVSARTIPVPGVYEVPCYDDSGYV 1316
            A++++ +NGWKLEF+NR+DW +FK+L+  C DRN+   +A+ IPVPGV EV  Y +S   
Sbjct: 447  AIMYSLDNGWKLEFTNRQDWNVFKDLYKKCSDRNIPSTAAKFIPVPGVREVSSYAESNSF 506

Query: 1315 PFVRPIAYITMRDDEVARALVRKPANYDMDSGDEEWLEKLNNGVFDGDVGVPNYILPEKF 1136
            PF RP  YI++  DE+ RA+ R  ANYDMDS DEEWL+K NN          N +  + F
Sbjct: 507  PFHRPDTYISVFGDELTRAMARTTANYDMDSEDEEWLKKFNN-------ECQNPVSDDNF 559

Query: 1135 EEMMDAFEKAAYHTPDEVSDESRATSFCSSLERKDTVAAVYQYWMKKRKQNHVALLRVFQ 956
            E ++D  EK  Y  PDE+ DE  AT+ C  L  K+ V AVY YWM+KRKQ    L+RVFQ
Sbjct: 560  ELIIDTLEKVYYCNPDELFDEKSATNGCQDLGSKEVVEAVYNYWMRKRKQKRSLLIRVFQ 619

Query: 955  CPPPRSSQLMQKPVFRKKRSFKRQMRQSGRGKQQIFFHASAVEP---EQDAMQRVQKAKS 785
                + + L+ KP+ RK+RSFKRQ  Q GR  Q     A A E    E++AM R+++AK+
Sbjct: 620  GHQSKRAPLIPKPLLRKRRSFKRQPSQFGRSNQPSVLKAFAAEQDAMEENAMLRIEEAKA 679

Query: 784  LADISMESVLLKRRRAQILMDNADLATY 701
             A++SME  + KRRRAQ L  NADLATY
Sbjct: 680  NANMSMELAIHKRRRAQSLAQNADLATY 707


Top