BLASTX nr result

ID: Mentha28_contig00013648 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Mentha28_contig00013648
         (3466 letters)

Database: ./nr 
           38,876,450 sequences; 13,856,398,315 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

gb|EYU31274.1| hypothetical protein MIMGU_mgv1a000087mg [Mimulus...  1222   0.0  
ref|XP_002525350.1| DNA binding protein, putative [Ricinus commu...   728   0.0  
ref|XP_006365207.1| PREDICTED: methyl-CpG-binding domain-contain...   722   0.0  
ref|XP_004239350.1| PREDICTED: methyl-CpG-binding domain-contain...   721   0.0  
ref|XP_006483833.1| PREDICTED: methyl-CpG-binding domain-contain...   713   0.0  
ref|XP_007217135.1| hypothetical protein PRUPE_ppa000046mg [Prun...   709   0.0  
ref|XP_007031430.1| Methyl-CpG-binding domain-containing protein...   703   0.0  
ref|XP_002274643.2| PREDICTED: methyl-CpG-binding domain-contain...   693   0.0  
ref|XP_006446469.1| hypothetical protein CICLE_v10014026mg [Citr...   692   0.0  
ref|XP_006470356.1| PREDICTED: methyl-CpG-binding domain-contain...   692   0.0  
ref|XP_006470355.1| PREDICTED: methyl-CpG-binding domain-contain...   692   0.0  
ref|XP_004167238.1| PREDICTED: LOW QUALITY PROTEIN: methyl-CpG-b...   669   0.0  
ref|XP_004141185.1| PREDICTED: methyl-CpG-binding domain-contain...   669   0.0  
ref|XP_002884279.1| methyl-CpG-binding domain 9 [Arabidopsis lyr...   664   0.0  
ref|XP_006594288.1| PREDICTED: methyl-CpG-binding domain-contain...   659   0.0  
ref|XP_006603816.1| PREDICTED: methyl-CpG-binding domain-contain...   655   0.0  
ref|NP_186795.1| methyl-CPG-binding domain 9 [Arabidopsis thalia...   655   0.0  
gb|EXC31622.1| Methyl-CpG-binding domain-containing protein 9 [M...   655   0.0  
ref|XP_007031432.1| Methyl-CpG-binding domain-containing protein...   649   0.0  
ref|XP_006296811.1| hypothetical protein CARUB_v10012794mg [Caps...   649   0.0  

>gb|EYU31274.1| hypothetical protein MIMGU_mgv1a000087mg [Mimulus guttatus]
          Length = 1861

 Score = 1222 bits (3161), Expect = 0.0
 Identities = 639/997 (64%), Positives = 748/997 (75%), Gaps = 2/997 (0%)
 Frame = -3

Query: 3464 ASDEDKVFCNLLGRIILNPNDNDDEGLLGYPAMVSRPLDFRTIDLRLASGAYGGSHEAFV 3285
            +SDED+VFCNLL RI+LNPNDNDDEG+LGYPAMVSRPLDFRTIDLRLA+GAYGGSHE F 
Sbjct: 892  SSDEDRVFCNLLARIVLNPNDNDDEGVLGYPAMVSRPLDFRTIDLRLAAGAYGGSHETFF 951

Query: 3284 DDVREVWRNIHTAYGDRSDLIDVADNLSKKFEDLYEKEVLTLLQKVADISDVKDSSADSL 3105
            DDV+EVWRNI  AYGDR DLIDV +NLSKKFE+LYEKEV+T + K+A+  +  DSSAD++
Sbjct: 952  DDVQEVWRNIRIAYGDRPDLIDVVENLSKKFEELYEKEVMTFVHKIAENVNASDSSADAI 1011

Query: 3104 KERDDLLVQVCNSVLPRAPWDEGLCKVCGMDKDDDNVLLCDKCDSEYHRYCLDPPLLRIP 2925
            KERDDLLVQ CNS LPRAPWDEG+CKVCGMDKDDDNVLLCDKCDSEYHRYCL PPLL+IP
Sbjct: 1012 KERDDLLVQACNSSLPRAPWDEGICKVCGMDKDDDNVLLCDKCDSEYHRYCLSPPLLKIP 1071

Query: 2924 EGNWYCPSCVTGQSLPSGTGYASGSNQRRKRRNQGEFSSKFLEELSRFAKLMEMKEHWEF 2745
            EGNWYCPSCVTGQ++   T Y S + Q RKR++QGEF+SKFLEEL+R AKLME+KE+WEF
Sbjct: 1072 EGNWYCPSCVTGQAISYSTSYGSVATQCRKRKHQGEFTSKFLEELARLAKLMEIKEYWEF 1131

Query: 2744 TVEERIFFLKFLFDEALNSATVRDHMEQCASRAADLQNKLRSLTSEXXXXXXXXXXXXLS 2565
            T+EERIFF+KFLFDEALNSAT+R+HM+Q +SRAADLQ KLRSLT E            LS
Sbjct: 1132 TIEERIFFMKFLFDEALNSATIREHMDQSSSRAADLQQKLRSLTYELKVLKAKEDMLGLS 1191

Query: 2564 AEKTNSGVLNVRGDLNSDASSSQHASENITRGKPSEKLVGDQSQPEKIIVKTSEGPNWLS 2385
             EK NSG    RGD+ SDASSS   +EN +R  PSEK  G          +  E P+ L+
Sbjct: 1192 TEKVNSG---GRGDMKSDASSSLLLTENSSR-IPSEK--GSHLSSLSAFTRLEERPS-LN 1244

Query: 2384 EKPISVQQPRSDQGHTSLLNNVQSPLFSSPTSERETELVQCPNQGDMPSSQLNNLKACTV 2205
            E+P    QP        LL+ + +P+ S+  S             D  SSQ N+LKA TV
Sbjct: 1245 EQP---NQP-------PLLSTIPAPVSSAQESR---------GNPDKLSSQDNSLKAATV 1285

Query: 2204 KQEITNLLDSIADIELELVKISLRRDFLGRDSIGRVYWAFCYPGARPWIIACGGTAHKER 2025
            K +I+++ DSIA IELEL+K+SLR+DFLGRDS GRVYW F  PGARPWI+ACG  A KER
Sbjct: 1286 KSDISSMRDSIASIELELLKVSLRKDFLGRDSNGRVYWGFYCPGARPWIMACGDLAFKER 1345

Query: 2024 CPRDFSSIPDSDKWMYYESDSEIEKLVGWLRENNVREKELKESISQFQANKLKDSEYTED 1845
            CP +F  +PDS KWMYYESD EIEKLVGWLRENN REKELKESI Q Q NKLKDS+YTE+
Sbjct: 1346 CPEEFIGVPDSHKWMYYESDDEIEKLVGWLRENNPREKELKESILQLQNNKLKDSQYTEN 1405

Query: 1844 HILKKREINHGGRKTVSADFLATNAMNALEKKFGPCKRIETTAVPQNLVMGASQSGGMHR 1665
            HIL K E N   RK  SA+ L+T AM +LE KFGP      T   QNL  G S    M+R
Sbjct: 1406 HILSKAEENRSERKASSANILSTKAMASLENKFGPLLGTRATDARQNLASGLSPDCRMYR 1465

Query: 1664 CECLEMLWPSKDHCGSCHQSFSTSEELRQHVKENCKAVASGSKRSQAAEDMSKRKKARNA 1485
            CECLE+LWPS +HC SCHQSF T+EEL QH+KENCK  A   KRSQ  ED+SKRKK +  
Sbjct: 1466 CECLELLWPSNNHCASCHQSFPTTEELGQHLKENCKPAAPVPKRSQTTEDVSKRKKLKIV 1525

Query: 1484 VSQEKRPASVGIPQGSTLEKQIDGSASVESYNADCPFNFEEIMTRFIVPSSVKDGVNEIG 1305
             SQEKRP  +GI Q ST +KQ DGS+  + Y ADCPFNFEEIMTRF+VP S+KD VN IG
Sbjct: 1526 SSQEKRPGDMGILQTSTSKKQNDGSSFADRYYADCPFNFEEIMTRFVVPGSIKDAVNSIG 1585

Query: 1304 LIGSGGLPSFLQGQSPYVSDSALTVGLERTNEASSRSNDLRSKQKNA--EPGDVMNGRGF 1131
            LIG+GG+PSF    S Y+               S    DL SKQ ++  E    MN +  
Sbjct: 1586 LIGNGGIPSFSSSGSLYL---------------SGMPTDLSSKQHHSSNEGSAAMNTKDN 1630

Query: 1130 KDSNRSSRSAENGLSDELSIVGRLKSILTSEKDQVTPMKDRSSLLGLSKSTIIRESSSRP 951
            K+S+R S  AE  L ++ S VGRLKSI  S ++ V+ MK+++SLLGLSKS++IRESS RP
Sbjct: 1631 KESSRLSSCAETFLGEKGSGVGRLKSISMSGREHVSSMKNKNSLLGLSKSSLIRESSQRP 1690

Query: 950  LVGRASEILRVLKINLLDMDAALPEDAFRKSRSSPDRRCAWRAFLKSSKSIYEMVQATIV 771
            LVGRASEILR LKINLLDMDAALP+DA R SRS+  RR AWRAF+KS+KSIYEMVQA I+
Sbjct: 1691 LVGRASEILRFLKINLLDMDAALPQDALRTSRSNEGRRYAWRAFVKSAKSIYEMVQAMII 1750

Query: 770  FEDMIKTEYLRNDWWYWSSPSTAARITTLSALALRIYSLDSAISYEEPLPTAAAMEVSEP 591
             ED I++EYLRNDWWYWSSPSTAA+ TTLS+LALRIYSLD+AISYE+PL    ++E+ EP
Sbjct: 1751 LEDTIRSEYLRNDWWYWSSPSTAAKTTTLSSLALRIYSLDAAISYEKPLQN-GSIEMPEP 1809

Query: 590  SNAIEEEIQKSPTLKNLASPSSPTLQKTPEPDSSENP 480
            S A+E+E   S  LKNL SPSSP+LQKTPEPDS+ENP
Sbjct: 1810 SCALEDEAPLSKLLKNLPSPSSPSLQKTPEPDSAENP 1846


>ref|XP_002525350.1| DNA binding protein, putative [Ricinus communis]
            gi|223535313|gb|EEF36988.1| DNA binding protein, putative
            [Ricinus communis]
          Length = 1794

 Score =  728 bits (1878), Expect = 0.0
 Identities = 429/1036 (41%), Positives = 601/1036 (58%), Gaps = 45/1036 (4%)
 Frame = -3

Query: 3464 ASDEDKVFCNLLGRIILNPNDNDDEGLLGYPAMVSRPLDFRTIDLRLASGAYGGSHEAFV 3285
            A+DE+K+FCNLLGR +LN +DNDDEGLLG+P MVSRPLDFRTIDLRLA GAYGGSHEAF+
Sbjct: 777  AADEEKIFCNLLGRTLLNTSDNDDEGLLGFPTMVSRPLDFRTIDLRLAFGAYGGSHEAFL 836

Query: 3284 DDVREVWRNIHTAYGDRSDLIDVADNLSKKFEDLYEKEVLTLLQKVADISDVKDSSADSL 3105
            +DVREVW +I TAY D+SDL+ +A+ LS+ FE LY+ EVLTL+QK+ D + V+ S++++ 
Sbjct: 837  EDVREVWHHIRTAYADQSDLVHLAEKLSQNFEALYKNEVLTLVQKLTDYAAVECSNSEAK 896

Query: 3104 KERDDLLVQVCNSVLPRAPWDEGLCKVCGMDKDDDNVLLCDKCDSEYHRYCLDPPLLRIP 2925
            KE +D+L     S +P+APWDEG+CKVCG+DKDDDNVLLCDKCDS YH YCL+PPL RIP
Sbjct: 897  KEMEDILEHA--SQMPKAPWDEGVCKVCGVDKDDDNVLLCDKCDSGYHTYCLNPPLARIP 954

Query: 2924 EGNWYCPSCVT--GQSLPSGTGYASGSNQRRKRRNQGEFSSKFLEELSRFAKLMEMKEHW 2751
            EGNWYCPSC+T     +P    +       RK+R QGEF+   LE L+     ME+ ++W
Sbjct: 955  EGNWYCPSCITQGASQVPQFVSHC------RKKRRQGEFTHGVLEALAHLGTTMEITDYW 1008

Query: 2750 EFTVEERIFFLKFLFDEALNSATVRDHMEQCASRAADLQNKLRSLTSEXXXXXXXXXXXX 2571
            +++VEERIF LKFL DE LNSA +R+H++QCAS +ADLQ KLRSL+ E            
Sbjct: 1009 DYSVEERIFLLKFLGDEVLNSANIREHLDQCASVSADLQQKLRSLSMEWRNLKFKEELML 1068

Query: 2570 LSAEKT-NSGVLNVRGDLNSDASSSQHASENITRGKPSEKLVGDQSQPEKIIVKTSEGPN 2394
                K+   G   V  + +     +   S   +      + + D  +  +    T   P 
Sbjct: 1069 NGVGKSGKEGTTTVLPNYDKLLGQTHSRSSLCSTSFIDLEHLKDGPRFPRTNDFTKR-PC 1127

Query: 2393 WLSEKPISVQQPRSDQGHTSLLNNVQSPLFSSPTSERETELVQCPNQGDMPSSQLNNLKA 2214
            W+  K + VQQP S+      +++ +  +                NQ D+   Q +NL++
Sbjct: 1128 WVYPKGVQVQQPISNGSQVFTISDTECQV----------------NQPDVNQLQTSNLES 1171

Query: 2213 CTVKQEITNLLDSIADIELELVKISLRRDFLGRDSIGRVYWAFCYPGARPWIIACGGT-- 2040
              ++ + + L DS+  +EL+L K SLR++FLGRDS GRVYWAF   G+ PW++  G T  
Sbjct: 1172 IFIRDKASVLQDSVTSLELQLQKASLRKEFLGRDSAGRVYWAFSRTGSLPWVVIDGTTVV 1231

Query: 2039 -----AHKERCPR-----DFSSIPDSD-------------------------KWMYYESD 1965
                 A + R  R       SSI   D                         +W  ++S 
Sbjct: 1232 QQSSIAEENRVLRFNNLTFRSSIGAQDLLRFKGSNVFSPYASDLTSGISVYFQWFSHQSY 1291

Query: 1964 SEIEKLVGWLRENNVREKELKESISQFQANKLKDSEYTEDHILKKRE---INHGGRKTVS 1794
            +EIE+L+ WLR+N+  ++EL ES+ Q       +S    +++L+  +   +     KT+ 
Sbjct: 1292 AEIEELIKWLRDNDPMQRELIESLLQRLNFGYSNSNKAANYVLEMNQPASMPVNIEKTLK 1351

Query: 1793 ADFLATNAMNALEKKFGPCKRIETT--AVPQNLVMGASQSGGMHRCECLEMLWPSKDHCG 1620
               L T A+ ALEKK+GPC  ++ T  +V  +  +  +    M RCECLE +WPS+ HC 
Sbjct: 1352 PKSLETRALTALEKKYGPCMELDVTNISVKFSRNLKVTYDDRMCRCECLEAIWPSRHHCL 1411

Query: 1619 SCHQSFSTSEELRQHVKENCKAVASGSKRSQAAEDMSKRKKARNAVSQEKRPASVGIPQG 1440
            SCH+SFS+  EL +H    C A A   + S+  +D+SK K    A   E +  + G   G
Sbjct: 1412 SCHRSFSSRCELEEHNDGKCGAGAHTPQNSRVTDDVSKEKVLMRAEHGEWQCKAGG--AG 1469

Query: 1439 STLEKQIDGSASVESYNADCPFNFEEIMTRFIVPSSVKDGVNEIGLIGSGGLPSFLQGQS 1260
              +E  + G           P+N EEI  +F+  SS K+ V EIGL+GS G+PS +   S
Sbjct: 1470 HEIEFGLIGFRK----EFMSPYNLEEISAKFVTRSSNKELVKEIGLLGSNGIPSLVPCSS 1525

Query: 1259 PYVSDSALTVGLERTNEASSRSNDLRSKQKNAEPGDVMNGRGFKDSNRSSRSAENGLSDE 1080
            PY+ D  L + L   NE          +  + +     + R    SN +       L +E
Sbjct: 1526 PYLIDPTLKLVLPCVNEVCQSVQSTNVENGSLQGDTTTSKRHANKSNATKDCTAVDLYEE 1585

Query: 1079 LSIVGRLKSILTSEKDQVTPMKDRSSLLGLSKSTIIRESSSRPLVGRASEILRVLKINLL 900
            L  +GR  S L ++    + ++   + LG   S  IR S+ RPLVG+ + ILR LKINLL
Sbjct: 1586 LQEIGR--SYLMNQ----SSLRFSCTKLGNPLSE-IRGSALRPLVGKGAHILRQLKINLL 1638

Query: 899  DMDAALPEDAFRKSRSSPDRRCAWRAFLKSSKSIYEMVQATIVFEDMIKTEYLRNDWWYW 720
            DMDAALPE+A + S    ++RCAWRAF+KS+KS++EMVQATIV E+MIKT++LRN+WWYW
Sbjct: 1639 DMDAALPEEAVKSSNIYLEKRCAWRAFVKSAKSVFEMVQATIVLENMIKTDFLRNEWWYW 1698

Query: 719  SSPSTAARITTLSALALRIYSLDSAISYEEPLPTAAAMEVSEPSNAIEEEIQKSPTLKNL 540
            SS S AA+I T+S+LALRIY+LD+AI YE+ LP     +++E  +  +          N 
Sbjct: 1699 SSLSAAAKIATISSLALRIYTLDAAIVYEKTLPFTPPKDIAEVGSKSD----------NN 1748

Query: 539  ASPSSPTLQKTPEPDS 492
             SP    L+  P+P S
Sbjct: 1749 NSPPHTDLESNPKPSS 1764


>ref|XP_006365207.1| PREDICTED: methyl-CpG-binding domain-containing protein 9-like
            [Solanum tuberosum]
          Length = 2173

 Score =  722 bits (1864), Expect = 0.0
 Identities = 444/1104 (40%), Positives = 619/1104 (56%), Gaps = 111/1104 (10%)
 Frame = -3

Query: 3458 DEDKVFCNLLGRIILNPNDNDDEGLLGYPAMVSRPLDFRTIDLRLASGAYGGSHEAFVDD 3279
            D+DKVFCNL GR +L+PNDND+EGLLG+PAMVSRPLDFRTID++LA+G+YGGSHE+F+D+
Sbjct: 1067 DKDKVFCNLSGRTVLSPNDNDNEGLLGHPAMVSRPLDFRTIDVKLAAGSYGGSHESFIDE 1126

Query: 3278 VREVWRNIHTAYGDRSDLIDVADNLSKKFEDLYEKEVLTLLQKVADISDVKDSSADSLKE 3099
            VREVW NI TAY ++S+L+++A +L +KFE+ YEKEVL L+Q + + S+  + S++  K+
Sbjct: 1127 VREVWHNIRTAYCNKSNLLELAGSLLQKFEEDYEKEVLPLVQ-IIECSNDSNLSSEVAKD 1185

Query: 3098 RDDLLVQVCNSVLPRAPWDEGLCKVCGMDKDDDNVLLCDKCDSEYHRYCLDPPLLRIPEG 2919
            RD LL  V  S LP+APW+EGLCKVC MDKDD NVLLCDKCDSEYH YCLDPPL+++P G
Sbjct: 1186 RDGLLAHVNESSLPKAPWEEGLCKVCSMDKDDVNVLLCDKCDSEYHTYCLDPPLVKVPIG 1245

Query: 2918 NWYCPSCVTGQSLPSGTGYASGSNQRR---KRRNQGEFSSKFLEELSRFAKLMEMKEHWE 2748
             WYCP C     +      +SGS+  R   KRR   + + KF+E+LS+  + ME+KE+WE
Sbjct: 1246 PWYCPDCEA--KISRSQNASSGSHTIRQCVKRRLHRKLTHKFMEKLSQLTRTMELKEYWE 1303

Query: 2747 FTVEERIFFLKFLFDEALNSATVRDHMEQCASRAADLQNKLRSLTSEXXXXXXXXXXXXL 2568
              +E+RIF LKFL DE LNSA +RDH+++ AS +A+LQ KLRSL +E             
Sbjct: 1304 LPLEDRIFLLKFLCDEMLNSAILRDHIDRSASLSAELQQKLRSLGAELKLLKHKKEILTA 1363

Query: 2567 SAEK---------------TNSGVLNVRG-DLNSDASSSQHASENITRGKPSEKLVGDQS 2436
              +                +N   L V+G D  S  SS       +  G    K      
Sbjct: 1364 KLKNDARSSGDTGSDTSLWSNDCKLKVQGPDSGSHNSSISGGCRQLDDGTQHNKCNDYNK 1423

Query: 2435 QP-----EKIIVKT-SEGPNWLSEKPISVQQPRSDQ---GHTSLLN------------NV 2319
            Q      + I  KT + G N +   P  +   +  Q    +T  LN            N+
Sbjct: 1424 QSCLYTSKNIQDKTCASGTNHIRNSPDPINHLQHQQLLKENTRSLNTSSHAKCGTEEANL 1483

Query: 2318 QSPLFSSPTSERETELVQ--------------------------CPNQGDMPSSQLNNLK 2217
            Q+ LF S T ++ET+ +                           C      P  +    +
Sbjct: 1484 QNDLFISTTLQQETDQIPGNCLESTPSSSKSIMLFATHIVSATTCSGSVSNPLEEAFLFE 1543

Query: 2216 ACTVKQEITNLLDSIADIELELVKISLRRDFLGRDSIGRVYWAFCYPGARPWIIACGGTA 2037
               +K+EI  L DSIA  ELEL ++S+R++++G+DS GR+YW F    +   +     + 
Sbjct: 1544 MSAIKKEIRALEDSIAAKELELQEVSVRKEYMGQDSEGRLYWTFGRSTSSRLVAYASTST 1603

Query: 2036 HKER---------------------CPRDFSSIPDSDKWMYYESDSEIEKLVGWLRENNV 1920
              E                       P +   +P+ D+W  Y+SD + E L+ WL+E++ 
Sbjct: 1604 QPESSGHLWSYGVESSRRSGVFDSSAPWENMGMPNLDQWTSYQSDVDTEILIRWLKEHDP 1663

Query: 1919 REKELKESISQFQANKLKDSEYTEDHILKKREI-----NHGGRKTVSADFLATNAMNALE 1755
            RE+ELKESI Q++  +     Y E H   K  +     +       ++D L T A+ A++
Sbjct: 1664 RERELKESILQWRDTRKMIYYYLESHGHDKVRLITSIPSEDSASCFNSDSLVTRAVTAIK 1723

Query: 1754 KKFGPCKRIETTAVPQNL--VMGASQSGGMHRCECLEMLWPSKDHCGSCHQSFSTSEELR 1581
            K    C   E T +  NL   +  S  G ++RCECLE LWPS+ HC SCHQ+FS ++E  
Sbjct: 1724 KMVSGCSAEEETEICTNLGVKVRVSFDGELYRCECLEPLWPSRPHCLSCHQTFSDAKERL 1783

Query: 1580 QHVKENCKAVASG--SKRSQAAEDMSKRKKARNA-----------VSQEKRPASVGIPQG 1440
            +H  E C+  +     +  + +E  +KRK+  N            VSQ  +   +G  + 
Sbjct: 1784 KHANEKCRIDSPSPIQRDGETSEQPAKRKRTANNEILQDNSLSNDVSQASKSKKLGNGEA 1843

Query: 1439 STLEKQIDGSASVESYNA-DCPFNFEEIMTRFIVPSSVKDGVNEIGLIGSGGLPSFLQGQ 1263
            S  +K  +  AS E+    +CPF FEEI  +FI   S+K+ VNEIGLIG  G PSF+   
Sbjct: 1844 SRRDKHGNAPASAENQTKQECPFKFEEIKAQFITQRSLKELVNEIGLIGCNGTPSFIPCT 1903

Query: 1262 SPYVSDSALTVGLERTNEA-SSRSNDLRSKQKNAEPGDVMNGRGFKDSNRSSRSAENGLS 1086
            SPY+ DSAL +  +R +E     S DL S +     G  ++     D+   +    NGL+
Sbjct: 1904 SPYLCDSALELLSQREDEVCGGNSTDLLSSEHQLRNGVKVSCINNSDNPNCTG---NGLA 1960

Query: 1085 DELSIVGRLKSILTSEKDQVTPMKDRSSLLGLSKSTIIRESSSRPLVGRASEILRVLKIN 906
                + GRLKS     ++Q +  KD+    G++   +I ESS  P+ GRAS ILR LKIN
Sbjct: 1961 GAGPVFGRLKSATKRGRNQFSSTKDKILEFGVNMYFVIPESSLHPVAGRASVILRCLKIN 2020

Query: 905  LLDMDAALPEDAFRKSRSSPDRRCAWRAFLKSSKSIYEMVQATIVFEDMIKTEYLRNDWW 726
            LLD+DAALPE+A R SR   +RR  WRAF+KS+ +IYEMVQATI+ ED IKTEYL+NDWW
Sbjct: 2021 LLDIDAALPEEALRVSRLQSERRRVWRAFVKSAATIYEMVQATIILEDAIKTEYLKNDWW 2080

Query: 725  YWSSPSTAARITTLSALALRIYSLDSAISYEEPLPTAAAMEVSEPSNAIEEEIQKSPTLK 546
            YWSSPS AARI+TLSALALR+Y+LDSAI Y++     ++ + SE     +EE +  P   
Sbjct: 2081 YWSSPSAAARISTLSALALRVYALDSAILYDK----LSSQDASETD--CKEEREPPPRNS 2134

Query: 545  NLASPSSPTLQK--TPEPDSSENP 480
               + +SP+ +K   PEP  S  P
Sbjct: 2135 VPTNTASPSKKKPLDPEPAESSRP 2158


>ref|XP_004239350.1| PREDICTED: methyl-CpG-binding domain-containing protein 9-like
            [Solanum lycopersicum]
          Length = 2151

 Score =  721 bits (1860), Expect = 0.0
 Identities = 442/1092 (40%), Positives = 618/1092 (56%), Gaps = 99/1092 (9%)
 Frame = -3

Query: 3458 DEDKVFCNLLGRIILNPNDNDDEGLLGYPAMVSRPLDFRTIDLRLASGAYGGSHEAFVDD 3279
            D++KVFCNL GR +L+PNDND+EGLLG+PAMVSRPLDFRTID++LA+G+YGGSHE+F+D+
Sbjct: 1062 DKNKVFCNLSGRTVLSPNDNDNEGLLGHPAMVSRPLDFRTIDVKLAAGSYGGSHESFIDE 1121

Query: 3278 VREVWRNIHTAYGDRSDLIDVADNLSKKFEDLYEKEVLTLLQKVADISDVKDSSADSLKE 3099
            VREVW NI TAY ++S+L+++A +L +KFE+ YEKEVL L+Q + + S+  + S++  K+
Sbjct: 1122 VREVWHNIRTAYCNKSNLLELAGSLLQKFEEDYEKEVLPLVQ-IIECSNDSNLSSEVAKD 1180

Query: 3098 RDDLLVQVCNSVLPRAPWDEGLCKVCGMDKDDDNVLLCDKCDSEYHRYCLDPPLLRIPEG 2919
            RD LL  V  S LP+APW+EGLCKVC MDKDD NVLLCDKCDSEYH YCLDPPL+++P G
Sbjct: 1181 RDGLLAHVNESSLPKAPWEEGLCKVCSMDKDDVNVLLCDKCDSEYHTYCLDPPLVKVPIG 1240

Query: 2918 NWYCPSCVTGQSLPSGTGYASGSNQRR---KRRNQGEFSSKFLEELSRFAKLMEMKEHWE 2748
             WYCP C     +      +SGS+  R   KRR + + + KF+E+LS+  + ME+KE+WE
Sbjct: 1241 PWYCPDCEA--KISRSQNASSGSHTIRQCVKRRLRRKLTHKFMEKLSQLTRTMELKEYWE 1298

Query: 2747 FTVEERIFFLKFLFDEALNSATVRDHMEQCASRAADLQNKLRSLTSEXXXXXXXXXXXXL 2568
              +E+RIF LKFL  E L+SA +RDH+++ AS +A+LQ KLRSL +E             
Sbjct: 1299 IPLEDRIFLLKFLCGEMLSSAILRDHIDRSASLSAELQQKLRSLGAELKLLKHKKEILTA 1358

Query: 2567 SAEK---------------TNSGVLNVRG-DLNSDASSSQHASENITRGKPSEKLVGDQS 2436
              +                +N   L V+G D  S  SS       +  G    K      
Sbjct: 1359 KLKNDARSSGDAGSDTSLWSNDCKLKVQGPDSGSHNSSISGGCRQLDDGTQHNKCNDFNK 1418

Query: 2435 QP----EKIIV-KT-SEGPNWLSEKPISVQQPRSDQ---GHTSLLN------------NV 2319
            Q      KII  KT + G N +   P  +   +  Q    +   LN            N+
Sbjct: 1419 QSCLYTSKIIQDKTCASGTNHIRNSPDPINHLQHQQLLKENARSLNTSSHAKCGTEETNL 1478

Query: 2318 QSPLFSSPTSERETELVQ--------------------------CPNQGDMPSSQLNNLK 2217
            Q+ LF S T ++ET+ +                           C      P  +    +
Sbjct: 1479 QNDLFMSTTVQQETDQIPGNRLESAQSSSKSIMLFATHIVSATTCLGSVSNPLEEALLFE 1538

Query: 2216 ACTVKQEITNLLDSIADIELELVKISLRRDFLGRDSIGRVYWAFCYPGARPWIIACGGTA 2037
               +K+EI  L DSIA  EL+L ++S+R++++G+DS GR+YW F    +   +     + 
Sbjct: 1539 MSAIKKEIRALEDSIAAKELDLQEVSVRKEYMGQDSEGRLYWTFGRSTSSRLVAYASTST 1598

Query: 2036 HKER---------------------CPRDFSSIPDSDKWMYYESDSEIEKLVGWLRENNV 1920
              E                       P +   +P+ ++W  Y+SD + E L+ WL+E++ 
Sbjct: 1599 QPESSGHLWSYGVESSRRSGVLDSSAPWENMGLPNLEQWTSYQSDVDTEILIRWLKEHDP 1658

Query: 1919 REKELKESISQFQANKLKDSEYTEDHILKKREIN-----HGGRKTVSADFLATNAMNALE 1755
            RE+ELKESI Q++  +     Y E H      +N            ++D L T A+ A++
Sbjct: 1659 RERELKESILQWRDTRKMIYYYLESHGHDTVGLNTSIPSEDSGSCFNSDSLVTRAVTAIK 1718

Query: 1754 KKFGPCKRIETTAVPQNL--VMGASQSGGMHRCECLEMLWPSKDHCGSCHQSFSTSEELR 1581
            K    C   E T +  NL   +  S  G ++RCECLE LWPS+ HC SCHQ+FS ++E +
Sbjct: 1719 KMVSGCSTEEETGICTNLGVKVRVSFDGELYRCECLEPLWPSRPHCLSCHQTFSDAKERQ 1778

Query: 1580 QHVKENCKAVASGSKRSQAAEDMSK-RKKARNAVSQEKRPASVGIPQGSTLEKQIDGSAS 1404
            +H  E C+  +S  +  + +E   K ++KA N + Q+   +++   +    +K  +  AS
Sbjct: 1779 KHANEKCRIDSSIQRDGETSEQPVKCKRKANNEILQDNSLSTIDCRR----DKHGNAPAS 1834

Query: 1403 VESYNA-DCPFNFEEIMTRFIVPSSVKDGVNEIGLIGSGGLPSFLQGQSPYVSDSALTVG 1227
             E+    +CPF  EEI  +FI  SS+K+ VNEIGLIG  G PSF+ G SPY+ DSAL + 
Sbjct: 1835 AENQTKQECPFKLEEIKAQFITQSSLKELVNEIGLIGCNGTPSFVPGTSPYLCDSALGLL 1894

Query: 1226 LERTNEA-SSRSNDLRSKQKNAEPGDVMNGRGFKDSNRSSRS--AENGLSDELSIVGRLK 1056
             +R +E     S DL S +       + NG  F   N S +     NGL+    + GRLK
Sbjct: 1895 SQREDEVCGGNSTDLLSSEHQ-----LRNGVKFSCINNSDKPNCTGNGLAGAGPVFGRLK 1949

Query: 1055 SILTSEKDQVTPMKDRSSLLGLSKSTIIRESSSRPLVGRASEILRVLKINLLDMDAALPE 876
            S     +D+ +  KD+    G++   +I ESS  P+ GRAS ILR LKINLLD+DAALPE
Sbjct: 1950 SATKRGRDKFSSTKDKILEFGVNMYFVIPESSLHPVAGRASVILRCLKINLLDIDAALPE 2009

Query: 875  DAFRKSRSSPDRRCAWRAFLKSSKSIYEMVQATIVFEDMIKTEYLRNDWWYWSSPSTAAR 696
            +A R SR  P+RR  WRAF+KS+ +IYEMVQATI+ ED IKTEYL+NDWWYWSSPS AAR
Sbjct: 2010 EALRVSRLQPERRRVWRAFVKSAATIYEMVQATIILEDAIKTEYLKNDWWYWSSPSAAAR 2069

Query: 695  ITTLSALALRIYSLDSAISYEEPLPTAAAMEVSEPSNAIEEEIQKSPTLKNLASPSSPTL 516
             +TLSALALR+Y+LDSAI Y++     ++ + SE     E E  ++    N ASPS    
Sbjct: 2070 NSTLSALALRVYALDSAILYDK----LSSQDASETDCKEEREPPRNSVPTNTASPSKKK- 2124

Query: 515  QKTPEPDSSENP 480
               PEP  S  P
Sbjct: 2125 PLDPEPAESSRP 2136


>ref|XP_006483833.1| PREDICTED: methyl-CpG-binding domain-containing protein 9-like
            isoform X2 [Citrus sinensis]
          Length = 2084

 Score =  713 bits (1841), Expect = 0.0
 Identities = 429/1074 (39%), Positives = 609/1074 (56%), Gaps = 81/1074 (7%)
 Frame = -3

Query: 3464 ASDEDKVFCNLLGRIILNPNDNDDEGLLGYPAMVSRPLDFRTIDLRLASGAYGGSHEAFV 3285
            A+DE++VFCNLLGR +LN +DNDDEGLLG+PAMVSRPLDFRTIDLRLA GAYGGSHEAF+
Sbjct: 1015 AADEERVFCNLLGRTLLNTSDNDDEGLLGFPAMVSRPLDFRTIDLRLAFGAYGGSHEAFL 1074

Query: 3284 DDVREVWRNIHTAYGDRSDLIDVADNLSKKFEDLYEKEVLTLLQKVADISDVKDSSADSL 3105
            +DVREVW +I TAY D+SDL+ +A  L + FE LY+KEVLTL+QK AD   ++  ++++ 
Sbjct: 1075 EDVREVWHHICTAYSDQSDLLQLAGKLCQNFEVLYKKEVLTLVQKFADYPSLECLNSEAK 1134

Query: 3104 KERDDLLVQVCNSVLPRAPWDEGLCKVCGMDKDDDNVLLCDKCDSEYHRYCLDPPLLRIP 2925
            KE +D+L     S +P+APWDEG+CKVCG+DKDDDNVLLCD CDS YH YCL PPL R+P
Sbjct: 1135 KEMEDILESA--SEIPKAPWDEGVCKVCGIDKDDDNVLLCDTCDSGYHTYCLTPPLTRVP 1192

Query: 2924 EGNWYCPSCVTGQSLPSGTGYASGSNQR-RKRRNQGEFSSKFLEELSRFAKLMEMKEHWE 2748
            EGNWYCP C++G             + R  KRR+QGEF+ + LEE+   A  MEM+++W+
Sbjct: 1193 EGNWYCPPCLSGNCKNKYMSQVPHVSSRIPKRRHQGEFTCRILEEVFHLAATMEMRDYWD 1252

Query: 2747 FTVEERIFFLKFLFDEALNSATVRDHMEQCASRAADLQNKLRSLTSEXXXXXXXXXXXXL 2568
            ++ +ERIF LKFL DE LNS  +R+H+E+CAS + DLQ K+RSL+ E             
Sbjct: 1253 YSDKERIFLLKFLCDELLNSTNIREHLERCASVSVDLQQKIRSLSLEWRNLKFREEILAG 1312

Query: 2567 SAEKTNSGVLNVRGDLNSDASSSQHASENITRGKPS------EKLVGDQSQPEKIIV--K 2412
               +  + VL+  G   ++  ++ +        +PS        L  D +  E  +   +
Sbjct: 1313 KVARDKASVLSGTGKCGTEGVATLYPHYGKLMRQPSGGGGYFSSLASDLALSEDGLQLNE 1372

Query: 2411 TSEGPNWLSEKPISVQQPRSDQGHTSLLNNVQSPLFSSPTSERETELVQCPNQGDMPSSQ 2232
            + +   W + K IS++QP   +         +S +     SE++   V    Q D+P S 
Sbjct: 1373 SRKLSCWFNLKGISMRQPSCSRNQIGEAPYTESQVHQE--SEKDNIRVD-DLQYDVPHSA 1429

Query: 2231 LNNLKACTVKQEIT----------------------------------------NLLDSI 2172
                K  T  +  T                                        +L DSI
Sbjct: 1430 SQPQKQDTAGEYATWRNKGQDLENGHTSGPLQPNCEASQSHFSSDHTNGNQVAEHLCDSI 1489

Query: 2171 ADIELELVKISLRRDFLGRDSIGRVYWAFCYPGARPWIIACGGTA-HKERCPRD------ 2013
            A +E + + +SLR++ LGRDS GR+YWAF  P   PW++    T   +ER  ++      
Sbjct: 1490 AGLESQQLAVSLRKELLGRDSAGRLYWAFFRPNTSPWLLVDATTVLEQERILKEHGDSLA 1549

Query: 2012 -------FSSIPDSDKWMYYESDSEIEKLVGWLRENNVREKELKESISQFQANKLKD--- 1863
                   ++ I  S  W  Y+SD+EIE+L+ WL +++ R+KEL ESI ++     KD   
Sbjct: 1550 NSPFEEEYNGISASSSWFSYQSDTEIEELIQWLSDSDPRDKELAESILRWTKIGYKDLKI 1609

Query: 1862 -SEYTEDHILKKREINHGGRKTVSADFLATNAMNALEKKFGPCKRIETTAVPQNLVMGAS 1686
               + ED  +           TV +  L T A+  LE+K GPC   E   +   L   + 
Sbjct: 1610 AGNHIEDESVPSSSKCRKSEATVKSSGLVTKALTVLEEKHGPCLEPEVLKMSMKLDTNSE 1669

Query: 1685 QS--GGMHRCECLEMLWPSKDHCGSCHQSFSTSEELRQHVKENCKAVASGSKRSQAAEDM 1512
             +    M+RCECLE + P++ HC  CH SFS   EL +H    C   A+ S+ S+  ++ 
Sbjct: 1670 LTCKERMYRCECLEPVLPTRFHCRRCHLSFSARNELEEHNDAKCILSATSSQNSKEDDE- 1728

Query: 1511 SKRKKARNAVSQEKRPASVGIPQGSTLEKQIDGSASVESYNAD----CPFNFEEIMTRFI 1344
              R K    +  E   A      G  + + +    ++ S+       CPFNFEEI T+FI
Sbjct: 1729 --RTKGAGTIRTETLQAECMETAGKGMSQSLKHGTAMGSFEIPKEFACPFNFEEISTKFI 1786

Query: 1343 VPSSVKDGVNEIGLIGSGGLPSFLQGQSPYVSDSALTVGLERTNEAS--SRSNDLRSKQK 1170
              +S+K+ V EIGLIGS G+P+F+   SPY+ D +L +     NE +  ++S +L +  +
Sbjct: 1787 TKNSIKELVQEIGLIGSNGVPAFVPSTSPYLCDPSLKLVEMCKNEINRGNKSTNLENLFQ 1846

Query: 1169 NAEPGDVMNGRGFKD--SNRSSRSAENGLSDELSIVGRLKSILTSEKDQVTPMKDRSSLL 996
             +  GD+++G    +  +N S R   +   D++    RL     +EK      +D+S  L
Sbjct: 1847 YSIVGDMVSGLEHDNISNNSSRRCTVSHNDDDVLKCRRLNPNFMNEK------RDQSFSL 1900

Query: 995  ----GLSKSTIIRESSSRPLVGRASEILRVLKINLLDMDAALPEDAFRKSRSSPDRRCAW 828
                G+  S+I+R++S  PL+GR  EILR LKINLLDMDAA+PE+A R S++  + R AW
Sbjct: 1901 SLKPGIGNSSIVRDTSLMPLMGRGIEILRQLKINLLDMDAAVPEEALRSSKACWENRSAW 1960

Query: 827  RAFLKSSKSIYEMVQATIVFEDMIKTEYLRNDWWYWSSPSTAARITTLSALALRIYSLDS 648
            RAF+KS+KSI+EMVQATIVFEDMIKT+YLRN WWYWSS S AA I T+SALALR+Y+LD+
Sbjct: 1961 RAFVKSAKSIFEMVQATIVFEDMIKTDYLRNGWWYWSSLSGAANIATVSALALRLYTLDA 2020

Query: 647  AISYEEPLPTAAAMEVSEPSNAIEEEIQKSPTLKNLASPSSPTLQKTPEPDSSE 486
            AI YE+    + ++E+ E  +  ++E       K+   PS   L KT   D +E
Sbjct: 2021 AIVYEK---HSDSIEIQEHISQPDKETSPCKDSKSNPKPSKAIL-KTQSSDLTE 2070


>ref|XP_007217135.1| hypothetical protein PRUPE_ppa000046mg [Prunus persica]
            gi|462413285|gb|EMJ18334.1| hypothetical protein
            PRUPE_ppa000046mg [Prunus persica]
          Length = 2154

 Score =  709 bits (1830), Expect = 0.0
 Identities = 442/1112 (39%), Positives = 619/1112 (55%), Gaps = 124/1112 (11%)
 Frame = -3

Query: 3464 ASDEDKVFCNLLGRIILNPNDNDDEGLLGYPAMVSRPLDFRTIDLRLASGAYGGSHEAFV 3285
            A+D+ KVFCNLLGR ++N +DNDDEGLLG PAMVSRPLDFRTIDLRLA+G+YGGSHEAF+
Sbjct: 1065 AADDTKVFCNLLGRKLINSSDNDDEGLLGSPAMVSRPLDFRTIDLRLAAGSYGGSHEAFL 1124

Query: 3284 DDVREVWRNIHTAYGDRSDLIDVADNLSKKFEDLYEKEVLTLLQKVADISDVKDSSADSL 3105
            +DVRE+W N+  AYGD+ DL+++A+ L++ FE LYEKEV+TL+ K+A+ + ++  SA+  
Sbjct: 1125 EDVRELWSNLRIAYGDQPDLVELAETLAQTFETLYEKEVITLVHKLAETAKLECLSAERK 1184

Query: 3104 KERDDLLVQVCNSVLPRAPWDEGLCKVCGMDKDDDNVLLCDKCDSEYHRYCLDPPLLRIP 2925
            KE DDLL     S +P+APWD+G+CKVCG+DKDDD+VLLCD CD+EYH YCL+PPL RIP
Sbjct: 1185 KEIDDLLAS--TSGIPKAPWDDGVCKVCGIDKDDDSVLLCDTCDAEYHTYCLNPPLARIP 1242

Query: 2924 EGNWYCPSCVTG-QSLPSGTGYASGSNQRRKRRNQGEFSSKFLEELSRFAKLMEMKEHWE 2748
            EGNWYCPSCV   Q +   + +     + R++  QGE +  +LE L+  +  ME  E+WE
Sbjct: 1243 EGNWYCPSCVVSKQMVQDASEHHQVIRKCRRKNYQGEVTRTYLEALTLLSMKMEENEYWE 1302

Query: 2747 FTVEERIFFLKFLFDEALNSATVRDHMEQCASRAADLQNKLRSLTSEXXXXXXXXXXXXL 2568
            F V+ER F LKFL DE LNSA +R H+E C+  +A+LQ KLRSL++E             
Sbjct: 1303 FNVDERTFLLKFLCDELLNSAVIRQHLEHCSETSAELQQKLRSLSAEWKNLKSKEEILIA 1362

Query: 2567 SAEKTN--------------------------------SGVLNV-----------RG-DL 2520
             A K +                                S   NV           RG D 
Sbjct: 1363 KAAKVDPSLEEDGVKEGLSTSVENHEKFVLQAHALSGRSNSFNVVSDDVPALEGARGLDK 1422

Query: 2519 NSDAS----SSQHASENITRGKPSEKLVGDQSQPEKI----IVKTSEGPNWLSEKPISVQ 2364
            +  AS    SSQH+ +   R K     V D   P  +      + S+  + L E P S  
Sbjct: 1423 HPSASNAEYSSQHSVDTEARAKDVHAAVHDTGTPGNVSSNAASEKSDISSRLIEFPSSNS 1482

Query: 2363 QPRSDQGHTSLLN-----------NVQSPLFSS----PTSERETELVQCPNQGDMPSSQL 2229
             P    G    +            +V  PL       P+  R   + Q  +   +  SQ 
Sbjct: 1483 LPHEINGSIGKIGCLGHPQDNMEMDVSLPLDQQGVCIPSDVRSNHVGQHMSPASVNESQA 1542

Query: 2228 NNLKACTVKQEITNLLDSIADIELELVKISLRRDFLGRDSIGRVYWAFCY---------- 2079
             +L+  +VK +++ L DSI  ++ EL K+S+RR+FLG DS+G +YWA  +          
Sbjct: 1543 YHLELNSVKSDLSLLQDSITSVDFELSKLSVRREFLGIDSLGGLYWASGHSRIVVDRTVS 1602

Query: 2078 --------PGARP-W----IIACGGT---------AHKERCPRDF---SSIPDSDKWMYY 1974
                     G  P W      +C  T           K  CP  F   S++  S  W+ Y
Sbjct: 1603 VQDGMNMTDGRDPVWRGSVTQSCASTGVDSSLPLEGSKAGCPYLFEPNSAVAFSAPWVSY 1662

Query: 1973 ESDSEIEKLVGWLRENNVREKELKESISQFQANKL----KDSEYTEDHILKKREINHGGR 1806
            ++D+EI+ L+GWL++ N +E+ELKESI Q++ ++     K    ++D +L    +   G 
Sbjct: 1663 QTDAEIDGLIGWLKDKNPKERELKESILQWKKSRFHKFQKTRSQSQDELLTAISVARNGE 1722

Query: 1805 KTVSADFLATNAMNALEKKFGPCKRIETTAVPQNLVMGA--SQSGGMHRCECLEMLWPSK 1632
            KT S D L T A   LEK +GPC  +ETT + +     A  +    M+RCECLE +WP++
Sbjct: 1723 KTES-DCLVTRAATLLEKMYGPCSELETTDISKKRGKRARLTNDEKMYRCECLEPIWPNR 1781

Query: 1631 DHCGSCHQSFSTSEELRQHVKENCKAVASGSKRSQAAEDMSK----------RKKARNAV 1482
             HC SCH++F    EL  H    C   ++  ++ +   D SK          R++ R  +
Sbjct: 1782 HHCLSCHRTFVADAELEGHNDGRCVPFSAACEKGKEISDSSKVKGSLKCEINREECRGEL 1841

Query: 1481 SQEKRPASVGIPQGSTLEKQIDGSASVESYNADCPFNFEEIMTRFIVPSSVKDGVNEIGL 1302
            +  +   SV     + L K  +G          CP++FEEI ++F+   S KD + EIGL
Sbjct: 1842 NSVETSKSVHSELSAKLIKFQNGGLV-------CPYDFEEICSKFVTNDSNKDLIQEIGL 1894

Query: 1301 IGSGGLPSFLQGQSPYVSDSALTVGLERTNEASSRSNDLRSKQKNAEPGDVMNGRGFKD- 1125
            IGS G+PSF+   SPY+SDS  T  L    +     N   + ++      V+ G+   D 
Sbjct: 1895 IGSQGVPSFVPSLSPYLSDS--TQQLVTQKDVGVHGNGPEAAEQL-----VLQGKTNVDI 1947

Query: 1124 SNRSSRSAENG--LSDELSIVGRLKSILTSEKDQVTPMKDRSSLLGLSKSTIIRESSSRP 951
            +  SS S + G  L+  +  +G L      EK +  P    SS++G  +  ++ +SS RP
Sbjct: 1948 AGCSSLSGKGGGLLNANIPTLGCL------EKREKRPSGSHSSVVGAGRFCVVPQSSLRP 2001

Query: 950  LVGRASEILRVLKINLLDMDAALPEDAFRKSRSSPDRRCAWRAFLKSSKSIYEMVQATIV 771
            LVG+  +I R LKINLLD+DAALPE+A R S+S  +RR AWR F+K++ +IYEMVQATIV
Sbjct: 2002 LVGKVCQISRRLKINLLDIDAALPEEALRPSKSHLERRWAWRTFVKAAVTIYEMVQATIV 2061

Query: 770  FEDMIKTEYLRNDWWYWSSPSTAARITTLSALALRIYSLDSAISYEEPLPTAAAMEVSEP 591
             EDMIKTEYLRN+WWYWSS S AA+I+TLSALALRIYSLDSAI YE+  P++  ++  EP
Sbjct: 2062 LEDMIKTEYLRNEWWYWSSFSAAAKISTLSALALRIYSLDSAIMYEKMFPSSDPVDKLEP 2121

Query: 590  SNAIEEEIQK--SPTLKNLASPSSPTLQKTPE 501
            S+ ++ ++      T +   S  S   +K PE
Sbjct: 2122 SSVLDLKLLPILDSTERTKLSRKSNKKRKEPE 2153


>ref|XP_007031430.1| Methyl-CpG-binding domain-containing protein 9, putative isoform 1
            [Theobroma cacao] gi|590645754|ref|XP_007031431.1|
            Methyl-CpG-binding domain-containing protein 9, putative
            isoform 1 [Theobroma cacao] gi|508710459|gb|EOY02356.1|
            Methyl-CpG-binding domain-containing protein 9, putative
            isoform 1 [Theobroma cacao] gi|508710460|gb|EOY02357.1|
            Methyl-CpG-binding domain-containing protein 9, putative
            isoform 1 [Theobroma cacao]
          Length = 2225

 Score =  703 bits (1814), Expect = 0.0
 Identities = 430/1102 (39%), Positives = 618/1102 (56%), Gaps = 112/1102 (10%)
 Frame = -3

Query: 3464 ASDEDKVFCNLLGRIILNPNDNDDEGLLGYPAMVSRPLDFRTIDLRLASGAYGGSHEAFV 3285
            A+D+ K+FCNLLGR ++N +DNDDEGLLG PAMVSRPLDFRTIDLRLA GAYGGSHEAF+
Sbjct: 1148 AADDSKIFCNLLGRKLMNSSDNDDEGLLGSPAMVSRPLDFRTIDLRLAVGAYGGSHEAFL 1207

Query: 3284 DDVREVWRNIHTAYGDRSDLIDVADNLSKKFEDLYEKEVLTLLQKVADISDVKDSSADSL 3105
             DVRE+W N+ TAY D+ DL+++A++LS+ FE LYE+EVLTL+QK+A+ + ++  +A++ 
Sbjct: 1208 KDVRELWSNVRTAYTDQPDLVELAESLSQNFESLYEQEVLTLVQKLAEYAKLECLNAETK 1267

Query: 3104 KERDDLLVQVCNSVLPRAPWDEGLCKVCGMDKDDDNVLLCDKCDSEYHRYCLDPPLLRIP 2925
            KE +DLL     S +P+APWDEG+CKVCG+DKDDD+VLLCD CD+EYH YCL+PPL RIP
Sbjct: 1268 KEINDLLAS--TSEIPKAPWDEGVCKVCGIDKDDDSVLLCDTCDAEYHTYCLNPPLARIP 1325

Query: 2924 EGNWYCPSCVTGQSL-PSGTGYASGSNQRRKRRNQGEFSSKFLEELSRFAKLMEMKEHWE 2748
            EGNWYCPSCV  + +    + ++    +RR ++ QGE +  +LE L+    ++E KE+W+
Sbjct: 1326 EGNWYCPSCVLSKRMVQDASEHSQVIIRRRDKKYQGEVTRGYLEALAHLGAVLEEKEYWQ 1385

Query: 2747 FTVEERIFFLKFLFDEALNSATVRDHMEQCASRAADLQNKLRSLTSEXXXXXXXXXXXXL 2568
            F+++ERIF LKFL DE LNSA +R H+EQCA   ++L  KLRS   E             
Sbjct: 1386 FSIDERIFLLKFLCDELLNSALIRQHLEQCA-ETSELHQKLRSAYVEWKNLKSREDFVAA 1444

Query: 2567 SAEKTNSGVLNVRGD---------LNSDA-------------SSSQHASENITRG----- 2469
             A K ++ + N  GD         L SD              +S+ +  +N T       
Sbjct: 1445 KAAKIDTSMSNAVGDVGVKDGDDWLPSDGGKEGADLNGSNKYASATYTEKNFTANGQTLN 1504

Query: 2468 --KPSEKLVGDQSQPEKIIVKTSEG-----------PNWLSEKPISVQQPRSDQGHTSLL 2328
                  +L GDQ+  +   V + +            PN LS++  +  +  S QG    L
Sbjct: 1505 PMDTEAQLKGDQAIVDASKVSSQKSDKSFRPSELLVPNHLSQEIENSSKETSFQGK---L 1561

Query: 2327 NNVQSPLFSSPTSERETELVQCPNQG--DMPS-----SQLNNLKACTVKQEITNLLDSIA 2169
               +    +SP S  +      P+     +PS     SQ ++L+  T+K +I  L D I 
Sbjct: 1562 EESKGMDVASPPSPSDCNGQFPPSDAAKQVPSVTENESQSHHLELNTIKNDIQRLQDLIT 1621

Query: 2168 DIELELVKISLRRDFLGRDSIGRVYWAFCYPGARP------------------------- 2064
             +E +L+K+S+R++FLG DS GR+YW    PG  P                         
Sbjct: 1622 SLESQLLKLSVRKEFLGSDSAGRLYWISAMPGGYPQVIVDGSLVLQKKRKFLGYEERVQN 1681

Query: 2063 ---WIIACGGT-------AHKERCPRDFSS---IPDSDKWMYYESDSEIEKLVGWLRENN 1923
               W  A  GT         K  CP  ++S   I     W+ Y++++EIE L+ WL +N 
Sbjct: 1682 TFIWNSASAGTDNGMKAEGSKASCPFLYNSKDAISVGSPWVTYQTEAEIEGLIDWLNDNE 1741

Query: 1922 VREKELKESISQFQANKLKDSEY------TEDHILKKREINHGGRKTVSADFLATNAMNA 1761
             +EKELKE+I Q    KLK  +Y       +D       ++ G  K   + FL T A   
Sbjct: 1742 PKEKELKEAILQ----KLKFQDYQKMKNQDQDECQTAFSMSSGSDKGSFSSFLGTKAAML 1797

Query: 1760 LEKKFGPCKRIETTAVPQNLVMGASQSGG--MHRCECLEMLWPSKDHCGSCHQSFSTSEE 1587
            LEKK+GPC + E T   +     A    G  M+RC+CLE +WPS++HC SCH++F +  E
Sbjct: 1798 LEKKYGPCFKSEITDSLKKRGKKARVINGDKMYRCKCLEPIWPSRNHCISCHKTFFSDVE 1857

Query: 1586 LRQHVKENCKAVASGSKRSQAAEDMSKRKKARNA-VSQEKRPASVGIPQGSTLEKQIDGS 1410
               H    C   +  +++S +  D  K K   N  +++      + I + S        S
Sbjct: 1858 FEDHNDGKCNLGSPLNEKSTSVGDSLKGKGNMNIDINRVDCTVDMEIVETSKSGHSELSS 1917

Query: 1409 ASVESYNAD--CPFNFEEIMTRFIVPSSVKDGVNEIGLIGSGGLPSFLQGQSPYVSDSAL 1236
              ++  N    CP+NFEEI T+F+   S ++ V EIGLIGS G+PSF+   S +VSDS L
Sbjct: 1918 RLIKFQNEGLVCPYNFEEISTKFVTRDSNEELVREIGLIGSNGVPSFVSSVSHFVSDSTL 1977

Query: 1235 TVGLERTNEASSRSNDLRSKQKNAEPGDVMNGRGFKDSNRSSRSAENGLSDELSIVGRLK 1056
                            +R  Q+  + GD +        ++ +RS  NG+++ LS     +
Sbjct: 1978 MT--------------VRPHQERGDLGDKLKATEMPGFSQGNRSVANGINERLSDNSFRR 2023

Query: 1055 SILT---------------SEKDQVTPMKDRSSLLGLSKSTIIRESSSRPLVGRASEILR 921
            S+ +                ++D+++     S  LG+ +  ++ +SS RPLVG+ S+I R
Sbjct: 2024 SVASEIEVQRTIRPALRCLEQRDRISSADKYSPELGIGRCCVVPQSSLRPLVGKVSQISR 2083

Query: 920  VLKINLLDMDAALPEDAFRKSRSSPDRRCAWRAFLKSSKSIYEMVQATIVFEDMIKTEYL 741
             LKINLLDMDAAL E+A R S++  +RR AWR+F+KS+++IYEMVQATIV EDMIKTEYL
Sbjct: 2084 QLKINLLDMDAALSEEALRPSKACMERRWAWRSFVKSAETIYEMVQATIVLEDMIKTEYL 2143

Query: 740  RNDWWYWSSPSTAARITTLSALALRIYSLDSAISYEEPLPTAAAMEVSEPSNAIEEEIQK 561
            RN+WWYWSS S A +I+T+S+LALRIYSLDSAI YE+      +++  +PS+  + ++  
Sbjct: 2144 RNEWWYWSSLSAAVKISTVSSLALRIYSLDSAIIYEKSF-EFHSIDNLKPSSIPDPKLLP 2202

Query: 560  SPTLKNLASPSSPTLQKTPEPD 495
            +  L      S  T +K  EP+
Sbjct: 2203 NLDLAEKCKVSRKTSKKRKEPE 2224


>ref|XP_002274643.2| PREDICTED: methyl-CpG-binding domain-containing protein 9-like [Vitis
            vinifera]
          Length = 2164

 Score =  693 bits (1788), Expect = 0.0
 Identities = 425/1075 (39%), Positives = 605/1075 (56%), Gaps = 115/1075 (10%)
 Frame = -3

Query: 3464 ASDEDKVFCNLLGRIILNPNDNDDEGLLGYPAMVSRPLDFRTIDLRLASGAYGGSHEAFV 3285
            A+D+ KVFC LLG  ++N  DNDDEGLLG PAMVSRPLDFRTIDLRLA GAYGGS E F+
Sbjct: 1079 AADDAKVFCTLLGSKLINSIDNDDEGLLGTPAMVSRPLDFRTIDLRLAVGAYGGSWETFL 1138

Query: 3284 DDVREVWRNIHTAYGDRSDLIDVADNLSKKFEDLYEKEVLTLLQKVADISDVKDSSADSL 3105
            +DVRE+W NIHTAY D+ D +++A  LS+ FE ++EKEVL L+QK  + +  +  SA++ 
Sbjct: 1139 EDVRELWNNIHTAYADQPDSVELARTLSQNFESMFEKEVLPLVQKFTEYAKSECLSAETE 1198

Query: 3104 KERDDLLVQVCNSVLPRAPWDEGLCKVCGMDKDDDNVLLCDKCDSEYHRYCLDPPLLRIP 2925
            KE DD LV    S +P+APWDEG+CKVCG+DKDDD+VLLCD CD+EYH YCL+PPL RIP
Sbjct: 1199 KEIDDFLVSA--SEIPKAPWDEGVCKVCGIDKDDDSVLLCDMCDAEYHTYCLNPPLARIP 1256

Query: 2924 EGNWYCPSCVTGQSLPSGTGYASGSNQRRKRRNQGEFSSKFLEELSRFAKLMEMKEHWEF 2745
            EGNWYCPSCV G S+   + +     QR+ +  QG+F+  +LE L+  A  ME KE+WE 
Sbjct: 1257 EGNWYCPSCVAGISMVDVSEHTHVIAQRQGKNCQGDFTHAYLESLAHLAAAMEEKEYWEL 1316

Query: 2744 TVEERIFFLKFLFDEALNSATVRDHMEQCASRAADLQNKLRSLTSEXXXXXXXXXXXXLS 2565
            +V++R F  KFL DE LN+A +R H+EQCA  +A+LQ KLRS++ E              
Sbjct: 1317 SVDQRTFLFKFLCDELLNTALIRQHLEQCAESSAELQQKLRSISVEWKNLKLKEENLAAR 1376

Query: 2564 AEKTNSGVLNVRGDLNSDASSSQHASEN----------ITRGKPSEKLVGDQSQPEKIIV 2415
            A K +SG++ V G++ ++   S   + N            R K    L  DQ Q E    
Sbjct: 1377 APKVDSGMIYVAGEVGTEGGLSSALTNNGKCIAKPHTLSDRPKDFGILSNDQLQVE---- 1432

Query: 2414 KTSEG--PNWLSEKPIS-------VQQPRSDQGHT----SLLNNVQSPLFSSP------- 2295
              SEG  PN L + P S         +P  ++G      ++++  Q  +   P       
Sbjct: 1433 GGSEGIRPNGLDKHPSSNCSEGNCTLKPIDNEGQLKEVHAVVDETQVSVDHFPHMVYQGN 1492

Query: 2294 -TSERETEL-VQCPNQGDM--------------PSSQLNNLKAC---------------- 2211
             +S R  EL +Q P Q +M               + + N+L+                  
Sbjct: 1493 GSSCRPNELHLQNPLQQEMDGLGTEFNLQVNMCENMEKNDLQGLHHPSDIRIVHVAEHDS 1552

Query: 2210 ---TVKQEITNLLDSIADIELELVKISLRRDFLGRDSIGRVYWAFCYPGARPWIIACGGT 2040
               ++K +I++L DS+A IE +L+K+S+RR+FLG DS GR+YW    PG  PW++  G  
Sbjct: 1553 ELNSIKNDISDLQDSMASIESQLLKLSVRREFLGSDSAGRLYWILAKPGWHPWVLVDGSM 1612

Query: 2039 AHKER-----------------------------------CP---RDFSSIPDSDKWMYY 1974
            A +++                                   CP   R  +SI    +W+ Y
Sbjct: 1613 ALQKKEKMRYLKNPGDSSVQKNSTSLSMDILSTLGGSNASCPFLYRPNASISICSQWVSY 1672

Query: 1973 ESDSEIEKLVGWLRENNVREKELKESISQFQANKLKDSEYT--EDHILKKREINH-GGRK 1803
            +S  EI+ L+GWL++ + REKELKESI      + +D + T   D +  +  ++     +
Sbjct: 1673 QSGEEIDALIGWLKDADPREKELKESILHLHKLRFRDWKLTGDPDQVDSQTTLSRFPNSE 1732

Query: 1802 TVSADFLATNAMNALEKKFGPC--KRIETTAVPQNLVMGASQSGGMHRCECLEMLWPSKD 1629
               +D L T A   L KK+GP     I  ++   +L    +    M+RCECLE +W S+ 
Sbjct: 1733 NAFSDGLLTKAGILLGKKYGPWFEPEIADSSKKWDLRSKVTNESKMYRCECLEPIWSSRH 1792

Query: 1628 HCGSCHQSFSTSEELRQHVKENCKAVASGSKRSQAAEDMSKRKKARNAV-SQEKRPASVG 1452
            HC SCH++F T  +L +H   +C+   SG   S+ +++ S   K +  + S+  R  S G
Sbjct: 1793 HCPSCHRTFFTDIQLEEHNDGSCR---SGPPTSEKSKENSSHLKGKGTMKSKISREESTG 1849

Query: 1451 ------IPQGSTLEKQIDGSASVESYNADCPFNFEEIMTRFIVPSSVKDGVNEIGLIGSG 1290
                  IP+G   + +       ++    CP++FEEI ++F+  +S K+ V EIGLIGS 
Sbjct: 1850 DIDMVEIPKGGCSQPR-SRLIKFQNEGLVCPYDFEEICSKFVTKNSNKELVQEIGLIGSK 1908

Query: 1289 GLPSFLQGQSPYVSDSALTVGLERTNEASSRSNDLRSKQKNAEPGDVMNGRGFKDSNRSS 1110
            G+PSF+  + PY+SD+ L   L  + E  + + D+   Q N  P     G G    N S 
Sbjct: 1909 GVPSFVSSRPPYISDATLL--LVPSGELKA-TGDMMLAQGNRIPA---GGSGSFSDNSSR 1962

Query: 1109 RSAENGLSDELSIVGRLKSILTSEKDQVTPMKDRSSLLGLSKSTIIRESSSRPLVGRASE 930
             SA N    E S   R       +KD+   + +    + + +  +I +SS RPLVG+  +
Sbjct: 1963 DSAAN----ETSAASRTDKSALEQKDKKYSLNNNGPEMEVGRCCVIPQSSLRPLVGKVYQ 2018

Query: 929  ILRVLKINLLDMDAALPEDAFRKSRSSPDRRCAWRAFLKSSKSIYEMVQATIVFEDMIKT 750
            ILR LKINLLDMDAALPE+A + SR+  ++R AWRAF+KS+++I+EMVQATI+ EDMIKT
Sbjct: 2019 ILRQLKINLLDMDAALPEEALKPSRADLEKRLAWRAFVKSAETIFEMVQATIMLEDMIKT 2078

Query: 749  EYLRNDWWYWSSPSTAARITTLSALALRIYSLDSAISYEEPLPTAAAMEVSEPSN 585
            EYL N WWYWSS S AA+ +T+S+LALRIYSLD+AI+YE+        +  +PS+
Sbjct: 2079 EYLMNGWWYWSSLSAAAKTSTVSSLALRIYSLDAAIAYEKISSNLDLTDSPKPSS 2133


>ref|XP_006446469.1| hypothetical protein CICLE_v10014026mg [Citrus clementina]
            gi|557549080|gb|ESR59709.1| hypothetical protein
            CICLE_v10014026mg [Citrus clementina]
          Length = 1680

 Score =  692 bits (1786), Expect = 0.0
 Identities = 399/1081 (36%), Positives = 585/1081 (54%), Gaps = 91/1081 (8%)
 Frame = -3

Query: 3464 ASDEDKVFCNLLGRIILNPNDNDDEGLLGYPAMVSRPLDFRTIDLRLASGAYGGSHEAFV 3285
            A+D++KVFCNLLGR  L+  DNDDEG LG PAMVSRPLDFRTIDLRLA GAY GS ++F+
Sbjct: 610  AADDEKVFCNLLGRKPLSSTDNDDEGFLGSPAMVSRPLDFRTIDLRLAVGAYDGSRDSFL 669

Query: 3284 DDVREVWRNIHTAYGDRSDLIDVADNLSKKFEDLYEKEVLTLLQKVADISDVKDSSADSL 3105
             DVRE W N+ TA+GD+ D +D+A+ LS+ FE LYE E++TLLQK+   + ++  S ++ 
Sbjct: 670  QDVREFWNNVRTAFGDQPDFVDLAEKLSRNFESLYENEIVTLLQKLVGYAKLESLSEETT 729

Query: 3104 KERDDLLVQVCNSVLPRAPWDEGLCKVCGMDKDDDNVLLCDKCDSEYHRYCLDPPLLRIP 2925
            KE +D+LVQ   S +P+APWDEG+CKVCG+DKDDD+VLLCD CD+EYH YCL+PPL+RIP
Sbjct: 730  KEINDILVQ--TSEIPKAPWDEGICKVCGVDKDDDSVLLCDTCDAEYHTYCLEPPLVRIP 787

Query: 2924 EGNWYCPSCVTGQSLPSGTG-YASGSNQRRKRRNQGEFSSKFLEELSRFAKLMEMKEHWE 2748
            EGNWYCPSCV   S+  G   ++    Q + ++ QGE +   LEEL     +ME KE+WE
Sbjct: 788  EGNWYCPSCVVRNSMVQGASEHSQVGGQHKGKKYQGEITRLCLEELRHLTTVMEEKEYWE 847

Query: 2747 FTVEERIFFLKFLFDEALNSATVRDHMEQCASRAADLQNKLRSLTSEXXXXXXXXXXXXL 2568
            F V ER F LKFL DE LNSA +R H+EQC    A+LQ KLRS + E             
Sbjct: 848  FNVHERTFLLKFLCDELLNSALLRQHLEQCTEVTAELQQKLRSFSVEFKNLKSREETVAA 907

Query: 2567 SAEKTNSGVLNVRGDLNSDASSSQHASENITRGKPSEKLVGDQSQPE-KIIVKTSEGPNW 2391
               K  + + N   ++      +     N   GK  E+     ++    +I     GP +
Sbjct: 908  RVAKVEASMTNSVAEICMKEGPATVIRNN---GKCIEQPQNSSNRSNCSVIALEESGPMY 964

Query: 2390 LSEKPISVQQPRSDQGHTSLLNNVQS------PLFSSPTSE------------------- 2286
             ++    +++P  D        N +S      PL SS   E                   
Sbjct: 965  PTDAEGQIEEPHGDNSKMPSQKNDESIKPNEHPLASSLPQEIDNLSGEIRSQHNLQELAR 1024

Query: 2285 -RETELVQCPNQGDMPS------------------SQLNNLKACTVKQEITNLLDSIADI 2163
             R+   +  P+    PS                   Q +NL+   ++ +I  L +SI  +
Sbjct: 1025 ARDAATLASPSNNQGPSVPNELHVTEGTCSVTMNEPQAHNLELNNIRNDILLLQESITSL 1084

Query: 2162 ELELVKISLRRDFLGRDSIGRVYWAFCYPGARPWIIACGGTAHKER-------------- 2025
            E +L+K+S+RR+FLG DS GR+YW    PG  P +I  G    +++              
Sbjct: 1085 EQQLLKLSVRREFLGSDSSGRLYWVLPLPGMHPCLIVDGSPELQQKRKILDFRGPVDKGL 1144

Query: 2024 ----------------------CP---RDFSSIPDSDKWMYYESDSEIEKLVGWLRENNV 1920
                                  CP     ++    S  W+ Y++D+EIE+LV WLR+N+ 
Sbjct: 1145 VLKNSSSSGSDAYSSSKGSKACCPFQYDPYAVTATSSHWILYQTDAEIEELVNWLRDNDP 1204

Query: 1919 REKELKESISQFQANKLKDSEYTE----DHILKKREINHGGRKTVSADFLATNAMNALEK 1752
            +E+ELK+SI  ++  + +DS++T+    D             K    D L T A   LEK
Sbjct: 1205 KERELKDSILNWKKIRFQDSQHTKKQSWDEYQSASSAPTNSDKVDCFDCLVTKAATLLEK 1264

Query: 1751 KFGPCKRIETTAVPQNLVMGASQSGGMHRCECLEMLWPSKDHCGSCHQSFSTSEELRQHV 1572
            K+GPC   E            +    M+RCECLE +WPS++HC SCH++FST+ E  +H 
Sbjct: 1265 KYGPCFESEEVLKKGGKRARVTSQEKMYRCECLEPIWPSRNHCLSCHRTFSTAVEFEEHN 1324

Query: 1571 KENCKAVASGSKRSQAAEDMSKRKKARNAVSQEKRPASVGIPQGSTLEKQIDGSASVESY 1392
                 A  +  K  +A+  +  +   ++ +S+      V + + S        S  +   
Sbjct: 1325 DTCNSAPPAYEKNKEASNSLKGKGNKKSDISRAACGTDVELVETS------KPSGLIRFQ 1378

Query: 1391 NADCPFNFEEIMTRFIVPSSVKDGVNEIGLIGSGGLPSFLQGQSPYVSDSALTVGLERTN 1212
            N  CPF+  EI ++F+   S K+ V EIGL+GS G+PS +   SP++SDS L +   +  
Sbjct: 1379 NDGCPFDLNEISSKFMTQDSNKELVQEIGLLGSKGIPSLIPSVSPFLSDSTLMLMSSQKE 1438

Query: 1211 EASSRSNDLRSKQKNAEPGDVMNGRGFKDSNRSSRSAENGLSDELSIVGRLKSIL--TSE 1038
                    + S+  ++  G         D+     S ++G +    ++   K     + +
Sbjct: 1439 VGVPDGQLMASETLSSSQGKQSMKNAGNDNMADDASRKSGSNGTHEVLKSKKPAFGCSEQ 1498

Query: 1037 KDQVTPMKDRSSLLGLSKSTIIRESSSRPLVGRASEILRVLKINLLDMDAALPEDAFRKS 858
            +D+ +    R   +G+++  ++ +SS RPL+GR S+I R LK+NLLD+DAALPE+A R S
Sbjct: 1499 RDRKSSSHVRVPKVGINQCCVVPQSSLRPLIGRTSQIKRRLKVNLLDIDAALPEEALRPS 1558

Query: 857  RSSPDRRCAWRAFLKSSKSIYEMVQATIVFEDMIKTEYLRNDWWYWSSPSTAARITTLSA 678
            ++  +RR AWRAF+KS+++IYEMVQATI+ EDMIKTE+LRN+WWYWSS S AA+ +T+S+
Sbjct: 1559 KAHLERRWAWRAFVKSAETIYEMVQATIILEDMIKTEFLRNEWWYWSSLSAAAKTSTMSS 1618

Query: 677  LALRIYSLDSAISYEEPLPTAAAMEVSEPSNAIEEEIQKSPTLKNLASPSSPTLQKTPEP 498
            LALRIYSLD+AI Y++       +E  +  +  E +      L   +  S  + +K  EP
Sbjct: 1619 LALRIYSLDAAIIYDKSTTNLNPVENLKLDSTPEHKPLPGVELLEKSKVSRKSNRKRKEP 1678

Query: 497  D 495
            +
Sbjct: 1679 E 1679


>ref|XP_006470356.1| PREDICTED: methyl-CpG-binding domain-containing protein 9-like
            isoform X2 [Citrus sinensis]
          Length = 2023

 Score =  692 bits (1785), Expect = 0.0
 Identities = 399/1079 (36%), Positives = 583/1079 (54%), Gaps = 89/1079 (8%)
 Frame = -3

Query: 3464 ASDEDKVFCNLLGRIILNPNDNDDEGLLGYPAMVSRPLDFRTIDLRLASGAYGGSHEAFV 3285
            A+D++KVFCNLLGR  L+  DNDDEG LG PAMVSRPLDFRTIDLRLA GAY GSH++F+
Sbjct: 955  AADDEKVFCNLLGRKPLSSTDNDDEGFLGSPAMVSRPLDFRTIDLRLAVGAYDGSHDSFL 1014

Query: 3284 DDVREVWRNIHTAYGDRSDLIDVADNLSKKFEDLYEKEVLTLLQKVADISDVKDSSADSL 3105
             DVRE W N+ TA+GD+ D +D+A+ LS+ FE LYE E++TLLQK+   + ++  S ++ 
Sbjct: 1015 QDVREFWNNVRTAFGDQPDFVDLAEKLSRNFESLYENEIVTLLQKLVGYAKLESLSEETT 1074

Query: 3104 KERDDLLVQVCNSVLPRAPWDEGLCKVCGMDKDDDNVLLCDKCDSEYHRYCLDPPLLRIP 2925
            KE +D+LVQ   S +P+APWDEG+CKVCG+DKDDD+VLLCD CD+EYH YCL+PPL+RIP
Sbjct: 1075 KEINDILVQ--TSEIPKAPWDEGICKVCGVDKDDDSVLLCDTCDAEYHTYCLEPPLVRIP 1132

Query: 2924 EGNWYCPSCVTGQSLPSGTG-YASGSNQRRKRRNQGEFSSKFLEELSRFAKLMEMKEHWE 2748
            EGNWYCPSCV   S+  G   ++    Q + + NQGE +   LE L     +ME KE+WE
Sbjct: 1133 EGNWYCPSCVVRNSMVQGASEHSQVGGQHKGKNNQGEITRLCLEALRHLTTVMEEKEYWE 1192

Query: 2747 FTVEERIFFLKFLFDEALNSATVRDHMEQCASRAADLQNKLRSLTSEXXXXXXXXXXXXL 2568
            F V ER F LKFL DE LNSA +R H+EQC    A+LQ KLRS + E             
Sbjct: 1193 FNVHERTFLLKFLCDELLNSALLRQHLEQCTEVTAELQQKLRSFSVEFKNLKSREETVAA 1252

Query: 2567 SAEKTNSGVLNVRGDLNSDASSSQHASENITRGKPSEKLVGDQSQPE-KIIVKTSEGPNW 2391
               K  + +     ++      +     N   GK  E+     ++    +I     GP +
Sbjct: 1253 RVAKVEASMTYSVAEVCMKEGPATVIRNN---GKCIEQPQNSSNRSNCSVIALEESGPMY 1309

Query: 2390 LSEKPISVQQPRSDQGHTSLLNNVQS------PLFSSPTSE------------------R 2283
             ++    +++P  D        N +S      PL SS   E                  R
Sbjct: 1310 PTDAEGQIEEPHGDNSKMPSQKNDESIKPNEHPLASSLPQEIDNLSGEIRSQHNLQELAR 1369

Query: 2282 ETELVQCPNQGDMPS------------------SQLNNLKACTVKQEITNLLDSIADIEL 2157
            +   +  P+    PS                   Q +NL+   ++ +I  L +SI  +E 
Sbjct: 1370 DAATLASPSNNHGPSVPNELHVTEGTCSVTMNEPQAHNLELNNIRNDILLLQESITSLEQ 1429

Query: 2156 ELVKISLRRDFLGRDSIGRVYWAFCYPGARPWIIACGGTAHKER---------------- 2025
            +L+K+S+RR+FLG DS GR+YW    PG  P +I  G    +++                
Sbjct: 1430 QLLKLSVRREFLGSDSSGRLYWVLPLPGMHPCLIVDGSPELQQKRKILDFRGPVDKGLVL 1489

Query: 2024 --------------------CP---RDFSSIPDSDKWMYYESDSEIEKLVGWLRENNVRE 1914
                                CP     ++    S  W+ Y++D+EIE+LV WLR+N+ +E
Sbjct: 1490 KNSSSSGSDAYSSSKGSKACCPFQYDPYAVTATSSHWILYQTDAEIEELVNWLRDNDPKE 1549

Query: 1913 KELKESISQFQANKLKDSEYTE----DHILKKREINHGGRKTVSADFLATNAMNALEKKF 1746
            +ELK+SI  ++  + +DS++T+    D             K    D L T A   LEKK+
Sbjct: 1550 RELKDSILNWKKIRFQDSQHTKKQSWDEYQSASSAPTNSDKVDCFDCLVTKAATLLEKKY 1609

Query: 1745 GPCKRIETTAVPQNLVMGASQSGGMHRCECLEMLWPSKDHCGSCHQSFSTSEELRQHVKE 1566
            GPC   E            +    M+RCECLE +WPS++HC SCH++FST+ E  +H   
Sbjct: 1610 GPCFESEEVLKKGGKRARVTSQEKMYRCECLEPIWPSRNHCLSCHRTFSTAVEFEEHNDT 1669

Query: 1565 NCKAVASGSKRSQAAEDMSKRKKARNAVSQEKRPASVGIPQGSTLEKQIDGSASVESYNA 1386
               A  +  K  +A+  +  +   ++ +S       V + + S        S  +   N 
Sbjct: 1670 CNSAPPAYEKNKEASNSLKGKGNKKSDISHAAGGTDVELVETS------KPSGLIRFQND 1723

Query: 1385 DCPFNFEEIMTRFIVPSSVKDGVNEIGLIGSGGLPSFLQGQSPYVSDSALTVGLERTNEA 1206
             CPF+  EI ++F+   S K+ V EIGL+GS G+PS +   SP++SDS L +   +    
Sbjct: 1724 GCPFDLNEISSKFMTQDSNKELVQEIGLLGSKGIPSLIPSVSPFLSDSTLMLMSPQKEVG 1783

Query: 1205 SSRSNDLRSKQKNAEPGDVMNGRGFKDSNRSSRSAENGLSDELSIVGRLKSIL--TSEKD 1032
                  + S+  ++  G         D+     S ++G +    ++   K     + ++D
Sbjct: 1784 VPDGQLMASETLSSSQGKQSMKNAGNDNMADDASRKSGSNGTHEVLKSKKPAFGCSEQRD 1843

Query: 1031 QVTPMKDRSSLLGLSKSTIIRESSSRPLVGRASEILRVLKINLLDMDAALPEDAFRKSRS 852
            + +    R   +G+++  ++ +SS RPL+GR S+I R LK+NLLD+DAALPE+A R S++
Sbjct: 1844 RKSSSHVRVPKVGINQCCVVPQSSLRPLIGRTSQIKRRLKVNLLDIDAALPEEALRPSKA 1903

Query: 851  SPDRRCAWRAFLKSSKSIYEMVQATIVFEDMIKTEYLRNDWWYWSSPSTAARITTLSALA 672
              +RR AWRAF+KS+++IYEMVQATI+ EDMIKTE+LRN+WWYWSS S AA+ +T+S+LA
Sbjct: 1904 HLERRWAWRAFVKSAETIYEMVQATIILEDMIKTEFLRNEWWYWSSLSAAAKTSTMSSLA 1963

Query: 671  LRIYSLDSAISYEEPLPTAAAMEVSEPSNAIEEEIQKSPTLKNLASPSSPTLQKTPEPD 495
            LRIYSLD+AI Y++       +E  +  +  E +      L   +  S  + +K  EP+
Sbjct: 1964 LRIYSLDAAIIYDKSTTNLNPVENLKLDSTPEHKPLPGVELLEKSKVSRKSNRKRKEPE 2022


>ref|XP_006470355.1| PREDICTED: methyl-CpG-binding domain-containing protein 9-like
            isoform X1 [Citrus sinensis]
          Length = 2159

 Score =  692 bits (1785), Expect = 0.0
 Identities = 399/1079 (36%), Positives = 583/1079 (54%), Gaps = 89/1079 (8%)
 Frame = -3

Query: 3464 ASDEDKVFCNLLGRIILNPNDNDDEGLLGYPAMVSRPLDFRTIDLRLASGAYGGSHEAFV 3285
            A+D++KVFCNLLGR  L+  DNDDEG LG PAMVSRPLDFRTIDLRLA GAY GSH++F+
Sbjct: 1091 AADDEKVFCNLLGRKPLSSTDNDDEGFLGSPAMVSRPLDFRTIDLRLAVGAYDGSHDSFL 1150

Query: 3284 DDVREVWRNIHTAYGDRSDLIDVADNLSKKFEDLYEKEVLTLLQKVADISDVKDSSADSL 3105
             DVRE W N+ TA+GD+ D +D+A+ LS+ FE LYE E++TLLQK+   + ++  S ++ 
Sbjct: 1151 QDVREFWNNVRTAFGDQPDFVDLAEKLSRNFESLYENEIVTLLQKLVGYAKLESLSEETT 1210

Query: 3104 KERDDLLVQVCNSVLPRAPWDEGLCKVCGMDKDDDNVLLCDKCDSEYHRYCLDPPLLRIP 2925
            KE +D+LVQ   S +P+APWDEG+CKVCG+DKDDD+VLLCD CD+EYH YCL+PPL+RIP
Sbjct: 1211 KEINDILVQ--TSEIPKAPWDEGICKVCGVDKDDDSVLLCDTCDAEYHTYCLEPPLVRIP 1268

Query: 2924 EGNWYCPSCVTGQSLPSGTG-YASGSNQRRKRRNQGEFSSKFLEELSRFAKLMEMKEHWE 2748
            EGNWYCPSCV   S+  G   ++    Q + + NQGE +   LE L     +ME KE+WE
Sbjct: 1269 EGNWYCPSCVVRNSMVQGASEHSQVGGQHKGKNNQGEITRLCLEALRHLTTVMEEKEYWE 1328

Query: 2747 FTVEERIFFLKFLFDEALNSATVRDHMEQCASRAADLQNKLRSLTSEXXXXXXXXXXXXL 2568
            F V ER F LKFL DE LNSA +R H+EQC    A+LQ KLRS + E             
Sbjct: 1329 FNVHERTFLLKFLCDELLNSALLRQHLEQCTEVTAELQQKLRSFSVEFKNLKSREETVAA 1388

Query: 2567 SAEKTNSGVLNVRGDLNSDASSSQHASENITRGKPSEKLVGDQSQPE-KIIVKTSEGPNW 2391
               K  + +     ++      +     N   GK  E+     ++    +I     GP +
Sbjct: 1389 RVAKVEASMTYSVAEVCMKEGPATVIRNN---GKCIEQPQNSSNRSNCSVIALEESGPMY 1445

Query: 2390 LSEKPISVQQPRSDQGHTSLLNNVQS------PLFSSPTSE------------------R 2283
             ++    +++P  D        N +S      PL SS   E                  R
Sbjct: 1446 PTDAEGQIEEPHGDNSKMPSQKNDESIKPNEHPLASSLPQEIDNLSGEIRSQHNLQELAR 1505

Query: 2282 ETELVQCPNQGDMPS------------------SQLNNLKACTVKQEITNLLDSIADIEL 2157
            +   +  P+    PS                   Q +NL+   ++ +I  L +SI  +E 
Sbjct: 1506 DAATLASPSNNHGPSVPNELHVTEGTCSVTMNEPQAHNLELNNIRNDILLLQESITSLEQ 1565

Query: 2156 ELVKISLRRDFLGRDSIGRVYWAFCYPGARPWIIACGGTAHKER---------------- 2025
            +L+K+S+RR+FLG DS GR+YW    PG  P +I  G    +++                
Sbjct: 1566 QLLKLSVRREFLGSDSSGRLYWVLPLPGMHPCLIVDGSPELQQKRKILDFRGPVDKGLVL 1625

Query: 2024 --------------------CP---RDFSSIPDSDKWMYYESDSEIEKLVGWLRENNVRE 1914
                                CP     ++    S  W+ Y++D+EIE+LV WLR+N+ +E
Sbjct: 1626 KNSSSSGSDAYSSSKGSKACCPFQYDPYAVTATSSHWILYQTDAEIEELVNWLRDNDPKE 1685

Query: 1913 KELKESISQFQANKLKDSEYTE----DHILKKREINHGGRKTVSADFLATNAMNALEKKF 1746
            +ELK+SI  ++  + +DS++T+    D             K    D L T A   LEKK+
Sbjct: 1686 RELKDSILNWKKIRFQDSQHTKKQSWDEYQSASSAPTNSDKVDCFDCLVTKAATLLEKKY 1745

Query: 1745 GPCKRIETTAVPQNLVMGASQSGGMHRCECLEMLWPSKDHCGSCHQSFSTSEELRQHVKE 1566
            GPC   E            +    M+RCECLE +WPS++HC SCH++FST+ E  +H   
Sbjct: 1746 GPCFESEEVLKKGGKRARVTSQEKMYRCECLEPIWPSRNHCLSCHRTFSTAVEFEEHNDT 1805

Query: 1565 NCKAVASGSKRSQAAEDMSKRKKARNAVSQEKRPASVGIPQGSTLEKQIDGSASVESYNA 1386
               A  +  K  +A+  +  +   ++ +S       V + + S        S  +   N 
Sbjct: 1806 CNSAPPAYEKNKEASNSLKGKGNKKSDISHAAGGTDVELVETS------KPSGLIRFQND 1859

Query: 1385 DCPFNFEEIMTRFIVPSSVKDGVNEIGLIGSGGLPSFLQGQSPYVSDSALTVGLERTNEA 1206
             CPF+  EI ++F+   S K+ V EIGL+GS G+PS +   SP++SDS L +   +    
Sbjct: 1860 GCPFDLNEISSKFMTQDSNKELVQEIGLLGSKGIPSLIPSVSPFLSDSTLMLMSPQKEVG 1919

Query: 1205 SSRSNDLRSKQKNAEPGDVMNGRGFKDSNRSSRSAENGLSDELSIVGRLKSIL--TSEKD 1032
                  + S+  ++  G         D+     S ++G +    ++   K     + ++D
Sbjct: 1920 VPDGQLMASETLSSSQGKQSMKNAGNDNMADDASRKSGSNGTHEVLKSKKPAFGCSEQRD 1979

Query: 1031 QVTPMKDRSSLLGLSKSTIIRESSSRPLVGRASEILRVLKINLLDMDAALPEDAFRKSRS 852
            + +    R   +G+++  ++ +SS RPL+GR S+I R LK+NLLD+DAALPE+A R S++
Sbjct: 1980 RKSSSHVRVPKVGINQCCVVPQSSLRPLIGRTSQIKRRLKVNLLDIDAALPEEALRPSKA 2039

Query: 851  SPDRRCAWRAFLKSSKSIYEMVQATIVFEDMIKTEYLRNDWWYWSSPSTAARITTLSALA 672
              +RR AWRAF+KS+++IYEMVQATI+ EDMIKTE+LRN+WWYWSS S AA+ +T+S+LA
Sbjct: 2040 HLERRWAWRAFVKSAETIYEMVQATIILEDMIKTEFLRNEWWYWSSLSAAAKTSTMSSLA 2099

Query: 671  LRIYSLDSAISYEEPLPTAAAMEVSEPSNAIEEEIQKSPTLKNLASPSSPTLQKTPEPD 495
            LRIYSLD+AI Y++       +E  +  +  E +      L   +  S  + +K  EP+
Sbjct: 2100 LRIYSLDAAIIYDKSTTNLNPVENLKLDSTPEHKPLPGVELLEKSKVSRKSNRKRKEPE 2158


>ref|XP_004167238.1| PREDICTED: LOW QUALITY PROTEIN: methyl-CpG-binding domain-containing
            protein 9-like [Cucumis sativus]
          Length = 1277

 Score =  669 bits (1727), Expect = 0.0
 Identities = 406/1049 (38%), Positives = 602/1049 (57%), Gaps = 84/1049 (8%)
 Frame = -3

Query: 3464 ASDEDKVFCNLLGRIILNPNDNDDEGLLGYPAMVSRPLDFRTIDLRLASGAYGGSHEAFV 3285
            A+D+ KVFCNLLGR ++  +DNDDEGLLG P MVSRPLDFRTIDLRLASG+Y GSHEAF+
Sbjct: 221  AADDAKVFCNLLGRKLMASSDNDDEGLLGPPGMVSRPLDFRTIDLRLASGSYDGSHEAFL 280

Query: 3284 DDVREVWRNIHTAYGDRSDLIDVADNLSKKFEDLYEKEVLTLLQKVADISDVKDSSADSL 3105
            +DV+E+W N+  AYGD+ DL+++ + LS+ FE LYE EVL+L++K+ + S ++  SA++ 
Sbjct: 281  EDVQELWNNLRYAYGDQPDLVELVETLSENFERLYENEVLSLIEKLKEFSKLESLSAETK 340

Query: 3104 KERDDLLVQVCNSVLPRAPWDEGLCKVCGMDKDDDNVLLCDKCDSEYHRYCLDPPLLRIP 2925
             E D  LV +  + +P+APWDEG+CKVCG+DKDDD+VLLCD CD+EYH YCL+PPL RIP
Sbjct: 341  VEVDGFLVSL--NEIPKAPWDEGVCKVCGIDKDDDSVLLCDTCDAEYHTYCLNPPLARIP 398

Query: 2924 EGNWYCPSCVTGQSLPSGTGYASGS--NQRRKRRNQGEFSSKFLEELSRFAKLMEMKEHW 2751
            EGNWYCPSCV G  +       +    N  + ++ +GE +  FL +L+  A  +E KE+W
Sbjct: 399  EGNWYCPSCVMGTRMVEDPSEHTKHIINLHKGKKFRGEVTRDFLNKLANLAAALEEKEYW 458

Query: 2750 EFTVEERIFFLKFLFDEALNSATVRDHMEQCASRAADLQNKLRSLTSEXXXXXXXXXXXX 2571
            EF+V+ER+F LK+L DE L+SA +R H+EQC    A+LQ KLRS   E            
Sbjct: 459  EFSVDERLFLLKYLCDELLSSALIRQHLEQCVEALAELQQKLRSCFIEWKNLKCREEVVA 518

Query: 2570 LSAEKTNSGVLNV----RGDLN------SDASSSQHASENITRGKPS--EKL-----VGD 2442
              A K ++ +L+     +G  +      SD  SS  + EN      S  E++     V D
Sbjct: 519  ARAAKLDTTMLSAVREGQGSCDGARLGASDQYSSLTSLENKCHNHASFQEQMSSAHDVTD 578

Query: 2441 QSQPEKIIVKTSEGPNWLSEKPISVQQPRSDQGHTSLLNNVQSPLFSSPTSERETELVQC 2262
             +     ++ +S   N  S KP+   +P         L+ +   +  S  S  ETE+   
Sbjct: 579  NNDAGGNVLSSSGSQN--SGKPVKFNEPS--------LSGLPQEVDGSDQSNMETEISIL 628

Query: 2261 PN-----------------QGDMPS-SQLNNLKACTVKQEITNLLDSIADIELELVKISL 2136
            P+                 Q   P+ SQ  + +  ++K++I  + DSIA  ELEL+KIS+
Sbjct: 629  PSGKQYFTPCDANGVPVAPQVPPPNESQAYHSELDSIKKDILQVQDSIASTELELLKISV 688

Query: 2135 RRDFLGRDSIGRVYWAFCYPGARPWIIACGGTAH----------KERCPRDFSSIPDSDK 1986
            RR+FLG D+ GR+YWA       P II+ G + H          K R  ++++S  +++ 
Sbjct: 689  RREFLGSDAAGRLYWASVMSNGLPQIISSGSSVHIGSESRDRVVKGRFFKNYTSTSNANS 748

Query: 1985 W-----MY------------------YESDSEIEKLVGWLRENNVREKELKESISQFQAN 1875
                  MY                  Y+++++I +L+ WL++++ +E+ELKESI Q+   
Sbjct: 749  STLNSNMYSSLLHLPKDFIGNSPCISYQTEADILELIDWLKDSDPKERELKESILQWLKP 808

Query: 1874 KLKDS----EYTEDHILKKREINHGGRKTVSADFLATNAMNALEKKFGPCKRIETTAVPQ 1707
            KL+ S      + +  LK    +    K   + FL   A   LE K+GP     T   P 
Sbjct: 809  KLQTSSRSNNQSPEEQLKDSSSSSDVEKLECSGFLVNRASALLESKYGPFLEFVT---PD 865

Query: 1706 NL-----VMGASQSGGMHRCECLEMLWPSKDHCGSCHQSFSTSEELRQHVKENCKAVASG 1542
            +L         ++   M RC C+E +WPS+ HC SCH+SFST  EL +H    C ++ + 
Sbjct: 866  DLNRWLDKARLAEDEKMFRCVCMEPVWPSRYHCLSCHKSFSTDVELEEHDNGQCSSLPAS 925

Query: 1541 SKRSQAAEDMSKRKKARNAVSQEKRPASVGIPQGSTLEKQIDGSASVESYNAD---CPFN 1371
                +   D SK K      S+++  +S+ I +  T     + S  +  Y  D   CP++
Sbjct: 926  CDGIKEVGDSSKSKCNIKFESKQEESSSMVIAE--TSRGYFNHSMGLIKYQNDGMMCPYD 983

Query: 1370 FEEIMTRFIVPSSVKDGVNEIGLIGSGGLPSFLQGQSPYVSDSALTVGLERTNEASSRSN 1191
            FE I ++F+   S KD + EIGLI S G+PSFL   SPY+ +S L V   + + ++    
Sbjct: 984  FELICSKFLTKDSNKDLIKEIGLISSNGVPSFLSSVSPYIMESTLNVIDLKKDSSTPEDG 1043

Query: 1190 DLRSKQKNAEPGDVMNGRGFKDSNRSSRSAENGLSDELSI--VGRLKSILTSEKDQVTPM 1017
             L S+  + E  +++   G   S+    S +    +E+S     RL +     K + + M
Sbjct: 1044 TLLSEWPSLE--NIILENGCHQSSSIDSSIQKPAGNEISAPKTKRLAAGCLEPKSKKSXM 1101

Query: 1016 KDRSSLLGLSKSTIIRESSSRPLVGRASEILRVLKINLLDMDAALPEDAFRKSRSSPDRR 837
             +R S  G+ +  +I +SS RPLVG+  +++R LK+NLLDMDAALP++A + S+   +RR
Sbjct: 1102 DNRFSEFGIGRCFVIPQSSQRPLVGKILQVVRGLKMNLLDMDAALPDEALKPSKLHIERR 1161

Query: 836  CAWRAFLKSSKSIYEMVQATIVFEDMIKTEYLRNDWWYWSSPSTAARITTLSALALRIYS 657
             AWRAF+KS+ +IYEMVQATI  EDMI+TEYL+N+WWYWSS S AA+I+T+S+LALRI+S
Sbjct: 1162 WAWRAFVKSAGTIYEMVQATIALEDMIRTEYLKNEWWYWSSLSAAAKISTVSSLALRIFS 1221

Query: 656  LDSAISYEEPLPTAAAMEVSEPSNAIEEE 570
            LD+AI YE+  P   + +  + +++I E+
Sbjct: 1222 LDAAIIYEKISPNQDSNDYLDTTSSIPEQ 1250


>ref|XP_004141185.1| PREDICTED: methyl-CpG-binding domain-containing protein 9-like
            [Cucumis sativus]
          Length = 2131

 Score =  669 bits (1726), Expect = 0.0
 Identities = 408/1050 (38%), Positives = 602/1050 (57%), Gaps = 85/1050 (8%)
 Frame = -3

Query: 3464 ASDEDKVFCNLLGRIILNPNDNDDEGLLGYPAMVSRPLDFRTIDLRLASGAYGGSHEAFV 3285
            A+D+ KVFCNLLGR ++  +DNDDEGLLG P MVSRPLDFRTIDLRLASG+Y GSHEAF+
Sbjct: 1074 AADDAKVFCNLLGRKLMASSDNDDEGLLGPPGMVSRPLDFRTIDLRLASGSYDGSHEAFL 1133

Query: 3284 DDVREVWRNIHTAYGDRSDLIDVADNLSKKFEDLYEKEVLTLLQKVADISDVKDSSADSL 3105
            +DV+E+W N+  AYGD+ DL+++ + LS+ FE LYE EVL+L++K+ + S ++  SA++ 
Sbjct: 1134 EDVQELWNNLRYAYGDQPDLVELVETLSENFERLYENEVLSLIEKLKEFSKLESLSAETK 1193

Query: 3104 KERDDLLVQVCNSVLPRAPWDEGLCKVCGMDKDDDNVLLCDKCDSEYHRYCLDPPLLRIP 2925
             E D  LV +  + +P+APWDEG+CKVCG+DKDDD+VLLCD CD+EYH YCL+PPL RIP
Sbjct: 1194 VEVDGFLVSL--NEIPKAPWDEGVCKVCGIDKDDDSVLLCDTCDAEYHTYCLNPPLARIP 1251

Query: 2924 EGNWYCPSCVTGQSL---PSGTGYASGSNQRRKRRNQGEFSSKFLEELSRFAKLMEMKEH 2754
            EGNWYCPSCV G  +   PS        N  + ++ +GE +  FL +L+  A  +E KE+
Sbjct: 1252 EGNWYCPSCVMGTRMVEDPSEHTKNHIINLHKGKKFRGEVTRDFLNKLANLAAALEEKEY 1311

Query: 2753 WEFTVEERIFFLKFLFDEALNSATVRDHMEQCASRAADLQNKLRSLTSEXXXXXXXXXXX 2574
            WEF+V+ER+F LK+L DE L+SA +R H+EQC    A+LQ KLRS   E           
Sbjct: 1312 WEFSVDERLFLLKYLCDELLSSALIRQHLEQCVEALAELQQKLRSCFIEWKNLKCREEVV 1371

Query: 2573 XLSAEKTNSGVLNV----RGDLN------SDASSSQHASENITRGKPS--EKL-----VG 2445
               A K ++ +L+     +G  +      SD  SS  + EN      S  E++     V 
Sbjct: 1372 AARAAKLDTTMLSAVREGQGSCDGARLGASDQYSSLTSLENKCHNHASFQEQMSSAHDVT 1431

Query: 2444 DQSQPEKIIVKTSEGPNWLSEKPISVQQPRSDQGHTSLLNNVQSPLFSSPTSERETELVQ 2265
            D +     ++ +S   N  S KP+   +P         L+ +   +  S  S  ETE+  
Sbjct: 1432 DNNDAGGNVLSSSGSQN--SGKPVKFNEPS--------LSGLPQEVDGSDQSNMETEISI 1481

Query: 2264 CPN-----------------QGDMPS-SQLNNLKACTVKQEITNLLDSIADIELELVKIS 2139
             P+                 Q   P+ SQ  + +  ++K++I  + DSIA  ELEL+KIS
Sbjct: 1482 LPSGKQYFTPCDANGVPVAPQVPPPNESQAYHSELDSIKKDILQVQDSIASTELELLKIS 1541

Query: 2138 LRRDFLGRDSIGRVYWAFCYPGARPWIIACGGTAH----------KERCPRDFSSIPDSD 1989
            +RR+FLG D+ GR+YWA       P II+ G + H          K R  ++++S  +++
Sbjct: 1542 VRREFLGSDAAGRLYWASVMSNGLPQIISSGSSVHIGSESRDRVVKGRFFKNYTSTSNAN 1601

Query: 1988 KW-----MY------------------YESDSEIEKLVGWLRENNVREKELKESISQFQA 1878
                   MY                  Y+++++I +L+ WL++++ +E+ELKESI Q+  
Sbjct: 1602 SSTLNSNMYSSLLHLPKDFIGNSPCISYQTEADILELIDWLKDSDPKERELKESILQWLK 1661

Query: 1877 NKLKDS----EYTEDHILKKREINHGGRKTVSADFLATNAMNALEKKFGPCKRIETTAVP 1710
             KL+ S      + +  LK    +    K   + FL   A   LE K+GP     T   P
Sbjct: 1662 PKLQTSSRSNNQSPEEQLKDSSSSSDVEKLECSGFLVNRASALLESKYGPFLEFVT---P 1718

Query: 1709 QNL-----VMGASQSGGMHRCECLEMLWPSKDHCGSCHQSFSTSEELRQHVKENCKAVAS 1545
             +L         ++   M RC C+E +WPS+ HC SCH+SFST  EL +H    C ++ +
Sbjct: 1719 DDLNRWLDKARLAEDEKMFRCVCMEPVWPSRYHCLSCHRSFSTDVELEEHDNGQCSSLPA 1778

Query: 1544 GSKRSQAAEDMSKRKKARNAVSQEKRPASVGIPQGSTLEKQIDGSASVESYNAD---CPF 1374
                 +   D SK K      S+++  +S+ I +  T     + S  +  Y  D   CP+
Sbjct: 1779 SCDGIKEVGDSSKSKCNIKFESKQEESSSMVIAE--TSRGYFNHSMGLIKYQNDGMMCPY 1836

Query: 1373 NFEEIMTRFIVPSSVKDGVNEIGLIGSGGLPSFLQGQSPYVSDSALTVGLERTNEASSRS 1194
            +FE I ++F+   S KD + EIGLI S G+PSFL   SPY+ +S L V   + + ++   
Sbjct: 1837 DFELICSKFLTKDSNKDLIKEIGLISSNGVPSFLSSVSPYIMESTLNVIDLKKDSSTPED 1896

Query: 1193 NDLRSKQKNAEPGDVMNGRGFKDSNRSSRSAENGLSDELSI--VGRLKSILTSEKDQVTP 1020
              L S+  + E  +++   G   S+    S +    +E+S     RL +     K +   
Sbjct: 1897 GTLLSEWPSLE--NIILENGCHQSSSIDSSIQKPAGNEISAPKTKRLAAGCLEPKSKKIC 1954

Query: 1019 MKDRSSLLGLSKSTIIRESSSRPLVGRASEILRVLKINLLDMDAALPEDAFRKSRSSPDR 840
            M +R S  G+ +  +I +SS RPLVG+  +++R LK+NLLDMDAALP++A + S+   +R
Sbjct: 1955 MDNRFSEFGIGRCFVIPQSSQRPLVGKILQVVRGLKMNLLDMDAALPDEALKPSKLHIER 2014

Query: 839  RCAWRAFLKSSKSIYEMVQATIVFEDMIKTEYLRNDWWYWSSPSTAARITTLSALALRIY 660
            R AWRAF+KS+ +IYEMVQATI  EDMI+TEYL+N+WWYWSS S AA+I+T+S+LALRI+
Sbjct: 2015 RWAWRAFVKSAGTIYEMVQATIALEDMIRTEYLKNEWWYWSSLSAAAKISTVSSLALRIF 2074

Query: 659  SLDSAISYEEPLPTAAAMEVSEPSNAIEEE 570
            SLD+AI YE+  P   + +  + +++I E+
Sbjct: 2075 SLDAAIIYEKISPNQDSNDYLDTTSSIPEQ 2104


>ref|XP_002884279.1| methyl-CpG-binding domain 9 [Arabidopsis lyrata subsp. lyrata]
            gi|297330119|gb|EFH60538.1| methyl-CpG-binding domain 9
            [Arabidopsis lyrata subsp. lyrata]
          Length = 2183

 Score =  664 bits (1713), Expect = 0.0
 Identities = 397/1061 (37%), Positives = 582/1061 (54%), Gaps = 69/1061 (6%)
 Frame = -3

Query: 3464 ASDEDKVFCNLLGRIILNPNDNDDEGLLGYPAMVSRPLDFRTIDLRLASGAYGGSHEAFV 3285
            A+DEDKVFC LLGR +LN +DNDD+GLLG PAMVSRPLDFRTIDLRLA+GAY GS EAF+
Sbjct: 1148 AADEDKVFCTLLGRKLLNSSDNDDDGLLGTPAMVSRPLDFRTIDLRLAAGAYDGSTEAFL 1207

Query: 3284 DDVREVWRNIHTAYGDRSDLIDVADNLSKKFEDLYEKEVLTLLQKVADISDVKDSSADSL 3105
            +DV E+W +I   Y D+ D +++   LS+KF+ LYE EVL L+QK+ +   ++  SA+  
Sbjct: 1208 EDVLELWSSIRVMYADQPDYVELVATLSEKFKSLYEAEVLPLVQKLMEYRKLECLSAEMK 1267

Query: 3104 KERDDLLVQVCNSVLPRAPWDEGLCKVCGMDKDDDNVLLCDKCDSEYHRYCLDPPLLRIP 2925
            KE  D++V V  + LP+APWDEG+CKVCG+DKDDD+VLLCD CD+EYH YCL+PPL+RIP
Sbjct: 1268 KEIKDIVVSV--NKLPKAPWDEGVCKVCGVDKDDDSVLLCDTCDAEYHTYCLNPPLIRIP 1325

Query: 2924 EGNWYCPSCVTGQSLPSGTGYASGSNQRRK-RRNQGEFSSKFLEELSRFAKLMEMKEHWE 2748
            EGNWYCPSCV  + +      +    +RRK R+ QG+ +   +E  +  A +ME K++WE
Sbjct: 1326 EGNWYCPSCVIAKRMAQEALESYKLVRRRKGRKYQGQLTRTSMEMTAHLADVMEEKDYWE 1385

Query: 2747 FTVEERIFFLKFLFDEALNSATVRDHMEQCASRAADLQNKLRSLTSEXXXXXXXXXXXXL 2568
            F+ EERI  LK L DE L+S+ V  H+EQCA    ++Q KLRSL+SE             
Sbjct: 1386 FSAEERILLLKLLCDELLSSSLVHQHLEQCAEAIIEMQQKLRSLSSEWKNAKMRQEFLTA 1445

Query: 2567 SAEKTNSGVLNVRGDL-NSDASSSQHASENITRGKPSEKLVGDQSQPEKIIVKTSEGPNW 2391
               K    +L   G+  NS   + Q   +   +    + +  D S    +     + P  
Sbjct: 1446 KLAKVEPSILKEVGEPHNSGHFADQMGCDQRPQEGVGDGVTHDDSSTAYLNKNKGKAPLE 1505

Query: 2390 LSEKPISVQQPRSDQGHTSLLNNVQSP-LFSSP--------------------------- 2295
               +P   Q  +  + H +  + + SP   SSP                           
Sbjct: 1506 TDSQPGEFQDSQPGESHVNFESKISSPETISSPGRHEKPIADTSPHVTDNPSFEKYTSET 1565

Query: 2294 ----------TSERETELVQCPNQGDMPSSQLNNLKAC-----TVKQEITNLLDSIADIE 2160
                      T    +  V+ P   D  S     L+AC         EI NL  SI  IE
Sbjct: 1566 LHKSVGRNHETHSLNSNAVEIPTAHDASSQASQELQACLQDLNATSHEIHNLQQSIRSIE 1625

Query: 2159 LELVKISLRRDFLGRDSIGRVYWAFCYPGARPWIIACGGTAHKERCPRDF--SSIPDS-- 1992
             +L+K S+RRDFLG D+ GR+YW  C+P   P I+  G  + ++    D   S +P    
Sbjct: 1626 SQLLKQSIRRDFLGNDASGRLYWGCCFPDENPRILVDGSISLQKPVQADLMGSKVPSPFL 1685

Query: 1991 ----------DKWMYYESDSEIEKLVGWLRENNVREKELKESISQFQANKLKDSEYTEDH 1842
                        W YYE+++EI +LV WL +++++E++L+ESI  ++  +  D       
Sbjct: 1686 HAVDHGRLRLSPWTYYETETEISELVQWLHDDDLKERDLRESILCWKRLRFGD------- 1738

Query: 1841 ILKKREINHGGRKTVSADFLATNAMNALEKKFGPCKRIET-TAVPQNLVMGASQSGGMHR 1665
            + K+++        + A  L T A  ++EKK+GPC ++ET T   +      SQ   + R
Sbjct: 1739 VQKEKKQAQNLSAPILARGLETKAAMSMEKKYGPCIKLETETLKKRGKKTKVSQREKLCR 1798

Query: 1664 CECLEMLWPSKDHCGSCHQSFSTSEELRQHVKENCKAVASGSKRSQAAEDMSKRKKA-RN 1488
            CECLE + PS  HC  CH++F++ +E  +H +  C   +  ++ S+   D SK K++ ++
Sbjct: 1799 CECLESILPSMIHCLICHKTFASDDEFEEHTESKCIPYSLATEESKEISDSSKAKESLKS 1858

Query: 1487 AVSQEKRPASVGIPQGSTLEKQIDGSASVESYNADCPFNFEEIMTRFIVPSSVKDGVNEI 1308
                 K  A   + + S + +   G    +   +  P++FEEI ++F+   S +D V EI
Sbjct: 1859 DYLNVKSSAGKAVGEISNVSELDSGLIRYQEEESISPYHFEEICSKFVTKDSNRDLVKEI 1918

Query: 1307 GLIGSGGLPSFLQGQSPYVSDSALTVGLERTNEASSRSNDLRSKQKNAEPGDVMNGRGFK 1128
            GLIGS G+P+FL   S + +DS L              N   +K    + GD +   G  
Sbjct: 1919 GLIGSNGIPTFLPASSTHHNDSVLI-------------NANPNKLDGGDSGDQVIFAG-P 1964

Query: 1127 DSNRSSRSAENGLSDELSIV----GRLKSILTSEKDQVTPMKDRSSLLGLSKSTIIRESS 960
            ++N    ++E+ LS + S+     G L  +             +SS  GL    ++ +++
Sbjct: 1965 ETNVEGLNSESNLSFDGSVTDNHGGPLNKLTGLGFGFSEQKNKKSSGSGLKSCCVVPQAA 2024

Query: 959  SRPLVGRASEILRVLKINLLDMDAALPEDAFRKSRSSPDRRCAWRAFLKSSKSIYEMVQA 780
             + + G+A  + R LK NLLDMD ALPE+A R S+S PDRR AWR F+KS++SIYE+VQA
Sbjct: 2025 LKRITGKALPVFRFLKTNLLDMDVALPEEALRPSKSHPDRRRAWRVFVKSAQSIYELVQA 2084

Query: 779  TIVFEDMIKTEYLRNDWWYWSSPSTAARITTLSALALRIYSLDSAISYEEPLPTAAAMEV 600
            T V EDMIKTEYL+N+WWYWSS S AA+I+TLSAL++RI+SLD+AI Y++P+  +   + 
Sbjct: 2085 TFVVEDMIKTEYLKNEWWYWSSLSAAAKISTLSALSVRIFSLDAAIIYDKPITPSDHNDE 2144

Query: 599  SEPSNAIEEEIQKSPTLKNLASPSS----PTLQKTPEPDSS 489
            ++P   I    QKS  + +    SS     + +K  EP+ S
Sbjct: 2145 TKP--IISSPDQKSQPVSDSQEKSSRVNRRSGKKRKEPEGS 2183


>ref|XP_006594288.1| PREDICTED: methyl-CpG-binding domain-containing protein 9-like
            [Glycine max]
          Length = 2202

 Score =  659 bits (1699), Expect = 0.0
 Identities = 417/1086 (38%), Positives = 602/1086 (55%), Gaps = 95/1086 (8%)
 Frame = -3

Query: 3464 ASDEDKVFCNLLGRIILNPNDNDDEGLLGYPAMVSRPLDFRTIDLRLASGAYGGSHEAFV 3285
            A+D+ KVFCNLLGR ++N +DNDDEGLLG PAMV+RPLDFRTIDLRLA+GAYGGSHEAF+
Sbjct: 1132 AADDSKVFCNLLGRKLINSSDNDDEGLLGSPAMVARPLDFRTIDLRLATGAYGGSHEAFL 1191

Query: 3284 DDVREVWRNIHTAYGDRSDLIDVADNLSKKFEDLYEKEVLTLLQKVADISDVKDSSADSL 3105
            +DVRE+W N+  A+GD+ DL+++A+ L++ FE LY +EV+T +Q+  + + ++  SA+  
Sbjct: 1192 EDVRELWNNVRVAFGDQPDLVELAEKLTQNFESLYNEEVVTYVQRFVEYAKLECLSAEMR 1251

Query: 3104 KERDDLLVQVCNSVLPRAPWDEGLCKVCGMDKDDDNVLLCDKCDSEYHRYCLDPPLLRIP 2925
            KE  D +     + +P+APWDEG+CKVCG+D+DDD+VLLCD CD+EYH YCL+PPL RIP
Sbjct: 1252 KEVGDFIES--TNEIPKAPWDEGVCKVCGIDRDDDSVLLCDTCDAEYHTYCLNPPLARIP 1309

Query: 2924 EGNWYCPSCVTGQ-SLPSGTGYASGSNQRRKRRNQGEFSSKFLEELSRFAKLMEMKEHWE 2748
            EGNWYCPSCV G+ +  + T       +R+ ++ QGE +S +LE L+  +  +E KE+WE
Sbjct: 1310 EGNWYCPSCVVGKHATQNVTERTQVIGKRQSKKFQGEVNSLYLESLAHLSAAIEEKEYWE 1369

Query: 2747 FTVEERIFFLKFLFDEALNSATVRDHMEQCASRAADLQNKLRSLTSEXXXXXXXXXXXXL 2568
            ++V ER F LKFL DE LNS+ +  H+EQCA  +A+L  KLR+ ++E             
Sbjct: 1370 YSVGERTFLLKFLCDELLNSSLIHQHLEQCAELSAELHQKLRAHSAEWKSLKTREDILST 1429

Query: 2567 SAEKTNSGVLNVRGDLNSDASSSQHASE--------NITRGKPSEKLVGDQSQPEKIIVK 2412
             A K ++  LN  G++      +   S         +     PS   V   S P + + K
Sbjct: 1430 KAAKIDTFSLNTAGEVGLKEGFASLLSNTGKCLVQPHTAVDNPSNFGVFVDSLPSEEVTK 1489

Query: 2411 TSEGPNWLSEKPISVQQPRSDQGHTSLLN------NVQSPLFSSPTSE------------ 2286
                 + + +K ISV    SD  + + ++      NV   + S  T +            
Sbjct: 1490 DKYRFDSV-DKSISVTNSDSDSQNMNSIDVEGQFRNVSGAVESQCTDKSPKSFPLPNHMP 1548

Query: 2285 RET------ELVQCPNQG------------------DMPSSQLN-----NLKACTVKQEI 2193
            +ET       LVQ  NQ                   D+P   +N     +L+   +K++I
Sbjct: 1549 QETNGAGGASLVQGKNQKCEGKDIPTPVSYQQGMPVDVPQISVNESEPYHLELIAIKRDI 1608

Query: 2192 TNLLDSIADIELELVKISLRRDFLGRDSIGRVYWAFCYPGARPWIIACGGTA-------- 2037
            + L DSI  +  +L+K+S+RR+ LG DSIGR+YWA   PG R  I+     A        
Sbjct: 1609 SLLQDSITSVASQLLKLSVRRECLGIDSIGRLYWASALPGGRSRIVVDASAALLHGRGMT 1668

Query: 2036 -------------HKERCPRDFS-------SIPDSDKWMYYESDSEIEKLVGWLRENNVR 1917
                         H     +D S        + +S  W+ YE+D EIE+L+GWL +++ +
Sbjct: 1669 FSRDYVEKFSVLQHCALSDKDSSLMSQPSNPLGNSSPWIAYETDVEIEELLGWLDDSDPK 1728

Query: 1916 EKELKESISQFQANKLKD--SEYTEDHILKKREIN--HGGRKTVSADFLATNAMNALEKK 1749
            E+ELK+SI     ++ +   +  TED    +  ++      KTVS + L T A + LEKK
Sbjct: 1729 ERELKDSIMLGPKSRFQQFINAQTEDRAKDQGNVSMPRNREKTVS-NSLVTKATSLLEKK 1787

Query: 1748 FGPCKRIETTAV--PQNLVMGASQSGGMHRCECLEMLWPSKDHCGSCHQSFSTSEELRQH 1575
            FGP    + + V   QN     +    ++RCECLE + PS+ HC  CH++ ++  E   H
Sbjct: 1788 FGPFVEWDNSEVLKKQNRKTRTTNDEKLYRCECLEPILPSRKHCTHCHKTVASDIEFDGH 1847

Query: 1574 VKENCKAVASGSKRSQAAEDMSKRK---KARNAVSQEKRPASVGIPQGSTLEKQIDGSAS 1404
                C A     ++++     SK +   K      + +  A   +   S   K       
Sbjct: 1848 NDGKCNAGLLAIEKNKDKNGSSKGRGNLKCDTLHEKFRADAETALTSVSGSSKLSSRLIK 1907

Query: 1403 VESYNADCPFNFEEIMTRFIVPSSVKDGVNEIGLIGSGGLPSFLQGQSPYVSDSALTVGL 1224
              +  + CPFNFE+I ++F+   S K+ V+EIGLIGS G+PSF+   SP+VS+  L+   
Sbjct: 1908 FSNEESTCPFNFEDICSKFVTNDSNKELVSEIGLIGSDGIPSFVPSVSPFVSEYTLSAQK 1967

Query: 1223 ERTNEAS-SRSNDLRSKQKNAEPGDVMNGRGFKDSNRSSRSAENGLSDELSIVGRLKSIL 1047
            + +     S  ++ R  Q N +      G G    ++S  S     ++E +     KS L
Sbjct: 1968 DESIVGGVSIVSESRVSQGNTD------GAGTCLDHKSGISTGKLAANESNKSN--KSSL 2019

Query: 1046 TSEKDQVTPMKDRSSLLGLSKSTIIRESSSRPLVGRASEILRVLKINLLDMDAALPEDAF 867
              ++D        +S++G     ++   S RPLVG+AS ILR LKINLLDMDAAL   A 
Sbjct: 2020 REQRDGKFSFCSPASVMGADGCCVVPSPSLRPLVGKASHILRQLKINLLDMDAALLAIAL 2079

Query: 866  RKSRSSPDRRCAWRAFLKSSKSIYEMVQATIVFEDMIKTEYLRNDWWYWSSPSTAARITT 687
            R S++ PDRR AWR F+KS+K+IYEM+QAT   EDMIKTEYLRNDWWYWSS S AA+ +T
Sbjct: 2080 RPSKAVPDRRQAWRTFVKSAKTIYEMIQATFTLEDMIKTEYLRNDWWYWSSFSAAAKSST 2139

Query: 686  LSALALRIYSLDSAISYEEPLPTAAAMEVSEPSNAIE-EEIQKSPTLKNLASPSSPTLQK 510
            L +LALRIYSLD AI YE+ +P ++  + SEPS   E + +    T K+ AS  S   +K
Sbjct: 2140 LPSLALRIYSLDLAIIYEK-MPNSSFTDSSEPSVIAEPKPLMNVDTEKSKASRKS--TRK 2196

Query: 509  TPEPDS 492
              E DS
Sbjct: 2197 RKESDS 2202


>ref|XP_006603816.1| PREDICTED: methyl-CpG-binding domain-containing protein 9-like
            isoform X1 [Glycine max] gi|571553376|ref|XP_006603817.1|
            PREDICTED: methyl-CpG-binding domain-containing protein
            9-like isoform X2 [Glycine max]
          Length = 2175

 Score =  655 bits (1690), Expect = 0.0
 Identities = 416/1061 (39%), Positives = 587/1061 (55%), Gaps = 98/1061 (9%)
 Frame = -3

Query: 3464 ASDEDKVFCNLLGRIILNPNDNDDEGLLGYPAMVSRPLDFRTIDLRLASGAYGGSHEAFV 3285
            A+D+ KVFCNLLGR + N +DNDDEGLLG PAMV+RPLDFRTIDLRLA+GAYGGSHEAF+
Sbjct: 1110 AADDSKVFCNLLGRKLTNSSDNDDEGLLGSPAMVARPLDFRTIDLRLATGAYGGSHEAFL 1169

Query: 3284 DDVREVWRNIHTAYGDRSDLIDVADNLSKKFEDLYEKEVLTLLQKVADISDVKDSSADSL 3105
            +DV E+W N+  A+GD+ DLI++A+ LS  FE LY +EV++ +QK  + + V+  SA+  
Sbjct: 1170 EDVHELWNNVRVAFGDQPDLIELAEKLSLNFESLYNEEVVSYVQKFVEYAKVECLSAEMR 1229

Query: 3104 KERDDLLVQVCNSVLPRAPWDEGLCKVCGMDKDDDNVLLCDKCDSEYHRYCLDPPLLRIP 2925
            KE  D +     + +P+APWDEG+CKVCG+D+DDD+VLLCD CD+EYH YCL+PPL RIP
Sbjct: 1230 KEVVDFIES--TNEIPKAPWDEGVCKVCGIDRDDDSVLLCDTCDAEYHTYCLNPPLARIP 1287

Query: 2924 EGNWYCPSCVTG-QSLPSGTGYASGSNQRRKRRNQGEFSSKFLEELSRFAKLMEMKEHWE 2748
            EGNWYCPSCV G ++    T       +R+ ++ QGE +S +LE L+  + ++E KE+WE
Sbjct: 1288 EGNWYCPSCVDGKRATQDVTERTKIIGKRQSKKFQGEVNSLYLESLTHLSSVIEEKEYWE 1347

Query: 2747 FTVEERIFFLKFLFDEALNSATVRDHMEQCASRAADLQNKLRSLTSEXXXXXXXXXXXXL 2568
            ++V ER F LKFL DE LNS+ +R H+EQCA  +A+L  KLR+ ++E             
Sbjct: 1348 YSVGERTFLLKFLCDELLNSSLIRQHLEQCAELSAELHQKLRAHSAEWKSLKTREDILST 1407

Query: 2567 SAEKTNSGVLNVRGD--LNSDASSSQHASENITRGKPSEKLVGDQSQPEKIIVKTSEGPN 2394
             A K ++  +N  G+  L    +       +     PS   V   S P + + K     +
Sbjct: 1408 KAAKMDTFSVNTAGEVGLKEGFTGKCPVQPHTAVDNPSNFGVFVDSLPSEEVTKERYRFD 1467

Query: 2393 WLSEKPISVQQPRSDQGHTSLLN------NVQSPLFSS---------PTSERETELVQCP 2259
             + +K ISV    SD  + + ++      NV + + S          P+    ++ + C 
Sbjct: 1468 SV-DKSISVTNSDSDSQNMNSIDVEGQFRNVSAAVESQCTDKSPKSFPSPNHMSQEINCA 1526

Query: 2258 N-----QG-----------------------DMPSSQLN-----NLKACTVKQEITNLLD 2178
                  QG                       D+P   LN     +L+   +K++I+ L D
Sbjct: 1527 GGEAHVQGNHQKCEGTDRPIPVSYQQGGVPVDVPQIGLNESEPYHLELNAIKRDISLLQD 1586

Query: 2177 SIADIELELVKISLRRDFLGRDSIGRVYWAFCYPGARPWII--ACGGTAHKERCP--RDF 2010
            SI  +  +L+K+S+RR+FLG DSIG++YWA   PG    II  A     H    P  RD+
Sbjct: 1587 SITSVVSQLLKLSVRREFLGIDSIGQLYWASALPGGHSRIIVDASAALLHGRGMPFSRDY 1646

Query: 2009 ------------------------SSIPDSDKWMYYESDSEIEKLVGWLRENNVREKELK 1902
                                    +S+ +   W+ YE+D+EIE+L+GWL  ++ +E+ELK
Sbjct: 1647 AEKFSVLQHCALSDKDSSLMSQPSNSLGNRSPWIAYETDAEIEELLGWLDYSDPKERELK 1706

Query: 1901 ESI-----SQFQA---NKLKDSEYTEDHILKKREINHGGRKTVSADFLATNAMNALEKKF 1746
            +SI     S+FQ     + +D    + HI   R       KTVS + L T A + LEKKF
Sbjct: 1707 DSIMLGPKSRFQEFINAQTEDQGEDQGHISMPR----NREKTVS-NSLVTKATSLLEKKF 1761

Query: 1745 GPCKRIETTAV--PQNLVMGASQSGGMHRCECLEMLWPSKDHCGSCHQSFSTSEELRQHV 1572
            GP    +   V   QN     +    ++RCECLE +WPS+ HC  CH++  +  E   H 
Sbjct: 1762 GPFVEWDNVEVLKKQNRKARTTNDEKLYRCECLEPIWPSRKHCTYCHKTVVSDVEFDGHN 1821

Query: 1571 KENCKAVASGSKRSQAAEDMSK-RKKARNAVSQEKRPASVGIPQGSTLEKQIDGSASVES 1395
               C A     ++ +     SK R   +   S EK  A        T    + GS+ + S
Sbjct: 1822 DGKCIAGLPAVEKKKDKNGSSKGRGNLKCDASHEKFRA-----DAETAVTSVSGSSKLSS 1876

Query: 1394 -------YNADCPFNFEEIMTRFIVPSSVKDGVNEIGLIGSGGLPSFLQGQSPYVSDSAL 1236
                     + CPF+FE+I ++F+   S K+ V EIGLIGS G+PS +   SP+VS+  L
Sbjct: 1877 RLIKFSNEESTCPFSFEDICSKFVTNDSNKELVREIGLIGSDGIPSLVPSVSPFVSEYTL 1936

Query: 1235 TVGL-ERTNEASSRSNDLRSKQKNAEPGDVMNGRGFKDSNRSSRSAENGLSDELSIVGRL 1059
            +    ER     S++++ +  Q N +       R  K S  + R A N  +         
Sbjct: 1937 SAQKDERIVGGVSKASESQVSQGNTDGAGTCLDR--KSSISTGRLAANESNKS------N 1988

Query: 1058 KSILTSEKDQVTPMKDRSSLLGLSKSTIIRESSSRPLVGRASEILRVLKINLLDMDAALP 879
            KS    ++D      + +S +G     ++   S RPLVG+AS ILR LKINLLDMDAAL 
Sbjct: 1989 KSSSREQRDGKLSFCNPASGMGADGYCVVPSPSLRPLVGKASHILRQLKINLLDMDAALT 2048

Query: 878  EDAFRKSRSSPDRRCAWRAFLKSSKSIYEMVQATIVFEDMIKTEYLRNDWWYWSSPSTAA 699
              A R S++  DRR AWR F+KS+K+IYEM+QAT   EDMIKTEYLRNDWWYWSS S AA
Sbjct: 2049 AIALRPSKAESDRRQAWRTFVKSAKTIYEMIQATFTLEDMIKTEYLRNDWWYWSSFSAAA 2108

Query: 698  RITTLSALALRIYSLDSAISYEEPLPTAAAMEVSEPSNAIE 576
            + +TL +LALRIYSLD AI YE+ +P ++  + SEPS  +E
Sbjct: 2109 KSSTLPSLALRIYSLDLAIIYEK-MPNSSFTDSSEPSAIVE 2148


>ref|NP_186795.1| methyl-CPG-binding domain 9 [Arabidopsis thaliana]
            gi|75337201|sp|Q9SGH2.1|MBD9_ARATH RecName:
            Full=Methyl-CpG-binding domain-containing protein 9;
            Short=AtMBD9; Short=MBD09; AltName: Full=Histone acetyl
            transferase MBD9; AltName: Full=Methyl-CpG-binding
            protein MBD9 gi|6692266|gb|AAF24616.1|AC010870_9 unknown
            protein [Arabidopsis thaliana]
            gi|332640148|gb|AEE73669.1| methyl-CPG-binding domain 9
            [Arabidopsis thaliana]
          Length = 2176

 Score =  655 bits (1690), Expect = 0.0
 Identities = 393/1054 (37%), Positives = 580/1054 (55%), Gaps = 62/1054 (5%)
 Frame = -3

Query: 3464 ASDEDKVFCNLLGRIILNPNDNDDEGLLGYPAMVSRPLDFRTIDLRLASGAYGGSHEAFV 3285
            A+DEDKV C LLGR +LN +DNDD+GLLG PAMVSRPLDFRTIDLRLA+GAY GS EAF+
Sbjct: 1148 AADEDKVLCTLLGRKLLNSSDNDDDGLLGSPAMVSRPLDFRTIDLRLAAGAYDGSTEAFL 1207

Query: 3284 DDVREVWRNIHTAYGDRSDLIDVADNLSKKFEDLYEKEVLTLLQKVADISDVKDSSADSL 3105
            +DV E+W +I   Y D+ D +D+   LS+KF+ LYE EV+ L+QK+ D   ++  SA+  
Sbjct: 1208 EDVLELWSSIRVMYADQPDCVDLVATLSEKFKSLYEAEVVPLVQKLKDYRKLECLSAEMK 1267

Query: 3104 KERDDLLVQVCNSVLPRAPWDEGLCKVCGMDKDDDNVLLCDKCDSEYHRYCLDPPLLRIP 2925
            KE  D++V V  + LP+APWDEG+CKVCG+DKDDD+VLLCD CD+EYH YCL+PPL+RIP
Sbjct: 1268 KEIKDIVVSV--NKLPKAPWDEGVCKVCGVDKDDDSVLLCDTCDAEYHTYCLNPPLIRIP 1325

Query: 2924 EGNWYCPSCVTGQSLPSGTGYASGSNQRRK-RRNQGEFSSKFLEELSRFAKLMEMKEHWE 2748
            +GNWYCPSCV  + +      +    +RRK R+ QGE +   +E  +  A +ME K++WE
Sbjct: 1326 DGNWYCPSCVIAKRMAQEALESYKLVRRRKGRKYQGELTRASMELTAHLADVMEEKDYWE 1385

Query: 2747 FTVEERIFFLKFLFDEALNSATVRDHMEQCASRAADLQNKLRSLTSEXXXXXXXXXXXXL 2568
            F+ EERI  LK L DE L+S+ V  H+EQCA    ++Q KLRSL+SE             
Sbjct: 1386 FSAEERILLLKLLCDELLSSSLVHQHLEQCAEAIIEMQQKLRSLSSEWKNAKMRQEFLTA 1445

Query: 2567 SAEKTNSGVLNVRGD----------LNSDASSSQHASENITRGKPSEKLVGDQSQPEKII 2418
               K    +L   G+          +  D    +   + +TR   +           K  
Sbjct: 1446 KLAKVEPSILKEVGEPHNSSYFADQMGCDPQPQEGVGDGVTRDDETSSTAYLNKNQGKSP 1505

Query: 2417 VKTSEGPNW----LSEKPISVQQPRSDQGHTSLLNNVQSPLFSSPTSERETE-------- 2274
            ++T   P        E  IS  +  S  G   L     SPL +    E++T         
Sbjct: 1506 LETDTQPGESHVNFGESKISSPETISSPGRHELPIADTSPLVTDNLPEKDTSETLLKSVG 1565

Query: 2273 -----------LVQCPNQGDMPSSQLNNLKAC-----TVKQEITNLLDSIADIELELVKI 2142
                        V+ P   D  S     L+AC         EI NL  SI  IE +L+K 
Sbjct: 1566 RNHETHSPNSNAVELPTAHDASSQASQELQACQQDLSATSNEIQNLQQSIRSIESQLLKQ 1625

Query: 2141 SLRRDFLGRDSIGRVYWAFCYPGARPWIIACGGTAHKERCPRDF--SSIPDS-------- 1992
            S+RRDFLG D+ GR+YW  C+P   P I+  G  + ++    D   S +P          
Sbjct: 1626 SIRRDFLGTDASGRLYWGCCFPDENPRILVDGSISLQKPVQADLIGSKVPSPFLHTVDHG 1685

Query: 1991 ----DKWMYYESDSEIEKLVGWLRENNVREKELKESISQFQANKLKDSEYTEDHILKKRE 1824
                  W YYE+++EI +LV WL +++++E++L+ESI  ++  +  D       + K+++
Sbjct: 1686 RLRLSPWTYYETETEISELVQWLHDDDLKERDLRESILWWKRLRYGD-------VQKEKK 1738

Query: 1823 INHGGRKTVSADFLATNAMNALEKKFGPCKRIET-TAVPQNLVMGASQSGGMHRCECLEM 1647
                    V A  L T A  ++EK++GPC ++E  T   +      ++   + RCECLE 
Sbjct: 1739 QAQNLSAPVFATGLETKAAMSMEKRYGPCIKLEMETLKKRGKKTKVAEREKLCRCECLES 1798

Query: 1646 LWPSKDHCGSCHQSFSTSEELRQHVKENCKAVASGSKRSQAAEDMSKRKKA-RNAVSQEK 1470
            + PS  HC  CH++F++ +E   H +  C   +  ++  +   D SK K++ ++     K
Sbjct: 1799 ILPSMIHCLICHKTFASDDEFEDHTESKCIPYSLATEEGKDISDSSKAKESLKSDYLNVK 1858

Query: 1469 RPASVGIPQGSTLEKQIDGSASVESYNADCPFNFEEIMTRFIVPSSVKDGVNEIGLIGSG 1290
              A   + + S + +   G    +   +  P++FEEI ++F+     +D V EIGLI S 
Sbjct: 1859 SSAGKDVAEISNVSELDSGLIRYQEEESISPYHFEEICSKFVTKDCNRDLVKEIGLISSN 1918

Query: 1289 GLPSFLQGQSPYVSDSALTVGLERTNEASSRSNDLRSKQKNAEPGDVMNGRGFKDSNRSS 1110
            G+P+FL   S +++DS L          S++SN    K    + GD +   G  ++N   
Sbjct: 1919 GIPTFLPSSSTHLNDSVLI---------SAKSN----KPDGGDSGDQVIFAG-PETNVEG 1964

Query: 1109 RSAENGLSDELSIVGRLKSILTSEKDQVTPMKD----RSSLLGLSKSTIIRESSSRPLVG 942
             ++E+ +S + S+       L           +    +SS  GL    ++ +++ + + G
Sbjct: 1965 LNSESNMSFDRSVTDSHGGPLDKPSGLGFGFSEQKNKKSSGSGLKSCCVVPQAALKRVTG 2024

Query: 941  RASEILRVLKINLLDMDAALPEDAFRKSRSSPDRRCAWRAFLKSSKSIYEMVQATIVFED 762
            +A    R LK NLLDMD ALPE+A R S+S P+RR AWR F+KSS+SIYE+VQATIV ED
Sbjct: 2025 KALPGFRFLKTNLLDMDVALPEEALRPSKSHPNRRRAWRVFVKSSQSIYELVQATIVVED 2084

Query: 761  MIKTEYLRNDWWYWSSPSTAARITTLSALALRIYSLDSAISYEEPLPTAAAMEVSEPSNA 582
            MIKTEYL+N+WWYWSS S AA+I+TLSAL++RI+SLD+AI Y++P+  +  ++ ++P  +
Sbjct: 2085 MIKTEYLKNEWWYWSSLSAAAKISTLSALSVRIFSLDAAIIYDKPITPSNPIDETKPIIS 2144

Query: 581  IEEEIQKSPTLKNLASPSSPTL---QKTPEPDSS 489
            + +  QKS  + +    SS      +K  EP+ S
Sbjct: 2145 LPD--QKSQPVSDSQERSSRVRRSGKKRKEPEGS 2176


>gb|EXC31622.1| Methyl-CpG-binding domain-containing protein 9 [Morus notabilis]
          Length = 2259

 Score =  655 bits (1689), Expect = 0.0
 Identities = 411/1101 (37%), Positives = 599/1101 (54%), Gaps = 136/1101 (12%)
 Frame = -3

Query: 3464 ASDEDKVFCNLLGRIILNPNDNDDEGLLGYPAMVSRPLDFRTIDLRLASGAYGGSHEAFV 3285
            A+D+ KVFCNLLGR ++N +DNDDEGLLG PAMVSRPLDFRTIDLRLA+GAYGGSHEAF+
Sbjct: 1150 AADDSKVFCNLLGRKLINSSDNDDEGLLGSPAMVSRPLDFRTIDLRLAAGAYGGSHEAFL 1209

Query: 3284 DDVREVWRNIHTAYGDRSDLIDVADNLSKKFEDLYEKEVLTLLQKVADISDVKDSSADSL 3105
            +DVRE+W  +  A+GD+ DL+++A+ LS+ FE LYE EV++L+ K ++++ ++  +A+  
Sbjct: 1210 EDVRELWSIVRNAFGDQPDLVELAETLSQNFESLYENEVISLVGKFSELAKLQCLNAEMR 1269

Query: 3104 KERDDLLVQVCNSVLPRAPWDEGLCKVCGMDKDDDNVLLCDKCDSEYHRYCLDPPLLRIP 2925
            KE D LL     +V+P+APWDEG+CKVCG+D+DDD+VLLCD CD+EYH YCL+PPLLRIP
Sbjct: 1270 KEIDYLLSS--TNVIPKAPWDEGVCKVCGIDRDDDSVLLCDTCDAEYHTYCLNPPLLRIP 1327

Query: 2924 EGNWYCPSCVTG----QSLPSGTGYASGSNQRRKRRNQGEFSSKFLEELSRFAKLMEMKE 2757
            EGNWYCPSCV G    Q +P          QR  ++ QGE +  +LE L+  A  ME KE
Sbjct: 1328 EGNWYCPSCVVGRRTVQDVPENVQVI---RQRSGKKYQGEVTRVYLEALAHLATKMEEKE 1384

Query: 2756 HWEFTVEE----------------------------------------RIFFLKFLFDEA 2697
            +WEF+V+E                                        R F +KFL DE 
Sbjct: 1385 YWEFSVDESMLLLRPTLRKGRPGEGRLGKARVGHPEWAAVDVGVGSVVRSFLMKFLCDEL 1444

Query: 2696 LNSATVRDHMEQCASRAADLQNKLRSLTSEXXXXXXXXXXXXLSAEKTNSGVLN------ 2535
            LNSA +R H+EQCA  + +LQ KLR+L  E              A K +  +LN      
Sbjct: 1445 LNSAIIRQHLEQCADTSTELQQKLRALFVEWKILKSREEILVARAAKHDPNILNSLGAVG 1504

Query: 2534 VRGDLNSDASSSQH---ASENITRGKPSEKLV----GDQSQPEKIIVKTSEGPNWLS--E 2382
            +R  L S+ +  Q    +  +   G  ++ L     G ++     + ++S   +  S  +
Sbjct: 1505 IRESLFSNHNKGQTPALSDRSNCCGMSTDDLSTLGGGREAIEPSGLDRSSSATDSQSNCQ 1564

Query: 2381 KPISVQQPRSD-----QGHTSLLNNVQSPL---------FSSPTSERETELVQCPNQG-- 2250
             P+  +    D     +   ++LN   +             S   +  + L      G  
Sbjct: 1565 NPLDTEDQLKDAHASVEESNTVLNEADASCGAICSTGNPHESVGKDSSSTLKPVGQHGHS 1624

Query: 2249 ---DMPSSQLNNLKACTVKQ------EITNLLDSIADIELELVKI-------SLRRDFLG 2118
               D+ S+   ++ A TV +      E+ ++ + I  +E  +  +       S+RR+FLG
Sbjct: 1625 NASDVRSTIGQSVPAATVNELQGHHVELKSVKNDITILEESITSVESELLKVSVRREFLG 1684

Query: 2117 RDSIGRVYWA--------------------------FCYPGARPWIIACGGTAHKERCPR 2016
             D +G +YW                           F  P  +  ++ C   +   +C R
Sbjct: 1685 SDFVGCLYWVSGTPTGSSCIIVDRSAALRSGKKMNNFQRPVGKSSVLQCSIQSVPIQCER 1744

Query: 2015 DFSSIPDSDKWMYYESDSEIEKLVGWLRENNVREKELKESISQFQANKLKDSEYTEDHIL 1836
            + S +     W+ Y++D +I++LV  L+ N+ +E+ELKESI  +Q  KL+  E+ ++ I 
Sbjct: 1745 N-SVVASDSPWVSYQTDGDIDQLVSCLKTNDTKERELKESILHWQ--KLRFQEFQKNKIR 1801

Query: 1835 KKRE-----INHGGRKTVSADFLATNAMNALEKKFGPCKRIETTAVPQNLVMGA--SQSG 1677
             + E      +  G K   +D L T A N LEK++GPC ++ETT + +     A  +   
Sbjct: 1802 GQAECAAFAASISGEKATFSDGLVTRAANLLEKRYGPCNQLETTDILKKRGKKARLTDDN 1861

Query: 1676 GMHRCECLEMLWPSKDHCGSCHQSFSTSEELRQHVKENCKAVASGSKRSQAAEDMSKRKK 1497
             M+RCECLE++WP + HC SCH++F    EL  H +  C +VA   ++ +   D SK K 
Sbjct: 1862 KMYRCECLELIWPCRHHCLSCHRTFFNDIELEGHNEGKCNSVALAQEKRKEISDSSKAKD 1921

Query: 1496 A----RNAVSQEKRPASVGIPQGSTLEKQIDGSASVESYNADCPFNFEEIMTRFIVPSSV 1329
            +     N        + V IP+    E         +     CP++FEEI ++F+   S 
Sbjct: 1922 SLKSDANREDSTGEMSRVEIPKTGFSELSAK-LIKFQDEGLSCPYDFEEICSKFVTKDSC 1980

Query: 1328 KDGVNEIGLIGSGGLPSFLQGQSPYVSDSALT-------VGLE-RTNEASSRSNDLRSKQ 1173
            KD V EIGLIGS G+PSF+   SP + DS L        VG +   +EA+ R   L +  
Sbjct: 1981 KDLVQEIGLIGSKGVPSFVSSMSPCLDDSTLALISPQKDVGAQGGGSEAAERPVSLGTGT 2040

Query: 1172 KNAEPGDVMNGRGFKDSNRSSRSAENGLSDELSIVGRLKSILTSEKDQVTPMKDRSSLLG 993
                  D+++ R  K   RS+    N +  +   +G ++     +++ +      SS +G
Sbjct: 2041 ITIAGWDILSDRSPK---RSAMKEINAVKSQRLTLGYIE-----QREGIRCSGSHSSEMG 2092

Query: 992  LSKSTIIRESSSRPLVGRASEILRVLKINLLDMDAALPEDAFRKSRSSPDRRCAWRAFLK 813
             ++  ++ + S RPLVG+ S+I R LKINLLDMDAALPE+A R S+S   RR AWRAF+K
Sbjct: 2093 ATRCCVVPQFSLRPLVGKVSQIYRRLKINLLDMDAALPEEALRPSKSHLGRRWAWRAFVK 2152

Query: 812  SSKSIYEMVQATIVFEDMIKTEYLRNDWWYWSSPSTAARITTLSALALRIYSLDSAISYE 633
            S+ +IYEMVQATIV EDMIKTEYL+N+WWYWSS S AAR +T+S+LALRIYSLD+AI YE
Sbjct: 2153 SATTIYEMVQATIVLEDMIKTEYLKNEWWYWSSFSAAARTSTMSSLALRIYSLDAAIIYE 2212

Query: 632  EPLPTAAAMEVSEPSNAIEEE 570
            +    +   + SEPSN  E++
Sbjct: 2213 KISSESDPTDKSEPSNLSEQK 2233


>ref|XP_007031432.1| Methyl-CpG-binding domain-containing protein 9, putative isoform 3
            [Theobroma cacao] gi|508710461|gb|EOY02358.1|
            Methyl-CpG-binding domain-containing protein 9, putative
            isoform 3 [Theobroma cacao]
          Length = 2195

 Score =  649 bits (1673), Expect = 0.0
 Identities = 411/1102 (37%), Positives = 592/1102 (53%), Gaps = 112/1102 (10%)
 Frame = -3

Query: 3464 ASDEDKVFCNLLGRIILNPNDNDDEGLLGYPAMVSRPLDFRTIDLRLASGAYGGSHEAFV 3285
            A+D+ K+FCNLLGR ++N +DNDDEGLLG PAMVSRPLDFRTIDLRLA GAYGGSHEAF+
Sbjct: 1148 AADDSKIFCNLLGRKLMNSSDNDDEGLLGSPAMVSRPLDFRTIDLRLAVGAYGGSHEAFL 1207

Query: 3284 DDVREVWRNIHTAYGDRSDLIDVADNLSKKFEDLYEKEVLTLLQKVADISDVKDSSADSL 3105
             DVRE+W N+ TAY D+ DL+++A++LS+ FE LYE+EVLTL+QK+A+ + ++  +A++ 
Sbjct: 1208 KDVRELWSNVRTAYTDQPDLVELAESLSQNFESLYEQEVLTLVQKLAEYAKLECLNAETK 1267

Query: 3104 KERDDLLVQVCNSVLPRAPWDEGLCKVCGMDKDDDNVLLCDKCDSEYHRYCLDPPLLRIP 2925
            KE +DLL     S +P+APWDEG+CKVCG+DKDDD+VLLCD CD+EYH YCL+PPL RIP
Sbjct: 1268 KEINDLLAS--TSEIPKAPWDEGVCKVCGIDKDDDSVLLCDTCDAEYHTYCLNPPLARIP 1325

Query: 2924 EGNWYCPSCVTGQSL-PSGTGYASGSNQRRKRRNQGEFSSKFLEELSRFAKLMEMKEHWE 2748
            EGNWYCPSCV  + +    + ++    +RR ++ QGE +  +LE L+    ++E KE+W+
Sbjct: 1326 EGNWYCPSCVLSKRMVQDASEHSQVIIRRRDKKYQGEVTRGYLEALAHLGAVLEEKEYWQ 1385

Query: 2747 FTVEERIFFLKFLFDEALNSATVRDHMEQCASRAADLQNKLRSLTSEXXXXXXXXXXXXL 2568
            F+++ERIF LKFL DE LNSA +R H+EQCA   ++L  KLRS   E             
Sbjct: 1386 FSIDERIFLLKFLCDELLNSALIRQHLEQCA-ETSELHQKLRSAYVEWKNLKSREDFVAA 1444

Query: 2567 SAEKTNSGVLNVRGD---------LNSDA-------------SSSQHASENITRG----- 2469
             A K ++ + N  GD         L SD              +S+ +  +N T       
Sbjct: 1445 KAAKIDTSMSNAVGDVGVKDGDDWLPSDGGKEGADLNGSNKYASATYTEKNFTANGQTLN 1504

Query: 2468 --KPSEKLVGDQSQPEKIIVKTSEG-----------PNWLSEKPISVQQPRSDQGHTSLL 2328
                  +L GDQ+  +   V + +            PN LS++  +  +  S QG    L
Sbjct: 1505 PMDTEAQLKGDQAIVDASKVSSQKSDKSFRPSELLVPNHLSQEIENSSKETSFQGK---L 1561

Query: 2327 NNVQSPLFSSPTSERETELVQCPNQG--DMPS-----SQLNNLKACTVKQEITNLLDSIA 2169
               +    +SP S  +      P+     +PS     SQ ++L+  T+K +I  L D I 
Sbjct: 1562 EESKGMDVASPPSPSDCNGQFPPSDAAKQVPSVTENESQSHHLELNTIKNDIQRLQDLIT 1621

Query: 2168 DIELELVKISLRRDFLGRDSIGRVYWAFCYPGARP------------------------- 2064
             +E +L+K+S+R++FLG DS GR+YW    PG  P                         
Sbjct: 1622 SLESQLLKLSVRKEFLGSDSAGRLYWISAMPGGYPQVIVDGSLVLQKKRKFLGYEERVQN 1681

Query: 2063 ---WIIACGGT-------AHKERCPRDFSS---IPDSDKWMYYESDSEIEKLVGWLRENN 1923
               W  A  GT         K  CP  ++S   I     W+ Y++++EIE L+ WL +N 
Sbjct: 1682 TFIWNSASAGTDNGMKAEGSKASCPFLYNSKDAISVGSPWVTYQTEAEIEGLIDWLNDNE 1741

Query: 1922 VREKELKESISQFQANKLKDSEY------TEDHILKKREINHGGRKTVSADFLATNAMNA 1761
             +EKELKE+I Q    KLK  +Y       +D       ++ G  K   + FL T A   
Sbjct: 1742 PKEKELKEAILQ----KLKFQDYQKMKNQDQDECQTAFSMSSGSDKGSFSSFLGTKAAML 1797

Query: 1760 LEKKFGPCKRIETTAVPQNLVMGASQSGG--MHRCECLEMLWPSKDHCGSCHQSFSTSEE 1587
            LEKK+GPC + E T   +     A    G  M+RC+CLE +WPS++HC SCH++F +  E
Sbjct: 1798 LEKKYGPCFKSEITDSLKKRGKKARVINGDKMYRCKCLEPIWPSRNHCISCHKTFFSDVE 1857

Query: 1586 LRQHVKENCKAVASGSKRSQAAEDMSKRKKARNA-VSQEKRPASVGIPQGSTLEKQIDGS 1410
               H    C   +  +++S +  D  K K   N  +++      + I + S        S
Sbjct: 1858 FEDHNDGKCNLGSPLNEKSTSVGDSLKGKGNMNIDINRVDCTVDMEIVETSKSGHSELSS 1917

Query: 1409 ASVESYNAD--CPFNFEEIMTRFIVPSSVKDGVNEIGLIGSGGLPSFLQGQSPYVSDSAL 1236
              ++  N    CP+NFEEI T+F+   S ++ V EIGLIGS G+PSF+   S +VSDS L
Sbjct: 1918 RLIKFQNEGLVCPYNFEEISTKFVTRDSNEELVREIGLIGSNGVPSFVSSVSHFVSDSTL 1977

Query: 1235 TVGLERTNEASSRSNDLRSKQKNAEPGDVMNGRGFKDSNRSSRSAENGLSDELSIVGRLK 1056
                            +R  Q+  + GD +        ++ +RS  NG+++ LS     +
Sbjct: 1978 MT--------------VRPHQERGDLGDKLKATEMPGFSQGNRSVANGINERLSDNSFRR 2023

Query: 1055 SILT---------------SEKDQVTPMKDRSSLLGLSKSTIIRESSSRPLVGRASEILR 921
            S+ +                ++D+++     S  LG+ +  ++ +SS RPLVG+ S+I R
Sbjct: 2024 SVASEIEVQRTIRPALRCLEQRDRISSADKYSPELGIGRCCVVPQSSLRPLVGKVSQISR 2083

Query: 920  VLKINLLDMDAALPEDAFRKSRSSPDRRCAWRAFLKSSKSIYEMVQATIVFEDMIKTEYL 741
             LKINLLDMDAAL E+A R S+                              DMIKTEYL
Sbjct: 2084 QLKINLLDMDAALSEEALRPSK------------------------------DMIKTEYL 2113

Query: 740  RNDWWYWSSPSTAARITTLSALALRIYSLDSAISYEEPLPTAAAMEVSEPSNAIEEEIQK 561
            RN+WWYWSS S A +I+T+S+LALRIYSLDSAI YE+      +++  +PS+  + ++  
Sbjct: 2114 RNEWWYWSSLSAAVKISTVSSLALRIYSLDSAIIYEKSF-EFHSIDNLKPSSIPDPKLLP 2172

Query: 560  SPTLKNLASPSSPTLQKTPEPD 495
            +  L      S  T +K  EP+
Sbjct: 2173 NLDLAEKCKVSRKTSKKRKEPE 2194


>ref|XP_006296811.1| hypothetical protein CARUB_v10012794mg [Capsella rubella]
            gi|482565520|gb|EOA29709.1| hypothetical protein
            CARUB_v10012794mg [Capsella rubella]
          Length = 2177

 Score =  649 bits (1673), Expect = 0.0
 Identities = 385/1048 (36%), Positives = 582/1048 (55%), Gaps = 60/1048 (5%)
 Frame = -3

Query: 3464 ASDEDKVFCNLLGRIILNPNDNDDEGLLGYPAMVSRPLDFRTIDLRLASGAYGGSHEAFV 3285
            A+DEDKVFC LLGR +L+ +DNDD+GLLG PAMVSRPLDFRTIDLRLA+GAY GS EAF+
Sbjct: 1151 AADEDKVFCTLLGRKLLHSSDNDDDGLLGSPAMVSRPLDFRTIDLRLAAGAYDGSTEAFL 1210

Query: 3284 DDVREVWRNIHTAYGDRSDLIDVADNLSKKFEDLYEKEVLTLLQKVADISDVKDSSADSL 3105
            +DV E+W +I   Y D+ D +++   LSK F+ LYE EVL L+QK  D   ++  +A+  
Sbjct: 1211 EDVHELWSSIRVMYADQPDCVELVATLSKTFKSLYEAEVLPLVQKFVDFRKLECLNAEMK 1270

Query: 3104 KERDDLLVQVCNSVLPRAPWDEGLCKVCGMDKDDDNVLLCDKCDSEYHRYCLDPPLLRIP 2925
            KE  D++V +  + LP+APWDEG+CKVCG+DKDDD+VLLCD CD+EYH YCL+PPL+RIP
Sbjct: 1271 KEIKDIIVSI--NKLPKAPWDEGVCKVCGVDKDDDSVLLCDTCDAEYHTYCLNPPLIRIP 1328

Query: 2924 EGNWYCPSCVTGQSLPSGTGYASGSNQRRK-RRNQGEFSSKFLEELSRFAKLMEMKEHWE 2748
            +GNWYCPSCV  + +      +    +RRK R+ QGE +   +E  +  A +ME K++WE
Sbjct: 1329 DGNWYCPSCVIAKRMAQEALESYKLVRRRKGRKYQGELTQASMEMTAHLAGVMEEKDYWE 1388

Query: 2747 FTVEERIFFLKFLFDEALNSATVRDHMEQCASRAADLQNKLRSLTSEXXXXXXXXXXXXL 2568
            F+VEERI  LK L DE L+S+ V  H+EQCA    ++Q KLRSL+SE             
Sbjct: 1389 FSVEERILLLKVLCDELLSSSLVHQHLEQCAEAIIEMQQKLRSLSSEWKNAKMRQEFLMA 1448

Query: 2567 SAEKTNSGVLNVRGDLNS----------DASSSQHASENITRGKPSEKLVGDQSQPEKII 2418
               K    +L    +L++          D  + +   + +T    +           K  
Sbjct: 1449 KLAKVEPSILKEASELHNSSHFADQMGCDERTHEGVGDGVTHDDETSSTAFLNKNQGKAP 1508

Query: 2417 VKTSEGPNWL-----SEKPISVQQPRSDQGHTSLLNNVQSPLFSSPTSERET-------- 2277
            ++T+  P  L       K  S ++  S   H  L+ ++      + T E++T        
Sbjct: 1509 LETNSQPGDLHVDSGGNKVSSQKKITSPGRHELLVADISPRATDNLTFEKDTLHKSVGRI 1568

Query: 2276 --------ELVQCPNQGDMPSSQLNNLKAC-----TVKQEITNLLDSIADIELELVKISL 2136
                      V+  +  D  S     L+AC         EI NL  SI  +E +L+K S+
Sbjct: 1569 HETHPLHSNAVELQSVHDASSQASQELQACQQDLNATSNEIQNLQLSIRSVESQLLKQSI 1628

Query: 2135 RRDFLGRDSIGRVYWAFCYPGARPWIIACGGTAHKE---------RCPRDFSSIPDSDK- 1986
            RRDFLG DS GR+YW  C+P   P ++  G  + ++         R P  F    D  + 
Sbjct: 1629 RRDFLGNDSSGRLYWGCCFPDENPRVLVDGSISLQKPVQANLTGSRAPSPFLQAVDHGRL 1688

Query: 1985 ----WMYYESDSEIEKLVGWLRENNVREKELKESISQFQANKLKDSEYTEDHILKKREIN 1818
                W YYE++SEI +LV WL +++ +E++L+ESI  ++  +  D       + K++E  
Sbjct: 1689 TLSPWTYYETESEISELVQWLHDDDPKERDLRESILCWKRLRFGD-------VQKEKENA 1741

Query: 1817 HGGRKTVSADFLATNAMNALEKKFGPCKRIETTAVPQNLVMGASQSGGMHRCECLEMLWP 1638
                  + +  L T A  ++EK+FGPC ++ET  + +       +     RCECLE + P
Sbjct: 1742 ENLSSPIFSRGLVTKAAMSMEKRFGPCIKLETETLKKRGKKTKVEREKFCRCECLEAILP 1801

Query: 1637 SKDHCGSCHQSFSTSEELRQHVKENCKAVASGSKRSQAAEDMSKRKKA-RNAVSQEKRPA 1461
            S  HC  CH++F++ +E   H +  C   +  ++  +   D SK K++ ++     K  A
Sbjct: 1802 SMIHCLICHKTFASDDEFENHSESKCIPYSLATEEGKEISDFSKAKESLKSDYLNVKSSA 1861

Query: 1460 SVGIPQGSTLEKQIDGSASVESYNADCPFNFEEIMTRFIVPSSVKDGVNEIGLIGSGGLP 1281
               + + S + +   G    +   +  P++FEEI ++F+   S +D V +IGLIGS G+P
Sbjct: 1862 GKDVSEISNVSELDSGLIRYQEEESISPYHFEEICSKFVTKDSNRDLVKDIGLIGSNGIP 1921

Query: 1280 SFLQGQSPYVSDSALTVGLERTNEASSRSNDLRSKQKNAEPGDVMNGRGFKDSNRSSRSA 1101
            +FL     +++DS L          S+ S    SK    + GD +   G  ++N    ++
Sbjct: 1922 TFLPSSYTHLNDSMLI---------SANS----SKLDGDDSGDQVVFAG-SETNVEGLNS 1967

Query: 1100 ENGLSDELSI---VGRLKSILTSEKDQVTPMKDRSSL-LGLSKSTIIRESSSRPLVGRAS 933
            E  +S + S+   +G   S  +      +  K + SL  GL    ++ ++S + + G+A 
Sbjct: 1968 EFNMSFDRSVTHDLGGPPSKPSGLGFGFSEQKIKKSLGSGLKSCCVVPQASLKRITGKAL 2027

Query: 932  EILRVLKINLLDMDAALPEDAFRKSRSSPDRRCAWRAFLKSSKSIYEMVQATIVFEDMIK 753
             + R LK NLLDMD ALPE+  R S+S P RR AWR F+KSS+SIYE+VQAT+V EDM+K
Sbjct: 2028 PVFRFLKTNLLDMDVALPEEGLRPSKSHPGRRRAWRLFVKSSQSIYELVQATVVLEDMVK 2087

Query: 752  TEYLRNDWWYWSSPSTAARITTLSALALRIYSLDSAISYEEPLPTAAAMEVSEPSNAIEE 573
            TEYL+N+WWYWSS S AA+I+TLSAL++RI++LD+AI Y++ L  +  ++ ++P  ++ +
Sbjct: 2088 TEYLKNEWWYWSSLSAAAKISTLSALSVRIFALDAAIMYDKLLTPSDPIDETKPIISLPD 2147

Query: 572  E----IQKSPTLKNLASPSSPTLQKTPE 501
            +    +  S    + A+  S   +K PE
Sbjct: 2148 QKSQPVSDSQERSSRANRRSGKKRKEPE 2175


Top