BLASTX nr result

ID: Catharanthus22_contig00003477 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Catharanthus22_contig00003477
         (3169 letters)

Database: ./nr 
           37,332,560 sequences; 13,225,080,153 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_002525350.1| DNA binding protein, putative [Ricinus commu...   527   e-146
ref|XP_006365207.1| PREDICTED: methyl-CpG-binding domain-contain...   526   e-146
ref|XP_006483833.1| PREDICTED: methyl-CpG-binding domain-contain...   511   e-142
ref|XP_006446469.1| hypothetical protein CICLE_v10014026mg [Citr...   504   e-140
ref|XP_006470356.1| PREDICTED: methyl-CpG-binding domain-contain...   503   e-139
ref|XP_006470355.1| PREDICTED: methyl-CpG-binding domain-contain...   503   e-139
gb|EXC31622.1| Methyl-CpG-binding domain-containing protein 9 [M...   475   e-131
ref|XP_002517349.1| DNA binding protein, putative [Ricinus commu...   474   e-130
ref|XP_004141185.1| PREDICTED: methyl-CpG-binding domain-contain...   472   e-130
ref|XP_004167238.1| PREDICTED: LOW QUALITY PROTEIN: methyl-CpG-b...   471   e-129
gb|EOY02356.1| Methyl-CpG-binding domain-containing protein 9, p...   470   e-129
ref|XP_002884279.1| methyl-CpG-binding domain 9 [Arabidopsis lyr...   468   e-129
ref|XP_006408507.1| hypothetical protein EUTSA_v10019872mg [Eutr...   467   e-128
ref|XP_006296811.1| hypothetical protein CARUB_v10012794mg [Caps...   461   e-126
ref|XP_006646998.1| PREDICTED: methyl-CpG-binding domain-contain...   457   e-125
ref|NP_186795.1| methyl-CPG-binding domain 9 [Arabidopsis thalia...   457   e-125
gb|ESW23089.1| hypothetical protein PHAVU_004G017600g [Phaseolus...   448   e-123
ref|XP_006594288.1| PREDICTED: methyl-CpG-binding domain-contain...   443   e-121
ref|NP_001046163.1| Os02g0192400 [Oryza sativa Japonica Group] g...   434   e-119
gb|EEE56485.1| hypothetical protein OsJ_05715 [Oryza sativa Japo...   434   e-119

>ref|XP_002525350.1| DNA binding protein, putative [Ricinus communis]
            gi|223535313|gb|EEF36988.1| DNA binding protein, putative
            [Ricinus communis]
          Length = 1794

 Score =  527 bits (1358), Expect = e-146
 Identities = 340/924 (36%), Positives = 488/924 (52%), Gaps = 57/924 (6%)
 Frame = -3

Query: 3164 TESQKERDEVLALVNDSSLPKAPWEEGICKVCGMDKDDVNVLLCDTCDSEYHTYCLNPPL 2985
            +E++KE +++L   + S +PKAPW+EG+CKVCG+DKDD NVLLCD CDS YHTYCLNPPL
Sbjct: 893  SEAKKEMEDILE--HASQMPKAPWDEGVCKVCGVDKDDDNVLLCDKCDSGYHTYCLNPPL 950

Query: 2984 GKVPDGNWYCPSCVTGNSLSEDAAYGTQVVNQCGKRRYQRKLIDETLEGLAQLANKMELK 2805
             ++P+GNWYCPSC+T     + A+   Q V+ C K+R Q +     LE LA L   ME+ 
Sbjct: 951  ARIPEGNWYCPSCIT-----QGASQVPQFVSHCRKKRRQGEFTHGVLEALAHLGTTMEIT 1005

Query: 2804 EYWEFTVEERILLLKFFSDEAMNSATIRDHIDQCASMCVDLHQRLRSLNSERKILKLKEE 2625
            +YW+++VEERI LLKF  DE +NSA IR+H+DQCAS+  DL Q+LRSL+ E + LK KEE
Sbjct: 1006 DYWDYSVEERIFLLKFLGDEVLNSANIREHLDQCASVSADLQQKLRSLSMEWRNLKFKEE 1065

Query: 2624 SLAANVAKMKGNVHTGGGELASVLADESQL--PVDNKVSSFSGGSVPMD---GGPHTKDQ 2460
             +   V K      +G     +VL +  +L     ++ S  S   + ++    GP     
Sbjct: 1066 LMLNGVGK------SGKEGTTTVLPNYDKLLGQTHSRSSLCSTSFIDLEHLKDGPRFPRT 1119

Query: 2459 VSILRSDVNLHPKLGDTXXXXXXXXXXXQMLGYGLSNTVSKNITVHASSFPGHQYSNQPN 2280
                +    ++PK                  G  +   +S    V   S    Q  NQP+
Sbjct: 1120 NDFTKRPCWVYPK------------------GVQVQQPISNGSQVFTISDTECQV-NQPD 1160

Query: 2279 ANSLLDYNAELSKLQQEISGLQESISTAESELLKVSLRKEFLGRDSYGRLYWVFGSYDAS 2100
             N L   N E   ++ + S LQ+S+++ E +L K SLRKEFLGRDS GR+YW F    + 
Sbjct: 1161 VNQLQTSNLESIFIRDKASVLQDSVTSLELQLQKASLRKEFLGRDSAGRVYWAFSRTGSL 1220

Query: 2099 PWIVANG-------SLNPESEFGLDNHFPKSSS--------------------------- 2022
            PW+V +G       S+  E+     N+    SS                           
Sbjct: 1221 PWVVIDGTTVVQQSSIAEENRVLRFNNLTFRSSIGAQDLLRFKGSNVFSPYASDLTSGIS 1280

Query: 2021 ----WMYYDTVAEIGELVKWLDDCDIRERDLKESILQWQSNKSLNSNYQRNDFLK-GKPS 1857
                W  + + AEI EL+KWL D D  +R+L ES+LQ  +    NSN   N  L+  +P+
Sbjct: 1281 VYFQWFSHQSYAEIEELIKWLRDNDPMQRELIESLLQRLNFGYSNSNKAANYVLEMNQPA 1340

Query: 1856 SSVISSEQRVPNHNCRVLKGVRALEKKFGLGMDMGADYINKNLEHNHEMTYQGRMCRCVC 1677
            S  ++ E+ +   +    + + ALEKK+G  M++    I+     N ++TY  RMCRC C
Sbjct: 1341 SMPVNIEKTLKPKSLET-RALTALEKKYGPCMELDVTNISVKFSRNLKVTYDDRMCRCEC 1399

Query: 1676 LELIWPSRHHCFSCHQTFSASEELDQHGE-KCRAAATGAAGSLKHDKTVMNQGIRICPGS 1500
            LE IWPSRHHC SCH++FS+  EL++H + KC A A     S   D  V  + + +    
Sbjct: 1400 LEAIWPSRHHCLSCHRSFSSRCELEEHNDGKCGAGAHTPQNSRVTD-DVSKEKVLMRAEH 1458

Query: 1499 SNIPQSVLNEKHDTQSNCVKXXXXXXXXXXXXEIMANFKVDYSIEEDIKGIGLFGTNGVV 1320
                       H+ +   +             EI A F    S +E +K IGL G+NG+ 
Sbjct: 1459 GEWQCKAGGAGHEIEFGLIGFRKEFMSPYNLEEISAKFVTRSSNKELVKEIGLLGSNGIP 1518

Query: 1319 PFLPNSSPCFNDPALTLVPPSGNEVSMEDQTSVPKSQQKLSDDLAKTASVVNDKSSGIEK 1140
              +P SSP   DP L LV P  NEV    Q++  ++     D    T S  +   S   K
Sbjct: 1519 SLVPCSSPYLIDPTLKLVLPCVNEVCQSVQSTNVENGSLQGD---TTTSKRHANKSNATK 1575

Query: 1139 GKGKVSEVECMKSRVLCERGRLXXXXXXXXXSTARR-----SIIRESSLRPKVGYATEVL 975
                V   E      L E GR           +  +     S IR S+LRP VG    +L
Sbjct: 1576 DCTAVDLYE-----ELQEIGRSYLMNQSSLRFSCTKLGNPLSEIRGSALRPLVGKGAHIL 1630

Query: 974  RLLKISMLDIDAALPEAALRASRSHLDRRCAWRTFVKCANTLYEMMLALIILEDTIRTDY 795
            R LKI++LD+DAALPE A+++S  +L++RCAWR FVK A +++EM+ A I+LE+ I+TD+
Sbjct: 1631 RQLKINLLDMDAALPEEAVKSSNIYLEKRCAWRAFVKSAKSVFEMVQATIVLENMIKTDF 1690

Query: 794  LKNNWWYWSSPSAASNMPTLSALALRIFTLDSAILYEKSSTDDTAGTSIPVSQP------ 633
            L+N WWYWSS SAA+ + T+S+LALRI+TLD+AI+YEK         ++P + P      
Sbjct: 1691 LRNEWWYWSSLSAAAKIATISSLALRIYTLDAAIVYEK---------TLPFTPPKDIAEV 1741

Query: 632  -DKEASSSAIPTTEMKSAEQPMQK 564
              K  ++++ P T+++S  +P  K
Sbjct: 1742 GSKSDNNNSPPHTDLESNPKPSSK 1765


>ref|XP_006365207.1| PREDICTED: methyl-CpG-binding domain-containing protein 9-like
            [Solanum tuberosum]
          Length = 2173

 Score =  526 bits (1356), Expect = e-146
 Identities = 358/976 (36%), Positives = 502/976 (51%), Gaps = 111/976 (11%)
 Frame = -3

Query: 3164 TESQKERDEVLALVNDSSLPKAPWEEGICKVCGMDKDDVNVLLCDTCDSEYHTYCLNPPL 2985
            +E  K+RD +LA VN+SSLPKAPWEEG+CKVC MDKDDVNVLLCD CDSEYHTYCL+PPL
Sbjct: 1180 SEVAKDRDGLLAHVNESSLPKAPWEEGLCKVCSMDKDDVNVLLCDKCDSEYHTYCLDPPL 1239

Query: 2984 GKVPDGNWYCPSCVTGNSLSEDAAYGTQVVNQCGKRRYQRKLIDETLEGLAQLANKMELK 2805
             KVP G WYCP C    S S++A+ G+  + QC KRR  RKL  + +E L+QL   MELK
Sbjct: 1240 VKVPIGPWYCPDCEAKISRSQNASSGSHTIRQCVKRRLHRKLTHKFMEKLSQLTRTMELK 1299

Query: 2804 EYWEFTVEERILLLKFFSDEAMNSATIRDHIDQCASMCVDLHQRLRSLNSERKILKLKEE 2625
            EYWE  +E+RI LLKF  DE +NSA +RDHID+ AS+  +L Q+LRSL +E K+LK K+E
Sbjct: 1300 EYWELPLEDRIFLLKFLCDEMLNSAILRDHIDRSASLSAELQQKLRSLGAELKLLKHKKE 1359

Query: 2624 SLAANVAKMKGNVHTGG--GELASVLADESQLPVD-----NKVSSFSGGSVPMDGGP-HT 2469
             L A   K+K +  + G  G   S+ +++ +L V      +  SS SGG   +D G  H 
Sbjct: 1360 ILTA---KLKNDARSSGDTGSDTSLWSNDCKLKVQGPDSGSHNSSISGGCRQLDDGTQHN 1416

Query: 2468 K----DQVSILRSDVNLHPKL--GDTXXXXXXXXXXXQMLGYGL--SNTVSKNITVHAS- 2316
            K    ++ S L +  N+  K     T            +    L   NT S N + HA  
Sbjct: 1417 KCNDYNKQSCLYTSKNIQDKTCASGTNHIRNSPDPINHLQHQQLLKENTRSLNTSSHAKC 1476

Query: 2315 ----------------------SFPGHQYSNQPNANSLLDYNA----------------- 2253
                                    PG+   + P+++  +   A                 
Sbjct: 1477 GTEEANLQNDLFISTTLQQETDQIPGNCLESTPSSSKSIMLFATHIVSATTCSGSVSNPL 1536

Query: 2252 ------ELSKLQQEISGLQESISTAESELLKVSLRKEFLGRDSYGRLYWVFGSYDASPWI 2091
                  E+S +++EI  L++SI+  E EL +VS+RKE++G+DS GRLYW FG   +S  +
Sbjct: 1537 EEAFLFEMSAIKKEIRALEDSIAAKELELQEVSVRKEYMGQDSEGRLYWTFGRSTSSRLV 1596

Query: 2090 V-ANGSLNPESE-----FGLDNH----------------FPKSSSWMYYDTVAEIGELVK 1977
              A+ S  PES      +G+++                  P    W  Y +  +   L++
Sbjct: 1597 AYASTSTQPESSGHLWSYGVESSRRSGVFDSSAPWENMGMPNLDQWTSYQSDVDTEILIR 1656

Query: 1976 WLDDCDIRERDLKESILQWQSNKSLNSNYQRNDFLKGKPSSSVISSEQRVP--NHNCRVL 1803
            WL + D RER+LKESILQW+  + +   Y  +         + I SE      N +  V 
Sbjct: 1657 WLKEHDPRERELKESILQWRDTRKMIYYYLESHGHDKVRLITSIPSEDSASCFNSDSLVT 1716

Query: 1802 KGVRALEKKFGLGMDMGADYINKNLEHNHEMTYQGRMCRCVCLELIWPSRHHCFSCHQTF 1623
            + V A++K            I  NL     +++ G + RC CLE +WPSR HC SCHQTF
Sbjct: 1717 RAVTAIKKMVSGCSAEEETEICTNLGVKVRVSFDGELYRCECLEPLWPSRPHCLSCHQTF 1776

Query: 1622 S-ASEELDQHGEKCRAAATG--------AAGSLKHDKTVMNQGIRICPGSSNIPQSVLN- 1473
            S A E L    EKCR  +          +    K  +T  N+ ++    S+++ Q+  + 
Sbjct: 1777 SDAKERLKHANEKCRIDSPSPIQRDGETSEQPAKRKRTANNEILQDNSLSNDVSQASKSK 1836

Query: 1472 ----------EKHDTQSNCVKXXXXXXXXXXXXEIMANFKVDYSIEEDIKGIGLFGTNGV 1323
                      +KH       +            EI A F    S++E +  IGL G NG 
Sbjct: 1837 KLGNGEASRRDKHGNAPASAENQTKQECPFKFEEIKAQFITQRSLKELVNEIGLIGCNGT 1896

Query: 1322 VPFLPNSSPCFNDPALTLVPPSGNEVSMEDQTSVPKSQQKLSDDLAKTASVVNDKSSGIE 1143
              F+P +SP   D AL L+    +EV   + T +  S+ +L + +    S +N+  +   
Sbjct: 1897 PSFIPCTSPYLCDSALELLSQREDEVCGGNSTDLLSSEHQLRNGVK--VSCINNSDNPNC 1954

Query: 1142 KGKGKVSEVECM-KSRVLCERGR---LXXXXXXXXXSTARRSIIRESSLRPKVGYATEVL 975
             G G         + +   +RGR                   +I ESSL P  G A+ +L
Sbjct: 1955 TGNGLAGAGPVFGRLKSATKRGRNQFSSTKDKILEFGVNMYFVIPESSLHPVAGRASVIL 2014

Query: 974  RLLKISMLDIDAALPEAALRASRSHLDRRCAWRTFVKCANTLYEMMLALIILEDTIRTDY 795
            R LKI++LDIDAALPE ALR SR   +RR  WR FVK A T+YEM+ A IILED I+T+Y
Sbjct: 2015 RCLKINLLDIDAALPEEALRVSRLQSERRRVWRAFVKSAATIYEMVQATIILEDAIKTEY 2074

Query: 794  LKNNWWYWSSPSAASNMPTLSALALRIFTLDSAILYEKSSTDDTAGTSIPVSQPDKEASS 615
            LKN+WWYWSSPSAA+ + TLSALALR++ LDSAILY+K S+ D + T     +  +    
Sbjct: 2075 LKNDWWYWSSPSAAARISTLSALALRVYALDSAILYDKLSSQDASETD--CKEEREPPPR 2132

Query: 614  SAIPT-TEMKSAEQPM 570
            +++PT T   S ++P+
Sbjct: 2133 NSVPTNTASPSKKKPL 2148


>ref|XP_006483833.1| PREDICTED: methyl-CpG-binding domain-containing protein 9-like
            isoform X2 [Citrus sinensis]
          Length = 2084

 Score =  511 bits (1316), Expect = e-142
 Identities = 338/956 (35%), Positives = 505/956 (52%), Gaps = 81/956 (8%)
 Frame = -3

Query: 3164 TESQKERDEVLALVNDSSLPKAPWEEGICKVCGMDKDDVNVLLCDTCDSEYHTYCLNPPL 2985
            +E++KE +++L   + S +PKAPW+EG+CKVCG+DKDD NVLLCDTCDS YHTYCL PPL
Sbjct: 1131 SEAKKEMEDILE--SASEIPKAPWDEGVCKVCGIDKDDDNVLLCDTCDSGYHTYCLTPPL 1188

Query: 2984 GKVPDGNWYCPSCVTGNSLSEDAAYGTQVVNQCGKRRYQRKLIDETLEGLAQLANKMELK 2805
             +VP+GNWYCP C++GN  ++  +    V ++  KRR+Q +     LE +  LA  ME++
Sbjct: 1189 TRVPEGNWYCPPCLSGNCKNKYMSQVPHVSSRIPKRRHQGEFTCRILEEVFHLAATMEMR 1248

Query: 2804 EYWEFTVEERILLLKFFSDEAMNSATIRDHIDQCASMCVDLHQRLRSLNSERKILKLKEE 2625
            +YW+++ +ERI LLKF  DE +NS  IR+H+++CAS+ VDL Q++RSL+ E + LK +EE
Sbjct: 1249 DYWDYSDKERIFLLKFLCDELLNSTNIREHLERCASVSVDLQQKIRSLSLEWRNLKFREE 1308

Query: 2624 SLAANVAKMKGNV-------------------------HTGGG----ELASVLA---DES 2541
             LA  VA+ K +V                          +GGG     LAS LA   D  
Sbjct: 1309 ILAGKVARDKASVLSGTGKCGTEGVATLYPHYGKLMRQPSGGGGYFSSLASDLALSEDGL 1368

Query: 2540 QLPVDNKVSSF-----------SGGSVPMDGGPHTKDQV--SILRSDVNLHPKLGDTXXX 2400
            QL    K+S +           S     +   P+T+ QV     + ++ +     D    
Sbjct: 1369 QLNESRKLSCWFNLKGISMRQPSCSRNQIGEAPYTESQVHQESEKDNIRVDDLQYDVPHS 1428

Query: 2399 XXXXXXXXQMLGYGLSNTVSKNITVHASSFPGHQYSNQPNAN-SLLDYNAELSKLQQEIS 2223
                        Y       +++    +S P      QPN   S   ++++ +   Q   
Sbjct: 1429 ASQPQKQDTAGEYATWRNKGQDLENGHTSGP-----LQPNCEASQSHFSSDHTNGNQVAE 1483

Query: 2222 GLQESISTAESELLKVSLRKEFLGRDSYGRLYWVFGSYDASPWIVANGSLNPESEFGLDN 2043
             L +SI+  ES+ L VSLRKE LGRDS GRLYW F   + SPW++ + +   E E  L  
Sbjct: 1484 HLCDSIAGLESQQLAVSLRKELLGRDSAGRLYWAFFRPNTSPWLLVDATTVLEQERILKE 1543

Query: 2042 H---------------FPKSSSWMYYDTVAEIGELVKWLDDCDIRERDLKESILQWQS-- 1914
            H                  SSSW  Y +  EI EL++WL D D R+++L ESIL+W    
Sbjct: 1544 HGDSLANSPFEEEYNGISASSSWFSYQSDTEIEELIQWLSDSDPRDKELAESILRWTKIG 1603

Query: 1913 --NKSLNSNYQRNDFLKGKPSSSVISSEQRVPNHNCRVLKGVRALEKKFGLGMDMGADYI 1740
              +  +  N+  ++ +   PSSS     +     +  V K +  LE+K G  ++     +
Sbjct: 1604 YKDLKIAGNHIEDESV---PSSSKCRKSEATVKSSGLVTKALTVLEEKHGPCLEPEVLKM 1660

Query: 1739 NKNLEHNHEMTYQGRMCRCVCLELIWPSRHHCFSCHQTFSASEELDQHGE-KCRAAATGA 1563
            +  L+ N E+T + RM RC CLE + P+R HC  CH +FSA  EL++H + KC  +AT +
Sbjct: 1661 SMKLDTNSELTCKERMYRCECLEPVLPTRFHCRRCHLSFSARNELEEHNDAKCILSATSS 1720

Query: 1562 AGSLKHDKTVMNQG-IRI------C--PGSSNIPQSVLNEKHDTQSNCVKXXXXXXXXXX 1410
              S + D+     G IR       C       + QS+   KH T     +          
Sbjct: 1721 QNSKEDDERTKGAGTIRTETLQAECMETAGKGMSQSL---KHGTAMGSFEIPKEFACPFN 1777

Query: 1409 XXEIMANFKVDYSIEEDIKGIGLFGTNGVVPFLPNSSPCFNDPALTLVPPSGNEVSMEDQ 1230
              EI   F    SI+E ++ IGL G+NGV  F+P++SP   DP+L LV    NE++  ++
Sbjct: 1778 FEEISTKFITKNSIKELVQEIGLIGSNGVPAFVPSTSPYLCDPSLKLVEMCKNEINRGNK 1837

Query: 1229 TSVPKS--QQKLSDDLAKTASVVNDKSSGIEKGKGKVSEVECMKSRVL----CERGRLXX 1068
            ++  ++  Q  +  D+       N  ++   +     ++ + +K R L        R   
Sbjct: 1838 STNLENLFQYSIVGDMVSGLEHDNISNNSSRRCTVSHNDDDVLKCRRLNPNFMNEKRDQS 1897

Query: 1067 XXXXXXXSTARRSIIRESSLRPKVGYATEVLRLLKISMLDIDAALPEAALRASRSHLDRR 888
                        SI+R++SL P +G   E+LR LKI++LD+DAA+PE ALR+S++  + R
Sbjct: 1898 FSLSLKPGIGNSSIVRDTSLMPLMGRGIEILRQLKINLLDMDAAVPEEALRSSKACWENR 1957

Query: 887  CAWRTFVKCANTLYEMMLALIILEDTIRTDYLKNNWWYWSSPSAASNMPTLSALALRIFT 708
             AWR FVK A +++EM+ A I+ ED I+TDYL+N WWYWSS S A+N+ T+SALALR++T
Sbjct: 1958 SAWRAFVKSAKSIFEMVQATIVFEDMIKTDYLRNGWWYWSSLSGAANIATVSALALRLYT 2017

Query: 707  LDSAILYEKSSTDDTAGTSIPVSQPDKEASSSAIPTTEMKSAEQPMQKVNDSDSGE 540
            LD+AI+YEK S  D+      +SQPDKE S    P  + KS  +P + +  + S +
Sbjct: 2018 LDAAIVYEKHS--DSIEIQEHISQPDKETS----PCKDSKSNPKPSKAILKTQSSD 2067


>ref|XP_006446469.1| hypothetical protein CICLE_v10014026mg [Citrus clementina]
            gi|557549080|gb|ESR59709.1| hypothetical protein
            CICLE_v10014026mg [Citrus clementina]
          Length = 1680

 Score =  504 bits (1298), Expect = e-140
 Identities = 329/958 (34%), Positives = 495/958 (51%), Gaps = 90/958 (9%)
 Frame = -3

Query: 3161 ESQKERDEVLALVNDSSLPKAPWEEGICKVCGMDKDDVNVLLCDTCDSEYHTYCLNPPLG 2982
            E+ KE +++L  V  S +PKAPW+EGICKVCG+DKDD +VLLCDTCD+EYHTYCL PPL 
Sbjct: 727  ETTKEINDIL--VQTSEIPKAPWDEGICKVCGVDKDDDSVLLCDTCDAEYHTYCLEPPLV 784

Query: 2981 KVPDGNWYCPSCVTGNSLSEDAAYGTQVVNQCGKRRYQRKLIDETLEGLAQLANKMELKE 2802
            ++P+GNWYCPSCV  NS+ + A+  +QV  Q   ++YQ ++    LE L  L   ME KE
Sbjct: 785  RIPEGNWYCPSCVVRNSMVQGASEHSQVGGQHKGKKYQGEITRLCLEELRHLTTVMEEKE 844

Query: 2801 YWEFTVEERILLLKFFSDEAMNSATIRDHIDQCASMCVDLHQRLRSLNSERKILKLKEES 2622
            YWEF V ER  LLKF  DE +NSA +R H++QC  +  +L Q+LRS + E K LK +EE+
Sbjct: 845  YWEFNVHERTFLLKFLCDELLNSALLRQHLEQCTEVTAELQQKLRSFSVEFKNLKSREET 904

Query: 2621 LAANVAKMKGNVHTGGGEL------ASVLADESQLPVDNKVSSF-----------SGGSV 2493
            +AA VAK++ ++     E+      A+V+ +  +     + SS            SG   
Sbjct: 905  VAARVAKVEASMTNSVAEICMKEGPATVIRNNGKCIEQPQNSSNRSNCSVIALEESGPMY 964

Query: 2492 PMDGG-----PH-TKDQVSILRSDVNLHPK---LGDTXXXXXXXXXXXQMLGYGLSNTVS 2340
            P D       PH    ++   ++D ++ P    L  +               + L     
Sbjct: 965  PTDAEGQIEEPHGDNSKMPSQKNDESIKPNEHPLASSLPQEIDNLSGEIRSQHNLQELAR 1024

Query: 2339 KNITVHASSFPGHQYSNQPNA------------NSLLDYNAELSKLQQEISGLQESISTA 2196
                   +S   +Q  + PN             N    +N EL+ ++ +I  LQESI++ 
Sbjct: 1025 ARDAATLASPSNNQGPSVPNELHVTEGTCSVTMNEPQAHNLELNNIRNDILLLQESITSL 1084

Query: 2195 ESELLKVSLRKEFLGRDSYGRLYWVFGSYDASPWIVANGS-------------------- 2076
            E +LLK+S+R+EFLG DS GRLYWV       P ++ +GS                    
Sbjct: 1085 EQQLLKLSVRREFLGSDSSGRLYWVLPLPGMHPCLIVDGSPELQQKRKILDFRGPVDKGL 1144

Query: 2075 -LNPESEFGLDNHFPK-------------------SSSWMYYDTVAEIGELVKWLDDCDI 1956
             L   S  G D +                      SS W+ Y T AEI ELV WL D D 
Sbjct: 1145 VLKNSSSSGSDAYSSSKGSKACCPFQYDPYAVTATSSHWILYQTDAEIEELVNWLRDNDP 1204

Query: 1955 RERDLKESILQWQSNKSLNSNY-QRNDFLKGKPSSSVISSEQRVPNHNCRVLKGVRALEK 1779
            +ER+LK+SIL W+  +  +S + ++  + + + +SS  ++  +V   +C V K    LEK
Sbjct: 1205 KERELKDSILNWKKIRFQDSQHTKKQSWDEYQSASSAPTNSDKVDCFDCLVTKAATLLEK 1264

Query: 1778 KFGLGMDMGADYINKNLEHNHEMTYQGRMCRCVCLELIWPSRHHCFSCHQTFSASEELDQ 1599
            K+G   +  ++ + K       +T Q +M RC CLE IWPSR+HC SCH+TFS + E ++
Sbjct: 1265 KYGPCFE--SEEVLKKGGKRARVTSQEKMYRCECLEPIWPSRNHCLSCHRTFSTAVEFEE 1322

Query: 1598 HGEKCRAAATGAAGSLKHDKTVMNQGIRICPGSSNIPQSVLNEKHDTQSNCVKXXXXXXX 1419
            H + C +A      + +   ++  +G +    S     + +     ++ + +        
Sbjct: 1323 HNDTCNSAPPAYEKNKEASNSLKGKGNKKSDISRAACGTDVELVETSKPSGLIRFQNDGC 1382

Query: 1418 XXXXXEIMANFKVDYSIEEDIKGIGLFGTNGVVPFLPNSSPCFNDPALTLVPPSGNEVSM 1239
                 EI + F    S +E ++ IGL G+ G+   +P+ SP  +D  L L+  S  EV +
Sbjct: 1383 PFDLNEISSKFMTQDSNKELVQEIGLLGSKGIPSLIPSVSPFLSDSTLMLMS-SQKEVGV 1441

Query: 1238 ED-----QTSVPKSQQKLS------DDLAKTASVVNDKSSGIEKGKGKVSEVECMKSRVL 1092
             D       ++  SQ K S      D++A  AS  +  +   E  K K     C + R  
Sbjct: 1442 PDGQLMASETLSSSQGKQSMKNAGNDNMADDASRKSGSNGTHEVLKSKKPAFGCSEQRDR 1501

Query: 1091 CERGRLXXXXXXXXXSTARRSIIRESSLRPKVGYATEVLRLLKISMLDIDAALPEAALRA 912
                 +               ++ +SSLRP +G  +++ R LK+++LDIDAALPE ALR 
Sbjct: 1502 KSSSHVRVPKVGINQCC----VVPQSSLRPLIGRTSQIKRRLKVNLLDIDAALPEEALRP 1557

Query: 911  SRSHLDRRCAWRTFVKCANTLYEMMLALIILEDTIRTDYLKNNWWYWSSPSAASNMPTLS 732
            S++HL+RR AWR FVK A T+YEM+ A IILED I+T++L+N WWYWSS SAA+   T+S
Sbjct: 1558 SKAHLERRWAWRAFVKSAETIYEMVQATIILEDMIKTEFLRNEWWYWSSLSAAAKTSTMS 1617

Query: 731  ALALRIFTLDSAILYEKSSTDDTAGTSIPVSQPDKEASSSAIPTTEMKSAEQPMQKVN 558
            +LALRI++LD+AI+Y+KS+T+     ++ +   D       +P  E+    +  +K N
Sbjct: 1618 SLALRIYSLDAAIIYDKSTTNLNPVENLKL---DSTPEHKPLPGVELLEKSKVSRKSN 1672


>ref|XP_006470356.1| PREDICTED: methyl-CpG-binding domain-containing protein 9-like
            isoform X2 [Citrus sinensis]
          Length = 2023

 Score =  503 bits (1296), Expect = e-139
 Identities = 327/956 (34%), Positives = 496/956 (51%), Gaps = 88/956 (9%)
 Frame = -3

Query: 3161 ESQKERDEVLALVNDSSLPKAPWEEGICKVCGMDKDDVNVLLCDTCDSEYHTYCLNPPLG 2982
            E+ KE +++L  V  S +PKAPW+EGICKVCG+DKDD +VLLCDTCD+EYHTYCL PPL 
Sbjct: 1072 ETTKEINDIL--VQTSEIPKAPWDEGICKVCGVDKDDDSVLLCDTCDAEYHTYCLEPPLV 1129

Query: 2981 KVPDGNWYCPSCVTGNSLSEDAAYGTQVVNQCGKRRYQRKLIDETLEGLAQLANKMELKE 2802
            ++P+GNWYCPSCV  NS+ + A+  +QV  Q   +  Q ++    LE L  L   ME KE
Sbjct: 1130 RIPEGNWYCPSCVVRNSMVQGASEHSQVGGQHKGKNNQGEITRLCLEALRHLTTVMEEKE 1189

Query: 2801 YWEFTVEERILLLKFFSDEAMNSATIRDHIDQCASMCVDLHQRLRSLNSERKILKLKEES 2622
            YWEF V ER  LLKF  DE +NSA +R H++QC  +  +L Q+LRS + E K LK +EE+
Sbjct: 1190 YWEFNVHERTFLLKFLCDELLNSALLRQHLEQCTEVTAELQQKLRSFSVEFKNLKSREET 1249

Query: 2621 LAANVAKMKGNVHTGGGEL------ASVLADESQLPVDNKVSSF-----------SGGSV 2493
            +AA VAK++ ++     E+      A+V+ +  +     + SS            SG   
Sbjct: 1250 VAARVAKVEASMTYSVAEVCMKEGPATVIRNNGKCIEQPQNSSNRSNCSVIALEESGPMY 1309

Query: 2492 PMDGG-----PH-TKDQVSILRSDVNLHPK---LGDTXXXXXXXXXXXQMLGYGLSNTVS 2340
            P D       PH    ++   ++D ++ P    L  +               + L     
Sbjct: 1310 PTDAEGQIEEPHGDNSKMPSQKNDESIKPNEHPLASSLPQEIDNLSGEIRSQHNLQELAR 1369

Query: 2339 KNITV------HASSFPGHQYSNQPNANSLLD----YNAELSKLQQEISGLQESISTAES 2190
               T+      H  S P   +  +   +  ++    +N EL+ ++ +I  LQESI++ E 
Sbjct: 1370 DAATLASPSNNHGPSVPNELHVTEGTCSVTMNEPQAHNLELNNIRNDILLLQESITSLEQ 1429

Query: 2189 ELLKVSLRKEFLGRDSYGRLYWVFGSYDASPWIVANGS---------------------L 2073
            +LLK+S+R+EFLG DS GRLYWV       P ++ +GS                     L
Sbjct: 1430 QLLKLSVRREFLGSDSSGRLYWVLPLPGMHPCLIVDGSPELQQKRKILDFRGPVDKGLVL 1489

Query: 2072 NPESEFGLDNHFPK-------------------SSSWMYYDTVAEIGELVKWLDDCDIRE 1950
               S  G D +                      SS W+ Y T AEI ELV WL D D +E
Sbjct: 1490 KNSSSSGSDAYSSSKGSKACCPFQYDPYAVTATSSHWILYQTDAEIEELVNWLRDNDPKE 1549

Query: 1949 RDLKESILQWQSNKSLNSNY-QRNDFLKGKPSSSVISSEQRVPNHNCRVLKGVRALEKKF 1773
            R+LK+SIL W+  +  +S + ++  + + + +SS  ++  +V   +C V K    LEKK+
Sbjct: 1550 RELKDSILNWKKIRFQDSQHTKKQSWDEYQSASSAPTNSDKVDCFDCLVTKAATLLEKKY 1609

Query: 1772 GLGMDMGADYINKNLEHNHEMTYQGRMCRCVCLELIWPSRHHCFSCHQTFSASEELDQHG 1593
            G   +  ++ + K       +T Q +M RC CLE IWPSR+HC SCH+TFS + E ++H 
Sbjct: 1610 GPCFE--SEEVLKKGGKRARVTSQEKMYRCECLEPIWPSRNHCLSCHRTFSTAVEFEEHN 1667

Query: 1592 EKCRAAATGAAGSLKHDKTVMNQGIRICPGSSNIPQSVLNEKHDTQSNCVKXXXXXXXXX 1413
            + C +A      + +   ++  +G +    S     + +     ++ + +          
Sbjct: 1668 DTCNSAPPAYEKNKEASNSLKGKGNKKSDISHAAGGTDVELVETSKPSGLIRFQNDGCPF 1727

Query: 1412 XXXEIMANFKVDYSIEEDIKGIGLFGTNGVVPFLPNSSPCFNDPALTLVPPSGNEVSMED 1233
               EI + F    S +E ++ IGL G+ G+   +P+ SP  +D  L L+ P   EV + D
Sbjct: 1728 DLNEISSKFMTQDSNKELVQEIGLLGSKGIPSLIPSVSPFLSDSTLMLMSPQ-KEVGVPD 1786

Query: 1232 -----QTSVPKSQQKLS------DDLAKTASVVNDKSSGIEKGKGKVSEVECMKSRVLCE 1086
                   ++  SQ K S      D++A  AS  +  +   E  K K     C + R    
Sbjct: 1787 GQLMASETLSSSQGKQSMKNAGNDNMADDASRKSGSNGTHEVLKSKKPAFGCSEQRDRKS 1846

Query: 1085 RGRLXXXXXXXXXSTARRSIIRESSLRPKVGYATEVLRLLKISMLDIDAALPEAALRASR 906
               +            +  ++ +SSLRP +G  +++ R LK+++LDIDAALPE ALR S+
Sbjct: 1847 SSHVRVPKVGIN----QCCVVPQSSLRPLIGRTSQIKRRLKVNLLDIDAALPEEALRPSK 1902

Query: 905  SHLDRRCAWRTFVKCANTLYEMMLALIILEDTIRTDYLKNNWWYWSSPSAASNMPTLSAL 726
            +HL+RR AWR FVK A T+YEM+ A IILED I+T++L+N WWYWSS SAA+   T+S+L
Sbjct: 1903 AHLERRWAWRAFVKSAETIYEMVQATIILEDMIKTEFLRNEWWYWSSLSAAAKTSTMSSL 1962

Query: 725  ALRIFTLDSAILYEKSSTDDTAGTSIPVSQPDKEASSSAIPTTEMKSAEQPMQKVN 558
            ALRI++LD+AI+Y+KS+T+     ++ +   D       +P  E+    +  +K N
Sbjct: 1963 ALRIYSLDAAIIYDKSTTNLNPVENLKL---DSTPEHKPLPGVELLEKSKVSRKSN 2015


>ref|XP_006470355.1| PREDICTED: methyl-CpG-binding domain-containing protein 9-like
            isoform X1 [Citrus sinensis]
          Length = 2159

 Score =  503 bits (1296), Expect = e-139
 Identities = 327/956 (34%), Positives = 496/956 (51%), Gaps = 88/956 (9%)
 Frame = -3

Query: 3161 ESQKERDEVLALVNDSSLPKAPWEEGICKVCGMDKDDVNVLLCDTCDSEYHTYCLNPPLG 2982
            E+ KE +++L  V  S +PKAPW+EGICKVCG+DKDD +VLLCDTCD+EYHTYCL PPL 
Sbjct: 1208 ETTKEINDIL--VQTSEIPKAPWDEGICKVCGVDKDDDSVLLCDTCDAEYHTYCLEPPLV 1265

Query: 2981 KVPDGNWYCPSCVTGNSLSEDAAYGTQVVNQCGKRRYQRKLIDETLEGLAQLANKMELKE 2802
            ++P+GNWYCPSCV  NS+ + A+  +QV  Q   +  Q ++    LE L  L   ME KE
Sbjct: 1266 RIPEGNWYCPSCVVRNSMVQGASEHSQVGGQHKGKNNQGEITRLCLEALRHLTTVMEEKE 1325

Query: 2801 YWEFTVEERILLLKFFSDEAMNSATIRDHIDQCASMCVDLHQRLRSLNSERKILKLKEES 2622
            YWEF V ER  LLKF  DE +NSA +R H++QC  +  +L Q+LRS + E K LK +EE+
Sbjct: 1326 YWEFNVHERTFLLKFLCDELLNSALLRQHLEQCTEVTAELQQKLRSFSVEFKNLKSREET 1385

Query: 2621 LAANVAKMKGNVHTGGGEL------ASVLADESQLPVDNKVSSF-----------SGGSV 2493
            +AA VAK++ ++     E+      A+V+ +  +     + SS            SG   
Sbjct: 1386 VAARVAKVEASMTYSVAEVCMKEGPATVIRNNGKCIEQPQNSSNRSNCSVIALEESGPMY 1445

Query: 2492 PMDGG-----PH-TKDQVSILRSDVNLHPK---LGDTXXXXXXXXXXXQMLGYGLSNTVS 2340
            P D       PH    ++   ++D ++ P    L  +               + L     
Sbjct: 1446 PTDAEGQIEEPHGDNSKMPSQKNDESIKPNEHPLASSLPQEIDNLSGEIRSQHNLQELAR 1505

Query: 2339 KNITV------HASSFPGHQYSNQPNANSLLD----YNAELSKLQQEISGLQESISTAES 2190
               T+      H  S P   +  +   +  ++    +N EL+ ++ +I  LQESI++ E 
Sbjct: 1506 DAATLASPSNNHGPSVPNELHVTEGTCSVTMNEPQAHNLELNNIRNDILLLQESITSLEQ 1565

Query: 2189 ELLKVSLRKEFLGRDSYGRLYWVFGSYDASPWIVANGS---------------------L 2073
            +LLK+S+R+EFLG DS GRLYWV       P ++ +GS                     L
Sbjct: 1566 QLLKLSVRREFLGSDSSGRLYWVLPLPGMHPCLIVDGSPELQQKRKILDFRGPVDKGLVL 1625

Query: 2072 NPESEFGLDNHFPK-------------------SSSWMYYDTVAEIGELVKWLDDCDIRE 1950
               S  G D +                      SS W+ Y T AEI ELV WL D D +E
Sbjct: 1626 KNSSSSGSDAYSSSKGSKACCPFQYDPYAVTATSSHWILYQTDAEIEELVNWLRDNDPKE 1685

Query: 1949 RDLKESILQWQSNKSLNSNY-QRNDFLKGKPSSSVISSEQRVPNHNCRVLKGVRALEKKF 1773
            R+LK+SIL W+  +  +S + ++  + + + +SS  ++  +V   +C V K    LEKK+
Sbjct: 1686 RELKDSILNWKKIRFQDSQHTKKQSWDEYQSASSAPTNSDKVDCFDCLVTKAATLLEKKY 1745

Query: 1772 GLGMDMGADYINKNLEHNHEMTYQGRMCRCVCLELIWPSRHHCFSCHQTFSASEELDQHG 1593
            G   +  ++ + K       +T Q +M RC CLE IWPSR+HC SCH+TFS + E ++H 
Sbjct: 1746 GPCFE--SEEVLKKGGKRARVTSQEKMYRCECLEPIWPSRNHCLSCHRTFSTAVEFEEHN 1803

Query: 1592 EKCRAAATGAAGSLKHDKTVMNQGIRICPGSSNIPQSVLNEKHDTQSNCVKXXXXXXXXX 1413
            + C +A      + +   ++  +G +    S     + +     ++ + +          
Sbjct: 1804 DTCNSAPPAYEKNKEASNSLKGKGNKKSDISHAAGGTDVELVETSKPSGLIRFQNDGCPF 1863

Query: 1412 XXXEIMANFKVDYSIEEDIKGIGLFGTNGVVPFLPNSSPCFNDPALTLVPPSGNEVSMED 1233
               EI + F    S +E ++ IGL G+ G+   +P+ SP  +D  L L+ P   EV + D
Sbjct: 1864 DLNEISSKFMTQDSNKELVQEIGLLGSKGIPSLIPSVSPFLSDSTLMLMSPQ-KEVGVPD 1922

Query: 1232 -----QTSVPKSQQKLS------DDLAKTASVVNDKSSGIEKGKGKVSEVECMKSRVLCE 1086
                   ++  SQ K S      D++A  AS  +  +   E  K K     C + R    
Sbjct: 1923 GQLMASETLSSSQGKQSMKNAGNDNMADDASRKSGSNGTHEVLKSKKPAFGCSEQRDRKS 1982

Query: 1085 RGRLXXXXXXXXXSTARRSIIRESSLRPKVGYATEVLRLLKISMLDIDAALPEAALRASR 906
               +            +  ++ +SSLRP +G  +++ R LK+++LDIDAALPE ALR S+
Sbjct: 1983 SSHVRVPKVGIN----QCCVVPQSSLRPLIGRTSQIKRRLKVNLLDIDAALPEEALRPSK 2038

Query: 905  SHLDRRCAWRTFVKCANTLYEMMLALIILEDTIRTDYLKNNWWYWSSPSAASNMPTLSAL 726
            +HL+RR AWR FVK A T+YEM+ A IILED I+T++L+N WWYWSS SAA+   T+S+L
Sbjct: 2039 AHLERRWAWRAFVKSAETIYEMVQATIILEDMIKTEFLRNEWWYWSSLSAAAKTSTMSSL 2098

Query: 725  ALRIFTLDSAILYEKSSTDDTAGTSIPVSQPDKEASSSAIPTTEMKSAEQPMQKVN 558
            ALRI++LD+AI+Y+KS+T+     ++ +   D       +P  E+    +  +K N
Sbjct: 2099 ALRIYSLDAAIIYDKSTTNLNPVENLKL---DSTPEHKPLPGVELLEKSKVSRKSN 2151


>gb|EXC31622.1| Methyl-CpG-binding domain-containing protein 9 [Morus notabilis]
          Length = 2259

 Score =  475 bits (1222), Expect = e-131
 Identities = 331/1003 (33%), Positives = 480/1003 (47%), Gaps = 135/1003 (13%)
 Frame = -3

Query: 3161 ESQKERDEVLALVNDSSLPKAPWEEGICKVCGMDKDDVNVLLCDTCDSEYHTYCLNPPLG 2982
            E +KE D +L+  N   +PKAPW+EG+CKVCG+D+DD +VLLCDTCD+EYHTYCLNPPL 
Sbjct: 1267 EMRKEIDYLLSSTN--VIPKAPWDEGVCKVCGIDRDDDSVLLCDTCDAEYHTYCLNPPLL 1324

Query: 2981 KVPDGNWYCPSCVTGNSLSEDAAYGTQVVNQCGKRRYQRKLIDETLEGLAQLANKMELKE 2802
            ++P+GNWYCPSCV G    +D     QV+ Q   ++YQ ++    LE LA LA KME KE
Sbjct: 1325 RIPEGNWYCPSCVVGRRTVQDVPENVQVIRQRSGKKYQGEVTRVYLEALAHLATKMEEKE 1384

Query: 2801 YWEFTVEERILLL----------------------------------------KFFSDEA 2742
            YWEF+V+E +LLL                                        KF  DE 
Sbjct: 1385 YWEFSVDESMLLLRPTLRKGRPGEGRLGKARVGHPEWAAVDVGVGSVVRSFLMKFLCDEL 1444

Query: 2741 MNSATIRDHIDQCASMCVDLHQRLRSLNSERKILKLKEESLAANVAKMKGNVHTGGGELA 2562
            +NSA IR H++QCA    +L Q+LR+L  E KILK +EE L A  AK   N+    G + 
Sbjct: 1445 LNSAIIRQHLEQCADTSTELQQKLRALFVEWKILKSREEILVARAAKHDPNILNSLGAVG 1504

Query: 2561 --------------SVLADESQ---LPVDNKVSSFSGGSVP-----MDGGPHTKDQVSIL 2448
                            L+D S    +  D+ +S+  GG        +D      D  S  
Sbjct: 1505 IRESLFSNHNKGQTPALSDRSNCCGMSTDD-LSTLGGGREAIEPSGLDRSSSATDSQSNC 1563

Query: 2447 RSDVNLHPKLGDTXXXXXXXXXXXQMLGYGLS---------NTVSKNITVHASSFPGHQY 2295
            ++ ++   +L D                              +V K+ +        H +
Sbjct: 1564 QNPLDTEDQLKDAHASVEESNTVLNEADASCGAICSTGNPHESVGKDSSSTLKPVGQHGH 1623

Query: 2294 SNQPNA-------------NSLLDYNAELSKLQQEISGLQESISTAESELLKVSLRKEFL 2154
            SN  +              N L  ++ EL  ++ +I+ L+ESI++ ESELLKVS+R+EFL
Sbjct: 1624 SNASDVRSTIGQSVPAATVNELQGHHVELKSVKNDITILEESITSVESELLKVSVRREFL 1683

Query: 2153 GRDSYGRLYWVFGSYDASPWIVANGSLNPESEFGLDNH---------------------- 2040
            G D  G LYWV G+   S  I+ + S    S   ++N                       
Sbjct: 1684 GSDFVGCLYWVSGTPTGSSCIIVDRSAALRSGKKMNNFQRPVGKSSVLQCSIQSVPIQCE 1743

Query: 2039 ----FPKSSSWMYYDTVAEIGELVKWLDDCDIRERDLKESILQWQSNKSLNSNYQRNDFL 1872
                    S W+ Y T  +I +LV  L   D +ER+LKESIL WQ  K     +Q+N  +
Sbjct: 1744 RNSVVASDSPWVSYQTDGDIDQLVSCLKTNDTKERELKESILHWQ--KLRFQEFQKNK-I 1800

Query: 1871 KGKPSSSVIS---SEQRVPNHNCRVLKGVRALEKKFGLGMDMGADYINKNLEHNHEMTYQ 1701
            +G+   +  +   S ++    +  V +    LEK++G    +    I K       +T  
Sbjct: 1801 RGQAECAAFAASISGEKATFSDGLVTRAANLLEKRYGPCNQLETTDILKKRGKKARLTDD 1860

Query: 1700 GRMCRCVCLELIWPSRHHCFSCHQTFSASEELDQHGE-KCRAAATG------------AA 1560
             +M RC CLELIWP RHHC SCH+TF    EL+ H E KC + A              A 
Sbjct: 1861 NKMYRCECLELIWPCRHHCLSCHRTFFNDIELEGHNEGKCNSVALAQEKRKEISDSSKAK 1920

Query: 1559 GSLKHDKTVMNQGIRICPGSSNIPQSVLNEKHDTQSNCVKXXXXXXXXXXXXE-IMANFK 1383
             SLK D    +    +      IP++  +E     +  +K            E I + F 
Sbjct: 1921 DSLKSDANREDSTGEM--SRVEIPKTGFSE---LSAKLIKFQDEGLSCPYDFEEICSKFV 1975

Query: 1382 VDYSIEEDIKGIGLFGTNGVVPFLPNSSPCFNDPALTLVPPSGNEVSMEDQTSVPKSQQK 1203
               S ++ ++ IGL G+ GV  F+ + SPC +D  L L+ P  +  +    +   +    
Sbjct: 1976 TKDSCKDLVQEIGLIGSKGVPSFVSSMSPCLDDSTLALISPQKDVGAQGGGSEAAERPVS 2035

Query: 1202 LSDDLAKTAS--VVNDKSSGIEKGKGKVSEVECMKSRVLC------ERGRLXXXXXXXXX 1047
            L       A   +++D+S      +  + E+  +KS+ L         G           
Sbjct: 2036 LGTGTITIAGWDILSDRSPK----RSAMKEINAVKSQRLTLGYIEQREGIRCSGSHSSEM 2091

Query: 1046 STARRSIIRESSLRPKVGYATEVLRLLKISMLDIDAALPEAALRASRSHLDRRCAWRTFV 867
               R  ++ + SLRP VG  +++ R LKI++LD+DAALPE ALR S+SHL RR AWR FV
Sbjct: 2092 GATRCCVVPQFSLRPLVGKVSQIYRRLKINLLDMDAALPEEALRPSKSHLGRRWAWRAFV 2151

Query: 866  KCANTLYEMMLALIILEDTIRTDYLKNNWWYWSSPSAASNMPTLSALALRIFTLDSAILY 687
            K A T+YEM+ A I+LED I+T+YLKN WWYWSS SAA+   T+S+LALRI++LD+AI+Y
Sbjct: 2152 KSATTIYEMVQATIVLEDMIKTEYLKNEWWYWSSFSAAARTSTMSSLALRIYSLDAAIIY 2211

Query: 686  EKSSTDDTAGTSIPVSQPDKEASSSAIPTTEMKSAEQPMQKVN 558
            EK S++         S+P   +    +P  ++    +  ++ N
Sbjct: 2212 EKISSESDPTDK---SEPSNLSEQKPVPVIDLTEKTKITRRSN 2251


>ref|XP_002517349.1| DNA binding protein, putative [Ricinus communis]
            gi|223543360|gb|EEF44891.1| DNA binding protein, putative
            [Ricinus communis]
          Length = 2145

 Score =  474 bits (1220), Expect = e-130
 Identities = 320/919 (34%), Positives = 475/919 (51%), Gaps = 87/919 (9%)
 Frame = -3

Query: 3161 ESQKERDEVLALVNDSSLPKAPWEEGICKVCGMDKDDVNVLLCDTCDSEYHTYCLNPPLG 2982
            E++K+ D VLA  N+  +PKAPW+EG+CKVCG DKDD +VLLCDTCD+EYHTYCLNPPL 
Sbjct: 1198 ETKKDLDIVLASTNE--IPKAPWDEGVCKVCGFDKDDDSVLLCDTCDAEYHTYCLNPPLA 1255

Query: 2981 KVPDGNWYCPSCVTGNSLSEDAAYGTQVVNQCGKRRYQRKLIDETLEGLAQLANKMELKE 2802
            ++P+GNWYCPSCV+   + E A+  TQV+ Q   ++YQ ++    LE L  LA+ ME K+
Sbjct: 1256 RIPEGNWYCPSCVSVRMVQE-ASVSTQVIGQNSCKKYQGEMTRIYLETLVHLASAMEEKD 1314

Query: 2801 YWEFTVEERILLLKFFSDEAMNSATIRDHIDQCASMCVDLHQRLRSLNSERKILKLKEES 2622
            YW+F V+ER  LLKF  DE +NSA +R H++QC     ++ Q+LR+L +E K LK KEE 
Sbjct: 1315 YWDFGVDERTFLLKFLCDELLNSALVRQHLEQCMESTAEVQQKLRTLYAEWKNLKSKEEF 1374

Query: 2621 LAANVAKM----KGNVHTGGGELASVLADE----SQLPV-DNKVSSFSGGS---VPMDGG 2478
            +A   AKM     G V  G   L S L D+     Q PV  +K S     S     +DG 
Sbjct: 1375 MALKSAKMGTGASGEVKEG---LVSALKDQGKSVGQPPVLGDKPSDCCAPSDDVSAVDGS 1431

Query: 2477 PHTKDQVSILR--SDVNLHPK-------LGDTXXXXXXXXXXXQMLGYGLSNTVSK-NIT 2328
            P         +  S++N   K       +  T            M G   SN  SK N  
Sbjct: 1432 PEGNGINGFDKHPSEINYEKKPSHDSQNIDSTNNHGPVKDMHDAMEG---SNDPSKENSK 1488

Query: 2327 VHASSFPGHQYSNQPNANSLLD-----------YNAELSKLQQEISGLQESISTAESELL 2181
                + PG   S+  NA  +L+           Y+ ++S ++ +I  LQ  IS+ ES+L 
Sbjct: 1489 PLGPNHPGFSLSSDMNALVVLNLPSVTMNESQAYHTDVSAIKDDILRLQNLISSMESQLS 1548

Query: 2180 KVSLRKEFLGRDSYGRLYWVFGSYDASPWIVANGS------------------LNPESEF 2055
            K SLR+EFLG DS G LYW   + +  P IV + S                  L   S  
Sbjct: 1549 KQSLRREFLGSDSRGHLYWASATPNGHPQIVVDRSLTFQHRKISHHRLGNSSVLQHSSSS 1608

Query: 2054 GLD-------------------NHFPKSSSWMYYDTVAEIGELVKWLDDCDIRERDLKES 1932
            G+D                        SS+W+ Y+T AEI EL+ WL + + +E +LKES
Sbjct: 1609 GIDACLNLEGSRACFPFLFNPNGTLSMSSAWVSYETDAEIEELIGWLGNNNQKEIELKES 1668

Query: 1931 ILQWQSNKSLNSNYQRNDFLKG-KPSSSVISSEQRVPNHNCRVLKGVRALEKKFGLGMDM 1755
            I+QW   +   S   R+   +  +   S I +  +    NC + K    LEK +G  +++
Sbjct: 1669 IMQWLKLRFQESQRIRDPVQEECRAGLSTIRNNDQTAFSNC-LTKATLLLEKNYGAFVEL 1727

Query: 1754 GADYINKNLEHNHEMTYQGRMCRCVCLELIWPSRHHCFSCHQTFSASEELDQHGE-KCRA 1578
                + K        T + +  RC CLELIWPSR+HC+SCH+T S   E + H + +C +
Sbjct: 1728 DTSDMLKKRGKKARGTNEEKTYRCDCLELIWPSRNHCYSCHRTSSNDVEFEGHSDGRCSS 1787

Query: 1577 AATGAAGSLKHDKTVMNQGIRICPGSSNIPQSVLNEKHDTQSNCVK--------XXXXXX 1422
                   S + + ++  +G      +    +S +++ H +     +              
Sbjct: 1788 VPQSREKSEETNDSLKGRGNVKAEVTWKEKKSEIDKLHSSMGGLSELRARLIKFQNEGIN 1847

Query: 1421 XXXXXXEIMANFKVDYSIEEDIKGIGLFGTNGVVPFLPNSSPCFNDPALTLVPPSGNEVS 1242
                  +I + F  + S +E ++ IGL G+NG+ PF+ + SP  +D    L+ P  N   
Sbjct: 1848 CPYDLLDICSKFVTEDSNKELVQDIGLIGSNGIPPFVTSISPYLSDSISVLISPENNTRI 1907

Query: 1241 MEDQTSVPKSQQKLSDDLAKTASVVNDKSSGIEKGKGKVSEV-ECMKSR-----VLCERG 1080
              D+ +V + Q     +  +  +V+   S    + K  ++E+ E +K+       L  RG
Sbjct: 1908 PGDECNVDERQVFPQGNWNENRAVLQSSSDNSTR-KTSINEIGEVLKTNKPPLGCLQRRG 1966

Query: 1079 -RLXXXXXXXXXSTARRSIIRESSLRPKVGYATEVLRLLKISMLDIDAALPEAALRASRS 903
             +                ++ ESSL P VG  + +LR LKI++LD++AALPE ALR ++ 
Sbjct: 1967 KKSSLGKCFPEMGPGCCCVVPESSLMPLVGKVSSILRQLKINLLDMEAALPEEALRPAKG 2026

Query: 902  HLDRRCAWRTFVKCANTLYEMMLALIILEDTIRTDYLKNNWWYWSSPSAASNMPTLSALA 723
             L RR AWR +VK A ++Y+M+ A I+LE+ I+T+YL+N WWYWSS SAA+   T+++LA
Sbjct: 2027 QLGRRWAWRAYVKSAESIYQMVRATIMLEEMIKTEYLRNEWWYWSSLSAAAKTSTVASLA 2086

Query: 722  LRIFTLDSAILYEKSSTDD 666
            LRI++LD+ I+YEK+S  D
Sbjct: 2087 LRIYSLDACIVYEKNSNSD 2105


>ref|XP_004141185.1| PREDICTED: methyl-CpG-binding domain-containing protein 9-like
            [Cucumis sativus]
          Length = 2131

 Score =  472 bits (1215), Expect = e-130
 Identities = 320/938 (34%), Positives = 481/938 (51%), Gaps = 82/938 (8%)
 Frame = -3

Query: 3161 ESQKERDEVLALVNDSSLPKAPWEEGICKVCGMDKDDVNVLLCDTCDSEYHTYCLNPPLG 2982
            E++ E D  L  +N+  +PKAPW+EG+CKVCG+DKDD +VLLCDTCD+EYHTYCLNPPL 
Sbjct: 1191 ETKVEVDGFLVSLNE--IPKAPWDEGVCKVCGIDKDDDSVLLCDTCDAEYHTYCLNPPLA 1248

Query: 2981 KVPDGNWYCPSCVTGNSLSEDAAYGTQ--VVNQCGKRRYQRKLIDETLEGLAQLANKMEL 2808
            ++P+GNWYCPSCV G  + ED +  T+  ++N    ++++ ++  + L  LA LA  +E 
Sbjct: 1249 RIPEGNWYCPSCVMGTRMVEDPSEHTKNHIINLHKGKKFRGEVTRDFLNKLANLAAALEE 1308

Query: 2807 KEYWEFTVEERILLLKFFSDEAMNSATIRDHIDQCASMCVDLHQRLRSLNSERKILKLKE 2628
            KEYWEF+V+ER+ LLK+  DE ++SA IR H++QC     +L Q+LRS   E K LK +E
Sbjct: 1309 KEYWEFSVDERLFLLKYLCDELLSSALIRQHLEQCVEALAELQQKLRSCFIEWKNLKCRE 1368

Query: 2627 ESLAANVAKM---------KGNVHTGGGELASVLADESQLPVDNKVSSFSGGSVPMDGGP 2475
            E +AA  AK+         +G     G  L +     S   ++NK  + +     M    
Sbjct: 1369 EVVAARAAKLDTTMLSAVREGQGSCDGARLGASDQYSSLTSLENKCHNHASFQEQMSSAH 1428

Query: 2474 HTKDQVSILRSDVNLHPKLGDTXXXXXXXXXXXQMLG--YGLSNTVSKNITVHASSFP-G 2304
               D      +  N+    G              + G    +  +   N+    S  P G
Sbjct: 1429 DVTDNND---AGGNVLSSSGSQNSGKPVKFNEPSLSGLPQEVDGSDQSNMETEISILPSG 1485

Query: 2303 HQYSNQPNANSL------------LDYNAELSKLQQEISGLQESISTAESELLKVSLRKE 2160
             QY    +AN +              Y++EL  ++++I  +Q+SI++ E ELLK+S+R+E
Sbjct: 1486 KQYFTPCDANGVPVAPQVPPPNESQAYHSELDSIKKDILQVQDSIASTELELLKISVRRE 1545

Query: 2159 FLGRDSYGRLYWVFGSYDASPWIVANGS---LNPESE----------------------- 2058
            FLG D+ GRLYW     +  P I+++GS   +  ES                        
Sbjct: 1546 FLGSDAAGRLYWASVMSNGLPQIISSGSSVHIGSESRDRVVKGRFFKNYTSTSNANSSTL 1605

Query: 2057 ----FGLDNHFPK----SSSWMYYDTVAEIGELVKWLDDCDIRERDLKESILQWQSNK-- 1908
                +    H PK    +S  + Y T A+I EL+ WL D D +ER+LKESILQW   K  
Sbjct: 1606 NSNMYSSLLHLPKDFIGNSPCISYQTEADILELIDWLKDSDPKERELKESILQWLKPKLQ 1665

Query: 1907 --SLNSNYQRNDFLKGKPSSSVISSEQRVPNHNCRVLKGVRALEKKFGLGMD-MGADYIN 1737
              S ++N    + LK   SSS +   +++      V +    LE K+G  ++ +  D +N
Sbjct: 1666 TSSRSNNQSPEEQLKDSSSSSDV---EKLECSGFLVNRASALLESKYGPFLEFVTPDDLN 1722

Query: 1736 KNLEHNHEMTYQGRMCRCVCLELIWPSRHHCFSCHQTFSASEELDQHGEKCRAAATGAAG 1557
            + L+    +    +M RCVC+E +WPSR+HC SCH++FS   EL++H     ++   +  
Sbjct: 1723 RWLD-KARLAEDEKMFRCVCMEPVWPSRYHCLSCHRSFSTDVELEEHDNGQCSSLPASCD 1781

Query: 1556 SLKH--DKTVMNQGIRICPGSSNIPQSVLNEKHDTQSN----CVK-XXXXXXXXXXXXEI 1398
             +K   D +     I+           V+ E      N     +K              I
Sbjct: 1782 GIKEVGDSSKSKCNIKFESKQEESSSMVIAETSRGYFNHSMGLIKYQNDGMMCPYDFELI 1841

Query: 1397 MANFKVDYSIEEDIKGIGLFGTNGVVPFLPNSSPCFNDPALTLVPPSGNEVSMEDQTSV- 1221
             + F    S ++ IK IGL  +NGV  FL + SP   +  L ++    +  + ED T + 
Sbjct: 1842 CSKFLTKDSNKDLIKEIGLISSNGVPSFLSSVSPYIMESTLNVIDLKKDSSTPEDGTLLS 1901

Query: 1220 --PKSQQKLSDDLAKTASVVN---DKSSG--IEKGKGKVSEVECM--KSRVLCERGRLXX 1068
              P  +  + ++    +S ++    K +G  I   K K     C+  KS+ +C   R   
Sbjct: 1902 EWPSLENIILENGCHQSSSIDSSIQKPAGNEISAPKTKRLAAGCLEPKSKKICMDNRF-- 1959

Query: 1067 XXXXXXXSTARRSIIRESSLRPKVGYATEVLRLLKISMLDIDAALPEAALRASRSHLDRR 888
                      R  +I +SS RP VG   +V+R LK+++LD+DAALP+ AL+ S+ H++RR
Sbjct: 1960 ----SEFGIGRCFVIPQSSQRPLVGKILQVVRGLKMNLLDMDAALPDEALKPSKLHIERR 2015

Query: 887  CAWRTFVKCANTLYEMMLALIILEDTIRTDYLKNNWWYWSSPSAASNMPTLSALALRIFT 708
             AWR FVK A T+YEM+ A I LED IRT+YLKN WWYWSS SAA+ + T+S+LALRIF+
Sbjct: 2016 WAWRAFVKSAGTIYEMVQATIALEDMIRTEYLKNEWWYWSSLSAAAKISTVSSLALRIFS 2075

Query: 707  LDSAILYEKSSTDDTAGTSIPVSQPDKEASSSAIPTTE 594
            LD+AI+YEK S +  +   +  +    E     +  TE
Sbjct: 2076 LDAAIIYEKISPNQDSNDYLDTTSSIPEQKLGGVDLTE 2113


>ref|XP_004167238.1| PREDICTED: LOW QUALITY PROTEIN: methyl-CpG-binding domain-containing
            protein 9-like [Cucumis sativus]
          Length = 1277

 Score =  471 bits (1211), Expect = e-129
 Identities = 317/933 (33%), Positives = 477/933 (51%), Gaps = 77/933 (8%)
 Frame = -3

Query: 3161 ESQKERDEVLALVNDSSLPKAPWEEGICKVCGMDKDDVNVLLCDTCDSEYHTYCLNPPLG 2982
            E++ E D  L  +N+  +PKAPW+EG+CKVCG+DKDD +VLLCDTCD+EYHTYCLNPPL 
Sbjct: 338  ETKVEVDGFLVSLNE--IPKAPWDEGVCKVCGIDKDDDSVLLCDTCDAEYHTYCLNPPLA 395

Query: 2981 KVPDGNWYCPSCVTGNSLSEDAAYGTQ-VVNQCGKRRYQRKLIDETLEGLAQLANKMELK 2805
            ++P+GNWYCPSCV G  + ED +  T+ ++N    ++++ ++  + L  LA LA  +E K
Sbjct: 396  RIPEGNWYCPSCVMGTRMVEDPSEHTKHIINLHKGKKFRGEVTRDFLNKLANLAAALEEK 455

Query: 2804 EYWEFTVEERILLLKFFSDEAMNSATIRDHIDQCASMCVDLHQRLRSLNSERKILKLKEE 2625
            EYWEF+V+ER+ LLK+  DE ++SA IR H++QC     +L Q+LRS   E K LK +EE
Sbjct: 456  EYWEFSVDERLFLLKYLCDELLSSALIRQHLEQCVEALAELQQKLRSCFIEWKNLKCREE 515

Query: 2624 SLAANVAKM---------KGNVHTGGGELASVLADESQLPVDNKVSSFSGGSVPMDGGPH 2472
             +AA  AK+         +G     G  L +     S   ++NK  + +     M     
Sbjct: 516  VVAARAAKLDTTMLSAVREGQGSCDGARLGASDQYSSLTSLENKCHNHASFQEQMSSAHD 575

Query: 2471 TKDQVSILRSDVNLHPKLGDTXXXXXXXXXXXQMLG--YGLSNTVSKNITVHASSFP-GH 2301
              D      +  N+    G              + G    +  +   N+    S  P G 
Sbjct: 576  VTDNND---AGGNVLSSSGSQNSGKPVKFNEPSLSGLPQEVDGSDQSNMETEISILPSGK 632

Query: 2300 QYSNQPNANSL------------LDYNAELSKLQQEISGLQESISTAESELLKVSLRKEF 2157
            QY    +AN +              Y++EL  ++++I  +Q+SI++ E ELLK+S+R+EF
Sbjct: 633  QYFTPCDANGVPVAPQVPPPNESQAYHSELDSIKKDILQVQDSIASTELELLKISVRREF 692

Query: 2156 LGRDSYGRLYWVFGSYDASPWIVANGS---LNPESE------------------------ 2058
            LG D+ GRLYW     +  P I+++GS   +  ES                         
Sbjct: 693  LGSDAAGRLYWASVMSNGLPQIISSGSSVHIGSESRDRVVKGRFFKNYTSTSNANSSTLN 752

Query: 2057 ---FGLDNHFPK----SSSWMYYDTVAEIGELVKWLDDCDIRERDLKESILQWQSNK--- 1908
               +    H PK    +S  + Y T A+I EL+ WL D D +ER+LKESILQW   K   
Sbjct: 753  SNMYSSLLHLPKDFIGNSPCISYQTEADILELIDWLKDSDPKERELKESILQWLKPKLQT 812

Query: 1907 -SLNSNYQRNDFLKGKPSSSVISSEQRVPNHNCRVLKGVRALEKKFGLGMD-MGADYINK 1734
             S ++N    + LK   SSS +   +++      V +    LE K+G  ++ +  D +N+
Sbjct: 813  SSRSNNQSPEEQLKDSSSSSDV---EKLECSGFLVNRASALLESKYGPFLEFVTPDDLNR 869

Query: 1733 NLEHNHEMTYQGRMCRCVCLELIWPSRHHCFSCHQTFSASEELDQHGEKCRAAATGAAGS 1554
             L+    +    +M RCVC+E +WPSR+HC SCH++FS   EL++H     ++   +   
Sbjct: 870  WLD-KARLAEDEKMFRCVCMEPVWPSRYHCLSCHKSFSTDVELEEHDNGQCSSLPASCDG 928

Query: 1553 LKH--DKTVMNQGIRICPGSSNIPQSVLNEKHDTQSN----CVK-XXXXXXXXXXXXEIM 1395
            +K   D +     I+           V+ E      N     +K              I 
Sbjct: 929  IKEVGDSSKSKCNIKFESKQEESSSMVIAETSRGYFNHSMGLIKYQNDGMMCPYDFELIC 988

Query: 1394 ANFKVDYSIEEDIKGIGLFGTNGVVPFLPNSSPCFNDPALTLVPPSGNEVSMEDQTSVPK 1215
            + F    S ++ IK IGL  +NGV  FL + SP   +  L ++    +  + ED T + +
Sbjct: 989  SKFLTKDSNKDLIKEIGLISSNGVPSFLSSVSPYIMESTLNVIDLKKDSSTPEDGTLLSE 1048

Query: 1214 SQQKLSDDLAKTASVVNDKSSGIEKGKGKVSEVECMKSRVLC------ERGRLXXXXXXX 1053
                 +  L       +   S I+K  G  +E+   K++ L       +  +        
Sbjct: 1049 WPSLENIILENGCHQSSSIDSSIQKPAG--NEISAPKTKRLAAGCLEPKSKKSXMDNRFS 1106

Query: 1052 XXSTARRSIIRESSLRPKVGYATEVLRLLKISMLDIDAALPEAALRASRSHLDRRCAWRT 873
                 R  +I +SS RP VG   +V+R LK+++LD+DAALP+ AL+ S+ H++RR AWR 
Sbjct: 1107 EFGIGRCFVIPQSSQRPLVGKILQVVRGLKMNLLDMDAALPDEALKPSKLHIERRWAWRA 1166

Query: 872  FVKCANTLYEMMLALIILEDTIRTDYLKNNWWYWSSPSAASNMPTLSALALRIFTLDSAI 693
            FVK A T+YEM+ A I LED IRT+YLKN WWYWSS SAA+ + T+S+LALRIF+LD+AI
Sbjct: 1167 FVKSAGTIYEMVQATIALEDMIRTEYLKNEWWYWSSLSAAAKISTVSSLALRIFSLDAAI 1226

Query: 692  LYEKSSTDDTAGTSIPVSQPDKEASSSAIPTTE 594
            +YEK S +  +   +  +    E     +  TE
Sbjct: 1227 IYEKISPNQDSNDYLDTTSSIPEQKLGGVDLTE 1259


>gb|EOY02356.1| Methyl-CpG-binding domain-containing protein 9, putative isoform 1
            [Theobroma cacao] gi|508710460|gb|EOY02357.1|
            Methyl-CpG-binding domain-containing protein 9, putative
            isoform 1 [Theobroma cacao]
          Length = 2225

 Score =  470 bits (1209), Expect = e-129
 Identities = 329/944 (34%), Positives = 486/944 (51%), Gaps = 105/944 (11%)
 Frame = -3

Query: 3161 ESQKERDEVLALVNDSSLPKAPWEEGICKVCGMDKDDVNVLLCDTCDSEYHTYCLNPPLG 2982
            E++KE +++LA  + S +PKAPW+EG+CKVCG+DKDD +VLLCDTCD+EYHTYCLNPPL 
Sbjct: 1265 ETKKEINDLLA--STSEIPKAPWDEGVCKVCGIDKDDDSVLLCDTCDAEYHTYCLNPPLA 1322

Query: 2981 KVPDGNWYCPSCVTGNSLSEDAAYGTQVVNQCGKRRYQRKLIDETLEGLAQLANKMELKE 2802
            ++P+GNWYCPSCV    + +DA+  +QV+ +   ++YQ ++    LE LA L   +E KE
Sbjct: 1323 RIPEGNWYCPSCVLSKRMVQDASEHSQVIIRRRDKKYQGEVTRGYLEALAHLGAVLEEKE 1382

Query: 2801 YWEFTVEERILLLKFFSDEAMNSATIRDHIDQCASMCVDLHQRLRSLNSERKILKLKEES 2622
            YW+F+++ERI LLKF  DE +NSA IR H++QCA    +LHQ+LRS   E K LK +E+ 
Sbjct: 1383 YWQFSIDERIFLLKFLCDELLNSALIRQHLEQCAETS-ELHQKLRSAYVEWKNLKSREDF 1441

Query: 2621 LAANVAKMKGNVHTGGGELASVLADESQLPVD-----------NKVSSFS---------G 2502
            +AA  AK+  ++    G++  V   +  LP D           NK +S +         G
Sbjct: 1442 VAAKAAKIDTSMSNAVGDVG-VKDGDDWLPSDGGKEGADLNGSNKYASATYTEKNFTANG 1500

Query: 2501 GSV-PMDGGPHTK--------DQVSILRSDVNLHP-----------KLGDTXXXXXXXXX 2382
             ++ PMD     K         +VS  +SD +  P           ++ ++         
Sbjct: 1501 QTLNPMDTEAQLKGDQAIVDASKVSSQKSDKSFRPSELLVPNHLSQEIENSSKETSFQGK 1560

Query: 2381 XXQMLGYGLSNTVSKNITVHASSFPGHQYSNQPNA---NSLLDYNAELSKLQQEISGLQE 2211
              +  G  +++  S +       FP    + Q  +   N    ++ EL+ ++ +I  LQ+
Sbjct: 1561 LEESKGMDVASPPSPSDC--NGQFPPSDAAKQVPSVTENESQSHHLELNTIKNDIQRLQD 1618

Query: 2210 SISTAESELLKVSLRKEFLGRDSYGRLYWVFGSYDASPWIVANGSLNPESE--------- 2058
             I++ ES+LLK+S+RKEFLG DS GRLYW+       P ++ +GSL  + +         
Sbjct: 1619 LITSLESQLLKLSVRKEFLGSDSAGRLYWISAMPGGYPQVIVDGSLVLQKKRKFLGYEER 1678

Query: 2057 -----------FGLDNHFPKSSS-------------------WMYYDTVAEIGELVKWLD 1968
                        G DN      S                   W+ Y T AEI  L+ WL+
Sbjct: 1679 VQNTFIWNSASAGTDNGMKAEGSKASCPFLYNSKDAISVGSPWVTYQTEAEIEGLIDWLN 1738

Query: 1967 DCDIRERDLKESILQWQSNKSLNSNYQR---NDFLKGKPSSSVISSEQRVPNHNCRVLKG 1797
            D + +E++LKE+ILQ    K    +YQ+    D  + + + S+ S   +    +    K 
Sbjct: 1739 DNEPKEKELKEAILQ----KLKFQDYQKMKNQDQDECQTAFSMSSGSDKGSFSSFLGTKA 1794

Query: 1796 VRALEKKFGLGMDMGADYINKNLEHNHEMTYQGRMCRCVCLELIWPSRHHCFSCHQTFSA 1617
               LEKK+G           K       +    +M RC CLE IWPSR+HC SCH+TF +
Sbjct: 1795 AMLLEKKYGPCFKSEITDSLKKRGKKARVINGDKMYRCKCLEPIWPSRNHCISCHKTFFS 1854

Query: 1616 SEELDQHGE-KCRAAA------TGAAGSLKHDKTVMNQGIRI--CPGSSNIPQSVLNEKH 1464
              E + H + KC   +      T    SLK  K  MN  I    C     I ++  +   
Sbjct: 1855 DVEFEDHNDGKCNLGSPLNEKSTSVGDSLK-GKGNMNIDINRVDCTVDMEIVETSKSGHS 1913

Query: 1463 DTQSNCVKXXXXXXXXXXXXE-IMANFKVDYSIEEDIKGIGLFGTNGVVPFLPNSSPCFN 1287
            +  S  +K            E I   F    S EE ++ IGL G+NGV  F+ + S   +
Sbjct: 1914 ELSSRLIKFQNEGLVCPYNFEEISTKFVTRDSNEELVREIGLIGSNGVPSFVSSVSHFVS 1973

Query: 1286 DPALTLVPPSGNEVSMEDQTSVPKSQQKLSDDLAKTASVVNDKSSGIEKGKGKVSEVECM 1107
            D  L  V P      + D+    +     S      A+ +N++ S     +   SE+E  
Sbjct: 1974 DSTLMTVRPHQERGDLGDKLKATE-MPGFSQGNRSVANGINERLSDNSFRRSVASEIEVQ 2032

Query: 1106 KS-----RVLCERGRLXXXXXXXXXS-TARRSIIRESSLRPKVGYATEVLRLLKISMLDI 945
            ++     R L +R R+             R  ++ +SSLRP VG  +++ R LKI++LD+
Sbjct: 2033 RTIRPALRCLEQRDRISSADKYSPELGIGRCCVVPQSSLRPLVGKVSQISRQLKINLLDM 2092

Query: 944  DAALPEAALRASRSHLDRRCAWRTFVKCANTLYEMMLALIILEDTIRTDYLKNNWWYWSS 765
            DAAL E ALR S++ ++RR AWR+FVK A T+YEM+ A I+LED I+T+YL+N WWYWSS
Sbjct: 2093 DAALSEEALRPSKACMERRWAWRSFVKSAETIYEMVQATIVLEDMIKTEYLRNEWWYWSS 2152

Query: 764  PSAASNMPTLSALALRIFTLDSAILYEKS----STDDTAGTSIP 645
             SAA  + T+S+LALRI++LDSAI+YEKS    S D+   +SIP
Sbjct: 2153 LSAAVKISTVSSLALRIYSLDSAIIYEKSFEFHSIDNLKPSSIP 2196


>ref|XP_002884279.1| methyl-CpG-binding domain 9 [Arabidopsis lyrata subsp. lyrata]
            gi|297330119|gb|EFH60538.1| methyl-CpG-binding domain 9
            [Arabidopsis lyrata subsp. lyrata]
          Length = 2183

 Score =  468 bits (1204), Expect = e-129
 Identities = 317/944 (33%), Positives = 474/944 (50%), Gaps = 84/944 (8%)
 Frame = -3

Query: 3161 ESQKERDEVLALVNDSSLPKAPWEEGICKVCGMDKDDVNVLLCDTCDSEYHTYCLNPPLG 2982
            E +KE  +++  VN   LPKAPW+EG+CKVCG+DKDD +VLLCDTCD+EYHTYCLNPPL 
Sbjct: 1265 EMKKEIKDIVVSVN--KLPKAPWDEGVCKVCGVDKDDDSVLLCDTCDAEYHTYCLNPPLI 1322

Query: 2981 KVPDGNWYCPSCVTGNSLSEDAAYGTQVVNQCGKRRYQRKLIDETLEGLAQLANKMELKE 2802
            ++P+GNWYCPSCV    ++++A    ++V +   R+YQ +L   ++E  A LA+ ME K+
Sbjct: 1323 RIPEGNWYCPSCVIAKRMAQEALESYKLVRRRKGRKYQGQLTRTSMEMTAHLADVMEEKD 1382

Query: 2801 YWEFTVEERILLLKFFSDEAMNSATIRDHIDQCASMCVDLHQRLRSLNSERKILKLKEES 2622
            YWEF+ EERILLLK   DE ++S+ +  H++QCA   +++ Q+LRSL+SE K  K+++E 
Sbjct: 1383 YWEFSAEERILLLKLLCDELLSSSLVHQHLEQCAEAIIEMQQKLRSLSSEWKNAKMRQEF 1442

Query: 2621 LAANVAKMK-------GNVHTGG--------------GELASVLADESQLPVDNKVSSFS 2505
            L A +AK++       G  H  G              G    V  D+S     NK    +
Sbjct: 1443 LTAKLAKVEPSILKEVGEPHNSGHFADQMGCDQRPQEGVGDGVTHDDSSTAYLNK----N 1498

Query: 2504 GGSVPM--DGGPHTKDQVSILRSDVNLHPK----------------LGDTXXXXXXXXXX 2379
             G  P+  D  P          S VN   K                + DT          
Sbjct: 1499 KGKAPLETDSQPGEFQDSQPGESHVNFESKISSPETISSPGRHEKPIADTSPHVTDNPSF 1558

Query: 2378 XQMLGYGLSNTVSKNITVH-----ASSFPGHQYSNQPNANSLLDYNAELSKLQQEISGLQ 2214
             +     L  +V +N   H     A   P    ++   +  L     +L+    EI  LQ
Sbjct: 1559 EKYTSETLHKSVGRNHETHSLNSNAVEIPTAHDASSQASQELQACLQDLNATSHEIHNLQ 1618

Query: 2213 ESISTAESELLKVSLRKEFLGRDSYGRLYWVFGSYDASPWIVANGSLNPESE-------- 2058
            +SI + ES+LLK S+R++FLG D+ GRLYW     D +P I+ +GS++ +          
Sbjct: 1619 QSIRSIESQLLKQSIRRDFLGNDASGRLYWGCCFPDENPRILVDGSISLQKPVQADLMGS 1678

Query: 2057 -------FGLDNHFPKSSSWMYYDTVAEIGELVKWLDDCDIRERDLKESILQWQSNKSLN 1899
                     +D+   + S W YY+T  EI ELV+WL D D++ERDL+ESIL W+      
Sbjct: 1679 KVPSPFLHAVDHGRLRLSPWTYYETETEISELVQWLHDDDLKERDLRESILCWK------ 1732

Query: 1898 SNYQRNDFLKGKPSSSVISSEQRVPNHNCRVLKGVRALEKKFGLGMDMGADYINKNLEHN 1719
               +  D  K K  +  +S+            K   ++EKK+G  + +  + + K  +  
Sbjct: 1733 -RLRFGDVQKEKKQAQNLSAPILARGLE---TKAAMSMEKKYGPCIKLETETLKKRGKKT 1788

Query: 1718 HEMTYQGRMCRCVCLELIWPSRHHCFSCHQTFSASEELDQHGE-KC------RAAATGAA 1560
             +++ + ++CRC CLE I PS  HC  CH+TF++ +E ++H E KC         +   +
Sbjct: 1789 -KVSQREKLCRCECLESILPSMIHCLICHKTFASDDEFEEHTESKCIPYSLATEESKEIS 1847

Query: 1559 GSLKHDKTVMNQGIRICPGSSNIPQSVLNEKHDTQSNCVKXXXXXXXXXXXXEIMANFKV 1380
             S K  +++ +  + +   +      + N                       EI + F  
Sbjct: 1848 DSSKAKESLKSDYLNVKSSAGKAVGEISNVSELDSGLIRYQEEESISPYHFEEICSKFVT 1907

Query: 1379 DYSIEEDIKGIGLFGTNGVVPFLPNSSPCFNDPALTLVPP-------SGNEV---SMEDQ 1230
              S  + +K IGL G+NG+  FLP SS   ND  L    P       SG++V     E  
Sbjct: 1908 KDSNRDLVKEIGLIGSNGIPTFLPASSTHHNDSVLINANPNKLDGGDSGDQVIFAGPETN 1967

Query: 1229 TSVPKSQQKLSDDLAKT------ASVVNDKSSGIEKGKGKVSEVECMKSRVLCERGRLXX 1068
                 S+  LS D + T       + +     G  + K K S    +KS   C       
Sbjct: 1968 VEGLNSESNLSFDGSVTDNHGGPLNKLTGLGFGFSEQKNKKSSGSGLKS---C------- 2017

Query: 1067 XXXXXXXSTARRSIIRESSLRPKVGYATEVLRLLKISMLDIDAALPEAALRASRSHLDRR 888
                         ++ +++L+   G A  V R LK ++LD+D ALPE ALR S+SH DRR
Sbjct: 2018 ------------CVVPQAALKRITGKALPVFRFLKTNLLDMDVALPEEALRPSKSHPDRR 2065

Query: 887  CAWRTFVKCANTLYEMMLALIILEDTIRTDYLKNNWWYWSSPSAASNMPTLSALALRIFT 708
             AWR FVK A ++YE++ A  ++ED I+T+YLKN WWYWSS SAA+ + TLSAL++RIF+
Sbjct: 2066 RAWRVFVKSAQSIYELVQATFVVEDMIKTEYLKNEWWYWSSLSAAAKISTLSALSVRIFS 2125

Query: 707  LDSAILYEKSST--DDTAGTSIPVSQPDKEASSSAIPTTEMKSA 582
            LD+AI+Y+K  T  D    T   +S PD++  S  +  ++ KS+
Sbjct: 2126 LDAAIIYDKPITPSDHNDETKPIISSPDQK--SQPVSDSQEKSS 2167


>ref|XP_006408507.1| hypothetical protein EUTSA_v10019872mg [Eutrema salsugineum]
            gi|557109653|gb|ESQ49960.1| hypothetical protein
            EUTSA_v10019872mg [Eutrema salsugineum]
          Length = 2173

 Score =  467 bits (1201), Expect = e-128
 Identities = 312/941 (33%), Positives = 478/941 (50%), Gaps = 65/941 (6%)
 Frame = -3

Query: 3164 TESQKERDEVLALVNDSSLPKAPWEEGICKVCGMDKDDVNVLLCDTCDSEYHTYCLNPPL 2985
            TE +KE  +++  +N   LPKAPW+EG+CKVCG+DKDD +VLLCDTCD+EYHTYCLNPPL
Sbjct: 1262 TEVKKEIKDIVVSIN--KLPKAPWDEGVCKVCGVDKDDDSVLLCDTCDAEYHTYCLNPPL 1319

Query: 2984 GKVPDGNWYCPSCVTGNSLSEDAAYGTQVVNQCGKRRYQRKLIDETLEGLAQLANKMELK 2805
             ++PDGNWYCPSCV    +++DA    ++V +   R+YQ +L   ++E  A LA+ ME K
Sbjct: 1320 IRIPDGNWYCPSCVIAKRIAQDALESYKLVRRRKGRKYQGELTRASMETTAHLADVMEEK 1379

Query: 2804 EYWEFTVEERILLLKFFSDEAMNSATIRDHIDQCASMCVDLHQRLRSLNSERKILKLKEE 2625
            +YWEF+ EERILLLK   DE ++S+ +  H++QCA   +++ Q+LRSL+SE K  K+++E
Sbjct: 1380 DYWEFSTEERILLLKLLCDELLSSSLVHQHLEQCAEAIIEMQQKLRSLSSEWKNTKMRQE 1439

Query: 2624 SLAANVAKMKGNVHTGGGELASVLADESQLPVDNKVSSFSGGSVPMDGGPHTKDQVSILR 2445
             L A +AK++ ++    GE  +  +   Q+  + +     G  V  D         + L 
Sbjct: 1440 FLTAKLAKVEPSILKELGEPQNSSSFAEQIRCNQQQQEGVGDRVTHD---DDTSSAAFLN 1496

Query: 2444 SDVNLHPKLGDTXXXXXXXXXXXQMLGY-----------------------GLSNTVSKN 2334
             +    P + D            + +                          LS     +
Sbjct: 1497 KNQRTTPLMTDAQTEELHVISGERKISTPENVTSPGRPELPIADASPHGTDNLSCEKDSS 1556

Query: 2333 ITVHASSFPGHQY----SNQPNANSLLDYNA-----------ELSKLQQEISGLQESIST 2199
             T+H S    H+     SN   + +  D ++           EL+    EI  LQ+SI +
Sbjct: 1557 DTLHKSVGGNHEIHTLKSNAVESQTAHDASSMASQELQASQQELNATSNEIQNLQQSIRS 1616

Query: 2198 AESELLKVSLRKEFLGRDSYGRLYWVFGSYDASPWIVANGSLNPESE------------- 2058
             ES+LL+ S+R++FLG D+ GRLYW     +  P I+ +GS++ +               
Sbjct: 1617 IESQLLRQSIRRDFLGSDASGRLYWGCCFPEEHPRILVDGSISLQKSVQVNLTGSKVLSP 1676

Query: 2057 --FGLDNHFPKSSSWMYYDTVAEIGELVKWLDDCDIRERDLKESILQWQSNKSLNSNYQR 1884
                +D+     S W YY+T AEI ELV+WL D D +ER+L+ESIL W   K L     +
Sbjct: 1677 FLHAVDHGRLLVSPWTYYETEAEISELVQWLHDDDPKERELRESILCW---KRLRFGDLQ 1733

Query: 1883 NDFLKGKPSSSVISSEQRVPNHNCRVLKGVRALEKKFGLGMDMGADYINKNLEHNHEMTY 1704
                + + SS  IS+          V K   ++EK++G  + +  + + K  +   ++  
Sbjct: 1734 RGMKQAQNSSCPISA-------GSLVTKAAMSMEKRYGPCIKLETETLKKRGKKT-KVAE 1785

Query: 1703 QGRMCRCVCLELIWPSRHHCFSCHQTFSASEELDQHGE-KC-RAAATGAAGSLKHDKTVM 1530
            + ++CRC CLE I PS  HC  CH+TF++ +E ++H E KC   +     G    D +  
Sbjct: 1786 REKLCRCECLEPILPSMIHCLICHKTFASDDEFEEHTESKCIPYSLASEEGKEISDSSKA 1845

Query: 1529 NQGIR---ICPGSSNIPQSVLNEKHDTQSNCVK-XXXXXXXXXXXXEIMANFKVDYSIEE 1362
              G++   +   ++    + ++   +  S  ++             EI + F    S  +
Sbjct: 1846 KDGLKSDYLNVYNAGKDVAEMSNVSELDSGLIRYQEEESISPYHFEEICSKFVTRDSNRD 1905

Query: 1361 DIKGIGLFGTNGVVPFLPNSSPCFNDPALTLVP----PSGNEVSMEDQTSVPKSQQKLSD 1194
             +K IGL G+NG   FLP SS   ND  L          G+ V     T    + + L+ 
Sbjct: 1906 LVKEIGLIGSNGTPTFLP-SSTFLNDSMLISATCNKLDGGDSVDQVIFTGSEANDEGLNS 1964

Query: 1193 D--LAKTASVVNDKSSGIEKGKGKVSEVECMKSRVLCERGRLXXXXXXXXXSTARRSIIR 1020
            +  ++    V ND    + K  G    +   K++    RG                 ++ 
Sbjct: 1965 ESNMSFNRIVTNDLGGPLNKPSGLSFGLSDQKNKKSSGRG------------LEGCCVVP 2012

Query: 1019 ESSLRPKVGYATEVLRLLKISMLDIDAALPEAALRASRSHLDRRCAWRTFVKCANTLYEM 840
            +SSL+   G A  V R LK +MLD+D ALPE ALR S+SH DRR AWR FVK A +++E+
Sbjct: 2013 QSSLKRITGKALSVFRFLKTNMLDMDVALPEEALRPSKSHPDRRRAWRAFVKSAQSIFEL 2072

Query: 839  MLALIILEDTIRTDYLKNNWWYWSSPSAASNMPTLSALALRIFTLDSAILYEKSSTDDTA 660
            + A I++ED I+T+YLKN WWYWSS SAA+ + TLSAL++R+F+LD+AILYEK       
Sbjct: 2073 VQAAIVVEDMIKTEYLKNEWWYWSSLSAAAKISTLSALSVRLFSLDAAILYEK------- 2125

Query: 659  GTSIPVSQPDKEASSSAIPTTEMKSAEQPMQKVNDSDSGEN 537
                P++Q D +  +  I   + +S  QP+    +  S  N
Sbjct: 2126 ----PINQSDPKDETKTISLPDQRS--QPVSDPQERSSRSN 2160


>ref|XP_006296811.1| hypothetical protein CARUB_v10012794mg [Capsella rubella]
            gi|482565520|gb|EOA29709.1| hypothetical protein
            CARUB_v10012794mg [Capsella rubella]
          Length = 2177

 Score =  461 bits (1185), Expect = e-126
 Identities = 311/920 (33%), Positives = 474/920 (51%), Gaps = 73/920 (7%)
 Frame = -3

Query: 3161 ESQKERDEVLALVNDSSLPKAPWEEGICKVCGMDKDDVNVLLCDTCDSEYHTYCLNPPLG 2982
            E +KE  +++  +N   LPKAPW+EG+CKVCG+DKDD +VLLCDTCD+EYHTYCLNPPL 
Sbjct: 1268 EMKKEIKDIIVSIN--KLPKAPWDEGVCKVCGVDKDDDSVLLCDTCDAEYHTYCLNPPLI 1325

Query: 2981 KVPDGNWYCPSCVTGNSLSEDAAYGTQVVNQCGKRRYQRKLIDETLEGLAQLANKMELKE 2802
            ++PDGNWYCPSCV    ++++A    ++V +   R+YQ +L   ++E  A LA  ME K+
Sbjct: 1326 RIPDGNWYCPSCVIAKRMAQEALESYKLVRRRKGRKYQGELTQASMEMTAHLAGVMEEKD 1385

Query: 2801 YWEFTVEERILLLKFFSDEAMNSATIRDHIDQCASMCVDLHQRLRSLNSERKILKLKEES 2622
            YWEF+VEERILLLK   DE ++S+ +  H++QCA   +++ Q+LRSL+SE K  K+++E 
Sbjct: 1386 YWEFSVEERILLLKVLCDELLSSSLVHQHLEQCAEAIIEMQQKLRSLSSEWKNAKMRQEF 1445

Query: 2621 LAANVAKMKGNVHTGGGELASVLADESQLPVDNKVSSFSGGSVPMDGGPHT--------- 2469
            L A +AK++ ++     EL +      Q+  D +     G  V  D    +         
Sbjct: 1446 LMAKLAKVEPSILKEASELHNSSHFADQMGCDERTHEGVGDGVTHDDETSSTAFLNKNQG 1505

Query: 2468 KDQVSILRSDVNLHPKLGD---TXXXXXXXXXXXQMLGYGLSNTVSKNI-----TVHASS 2313
            K  +       +LH   G    +           ++L   +S   + N+     T+H S 
Sbjct: 1506 KAPLETNSQPGDLHVDSGGNKVSSQKKITSPGRHELLVADISPRATDNLTFEKDTLHKSV 1565

Query: 2312 FPGHQ----YSNQPNANSLLDYNAELSKLQQ-----------EISGLQESISTAESELLK 2178
               H+    +SN     S+ D +++ S+  Q           EI  LQ SI + ES+LLK
Sbjct: 1566 GRIHETHPLHSNAVELQSVHDASSQASQELQACQQDLNATSNEIQNLQLSIRSVESQLLK 1625

Query: 2177 VSLRKEFLGRDSYGRLYWVFGSYDASPWIVANGSLNPESEF---------------GLDN 2043
             S+R++FLG DS GRLYW     D +P ++ +GS++ +                   +D+
Sbjct: 1626 QSIRRDFLGNDSSGRLYWGCCFPDENPRVLVDGSISLQKPVQANLTGSRAPSPFLQAVDH 1685

Query: 2042 HFPKSSSWMYYDTVAEIGELVKWLDDCDIRERDLKESILQWQSNKSLNSNYQRNDFLKGK 1863
                 S W YY+T +EI ELV+WL D D +ERDL+ESIL W+         +  D  K K
Sbjct: 1686 GRLTLSPWTYYETESEISELVQWLHDDDPKERDLRESILCWK-------RLRFGDVQKEK 1738

Query: 1862 PSSSVISSEQRVPNHNCRVLKGVRALEKKFGLGMDMGADYINKNLEHNHEMTYQGRMCRC 1683
             ++  +SS          V K   ++EK+FG  + +  + + K          + + CRC
Sbjct: 1739 ENAENLSSP---IFSRGLVTKAAMSMEKRFGPCIKLETETLKK--RGKKTKVEREKFCRC 1793

Query: 1682 VCLELIWPSRHHCFSCHQTFSASEELDQHGE-KC--RAAATGAAGSL----KHDKTVMNQ 1524
             CLE I PS  HC  CH+TF++ +E + H E KC   + AT     +    K  +++ + 
Sbjct: 1794 ECLEAILPSMIHCLICHKTFASDDEFENHSESKCIPYSLATEEGKEISDFSKAKESLKSD 1853

Query: 1523 GIRICPGSSNIPQSVLNEKHDTQSNCVK-XXXXXXXXXXXXEIMANFKVDYSIEEDIKGI 1347
             + +   S+    S ++   +  S  ++             EI + F    S  + +K I
Sbjct: 1854 YLNV-KSSAGKDVSEISNVSELDSGLIRYQEEESISPYHFEEICSKFVTKDSNRDLVKDI 1912

Query: 1346 GLFGTNGVVPFLPNSSPCFNDPALTLVPPS--------------GNEVSMEDQTSV--PK 1215
            GL G+NG+  FLP+S    ND  L     S              G+E ++E   S     
Sbjct: 1913 GLIGSNGIPTFLPSSYTHLNDSMLISANSSKLDGDDSGDQVVFAGSETNVEGLNSEFNMS 1972

Query: 1214 SQQKLSDDLAKTASVVNDKSSGIEKGKGKVSEVECMKSRVLCERGRLXXXXXXXXXSTAR 1035
              + ++ DL    S  +    G  + K K S    +KS   C                  
Sbjct: 1973 FDRSVTHDLGGPPSKPSGLGFGFSEQKIKKSLGSGLKS---C------------------ 2011

Query: 1034 RSIIRESSLRPKVGYATEVLRLLKISMLDIDAALPEAALRASRSHLDRRCAWRTFVKCAN 855
              ++ ++SL+   G A  V R LK ++LD+D ALPE  LR S+SH  RR AWR FVK + 
Sbjct: 2012 -CVVPQASLKRITGKALPVFRFLKTNLLDMDVALPEEGLRPSKSHPGRRRAWRLFVKSSQ 2070

Query: 854  TLYEMMLALIILEDTIRTDYLKNNWWYWSSPSAASNMPTLSALALRIFTLDSAILYEK-- 681
            ++YE++ A ++LED ++T+YLKN WWYWSS SAA+ + TLSAL++RIF LD+AI+Y+K  
Sbjct: 2071 SIYELVQATVVLEDMVKTEYLKNEWWYWSSLSAAAKISTLSALSVRIFALDAAIMYDKLL 2130

Query: 680  SSTDDTAGTSIPVSQPDKEA 621
            + +D    T   +S PD+++
Sbjct: 2131 TPSDPIDETKPIISLPDQKS 2150


>ref|XP_006646998.1| PREDICTED: methyl-CpG-binding domain-containing protein 9-like [Oryza
            brachyantha]
          Length = 1852

 Score =  457 bits (1177), Expect = e-125
 Identities = 322/926 (34%), Positives = 463/926 (50%), Gaps = 91/926 (9%)
 Frame = -3

Query: 3167 GTESQKERDEVLALVNDSSLPKAPWEEGICKVCGMDKDDVNVLLCDTCDSEYHTYCLNPP 2988
            G+E  +E  ++L   N  SLPKAPWE+G+CKVCG+D+DD +VLLCD CDSEYHTYCLNPP
Sbjct: 955  GSEMHEELHDILTASN--SLPKAPWEDGVCKVCGIDRDDDSVLLCDKCDSEYHTYCLNPP 1012

Query: 2987 LGKVPDGNWYCPSCVTGNSLSEDAAYGTQVVNQCGKRRYQRKLIDETL----EGLAQLAN 2820
            L ++P+GNWYCPSC+ G   +     G Q V     +R Q+K + E      E L +L  
Sbjct: 1013 LARIPEGNWYCPSCMLGQKKAH-LDQGAQDV-----KRQQKKFVGEEAHAFQEELNKLVT 1066

Query: 2819 KMELKEYWEFTVEERILLLKFFSDEAMNSATIRDHIDQCASMCVDLHQRLRSLNSERKIL 2640
             ME KEYW+  ++ERI LLKF  DE +N+A IR+H+DQC+    DL Q+ RS N E K L
Sbjct: 1067 AMEEKEYWDLRIQERIYLLKFLCDEMLNTALIREHLDQCSDKLGDLQQKFRSSNFELKDL 1126

Query: 2639 KLKEESLAANVAKMKGNV-----------------------HTGGGELASVLADESQLPV 2529
            K KEE   ++  + + +                        H   GEL +V  +     +
Sbjct: 1127 KYKEEIRTSHARQSRSSKTEQHFSNISGPVENQQCTPKALDHLEEGELGNVGVN-----L 1181

Query: 2528 DNKVSSFSGGSVPMDGGPHTKDQVSILRSDVNLHPKLGDTXXXXXXXXXXXQ-------- 2373
            +N       G + + G PH  DQ     S V  H  LG +                    
Sbjct: 1182 NNPADGVRDGQLNV-GRPHKSDQDISSTSMVEEHKSLGLSEQPSGMAIDQIDGDAIDEGS 1240

Query: 2372 --------MLGYGLSNTVSKNITVHASSFPGHQYSNQPNANSLLDY-------------- 2259
                     LG   S   + N+    +S PG    ++  + S  D               
Sbjct: 1241 QTQSCEKRPLGVKSSTCDNLNLRETETSTPGRDLPDENASASFQDNLEASTTKSMEFDAD 1300

Query: 2258 NAELSKLQQEISGLQESISTAESELLKVSLRKEFLGRDSYGRLYWVFGSYDASPWIVANG 2079
            N E+  L  +IS LQ+SIS  ES++   S R+E LG+DS GRLYWV G     PW+VA+G
Sbjct: 1301 NNEMDTLSDDISKLQDSISLLESQINMASSRRECLGKDSIGRLYWVIGRPGKHPWLVADG 1360

Query: 2078 SL--NPESEFGLDNHFP---------KSSSWMYYDTVAEIGELVKWLDDCDIRERDLKES 1932
            S+  + E +  + N +P          S+S   Y++  EI  LV WL D D RE++LK+S
Sbjct: 1361 SMLISKERDISMVNSYPLSAFDCRGWNSASIFIYESDEEIQCLVDWLRDYDPREKELKDS 1420

Query: 1931 ILQWQSNKSLNSNYQRNDFLKGKPSSSVISSEQRV--PNHNCRVLKGVRALEKKFGLGMD 1758
            ILQWQ +      +Q +  L   P S+   SEQ +  P     VL     LE+K+GL +D
Sbjct: 1421 ILQWQRHLC----HQSSSPLIDPPVSNFSKSEQLIDLPRTKASVL-----LEQKYGLQLD 1471

Query: 1757 MGADYINKNLEHNHEMTYQGRMCRCVCLELIWPSRHHCFSCHQTFSASEELDQHGE-KCR 1581
                 ++K      ++  + R  RC CLE IWPSR+HC  CH+T+    E + H + KC 
Sbjct: 1472 QDTSDLSKKRGKKVKLGSEERTYRCDCLEPIWPSRNHCLICHETYLVYTEFEGHNDGKCS 1531

Query: 1580 AAATGAAGSLKHDKTVMNQGIRICPGSSNIPQSVLNEKHDTQSNCV----KXXXXXXXXX 1413
                    S ++D++ +            +P+S + EK     + V              
Sbjct: 1532 KIHQSPDESKENDESKVK-----------VPKSDMKEKDSLDRSSVIEPSSDRKFMQCPY 1580

Query: 1412 XXXEIMANFKVDYSIEEDIKGIGLFGTNGVVPFLPNSSPCFNDPALTLVPPSGNEVSMED 1233
               EI   F  + S +E +K IGL G+NGV  F+P S   F +PA+ L   +  +  + D
Sbjct: 1581 DFEEICRKFITNDSNKETVKQIGLNGSNGVPSFVP-SPAFFLEPAIVL-NQNRKDGELND 1638

Query: 1232 QTSVPK-----SQQKLSDDLAKTASVV--NDKSSGIEKGK--------GKVSEVECMK-S 1101
             TS  +     S QKL  +++K+A +   N     ++K K        G+ +     K +
Sbjct: 1639 WTSCLEECNAMSAQKLGQEVSKSAQICPGNMGDEKVQKSKKPTPDNTSGEEAHSTTGKPT 1698

Query: 1100 RVLCERGRLXXXXXXXXXSTARRSIIRESSLRPKVGYATEVLRLLKISMLDIDAALPEAA 921
            RVL   G L                + ESSLRP +G  + +L+  KI++LDI+A LPE A
Sbjct: 1699 RVLAVNGGL----------------VPESSLRPVLGRNSHILKQQKINLLDIEATLPEEA 1742

Query: 920  LRASRSHLDRRCAWRTFVKCANTLYEMMLALIILEDTIRTDYLKNNWWYWSSPSAASNMP 741
            LRAS+S   RR +WR FVK A+++ +M+LA  +LE  ++ ++LKN+WWYWSS +AA    
Sbjct: 1743 LRASKSQQIRRRSWRAFVKDADSISQMVLAANLLEGMVKAEFLKNDWWYWSSFTAAMKTS 1802

Query: 740  TLSALALRIFTLDSAILYEKSSTDDT 663
            T+S+LALRI+TLD  I+Y K    ++
Sbjct: 1803 TVSSLALRIYTLDDCIIYSKDQVSNS 1828


>ref|NP_186795.1| methyl-CPG-binding domain 9 [Arabidopsis thaliana]
            gi|75337201|sp|Q9SGH2.1|MBD9_ARATH RecName:
            Full=Methyl-CpG-binding domain-containing protein 9;
            Short=AtMBD9; Short=MBD09; AltName: Full=Histone acetyl
            transferase MBD9; AltName: Full=Methyl-CpG-binding
            protein MBD9 gi|6692266|gb|AAF24616.1|AC010870_9 unknown
            protein [Arabidopsis thaliana]
            gi|332640148|gb|AEE73669.1| methyl-CPG-binding domain 9
            [Arabidopsis thaliana]
          Length = 2176

 Score =  457 bits (1175), Expect = e-125
 Identities = 312/923 (33%), Positives = 480/923 (52%), Gaps = 76/923 (8%)
 Frame = -3

Query: 3161 ESQKERDEVLALVNDSSLPKAPWEEGICKVCGMDKDDVNVLLCDTCDSEYHTYCLNPPLG 2982
            E +KE  +++  VN   LPKAPW+EG+CKVCG+DKDD +VLLCDTCD+EYHTYCLNPPL 
Sbjct: 1265 EMKKEIKDIVVSVN--KLPKAPWDEGVCKVCGVDKDDDSVLLCDTCDAEYHTYCLNPPLI 1322

Query: 2981 KVPDGNWYCPSCVTGNSLSEDAAYGTQVVNQCGKRRYQRKLIDETLEGLAQLANKMELKE 2802
            ++PDGNWYCPSCV    ++++A    ++V +   R+YQ +L   ++E  A LA+ ME K+
Sbjct: 1323 RIPDGNWYCPSCVIAKRMAQEALESYKLVRRRKGRKYQGELTRASMELTAHLADVMEEKD 1382

Query: 2801 YWEFTVEERILLLKFFSDEAMNSATIRDHIDQCASMCVDLHQRLRSLNSERKILKLKEES 2622
            YWEF+ EERILLLK   DE ++S+ +  H++QCA   +++ Q+LRSL+SE K  K+++E 
Sbjct: 1383 YWEFSAEERILLLKLLCDELLSSSLVHQHLEQCAEAIIEMQQKLRSLSSEWKNAKMRQEF 1442

Query: 2621 LAANVAKMKGNVHTGGGELASVLADESQLPVDNKVSSFSGGSVPMDGGPHT-----KDQ- 2460
            L A +AK++ ++    GE  +      Q+  D +     G  V  D    +     K+Q 
Sbjct: 1443 LTAKLAKVEPSILKEVGEPHNSSYFADQMGCDPQPQEGVGDGVTRDDETSSTAYLNKNQG 1502

Query: 2459 VSILRSDV---NLHPKLGDTXXXXXXXXXXXQMLGYGLSNT---VSKNI-------TVHA 2319
             S L +D      H   G++                 +++T   V+ N+       T+  
Sbjct: 1503 KSPLETDTQPGESHVNFGESKISSPETISSPGRHELPIADTSPLVTDNLPEKDTSETLLK 1562

Query: 2318 SSFPGHQYSNQPNANS----------------LLDYNAELSKLQQEISGLQESISTAESE 2187
            S    H+ ++ PN+N+                L     +LS    EI  LQ+SI + ES+
Sbjct: 1563 SVGRNHE-THSPNSNAVELPTAHDASSQASQELQACQQDLSATSNEIQNLQQSIRSIESQ 1621

Query: 2186 LLKVSLRKEFLGRDSYGRLYWVFGSYDASPWIVANGSLNPESE---------------FG 2052
            LLK S+R++FLG D+ GRLYW     D +P I+ +GS++ +                   
Sbjct: 1622 LLKQSIRRDFLGTDASGRLYWGCCFPDENPRILVDGSISLQKPVQADLIGSKVPSPFLHT 1681

Query: 2051 LDNHFPKSSSWMYYDTVAEIGELVKWLDDCDIRERDLKESILQWQSNKSLNSNYQRNDFL 1872
            +D+   + S W YY+T  EI ELV+WL D D++ERDL+ESIL W+         +  D  
Sbjct: 1682 VDHGRLRLSPWTYYETETEISELVQWLHDDDLKERDLRESILWWK-------RLRYGDVQ 1734

Query: 1871 KGKPSSSVISSEQRVPNHNCRVLKGVRALEKKFGLGMDMGADYINKNLEHNHEMTYQGRM 1692
            K K  +  +S+            K   ++EK++G  + +  + + K  +   ++  + ++
Sbjct: 1735 KEKKQAQNLSAPVFATGLE---TKAAMSMEKRYGPCIKLEMETLKKRGKKT-KVAEREKL 1790

Query: 1691 CRCVCLELIWPSRHHCFSCHQTFSASEELDQHGE-KC------RAAATGAAGSLKHDKTV 1533
            CRC CLE I PS  HC  CH+TF++ +E + H E KC             + S K  +++
Sbjct: 1791 CRCECLESILPSMIHCLICHKTFASDDEFEDHTESKCIPYSLATEEGKDISDSSKAKESL 1850

Query: 1532 MNQGIRICPGSSNIPQSVLNEKHDTQSNCVKXXXXXXXXXXXXEIMANFKVDYSIEED-I 1356
             +  + +   S+    + ++   +  S  ++            E + +  V      D +
Sbjct: 1851 KSDYLNV-KSSAGKDVAEISNVSELDSGLIRYQEEESISPYHFEEICSKFVTKDCNRDLV 1909

Query: 1355 KGIGLFGTNGVVPFLPNSSPCFNDPALTLVP-------PSGNEV---SMEDQTSVPKSQQ 1206
            K IGL  +NG+  FLP+SS   ND  L            SG++V     E       S+ 
Sbjct: 1910 KEIGLISSNGIPTFLPSSSTHLNDSVLISAKSNKPDGGDSGDQVIFAGPETNVEGLNSES 1969

Query: 1205 KLSDDLAKTASVVN--DKSSGIEKG----KGKVSEVECMKSRVLCERGRLXXXXXXXXXS 1044
             +S D + T S     DK SG+  G    K K S    +KS   C               
Sbjct: 1970 NMSFDRSVTDSHGGPLDKPSGLGFGFSEQKNKKSSGSGLKS---C--------------- 2011

Query: 1043 TARRSIIRESSLRPKVGYATEVLRLLKISMLDIDAALPEAALRASRSHLDRRCAWRTFVK 864
                 ++ +++L+   G A    R LK ++LD+D ALPE ALR S+SH +RR AWR FVK
Sbjct: 2012 ----CVVPQAALKRVTGKALPGFRFLKTNLLDMDVALPEEALRPSKSHPNRRRAWRVFVK 2067

Query: 863  CANTLYEMMLALIILEDTIRTDYLKNNWWYWSSPSAASNMPTLSALALRIFTLDSAILYE 684
             + ++YE++ A I++ED I+T+YLKN WWYWSS SAA+ + TLSAL++RIF+LD+AI+Y+
Sbjct: 2068 SSQSIYELVQATIVVEDMIKTEYLKNEWWYWSSLSAAAKISTLSALSVRIFSLDAAIIYD 2127

Query: 683  KSSTDDTA--GTSIPVSQPDKEA 621
            K  T       T   +S PD+++
Sbjct: 2128 KPITPSNPIDETKPIISLPDQKS 2150


>gb|ESW23089.1| hypothetical protein PHAVU_004G017600g [Phaseolus vulgaris]
          Length = 2204

 Score =  448 bits (1152), Expect = e-123
 Identities = 311/970 (32%), Positives = 467/970 (48%), Gaps = 99/970 (10%)
 Frame = -3

Query: 3161 ESQKERDEVLALVNDSSLPKAPWEEGICKVCGMDKDDVNVLLCDTCDSEYHTYCLNPPLG 2982
            E +KE D+ +  + ++  PKAPW+EG+CKVCG+D+DD +VLLCDTCD+EYHTYCLNPPL 
Sbjct: 1254 EMRKEVDDFIESMKET--PKAPWDEGVCKVCGIDRDDDSVLLCDTCDAEYHTYCLNPPLA 1311

Query: 2981 KVPDGNWYCPSCVTGNSLSEDAAYGTQVVNQCGKRRYQRKLIDETLEGLAQLANKMELKE 2802
            ++P+GNWYCPSCV G   ++D    TQV+ +C  +++Q ++    LE L  L+  +E KE
Sbjct: 1312 RIPEGNWYCPSCVDGKHATQDVTERTQVIGKCRSKKFQGEVNSLFLESLTHLSTVIEEKE 1371

Query: 2801 YWEFTVEERILLLKFFSDEAMNSATIRDHIDQCASMCVDLHQRLR-------SLNSERKI 2643
            YWE ++ ER  LLKF  DE +NS+ IR H++QC+ +  +LHQ+LR       +L +   I
Sbjct: 1372 YWEHSLGERTFLLKFLCDELLNSSMIRQHLEQCSELSAELHQKLRAHSAEWKNLKTREDI 1431

Query: 2642 LKLKEESLAANVAKMKGNVHTGGGELASVLADESQLPVD--NKVSSFSGGSVPMDGGPH- 2472
            L  K   +        G V    G + ++L +  +  V     V + S   V +D  P  
Sbjct: 1432 LSTKAAKIDTFSLNTAGEVGLREG-VTTLLTNTGKCLVQPHTAVDNPSNFGVFVDSLPSE 1490

Query: 2471 --TKDQVSILRSDVNLHPKLGDTXXXXXXXXXXXQMLGYGLSNTVSKNITVHASSFPGHQ 2298
              TK++      D ++     D+                      S++      SFP   
Sbjct: 1491 ETTKEKYRFDSVDKSMSVTNSDSDSQNMNSLDVEGQFRNVSGAVESQSTDKSPKSFPSPN 1550

Query: 2297 YSNQPNA---------------------------------------NSLLDYNAELSKLQ 2235
             S + N                                        N    Y+ EL+ ++
Sbjct: 1551 LSQEINGSGGAAHAQSNHQKCEGRDISTPVTCQQGGVTVDASHTALNESEPYHLELNAIK 1610

Query: 2234 QEISGLQESISTAESELLKVSLRKEFLGRDSYGRLYWVFGS--------YDAS------- 2100
            ++IS LQ+SI++  S+LL++S+R+EFLG DS GRLYW             DAS       
Sbjct: 1611 RDISVLQDSITSVVSQLLRLSVRREFLGIDSIGRLYWASTLPGGRSRIVVDASAALLHGR 1670

Query: 2099 --PW---------IVANGSLNPESEFGLDNHFPKSSSWMYYDTVAEIGELVKWLDDCDIR 1953
              P+         ++ + SL+ +    L N    SS W+ Y+T AEI EL+ WLDD D +
Sbjct: 1671 GIPFSRDYVEKFSVLQHSSLSEKDSSQLRNALANSSPWIAYETDAEIEELLGWLDDSDPK 1730

Query: 1952 ERDLKESILQWQSNKS---LNSNYQRNDFLKGKPSSSVISSEQRVPNHNCRVLKGVRALE 1782
            ER+LK+SI+Q   ++    LN+  +     +G P S  I+ E+ V +    V K    LE
Sbjct: 1731 ERELKDSIMQGPRSRFQEFLNAQTEEQVEDRG-PISMPINREKTVSSS--LVTKATSLLE 1787

Query: 1781 KKFGLGMDMGADYINKNLEHNHEMTYQGRMCRCVCLELIWPSRHHCFSCHQTFSASEELD 1602
            KK+G   +   +   K  + +   T   ++ RC CLE IW  R HC  CH+T S+  E D
Sbjct: 1788 KKYGPFFEWDIEMSRKQNKKSRT-TNDEKLFRCECLEPIWFDRRHCTYCHKTVSSDGEFD 1846

Query: 1601 QHGE-KCRAAATGAAGSLKHDKTVMNQGIRICPGSSNIPQSVLNEKHDTQSN-------- 1449
             H + KC A    A  +           I  C G  N+      EK    +         
Sbjct: 1847 GHNDGKCNAGLPVAEKN--------RNKIGSCKGKGNLRCDTSREKFRADAETAGTKVGG 1898

Query: 1448 CVKXXXXXXXXXXXXE--------IMANFKVDYSIEEDIKGIGLFGTNGVVPFLPNSSPC 1293
            C K                     I + F+   S  E +K IGL GT+G+  F+P+ SP 
Sbjct: 1899 CSKLSSRLIKFSNEESTCPFNFEDICSKFETSESNRELVKEIGLIGTDGIPSFVPSVSPL 1958

Query: 1292 FNDPALTLVPPSGNEVSMEDQTSVPKSQQKLSDDLAKTASVVNDKSSGIEKGKGKVSEVE 1113
             ++      P     + +  + +  +  Q  +D     A    D +SGI  G+   +E+ 
Sbjct: 1959 VSEYTRFSTPKDDAIIGVLSKPTETRGSQGNTDG----AGACLDHNSGISTGRLAANEIN 2014

Query: 1112 CMKSRVLCER--GRLXXXXXXXXXSTARRSIIRESSLRPKVGYATEVLRLLKISMLDIDA 939
                    E+  G+                ++  SSL+P VG  + +LR LKI++LD+DA
Sbjct: 2015 KSNKSSSGEQRDGKFSFCGPASDMGVDGCCVVPLSSLKPLVGKVSHILRQLKINLLDMDA 2074

Query: 938  ALPEAALRASRSHLDRRCAWRTFVKCANTLYEMMLALIILEDTIRTDYLKNNWWYWSSPS 759
            ALP +ALR S++  +RR AWR FVK A T+YEM+ A   LED I+T+YL+N+WWYWSS S
Sbjct: 2075 ALPASALRPSKAESERRQAWRAFVKSAETIYEMIQATFTLEDMIKTEYLRNDWWYWSSFS 2134

Query: 758  AASNMPTLSALALRIFTLDSAILYEKSSTDDTAGTSIPVSQPDKEASSSAIPTTEMKSAE 579
            AA+   TL +LALR+++LD AI+YEK+       +S P    +     + + T + K   
Sbjct: 2135 AAAKTSTLPSLALRLYSLDLAIIYEKTPNSTFTDSSEPSGTAETRPPMN-VDTEKSKGNR 2193

Query: 578  QPMQKVNDSD 549
            +  +K  +SD
Sbjct: 2194 KSNRKRKESD 2203


>ref|XP_006594288.1| PREDICTED: methyl-CpG-binding domain-containing protein 9-like
            [Glycine max]
          Length = 2202

 Score =  443 bits (1140), Expect = e-121
 Identities = 320/977 (32%), Positives = 482/977 (49%), Gaps = 105/977 (10%)
 Frame = -3

Query: 3161 ESQKERDEVLALVNDSSLPKAPWEEGICKVCGMDKDDVNVLLCDTCDSEYHTYCLNPPLG 2982
            E +KE  + +   N+  +PKAPW+EG+CKVCG+D+DD +VLLCDTCD+EYHTYCLNPPL 
Sbjct: 1249 EMRKEVGDFIESTNE--IPKAPWDEGVCKVCGIDRDDDSVLLCDTCDAEYHTYCLNPPLA 1306

Query: 2981 KVPDGNWYCPSCVTGNSLSEDAAYGTQVVNQCGKRRYQRKLIDETLEGLAQLANKMELKE 2802
            ++P+GNWYCPSCV G   +++    TQV+ +   +++Q ++    LE LA L+  +E KE
Sbjct: 1307 RIPEGNWYCPSCVVGKHATQNVTERTQVIGKRQSKKFQGEVNSLYLESLAHLSAAIEEKE 1366

Query: 2801 YWEFTVEERILLLKFFSDEAMNSATIRDHIDQCASMCVDLHQRLR-------SLNSERKI 2643
            YWE++V ER  LLKF  DE +NS+ I  H++QCA +  +LHQ+LR       SL +   I
Sbjct: 1367 YWEYSVGERTFLLKFLCDELLNSSLIHQHLEQCAELSAELHQKLRAHSAEWKSLKTREDI 1426

Query: 2642 LKLKEESLAANVAKMKGNVHTGGGELASVLADESQLPVD--NKVSSFSGGSVPMDGGPH- 2472
            L  K   +        G V    G  AS+L++  +  V     V + S   V +D  P  
Sbjct: 1427 LSTKAAKIDTFSLNTAGEVGLKEG-FASLLSNTGKCLVQPHTAVDNPSNFGVFVDSLPSE 1485

Query: 2471 --TKDQ---------VSILRSD--------VNLHPKLGDTXXXXXXXXXXXQMLGYGLSN 2349
              TKD+         +S+  SD        +++  +  +                + L N
Sbjct: 1486 EVTKDKYRFDSVDKSISVTNSDSDSQNMNSIDVEGQFRNVSGAVESQCTDKSPKSFPLPN 1545

Query: 2348 TV--------------SKNITVHASSFP---GHQYSN-----QPNANSLLDYNAELSKLQ 2235
             +               KN        P    +Q        Q + N    Y+ EL  ++
Sbjct: 1546 HMPQETNGAGGASLVQGKNQKCEGKDIPTPVSYQQGMPVDVPQISVNESEPYHLELIAIK 1605

Query: 2234 QEISGLQESISTAESELLKVSLRKEFLGRDSYGRLYWVFGSYDASPWIVANGSLNPESEF 2055
            ++IS LQ+SI++  S+LLK+S+R+E LG DS GRLYW          IV + S       
Sbjct: 1606 RDISLLQDSITSVASQLLKLSVRRECLGIDSIGRLYWASALPGGRSRIVVDASAALLHGR 1665

Query: 2054 GL-----------------------------DNHFPKSSSWMYYDTVAEIGELVKWLDDC 1962
            G+                              N    SS W+ Y+T  EI EL+ WLDD 
Sbjct: 1666 GMTFSRDYVEKFSVLQHCALSDKDSSLMSQPSNPLGNSSPWIAYETDVEIEELLGWLDDS 1725

Query: 1961 DIRERDLKESILQWQSNKSLNS-NYQRNDFLKGKPSSSVISSEQRVPNHNCRVLKGVRAL 1785
            D +ER+LK+SI+    ++     N Q  D  K + + S+  + ++  + N  V K    L
Sbjct: 1726 DPKERELKDSIMLGPKSRFQQFINAQTEDRAKDQGNVSMPRNREKTVS-NSLVTKATSLL 1784

Query: 1784 EKKFGLGMDMGADYINKNLEHNHEMTYQGRMCRCVCLELIWPSRHHCFSCHQTFSASEEL 1605
            EKKFG  ++     + K        T   ++ RC CLE I PSR HC  CH+T ++  E 
Sbjct: 1785 EKKFGPFVEWDNSEVLKKQNRKTRTTNDEKLYRCECLEPILPSRKHCTHCHKTVASDIEF 1844

Query: 1604 DQHGE-KCRAA------------ATGAAGSLKHD------KTVMNQGIRICPGSSNIPQS 1482
            D H + KC A             ++   G+LK D      +      +    GSS +   
Sbjct: 1845 DGHNDGKCNAGLLAIEKNKDKNGSSKGRGNLKCDTLHEKFRADAETALTSVSGSSKLSSR 1904

Query: 1481 VLNEKHDTQSNCVKXXXXXXXXXXXXEIMANFKVDYSIEEDIKGIGLFGTNGVVPFLPNS 1302
            ++   ++ +S C              +I + F  + S +E +  IGL G++G+  F+P+ 
Sbjct: 1905 LIKFSNE-ESTC---------PFNFEDICSKFVTNDSNKELVSEIGLIGSDGIPSFVPSV 1954

Query: 1301 SPCFNDPALTLVPPSGNEVSMEDQTSVPKSQQKLSDDLAKTASVVNDKSSGIEKGKGKVS 1122
            SP  ++  L+    +  + S+    S+  S+ ++S      A    D  SGI  GK   +
Sbjct: 1955 SPFVSEYTLS----AQKDESIVGGVSIV-SESRVSQGNTDGAGTCLDHKSGISTGKLAAN 2009

Query: 1121 EVECMKSRVLCER--GRLXXXXXXXXXSTARRSIIRESSLRPKVGYATEVLRLLKISMLD 948
            E        L E+  G+                ++   SLRP VG A+ +LR LKI++LD
Sbjct: 2010 ESNKSNKSSLREQRDGKFSFCSPASVMGADGCCVVPSPSLRPLVGKASHILRQLKINLLD 2069

Query: 947  IDAALPEAALRASRSHLDRRCAWRTFVKCANTLYEMMLALIILEDTIRTDYLKNNWWYWS 768
            +DAAL   ALR S++  DRR AWRTFVK A T+YEM+ A   LED I+T+YL+N+WWYWS
Sbjct: 2070 MDAALLAIALRPSKAVPDRRQAWRTFVKSAKTIYEMIQATFTLEDMIKTEYLRNDWWYWS 2129

Query: 767  SPSAASNMPTLSALALRIFTLDSAILYEK---SSTDDTAGTSIPVSQPDKEASSSAIPTT 597
            S SAA+   TL +LALRI++LD AI+YEK   SS  D++  S+ +++P    +   + T 
Sbjct: 2130 SFSAAAKSSTLPSLALRIYSLDLAIIYEKMPNSSFTDSSEPSV-IAEPKPLMN---VDTE 2185

Query: 596  EMKSAEQPMQKVNDSDS 546
            + K++ +  +K  +SDS
Sbjct: 2186 KSKASRKSTRKRKESDS 2202


>ref|NP_001046163.1| Os02g0192400 [Oryza sativa Japonica Group]
            gi|46389826|dbj|BAD15389.1| PHD finger-like protein
            [Oryza sativa Japonica Group] gi|50726413|dbj|BAD34024.1|
            PHD finger-like protein [Oryza sativa Japonica Group]
            gi|113535694|dbj|BAF08077.1| Os02g0192400 [Oryza sativa
            Japonica Group]
          Length = 929

 Score =  434 bits (1117), Expect = e-119
 Identities = 308/903 (34%), Positives = 444/903 (49%), Gaps = 74/903 (8%)
 Frame = -3

Query: 3167 GTESQKERDEVLALVNDSSLPKAPWEEGICKVCGMDKDDVNVLLCDTCDSEYHTYCLNPP 2988
            G+E  +E  ++L   N  SLPKAPWE+G+CKVCG+D+DD +VLLCD CDSEYHTYCLNPP
Sbjct: 34   GSEMHEELHDILTAAN--SLPKAPWEDGVCKVCGIDRDDDSVLLCDKCDSEYHTYCLNPP 91

Query: 2987 LGKVPDGNWYCPSCVTGNSLSEDAAYGTQVVNQCGKRRYQRKLIDETL----EGLAQLAN 2820
            L ++P+GNWYCPSC+ G + +     G Q V     +R Q+K + E      E L +LA 
Sbjct: 92   LARIPEGNWYCPSCMLGQTKAHH-DQGVQDV-----KRQQKKFVGEEAHAFQEELNKLAT 145

Query: 2819 KMELKEYWEFTVEERILLLKFFSDEAMNSATIRDHIDQCASMCVDLHQRLRSLNSERKIL 2640
             ME KEYW+  ++ERI LLKF  DE +N+A IR+H+DQC+    DL Q+ R+ N E K L
Sbjct: 146  AMEEKEYWDLNMQERIYLLKFLCDEMLNTALIREHLDQCSDKLGDLQQKFRASNFELKDL 205

Query: 2639 KLKEE---SLAANVAKMKGNVHTGGGELASVLADESQLPVDNKVSSFSGGSV------PM 2487
            K KEE   S A      K   H            +      + +     G+V      P 
Sbjct: 206  KYKEEMRTSYARQSRSSKTEQHFNNSSGPVENQQQCTPTALDHLEEAEQGNVGVNLNNPA 265

Query: 2486 DGGP-----------HTKDQVSILRSDVNLHPKLGDTXXXXXXXXXXXQMLGYGLSNTVS 2340
            DG P             KD  S    +      L +              +  G  +   
Sbjct: 266  DGVPDGQLNVGKPYKSDKDISSASMVEERKSSGLSEQPSGMAIDQIDGDAIDEGSQSCEK 325

Query: 2339 KNITVHAS------------SFPGHQYSNQPNANSLLDYNAELSK--------------- 2241
            +++   +S            S PG +  ++  + S  D N E S                
Sbjct: 326  RSLGAKSSTCDNLNLKDTEFSTPGRELPDERASTSFQD-NLEASSTKSIELDADNNEMDT 384

Query: 2240 LQQEISGLQESISTAESELLKVSLRKEFLGRDSYGRLYWVFGSYDASPWIVANGS-LNP- 2067
            L  +IS LQ+SIS  ES++   S R+E LG+DS GRLYWV G     PW+VA+GS L P 
Sbjct: 385  LSDDISKLQDSISLLESQINMASSRRECLGKDSIGRLYWVIGRPGKRPWLVADGSMLKPK 444

Query: 2066 ESEFGLDNHFP---------KSSSWMYYDTVAEIGELVKWLDDCDIRERDLKESILQWQS 1914
            E +  + N +P          S+S   Y++  EI  L+ WL D D RE++LK+SILQWQ 
Sbjct: 445  ERDISMVNSYPPSAFDCKGWNSASIFIYESDEEIQCLLDWLRDYDPREKELKDSILQWQR 504

Query: 1913 NKSLNSNYQRNDFLKGKPSSSVISSEQ--RVPNHNCRVLKGVRALEKKFGLGMDMGADYI 1740
            +    S+    D     P  S    EQ   +PN    V+     LE+K+GL +D     +
Sbjct: 505  HFCHQSSSPLVD-----PPISGPKGEQLMELPNTKAAVI-----LEQKYGLQLDQDTSDL 554

Query: 1739 NKNLEHNHEMTYQGRMCRCVCLELIWPSRHHCFSCHQTFSASEELDQHGE-KCRAAATGA 1563
             K      +++ + R  RC CLE +WPSR+HC +CH+T+  S E + H + KC       
Sbjct: 555  PKKRGKKIKLSSEDRTYRCDCLEPVWPSRYHCLTCHETYLISTEFEGHNDGKCSKIHQSP 614

Query: 1562 AGSLKHDKTVMNQGIRICPGSSNIPQSVLNEKHDTQSNCV----KXXXXXXXXXXXXEIM 1395
              S ++D+  +            + +S   EK   + + V                 EI 
Sbjct: 615  DESRENDEPKV-----------KVTKSDTKEKDSLECSSVIEPSSDRKLMQCPYDFEEIC 663

Query: 1394 ANFKVDYSIEEDIKGIGLFGTNGVVPFLPNSSPCFNDPALTLVPPSGNEVSMEDQTSVPK 1215
              F  + S +E +K IGL G+NGV  F+P S   F +PA+ +   +  +  ++D TS  +
Sbjct: 664  RKFVTNDSNKETVKQIGLNGSNGVPSFVP-SPAFFLEPAI-VQSQNRKDDELKDWTSSLE 721

Query: 1214 SQQKLSDDLAKTASVVNDKSSGIEKGKGKVSEVECMKSRVLCERGRLXXXXXXXXXSTAR 1035
                +S        +V + S   +   G V + +  KS+                    R
Sbjct: 722  ECNAMS-----AQKLVQEVSKSGQSCPGNVGDEKVQKSKKPTPDNTSGEEAHSTTGKPTR 776

Query: 1034 -----RSIIRESSLRPKVGYATEVLRLLKISMLDIDAALPEAALRASRSHLDRRCAWRTF 870
                   ++ ESSLRP +G  + +L+  KI++LDI+AALPE ALRAS+    RR +WR F
Sbjct: 777  LLAVNGGLVPESSLRPLIGRNSHILKQQKINLLDIEAALPEEALRASKCQQIRRRSWRAF 836

Query: 869  VKCANTLYEMMLALIILEDTIRTDYLKNNWWYWSSPSAASNMPTLSALALRIFTLDSAIL 690
            VK A ++ +M+LA  +LE  I+ ++LKN+WWYWSS +AA    T+S+LALR++TLD  I+
Sbjct: 837  VKDAESISQMVLAANLLEGMIKAEFLKNDWWYWSSFTAAMKTSTVSSLALRVYTLDDCII 896

Query: 689  YEK 681
            Y K
Sbjct: 897  YSK 899


>gb|EEE56485.1| hypothetical protein OsJ_05715 [Oryza sativa Japonica Group]
          Length = 1949

 Score =  434 bits (1117), Expect = e-119
 Identities = 308/903 (34%), Positives = 444/903 (49%), Gaps = 74/903 (8%)
 Frame = -3

Query: 3167 GTESQKERDEVLALVNDSSLPKAPWEEGICKVCGMDKDDVNVLLCDTCDSEYHTYCLNPP 2988
            G+E  +E  ++L   N  SLPKAPWE+G+CKVCG+D+DD +VLLCD CDSEYHTYCLNPP
Sbjct: 1054 GSEMHEELHDILTAAN--SLPKAPWEDGVCKVCGIDRDDDSVLLCDKCDSEYHTYCLNPP 1111

Query: 2987 LGKVPDGNWYCPSCVTGNSLSEDAAYGTQVVNQCGKRRYQRKLIDETL----EGLAQLAN 2820
            L ++P+GNWYCPSC+ G + +     G Q V     +R Q+K + E      E L +LA 
Sbjct: 1112 LARIPEGNWYCPSCMLGQTKAHH-DQGVQDV-----KRQQKKFVGEEAHAFQEELNKLAT 1165

Query: 2819 KMELKEYWEFTVEERILLLKFFSDEAMNSATIRDHIDQCASMCVDLHQRLRSLNSERKIL 2640
             ME KEYW+  ++ERI LLKF  DE +N+A IR+H+DQC+    DL Q+ R+ N E K L
Sbjct: 1166 AMEEKEYWDLNMQERIYLLKFLCDEMLNTALIREHLDQCSDKLGDLQQKFRASNFELKDL 1225

Query: 2639 KLKEE---SLAANVAKMKGNVHTGGGELASVLADESQLPVDNKVSSFSGGSV------PM 2487
            K KEE   S A      K   H            +      + +     G+V      P 
Sbjct: 1226 KYKEEMRTSYARQSRSSKTEQHFNNSSGPVENQQQCTPTALDHLEEAEQGNVGVNLNNPA 1285

Query: 2486 DGGP-----------HTKDQVSILRSDVNLHPKLGDTXXXXXXXXXXXQMLGYGLSNTVS 2340
            DG P             KD  S    +      L +              +  G  +   
Sbjct: 1286 DGVPDGQLNVGKPYKSDKDISSASMVEERKSSGLSEQPSGMAIDQIDGDAIDEGSQSCEK 1345

Query: 2339 KNITVHAS------------SFPGHQYSNQPNANSLLDYNAELSK--------------- 2241
            +++   +S            S PG +  ++  + S  D N E S                
Sbjct: 1346 RSLGAKSSTCDNLNLKDTEFSTPGRELPDERASTSFQD-NLEASSTKSIELDADNNEMDT 1404

Query: 2240 LQQEISGLQESISTAESELLKVSLRKEFLGRDSYGRLYWVFGSYDASPWIVANGS-LNP- 2067
            L  +IS LQ+SIS  ES++   S R+E LG+DS GRLYWV G     PW+VA+GS L P 
Sbjct: 1405 LSDDISKLQDSISLLESQINMASSRRECLGKDSIGRLYWVIGRPGKRPWLVADGSMLKPK 1464

Query: 2066 ESEFGLDNHFP---------KSSSWMYYDTVAEIGELVKWLDDCDIRERDLKESILQWQS 1914
            E +  + N +P          S+S   Y++  EI  L+ WL D D RE++LK+SILQWQ 
Sbjct: 1465 ERDISMVNSYPPSAFDCKGWNSASIFIYESDEEIQCLLDWLRDYDPREKELKDSILQWQR 1524

Query: 1913 NKSLNSNYQRNDFLKGKPSSSVISSEQ--RVPNHNCRVLKGVRALEKKFGLGMDMGADYI 1740
            +    S+    D     P  S    EQ   +PN    V+     LE+K+GL +D     +
Sbjct: 1525 HFCHQSSSPLVD-----PPISGPKGEQLMELPNTKAAVI-----LEQKYGLQLDQDTSDL 1574

Query: 1739 NKNLEHNHEMTYQGRMCRCVCLELIWPSRHHCFSCHQTFSASEELDQHGE-KCRAAATGA 1563
             K      +++ + R  RC CLE +WPSR+HC +CH+T+  S E + H + KC       
Sbjct: 1575 PKKRGKKIKLSSEDRTYRCDCLEPVWPSRYHCLTCHETYLISTEFEGHNDGKCSKIHQSP 1634

Query: 1562 AGSLKHDKTVMNQGIRICPGSSNIPQSVLNEKHDTQSNCV----KXXXXXXXXXXXXEIM 1395
              S ++D+  +            + +S   EK   + + V                 EI 
Sbjct: 1635 DESRENDEPKV-----------KVTKSDTKEKDSLECSSVIEPSSDRKLMQCPYDFEEIC 1683

Query: 1394 ANFKVDYSIEEDIKGIGLFGTNGVVPFLPNSSPCFNDPALTLVPPSGNEVSMEDQTSVPK 1215
              F  + S +E +K IGL G+NGV  F+P S   F +PA+ +   +  +  ++D TS  +
Sbjct: 1684 RKFVTNDSNKETVKQIGLNGSNGVPSFVP-SPAFFLEPAI-VQSQNRKDDELKDWTSSLE 1741

Query: 1214 SQQKLSDDLAKTASVVNDKSSGIEKGKGKVSEVECMKSRVLCERGRLXXXXXXXXXSTAR 1035
                +S        +V + S   +   G V + +  KS+                    R
Sbjct: 1742 ECNAMS-----AQKLVQEVSKSGQSCPGNVGDEKVQKSKKPTPDNTSGEEAHSTTGKPTR 1796

Query: 1034 -----RSIIRESSLRPKVGYATEVLRLLKISMLDIDAALPEAALRASRSHLDRRCAWRTF 870
                   ++ ESSLRP +G  + +L+  KI++LDI+AALPE ALRAS+    RR +WR F
Sbjct: 1797 LLAVNGGLVPESSLRPLIGRNSHILKQQKINLLDIEAALPEEALRASKCQQIRRRSWRAF 1856

Query: 869  VKCANTLYEMMLALIILEDTIRTDYLKNNWWYWSSPSAASNMPTLSALALRIFTLDSAIL 690
            VK A ++ +M+LA  +LE  I+ ++LKN+WWYWSS +AA    T+S+LALR++TLD  I+
Sbjct: 1857 VKDAESISQMVLAANLLEGMIKAEFLKNDWWYWSSFTAAMKTSTVSSLALRVYTLDDCII 1916

Query: 689  YEK 681
            Y K
Sbjct: 1917 YSK 1919


Top