BLASTX nr result
ID: Catharanthus22_contig00003477
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Catharanthus22_contig00003477 (3169 letters) Database: ./nr 37,332,560 sequences; 13,225,080,153 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_002525350.1| DNA binding protein, putative [Ricinus commu... 527 e-146 ref|XP_006365207.1| PREDICTED: methyl-CpG-binding domain-contain... 526 e-146 ref|XP_006483833.1| PREDICTED: methyl-CpG-binding domain-contain... 511 e-142 ref|XP_006446469.1| hypothetical protein CICLE_v10014026mg [Citr... 504 e-140 ref|XP_006470356.1| PREDICTED: methyl-CpG-binding domain-contain... 503 e-139 ref|XP_006470355.1| PREDICTED: methyl-CpG-binding domain-contain... 503 e-139 gb|EXC31622.1| Methyl-CpG-binding domain-containing protein 9 [M... 475 e-131 ref|XP_002517349.1| DNA binding protein, putative [Ricinus commu... 474 e-130 ref|XP_004141185.1| PREDICTED: methyl-CpG-binding domain-contain... 472 e-130 ref|XP_004167238.1| PREDICTED: LOW QUALITY PROTEIN: methyl-CpG-b... 471 e-129 gb|EOY02356.1| Methyl-CpG-binding domain-containing protein 9, p... 470 e-129 ref|XP_002884279.1| methyl-CpG-binding domain 9 [Arabidopsis lyr... 468 e-129 ref|XP_006408507.1| hypothetical protein EUTSA_v10019872mg [Eutr... 467 e-128 ref|XP_006296811.1| hypothetical protein CARUB_v10012794mg [Caps... 461 e-126 ref|XP_006646998.1| PREDICTED: methyl-CpG-binding domain-contain... 457 e-125 ref|NP_186795.1| methyl-CPG-binding domain 9 [Arabidopsis thalia... 457 e-125 gb|ESW23089.1| hypothetical protein PHAVU_004G017600g [Phaseolus... 448 e-123 ref|XP_006594288.1| PREDICTED: methyl-CpG-binding domain-contain... 443 e-121 ref|NP_001046163.1| Os02g0192400 [Oryza sativa Japonica Group] g... 434 e-119 gb|EEE56485.1| hypothetical protein OsJ_05715 [Oryza sativa Japo... 434 e-119 >ref|XP_002525350.1| DNA binding protein, putative [Ricinus communis] gi|223535313|gb|EEF36988.1| DNA binding protein, putative [Ricinus communis] Length = 1794 Score = 527 bits (1358), Expect = e-146 Identities = 340/924 (36%), Positives = 488/924 (52%), Gaps = 57/924 (6%) Frame = -3 Query: 3164 TESQKERDEVLALVNDSSLPKAPWEEGICKVCGMDKDDVNVLLCDTCDSEYHTYCLNPPL 2985 +E++KE +++L + S +PKAPW+EG+CKVCG+DKDD NVLLCD CDS YHTYCLNPPL Sbjct: 893 SEAKKEMEDILE--HASQMPKAPWDEGVCKVCGVDKDDDNVLLCDKCDSGYHTYCLNPPL 950 Query: 2984 GKVPDGNWYCPSCVTGNSLSEDAAYGTQVVNQCGKRRYQRKLIDETLEGLAQLANKMELK 2805 ++P+GNWYCPSC+T + A+ Q V+ C K+R Q + LE LA L ME+ Sbjct: 951 ARIPEGNWYCPSCIT-----QGASQVPQFVSHCRKKRRQGEFTHGVLEALAHLGTTMEIT 1005 Query: 2804 EYWEFTVEERILLLKFFSDEAMNSATIRDHIDQCASMCVDLHQRLRSLNSERKILKLKEE 2625 +YW+++VEERI LLKF DE +NSA IR+H+DQCAS+ DL Q+LRSL+ E + LK KEE Sbjct: 1006 DYWDYSVEERIFLLKFLGDEVLNSANIREHLDQCASVSADLQQKLRSLSMEWRNLKFKEE 1065 Query: 2624 SLAANVAKMKGNVHTGGGELASVLADESQL--PVDNKVSSFSGGSVPMD---GGPHTKDQ 2460 + V K +G +VL + +L ++ S S + ++ GP Sbjct: 1066 LMLNGVGK------SGKEGTTTVLPNYDKLLGQTHSRSSLCSTSFIDLEHLKDGPRFPRT 1119 Query: 2459 VSILRSDVNLHPKLGDTXXXXXXXXXXXQMLGYGLSNTVSKNITVHASSFPGHQYSNQPN 2280 + ++PK G + +S V S Q NQP+ Sbjct: 1120 NDFTKRPCWVYPK------------------GVQVQQPISNGSQVFTISDTECQV-NQPD 1160 Query: 2279 ANSLLDYNAELSKLQQEISGLQESISTAESELLKVSLRKEFLGRDSYGRLYWVFGSYDAS 2100 N L N E ++ + S LQ+S+++ E +L K SLRKEFLGRDS GR+YW F + Sbjct: 1161 VNQLQTSNLESIFIRDKASVLQDSVTSLELQLQKASLRKEFLGRDSAGRVYWAFSRTGSL 1220 Query: 2099 PWIVANG-------SLNPESEFGLDNHFPKSSS--------------------------- 2022 PW+V +G S+ E+ N+ SS Sbjct: 1221 PWVVIDGTTVVQQSSIAEENRVLRFNNLTFRSSIGAQDLLRFKGSNVFSPYASDLTSGIS 1280 Query: 2021 ----WMYYDTVAEIGELVKWLDDCDIRERDLKESILQWQSNKSLNSNYQRNDFLK-GKPS 1857 W + + AEI EL+KWL D D +R+L ES+LQ + NSN N L+ +P+ Sbjct: 1281 VYFQWFSHQSYAEIEELIKWLRDNDPMQRELIESLLQRLNFGYSNSNKAANYVLEMNQPA 1340 Query: 1856 SSVISSEQRVPNHNCRVLKGVRALEKKFGLGMDMGADYINKNLEHNHEMTYQGRMCRCVC 1677 S ++ E+ + + + + ALEKK+G M++ I+ N ++TY RMCRC C Sbjct: 1341 SMPVNIEKTLKPKSLET-RALTALEKKYGPCMELDVTNISVKFSRNLKVTYDDRMCRCEC 1399 Query: 1676 LELIWPSRHHCFSCHQTFSASEELDQHGE-KCRAAATGAAGSLKHDKTVMNQGIRICPGS 1500 LE IWPSRHHC SCH++FS+ EL++H + KC A A S D V + + + Sbjct: 1400 LEAIWPSRHHCLSCHRSFSSRCELEEHNDGKCGAGAHTPQNSRVTD-DVSKEKVLMRAEH 1458 Query: 1499 SNIPQSVLNEKHDTQSNCVKXXXXXXXXXXXXEIMANFKVDYSIEEDIKGIGLFGTNGVV 1320 H+ + + EI A F S +E +K IGL G+NG+ Sbjct: 1459 GEWQCKAGGAGHEIEFGLIGFRKEFMSPYNLEEISAKFVTRSSNKELVKEIGLLGSNGIP 1518 Query: 1319 PFLPNSSPCFNDPALTLVPPSGNEVSMEDQTSVPKSQQKLSDDLAKTASVVNDKSSGIEK 1140 +P SSP DP L LV P NEV Q++ ++ D T S + S K Sbjct: 1519 SLVPCSSPYLIDPTLKLVLPCVNEVCQSVQSTNVENGSLQGD---TTTSKRHANKSNATK 1575 Query: 1139 GKGKVSEVECMKSRVLCERGRLXXXXXXXXXSTARR-----SIIRESSLRPKVGYATEVL 975 V E L E GR + + S IR S+LRP VG +L Sbjct: 1576 DCTAVDLYE-----ELQEIGRSYLMNQSSLRFSCTKLGNPLSEIRGSALRPLVGKGAHIL 1630 Query: 974 RLLKISMLDIDAALPEAALRASRSHLDRRCAWRTFVKCANTLYEMMLALIILEDTIRTDY 795 R LKI++LD+DAALPE A+++S +L++RCAWR FVK A +++EM+ A I+LE+ I+TD+ Sbjct: 1631 RQLKINLLDMDAALPEEAVKSSNIYLEKRCAWRAFVKSAKSVFEMVQATIVLENMIKTDF 1690 Query: 794 LKNNWWYWSSPSAASNMPTLSALALRIFTLDSAILYEKSSTDDTAGTSIPVSQP------ 633 L+N WWYWSS SAA+ + T+S+LALRI+TLD+AI+YEK ++P + P Sbjct: 1691 LRNEWWYWSSLSAAAKIATISSLALRIYTLDAAIVYEK---------TLPFTPPKDIAEV 1741 Query: 632 -DKEASSSAIPTTEMKSAEQPMQK 564 K ++++ P T+++S +P K Sbjct: 1742 GSKSDNNNSPPHTDLESNPKPSSK 1765 >ref|XP_006365207.1| PREDICTED: methyl-CpG-binding domain-containing protein 9-like [Solanum tuberosum] Length = 2173 Score = 526 bits (1356), Expect = e-146 Identities = 358/976 (36%), Positives = 502/976 (51%), Gaps = 111/976 (11%) Frame = -3 Query: 3164 TESQKERDEVLALVNDSSLPKAPWEEGICKVCGMDKDDVNVLLCDTCDSEYHTYCLNPPL 2985 +E K+RD +LA VN+SSLPKAPWEEG+CKVC MDKDDVNVLLCD CDSEYHTYCL+PPL Sbjct: 1180 SEVAKDRDGLLAHVNESSLPKAPWEEGLCKVCSMDKDDVNVLLCDKCDSEYHTYCLDPPL 1239 Query: 2984 GKVPDGNWYCPSCVTGNSLSEDAAYGTQVVNQCGKRRYQRKLIDETLEGLAQLANKMELK 2805 KVP G WYCP C S S++A+ G+ + QC KRR RKL + +E L+QL MELK Sbjct: 1240 VKVPIGPWYCPDCEAKISRSQNASSGSHTIRQCVKRRLHRKLTHKFMEKLSQLTRTMELK 1299 Query: 2804 EYWEFTVEERILLLKFFSDEAMNSATIRDHIDQCASMCVDLHQRLRSLNSERKILKLKEE 2625 EYWE +E+RI LLKF DE +NSA +RDHID+ AS+ +L Q+LRSL +E K+LK K+E Sbjct: 1300 EYWELPLEDRIFLLKFLCDEMLNSAILRDHIDRSASLSAELQQKLRSLGAELKLLKHKKE 1359 Query: 2624 SLAANVAKMKGNVHTGG--GELASVLADESQLPVD-----NKVSSFSGGSVPMDGGP-HT 2469 L A K+K + + G G S+ +++ +L V + SS SGG +D G H Sbjct: 1360 ILTA---KLKNDARSSGDTGSDTSLWSNDCKLKVQGPDSGSHNSSISGGCRQLDDGTQHN 1416 Query: 2468 K----DQVSILRSDVNLHPKL--GDTXXXXXXXXXXXQMLGYGL--SNTVSKNITVHAS- 2316 K ++ S L + N+ K T + L NT S N + HA Sbjct: 1417 KCNDYNKQSCLYTSKNIQDKTCASGTNHIRNSPDPINHLQHQQLLKENTRSLNTSSHAKC 1476 Query: 2315 ----------------------SFPGHQYSNQPNANSLLDYNA----------------- 2253 PG+ + P+++ + A Sbjct: 1477 GTEEANLQNDLFISTTLQQETDQIPGNCLESTPSSSKSIMLFATHIVSATTCSGSVSNPL 1536 Query: 2252 ------ELSKLQQEISGLQESISTAESELLKVSLRKEFLGRDSYGRLYWVFGSYDASPWI 2091 E+S +++EI L++SI+ E EL +VS+RKE++G+DS GRLYW FG +S + Sbjct: 1537 EEAFLFEMSAIKKEIRALEDSIAAKELELQEVSVRKEYMGQDSEGRLYWTFGRSTSSRLV 1596 Query: 2090 V-ANGSLNPESE-----FGLDNH----------------FPKSSSWMYYDTVAEIGELVK 1977 A+ S PES +G+++ P W Y + + L++ Sbjct: 1597 AYASTSTQPESSGHLWSYGVESSRRSGVFDSSAPWENMGMPNLDQWTSYQSDVDTEILIR 1656 Query: 1976 WLDDCDIRERDLKESILQWQSNKSLNSNYQRNDFLKGKPSSSVISSEQRVP--NHNCRVL 1803 WL + D RER+LKESILQW+ + + Y + + I SE N + V Sbjct: 1657 WLKEHDPRERELKESILQWRDTRKMIYYYLESHGHDKVRLITSIPSEDSASCFNSDSLVT 1716 Query: 1802 KGVRALEKKFGLGMDMGADYINKNLEHNHEMTYQGRMCRCVCLELIWPSRHHCFSCHQTF 1623 + V A++K I NL +++ G + RC CLE +WPSR HC SCHQTF Sbjct: 1717 RAVTAIKKMVSGCSAEEETEICTNLGVKVRVSFDGELYRCECLEPLWPSRPHCLSCHQTF 1776 Query: 1622 S-ASEELDQHGEKCRAAATG--------AAGSLKHDKTVMNQGIRICPGSSNIPQSVLN- 1473 S A E L EKCR + + K +T N+ ++ S+++ Q+ + Sbjct: 1777 SDAKERLKHANEKCRIDSPSPIQRDGETSEQPAKRKRTANNEILQDNSLSNDVSQASKSK 1836 Query: 1472 ----------EKHDTQSNCVKXXXXXXXXXXXXEIMANFKVDYSIEEDIKGIGLFGTNGV 1323 +KH + EI A F S++E + IGL G NG Sbjct: 1837 KLGNGEASRRDKHGNAPASAENQTKQECPFKFEEIKAQFITQRSLKELVNEIGLIGCNGT 1896 Query: 1322 VPFLPNSSPCFNDPALTLVPPSGNEVSMEDQTSVPKSQQKLSDDLAKTASVVNDKSSGIE 1143 F+P +SP D AL L+ +EV + T + S+ +L + + S +N+ + Sbjct: 1897 PSFIPCTSPYLCDSALELLSQREDEVCGGNSTDLLSSEHQLRNGVK--VSCINNSDNPNC 1954 Query: 1142 KGKGKVSEVECM-KSRVLCERGR---LXXXXXXXXXSTARRSIIRESSLRPKVGYATEVL 975 G G + + +RGR +I ESSL P G A+ +L Sbjct: 1955 TGNGLAGAGPVFGRLKSATKRGRNQFSSTKDKILEFGVNMYFVIPESSLHPVAGRASVIL 2014 Query: 974 RLLKISMLDIDAALPEAALRASRSHLDRRCAWRTFVKCANTLYEMMLALIILEDTIRTDY 795 R LKI++LDIDAALPE ALR SR +RR WR FVK A T+YEM+ A IILED I+T+Y Sbjct: 2015 RCLKINLLDIDAALPEEALRVSRLQSERRRVWRAFVKSAATIYEMVQATIILEDAIKTEY 2074 Query: 794 LKNNWWYWSSPSAASNMPTLSALALRIFTLDSAILYEKSSTDDTAGTSIPVSQPDKEASS 615 LKN+WWYWSSPSAA+ + TLSALALR++ LDSAILY+K S+ D + T + + Sbjct: 2075 LKNDWWYWSSPSAAARISTLSALALRVYALDSAILYDKLSSQDASETD--CKEEREPPPR 2132 Query: 614 SAIPT-TEMKSAEQPM 570 +++PT T S ++P+ Sbjct: 2133 NSVPTNTASPSKKKPL 2148 >ref|XP_006483833.1| PREDICTED: methyl-CpG-binding domain-containing protein 9-like isoform X2 [Citrus sinensis] Length = 2084 Score = 511 bits (1316), Expect = e-142 Identities = 338/956 (35%), Positives = 505/956 (52%), Gaps = 81/956 (8%) Frame = -3 Query: 3164 TESQKERDEVLALVNDSSLPKAPWEEGICKVCGMDKDDVNVLLCDTCDSEYHTYCLNPPL 2985 +E++KE +++L + S +PKAPW+EG+CKVCG+DKDD NVLLCDTCDS YHTYCL PPL Sbjct: 1131 SEAKKEMEDILE--SASEIPKAPWDEGVCKVCGIDKDDDNVLLCDTCDSGYHTYCLTPPL 1188 Query: 2984 GKVPDGNWYCPSCVTGNSLSEDAAYGTQVVNQCGKRRYQRKLIDETLEGLAQLANKMELK 2805 +VP+GNWYCP C++GN ++ + V ++ KRR+Q + LE + LA ME++ Sbjct: 1189 TRVPEGNWYCPPCLSGNCKNKYMSQVPHVSSRIPKRRHQGEFTCRILEEVFHLAATMEMR 1248 Query: 2804 EYWEFTVEERILLLKFFSDEAMNSATIRDHIDQCASMCVDLHQRLRSLNSERKILKLKEE 2625 +YW+++ +ERI LLKF DE +NS IR+H+++CAS+ VDL Q++RSL+ E + LK +EE Sbjct: 1249 DYWDYSDKERIFLLKFLCDELLNSTNIREHLERCASVSVDLQQKIRSLSLEWRNLKFREE 1308 Query: 2624 SLAANVAKMKGNV-------------------------HTGGG----ELASVLA---DES 2541 LA VA+ K +V +GGG LAS LA D Sbjct: 1309 ILAGKVARDKASVLSGTGKCGTEGVATLYPHYGKLMRQPSGGGGYFSSLASDLALSEDGL 1368 Query: 2540 QLPVDNKVSSF-----------SGGSVPMDGGPHTKDQV--SILRSDVNLHPKLGDTXXX 2400 QL K+S + S + P+T+ QV + ++ + D Sbjct: 1369 QLNESRKLSCWFNLKGISMRQPSCSRNQIGEAPYTESQVHQESEKDNIRVDDLQYDVPHS 1428 Query: 2399 XXXXXXXXQMLGYGLSNTVSKNITVHASSFPGHQYSNQPNAN-SLLDYNAELSKLQQEIS 2223 Y +++ +S P QPN S ++++ + Q Sbjct: 1429 ASQPQKQDTAGEYATWRNKGQDLENGHTSGP-----LQPNCEASQSHFSSDHTNGNQVAE 1483 Query: 2222 GLQESISTAESELLKVSLRKEFLGRDSYGRLYWVFGSYDASPWIVANGSLNPESEFGLDN 2043 L +SI+ ES+ L VSLRKE LGRDS GRLYW F + SPW++ + + E E L Sbjct: 1484 HLCDSIAGLESQQLAVSLRKELLGRDSAGRLYWAFFRPNTSPWLLVDATTVLEQERILKE 1543 Query: 2042 H---------------FPKSSSWMYYDTVAEIGELVKWLDDCDIRERDLKESILQWQS-- 1914 H SSSW Y + EI EL++WL D D R+++L ESIL+W Sbjct: 1544 HGDSLANSPFEEEYNGISASSSWFSYQSDTEIEELIQWLSDSDPRDKELAESILRWTKIG 1603 Query: 1913 --NKSLNSNYQRNDFLKGKPSSSVISSEQRVPNHNCRVLKGVRALEKKFGLGMDMGADYI 1740 + + N+ ++ + PSSS + + V K + LE+K G ++ + Sbjct: 1604 YKDLKIAGNHIEDESV---PSSSKCRKSEATVKSSGLVTKALTVLEEKHGPCLEPEVLKM 1660 Query: 1739 NKNLEHNHEMTYQGRMCRCVCLELIWPSRHHCFSCHQTFSASEELDQHGE-KCRAAATGA 1563 + L+ N E+T + RM RC CLE + P+R HC CH +FSA EL++H + KC +AT + Sbjct: 1661 SMKLDTNSELTCKERMYRCECLEPVLPTRFHCRRCHLSFSARNELEEHNDAKCILSATSS 1720 Query: 1562 AGSLKHDKTVMNQG-IRI------C--PGSSNIPQSVLNEKHDTQSNCVKXXXXXXXXXX 1410 S + D+ G IR C + QS+ KH T + Sbjct: 1721 QNSKEDDERTKGAGTIRTETLQAECMETAGKGMSQSL---KHGTAMGSFEIPKEFACPFN 1777 Query: 1409 XXEIMANFKVDYSIEEDIKGIGLFGTNGVVPFLPNSSPCFNDPALTLVPPSGNEVSMEDQ 1230 EI F SI+E ++ IGL G+NGV F+P++SP DP+L LV NE++ ++ Sbjct: 1778 FEEISTKFITKNSIKELVQEIGLIGSNGVPAFVPSTSPYLCDPSLKLVEMCKNEINRGNK 1837 Query: 1229 TSVPKS--QQKLSDDLAKTASVVNDKSSGIEKGKGKVSEVECMKSRVL----CERGRLXX 1068 ++ ++ Q + D+ N ++ + ++ + +K R L R Sbjct: 1838 STNLENLFQYSIVGDMVSGLEHDNISNNSSRRCTVSHNDDDVLKCRRLNPNFMNEKRDQS 1897 Query: 1067 XXXXXXXSTARRSIIRESSLRPKVGYATEVLRLLKISMLDIDAALPEAALRASRSHLDRR 888 SI+R++SL P +G E+LR LKI++LD+DAA+PE ALR+S++ + R Sbjct: 1898 FSLSLKPGIGNSSIVRDTSLMPLMGRGIEILRQLKINLLDMDAAVPEEALRSSKACWENR 1957 Query: 887 CAWRTFVKCANTLYEMMLALIILEDTIRTDYLKNNWWYWSSPSAASNMPTLSALALRIFT 708 AWR FVK A +++EM+ A I+ ED I+TDYL+N WWYWSS S A+N+ T+SALALR++T Sbjct: 1958 SAWRAFVKSAKSIFEMVQATIVFEDMIKTDYLRNGWWYWSSLSGAANIATVSALALRLYT 2017 Query: 707 LDSAILYEKSSTDDTAGTSIPVSQPDKEASSSAIPTTEMKSAEQPMQKVNDSDSGE 540 LD+AI+YEK S D+ +SQPDKE S P + KS +P + + + S + Sbjct: 2018 LDAAIVYEKHS--DSIEIQEHISQPDKETS----PCKDSKSNPKPSKAILKTQSSD 2067 >ref|XP_006446469.1| hypothetical protein CICLE_v10014026mg [Citrus clementina] gi|557549080|gb|ESR59709.1| hypothetical protein CICLE_v10014026mg [Citrus clementina] Length = 1680 Score = 504 bits (1298), Expect = e-140 Identities = 329/958 (34%), Positives = 495/958 (51%), Gaps = 90/958 (9%) Frame = -3 Query: 3161 ESQKERDEVLALVNDSSLPKAPWEEGICKVCGMDKDDVNVLLCDTCDSEYHTYCLNPPLG 2982 E+ KE +++L V S +PKAPW+EGICKVCG+DKDD +VLLCDTCD+EYHTYCL PPL Sbjct: 727 ETTKEINDIL--VQTSEIPKAPWDEGICKVCGVDKDDDSVLLCDTCDAEYHTYCLEPPLV 784 Query: 2981 KVPDGNWYCPSCVTGNSLSEDAAYGTQVVNQCGKRRYQRKLIDETLEGLAQLANKMELKE 2802 ++P+GNWYCPSCV NS+ + A+ +QV Q ++YQ ++ LE L L ME KE Sbjct: 785 RIPEGNWYCPSCVVRNSMVQGASEHSQVGGQHKGKKYQGEITRLCLEELRHLTTVMEEKE 844 Query: 2801 YWEFTVEERILLLKFFSDEAMNSATIRDHIDQCASMCVDLHQRLRSLNSERKILKLKEES 2622 YWEF V ER LLKF DE +NSA +R H++QC + +L Q+LRS + E K LK +EE+ Sbjct: 845 YWEFNVHERTFLLKFLCDELLNSALLRQHLEQCTEVTAELQQKLRSFSVEFKNLKSREET 904 Query: 2621 LAANVAKMKGNVHTGGGEL------ASVLADESQLPVDNKVSSF-----------SGGSV 2493 +AA VAK++ ++ E+ A+V+ + + + SS SG Sbjct: 905 VAARVAKVEASMTNSVAEICMKEGPATVIRNNGKCIEQPQNSSNRSNCSVIALEESGPMY 964 Query: 2492 PMDGG-----PH-TKDQVSILRSDVNLHPK---LGDTXXXXXXXXXXXQMLGYGLSNTVS 2340 P D PH ++ ++D ++ P L + + L Sbjct: 965 PTDAEGQIEEPHGDNSKMPSQKNDESIKPNEHPLASSLPQEIDNLSGEIRSQHNLQELAR 1024 Query: 2339 KNITVHASSFPGHQYSNQPNA------------NSLLDYNAELSKLQQEISGLQESISTA 2196 +S +Q + PN N +N EL+ ++ +I LQESI++ Sbjct: 1025 ARDAATLASPSNNQGPSVPNELHVTEGTCSVTMNEPQAHNLELNNIRNDILLLQESITSL 1084 Query: 2195 ESELLKVSLRKEFLGRDSYGRLYWVFGSYDASPWIVANGS-------------------- 2076 E +LLK+S+R+EFLG DS GRLYWV P ++ +GS Sbjct: 1085 EQQLLKLSVRREFLGSDSSGRLYWVLPLPGMHPCLIVDGSPELQQKRKILDFRGPVDKGL 1144 Query: 2075 -LNPESEFGLDNHFPK-------------------SSSWMYYDTVAEIGELVKWLDDCDI 1956 L S G D + SS W+ Y T AEI ELV WL D D Sbjct: 1145 VLKNSSSSGSDAYSSSKGSKACCPFQYDPYAVTATSSHWILYQTDAEIEELVNWLRDNDP 1204 Query: 1955 RERDLKESILQWQSNKSLNSNY-QRNDFLKGKPSSSVISSEQRVPNHNCRVLKGVRALEK 1779 +ER+LK+SIL W+ + +S + ++ + + + +SS ++ +V +C V K LEK Sbjct: 1205 KERELKDSILNWKKIRFQDSQHTKKQSWDEYQSASSAPTNSDKVDCFDCLVTKAATLLEK 1264 Query: 1778 KFGLGMDMGADYINKNLEHNHEMTYQGRMCRCVCLELIWPSRHHCFSCHQTFSASEELDQ 1599 K+G + ++ + K +T Q +M RC CLE IWPSR+HC SCH+TFS + E ++ Sbjct: 1265 KYGPCFE--SEEVLKKGGKRARVTSQEKMYRCECLEPIWPSRNHCLSCHRTFSTAVEFEE 1322 Query: 1598 HGEKCRAAATGAAGSLKHDKTVMNQGIRICPGSSNIPQSVLNEKHDTQSNCVKXXXXXXX 1419 H + C +A + + ++ +G + S + + ++ + + Sbjct: 1323 HNDTCNSAPPAYEKNKEASNSLKGKGNKKSDISRAACGTDVELVETSKPSGLIRFQNDGC 1382 Query: 1418 XXXXXEIMANFKVDYSIEEDIKGIGLFGTNGVVPFLPNSSPCFNDPALTLVPPSGNEVSM 1239 EI + F S +E ++ IGL G+ G+ +P+ SP +D L L+ S EV + Sbjct: 1383 PFDLNEISSKFMTQDSNKELVQEIGLLGSKGIPSLIPSVSPFLSDSTLMLMS-SQKEVGV 1441 Query: 1238 ED-----QTSVPKSQQKLS------DDLAKTASVVNDKSSGIEKGKGKVSEVECMKSRVL 1092 D ++ SQ K S D++A AS + + E K K C + R Sbjct: 1442 PDGQLMASETLSSSQGKQSMKNAGNDNMADDASRKSGSNGTHEVLKSKKPAFGCSEQRDR 1501 Query: 1091 CERGRLXXXXXXXXXSTARRSIIRESSLRPKVGYATEVLRLLKISMLDIDAALPEAALRA 912 + ++ +SSLRP +G +++ R LK+++LDIDAALPE ALR Sbjct: 1502 KSSSHVRVPKVGINQCC----VVPQSSLRPLIGRTSQIKRRLKVNLLDIDAALPEEALRP 1557 Query: 911 SRSHLDRRCAWRTFVKCANTLYEMMLALIILEDTIRTDYLKNNWWYWSSPSAASNMPTLS 732 S++HL+RR AWR FVK A T+YEM+ A IILED I+T++L+N WWYWSS SAA+ T+S Sbjct: 1558 SKAHLERRWAWRAFVKSAETIYEMVQATIILEDMIKTEFLRNEWWYWSSLSAAAKTSTMS 1617 Query: 731 ALALRIFTLDSAILYEKSSTDDTAGTSIPVSQPDKEASSSAIPTTEMKSAEQPMQKVN 558 +LALRI++LD+AI+Y+KS+T+ ++ + D +P E+ + +K N Sbjct: 1618 SLALRIYSLDAAIIYDKSTTNLNPVENLKL---DSTPEHKPLPGVELLEKSKVSRKSN 1672 >ref|XP_006470356.1| PREDICTED: methyl-CpG-binding domain-containing protein 9-like isoform X2 [Citrus sinensis] Length = 2023 Score = 503 bits (1296), Expect = e-139 Identities = 327/956 (34%), Positives = 496/956 (51%), Gaps = 88/956 (9%) Frame = -3 Query: 3161 ESQKERDEVLALVNDSSLPKAPWEEGICKVCGMDKDDVNVLLCDTCDSEYHTYCLNPPLG 2982 E+ KE +++L V S +PKAPW+EGICKVCG+DKDD +VLLCDTCD+EYHTYCL PPL Sbjct: 1072 ETTKEINDIL--VQTSEIPKAPWDEGICKVCGVDKDDDSVLLCDTCDAEYHTYCLEPPLV 1129 Query: 2981 KVPDGNWYCPSCVTGNSLSEDAAYGTQVVNQCGKRRYQRKLIDETLEGLAQLANKMELKE 2802 ++P+GNWYCPSCV NS+ + A+ +QV Q + Q ++ LE L L ME KE Sbjct: 1130 RIPEGNWYCPSCVVRNSMVQGASEHSQVGGQHKGKNNQGEITRLCLEALRHLTTVMEEKE 1189 Query: 2801 YWEFTVEERILLLKFFSDEAMNSATIRDHIDQCASMCVDLHQRLRSLNSERKILKLKEES 2622 YWEF V ER LLKF DE +NSA +R H++QC + +L Q+LRS + E K LK +EE+ Sbjct: 1190 YWEFNVHERTFLLKFLCDELLNSALLRQHLEQCTEVTAELQQKLRSFSVEFKNLKSREET 1249 Query: 2621 LAANVAKMKGNVHTGGGEL------ASVLADESQLPVDNKVSSF-----------SGGSV 2493 +AA VAK++ ++ E+ A+V+ + + + SS SG Sbjct: 1250 VAARVAKVEASMTYSVAEVCMKEGPATVIRNNGKCIEQPQNSSNRSNCSVIALEESGPMY 1309 Query: 2492 PMDGG-----PH-TKDQVSILRSDVNLHPK---LGDTXXXXXXXXXXXQMLGYGLSNTVS 2340 P D PH ++ ++D ++ P L + + L Sbjct: 1310 PTDAEGQIEEPHGDNSKMPSQKNDESIKPNEHPLASSLPQEIDNLSGEIRSQHNLQELAR 1369 Query: 2339 KNITV------HASSFPGHQYSNQPNANSLLD----YNAELSKLQQEISGLQESISTAES 2190 T+ H S P + + + ++ +N EL+ ++ +I LQESI++ E Sbjct: 1370 DAATLASPSNNHGPSVPNELHVTEGTCSVTMNEPQAHNLELNNIRNDILLLQESITSLEQ 1429 Query: 2189 ELLKVSLRKEFLGRDSYGRLYWVFGSYDASPWIVANGS---------------------L 2073 +LLK+S+R+EFLG DS GRLYWV P ++ +GS L Sbjct: 1430 QLLKLSVRREFLGSDSSGRLYWVLPLPGMHPCLIVDGSPELQQKRKILDFRGPVDKGLVL 1489 Query: 2072 NPESEFGLDNHFPK-------------------SSSWMYYDTVAEIGELVKWLDDCDIRE 1950 S G D + SS W+ Y T AEI ELV WL D D +E Sbjct: 1490 KNSSSSGSDAYSSSKGSKACCPFQYDPYAVTATSSHWILYQTDAEIEELVNWLRDNDPKE 1549 Query: 1949 RDLKESILQWQSNKSLNSNY-QRNDFLKGKPSSSVISSEQRVPNHNCRVLKGVRALEKKF 1773 R+LK+SIL W+ + +S + ++ + + + +SS ++ +V +C V K LEKK+ Sbjct: 1550 RELKDSILNWKKIRFQDSQHTKKQSWDEYQSASSAPTNSDKVDCFDCLVTKAATLLEKKY 1609 Query: 1772 GLGMDMGADYINKNLEHNHEMTYQGRMCRCVCLELIWPSRHHCFSCHQTFSASEELDQHG 1593 G + ++ + K +T Q +M RC CLE IWPSR+HC SCH+TFS + E ++H Sbjct: 1610 GPCFE--SEEVLKKGGKRARVTSQEKMYRCECLEPIWPSRNHCLSCHRTFSTAVEFEEHN 1667 Query: 1592 EKCRAAATGAAGSLKHDKTVMNQGIRICPGSSNIPQSVLNEKHDTQSNCVKXXXXXXXXX 1413 + C +A + + ++ +G + S + + ++ + + Sbjct: 1668 DTCNSAPPAYEKNKEASNSLKGKGNKKSDISHAAGGTDVELVETSKPSGLIRFQNDGCPF 1727 Query: 1412 XXXEIMANFKVDYSIEEDIKGIGLFGTNGVVPFLPNSSPCFNDPALTLVPPSGNEVSMED 1233 EI + F S +E ++ IGL G+ G+ +P+ SP +D L L+ P EV + D Sbjct: 1728 DLNEISSKFMTQDSNKELVQEIGLLGSKGIPSLIPSVSPFLSDSTLMLMSPQ-KEVGVPD 1786 Query: 1232 -----QTSVPKSQQKLS------DDLAKTASVVNDKSSGIEKGKGKVSEVECMKSRVLCE 1086 ++ SQ K S D++A AS + + E K K C + R Sbjct: 1787 GQLMASETLSSSQGKQSMKNAGNDNMADDASRKSGSNGTHEVLKSKKPAFGCSEQRDRKS 1846 Query: 1085 RGRLXXXXXXXXXSTARRSIIRESSLRPKVGYATEVLRLLKISMLDIDAALPEAALRASR 906 + + ++ +SSLRP +G +++ R LK+++LDIDAALPE ALR S+ Sbjct: 1847 SSHVRVPKVGIN----QCCVVPQSSLRPLIGRTSQIKRRLKVNLLDIDAALPEEALRPSK 1902 Query: 905 SHLDRRCAWRTFVKCANTLYEMMLALIILEDTIRTDYLKNNWWYWSSPSAASNMPTLSAL 726 +HL+RR AWR FVK A T+YEM+ A IILED I+T++L+N WWYWSS SAA+ T+S+L Sbjct: 1903 AHLERRWAWRAFVKSAETIYEMVQATIILEDMIKTEFLRNEWWYWSSLSAAAKTSTMSSL 1962 Query: 725 ALRIFTLDSAILYEKSSTDDTAGTSIPVSQPDKEASSSAIPTTEMKSAEQPMQKVN 558 ALRI++LD+AI+Y+KS+T+ ++ + D +P E+ + +K N Sbjct: 1963 ALRIYSLDAAIIYDKSTTNLNPVENLKL---DSTPEHKPLPGVELLEKSKVSRKSN 2015 >ref|XP_006470355.1| PREDICTED: methyl-CpG-binding domain-containing protein 9-like isoform X1 [Citrus sinensis] Length = 2159 Score = 503 bits (1296), Expect = e-139 Identities = 327/956 (34%), Positives = 496/956 (51%), Gaps = 88/956 (9%) Frame = -3 Query: 3161 ESQKERDEVLALVNDSSLPKAPWEEGICKVCGMDKDDVNVLLCDTCDSEYHTYCLNPPLG 2982 E+ KE +++L V S +PKAPW+EGICKVCG+DKDD +VLLCDTCD+EYHTYCL PPL Sbjct: 1208 ETTKEINDIL--VQTSEIPKAPWDEGICKVCGVDKDDDSVLLCDTCDAEYHTYCLEPPLV 1265 Query: 2981 KVPDGNWYCPSCVTGNSLSEDAAYGTQVVNQCGKRRYQRKLIDETLEGLAQLANKMELKE 2802 ++P+GNWYCPSCV NS+ + A+ +QV Q + Q ++ LE L L ME KE Sbjct: 1266 RIPEGNWYCPSCVVRNSMVQGASEHSQVGGQHKGKNNQGEITRLCLEALRHLTTVMEEKE 1325 Query: 2801 YWEFTVEERILLLKFFSDEAMNSATIRDHIDQCASMCVDLHQRLRSLNSERKILKLKEES 2622 YWEF V ER LLKF DE +NSA +R H++QC + +L Q+LRS + E K LK +EE+ Sbjct: 1326 YWEFNVHERTFLLKFLCDELLNSALLRQHLEQCTEVTAELQQKLRSFSVEFKNLKSREET 1385 Query: 2621 LAANVAKMKGNVHTGGGEL------ASVLADESQLPVDNKVSSF-----------SGGSV 2493 +AA VAK++ ++ E+ A+V+ + + + SS SG Sbjct: 1386 VAARVAKVEASMTYSVAEVCMKEGPATVIRNNGKCIEQPQNSSNRSNCSVIALEESGPMY 1445 Query: 2492 PMDGG-----PH-TKDQVSILRSDVNLHPK---LGDTXXXXXXXXXXXQMLGYGLSNTVS 2340 P D PH ++ ++D ++ P L + + L Sbjct: 1446 PTDAEGQIEEPHGDNSKMPSQKNDESIKPNEHPLASSLPQEIDNLSGEIRSQHNLQELAR 1505 Query: 2339 KNITV------HASSFPGHQYSNQPNANSLLD----YNAELSKLQQEISGLQESISTAES 2190 T+ H S P + + + ++ +N EL+ ++ +I LQESI++ E Sbjct: 1506 DAATLASPSNNHGPSVPNELHVTEGTCSVTMNEPQAHNLELNNIRNDILLLQESITSLEQ 1565 Query: 2189 ELLKVSLRKEFLGRDSYGRLYWVFGSYDASPWIVANGS---------------------L 2073 +LLK+S+R+EFLG DS GRLYWV P ++ +GS L Sbjct: 1566 QLLKLSVRREFLGSDSSGRLYWVLPLPGMHPCLIVDGSPELQQKRKILDFRGPVDKGLVL 1625 Query: 2072 NPESEFGLDNHFPK-------------------SSSWMYYDTVAEIGELVKWLDDCDIRE 1950 S G D + SS W+ Y T AEI ELV WL D D +E Sbjct: 1626 KNSSSSGSDAYSSSKGSKACCPFQYDPYAVTATSSHWILYQTDAEIEELVNWLRDNDPKE 1685 Query: 1949 RDLKESILQWQSNKSLNSNY-QRNDFLKGKPSSSVISSEQRVPNHNCRVLKGVRALEKKF 1773 R+LK+SIL W+ + +S + ++ + + + +SS ++ +V +C V K LEKK+ Sbjct: 1686 RELKDSILNWKKIRFQDSQHTKKQSWDEYQSASSAPTNSDKVDCFDCLVTKAATLLEKKY 1745 Query: 1772 GLGMDMGADYINKNLEHNHEMTYQGRMCRCVCLELIWPSRHHCFSCHQTFSASEELDQHG 1593 G + ++ + K +T Q +M RC CLE IWPSR+HC SCH+TFS + E ++H Sbjct: 1746 GPCFE--SEEVLKKGGKRARVTSQEKMYRCECLEPIWPSRNHCLSCHRTFSTAVEFEEHN 1803 Query: 1592 EKCRAAATGAAGSLKHDKTVMNQGIRICPGSSNIPQSVLNEKHDTQSNCVKXXXXXXXXX 1413 + C +A + + ++ +G + S + + ++ + + Sbjct: 1804 DTCNSAPPAYEKNKEASNSLKGKGNKKSDISHAAGGTDVELVETSKPSGLIRFQNDGCPF 1863 Query: 1412 XXXEIMANFKVDYSIEEDIKGIGLFGTNGVVPFLPNSSPCFNDPALTLVPPSGNEVSMED 1233 EI + F S +E ++ IGL G+ G+ +P+ SP +D L L+ P EV + D Sbjct: 1864 DLNEISSKFMTQDSNKELVQEIGLLGSKGIPSLIPSVSPFLSDSTLMLMSPQ-KEVGVPD 1922 Query: 1232 -----QTSVPKSQQKLS------DDLAKTASVVNDKSSGIEKGKGKVSEVECMKSRVLCE 1086 ++ SQ K S D++A AS + + E K K C + R Sbjct: 1923 GQLMASETLSSSQGKQSMKNAGNDNMADDASRKSGSNGTHEVLKSKKPAFGCSEQRDRKS 1982 Query: 1085 RGRLXXXXXXXXXSTARRSIIRESSLRPKVGYATEVLRLLKISMLDIDAALPEAALRASR 906 + + ++ +SSLRP +G +++ R LK+++LDIDAALPE ALR S+ Sbjct: 1983 SSHVRVPKVGIN----QCCVVPQSSLRPLIGRTSQIKRRLKVNLLDIDAALPEEALRPSK 2038 Query: 905 SHLDRRCAWRTFVKCANTLYEMMLALIILEDTIRTDYLKNNWWYWSSPSAASNMPTLSAL 726 +HL+RR AWR FVK A T+YEM+ A IILED I+T++L+N WWYWSS SAA+ T+S+L Sbjct: 2039 AHLERRWAWRAFVKSAETIYEMVQATIILEDMIKTEFLRNEWWYWSSLSAAAKTSTMSSL 2098 Query: 725 ALRIFTLDSAILYEKSSTDDTAGTSIPVSQPDKEASSSAIPTTEMKSAEQPMQKVN 558 ALRI++LD+AI+Y+KS+T+ ++ + D +P E+ + +K N Sbjct: 2099 ALRIYSLDAAIIYDKSTTNLNPVENLKL---DSTPEHKPLPGVELLEKSKVSRKSN 2151 >gb|EXC31622.1| Methyl-CpG-binding domain-containing protein 9 [Morus notabilis] Length = 2259 Score = 475 bits (1222), Expect = e-131 Identities = 331/1003 (33%), Positives = 480/1003 (47%), Gaps = 135/1003 (13%) Frame = -3 Query: 3161 ESQKERDEVLALVNDSSLPKAPWEEGICKVCGMDKDDVNVLLCDTCDSEYHTYCLNPPLG 2982 E +KE D +L+ N +PKAPW+EG+CKVCG+D+DD +VLLCDTCD+EYHTYCLNPPL Sbjct: 1267 EMRKEIDYLLSSTN--VIPKAPWDEGVCKVCGIDRDDDSVLLCDTCDAEYHTYCLNPPLL 1324 Query: 2981 KVPDGNWYCPSCVTGNSLSEDAAYGTQVVNQCGKRRYQRKLIDETLEGLAQLANKMELKE 2802 ++P+GNWYCPSCV G +D QV+ Q ++YQ ++ LE LA LA KME KE Sbjct: 1325 RIPEGNWYCPSCVVGRRTVQDVPENVQVIRQRSGKKYQGEVTRVYLEALAHLATKMEEKE 1384 Query: 2801 YWEFTVEERILLL----------------------------------------KFFSDEA 2742 YWEF+V+E +LLL KF DE Sbjct: 1385 YWEFSVDESMLLLRPTLRKGRPGEGRLGKARVGHPEWAAVDVGVGSVVRSFLMKFLCDEL 1444 Query: 2741 MNSATIRDHIDQCASMCVDLHQRLRSLNSERKILKLKEESLAANVAKMKGNVHTGGGELA 2562 +NSA IR H++QCA +L Q+LR+L E KILK +EE L A AK N+ G + Sbjct: 1445 LNSAIIRQHLEQCADTSTELQQKLRALFVEWKILKSREEILVARAAKHDPNILNSLGAVG 1504 Query: 2561 --------------SVLADESQ---LPVDNKVSSFSGGSVP-----MDGGPHTKDQVSIL 2448 L+D S + D+ +S+ GG +D D S Sbjct: 1505 IRESLFSNHNKGQTPALSDRSNCCGMSTDD-LSTLGGGREAIEPSGLDRSSSATDSQSNC 1563 Query: 2447 RSDVNLHPKLGDTXXXXXXXXXXXQMLGYGLS---------NTVSKNITVHASSFPGHQY 2295 ++ ++ +L D +V K+ + H + Sbjct: 1564 QNPLDTEDQLKDAHASVEESNTVLNEADASCGAICSTGNPHESVGKDSSSTLKPVGQHGH 1623 Query: 2294 SNQPNA-------------NSLLDYNAELSKLQQEISGLQESISTAESELLKVSLRKEFL 2154 SN + N L ++ EL ++ +I+ L+ESI++ ESELLKVS+R+EFL Sbjct: 1624 SNASDVRSTIGQSVPAATVNELQGHHVELKSVKNDITILEESITSVESELLKVSVRREFL 1683 Query: 2153 GRDSYGRLYWVFGSYDASPWIVANGSLNPESEFGLDNH---------------------- 2040 G D G LYWV G+ S I+ + S S ++N Sbjct: 1684 GSDFVGCLYWVSGTPTGSSCIIVDRSAALRSGKKMNNFQRPVGKSSVLQCSIQSVPIQCE 1743 Query: 2039 ----FPKSSSWMYYDTVAEIGELVKWLDDCDIRERDLKESILQWQSNKSLNSNYQRNDFL 1872 S W+ Y T +I +LV L D +ER+LKESIL WQ K +Q+N + Sbjct: 1744 RNSVVASDSPWVSYQTDGDIDQLVSCLKTNDTKERELKESILHWQ--KLRFQEFQKNK-I 1800 Query: 1871 KGKPSSSVIS---SEQRVPNHNCRVLKGVRALEKKFGLGMDMGADYINKNLEHNHEMTYQ 1701 +G+ + + S ++ + V + LEK++G + I K +T Sbjct: 1801 RGQAECAAFAASISGEKATFSDGLVTRAANLLEKRYGPCNQLETTDILKKRGKKARLTDD 1860 Query: 1700 GRMCRCVCLELIWPSRHHCFSCHQTFSASEELDQHGE-KCRAAATG------------AA 1560 +M RC CLELIWP RHHC SCH+TF EL+ H E KC + A A Sbjct: 1861 NKMYRCECLELIWPCRHHCLSCHRTFFNDIELEGHNEGKCNSVALAQEKRKEISDSSKAK 1920 Query: 1559 GSLKHDKTVMNQGIRICPGSSNIPQSVLNEKHDTQSNCVKXXXXXXXXXXXXE-IMANFK 1383 SLK D + + IP++ +E + +K E I + F Sbjct: 1921 DSLKSDANREDSTGEM--SRVEIPKTGFSE---LSAKLIKFQDEGLSCPYDFEEICSKFV 1975 Query: 1382 VDYSIEEDIKGIGLFGTNGVVPFLPNSSPCFNDPALTLVPPSGNEVSMEDQTSVPKSQQK 1203 S ++ ++ IGL G+ GV F+ + SPC +D L L+ P + + + + Sbjct: 1976 TKDSCKDLVQEIGLIGSKGVPSFVSSMSPCLDDSTLALISPQKDVGAQGGGSEAAERPVS 2035 Query: 1202 LSDDLAKTAS--VVNDKSSGIEKGKGKVSEVECMKSRVLC------ERGRLXXXXXXXXX 1047 L A +++D+S + + E+ +KS+ L G Sbjct: 2036 LGTGTITIAGWDILSDRSPK----RSAMKEINAVKSQRLTLGYIEQREGIRCSGSHSSEM 2091 Query: 1046 STARRSIIRESSLRPKVGYATEVLRLLKISMLDIDAALPEAALRASRSHLDRRCAWRTFV 867 R ++ + SLRP VG +++ R LKI++LD+DAALPE ALR S+SHL RR AWR FV Sbjct: 2092 GATRCCVVPQFSLRPLVGKVSQIYRRLKINLLDMDAALPEEALRPSKSHLGRRWAWRAFV 2151 Query: 866 KCANTLYEMMLALIILEDTIRTDYLKNNWWYWSSPSAASNMPTLSALALRIFTLDSAILY 687 K A T+YEM+ A I+LED I+T+YLKN WWYWSS SAA+ T+S+LALRI++LD+AI+Y Sbjct: 2152 KSATTIYEMVQATIVLEDMIKTEYLKNEWWYWSSFSAAARTSTMSSLALRIYSLDAAIIY 2211 Query: 686 EKSSTDDTAGTSIPVSQPDKEASSSAIPTTEMKSAEQPMQKVN 558 EK S++ S+P + +P ++ + ++ N Sbjct: 2212 EKISSESDPTDK---SEPSNLSEQKPVPVIDLTEKTKITRRSN 2251 >ref|XP_002517349.1| DNA binding protein, putative [Ricinus communis] gi|223543360|gb|EEF44891.1| DNA binding protein, putative [Ricinus communis] Length = 2145 Score = 474 bits (1220), Expect = e-130 Identities = 320/919 (34%), Positives = 475/919 (51%), Gaps = 87/919 (9%) Frame = -3 Query: 3161 ESQKERDEVLALVNDSSLPKAPWEEGICKVCGMDKDDVNVLLCDTCDSEYHTYCLNPPLG 2982 E++K+ D VLA N+ +PKAPW+EG+CKVCG DKDD +VLLCDTCD+EYHTYCLNPPL Sbjct: 1198 ETKKDLDIVLASTNE--IPKAPWDEGVCKVCGFDKDDDSVLLCDTCDAEYHTYCLNPPLA 1255 Query: 2981 KVPDGNWYCPSCVTGNSLSEDAAYGTQVVNQCGKRRYQRKLIDETLEGLAQLANKMELKE 2802 ++P+GNWYCPSCV+ + E A+ TQV+ Q ++YQ ++ LE L LA+ ME K+ Sbjct: 1256 RIPEGNWYCPSCVSVRMVQE-ASVSTQVIGQNSCKKYQGEMTRIYLETLVHLASAMEEKD 1314 Query: 2801 YWEFTVEERILLLKFFSDEAMNSATIRDHIDQCASMCVDLHQRLRSLNSERKILKLKEES 2622 YW+F V+ER LLKF DE +NSA +R H++QC ++ Q+LR+L +E K LK KEE Sbjct: 1315 YWDFGVDERTFLLKFLCDELLNSALVRQHLEQCMESTAEVQQKLRTLYAEWKNLKSKEEF 1374 Query: 2621 LAANVAKM----KGNVHTGGGELASVLADE----SQLPV-DNKVSSFSGGS---VPMDGG 2478 +A AKM G V G L S L D+ Q PV +K S S +DG Sbjct: 1375 MALKSAKMGTGASGEVKEG---LVSALKDQGKSVGQPPVLGDKPSDCCAPSDDVSAVDGS 1431 Query: 2477 PHTKDQVSILR--SDVNLHPK-------LGDTXXXXXXXXXXXQMLGYGLSNTVSK-NIT 2328 P + S++N K + T M G SN SK N Sbjct: 1432 PEGNGINGFDKHPSEINYEKKPSHDSQNIDSTNNHGPVKDMHDAMEG---SNDPSKENSK 1488 Query: 2327 VHASSFPGHQYSNQPNANSLLD-----------YNAELSKLQQEISGLQESISTAESELL 2181 + PG S+ NA +L+ Y+ ++S ++ +I LQ IS+ ES+L Sbjct: 1489 PLGPNHPGFSLSSDMNALVVLNLPSVTMNESQAYHTDVSAIKDDILRLQNLISSMESQLS 1548 Query: 2180 KVSLRKEFLGRDSYGRLYWVFGSYDASPWIVANGS------------------LNPESEF 2055 K SLR+EFLG DS G LYW + + P IV + S L S Sbjct: 1549 KQSLRREFLGSDSRGHLYWASATPNGHPQIVVDRSLTFQHRKISHHRLGNSSVLQHSSSS 1608 Query: 2054 GLD-------------------NHFPKSSSWMYYDTVAEIGELVKWLDDCDIRERDLKES 1932 G+D SS+W+ Y+T AEI EL+ WL + + +E +LKES Sbjct: 1609 GIDACLNLEGSRACFPFLFNPNGTLSMSSAWVSYETDAEIEELIGWLGNNNQKEIELKES 1668 Query: 1931 ILQWQSNKSLNSNYQRNDFLKG-KPSSSVISSEQRVPNHNCRVLKGVRALEKKFGLGMDM 1755 I+QW + S R+ + + S I + + NC + K LEK +G +++ Sbjct: 1669 IMQWLKLRFQESQRIRDPVQEECRAGLSTIRNNDQTAFSNC-LTKATLLLEKNYGAFVEL 1727 Query: 1754 GADYINKNLEHNHEMTYQGRMCRCVCLELIWPSRHHCFSCHQTFSASEELDQHGE-KCRA 1578 + K T + + RC CLELIWPSR+HC+SCH+T S E + H + +C + Sbjct: 1728 DTSDMLKKRGKKARGTNEEKTYRCDCLELIWPSRNHCYSCHRTSSNDVEFEGHSDGRCSS 1787 Query: 1577 AATGAAGSLKHDKTVMNQGIRICPGSSNIPQSVLNEKHDTQSNCVK--------XXXXXX 1422 S + + ++ +G + +S +++ H + + Sbjct: 1788 VPQSREKSEETNDSLKGRGNVKAEVTWKEKKSEIDKLHSSMGGLSELRARLIKFQNEGIN 1847 Query: 1421 XXXXXXEIMANFKVDYSIEEDIKGIGLFGTNGVVPFLPNSSPCFNDPALTLVPPSGNEVS 1242 +I + F + S +E ++ IGL G+NG+ PF+ + SP +D L+ P N Sbjct: 1848 CPYDLLDICSKFVTEDSNKELVQDIGLIGSNGIPPFVTSISPYLSDSISVLISPENNTRI 1907 Query: 1241 MEDQTSVPKSQQKLSDDLAKTASVVNDKSSGIEKGKGKVSEV-ECMKSR-----VLCERG 1080 D+ +V + Q + + +V+ S + K ++E+ E +K+ L RG Sbjct: 1908 PGDECNVDERQVFPQGNWNENRAVLQSSSDNSTR-KTSINEIGEVLKTNKPPLGCLQRRG 1966 Query: 1079 -RLXXXXXXXXXSTARRSIIRESSLRPKVGYATEVLRLLKISMLDIDAALPEAALRASRS 903 + ++ ESSL P VG + +LR LKI++LD++AALPE ALR ++ Sbjct: 1967 KKSSLGKCFPEMGPGCCCVVPESSLMPLVGKVSSILRQLKINLLDMEAALPEEALRPAKG 2026 Query: 902 HLDRRCAWRTFVKCANTLYEMMLALIILEDTIRTDYLKNNWWYWSSPSAASNMPTLSALA 723 L RR AWR +VK A ++Y+M+ A I+LE+ I+T+YL+N WWYWSS SAA+ T+++LA Sbjct: 2027 QLGRRWAWRAYVKSAESIYQMVRATIMLEEMIKTEYLRNEWWYWSSLSAAAKTSTVASLA 2086 Query: 722 LRIFTLDSAILYEKSSTDD 666 LRI++LD+ I+YEK+S D Sbjct: 2087 LRIYSLDACIVYEKNSNSD 2105 >ref|XP_004141185.1| PREDICTED: methyl-CpG-binding domain-containing protein 9-like [Cucumis sativus] Length = 2131 Score = 472 bits (1215), Expect = e-130 Identities = 320/938 (34%), Positives = 481/938 (51%), Gaps = 82/938 (8%) Frame = -3 Query: 3161 ESQKERDEVLALVNDSSLPKAPWEEGICKVCGMDKDDVNVLLCDTCDSEYHTYCLNPPLG 2982 E++ E D L +N+ +PKAPW+EG+CKVCG+DKDD +VLLCDTCD+EYHTYCLNPPL Sbjct: 1191 ETKVEVDGFLVSLNE--IPKAPWDEGVCKVCGIDKDDDSVLLCDTCDAEYHTYCLNPPLA 1248 Query: 2981 KVPDGNWYCPSCVTGNSLSEDAAYGTQ--VVNQCGKRRYQRKLIDETLEGLAQLANKMEL 2808 ++P+GNWYCPSCV G + ED + T+ ++N ++++ ++ + L LA LA +E Sbjct: 1249 RIPEGNWYCPSCVMGTRMVEDPSEHTKNHIINLHKGKKFRGEVTRDFLNKLANLAAALEE 1308 Query: 2807 KEYWEFTVEERILLLKFFSDEAMNSATIRDHIDQCASMCVDLHQRLRSLNSERKILKLKE 2628 KEYWEF+V+ER+ LLK+ DE ++SA IR H++QC +L Q+LRS E K LK +E Sbjct: 1309 KEYWEFSVDERLFLLKYLCDELLSSALIRQHLEQCVEALAELQQKLRSCFIEWKNLKCRE 1368 Query: 2627 ESLAANVAKM---------KGNVHTGGGELASVLADESQLPVDNKVSSFSGGSVPMDGGP 2475 E +AA AK+ +G G L + S ++NK + + M Sbjct: 1369 EVVAARAAKLDTTMLSAVREGQGSCDGARLGASDQYSSLTSLENKCHNHASFQEQMSSAH 1428 Query: 2474 HTKDQVSILRSDVNLHPKLGDTXXXXXXXXXXXQMLG--YGLSNTVSKNITVHASSFP-G 2304 D + N+ G + G + + N+ S P G Sbjct: 1429 DVTDNND---AGGNVLSSSGSQNSGKPVKFNEPSLSGLPQEVDGSDQSNMETEISILPSG 1485 Query: 2303 HQYSNQPNANSL------------LDYNAELSKLQQEISGLQESISTAESELLKVSLRKE 2160 QY +AN + Y++EL ++++I +Q+SI++ E ELLK+S+R+E Sbjct: 1486 KQYFTPCDANGVPVAPQVPPPNESQAYHSELDSIKKDILQVQDSIASTELELLKISVRRE 1545 Query: 2159 FLGRDSYGRLYWVFGSYDASPWIVANGS---LNPESE----------------------- 2058 FLG D+ GRLYW + P I+++GS + ES Sbjct: 1546 FLGSDAAGRLYWASVMSNGLPQIISSGSSVHIGSESRDRVVKGRFFKNYTSTSNANSSTL 1605 Query: 2057 ----FGLDNHFPK----SSSWMYYDTVAEIGELVKWLDDCDIRERDLKESILQWQSNK-- 1908 + H PK +S + Y T A+I EL+ WL D D +ER+LKESILQW K Sbjct: 1606 NSNMYSSLLHLPKDFIGNSPCISYQTEADILELIDWLKDSDPKERELKESILQWLKPKLQ 1665 Query: 1907 --SLNSNYQRNDFLKGKPSSSVISSEQRVPNHNCRVLKGVRALEKKFGLGMD-MGADYIN 1737 S ++N + LK SSS + +++ V + LE K+G ++ + D +N Sbjct: 1666 TSSRSNNQSPEEQLKDSSSSSDV---EKLECSGFLVNRASALLESKYGPFLEFVTPDDLN 1722 Query: 1736 KNLEHNHEMTYQGRMCRCVCLELIWPSRHHCFSCHQTFSASEELDQHGEKCRAAATGAAG 1557 + L+ + +M RCVC+E +WPSR+HC SCH++FS EL++H ++ + Sbjct: 1723 RWLD-KARLAEDEKMFRCVCMEPVWPSRYHCLSCHRSFSTDVELEEHDNGQCSSLPASCD 1781 Query: 1556 SLKH--DKTVMNQGIRICPGSSNIPQSVLNEKHDTQSN----CVK-XXXXXXXXXXXXEI 1398 +K D + I+ V+ E N +K I Sbjct: 1782 GIKEVGDSSKSKCNIKFESKQEESSSMVIAETSRGYFNHSMGLIKYQNDGMMCPYDFELI 1841 Query: 1397 MANFKVDYSIEEDIKGIGLFGTNGVVPFLPNSSPCFNDPALTLVPPSGNEVSMEDQTSV- 1221 + F S ++ IK IGL +NGV FL + SP + L ++ + + ED T + Sbjct: 1842 CSKFLTKDSNKDLIKEIGLISSNGVPSFLSSVSPYIMESTLNVIDLKKDSSTPEDGTLLS 1901 Query: 1220 --PKSQQKLSDDLAKTASVVN---DKSSG--IEKGKGKVSEVECM--KSRVLCERGRLXX 1068 P + + ++ +S ++ K +G I K K C+ KS+ +C R Sbjct: 1902 EWPSLENIILENGCHQSSSIDSSIQKPAGNEISAPKTKRLAAGCLEPKSKKICMDNRF-- 1959 Query: 1067 XXXXXXXSTARRSIIRESSLRPKVGYATEVLRLLKISMLDIDAALPEAALRASRSHLDRR 888 R +I +SS RP VG +V+R LK+++LD+DAALP+ AL+ S+ H++RR Sbjct: 1960 ----SEFGIGRCFVIPQSSQRPLVGKILQVVRGLKMNLLDMDAALPDEALKPSKLHIERR 2015 Query: 887 CAWRTFVKCANTLYEMMLALIILEDTIRTDYLKNNWWYWSSPSAASNMPTLSALALRIFT 708 AWR FVK A T+YEM+ A I LED IRT+YLKN WWYWSS SAA+ + T+S+LALRIF+ Sbjct: 2016 WAWRAFVKSAGTIYEMVQATIALEDMIRTEYLKNEWWYWSSLSAAAKISTVSSLALRIFS 2075 Query: 707 LDSAILYEKSSTDDTAGTSIPVSQPDKEASSSAIPTTE 594 LD+AI+YEK S + + + + E + TE Sbjct: 2076 LDAAIIYEKISPNQDSNDYLDTTSSIPEQKLGGVDLTE 2113 >ref|XP_004167238.1| PREDICTED: LOW QUALITY PROTEIN: methyl-CpG-binding domain-containing protein 9-like [Cucumis sativus] Length = 1277 Score = 471 bits (1211), Expect = e-129 Identities = 317/933 (33%), Positives = 477/933 (51%), Gaps = 77/933 (8%) Frame = -3 Query: 3161 ESQKERDEVLALVNDSSLPKAPWEEGICKVCGMDKDDVNVLLCDTCDSEYHTYCLNPPLG 2982 E++ E D L +N+ +PKAPW+EG+CKVCG+DKDD +VLLCDTCD+EYHTYCLNPPL Sbjct: 338 ETKVEVDGFLVSLNE--IPKAPWDEGVCKVCGIDKDDDSVLLCDTCDAEYHTYCLNPPLA 395 Query: 2981 KVPDGNWYCPSCVTGNSLSEDAAYGTQ-VVNQCGKRRYQRKLIDETLEGLAQLANKMELK 2805 ++P+GNWYCPSCV G + ED + T+ ++N ++++ ++ + L LA LA +E K Sbjct: 396 RIPEGNWYCPSCVMGTRMVEDPSEHTKHIINLHKGKKFRGEVTRDFLNKLANLAAALEEK 455 Query: 2804 EYWEFTVEERILLLKFFSDEAMNSATIRDHIDQCASMCVDLHQRLRSLNSERKILKLKEE 2625 EYWEF+V+ER+ LLK+ DE ++SA IR H++QC +L Q+LRS E K LK +EE Sbjct: 456 EYWEFSVDERLFLLKYLCDELLSSALIRQHLEQCVEALAELQQKLRSCFIEWKNLKCREE 515 Query: 2624 SLAANVAKM---------KGNVHTGGGELASVLADESQLPVDNKVSSFSGGSVPMDGGPH 2472 +AA AK+ +G G L + S ++NK + + M Sbjct: 516 VVAARAAKLDTTMLSAVREGQGSCDGARLGASDQYSSLTSLENKCHNHASFQEQMSSAHD 575 Query: 2471 TKDQVSILRSDVNLHPKLGDTXXXXXXXXXXXQMLG--YGLSNTVSKNITVHASSFP-GH 2301 D + N+ G + G + + N+ S P G Sbjct: 576 VTDNND---AGGNVLSSSGSQNSGKPVKFNEPSLSGLPQEVDGSDQSNMETEISILPSGK 632 Query: 2300 QYSNQPNANSL------------LDYNAELSKLQQEISGLQESISTAESELLKVSLRKEF 2157 QY +AN + Y++EL ++++I +Q+SI++ E ELLK+S+R+EF Sbjct: 633 QYFTPCDANGVPVAPQVPPPNESQAYHSELDSIKKDILQVQDSIASTELELLKISVRREF 692 Query: 2156 LGRDSYGRLYWVFGSYDASPWIVANGS---LNPESE------------------------ 2058 LG D+ GRLYW + P I+++GS + ES Sbjct: 693 LGSDAAGRLYWASVMSNGLPQIISSGSSVHIGSESRDRVVKGRFFKNYTSTSNANSSTLN 752 Query: 2057 ---FGLDNHFPK----SSSWMYYDTVAEIGELVKWLDDCDIRERDLKESILQWQSNK--- 1908 + H PK +S + Y T A+I EL+ WL D D +ER+LKESILQW K Sbjct: 753 SNMYSSLLHLPKDFIGNSPCISYQTEADILELIDWLKDSDPKERELKESILQWLKPKLQT 812 Query: 1907 -SLNSNYQRNDFLKGKPSSSVISSEQRVPNHNCRVLKGVRALEKKFGLGMD-MGADYINK 1734 S ++N + LK SSS + +++ V + LE K+G ++ + D +N+ Sbjct: 813 SSRSNNQSPEEQLKDSSSSSDV---EKLECSGFLVNRASALLESKYGPFLEFVTPDDLNR 869 Query: 1733 NLEHNHEMTYQGRMCRCVCLELIWPSRHHCFSCHQTFSASEELDQHGEKCRAAATGAAGS 1554 L+ + +M RCVC+E +WPSR+HC SCH++FS EL++H ++ + Sbjct: 870 WLD-KARLAEDEKMFRCVCMEPVWPSRYHCLSCHKSFSTDVELEEHDNGQCSSLPASCDG 928 Query: 1553 LKH--DKTVMNQGIRICPGSSNIPQSVLNEKHDTQSN----CVK-XXXXXXXXXXXXEIM 1395 +K D + I+ V+ E N +K I Sbjct: 929 IKEVGDSSKSKCNIKFESKQEESSSMVIAETSRGYFNHSMGLIKYQNDGMMCPYDFELIC 988 Query: 1394 ANFKVDYSIEEDIKGIGLFGTNGVVPFLPNSSPCFNDPALTLVPPSGNEVSMEDQTSVPK 1215 + F S ++ IK IGL +NGV FL + SP + L ++ + + ED T + + Sbjct: 989 SKFLTKDSNKDLIKEIGLISSNGVPSFLSSVSPYIMESTLNVIDLKKDSSTPEDGTLLSE 1048 Query: 1214 SQQKLSDDLAKTASVVNDKSSGIEKGKGKVSEVECMKSRVLC------ERGRLXXXXXXX 1053 + L + S I+K G +E+ K++ L + + Sbjct: 1049 WPSLENIILENGCHQSSSIDSSIQKPAG--NEISAPKTKRLAAGCLEPKSKKSXMDNRFS 1106 Query: 1052 XXSTARRSIIRESSLRPKVGYATEVLRLLKISMLDIDAALPEAALRASRSHLDRRCAWRT 873 R +I +SS RP VG +V+R LK+++LD+DAALP+ AL+ S+ H++RR AWR Sbjct: 1107 EFGIGRCFVIPQSSQRPLVGKILQVVRGLKMNLLDMDAALPDEALKPSKLHIERRWAWRA 1166 Query: 872 FVKCANTLYEMMLALIILEDTIRTDYLKNNWWYWSSPSAASNMPTLSALALRIFTLDSAI 693 FVK A T+YEM+ A I LED IRT+YLKN WWYWSS SAA+ + T+S+LALRIF+LD+AI Sbjct: 1167 FVKSAGTIYEMVQATIALEDMIRTEYLKNEWWYWSSLSAAAKISTVSSLALRIFSLDAAI 1226 Query: 692 LYEKSSTDDTAGTSIPVSQPDKEASSSAIPTTE 594 +YEK S + + + + E + TE Sbjct: 1227 IYEKISPNQDSNDYLDTTSSIPEQKLGGVDLTE 1259 >gb|EOY02356.1| Methyl-CpG-binding domain-containing protein 9, putative isoform 1 [Theobroma cacao] gi|508710460|gb|EOY02357.1| Methyl-CpG-binding domain-containing protein 9, putative isoform 1 [Theobroma cacao] Length = 2225 Score = 470 bits (1209), Expect = e-129 Identities = 329/944 (34%), Positives = 486/944 (51%), Gaps = 105/944 (11%) Frame = -3 Query: 3161 ESQKERDEVLALVNDSSLPKAPWEEGICKVCGMDKDDVNVLLCDTCDSEYHTYCLNPPLG 2982 E++KE +++LA + S +PKAPW+EG+CKVCG+DKDD +VLLCDTCD+EYHTYCLNPPL Sbjct: 1265 ETKKEINDLLA--STSEIPKAPWDEGVCKVCGIDKDDDSVLLCDTCDAEYHTYCLNPPLA 1322 Query: 2981 KVPDGNWYCPSCVTGNSLSEDAAYGTQVVNQCGKRRYQRKLIDETLEGLAQLANKMELKE 2802 ++P+GNWYCPSCV + +DA+ +QV+ + ++YQ ++ LE LA L +E KE Sbjct: 1323 RIPEGNWYCPSCVLSKRMVQDASEHSQVIIRRRDKKYQGEVTRGYLEALAHLGAVLEEKE 1382 Query: 2801 YWEFTVEERILLLKFFSDEAMNSATIRDHIDQCASMCVDLHQRLRSLNSERKILKLKEES 2622 YW+F+++ERI LLKF DE +NSA IR H++QCA +LHQ+LRS E K LK +E+ Sbjct: 1383 YWQFSIDERIFLLKFLCDELLNSALIRQHLEQCAETS-ELHQKLRSAYVEWKNLKSREDF 1441 Query: 2621 LAANVAKMKGNVHTGGGELASVLADESQLPVD-----------NKVSSFS---------G 2502 +AA AK+ ++ G++ V + LP D NK +S + G Sbjct: 1442 VAAKAAKIDTSMSNAVGDVG-VKDGDDWLPSDGGKEGADLNGSNKYASATYTEKNFTANG 1500 Query: 2501 GSV-PMDGGPHTK--------DQVSILRSDVNLHP-----------KLGDTXXXXXXXXX 2382 ++ PMD K +VS +SD + P ++ ++ Sbjct: 1501 QTLNPMDTEAQLKGDQAIVDASKVSSQKSDKSFRPSELLVPNHLSQEIENSSKETSFQGK 1560 Query: 2381 XXQMLGYGLSNTVSKNITVHASSFPGHQYSNQPNA---NSLLDYNAELSKLQQEISGLQE 2211 + G +++ S + FP + Q + N ++ EL+ ++ +I LQ+ Sbjct: 1561 LEESKGMDVASPPSPSDC--NGQFPPSDAAKQVPSVTENESQSHHLELNTIKNDIQRLQD 1618 Query: 2210 SISTAESELLKVSLRKEFLGRDSYGRLYWVFGSYDASPWIVANGSLNPESE--------- 2058 I++ ES+LLK+S+RKEFLG DS GRLYW+ P ++ +GSL + + Sbjct: 1619 LITSLESQLLKLSVRKEFLGSDSAGRLYWISAMPGGYPQVIVDGSLVLQKKRKFLGYEER 1678 Query: 2057 -----------FGLDNHFPKSSS-------------------WMYYDTVAEIGELVKWLD 1968 G DN S W+ Y T AEI L+ WL+ Sbjct: 1679 VQNTFIWNSASAGTDNGMKAEGSKASCPFLYNSKDAISVGSPWVTYQTEAEIEGLIDWLN 1738 Query: 1967 DCDIRERDLKESILQWQSNKSLNSNYQR---NDFLKGKPSSSVISSEQRVPNHNCRVLKG 1797 D + +E++LKE+ILQ K +YQ+ D + + + S+ S + + K Sbjct: 1739 DNEPKEKELKEAILQ----KLKFQDYQKMKNQDQDECQTAFSMSSGSDKGSFSSFLGTKA 1794 Query: 1796 VRALEKKFGLGMDMGADYINKNLEHNHEMTYQGRMCRCVCLELIWPSRHHCFSCHQTFSA 1617 LEKK+G K + +M RC CLE IWPSR+HC SCH+TF + Sbjct: 1795 AMLLEKKYGPCFKSEITDSLKKRGKKARVINGDKMYRCKCLEPIWPSRNHCISCHKTFFS 1854 Query: 1616 SEELDQHGE-KCRAAA------TGAAGSLKHDKTVMNQGIRI--CPGSSNIPQSVLNEKH 1464 E + H + KC + T SLK K MN I C I ++ + Sbjct: 1855 DVEFEDHNDGKCNLGSPLNEKSTSVGDSLK-GKGNMNIDINRVDCTVDMEIVETSKSGHS 1913 Query: 1463 DTQSNCVKXXXXXXXXXXXXE-IMANFKVDYSIEEDIKGIGLFGTNGVVPFLPNSSPCFN 1287 + S +K E I F S EE ++ IGL G+NGV F+ + S + Sbjct: 1914 ELSSRLIKFQNEGLVCPYNFEEISTKFVTRDSNEELVREIGLIGSNGVPSFVSSVSHFVS 1973 Query: 1286 DPALTLVPPSGNEVSMEDQTSVPKSQQKLSDDLAKTASVVNDKSSGIEKGKGKVSEVECM 1107 D L V P + D+ + S A+ +N++ S + SE+E Sbjct: 1974 DSTLMTVRPHQERGDLGDKLKATE-MPGFSQGNRSVANGINERLSDNSFRRSVASEIEVQ 2032 Query: 1106 KS-----RVLCERGRLXXXXXXXXXS-TARRSIIRESSLRPKVGYATEVLRLLKISMLDI 945 ++ R L +R R+ R ++ +SSLRP VG +++ R LKI++LD+ Sbjct: 2033 RTIRPALRCLEQRDRISSADKYSPELGIGRCCVVPQSSLRPLVGKVSQISRQLKINLLDM 2092 Query: 944 DAALPEAALRASRSHLDRRCAWRTFVKCANTLYEMMLALIILEDTIRTDYLKNNWWYWSS 765 DAAL E ALR S++ ++RR AWR+FVK A T+YEM+ A I+LED I+T+YL+N WWYWSS Sbjct: 2093 DAALSEEALRPSKACMERRWAWRSFVKSAETIYEMVQATIVLEDMIKTEYLRNEWWYWSS 2152 Query: 764 PSAASNMPTLSALALRIFTLDSAILYEKS----STDDTAGTSIP 645 SAA + T+S+LALRI++LDSAI+YEKS S D+ +SIP Sbjct: 2153 LSAAVKISTVSSLALRIYSLDSAIIYEKSFEFHSIDNLKPSSIP 2196 >ref|XP_002884279.1| methyl-CpG-binding domain 9 [Arabidopsis lyrata subsp. lyrata] gi|297330119|gb|EFH60538.1| methyl-CpG-binding domain 9 [Arabidopsis lyrata subsp. lyrata] Length = 2183 Score = 468 bits (1204), Expect = e-129 Identities = 317/944 (33%), Positives = 474/944 (50%), Gaps = 84/944 (8%) Frame = -3 Query: 3161 ESQKERDEVLALVNDSSLPKAPWEEGICKVCGMDKDDVNVLLCDTCDSEYHTYCLNPPLG 2982 E +KE +++ VN LPKAPW+EG+CKVCG+DKDD +VLLCDTCD+EYHTYCLNPPL Sbjct: 1265 EMKKEIKDIVVSVN--KLPKAPWDEGVCKVCGVDKDDDSVLLCDTCDAEYHTYCLNPPLI 1322 Query: 2981 KVPDGNWYCPSCVTGNSLSEDAAYGTQVVNQCGKRRYQRKLIDETLEGLAQLANKMELKE 2802 ++P+GNWYCPSCV ++++A ++V + R+YQ +L ++E A LA+ ME K+ Sbjct: 1323 RIPEGNWYCPSCVIAKRMAQEALESYKLVRRRKGRKYQGQLTRTSMEMTAHLADVMEEKD 1382 Query: 2801 YWEFTVEERILLLKFFSDEAMNSATIRDHIDQCASMCVDLHQRLRSLNSERKILKLKEES 2622 YWEF+ EERILLLK DE ++S+ + H++QCA +++ Q+LRSL+SE K K+++E Sbjct: 1383 YWEFSAEERILLLKLLCDELLSSSLVHQHLEQCAEAIIEMQQKLRSLSSEWKNAKMRQEF 1442 Query: 2621 LAANVAKMK-------GNVHTGG--------------GELASVLADESQLPVDNKVSSFS 2505 L A +AK++ G H G G V D+S NK + Sbjct: 1443 LTAKLAKVEPSILKEVGEPHNSGHFADQMGCDQRPQEGVGDGVTHDDSSTAYLNK----N 1498 Query: 2504 GGSVPM--DGGPHTKDQVSILRSDVNLHPK----------------LGDTXXXXXXXXXX 2379 G P+ D P S VN K + DT Sbjct: 1499 KGKAPLETDSQPGEFQDSQPGESHVNFESKISSPETISSPGRHEKPIADTSPHVTDNPSF 1558 Query: 2378 XQMLGYGLSNTVSKNITVH-----ASSFPGHQYSNQPNANSLLDYNAELSKLQQEISGLQ 2214 + L +V +N H A P ++ + L +L+ EI LQ Sbjct: 1559 EKYTSETLHKSVGRNHETHSLNSNAVEIPTAHDASSQASQELQACLQDLNATSHEIHNLQ 1618 Query: 2213 ESISTAESELLKVSLRKEFLGRDSYGRLYWVFGSYDASPWIVANGSLNPESE-------- 2058 +SI + ES+LLK S+R++FLG D+ GRLYW D +P I+ +GS++ + Sbjct: 1619 QSIRSIESQLLKQSIRRDFLGNDASGRLYWGCCFPDENPRILVDGSISLQKPVQADLMGS 1678 Query: 2057 -------FGLDNHFPKSSSWMYYDTVAEIGELVKWLDDCDIRERDLKESILQWQSNKSLN 1899 +D+ + S W YY+T EI ELV+WL D D++ERDL+ESIL W+ Sbjct: 1679 KVPSPFLHAVDHGRLRLSPWTYYETETEISELVQWLHDDDLKERDLRESILCWK------ 1732 Query: 1898 SNYQRNDFLKGKPSSSVISSEQRVPNHNCRVLKGVRALEKKFGLGMDMGADYINKNLEHN 1719 + D K K + +S+ K ++EKK+G + + + + K + Sbjct: 1733 -RLRFGDVQKEKKQAQNLSAPILARGLE---TKAAMSMEKKYGPCIKLETETLKKRGKKT 1788 Query: 1718 HEMTYQGRMCRCVCLELIWPSRHHCFSCHQTFSASEELDQHGE-KC------RAAATGAA 1560 +++ + ++CRC CLE I PS HC CH+TF++ +E ++H E KC + + Sbjct: 1789 -KVSQREKLCRCECLESILPSMIHCLICHKTFASDDEFEEHTESKCIPYSLATEESKEIS 1847 Query: 1559 GSLKHDKTVMNQGIRICPGSSNIPQSVLNEKHDTQSNCVKXXXXXXXXXXXXEIMANFKV 1380 S K +++ + + + + + N EI + F Sbjct: 1848 DSSKAKESLKSDYLNVKSSAGKAVGEISNVSELDSGLIRYQEEESISPYHFEEICSKFVT 1907 Query: 1379 DYSIEEDIKGIGLFGTNGVVPFLPNSSPCFNDPALTLVPP-------SGNEV---SMEDQ 1230 S + +K IGL G+NG+ FLP SS ND L P SG++V E Sbjct: 1908 KDSNRDLVKEIGLIGSNGIPTFLPASSTHHNDSVLINANPNKLDGGDSGDQVIFAGPETN 1967 Query: 1229 TSVPKSQQKLSDDLAKT------ASVVNDKSSGIEKGKGKVSEVECMKSRVLCERGRLXX 1068 S+ LS D + T + + G + K K S +KS C Sbjct: 1968 VEGLNSESNLSFDGSVTDNHGGPLNKLTGLGFGFSEQKNKKSSGSGLKS---C------- 2017 Query: 1067 XXXXXXXSTARRSIIRESSLRPKVGYATEVLRLLKISMLDIDAALPEAALRASRSHLDRR 888 ++ +++L+ G A V R LK ++LD+D ALPE ALR S+SH DRR Sbjct: 2018 ------------CVVPQAALKRITGKALPVFRFLKTNLLDMDVALPEEALRPSKSHPDRR 2065 Query: 887 CAWRTFVKCANTLYEMMLALIILEDTIRTDYLKNNWWYWSSPSAASNMPTLSALALRIFT 708 AWR FVK A ++YE++ A ++ED I+T+YLKN WWYWSS SAA+ + TLSAL++RIF+ Sbjct: 2066 RAWRVFVKSAQSIYELVQATFVVEDMIKTEYLKNEWWYWSSLSAAAKISTLSALSVRIFS 2125 Query: 707 LDSAILYEKSST--DDTAGTSIPVSQPDKEASSSAIPTTEMKSA 582 LD+AI+Y+K T D T +S PD++ S + ++ KS+ Sbjct: 2126 LDAAIIYDKPITPSDHNDETKPIISSPDQK--SQPVSDSQEKSS 2167 >ref|XP_006408507.1| hypothetical protein EUTSA_v10019872mg [Eutrema salsugineum] gi|557109653|gb|ESQ49960.1| hypothetical protein EUTSA_v10019872mg [Eutrema salsugineum] Length = 2173 Score = 467 bits (1201), Expect = e-128 Identities = 312/941 (33%), Positives = 478/941 (50%), Gaps = 65/941 (6%) Frame = -3 Query: 3164 TESQKERDEVLALVNDSSLPKAPWEEGICKVCGMDKDDVNVLLCDTCDSEYHTYCLNPPL 2985 TE +KE +++ +N LPKAPW+EG+CKVCG+DKDD +VLLCDTCD+EYHTYCLNPPL Sbjct: 1262 TEVKKEIKDIVVSIN--KLPKAPWDEGVCKVCGVDKDDDSVLLCDTCDAEYHTYCLNPPL 1319 Query: 2984 GKVPDGNWYCPSCVTGNSLSEDAAYGTQVVNQCGKRRYQRKLIDETLEGLAQLANKMELK 2805 ++PDGNWYCPSCV +++DA ++V + R+YQ +L ++E A LA+ ME K Sbjct: 1320 IRIPDGNWYCPSCVIAKRIAQDALESYKLVRRRKGRKYQGELTRASMETTAHLADVMEEK 1379 Query: 2804 EYWEFTVEERILLLKFFSDEAMNSATIRDHIDQCASMCVDLHQRLRSLNSERKILKLKEE 2625 +YWEF+ EERILLLK DE ++S+ + H++QCA +++ Q+LRSL+SE K K+++E Sbjct: 1380 DYWEFSTEERILLLKLLCDELLSSSLVHQHLEQCAEAIIEMQQKLRSLSSEWKNTKMRQE 1439 Query: 2624 SLAANVAKMKGNVHTGGGELASVLADESQLPVDNKVSSFSGGSVPMDGGPHTKDQVSILR 2445 L A +AK++ ++ GE + + Q+ + + G V D + L Sbjct: 1440 FLTAKLAKVEPSILKELGEPQNSSSFAEQIRCNQQQQEGVGDRVTHD---DDTSSAAFLN 1496 Query: 2444 SDVNLHPKLGDTXXXXXXXXXXXQMLGY-----------------------GLSNTVSKN 2334 + P + D + + LS + Sbjct: 1497 KNQRTTPLMTDAQTEELHVISGERKISTPENVTSPGRPELPIADASPHGTDNLSCEKDSS 1556 Query: 2333 ITVHASSFPGHQY----SNQPNANSLLDYNA-----------ELSKLQQEISGLQESIST 2199 T+H S H+ SN + + D ++ EL+ EI LQ+SI + Sbjct: 1557 DTLHKSVGGNHEIHTLKSNAVESQTAHDASSMASQELQASQQELNATSNEIQNLQQSIRS 1616 Query: 2198 AESELLKVSLRKEFLGRDSYGRLYWVFGSYDASPWIVANGSLNPESE------------- 2058 ES+LL+ S+R++FLG D+ GRLYW + P I+ +GS++ + Sbjct: 1617 IESQLLRQSIRRDFLGSDASGRLYWGCCFPEEHPRILVDGSISLQKSVQVNLTGSKVLSP 1676 Query: 2057 --FGLDNHFPKSSSWMYYDTVAEIGELVKWLDDCDIRERDLKESILQWQSNKSLNSNYQR 1884 +D+ S W YY+T AEI ELV+WL D D +ER+L+ESIL W K L + Sbjct: 1677 FLHAVDHGRLLVSPWTYYETEAEISELVQWLHDDDPKERELRESILCW---KRLRFGDLQ 1733 Query: 1883 NDFLKGKPSSSVISSEQRVPNHNCRVLKGVRALEKKFGLGMDMGADYINKNLEHNHEMTY 1704 + + SS IS+ V K ++EK++G + + + + K + ++ Sbjct: 1734 RGMKQAQNSSCPISA-------GSLVTKAAMSMEKRYGPCIKLETETLKKRGKKT-KVAE 1785 Query: 1703 QGRMCRCVCLELIWPSRHHCFSCHQTFSASEELDQHGE-KC-RAAATGAAGSLKHDKTVM 1530 + ++CRC CLE I PS HC CH+TF++ +E ++H E KC + G D + Sbjct: 1786 REKLCRCECLEPILPSMIHCLICHKTFASDDEFEEHTESKCIPYSLASEEGKEISDSSKA 1845 Query: 1529 NQGIR---ICPGSSNIPQSVLNEKHDTQSNCVK-XXXXXXXXXXXXEIMANFKVDYSIEE 1362 G++ + ++ + ++ + S ++ EI + F S + Sbjct: 1846 KDGLKSDYLNVYNAGKDVAEMSNVSELDSGLIRYQEEESISPYHFEEICSKFVTRDSNRD 1905 Query: 1361 DIKGIGLFGTNGVVPFLPNSSPCFNDPALTLVP----PSGNEVSMEDQTSVPKSQQKLSD 1194 +K IGL G+NG FLP SS ND L G+ V T + + L+ Sbjct: 1906 LVKEIGLIGSNGTPTFLP-SSTFLNDSMLISATCNKLDGGDSVDQVIFTGSEANDEGLNS 1964 Query: 1193 D--LAKTASVVNDKSSGIEKGKGKVSEVECMKSRVLCERGRLXXXXXXXXXSTARRSIIR 1020 + ++ V ND + K G + K++ RG ++ Sbjct: 1965 ESNMSFNRIVTNDLGGPLNKPSGLSFGLSDQKNKKSSGRG------------LEGCCVVP 2012 Query: 1019 ESSLRPKVGYATEVLRLLKISMLDIDAALPEAALRASRSHLDRRCAWRTFVKCANTLYEM 840 +SSL+ G A V R LK +MLD+D ALPE ALR S+SH DRR AWR FVK A +++E+ Sbjct: 2013 QSSLKRITGKALSVFRFLKTNMLDMDVALPEEALRPSKSHPDRRRAWRAFVKSAQSIFEL 2072 Query: 839 MLALIILEDTIRTDYLKNNWWYWSSPSAASNMPTLSALALRIFTLDSAILYEKSSTDDTA 660 + A I++ED I+T+YLKN WWYWSS SAA+ + TLSAL++R+F+LD+AILYEK Sbjct: 2073 VQAAIVVEDMIKTEYLKNEWWYWSSLSAAAKISTLSALSVRLFSLDAAILYEK------- 2125 Query: 659 GTSIPVSQPDKEASSSAIPTTEMKSAEQPMQKVNDSDSGEN 537 P++Q D + + I + +S QP+ + S N Sbjct: 2126 ----PINQSDPKDETKTISLPDQRS--QPVSDPQERSSRSN 2160 >ref|XP_006296811.1| hypothetical protein CARUB_v10012794mg [Capsella rubella] gi|482565520|gb|EOA29709.1| hypothetical protein CARUB_v10012794mg [Capsella rubella] Length = 2177 Score = 461 bits (1185), Expect = e-126 Identities = 311/920 (33%), Positives = 474/920 (51%), Gaps = 73/920 (7%) Frame = -3 Query: 3161 ESQKERDEVLALVNDSSLPKAPWEEGICKVCGMDKDDVNVLLCDTCDSEYHTYCLNPPLG 2982 E +KE +++ +N LPKAPW+EG+CKVCG+DKDD +VLLCDTCD+EYHTYCLNPPL Sbjct: 1268 EMKKEIKDIIVSIN--KLPKAPWDEGVCKVCGVDKDDDSVLLCDTCDAEYHTYCLNPPLI 1325 Query: 2981 KVPDGNWYCPSCVTGNSLSEDAAYGTQVVNQCGKRRYQRKLIDETLEGLAQLANKMELKE 2802 ++PDGNWYCPSCV ++++A ++V + R+YQ +L ++E A LA ME K+ Sbjct: 1326 RIPDGNWYCPSCVIAKRMAQEALESYKLVRRRKGRKYQGELTQASMEMTAHLAGVMEEKD 1385 Query: 2801 YWEFTVEERILLLKFFSDEAMNSATIRDHIDQCASMCVDLHQRLRSLNSERKILKLKEES 2622 YWEF+VEERILLLK DE ++S+ + H++QCA +++ Q+LRSL+SE K K+++E Sbjct: 1386 YWEFSVEERILLLKVLCDELLSSSLVHQHLEQCAEAIIEMQQKLRSLSSEWKNAKMRQEF 1445 Query: 2621 LAANVAKMKGNVHTGGGELASVLADESQLPVDNKVSSFSGGSVPMDGGPHT--------- 2469 L A +AK++ ++ EL + Q+ D + G V D + Sbjct: 1446 LMAKLAKVEPSILKEASELHNSSHFADQMGCDERTHEGVGDGVTHDDETSSTAFLNKNQG 1505 Query: 2468 KDQVSILRSDVNLHPKLGD---TXXXXXXXXXXXQMLGYGLSNTVSKNI-----TVHASS 2313 K + +LH G + ++L +S + N+ T+H S Sbjct: 1506 KAPLETNSQPGDLHVDSGGNKVSSQKKITSPGRHELLVADISPRATDNLTFEKDTLHKSV 1565 Query: 2312 FPGHQ----YSNQPNANSLLDYNAELSKLQQ-----------EISGLQESISTAESELLK 2178 H+ +SN S+ D +++ S+ Q EI LQ SI + ES+LLK Sbjct: 1566 GRIHETHPLHSNAVELQSVHDASSQASQELQACQQDLNATSNEIQNLQLSIRSVESQLLK 1625 Query: 2177 VSLRKEFLGRDSYGRLYWVFGSYDASPWIVANGSLNPESEF---------------GLDN 2043 S+R++FLG DS GRLYW D +P ++ +GS++ + +D+ Sbjct: 1626 QSIRRDFLGNDSSGRLYWGCCFPDENPRVLVDGSISLQKPVQANLTGSRAPSPFLQAVDH 1685 Query: 2042 HFPKSSSWMYYDTVAEIGELVKWLDDCDIRERDLKESILQWQSNKSLNSNYQRNDFLKGK 1863 S W YY+T +EI ELV+WL D D +ERDL+ESIL W+ + D K K Sbjct: 1686 GRLTLSPWTYYETESEISELVQWLHDDDPKERDLRESILCWK-------RLRFGDVQKEK 1738 Query: 1862 PSSSVISSEQRVPNHNCRVLKGVRALEKKFGLGMDMGADYINKNLEHNHEMTYQGRMCRC 1683 ++ +SS V K ++EK+FG + + + + K + + CRC Sbjct: 1739 ENAENLSSP---IFSRGLVTKAAMSMEKRFGPCIKLETETLKK--RGKKTKVEREKFCRC 1793 Query: 1682 VCLELIWPSRHHCFSCHQTFSASEELDQHGE-KC--RAAATGAAGSL----KHDKTVMNQ 1524 CLE I PS HC CH+TF++ +E + H E KC + AT + K +++ + Sbjct: 1794 ECLEAILPSMIHCLICHKTFASDDEFENHSESKCIPYSLATEEGKEISDFSKAKESLKSD 1853 Query: 1523 GIRICPGSSNIPQSVLNEKHDTQSNCVK-XXXXXXXXXXXXEIMANFKVDYSIEEDIKGI 1347 + + S+ S ++ + S ++ EI + F S + +K I Sbjct: 1854 YLNV-KSSAGKDVSEISNVSELDSGLIRYQEEESISPYHFEEICSKFVTKDSNRDLVKDI 1912 Query: 1346 GLFGTNGVVPFLPNSSPCFNDPALTLVPPS--------------GNEVSMEDQTSV--PK 1215 GL G+NG+ FLP+S ND L S G+E ++E S Sbjct: 1913 GLIGSNGIPTFLPSSYTHLNDSMLISANSSKLDGDDSGDQVVFAGSETNVEGLNSEFNMS 1972 Query: 1214 SQQKLSDDLAKTASVVNDKSSGIEKGKGKVSEVECMKSRVLCERGRLXXXXXXXXXSTAR 1035 + ++ DL S + G + K K S +KS C Sbjct: 1973 FDRSVTHDLGGPPSKPSGLGFGFSEQKIKKSLGSGLKS---C------------------ 2011 Query: 1034 RSIIRESSLRPKVGYATEVLRLLKISMLDIDAALPEAALRASRSHLDRRCAWRTFVKCAN 855 ++ ++SL+ G A V R LK ++LD+D ALPE LR S+SH RR AWR FVK + Sbjct: 2012 -CVVPQASLKRITGKALPVFRFLKTNLLDMDVALPEEGLRPSKSHPGRRRAWRLFVKSSQ 2070 Query: 854 TLYEMMLALIILEDTIRTDYLKNNWWYWSSPSAASNMPTLSALALRIFTLDSAILYEK-- 681 ++YE++ A ++LED ++T+YLKN WWYWSS SAA+ + TLSAL++RIF LD+AI+Y+K Sbjct: 2071 SIYELVQATVVLEDMVKTEYLKNEWWYWSSLSAAAKISTLSALSVRIFALDAAIMYDKLL 2130 Query: 680 SSTDDTAGTSIPVSQPDKEA 621 + +D T +S PD+++ Sbjct: 2131 TPSDPIDETKPIISLPDQKS 2150 >ref|XP_006646998.1| PREDICTED: methyl-CpG-binding domain-containing protein 9-like [Oryza brachyantha] Length = 1852 Score = 457 bits (1177), Expect = e-125 Identities = 322/926 (34%), Positives = 463/926 (50%), Gaps = 91/926 (9%) Frame = -3 Query: 3167 GTESQKERDEVLALVNDSSLPKAPWEEGICKVCGMDKDDVNVLLCDTCDSEYHTYCLNPP 2988 G+E +E ++L N SLPKAPWE+G+CKVCG+D+DD +VLLCD CDSEYHTYCLNPP Sbjct: 955 GSEMHEELHDILTASN--SLPKAPWEDGVCKVCGIDRDDDSVLLCDKCDSEYHTYCLNPP 1012 Query: 2987 LGKVPDGNWYCPSCVTGNSLSEDAAYGTQVVNQCGKRRYQRKLIDETL----EGLAQLAN 2820 L ++P+GNWYCPSC+ G + G Q V +R Q+K + E E L +L Sbjct: 1013 LARIPEGNWYCPSCMLGQKKAH-LDQGAQDV-----KRQQKKFVGEEAHAFQEELNKLVT 1066 Query: 2819 KMELKEYWEFTVEERILLLKFFSDEAMNSATIRDHIDQCASMCVDLHQRLRSLNSERKIL 2640 ME KEYW+ ++ERI LLKF DE +N+A IR+H+DQC+ DL Q+ RS N E K L Sbjct: 1067 AMEEKEYWDLRIQERIYLLKFLCDEMLNTALIREHLDQCSDKLGDLQQKFRSSNFELKDL 1126 Query: 2639 KLKEESLAANVAKMKGNV-----------------------HTGGGELASVLADESQLPV 2529 K KEE ++ + + + H GEL +V + + Sbjct: 1127 KYKEEIRTSHARQSRSSKTEQHFSNISGPVENQQCTPKALDHLEEGELGNVGVN-----L 1181 Query: 2528 DNKVSSFSGGSVPMDGGPHTKDQVSILRSDVNLHPKLGDTXXXXXXXXXXXQ-------- 2373 +N G + + G PH DQ S V H LG + Sbjct: 1182 NNPADGVRDGQLNV-GRPHKSDQDISSTSMVEEHKSLGLSEQPSGMAIDQIDGDAIDEGS 1240 Query: 2372 --------MLGYGLSNTVSKNITVHASSFPGHQYSNQPNANSLLDY-------------- 2259 LG S + N+ +S PG ++ + S D Sbjct: 1241 QTQSCEKRPLGVKSSTCDNLNLRETETSTPGRDLPDENASASFQDNLEASTTKSMEFDAD 1300 Query: 2258 NAELSKLQQEISGLQESISTAESELLKVSLRKEFLGRDSYGRLYWVFGSYDASPWIVANG 2079 N E+ L +IS LQ+SIS ES++ S R+E LG+DS GRLYWV G PW+VA+G Sbjct: 1301 NNEMDTLSDDISKLQDSISLLESQINMASSRRECLGKDSIGRLYWVIGRPGKHPWLVADG 1360 Query: 2078 SL--NPESEFGLDNHFP---------KSSSWMYYDTVAEIGELVKWLDDCDIRERDLKES 1932 S+ + E + + N +P S+S Y++ EI LV WL D D RE++LK+S Sbjct: 1361 SMLISKERDISMVNSYPLSAFDCRGWNSASIFIYESDEEIQCLVDWLRDYDPREKELKDS 1420 Query: 1931 ILQWQSNKSLNSNYQRNDFLKGKPSSSVISSEQRV--PNHNCRVLKGVRALEKKFGLGMD 1758 ILQWQ + +Q + L P S+ SEQ + P VL LE+K+GL +D Sbjct: 1421 ILQWQRHLC----HQSSSPLIDPPVSNFSKSEQLIDLPRTKASVL-----LEQKYGLQLD 1471 Query: 1757 MGADYINKNLEHNHEMTYQGRMCRCVCLELIWPSRHHCFSCHQTFSASEELDQHGE-KCR 1581 ++K ++ + R RC CLE IWPSR+HC CH+T+ E + H + KC Sbjct: 1472 QDTSDLSKKRGKKVKLGSEERTYRCDCLEPIWPSRNHCLICHETYLVYTEFEGHNDGKCS 1531 Query: 1580 AAATGAAGSLKHDKTVMNQGIRICPGSSNIPQSVLNEKHDTQSNCV----KXXXXXXXXX 1413 S ++D++ + +P+S + EK + V Sbjct: 1532 KIHQSPDESKENDESKVK-----------VPKSDMKEKDSLDRSSVIEPSSDRKFMQCPY 1580 Query: 1412 XXXEIMANFKVDYSIEEDIKGIGLFGTNGVVPFLPNSSPCFNDPALTLVPPSGNEVSMED 1233 EI F + S +E +K IGL G+NGV F+P S F +PA+ L + + + D Sbjct: 1581 DFEEICRKFITNDSNKETVKQIGLNGSNGVPSFVP-SPAFFLEPAIVL-NQNRKDGELND 1638 Query: 1232 QTSVPK-----SQQKLSDDLAKTASVV--NDKSSGIEKGK--------GKVSEVECMK-S 1101 TS + S QKL +++K+A + N ++K K G+ + K + Sbjct: 1639 WTSCLEECNAMSAQKLGQEVSKSAQICPGNMGDEKVQKSKKPTPDNTSGEEAHSTTGKPT 1698 Query: 1100 RVLCERGRLXXXXXXXXXSTARRSIIRESSLRPKVGYATEVLRLLKISMLDIDAALPEAA 921 RVL G L + ESSLRP +G + +L+ KI++LDI+A LPE A Sbjct: 1699 RVLAVNGGL----------------VPESSLRPVLGRNSHILKQQKINLLDIEATLPEEA 1742 Query: 920 LRASRSHLDRRCAWRTFVKCANTLYEMMLALIILEDTIRTDYLKNNWWYWSSPSAASNMP 741 LRAS+S RR +WR FVK A+++ +M+LA +LE ++ ++LKN+WWYWSS +AA Sbjct: 1743 LRASKSQQIRRRSWRAFVKDADSISQMVLAANLLEGMVKAEFLKNDWWYWSSFTAAMKTS 1802 Query: 740 TLSALALRIFTLDSAILYEKSSTDDT 663 T+S+LALRI+TLD I+Y K ++ Sbjct: 1803 TVSSLALRIYTLDDCIIYSKDQVSNS 1828 >ref|NP_186795.1| methyl-CPG-binding domain 9 [Arabidopsis thaliana] gi|75337201|sp|Q9SGH2.1|MBD9_ARATH RecName: Full=Methyl-CpG-binding domain-containing protein 9; Short=AtMBD9; Short=MBD09; AltName: Full=Histone acetyl transferase MBD9; AltName: Full=Methyl-CpG-binding protein MBD9 gi|6692266|gb|AAF24616.1|AC010870_9 unknown protein [Arabidopsis thaliana] gi|332640148|gb|AEE73669.1| methyl-CPG-binding domain 9 [Arabidopsis thaliana] Length = 2176 Score = 457 bits (1175), Expect = e-125 Identities = 312/923 (33%), Positives = 480/923 (52%), Gaps = 76/923 (8%) Frame = -3 Query: 3161 ESQKERDEVLALVNDSSLPKAPWEEGICKVCGMDKDDVNVLLCDTCDSEYHTYCLNPPLG 2982 E +KE +++ VN LPKAPW+EG+CKVCG+DKDD +VLLCDTCD+EYHTYCLNPPL Sbjct: 1265 EMKKEIKDIVVSVN--KLPKAPWDEGVCKVCGVDKDDDSVLLCDTCDAEYHTYCLNPPLI 1322 Query: 2981 KVPDGNWYCPSCVTGNSLSEDAAYGTQVVNQCGKRRYQRKLIDETLEGLAQLANKMELKE 2802 ++PDGNWYCPSCV ++++A ++V + R+YQ +L ++E A LA+ ME K+ Sbjct: 1323 RIPDGNWYCPSCVIAKRMAQEALESYKLVRRRKGRKYQGELTRASMELTAHLADVMEEKD 1382 Query: 2801 YWEFTVEERILLLKFFSDEAMNSATIRDHIDQCASMCVDLHQRLRSLNSERKILKLKEES 2622 YWEF+ EERILLLK DE ++S+ + H++QCA +++ Q+LRSL+SE K K+++E Sbjct: 1383 YWEFSAEERILLLKLLCDELLSSSLVHQHLEQCAEAIIEMQQKLRSLSSEWKNAKMRQEF 1442 Query: 2621 LAANVAKMKGNVHTGGGELASVLADESQLPVDNKVSSFSGGSVPMDGGPHT-----KDQ- 2460 L A +AK++ ++ GE + Q+ D + G V D + K+Q Sbjct: 1443 LTAKLAKVEPSILKEVGEPHNSSYFADQMGCDPQPQEGVGDGVTRDDETSSTAYLNKNQG 1502 Query: 2459 VSILRSDV---NLHPKLGDTXXXXXXXXXXXQMLGYGLSNT---VSKNI-------TVHA 2319 S L +D H G++ +++T V+ N+ T+ Sbjct: 1503 KSPLETDTQPGESHVNFGESKISSPETISSPGRHELPIADTSPLVTDNLPEKDTSETLLK 1562 Query: 2318 SSFPGHQYSNQPNANS----------------LLDYNAELSKLQQEISGLQESISTAESE 2187 S H+ ++ PN+N+ L +LS EI LQ+SI + ES+ Sbjct: 1563 SVGRNHE-THSPNSNAVELPTAHDASSQASQELQACQQDLSATSNEIQNLQQSIRSIESQ 1621 Query: 2186 LLKVSLRKEFLGRDSYGRLYWVFGSYDASPWIVANGSLNPESE---------------FG 2052 LLK S+R++FLG D+ GRLYW D +P I+ +GS++ + Sbjct: 1622 LLKQSIRRDFLGTDASGRLYWGCCFPDENPRILVDGSISLQKPVQADLIGSKVPSPFLHT 1681 Query: 2051 LDNHFPKSSSWMYYDTVAEIGELVKWLDDCDIRERDLKESILQWQSNKSLNSNYQRNDFL 1872 +D+ + S W YY+T EI ELV+WL D D++ERDL+ESIL W+ + D Sbjct: 1682 VDHGRLRLSPWTYYETETEISELVQWLHDDDLKERDLRESILWWK-------RLRYGDVQ 1734 Query: 1871 KGKPSSSVISSEQRVPNHNCRVLKGVRALEKKFGLGMDMGADYINKNLEHNHEMTYQGRM 1692 K K + +S+ K ++EK++G + + + + K + ++ + ++ Sbjct: 1735 KEKKQAQNLSAPVFATGLE---TKAAMSMEKRYGPCIKLEMETLKKRGKKT-KVAEREKL 1790 Query: 1691 CRCVCLELIWPSRHHCFSCHQTFSASEELDQHGE-KC------RAAATGAAGSLKHDKTV 1533 CRC CLE I PS HC CH+TF++ +E + H E KC + S K +++ Sbjct: 1791 CRCECLESILPSMIHCLICHKTFASDDEFEDHTESKCIPYSLATEEGKDISDSSKAKESL 1850 Query: 1532 MNQGIRICPGSSNIPQSVLNEKHDTQSNCVKXXXXXXXXXXXXEIMANFKVDYSIEED-I 1356 + + + S+ + ++ + S ++ E + + V D + Sbjct: 1851 KSDYLNV-KSSAGKDVAEISNVSELDSGLIRYQEEESISPYHFEEICSKFVTKDCNRDLV 1909 Query: 1355 KGIGLFGTNGVVPFLPNSSPCFNDPALTLVP-------PSGNEV---SMEDQTSVPKSQQ 1206 K IGL +NG+ FLP+SS ND L SG++V E S+ Sbjct: 1910 KEIGLISSNGIPTFLPSSSTHLNDSVLISAKSNKPDGGDSGDQVIFAGPETNVEGLNSES 1969 Query: 1205 KLSDDLAKTASVVN--DKSSGIEKG----KGKVSEVECMKSRVLCERGRLXXXXXXXXXS 1044 +S D + T S DK SG+ G K K S +KS C Sbjct: 1970 NMSFDRSVTDSHGGPLDKPSGLGFGFSEQKNKKSSGSGLKS---C--------------- 2011 Query: 1043 TARRSIIRESSLRPKVGYATEVLRLLKISMLDIDAALPEAALRASRSHLDRRCAWRTFVK 864 ++ +++L+ G A R LK ++LD+D ALPE ALR S+SH +RR AWR FVK Sbjct: 2012 ----CVVPQAALKRVTGKALPGFRFLKTNLLDMDVALPEEALRPSKSHPNRRRAWRVFVK 2067 Query: 863 CANTLYEMMLALIILEDTIRTDYLKNNWWYWSSPSAASNMPTLSALALRIFTLDSAILYE 684 + ++YE++ A I++ED I+T+YLKN WWYWSS SAA+ + TLSAL++RIF+LD+AI+Y+ Sbjct: 2068 SSQSIYELVQATIVVEDMIKTEYLKNEWWYWSSLSAAAKISTLSALSVRIFSLDAAIIYD 2127 Query: 683 KSSTDDTA--GTSIPVSQPDKEA 621 K T T +S PD+++ Sbjct: 2128 KPITPSNPIDETKPIISLPDQKS 2150 >gb|ESW23089.1| hypothetical protein PHAVU_004G017600g [Phaseolus vulgaris] Length = 2204 Score = 448 bits (1152), Expect = e-123 Identities = 311/970 (32%), Positives = 467/970 (48%), Gaps = 99/970 (10%) Frame = -3 Query: 3161 ESQKERDEVLALVNDSSLPKAPWEEGICKVCGMDKDDVNVLLCDTCDSEYHTYCLNPPLG 2982 E +KE D+ + + ++ PKAPW+EG+CKVCG+D+DD +VLLCDTCD+EYHTYCLNPPL Sbjct: 1254 EMRKEVDDFIESMKET--PKAPWDEGVCKVCGIDRDDDSVLLCDTCDAEYHTYCLNPPLA 1311 Query: 2981 KVPDGNWYCPSCVTGNSLSEDAAYGTQVVNQCGKRRYQRKLIDETLEGLAQLANKMELKE 2802 ++P+GNWYCPSCV G ++D TQV+ +C +++Q ++ LE L L+ +E KE Sbjct: 1312 RIPEGNWYCPSCVDGKHATQDVTERTQVIGKCRSKKFQGEVNSLFLESLTHLSTVIEEKE 1371 Query: 2801 YWEFTVEERILLLKFFSDEAMNSATIRDHIDQCASMCVDLHQRLR-------SLNSERKI 2643 YWE ++ ER LLKF DE +NS+ IR H++QC+ + +LHQ+LR +L + I Sbjct: 1372 YWEHSLGERTFLLKFLCDELLNSSMIRQHLEQCSELSAELHQKLRAHSAEWKNLKTREDI 1431 Query: 2642 LKLKEESLAANVAKMKGNVHTGGGELASVLADESQLPVD--NKVSSFSGGSVPMDGGPH- 2472 L K + G V G + ++L + + V V + S V +D P Sbjct: 1432 LSTKAAKIDTFSLNTAGEVGLREG-VTTLLTNTGKCLVQPHTAVDNPSNFGVFVDSLPSE 1490 Query: 2471 --TKDQVSILRSDVNLHPKLGDTXXXXXXXXXXXQMLGYGLSNTVSKNITVHASSFPGHQ 2298 TK++ D ++ D+ S++ SFP Sbjct: 1491 ETTKEKYRFDSVDKSMSVTNSDSDSQNMNSLDVEGQFRNVSGAVESQSTDKSPKSFPSPN 1550 Query: 2297 YSNQPNA---------------------------------------NSLLDYNAELSKLQ 2235 S + N N Y+ EL+ ++ Sbjct: 1551 LSQEINGSGGAAHAQSNHQKCEGRDISTPVTCQQGGVTVDASHTALNESEPYHLELNAIK 1610 Query: 2234 QEISGLQESISTAESELLKVSLRKEFLGRDSYGRLYWVFGS--------YDAS------- 2100 ++IS LQ+SI++ S+LL++S+R+EFLG DS GRLYW DAS Sbjct: 1611 RDISVLQDSITSVVSQLLRLSVRREFLGIDSIGRLYWASTLPGGRSRIVVDASAALLHGR 1670 Query: 2099 --PW---------IVANGSLNPESEFGLDNHFPKSSSWMYYDTVAEIGELVKWLDDCDIR 1953 P+ ++ + SL+ + L N SS W+ Y+T AEI EL+ WLDD D + Sbjct: 1671 GIPFSRDYVEKFSVLQHSSLSEKDSSQLRNALANSSPWIAYETDAEIEELLGWLDDSDPK 1730 Query: 1952 ERDLKESILQWQSNKS---LNSNYQRNDFLKGKPSSSVISSEQRVPNHNCRVLKGVRALE 1782 ER+LK+SI+Q ++ LN+ + +G P S I+ E+ V + V K LE Sbjct: 1731 ERELKDSIMQGPRSRFQEFLNAQTEEQVEDRG-PISMPINREKTVSSS--LVTKATSLLE 1787 Query: 1781 KKFGLGMDMGADYINKNLEHNHEMTYQGRMCRCVCLELIWPSRHHCFSCHQTFSASEELD 1602 KK+G + + K + + T ++ RC CLE IW R HC CH+T S+ E D Sbjct: 1788 KKYGPFFEWDIEMSRKQNKKSRT-TNDEKLFRCECLEPIWFDRRHCTYCHKTVSSDGEFD 1846 Query: 1601 QHGE-KCRAAATGAAGSLKHDKTVMNQGIRICPGSSNIPQSVLNEKHDTQSN-------- 1449 H + KC A A + I C G N+ EK + Sbjct: 1847 GHNDGKCNAGLPVAEKN--------RNKIGSCKGKGNLRCDTSREKFRADAETAGTKVGG 1898 Query: 1448 CVKXXXXXXXXXXXXE--------IMANFKVDYSIEEDIKGIGLFGTNGVVPFLPNSSPC 1293 C K I + F+ S E +K IGL GT+G+ F+P+ SP Sbjct: 1899 CSKLSSRLIKFSNEESTCPFNFEDICSKFETSESNRELVKEIGLIGTDGIPSFVPSVSPL 1958 Query: 1292 FNDPALTLVPPSGNEVSMEDQTSVPKSQQKLSDDLAKTASVVNDKSSGIEKGKGKVSEVE 1113 ++ P + + + + + Q +D A D +SGI G+ +E+ Sbjct: 1959 VSEYTRFSTPKDDAIIGVLSKPTETRGSQGNTDG----AGACLDHNSGISTGRLAANEIN 2014 Query: 1112 CMKSRVLCER--GRLXXXXXXXXXSTARRSIIRESSLRPKVGYATEVLRLLKISMLDIDA 939 E+ G+ ++ SSL+P VG + +LR LKI++LD+DA Sbjct: 2015 KSNKSSSGEQRDGKFSFCGPASDMGVDGCCVVPLSSLKPLVGKVSHILRQLKINLLDMDA 2074 Query: 938 ALPEAALRASRSHLDRRCAWRTFVKCANTLYEMMLALIILEDTIRTDYLKNNWWYWSSPS 759 ALP +ALR S++ +RR AWR FVK A T+YEM+ A LED I+T+YL+N+WWYWSS S Sbjct: 2075 ALPASALRPSKAESERRQAWRAFVKSAETIYEMIQATFTLEDMIKTEYLRNDWWYWSSFS 2134 Query: 758 AASNMPTLSALALRIFTLDSAILYEKSSTDDTAGTSIPVSQPDKEASSSAIPTTEMKSAE 579 AA+ TL +LALR+++LD AI+YEK+ +S P + + + T + K Sbjct: 2135 AAAKTSTLPSLALRLYSLDLAIIYEKTPNSTFTDSSEPSGTAETRPPMN-VDTEKSKGNR 2193 Query: 578 QPMQKVNDSD 549 + +K +SD Sbjct: 2194 KSNRKRKESD 2203 >ref|XP_006594288.1| PREDICTED: methyl-CpG-binding domain-containing protein 9-like [Glycine max] Length = 2202 Score = 443 bits (1140), Expect = e-121 Identities = 320/977 (32%), Positives = 482/977 (49%), Gaps = 105/977 (10%) Frame = -3 Query: 3161 ESQKERDEVLALVNDSSLPKAPWEEGICKVCGMDKDDVNVLLCDTCDSEYHTYCLNPPLG 2982 E +KE + + N+ +PKAPW+EG+CKVCG+D+DD +VLLCDTCD+EYHTYCLNPPL Sbjct: 1249 EMRKEVGDFIESTNE--IPKAPWDEGVCKVCGIDRDDDSVLLCDTCDAEYHTYCLNPPLA 1306 Query: 2981 KVPDGNWYCPSCVTGNSLSEDAAYGTQVVNQCGKRRYQRKLIDETLEGLAQLANKMELKE 2802 ++P+GNWYCPSCV G +++ TQV+ + +++Q ++ LE LA L+ +E KE Sbjct: 1307 RIPEGNWYCPSCVVGKHATQNVTERTQVIGKRQSKKFQGEVNSLYLESLAHLSAAIEEKE 1366 Query: 2801 YWEFTVEERILLLKFFSDEAMNSATIRDHIDQCASMCVDLHQRLR-------SLNSERKI 2643 YWE++V ER LLKF DE +NS+ I H++QCA + +LHQ+LR SL + I Sbjct: 1367 YWEYSVGERTFLLKFLCDELLNSSLIHQHLEQCAELSAELHQKLRAHSAEWKSLKTREDI 1426 Query: 2642 LKLKEESLAANVAKMKGNVHTGGGELASVLADESQLPVD--NKVSSFSGGSVPMDGGPH- 2472 L K + G V G AS+L++ + V V + S V +D P Sbjct: 1427 LSTKAAKIDTFSLNTAGEVGLKEG-FASLLSNTGKCLVQPHTAVDNPSNFGVFVDSLPSE 1485 Query: 2471 --TKDQ---------VSILRSD--------VNLHPKLGDTXXXXXXXXXXXQMLGYGLSN 2349 TKD+ +S+ SD +++ + + + L N Sbjct: 1486 EVTKDKYRFDSVDKSISVTNSDSDSQNMNSIDVEGQFRNVSGAVESQCTDKSPKSFPLPN 1545 Query: 2348 TV--------------SKNITVHASSFP---GHQYSN-----QPNANSLLDYNAELSKLQ 2235 + KN P +Q Q + N Y+ EL ++ Sbjct: 1546 HMPQETNGAGGASLVQGKNQKCEGKDIPTPVSYQQGMPVDVPQISVNESEPYHLELIAIK 1605 Query: 2234 QEISGLQESISTAESELLKVSLRKEFLGRDSYGRLYWVFGSYDASPWIVANGSLNPESEF 2055 ++IS LQ+SI++ S+LLK+S+R+E LG DS GRLYW IV + S Sbjct: 1606 RDISLLQDSITSVASQLLKLSVRRECLGIDSIGRLYWASALPGGRSRIVVDASAALLHGR 1665 Query: 2054 GL-----------------------------DNHFPKSSSWMYYDTVAEIGELVKWLDDC 1962 G+ N SS W+ Y+T EI EL+ WLDD Sbjct: 1666 GMTFSRDYVEKFSVLQHCALSDKDSSLMSQPSNPLGNSSPWIAYETDVEIEELLGWLDDS 1725 Query: 1961 DIRERDLKESILQWQSNKSLNS-NYQRNDFLKGKPSSSVISSEQRVPNHNCRVLKGVRAL 1785 D +ER+LK+SI+ ++ N Q D K + + S+ + ++ + N V K L Sbjct: 1726 DPKERELKDSIMLGPKSRFQQFINAQTEDRAKDQGNVSMPRNREKTVS-NSLVTKATSLL 1784 Query: 1784 EKKFGLGMDMGADYINKNLEHNHEMTYQGRMCRCVCLELIWPSRHHCFSCHQTFSASEEL 1605 EKKFG ++ + K T ++ RC CLE I PSR HC CH+T ++ E Sbjct: 1785 EKKFGPFVEWDNSEVLKKQNRKTRTTNDEKLYRCECLEPILPSRKHCTHCHKTVASDIEF 1844 Query: 1604 DQHGE-KCRAA------------ATGAAGSLKHD------KTVMNQGIRICPGSSNIPQS 1482 D H + KC A ++ G+LK D + + GSS + Sbjct: 1845 DGHNDGKCNAGLLAIEKNKDKNGSSKGRGNLKCDTLHEKFRADAETALTSVSGSSKLSSR 1904 Query: 1481 VLNEKHDTQSNCVKXXXXXXXXXXXXEIMANFKVDYSIEEDIKGIGLFGTNGVVPFLPNS 1302 ++ ++ +S C +I + F + S +E + IGL G++G+ F+P+ Sbjct: 1905 LIKFSNE-ESTC---------PFNFEDICSKFVTNDSNKELVSEIGLIGSDGIPSFVPSV 1954 Query: 1301 SPCFNDPALTLVPPSGNEVSMEDQTSVPKSQQKLSDDLAKTASVVNDKSSGIEKGKGKVS 1122 SP ++ L+ + + S+ S+ S+ ++S A D SGI GK + Sbjct: 1955 SPFVSEYTLS----AQKDESIVGGVSIV-SESRVSQGNTDGAGTCLDHKSGISTGKLAAN 2009 Query: 1121 EVECMKSRVLCER--GRLXXXXXXXXXSTARRSIIRESSLRPKVGYATEVLRLLKISMLD 948 E L E+ G+ ++ SLRP VG A+ +LR LKI++LD Sbjct: 2010 ESNKSNKSSLREQRDGKFSFCSPASVMGADGCCVVPSPSLRPLVGKASHILRQLKINLLD 2069 Query: 947 IDAALPEAALRASRSHLDRRCAWRTFVKCANTLYEMMLALIILEDTIRTDYLKNNWWYWS 768 +DAAL ALR S++ DRR AWRTFVK A T+YEM+ A LED I+T+YL+N+WWYWS Sbjct: 2070 MDAALLAIALRPSKAVPDRRQAWRTFVKSAKTIYEMIQATFTLEDMIKTEYLRNDWWYWS 2129 Query: 767 SPSAASNMPTLSALALRIFTLDSAILYEK---SSTDDTAGTSIPVSQPDKEASSSAIPTT 597 S SAA+ TL +LALRI++LD AI+YEK SS D++ S+ +++P + + T Sbjct: 2130 SFSAAAKSSTLPSLALRIYSLDLAIIYEKMPNSSFTDSSEPSV-IAEPKPLMN---VDTE 2185 Query: 596 EMKSAEQPMQKVNDSDS 546 + K++ + +K +SDS Sbjct: 2186 KSKASRKSTRKRKESDS 2202 >ref|NP_001046163.1| Os02g0192400 [Oryza sativa Japonica Group] gi|46389826|dbj|BAD15389.1| PHD finger-like protein [Oryza sativa Japonica Group] gi|50726413|dbj|BAD34024.1| PHD finger-like protein [Oryza sativa Japonica Group] gi|113535694|dbj|BAF08077.1| Os02g0192400 [Oryza sativa Japonica Group] Length = 929 Score = 434 bits (1117), Expect = e-119 Identities = 308/903 (34%), Positives = 444/903 (49%), Gaps = 74/903 (8%) Frame = -3 Query: 3167 GTESQKERDEVLALVNDSSLPKAPWEEGICKVCGMDKDDVNVLLCDTCDSEYHTYCLNPP 2988 G+E +E ++L N SLPKAPWE+G+CKVCG+D+DD +VLLCD CDSEYHTYCLNPP Sbjct: 34 GSEMHEELHDILTAAN--SLPKAPWEDGVCKVCGIDRDDDSVLLCDKCDSEYHTYCLNPP 91 Query: 2987 LGKVPDGNWYCPSCVTGNSLSEDAAYGTQVVNQCGKRRYQRKLIDETL----EGLAQLAN 2820 L ++P+GNWYCPSC+ G + + G Q V +R Q+K + E E L +LA Sbjct: 92 LARIPEGNWYCPSCMLGQTKAHH-DQGVQDV-----KRQQKKFVGEEAHAFQEELNKLAT 145 Query: 2819 KMELKEYWEFTVEERILLLKFFSDEAMNSATIRDHIDQCASMCVDLHQRLRSLNSERKIL 2640 ME KEYW+ ++ERI LLKF DE +N+A IR+H+DQC+ DL Q+ R+ N E K L Sbjct: 146 AMEEKEYWDLNMQERIYLLKFLCDEMLNTALIREHLDQCSDKLGDLQQKFRASNFELKDL 205 Query: 2639 KLKEE---SLAANVAKMKGNVHTGGGELASVLADESQLPVDNKVSSFSGGSV------PM 2487 K KEE S A K H + + + G+V P Sbjct: 206 KYKEEMRTSYARQSRSSKTEQHFNNSSGPVENQQQCTPTALDHLEEAEQGNVGVNLNNPA 265 Query: 2486 DGGP-----------HTKDQVSILRSDVNLHPKLGDTXXXXXXXXXXXQMLGYGLSNTVS 2340 DG P KD S + L + + G + Sbjct: 266 DGVPDGQLNVGKPYKSDKDISSASMVEERKSSGLSEQPSGMAIDQIDGDAIDEGSQSCEK 325 Query: 2339 KNITVHAS------------SFPGHQYSNQPNANSLLDYNAELSK--------------- 2241 +++ +S S PG + ++ + S D N E S Sbjct: 326 RSLGAKSSTCDNLNLKDTEFSTPGRELPDERASTSFQD-NLEASSTKSIELDADNNEMDT 384 Query: 2240 LQQEISGLQESISTAESELLKVSLRKEFLGRDSYGRLYWVFGSYDASPWIVANGS-LNP- 2067 L +IS LQ+SIS ES++ S R+E LG+DS GRLYWV G PW+VA+GS L P Sbjct: 385 LSDDISKLQDSISLLESQINMASSRRECLGKDSIGRLYWVIGRPGKRPWLVADGSMLKPK 444 Query: 2066 ESEFGLDNHFP---------KSSSWMYYDTVAEIGELVKWLDDCDIRERDLKESILQWQS 1914 E + + N +P S+S Y++ EI L+ WL D D RE++LK+SILQWQ Sbjct: 445 ERDISMVNSYPPSAFDCKGWNSASIFIYESDEEIQCLLDWLRDYDPREKELKDSILQWQR 504 Query: 1913 NKSLNSNYQRNDFLKGKPSSSVISSEQ--RVPNHNCRVLKGVRALEKKFGLGMDMGADYI 1740 + S+ D P S EQ +PN V+ LE+K+GL +D + Sbjct: 505 HFCHQSSSPLVD-----PPISGPKGEQLMELPNTKAAVI-----LEQKYGLQLDQDTSDL 554 Query: 1739 NKNLEHNHEMTYQGRMCRCVCLELIWPSRHHCFSCHQTFSASEELDQHGE-KCRAAATGA 1563 K +++ + R RC CLE +WPSR+HC +CH+T+ S E + H + KC Sbjct: 555 PKKRGKKIKLSSEDRTYRCDCLEPVWPSRYHCLTCHETYLISTEFEGHNDGKCSKIHQSP 614 Query: 1562 AGSLKHDKTVMNQGIRICPGSSNIPQSVLNEKHDTQSNCV----KXXXXXXXXXXXXEIM 1395 S ++D+ + + +S EK + + V EI Sbjct: 615 DESRENDEPKV-----------KVTKSDTKEKDSLECSSVIEPSSDRKLMQCPYDFEEIC 663 Query: 1394 ANFKVDYSIEEDIKGIGLFGTNGVVPFLPNSSPCFNDPALTLVPPSGNEVSMEDQTSVPK 1215 F + S +E +K IGL G+NGV F+P S F +PA+ + + + ++D TS + Sbjct: 664 RKFVTNDSNKETVKQIGLNGSNGVPSFVP-SPAFFLEPAI-VQSQNRKDDELKDWTSSLE 721 Query: 1214 SQQKLSDDLAKTASVVNDKSSGIEKGKGKVSEVECMKSRVLCERGRLXXXXXXXXXSTAR 1035 +S +V + S + G V + + KS+ R Sbjct: 722 ECNAMS-----AQKLVQEVSKSGQSCPGNVGDEKVQKSKKPTPDNTSGEEAHSTTGKPTR 776 Query: 1034 -----RSIIRESSLRPKVGYATEVLRLLKISMLDIDAALPEAALRASRSHLDRRCAWRTF 870 ++ ESSLRP +G + +L+ KI++LDI+AALPE ALRAS+ RR +WR F Sbjct: 777 LLAVNGGLVPESSLRPLIGRNSHILKQQKINLLDIEAALPEEALRASKCQQIRRRSWRAF 836 Query: 869 VKCANTLYEMMLALIILEDTIRTDYLKNNWWYWSSPSAASNMPTLSALALRIFTLDSAIL 690 VK A ++ +M+LA +LE I+ ++LKN+WWYWSS +AA T+S+LALR++TLD I+ Sbjct: 837 VKDAESISQMVLAANLLEGMIKAEFLKNDWWYWSSFTAAMKTSTVSSLALRVYTLDDCII 896 Query: 689 YEK 681 Y K Sbjct: 897 YSK 899 >gb|EEE56485.1| hypothetical protein OsJ_05715 [Oryza sativa Japonica Group] Length = 1949 Score = 434 bits (1117), Expect = e-119 Identities = 308/903 (34%), Positives = 444/903 (49%), Gaps = 74/903 (8%) Frame = -3 Query: 3167 GTESQKERDEVLALVNDSSLPKAPWEEGICKVCGMDKDDVNVLLCDTCDSEYHTYCLNPP 2988 G+E +E ++L N SLPKAPWE+G+CKVCG+D+DD +VLLCD CDSEYHTYCLNPP Sbjct: 1054 GSEMHEELHDILTAAN--SLPKAPWEDGVCKVCGIDRDDDSVLLCDKCDSEYHTYCLNPP 1111 Query: 2987 LGKVPDGNWYCPSCVTGNSLSEDAAYGTQVVNQCGKRRYQRKLIDETL----EGLAQLAN 2820 L ++P+GNWYCPSC+ G + + G Q V +R Q+K + E E L +LA Sbjct: 1112 LARIPEGNWYCPSCMLGQTKAHH-DQGVQDV-----KRQQKKFVGEEAHAFQEELNKLAT 1165 Query: 2819 KMELKEYWEFTVEERILLLKFFSDEAMNSATIRDHIDQCASMCVDLHQRLRSLNSERKIL 2640 ME KEYW+ ++ERI LLKF DE +N+A IR+H+DQC+ DL Q+ R+ N E K L Sbjct: 1166 AMEEKEYWDLNMQERIYLLKFLCDEMLNTALIREHLDQCSDKLGDLQQKFRASNFELKDL 1225 Query: 2639 KLKEE---SLAANVAKMKGNVHTGGGELASVLADESQLPVDNKVSSFSGGSV------PM 2487 K KEE S A K H + + + G+V P Sbjct: 1226 KYKEEMRTSYARQSRSSKTEQHFNNSSGPVENQQQCTPTALDHLEEAEQGNVGVNLNNPA 1285 Query: 2486 DGGP-----------HTKDQVSILRSDVNLHPKLGDTXXXXXXXXXXXQMLGYGLSNTVS 2340 DG P KD S + L + + G + Sbjct: 1286 DGVPDGQLNVGKPYKSDKDISSASMVEERKSSGLSEQPSGMAIDQIDGDAIDEGSQSCEK 1345 Query: 2339 KNITVHAS------------SFPGHQYSNQPNANSLLDYNAELSK--------------- 2241 +++ +S S PG + ++ + S D N E S Sbjct: 1346 RSLGAKSSTCDNLNLKDTEFSTPGRELPDERASTSFQD-NLEASSTKSIELDADNNEMDT 1404 Query: 2240 LQQEISGLQESISTAESELLKVSLRKEFLGRDSYGRLYWVFGSYDASPWIVANGS-LNP- 2067 L +IS LQ+SIS ES++ S R+E LG+DS GRLYWV G PW+VA+GS L P Sbjct: 1405 LSDDISKLQDSISLLESQINMASSRRECLGKDSIGRLYWVIGRPGKRPWLVADGSMLKPK 1464 Query: 2066 ESEFGLDNHFP---------KSSSWMYYDTVAEIGELVKWLDDCDIRERDLKESILQWQS 1914 E + + N +P S+S Y++ EI L+ WL D D RE++LK+SILQWQ Sbjct: 1465 ERDISMVNSYPPSAFDCKGWNSASIFIYESDEEIQCLLDWLRDYDPREKELKDSILQWQR 1524 Query: 1913 NKSLNSNYQRNDFLKGKPSSSVISSEQ--RVPNHNCRVLKGVRALEKKFGLGMDMGADYI 1740 + S+ D P S EQ +PN V+ LE+K+GL +D + Sbjct: 1525 HFCHQSSSPLVD-----PPISGPKGEQLMELPNTKAAVI-----LEQKYGLQLDQDTSDL 1574 Query: 1739 NKNLEHNHEMTYQGRMCRCVCLELIWPSRHHCFSCHQTFSASEELDQHGE-KCRAAATGA 1563 K +++ + R RC CLE +WPSR+HC +CH+T+ S E + H + KC Sbjct: 1575 PKKRGKKIKLSSEDRTYRCDCLEPVWPSRYHCLTCHETYLISTEFEGHNDGKCSKIHQSP 1634 Query: 1562 AGSLKHDKTVMNQGIRICPGSSNIPQSVLNEKHDTQSNCV----KXXXXXXXXXXXXEIM 1395 S ++D+ + + +S EK + + V EI Sbjct: 1635 DESRENDEPKV-----------KVTKSDTKEKDSLECSSVIEPSSDRKLMQCPYDFEEIC 1683 Query: 1394 ANFKVDYSIEEDIKGIGLFGTNGVVPFLPNSSPCFNDPALTLVPPSGNEVSMEDQTSVPK 1215 F + S +E +K IGL G+NGV F+P S F +PA+ + + + ++D TS + Sbjct: 1684 RKFVTNDSNKETVKQIGLNGSNGVPSFVP-SPAFFLEPAI-VQSQNRKDDELKDWTSSLE 1741 Query: 1214 SQQKLSDDLAKTASVVNDKSSGIEKGKGKVSEVECMKSRVLCERGRLXXXXXXXXXSTAR 1035 +S +V + S + G V + + KS+ R Sbjct: 1742 ECNAMS-----AQKLVQEVSKSGQSCPGNVGDEKVQKSKKPTPDNTSGEEAHSTTGKPTR 1796 Query: 1034 -----RSIIRESSLRPKVGYATEVLRLLKISMLDIDAALPEAALRASRSHLDRRCAWRTF 870 ++ ESSLRP +G + +L+ KI++LDI+AALPE ALRAS+ RR +WR F Sbjct: 1797 LLAVNGGLVPESSLRPLIGRNSHILKQQKINLLDIEAALPEEALRASKCQQIRRRSWRAF 1856 Query: 869 VKCANTLYEMMLALIILEDTIRTDYLKNNWWYWSSPSAASNMPTLSALALRIFTLDSAIL 690 VK A ++ +M+LA +LE I+ ++LKN+WWYWSS +AA T+S+LALR++TLD I+ Sbjct: 1857 VKDAESISQMVLAANLLEGMIKAEFLKNDWWYWSSFTAAMKTSTVSSLALRVYTLDDCII 1916 Query: 689 YEK 681 Y K Sbjct: 1917 YSK 1919