BLASTX nr result
ID: Mentha29_contig00006040
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Mentha29_contig00006040 (3087 letters) Database: ./nr 38,876,450 sequences; 13,856,398,315 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value gb|EYU31274.1| hypothetical protein MIMGU_mgv1a000087mg [Mimulus... 1017 0.0 ref|XP_006365207.1| PREDICTED: methyl-CpG-binding domain-contain... 573 e-160 ref|XP_004239350.1| PREDICTED: methyl-CpG-binding domain-contain... 572 e-160 ref|XP_002525350.1| DNA binding protein, putative [Ricinus commu... 558 e-156 ref|XP_006483833.1| PREDICTED: methyl-CpG-binding domain-contain... 549 e-153 ref|XP_002274643.2| PREDICTED: methyl-CpG-binding domain-contain... 543 e-151 ref|XP_007217135.1| hypothetical protein PRUPE_ppa000046mg [Prun... 542 e-151 ref|XP_006446469.1| hypothetical protein CICLE_v10014026mg [Citr... 538 e-150 ref|XP_006470356.1| PREDICTED: methyl-CpG-binding domain-contain... 535 e-149 ref|XP_006470355.1| PREDICTED: methyl-CpG-binding domain-contain... 535 e-149 ref|XP_007031430.1| Methyl-CpG-binding domain-containing protein... 534 e-149 ref|XP_002884279.1| methyl-CpG-binding domain 9 [Arabidopsis lyr... 520 e-144 ref|XP_004167238.1| PREDICTED: LOW QUALITY PROTEIN: methyl-CpG-b... 517 e-143 ref|XP_004141185.1| PREDICTED: methyl-CpG-binding domain-contain... 516 e-143 gb|EXC31622.1| Methyl-CpG-binding domain-containing protein 9 [M... 512 e-142 ref|NP_186795.1| methyl-CPG-binding domain 9 [Arabidopsis thalia... 509 e-141 ref|XP_006296811.1| hypothetical protein CARUB_v10012794mg [Caps... 505 e-140 ref|XP_006603816.1| PREDICTED: methyl-CpG-binding domain-contain... 501 e-139 ref|XP_006594288.1| PREDICTED: methyl-CpG-binding domain-contain... 501 e-139 ref|XP_006408507.1| hypothetical protein EUTSA_v10019872mg [Eutr... 496 e-137 >gb|EYU31274.1| hypothetical protein MIMGU_mgv1a000087mg [Mimulus guttatus] Length = 1861 Score = 1017 bits (2629), Expect = 0.0 Identities = 540/876 (61%), Positives = 635/876 (72%), Gaps = 2/876 (0%) Frame = +3 Query: 3 ERDDLLVQVCNSVLPRAPWDEGLCKVCGMDKDDDNVLLCDKCDSEYHRYCLDPPLLRIPE 182 ERDDLLVQ CNS LPRAPWDEG+CKVCGMDKDDDNVLLCDKCDSEYHRYCL PPLL+IPE Sbjct: 1013 ERDDLLVQACNSSLPRAPWDEGICKVCGMDKDDDNVLLCDKCDSEYHRYCLSPPLLKIPE 1072 Query: 183 GNWYCPSCVTGQSLPSGTGYASGSNQRRKRRNQGEFSSKFLEELSRFAKLMGMKEHWEFT 362 GNWYCPSCVTGQ++ T Y S + Q RKR++QGEF+SKFLEEL+R AKLM +KE+WEFT Sbjct: 1073 GNWYCPSCVTGQAISYSTSYGSVATQCRKRKHQGEFTSKFLEELARLAKLMEIKEYWEFT 1132 Query: 363 VEERIFFLKFLLDEALNSATVRDHMEQCASRAADLQNKLRSLTSEXXXXXXXXXXXXXSA 542 +EERIFF+KFL DEALNSAT+R+HM+Q +SRAADLQ KLRSLT E S Sbjct: 1133 IEERIFFMKFLFDEALNSATIREHMDQSSSRAADLQQKLRSLTYELKVLKAKEDMLGLST 1192 Query: 543 EKTNSGVLNVRGDLNSDASSSQHASENITRGKPSEKLVGDQSQPEKIIVKTSEGPNWLSE 722 EK NSG RGD+ SDASSS +EN +R PSEK G + E P+ L+E Sbjct: 1193 EKVNSGG---RGDMKSDASSSLLLTENSSR-IPSEK--GSHLSSLSAFTRLEERPS-LNE 1245 Query: 723 KPISVQQPQSDQGHTSLLNNVQSPLFSSPTSERETELVQCPNQGDMPSSQLNNLKACTVK 902 +P QP LL+ + +P+ S+ S D SSQ N+LKA TVK Sbjct: 1246 QP---NQPP-------LLSTIPAPVSSAQESR---------GNPDKLSSQDNSLKAATVK 1286 Query: 903 QEITNLLDSIADIELELVKISLRRDFLGRDSIGRVYWAFCYPGARPWVIACGGTAHKERC 1082 +I+++ DSIA IELEL+K+SLR+DFLGRDS GRVYW F PGARPW++ACG A KERC Sbjct: 1287 SDISSMRDSIASIELELLKVSLRKDFLGRDSNGRVYWGFYCPGARPWIMACGDLAFKERC 1346 Query: 1083 PRDFSSIPDSDKWMYYESDSEIEKLVGWLRENNVREKELKESISQFQANKLKDSEYTEDH 1262 P +F +PDS KWMYYESD EIEKLVGWLRENN REKELKESI Q Q NKLKDS+YTE+H Sbjct: 1347 PEEFIGVPDSHKWMYYESDDEIEKLVGWLRENNPREKELKESILQLQNNKLKDSQYTENH 1406 Query: 1263 ILKKREINHGGRKTVSADFLATNAMNALEKKFGPCKRIETTAVPQNLVMGASQSGGMHRC 1442 IL K E N RK SA+ L+T AM +LE KFGP T QNL G S M+RC Sbjct: 1407 ILSKAEENRSERKASSANILSTKAMASLENKFGPLLGTRATDARQNLASGLSPDCRMYRC 1466 Query: 1443 ECLEMLWPSKDHCGSCHQSFSTSEELRQHVKENCKAVASGSKRSQAAEDMSKRKKARNAV 1622 ECLE+LWPS +HC SCHQSF T+EEL QH+KENCK A KRSQ ED+SKRKK + Sbjct: 1467 ECLELLWPSNNHCASCHQSFPTTEELGQHLKENCKPAAPVPKRSQTTEDVSKRKKLKIVS 1526 Query: 1623 SQEKRPASVGIPQGSTLEKQIDGSASVESYNADCPFNFEEIMTRFIVPSSVKDGVNEIGL 1802 SQEKRP +GI Q ST +KQ DGS+ + Y ADCPFNFEEIMTRF+VP S+KD VN IGL Sbjct: 1527 SQEKRPGDMGILQTSTSKKQNDGSSFADRYYADCPFNFEEIMTRFVVPGSIKDAVNSIGL 1586 Query: 1803 IGSGGLPSFLQGQSPYVSDSALTVGLERTNEASSRSNDLRSKQKNA--EPGDVMNGRGFK 1976 IG+GG+PSF S Y+ S DL SKQ ++ E MN + K Sbjct: 1587 IGNGGIPSFSSSGSLYL---------------SGMPTDLSSKQHHSSNEGSAAMNTKDNK 1631 Query: 1977 DSNRSSRSAENGLSDELSIVGRLKSILTSEKDQVTPMKDRSSLLGLSKSTIIRESSSRPL 2156 +S+R S AE L ++ S VGRLKSI S ++ V+ MK+++SLLGLSKS++IRESS RPL Sbjct: 1632 ESSRLSSCAETFLGEKGSGVGRLKSISMSGREHVSSMKNKNSLLGLSKSSLIRESSQRPL 1691 Query: 2157 VGRASEILRILKINLLDMDAALPEDAFRKSRTSPDRRCAWRAFVKSSKSIYEMVQATIVF 2336 VGRASEILR LKINLLDMDAALP+DA R SR++ RR AWRAFVKS+KSIYEMVQA I+ Sbjct: 1692 VGRASEILRFLKINLLDMDAALPQDALRTSRSNEGRRYAWRAFVKSAKSIYEMVQAMIIL 1751 Query: 2337 EDMIKTEYLRNDWWYWSSPSTAARITTLSALALRIYSLDSAISYEEPLPTAAAMEVSEPC 2516 ED I++EYLRNDWWYWSSPSTAA+ TTLS+LALRIYSLD+AISYE+PL ++E+ EP Sbjct: 1752 EDTIRSEYLRNDWWYWSSPSTAAKTTTLSSLALRIYSLDAAISYEKPLQN-GSIEMPEPS 1810 Query: 2517 NAIEEEMQKSPTLKNLASPSSPTLLKTPEPDSSENP 2624 A+E+E S LKNL SPSSP+L KTPEPDS+ENP Sbjct: 1811 CALEDEAPLSKLLKNLPSPSSPSLQKTPEPDSAENP 1846 >ref|XP_006365207.1| PREDICTED: methyl-CpG-binding domain-containing protein 9-like [Solanum tuberosum] Length = 2173 Score = 573 bits (1478), Expect = e-160 Identities = 374/984 (38%), Positives = 514/984 (52%), Gaps = 110/984 (11%) Frame = +3 Query: 3 ERDDLLVQVCNSVLPRAPWDEGLCKVCGMDKDDDNVLLCDKCDSEYHRYCLDPPLLRIPE 182 +RD LL V S LP+APW+EGLCKVC MDKDD NVLLCDKCDSEYH YCLDPPL+++P Sbjct: 1185 DRDGLLAHVNESSLPKAPWEEGLCKVCSMDKDDVNVLLCDKCDSEYHTYCLDPPLVKVPI 1244 Query: 183 GNWYCPSCVTGQSLPSGTGYASGSNQRR---KRRNQGEFSSKFLEELSRFAKLMGMKEHW 353 G WYCP C + +SGS+ R KRR + + KF+E+LS+ + M +KE+W Sbjct: 1245 GPWYCPDCEA--KISRSQNASSGSHTIRQCVKRRLHRKLTHKFMEKLSQLTRTMELKEYW 1302 Query: 354 EFTVEERIFFLKFLLDEALNSATVRDHMEQCASRAADLQNKLRSLTSEXXXXXXXXXXXX 533 E +E+RIF LKFL DE LNSA +RDH+++ AS +A+LQ KLRSL +E Sbjct: 1303 ELPLEDRIFLLKFLCDEMLNSAILRDHIDRSASLSAELQQKLRSLGAELKLLKHKKEILT 1362 Query: 534 XSAEK---------------TNSGVLNVRG-DLNSDASSSQHASENITRGKPSEKLVGDQ 665 + +N L V+G D S SS + G K Sbjct: 1363 AKLKNDARSSGDTGSDTSLWSNDCKLKVQGPDSGSHNSSISGGCRQLDDGTQHNKCNDYN 1422 Query: 666 SQP-----EKIIVKT-SEGPNWLSEKPISVQQPQSDQ---GHTSLLN------------N 782 Q + I KT + G N + P + Q Q +T LN N Sbjct: 1423 KQSCLYTSKNIQDKTCASGTNHIRNSPDPINHLQHQQLLKENTRSLNTSSHAKCGTEEAN 1482 Query: 783 VQSPLFSSPTSERETELVQ--------------------------CPNQGDMPSSQLNNL 884 +Q+ LF S T ++ET+ + C P + Sbjct: 1483 LQNDLFISTTLQQETDQIPGNCLESTPSSSKSIMLFATHIVSATTCSGSVSNPLEEAFLF 1542 Query: 885 KACTVKQEITNLLDSIADIELELVKISLRRDFLGRDSIGRVYWAFCYPGARPWVIACGGT 1064 + +K+EI L DSIA ELEL ++S+R++++G+DS GR+YW F + V + Sbjct: 1543 EMSAIKKEIRALEDSIAAKELELQEVSVRKEYMGQDSEGRLYWTFGRSTSSRLVAYASTS 1602 Query: 1065 AHKER---------------------CPRDFSSIPDSDKWMYYESDSEIEKLVGWLRENN 1181 E P + +P+ D+W Y+SD + E L+ WL+E++ Sbjct: 1603 TQPESSGHLWSYGVESSRRSGVFDSSAPWENMGMPNLDQWTSYQSDVDTEILIRWLKEHD 1662 Query: 1182 VREKELKESISQFQANKLKDSEYTEDHILKKREI-----NHGGRKTVSADFLATNAMNAL 1346 RE+ELKESI Q++ + Y E H K + + ++D L T A+ A+ Sbjct: 1663 PRERELKESILQWRDTRKMIYYYLESHGHDKVRLITSIPSEDSASCFNSDSLVTRAVTAI 1722 Query: 1347 EKKFGPCKRIETTAVPQNL--VMGASQSGGMHRCECLEMLWPSKDHCGSCHQSFSTSEEL 1520 +K C E T + NL + S G ++RCECLE LWPS+ HC SCHQ+FS ++E Sbjct: 1723 KKMVSGCSAEEETEICTNLGVKVRVSFDGELYRCECLEPLWPSRPHCLSCHQTFSDAKER 1782 Query: 1521 RQHVKENCKAVASG--SKRSQAAEDMSKRKKARNA-----------VSQEKRPASVGIPQ 1661 +H E C+ + + + +E +KRK+ N VSQ + +G + Sbjct: 1783 LKHANEKCRIDSPSPIQRDGETSEQPAKRKRTANNEILQDNSLSNDVSQASKSKKLGNGE 1842 Query: 1662 GSTLEKQIDGSASVESYNA-DCPFNFEEIMTRFIVPSSVKDGVNEIGLIGSGGLPSFLQG 1838 S +K + AS E+ +CPF FEEI +FI S+K+ VNEIGLIG G PSF+ Sbjct: 1843 ASRRDKHGNAPASAENQTKQECPFKFEEIKAQFITQRSLKELVNEIGLIGCNGTPSFIPC 1902 Query: 1839 QSPYVSDSALTVGLERTNEA-SSRSNDLRSKQKNAEPGDVMNGRGFKDSNRSSRSAENGL 2015 SPY+ DSAL + +R +E S DL S + G ++ D+ + NGL Sbjct: 1903 TSPYLCDSALELLSQREDEVCGGNSTDLLSSEHQLRNGVKVSCINNSDNPNCTG---NGL 1959 Query: 2016 SDELSIVGRLKSILTSEKDQVTPMKDRSSLLGLSKSTIIRESSSRPLVGRASEILRILKI 2195 + + GRLKS ++Q + KD+ G++ +I ESS P+ GRAS ILR LKI Sbjct: 1960 AGAGPVFGRLKSATKRGRNQFSSTKDKILEFGVNMYFVIPESSLHPVAGRASVILRCLKI 2019 Query: 2196 NLLDMDAALPEDAFRKSRTSPDRRCAWRAFVKSSKSIYEMVQATIVFEDMIKTEYLRNDW 2375 NLLD+DAALPE+A R SR +RR WRAFVKS+ +IYEMVQATI+ ED IKTEYL+NDW Sbjct: 2020 NLLDIDAALPEEALRVSRLQSERRRVWRAFVKSAATIYEMVQATIILEDAIKTEYLKNDW 2079 Query: 2376 WYWSSPSTAARITTLSALALRIYSLDSAISYEEPLPTAAAMEVSE-PCNAIEEEMQKSPT 2552 WYWSSPS AARI+TLSALALR+Y+LDSAI Y++ ++ + SE C E ++ Sbjct: 2080 WYWSSPSAAARISTLSALALRVYALDSAILYDK----LSSQDASETDCKEEREPPPRNSV 2135 Query: 2553 LKNLASPSSPTLLKTPEPDSSENP 2624 N ASPS L PEP S P Sbjct: 2136 PTNTASPSKKKPL-DPEPAESSRP 2158 >ref|XP_004239350.1| PREDICTED: methyl-CpG-binding domain-containing protein 9-like [Solanum lycopersicum] Length = 2151 Score = 572 bits (1473), Expect = e-160 Identities = 372/973 (38%), Positives = 516/973 (53%), Gaps = 99/973 (10%) Frame = +3 Query: 3 ERDDLLVQVCNSVLPRAPWDEGLCKVCGMDKDDDNVLLCDKCDSEYHRYCLDPPLLRIPE 182 +RD LL V S LP+APW+EGLCKVC MDKDD NVLLCDKCDSEYH YCLDPPL+++P Sbjct: 1180 DRDGLLAHVNESSLPKAPWEEGLCKVCSMDKDDVNVLLCDKCDSEYHTYCLDPPLVKVPI 1239 Query: 183 GNWYCPSCVTGQSLPSGTGYASGSNQRR---KRRNQGEFSSKFLEELSRFAKLMGMKEHW 353 G WYCP C + +SGS+ R KRR + + + KF+E+LS+ + M +KE+W Sbjct: 1240 GPWYCPDCEA--KISRSQNASSGSHTIRQCVKRRLRRKLTHKFMEKLSQLTRTMELKEYW 1297 Query: 354 EFTVEERIFFLKFLLDEALNSATVRDHMEQCASRAADLQNKLRSLTSEXXXXXXXXXXXX 533 E +E+RIF LKFL E L+SA +RDH+++ AS +A+LQ KLRSL +E Sbjct: 1298 EIPLEDRIFLLKFLCGEMLSSAILRDHIDRSASLSAELQQKLRSLGAELKLLKHKKEILT 1357 Query: 534 XSAEK---------------TNSGVLNVRG-DLNSDASSSQHASENITRGKPSEKLVGDQ 665 + +N L V+G D S SS + G K Sbjct: 1358 AKLKNDARSSGDAGSDTSLWSNDCKLKVQGPDSGSHNSSISGGCRQLDDGTQHNKCNDFN 1417 Query: 666 SQP----EKIIV-KT-SEGPNWLSEKPISVQQPQSDQ---GHTSLLN------------N 782 Q KII KT + G N + P + Q Q + LN N Sbjct: 1418 KQSCLYTSKIIQDKTCASGTNHIRNSPDPINHLQHQQLLKENARSLNTSSHAKCGTEETN 1477 Query: 783 VQSPLFSSPTSERETELVQ--------------------------CPNQGDMPSSQLNNL 884 +Q+ LF S T ++ET+ + C P + Sbjct: 1478 LQNDLFMSTTVQQETDQIPGNRLESAQSSSKSIMLFATHIVSATTCLGSVSNPLEEALLF 1537 Query: 885 KACTVKQEITNLLDSIADIELELVKISLRRDFLGRDSIGRVYWAFCYPGARPWVIACGGT 1064 + +K+EI L DSIA EL+L ++S+R++++G+DS GR+YW F + V + Sbjct: 1538 EMSAIKKEIRALEDSIAAKELDLQEVSVRKEYMGQDSEGRLYWTFGRSTSSRLVAYASTS 1597 Query: 1065 AHKER---------------------CPRDFSSIPDSDKWMYYESDSEIEKLVGWLRENN 1181 E P + +P+ ++W Y+SD + E L+ WL+E++ Sbjct: 1598 TQPESSGHLWSYGVESSRRSGVLDSSAPWENMGLPNLEQWTSYQSDVDTEILIRWLKEHD 1657 Query: 1182 VREKELKESISQFQANKLKDSEYTEDHILKKREIN-----HGGRKTVSADFLATNAMNAL 1346 RE+ELKESI Q++ + Y E H +N ++D L T A+ A+ Sbjct: 1658 PRERELKESILQWRDTRKMIYYYLESHGHDTVGLNTSIPSEDSGSCFNSDSLVTRAVTAI 1717 Query: 1347 EKKFGPCKRIETTAVPQNL--VMGASQSGGMHRCECLEMLWPSKDHCGSCHQSFSTSEEL 1520 +K C E T + NL + S G ++RCECLE LWPS+ HC SCHQ+FS ++E Sbjct: 1718 KKMVSGCSTEEETGICTNLGVKVRVSFDGELYRCECLEPLWPSRPHCLSCHQTFSDAKER 1777 Query: 1521 RQHVKENCKAVASGSKRSQAAEDMSK-RKKARNAVSQEKRPASVGIPQGSTLEKQIDGSA 1697 ++H E C+ +S + + +E K ++KA N + Q+ +++ + +K + A Sbjct: 1778 QKHANEKCRIDSSIQRDGETSEQPVKCKRKANNEILQDNSLSTIDCRR----DKHGNAPA 1833 Query: 1698 SVESYNA-DCPFNFEEIMTRFIVPSSVKDGVNEIGLIGSGGLPSFLQGQSPYVSDSALTV 1874 S E+ +CPF EEI +FI SS+K+ VNEIGLIG G PSF+ G SPY+ DSAL + Sbjct: 1834 SAENQTKQECPFKLEEIKAQFITQSSLKELVNEIGLIGCNGTPSFVPGTSPYLCDSALGL 1893 Query: 1875 GLERTNEA-SSRSNDLRSKQKNAEPGDVMNGRGFKDSNRSSRS--AENGLSDELSIVGRL 2045 +R +E S DL S + + NG F N S + NGL+ + GRL Sbjct: 1894 LSQREDEVCGGNSTDLLSSEHQ-----LRNGVKFSCINNSDKPNCTGNGLAGAGPVFGRL 1948 Query: 2046 KSILTSEKDQVTPMKDRSSLLGLSKSTIIRESSSRPLVGRASEILRILKINLLDMDAALP 2225 KS +D+ + KD+ G++ +I ESS P+ GRAS ILR LKINLLD+DAALP Sbjct: 1949 KSATKRGRDKFSSTKDKILEFGVNMYFVIPESSLHPVAGRASVILRCLKINLLDIDAALP 2008 Query: 2226 EDAFRKSRTSPDRRCAWRAFVKSSKSIYEMVQATIVFEDMIKTEYLRNDWWYWSSPSTAA 2405 E+A R SR P+RR WRAFVKS+ +IYEMVQATI+ ED IKTEYL+NDWWYWSSPS AA Sbjct: 2009 EEALRVSRLQPERRRVWRAFVKSAATIYEMVQATIILEDAIKTEYLKNDWWYWSSPSAAA 2068 Query: 2406 RITTLSALALRIYSLDSAISYEEPLPTAAAMEVSEPCNAIEEEMQKSPTLKNLASPSSPT 2585 R +TLSALALR+Y+LDSAI Y++ ++ + SE E E ++ N ASPS Sbjct: 2069 RNSTLSALALRVYALDSAILYDK----LSSQDASETDCKEEREPPRNSVPTNTASPSKKK 2124 Query: 2586 LLKTPEPDSSENP 2624 L PEP S P Sbjct: 2125 PL-DPEPAESSRP 2136 >ref|XP_002525350.1| DNA binding protein, putative [Ricinus communis] gi|223535313|gb|EEF36988.1| DNA binding protein, putative [Ricinus communis] Length = 1794 Score = 558 bits (1438), Expect = e-156 Identities = 350/919 (38%), Positives = 501/919 (54%), Gaps = 46/919 (5%) Frame = +3 Query: 3 ERDDLLVQVCNSVLPRAPWDEGLCKVCGMDKDDDNVLLCDKCDSEYHRYCLDPPLLRIPE 182 E +D+L S +P+APWDEG+CKVCG+DKDDDNVLLCDKCDS YH YCL+PPL RIPE Sbjct: 898 EMEDILEHA--SQMPKAPWDEGVCKVCGVDKDDDNVLLCDKCDSGYHTYCLNPPLARIPE 955 Query: 183 GNWYCPSCVT--GQSLPSGTGYASGSNQRRKRRNQGEFSSKFLEELSRFAKLMGMKEHWE 356 GNWYCPSC+T +P + RK+R QGEF+ LE L+ M + ++W+ Sbjct: 956 GNWYCPSCITQGASQVPQFVSHC------RKKRRQGEFTHGVLEALAHLGTTMEITDYWD 1009 Query: 357 FTVEERIFFLKFLLDEALNSATVRDHMEQCASRAADLQNKLRSLTSEXXXXXXXXXXXXX 536 ++VEERIF LKFL DE LNSA +R+H++QCAS +ADLQ KLRSL+ E Sbjct: 1010 YSVEERIFLLKFLGDEVLNSANIREHLDQCASVSADLQQKLRSLSMEWRNLKFKEELMLN 1069 Query: 537 SAEKTNS-GVLNVRGDLNSDASSSQHASENITRGKPSEKLVGDQSQPEKIIVKTSEGPNW 713 K+ G V + + + S + + + D + + T P W Sbjct: 1070 GVGKSGKEGTTTVLPNYDKLLGQTHSRSSLCSTSFIDLEHLKDGPRFPRTNDFTKR-PCW 1128 Query: 714 LSEKPISVQQPQSDQGHTSLLNNVQSPLFSSPTSERETELVQCPNQGDMPSSQLNNLKAC 893 + K + VQQP S+ +++ + + NQ D+ Q +NL++ Sbjct: 1129 VYPKGVQVQQPISNGSQVFTISDTECQV----------------NQPDVNQLQTSNLESI 1172 Query: 894 TVKQEITNLLDSIADIELELVKISLRRDFLGRDSIGRVYWAFCYPGARPWVIACGGT--- 1064 ++ + + L DS+ +EL+L K SLR++FLGRDS GRVYWAF G+ PWV+ G T Sbjct: 1173 FIRDKASVLQDSVTSLELQLQKASLRKEFLGRDSAGRVYWAFSRTGSLPWVVIDGTTVVQ 1232 Query: 1065 ----AHKERCPRDF-----SSIPDSD-------------------------KWMYYESDS 1142 A + R R SSI D +W ++S + Sbjct: 1233 QSSIAEENRVLRFNNLTFRSSIGAQDLLRFKGSNVFSPYASDLTSGISVYFQWFSHQSYA 1292 Query: 1143 EIEKLVGWLRENNVREKELKESISQFQANKLKDSEYTEDHILKKRE---INHGGRKTVSA 1313 EIE+L+ WLR+N+ ++EL ES+ Q +S +++L+ + + KT+ Sbjct: 1293 EIEELIKWLRDNDPMQRELIESLLQRLNFGYSNSNKAANYVLEMNQPASMPVNIEKTLKP 1352 Query: 1314 DFLATNAMNALEKKFGPCKRIETT--AVPQNLVMGASQSGGMHRCECLEMLWPSKDHCGS 1487 L T A+ ALEKK+GPC ++ T +V + + + M RCECLE +WPS+ HC S Sbjct: 1353 KSLETRALTALEKKYGPCMELDVTNISVKFSRNLKVTYDDRMCRCECLEAIWPSRHHCLS 1412 Query: 1488 CHQSFSTSEELRQHVKENCKAVASGSKRSQAAEDMSKRKKARNAVSQEKRPASVGIPQGS 1667 CH+SFS+ EL +H C A A + S+ +D+SK K A E + + G G Sbjct: 1413 CHRSFSSRCELEEHNDGKCGAGAHTPQNSRVTDDVSKEKVLMRAEHGEWQCKAGGA--GH 1470 Query: 1668 TLEKQIDGSASVESYNADCPFNFEEIMTRFIVPSSVKDGVNEIGLIGSGGLPSFLQGQSP 1847 +E + G P+N EEI +F+ SS K+ V EIGL+GS G+PS + SP Sbjct: 1471 EIEFGLIGFRK----EFMSPYNLEEISAKFVTRSSNKELVKEIGLLGSNGIPSLVPCSSP 1526 Query: 1848 YVSDSALTVGLERTNEASSRSNDLRSKQKNAEPGDVMNGRGFKDSNRSSRSAENGLSDEL 2027 Y+ D L + L NE + + + + R SN + L +EL Sbjct: 1527 YLIDPTLKLVLPCVNEVCQSVQSTNVENGSLQGDTTTSKRHANKSNATKDCTAVDLYEEL 1586 Query: 2028 SIVGRLKSILTSEKDQVTPMKDRSSLLGLSKSTIIRESSSRPLVGRASEILRILKINLLD 2207 +GR S L ++ ++ + LG S I R S+ RPLVG+ + ILR LKINLLD Sbjct: 1587 QEIGR--SYLMNQSS----LRFSCTKLGNPLSEI-RGSALRPLVGKGAHILRQLKINLLD 1639 Query: 2208 MDAALPEDAFRKSRTSPDRRCAWRAFVKSSKSIYEMVQATIVFEDMIKTEYLRNDWWYWS 2387 MDAALPE+A + S ++RCAWRAFVKS+KS++EMVQATIV E+MIKT++LRN+WWYWS Sbjct: 1640 MDAALPEEAVKSSNIYLEKRCAWRAFVKSAKSVFEMVQATIVLENMIKTDFLRNEWWYWS 1699 Query: 2388 SPSTAARITTLSALALRIYSLDSAISYEEPLPTAAAMEVSEPCNAIEEEMQKSPT-LKNL 2564 S S AA+I T+S+LALRIY+LD+AI YE+ LP +++E + + T L++ Sbjct: 1700 SLSAAAKIATISSLALRIYTLDAAIVYEKTLPFTPPKDIAEVGSKSDNNNSPPHTDLESN 1759 Query: 2565 ASPSSPTLLKTPEPDSSEN 2621 PSS +L++ D ++N Sbjct: 1760 PKPSSKPVLRSHNLDLTDN 1778 >ref|XP_006483833.1| PREDICTED: methyl-CpG-binding domain-containing protein 9-like isoform X2 [Citrus sinensis] Length = 2084 Score = 549 bits (1414), Expect = e-153 Identities = 348/953 (36%), Positives = 506/953 (53%), Gaps = 81/953 (8%) Frame = +3 Query: 3 ERDDLLVQVCNSVLPRAPWDEGLCKVCGMDKDDDNVLLCDKCDSEYHRYCLDPPLLRIPE 182 E +D+L S +P+APWDEG+CKVCG+DKDDDNVLLCD CDS YH YCL PPL R+PE Sbjct: 1136 EMEDILESA--SEIPKAPWDEGVCKVCGIDKDDDNVLLCDTCDSGYHTYCLTPPLTRVPE 1193 Query: 183 GNWYCPSCVTGQSLPSGTGYASGSNQR-RKRRNQGEFSSKFLEELSRFAKLMGMKEHWEF 359 GNWYCP C++G + R KRR+QGEF+ + LEE+ A M M+++W++ Sbjct: 1194 GNWYCPPCLSGNCKNKYMSQVPHVSSRIPKRRHQGEFTCRILEEVFHLAATMEMRDYWDY 1253 Query: 360 TVEERIFFLKFLLDEALNSATVRDHMEQCASRAADLQNKLRSLTSEXXXXXXXXXXXXXS 539 + +ERIF LKFL DE LNS +R+H+E+CAS + DLQ K+RSL+ E Sbjct: 1254 SDKERIFLLKFLCDELLNSTNIREHLERCASVSVDLQQKIRSLSLEWRNLKFREEILAGK 1313 Query: 540 AEKTNSGVLNVRGDLNSDASSSQHASENITRGKPS------EKLVGDQSQPEKIIV--KT 695 + + VL+ G ++ ++ + +PS L D + E + ++ Sbjct: 1314 VARDKASVLSGTGKCGTEGVATLYPHYGKLMRQPSGGGGYFSSLASDLALSEDGLQLNES 1373 Query: 696 SEGPNWLSEKPISVQQPQSDQGHTSLLNNVQSPLFSSPTSERETELVQCPNQGDMPSSQL 875 + W + K IS++QP + +S + SE++ V Q D+P S Sbjct: 1374 RKLSCWFNLKGISMRQPSCSRNQIGEAPYTESQVHQE--SEKDNIRVD-DLQYDVPHSAS 1430 Query: 876 NNLKACTVKQEIT----------------------------------------NLLDSIA 935 K T + T +L DSIA Sbjct: 1431 QPQKQDTAGEYATWRNKGQDLENGHTSGPLQPNCEASQSHFSSDHTNGNQVAEHLCDSIA 1490 Query: 936 DIELELVKISLRRDFLGRDSIGRVYWAFCYPGARPWVIACGGTA-HKERCPRD------- 1091 +E + + +SLR++ LGRDS GR+YWAF P PW++ T +ER ++ Sbjct: 1491 GLESQQLAVSLRKELLGRDSAGRLYWAFFRPNTSPWLLVDATTVLEQERILKEHGDSLAN 1550 Query: 1092 ------FSSIPDSDKWMYYESDSEIEKLVGWLRENNVREKELKESISQFQANKLKD---- 1241 ++ I S W Y+SD+EIE+L+ WL +++ R+KEL ESI ++ KD Sbjct: 1551 SPFEEEYNGISASSSWFSYQSDTEIEELIQWLSDSDPRDKELAESILRWTKIGYKDLKIA 1610 Query: 1242 SEYTEDHILKKREINHGGRKTVSADFLATNAMNALEKKFGPCKRIETTAVPQNLVMGASQ 1421 + ED + TV + L T A+ LE+K GPC E + L + Sbjct: 1611 GNHIEDESVPSSSKCRKSEATVKSSGLVTKALTVLEEKHGPCLEPEVLKMSMKLDTNSEL 1670 Query: 1422 S--GGMHRCECLEMLWPSKDHCGSCHQSFSTSEELRQHVKENCKAVASGSKRSQAAEDMS 1595 + M+RCECLE + P++ HC CH SFS EL +H C A+ S+ S+ ++ Sbjct: 1671 TCKERMYRCECLEPVLPTRFHCRRCHLSFSARNELEEHNDAKCILSATSSQNSKEDDE-- 1728 Query: 1596 KRKKARNAVSQEKRPASVGIPQGSTLEKQIDGSASVESYNAD----CPFNFEEIMTRFIV 1763 R K + E A G + + + ++ S+ CPFNFEEI T+FI Sbjct: 1729 -RTKGAGTIRTETLQAECMETAGKGMSQSLKHGTAMGSFEIPKEFACPFNFEEISTKFIT 1787 Query: 1764 PSSVKDGVNEIGLIGSGGLPSFLQGQSPYVSDSALTVGLERTNEAS--SRSNDLRSKQKN 1937 +S+K+ V EIGLIGS G+P+F+ SPY+ D +L + NE + ++S +L + + Sbjct: 1788 KNSIKELVQEIGLIGSNGVPAFVPSTSPYLCDPSLKLVEMCKNEINRGNKSTNLENLFQY 1847 Query: 1938 AEPGDVMNGRGFKD--SNRSSRSAENGLSDELSIVGRLKSILTSEKDQVTPMKDRSSLL- 2108 + GD+++G + +N S R + D++ RL +EK +D+S L Sbjct: 1848 SIVGDMVSGLEHDNISNNSSRRCTVSHNDDDVLKCRRLNPNFMNEK------RDQSFSLS 1901 Query: 2109 ---GLSKSTIIRESSSRPLVGRASEILRILKINLLDMDAALPEDAFRKSRTSPDRRCAWR 2279 G+ S+I+R++S PL+GR EILR LKINLLDMDAA+PE+A R S+ + R AWR Sbjct: 1902 LKPGIGNSSIVRDTSLMPLMGRGIEILRQLKINLLDMDAAVPEEALRSSKACWENRSAWR 1961 Query: 2280 AFVKSSKSIYEMVQATIVFEDMIKTEYLRNDWWYWSSPSTAARITTLSALALRIYSLDSA 2459 AFVKS+KSI+EMVQATIVFEDMIKT+YLRN WWYWSS S AA I T+SALALR+Y+LD+A Sbjct: 1962 AFVKSAKSIFEMVQATIVFEDMIKTDYLRNGWWYWSSLSGAANIATVSALALRLYTLDAA 2021 Query: 2460 ISYEEPLPTAAAMEVSEPCNAIEEEMQKSPTLKNLASPSSPTLLKTPEPDSSE 2618 I YE+ + ++E+ E + ++E K+ PS +LKT D +E Sbjct: 2022 IVYEK---HSDSIEIQEHISQPDKETSPCKDSKSNPKPSK-AILKTQSSDLTE 2070 >ref|XP_002274643.2| PREDICTED: methyl-CpG-binding domain-containing protein 9-like [Vitis vinifera] Length = 2164 Score = 543 bits (1400), Expect = e-151 Identities = 352/941 (37%), Positives = 506/941 (53%), Gaps = 117/941 (12%) Frame = +3 Query: 3 ERDDLLVQVCNSVLPRAPWDEGLCKVCGMDKDDDNVLLCDKCDSEYHRYCLDPPLLRIPE 182 E DD LV S +P+APWDEG+CKVCG+DKDDD+VLLCD CD+EYH YCL+PPL RIPE Sbjct: 1200 EIDDFLVSA--SEIPKAPWDEGVCKVCGIDKDDDSVLLCDMCDAEYHTYCLNPPLARIPE 1257 Query: 183 GNWYCPSCVTGQSLPSGTGYASGSNQRRKRRNQGEFSSKFLEELSRFAKLMGMKEHWEFT 362 GNWYCPSCV G S+ + + QR+ + QG+F+ +LE L+ A M KE+WE + Sbjct: 1258 GNWYCPSCVAGISMVDVSEHTHVIAQRQGKNCQGDFTHAYLESLAHLAAAMEEKEYWELS 1317 Query: 363 VEERIFFLKFLLDEALNSATVRDHMEQCASRAADLQNKLRSLTSEXXXXXXXXXXXXXSA 542 V++R F KFL DE LN+A +R H+EQCA +A+LQ KLRS++ E A Sbjct: 1318 VDQRTFLFKFLCDELLNTALIRQHLEQCAESSAELQQKLRSISVEWKNLKLKEENLAARA 1377 Query: 543 EKTNSGVLNVRGDLNSDASSSQHASEN----------ITRGKPSEKLVGDQSQPEKIIVK 692 K +SG++ V G++ ++ S + N R K L DQ Q E Sbjct: 1378 PKVDSGMIYVAGEVGTEGGLSSALTNNGKCIAKPHTLSDRPKDFGILSNDQLQVEG---- 1433 Query: 693 TSEG--PNWLSEKPIS-------VQQPQSDQGHT----SLLNNVQSPLFSSP-------- 809 SEG PN L + P S +P ++G ++++ Q + P Sbjct: 1434 GSEGIRPNGLDKHPSSNCSEGNCTLKPIDNEGQLKEVHAVVDETQVSVDHFPHMVYQGNG 1493 Query: 810 TSERETEL-VQCPNQGDMP--------------SSQLNNLKAC----------------- 893 +S R EL +Q P Q +M + + N+L+ Sbjct: 1494 SSCRPNELHLQNPLQQEMDGLGTEFNLQVNMCENMEKNDLQGLHHPSDIRIVHVAEHDSE 1553 Query: 894 --TVKQEITNLLDSIADIELELVKISLRRDFLGRDSIGRVYWAFCYPGARPWVIACGGTA 1067 ++K +I++L DS+A IE +L+K+S+RR+FLG DS GR+YW PG PWV+ G A Sbjct: 1554 LNSIKNDISDLQDSMASIESQLLKLSVRREFLGSDSAGRLYWILAKPGWHPWVLVDGSMA 1613 Query: 1068 HKER-----------------------------------CP---RDFSSIPDSDKWMYYE 1133 +++ CP R +SI +W+ Y+ Sbjct: 1614 LQKKEKMRYLKNPGDSSVQKNSTSLSMDILSTLGGSNASCPFLYRPNASISICSQWVSYQ 1673 Query: 1134 SDSEIEKLVGWLRENNVREKELKESISQFQANKLKDSEYT--EDHILKKREINH-GGRKT 1304 S EI+ L+GWL++ + REKELKESI + +D + T D + + ++ + Sbjct: 1674 SGEEIDALIGWLKDADPREKELKESILHLHKLRFRDWKLTGDPDQVDSQTTLSRFPNSEN 1733 Query: 1305 VSADFLATNAMNALEKKFGPC--KRIETTAVPQNLVMGASQSGGMHRCECLEMLWPSKDH 1478 +D L T A L KK+GP I ++ +L + M+RCECLE +W S+ H Sbjct: 1734 AFSDGLLTKAGILLGKKYGPWFEPEIADSSKKWDLRSKVTNESKMYRCECLEPIWSSRHH 1793 Query: 1479 CGSCHQSFSTSEELRQHVKENCKAVASGSKRSQAAEDMSKRKKARNAV-SQEKRPASVG- 1652 C SCH++F T +L +H +C+ SG S+ +++ S K + + S+ R S G Sbjct: 1794 CPSCHRTFFTDIQLEEHNDGSCR---SGPPTSEKSKENSSHLKGKGTMKSKISREESTGD 1850 Query: 1653 -----IPQGSTLEKQIDGSASVESYNAD--CPFNFEEIMTRFIVPSSVKDGVNEIGLIGS 1811 IP+G + + S ++ N CP++FEEI ++F+ +S K+ V EIGLIGS Sbjct: 1851 IDMVEIPKGGCSQPR---SRLIKFQNEGLVCPYDFEEICSKFVTKNSNKELVQEIGLIGS 1907 Query: 1812 GGLPSFLQGQSPYVSDSALTVGLERTNEASSRSNDLRSKQKNAEPGDVMNGRGFKDSNRS 1991 G+PSF+ + PY+SD+ L L + E + + D+ Q N P G G N S Sbjct: 1908 KGVPSFVSSRPPYISDATLL--LVPSGELKA-TGDMMLAQGNRIPA---GGSGSFSDNSS 1961 Query: 1992 SRSAENGLSDELSIVGRLKSILTSEKDQVTPMKDRSSLLGLSKSTIIRESSSRPLVGRAS 2171 SA N E S R +KD+ + + + + + +I +SS RPLVG+ Sbjct: 1962 RDSAAN----ETSAASRTDKSALEQKDKKYSLNNNGPEMEVGRCCVIPQSSLRPLVGKVY 2017 Query: 2172 EILRILKINLLDMDAALPEDAFRKSRTSPDRRCAWRAFVKSSKSIYEMVQATIVFEDMIK 2351 +ILR LKINLLDMDAALPE+A + SR ++R AWRAFVKS+++I+EMVQATI+ EDMIK Sbjct: 2018 QILRQLKINLLDMDAALPEEALKPSRADLEKRLAWRAFVKSAETIFEMVQATIMLEDMIK 2077 Query: 2352 TEYLRNDWWYWSSPSTAARITTLSALALRIYSLDSAISYEE 2474 TEYL N WWYWSS S AA+ +T+S+LALRIYSLD+AI+YE+ Sbjct: 2078 TEYLMNGWWYWSSLSAAAKTSTVSSLALRIYSLDAAIAYEK 2118 >ref|XP_007217135.1| hypothetical protein PRUPE_ppa000046mg [Prunus persica] gi|462413285|gb|EMJ18334.1| hypothetical protein PRUPE_ppa000046mg [Prunus persica] Length = 2154 Score = 542 bits (1397), Expect = e-151 Identities = 357/964 (37%), Positives = 504/964 (52%), Gaps = 122/964 (12%) Frame = +3 Query: 3 ERDDLLVQVCNSVLPRAPWDEGLCKVCGMDKDDDNVLLCDKCDSEYHRYCLDPPLLRIPE 182 E DDLL S +P+APWD+G+CKVCG+DKDDD+VLLCD CD+EYH YCL+PPL RIPE Sbjct: 1186 EIDDLLAST--SGIPKAPWDDGVCKVCGIDKDDDSVLLCDTCDAEYHTYCLNPPLARIPE 1243 Query: 183 GNWYCPSCVTGQSLPSGTGYASGSNQRRKRRN-QGEFSSKFLEELSRFAKLMGMKEHWEF 359 GNWYCPSCV + + ++ +R+N QGE + +LE L+ + M E+WEF Sbjct: 1244 GNWYCPSCVVSKQMVQDASEHHQVIRKCRRKNYQGEVTRTYLEALTLLSMKMEENEYWEF 1303 Query: 360 TVEERIFFLKFLLDEALNSATVRDHMEQCASRAADLQNKLRSLTSEXXXXXXXXXXXXXS 539 V+ER F LKFL DE LNSA +R H+E C+ +A+LQ KLRSL++E Sbjct: 1304 NVDERTFLLKFLCDELLNSAVIRQHLEHCSETSAELQQKLRSLSAEWKNLKSKEEILIAK 1363 Query: 540 AEKTN--------------------------------SGVLNV-----------RG-DLN 587 A K + S NV RG D + Sbjct: 1364 AAKVDPSLEEDGVKEGLSTSVENHEKFVLQAHALSGRSNSFNVVSDDVPALEGARGLDKH 1423 Query: 588 SDAS----SSQHASENITRGKPSEKLVGDQSQPEKI----IVKTSEGPNWLSEKPISVQQ 743 AS SSQH+ + R K V D P + + S+ + L E P S Sbjct: 1424 PSASNAEYSSQHSVDTEARAKDVHAAVHDTGTPGNVSSNAASEKSDISSRLIEFPSSNSL 1483 Query: 744 PQSDQGHTSLLN-----------NVQSPLFSS----PTSERETELVQCPNQGDMPSSQLN 878 P G + +V PL P+ R + Q + + SQ Sbjct: 1484 PHEINGSIGKIGCLGHPQDNMEMDVSLPLDQQGVCIPSDVRSNHVGQHMSPASVNESQAY 1543 Query: 879 NLKACTVKQEITNLLDSIADIELELVKISLRRDFLGRDSIGRVYWAFCYP---------- 1028 +L+ +VK +++ L DSI ++ EL K+S+RR+FLG DS+G +YWA + Sbjct: 1544 HLELNSVKSDLSLLQDSITSVDFELSKLSVRREFLGIDSLGGLYWASGHSRIVVDRTVSV 1603 Query: 1029 --------GARP-W----VIACGGTA---------HKERCPRDF---SSIPDSDKWMYYE 1133 G P W +C T K CP F S++ S W+ Y+ Sbjct: 1604 QDGMNMTDGRDPVWRGSVTQSCASTGVDSSLPLEGSKAGCPYLFEPNSAVAFSAPWVSYQ 1663 Query: 1134 SDSEIEKLVGWLRENNVREKELKESISQFQANKL----KDSEYTEDHILKKREINHGGRK 1301 +D+EI+ L+GWL++ N +E+ELKESI Q++ ++ K ++D +L + G K Sbjct: 1664 TDAEIDGLIGWLKDKNPKERELKESILQWKKSRFHKFQKTRSQSQDELLTAISVARNGEK 1723 Query: 1302 TVSADFLATNAMNALEKKFGPCKRIETTAVPQNLVMGASQSGG--MHRCECLEMLWPSKD 1475 T S D L T A LEK +GPC +ETT + + A + M+RCECLE +WP++ Sbjct: 1724 TES-DCLVTRAATLLEKMYGPCSELETTDISKKRGKRARLTNDEKMYRCECLEPIWPNRH 1782 Query: 1476 HCGSCHQSFSTSEELRQHVKENCKAVASGSKRSQAAEDMSK----------RKKARNAVS 1625 HC SCH++F EL H C ++ ++ + D SK R++ R ++ Sbjct: 1783 HCLSCHRTFVADAELEGHNDGRCVPFSAACEKGKEISDSSKVKGSLKCEINREECRGELN 1842 Query: 1626 QEKRPASVGIPQGSTLEKQIDGSASVESYNADCPFNFEEIMTRFIVPSSVKDGVNEIGLI 1805 + SV + L K +G CP++FEEI ++F+ S KD + EIGLI Sbjct: 1843 SVETSKSVHSELSAKLIKFQNGGLV-------CPYDFEEICSKFVTNDSNKDLIQEIGLI 1895 Query: 1806 GSGGLPSFLQGQSPYVSDSALTVGLERTNEASSRSNDLRSKQKNAEPGDVMNGRGFKD-S 1982 GS G+PSF+ SPY+SDS T L + N + ++ V+ G+ D + Sbjct: 1896 GSQGVPSFVPSLSPYLSDS--TQQLVTQKDVGVHGNGPEAAEQL-----VLQGKTNVDIA 1948 Query: 1983 NRSSRSAENG--LSDELSIVGRLKSILTSEKDQVTPMKDRSSLLGLSKSTIIRESSSRPL 2156 SS S + G L+ + +G L EK + P SS++G + ++ +SS RPL Sbjct: 1949 GCSSLSGKGGGLLNANIPTLGCL------EKREKRPSGSHSSVVGAGRFCVVPQSSLRPL 2002 Query: 2157 VGRASEILRILKINLLDMDAALPEDAFRKSRTSPDRRCAWRAFVKSSKSIYEMVQATIVF 2336 VG+ +I R LKINLLD+DAALPE+A R S++ +RR AWR FVK++ +IYEMVQATIV Sbjct: 2003 VGKVCQISRRLKINLLDIDAALPEEALRPSKSHLERRWAWRTFVKAAVTIYEMVQATIVL 2062 Query: 2337 EDMIKTEYLRNDWWYWSSPSTAARITTLSALALRIYSLDSAISYEEPLPTAAAMEVSEPC 2516 EDMIKTEYLRN+WWYWSS S AA+I+TLSALALRIYSLDSAI YE+ P++ ++ EP Sbjct: 2063 EDMIKTEYLRNEWWYWSSFSAAAKISTLSALALRIYSLDSAIMYEKMFPSSDPVDKLEPS 2122 Query: 2517 NAIE 2528 + ++ Sbjct: 2123 SVLD 2126 >ref|XP_006446469.1| hypothetical protein CICLE_v10014026mg [Citrus clementina] gi|557549080|gb|ESR59709.1| hypothetical protein CICLE_v10014026mg [Citrus clementina] Length = 1680 Score = 538 bits (1386), Expect = e-150 Identities = 320/915 (34%), Positives = 474/915 (51%), Gaps = 91/915 (9%) Frame = +3 Query: 3 ERDDLLVQVCNSVLPRAPWDEGLCKVCGMDKDDDNVLLCDKCDSEYHRYCLDPPLLRIPE 182 E +D+LVQ S +P+APWDEG+CKVCG+DKDDD+VLLCD CD+EYH YCL+PPL+RIPE Sbjct: 731 EINDILVQT--SEIPKAPWDEGICKVCGVDKDDDSVLLCDTCDAEYHTYCLEPPLVRIPE 788 Query: 183 GNWYCPSCVTGQSLPSGTG-YASGSNQRRKRRNQGEFSSKFLEELSRFAKLMGMKEHWEF 359 GNWYCPSCV S+ G ++ Q + ++ QGE + LEEL +M KE+WEF Sbjct: 789 GNWYCPSCVVRNSMVQGASEHSQVGGQHKGKKYQGEITRLCLEELRHLTTVMEEKEYWEF 848 Query: 360 TVEERIFFLKFLLDEALNSATVRDHMEQCASRAADLQNKLRSLTSEXXXXXXXXXXXXXS 539 V ER F LKFL DE LNSA +R H+EQC A+LQ KLRS + E Sbjct: 849 NVHERTFLLKFLCDELLNSALLRQHLEQCTEVTAELQQKLRSFSVEFKNLKSREETVAAR 908 Query: 540 AEKTNSGVLNVRGDLNSDASSSQHASENITRGKPSEKLVGDQSQPEKIIVKTSE-GPNWL 716 K + + N ++ + N GK E+ ++ ++ E GP + Sbjct: 909 VAKVEASMTNSVAEICMKEGPATVIRNN---GKCIEQPQNSSNRSNCSVIALEESGPMYP 965 Query: 717 SEKPISVQQPQSDQGHTSLLNNVQS------PLFSSPTSE-------------------- 818 ++ +++P D N +S PL SS E Sbjct: 966 TDAEGQIEEPHGDNSKMPSQKNDESIKPNEHPLASSLPQEIDNLSGEIRSQHNLQELARA 1025 Query: 819 RETELVQCPNQGDMPS------------------SQLNNLKACTVKQEITNLLDSIADIE 944 R+ + P+ PS Q +NL+ ++ +I L +SI +E Sbjct: 1026 RDAATLASPSNNQGPSVPNELHVTEGTCSVTMNEPQAHNLELNNIRNDILLLQESITSLE 1085 Query: 945 LELVKISLRRDFLGRDSIGRVYWAFCYPGARPWVIACGGTAHKER--------------- 1079 +L+K+S+RR+FLG DS GR+YW PG P +I G +++ Sbjct: 1086 QQLLKLSVRREFLGSDSSGRLYWVLPLPGMHPCLIVDGSPELQQKRKILDFRGPVDKGLV 1145 Query: 1080 ---------------------CPRDFSSIP---DSDKWMYYESDSEIEKLVGWLRENNVR 1187 CP + S W+ Y++D+EIE+LV WLR+N+ + Sbjct: 1146 LKNSSSSGSDAYSSSKGSKACCPFQYDPYAVTATSSHWILYQTDAEIEELVNWLRDNDPK 1205 Query: 1188 EKELKESISQFQANKLKDSEYTE----DHILKKREINHGGRKTVSADFLATNAMNALEKK 1355 E+ELK+SI ++ + +DS++T+ D K D L T A LEKK Sbjct: 1206 ERELKDSILNWKKIRFQDSQHTKKQSWDEYQSASSAPTNSDKVDCFDCLVTKAATLLEKK 1265 Query: 1356 FGPCKRIETTAVPQNLVMGASQSGGMHRCECLEMLWPSKDHCGSCHQSFSTSEELRQHVK 1535 +GPC E + M+RCECLE +WPS++HC SCH++FST+ E +H Sbjct: 1266 YGPCFESEEVLKKGGKRARVTSQEKMYRCECLEPIWPSRNHCLSCHRTFSTAVEFEEHND 1325 Query: 1536 ENCKAVASGSKRSQAAEDMSKRKKARNAVSQEKRPASVGIPQGSTLEKQIDGSASVESYN 1715 A + K +A+ + + ++ +S+ V + + S S + N Sbjct: 1326 TCNSAPPAYEKNKEASNSLKGKGNKKSDISRAACGTDVELVETSK------PSGLIRFQN 1379 Query: 1716 ADCPFNFEEIMTRFIVPSSVKDGVNEIGLIGSGGLPSFLQGQSPYVSDSALTVGLERTNE 1895 CPF+ EI ++F+ S K+ V EIGL+GS G+PS + SP++SDS L + + Sbjct: 1380 DGCPFDLNEISSKFMTQDSNKELVQEIGLLGSKGIPSLIPSVSPFLSDSTLMLMSSQKEV 1439 Query: 1896 ASSRSNDLRSKQKNAEPGDVMNGRGFKDSNRSSRSAENGLSDELSIVGRLKSIL--TSEK 2069 + S+ ++ G D+ S ++G + ++ K + ++ Sbjct: 1440 GVPDGQLMASETLSSSQGKQSMKNAGNDNMADDASRKSGSNGTHEVLKSKKPAFGCSEQR 1499 Query: 2070 DQVTPMKDRSSLLGLSKSTIIRESSSRPLVGRASEILRILKINLLDMDAALPEDAFRKSR 2249 D+ + R +G+++ ++ +SS RPL+GR S+I R LK+NLLD+DAALPE+A R S+ Sbjct: 1500 DRKSSSHVRVPKVGINQCCVVPQSSLRPLIGRTSQIKRRLKVNLLDIDAALPEEALRPSK 1559 Query: 2250 TSPDRRCAWRAFVKSSKSIYEMVQATIVFEDMIKTEYLRNDWWYWSSPSTAARITTLSAL 2429 +RR AWRAFVKS+++IYEMVQATI+ EDMIKTE+LRN+WWYWSS S AA+ +T+S+L Sbjct: 1560 AHLERRWAWRAFVKSAETIYEMVQATIILEDMIKTEFLRNEWWYWSSLSAAAKTSTMSSL 1619 Query: 2430 ALRIYSLDSAISYEE 2474 ALRIYSLD+AI Y++ Sbjct: 1620 ALRIYSLDAAIIYDK 1634 >ref|XP_006470356.1| PREDICTED: methyl-CpG-binding domain-containing protein 9-like isoform X2 [Citrus sinensis] Length = 2023 Score = 535 bits (1377), Expect = e-149 Identities = 319/913 (34%), Positives = 471/913 (51%), Gaps = 89/913 (9%) Frame = +3 Query: 3 ERDDLLVQVCNSVLPRAPWDEGLCKVCGMDKDDDNVLLCDKCDSEYHRYCLDPPLLRIPE 182 E +D+LVQ S +P+APWDEG+CKVCG+DKDDD+VLLCD CD+EYH YCL+PPL+RIPE Sbjct: 1076 EINDILVQT--SEIPKAPWDEGICKVCGVDKDDDSVLLCDTCDAEYHTYCLEPPLVRIPE 1133 Query: 183 GNWYCPSCVTGQSLPSGTG-YASGSNQRRKRRNQGEFSSKFLEELSRFAKLMGMKEHWEF 359 GNWYCPSCV S+ G ++ Q + + NQGE + LE L +M KE+WEF Sbjct: 1134 GNWYCPSCVVRNSMVQGASEHSQVGGQHKGKNNQGEITRLCLEALRHLTTVMEEKEYWEF 1193 Query: 360 TVEERIFFLKFLLDEALNSATVRDHMEQCASRAADLQNKLRSLTSEXXXXXXXXXXXXXS 539 V ER F LKFL DE LNSA +R H+EQC A+LQ KLRS + E Sbjct: 1194 NVHERTFLLKFLCDELLNSALLRQHLEQCTEVTAELQQKLRSFSVEFKNLKSREETVAAR 1253 Query: 540 AEKTNSGVLNVRGDLNSDASSSQHASENITRGKPSEKLVGDQSQPEKIIVKTSE-GPNWL 716 K + + ++ + N GK E+ ++ ++ E GP + Sbjct: 1254 VAKVEASMTYSVAEVCMKEGPATVIRNN---GKCIEQPQNSSNRSNCSVIALEESGPMYP 1310 Query: 717 SEKPISVQQPQSDQGHTSLLNNVQS------PLFSSPTSE------------------RE 824 ++ +++P D N +S PL SS E R+ Sbjct: 1311 TDAEGQIEEPHGDNSKMPSQKNDESIKPNEHPLASSLPQEIDNLSGEIRSQHNLQELARD 1370 Query: 825 TELVQCPNQGDMPS------------------SQLNNLKACTVKQEITNLLDSIADIELE 950 + P+ PS Q +NL+ ++ +I L +SI +E + Sbjct: 1371 AATLASPSNNHGPSVPNELHVTEGTCSVTMNEPQAHNLELNNIRNDILLLQESITSLEQQ 1430 Query: 951 LVKISLRRDFLGRDSIGRVYWAFCYPGARPWVIACGGTAHKER----------------- 1079 L+K+S+RR+FLG DS GR+YW PG P +I G +++ Sbjct: 1431 LLKLSVRREFLGSDSSGRLYWVLPLPGMHPCLIVDGSPELQQKRKILDFRGPVDKGLVLK 1490 Query: 1080 -------------------CPRDFSSIP---DSDKWMYYESDSEIEKLVGWLRENNVREK 1193 CP + S W+ Y++D+EIE+LV WLR+N+ +E+ Sbjct: 1491 NSSSSGSDAYSSSKGSKACCPFQYDPYAVTATSSHWILYQTDAEIEELVNWLRDNDPKER 1550 Query: 1194 ELKESISQFQANKLKDSEYTE----DHILKKREINHGGRKTVSADFLATNAMNALEKKFG 1361 ELK+SI ++ + +DS++T+ D K D L T A LEKK+G Sbjct: 1551 ELKDSILNWKKIRFQDSQHTKKQSWDEYQSASSAPTNSDKVDCFDCLVTKAATLLEKKYG 1610 Query: 1362 PCKRIETTAVPQNLVMGASQSGGMHRCECLEMLWPSKDHCGSCHQSFSTSEELRQHVKEN 1541 PC E + M+RCECLE +WPS++HC SCH++FST+ E +H Sbjct: 1611 PCFESEEVLKKGGKRARVTSQEKMYRCECLEPIWPSRNHCLSCHRTFSTAVEFEEHNDTC 1670 Query: 1542 CKAVASGSKRSQAAEDMSKRKKARNAVSQEKRPASVGIPQGSTLEKQIDGSASVESYNAD 1721 A + K +A+ + + ++ +S V + + S S + N Sbjct: 1671 NSAPPAYEKNKEASNSLKGKGNKKSDISHAAGGTDVELVETSK------PSGLIRFQNDG 1724 Query: 1722 CPFNFEEIMTRFIVPSSVKDGVNEIGLIGSGGLPSFLQGQSPYVSDSALTVGLERTNEAS 1901 CPF+ EI ++F+ S K+ V EIGL+GS G+PS + SP++SDS L + + Sbjct: 1725 CPFDLNEISSKFMTQDSNKELVQEIGLLGSKGIPSLIPSVSPFLSDSTLMLMSPQKEVGV 1784 Query: 1902 SRSNDLRSKQKNAEPGDVMNGRGFKDSNRSSRSAENGLSDELSIVGRLKSIL--TSEKDQ 2075 + S+ ++ G D+ S ++G + ++ K + ++D+ Sbjct: 1785 PDGQLMASETLSSSQGKQSMKNAGNDNMADDASRKSGSNGTHEVLKSKKPAFGCSEQRDR 1844 Query: 2076 VTPMKDRSSLLGLSKSTIIRESSSRPLVGRASEILRILKINLLDMDAALPEDAFRKSRTS 2255 + R +G+++ ++ +SS RPL+GR S+I R LK+NLLD+DAALPE+A R S+ Sbjct: 1845 KSSSHVRVPKVGINQCCVVPQSSLRPLIGRTSQIKRRLKVNLLDIDAALPEEALRPSKAH 1904 Query: 2256 PDRRCAWRAFVKSSKSIYEMVQATIVFEDMIKTEYLRNDWWYWSSPSTAARITTLSALAL 2435 +RR AWRAFVKS+++IYEMVQATI+ EDMIKTE+LRN+WWYWSS S AA+ +T+S+LAL Sbjct: 1905 LERRWAWRAFVKSAETIYEMVQATIILEDMIKTEFLRNEWWYWSSLSAAAKTSTMSSLAL 1964 Query: 2436 RIYSLDSAISYEE 2474 RIYSLD+AI Y++ Sbjct: 1965 RIYSLDAAIIYDK 1977 >ref|XP_006470355.1| PREDICTED: methyl-CpG-binding domain-containing protein 9-like isoform X1 [Citrus sinensis] Length = 2159 Score = 535 bits (1377), Expect = e-149 Identities = 319/913 (34%), Positives = 471/913 (51%), Gaps = 89/913 (9%) Frame = +3 Query: 3 ERDDLLVQVCNSVLPRAPWDEGLCKVCGMDKDDDNVLLCDKCDSEYHRYCLDPPLLRIPE 182 E +D+LVQ S +P+APWDEG+CKVCG+DKDDD+VLLCD CD+EYH YCL+PPL+RIPE Sbjct: 1212 EINDILVQT--SEIPKAPWDEGICKVCGVDKDDDSVLLCDTCDAEYHTYCLEPPLVRIPE 1269 Query: 183 GNWYCPSCVTGQSLPSGTG-YASGSNQRRKRRNQGEFSSKFLEELSRFAKLMGMKEHWEF 359 GNWYCPSCV S+ G ++ Q + + NQGE + LE L +M KE+WEF Sbjct: 1270 GNWYCPSCVVRNSMVQGASEHSQVGGQHKGKNNQGEITRLCLEALRHLTTVMEEKEYWEF 1329 Query: 360 TVEERIFFLKFLLDEALNSATVRDHMEQCASRAADLQNKLRSLTSEXXXXXXXXXXXXXS 539 V ER F LKFL DE LNSA +R H+EQC A+LQ KLRS + E Sbjct: 1330 NVHERTFLLKFLCDELLNSALLRQHLEQCTEVTAELQQKLRSFSVEFKNLKSREETVAAR 1389 Query: 540 AEKTNSGVLNVRGDLNSDASSSQHASENITRGKPSEKLVGDQSQPEKIIVKTSE-GPNWL 716 K + + ++ + N GK E+ ++ ++ E GP + Sbjct: 1390 VAKVEASMTYSVAEVCMKEGPATVIRNN---GKCIEQPQNSSNRSNCSVIALEESGPMYP 1446 Query: 717 SEKPISVQQPQSDQGHTSLLNNVQS------PLFSSPTSE------------------RE 824 ++ +++P D N +S PL SS E R+ Sbjct: 1447 TDAEGQIEEPHGDNSKMPSQKNDESIKPNEHPLASSLPQEIDNLSGEIRSQHNLQELARD 1506 Query: 825 TELVQCPNQGDMPS------------------SQLNNLKACTVKQEITNLLDSIADIELE 950 + P+ PS Q +NL+ ++ +I L +SI +E + Sbjct: 1507 AATLASPSNNHGPSVPNELHVTEGTCSVTMNEPQAHNLELNNIRNDILLLQESITSLEQQ 1566 Query: 951 LVKISLRRDFLGRDSIGRVYWAFCYPGARPWVIACGGTAHKER----------------- 1079 L+K+S+RR+FLG DS GR+YW PG P +I G +++ Sbjct: 1567 LLKLSVRREFLGSDSSGRLYWVLPLPGMHPCLIVDGSPELQQKRKILDFRGPVDKGLVLK 1626 Query: 1080 -------------------CPRDFSSIP---DSDKWMYYESDSEIEKLVGWLRENNVREK 1193 CP + S W+ Y++D+EIE+LV WLR+N+ +E+ Sbjct: 1627 NSSSSGSDAYSSSKGSKACCPFQYDPYAVTATSSHWILYQTDAEIEELVNWLRDNDPKER 1686 Query: 1194 ELKESISQFQANKLKDSEYTE----DHILKKREINHGGRKTVSADFLATNAMNALEKKFG 1361 ELK+SI ++ + +DS++T+ D K D L T A LEKK+G Sbjct: 1687 ELKDSILNWKKIRFQDSQHTKKQSWDEYQSASSAPTNSDKVDCFDCLVTKAATLLEKKYG 1746 Query: 1362 PCKRIETTAVPQNLVMGASQSGGMHRCECLEMLWPSKDHCGSCHQSFSTSEELRQHVKEN 1541 PC E + M+RCECLE +WPS++HC SCH++FST+ E +H Sbjct: 1747 PCFESEEVLKKGGKRARVTSQEKMYRCECLEPIWPSRNHCLSCHRTFSTAVEFEEHNDTC 1806 Query: 1542 CKAVASGSKRSQAAEDMSKRKKARNAVSQEKRPASVGIPQGSTLEKQIDGSASVESYNAD 1721 A + K +A+ + + ++ +S V + + S S + N Sbjct: 1807 NSAPPAYEKNKEASNSLKGKGNKKSDISHAAGGTDVELVETSK------PSGLIRFQNDG 1860 Query: 1722 CPFNFEEIMTRFIVPSSVKDGVNEIGLIGSGGLPSFLQGQSPYVSDSALTVGLERTNEAS 1901 CPF+ EI ++F+ S K+ V EIGL+GS G+PS + SP++SDS L + + Sbjct: 1861 CPFDLNEISSKFMTQDSNKELVQEIGLLGSKGIPSLIPSVSPFLSDSTLMLMSPQKEVGV 1920 Query: 1902 SRSNDLRSKQKNAEPGDVMNGRGFKDSNRSSRSAENGLSDELSIVGRLKSIL--TSEKDQ 2075 + S+ ++ G D+ S ++G + ++ K + ++D+ Sbjct: 1921 PDGQLMASETLSSSQGKQSMKNAGNDNMADDASRKSGSNGTHEVLKSKKPAFGCSEQRDR 1980 Query: 2076 VTPMKDRSSLLGLSKSTIIRESSSRPLVGRASEILRILKINLLDMDAALPEDAFRKSRTS 2255 + R +G+++ ++ +SS RPL+GR S+I R LK+NLLD+DAALPE+A R S+ Sbjct: 1981 KSSSHVRVPKVGINQCCVVPQSSLRPLIGRTSQIKRRLKVNLLDIDAALPEEALRPSKAH 2040 Query: 2256 PDRRCAWRAFVKSSKSIYEMVQATIVFEDMIKTEYLRNDWWYWSSPSTAARITTLSALAL 2435 +RR AWRAFVKS+++IYEMVQATI+ EDMIKTE+LRN+WWYWSS S AA+ +T+S+LAL Sbjct: 2041 LERRWAWRAFVKSAETIYEMVQATIILEDMIKTEFLRNEWWYWSSLSAAAKTSTMSSLAL 2100 Query: 2436 RIYSLDSAISYEE 2474 RIYSLD+AI Y++ Sbjct: 2101 RIYSLDAAIIYDK 2113 >ref|XP_007031430.1| Methyl-CpG-binding domain-containing protein 9, putative isoform 1 [Theobroma cacao] gi|590645754|ref|XP_007031431.1| Methyl-CpG-binding domain-containing protein 9, putative isoform 1 [Theobroma cacao] gi|508710459|gb|EOY02356.1| Methyl-CpG-binding domain-containing protein 9, putative isoform 1 [Theobroma cacao] gi|508710460|gb|EOY02357.1| Methyl-CpG-binding domain-containing protein 9, putative isoform 1 [Theobroma cacao] Length = 2225 Score = 534 bits (1376), Expect = e-149 Identities = 352/981 (35%), Positives = 510/981 (51%), Gaps = 112/981 (11%) Frame = +3 Query: 3 ERDDLLVQVCNSVLPRAPWDEGLCKVCGMDKDDDNVLLCDKCDSEYHRYCLDPPLLRIPE 182 E +DLL S +P+APWDEG+CKVCG+DKDDD+VLLCD CD+EYH YCL+PPL RIPE Sbjct: 1269 EINDLLAST--SEIPKAPWDEGVCKVCGIDKDDDSVLLCDTCDAEYHTYCLNPPLARIPE 1326 Query: 183 GNWYCPSCVTGQSL-PSGTGYASGSNQRRKRRNQGEFSSKFLEELSRFAKLMGMKEHWEF 359 GNWYCPSCV + + + ++ +RR ++ QGE + +LE L+ ++ KE+W+F Sbjct: 1327 GNWYCPSCVLSKRMVQDASEHSQVIIRRRDKKYQGEVTRGYLEALAHLGAVLEEKEYWQF 1386 Query: 360 TVEERIFFLKFLLDEALNSATVRDHMEQCASRAADLQNKLRSLTSEXXXXXXXXXXXXXS 539 +++ERIF LKFL DE LNSA +R H+EQCA ++L KLRS E Sbjct: 1387 SIDERIFLLKFLCDELLNSALIRQHLEQCAE-TSELHQKLRSAYVEWKNLKSREDFVAAK 1445 Query: 540 AEKTNSGVLNVRGD---------LNSDAS--------SSQHASENITRGK---------- 638 A K ++ + N GD L SD S+++AS T Sbjct: 1446 AAKIDTSMSNAVGDVGVKDGDDWLPSDGGKEGADLNGSNKYASATYTEKNFTANGQTLNP 1505 Query: 639 --PSEKLVGDQSQPEKIIVKTSEG-----------PNWLSEKPISVQQPQSDQGHTSLLN 779 +L GDQ+ + V + + PN LS++ + + S QG L Sbjct: 1506 MDTEAQLKGDQAIVDASKVSSQKSDKSFRPSELLVPNHLSQEIENSSKETSFQGK---LE 1562 Query: 780 NVQSPLFSSPTSERETELVQCPNQG--DMPS-----SQLNNLKACTVKQEITNLLDSIAD 938 + +SP S + P+ +PS SQ ++L+ T+K +I L D I Sbjct: 1563 ESKGMDVASPPSPSDCNGQFPPSDAAKQVPSVTENESQSHHLELNTIKNDIQRLQDLITS 1622 Query: 939 IELELVKISLRRDFLGRDSIGRVYWAFCYPGARP-------------------------- 1040 +E +L+K+S+R++FLG DS GR+YW PG P Sbjct: 1623 LESQLLKLSVRKEFLGSDSAGRLYWISAMPGGYPQVIVDGSLVLQKKRKFLGYEERVQNT 1682 Query: 1041 --WVIACGGT-------AHKERCPRDFSS---IPDSDKWMYYESDSEIEKLVGWLRENNV 1184 W A GT K CP ++S I W+ Y++++EIE L+ WL +N Sbjct: 1683 FIWNSASAGTDNGMKAEGSKASCPFLYNSKDAISVGSPWVTYQTEAEIEGLIDWLNDNEP 1742 Query: 1185 REKELKESISQFQANKLKDSEY------TEDHILKKREINHGGRKTVSADFLATNAMNAL 1346 +EKELKE+I Q KLK +Y +D ++ G K + FL T A L Sbjct: 1743 KEKELKEAILQ----KLKFQDYQKMKNQDQDECQTAFSMSSGSDKGSFSSFLGTKAAMLL 1798 Query: 1347 EKKFGPCKRIETTAVPQNLVMGASQSGG--MHRCECLEMLWPSKDHCGSCHQSFSTSEEL 1520 EKK+GPC + E T + A G M+RC+CLE +WPS++HC SCH++F + E Sbjct: 1799 EKKYGPCFKSEITDSLKKRGKKARVINGDKMYRCKCLEPIWPSRNHCISCHKTFFSDVEF 1858 Query: 1521 RQHVKENCKAVASGSKRSQAAEDMSKRKKARNA-VSQEKRPASVGIPQGSTLEKQIDGSA 1697 H C + +++S + D K K N +++ + I + S S Sbjct: 1859 EDHNDGKCNLGSPLNEKSTSVGDSLKGKGNMNIDINRVDCTVDMEIVETSKSGHSELSSR 1918 Query: 1698 SVESYNAD--CPFNFEEIMTRFIVPSSVKDGVNEIGLIGSGGLPSFLQGQSPYVSDSALT 1871 ++ N CP+NFEEI T+F+ S ++ V EIGLIGS G+PSF+ S +VSDS L Sbjct: 1919 LIKFQNEGLVCPYNFEEISTKFVTRDSNEELVREIGLIGSNGVPSFVSSVSHFVSDSTLM 1978 Query: 1872 VGLERTNEASSRSNDLRSKQKNAEPGDVMNGRGFKDSNRSSRSAENGLSDELSIVGRLKS 2051 +R Q+ + GD + ++ +RS NG+++ LS +S Sbjct: 1979 T--------------VRPHQERGDLGDKLKATEMPGFSQGNRSVANGINERLSDNSFRRS 2024 Query: 2052 ILT---------------SEKDQVTPMKDRSSLLGLSKSTIIRESSSRPLVGRASEILRI 2186 + + ++D+++ S LG+ + ++ +SS RPLVG+ S+I R Sbjct: 2025 VASEIEVQRTIRPALRCLEQRDRISSADKYSPELGIGRCCVVPQSSLRPLVGKVSQISRQ 2084 Query: 2187 LKINLLDMDAALPEDAFRKSRTSPDRRCAWRAFVKSSKSIYEMVQATIVFEDMIKTEYLR 2366 LKINLLDMDAAL E+A R S+ +RR AWR+FVKS+++IYEMVQATIV EDMIKTEYLR Sbjct: 2085 LKINLLDMDAALSEEALRPSKACMERRWAWRSFVKSAETIYEMVQATIVLEDMIKTEYLR 2144 Query: 2367 NDWWYWSSPSTAARITTLSALALRIYSLDSAISYEEPLPTAAAMEVSEPCNAIEEEMQKS 2546 N+WWYWSS S A +I+T+S+LALRIYSLDSAI YE+ +++ +P + + ++ + Sbjct: 2145 NEWWYWSSLSAAVKISTVSSLALRIYSLDSAIIYEKSF-EFHSIDNLKPSSIPDPKLLPN 2203 Query: 2547 PTLKNLASPSSPTLLKTPEPD 2609 L S T K EP+ Sbjct: 2204 LDLAEKCKVSRKTSKKRKEPE 2224 >ref|XP_002884279.1| methyl-CpG-binding domain 9 [Arabidopsis lyrata subsp. lyrata] gi|297330119|gb|EFH60538.1| methyl-CpG-binding domain 9 [Arabidopsis lyrata subsp. lyrata] Length = 2183 Score = 520 bits (1338), Expect = e-144 Identities = 320/924 (34%), Positives = 481/924 (52%), Gaps = 65/924 (7%) Frame = +3 Query: 3 ERDDLLVQVCNSVLPRAPWDEGLCKVCGMDKDDDNVLLCDKCDSEYHRYCLDPPLLRIPE 182 E D++V V + LP+APWDEG+CKVCG+DKDDD+VLLCD CD+EYH YCL+PPL+RIPE Sbjct: 1269 EIKDIVVSV--NKLPKAPWDEGVCKVCGVDKDDDSVLLCDTCDAEYHTYCLNPPLIRIPE 1326 Query: 183 GNWYCPSCVTGQSLPSGTGYASGSNQRRK-RRNQGEFSSKFLEELSRFAKLMGMKEHWEF 359 GNWYCPSCV + + + +RRK R+ QG+ + +E + A +M K++WEF Sbjct: 1327 GNWYCPSCVIAKRMAQEALESYKLVRRRKGRKYQGQLTRTSMEMTAHLADVMEEKDYWEF 1386 Query: 360 TVEERIFFLKFLLDEALNSATVRDHMEQCASRAADLQNKLRSLTSEXXXXXXXXXXXXXS 539 + EERI LK L DE L+S+ V H+EQCA ++Q KLRSL+SE Sbjct: 1387 SAEERILLLKLLCDELLSSSLVHQHLEQCAEAIIEMQQKLRSLSSEWKNAKMRQEFLTAK 1446 Query: 540 AEKTNSGVLNVRGDL-NSDASSSQHASENITRGKPSEKLVGDQSQPEKIIVKTSEGPNWL 716 K +L G+ NS + Q + + + + D S + + P Sbjct: 1447 LAKVEPSILKEVGEPHNSGHFADQMGCDQRPQEGVGDGVTHDDSSTAYLNKNKGKAPLET 1506 Query: 717 SEKPISVQQPQSDQGHTSLLNNVQSP-LFSSP---------------------------- 809 +P Q Q + H + + + SP SSP Sbjct: 1507 DSQPGEFQDSQPGESHVNFESKISSPETISSPGRHEKPIADTSPHVTDNPSFEKYTSETL 1566 Query: 810 ---------TSERETELVQCPNQGDMPSSQLNNLKAC-----TVKQEITNLLDSIADIEL 947 T + V+ P D S L+AC EI NL SI IE Sbjct: 1567 HKSVGRNHETHSLNSNAVEIPTAHDASSQASQELQACLQDLNATSHEIHNLQQSIRSIES 1626 Query: 948 ELVKISLRRDFLGRDSIGRVYWAFCYPGARPWVIACGGTAHKERCPRDF--SSIPDS--- 1112 +L+K S+RRDFLG D+ GR+YW C+P P ++ G + ++ D S +P Sbjct: 1627 QLLKQSIRRDFLGNDASGRLYWGCCFPDENPRILVDGSISLQKPVQADLMGSKVPSPFLH 1686 Query: 1113 ---------DKWMYYESDSEIEKLVGWLRENNVREKELKESISQFQANKLKDSEYTEDHI 1265 W YYE+++EI +LV WL +++++E++L+ESI ++ + D + Sbjct: 1687 AVDHGRLRLSPWTYYETETEISELVQWLHDDDLKERDLRESILCWKRLRFGD-------V 1739 Query: 1266 LKKREINHGGRKTVSADFLATNAMNALEKKFGPCKRIET-TAVPQNLVMGASQSGGMHRC 1442 K+++ + A L T A ++EKK+GPC ++ET T + SQ + RC Sbjct: 1740 QKEKKQAQNLSAPILARGLETKAAMSMEKKYGPCIKLETETLKKRGKKTKVSQREKLCRC 1799 Query: 1443 ECLEMLWPSKDHCGSCHQSFSTSEELRQHVKENCKAVASGSKRSQAAEDMSKRKKA-RNA 1619 ECLE + PS HC CH++F++ +E +H + C + ++ S+ D SK K++ ++ Sbjct: 1800 ECLESILPSMIHCLICHKTFASDDEFEEHTESKCIPYSLATEESKEISDSSKAKESLKSD 1859 Query: 1620 VSQEKRPASVGIPQGSTLEKQIDGSASVESYNADCPFNFEEIMTRFIVPSSVKDGVNEIG 1799 K A + + S + + G + + P++FEEI ++F+ S +D V EIG Sbjct: 1860 YLNVKSSAGKAVGEISNVSELDSGLIRYQEEESISPYHFEEICSKFVTKDSNRDLVKEIG 1919 Query: 1800 LIGSGGLPSFLQGQSPYVSDSALTVGLERTNEASSRSNDLRSKQKNAEPGDVMNGRGFKD 1979 LIGS G+P+FL S + +DS L N +K + GD + G + Sbjct: 1920 LIGSNGIPTFLPASSTHHNDSVLI-------------NANPNKLDGGDSGDQVIFAG-PE 1965 Query: 1980 SNRSSRSAENGLSDELSIV----GRLKSILTSEKDQVTPMKDRSSLLGLSKSTIIRESSS 2147 +N ++E+ LS + S+ G L + +SS GL ++ +++ Sbjct: 1966 TNVEGLNSESNLSFDGSVTDNHGGPLNKLTGLGFGFSEQKNKKSSGSGLKSCCVVPQAAL 2025 Query: 2148 RPLVGRASEILRILKINLLDMDAALPEDAFRKSRTSPDRRCAWRAFVKSSKSIYEMVQAT 2327 + + G+A + R LK NLLDMD ALPE+A R S++ PDRR AWR FVKS++SIYE+VQAT Sbjct: 2026 KRITGKALPVFRFLKTNLLDMDVALPEEALRPSKSHPDRRRAWRVFVKSAQSIYELVQAT 2085 Query: 2328 IVFEDMIKTEYLRNDWWYWSSPSTAARITTLSALALRIYSLDSAISYEEPLPTAAAMEVS 2507 V EDMIKTEYL+N+WWYWSS S AA+I+TLSAL++RI+SLD+AI Y++P+ + + + Sbjct: 2086 FVVEDMIKTEYLKNEWWYWSSLSAAAKISTLSALSVRIFSLDAAIIYDKPITPSDHNDET 2145 Query: 2508 EPCNAIEEEMQKSPTLKNLASPSS 2579 +P I QKS + + SS Sbjct: 2146 KP--IISSPDQKSQPVSDSQEKSS 2167 >ref|XP_004167238.1| PREDICTED: LOW QUALITY PROTEIN: methyl-CpG-binding domain-containing protein 9-like [Cucumis sativus] Length = 1277 Score = 517 bits (1331), Expect = e-143 Identities = 332/925 (35%), Positives = 501/925 (54%), Gaps = 81/925 (8%) Frame = +3 Query: 3 ERDDLLVQVCNSVLPRAPWDEGLCKVCGMDKDDDNVLLCDKCDSEYHRYCLDPPLLRIPE 182 E D LV + + +P+APWDEG+CKVCG+DKDDD+VLLCD CD+EYH YCL+PPL RIPE Sbjct: 342 EVDGFLVSL--NEIPKAPWDEGVCKVCGIDKDDDSVLLCDTCDAEYHTYCLNPPLARIPE 399 Query: 183 GNWYCPSCVTGQSLPSGTGYASGS--NQRRKRRNQGEFSSKFLEELSRFAKLMGMKEHWE 356 GNWYCPSCV G + + N + ++ +GE + FL +L+ A + KE+WE Sbjct: 400 GNWYCPSCVMGTRMVEDPSEHTKHIINLHKGKKFRGEVTRDFLNKLANLAAALEEKEYWE 459 Query: 357 FTVEERIFFLKFLLDEALNSATVRDHMEQCASRAADLQNKLRSLTSEXXXXXXXXXXXXX 536 F+V+ER+F LK+L DE L+SA +R H+EQC A+LQ KLRS E Sbjct: 460 FSVDERLFLLKYLCDELLSSALIRQHLEQCVEALAELQQKLRSCFIEWKNLKCREEVVAA 519 Query: 537 SAEKTNSGVLNV----RGDLN------SDASSSQHASENITRGKPS--EKL-----VGDQ 665 A K ++ +L+ +G + SD SS + EN S E++ V D Sbjct: 520 RAAKLDTTMLSAVREGQGSCDGARLGASDQYSSLTSLENKCHNHASFQEQMSSAHDVTDN 579 Query: 666 SQPEKIIVKTSEGPNWLSEKPISVQQPQSDQGHTSLLNNVQSPLFSSPTSERETELVQCP 845 + ++ +S N S KP+ +P L+ + + S S ETE+ P Sbjct: 580 NDAGGNVLSSSGSQN--SGKPVKFNEPS--------LSGLPQEVDGSDQSNMETEISILP 629 Query: 846 N-----------------QGDMPS-SQLNNLKACTVKQEITNLLDSIADIELELVKISLR 971 + Q P+ SQ + + ++K++I + DSIA ELEL+KIS+R Sbjct: 630 SGKQYFTPCDANGVPVAPQVPPPNESQAYHSELDSIKKDILQVQDSIASTELELLKISVR 689 Query: 972 RDFLGRDSIGRVYWAFCYPGARPWVIACGGTAH----------KERCPRDFSSIPDSDKW 1121 R+FLG D+ GR+YWA P +I+ G + H K R ++++S +++ Sbjct: 690 REFLGSDAAGRLYWASVMSNGLPQIISSGSSVHIGSESRDRVVKGRFFKNYTSTSNANSS 749 Query: 1122 -----MY------------------YESDSEIEKLVGWLRENNVREKELKESISQFQANK 1232 MY Y+++++I +L+ WL++++ +E+ELKESI Q+ K Sbjct: 750 TLNSNMYSSLLHLPKDFIGNSPCISYQTEADILELIDWLKDSDPKERELKESILQWLKPK 809 Query: 1233 LKDSEYTEDHI----LKKREINHGGRKTVSADFLATNAMNALEKKFGPCKRIETTAVPQN 1400 L+ S + + LK + K + FL A LE K+GP T Sbjct: 810 LQTSSRSNNQSPEEQLKDSSSSSDVEKLECSGFLVNRASALLESKYGPFLEFVTPDDLNR 869 Query: 1401 LVMGA--SQSGGMHRCECLEMLWPSKDHCGSCHQSFSTSEELRQHVKENCKAVASGSKRS 1574 + A ++ M RC C+E +WPS+ HC SCH+SFST EL +H C ++ + Sbjct: 870 WLDKARLAEDEKMFRCVCMEPVWPSRYHCLSCHKSFSTDVELEEHDNGQCSSLPASCDGI 929 Query: 1575 QAAEDMSKRKKARNAVSQEKRPASVGIPQGSTLEKQIDGSASVESYNAD---CPFNFEEI 1745 + D SK K S+++ +S+ I + T + S + Y D CP++FE I Sbjct: 930 KEVGDSSKSKCNIKFESKQEESSSMVIAE--TSRGYFNHSMGLIKYQNDGMMCPYDFELI 987 Query: 1746 MTRFIVPSSVKDGVNEIGLIGSGGLPSFLQGQSPYVSDSALTVGLERTNEASSRSNDLRS 1925 ++F+ S KD + EIGLI S G+PSFL SPY+ +S L V + + ++ L S Sbjct: 988 CSKFLTKDSNKDLIKEIGLISSNGVPSFLSSVSPYIMESTLNVIDLKKDSSTPEDGTLLS 1047 Query: 1926 KQKNAEPGDVMNGRGFKDSNRSSRSAENGLSDELSI--VGRLKSILTSEKDQVTPMKDRS 2099 + + E +++ G S+ S + +E+S RL + K + + M +R Sbjct: 1048 EWPSLE--NIILENGCHQSSSIDSSIQKPAGNEISAPKTKRLAAGCLEPKSKKSXMDNRF 1105 Query: 2100 SLLGLSKSTIIRESSSRPLVGRASEILRILKINLLDMDAALPEDAFRKSRTSPDRRCAWR 2279 S G+ + +I +SS RPLVG+ +++R LK+NLLDMDAALP++A + S+ +RR AWR Sbjct: 1106 SEFGIGRCFVIPQSSQRPLVGKILQVVRGLKMNLLDMDAALPDEALKPSKLHIERRWAWR 1165 Query: 2280 AFVKSSKSIYEMVQATIVFEDMIKTEYLRNDWWYWSSPSTAARITTLSALALRIYSLDSA 2459 AFVKS+ +IYEMVQATI EDMI+TEYL+N+WWYWSS S AA+I+T+S+LALRI+SLD+A Sbjct: 1166 AFVKSAGTIYEMVQATIALEDMIRTEYLKNEWWYWSSLSAAAKISTVSSLALRIFSLDAA 1225 Query: 2460 ISYEEPLPTAAAMEVSEPCNAIEEE 2534 I YE+ P + + + ++I E+ Sbjct: 1226 IIYEKISPNQDSNDYLDTTSSIPEQ 1250 >ref|XP_004141185.1| PREDICTED: methyl-CpG-binding domain-containing protein 9-like [Cucumis sativus] Length = 2131 Score = 516 bits (1330), Expect = e-143 Identities = 334/926 (36%), Positives = 501/926 (54%), Gaps = 82/926 (8%) Frame = +3 Query: 3 ERDDLLVQVCNSVLPRAPWDEGLCKVCGMDKDDDNVLLCDKCDSEYHRYCLDPPLLRIPE 182 E D LV + + +P+APWDEG+CKVCG+DKDDD+VLLCD CD+EYH YCL+PPL RIPE Sbjct: 1195 EVDGFLVSL--NEIPKAPWDEGVCKVCGIDKDDDSVLLCDTCDAEYHTYCLNPPLARIPE 1252 Query: 183 GNWYCPSCVTGQSL---PSGTGYASGSNQRRKRRNQGEFSSKFLEELSRFAKLMGMKEHW 353 GNWYCPSCV G + PS N + ++ +GE + FL +L+ A + KE+W Sbjct: 1253 GNWYCPSCVMGTRMVEDPSEHTKNHIINLHKGKKFRGEVTRDFLNKLANLAAALEEKEYW 1312 Query: 354 EFTVEERIFFLKFLLDEALNSATVRDHMEQCASRAADLQNKLRSLTSEXXXXXXXXXXXX 533 EF+V+ER+F LK+L DE L+SA +R H+EQC A+LQ KLRS E Sbjct: 1313 EFSVDERLFLLKYLCDELLSSALIRQHLEQCVEALAELQQKLRSCFIEWKNLKCREEVVA 1372 Query: 534 XSAEKTNSGVLNV----RGDLN------SDASSSQHASENITRGKPS--EKL-----VGD 662 A K ++ +L+ +G + SD SS + EN S E++ V D Sbjct: 1373 ARAAKLDTTMLSAVREGQGSCDGARLGASDQYSSLTSLENKCHNHASFQEQMSSAHDVTD 1432 Query: 663 QSQPEKIIVKTSEGPNWLSEKPISVQQPQSDQGHTSLLNNVQSPLFSSPTSERETELVQC 842 + ++ +S N S KP+ +P L+ + + S S ETE+ Sbjct: 1433 NNDAGGNVLSSSGSQN--SGKPVKFNEPS--------LSGLPQEVDGSDQSNMETEISIL 1482 Query: 843 PN-----------------QGDMPS-SQLNNLKACTVKQEITNLLDSIADIELELVKISL 968 P+ Q P+ SQ + + ++K++I + DSIA ELEL+KIS+ Sbjct: 1483 PSGKQYFTPCDANGVPVAPQVPPPNESQAYHSELDSIKKDILQVQDSIASTELELLKISV 1542 Query: 969 RRDFLGRDSIGRVYWAFCYPGARPWVIACGGTAH----------KERCPRDFSSIPDSDK 1118 RR+FLG D+ GR+YWA P +I+ G + H K R ++++S +++ Sbjct: 1543 RREFLGSDAAGRLYWASVMSNGLPQIISSGSSVHIGSESRDRVVKGRFFKNYTSTSNANS 1602 Query: 1119 W-----MY------------------YESDSEIEKLVGWLRENNVREKELKESISQFQAN 1229 MY Y+++++I +L+ WL++++ +E+ELKESI Q+ Sbjct: 1603 STLNSNMYSSLLHLPKDFIGNSPCISYQTEADILELIDWLKDSDPKERELKESILQWLKP 1662 Query: 1230 KLKDSEYTEDHI----LKKREINHGGRKTVSADFLATNAMNALEKKFGPCKRIETTAVPQ 1397 KL+ S + + LK + K + FL A LE K+GP T Sbjct: 1663 KLQTSSRSNNQSPEEQLKDSSSSSDVEKLECSGFLVNRASALLESKYGPFLEFVTPDDLN 1722 Query: 1398 NLVMGA--SQSGGMHRCECLEMLWPSKDHCGSCHQSFSTSEELRQHVKENCKAVASGSKR 1571 + A ++ M RC C+E +WPS+ HC SCH+SFST EL +H C ++ + Sbjct: 1723 RWLDKARLAEDEKMFRCVCMEPVWPSRYHCLSCHRSFSTDVELEEHDNGQCSSLPASCDG 1782 Query: 1572 SQAAEDMSKRKKARNAVSQEKRPASVGIPQGSTLEKQIDGSASVESYNAD---CPFNFEE 1742 + D SK K S+++ +S+ I + T + S + Y D CP++FE Sbjct: 1783 IKEVGDSSKSKCNIKFESKQEESSSMVIAE--TSRGYFNHSMGLIKYQNDGMMCPYDFEL 1840 Query: 1743 IMTRFIVPSSVKDGVNEIGLIGSGGLPSFLQGQSPYVSDSALTVGLERTNEASSRSNDLR 1922 I ++F+ S KD + EIGLI S G+PSFL SPY+ +S L V + + ++ L Sbjct: 1841 ICSKFLTKDSNKDLIKEIGLISSNGVPSFLSSVSPYIMESTLNVIDLKKDSSTPEDGTLL 1900 Query: 1923 SKQKNAEPGDVMNGRGFKDSNRSSRSAENGLSDELSI--VGRLKSILTSEKDQVTPMKDR 2096 S+ + E +++ G S+ S + +E+S RL + K + M +R Sbjct: 1901 SEWPSLE--NIILENGCHQSSSIDSSIQKPAGNEISAPKTKRLAAGCLEPKSKKICMDNR 1958 Query: 2097 SSLLGLSKSTIIRESSSRPLVGRASEILRILKINLLDMDAALPEDAFRKSRTSPDRRCAW 2276 S G+ + +I +SS RPLVG+ +++R LK+NLLDMDAALP++A + S+ +RR AW Sbjct: 1959 FSEFGIGRCFVIPQSSQRPLVGKILQVVRGLKMNLLDMDAALPDEALKPSKLHIERRWAW 2018 Query: 2277 RAFVKSSKSIYEMVQATIVFEDMIKTEYLRNDWWYWSSPSTAARITTLSALALRIYSLDS 2456 RAFVKS+ +IYEMVQATI EDMI+TEYL+N+WWYWSS S AA+I+T+S+LALRI+SLD+ Sbjct: 2019 RAFVKSAGTIYEMVQATIALEDMIRTEYLKNEWWYWSSLSAAAKISTVSSLALRIFSLDA 2078 Query: 2457 AISYEEPLPTAAAMEVSEPCNAIEEE 2534 AI YE+ P + + + ++I E+ Sbjct: 2079 AIIYEKISPNQDSNDYLDTTSSIPEQ 2104 >gb|EXC31622.1| Methyl-CpG-binding domain-containing protein 9 [Morus notabilis] Length = 2259 Score = 512 bits (1319), Expect = e-142 Identities = 341/969 (35%), Positives = 493/969 (50%), Gaps = 136/969 (14%) Frame = +3 Query: 36 SVLPRAPWDEGLCKVCGMDKDDDNVLLCDKCDSEYHRYCLDPPLLRIPEGNWYCPSCVTG 215 +V+P+APWDEG+CKVCG+D+DDD+VLLCD CD+EYH YCL+PPLLRIPEGNWYCPSCV G Sbjct: 1280 NVIPKAPWDEGVCKVCGIDRDDDSVLLCDTCDAEYHTYCLNPPLLRIPEGNWYCPSCVVG 1339 Query: 216 ----QSLPSGTGYASGSNQRRKRRNQGEFSSKFLEELSRFAKLMGMKEHWEFTVEE---- 371 Q +P QR ++ QGE + +LE L+ A M KE+WEF+V+E Sbjct: 1340 RRTVQDVPENVQVI---RQRSGKKYQGEVTRVYLEALAHLATKMEEKEYWEFSVDESMLL 1396 Query: 372 ------------------------------------RIFFLKFLLDEALNSATVRDHMEQ 443 R F +KFL DE LNSA +R H+EQ Sbjct: 1397 LRPTLRKGRPGEGRLGKARVGHPEWAAVDVGVGSVVRSFLMKFLCDELLNSAIIRQHLEQ 1456 Query: 444 CASRAADLQNKLRSLTSEXXXXXXXXXXXXXSAEKTNSGVLN------VRGDLNSDASSS 605 CA + +LQ KLR+L E A K + +LN +R L S+ + Sbjct: 1457 CADTSTELQQKLRALFVEWKILKSREEILVARAAKHDPNILNSLGAVGIRESLFSNHNKG 1516 Query: 606 QH---ASENITRGKPSEKLVGDQSQPEKIIVKTSEGPNWLSEKPISVQQP-----QSDQG 761 Q + + G ++ L E I + + ++ + Q P Q Sbjct: 1517 QTPALSDRSNCCGMSTDDLSTLGGGREAIEPSGLDRSSSATDSQSNCQNPLDTEDQLKDA 1576 Query: 762 HTS------LLNNVQSPLFSSPTSERETELV--------------QCPNQGDMPSSQLNN 881 H S +LN + + ++ E V N D+ S+ + Sbjct: 1577 HASVEESNTVLNEADASCGAICSTGNPHESVGKDSSSTLKPVGQHGHSNASDVRSTIGQS 1636 Query: 882 LKACTVKQ-------------EITNLLDSIADIELELVKISLRRDFLGRDSIGRVYWA-- 1016 + A TV + +IT L +SI +E EL+K+S+RR+FLG D +G +YW Sbjct: 1637 VPAATVNELQGHHVELKSVKNDITILEESITSVESELLKVSVRREFLGSDFVGCLYWVSG 1696 Query: 1017 ------------------------FCYPGARPWVIACGGTAHKERCPRDFSSIPDSDKWM 1124 F P + V+ C + +C R+ S + W+ Sbjct: 1697 TPTGSSCIIVDRSAALRSGKKMNNFQRPVGKSSVLQCSIQSVPIQCERN-SVVASDSPWV 1755 Query: 1125 YYESDSEIEKLVGWLRENNVREKELKESISQFQANKLKDSEYTEDHILKKRE-----INH 1289 Y++D +I++LV L+ N+ +E+ELKESI +Q KL+ E+ ++ I + E + Sbjct: 1756 SYQTDGDIDQLVSCLKTNDTKERELKESILHWQ--KLRFQEFQKNKIRGQAECAAFAASI 1813 Query: 1290 GGRKTVSADFLATNAMNALEKKFGPCKRIETTAVPQNLVMGA--SQSGGMHRCECLEMLW 1463 G K +D L T A N LEK++GPC ++ETT + + A + M+RCECLE++W Sbjct: 1814 SGEKATFSDGLVTRAANLLEKRYGPCNQLETTDILKKRGKKARLTDDNKMYRCECLELIW 1873 Query: 1464 PSKDHCGSCHQSFSTSEELRQHVKENCKAVASGSKRSQAAEDMSKRKKA----RNAVSQE 1631 P + HC SCH++F EL H + C +VA ++ + D SK K + N Sbjct: 1874 PCRHHCLSCHRTFFNDIELEGHNEGKCNSVALAQEKRKEISDSSKAKDSLKSDANREDST 1933 Query: 1632 KRPASVGIPQGSTLEKQIDGSASVESYNADCPFNFEEIMTRFIVPSSVKDGVNEIGLIGS 1811 + V IP+ E + CP++FEEI ++F+ S KD V EIGLIGS Sbjct: 1934 GEMSRVEIPKTGFSELSAK-LIKFQDEGLSCPYDFEEICSKFVTKDSCKDLVQEIGLIGS 1992 Query: 1812 GGLPSFLQGQSPYVSDSALT-------VGLE-RTNEASSRSNDLRSKQKNAEPGDVMNGR 1967 G+PSF+ SP + DS L VG + +EA+ R L + D+++ R Sbjct: 1993 KGVPSFVSSMSPCLDDSTLALISPQKDVGAQGGGSEAAERPVSLGTGTITIAGWDILSDR 2052 Query: 1968 GFKDSNRSSRSAENGLSDELSIVGRLKSILTSEKDQVTPMKDRSSLLGLSKSTIIRESSS 2147 K RS+ N + + +G ++ +++ + SS +G ++ ++ + S Sbjct: 2053 SPK---RSAMKEINAVKSQRLTLGYIE-----QREGIRCSGSHSSEMGATRCCVVPQFSL 2104 Query: 2148 RPLVGRASEILRILKINLLDMDAALPEDAFRKSRTSPDRRCAWRAFVKSSKSIYEMVQAT 2327 RPLVG+ S+I R LKINLLDMDAALPE+A R S++ RR AWRAFVKS+ +IYEMVQAT Sbjct: 2105 RPLVGKVSQIYRRLKINLLDMDAALPEEALRPSKSHLGRRWAWRAFVKSATTIYEMVQAT 2164 Query: 2328 IVFEDMIKTEYLRNDWWYWSSPSTAARITTLSALALRIYSLDSAISYEEPLPTAAAMEVS 2507 IV EDMIKTEYL+N+WWYWSS S AAR +T+S+LALRIYSLD+AI YE+ + + S Sbjct: 2165 IVLEDMIKTEYLKNEWWYWSSFSAAARTSTMSSLALRIYSLDAAIIYEKISSESDPTDKS 2224 Query: 2508 EPCNAIEEE 2534 EP N E++ Sbjct: 2225 EPSNLSEQK 2233 >ref|NP_186795.1| methyl-CPG-binding domain 9 [Arabidopsis thaliana] gi|75337201|sp|Q9SGH2.1|MBD9_ARATH RecName: Full=Methyl-CpG-binding domain-containing protein 9; Short=AtMBD9; Short=MBD09; AltName: Full=Histone acetyl transferase MBD9; AltName: Full=Methyl-CpG-binding protein MBD9 gi|6692266|gb|AAF24616.1|AC010870_9 unknown protein [Arabidopsis thaliana] gi|332640148|gb|AEE73669.1| methyl-CPG-binding domain 9 [Arabidopsis thaliana] Length = 2176 Score = 509 bits (1312), Expect = e-141 Identities = 319/933 (34%), Positives = 486/933 (52%), Gaps = 62/933 (6%) Frame = +3 Query: 3 ERDDLLVQVCNSVLPRAPWDEGLCKVCGMDKDDDNVLLCDKCDSEYHRYCLDPPLLRIPE 182 E D++V V + LP+APWDEG+CKVCG+DKDDD+VLLCD CD+EYH YCL+PPL+RIP+ Sbjct: 1269 EIKDIVVSV--NKLPKAPWDEGVCKVCGVDKDDDSVLLCDTCDAEYHTYCLNPPLIRIPD 1326 Query: 183 GNWYCPSCVTGQSLPSGTGYASGSNQRRK-RRNQGEFSSKFLEELSRFAKLMGMKEHWEF 359 GNWYCPSCV + + + +RRK R+ QGE + +E + A +M K++WEF Sbjct: 1327 GNWYCPSCVIAKRMAQEALESYKLVRRRKGRKYQGELTRASMELTAHLADVMEEKDYWEF 1386 Query: 360 TVEERIFFLKFLLDEALNSATVRDHMEQCASRAADLQNKLRSLTSEXXXXXXXXXXXXXS 539 + EERI LK L DE L+S+ V H+EQCA ++Q KLRSL+SE Sbjct: 1387 SAEERILLLKLLCDELLSSSLVHQHLEQCAEAIIEMQQKLRSLSSEWKNAKMRQEFLTAK 1446 Query: 540 AEKTNSGVLNVRGD----------LNSDASSSQHASENITRGKPSEKLVGDQSQPEKIIV 689 K +L G+ + D + + +TR + K + Sbjct: 1447 LAKVEPSILKEVGEPHNSSYFADQMGCDPQPQEGVGDGVTRDDETSSTAYLNKNQGKSPL 1506 Query: 690 KTSEGPNW----LSEKPISVQQPQSDQGHTSLLNNVQSPLFSSPTSERETE--------- 830 +T P E IS + S G L SPL + E++T Sbjct: 1507 ETDTQPGESHVNFGESKISSPETISSPGRHELPIADTSPLVTDNLPEKDTSETLLKSVGR 1566 Query: 831 ----------LVQCPNQGDMPSSQLNNLKAC-----TVKQEITNLLDSIADIELELVKIS 965 V+ P D S L+AC EI NL SI IE +L+K S Sbjct: 1567 NHETHSPNSNAVELPTAHDASSQASQELQACQQDLSATSNEIQNLQQSIRSIESQLLKQS 1626 Query: 966 LRRDFLGRDSIGRVYWAFCYPGARPWVIACGGTAHKERCPRDF--SSIPDS--------- 1112 +RRDFLG D+ GR+YW C+P P ++ G + ++ D S +P Sbjct: 1627 IRRDFLGTDASGRLYWGCCFPDENPRILVDGSISLQKPVQADLIGSKVPSPFLHTVDHGR 1686 Query: 1113 ---DKWMYYESDSEIEKLVGWLRENNVREKELKESISQFQANKLKDSEYTEDHILKKREI 1283 W YYE+++EI +LV WL +++++E++L+ESI ++ + D + K+++ Sbjct: 1687 LRLSPWTYYETETEISELVQWLHDDDLKERDLRESILWWKRLRYGD-------VQKEKKQ 1739 Query: 1284 NHGGRKTVSADFLATNAMNALEKKFGPCKRIET-TAVPQNLVMGASQSGGMHRCECLEML 1460 V A L T A ++EK++GPC ++E T + ++ + RCECLE + Sbjct: 1740 AQNLSAPVFATGLETKAAMSMEKRYGPCIKLEMETLKKRGKKTKVAEREKLCRCECLESI 1799 Query: 1461 WPSKDHCGSCHQSFSTSEELRQHVKENCKAVASGSKRSQAAEDMSKRKKA-RNAVSQEKR 1637 PS HC CH++F++ +E H + C + ++ + D SK K++ ++ K Sbjct: 1800 LPSMIHCLICHKTFASDDEFEDHTESKCIPYSLATEEGKDISDSSKAKESLKSDYLNVKS 1859 Query: 1638 PASVGIPQGSTLEKQIDGSASVESYNADCPFNFEEIMTRFIVPSSVKDGVNEIGLIGSGG 1817 A + + S + + G + + P++FEEI ++F+ +D V EIGLI S G Sbjct: 1860 SAGKDVAEISNVSELDSGLIRYQEEESISPYHFEEICSKFVTKDCNRDLVKEIGLISSNG 1919 Query: 1818 LPSFLQGQSPYVSDSALTVGLERTNEASSRSNDLRSKQKNAEPGDVMNGRGFKDSNRSSR 1997 +P+FL S +++DS L S++SN K + GD + G ++N Sbjct: 1920 IPTFLPSSSTHLNDSVLI---------SAKSN----KPDGGDSGDQVIFAG-PETNVEGL 1965 Query: 1998 SAENGLSDELSIVGRLKSILTSEKDQVTPMKD----RSSLLGLSKSTIIRESSSRPLVGR 2165 ++E+ +S + S+ L + +SS GL ++ +++ + + G+ Sbjct: 1966 NSESNMSFDRSVTDSHGGPLDKPSGLGFGFSEQKNKKSSGSGLKSCCVVPQAALKRVTGK 2025 Query: 2166 ASEILRILKINLLDMDAALPEDAFRKSRTSPDRRCAWRAFVKSSKSIYEMVQATIVFEDM 2345 A R LK NLLDMD ALPE+A R S++ P+RR AWR FVKSS+SIYE+VQATIV EDM Sbjct: 2026 ALPGFRFLKTNLLDMDVALPEEALRPSKSHPNRRRAWRVFVKSSQSIYELVQATIVVEDM 2085 Query: 2346 IKTEYLRNDWWYWSSPSTAARITTLSALALRIYSLDSAISYEEPLPTAAAMEVSEPCNAI 2525 IKTEYL+N+WWYWSS S AA+I+TLSAL++RI+SLD+AI Y++P+ + ++ ++P ++ Sbjct: 2086 IKTEYLKNEWWYWSSLSAAAKISTLSALSVRIFSLDAAIIYDKPITPSNPIDETKPIISL 2145 Query: 2526 EEEMQKSPTLKNLASPSSPTL---LKTPEPDSS 2615 + QKS + + SS K EP+ S Sbjct: 2146 PD--QKSQPVSDSQERSSRVRRSGKKRKEPEGS 2176 >ref|XP_006296811.1| hypothetical protein CARUB_v10012794mg [Capsella rubella] gi|482565520|gb|EOA29709.1| hypothetical protein CARUB_v10012794mg [Capsella rubella] Length = 2177 Score = 505 bits (1301), Expect = e-140 Identities = 313/915 (34%), Positives = 486/915 (53%), Gaps = 56/915 (6%) Frame = +3 Query: 3 ERDDLLVQVCNSVLPRAPWDEGLCKVCGMDKDDDNVLLCDKCDSEYHRYCLDPPLLRIPE 182 E D++V + + LP+APWDEG+CKVCG+DKDDD+VLLCD CD+EYH YCL+PPL+RIP+ Sbjct: 1272 EIKDIIVSI--NKLPKAPWDEGVCKVCGVDKDDDSVLLCDTCDAEYHTYCLNPPLIRIPD 1329 Query: 183 GNWYCPSCVTGQSLPSGTGYASGSNQRRK-RRNQGEFSSKFLEELSRFAKLMGMKEHWEF 359 GNWYCPSCV + + + +RRK R+ QGE + +E + A +M K++WEF Sbjct: 1330 GNWYCPSCVIAKRMAQEALESYKLVRRRKGRKYQGELTQASMEMTAHLAGVMEEKDYWEF 1389 Query: 360 TVEERIFFLKFLLDEALNSATVRDHMEQCASRAADLQNKLRSLTSEXXXXXXXXXXXXXS 539 +VEERI LK L DE L+S+ V H+EQCA ++Q KLRSL+SE Sbjct: 1390 SVEERILLLKVLCDELLSSSLVHQHLEQCAEAIIEMQQKLRSLSSEWKNAKMRQEFLMAK 1449 Query: 540 AEKTNSGVLNVRGDLNS----------DASSSQHASENITRGKPSEKLVGDQSQPEKIIV 689 K +L +L++ D + + + +T + K + Sbjct: 1450 LAKVEPSILKEASELHNSSHFADQMGCDERTHEGVGDGVTHDDETSSTAFLNKNQGKAPL 1509 Query: 690 KTSEGPNWL-----SEKPISVQQPQSDQGHTSLLNNVQSPLFSSPTSERET--------- 827 +T+ P L K S ++ S H L+ ++ + T E++T Sbjct: 1510 ETNSQPGDLHVDSGGNKVSSQKKITSPGRHELLVADISPRATDNLTFEKDTLHKSVGRIH 1569 Query: 828 -------ELVQCPNQGDMPSSQLNNLKAC-----TVKQEITNLLDSIADIELELVKISLR 971 V+ + D S L+AC EI NL SI +E +L+K S+R Sbjct: 1570 ETHPLHSNAVELQSVHDASSQASQELQACQQDLNATSNEIQNLQLSIRSVESQLLKQSIR 1629 Query: 972 RDFLGRDSIGRVYWAFCYPGARPWVIACGGTAHKE---------RCPRDFSSIPDSDK-- 1118 RDFLG DS GR+YW C+P P V+ G + ++ R P F D + Sbjct: 1630 RDFLGNDSSGRLYWGCCFPDENPRVLVDGSISLQKPVQANLTGSRAPSPFLQAVDHGRLT 1689 Query: 1119 ---WMYYESDSEIEKLVGWLRENNVREKELKESISQFQANKLKDSEYTEDHILKKREINH 1289 W YYE++SEI +LV WL +++ +E++L+ESI ++ + D + K++E Sbjct: 1690 LSPWTYYETESEISELVQWLHDDDPKERDLRESILCWKRLRFGD-------VQKEKENAE 1742 Query: 1290 GGRKTVSADFLATNAMNALEKKFGPCKRIETTAVPQNLVMGASQSGGMHRCECLEMLWPS 1469 + + L T A ++EK+FGPC ++ET + + + RCECLE + PS Sbjct: 1743 NLSSPIFSRGLVTKAAMSMEKRFGPCIKLETETLKKRGKKTKVEREKFCRCECLEAILPS 1802 Query: 1470 KDHCGSCHQSFSTSEELRQHVKENCKAVASGSKRSQAAEDMSKRKKA-RNAVSQEKRPAS 1646 HC CH++F++ +E H + C + ++ + D SK K++ ++ K A Sbjct: 1803 MIHCLICHKTFASDDEFENHSESKCIPYSLATEEGKEISDFSKAKESLKSDYLNVKSSAG 1862 Query: 1647 VGIPQGSTLEKQIDGSASVESYNADCPFNFEEIMTRFIVPSSVKDGVNEIGLIGSGGLPS 1826 + + S + + G + + P++FEEI ++F+ S +D V +IGLIGS G+P+ Sbjct: 1863 KDVSEISNVSELDSGLIRYQEEESISPYHFEEICSKFVTKDSNRDLVKDIGLIGSNGIPT 1922 Query: 1827 FLQGQSPYVSDSALTVGLERTNEASSRSNDLRSKQKNAEPGDVMNGRGFKDSNRSSRSAE 2006 FL +++DS L S+ S SK + GD + G ++N ++E Sbjct: 1923 FLPSSYTHLNDSMLI---------SANS----SKLDGDDSGDQVVFAG-SETNVEGLNSE 1968 Query: 2007 NGLSDELSI---VGRLKSILTSEKDQVTPMKDRSSL-LGLSKSTIIRESSSRPLVGRASE 2174 +S + S+ +G S + + K + SL GL ++ ++S + + G+A Sbjct: 1969 FNMSFDRSVTHDLGGPPSKPSGLGFGFSEQKIKKSLGSGLKSCCVVPQASLKRITGKALP 2028 Query: 2175 ILRILKINLLDMDAALPEDAFRKSRTSPDRRCAWRAFVKSSKSIYEMVQATIVFEDMIKT 2354 + R LK NLLDMD ALPE+ R S++ P RR AWR FVKSS+SIYE+VQAT+V EDM+KT Sbjct: 2029 VFRFLKTNLLDMDVALPEEGLRPSKSHPGRRRAWRLFVKSSQSIYELVQATVVLEDMVKT 2088 Query: 2355 EYLRNDWWYWSSPSTAARITTLSALALRIYSLDSAISYEEPLPTAAAMEVSEPCNAIEEE 2534 EYL+N+WWYWSS S AA+I+TLSAL++RI++LD+AI Y++ L + ++ ++P ++ + Sbjct: 2089 EYLKNEWWYWSSLSAAAKISTLSALSVRIFALDAAIMYDKLLTPSDPIDETKPIISLPD- 2147 Query: 2535 MQKSPTLKNLASPSS 2579 QKS + + SS Sbjct: 2148 -QKSQPVSDSQERSS 2161 >ref|XP_006603816.1| PREDICTED: methyl-CpG-binding domain-containing protein 9-like isoform X1 [Glycine max] gi|571553376|ref|XP_006603817.1| PREDICTED: methyl-CpG-binding domain-containing protein 9-like isoform X2 [Glycine max] Length = 2175 Score = 501 bits (1291), Expect = e-139 Identities = 338/927 (36%), Positives = 482/927 (51%), Gaps = 98/927 (10%) Frame = +3 Query: 42 LPRAPWDEGLCKVCGMDKDDDNVLLCDKCDSEYHRYCLDPPLLRIPEGNWYCPSCVTGQS 221 +P+APWDEG+CKVCG+D+DDD+VLLCD CD+EYH YCL+PPL RIPEGNWYCPSCV G+ Sbjct: 1242 IPKAPWDEGVCKVCGIDRDDDSVLLCDTCDAEYHTYCLNPPLARIPEGNWYCPSCVDGKR 1301 Query: 222 LPSG-TGYASGSNQRRKRRNQGEFSSKFLEELSRFAKLMGMKEHWEFTVEERIFFLKFLL 398 T +R+ ++ QGE +S +LE L+ + ++ KE+WE++V ER F LKFL Sbjct: 1302 ATQDVTERTKIIGKRQSKKFQGEVNSLYLESLTHLSSVIEEKEYWEYSVGERTFLLKFLC 1361 Query: 399 DEALNSATVRDHMEQCASRAADLQNKLRSLTSEXXXXXXXXXXXXXSAEKTNSGVLNVRG 578 DE LNS+ +R H+EQCA +A+L KLR+ ++E A K ++ +N G Sbjct: 1362 DELLNSSLIRQHLEQCAELSAELHQKLRAHSAEWKSLKTREDILSTKAAKMDTFSVNTAG 1421 Query: 579 D--LNSDASSSQHASENITRGKPSEKLVGDQSQPEKIIVKTSEGPNWLSEKPISVQQPQS 752 + L + + PS V S P + + K + + +K ISV S Sbjct: 1422 EVGLKEGFTGKCPVQPHTAVDNPSNFGVFVDSLPSEEVTKERYRFDSV-DKSISVTNSDS 1480 Query: 753 DQGHTSLLN------NVQSPLFSS---------PTSERETELVQCPN------------- 848 D + + ++ NV + + S P+ ++ + C Sbjct: 1481 DSQNMNSIDVEGQFRNVSAAVESQCTDKSPKSFPSPNHMSQEINCAGGEAHVQGNHQKCE 1540 Query: 849 -----------QG----DMPSSQLN-----NLKACTVKQEITNLLDSIADIELELVKISL 968 QG D+P LN +L+ +K++I+ L DSI + +L+K+S+ Sbjct: 1541 GTDRPIPVSYQQGGVPVDVPQIGLNESEPYHLELNAIKRDISLLQDSITSVVSQLLKLSV 1600 Query: 969 RRDFLGRDSIGRVYWAFCYPGARPWVIACGGTA--HKERCP--RDFS------------- 1097 RR+FLG DSIG++YWA PG +I A H P RD++ Sbjct: 1601 RREFLGIDSIGQLYWASALPGGHSRIIVDASAALLHGRGMPFSRDYAEKFSVLQHCALSD 1660 Query: 1098 -----------SIPDSDKWMYYESDSEIEKLVGWLRENNVREKELKESI-----SQFQA- 1226 S+ + W+ YE+D+EIE+L+GWL ++ +E+ELK+SI S+FQ Sbjct: 1661 KDSSLMSQPSNSLGNRSPWIAYETDAEIEELLGWLDYSDPKERELKDSIMLGPKSRFQEF 1720 Query: 1227 --NKLKDSEYTEDHILKKREINHGGRKTVSADFLATNAMNALEKKFGPCKRIETTAV--P 1394 + +D + HI R KTVS + L T A + LEKKFGP + V Sbjct: 1721 INAQTEDQGEDQGHISMPRN----REKTVS-NSLVTKATSLLEKKFGPFVEWDNVEVLKK 1775 Query: 1395 QNLVMGASQSGGMHRCECLEMLWPSKDHCGSCHQSFSTSEELRQHVKENCKAVASGSKRS 1574 QN + ++RCECLE +WPS+ HC CH++ + E H C A ++ Sbjct: 1776 QNRKARTTNDEKLYRCECLEPIWPSRKHCTYCHKTVVSDVEFDGHNDGKCIAGLPAVEKK 1835 Query: 1575 QAAEDMSK-RKKARNAVSQEKRPASVGIPQGSTLEKQIDGSASVESY-------NADCPF 1730 + SK R + S EK A T + GS+ + S + CPF Sbjct: 1836 KDKNGSSKGRGNLKCDASHEKFRADA-----ETAVTSVSGSSKLSSRLIKFSNEESTCPF 1890 Query: 1731 NFEEIMTRFIVPSSVKDGVNEIGLIGSGGLPSFLQGQSPYVSDSALTVGL-ERTNEASSR 1907 +FE+I ++F+ S K+ V EIGLIGS G+PS + SP+VS+ L+ ER S+ Sbjct: 1891 SFEDICSKFVTNDSNKELVREIGLIGSDGIPSLVPSVSPFVSEYTLSAQKDERIVGGVSK 1950 Query: 1908 SNDLRSKQKNAEPGDVMNGRGFKDSNRSSRSAENGLSDELSIVGRLKSILTSEKDQVTPM 2087 +++ + Q N + R K S + R A N + KS ++D Sbjct: 1951 ASESQVSQGNTDGAGTCLDR--KSSISTGRLAANESNKSN------KSSSREQRDGKLSF 2002 Query: 2088 KDRSSLLGLSKSTIIRESSSRPLVGRASEILRILKINLLDMDAALPEDAFRKSRTSPDRR 2267 + +S +G ++ S RPLVG+AS ILR LKINLLDMDAAL A R S+ DRR Sbjct: 2003 CNPASGMGADGYCVVPSPSLRPLVGKASHILRQLKINLLDMDAALTAIALRPSKAESDRR 2062 Query: 2268 CAWRAFVKSSKSIYEMVQATIVFEDMIKTEYLRNDWWYWSSPSTAARITTLSALALRIYS 2447 AWR FVKS+K+IYEM+QAT EDMIKTEYLRNDWWYWSS S AA+ +TL +LALRIYS Sbjct: 2063 QAWRTFVKSAKTIYEMIQATFTLEDMIKTEYLRNDWWYWSSFSAAAKSSTLPSLALRIYS 2122 Query: 2448 LDSAISYEEPLPTAAAMEVSEPCNAIE 2528 LD AI YE+ +P ++ + SEP +E Sbjct: 2123 LDLAIIYEK-MPNSSFTDSSEPSAIVE 2148 >ref|XP_006594288.1| PREDICTED: methyl-CpG-binding domain-containing protein 9-like [Glycine max] Length = 2202 Score = 501 bits (1291), Expect = e-139 Identities = 341/952 (35%), Positives = 492/952 (51%), Gaps = 95/952 (9%) Frame = +3 Query: 42 LPRAPWDEGLCKVCGMDKDDDNVLLCDKCDSEYHRYCLDPPLLRIPEGNWYCPSCVTGQS 221 +P+APWDEG+CKVCG+D+DDD+VLLCD CD+EYH YCL+PPL RIPEGNWYCPSCV G+ Sbjct: 1264 IPKAPWDEGVCKVCGIDRDDDSVLLCDTCDAEYHTYCLNPPLARIPEGNWYCPSCVVGKH 1323 Query: 222 LPSG-TGYASGSNQRRKRRNQGEFSSKFLEELSRFAKLMGMKEHWEFTVEERIFFLKFLL 398 T +R+ ++ QGE +S +LE L+ + + KE+WE++V ER F LKFL Sbjct: 1324 ATQNVTERTQVIGKRQSKKFQGEVNSLYLESLAHLSAAIEEKEYWEYSVGERTFLLKFLC 1383 Query: 399 DEALNSATVRDHMEQCASRAADLQNKLRSLTSEXXXXXXXXXXXXXSAEKTNSGVLNVRG 578 DE LNS+ + H+EQCA +A+L KLR+ ++E A K ++ LN G Sbjct: 1384 DELLNSSLIHQHLEQCAELSAELHQKLRAHSAEWKSLKTREDILSTKAAKIDTFSLNTAG 1443 Query: 579 DLNSDASSSQHASE--------NITRGKPSEKLVGDQSQPEKIIVKTSEGPNWLSEKPIS 734 ++ + S + PS V S P + + K + + +K IS Sbjct: 1444 EVGLKEGFASLLSNTGKCLVQPHTAVDNPSNFGVFVDSLPSEEVTKDKYRFDSV-DKSIS 1502 Query: 735 VQQPQSDQGHTSLLN------NVQSPLFSSPTSE------------RETE------LVQC 842 V SD + + ++ NV + S T + +ET LVQ Sbjct: 1503 VTNSDSDSQNMNSIDVEGQFRNVSGAVESQCTDKSPKSFPLPNHMPQETNGAGGASLVQG 1562 Query: 843 PNQG------------------DMPSSQLN-----NLKACTVKQEITNLLDSIADIELEL 953 NQ D+P +N +L+ +K++I+ L DSI + +L Sbjct: 1563 KNQKCEGKDIPTPVSYQQGMPVDVPQISVNESEPYHLELIAIKRDISLLQDSITSVASQL 1622 Query: 954 VKISLRRDFLGRDSIGRVYWAFCYPGARPWVIACGGTA---------------------H 1070 +K+S+RR+ LG DSIGR+YWA PG R ++ A H Sbjct: 1623 LKLSVRRECLGIDSIGRLYWASALPGGRSRIVVDASAALLHGRGMTFSRDYVEKFSVLQH 1682 Query: 1071 KERCPRDFS-------SIPDSDKWMYYESDSEIEKLVGWLRENNVREKELKESISQFQAN 1229 +D S + +S W+ YE+D EIE+L+GWL +++ +E+ELK+SI + Sbjct: 1683 CALSDKDSSLMSQPSNPLGNSSPWIAYETDVEIEELLGWLDDSDPKERELKDSIMLGPKS 1742 Query: 1230 KLKD--SEYTEDHILKKREIN--HGGRKTVSADFLATNAMNALEKKFGPCKRIETTAV-- 1391 + + + TED + ++ KTVS + L T A + LEKKFGP + + V Sbjct: 1743 RFQQFINAQTEDRAKDQGNVSMPRNREKTVS-NSLVTKATSLLEKKFGPFVEWDNSEVLK 1801 Query: 1392 PQNLVMGASQSGGMHRCECLEMLWPSKDHCGSCHQSFSTSEELRQHVKENCKAVASGSKR 1571 QN + ++RCECLE + PS+ HC CH++ ++ E H C A ++ Sbjct: 1802 KQNRKTRTTNDEKLYRCECLEPILPSRKHCTHCHKTVASDIEFDGHNDGKCNAGLLAIEK 1861 Query: 1572 SQAAEDMSKRK---KARNAVSQEKRPASVGIPQGSTLEKQIDGSASVESYNADCPFNFEE 1742 ++ SK + K + + A + S K + + CPFNFE+ Sbjct: 1862 NKDKNGSSKGRGNLKCDTLHEKFRADAETALTSVSGSSKLSSRLIKFSNEESTCPFNFED 1921 Query: 1743 IMTRFIVPSSVKDGVNEIGLIGSGGLPSFLQGQSPYVSDSALTVGLERTNEAS-SRSNDL 1919 I ++F+ S K+ V+EIGLIGS G+PSF+ SP+VS+ L+ + + S ++ Sbjct: 1922 ICSKFVTNDSNKELVSEIGLIGSDGIPSFVPSVSPFVSEYTLSAQKDESIVGGVSIVSES 1981 Query: 1920 RSKQKNAEPGDVMNGRGFKDSNRSSRSAENGLSDELSIVGRLKSILTSEKDQVTPMKDRS 2099 R Q N + G G ++S S ++E + KS L ++D + Sbjct: 1982 RVSQGNTD------GAGTCLDHKSGISTGKLAANESNKSN--KSSLREQRDGKFSFCSPA 2033 Query: 2100 SLLGLSKSTIIRESSSRPLVGRASEILRILKINLLDMDAALPEDAFRKSRTSPDRRCAWR 2279 S++G ++ S RPLVG+AS ILR LKINLLDMDAAL A R S+ PDRR AWR Sbjct: 2034 SVMGADGCCVVPSPSLRPLVGKASHILRQLKINLLDMDAALLAIALRPSKAVPDRRQAWR 2093 Query: 2280 AFVKSSKSIYEMVQATIVFEDMIKTEYLRNDWWYWSSPSTAARITTLSALALRIYSLDSA 2459 FVKS+K+IYEM+QAT EDMIKTEYLRNDWWYWSS S AA+ +TL +LALRIYSLD A Sbjct: 2094 TFVKSAKTIYEMIQATFTLEDMIKTEYLRNDWWYWSSFSAAAKSSTLPSLALRIYSLDLA 2153 Query: 2460 ISYEEPLPTAAAMEVSEPCNAIE-EEMQKSPTLKNLASPSSPTLLKTPEPDS 2612 I YE+ +P ++ + SEP E + + T K+ AS S K E DS Sbjct: 2154 IIYEK-MPNSSFTDSSEPSVIAEPKPLMNVDTEKSKASRKSTR--KRKESDS 2202 >ref|XP_006408507.1| hypothetical protein EUTSA_v10019872mg [Eutrema salsugineum] gi|557109653|gb|ESQ49960.1| hypothetical protein EUTSA_v10019872mg [Eutrema salsugineum] Length = 2173 Score = 496 bits (1278), Expect = e-137 Identities = 318/893 (35%), Positives = 472/893 (52%), Gaps = 67/893 (7%) Frame = +3 Query: 3 ERDDLLVQVCNSVLPRAPWDEGLCKVCGMDKDDDNVLLCDKCDSEYHRYCLDPPLLRIPE 182 E D++V + + LP+APWDEG+CKVCG+DKDDD+VLLCD CD+EYH YCL+PPL+RIP+ Sbjct: 1267 EIKDIVVSI--NKLPKAPWDEGVCKVCGVDKDDDSVLLCDTCDAEYHTYCLNPPLIRIPD 1324 Query: 183 GNWYCPSCVTGQSLPSGTGYASGSNQRRK-RRNQGEFSSKFLEELSRFAKLMGMKEHWEF 359 GNWYCPSCV + + + +RRK R+ QGE + +E + A +M K++WEF Sbjct: 1325 GNWYCPSCVIAKRIAQDALESYKLVRRRKGRKYQGELTRASMETTAHLADVMEEKDYWEF 1384 Query: 360 TVEERIFFLKFLLDEALNSATVRDHMEQCASRAADLQNKLRSLTSEXXXXXXXXXXXXXS 539 + EERI LK L DE L+S+ V H+EQCA ++Q KLRSL+SE Sbjct: 1385 STEERILLLKLLCDELLSSSLVHQHLEQCAEAIIEMQQKLRSLSSEWKNTKMRQEFLTAK 1444 Query: 540 AEKTNSGVLNVRGDLNSDASSSQHASENITRGKPSEKLVGDQ------------------ 665 K +L G + +S +E I + ++ VGD+ Sbjct: 1445 LAKVEPSILKELG----EPQNSSSFAEQIRCNQQQQEGVGDRVTHDDDTSSAAFLNKNQR 1500 Query: 666 -------SQPEKIIV-----KTSEGPNWLS----EKPI-----------SVQQPQSDQGH 764 +Q E++ V K S N S E PI S ++ SD H Sbjct: 1501 TTPLMTDAQTEELHVISGERKISTPENVTSPGRPELPIADASPHGTDNLSCEKDSSDTLH 1560 Query: 765 TSLLNNVQSPLFSSPTSERETELVQCPNQGDMPSSQL--NNLKACTVKQEITNLLDSIAD 938 S+ N + S E +T + M S +L + + EI NL SI Sbjct: 1561 KSVGGNHEIHTLKSNAVESQT----AHDASSMASQELQASQQELNATSNEIQNLQQSIRS 1616 Query: 939 IELELVKISLRRDFLGRDSIGRVYWAFCYPGARPWVIACGG-----------TAHKERCP 1085 IE +L++ S+RRDFLG D+ GR+YW C+P P ++ G T K P Sbjct: 1617 IESQLLRQSIRRDFLGSDASGRLYWGCCFPEEHPRILVDGSISLQKSVQVNLTGSKVLSP 1676 Query: 1086 RDFSSIPDSDK-----WMYYESDSEIEKLVGWLRENNVREKELKESISQFQANKLKDSEY 1250 F D + W YYE+++EI +LV WL +++ +E+EL+ESI ++ + D Sbjct: 1677 --FLHAVDHGRLLVSPWTYYETEAEISELVQWLHDDDPKERELRESILCWKRLRFGD--- 1731 Query: 1251 TEDHILKKREINHGGRKTVSADFLATNAMNALEKKFGPCKRIET-TAVPQNLVMGASQSG 1427 + + + +SA L T A ++EK++GPC ++ET T + ++ Sbjct: 1732 ----LQRGMKQAQNSSCPISAGSLVTKAAMSMEKRYGPCIKLETETLKKRGKKTKVAERE 1787 Query: 1428 GMHRCECLEMLWPSKDHCGSCHQSFSTSEELRQHVKENCKAVASGSKRSQAAEDMSKRKK 1607 + RCECLE + PS HC CH++F++ +E +H + C + S+ + D SK K Sbjct: 1788 KLCRCECLEPILPSMIHCLICHKTFASDDEFEEHTESKCIPYSLASEEGKEISDSSKAKD 1847 Query: 1608 ARNAVSQEKRPASVGIPQGSTLEKQIDGSASVESYNADCPFNFEEIMTRFIVPSSVKDGV 1787 + A + + S + + G + + P++FEEI ++F+ S +D V Sbjct: 1848 GLKSDYLNVYNAGKDVAEMSNVSELDSGLIRYQEEESISPYHFEEICSKFVTRDSNRDLV 1907 Query: 1788 NEIGLIGSGGLPSFLQGQSPYVSDSALTVGLERTNEASSRSNDLRSKQKNAEPGDVMNGR 1967 EIGLIGS G P+FL S +++DS L + ++ + S + G N Sbjct: 1908 KEIGLIGSNGTPTFLP-SSTFLNDSML------ISATCNKLDGGDSVDQVIFTGSEANDE 1960 Query: 1968 GF-KDSNRS-SRSAENGLSDELSIVGRLKSILTSEKDQVTPMKDRSSLLGLSKSTIIRES 2141 G +SN S +R N L L+ L L+ +K++ +SS GL ++ +S Sbjct: 1961 GLNSESNMSFNRIVTNDLGGPLNKPSGLSFGLSDQKNK------KSSGRGLEGCCVVPQS 2014 Query: 2142 SSRPLVGRASEILRILKINLLDMDAALPEDAFRKSRTSPDRRCAWRAFVKSSKSIYEMVQ 2321 S + + G+A + R LK N+LDMD ALPE+A R S++ PDRR AWRAFVKS++SI+E+VQ Sbjct: 2015 SLKRITGKALSVFRFLKTNMLDMDVALPEEALRPSKSHPDRRRAWRAFVKSAQSIFELVQ 2074 Query: 2322 ATIVFEDMIKTEYLRNDWWYWSSPSTAARITTLSALALRIYSLDSAISYEEPL 2480 A IV EDMIKTEYL+N+WWYWSS S AA+I+TLSAL++R++SLD+AI YE+P+ Sbjct: 2075 AAIVVEDMIKTEYLKNEWWYWSSLSAAAKISTLSALSVRLFSLDAAILYEKPI 2127