BLASTX nr result
ID: Atropa21_contig00012307
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Atropa21_contig00012307 (1461 letters) Database: ./nr 37,332,560 sequences; 13,225,080,153 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_006364215.1| PREDICTED: methyl-CpG-binding domain-contain... 359 1e-96 ref|XP_004235957.1| PREDICTED: uncharacterized protein LOC101251... 297 7e-78 ref|XP_004235958.1| PREDICTED: uncharacterized protein LOC101251... 275 5e-71 gb|ADN06078.1| DNA methylation protein MBD1 [Capsicum annuum] 226 2e-56 ref|XP_004241694.1| PREDICTED: uncharacterized protein LOC101262... 196 2e-47 ref|XP_006356218.1| PREDICTED: methyl-CpG-binding domain-contain... 190 1e-45 ref|XP_002267149.1| PREDICTED: uncharacterized protein LOC100249... 169 4e-39 ref|XP_006427282.1| hypothetical protein CICLE_v10025929mg [Citr... 161 7e-37 ref|XP_006465326.1| PREDICTED: methyl-CpG-binding domain-contain... 159 2e-36 ref|XP_003527162.1| PREDICTED: methyl-CpG-binding domain-contain... 155 3e-35 ref|XP_002892861.1| hypothetical protein ARALYDRAFT_312525 [Arab... 152 5e-34 ref|XP_003527196.1| PREDICTED: methyl-CpG-binding domain-contain... 150 1e-33 gb|EOY25807.1| Methyl-CPG-binding domain 10 [Theobroma cacao] 150 2e-33 gb|EMJ17007.1| hypothetical protein PRUPE_ppa009989mg [Prunus pe... 149 4e-33 ref|XP_004508481.1| PREDICTED: methyl-CpG-binding domain-contain... 148 7e-33 ref|XP_004251723.1| PREDICTED: uncharacterized protein LOC101259... 148 7e-33 gb|EOY31101.1| Methyl-CPG-binding domain 10, putative isoform 1 ... 146 2e-32 ref|NP_563971.1| methyl-CPG-binding domain 10 [Arabidopsis thali... 146 2e-32 gb|ESW27012.1| hypothetical protein PHAVU_003G166200g [Phaseolus... 145 6e-32 gb|EMJ03445.1| hypothetical protein PRUPE_ppa008789mg [Prunus pe... 144 7e-32 >ref|XP_006364215.1| PREDICTED: methyl-CpG-binding domain-containing protein 10-like [Solanum tuberosum] Length = 317 Score = 359 bits (922), Expect = 1e-96 Identities = 208/326 (63%), Positives = 233/326 (71%), Gaps = 15/326 (4%) Frame = +1 Query: 301 MPKKGGKIKKNEVVFVAPTGDEIRNRRQLEKYLKTHDGNPGISEFDWTTGEAPRRSARIS 480 M KKG + KKNEVVFVAPTG+EIRN+RQLEKYLKTHDGNPG+SEFDWTTGEAPRRSARIS Sbjct: 1 MLKKGVRAKKNEVVFVAPTGEEIRNKRQLEKYLKTHDGNPGMSEFDWTTGEAPRRSARIS 60 Query: 481 EKVKAMPPPAVLEPXXXXXXXXXXXXXXXEMGDANVEKENMDKKDMESTIEEKEGLEK-E 657 +KVKAMP PAVLEP EM A++EKEN MES IEEKEGLEK E Sbjct: 61 QKVKAMPLPAVLEPAKKRQRTSYATKKEEEMDIADIEKEN-----MESAIEEKEGLEKEE 115 Query: 658 NVVENEIEDKEKGEMGVDKEDTAHKKEEEVSGREV----IESSKDTQVEDGG-KMAENGT 822 N+VE E+EDK GEMG +KE+T HKKEEE++G EV E+S D Q EDGG KMAEN T Sbjct: 116 NIVEVEMEDK-GGEMGANKENTEHKKEEEMAGTEVSDRKTEASNDIQFEDGGDKMAENVT 174 Query: 823 EETIEIDANIPIDSMDKDNFNGVASDXXXXXXXXXXXXXXVKEKHGYGENLDSQITID-S 999 EETI IDA+ DSMDKD F GV ++ V EK GYGENL+SQI ID S Sbjct: 175 EETIAIDADFWNDSMDKDYFKGVFANAVIGETNGAEAGIEVGEKPGYGENLESQIDIDIS 234 Query: 1000 SADDMNQDIP--------ISGSETYNIQDPAYGGEATGVRENLGEHRSLEEENNKNRVGL 1155 AD+MN DIP ++G ETYNIQDPA+ GEA+G RE E S+EEE NKNR GL Sbjct: 235 GADNMNHDIPDKGTPERKVAGGETYNIQDPAFVGEASGFRE---EPNSVEEEKNKNRAGL 291 Query: 1156 VMDNGKINQPGPAHTPQHQSGATISC 1233 VMDN +INQP AHTPQHQS ATISC Sbjct: 292 VMDNCQINQPERAHTPQHQSVATISC 317 >ref|XP_004235957.1| PREDICTED: uncharacterized protein LOC101251458 isoform 1 [Solanum lycopersicum] Length = 393 Score = 297 bits (761), Expect = 7e-78 Identities = 192/417 (46%), Positives = 227/417 (54%), Gaps = 80/417 (19%) Frame = +1 Query: 223 MASNNMENNDGLPLELPAPPSWKKRVMPKKGGKIKKNEVVFVAPTGDEIRNRRQLEKYLK 402 MASNNME LPAP SWKK +MPKKG + KKNEVVFVAPTG+EIRN+RQLEKYLK Sbjct: 1 MASNNME--------LPAPLSWKKLLMPKKGIRAKKNEVVFVAPTGEEIRNKRQLEKYLK 52 Query: 403 THDGNPGISEFDWTTGEAPRRSARISEKVKAMPPPAVLEPXXXXXXXXXXXXXXXEMGDA 582 TH+GNPG+SEFDWTTGE PRRSARIS+KVKAMP PAVLEP EM A Sbjct: 53 THNGNPGMSEFDWTTGETPRRSARISQKVKAMPLPAVLEPAKKRQRTSSSTKKEEEMDAA 112 Query: 583 NVEKENMDKKDMESTIEEKEGLE-KENVVENEIEDKEKGEMGVDKEDTAHKKEEEVSGRE 759 N EKENMDKK+MES IE EGLE KENVV+NE DKE DT HKK+EE+SG E Sbjct: 113 NAEKENMDKKEMESVIEATEGLENKENVVDNETVDKE--------VDTEHKKDEELSGGE 164 Query: 760 VI-----ESSKDTQVEDGGKMAENGTEETIEIDANIPIDSMDKDNFNGVASDXXXXXXXX 924 V+ E+S D VEDGGK+A+NG E+TI + ++ DK + G + Sbjct: 165 VLQKQKSEASNDALVEDGGKLAQNGIEDTI-----VEVEIEDKGDEMGAYKENIEHKIEG 219 Query: 925 XXXXXXVKEK---------HGYGENLDSQITIDSSA-------DDMNQ------------ 1020 V ++ G G + +++ ++A D M++ Sbjct: 220 EISGTEVSDRKTETSNDTQFGDGSKMAENVSVVTTAIDADFWNDSMDKDYFKGAFANAVI 279 Query: 1021 --------------------------DIPISGSETYN--------------------IQD 1062 DI ISG++ N IQD Sbjct: 280 GDTNAAEAGIEVGEKPGYGENLESQIDIEISGADDMNHDMPDMVASERNVAGGETFNIQD 339 Query: 1063 PAYGGEATGVRENLGEHRSLEEENNKNRVGLVMDNGKINQPGPAHTPQHQSGATISC 1233 PA+ G E EHR LEEE NKN GLVMDNG+INQP AHTPQHQS ATISC Sbjct: 340 PAFVEGTGGFHE---EHRPLEEEKNKNSTGLVMDNGQINQPERAHTPQHQSSATISC 393 >ref|XP_004235958.1| PREDICTED: uncharacterized protein LOC101251458 isoform 2 [Solanum lycopersicum] Length = 375 Score = 275 bits (702), Expect = 5e-71 Identities = 177/391 (45%), Positives = 211/391 (53%), Gaps = 80/391 (20%) Frame = +1 Query: 301 MPKKGGKIKKNEVVFVAPTGDEIRNRRQLEKYLKTHDGNPGISEFDWTTGEAPRRSARIS 480 MPKKG + KKNEVVFVAPTG+EIRN+RQLEKYLKTH+GNPG+SEFDWTTGE PRRSARIS Sbjct: 1 MPKKGIRAKKNEVVFVAPTGEEIRNKRQLEKYLKTHNGNPGMSEFDWTTGETPRRSARIS 60 Query: 481 EKVKAMPPPAVLEPXXXXXXXXXXXXXXXEMGDANVEKENMDKKDMESTIEEKEGLE-KE 657 +KVKAMP PAVLEP EM AN EKENMDKK+MES IE EGLE KE Sbjct: 61 QKVKAMPLPAVLEPAKKRQRTSSSTKKEEEMDAANAEKENMDKKEMESVIEATEGLENKE 120 Query: 658 NVVENEIEDKEKGEMGVDKEDTAHKKEEEVSGREVI-----ESSKDTQVEDGGKMAENGT 822 NVV+NE DKE DT HKK+EE+SG EV+ E+S D VEDGGK+A+NG Sbjct: 121 NVVDNETVDKE--------VDTEHKKDEELSGGEVLQKQKSEASNDALVEDGGKLAQNGI 172 Query: 823 EETIEIDANIPIDSMDKDNFNGVASDXXXXXXXXXXXXXXVKEK---------HGYGENL 975 E+TI + ++ DK + G + V ++ G G + Sbjct: 173 EDTI-----VEVEIEDKGDEMGAYKENIEHKIEGEISGTEVSDRKTETSNDTQFGDGSKM 227 Query: 976 DSQITIDSSA-------DDMNQ-------------------------------------- 1020 +++ ++A D M++ Sbjct: 228 AENVSVVTTAIDADFWNDSMDKDYFKGAFANAVIGDTNAAEAGIEVGEKPGYGENLESQI 287 Query: 1021 DIPISGSETYN--------------------IQDPAYGGEATGVRENLGEHRSLEEENNK 1140 DI ISG++ N IQDPA+ G E EHR LEEE NK Sbjct: 288 DIEISGADDMNHDMPDMVASERNVAGGETFNIQDPAFVEGTGGFHE---EHRPLEEEKNK 344 Query: 1141 NRVGLVMDNGKINQPGPAHTPQHQSGATISC 1233 N GLVMDNG+INQP AHTPQHQS ATISC Sbjct: 345 NSTGLVMDNGQINQPERAHTPQHQSSATISC 375 >gb|ADN06078.1| DNA methylation protein MBD1 [Capsicum annuum] Length = 235 Score = 226 bits (575), Expect = 2e-56 Identities = 125/217 (57%), Positives = 149/217 (68%), Gaps = 7/217 (3%) Frame = +1 Query: 238 MENNDGLPLELPAPPSWKKRVMPKKGGKIKKNEVVFVAPTGDEIRNRRQLEKYLKTHDGN 417 ME N LP+ELPAPPSWKK MPKK + KKNE+VFVAPTG+EIRNRRQLEKY+KTHDG+ Sbjct: 13 MEKNQVLPVELPAPPSWKKLAMPKKSVRAKKNEIVFVAPTGEEIRNRRQLEKYVKTHDGS 72 Query: 418 PGISEFDWTTGEAPRRSARISEKVKAMPPPAVLEPXXXXXXXXXXXXXXXEMGDANVEKE 597 PGISEFDWTTGEAPRRSARISEKVKAMPP VLEP + A+VEKE Sbjct: 73 PGISEFDWTTGEAPRRSARISEKVKAMPPVTVLEP----TMKRRRTATKKDEDVASVEKE 128 Query: 598 NMDKKDMESTIEEKEGL-EKENVVENEIEDKEKGEMGVDKEDTAHKKE-----EEVSGRE 759 NM KKDME IE KEGL E+E+ VE+E+ DK K E + +T + KE E R+ Sbjct: 129 NMGKKDMEPAIENKEGLEEEEDDVEDEVVDKGKDEQTAKEVETEYIKELEMFDGEAPKRQ 188 Query: 760 VIESSKDTQVEDGGKMAENGTEET-IEIDANIPIDSM 867 ++ D Q EDGGK+ E+ T+ET +E++ D M Sbjct: 189 KYKACNDPQYEDGGKLEEDVTQETAVEVEMENKEDEM 225 >ref|XP_004241694.1| PREDICTED: uncharacterized protein LOC101262625 [Solanum lycopersicum] Length = 353 Score = 196 bits (498), Expect = 2e-47 Identities = 128/340 (37%), Positives = 174/340 (51%), Gaps = 16/340 (4%) Frame = +1 Query: 262 LELPAPPSWKKRVMPKKGGKIKKNEVVFVAPTGDEIRNRRQLEKYLKTHDGNPGISEFDW 441 +ELPAPPSWKK PK+GG KK+EVVF+APTG+E++NR+QLE+YLK H G+PGISEFDW Sbjct: 17 VELPAPPSWKKLFTPKQGGTPKKSEVVFIAPTGEEVKNRKQLEQYLKAHPGSPGISEFDW 76 Query: 442 TTGEAPRRSARISEKVKAMPPPAVLE-PXXXXXXXXXXXXXXXEMGDANVEKENMDKKDM 618 +TGE PRRSARISEKVKAM PP++LE P E A E + + K M Sbjct: 77 STGETPRRSARISEKVKAMRPPSLLESPKKKRRTSSGTKKDSKEKAAAKAEMGSAETKGM 136 Query: 619 ESTIEEKEGLEKE-NVVENEIEDKEKGEM-GVDKEDTAHKKEEEVSGREVIESSKDTQVE 792 ES+ EE E LEK+ E E++DK K E V +++ + + + ES + + Sbjct: 137 ESSKEENENLEKKAGEAEAEMQDKGKKEAEAVVEDERIEDAKLPPAEKPDSESEEFHSAD 196 Query: 793 DGGKMAENGTEETIEIDANIPIDSMDKDN-FNGV------------ASDXXXXXXXXXXX 933 DG + E +E +++++KD G Sbjct: 197 DGNQDKSENAEAEMEDKKKKEVEAVEKDKCLKGAELPSGDEREPESEEVHSADVGRHKSE 256 Query: 934 XXXVKEKHGYGENLDSQITIDSSADDMNQDIPISGSETYNIQDPAYGGEATGVRENLGEH 1113 ++EK E L+SQ ++ A + ++ + GS + A G + N GE Sbjct: 257 NAGIEEKQVSEEKLESQNKLEELAAE-GTNVTVGGSA--GQAEHAVDGVSKDAHSNDGED 313 Query: 1114 RSLEEENNKNRVGLVMDNGKINQPGPAHTPQHQSGATISC 1233 EEE L M+N INQPG H QHQS A ISC Sbjct: 314 GPKEEEKKTEGTELAMENNNINQPGLVHPQQHQSPAPISC 353 >ref|XP_006356218.1| PREDICTED: methyl-CpG-binding domain-containing protein 10-like [Solanum tuberosum] Length = 353 Score = 190 bits (483), Expect = 1e-45 Identities = 132/342 (38%), Positives = 170/342 (49%), Gaps = 18/342 (5%) Frame = +1 Query: 262 LELPAPPSWKKRVMPKKGGKIKKNEVVFVAPTGDEIRNRRQLEKYLKTHDGNPGISEFDW 441 +EL APPSWKK PK+GG KK+EVVF+ PTG+E++NR+QLE+YLK H GNP I+EFDW Sbjct: 17 VELAAPPSWKKLFTPKQGGTPKKSEVVFITPTGEEVKNRKQLEQYLKAHPGNPAIAEFDW 76 Query: 442 TTGEAPRRSARISEKVKAMPPPAVLE-PXXXXXXXXXXXXXXXEMGDANVEKENMDKKDM 618 +TGE PRRSARISEKVKA PP++LE P E A EKE+ + K M Sbjct: 77 STGETPRRSARISEKVKAKRPPSLLESPKKKRRTSSGTKKVSKEKDAAKAEKESSETKGM 136 Query: 619 ESTIEEKEGL-EKENVVENEIEDKEKGEMGVDKED---------TAHKKEEEV------- 747 ES+ EE E L +K E E+ED+ K E+ ED A K + E Sbjct: 137 ESSKEENENLGKKAGEAEGEMEDEGKKEVEAVVEDERIEDAKLPPAEKPDSESEEFHSAD 196 Query: 748 SGREVIESSKDTQVEDGGKMAENGTEETIEIDANIPIDSMDKDNFNGVASDXXXXXXXXX 927 G + + ++ED K TEE E + S DK Sbjct: 197 DGNQDKSEHAEAEMEDKKKKEVEATEED-ECLKGAELPSGDKPEPESEEVHSADVGKQNK 255 Query: 928 XXXXXVKEKHGYGENLDSQITIDSSADDMNQDIPISGSETYNIQDPAYGGEATGVRENLG 1107 ++EK E L SQ + A + ++ I GS N + A G + N Sbjct: 256 SENADIEEKQVSEEKLVSQNKPEELAAE-GTNVTIGGSA--NQAEHAVDGVSKDAHSNDS 312 Query: 1108 EHRSLEEENNKNRVGLVMDNGKINQPGPAHTPQHQSGATISC 1233 E R EE+ + L M+N INQPG H+ QHQS A ISC Sbjct: 313 EDRPKEEKETEG-TDLAMENNNINQPGLVHSQQHQSPAPISC 353 >ref|XP_002267149.1| PREDICTED: uncharacterized protein LOC100249094 isoform 1 [Vitis vinifera] Length = 324 Score = 169 bits (427), Expect = 4e-39 Identities = 119/328 (36%), Positives = 162/328 (49%), Gaps = 4/328 (1%) Frame = +1 Query: 262 LELPAPPSWKKRVMPKKGGKIKKNEVVFVAPTGDEIRNRRQLEKYLKTHDGNPGISEFDW 441 +ELPAPPSWKK MPKKG +KNE+VF+APTG+EI +R+QLE+YLK+H GNP ISEFDW Sbjct: 10 MELPAPPSWKKMFMPKKGTP-RKNEIVFIAPTGEEINSRKQLEQYLKSHPGNPAISEFDW 68 Query: 442 TTGEAPRRSARISEKVKAMPPPAVLEPXXXXXXXXXXXXXXXEMGDANVEKENMDKKDM- 618 TGE PRRSARISEK KA PP P E A E+E + M Sbjct: 69 GTGETPRRSARISEKAKATPPAESEPPKKRGRKSSGSKKDGKETEAATEEQEGKKEISMQ 128 Query: 619 ESTIEEKEGLEKENV-VENEIEDKEKGEMGVDKEDTAHKKEEEVSGREVIESSKDTQVED 795 ++ + EK E E+V E ++E+ K D+ EE E I+ + D Sbjct: 129 DADVTEKANAESEDVSKEIQVENGVKTVAEADQVKNLDVNMEEAGPVEAIDGKDEKIQSD 188 Query: 796 GG--KMAENGTEETIEIDANIPIDSMDKDNFNGVASDXXXXXXXXXXXXXXVKEKHGYGE 969 G K+A TE +A + ++ VA D K G+ Sbjct: 189 TGDSKVAATETEVVNAEEAQGEKEVKKQEVAEAVAVD-------EPAKEAGAKVTQKEGD 241 Query: 970 NLDSQITIDSSADDMNQDIPISGSETYNIQDPAYGGEATGVRENLGEHRSLEEENNKNRV 1149 L++ T++ + + +N+D P I E V +N G+ + EEN K Sbjct: 242 KLETSATVELN-EAVNKDKP----NGLGIAPEEEVKEKQEVPDNDGKCKFQVEENGKKLE 296 Query: 1150 GLVMDNGKINQPGPAHTPQHQSGATISC 1233 G V +NGK+NQ A TPQH + + +SC Sbjct: 297 GDVTENGKVNQMQRAETPQHPAPSPVSC 324 >ref|XP_006427282.1| hypothetical protein CICLE_v10025929mg [Citrus clementina] gi|557529272|gb|ESR40522.1| hypothetical protein CICLE_v10025929mg [Citrus clementina] Length = 363 Score = 161 bits (407), Expect = 7e-37 Identities = 117/349 (33%), Positives = 171/349 (48%), Gaps = 23/349 (6%) Frame = +1 Query: 256 LPLELPAPPSWKKRVMPKKGGKIKKNEVVFVAPTGDEIRNRRQLEKYLKTHDGNPGISEF 435 + +ELPAPP+WKK +PKKGG +K+E++F+APTG+E+ NR+QLE+YLK+H GNP I+EF Sbjct: 45 ISVELPAPPAWKKMYLPKKGGTPRKSEIMFIAPTGEELHNRKQLEQYLKSHPGNPAITEF 104 Query: 436 DWTTGEAPRRSARISEKVKAMPPPAVLEPXXXXXXXXXXXXXXXEMGDANVEKENMDKKD 615 DW TGE PRRS+RISEK KA P P P + G + D K+ Sbjct: 105 DWGTGETPRRSSRISEKAKATPTPEKEPP--------------KKRGRRSTSASKKDNKE 150 Query: 616 MESTIEEKEGLEKENVVENEI---EDKEKGEMGVDKE---DTAHKKEEEVSGREVIESSK 777 E+ E E+E VE ++ +K+K E V K+ +T KE E + + +K Sbjct: 151 TEA----PENAEREKEVEMQVAEETEKKKAEAEVGKDVSVETQVGKEGETQEKAEVTENK 206 Query: 778 DTQVEDGG----KMAENGTEETIEIDANIPIDSMDKDNFNGVASDXXXXXXXXXXXXXXV 945 D +E+ G A++G EET + S+ K+N A V Sbjct: 207 DANMEEAGPEQNNKAQDGAEETKNSNEEREAASV-KENEETAAEVTQNEKEKVEISVENV 265 Query: 946 -----KEKHGYGENLD---SQITIDSSA----DDMNQDIPISGSETYNIQDPAYGGEATG 1089 +E+ G +N S +T+ + D+ N P + E QD E + Sbjct: 266 PQNEAEEEDGATDNKQDNPSTVTVGENGGAEKDEGNGSAPATDGEVKEKQDGLENDEKSN 325 Query: 1090 VRENLGEHRSLEEENNKNRVGLVMDNGKINQPGPAH-TPQHQSGATISC 1233 V ++E K G V++NGK++Q G TPQH A++SC Sbjct: 326 VP---------DKETGKEIDGGVIENGKVDQMGRTDTTPQHP--ASVSC 363 >ref|XP_006465326.1| PREDICTED: methyl-CpG-binding domain-containing protein 10-like [Citrus sinensis] Length = 326 Score = 159 bits (403), Expect = 2e-36 Identities = 116/349 (33%), Positives = 171/349 (48%), Gaps = 23/349 (6%) Frame = +1 Query: 256 LPLELPAPPSWKKRVMPKKGGKIKKNEVVFVAPTGDEIRNRRQLEKYLKTHDGNPGISEF 435 + +ELPAPP+WKK +PKKGG +K+E++F+APTG+E+ NR+QLE+YLK+H GNP I+EF Sbjct: 8 ISVELPAPPAWKKMYLPKKGGTPRKSEIMFIAPTGEELHNRKQLEQYLKSHPGNPAITEF 67 Query: 436 DWTTGEAPRRSARISEKVKAMPPPAVLEPXXXXXXXXXXXXXXXEMGDANVEKENMDKKD 615 DW TGE PRRS+RISEK KA P P P + G + D K+ Sbjct: 68 DWGTGETPRRSSRISEKAKATPTPEKEPP--------------KKRGRRSTSTSKKDNKE 113 Query: 616 MESTIEEKEGLEKENVVENEI---EDKEKGEMGVDKE---DTAHKKEEEVSGREVIESSK 777 E+ E E+E VE ++ +K+K E V K+ +T KE E + + +K Sbjct: 114 TEA----PENAEREKEVEMQVAEETEKKKAEAEVGKDVSVETQVGKEGETQEKAEVTENK 169 Query: 778 DTQVEDGG----KMAENGTEETIEIDANIPIDSMDKDNFNGVASDXXXXXXXXXXXXXXV 945 D +E+ G A++G EET + S K+N A V Sbjct: 170 DANMEEAGPEQNNKAQDGAEETKNSNEEREAASA-KENEETAAEVTQNEKEKVEISVENV 228 Query: 946 -----KEKHGYGENLD---SQITIDSSA----DDMNQDIPISGSETYNIQDPAYGGEATG 1089 +E++G +N S +T+ + ++ N P + E QD E + Sbjct: 229 PQNEAEEENGATDNKQNNPSTVTVGENGGAEKEEGNGSAPATDGEVKEKQDGLENDEKSN 288 Query: 1090 VRENLGEHRSLEEENNKNRVGLVMDNGKINQPGPAH-TPQHQSGATISC 1233 V ++E K G V++NGK++Q G TPQH A++SC Sbjct: 289 VP---------DKETGKEIDGGVIENGKVDQMGRTDTTPQHP--ASVSC 326 >ref|XP_003527162.1| PREDICTED: methyl-CpG-binding domain-containing protein 10-like [Glycine max] Length = 305 Score = 155 bits (393), Expect = 3e-35 Identities = 103/301 (34%), Positives = 143/301 (47%), Gaps = 4/301 (1%) Frame = +1 Query: 244 NNDGLPLELPAPPSWKKRVMPKKGGKIKKNEVVFVAPTGDEIRNRRQLEKYLKTHDGNPG 423 + + L LELPAPP WKK+ +PKK G KKNE+VF APTG+EI NR+QLEKYLK H G P Sbjct: 12 SEETLSLELPAPPGWKKQFIPKKAGTPKKNEIVFTAPTGEEINNRKQLEKYLKAHPGGPA 71 Query: 424 ISEFDWTTGEAPRRSARISEKVKAMPPPAVLEPXXXXXXXXXXXXXXXEMGDANVEKENM 603 +SEFDW TGE PRRS RISEK KA PP EP + E + Sbjct: 72 VSEFDWGTGETPRRSTRISEKAKA-APPTQREPPKKRTKRSSASQKEISQEEKEEETKEA 130 Query: 604 DKKDMESTIEEKEGLEKENVVENEIEDKEKGEMGVDKEDTAHKKEEEVSGREVIESSKDT 783 + ++ + T + +EKE VV NE DK + V+K + EE +G V + Sbjct: 131 EMQEADDTTKGDNDIEKEKVVVNENHDKSVEDTDVNK--STRYGEEAKAGENV-----EV 183 Query: 784 QVEDGGKMAENGTEETIEIDANIPIDSMDKDNFNGVASDXXXXXXXXXXXXXXVKEKHGY 963 +E+ A +G ++ D D G KE G+ Sbjct: 184 PIEEEKSNAADGELPALK-------DKADDKVTEGSEVFLRKDEEKIEQPQEETKEYSGF 236 Query: 964 G--ENLDSQITIDSS--ADDMNQDIPISGSETYNIQDPAYGGEATGVRENLGEHRSLEEE 1131 G E L++ T D + + +N++ + + + + GE G + N EH L+E Sbjct: 237 GEPEKLETCTTADKTVEVEGVNKEDHVKSTHEFEV------GEIEGTKVNGEEHHKLDEI 290 Query: 1132 N 1134 N Sbjct: 291 N 291 >ref|XP_002892861.1| hypothetical protein ARALYDRAFT_312525 [Arabidopsis lyrata subsp. lyrata] gi|297338703|gb|EFH69120.1| hypothetical protein ARALYDRAFT_312525 [Arabidopsis lyrata subsp. lyrata] Length = 357 Score = 152 bits (383), Expect = 5e-34 Identities = 95/213 (44%), Positives = 124/213 (58%), Gaps = 11/213 (5%) Frame = +1 Query: 238 MENNDGL-PLELPAPPSWKKRVMPKKGGKIKKNEVVFVAPTGDEIRNRRQLEKYLKTHDG 414 MEN D L +ELPAP SWKK PK+ G +K E+VFVAPTG+EI +R+QLE+YLK H G Sbjct: 1 MENTDELVSIELPAPASWKKLFYPKRAGTPRKTEIVFVAPTGEEISSRKQLEQYLKAHPG 60 Query: 415 NPGISEFDWTTGEAPRRSARISEKVKAMPPPAVLEPXXXXXXXXXXXXXXXEMGDANVE- 591 NP ISEF+WTTGE PRRS+RIS+KVKA P EP E + N E Sbjct: 61 NPLISEFEWTTGETPRRSSRISQKVKATTPTPDKEPLLKKRRSSLTKKDNKEAAEKNEEA 120 Query: 592 --KENMD-KKD--MESTIEEKEGLEKENVVENEIEDKEKGEMGVDKEDTAHKKEEEVSGR 756 KENMD KD E+ +EKEG+ + E E ++ KGE +K + +K+ E V+ + Sbjct: 121 AVKENMDVDKDGQTENAEKEKEGVTEIAEAEKENKEGVKGEKEDEKAEAENKEAEVVTDK 180 Query: 757 ----EVIESSKDTQVEDGGKMAENGTEETIEID 843 EV S + + E GG G EE +++ Sbjct: 181 KESMEVDTSELEKKTESGG-----GAEEPSKVE 208 >ref|XP_003527196.1| PREDICTED: methyl-CpG-binding domain-containing protein 10-like [Glycine max] Length = 306 Score = 150 bits (380), Expect = 1e-33 Identities = 99/306 (32%), Positives = 144/306 (47%), Gaps = 5/306 (1%) Frame = +1 Query: 262 LELPAPPSWKKRVMPKKGGKIKKNEVVFVAPTGDEIRNRRQLEKYLKTHDGNPGISEFDW 441 LELPAPP WKK+ +PKK G KKNE+VF +PTG+EI +R+QLEKYLK H G P +SEFDW Sbjct: 18 LELPAPPGWKKKFIPKKAGTPKKNEIVFTSPTGEEINSRKQLEKYLKAHPGGPAVSEFDW 77 Query: 442 TTGEAPRRSARISEKVKAMPPPAVLEPXXXXXXXXXXXXXXXEMGDANVEKENMDKKDME 621 TGE PRRS RISEK K + PPA EP + + E + + ++ + Sbjct: 78 GTGETPRRSTRISEKAK-VAPPAESEPPKKRTKRSSASQKETSLEEKEEETKEAEMQEAD 136 Query: 622 STIEEKEGLEKENVVENEIEDKEKGEMGVDKEDTAHKKE-EEVSGREVIESSKDTQVEDG 798 T ++ +EKE E +D +K D +AH E + V EV+ + + DG Sbjct: 137 DTTKDDNVIEKEKDFVKENQD-DKSVENTDVNKSAHSGEAKPVENVEVLIEEEKSNAADG 195 Query: 799 GKMAENGTEETIEIDANIPIDSMDKDNFNGVASDXXXXXXXXXXXXXXVKEKHGYG--EN 972 A D +D G KE G+G E Sbjct: 196 ELPALK--------------DKVDDKRTEGSEDFLRKDEEKIEHPQEETKEYRGFGEQEK 241 Query: 973 LDSQITIDS--SADDMNQDIPISGSETYNIQDPAYGGEATGVRENLGEHRSLEEENNKNR 1146 L++ +T D + +N++ + + ++++ E G + N EH L+E N K Sbjct: 242 LETCLTADKRVEVEGVNKEEHVKSTCEFDVE------EIEGTKVNSEEHHKLDEINKKAE 295 Query: 1147 VGLVMD 1164 L ++ Sbjct: 296 AELTVN 301 >gb|EOY25807.1| Methyl-CPG-binding domain 10 [Theobroma cacao] Length = 331 Score = 150 bits (378), Expect = 2e-33 Identities = 125/375 (33%), Positives = 163/375 (43%), Gaps = 43/375 (11%) Frame = +1 Query: 238 MENNDGLPLELPAPPSWKKRVMPKKGGKIKKNEVVFVAPTGDEIRNRRQLEKYLKTHDGN 417 ME+ + + +ELPAP SWKK PKK G +K E++FVAPTG+EI NR+QLE+YLK+H GN Sbjct: 1 MESLEVISVELPAPASWKKMYFPKKVGSPRKTEIMFVAPTGEEINNRKQLEQYLKSHPGN 60 Query: 418 PGISEFDWTTGEAPRRSARISEKVKAMPPPAVLEPXXXXXXXXXXXXXXXEMGDANVEKE 597 P I+EFDW TGE PRRSARISEK KA P P EKE Sbjct: 61 PPIAEFDWGTGETPRRSARISEKAKATPTP---------------------------EKE 93 Query: 598 NMDKKDMESTIEEKEGLEKENVVENEIEDKEKGEMGVDKEDTAHKKEEEVSGREVIESSK 777 K+ S +KE E E V E K +GE G +K+DT +E G E + + Sbjct: 94 PPKKRGRRSLSAKKENKETEAVPE-----KAEGEKGSEKQDTQATVKETAEG-EKEKDGE 147 Query: 778 DTQVEDGG-------------KMAENGTEETIEIDANIPIDSMDKDNFNGVASDXXXXXX 918 QVE+GG KM E G EE ++ I KD+ G A+ Sbjct: 148 VKQVENGGKTKAADLTGDSDAKMEEPGEEEA---GKDVKIPDTAKDDKKGEAA-----GT 199 Query: 919 XXXXXXXXVKEKHGYGENLDSQITIDSSADDMNQDIPISGSETYN------------IQD 1062 +EK D T + + + +P + +E N + Sbjct: 200 AENPTSMEFQEKPAEACCTDGTPTEEEKVEAPIEKVPQTQAEKENGTCEKQSENPETVTM 259 Query: 1063 PAYGG------------------EATGVRENLGEHRSLEEENNKNRVGLVMDNGKINQPG 1188 A GG E GV E G+ EE K G +++NGK+ G Sbjct: 260 EANGGVEKENPYGATFVPEGEAKEKQGVLEVSGKCNVQVEEKGKAVDGELIENGKV---G 316 Query: 1189 PAHTPQHQSGATISC 1233 PQ A ISC Sbjct: 317 QTDAPQPPGPAAISC 331 >gb|EMJ17007.1| hypothetical protein PRUPE_ppa009989mg [Prunus persica] Length = 268 Score = 149 bits (375), Expect = 4e-33 Identities = 91/212 (42%), Positives = 116/212 (54%), Gaps = 10/212 (4%) Frame = +1 Query: 250 DGLPLELPAPPSWKKRVMPKKGGKIKKNEVVFVAPTGDEIRNRRQLEKYLKTHDGNPGIS 429 D + +EL APP+WKK+ PKKGG +KNE+VF++PTG+EI NR+QLE+YLK+H GNP IS Sbjct: 6 DEVSVELSAPPAWKKKFFPKKGGTPRKNEIVFISPTGEEINNRKQLEQYLKSHPGNPAIS 65 Query: 430 EFDWTTGEAPRRSARISEKVKAMPPPAVLEPXXXXXXXXXXXXXXXEMGDANVEKEN-MD 606 EFDW+TGE PRRSARISEKVK P P P E V+ N + Sbjct: 66 EFDWSTGETPRRSARISEKVKVAPAPESEPPKKRGRKSSGSKDNKMEAAGGEVDGTNEIQ 125 Query: 607 KKDMESTIEEKEGLEKENVVENEIEDKEKGEMGVDKEDTAHKKEEEVSGREVIESSKDTQ 786 KD E + EK E E + E E KEK D +D A + E + EV ++ +D Sbjct: 126 MKDAE--VAEKRDAEAEKEKDAEAE-KEK-----DADDKAATEITENNKEEVPQAGEDQA 177 Query: 787 VEDGGK---------MAENGTEETIEIDANIP 855 GK + ENG E +DA P Sbjct: 178 NGTCGKKQDETAAVTVEENGAAEKENVDAAAP 209 >ref|XP_004508481.1| PREDICTED: methyl-CpG-binding domain-containing protein 10-like [Cicer arietinum] Length = 314 Score = 148 bits (373), Expect = 7e-33 Identities = 83/214 (38%), Positives = 116/214 (54%), Gaps = 18/214 (8%) Frame = +1 Query: 250 DGLPLELPAPPSWKKRVMPKKGGKIKKNEVVFVAPTGDEIRNRRQLEKYLKTHDGNPGIS 429 + + +ELPAPP W K+ PKK G KKNE+VF APTG+EI N++QLEKYLK H G P IS Sbjct: 22 ESVSMELPAPPGWNKKFFPKKSGTPKKNEIVFTAPTGEEISNKKQLEKYLKAHPGGPNIS 81 Query: 430 EFDWTTGEAPRRSARISEKVKAMPPPAVLEPXXXXXXXXXXXXXXXEMGDANVEKENMDK 609 EFDW TGE PRRSARISEK KA PP + E ++++ Sbjct: 82 EFDWGTGETPRRSARISEKAKAAPPAESESEPPKKRVKKSPASKKGASEEEVEETKDVEM 141 Query: 610 KDMESTIEEKEGLEKENVVENEIEDKEKGEMGVDKEDTAHKKEEEVSGREV--------- 762 K+ E T ++K+ LE+E + NE ED++ E D ++ H +E G + Sbjct: 142 KEAEETKDDKD-LEQEKKIVNENEDEKGVEDAADVKEPIHPEESTHHGENIDIANDEGKS 200 Query: 763 ------IESSK---DTQVEDGGKMAENGTEETIE 837 +++SK D + +G ++ +N EE E Sbjct: 201 NTADGELQASKEKVDDKGTEGSEVVQNKDEEKFE 234 >ref|XP_004251723.1| PREDICTED: uncharacterized protein LOC101259218 [Solanum lycopersicum] Length = 434 Score = 148 bits (373), Expect = 7e-33 Identities = 87/220 (39%), Positives = 122/220 (55%), Gaps = 17/220 (7%) Frame = +1 Query: 229 SNNMENNDGLPLELPAPPSWKKRVMPKKGGKIKKNEVVFVAPTGDEIRNRRQLEKYLKTH 408 + ++E N+ + +ELPAP W K+ +PKKGG KKN++VF APTG+EI R+QLE+YLK+H Sbjct: 2 AKSVEKNEVVSIELPAPSGWSKKFLPKKGGTPKKNDIVFTAPTGEEITTRKQLEQYLKSH 61 Query: 409 DGNPGISEFDWTTGEAPRRSARISEKVKAMPPPAVLEPXXXXXXXXXXXXXXXEMGDANV 588 G P ++EFDW TGE PRRSARI+ K KA P EP + + Sbjct: 62 PGGPPVAEFDWGTGETPRRSARITGKAKATQSPTESEPAKKRGRKSSASKKDSKDKEVTK 121 Query: 589 EKENMDKKDME---------STIEEKEGLE-KENVVENEIEDKEKGEMGVDKEDTAHKKE 738 E E DME + +E +E +E KEN NE +D GE V+K+D E Sbjct: 122 ETEAAKDDDMEEAEKHEKDTAAMESEEDVEKKENENPNETQD---GESEVEKKDEIQSSE 178 Query: 739 EEV------SGREVIESSKDTQVEDGGKMAEN-GTEETIE 837 ++V G+ V + +D QVE +MA+N G + +E Sbjct: 179 KDVVKENLDEGQNVHDKVEDAQVEKDVQMADNVGPSQDVE 218 >gb|EOY31101.1| Methyl-CPG-binding domain 10, putative isoform 1 [Theobroma cacao] gi|508783846|gb|EOY31102.1| Methyl-CPG-binding domain 10, putative isoform 1 [Theobroma cacao] Length = 331 Score = 146 bits (369), Expect = 2e-32 Identities = 95/226 (42%), Positives = 126/226 (55%), Gaps = 10/226 (4%) Frame = +1 Query: 250 DGLPLELPAPPSWKKRVMPKKGGKIKKNEVVFVAPTGDEIRNRRQLEKYLKTHDGNPGIS 429 D + LELPAPP WKK+ MPKKGG KKNE++F APTG+EI NR+QLE+YLK H G P +S Sbjct: 16 DVVCLELPAPPGWKKKFMPKKGGTPKKNEILFTAPTGEEISNRKQLEQYLKAHPGGPAVS 75 Query: 430 EFDWTTGEAPRRSARISEKVKAMPPPAVLEP--XXXXXXXXXXXXXXXEMGDANVEKENM 603 FDW TGE PRRSARISEKVKAMP P P E G E+ Sbjct: 76 VFDWGTGETPRRSARISEKVKAMPTPESEPPKKRGRKSSASKKDNKETETGPEGTEETKD 135 Query: 604 D-KKDMESTIEEKEG-LEKENVVENEIEDKEKGEMGVDKEDTAHKKEEEVSGREVIESSK 777 D ++ E + ++ EG K V ENE E+K K + G K ++ +EV E +++ Sbjct: 136 DHMQEAEKSEKDNEGEAGKVAVQENENENKNKTQDGDGKTEST---PQEVKLGE--DANV 190 Query: 778 DTQVEDGGKMAE------NGTEETIEIDANIPIDSMDKDNFNGVAS 897 T VE G + A+ ++ +E DA+ + +K+ G AS Sbjct: 191 STNVEYGTESADAASKKLKNPKDGVEADAS-GVAEKEKEGSEGTAS 235 >ref|NP_563971.1| methyl-CPG-binding domain 10 [Arabidopsis thaliana] gi|75215632|sp|Q9XI36.1|MBD10_ARATH RecName: Full=Methyl-CpG-binding domain-containing protein 10; Short=AtMBD10; Short=MBD10; AltName: Full=Methyl-CpG-binding protein MBD10 gi|5103831|gb|AAD39661.1|AC007591_26 ESTs gb|H37032, gb|R6425, gb|Z34651, gb|N37268, gb|AA713172 and gb|Z34241 come from this gene [Arabidopsis thaliana] gi|20453139|gb|AAM19811.1| At1g15340/F9L1_28 [Arabidopsis thaliana] gi|56382007|gb|AAV85722.1| At1g15340 [Arabidopsis thaliana] gi|332191184|gb|AEE29305.1| methyl-CPG-binding domain 10 [Arabidopsis thaliana] Length = 384 Score = 146 bits (368), Expect = 2e-32 Identities = 89/204 (43%), Positives = 115/204 (56%), Gaps = 7/204 (3%) Frame = +1 Query: 238 MENNDGL-PLELPAPPSWKKRVMPKKGGKIKKNEVVFVAPTGDEIRNRRQLEKYLKTHDG 414 MEN D L +ELPAP SWKK PK+ G +K E+VFVAPTG+EI +R+QLE+YLK H G Sbjct: 1 MENTDELVSIELPAPASWKKLFYPKRAGTPRKTEIVFVAPTGEEISSRKQLEQYLKAHPG 60 Query: 415 NPGISEFDWTTGEAPRRSARISEKVKAMPPPAVLEPXXXXXXXXXXXXXXXEMGDANVE- 591 NP ISEF+WTTGE PRRS+RIS+KVKA P EP E + N E Sbjct: 61 NPVISEFEWTTGETPRRSSRISQKVKATTPTPDKEPLLKKRRSSLTKKDNKEAAEKNEEA 120 Query: 592 --KENM--DKKDMESTIEEKEGLEKENVVENEIEDKEKGEMGVDKEDTAHKKEEEV-SGR 756 KENM DK E ++ EKE V E +KE E + + +K+ E+ +G+ Sbjct: 121 AVKENMDVDKDGKTENAEAEKEKEKEGVTEIAEAEKENNEGEKTEAEKVNKEGEKTEAGK 180 Query: 757 EVIESSKDTQVEDGGKMAENGTEE 828 E + + E G+ AE +E Sbjct: 181 EGQTEIAEAEKEKEGEKAEAENKE 204 >gb|ESW27012.1| hypothetical protein PHAVU_003G166200g [Phaseolus vulgaris] Length = 341 Score = 145 bits (365), Expect = 6e-32 Identities = 84/217 (38%), Positives = 119/217 (54%), Gaps = 9/217 (4%) Frame = +1 Query: 256 LPLELPAPPSWKKRVMPKKGGKIKKNEVVFVAPTGDEIRNRRQLEKYLKTHDGNPGISEF 435 L LELPAP WKK+ PKK G KKNE+ F APTG+EI NR+QLE+YLK H G P +SEF Sbjct: 39 LSLELPAPSGWKKKFFPKKYGTPKKNEIAFTAPTGEEIHNRKQLEQYLKAHPGGPAVSEF 98 Query: 436 DWTTGEAPRRSARISEKVKAMPPPAVLEPXXXXXXXXXXXXXXXEMGD----ANVEKENM 603 DW TGE PRRSARISEK K PPP P + + +V+ E Sbjct: 99 DWGTGETPRRSARISEKAKTTPPPEYEPPKKRGKKSPALKKEASQEEEKDETKDVQMEEA 158 Query: 604 DKKDMESTIEEKEGLEKENVVENEIEDKEKGEMGVDKEDTAHKKEEE--VSGREVIESSK 777 D+ E +E+++ + +EN E ED + E +E T K++E + + +SK Sbjct: 159 DETKDEKDLEQEKNVVEENQDEKRAEDTDGKESTHPEESTDKPKDDENFKTADAELHASK 218 Query: 778 ---DTQVEDGGKMAENGTEETIEIDANIPIDSMDKDN 879 + ++ +G ++ ++ EE IE P++ KD+ Sbjct: 219 EKLEDKLVEGSEVVQSKDEEKIE----QPLEETKKDD 251 >gb|EMJ03445.1| hypothetical protein PRUPE_ppa008789mg [Prunus persica] Length = 319 Score = 144 bits (364), Expect = 7e-32 Identities = 90/223 (40%), Positives = 117/223 (52%), Gaps = 10/223 (4%) Frame = +1 Query: 226 ASNNMENNDGLPLELPAPPSWKKRVMPKKGGKIKKNEVVFVAPTGDEIRNRRQLEKYLKT 405 +S E + + LELPAP W K+ +PK+ G KKNE++F APTG+EI N+RQLE+YLK Sbjct: 3 SSVEKEGEEVVSLELPAPSGWVKKFLPKQSGTPKKNEIIFTAPTGEEITNKRQLEQYLKA 62 Query: 406 HDGNPGISEFDWTTGEAPRRSARISEKVKAMPPPAVLEPXXXXXXXXXXXXXXXEMGDAN 585 H G P +SEFDW+TGE PRRSARISEK KA PPP Sbjct: 63 HPGGPAVSEFDWSTGETPRRSARISEKAKATPPP-------------------------- 96 Query: 586 VEKENMDKKDMESTIEEKEGLEK----ENVVENEIEDKEKGEMGVDKEDTAHKKEEEVSG 753 E E K+ +ST +K+ EK E E +I D + E EDT +K++ Sbjct: 97 -ESEPPKKRSRKSTSAKKDSKEKQAGPEGAEETKISDVQAAEKAEKVEDTEMEKDD---- 151 Query: 754 REVIESSKDTQVEDGGKMAENGT-----EET-IEIDANIPIDS 864 KD Q E+ A+ T EET +E +ANIP D+ Sbjct: 152 ------VKDNQDEEKAPDADTKTEVAQPEETKVEQEANIPGDA 188