BLASTX nr result
ID: Cephaelis21_contig00001056
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Cephaelis21_contig00001056 (1470 letters) Database: ./nr 23,641,837 sequences; 8,123,359,852 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_002267149.1| PREDICTED: uncharacterized protein LOC100249... 170 8e-40 ref|NP_563971.1| methyl-CPG-binding domain 10 [Arabidopsis thali... 163 1e-37 ref|XP_002517633.1| Nucleosome-binding protein, putative [Ricinu... 161 5e-37 gb|AAN60295.1| unknown [Arabidopsis thaliana] 160 1e-36 ref|NP_001240074.1| uncharacterized protein LOC100776785 [Glycin... 156 1e-35 >ref|XP_002267149.1| PREDICTED: uncharacterized protein LOC100249094 isoform 1 [Vitis vinifera] Length = 324 Score = 170 bits (431), Expect = 8e-40 Identities = 122/306 (39%), Positives = 164/306 (53%), Gaps = 17/306 (5%) Frame = -1 Query: 1197 LPAPASWKKLYLPKKAGTPRKSEIVFVAPTGEEISTRRQLDQYLKSHPGSPSISEFDWST 1018 LPAP SWKK+++PKK GTPRK+EIVF+APTGEEI++R+QL+QYLKSHPG+P+ISEFDW T Sbjct: 12 LPAPPSWKKMFMPKK-GTPRKNEIVFIAPTGEEINSRKQLEQYLKSHPGNPAISEFDWGT 70 Query: 1017 GETPRRSARISEKAXXXXXXXXXXXXXKRARTSVGTKKDKEST------NDGNXXXXXXX 856 GETPRRSARISEKA KR R S G+KKD + T +G Sbjct: 71 GETPRRSARISEKA-KATPPAESEPPKKRGRKSSGSKKDGKETEAATEEQEGKKEISMQD 129 Query: 855 KMPVAGANEKKDDGGDKPSDTPAVTQGEEEGKAHYEDRNVTE--PKLKVNGQEENSATDV 682 AN + +D + V E + D N+ E P ++G++E +D Sbjct: 130 ADVTEKANAESEDVSKEIQVENGVKTVAEADQVKNLDVNMEEAGPVEAIDGKDEKIQSDT 189 Query: 681 GTEK-ARDEGSTADTVVANQPSE-KVEQKLESKAIQKPEIEAGKDASADTAEQDKVGNDS 508 G K A E + A E K ++ E+ A+ +P EAG A E DK+ + Sbjct: 190 GDSKVAATETEVVNAEEAQGEKEVKKQEVAEAVAVDEPAKEAG--AKVTQKEGDKLETSA 247 Query: 507 VVVNNGA--EGQRTDVVVPSVGEIKGQ----ENDGKLKAQVERE-NNMKGVVIENGKVDQ 349 V N A + + + + E+K + +NDGK K QVE ++G V ENGKV+Q Sbjct: 248 TVELNEAVNKDKPNGLGIAPEEEVKEKQEVPDNDGKCKFQVEENGKKLEGDVTENGKVNQ 307 Query: 348 SGPRET 331 ET Sbjct: 308 MQRAET 313 >ref|NP_563971.1| methyl-CPG-binding domain 10 [Arabidopsis thaliana] gi|75215632|sp|Q9XI36.1|MBD10_ARATH RecName: Full=Methyl-CpG-binding domain-containing protein 10; Short=AtMBD10; Short=MBD10; AltName: Full=Methyl-CpG-binding protein MBD10 gi|5103831|gb|AAD39661.1|AC007591_26 ESTs gb|H37032, gb|R6425, gb|Z34651, gb|N37268, gb|AA713172 and gb|Z34241 come from this gene [Arabidopsis thaliana] gi|20453139|gb|AAM19811.1| At1g15340/F9L1_28 [Arabidopsis thaliana] gi|56382007|gb|AAV85722.1| At1g15340 [Arabidopsis thaliana] gi|332191184|gb|AEE29305.1| methyl-CPG-binding domain 10 [Arabidopsis thaliana] Length = 384 Score = 163 bits (412), Expect = 1e-37 Identities = 100/275 (36%), Positives = 143/275 (52%), Gaps = 4/275 (1%) Frame = -1 Query: 1197 LPAPASWKKLYLPKKAGTPRKSEIVFVAPTGEEISTRRQLDQYLKSHPGSPSISEFDWST 1018 LPAPASWKKL+ PK+AGTPRK+EIVFVAPTGEEIS+R+QL+QYLK+HPG+P ISEF+W+T Sbjct: 12 LPAPASWKKLFYPKRAGTPRKTEIVFVAPTGEEISSRKQLEQYLKAHPGNPVISEFEWTT 71 Query: 1017 GETPRRSARISEKAXXXXXXXXXXXXXKRARTSVGTKKDKESTNDGNXXXXXXXKMPVAG 838 GETPRRS+RIS+K K+ R+S+ TKKD + + N M V Sbjct: 72 GETPRRSSRISQKVKATTPTPDKEPLLKKRRSSL-TKKDNKEAAEKNEEAAVKENMDVDK 130 Query: 837 ANEKKDDGGDKPSDTPAVTQGEEEGKAHYEDRNVTEPKLKVNGQE----ENSATDVGTEK 670 + ++ +K + VT+ E K + E K+ G++ + T++ + Sbjct: 131 DGKTENAEAEKEKEKEGVTEIAEAEKENNEGEKTEAEKVNKEGEKTEAGKEGQTEIAEAE 190 Query: 669 ARDEGSTADTVVANQPSEKVEQKLESKAIQKPEIEAGKDASADTAEQDKVGNDSVVVNNG 490 EG A+ N+ +E V K ES + E+E + E KV Sbjct: 191 KEKEGEKAE--AENKEAEVVRDKKESMEVDTSELEKKAGSGEGAEEPSKVEGLKDTEMKE 248 Query: 489 AEGQRTDVVVPSVGEIKGQENDGKLKAQVERENNM 385 A+ T+ V + EN G + + E N+ Sbjct: 249 AQEVVTEADVEKKPAEEKTENKGSVTTEANGEQNV 283 >ref|XP_002517633.1| Nucleosome-binding protein, putative [Ricinus communis] gi|223543265|gb|EEF44797.1| Nucleosome-binding protein, putative [Ricinus communis] Length = 331 Score = 161 bits (407), Expect = 5e-37 Identities = 113/310 (36%), Positives = 164/310 (52%), Gaps = 26/310 (8%) Frame = -1 Query: 1197 LPAPASWKKLYLPKKAGTPRKSEIVFVAPTGEEISTRRQLDQYLKSHPGSPSISEFDWST 1018 LPAP+SWKK+Y PK+AGTPRKSEI+F++PTGEEI++R+QL+QYLKSHPG+P I+EFDW T Sbjct: 12 LPAPSSWKKMYFPKRAGTPRKSEIMFISPTGEEINSRKQLEQYLKSHPGNPPIAEFDWGT 71 Query: 1017 GETPRRSARISEKAXXXXXXXXXXXXXKRARTSVGTKKDK---ESTNDGNXXXXXXXKMP 847 GETPRRSARISEK + ++S G+KKD ES ++ Sbjct: 72 GETPRRSARISEKVKATPTPEKEPAKKRGRKSSSGSKKDNKETESAHEKGEYEKEIQMQD 131 Query: 846 VAGANEKKDDGGDKPSDTP---AVTQGEEEGKA-HYEDRNVTEPKLKVNGQEENSATDVG 679 GA ++ ++ G K +D + +G+++ +A E+ ++ E K ++ D G Sbjct: 132 ADGAGKENEEAG-KENDVAKEGLIEKGDKKEEAGQTENADIEETAQKQVNKDTGVQEDAG 190 Query: 678 TEKARDEGSTADTVVANQPSEKVEQKLESKAIQKPEIEAGK-DASADTAEQDKVGNDS-- 508 +KA E + Q E E+ L+ K+ + EAG + A+ Q +VG ++ Sbjct: 191 EDKAGPE--NLQQAMEVQEQENPEEALKKKSAE----EAGSGEGIAENVLQTEVGKENDQ 244 Query: 507 -----------VVVNNGAEGQRTDVVVP-SVGEIKG----QENDGKLKAQVERENNMKGV 376 N GA + + VP S EIK QE D + + M G Sbjct: 245 GDKMDIPESVPKEANGGAAKENANGAVPVSEEEIKEKPDLQEKDNTPVDGISK--TMDGE 302 Query: 375 VIENGKVDQS 346 V ENGKV+Q+ Sbjct: 303 VTENGKVNQT 312 >gb|AAN60295.1| unknown [Arabidopsis thaliana] Length = 346 Score = 160 bits (404), Expect = 1e-36 Identities = 99/275 (36%), Positives = 140/275 (50%), Gaps = 4/275 (1%) Frame = -1 Query: 1197 LPAPASWKKLYLPKKAGTPRKSEIVFVAPTGEEISTRRQLDQYLKSHPGSPSISEFDWST 1018 LPAPASWKKL+ PK+AGTPRK+EIVFVAPTGEEIS+R+QL+QYLK+HPG+P ISEF+W+T Sbjct: 12 LPAPASWKKLFYPKRAGTPRKTEIVFVAPTGEEISSRKQLEQYLKAHPGNPVISEFEWTT 71 Query: 1017 GETPRRSARISEKAXXXXXXXXXXXXXKRARTSVGTKKDKESTNDGNXXXXXXXKMPVAG 838 GETPRRS+RIS+K + R S TKKD + + N M V Sbjct: 72 GETPRRSSRISQKV----KATPDKEPLLKKRRSSLTKKDNKEAAEKNEEAAVKENMDVDK 127 Query: 837 ANEKKDDGGDKPSDTPAVTQGEEEGKAHYEDRNVTEPKLKVNGQE----ENSATDVGTEK 670 + ++ +K + VT+ E K + E K+ G++ + T++ + Sbjct: 128 DGKTENAEAEKEKEKEGVTEIAEAEKENNEGEKTEAEKVNKEGEKTEAGKEGQTEIAEAE 187 Query: 669 ARDEGSTADTVVANQPSEKVEQKLESKAIQKPEIEAGKDASADTAEQDKVGNDSVVVNNG 490 EG A+ N+ +E V K ES + E+E + E KV Sbjct: 188 KEKEGEKAE--AENKEAEVVRDKKESMEVDTSELEKKAGSGEGAEEPSKVEGLKDTEMKE 245 Query: 489 AEGQRTDVVVPSVGEIKGQENDGKLKAQVERENNM 385 A+ T+ V + EN G + + E N+ Sbjct: 246 AQEVVTEADVEKKPAEEKTENKGSVTTEANGEQNV 280 >ref|NP_001240074.1| uncharacterized protein LOC100776785 [Glycine max] gi|255645971|gb|ACU23474.1| unknown [Glycine max] Length = 275 Score = 156 bits (395), Expect = 1e-35 Identities = 99/232 (42%), Positives = 126/232 (54%), Gaps = 7/232 (3%) Frame = -1 Query: 1197 LPAPASWKKLYLPKKAGTPRKSEIVFVAPTGEEISTRRQLDQYLKSHPGSPSISEFDWST 1018 LPAP+ W KL+ PKK GTPRKSEIVF+APTGEEIST++QL+QYLK+HPG+P ISEFDW T Sbjct: 19 LPAPSGWNKLFFPKKLGTPRKSEIVFIAPTGEEISTKKQLEQYLKAHPGNPVISEFDWGT 78 Query: 1017 GETPRRSARISEKAXXXXXXXXXXXXXKRARTSVGTKKD-KESTNDGNXXXXXXXKMPVA 841 GETPRRSARISEK KRAR S G+KKD KE+ + Sbjct: 79 GETPRRSARISEKV-KSTPPADSDTPKKRARKSSGSKKDNKETESASEEGKAKSDTEDPK 137 Query: 840 GANEKKDDGGDKPSDTPAVTQGEEEGKAHYEDRNVTEPKLKVNGQEENSATDVGTEKARD 661 A E+K++G D + Q E K D +P + + EEN D + D Sbjct: 138 AAEEEKNEGND--NSNSGGKQLENGDKTEQIDEQAKKPDVDM---EENDLNDTNNKLEND 192 Query: 660 EGS------TADTVVANQPSEKVEQKLESKAIQKPEIEAGKDASADTAEQDK 523 + V+A +P + QK E + +K EA A+TAE +K Sbjct: 193 SDEIKNSHVNGENVIAERPEGEEAQKQEVEPAEKVAEEA-----ANTAETEK 239