BLASTX nr result
ID: Ephedra25_contig00005512
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Ephedra25_contig00005512 (1899 letters) Database: ./nr 37,332,560 sequences; 13,225,080,153 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_002284140.2| PREDICTED: uncharacterized protein LOC100248... 137 2e-29 ref|XP_004238529.1| PREDICTED: uncharacterized protein LOC101267... 135 6e-29 ref|XP_006578321.1| PREDICTED: uncharacterized protein LOC100780... 129 4e-27 ref|XP_006364865.1| PREDICTED: uncharacterized protein LOC102582... 129 6e-27 ref|XP_002513931.1| hypothetical protein RCOM_1035820 [Ricinus c... 122 4e-25 ref|XP_006853188.1| hypothetical protein AMTR_s00038p00204530 [A... 122 7e-25 ref|XP_006581536.1| PREDICTED: uncharacterized protein LOC102665... 121 1e-24 ref|NP_173650.3| methyl-CPG-binding domain-containing protein [A... 116 4e-23 ref|XP_002893218.1| methyl-CpG-binding domain 8 [Arabidopsis lyr... 113 3e-22 gb|EOY15980.1| Uncharacterized protein isoform 1 [Theobroma caca... 112 4e-22 gb|ESW08251.1| hypothetical protein PHAVU_009G031600g [Phaseolus... 111 1e-21 ref|XP_006416210.1| hypothetical protein EUTSA_v10007200mg [Eutr... 105 5e-20 ref|XP_004292482.1| PREDICTED: uncharacterized protein LOC101298... 101 1e-18 ref|XP_002525855.1| hypothetical protein RCOM_0824380 [Ricinus c... 98 1e-17 gb|EMJ00867.1| hypothetical protein PRUPE_ppa001455mg [Prunus pe... 98 1e-17 ref|XP_006661339.1| PREDICTED: dentin sialophosphoprotein-like, ... 97 2e-17 ref|XP_004957094.1| PREDICTED: uncharacterized protein LOC101759... 97 2e-17 ref|NP_001123600.1| LOC100170247 [Zea mays] gi|189514249|gb|ACE0... 97 2e-17 ref|XP_002300183.1| hypothetical protein POPTR_0001s31990g [Popu... 97 3e-17 emb|CBI19167.3| unnamed protein product [Vitis vinifera] 96 4e-17 >ref|XP_002284140.2| PREDICTED: uncharacterized protein LOC100248904 [Vitis vinifera] Length = 947 Score = 137 bits (345), Expect = 2e-29 Identities = 131/475 (27%), Positives = 206/475 (43%), Gaps = 12/475 (2%) Frame = +3 Query: 60 LDFSTIPVVDLHDFSQDEINV----ASQCSDVFRFDDIIVPKIDYSVFQESSASRKQTYS 227 L +P++DL SQ E+ +S SD+ R DD+++PKID S+F ES+ SRKQTYS Sbjct: 12 LHLEALPLIDLRFLSQSELQALSLTSSHSSDLRRCDDVVIPKIDRSIFNESAGSRKQTYS 71 Query: 228 KLRLS-RKQEGAETLPGYKAGHFSLSKCRSMVDESGKQEAQRILQFIQERLNTTHPSGNG 404 +LRL+ RK + A T+P + FS + E +E I+ ++ Sbjct: 72 RLRLAPRKPDIAATIP--RRPRFSPHLNQKAALEPVDEENTLIIGLLK------------ 117 Query: 405 SLFMSDGALDANSSEINLQAITRVDENCPQPLAVYSALHNEDVRLVSSEQVTDLVTAASN 584 LF ++ T D+ P + Y NE ++ + + V D Sbjct: 118 GLFATE---------------THADDLIPVQVE-YRESSNEILQNIPIDVVADS------ 155 Query: 585 SAIDRSKKKLKPKEGARLKAFMNSNDAAAHQIPIKPEELRSNGTTNFVDTVDKAENIQQY 764 R +K+ +PK + + N + I + +NG VD A Sbjct: 156 ---GRKRKRGRPKSEKTIAVYQNGGSGEGGGMGI----INNNGVV-----VDVAA----- 198 Query: 765 HDKLTRENNAPHMANANNTATFSPIFKESIFPQLKRK---FTTEPELHTFFNNIGGEWAS 935 +ANA ++ P+L+R+ TTE EL F + G+W S Sbjct: 199 ------------LANA----------EDPFGPELRRRTEGLTTEEELLGFLTGLSGQWGS 236 Query: 936 KLKKRKIVNAEDFVSGLPQGWKLLLGVRKKNGRYIIECRKYISPAGPQFASWKEASVFLS 1115 + KKRKIV A DF LPQGWKLLL +++K GR + CR+YISP G QF S KE S L Sbjct: 237 RRKKRKIVEASDFGDVLPQGWKLLLSMKRKEGRVWLFCRRYISPNGQQFVSCKEVSSCLL 296 Query: 1116 TNGALPVGRKDVSQVNLDHANSRTHFDPIQKKETHGTLNDGVDMPNASHQNQLKAICSPD 1295 + G +D Q N H + + + + + G G+ + + + ++ L +CS Sbjct: 297 SLS----GLQDARQPNYGHNDENSQ---LAHQISPGNA-AGLTLKDDNSKDGL--VCSSP 346 Query: 1296 AGKKS----QEKSDIVAGTKRRYNPSSLNAPMNCKKCNATYPNRSSFMGHLTIHH 1448 + + EK + + + + C KC T+ + + HL+ H Sbjct: 347 STVTTIPTHHEKQATLLNMGNSWE-VKVGEILKCHKCAMTFDEKDDLLHHLSSSH 400 >ref|XP_004238529.1| PREDICTED: uncharacterized protein LOC101267888 [Solanum lycopersicum] Length = 1192 Score = 135 bits (340), Expect = 6e-29 Identities = 171/660 (25%), Positives = 260/660 (39%), Gaps = 50/660 (7%) Frame = +3 Query: 60 LDFSTIPVVDLHDFSQDEINVASQCSDVF----RFDDIIVPKIDYSVFQESSASRKQTYS 227 L +IP VDL SQ E+ S CS R DD+I+PKID SVF ES+ SRKQTYS Sbjct: 18 LQAESIPTVDLRLLSQSELYSLSLCSPAAFNPCRDDDVIIPKIDRSVFNESAGSRKQTYS 77 Query: 228 KLRLSRKQEGAETLPGYKAGHFSLSKCRSMVDESGKQEAQRILQFIQERLNTTHPSGNGS 407 +LRL+ P A S + R+ N+ HP N S Sbjct: 78 RLRLA---------PAATASASSAIRSRT-----------------PHLRNSPHPLQNPS 111 Query: 408 LFMSDGALDANSSEINL---QAITRVDENCPQPLAVYSALHNEDVRLVSSEQVTDLVTAA 578 ++G ++ SS+I Q + P L +++ + + S V L A Sbjct: 112 --PNNGPANSESSQIVTLLKQLFGSGTQKNPTDLVPIRVDYSDSLSVPSHVPVPGLELAN 169 Query: 579 SNSAIDRSKKKLKPKEGARLKAFMNSNDAAAHQIPIKPEELRSNGTTNFVDTVDKAENIQ 758 S I + +K+ +P++ N N ++ VD V K + Sbjct: 170 VGS-IGQKRKRGRPRK--------NENGVRVAEVK--------------VDEVVKDIVVY 206 Query: 759 QYHDKLTRENNAPHMANANNTATFSPIFKESIFP---QLKRK---FTTEPELHTFFNNIG 920 Q D +E + N + + S+ P +L+R+ + EL F + Sbjct: 207 QNVDDSDKE-----IMNKDGIPVDLAVLGASVDPFGLELRRRTEGLGSAEELLGFLGRLN 261 Query: 921 GEWASKLKKRKIVNAEDFVSGLPQGWKLLLGVRKKNGRYIIECRKYISPAGPQFASWKEA 1100 G+W S KKR+IV+A+DF S LP+ WKLLL +++K GR + CR+YISP G QF + KE Sbjct: 262 GQWGSTRKKRRIVDADDFGSMLPKSWKLLLSIKRKEGRSWLHCRRYISPNGRQFGTCKEV 321 Query: 1101 SVFL-----STNGALPVGRKDVSQVNLDHANSRTHFDPIQ---KKET-----------HG 1223 S +L N LP V + +A + T IQ KKE+ HG Sbjct: 322 SSYLLFLRGERNENLPTYVNGSGTVEITNACALTSDLRIQDGGKKESSVFHNSSPAVGHG 381 Query: 1224 TLNDGVDMPNASHQNQLKAICSPDAGKKSQEKSDIV-------AGTKRRYNPSSL----- 1367 L ++ S + K D++ K R S+ Sbjct: 382 ELQVLLNFGELSEVQVGDLLQCDKCNVTFNNKDDLLQHQLFSHQRRKSRNGGQSITDGVI 441 Query: 1368 --NAPMNCKKCNATYPNRSSFMGHLTIHHVKRKKNAEG-IPANTNSVTSLQVQANGTNNI 1538 + C+ C+ T+ + + GH+ H K+ K +G +P V + Sbjct: 442 IRDGKFECQFCHKTFEEKHRYNGHVGNHVKKQVKTVDGSLPIKMGGGIEPVVPSGAMLRE 501 Query: 1539 PNIMAVQAYGQNM-DNVTMVYDKANN-VNSTQIQEDGVNNGKSALHIGNAEDMEKAPGEV 1712 P + +N+ +N ++ D +N +T+IQED + G + G Sbjct: 502 PIMQDSVVLPRNLTENAGVITDAGDNPAPTTKIQEDHMETDNKLEAEGTSNGCHNQEGSS 561 Query: 1713 VTMSHISSETVRLHTENMENSSTNGNIPHDANCSSMTDIKSPSNSCSKSF-DEKYQCTVD 1889 V+ S ISS + +N P DI +SC S D K+ TVD Sbjct: 562 VSRSPISSNEKTCVDISKVIVGSNIEEPEQEGLLCSNDI---VDSCGVSMEDGKFFPTVD 618 >ref|XP_006578321.1| PREDICTED: uncharacterized protein LOC100780637 isoform X1 [Glycine max] gi|571450041|ref|XP_006578322.1| PREDICTED: uncharacterized protein LOC100780637 isoform X2 [Glycine max] Length = 863 Score = 129 bits (324), Expect = 4e-27 Identities = 118/458 (25%), Positives = 199/458 (43%), Gaps = 14/458 (3%) Frame = +3 Query: 72 TIPVVDLHDFSQDEINV-----ASQCSDVFRFDDIIVPKIDYSVFQESSASRKQTYSKLR 236 ++P+VDL SQ E+ A+ C DD ++PKID S F ES+ SRKQTYSKLR Sbjct: 18 SLPLVDLRLLSQPELYTLSLSGATHCHRRNSDDDSVIPKIDRSNFNESAGSRKQTYSKLR 77 Query: 237 LSRKQEGAETLPGYKAGHFSLSKCRSMVDESGKQEAQRILQFIQERLNTTHPSGNGSLFM 416 L+++++ +P + H L + E ++E RI+ +Q+ LF Sbjct: 78 LNKRKQNP-AVPASSSFHIPLH-----ISEPEEEENSRIVALLQQ------------LFG 119 Query: 417 SDGALDANSSEINLQAITRVDENCPQPLAVYSALHNEDVRLVSSEQVTDLVTAASNSAID 596 + +A ++ + + V + QP +++A N + +V+ Sbjct: 120 VEPLRNAPRNDAAERRLVPVQVDFKQPPPMFAAFQNVPIDVVADSS-------------Q 166 Query: 597 RSKKKLKPKEGARLKAFMNSNDAAAHQIPIKPEELRSNGTTNFVDTVDKAENIQQYHDKL 776 R +K+ +P++ + N K N T FV+ K Sbjct: 167 RKRKRGRPRK--------DENSVTVFVEEPKKVTKEENSVTVFVEEPKKVNG-------- 210 Query: 777 TRENNAPHMANANNTATFSP---IFKESIFPQLKRK---FTTEPELHTFFNNIGGEWASK 938 N + A A T T + + ++ +LKR+ TEP++ F + GEWAS+ Sbjct: 211 ---NGEVNAAVATTTTTVNETVGLDEDPFEVELKRRTQGLETEPQVVEFLETLNGEWASQ 267 Query: 939 LKKRKIVNAEDFVSGLPQGWKLLLGVRKKNGRYIIECRKYISPAGPQFASWKEASVFLST 1118 KKR+IV A + LP GWK+++ ++ GR CR+Y+SP G QF S KEAS +L + Sbjct: 268 RKKRRIVPASELGDLLPAGWKIVIITMRRAGRASAVCRRYVSPDGHQFESCKEASAYLLS 327 Query: 1119 NGALPVGRKDVSQVNLDHANSRTHFDPIQKKETHGTLNDGVDMPNASHQNQLKAICSPDA 1298 G +D S + +++ + + ++ +P + A P A Sbjct: 328 ----VFGVQDRSHLKSSYSDGAQQLSSSMNRASESSVG---HVPTGDMKTDASASYLPSA 380 Query: 1299 G---KKSQEKSDIVAGTKRRYNPSSLNAPMNCKKCNAT 1403 G S EK ++ + N +S + + CK +AT Sbjct: 381 GAPIHSSHEKQPPISSSIGSENFNS-DLALGCKLGDAT 417 >ref|XP_006364865.1| PREDICTED: uncharacterized protein LOC102582612 isoform X2 [Solanum tuberosum] Length = 1193 Score = 129 bits (323), Expect = 6e-27 Identities = 135/490 (27%), Positives = 208/490 (42%), Gaps = 18/490 (3%) Frame = +3 Query: 60 LDFSTIPVVDLHDFSQDEINVASQCSDVF----RFDDIIVPKIDYSVFQESSASRKQTYS 227 L +IP VDL SQ E+ S CS R DD+I+PKID SVF ES+ SRKQTYS Sbjct: 18 LQAESIPTVDLRLLSQSELYSLSLCSTAAFNPCRDDDVIIPKIDRSVFNESAGSRKQTYS 77 Query: 228 KLRLSRKQEGAETLPGYKAGHFSLSKCRSMVDESGKQEAQRILQFIQERLNTTHPSGNGS 407 +LRL+ A S S RS N+ HP N S Sbjct: 78 RLRLAPAA----------AASASSSAIRSRTPHLR---------------NSPHPLQNPS 112 Query: 408 LFMSDGALDANSSEINL---QAITRVDENCPQPLAVYSALHNEDVRLVSSEQVTDLVTAA 578 ++G ++ SS+I + Q + P L +++ + + S V L A Sbjct: 113 --PNNGPANSESSQIVILLKQLFGSGTQKNPTDLVPIRVDYSDSLSVPSHVPVPGLELAN 170 Query: 579 SNSAIDRSKKKLKPKEGARLKAFMNSNDAAAHQIPIKPEELRSNGTTNFVDTVDKAENIQ 758 S + + +K+ +P++ N+ +K +E+ V + +N+ Sbjct: 171 VGS-VGQKRKRGRPRK----------NENGVRVAEVKVDEV--------VKDIVVYQNVD 211 Query: 759 QYHDKLTRENNAPHMANANNTATFSPIFKESIFPQLKRK---FTTEPELHTFFNNIGGEW 929 ++ ++ P + A A P E L+R+ + EL F + G+W Sbjct: 212 DSDKEIMNKDGIP-VDLAVLGALVDPFGLE-----LRRRTEGLGSAEELLGFLGRLNGQW 265 Query: 930 ASKLKKRKIVNAEDFVSGLPQGWKLLLGVRKKNGRYIIECRKYISPAGPQFASWKEASVF 1109 S KKR+IV+A++F S LP+ WKLLL +++K GR + CR+YISP G QF + KE S + Sbjct: 266 GSTRKKRRIVDADEFGSVLPKSWKLLLSIKRKEGRSWLHCRRYISPNGRQFGTCKEVSSY 325 Query: 1110 L-----STNGALPVGRKDVSQVNLDHANSRTHFDPIQ---KKETHGTLNDGVDMPNASHQ 1265 L LP V + +A + T IQ KKE+ N P H Sbjct: 326 LLFLHGERKENLPAYANGSGTVEITNACALTSDLRIQDGGKKESSVFHNSS---PAVGH- 381 Query: 1266 NQLKAICSPDAGKKSQEKSDIVAGTKRRYNPSSLNAPMNCKKCNATYPNRSSFMGHLTIH 1445 +L+ + + E S++ G ++C KCN T+ N+ + H Sbjct: 382 GELQVLVN------FGELSEVQVGDL-----------LHCDKCNVTFNNKDDLLQHQLFS 424 Query: 1446 HVKRKKNAEG 1475 H +R+ G Sbjct: 425 HQRRRSRNGG 434 >ref|XP_002513931.1| hypothetical protein RCOM_1035820 [Ricinus communis] gi|223547017|gb|EEF48514.1| hypothetical protein RCOM_1035820 [Ricinus communis] Length = 1337 Score = 122 bits (307), Expect = 4e-25 Identities = 136/506 (26%), Positives = 209/506 (41%), Gaps = 38/506 (7%) Frame = +3 Query: 60 LDFSTIPVVDLHDFSQDEINVASQCSDVFRFD------DIIVPKIDYSVFQESSASRKQT 221 L ++P++DL SQ E+ S CS F + D+ KID SVF ES+ SRKQT Sbjct: 24 LQMESLPLIDLRLLSQSELLSLSLCSFSFLNNPLQNEADVATLKIDRSVFNESAGSRKQT 83 Query: 222 YSKLRLSRKQEGAETLPGYKAGHFSLSKCRSM-----VDESGKQEAQRILQFIQERLNTT 386 +S+LRL+R+ HFS R+ V+ S +E +I+ I+ Sbjct: 84 FSRLRLARRNNNNS--------HFSTPSIRNQIPHQTVEISQDEENSQIIYLIK------ 129 Query: 387 HPSGNGSLFMSDGALDANSSEINLQAITRVDENCPQPLAV---YSALHNEDVRLVSSEQV 557 SLF S+ + ++E++ + D P+ + AL + V S E Sbjct: 130 ------SLFGSNFENEKENNEVDNVNLFSDDNLISVPITYNESFQALQDLAVADYSDETK 183 Query: 558 TDLVTAASNSAIDRSKKKLKPKEGARLKAFMNSNDAAAHQIPIKPEELRSNGTTNFVDTV 737 + TA ++S K+K + + L F+ +N+ + + EE T D+ Sbjct: 184 QAIATAITHSESTAEKRK-RGRPRKNLSDFVGNNNVDGNDNGNEKEEKEETAIT---DSK 239 Query: 738 DKAENIQQYHDKLTRENNAPHMANANNTATF-SPIFKES----------------IFPQL 866 K Q+ L NN AN A +P +E +L Sbjct: 240 RKRGRPQKDASTLGCHNNNNVNANEEKRAVCENPRTQEEEKRGMKVELGSSEEDPYAEEL 299 Query: 867 KRK---FTTEPELHTFFNNIGGEWASKLKKRKIVNAEDFVSGLPQGWKLLLGVRKKNGRY 1037 +R+ TE EL F + GEW SK KKRKIV+A LP+ WKL+L +++ G + Sbjct: 300 RRRTMGMQTESELLGFLEGLQGEWMSKRKKRKIVDASVLGDVLPRNWKLILCNKRRAGFF 359 Query: 1038 IIECRKYISPAGPQFASWKEASVFLSTNGALPVGRKDVSQVNLDHANSRTHFDPIQKKET 1217 ++C YISP G QF S KE S + L + VSQ + H +S + Sbjct: 360 WLDCTGYISPNGQQFMSCKEVS-----SNLLSKELQGVSQSSFGHDDSNI--------QL 406 Query: 1218 HGTLNDG--VDMPNASHQNQLKAICSPDAGKKSQEKSDIVAGTKRRYNPSSLNA--PMNC 1385 GT++ G D+ +++N I SP + + A T P + NC Sbjct: 407 TGTVSYGNAADLTLKNNKNGGGFISSPALPVTKSVEHEKQATTLAAVVPPHVQTVEKYNC 466 Query: 1386 KKCNATYPNRSSFMGHLTIHHVKRKK 1463 KC + + HL H + K Sbjct: 467 HKCTMAFQEPDDLLQHLLSSHQRAPK 492 >ref|XP_006853188.1| hypothetical protein AMTR_s00038p00204530 [Amborella trichopoda] gi|548856827|gb|ERN14655.1| hypothetical protein AMTR_s00038p00204530 [Amborella trichopoda] Length = 826 Score = 122 bits (305), Expect = 7e-25 Identities = 92/303 (30%), Positives = 147/303 (48%), Gaps = 11/303 (3%) Frame = +3 Query: 588 AIDRSKKKLKPKEGARLKAFMN-----SNDAAAHQIPIKPEELRSNGTTNFVDTVDKAEN 752 A+ R K+++ KE AR K M+ + D A + +NG+++F T Sbjct: 260 AVIRQKRRVSKKEDARRKGLMSLAVLENGDRGA---------IDNNGSSDFNQTGIGC-- 308 Query: 753 IQQYHDKLTRENNAPHMANANNTATFSPIFKESIFPQLKRK---FTTEPELHTFFNNIGG 923 H + +N M + +E P LK++ E EL F + +GG Sbjct: 309 ----HGNVRNGDNKEKMLQNGFVEVHALASRELFVPHLKKRTAALENELELVEFLDGLGG 364 Query: 924 EWASKLKKRKIVNAEDFVSGLPQGWKLLLGVRKKNGRYIIECRKYISPAGPQFASWKEAS 1103 EW +K KKRK+V+A DF GLP GWK++LG+RKK G+ I+CRKYISP G +FA+ KE + Sbjct: 365 EWVTKRKKRKMVDASDFGDGLPDGWKVILGIRKKEGKLFIDCRKYISPTGQKFATCKEVT 424 Query: 1104 VFLST---NGALPVGRKDVSQVNLDHANSRTHFDPIQKKETHGTLNDGVDMPNASHQNQL 1274 L + +G+L V + + N+ + RT TH ++ V P + + + Sbjct: 425 AHLLSEPQDGSLAVSAR--IEENMSGNSMRTRI----SGATHSSMK--VPAPQ-TKEPKC 475 Query: 1275 KAICSPDAGKKSQEKSDIVAGTKRRYNPSSLNAPMNCKKCNATYPNRSSFMGHLTIHHVK 1454 S D+GK+ I+ + + NP L + C+KCN + ++ +M HL H + Sbjct: 476 NGSISKDSGKQ------II--SHQVDNPIKLT--LECRKCNLNFNSKEVYMHHLLAVHQR 525 Query: 1455 RKK 1463 + K Sbjct: 526 KSK 528 Score = 64.3 bits (155), Expect = 2e-07 Identities = 36/68 (52%), Positives = 46/68 (67%), Gaps = 3/68 (4%) Frame = +3 Query: 60 LDFSTIPVVDLHDFSQDEIN---VASQCSDVFRFDDIIVPKIDYSVFQESSASRKQTYSK 230 L S+IP++DL SQDEI+ + S S DI+VPKID S+F ES SRKQTYS+ Sbjct: 14 LPISSIPLIDLRFLSQDEISSLALLSLPSSNPPLTDIVVPKIDRSIFNESQGSRKQTYSR 73 Query: 231 LRLSRKQE 254 LRLS K++ Sbjct: 74 LRLSHKKQ 81 >ref|XP_006581536.1| PREDICTED: uncharacterized protein LOC102665295 [Glycine max] Length = 871 Score = 121 bits (303), Expect = 1e-24 Identities = 127/455 (27%), Positives = 205/455 (45%), Gaps = 11/455 (2%) Frame = +3 Query: 72 TIPVVDLHDFSQDEINVASQCSDVFRF-----DDIIVPKIDYSVFQESSASRKQTYSKLR 236 ++P+VDL SQ E+ S R DD ++PKID S F ES+ SRKQTYSKLR Sbjct: 16 SLPLVDLRLLSQPELYTLSLSGATHRHRRNSDDDSVIPKIDRSNFNESAGSRKQTYSKLR 75 Query: 237 LSRKQEGAETLPGYKAGHFSLSKCRSMVDESGKQEAQRILQFIQERLNTTHPSGNGSLFM 416 L+++++ +P + H L + E ++E RI+ + + L P N + Sbjct: 76 LNKRKQNP-AVPASSSFHIPLH-----ISEPEEEENSRIIALLHQ-LFGVEPLRNNAPRN 128 Query: 417 SDGALDANSSEINLQAITRVDENCPQPLAVYSALHNEDVRLVSSEQVTDLVTAASNSAID 596 +D + E L + V+ P P++V + N + D+V S Sbjct: 129 ND------APERRLVPV-HVEFKQPPPISV-ALFQNVPI---------DVVPDGSQ---- 167 Query: 597 RSKKKLKPKEGARLKAFMNSNDAAAHQIPIKPEELRSNGTTNFVDTVDKAENIQQYHDKL 776 R +K+ +P++ NS + P K + N T FV+ K N ++ Sbjct: 168 RKRKRGRPRKDE------NSVTVFVEE-PTKVTK-EENSLTVFVEEPKKVTNEEK--SVK 217 Query: 777 TRENNAPHMANANNTATFSPIFKESIFP-QLKRK---FTTEPELHTFFNNIGGEWASKLK 944 N + A A T S E +F +LKR+ TE ++ F + GEWAS+ K Sbjct: 218 VNGNGEGNAAVATATVNESVGLDEDLFEVELKRRAQGLETESQVMEFLETLNGEWASQRK 277 Query: 945 KRKIVNAEDFVSGLPQGWKLLLGVRKKNGRYIIECRKYISPAGPQFASWKEASVFLSTNG 1124 KR+IV A + LP GWK+++ V ++ GR CR+Y+SP G QF S KEAS +L + Sbjct: 278 KRRIVPATELGDMLPAGWKIVIIVMRRAGRASAVCRRYVSPDGHQFESCKEASAYLLSVS 337 Query: 1125 ALPVGRKDVSQVNLDHANSRTHFDPIQKKETHGTLN--DGVDMPNASHQNQLKAICSPDA 1298 G +D S + + + + + ++ DM ++ + L + +P Sbjct: 338 ----GVQDRSHLKSSYTDGAQQLSSSMNRASESSVGHVPTGDMKTVANASYLSSAGAPI- 392 Query: 1299 GKKSQEKSDIVAGTKRRYNPSSLNAPMNCKKCNAT 1403 S EK +V+ + N S + + CK +AT Sbjct: 393 -DSSHEKQPLVSSSIGSENFIS-DLALGCKLGDAT 425 >ref|NP_173650.3| methyl-CPG-binding domain-containing protein [Arabidopsis thaliana] gi|75174757|sp|Q9LME6.1|MBD8_ARATH RecName: Full=Methyl-CpG-binding domain-containing protein 8; Short=AtMBD8; Short=MBD08; AltName: Full=Methyl-CpG-binding protein MBD8 gi|9392683|gb|AAF87260.1|AC068562_7 Contains a Methyl-CpG binding domain PF|01429 and two DNA binding domains with preference for A/T rich regions PF|02178. ESTs gb|AI998776, gb|N95984 come from this gene [Arabidopsis thaliana] gi|26452716|dbj|BAC43440.1| unknown protein [Arabidopsis thaliana] gi|332192108|gb|AEE30229.1| methyl-CPG-binding domain-containing protein [Arabidopsis thaliana] Length = 524 Score = 116 bits (290), Expect = 4e-23 Identities = 101/383 (26%), Positives = 166/383 (43%), Gaps = 32/383 (8%) Frame = +3 Query: 60 LDFSTIPVVDLHDFSQDEINVASQCSDVFRF-----------DDIIVPKIDYSVFQESSA 206 L ++P++D SQ E+ SQCS + DD + PKID SVF ES+ Sbjct: 21 LSAESLPLIDTRLLSQSELRALSQCSSLSPSSSASLAASAGGDDDLTPKIDRSVFNESAG 80 Query: 207 SRKQTYSKLRLSRKQEGAETLPGYKAGHFSLSKCRSMVDESGKQEAQRILQFIQERLNTT 386 SRKQT+ +LRL+R + E P + D+S ++E ++ ++ N Sbjct: 81 SRKQTFLRLRLARHPQPPEEPPSPQRQR----------DDSSREEQTQVASLLRSLFNVD 130 Query: 387 HPSGNGSLFMSDGALDANSSEI--NLQAITRVDENCPQPLAVYSALHNE---------DV 533 + L+ N +I N R + + Q + + N+ + Sbjct: 131 SNQSKEEEDEGEEELEDNEGQIHYNSYVYQRPNLDSIQNVLIQGTSGNKIKRKRGRPRKI 190 Query: 534 RLVSSE-QVTDLVTAASNSA-IDRSKKKLKPKE---GARLKAFMNSNDAAAHQIPIKPEE 698 R S E +V DL AS +D++ L + + NS + P EE Sbjct: 191 RNPSEENEVLDLTGEASTYVFVDKTSSNLGMVSRVGSSGISLDSNSVKRKRGRPPKNKEE 250 Query: 699 L-----RSNGTTNFVDTVDKAENIQQYHDKLTRENNAPHMANANNTATFSPIFKESIFPQ 863 + R + N + DK E + + EN + + + A+ S E + Sbjct: 251 IMNLEKRDSAIVN-ISAFDKEELV------VNLENREGTIVDLSALASVSEDPYEEELRR 303 Query: 864 LKRKFTTEPELHTFFNNIGGEWASKLKKRKIVNAEDFVSGLPQGWKLLLGVRKKNGRYII 1043 + T+ E+ F + GEW + KK+K+VNA D+ LP+GW+L+L +++K ++ Sbjct: 304 ITVGLKTKEEILGFLEQLNGEWVNIGKKKKVVNACDYGGYLPRGWRLMLYIKRKGSNLLL 363 Query: 1044 ECRKYISPAGPQFASWKEASVFL 1112 CR+YISP G QF + KE S +L Sbjct: 364 ACRRYISPDGQQFETCKEVSTYL 386 >ref|XP_002893218.1| methyl-CpG-binding domain 8 [Arabidopsis lyrata subsp. lyrata] gi|297339060|gb|EFH69477.1| methyl-CpG-binding domain 8 [Arabidopsis lyrata subsp. lyrata] Length = 511 Score = 113 bits (283), Expect = 3e-22 Identities = 102/387 (26%), Positives = 170/387 (43%), Gaps = 32/387 (8%) Frame = +3 Query: 48 AEVLLDFSTIPVVDLHDFSQDEINVASQCSDVFRF-----------DDIIVPKIDYSVFQ 194 A+ L ++P++D+ SQ E+ S CS + DD + PKID SVF Sbjct: 17 ADNRLSAESLPLIDMRLLSQSELRALSHCSSLSPSSSASLATSAGGDDDLTPKIDRSVFN 76 Query: 195 ESSASRKQTYSKLRLSRKQEGAETLPGYKAGHFSLSKCRSMVDESGKQEAQRILQFIQER 374 ES+ SRKQT+ +LRL+R + E P + D+S +E ++ ++ Sbjct: 77 ESAGSRKQTFLRLRLARHPQPTEKPPSPQRQR----------DDSSIEEQTQVAPLLRSL 126 Query: 375 LNTTHPSGNGSLFMSDGALDANSSEI--NLQAITRVDENCPQPLAVYSALHNE------- 527 N + ++ N +I N R + + Q + + NE Sbjct: 127 FNVDSIQSKEEEDEGEEEVEENEGQIHYNSYVYQRPNLDSVQNVLIQGTSGNEIKRKRGR 186 Query: 528 --DVRLVSSE--QVTDLVTAASNSA-IDRSKKKLKPKE---GARLKAFMNSNDAAAHQIP 683 +R S E +V DL AS +D++ L + + + NS + P Sbjct: 187 PRKIRNPSEEDTEVLDLTGEASAYVFVDKTSSNLGIESRFGSSGISMDSNSVKRKRGRPP 246 Query: 684 IKPEELRS--NGTTNFVDT--VDKAENIQQYHDKLTRENNAPHMANANNTATFSPIFKES 851 EE+ + N + V++ +DK E ++ EN + + + A+ S E Sbjct: 247 KNKEEIMNLENRDSAIVNSSALDKEELVKL-------ENREGAIVDLSALASVSEDPYEE 299 Query: 852 IFPQLKRKFTTEPELHTFFNNIGGEWASKLKKRKIVNAEDFVSGLPQGWKLLLGVRKKNG 1031 ++ T+ E+ F + GEW + KK+K+V A D+ LP+GWKL+L ++KK Sbjct: 300 ELRRITVGLKTKEEILVFLEQLNGEWVNIGKKKKVVRACDYGGYLPRGWKLMLYIKKKGS 359 Query: 1032 RYIIECRKYISPAGPQFASWKEASVFL 1112 ++ CR+YISP G QF + KE S +L Sbjct: 360 SLLLACRRYISPDGQQFETCKEVSTYL 386 >gb|EOY15980.1| Uncharacterized protein isoform 1 [Theobroma cacao] gi|508724084|gb|EOY15981.1| Uncharacterized protein isoform 1 [Theobroma cacao] Length = 1203 Score = 112 bits (281), Expect = 4e-22 Identities = 138/525 (26%), Positives = 207/525 (39%), Gaps = 53/525 (10%) Frame = +3 Query: 60 LDFSTIPVVDLHDFSQDEINVASQCSDV----FRFDDIIVPKIDYSVFQESSASRKQTYS 227 L +IPVVDL SQ E+ S CS ++ PKID SVF ES+ SRKQT+S Sbjct: 14 LHLESIPVVDLRLISQPELLSLSLCSSSPSPSNADTELFTPKIDRSVFNESAGSRKQTFS 73 Query: 228 KLRLSRKQEGA----ETLPGYKAGHFSLSKCRSMVDESG-KQEAQRILQFIQERLNTTHP 392 +LRL+ + + P K SLS+ + V+ +E+ IL ++ Sbjct: 74 RLRLAAPRNHLPHPHHSSPSSKP-FTSLSQRLNPVNPGPLDEESSNILSLLK-------- 124 Query: 393 SGNGSLFMSDGALDANSSEINLQAITRVDENCPQPLAVY---------SALHNEDVRLVS 545 SLF D +L +N++E D+ P+ + S L N V +VS Sbjct: 125 ----SLFNIDDSLTSNTNEDEPD-----DDKDLVPVQIEYENGKDNGNSVLQNIPVGIVS 175 Query: 546 ------------SEQVTDLVTAASNSAIDRSKKKLKPKEGARLKAFMNSNDAAAHQIPIK 689 +Q +L+ + N I+ + E A S +A I Sbjct: 176 CSGSKRKRGRPRKDQKDNLLIESENLVIEEHQ------ETAAFDRVSESVNAGG--ISSC 227 Query: 690 PEELRSNGTTNFVDTVDKAENIQQYHDKLTRENNAPHMANANNTATFSPIFKESIFPQLK 869 E R G ++++N ++ E+ +A N A I +L+ Sbjct: 228 SERKRKRGRPR----KEESQNRVIVSEEKKVESEIERVALGNVEAILG------IEEELR 277 Query: 870 RK---FTTEPELHTFFNNIGGEWASKLKKRKIVNAEDFVSGLPQGWKLLLGVRKKNGRYI 1040 R+ TE EL F + GEWASK +K++IV+A F + LPQGWKL+L V+K+ G Sbjct: 278 RRTEAIGTEAELLEFMGGLEGEWASKSQKKRIVDAAGFGNVLPQGWKLMLFVKKRAGHVW 337 Query: 1041 IECRKYISPAGPQFASWKEASVFLSTNGALPVGRKDVSQVNLDHANS-----RTHFDPIQ 1205 + C +YISP G QF S KE S L + G L + S + S +F I Sbjct: 338 LACSRYISPNGQQFVSCKEVSSCLLSAGELKDSSQSTSSLTGRGIGSGVKPTSENFPIIC 397 Query: 1206 KKETHGTLNDGVDMPNASHQNQLKAICSPDAGKKSQEKSDIVA---------------GT 1340 H + M + + + I ++ D + GT Sbjct: 398 TSSEHERQAPLLRMGSPWEVQRAETIKCHKCTMTFNQQDDFICHLLSSHQGTVKSSGHGT 457 Query: 1341 KRRYNPSSLNAPMNCKKCNATYPNRSSFMGHLTIHHVKRKKNAEG 1475 N C+ C + RS + HL +H K EG Sbjct: 458 STNEEVIIKNGKYECQFCYELFEERSCYSSHLGVHMKNNTKKVEG 502 >gb|ESW08251.1| hypothetical protein PHAVU_009G031600g [Phaseolus vulgaris] Length = 841 Score = 111 bits (277), Expect = 1e-21 Identities = 109/369 (29%), Positives = 166/369 (44%), Gaps = 8/369 (2%) Frame = +3 Query: 30 EAEVKEAEVLLDFSTIPVVDLHDFSQDEINVASQCSDVFRF-----DDIIVPKIDYSVFQ 194 EAEV+ + +D ++P+VDL SQ E+ S R +D +VPKID S F Sbjct: 5 EAEVEPSSDHID--SLPLVDLRLLSQPELYTLSLSGATHRHRRANDNDSVVPKIDRSNFN 62 Query: 195 ESSASRKQTYSKLRLSRKQEGAETLPGYKAGHFSLSKCRSMVDESGKQEAQRILQFIQER 374 ES+ SRKQTYSKLRL+ K++ +P + H + E QE +I+ + ++ Sbjct: 63 ESAGSRKQTYSKLRLN-KRKQNFAVPASSSFH---------IPEPVDQENSQIISLL-QQ 111 Query: 375 LNTTHPSGNGSLFMSDGALDANSSEINLQAITRVDENCPQPLAVYSALHNEDVRLVSSEQ 554 L P N AL + + + V QP V V+ + Sbjct: 112 LFGVEPLRN--------ALRPDCGDAANHQLFPVHVEFKQPPPV----------TVTFQT 153 Query: 555 VTDLVTAASNSAIDRSKKKLKPKEGARLKAFMNSNDAAAHQIPIKPEELRSNGTTNFVDT 734 V V ASN R +K+ +P++ L + ++ G + Sbjct: 154 VPIDVIDASN----RKRKRGRPRKNENLVSVFEEETKKVNE-----------GRSAVATV 198 Query: 735 VDKAENIQQYHDKLTRENNAPHMANANNTATFSPIFKESIFPQLKRK---FTTEPELHTF 905 +++ + D L +N P F E +LKR+ TEP+L F Sbjct: 199 IERGFGVDA--DGL---DNDP--------------FGE----ELKRRTAGLETEPQLLEF 235 Query: 906 FNNIGGEWASKLKKRKIVNAEDFVSGLPQGWKLLLGVRKKNGRYIIECRKYISPAGPQFA 1085 + GEWAS+ KKR+IV A D + LP GWK+++ + ++ GR + CR+Y+SP G QF Sbjct: 236 LETLNGEWASQRKKRRIVQASDLGTVLPAGWKIVITLLRRAGRASVVCRRYVSPGGHQFE 295 Query: 1086 SWKEASVFL 1112 S KEAS +L Sbjct: 296 SCKEASAYL 304 >ref|XP_006416210.1| hypothetical protein EUTSA_v10007200mg [Eutrema salsugineum] gi|557093981|gb|ESQ34563.1| hypothetical protein EUTSA_v10007200mg [Eutrema salsugineum] Length = 575 Score = 105 bits (263), Expect = 5e-20 Identities = 108/428 (25%), Positives = 185/428 (43%), Gaps = 31/428 (7%) Frame = +3 Query: 60 LDFSTIPVVDLHDFSQDEINVASQCSDVFR-------FDDIIVPKIDYSVFQESSASRKQ 218 L ++P++D SQ E+ S S DD + PKID SVF ES+ SRKQ Sbjct: 106 LSAESLPLIDTRLLSQSELRALSPSSSSSASLAASAGVDDDLTPKIDRSVFNESAGSRKQ 165 Query: 219 TYSKLRLSRKQEGAETLPGYKAGHFSLSKCRSMVDESGKQEAQRILQFIQERLNTTHPSG 398 T+ ++RL+R + D+S ++E ++ ++ Sbjct: 166 TFLRVRLARDPPPPRPPSPQRRR-----------DDSSREEKSQVASLLR---------- 204 Query: 399 NGSLFMSDGALDANSSEINLQAITRVDENCPQPLAVYSALHNEDVR----LVSSEQVTDL 566 SLF D + N+ E + V+E QPL +N +V S + V + Sbjct: 205 --SLFSVD-SFQRNAEED--EGEEEVEEKEGQPLISLPIHNNGNVYRNPYFDSVKNVQGI 259 Query: 567 VTAASNSAIDRSKKKLKPKEGARLKAFMNSNDAAAHQIPIKPEELRSN-GTTNFVDTVD- 740 + R +K P +G L ++ D + + + ++ RSN GT + D Sbjct: 260 SENETRRRPGRPRKIRNPSDGV-LDSYA---DESEREGTLSVDKTRSNLGTESGYDASGI 315 Query: 741 ---------KAENIQQYHDKLTRENNAPHMANANNTATFSPIF-----KESIFPQLKRKF 878 K ++ D E+ ++ N T + +E + + R+ Sbjct: 316 SMDSNPGKRKRGRPRKSGDGCKSEDKEEIVSLENREGTMVDLSALANNEEDPYGEELRRI 375 Query: 879 T----TEPELHTFFNNIGGEWASKLKKRKIVNAEDFVSGLPQGWKLLLGVRKKNGRYIIE 1046 T T+ EL F + GEW + KK+K+V A D+ LP+GWKL+L ++KK + Sbjct: 376 TVGLGTKEELLAFLEQVNGEWVNAGKKKKVVKACDYGGYLPRGWKLMLCIKKKGSIQWLA 435 Query: 1047 CRKYISPAGPQFASWKEASVFLSTNGALPVGRKDVSQVNLDHANSRTHFDPIQKKETHGT 1226 CR+YISP G +FA+ KE S +L + V + +++N +++ T +P+ E+ Sbjct: 436 CRRYISPDGQEFATCKEVSTYLQS----LVESQSKNRLNSFQSDNHTLGEPVMGNESLVG 491 Query: 1227 LNDGVDMP 1250 +D +D+P Sbjct: 492 NSDSMDLP 499 >ref|XP_004292482.1| PREDICTED: uncharacterized protein LOC101298198 [Fragaria vesca subsp. vesca] Length = 821 Score = 101 bits (251), Expect = 1e-18 Identities = 94/363 (25%), Positives = 157/363 (43%), Gaps = 15/363 (4%) Frame = +3 Query: 405 SLFMSDGALDANSSEINLQAITR--VDENCPQPLAVYSALHNEDVRLVSSEQVTDLVTAA 578 SL S+GA+D + + I R +E+ YS + L+S+ +V+ A Sbjct: 38 SLTRSNGAID----HLVVPKIDRSQFNESAGSRRQTYSRVRRRVAGLLSNPKVS-----A 88 Query: 579 SNSAIDRSKKKLKPKEGARLKAFMNSNDAAAHQIPIKPEELRSNGTTNFVDTVDKAENIQ 758 + D ++ LK F+ S D QI ++P + + + + +++ + + Sbjct: 89 PPAQPDDPERNENQAIIGHLKRFI-SQDPKFDQIDLEPSPMTMKASLSGMAELERRKRKR 147 Query: 759 QYHDKLTRENNAPHM-ANANNTATFSPIFKESIFP---QLKRK---FTTEPELHTFFNNI 917 K + + N N A + S P +L+R+ TE EL F ++ Sbjct: 148 GRKPKAKGSSGGEGLIVNKNGAAVDIWALQNSENPFGDELRRRTLGLETEEELLGFMRDL 207 Query: 918 GGEWASKLKKRKIVNAEDFVSGLPQGWKLLLGVRKKNGRYIIECRKYISPAGPQFASWKE 1097 GG+W S+ KKRKIV+A +F LP GWKLLLG+++K R I CR+YISP G QF S KE Sbjct: 208 GGQWGSRRKKRKIVDATEFGDALPLGWKLLLGLKRKERRAWIYCRRYISPTGQQFLSCKE 267 Query: 1098 ASVFL----STNGALPVGRKDVSQVNLDH--ANSRTHFDPIQKKETHGTLNDGVDMPNAS 1259 + FL S N A + D A H D +K + N G+ + S Sbjct: 268 VASFLESFFSLNNADRHDGDGGENIQEDRIVATENQHADKDGEKRQDVSFNSGILGSSIS 327 Query: 1260 HQNQLKAICSPDAGKKSQEKSDIVAGTKRRYNPSSLNAPMNCKKCNATYPNRSSFMGHLT 1439 ++ + ++ + + ++ C KC+ T+ ++ S++ HL Sbjct: 328 NE------------QSNEPEKKVSISEMENLAEVQIHNLFECHKCSMTFADKDSYLQHLL 375 Query: 1440 IHH 1448 H Sbjct: 376 SFH 378 >ref|XP_002525855.1| hypothetical protein RCOM_0824380 [Ricinus communis] gi|223534860|gb|EEF36549.1| hypothetical protein RCOM_0824380 [Ricinus communis] Length = 697 Score = 98.2 bits (243), Expect = 1e-17 Identities = 63/200 (31%), Positives = 96/200 (48%), Gaps = 4/200 (2%) Frame = +3 Query: 861 QLKRK---FTTEPELHTFFNNIGGEWASKLKKRKIVNAEDFVSGLPQGWKLLLGVRKKNG 1031 +LKR+ E EL FF ++GG+W S+ +KRKIV+A +F LP GWKLLLG+++K G Sbjct: 199 ELKRRTEGMVKEEELLGFFRDLGGQWCSRRRKRKIVDASEFGDFLPFGWKLLLGLKRKEG 258 Query: 1032 RYIIECRKYISPAGPQFASWKEASVFLSTNGALPVGRKDVSQVNLDHANSRTHFDPIQKK 1211 + + CR+YISP+G QF S KE S +L + DH+N Sbjct: 259 KAWVYCRRYISPSGQQFISCKEVSAYLQS-----------CLKPYDHSNGNNRQVHRVAS 307 Query: 1212 ETH-GTLNDGVDMPNASHQNQLKAICSPDAGKKSQEKSDIVAGTKRRYNPSSLNAPMNCK 1388 E H GT D S + ++ D + E +++ + C Sbjct: 308 ENHAGTSGREEDQRQPSEHEKAVSLLGID----NLELAEV-----------QIQDLFECH 352 Query: 1389 KCNATYPNRSSFMGHLTIHH 1448 KCN T+ ++ +++ HL H Sbjct: 353 KCNMTFDDKDTYLQHLLSFH 372 >gb|EMJ00867.1| hypothetical protein PRUPE_ppa001455mg [Prunus persica] Length = 824 Score = 97.8 bits (242), Expect = 1e-17 Identities = 64/194 (32%), Positives = 95/194 (48%), Gaps = 5/194 (2%) Frame = +3 Query: 882 TEPELHTFFNNIGGEWASKLKKRKIVNAEDFVSGLPQGWKLLLGVRKKNGRYIIECRKYI 1061 TE +L F +GG+W S+ KKRKIV+A +F LP GWKLLLG+++K GR I CR++I Sbjct: 216 TEEQLLGFMRELGGQWGSRRKKRKIVDANEFGDALPVGWKLLLGLKRKEGRAWIYCRRFI 275 Query: 1062 SPAGPQFASWKEASVFLST-----NGALPVGRKDVSQVNLDHANSRTHFDPIQKKETHGT 1226 SP G QF S KE S FL + N P G H + I E + Sbjct: 276 SPTGQQFLSCKEVSSFLHSFFGFNNARQPDG----------HGGENLQEECIMTTENQHS 325 Query: 1227 LNDGVDMPNASHQNQLKAICSPDAGKKSQEKSDIVAGTKRRYNPSSLNAPMNCKKCNATY 1406 DG + + L + S + ++ +E S ++G + ++ C KC+ T+ Sbjct: 326 DKDGGRRQYVNSSSAL--VVSTISNEREKEVS--LSGME-NLAEVQIHDLFECHKCSMTF 380 Query: 1407 PNRSSFMGHLTIHH 1448 + S++ HL H Sbjct: 381 GEKDSYLQHLLSFH 394 >ref|XP_006661339.1| PREDICTED: dentin sialophosphoprotein-like, partial [Oryza brachyantha] Length = 1042 Score = 97.1 bits (240), Expect = 2e-17 Identities = 89/336 (26%), Positives = 149/336 (44%), Gaps = 27/336 (8%) Frame = +3 Query: 885 EPELHTFFNNIGGEWASKLKKRKIVNAEDFVSGLPQGWKLLLGVRKKNGRYIIECRKYIS 1064 E EL F N + G+W S+ ++RK V+A F LP+GWKLLLG+++K I CR+Y+S Sbjct: 136 ESELLGFMNGLEGQWGSRRRRRKFVDASMFGDHLPRGWKLLLGLKRKERVAWINCRRYVS 195 Query: 1065 PAGPQFASWKEASVFLSTNGALPVGRKDVSQVNLDHANSRTHFDPIQKKETHGTLNDGVD 1244 P+G QFAS KE S +L + +G + + ++N+ H E H + G Sbjct: 196 PSGQQFASCKEISSYLIS----LLGYVEAKPTAIQNSNAGVH-------ELHTVNSVGHC 244 Query: 1245 MPNASHQNQLKAICSPDAGKKSQEKSDIVAGTKRRYNPSSLNAPMN---CKKCNATYPNR 1415 PN++ + +P S S +R+++ + N C+KCN + ++ Sbjct: 245 QPNSTEEKH----SAPPV--TSVPVSSHYGDPQRQHDKNETQVETNGKECQKCNLIFQDQ 298 Query: 1416 SSFMGH-LTIHHVK---RKKNAEG-IPANTN-SVTSLQVQANGTNNI----PNIMAVQAY 1565 S+++ H L+ H K RK N G + N N + + ++Q + + N+ A + Sbjct: 299 SAYVQHQLSFHQRKAKRRKVNKSGEVGVNKNGTFVTQELQQTSEDKLGHIDHNVAASRNQ 358 Query: 1566 GQNMDNV-------------TMVYDKANNVNSTQIQEDGVNNGKSALHIGNAEDMEKAPG 1706 GQ + V +M + + E G + L G+ D Sbjct: 359 GQTPEKVSDETISGELGGQPSMAPEPVGFRETDGETEQGKESSAGELLSGHCNDSLHNMA 418 Query: 1707 EVVTMSHISS-ETVRLHTENMENSSTNGNIPHDANC 1811 +V S+ E V H EN+ ++ + I HD C Sbjct: 419 DVAEQEKRSAREPVTGHHENLSDNCVDHKI-HDGAC 453 >ref|XP_004957094.1| PREDICTED: uncharacterized protein LOC101759536 [Setaria italica] Length = 1141 Score = 97.1 bits (240), Expect = 2e-17 Identities = 61/200 (30%), Positives = 100/200 (50%), Gaps = 6/200 (3%) Frame = +3 Query: 882 TEPELHTFFNNIGGEWASKLKKRKIVNAEDFVSGLPQGWKLLLGVRKKNGRYIIECRKYI 1061 +E EL F N + G+W S+ ++RK V+A F LP+GWKLLLG+++K I CR+Y+ Sbjct: 198 SESELLGFMNALEGQWGSRRRRRKFVDAGMFADHLPRGWKLLLGLKRKERVAWINCRRYV 257 Query: 1062 SPAGPQFASWKEASVFLSTNGALPVGRKDVSQVNLDHANSRTHFDPIQKKETHGTLNDGV 1241 SP G QFA+ KE S +L + P + +Q+N ++ H + Sbjct: 258 SPKGHQFATCKEVSTYLRSLLGYPEAKPTTTQIN----SAGVH---------------DL 298 Query: 1242 DMPNASHQNQL----KAICSPDAGKKSQEKSDIVAGTKRRYNPSSLNA-PMNCKKCNATY 1406 D+ +A HQ + + + P S G K + + + + P C+KCN T+ Sbjct: 299 DINSAGHQQTISIEQRQLAVPLTSVTLFSHSGDSHGQKLQKDEAQMEVNPKECRKCNLTF 358 Query: 1407 PNRSSFMGH-LTIHHVKRKK 1463 ++ ++M H L+ H K K+ Sbjct: 359 HDQGAYMQHQLSFHQRKAKR 378 >ref|NP_001123600.1| LOC100170247 [Zea mays] gi|189514249|gb|ACE07054.1| methylcytosine binding domain protein [Zea mays] gi|414589744|tpg|DAA40315.1| TPA: methylcytosine binding domain protein [Zea mays] Length = 1176 Score = 97.1 bits (240), Expect = 2e-17 Identities = 73/234 (31%), Positives = 111/234 (47%), Gaps = 12/234 (5%) Frame = +3 Query: 882 TEPELHTFFNNIGGEWASKLKKRKIVNAEDFVSGLPQGWKLLLGVRKKNGRYIIECRKYI 1061 +E EL F N + G+W S+ ++RK VNA F LP GWKLLLG+++K I CR+Y+ Sbjct: 195 SESELLGFMNALEGQWGSRRRRRKFVNAGMFGDHLPCGWKLLLGLKRKERVAWINCRRYV 254 Query: 1062 SPAGPQFASWKEASVFLSTNGALPVGRKDVSQVNLDHANSRTHFDPIQKKETHGTLNDGV 1241 SP G QFA+ KE S +L + + SQ+N N+ H + H Sbjct: 255 SPKGHQFATCKEVSSYLLSLLGYQEAKPTASQIN----NAGVHDLHVNSVGLHQQTISIE 310 Query: 1242 DMPNASHQNQLKAICSPDAGKKSQEKSDIVAGTKRRYNPSSLNAPMNCKKCNATYPNRSS 1421 + A N + S +G Q+K ++ P +NA C+KCN T+ ++S+ Sbjct: 311 EKQIAVPVNSVALFNS--SGDSHQQK------LQKDEAPIEVNA-KECRKCNLTFHDQSA 361 Query: 1422 FMGH-LTIHHVKRKK-----------NAEGIPANTNSVTSLQVQANGTNNIPNI 1547 +M H L+ H K K+ N +G T TS +V N ++ N+ Sbjct: 362 YMQHQLSFHQRKAKRRRVSKSGELGTNIDGNYEKTQQKTSGEVSGNFGHSAANV 415 >ref|XP_002300183.1| hypothetical protein POPTR_0001s31990g [Populus trichocarpa] gi|222847441|gb|EEE84988.1| hypothetical protein POPTR_0001s31990g [Populus trichocarpa] Length = 837 Score = 96.7 bits (239), Expect = 3e-17 Identities = 62/199 (31%), Positives = 93/199 (46%), Gaps = 3/199 (1%) Frame = +3 Query: 861 QLKRK---FTTEPELHTFFNNIGGEWASKLKKRKIVNAEDFVSGLPQGWKLLLGVRKKNG 1031 +LKR+ E EL FF +GG+W S+ KKRKIV+A +F LP GWKL+LG+++K G Sbjct: 219 ELKRRTEGMEKEEELLGFFRELGGQWCSRRKKRKIVDAGEFGDFLPVGWKLILGLKRKEG 278 Query: 1032 RYIIECRKYISPAGPQFASWKEASVFLSTNGALPVGRKDVSQVNLDHANSRTHFDPIQKK 1211 R + CR+Y+SP+G QF S K+ S +L + VG D Q DH Sbjct: 279 RAWVYCRRYLSPSGQQFISCKDVSAYLQS----LVGPYDAQQAK-DHTG----------- 322 Query: 1212 ETHGTLNDGVDMPNASHQNQLKAICSPDAGKKSQEKSDIVAGTKRRYNPSSLNAPMNCKK 1391 H D P+A +L+ D + + + + + C K Sbjct: 323 --HSIQQDHGGAPHAGAIERLE-----DQRQSIEHQKQVSLLETDNLAEVQIRDLFECHK 375 Query: 1392 CNATYPNRSSFMGHLTIHH 1448 C T+ + +++ HL H Sbjct: 376 CRMTFDEKGTYLEHLLSFH 394 >emb|CBI19167.3| unnamed protein product [Vitis vinifera] Length = 1129 Score = 96.3 bits (238), Expect = 4e-17 Identities = 66/204 (32%), Positives = 101/204 (49%), Gaps = 7/204 (3%) Frame = +3 Query: 858 PQLKRK---FTTEPELHTFFNNIGGEWASKLKKRKIVNAEDFVSGLPQGWKLLLGVRKKN 1028 P+L+R+ TTE EL F + G+W S+ KKRKIV A DF LPQGWKLLL +++K Sbjct: 171 PELRRRTEGLTTEEELLGFLTGLSGQWGSRRKKRKIVEASDFGDVLPQGWKLLLSMKRKE 230 Query: 1029 GRYIIECRKYISPAGPQFASWKEASVFLSTNGALPVGRKDVSQVNLDHANSRTHFDPIQK 1208 GR + CR+YISP G QF S KE S L + G +D Q N H + + + Sbjct: 231 GRVWLFCRRYISPNGQQFVSCKEVSSCLLSLS----GLQDARQPNYGHNDENSQ---LAH 283 Query: 1209 KETHGTLNDGVDMPNASHQNQLKAICSPDAGKKS----QEKSDIVAGTKRRYNPSSLNAP 1376 + + G G+ + + + ++ L +CS + + EK + + + Sbjct: 284 QISPGNA-AGLTLKDDNSKDGL--VCSSPSTVTTIPTHHEKQATLLNMGNSWE-VKVGEI 339 Query: 1377 MNCKKCNATYPNRSSFMGHLTIHH 1448 + C KC T+ + + HL+ H Sbjct: 340 LKCHKCAMTFDEKDDLLHHLSSSH 363 Score = 69.7 bits (169), Expect = 4e-09 Identities = 39/88 (44%), Positives = 55/88 (62%), Gaps = 5/88 (5%) Frame = +3 Query: 24 SMEAEVKEAEVLLDFSTIPVVDLHDFSQDEINV----ASQCSDVFRFDDIIVPKIDYSVF 191 SM + A L +P++DL SQ E+ +S SD+ R DD+++PKID S+F Sbjct: 2 SMASSSSTASGGLHLEALPLIDLRFLSQSELQALSLTSSHSSDLRRCDDVVIPKIDRSIF 61 Query: 192 QESSASRKQTYSKLRLS-RKQEGAETLP 272 ES+ SRKQTYS+LRL+ RK + A T+P Sbjct: 62 NESAGSRKQTYSRLRLAPRKPDIAATIP 89