BLASTX nr result
ID: Ephedra26_contig00011640
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Ephedra26_contig00011640 (2113 letters) Database: ./nr 37,332,560 sequences; 13,225,080,153 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_004238529.1| PREDICTED: uncharacterized protein LOC101267... 137 1e-29 ref|XP_002284140.2| PREDICTED: uncharacterized protein LOC100248... 137 2e-29 ref|XP_006578321.1| PREDICTED: uncharacterized protein LOC100780... 129 5e-27 ref|XP_006364865.1| PREDICTED: uncharacterized protein LOC102582... 128 9e-27 ref|XP_002513931.1| hypothetical protein RCOM_1035820 [Ricinus c... 122 5e-25 ref|XP_006853188.1| hypothetical protein AMTR_s00038p00204530 [A... 122 6e-25 ref|XP_006581536.1| PREDICTED: uncharacterized protein LOC102665... 121 1e-24 ref|NP_173650.3| methyl-CPG-binding domain-containing protein [A... 115 6e-23 gb|ESW08251.1| hypothetical protein PHAVU_009G031600g [Phaseolus... 114 1e-22 ref|XP_006433971.1| hypothetical protein CICLE_v10000205mg [Citr... 114 1e-22 gb|EOY15980.1| Uncharacterized protein isoform 1 [Theobroma caca... 112 5e-22 ref|XP_006472591.1| PREDICTED: uncharacterized protein LOC102628... 108 7e-21 ref|XP_006416210.1| hypothetical protein EUTSA_v10007200mg [Eutr... 105 6e-20 ref|XP_004292482.1| PREDICTED: uncharacterized protein LOC101298... 101 1e-18 ref|XP_006661339.1| PREDICTED: dentin sialophosphoprotein-like, ... 99 7e-18 ref|XP_004957094.1| PREDICTED: uncharacterized protein LOC101759... 99 1e-17 ref|XP_002525855.1| hypothetical protein RCOM_0824380 [Ricinus c... 98 1e-17 gb|EMJ00867.1| hypothetical protein PRUPE_ppa001455mg [Prunus pe... 98 2e-17 ref|NP_001123600.1| LOC100170247 [Zea mays] gi|189514249|gb|ACE0... 97 3e-17 ref|XP_002300183.1| hypothetical protein POPTR_0001s31990g [Popu... 97 4e-17 >ref|XP_004238529.1| PREDICTED: uncharacterized protein LOC101267888 [Solanum lycopersicum] Length = 1192 Score = 137 bits (346), Expect = 1e-29 Identities = 173/667 (25%), Positives = 263/667 (39%), Gaps = 50/667 (7%) Frame = -3 Query: 2054 LDFSTIPVVDLHDFSQDEINVASQCSDVF----RFDDIIVPKIDYSVFQESSASRKQTYS 1887 L +IP VDL SQ E+ S CS R DD+I+PKID SVF ES+ SRKQTYS Sbjct: 18 LQAESIPTVDLRLLSQSELYSLSLCSPAAFNPCRDDDVIIPKIDRSVFNESAGSRKQTYS 77 Query: 1886 KLRLSRKQEGAETLPGYKAGHFSLSKCRSMVDESGKQEAQRILQFIQERLNTMHPSGNGS 1707 +LRL+ P A S + R+ N+ HP N S Sbjct: 78 RLRLA---------PAATASASSAIRSRT-----------------PHLRNSPHPLQNPS 111 Query: 1706 LFMSDGALDANSSEINL---QAITRVDENCPQPLAVYSALHNEDVRLVSSEQVTDLVTAA 1536 ++G ++ SS+I Q + P L +++ + + S V L A Sbjct: 112 --PNNGPANSESSQIVTLLKQLFGSGTQKNPTDLVPIRVDYSDSLSVPSHVPVPGLELAN 169 Query: 1535 SNSAIDRSKKKLKPKEGARLKAFMNSNDAAAHQIPIKPEELRSNGTTNFVDTVDKAENIQ 1356 S I + +K+ +P++ N N ++ VD V K + Sbjct: 170 VGS-IGQKRKRGRPRK--------NENGVRVAEVK--------------VDEVVKDIVVY 206 Query: 1355 QYHDKLTRENNAPHMANANNTATFSPIFKESIFP---QLKRK---FTTEPELHTFFNNIG 1194 Q D +E + N + + S+ P +L+R+ + EL F + Sbjct: 207 QNVDDSDKE-----IMNKDGIPVDLAVLGASVDPFGLELRRRTEGLGSAEELLGFLGRLN 261 Query: 1193 GEWASKLKKRKIVNAEDFVSGLPQGWKLLLGVRKKNGRYIIECRKYISPAGPQFASWKEA 1014 G+W S KKR+IV+A+DF S LP+ WKLLL +++K GR + CR+YISP G QF + KE Sbjct: 262 GQWGSTRKKRRIVDADDFGSMLPKSWKLLLSIKRKEGRSWLHCRRYISPNGRQFGTCKEV 321 Query: 1013 SVFL-----STNGALPVGRKDVSQVNLDHANSRTHFDPIQ---KKET-----------HG 891 S +L N LP V + +A + T IQ KKE+ HG Sbjct: 322 SSYLLFLRGERNENLPTYVNGSGTVEITNACALTSDLRIQDGGKKESSVFHNSSPAVGHG 381 Query: 890 TLNDGVDMPNASHQNQLKAICSPDAGKKSQEKSDIV-------AGTKRRYNPSSL----- 747 L ++ S + K D++ K R S+ Sbjct: 382 ELQVLLNFGELSEVQVGDLLQCDKCNVTFNNKDDLLQHQLFSHQRRKSRNGGQSITDGVI 441 Query: 746 --NAPMNCKKCNATYPNRSSFMGHLTIHHVKRKKNAEG-IPANTNSVTSLQVQANGTNNI 576 + C+ C+ T+ + + GH+ H K+ K +G +P V + Sbjct: 442 IRDGKFECQFCHKTFEEKHRYNGHVGNHVKKQVKTVDGSLPIKMGGGIEPVVPSGAMLRE 501 Query: 575 PNIMAVQAYGQNM-DNVAMVYDKANN-VNSTQIQEDGVNNGKSALHIGNAEDMEKAPGEV 402 P + +N+ +N ++ D +N +T+IQED + G + G Sbjct: 502 PIMQDSVVLPRNLTENAGVITDAGDNPAPTTKIQEDHMETDNKLEAEGTSNGCHNQEGSS 561 Query: 401 VTMSHISSETVRLHTENMENSSTNGNIPHDANCSSMTDIKSPSNSCSKSF-DEKYQCTVD 225 V+ S ISS + +N P DI +SC S D K+ TVD Sbjct: 562 VSRSPISSNEKTCVDISKVIVGSNIEEPEQEGLLCSNDI---VDSCGVSMEDGKFFPTVD 618 Query: 224 IGVPESG 204 E+G Sbjct: 619 ESKVENG 625 >ref|XP_002284140.2| PREDICTED: uncharacterized protein LOC100248904 [Vitis vinifera] Length = 947 Score = 137 bits (345), Expect = 2e-29 Identities = 132/477 (27%), Positives = 204/477 (42%), Gaps = 14/477 (2%) Frame = -3 Query: 2054 LDFSTIPVVDLHDFSQDEINV----ASQCSDVFRFDDIIVPKIDYSVFQESSASRKQTYS 1887 L +P++DL SQ E+ +S SD+ R DD+++PKID S+F ES+ SRKQTYS Sbjct: 12 LHLEALPLIDLRFLSQSELQALSLTSSHSSDLRRCDDVVIPKIDRSIFNESAGSRKQTYS 71 Query: 1886 KLRLS-RKQEGAETLPGYK--AGHFSLSKCRSMVDESGKQEAQRILQFIQERLNTMHPSG 1716 +LRL+ RK + A T+P + H + VDE NT+ Sbjct: 72 RLRLAPRKPDIAATIPRRPRFSPHLNQKAALEPVDEE----------------NTLIIGL 115 Query: 1715 NGSLFMSDGALDANSSEINLQAITRVDENCPQPLAVYSALHNEDVRLVSSEQVTDLVTAA 1536 LF ++ T D+ P + Y NE ++ + + V D Sbjct: 116 LKGLFATE---------------THADDLIPVQVE-YRESSNEILQNIPIDVVADS---- 155 Query: 1535 SNSAIDRSKKKLKPKEGARLKAFMNSNDAAAHQIPIKPEELRSNGTTNFVDTVDKAENIQ 1356 R +K+ +PK + + N + I + +NG VD A Sbjct: 156 -----GRKRKRGRPKSEKTIAVYQNGGSGEGGGMGI----INNNGVV-----VDVAA--- 198 Query: 1355 QYHDKLTRENNAPHMANANNTATFSPIFKESIFPQLKRK---FTTEPELHTFFNNIGGEW 1185 +ANA ++ P+L+R+ TTE EL F + G+W Sbjct: 199 --------------LANA----------EDPFGPELRRRTEGLTTEEELLGFLTGLSGQW 234 Query: 1184 ASKLKKRKIVNAEDFVSGLPQGWKLLLGVRKKNGRYIIECRKYISPAGPQFASWKEASVF 1005 S+ KKRKIV A DF LPQGWKLLL +++K GR + CR+YISP G QF S KE S Sbjct: 235 GSRRKKRKIVEASDFGDVLPQGWKLLLSMKRKEGRVWLFCRRYISPNGQQFVSCKEVSSC 294 Query: 1004 LSTNGALPVGRKDVSQVNLDHANSRTHFDPIQKKETHGTLNDGVDMPNASHQNQLKAICS 825 L + G +D Q N H + + + + + G G+ + + + ++ L +CS Sbjct: 295 LLSLS----GLQDARQPNYGHNDENSQ---LAHQISPGNA-AGLTLKDDNSKDGL--VCS 344 Query: 824 PDAGKKS----QEKSDIVAGTKRRYNPSSLNAPMNCKKCNATYPNRSSFMGHLTIHH 666 + + EK + + + + C KC T+ + + HL+ H Sbjct: 345 SPSTVTTIPTHHEKQATLLNMGNSWE-VKVGEILKCHKCAMTFDEKDDLLHHLSSSH 400 >ref|XP_006578321.1| PREDICTED: uncharacterized protein LOC100780637 isoform X1 [Glycine max] gi|571450041|ref|XP_006578322.1| PREDICTED: uncharacterized protein LOC100780637 isoform X2 [Glycine max] Length = 863 Score = 129 bits (324), Expect = 5e-27 Identities = 118/458 (25%), Positives = 199/458 (43%), Gaps = 14/458 (3%) Frame = -3 Query: 2042 TIPVVDLHDFSQDEINV-----ASQCSDVFRFDDIIVPKIDYSVFQESSASRKQTYSKLR 1878 ++P+VDL SQ E+ A+ C DD ++PKID S F ES+ SRKQTYSKLR Sbjct: 18 SLPLVDLRLLSQPELYTLSLSGATHCHRRNSDDDSVIPKIDRSNFNESAGSRKQTYSKLR 77 Query: 1877 LSRKQEGAETLPGYKAGHFSLSKCRSMVDESGKQEAQRILQFIQERLNTMHPSGNGSLFM 1698 L+++++ +P + H L + E ++E RI+ +Q+ LF Sbjct: 78 LNKRKQNP-AVPASSSFHIPLH-----ISEPEEEENSRIVALLQQ------------LFG 119 Query: 1697 SDGALDANSSEINLQAITRVDENCPQPLAVYSALHNEDVRLVSSEQVTDLVTAASNSAID 1518 + +A ++ + + V + QP +++A N + +V+ Sbjct: 120 VEPLRNAPRNDAAERRLVPVQVDFKQPPPMFAAFQNVPIDVVADSS-------------Q 166 Query: 1517 RSKKKLKPKEGARLKAFMNSNDAAAHQIPIKPEELRSNGTTNFVDTVDKAENIQQYHDKL 1338 R +K+ +P++ + N K N T FV+ K Sbjct: 167 RKRKRGRPRK--------DENSVTVFVEEPKKVTKEENSVTVFVEEPKKVNG-------- 210 Query: 1337 TRENNAPHMANANNTATFSP---IFKESIFPQLKRK---FTTEPELHTFFNNIGGEWASK 1176 N + A A T T + + ++ +LKR+ TEP++ F + GEWAS+ Sbjct: 211 ---NGEVNAAVATTTTTVNETVGLDEDPFEVELKRRTQGLETEPQVVEFLETLNGEWASQ 267 Query: 1175 LKKRKIVNAEDFVSGLPQGWKLLLGVRKKNGRYIIECRKYISPAGPQFASWKEASVFLST 996 KKR+IV A + LP GWK+++ ++ GR CR+Y+SP G QF S KEAS +L + Sbjct: 268 RKKRRIVPASELGDLLPAGWKIVIITMRRAGRASAVCRRYVSPDGHQFESCKEASAYLLS 327 Query: 995 NGALPVGRKDVSQVNLDHANSRTHFDPIQKKETHGTLNDGVDMPNASHQNQLKAICSPDA 816 G +D S + +++ + + ++ +P + A P A Sbjct: 328 ----VFGVQDRSHLKSSYSDGAQQLSSSMNRASESSVG---HVPTGDMKTDASASYLPSA 380 Query: 815 G---KKSQEKSDIVAGTKRRYNPSSLNAPMNCKKCNAT 711 G S EK ++ + N +S + + CK +AT Sbjct: 381 GAPIHSSHEKQPPISSSIGSENFNS-DLALGCKLGDAT 417 >ref|XP_006364865.1| PREDICTED: uncharacterized protein LOC102582612 isoform X2 [Solanum tuberosum] Length = 1193 Score = 128 bits (322), Expect = 9e-27 Identities = 135/490 (27%), Positives = 208/490 (42%), Gaps = 18/490 (3%) Frame = -3 Query: 2054 LDFSTIPVVDLHDFSQDEINVASQCSDVF----RFDDIIVPKIDYSVFQESSASRKQTYS 1887 L +IP VDL SQ E+ S CS R DD+I+PKID SVF ES+ SRKQTYS Sbjct: 18 LQAESIPTVDLRLLSQSELYSLSLCSTAAFNPCRDDDVIIPKIDRSVFNESAGSRKQTYS 77 Query: 1886 KLRLSRKQEGAETLPGYKAGHFSLSKCRSMVDESGKQEAQRILQFIQERLNTMHPSGNGS 1707 +LRL+ A S S RS N+ HP N S Sbjct: 78 RLRLAPAA----------AASASSSAIRSRTPHLR---------------NSPHPLQNPS 112 Query: 1706 LFMSDGALDANSSEINL---QAITRVDENCPQPLAVYSALHNEDVRLVSSEQVTDLVTAA 1536 ++G ++ SS+I + Q + P L +++ + + S V L A Sbjct: 113 --PNNGPANSESSQIVILLKQLFGSGTQKNPTDLVPIRVDYSDSLSVPSHVPVPGLELAN 170 Query: 1535 SNSAIDRSKKKLKPKEGARLKAFMNSNDAAAHQIPIKPEELRSNGTTNFVDTVDKAENIQ 1356 S + + +K+ +P++ N+ +K +E+ V + +N+ Sbjct: 171 VGS-VGQKRKRGRPRK----------NENGVRVAEVKVDEV--------VKDIVVYQNVD 211 Query: 1355 QYHDKLTRENNAPHMANANNTATFSPIFKESIFPQLKRK---FTTEPELHTFFNNIGGEW 1185 ++ ++ P + A A P E L+R+ + EL F + G+W Sbjct: 212 DSDKEIMNKDGIP-VDLAVLGALVDPFGLE-----LRRRTEGLGSAEELLGFLGRLNGQW 265 Query: 1184 ASKLKKRKIVNAEDFVSGLPQGWKLLLGVRKKNGRYIIECRKYISPAGPQFASWKEASVF 1005 S KKR+IV+A++F S LP+ WKLLL +++K GR + CR+YISP G QF + KE S + Sbjct: 266 GSTRKKRRIVDADEFGSVLPKSWKLLLSIKRKEGRSWLHCRRYISPNGRQFGTCKEVSSY 325 Query: 1004 L-----STNGALPVGRKDVSQVNLDHANSRTHFDPIQ---KKETHGTLNDGVDMPNASHQ 849 L LP V + +A + T IQ KKE+ N P H Sbjct: 326 LLFLHGERKENLPAYANGSGTVEITNACALTSDLRIQDGGKKESSVFHNSS---PAVGH- 381 Query: 848 NQLKAICSPDAGKKSQEKSDIVAGTKRRYNPSSLNAPMNCKKCNATYPNRSSFMGHLTIH 669 +L+ + + E S++ G ++C KCN T+ N+ + H Sbjct: 382 GELQVLVN------FGELSEVQVGDL-----------LHCDKCNVTFNNKDDLLQHQLFS 424 Query: 668 HVKRKKNAEG 639 H +R+ G Sbjct: 425 HQRRRSRNGG 434 >ref|XP_002513931.1| hypothetical protein RCOM_1035820 [Ricinus communis] gi|223547017|gb|EEF48514.1| hypothetical protein RCOM_1035820 [Ricinus communis] Length = 1337 Score = 122 bits (307), Expect = 5e-25 Identities = 136/506 (26%), Positives = 209/506 (41%), Gaps = 38/506 (7%) Frame = -3 Query: 2054 LDFSTIPVVDLHDFSQDEINVASQCSDVFRFD------DIIVPKIDYSVFQESSASRKQT 1893 L ++P++DL SQ E+ S CS F + D+ KID SVF ES+ SRKQT Sbjct: 24 LQMESLPLIDLRLLSQSELLSLSLCSFSFLNNPLQNEADVATLKIDRSVFNESAGSRKQT 83 Query: 1892 YSKLRLSRKQEGAETLPGYKAGHFSLSKCRSM-----VDESGKQEAQRILQFIQERLNTM 1728 +S+LRL+R+ HFS R+ V+ S +E +I+ I+ Sbjct: 84 FSRLRLARRNNNNS--------HFSTPSIRNQIPHQTVEISQDEENSQIIYLIK------ 129 Query: 1727 HPSGNGSLFMSDGALDANSSEINLQAITRVDENCPQPLAV---YSALHNEDVRLVSSEQV 1557 SLF S+ + ++E++ + D P+ + AL + V S E Sbjct: 130 ------SLFGSNFENEKENNEVDNVNLFSDDNLISVPITYNESFQALQDLAVADYSDETK 183 Query: 1556 TDLVTAASNSAIDRSKKKLKPKEGARLKAFMNSNDAAAHQIPIKPEELRSNGTTNFVDTV 1377 + TA ++S K+K + + L F+ +N+ + + EE T D+ Sbjct: 184 QAIATAITHSESTAEKRK-RGRPRKNLSDFVGNNNVDGNDNGNEKEEKEETAIT---DSK 239 Query: 1376 DKAENIQQYHDKLTRENNAPHMANANNTATF-SPIFKES----------------IFPQL 1248 K Q+ L NN AN A +P +E +L Sbjct: 240 RKRGRPQKDASTLGCHNNNNVNANEEKRAVCENPRTQEEEKRGMKVELGSSEEDPYAEEL 299 Query: 1247 KRK---FTTEPELHTFFNNIGGEWASKLKKRKIVNAEDFVSGLPQGWKLLLGVRKKNGRY 1077 +R+ TE EL F + GEW SK KKRKIV+A LP+ WKL+L +++ G + Sbjct: 300 RRRTMGMQTESELLGFLEGLQGEWMSKRKKRKIVDASVLGDVLPRNWKLILCNKRRAGFF 359 Query: 1076 IIECRKYISPAGPQFASWKEASVFLSTNGALPVGRKDVSQVNLDHANSRTHFDPIQKKET 897 ++C YISP G QF S KE S + L + VSQ + H +S + Sbjct: 360 WLDCTGYISPNGQQFMSCKEVS-----SNLLSKELQGVSQSSFGHDDSNI--------QL 406 Query: 896 HGTLNDG--VDMPNASHQNQLKAICSPDAGKKSQEKSDIVAGTKRRYNPSSLNA--PMNC 729 GT++ G D+ +++N I SP + + A T P + NC Sbjct: 407 TGTVSYGNAADLTLKNNKNGGGFISSPALPVTKSVEHEKQATTLAAVVPPHVQTVEKYNC 466 Query: 728 KKCNATYPNRSSFMGHLTIHHVKRKK 651 KC + + HL H + K Sbjct: 467 HKCTMAFQEPDDLLQHLLSSHQRAPK 492 >ref|XP_006853188.1| hypothetical protein AMTR_s00038p00204530 [Amborella trichopoda] gi|548856827|gb|ERN14655.1| hypothetical protein AMTR_s00038p00204530 [Amborella trichopoda] Length = 826 Score = 122 bits (306), Expect = 6e-25 Identities = 119/443 (26%), Positives = 200/443 (45%), Gaps = 35/443 (7%) Frame = -3 Query: 1526 AIDRSKKKLKPKEGARLKAFMN-----SNDAAAHQIPIKPEELRSNGTTNFVDTVDKAEN 1362 A+ R K+++ KE AR K M+ + D A + +NG+++F T Sbjct: 260 AVIRQKRRVSKKEDARRKGLMSLAVLENGDRGA---------IDNNGSSDFNQTGIGC-- 308 Query: 1361 IQQYHDKLTRENNAPHMANANNTATFSPIFKESIFPQLKRK---FTTEPELHTFFNNIGG 1191 H + +N M + +E P LK++ E EL F + +GG Sbjct: 309 ----HGNVRNGDNKEKMLQNGFVEVHALASRELFVPHLKKRTAALENELELVEFLDGLGG 364 Query: 1190 EWASKLKKRKIVNAEDFVSGLPQGWKLLLGVRKKNGRYIIECRKYISPAGPQFASWKEAS 1011 EW +K KKRK+V+A DF GLP GWK++LG+RKK G+ I+CRKYISP G +FA+ KE + Sbjct: 365 EWVTKRKKRKMVDASDFGDGLPDGWKVILGIRKKEGKLFIDCRKYISPTGQKFATCKEVT 424 Query: 1010 VFLST---NGALPVGRKDVSQVNLDHANSRTHFDPIQKKETHGTLNDGVDMPNASHQNQL 840 L + +G+L V + + N+ + RT TH ++ V P + + + Sbjct: 425 AHLLSEPQDGSLAVSAR--IEENMSGNSMRTRI----SGATHSSMK--VPAPQ-TKEPKC 475 Query: 839 KAICSPDAGKKSQEKSDIVAGTKRRYNPSSLNAPMNCKKCNATYPNRSSFMGHLTIHHVK 660 S D+GK+ I+ + + NP L + C+KCN + ++ +M HL H + Sbjct: 476 NGSISKDSGKQ------II--SHQVDNPIKLT--LECRKCNLNFNSKEVYMHHLLAVHQR 525 Query: 659 RKKN-------AEGIPANTNS-VTSLQVQANGTNNIPN---IMAVQAYGQNMD---NVAM 522 + K EG+ V + + G + N + V+ Y ++++ + AM Sbjct: 526 KSKRCRLGKSLGEGVLIEDGKYVCQICHKVFGEKHRYNGHVGVHVRNYFKSLEASQDQAM 585 Query: 521 VYDKANNVNSTQIQEDGVNNGK------SALHIGNAEDMEKAPGEVVTMSHISSETV--- 369 + DK +S + + +++GK S GN++ M +S S E Sbjct: 586 I-DKPIAASSLDVGKPQISDGKQENSSESIEGDGNSDRMPSEDNLGALLSKSSDEPCDDL 644 Query: 368 -RLHTENMENSSTNGNIPHDANC 303 T+N++ S ++ D NC Sbjct: 645 KMATTDNLKKISEKSDVDSDENC 667 Score = 64.3 bits (155), Expect = 2e-07 Identities = 36/68 (52%), Positives = 46/68 (67%), Gaps = 3/68 (4%) Frame = -3 Query: 2054 LDFSTIPVVDLHDFSQDEIN---VASQCSDVFRFDDIIVPKIDYSVFQESSASRKQTYSK 1884 L S+IP++DL SQDEI+ + S S DI+VPKID S+F ES SRKQTYS+ Sbjct: 14 LPISSIPLIDLRFLSQDEISSLALLSLPSSNPPLTDIVVPKIDRSIFNESQGSRKQTYSR 73 Query: 1883 LRLSRKQE 1860 LRLS K++ Sbjct: 74 LRLSHKKQ 81 >ref|XP_006581536.1| PREDICTED: uncharacterized protein LOC102665295 [Glycine max] Length = 871 Score = 121 bits (304), Expect = 1e-24 Identities = 127/455 (27%), Positives = 206/455 (45%), Gaps = 11/455 (2%) Frame = -3 Query: 2042 TIPVVDLHDFSQDEINVASQCSDVFRF-----DDIIVPKIDYSVFQESSASRKQTYSKLR 1878 ++P+VDL SQ E+ S R DD ++PKID S F ES+ SRKQTYSKLR Sbjct: 16 SLPLVDLRLLSQPELYTLSLSGATHRHRRNSDDDSVIPKIDRSNFNESAGSRKQTYSKLR 75 Query: 1877 LSRKQEGAETLPGYKAGHFSLSKCRSMVDESGKQEAQRILQFIQERLNTMHPSGNGSLFM 1698 L+++++ +P + H L + E ++E RI+ + + L + P N + Sbjct: 76 LNKRKQNP-AVPASSSFHIPLH-----ISEPEEEENSRIIALLHQ-LFGVEPLRNNAPRN 128 Query: 1697 SDGALDANSSEINLQAITRVDENCPQPLAVYSALHNEDVRLVSSEQVTDLVTAASNSAID 1518 +D + E L + V+ P P++V + N + D+V S Sbjct: 129 ND------APERRLVPV-HVEFKQPPPISV-ALFQNVPI---------DVVPDGSQ---- 167 Query: 1517 RSKKKLKPKEGARLKAFMNSNDAAAHQIPIKPEELRSNGTTNFVDTVDKAENIQQYHDKL 1338 R +K+ +P++ NS + P K + N T FV+ K N ++ Sbjct: 168 RKRKRGRPRKDE------NSVTVFVEE-PTKVTK-EENSLTVFVEEPKKVTNEEK--SVK 217 Query: 1337 TRENNAPHMANANNTATFSPIFKESIFP-QLKRK---FTTEPELHTFFNNIGGEWASKLK 1170 N + A A T S E +F +LKR+ TE ++ F + GEWAS+ K Sbjct: 218 VNGNGEGNAAVATATVNESVGLDEDLFEVELKRRAQGLETESQVMEFLETLNGEWASQRK 277 Query: 1169 KRKIVNAEDFVSGLPQGWKLLLGVRKKNGRYIIECRKYISPAGPQFASWKEASVFLSTNG 990 KR+IV A + LP GWK+++ V ++ GR CR+Y+SP G QF S KEAS +L + Sbjct: 278 KRRIVPATELGDMLPAGWKIVIIVMRRAGRASAVCRRYVSPDGHQFESCKEASAYLLSVS 337 Query: 989 ALPVGRKDVSQVNLDHANSRTHFDPIQKKETHGTLN--DGVDMPNASHQNQLKAICSPDA 816 G +D S + + + + + ++ DM ++ + L + +P Sbjct: 338 ----GVQDRSHLKSSYTDGAQQLSSSMNRASESSVGHVPTGDMKTVANASYLSSAGAPI- 392 Query: 815 GKKSQEKSDIVAGTKRRYNPSSLNAPMNCKKCNAT 711 S EK +V+ + N S + + CK +AT Sbjct: 393 -DSSHEKQPLVSSSIGSENFIS-DLALGCKLGDAT 425 >ref|NP_173650.3| methyl-CPG-binding domain-containing protein [Arabidopsis thaliana] gi|75174757|sp|Q9LME6.1|MBD8_ARATH RecName: Full=Methyl-CpG-binding domain-containing protein 8; Short=AtMBD8; Short=MBD08; AltName: Full=Methyl-CpG-binding protein MBD8 gi|9392683|gb|AAF87260.1|AC068562_7 Contains a Methyl-CpG binding domain PF|01429 and two DNA binding domains with preference for A/T rich regions PF|02178. ESTs gb|AI998776, gb|N95984 come from this gene [Arabidopsis thaliana] gi|26452716|dbj|BAC43440.1| unknown protein [Arabidopsis thaliana] gi|332192108|gb|AEE30229.1| methyl-CPG-binding domain-containing protein [Arabidopsis thaliana] Length = 524 Score = 115 bits (289), Expect = 6e-23 Identities = 100/393 (25%), Positives = 166/393 (42%), Gaps = 42/393 (10%) Frame = -3 Query: 2054 LDFSTIPVVDLHDFSQDEINVASQCSDVFRF-----------DDIIVPKIDYSVFQESSA 1908 L ++P++D SQ E+ SQCS + DD + PKID SVF ES+ Sbjct: 21 LSAESLPLIDTRLLSQSELRALSQCSSLSPSSSASLAASAGGDDDLTPKIDRSVFNESAG 80 Query: 1907 SRKQTYSKLRLSRKQEGAETLPGYKAGHFSLSKCRSMVDESGKQEAQRILQFIQERLNT- 1731 SRKQT+ +LRL+R + E P + D+S ++E ++ ++ N Sbjct: 81 SRKQTFLRLRLARHPQPPEEPPSPQRQR----------DDSSREEQTQVASLLRSLFNVD 130 Query: 1730 ------MHPSGNGSLFMSDGALDANS---SEINLQAI----------TRVDENCPQPLAV 1608 G L ++G + NS NL +I ++ +P + Sbjct: 131 SNQSKEEEDEGEEELEDNEGQIHYNSYVYQRPNLDSIQNVLIQGTSGNKIKRKRGRPRKI 190 Query: 1607 YSALHNEDVRLVSSEQVT-----------DLVTAASNSAIDRSKKKLKPKEGARLKAFMN 1461 + +V ++ E T +V+ +S I +K K G K Sbjct: 191 RNPSEENEVLDLTGEASTYVFVDKTSSNLGMVSRVGSSGISLDSNSVKRKRGRPPK---- 246 Query: 1460 SNDAAAHQIPIKPEELRSNGTTNFVDTVDKAENIQQYHDKLTRENNAPHMANANNTATFS 1281 ++ I E R + N + DK E + + EN + + + A+ S Sbjct: 247 ------NKEEIMNLEKRDSAIVN-ISAFDKEELV------VNLENREGTIVDLSALASVS 293 Query: 1280 PIFKESIFPQLKRKFTTEPELHTFFNNIGGEWASKLKKRKIVNAEDFVSGLPQGWKLLLG 1101 E ++ T+ E+ F + GEW + KK+K+VNA D+ LP+GW+L+L Sbjct: 294 EDPYEEELRRITVGLKTKEEILGFLEQLNGEWVNIGKKKKVVNACDYGGYLPRGWRLMLY 353 Query: 1100 VRKKNGRYIIECRKYISPAGPQFASWKEASVFL 1002 +++K ++ CR+YISP G QF + KE S +L Sbjct: 354 IKRKGSNLLLACRRYISPDGQQFETCKEVSTYL 386 >gb|ESW08251.1| hypothetical protein PHAVU_009G031600g [Phaseolus vulgaris] Length = 841 Score = 114 bits (286), Expect = 1e-22 Identities = 109/369 (29%), Positives = 168/369 (45%), Gaps = 8/369 (2%) Frame = -3 Query: 2084 EAEVKEAEVLLDFSTIPVVDLHDFSQDEINVASQCSDVFRF-----DDIIVPKIDYSVFQ 1920 EAEV+ + +D ++P+VDL SQ E+ S R +D +VPKID S F Sbjct: 5 EAEVEPSSDHID--SLPLVDLRLLSQPELYTLSLSGATHRHRRANDNDSVVPKIDRSNFN 62 Query: 1919 ESSASRKQTYSKLRLSRKQEGAETLPGYKAGHFSLSKCRSMVDESGKQEAQRILQFIQER 1740 ES+ SRKQTYSKLRL+++++ +P + H + E QE +I+ +Q + Sbjct: 63 ESAGSRKQTYSKLRLNKRKQNF-AVPASSSFH---------IPEPVDQENSQIISLLQ-Q 111 Query: 1739 LNTMHPSGNGSLFMSDGALDANSSEINLQAITRVDENCPQPLAVYSALHNEDVRLVSSEQ 1560 L + P N AL + + + V QP V V+ + Sbjct: 112 LFGVEPLRN--------ALRPDCGDAANHQLFPVHVEFKQPPPV----------TVTFQT 153 Query: 1559 VTDLVTAASNSAIDRSKKKLKPKEGARLKAFMNSNDAAAHQIPIKPEELRSNGTTNFVDT 1380 V V ASN R +K+ +P++ L + ++ G + Sbjct: 154 VPIDVIDASN----RKRKRGRPRKNENLVSVFEEETKKVNE-----------GRSAVATV 198 Query: 1379 VDKAENIQQYHDKLTRENNAPHMANANNTATFSPIFKESIFPQLKRK---FTTEPELHTF 1209 +++ + D L +N P F E +LKR+ TEP+L F Sbjct: 199 IERGFGVDA--DGL---DNDP--------------FGE----ELKRRTAGLETEPQLLEF 235 Query: 1208 FNNIGGEWASKLKKRKIVNAEDFVSGLPQGWKLLLGVRKKNGRYIIECRKYISPAGPQFA 1029 + GEWAS+ KKR+IV A D + LP GWK+++ + ++ GR + CR+Y+SP G QF Sbjct: 236 LETLNGEWASQRKKRRIVQASDLGTVLPAGWKIVITLLRRAGRASVVCRRYVSPGGHQFE 295 Query: 1028 SWKEASVFL 1002 S KEAS +L Sbjct: 296 SCKEASAYL 304 >ref|XP_006433971.1| hypothetical protein CICLE_v10000205mg [Citrus clementina] gi|557536093|gb|ESR47211.1| hypothetical protein CICLE_v10000205mg [Citrus clementina] Length = 919 Score = 114 bits (286), Expect = 1e-22 Identities = 172/729 (23%), Positives = 297/729 (40%), Gaps = 77/729 (10%) Frame = -3 Query: 2054 LDFSTIPVVDLHDFSQDEINVASQCSDVFRF---------DDIIVPKIDYSVFQESSASR 1902 L + ++P++DL +Q E+ S CS D++ PKID SVF ES+ SR Sbjct: 12 LHYDSLPLIDLRLLAQSELLSLSLCSSRVSTTTSSQNEDEDEVSTPKIDRSVFNESAGSR 71 Query: 1901 KQTYSKLRLSRKQEGAETLPGYKAGHFSLSKCRSMVDESGKQEAQRILQFIQERLNTMHP 1722 KQT+S+LRL+ + + +P ++ ++ ++ DE Q I+ ++ N Sbjct: 72 KQTFSRLRLAPRN--SPQIPPQIP--YTAARAETL-DEDNPQ----IVGLLESLFNIQ-- 120 Query: 1721 SGNGSLFMSDGAL-------DANSSEINLQAITRVDENC---PQPLAVYSALHNEDVR-- 1578 S + S ++D L A +++N+ VDEN P + YSA + R Sbjct: 121 SHSSSTIVNDQQLVPVQVEYKAYLNDVNVNV--NVDENLHDVPISVVTYSARKRKRGRPR 178 Query: 1577 -----------LVSSEQVTDLVTAASNSAIDR---------SKKKLKPKEGA------RL 1476 + SE ++V+ +S + D +K+ +P++ ++ Sbjct: 179 KDEMTSSDNWWFIESENKVNVVSKSSLNITDNVNVVPCKIGKRKRGRPRKSENRNNNFKV 238 Query: 1475 KAFMNSNDAAAHQIPIKPEELRS-NGTTNFVDTVDKAENIQQYHDKLTRENNAPHMANAN 1299 A S + P +P + NG + + +E+ + ++ EN N Sbjct: 239 NAVSESAPNVGKRGPGRPRKGEGKNGDKSVKKEIVVSESKEDLVNEALMENGDGIAVNLV 298 Query: 1298 NTATFSPIFKESIFPQLKRKFTTEPELHTFFNNIGGEWASKLKKRKIVNAEDFVSGLPQG 1119 A F E + + E EL F + G W S KKRKIV+A +F LP+G Sbjct: 299 ALANTEDPFGEELRRRTGGSEKRE-ELLGFLTGLKGVWVSYRKKRKIVDASEFGDVLPRG 357 Query: 1118 WKLLLGVRKKNGRYIIECRKYISPAGPQFASWKEASVFLSTNGALPVGRKDVSQVNLDHA 939 WKL+L ++KK G + CR+YISP G QF S KE S +L + G K SQ + H Sbjct: 358 WKLMLCIKKKVGHMWLGCRRYISPNGRQFVSCKEVSSYLLSLS----GHKVASQPSAAHT 413 Query: 938 -------NSRTH---FDPIQKKETHGT-----LNDGVDMPNASHQNQ--LKAICSP--DA 816 N T DPI K + +G L + H+ Q L I SP D Sbjct: 414 GDCIQLDNKMTFGNAVDPILKDDKNGADLVFHLPFPASSVSTGHEKQATLPKIMSPGEDK 473 Query: 815 GKKSQEKSDIVAG-TKRRYNPSSLNAPMNCKKCNATYPNRSSFMGHLTIHHVKRKKNAEG 639 G+++ K V+ T + + + K + ++ ++ H H + Sbjct: 474 GQENCNKKYSVSNITDEKVEKMNAATEVTAAKLDVSFGAKAVMCNHQNNKHFGSCSERD- 532 Query: 638 IPANTNSVTSLQVQANGTNNIPNIMAVQAYGQNMDNVAMVYDKANNVNSTQIQED-GVNN 462 +P NT S ++ +G + + + + + G VY + +I +D G + Sbjct: 533 VPKNTISSSN---NMSGQDQVFQPLILDSSGNG------VYFSSVEKQKQEIGDDSGFVS 583 Query: 461 GKSALHIGNAEDMEKAPGEVVTMSHISSETVRLHTENME-NSSTNGNIPHDANCSSMTDI 285 + I + +++EK + S E +++ + E N + G++ CS + D Sbjct: 584 PNAKDEISSCQNLEKG------LFTSSMEHMKVDVDKCERNEAIAGSV---YGCSRLVDT 634 Query: 284 ------KSPSNSCSKSFD-EKYQCTVDIGVPESGDEQKSEKPNLFNFTSKNISADNNEQA 126 + CS + +C V +SG + SE L F S+ I +N Sbjct: 635 MTYEKGRGSFEGCSVVLSGSELKCGSMNAVNKSGRPEDSEDGLLNLFGSEKIFGFDNNLT 694 Query: 125 SISNPFLEL 99 +S +E+ Sbjct: 695 KVSVDKMEV 703 >gb|EOY15980.1| Uncharacterized protein isoform 1 [Theobroma cacao] gi|508724084|gb|EOY15981.1| Uncharacterized protein isoform 1 [Theobroma cacao] Length = 1203 Score = 112 bits (281), Expect = 5e-22 Identities = 138/525 (26%), Positives = 207/525 (39%), Gaps = 53/525 (10%) Frame = -3 Query: 2054 LDFSTIPVVDLHDFSQDEINVASQCSDV----FRFDDIIVPKIDYSVFQESSASRKQTYS 1887 L +IPVVDL SQ E+ S CS ++ PKID SVF ES+ SRKQT+S Sbjct: 14 LHLESIPVVDLRLISQPELLSLSLCSSSPSPSNADTELFTPKIDRSVFNESAGSRKQTFS 73 Query: 1886 KLRLSRKQEGA----ETLPGYKAGHFSLSKCRSMVDESG-KQEAQRILQFIQERLNTMHP 1722 +LRL+ + + P K SLS+ + V+ +E+ IL ++ Sbjct: 74 RLRLAAPRNHLPHPHHSSPSSKP-FTSLSQRLNPVNPGPLDEESSNILSLLK-------- 124 Query: 1721 SGNGSLFMSDGALDANSSEINLQAITRVDENCPQPLAVY---------SALHNEDVRLVS 1569 SLF D +L +N++E D+ P+ + S L N V +VS Sbjct: 125 ----SLFNIDDSLTSNTNEDEPD-----DDKDLVPVQIEYENGKDNGNSVLQNIPVGIVS 175 Query: 1568 ------------SEQVTDLVTAASNSAIDRSKKKLKPKEGARLKAFMNSNDAAAHQIPIK 1425 +Q +L+ + N I+ + E A S +A I Sbjct: 176 CSGSKRKRGRPRKDQKDNLLIESENLVIEEHQ------ETAAFDRVSESVNAGG--ISSC 227 Query: 1424 PEELRSNGTTNFVDTVDKAENIQQYHDKLTRENNAPHMANANNTATFSPIFKESIFPQLK 1245 E R G ++++N ++ E+ +A N A I +L+ Sbjct: 228 SERKRKRGRPR----KEESQNRVIVSEEKKVESEIERVALGNVEAILG------IEEELR 277 Query: 1244 RK---FTTEPELHTFFNNIGGEWASKLKKRKIVNAEDFVSGLPQGWKLLLGVRKKNGRYI 1074 R+ TE EL F + GEWASK +K++IV+A F + LPQGWKL+L V+K+ G Sbjct: 278 RRTEAIGTEAELLEFMGGLEGEWASKSQKKRIVDAAGFGNVLPQGWKLMLFVKKRAGHVW 337 Query: 1073 IECRKYISPAGPQFASWKEASVFLSTNGALPVGRKDVSQVNLDHANS-----RTHFDPIQ 909 + C +YISP G QF S KE S L + G L + S + S +F I Sbjct: 338 LACSRYISPNGQQFVSCKEVSSCLLSAGELKDSSQSTSSLTGRGIGSGVKPTSENFPIIC 397 Query: 908 KKETHGTLNDGVDMPNASHQNQLKAICSPDAGKKSQEKSDIVA---------------GT 774 H + M + + + I ++ D + GT Sbjct: 398 TSSEHERQAPLLRMGSPWEVQRAETIKCHKCTMTFNQQDDFICHLLSSHQGTVKSSGHGT 457 Query: 773 KRRYNPSSLNAPMNCKKCNATYPNRSSFMGHLTIHHVKRKKNAEG 639 N C+ C + RS + HL +H K EG Sbjct: 458 STNEEVIIKNGKYECQFCYELFEERSCYSSHLGVHMKNNTKKVEG 502 >ref|XP_006472591.1| PREDICTED: uncharacterized protein LOC102628030 [Citrus sinensis] Length = 917 Score = 108 bits (271), Expect = 7e-21 Identities = 174/731 (23%), Positives = 296/731 (40%), Gaps = 79/731 (10%) Frame = -3 Query: 2054 LDFSTIPVVDLHDFSQDEINVASQCSDVFRF---------DDIIVPKIDYSVFQESSASR 1902 L + ++P++DL +Q E+ S CS D++ PKID SVF ES+ SR Sbjct: 12 LHYDSLPLIDLRLLAQSELLSLSLCSSRVSTTTSSQNEDEDEVSTPKIDRSVFNESAGSR 71 Query: 1901 KQTYSKLRLSRKQEGAETLPGYKAGHFSLSKCRSMVDESGKQEAQRILQFIQERLNTMHP 1722 KQT+S+LRL+ + + +P ++ ++ ++ DE Q I+ ++ N Sbjct: 72 KQTFSRLRLAPRN--SPQIPPQIP--YTAARAETL-DEDNPQ----IVGLLESLFNIQ-- 120 Query: 1721 SGNGSLFMSDGAL-------DANSSEINLQAITRVDENC---PQPLAVYSALHNEDVR-- 1578 S + S ++D L A +++N+ VDE+ P + YSA + R Sbjct: 121 SHSSSTIVNDQQLVPVQVEYKAYLNDVNVN----VDEDLHDVPISVVTYSARKRKRGRPR 176 Query: 1577 -----------LVSSEQVTDLVTAASNSAIDRSK----KKLKPKEGARLKAFMNSNDAAA 1443 + SE ++V+ +S + D K K K G K+ +N+ Sbjct: 177 KDEMTSSDNWWFIESENKVNVVSKSSLNITDNVNVVPCKTGKRKRGRPRKSENGNNNFKV 236 Query: 1442 HQI-----------PIKPEELRS-NGTTNFVDTVDKAENIQQYHDKLTRENNAPHMANAN 1299 + + P +P + NG + + +E+ + ++ E+ N Sbjct: 237 NAVSESAPNVGKRGPGRPRKGEGKNGDKSVKKEIVVSESKEDLVNEALMEDRDGIAVNLV 296 Query: 1298 NTATFSPIFKESIFPQLKRKFTTEPELHTFFNNIGGEWASKLKKRKIVNAEDFVSGLPQG 1119 A F E + + E EL F + G W S KKRKIV+A +F LP+G Sbjct: 297 ALANTEDPFGEELRRRTGGSEKRE-ELLGFLTGLKGVWVSYRKKRKIVDASEFGDVLPRG 355 Query: 1118 WKLLLGVRKKNGRYIIECRKYISPAGPQFASWKEASVFLSTNGALPVGRKDVSQVNLDHA 939 WKL+L ++KK G + CR+YISP G QF S KE S +L + G K SQ + H Sbjct: 356 WKLMLCIKKKVGHMWLGCRRYISPNGRQFVSCKEVSSYLLSLS----GHKVASQPSAAHT 411 Query: 938 -------NSRTH---FDPIQKKETHGT-----LNDGVDMPNASHQNQ--LKAICSP--DA 816 N T DPI K + +G L + H+ Q L I SP D Sbjct: 412 GDCIQLDNKMTFGNAVDPILKDDKNGADLVFHLPFPASSVSTGHEKQATLPKIMSPGEDE 471 Query: 815 GKKSQEKSDIVAG-TKRRYNPSSLNAPMNCKKCNATYPNRSSFMGHLTIHHVKRKKNAEG 639 G+++ K V+ T + + + K + ++ ++ H H + Sbjct: 472 GQENCNKKYSVSNITDEKVEKMNAATEVTAAKLDVSFCAKAVMCNHQNNKHFGSCSERD- 530 Query: 638 IPANTNSVTSLQVQANGTNNI--PNIMAVQAYGQNMDNVAMVYDKANNVNSTQIQED-GV 468 +P NT S ++ +G + + P I+ G VY + +I +D G Sbjct: 531 VPKNTISSSN---NMSGQDQVFQPQILDSSGNG--------VYFSSVEKQKQEIGDDSGF 579 Query: 467 NNGKSALHIGNAEDMEKAPGEVVTMSHISSETVRLHTENME-NSSTNGNIPHDANCSSMT 291 + + I + +++EK + S E +++ + E N + G++ CS + Sbjct: 580 VSPNAKDEISSCQNLEKG------LFTSSMEHMKVDVDKCERNEAIAGSV---YGCSRLV 630 Query: 290 DI------KSPSNSCSKSFD-EKYQCTVDIGVPESGDEQKSEKPNLFNFTSKNISADNNE 132 D + CS + +C V +SG + SE L F S+ I +N Sbjct: 631 DTMTYEKGRGSFEGCSVVLSGSELKCGSMNAVNKSGRPEDSEDGLLNLFGSEKIFGFDNN 690 Query: 131 QASISNPFLEL 99 +S +E+ Sbjct: 691 LTKVSVDKMEV 701 >ref|XP_006416210.1| hypothetical protein EUTSA_v10007200mg [Eutrema salsugineum] gi|557093981|gb|ESQ34563.1| hypothetical protein EUTSA_v10007200mg [Eutrema salsugineum] Length = 575 Score = 105 bits (263), Expect = 6e-20 Identities = 108/428 (25%), Positives = 185/428 (43%), Gaps = 31/428 (7%) Frame = -3 Query: 2054 LDFSTIPVVDLHDFSQDEINVASQCSDVFR-------FDDIIVPKIDYSVFQESSASRKQ 1896 L ++P++D SQ E+ S S DD + PKID SVF ES+ SRKQ Sbjct: 106 LSAESLPLIDTRLLSQSELRALSPSSSSSASLAASAGVDDDLTPKIDRSVFNESAGSRKQ 165 Query: 1895 TYSKLRLSRKQEGAETLPGYKAGHFSLSKCRSMVDESGKQEAQRILQFIQERLNTMHPSG 1716 T+ ++RL+R + D+S ++E ++ ++ Sbjct: 166 TFLRVRLARDPPPPRPPSPQRRR-----------DDSSREEKSQVASLLR---------- 204 Query: 1715 NGSLFMSDGALDANSSEINLQAITRVDENCPQPLAVYSALHNEDVR----LVSSEQVTDL 1548 SLF D + N+ E + V+E QPL +N +V S + V + Sbjct: 205 --SLFSVD-SFQRNAEED--EGEEEVEEKEGQPLISLPIHNNGNVYRNPYFDSVKNVQGI 259 Query: 1547 VTAASNSAIDRSKKKLKPKEGARLKAFMNSNDAAAHQIPIKPEELRSN-GTTNFVDTVD- 1374 + R +K P +G L ++ D + + + ++ RSN GT + D Sbjct: 260 SENETRRRPGRPRKIRNPSDGV-LDSYA---DESEREGTLSVDKTRSNLGTESGYDASGI 315 Query: 1373 ---------KAENIQQYHDKLTRENNAPHMANANNTATFSPIF-----KESIFPQLKRKF 1236 K ++ D E+ ++ N T + +E + + R+ Sbjct: 316 SMDSNPGKRKRGRPRKSGDGCKSEDKEEIVSLENREGTMVDLSALANNEEDPYGEELRRI 375 Query: 1235 T----TEPELHTFFNNIGGEWASKLKKRKIVNAEDFVSGLPQGWKLLLGVRKKNGRYIIE 1068 T T+ EL F + GEW + KK+K+V A D+ LP+GWKL+L ++KK + Sbjct: 376 TVGLGTKEELLAFLEQVNGEWVNAGKKKKVVKACDYGGYLPRGWKLMLCIKKKGSIQWLA 435 Query: 1067 CRKYISPAGPQFASWKEASVFLSTNGALPVGRKDVSQVNLDHANSRTHFDPIQKKETHGT 888 CR+YISP G +FA+ KE S +L + V + +++N +++ T +P+ E+ Sbjct: 436 CRRYISPDGQEFATCKEVSTYLQS----LVESQSKNRLNSFQSDNHTLGEPVMGNESLVG 491 Query: 887 LNDGVDMP 864 +D +D+P Sbjct: 492 NSDSMDLP 499 >ref|XP_004292482.1| PREDICTED: uncharacterized protein LOC101298198 [Fragaria vesca subsp. vesca] Length = 821 Score = 101 bits (251), Expect = 1e-18 Identities = 94/363 (25%), Positives = 157/363 (43%), Gaps = 15/363 (4%) Frame = -3 Query: 1709 SLFMSDGALDANSSEINLQAITR--VDENCPQPLAVYSALHNEDVRLVSSEQVTDLVTAA 1536 SL S+GA+D + + I R +E+ YS + L+S+ +V+ A Sbjct: 38 SLTRSNGAID----HLVVPKIDRSQFNESAGSRRQTYSRVRRRVAGLLSNPKVS-----A 88 Query: 1535 SNSAIDRSKKKLKPKEGARLKAFMNSNDAAAHQIPIKPEELRSNGTTNFVDTVDKAENIQ 1356 + D ++ LK F+ S D QI ++P + + + + +++ + + Sbjct: 89 PPAQPDDPERNENQAIIGHLKRFI-SQDPKFDQIDLEPSPMTMKASLSGMAELERRKRKR 147 Query: 1355 QYHDKLTRENNAPHM-ANANNTATFSPIFKESIFP---QLKRK---FTTEPELHTFFNNI 1197 K + + N N A + S P +L+R+ TE EL F ++ Sbjct: 148 GRKPKAKGSSGGEGLIVNKNGAAVDIWALQNSENPFGDELRRRTLGLETEEELLGFMRDL 207 Query: 1196 GGEWASKLKKRKIVNAEDFVSGLPQGWKLLLGVRKKNGRYIIECRKYISPAGPQFASWKE 1017 GG+W S+ KKRKIV+A +F LP GWKLLLG+++K R I CR+YISP G QF S KE Sbjct: 208 GGQWGSRRKKRKIVDATEFGDALPLGWKLLLGLKRKERRAWIYCRRYISPTGQQFLSCKE 267 Query: 1016 ASVFL----STNGALPVGRKDVSQVNLDH--ANSRTHFDPIQKKETHGTLNDGVDMPNAS 855 + FL S N A + D A H D +K + N G+ + S Sbjct: 268 VASFLESFFSLNNADRHDGDGGENIQEDRIVATENQHADKDGEKRQDVSFNSGILGSSIS 327 Query: 854 HQNQLKAICSPDAGKKSQEKSDIVAGTKRRYNPSSLNAPMNCKKCNATYPNRSSFMGHLT 675 ++ + ++ + + ++ C KC+ T+ ++ S++ HL Sbjct: 328 NE------------QSNEPEKKVSISEMENLAEVQIHNLFECHKCSMTFADKDSYLQHLL 375 Query: 674 IHH 666 H Sbjct: 376 SFH 378 >ref|XP_006661339.1| PREDICTED: dentin sialophosphoprotein-like, partial [Oryza brachyantha] Length = 1042 Score = 99.0 bits (245), Expect = 7e-18 Identities = 101/406 (24%), Positives = 168/406 (41%), Gaps = 42/406 (10%) Frame = -3 Query: 1229 EPELHTFFNNIGGEWASKLKKRKIVNAEDFVSGLPQGWKLLLGVRKKNGRYIIECRKYIS 1050 E EL F N + G+W S+ ++RK V+A F LP+GWKLLLG+++K I CR+Y+S Sbjct: 136 ESELLGFMNGLEGQWGSRRRRRKFVDASMFGDHLPRGWKLLLGLKRKERVAWINCRRYVS 195 Query: 1049 PAGPQFASWKEASVFLSTNGALPVGRKDVSQVNLDHANSRTHFDPIQKKETHGTLNDGVD 870 P+G QFAS KE S +L + +G + + ++N+ H E H + G Sbjct: 196 PSGQQFASCKEISSYLIS----LLGYVEAKPTAIQNSNAGVH-------ELHTVNSVGHC 244 Query: 869 MPNASHQNQLKAICSPDAGKKSQEKSDIVAGTKRRYNPSSLNAPMN---CKKCNATYPNR 699 PN++ + +P S S +R+++ + N C+KCN + ++ Sbjct: 245 QPNSTEEKH----SAPPV--TSVPVSSHYGDPQRQHDKNETQVETNGKECQKCNLIFQDQ 298 Query: 698 SSFMGH-LTIHHVK---RKKNAEG-IPANTN-SVTSLQVQANGTNNI----PNIMAVQAY 549 S+++ H L+ H K RK N G + N N + + ++Q + + N+ A + Sbjct: 299 SAYVQHQLSFHQRKAKRRKVNKSGEVGVNKNGTFVTQELQQTSEDKLGHIDHNVAASRNQ 358 Query: 548 GQNMDNV-------------AMVYDKANNVNSTQIQEDGVNNGKSALHIGNAEDMEKAPG 408 GQ + V +M + + E G + L G+ D Sbjct: 359 GQTPEKVSDETISGELGGQPSMAPEPVGFRETDGETEQGKESSAGELLSGHCNDSLHNMA 418 Query: 407 EVVTMSHISS-ETVRLHTENMENSST----------NGNIPHDANCSSMTDIKSPSN--- 270 +V S+ E V H EN+ ++ N PH +S SP+N Sbjct: 419 DVAEQEKRSAREPVTGHHENLSDNCVDHKIHDGACHNAEEPHAVEAASKFSTGSPANFHE 478 Query: 269 --SCSKSFDEKYQCTVDIGVPESGDEQKSEKPNLFNFTSKNISADN 138 S CT +I + E PN + S++ D+ Sbjct: 479 IDSSKDIVLSSADCTQNISKTDKTCNLLEEAPNATSTQSESKCTDD 524 >ref|XP_004957094.1| PREDICTED: uncharacterized protein LOC101759536 [Setaria italica] Length = 1141 Score = 98.6 bits (244), Expect = 1e-17 Identities = 106/468 (22%), Positives = 186/468 (39%), Gaps = 82/468 (17%) Frame = -3 Query: 1232 TEPELHTFFNNIGGEWASKLKKRKIVNAEDFVSGLPQGWKLLLGVRKKNGRYIIECRKYI 1053 +E EL F N + G+W S+ ++RK V+A F LP+GWKLLLG+++K I CR+Y+ Sbjct: 198 SESELLGFMNALEGQWGSRRRRRKFVDAGMFADHLPRGWKLLLGLKRKERVAWINCRRYV 257 Query: 1052 SPAGPQFASWKEASVFLSTNGALPVGRKDVSQVNLDHANSRTHFDPIQKKETHGTLNDGV 873 SP G QFA+ KE S +L + P + +Q+N ++ H + Sbjct: 258 SPKGHQFATCKEVSTYLRSLLGYPEAKPTTTQIN----SAGVH---------------DL 298 Query: 872 DMPNASHQNQL----KAICSPDAGKKSQEKSDIVAGTKRRYNPSSLNA-PMNCKKCNATY 708 D+ +A HQ + + + P S G K + + + + P C+KCN T+ Sbjct: 299 DINSAGHQQTISIEQRQLAVPLTSVTLFSHSGDSHGQKLQKDEAQMEVNPKECRKCNLTF 358 Query: 707 PNRSSFMGH-LTIHHVKRKKN-----------------------AEGIPANTNSVTSLQV 600 ++ ++M H L+ H K K+ EG +++ V ++ Sbjct: 359 HDQGAYMQHQLSFHQRKAKRRRVSKSSELGTYVDGNYETQQKTLGEGFGNSSHGVADVRY 418 Query: 599 QANGTNNI------------PNIMAVQAYGQNMDNVAMVYDK---------ANNVN---- 495 Q + P++ A Q M + +K NN + Sbjct: 419 QGQSPAKLFDGTFSGQLGVQPSLKAAPLGFQEMTVLPPQLEKEPFAGEPVSMNNKDPPEE 478 Query: 494 ---------STQIQEDGVNNGKSALHIGNAEDMEKAPG--EVVTMSHISSE--------- 375 + E +GK + N + EK P E V+ S ++E Sbjct: 479 MSGFLEQERESAAGEPISRHGKDPQEMINFPEQEKEPAAREAVSGSTSAAELEKGPSAGG 538 Query: 374 -TVRLHTENMENSSTNGNIPHDANCSS-----MTDIKSPSNSCSKSFDEKYQCTVDIGVP 213 T H + ++NS + HD C S D +S ++C+ + + C+ D+ + Sbjct: 539 PTSGHHLDAVDNSD---HRTHDETCDSAVASLSVDAESKLSTCNATNFHENDCSKDLELS 595 Query: 212 ESGDEQKSEKPNLFNFTSKNIS--ADNNEQASISNPFLELLQEAAVEE 75 + QKS + + K +S AD+ ++ +N +E E+ Sbjct: 596 NTDHSQKSNRSDETYGVPKEVSPAADDPVESKSTNDLMECTDITQTEQ 643 >ref|XP_002525855.1| hypothetical protein RCOM_0824380 [Ricinus communis] gi|223534860|gb|EEF36549.1| hypothetical protein RCOM_0824380 [Ricinus communis] Length = 697 Score = 98.2 bits (243), Expect = 1e-17 Identities = 63/200 (31%), Positives = 96/200 (48%), Gaps = 4/200 (2%) Frame = -3 Query: 1253 QLKRK---FTTEPELHTFFNNIGGEWASKLKKRKIVNAEDFVSGLPQGWKLLLGVRKKNG 1083 +LKR+ E EL FF ++GG+W S+ +KRKIV+A +F LP GWKLLLG+++K G Sbjct: 199 ELKRRTEGMVKEEELLGFFRDLGGQWCSRRRKRKIVDASEFGDFLPFGWKLLLGLKRKEG 258 Query: 1082 RYIIECRKYISPAGPQFASWKEASVFLSTNGALPVGRKDVSQVNLDHANSRTHFDPIQKK 903 + + CR+YISP+G QF S KE S +L + DH+N Sbjct: 259 KAWVYCRRYISPSGQQFISCKEVSAYLQS-----------CLKPYDHSNGNNRQVHRVAS 307 Query: 902 ETH-GTLNDGVDMPNASHQNQLKAICSPDAGKKSQEKSDIVAGTKRRYNPSSLNAPMNCK 726 E H GT D S + ++ D + E +++ + C Sbjct: 308 ENHAGTSGREEDQRQPSEHEKAVSLLGID----NLELAEV-----------QIQDLFECH 352 Query: 725 KCNATYPNRSSFMGHLTIHH 666 KCN T+ ++ +++ HL H Sbjct: 353 KCNMTFDDKDTYLQHLLSFH 372 >gb|EMJ00867.1| hypothetical protein PRUPE_ppa001455mg [Prunus persica] Length = 824 Score = 97.8 bits (242), Expect = 2e-17 Identities = 64/194 (32%), Positives = 95/194 (48%), Gaps = 5/194 (2%) Frame = -3 Query: 1232 TEPELHTFFNNIGGEWASKLKKRKIVNAEDFVSGLPQGWKLLLGVRKKNGRYIIECRKYI 1053 TE +L F +GG+W S+ KKRKIV+A +F LP GWKLLLG+++K GR I CR++I Sbjct: 216 TEEQLLGFMRELGGQWGSRRKKRKIVDANEFGDALPVGWKLLLGLKRKEGRAWIYCRRFI 275 Query: 1052 SPAGPQFASWKEASVFLST-----NGALPVGRKDVSQVNLDHANSRTHFDPIQKKETHGT 888 SP G QF S KE S FL + N P G H + I E + Sbjct: 276 SPTGQQFLSCKEVSSFLHSFFGFNNARQPDG----------HGGENLQEECIMTTENQHS 325 Query: 887 LNDGVDMPNASHQNQLKAICSPDAGKKSQEKSDIVAGTKRRYNPSSLNAPMNCKKCNATY 708 DG + + L + S + ++ +E S ++G + ++ C KC+ T+ Sbjct: 326 DKDGGRRQYVNSSSAL--VVSTISNEREKEVS--LSGME-NLAEVQIHDLFECHKCSMTF 380 Query: 707 PNRSSFMGHLTIHH 666 + S++ HL H Sbjct: 381 GEKDSYLQHLLSFH 394 >ref|NP_001123600.1| LOC100170247 [Zea mays] gi|189514249|gb|ACE07054.1| methylcytosine binding domain protein [Zea mays] gi|414589744|tpg|DAA40315.1| TPA: methylcytosine binding domain protein [Zea mays] Length = 1176 Score = 97.1 bits (240), Expect = 3e-17 Identities = 73/234 (31%), Positives = 111/234 (47%), Gaps = 12/234 (5%) Frame = -3 Query: 1232 TEPELHTFFNNIGGEWASKLKKRKIVNAEDFVSGLPQGWKLLLGVRKKNGRYIIECRKYI 1053 +E EL F N + G+W S+ ++RK VNA F LP GWKLLLG+++K I CR+Y+ Sbjct: 195 SESELLGFMNALEGQWGSRRRRRKFVNAGMFGDHLPCGWKLLLGLKRKERVAWINCRRYV 254 Query: 1052 SPAGPQFASWKEASVFLSTNGALPVGRKDVSQVNLDHANSRTHFDPIQKKETHGTLNDGV 873 SP G QFA+ KE S +L + + SQ+N N+ H + H Sbjct: 255 SPKGHQFATCKEVSSYLLSLLGYQEAKPTASQIN----NAGVHDLHVNSVGLHQQTISIE 310 Query: 872 DMPNASHQNQLKAICSPDAGKKSQEKSDIVAGTKRRYNPSSLNAPMNCKKCNATYPNRSS 693 + A N + S +G Q+K ++ P +NA C+KCN T+ ++S+ Sbjct: 311 EKQIAVPVNSVALFNS--SGDSHQQK------LQKDEAPIEVNA-KECRKCNLTFHDQSA 361 Query: 692 FMGH-LTIHHVKRKK-----------NAEGIPANTNSVTSLQVQANGTNNIPNI 567 +M H L+ H K K+ N +G T TS +V N ++ N+ Sbjct: 362 YMQHQLSFHQRKAKRRRVSKSGELGTNIDGNYEKTQQKTSGEVSGNFGHSAANV 415 >ref|XP_002300183.1| hypothetical protein POPTR_0001s31990g [Populus trichocarpa] gi|222847441|gb|EEE84988.1| hypothetical protein POPTR_0001s31990g [Populus trichocarpa] Length = 837 Score = 96.7 bits (239), Expect = 4e-17 Identities = 62/199 (31%), Positives = 93/199 (46%), Gaps = 3/199 (1%) Frame = -3 Query: 1253 QLKRK---FTTEPELHTFFNNIGGEWASKLKKRKIVNAEDFVSGLPQGWKLLLGVRKKNG 1083 +LKR+ E EL FF +GG+W S+ KKRKIV+A +F LP GWKL+LG+++K G Sbjct: 219 ELKRRTEGMEKEEELLGFFRELGGQWCSRRKKRKIVDAGEFGDFLPVGWKLILGLKRKEG 278 Query: 1082 RYIIECRKYISPAGPQFASWKEASVFLSTNGALPVGRKDVSQVNLDHANSRTHFDPIQKK 903 R + CR+Y+SP+G QF S K+ S +L + VG D Q DH Sbjct: 279 RAWVYCRRYLSPSGQQFISCKDVSAYLQS----LVGPYDAQQAK-DHTG----------- 322 Query: 902 ETHGTLNDGVDMPNASHQNQLKAICSPDAGKKSQEKSDIVAGTKRRYNPSSLNAPMNCKK 723 H D P+A +L+ D + + + + + C K Sbjct: 323 --HSIQQDHGGAPHAGAIERLE-----DQRQSIEHQKQVSLLETDNLAEVQIRDLFECHK 375 Query: 722 CNATYPNRSSFMGHLTIHH 666 C T+ + +++ HL H Sbjct: 376 CRMTFDEKGTYLEHLLSFH 394