BLASTX nr result

ID: Ephedra25_contig00005512 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Ephedra25_contig00005512
         (1899 letters)

Database: ./nr 
           37,332,560 sequences; 13,225,080,153 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_002284140.2| PREDICTED: uncharacterized protein LOC100248...   137   2e-29
ref|XP_004238529.1| PREDICTED: uncharacterized protein LOC101267...   135   6e-29
ref|XP_006578321.1| PREDICTED: uncharacterized protein LOC100780...   129   4e-27
ref|XP_006364865.1| PREDICTED: uncharacterized protein LOC102582...   129   6e-27
ref|XP_002513931.1| hypothetical protein RCOM_1035820 [Ricinus c...   122   4e-25
ref|XP_006853188.1| hypothetical protein AMTR_s00038p00204530 [A...   122   7e-25
ref|XP_006581536.1| PREDICTED: uncharacterized protein LOC102665...   121   1e-24
ref|NP_173650.3| methyl-CPG-binding domain-containing protein [A...   116   4e-23
ref|XP_002893218.1| methyl-CpG-binding domain 8 [Arabidopsis lyr...   113   3e-22
gb|EOY15980.1| Uncharacterized protein isoform 1 [Theobroma caca...   112   4e-22
gb|ESW08251.1| hypothetical protein PHAVU_009G031600g [Phaseolus...   111   1e-21
ref|XP_006416210.1| hypothetical protein EUTSA_v10007200mg [Eutr...   105   5e-20
ref|XP_004292482.1| PREDICTED: uncharacterized protein LOC101298...   101   1e-18
ref|XP_002525855.1| hypothetical protein RCOM_0824380 [Ricinus c...    98   1e-17
gb|EMJ00867.1| hypothetical protein PRUPE_ppa001455mg [Prunus pe...    98   1e-17
ref|XP_006661339.1| PREDICTED: dentin sialophosphoprotein-like, ...    97   2e-17
ref|XP_004957094.1| PREDICTED: uncharacterized protein LOC101759...    97   2e-17
ref|NP_001123600.1| LOC100170247 [Zea mays] gi|189514249|gb|ACE0...    97   2e-17
ref|XP_002300183.1| hypothetical protein POPTR_0001s31990g [Popu...    97   3e-17
emb|CBI19167.3| unnamed protein product [Vitis vinifera]               96   4e-17

>ref|XP_002284140.2| PREDICTED: uncharacterized protein LOC100248904 [Vitis vinifera]
          Length = 947

 Score =  137 bits (345), Expect = 2e-29
 Identities = 131/475 (27%), Positives = 206/475 (43%), Gaps = 12/475 (2%)
 Frame = +3

Query: 60   LDFSTIPVVDLHDFSQDEINV----ASQCSDVFRFDDIIVPKIDYSVFQESSASRKQTYS 227
            L    +P++DL   SQ E+      +S  SD+ R DD+++PKID S+F ES+ SRKQTYS
Sbjct: 12   LHLEALPLIDLRFLSQSELQALSLTSSHSSDLRRCDDVVIPKIDRSIFNESAGSRKQTYS 71

Query: 228  KLRLS-RKQEGAETLPGYKAGHFSLSKCRSMVDESGKQEAQRILQFIQERLNTTHPSGNG 404
            +LRL+ RK + A T+P  +   FS    +    E   +E   I+  ++            
Sbjct: 72   RLRLAPRKPDIAATIP--RRPRFSPHLNQKAALEPVDEENTLIIGLLK------------ 117

Query: 405  SLFMSDGALDANSSEINLQAITRVDENCPQPLAVYSALHNEDVRLVSSEQVTDLVTAASN 584
             LF ++               T  D+  P  +  Y    NE ++ +  + V D       
Sbjct: 118  GLFATE---------------THADDLIPVQVE-YRESSNEILQNIPIDVVADS------ 155

Query: 585  SAIDRSKKKLKPKEGARLKAFMNSNDAAAHQIPIKPEELRSNGTTNFVDTVDKAENIQQY 764
                R +K+ +PK    +  + N        + I    + +NG       VD A      
Sbjct: 156  ---GRKRKRGRPKSEKTIAVYQNGGSGEGGGMGI----INNNGVV-----VDVAA----- 198

Query: 765  HDKLTRENNAPHMANANNTATFSPIFKESIFPQLKRK---FTTEPELHTFFNNIGGEWAS 935
                        +ANA          ++   P+L+R+    TTE EL  F   + G+W S
Sbjct: 199  ------------LANA----------EDPFGPELRRRTEGLTTEEELLGFLTGLSGQWGS 236

Query: 936  KLKKRKIVNAEDFVSGLPQGWKLLLGVRKKNGRYIIECRKYISPAGPQFASWKEASVFLS 1115
            + KKRKIV A DF   LPQGWKLLL +++K GR  + CR+YISP G QF S KE S  L 
Sbjct: 237  RRKKRKIVEASDFGDVLPQGWKLLLSMKRKEGRVWLFCRRYISPNGQQFVSCKEVSSCLL 296

Query: 1116 TNGALPVGRKDVSQVNLDHANSRTHFDPIQKKETHGTLNDGVDMPNASHQNQLKAICSPD 1295
            +      G +D  Q N  H +  +    +  + + G    G+ + + + ++ L  +CS  
Sbjct: 297  SLS----GLQDARQPNYGHNDENSQ---LAHQISPGNA-AGLTLKDDNSKDGL--VCSSP 346

Query: 1296 AGKKS----QEKSDIVAGTKRRYNPSSLNAPMNCKKCNATYPNRSSFMGHLTIHH 1448
            +   +     EK   +      +    +   + C KC  T+  +   + HL+  H
Sbjct: 347  STVTTIPTHHEKQATLLNMGNSWE-VKVGEILKCHKCAMTFDEKDDLLHHLSSSH 400


>ref|XP_004238529.1| PREDICTED: uncharacterized protein LOC101267888 [Solanum
            lycopersicum]
          Length = 1192

 Score =  135 bits (340), Expect = 6e-29
 Identities = 171/660 (25%), Positives = 260/660 (39%), Gaps = 50/660 (7%)
 Frame = +3

Query: 60   LDFSTIPVVDLHDFSQDEINVASQCSDVF----RFDDIIVPKIDYSVFQESSASRKQTYS 227
            L   +IP VDL   SQ E+   S CS       R DD+I+PKID SVF ES+ SRKQTYS
Sbjct: 18   LQAESIPTVDLRLLSQSELYSLSLCSPAAFNPCRDDDVIIPKIDRSVFNESAGSRKQTYS 77

Query: 228  KLRLSRKQEGAETLPGYKAGHFSLSKCRSMVDESGKQEAQRILQFIQERLNTTHPSGNGS 407
            +LRL+         P   A   S  + R+                     N+ HP  N S
Sbjct: 78   RLRLA---------PAATASASSAIRSRT-----------------PHLRNSPHPLQNPS 111

Query: 408  LFMSDGALDANSSEINL---QAITRVDENCPQPLAVYSALHNEDVRLVSSEQVTDLVTAA 578
               ++G  ++ SS+I     Q      +  P  L      +++ + + S   V  L  A 
Sbjct: 112  --PNNGPANSESSQIVTLLKQLFGSGTQKNPTDLVPIRVDYSDSLSVPSHVPVPGLELAN 169

Query: 579  SNSAIDRSKKKLKPKEGARLKAFMNSNDAAAHQIPIKPEELRSNGTTNFVDTVDKAENIQ 758
              S I + +K+ +P++        N N     ++               VD V K   + 
Sbjct: 170  VGS-IGQKRKRGRPRK--------NENGVRVAEVK--------------VDEVVKDIVVY 206

Query: 759  QYHDKLTRENNAPHMANANNTATFSPIFKESIFP---QLKRK---FTTEPELHTFFNNIG 920
            Q  D   +E     + N +       +   S+ P   +L+R+     +  EL  F   + 
Sbjct: 207  QNVDDSDKE-----IMNKDGIPVDLAVLGASVDPFGLELRRRTEGLGSAEELLGFLGRLN 261

Query: 921  GEWASKLKKRKIVNAEDFVSGLPQGWKLLLGVRKKNGRYIIECRKYISPAGPQFASWKEA 1100
            G+W S  KKR+IV+A+DF S LP+ WKLLL +++K GR  + CR+YISP G QF + KE 
Sbjct: 262  GQWGSTRKKRRIVDADDFGSMLPKSWKLLLSIKRKEGRSWLHCRRYISPNGRQFGTCKEV 321

Query: 1101 SVFL-----STNGALPVGRKDVSQVNLDHANSRTHFDPIQ---KKET-----------HG 1223
            S +L       N  LP        V + +A + T    IQ   KKE+           HG
Sbjct: 322  SSYLLFLRGERNENLPTYVNGSGTVEITNACALTSDLRIQDGGKKESSVFHNSSPAVGHG 381

Query: 1224 TLNDGVDMPNASHQNQLKAICSPDAGKKSQEKSDIV-------AGTKRRYNPSSL----- 1367
             L   ++    S       +           K D++          K R    S+     
Sbjct: 382  ELQVLLNFGELSEVQVGDLLQCDKCNVTFNNKDDLLQHQLFSHQRRKSRNGGQSITDGVI 441

Query: 1368 --NAPMNCKKCNATYPNRSSFMGHLTIHHVKRKKNAEG-IPANTNSVTSLQVQANGTNNI 1538
              +    C+ C+ T+  +  + GH+  H  K+ K  +G +P          V +      
Sbjct: 442  IRDGKFECQFCHKTFEEKHRYNGHVGNHVKKQVKTVDGSLPIKMGGGIEPVVPSGAMLRE 501

Query: 1539 PNIMAVQAYGQNM-DNVTMVYDKANN-VNSTQIQEDGVNNGKSALHIGNAEDMEKAPGEV 1712
            P +       +N+ +N  ++ D  +N   +T+IQED +         G +       G  
Sbjct: 502  PIMQDSVVLPRNLTENAGVITDAGDNPAPTTKIQEDHMETDNKLEAEGTSNGCHNQEGSS 561

Query: 1713 VTMSHISSETVRLHTENMENSSTNGNIPHDANCSSMTDIKSPSNSCSKSF-DEKYQCTVD 1889
            V+ S ISS        +     +N   P         DI    +SC  S  D K+  TVD
Sbjct: 562  VSRSPISSNEKTCVDISKVIVGSNIEEPEQEGLLCSNDI---VDSCGVSMEDGKFFPTVD 618


>ref|XP_006578321.1| PREDICTED: uncharacterized protein LOC100780637 isoform X1 [Glycine
            max] gi|571450041|ref|XP_006578322.1| PREDICTED:
            uncharacterized protein LOC100780637 isoform X2 [Glycine
            max]
          Length = 863

 Score =  129 bits (324), Expect = 4e-27
 Identities = 118/458 (25%), Positives = 199/458 (43%), Gaps = 14/458 (3%)
 Frame = +3

Query: 72   TIPVVDLHDFSQDEINV-----ASQCSDVFRFDDIIVPKIDYSVFQESSASRKQTYSKLR 236
            ++P+VDL   SQ E+       A+ C      DD ++PKID S F ES+ SRKQTYSKLR
Sbjct: 18   SLPLVDLRLLSQPELYTLSLSGATHCHRRNSDDDSVIPKIDRSNFNESAGSRKQTYSKLR 77

Query: 237  LSRKQEGAETLPGYKAGHFSLSKCRSMVDESGKQEAQRILQFIQERLNTTHPSGNGSLFM 416
            L+++++    +P   + H  L      + E  ++E  RI+  +Q+            LF 
Sbjct: 78   LNKRKQNP-AVPASSSFHIPLH-----ISEPEEEENSRIVALLQQ------------LFG 119

Query: 417  SDGALDANSSEINLQAITRVDENCPQPLAVYSALHNEDVRLVSSEQVTDLVTAASNSAID 596
             +   +A  ++   + +  V  +  QP  +++A  N  + +V+                 
Sbjct: 120  VEPLRNAPRNDAAERRLVPVQVDFKQPPPMFAAFQNVPIDVVADSS-------------Q 166

Query: 597  RSKKKLKPKEGARLKAFMNSNDAAAHQIPIKPEELRSNGTTNFVDTVDKAENIQQYHDKL 776
            R +K+ +P++        + N         K      N  T FV+   K           
Sbjct: 167  RKRKRGRPRK--------DENSVTVFVEEPKKVTKEENSVTVFVEEPKKVNG-------- 210

Query: 777  TRENNAPHMANANNTATFSP---IFKESIFPQLKRK---FTTEPELHTFFNNIGGEWASK 938
               N   + A A  T T +    + ++    +LKR+     TEP++  F   + GEWAS+
Sbjct: 211  ---NGEVNAAVATTTTTVNETVGLDEDPFEVELKRRTQGLETEPQVVEFLETLNGEWASQ 267

Query: 939  LKKRKIVNAEDFVSGLPQGWKLLLGVRKKNGRYIIECRKYISPAGPQFASWKEASVFLST 1118
             KKR+IV A +    LP GWK+++   ++ GR    CR+Y+SP G QF S KEAS +L +
Sbjct: 268  RKKRRIVPASELGDLLPAGWKIVIITMRRAGRASAVCRRYVSPDGHQFESCKEASAYLLS 327

Query: 1119 NGALPVGRKDVSQVNLDHANSRTHFDPIQKKETHGTLNDGVDMPNASHQNQLKAICSPDA 1298
                  G +D S +   +++          + +  ++     +P    +    A   P A
Sbjct: 328  ----VFGVQDRSHLKSSYSDGAQQLSSSMNRASESSVG---HVPTGDMKTDASASYLPSA 380

Query: 1299 G---KKSQEKSDIVAGTKRRYNPSSLNAPMNCKKCNAT 1403
            G     S EK   ++ +    N +S +  + CK  +AT
Sbjct: 381  GAPIHSSHEKQPPISSSIGSENFNS-DLALGCKLGDAT 417


>ref|XP_006364865.1| PREDICTED: uncharacterized protein LOC102582612 isoform X2 [Solanum
            tuberosum]
          Length = 1193

 Score =  129 bits (323), Expect = 6e-27
 Identities = 135/490 (27%), Positives = 208/490 (42%), Gaps = 18/490 (3%)
 Frame = +3

Query: 60   LDFSTIPVVDLHDFSQDEINVASQCSDVF----RFDDIIVPKIDYSVFQESSASRKQTYS 227
            L   +IP VDL   SQ E+   S CS       R DD+I+PKID SVF ES+ SRKQTYS
Sbjct: 18   LQAESIPTVDLRLLSQSELYSLSLCSTAAFNPCRDDDVIIPKIDRSVFNESAGSRKQTYS 77

Query: 228  KLRLSRKQEGAETLPGYKAGHFSLSKCRSMVDESGKQEAQRILQFIQERLNTTHPSGNGS 407
            +LRL+             A   S S  RS                     N+ HP  N S
Sbjct: 78   RLRLAPAA----------AASASSSAIRSRTPHLR---------------NSPHPLQNPS 112

Query: 408  LFMSDGALDANSSEINL---QAITRVDENCPQPLAVYSALHNEDVRLVSSEQVTDLVTAA 578
               ++G  ++ SS+I +   Q      +  P  L      +++ + + S   V  L  A 
Sbjct: 113  --PNNGPANSESSQIVILLKQLFGSGTQKNPTDLVPIRVDYSDSLSVPSHVPVPGLELAN 170

Query: 579  SNSAIDRSKKKLKPKEGARLKAFMNSNDAAAHQIPIKPEELRSNGTTNFVDTVDKAENIQ 758
              S + + +K+ +P++          N+       +K +E+        V  +   +N+ 
Sbjct: 171  VGS-VGQKRKRGRPRK----------NENGVRVAEVKVDEV--------VKDIVVYQNVD 211

Query: 759  QYHDKLTRENNAPHMANANNTATFSPIFKESIFPQLKRK---FTTEPELHTFFNNIGGEW 929
                ++  ++  P +  A   A   P   E     L+R+     +  EL  F   + G+W
Sbjct: 212  DSDKEIMNKDGIP-VDLAVLGALVDPFGLE-----LRRRTEGLGSAEELLGFLGRLNGQW 265

Query: 930  ASKLKKRKIVNAEDFVSGLPQGWKLLLGVRKKNGRYIIECRKYISPAGPQFASWKEASVF 1109
             S  KKR+IV+A++F S LP+ WKLLL +++K GR  + CR+YISP G QF + KE S +
Sbjct: 266  GSTRKKRRIVDADEFGSVLPKSWKLLLSIKRKEGRSWLHCRRYISPNGRQFGTCKEVSSY 325

Query: 1110 L-----STNGALPVGRKDVSQVNLDHANSRTHFDPIQ---KKETHGTLNDGVDMPNASHQ 1265
            L          LP        V + +A + T    IQ   KKE+    N     P   H 
Sbjct: 326  LLFLHGERKENLPAYANGSGTVEITNACALTSDLRIQDGGKKESSVFHNSS---PAVGH- 381

Query: 1266 NQLKAICSPDAGKKSQEKSDIVAGTKRRYNPSSLNAPMNCKKCNATYPNRSSFMGHLTIH 1445
             +L+ + +        E S++  G             ++C KCN T+ N+   + H    
Sbjct: 382  GELQVLVN------FGELSEVQVGDL-----------LHCDKCNVTFNNKDDLLQHQLFS 424

Query: 1446 HVKRKKNAEG 1475
            H +R+    G
Sbjct: 425  HQRRRSRNGG 434


>ref|XP_002513931.1| hypothetical protein RCOM_1035820 [Ricinus communis]
            gi|223547017|gb|EEF48514.1| hypothetical protein
            RCOM_1035820 [Ricinus communis]
          Length = 1337

 Score =  122 bits (307), Expect = 4e-25
 Identities = 136/506 (26%), Positives = 209/506 (41%), Gaps = 38/506 (7%)
 Frame = +3

Query: 60   LDFSTIPVVDLHDFSQDEINVASQCSDVFRFD------DIIVPKIDYSVFQESSASRKQT 221
            L   ++P++DL   SQ E+   S CS  F  +      D+   KID SVF ES+ SRKQT
Sbjct: 24   LQMESLPLIDLRLLSQSELLSLSLCSFSFLNNPLQNEADVATLKIDRSVFNESAGSRKQT 83

Query: 222  YSKLRLSRKQEGAETLPGYKAGHFSLSKCRSM-----VDESGKQEAQRILQFIQERLNTT 386
            +S+LRL+R+             HFS    R+      V+ S  +E  +I+  I+      
Sbjct: 84   FSRLRLARRNNNNS--------HFSTPSIRNQIPHQTVEISQDEENSQIIYLIK------ 129

Query: 387  HPSGNGSLFMSDGALDANSSEINLQAITRVDENCPQPLAV---YSALHNEDVRLVSSEQV 557
                  SLF S+   +  ++E++   +   D     P+     + AL +  V   S E  
Sbjct: 130  ------SLFGSNFENEKENNEVDNVNLFSDDNLISVPITYNESFQALQDLAVADYSDETK 183

Query: 558  TDLVTAASNSAIDRSKKKLKPKEGARLKAFMNSNDAAAHQIPIKPEELRSNGTTNFVDTV 737
              + TA ++S     K+K + +    L  F+ +N+   +    + EE      T   D+ 
Sbjct: 184  QAIATAITHSESTAEKRK-RGRPRKNLSDFVGNNNVDGNDNGNEKEEKEETAIT---DSK 239

Query: 738  DKAENIQQYHDKLTRENNAPHMANANNTATF-SPIFKES----------------IFPQL 866
             K    Q+    L   NN    AN    A   +P  +E                    +L
Sbjct: 240  RKRGRPQKDASTLGCHNNNNVNANEEKRAVCENPRTQEEEKRGMKVELGSSEEDPYAEEL 299

Query: 867  KRK---FTTEPELHTFFNNIGGEWASKLKKRKIVNAEDFVSGLPQGWKLLLGVRKKNGRY 1037
            +R+     TE EL  F   + GEW SK KKRKIV+A      LP+ WKL+L  +++ G +
Sbjct: 300  RRRTMGMQTESELLGFLEGLQGEWMSKRKKRKIVDASVLGDVLPRNWKLILCNKRRAGFF 359

Query: 1038 IIECRKYISPAGPQFASWKEASVFLSTNGALPVGRKDVSQVNLDHANSRTHFDPIQKKET 1217
             ++C  YISP G QF S KE S     +  L    + VSQ +  H +S          + 
Sbjct: 360  WLDCTGYISPNGQQFMSCKEVS-----SNLLSKELQGVSQSSFGHDDSNI--------QL 406

Query: 1218 HGTLNDG--VDMPNASHQNQLKAICSPDAGKKSQEKSDIVAGTKRRYNPSSLNA--PMNC 1385
             GT++ G   D+   +++N    I SP        + +  A T     P  +      NC
Sbjct: 407  TGTVSYGNAADLTLKNNKNGGGFISSPALPVTKSVEHEKQATTLAAVVPPHVQTVEKYNC 466

Query: 1386 KKCNATYPNRSSFMGHLTIHHVKRKK 1463
             KC   +      + HL   H +  K
Sbjct: 467  HKCTMAFQEPDDLLQHLLSSHQRAPK 492


>ref|XP_006853188.1| hypothetical protein AMTR_s00038p00204530 [Amborella trichopoda]
            gi|548856827|gb|ERN14655.1| hypothetical protein
            AMTR_s00038p00204530 [Amborella trichopoda]
          Length = 826

 Score =  122 bits (305), Expect = 7e-25
 Identities = 92/303 (30%), Positives = 147/303 (48%), Gaps = 11/303 (3%)
 Frame = +3

Query: 588  AIDRSKKKLKPKEGARLKAFMN-----SNDAAAHQIPIKPEELRSNGTTNFVDTVDKAEN 752
            A+ R K+++  KE AR K  M+     + D  A         + +NG+++F  T      
Sbjct: 260  AVIRQKRRVSKKEDARRKGLMSLAVLENGDRGA---------IDNNGSSDFNQTGIGC-- 308

Query: 753  IQQYHDKLTRENNAPHMANANNTATFSPIFKESIFPQLKRK---FTTEPELHTFFNNIGG 923
                H  +   +N   M         +   +E   P LK++      E EL  F + +GG
Sbjct: 309  ----HGNVRNGDNKEKMLQNGFVEVHALASRELFVPHLKKRTAALENELELVEFLDGLGG 364

Query: 924  EWASKLKKRKIVNAEDFVSGLPQGWKLLLGVRKKNGRYIIECRKYISPAGPQFASWKEAS 1103
            EW +K KKRK+V+A DF  GLP GWK++LG+RKK G+  I+CRKYISP G +FA+ KE +
Sbjct: 365  EWVTKRKKRKMVDASDFGDGLPDGWKVILGIRKKEGKLFIDCRKYISPTGQKFATCKEVT 424

Query: 1104 VFLST---NGALPVGRKDVSQVNLDHANSRTHFDPIQKKETHGTLNDGVDMPNASHQNQL 1274
              L +   +G+L V  +   + N+   + RT         TH ++   V  P  + + + 
Sbjct: 425  AHLLSEPQDGSLAVSAR--IEENMSGNSMRTRI----SGATHSSMK--VPAPQ-TKEPKC 475

Query: 1275 KAICSPDAGKKSQEKSDIVAGTKRRYNPSSLNAPMNCKKCNATYPNRSSFMGHLTIHHVK 1454
                S D+GK+      I+  + +  NP  L   + C+KCN  + ++  +M HL   H +
Sbjct: 476  NGSISKDSGKQ------II--SHQVDNPIKLT--LECRKCNLNFNSKEVYMHHLLAVHQR 525

Query: 1455 RKK 1463
            + K
Sbjct: 526  KSK 528



 Score = 64.3 bits (155), Expect = 2e-07
 Identities = 36/68 (52%), Positives = 46/68 (67%), Gaps = 3/68 (4%)
 Frame = +3

Query: 60  LDFSTIPVVDLHDFSQDEIN---VASQCSDVFRFDDIIVPKIDYSVFQESSASRKQTYSK 230
           L  S+IP++DL   SQDEI+   + S  S      DI+VPKID S+F ES  SRKQTYS+
Sbjct: 14  LPISSIPLIDLRFLSQDEISSLALLSLPSSNPPLTDIVVPKIDRSIFNESQGSRKQTYSR 73

Query: 231 LRLSRKQE 254
           LRLS K++
Sbjct: 74  LRLSHKKQ 81


>ref|XP_006581536.1| PREDICTED: uncharacterized protein LOC102665295 [Glycine max]
          Length = 871

 Score =  121 bits (303), Expect = 1e-24
 Identities = 127/455 (27%), Positives = 205/455 (45%), Gaps = 11/455 (2%)
 Frame = +3

Query: 72   TIPVVDLHDFSQDEINVASQCSDVFRF-----DDIIVPKIDYSVFQESSASRKQTYSKLR 236
            ++P+VDL   SQ E+   S      R      DD ++PKID S F ES+ SRKQTYSKLR
Sbjct: 16   SLPLVDLRLLSQPELYTLSLSGATHRHRRNSDDDSVIPKIDRSNFNESAGSRKQTYSKLR 75

Query: 237  LSRKQEGAETLPGYKAGHFSLSKCRSMVDESGKQEAQRILQFIQERLNTTHPSGNGSLFM 416
            L+++++    +P   + H  L      + E  ++E  RI+  + + L    P  N +   
Sbjct: 76   LNKRKQNP-AVPASSSFHIPLH-----ISEPEEEENSRIIALLHQ-LFGVEPLRNNAPRN 128

Query: 417  SDGALDANSSEINLQAITRVDENCPQPLAVYSALHNEDVRLVSSEQVTDLVTAASNSAID 596
            +D      + E  L  +  V+   P P++V +   N  +         D+V   S     
Sbjct: 129  ND------APERRLVPV-HVEFKQPPPISV-ALFQNVPI---------DVVPDGSQ---- 167

Query: 597  RSKKKLKPKEGARLKAFMNSNDAAAHQIPIKPEELRSNGTTNFVDTVDKAENIQQYHDKL 776
            R +K+ +P++        NS      + P K  +   N  T FV+   K  N ++     
Sbjct: 168  RKRKRGRPRKDE------NSVTVFVEE-PTKVTK-EENSLTVFVEEPKKVTNEEK--SVK 217

Query: 777  TRENNAPHMANANNTATFSPIFKESIFP-QLKRK---FTTEPELHTFFNNIGGEWASKLK 944
               N   + A A  T   S    E +F  +LKR+     TE ++  F   + GEWAS+ K
Sbjct: 218  VNGNGEGNAAVATATVNESVGLDEDLFEVELKRRAQGLETESQVMEFLETLNGEWASQRK 277

Query: 945  KRKIVNAEDFVSGLPQGWKLLLGVRKKNGRYIIECRKYISPAGPQFASWKEASVFLSTNG 1124
            KR+IV A +    LP GWK+++ V ++ GR    CR+Y+SP G QF S KEAS +L +  
Sbjct: 278  KRRIVPATELGDMLPAGWKIVIIVMRRAGRASAVCRRYVSPDGHQFESCKEASAYLLSVS 337

Query: 1125 ALPVGRKDVSQVNLDHANSRTHFDPIQKKETHGTLN--DGVDMPNASHQNQLKAICSPDA 1298
                G +D S +   + +          + +  ++      DM   ++ + L +  +P  
Sbjct: 338  ----GVQDRSHLKSSYTDGAQQLSSSMNRASESSVGHVPTGDMKTVANASYLSSAGAPI- 392

Query: 1299 GKKSQEKSDIVAGTKRRYNPSSLNAPMNCKKCNAT 1403
               S EK  +V+ +    N  S +  + CK  +AT
Sbjct: 393  -DSSHEKQPLVSSSIGSENFIS-DLALGCKLGDAT 425


>ref|NP_173650.3| methyl-CPG-binding domain-containing protein [Arabidopsis thaliana]
            gi|75174757|sp|Q9LME6.1|MBD8_ARATH RecName:
            Full=Methyl-CpG-binding domain-containing protein 8;
            Short=AtMBD8; Short=MBD08; AltName:
            Full=Methyl-CpG-binding protein MBD8
            gi|9392683|gb|AAF87260.1|AC068562_7 Contains a Methyl-CpG
            binding domain PF|01429 and two DNA binding domains with
            preference for A/T rich regions PF|02178. ESTs
            gb|AI998776, gb|N95984 come from this gene [Arabidopsis
            thaliana] gi|26452716|dbj|BAC43440.1| unknown protein
            [Arabidopsis thaliana] gi|332192108|gb|AEE30229.1|
            methyl-CPG-binding domain-containing protein [Arabidopsis
            thaliana]
          Length = 524

 Score =  116 bits (290), Expect = 4e-23
 Identities = 101/383 (26%), Positives = 166/383 (43%), Gaps = 32/383 (8%)
 Frame = +3

Query: 60   LDFSTIPVVDLHDFSQDEINVASQCSDVFRF-----------DDIIVPKIDYSVFQESSA 206
            L   ++P++D    SQ E+   SQCS +              DD + PKID SVF ES+ 
Sbjct: 21   LSAESLPLIDTRLLSQSELRALSQCSSLSPSSSASLAASAGGDDDLTPKIDRSVFNESAG 80

Query: 207  SRKQTYSKLRLSRKQEGAETLPGYKAGHFSLSKCRSMVDESGKQEAQRILQFIQERLNTT 386
            SRKQT+ +LRL+R  +  E  P  +             D+S ++E  ++   ++   N  
Sbjct: 81   SRKQTFLRLRLARHPQPPEEPPSPQRQR----------DDSSREEQTQVASLLRSLFNVD 130

Query: 387  HPSGNGSLFMSDGALDANSSEI--NLQAITRVDENCPQPLAVYSALHNE---------DV 533
                       +  L+ N  +I  N     R + +  Q + +     N+          +
Sbjct: 131  SNQSKEEEDEGEEELEDNEGQIHYNSYVYQRPNLDSIQNVLIQGTSGNKIKRKRGRPRKI 190

Query: 534  RLVSSE-QVTDLVTAASNSA-IDRSKKKLKPKE---GARLKAFMNSNDAAAHQIPIKPEE 698
            R  S E +V DL   AS    +D++   L        + +    NS      + P   EE
Sbjct: 191  RNPSEENEVLDLTGEASTYVFVDKTSSNLGMVSRVGSSGISLDSNSVKRKRGRPPKNKEE 250

Query: 699  L-----RSNGTTNFVDTVDKAENIQQYHDKLTRENNAPHMANANNTATFSPIFKESIFPQ 863
            +     R +   N +   DK E +      +  EN    + + +  A+ S    E    +
Sbjct: 251  IMNLEKRDSAIVN-ISAFDKEELV------VNLENREGTIVDLSALASVSEDPYEEELRR 303

Query: 864  LKRKFTTEPELHTFFNNIGGEWASKLKKRKIVNAEDFVSGLPQGWKLLLGVRKKNGRYII 1043
            +     T+ E+  F   + GEW +  KK+K+VNA D+   LP+GW+L+L +++K    ++
Sbjct: 304  ITVGLKTKEEILGFLEQLNGEWVNIGKKKKVVNACDYGGYLPRGWRLMLYIKRKGSNLLL 363

Query: 1044 ECRKYISPAGPQFASWKEASVFL 1112
             CR+YISP G QF + KE S +L
Sbjct: 364  ACRRYISPDGQQFETCKEVSTYL 386


>ref|XP_002893218.1| methyl-CpG-binding domain 8 [Arabidopsis lyrata subsp. lyrata]
            gi|297339060|gb|EFH69477.1| methyl-CpG-binding domain 8
            [Arabidopsis lyrata subsp. lyrata]
          Length = 511

 Score =  113 bits (283), Expect = 3e-22
 Identities = 102/387 (26%), Positives = 170/387 (43%), Gaps = 32/387 (8%)
 Frame = +3

Query: 48   AEVLLDFSTIPVVDLHDFSQDEINVASQCSDVFRF-----------DDIIVPKIDYSVFQ 194
            A+  L   ++P++D+   SQ E+   S CS +              DD + PKID SVF 
Sbjct: 17   ADNRLSAESLPLIDMRLLSQSELRALSHCSSLSPSSSASLATSAGGDDDLTPKIDRSVFN 76

Query: 195  ESSASRKQTYSKLRLSRKQEGAETLPGYKAGHFSLSKCRSMVDESGKQEAQRILQFIQER 374
            ES+ SRKQT+ +LRL+R  +  E  P  +             D+S  +E  ++   ++  
Sbjct: 77   ESAGSRKQTFLRLRLARHPQPTEKPPSPQRQR----------DDSSIEEQTQVAPLLRSL 126

Query: 375  LNTTHPSGNGSLFMSDGALDANSSEI--NLQAITRVDENCPQPLAVYSALHNE------- 527
             N             +  ++ N  +I  N     R + +  Q + +     NE       
Sbjct: 127  FNVDSIQSKEEEDEGEEEVEENEGQIHYNSYVYQRPNLDSVQNVLIQGTSGNEIKRKRGR 186

Query: 528  --DVRLVSSE--QVTDLVTAASNSA-IDRSKKKLKPKE---GARLKAFMNSNDAAAHQIP 683
               +R  S E  +V DL   AS    +D++   L  +     + +    NS      + P
Sbjct: 187  PRKIRNPSEEDTEVLDLTGEASAYVFVDKTSSNLGIESRFGSSGISMDSNSVKRKRGRPP 246

Query: 684  IKPEELRS--NGTTNFVDT--VDKAENIQQYHDKLTRENNAPHMANANNTATFSPIFKES 851
               EE+ +  N  +  V++  +DK E ++        EN    + + +  A+ S    E 
Sbjct: 247  KNKEEIMNLENRDSAIVNSSALDKEELVKL-------ENREGAIVDLSALASVSEDPYEE 299

Query: 852  IFPQLKRKFTTEPELHTFFNNIGGEWASKLKKRKIVNAEDFVSGLPQGWKLLLGVRKKNG 1031
               ++     T+ E+  F   + GEW +  KK+K+V A D+   LP+GWKL+L ++KK  
Sbjct: 300  ELRRITVGLKTKEEILVFLEQLNGEWVNIGKKKKVVRACDYGGYLPRGWKLMLYIKKKGS 359

Query: 1032 RYIIECRKYISPAGPQFASWKEASVFL 1112
              ++ CR+YISP G QF + KE S +L
Sbjct: 360  SLLLACRRYISPDGQQFETCKEVSTYL 386


>gb|EOY15980.1| Uncharacterized protein isoform 1 [Theobroma cacao]
            gi|508724084|gb|EOY15981.1| Uncharacterized protein
            isoform 1 [Theobroma cacao]
          Length = 1203

 Score =  112 bits (281), Expect = 4e-22
 Identities = 138/525 (26%), Positives = 207/525 (39%), Gaps = 53/525 (10%)
 Frame = +3

Query: 60   LDFSTIPVVDLHDFSQDEINVASQCSDV----FRFDDIIVPKIDYSVFQESSASRKQTYS 227
            L   +IPVVDL   SQ E+   S CS          ++  PKID SVF ES+ SRKQT+S
Sbjct: 14   LHLESIPVVDLRLISQPELLSLSLCSSSPSPSNADTELFTPKIDRSVFNESAGSRKQTFS 73

Query: 228  KLRLSRKQEGA----ETLPGYKAGHFSLSKCRSMVDESG-KQEAQRILQFIQERLNTTHP 392
            +LRL+  +        + P  K    SLS+  + V+     +E+  IL  ++        
Sbjct: 74   RLRLAAPRNHLPHPHHSSPSSKP-FTSLSQRLNPVNPGPLDEESSNILSLLK-------- 124

Query: 393  SGNGSLFMSDGALDANSSEINLQAITRVDENCPQPLAVY---------SALHNEDVRLVS 545
                SLF  D +L +N++E         D+    P+ +          S L N  V +VS
Sbjct: 125  ----SLFNIDDSLTSNTNEDEPD-----DDKDLVPVQIEYENGKDNGNSVLQNIPVGIVS 175

Query: 546  ------------SEQVTDLVTAASNSAIDRSKKKLKPKEGARLKAFMNSNDAAAHQIPIK 689
                         +Q  +L+  + N  I+  +      E A       S +A    I   
Sbjct: 176  CSGSKRKRGRPRKDQKDNLLIESENLVIEEHQ------ETAAFDRVSESVNAGG--ISSC 227

Query: 690  PEELRSNGTTNFVDTVDKAENIQQYHDKLTRENNAPHMANANNTATFSPIFKESIFPQLK 869
             E  R  G        ++++N     ++   E+    +A  N  A         I  +L+
Sbjct: 228  SERKRKRGRPR----KEESQNRVIVSEEKKVESEIERVALGNVEAILG------IEEELR 277

Query: 870  RK---FTTEPELHTFFNNIGGEWASKLKKRKIVNAEDFVSGLPQGWKLLLGVRKKNGRYI 1040
            R+     TE EL  F   + GEWASK +K++IV+A  F + LPQGWKL+L V+K+ G   
Sbjct: 278  RRTEAIGTEAELLEFMGGLEGEWASKSQKKRIVDAAGFGNVLPQGWKLMLFVKKRAGHVW 337

Query: 1041 IECRKYISPAGPQFASWKEASVFLSTNGALPVGRKDVSQVNLDHANS-----RTHFDPIQ 1205
            + C +YISP G QF S KE S  L + G L    +  S +      S       +F  I 
Sbjct: 338  LACSRYISPNGQQFVSCKEVSSCLLSAGELKDSSQSTSSLTGRGIGSGVKPTSENFPIIC 397

Query: 1206 KKETHGTLNDGVDMPNASHQNQLKAICSPDAGKKSQEKSDIVA---------------GT 1340
                H      + M +     + + I          ++ D +                GT
Sbjct: 398  TSSEHERQAPLLRMGSPWEVQRAETIKCHKCTMTFNQQDDFICHLLSSHQGTVKSSGHGT 457

Query: 1341 KRRYNPSSLNAPMNCKKCNATYPNRSSFMGHLTIHHVKRKKNAEG 1475
                     N    C+ C   +  RS +  HL +H     K  EG
Sbjct: 458  STNEEVIIKNGKYECQFCYELFEERSCYSSHLGVHMKNNTKKVEG 502


>gb|ESW08251.1| hypothetical protein PHAVU_009G031600g [Phaseolus vulgaris]
          Length = 841

 Score =  111 bits (277), Expect = 1e-21
 Identities = 109/369 (29%), Positives = 166/369 (44%), Gaps = 8/369 (2%)
 Frame = +3

Query: 30   EAEVKEAEVLLDFSTIPVVDLHDFSQDEINVASQCSDVFRF-----DDIIVPKIDYSVFQ 194
            EAEV+ +   +D  ++P+VDL   SQ E+   S      R      +D +VPKID S F 
Sbjct: 5    EAEVEPSSDHID--SLPLVDLRLLSQPELYTLSLSGATHRHRRANDNDSVVPKIDRSNFN 62

Query: 195  ESSASRKQTYSKLRLSRKQEGAETLPGYKAGHFSLSKCRSMVDESGKQEAQRILQFIQER 374
            ES+ SRKQTYSKLRL+ K++    +P   + H         + E   QE  +I+  + ++
Sbjct: 63   ESAGSRKQTYSKLRLN-KRKQNFAVPASSSFH---------IPEPVDQENSQIISLL-QQ 111

Query: 375  LNTTHPSGNGSLFMSDGALDANSSEINLQAITRVDENCPQPLAVYSALHNEDVRLVSSEQ 554
            L    P  N        AL  +  +     +  V     QP  V           V+ + 
Sbjct: 112  LFGVEPLRN--------ALRPDCGDAANHQLFPVHVEFKQPPPV----------TVTFQT 153

Query: 555  VTDLVTAASNSAIDRSKKKLKPKEGARLKAFMNSNDAAAHQIPIKPEELRSNGTTNFVDT 734
            V   V  ASN    R +K+ +P++   L +         ++           G +     
Sbjct: 154  VPIDVIDASN----RKRKRGRPRKNENLVSVFEEETKKVNE-----------GRSAVATV 198

Query: 735  VDKAENIQQYHDKLTRENNAPHMANANNTATFSPIFKESIFPQLKRK---FTTEPELHTF 905
            +++   +    D L   +N P              F E    +LKR+     TEP+L  F
Sbjct: 199  IERGFGVDA--DGL---DNDP--------------FGE----ELKRRTAGLETEPQLLEF 235

Query: 906  FNNIGGEWASKLKKRKIVNAEDFVSGLPQGWKLLLGVRKKNGRYIIECRKYISPAGPQFA 1085
               + GEWAS+ KKR+IV A D  + LP GWK+++ + ++ GR  + CR+Y+SP G QF 
Sbjct: 236  LETLNGEWASQRKKRRIVQASDLGTVLPAGWKIVITLLRRAGRASVVCRRYVSPGGHQFE 295

Query: 1086 SWKEASVFL 1112
            S KEAS +L
Sbjct: 296  SCKEASAYL 304


>ref|XP_006416210.1| hypothetical protein EUTSA_v10007200mg [Eutrema salsugineum]
            gi|557093981|gb|ESQ34563.1| hypothetical protein
            EUTSA_v10007200mg [Eutrema salsugineum]
          Length = 575

 Score =  105 bits (263), Expect = 5e-20
 Identities = 108/428 (25%), Positives = 185/428 (43%), Gaps = 31/428 (7%)
 Frame = +3

Query: 60   LDFSTIPVVDLHDFSQDEINVASQCSDVFR-------FDDIIVPKIDYSVFQESSASRKQ 218
            L   ++P++D    SQ E+   S  S            DD + PKID SVF ES+ SRKQ
Sbjct: 106  LSAESLPLIDTRLLSQSELRALSPSSSSSASLAASAGVDDDLTPKIDRSVFNESAGSRKQ 165

Query: 219  TYSKLRLSRKQEGAETLPGYKAGHFSLSKCRSMVDESGKQEAQRILQFIQERLNTTHPSG 398
            T+ ++RL+R           +             D+S ++E  ++   ++          
Sbjct: 166  TFLRVRLARDPPPPRPPSPQRRR-----------DDSSREEKSQVASLLR---------- 204

Query: 399  NGSLFMSDGALDANSSEINLQAITRVDENCPQPLAVYSALHNEDVR----LVSSEQVTDL 566
              SLF  D +   N+ E   +    V+E   QPL      +N +V       S + V  +
Sbjct: 205  --SLFSVD-SFQRNAEED--EGEEEVEEKEGQPLISLPIHNNGNVYRNPYFDSVKNVQGI 259

Query: 567  VTAASNSAIDRSKKKLKPKEGARLKAFMNSNDAAAHQIPIKPEELRSN-GTTNFVDTVD- 740
                +     R +K   P +G  L ++    D +  +  +  ++ RSN GT +  D    
Sbjct: 260  SENETRRRPGRPRKIRNPSDGV-LDSYA---DESEREGTLSVDKTRSNLGTESGYDASGI 315

Query: 741  ---------KAENIQQYHDKLTRENNAPHMANANNTATFSPIF-----KESIFPQLKRKF 878
                     K    ++  D    E+    ++  N   T   +      +E  + +  R+ 
Sbjct: 316  SMDSNPGKRKRGRPRKSGDGCKSEDKEEIVSLENREGTMVDLSALANNEEDPYGEELRRI 375

Query: 879  T----TEPELHTFFNNIGGEWASKLKKRKIVNAEDFVSGLPQGWKLLLGVRKKNGRYIIE 1046
            T    T+ EL  F   + GEW +  KK+K+V A D+   LP+GWKL+L ++KK     + 
Sbjct: 376  TVGLGTKEELLAFLEQVNGEWVNAGKKKKVVKACDYGGYLPRGWKLMLCIKKKGSIQWLA 435

Query: 1047 CRKYISPAGPQFASWKEASVFLSTNGALPVGRKDVSQVNLDHANSRTHFDPIQKKETHGT 1226
            CR+YISP G +FA+ KE S +L +     V  +  +++N   +++ T  +P+   E+   
Sbjct: 436  CRRYISPDGQEFATCKEVSTYLQS----LVESQSKNRLNSFQSDNHTLGEPVMGNESLVG 491

Query: 1227 LNDGVDMP 1250
             +D +D+P
Sbjct: 492  NSDSMDLP 499


>ref|XP_004292482.1| PREDICTED: uncharacterized protein LOC101298198 [Fragaria vesca
            subsp. vesca]
          Length = 821

 Score =  101 bits (251), Expect = 1e-18
 Identities = 94/363 (25%), Positives = 157/363 (43%), Gaps = 15/363 (4%)
 Frame = +3

Query: 405  SLFMSDGALDANSSEINLQAITR--VDENCPQPLAVYSALHNEDVRLVSSEQVTDLVTAA 578
            SL  S+GA+D     + +  I R   +E+       YS +      L+S+ +V+     A
Sbjct: 38   SLTRSNGAID----HLVVPKIDRSQFNESAGSRRQTYSRVRRRVAGLLSNPKVS-----A 88

Query: 579  SNSAIDRSKKKLKPKEGARLKAFMNSNDAAAHQIPIKPEELRSNGTTNFVDTVDKAENIQ 758
              +  D  ++         LK F+ S D    QI ++P  +    + + +  +++ +  +
Sbjct: 89   PPAQPDDPERNENQAIIGHLKRFI-SQDPKFDQIDLEPSPMTMKASLSGMAELERRKRKR 147

Query: 759  QYHDKLTRENNAPHM-ANANNTATFSPIFKESIFP---QLKRK---FTTEPELHTFFNNI 917
                K    +    +  N N  A      + S  P   +L+R+     TE EL  F  ++
Sbjct: 148  GRKPKAKGSSGGEGLIVNKNGAAVDIWALQNSENPFGDELRRRTLGLETEEELLGFMRDL 207

Query: 918  GGEWASKLKKRKIVNAEDFVSGLPQGWKLLLGVRKKNGRYIIECRKYISPAGPQFASWKE 1097
            GG+W S+ KKRKIV+A +F   LP GWKLLLG+++K  R  I CR+YISP G QF S KE
Sbjct: 208  GGQWGSRRKKRKIVDATEFGDALPLGWKLLLGLKRKERRAWIYCRRYISPTGQQFLSCKE 267

Query: 1098 ASVFL----STNGALPVGRKDVSQVNLDH--ANSRTHFDPIQKKETHGTLNDGVDMPNAS 1259
             + FL    S N A          +  D   A    H D   +K    + N G+   + S
Sbjct: 268  VASFLESFFSLNNADRHDGDGGENIQEDRIVATENQHADKDGEKRQDVSFNSGILGSSIS 327

Query: 1260 HQNQLKAICSPDAGKKSQEKSDIVAGTKRRYNPSSLNAPMNCKKCNATYPNRSSFMGHLT 1439
            ++            + ++ +  +            ++    C KC+ T+ ++ S++ HL 
Sbjct: 328  NE------------QSNEPEKKVSISEMENLAEVQIHNLFECHKCSMTFADKDSYLQHLL 375

Query: 1440 IHH 1448
              H
Sbjct: 376  SFH 378


>ref|XP_002525855.1| hypothetical protein RCOM_0824380 [Ricinus communis]
            gi|223534860|gb|EEF36549.1| hypothetical protein
            RCOM_0824380 [Ricinus communis]
          Length = 697

 Score = 98.2 bits (243), Expect = 1e-17
 Identities = 63/200 (31%), Positives = 96/200 (48%), Gaps = 4/200 (2%)
 Frame = +3

Query: 861  QLKRK---FTTEPELHTFFNNIGGEWASKLKKRKIVNAEDFVSGLPQGWKLLLGVRKKNG 1031
            +LKR+      E EL  FF ++GG+W S+ +KRKIV+A +F   LP GWKLLLG+++K G
Sbjct: 199  ELKRRTEGMVKEEELLGFFRDLGGQWCSRRRKRKIVDASEFGDFLPFGWKLLLGLKRKEG 258

Query: 1032 RYIIECRKYISPAGPQFASWKEASVFLSTNGALPVGRKDVSQVNLDHANSRTHFDPIQKK 1211
            +  + CR+YISP+G QF S KE S +L +                DH+N           
Sbjct: 259  KAWVYCRRYISPSGQQFISCKEVSAYLQS-----------CLKPYDHSNGNNRQVHRVAS 307

Query: 1212 ETH-GTLNDGVDMPNASHQNQLKAICSPDAGKKSQEKSDIVAGTKRRYNPSSLNAPMNCK 1388
            E H GT     D    S   +  ++   D    + E +++            +     C 
Sbjct: 308  ENHAGTSGREEDQRQPSEHEKAVSLLGID----NLELAEV-----------QIQDLFECH 352

Query: 1389 KCNATYPNRSSFMGHLTIHH 1448
            KCN T+ ++ +++ HL   H
Sbjct: 353  KCNMTFDDKDTYLQHLLSFH 372


>gb|EMJ00867.1| hypothetical protein PRUPE_ppa001455mg [Prunus persica]
          Length = 824

 Score = 97.8 bits (242), Expect = 1e-17
 Identities = 64/194 (32%), Positives = 95/194 (48%), Gaps = 5/194 (2%)
 Frame = +3

Query: 882  TEPELHTFFNNIGGEWASKLKKRKIVNAEDFVSGLPQGWKLLLGVRKKNGRYIIECRKYI 1061
            TE +L  F   +GG+W S+ KKRKIV+A +F   LP GWKLLLG+++K GR  I CR++I
Sbjct: 216  TEEQLLGFMRELGGQWGSRRKKRKIVDANEFGDALPVGWKLLLGLKRKEGRAWIYCRRFI 275

Query: 1062 SPAGPQFASWKEASVFLST-----NGALPVGRKDVSQVNLDHANSRTHFDPIQKKETHGT 1226
            SP G QF S KE S FL +     N   P G          H       + I   E   +
Sbjct: 276  SPTGQQFLSCKEVSSFLHSFFGFNNARQPDG----------HGGENLQEECIMTTENQHS 325

Query: 1227 LNDGVDMPNASHQNQLKAICSPDAGKKSQEKSDIVAGTKRRYNPSSLNAPMNCKKCNATY 1406
              DG      +  + L  + S  + ++ +E S  ++G +       ++    C KC+ T+
Sbjct: 326  DKDGGRRQYVNSSSAL--VVSTISNEREKEVS--LSGME-NLAEVQIHDLFECHKCSMTF 380

Query: 1407 PNRSSFMGHLTIHH 1448
              + S++ HL   H
Sbjct: 381  GEKDSYLQHLLSFH 394


>ref|XP_006661339.1| PREDICTED: dentin sialophosphoprotein-like, partial [Oryza
            brachyantha]
          Length = 1042

 Score = 97.1 bits (240), Expect = 2e-17
 Identities = 89/336 (26%), Positives = 149/336 (44%), Gaps = 27/336 (8%)
 Frame = +3

Query: 885  EPELHTFFNNIGGEWASKLKKRKIVNAEDFVSGLPQGWKLLLGVRKKNGRYIIECRKYIS 1064
            E EL  F N + G+W S+ ++RK V+A  F   LP+GWKLLLG+++K     I CR+Y+S
Sbjct: 136  ESELLGFMNGLEGQWGSRRRRRKFVDASMFGDHLPRGWKLLLGLKRKERVAWINCRRYVS 195

Query: 1065 PAGPQFASWKEASVFLSTNGALPVGRKDVSQVNLDHANSRTHFDPIQKKETHGTLNDGVD 1244
            P+G QFAS KE S +L +     +G  +     + ++N+  H       E H   + G  
Sbjct: 196  PSGQQFASCKEISSYLIS----LLGYVEAKPTAIQNSNAGVH-------ELHTVNSVGHC 244

Query: 1245 MPNASHQNQLKAICSPDAGKKSQEKSDIVAGTKRRYNPSSLNAPMN---CKKCNATYPNR 1415
             PN++ +       +P     S   S      +R+++ +      N   C+KCN  + ++
Sbjct: 245  QPNSTEEKH----SAPPV--TSVPVSSHYGDPQRQHDKNETQVETNGKECQKCNLIFQDQ 298

Query: 1416 SSFMGH-LTIHHVK---RKKNAEG-IPANTN-SVTSLQVQANGTNNI----PNIMAVQAY 1565
            S+++ H L+ H  K   RK N  G +  N N +  + ++Q    + +     N+ A +  
Sbjct: 299  SAYVQHQLSFHQRKAKRRKVNKSGEVGVNKNGTFVTQELQQTSEDKLGHIDHNVAASRNQ 358

Query: 1566 GQNMDNV-------------TMVYDKANNVNSTQIQEDGVNNGKSALHIGNAEDMEKAPG 1706
            GQ  + V             +M  +      +    E G  +    L  G+  D      
Sbjct: 359  GQTPEKVSDETISGELGGQPSMAPEPVGFRETDGETEQGKESSAGELLSGHCNDSLHNMA 418

Query: 1707 EVVTMSHISS-ETVRLHTENMENSSTNGNIPHDANC 1811
            +V      S+ E V  H EN+ ++  +  I HD  C
Sbjct: 419  DVAEQEKRSAREPVTGHHENLSDNCVDHKI-HDGAC 453


>ref|XP_004957094.1| PREDICTED: uncharacterized protein LOC101759536 [Setaria italica]
          Length = 1141

 Score = 97.1 bits (240), Expect = 2e-17
 Identities = 61/200 (30%), Positives = 100/200 (50%), Gaps = 6/200 (3%)
 Frame = +3

Query: 882  TEPELHTFFNNIGGEWASKLKKRKIVNAEDFVSGLPQGWKLLLGVRKKNGRYIIECRKYI 1061
            +E EL  F N + G+W S+ ++RK V+A  F   LP+GWKLLLG+++K     I CR+Y+
Sbjct: 198  SESELLGFMNALEGQWGSRRRRRKFVDAGMFADHLPRGWKLLLGLKRKERVAWINCRRYV 257

Query: 1062 SPAGPQFASWKEASVFLSTNGALPVGRKDVSQVNLDHANSRTHFDPIQKKETHGTLNDGV 1241
            SP G QFA+ KE S +L +    P  +   +Q+N    ++  H                +
Sbjct: 258  SPKGHQFATCKEVSTYLRSLLGYPEAKPTTTQIN----SAGVH---------------DL 298

Query: 1242 DMPNASHQNQL----KAICSPDAGKKSQEKSDIVAGTKRRYNPSSLNA-PMNCKKCNATY 1406
            D+ +A HQ  +    + +  P         S    G K + + + +   P  C+KCN T+
Sbjct: 299  DINSAGHQQTISIEQRQLAVPLTSVTLFSHSGDSHGQKLQKDEAQMEVNPKECRKCNLTF 358

Query: 1407 PNRSSFMGH-LTIHHVKRKK 1463
             ++ ++M H L+ H  K K+
Sbjct: 359  HDQGAYMQHQLSFHQRKAKR 378


>ref|NP_001123600.1| LOC100170247 [Zea mays] gi|189514249|gb|ACE07054.1| methylcytosine
            binding domain protein [Zea mays]
            gi|414589744|tpg|DAA40315.1| TPA: methylcytosine binding
            domain protein [Zea mays]
          Length = 1176

 Score = 97.1 bits (240), Expect = 2e-17
 Identities = 73/234 (31%), Positives = 111/234 (47%), Gaps = 12/234 (5%)
 Frame = +3

Query: 882  TEPELHTFFNNIGGEWASKLKKRKIVNAEDFVSGLPQGWKLLLGVRKKNGRYIIECRKYI 1061
            +E EL  F N + G+W S+ ++RK VNA  F   LP GWKLLLG+++K     I CR+Y+
Sbjct: 195  SESELLGFMNALEGQWGSRRRRRKFVNAGMFGDHLPCGWKLLLGLKRKERVAWINCRRYV 254

Query: 1062 SPAGPQFASWKEASVFLSTNGALPVGRKDVSQVNLDHANSRTHFDPIQKKETHGTLNDGV 1241
            SP G QFA+ KE S +L +       +   SQ+N    N+  H   +     H       
Sbjct: 255  SPKGHQFATCKEVSSYLLSLLGYQEAKPTASQIN----NAGVHDLHVNSVGLHQQTISIE 310

Query: 1242 DMPNASHQNQLKAICSPDAGKKSQEKSDIVAGTKRRYNPSSLNAPMNCKKCNATYPNRSS 1421
            +   A   N +    S  +G   Q+K       ++   P  +NA   C+KCN T+ ++S+
Sbjct: 311  EKQIAVPVNSVALFNS--SGDSHQQK------LQKDEAPIEVNA-KECRKCNLTFHDQSA 361

Query: 1422 FMGH-LTIHHVKRKK-----------NAEGIPANTNSVTSLQVQANGTNNIPNI 1547
            +M H L+ H  K K+           N +G    T   TS +V  N  ++  N+
Sbjct: 362  YMQHQLSFHQRKAKRRRVSKSGELGTNIDGNYEKTQQKTSGEVSGNFGHSAANV 415


>ref|XP_002300183.1| hypothetical protein POPTR_0001s31990g [Populus trichocarpa]
            gi|222847441|gb|EEE84988.1| hypothetical protein
            POPTR_0001s31990g [Populus trichocarpa]
          Length = 837

 Score = 96.7 bits (239), Expect = 3e-17
 Identities = 62/199 (31%), Positives = 93/199 (46%), Gaps = 3/199 (1%)
 Frame = +3

Query: 861  QLKRK---FTTEPELHTFFNNIGGEWASKLKKRKIVNAEDFVSGLPQGWKLLLGVRKKNG 1031
            +LKR+      E EL  FF  +GG+W S+ KKRKIV+A +F   LP GWKL+LG+++K G
Sbjct: 219  ELKRRTEGMEKEEELLGFFRELGGQWCSRRKKRKIVDAGEFGDFLPVGWKLILGLKRKEG 278

Query: 1032 RYIIECRKYISPAGPQFASWKEASVFLSTNGALPVGRKDVSQVNLDHANSRTHFDPIQKK 1211
            R  + CR+Y+SP+G QF S K+ S +L +     VG  D  Q   DH             
Sbjct: 279  RAWVYCRRYLSPSGQQFISCKDVSAYLQS----LVGPYDAQQAK-DHTG----------- 322

Query: 1212 ETHGTLNDGVDMPNASHQNQLKAICSPDAGKKSQEKSDIVAGTKRRYNPSSLNAPMNCKK 1391
              H    D    P+A    +L+     D  +  + +  +            +     C K
Sbjct: 323  --HSIQQDHGGAPHAGAIERLE-----DQRQSIEHQKQVSLLETDNLAEVQIRDLFECHK 375

Query: 1392 CNATYPNRSSFMGHLTIHH 1448
            C  T+  + +++ HL   H
Sbjct: 376  CRMTFDEKGTYLEHLLSFH 394


>emb|CBI19167.3| unnamed protein product [Vitis vinifera]
          Length = 1129

 Score = 96.3 bits (238), Expect = 4e-17
 Identities = 66/204 (32%), Positives = 101/204 (49%), Gaps = 7/204 (3%)
 Frame = +3

Query: 858  PQLKRK---FTTEPELHTFFNNIGGEWASKLKKRKIVNAEDFVSGLPQGWKLLLGVRKKN 1028
            P+L+R+    TTE EL  F   + G+W S+ KKRKIV A DF   LPQGWKLLL +++K 
Sbjct: 171  PELRRRTEGLTTEEELLGFLTGLSGQWGSRRKKRKIVEASDFGDVLPQGWKLLLSMKRKE 230

Query: 1029 GRYIIECRKYISPAGPQFASWKEASVFLSTNGALPVGRKDVSQVNLDHANSRTHFDPIQK 1208
            GR  + CR+YISP G QF S KE S  L +      G +D  Q N  H +  +    +  
Sbjct: 231  GRVWLFCRRYISPNGQQFVSCKEVSSCLLSLS----GLQDARQPNYGHNDENSQ---LAH 283

Query: 1209 KETHGTLNDGVDMPNASHQNQLKAICSPDAGKKS----QEKSDIVAGTKRRYNPSSLNAP 1376
            + + G    G+ + + + ++ L  +CS  +   +     EK   +      +    +   
Sbjct: 284  QISPGNA-AGLTLKDDNSKDGL--VCSSPSTVTTIPTHHEKQATLLNMGNSWE-VKVGEI 339

Query: 1377 MNCKKCNATYPNRSSFMGHLTIHH 1448
            + C KC  T+  +   + HL+  H
Sbjct: 340  LKCHKCAMTFDEKDDLLHHLSSSH 363



 Score = 69.7 bits (169), Expect = 4e-09
 Identities = 39/88 (44%), Positives = 55/88 (62%), Gaps = 5/88 (5%)
 Frame = +3

Query: 24  SMEAEVKEAEVLLDFSTIPVVDLHDFSQDEINV----ASQCSDVFRFDDIIVPKIDYSVF 191
           SM +    A   L    +P++DL   SQ E+      +S  SD+ R DD+++PKID S+F
Sbjct: 2   SMASSSSTASGGLHLEALPLIDLRFLSQSELQALSLTSSHSSDLRRCDDVVIPKIDRSIF 61

Query: 192 QESSASRKQTYSKLRLS-RKQEGAETLP 272
            ES+ SRKQTYS+LRL+ RK + A T+P
Sbjct: 62  NESAGSRKQTYSRLRLAPRKPDIAATIP 89


Top