BLASTX nr result

ID: Ephedra26_contig00011640 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Ephedra26_contig00011640
         (2113 letters)

Database: ./nr 
           37,332,560 sequences; 13,225,080,153 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_004238529.1| PREDICTED: uncharacterized protein LOC101267...   137   1e-29
ref|XP_002284140.2| PREDICTED: uncharacterized protein LOC100248...   137   2e-29
ref|XP_006578321.1| PREDICTED: uncharacterized protein LOC100780...   129   5e-27
ref|XP_006364865.1| PREDICTED: uncharacterized protein LOC102582...   128   9e-27
ref|XP_002513931.1| hypothetical protein RCOM_1035820 [Ricinus c...   122   5e-25
ref|XP_006853188.1| hypothetical protein AMTR_s00038p00204530 [A...   122   6e-25
ref|XP_006581536.1| PREDICTED: uncharacterized protein LOC102665...   121   1e-24
ref|NP_173650.3| methyl-CPG-binding domain-containing protein [A...   115   6e-23
gb|ESW08251.1| hypothetical protein PHAVU_009G031600g [Phaseolus...   114   1e-22
ref|XP_006433971.1| hypothetical protein CICLE_v10000205mg [Citr...   114   1e-22
gb|EOY15980.1| Uncharacterized protein isoform 1 [Theobroma caca...   112   5e-22
ref|XP_006472591.1| PREDICTED: uncharacterized protein LOC102628...   108   7e-21
ref|XP_006416210.1| hypothetical protein EUTSA_v10007200mg [Eutr...   105   6e-20
ref|XP_004292482.1| PREDICTED: uncharacterized protein LOC101298...   101   1e-18
ref|XP_006661339.1| PREDICTED: dentin sialophosphoprotein-like, ...    99   7e-18
ref|XP_004957094.1| PREDICTED: uncharacterized protein LOC101759...    99   1e-17
ref|XP_002525855.1| hypothetical protein RCOM_0824380 [Ricinus c...    98   1e-17
gb|EMJ00867.1| hypothetical protein PRUPE_ppa001455mg [Prunus pe...    98   2e-17
ref|NP_001123600.1| LOC100170247 [Zea mays] gi|189514249|gb|ACE0...    97   3e-17
ref|XP_002300183.1| hypothetical protein POPTR_0001s31990g [Popu...    97   4e-17

>ref|XP_004238529.1| PREDICTED: uncharacterized protein LOC101267888 [Solanum
            lycopersicum]
          Length = 1192

 Score =  137 bits (346), Expect = 1e-29
 Identities = 173/667 (25%), Positives = 263/667 (39%), Gaps = 50/667 (7%)
 Frame = -3

Query: 2054 LDFSTIPVVDLHDFSQDEINVASQCSDVF----RFDDIIVPKIDYSVFQESSASRKQTYS 1887
            L   +IP VDL   SQ E+   S CS       R DD+I+PKID SVF ES+ SRKQTYS
Sbjct: 18   LQAESIPTVDLRLLSQSELYSLSLCSPAAFNPCRDDDVIIPKIDRSVFNESAGSRKQTYS 77

Query: 1886 KLRLSRKQEGAETLPGYKAGHFSLSKCRSMVDESGKQEAQRILQFIQERLNTMHPSGNGS 1707
            +LRL+         P   A   S  + R+                     N+ HP  N S
Sbjct: 78   RLRLA---------PAATASASSAIRSRT-----------------PHLRNSPHPLQNPS 111

Query: 1706 LFMSDGALDANSSEINL---QAITRVDENCPQPLAVYSALHNEDVRLVSSEQVTDLVTAA 1536
               ++G  ++ SS+I     Q      +  P  L      +++ + + S   V  L  A 
Sbjct: 112  --PNNGPANSESSQIVTLLKQLFGSGTQKNPTDLVPIRVDYSDSLSVPSHVPVPGLELAN 169

Query: 1535 SNSAIDRSKKKLKPKEGARLKAFMNSNDAAAHQIPIKPEELRSNGTTNFVDTVDKAENIQ 1356
              S I + +K+ +P++        N N     ++               VD V K   + 
Sbjct: 170  VGS-IGQKRKRGRPRK--------NENGVRVAEVK--------------VDEVVKDIVVY 206

Query: 1355 QYHDKLTRENNAPHMANANNTATFSPIFKESIFP---QLKRK---FTTEPELHTFFNNIG 1194
            Q  D   +E     + N +       +   S+ P   +L+R+     +  EL  F   + 
Sbjct: 207  QNVDDSDKE-----IMNKDGIPVDLAVLGASVDPFGLELRRRTEGLGSAEELLGFLGRLN 261

Query: 1193 GEWASKLKKRKIVNAEDFVSGLPQGWKLLLGVRKKNGRYIIECRKYISPAGPQFASWKEA 1014
            G+W S  KKR+IV+A+DF S LP+ WKLLL +++K GR  + CR+YISP G QF + KE 
Sbjct: 262  GQWGSTRKKRRIVDADDFGSMLPKSWKLLLSIKRKEGRSWLHCRRYISPNGRQFGTCKEV 321

Query: 1013 SVFL-----STNGALPVGRKDVSQVNLDHANSRTHFDPIQ---KKET-----------HG 891
            S +L       N  LP        V + +A + T    IQ   KKE+           HG
Sbjct: 322  SSYLLFLRGERNENLPTYVNGSGTVEITNACALTSDLRIQDGGKKESSVFHNSSPAVGHG 381

Query: 890  TLNDGVDMPNASHQNQLKAICSPDAGKKSQEKSDIV-------AGTKRRYNPSSL----- 747
             L   ++    S       +           K D++          K R    S+     
Sbjct: 382  ELQVLLNFGELSEVQVGDLLQCDKCNVTFNNKDDLLQHQLFSHQRRKSRNGGQSITDGVI 441

Query: 746  --NAPMNCKKCNATYPNRSSFMGHLTIHHVKRKKNAEG-IPANTNSVTSLQVQANGTNNI 576
              +    C+ C+ T+  +  + GH+  H  K+ K  +G +P          V +      
Sbjct: 442  IRDGKFECQFCHKTFEEKHRYNGHVGNHVKKQVKTVDGSLPIKMGGGIEPVVPSGAMLRE 501

Query: 575  PNIMAVQAYGQNM-DNVAMVYDKANN-VNSTQIQEDGVNNGKSALHIGNAEDMEKAPGEV 402
            P +       +N+ +N  ++ D  +N   +T+IQED +         G +       G  
Sbjct: 502  PIMQDSVVLPRNLTENAGVITDAGDNPAPTTKIQEDHMETDNKLEAEGTSNGCHNQEGSS 561

Query: 401  VTMSHISSETVRLHTENMENSSTNGNIPHDANCSSMTDIKSPSNSCSKSF-DEKYQCTVD 225
            V+ S ISS        +     +N   P         DI    +SC  S  D K+  TVD
Sbjct: 562  VSRSPISSNEKTCVDISKVIVGSNIEEPEQEGLLCSNDI---VDSCGVSMEDGKFFPTVD 618

Query: 224  IGVPESG 204
                E+G
Sbjct: 619  ESKVENG 625


>ref|XP_002284140.2| PREDICTED: uncharacterized protein LOC100248904 [Vitis vinifera]
          Length = 947

 Score =  137 bits (345), Expect = 2e-29
 Identities = 132/477 (27%), Positives = 204/477 (42%), Gaps = 14/477 (2%)
 Frame = -3

Query: 2054 LDFSTIPVVDLHDFSQDEINV----ASQCSDVFRFDDIIVPKIDYSVFQESSASRKQTYS 1887
            L    +P++DL   SQ E+      +S  SD+ R DD+++PKID S+F ES+ SRKQTYS
Sbjct: 12   LHLEALPLIDLRFLSQSELQALSLTSSHSSDLRRCDDVVIPKIDRSIFNESAGSRKQTYS 71

Query: 1886 KLRLS-RKQEGAETLPGYK--AGHFSLSKCRSMVDESGKQEAQRILQFIQERLNTMHPSG 1716
            +LRL+ RK + A T+P     + H +       VDE                 NT+    
Sbjct: 72   RLRLAPRKPDIAATIPRRPRFSPHLNQKAALEPVDEE----------------NTLIIGL 115

Query: 1715 NGSLFMSDGALDANSSEINLQAITRVDENCPQPLAVYSALHNEDVRLVSSEQVTDLVTAA 1536
               LF ++               T  D+  P  +  Y    NE ++ +  + V D     
Sbjct: 116  LKGLFATE---------------THADDLIPVQVE-YRESSNEILQNIPIDVVADS---- 155

Query: 1535 SNSAIDRSKKKLKPKEGARLKAFMNSNDAAAHQIPIKPEELRSNGTTNFVDTVDKAENIQ 1356
                  R +K+ +PK    +  + N        + I    + +NG       VD A    
Sbjct: 156  -----GRKRKRGRPKSEKTIAVYQNGGSGEGGGMGI----INNNGVV-----VDVAA--- 198

Query: 1355 QYHDKLTRENNAPHMANANNTATFSPIFKESIFPQLKRK---FTTEPELHTFFNNIGGEW 1185
                          +ANA          ++   P+L+R+    TTE EL  F   + G+W
Sbjct: 199  --------------LANA----------EDPFGPELRRRTEGLTTEEELLGFLTGLSGQW 234

Query: 1184 ASKLKKRKIVNAEDFVSGLPQGWKLLLGVRKKNGRYIIECRKYISPAGPQFASWKEASVF 1005
             S+ KKRKIV A DF   LPQGWKLLL +++K GR  + CR+YISP G QF S KE S  
Sbjct: 235  GSRRKKRKIVEASDFGDVLPQGWKLLLSMKRKEGRVWLFCRRYISPNGQQFVSCKEVSSC 294

Query: 1004 LSTNGALPVGRKDVSQVNLDHANSRTHFDPIQKKETHGTLNDGVDMPNASHQNQLKAICS 825
            L +      G +D  Q N  H +  +    +  + + G    G+ + + + ++ L  +CS
Sbjct: 295  LLSLS----GLQDARQPNYGHNDENSQ---LAHQISPGNA-AGLTLKDDNSKDGL--VCS 344

Query: 824  PDAGKKS----QEKSDIVAGTKRRYNPSSLNAPMNCKKCNATYPNRSSFMGHLTIHH 666
              +   +     EK   +      +    +   + C KC  T+  +   + HL+  H
Sbjct: 345  SPSTVTTIPTHHEKQATLLNMGNSWE-VKVGEILKCHKCAMTFDEKDDLLHHLSSSH 400


>ref|XP_006578321.1| PREDICTED: uncharacterized protein LOC100780637 isoform X1 [Glycine
            max] gi|571450041|ref|XP_006578322.1| PREDICTED:
            uncharacterized protein LOC100780637 isoform X2 [Glycine
            max]
          Length = 863

 Score =  129 bits (324), Expect = 5e-27
 Identities = 118/458 (25%), Positives = 199/458 (43%), Gaps = 14/458 (3%)
 Frame = -3

Query: 2042 TIPVVDLHDFSQDEINV-----ASQCSDVFRFDDIIVPKIDYSVFQESSASRKQTYSKLR 1878
            ++P+VDL   SQ E+       A+ C      DD ++PKID S F ES+ SRKQTYSKLR
Sbjct: 18   SLPLVDLRLLSQPELYTLSLSGATHCHRRNSDDDSVIPKIDRSNFNESAGSRKQTYSKLR 77

Query: 1877 LSRKQEGAETLPGYKAGHFSLSKCRSMVDESGKQEAQRILQFIQERLNTMHPSGNGSLFM 1698
            L+++++    +P   + H  L      + E  ++E  RI+  +Q+            LF 
Sbjct: 78   LNKRKQNP-AVPASSSFHIPLH-----ISEPEEEENSRIVALLQQ------------LFG 119

Query: 1697 SDGALDANSSEINLQAITRVDENCPQPLAVYSALHNEDVRLVSSEQVTDLVTAASNSAID 1518
             +   +A  ++   + +  V  +  QP  +++A  N  + +V+                 
Sbjct: 120  VEPLRNAPRNDAAERRLVPVQVDFKQPPPMFAAFQNVPIDVVADSS-------------Q 166

Query: 1517 RSKKKLKPKEGARLKAFMNSNDAAAHQIPIKPEELRSNGTTNFVDTVDKAENIQQYHDKL 1338
            R +K+ +P++        + N         K      N  T FV+   K           
Sbjct: 167  RKRKRGRPRK--------DENSVTVFVEEPKKVTKEENSVTVFVEEPKKVNG-------- 210

Query: 1337 TRENNAPHMANANNTATFSP---IFKESIFPQLKRK---FTTEPELHTFFNNIGGEWASK 1176
               N   + A A  T T +    + ++    +LKR+     TEP++  F   + GEWAS+
Sbjct: 211  ---NGEVNAAVATTTTTVNETVGLDEDPFEVELKRRTQGLETEPQVVEFLETLNGEWASQ 267

Query: 1175 LKKRKIVNAEDFVSGLPQGWKLLLGVRKKNGRYIIECRKYISPAGPQFASWKEASVFLST 996
             KKR+IV A +    LP GWK+++   ++ GR    CR+Y+SP G QF S KEAS +L +
Sbjct: 268  RKKRRIVPASELGDLLPAGWKIVIITMRRAGRASAVCRRYVSPDGHQFESCKEASAYLLS 327

Query: 995  NGALPVGRKDVSQVNLDHANSRTHFDPIQKKETHGTLNDGVDMPNASHQNQLKAICSPDA 816
                  G +D S +   +++          + +  ++     +P    +    A   P A
Sbjct: 328  ----VFGVQDRSHLKSSYSDGAQQLSSSMNRASESSVG---HVPTGDMKTDASASYLPSA 380

Query: 815  G---KKSQEKSDIVAGTKRRYNPSSLNAPMNCKKCNAT 711
            G     S EK   ++ +    N +S +  + CK  +AT
Sbjct: 381  GAPIHSSHEKQPPISSSIGSENFNS-DLALGCKLGDAT 417


>ref|XP_006364865.1| PREDICTED: uncharacterized protein LOC102582612 isoform X2 [Solanum
            tuberosum]
          Length = 1193

 Score =  128 bits (322), Expect = 9e-27
 Identities = 135/490 (27%), Positives = 208/490 (42%), Gaps = 18/490 (3%)
 Frame = -3

Query: 2054 LDFSTIPVVDLHDFSQDEINVASQCSDVF----RFDDIIVPKIDYSVFQESSASRKQTYS 1887
            L   +IP VDL   SQ E+   S CS       R DD+I+PKID SVF ES+ SRKQTYS
Sbjct: 18   LQAESIPTVDLRLLSQSELYSLSLCSTAAFNPCRDDDVIIPKIDRSVFNESAGSRKQTYS 77

Query: 1886 KLRLSRKQEGAETLPGYKAGHFSLSKCRSMVDESGKQEAQRILQFIQERLNTMHPSGNGS 1707
            +LRL+             A   S S  RS                     N+ HP  N S
Sbjct: 78   RLRLAPAA----------AASASSSAIRSRTPHLR---------------NSPHPLQNPS 112

Query: 1706 LFMSDGALDANSSEINL---QAITRVDENCPQPLAVYSALHNEDVRLVSSEQVTDLVTAA 1536
               ++G  ++ SS+I +   Q      +  P  L      +++ + + S   V  L  A 
Sbjct: 113  --PNNGPANSESSQIVILLKQLFGSGTQKNPTDLVPIRVDYSDSLSVPSHVPVPGLELAN 170

Query: 1535 SNSAIDRSKKKLKPKEGARLKAFMNSNDAAAHQIPIKPEELRSNGTTNFVDTVDKAENIQ 1356
              S + + +K+ +P++          N+       +K +E+        V  +   +N+ 
Sbjct: 171  VGS-VGQKRKRGRPRK----------NENGVRVAEVKVDEV--------VKDIVVYQNVD 211

Query: 1355 QYHDKLTRENNAPHMANANNTATFSPIFKESIFPQLKRK---FTTEPELHTFFNNIGGEW 1185
                ++  ++  P +  A   A   P   E     L+R+     +  EL  F   + G+W
Sbjct: 212  DSDKEIMNKDGIP-VDLAVLGALVDPFGLE-----LRRRTEGLGSAEELLGFLGRLNGQW 265

Query: 1184 ASKLKKRKIVNAEDFVSGLPQGWKLLLGVRKKNGRYIIECRKYISPAGPQFASWKEASVF 1005
             S  KKR+IV+A++F S LP+ WKLLL +++K GR  + CR+YISP G QF + KE S +
Sbjct: 266  GSTRKKRRIVDADEFGSVLPKSWKLLLSIKRKEGRSWLHCRRYISPNGRQFGTCKEVSSY 325

Query: 1004 L-----STNGALPVGRKDVSQVNLDHANSRTHFDPIQ---KKETHGTLNDGVDMPNASHQ 849
            L          LP        V + +A + T    IQ   KKE+    N     P   H 
Sbjct: 326  LLFLHGERKENLPAYANGSGTVEITNACALTSDLRIQDGGKKESSVFHNSS---PAVGH- 381

Query: 848  NQLKAICSPDAGKKSQEKSDIVAGTKRRYNPSSLNAPMNCKKCNATYPNRSSFMGHLTIH 669
             +L+ + +        E S++  G             ++C KCN T+ N+   + H    
Sbjct: 382  GELQVLVN------FGELSEVQVGDL-----------LHCDKCNVTFNNKDDLLQHQLFS 424

Query: 668  HVKRKKNAEG 639
            H +R+    G
Sbjct: 425  HQRRRSRNGG 434


>ref|XP_002513931.1| hypothetical protein RCOM_1035820 [Ricinus communis]
            gi|223547017|gb|EEF48514.1| hypothetical protein
            RCOM_1035820 [Ricinus communis]
          Length = 1337

 Score =  122 bits (307), Expect = 5e-25
 Identities = 136/506 (26%), Positives = 209/506 (41%), Gaps = 38/506 (7%)
 Frame = -3

Query: 2054 LDFSTIPVVDLHDFSQDEINVASQCSDVFRFD------DIIVPKIDYSVFQESSASRKQT 1893
            L   ++P++DL   SQ E+   S CS  F  +      D+   KID SVF ES+ SRKQT
Sbjct: 24   LQMESLPLIDLRLLSQSELLSLSLCSFSFLNNPLQNEADVATLKIDRSVFNESAGSRKQT 83

Query: 1892 YSKLRLSRKQEGAETLPGYKAGHFSLSKCRSM-----VDESGKQEAQRILQFIQERLNTM 1728
            +S+LRL+R+             HFS    R+      V+ S  +E  +I+  I+      
Sbjct: 84   FSRLRLARRNNNNS--------HFSTPSIRNQIPHQTVEISQDEENSQIIYLIK------ 129

Query: 1727 HPSGNGSLFMSDGALDANSSEINLQAITRVDENCPQPLAV---YSALHNEDVRLVSSEQV 1557
                  SLF S+   +  ++E++   +   D     P+     + AL +  V   S E  
Sbjct: 130  ------SLFGSNFENEKENNEVDNVNLFSDDNLISVPITYNESFQALQDLAVADYSDETK 183

Query: 1556 TDLVTAASNSAIDRSKKKLKPKEGARLKAFMNSNDAAAHQIPIKPEELRSNGTTNFVDTV 1377
              + TA ++S     K+K + +    L  F+ +N+   +    + EE      T   D+ 
Sbjct: 184  QAIATAITHSESTAEKRK-RGRPRKNLSDFVGNNNVDGNDNGNEKEEKEETAIT---DSK 239

Query: 1376 DKAENIQQYHDKLTRENNAPHMANANNTATF-SPIFKES----------------IFPQL 1248
             K    Q+    L   NN    AN    A   +P  +E                    +L
Sbjct: 240  RKRGRPQKDASTLGCHNNNNVNANEEKRAVCENPRTQEEEKRGMKVELGSSEEDPYAEEL 299

Query: 1247 KRK---FTTEPELHTFFNNIGGEWASKLKKRKIVNAEDFVSGLPQGWKLLLGVRKKNGRY 1077
            +R+     TE EL  F   + GEW SK KKRKIV+A      LP+ WKL+L  +++ G +
Sbjct: 300  RRRTMGMQTESELLGFLEGLQGEWMSKRKKRKIVDASVLGDVLPRNWKLILCNKRRAGFF 359

Query: 1076 IIECRKYISPAGPQFASWKEASVFLSTNGALPVGRKDVSQVNLDHANSRTHFDPIQKKET 897
             ++C  YISP G QF S KE S     +  L    + VSQ +  H +S          + 
Sbjct: 360  WLDCTGYISPNGQQFMSCKEVS-----SNLLSKELQGVSQSSFGHDDSNI--------QL 406

Query: 896  HGTLNDG--VDMPNASHQNQLKAICSPDAGKKSQEKSDIVAGTKRRYNPSSLNA--PMNC 729
             GT++ G   D+   +++N    I SP        + +  A T     P  +      NC
Sbjct: 407  TGTVSYGNAADLTLKNNKNGGGFISSPALPVTKSVEHEKQATTLAAVVPPHVQTVEKYNC 466

Query: 728  KKCNATYPNRSSFMGHLTIHHVKRKK 651
             KC   +      + HL   H +  K
Sbjct: 467  HKCTMAFQEPDDLLQHLLSSHQRAPK 492


>ref|XP_006853188.1| hypothetical protein AMTR_s00038p00204530 [Amborella trichopoda]
            gi|548856827|gb|ERN14655.1| hypothetical protein
            AMTR_s00038p00204530 [Amborella trichopoda]
          Length = 826

 Score =  122 bits (306), Expect = 6e-25
 Identities = 119/443 (26%), Positives = 200/443 (45%), Gaps = 35/443 (7%)
 Frame = -3

Query: 1526 AIDRSKKKLKPKEGARLKAFMN-----SNDAAAHQIPIKPEELRSNGTTNFVDTVDKAEN 1362
            A+ R K+++  KE AR K  M+     + D  A         + +NG+++F  T      
Sbjct: 260  AVIRQKRRVSKKEDARRKGLMSLAVLENGDRGA---------IDNNGSSDFNQTGIGC-- 308

Query: 1361 IQQYHDKLTRENNAPHMANANNTATFSPIFKESIFPQLKRK---FTTEPELHTFFNNIGG 1191
                H  +   +N   M         +   +E   P LK++      E EL  F + +GG
Sbjct: 309  ----HGNVRNGDNKEKMLQNGFVEVHALASRELFVPHLKKRTAALENELELVEFLDGLGG 364

Query: 1190 EWASKLKKRKIVNAEDFVSGLPQGWKLLLGVRKKNGRYIIECRKYISPAGPQFASWKEAS 1011
            EW +K KKRK+V+A DF  GLP GWK++LG+RKK G+  I+CRKYISP G +FA+ KE +
Sbjct: 365  EWVTKRKKRKMVDASDFGDGLPDGWKVILGIRKKEGKLFIDCRKYISPTGQKFATCKEVT 424

Query: 1010 VFLST---NGALPVGRKDVSQVNLDHANSRTHFDPIQKKETHGTLNDGVDMPNASHQNQL 840
              L +   +G+L V  +   + N+   + RT         TH ++   V  P  + + + 
Sbjct: 425  AHLLSEPQDGSLAVSAR--IEENMSGNSMRTRI----SGATHSSMK--VPAPQ-TKEPKC 475

Query: 839  KAICSPDAGKKSQEKSDIVAGTKRRYNPSSLNAPMNCKKCNATYPNRSSFMGHLTIHHVK 660
                S D+GK+      I+  + +  NP  L   + C+KCN  + ++  +M HL   H +
Sbjct: 476  NGSISKDSGKQ------II--SHQVDNPIKLT--LECRKCNLNFNSKEVYMHHLLAVHQR 525

Query: 659  RKKN-------AEGIPANTNS-VTSLQVQANGTNNIPN---IMAVQAYGQNMD---NVAM 522
            + K         EG+       V  +  +  G  +  N    + V+ Y ++++   + AM
Sbjct: 526  KSKRCRLGKSLGEGVLIEDGKYVCQICHKVFGEKHRYNGHVGVHVRNYFKSLEASQDQAM 585

Query: 521  VYDKANNVNSTQIQEDGVNNGK------SALHIGNAEDMEKAPGEVVTMSHISSETV--- 369
            + DK    +S  + +  +++GK      S    GN++ M         +S  S E     
Sbjct: 586  I-DKPIAASSLDVGKPQISDGKQENSSESIEGDGNSDRMPSEDNLGALLSKSSDEPCDDL 644

Query: 368  -RLHTENMENSSTNGNIPHDANC 303
                T+N++  S   ++  D NC
Sbjct: 645  KMATTDNLKKISEKSDVDSDENC 667



 Score = 64.3 bits (155), Expect = 2e-07
 Identities = 36/68 (52%), Positives = 46/68 (67%), Gaps = 3/68 (4%)
 Frame = -3

Query: 2054 LDFSTIPVVDLHDFSQDEIN---VASQCSDVFRFDDIIVPKIDYSVFQESSASRKQTYSK 1884
            L  S+IP++DL   SQDEI+   + S  S      DI+VPKID S+F ES  SRKQTYS+
Sbjct: 14   LPISSIPLIDLRFLSQDEISSLALLSLPSSNPPLTDIVVPKIDRSIFNESQGSRKQTYSR 73

Query: 1883 LRLSRKQE 1860
            LRLS K++
Sbjct: 74   LRLSHKKQ 81


>ref|XP_006581536.1| PREDICTED: uncharacterized protein LOC102665295 [Glycine max]
          Length = 871

 Score =  121 bits (304), Expect = 1e-24
 Identities = 127/455 (27%), Positives = 206/455 (45%), Gaps = 11/455 (2%)
 Frame = -3

Query: 2042 TIPVVDLHDFSQDEINVASQCSDVFRF-----DDIIVPKIDYSVFQESSASRKQTYSKLR 1878
            ++P+VDL   SQ E+   S      R      DD ++PKID S F ES+ SRKQTYSKLR
Sbjct: 16   SLPLVDLRLLSQPELYTLSLSGATHRHRRNSDDDSVIPKIDRSNFNESAGSRKQTYSKLR 75

Query: 1877 LSRKQEGAETLPGYKAGHFSLSKCRSMVDESGKQEAQRILQFIQERLNTMHPSGNGSLFM 1698
            L+++++    +P   + H  L      + E  ++E  RI+  + + L  + P  N +   
Sbjct: 76   LNKRKQNP-AVPASSSFHIPLH-----ISEPEEEENSRIIALLHQ-LFGVEPLRNNAPRN 128

Query: 1697 SDGALDANSSEINLQAITRVDENCPQPLAVYSALHNEDVRLVSSEQVTDLVTAASNSAID 1518
            +D      + E  L  +  V+   P P++V +   N  +         D+V   S     
Sbjct: 129  ND------APERRLVPV-HVEFKQPPPISV-ALFQNVPI---------DVVPDGSQ---- 167

Query: 1517 RSKKKLKPKEGARLKAFMNSNDAAAHQIPIKPEELRSNGTTNFVDTVDKAENIQQYHDKL 1338
            R +K+ +P++        NS      + P K  +   N  T FV+   K  N ++     
Sbjct: 168  RKRKRGRPRKDE------NSVTVFVEE-PTKVTK-EENSLTVFVEEPKKVTNEEK--SVK 217

Query: 1337 TRENNAPHMANANNTATFSPIFKESIFP-QLKRK---FTTEPELHTFFNNIGGEWASKLK 1170
               N   + A A  T   S    E +F  +LKR+     TE ++  F   + GEWAS+ K
Sbjct: 218  VNGNGEGNAAVATATVNESVGLDEDLFEVELKRRAQGLETESQVMEFLETLNGEWASQRK 277

Query: 1169 KRKIVNAEDFVSGLPQGWKLLLGVRKKNGRYIIECRKYISPAGPQFASWKEASVFLSTNG 990
            KR+IV A +    LP GWK+++ V ++ GR    CR+Y+SP G QF S KEAS +L +  
Sbjct: 278  KRRIVPATELGDMLPAGWKIVIIVMRRAGRASAVCRRYVSPDGHQFESCKEASAYLLSVS 337

Query: 989  ALPVGRKDVSQVNLDHANSRTHFDPIQKKETHGTLN--DGVDMPNASHQNQLKAICSPDA 816
                G +D S +   + +          + +  ++      DM   ++ + L +  +P  
Sbjct: 338  ----GVQDRSHLKSSYTDGAQQLSSSMNRASESSVGHVPTGDMKTVANASYLSSAGAPI- 392

Query: 815  GKKSQEKSDIVAGTKRRYNPSSLNAPMNCKKCNAT 711
               S EK  +V+ +    N  S +  + CK  +AT
Sbjct: 393  -DSSHEKQPLVSSSIGSENFIS-DLALGCKLGDAT 425


>ref|NP_173650.3| methyl-CPG-binding domain-containing protein [Arabidopsis thaliana]
            gi|75174757|sp|Q9LME6.1|MBD8_ARATH RecName:
            Full=Methyl-CpG-binding domain-containing protein 8;
            Short=AtMBD8; Short=MBD08; AltName:
            Full=Methyl-CpG-binding protein MBD8
            gi|9392683|gb|AAF87260.1|AC068562_7 Contains a Methyl-CpG
            binding domain PF|01429 and two DNA binding domains with
            preference for A/T rich regions PF|02178. ESTs
            gb|AI998776, gb|N95984 come from this gene [Arabidopsis
            thaliana] gi|26452716|dbj|BAC43440.1| unknown protein
            [Arabidopsis thaliana] gi|332192108|gb|AEE30229.1|
            methyl-CPG-binding domain-containing protein [Arabidopsis
            thaliana]
          Length = 524

 Score =  115 bits (289), Expect = 6e-23
 Identities = 100/393 (25%), Positives = 166/393 (42%), Gaps = 42/393 (10%)
 Frame = -3

Query: 2054 LDFSTIPVVDLHDFSQDEINVASQCSDVFRF-----------DDIIVPKIDYSVFQESSA 1908
            L   ++P++D    SQ E+   SQCS +              DD + PKID SVF ES+ 
Sbjct: 21   LSAESLPLIDTRLLSQSELRALSQCSSLSPSSSASLAASAGGDDDLTPKIDRSVFNESAG 80

Query: 1907 SRKQTYSKLRLSRKQEGAETLPGYKAGHFSLSKCRSMVDESGKQEAQRILQFIQERLNT- 1731
            SRKQT+ +LRL+R  +  E  P  +             D+S ++E  ++   ++   N  
Sbjct: 81   SRKQTFLRLRLARHPQPPEEPPSPQRQR----------DDSSREEQTQVASLLRSLFNVD 130

Query: 1730 ------MHPSGNGSLFMSDGALDANS---SEINLQAI----------TRVDENCPQPLAV 1608
                      G   L  ++G +  NS      NL +I           ++     +P  +
Sbjct: 131  SNQSKEEEDEGEEELEDNEGQIHYNSYVYQRPNLDSIQNVLIQGTSGNKIKRKRGRPRKI 190

Query: 1607 YSALHNEDVRLVSSEQVT-----------DLVTAASNSAIDRSKKKLKPKEGARLKAFMN 1461
             +     +V  ++ E  T            +V+   +S I      +K K G   K    
Sbjct: 191  RNPSEENEVLDLTGEASTYVFVDKTSSNLGMVSRVGSSGISLDSNSVKRKRGRPPK---- 246

Query: 1460 SNDAAAHQIPIKPEELRSNGTTNFVDTVDKAENIQQYHDKLTRENNAPHMANANNTATFS 1281
                  ++  I   E R +   N +   DK E +      +  EN    + + +  A+ S
Sbjct: 247  ------NKEEIMNLEKRDSAIVN-ISAFDKEELV------VNLENREGTIVDLSALASVS 293

Query: 1280 PIFKESIFPQLKRKFTTEPELHTFFNNIGGEWASKLKKRKIVNAEDFVSGLPQGWKLLLG 1101
                E    ++     T+ E+  F   + GEW +  KK+K+VNA D+   LP+GW+L+L 
Sbjct: 294  EDPYEEELRRITVGLKTKEEILGFLEQLNGEWVNIGKKKKVVNACDYGGYLPRGWRLMLY 353

Query: 1100 VRKKNGRYIIECRKYISPAGPQFASWKEASVFL 1002
            +++K    ++ CR+YISP G QF + KE S +L
Sbjct: 354  IKRKGSNLLLACRRYISPDGQQFETCKEVSTYL 386


>gb|ESW08251.1| hypothetical protein PHAVU_009G031600g [Phaseolus vulgaris]
          Length = 841

 Score =  114 bits (286), Expect = 1e-22
 Identities = 109/369 (29%), Positives = 168/369 (45%), Gaps = 8/369 (2%)
 Frame = -3

Query: 2084 EAEVKEAEVLLDFSTIPVVDLHDFSQDEINVASQCSDVFRF-----DDIIVPKIDYSVFQ 1920
            EAEV+ +   +D  ++P+VDL   SQ E+   S      R      +D +VPKID S F 
Sbjct: 5    EAEVEPSSDHID--SLPLVDLRLLSQPELYTLSLSGATHRHRRANDNDSVVPKIDRSNFN 62

Query: 1919 ESSASRKQTYSKLRLSRKQEGAETLPGYKAGHFSLSKCRSMVDESGKQEAQRILQFIQER 1740
            ES+ SRKQTYSKLRL+++++    +P   + H         + E   QE  +I+  +Q +
Sbjct: 63   ESAGSRKQTYSKLRLNKRKQNF-AVPASSSFH---------IPEPVDQENSQIISLLQ-Q 111

Query: 1739 LNTMHPSGNGSLFMSDGALDANSSEINLQAITRVDENCPQPLAVYSALHNEDVRLVSSEQ 1560
            L  + P  N        AL  +  +     +  V     QP  V           V+ + 
Sbjct: 112  LFGVEPLRN--------ALRPDCGDAANHQLFPVHVEFKQPPPV----------TVTFQT 153

Query: 1559 VTDLVTAASNSAIDRSKKKLKPKEGARLKAFMNSNDAAAHQIPIKPEELRSNGTTNFVDT 1380
            V   V  ASN    R +K+ +P++   L +         ++           G +     
Sbjct: 154  VPIDVIDASN----RKRKRGRPRKNENLVSVFEEETKKVNE-----------GRSAVATV 198

Query: 1379 VDKAENIQQYHDKLTRENNAPHMANANNTATFSPIFKESIFPQLKRK---FTTEPELHTF 1209
            +++   +    D L   +N P              F E    +LKR+     TEP+L  F
Sbjct: 199  IERGFGVDA--DGL---DNDP--------------FGE----ELKRRTAGLETEPQLLEF 235

Query: 1208 FNNIGGEWASKLKKRKIVNAEDFVSGLPQGWKLLLGVRKKNGRYIIECRKYISPAGPQFA 1029
               + GEWAS+ KKR+IV A D  + LP GWK+++ + ++ GR  + CR+Y+SP G QF 
Sbjct: 236  LETLNGEWASQRKKRRIVQASDLGTVLPAGWKIVITLLRRAGRASVVCRRYVSPGGHQFE 295

Query: 1028 SWKEASVFL 1002
            S KEAS +L
Sbjct: 296  SCKEASAYL 304


>ref|XP_006433971.1| hypothetical protein CICLE_v10000205mg [Citrus clementina]
            gi|557536093|gb|ESR47211.1| hypothetical protein
            CICLE_v10000205mg [Citrus clementina]
          Length = 919

 Score =  114 bits (286), Expect = 1e-22
 Identities = 172/729 (23%), Positives = 297/729 (40%), Gaps = 77/729 (10%)
 Frame = -3

Query: 2054 LDFSTIPVVDLHDFSQDEINVASQCSDVFRF---------DDIIVPKIDYSVFQESSASR 1902
            L + ++P++DL   +Q E+   S CS              D++  PKID SVF ES+ SR
Sbjct: 12   LHYDSLPLIDLRLLAQSELLSLSLCSSRVSTTTSSQNEDEDEVSTPKIDRSVFNESAGSR 71

Query: 1901 KQTYSKLRLSRKQEGAETLPGYKAGHFSLSKCRSMVDESGKQEAQRILQFIQERLNTMHP 1722
            KQT+S+LRL+ +   +  +P      ++ ++  ++ DE   Q    I+  ++   N    
Sbjct: 72   KQTFSRLRLAPRN--SPQIPPQIP--YTAARAETL-DEDNPQ----IVGLLESLFNIQ-- 120

Query: 1721 SGNGSLFMSDGAL-------DANSSEINLQAITRVDENC---PQPLAVYSALHNEDVR-- 1578
            S + S  ++D  L        A  +++N+     VDEN    P  +  YSA   +  R  
Sbjct: 121  SHSSSTIVNDQQLVPVQVEYKAYLNDVNVNV--NVDENLHDVPISVVTYSARKRKRGRPR 178

Query: 1577 -----------LVSSEQVTDLVTAASNSAIDR---------SKKKLKPKEGA------RL 1476
                        + SE   ++V+ +S +  D           +K+ +P++        ++
Sbjct: 179  KDEMTSSDNWWFIESENKVNVVSKSSLNITDNVNVVPCKIGKRKRGRPRKSENRNNNFKV 238

Query: 1475 KAFMNSNDAAAHQIPIKPEELRS-NGTTNFVDTVDKAENIQQYHDKLTRENNAPHMANAN 1299
             A   S      + P +P +    NG  +    +  +E+ +   ++   EN      N  
Sbjct: 239  NAVSESAPNVGKRGPGRPRKGEGKNGDKSVKKEIVVSESKEDLVNEALMENGDGIAVNLV 298

Query: 1298 NTATFSPIFKESIFPQLKRKFTTEPELHTFFNNIGGEWASKLKKRKIVNAEDFVSGLPQG 1119
              A     F E +  +       E EL  F   + G W S  KKRKIV+A +F   LP+G
Sbjct: 299  ALANTEDPFGEELRRRTGGSEKRE-ELLGFLTGLKGVWVSYRKKRKIVDASEFGDVLPRG 357

Query: 1118 WKLLLGVRKKNGRYIIECRKYISPAGPQFASWKEASVFLSTNGALPVGRKDVSQVNLDHA 939
            WKL+L ++KK G   + CR+YISP G QF S KE S +L +      G K  SQ +  H 
Sbjct: 358  WKLMLCIKKKVGHMWLGCRRYISPNGRQFVSCKEVSSYLLSLS----GHKVASQPSAAHT 413

Query: 938  -------NSRTH---FDPIQKKETHGT-----LNDGVDMPNASHQNQ--LKAICSP--DA 816
                   N  T     DPI K + +G      L       +  H+ Q  L  I SP  D 
Sbjct: 414  GDCIQLDNKMTFGNAVDPILKDDKNGADLVFHLPFPASSVSTGHEKQATLPKIMSPGEDK 473

Query: 815  GKKSQEKSDIVAG-TKRRYNPSSLNAPMNCKKCNATYPNRSSFMGHLTIHHVKRKKNAEG 639
            G+++  K   V+  T  +    +    +   K + ++  ++    H    H       + 
Sbjct: 474  GQENCNKKYSVSNITDEKVEKMNAATEVTAAKLDVSFGAKAVMCNHQNNKHFGSCSERD- 532

Query: 638  IPANTNSVTSLQVQANGTNNIPNIMAVQAYGQNMDNVAMVYDKANNVNSTQIQED-GVNN 462
            +P NT S ++     +G + +   + + + G        VY  +      +I +D G  +
Sbjct: 533  VPKNTISSSN---NMSGQDQVFQPLILDSSGNG------VYFSSVEKQKQEIGDDSGFVS 583

Query: 461  GKSALHIGNAEDMEKAPGEVVTMSHISSETVRLHTENME-NSSTNGNIPHDANCSSMTDI 285
              +   I + +++EK       +   S E +++  +  E N +  G++     CS + D 
Sbjct: 584  PNAKDEISSCQNLEKG------LFTSSMEHMKVDVDKCERNEAIAGSV---YGCSRLVDT 634

Query: 284  ------KSPSNSCSKSFD-EKYQCTVDIGVPESGDEQKSEKPNLFNFTSKNISADNNEQA 126
                  +     CS      + +C     V +SG  + SE   L  F S+ I   +N   
Sbjct: 635  MTYEKGRGSFEGCSVVLSGSELKCGSMNAVNKSGRPEDSEDGLLNLFGSEKIFGFDNNLT 694

Query: 125  SISNPFLEL 99
             +S   +E+
Sbjct: 695  KVSVDKMEV 703


>gb|EOY15980.1| Uncharacterized protein isoform 1 [Theobroma cacao]
            gi|508724084|gb|EOY15981.1| Uncharacterized protein
            isoform 1 [Theobroma cacao]
          Length = 1203

 Score =  112 bits (281), Expect = 5e-22
 Identities = 138/525 (26%), Positives = 207/525 (39%), Gaps = 53/525 (10%)
 Frame = -3

Query: 2054 LDFSTIPVVDLHDFSQDEINVASQCSDV----FRFDDIIVPKIDYSVFQESSASRKQTYS 1887
            L   +IPVVDL   SQ E+   S CS          ++  PKID SVF ES+ SRKQT+S
Sbjct: 14   LHLESIPVVDLRLISQPELLSLSLCSSSPSPSNADTELFTPKIDRSVFNESAGSRKQTFS 73

Query: 1886 KLRLSRKQEGA----ETLPGYKAGHFSLSKCRSMVDESG-KQEAQRILQFIQERLNTMHP 1722
            +LRL+  +        + P  K    SLS+  + V+     +E+  IL  ++        
Sbjct: 74   RLRLAAPRNHLPHPHHSSPSSKP-FTSLSQRLNPVNPGPLDEESSNILSLLK-------- 124

Query: 1721 SGNGSLFMSDGALDANSSEINLQAITRVDENCPQPLAVY---------SALHNEDVRLVS 1569
                SLF  D +L +N++E         D+    P+ +          S L N  V +VS
Sbjct: 125  ----SLFNIDDSLTSNTNEDEPD-----DDKDLVPVQIEYENGKDNGNSVLQNIPVGIVS 175

Query: 1568 ------------SEQVTDLVTAASNSAIDRSKKKLKPKEGARLKAFMNSNDAAAHQIPIK 1425
                         +Q  +L+  + N  I+  +      E A       S +A    I   
Sbjct: 176  CSGSKRKRGRPRKDQKDNLLIESENLVIEEHQ------ETAAFDRVSESVNAGG--ISSC 227

Query: 1424 PEELRSNGTTNFVDTVDKAENIQQYHDKLTRENNAPHMANANNTATFSPIFKESIFPQLK 1245
             E  R  G        ++++N     ++   E+    +A  N  A         I  +L+
Sbjct: 228  SERKRKRGRPR----KEESQNRVIVSEEKKVESEIERVALGNVEAILG------IEEELR 277

Query: 1244 RK---FTTEPELHTFFNNIGGEWASKLKKRKIVNAEDFVSGLPQGWKLLLGVRKKNGRYI 1074
            R+     TE EL  F   + GEWASK +K++IV+A  F + LPQGWKL+L V+K+ G   
Sbjct: 278  RRTEAIGTEAELLEFMGGLEGEWASKSQKKRIVDAAGFGNVLPQGWKLMLFVKKRAGHVW 337

Query: 1073 IECRKYISPAGPQFASWKEASVFLSTNGALPVGRKDVSQVNLDHANS-----RTHFDPIQ 909
            + C +YISP G QF S KE S  L + G L    +  S +      S       +F  I 
Sbjct: 338  LACSRYISPNGQQFVSCKEVSSCLLSAGELKDSSQSTSSLTGRGIGSGVKPTSENFPIIC 397

Query: 908  KKETHGTLNDGVDMPNASHQNQLKAICSPDAGKKSQEKSDIVA---------------GT 774
                H      + M +     + + I          ++ D +                GT
Sbjct: 398  TSSEHERQAPLLRMGSPWEVQRAETIKCHKCTMTFNQQDDFICHLLSSHQGTVKSSGHGT 457

Query: 773  KRRYNPSSLNAPMNCKKCNATYPNRSSFMGHLTIHHVKRKKNAEG 639
                     N    C+ C   +  RS +  HL +H     K  EG
Sbjct: 458  STNEEVIIKNGKYECQFCYELFEERSCYSSHLGVHMKNNTKKVEG 502


>ref|XP_006472591.1| PREDICTED: uncharacterized protein LOC102628030 [Citrus sinensis]
          Length = 917

 Score =  108 bits (271), Expect = 7e-21
 Identities = 174/731 (23%), Positives = 296/731 (40%), Gaps = 79/731 (10%)
 Frame = -3

Query: 2054 LDFSTIPVVDLHDFSQDEINVASQCSDVFRF---------DDIIVPKIDYSVFQESSASR 1902
            L + ++P++DL   +Q E+   S CS              D++  PKID SVF ES+ SR
Sbjct: 12   LHYDSLPLIDLRLLAQSELLSLSLCSSRVSTTTSSQNEDEDEVSTPKIDRSVFNESAGSR 71

Query: 1901 KQTYSKLRLSRKQEGAETLPGYKAGHFSLSKCRSMVDESGKQEAQRILQFIQERLNTMHP 1722
            KQT+S+LRL+ +   +  +P      ++ ++  ++ DE   Q    I+  ++   N    
Sbjct: 72   KQTFSRLRLAPRN--SPQIPPQIP--YTAARAETL-DEDNPQ----IVGLLESLFNIQ-- 120

Query: 1721 SGNGSLFMSDGAL-------DANSSEINLQAITRVDENC---PQPLAVYSALHNEDVR-- 1578
            S + S  ++D  L        A  +++N+     VDE+    P  +  YSA   +  R  
Sbjct: 121  SHSSSTIVNDQQLVPVQVEYKAYLNDVNVN----VDEDLHDVPISVVTYSARKRKRGRPR 176

Query: 1577 -----------LVSSEQVTDLVTAASNSAIDRSK----KKLKPKEGARLKAFMNSNDAAA 1443
                        + SE   ++V+ +S +  D       K  K K G   K+   +N+   
Sbjct: 177  KDEMTSSDNWWFIESENKVNVVSKSSLNITDNVNVVPCKTGKRKRGRPRKSENGNNNFKV 236

Query: 1442 HQI-----------PIKPEELRS-NGTTNFVDTVDKAENIQQYHDKLTRENNAPHMANAN 1299
            + +           P +P +    NG  +    +  +E+ +   ++   E+      N  
Sbjct: 237  NAVSESAPNVGKRGPGRPRKGEGKNGDKSVKKEIVVSESKEDLVNEALMEDRDGIAVNLV 296

Query: 1298 NTATFSPIFKESIFPQLKRKFTTEPELHTFFNNIGGEWASKLKKRKIVNAEDFVSGLPQG 1119
              A     F E +  +       E EL  F   + G W S  KKRKIV+A +F   LP+G
Sbjct: 297  ALANTEDPFGEELRRRTGGSEKRE-ELLGFLTGLKGVWVSYRKKRKIVDASEFGDVLPRG 355

Query: 1118 WKLLLGVRKKNGRYIIECRKYISPAGPQFASWKEASVFLSTNGALPVGRKDVSQVNLDHA 939
            WKL+L ++KK G   + CR+YISP G QF S KE S +L +      G K  SQ +  H 
Sbjct: 356  WKLMLCIKKKVGHMWLGCRRYISPNGRQFVSCKEVSSYLLSLS----GHKVASQPSAAHT 411

Query: 938  -------NSRTH---FDPIQKKETHGT-----LNDGVDMPNASHQNQ--LKAICSP--DA 816
                   N  T     DPI K + +G      L       +  H+ Q  L  I SP  D 
Sbjct: 412  GDCIQLDNKMTFGNAVDPILKDDKNGADLVFHLPFPASSVSTGHEKQATLPKIMSPGEDE 471

Query: 815  GKKSQEKSDIVAG-TKRRYNPSSLNAPMNCKKCNATYPNRSSFMGHLTIHHVKRKKNAEG 639
            G+++  K   V+  T  +    +    +   K + ++  ++    H    H       + 
Sbjct: 472  GQENCNKKYSVSNITDEKVEKMNAATEVTAAKLDVSFCAKAVMCNHQNNKHFGSCSERD- 530

Query: 638  IPANTNSVTSLQVQANGTNNI--PNIMAVQAYGQNMDNVAMVYDKANNVNSTQIQED-GV 468
            +P NT S ++     +G + +  P I+     G        VY  +      +I +D G 
Sbjct: 531  VPKNTISSSN---NMSGQDQVFQPQILDSSGNG--------VYFSSVEKQKQEIGDDSGF 579

Query: 467  NNGKSALHIGNAEDMEKAPGEVVTMSHISSETVRLHTENME-NSSTNGNIPHDANCSSMT 291
             +  +   I + +++EK       +   S E +++  +  E N +  G++     CS + 
Sbjct: 580  VSPNAKDEISSCQNLEKG------LFTSSMEHMKVDVDKCERNEAIAGSV---YGCSRLV 630

Query: 290  DI------KSPSNSCSKSFD-EKYQCTVDIGVPESGDEQKSEKPNLFNFTSKNISADNNE 132
            D       +     CS      + +C     V +SG  + SE   L  F S+ I   +N 
Sbjct: 631  DTMTYEKGRGSFEGCSVVLSGSELKCGSMNAVNKSGRPEDSEDGLLNLFGSEKIFGFDNN 690

Query: 131  QASISNPFLEL 99
               +S   +E+
Sbjct: 691  LTKVSVDKMEV 701


>ref|XP_006416210.1| hypothetical protein EUTSA_v10007200mg [Eutrema salsugineum]
            gi|557093981|gb|ESQ34563.1| hypothetical protein
            EUTSA_v10007200mg [Eutrema salsugineum]
          Length = 575

 Score =  105 bits (263), Expect = 6e-20
 Identities = 108/428 (25%), Positives = 185/428 (43%), Gaps = 31/428 (7%)
 Frame = -3

Query: 2054 LDFSTIPVVDLHDFSQDEINVASQCSDVFR-------FDDIIVPKIDYSVFQESSASRKQ 1896
            L   ++P++D    SQ E+   S  S            DD + PKID SVF ES+ SRKQ
Sbjct: 106  LSAESLPLIDTRLLSQSELRALSPSSSSSASLAASAGVDDDLTPKIDRSVFNESAGSRKQ 165

Query: 1895 TYSKLRLSRKQEGAETLPGYKAGHFSLSKCRSMVDESGKQEAQRILQFIQERLNTMHPSG 1716
            T+ ++RL+R           +             D+S ++E  ++   ++          
Sbjct: 166  TFLRVRLARDPPPPRPPSPQRRR-----------DDSSREEKSQVASLLR---------- 204

Query: 1715 NGSLFMSDGALDANSSEINLQAITRVDENCPQPLAVYSALHNEDVR----LVSSEQVTDL 1548
              SLF  D +   N+ E   +    V+E   QPL      +N +V       S + V  +
Sbjct: 205  --SLFSVD-SFQRNAEED--EGEEEVEEKEGQPLISLPIHNNGNVYRNPYFDSVKNVQGI 259

Query: 1547 VTAASNSAIDRSKKKLKPKEGARLKAFMNSNDAAAHQIPIKPEELRSN-GTTNFVDTVD- 1374
                +     R +K   P +G  L ++    D +  +  +  ++ RSN GT +  D    
Sbjct: 260  SENETRRRPGRPRKIRNPSDGV-LDSYA---DESEREGTLSVDKTRSNLGTESGYDASGI 315

Query: 1373 ---------KAENIQQYHDKLTRENNAPHMANANNTATFSPIF-----KESIFPQLKRKF 1236
                     K    ++  D    E+    ++  N   T   +      +E  + +  R+ 
Sbjct: 316  SMDSNPGKRKRGRPRKSGDGCKSEDKEEIVSLENREGTMVDLSALANNEEDPYGEELRRI 375

Query: 1235 T----TEPELHTFFNNIGGEWASKLKKRKIVNAEDFVSGLPQGWKLLLGVRKKNGRYIIE 1068
            T    T+ EL  F   + GEW +  KK+K+V A D+   LP+GWKL+L ++KK     + 
Sbjct: 376  TVGLGTKEELLAFLEQVNGEWVNAGKKKKVVKACDYGGYLPRGWKLMLCIKKKGSIQWLA 435

Query: 1067 CRKYISPAGPQFASWKEASVFLSTNGALPVGRKDVSQVNLDHANSRTHFDPIQKKETHGT 888
            CR+YISP G +FA+ KE S +L +     V  +  +++N   +++ T  +P+   E+   
Sbjct: 436  CRRYISPDGQEFATCKEVSTYLQS----LVESQSKNRLNSFQSDNHTLGEPVMGNESLVG 491

Query: 887  LNDGVDMP 864
             +D +D+P
Sbjct: 492  NSDSMDLP 499


>ref|XP_004292482.1| PREDICTED: uncharacterized protein LOC101298198 [Fragaria vesca
            subsp. vesca]
          Length = 821

 Score =  101 bits (251), Expect = 1e-18
 Identities = 94/363 (25%), Positives = 157/363 (43%), Gaps = 15/363 (4%)
 Frame = -3

Query: 1709 SLFMSDGALDANSSEINLQAITR--VDENCPQPLAVYSALHNEDVRLVSSEQVTDLVTAA 1536
            SL  S+GA+D     + +  I R   +E+       YS +      L+S+ +V+     A
Sbjct: 38   SLTRSNGAID----HLVVPKIDRSQFNESAGSRRQTYSRVRRRVAGLLSNPKVS-----A 88

Query: 1535 SNSAIDRSKKKLKPKEGARLKAFMNSNDAAAHQIPIKPEELRSNGTTNFVDTVDKAENIQ 1356
              +  D  ++         LK F+ S D    QI ++P  +    + + +  +++ +  +
Sbjct: 89   PPAQPDDPERNENQAIIGHLKRFI-SQDPKFDQIDLEPSPMTMKASLSGMAELERRKRKR 147

Query: 1355 QYHDKLTRENNAPHM-ANANNTATFSPIFKESIFP---QLKRK---FTTEPELHTFFNNI 1197
                K    +    +  N N  A      + S  P   +L+R+     TE EL  F  ++
Sbjct: 148  GRKPKAKGSSGGEGLIVNKNGAAVDIWALQNSENPFGDELRRRTLGLETEEELLGFMRDL 207

Query: 1196 GGEWASKLKKRKIVNAEDFVSGLPQGWKLLLGVRKKNGRYIIECRKYISPAGPQFASWKE 1017
            GG+W S+ KKRKIV+A +F   LP GWKLLLG+++K  R  I CR+YISP G QF S KE
Sbjct: 208  GGQWGSRRKKRKIVDATEFGDALPLGWKLLLGLKRKERRAWIYCRRYISPTGQQFLSCKE 267

Query: 1016 ASVFL----STNGALPVGRKDVSQVNLDH--ANSRTHFDPIQKKETHGTLNDGVDMPNAS 855
             + FL    S N A          +  D   A    H D   +K    + N G+   + S
Sbjct: 268  VASFLESFFSLNNADRHDGDGGENIQEDRIVATENQHADKDGEKRQDVSFNSGILGSSIS 327

Query: 854  HQNQLKAICSPDAGKKSQEKSDIVAGTKRRYNPSSLNAPMNCKKCNATYPNRSSFMGHLT 675
            ++            + ++ +  +            ++    C KC+ T+ ++ S++ HL 
Sbjct: 328  NE------------QSNEPEKKVSISEMENLAEVQIHNLFECHKCSMTFADKDSYLQHLL 375

Query: 674  IHH 666
              H
Sbjct: 376  SFH 378


>ref|XP_006661339.1| PREDICTED: dentin sialophosphoprotein-like, partial [Oryza
            brachyantha]
          Length = 1042

 Score = 99.0 bits (245), Expect = 7e-18
 Identities = 101/406 (24%), Positives = 168/406 (41%), Gaps = 42/406 (10%)
 Frame = -3

Query: 1229 EPELHTFFNNIGGEWASKLKKRKIVNAEDFVSGLPQGWKLLLGVRKKNGRYIIECRKYIS 1050
            E EL  F N + G+W S+ ++RK V+A  F   LP+GWKLLLG+++K     I CR+Y+S
Sbjct: 136  ESELLGFMNGLEGQWGSRRRRRKFVDASMFGDHLPRGWKLLLGLKRKERVAWINCRRYVS 195

Query: 1049 PAGPQFASWKEASVFLSTNGALPVGRKDVSQVNLDHANSRTHFDPIQKKETHGTLNDGVD 870
            P+G QFAS KE S +L +     +G  +     + ++N+  H       E H   + G  
Sbjct: 196  PSGQQFASCKEISSYLIS----LLGYVEAKPTAIQNSNAGVH-------ELHTVNSVGHC 244

Query: 869  MPNASHQNQLKAICSPDAGKKSQEKSDIVAGTKRRYNPSSLNAPMN---CKKCNATYPNR 699
             PN++ +       +P     S   S      +R+++ +      N   C+KCN  + ++
Sbjct: 245  QPNSTEEKH----SAPPV--TSVPVSSHYGDPQRQHDKNETQVETNGKECQKCNLIFQDQ 298

Query: 698  SSFMGH-LTIHHVK---RKKNAEG-IPANTN-SVTSLQVQANGTNNI----PNIMAVQAY 549
            S+++ H L+ H  K   RK N  G +  N N +  + ++Q    + +     N+ A +  
Sbjct: 299  SAYVQHQLSFHQRKAKRRKVNKSGEVGVNKNGTFVTQELQQTSEDKLGHIDHNVAASRNQ 358

Query: 548  GQNMDNV-------------AMVYDKANNVNSTQIQEDGVNNGKSALHIGNAEDMEKAPG 408
            GQ  + V             +M  +      +    E G  +    L  G+  D      
Sbjct: 359  GQTPEKVSDETISGELGGQPSMAPEPVGFRETDGETEQGKESSAGELLSGHCNDSLHNMA 418

Query: 407  EVVTMSHISS-ETVRLHTENMENSST----------NGNIPHDANCSSMTDIKSPSN--- 270
            +V      S+ E V  H EN+ ++            N   PH    +S     SP+N   
Sbjct: 419  DVAEQEKRSAREPVTGHHENLSDNCVDHKIHDGACHNAEEPHAVEAASKFSTGSPANFHE 478

Query: 269  --SCSKSFDEKYQCTVDIGVPESGDEQKSEKPNLFNFTSKNISADN 138
              S          CT +I   +       E PN  +  S++   D+
Sbjct: 479  IDSSKDIVLSSADCTQNISKTDKTCNLLEEAPNATSTQSESKCTDD 524


>ref|XP_004957094.1| PREDICTED: uncharacterized protein LOC101759536 [Setaria italica]
          Length = 1141

 Score = 98.6 bits (244), Expect = 1e-17
 Identities = 106/468 (22%), Positives = 186/468 (39%), Gaps = 82/468 (17%)
 Frame = -3

Query: 1232 TEPELHTFFNNIGGEWASKLKKRKIVNAEDFVSGLPQGWKLLLGVRKKNGRYIIECRKYI 1053
            +E EL  F N + G+W S+ ++RK V+A  F   LP+GWKLLLG+++K     I CR+Y+
Sbjct: 198  SESELLGFMNALEGQWGSRRRRRKFVDAGMFADHLPRGWKLLLGLKRKERVAWINCRRYV 257

Query: 1052 SPAGPQFASWKEASVFLSTNGALPVGRKDVSQVNLDHANSRTHFDPIQKKETHGTLNDGV 873
            SP G QFA+ KE S +L +    P  +   +Q+N    ++  H                +
Sbjct: 258  SPKGHQFATCKEVSTYLRSLLGYPEAKPTTTQIN----SAGVH---------------DL 298

Query: 872  DMPNASHQNQL----KAICSPDAGKKSQEKSDIVAGTKRRYNPSSLNA-PMNCKKCNATY 708
            D+ +A HQ  +    + +  P         S    G K + + + +   P  C+KCN T+
Sbjct: 299  DINSAGHQQTISIEQRQLAVPLTSVTLFSHSGDSHGQKLQKDEAQMEVNPKECRKCNLTF 358

Query: 707  PNRSSFMGH-LTIHHVKRKKN-----------------------AEGIPANTNSVTSLQV 600
             ++ ++M H L+ H  K K+                         EG   +++ V  ++ 
Sbjct: 359  HDQGAYMQHQLSFHQRKAKRRRVSKSSELGTYVDGNYETQQKTLGEGFGNSSHGVADVRY 418

Query: 599  QANGTNNI------------PNIMAVQAYGQNMDNVAMVYDK---------ANNVN---- 495
            Q      +            P++ A     Q M  +    +K          NN +    
Sbjct: 419  QGQSPAKLFDGTFSGQLGVQPSLKAAPLGFQEMTVLPPQLEKEPFAGEPVSMNNKDPPEE 478

Query: 494  ---------STQIQEDGVNNGKSALHIGNAEDMEKAPG--EVVTMSHISSE--------- 375
                      +   E    +GK    + N  + EK P   E V+ S  ++E         
Sbjct: 479  MSGFLEQERESAAGEPISRHGKDPQEMINFPEQEKEPAAREAVSGSTSAAELEKGPSAGG 538

Query: 374  -TVRLHTENMENSSTNGNIPHDANCSS-----MTDIKSPSNSCSKSFDEKYQCTVDIGVP 213
             T   H + ++NS    +  HD  C S       D +S  ++C+ +   +  C+ D+ + 
Sbjct: 539  PTSGHHLDAVDNSD---HRTHDETCDSAVASLSVDAESKLSTCNATNFHENDCSKDLELS 595

Query: 212  ESGDEQKSEKPNLFNFTSKNIS--ADNNEQASISNPFLELLQEAAVEE 75
             +   QKS + +      K +S  AD+  ++  +N  +E       E+
Sbjct: 596  NTDHSQKSNRSDETYGVPKEVSPAADDPVESKSTNDLMECTDITQTEQ 643


>ref|XP_002525855.1| hypothetical protein RCOM_0824380 [Ricinus communis]
            gi|223534860|gb|EEF36549.1| hypothetical protein
            RCOM_0824380 [Ricinus communis]
          Length = 697

 Score = 98.2 bits (243), Expect = 1e-17
 Identities = 63/200 (31%), Positives = 96/200 (48%), Gaps = 4/200 (2%)
 Frame = -3

Query: 1253 QLKRK---FTTEPELHTFFNNIGGEWASKLKKRKIVNAEDFVSGLPQGWKLLLGVRKKNG 1083
            +LKR+      E EL  FF ++GG+W S+ +KRKIV+A +F   LP GWKLLLG+++K G
Sbjct: 199  ELKRRTEGMVKEEELLGFFRDLGGQWCSRRRKRKIVDASEFGDFLPFGWKLLLGLKRKEG 258

Query: 1082 RYIIECRKYISPAGPQFASWKEASVFLSTNGALPVGRKDVSQVNLDHANSRTHFDPIQKK 903
            +  + CR+YISP+G QF S KE S +L +                DH+N           
Sbjct: 259  KAWVYCRRYISPSGQQFISCKEVSAYLQS-----------CLKPYDHSNGNNRQVHRVAS 307

Query: 902  ETH-GTLNDGVDMPNASHQNQLKAICSPDAGKKSQEKSDIVAGTKRRYNPSSLNAPMNCK 726
            E H GT     D    S   +  ++   D    + E +++            +     C 
Sbjct: 308  ENHAGTSGREEDQRQPSEHEKAVSLLGID----NLELAEV-----------QIQDLFECH 352

Query: 725  KCNATYPNRSSFMGHLTIHH 666
            KCN T+ ++ +++ HL   H
Sbjct: 353  KCNMTFDDKDTYLQHLLSFH 372


>gb|EMJ00867.1| hypothetical protein PRUPE_ppa001455mg [Prunus persica]
          Length = 824

 Score = 97.8 bits (242), Expect = 2e-17
 Identities = 64/194 (32%), Positives = 95/194 (48%), Gaps = 5/194 (2%)
 Frame = -3

Query: 1232 TEPELHTFFNNIGGEWASKLKKRKIVNAEDFVSGLPQGWKLLLGVRKKNGRYIIECRKYI 1053
            TE +L  F   +GG+W S+ KKRKIV+A +F   LP GWKLLLG+++K GR  I CR++I
Sbjct: 216  TEEQLLGFMRELGGQWGSRRKKRKIVDANEFGDALPVGWKLLLGLKRKEGRAWIYCRRFI 275

Query: 1052 SPAGPQFASWKEASVFLST-----NGALPVGRKDVSQVNLDHANSRTHFDPIQKKETHGT 888
            SP G QF S KE S FL +     N   P G          H       + I   E   +
Sbjct: 276  SPTGQQFLSCKEVSSFLHSFFGFNNARQPDG----------HGGENLQEECIMTTENQHS 325

Query: 887  LNDGVDMPNASHQNQLKAICSPDAGKKSQEKSDIVAGTKRRYNPSSLNAPMNCKKCNATY 708
              DG      +  + L  + S  + ++ +E S  ++G +       ++    C KC+ T+
Sbjct: 326  DKDGGRRQYVNSSSAL--VVSTISNEREKEVS--LSGME-NLAEVQIHDLFECHKCSMTF 380

Query: 707  PNRSSFMGHLTIHH 666
              + S++ HL   H
Sbjct: 381  GEKDSYLQHLLSFH 394


>ref|NP_001123600.1| LOC100170247 [Zea mays] gi|189514249|gb|ACE07054.1| methylcytosine
            binding domain protein [Zea mays]
            gi|414589744|tpg|DAA40315.1| TPA: methylcytosine binding
            domain protein [Zea mays]
          Length = 1176

 Score = 97.1 bits (240), Expect = 3e-17
 Identities = 73/234 (31%), Positives = 111/234 (47%), Gaps = 12/234 (5%)
 Frame = -3

Query: 1232 TEPELHTFFNNIGGEWASKLKKRKIVNAEDFVSGLPQGWKLLLGVRKKNGRYIIECRKYI 1053
            +E EL  F N + G+W S+ ++RK VNA  F   LP GWKLLLG+++K     I CR+Y+
Sbjct: 195  SESELLGFMNALEGQWGSRRRRRKFVNAGMFGDHLPCGWKLLLGLKRKERVAWINCRRYV 254

Query: 1052 SPAGPQFASWKEASVFLSTNGALPVGRKDVSQVNLDHANSRTHFDPIQKKETHGTLNDGV 873
            SP G QFA+ KE S +L +       +   SQ+N    N+  H   +     H       
Sbjct: 255  SPKGHQFATCKEVSSYLLSLLGYQEAKPTASQIN----NAGVHDLHVNSVGLHQQTISIE 310

Query: 872  DMPNASHQNQLKAICSPDAGKKSQEKSDIVAGTKRRYNPSSLNAPMNCKKCNATYPNRSS 693
            +   A   N +    S  +G   Q+K       ++   P  +NA   C+KCN T+ ++S+
Sbjct: 311  EKQIAVPVNSVALFNS--SGDSHQQK------LQKDEAPIEVNA-KECRKCNLTFHDQSA 361

Query: 692  FMGH-LTIHHVKRKK-----------NAEGIPANTNSVTSLQVQANGTNNIPNI 567
            +M H L+ H  K K+           N +G    T   TS +V  N  ++  N+
Sbjct: 362  YMQHQLSFHQRKAKRRRVSKSGELGTNIDGNYEKTQQKTSGEVSGNFGHSAANV 415


>ref|XP_002300183.1| hypothetical protein POPTR_0001s31990g [Populus trichocarpa]
            gi|222847441|gb|EEE84988.1| hypothetical protein
            POPTR_0001s31990g [Populus trichocarpa]
          Length = 837

 Score = 96.7 bits (239), Expect = 4e-17
 Identities = 62/199 (31%), Positives = 93/199 (46%), Gaps = 3/199 (1%)
 Frame = -3

Query: 1253 QLKRK---FTTEPELHTFFNNIGGEWASKLKKRKIVNAEDFVSGLPQGWKLLLGVRKKNG 1083
            +LKR+      E EL  FF  +GG+W S+ KKRKIV+A +F   LP GWKL+LG+++K G
Sbjct: 219  ELKRRTEGMEKEEELLGFFRELGGQWCSRRKKRKIVDAGEFGDFLPVGWKLILGLKRKEG 278

Query: 1082 RYIIECRKYISPAGPQFASWKEASVFLSTNGALPVGRKDVSQVNLDHANSRTHFDPIQKK 903
            R  + CR+Y+SP+G QF S K+ S +L +     VG  D  Q   DH             
Sbjct: 279  RAWVYCRRYLSPSGQQFISCKDVSAYLQS----LVGPYDAQQAK-DHTG----------- 322

Query: 902  ETHGTLNDGVDMPNASHQNQLKAICSPDAGKKSQEKSDIVAGTKRRYNPSSLNAPMNCKK 723
              H    D    P+A    +L+     D  +  + +  +            +     C K
Sbjct: 323  --HSIQQDHGGAPHAGAIERLE-----DQRQSIEHQKQVSLLETDNLAEVQIRDLFECHK 375

Query: 722  CNATYPNRSSFMGHLTIHH 666
            C  T+  + +++ HL   H
Sbjct: 376  CRMTFDEKGTYLEHLLSFH 394


Top