BLASTX nr result

ID: Anemarrhena21_contig00022711 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Anemarrhena21_contig00022711
         (1327 letters)

Database: ./nr 
           69,698,275 sequences; 24,982,196,650 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_008811415.1| PREDICTED: embryonic stem cell-specific 5-hy...   444   e-122
ref|XP_010909447.1| PREDICTED: embryonic stem cell-specific 5-hy...   438   e-120
ref|XP_009416600.1| PREDICTED: embryonic stem cell-specific 5-hy...   410   e-111
ref|XP_010244070.1| PREDICTED: embryonic stem cell-specific 5-hy...   394   e-107
ref|XP_010244069.1| PREDICTED: embryonic stem cell-specific 5-hy...   389   e-105
ref|XP_004982141.1| PREDICTED: embryonic stem cell-specific 5-hy...   380   e-102
ref|XP_003622783.1| hypothetical protein MTR_7g052250 [Medicago ...   379   e-102
ref|XP_004492204.1| PREDICTED: embryonic stem cell-specific 5-hy...   379   e-102
ref|XP_003635244.1| PREDICTED: embryonic stem cell-specific 5-hy...   377   e-101
ref|XP_012084983.1| PREDICTED: embryonic stem cell-specific 5-hy...   373   e-100
ref|XP_007140735.1| hypothetical protein PHAVU_008G137400g [Phas...   369   2e-99
ref|XP_003532247.1| PREDICTED: embryonic stem cell-specific 5-hy...   366   2e-98
ref|XP_008244708.1| PREDICTED: embryonic stem cell-specific 5-hy...   364   7e-98
ref|XP_007199067.1| hypothetical protein PRUPE_ppa018685mg [Prun...   363   2e-97
ref|XP_004290141.1| PREDICTED: embryonic stem cell-specific 5-hy...   359   3e-96
ref|XP_009358987.1| PREDICTED: embryonic stem cell-specific 5-hy...   358   4e-96
ref|XP_002303080.1| hypothetical protein POPTR_0002s25190g [Popu...   355   3e-95
ref|XP_007049611.1| Uncharacterized protein TCM_002685 [Theobrom...   355   3e-95
gb|KHG01683.1| hypothetical protein F383_01440 [Gossypium arboreum]   354   7e-95
ref|XP_011024560.1| PREDICTED: embryonic stem cell-specific 5-hy...   353   2e-94

>ref|XP_008811415.1| PREDICTED: embryonic stem cell-specific
            5-hydroxymethylcytosine-binding protein [Phoenix
            dactylifera]
          Length = 373

 Score =  444 bits (1141), Expect = e-122
 Identities = 233/378 (61%), Positives = 266/378 (70%), Gaps = 17/378 (4%)
 Frame = -3

Query: 1220 MCGRARCTLNAHQVAQACGL--NPQHASSVPTQQIDRYRPSYNVSPGAYLPVFRVRRXXX 1047
            MCGRARCTLN  QVAQAC L      ASS+PT Q+DRYRPSYNVSPGAYLPV   R+   
Sbjct: 1    MCGRARCTLNPGQVAQACRLADGGGDASSIPTLQMDRYRPSYNVSPGAYLPVVAARKEAK 60

Query: 1046 XXXXXXXXXXXGPIVHCMKWGLIPSFTKKTEKPDHYRMFNARSESIKEKASFRRLIPNNR 867
                        P++HCMKWGL+PSFTKKTEKPDH++MFNARSESIKEKASFRRLIP NR
Sbjct: 61   GSGEGREA----PVIHCMKWGLVPSFTKKTEKPDHFKMFNARSESIKEKASFRRLIPTNR 116

Query: 866  CLVAVEGFYEWKKDGSKKQPYYIHFKDNQPLVFAALYDSWENSEGEILYTFTIVTTRSSS 687
            CLVAVEGFYEWKKDGS+KQPYYIHFKD++PLVFAALYDSW NSEGEIL+TFTI+TTRSS+
Sbjct: 117  CLVAVEGFYEWKKDGSRKQPYYIHFKDHRPLVFAALYDSWVNSEGEILHTFTILTTRSST 176

Query: 686  ALEWLHDRMPVIFRDIGSINLWLNDFSTKHEAILGPCEDPDLVWYPVTPAMGKPSFDGPE 507
            AL+WLHDRMPVI  + GSI++WL   + K EA+LGP ED DLVWYPVT A+GKPSFDGPE
Sbjct: 177  ALQWLHDRMPVILGNKGSIDVWLEKSTPKLEAVLGPYEDSDLVWYPVTTAVGKPSFDGPE 236

Query: 506  CIKEIQLKPAGEKPLSKFFSKKTEVKCQSEPGFVNSSK---QEHALDNVKEESGXXXXXX 336
            CIKEI+LK  GE P+SKFF+KK   K QSEP    SS    Q  A  +VK+E        
Sbjct: 237  CIKEIKLKSTGENPMSKFFAKKKVDKSQSEPEHKKSSNEFPQTDAFGSVKDEPDAEETEE 296

Query: 335  XXXXXXXXXXXEMH------------GIKRELEEMEPSSGLINENMHSKPTNPAKKKGKG 192
                                      GIKRE EEM  +S    E     P +P KK GK 
Sbjct: 297  LTKEEKNGKSDHFAPPKGETIGPDVCGIKREFEEMATNSISQTETAIVLPASPVKK-GKS 355

Query: 191  INNTGDKQSSLFSYFGRR 138
            + +TGD+Q+SL SYFG+R
Sbjct: 356  VKSTGDRQASLLSYFGKR 373


>ref|XP_010909447.1| PREDICTED: embryonic stem cell-specific
            5-hydroxymethylcytosine-binding protein [Elaeis
            guineensis]
          Length = 373

 Score =  438 bits (1126), Expect = e-120
 Identities = 230/378 (60%), Positives = 268/378 (70%), Gaps = 17/378 (4%)
 Frame = -3

Query: 1220 MCGRARCTLNAHQVAQACGL--NPQHASSVPTQQIDRYRPSYNVSPGAYLPVFRVRRXXX 1047
            MCGRARCTLN   VA+ACGL     +ASSV T ++DRYRPSYNVSPGAYLPV   R+   
Sbjct: 1    MCGRARCTLNPGLVAKACGLADGGGNASSVLTLEMDRYRPSYNVSPGAYLPVVAARKEGK 60

Query: 1046 XXXXXXXXXXXGPIVHCMKWGLIPSFTKKTEKPDHYRMFNARSESIKEKASFRRLIPNNR 867
                        P++HCMKWGL+PSFTKKTEKPDH++MFNARSESIKEKASFRRLIP NR
Sbjct: 61   GSGQGREA----PVIHCMKWGLVPSFTKKTEKPDHFKMFNARSESIKEKASFRRLIPTNR 116

Query: 866  CLVAVEGFYEWKKDGSKKQPYYIHFKDNQPLVFAALYDSWENSEGEILYTFTIVTTRSSS 687
            CLVAVEGFYEWKKDGS+K+PYYIHF D++PLVFAALYDSW NS GEIL+TFTI+TTRSS+
Sbjct: 117  CLVAVEGFYEWKKDGSRKKPYYIHFNDHRPLVFAALYDSWVNSGGEILHTFTILTTRSST 176

Query: 686  ALEWLHDRMPVIFRDIGSINLWLNDFSTKHEAILGPCEDPDLVWYPVTPAMGKPSFDGPE 507
            AL+WLHDRMPVI R+  SI++WL   + K EAILGP ED DLVWYPVT A+GKPSFDGPE
Sbjct: 177  ALQWLHDRMPVILRNKSSIDVWLEKSTPKLEAILGPYEDSDLVWYPVTTAVGKPSFDGPE 236

Query: 506  CIKEIQLKPAGEKPLSKFFSKKTEVKCQSEPGFVNSSK---QEHALDNVKEE-------- 360
            CIKEI+LK  GE P+SKFFSKK + K  SEP    SS    Q  A  +VK+E        
Sbjct: 237  CIKEIKLKSTGENPMSKFFSKKVD-KSHSEPEHKKSSNEFPQTDAFSSVKDELDDAEIGE 295

Query: 359  ----SGXXXXXXXXXXXXXXXXXEMHGIKRELEEMEPSSGLINENMHSKPTNPAKKKGKG 192
                                   ++ GIKRE EEM  +S    E     P +PA+KKGK 
Sbjct: 296  LTREEKNGKSDHFALAKSETIESDVCGIKREFEEMATNSVPQTETAVVLPASPAEKKGKS 355

Query: 191  INNTGDKQSSLFSYFGRR 138
            + +TGD+Q+SL SYFG+R
Sbjct: 356  VKSTGDRQASLLSYFGKR 373


>ref|XP_009416600.1| PREDICTED: embryonic stem cell-specific
            5-hydroxymethylcytosine-binding protein isoform X1 [Musa
            acuminata subsp. malaccensis]
          Length = 376

 Score =  410 bits (1053), Expect = e-111
 Identities = 221/386 (57%), Positives = 256/386 (66%), Gaps = 28/386 (7%)
 Frame = -3

Query: 1220 MCGRARCTLNAHQVAQACGLNPQHASSVPTQQIDRYRPSYNVSPGAYLPVFRVRRXXXXX 1041
            MCGR RCTLN  Q+A+ACGL   +A+S+ T +IDRYRPSYNVSPGAYLPV  V R     
Sbjct: 1    MCGRTRCTLNTDQIARACGLTV-NAASIRTHEIDRYRPSYNVSPGAYLPVVLVERATGEA 59

Query: 1040 XXXXXXXXXGPIVHCMKWGLIPSFTKKTEKPDHYRMFNARSESIKEKASFRRLIPNNRCL 861
                      P + CMKWGL+PSFTKKTEKPDHY+MFNARSESIKEKASFRRL+P NRCL
Sbjct: 60   EES-------PAIRCMKWGLVPSFTKKTEKPDHYKMFNARSESIKEKASFRRLVPTNRCL 112

Query: 860  VAVEGFYEWKKDGSKKQPYYIHFKDNQPLVFAALYDSWENSEGEILYTFTIVTTRSSSAL 681
            VAVEGFYEWKKDGSKKQPYYIHFKDN+PLVFAALYD+W+NSEG+ILYT TI+TT SSSAL
Sbjct: 113  VAVEGFYEWKKDGSKKQPYYIHFKDNRPLVFAALYDAWKNSEGDILYTLTILTTSSSSAL 172

Query: 680  EWLHDRMPVIFRDIGSINLWLNDFSTKHEAILGPCEDPDLVWYPVTPAMGKPSFDGPECI 501
            +WLHDRMPVI R+ GS+++WLN    + E +L   ED DLVWYPVT A+GKPSFDG ECI
Sbjct: 173  QWLHDRMPVILRNEGSVDVWLNKAIPEFETVLRSYEDADLVWYPVTTAVGKPSFDGSECI 232

Query: 500  KEIQLKPAGEKPLSKFFSKKTEVKCQSEPGFVNS----------------------SKQE 387
            KEIQL  A   P+SKFF+KKT+ K Q E     S                      S Q 
Sbjct: 233  KEIQLSSADRNPISKFFAKKTDDKDQMEVKHGKSLKESPKKEIFDIAAELSISSEESPQG 292

Query: 386  HALDNVKE------ESGXXXXXXXXXXXXXXXXXEMHGIKRELEEMEPSSGLINENMHSK 225
               D++KE       +                  E+ G KR    + P SGL +E   SK
Sbjct: 293  DHFDDLKEHLEFNTHANADESDHFSLLKNPSIEPEICGTKRGSGAIAPDSGLTSEK-GSK 351

Query: 224  PTNPAKKKGKGINNTGDKQSSLFSYF 147
            P    KKK + + NTGDKQ+SL SYF
Sbjct: 352  P----KKKARPVKNTGDKQASLLSYF 373


>ref|XP_010244070.1| PREDICTED: embryonic stem cell-specific
            5-hydroxymethylcytosine-binding protein isoform X2
            [Nelumbo nucifera]
          Length = 367

 Score =  394 bits (1012), Expect = e-107
 Identities = 211/376 (56%), Positives = 251/376 (66%), Gaps = 16/376 (4%)
 Frame = -3

Query: 1220 MCGRARCTLNAHQVAQACGLNPQHASSVPTQQIDRYRPSYNVSPGAYLPVFRVRRXXXXX 1041
            MCGR RCTL A  + +AC ++    +SV T  ++RYRPS+NVSPG+ LPV R        
Sbjct: 1    MCGRTRCTLRAEDIPRACCVD---GASVRTVDVNRYRPSFNVSPGSDLPVVR-------R 50

Query: 1040 XXXXXXXXXGPIVHCMKWGLIPSFTKKTEKPDHYRMFNARSESIKEKASFRRLIPNNRCL 861
                     G ++HCMKWGL+PSFTKKTEKPDHYRMFNARSESI EKASFRRL+PNNRCL
Sbjct: 51   DEACGGESEGAVLHCMKWGLVPSFTKKTEKPDHYRMFNARSESICEKASFRRLVPNNRCL 110

Query: 860  VAVEGFYEWKKDGSKKQPYYIHFKDNQPLVFAALYDSWENSEGEILYTFTIVTTRSSSAL 681
            V VEGFYEWKKDG KKQPYYIHFK++QPLV AALYDSW NSEGE+LYTFTI+TT SSS L
Sbjct: 111  VVVEGFYEWKKDGPKKQPYYIHFKNDQPLVIAALYDSWRNSEGEMLYTFTILTTCSSSNL 170

Query: 680  EWLHDRMPVIFRDIGSINLWLNDFSTKHEAILGPCEDPDLVWYPVTPAMGKPSFDGPECI 501
            +WLHDRMPVI  +  SI+ WL   ++K E++L P EDPDLVWYPVTPAMGKPSFDGPECI
Sbjct: 171  QWLHDRMPVILGNKESIDAWLKGSTSKSESVLKPYEDPDLVWYPVTPAMGKPSFDGPECI 230

Query: 500  KEIQLKPAGEKPLSKFFSKK---------TEVKCQSEPGFVNSSKQEHALDNVKEE---- 360
            KEIQLK   +  +SKFFSKK         ++VK  SE        +E  +    EE    
Sbjct: 231  KEIQLKSEEKSSISKFFSKKKGDDEQKLDSQVKVSSEESAQTRPLEEFEVKPPFEEKTEL 290

Query: 359  ---SGXXXXXXXXXXXXXXXXXEMHGIKRELEEMEPSSGLINENMHSKPTNPAKKKGKGI 189
               S                  E  GIKR  +E   +S     N+    T+P +KK + +
Sbjct: 291  PSDSNMDTGPKSNVSSLPKGEAEKCGIKRYYQEFAGNSMPAIGNIDKLQTSPIRKKKEIL 350

Query: 188  NNTGDKQSSLFSYFGR 141
            NNTGDKQ++LFSYFG+
Sbjct: 351  NNTGDKQATLFSYFGK 366


>ref|XP_010244069.1| PREDICTED: embryonic stem cell-specific
            5-hydroxymethylcytosine-binding protein isoform X1
            [Nelumbo nucifera]
          Length = 369

 Score =  389 bits (999), Expect = e-105
 Identities = 211/378 (55%), Positives = 251/378 (66%), Gaps = 18/378 (4%)
 Frame = -3

Query: 1220 MCGRARCTLNAHQVAQACGLNPQHASSVPTQQIDRYRPSYNVSPGAYLPVFRVRRXXXXX 1041
            MCGR RCTL A  + +AC ++    +SV T  ++RYRPS+NVSPG+ LPV R        
Sbjct: 1    MCGRTRCTLRAEDIPRACCVD---GASVRTVDVNRYRPSFNVSPGSDLPVVR-------R 50

Query: 1040 XXXXXXXXXGPIVHCMKWGLIPSFTKKTEKPDHYRM--FNARSESIKEKASFRRLIPNNR 867
                     G ++HCMKWGL+PSFTKKTEKPDHYRM  FNARSESI EKASFRRL+PNNR
Sbjct: 51   DEACGGESEGAVLHCMKWGLVPSFTKKTEKPDHYRMLQFNARSESICEKASFRRLVPNNR 110

Query: 866  CLVAVEGFYEWKKDGSKKQPYYIHFKDNQPLVFAALYDSWENSEGEILYTFTIVTTRSSS 687
            CLV VEGFYEWKKDG KKQPYYIHFK++QPLV AALYDSW NSEGE+LYTFTI+TT SSS
Sbjct: 111  CLVVVEGFYEWKKDGPKKQPYYIHFKNDQPLVIAALYDSWRNSEGEMLYTFTILTTCSSS 170

Query: 686  ALEWLHDRMPVIFRDIGSINLWLNDFSTKHEAILGPCEDPDLVWYPVTPAMGKPSFDGPE 507
             L+WLHDRMPVI  +  SI+ WL   ++K E++L P EDPDLVWYPVTPAMGKPSFDGPE
Sbjct: 171  NLQWLHDRMPVILGNKESIDAWLKGSTSKSESVLKPYEDPDLVWYPVTPAMGKPSFDGPE 230

Query: 506  CIKEIQLKPAGEKPLSKFFSKK---------TEVKCQSEPGFVNSSKQEHALDNVKEE-- 360
            CIKEIQLK   +  +SKFFSKK         ++VK  SE        +E  +    EE  
Sbjct: 231  CIKEIQLKSEEKSSISKFFSKKKGDDEQKLDSQVKVSSEESAQTRPLEEFEVKPPFEEKT 290

Query: 359  -----SGXXXXXXXXXXXXXXXXXEMHGIKRELEEMEPSSGLINENMHSKPTNPAKKKGK 195
                 S                  E  GIKR  +E   +S     N+    T+P +KK +
Sbjct: 291  ELPSDSNMDTGPKSNVSSLPKGEAEKCGIKRYYQEFAGNSMPAIGNIDKLQTSPIRKKKE 350

Query: 194  GINNTGDKQSSLFSYFGR 141
             +NNTGDKQ++LFSYFG+
Sbjct: 351  ILNNTGDKQATLFSYFGK 368


>ref|XP_004982141.1| PREDICTED: embryonic stem cell-specific
            5-hydroxymethylcytosine-binding protein [Setaria italica]
          Length = 416

 Score =  380 bits (976), Expect = e-102
 Identities = 210/417 (50%), Positives = 257/417 (61%), Gaps = 56/417 (13%)
 Frame = -3

Query: 1220 MCGRARCTLNAHQVAQACGLNPQHASS------------VPTQQIDRYRPSYNVSPGAYL 1077
            MCGRARCTL+A Q A+A G     A++            V T  +DR+RPSYNVSPGAYL
Sbjct: 1    MCGRARCTLSAAQAARAFGFPTTTAAAAGSGGGAGDAPAVRTLDLDRFRPSYNVSPGAYL 60

Query: 1076 PVFRVR-RXXXXXXXXXXXXXXGPIVHCMKWGLIPSFTKKTEKPDHYRMFNARSESIKEK 900
            PV  VR +               P++ CMKWGL+PSFT KTEKPDH+RMFNARSES+KEK
Sbjct: 61   PVGTVRAQPAAGSDGGRGGDGAEPVIQCMKWGLVPSFTGKTEKPDHFRMFNARSESVKEK 120

Query: 899  ASFRRLIPNNRCLVAVEGFYEWKKDGSKKQPYYIHFKDNQPLVFAALYDSWENSEGEILY 720
            ASFRRLIP NRCLVAVEGFYEWKKDGSKKQPYYIHF+D++PLVFAALYD+W NSEGE+++
Sbjct: 121  ASFRRLIPKNRCLVAVEGFYEWKKDGSKKQPYYIHFQDHRPLVFAALYDTWTNSEGEVIH 180

Query: 719  TFTIVTTRSSSALEWLHDRMPVIFRDIGSINLWLNDFSTKHEAILGPCEDPDLVWYPVTP 540
            TFTI+TTR+S++L+WLHDRMPVI  D  S+N+WLND S K E I  P E  DLVWYPVT 
Sbjct: 181  TFTILTTRASTSLKWLHDRMPVILGDNDSVNVWLNDASVKLEEITSPYEGADLVWYPVTS 240

Query: 539  AMGKPSFDGPECIKEIQLKPAGEKPLSKFFSKK-------------------------TE 435
            AMGK SFDGPECIKE+ + P+ EKP+SKFF+KK                         ++
Sbjct: 241  AMGKTSFDGPECIKELHMGPS-EKPISKFFTKKSTAHDQSVKPEKTTLEFAETHSSRASK 299

Query: 434  VKC----QSEPGFVNSSKQEH--ALDNVKEESGXXXXXXXXXXXXXXXXXEM-------- 297
            V+C    Q++P  VN    E       VK+E                    M        
Sbjct: 300  VECDESVQNQPEDVNQQHGEERTTSSTVKDEPVSLGPQVIGKPQSIKDEDTMTSTGITIE 359

Query: 296  ----HGIKRELEEMEPSSGLINENMHSKPTNPAKKKGKGINNTGDKQSSLFSYFGRR 138
                 GIKR++E+ E  + ++  ++ S       KKGKG     D Q+SL SYF R+
Sbjct: 360  KQDDFGIKRKIEDTEVKAEMMENSVWSCSRPTTTKKGKGAKAAPDGQASLLSYFARK 416


>ref|XP_003622783.1| hypothetical protein MTR_7g052250 [Medicago truncatula]
            gi|355497798|gb|AES79001.1| UPF0361 C3orf37-like protein
            [Medicago truncatula]
          Length = 354

 Score =  379 bits (974), Expect = e-102
 Identities = 207/370 (55%), Positives = 242/370 (65%), Gaps = 9/370 (2%)
 Frame = -3

Query: 1220 MCGRARCTLNAHQVAQACGLNPQHASSVPTQ--QIDRYRPSYNVSPGAYLPVFRVRRXXX 1047
            MCGR RC+L A  V +AC     H ++ P++   IDRYRPS NVSPG  +PV  VRR   
Sbjct: 1    MCGRTRCSLRADDVPRAC-----HRTTAPSRLLHIDRYRPSNNVSPGFNIPV--VRREDN 53

Query: 1046 XXXXXXXXXXXGPIVHCMKWGLIPSFTKKTEKPDHYRMFNARSESIKEKASFRRLIPNNR 867
                         +VHCMKWGLIPSFTKKT+KPDHY+MFNARSESI EKASFRRL+P NR
Sbjct: 54   ASAESDGH-----VVHCMKWGLIPSFTKKTDKPDHYKMFNARSESIDEKASFRRLLPKNR 108

Query: 866  CLVAVEGFYEWKKDGSKKQPYYIHFKDNQPLVFAALYDSWENSEGEILYTFTIVTTRSSS 687
            CLVAVEGFYEWKKDGSKKQPYYIHFKD +PLVFAALYDSW+NSEGEILYTFTIVTT SSS
Sbjct: 109  CLVAVEGFYEWKKDGSKKQPYYIHFKDGRPLVFAALYDSWQNSEGEILYTFTIVTTSSSS 168

Query: 686  ALEWLHDRMPVIFRDIGSINLWLNDFSTKHEAILGPCEDPDLVWYPVTPAMGKPSFDGPE 507
            A +WLHDRMPVI  D  + + WL+  S+  ++++ P E+ DLVWYPVTPAMGKPSFDGPE
Sbjct: 169  AFKWLHDRMPVILGDKDTTDTWLSSASS-FKSVMKPYEESDLVWYPVTPAMGKPSFDGPE 227

Query: 506  CIKEIQLKPAGEKPLSKFFSKKTEVKCQSEPGF-------VNSSKQEHALDNVKEESGXX 348
            CIKEIQ+K  G  P+SKFFSKK      ++P         V + + +   +  K E G  
Sbjct: 228  CIKEIQIKTEGYIPISKFFSKKEAEVEDTKPEHKILSHEPVKTEQTKDVSEEAKTEEG-- 285

Query: 347  XXXXXXXXXXXXXXXEMHGIKRELEEMEPSSGLINENMHSKPTNPAKKKGKGINNTGDKQ 168
                               IKRE + +   S     N      NPAKKK K      DKQ
Sbjct: 286  DTDLKSSGISPSQNVNRFAIKREYDAISSDSKPSLANNDQVSANPAKKKEKA-KTADDKQ 344

Query: 167  SSLFSYFGRR 138
             +LFSYFG+R
Sbjct: 345  PTLFSYFGKR 354


>ref|XP_004492204.1| PREDICTED: embryonic stem cell-specific
            5-hydroxymethylcytosine-binding protein [Cicer arietinum]
          Length = 375

 Score =  379 bits (973), Expect = e-102
 Identities = 212/383 (55%), Positives = 250/383 (65%), Gaps = 16/383 (4%)
 Frame = -3

Query: 1238 RRRKTKMCGRARCTLNAHQVAQACGLNPQHASSVPTQ--QIDRYRPSYNVSPGAYLPVFR 1065
            R R+ +MCGR RCTL    +  AC     H ++ PT+   +DRYRPS+NVSPG ++PV  
Sbjct: 15   RNREDEMCGRGRCTLRPDDIPTAC-----HRTTAPTRLLHVDRYRPSHNVSPGFHMPV-- 67

Query: 1064 VRRXXXXXXXXXXXXXXGPIVHCMKWGLIPSFTKKTEKPDHYRMFNARSESIKEKASFRR 885
            VRR                ++HCMKWGLIPSFTKKTEKPDHYRMFNARSESI EKASFRR
Sbjct: 68   VRREDASESEGH-------VLHCMKWGLIPSFTKKTEKPDHYRMFNARSESIDEKASFRR 120

Query: 884  LIPNNRCLVAVEGFYEWKKDGSKKQPYYIHFKDNQPLVFAALYDSWENSEGEILYTFTIV 705
            L+P NRCLVAVEGFYEWKKDGSKKQPYYIHFKD +PLVFAALYDSW+NSEGE LYTFTIV
Sbjct: 121  LLPKNRCLVAVEGFYEWKKDGSKKQPYYIHFKDGRPLVFAALYDSWQNSEGETLYTFTIV 180

Query: 704  TTRSSSALEWLHDRMPVIFRDIGSINLWLNDFSTKHEAILGPCEDPDLVWYPVTPAMGKP 525
            TT SSS L+WLHDRMPVI  D  S + WLN  S+  +++L P E+ DL WYPVTPAMGKP
Sbjct: 181  TTSSSSTLQWLHDRMPVILSDKDSTDTWLNSASS-FKSVLKPYEECDLAWYPVTPAMGKP 239

Query: 524  SFDGPECIKEIQLKPAGEKPLSKFFSKKTEVKCQSEPGFVNSS------KQEHAL----D 375
            SFDGPECIKEIQ+K  G  P+SKFFS+K      ++ G    S      K E       +
Sbjct: 240  SFDGPECIKEIQVKAEGNIPISKFFSRKGGEGEDTKSGHKILSLCHEPVKTEQTTKDLSE 299

Query: 374  NVKEESGXXXXXXXXXXXXXXXXXEMHGIKRELE----EMEPSSGLINENMHSKPTNPAK 207
              K E G                     +KRE +    + +PS G IN+ + + P  P K
Sbjct: 300  GAKTEEGESDLKSSGSSPQNVTKFT---VKREYDAISSDSKPSLG-INDQVIANP--PTK 353

Query: 206  KKGKGINNTGDKQSSLFSYFGRR 138
            KK K   N  DKQ +LFS+FG+R
Sbjct: 354  KKEKA-KNADDKQPTLFSFFGKR 375


>ref|XP_003635244.1| PREDICTED: embryonic stem cell-specific
            5-hydroxymethylcytosine-binding protein [Vitis vinifera]
            gi|296090568|emb|CBI40918.3| unnamed protein product
            [Vitis vinifera]
          Length = 392

 Score =  377 bits (967), Expect = e-101
 Identities = 218/407 (53%), Positives = 251/407 (61%), Gaps = 47/407 (11%)
 Frame = -3

Query: 1220 MCGRARCTLNAHQVAQACGLNPQHASSVPTQ--QIDRYRPSYNVSPGAYLPVFRVRRXXX 1047
            MCGRARCTL    +A+AC LN     ++PTQ  Q+DRYRPSYNVSPGA LPV R      
Sbjct: 1    MCGRARCTLRPDNIARACNLN-----TLPTQNIQMDRYRPSYNVSPGANLPVVR------ 49

Query: 1046 XXXXXXXXXXXGPIVHCMKWGLIPSFTKKTEKPDHYRMFNARSESIKEKASFRRLIPNNR 867
                         IVHCMKWGL+PSFTKK+EKPDHY+MFNARSES+ EKASFRRL+P NR
Sbjct: 50   ---RGGGTEGEEAIVHCMKWGLVPSFTKKSEKPDHYKMFNARSESVCEKASFRRLVPKNR 106

Query: 866  CLVAVEGFYEWKKDGSKKQPYYIHFKDNQPLVFAALYDSWENSEGEILYTFTIVTTRSSS 687
            CLVAVEGFYEWKKDGSKKQPYYIH KD +PLVFAAL+DSW NSEGEILYT TI+TT SSS
Sbjct: 107  CLVAVEGFYEWKKDGSKKQPYYIHLKDGRPLVFAALFDSWANSEGEILYTCTILTTSSSS 166

Query: 686  ALEWLHDRMPVIFRDIGSINLWLN-DFSTKHEAILGPCEDPDLVWYPVTPAMGKPSFDGP 510
            AL+WLHDRMPVI  D  S + WLN   S++   +L P EDPDLVWYPVT AMGKPSF+GP
Sbjct: 167  ALQWLHDRMPVILGDKESTDAWLNGSSSSQFNTVLKPYEDPDLVWYPVTQAMGKPSFEGP 226

Query: 509  ECIKEIQLKPAGEKPLSKFFSK---KTEVKCQSEP---GFVNSSKQEHALDN-------- 372
            ECIKEIQLK   ++P+SKFFS    K E    +EP       S K+E A++N        
Sbjct: 227  ECIKEIQLKNE-QRPISKFFSTKGIKNEQGLSNEPVKSNLPQSLKEEPAIENSTGLPSST 285

Query: 371  VKEESGXXXXXXXXXXXXXXXXXEMHGIKRELEEMEPSSGLINENMH------------- 231
            VK +                       +K+E  E E  +GL     H             
Sbjct: 286  VKGDHDSTCSRSIPQEESTWFTNLPKSLKQE-PETEDKTGLPFPGDHDSKCDEEATKLPI 344

Query: 230  ----------SKPT-------NPAKKKGKGINNTGDKQSSLFSYFGR 141
                      SKP        +P  KKGK   N GDKQ +LFSYFG+
Sbjct: 345  KRDFEEFSADSKPNTDTVEKPSPVTKKGKLNKNAGDKQPTLFSYFGK 391


>ref|XP_012084983.1| PREDICTED: embryonic stem cell-specific
            5-hydroxymethylcytosine-binding protein [Jatropha curcas]
            gi|643739501|gb|KDP45255.1| hypothetical protein
            JCGZ_15120 [Jatropha curcas]
          Length = 362

 Score =  373 bits (958), Expect = e-100
 Identities = 209/376 (55%), Positives = 247/376 (65%), Gaps = 15/376 (3%)
 Frame = -3

Query: 1220 MCGRARCTLNAHQVAQACGLNPQHASSVPTQQIDRYRPSYNVSPGAYLPVFRVRRXXXXX 1041
            MCGRARCTL A  +++AC  N     SV    +DRYRP YNVSPG+ LPV  V R     
Sbjct: 1    MCGRARCTLRADDISRACHCNGAPVRSV---NMDRYRPYYNVSPGSNLPV--VYRGDVSG 55

Query: 1040 XXXXXXXXXGPIVHCMKWGLIPSFTKKTEKPDHYRMFNARSESIKEKASFRRLIPNNRCL 861
                        +HCM WGL+PSFTKKTEKPD YRMFNARSES++EKASFRRL+P NRCL
Sbjct: 56   GEGYS-------LHCMTWGLVPSFTKKTEKPDFYRMFNARSESVREKASFRRLLPKNRCL 108

Query: 860  VAVEGFYEWKKDGSKKQPYYIHFKDNQPLVFAALYDSWENSEGEILYTFTIVTTRSSSAL 681
            VAVEGFYEWKKDGSKKQPYYIHFKD++PLVFAALYDSW+NSEGEIL TFTI+TT SSSAL
Sbjct: 109  VAVEGFYEWKKDGSKKQPYYIHFKDDRPLVFAALYDSWQNSEGEILDTFTILTTSSSSAL 168

Query: 680  EWLHDRMPVIFRDIGSINLWLN-DFSTKHEAILGPCEDPDLVWYPVTPAMGKPSFDGPEC 504
            +WLHDRMPVI  D G+I+ WLN   S+K + +L P E+PDLVWYPVTPAMGK SFDGPEC
Sbjct: 169  QWLHDRMPVILGDKGAIDTWLNGSSSSKFDIMLKPYENPDLVWYPVTPAMGKISFDGPEC 228

Query: 503  IKEIQLKPAGEKPLSKFFSKKTEVKCQSEPGFVNSSKQEHALDN----VKEE-------- 360
            IKEI LK   +  +SKFFS+K E+K + E     S+ ++ A  N    VKEE        
Sbjct: 229  IKEIHLKTEDKGTISKFFSRK-EIKSEQESNLQGSTCEKSADVNTPKRVKEEDVIADKLD 287

Query: 359  --SGXXXXXXXXXXXXXXXXXEMHGIKRELEEMEPSSGLINENMHSKPTNPAKKKGKGIN 186
              S                    +  KR+ EE    S L  +     P +PA+KK   + 
Sbjct: 288  IPSLVNNDSRSSVCTITKEDGTKYKTKRDYEETLNDSKLGLDKDEKPPQSPARKK-VNLK 346

Query: 185  NTGDKQSSLFSYFGRR 138
              GDKQ +LFSYF ++
Sbjct: 347  IDGDKQPTLFSYFSKK 362


>ref|XP_007140735.1| hypothetical protein PHAVU_008G137400g [Phaseolus vulgaris]
            gi|561013868|gb|ESW12729.1| hypothetical protein
            PHAVU_008G137400g [Phaseolus vulgaris]
          Length = 353

 Score =  369 bits (948), Expect = 2e-99
 Identities = 199/370 (53%), Positives = 235/370 (63%), Gaps = 9/370 (2%)
 Frame = -3

Query: 1220 MCGRARCTLNAHQVAQACGLNPQHASSVPTQQI--DRYRPSYNVSPGAYLPVFRVRRXXX 1047
            MCGR RCTL +  V +AC     H S  PT+ +  DRYRP+YNVSPG+ +PV R      
Sbjct: 1    MCGRTRCTLRSDDVPRAC-----HRSDAPTRTLHMDRYRPAYNVSPGSNMPVVRREEASD 55

Query: 1046 XXXXXXXXXXXGPIVHCMKWGLIPSFTKKTEKPDHYRMFNARSESIKEKASFRRLIPNNR 867
                         ++H MKWGLIPSFTKKTEKPDHY+MFNARSESI EKASFRRL+P +R
Sbjct: 56   SGGY---------VLHSMKWGLIPSFTKKTEKPDHYKMFNARSESIDEKASFRRLLPKSR 106

Query: 866  CLVAVEGFYEWKKDGSKKQPYYIHFKDNQPLVFAALYDSWENSEGEILYTFTIVTTRSSS 687
            CLVAVEGFYEWKKDGSKKQPYYIHFKD + LVFAALYDSW+NSEGE L+TFTIVTT SSS
Sbjct: 107  CLVAVEGFYEWKKDGSKKQPYYIHFKDGRRLVFAALYDSWQNSEGETLHTFTIVTTSSSS 166

Query: 686  ALEWLHDRMPVIFRDIGSINLWLNDFSTKHEAILGPCEDPDLVWYPVTPAMGKPSFDGPE 507
            AL+WLHDRMPVI     S + WL+  ++  ++++ P E+ DLVWYPVT AMGK SFDGPE
Sbjct: 167  ALQWLHDRMPVILGSKESTDTWLSSSASSFKSVMKPYEESDLVWYPVTSAMGKTSFDGPE 226

Query: 506  CIKEIQLKPAGEKPLSKFFSKKTEVKCQSEP-------GFVNSSKQEHALDNVKEESGXX 348
            CIKEIQ+K  G   +S FFSKK      ++P        FV +   E  ++  K E G  
Sbjct: 227  CIKEIQVKAEGNTSISMFFSKKGAESKDTKPEQKLSSHEFVKTEPTEDLIEGAKAEEGDN 286

Query: 347  XXXXXXXXXXXXXXXEMHGIKRELEEMEPSSGLINENMHSKPTNPAKKKGKGINNTGDKQ 168
                               IKRE E     S     N     +NPAKKK K      DKQ
Sbjct: 287  DLKFSGSSHSKNASTL--PIKREYETFSADSKPALANHDQISSNPAKKKEK-TKTANDKQ 343

Query: 167  SSLFSYFGRR 138
             +LFSYFG++
Sbjct: 344  PTLFSYFGKK 353


>ref|XP_003532247.1| PREDICTED: embryonic stem cell-specific
            5-hydroxymethylcytosine-binding protein-like [Glycine
            max] gi|734346058|gb|KHN10932.1| UPF0361 protein C3orf37
            like [Glycine soja]
          Length = 382

 Score =  366 bits (940), Expect = 2e-98
 Identities = 209/398 (52%), Positives = 242/398 (60%), Gaps = 38/398 (9%)
 Frame = -3

Query: 1220 MCGRARCTLNAHQVAQACGLNPQHASSVPTQ--QIDRYRPSYNVSPGAYLPVFRVRRXXX 1047
            MCGRARCTL A  V +AC     H S+ PT+   IDRYRP+YNVSPG  +PV  VRR   
Sbjct: 1    MCGRARCTLRADDVPRAC-----HRSTSPTRTLHIDRYRPAYNVSPGFDVPV--VRRDDA 53

Query: 1046 XXXXXXXXXXXGPIVHCMKWGLIPSFTKKTEKPDHYRMFNARSESIKEKASFRRLIPNNR 867
                         ++ CMKWGLIPSFTKKTEKPDHYRMFNARSESI EKASFRRL+P +R
Sbjct: 54   SGGEGY-------VLQCMKWGLIPSFTKKTEKPDHYRMFNARSESIDEKASFRRLLPKSR 106

Query: 866  CLVAVEGFYEWKKDGSKKQPYYIHFKDNQPLVFAALYDSWENSEGEILYTFTIVTTRSSS 687
            CLVAVEGFYEWKKDGSKKQPYYIHFKD +PLVFAALYDSW+NSEGE LYTFTIVTT SSS
Sbjct: 107  CLVAVEGFYEWKKDGSKKQPYYIHFKDGRPLVFAALYDSWQNSEGETLYTFTIVTTSSSS 166

Query: 686  ALEWLHDRMPVIFRDIGSINLWLNDFSTKHEAILGPCEDPDLVWYPVTPAMGKPSFDGPE 507
            AL+WLHDRMPVI     S ++WL+  ++  ++++ P E+ DLVWYPVT AMGK SFDGPE
Sbjct: 167  ALQWLHDRMPVILGSKESTDIWLSSSASSFKSVMKPYEESDLVWYPVTSAMGKASFDGPE 226

Query: 506  CIKEIQLKPAGEKPLSKFFSKKTEVKCQSEP----------------------------- 414
            CIKEIQ+K  G   +S FFSKK +    ++P                             
Sbjct: 227  CIKEIQVKAQGNTSISMFFSKKGDESKDTKPEQKASCPEVVKTEHTEDLTESKDTKPEQK 286

Query: 413  ----GFVNSSKQEHALDNVKEESGXXXXXXXXXXXXXXXXXEMHGIKRELEEM---EPSS 255
                 FV +   E   +  K E G                  M  IKRE E     +   
Sbjct: 287  TSSHEFVKTEPTEDLRERAKTEEG--GNDLKFHGSSHSQNVSMLPIKREYETFSAADSKP 344

Query: 254  GLINENMHSKPTNPAKKKGKGINNTGDKQSSLFSYFGR 141
             L N +  S   NPAKKK K      DKQ +LFSYFG+
Sbjct: 345  ALANHDQIS--PNPAKKKEKA-KTANDKQPTLFSYFGK 379


>ref|XP_008244708.1| PREDICTED: embryonic stem cell-specific
            5-hydroxymethylcytosine-binding protein [Prunus mume]
          Length = 365

 Score =  364 bits (935), Expect = 7e-98
 Identities = 204/379 (53%), Positives = 237/379 (62%), Gaps = 19/379 (5%)
 Frame = -3

Query: 1220 MCGRARCTLNAHQVAQACGLNPQHASSVPTQQIDRYRPSYNVSPGAYLPVFRVRRXXXXX 1041
            MCGRARCT+ A  + +AC    +    V T  +DR+RP +N SPG+ LPV  VRR     
Sbjct: 1    MCGRARCTVRADDIPRACH---RIHGPVRTVNMDRFRPLFNASPGSNLPV--VRREDGAD 55

Query: 1040 XXXXXXXXXGPIVHCMKWGLIPSFTKKTEKPDHYRMFNARSESIKEKASFRRLIPNNRCL 861
                       +VHCMKWGLIPSFTKKTEKPDHY+MFNARSESI EKASFRRLIP NRCL
Sbjct: 56   GDGV-------VVHCMKWGLIPSFTKKTEKPDHYKMFNARSESICEKASFRRLIPKNRCL 108

Query: 860  VAVEGFYEWKKDGSKKQPYYIHFKDNQPLVFAALYDSWENSEGEILYTFTIVTTRSSSAL 681
            +AVEGFYEWKKDGSKKQPYY+HF D +PL+FAALYDSWENSEGE LYTFTI+TT SSSAL
Sbjct: 109  IAVEGFYEWKKDGSKKQPYYVHFNDGRPLLFAALYDSWENSEGEKLYTFTIITTSSSSAL 168

Query: 680  EWLHDRMPVIFRDIGSINLWLNDFSTKH-EAILGPCEDPDLVWYPVTPAMGKPSFDGPEC 504
             WLHDRMPVI  D GS + WL+  ST + +++L P E PDLVWYPVTPAMGK SFDGPEC
Sbjct: 169  GWLHDRMPVILGDKGSTDSWLSGSSTSNFDSLLKPYEGPDLVWYPVTPAMGKVSFDGPEC 228

Query: 503  IKEIQLKPAGEKPLSKFFSKKTEVKCQSEP---GFVNSSKQEHALDNVKEESGXXXXXXX 333
            I EIQLK  G   ++KFF  K   K +  P    F NSS +     +VKEE         
Sbjct: 229  INEIQLKTEGNNSITKFFMSKGTKKEELNPKDTSFYNSSVKNDLPKSVKEEPESKEKTEP 288

Query: 332  XXXXXXXXXXEMHGI-------------KRELEEMEPSSGLINENMHSKPTNPAKKKGKG 192
                          +             KR+ EE    S  +         +PAKKK   
Sbjct: 289  PASTEKCENDSKCSVSTLSQEGASKGQTKRDYEEFSADSKPVAYETSEISASPAKKK--- 345

Query: 191  INNTG--DKQSSLFSYFGR 141
            +N     DKQ +LFSYFG+
Sbjct: 346  VNPKSFLDKQPTLFSYFGK 364


>ref|XP_007199067.1| hypothetical protein PRUPE_ppa018685mg [Prunus persica]
            gi|462394467|gb|EMJ00266.1| hypothetical protein
            PRUPE_ppa018685mg [Prunus persica]
          Length = 363

 Score =  363 bits (931), Expect = 2e-97
 Identities = 204/377 (54%), Positives = 238/377 (63%), Gaps = 17/377 (4%)
 Frame = -3

Query: 1220 MCGRARCTLNAHQVAQACGLNPQHASSVP--TQQIDRYRPSYNVSPGAYLPVFRVRRXXX 1047
            MCGRARCTL A  + +AC     H S  P  T  +DR+RP +N SPG+ LPV  VRR   
Sbjct: 1    MCGRARCTLRADDIPRAC-----HRSHGPVRTVNMDRFRPLFNASPGSNLPV--VRREDG 53

Query: 1046 XXXXXXXXXXXGPIVHCMKWGLIPSFTKKTEKPDHYRMFNARSESIKEKASFRRLIPNNR 867
                         +VHCMKWGLIPSFTKKTEKPDHY+MFNARSESI EKASFRRLIP NR
Sbjct: 54   GDGDGV-------VVHCMKWGLIPSFTKKTEKPDHYKMFNARSESICEKASFRRLIPKNR 106

Query: 866  CLVAVEGFYEWKKDGSKKQPYYIHFKDNQPLVFAALYDSWENSEGEILYTFTIVTTRSSS 687
            CL+AVEGFYEWKKDGSKKQPYY+HF D +PL+FAALYD WENSEGE LYTFTI+TT SSS
Sbjct: 107  CLIAVEGFYEWKKDGSKKQPYYVHFNDGRPLLFAALYDFWENSEGEKLYTFTIITTSSSS 166

Query: 686  ALEWLHDRMPVIFRDIGSINLWLNDFSTKH-EAILGPCEDPDLVWYPVTPAMGKPSFDGP 510
            AL WLHDRMPVI  D GS + WL+  ST + +++L P E PDLVWYPVT AMGK SFDGP
Sbjct: 167  ALGWLHDRMPVILGDKGSTDSWLSGSSTSNFDSLLKPYEGPDLVWYPVTQAMGKVSFDGP 226

Query: 509  ECIKEIQLKPAGEKPLSKFFSKKTEVKCQSEP---GFVNSSKQEHALDNVKEE------S 357
            ECI EIQLK  G   ++KFF  K   K +  P    F +SS +     +VKEE      +
Sbjct: 227  ECINEIQLKTEGNNSITKFFMSKGTKKEELNPKDTSFYDSSVKNDLPKSVKEEPEGKEKT 286

Query: 356  GXXXXXXXXXXXXXXXXXEMHGI-----KRELEEMEPSSGLINENMHSKPTNPAKKKGKG 192
                                 G+     KR+ EE    S  +         +PAKKK   
Sbjct: 287  EQPASTEKCENDSKGQTISQEGVSKGQTKRDYEEFSADSKPVAYETSEMSASPAKKKVNP 346

Query: 191  INNTGDKQSSLFSYFGR 141
             ++  DKQ +LFSYFG+
Sbjct: 347  KSSV-DKQPTLFSYFGK 362


>ref|XP_004290141.1| PREDICTED: embryonic stem cell-specific
            5-hydroxymethylcytosine-binding protein isoform X2
            [Fragaria vesca subsp. vesca]
          Length = 366

 Score =  359 bits (921), Expect = 3e-96
 Identities = 201/375 (53%), Positives = 239/375 (63%), Gaps = 17/375 (4%)
 Frame = -3

Query: 1220 MCGRARCTLNAHQVAQACGLNPQHASSVPTQQIDRYRPSYNVSPGAYLPVFRVRRXXXXX 1041
            MCGRARCTL A  +++AC  N     SV    +DRY+P YNVSPGA LPV  VRR     
Sbjct: 1    MCGRARCTLRADDISRACYRNHGPVRSV---NMDRYQPRYNVSPGANLPV--VRRGDGAD 55

Query: 1040 XXXXXXXXXGPIVHCMKWGLIPSFTKKTEKPDHYRMFNARSESIKEKASFRRLIPNNRCL 861
                       ++HCMKWGLIPSFTKKTEKPDHYRMFNARSESI EKASFRRL+P +RC+
Sbjct: 56   GEDGV------VLHCMKWGLIPSFTKKTEKPDHYRMFNARSESICEKASFRRLVPKSRCV 109

Query: 860  VAVEGFYEWKKDGSKKQPYYIHFKDNQPLVFAALYDSWENSEGEILYTFTIVTTRSSSAL 681
            VAVEGFYEWKKDGSKKQPYY+HFKD +PL+FAALYDSWENSEGE LYTFTI+TT SSSAL
Sbjct: 110  VAVEGFYEWKKDGSKKQPYYVHFKDGRPLLFAALYDSWENSEGEKLYTFTIITTSSSSAL 169

Query: 680  EWLHDRMPVIFRDIGSINLWLNDFSTKH-EAILGPCEDPDLVWYPVTPAMGKPSFDGPEC 504
             WLHDRMPV+  D  S++ WL+  S  + + +L P E PDLVWYPVTPAMGK SFDGPEC
Sbjct: 170  GWLHDRMPVVLGDKESVDTWLDGSSASNFDKLLKPYEGPDLVWYPVTPAMGKVSFDGPEC 229

Query: 503  IKEIQLKPAGEKPLSKFFSKKTEVKCQSEP---GFVNSSKQEHALDNVKEE--------- 360
              EI+LK  G   ++KFFS K   K +  P      +SS +    +++ EE         
Sbjct: 230  SNEIKLKTDGTNSITKFFSTKGTKKEEINPKDTSLHDSSVKTEFPESLNEEPETKEEKVQ 289

Query: 359  ---SGXXXXXXXXXXXXXXXXXEMHGIKRELEE-MEPSSGLINENMHSKPTNPAKKKGKG 192
               +                       KR+ EE +  S  L NE+      +PAKKK   
Sbjct: 290  PSSTVKCEDSKSSVSILSQEDASKEQTKRDYEEFLADSKPLPNESDKKSSASPAKKK-VN 348

Query: 191  INNTGDKQSSLFSYF 147
            +  + DKQ +LFSYF
Sbjct: 349  LKTSHDKQPTLFSYF 363


>ref|XP_009358987.1| PREDICTED: embryonic stem cell-specific
            5-hydroxymethylcytosine-binding protein-like [Pyrus x
            bretschneideri]
          Length = 365

 Score =  358 bits (920), Expect = 4e-96
 Identities = 203/379 (53%), Positives = 240/379 (63%), Gaps = 19/379 (5%)
 Frame = -3

Query: 1220 MCGRARCTLNAHQVAQACGLNPQHASSVPTQQI--DRYRPSYNVSPGAYLPVFRVRRXXX 1047
            MCGRARCTL A  V +AC     H +  P + +  DRYRPSYNVSPG+ LPV  VRR   
Sbjct: 1    MCGRARCTLRADDVTRAC-----HRTHAPVRAVNMDRYRPSYNVSPGSNLPV--VRREDG 53

Query: 1046 XXXXXXXXXXXGPIVHCMKWGLIPSFTKKTEKPDHYRMFNARSESIKEKASFRRLIPNNR 867
                         ++ CMKWGLIPSFTKKTEKPD YRMFNARSESI EKASFRRL+P +R
Sbjct: 54   ADGDGV-------VLQCMKWGLIPSFTKKTEKPDFYRMFNARSESICEKASFRRLVPKSR 106

Query: 866  CLVAVEGFYEWKKDGSKKQPYYIHFKDNQPLVFAALYDSWENSEGEILYTFTIVTTRSSS 687
            C+VAVEGFYEWKKDGSKKQPYY+HFKD++PL+FAALYDSWENSEGE LYTFTI+TT SSS
Sbjct: 107  CIVAVEGFYEWKKDGSKKQPYYVHFKDSRPLLFAALYDSWENSEGEKLYTFTIITTSSSS 166

Query: 686  ALEWLHDRMPVIFRDIGSINLWLNDFSTKH-EAILGPCEDPDLVWYPVTPAMGKPSFDGP 510
            AL WLHDRMPVI  D  S + WL+  S+ + +++L P E PDLVWYPVTPAMGK SFDGP
Sbjct: 167  ALGWLHDRMPVILGDKESTDTWLDGSSSSNFDSLLKPYEGPDLVWYPVTPAMGKVSFDGP 226

Query: 509  ECIKEIQLKPAGEKPLSKFFSKKTEVKCQSEPGFVNS---SKQEHALDNVKEESGXXXXX 339
            ECI EIQLK  G   ++KFFS K   K +  P   +S   S ++     VKEE       
Sbjct: 227  ECINEIQLKTEGNNSITKFFSAKGTKKEELNPKDTSSDDTSAKKDLSKMVKEEPESKEDT 286

Query: 338  XXXXXXXXXXXXEMHGI-------------KRELEEMEPSSGLINENMHSKPTNPAKKKG 198
                            +             KR+ EE    S  +    + K  + AKKK 
Sbjct: 287  EQPYSTEQCEDESKCNVSTFSQEGVSKGQAKRDYEEFSADSKPVAYVTNKKSASLAKKKV 346

Query: 197  KGINNTGDKQSSLFSYFGR 141
               ++  DKQ +LFSYFG+
Sbjct: 347  NPKSSL-DKQPTLFSYFGK 364


>ref|XP_002303080.1| hypothetical protein POPTR_0002s25190g [Populus trichocarpa]
            gi|222844806|gb|EEE82353.1| hypothetical protein
            POPTR_0002s25190g [Populus trichocarpa]
          Length = 367

 Score =  355 bits (912), Expect = 3e-95
 Identities = 201/376 (53%), Positives = 240/376 (63%), Gaps = 15/376 (3%)
 Frame = -3

Query: 1220 MCGRARCTLNAHQVAQACGLNPQHASSVPTQQIDRYRPSYNVSPGAYLPVFRVRRXXXXX 1041
            MCGRARCTL A  + +AC  N     SV    +DRYRPSYN SPG+ L V  VRR     
Sbjct: 1    MCGRARCTLRADDIPRACHRNTATVRSV---NMDRYRPSYNASPGSNLAV--VRRDDAAS 55

Query: 1040 XXXXXXXXXGPIVHCMKWGLIPSFTKKTEKPDHYRMFNARSESIKEKASFRRLIPNNRCL 861
                       I HCMKWGLIP FTKK+EKPD Y+MFNARSES+ EKASFRRLIP +RCL
Sbjct: 56   GDGASGGDGYAI-HCMKWGLIPGFTKKSEKPDFYKMFNARSESLSEKASFRRLIPKSRCL 114

Query: 860  VAVEGFYEWKKDGSKKQPYYIHFKDNQPLVFAALYDSWENSEGEILYTFTIVTTRSSSAL 681
            VAVEGFYEWKKDGSKKQPYYIHFKD +PLVFAALYDSW+NSEGEILYTFTIVTT +SSA+
Sbjct: 115  VAVEGFYEWKKDGSKKQPYYIHFKDGRPLVFAALYDSWQNSEGEILYTFTIVTTAASSAI 174

Query: 680  EWLHDRMPVIFRDIGSINLWLN-DFSTKHEAILGPCEDPDLVWYPVTPAMGKPSFDGPEC 504
            +WLH+RMPVI  D  + + WL+   ++K + +L P E  DLVWYPVTPAMGKPSFDGPEC
Sbjct: 175  QWLHERMPVILGDKEATDTWLSVSSNSKFDTVLKPYEHSDLVWYPVTPAMGKPSFDGPEC 234

Query: 503  IKEIQLKPAGEKPLSKFFSKKTEVKCQSEPGFVNSSKQ-EHALDNVKEESGXXXXXXXXX 327
            IKEI LK   +  +SKFFS+K E K +S P      K  +    +VKEE+          
Sbjct: 235  IKEIHLKMEEKGTISKFFSRK-EFKEESNPEESTHGKSLKLEPKSVKEENESEEKLETPC 293

Query: 326  XXXXXXXXEMHGI-------------KRELEEMEPSSGLINENMHSKPTNPAKKKGKGIN 186
                        +             KR+ EE+  S  L  + +     +PAKKK   + 
Sbjct: 294  SAKTVDYDLKSELETFSHEGETKCKTKRDREELVDSK-LKTDEIVKPRASPAKKKA-NLK 351

Query: 185  NTGDKQSSLFSYFGRR 138
            +  DKQ +L SYFG++
Sbjct: 352  SVDDKQPTLLSYFGKK 367


>ref|XP_007049611.1| Uncharacterized protein TCM_002685 [Theobroma cacao]
            gi|508701872|gb|EOX93768.1| Uncharacterized protein
            TCM_002685 [Theobroma cacao]
          Length = 360

 Score =  355 bits (912), Expect = 3e-95
 Identities = 197/379 (51%), Positives = 235/379 (62%), Gaps = 18/379 (4%)
 Frame = -3

Query: 1220 MCGRARCTLNAHQVAQACGLNPQHASSVPTQQI--DRYRPSYNVSPGAYLPVFRVRRXXX 1047
            MCGRARCTL A  + +A      H +  P + +  DRYRPSYNV PG  LPV  VRR   
Sbjct: 1    MCGRARCTLRADDIPRA-----SHRNDGPVRHVHMDRYRPSYNVGPGMNLPV--VRRDDG 53

Query: 1046 XXXXXXXXXXXGPIVHCMKWGLIPSFTKKTEKPDHYRMFNARSESIKEKASFRRLIPNNR 867
                         ++HCMKWGLIPSFTKKT+KPD Y+MFNARSES+ EKASFRRL+P +R
Sbjct: 54   SNGDGGV------VLHCMKWGLIPSFTKKTDKPDFYKMFNARSESVCEKASFRRLLPKSR 107

Query: 866  CLVAVEGFYEWKKDGSKKQPYYIHFKDNQPLVFAALYDSWENSEGEILYTFTIVTTRSSS 687
            CLVAVEGFYEWKKDGSKKQPYYIHFKD +PLVFAALYD WENSEGE LYTFTI+TT SSS
Sbjct: 108  CLVAVEGFYEWKKDGSKKQPYYIHFKDGRPLVFAALYDCWENSEGEKLYTFTILTTASSS 167

Query: 686  ALEWLHDRMPVIFRDIGSINLWLNDFSTKHEAILGPCEDPDLVWYPVTPAMGKPSFDGPE 507
            A  WLHDRMPVI  D  S + WLN   TK + +L P E+PDLVWYPVT A+GK SF+GPE
Sbjct: 168  AFLWLHDRMPVILGDKESTDTWLN--GTKIDTLLKPYENPDLVWYPVTSAIGKLSFEGPE 225

Query: 506  CIKEIQLKPAGEKPLSKFFSK---KTEVKCQSEPGFVNSSKQEHALDNVKEESGXXXXXX 336
            C+KE+ LK   + P+SKFFS    K E +   E    + S Q + L N+KEE        
Sbjct: 226  CVKEVPLKTQEKNPISKFFSTREVKREQESNMEKSLCDESVQTNLLKNLKEEPNSPEDKE 285

Query: 335  XXXXXXXXXXXEMHGI-------------KRELEEMEPSSGLINENMHSKPTNPAKKKGK 195
                           +             KR+ EE    +    + +     +PA+KKG 
Sbjct: 286  IPSLASKEDNDSKSSVLVPTCEDVRKCQTKRDYEEFSADTKPAKDEIE---VSPARKKG- 341

Query: 194  GINNTGDKQSSLFSYFGRR 138
             I     KQ +LF+YFG+R
Sbjct: 342  NIKGVAGKQPTLFAYFGKR 360


>gb|KHG01683.1| hypothetical protein F383_01440 [Gossypium arboreum]
          Length = 359

 Score =  354 bits (909), Expect = 7e-95
 Identities = 197/381 (51%), Positives = 246/381 (64%), Gaps = 20/381 (5%)
 Frame = -3

Query: 1220 MCGRARCTLNAHQVAQACGLNPQHASSVPTQQI--DRYRPSYNVSPGAYLPVFRVRRXXX 1047
            MCGRARCTL A  + +AC     H +  P + +  DRYRPSYNV PG  +PV  VRR   
Sbjct: 1    MCGRARCTLRADDIPRAC-----HRNDGPIRHVNMDRYRPSYNVGPGMNIPV--VRRDNG 53

Query: 1046 XXXXXXXXXXXGPIVHCMKWGLIPSFTKKTEKPDHYRMFNARSESIKEKASFRRLIPNNR 867
                         ++HCMKWGLIPSFTKK++KPD ++MFNARSES+ EKASFRRL+P +R
Sbjct: 54   SNTDVAGV-----VLHCMKWGLIPSFTKKSDKPDFFKMFNARSESVCEKASFRRLLPKSR 108

Query: 866  CLVAVEGFYEWKKDGSKKQPYYIHFKDNQPLVFAALYDSWENSEGEILYTFTIVTTRSSS 687
            CLVAVEGFYEWKKD SKKQPYYIHFKD +PLVFAALYDSWENSEGE L+TFTI+TT +SS
Sbjct: 109  CLVAVEGFYEWKKDVSKKQPYYIHFKDGRPLVFAALYDSWENSEGEKLHTFTILTTSASS 168

Query: 686  ALEWLHDRMPVIFRDIGSINLWLNDFSTKHEAILGPCEDPDLVWYPVTPAMGKPSFDGPE 507
              +WLHDRMPVI  D GS + WLN   +K + +L P E+PDLVWYPVTPA+GK SF+GPE
Sbjct: 169  TFQWLHDRMPVILGDKGSTDAWLN--GSKTDMLLKPYENPDLVWYPVTPAIGKLSFEGPE 226

Query: 506  CIKEIQLKPAGEKPLSKFFS-KKTEVKCQS--EPGFVNSSKQEHALDNVKEES------- 357
            C+KE+ LK   +  +SKFFS +K E + +S  E    + S + + L+N+KE+        
Sbjct: 227  CVKEVPLKTQEKNSISKFFSTRKVEKEQESNMEQSVCDESVKTNLLNNLKEDPRSTDDRL 286

Query: 356  ----GXXXXXXXXXXXXXXXXXEMHGIKRELEEM----EPSSGLINENMHSKPTNPAKKK 201
                                      +KR+ EE+    +PS   I        T+PA+KK
Sbjct: 287  ASLIDKDHDSKSNVPVPSLGDVGKSQVKRDYEELLADTKPSKDKIE-------TSPARKK 339

Query: 200  GKGINNTGDKQSSLFSYFGRR 138
            G  I   GDKQ +LFSYFG++
Sbjct: 340  G-NIKGGGDKQPTLFSYFGKK 359


>ref|XP_011024560.1| PREDICTED: embryonic stem cell-specific
            5-hydroxymethylcytosine-binding protein-like [Populus
            euphratica]
          Length = 362

 Score =  353 bits (905), Expect = 2e-94
 Identities = 199/376 (52%), Positives = 237/376 (63%), Gaps = 15/376 (3%)
 Frame = -3

Query: 1220 MCGRARCTLNAHQVAQACGLNPQHASSVPTQQIDRYRPSYNVSPGAYLPVFRVRRXXXXX 1041
            MCGR RCTL A  + +AC  N     SV    +DRYRPSYN SPG+ L V  VRR     
Sbjct: 1    MCGRVRCTLGADDIPRACHRNTATVRSV---NMDRYRPSYNASPGSNLAV--VRRDDAAS 55

Query: 1040 XXXXXXXXXGPIVHCMKWGLIPSFTKKTEKPDHYRMFNARSESIKEKASFRRLIPNNRCL 861
                        +HCMKWGLIPSFTKK+EKPD Y+MFNARSES+ EKASFRRLIP +RCL
Sbjct: 56   GGDGYA------IHCMKWGLIPSFTKKSEKPDFYKMFNARSESLSEKASFRRLIPKSRCL 109

Query: 860  VAVEGFYEWKKDGSKKQPYYIHFKDNQPLVFAALYDSWENSEGEILYTFTIVTTRSSSAL 681
            VAVEGFYEWKKDGSKKQPYYIHFKD +PLVFAALYDSW+NSEGEILYTFTIVTT +SSA+
Sbjct: 110  VAVEGFYEWKKDGSKKQPYYIHFKDGRPLVFAALYDSWQNSEGEILYTFTIVTTAASSAI 169

Query: 680  EWLHDRMPVIFRDIGSINLWLN-DFSTKHEAILGPCEDPDLVWYPVTPAMGKPSFDGPEC 504
            +WLHDRMPVI  D  + ++WL+   ++K + +L P    DLVWYPVTPAMGKPSFDGPEC
Sbjct: 170  QWLHDRMPVILGDKEATDIWLSVSSNSKFDTVLKPYGHSDLVWYPVTPAMGKPSFDGPEC 229

Query: 503  IKEIQLKPAGEKPLSKFFSKKTEVKCQSEPGFVNSSKQ-EHALDNVKEESGXXXXXXXXX 327
            IKEI LK   +  +SKFFS+K E K +S P      K  +    +VKEE           
Sbjct: 230  IKEIHLKMEEKGTISKFFSRK-EFKEESNPEESTHGKSLKLEPKSVKEEYESEEKLKTPC 288

Query: 326  XXXXXXXXEMHGI-------------KRELEEMEPSSGLINENMHSKPTNPAKKKGKGIN 186
                        +             KR+ EE+  S    +E +  KP     KK   + 
Sbjct: 289  SAKTVDYDLKSELETFSHEGETKCRTKRDREEVVDSKPKTDEIV--KPRASPAKKTANLK 346

Query: 185  NTGDKQSSLFSYFGRR 138
            +  DKQ +L SYFG++
Sbjct: 347  SVDDKQPTLLSYFGKK 362


Top