BLASTX nr result

ID: Catharanthus23_contig00010957 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Catharanthus23_contig00010957
         (1686 letters)

Database: ./nr 
           37,332,560 sequences; 13,225,080,153 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_006346951.1| PREDICTED: probable DNA-3-methyladenine glyc...   394   e-107
ref|XP_004233519.1| PREDICTED: probable DNA-3-methyladenine glyc...   390   e-106
gb|EPS66255.1| hypothetical protein M569_08523, partial [Genlise...   359   2e-96
emb|CAN72984.1| hypothetical protein VITISV_009035 [Vitis vinifera]   341   6e-91
ref|XP_006473513.1| PREDICTED: DNA-3-methyladenine glycosylase 1...   340   8e-91
gb|EOY14778.1| DNA glycosylase superfamily protein [Theobroma ca...   339   2e-90
ref|XP_004147864.1| PREDICTED: uncharacterized protein LOC101202...   337   7e-90
ref|XP_006473512.1| PREDICTED: DNA-3-methyladenine glycosylase 1...   336   2e-89
ref|XP_006435004.1| hypothetical protein CICLE_v10001539mg [Citr...   336   2e-89
ref|XP_002266618.1| PREDICTED: DNA-3-methyladenine glycosylase 1...   334   6e-89
ref|XP_002282344.1| PREDICTED: DNA-3-methyladenine glycosylase 1...   333   1e-88
ref|XP_002510396.1| DNA-3-methyladenine glycosylase, putative [R...   332   2e-88
ref|XP_006341950.1| PREDICTED: probable DNA-3-methyladenine glyc...   330   1e-87
gb|EXB91937.1| DNA-3-methyladenine glycosylase 1 [Morus notabilis]    328   3e-87
ref|XP_002302029.1| predicted protein [Populus trichocarpa]           328   4e-87
emb|CBI19705.3| unnamed protein product [Vitis vinifera]              323   6e-87
ref|XP_006416502.1| hypothetical protein EUTSA_v10007939mg [Eutr...   327   7e-87
ref|XP_004238277.1| PREDICTED: DNA-3-methyladenine glycosylase 1...   327   1e-86
ref|XP_006305120.1| hypothetical protein CARUB_v10009489mg [Caps...   326   2e-86
gb|ESW08591.1| hypothetical protein PHAVU_009G058200g [Phaseolus...   325   3e-86

>ref|XP_006346951.1| PREDICTED: probable DNA-3-methyladenine glycosylase 2-like [Solanum
            tuberosum]
          Length = 362

 Score =  394 bits (1013), Expect = e-107
 Identities = 202/278 (72%), Positives = 224/278 (80%)
 Frame = -2

Query: 1232 KNRRRSASQSSRVLPQVIKPLTAAGEIEAALRHLRSADHLLAAFIDSYPMPTFETHRSPF 1053
            KNRR+SA +SSRVLPQ+IKPL+A GEI+ AL+HLRS D LL + ID+ P P FE H S F
Sbjct: 87   KNRRKSAPKSSRVLPQIIKPLSADGEIDNALQHLRSVDPLLVSLIDTLPSPQFELHHSAF 146

Query: 1052 LALTRSILYQQLAYKAGASIYNRFVSLCGGEDNVRPDNVLALSTQQLKQVGVSGRKASYL 873
            LAL++SILYQQLAYKAG SIY RFVSLCGGED V PD VL+LS QQLKQVG+SGRKASYL
Sbjct: 147  LALSKSILYQQLAYKAGTSIYTRFVSLCGGEDAVCPDIVLSLSPQQLKQVGISGRKASYL 206

Query: 872  YDLANKYKSGILSDETVVKMDDRSLFTMLSMVKGIGSWSVHMFMIFSLHRPDVLPVSDLG 693
            +DLANKY+SGILSDET+VKMDDRSLFTMLSMVKGIGSWSVHMFMIFSLHRPDVLPVSDLG
Sbjct: 207  HDLANKYRSGILSDETLVKMDDRSLFTMLSMVKGIGSWSVHMFMIFSLHRPDVLPVSDLG 266

Query: 692  VRKGVQMLYGLEEMPRPSQMEQLCEKWKPYRSVGAWYMWRISEVKGTPNVGATPSLEVTN 513
            VRKGVQ+LYGLEE+PRPSQMEQLC+KWKPYRS GAWYMWR+ E KGTP   A P ++  N
Sbjct: 267  VRKGVQLLYGLEELPRPSQMEQLCDKWKPYRSAGAWYMWRLVEGKGTPTTAAAP-IDGGN 325

Query: 512  VXXXXXXXXXXXXXXXXXXXXXXXINGMGNLGACIWGQ 399
            V                       ING+ NLGACIW Q
Sbjct: 326  V-QALQQFPTEQETQQHQLQLLEPINGIENLGACIWSQ 362


>ref|XP_004233519.1| PREDICTED: probable DNA-3-methyladenine glycosylase 2-like [Solanum
            lycopersicum]
          Length = 353

 Score =  390 bits (1003), Expect = e-106
 Identities = 199/278 (71%), Positives = 222/278 (79%)
 Frame = -2

Query: 1232 KNRRRSASQSSRVLPQVIKPLTAAGEIEAALRHLRSADHLLAAFIDSYPMPTFETHRSPF 1053
            KNRR++A +SSRV PQ+IKPL+A GEI+ AL+HLRS D LL + ID+ P P FE H S F
Sbjct: 78   KNRRKTAPKSSRVSPQIIKPLSADGEIDNALQHLRSVDPLLVSLIDTLPSPQFELHHSAF 137

Query: 1052 LALTRSILYQQLAYKAGASIYNRFVSLCGGEDNVRPDNVLALSTQQLKQVGVSGRKASYL 873
            LAL++SILYQQLAYKAG SIY RFVSLCGGED V PD VLALS QQLKQVG+SGRKASYL
Sbjct: 138  LALSKSILYQQLAYKAGTSIYTRFVSLCGGEDAVCPDIVLALSPQQLKQVGISGRKASYL 197

Query: 872  YDLANKYKSGILSDETVVKMDDRSLFTMLSMVKGIGSWSVHMFMIFSLHRPDVLPVSDLG 693
            +DLANKYKSGILSDET+VKMDDRSLF MLSMVKGIGSWSVHMFMIFSLHRPD+LPVSDLG
Sbjct: 198  HDLANKYKSGILSDETLVKMDDRSLFAMLSMVKGIGSWSVHMFMIFSLHRPDILPVSDLG 257

Query: 692  VRKGVQMLYGLEEMPRPSQMEQLCEKWKPYRSVGAWYMWRISEVKGTPNVGATPSLEVTN 513
            VRKGVQ+LYGLEE+PRPSQMEQLC+KWKPYRS GAWYMWR+ E KGTP + A P ++  N
Sbjct: 258  VRKGVQLLYGLEELPRPSQMEQLCDKWKPYRSAGAWYMWRLVEGKGTPTIAAAP-IDGGN 316

Query: 512  VXXXXXXXXXXXXXXXXXXXXXXXINGMGNLGACIWGQ 399
                                    ING+ NLGACIW Q
Sbjct: 317  A-QALQQFPVEQETQQHQLQLLEPINGIENLGACIWSQ 353


>gb|EPS66255.1| hypothetical protein M569_08523, partial [Genlisea aurea]
          Length = 321

 Score =  359 bits (922), Expect = 2e-96
 Identities = 190/327 (58%), Positives = 221/327 (67%)
 Frame = -2

Query: 1520 PQKSSKIPIRPQKIRKLXXXXXXXXXXXXXXXXXXXXXSDEKLIQVPDSAAASPSPVQXX 1341
            PQ  SKIPIRPQK+RKL                      D  L   P SA  + + V   
Sbjct: 4    PQNPSKIPIRPQKMRKLSNPASICDDKAYSPQEIGA---DSPLAAPPSSALTACATV--- 57

Query: 1340 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXPKNRRRSASQSSRVLPQVIKPLTAA 1161
                                                +NRRRS SQ+SRV PQ+ +PL A 
Sbjct: 58   ---------------------GAITPVTAAAATSAARNRRRSYSQASRVSPQLTRPLYAE 96

Query: 1160 GEIEAALRHLRSADHLLAAFIDSYPMPTFETHRSPFLALTRSILYQQLAYKAGASIYNRF 981
            GE+E AL HLR  D L  A ID+YP P F+TH SPF+AL +SI+YQQLA KAG SIY RF
Sbjct: 97   GELEIALNHLRVVDPLFGALIDAYPPPQFDTHPSPFIALAKSIIYQQLALKAGTSIYMRF 156

Query: 980  VSLCGGEDNVRPDNVLALSTQQLKQVGVSGRKASYLYDLANKYKSGILSDETVVKMDDRS 801
            ++LC GE+ V PD+VL+LS+QQLKQ+G+SGRKASYLYDLANKYKSGILSDE +VKMDD+S
Sbjct: 157  IALCSGEEAVTPDSVLSLSSQQLKQIGISGRKASYLYDLANKYKSGILSDELIVKMDDKS 216

Query: 800  LFTMLSMVKGIGSWSVHMFMIFSLHRPDVLPVSDLGVRKGVQMLYGLEEMPRPSQMEQLC 621
            LFTMLSMVKGIGSWSVHMFM+FSL RPDVLPVSDLGVRKGVQ+LY L E+PRPSQMEQLC
Sbjct: 217  LFTMLSMVKGIGSWSVHMFMLFSLQRPDVLPVSDLGVRKGVQLLYDLGELPRPSQMEQLC 276

Query: 620  EKWKPYRSVGAWYMWRISEVKGTPNVG 540
             KW+PYRSV +WY+WRI E K +P+ G
Sbjct: 277  GKWRPYRSVASWYLWRIVEAKASPSSG 303


>emb|CAN72984.1| hypothetical protein VITISV_009035 [Vitis vinifera]
          Length = 353

 Score =  341 bits (874), Expect = 6e-91
 Identities = 163/233 (69%), Positives = 194/233 (83%)
 Frame = -2

Query: 1223 RRSASQSSRVLPQVIKPLTAAGEIEAALRHLRSADHLLAAFIDSYPMPTFETHRSPFLAL 1044
            +R+A+QS+  LP ++KPL+  GE++ ALRHL  +D LLAA I+++  PTF++   PFLAL
Sbjct: 87   KRNAAQSTAALPTIVKPLSCEGELDVALRHLTKSDPLLAALINTHQPPTFDSCHPPFLAL 146

Query: 1043 TRSILYQQLAYKAGASIYNRFVSLCGGEDNVRPDNVLALSTQQLKQVGVSGRKASYLYDL 864
             +SILYQQLAYKA  SIY RFV+LCGGE  V PD VLALS  QL+Q+GVSGRKA YL+DL
Sbjct: 147  AKSILYQQLAYKAATSIYTRFVALCGGEAGVVPDAVLALSPSQLRQIGVSGRKAGYLHDL 206

Query: 863  ANKYKSGILSDETVVKMDDRSLFTMLSMVKGIGSWSVHMFMIFSLHRPDVLPVSDLGVRK 684
            A+KYK+GILSD +++ MDD+SLFTML+MVKGIGSWSVHMFMIFSLHRPDVLPV D+GVRK
Sbjct: 207  ASKYKTGILSDSSIMGMDDKSLFTMLTMVKGIGSWSVHMFMIFSLHRPDVLPVGDVGVRK 266

Query: 683  GVQMLYGLEEMPRPSQMEQLCEKWKPYRSVGAWYMWRISEVKGTPNVGATPSL 525
            GVQ LYGLEE+PRPSQMEQLCEKWKPYRSVG+WYMWR  E KG P   A  +L
Sbjct: 267  GVQFLYGLEELPRPSQMEQLCEKWKPYRSVGSWYMWRFVEAKGAPPARAAVAL 319


>ref|XP_006473513.1| PREDICTED: DNA-3-methyladenine glycosylase 1-like isoform X2 [Citrus
            sinensis]
          Length = 371

 Score =  340 bits (873), Expect = 8e-91
 Identities = 167/279 (59%), Positives = 208/279 (74%), Gaps = 1/279 (0%)
 Frame = -2

Query: 1232 KNRRRSASQSSRVLPQVI-KPLTAAGEIEAALRHLRSADHLLAAFIDSYPMPTFETHRSP 1056
            K+R     Q +  +P++I +PL++ GE+EAA+RHLR+AD  LA+ ID +P PTF++  +P
Sbjct: 101  KSRAIQQQQQTLTVPRIIARPLSSEGEVEAAIRHLRNADRQLASLIDIHPPPTFDSFHTP 160

Query: 1055 FLALTRSILYQQLAYKAGASIYNRFVSLCGGEDNVRPDNVLALSTQQLKQVGVSGRKASY 876
            FLALTRSILYQQLA+KAG SIY RF++LCGGE  V P+ VLAL+ QQL+Q+GVSGRKASY
Sbjct: 161  FLALTRSILYQQLAFKAGTSIYTRFIALCGGEAGVVPETVLALTPQQLRQIGVSGRKASY 220

Query: 875  LYDLANKYKSGILSDETVVKMDDRSLFTMLSMVKGIGSWSVHMFMIFSLHRPDVLPVSDL 696
            L+DLA KY++GILSD  +V MDD+SLFTML+MV GIGSWSVHMFMIFSLHRPDVLP++DL
Sbjct: 221  LHDLARKYQNGILSDSAIVNMDDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDL 280

Query: 695  GVRKGVQMLYGLEEMPRPSQMEQLCEKWKPYRSVGAWYMWRISEVKGTPNVGATPSLEVT 516
            GVRKGVQ+LY LEE+PRPSQM+QLCEKW+PYRSV +WY+WR  E KG P+  A  +    
Sbjct: 281  GVRKGVQLLYSLEELPRPSQMDQLCEKWRPYRSVASWYLWRFVEAKGAPSSAAAVAAGAA 340

Query: 515  NVXXXXXXXXXXXXXXXXXXXXXXXINGMGNLGACIWGQ 399
                                     IN + N+GAC WGQ
Sbjct: 341  --------LPQPQQEEQQQPQLLDQINSLINIGACAWGQ 371


>gb|EOY14778.1| DNA glycosylase superfamily protein [Theobroma cacao]
          Length = 397

 Score =  339 bits (870), Expect = 2e-90
 Identities = 165/267 (61%), Positives = 206/267 (77%), Gaps = 1/267 (0%)
 Frame = -2

Query: 1196 VLPQVI-KPLTAAGEIEAALRHLRSADHLLAAFIDSYPMPTFETHRSPFLALTRSILYQQ 1020
            V+P+++ + L+  GE+E A+RHLR+AD LLA+ ID +P PTF+T  +PFLALTRSILYQQ
Sbjct: 132  VVPRIMARSLSCEGEVETAIRHLRNADPLLASLIDIHPPPTFDTFHTPFLALTRSILYQQ 191

Query: 1019 LAYKAGASIYNRFVSLCGGEDNVRPDNVLALSTQQLKQVGVSGRKASYLYDLANKYKSGI 840
            LA+KAG SIYNRF++LCGGE+ V P+ VL+L+ QQL+Q+GVSGRKASYL+DLA KY++GI
Sbjct: 192  LAFKAGTSIYNRFIALCGGENGVVPETVLSLTAQQLRQIGVSGRKASYLHDLARKYQTGI 251

Query: 839  LSDETVVKMDDRSLFTMLSMVKGIGSWSVHMFMIFSLHRPDVLPVSDLGVRKGVQMLYGL 660
            LSD  +V MDD+SLFTML+MV GIGSWSVHMFMIFSLHRPDVLP++DLGVRKGVQ+LY L
Sbjct: 252  LSDSAIVNMDDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLGVRKGVQLLYNL 311

Query: 659  EEMPRPSQMEQLCEKWKPYRSVGAWYMWRISEVKGTPNVGATPSLEVTNVXXXXXXXXXX 480
            EE+PRPSQM+QLCEKW+PYRSV +WY+WR  E KG P+  A  +    ++          
Sbjct: 312  EELPRPSQMDQLCEKWRPYRSVASWYLWRFVEAKGAPSSAAAVAAG-ASLPPPQQEEQQQ 370

Query: 479  XXXXXXXXXXXXXINGMGNLGACIWGQ 399
                         IN + NLGAC WGQ
Sbjct: 371  HQQHQQQPQLLDPINSILNLGACAWGQ 397


>ref|XP_004147864.1| PREDICTED: uncharacterized protein LOC101202943 [Cucumis sativus]
            gi|449476816|ref|XP_004154842.1| PREDICTED:
            uncharacterized LOC101202943 [Cucumis sativus]
          Length = 382

 Score =  337 bits (865), Expect = 7e-90
 Identities = 167/284 (58%), Positives = 209/284 (73%), Gaps = 7/284 (2%)
 Frame = -2

Query: 1229 NRRRSASQSSRVLPQVIKP---LTAAGEIEAALRHLRSADHLLAAFIDSYPMPTFETHRS 1059
            N+ ++A Q +      + P   L+  GE+E ALRHLR+AD LLA  ID +  PTF++ ++
Sbjct: 99   NKSKTAHQRAAFASATVPPARSLSCEGEVEIALRHLRNADPLLAQLIDLHQRPTFDSFQT 158

Query: 1058 PFLALTRSILYQQLAYKAGASIYNRFVSLCGGEDNVRPDNVLALSTQQLKQVGVSGRKAS 879
            PFLALTRSILYQQLAYKAG SIY RF++LCGGE  V P+ VLAL+ QQL+Q+G+SGRK+S
Sbjct: 159  PFLALTRSILYQQLAYKAGTSIYTRFIALCGGEAGVLPETVLALNPQQLRQIGISGRKSS 218

Query: 878  YLYDLANKYKSGILSDETVVKMDDRSLFTMLSMVKGIGSWSVHMFMIFSLHRPDVLPVSD 699
            YL+DLA KY++GILSD  +V MDD+SLFTML+MV GIGSWSVHMFMIFSLHRPDVLP++D
Sbjct: 219  YLHDLARKYQNGILSDPAIVNMDDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPIND 278

Query: 698  LGVRKGVQMLYGLEEMPRPSQMEQLCEKWKPYRSVGAWYMWRISEVKGTPN----VGATP 531
            L VRKGVQ+LY LEE+PRPSQM+QLCEKW+PYRSVG+WYMWR++E KG  +    V A  
Sbjct: 279  LNVRKGVQLLYNLEELPRPSQMDQLCEKWRPYRSVGSWYMWRLAEAKGASSSAAAVAAGA 338

Query: 530  SLEVTNVXXXXXXXXXXXXXXXXXXXXXXXINGMGNLGACIWGQ 399
            SL++ +                        +N + NLGAC WGQ
Sbjct: 339  SLQLQHQDHHQEHQHPQHPQHPQQPQLLDPLNSILNLGACAWGQ 382


>ref|XP_006473512.1| PREDICTED: DNA-3-methyladenine glycosylase 1-like isoform X1 [Citrus
            sinensis]
          Length = 373

 Score =  336 bits (862), Expect = 2e-89
 Identities = 158/233 (67%), Positives = 196/233 (84%), Gaps = 1/233 (0%)
 Frame = -2

Query: 1232 KNRRRSASQSSRVLPQVI-KPLTAAGEIEAALRHLRSADHLLAAFIDSYPMPTFETHRSP 1056
            K+R     Q +  +P++I +PL++ GE+EAA+RHLR+AD  LA+ ID +P PTF++  +P
Sbjct: 101  KSRAIQQQQQTLTVPRIIARPLSSEGEVEAAIRHLRNADRQLASLIDIHPPPTFDSFHTP 160

Query: 1055 FLALTRSILYQQLAYKAGASIYNRFVSLCGGEDNVRPDNVLALSTQQLKQVGVSGRKASY 876
            FLALTRSILYQQLA+KAG SIY RF++LCGGE  V P+ VLAL+ QQL+Q+GVSGRKASY
Sbjct: 161  FLALTRSILYQQLAFKAGTSIYTRFIALCGGEAGVVPETVLALTPQQLRQIGVSGRKASY 220

Query: 875  LYDLANKYKSGILSDETVVKMDDRSLFTMLSMVKGIGSWSVHMFMIFSLHRPDVLPVSDL 696
            L+DLA KY++GILSD  +V MDD+SLFTML+MV GIGSWSVHMFMIFSLHRPDVLP++DL
Sbjct: 221  LHDLARKYQNGILSDSAIVNMDDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDL 280

Query: 695  GVRKGVQMLYGLEEMPRPSQMEQLCEKWKPYRSVGAWYMWRISEVKGTPNVGA 537
            GVRKGVQ+LY LEE+PRPSQM+QLCEKW+PYRSV +WY+WR  E KG P+  A
Sbjct: 281  GVRKGVQLLYSLEELPRPSQMDQLCEKWRPYRSVASWYLWRFVEAKGAPSSAA 333


>ref|XP_006435004.1| hypothetical protein CICLE_v10001539mg [Citrus clementina]
            gi|557537126|gb|ESR48244.1| hypothetical protein
            CICLE_v10001539mg [Citrus clementina]
          Length = 373

 Score =  336 bits (862), Expect = 2e-89
 Identities = 158/233 (67%), Positives = 196/233 (84%), Gaps = 1/233 (0%)
 Frame = -2

Query: 1232 KNRRRSASQSSRVLPQVI-KPLTAAGEIEAALRHLRSADHLLAAFIDSYPMPTFETHRSP 1056
            K+R     Q +  +P++I +PL++ GE+EAA+RHLR+AD  LA+ ID +P PTF++  +P
Sbjct: 101  KSRAIQQQQQTLTVPRIIARPLSSEGEVEAAIRHLRNADRQLASLIDIHPPPTFDSFHTP 160

Query: 1055 FLALTRSILYQQLAYKAGASIYNRFVSLCGGEDNVRPDNVLALSTQQLKQVGVSGRKASY 876
            FLALTRSILYQQLA+KAG SIY RF++LCGGE  V P+ VLAL+ QQL+Q+GVSGRKASY
Sbjct: 161  FLALTRSILYQQLAFKAGTSIYTRFIALCGGEAGVVPETVLALTPQQLRQIGVSGRKASY 220

Query: 875  LYDLANKYKSGILSDETVVKMDDRSLFTMLSMVKGIGSWSVHMFMIFSLHRPDVLPVSDL 696
            L+DLA KY++GILSD  +V MDD+SLFTML+MV GIGSWSVHMFMIFSLHRPDVLP++DL
Sbjct: 221  LHDLARKYQNGILSDSAIVNMDDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDL 280

Query: 695  GVRKGVQMLYGLEEMPRPSQMEQLCEKWKPYRSVGAWYMWRISEVKGTPNVGA 537
            GVRKGVQ+LY LEE+PRPSQM+QLCEKW+PYRSV +WY+WR  E KG P+  A
Sbjct: 281  GVRKGVQLLYSLEELPRPSQMDQLCEKWRPYRSVASWYLWRFVEAKGAPSSAA 333


>ref|XP_002266618.1| PREDICTED: DNA-3-methyladenine glycosylase 1 [Vitis vinifera]
            gi|297735147|emb|CBI17509.3| unnamed protein product
            [Vitis vinifera]
          Length = 329

 Score =  334 bits (857), Expect = 6e-89
 Identities = 168/259 (64%), Positives = 195/259 (75%)
 Frame = -2

Query: 1175 PLTAAGEIEAALRHLRSADHLLAAFIDSYPMPTFETHRSPFLALTRSILYQQLAYKAGAS 996
            PL+  GE++ ALRHL  +D LLAA I+++  PTF++   PFLAL +SILYQQLAYKA  S
Sbjct: 73   PLSCEGELDVALRHLTKSDPLLAALINTHQPPTFDSCHPPFLALAKSILYQQLAYKAATS 132

Query: 995  IYNRFVSLCGGEDNVRPDNVLALSTQQLKQVGVSGRKASYLYDLANKYKSGILSDETVVK 816
            IY RFV+LCGGE  V PD VLALS  QL+Q+GVSGRKA YL+DLA+KYK+GILSD +++ 
Sbjct: 133  IYTRFVALCGGEAGVVPDAVLALSPSQLRQIGVSGRKAGYLHDLASKYKTGILSDSSIMG 192

Query: 815  MDDRSLFTMLSMVKGIGSWSVHMFMIFSLHRPDVLPVSDLGVRKGVQMLYGLEEMPRPSQ 636
            MDD+SLFTML+MVKGIGSWSVHMFMIFSLHRPDVLPV D+GVRKGVQ LYGLEE+PRPSQ
Sbjct: 193  MDDKSLFTMLTMVKGIGSWSVHMFMIFSLHRPDVLPVGDVGVRKGVQFLYGLEELPRPSQ 252

Query: 635  MEQLCEKWKPYRSVGAWYMWRISEVKGTPNVGATPSLEVTNVXXXXXXXXXXXXXXXXXX 456
            MEQLCEKWKPYRSVG+WYMWR  E KG P   A  +L   +                   
Sbjct: 253  MEQLCEKWKPYRSVGSWYMWRFVEAKGAPPARAAVAL--VDGATSEQQQQQEQQQQPQQL 310

Query: 455  XXXXXINGMGNLGACIWGQ 399
                 ING+ NLGACIWGQ
Sbjct: 311  QLVDPINGIVNLGACIWGQ 329


>ref|XP_002282344.1| PREDICTED: DNA-3-methyladenine glycosylase 1-like [Vitis vinifera]
          Length = 363

 Score =  333 bits (855), Expect = 1e-88
 Identities = 170/266 (63%), Positives = 198/266 (74%), Gaps = 4/266 (1%)
 Frame = -2

Query: 1184 VIKPLTAAGEIEAALRHLRSADHLLAAFIDSYPMPTFETHRSPFLALTRSILYQQLAYKA 1005
            V + L+  GEIE ALRHLR+AD  LA  ID +P PTF++  +PFLALT+SILYQQLAYKA
Sbjct: 101  VARALSCEGEIEIALRHLRNADPHLAPLIDLHPPPTFDSFHTPFLALTKSILYQQLAYKA 160

Query: 1004 GASIYNRFVSLCGGEDNVRPDNVLALSTQQLKQVGVSGRKASYLYDLANKYKSGILSDET 825
            G SIY RFV LCGGE  V P+ VLAL+  QL+Q+GVSGRKASYL+DLA KY++GILSD  
Sbjct: 161  GTSIYTRFVGLCGGEAGVLPETVLALTPHQLRQIGVSGRKASYLHDLARKYQNGILSDTG 220

Query: 824  VVKMDDRSLFTMLSMVKGIGSWSVHMFMIFSLHRPDVLPVSDLGVRKGVQMLYGLEEMPR 645
            ++ MDD+SLFTML+MV GIGSWSVHMFMIFSLHRPDVLPV+DLGVRKGVQ+LYGLEE+PR
Sbjct: 221  IITMDDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPVNDLGVRKGVQLLYGLEELPR 280

Query: 644  PSQMEQLCEKWKPYRSVGAWYMWRISEVKGTPN----VGATPSLEVTNVXXXXXXXXXXX 477
            PSQMEQLCEKW+PYRSV +WY+WR  E KG P+    V   PSL+               
Sbjct: 281  PSQMEQLCEKWRPYRSVASWYIWRFVEGKGAPSSAAAVAGGPSLQQQQ---QQQEQQQQH 337

Query: 476  XXXXXXXXXXXXINGMGNLGACIWGQ 399
                        ING+ NLGAC WGQ
Sbjct: 338  QQQQHQQQFLDPINGILNLGACAWGQ 363


>ref|XP_002510396.1| DNA-3-methyladenine glycosylase, putative [Ricinus communis]
            gi|223551097|gb|EEF52583.1| DNA-3-methyladenine
            glycosylase, putative [Ricinus communis]
          Length = 369

 Score =  332 bits (852), Expect = 2e-88
 Identities = 166/267 (62%), Positives = 200/267 (74%), Gaps = 3/267 (1%)
 Frame = -2

Query: 1190 PQVI-KPLTAAGEIEAALRHLRSADHLLAAFIDSYPMPTFETHRSPFLALTRSILYQQLA 1014
            P++I + L+  GE+E A+RHLR AD LL++ ID +P PTF+T  +PFLALTRSILYQQLA
Sbjct: 113  PRIIARSLSCEGEVENAIRHLREADPLLSSLIDLHPPPTFDTFHTPFLALTRSILYQQLA 172

Query: 1013 YKAGASIYNRFVSLCGGEDNVRPDNVLALSTQQLKQVGVSGRKASYLYDLANKYKSGILS 834
            +KAG SIY RF+SLCGGE  V PD VLAL+ QQL+Q+GVSGRKASYL+DLA KY +GILS
Sbjct: 173  FKAGTSIYTRFISLCGGEAGVVPDTVLALTPQQLRQIGVSGRKASYLHDLARKYHNGILS 232

Query: 833  DETVVKMDDRSLFTMLSMVKGIGSWSVHMFMIFSLHRPDVLPVSDLGVRKGVQMLYGLEE 654
            D  +V MDD+SLFTML+MV GIGSWSVHMFMIFSLHRPDVLP++DLGVRKGVQ+LY LE+
Sbjct: 233  DSAIVNMDDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLGVRKGVQLLYNLED 292

Query: 653  MPRPSQMEQLCEKWKPYRSVGAWYMWRISEVKGTPN--VGATPSLEVTNVXXXXXXXXXX 480
            +PRPSQM+QLCEKW+PYRSV +WY+WR  E KG+P+  V       +T            
Sbjct: 293  LPRPSQMDQLCEKWRPYRSVASWYLWRFVEAKGSPSSAVAVATGAALTQ----------Q 342

Query: 479  XXXXXXXXXXXXXINGMGNLGACIWGQ 399
                         IN + NLGAC WGQ
Sbjct: 343  HQEDHQQPQLLDPINSILNLGACAWGQ 369


>ref|XP_006341950.1| PREDICTED: probable DNA-3-methyladenine glycosylase 2-like isoform X1
            [Solanum tuberosum]
          Length = 334

 Score =  330 bits (845), Expect = 1e-87
 Identities = 160/273 (58%), Positives = 209/273 (76%), Gaps = 2/273 (0%)
 Frame = -2

Query: 1211 SQSSRVLPQVI-KPLTAAGEIEAALRHLRSADHLLAAFIDSYPMPTFETHRSPFLALTRS 1035
            +Q S ++P+++ + L+  GE+E+A+ +L+S+D LL+  I++YP+PT E  + PFLALT+S
Sbjct: 62   TQISTIVPRIVSRSLSYEGELESAINYLKSSDPLLSPLIETYPLPTLELFQPPFLALTKS 121

Query: 1034 ILYQQLAYKAGASIYNRFVSLCGGEDNVRPDNVLALSTQQLKQVGVSGRKASYLYDLANK 855
            IL+QQLAYKAG+SIY RF+SLCGGE NV PD VL L+ QQL+Q+GVS RKASYL+DLA K
Sbjct: 122  ILFQQLAYKAGSSIYTRFISLCGGESNVMPDMVLGLTPQQLRQIGVSARKASYLHDLARK 181

Query: 854  YKSGILSDETVVKMDDRSLFTMLSMVKGIGSWSVHMFMIFSLHRPDVLPVSDLGVRKGVQ 675
            Y++GILSD+++V MDD+SLFTML+MV GIGSWSVHMFMIFSLHRPDVLP+ DLG+RKGV+
Sbjct: 182  YQNGILSDKSIVDMDDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPIHDLGIRKGVR 241

Query: 674  MLYGLEEMPRPSQMEQLCEKWKPYRSVGAWYMWRISEVKGTPNVG-ATPSLEVTNVXXXX 498
            MLYGLE++PRPSQM+QLCEKWKPYRSV +WY+WR  E KG  + G    + +V+      
Sbjct: 242  MLYGLEDLPRPSQMDQLCEKWKPYRSVASWYIWRFVEAKGANSKGNVVGNSDVSLQQQML 301

Query: 497  XXXXXXXXXXXXXXXXXXXINGMGNLGACIWGQ 399
                               ING+ ++GAC WGQ
Sbjct: 302  SMQQQQQQQHQPNQQFLDPINGILDVGACAWGQ 334


>gb|EXB91937.1| DNA-3-methyladenine glycosylase 1 [Morus notabilis]
          Length = 451

 Score =  328 bits (842), Expect = 3e-87
 Identities = 178/336 (52%), Positives = 215/336 (63%), Gaps = 4/336 (1%)
 Frame = -2

Query: 1529 SNLPQKSS----KIPIRPQKIRKLXXXXXXXXXXXXXXXXXXXXXSDEKLIQVPDSAAAS 1362
            SN P ++S    KIP+RP+KIRKL                        +++ VP++   S
Sbjct: 48   SNAPSQTSSPPSKIPLRPRKIRKLSPDDSDSK--------------SSQVVAVPENPKPS 93

Query: 1361 PSPVQXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXPKNRRRSASQSSRVLPQV 1182
            P+                                           +R  A  + R+   V
Sbjct: 94   PTAAAAAKPAKAKIV-----------------------------QQRALAIAAPRI---V 121

Query: 1181 IKPLTAAGEIEAALRHLRSADHLLAAFIDSYPMPTFETHRSPFLALTRSILYQQLAYKAG 1002
             + L+  GE+E ALRHLR AD LLA  ID +  PTF+   +PFLALTRSILYQQLAYKAG
Sbjct: 122  ARSLSCEGEVEVALRHLRRADPLLAPLIDIHQPPTFDNFHTPFLALTRSILYQQLAYKAG 181

Query: 1001 ASIYNRFVSLCGGEDNVRPDNVLALSTQQLKQVGVSGRKASYLYDLANKYKSGILSDETV 822
             SIY RF++LCGGE  V P+ VLAL+ QQL+Q+GVSGRKASYL+DLA KY++GILSD  +
Sbjct: 182  TSIYTRFIALCGGETGVVPETVLALTPQQLRQIGVSGRKASYLHDLARKYQNGILSDSAI 241

Query: 821  VKMDDRSLFTMLSMVKGIGSWSVHMFMIFSLHRPDVLPVSDLGVRKGVQMLYGLEEMPRP 642
            V MDD+SLFTML+MV GIGSWSVHMFMIFSLHRPDVLP++DLGVRKGVQ+LY LEE+PRP
Sbjct: 242  VNMDDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLGVRKGVQLLYNLEELPRP 301

Query: 641  SQMEQLCEKWKPYRSVGAWYMWRISEVKGTPNVGAT 534
            SQM+QLCEKW+PYRSV AWYMWR  E KG P   AT
Sbjct: 302  SQMDQLCEKWRPYRSVAAWYMWRFVEQKGAPPNAAT 337


>ref|XP_002302029.1| predicted protein [Populus trichocarpa]
          Length = 381

 Score =  328 bits (841), Expect = 4e-87
 Identities = 164/262 (62%), Positives = 196/262 (74%), Gaps = 2/262 (0%)
 Frame = -2

Query: 1178 KPLTAAGEIEAALRHLRSADHLLAAFIDSYPMPTFETHRSPFLALTRSILYQQLAYKAGA 999
            + LT  GE+E A+RHLR+AD LLA+ ID YP PTF+T  +PFLAL RSILYQQLA+KAG 
Sbjct: 127  RSLTCEGELEIAIRHLRNADPLLASLIDIYPPPTFDTFPTPFLALARSILYQQLAFKAGT 186

Query: 998  SIYNRFVSLCGGEDNVRPDNVLALSTQQLKQVGVSGRKASYLYDLANKYKSGILSDETVV 819
            SIY RF+SLCGGE  V P+ VLAL+ QQL+Q+GVSGRKASYL+DLA KY++GILSD  +V
Sbjct: 187  SIYTRFISLCGGEAGVLPETVLALTPQQLRQIGVSGRKASYLHDLARKYQNGILSDSAIV 246

Query: 818  KMDDRSLFTMLSMVKGIGSWSVHMFMIFSLHRPDVLPVSDLGVRKGVQMLYGLEEMPRPS 639
             MDD+SLFTML+MV GIGSWSVHMFMIFSLHRPDVLP++DL VRKG+Q+LY L E+PRPS
Sbjct: 247  NMDDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLQVRKGLQVLYNLPELPRPS 306

Query: 638  QMEQLCEKWKPYRSVGAWYMWRISEVKGTPN--VGATPSLEVTNVXXXXXXXXXXXXXXX 465
            QM+ LCEKW+PYRSV +WY+WR  EVKG+P+  V    S  +T                 
Sbjct: 307  QMDHLCEKWRPYRSVASWYLWRFQEVKGSPSSAVALASSGNLTQ-------QQQEEQQHQ 359

Query: 464  XXXXXXXXINGMGNLGACIWGQ 399
                    IN + NLGAC WGQ
Sbjct: 360  QEPQLIDPINSILNLGACAWGQ 381


>emb|CBI19705.3| unnamed protein product [Vitis vinifera]
          Length = 351

 Score =  323 bits (828), Expect(2) = 6e-87
 Identities = 157/229 (68%), Positives = 184/229 (80%)
 Frame = -2

Query: 1223 RRSASQSSRVLPQVIKPLTAAGEIEAALRHLRSADHLLAAFIDSYPMPTFETHRSPFLAL 1044
            R+ +  +S   P         GEIE ALRHLR+AD  LA  ID +P PTF++  +PFLAL
Sbjct: 95   RKISPDNSESKPAGDSKTAGKGEIEIALRHLRNADPHLAPLIDLHPPPTFDSFHTPFLAL 154

Query: 1043 TRSILYQQLAYKAGASIYNRFVSLCGGEDNVRPDNVLALSTQQLKQVGVSGRKASYLYDL 864
            T+SILYQQLAYKAG SIY RFV LCGGE  V P+ VLAL+  QL+Q+GVSGRKASYL+DL
Sbjct: 155  TKSILYQQLAYKAGTSIYTRFVGLCGGEAGVLPETVLALTPHQLRQIGVSGRKASYLHDL 214

Query: 863  ANKYKSGILSDETVVKMDDRSLFTMLSMVKGIGSWSVHMFMIFSLHRPDVLPVSDLGVRK 684
            A KY++GILSD  ++ MDD+SLFTML+MV GIGSWSVHMFMIFSLHRPDVLPV+DLGVRK
Sbjct: 215  ARKYQNGILSDTGIITMDDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPVNDLGVRK 274

Query: 683  GVQMLYGLEEMPRPSQMEQLCEKWKPYRSVGAWYMWRISEVKGTPNVGA 537
            GVQ+LYGLEE+PRPSQMEQLCEKW+PYRSV +WY+WR  E KG P+  A
Sbjct: 275  GVQLLYGLEELPRPSQMEQLCEKWRPYRSVASWYIWRFVEGKGAPSSAA 323



 Score = 26.9 bits (58), Expect(2) = 6e-87
 Identities = 11/17 (64%), Positives = 12/17 (70%)
 Frame = -1

Query: 441 HQWHGKSWGLHLGPMTG 391
           +QWH K  GL LG MTG
Sbjct: 334 NQWHSKPRGLCLGTMTG 350


>ref|XP_006416502.1| hypothetical protein EUTSA_v10007939mg [Eutrema salsugineum]
            gi|557094273|gb|ESQ34855.1| hypothetical protein
            EUTSA_v10007939mg [Eutrema salsugineum]
          Length = 378

 Score =  327 bits (839), Expect = 7e-87
 Identities = 160/225 (71%), Positives = 188/225 (83%)
 Frame = -2

Query: 1223 RRSASQSSRVLPQVIKPLTAAGEIEAALRHLRSADHLLAAFIDSYPMPTFETHRSPFLAL 1044
            R S S++  V     +PLT+ GE+E A+ HLR+AD LLAA ID YP PTFE+  +PFLAL
Sbjct: 117  RLSQSRAITVPRIQARPLTSEGELEVAIHHLRNADPLLAALIDVYPPPTFESFPTPFLAL 176

Query: 1043 TRSILYQQLAYKAGASIYNRFVSLCGGEDNVRPDNVLALSTQQLKQVGVSGRKASYLYDL 864
             RSILYQQLA KAG SIY RFV+LCGGE+ V P+ VLAL+ QQL+Q+GVSGRKASYL+DL
Sbjct: 177  IRSILYQQLAAKAGNSIYTRFVALCGGENFVVPETVLALNPQQLRQIGVSGRKASYLHDL 236

Query: 863  ANKYKSGILSDETVVKMDDRSLFTMLSMVKGIGSWSVHMFMIFSLHRPDVLPVSDLGVRK 684
            A KY++GILSD  ++ MDD+SLFTML+MV GIGSWSVHMFMI SLHRPDVLP++DLGVRK
Sbjct: 237  ARKYQNGILSDSAILNMDDKSLFTMLTMVNGIGSWSVHMFMINSLHRPDVLPINDLGVRK 296

Query: 683  GVQMLYGLEEMPRPSQMEQLCEKWKPYRSVGAWYMWRISEVKGTP 549
            GVQMLY LEE+PRPSQMEQLC KW+PYRSV +WYMWR+ E KGTP
Sbjct: 297  GVQMLYNLEELPRPSQMEQLCVKWRPYRSVASWYMWRLIEAKGTP 341


>ref|XP_004238277.1| PREDICTED: DNA-3-methyladenine glycosylase 1-like isoform 1 [Solanum
            lycopersicum]
          Length = 332

 Score =  327 bits (838), Expect = 1e-86
 Identities = 160/270 (59%), Positives = 206/270 (76%), Gaps = 1/270 (0%)
 Frame = -2

Query: 1211 SQSSRVLPQVI-KPLTAAGEIEAALRHLRSADHLLAAFIDSYPMPTFETHRSPFLALTRS 1035
            +Q S ++P+++ + L+  GE+E+A+ +L+S+D LL+  I++YP PT E  + PFLALT+S
Sbjct: 62   TQISTIVPRIVSRSLSYEGELESAINYLKSSDPLLSPLIETYPPPTLELFQPPFLALTKS 121

Query: 1034 ILYQQLAYKAGASIYNRFVSLCGGEDNVRPDNVLALSTQQLKQVGVSGRKASYLYDLANK 855
            IL+QQLAYKAG+SIY RF+SLCGGE NV PD VL L+ QQL+Q+GVS RKASYL+DLA K
Sbjct: 122  ILFQQLAYKAGSSIYTRFISLCGGESNVVPDMVLGLTPQQLRQIGVSARKASYLHDLARK 181

Query: 854  YKSGILSDETVVKMDDRSLFTMLSMVKGIGSWSVHMFMIFSLHRPDVLPVSDLGVRKGVQ 675
            Y++GILSD+++V MDD+SLFTML+MV GIGSWSVHMFMIFSLHRPDVLP+ DLG+RKGV+
Sbjct: 182  YQNGILSDKSIVDMDDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPIHDLGIRKGVR 241

Query: 674  MLYGLEEMPRPSQMEQLCEKWKPYRSVGAWYMWRISEVKGTPNVGATPSLEVTNVXXXXX 495
            MLYGLE++PRPSQM+QLCEKWKPYRSV +WY+WR  E KG  + G    +  +NV     
Sbjct: 242  MLYGLEDLPRPSQMDQLCEKWKPYRSVASWYIWRFVEAKGANSKGNV--VGNSNVSLQQQ 299

Query: 494  XXXXXXXXXXXXXXXXXXINGMGNLGACIW 405
                              ING+ N+GAC W
Sbjct: 300  ILSMQQQQQQQHQQFLDPINGILNVGACAW 329


>ref|XP_006305120.1| hypothetical protein CARUB_v10009489mg [Capsella rubella]
            gi|482573831|gb|EOA38018.1| hypothetical protein
            CARUB_v10009489mg [Capsella rubella]
          Length = 371

 Score =  326 bits (836), Expect = 2e-86
 Identities = 160/227 (70%), Positives = 194/227 (85%), Gaps = 2/227 (0%)
 Frame = -2

Query: 1211 SQSSRV-LPQV-IKPLTAAGEIEAALRHLRSADHLLAAFIDSYPMPTFETHRSPFLALTR 1038
            SQS  V +P++  +PLT  GE+EAA+ +LR+AD LLAA ID +P PTFE+ ++PFLAL R
Sbjct: 106  SQSRAVNVPRIQAQPLTCEGELEAAIHYLRNADPLLAALIDVHPPPTFESFKTPFLALIR 165

Query: 1037 SILYQQLAYKAGASIYNRFVSLCGGEDNVRPDNVLALSTQQLKQVGVSGRKASYLYDLAN 858
            SILYQQLA KAG SIY+RFVS+CGGE+ V P+ VLALS Q+L+Q+GVSGRKASYL+DLA 
Sbjct: 166  SILYQQLATKAGNSIYSRFVSICGGENMVTPETVLALSPQELRQIGVSGRKASYLHDLAR 225

Query: 857  KYKSGILSDETVVKMDDRSLFTMLSMVKGIGSWSVHMFMIFSLHRPDVLPVSDLGVRKGV 678
            KY++GILSD  ++ MD++SLFTML+MV GIGSWSVHMFMI SLHRPDVLPV+DLGVRKGV
Sbjct: 226  KYQNGILSDSAILNMDEKSLFTMLTMVNGIGSWSVHMFMINSLHRPDVLPVNDLGVRKGV 285

Query: 677  QMLYGLEEMPRPSQMEQLCEKWKPYRSVGAWYMWRISEVKGTPNVGA 537
            QMLYGL+++PRPSQMEQ C KW+PYRSVG+WYMWR+ E KGTP   A
Sbjct: 286  QMLYGLDDLPRPSQMEQHCAKWRPYRSVGSWYMWRLIESKGTPRSAA 332


>gb|ESW08591.1| hypothetical protein PHAVU_009G058200g [Phaseolus vulgaris]
          Length = 366

 Score =  325 bits (834), Expect = 3e-86
 Identities = 167/276 (60%), Positives = 202/276 (73%), Gaps = 5/276 (1%)
 Frame = -2

Query: 1211 SQSSRVLPQVI-KPLTAAGEIEAALRHLRSADHLLAAFIDSYPMPTFETHRSPFLALTRS 1035
            S+   VLP+++ + L+  GE+E ALR LR+AD LL+  ID +  PTF+   +PFLALTRS
Sbjct: 91   SRGMSVLPRLVARSLSCEGEVEIALRFLRNADPLLSPLIDIHQPPTFDNFHTPFLALTRS 150

Query: 1034 ILYQQLAYKAGASIYNRFVSLCGGEDNVRPDNVLALSTQQLKQVGVSGRKASYLYDLANK 855
            ILYQQLAYKAG SIY RF++LCGGE+ V P+ VLAL+ QQL+Q+GVSGRKASYL+DLA K
Sbjct: 151  ILYQQLAYKAGTSIYTRFIALCGGENGVVPETVLALTPQQLRQIGVSGRKASYLHDLARK 210

Query: 854  YKSGILSDETVVKMDDRSLFTMLSMVKGIGSWSVHMFMIFSLHRPDVLPVSDLGVRKGVQ 675
            Y++GILSD  +V MDD+SLFTML+MV GIGSWSVHMFMIFSLHRPDVLP++DLGVRKGVQ
Sbjct: 211  YQNGILSDSAIVNMDDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLGVRKGVQ 270

Query: 674  MLYGLEEMPRPSQMEQLCEKWKPYRSVGAWYMWRISEVKGTPNVG---ATPSLEVTNVXX 504
            +LY LE++PRPSQM+ LCEKW+PYRSV +WYMWR  E KGTP+     AT +        
Sbjct: 271  LLYNLEDLPRPSQMDHLCEKWRPYRSVASWYMWRFVEAKGTPSSAVAVATGAGLQQQHHH 330

Query: 503  XXXXXXXXXXXXXXXXXXXXXINGMGNLG-ACIWGQ 399
                                 IN M NLG AC WGQ
Sbjct: 331  QHQQHEQQQQQHPPQPQLLDPINSMFNLGAACAWGQ 366


Top