BLASTX nr result

ID: Forsythia22_contig00009606 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Forsythia22_contig00009606
         (1777 letters)

Database: ./nr 
           69,698,275 sequences; 24,982,196,650 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_011099624.1| PREDICTED: uncharacterized protein LOC105177...   494   e-137
ref|XP_012834115.1| PREDICTED: uncharacterized protein LOC105954...   488   e-135
ref|XP_009804698.1| PREDICTED: probable DNA-3-methyladenine glyc...   483   e-133
ref|XP_009766232.1| PREDICTED: probable DNA-3-methyladenine glyc...   452   e-124
gb|EYU40170.1| hypothetical protein MIMGU_mgv1a021549mg [Erythra...   452   e-124
ref|XP_006346951.1| PREDICTED: probable DNA-3-methyladenine glyc...   449   e-123
emb|CDP02014.1| unnamed protein product [Coffea canephora]            449   e-123
ref|XP_009766226.1| PREDICTED: probable DNA-3-methyladenine glyc...   445   e-122
ref|XP_004233519.1| PREDICTED: probable DNA-3-methyladenine glyc...   443   e-121
ref|XP_009766219.1| PREDICTED: probable DNA-3-methyladenine glyc...   440   e-120
ref|XP_007017553.1| DNA glycosylase superfamily protein [Theobro...   386   e-104
ref|XP_010263642.1| PREDICTED: uncharacterized protein LOC104601...   381   e-102
ref|XP_010649138.1| PREDICTED: DNA-3-methyladenine glycosylase 1...   380   e-102
ref|XP_002282344.1| PREDICTED: probable DNA-3-methyladenine glyc...   379   e-102
ref|XP_010254809.1| PREDICTED: uncharacterized protein LOC104595...   376   e-101
ref|XP_006473513.1| PREDICTED: DNA-3-methyladenine glycosylase 1...   375   e-101
ref|XP_012446557.1| PREDICTED: probable DNA-3-methyladenine glyc...   372   e-100
ref|XP_010102087.1| DNA-3-methyladenine glycosylase 1 [Morus not...   370   1e-99
ref|XP_008466558.1| PREDICTED: DNA-3-methyladenine glycosylase 1...   370   2e-99
ref|XP_012444918.1| PREDICTED: probable DNA-3-methyladenine glyc...   370   3e-99

>ref|XP_011099624.1| PREDICTED: uncharacterized protein LOC105177997 [Sesamum indicum]
          Length = 419

 Score =  494 bits (1273), Expect = e-137
 Identities = 277/425 (65%), Positives = 311/425 (73%), Gaps = 23/425 (5%)
 Frame = -3

Query: 1625 MGEHTRTQ------PETQSQ----------DSKPPSPFRSMKSEPNLDSNSQ-SPPQP-T 1500
            MGE T  Q      PE+QSQ          D +P S     KS P+ DS+++ S PQP +
Sbjct: 1    MGEQTHIQIETLPKPESQSQPLLETPKTQQDCQPQSSLD--KSAPSSDSSARISHPQPVS 58

Query: 1499 LSTDPATDAGEFLQNSQNHSNPSKIPIRPQKIRKLSITNTD--TTP--AADEESPPPQIX 1332
            L+      A E   N QN   PSKIPIRPQKIRKLS +  D  +TP   AD+ S      
Sbjct: 59   LAESSHATATEISHNPQN---PSKIPIRPQKIRKLSTSIPDKPSTPQTTADDSSVSASSS 115

Query: 1331 XXXXXXXXXXXXXXXTVEAPTTTITTKNRRRSASQLSRPLPQIIKPLSVNGEIELAIRHL 1152
                            V  PTTT + KNRRRSASQ SR LPQ+IKPLS +GEIELAIRHL
Sbjct: 116  LALTTTTASTTTAMTPV-TPTTTHSAKNRRRSASQASRVLPQVIKPLSADGEIELAIRHL 174

Query: 1151 RSVDPLLVSLIDTXXXXXXXXXXXXFLALTKSILYQQLAYKAGAAIYTRFIALCGGEDSV 972
            R+ D LL  LIDT            F ALTKSILYQQLAYKAG +IYTRF++LCGGE+S+
Sbjct: 175  RAADALLGPLIDTHPPPQFEFHHNPFHALTKSILYQQLAYKAGTSIYTRFVSLCGGEESI 234

Query: 971  GPDTVLALSAQQLKQIGISGRKASYLYDLANKYNSGILSDESVMKMDDRSLFTMLSMVKG 792
             PD+VLALS QQLKQIG+SGRKASYLYDLANKY SGILSD++V+KMDDRSLFTMLSMVKG
Sbjct: 235  SPDSVLALSPQQLKQIGVSGRKASYLYDLANKYKSGILSDDTVVKMDDRSLFTMLSMVKG 294

Query: 791  IGSWSVHMFMIFSLHRPDVLPVSDLGVRKGVQLLYGLDDLPRPSQMEQLCEKWKPYRSVG 612
            IGSWSVHMFMIFSLHRPDVLPVSDLGVRKGVQLLYGL++LPRPSQMEQLCEKWKPYRSVG
Sbjct: 295  IGSWSVHMFMIFSLHRPDVLPVSDLGVRKGVQLLYGLEELPRPSQMEQLCEKWKPYRSVG 354

Query: 611  AWYMWRFVEGKGTPAAGATMALEGNIVQPLQQIEPQQDGRQH-HQLQFLESVNGIGNLGA 435
            AWYMWRFVEGKG P + +   L+G++VQPLQQIEPQQDG QH HQLQF+E VNGIGN+GA
Sbjct: 355  AWYMWRFVEGKGAPTSNSGGVLDGSVVQPLQQIEPQQDGHQHQHQLQFVEPVNGIGNIGA 414

Query: 434  CIWGQ 420
            CIW Q
Sbjct: 415  CIWNQ 419


>ref|XP_012834115.1| PREDICTED: uncharacterized protein LOC105954973 [Erythranthe
            guttatus]
          Length = 424

 Score =  488 bits (1255), Expect = e-135
 Identities = 272/428 (63%), Positives = 306/428 (71%), Gaps = 26/428 (6%)
 Frame = -3

Query: 1625 MGEHTRTQ------PETQSQ-----------DSKPPSPFRSMKSEPNLDSNSQSPPQPTL 1497
            MG+ T TQ      PE++S            DSKP S    + + P+ DSN Q     T 
Sbjct: 1    MGDQTHTQIETRPLPESESHSQIVEIPNTLHDSKPQSS--PLTTAPDSDSNPQICHPQTA 58

Query: 1496 STDPAT--DAGEFLQNSQNHSNPSKIPIRPQKIRKLSIT--NTDTTPAADEESPPPQIXX 1329
            S   A+   A    + S N  NPSKIPIRPQKIRKLS T   + T  +  +E+       
Sbjct: 59   SVAEASLAAAAAATEISNNSQNPSKIPIRPQKIRKLSTTAGKSSTPQSTADEASVSASPS 118

Query: 1328 XXXXXXXXXXXXXXTVEAPTTTITTKNRRRSASQLSRPLPQIIKPLSVNGEIELAIRHLR 1149
                          +   P+TT T KNRRRSASQ SR +PQIIKPLS +GEIELAIRHLR
Sbjct: 119  LPLTPAAGAASTVASPATPSTTHTAKNRRRSASQASRAMPQIIKPLSADGEIELAIRHLR 178

Query: 1148 SVDPLLVSLIDTXXXXXXXXXXXXFLALTKSILYQQLAYKAGAAIYTRFIALCGGEDSVG 969
            +VDPLL  LIDT            FLALTKSILYQQLA KAG +IYTRF++LCG E+SV 
Sbjct: 179  AVDPLLGPLIDTHLPFQFDSQQPPFLALTKSILYQQLACKAGTSIYTRFVSLCGAEESVC 238

Query: 968  PDTVLALSAQQLKQIGISGRKASYLYDLANKYNSGILSDESVMKMDDRSLFTMLSMVKGI 789
            PDTVL+LS QQLK IG+SGRKASYLYDLANKY SGILSD++V+KMDDRSLFTMLSMVKGI
Sbjct: 239  PDTVLSLSTQQLKAIGVSGRKASYLYDLANKYKSGILSDDTVVKMDDRSLFTMLSMVKGI 298

Query: 788  GSWSVHMFMIFSLHRPDVLPVSDLGVRKGVQLLYGLDDLPRPSQMEQLCEKWKPYRSVGA 609
            GSWSVHMFMIFSLHRPDVLPVSDLGVRKGVQ+L GLD+LPRPSQMEQLCEKWKPYRSVGA
Sbjct: 299  GSWSVHMFMIFSLHRPDVLPVSDLGVRKGVQMLNGLDELPRPSQMEQLCEKWKPYRSVGA 358

Query: 608  WYMWRFVEGKGTPAAGATMALEGNIVQPLQQIEPQQDGRQH-----HQLQFLESVNGIGN 444
            WYMWRFVEGKG  AAG+ +ALE  +VQPLQQ+EPQQDG QH     HQLQF+E VNGIGN
Sbjct: 359  WYMWRFVEGKG--AAGSGVALEDGVVQPLQQVEPQQDGHQHQHQLQHQLQFVEPVNGIGN 416

Query: 443  LGACIWGQ 420
            +GACIW Q
Sbjct: 417  MGACIWNQ 424


>ref|XP_009804698.1| PREDICTED: probable DNA-3-methyladenine glycosylase 2 [Nicotiana
            sylvestris]
          Length = 363

 Score =  483 bits (1242), Expect = e-133
 Identities = 264/402 (65%), Positives = 294/402 (73%)
 Frame = -3

Query: 1625 MGEHTRTQPETQSQDSKPPSPFRSMKSEPNLDSNSQSPPQPTLSTDPATDAGEFLQNSQN 1446
            MGE T+TQ  T+ Q    PS      S+PN DS        TLST+P  D         N
Sbjct: 1    MGEQTQTQTITEPQT---PS-----LSQPNSDST-------TLSTNPPVDI------PPN 39

Query: 1445 HSNPSKIPIRPQKIRKLSITNTDTTPAADEESPPPQIXXXXXXXXXXXXXXXXTVEAPTT 1266
             SNPSKIPIRPQKIRKLS T   T+P +    P                     V +   
Sbjct: 40   PSNPSKIPIRPQKIRKLSST---TSPQSTNPKPADS--------------SQSVVTSNGK 82

Query: 1265 TITTKNRRRSASQLSRPLPQIIKPLSVNGEIELAIRHLRSVDPLLVSLIDTXXXXXXXXX 1086
               TKNRRRSASQL+R LPQ+IKPLS NGEIE A+RHLR  DPLL SLIDT         
Sbjct: 83   VTITKNRRRSASQLTRVLPQVIKPLSANGEIENALRHLRLADPLLCSLIDTLPLPAFDSH 142

Query: 1085 XXXFLALTKSILYQQLAYKAGAAIYTRFIALCGGEDSVGPDTVLALSAQQLKQIGISGRK 906
               FLAL KSILYQQLAYKAG +IYTRF++LCG ED+V PD VL+LSAQQLKQIGISGRK
Sbjct: 143  QLPFLALCKSILYQQLAYKAGTSIYTRFVSLCGSEDAVCPDVVLSLSAQQLKQIGISGRK 202

Query: 905  ASYLYDLANKYNSGILSDESVMKMDDRSLFTMLSMVKGIGSWSVHMFMIFSLHRPDVLPV 726
            ASYLYDLANKY +GIL+D++V+KMDDRSLFTMLSMVKGIGSWSVHMFMIFSLHRPDVLPV
Sbjct: 203  ASYLYDLANKYKTGILADDTVVKMDDRSLFTMLSMVKGIGSWSVHMFMIFSLHRPDVLPV 262

Query: 725  SDLGVRKGVQLLYGLDDLPRPSQMEQLCEKWKPYRSVGAWYMWRFVEGKGTPAAGATMAL 546
            SDLGVRKGVQ+LYGL++LPRPSQMEQLCEKW+PYRS+GAWYMWRF+EGKGTPA  A  A+
Sbjct: 263  SDLGVRKGVQMLYGLEELPRPSQMEQLCEKWRPYRSIGAWYMWRFIEGKGTPATAAA-AM 321

Query: 545  EGNIVQPLQQIEPQQDGRQHHQLQFLESVNGIGNLGACIWGQ 420
            EG  VQPLQQIEPQQ   Q HQLQ LE ++GIG+LGACIWGQ
Sbjct: 322  EGGSVQPLQQIEPQQQPEQQHQLQLLEPIDGIGSLGACIWGQ 363


>ref|XP_009766232.1| PREDICTED: probable DNA-3-methyladenine glycosylase 2 isoform X3
            [Nicotiana sylvestris]
          Length = 360

 Score =  452 bits (1163), Expect = e-124
 Identities = 252/406 (62%), Positives = 283/406 (69%), Gaps = 4/406 (0%)
 Frame = -3

Query: 1625 MGEHTRTQ--PETQSQDSKPPSPFRSMKSEPNLDSNSQSPPQPTLSTDPATDAGEFLQNS 1452
            MGE T+ Q  PETQ+    PP      +S PN DS   S     L  +P           
Sbjct: 1    MGEQTQVQAKPETQT----PPQSQLQPRSPPNSDSTLVSNSPVDLPPNP----------- 45

Query: 1451 QNHSNPSKIPIRPQKIRKLSITNTDTTPAADEESPPPQIXXXXXXXXXXXXXXXXTVEAP 1272
               SNPSKIPIRPQKIRKLS T +  TP     S  P                       
Sbjct: 46   ---SNPSKIPIRPQKIRKLSCTPSSKTPQTTASSATPA---------------------- 80

Query: 1271 TTTITTKNRRRSA--SQLSRPLPQIIKPLSVNGEIELAIRHLRSVDPLLVSLIDTXXXXX 1098
                +TK+RR+SA  S  SR LPQIIKPLS NGEI+ A+ HLRS DPLL SLIDT     
Sbjct: 81   ----STKSRRKSAPKSSSSRGLPQIIKPLSANGEIDNALLHLRSADPLLGSLIDTLPVPQ 136

Query: 1097 XXXXXXXFLALTKSILYQQLAYKAGAAIYTRFIALCGGEDSVGPDTVLALSAQQLKQIGI 918
                   FLAL+KSILYQQLAYKAG +IYTRF++LCGGED+V PD VL+LSAQQLKQIG+
Sbjct: 137  FESHHSPFLALSKSILYQQLAYKAGTSIYTRFVSLCGGEDAVCPDVVLSLSAQQLKQIGV 196

Query: 917  SGRKASYLYDLANKYNSGILSDESVMKMDDRSLFTMLSMVKGIGSWSVHMFMIFSLHRPD 738
            SGRKASYLYDLANKY +GIL D++++KMDD+SLFTMLSMVKGIGSWSVHMFMIFSLHRPD
Sbjct: 197  SGRKASYLYDLANKYKNGILCDDALVKMDDKSLFTMLSMVKGIGSWSVHMFMIFSLHRPD 256

Query: 737  VLPVSDLGVRKGVQLLYGLDDLPRPSQMEQLCEKWKPYRSVGAWYMWRFVEGKGTPAAGA 558
            VLPVSDLGVRKGVQLLYGL++LPRPSQMEQLCEKW+PYRS GAWYMWRFVE KGTP   A
Sbjct: 257  VLPVSDLGVRKGVQLLYGLEELPRPSQMEQLCEKWRPYRSAGAWYMWRFVEWKGTPTTAA 316

Query: 557  TMALEGNIVQPLQQIEPQQDGRQHHQLQFLESVNGIGNLGACIWGQ 420
              A++   VQPLQQI+  Q+  Q HQLQ LE +NGIGNLGACIW Q
Sbjct: 317  A-AIDAGNVQPLQQIQTGQE-TQQHQLQLLEPINGIGNLGACIWSQ 360


>gb|EYU40170.1| hypothetical protein MIMGU_mgv1a021549mg [Erythranthe guttata]
          Length = 407

 Score =  452 bits (1163), Expect = e-124
 Identities = 257/422 (60%), Positives = 289/422 (68%), Gaps = 26/422 (6%)
 Frame = -3

Query: 1625 MGEHTRTQ------PETQSQ-----------DSKPPSPFRSMKSEPNLDSNSQSPPQPTL 1497
            MG+ T TQ      PE++S            DSKP S    + + P+ DSN Q     T 
Sbjct: 1    MGDQTHTQIETRPLPESESHSQIVEIPNTLHDSKPQSS--PLTTAPDSDSNPQICHPQTA 58

Query: 1496 STDPAT--DAGEFLQNSQNHSNPSKIPIRPQKIRKLSIT--NTDTTPAADEESPPPQIXX 1329
            S   A+   A    + S N  NPSKIPIRPQKIRKLS T   + T  +  +E+       
Sbjct: 59   SVAEASLAAAAAATEISNNSQNPSKIPIRPQKIRKLSTTAGKSSTPQSTADEASVSASPS 118

Query: 1328 XXXXXXXXXXXXXXTVEAPTTTITTKNRRRSASQLSRPLPQIIKPLSVNGEIELAIRHLR 1149
                          +   P+TT T KNRRRSASQ SR +PQIIKPLS +GEIELAIRHLR
Sbjct: 119  LPLTPAAGAASTVASPATPSTTHTAKNRRRSASQASRAMPQIIKPLSADGEIELAIRHLR 178

Query: 1148 SVDPLLVSLIDTXXXXXXXXXXXXFLALTKSILYQQLAYKAGAAIYTRFIALCGGEDSVG 969
            +VDPLL  LIDT            FLALTKSILYQQLA KAG +IYTRF++LCG E+SV 
Sbjct: 179  AVDPLLGPLIDTHLPFQFDSQQPPFLALTKSILYQQLACKAGTSIYTRFVSLCGAEESVC 238

Query: 968  PDTVLALSAQQLKQIGISGRKASYLYDLANKYNSGILSDESVMKMDDRSLFTMLSMVKGI 789
            PDTVL+LS QQLK IG+SGRKASYLYDLANKY SGILSD++V+KMDDRSLFTMLSMVKGI
Sbjct: 239  PDTVLSLSTQQLKAIGVSGRKASYLYDLANKYKSGILSDDTVVKMDDRSLFTMLSMVKGI 298

Query: 788  GSWSVHMFMIFSLHRPDVLPVSDLGVRKGVQLLYGLDDLPRPSQMEQLCEKWKPYRSVGA 609
            GSWSVHMFMIFSLHRPDVLPVSDLGVRKGVQ+L GLD+LPRPSQMEQLCEKWKPYRSVGA
Sbjct: 299  GSWSVHMFMIFSLHRPDVLPVSDLGVRKGVQMLNGLDELPRPSQMEQLCEKWKPYRSVGA 358

Query: 608  WYMWRFVEGKGTPAAGATMALEGNIVQPLQQIEPQQDGRQH-----HQLQFLESVNGIGN 444
            WYMWRFVEGKG   +G              Q+EPQQDG QH     HQLQF+E VNGIGN
Sbjct: 359  WYMWRFVEGKGAAGSGV-------------QVEPQQDGHQHQHQLQHQLQFVEPVNGIGN 405

Query: 443  LG 438
            +G
Sbjct: 406  MG 407


>ref|XP_006346951.1| PREDICTED: probable DNA-3-methyladenine glycosylase 2-like [Solanum
            tuberosum]
          Length = 362

 Score =  449 bits (1155), Expect = e-123
 Identities = 248/403 (61%), Positives = 283/403 (70%), Gaps = 1/403 (0%)
 Frame = -3

Query: 1625 MGEHTRTQPETQSQDSKPPSPFRSMKSEPNLDSNSQSPPQPTLSTDPATDAGEFLQNSQN 1446
            M E T T P+ Q Q    P P     S    +S    PP P                   
Sbjct: 1    MSEQTLTPPQPQPQPQPLPQPLPISDSTLVSNSPVDLPPNP------------------- 41

Query: 1445 HSNPSKIPIRPQKIRKLSITNTDTTPAADEESPPPQIXXXXXXXXXXXXXXXXTVEAPTT 1266
             SNPSKIPIRPQKIRKLS     +TP+++ ++P   +                   A + 
Sbjct: 42   -SNPSKIPIRPQKIRKLS-----STPSSNGKTPETTVPSAST--------------ATSG 81

Query: 1265 TIT-TKNRRRSASQLSRPLPQIIKPLSVNGEIELAIRHLRSVDPLLVSLIDTXXXXXXXX 1089
             IT TKNRR+SA + SR LPQIIKPLS +GEI+ A++HLRSVDPLLVSLIDT        
Sbjct: 82   AITVTKNRRKSAPKSSRVLPQIIKPLSADGEIDNALQHLRSVDPLLVSLIDTLPSPQFEL 141

Query: 1088 XXXXFLALTKSILYQQLAYKAGAAIYTRFIALCGGEDSVGPDTVLALSAQQLKQIGISGR 909
                FLAL+KSILYQQLAYKAG +IYTRF++LCGGED+V PD VL+LS QQLKQ+GISGR
Sbjct: 142  HHSAFLALSKSILYQQLAYKAGTSIYTRFVSLCGGEDAVCPDIVLSLSPQQLKQVGISGR 201

Query: 908  KASYLYDLANKYNSGILSDESVMKMDDRSLFTMLSMVKGIGSWSVHMFMIFSLHRPDVLP 729
            KASYL+DLANKY SGILSDE+++KMDDRSLFTMLSMVKGIGSWSVHMFMIFSLHRPDVLP
Sbjct: 202  KASYLHDLANKYRSGILSDETLVKMDDRSLFTMLSMVKGIGSWSVHMFMIFSLHRPDVLP 261

Query: 728  VSDLGVRKGVQLLYGLDDLPRPSQMEQLCEKWKPYRSVGAWYMWRFVEGKGTPAAGATMA 549
            VSDLGVRKGVQLLYGL++LPRPSQMEQLC+KWKPYRS GAWYMWR VEGKGTP   A   
Sbjct: 262  VSDLGVRKGVQLLYGLEELPRPSQMEQLCDKWKPYRSAGAWYMWRLVEGKGTPTTAAA-P 320

Query: 548  LEGNIVQPLQQIEPQQDGRQHHQLQFLESVNGIGNLGACIWGQ 420
            ++G  VQ LQQ   +Q+  Q HQLQ LE +NGI NLGACIW Q
Sbjct: 321  IDGGNVQALQQFPTEQE-TQQHQLQLLEPINGIENLGACIWSQ 362


>emb|CDP02014.1| unnamed protein product [Coffea canephora]
          Length = 337

 Score =  449 bits (1154), Expect = e-123
 Identities = 246/402 (61%), Positives = 272/402 (67%)
 Frame = -3

Query: 1625 MGEHTRTQPETQSQDSKPPSPFRSMKSEPNLDSNSQSPPQPTLSTDPATDAGEFLQNSQN 1446
            MGE T+ Q +TQSQ   P S      S+PN D  S + P P        D       SQ 
Sbjct: 1    MGEQTQVQTQTQSQS--PAS------SQPNSDVTSATQPTPVADVSINADV------SQK 46

Query: 1445 HSNPSKIPIRPQKIRKLSITNTDTTPAADEESPPPQIXXXXXXXXXXXXXXXXTVEAPTT 1266
             SNPSKIPIRPQKIRKLS                                       PT+
Sbjct: 47   PSNPSKIPIRPQKIRKLSSN-------------------------------------PTS 69

Query: 1265 TITTKNRRRSASQLSRPLPQIIKPLSVNGEIELAIRHLRSVDPLLVSLIDTXXXXXXXXX 1086
            TI T                IIKPLS  GEI  A+ HLR VDPLL +LIDT         
Sbjct: 70   TIATT--------------PIIKPLSAEGEINAALHHLRVVDPLLATLIDTHQPPAFESH 115

Query: 1085 XXXFLALTKSILYQQLAYKAGAAIYTRFIALCGGEDSVGPDTVLALSAQQLKQIGISGRK 906
               FLALTKSILYQQLAYKAG +IY RF+ALCGGE +V PD VL LSAQ+LKQ+G+SGRK
Sbjct: 116  HSPFLALTKSILYQQLAYKAGTSIYNRFVALCGGETAVLPDNVLGLSAQELKQVGVSGRK 175

Query: 905  ASYLYDLANKYNSGILSDESVMKMDDRSLFTMLSMVKGIGSWSVHMFMIFSLHRPDVLPV 726
            ASYLYDLANKY SGILSDE+V+KMDD+SLFTMLSMVKGIGSWSVHMFMIFSLHRPDVLPV
Sbjct: 176  ASYLYDLANKYKSGILSDETVVKMDDKSLFTMLSMVKGIGSWSVHMFMIFSLHRPDVLPV 235

Query: 725  SDLGVRKGVQLLYGLDDLPRPSQMEQLCEKWKPYRSVGAWYMWRFVEGKGTPAAGATMAL 546
            SDLGVRKGVQ+LYGL++LPRPSQMEQLCEKW+PYRSVGAWYMWRFVEGKG+  A    ++
Sbjct: 236  SDLGVRKGVQMLYGLEELPRPSQMEQLCEKWRPYRSVGAWYMWRFVEGKGSQNASVAPSV 295

Query: 545  EGNIVQPLQQIEPQQDGRQHHQLQFLESVNGIGNLGACIWGQ 420
            EG  VQPLQQIEPQQD +Q HQLQ LE +NG+GNLGACIWGQ
Sbjct: 296  EGANVQPLQQIEPQQDAQQQHQLQLLEPINGMGNLGACIWGQ 337


>ref|XP_009766226.1| PREDICTED: probable DNA-3-methyladenine glycosylase 2 isoform X2
            [Nicotiana sylvestris]
          Length = 368

 Score =  445 bits (1144), Expect = e-122
 Identities = 252/414 (60%), Positives = 283/414 (68%), Gaps = 12/414 (2%)
 Frame = -3

Query: 1625 MGEHTRTQ--PETQSQDSKPPSPFRSMKSEPNLDSNSQSPPQPTLSTDPATDAGEFLQNS 1452
            MGE T+ Q  PETQ+    PP      +S PN DS   S     L  +P           
Sbjct: 1    MGEQTQVQAKPETQT----PPQSQLQPRSPPNSDSTLVSNSPVDLPPNP----------- 45

Query: 1451 QNHSNPSKIPIRPQKIRKLSITNTDTTPAADEESPPPQIXXXXXXXXXXXXXXXXTVEAP 1272
               SNPSKIPIRPQKIRKLS T +  TP     S  P                       
Sbjct: 46   ---SNPSKIPIRPQKIRKLSCTPSSKTPQTTASSATPA---------------------- 80

Query: 1271 TTTITTKNRRRSA--SQLSRPLPQIIKPLSVNGEIELAIRHLRSVDPLLVSLIDTXXXXX 1098
                +TK+RR+SA  S  SR LPQIIKPLS NGEI+ A+ HLRS DPLL SLIDT     
Sbjct: 81   ----STKSRRKSAPKSSSSRGLPQIIKPLSANGEIDNALLHLRSADPLLGSLIDTLPVPQ 136

Query: 1097 XXXXXXXFLALTKSILYQQLAYKAGAAIYTRFIALCGGEDSVGPDTVLALSAQQLKQIGI 918
                   FLAL+KSILYQQLAYKAG +IYTRF++LCGGED+V PD VL+LSAQQLKQIG+
Sbjct: 137  FESHHSPFLALSKSILYQQLAYKAGTSIYTRFVSLCGGEDAVCPDVVLSLSAQQLKQIGV 196

Query: 917  SGRKASYLYDLANKYNSGILSDESVMKMDDRSLFTMLSMVKGIGSWSVHMFMIFSLHRPD 738
            SGRKASYLYDLANKY +GIL D++++KMDD+SLFTMLSMVKGIGSWSVHMFMIFSLHRPD
Sbjct: 197  SGRKASYLYDLANKYKNGILCDDALVKMDDKSLFTMLSMVKGIGSWSVHMFMIFSLHRPD 256

Query: 737  VLPVSDLGVRKGVQLLYGLDDLPRPSQMEQLCEKWKPYRSVGAWYMWRFVEGKGTPAAGA 558
            VLPVSDLGVRKGVQLLYGL++LPRPSQMEQLCEKW+PYRS GAWYMWRFVE KGTP   A
Sbjct: 257  VLPVSDLGVRKGVQLLYGLEELPRPSQMEQLCEKWRPYRSAGAWYMWRFVEWKGTPTTAA 316

Query: 557  TMALEGNIVQPLQQIEPQQDGRQHHQLQFLESVNGIGNLG--------ACIWGQ 420
              A++   VQPLQQI+  Q+  Q HQLQ LE +NGIGNLG        ACIW Q
Sbjct: 317  A-AIDAGNVQPLQQIQTGQE-TQQHQLQLLEPINGIGNLGYLTIFRLKACIWSQ 368


>ref|XP_004233519.1| PREDICTED: probable DNA-3-methyladenine glycosylase 2 [Solanum
            lycopersicum]
          Length = 353

 Score =  443 bits (1140), Expect = e-121
 Identities = 238/374 (63%), Positives = 278/374 (74%), Gaps = 2/374 (0%)
 Frame = -3

Query: 1535 LDSNSQSPPQPT-LSTDPATDAGEFLQNSQNHSNPSKIPIRPQKIRKLSITNTDTTPAAD 1359
            +   +Q+PPQP   S+D    +   +    N SNPSKIPIRPQKIRKLS     +TP+++
Sbjct: 1    MSEQTQTPPQPLPTSSDSTLVSNSPVDLPPNPSNPSKIPIRPQKIRKLS-----STPSSN 55

Query: 1358 EESPPPQIXXXXXXXXXXXXXXXXTVEAPTTTIT-TKNRRRSASQLSRPLPQIIKPLSVN 1182
             ++P   +                   A +  IT TKNRR++A + SR  PQIIKPLS +
Sbjct: 56   GKTPETAVPSAST--------------ATSGAITVTKNRRKTAPKSSRVSPQIIKPLSAD 101

Query: 1181 GEIELAIRHLRSVDPLLVSLIDTXXXXXXXXXXXXFLALTKSILYQQLAYKAGAAIYTRF 1002
            GEI+ A++HLRSVDPLLVSLIDT            FLAL+KSILYQQLAYKAG +IYTRF
Sbjct: 102  GEIDNALQHLRSVDPLLVSLIDTLPSPQFELHHSAFLALSKSILYQQLAYKAGTSIYTRF 161

Query: 1001 IALCGGEDSVGPDTVLALSAQQLKQIGISGRKASYLYDLANKYNSGILSDESVMKMDDRS 822
            ++LCGGED+V PD VLALS QQLKQ+GISGRKASYL+DLANKY SGILSDE+++KMDDRS
Sbjct: 162  VSLCGGEDAVCPDIVLALSPQQLKQVGISGRKASYLHDLANKYKSGILSDETLVKMDDRS 221

Query: 821  LFTMLSMVKGIGSWSVHMFMIFSLHRPDVLPVSDLGVRKGVQLLYGLDDLPRPSQMEQLC 642
            LF MLSMVKGIGSWSVHMFMIFSLHRPD+LPVSDLGVRKGVQLLYGL++LPRPSQMEQLC
Sbjct: 222  LFAMLSMVKGIGSWSVHMFMIFSLHRPDILPVSDLGVRKGVQLLYGLEELPRPSQMEQLC 281

Query: 641  EKWKPYRSVGAWYMWRFVEGKGTPAAGATMALEGNIVQPLQQIEPQQDGRQHHQLQFLES 462
            +KWKPYRS GAWYMWR VEGKGTP   A   ++G   Q LQQ   +Q+  Q HQLQ LE 
Sbjct: 282  DKWKPYRSAGAWYMWRLVEGKGTPTIAAA-PIDGGNAQALQQFPVEQE-TQQHQLQLLEP 339

Query: 461  VNGIGNLGACIWGQ 420
            +NGI NLGACIW Q
Sbjct: 340  INGIENLGACIWSQ 353


>ref|XP_009766219.1| PREDICTED: probable DNA-3-methyladenine glycosylase 2 isoform X1
            [Nicotiana sylvestris]
          Length = 395

 Score =  440 bits (1132), Expect = e-120
 Identities = 248/403 (61%), Positives = 279/403 (69%), Gaps = 4/403 (0%)
 Frame = -3

Query: 1625 MGEHTRTQ--PETQSQDSKPPSPFRSMKSEPNLDSNSQSPPQPTLSTDPATDAGEFLQNS 1452
            MGE T+ Q  PETQ+    PP      +S PN DS   S     L  +P           
Sbjct: 1    MGEQTQVQAKPETQT----PPQSQLQPRSPPNSDSTLVSNSPVDLPPNP----------- 45

Query: 1451 QNHSNPSKIPIRPQKIRKLSITNTDTTPAADEESPPPQIXXXXXXXXXXXXXXXXTVEAP 1272
               SNPSKIPIRPQKIRKLS T +  TP     S  P                       
Sbjct: 46   ---SNPSKIPIRPQKIRKLSCTPSSKTPQTTASSATPA---------------------- 80

Query: 1271 TTTITTKNRRRSA--SQLSRPLPQIIKPLSVNGEIELAIRHLRSVDPLLVSLIDTXXXXX 1098
                +TK+RR+SA  S  SR LPQIIKPLS NGEI+ A+ HLRS DPLL SLIDT     
Sbjct: 81   ----STKSRRKSAPKSSSSRGLPQIIKPLSANGEIDNALLHLRSADPLLGSLIDTLPVPQ 136

Query: 1097 XXXXXXXFLALTKSILYQQLAYKAGAAIYTRFIALCGGEDSVGPDTVLALSAQQLKQIGI 918
                   FLAL+KSILYQQLAYKAG +IYTRF++LCGGED+V PD VL+LSAQQLKQIG+
Sbjct: 137  FESHHSPFLALSKSILYQQLAYKAGTSIYTRFVSLCGGEDAVCPDVVLSLSAQQLKQIGV 196

Query: 917  SGRKASYLYDLANKYNSGILSDESVMKMDDRSLFTMLSMVKGIGSWSVHMFMIFSLHRPD 738
            SGRKASYLYDLANKY +GIL D++++KMDD+SLFTMLSMVKGIGSWSVHMFMIFSLHRPD
Sbjct: 197  SGRKASYLYDLANKYKNGILCDDALVKMDDKSLFTMLSMVKGIGSWSVHMFMIFSLHRPD 256

Query: 737  VLPVSDLGVRKGVQLLYGLDDLPRPSQMEQLCEKWKPYRSVGAWYMWRFVEGKGTPAAGA 558
            VLPVSDLGVRKGVQLLYGL++LPRPSQMEQLCEKW+PYRS GAWYMWRFVE KGTP   A
Sbjct: 257  VLPVSDLGVRKGVQLLYGLEELPRPSQMEQLCEKWRPYRSAGAWYMWRFVEWKGTPTTAA 316

Query: 557  TMALEGNIVQPLQQIEPQQDGRQHHQLQFLESVNGIGNLGACI 429
              A++   VQPLQQI+  Q+  Q HQLQ LE +NGIGNLG  I
Sbjct: 317  A-AIDAGNVQPLQQIQTGQE-TQQHQLQLLEPINGIGNLGLLI 357


>ref|XP_007017553.1| DNA glycosylase superfamily protein [Theobroma cacao]
            gi|508722881|gb|EOY14778.1| DNA glycosylase superfamily
            protein [Theobroma cacao]
          Length = 397

 Score =  386 bits (992), Expect = e-104
 Identities = 216/414 (52%), Positives = 269/414 (64%), Gaps = 12/414 (2%)
 Frame = -3

Query: 1625 MGEHTRTQ--PETQSQ---DSKPPSPFRSMKS----EPNLDSNSQSPPQPTLSTDPATDA 1473
            MGE TR Q  P+ QSQ   DS   +P +          N ++ S +    T+++   T A
Sbjct: 1    MGEQTRAQLQPQAQSQPQNDSSSSTPTQEQSQGQTQTQNPNNTSNAAVSTTVTSAVVTSA 60

Query: 1472 GEFLQN--SQNHSNPSKIPIRPQKIRKLSITNTDTTPAADEESPPPQIXXXXXXXXXXXX 1299
               L N   Q  S PSKIP RP+KIRKLS      T A+ + +                 
Sbjct: 61   PTELTNVPPQTSSPPSKIPFRPRKIRKLSPDPNSDTNASQQATTSAT------------- 107

Query: 1298 XXXXTVEAPTTTITTKNRRRSASQLSRPLPQII-KPLSVNGEIELAIRHLRSVDPLLVSL 1122
                  E P T   T   + +  +    +P+I+ + LS  GE+E AIRHLR+ DPLL SL
Sbjct: 108  ---SATEPPKTVAKTPKTKLTQHRALAVVPRIMARSLSCEGEVETAIRHLRNADPLLASL 164

Query: 1121 IDTXXXXXXXXXXXXFLALTKSILYQQLAYKAGAAIYTRFIALCGGEDSVGPDTVLALSA 942
            ID             FLALT+SILYQQLA+KAG +IY RFIALCGGE+ V P+TVL+L+A
Sbjct: 165  IDIHPPPTFDTFHTPFLALTRSILYQQLAFKAGTSIYNRFIALCGGENGVVPETVLSLTA 224

Query: 941  QQLKQIGISGRKASYLYDLANKYNSGILSDESVMKMDDRSLFTMLSMVKGIGSWSVHMFM 762
            QQL+QIG+SGRKASYL+DLA KY +GILSD +++ MDD+SLFTML+MV GIGSWSVHMFM
Sbjct: 225  QQLRQIGVSGRKASYLHDLARKYQTGILSDSAIVNMDDKSLFTMLTMVNGIGSWSVHMFM 284

Query: 761  IFSLHRPDVLPVSDLGVRKGVQLLYGLDDLPRPSQMEQLCEKWKPYRSVGAWYMWRFVEG 582
            IFSLHRPDVLP++DLGVRKGVQLLY L++LPRPSQM+QLCEKW+PYRSV +WY+WRFVE 
Sbjct: 285  IFSLHRPDVLPINDLGVRKGVQLLYNLEELPRPSQMDQLCEKWRPYRSVASWYLWRFVEA 344

Query: 581  KGTPAAGATMALEGNIVQPLQQIEPQQDGRQHHQLQFLESVNGIGNLGACIWGQ 420
            KG P++ A +A  G  + P QQ E QQ  +   Q Q L+ +N I NLGAC WGQ
Sbjct: 345  KGAPSSAAAVA-AGASLPPPQQEEQQQHQQHQQQPQLLDPINSILNLGACAWGQ 397


>ref|XP_010263642.1| PREDICTED: uncharacterized protein LOC104601852 [Nelumbo nucifera]
          Length = 425

 Score =  381 bits (978), Expect = e-102
 Identities = 215/419 (51%), Positives = 266/419 (63%), Gaps = 11/419 (2%)
 Frame = -3

Query: 1643 SITFLAMGEHTR----------TQPETQSQDSKPPSPFRSMKSEPNLDSNSQSPPQPTLS 1494
            S++   MGE T+          +Q + QSQ    P P     + P L   + +PPQ    
Sbjct: 19   SVSLSYMGEQTQPPTQTQIQSQSQAQAQSQPLPLPPPPPPPPAPPLLHDITTNPPQLVAP 78

Query: 1493 TDPATDAGEFLQNSQNHSNPSKIPIRPQKIRKLSITNTDTTPAADEESPPPQIXXXXXXX 1314
              P T A    QNS   ++ +KIP RP+KIRK       T+     ++   +I       
Sbjct: 79   APPTTTASSAPQNS---ASSTKIPFRPRKIRK-------TSSDVSSDNSDNKIVDGECKT 128

Query: 1313 XXXXXXXXXTVEAPTTTITTKNRRRSASQLSRPLPQII-KPLSVNGEIELAIRHLRSVDP 1137
                           TT + K  R  A Q+ R +P+++ + LS  GE+ LA++HLR+ DP
Sbjct: 129  TATNGDHKTNNNTALTTTSNKKSRIVAKQV-RVVPRVVARTLSCEGEVALALQHLRNSDP 187

Query: 1136 LLVSLIDTXXXXXXXXXXXXFLALTKSILYQQLAYKAGAAIYTRFIALCGGEDSVGPDTV 957
             L  LID             FLALTKSILYQQLAYKAG +IYTRF++LCGGE  V P+ V
Sbjct: 188  QLARLIDIHQPPTFDSFHPPFLALTKSILYQQLAYKAGTSIYTRFVSLCGGEAGVVPEAV 247

Query: 956  LALSAQQLKQIGISGRKASYLYDLANKYNSGILSDESVMKMDDRSLFTMLSMVKGIGSWS 777
            LALS QQL+QIG+SGRKASYL+DLANKY +GILSD S++ MDD+SLFTML+MVKGIGSWS
Sbjct: 248  LALSPQQLRQIGVSGRKASYLHDLANKYRNGILSDASIVDMDDKSLFTMLTMVKGIGSWS 307

Query: 776  VHMFMIFSLHRPDVLPVSDLGVRKGVQLLYGLDDLPRPSQMEQLCEKWKPYRSVGAWYMW 597
            VHMFMIFSLHRPDVLPV DLGVRKGVQLLYGL++LPRPSQMEQLCEKW+PYRSV +WYMW
Sbjct: 308  VHMFMIFSLHRPDVLPVGDLGVRKGVQLLYGLEELPRPSQMEQLCEKWRPYRSVASWYMW 367

Query: 596  RFVEGKGTPAAGATMALEGNIVQPLQQIEPQQDGRQHHQLQFLESVNGIGNLGACIWGQ 420
            RF E KG PA+ A +A+  +  Q L    PQQ  +     Q ++ +NGI NLGAC WGQ
Sbjct: 368  RFAEAKGAPASAAAVAVGVSQQQQLPP-PPQQQQQPPPPPQLIDPMNGIANLGACTWGQ 425


>ref|XP_010649138.1| PREDICTED: DNA-3-methyladenine glycosylase 1 [Vitis vinifera]
          Length = 384

 Score =  380 bits (977), Expect = e-102
 Identities = 211/384 (54%), Positives = 254/384 (66%), Gaps = 3/384 (0%)
 Frame = -3

Query: 1562 FRSMKSEPN---LDSNSQSPPQPTLSTDPATDAGEFLQNSQNHSNPSKIPIRPQKIRKLS 1392
            F S   EP    L   S +  + T++T  A D       S   S+ SK+P R +KIRK+S
Sbjct: 30   FHSESHEPFTTLLQPTSTTEAESTITTVTADDI------SLQASSSSKLPFRSRKIRKIS 83

Query: 1391 ITNTDTTPAADEESPPPQIXXXXXXXXXXXXXXXXTVEAPTTTITTKNRRRSASQLSRPL 1212
              +  T   +D +S P                     E        +  +R+A+Q +  L
Sbjct: 84   --SAATPSGSDGKSEPVS-------------------EDDLLKGGNRAWKRNAAQSTAAL 122

Query: 1211 PQIIKPLSVNGEIELAIRHLRSVDPLLVSLIDTXXXXXXXXXXXXFLALTKSILYQQLAY 1032
            P I+KPLS  GE+++A+RHL   DPLL +LI+T            FLAL KSILYQQLAY
Sbjct: 123  PTIVKPLSCEGELDVALRHLTKSDPLLAALINTHQPPTFDSCHPPFLALAKSILYQQLAY 182

Query: 1031 KAGAAIYTRFIALCGGEDSVGPDTVLALSAQQLKQIGISGRKASYLYDLANKYNSGILSD 852
            KA  +IYTRF+ALCGGE  V PD VLALS  QL+QIG+SGRKA YL+DLA+KY +GILSD
Sbjct: 183  KAATSIYTRFVALCGGEAGVVPDAVLALSPSQLRQIGVSGRKAGYLHDLASKYKTGILSD 242

Query: 851  ESVMKMDDRSLFTMLSMVKGIGSWSVHMFMIFSLHRPDVLPVSDLGVRKGVQLLYGLDDL 672
             S+M MDD+SLFTML+MVKGIGSWSVHMFMIFSLHRPDVLPV D+GVRKGVQ LYGL++L
Sbjct: 243  SSIMGMDDKSLFTMLTMVKGIGSWSVHMFMIFSLHRPDVLPVGDVGVRKGVQFLYGLEEL 302

Query: 671  PRPSQMEQLCEKWKPYRSVGAWYMWRFVEGKGTPAAGATMALEGNIVQPLQQIEPQQDGR 492
            PRPSQMEQLCEKWKPYRSVG+WYMWRFVE KG P A A +AL        QQ + QQ  +
Sbjct: 303  PRPSQMEQLCEKWKPYRSVGSWYMWRFVEAKGAPPARAAVALVDGATSEQQQQQEQQ--Q 360

Query: 491  QHHQLQFLESVNGIGNLGACIWGQ 420
            Q  QLQ ++ +NGI NLGACIWGQ
Sbjct: 361  QPQQLQLVDPINGIVNLGACIWGQ 384


>ref|XP_002282344.1| PREDICTED: probable DNA-3-methyladenine glycosylase 2 [Vitis
            vinifera]
          Length = 363

 Score =  379 bits (974), Expect = e-102
 Identities = 209/404 (51%), Positives = 263/404 (65%), Gaps = 2/404 (0%)
 Frame = -3

Query: 1625 MGEHTRTQPETQSQDSKPPSPFRSMKSEPNLDSNSQSPPQPTLSTDPATDAGEFLQNSQN 1446
            MGEH +T P+ Q  +    +   +        +++ +    + ST+ AT A       +N
Sbjct: 2    MGEHAQTVPKLQPDNESATATSNA--------ADTTAIQIVSTSTELATIAPP-----EN 48

Query: 1445 HSNPSKIPIRPQKIRKLSITNTDTTPAADEESPPPQIXXXXXXXXXXXXXXXXTVEAPTT 1266
             S+ S IP RP+KIRK+S  N+++ PA D +                             
Sbjct: 49   QSSASNIPFRPRKIRKISPDNSESKPAGDSK----------------------------- 79

Query: 1265 TITTKNRRRSASQLSRPLPQII-KPLSVNGEIELAIRHLRSVDPLLVSLIDTXXXXXXXX 1089
            T     + +   Q    +P ++ + LS  GEIE+A+RHLR+ DP L  LID         
Sbjct: 80   TAGKGAKNKLVPQRVPAVPNMVARALSCEGEIEIALRHLRNADPHLAPLIDLHPPPTFDS 139

Query: 1088 XXXXFLALTKSILYQQLAYKAGAAIYTRFIALCGGEDSVGPDTVLALSAQQLKQIGISGR 909
                FLALTKSILYQQLAYKAG +IYTRF+ LCGGE  V P+TVLAL+  QL+QIG+SGR
Sbjct: 140  FHTPFLALTKSILYQQLAYKAGTSIYTRFVGLCGGEAGVLPETVLALTPHQLRQIGVSGR 199

Query: 908  KASYLYDLANKYNSGILSDESVMKMDDRSLFTMLSMVKGIGSWSVHMFMIFSLHRPDVLP 729
            KASYL+DLA KY +GILSD  ++ MDD+SLFTML+MV GIGSWSVHMFMIFSLHRPDVLP
Sbjct: 200  KASYLHDLARKYQNGILSDTGIITMDDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLP 259

Query: 728  VSDLGVRKGVQLLYGLDDLPRPSQMEQLCEKWKPYRSVGAWYMWRFVEGKGTPAAGATMA 549
            V+DLGVRKGVQLLYGL++LPRPSQMEQLCEKW+PYRSV +WY+WRFVEGKG P++ A +A
Sbjct: 260  VNDLGVRKGVQLLYGLEELPRPSQMEQLCEKWRPYRSVASWYIWRFVEGKGAPSSAAAVA 319

Query: 548  LEGNIVQPLQQIEPQQD-GRQHHQLQFLESVNGIGNLGACIWGQ 420
               ++ Q  QQ E QQ   +Q HQ QFL+ +NGI NLGAC WGQ
Sbjct: 320  GGPSLQQQQQQQEQQQQHQQQQHQQQFLDPINGILNLGACAWGQ 363


>ref|XP_010254809.1| PREDICTED: uncharacterized protein LOC104595671 isoform X2 [Nelumbo
            nucifera]
          Length = 439

 Score =  376 bits (966), Expect = e-101
 Identities = 216/423 (51%), Positives = 271/423 (64%), Gaps = 14/423 (3%)
 Frame = -3

Query: 1646 SSITFLAMGEHTRTQPETQSQDSKPPSPFRSMKSEPNLDSNSQSPPQPTLSTDPATDAGE 1467
            S I+   MGE  +TQP+TQ Q    P      +  P L  ++ +P Q   +  P+T A  
Sbjct: 27   SLISLSYMGE--QTQPQTQIQ---APQSHAQSQPHPPLPHDTTTP-QVVPAAPPSTTAAH 80

Query: 1466 FLQNSQNHSNPS--KIPIRPQKIRKLSITNTDTTPAADEESPPPQIXXXXXXXXXXXXXX 1293
                   H++ S  KIP RP+KIRK+S       P+ + ++   QI              
Sbjct: 81   PSATIAPHNSASSIKIPFRPRKIRKVSSDG----PSDNSDNKSLQIVEGDCKTTTTTNGD 136

Query: 1292 XXTV--EAPTTTITTKNRRRSASQLSRPLPQII-KPLSVNGEIELAIRHLRSVDPLLVSL 1122
                   +  TT  +  + +   Q  R LP+++ + LS  GEI LA+++LR+ DP L  L
Sbjct: 137  HKPNINNSGATTTASNKKNKIVVQQVRVLPRVVARTLSCEGEIALALQYLRNSDPQLARL 196

Query: 1121 IDTXXXXXXXXXXXXFLALTKSILYQQLAYKAGAAIYTRFIALCGGEDSVGPDTVLALSA 942
            ID             FLALTKSILYQQLAYKAG +IYTRF++LCGGE  V P+ VLALS 
Sbjct: 197  IDIHQPPTFDSFHPPFLALTKSILYQQLAYKAGTSIYTRFVSLCGGEAGVVPEAVLALSP 256

Query: 941  QQLKQIGISGRKASYLYDLANKYNSGILSDESVMKMDDRSLFTMLSMVKGIGSWSVHMFM 762
            QQL+QIG+SGRKASYL+DLANKY +GILSD S++ MDD+SLFTML+MVKGIGSWSVHMFM
Sbjct: 257  QQLRQIGVSGRKASYLHDLANKYRNGILSDASIVDMDDKSLFTMLTMVKGIGSWSVHMFM 316

Query: 761  IFSLHRPDVLPVSDLGVRKGVQLLYGLDDLPRPSQMEQLCEKWKPYRSVGAWYMWRFVEG 582
            IFSLHRPDVLPV D+GVRKGVQLLYGLD LPRPSQMEQLCEKW+PYRSV +WYMWRF E 
Sbjct: 317  IFSLHRPDVLPVGDIGVRKGVQLLYGLDQLPRPSQMEQLCEKWRPYRSVASWYMWRFAEA 376

Query: 581  KGTPAAGATMALEGNIVQPLQQIEPQQDGRQHH---------QLQFLESVNGIGNLGACI 429
            KG PA+ A +A+  +  Q LQQ + QQ  +QH          Q Q ++ ++G+ NLGAC 
Sbjct: 377  KGAPASAAAVAVGVSQQQQLQQHQLQQPQQQHQQHQQHQQPPQPQLIDPMHGMANLGACA 436

Query: 428  WGQ 420
            WGQ
Sbjct: 437  WGQ 439


>ref|XP_006473513.1| PREDICTED: DNA-3-methyladenine glycosylase 1-like isoform X2 [Citrus
            sinensis]
          Length = 371

 Score =  375 bits (964), Expect = e-101
 Identities = 205/398 (51%), Positives = 262/398 (65%), Gaps = 3/398 (0%)
 Frame = -3

Query: 1604 QPETQSQDSKPPSPFRSMKSEPNLDSNSQSPPQPTLSTDPATDAGEFLQN--SQNHSNPS 1431
            Q ++Q+Q+   P P    +  PN DS +     P + T+ A +A     N   Q  S PS
Sbjct: 4    QTQSQTQNQPEPQPEPETQPPPNQDSTTTLAVIP-VQTETANNATITHANVTPQTSSPPS 62

Query: 1430 KIPIRPQKIRKLSITNTDTTPAADEESPPPQIXXXXXXXXXXXXXXXXTVEAPTTTITTK 1251
            KIP+RP+KIRKLS  N     ++ + +   +                      T+  +TK
Sbjct: 63   KIPLRPRKIRKLSPDNGVDQASSSQPTESSKA---------------------TSAKSTK 101

Query: 1250 NRRRSASQLSRPLPQII-KPLSVNGEIELAIRHLRSVDPLLVSLIDTXXXXXXXXXXXXF 1074
            +R     Q +  +P+II +PLS  GE+E AIRHLR+ D  L SLID             F
Sbjct: 102  SRAIQQQQQTLTVPRIIARPLSSEGEVEAAIRHLRNADRQLASLIDIHPPPTFDSFHTPF 161

Query: 1073 LALTKSILYQQLAYKAGAAIYTRFIALCGGEDSVGPDTVLALSAQQLKQIGISGRKASYL 894
            LALT+SILYQQLA+KAG +IYTRFIALCGGE  V P+TVLAL+ QQL+QIG+SGRKASYL
Sbjct: 162  LALTRSILYQQLAFKAGTSIYTRFIALCGGEAGVVPETVLALTPQQLRQIGVSGRKASYL 221

Query: 893  YDLANKYNSGILSDESVMKMDDRSLFTMLSMVKGIGSWSVHMFMIFSLHRPDVLPVSDLG 714
            +DLA KY +GILSD +++ MDD+SLFTML+MV GIGSWSVHMFMIFSLHRPDVLP++DLG
Sbjct: 222  HDLARKYQNGILSDSAIVNMDDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLG 281

Query: 713  VRKGVQLLYGLDDLPRPSQMEQLCEKWKPYRSVGAWYMWRFVEGKGTPAAGATMALEGNI 534
            VRKGVQLLY L++LPRPSQM+QLCEKW+PYRSV +WY+WRFVE KG P++ A +A    +
Sbjct: 282  VRKGVQLLYSLEELPRPSQMDQLCEKWRPYRSVASWYLWRFVEAKGAPSSAAAVAAGAAL 341

Query: 533  VQPLQQIEPQQDGRQHHQLQFLESVNGIGNLGACIWGQ 420
             QP Q+        +  Q Q L+ +N + N+GAC WGQ
Sbjct: 342  PQPQQE--------EQQQPQLLDQINSLINIGACAWGQ 371


>ref|XP_012446557.1| PREDICTED: probable DNA-3-methyladenine glycosylase 2 [Gossypium
            raimondii] gi|763792804|gb|KJB59800.1| hypothetical
            protein B456_009G273100 [Gossypium raimondii]
          Length = 396

 Score =  372 bits (955), Expect = e-100
 Identities = 209/420 (49%), Positives = 270/420 (64%), Gaps = 18/420 (4%)
 Frame = -3

Query: 1625 MGEHTRTQPETQSQ-----DSKPPSPFRSMKSEPNLDSNSQS-----PPQPTLSTDPATD 1476
            MGE    QP+ Q+Q     DS   +  ++ +    L     S        PT++T     
Sbjct: 1    MGEQASGQPQPQAQSQPSNDSSDATQCQTQRQAQTLTKTENSNDAFAAAAPTVTTALVVS 60

Query: 1475 AGEFLQNSQ--NHSNPSKIPIRPQKIRKLSITNTDTTPAADEESPPPQIXXXXXXXXXXX 1302
            A   L +      S PSKIP RP+KIRKLS  ++++ P A +++                
Sbjct: 61   ASTELTDGSPLTSSPPSKIPSRPRKIRKLS-PDSNSEPNASQQATTSTTSTS-------- 111

Query: 1301 XXXXXTVEAPTTTITTKNRRRSASQLSRPLPQIIKP------LSVNGEIELAIRHLRSVD 1140
                  V  P  T+     R   ++LS+    ++ P      LS  GE+E A+RHLR+ D
Sbjct: 112  ------VAVPLKTVP----RAPKAKLSQHRALVVAPQFFARSLSCEGEVETAVRHLRNAD 161

Query: 1139 PLLVSLIDTXXXXXXXXXXXXFLALTKSILYQQLAYKAGAAIYTRFIALCGGEDSVGPDT 960
            PLL SLID             FLALT+SILYQQLA+KAG +IYTRFIALCGGE+ V P+T
Sbjct: 162  PLLASLIDLHPPPTFDTFQTPFLALTRSILYQQLAFKAGTSIYTRFIALCGGENGVVPET 221

Query: 959  VLALSAQQLKQIGISGRKASYLYDLANKYNSGILSDESVMKMDDRSLFTMLSMVKGIGSW 780
            VL+L+ QQL+QIG+SGRKASYL+DLA KY +GILSD +++ MDD+SLFTML+MV GIGSW
Sbjct: 222  VLSLTPQQLRQIGVSGRKASYLHDLARKYQTGILSDSAIVNMDDKSLFTMLTMVNGIGSW 281

Query: 779  SVHMFMIFSLHRPDVLPVSDLGVRKGVQLLYGLDDLPRPSQMEQLCEKWKPYRSVGAWYM 600
            SVHMFMIFSLHRPDVLP++DLG+RKGVQLLY L++LPRPSQM+QLCEKW+PYRSV +WY+
Sbjct: 282  SVHMFMIFSLHRPDVLPINDLGIRKGVQLLYSLEELPRPSQMDQLCEKWRPYRSVASWYL 341

Query: 599  WRFVEGKGTPAAGATMALEGNIVQPLQQIEPQQDGRQHHQLQFLESVNGIGNLGACIWGQ 420
            WRFVE KG P++ A +A  G  +QPL    PQ++ +   Q Q L+S+N I +LGAC WGQ
Sbjct: 342  WRFVEAKGAPSSAAAVA-AGASLQPL----PQEEHQHQQQPQLLDSINSILDLGACTWGQ 396


>ref|XP_010102087.1| DNA-3-methyladenine glycosylase 1 [Morus notabilis]
            gi|587903719|gb|EXB91937.1| DNA-3-methyladenine
            glycosylase 1 [Morus notabilis]
          Length = 451

 Score =  370 bits (951), Expect = 1e-99
 Identities = 210/399 (52%), Positives = 259/399 (64%), Gaps = 9/399 (2%)
 Frame = -3

Query: 1625 MGEHTRTQPETQSQDSKPPSPFRSMKSEPNLDSNSQSPPQPTLSTDPATDAGEFLQN--S 1452
            MGE T+TQ +TQ      P        E    S+S      T +  P++ A   L N  S
Sbjct: 1    MGEQTQTQTQTQQ-----PQQHHGQTQE---SSSSMVTSISTTTIAPSSTAPTELSNAPS 52

Query: 1451 QNHSNPSKIPIRPQKIRKLSITNTDTTPA---ADEESPPPQIXXXXXXXXXXXXXXXXTV 1281
            Q  S PSKIP+RP+KIRKLS  ++D+  +   A  E+P P                    
Sbjct: 53   QTSSPPSKIPLRPRKIRKLSPDDSDSKSSQVVAVPENPKP-------------------- 92

Query: 1280 EAPTTTITTKNRRRSASQ---LSRPLPQII-KPLSVNGEIELAIRHLRSVDPLLVSLIDT 1113
             +PT     K  +    Q   L+   P+I+ + LS  GE+E+A+RHLR  DPLL  LID 
Sbjct: 93   -SPTAAAAAKPAKAKIVQQRALAIAAPRIVARSLSCEGEVEVALRHLRRADPLLAPLIDI 151

Query: 1112 XXXXXXXXXXXXFLALTKSILYQQLAYKAGAAIYTRFIALCGGEDSVGPDTVLALSAQQL 933
                        FLALT+SILYQQLAYKAG +IYTRFIALCGGE  V P+TVLAL+ QQL
Sbjct: 152  HQPPTFDNFHTPFLALTRSILYQQLAYKAGTSIYTRFIALCGGETGVVPETVLALTPQQL 211

Query: 932  KQIGISGRKASYLYDLANKYNSGILSDESVMKMDDRSLFTMLSMVKGIGSWSVHMFMIFS 753
            +QIG+SGRKASYL+DLA KY +GILSD +++ MDD+SLFTML+MV GIGSWSVHMFMIFS
Sbjct: 212  RQIGVSGRKASYLHDLARKYQNGILSDSAIVNMDDKSLFTMLTMVNGIGSWSVHMFMIFS 271

Query: 752  LHRPDVLPVSDLGVRKGVQLLYGLDDLPRPSQMEQLCEKWKPYRSVGAWYMWRFVEGKGT 573
            LHRPDVLP++DLGVRKGVQLLY L++LPRPSQM+QLCEKW+PYRSV AWYMWRFVE KG 
Sbjct: 272  LHRPDVLPINDLGVRKGVQLLYNLEELPRPSQMDQLCEKWRPYRSVAAWYMWRFVEQKGA 331

Query: 572  PAAGATMALEGNIVQPLQQIEPQQDGRQHHQLQFLESVN 456
            P   AT+A+  N+ Q  QQ + Q +  Q  Q Q ++ +N
Sbjct: 332  PPNAATVAVGANLQQQQQQQQQQGEPHQPQQPQLMDPLN 370


>ref|XP_008466558.1| PREDICTED: DNA-3-methyladenine glycosylase 1 [Cucumis melo]
          Length = 379

 Score =  370 bits (950), Expect = 2e-99
 Identities = 208/407 (51%), Positives = 264/407 (64%), Gaps = 5/407 (1%)
 Frame = -3

Query: 1625 MGEHTRTQPETQSQDSKPPSPFRSMKSEPNLDSNSQSP-PQPTLSTDPATDAGEFLQNSQ 1449
            MGE T+ Q +TQ+Q    P P    ++  +  SNS +P  Q T+      +A      SQ
Sbjct: 1    MGEQTQVQVQTQTQSQ--PQPQSQAQNTFHESSNSTTPIAQATVMLSEVMNAP-----SQ 53

Query: 1448 NHSNPSKIPIRPQKIRKLSITNTDTTPA---ADEESPPPQIXXXXXXXXXXXXXXXXTVE 1278
              S PSK+P+RP+KIRKLS   +D   +   A  + P P                     
Sbjct: 54   ISSPPSKMPLRPRKIRKLSPEESDPNSSHVVAIPDGPKPI-------------------- 93

Query: 1277 APTTTITTKNRRRSASQLSRPLPQIIKPLSVNGEIELAIRHLRSVDPLLVSLIDTXXXXX 1098
            A   +  +K  ++ A+  S  +P + + LS  GE+E+A+RHLR+ DPLL  LID      
Sbjct: 94   ATVKSNKSKTAQQRAAFASATVP-LARSLSCEGEVEIALRHLRNADPLLAQLIDLHQRPT 152

Query: 1097 XXXXXXXFLALTKSILYQQLAYKAGAAIYTRFIALCGGEDSVGPDTVLALSAQQLKQIGI 918
                   FLALT+SILYQQLAYKAG +IYTRFIALCGGE  V P+TVL+L+ QQL+QIGI
Sbjct: 153  FDSFQTPFLALTRSILYQQLAYKAGTSIYTRFIALCGGEAGVLPETVLSLNPQQLRQIGI 212

Query: 917  SGRKASYLYDLANKYNSGILSDESVMKMDDRSLFTMLSMVKGIGSWSVHMFMIFSLHRPD 738
            SGRK+SYL+DLA KY +GILSD +++ MDD+SLFTML+MV GIGSWSVHMFMIFSLHRPD
Sbjct: 213  SGRKSSYLHDLARKYQNGILSDPAIVNMDDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPD 272

Query: 737  VLPVSDLGVRKGVQLLYGLDDLPRPSQMEQLCEKWKPYRSVGAWYMWRFVEGKGTPAAGA 558
            VLP++DL VRKGVQLLY L++LPRPSQM+QLCEKW+PYRSVG+WYMWR  E KG  ++ A
Sbjct: 273  VLPINDLNVRKGVQLLYNLEELPRPSQMDQLCEKWRPYRSVGSWYMWRLAEAKGASSSAA 332

Query: 557  TMALEGNIVQPLQQIEPQQDGRQH-HQLQFLESVNGIGNLGACIWGQ 420
             +A   ++    Q    +    QH  Q Q L+ +NGI NLGAC WGQ
Sbjct: 333  AVAAGASLQLQQQDHHQEHQHPQHPQQPQLLDPLNGILNLGACAWGQ 379


>ref|XP_012444918.1| PREDICTED: probable DNA-3-methyladenine glycosylase 2 isoform X2
            [Gossypium raimondii] gi|763791263|gb|KJB58259.1|
            hypothetical protein B456_009G201500 [Gossypium
            raimondii]
          Length = 395

 Score =  370 bits (949), Expect = 3e-99
 Identities = 208/412 (50%), Positives = 266/412 (64%), Gaps = 10/412 (2%)
 Frame = -3

Query: 1625 MGEHTRTQPETQSQDSKPPSPFRSMKSEPNLDS----NSQSPPQPTLSTD----PATDAG 1470
            MGE T +QP+ Q Q   P     + +++    S    NS + P  T++T      A    
Sbjct: 1    MGEQTPSQPQPQVQSQPPNDSSTTTQAQVQTQSGDPNNSSTAPVSTVTTACTAIVACGPT 60

Query: 1469 EFLQNSQNH-SNPSKIPIRPQKIRKLSITNTDTTPAADEESPPPQIXXXXXXXXXXXXXX 1293
            E +    +  S PSKIP RP+KIRKLS  +    P A +++                   
Sbjct: 61   ELVNVPLSTLSPPSKIPSRPRKIRKLS-PDLSFDPNASQQATTSSSTSLTE--------- 110

Query: 1292 XXTVEAPTTTITTKNRRRSASQLSRPLPQII-KPLSVNGEIELAIRHLRSVDPLLVSLID 1116
                +  T   T+K +      L+   P+II + LS  GE+E AI HLR  DPLL SLID
Sbjct: 111  ----QRKTVGRTSKTKLSQHRALAVVAPRIISRSLSCEGEVENAIHHLRDADPLLASLID 166

Query: 1115 TXXXXXXXXXXXXFLALTKSILYQQLAYKAGAAIYTRFIALCGGEDSVGPDTVLALSAQQ 936
                         FLALT+SILYQQLA+KAG +IYTRFI+LCGGE+ V P+TVL+L++QQ
Sbjct: 167  LHPPPTFDTFHAPFLALTRSILYQQLAFKAGTSIYTRFISLCGGENGVVPETVLSLTSQQ 226

Query: 935  LKQIGISGRKASYLYDLANKYNSGILSDESVMKMDDRSLFTMLSMVKGIGSWSVHMFMIF 756
            L+QIG+SGRKASYL+DLA KY +GILSD +++ MDD+SLFTML+MV GIGSWSVHMFMIF
Sbjct: 227  LRQIGVSGRKASYLHDLARKYQTGILSDSAIVNMDDKSLFTMLTMVNGIGSWSVHMFMIF 286

Query: 755  SLHRPDVLPVSDLGVRKGVQLLYGLDDLPRPSQMEQLCEKWKPYRSVGAWYMWRFVEGKG 576
            SLHRPDVLP++DLGVRKGVQLLY L++LPRPSQM+QLCEKW+PYRSV +WY+WR+VE KG
Sbjct: 287  SLHRPDVLPINDLGVRKGVQLLYNLEELPRPSQMDQLCEKWRPYRSVASWYLWRYVEAKG 346

Query: 575  TPAAGATMALEGNIVQPLQQIEPQQDGRQHHQLQFLESVNGIGNLGACIWGQ 420
             P++ A +A   ++    QQ EPQQ      Q Q ++ +N I NLGAC WGQ
Sbjct: 347  APSSAAAVAAGASLPPLQQQEEPQQ---HQQQPQLMDPINSILNLGACAWGQ 395


Top