BLASTX nr result

ID: Gardenia21_contig00006576 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Gardenia21_contig00006576
         (1856 letters)

Database: ./nr 
           77,306,371 sequences; 28,104,191,420 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

emb|CDP02014.1| unnamed protein product [Coffea canephora]            464   e-127
ref|XP_011099624.1| PREDICTED: uncharacterized protein LOC105177...   442   e-121
ref|XP_009804698.1| PREDICTED: probable DNA-3-methyladenine glyc...   442   e-121
ref|XP_006346951.1| PREDICTED: probable DNA-3-methyladenine glyc...   431   e-117
ref|XP_004233519.1| PREDICTED: probable DNA-3-methyladenine glyc...   418   e-114
ref|XP_012834115.1| PREDICTED: uncharacterized protein LOC105954...   409   e-111
ref|XP_009766232.1| PREDICTED: probable DNA-3-methyladenine glyc...   404   e-109
ref|XP_009766226.1| PREDICTED: probable DNA-3-methyladenine glyc...   396   e-107
gb|EYU40170.1| hypothetical protein MIMGU_mgv1a021549mg [Erythra...   394   e-106
ref|XP_009766219.1| PREDICTED: probable DNA-3-methyladenine glyc...   392   e-106
ref|XP_010263642.1| PREDICTED: uncharacterized protein LOC104601...   383   e-103
ref|XP_010649138.1| PREDICTED: DNA-3-methyladenine glycosylase 1...   380   e-102
ref|XP_007017553.1| DNA glycosylase superfamily protein [Theobro...   379   e-102
ref|XP_006473513.1| PREDICTED: DNA-3-methyladenine glycosylase 1...   370   2e-99
gb|EPS66255.1| hypothetical protein M569_08523, partial [Genlise...   369   4e-99
ref|XP_010254809.1| PREDICTED: uncharacterized protein LOC104595...   367   1e-98
ref|XP_012071867.1| PREDICTED: uncharacterized protein LOC105633...   367   1e-98
ref|XP_010102087.1| DNA-3-methyladenine glycosylase 1 [Morus not...   366   3e-98
ref|XP_008466558.1| PREDICTED: DNA-3-methyladenine glycosylase 1...   366   3e-98
ref|XP_012444918.1| PREDICTED: probable DNA-3-methyladenine glyc...   365   6e-98

>emb|CDP02014.1| unnamed protein product [Coffea canephora]
          Length = 337

 Score =  464 bits (1193), Expect = e-127
 Identities = 233/262 (88%), Positives = 236/262 (90%)
 Frame = -2

Query: 1168 IIKPLSAEGEINAALHHLRVADPLLATLIDIHQPPAFESHHSPFLALTKSILYQQLAYKA 989
            IIKPLSAEGEINAALHHLRV DPLLATLID HQPPAFESHHSPFLALTKSILYQQLAYKA
Sbjct: 76   IIKPLSAEGEINAALHHLRVVDPLLATLIDTHQPPAFESHHSPFLALTKSILYQQLAYKA 135

Query: 988  GTSIYNRFVALCGGEGAVLPDNVLGLSAQQLKQVGVSGRKASYLYDLANKYKSGILSDET 809
            GTSIYNRFVALCGGE AVLPDNVLGLSAQ+LKQVGVSGRKASYLYDLANKYKSGILSDET
Sbjct: 136  GTSIYNRFVALCGGETAVLPDNVLGLSAQELKQVGVSGRKASYLYDLANKYKSGILSDET 195

Query: 808  VVKMDDKSLFTMLSMVKGIGSWSVHMFMIFSLHRPDVLPVSDLGVRKGVQLLYGLEELPR 629
            VVKMDDKSLFTMLSMVKGIGSWSVHMFMIFSLHRPDVLPVSDLGVRKGVQ+LYGLEELPR
Sbjct: 196  VVKMDDKSLFTMLSMVKGIGSWSVHMFMIFSLHRPDVLPVSDLGVRKGVQMLYGLEELPR 255

Query: 628  PSQMELLCEKWRPYRSVGAWYMWRFVEGKGSQNASAASSLEGANVXXXXXXXXXXXXXXX 449
            PSQME LCEKWRPYRSVGAWYMWRFVEGKGSQNAS A S+EGANV               
Sbjct: 256  PSQMEQLCEKWRPYRSVGAWYMWRFVEGKGSQNASVAPSVEGANVQPLQQIEPQQDAQQQ 315

Query: 448  XXXXXLEPINGMGNLGACIWGQ 383
                 LEPINGMGNLGACIWGQ
Sbjct: 316  HQLQLLEPINGMGNLGACIWGQ 337



 Score =  108 bits (270), Expect = 2e-20
 Identities = 52/57 (91%), Positives = 56/57 (98%)
 Frame = -2

Query: 1594 PNSDVASATQPTPVADVSINANVSQKPTNPSKIPIRPQKIRKLSSNPTSTIATTPVV 1424
            PNSDV SATQPTPVADVSINA+VSQKP+NPSKIPIRPQKIRKLSSNPTSTIATTP++
Sbjct: 21   PNSDVTSATQPTPVADVSINADVSQKPSNPSKIPIRPQKIRKLSSNPTSTIATTPII 77


>ref|XP_011099624.1| PREDICTED: uncharacterized protein LOC105177997 [Sesamum indicum]
          Length = 419

 Score =  442 bits (1137), Expect = e-121
 Identities = 241/412 (58%), Positives = 281/412 (68%), Gaps = 8/412 (1%)
 Frame = -2

Query: 1594 PNSDVA---SATQPTPVADVS--INANVSQKPTNPSKIPIRPQKIRKLSSNPTSTIATTP 1430
            P+SD +   S  QP  +A+ S      +S  P NPSKIPIRPQKIRKLS++     +T  
Sbjct: 43   PSSDSSARISHPQPVSLAESSHATATEISHNPQNPSKIPIRPQKIRKLSTSIPDKPSTPQ 102

Query: 1429 VVL--TPVNSSSSVTTLQTSEAKSLQITDXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 1256
                 + V++SSS+    T+ + +  +T                                
Sbjct: 103  TTADDSSVSASSSLALTTTTASTTTAMTPVTPTTTHSA---------------------- 140

Query: 1255 XXXXXXXXXXXTPKNRRRSAVQSARVLPQIIKPLSAEGEINAALHHLRVADPLLATLIDI 1076
                         KNRRRSA Q++RVLPQ+IKPLSA+GEI  A+ HLR AD LL  LID 
Sbjct: 141  -------------KNRRRSASQASRVLPQVIKPLSADGEIELAIRHLRAADALLGPLIDT 187

Query: 1075 HQPPAFESHHSPFLALTKSILYQQLAYKAGTSIYNRFVALCGGEGAVLPDNVLGLSAQQL 896
            H PP FE HH+PF ALTKSILYQQLAYKAGTSIY RFV+LCGGE ++ PD+VL LS QQL
Sbjct: 188  HPPPQFEFHHNPFHALTKSILYQQLAYKAGTSIYTRFVSLCGGEESISPDSVLALSPQQL 247

Query: 895  KQVGVSGRKASYLYDLANKYKSGILSDETVVKMDDKSLFTMLSMVKGIGSWSVHMFMIFS 716
            KQ+GVSGRKASYLYDLANKYKSGILSD+TVVKMDD+SLFTMLSMVKGIGSWSVHMFMIFS
Sbjct: 248  KQIGVSGRKASYLYDLANKYKSGILSDDTVVKMDDRSLFTMLSMVKGIGSWSVHMFMIFS 307

Query: 715  LHRPDVLPVSDLGVRKGVQLLYGLEELPRPSQMELLCEKWRPYRSVGAWYMWRFVEGKGS 536
            LHRPDVLPVSDLGVRKGVQLLYGLEELPRPSQME LCEKW+PYRSVGAWYMWRFVEGKG+
Sbjct: 308  LHRPDVLPVSDLGVRKGVQLLYGLEELPRPSQMEQLCEKWKPYRSVGAWYMWRFVEGKGA 367

Query: 535  QNASAASSLEGANV-XXXXXXXXXXXXXXXXXXXXLEPINGMGNLGACIWGQ 383
              +++   L+G+ V                     +EP+NG+GN+GACIW Q
Sbjct: 368  PTSNSGGVLDGSVVQPLQQIEPQQDGHQHQHQLQFVEPVNGIGNIGACIWNQ 419


>ref|XP_009804698.1| PREDICTED: probable DNA-3-methyladenine glycosylase 2 [Nicotiana
            sylvestris]
          Length = 363

 Score =  442 bits (1136), Expect = e-121
 Identities = 248/404 (61%), Positives = 279/404 (69%)
 Frame = -2

Query: 1594 PNSDVASATQPTPVADVSINANVSQKPTNPSKIPIRPQKIRKLSSNPTSTIATTPVVLTP 1415
            PNSD  + +   PV       ++   P+NPSKIPIRPQKIRKLSS  TS  +T P    P
Sbjct: 21   PNSDSTTLSTNPPV-------DIPPNPSNPSKIPIRPQKIRKLSST-TSPQSTNP---KP 69

Query: 1414 VNSSSSVTTLQTSEAKSLQITDXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 1235
             +SS SV T   S  K + IT                                       
Sbjct: 70   ADSSQSVVT---SNGK-VTIT--------------------------------------- 86

Query: 1234 XXXXTPKNRRRSAVQSARVLPQIIKPLSAEGEINAALHHLRVADPLLATLIDIHQPPAFE 1055
                  KNRRRSA Q  RVLPQ+IKPLSA GEI  AL HLR+ADPLL +LID    PAF+
Sbjct: 87   ------KNRRRSASQLTRVLPQVIKPLSANGEIENALRHLRLADPLLCSLIDTLPLPAFD 140

Query: 1054 SHHSPFLALTKSILYQQLAYKAGTSIYNRFVALCGGEGAVLPDNVLGLSAQQLKQVGVSG 875
            SH  PFLAL KSILYQQLAYKAGTSIY RFV+LCG E AV PD VL LSAQQLKQ+G+SG
Sbjct: 141  SHQLPFLALCKSILYQQLAYKAGTSIYTRFVSLCGSEDAVCPDVVLSLSAQQLKQIGISG 200

Query: 874  RKASYLYDLANKYKSGILSDETVVKMDDKSLFTMLSMVKGIGSWSVHMFMIFSLHRPDVL 695
            RKASYLYDLANKYK+GIL+D+TVVKMDD+SLFTMLSMVKGIGSWSVHMFMIFSLHRPDVL
Sbjct: 201  RKASYLYDLANKYKTGILADDTVVKMDDRSLFTMLSMVKGIGSWSVHMFMIFSLHRPDVL 260

Query: 694  PVSDLGVRKGVQLLYGLEELPRPSQMELLCEKWRPYRSVGAWYMWRFVEGKGSQNASAAS 515
            PVSDLGVRKGVQ+LYGLEELPRPSQME LCEKWRPYRS+GAWYMWRF+EGKG+  A+AA+
Sbjct: 261  PVSDLGVRKGVQMLYGLEELPRPSQMEQLCEKWRPYRSIGAWYMWRFIEGKGTP-ATAAA 319

Query: 514  SLEGANVXXXXXXXXXXXXXXXXXXXXLEPINGMGNLGACIWGQ 383
            ++EG +V                    LEPI+G+G+LGACIWGQ
Sbjct: 320  AMEGGSVQPLQQIEPQQQPEQQHQLQLLEPIDGIGSLGACIWGQ 363


>ref|XP_006346951.1| PREDICTED: probable DNA-3-methyladenine glycosylase 2-like [Solanum
            tuberosum]
          Length = 362

 Score =  431 bits (1107), Expect = e-117
 Identities = 238/399 (59%), Positives = 275/399 (68%), Gaps = 4/399 (1%)
 Frame = -2

Query: 1567 QPTPVADVSINAN----VSQKPTNPSKIPIRPQKIRKLSSNPTSTIATTPVVLTPVNSSS 1400
            QP P++D ++ +N    +   P+NPSKIPIRPQKIRKLSS P+S    TP         +
Sbjct: 20   QPLPISDSTLVSNSPVDLPPNPSNPSKIPIRPQKIRKLSSTPSSN-GKTP--------ET 70

Query: 1399 SVTTLQTSEAKSLQITDXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXT 1220
            +V +  T+ + ++ +T                                            
Sbjct: 71   TVPSASTATSGAITVT-------------------------------------------- 86

Query: 1219 PKNRRRSAVQSARVLPQIIKPLSAEGEINAALHHLRVADPLLATLIDIHQPPAFESHHSP 1040
             KNRR+SA +S+RVLPQIIKPLSA+GEI+ AL HLR  DPLL +LID    P FE HHS 
Sbjct: 87   -KNRRKSAPKSSRVLPQIIKPLSADGEIDNALQHLRSVDPLLVSLIDTLPSPQFELHHSA 145

Query: 1039 FLALTKSILYQQLAYKAGTSIYNRFVALCGGEGAVLPDNVLGLSAQQLKQVGVSGRKASY 860
            FLAL+KSILYQQLAYKAGTSIY RFV+LCGGE AV PD VL LS QQLKQVG+SGRKASY
Sbjct: 146  FLALSKSILYQQLAYKAGTSIYTRFVSLCGGEDAVCPDIVLSLSPQQLKQVGISGRKASY 205

Query: 859  LYDLANKYKSGILSDETVVKMDDKSLFTMLSMVKGIGSWSVHMFMIFSLHRPDVLPVSDL 680
            L+DLANKY+SGILSDET+VKMDD+SLFTMLSMVKGIGSWSVHMFMIFSLHRPDVLPVSDL
Sbjct: 206  LHDLANKYRSGILSDETLVKMDDRSLFTMLSMVKGIGSWSVHMFMIFSLHRPDVLPVSDL 265

Query: 679  GVRKGVQLLYGLEELPRPSQMELLCEKWRPYRSVGAWYMWRFVEGKGSQNASAASSLEGA 500
            GVRKGVQLLYGLEELPRPSQME LC+KW+PYRS GAWYMWR VEGKG+   +AA+ ++G 
Sbjct: 266  GVRKGVQLLYGLEELPRPSQMEQLCDKWKPYRSAGAWYMWRLVEGKGTP-TTAAAPIDGG 324

Query: 499  NVXXXXXXXXXXXXXXXXXXXXLEPINGMGNLGACIWGQ 383
            NV                    LEPING+ NLGACIW Q
Sbjct: 325  NV-QALQQFPTEQETQQHQLQLLEPINGIENLGACIWSQ 362


>ref|XP_004233519.1| PREDICTED: probable DNA-3-methyladenine glycosylase 2 [Solanum
            lycopersicum]
          Length = 353

 Score =  418 bits (1075), Expect = e-114
 Identities = 235/403 (58%), Positives = 268/403 (66%), Gaps = 7/403 (1%)
 Frame = -2

Query: 1570 TQPTPVADVSINANVSQKP-------TNPSKIPIRPQKIRKLSSNPTSTIATTPVVLTPV 1412
            T P P+   S +  VS  P       +NPSKIPIRPQKIRKLSS P+S    TP      
Sbjct: 7    TPPQPLPTSSDSTLVSNSPVDLPPNPSNPSKIPIRPQKIRKLSSTPSSN-GKTP------ 59

Query: 1411 NSSSSVTTLQTSEAKSLQITDXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 1232
               ++V +  T+ + ++ +T                                        
Sbjct: 60   --ETAVPSASTATSGAITVT---------------------------------------- 77

Query: 1231 XXXTPKNRRRSAVQSARVLPQIIKPLSAEGEINAALHHLRVADPLLATLIDIHQPPAFES 1052
                 KNRR++A +S+RV PQIIKPLSA+GEI+ AL HLR  DPLL +LID    P FE 
Sbjct: 78   -----KNRRKTAPKSSRVSPQIIKPLSADGEIDNALQHLRSVDPLLVSLIDTLPSPQFEL 132

Query: 1051 HHSPFLALTKSILYQQLAYKAGTSIYNRFVALCGGEGAVLPDNVLGLSAQQLKQVGVSGR 872
            HHS FLAL+KSILYQQLAYKAGTSIY RFV+LCGGE AV PD VL LS QQLKQVG+SGR
Sbjct: 133  HHSAFLALSKSILYQQLAYKAGTSIYTRFVSLCGGEDAVCPDIVLALSPQQLKQVGISGR 192

Query: 871  KASYLYDLANKYKSGILSDETVVKMDDKSLFTMLSMVKGIGSWSVHMFMIFSLHRPDVLP 692
            KASYL+DLANKYKSGILSDET+VKMDD+SLF MLSMVKGIGSWSVHMFMIFSLHRPD+LP
Sbjct: 193  KASYLHDLANKYKSGILSDETLVKMDDRSLFAMLSMVKGIGSWSVHMFMIFSLHRPDILP 252

Query: 691  VSDLGVRKGVQLLYGLEELPRPSQMELLCEKWRPYRSVGAWYMWRFVEGKGSQNASAASS 512
            VSDLGVRKGVQLLYGLEELPRPSQME LC+KW+PYRS GAWYMWR VEGKG+    AA+ 
Sbjct: 253  VSDLGVRKGVQLLYGLEELPRPSQMEQLCDKWKPYRSAGAWYMWRLVEGKGTPTI-AAAP 311

Query: 511  LEGANVXXXXXXXXXXXXXXXXXXXXLEPINGMGNLGACIWGQ 383
            ++G N                     LEPING+ NLGACIW Q
Sbjct: 312  IDGGNA-QALQQFPVEQETQQHQLQLLEPINGIENLGACIWSQ 353


>ref|XP_012834115.1| PREDICTED: uncharacterized protein LOC105954973 [Erythranthe
            guttatus]
          Length = 424

 Score =  409 bits (1050), Expect = e-111
 Identities = 229/403 (56%), Positives = 261/403 (64%), Gaps = 8/403 (1%)
 Frame = -2

Query: 1567 QPTPVADVSINA-----NVSQKPTNPSKIPIRPQKIRKLSSNPTSTIATTPVVLTPVNSS 1403
            Q   VA+ S+ A      +S    NPSKIPIRPQKIRKLS+  T+  ++TP       S 
Sbjct: 56   QTASVAEASLAAAAAATEISNNSQNPSKIPIRPQKIRKLST--TAGKSSTPQSTADEASV 113

Query: 1402 SSVTTLQTSEAKSLQITDXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 1223
            S+  +L  + A     T                                           
Sbjct: 114  SASPSLPLTPAAGAAST--------------------------------VASPATPSTTH 141

Query: 1222 TPKNRRRSAVQSARVLPQIIKPLSAEGEINAALHHLRVADPLLATLIDIHQPPAFESHHS 1043
            T KNRRRSA Q++R +PQIIKPLSA+GEI  A+ HLR  DPLL  LID H P  F+S   
Sbjct: 142  TAKNRRRSASQASRAMPQIIKPLSADGEIELAIRHLRAVDPLLGPLIDTHLPFQFDSQQP 201

Query: 1042 PFLALTKSILYQQLAYKAGTSIYNRFVALCGGEGAVLPDNVLGLSAQQLKQVGVSGRKAS 863
            PFLALTKSILYQQLA KAGTSIY RFV+LCG E +V PD VL LS QQLK +GVSGRKAS
Sbjct: 202  PFLALTKSILYQQLACKAGTSIYTRFVSLCGAEESVCPDTVLSLSTQQLKAIGVSGRKAS 261

Query: 862  YLYDLANKYKSGILSDETVVKMDDKSLFTMLSMVKGIGSWSVHMFMIFSLHRPDVLPVSD 683
            YLYDLANKYKSGILSD+TVVKMDD+SLFTMLSMVKGIGSWSVHMFMIFSLHRPDVLPVSD
Sbjct: 262  YLYDLANKYKSGILSDDTVVKMDDRSLFTMLSMVKGIGSWSVHMFMIFSLHRPDVLPVSD 321

Query: 682  LGVRKGVQLLYGLEELPRPSQMELLCEKWRPYRSVGAWYMWRFVEGKGSQNASAASS--- 512
            LGVRKGVQ+L GL+ELPRPSQME LCEKW+PYRSVGAWYMWRFVEGKG+  +  A     
Sbjct: 322  LGVRKGVQMLNGLDELPRPSQMEQLCEKWKPYRSVGAWYMWRFVEGKGAAGSGVALEDGV 381

Query: 511  LEGANVXXXXXXXXXXXXXXXXXXXXLEPINGMGNLGACIWGQ 383
            ++                        +EP+NG+GN+GACIW Q
Sbjct: 382  VQPLQQVEPQQDGHQHQHQLQHQLQFVEPVNGIGNMGACIWNQ 424


>ref|XP_009766232.1| PREDICTED: probable DNA-3-methyladenine glycosylase 2 isoform X3
            [Nicotiana sylvestris]
          Length = 360

 Score =  404 bits (1037), Expect = e-109
 Identities = 211/280 (75%), Positives = 230/280 (82%), Gaps = 2/280 (0%)
 Frame = -2

Query: 1216 KNRRRSAVQSA--RVLPQIIKPLSAEGEINAALHHLRVADPLLATLIDIHQPPAFESHHS 1043
            K+RR+SA +S+  R LPQIIKPLSA GEI+ AL HLR ADPLL +LID    P FESHHS
Sbjct: 83   KSRRKSAPKSSSSRGLPQIIKPLSANGEIDNALLHLRSADPLLGSLIDTLPVPQFESHHS 142

Query: 1042 PFLALTKSILYQQLAYKAGTSIYNRFVALCGGEGAVLPDNVLGLSAQQLKQVGVSGRKAS 863
            PFLAL+KSILYQQLAYKAGTSIY RFV+LCGGE AV PD VL LSAQQLKQ+GVSGRKAS
Sbjct: 143  PFLALSKSILYQQLAYKAGTSIYTRFVSLCGGEDAVCPDVVLSLSAQQLKQIGVSGRKAS 202

Query: 862  YLYDLANKYKSGILSDETVVKMDDKSLFTMLSMVKGIGSWSVHMFMIFSLHRPDVLPVSD 683
            YLYDLANKYK+GIL D+ +VKMDDKSLFTMLSMVKGIGSWSVHMFMIFSLHRPDVLPVSD
Sbjct: 203  YLYDLANKYKNGILCDDALVKMDDKSLFTMLSMVKGIGSWSVHMFMIFSLHRPDVLPVSD 262

Query: 682  LGVRKGVQLLYGLEELPRPSQMELLCEKWRPYRSVGAWYMWRFVEGKGSQNASAASSLEG 503
            LGVRKGVQLLYGLEELPRPSQME LCEKWRPYRS GAWYMWRFVE KG+   +AA++++ 
Sbjct: 263  LGVRKGVQLLYGLEELPRPSQMEQLCEKWRPYRSAGAWYMWRFVEWKGTP-TTAAAAIDA 321

Query: 502  ANVXXXXXXXXXXXXXXXXXXXXLEPINGMGNLGACIWGQ 383
             NV                    LEPING+GNLGACIW Q
Sbjct: 322  GNV-QPLQQIQTGQETQQHQLQLLEPINGIGNLGACIWSQ 360


>ref|XP_009766226.1| PREDICTED: probable DNA-3-methyladenine glycosylase 2 isoform X2
            [Nicotiana sylvestris]
          Length = 368

 Score =  396 bits (1018), Expect = e-107
 Identities = 211/288 (73%), Positives = 230/288 (79%), Gaps = 10/288 (3%)
 Frame = -2

Query: 1216 KNRRRSAVQSA--RVLPQIIKPLSAEGEINAALHHLRVADPLLATLIDIHQPPAFESHHS 1043
            K+RR+SA +S+  R LPQIIKPLSA GEI+ AL HLR ADPLL +LID    P FESHHS
Sbjct: 83   KSRRKSAPKSSSSRGLPQIIKPLSANGEIDNALLHLRSADPLLGSLIDTLPVPQFESHHS 142

Query: 1042 PFLALTKSILYQQLAYKAGTSIYNRFVALCGGEGAVLPDNVLGLSAQQLKQVGVSGRKAS 863
            PFLAL+KSILYQQLAYKAGTSIY RFV+LCGGE AV PD VL LSAQQLKQ+GVSGRKAS
Sbjct: 143  PFLALSKSILYQQLAYKAGTSIYTRFVSLCGGEDAVCPDVVLSLSAQQLKQIGVSGRKAS 202

Query: 862  YLYDLANKYKSGILSDETVVKMDDKSLFTMLSMVKGIGSWSVHMFMIFSLHRPDVLPVSD 683
            YLYDLANKYK+GIL D+ +VKMDDKSLFTMLSMVKGIGSWSVHMFMIFSLHRPDVLPVSD
Sbjct: 203  YLYDLANKYKNGILCDDALVKMDDKSLFTMLSMVKGIGSWSVHMFMIFSLHRPDVLPVSD 262

Query: 682  LGVRKGVQLLYGLEELPRPSQMELLCEKWRPYRSVGAWYMWRFVEGKGSQNASAASSLEG 503
            LGVRKGVQLLYGLEELPRPSQME LCEKWRPYRS GAWYMWRFVE KG+   +AA++++ 
Sbjct: 263  LGVRKGVQLLYGLEELPRPSQMEQLCEKWRPYRSAGAWYMWRFVEWKGTP-TTAAAAIDA 321

Query: 502  ANVXXXXXXXXXXXXXXXXXXXXLEPINGMGNLG--------ACIWGQ 383
             NV                    LEPING+GNLG        ACIW Q
Sbjct: 322  GNV-QPLQQIQTGQETQQHQLQLLEPINGIGNLGYLTIFRLKACIWSQ 368


>gb|EYU40170.1| hypothetical protein MIMGU_mgv1a021549mg [Erythranthe guttata]
          Length = 407

 Score =  394 bits (1012), Expect = e-106
 Identities = 225/394 (57%), Positives = 256/394 (64%), Gaps = 5/394 (1%)
 Frame = -2

Query: 1567 QPTPVADVSINA-----NVSQKPTNPSKIPIRPQKIRKLSSNPTSTIATTPVVLTPVNSS 1403
            Q   VA+ S+ A      +S    NPSKIPIRPQKIRKLS+  T+  ++TP       S 
Sbjct: 56   QTASVAEASLAAAAAATEISNNSQNPSKIPIRPQKIRKLST--TAGKSSTPQSTADEASV 113

Query: 1402 SSVTTLQTSEAKSLQITDXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 1223
            S+  +L  + A     T                                           
Sbjct: 114  SASPSLPLTPAAGAAST--------------------------------VASPATPSTTH 141

Query: 1222 TPKNRRRSAVQSARVLPQIIKPLSAEGEINAALHHLRVADPLLATLIDIHQPPAFESHHS 1043
            T KNRRRSA Q++R +PQIIKPLSA+GEI  A+ HLR  DPLL  LID H P  F+S   
Sbjct: 142  TAKNRRRSASQASRAMPQIIKPLSADGEIELAIRHLRAVDPLLGPLIDTHLPFQFDSQQP 201

Query: 1042 PFLALTKSILYQQLAYKAGTSIYNRFVALCGGEGAVLPDNVLGLSAQQLKQVGVSGRKAS 863
            PFLALTKSILYQQLA KAGTSIY RFV+LCG E +V PD VL LS QQLK +GVSGRKAS
Sbjct: 202  PFLALTKSILYQQLACKAGTSIYTRFVSLCGAEESVCPDTVLSLSTQQLKAIGVSGRKAS 261

Query: 862  YLYDLANKYKSGILSDETVVKMDDKSLFTMLSMVKGIGSWSVHMFMIFSLHRPDVLPVSD 683
            YLYDLANKYKSGILSD+TVVKMDD+SLFTMLSMVKGIGSWSVHMFMIFSLHRPDVLPVSD
Sbjct: 262  YLYDLANKYKSGILSDDTVVKMDDRSLFTMLSMVKGIGSWSVHMFMIFSLHRPDVLPVSD 321

Query: 682  LGVRKGVQLLYGLEELPRPSQMELLCEKWRPYRSVGAWYMWRFVEGKGSQNASAASSLEG 503
            LGVRKGVQ+L GL+ELPRPSQME LCEKW+PYRSVGAWYMWRFVEGKG    +A S ++ 
Sbjct: 322  LGVRKGVQMLNGLDELPRPSQMEQLCEKWKPYRSVGAWYMWRFVEGKG----AAGSGVQ- 376

Query: 502  ANVXXXXXXXXXXXXXXXXXXXXLEPINGMGNLG 401
                                   +EP+NG+GN+G
Sbjct: 377  ---VEPQQDGHQHQHQLQHQLQFVEPVNGIGNMG 407


>ref|XP_009766219.1| PREDICTED: probable DNA-3-methyladenine glycosylase 2 isoform X1
            [Nicotiana sylvestris]
          Length = 395

 Score =  392 bits (1006), Expect = e-106
 Identities = 207/277 (74%), Positives = 226/277 (81%), Gaps = 2/277 (0%)
 Frame = -2

Query: 1216 KNRRRSAVQSA--RVLPQIIKPLSAEGEINAALHHLRVADPLLATLIDIHQPPAFESHHS 1043
            K+RR+SA +S+  R LPQIIKPLSA GEI+ AL HLR ADPLL +LID    P FESHHS
Sbjct: 83   KSRRKSAPKSSSSRGLPQIIKPLSANGEIDNALLHLRSADPLLGSLIDTLPVPQFESHHS 142

Query: 1042 PFLALTKSILYQQLAYKAGTSIYNRFVALCGGEGAVLPDNVLGLSAQQLKQVGVSGRKAS 863
            PFLAL+KSILYQQLAYKAGTSIY RFV+LCGGE AV PD VL LSAQQLKQ+GVSGRKAS
Sbjct: 143  PFLALSKSILYQQLAYKAGTSIYTRFVSLCGGEDAVCPDVVLSLSAQQLKQIGVSGRKAS 202

Query: 862  YLYDLANKYKSGILSDETVVKMDDKSLFTMLSMVKGIGSWSVHMFMIFSLHRPDVLPVSD 683
            YLYDLANKYK+GIL D+ +VKMDDKSLFTMLSMVKGIGSWSVHMFMIFSLHRPDVLPVSD
Sbjct: 203  YLYDLANKYKNGILCDDALVKMDDKSLFTMLSMVKGIGSWSVHMFMIFSLHRPDVLPVSD 262

Query: 682  LGVRKGVQLLYGLEELPRPSQMELLCEKWRPYRSVGAWYMWRFVEGKGSQNASAASSLEG 503
            LGVRKGVQLLYGLEELPRPSQME LCEKWRPYRS GAWYMWRFVE KG+   +AA++++ 
Sbjct: 263  LGVRKGVQLLYGLEELPRPSQMEQLCEKWRPYRSAGAWYMWRFVEWKGTP-TTAAAAIDA 321

Query: 502  ANVXXXXXXXXXXXXXXXXXXXXLEPINGMGNLGACI 392
             NV                    LEPING+GNLG  I
Sbjct: 322  GNV-QPLQQIQTGQETQQHQLQLLEPINGIGNLGLLI 357


>ref|XP_010263642.1| PREDICTED: uncharacterized protein LOC104601852 [Nelumbo nucifera]
          Length = 425

 Score =  383 bits (983), Expect = e-103
 Identities = 206/393 (52%), Positives = 252/393 (64%), Gaps = 1/393 (0%)
 Frame = -2

Query: 1558 PVADVSINANVSQKPTNPSKIPIRPQKIRKLSSNPTSTIATTPVVLTPVNSSSSVTTLQT 1379
            P    +  ++  Q   + +KIP RP+KIRK SS+ +S  +   +V     ++++    +T
Sbjct: 78   PAPPTTTASSAPQNSASSTKIPFRPRKIRKTSSDVSSDNSDNKIVDGECKTTATNGDHKT 137

Query: 1378 SEAKSLQITDXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXTPKNRRRS 1199
            +   +L  T                                               + R 
Sbjct: 138  NNNTALTTTS--------------------------------------------NKKSRI 153

Query: 1198 AVQSARVLPQII-KPLSAEGEINAALHHLRVADPLLATLIDIHQPPAFESHHSPFLALTK 1022
              +  RV+P+++ + LS EGE+  AL HLR +DP LA LIDIHQPP F+S H PFLALTK
Sbjct: 154  VAKQVRVVPRVVARTLSCEGEVALALQHLRNSDPQLARLIDIHQPPTFDSFHPPFLALTK 213

Query: 1021 SILYQQLAYKAGTSIYNRFVALCGGEGAVLPDNVLGLSAQQLKQVGVSGRKASYLYDLAN 842
            SILYQQLAYKAGTSIY RFV+LCGGE  V+P+ VL LS QQL+Q+GVSGRKASYL+DLAN
Sbjct: 214  SILYQQLAYKAGTSIYTRFVSLCGGEAGVVPEAVLALSPQQLRQIGVSGRKASYLHDLAN 273

Query: 841  KYKSGILSDETVVKMDDKSLFTMLSMVKGIGSWSVHMFMIFSLHRPDVLPVSDLGVRKGV 662
            KY++GILSD ++V MDDKSLFTML+MVKGIGSWSVHMFMIFSLHRPDVLPV DLGVRKGV
Sbjct: 274  KYRNGILSDASIVDMDDKSLFTMLTMVKGIGSWSVHMFMIFSLHRPDVLPVGDLGVRKGV 333

Query: 661  QLLYGLEELPRPSQMELLCEKWRPYRSVGAWYMWRFVEGKGSQNASAASSLEGANVXXXX 482
            QLLYGLEELPRPSQME LCEKWRPYRSV +WYMWRF E KG+  ASAA+   G +     
Sbjct: 334  QLLYGLEELPRPSQMEQLCEKWRPYRSVASWYMWRFAEAKGAP-ASAAAVAVGVSQQQQL 392

Query: 481  XXXXXXXXXXXXXXXXLEPINGMGNLGACIWGQ 383
                            ++P+NG+ NLGAC WGQ
Sbjct: 393  PPPPQQQQQPPPPPQLIDPMNGIANLGACTWGQ 425


>ref|XP_010649138.1| PREDICTED: DNA-3-methyladenine glycosylase 1 [Vitis vinifera]
          Length = 384

 Score =  380 bits (975), Expect = e-102
 Identities = 190/276 (68%), Positives = 217/276 (78%), Gaps = 1/276 (0%)
 Frame = -2

Query: 1207 RRSAVQSARVLPQIIKPLSAEGEINAALHHLRVADPLLATLIDIHQPPAFESHHSPFLAL 1028
            +R+A QS   LP I+KPLS EGE++ AL HL  +DPLLA LI+ HQPP F+S H PFLAL
Sbjct: 112  KRNAAQSTAALPTIVKPLSCEGELDVALRHLTKSDPLLAALINTHQPPTFDSCHPPFLAL 171

Query: 1027 TKSILYQQLAYKAGTSIYNRFVALCGGEGAVLPDNVLGLSAQQLKQVGVSGRKASYLYDL 848
             KSILYQQLAYKA TSIY RFVALCGGE  V+PD VL LS  QL+Q+GVSGRKA YL+DL
Sbjct: 172  AKSILYQQLAYKAATSIYTRFVALCGGEAGVVPDAVLALSPSQLRQIGVSGRKAGYLHDL 231

Query: 847  ANKYKSGILSDETVVKMDDKSLFTMLSMVKGIGSWSVHMFMIFSLHRPDVLPVSDLGVRK 668
            A+KYK+GILSD +++ MDDKSLFTML+MVKGIGSWSVHMFMIFSLHRPDVLPV D+GVRK
Sbjct: 232  ASKYKTGILSDSSIMGMDDKSLFTMLTMVKGIGSWSVHMFMIFSLHRPDVLPVGDVGVRK 291

Query: 667  GVQLLYGLEELPRPSQMELLCEKWRPYRSVGAWYMWRFVEGKGSQNASAASSL-EGANVX 491
            GVQ LYGLEELPRPSQME LCEKW+PYRSVG+WYMWRFVE KG+  A AA +L +GA   
Sbjct: 292  GVQFLYGLEELPRPSQMEQLCEKWKPYRSVGSWYMWRFVEAKGAPPARAAVALVDGAT-- 349

Query: 490  XXXXXXXXXXXXXXXXXXXLEPINGMGNLGACIWGQ 383
                               ++PING+ NLGACIWGQ
Sbjct: 350  -SEQQQQQEQQQQPQQLQLVDPINGIVNLGACIWGQ 384


>ref|XP_007017553.1| DNA glycosylase superfamily protein [Theobroma cacao]
            gi|508722881|gb|EOY14778.1| DNA glycosylase superfamily
            protein [Theobroma cacao]
          Length = 397

 Score =  379 bits (972), Expect = e-102
 Identities = 212/411 (51%), Positives = 260/411 (63%), Gaps = 7/411 (1%)
 Frame = -2

Query: 1594 PNSDVASATQPTPVADVSINANVS------QKPTNPSKIPIRPQKIRKLSSNPTSTIATT 1433
            PN+   +A   T  + V  +A         Q  + PSKIP RP+KIRKLS +P S     
Sbjct: 40   PNNTSNAAVSTTVTSAVVTSAPTELTNVPPQTSSPPSKIPFRPRKIRKLSPDPNSD---- 95

Query: 1432 PVVLTPVNSSSSVTTLQTSEAKSLQITDXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 1253
                   N+S   TT  TS  +  +                                   
Sbjct: 96   ------TNASQQATTSATSATEPPKTV--------------------------------- 116

Query: 1252 XXXXXXXXXXTPKNRRRSAVQSARVLPQII-KPLSAEGEINAALHHLRVADPLLATLIDI 1076
                      TPK +     ++  V+P+I+ + LS EGE+  A+ HLR ADPLLA+LIDI
Sbjct: 117  --------AKTPKTKLTQH-RALAVVPRIMARSLSCEGEVETAIRHLRNADPLLASLIDI 167

Query: 1075 HQPPAFESHHSPFLALTKSILYQQLAYKAGTSIYNRFVALCGGEGAVLPDNVLGLSAQQL 896
            H PP F++ H+PFLALT+SILYQQLA+KAGTSIYNRF+ALCGGE  V+P+ VL L+AQQL
Sbjct: 168  HPPPTFDTFHTPFLALTRSILYQQLAFKAGTSIYNRFIALCGGENGVVPETVLSLTAQQL 227

Query: 895  KQVGVSGRKASYLYDLANKYKSGILSDETVVKMDDKSLFTMLSMVKGIGSWSVHMFMIFS 716
            +Q+GVSGRKASYL+DLA KY++GILSD  +V MDDKSLFTML+MV GIGSWSVHMFMIFS
Sbjct: 228  RQIGVSGRKASYLHDLARKYQTGILSDSAIVNMDDKSLFTMLTMVNGIGSWSVHMFMIFS 287

Query: 715  LHRPDVLPVSDLGVRKGVQLLYGLEELPRPSQMELLCEKWRPYRSVGAWYMWRFVEGKGS 536
            LHRPDVLP++DLGVRKGVQLLY LEELPRPSQM+ LCEKWRPYRSV +WY+WRFVE KG+
Sbjct: 288  LHRPDVLPINDLGVRKGVQLLYNLEELPRPSQMDQLCEKWRPYRSVASWYLWRFVEAKGA 347

Query: 535  QNASAASSLEGANVXXXXXXXXXXXXXXXXXXXXLEPINGMGNLGACIWGQ 383
              +SAA+   GA++                    L+PIN + NLGAC WGQ
Sbjct: 348  P-SSAAAVAAGASLPPPQQEEQQQHQQHQQQPQLLDPINSILNLGACAWGQ 397


>ref|XP_006473513.1| PREDICTED: DNA-3-methyladenine glycosylase 1-like isoform X2 [Citrus
            sinensis]
          Length = 371

 Score =  370 bits (951), Expect = 2e-99
 Identities = 209/412 (50%), Positives = 258/412 (62%), Gaps = 8/412 (1%)
 Frame = -2

Query: 1594 PNSDVASATQPTPVADVSIN------ANVS-QKPTNPSKIPIRPQKIRKLSSNPTSTIAT 1436
            PN D  +     PV   + N      ANV+ Q  + PSKIP+RP+KIRKLS +       
Sbjct: 25   PNQDSTTTLAVIPVQTETANNATITHANVTPQTSSPPSKIPLRPRKIRKLSPD------- 77

Query: 1435 TPVVLTPVNSSSSVTTLQTSEAKSLQITDXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 1256
                   V+ +SS    ++S+A S + T                                
Sbjct: 78   -----NGVDQASSSQPTESSKATSAKST-------------------------------- 100

Query: 1255 XXXXXXXXXXXTPKNRRRSAVQSARVLPQII-KPLSAEGEINAALHHLRVADPLLATLID 1079
                         K+R     Q    +P+II +PLS+EGE+ AA+ HLR AD  LA+LID
Sbjct: 101  -------------KSRAIQQQQQTLTVPRIIARPLSSEGEVEAAIRHLRNADRQLASLID 147

Query: 1078 IHQPPAFESHHSPFLALTKSILYQQLAYKAGTSIYNRFVALCGGEGAVLPDNVLGLSAQQ 899
            IH PP F+S H+PFLALT+SILYQQLA+KAGTSIY RF+ALCGGE  V+P+ VL L+ QQ
Sbjct: 148  IHPPPTFDSFHTPFLALTRSILYQQLAFKAGTSIYTRFIALCGGEAGVVPETVLALTPQQ 207

Query: 898  LKQVGVSGRKASYLYDLANKYKSGILSDETVVKMDDKSLFTMLSMVKGIGSWSVHMFMIF 719
            L+Q+GVSGRKASYL+DLA KY++GILSD  +V MDDKSLFTML+MV GIGSWSVHMFMIF
Sbjct: 208  LRQIGVSGRKASYLHDLARKYQNGILSDSAIVNMDDKSLFTMLTMVNGIGSWSVHMFMIF 267

Query: 718  SLHRPDVLPVSDLGVRKGVQLLYGLEELPRPSQMELLCEKWRPYRSVGAWYMWRFVEGKG 539
            SLHRPDVLP++DLGVRKGVQLLY LEELPRPSQM+ LCEKWRPYRSV +WY+WRFVE KG
Sbjct: 268  SLHRPDVLPINDLGVRKGVQLLYSLEELPRPSQMDQLCEKWRPYRSVASWYLWRFVEAKG 327

Query: 538  SQNASAASSLEGANVXXXXXXXXXXXXXXXXXXXXLEPINGMGNLGACIWGQ 383
            + +++AA +   A                      L+ IN + N+GAC WGQ
Sbjct: 328  APSSAAAVAAGAA--------LPQPQQEEQQQPQLLDQINSLINIGACAWGQ 371


>gb|EPS66255.1| hypothetical protein M569_08523, partial [Genlisea aurea]
          Length = 321

 Score =  369 bits (948), Expect = 4e-99
 Identities = 200/343 (58%), Positives = 231/343 (67%), Gaps = 10/343 (2%)
 Frame = -2

Query: 1525 SQKPTNPSKIPIRPQKIRKLSS----------NPTSTIATTPVVLTPVNSSSSVTTLQTS 1376
            S  P NPSKIPIRPQK+RKLS+          +P    A +P+   P   SS++T   T 
Sbjct: 1    SYNPQNPSKIPIRPQKMRKLSNPASICDDKAYSPQEIGADSPLAAPP---SSALTACATV 57

Query: 1375 EAKSLQITDXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXTPKNRRRSA 1196
             A +                                                 +NRRRS 
Sbjct: 58   GAIT--------------------------------------PVTAAAATSAARNRRRSY 79

Query: 1195 VQSARVLPQIIKPLSAEGEINAALHHLRVADPLLATLIDIHQPPAFESHHSPFLALTKSI 1016
             Q++RV PQ+ +PL AEGE+  AL+HLRV DPL   LID + PP F++H SPF+AL KSI
Sbjct: 80   SQASRVSPQLTRPLYAEGELEIALNHLRVVDPLFGALIDAYPPPQFDTHPSPFIALAKSI 139

Query: 1015 LYQQLAYKAGTSIYNRFVALCGGEGAVLPDNVLGLSAQQLKQVGVSGRKASYLYDLANKY 836
            +YQQLA KAGTSIY RF+ALC GE AV PD+VL LS+QQLKQ+G+SGRKASYLYDLANKY
Sbjct: 140  IYQQLALKAGTSIYMRFIALCSGEEAVTPDSVLSLSSQQLKQIGISGRKASYLYDLANKY 199

Query: 835  KSGILSDETVVKMDDKSLFTMLSMVKGIGSWSVHMFMIFSLHRPDVLPVSDLGVRKGVQL 656
            KSGILSDE +VKMDDKSLFTMLSMVKGIGSWSVHMFM+FSL RPDVLPVSDLGVRKGVQL
Sbjct: 200  KSGILSDELIVKMDDKSLFTMLSMVKGIGSWSVHMFMLFSLQRPDVLPVSDLGVRKGVQL 259

Query: 655  LYGLEELPRPSQMELLCEKWRPYRSVGAWYMWRFVEGKGSQNA 527
            LY L ELPRPSQME LC KWRPYRSV +WY+WR VE K S ++
Sbjct: 260  LYDLGELPRPSQMEQLCGKWRPYRSVASWYLWRIVEAKASPSS 302


>ref|XP_010254809.1| PREDICTED: uncharacterized protein LOC104595671 isoform X2 [Nelumbo
            nucifera]
          Length = 439

 Score =  367 bits (943), Expect = 1e-98
 Identities = 187/287 (65%), Positives = 218/287 (75%), Gaps = 11/287 (3%)
 Frame = -2

Query: 1210 RRRSAVQSARVLPQII-KPLSAEGEINAALHHLRVADPLLATLIDIHQPPAFESHHSPFL 1034
            + +  VQ  RVLP+++ + LS EGEI  AL +LR +DP LA LIDIHQPP F+S H PFL
Sbjct: 154  KNKIVVQQVRVLPRVVARTLSCEGEIALALQYLRNSDPQLARLIDIHQPPTFDSFHPPFL 213

Query: 1033 ALTKSILYQQLAYKAGTSIYNRFVALCGGEGAVLPDNVLGLSAQQLKQVGVSGRKASYLY 854
            ALTKSILYQQLAYKAGTSIY RFV+LCGGE  V+P+ VL LS QQL+Q+GVSGRKASYL+
Sbjct: 214  ALTKSILYQQLAYKAGTSIYTRFVSLCGGEAGVVPEAVLALSPQQLRQIGVSGRKASYLH 273

Query: 853  DLANKYKSGILSDETVVKMDDKSLFTMLSMVKGIGSWSVHMFMIFSLHRPDVLPVSDLGV 674
            DLANKY++GILSD ++V MDDKSLFTML+MVKGIGSWSVHMFMIFSLHRPDVLPV D+GV
Sbjct: 274  DLANKYRNGILSDASIVDMDDKSLFTMLTMVKGIGSWSVHMFMIFSLHRPDVLPVGDIGV 333

Query: 673  RKGVQLLYGLEELPRPSQMELLCEKWRPYRSVGAWYMWRFVEGKGSQNASAASSLEGAN- 497
            RKGVQLLYGL++LPRPSQME LCEKWRPYRSV +WYMWRF E KG+  ASAA+   G + 
Sbjct: 334  RKGVQLLYGLDQLPRPSQMEQLCEKWRPYRSVASWYMWRFAEAKGAP-ASAAAVAVGVSQ 392

Query: 496  ---------VXXXXXXXXXXXXXXXXXXXXLEPINGMGNLGACIWGQ 383
                                          ++P++GM NLGAC WGQ
Sbjct: 393  QQQLQQHQLQQPQQQHQQHQQHQQPPQPQLIDPMHGMANLGACAWGQ 439


>ref|XP_012071867.1| PREDICTED: uncharacterized protein LOC105633802 [Jatropha curcas]
            gi|643731174|gb|KDP38512.1| hypothetical protein
            JCGZ_04437 [Jatropha curcas]
          Length = 406

 Score =  367 bits (943), Expect = 1e-98
 Identities = 207/415 (49%), Positives = 257/415 (61%), Gaps = 13/415 (3%)
 Frame = -2

Query: 1588 SDVASATQPTPVADVSINANVS----------QKPTNPSKIP-IRPQKIRKLSSNPTSTI 1442
            + V + TQP P+ D +  + ++          Q  + P+KIP  RP+KIRKLS + T+T 
Sbjct: 53   AQVQTQTQPQPLHDSTTTSTITTTNELTTIPQQTVSPPAKIPPSRPRKIRKLSPDDTATT 112

Query: 1441 ATTP--VVLTPVNSSSSVTTLQTSEAKSLQITDXXXXXXXXXXXXXXXXXXXXXXXXXXX 1268
            AT P    LT   +    TT ++++ +  Q                              
Sbjct: 113  ATDPNSSQLTTTTNEPPKTTAKSAKTRIAQT----------------------------- 143

Query: 1267 XXXXXXXXXXXXXXXTPKNRRRSAVQSARVLPQIIKPLSAEGEINAALHHLRVADPLLAT 1088
                                +   V   R++P   + LS EGE+  A+ HLR ADPLLA+
Sbjct: 144  --------------------KAIVVAPPRIIP---RSLSCEGEVENAIRHLRDADPLLAS 180

Query: 1087 LIDIHQPPAFESHHSPFLALTKSILYQQLAYKAGTSIYNRFVALCGGEGAVLPDNVLGLS 908
            LID+H PP F++ H+PFLALT+SILYQQLA+KAGTSIY RF+ALCGGE  VLP  VL L+
Sbjct: 181  LIDLHPPPTFDTFHTPFLALTRSILYQQLAFKAGTSIYTRFIALCGGEAGVLPGTVLSLT 240

Query: 907  AQQLKQVGVSGRKASYLYDLANKYKSGILSDETVVKMDDKSLFTMLSMVKGIGSWSVHMF 728
             QQL+Q+GVSGRKASYL+DLA KY +GILSD  +V MDDKSLFTML+MV GIGSWSVHMF
Sbjct: 241  PQQLRQIGVSGRKASYLHDLARKYHNGILSDTAIVNMDDKSLFTMLTMVNGIGSWSVHMF 300

Query: 727  MIFSLHRPDVLPVSDLGVRKGVQLLYGLEELPRPSQMELLCEKWRPYRSVGAWYMWRFVE 548
            MIFSLHRPDVLP++DLGVRKGVQLLY LE+LPRPSQM+ LCEKWRPYRSV +WY+WRFVE
Sbjct: 301  MIFSLHRPDVLPINDLGVRKGVQLLYNLEDLPRPSQMDQLCEKWRPYRSVASWYLWRFVE 360

Query: 547  GKGSQNASAASSLEGANVXXXXXXXXXXXXXXXXXXXXLEPINGMGNLGACIWGQ 383
             KGS  +SA +   GA +                    L+PIN + NLGAC WGQ
Sbjct: 361  AKGSP-SSAVAVATGAGM--------TQQQQEEQQPQLLDPINSILNLGACAWGQ 406


>ref|XP_010102087.1| DNA-3-methyladenine glycosylase 1 [Morus notabilis]
            gi|587903719|gb|EXB91937.1| DNA-3-methyladenine
            glycosylase 1 [Morus notabilis]
          Length = 451

 Score =  366 bits (940), Expect = 3e-98
 Identities = 196/357 (54%), Positives = 239/357 (66%)
 Frame = -2

Query: 1564 PTPVADVSINANVSQKPTNPSKIPIRPQKIRKLSSNPTSTIATTPVVLTPVNSSSSVTTL 1385
            P+  A   ++   SQ  + PSKIP+RP+KIRKLS + + +  ++ VV  P N   S T  
Sbjct: 39   PSSTAPTELSNAPSQTSSPPSKIPLRPRKIRKLSPDDSDS-KSSQVVAVPENPKPSPTAA 97

Query: 1384 QTSEAKSLQITDXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXTPKNRR 1205
              ++    +I                                                +R
Sbjct: 98   AAAKPAKAKIVQ----------------------------------------------QR 111

Query: 1204 RSAVQSARVLPQIIKPLSAEGEINAALHHLRVADPLLATLIDIHQPPAFESHHSPFLALT 1025
              A+ + R+   + + LS EGE+  AL HLR ADPLLA LIDIHQPP F++ H+PFLALT
Sbjct: 112  ALAIAAPRI---VARSLSCEGEVEVALRHLRRADPLLAPLIDIHQPPTFDNFHTPFLALT 168

Query: 1024 KSILYQQLAYKAGTSIYNRFVALCGGEGAVLPDNVLGLSAQQLKQVGVSGRKASYLYDLA 845
            +SILYQQLAYKAGTSIY RF+ALCGGE  V+P+ VL L+ QQL+Q+GVSGRKASYL+DLA
Sbjct: 169  RSILYQQLAYKAGTSIYTRFIALCGGETGVVPETVLALTPQQLRQIGVSGRKASYLHDLA 228

Query: 844  NKYKSGILSDETVVKMDDKSLFTMLSMVKGIGSWSVHMFMIFSLHRPDVLPVSDLGVRKG 665
             KY++GILSD  +V MDDKSLFTML+MV GIGSWSVHMFMIFSLHRPDVLP++DLGVRKG
Sbjct: 229  RKYQNGILSDSAIVNMDDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLGVRKG 288

Query: 664  VQLLYGLEELPRPSQMELLCEKWRPYRSVGAWYMWRFVEGKGSQNASAASSLEGANV 494
            VQLLY LEELPRPSQM+ LCEKWRPYRSV AWYMWRFVE KG+   +AA+   GAN+
Sbjct: 289  VQLLYNLEELPRPSQMDQLCEKWRPYRSVAAWYMWRFVEQKGAP-PNAATVAVGANL 344


>ref|XP_008466558.1| PREDICTED: DNA-3-methyladenine glycosylase 1 [Cucumis melo]
          Length = 379

 Score =  366 bits (940), Expect = 3e-98
 Identities = 201/404 (49%), Positives = 255/404 (63%), Gaps = 6/404 (1%)
 Frame = -2

Query: 1576 SATQPTPVADVSINANV-----SQKPTNPSKIPIRPQKIRKLSSNPTSTIATTPVVLTPV 1412
            S+   TP+A  ++  +      SQ  + PSK+P+RP+KIRKLS  P  +   +  V+   
Sbjct: 30   SSNSTTPIAQATVMLSEVMNAPSQISSPPSKMPLRPRKIRKLS--PEESDPNSSHVVAIP 87

Query: 1411 NSSSSVTTLQTSEAKSLQITDXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 1232
            +    + T++++++K+ Q                                          
Sbjct: 88   DGPKPIATVKSNKSKTAQ------------------------------------------ 105

Query: 1231 XXXTPKNRRRSAVQSARVLPQIIKPLSAEGEINAALHHLRVADPLLATLIDIHQPPAFES 1052
                    +R+A  SA V   + + LS EGE+  AL HLR ADPLLA LID+HQ P F+S
Sbjct: 106  --------QRAAFASATV--PLARSLSCEGEVEIALRHLRNADPLLAQLIDLHQRPTFDS 155

Query: 1051 HHSPFLALTKSILYQQLAYKAGTSIYNRFVALCGGEGAVLPDNVLGLSAQQLKQVGVSGR 872
              +PFLALT+SILYQQLAYKAGTSIY RF+ALCGGE  VLP+ VL L+ QQL+Q+G+SGR
Sbjct: 156  FQTPFLALTRSILYQQLAYKAGTSIYTRFIALCGGEAGVLPETVLSLNPQQLRQIGISGR 215

Query: 871  KASYLYDLANKYKSGILSDETVVKMDDKSLFTMLSMVKGIGSWSVHMFMIFSLHRPDVLP 692
            K+SYL+DLA KY++GILSD  +V MDDKSLFTML+MV GIGSWSVHMFMIFSLHRPDVLP
Sbjct: 216  KSSYLHDLARKYQNGILSDPAIVNMDDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLP 275

Query: 691  VSDLGVRKGVQLLYGLEELPRPSQMELLCEKWRPYRSVGAWYMWRFVEGKG-SQNASAAS 515
            ++DL VRKGVQLLY LEELPRPSQM+ LCEKWRPYRSVG+WYMWR  E KG S +A+A +
Sbjct: 276  INDLNVRKGVQLLYNLEELPRPSQMDQLCEKWRPYRSVGSWYMWRLAEAKGASSSAAAVA 335

Query: 514  SLEGANVXXXXXXXXXXXXXXXXXXXXLEPINGMGNLGACIWGQ 383
            +     +                    L+P+NG+ NLGAC WGQ
Sbjct: 336  AGASLQLQQQDHHQEHQHPQHPQQPQLLDPLNGILNLGACAWGQ 379


>ref|XP_012444918.1| PREDICTED: probable DNA-3-methyladenine glycosylase 2 isoform X2
            [Gossypium raimondii] gi|763791263|gb|KJB58259.1|
            hypothetical protein B456_009G201500 [Gossypium
            raimondii]
          Length = 395

 Score =  365 bits (938), Expect = 6e-98
 Identities = 204/409 (49%), Positives = 254/409 (62%), Gaps = 11/409 (2%)
 Frame = -2

Query: 1576 SATQPTPVADVSINANVSQKPTN-----------PSKIPIRPQKIRKLSSNPTSTIATTP 1430
            S+T P      +  A V+  PT            PSKIP RP+KIRKLS +    ++  P
Sbjct: 39   SSTAPVSTVTTACTAIVACGPTELVNVPLSTLSPPSKIPSRPRKIRKLSPD----LSFDP 94

Query: 1429 VVLTPVNSSSSVTTLQTSEAKSLQITDXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 1250
                   +SSS  T  T + K++  T                                  
Sbjct: 95   NASQQATTSSS--TSLTEQRKTVGRTSKTKL----------------------------- 123

Query: 1249 XXXXXXXXXTPKNRRRSAVQSARVLPQIIKPLSAEGEINAALHHLRVADPLLATLIDIHQ 1070
                          R  AV + R+   I + LS EGE+  A+HHLR ADPLLA+LID+H 
Sbjct: 124  -----------SQHRALAVVAPRI---ISRSLSCEGEVENAIHHLRDADPLLASLIDLHP 169

Query: 1069 PPAFESHHSPFLALTKSILYQQLAYKAGTSIYNRFVALCGGEGAVLPDNVLGLSAQQLKQ 890
            PP F++ H+PFLALT+SILYQQLA+KAGTSIY RF++LCGGE  V+P+ VL L++QQL+Q
Sbjct: 170  PPTFDTFHAPFLALTRSILYQQLAFKAGTSIYTRFISLCGGENGVVPETVLSLTSQQLRQ 229

Query: 889  VGVSGRKASYLYDLANKYKSGILSDETVVKMDDKSLFTMLSMVKGIGSWSVHMFMIFSLH 710
            +GVSGRKASYL+DLA KY++GILSD  +V MDDKSLFTML+MV GIGSWSVHMFMIFSLH
Sbjct: 230  IGVSGRKASYLHDLARKYQTGILSDSAIVNMDDKSLFTMLTMVNGIGSWSVHMFMIFSLH 289

Query: 709  RPDVLPVSDLGVRKGVQLLYGLEELPRPSQMELLCEKWRPYRSVGAWYMWRFVEGKGSQN 530
            RPDVLP++DLGVRKGVQLLY LEELPRPSQM+ LCEKWRPYRSV +WY+WR+VE KG+ +
Sbjct: 290  RPDVLPINDLGVRKGVQLLYNLEELPRPSQMDQLCEKWRPYRSVASWYLWRYVEAKGAPS 349

Query: 529  ASAASSLEGANVXXXXXXXXXXXXXXXXXXXXLEPINGMGNLGACIWGQ 383
            ++AA +   A                      ++PIN + NLGAC WGQ
Sbjct: 350  SAAAVA---AGASLPPLQQQEEPQQHQQQPQLMDPINSILNLGACAWGQ 395


Top