BLASTX nr result

ID: Forsythia22_contig00040369 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Forsythia22_contig00040369
         (1747 letters)

Database: ./nr 
           69,698,275 sequences; 24,982,196,650 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_011080589.1| PREDICTED: A/G-specific adenine DNA glycosyl...   650   0.0  
ref|XP_012847854.1| PREDICTED: A/G-specific adenine DNA glycosyl...   631   e-178
ref|XP_012847802.1| PREDICTED: A/G-specific adenine DNA glycosyl...   610   e-173
gb|EYU46093.1| hypothetical protein MIMGU_mgv1a022080mg, partial...   612   e-172
ref|XP_004246789.2| PREDICTED: A/G-specific adenine DNA glycosyl...   566   e-158
ref|XP_006344357.1| PREDICTED: A/G-specific adenine DNA glycosyl...   566   e-158
emb|CDP04005.1| unnamed protein product [Coffea canephora]            565   e-158
ref|XP_009769615.1| PREDICTED: A/G-specific adenine DNA glycosyl...   561   e-157
ref|XP_009610155.1| PREDICTED: A/G-specific adenine DNA glycosyl...   556   e-155
ref|XP_010251435.1| PREDICTED: A/G-specific adenine DNA glycosyl...   551   e-154
ref|XP_007049485.1| HhH-GPD base excision DNA repair family prot...   551   e-154
ref|XP_011008193.1| PREDICTED: A/G-specific adenine DNA glycosyl...   547   e-152
ref|XP_012473350.1| PREDICTED: A/G-specific adenine DNA glycosyl...   546   e-152
gb|KJB08737.1| hypothetical protein B456_001G100200 [Gossypium r...   546   e-152
gb|KDO51051.1| hypothetical protein CISIN_1g010868mg [Citrus sin...   544   e-152
ref|XP_006447890.1| hypothetical protein CICLE_v10015195mg [Citr...   543   e-151
ref|XP_010069551.1| PREDICTED: A/G-specific adenine DNA glycosyl...   543   e-151
ref|XP_008236019.1| PREDICTED: A/G-specific adenine DNA glycosyl...   542   e-151
ref|XP_012084114.1| PREDICTED: A/G-specific adenine DNA glycosyl...   541   e-151
ref|XP_010679041.1| PREDICTED: A/G-specific adenine DNA glycosyl...   539   e-150

>ref|XP_011080589.1| PREDICTED: A/G-specific adenine DNA glycosylase [Sesamum indicum]
          Length = 448

 Score =  650 bits (1676), Expect = 0.0
 Identities = 327/433 (75%), Positives = 370/433 (85%), Gaps = 1/433 (0%)
 Frame = -1

Query: 1627 KRCRQEKTKTEPRA-LXXXXXXXXXXXXXXXIRASLLKWYDENQRDLPWRRISKNGDNNG 1451
            KRCRQ  +  +P   +               IR SLL+WYDEN+RDLPWRR+S   D+  
Sbjct: 3    KRCRQAGSGNKPTLEVVDIEDISFSNKEIPKIRTSLLEWYDENRRDLPWRRLSSGQDD-- 60

Query: 1450 VSVGERERRAYAVWVSEVMLQQTRVQTVIDYFNRWMEKWPTIHDLAQADIEEVNELWAGL 1271
            V V  RER+AYAVWVSEVMLQQTRVQTV+DYFNRWMEKWPTIH LA+A IEEVNE+WAGL
Sbjct: 61   VHVEHRERKAYAVWVSEVMLQQTRVQTVVDYFNRWMEKWPTIHHLARASIEEVNEMWAGL 120

Query: 1270 GYYRRARFLLEGAKMIVEDRTEFPKTVSSLRKVKGIGDYTAGAIASIAFNETVPVVDGNV 1091
            GYYRRARFLLEGAKMIVE   EFPKT SSL+ VKGIG+YTAGAIASIAF ETVPVVDGNV
Sbjct: 121  GYYRRARFLLEGAKMIVEGGGEFPKTASSLKMVKGIGNYTAGAIASIAFEETVPVVDGNV 180

Query: 1090 IRVIARLKALSANPKDSKTVKNVWKLAGQLVDPSRPGDFNQALMELGATVCTPFGPSCSA 911
            +RVIARLKA+SANPK+S TVKN+WKLA QLVDP RPGDFNQA+MELGATVC+P  PSCS 
Sbjct: 181  VRVIARLKAISANPKNSATVKNIWKLARQLVDPKRPGDFNQAVMELGATVCSPAAPSCST 240

Query: 910  CPISHQCRAVLLSTKDKSVQVTDYPMKVVKAKQRRDFSAVSVVEIVEVDGSRSDSRFLLV 731
            CPISHQC+A+ LS  ++S+QVTDYPMKV KAKQRRD+SAVSVVEIVE +GS+SDSR+LLV
Sbjct: 241  CPISHQCQALSLSRSNESIQVTDYPMKVTKAKQRRDYSAVSVVEIVE-EGSQSDSRYLLV 299

Query: 730  KRPDNGLLAGLWEFPSVLLEGETDLASRRKAIDNFLKESFGVDIRKSCKIVLREEVGEYV 551
            KRPD GLLAG WEFPSVLL+GE DLASRRKAID FLK+SFG+D  KSCK+VLREE+GEYV
Sbjct: 300  KRPDQGLLAGQWEFPSVLLDGEADLASRRKAIDIFLKQSFGLDKEKSCKVVLREEIGEYV 359

Query: 550  HVFSHIRLKMCIELLILNLKGGKNFMQRTQESATLTWKFVDGEALSSLGLTSGVRKVYTM 371
            HVF+HIRLKM +ELLIL+LKGG NF+QR QES T+TWKFVD +ALS+LGLTSGVRKVY M
Sbjct: 360  HVFTHIRLKMHVELLILHLKGGINFLQRNQESTTMTWKFVDNKALSTLGLTSGVRKVYNM 419

Query: 370  VEKFKQNSSDSVP 332
            +E+FKQN SDS+P
Sbjct: 420  IEEFKQNRSDSLP 432


>ref|XP_012847854.1| PREDICTED: A/G-specific adenine DNA glycosylase isoform X2
            [Erythranthe guttatus]
          Length = 492

 Score =  631 bits (1628), Expect = e-178
 Identities = 333/480 (69%), Positives = 377/480 (78%), Gaps = 9/480 (1%)
 Frame = -1

Query: 1744 TFYLAGKTHSP---PNLHSVRRRSLPPTIVSTKEDATESTTMKRCRQE-----KTKTEPR 1589
            T YLAGK HSP    N      RS PP         T ++TMKRCR+E     K+  EP 
Sbjct: 2    THYLAGKLHSPLIISNFAGRHHRSPPPP-----PPPTAASTMKRCRKEESTRIKSTVEPV 56

Query: 1588 ALXXXXXXXXXXXXXXXIRASLLKWYDENQRDLPWRRISKNGDNNGVSVGERERRAYAVW 1409
             +                R SLL+WYDEN+RDLPWRRIS  G  N V V ERE+RAYAVW
Sbjct: 57   DIEDISFRGKEIQKI---RESLLEWYDENRRDLPWRRISNGG--NDVGVEEREKRAYAVW 111

Query: 1408 VSEVMLQQTRVQTVIDYFNRWMEKWPTIHDLAQADIEEVNELWAGLGYYRRARFLLEGAK 1229
            VSEVMLQQTRVQTV+DYFNRWM KWPTIH LAQA IEEVNE+WAGLGYYRRARFLLEGA+
Sbjct: 112  VSEVMLQQTRVQTVVDYFNRWMGKWPTIHHLAQASIEEVNEMWAGLGYYRRARFLLEGAQ 171

Query: 1228 MIVEDRTEFPKTVSSLRKVKGIGDYTAGAIASIAFNETVPVVDGNVIRVIARLKALSANP 1049
            M+VE   EFPKT + L  V+GIG YTAGAIASIAF+E VPVVDGNVIRVI RLKA+SANP
Sbjct: 172  MVVEGGGEFPKTATDLEMVRGIGKYTAGAIASIAFDEAVPVVDGNVIRVITRLKAISANP 231

Query: 1048 KDSKTVKNVWKLAGQLVDPSRPGDFNQALMELGATVCTPFGPSCSACPISHQCRAVLLST 869
            K++ TVKN+WKLA QLVDP RPGDFNQA+MELGAT C+   PSCS CP+SHQC+A+ LS 
Sbjct: 232  KNAATVKNIWKLARQLVDPLRPGDFNQAIMELGATACSVTSPSCSTCPVSHQCQALSLSR 291

Query: 868  KDKSVQVTDYPMKVVKAKQRRDFSAVSVVEIVEVDGSRSDSRFLLVKRPDNGLLAGLWEF 689
            K +SVQVTDYPMKV KAK R DFSAVSVVEIV+ +GS+S SR+LLVKRPD GLLAGLWEF
Sbjct: 292  KQESVQVTDYPMKVAKAKPRHDFSAVSVVEIVD-EGSQSKSRYLLVKRPDEGLLAGLWEF 350

Query: 688  PSVLLEGETDLASRRKAIDNFLKESFGVDIRKSCKIVLREEVGEYVHVFSHIRLKMCIEL 509
            PSVLL GE DLASRRKAID+FLK+SFG+D +KSCK+V REEVGE VHVF+HIRLKM IEL
Sbjct: 351  PSVLLVGEADLASRRKAIDSFLKQSFGIDTKKSCKVVSREEVGECVHVFTHIRLKMYIEL 410

Query: 508  LILNL-KGGKNFMQRTQESATLTWKFVDGEALSSLGLTSGVRKVYTMVEKFKQNSSDSVP 332
            LIL L +GG N + + QES+T+ WKFVD +ALS+LGLTSGVRKV TMVEKFKQ+  +SVP
Sbjct: 411  LILQLTEGGMNCLHKKQESSTMKWKFVDDKALSTLGLTSGVRKVCTMVEKFKQSGPNSVP 470


>ref|XP_012847802.1| PREDICTED: A/G-specific adenine DNA glycosylase isoform X1
            [Erythranthe guttatus]
          Length = 503

 Score =  610 bits (1574), Expect(2) = e-173
 Identities = 322/464 (69%), Positives = 364/464 (78%), Gaps = 9/464 (1%)
 Frame = -1

Query: 1744 TFYLAGKTHSP---PNLHSVRRRSLPPTIVSTKEDATESTTMKRCRQE-----KTKTEPR 1589
            T YLAGK HSP    N      RS PP         T ++TMKRCR+E     K+  EP 
Sbjct: 2    THYLAGKLHSPLIISNFAGRHHRSPPPP-----PPPTAASTMKRCRKEESTRIKSTVEPV 56

Query: 1588 ALXXXXXXXXXXXXXXXIRASLLKWYDENQRDLPWRRISKNGDNNGVSVGERERRAYAVW 1409
             +                R SLL+WYDEN+RDLPWRRIS  G  N V V ERE+RAYAVW
Sbjct: 57   DIEDISFRGKEIQKI---RESLLEWYDENRRDLPWRRISNGG--NDVGVEEREKRAYAVW 111

Query: 1408 VSEVMLQQTRVQTVIDYFNRWMEKWPTIHDLAQADIEEVNELWAGLGYYRRARFLLEGAK 1229
            VSEVMLQQTRVQTV+DYFNRWM KWPTIH LAQA IEEVNE+WAGLGYYRRARFLLEGA+
Sbjct: 112  VSEVMLQQTRVQTVVDYFNRWMGKWPTIHHLAQASIEEVNEMWAGLGYYRRARFLLEGAQ 171

Query: 1228 MIVEDRTEFPKTVSSLRKVKGIGDYTAGAIASIAFNETVPVVDGNVIRVIARLKALSANP 1049
            M+VE   EFPKT + L  V+GIG YTAGAIASIAF+E VPVVDGNVIRVI RLKA+SANP
Sbjct: 172  MVVEGGGEFPKTATDLEMVRGIGKYTAGAIASIAFDEAVPVVDGNVIRVITRLKAISANP 231

Query: 1048 KDSKTVKNVWKLAGQLVDPSRPGDFNQALMELGATVCTPFGPSCSACPISHQCRAVLLST 869
            K++ TVKN+WKLA QLVDP RPGDFNQA+MELGAT C+   PSCS CP+SHQC+A+ LS 
Sbjct: 232  KNAATVKNIWKLARQLVDPLRPGDFNQAIMELGATACSVTSPSCSTCPVSHQCQALSLSR 291

Query: 868  KDKSVQVTDYPMKVVKAKQRRDFSAVSVVEIVEVDGSRSDSRFLLVKRPDNGLLAGLWEF 689
            K +SVQVTDYPMKV KAK R DFSAVSVVEIV+ +GS+S SR+LLVKRPD GLLAGLWEF
Sbjct: 292  KQESVQVTDYPMKVAKAKPRHDFSAVSVVEIVD-EGSQSKSRYLLVKRPDEGLLAGLWEF 350

Query: 688  PSVLLEGETDLASRRKAIDNFLKESFGVDIRKSCKIVLREEVGEYVHVFSHIRLKMCIEL 509
            PSVLL GE DLASRRKAID+FLK+SFG+D +KSCK+V REEVGE VHVF+HIRLKM IEL
Sbjct: 351  PSVLLVGEADLASRRKAIDSFLKQSFGIDTKKSCKVVSREEVGECVHVFTHIRLKMYIEL 410

Query: 508  LILNL-KGGKNFMQRTQESATLTWKFVDGEALSSLGLTSGVRKV 380
            LIL L +GG N + + QES+T+ WKFVD +ALS+LGLTSGVRKV
Sbjct: 411  LILQLTEGGMNCLHKKQESSTMKWKFVDDKALSTLGLTSGVRKV 454



 Score = 26.9 bits (58), Expect(2) = e-173
 Identities = 13/21 (61%), Positives = 15/21 (71%)
 Frame = -2

Query: 396 LVYGRFTLWLRNSSRIVLIQS 334
           +V  RF  WLRNSSR+  IQS
Sbjct: 482 IVKCRFVPWLRNSSRVGPIQS 502


>gb|EYU46093.1| hypothetical protein MIMGU_mgv1a022080mg, partial [Erythranthe
            guttata]
          Length = 433

 Score =  612 bits (1577), Expect = e-172
 Identities = 308/402 (76%), Positives = 347/402 (86%), Gaps = 1/402 (0%)
 Frame = -1

Query: 1534 RASLLKWYDENQRDLPWRRISKNGDNNGVSVGERERRAYAVWVSEVMLQQTRVQTVIDYF 1355
            R SLL+WYDEN+RDLPWRRIS  G  N V V ERE+RAYAVWVSEVMLQQTRVQTV+DYF
Sbjct: 13   RESLLEWYDENRRDLPWRRISNGG--NDVGVEEREKRAYAVWVSEVMLQQTRVQTVVDYF 70

Query: 1354 NRWMEKWPTIHDLAQADIEEVNELWAGLGYYRRARFLLEGAKMIVEDRTEFPKTVSSLRK 1175
            NRWM KWPTIH LAQA IEEVNE+WAGLGYYRRARFLLEGA+M+VE   EFPKT + L  
Sbjct: 71   NRWMGKWPTIHHLAQASIEEVNEMWAGLGYYRRARFLLEGAQMVVEGGGEFPKTATDLEM 130

Query: 1174 VKGIGDYTAGAIASIAFNETVPVVDGNVIRVIARLKALSANPKDSKTVKNVWKLAGQLVD 995
            V+GIG YTAGAIASIAF+E VPVVDGNVIRVI RLKA+SANPK++ TVKN+WKLA QLVD
Sbjct: 131  VRGIGKYTAGAIASIAFDEAVPVVDGNVIRVITRLKAISANPKNAATVKNIWKLARQLVD 190

Query: 994  PSRPGDFNQALMELGATVCTPFGPSCSACPISHQCRAVLLSTKDKSVQVTDYPMKVVKAK 815
            P RPGDFNQA+MELGAT C+   PSCS CP+SHQC+A+ LS K +SVQVTDYPMKV KAK
Sbjct: 191  PLRPGDFNQAIMELGATACSVTSPSCSTCPVSHQCQALSLSRKQESVQVTDYPMKVAKAK 250

Query: 814  QRRDFSAVSVVEIVEVDGSRSDSRFLLVKRPDNGLLAGLWEFPSVLLEGETDLASRRKAI 635
             R DFSAVSVVEIV+ +GS+S SR+LLVKRPD GLLAGLWEFPSVLL GE DLASRRKAI
Sbjct: 251  PRHDFSAVSVVEIVD-EGSQSKSRYLLVKRPDEGLLAGLWEFPSVLLVGEADLASRRKAI 309

Query: 634  DNFLKESFGVDIRKSCKIVLREEVGEYVHVFSHIRLKMCIELLILNL-KGGKNFMQRTQE 458
            D+FLK+SFG+D +KSCK+V REEVGE VHVF+HIRLKM IELLIL L +GG N + + QE
Sbjct: 310  DSFLKQSFGIDTKKSCKVVSREEVGECVHVFTHIRLKMYIELLILQLTEGGMNCLHKKQE 369

Query: 457  SATLTWKFVDGEALSSLGLTSGVRKVYTMVEKFKQNSSDSVP 332
            S+T+ WKFVD +ALS+LGLTSGVRKV TMVEKFKQ+  +SVP
Sbjct: 370  SSTMKWKFVDDKALSTLGLTSGVRKVCTMVEKFKQSGPNSVP 411


>ref|XP_004246789.2| PREDICTED: A/G-specific adenine DNA glycosylase isoform X1 [Solanum
            lycopersicum]
          Length = 476

 Score =  567 bits (1460), Expect = e-158
 Identities = 276/404 (68%), Positives = 335/404 (82%), Gaps = 3/404 (0%)
 Frame = -1

Query: 1534 RASLLKWYDENQRDLPWRRISKNGDNNGVSVGERERRAYAVWVSEVMLQQTRVQTVIDYF 1355
            RASLL+WYDENQRDLPWRRIS   D       ER++R YAVWVSEVMLQQTRV TVIDYF
Sbjct: 70   RASLLEWYDENQRDLPWRRISGGSD-------ERDKRGYAVWVSEVMLQQTRVSTVIDYF 122

Query: 1354 NRWMEKWPTIHDLAQADIEEVNELWAGLGYYRRARFLLEGAKMIVEDRTEFPKTVSSLRK 1175
             RWM KWPT+H LAQA +EEVNE+WAGLGYYRR RFLL+GAK +VE+   FP+TVS LRK
Sbjct: 123  KRWMNKWPTLHHLAQASLEEVNEMWAGLGYYRRVRFLLQGAKEVVEEGGSFPETVSELRK 182

Query: 1174 VKGIGDYTAGAIASIAFNETVPVVDGNVIRVIARLKALSANPKDSKTVKNVWKLAGQLVD 995
            +KGIG+YTAGAIASIAF + VPVVDGNV+RVI+RLKA+SANPKD+ TVK+ WKLAGQLVD
Sbjct: 183  IKGIGEYTAGAIASIAFKKVVPVVDGNVVRVISRLKAISANPKDTATVKSFWKLAGQLVD 242

Query: 994  PSRPGDFNQALMELGATVCTPFGPSCSACPISHQCRAVLLSTKDKSVQVTDYPMKVVKAK 815
            P RPGDFNQALMELGAT+C+   P C+ CPIS QC A+ LS +++SV V+DYP KVVKAK
Sbjct: 243  PCRPGDFNQALMELGATLCSLSNPGCAVCPISAQCHALSLSRQNESVHVSDYPTKVVKAK 302

Query: 814  QRRDFSAVSVVEIV---EVDGSRSDSRFLLVKRPDNGLLAGLWEFPSVLLEGETDLASRR 644
            QR +FSAVSVVEI+   E+ GS+S+S+++LVKRP+ GLLAGLWEFPS+LLE E DLASRR
Sbjct: 303  QRHEFSAVSVVEILDCQEMTGSQSNSKYILVKRPNEGLLAGLWEFPSILLEKEADLASRR 362

Query: 643  KAIDNFLKESFGVDIRKSCKIVLREEVGEYVHVFSHIRLKMCIELLILNLKGGKNFMQRT 464
            KAIDNFL+ S  +D+++S +IV RE++GE+VHVFSHIRLKM +ELL+L+ KG ++     
Sbjct: 363  KAIDNFLQSSLNLDLKESTRIVSREDIGEFVHVFSHIRLKMYVELLVLHPKGNRSIEDEK 422

Query: 463  QESATLTWKFVDGEALSSLGLTSGVRKVYTMVEKFKQNSSDSVP 332
             +  ++TWK+VDG+ L S+GLTSGVRKVYTMV+K KQ    ++P
Sbjct: 423  LDKESITWKYVDGKNLDSMGLTSGVRKVYTMVQKHKQTEQATIP 466


>ref|XP_006344357.1| PREDICTED: A/G-specific adenine DNA glycosylase-like isoform X1
            [Solanum tuberosum]
          Length = 456

 Score =  566 bits (1458), Expect = e-158
 Identities = 277/404 (68%), Positives = 335/404 (82%), Gaps = 3/404 (0%)
 Frame = -1

Query: 1534 RASLLKWYDENQRDLPWRRISKNGDNNGVSVGERERRAYAVWVSEVMLQQTRVQTVIDYF 1355
            RASLL+WYDENQRDLPWRRIS   D       ER++R YAVWVSEVMLQQTRV TVIDYF
Sbjct: 50   RASLLEWYDENQRDLPWRRISSGFD-------ERDKRGYAVWVSEVMLQQTRVSTVIDYF 102

Query: 1354 NRWMEKWPTIHDLAQADIEEVNELWAGLGYYRRARFLLEGAKMIVEDRTEFPKTVSSLRK 1175
             RWM KWPT+H LAQA +EEVNE+WAGLGYYRR RFLL+GAK +VE+   FP+TVS LRK
Sbjct: 103  KRWMNKWPTLHHLAQASLEEVNEMWAGLGYYRRVRFLLQGAKEVVEEGGSFPETVSELRK 162

Query: 1174 VKGIGDYTAGAIASIAFNETVPVVDGNVIRVIARLKALSANPKDSKTVKNVWKLAGQLVD 995
            +KGIG+YT+GAIASIAFN+ VPVVDGNV+RVI+RLKA+SANPKD+ TVK+ WKLAGQLVD
Sbjct: 163  IKGIGEYTSGAIASIAFNKAVPVVDGNVVRVISRLKAISANPKDAATVKSFWKLAGQLVD 222

Query: 994  PSRPGDFNQALMELGATVCTPFGPSCSACPISHQCRAVLLSTKDKSVQVTDYPMKVVKAK 815
            P RPGDFNQALMELGAT+C+   P C+ACPIS QC A+ LS + +SV V+DYP KVVKAK
Sbjct: 223  PCRPGDFNQALMELGATLCSLSNPGCAACPISAQCHALSLSRQSESVHVSDYPTKVVKAK 282

Query: 814  QRRDFSAVSVVEIV---EVDGSRSDSRFLLVKRPDNGLLAGLWEFPSVLLEGETDLASRR 644
            QR +FSAVSVVEI+   E+ G +S S+++LVKRPD GLLAGLWEFPS+LLE E DLASRR
Sbjct: 283  QRHEFSAVSVVEILDCQEMTGPQSSSKYILVKRPDEGLLAGLWEFPSILLEKEADLASRR 342

Query: 643  KAIDNFLKESFGVDIRKSCKIVLREEVGEYVHVFSHIRLKMCIELLILNLKGGKNFMQRT 464
            KAIDNFL+ SF +D+++S +IV RE++GE VHVFSHIRLKM +ELL+L+ KG ++   + 
Sbjct: 343  KAIDNFLQSSFYLDLKESTRIVSREDIGECVHVFSHIRLKMYVELLVLHPKGNRSIDYKK 402

Query: 463  QESATLTWKFVDGEALSSLGLTSGVRKVYTMVEKFKQNSSDSVP 332
             +  ++TWK+VDG+ L S+GL+SGVRKVYTMV+K KQ    ++P
Sbjct: 403  LDKESITWKYVDGKNLGSMGLSSGVRKVYTMVQKHKQTEQATIP 446


>emb|CDP04005.1| unnamed protein product [Coffea canephora]
          Length = 513

 Score =  565 bits (1457), Expect = e-158
 Identities = 297/483 (61%), Positives = 354/483 (73%), Gaps = 19/483 (3%)
 Frame = -1

Query: 1723 THSPPNLHSVRRRSLPPTIVSTKE------------DATESTTMKRCRQEKTK-TEPRAL 1583
            THS    H++R R   PT VS  +            D ++    +R  + K K T+    
Sbjct: 19   THS----HTLRNRRTRPTTVSMDDIIGNTQNTVAPSDQSKKKRPRRVVRPKPKSTQVEKS 74

Query: 1582 XXXXXXXXXXXXXXXIRASLLKWYDENQRDLPWRRISKNGDN--NGVSVGERERRAYAVW 1409
                           IRASLLKWYDENQRDLPWRRIS  G++  +     E E+RAYAVW
Sbjct: 75   DDIEDINFTEDETVEIRASLLKWYDENQRDLPWRRISSKGEDEEDNEDTEESEKRAYAVW 134

Query: 1408 VSEVMLQQTRVQTVIDYFNRWMEKWPTIHDLAQADIEEVNELWAGLGYYRRARFLLEGAK 1229
            VSEVMLQQTRVQTVIDYFN+WM KWPT+  LAQA +EEVNE+WAGLGYYRRARFLLEGAK
Sbjct: 135  VSEVMLQQTRVQTVIDYFNKWMTKWPTLSHLAQASLEEVNEMWAGLGYYRRARFLLEGAK 194

Query: 1228 MIVEDRTEFPKTVSSLRKVKGIGDYTAGAIASIAFNETVPVVDGNVIRVIARLKALSANP 1049
            MIVE+   FPK V +LRKVKGIG+YTAGAIASIAF E VPVVDGNV+RVIARLKA+S NP
Sbjct: 195  MIVEEGGGFPKAVPALRKVKGIGEYTAGAIASIAFKEVVPVVDGNVVRVIARLKAVSTNP 254

Query: 1048 KDSKTVKNVWKLAGQLVDPSRPGDFNQALMELGATVCTPFGPSCSACPISHQCRAVLLST 869
            K++  VKN WKLAGQLVD  RPGDFNQALMELGATVCTP  PSC+ CPIS +CRA+LLS 
Sbjct: 255  KEAVAVKNTWKLAGQLVDLCRPGDFNQALMELGATVCTPSSPSCNECPISTKCRALLLSR 314

Query: 868  KDKSVQVTDYPMKVVKAKQRRDFSAVSVVEIVE----VDGSRSDSRFLLVKRPDNGLLAG 701
               SVQVTDYPMK+VKAKQR DF+AV+VVE++E     D +  +S+F+LVKR D GLLAG
Sbjct: 315  CHDSVQVTDYPMKIVKAKQRSDFAAVTVVEVLEGPRMKDEAHPNSKFILVKRADKGLLAG 374

Query: 700  LWEFPSVLLEGETDLASRRKAIDNFLKESFGVDIRKSCKIVLREEVGEYVHVFSHIRLKM 521
            LWEFPSVLL+GE D  +RR AID++LK +F +D  KSC I+ RE+VGEYVHVF+HIRLKM
Sbjct: 375  LWEFPSVLLDGEADSVTRRDAIDHYLKSAFDLDPTKSCDIISREDVGEYVHVFTHIRLKM 434

Query: 520  CIELLILNLKGGKNFMQRTQESATLTWKFVDGEALSSLGLTSGVRKVYTMVEKFKQNSSD 341
             +E ++L++K  K    + Q    + WKFVD + LS +GLTSGVRKVY M+E +KQ +S 
Sbjct: 435  YVEWMVLHVKCFKKLWNKKQGEDDINWKFVDQQTLSCMGLTSGVRKVYGMIENYKQRTSS 494

Query: 340  SVP 332
            S+P
Sbjct: 495  SLP 497


>ref|XP_009769615.1| PREDICTED: A/G-specific adenine DNA glycosylase [Nicotiana
            sylvestris]
          Length = 493

 Score =  561 bits (1447), Expect = e-157
 Identities = 286/448 (63%), Positives = 340/448 (75%), Gaps = 6/448 (1%)
 Frame = -1

Query: 1657 KEDATESTTMKRCRQEKTKTEPRALXXXXXXXXXXXXXXXIRASLLKWYDENQRDLPWRR 1478
            K  A      +R  Q K K E                   IRASLL+WYD NQRDLPWRR
Sbjct: 36   KRTAISKKRPRRTTQPKPKIEVPTSGDIEDFSFSKDEALQIRASLLEWYDNNQRDLPWRR 95

Query: 1477 ISKN---GDNNGVSVGERERRAYAVWVSEVMLQQTRVQTVIDYFNRWMEKWPTIHDLAQA 1307
            IS +   G        ERE+R YAVWVSEVMLQQTRV TVIDYFNRWM KWPT+H LAQA
Sbjct: 96   ISSSSSCGFKEEDDDDEREKRGYAVWVSEVMLQQTRVSTVIDYFNRWMNKWPTLHHLAQA 155

Query: 1306 DIEEVNELWAGLGYYRRARFLLEGAKMIVEDRTEFPKTVSSLRKVKGIGDYTAGAIASIA 1127
             +EEVNE+WAGLGYYRRARFLLEGAK +VE    FP+TVS LR +KGIG+YTAGAI+SIA
Sbjct: 156  SLEEVNEMWAGLGYYRRARFLLEGAKEVVEQGGTFPETVSDLRNIKGIGEYTAGAISSIA 215

Query: 1126 FNETVPVVDGNVIRVIARLKALSANPKDSKTVKNVWKLAGQLVDPSRPGDFNQALMELGA 947
            F + VPVVDGNV+RVI+RLKA+SANPKD+ TVK +WKLAGQLVDP RPGDFNQALMELGA
Sbjct: 216  FKKAVPVVDGNVVRVISRLKAISANPKDAATVKKIWKLAGQLVDPFRPGDFNQALMELGA 275

Query: 946  TVCTPFGPSCSACPISHQCRAVLLSTKDKSVQVTDYPMKVVKAKQRRDFSAVSVVEIV-- 773
            T+C+   P C+ACPIS QC A+ LS +++SV VTDYP+KV+KAKQR +FSAVSVVEI+  
Sbjct: 276  TLCSLSNPGCAACPISAQCHALSLSRQNESVHVTDYPIKVMKAKQRHEFSAVSVVEILDC 335

Query: 772  -EVDGSRSDSRFLLVKRPDNGLLAGLWEFPSVLLEGETDLASRRKAIDNFLKESFGVDIR 596
             E  G +S S+F+LVKRP+ GLLAGLWEFPSVLLE E DLASRR AID FL+ SF +D++
Sbjct: 336  QETIGPQSSSKFILVKRPNKGLLAGLWEFPSVLLEKEADLASRRIAIDKFLQSSFNLDLK 395

Query: 595  KSCKIVLREEVGEYVHVFSHIRLKMCIELLILNLKGGKNFMQRTQESATLTWKFVDGEAL 416
            +S +IV RE +GEYVHVFSHIRLKM IELL+L  KG ++   + ++  ++TWK+VD + L
Sbjct: 396  ESIRIVSREYIGEYVHVFSHIRLKMYIELLVLRPKGNRSIDYKKRDKESMTWKYVDSKNL 455

Query: 415  SSLGLTSGVRKVYTMVEKFKQNSSDSVP 332
             S+GLTSGVRKVY MV+K KQ    ++P
Sbjct: 456  DSMGLTSGVRKVYNMVQKHKQTDQGTIP 483


>ref|XP_009610155.1| PREDICTED: A/G-specific adenine DNA glycosylase [Nicotiana
            tomentosiformis]
          Length = 493

 Score =  556 bits (1433), Expect = e-155
 Identities = 277/404 (68%), Positives = 329/404 (81%), Gaps = 4/404 (0%)
 Frame = -1

Query: 1534 RASLLKWYDENQRDLPWRRISKNGDNN-GVSVGERERRAYAVWVSEVMLQQTRVQTVIDY 1358
            RASLL+WYD NQRDLPWRRIS +          ERE+R YAVWVSEVMLQQTRV TVIDY
Sbjct: 79   RASLLEWYDNNQRDLPWRRISSSSSCGFKEDDDEREKRGYAVWVSEVMLQQTRVSTVIDY 138

Query: 1357 FNRWMEKWPTIHDLAQADIEEVNELWAGLGYYRRARFLLEGAKMIVEDRTEFPKTVSSLR 1178
            FNRWM KWPT+  LAQA +EEVNE+WAGLGYYRRARFLLEGAK +VE    FP+TVS LR
Sbjct: 139  FNRWMNKWPTLRHLAQASLEEVNEMWAGLGYYRRARFLLEGAKEVVEQGGTFPETVSDLR 198

Query: 1177 KVKGIGDYTAGAIASIAFNETVPVVDGNVIRVIARLKALSANPKDSKTVKNVWKLAGQLV 998
             +KGIG+YTAGAI+SIAF + VPVVDGNV+RVI+RLKA+SANPKD+ +VKN WKLAGQLV
Sbjct: 199  NIKGIGEYTAGAISSIAFKKAVPVVDGNVVRVISRLKAISANPKDAASVKNFWKLAGQLV 258

Query: 997  DPSRPGDFNQALMELGATVCTPFGPSCSACPISHQCRAVLLSTKDKSVQVTDYPMKVVKA 818
            DP RPGDFNQALMELGAT+C+   P C+ACPIS QC A+ LS +++SV VTDYP+KV+KA
Sbjct: 259  DPFRPGDFNQALMELGATLCSLSNPGCAACPISAQCHALSLSRQNESVHVTDYPIKVMKA 318

Query: 817  KQRRDFSAVSVVEIV---EVDGSRSDSRFLLVKRPDNGLLAGLWEFPSVLLEGETDLASR 647
            KQR +FSAVSVVEI+   E  G +S S+F+LVKRP+NGLLAGLWEFPSVLLE E DLASR
Sbjct: 319  KQRHEFSAVSVVEILDCQETIGPQSSSKFILVKRPNNGLLAGLWEFPSVLLEKEADLASR 378

Query: 646  RKAIDNFLKESFGVDIRKSCKIVLREEVGEYVHVFSHIRLKMCIELLILNLKGGKNFMQR 467
            R AID FL+ SF +D+++S +IV RE +GEYVHVFSHIRLKM IELL+L  KG  +   +
Sbjct: 379  RIAIDKFLQSSFNLDLKESIRIVSREYIGEYVHVFSHIRLKMYIELLVLRPKGNNSIDYK 438

Query: 466  TQESATLTWKFVDGEALSSLGLTSGVRKVYTMVEKFKQNSSDSV 335
             Q+  ++TWK+VD + L S+GLTSGVRKVY+MV+K KQ    ++
Sbjct: 439  KQDKESMTWKYVDSKNLDSMGLTSGVRKVYSMVQKHKQTDQGTI 482


>ref|XP_010251435.1| PREDICTED: A/G-specific adenine DNA glycosylase [Nelumbo nucifera]
          Length = 486

 Score =  551 bits (1421), Expect = e-154
 Identities = 270/405 (66%), Positives = 329/405 (81%), Gaps = 4/405 (0%)
 Frame = -1

Query: 1534 RASLLKWYDENQRDLPWRRISKNGDNNGVSVGERERRAYAVWVSEVMLQQTRVQTVIDYF 1355
            R+SLL+WY ENQR LPWR+   + DNN   V +   RAYAVWVSEVMLQQTRV +VIDY+
Sbjct: 77   RSSLLQWYYENQRVLPWRKNQDDEDNNAQGVSDT--RAYAVWVSEVMLQQTRVASVIDYY 134

Query: 1354 NRWMEKWPTIHDLAQADIEEVNELWAGLGYYRRARFLLEGAKMIVEDRTEFPKTVSSLRK 1175
            NRWMEKWPT++ LAQA  EEVNE+WAGLGYYRRAR+LLEGAK+IVE R EFPKTVS+LR+
Sbjct: 135  NRWMEKWPTVYHLAQASQEEVNEMWAGLGYYRRARYLLEGAKLIVE-RGEFPKTVSALRE 193

Query: 1174 VKGIGDYTAGAIASIAFNETVPVVDGNVIRVIARLKALSANPKDSKTVKNVWKLAGQLVD 995
            + GIGDYTAGAIASIAF ETVPVVDGNV+RVIARLKA+SANPK+ KT+K+ WKLAGQLVD
Sbjct: 194  IPGIGDYTAGAIASIAFKETVPVVDGNVVRVIARLKAISANPKEGKTIKSFWKLAGQLVD 253

Query: 994  PSRPGDFNQALMELGATVCTPFGPSCSACPISHQCRAVLLSTKDKSVQVTDYPMKVVKAK 815
            P RPGDFNQALMELGAT+C P  PSCS CPIS QC A+ +S   +S+QVTDYP K+VKA+
Sbjct: 254  PLRPGDFNQALMELGATICNPSSPSCSTCPISEQCHALSVSRNCQSIQVTDYPTKIVKAE 313

Query: 814  QRRDFSAVSVVEIVE----VDGSRSDSRFLLVKRPDNGLLAGLWEFPSVLLEGETDLASR 647
            +R DF+AV VVEI E     +G      FLLVKRP+ GLLAGLWEFPSVLL GE +L +R
Sbjct: 314  KRCDFAAVCVVEISEGPDIQEGDHKSKGFLLVKRPEEGLLAGLWEFPSVLLGGEVNLITR 373

Query: 646  RKAIDNFLKESFGVDIRKSCKIVLREEVGEYVHVFSHIRLKMCIELLILNLKGGKNFMQR 467
            RK +D +LK+SF +D +++C I LRE VGEYVH+FSHI+L+M +EL++L+LKGG+N +  
Sbjct: 374  RKVMDQYLKKSFNLDAKRNCSIALREVVGEYVHIFSHIQLRMYVELMVLHLKGGENIIFP 433

Query: 466  TQESATLTWKFVDGEALSSLGLTSGVRKVYTMVEKFKQNSSDSVP 332
              +  T+TWK VDG+++ S+GLTSGVRKVY M++KFK++     P
Sbjct: 434  KMDKETVTWKLVDGKSIQSMGLTSGVRKVYNMIQKFKKSRLSKNP 478


>ref|XP_007049485.1| HhH-GPD base excision DNA repair family protein [Theobroma cacao]
            gi|508701746|gb|EOX93642.1| HhH-GPD base excision DNA
            repair family protein [Theobroma cacao]
          Length = 461

 Score =  551 bits (1419), Expect = e-154
 Identities = 279/411 (67%), Positives = 330/411 (80%), Gaps = 10/411 (2%)
 Frame = -1

Query: 1534 RASLLKWYDENQRDLPWRR-ISKNGDNNGVSVGERE---RRAYAVWVSEVMLQQTRVQTV 1367
            R+SLL+WYD+NQRDLPWRR  +K+G+   V   E E   +RAY VWVSEVMLQQTRVQTV
Sbjct: 43   RSSLLEWYDKNQRDLPWRRRTTKSGNGKNVKKEEEEDDEKRAYGVWVSEVMLQQTRVQTV 102

Query: 1366 IDYFNRWMEKWPTIHDLAQADIEEVNELWAGLGYYRRARFLLEGAKMIVEDRTEFPKTVS 1187
            IDY+ RWM+KWPT+  LAQA +EEVNE+WAGLGYYRRARFLLEGAKMIV   +EFP TVS
Sbjct: 103  IDYYKRWMQKWPTLQHLAQASLEEVNEMWAGLGYYRRARFLLEGAKMIVARGSEFPNTVS 162

Query: 1186 SLRKVKGIGDYTAGAIASIAFNETVPVVDGNVIRVIARLKALSANPKDSKTVKNVWKLAG 1007
            +LRKV GIGDYTAGAIASIAF E VPVVDGNV+RV+ARLKA+SANPKD  TVKN WKLA 
Sbjct: 163  TLRKVPGIGDYTAGAIASIAFKEVVPVVDGNVVRVLARLKAISANPKDKTTVKNFWKLAA 222

Query: 1006 QLVDPSRPGDFNQALMELGATVCTPFGPSCSACPISHQCRAVLLSTKDKSVQVTDYPMKV 827
            QLVDPSRPGDFNQ+LMELGAT+CTP  PSCS+CP+S QC A+  S  D+SV VT YP KV
Sbjct: 223  QLVDPSRPGDFNQSLMELGATLCTPLNPSCSSCPVSSQCCALYNSKNDESVVVTRYPTKV 282

Query: 826  VKAKQRRDFSAVSVVEIVEVDG----SRSDSRFLLVKRPDNGLLAGLWEFPSVLLEGETD 659
            VKAKQR+DFS V VVEI    G    S+ DSRFLLVKRPD GLLAGLWEFPSV L+ E D
Sbjct: 283  VKAKQRQDFSTVCVVEISGSQGTLHQSQPDSRFLLVKRPDEGLLAGLWEFPSVTLDEEAD 342

Query: 658  LASRRKAIDNFLKESFGVDIRKSCKIVLREEVGEYVHVFSHIRLKMCIELLILNLKGGKN 479
            LA RRK ID  LK+SF ++  K+C I+ R  VGE+VHVFSHIR K+ +ELL+L+LKGG +
Sbjct: 343  LAMRRKLIDQLLKKSFKLNPPKNCSIISRVLVGEFVHVFSHIRRKIYVELLVLHLKGGMH 402

Query: 478  FMQRTQESATLTWKFVDGEALSSLGLTSGVRKVYTMVEKFKQN--SSDSVP 332
             + + ++S T+ WK +D +A+S +GLTS V+KVY+MV+ FKQN  S+ S+P
Sbjct: 403  DLYKEKDSGTMDWKLLDSDAVSRMGLTSSVQKVYSMVQNFKQNGLSNSSIP 453


>ref|XP_011008193.1| PREDICTED: A/G-specific adenine DNA glycosylase [Populus euphratica]
          Length = 517

 Score =  547 bits (1409), Expect = e-152
 Identities = 275/406 (67%), Positives = 320/406 (78%), Gaps = 6/406 (1%)
 Frame = -1

Query: 1534 RASLLKWYDENQRDLPWRRISKNGDN--NGVSVGERERRAYAVWVSEVMLQQTRVQTVID 1361
            RASLL WYD NQRDLPWRRI++  +         E E RAY VWVSEVMLQQTRVQTVID
Sbjct: 96   RASLLDWYDHNQRDLPWRRITQTKETPFKEEEEEEEEERAYGVWVSEVMLQQTRVQTVID 155

Query: 1360 YFNRWMEKWPTIHDLAQADIEEVNELWAGLGYYRRARFLLEGAKMIVEDRTEFPKTVSSL 1181
            Y+NRWM KWPT+H LAQA +EEVNE+WAGLGYYRRARFLLEGAKMIV     FPK VSSL
Sbjct: 156  YYNRWMLKWPTLHHLAQASLEEVNEMWAGLGYYRRARFLLEGAKMIVAGGDGFPKIVSSL 215

Query: 1180 RKVKGIGDYTAGAIASIAFNETVPVVDGNVIRVIARLKALSANPKDSKTVKNVWKLAGQL 1001
            RKV GIGDYTAGAIASIAF E VPVVDGNVIRV+ARLKA+SANPKD  TVK  WKLA QL
Sbjct: 216  RKVPGIGDYTAGAIASIAFKEVVPVVDGNVIRVLARLKAISANPKDKVTVKKFWKLAAQL 275

Query: 1000 VDPSRPGDFNQALMELGATVCTPFGPSCSACPISHQCRAVLLSTKDKSVQVTDYPMKVVK 821
            VDP RPGDFNQ+LMELGATVCTP  PSCS+CP+S QCRA+ +S  DK V +TDYP K +K
Sbjct: 276  VDPHRPGDFNQSLMELGATVCTPVNPSCSSCPVSGQCRALTISKLDKLVLITDYPAKSIK 335

Query: 820  AKQRRDFSAVSVVEIVE----VDGSRSDSRFLLVKRPDNGLLAGLWEFPSVLLEGETDLA 653
             KQR +FSAV  VEI      ++G +S S FLLVKRPD GLLAGLWEFPSV+L  E DL 
Sbjct: 336  LKQRHEFSAVCAVEISGSRDLIEGDQSSSVFLLVKRPDEGLLAGLWEFPSVMLGKEADLT 395

Query: 652  SRRKAIDNFLKESFGVDIRKSCKIVLREEVGEYVHVFSHIRLKMCIELLILNLKGGKNFM 473
             RR  ++ FLK+SF +D +K+C ++LRE++GE++H+F+HIRLK+ +ELLI++LKG  + +
Sbjct: 396  RRRNEMNRFLKKSFRLDPQKTCSVLLREDIGEFIHIFTHIRLKVYVELLIVHLKGDMSDL 455

Query: 472  QRTQESATLTWKFVDGEALSSLGLTSGVRKVYTMVEKFKQNSSDSV 335
               Q    +TWK VD +ALSSLGLTSGVRKV TMV+KFKQ S  +V
Sbjct: 456  FSKQSGENMTWKCVDRKALSSLGLTSGVRKVCTMVQKFKQKSLSTV 501


>ref|XP_012473350.1| PREDICTED: A/G-specific adenine DNA glycosylase-like isoform X4
            [Gossypium raimondii]
          Length = 492

 Score =  546 bits (1407), Expect = e-152
 Identities = 274/409 (66%), Positives = 327/409 (79%), Gaps = 8/409 (1%)
 Frame = -1

Query: 1534 RASLLKWYDENQRDLPWRRISKNGDN--NGVSVGERERRAYAVWVSEVMLQQTRVQTVID 1361
            RASLL+WYD+NQRDLPWR  +K  +N  N     E E+RAY VWVSEVMLQQTRVQTVID
Sbjct: 76   RASLLEWYDKNQRDLPWRTSTKKSENGENVQEEEEEEKRAYGVWVSEVMLQQTRVQTVID 135

Query: 1360 YFNRWMEKWPTIHDLAQADIEEVNELWAGLGYYRRARFLLEGAKMIVEDRTEFPKTVSSL 1181
            Y+NRWM KWPT+  L+QA +EEVNE+WAGLGYYRRARFLLEGAKMIV + +EFP TV +L
Sbjct: 136  YYNRWMLKWPTLQHLSQASLEEVNEMWAGLGYYRRARFLLEGAKMIVAEGSEFPNTVFAL 195

Query: 1180 RKVKGIGDYTAGAIASIAFNETVPVVDGNVIRVIARLKALSANPKDSKTVKNVWKLAGQL 1001
            RKV GIGDYTAGAIASIAF + VPVVDGNV+RV+ARLKA+SANPKD  TVKN WKLA QL
Sbjct: 196  RKVPGIGDYTAGAIASIAFKQVVPVVDGNVVRVLARLKAISANPKDKTTVKNFWKLAAQL 255

Query: 1000 VDPSRPGDFNQALMELGATVCTPFGPSCSACPISHQCRAVLLSTKDKSVQVTDYPMKVVK 821
            VDPSRPGDFNQ+LMELGAT+CTP  P+C++CP+S QCRA+  S  D+SV V DYPMKVVK
Sbjct: 256  VDPSRPGDFNQSLMELGATLCTPLNPNCTSCPVSSQCRALHNSRNDESVMVMDYPMKVVK 315

Query: 820  AKQRRDFSAVSVVEIV----EVDGSRSDSRFLLVKRPDNGLLAGLWEFPSVLLEGETDLA 653
             KQR DFS VSVVEI      +  ++S+SR LLVKRPD GLLAGLWEFP V L+ E DL+
Sbjct: 316  TKQRNDFSTVSVVEISRSQDRLQQTKSNSRVLLVKRPDEGLLAGLWEFPCVTLDEEADLS 375

Query: 652  SRRKAIDNFLKESFGVDIRKSCKIVLREEVGEYVHVFSHIRLKMCIELLILNLKGGKNFM 473
             RRK ID  LK+SF ++  K+C ++ RE VGE+VHVFSHIR K+ +ELL+L+LKGGK+ +
Sbjct: 376  MRRKLIDQLLKKSFKLNPPKNCNVISRELVGEFVHVFSHIRRKIYVELLVLHLKGGKHVL 435

Query: 472  QRTQESATLTWKFVDGEALSSLGLTSGVRKVYTMVEKFKQN--SSDSVP 332
                +     WK +D EA+S +GLTS VRKVY+MV+KFKQ+  S++SVP
Sbjct: 436  FEEDDINATDWKLLDCEAVSRMGLTSSVRKVYSMVQKFKQDGTSNNSVP 484


>gb|KJB08737.1| hypothetical protein B456_001G100200 [Gossypium raimondii]
          Length = 451

 Score =  546 bits (1407), Expect = e-152
 Identities = 274/409 (66%), Positives = 327/409 (79%), Gaps = 8/409 (1%)
 Frame = -1

Query: 1534 RASLLKWYDENQRDLPWRRISKNGDN--NGVSVGERERRAYAVWVSEVMLQQTRVQTVID 1361
            RASLL+WYD+NQRDLPWR  +K  +N  N     E E+RAY VWVSEVMLQQTRVQTVID
Sbjct: 35   RASLLEWYDKNQRDLPWRTSTKKSENGENVQEEEEEEKRAYGVWVSEVMLQQTRVQTVID 94

Query: 1360 YFNRWMEKWPTIHDLAQADIEEVNELWAGLGYYRRARFLLEGAKMIVEDRTEFPKTVSSL 1181
            Y+NRWM KWPT+  L+QA +EEVNE+WAGLGYYRRARFLLEGAKMIV + +EFP TV +L
Sbjct: 95   YYNRWMLKWPTLQHLSQASLEEVNEMWAGLGYYRRARFLLEGAKMIVAEGSEFPNTVFAL 154

Query: 1180 RKVKGIGDYTAGAIASIAFNETVPVVDGNVIRVIARLKALSANPKDSKTVKNVWKLAGQL 1001
            RKV GIGDYTAGAIASIAF + VPVVDGNV+RV+ARLKA+SANPKD  TVKN WKLA QL
Sbjct: 155  RKVPGIGDYTAGAIASIAFKQVVPVVDGNVVRVLARLKAISANPKDKTTVKNFWKLAAQL 214

Query: 1000 VDPSRPGDFNQALMELGATVCTPFGPSCSACPISHQCRAVLLSTKDKSVQVTDYPMKVVK 821
            VDPSRPGDFNQ+LMELGAT+CTP  P+C++CP+S QCRA+  S  D+SV V DYPMKVVK
Sbjct: 215  VDPSRPGDFNQSLMELGATLCTPLNPNCTSCPVSSQCRALHNSRNDESVMVMDYPMKVVK 274

Query: 820  AKQRRDFSAVSVVEIV----EVDGSRSDSRFLLVKRPDNGLLAGLWEFPSVLLEGETDLA 653
             KQR DFS VSVVEI      +  ++S+SR LLVKRPD GLLAGLWEFP V L+ E DL+
Sbjct: 275  TKQRNDFSTVSVVEISRSQDRLQQTKSNSRVLLVKRPDEGLLAGLWEFPCVTLDEEADLS 334

Query: 652  SRRKAIDNFLKESFGVDIRKSCKIVLREEVGEYVHVFSHIRLKMCIELLILNLKGGKNFM 473
             RRK ID  LK+SF ++  K+C ++ RE VGE+VHVFSHIR K+ +ELL+L+LKGGK+ +
Sbjct: 335  MRRKLIDQLLKKSFKLNPPKNCNVISRELVGEFVHVFSHIRRKIYVELLVLHLKGGKHVL 394

Query: 472  QRTQESATLTWKFVDGEALSSLGLTSGVRKVYTMVEKFKQN--SSDSVP 332
                +     WK +D EA+S +GLTS VRKVY+MV+KFKQ+  S++SVP
Sbjct: 395  FEEDDINATDWKLLDCEAVSRMGLTSSVRKVYSMVQKFKQDGTSNNSVP 443


>gb|KDO51051.1| hypothetical protein CISIN_1g010868mg [Citrus sinensis]
            gi|641832008|gb|KDO51052.1| hypothetical protein
            CISIN_1g010868mg [Citrus sinensis]
          Length = 498

 Score =  544 bits (1402), Expect = e-152
 Identities = 268/407 (65%), Positives = 329/407 (80%), Gaps = 6/407 (1%)
 Frame = -1

Query: 1534 RASLLKWYDENQRDLPWRRISKNGDNNGVSVGERERRAYAVWVSEVMLQQTRVQTVIDYF 1355
            R SLL+WYD+NQR+LPWR  S++         E+E+RAY VWVSEVMLQQTRVQTVIDY+
Sbjct: 84   RQSLLQWYDKNQRELPWRERSESDKEE-----EKEKRAYGVWVSEVMLQQTRVQTVIDYY 138

Query: 1354 NRWMEKWPTIHDLAQADIEEVNELWAGLGYYRRARFLLEGAKMIVEDRTEFPKTVSSLRK 1175
            NRWM KWPTIH LA+A +EEVNE+WAGLGYYRRARFLLEGAKMIV +   FP TVS LRK
Sbjct: 139  NRWMTKWPTIHHLAKASLEEVNEMWAGLGYYRRARFLLEGAKMIVAEGDGFPNTVSDLRK 198

Query: 1174 VKGIGDYTAGAIASIAFNETVPVVDGNVIRVIARLKALSANPKDSKTVKNVWKLAGQLVD 995
            V GIG+YTAGAIASIAF E VPVVDGNVIRV+ARLKA+SANPKD+ TVKN WKLA QLVD
Sbjct: 199  VPGIGNYTAGAIASIAFKEVVPVVDGNVIRVLARLKAISANPKDTSTVKNFWKLATQLVD 258

Query: 994  PSRPGDFNQALMELGATVCTPFGPSCSACPISHQCRAVLLSTKDKSVQVTDYPMKVVKAK 815
              RPGDFNQ+LMELGA +CTP  P+C++CP+S +C+A  +S +D SV VT YPMKV+KA+
Sbjct: 259  SCRPGDFNQSLMELGAVICTPLNPNCTSCPVSDKCQAYSMSKRDNSVLVTSYPMKVLKAR 318

Query: 814  QRRDFSAVSVVEIV----EVDGSRSDSRFLLVKRPDNGLLAGLWEFPSVLLEGETDLASR 647
            QR D SA  VVEI+    E + ++ D  F+LVKR D GLLAGLWEFPS++L+GETD+ +R
Sbjct: 319  QRHDVSAACVVEILGGNDESERTQPDGVFILVKRRDEGLLAGLWEFPSIILDGETDITTR 378

Query: 646  RKAIDNFLKESFGVDIRKSCKIVLREEVGEYVHVFSHIRLKMCIELLILNLKGGKNFMQR 467
            R+A + FLK+SF +D R +C I+LRE+VGE+VH+FSHIRLK+ +ELL+L +KGG +    
Sbjct: 379  REAAECFLKKSFNLDPRNNCSIILREDVGEFVHIFSHIRLKVHVELLVLCIKGGIDKWVE 438

Query: 466  TQESATLTWKFVDGEALSSLGLTSGVRKVYTMVEKFKQN--SSDSVP 332
             Q+  TL+WK VDG  L+S+GLTSGVRKVYTMV+KFKQ   +++S+P
Sbjct: 439  KQDKGTLSWKCVDGGTLASMGLTSGVRKVYTMVQKFKQKRLTTNSIP 485


>ref|XP_006447890.1| hypothetical protein CICLE_v10015195mg [Citrus clementina]
            gi|568830187|ref|XP_006469387.1| PREDICTED: A/G-specific
            adenine DNA glycosylase-like isoform X1 [Citrus sinensis]
            gi|557550501|gb|ESR61130.1| hypothetical protein
            CICLE_v10015195mg [Citrus clementina]
          Length = 456

 Score =  543 bits (1400), Expect = e-151
 Identities = 268/407 (65%), Positives = 328/407 (80%), Gaps = 6/407 (1%)
 Frame = -1

Query: 1534 RASLLKWYDENQRDLPWRRISKNGDNNGVSVGERERRAYAVWVSEVMLQQTRVQTVIDYF 1355
            R SLL+WYD+NQR+LPWR  S++         E+E+RAY VWVSEVMLQQTRVQTVIDY+
Sbjct: 42   RQSLLQWYDKNQRELPWRERSESDKEE-----EKEKRAYGVWVSEVMLQQTRVQTVIDYY 96

Query: 1354 NRWMEKWPTIHDLAQADIEEVNELWAGLGYYRRARFLLEGAKMIVEDRTEFPKTVSSLRK 1175
            NRWM KWPTIH LA+A +EEVNE+WAGLGYYRRARFLLEGAKMIV +   FP TVS LRK
Sbjct: 97   NRWMTKWPTIHHLAKASLEEVNEMWAGLGYYRRARFLLEGAKMIVAEGDGFPNTVSDLRK 156

Query: 1174 VKGIGDYTAGAIASIAFNETVPVVDGNVIRVIARLKALSANPKDSKTVKNVWKLAGQLVD 995
            V GIG+YTAGAIASIAF E VPVVDGNVIRV+ARLKA+SANPKD+ TVKN WKLA QLVD
Sbjct: 157  VPGIGNYTAGAIASIAFKEVVPVVDGNVIRVLARLKAISANPKDTSTVKNFWKLATQLVD 216

Query: 994  PSRPGDFNQALMELGATVCTPFGPSCSACPISHQCRAVLLSTKDKSVQVTDYPMKVVKAK 815
              RPGDFNQ+LMELGA +CTP  P+C++CP+S +C+A  +S  D SV VT YPMKV+KA+
Sbjct: 217  SCRPGDFNQSLMELGAVICTPLNPNCTSCPVSDKCQAYSMSKCDNSVLVTSYPMKVLKAR 276

Query: 814  QRRDFSAVSVVEIV----EVDGSRSDSRFLLVKRPDNGLLAGLWEFPSVLLEGETDLASR 647
            QR D SA  VVEI+    E + ++ D  F+LVKR D GLLAGLWEFPS++L+GETD+ +R
Sbjct: 277  QRHDVSAACVVEILGGNDESERTQPDGVFILVKRRDEGLLAGLWEFPSIILDGETDITTR 336

Query: 646  RKAIDNFLKESFGVDIRKSCKIVLREEVGEYVHVFSHIRLKMCIELLILNLKGGKNFMQR 467
            R+A + FLK+SF +D R +C I+LRE+VGE+VH+FSHIRLK+ +ELL+L +KGG +    
Sbjct: 337  REAAECFLKKSFNLDPRNNCSIILREDVGEFVHIFSHIRLKVHVELLVLRIKGGIDKWVE 396

Query: 466  TQESATLTWKFVDGEALSSLGLTSGVRKVYTMVEKFKQN--SSDSVP 332
             Q+  TL+WK VDG  L+S+GLTSGVRKVYTMV+KFKQ   +++S+P
Sbjct: 397  KQDKGTLSWKCVDGGTLASMGLTSGVRKVYTMVQKFKQKRLTTNSIP 443


>ref|XP_010069551.1| PREDICTED: A/G-specific adenine DNA glycosylase isoform X1
            [Eucalyptus grandis]
          Length = 509

 Score =  543 bits (1399), Expect = e-151
 Identities = 278/409 (67%), Positives = 325/409 (79%), Gaps = 12/409 (2%)
 Frame = -1

Query: 1534 RASLLKWYDENQRDLPWRRISKNGDNNGVSVGERE--------RRAYAVWVSEVMLQQTR 1379
            RASLL+WYD N+RDLPWR  + NG  N  +  E E        RRAY VWVSEVMLQQTR
Sbjct: 91   RASLLEWYDRNRRDLPWR--ASNGGGNAGNAQEDEDGDGEEEDRRAYGVWVSEVMLQQTR 148

Query: 1378 VQTVIDYFNRWMEKWPTIHDLAQADIEEVNELWAGLGYYRRARFLLEGAKMIVEDRTEFP 1199
            VQTVIDY+NRWM KWPT+H LA A +EEVNE+WAGLGYYRRARFLLEGAKMIV     FP
Sbjct: 149  VQTVIDYYNRWMLKWPTLHHLASASLEEVNEMWAGLGYYRRARFLLEGAKMIVTGGEGFP 208

Query: 1198 KTVSSLRKVKGIGDYTAGAIASIAFNETVPVVDGNVIRVIARLKALSANPKDSKTVKNVW 1019
            +TV +LRK+ GIGDYTAGAIASIAFNE VPVVDGNV+RV+ARLKA+SANPKDS TVK  W
Sbjct: 209  RTVETLRKIPGIGDYTAGAIASIAFNEVVPVVDGNVVRVLARLKAVSANPKDSATVKKFW 268

Query: 1018 KLAGQLVDPSRPGDFNQALMELGATVCTPFGPSCSACPISHQCRAVLLSTKDKSVQVTDY 839
            KLA QLVDP RPGDFNQ+LMELGAT+CTP  PSCS+CPIS QC+A+ +S KD+SV VTDY
Sbjct: 269  KLAAQLVDPDRPGDFNQSLMELGATLCTPSNPSCSSCPISIQCQALAISRKDESVTVTDY 328

Query: 838  PMKVVKAKQRRDFSAVSVVEIVEVDGS----RSDSRFLLVKRPDNGLLAGLWEFPSVLLE 671
            P K +K KQR +FSAV VVEI+  D S     S+S +LLVKRPD GLLAGLWEFPSV+L+
Sbjct: 329  PSKGIKTKQREEFSAVCVVEILRGDDSFASNSSESGYLLVKRPDEGLLAGLWEFPSVMLK 388

Query: 670  GETDLASRRKAIDNFLKESFGVDIRKSCKIVLREEVGEYVHVFSHIRLKMCIELLILNLK 491
             E D  +RRKAID+FL++SFG++    C  V RE+VG++VH+FSHIRL++  ELL+L LK
Sbjct: 389  DEADSDTRRKAIDHFLEQSFGLN-STVCIPVTREDVGDFVHIFSHIRLRIFAELLVLRLK 447

Query: 490  GGKNFMQRTQESATLTWKFVDGEALSSLGLTSGVRKVYTMVEKFKQNSS 344
               +F  R     TLTWK+VD EALSSLGLTSGVRKVY M++KFK++SS
Sbjct: 448  DEMSFF-RKHSKKTLTWKYVDSEALSSLGLTSGVRKVYAMIQKFKKSSS 495


>ref|XP_008236019.1| PREDICTED: A/G-specific adenine DNA glycosylase [Prunus mume]
          Length = 453

 Score =  542 bits (1397), Expect = e-151
 Identities = 278/455 (61%), Positives = 334/455 (73%), Gaps = 4/455 (0%)
 Frame = -1

Query: 1699 SVRRRSLPPTIVSTKEDATESTTMKRCRQEKTKTEPRALXXXXXXXXXXXXXXXIRASLL 1520
            S R++     + + +  A  S   ++ ++ +   +   +               IR +LL
Sbjct: 6    SSRKKKDAAVVANKRPPAAASLPQRQTQRRRQSAKESEIQDIEDLFFSEEETQRIRKALL 65

Query: 1519 KWYDENQRDLPWRRISKNGDNNGVSVGERERRAYAVWVSEVMLQQTRVQTVIDYFNRWME 1340
            +WY  N+R+LPWR   +          + ERRAY VWVSEVMLQQTRVQTV+ YF+RWM 
Sbjct: 66   EWYGLNRRELPWREAEE----------DVERRAYRVWVSEVMLQQTRVQTVVQYFHRWMS 115

Query: 1339 KWPTIHDLAQADIEEVNELWAGLGYYRRARFLLEGAKMIVEDRTEFPKTVSSLRKVKGIG 1160
            KWPTIH LAQA +EEVNELWAGLGYYRRARFLLEGA+MIV +  +FPKTVS LRKV+GIG
Sbjct: 116  KWPTIHHLAQASLEEVNELWAGLGYYRRARFLLEGARMIVAEEVQFPKTVSQLRKVRGIG 175

Query: 1159 DYTAGAIASIAFNETVPVVDGNVIRVIARLKALSANPKDSKTVKNVWKLAGQLVDPSRPG 980
            DYTAGAIASIAF E VPVVDGNV+RVIARLKA+SANPKDS TVK  WKLA QLVD  +PG
Sbjct: 176  DYTAGAIASIAFKEVVPVVDGNVVRVIARLKAVSANPKDSSTVKKFWKLAAQLVDTFQPG 235

Query: 979  DFNQALMELGATVCTPFGPSCSACPISHQCRAVLLSTKDKSVQVTDYPMKVVKAKQRRDF 800
            DFNQALMELGATVCTP  PSC +CP+S QC A+ +S  D SV VTDYP+KVVKAKQR DF
Sbjct: 236  DFNQALMELGATVCTPLSPSCHSCPVSVQCCALSISRADSSVLVTDYPVKVVKAKQRHDF 295

Query: 799  SAVSVVEIVE----VDGSRSDSRFLLVKRPDNGLLAGLWEFPSVLLEGETDLASRRKAID 632
            SAV VV+I+      +G R+++ FLLVKRPD GLLAGLWEFPSVLL GE DL +RRKAID
Sbjct: 296  SAVCVVQILRDEELSEGHRTNNGFLLVKRPDEGLLAGLWEFPSVLLAGEADLVTRRKAID 355

Query: 631  NFLKESFGVDIRKSCKIVLREEVGEYVHVFSHIRLKMCIELLILNLKGGKNFMQRTQESA 452
             +L + F ++ R +C IV RE VGE +HVF+HIRLKM +ELL+L+LKGG   +   Q   
Sbjct: 356  QYLNKHFRLNPRNTCDIVSREYVGENIHVFTHIRLKMYVELLVLHLKGGMKDLVSKQGKE 415

Query: 451  TLTWKFVDGEALSSLGLTSGVRKVYTMVEKFKQNS 347
            T+ WK VD E LSS+GLTSGVRKVYTMV+KFK+ +
Sbjct: 416  TVPWKCVDAEVLSSMGLTSGVRKVYTMVQKFKRET 450


>ref|XP_012084114.1| PREDICTED: A/G-specific adenine DNA glycosylase isoform X1 [Jatropha
            curcas] gi|643739419|gb|KDP45173.1| hypothetical protein
            JCGZ_15038 [Jatropha curcas]
          Length = 465

 Score =  541 bits (1395), Expect = e-151
 Identities = 277/410 (67%), Positives = 329/410 (80%), Gaps = 9/410 (2%)
 Frame = -1

Query: 1534 RASLLKWYDENQRDLPWRRISKNGDNNGVSVGERE---RRAYAVWVSEVMLQQTRVQTVI 1364
            R SLL WYD NQR LPWRR  KN   N + + E E   +RAY VWVSEVMLQQTRVQTVI
Sbjct: 45   RESLLDWYDHNQRVLPWRR--KN--TNPLEIEEEEEKGKRAYGVWVSEVMLQQTRVQTVI 100

Query: 1363 DYFNRWMEKWPTIHDLAQADIEEVNELWAGLGYYRRARFLLEGAKMIVEDRTEFPKTVSS 1184
            DY+NRWM KWPT+ +LA A +EEVNE+WAGLGYYRRARFLLEGAKMIV +   FP TVSS
Sbjct: 101  DYYNRWMLKWPTLENLALASLEEVNEMWAGLGYYRRARFLLEGAKMIVAEGGGFPSTVSS 160

Query: 1183 LRKVKGIGDYTAGAIASIAFNETVPVVDGNVIRVIARLKALSANPKDSKTVKNVWKLAGQ 1004
            LRKV GIG+YTAGAIASIAF E VPVVDGNVIRV+ARLKA+S NPK+   +KN WKLA Q
Sbjct: 161  LRKVPGIGNYTAGAIASIAFGEVVPVVDGNVIRVLARLKAISTNPKNLVAIKNFWKLAAQ 220

Query: 1003 LVDPSRPGDFNQALMELGATVCTPFGPSCSACPISHQCRAVLLSTKDKSVQVTDYPMKVV 824
            LVDP RPGDFNQ+LMELGATVCTP  P+CS CP+S+QCRA+ +S +DKSV VTDYP KVV
Sbjct: 221  LVDPCRPGDFNQSLMELGATVCTPSNPNCSLCPVSNQCRALSIS-EDKSVLVTDYPAKVV 279

Query: 823  KAKQRRDFSAVSVVEIV----EVDGSRSDSRFLLVKRPDNGLLAGLWEFPSVLLEGETDL 656
            K KQR +FSAV VVEI+      DG +S+S FLLVKRPD+GLLAGLWEFP+V+L+ E DL
Sbjct: 280  KVKQRNEFSAVCVVEILGSQGPTDGDQSESGFLLVKRPDDGLLAGLWEFPTVMLDKEADL 339

Query: 655  ASRRKAIDNFLKESFGVDIRKSCKIVLREEVGEYVHVFSHIRLKMCIELLILNLKGGKNF 476
              R K I+ FLK++F +D +++C IVLRE++GE+VH+FSHIRLK+ +ELL++ LKGG   
Sbjct: 340  TKRTKEINQFLKKTFKIDPQRTCSIVLREDIGEFVHIFSHIRLKVYVELLVICLKGGTTE 399

Query: 475  MQRTQESATLTWKFVDGEALSSLGLTSGVRKVYTMVEKFKQN--SSDSVP 332
            +    +    +WK+V+ +ALS+LGLTSGVRKVYTMVEKFKQN  S+DS P
Sbjct: 400  LFSEHKKEATSWKYVNKKALSNLGLTSGVRKVYTMVEKFKQNRLSTDSAP 449


>ref|XP_010679041.1| PREDICTED: A/G-specific adenine DNA glycosylase [Beta vulgaris subsp.
            vulgaris] gi|870858670|gb|KMT10158.1| hypothetical
            protein BVRB_5g119190 [Beta vulgaris subsp. vulgaris]
          Length = 468

 Score =  539 bits (1388), Expect = e-150
 Identities = 268/400 (67%), Positives = 320/400 (80%), Gaps = 4/400 (1%)
 Frame = -1

Query: 1534 RASLLKWYDENQRDLPWRRISKNGDNNGVSVGERERRAYAVWVSEVMLQQTRVQTVIDYF 1355
            RASLL+WYD+N+RDLPWR ++   D         ER+AY VWVSEVMLQQTRV TVIDY+
Sbjct: 60   RASLLEWYDKNKRDLPWRNLNDVDDGG-------ERKAYGVWVSEVMLQQTRVVTVIDYY 112

Query: 1354 NRWMEKWPTIHDLAQADIEEVNELWAGLGYYRRARFLLEGAKMIVEDRTEFPKTVSSLRK 1175
            NRWM+KWP+IH L+ A +EEVNE+WAGLGYYRRAR+LLEG K I+E+   FP+TVSSLRK
Sbjct: 113  NRWMQKWPSIHLLSLASLEEVNEMWAGLGYYRRARYLLEGTKKIIEEGGTFPRTVSSLRK 172

Query: 1174 VKGIGDYTAGAIASIAFNETVPVVDGNVIRVIARLKALSANPKDSKTVKNVWKLAGQLVD 995
            + GIGDYT+GAIASIAFNE VPVVDGNV+RV+ARLKA+SANPKDS TVK  W+LAGQLVD
Sbjct: 173  IPGIGDYTSGAIASIAFNEVVPVVDGNVVRVLARLKAISANPKDSVTVKKFWRLAGQLVD 232

Query: 994  PSRPGDFNQALMELGATVCTPFGPSCSACPISHQCRAVLLSTKDKSVQVTDYPMKVVKAK 815
            P RPG+FNQALMELGAT CT   PSCS CP+S QC A  LS   K   VTDYP KVVKAK
Sbjct: 233  PRRPGEFNQALMELGATTCTVTSPSCSECPVSAQCHA--LSLSQKGGLVTDYPAKVVKAK 290

Query: 814  QRRDFSAVSVVEIVE----VDGSRSDSRFLLVKRPDNGLLAGLWEFPSVLLEGETDLASR 647
             R +FSAV VVEI E    V+  +  SR+LLVKRP+ GLLAGLWEFPSVLL  E++ A R
Sbjct: 291  PRNEFSAVCVVEITESRNLVEAYQGTSRYLLVKRPNEGLLAGLWEFPSVLLGKESETAIR 350

Query: 646  RKAIDNFLKESFGVDIRKSCKIVLREEVGEYVHVFSHIRLKMCIELLILNLKGGKNFMQR 467
            +K+ID FLK SF +D +K+CK++ REEVGEYVHVFSHIRLKM +E LI++LKGG +F+Q 
Sbjct: 351  KKSIDTFLKTSFNLDTKKTCKVISREEVGEYVHVFSHIRLKMYVEYLIIHLKGGLDFLQS 410

Query: 466  TQESATLTWKFVDGEALSSLGLTSGVRKVYTMVEKFKQNS 347
              +  ++ WK VD + LS +GLTSGV+KV+TMV+KFK+NS
Sbjct: 411  VPDEGSMVWKCVDWKELSRMGLTSGVKKVHTMVQKFKENS 450


Top