BLASTX nr result
ID: Akebia25_contig00008675
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Akebia25_contig00008675 (1135 letters) Database: ./nr 38,876,450 sequences; 13,856,398,315 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_002510396.1| DNA-3-methyladenine glycosylase, putative [R... 400 e-109 ref|XP_002282344.1| PREDICTED: DNA-3-methyladenine glycosylase 1... 397 e-108 ref|XP_007017553.1| DNA glycosylase superfamily protein [Theobro... 395 e-107 ref|XP_002266618.1| PREDICTED: DNA-3-methyladenine glycosylase 1... 395 e-107 ref|XP_006473513.1| PREDICTED: DNA-3-methyladenine glycosylase 1... 390 e-106 ref|XP_004501010.1| PREDICTED: DNA-3-methyladenine glycosylase 1... 388 e-105 ref|XP_007136597.1| hypothetical protein PHAVU_009G058200g [Phas... 387 e-105 ref|XP_003528090.1| PREDICTED: DNA-3-methyladenine glycosylase 1... 386 e-105 gb|ACU22727.1| unknown [Glycine max] 386 e-105 ref|XP_004147864.1| PREDICTED: uncharacterized protein LOC101202... 385 e-104 ref|XP_003523827.1| PREDICTED: probable DNA-3-methyladenine glyc... 384 e-104 emb|CAN72984.1| hypothetical protein VITISV_009035 [Vitis vinifera] 384 e-104 gb|EXB91937.1| DNA-3-methyladenine glycosylase 1 [Morus notabilis] 382 e-103 ref|XP_006473512.1| PREDICTED: DNA-3-methyladenine glycosylase 1... 375 e-101 ref|XP_006435004.1| hypothetical protein CICLE_v10001539mg [Citr... 375 e-101 ref|XP_004291872.1| PREDICTED: putative DNA-3-methyladenine glyc... 368 2e-99 ref|XP_007223275.1| hypothetical protein PRUPE_ppa007252mg [Prun... 366 1e-98 emb|CBI19705.3| unnamed protein product [Vitis vinifera] 361 2e-97 ref|XP_004238277.1| PREDICTED: DNA-3-methyladenine glycosylase 1... 358 2e-96 ref|XP_006390331.1| hypothetical protein EUTSA_v10018654mg [Eutr... 358 3e-96 >ref|XP_002510396.1| DNA-3-methyladenine glycosylase, putative [Ricinus communis] gi|223551097|gb|EEF52583.1| DNA-3-methyladenine glycosylase, putative [Ricinus communis] Length = 369 Score = 400 bits (1028), Expect = e-109 Identities = 195/254 (76%), Positives = 220/254 (86%), Gaps = 6/254 (2%) Frame = +1 Query: 163 KTLSCEGEIDLAIRHLRSSDPLLARIIDVHLPPTFDSFHPPFLALTKSILYQQLAYKAGT 342 ++LSCEGE++ AIRHLR +DPLL+ +ID+H PPTFD+FH PFLALT+SILYQQLA+KAGT Sbjct: 118 RSLSCEGEVENAIRHLREADPLLSSLIDLHPPPTFDTFHTPFLALTRSILYQQLAFKAGT 177 Query: 343 SIYTRFVSLCGGEAAVVPQTVLALTAPQLRQIGVSGRKASYLHDLASKYCNGILSDSSIV 522 SIYTRF+SLCGGEA VVP TVLALT QLRQIGVSGRKASYLHDLA KY NGILSDS+IV Sbjct: 178 SIYTRFISLCGGEAGVVPDTVLALTPQQLRQIGVSGRKASYLHDLARKYHNGILSDSAIV 237 Query: 523 DMDDKSLFTMLTMVKGIGPWSVHMFMIFSLHRPDVLPVGDLGVRKGVQLLHGLEELPRPS 702 +MDDKSLFTMLTMV GIG WSVHMFMIFSLHRPDVLP+ DLGVRKGVQLL+ LE+LPRPS Sbjct: 238 NMDDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLGVRKGVQLLYNLEDLPRPS 297 Query: 703 QMEQLCEKWRPYRSVGSWYMWRFVEVKGAPAPANTSSVAVGDGL------QHQQPQLIDP 864 QM+QLCEKWRPYRSV SWY+WRFVE KG+P+ A +VA G L HQQPQL+DP Sbjct: 298 QMDQLCEKWRPYRSVASWYLWRFVEAKGSPSSA--VAVATGAALTQQHQEDHQQPQLLDP 355 Query: 865 MSGITGLGACAWGQ 906 ++ I LGACAWGQ Sbjct: 356 INSILNLGACAWGQ 369 >ref|XP_002282344.1| PREDICTED: DNA-3-methyladenine glycosylase 1-like [Vitis vinifera] Length = 363 Score = 397 bits (1020), Expect = e-108 Identities = 199/263 (75%), Positives = 219/263 (83%), Gaps = 15/263 (5%) Frame = +1 Query: 163 KTLSCEGEIDLAIRHLRSSDPLLARIIDVHLPPTFDSFHPPFLALTKSILYQQLAYKAGT 342 + LSCEGEI++A+RHLR++DP LA +ID+H PPTFDSFH PFLALTKSILYQQLAYKAGT Sbjct: 103 RALSCEGEIEIALRHLRNADPHLAPLIDLHPPPTFDSFHTPFLALTKSILYQQLAYKAGT 162 Query: 343 SIYTRFVSLCGGEAAVVPQTVLALTAPQLRQIGVSGRKASYLHDLASKYCNGILSDSSIV 522 SIYTRFV LCGGEA V+P+TVLALT QLRQIGVSGRKASYLHDLA KY NGILSD+ I+ Sbjct: 163 SIYTRFVGLCGGEAGVLPETVLALTPHQLRQIGVSGRKASYLHDLARKYQNGILSDTGII 222 Query: 523 DMDDKSLFTMLTMVKGIGPWSVHMFMIFSLHRPDVLPVGDLGVRKGVQLLHGLEELPRPS 702 MDDKSLFTMLTMV GIG WSVHMFMIFSLHRPDVLPV DLGVRKGVQLL+GLEELPRPS Sbjct: 223 TMDDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPVNDLGVRKGVQLLYGLEELPRPS 282 Query: 703 QMEQLCEKWRPYRSVGSWYMWRFVEVKGAPAPANTSSVAVGDGLQHQ------------- 843 QMEQLCEKWRPYRSV SWY+WRFVE KGAP+ A ++VA G LQ Q Sbjct: 283 QMEQLCEKWRPYRSVASWYIWRFVEGKGAPSSA--AAVAGGPSLQQQQQQQEQQQQHQQQ 340 Query: 844 --QPQLIDPMSGITGLGACAWGQ 906 Q Q +DP++GI LGACAWGQ Sbjct: 341 QHQQQFLDPINGILNLGACAWGQ 363 >ref|XP_007017553.1| DNA glycosylase superfamily protein [Theobroma cacao] gi|508722881|gb|EOY14778.1| DNA glycosylase superfamily protein [Theobroma cacao] Length = 397 Score = 395 bits (1016), Expect = e-107 Identities = 195/261 (74%), Positives = 222/261 (85%), Gaps = 13/261 (4%) Frame = +1 Query: 163 KTLSCEGEIDLAIRHLRSSDPLLARIIDVHLPPTFDSFHPPFLALTKSILYQQLAYKAGT 342 ++LSCEGE++ AIRHLR++DPLLA +ID+H PPTFD+FH PFLALT+SILYQQLA+KAGT Sbjct: 139 RSLSCEGEVETAIRHLRNADPLLASLIDIHPPPTFDTFHTPFLALTRSILYQQLAFKAGT 198 Query: 343 SIYTRFVSLCGGEAAVVPQTVLALTAPQLRQIGVSGRKASYLHDLASKYCNGILSDSSIV 522 SIY RF++LCGGE VVP+TVL+LTA QLRQIGVSGRKASYLHDLA KY GILSDS+IV Sbjct: 199 SIYNRFIALCGGENGVVPETVLSLTAQQLRQIGVSGRKASYLHDLARKYQTGILSDSAIV 258 Query: 523 DMDDKSLFTMLTMVKGIGPWSVHMFMIFSLHRPDVLPVGDLGVRKGVQLLHGLEELPRPS 702 +MDDKSLFTMLTMV GIG WSVHMFMIFSLHRPDVLP+ DLGVRKGVQLL+ LEELPRPS Sbjct: 259 NMDDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLGVRKGVQLLYNLEELPRPS 318 Query: 703 QMEQLCEKWRPYRSVGSWYMWRFVEVKGAPAPANTSSVAVGDGL---------QH----Q 843 QM+QLCEKWRPYRSV SWY+WRFVE KGAP+ A ++VA G L QH Q Sbjct: 319 QMDQLCEKWRPYRSVASWYLWRFVEAKGAPSSA--AAVAAGASLPPPQQEEQQQHQQHQQ 376 Query: 844 QPQLIDPMSGITGLGACAWGQ 906 QPQL+DP++ I LGACAWGQ Sbjct: 377 QPQLLDPINSILNLGACAWGQ 397 >ref|XP_002266618.1| PREDICTED: DNA-3-methyladenine glycosylase 1 [Vitis vinifera] gi|297735147|emb|CBI17509.3| unnamed protein product [Vitis vinifera] Length = 329 Score = 395 bits (1015), Expect = e-107 Identities = 200/258 (77%), Positives = 217/258 (84%), Gaps = 12/258 (4%) Frame = +1 Query: 169 LSCEGEIDLAIRHLRSSDPLLARIIDVHLPPTFDSFHPPFLALTKSILYQQLAYKAGTSI 348 LSCEGE+D+A+RHL SDPLLA +I+ H PPTFDS HPPFLAL KSILYQQLAYKA TSI Sbjct: 74 LSCEGELDVALRHLTKSDPLLAALINTHQPPTFDSCHPPFLALAKSILYQQLAYKAATSI 133 Query: 349 YTRFVSLCGGEAAVVPQTVLALTAPQLRQIGVSGRKASYLHDLASKYCNGILSDSSIVDM 528 YTRFV+LCGGEA VVP VLAL+ QLRQIGVSGRKA YLHDLASKY GILSDSSI+ M Sbjct: 134 YTRFVALCGGEAGVVPDAVLALSPSQLRQIGVSGRKAGYLHDLASKYKTGILSDSSIMGM 193 Query: 529 DDKSLFTMLTMVKGIGPWSVHMFMIFSLHRPDVLPVGDLGVRKGVQLLHGLEELPRPSQM 708 DDKSLFTMLTMVKGIG WSVHMFMIFSLHRPDVLPVGD+GVRKGVQ L+GLEELPRPSQM Sbjct: 194 DDKSLFTMLTMVKGIGSWSVHMFMIFSLHRPDVLPVGDVGVRKGVQFLYGLEELPRPSQM 253 Query: 709 EQLCEKWRPYRSVGSWYMWRFVEVKGAPAPANTSSVAVGDGL---------QHQQP---Q 852 EQLCEKW+PYRSVGSWYMWRFVE KGAP PA ++VA+ DG Q QQP Q Sbjct: 254 EQLCEKWKPYRSVGSWYMWRFVEAKGAP-PAR-AAVALVDGATSEQQQQQEQQQQPQQLQ 311 Query: 853 LIDPMSGITGLGACAWGQ 906 L+DP++GI LGAC WGQ Sbjct: 312 LVDPINGIVNLGACIWGQ 329 >ref|XP_006473513.1| PREDICTED: DNA-3-methyladenine glycosylase 1-like isoform X2 [Citrus sinensis] Length = 371 Score = 390 bits (1002), Expect = e-106 Identities = 191/254 (75%), Positives = 218/254 (85%), Gaps = 6/254 (2%) Frame = +1 Query: 163 KTLSCEGEIDLAIRHLRSSDPLLARIIDVHLPPTFDSFHPPFLALTKSILYQQLAYKAGT 342 + LS EGE++ AIRHLR++D LA +ID+H PPTFDSFH PFLALT+SILYQQLA+KAGT Sbjct: 120 RPLSSEGEVEAAIRHLRNADRQLASLIDIHPPPTFDSFHTPFLALTRSILYQQLAFKAGT 179 Query: 343 SIYTRFVSLCGGEAAVVPQTVLALTAPQLRQIGVSGRKASYLHDLASKYCNGILSDSSIV 522 SIYTRF++LCGGEA VVP+TVLALT QLRQIGVSGRKASYLHDLA KY NGILSDS+IV Sbjct: 180 SIYTRFIALCGGEAGVVPETVLALTPQQLRQIGVSGRKASYLHDLARKYQNGILSDSAIV 239 Query: 523 DMDDKSLFTMLTMVKGIGPWSVHMFMIFSLHRPDVLPVGDLGVRKGVQLLHGLEELPRPS 702 +MDDKSLFTMLTMV GIG WSVHMFMIFSLHRPDVLP+ DLGVRKGVQLL+ LEELPRPS Sbjct: 240 NMDDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLGVRKGVQLLYSLEELPRPS 299 Query: 703 QMEQLCEKWRPYRSVGSWYMWRFVEVKGAPAPANTSSVAVGDGL------QHQQPQLIDP 864 QM+QLCEKWRPYRSV SWY+WRFVE KGAP+ A ++VA G L + QQPQL+D Sbjct: 300 QMDQLCEKWRPYRSVASWYLWRFVEAKGAPSSA--AAVAAGAALPQPQQEEQQQPQLLDQ 357 Query: 865 MSGITGLGACAWGQ 906 ++ + +GACAWGQ Sbjct: 358 INSLINIGACAWGQ 371 >ref|XP_004501010.1| PREDICTED: DNA-3-methyladenine glycosylase 1-like isoform X1 [Cicer arietinum] Length = 384 Score = 388 bits (996), Expect = e-105 Identities = 190/264 (71%), Positives = 221/264 (83%), Gaps = 16/264 (6%) Frame = +1 Query: 163 KTLSCEGEIDLAIRHLRSSDPLLARIIDVHLPPTFDSFHPPFLALTKSILYQQLAYKAGT 342 ++LSCEGE+++A+R+LR++DPLL+ +ID+H PPTFD+FH PFLALT+SILYQQLA+KAGT Sbjct: 123 RSLSCEGEVEIALRYLRNADPLLSPLIDIHQPPTFDNFHTPFLALTRSILYQQLAFKAGT 182 Query: 343 SIYTRFVSLCGGEAAVVPQTVLALTAPQLRQIGVSGRKASYLHDLASKYCNGILSDSSIV 522 SIYTRF++LCGGEA VVP+TVLAL QLRQIGVSGRKASYLHDLA KY NGILSDS+IV Sbjct: 183 SIYTRFIALCGGEAGVVPETVLALNPQQLRQIGVSGRKASYLHDLARKYQNGILSDSAIV 242 Query: 523 DMDDKSLFTMLTMVKGIGPWSVHMFMIFSLHRPDVLPVGDLGVRKGVQLLHGLEELPRPS 702 +MDDKSLFTMLTMV GIG WSVHMFMIFSLHRPDVLP+ DLGVRKGVQ+L+ LE+LPRPS Sbjct: 243 NMDDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLGVRKGVQILYNLEDLPRPS 302 Query: 703 QMEQLCEKWRPYRSVGSWYMWRFVEVKGAPAPANTSSVAVGDGL---------------Q 837 QM+QLCEKWRPYRSV SWYMWRFVE KG P+ A +VA G GL Q Sbjct: 303 QMDQLCEKWRPYRSVASWYMWRFVEAKGTPSSA--VAVATGAGLQQHQLEQHQQQQQQQQ 360 Query: 838 HQQPQLIDPMSGITGLG-ACAWGQ 906 H Q QL+DPM+ + +G ACAWGQ Sbjct: 361 HSQQQLMDPMNSMFNIGAACAWGQ 384 >ref|XP_007136597.1| hypothetical protein PHAVU_009G058200g [Phaseolus vulgaris] gi|561009684|gb|ESW08591.1| hypothetical protein PHAVU_009G058200g [Phaseolus vulgaris] Length = 366 Score = 387 bits (995), Expect = e-105 Identities = 192/266 (72%), Positives = 220/266 (82%), Gaps = 18/266 (6%) Frame = +1 Query: 163 KTLSCEGEIDLAIRHLRSSDPLLARIIDVHLPPTFDSFHPPFLALTKSILYQQLAYKAGT 342 ++LSCEGE+++A+R LR++DPLL+ +ID+H PPTFD+FH PFLALT+SILYQQLAYKAGT Sbjct: 103 RSLSCEGEVEIALRFLRNADPLLSPLIDIHQPPTFDNFHTPFLALTRSILYQQLAYKAGT 162 Query: 343 SIYTRFVSLCGGEAAVVPQTVLALTAPQLRQIGVSGRKASYLHDLASKYCNGILSDSSIV 522 SIYTRF++LCGGE VVP+TVLALT QLRQIGVSGRKASYLHDLA KY NGILSDS+IV Sbjct: 163 SIYTRFIALCGGENGVVPETVLALTPQQLRQIGVSGRKASYLHDLARKYQNGILSDSAIV 222 Query: 523 DMDDKSLFTMLTMVKGIGPWSVHMFMIFSLHRPDVLPVGDLGVRKGVQLLHGLEELPRPS 702 +MDDKSLFTMLTMV GIG WSVHMFMIFSLHRPDVLP+ DLGVRKGVQLL+ LE+LPRPS Sbjct: 223 NMDDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLGVRKGVQLLYNLEDLPRPS 282 Query: 703 QMEQLCEKWRPYRSVGSWYMWRFVEVKGAPAPANTSSVAVGDGLQHQ------------- 843 QM+ LCEKWRPYRSV SWYMWRFVE KG P+ A +VA G GLQ Q Sbjct: 283 QMDHLCEKWRPYRSVASWYMWRFVEAKGTPSSA--VAVATGAGLQQQHHHQHQQHEQQQQ 340 Query: 844 ----QPQLIDPMSGITGLG-ACAWGQ 906 QPQL+DP++ + LG ACAWGQ Sbjct: 341 QHPPQPQLLDPINSMFNLGAACAWGQ 366 >ref|XP_003528090.1| PREDICTED: DNA-3-methyladenine glycosylase 1-like [Glycine max] Length = 351 Score = 386 bits (992), Expect = e-105 Identities = 190/266 (71%), Positives = 221/266 (83%), Gaps = 18/266 (6%) Frame = +1 Query: 163 KTLSCEGEIDLAIRHLRSSDPLLARIIDVHLPPTFDSFHPPFLALTKSILYQQLAYKAGT 342 ++LSC+GE+++++R+LR++DPLL+ +ID+H PPTFD+FH PFLALT+SILYQQLA+KAGT Sbjct: 88 RSLSCDGEVEISLRYLRNADPLLSPLIDIHQPPTFDNFHTPFLALTRSILYQQLAFKAGT 147 Query: 343 SIYTRFVSLCGGEAAVVPQTVLALTAPQLRQIGVSGRKASYLHDLASKYCNGILSDSSIV 522 SIYTRF+ LCGGE VVP+TVLALT QLRQIGVSGRKASYLHDLA KY NGILSDS+IV Sbjct: 148 SIYTRFIGLCGGENGVVPETVLALTPQQLRQIGVSGRKASYLHDLARKYQNGILSDSAIV 207 Query: 523 DMDDKSLFTMLTMVKGIGPWSVHMFMIFSLHRPDVLPVGDLGVRKGVQLLHGLEELPRPS 702 +MDDKSLFTMLTMV GIG WSVHMFMIFSLHRPDVLP+ DLGVRKGVQLL+ LE+LPRPS Sbjct: 208 NMDDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLGVRKGVQLLYNLEDLPRPS 267 Query: 703 QMEQLCEKWRPYRSVGSWYMWRFVEVKGAPAPANTSSVAVGDGLQHQ------------- 843 QM+QLC+KWRPYRSV SWYMWRFVE KG P+ A T VA G GLQ Q Sbjct: 268 QMDQLCDKWRPYRSVASWYMWRFVEAKGTPSSAVT--VATGAGLQQQRHHQHQQQEQQQQ 325 Query: 844 ----QPQLIDPMSGITGLG-ACAWGQ 906 QPQL+DP++ + LG ACAWGQ Sbjct: 326 QHAPQPQLLDPINSMFNLGAACAWGQ 351 >gb|ACU22727.1| unknown [Glycine max] Length = 351 Score = 386 bits (992), Expect = e-105 Identities = 190/266 (71%), Positives = 221/266 (83%), Gaps = 18/266 (6%) Frame = +1 Query: 163 KTLSCEGEIDLAIRHLRSSDPLLARIIDVHLPPTFDSFHPPFLALTKSILYQQLAYKAGT 342 ++LSC+GE+++++R+LR++DPLL+ +ID+H PPTFD+FH PFLALT+SILYQQLA+KAGT Sbjct: 88 RSLSCDGEVEISLRYLRNADPLLSPLIDIHQPPTFDNFHTPFLALTRSILYQQLAFKAGT 147 Query: 343 SIYTRFVSLCGGEAAVVPQTVLALTAPQLRQIGVSGRKASYLHDLASKYCNGILSDSSIV 522 SIYTRF+ LCGGE VVP+TVLALT QLRQIGVSGRKASYLHDLA KY NGILSDS+IV Sbjct: 148 SIYTRFIGLCGGENGVVPETVLALTPQQLRQIGVSGRKASYLHDLARKYQNGILSDSAIV 207 Query: 523 DMDDKSLFTMLTMVKGIGPWSVHMFMIFSLHRPDVLPVGDLGVRKGVQLLHGLEELPRPS 702 +MDDKSLFTMLTMV GIG WSVHMFMIFSLHRPDVLP+ DLGVRKGVQLL+ LE+LPRPS Sbjct: 208 NMDDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLGVRKGVQLLYNLEDLPRPS 267 Query: 703 QMEQLCEKWRPYRSVGSWYMWRFVEVKGAPAPANTSSVAVGDGLQHQ------------- 843 QM+QLC+KWRPYRSV SWYMWRFVE KG P+ A T VA G GLQ Q Sbjct: 268 QMDQLCDKWRPYRSVASWYMWRFVEAKGTPSSAVT--VATGAGLQQQRHHQHQQQEQQQQ 325 Query: 844 ----QPQLIDPMSGITGLGA-CAWGQ 906 QPQL+DP++ + LGA CAWGQ Sbjct: 326 QHAPQPQLLDPINSMFNLGAVCAWGQ 351 >ref|XP_004147864.1| PREDICTED: uncharacterized protein LOC101202943 [Cucumis sativus] gi|449476816|ref|XP_004154842.1| PREDICTED: uncharacterized LOC101202943 [Cucumis sativus] Length = 382 Score = 385 bits (990), Expect = e-104 Identities = 187/264 (70%), Positives = 215/264 (81%), Gaps = 16/264 (6%) Frame = +1 Query: 163 KTLSCEGEIDLAIRHLRSSDPLLARIIDVHLPPTFDSFHPPFLALTKSILYQQLAYKAGT 342 ++LSCEGE+++A+RHLR++DPLLA++ID+H PTFDSF PFLALT+SILYQQLAYKAGT Sbjct: 119 RSLSCEGEVEIALRHLRNADPLLAQLIDLHQRPTFDSFQTPFLALTRSILYQQLAYKAGT 178 Query: 343 SIYTRFVSLCGGEAAVVPQTVLALTAPQLRQIGVSGRKASYLHDLASKYCNGILSDSSIV 522 SIYTRF++LCGGEA V+P+TVLAL QLRQIG+SGRK+SYLHDLA KY NGILSD +IV Sbjct: 179 SIYTRFIALCGGEAGVLPETVLALNPQQLRQIGISGRKSSYLHDLARKYQNGILSDPAIV 238 Query: 523 DMDDKSLFTMLTMVKGIGPWSVHMFMIFSLHRPDVLPVGDLGVRKGVQLLHGLEELPRPS 702 +MDDKSLFTMLTMV GIG WSVHMFMIFSLHRPDVLP+ DL VRKGVQLL+ LEELPRPS Sbjct: 239 NMDDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLNVRKGVQLLYNLEELPRPS 298 Query: 703 QMEQLCEKWRPYRSVGSWYMWRFVEVKGAPAPANTSSVAVGDGLQH-------------- 840 QM+QLCEKWRPYRSVGSWYMWR E KGA + A + LQH Sbjct: 299 QMDQLCEKWRPYRSVGSWYMWRLAEAKGASSSAAAVAAGASLQLQHQDHHQEHQHPQHPQ 358 Query: 841 --QQPQLIDPMSGITGLGACAWGQ 906 QQPQL+DP++ I LGACAWGQ Sbjct: 359 HPQQPQLLDPLNSILNLGACAWGQ 382 >ref|XP_003523827.1| PREDICTED: probable DNA-3-methyladenine glycosylase 2-like [Glycine max] Length = 374 Score = 384 bits (985), Expect = e-104 Identities = 190/275 (69%), Positives = 222/275 (80%), Gaps = 27/275 (9%) Frame = +1 Query: 163 KTLSCEGEIDLAIRHLRSSDPLLARIIDVHLPPTFDSFHPPFLALTKSILYQQLAYKAGT 342 ++LSC+GE+++A+R+LR++DP+L+ +ID+H PPTFD+FH PFLALT+SILYQQLAYKAGT Sbjct: 102 RSLSCDGEVEIALRYLRNADPVLSPLIDIHQPPTFDNFHTPFLALTRSILYQQLAYKAGT 161 Query: 343 SIYTRFVSLCGGEAAVVPQTVLALTAPQLRQIGVSGRKASYLHDLASKYCNGILSDSSIV 522 SIYTRF++LCGGE VVP+TVLALT QLRQIGVSGRKASYLHDLA KY NGILSDS+IV Sbjct: 162 SIYTRFIALCGGENGVVPETVLALTPQQLRQIGVSGRKASYLHDLARKYQNGILSDSAIV 221 Query: 523 DMDDKSLFTMLTMVKGIGPWSVHMFMIFSLHRPDVLPVGDLGVRKGVQLLHGLEELPRPS 702 +MDDKSLFTMLTMV GIG WSVHMFMIFSLHRPDVLP+ DLGVRKGVQLL+ LE+LPRPS Sbjct: 222 NMDDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLGVRKGVQLLYNLEDLPRPS 281 Query: 703 QMEQLCEKWRPYRSVGSWYMWRFVEVKGAPAPANTSSVAVGDGLQHQ------------- 843 QM+QLC+KWRPYRSV SWYMWRFVE KG P+ A +VA G GLQ Q Sbjct: 282 QMDQLCDKWRPYRSVASWYMWRFVEAKGTPSSA--VAVATGAGLQQQQHHQHHHQHQQQE 339 Query: 844 -------------QPQLIDPMSGITGLG-ACAWGQ 906 QPQL+DP++ + LG ACAWGQ Sbjct: 340 QQQQQQQQQQHPPQPQLLDPINSMFNLGAACAWGQ 374 >emb|CAN72984.1| hypothetical protein VITISV_009035 [Vitis vinifera] Length = 353 Score = 384 bits (985), Expect = e-104 Identities = 196/254 (77%), Positives = 213/254 (83%), Gaps = 12/254 (4%) Frame = +1 Query: 163 KTLSCEGEIDLAIRHLRSSDPLLARIIDVHLPPTFDSFHPPFLALTKSILYQQLAYKAGT 342 K LSCEGE+D+A+RHL SDPLLA +I+ H PPTFDS HPPFLAL KSILYQQLAYKA T Sbjct: 102 KPLSCEGELDVALRHLTKSDPLLAALINTHQPPTFDSCHPPFLALAKSILYQQLAYKAAT 161 Query: 343 SIYTRFVSLCGGEAAVVPQTVLALTAPQLRQIGVSGRKASYLHDLASKYCNGILSDSSIV 522 SIYTRFV+LCGGEA VVP VLAL+ QLRQIGVSGRKA YLHDLASKY GILSDSSI+ Sbjct: 162 SIYTRFVALCGGEAGVVPDAVLALSPSQLRQIGVSGRKAGYLHDLASKYKTGILSDSSIM 221 Query: 523 DMDDKSLFTMLTMVKGIGPWSVHMFMIFSLHRPDVLPVGDLGVRKGVQLLHGLEELPRPS 702 MDDKSLFTMLTMVKGIG WSVHMFMIFSLHRPDVLPVGD+GVRKGVQ L+GLEELPRPS Sbjct: 222 GMDDKSLFTMLTMVKGIGSWSVHMFMIFSLHRPDVLPVGDVGVRKGVQFLYGLEELPRPS 281 Query: 703 QMEQLCEKWRPYRSVGSWYMWRFVEVKGAPAPANTSSVAVGDGL---------QHQQP-- 849 QMEQLCEKW+PYRSVGSWYMWRFVE KGAP PA ++VA+ DG Q QQP Sbjct: 282 QMEQLCEKWKPYRSVGSWYMWRFVEAKGAP-PAR-AAVALVDGATSEQQQQQEQQQQPQQ 339 Query: 850 -QLIDPMSGITGLG 888 QL+DP++GI LG Sbjct: 340 LQLVDPINGIVNLG 353 >gb|EXB91937.1| DNA-3-methyladenine glycosylase 1 [Morus notabilis] Length = 451 Score = 382 bits (982), Expect = e-103 Identities = 188/250 (75%), Positives = 213/250 (85%), Gaps = 14/250 (5%) Frame = +1 Query: 163 KTLSCEGEIDLAIRHLRSSDPLLARIIDVHLPPTFDSFHPPFLALTKSILYQQLAYKAGT 342 ++LSCEGE+++A+RHLR +DPLLA +ID+H PPTFD+FH PFLALT+SILYQQLAYKAGT Sbjct: 123 RSLSCEGEVEVALRHLRRADPLLAPLIDIHQPPTFDNFHTPFLALTRSILYQQLAYKAGT 182 Query: 343 SIYTRFVSLCGGEAAVVPQTVLALTAPQLRQIGVSGRKASYLHDLASKYCNGILSDSSIV 522 SIYTRF++LCGGE VVP+TVLALT QLRQIGVSGRKASYLHDLA KY NGILSDS+IV Sbjct: 183 SIYTRFIALCGGETGVVPETVLALTPQQLRQIGVSGRKASYLHDLARKYQNGILSDSAIV 242 Query: 523 DMDDKSLFTMLTMVKGIGPWSVHMFMIFSLHRPDVLPVGDLGVRKGVQLLHGLEELPRPS 702 +MDDKSLFTMLTMV GIG WSVHMFMIFSLHRPDVLP+ DLGVRKGVQLL+ LEELPRPS Sbjct: 243 NMDDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLGVRKGVQLLYNLEELPRPS 302 Query: 703 QMEQLCEKWRPYRSVGSWYMWRFVEVKGAPAPANTSSVAVGDGLQH-------------- 840 QM+QLCEKWRPYRSV +WYMWRFVE KG AP N ++VAVG LQ Sbjct: 303 QMDQLCEKWRPYRSVAAWYMWRFVEQKG--APPNAATVAVGANLQQQQQQQQQQGEPHQP 360 Query: 841 QQPQLIDPMS 870 QQPQL+DP++ Sbjct: 361 QQPQLMDPLN 370 >ref|XP_006473512.1| PREDICTED: DNA-3-methyladenine glycosylase 1-like isoform X1 [Citrus sinensis] Length = 373 Score = 375 bits (963), Expect = e-101 Identities = 185/248 (74%), Positives = 212/248 (85%), Gaps = 6/248 (2%) Frame = +1 Query: 163 KTLSCEGEIDLAIRHLRSSDPLLARIIDVHLPPTFDSFHPPFLALTKSILYQQLAYKAGT 342 + LS EGE++ AIRHLR++D LA +ID+H PPTFDSFH PFLALT+SILYQQLA+KAGT Sbjct: 120 RPLSSEGEVEAAIRHLRNADRQLASLIDIHPPPTFDSFHTPFLALTRSILYQQLAFKAGT 179 Query: 343 SIYTRFVSLCGGEAAVVPQTVLALTAPQLRQIGVSGRKASYLHDLASKYCNGILSDSSIV 522 SIYTRF++LCGGEA VVP+TVLALT QLRQIGVSGRKASYLHDLA KY NGILSDS+IV Sbjct: 180 SIYTRFIALCGGEAGVVPETVLALTPQQLRQIGVSGRKASYLHDLARKYQNGILSDSAIV 239 Query: 523 DMDDKSLFTMLTMVKGIGPWSVHMFMIFSLHRPDVLPVGDLGVRKGVQLLHGLEELPRPS 702 +MDDKSLFTMLTMV GIG WSVHMFMIFSLHRPDVLP+ DLGVRKGVQLL+ LEELPRPS Sbjct: 240 NMDDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLGVRKGVQLLYSLEELPRPS 299 Query: 703 QMEQLCEKWRPYRSVGSWYMWRFVEVKGAPAPANTSSVAVGDGL------QHQQPQLIDP 864 QM+QLCEKWRPYRSV SWY+WRFVE KGAP+ A ++VA G L + QQPQL+D Sbjct: 300 QMDQLCEKWRPYRSVASWYLWRFVEAKGAPSSA--AAVAAGAALPQPQQEEQQQPQLLDQ 357 Query: 865 MSGITGLG 888 ++ + +G Sbjct: 358 INSLINIG 365 >ref|XP_006435004.1| hypothetical protein CICLE_v10001539mg [Citrus clementina] gi|557537126|gb|ESR48244.1| hypothetical protein CICLE_v10001539mg [Citrus clementina] Length = 373 Score = 375 bits (963), Expect = e-101 Identities = 185/248 (74%), Positives = 212/248 (85%), Gaps = 6/248 (2%) Frame = +1 Query: 163 KTLSCEGEIDLAIRHLRSSDPLLARIIDVHLPPTFDSFHPPFLALTKSILYQQLAYKAGT 342 + LS EGE++ AIRHLR++D LA +ID+H PPTFDSFH PFLALT+SILYQQLA+KAGT Sbjct: 120 RPLSSEGEVEAAIRHLRNADRQLASLIDIHPPPTFDSFHTPFLALTRSILYQQLAFKAGT 179 Query: 343 SIYTRFVSLCGGEAAVVPQTVLALTAPQLRQIGVSGRKASYLHDLASKYCNGILSDSSIV 522 SIYTRF++LCGGEA VVP+TVLALT QLRQIGVSGRKASYLHDLA KY NGILSDS+IV Sbjct: 180 SIYTRFIALCGGEAGVVPETVLALTPQQLRQIGVSGRKASYLHDLARKYQNGILSDSAIV 239 Query: 523 DMDDKSLFTMLTMVKGIGPWSVHMFMIFSLHRPDVLPVGDLGVRKGVQLLHGLEELPRPS 702 +MDDKSLFTMLTMV GIG WSVHMFMIFSLHRPDVLP+ DLGVRKGVQLL+ LEELPRPS Sbjct: 240 NMDDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLGVRKGVQLLYSLEELPRPS 299 Query: 703 QMEQLCEKWRPYRSVGSWYMWRFVEVKGAPAPANTSSVAVGDGL------QHQQPQLIDP 864 QM+QLCEKWRPYRSV SWY+WRFVE KGAP+ A ++VA G L + QQPQL+D Sbjct: 300 QMDQLCEKWRPYRSVASWYLWRFVEAKGAPSSA--AAVAAGAALPQPQQEEQQQPQLLDQ 357 Query: 865 MSGITGLG 888 ++ + +G Sbjct: 358 INSLINIG 365 >ref|XP_004291872.1| PREDICTED: putative DNA-3-methyladenine glycosylase YfjP-like isoform 1 [Fragaria vesca subsp. vesca] Length = 385 Score = 368 bits (945), Expect = 2e-99 Identities = 181/261 (69%), Positives = 213/261 (81%), Gaps = 13/261 (4%) Frame = +1 Query: 163 KTLSCEGEIDLAIRHLRSSDPLLARIIDVHLPPTFDSFHPPFLALTKSILYQQLAYKAGT 342 + L C+GE++ AIRHLR++DPLL +I+ H PP FD+FH PFLALT+SILYQQLAYKAGT Sbjct: 127 RPLRCDGEVESAIRHLRNADPLLIPLIEAHQPPQFDNFHTPFLALTRSILYQQLAYKAGT 186 Query: 343 SIYTRFVSLCGGEAAVVPQTVLALTAPQLRQIGVSGRKASYLHDLASKYCNGILSDSSIV 522 SIYTRF+ LCGGE+AV P+TVLA +A QLRQIG+SGRKASYLHDLA KY NGILSD++IV Sbjct: 187 SIYTRFIQLCGGESAVNPETVLAQSATQLRQIGISGRKASYLHDLARKYQNGILSDTAIV 246 Query: 523 DMDDKSLFTMLTMVKGIGPWSVHMFMIFSLHRPDVLPVGDLGVRKGVQLLHGLEELPRPS 702 +MDDKSLFTMLTMV GIG WSVHMFMIFSLHRPDVLPV DLG+RKGVQLL+GLEELPRPS Sbjct: 247 NMDDKSLFTMLTMVSGIGSWSVHMFMIFSLHRPDVLPVNDLGIRKGVQLLYGLEELPRPS 306 Query: 703 QMEQLCEKWRPYRSVGSWYMWRFVEVKGAPAPANTSSVAVG------------DGLQH-Q 843 M+QLC+KWRPYRSV +WY+WR+VE KGA + A ++VA G QH Q Sbjct: 307 HMDQLCDKWRPYRSVAAWYLWRYVESKGASSTA--AAVAAGAIAPMQQQQEDQQPQQHPQ 364 Query: 844 QPQLIDPMSGITGLGACAWGQ 906 Q QL+D +S + +GAC WGQ Sbjct: 365 QQQLMDSLSNLINIGACTWGQ 385 >ref|XP_007223275.1| hypothetical protein PRUPE_ppa007252mg [Prunus persica] gi|462420211|gb|EMJ24474.1| hypothetical protein PRUPE_ppa007252mg [Prunus persica] Length = 376 Score = 366 bits (939), Expect = 1e-98 Identities = 186/262 (70%), Positives = 211/262 (80%), Gaps = 15/262 (5%) Frame = +1 Query: 163 KTLSCEGEIDLAIRHLRSSDPLLARIIDVHLPPTFDSFHPPFLALTKSILYQQLAYKAGT 342 + LSCEGE++ AIRHLR++DPLLA +ID+H PTFD+F PFLALT+SILYQQLAYKAG Sbjct: 116 RPLSCEGEVEAAIRHLRNADPLLAPLIDLHQRPTFDTFQTPFLALTRSILYQQLAYKAGN 175 Query: 343 SIYTRFVSLCGGEAAVVPQTVLALTAPQLRQIGVSGRKASYLHDLASKYCNGILSDSSIV 522 SIYTRFVSLCGGEA VVP+TVLA T QLRQIGVSGRKASYLHDLA KY NGILSD++IV Sbjct: 176 SIYTRFVSLCGGEACVVPETVLAQTPQQLRQIGVSGRKASYLHDLARKYQNGILSDAAIV 235 Query: 523 DMDDKSLFTMLTMVKGIGPWSVHMFMIFSLHRPDVLPVGDLGVRKGVQLLHGLEELPRPS 702 +MDDKSLFTMLTMV GIG WSVHMFMIFSLHRPDVLP+ DL +RKGVQLL+ L+ELPRPS Sbjct: 236 NMDDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLSMRKGVQLLYNLDELPRPS 295 Query: 703 QMEQLCEKWRPYRSVGSWYMWRFVEVKGAPAPANTSSVAVGDGL-----------QH--- 840 QME LCEKWRPYRSV + YMWRF E KGAP+ A ++VA G L QH Sbjct: 296 QMEHLCEKWRPYRSVAACYMWRFSESKGAPSSA--AAVAAGATLPPQQQQEEQQQQHPQH 353 Query: 841 -QQPQLIDPMSGITGLGACAWG 903 QQ QL+D +S + +GAC WG Sbjct: 354 PQQQQLMDSLSSLINIGACTWG 375 >emb|CBI19705.3| unnamed protein product [Vitis vinifera] Length = 351 Score = 361 bits (926), Expect(2) = 2e-97 Identities = 175/208 (84%), Positives = 190/208 (91%) Frame = +1 Query: 178 EGEIDLAIRHLRSSDPLLARIIDVHLPPTFDSFHPPFLALTKSILYQQLAYKAGTSIYTR 357 +GEI++A+RHLR++DP LA +ID+H PPTFDSFH PFLALTKSILYQQLAYKAGTSIYTR Sbjct: 115 KGEIEIALRHLRNADPHLAPLIDLHPPPTFDSFHTPFLALTKSILYQQLAYKAGTSIYTR 174 Query: 358 FVSLCGGEAAVVPQTVLALTAPQLRQIGVSGRKASYLHDLASKYCNGILSDSSIVDMDDK 537 FV LCGGEA V+P+TVLALT QLRQIGVSGRKASYLHDLA KY NGILSD+ I+ MDDK Sbjct: 175 FVGLCGGEAGVLPETVLALTPHQLRQIGVSGRKASYLHDLARKYQNGILSDTGIITMDDK 234 Query: 538 SLFTMLTMVKGIGPWSVHMFMIFSLHRPDVLPVGDLGVRKGVQLLHGLEELPRPSQMEQL 717 SLFTMLTMV GIG WSVHMFMIFSLHRPDVLPV DLGVRKGVQLL+GLEELPRPSQMEQL Sbjct: 235 SLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPVNDLGVRKGVQLLYGLEELPRPSQMEQL 294 Query: 718 CEKWRPYRSVGSWYMWRFVEVKGAPAPA 801 CEKWRPYRSV SWY+WRFVE KGAP+ A Sbjct: 295 CEKWRPYRSVASWYIWRFVEGKGAPSSA 322 Score = 23.5 bits (49), Expect(2) = 2e-97 Identities = 10/24 (41%), Positives = 14/24 (58%) Frame = +3 Query: 834 ATSAATAYRSNEWNHWTRGLRVGT 905 A + RSN+W+ RGL +GT Sbjct: 324 AVAGGPISRSNQWHSKPRGLCLGT 347 >ref|XP_004238277.1| PREDICTED: DNA-3-methyladenine glycosylase 1-like isoform 1 [Solanum lycopersicum] Length = 332 Score = 358 bits (920), Expect = 2e-96 Identities = 176/256 (68%), Positives = 209/256 (81%), Gaps = 10/256 (3%) Frame = +1 Query: 163 KTLSCEGEIDLAIRHLRSSDPLLARIIDVHLPPTFDSFHPPFLALTKSILYQQLAYKAGT 342 ++LS EGE++ AI +L+SSDPLL+ +I+ + PPT + F PPFLALTKSIL+QQLAYKAG+ Sbjct: 74 RSLSYEGELESAINYLKSSDPLLSPLIETYPPPTLELFQPPFLALTKSILFQQLAYKAGS 133 Query: 343 SIYTRFVSLCGGEAAVVPQTVLALTAPQLRQIGVSGRKASYLHDLASKYCNGILSDSSIV 522 SIYTRF+SLCGGE+ VVP VL LT QLRQIGVS RKASYLHDLA KY NGILSD SIV Sbjct: 134 SIYTRFISLCGGESNVVPDMVLGLTPQQLRQIGVSARKASYLHDLARKYQNGILSDKSIV 193 Query: 523 DMDDKSLFTMLTMVKGIGPWSVHMFMIFSLHRPDVLPVGDLGVRKGVQLLHGLEELPRPS 702 DMDDKSLFTMLTMV GIG WSVHMFMIFSLHRPDVLP+ DLG+RKGV++L+GLE+LPRPS Sbjct: 194 DMDDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPIHDLGIRKGVRMLYGLEDLPRPS 253 Query: 703 QMEQLCEKWRPYRSVGSWYMWRFVEVKGAPAPAN---TSSVAVGDGL-------QHQQPQ 852 QM+QLCEKW+PYRSV SWY+WRFVE KGA + N S+V++ + Q Q Q Sbjct: 254 QMDQLCEKWKPYRSVASWYIWRFVEAKGANSKGNVVGNSNVSLQQQILSMQQQQQQQHQQ 313 Query: 853 LIDPMSGITGLGACAW 900 +DP++GI +GACAW Sbjct: 314 FLDPINGILNVGACAW 329 >ref|XP_006390331.1| hypothetical protein EUTSA_v10018654mg [Eutrema salsugineum] gi|557086765|gb|ESQ27617.1| hypothetical protein EUTSA_v10018654mg [Eutrema salsugineum] Length = 403 Score = 358 bits (918), Expect = 3e-96 Identities = 182/266 (68%), Positives = 207/266 (77%), Gaps = 18/266 (6%) Frame = +1 Query: 163 KTLSCEGEIDLAIRHLRSSDPLLARIIDVHLPPTFDSFHPPFLALTKSILYQQLAYKAGT 342 ++L+CEGE++ AI HLRS DPLL +ID+H PPT++SFH PFLAL +SILYQQLA KAG Sbjct: 139 RSLTCEGELEAAICHLRSVDPLLGSLIDIHPPPTYESFHSPFLALIRSILYQQLAAKAGN 198 Query: 343 SIYTRFVSLCGGEAAVVPQTVLALTAPQLRQIGVSGRKASYLHDLASKYCNGILSDSSIV 522 SIYTRFV+LCGGE AVVP+TVL LT QLRQIGVSGRKASYL+DLA KY NGILSDS IV Sbjct: 199 SIYTRFVALCGGENAVVPETVLPLTPQQLRQIGVSGRKASYLNDLARKYQNGILSDSGIV 258 Query: 523 DMDDKSLFTMLTMVKGIGPWSVHMFMIFSLHRPDVLPVGDLGVRKGVQLLHGLEELPRPS 702 +MD+KSLFTMLTMV GIG WSVHMFMI SLHRPDVLPV DLGVRKGVQ+L+ L ELPRPS Sbjct: 259 NMDEKSLFTMLTMVNGIGSWSVHMFMINSLHRPDVLPVNDLGVRKGVQMLYNLPELPRPS 318 Query: 703 QMEQLCEKWRPYRSVGSWYMWRFVEVKGAPAPANTSSVAVGDGL---------------- 834 QMEQLCEKWRPYRSVGSWYMWR +E K P+N +SV G L Sbjct: 319 QMEQLCEKWRPYRSVGSWYMWRLIEAKS--TPSNAASVTAGAALSFPQLEDIQQQQQQQQ 376 Query: 835 -QHQQPQLIDPMSGITGLG-ACAWGQ 906 Q QQ QL+DP++ + +G AWGQ Sbjct: 377 HQQQQSQLLDPLNSVFSIGYTQAWGQ 402