BLASTX nr result
ID: Akebia27_contig00005300
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Akebia27_contig00005300 (1279 letters) Database: ./nr 38,876,450 sequences; 13,856,398,315 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_002510396.1| DNA-3-methyladenine glycosylase, putative [R... 400 e-109 ref|XP_002282344.1| PREDICTED: DNA-3-methyladenine glycosylase 1... 397 e-108 ref|XP_007017553.1| DNA glycosylase superfamily protein [Theobro... 395 e-107 ref|XP_002266618.1| PREDICTED: DNA-3-methyladenine glycosylase 1... 394 e-107 ref|XP_006473513.1| PREDICTED: DNA-3-methyladenine glycosylase 1... 390 e-106 ref|XP_004501010.1| PREDICTED: DNA-3-methyladenine glycosylase 1... 387 e-105 ref|XP_007136597.1| hypothetical protein PHAVU_009G058200g [Phas... 387 e-105 ref|XP_003528090.1| PREDICTED: DNA-3-methyladenine glycosylase 1... 386 e-104 gb|ACU22727.1| unknown [Glycine max] 386 e-104 ref|XP_004147864.1| PREDICTED: uncharacterized protein LOC101202... 385 e-104 ref|XP_003523827.1| PREDICTED: probable DNA-3-methyladenine glyc... 383 e-104 gb|EXB91937.1| DNA-3-methyladenine glycosylase 1 [Morus notabilis] 382 e-103 emb|CAN72984.1| hypothetical protein VITISV_009035 [Vitis vinifera] 382 e-103 ref|XP_006473512.1| PREDICTED: DNA-3-methyladenine glycosylase 1... 375 e-101 ref|XP_006435004.1| hypothetical protein CICLE_v10001539mg [Citr... 375 e-101 ref|XP_004291872.1| PREDICTED: putative DNA-3-methyladenine glyc... 368 3e-99 ref|XP_007223275.1| hypothetical protein PRUPE_ppa007252mg [Prun... 365 2e-98 emb|CBI19705.3| unnamed protein product [Vitis vinifera] 361 2e-97 ref|XP_006390331.1| hypothetical protein EUTSA_v10018654mg [Eutr... 358 4e-96 ref|XP_004238277.1| PREDICTED: DNA-3-methyladenine glycosylase 1... 358 4e-96 >ref|XP_002510396.1| DNA-3-methyladenine glycosylase, putative [Ricinus communis] gi|223551097|gb|EEF52583.1| DNA-3-methyladenine glycosylase, putative [Ricinus communis] Length = 369 Score = 400 bits (1027), Expect = e-109 Identities = 195/256 (76%), Positives = 220/256 (85%), Gaps = 8/256 (3%) Frame = -1 Query: 763 KTLSCEGEIDLAIRHLRSSDPLLARIIDVHLPPTFDSFHPPFLALTKSILYQQLAYKAGT 584 ++LSCEGE++ AIRHLR +DPLL+ +ID+H PPTFD+FH PFLALT+SILYQQLA+KAGT Sbjct: 118 RSLSCEGEVENAIRHLREADPLLSSLIDLHPPPTFDTFHTPFLALTRSILYQQLAFKAGT 177 Query: 583 SIYTRFVSLCGGEAAVVPQTVLALTAPQLRQIGVSGRKASYLHDLASKYCNGILSDSSIV 404 SIYTRF+SLCGGEA VVP TVLALT QLRQIGVSGRKASYLHDLA KY NGILSDS+IV Sbjct: 178 SIYTRFISLCGGEAGVVPDTVLALTPQQLRQIGVSGRKASYLHDLARKYHNGILSDSAIV 237 Query: 403 DMDDKSLFTMLTMVKGIGPWSVHMFMIFSLHRPDVLPVGDLGVRKGVQLLHGLEELPRPS 224 +MDDKSLFTMLTMV GIG WSVHMFMIFSLHRPDVLP+ DLGVRKGVQLL+ LE+LPRPS Sbjct: 238 NMDDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLGVRKGVQLLYNLEDLPRPS 297 Query: 223 QMEQLCEKWRPYRSVGSWYMWRFVEVKGAPAIANTSSVAVGDGL--------QHQQPQLI 68 QM+QLCEKWRPYRSV SWY+WRFVE KG+P +S+VAV G HQQPQL+ Sbjct: 298 QMDQLCEKWRPYRSVASWYLWRFVEAKGSP----SSAVAVATGAALTQQHQEDHQQPQLL 353 Query: 67 DPMSGITGLGACAWGQ 20 DP++ I LGACAWGQ Sbjct: 354 DPINSILNLGACAWGQ 369 >ref|XP_002282344.1| PREDICTED: DNA-3-methyladenine glycosylase 1-like [Vitis vinifera] Length = 363 Score = 397 bits (1019), Expect = e-108 Identities = 199/263 (75%), Positives = 219/263 (83%), Gaps = 15/263 (5%) Frame = -1 Query: 763 KTLSCEGEIDLAIRHLRSSDPLLARIIDVHLPPTFDSFHPPFLALTKSILYQQLAYKAGT 584 + LSCEGEI++A+RHLR++DP LA +ID+H PPTFDSFH PFLALTKSILYQQLAYKAGT Sbjct: 103 RALSCEGEIEIALRHLRNADPHLAPLIDLHPPPTFDSFHTPFLALTKSILYQQLAYKAGT 162 Query: 583 SIYTRFVSLCGGEAAVVPQTVLALTAPQLRQIGVSGRKASYLHDLASKYCNGILSDSSIV 404 SIYTRFV LCGGEA V+P+TVLALT QLRQIGVSGRKASYLHDLA KY NGILSD+ I+ Sbjct: 163 SIYTRFVGLCGGEAGVLPETVLALTPHQLRQIGVSGRKASYLHDLARKYQNGILSDTGII 222 Query: 403 DMDDKSLFTMLTMVKGIGPWSVHMFMIFSLHRPDVLPVGDLGVRKGVQLLHGLEELPRPS 224 MDDKSLFTMLTMV GIG WSVHMFMIFSLHRPDVLPV DLGVRKGVQLL+GLEELPRPS Sbjct: 223 TMDDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPVNDLGVRKGVQLLYGLEELPRPS 282 Query: 223 QMEQLCEKWRPYRSVGSWYMWRFVEVKGAPAIANTSSVAVGDGLQHQ------------- 83 QMEQLCEKWRPYRSV SWY+WRFVE KGAP+ A ++VA G LQ Q Sbjct: 283 QMEQLCEKWRPYRSVASWYIWRFVEGKGAPSSA--AAVAGGPSLQQQQQQQEQQQQHQQQ 340 Query: 82 --QPQLIDPMSGITGLGACAWGQ 20 Q Q +DP++GI LGACAWGQ Sbjct: 341 QHQQQFLDPINGILNLGACAWGQ 363 >ref|XP_007017553.1| DNA glycosylase superfamily protein [Theobroma cacao] gi|508722881|gb|EOY14778.1| DNA glycosylase superfamily protein [Theobroma cacao] Length = 397 Score = 395 bits (1015), Expect = e-107 Identities = 195/261 (74%), Positives = 222/261 (85%), Gaps = 13/261 (4%) Frame = -1 Query: 763 KTLSCEGEIDLAIRHLRSSDPLLARIIDVHLPPTFDSFHPPFLALTKSILYQQLAYKAGT 584 ++LSCEGE++ AIRHLR++DPLLA +ID+H PPTFD+FH PFLALT+SILYQQLA+KAGT Sbjct: 139 RSLSCEGEVETAIRHLRNADPLLASLIDIHPPPTFDTFHTPFLALTRSILYQQLAFKAGT 198 Query: 583 SIYTRFVSLCGGEAAVVPQTVLALTAPQLRQIGVSGRKASYLHDLASKYCNGILSDSSIV 404 SIY RF++LCGGE VVP+TVL+LTA QLRQIGVSGRKASYLHDLA KY GILSDS+IV Sbjct: 199 SIYNRFIALCGGENGVVPETVLSLTAQQLRQIGVSGRKASYLHDLARKYQTGILSDSAIV 258 Query: 403 DMDDKSLFTMLTMVKGIGPWSVHMFMIFSLHRPDVLPVGDLGVRKGVQLLHGLEELPRPS 224 +MDDKSLFTMLTMV GIG WSVHMFMIFSLHRPDVLP+ DLGVRKGVQLL+ LEELPRPS Sbjct: 259 NMDDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLGVRKGVQLLYNLEELPRPS 318 Query: 223 QMEQLCEKWRPYRSVGSWYMWRFVEVKGAPAIANTSSVAVGDGL---------QH----Q 83 QM+QLCEKWRPYRSV SWY+WRFVE KGAP+ A ++VA G L QH Q Sbjct: 319 QMDQLCEKWRPYRSVASWYLWRFVEAKGAPSSA--AAVAAGASLPPPQQEEQQQHQQHQQ 376 Query: 82 QPQLIDPMSGITGLGACAWGQ 20 QPQL+DP++ I LGACAWGQ Sbjct: 377 QPQLLDPINSILNLGACAWGQ 397 >ref|XP_002266618.1| PREDICTED: DNA-3-methyladenine glycosylase 1 [Vitis vinifera] gi|297735147|emb|CBI17509.3| unnamed protein product [Vitis vinifera] Length = 329 Score = 394 bits (1012), Expect = e-107 Identities = 198/258 (76%), Positives = 215/258 (83%), Gaps = 12/258 (4%) Frame = -1 Query: 757 LSCEGEIDLAIRHLRSSDPLLARIIDVHLPPTFDSFHPPFLALTKSILYQQLAYKAGTSI 578 LSCEGE+D+A+RHL SDPLLA +I+ H PPTFDS HPPFLAL KSILYQQLAYKA TSI Sbjct: 74 LSCEGELDVALRHLTKSDPLLAALINTHQPPTFDSCHPPFLALAKSILYQQLAYKAATSI 133 Query: 577 YTRFVSLCGGEAAVVPQTVLALTAPQLRQIGVSGRKASYLHDLASKYCNGILSDSSIVDM 398 YTRFV+LCGGEA VVP VLAL+ QLRQIGVSGRKA YLHDLASKY GILSDSSI+ M Sbjct: 134 YTRFVALCGGEAGVVPDAVLALSPSQLRQIGVSGRKAGYLHDLASKYKTGILSDSSIMGM 193 Query: 397 DDKSLFTMLTMVKGIGPWSVHMFMIFSLHRPDVLPVGDLGVRKGVQLLHGLEELPRPSQM 218 DDKSLFTMLTMVKGIG WSVHMFMIFSLHRPDVLPVGD+GVRKGVQ L+GLEELPRPSQM Sbjct: 194 DDKSLFTMLTMVKGIGSWSVHMFMIFSLHRPDVLPVGDVGVRKGVQFLYGLEELPRPSQM 253 Query: 217 EQLCEKWRPYRSVGSWYMWRFVEVKGAPAIANTSSVAVGDGL---------QHQQP---Q 74 EQLCEKW+PYRSVGSWYMWRFVE KGAP ++VA+ DG Q QQP Q Sbjct: 254 EQLCEKWKPYRSVGSWYMWRFVEAKGAPPA--RAAVALVDGATSEQQQQQEQQQQPQQLQ 311 Query: 73 LIDPMSGITGLGACAWGQ 20 L+DP++GI LGAC WGQ Sbjct: 312 LVDPINGIVNLGACIWGQ 329 >ref|XP_006473513.1| PREDICTED: DNA-3-methyladenine glycosylase 1-like isoform X2 [Citrus sinensis] Length = 371 Score = 390 bits (1001), Expect = e-106 Identities = 191/254 (75%), Positives = 218/254 (85%), Gaps = 6/254 (2%) Frame = -1 Query: 763 KTLSCEGEIDLAIRHLRSSDPLLARIIDVHLPPTFDSFHPPFLALTKSILYQQLAYKAGT 584 + LS EGE++ AIRHLR++D LA +ID+H PPTFDSFH PFLALT+SILYQQLA+KAGT Sbjct: 120 RPLSSEGEVEAAIRHLRNADRQLASLIDIHPPPTFDSFHTPFLALTRSILYQQLAFKAGT 179 Query: 583 SIYTRFVSLCGGEAAVVPQTVLALTAPQLRQIGVSGRKASYLHDLASKYCNGILSDSSIV 404 SIYTRF++LCGGEA VVP+TVLALT QLRQIGVSGRKASYLHDLA KY NGILSDS+IV Sbjct: 180 SIYTRFIALCGGEAGVVPETVLALTPQQLRQIGVSGRKASYLHDLARKYQNGILSDSAIV 239 Query: 403 DMDDKSLFTMLTMVKGIGPWSVHMFMIFSLHRPDVLPVGDLGVRKGVQLLHGLEELPRPS 224 +MDDKSLFTMLTMV GIG WSVHMFMIFSLHRPDVLP+ DLGVRKGVQLL+ LEELPRPS Sbjct: 240 NMDDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLGVRKGVQLLYSLEELPRPS 299 Query: 223 QMEQLCEKWRPYRSVGSWYMWRFVEVKGAPAIANTSSVAVGDGL------QHQQPQLIDP 62 QM+QLCEKWRPYRSV SWY+WRFVE KGAP+ A ++VA G L + QQPQL+D Sbjct: 300 QMDQLCEKWRPYRSVASWYLWRFVEAKGAPSSA--AAVAAGAALPQPQQEEQQQPQLLDQ 357 Query: 61 MSGITGLGACAWGQ 20 ++ + +GACAWGQ Sbjct: 358 INSLINIGACAWGQ 371 >ref|XP_004501010.1| PREDICTED: DNA-3-methyladenine glycosylase 1-like isoform X1 [Cicer arietinum] Length = 384 Score = 387 bits (995), Expect = e-105 Identities = 190/264 (71%), Positives = 221/264 (83%), Gaps = 16/264 (6%) Frame = -1 Query: 763 KTLSCEGEIDLAIRHLRSSDPLLARIIDVHLPPTFDSFHPPFLALTKSILYQQLAYKAGT 584 ++LSCEGE+++A+R+LR++DPLL+ +ID+H PPTFD+FH PFLALT+SILYQQLA+KAGT Sbjct: 123 RSLSCEGEVEIALRYLRNADPLLSPLIDIHQPPTFDNFHTPFLALTRSILYQQLAFKAGT 182 Query: 583 SIYTRFVSLCGGEAAVVPQTVLALTAPQLRQIGVSGRKASYLHDLASKYCNGILSDSSIV 404 SIYTRF++LCGGEA VVP+TVLAL QLRQIGVSGRKASYLHDLA KY NGILSDS+IV Sbjct: 183 SIYTRFIALCGGEAGVVPETVLALNPQQLRQIGVSGRKASYLHDLARKYQNGILSDSAIV 242 Query: 403 DMDDKSLFTMLTMVKGIGPWSVHMFMIFSLHRPDVLPVGDLGVRKGVQLLHGLEELPRPS 224 +MDDKSLFTMLTMV GIG WSVHMFMIFSLHRPDVLP+ DLGVRKGVQ+L+ LE+LPRPS Sbjct: 243 NMDDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLGVRKGVQILYNLEDLPRPS 302 Query: 223 QMEQLCEKWRPYRSVGSWYMWRFVEVKGAPAIANTSSVAVGDGL---------------Q 89 QM+QLCEKWRPYRSV SWYMWRFVE KG P+ A +VA G GL Q Sbjct: 303 QMDQLCEKWRPYRSVASWYMWRFVEAKGTPSSA--VAVATGAGLQQHQLEQHQQQQQQQQ 360 Query: 88 HQQPQLIDPMSGITGLG-ACAWGQ 20 H Q QL+DPM+ + +G ACAWGQ Sbjct: 361 HSQQQLMDPMNSMFNIGAACAWGQ 384 >ref|XP_007136597.1| hypothetical protein PHAVU_009G058200g [Phaseolus vulgaris] gi|561009684|gb|ESW08591.1| hypothetical protein PHAVU_009G058200g [Phaseolus vulgaris] Length = 366 Score = 387 bits (994), Expect = e-105 Identities = 192/266 (72%), Positives = 220/266 (82%), Gaps = 18/266 (6%) Frame = -1 Query: 763 KTLSCEGEIDLAIRHLRSSDPLLARIIDVHLPPTFDSFHPPFLALTKSILYQQLAYKAGT 584 ++LSCEGE+++A+R LR++DPLL+ +ID+H PPTFD+FH PFLALT+SILYQQLAYKAGT Sbjct: 103 RSLSCEGEVEIALRFLRNADPLLSPLIDIHQPPTFDNFHTPFLALTRSILYQQLAYKAGT 162 Query: 583 SIYTRFVSLCGGEAAVVPQTVLALTAPQLRQIGVSGRKASYLHDLASKYCNGILSDSSIV 404 SIYTRF++LCGGE VVP+TVLALT QLRQIGVSGRKASYLHDLA KY NGILSDS+IV Sbjct: 163 SIYTRFIALCGGENGVVPETVLALTPQQLRQIGVSGRKASYLHDLARKYQNGILSDSAIV 222 Query: 403 DMDDKSLFTMLTMVKGIGPWSVHMFMIFSLHRPDVLPVGDLGVRKGVQLLHGLEELPRPS 224 +MDDKSLFTMLTMV GIG WSVHMFMIFSLHRPDVLP+ DLGVRKGVQLL+ LE+LPRPS Sbjct: 223 NMDDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLGVRKGVQLLYNLEDLPRPS 282 Query: 223 QMEQLCEKWRPYRSVGSWYMWRFVEVKGAPAIANTSSVAVGDGLQHQ------------- 83 QM+ LCEKWRPYRSV SWYMWRFVE KG P+ A +VA G GLQ Q Sbjct: 283 QMDHLCEKWRPYRSVASWYMWRFVEAKGTPSSA--VAVATGAGLQQQHHHQHQQHEQQQQ 340 Query: 82 ----QPQLIDPMSGITGLG-ACAWGQ 20 QPQL+DP++ + LG ACAWGQ Sbjct: 341 QHPPQPQLLDPINSMFNLGAACAWGQ 366 >ref|XP_003528090.1| PREDICTED: DNA-3-methyladenine glycosylase 1-like [Glycine max] Length = 351 Score = 386 bits (991), Expect = e-104 Identities = 190/266 (71%), Positives = 221/266 (83%), Gaps = 18/266 (6%) Frame = -1 Query: 763 KTLSCEGEIDLAIRHLRSSDPLLARIIDVHLPPTFDSFHPPFLALTKSILYQQLAYKAGT 584 ++LSC+GE+++++R+LR++DPLL+ +ID+H PPTFD+FH PFLALT+SILYQQLA+KAGT Sbjct: 88 RSLSCDGEVEISLRYLRNADPLLSPLIDIHQPPTFDNFHTPFLALTRSILYQQLAFKAGT 147 Query: 583 SIYTRFVSLCGGEAAVVPQTVLALTAPQLRQIGVSGRKASYLHDLASKYCNGILSDSSIV 404 SIYTRF+ LCGGE VVP+TVLALT QLRQIGVSGRKASYLHDLA KY NGILSDS+IV Sbjct: 148 SIYTRFIGLCGGENGVVPETVLALTPQQLRQIGVSGRKASYLHDLARKYQNGILSDSAIV 207 Query: 403 DMDDKSLFTMLTMVKGIGPWSVHMFMIFSLHRPDVLPVGDLGVRKGVQLLHGLEELPRPS 224 +MDDKSLFTMLTMV GIG WSVHMFMIFSLHRPDVLP+ DLGVRKGVQLL+ LE+LPRPS Sbjct: 208 NMDDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLGVRKGVQLLYNLEDLPRPS 267 Query: 223 QMEQLCEKWRPYRSVGSWYMWRFVEVKGAPAIANTSSVAVGDGLQHQ------------- 83 QM+QLC+KWRPYRSV SWYMWRFVE KG P+ A T VA G GLQ Q Sbjct: 268 QMDQLCDKWRPYRSVASWYMWRFVEAKGTPSSAVT--VATGAGLQQQRHHQHQQQEQQQQ 325 Query: 82 ----QPQLIDPMSGITGLG-ACAWGQ 20 QPQL+DP++ + LG ACAWGQ Sbjct: 326 QHAPQPQLLDPINSMFNLGAACAWGQ 351 >gb|ACU22727.1| unknown [Glycine max] Length = 351 Score = 386 bits (991), Expect = e-104 Identities = 190/266 (71%), Positives = 221/266 (83%), Gaps = 18/266 (6%) Frame = -1 Query: 763 KTLSCEGEIDLAIRHLRSSDPLLARIIDVHLPPTFDSFHPPFLALTKSILYQQLAYKAGT 584 ++LSC+GE+++++R+LR++DPLL+ +ID+H PPTFD+FH PFLALT+SILYQQLA+KAGT Sbjct: 88 RSLSCDGEVEISLRYLRNADPLLSPLIDIHQPPTFDNFHTPFLALTRSILYQQLAFKAGT 147 Query: 583 SIYTRFVSLCGGEAAVVPQTVLALTAPQLRQIGVSGRKASYLHDLASKYCNGILSDSSIV 404 SIYTRF+ LCGGE VVP+TVLALT QLRQIGVSGRKASYLHDLA KY NGILSDS+IV Sbjct: 148 SIYTRFIGLCGGENGVVPETVLALTPQQLRQIGVSGRKASYLHDLARKYQNGILSDSAIV 207 Query: 403 DMDDKSLFTMLTMVKGIGPWSVHMFMIFSLHRPDVLPVGDLGVRKGVQLLHGLEELPRPS 224 +MDDKSLFTMLTMV GIG WSVHMFMIFSLHRPDVLP+ DLGVRKGVQLL+ LE+LPRPS Sbjct: 208 NMDDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLGVRKGVQLLYNLEDLPRPS 267 Query: 223 QMEQLCEKWRPYRSVGSWYMWRFVEVKGAPAIANTSSVAVGDGLQHQ------------- 83 QM+QLC+KWRPYRSV SWYMWRFVE KG P+ A T VA G GLQ Q Sbjct: 268 QMDQLCDKWRPYRSVASWYMWRFVEAKGTPSSAVT--VATGAGLQQQRHHQHQQQEQQQQ 325 Query: 82 ----QPQLIDPMSGITGLGA-CAWGQ 20 QPQL+DP++ + LGA CAWGQ Sbjct: 326 QHAPQPQLLDPINSMFNLGAVCAWGQ 351 >ref|XP_004147864.1| PREDICTED: uncharacterized protein LOC101202943 [Cucumis sativus] gi|449476816|ref|XP_004154842.1| PREDICTED: uncharacterized LOC101202943 [Cucumis sativus] Length = 382 Score = 385 bits (989), Expect = e-104 Identities = 187/264 (70%), Positives = 215/264 (81%), Gaps = 16/264 (6%) Frame = -1 Query: 763 KTLSCEGEIDLAIRHLRSSDPLLARIIDVHLPPTFDSFHPPFLALTKSILYQQLAYKAGT 584 ++LSCEGE+++A+RHLR++DPLLA++ID+H PTFDSF PFLALT+SILYQQLAYKAGT Sbjct: 119 RSLSCEGEVEIALRHLRNADPLLAQLIDLHQRPTFDSFQTPFLALTRSILYQQLAYKAGT 178 Query: 583 SIYTRFVSLCGGEAAVVPQTVLALTAPQLRQIGVSGRKASYLHDLASKYCNGILSDSSIV 404 SIYTRF++LCGGEA V+P+TVLAL QLRQIG+SGRK+SYLHDLA KY NGILSD +IV Sbjct: 179 SIYTRFIALCGGEAGVLPETVLALNPQQLRQIGISGRKSSYLHDLARKYQNGILSDPAIV 238 Query: 403 DMDDKSLFTMLTMVKGIGPWSVHMFMIFSLHRPDVLPVGDLGVRKGVQLLHGLEELPRPS 224 +MDDKSLFTMLTMV GIG WSVHMFMIFSLHRPDVLP+ DL VRKGVQLL+ LEELPRPS Sbjct: 239 NMDDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLNVRKGVQLLYNLEELPRPS 298 Query: 223 QMEQLCEKWRPYRSVGSWYMWRFVEVKGAPAIANTSSVAVGDGLQH-------------- 86 QM+QLCEKWRPYRSVGSWYMWR E KGA + A + LQH Sbjct: 299 QMDQLCEKWRPYRSVGSWYMWRLAEAKGASSSAAAVAAGASLQLQHQDHHQEHQHPQHPQ 358 Query: 85 --QQPQLIDPMSGITGLGACAWGQ 20 QQPQL+DP++ I LGACAWGQ Sbjct: 359 HPQQPQLLDPLNSILNLGACAWGQ 382 >ref|XP_003523827.1| PREDICTED: probable DNA-3-methyladenine glycosylase 2-like [Glycine max] Length = 374 Score = 383 bits (984), Expect = e-104 Identities = 190/275 (69%), Positives = 222/275 (80%), Gaps = 27/275 (9%) Frame = -1 Query: 763 KTLSCEGEIDLAIRHLRSSDPLLARIIDVHLPPTFDSFHPPFLALTKSILYQQLAYKAGT 584 ++LSC+GE+++A+R+LR++DP+L+ +ID+H PPTFD+FH PFLALT+SILYQQLAYKAGT Sbjct: 102 RSLSCDGEVEIALRYLRNADPVLSPLIDIHQPPTFDNFHTPFLALTRSILYQQLAYKAGT 161 Query: 583 SIYTRFVSLCGGEAAVVPQTVLALTAPQLRQIGVSGRKASYLHDLASKYCNGILSDSSIV 404 SIYTRF++LCGGE VVP+TVLALT QLRQIGVSGRKASYLHDLA KY NGILSDS+IV Sbjct: 162 SIYTRFIALCGGENGVVPETVLALTPQQLRQIGVSGRKASYLHDLARKYQNGILSDSAIV 221 Query: 403 DMDDKSLFTMLTMVKGIGPWSVHMFMIFSLHRPDVLPVGDLGVRKGVQLLHGLEELPRPS 224 +MDDKSLFTMLTMV GIG WSVHMFMIFSLHRPDVLP+ DLGVRKGVQLL+ LE+LPRPS Sbjct: 222 NMDDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLGVRKGVQLLYNLEDLPRPS 281 Query: 223 QMEQLCEKWRPYRSVGSWYMWRFVEVKGAPAIANTSSVAVGDGLQHQ------------- 83 QM+QLC+KWRPYRSV SWYMWRFVE KG P+ A +VA G GLQ Q Sbjct: 282 QMDQLCDKWRPYRSVASWYMWRFVEAKGTPSSA--VAVATGAGLQQQQHHQHHHQHQQQE 339 Query: 82 -------------QPQLIDPMSGITGLG-ACAWGQ 20 QPQL+DP++ + LG ACAWGQ Sbjct: 340 QQQQQQQQQQHPPQPQLLDPINSMFNLGAACAWGQ 374 >gb|EXB91937.1| DNA-3-methyladenine glycosylase 1 [Morus notabilis] Length = 451 Score = 382 bits (982), Expect = e-103 Identities = 188/250 (75%), Positives = 213/250 (85%), Gaps = 14/250 (5%) Frame = -1 Query: 763 KTLSCEGEIDLAIRHLRSSDPLLARIIDVHLPPTFDSFHPPFLALTKSILYQQLAYKAGT 584 ++LSCEGE+++A+RHLR +DPLLA +ID+H PPTFD+FH PFLALT+SILYQQLAYKAGT Sbjct: 123 RSLSCEGEVEVALRHLRRADPLLAPLIDIHQPPTFDNFHTPFLALTRSILYQQLAYKAGT 182 Query: 583 SIYTRFVSLCGGEAAVVPQTVLALTAPQLRQIGVSGRKASYLHDLASKYCNGILSDSSIV 404 SIYTRF++LCGGE VVP+TVLALT QLRQIGVSGRKASYLHDLA KY NGILSDS+IV Sbjct: 183 SIYTRFIALCGGETGVVPETVLALTPQQLRQIGVSGRKASYLHDLARKYQNGILSDSAIV 242 Query: 403 DMDDKSLFTMLTMVKGIGPWSVHMFMIFSLHRPDVLPVGDLGVRKGVQLLHGLEELPRPS 224 +MDDKSLFTMLTMV GIG WSVHMFMIFSLHRPDVLP+ DLGVRKGVQLL+ LEELPRPS Sbjct: 243 NMDDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLGVRKGVQLLYNLEELPRPS 302 Query: 223 QMEQLCEKWRPYRSVGSWYMWRFVEVKGAPAIANTSSVAVGDGLQH-------------- 86 QM+QLCEKWRPYRSV +WYMWRFVE KGAP N ++VAVG LQ Sbjct: 303 QMDQLCEKWRPYRSVAAWYMWRFVEQKGAP--PNAATVAVGANLQQQQQQQQQQGEPHQP 360 Query: 85 QQPQLIDPMS 56 QQPQL+DP++ Sbjct: 361 QQPQLMDPLN 370 >emb|CAN72984.1| hypothetical protein VITISV_009035 [Vitis vinifera] Length = 353 Score = 382 bits (982), Expect = e-103 Identities = 194/254 (76%), Positives = 211/254 (83%), Gaps = 12/254 (4%) Frame = -1 Query: 763 KTLSCEGEIDLAIRHLRSSDPLLARIIDVHLPPTFDSFHPPFLALTKSILYQQLAYKAGT 584 K LSCEGE+D+A+RHL SDPLLA +I+ H PPTFDS HPPFLAL KSILYQQLAYKA T Sbjct: 102 KPLSCEGELDVALRHLTKSDPLLAALINTHQPPTFDSCHPPFLALAKSILYQQLAYKAAT 161 Query: 583 SIYTRFVSLCGGEAAVVPQTVLALTAPQLRQIGVSGRKASYLHDLASKYCNGILSDSSIV 404 SIYTRFV+LCGGEA VVP VLAL+ QLRQIGVSGRKA YLHDLASKY GILSDSSI+ Sbjct: 162 SIYTRFVALCGGEAGVVPDAVLALSPSQLRQIGVSGRKAGYLHDLASKYKTGILSDSSIM 221 Query: 403 DMDDKSLFTMLTMVKGIGPWSVHMFMIFSLHRPDVLPVGDLGVRKGVQLLHGLEELPRPS 224 MDDKSLFTMLTMVKGIG WSVHMFMIFSLHRPDVLPVGD+GVRKGVQ L+GLEELPRPS Sbjct: 222 GMDDKSLFTMLTMVKGIGSWSVHMFMIFSLHRPDVLPVGDVGVRKGVQFLYGLEELPRPS 281 Query: 223 QMEQLCEKWRPYRSVGSWYMWRFVEVKGAPAIANTSSVAVGDGL---------QHQQP-- 77 QMEQLCEKW+PYRSVGSWYMWRFVE KGAP ++VA+ DG Q QQP Sbjct: 282 QMEQLCEKWKPYRSVGSWYMWRFVEAKGAPPA--RAAVALVDGATSEQQQQQEQQQQPQQ 339 Query: 76 -QLIDPMSGITGLG 38 QL+DP++GI LG Sbjct: 340 LQLVDPINGIVNLG 353 >ref|XP_006473512.1| PREDICTED: DNA-3-methyladenine glycosylase 1-like isoform X1 [Citrus sinensis] Length = 373 Score = 375 bits (962), Expect = e-101 Identities = 185/248 (74%), Positives = 212/248 (85%), Gaps = 6/248 (2%) Frame = -1 Query: 763 KTLSCEGEIDLAIRHLRSSDPLLARIIDVHLPPTFDSFHPPFLALTKSILYQQLAYKAGT 584 + LS EGE++ AIRHLR++D LA +ID+H PPTFDSFH PFLALT+SILYQQLA+KAGT Sbjct: 120 RPLSSEGEVEAAIRHLRNADRQLASLIDIHPPPTFDSFHTPFLALTRSILYQQLAFKAGT 179 Query: 583 SIYTRFVSLCGGEAAVVPQTVLALTAPQLRQIGVSGRKASYLHDLASKYCNGILSDSSIV 404 SIYTRF++LCGGEA VVP+TVLALT QLRQIGVSGRKASYLHDLA KY NGILSDS+IV Sbjct: 180 SIYTRFIALCGGEAGVVPETVLALTPQQLRQIGVSGRKASYLHDLARKYQNGILSDSAIV 239 Query: 403 DMDDKSLFTMLTMVKGIGPWSVHMFMIFSLHRPDVLPVGDLGVRKGVQLLHGLEELPRPS 224 +MDDKSLFTMLTMV GIG WSVHMFMIFSLHRPDVLP+ DLGVRKGVQLL+ LEELPRPS Sbjct: 240 NMDDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLGVRKGVQLLYSLEELPRPS 299 Query: 223 QMEQLCEKWRPYRSVGSWYMWRFVEVKGAPAIANTSSVAVGDGL------QHQQPQLIDP 62 QM+QLCEKWRPYRSV SWY+WRFVE KGAP+ A ++VA G L + QQPQL+D Sbjct: 300 QMDQLCEKWRPYRSVASWYLWRFVEAKGAPSSA--AAVAAGAALPQPQQEEQQQPQLLDQ 357 Query: 61 MSGITGLG 38 ++ + +G Sbjct: 358 INSLINIG 365 >ref|XP_006435004.1| hypothetical protein CICLE_v10001539mg [Citrus clementina] gi|557537126|gb|ESR48244.1| hypothetical protein CICLE_v10001539mg [Citrus clementina] Length = 373 Score = 375 bits (962), Expect = e-101 Identities = 185/248 (74%), Positives = 212/248 (85%), Gaps = 6/248 (2%) Frame = -1 Query: 763 KTLSCEGEIDLAIRHLRSSDPLLARIIDVHLPPTFDSFHPPFLALTKSILYQQLAYKAGT 584 + LS EGE++ AIRHLR++D LA +ID+H PPTFDSFH PFLALT+SILYQQLA+KAGT Sbjct: 120 RPLSSEGEVEAAIRHLRNADRQLASLIDIHPPPTFDSFHTPFLALTRSILYQQLAFKAGT 179 Query: 583 SIYTRFVSLCGGEAAVVPQTVLALTAPQLRQIGVSGRKASYLHDLASKYCNGILSDSSIV 404 SIYTRF++LCGGEA VVP+TVLALT QLRQIGVSGRKASYLHDLA KY NGILSDS+IV Sbjct: 180 SIYTRFIALCGGEAGVVPETVLALTPQQLRQIGVSGRKASYLHDLARKYQNGILSDSAIV 239 Query: 403 DMDDKSLFTMLTMVKGIGPWSVHMFMIFSLHRPDVLPVGDLGVRKGVQLLHGLEELPRPS 224 +MDDKSLFTMLTMV GIG WSVHMFMIFSLHRPDVLP+ DLGVRKGVQLL+ LEELPRPS Sbjct: 240 NMDDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLGVRKGVQLLYSLEELPRPS 299 Query: 223 QMEQLCEKWRPYRSVGSWYMWRFVEVKGAPAIANTSSVAVGDGL------QHQQPQLIDP 62 QM+QLCEKWRPYRSV SWY+WRFVE KGAP+ A ++VA G L + QQPQL+D Sbjct: 300 QMDQLCEKWRPYRSVASWYLWRFVEAKGAPSSA--AAVAAGAALPQPQQEEQQQPQLLDQ 357 Query: 61 MSGITGLG 38 ++ + +G Sbjct: 358 INSLINIG 365 >ref|XP_004291872.1| PREDICTED: putative DNA-3-methyladenine glycosylase YfjP-like isoform 1 [Fragaria vesca subsp. vesca] Length = 385 Score = 368 bits (945), Expect = 3e-99 Identities = 181/261 (69%), Positives = 213/261 (81%), Gaps = 13/261 (4%) Frame = -1 Query: 763 KTLSCEGEIDLAIRHLRSSDPLLARIIDVHLPPTFDSFHPPFLALTKSILYQQLAYKAGT 584 + L C+GE++ AIRHLR++DPLL +I+ H PP FD+FH PFLALT+SILYQQLAYKAGT Sbjct: 127 RPLRCDGEVESAIRHLRNADPLLIPLIEAHQPPQFDNFHTPFLALTRSILYQQLAYKAGT 186 Query: 583 SIYTRFVSLCGGEAAVVPQTVLALTAPQLRQIGVSGRKASYLHDLASKYCNGILSDSSIV 404 SIYTRF+ LCGGE+AV P+TVLA +A QLRQIG+SGRKASYLHDLA KY NGILSD++IV Sbjct: 187 SIYTRFIQLCGGESAVNPETVLAQSATQLRQIGISGRKASYLHDLARKYQNGILSDTAIV 246 Query: 403 DMDDKSLFTMLTMVKGIGPWSVHMFMIFSLHRPDVLPVGDLGVRKGVQLLHGLEELPRPS 224 +MDDKSLFTMLTMV GIG WSVHMFMIFSLHRPDVLPV DLG+RKGVQLL+GLEELPRPS Sbjct: 247 NMDDKSLFTMLTMVSGIGSWSVHMFMIFSLHRPDVLPVNDLGIRKGVQLLYGLEELPRPS 306 Query: 223 QMEQLCEKWRPYRSVGSWYMWRFVEVKGAPAIANTSSVAVG------------DGLQH-Q 83 M+QLC+KWRPYRSV +WY+WR+VE KGA + A ++VA G QH Q Sbjct: 307 HMDQLCDKWRPYRSVAAWYLWRYVESKGASSTA--AAVAAGAIAPMQQQQEDQQPQQHPQ 364 Query: 82 QPQLIDPMSGITGLGACAWGQ 20 Q QL+D +S + +GAC WGQ Sbjct: 365 QQQLMDSLSNLINIGACTWGQ 385 >ref|XP_007223275.1| hypothetical protein PRUPE_ppa007252mg [Prunus persica] gi|462420211|gb|EMJ24474.1| hypothetical protein PRUPE_ppa007252mg [Prunus persica] Length = 376 Score = 365 bits (938), Expect = 2e-98 Identities = 186/262 (70%), Positives = 211/262 (80%), Gaps = 15/262 (5%) Frame = -1 Query: 763 KTLSCEGEIDLAIRHLRSSDPLLARIIDVHLPPTFDSFHPPFLALTKSILYQQLAYKAGT 584 + LSCEGE++ AIRHLR++DPLLA +ID+H PTFD+F PFLALT+SILYQQLAYKAG Sbjct: 116 RPLSCEGEVEAAIRHLRNADPLLAPLIDLHQRPTFDTFQTPFLALTRSILYQQLAYKAGN 175 Query: 583 SIYTRFVSLCGGEAAVVPQTVLALTAPQLRQIGVSGRKASYLHDLASKYCNGILSDSSIV 404 SIYTRFVSLCGGEA VVP+TVLA T QLRQIGVSGRKASYLHDLA KY NGILSD++IV Sbjct: 176 SIYTRFVSLCGGEACVVPETVLAQTPQQLRQIGVSGRKASYLHDLARKYQNGILSDAAIV 235 Query: 403 DMDDKSLFTMLTMVKGIGPWSVHMFMIFSLHRPDVLPVGDLGVRKGVQLLHGLEELPRPS 224 +MDDKSLFTMLTMV GIG WSVHMFMIFSLHRPDVLP+ DL +RKGVQLL+ L+ELPRPS Sbjct: 236 NMDDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLSMRKGVQLLYNLDELPRPS 295 Query: 223 QMEQLCEKWRPYRSVGSWYMWRFVEVKGAPAIANTSSVAVGDGL-----------QH--- 86 QME LCEKWRPYRSV + YMWRF E KGAP+ A ++VA G L QH Sbjct: 296 QMEHLCEKWRPYRSVAACYMWRFSESKGAPSSA--AAVAAGATLPPQQQQEEQQQQHPQH 353 Query: 85 -QQPQLIDPMSGITGLGACAWG 23 QQ QL+D +S + +GAC WG Sbjct: 354 PQQQQLMDSLSSLINIGACTWG 375 >emb|CBI19705.3| unnamed protein product [Vitis vinifera] Length = 351 Score = 361 bits (926), Expect(2) = 2e-97 Identities = 178/218 (81%), Positives = 194/218 (88%) Frame = -1 Query: 748 EGEIDLAIRHLRSSDPLLARIIDVHLPPTFDSFHPPFLALTKSILYQQLAYKAGTSIYTR 569 +GEI++A+RHLR++DP LA +ID+H PPTFDSFH PFLALTKSILYQQLAYKAGTSIYTR Sbjct: 115 KGEIEIALRHLRNADPHLAPLIDLHPPPTFDSFHTPFLALTKSILYQQLAYKAGTSIYTR 174 Query: 568 FVSLCGGEAAVVPQTVLALTAPQLRQIGVSGRKASYLHDLASKYCNGILSDSSIVDMDDK 389 FV LCGGEA V+P+TVLALT QLRQIGVSGRKASYLHDLA KY NGILSD+ I+ MDDK Sbjct: 175 FVGLCGGEAGVLPETVLALTPHQLRQIGVSGRKASYLHDLARKYQNGILSDTGIITMDDK 234 Query: 388 SLFTMLTMVKGIGPWSVHMFMIFSLHRPDVLPVGDLGVRKGVQLLHGLEELPRPSQMEQL 209 SLFTMLTMV GIG WSVHMFMIFSLHRPDVLPV DLGVRKGVQLL+GLEELPRPSQMEQL Sbjct: 235 SLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPVNDLGVRKGVQLLYGLEELPRPSQMEQL 294 Query: 208 CEKWRPYRSVGSWYMWRFVEVKGAPAIANTSSVAVGDG 95 CEKWRPYRSV SWY+WRFVE KGAP +S+ AV G Sbjct: 295 CEKWRPYRSVASWYIWRFVEGKGAP----SSAAAVAGG 328 Score = 23.5 bits (49), Expect(2) = 2e-97 Identities = 10/24 (41%), Positives = 14/24 (58%) Frame = -3 Query: 92 ATSAATAYRSNEWNHWTRGLRVGT 21 A + RSN+W+ RGL +GT Sbjct: 324 AVAGGPISRSNQWHSKPRGLCLGT 347 >ref|XP_006390331.1| hypothetical protein EUTSA_v10018654mg [Eutrema salsugineum] gi|557086765|gb|ESQ27617.1| hypothetical protein EUTSA_v10018654mg [Eutrema salsugineum] Length = 403 Score = 358 bits (918), Expect = 4e-96 Identities = 182/266 (68%), Positives = 207/266 (77%), Gaps = 18/266 (6%) Frame = -1 Query: 763 KTLSCEGEIDLAIRHLRSSDPLLARIIDVHLPPTFDSFHPPFLALTKSILYQQLAYKAGT 584 ++L+CEGE++ AI HLRS DPLL +ID+H PPT++SFH PFLAL +SILYQQLA KAG Sbjct: 139 RSLTCEGELEAAICHLRSVDPLLGSLIDIHPPPTYESFHSPFLALIRSILYQQLAAKAGN 198 Query: 583 SIYTRFVSLCGGEAAVVPQTVLALTAPQLRQIGVSGRKASYLHDLASKYCNGILSDSSIV 404 SIYTRFV+LCGGE AVVP+TVL LT QLRQIGVSGRKASYL+DLA KY NGILSDS IV Sbjct: 199 SIYTRFVALCGGENAVVPETVLPLTPQQLRQIGVSGRKASYLNDLARKYQNGILSDSGIV 258 Query: 403 DMDDKSLFTMLTMVKGIGPWSVHMFMIFSLHRPDVLPVGDLGVRKGVQLLHGLEELPRPS 224 +MD+KSLFTMLTMV GIG WSVHMFMI SLHRPDVLPV DLGVRKGVQ+L+ L ELPRPS Sbjct: 259 NMDEKSLFTMLTMVNGIGSWSVHMFMINSLHRPDVLPVNDLGVRKGVQMLYNLPELPRPS 318 Query: 223 QMEQLCEKWRPYRSVGSWYMWRFVEVKGAPAIANTSSVAVGDGL---------------- 92 QMEQLCEKWRPYRSVGSWYMWR +E K P +N +SV G L Sbjct: 319 QMEQLCEKWRPYRSVGSWYMWRLIEAKSTP--SNAASVTAGAALSFPQLEDIQQQQQQQQ 376 Query: 91 -QHQQPQLIDPMSGITGLG-ACAWGQ 20 Q QQ QL+DP++ + +G AWGQ Sbjct: 377 HQQQQSQLLDPLNSVFSIGYTQAWGQ 402 >ref|XP_004238277.1| PREDICTED: DNA-3-methyladenine glycosylase 1-like isoform 1 [Solanum lycopersicum] Length = 332 Score = 358 bits (918), Expect = 4e-96 Identities = 175/256 (68%), Positives = 209/256 (81%), Gaps = 10/256 (3%) Frame = -1 Query: 763 KTLSCEGEIDLAIRHLRSSDPLLARIIDVHLPPTFDSFHPPFLALTKSILYQQLAYKAGT 584 ++LS EGE++ AI +L+SSDPLL+ +I+ + PPT + F PPFLALTKSIL+QQLAYKAG+ Sbjct: 74 RSLSYEGELESAINYLKSSDPLLSPLIETYPPPTLELFQPPFLALTKSILFQQLAYKAGS 133 Query: 583 SIYTRFVSLCGGEAAVVPQTVLALTAPQLRQIGVSGRKASYLHDLASKYCNGILSDSSIV 404 SIYTRF+SLCGGE+ VVP VL LT QLRQIGVS RKASYLHDLA KY NGILSD SIV Sbjct: 134 SIYTRFISLCGGESNVVPDMVLGLTPQQLRQIGVSARKASYLHDLARKYQNGILSDKSIV 193 Query: 403 DMDDKSLFTMLTMVKGIGPWSVHMFMIFSLHRPDVLPVGDLGVRKGVQLLHGLEELPRPS 224 DMDDKSLFTMLTMV GIG WSVHMFMIFSLHRPDVLP+ DLG+RKGV++L+GLE+LPRPS Sbjct: 194 DMDDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPIHDLGIRKGVRMLYGLEDLPRPS 253 Query: 223 QMEQLCEKWRPYRSVGSWYMWRFVEVKGAPA---IANTSSVAVGDGL-------QHQQPQ 74 QM+QLCEKW+PYRSV SWY+WRFVE KGA + + S+V++ + Q Q Q Sbjct: 254 QMDQLCEKWKPYRSVASWYIWRFVEAKGANSKGNVVGNSNVSLQQQILSMQQQQQQQHQQ 313 Query: 73 LIDPMSGITGLGACAW 26 +DP++GI +GACAW Sbjct: 314 FLDPINGILNVGACAW 329