BLASTX nr result
ID: Akebia25_contig00034822
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Akebia25_contig00034822 (1632 letters) Database: ./nr 38,876,450 sequences; 13,856,398,315 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_002265027.2| PREDICTED: A/G-specific adenine DNA glycosyl... 556 e-156 emb|CBI25679.3| unnamed protein product [Vitis vinifera] 556 e-156 ref|XP_006344357.1| PREDICTED: A/G-specific adenine DNA glycosyl... 553 e-154 emb|CAN71629.1| hypothetical protein VITISV_015579 [Vitis vinifera] 544 e-152 ref|XP_006447890.1| hypothetical protein CICLE_v10015195mg [Citr... 543 e-151 ref|XP_006858703.1| hypothetical protein AMTR_s00066p00103210 [A... 541 e-151 ref|XP_004246789.1| PREDICTED: A/G-specific adenine DNA glycosyl... 540 e-151 ref|XP_002524570.1| A/G-specific adenine glycosylase muty, putat... 533 e-149 gb|EYU46093.1| hypothetical protein MIMGU_mgv1a022080mg, partial... 532 e-148 ref|XP_002321221.2| hypothetical protein POPTR_0014s17120g [Popu... 531 e-148 ref|XP_007049485.1| HhH-GPD base excision DNA repair family prot... 529 e-147 ref|XP_004293166.1| PREDICTED: A/G-specific adenine DNA glycosyl... 519 e-144 ref|XP_007206303.1| hypothetical protein PRUPE_ppa020735mg, part... 517 e-144 ref|XP_004510725.1| PREDICTED: A/G-specific adenine DNA glycosyl... 514 e-143 ref|XP_006583255.1| PREDICTED: A/G-specific adenine DNA glycosyl... 513 e-143 ref|XP_003528811.1| PREDICTED: A/G-specific adenine DNA glycosyl... 513 e-143 ref|XP_006282600.1| hypothetical protein CARUB_v10004796mg [Caps... 513 e-142 ref|XP_004510727.1| PREDICTED: A/G-specific adenine DNA glycosyl... 512 e-142 ref|XP_007135205.1| hypothetical protein PHAVU_010G109900g [Phas... 510 e-142 ref|XP_004510726.1| PREDICTED: A/G-specific adenine DNA glycosyl... 509 e-141 >ref|XP_002265027.2| PREDICTED: A/G-specific adenine DNA glycosylase [Vitis vinifera] Length = 464 Score = 556 bits (1434), Expect = e-156 Identities = 281/435 (64%), Positives = 340/435 (78%), Gaps = 5/435 (1%) Frame = -1 Query: 1632 KKKKRMREPQQQKTSVVGGLEDIEDFSKPDTLKIRASLLQWYDQNQRVLPWRSSSKTHHY 1453 + K R+ +Q+ TS + + DIEDF + +TLKIRASLL WYD N+R LPWR+ + T + Sbjct: 12 RDNKEKRKRKQRTTSEIEVM-DIEDFGRDETLKIRASLLGWYDLNKRNLPWRTPTTTTTH 70 Query: 1452 SQQD-----KEQNLRAYAVWVSEIMLQQTRVSTVIDYYNRWMQKWPTIHHLSQASQEEVN 1288 +D ++ + RAYAVWVSE+MLQQTRV TVIDYYNRWMQKWPT+HHLS AS EEVN Sbjct: 71 EDEDDADAHEDLDNRAYAVWVSEVMLQQTRVETVIDYYNRWMQKWPTLHHLSLASLEEVN 130 Query: 1287 EMWAGLGYYRRARFLLEGAKMVVEGGGEFPTTVSALRKVRGIGDYTAGAIASIAFKEAVP 1108 EMWAGLGYYRRAR LLEGAKM+ EG FP T SALR+V GIG+YTAGAIASIAFKEAVP Sbjct: 131 EMWAGLGYYRRARCLLEGAKMISEGKCGFPRTTSALREVPGIGNYTAGAIASIAFKEAVP 190 Query: 1107 VVDGNVIRVIARLKAISANPKERETIKDIWKLAGQLVDPCRPGDFNQALMELGSTLCTPT 928 VVDGNV+RVIARLKAIS+NPK TIK+IW+LAGQLVDPC+PGDFNQALMELG+T+CTP Sbjct: 191 VVDGNVVRVIARLKAISSNPKHSATIKNIWRLAGQLVDPCKPGDFNQALMELGATICTPL 250 Query: 927 NPDCSVCPVSGQCRALSLSRKCQSVQVTNYPIKIVKAKQRHDFCAICVVEILEEGNLGIE 748 P CS CPVS QC LS+S +S+ VT+YP+K+VKAK+RHDF A+ VV+ILEE + I Sbjct: 251 KPICSACPVSDQCSVLSMSESHRSILVTDYPVKVVKAKKRHDFSAVSVVKILEEQD--IS 308 Query: 747 DGSHKSNGFLLVKRPNEGLLAGLWEFPSVLLDGEANLAIRREAIDEYLKKLFEIDTKEKS 568 GS ++ FLLVKRPNEGLLAGLWEFPSVLLDGEA+ A RR+ ID +LK F++DTK+ Sbjct: 309 KGSQYNSRFLLVKRPNEGLLAGLWEFPSVLLDGEADGATRRKRIDRFLKS-FKLDTKKNC 367 Query: 567 CIILREDVGECVHIFTHIRLKMYVEXXXXXXXXXXXXXXXKEDRDTLEWKCVDNKSIQSM 388 I+ REDVGECVH+FTHI L MYVE ED++T+ W+ +D++++ SM Sbjct: 368 RIVSREDVGECVHVFTHIHLTMYVELLVLHLKGGMKISYENEDKETMTWRWIDSEALSSM 427 Query: 387 GLTSGVRKAFNMIQQ 343 GLTSGVRK +NMIQ+ Sbjct: 428 GLTSGVRKVYNMIQK 442 >emb|CBI25679.3| unnamed protein product [Vitis vinifera] Length = 506 Score = 556 bits (1434), Expect = e-156 Identities = 281/435 (64%), Positives = 340/435 (78%), Gaps = 5/435 (1%) Frame = -1 Query: 1632 KKKKRMREPQQQKTSVVGGLEDIEDFSKPDTLKIRASLLQWYDQNQRVLPWRSSSKTHHY 1453 + K R+ +Q+ TS + + DIEDF + +TLKIRASLL WYD N+R LPWR+ + T + Sbjct: 54 RDNKEKRKRKQRTTSEIEVM-DIEDFGRDETLKIRASLLGWYDLNKRNLPWRTPTTTTTH 112 Query: 1452 SQQD-----KEQNLRAYAVWVSEIMLQQTRVSTVIDYYNRWMQKWPTIHHLSQASQEEVN 1288 +D ++ + RAYAVWVSE+MLQQTRV TVIDYYNRWMQKWPT+HHLS AS EEVN Sbjct: 113 EDEDDADAHEDLDNRAYAVWVSEVMLQQTRVETVIDYYNRWMQKWPTLHHLSLASLEEVN 172 Query: 1287 EMWAGLGYYRRARFLLEGAKMVVEGGGEFPTTVSALRKVRGIGDYTAGAIASIAFKEAVP 1108 EMWAGLGYYRRAR LLEGAKM+ EG FP T SALR+V GIG+YTAGAIASIAFKEAVP Sbjct: 173 EMWAGLGYYRRARCLLEGAKMISEGKCGFPRTTSALREVPGIGNYTAGAIASIAFKEAVP 232 Query: 1107 VVDGNVIRVIARLKAISANPKERETIKDIWKLAGQLVDPCRPGDFNQALMELGSTLCTPT 928 VVDGNV+RVIARLKAIS+NPK TIK+IW+LAGQLVDPC+PGDFNQALMELG+T+CTP Sbjct: 233 VVDGNVVRVIARLKAISSNPKHSATIKNIWRLAGQLVDPCKPGDFNQALMELGATICTPL 292 Query: 927 NPDCSVCPVSGQCRALSLSRKCQSVQVTNYPIKIVKAKQRHDFCAICVVEILEEGNLGIE 748 P CS CPVS QC LS+S +S+ VT+YP+K+VKAK+RHDF A+ VV+ILEE + I Sbjct: 293 KPICSACPVSDQCSVLSMSESHRSILVTDYPVKVVKAKKRHDFSAVSVVKILEEQD--IS 350 Query: 747 DGSHKSNGFLLVKRPNEGLLAGLWEFPSVLLDGEANLAIRREAIDEYLKKLFEIDTKEKS 568 GS ++ FLLVKRPNEGLLAGLWEFPSVLLDGEA+ A RR+ ID +LK F++DTK+ Sbjct: 351 KGSQYNSRFLLVKRPNEGLLAGLWEFPSVLLDGEADGATRRKRIDRFLKS-FKLDTKKNC 409 Query: 567 CIILREDVGECVHIFTHIRLKMYVEXXXXXXXXXXXXXXXKEDRDTLEWKCVDNKSIQSM 388 I+ REDVGECVH+FTHI L MYVE ED++T+ W+ +D++++ SM Sbjct: 410 RIVSREDVGECVHVFTHIHLTMYVELLVLHLKGGMKISYENEDKETMTWRWIDSEALSSM 469 Query: 387 GLTSGVRKAFNMIQQ 343 GLTSGVRK +NMIQ+ Sbjct: 470 GLTSGVRKVYNMIQK 484 >ref|XP_006344357.1| PREDICTED: A/G-specific adenine DNA glycosylase-like isoform X1 [Solanum tuberosum] Length = 456 Score = 553 bits (1424), Expect = e-154 Identities = 276/435 (63%), Positives = 340/435 (78%), Gaps = 2/435 (0%) Frame = -1 Query: 1632 KKKKRMREPQQQKTSVVGGLEDIED--FSKPDTLKIRASLLQWYDQNQRVLPWRSSSKTH 1459 K KKR R ++ V +DIED FSK +TL+IRASLL+WYD+NQR LPWR S Sbjct: 14 KSKKRGRRNREIPRKEVPLSDDIEDISFSKDETLQIRASLLEWYDENQRDLPWRRISSG- 72 Query: 1458 HYSQQDKEQNLRAYAVWVSEIMLQQTRVSTVIDYYNRWMQKWPTIHHLSQASQEEVNEMW 1279 + ++DK R YAVWVSE+MLQQTRVSTVIDY+ RWM KWPT+HHL+QAS EEVNEMW Sbjct: 73 -FDERDK----RGYAVWVSEVMLQQTRVSTVIDYFKRWMNKWPTLHHLAQASLEEVNEMW 127 Query: 1278 AGLGYYRRARFLLEGAKMVVEGGGEFPTTVSALRKVRGIGDYTAGAIASIAFKEAVPVVD 1099 AGLGYYRR RFLL+GAK VVE GG FP TVS LRK++GIG+YT+GAIASIAF +AVPVVD Sbjct: 128 AGLGYYRRVRFLLQGAKEVVEEGGSFPETVSELRKIKGIGEYTSGAIASIAFNKAVPVVD 187 Query: 1098 GNVIRVIARLKAISANPKERETIKDIWKLAGQLVDPCRPGDFNQALMELGSTLCTPTNPD 919 GNV+RVI+RLKAISANPK+ T+K WKLAGQLVDPCRPGDFNQALMELG+TLC+ +NP Sbjct: 188 GNVVRVISRLKAISANPKDAATVKSFWKLAGQLVDPCRPGDFNQALMELGATLCSLSNPG 247 Query: 918 CSVCPVSGQCRALSLSRKCQSVQVTNYPIKIVKAKQRHDFCAICVVEILEEGNLGIEDGS 739 C+ CP+S QC ALSLSR+ +SV V++YP K+VKAKQRH+F A+ VVEIL+ + G Sbjct: 248 CAACPISAQCHALSLSRQSESVHVSDYPTKVVKAKQRHEFSAVSVVEILDCQEM---TGP 304 Query: 738 HKSNGFLLVKRPNEGLLAGLWEFPSVLLDGEANLAIRREAIDEYLKKLFEIDTKEKSCII 559 S+ ++LVKRP+EGLLAGLWEFPS+LL+ EA+LA RR+AID +L+ F +D KE + I+ Sbjct: 305 QSSSKYILVKRPDEGLLAGLWEFPSILLEKEADLASRRKAIDNFLQSSFYLDLKESTRIV 364 Query: 558 LREDVGECVHIFTHIRLKMYVEXXXXXXXXXXXXXXXKEDRDTLEWKCVDNKSIQSMGLT 379 RED+GECVH+F+HIRLKMYVE K D++++ WK VD K++ SMGL+ Sbjct: 365 SREDIGECVHVFSHIRLKMYVELLVLHPKGNRSIDYKKLDKESITWKYVDGKNLGSMGLS 424 Query: 378 SGVRKAFNMIQQHKQ 334 SGVRK + M+Q+HKQ Sbjct: 425 SGVRKVYTMVQKHKQ 439 >emb|CAN71629.1| hypothetical protein VITISV_015579 [Vitis vinifera] Length = 1031 Score = 544 bits (1401), Expect = e-152 Identities = 281/457 (61%), Positives = 340/457 (74%), Gaps = 27/457 (5%) Frame = -1 Query: 1632 KKKKRMREPQQQKTSVVGGLEDIEDFSKPDTLKIRASLLQWYDQNQRVLPWRSSSKTHHY 1453 + K R+ +Q+ TS + + DIEDF + +TLKIRASLL WYD N+R LPWR+ + T + Sbjct: 557 RDNKEKRKRKQRTTSEIEVM-DIEDFGRDETLKIRASLLGWYDLNKRNLPWRTPTTTTTH 615 Query: 1452 SQQD-----KEQNLRAYAVWVSEIMLQQTRVSTVIDYYNRWMQKWPTIHHLSQASQEEVN 1288 +D ++ + RAYAVWVSE+MLQQTRV TVIDYYNRWMQKWPT+HHLS AS EEVN Sbjct: 616 EDEDDADAHEDLDNRAYAVWVSEVMLQQTRVETVIDYYNRWMQKWPTLHHLSLASLEEVN 675 Query: 1287 EMWAGLGYYRRARFLLEGAKMVVEGGGEFPTTVSALRKVRGIGDYTAGAIASIAFKEAVP 1108 EMWAGLGYYRRAR LLEGAKM+ EG FP T SALR+V GIG+YTAGAIASIAFKEAVP Sbjct: 676 EMWAGLGYYRRARCLLEGAKMISEGKCGFPRTTSALREVPGIGNYTAGAIASIAFKEAVP 735 Query: 1107 VVDGNVIRVIARLKAISANPKERETIKDIWKLAGQLVDPCRPGDFNQALMELGSTLCTPT 928 VVDGNV+RVIARLKAIS+NPK TIK+IW+LAGQLVDPC+PGDFNQALMELG+T+CTP Sbjct: 736 VVDGNVVRVIARLKAISSNPKHSATIKNIWRLAGQLVDPCKPGDFNQALMELGATICTPL 795 Query: 927 NPDCSVCPVSGQCRALSLSRKCQSVQVTNYPIKIVKAKQRHDFCAICVVEILEEGNLGIE 748 P CS CPVS QC LS+S +S+ VT+YP+K+VKAK+RHDF A+ VV+ILEE + I Sbjct: 796 KPICSACPVSDQCSVLSMSESHRSILVTDYPVKVVKAKKRHDFSAVSVVKILEEQD--IS 853 Query: 747 DGSHKSNGFLLVKRPNEGLLAGLWEFPSVLLDGEANLAIRREAIDEYLKKLFEIDTKEKS 568 GS ++ FLLVKRPNEGLLAGLWEFPSVLLDGEA+ A RR+ ID +LK F++DTK+ Sbjct: 854 KGSQYNSRFLLVKRPNEGLLAGLWEFPSVLLDGEADGATRRKRIDRFLKS-FKLDTKKNC 912 Query: 567 CIILREDVGECVHIFTHIRLKMYVE----------------------XXXXXXXXXXXXX 454 I+ REDVGECVH+FTHI L MYVE Sbjct: 913 RIVSREDVGECVHVFTHIHLTMYVELLVLHLKGLGLLINSHSQICMSVYVSLYPGGMKIS 972 Query: 453 XXKEDRDTLEWKCVDNKSIQSMGLTSGVRKAFNMIQQ 343 ED++T+ W+ +D++++ SMGLTSGVRK +NMIQ+ Sbjct: 973 YENEDKETMTWRWIDSEALSSMGLTSGVRKVYNMIQK 1009 >ref|XP_006447890.1| hypothetical protein CICLE_v10015195mg [Citrus clementina] gi|568830187|ref|XP_006469387.1| PREDICTED: A/G-specific adenine DNA glycosylase-like isoform X1 [Citrus sinensis] gi|557550501|gb|ESR61130.1| hypothetical protein CICLE_v10015195mg [Citrus clementina] Length = 456 Score = 543 bits (1398), Expect = e-151 Identities = 275/439 (62%), Positives = 334/439 (76%), Gaps = 1/439 (0%) Frame = -1 Query: 1629 KKKRMREPQQQKTSVVGGLEDIED-FSKPDTLKIRASLLQWYDQNQRVLPWRSSSKTHHY 1453 KKK+ R+ ++KT++ EDIED FS+ + KIR SLLQWYD+NQR LPWR S+ Sbjct: 8 KKKKERQLPEKKTALPLEEEDIEDLFSEKEVKKIRQSLLQWYDKNQRELPWRERSE---- 63 Query: 1452 SQQDKEQNLRAYAVWVSEIMLQQTRVSTVIDYYNRWMQKWPTIHHLSQASQEEVNEMWAG 1273 S +++E+ RAY VWVSE+MLQQTRV TVIDYYNRWM KWPTIHHL++AS EEVNEMWAG Sbjct: 64 SDKEEEKEKRAYGVWVSEVMLQQTRVQTVIDYYNRWMTKWPTIHHLAKASLEEVNEMWAG 123 Query: 1272 LGYYRRARFLLEGAKMVVEGGGEFPTTVSALRKVRGIGDYTAGAIASIAFKEAVPVVDGN 1093 LGYYRRARFLLEGAKM+V G FP TVS LRKV GIG+YTAGAIASIAFKE VPVVDGN Sbjct: 124 LGYYRRARFLLEGAKMIVAEGDGFPNTVSDLRKVPGIGNYTAGAIASIAFKEVVPVVDGN 183 Query: 1092 VIRVIARLKAISANPKERETIKDIWKLAGQLVDPCRPGDFNQALMELGSTLCTPTNPDCS 913 VIRV+ARLKAISANPK+ T+K+ WKLA QLVD CRPGDFNQ+LMELG+ +CTP NP+C+ Sbjct: 184 VIRVLARLKAISANPKDTSTVKNFWKLATQLVDSCRPGDFNQSLMELGAVICTPLNPNCT 243 Query: 912 VCPVSGQCRALSLSRKCQSVQVTNYPIKIVKAKQRHDFCAICVVEILEEGNLGIEDGSHK 733 CPVS +C+A S+S+ SV VT+YP+K++KA+QRHD A CVVEIL G + + Sbjct: 244 SCPVSDKCQAYSMSKCDNSVLVTSYPMKVLKARQRHDVSAACVVEIL--GGNDESERTQP 301 Query: 732 SNGFLLVKRPNEGLLAGLWEFPSVLLDGEANLAIRREAIDEYLKKLFEIDTKEKSCIILR 553 F+LVKR +EGLLAGLWEFPS++LDGE ++ RREA + +LKK F +D + IILR Sbjct: 302 DGVFILVKRRDEGLLAGLWEFPSIILDGETDITTRREAAECFLKKSFNLDPRNNCSIILR 361 Query: 552 EDVGECVHIFTHIRLKMYVEXXXXXXXXXXXXXXXKEDRDTLEWKCVDNKSIQSMGLTSG 373 EDVGE VHIF+HIRLK++VE K+D+ TL WKCVD ++ SMGLTSG Sbjct: 362 EDVGEFVHIFSHIRLKVHVELLVLRIKGGIDKWVEKQDKGTLSWKCVDGGTLASMGLTSG 421 Query: 372 VRKAFNMIQQHKQNGLLIN 316 VRK + M+Q+ KQ L N Sbjct: 422 VRKVYTMVQKFKQKRLTTN 440 >ref|XP_006858703.1| hypothetical protein AMTR_s00066p00103210 [Amborella trichopoda] gi|548862814|gb|ERN20170.1| hypothetical protein AMTR_s00066p00103210 [Amborella trichopoda] Length = 523 Score = 541 bits (1393), Expect = e-151 Identities = 278/428 (64%), Positives = 333/428 (77%), Gaps = 5/428 (1%) Frame = -1 Query: 1581 GGLEDIEDFSKPDTLKIRASLLQWYDQNQRVLPWRSSSKTHHYSQQDKEQNLRAYAVWVS 1402 G L DIEDFS +TLKIRASLL WYD+NQR+LPWR++S ++D E RAYAVWVS Sbjct: 91 GSLRDIEDFSLEETLKIRASLLGWYDKNQRILPWRANSVRESEEREDAEA--RAYAVWVS 148 Query: 1401 EIMLQQTRVSTVIDYYNRWMQKWPTIHHLSQASQEEVNEMWAGLGYYRRARFLLEGAKMV 1222 E+MLQQTRV+TVI YY RWM+KWP+IHHL+QASQEEVNEMWAGLGYYRRAR+LLEGAK V Sbjct: 149 EVMLQQTRVATVIRYYGRWMEKWPSIHHLAQASQEEVNEMWAGLGYYRRARYLLEGAKSV 208 Query: 1221 VEGGGEFPTTVSALRKVRGIGDYTAGAIASIAFKEAVPVVDGNVIRVIARLKAISANPKE 1042 V+GG +FP TV LRKV+G+GDYTAGAIASIAFK+AVPVVDGNVIRVIARLKAIS+NPKE Sbjct: 209 VQGG-QFPRTVPDLRKVQGVGDYTAGAIASIAFKQAVPVVDGNVIRVIARLKAISSNPKE 267 Query: 1041 RETIKDIWKLAGQLVDPCRPGDFNQALMELGSTLCTPTNPDCSVCPVSGQCRALSLSRKC 862 T+K WKLAGQLVDP RPGDFNQALMELGSTLCTP++P CS CPVS +C+ALSLS+ Sbjct: 268 STTVKGFWKLAGQLVDPERPGDFNQALMELGSTLCTPSSPSCSSCPVSKRCQALSLSKTP 327 Query: 861 QS---VQVTNYPIKIVKAKQRHDFCAICVVEILEEGNLGIEDGSHKSNGFLLVKRPNEGL 691 S + VT++P+K+ K KQR DF A+C+VEI E+ +L + + FL++KRP+EGL Sbjct: 328 NSGKEILVTDFPVKVSKVKQREDFAAVCLVEITEKLDLESWKLESEKDIFLMIKRPDEGL 387 Query: 690 LAGLWEFPSVLLDGEANLAI--RREAIDEYLKKLFEIDTKEKSCIILREDVGECVHIFTH 517 LAGLWEFPSVLLD E N+ + RR A+++YLK F ++T S +I R DVGE VHIFTH Sbjct: 388 LAGLWEFPSVLLD-ETNMGLCTRRSAMNKYLKGTFGLETNRSSRVIFRGDVGEYVHIFTH 446 Query: 516 IRLKMYVEXXXXXXXXXXXXXXXKEDRDTLEWKCVDNKSIQSMGLTSGVRKAFNMIQQHK 337 IRLKM+VE K D + W+CVD SI+++GLTSGVRK +NMIQ K Sbjct: 447 IRLKMHVELLVLNLKGGIDTSNVKNDSQGICWRCVDENSIKNIGLTSGVRKVYNMIQDFK 506 Query: 336 QNGLLINP 313 + GLL NP Sbjct: 507 KKGLLQNP 514 >ref|XP_004246789.1| PREDICTED: A/G-specific adenine DNA glycosylase-like [Solanum lycopersicum] Length = 432 Score = 540 bits (1390), Expect = e-151 Identities = 270/425 (63%), Positives = 329/425 (77%) Frame = -1 Query: 1632 KKKKRMREPQQQKTSVVGGLEDIEDFSKPDTLKIRASLLQWYDQNQRVLPWRSSSKTHHY 1453 K KKR R ++ +EDI FSK +TL+IRASLL+WYD+NQR LPWR Sbjct: 12 KSKKRARRSREIPPKESDDIEDIS-FSKDETLQIRASLLEWYDENQRDLPWR------RI 64 Query: 1452 SQQDKEQNLRAYAVWVSEIMLQQTRVSTVIDYYNRWMQKWPTIHHLSQASQEEVNEMWAG 1273 S E++ R YAVWVSE+MLQQTRVSTVIDY+ RWM KWPT+HHL+QAS EEVNEMWAG Sbjct: 65 SGGSDERDKRGYAVWVSEVMLQQTRVSTVIDYFKRWMNKWPTLHHLAQASLEEVNEMWAG 124 Query: 1272 LGYYRRARFLLEGAKMVVEGGGEFPTTVSALRKVRGIGDYTAGAIASIAFKEAVPVVDGN 1093 LGYYRR RFLL+GAK VVE GG FP TVS LRK++GIG+YTAGAIASIAFK+ VPVVDGN Sbjct: 125 LGYYRRVRFLLQGAKEVVEEGGSFPETVSELRKIKGIGEYTAGAIASIAFKKVVPVVDGN 184 Query: 1092 VIRVIARLKAISANPKERETIKDIWKLAGQLVDPCRPGDFNQALMELGSTLCTPTNPDCS 913 V+RVI+RLKAISANPK+ T+K WKLAGQLVDPCRPGDFNQALMELG+TLC+ +NP C+ Sbjct: 185 VVRVISRLKAISANPKDTATVKSFWKLAGQLVDPCRPGDFNQALMELGATLCSLSNPGCA 244 Query: 912 VCPVSGQCRALSLSRKCQSVQVTNYPIKIVKAKQRHDFCAICVVEILEEGNLGIEDGSHK 733 VCP+S QC ALSLSR+ +SV V++YP K+VKAKQRH+F A+ VVEIL+ + GS Sbjct: 245 VCPISAQCHALSLSRQNESVHVSDYPTKVVKAKQRHEFSAVSVVEILDCQEM---TGSQS 301 Query: 732 SNGFLLVKRPNEGLLAGLWEFPSVLLDGEANLAIRREAIDEYLKKLFEIDTKEKSCIILR 553 ++ ++LVKRPNEGLLAGLWEFPS+LL+ EA+LA RR+AID +L+ +D KE + I+ R Sbjct: 302 NSKYILVKRPNEGLLAGLWEFPSILLEKEADLASRRKAIDNFLQSSLNLDLKESTRIVSR 361 Query: 552 EDVGECVHIFTHIRLKMYVEXXXXXXXXXXXXXXXKEDRDTLEWKCVDNKSIQSMGLTSG 373 ED+GE VH+F+HIRLKMYVE K D++++ WK VD K++ SMGLTSG Sbjct: 362 EDIGEFVHVFSHIRLKMYVELLVLHPKGNRSIEDEKLDKESITWKYVDGKNLDSMGLTSG 421 Query: 372 VRKAF 358 VRK + Sbjct: 422 VRKVW 426 >ref|XP_002524570.1| A/G-specific adenine glycosylase muty, putative [Ricinus communis] gi|223536123|gb|EEF37778.1| A/G-specific adenine glycosylase muty, putative [Ricinus communis] Length = 775 Score = 533 bits (1373), Expect = e-149 Identities = 261/423 (61%), Positives = 323/423 (76%) Frame = -1 Query: 1632 KKKKRMREPQQQKTSVVGGLEDIEDFSKPDTLKIRASLLQWYDQNQRVLPWRSSSKTHHY 1453 K KKR + ++ +V +EDI F +T KIR SLL+WYDQNQR LPWR T+ Sbjct: 8 KNKKRNVQLISKEQEIVVDIEDI--FIDKETQKIRESLLEWYDQNQRQLPWRRQKTTNPS 65 Query: 1452 SQQDKEQNLRAYAVWVSEIMLQQTRVSTVIDYYNRWMQKWPTIHHLSQASQEEVNEMWAG 1273 + ++E+ RAY +WVSE+MLQQTRV TVIDYYNRWM KWPTIHHL+QAS EEVNE+WAG Sbjct: 66 QESEEEKEKRAYGIWVSEVMLQQTRVQTVIDYYNRWMLKWPTIHHLAQASLEEVNEIWAG 125 Query: 1272 LGYYRRARFLLEGAKMVVEGGGEFPTTVSALRKVRGIGDYTAGAIASIAFKEAVPVVDGN 1093 LGYYRRARFLLEGAKM+V GGG FP TVS+LRKV GIGDYTAGAIASIAFKE VPVVDGN Sbjct: 126 LGYYRRARFLLEGAKMIVAGGG-FPNTVSSLRKVPGIGDYTAGAIASIAFKEVVPVVDGN 184 Query: 1092 VIRVIARLKAISANPKERETIKDIWKLAGQLVDPCRPGDFNQALMELGSTLCTPTNPDCS 913 V+RV+ RL+AISANPK+ T+K +WKLA QLVDPCRPGDFNQ+LMELG+T+C P+NP CS Sbjct: 185 VVRVLTRLRAISANPKDSMTVKKLWKLAAQLVDPCRPGDFNQSLMELGATVCAPSNPSCS 244 Query: 912 VCPVSGQCRALSLSRKCQSVQVTNYPIKIVKAKQRHDFCAICVVEILEEGNLGIEDGSHK 733 CPVS QCR LS+S + +S+ VT+YP K+VK K +H+F A+CVVEIL G+ G D Sbjct: 245 SCPVSSQCRVLSISNQDKSILVTDYPTKVVKVKPKHEFSAVCVVEIL--GSCGPVDNQKT 302 Query: 732 SNGFLLVKRPNEGLLAGLWEFPSVLLDGEANLAIRREAIDEYLKKLFEIDTKEKSCIILR 553 + FLLVKRP++GLLAGLWEFP+ LD EA+L RR ID ++KK F +D ++ ++LR Sbjct: 303 DSKFLLVKRPDDGLLAGLWEFPTCRLDKEADLITRRNEIDHFMKKSFRLDPEKTYSMVLR 362 Query: 552 EDVGECVHIFTHIRLKMYVEXXXXXXXXXXXXXXXKEDRDTLEWKCVDNKSIQSMGLTSG 373 ED+GE VHIFTHIRLK+YV+ K++++ WKCV+ K++ ++GLTSG Sbjct: 363 EDIGEFVHIFTHIRLKVYVDLLVIRLKGGMSQLFRKQEKEATNWKCVEKKALPNLGLTSG 422 Query: 372 VRK 364 VRK Sbjct: 423 VRK 425 >gb|EYU46093.1| hypothetical protein MIMGU_mgv1a022080mg, partial [Mimulus guttatus] Length = 433 Score = 532 bits (1370), Expect = e-148 Identities = 269/411 (65%), Positives = 322/411 (78%), Gaps = 1/411 (0%) Frame = -1 Query: 1557 FSKPDTLKIRASLLQWYDQNQRVLPWRSSSKTHHYSQQDKEQNLRAYAVWVSEIMLQQTR 1378 F + KIR SLL+WYD+N+R LPWR S + +E+ RAYAVWVSE+MLQQTR Sbjct: 4 FRGKEIQKIRESLLEWYDENRRDLPWRRISNGGN-DVGVEEREKRAYAVWVSEVMLQQTR 62 Query: 1377 VSTVIDYYNRWMQKWPTIHHLSQASQEEVNEMWAGLGYYRRARFLLEGAKMVVEGGGEFP 1198 V TV+DY+NRWM KWPTIHHL+QAS EEVNEMWAGLGYYRRARFLLEGA+MVVEGGGEFP Sbjct: 63 VQTVVDYFNRWMGKWPTIHHLAQASIEEVNEMWAGLGYYRRARFLLEGAQMVVEGGGEFP 122 Query: 1197 TTVSALRKVRGIGDYTAGAIASIAFKEAVPVVDGNVIRVIARLKAISANPKERETIKDIW 1018 T + L VRGIG YTAGAIASIAF EAVPVVDGNVIRVI RLKAISANPK T+K+IW Sbjct: 123 KTATDLEMVRGIGKYTAGAIASIAFDEAVPVVDGNVIRVITRLKAISANPKNAATVKNIW 182 Query: 1017 KLAGQLVDPCRPGDFNQALMELGSTLCTPTNPDCSVCPVSGQCRALSLSRKCQSVQVTNY 838 KLA QLVDP RPGDFNQA+MELG+T C+ T+P CS CPVS QC+ALSLSRK +SVQVT+Y Sbjct: 183 KLARQLVDPLRPGDFNQAIMELGATACSVTSPSCSTCPVSHQCQALSLSRKQESVQVTDY 242 Query: 837 PIKIVKAKQRHDFCAICVVEILEEGNLGIEDGSHKSNGFLLVKRPNEGLLAGLWEFPSVL 658 P+K+ KAK RHDF A+ VVEI++E GS + +LLVKRP+EGLLAGLWEFPSVL Sbjct: 243 PMKVAKAKPRHDFSAVSVVEIVDE-------GSQSKSRYLLVKRPDEGLLAGLWEFPSVL 295 Query: 657 LDGEANLAIRREAIDEYLKKLFEIDTKEKSCIILREDVGECVHIFTHIRLKMYVE-XXXX 481 L GEA+LA RR+AID +LK+ F IDTK+ ++ RE+VGECVH+FTHIRLKMY+E Sbjct: 296 LVGEADLASRRKAIDSFLKQSFGIDTKKSCKVVSREEVGECVHVFTHIRLKMYIELLILQ 355 Query: 480 XXXXXXXXXXXKEDRDTLEWKCVDNKSIQSMGLTSGVRKAFNMIQQHKQNG 328 K++ T++WK VD+K++ ++GLTSGVRK M+++ KQ+G Sbjct: 356 LTEGGMNCLHKKQESSTMKWKFVDDKALSTLGLTSGVRKVCTMVEKFKQSG 406 >ref|XP_002321221.2| hypothetical protein POPTR_0014s17120g [Populus trichocarpa] gi|550324385|gb|EEE99536.2| hypothetical protein POPTR_0014s17120g [Populus trichocarpa] Length = 482 Score = 531 bits (1368), Expect = e-148 Identities = 265/434 (61%), Positives = 327/434 (75%), Gaps = 11/434 (2%) Frame = -1 Query: 1632 KKKKRMREPQQQ-----KTSVVGGLEDIEDFSKPDTLKIRASLLQWYDQNQRVLPWRSSS 1468 K+ + +P++Q K VV +ED+ FS +T KIRASLL+WYD NQR LPWR + Sbjct: 13 KRNAAIAKPKEQRQHSSKKQVVADIEDL--FSDKETQKIRASLLEWYDHNQRDLPWRRIT 70 Query: 1467 KT------HHYSQQDKEQNLRAYAVWVSEIMLQQTRVSTVIDYYNRWMQKWPTIHHLSQA 1306 +T ++++E+ RAY VWVSE+MLQQTRV TVIDYYNRWM KWPT+HHL+QA Sbjct: 71 QTKETPFKEEEEEEEEEEERRAYGVWVSEVMLQQTRVQTVIDYYNRWMLKWPTLHHLAQA 130 Query: 1305 SQEEVNEMWAGLGYYRRARFLLEGAKMVVEGGGEFPTTVSALRKVRGIGDYTAGAIASIA 1126 S EEVNE WAGLGYYRRARFLLEGAKM+V GG FP VS+LRKV GIGDYTAGAIASIA Sbjct: 131 SLEEVNEKWAGLGYYRRARFLLEGAKMIVAGGDGFPKIVSSLRKVPGIGDYTAGAIASIA 190 Query: 1125 FKEAVPVVDGNVIRVIARLKAISANPKERETIKDIWKLAGQLVDPCRPGDFNQALMELGS 946 FKE VPVVDGNVIRV+ARLKAISANPK++ T+K WKLA QLVDP RPGDFNQ+LMELG+ Sbjct: 191 FKEVVPVVDGNVIRVLARLKAISANPKDKVTVKKFWKLAAQLVDPHRPGDFNQSLMELGA 250 Query: 945 TLCTPTNPDCSVCPVSGQCRALSLSRKCQSVQVTNYPIKIVKAKQRHDFCAICVVEILEE 766 TLCTP NP CS CPVSGQCRAL++S+ + V +T+YP K +K KQRH+F A+C VEI Sbjct: 251 TLCTPVNPSCSSCPVSGQCRALTISKLDKLVLITDYPAKSIKLKQRHEFSAVCAVEI--T 308 Query: 765 GNLGIEDGSHKSNGFLLVKRPNEGLLAGLWEFPSVLLDGEANLAIRREAIDEYLKKLFEI 586 G + +G S+ FLLVKRP+EGLLAGLWEFPSV+L EA++ RR+ ++ +LKK F + Sbjct: 309 GRQDLIEGDQSSSVFLLVKRPDEGLLAGLWEFPSVMLGKEADMTRRRKEMNRFLKKSFRL 368 Query: 585 DTKEKSCIILREDVGECVHIFTHIRLKMYVEXXXXXXXXXXXXXXXKEDRDTLEWKCVDN 406 D ++ ++LRED+GE +HIFTHIRLK+YVE K+ R+ + WKCVD Sbjct: 369 DPQKTCSVLLREDIGEFIHIFTHIRLKVYVELLIVHLKGDMSDLFSKQSRENMTWKCVDR 428 Query: 405 KSIQSMGLTSGVRK 364 +++ S+GLTSGVRK Sbjct: 429 EALSSLGLTSGVRK 442 >ref|XP_007049485.1| HhH-GPD base excision DNA repair family protein [Theobroma cacao] gi|508701746|gb|EOX93642.1| HhH-GPD base excision DNA repair family protein [Theobroma cacao] Length = 461 Score = 529 bits (1363), Expect = e-147 Identities = 268/442 (60%), Positives = 333/442 (75%), Gaps = 6/442 (1%) Frame = -1 Query: 1632 KKKKRMREP-QQQKTSVVGGLEDIEDFSKPDTLKIRASLLQWYDQNQRVLPWR-----SS 1471 KK+ ++ + ++++ V+G +ED+ FS+ DT +IR+SLL+WYD+NQR LPWR S Sbjct: 10 KKRHQLNQLIKEEQEHVMGDIEDL--FSEEDTNRIRSSLLEWYDKNQRDLPWRRRTTKSG 67 Query: 1470 SKTHHYSQQDKEQNLRAYAVWVSEIMLQQTRVSTVIDYYNRWMQKWPTIHHLSQASQEEV 1291 + + +++++ RAY VWVSE+MLQQTRV TVIDYY RWMQKWPT+ HL+QAS EEV Sbjct: 68 NGKNVKKEEEEDDEKRAYGVWVSEVMLQQTRVQTVIDYYKRWMQKWPTLQHLAQASLEEV 127 Query: 1290 NEMWAGLGYYRRARFLLEGAKMVVEGGGEFPTTVSALRKVRGIGDYTAGAIASIAFKEAV 1111 NEMWAGLGYYRRARFLLEGAKM+V G EFP TVS LRKV GIGDYTAGAIASIAFKE V Sbjct: 128 NEMWAGLGYYRRARFLLEGAKMIVARGSEFPNTVSTLRKVPGIGDYTAGAIASIAFKEVV 187 Query: 1110 PVVDGNVIRVIARLKAISANPKERETIKDIWKLAGQLVDPCRPGDFNQALMELGSTLCTP 931 PVVDGNV+RV+ARLKAISANPK++ T+K+ WKLA QLVDP RPGDFNQ+LMELG+TLCTP Sbjct: 188 PVVDGNVVRVLARLKAISANPKDKTTVKNFWKLAAQLVDPSRPGDFNQSLMELGATLCTP 247 Query: 930 TNPDCSVCPVSGQCRALSLSRKCQSVQVTNYPIKIVKAKQRHDFCAICVVEILEEGNLGI 751 NP CS CPVS QC AL S+ +SV VT YP K+VKAKQR DF +CVVEI G+ G Sbjct: 248 LNPSCSSCPVSSQCCALYNSKNDESVVVTRYPTKVVKAKQRQDFSTVCVVEI--SGSQGT 305 Query: 750 EDGSHKSNGFLLVKRPNEGLLAGLWEFPSVLLDGEANLAIRREAIDEYLKKLFEIDTKEK 571 S + FLLVKRP+EGLLAGLWEFPSV LD EA+LA+RR+ ID+ LKK F+++ + Sbjct: 306 LHQSQPDSRFLLVKRPDEGLLAGLWEFPSVTLDEEADLAMRRKLIDQLLKKSFKLNPPKN 365 Query: 570 SCIILREDVGECVHIFTHIRLKMYVEXXXXXXXXXXXXXXXKEDRDTLEWKCVDNKSIQS 391 II R VGE VH+F+HIR K+YVE ++D T++WK +D+ ++ Sbjct: 366 CSIISRVLVGEFVHVFSHIRRKIYVELLVLHLKGGMHDLYKEKDSGTMDWKLLDSDAVSR 425 Query: 390 MGLTSGVRKAFNMIQQHKQNGL 325 MGLTS V+K ++M+Q KQNGL Sbjct: 426 MGLTSSVQKVYSMVQNFKQNGL 447 >ref|XP_004293166.1| PREDICTED: A/G-specific adenine DNA glycosylase-like [Fragaria vesca subsp. vesca] Length = 453 Score = 519 bits (1337), Expect = e-144 Identities = 265/420 (63%), Positives = 316/420 (75%), Gaps = 4/420 (0%) Frame = -1 Query: 1572 EDIED-FSKPDTLKIRASLLQWYDQNQRVLPWRSSSKTHHYSQQDKEQNLRAYAVWVSEI 1396 +DIED FS+ +T KIRASLL+WY N+R LPWR +Q+ + +R Y VWVSE+ Sbjct: 31 QDIEDLFSQDETQKIRASLLKWYGLNRRDLPWR---------EQEDDVEVRVYRVWVSEV 81 Query: 1395 MLQQTRVSTVIDYYNRWMQKWPTIHHLSQASQEEVNEMWAGLGYYRRARFLLEGAKMVVE 1216 MLQQTRV VI Y+NRWM KWPTIH L+QAS EEVNEMWAGLGYYRRARFLLEGA+ +V Sbjct: 82 MLQQTRVQAVIHYFNRWMSKWPTIHSLAQASLEEVNEMWAGLGYYRRARFLLEGARKIVA 141 Query: 1215 GGGEFPTTVSALRKVRGIGDYTAGAIASIAFKEAVPVVDGNVIRVIARLKAISANPKERE 1036 G +FP TVS LRK+ GIGDYTAGAIASIA KEAVPVVDGNVIRV ARLKAISANPK+ Sbjct: 142 EGDQFPKTVSQLRKIPGIGDYTAGAIASIALKEAVPVVDGNVIRVTARLKAISANPKDSS 201 Query: 1035 TIKDIWKLAGQLVDPCRPGDFNQALMELGSTLCTPTNPDCSVCPVSGQCRALSLSRKCQS 856 T+K WKLA QLVDP +PGDFNQALMELG+T+CTP++P C CPVS QC ALS+SR S Sbjct: 202 TVKKFWKLAAQLVDPFQPGDFNQALMELGATVCTPSSPSCGTCPVSDQCCALSISRHDSS 261 Query: 855 VQVTNYPIKIVKAKQRHDFCAICVVEILEEGNLGIEDGSHK---SNGFLLVKRPNEGLLA 685 V VT+YPIK+VKAKQRH+F A+CVVEI +G E+ + +NGFLLVKRP+EGLLA Sbjct: 262 VVVTDYPIKVVKAKQRHEFSAVCVVEI-----VGDEESLKRHQINNGFLLVKRPDEGLLA 316 Query: 684 GLWEFPSVLLDGEANLAIRREAIDEYLKKLFEIDTKEKSCIILREDVGECVHIFTHIRLK 505 GLWEFPSV L GE +L RR+AID+YLKK F + ++ II RE VGE VH+F+HIRLK Sbjct: 317 GLWEFPSVSLAGEVDLLARRKAIDQYLKKYFTLQPRKTCDIICREHVGEYVHVFSHIRLK 376 Query: 504 MYVEXXXXXXXXXXXXXXXKEDRDTLEWKCVDNKSIQSMGLTSGVRKAFNMIQQHKQNGL 325 MYVE K D+D + WKCVD K + +MGLTSGV+K + M+Q+ ++ L Sbjct: 377 MYVELLILRVEGGINDLVSKRDKDIVPWKCVDAKVLSNMGLTSGVKKVYTMVQKFRRGNL 436 >ref|XP_007206303.1| hypothetical protein PRUPE_ppa020735mg, partial [Prunus persica] gi|462401945|gb|EMJ07502.1| hypothetical protein PRUPE_ppa020735mg, partial [Prunus persica] Length = 521 Score = 517 bits (1332), Expect = e-144 Identities = 262/434 (60%), Positives = 326/434 (75%), Gaps = 2/434 (0%) Frame = -1 Query: 1632 KKKKRMREPQQQKTSVVGGLEDIED--FSKPDTLKIRASLLQWYDQNQRVLPWRSSSKTH 1459 ++++ +EP+ ++DIED FS+ + +IR +LL+WY N+R LPWR + Sbjct: 108 RRRQSAKEPE---------IQDIEDLFFSEEEAQRIRQALLEWYGLNRRELPWREA---- 154 Query: 1458 HYSQQDKEQNLRAYAVWVSEIMLQQTRVSTVIDYYNRWMQKWPTIHHLSQASQEEVNEMW 1279 ++D E+ RAY VWVSE+MLQQTRV TV+ Y++RWM KWPTIHHL+QAS EEVNE+W Sbjct: 155 ---EEDVER--RAYRVWVSEVMLQQTRVQTVVQYFHRWMSKWPTIHHLAQASLEEVNELW 209 Query: 1278 AGLGYYRRARFLLEGAKMVVEGGGEFPTTVSALRKVRGIGDYTAGAIASIAFKEAVPVVD 1099 AGLGYYRRARFLLEGA+M+V +FP TVS LRKVRGIGDYTAGAIASIAFKE VPVVD Sbjct: 210 AGLGYYRRARFLLEGARMIVAEEVQFPKTVSQLRKVRGIGDYTAGAIASIAFKEVVPVVD 269 Query: 1098 GNVIRVIARLKAISANPKERETIKDIWKLAGQLVDPCRPGDFNQALMELGSTLCTPTNPD 919 GNV+RVIARLKA+SANPK+ T+K WKLA QLVDP +PG+FNQALMELG+T+CTP +P Sbjct: 270 GNVVRVIARLKAVSANPKDSSTVKKFWKLAAQLVDPFQPGEFNQALMELGATVCTPLSPS 329 Query: 918 CSVCPVSGQCRALSLSRKCQSVQVTNYPIKIVKAKQRHDFCAICVVEILEEGNLGIEDGS 739 C CPVS QC ALS+SR SV VT+YP+K+VKAKQRHDF A+CVV+IL G+ + +G Sbjct: 330 CHSCPVSIQCCALSISRADSSVLVTDYPVKVVKAKQRHDFSAVCVVQIL--GDEELSEGH 387 Query: 738 HKSNGFLLVKRPNEGLLAGLWEFPSVLLDGEANLAIRREAIDEYLKKLFEIDTKEKSCII 559 +NGFLLVKRP+EGLLAGLWEFPSVLL GEA+L RR+AID+YL K F ++ + I+ Sbjct: 388 RTNNGFLLVKRPDEGLLAGLWEFPSVLLAGEADLVTRRKAIDQYLNKHFRLNPRNTCDIV 447 Query: 558 LREDVGECVHIFTHIRLKMYVEXXXXXXXXXXXXXXXKEDRDTLEWKCVDNKSIQSMGLT 379 RE VGE +H+FTHIRLKMYVE K+ ++T+ WKCVD + + SMGLT Sbjct: 448 SREYVGENIHVFTHIRLKMYVELLVLHLKGGMKDLVSKQGKETVPWKCVDAEVLSSMGLT 507 Query: 378 SGVRKAFNMIQQHK 337 SGVRK + + K Sbjct: 508 SGVRKVMSYFHKSK 521 >ref|XP_004510725.1| PREDICTED: A/G-specific adenine DNA glycosylase-like isoform X1 [Cicer arietinum] Length = 478 Score = 514 bits (1323), Expect = e-143 Identities = 266/433 (61%), Positives = 323/433 (74%), Gaps = 5/433 (1%) Frame = -1 Query: 1599 QKTSVVGGLEDIED---FSKPDTLKIRASL-LQWYDQNQRVLPWRSSSKTHHYSQQDKEQ 1432 +KT + +EDIED FSK +T K+R L WYD N+R LPWR++ +H ++DKE+ Sbjct: 36 KKTRTLVEMEDIEDSMSFSKDETHKLRXXXXLDWYDHNRRDLPWRTTF--NHNIEEDKEE 93 Query: 1431 -NLRAYAVWVSEIMLQQTRVSTVIDYYNRWMQKWPTIHHLSQASQEEVNEMWAGLGYYRR 1255 RAY VWVSE+MLQQTRV TVI YYNRWM KWPTIHHL++AS EEVNE+WAGLGYYRR Sbjct: 94 VEKRAYGVWVSEVMLQQTRVQTVIAYYNRWMLKWPTIHHLAKASLEEVNEIWAGLGYYRR 153 Query: 1254 ARFLLEGAKMVVEGGGEFPTTVSALRKVRGIGDYTAGAIASIAFKEAVPVVDGNVIRVIA 1075 ARFLLEGAK +V GG P T S LRK+ GIGDYT+GAIASIAFKEAVPVVDGNVIRVIA Sbjct: 154 ARFLLEGAKKIVAEGGSIPKTASMLRKIPGIGDYTSGAIASIAFKEAVPVVDGNVIRVIA 213 Query: 1074 RLKAISANPKERETIKDIWKLAGQLVDPCRPGDFNQALMELGSTLCTPTNPDCSVCPVSG 895 RL+A+S NPK+ IK W++A QLVDP RPGDFNQ+LMELG+T+CTP NP CS CP S Sbjct: 214 RLRAVSENPKDSAIIKKFWEIAAQLVDPLRPGDFNQSLMELGATVCTPLNPSCSSCPASE 273 Query: 894 QCRALSLSRKCQSVQVTNYPIKIVKAKQRHDFCAICVVEILEEGNLGIEDGSHKSNGFLL 715 C ALS+ ++ + VT+YPIK VK KQR DF A+CVVE+L G + +E +H S+ F+L Sbjct: 274 FCHALSIVKQDSTAAVTDYPIKGVKVKQRSDFSAVCVVELL-GGEVSLEK-NHSSSIFVL 331 Query: 714 VKRPNEGLLAGLWEFPSVLLDGEANLAIRREAIDEYLKKLFEIDTKEKSCIILREDVGEC 535 VKRP+EGLLAGLWEFPSVLLDGE RR+A D +LKK +ID ++ IILREDVGE Sbjct: 332 VKRPDEGLLAGLWEFPSVLLDGETAPLARRKATDCFLKKNLKIDIRKTCDIILREDVGEF 391 Query: 534 VHIFTHIRLKMYVEXXXXXXXXXXXXXXXKEDRDTLEWKCVDNKSIQSMGLTSGVRKAFN 355 VHIF+HIRLK+YVE ED +T+ WKCVD+ ++ SMGLT+ VRKA++ Sbjct: 392 VHIFSHIRLKLYVELLVLQLKGKVDDLFESEDDETITWKCVDSNALSSMGLTTSVRKAYD 451 Query: 354 MIQQHKQNGLLIN 316 M+Q+ KQ L N Sbjct: 452 MVQKFKQKRLPFN 464 >ref|XP_006583255.1| PREDICTED: A/G-specific adenine DNA glycosylase-like isoform X2 [Glycine max] Length = 470 Score = 513 bits (1322), Expect = e-143 Identities = 267/447 (59%), Positives = 321/447 (71%), Gaps = 12/447 (2%) Frame = -1 Query: 1629 KKKRMREPQQQKTSVVGG---------LEDIED---FSKPDTLKIRASLLQWYDQNQRVL 1486 +KK+ + ++ VVG +EDIED FSK +T K+R +LL WYD N+R L Sbjct: 17 EKKKKKNSTRRSVVVVGESKKPQPLVEVEDIEDSLSFSKDETHKLRVALLDWYDLNRRDL 76 Query: 1485 PWRSSSKTHHYSQQDKEQNLRAYAVWVSEIMLQQTRVSTVIDYYNRWMQKWPTIHHLSQA 1306 PWR++ K Q+D+E RAY VWVSE+MLQQTRV TVI YYNRWMQKWPTIHHL+QA Sbjct: 77 PWRTTFK-----QEDEEVERRAYGVWVSEVMLQQTRVQTVIAYYNRWMQKWPTIHHLAQA 131 Query: 1305 SQEEVNEMWAGLGYYRRARFLLEGAKMVVEGGGEFPTTVSALRKVRGIGDYTAGAIASIA 1126 S EEVNEMWAGLGYYRRARFLLEGAK +V GG+ P S LR + GIG+YT+GAIASIA Sbjct: 132 SLEEVNEMWAGLGYYRRARFLLEGAKKIVAEGGQIPKVASMLRNIPGIGEYTSGAIASIA 191 Query: 1125 FKEAVPVVDGNVIRVIARLKAISANPKERETIKDIWKLAGQLVDPCRPGDFNQALMELGS 946 FKE VPVVDGNV+RVIARL+AISANPK+ TIK WKLA QLVDP RPGDFNQALMELG+ Sbjct: 192 FKEVVPVVDGNVVRVIARLRAISANPKDSATIKKFWKLAAQLVDPVRPGDFNQALMELGA 251 Query: 945 TLCTPTNPDCSVCPVSGQCRALSLSRKCQSVQVTNYPIKIVKAKQRHDFCAICVVEILEE 766 T+CTP NP CS CP S C ALS ++ +V VT+YP+K VK KQR DF A+CVVE++ Sbjct: 252 TVCTPLNPSCSSCPASEFCHALSNAKHDSTVAVTDYPVKGVKVKQRCDFSAVCVVELVGA 311 Query: 765 GNLGIEDGSHKSNGFLLVKRPNEGLLAGLWEFPSVLLDGEANLAIRREAIDEYLKKLFEI 586 L S K F+LVKRP EGLLAGLWEFPSVLLDGEA RREA+D +L+K +I Sbjct: 312 ETLNKNQSSSK---FILVKRPEEGLLAGLWEFPSVLLDGEAVPLARREAMDRFLEKNLKI 368 Query: 585 DTKEKSCIILREDVGECVHIFTHIRLKMYVEXXXXXXXXXXXXXXXKEDRDTLEWKCVDN 406 D ++ I+LRED+GE VHIF+HIRLK+YVE +++ T WKCV + Sbjct: 369 DIRKTCNIVLREDIGEFVHIFSHIRLKLYVELLVLQLKGVDDLFKSPDNKTT--WKCVYS 426 Query: 405 KSIQSMGLTSGVRKAFNMIQQHKQNGL 325 ++ SMGLT+ VRK +NM+Q KQ L Sbjct: 427 NALSSMGLTTSVRKVYNMVQNFKQKTL 453 >ref|XP_003528811.1| PREDICTED: A/G-specific adenine DNA glycosylase-like isoform X1 [Glycine max] Length = 471 Score = 513 bits (1322), Expect = e-143 Identities = 268/447 (59%), Positives = 319/447 (71%), Gaps = 12/447 (2%) Frame = -1 Query: 1629 KKKRMREPQQQKTSVVGG---------LEDIED---FSKPDTLKIRASLLQWYDQNQRVL 1486 +KK+ + ++ VVG +EDIED FSK +T K+R +LL WYD N+R L Sbjct: 17 EKKKKKNSTRRSVVVVGESKKPQPLVEVEDIEDSLSFSKDETHKLRVALLDWYDLNRRDL 76 Query: 1485 PWRSSSKTHHYSQQDKEQNLRAYAVWVSEIMLQQTRVSTVIDYYNRWMQKWPTIHHLSQA 1306 PWR++ K Q+D+E RAY VWVSE+MLQQTRV TVI YYNRWMQKWPTIHHL+QA Sbjct: 77 PWRTTFK-----QEDEEVERRAYGVWVSEVMLQQTRVQTVIAYYNRWMQKWPTIHHLAQA 131 Query: 1305 SQEEVNEMWAGLGYYRRARFLLEGAKMVVEGGGEFPTTVSALRKVRGIGDYTAGAIASIA 1126 S EEVNEMWAGLGYYRRARFLLEGAK +V GG+ P S LR + GIG+YT+GAIASIA Sbjct: 132 SLEEVNEMWAGLGYYRRARFLLEGAKKIVAEGGQIPKVASMLRNIPGIGEYTSGAIASIA 191 Query: 1125 FKEAVPVVDGNVIRVIARLKAISANPKERETIKDIWKLAGQLVDPCRPGDFNQALMELGS 946 FKE VPVVDGNV+RVIARL+AISANPK+ TIK WKLA QLVDP RPGDFNQALMELG+ Sbjct: 192 FKEVVPVVDGNVVRVIARLRAISANPKDSATIKKFWKLAAQLVDPVRPGDFNQALMELGA 251 Query: 945 TLCTPTNPDCSVCPVSGQCRALSLSRKCQSVQVTNYPIKIVKAKQRHDFCAICVVEILEE 766 T+CTP NP CS CP S C ALS ++ +V VT+YP+K VK KQR DF A+CVVE++ Sbjct: 252 TVCTPLNPSCSSCPASEFCHALSNAKHDSTVAVTDYPVKGVKVKQRCDFSAVCVVELVGA 311 Query: 765 GNLGIEDGSHKSNGFLLVKRPNEGLLAGLWEFPSVLLDGEANLAIRREAIDEYLKKLFEI 586 L S K F+LVKRP EGLLAGLWEFPSVLLDGEA RREA+D +L+K +I Sbjct: 312 ETLNKNQSSSK---FILVKRPEEGLLAGLWEFPSVLLDGEAVPLARREAMDRFLEKNLKI 368 Query: 585 DTKEKSCIILREDVGECVHIFTHIRLKMYVEXXXXXXXXXXXXXXXKEDRDTLEWKCVDN 406 D ++ I+LRED+GE VHIF+HIRLK+YVE D T WKCV + Sbjct: 369 DIRKTCNIVLREDIGEFVHIFSHIRLKLYVELLVLQLKVGVDDLFKSPDNKT-TWKCVYS 427 Query: 405 KSIQSMGLTSGVRKAFNMIQQHKQNGL 325 ++ SMGLT+ VRK +NM+Q KQ L Sbjct: 428 NALSSMGLTTSVRKVYNMVQNFKQKTL 454 >ref|XP_006282600.1| hypothetical protein CARUB_v10004796mg [Capsella rubella] gi|482551305|gb|EOA15498.1| hypothetical protein CARUB_v10004796mg [Capsella rubella] Length = 450 Score = 513 bits (1320), Expect = e-142 Identities = 262/433 (60%), Positives = 321/433 (74%) Frame = -1 Query: 1632 KKKKRMREPQQQKTSVVGGLEDIEDFSKPDTLKIRASLLQWYDQNQRVLPWRSSSKTHHY 1453 KKK R +P++++ + G +ED+ FS +T +IR SLL WYD NQR LPWR Sbjct: 12 KKKSRAEKPEEEEEPLGGDIEDL--FSGNETQEIRMSLLDWYDTNQRDLPWRKRR----- 64 Query: 1452 SQQDKEQNLRAYAVWVSEIMLQQTRVSTVIDYYNRWMQKWPTIHHLSQASQEEVNEMWAG 1273 S+ +KE+ RAY VWVSEIMLQQTRV TV++YY RWM KWPTI+ L+QAS EEVNEMWAG Sbjct: 65 SESEKER--RAYEVWVSEIMLQQTRVQTVLEYYKRWMLKWPTINDLAQASLEEVNEMWAG 122 Query: 1272 LGYYRRARFLLEGAKMVVEGGGEFPTTVSALRKVRGIGDYTAGAIASIAFKEAVPVVDGN 1093 LGYYRRARFLLEGAKMVV G FP S+L KV+GIG+YTAGAIASIAF EAVPVVDGN Sbjct: 123 LGYYRRARFLLEGAKMVVAGKDGFPNQASSLMKVKGIGEYTAGAIASIAFNEAVPVVDGN 182 Query: 1092 VIRVIARLKAISANPKERETIKDIWKLAGQLVDPCRPGDFNQALMELGSTLCTPTNPDCS 913 VIRV+ARLKAISANPK+R T ++ WKLA QLVDP RPGDFNQ+LMELG+TLC+ + P CS Sbjct: 183 VIRVLARLKAISANPKDRRTARNFWKLAAQLVDPSRPGDFNQSLMELGATLCSVSKPSCS 242 Query: 912 VCPVSGQCRALSLSRKCQSVQVTNYPIKIVKAKQRHDFCAICVVEILEEGNLGIEDGSHK 733 CPVS QCRA SLS++ +++ VT+YP K+VKAK R DFC +CV+EIL NL + + Sbjct: 243 SCPVSSQCRAYSLSQENRTISVTDYPTKVVKAKPRCDFCCVCVLEIL---NL---ERNQS 296 Query: 732 SNGFLLVKRPNEGLLAGLWEFPSVLLDGEANLAIRREAIDEYLKKLFEIDTKEKSCIILR 553 F+LVKRP EGLLAGLWEFPSV+LD EA LA RR AI+ YLK+ F + K+ I+ R Sbjct: 297 GGRFVLVKRPEEGLLAGLWEFPSVILDKEAGLATRRNAINLYLKEAFHVQPKKTCTIVSR 356 Query: 552 EDVGECVHIFTHIRLKMYVEXXXXXXXXXXXXXXXKEDRDTLEWKCVDNKSIQSMGLTSG 373 +++GE VHIFTHIR K+YVE + DTL WKCV + ++ +MGLTS Sbjct: 357 KELGEFVHIFTHIRRKVYVELLVVQLTGGTDALLKDQANDTLTWKCVGSDALSTMGLTSA 416 Query: 372 VRKAFNMIQQHKQ 334 VRK ++M++ HKQ Sbjct: 417 VRKVYSMVEAHKQ 429 >ref|XP_004510727.1| PREDICTED: A/G-specific adenine DNA glycosylase-like isoform X3 [Cicer arietinum] Length = 478 Score = 512 bits (1318), Expect = e-142 Identities = 264/433 (60%), Positives = 322/433 (74%), Gaps = 5/433 (1%) Frame = -1 Query: 1599 QKTSVVGGLEDIED---FSKPDTLKIRASL-LQWYDQNQRVLPWRSSSKTHHYSQQDKEQ 1432 +KT + +EDIED FSK +T K+R L WYD N+R LPWR++ +H ++DKE+ Sbjct: 36 KKTRTLVEMEDIEDSMSFSKDETHKLRXXXXLDWYDHNRRDLPWRTTF--NHNIEEDKEE 93 Query: 1431 -NLRAYAVWVSEIMLQQTRVSTVIDYYNRWMQKWPTIHHLSQASQEEVNEMWAGLGYYRR 1255 RAY VWVSE+MLQQTRV TVI YYNRWM KWPTIHHL++AS EEVNE+WAGLGYYRR Sbjct: 94 VEKRAYGVWVSEVMLQQTRVQTVIAYYNRWMLKWPTIHHLAKASLEEVNEIWAGLGYYRR 153 Query: 1254 ARFLLEGAKMVVEGGGEFPTTVSALRKVRGIGDYTAGAIASIAFKEAVPVVDGNVIRVIA 1075 ARFLLEGAK +V GG P T S LRK+ GIGDYT+GAIASIAFKE +PVVDGNVIRVIA Sbjct: 154 ARFLLEGAKKIVAEGGSIPKTASMLRKIPGIGDYTSGAIASIAFKEVIPVVDGNVIRVIA 213 Query: 1074 RLKAISANPKERETIKDIWKLAGQLVDPCRPGDFNQALMELGSTLCTPTNPDCSVCPVSG 895 RL+A+S NPK+ IK W++A QLVDP RPGDFNQ+LMELG+T+CTP NP CS CP S Sbjct: 214 RLRAVSENPKDSAIIKKFWEIAAQLVDPLRPGDFNQSLMELGATVCTPLNPSCSSCPASE 273 Query: 894 QCRALSLSRKCQSVQVTNYPIKIVKAKQRHDFCAICVVEILEEGNLGIEDGSHKSNGFLL 715 C ALS+ ++ + VT+YPIK VK KQR DF A+CVVE+L G + +E +H S+ F+L Sbjct: 274 FCHALSIVKQDSTAAVTDYPIKGVKVKQRSDFSAVCVVELL-GGEVSLEK-NHSSSIFVL 331 Query: 714 VKRPNEGLLAGLWEFPSVLLDGEANLAIRREAIDEYLKKLFEIDTKEKSCIILREDVGEC 535 VKRP+EGLLAGLWEFPSVLLDGE RR+A D +LKK +ID ++ IILREDVGE Sbjct: 332 VKRPDEGLLAGLWEFPSVLLDGETAPLARRKATDCFLKKNLKIDIRKTCDIILREDVGEF 391 Query: 534 VHIFTHIRLKMYVEXXXXXXXXXXXXXXXKEDRDTLEWKCVDNKSIQSMGLTSGVRKAFN 355 VHIF+HIRLK+YVE ED +T+ WKCVD+ ++ SMGLT+ VRKA++ Sbjct: 392 VHIFSHIRLKLYVELLVLQLKGKVDDLFESEDDETITWKCVDSNALSSMGLTTSVRKAYD 451 Query: 354 MIQQHKQNGLLIN 316 M+Q+ KQ L N Sbjct: 452 MVQKFKQKRLPFN 464 >ref|XP_007135205.1| hypothetical protein PHAVU_010G109900g [Phaseolus vulgaris] gi|561008250|gb|ESW07199.1| hypothetical protein PHAVU_010G109900g [Phaseolus vulgaris] Length = 475 Score = 510 bits (1313), Expect = e-142 Identities = 268/450 (59%), Positives = 320/450 (71%), Gaps = 11/450 (2%) Frame = -1 Query: 1632 KKKKRMREPQ----QQKTSVVGGLEDIED---FSKPDTLKIRASLLQWYDQNQRVLPWRS 1474 +KKK MR +K + +EDIED FSK +T K+R SLL WYD N+R LPWR Sbjct: 18 EKKKSMRRRSIVGASKKPQPLVEVEDIEDAISFSKDETHKLRVSLLDWYDLNRRDLPWR- 76 Query: 1473 SSKTHHYSQQDKEQN---LRAYAVWVSEIMLQQTRVSTVIDYYNRWMQKWPTIHHLSQAS 1303 THH ++K++ RAY VWVSE+MLQQTRV TVI YYNRWMQKWPTI+HL+QAS Sbjct: 77 ---THHREDEEKQEEELERRAYGVWVSEVMLQQTRVQTVIAYYNRWMQKWPTIYHLAQAS 133 Query: 1302 QEEVNEMWAGLGYYRRARFLLEGAKMVVEGGGEFPTTVSALRKVRGIGDYTAGAIASIAF 1123 EEVNEMWAGLGYYRRARFLLEGAK VV GG+ P S L K+ GIGDYT+GAIASIAF Sbjct: 134 LEEVNEMWAGLGYYRRARFLLEGAKKVVAEGGKIPKVASMLLKIPGIGDYTSGAIASIAF 193 Query: 1122 KEAVPVVDGNVIRVIARLKAISANPKERETIKDIWKLAGQLVDPCRPGDFNQALMELGST 943 KE VPVVDGNV+RVIARL+A+S NPK+ T+K WKLA QLVDP RPGDFNQALMELG+T Sbjct: 194 KEVVPVVDGNVVRVIARLRAVSTNPKDSATVKRFWKLAAQLVDPVRPGDFNQALMELGAT 253 Query: 942 LCTPTNPDCSVCPVSGQCRALSLSRKCQSVQVTNYPIKIVKAKQRHDFCAICVVEILEEG 763 +CTP NP CS CP S C+ALS ++ +V VT+YP+K VK KQR DF A+CVVE+L G Sbjct: 254 VCTPLNPSCSSCPASEFCQALSNAKHDTAVAVTDYPVKGVKVKQRRDFSAVCVVELL--G 311 Query: 762 NLGIEDGSHKSNGFLLVKRPNEGLLAGLWEFPSVLLDGE-ANLAIRREAIDEYLKKLFEI 586 + D + + F+LVKRP EGLLAGLWEFPSVLLDGE L RREA+D +LK F+I Sbjct: 312 AEALLDKNQSISKFILVKRPEEGLLAGLWEFPSVLLDGETVPLTTRREAMDRFLKANFKI 371 Query: 585 DTKEKSCIILREDVGECVHIFTHIRLKMYVEXXXXXXXXXXXXXXXKEDRDTLEWKCVDN 406 D ++ I+LRED+GE VHIF+HIRLK+YVE K + WKCV + Sbjct: 372 DVRKTCNIVLREDIGEFVHIFSHIRLKLYVELLVLQFKEGGEDDLFKSPDNKPTWKCVYS 431 Query: 405 KSIQSMGLTSGVRKAFNMIQQHKQNGLLIN 316 ++ MGLT+ VRK +NM+Q KQ L N Sbjct: 432 NALSGMGLTTSVRKVYNMVQNFKQKALPSN 461 >ref|XP_004510726.1| PREDICTED: A/G-specific adenine DNA glycosylase-like isoform X2 [Cicer arietinum] Length = 475 Score = 509 bits (1312), Expect = e-141 Identities = 266/433 (61%), Positives = 323/433 (74%), Gaps = 5/433 (1%) Frame = -1 Query: 1599 QKTSVVGGLEDIED---FSKPDTLKIRASL-LQWYDQNQRVLPWRSSSKTHHYSQQDKEQ 1432 +KT + +EDIED FSK +T K+R L WYD N+R LPWR++ +H ++DKE+ Sbjct: 36 KKTRTLVEMEDIEDSMSFSKDETHKLRXXXXLDWYDHNRRDLPWRTTF--NHNIEEDKEE 93 Query: 1431 -NLRAYAVWVSEIMLQQTRVSTVIDYYNRWMQKWPTIHHLSQASQEEVNEMWAGLGYYRR 1255 RAY VWVSE+MLQQTRV TVI YYNRWM KWPTIHHL++AS EEVNE+WAGLGYYRR Sbjct: 94 VEKRAYGVWVSEVMLQQTRVQTVIAYYNRWMLKWPTIHHLAKASLEEVNEIWAGLGYYRR 153 Query: 1254 ARFLLEGAKMVVEGGGEFPTTVSALRKVRGIGDYTAGAIASIAFKEAVPVVDGNVIRVIA 1075 ARFLLEGAK +V GG P T S LRK+ GIGDYT+GAIASIAFKEAVPVVDGNVIRVIA Sbjct: 154 ARFLLEGAKKIVAEGGSIPKTASMLRKIPGIGDYTSGAIASIAFKEAVPVVDGNVIRVIA 213 Query: 1074 RLKAISANPKERETIKDIWKLAGQLVDPCRPGDFNQALMELGSTLCTPTNPDCSVCPVSG 895 RL+A+S NPK+ IK W++A QLVDP RPGDFNQ+LMELG+T+CTP NP CS CP S Sbjct: 214 RLRAVSENPKDSAIIKKFWEIAAQLVDPLRPGDFNQSLMELGATVCTPLNPSCSSCPASE 273 Query: 894 QCRALSLSRKCQSVQVTNYPIKIVKAKQRHDFCAICVVEILEEGNLGIEDGSHKSNGFLL 715 C ALS+ ++ + VT+YPIK VK KQR DF A+CVVE+L G + +E +H S+ F+L Sbjct: 274 FCHALSIVKQDSTAAVTDYPIKGVKVKQRSDFSAVCVVELL-GGEVSLEK-NHSSSIFVL 331 Query: 714 VKRPNEGLLAGLWEFPSVLLDGEANLAIRREAIDEYLKKLFEIDTKEKSCIILREDVGEC 535 VKRP+EGLLAGLWEFPSVLLDGE RR+A D +LKK +ID ++ IILREDVGE Sbjct: 332 VKRPDEGLLAGLWEFPSVLLDGETAPLARRKATDCFLKKNLKIDIRKTCDIILREDVGEF 391 Query: 534 VHIFTHIRLKMYVEXXXXXXXXXXXXXXXKEDRDTLEWKCVDNKSIQSMGLTSGVRKAFN 355 VHIF+HIRLK+YVE ED +T+ WKCVD+ ++ SMGLT+ VRKA++ Sbjct: 392 VHIFSHIRLKLYVE---LLVLQLKDDLFESEDDETITWKCVDSNALSSMGLTTSVRKAYD 448 Query: 354 MIQQHKQNGLLIN 316 M+Q+ KQ L N Sbjct: 449 MVQKFKQKRLPFN 461