BLASTX nr result
ID: Cocculus23_contig00009382
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Cocculus23_contig00009382 (1308 letters) Database: ./nr 38,876,450 sequences; 13,856,398,315 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_002263635.1| PREDICTED: formamidopyrimidine-DNA glycosyla... 344 4e-92 ref|XP_006427395.1| hypothetical protein CICLE_v10025737mg [Citr... 340 1e-90 ref|XP_006492080.1| PREDICTED: formamidopyrimidine-DNA glycosyla... 338 3e-90 ref|XP_007023286.1| MUTM-1 isoform 2 [Theobroma cacao] gi|508778... 323 8e-86 ref|XP_003542122.1| PREDICTED: formamidopyrimidine-DNA glycosyla... 323 8e-86 ref|XP_004486649.1| PREDICTED: formamidopyrimidine-DNA glycosyla... 323 1e-85 gb|AAC97952.1| putative formamidopyrimidine-DNA glycosylase 1 [A... 323 1e-85 ref|XP_006856551.1| hypothetical protein AMTR_s00046p00171520 [A... 322 3e-85 ref|NP_564608.1| formamidopyrimidine-DNA glycosylase [Arabidopsi... 320 1e-84 ref|XP_006385295.1| hypothetical protein POPTR_0003s02540g [Popu... 319 1e-84 ref|XP_007215481.1| hypothetical protein PRUPE_ppa006603mg [Prun... 319 1e-84 ref|XP_006595167.1| PREDICTED: formamidopyrimidine-DNA glycosyla... 319 2e-84 ref|XP_007150770.1| hypothetical protein PHAVU_005G1793001g [Pha... 318 3e-84 ref|XP_007023285.1| MUTM-1 isoform 1 [Theobroma cacao] gi|508778... 317 1e-83 gb|EXB67257.1| Formamidopyrimidine-DNA glycosylase [Morus notabi... 315 3e-83 ref|XP_006304948.1| hypothetical protein CARUB_v10011435mg [Caps... 311 5e-82 ref|XP_002517673.1| formamidopyrimidine-DNA glycosylase, putativ... 310 9e-82 pdb|3TWL|A Chain A, Crystal Structure Of Arabidopsis Thaliana Fp... 309 2e-81 ref|XP_003597926.1| Formamidopyrimidine-DNA glycosylase [Medicag... 307 6e-81 ref|XP_006392913.1| hypothetical protein EUTSA_v10011553mg [Eutr... 306 1e-80 >ref|XP_002263635.1| PREDICTED: formamidopyrimidine-DNA glycosylase-like [Vitis vinifera] Length = 403 Score = 344 bits (883), Expect = 4e-92 Identities = 187/288 (64%), Positives = 209/288 (72%), Gaps = 27/288 (9%) Frame = -3 Query: 1306 VSDTDEWPSKYSKLFIKLDDGLELSFTDKRRFARVRLLKDPASVPPISELGPDALLEPMT 1127 V DTDEWPSKYSKLFI+LDDGLELSFTDKRRFA+VRLL+DPASVPPISELGPDALLEPMT Sbjct: 97 VKDTDEWPSKYSKLFIELDDGLELSFTDKRRFAKVRLLEDPASVPPISELGPDALLEPMT 156 Query: 1126 LNDFMESLSKKKIGMKALLLDQSYIAGIGNWIADEVLYQARIHPLQPASSLAKESCESLH 947 +++F++SLSKKKI +KALLLDQSYIAGIGNW+ADEVLY ARIHPLQ ASSL +ESCE+LH Sbjct: 157 IDEFIKSLSKKKIAIKALLLDQSYIAGIGNWLADEVLYHARIHPLQVASSLTRESCETLH 216 Query: 946 KCIKEVIEKAVEVGADSSQFPENWIFHSREKKPGKAFVDGKIIEFITAGGRTTAYVPELQ 767 +CIK+VIEKA+EVGADSSQFP NWIFHSREKKPGKAFVDGK I+FI+AGGRTTAYVPELQ Sbjct: 217 QCIKQVIEKAMEVGADSSQFPSNWIFHSREKKPGKAFVDGKKIDFISAGGRTTAYVPELQ 276 Query: 766 KLPGDQTGKELAKPKKQSP---------DGDGXXXXXXXXEILKKN----KSKKGQKTAT 626 KL G Q K KP+KQ+P D D +KN KSKKGQ Sbjct: 277 KLSGTQAAKASVKPRKQTPMRKKEENDEDDDDDDALDEPASEEEKNTKRAKSKKGQNPKG 336 Query: 625 XXXXXXXXXXXXXGRRDSGD--------------DGEQAKKKTKVTTN 524 DS D DG+Q KK +VT N Sbjct: 337 GGKKPPAKRKVEESDNDSDDNDDNNDDDDDDEDKDGDQ-KKAKRVTKN 383 >ref|XP_006427395.1| hypothetical protein CICLE_v10025737mg [Citrus clementina] gi|557529385|gb|ESR40635.1| hypothetical protein CICLE_v10025737mg [Citrus clementina] Length = 408 Score = 340 bits (871), Expect = 1e-90 Identities = 173/234 (73%), Positives = 193/234 (82%), Gaps = 10/234 (4%) Frame = -3 Query: 1306 VSDTDEWPSKYSKLFIKLDDGLELSFTDKRRFARVRLLKDPASVPPISELGPDALLEPMT 1127 V DTDEWPSKYSK F++LDDGLELSFTDKRRFA+VRLL DP SVPPISELGPDALLEPMT Sbjct: 97 VKDTDEWPSKYSKFFVELDDGLELSFTDKRRFAKVRLLNDPTSVPPISELGPDALLEPMT 156 Query: 1126 LNDFMESLSKKKIGMKALLLDQSYIAGIGNWIADEVLYQARIHPLQPASSLAKESCESLH 947 +++F +SLSKKKI +KALLLDQSYI+GIGNW+ADEVLYQA+IHPLQ A SL+KESC +L Sbjct: 157 VDEFTDSLSKKKITLKALLLDQSYISGIGNWVADEVLYQAKIHPLQTAVSLSKESCATLL 216 Query: 946 KCIKEVIEKAVEVGADSSQFPENWIFHSREKKPGKAFVDGKIIEFITAGGRTTAYVPELQ 767 KCIKEVIEKA+EVGADSSQFP NWIFHSREKKPGKAFVDGK I+FITAGGRTTAYVPELQ Sbjct: 217 KCIKEVIEKALEVGADSSQFPSNWIFHSREKKPGKAFVDGKKIDFITAGGRTTAYVPELQ 276 Query: 766 KLPGDQTGKELAKPKKQSPDGD----------GXXXXXXXXEILKKNKSKKGQK 635 KL G Q K + KP+KQ+P G+ G EI + KSKK QK Sbjct: 277 KLNGVQAAKAVGKPRKQAPKGEDSKDDDKYNSGDESESDGEEIAENVKSKKRQK 330 >ref|XP_006492080.1| PREDICTED: formamidopyrimidine-DNA glycosylase-like isoform X1 [Citrus sinensis] Length = 408 Score = 338 bits (867), Expect = 3e-90 Identities = 173/234 (73%), Positives = 193/234 (82%), Gaps = 10/234 (4%) Frame = -3 Query: 1306 VSDTDEWPSKYSKLFIKLDDGLELSFTDKRRFARVRLLKDPASVPPISELGPDALLEPMT 1127 V DTDEWPSKYSK F++LDDGLELSFTDKRRFA+VRLL DP SVPPISELGPDALLEPMT Sbjct: 97 VKDTDEWPSKYSKFFVELDDGLELSFTDKRRFAKVRLLNDPTSVPPISELGPDALLEPMT 156 Query: 1126 LNDFMESLSKKKIGMKALLLDQSYIAGIGNWIADEVLYQARIHPLQPASSLAKESCESLH 947 +++F +SLSKKKI +KALLLDQSYI+GIGNWIADEVLYQA+IHPLQ A+SL+K+SC +L Sbjct: 157 VDEFTDSLSKKKITIKALLLDQSYISGIGNWIADEVLYQAKIHPLQTAASLSKKSCATLL 216 Query: 946 KCIKEVIEKAVEVGADSSQFPENWIFHSREKKPGKAFVDGKIIEFITAGGRTTAYVPELQ 767 KCIKEVIEKA+EVGADSSQFP NWIFHSREKKPGKAFVDGK I+FITAGGRTTAYVPELQ Sbjct: 217 KCIKEVIEKALEVGADSSQFPSNWIFHSREKKPGKAFVDGKKIDFITAGGRTTAYVPELQ 276 Query: 766 KLPGDQTGKELAKPKKQSPDGD----------GXXXXXXXXEILKKNKSKKGQK 635 KL G Q K + KP+KQ P G+ G EI + KSKK QK Sbjct: 277 KLNGVQAAKAVGKPRKQVPKGEDSKDDDKYNSGDESESDGEEIAENVKSKKRQK 330 >ref|XP_007023286.1| MUTM-1 isoform 2 [Theobroma cacao] gi|508778652|gb|EOY25908.1| MUTM-1 isoform 2 [Theobroma cacao] Length = 409 Score = 323 bits (829), Expect = 8e-86 Identities = 158/203 (77%), Positives = 178/203 (87%) Frame = -3 Query: 1306 VSDTDEWPSKYSKLFIKLDDGLELSFTDKRRFARVRLLKDPASVPPISELGPDALLEPMT 1127 V D DEWPSKYSK F++L+DGLELSFTDKRRFARVRLLKDP SVPPISELGPDAL +PMT Sbjct: 97 VKDNDEWPSKYSKFFVELEDGLELSFTDKRRFARVRLLKDPTSVPPISELGPDALFQPMT 156 Query: 1126 LNDFMESLSKKKIGMKALLLDQSYIAGIGNWIADEVLYQARIHPLQPASSLAKESCESLH 947 +++F ESL+KKKI +KALLLDQS+I+GIGNWIADEVLYQARIHPLQ +SSL+KE+C +L Sbjct: 157 VDEFTESLNKKKIAIKALLLDQSFISGIGNWIADEVLYQARIHPLQISSSLSKENCATLL 216 Query: 946 KCIKEVIEKAVEVGADSSQFPENWIFHSREKKPGKAFVDGKIIEFITAGGRTTAYVPELQ 767 +CI EVIEKAVEVGADSSQFP NWIFHSREKKPGKAFVDGK I+FI AGGRT+AYVPELQ Sbjct: 217 QCINEVIEKAVEVGADSSQFPSNWIFHSREKKPGKAFVDGKKIDFINAGGRTSAYVPELQ 276 Query: 766 KLPGDQTGKELAKPKKQSPDGDG 698 KL G Q K KP+KQ+ G Sbjct: 277 KLSGKQATKAAGKPRKQASKRKG 299 >ref|XP_003542122.1| PREDICTED: formamidopyrimidine-DNA glycosylase-like isoform X1 [Glycine max] Length = 399 Score = 323 bits (829), Expect = 8e-86 Identities = 168/230 (73%), Positives = 182/230 (79%), Gaps = 6/230 (2%) Frame = -3 Query: 1306 VSDTDEWPSKYSKLFIKLDDGLELSFTDKRRFARVRLLKDPASVPPISELGPDALLEPMT 1127 V D DEWPSKYSK+FI+LDDGLELSFTDKRRFA+VRLLKDP SVPPISELGPDAL EPMT Sbjct: 97 VKDEDEWPSKYSKIFIELDDGLELSFTDKRRFAKVRLLKDPTSVPPISELGPDALFEPMT 156 Query: 1126 LNDFMESLSKKKIGMKALLLDQSYIAGIGNWIADEVLYQARIHPLQPASSLAKESCESLH 947 L F ESL KKK +KALLLDQS+I+GIGNW+ADEVLYQARIHP Q ASSL+ ESC +L Sbjct: 157 LEKFTESLHKKKTEIKALLLDQSFISGIGNWVADEVLYQARIHPRQVASSLSNESCSNLS 216 Query: 946 KCIKEVIEKAVEVGADSSQFPENWIFHSREKKPGKAFVDGKIIEFITAGGRTTAYVPELQ 767 KCIKEVIEKA+EVGA+SSQ+P NWIFHSREKKPGKAFVDGK I+FITAGGRTTAYVPELQ Sbjct: 217 KCIKEVIEKAIEVGAESSQYPTNWIFHSREKKPGKAFVDGKKIDFITAGGRTTAYVPELQ 276 Query: 766 KLPGDQTGKELAKP------KKQSPDGDGXXXXXXXXEILKKNKSKKGQK 635 KL G KE KP KK D D + L KSKKG K Sbjct: 277 KLSGSLDVKETGKPNKRQASKKVRVDDDTEKPTNGEVDDLGSVKSKKGTK 326 >ref|XP_004486649.1| PREDICTED: formamidopyrimidine-DNA glycosylase-like isoform X1 [Cicer arietinum] Length = 403 Score = 323 bits (828), Expect = 1e-85 Identities = 158/198 (79%), Positives = 176/198 (88%) Frame = -3 Query: 1306 VSDTDEWPSKYSKLFIKLDDGLELSFTDKRRFARVRLLKDPASVPPISELGPDALLEPMT 1127 V+D DEWPSK+SK FI+L+DGLE+SFTDKRRFARVRLLKDP SVPPISELGPDAL EPMT Sbjct: 97 VNDKDEWPSKHSKFFIQLNDGLEMSFTDKRRFARVRLLKDPTSVPPISELGPDALFEPMT 156 Query: 1126 LNDFMESLSKKKIGMKALLLDQSYIAGIGNWIADEVLYQARIHPLQPASSLAKESCESLH 947 L+DF E L KKK +KALLLDQSYI+GIGNW+ADEVLYQARIHP Q AS+L+ E C +LH Sbjct: 157 LDDFTERLHKKKTEIKALLLDQSYISGIGNWVADEVLYQARIHPRQTASTLSGEGCSTLH 216 Query: 946 KCIKEVIEKAVEVGADSSQFPENWIFHSREKKPGKAFVDGKIIEFITAGGRTTAYVPELQ 767 KCIKEVIEKAVEVGADSSQ+P NWIFHSREKKPGKAF+DGK I+FITAGGRTTAYVPELQ Sbjct: 217 KCIKEVIEKAVEVGADSSQYPTNWIFHSREKKPGKAFIDGKNIDFITAGGRTTAYVPELQ 276 Query: 766 KLPGDQTGKELAKPKKQS 713 KL G Q KE +KP+ +S Sbjct: 277 KLSGSQELKENSKPRGKS 294 >gb|AAC97952.1| putative formamidopyrimidine-DNA glycosylase 1 [Arabidopsis thaliana] Length = 390 Score = 323 bits (827), Expect = 1e-85 Identities = 172/265 (64%), Positives = 194/265 (73%), Gaps = 8/265 (3%) Frame = -3 Query: 1306 VSDTDEWPSKYSKLFIKLDDGLELSFTDKRRFARVRLLKDPASVPPISELGPDALLEPMT 1127 V D++EWPSKYSK F++LDDGLELSFTDKRRFA+VRLL +P SV PISELGPDALLEPMT Sbjct: 97 VKDSEEWPSKYSKFFVELDDGLELSFTDKRRFAKVRLLANPTSVSPISELGPDALLEPMT 156 Query: 1126 LNDFMESLSKKKIGMKALLLDQSYIAGIGNWIADEVLYQARIHPLQPASSLAKESCESLH 947 +++F ESL+KKKI +K LLLDQ YI+GIGNWIADEVLYQARIHPLQ ASSL+KE CE+LH Sbjct: 157 VDEFAESLAKKKITIKPLLLDQGYISGIGNWIADEVLYQARIHPLQTASSLSKEQCEALH 216 Query: 946 KCIKEVIEKAVEVGADSSQFPENWIFHSREKKPGKAFVDGKIIEFITAGGRTTAYVPELQ 767 IKEVIEKAVEV ADSSQFP NWIFH+REKKPGKAFVDGK I+FITAGGRTTAYVPELQ Sbjct: 217 TSIKEVIEKAVEVDADSSQFPSNWIFHNREKKPGKAFVDGKKIDFITAGGRTTAYVPELQ 276 Query: 766 KLPGD--------QTGKELAKPKKQSPDGDGXXXXXXXXEILKKNKSKKGQKTATXXXXX 611 KL G + K KPK+ DGDG + + KSKKGQK Sbjct: 277 KLYGKDAEKAAKVRPAKRGVKPKED--DGDGEEDEQETEKEDESAKSKKGQKPRGGRGKK 334 Query: 610 XXXXXXXXGRRDSGDDGEQAKKKTK 536 D GDD E ++ K Sbjct: 335 PASKTKTEESDDDGDDSEAEEEVVK 359 >ref|XP_006856551.1| hypothetical protein AMTR_s00046p00171520 [Amborella trichopoda] gi|548860432|gb|ERN18018.1| hypothetical protein AMTR_s00046p00171520 [Amborella trichopoda] Length = 385 Score = 322 bits (824), Expect = 3e-85 Identities = 158/205 (77%), Positives = 182/205 (88%), Gaps = 3/205 (1%) Frame = -3 Query: 1306 VSDTDEWPSKYSKLFIKLDDGLELSFTDKRRFARVRLLKDPASVPPISELGPDALLEPMT 1127 V+DTDEWPSKYSK+FI+LDDGLELSFTDKRRFARVRLL+DP SVPPISELGPDALLEPMT Sbjct: 97 VNDTDEWPSKYSKVFIELDDGLELSFTDKRRFARVRLLQDPTSVPPISELGPDALLEPMT 156 Query: 1126 LNDFMESLSKKKIGMKALLLDQSYIAGIGNWIADEVLYQARIHPLQPASSLAKESCESLH 947 ++F SL+KKK+G+KALLLDQSYI+GIGNW+ADEVLYQARIHPLQ A+SL+KESC +LH Sbjct: 157 ADEFANSLNKKKLGIKALLLDQSYISGIGNWVADEVLYQARIHPLQHATSLSKESCVTLH 216 Query: 946 KCIKEVIEKAVEVGADSSQFPENWIFHSREKKPGKAFVDGKIIEFITAGGRTTAYVPELQ 767 K I EVI KA+EVGADSSQFP+NW+FH REKKPGKAFVDGK IEFITAGGRT+A+VPELQ Sbjct: 217 KSINEVIHKALEVGADSSQFPKNWLFHYREKKPGKAFVDGKRIEFITAGGRTSAFVPELQ 276 Query: 766 KLPG---DQTGKELAKPKKQSPDGD 701 KL G ++ K+ PKK + D + Sbjct: 277 KLSGAAAEKVRKKTTNPKKVNEDDE 301 >ref|NP_564608.1| formamidopyrimidine-DNA glycosylase [Arabidopsis thaliana] gi|75099732|sp|O80358.1|FPG_ARATH RecName: Full=Formamidopyrimidine-DNA glycosylase; Short=Fapy-DNA glycosylase; AltName: Full=DNA-(apurinic or apyrimidinic site) lyase FPG1; AltName: Full=Formamidopyrimidine-DNA glycosylase 1; Short=AtFPG-1; AltName: Full=Formamidopyrimidine-DNA glycosylase 2; Short=AtFPG-2; AltName: Full=Protein MutM homolog 1; Short=AtMMH-1; AltName: Full=Protein MutM homolog 2; Short=AtMMH-2 gi|5903053|gb|AAD55612.1|AC008016_22 Identical to gb|AB010690 mutM homologue-1 (formamidopyrimidine-DNA glycosylase 1) from Arabidopsis thaliana. EST gb|Z18192 comes from this gene [Arabidopsis thaliana] gi|3550982|dbj|BAA32702.1| AtMMH-1 [Arabidopsis thaliana] gi|195947437|gb|ACG58696.1| At1g52500 [Arabidopsis thaliana] gi|332194693|gb|AEE32814.1| formamidopyrimidine-DNA glycosylase [Arabidopsis thaliana] Length = 390 Score = 320 bits (819), Expect = 1e-84 Identities = 171/265 (64%), Positives = 193/265 (72%), Gaps = 8/265 (3%) Frame = -3 Query: 1306 VSDTDEWPSKYSKLFIKLDDGLELSFTDKRRFARVRLLKDPASVPPISELGPDALLEPMT 1127 V D++EWPSKYSK F++LDDGLELSFTDKRRFA+VRLL +P SV PISELGPDALLEPMT Sbjct: 97 VKDSEEWPSKYSKFFVELDDGLELSFTDKRRFAKVRLLANPTSVSPISELGPDALLEPMT 156 Query: 1126 LNDFMESLSKKKIGMKALLLDQSYIAGIGNWIADEVLYQARIHPLQPASSLAKESCESLH 947 +++F ESL+KKKI +K LLLDQ YI+GIGNWIADEVLYQARIHPLQ ASSL+KE CE+LH Sbjct: 157 VDEFAESLAKKKITIKPLLLDQGYISGIGNWIADEVLYQARIHPLQTASSLSKEQCEALH 216 Query: 946 KCIKEVIEKAVEVGADSSQFPENWIFHSREKKPGKAFVDGKIIEFITAGGRTTAYVPELQ 767 IKEVIEKAVEV ADSSQFP WIFH+REKKPGKAFVDGK I+FITAGGRTTAYVPELQ Sbjct: 217 TSIKEVIEKAVEVDADSSQFPSYWIFHNREKKPGKAFVDGKKIDFITAGGRTTAYVPELQ 276 Query: 766 KLPGD--------QTGKELAKPKKQSPDGDGXXXXXXXXEILKKNKSKKGQKTATXXXXX 611 KL G + K KPK+ DGDG + + KSKKGQK Sbjct: 277 KLYGKDAEKAAKVRPAKRGVKPKED--DGDGEEDEQETEKEDESAKSKKGQKPRGGRGKK 334 Query: 610 XXXXXXXXGRRDSGDDGEQAKKKTK 536 D GDD E ++ K Sbjct: 335 PASKTKTEESDDDGDDSEAEEEVVK 359 >ref|XP_006385295.1| hypothetical protein POPTR_0003s02540g [Populus trichocarpa] gi|550342236|gb|ERP63092.1| hypothetical protein POPTR_0003s02540g [Populus trichocarpa] Length = 407 Score = 319 bits (818), Expect = 1e-84 Identities = 158/206 (76%), Positives = 180/206 (87%), Gaps = 4/206 (1%) Frame = -3 Query: 1306 VSDTDEWPSKYSKLFIKLDDGLELSFTDKRRFARVRLLKDPASVPPISELGPDALLEPMT 1127 V+D+DEWPSKYSK F++LDDGLELSFTDKRRFA+VRLL+DPAS PPISELGPDALLEPMT Sbjct: 97 VNDSDEWPSKYSKFFVQLDDGLELSFTDKRRFAKVRLLEDPASKPPISELGPDALLEPMT 156 Query: 1126 LNDFMESLSKKKIGMKALLLDQSYIAGIGNWIADEVLYQARIHPLQPASSLAKESCESLH 947 +++ SLSKKK+ +KALLLDQS+++GIGNWIADEVLYQARIHPLQ ASSL++ES +LH Sbjct: 157 VDELHGSLSKKKVAIKALLLDQSFVSGIGNWIADEVLYQARIHPLQIASSLSRESSATLH 216 Query: 946 KCIKEVIEKAVEVGADSSQFPENWIFHSREKKPGKAFVDGKIIEFITAGGRTTAYVPELQ 767 KCIKEVIEKAVEVGADSSQFP NWIFHSREKK K F+DGK I+FI AGGRTTAYVP LQ Sbjct: 217 KCIKEVIEKAVEVGADSSQFPNNWIFHSREKKSKKTFIDGKEIDFIVAGGRTTAYVPGLQ 276 Query: 766 KLPGDQTGKELAKPK----KQSPDGD 701 KL G+Q GK + KPK K+ DGD Sbjct: 277 KLNGNQAGKAVGKPKARTSKKKRDGD 302 >ref|XP_007215481.1| hypothetical protein PRUPE_ppa006603mg [Prunus persica] gi|462411631|gb|EMJ16680.1| hypothetical protein PRUPE_ppa006603mg [Prunus persica] Length = 403 Score = 319 bits (818), Expect = 1e-84 Identities = 173/280 (61%), Positives = 195/280 (69%), Gaps = 19/280 (6%) Frame = -3 Query: 1306 VSDTDEWPSKYSKLFIKLDDGLELSFTDKRRFARVRLLKDPASVPPISELGPDALLEPMT 1127 V DTDEWPSKYSKLF++LDDGLE SFTDKRRFARVRLLKDPASVPPISELGPDALLEPMT Sbjct: 97 VKDTDEWPSKYSKLFVELDDGLEFSFTDKRRFARVRLLKDPASVPPISELGPDALLEPMT 156 Query: 1126 LNDFMESLSKKKIGMKALLLDQSYIAGIGNWIADEVLYQARIHPLQPASSLAKESCESLH 947 ++ ESLSKKKI +K LLLDQSYI+GIGNW+ADEVLYQARIHP Q A+SL+KE+ +LH Sbjct: 157 GDELFESLSKKKIAIKTLLLDQSYISGIGNWVADEVLYQARIHPEQSAASLSKENYGNLH 216 Query: 946 KCIKEVIEKAVEVGADSSQFPENWIFHSREKKPGKAFVDGKIIEFITAGGRTTAYVPELQ 767 K IKEVIEK++EVGADSSQFP NWIFHSREKKPGKAFVDG+ I+FIT GGRTTAYVPELQ Sbjct: 217 KSIKEVIEKSLEVGADSSQFPSNWIFHSREKKPGKAFVDGRKIDFITVGGRTTAYVPELQ 276 Query: 766 KLPGDQTGKELAK-PKKQSPDGDGXXXXXXXXEI---------LKKNKSKKGQKTATXXX 617 KL G Q + +K K+ GDG KK + +GQ + Sbjct: 277 KLSGQQAARAGSKQANKRKGHGDGVKDDVNEAASDEEVNGSVQSKKGRKPRGQGNKSSAK 336 Query: 616 XXXXXXXXXXGRRDSGDD---------GEQAKKKTKVTTN 524 DS DD E K KT+ TN Sbjct: 337 RKSKESDDEDNANDSEDDDDDDNDDHHDEDQKNKTRKVTN 376 >ref|XP_006595167.1| PREDICTED: formamidopyrimidine-DNA glycosylase-like isoform X2 [Glycine max] Length = 400 Score = 319 bits (817), Expect = 2e-84 Identities = 168/231 (72%), Positives = 182/231 (78%), Gaps = 7/231 (3%) Frame = -3 Query: 1306 VSDTDEWPSKYSKLFIKLDDGLELSFTDKRRFARVRLLKDPASVPPISELGPDALLEPMT 1127 V D DEWPSKYSK+FI+LDDGLELSFTDKRRFA+VRLLKDP SVPPISELGPDAL EPMT Sbjct: 97 VKDEDEWPSKYSKIFIELDDGLELSFTDKRRFAKVRLLKDPTSVPPISELGPDALFEPMT 156 Query: 1126 LNDFMESLSKKKIGMKALLLDQSYIAGIGNWIADEVLYQARIHPLQPASSLAKESCESLH 947 L F ESL KKK +KALLLDQS+I+GIGNW+ADEVLYQARIHP Q ASSL+ ESC +L Sbjct: 157 LEKFTESLHKKKTEIKALLLDQSFISGIGNWVADEVLYQARIHPRQVASSLSNESCSNLS 216 Query: 946 KCIKEVIEKAVEVGADSSQFPENWIFHSREKKPGKAFVD-GKIIEFITAGGRTTAYVPEL 770 KCIKEVIEKA+EVGA+SSQ+P NWIFHSREKKPGKAFVD GK I+FITAGGRTTAYVPEL Sbjct: 217 KCIKEVIEKAIEVGAESSQYPTNWIFHSREKKPGKAFVDAGKKIDFITAGGRTTAYVPEL 276 Query: 769 QKLPGDQTGKELAKP------KKQSPDGDGXXXXXXXXEILKKNKSKKGQK 635 QKL G KE KP KK D D + L KSKKG K Sbjct: 277 QKLSGSLDVKETGKPNKRQASKKVRVDDDTEKPTNGEVDDLGSVKSKKGTK 327 >ref|XP_007150770.1| hypothetical protein PHAVU_005G1793001g [Phaseolus vulgaris] gi|561024034|gb|ESW22764.1| hypothetical protein PHAVU_005G1793001g [Phaseolus vulgaris] Length = 313 Score = 318 bits (816), Expect = 3e-84 Identities = 170/266 (63%), Positives = 187/266 (70%), Gaps = 5/266 (1%) Frame = -3 Query: 1306 VSDTDEWPSKYSKLFIKLDDGLELSFTDKRRFARVRLLKDPASVPPISELGPDALLEPMT 1127 V D DEWPSKYSK FI+LDDGLELSFTDKRRFA+VRLLKDP SVPPISELGPDAL EPMT Sbjct: 20 VKDEDEWPSKYSKFFIELDDGLELSFTDKRRFAKVRLLKDPTSVPPISELGPDALFEPMT 79 Query: 1126 LNDFMESLSKKKIGMKALLLDQSYIAGIGNWIADEVLYQARIHPLQPASSLAKESCESLH 947 L F ESL K+K +KALLLDQSYI+GIGNW+ADEVLYQARIHP Q ASSL+ SC +L+ Sbjct: 80 LEKFTESLHKRKTEIKALLLDQSYISGIGNWVADEVLYQARIHPRQAASSLSDASCSTLY 139 Query: 946 KCIKEVIEKAVEVGADSSQFPENWIFHSREKKPGKAFVDGKIIEFITAGGRTTAYVPELQ 767 K I+EVIEKAVEVGADS+Q+P +WIFHSREKKP KAFVDG I+FITAGGRTTAYVPELQ Sbjct: 140 KSIEEVIEKAVEVGADSNQYPNSWIFHSREKKPDKAFVDGNKIDFITAGGRTTAYVPELQ 199 Query: 766 KLPGDQTGKELAKPKKQ-----SPDGDGXXXXXXXXEILKKNKSKKGQKTATXXXXXXXX 602 KL G KE KPK+Q S D D L KSKKG K Sbjct: 200 KLSGSIDVKETGKPKRQASKKVSGDDDTEKPTDGEEGDLGNVKSKKGAKAGVKGRKPAIK 259 Query: 601 XXXXXGRRDSGDDGEQAKKKTKVTTN 524 D+ D + KK TN Sbjct: 260 KKSEESDEDNDSDAQVEKKNPGNVTN 285 >ref|XP_007023285.1| MUTM-1 isoform 1 [Theobroma cacao] gi|508778651|gb|EOY25907.1| MUTM-1 isoform 1 [Theobroma cacao] Length = 416 Score = 317 bits (811), Expect = 1e-83 Identities = 158/210 (75%), Positives = 178/210 (84%), Gaps = 7/210 (3%) Frame = -3 Query: 1306 VSDTDEWPSKYSKLFIKLDDGLELSFTDKRRFARVRLLKDPASVPPISELGPDALLEPMT 1127 V D DEWPSKYSK F++L+DGLELSFTDKRRFARVRLLKDP SVPPISELGPDAL +PMT Sbjct: 97 VKDNDEWPSKYSKFFVELEDGLELSFTDKRRFARVRLLKDPTSVPPISELGPDALFQPMT 156 Query: 1126 LNDFMESLSKKKIGMKALLLDQSYIAGIGNWIADEVLYQARIHPLQPASSLAKESCESLH 947 +++F ESL+KKKI +KALLLDQS+I+GIGNWIADEVLYQARIHPLQ +SSL+KE+C +L Sbjct: 157 VDEFTESLNKKKIAIKALLLDQSFISGIGNWIADEVLYQARIHPLQISSSLSKENCATLL 216 Query: 946 KCIK-------EVIEKAVEVGADSSQFPENWIFHSREKKPGKAFVDGKIIEFITAGGRTT 788 +CI EVIEKAVEVGADSSQFP NWIFHSREKKPGKAFVDGK I+FI AGGRT+ Sbjct: 217 QCINEVIRYAVEVIEKAVEVGADSSQFPSNWIFHSREKKPGKAFVDGKKIDFINAGGRTS 276 Query: 787 AYVPELQKLPGDQTGKELAKPKKQSPDGDG 698 AYVPELQKL G Q K KP+KQ+ G Sbjct: 277 AYVPELQKLSGKQATKAAGKPRKQASKRKG 306 >gb|EXB67257.1| Formamidopyrimidine-DNA glycosylase [Morus notabilis] Length = 556 Score = 315 bits (807), Expect = 3e-83 Identities = 166/257 (64%), Positives = 190/257 (73%), Gaps = 33/257 (12%) Frame = -3 Query: 1306 VSDTDEWPSKYSKLFIKLDDGLELSFTDKRRFARVRLLKDPASVPPISELGPDALLEPMT 1127 V D +EWPSKYSK+FI+LDDG+ELSFTDKRRFA+VRLLKDP SVPPISELGPDALLEPMT Sbjct: 97 VKDDEEWPSKYSKVFIELDDGMELSFTDKRRFAKVRLLKDPTSVPPISELGPDALLEPMT 156 Query: 1126 LNDFMESLSKKKIGMKALLLDQSYIAGIGNWIADEVLYQARIHPLQPASSLAKESCESLH 947 +++F SLSKKKI +KALLLDQSYI+GIGNWIADEVLYQA++HPLQ A++L+KESC +L Sbjct: 157 VDEFAASLSKKKIAIKALLLDQSYISGIGNWIADEVLYQAKVHPLQVAATLSKESCATLQ 216 Query: 946 KCIKEVIEKAVEVGADSSQFPENWIFHSREKKPGKAFVD--------------------- 830 KCIKEVIEKAVEVGADSSQ+P NWIFH+REKKPGKAFVD Sbjct: 217 KCIKEVIEKAVEVGADSSQYPNNWIFHAREKKPGKAFVDGLAPDPYVINLIPYLELIILH 276 Query: 829 -----GKIIEFITAGGRTTAYVPELQKLPGDQTGKELAKPKKQS-------PDGDGXXXX 686 GK IEFITAGGRTTA+VPELQKL G Q K ++K KQS +GD Sbjct: 277 PIGLSGKKIEFITAGGRTTAFVPELQKLSGSQAAKAVSKQGKQSNRRKGRQDEGDKDEQE 336 Query: 685 XXXXEILKKNKSKKGQK 635 +I +K KK K Sbjct: 337 IDEGDIAEKTTRKKEMK 353 >ref|XP_006304948.1| hypothetical protein CARUB_v10011435mg [Capsella rubella] gi|482573659|gb|EOA37846.1| hypothetical protein CARUB_v10011435mg [Capsella rubella] Length = 396 Score = 311 bits (796), Expect = 5e-82 Identities = 162/231 (70%), Positives = 181/231 (78%), Gaps = 6/231 (2%) Frame = -3 Query: 1306 VSDTDEWPSKYSKLFIKLDDGLELSFTDKRRFARVRLLKDPASVPPISELGPDALLEPMT 1127 V D++EWPSKYSK F++LDDGLELSFTDKRRFA+VRLL +P SV PISELGPDALLEPMT Sbjct: 97 VKDSEEWPSKYSKFFVELDDGLELSFTDKRRFAKVRLLANPTSVRPISELGPDALLEPMT 156 Query: 1126 LNDFMESLSKKKIGMKALLLDQSYIAGIGNWIADEVLYQARIHPLQPASSLAKESCESLH 947 +++F ESL+KKKI +K LLLDQ +I+GIGNWIADEVLYQARIHPLQ ASSL+KE CE+LH Sbjct: 157 VDEFAESLAKKKITIKPLLLDQGFISGIGNWIADEVLYQARIHPLQTASSLSKEQCEALH 216 Query: 946 KCIKEVIEKAVEVGADSSQFPENWIFHSREKKPGKAFVDGKIIEFITAGGRTTAYVPELQ 767 I EVIEKAVEV ADSSQFP NWIFH REKKPGKAFVDGK I FITAGGRTTAYVPELQ Sbjct: 217 TSITEVIEKAVEVDADSSQFPSNWIFHDREKKPGKAFVDGKKINFITAGGRTTAYVPELQ 276 Query: 766 KLPGDQTGKEL-AKP-----KKQSPDGDGXXXXXXXXEILKKNKSKKGQKT 632 KL G K +P K + DGDG + K KKGQK+ Sbjct: 277 KLSGKDAEKAAKVRPGKRGVKSKEDDGDGEEDEQESEKEDGSAKLKKGQKS 327 >ref|XP_002517673.1| formamidopyrimidine-DNA glycosylase, putative [Ricinus communis] gi|223543305|gb|EEF44837.1| formamidopyrimidine-DNA glycosylase, putative [Ricinus communis] Length = 403 Score = 310 bits (794), Expect = 9e-82 Identities = 150/189 (79%), Positives = 171/189 (90%) Frame = -3 Query: 1306 VSDTDEWPSKYSKLFIKLDDGLELSFTDKRRFARVRLLKDPASVPPISELGPDALLEPMT 1127 V+DTDEWPSKYSKLF++LDDGLELSFTDKRRFA+VRLL +P SVPPISELGPDALL+PM Sbjct: 97 VNDTDEWPSKYSKLFVELDDGLELSFTDKRRFAKVRLLNNPVSVPPISELGPDALLQPMA 156 Query: 1126 LNDFMESLSKKKIGMKALLLDQSYIAGIGNWIADEVLYQARIHPLQPASSLAKESCESLH 947 +++F +SL KKK+ +KALLLDQS+I+GIGNWIADEVLYQARIHP Q ASS KESC +L Sbjct: 157 VDEFYKSLCKKKMPIKALLLDQSFISGIGNWIADEVLYQARIHPQQSASSFTKESCATLL 216 Query: 946 KCIKEVIEKAVEVGADSSQFPENWIFHSREKKPGKAFVDGKIIEFITAGGRTTAYVPELQ 767 KCIKEVIEKA+EV ADSSQFP +WIFHSREKKPGKAF+DGK I+FIT+GGRTTAYVPELQ Sbjct: 217 KCIKEVIEKAIEVEADSSQFPNSWIFHSREKKPGKAFIDGKKIDFITSGGRTTAYVPELQ 276 Query: 766 KLPGDQTGK 740 KL G+Q K Sbjct: 277 KLSGNQISK 285 >pdb|3TWL|A Chain A, Crystal Structure Of Arabidopsis Thaliana Fpg gi|400261074|pdb|3TWM|A Chain A, Crystal Structure Of Arabidopsis Thaliana Fpg gi|400261075|pdb|3TWM|B Chain B, Crystal Structure Of Arabidopsis Thaliana Fpg Length = 310 Score = 309 bits (792), Expect = 2e-81 Identities = 158/210 (75%), Positives = 176/210 (83%), Gaps = 8/210 (3%) Frame = -3 Query: 1306 VSDTDEWPSKYSKLFIKLDDGLELSFTDKRRFARVRLLKDPASVPPISELGPDALLEPMT 1127 V D++EWPSKYSK F++LDDGLELSFTDKRRFA+VRLL +P SV PISELGPDALLEPMT Sbjct: 97 VKDSEEWPSKYSKFFVELDDGLELSFTDKRRFAKVRLLANPTSVSPISELGPDALLEPMT 156 Query: 1126 LNDFMESLSKKKIGMKALLLDQSYIAGIGNWIADEVLYQARIHPLQPASSLAKESCESLH 947 +++F ESL+KKKI +K LLLDQ YI+GIGNWIADEVLYQARIHPLQ ASSL+KE CE+LH Sbjct: 157 VDEFAESLAKKKITIKPLLLDQGYISGIGNWIADEVLYQARIHPLQTASSLSKEQCEALH 216 Query: 946 KCIKEVIEKAVEVGADSSQFPENWIFHSREKKPGKAFVDGKIIEFITAGGRTTAYVPELQ 767 IKEVIEKAVEV ADSSQFP NWIFH+REKKPGKAFVDGK I+FITAGGRTTAYVPELQ Sbjct: 217 TSIKEVIEKAVEVDADSSQFPSNWIFHNREKKPGKAFVDGKKIDFITAGGRTTAYVPELQ 276 Query: 766 KLPGD--------QTGKELAKPKKQSPDGD 701 KL G + K KPK+ DGD Sbjct: 277 KLYGKDAEKAAKVRPAKRGVKPKED--DGD 304 >ref|XP_003597926.1| Formamidopyrimidine-DNA glycosylase [Medicago truncatula] gi|355486974|gb|AES68177.1| Formamidopyrimidine-DNA glycosylase [Medicago truncatula] Length = 424 Score = 307 bits (787), Expect = 6e-81 Identities = 176/293 (60%), Positives = 193/293 (65%), Gaps = 38/293 (12%) Frame = -3 Query: 1306 VSDTDEWPSKYSKLFIKLDDGLELSFTDKRRFARVRLLKDPASVPPISELGPDALLEPMT 1127 V+D DEWPSKYSK FI+LDDGLELSFTDKRRFARVRLLKDP SVPPISELGPDAL + MT Sbjct: 97 VNDEDEWPSKYSKFFIQLDDGLELSFTDKRRFARVRLLKDPTSVPPISELGPDALFDFMT 156 Query: 1126 LNDFMESLSKKKIGMKALLLDQSYIAGIGNWIADEVLYQARIHPLQPASSLAKESCESLH 947 L++F E L KKK +KALLLDQSYI+GIGNW+ADEVLYQARIHP Q ASSL+ ESC +L+ Sbjct: 157 LDEFTERLHKKKTEIKALLLDQSYISGIGNWVADEVLYQARIHPRQIASSLSGESCSTLY 216 Query: 946 KCIKEVIEKA----------------------------VEVGADSSQFPENWIFHSREKK 851 KCIKEVI+ A VEVGADSSQ+P NWIFHSREKK Sbjct: 217 KCIKEVIQFAVEVDADCSRFPLEWLFHFRWGKKPGKISVEVGADSSQYPTNWIFHSREKK 276 Query: 850 PGKAFVDGKIIEFITAGGRTTAYVPELQKLPGDQTGKELAK-----PKKQSPDGDGXXXX 686 PGKAFVDGK IEFITAGGRTTAYVPELQKL G Q KE K KK S D D Sbjct: 277 PGKAFVDGKTIEFITAGGRTTAYVPELQKLSGSQVLKETGKLRGKASKKSSVDDDNNDGA 336 Query: 685 XXXXEILKKNK-SKKGQKTATXXXXXXXXXXXXXGRRDSG----DDGEQAKKK 542 E LK K +K G K D+G DD +Q +KK Sbjct: 337 DENLESLKSKKGTKAGAKAKKPSKRKKTEESDDDNDGDAGTDNYDDSDQVEKK 389 >ref|XP_006392913.1| hypothetical protein EUTSA_v10011553mg [Eutrema salsugineum] gi|557089491|gb|ESQ30199.1| hypothetical protein EUTSA_v10011553mg [Eutrema salsugineum] Length = 397 Score = 306 bits (785), Expect = 1e-80 Identities = 169/274 (61%), Positives = 193/274 (70%), Gaps = 17/274 (6%) Frame = -3 Query: 1306 VSDTDEWPSKYSKLFIKLDDGLELSFTDKRRFARVRLLKDPASVPPISELGPDALLEPMT 1127 V D++EWPSKYSK F++LDDGLELSFTDKRRFA+VRLL++PASV PISELGPDALLEP+T Sbjct: 97 VKDSEEWPSKYSKFFVELDDGLELSFTDKRRFAKVRLLENPASVRPISELGPDALLEPLT 156 Query: 1126 LNDFMESLSKKKIGMKALLLDQSYIAGIGNWIADEVLYQARIHPLQPASSLAKESCESLH 947 +++ +SL+KKKI +K LLLDQ +I+GIGNWIADEVLYQARIHPLQ ASSL+KE CE+LH Sbjct: 157 IDELAKSLAKKKITIKPLLLDQGFISGIGNWIADEVLYQARIHPLQTASSLSKEQCEALH 216 Query: 946 KCIKEVIEKAVEVGADSSQFPENWIFHSREKKPGKAFVDGKIIEFITAGGRTTAYVPELQ 767 IKEVIEKAVEV AD+SQFP WIFHSRE KPGKAFVDGK I+FITAGGRTTAYVPELQ Sbjct: 217 TSIKEVIEKAVEVDADTSQFPSIWIFHSREAKPGKAFVDGKKIDFITAGGRTTAYVPELQ 276 Query: 766 KLPGDQTGKELAKPKK----------QSPDGDGXXXXXXXXEILKKNKSKKGQ------- 638 KL TGK+ K K + DGDG E K KKGQ Sbjct: 277 KL----TGKDAEKATKVRAGKRGVNSKEDDGDGDEDEQESEEEDDSAKPKKGQKPKGRGK 332 Query: 637 KTATXXXXXXXXXXXXXGRRDSGDDGEQAKKKTK 536 K A+ D GDD E +K K Sbjct: 333 KPASKRKTEESDDEDDDAVADGGDDSEAEEKVIK 366