BLASTX nr result
ID: Cocculus22_contig00015804
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Cocculus22_contig00015804 (509 letters) Database: ./nr 38,876,450 sequences; 13,856,398,315 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_006443060.1| hypothetical protein CICLE_v10020813mg [Citr... 293 2e-77 ref|XP_004507328.1| PREDICTED: endonuclease III-like protein 1-l... 291 6e-77 ref|XP_007131960.1| hypothetical protein PHAVU_011G055100g [Phas... 291 8e-77 ref|XP_007131959.1| hypothetical protein PHAVU_011G055100g [Phas... 291 8e-77 ref|XP_002534117.1| endonuclease III, putative [Ricinus communis... 291 8e-77 ref|XP_002264475.2| PREDICTED: endonuclease III-like protein 1-l... 290 1e-76 ref|XP_006494172.1| PREDICTED: endonuclease III-like protein 1-l... 289 2e-76 ref|XP_004241397.1| PREDICTED: endonuclease III-like protein 1-l... 287 9e-76 ref|XP_006347463.1| PREDICTED: endonuclease III-like protein 1-l... 287 1e-75 ref|XP_007222713.1| hypothetical protein PRUPE_ppa009900mg [Prun... 287 1e-75 ref|XP_003539044.2| PREDICTED: endonuclease III-like protein 1-l... 286 2e-75 ref|XP_007034069.1| DNA glycosylase superfamily protein isoform ... 285 4e-75 ref|XP_007034067.1| DNA glycosylase superfamily protein isoform ... 285 4e-75 emb|CBI36652.3| unnamed protein product [Vitis vinifera] 285 5e-75 ref|XP_007034070.1| DNA glycosylase superfamily protein isoform ... 280 1e-73 ref|XP_007034068.1| DNA glycosylase superfamily protein isoform ... 280 1e-73 ref|XP_004309826.1| PREDICTED: endonuclease III-like protein 1-l... 280 2e-73 ref|XP_006410282.1| hypothetical protein EUTSA_v10016815mg [Eutr... 278 4e-73 gb|AAM61598.1| putative endonuclease [Arabidopsis thaliana] 277 1e-72 ref|NP_565725.1| protein NTH1 [Arabidopsis thaliana] gi|75206080... 277 1e-72 >ref|XP_006443060.1| hypothetical protein CICLE_v10020813mg [Citrus clementina] gi|557545322|gb|ESR56300.1| hypothetical protein CICLE_v10020813mg [Citrus clementina] Length = 357 Score = 293 bits (750), Expect = 2e-77 Identities = 142/169 (84%), Positives = 153/169 (90%) Frame = -2 Query: 508 LDGIRKMRSSEDAPVDSMGCEKAGSFLPPKERRFAVLVSSLLSSQTKDGVTHGAIQRLHQ 329 L+GIRKMR+SEDAPVDSMGCEKAGS LPP+ERRFAVL+SSLLSSQTKD VTHGAIQRL Q Sbjct: 122 LEGIRKMRTSEDAPVDSMGCEKAGSSLPPRERRFAVLISSLLSSQTKDNVTHGAIQRLLQ 181 Query: 328 NGMLAPDVIDNVDEETIKKLIYPVGFYTRKACNLKKIAKICLTKYGGDIPSTLEDLLLLP 149 NG+L + ID DE TIK LIYPVGFYTRKA N+KKIA ICLTKY GDIPS+L++LLLLP Sbjct: 182 NGLLTAEAIDKADEATIKDLIYPVGFYTRKASNMKKIAPICLTKYDGDIPSSLDELLLLP 241 Query: 148 GIGPKMAHLVMNVGWNNVQGICVDTHVHRICNRLRWVSKPGTGQKTSSP 2 GIGPKMAHLVMNVGWNNVQGICVDTHVHRICNRL WVS+PG QKTSSP Sbjct: 242 GIGPKMAHLVMNVGWNNVQGICVDTHVHRICNRLGWVSQPGRKQKTSSP 290 >ref|XP_004507328.1| PREDICTED: endonuclease III-like protein 1-like [Cicer arietinum] Length = 387 Score = 291 bits (745), Expect = 6e-77 Identities = 144/169 (85%), Positives = 150/169 (88%) Frame = -2 Query: 508 LDGIRKMRSSEDAPVDSMGCEKAGSFLPPKERRFAVLVSSLLSSQTKDGVTHGAIQRLHQ 329 L+GIRKMR S DAPVD+MGCEKAGS LPPKERRFAVLVSSLLSSQTKD V HGAIQRL Q Sbjct: 147 LEGIRKMRCSADAPVDTMGCEKAGSTLPPKERRFAVLVSSLLSSQTKDHVNHGAIQRLLQ 206 Query: 328 NGMLAPDVIDNVDEETIKKLIYPVGFYTRKACNLKKIAKICLTKYGGDIPSTLEDLLLLP 149 N +L PD I+N DEETIKKLIYPVGFYTRKA NLKKIA ICL KYGGDIPSTLE LLLLP Sbjct: 207 NDLLTPDAINNADEETIKKLIYPVGFYTRKATNLKKIANICLMKYGGDIPSTLEQLLLLP 266 Query: 148 GIGPKMAHLVMNVGWNNVQGICVDTHVHRICNRLRWVSKPGTGQKTSSP 2 GIGPKMAHLVMNV WNNVQGICVDTHVHRICNRL WVS+ GT QKT +P Sbjct: 267 GIGPKMAHLVMNVAWNNVQGICVDTHVHRICNRLGWVSRLGTKQKTLTP 315 >ref|XP_007131960.1| hypothetical protein PHAVU_011G055100g [Phaseolus vulgaris] gi|561004960|gb|ESW03954.1| hypothetical protein PHAVU_011G055100g [Phaseolus vulgaris] Length = 408 Score = 291 bits (744), Expect = 8e-77 Identities = 142/169 (84%), Positives = 152/169 (89%) Frame = -2 Query: 508 LDGIRKMRSSEDAPVDSMGCEKAGSFLPPKERRFAVLVSSLLSSQTKDGVTHGAIQRLHQ 329 L+GIRKMRSS DAPVD+MGCEKAG LPPKERRFAVLVSSLLSSQTKD VTHGAIQRL Q Sbjct: 168 LEGIRKMRSSADAPVDTMGCEKAGDTLPPKERRFAVLVSSLLSSQTKDPVTHGAIQRLLQ 227 Query: 328 NGMLAPDVIDNVDEETIKKLIYPVGFYTRKACNLKKIAKICLTKYGGDIPSTLEDLLLLP 149 N +L P+ I+NVDEETIKKLIYPVGFYTRKA NLKKIA ICL KY GDIPS+++ LLLLP Sbjct: 228 NDLLTPEAINNVDEETIKKLIYPVGFYTRKATNLKKIANICLMKYHGDIPSSIDQLLLLP 287 Query: 148 GIGPKMAHLVMNVGWNNVQGICVDTHVHRICNRLRWVSKPGTGQKTSSP 2 GIGPKMAHLVMN GWNNVQGICVDTHVHRICNRL WVS+ GT QKTS+P Sbjct: 288 GIGPKMAHLVMNAGWNNVQGICVDTHVHRICNRLGWVSRLGTNQKTSTP 336 >ref|XP_007131959.1| hypothetical protein PHAVU_011G055100g [Phaseolus vulgaris] gi|561004959|gb|ESW03953.1| hypothetical protein PHAVU_011G055100g [Phaseolus vulgaris] Length = 359 Score = 291 bits (744), Expect = 8e-77 Identities = 142/169 (84%), Positives = 152/169 (89%) Frame = -2 Query: 508 LDGIRKMRSSEDAPVDSMGCEKAGSFLPPKERRFAVLVSSLLSSQTKDGVTHGAIQRLHQ 329 L+GIRKMRSS DAPVD+MGCEKAG LPPKERRFAVLVSSLLSSQTKD VTHGAIQRL Q Sbjct: 119 LEGIRKMRSSADAPVDTMGCEKAGDTLPPKERRFAVLVSSLLSSQTKDPVTHGAIQRLLQ 178 Query: 328 NGMLAPDVIDNVDEETIKKLIYPVGFYTRKACNLKKIAKICLTKYGGDIPSTLEDLLLLP 149 N +L P+ I+NVDEETIKKLIYPVGFYTRKA NLKKIA ICL KY GDIPS+++ LLLLP Sbjct: 179 NDLLTPEAINNVDEETIKKLIYPVGFYTRKATNLKKIANICLMKYHGDIPSSIDQLLLLP 238 Query: 148 GIGPKMAHLVMNVGWNNVQGICVDTHVHRICNRLRWVSKPGTGQKTSSP 2 GIGPKMAHLVMN GWNNVQGICVDTHVHRICNRL WVS+ GT QKTS+P Sbjct: 239 GIGPKMAHLVMNAGWNNVQGICVDTHVHRICNRLGWVSRLGTNQKTSTP 287 >ref|XP_002534117.1| endonuclease III, putative [Ricinus communis] gi|223525829|gb|EEF28268.1| endonuclease III, putative [Ricinus communis] Length = 357 Score = 291 bits (744), Expect = 8e-77 Identities = 141/169 (83%), Positives = 151/169 (89%) Frame = -2 Query: 508 LDGIRKMRSSEDAPVDSMGCEKAGSFLPPKERRFAVLVSSLLSSQTKDGVTHGAIQRLHQ 329 L+GIRKMRSSEDAPVD+MGCEKAGSFLP KERRFAVLVSSL+SSQTKD VTHGA+QRLHQ Sbjct: 119 LEGIRKMRSSEDAPVDTMGCEKAGSFLPSKERRFAVLVSSLMSSQTKDHVTHGAVQRLHQ 178 Query: 328 NGMLAPDVIDNVDEETIKKLIYPVGFYTRKACNLKKIAKICLTKYGGDIPSTLEDLLLLP 149 N +L D ID DE TIK LIYPVGFYTRKA NLKKIAKICL KY GDIP +LEDLL LP Sbjct: 179 NSLLTADAIDKADETTIKDLIYPVGFYTRKASNLKKIAKICLMKYDGDIPRSLEDLLSLP 238 Query: 148 GIGPKMAHLVMNVGWNNVQGICVDTHVHRICNRLRWVSKPGTGQKTSSP 2 GIGPKMAHLVMNV W++VQGICVDTHVHRICNRL WVS+PGT QKTS+P Sbjct: 239 GIGPKMAHLVMNVAWDDVQGICVDTHVHRICNRLGWVSRPGTEQKTSNP 287 >ref|XP_002264475.2| PREDICTED: endonuclease III-like protein 1-like [Vitis vinifera] Length = 355 Score = 290 bits (743), Expect = 1e-76 Identities = 143/169 (84%), Positives = 151/169 (89%) Frame = -2 Query: 508 LDGIRKMRSSEDAPVDSMGCEKAGSFLPPKERRFAVLVSSLLSSQTKDGVTHGAIQRLHQ 329 L+GIRKMRSSEDAPVDSMGCEKAGS LPP+ERRFAVLVSSLLSSQTKD VTHGAIQRL Q Sbjct: 117 LEGIRKMRSSEDAPVDSMGCEKAGSSLPPRERRFAVLVSSLLSSQTKDNVTHGAIQRLLQ 176 Query: 328 NGMLAPDVIDNVDEETIKKLIYPVGFYTRKACNLKKIAKICLTKYGGDIPSTLEDLLLLP 149 NG+L D ID DE T+K LIYPVGFY+RKA NLKKIAKICL KY GDIPS+LE+LLLLP Sbjct: 177 NGLLVADAIDKADEATVKSLIYPVGFYSRKAGNLKKIAKICLMKYDGDIPSSLEELLLLP 236 Query: 148 GIGPKMAHLVMNVGWNNVQGICVDTHVHRICNRLRWVSKPGTGQKTSSP 2 GIGPKMAHLVMNV WNNVQGICVDTHVHRICNRL WVS+ GT QKTS P Sbjct: 237 GIGPKMAHLVMNVAWNNVQGICVDTHVHRICNRLGWVSRRGTKQKTSLP 285 >ref|XP_006494172.1| PREDICTED: endonuclease III-like protein 1-like [Citrus sinensis] Length = 357 Score = 289 bits (740), Expect = 2e-76 Identities = 141/169 (83%), Positives = 152/169 (89%) Frame = -2 Query: 508 LDGIRKMRSSEDAPVDSMGCEKAGSFLPPKERRFAVLVSSLLSSQTKDGVTHGAIQRLHQ 329 L+GIRKMR+SEDAPVDSMGCEKAGS LPP+ERRFAVL+SSLLSSQTKD VTHGAIQRL Q Sbjct: 122 LEGIRKMRTSEDAPVDSMGCEKAGSSLPPRERRFAVLISSLLSSQTKDNVTHGAIQRLLQ 181 Query: 328 NGMLAPDVIDNVDEETIKKLIYPVGFYTRKACNLKKIAKICLTKYGGDIPSTLEDLLLLP 149 NG+L + ID DE TIK LIY VGFYTRKA N+KKIA ICLTKY GDIPS+L++LLLLP Sbjct: 182 NGLLTAEAIDKADEATIKDLIYLVGFYTRKASNMKKIAPICLTKYDGDIPSSLDELLLLP 241 Query: 148 GIGPKMAHLVMNVGWNNVQGICVDTHVHRICNRLRWVSKPGTGQKTSSP 2 GIGPKMAHLVMNVGWNNVQGICVDTHVHRICNRL WVS+PG QKTSSP Sbjct: 242 GIGPKMAHLVMNVGWNNVQGICVDTHVHRICNRLGWVSQPGRKQKTSSP 290 >ref|XP_004241397.1| PREDICTED: endonuclease III-like protein 1-like [Solanum lycopersicum] Length = 380 Score = 287 bits (735), Expect = 9e-76 Identities = 140/169 (82%), Positives = 152/169 (89%) Frame = -2 Query: 508 LDGIRKMRSSEDAPVDSMGCEKAGSFLPPKERRFAVLVSSLLSSQTKDGVTHGAIQRLHQ 329 L+GIRKMRS+EDAPVDSMGCEKAGS LP KERRFAVLVSSLLSSQTKD V HGA+QRL Q Sbjct: 149 LEGIRKMRSAEDAPVDSMGCEKAGSSLPAKERRFAVLVSSLLSSQTKDQVNHGAVQRLLQ 208 Query: 328 NGMLAPDVIDNVDEETIKKLIYPVGFYTRKACNLKKIAKICLTKYGGDIPSTLEDLLLLP 149 NG+LA D ID+ +EETIK LIYPVGFYTRKA NLKK+AKICL+KY GDIPS+LE+LLLLP Sbjct: 209 NGLLAADAIDSANEETIKSLIYPVGFYTRKASNLKKVAKICLSKYNGDIPSSLEELLLLP 268 Query: 148 GIGPKMAHLVMNVGWNNVQGICVDTHVHRICNRLRWVSKPGTGQKTSSP 2 GIGPKMAHLVMNV W NVQGICVDTHVHRI NRL WVS+PGT QKT +P Sbjct: 269 GIGPKMAHLVMNVAWENVQGICVDTHVHRISNRLEWVSRPGTKQKTRTP 317 >ref|XP_006347463.1| PREDICTED: endonuclease III-like protein 1-like [Solanum tuberosum] Length = 422 Score = 287 bits (734), Expect = 1e-75 Identities = 141/169 (83%), Positives = 152/169 (89%) Frame = -2 Query: 508 LDGIRKMRSSEDAPVDSMGCEKAGSFLPPKERRFAVLVSSLLSSQTKDGVTHGAIQRLHQ 329 L+GIRKMRS+EDAPVDSMGCEKAGS LP KERRFAVLVSSLLSSQTKD V HGAIQRL Q Sbjct: 191 LEGIRKMRSAEDAPVDSMGCEKAGSSLPAKERRFAVLVSSLLSSQTKDQVNHGAIQRLLQ 250 Query: 328 NGMLAPDVIDNVDEETIKKLIYPVGFYTRKACNLKKIAKICLTKYGGDIPSTLEDLLLLP 149 NG+LA D ID+ +EETIK LIYPVGFYTRKA NLKK+AKICL+KY GDIPS+LE+LLLLP Sbjct: 251 NGLLAADAIDSANEETIKSLIYPVGFYTRKASNLKKVAKICLSKYNGDIPSSLEELLLLP 310 Query: 148 GIGPKMAHLVMNVGWNNVQGICVDTHVHRICNRLRWVSKPGTGQKTSSP 2 GIGPKMAHLVMNV W NVQGICVDTHVHRI NRL WVS+PGT QKT +P Sbjct: 311 GIGPKMAHLVMNVAWENVQGICVDTHVHRISNRLGWVSRPGTKQKTRTP 359 >ref|XP_007222713.1| hypothetical protein PRUPE_ppa009900mg [Prunus persica] gi|462419649|gb|EMJ23912.1| hypothetical protein PRUPE_ppa009900mg [Prunus persica] Length = 272 Score = 287 bits (734), Expect = 1e-75 Identities = 143/169 (84%), Positives = 151/169 (89%) Frame = -2 Query: 508 LDGIRKMRSSEDAPVDSMGCEKAGSFLPPKERRFAVLVSSLLSSQTKDGVTHGAIQRLHQ 329 L+GIRKMRSSEDAPVDSMGCEKAGS LPPKERRFAVLVSSLLSSQTKD VTHGAIQRL Q Sbjct: 34 LEGIRKMRSSEDAPVDSMGCEKAGSALPPKERRFAVLVSSLLSSQTKDHVTHGAIQRLLQ 93 Query: 328 NGMLAPDVIDNVDEETIKKLIYPVGFYTRKACNLKKIAKICLTKYGGDIPSTLEDLLLLP 149 N +LA D ID +E TIK LIYPVGFYTRKA NLKKIAKICLTKY GDIPS+L++LL LP Sbjct: 94 NNLLAADSIDKAEEATIKSLIYPVGFYTRKATNLKKIAKICLTKYDGDIPSSLDELLSLP 153 Query: 148 GIGPKMAHLVMNVGWNNVQGICVDTHVHRICNRLRWVSKPGTGQKTSSP 2 GIGPKMAHLVMNVGWNNVQGICVDTHVHRI NRL WVS+ G QKTS+P Sbjct: 154 GIGPKMAHLVMNVGWNNVQGICVDTHVHRISNRLGWVSREGRKQKTSNP 202 >ref|XP_003539044.2| PREDICTED: endonuclease III-like protein 1-like [Glycine max] Length = 357 Score = 286 bits (732), Expect = 2e-75 Identities = 141/169 (83%), Positives = 150/169 (88%) Frame = -2 Query: 508 LDGIRKMRSSEDAPVDSMGCEKAGSFLPPKERRFAVLVSSLLSSQTKDGVTHGAIQRLHQ 329 L+GIRKMR S DAPVD+MGCEKAG LPPKERRFAVLVSSLLSSQTKD VTHGAIQRL Q Sbjct: 116 LEGIRKMRCSADAPVDTMGCEKAGETLPPKERRFAVLVSSLLSSQTKDPVTHGAIQRLLQ 175 Query: 328 NGMLAPDVIDNVDEETIKKLIYPVGFYTRKACNLKKIAKICLTKYGGDIPSTLEDLLLLP 149 N +L D I++ DEETIKKLIYPVGFYTRKA NLKKIA ICL KY GDIPS++E LLLLP Sbjct: 176 NDLLTADAINDADEETIKKLIYPVGFYTRKASNLKKIANICLMKYDGDIPSSIEQLLLLP 235 Query: 148 GIGPKMAHLVMNVGWNNVQGICVDTHVHRICNRLRWVSKPGTGQKTSSP 2 GIGPKMAHLVMNVGWNNVQGICVDTHVHRICNRL WVS+ GT QKTS+P Sbjct: 236 GIGPKMAHLVMNVGWNNVQGICVDTHVHRICNRLGWVSRLGTKQKTSTP 284 >ref|XP_007034069.1| DNA glycosylase superfamily protein isoform 3 [Theobroma cacao] gi|508713098|gb|EOY04995.1| DNA glycosylase superfamily protein isoform 3 [Theobroma cacao] Length = 364 Score = 285 bits (730), Expect = 4e-75 Identities = 138/169 (81%), Positives = 151/169 (89%) Frame = -2 Query: 508 LDGIRKMRSSEDAPVDSMGCEKAGSFLPPKERRFAVLVSSLLSSQTKDGVTHGAIQRLHQ 329 L+GIRKMRS+EDAPVD+MGCEKAGS LPPKERRFAVL+SSLLSSQTKD VTHGAIQRL Q Sbjct: 123 LEGIRKMRSAEDAPVDTMGCEKAGSVLPPKERRFAVLISSLLSSQTKDHVTHGAIQRLIQ 182 Query: 328 NGMLAPDVIDNVDEETIKKLIYPVGFYTRKACNLKKIAKICLTKYGGDIPSTLEDLLLLP 149 N ++ PD ID DE TIK LIYPVGFYTRKA N+KKIAKICL KY GDIPS+LE+LLLLP Sbjct: 183 NCLMTPDAIDKADEATIKDLIYPVGFYTRKAINVKKIAKICLMKYDGDIPSSLEELLLLP 242 Query: 148 GIGPKMAHLVMNVGWNNVQGICVDTHVHRICNRLRWVSKPGTGQKTSSP 2 GIGPKMAHLVMN+ W++VQGICVDTHVHRICNRL WVS+PGT QKT P Sbjct: 243 GIGPKMAHLVMNIAWDDVQGICVDTHVHRICNRLGWVSRPGTKQKTLYP 291 >ref|XP_007034067.1| DNA glycosylase superfamily protein isoform 1 [Theobroma cacao] gi|508713096|gb|EOY04993.1| DNA glycosylase superfamily protein isoform 1 [Theobroma cacao] Length = 387 Score = 285 bits (730), Expect = 4e-75 Identities = 138/169 (81%), Positives = 151/169 (89%) Frame = -2 Query: 508 LDGIRKMRSSEDAPVDSMGCEKAGSFLPPKERRFAVLVSSLLSSQTKDGVTHGAIQRLHQ 329 L+GIRKMRS+EDAPVD+MGCEKAGS LPPKERRFAVL+SSLLSSQTKD VTHGAIQRL Q Sbjct: 146 LEGIRKMRSAEDAPVDTMGCEKAGSVLPPKERRFAVLISSLLSSQTKDHVTHGAIQRLIQ 205 Query: 328 NGMLAPDVIDNVDEETIKKLIYPVGFYTRKACNLKKIAKICLTKYGGDIPSTLEDLLLLP 149 N ++ PD ID DE TIK LIYPVGFYTRKA N+KKIAKICL KY GDIPS+LE+LLLLP Sbjct: 206 NCLMTPDAIDKADEATIKDLIYPVGFYTRKAINVKKIAKICLMKYDGDIPSSLEELLLLP 265 Query: 148 GIGPKMAHLVMNVGWNNVQGICVDTHVHRICNRLRWVSKPGTGQKTSSP 2 GIGPKMAHLVMN+ W++VQGICVDTHVHRICNRL WVS+PGT QKT P Sbjct: 266 GIGPKMAHLVMNIAWDDVQGICVDTHVHRICNRLGWVSRPGTKQKTLYP 314 >emb|CBI36652.3| unnamed protein product [Vitis vinifera] Length = 379 Score = 285 bits (729), Expect = 5e-75 Identities = 143/172 (83%), Positives = 151/172 (87%), Gaps = 3/172 (1%) Frame = -2 Query: 508 LDGIRKMRSSEDAPVDSMGCEKAGSFLPPKERRFAVLVSSLLSSQTKDGVTHG---AIQR 338 L+GIRKMRSSEDAPVDSMGCEKAGS LPP+ERRFAVLVSSLLSSQTKD VTHG AIQR Sbjct: 138 LEGIRKMRSSEDAPVDSMGCEKAGSSLPPRERRFAVLVSSLLSSQTKDNVTHGNAGAIQR 197 Query: 337 LHQNGMLAPDVIDNVDEETIKKLIYPVGFYTRKACNLKKIAKICLTKYGGDIPSTLEDLL 158 L QNG+L D ID DE T+K LIYPVGFY+RKA NLKKIAKICL KY GDIPS+LE+LL Sbjct: 198 LLQNGLLVADAIDKADEATVKSLIYPVGFYSRKAGNLKKIAKICLMKYDGDIPSSLEELL 257 Query: 157 LLPGIGPKMAHLVMNVGWNNVQGICVDTHVHRICNRLRWVSKPGTGQKTSSP 2 LLPGIGPKMAHLVMNV WNNVQGICVDTHVHRICNRL WVS+ GT QKTS P Sbjct: 258 LLPGIGPKMAHLVMNVAWNNVQGICVDTHVHRICNRLGWVSRRGTKQKTSLP 309 >ref|XP_007034070.1| DNA glycosylase superfamily protein isoform 4 [Theobroma cacao] gi|508713099|gb|EOY04996.1| DNA glycosylase superfamily protein isoform 4 [Theobroma cacao] Length = 336 Score = 280 bits (717), Expect = 1e-73 Identities = 135/164 (82%), Positives = 148/164 (90%) Frame = -2 Query: 508 LDGIRKMRSSEDAPVDSMGCEKAGSFLPPKERRFAVLVSSLLSSQTKDGVTHGAIQRLHQ 329 L+GIRKMRS+EDAPVD+MGCEKAGS LPPKERRFAVL+SSLLSSQTKD VTHGAIQRL Q Sbjct: 123 LEGIRKMRSAEDAPVDTMGCEKAGSVLPPKERRFAVLISSLLSSQTKDHVTHGAIQRLIQ 182 Query: 328 NGMLAPDVIDNVDEETIKKLIYPVGFYTRKACNLKKIAKICLTKYGGDIPSTLEDLLLLP 149 N ++ PD ID DE TIK LIYPVGFYTRKA N+KKIAKICL KY GDIPS+LE+LLLLP Sbjct: 183 NCLMTPDAIDKADEATIKDLIYPVGFYTRKAINVKKIAKICLMKYDGDIPSSLEELLLLP 242 Query: 148 GIGPKMAHLVMNVGWNNVQGICVDTHVHRICNRLRWVSKPGTGQ 17 GIGPKMAHLVMN+ W++VQGICVDTHVHRICNRL WVS+PGT Q Sbjct: 243 GIGPKMAHLVMNIAWDDVQGICVDTHVHRICNRLGWVSRPGTKQ 286 >ref|XP_007034068.1| DNA glycosylase superfamily protein isoform 2 [Theobroma cacao] gi|508713097|gb|EOY04994.1| DNA glycosylase superfamily protein isoform 2 [Theobroma cacao] Length = 359 Score = 280 bits (717), Expect = 1e-73 Identities = 135/164 (82%), Positives = 148/164 (90%) Frame = -2 Query: 508 LDGIRKMRSSEDAPVDSMGCEKAGSFLPPKERRFAVLVSSLLSSQTKDGVTHGAIQRLHQ 329 L+GIRKMRS+EDAPVD+MGCEKAGS LPPKERRFAVL+SSLLSSQTKD VTHGAIQRL Q Sbjct: 146 LEGIRKMRSAEDAPVDTMGCEKAGSVLPPKERRFAVLISSLLSSQTKDHVTHGAIQRLIQ 205 Query: 328 NGMLAPDVIDNVDEETIKKLIYPVGFYTRKACNLKKIAKICLTKYGGDIPSTLEDLLLLP 149 N ++ PD ID DE TIK LIYPVGFYTRKA N+KKIAKICL KY GDIPS+LE+LLLLP Sbjct: 206 NCLMTPDAIDKADEATIKDLIYPVGFYTRKAINVKKIAKICLMKYDGDIPSSLEELLLLP 265 Query: 148 GIGPKMAHLVMNVGWNNVQGICVDTHVHRICNRLRWVSKPGTGQ 17 GIGPKMAHLVMN+ W++VQGICVDTHVHRICNRL WVS+PGT Q Sbjct: 266 GIGPKMAHLVMNIAWDDVQGICVDTHVHRICNRLGWVSRPGTKQ 309 >ref|XP_004309826.1| PREDICTED: endonuclease III-like protein 1-like [Fragaria vesca subsp. vesca] Length = 341 Score = 280 bits (715), Expect = 2e-73 Identities = 140/169 (82%), Positives = 149/169 (88%) Frame = -2 Query: 508 LDGIRKMRSSEDAPVDSMGCEKAGSFLPPKERRFAVLVSSLLSSQTKDGVTHGAIQRLHQ 329 L+GIRKMRS+EDAPVDSMGCEKAGS LPPKERRFAVLVSSLLSSQTKD VTHGA+QRL Q Sbjct: 104 LEGIRKMRSAEDAPVDSMGCEKAGSALPPKERRFAVLVSSLLSSQTKDQVTHGAVQRLLQ 163 Query: 328 NGMLAPDVIDNVDEETIKKLIYPVGFYTRKACNLKKIAKICLTKYGGDIPSTLEDLLLLP 149 NGML+ D ID DE TIK LIYPVGFYTRKA NLKKIA ICL KY GDIPS+LE+LL LP Sbjct: 164 NGMLSADAIDKGDEPTIKSLIYPVGFYTRKASNLKKIANICLVKYDGDIPSSLEELLSLP 223 Query: 148 GIGPKMAHLVMNVGWNNVQGICVDTHVHRICNRLRWVSKPGTGQKTSSP 2 GIGPKMAHLVMNV W+NVQGICVDTHVHRICNRL WV + G QKTS+P Sbjct: 224 GIGPKMAHLVMNVAWDNVQGICVDTHVHRICNRLGWV-RAGKKQKTSNP 271 >ref|XP_006410282.1| hypothetical protein EUTSA_v10016815mg [Eutrema salsugineum] gi|557111451|gb|ESQ51735.1| hypothetical protein EUTSA_v10016815mg [Eutrema salsugineum] Length = 373 Score = 278 bits (712), Expect = 4e-73 Identities = 129/169 (76%), Positives = 150/169 (88%) Frame = -2 Query: 508 LDGIRKMRSSEDAPVDSMGCEKAGSFLPPKERRFAVLVSSLLSSQTKDGVTHGAIQRLHQ 329 L+GIR+MRSSEDAPVDSMGC+KAGSFLPP ERRFAVL+ +LLSSQTKD V + AI RLHQ Sbjct: 135 LEGIRQMRSSEDAPVDSMGCDKAGSFLPPTERRFAVLLGALLSSQTKDEVNNAAIHRLHQ 194 Query: 328 NGMLAPDVIDNVDEETIKKLIYPVGFYTRKACNLKKIAKICLTKYGGDIPSTLEDLLLLP 149 NG+L P+ +D DE T+++LIYPVGFYTRKA +KKIAKICL KY GDIPS+L+DLL LP Sbjct: 195 NGLLTPEAVDKADESTLRELIYPVGFYTRKATYMKKIAKICLVKYNGDIPSSLDDLLALP 254 Query: 148 GIGPKMAHLVMNVGWNNVQGICVDTHVHRICNRLRWVSKPGTGQKTSSP 2 GIGPKMAHL++++ WN+VQGICVDTHVHRICNRL WVS+PGT QKTSSP Sbjct: 255 GIGPKMAHLILHIAWNDVQGICVDTHVHRICNRLGWVSRPGTKQKTSSP 303 >gb|AAM61598.1| putative endonuclease [Arabidopsis thaliana] Length = 379 Score = 277 bits (709), Expect = 1e-72 Identities = 129/169 (76%), Positives = 150/169 (88%) Frame = -2 Query: 508 LDGIRKMRSSEDAPVDSMGCEKAGSFLPPKERRFAVLVSSLLSSQTKDGVTHGAIQRLHQ 329 L+GIR+MRSSEDAPVDSMGC+KAGSFLPP ERRFAVL+ +LLSSQTKD V + AI RLHQ Sbjct: 141 LEGIRQMRSSEDAPVDSMGCDKAGSFLPPTERRFAVLLGALLSSQTKDQVNNAAIHRLHQ 200 Query: 328 NGMLAPDVIDNVDEETIKKLIYPVGFYTRKACNLKKIAKICLTKYGGDIPSTLEDLLLLP 149 NG+L P+ +D DE TIK+LIYPVGFYTRKA +KKIA+ICL KY GDIPS+L+DLL LP Sbjct: 201 NGLLTPEAVDKADESTIKELIYPVGFYTRKATYMKKIARICLVKYDGDIPSSLDDLLSLP 260 Query: 148 GIGPKMAHLVMNVGWNNVQGICVDTHVHRICNRLRWVSKPGTGQKTSSP 2 GIGPKMAHL++++ WN+VQGICVDTHVHRICNRL WVS+PGT QKT+SP Sbjct: 261 GIGPKMAHLILHIAWNDVQGICVDTHVHRICNRLGWVSRPGTKQKTTSP 309 >ref|NP_565725.1| protein NTH1 [Arabidopsis thaliana] gi|75206080|sp|Q9SIC4.2|NTH1_ARATH RecName: Full=Endonuclease III homolog 1, chloroplastic; Short=AtNTH1; AltName: Full=Bifunctional DNA N-glycoslyase/DNA-(apurinic or apyrimidinic site) lyase 1; Short=DNA glycoslyase/AP lyase 1; Flags: Precursor gi|20198157|gb|AAD26474.2| putative endonuclease [Arabidopsis thaliana] gi|330253455|gb|AEC08549.1| protein NTH1 [Arabidopsis thaliana] Length = 379 Score = 277 bits (709), Expect = 1e-72 Identities = 129/169 (76%), Positives = 150/169 (88%) Frame = -2 Query: 508 LDGIRKMRSSEDAPVDSMGCEKAGSFLPPKERRFAVLVSSLLSSQTKDGVTHGAIQRLHQ 329 L+GIR+MRSSEDAPVDSMGC+KAGSFLPP ERRFAVL+ +LLSSQTKD V + AI RLHQ Sbjct: 141 LEGIRQMRSSEDAPVDSMGCDKAGSFLPPTERRFAVLLGALLSSQTKDQVNNAAIHRLHQ 200 Query: 328 NGMLAPDVIDNVDEETIKKLIYPVGFYTRKACNLKKIAKICLTKYGGDIPSTLEDLLLLP 149 NG+L P+ +D DE TIK+LIYPVGFYTRKA +KKIA+ICL KY GDIPS+L+DLL LP Sbjct: 201 NGLLTPEAVDKADESTIKELIYPVGFYTRKATYMKKIARICLVKYDGDIPSSLDDLLSLP 260 Query: 148 GIGPKMAHLVMNVGWNNVQGICVDTHVHRICNRLRWVSKPGTGQKTSSP 2 GIGPKMAHL++++ WN+VQGICVDTHVHRICNRL WVS+PGT QKT+SP Sbjct: 261 GIGPKMAHLILHIAWNDVQGICVDTHVHRICNRLGWVSRPGTKQKTTSP 309