BLASTX nr result
ID: Akebia25_contig00047049
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Akebia25_contig00047049 (726 letters) Database: ./nr 38,876,450 sequences; 13,856,398,315 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_002264475.2| PREDICTED: endonuclease III-like protein 1-l... 247 3e-63 ref|XP_006443060.1| hypothetical protein CICLE_v10020813mg [Citr... 241 1e-61 emb|CBI36652.3| unnamed protein product [Vitis vinifera] 241 1e-61 ref|XP_006494172.1| PREDICTED: endonuclease III-like protein 1-l... 238 2e-60 ref|XP_007034068.1| DNA glycosylase superfamily protein isoform ... 236 8e-60 ref|XP_007034067.1| DNA glycosylase superfamily protein isoform ... 236 8e-60 ref|XP_002534117.1| endonuclease III, putative [Ricinus communis... 228 1e-57 ref|XP_004241397.1| PREDICTED: endonuclease III-like protein 1-l... 216 5e-54 ref|XP_007034070.1| DNA glycosylase superfamily protein isoform ... 215 1e-53 ref|XP_007034069.1| DNA glycosylase superfamily protein isoform ... 215 1e-53 ref|XP_007131959.1| hypothetical protein PHAVU_011G055100g [Phas... 202 1e-49 ref|XP_004152104.1| PREDICTED: endonuclease III-like protein 1-l... 198 2e-48 ref|XP_006347463.1| PREDICTED: endonuclease III-like protein 1-l... 197 3e-48 ref|XP_006410282.1| hypothetical protein EUTSA_v10016815mg [Eutr... 195 2e-47 gb|EYU42853.1| hypothetical protein MIMGU_mgv1a009936mg [Mimulus... 194 2e-47 ref|XP_003539044.2| PREDICTED: endonuclease III-like protein 1-l... 194 2e-47 ref|XP_007131960.1| hypothetical protein PHAVU_011G055100g [Phas... 194 3e-47 ref|XP_004507328.1| PREDICTED: endonuclease III-like protein 1-l... 194 3e-47 ref|NP_565725.1| protein NTH1 [Arabidopsis thaliana] gi|75206080... 193 4e-47 emb|CAC16135.1| endonuclease III homologue [Arabidopsis thaliana] 193 4e-47 >ref|XP_002264475.2| PREDICTED: endonuclease III-like protein 1-like [Vitis vinifera] Length = 355 Score = 247 bits (630), Expect = 3e-63 Identities = 128/192 (66%), Positives = 147/192 (76%) Frame = +3 Query: 150 KFPVKSEIPNPESNPEIRVFVRKRRLKKTVEIPAEQPKIEPLQQKKKLCGLPDIEEFAYG 329 K E PN S E+RVFVRK+R+K VE P ++ K EP QQK +C LPDIEEF Y Sbjct: 18 KTSAHEETPNGVSGSEVRVFVRKKRVKMAVETPEKEIKAEPQQQK--ICELPDIEEFTYR 75 Query: 330 NVNESAQTRHTKPTSNMLAVGSEDAFPIKTKVEPPSNWEEVLEGIRKMRSSEDAPVDSMG 509 S R +KPTS++ G+E I+ E P+NWE++LEGIRKMRSSEDAPVDSMG Sbjct: 76 KGKRSTHLRKSKPTSDVPPGGTEITSSIRPAAELPANWEKILEGIRKMRSSEDAPVDSMG 135 Query: 510 CEKAGSILPPKERRFAILVSSLLSSQTKDEVTHGAIQRLLQNGLLTADAIDNAEEATIKN 689 CEKAGS LPP+ERRFA+LVSSLLSSQTKD VTHGAIQRLLQNGLL ADAID A+EAT+K+ Sbjct: 136 CEKAGSSLPPRERRFAVLVSSLLSSQTKDNVTHGAIQRLLQNGLLVADAIDKADEATVKS 195 Query: 690 LIYPVGFYSRKA 725 LIYPVGFYSRKA Sbjct: 196 LIYPVGFYSRKA 207 >ref|XP_006443060.1| hypothetical protein CICLE_v10020813mg [Citrus clementina] gi|557545322|gb|ESR56300.1| hypothetical protein CICLE_v10020813mg [Citrus clementina] Length = 357 Score = 241 bits (616), Expect = 1e-61 Identities = 130/212 (61%), Positives = 159/212 (75%), Gaps = 6/212 (2%) Frame = +3 Query: 108 YLLPQMSEIRLFSKKF-PVKSEIPNPESNPEIRVFVRKRRLKKTVEIPAEQPKIE-PLQQ 281 ++L +M R +SK+ + NPE+RVFVR++R K ++I E+PK E P++ Sbjct: 3 HILLKMPNSRFYSKRLLQPNANFSTSPPNPELRVFVRRKRQKNALQISKEEPKNEAPIEH 62 Query: 282 KKKLCGLPDIEEFAYGNVNESAQTRH----TKPTSNMLAVGSEDAFPIKTKVEPPSNWEE 449 K CGLPDIEEFAY N SA + +K T +M VG+E A + + EPP+NWE Sbjct: 63 KS--CGLPDIEEFAYKEANGSALSSKIAGKSKSTQDMPVVGTEVASLNRMRGEPPANWER 120 Query: 450 VLEGIRKMRSSEDAPVDSMGCEKAGSILPPKERRFAILVSSLLSSQTKDEVTHGAIQRLL 629 VLEGIRKMR+SEDAPVDSMGCEKAGS LPP+ERRFA+L+SSLLSSQTKD VTHGAIQRLL Sbjct: 121 VLEGIRKMRTSEDAPVDSMGCEKAGSSLPPRERRFAVLISSLLSSQTKDNVTHGAIQRLL 180 Query: 630 QNGLLTADAIDNAEEATIKNLIYPVGFYSRKA 725 QNGLLTA+AID A+EATIK+LIYPVGFY+RKA Sbjct: 181 QNGLLTAEAIDKADEATIKDLIYPVGFYTRKA 212 >emb|CBI36652.3| unnamed protein product [Vitis vinifera] Length = 379 Score = 241 bits (616), Expect = 1e-61 Identities = 128/195 (65%), Positives = 147/195 (75%), Gaps = 3/195 (1%) Frame = +3 Query: 150 KFPVKSEIPNPESNPEIRVFVRKRRLKKTVEIPAEQPKIEPLQQKKKLCGLPDIEEFAYG 329 K E PN S E+RVFVRK+R+K VE P ++ K EP QQK +C LPDIEEF Y Sbjct: 39 KTSAHEETPNGVSGSEVRVFVRKKRVKMAVETPEKEIKAEPQQQK--ICELPDIEEFTYR 96 Query: 330 NVNESAQTRHTKPTSNMLAVGSEDAFPIKTKVEPPSNWEEVLEGIRKMRSSEDAPVDSMG 509 S R +KPTS++ G+E I+ E P+NWE++LEGIRKMRSSEDAPVDSMG Sbjct: 97 KGKRSTHLRKSKPTSDVPPGGTEITSSIRPAAELPANWEKILEGIRKMRSSEDAPVDSMG 156 Query: 510 CEKAGSILPPKERRFAILVSSLLSSQTKDEVTH---GAIQRLLQNGLLTADAIDNAEEAT 680 CEKAGS LPP+ERRFA+LVSSLLSSQTKD VTH GAIQRLLQNGLL ADAID A+EAT Sbjct: 157 CEKAGSSLPPRERRFAVLVSSLLSSQTKDNVTHGNAGAIQRLLQNGLLVADAIDKADEAT 216 Query: 681 IKNLIYPVGFYSRKA 725 +K+LIYPVGFYSRKA Sbjct: 217 VKSLIYPVGFYSRKA 231 >ref|XP_006494172.1| PREDICTED: endonuclease III-like protein 1-like [Citrus sinensis] Length = 357 Score = 238 bits (606), Expect = 2e-60 Identities = 129/212 (60%), Positives = 158/212 (74%), Gaps = 6/212 (2%) Frame = +3 Query: 108 YLLPQMSEIRLFSKKF-PVKSEIPNPESNPEIRVFVRKRRLKKTVEIPAEQPKIE-PLQQ 281 ++L +M R +SK+ + NPE+RVFVR++R K ++I E+PK E P++ Sbjct: 3 HILLKMPNSRFYSKRLLQPNANFSTSPPNPELRVFVRRKRQKNALQISKEEPKNEAPIEH 62 Query: 282 KKKLCGLPDIEEFAYGNVNESAQTRH----TKPTSNMLAVGSEDAFPIKTKVEPPSNWEE 449 K CGLPDIEEFAY N SA + +K T +M VG+E A + + EPP+NWE Sbjct: 63 KS--CGLPDIEEFAYKEANGSALSSKIAGKSKSTQDMPVVGTEVASLNRMRGEPPANWER 120 Query: 450 VLEGIRKMRSSEDAPVDSMGCEKAGSILPPKERRFAILVSSLLSSQTKDEVTHGAIQRLL 629 VLEGIRKMR+SEDAPVDSMGCEKAGS LPP+ERRFA+L+SSLLSSQTKD VTHGAIQRLL Sbjct: 121 VLEGIRKMRTSEDAPVDSMGCEKAGSSLPPRERRFAVLISSLLSSQTKDNVTHGAIQRLL 180 Query: 630 QNGLLTADAIDNAEEATIKNLIYPVGFYSRKA 725 QNGLLTA+AID A+EATIK+LIY VGFY+RKA Sbjct: 181 QNGLLTAEAIDKADEATIKDLIYLVGFYTRKA 212 >ref|XP_007034068.1| DNA glycosylase superfamily protein isoform 2 [Theobroma cacao] gi|508713097|gb|EOY04994.1| DNA glycosylase superfamily protein isoform 2 [Theobroma cacao] Length = 359 Score = 236 bits (601), Expect = 8e-60 Identities = 124/196 (63%), Positives = 150/196 (76%), Gaps = 7/196 (3%) Frame = +3 Query: 159 VKSEIPNPESN-------PEIRVFVRKRRLKKTVEIPAEQPKIEPLQQKKKLCGLPDIEE 317 V S PNP S P +RVF RK+R+KKTV++ E PK E + KLCGLPDIEE Sbjct: 43 VPSSDPNPGSETTDNVSVPAVRVFTRKKRVKKTVDVVQEIPKAE--NKGLKLCGLPDIEE 100 Query: 318 FAYGNVNESAQTRHTKPTSNMLAVGSEDAFPIKTKVEPPSNWEEVLEGIRKMRSSEDAPV 497 FAY V+ + + +K TS+ + VG+ A P+ P+NWE+VLEGIRKMRS+EDAPV Sbjct: 101 FAYKKVDGPSLSGKSKSTSDEINVGTGIASPVGIGGNAPANWEKVLEGIRKMRSAEDAPV 160 Query: 498 DSMGCEKAGSILPPKERRFAILVSSLLSSQTKDEVTHGAIQRLLQNGLLTADAIDNAEEA 677 D+MGCEKAGS+LPPKERRFA+L+SSLLSSQTKD VTHGAIQRL+QN L+T DAID A+EA Sbjct: 161 DTMGCEKAGSVLPPKERRFAVLISSLLSSQTKDHVTHGAIQRLIQNCLMTPDAIDKADEA 220 Query: 678 TIKNLIYPVGFYSRKA 725 TIK+LIYPVGFY+RKA Sbjct: 221 TIKDLIYPVGFYTRKA 236 >ref|XP_007034067.1| DNA glycosylase superfamily protein isoform 1 [Theobroma cacao] gi|508713096|gb|EOY04993.1| DNA glycosylase superfamily protein isoform 1 [Theobroma cacao] Length = 387 Score = 236 bits (601), Expect = 8e-60 Identities = 124/196 (63%), Positives = 150/196 (76%), Gaps = 7/196 (3%) Frame = +3 Query: 159 VKSEIPNPESN-------PEIRVFVRKRRLKKTVEIPAEQPKIEPLQQKKKLCGLPDIEE 317 V S PNP S P +RVF RK+R+KKTV++ E PK E + KLCGLPDIEE Sbjct: 43 VPSSDPNPGSETTDNVSVPAVRVFTRKKRVKKTVDVVQEIPKAE--NKGLKLCGLPDIEE 100 Query: 318 FAYGNVNESAQTRHTKPTSNMLAVGSEDAFPIKTKVEPPSNWEEVLEGIRKMRSSEDAPV 497 FAY V+ + + +K TS+ + VG+ A P+ P+NWE+VLEGIRKMRS+EDAPV Sbjct: 101 FAYKKVDGPSLSGKSKSTSDEINVGTGIASPVGIGGNAPANWEKVLEGIRKMRSAEDAPV 160 Query: 498 DSMGCEKAGSILPPKERRFAILVSSLLSSQTKDEVTHGAIQRLLQNGLLTADAIDNAEEA 677 D+MGCEKAGS+LPPKERRFA+L+SSLLSSQTKD VTHGAIQRL+QN L+T DAID A+EA Sbjct: 161 DTMGCEKAGSVLPPKERRFAVLISSLLSSQTKDHVTHGAIQRLIQNCLMTPDAIDKADEA 220 Query: 678 TIKNLIYPVGFYSRKA 725 TIK+LIYPVGFY+RKA Sbjct: 221 TIKDLIYPVGFYTRKA 236 >ref|XP_002534117.1| endonuclease III, putative [Ricinus communis] gi|223525829|gb|EEF28268.1| endonuclease III, putative [Ricinus communis] Length = 357 Score = 228 bits (582), Expect = 1e-57 Identities = 124/207 (59%), Positives = 152/207 (73%), Gaps = 10/207 (4%) Frame = +3 Query: 135 RLFSKKFPVKSEI------PNPESN----PEIRVFVRKRRLKKTVEIPAEQPKIEPLQQK 284 R SK K+EI P P SN P RV+VRK+R K+T+E+ ++ K+E + K Sbjct: 5 RFSSKSLQSKTEIQILSSDPIPGSNEATEPASRVYVRKKRAKRTLEVAEKELKVETKEVK 64 Query: 285 KKLCGLPDIEEFAYGNVNESAQTRHTKPTSNMLAVGSEDAFPIKTKVEPPSNWEEVLEGI 464 + LPDIE+F++ N SA R +KP+ ++L V +E A I+ EPP+NWE VLEGI Sbjct: 65 QS--ALPDIEDFSFKGTNGSAYLRKSKPSRDVLPVDNEVACTIRPSDEPPANWEIVLEGI 122 Query: 465 RKMRSSEDAPVDSMGCEKAGSILPPKERRFAILVSSLLSSQTKDEVTHGAIQRLLQNGLL 644 RKMRSSEDAPVD+MGCEKAGS LP KERRFA+LVSSL+SSQTKD VTHGA+QRL QN LL Sbjct: 123 RKMRSSEDAPVDTMGCEKAGSFLPSKERRFAVLVSSLMSSQTKDHVTHGAVQRLHQNSLL 182 Query: 645 TADAIDNAEEATIKNLIYPVGFYSRKA 725 TADAID A+E TIK+LIYPVGFY+RKA Sbjct: 183 TADAIDKADETTIKDLIYPVGFYTRKA 209 >ref|XP_004241397.1| PREDICTED: endonuclease III-like protein 1-like [Solanum lycopersicum] Length = 380 Score = 216 bits (551), Expect = 5e-54 Identities = 115/187 (61%), Positives = 139/187 (74%), Gaps = 7/187 (3%) Frame = +3 Query: 186 SNPEIRVFVRKRRLKKTVEIPAEQPKIEPLQQKKKLCGLPDIEEFAYGNVNESAQTRHTK 365 S PE+RVF+R++R+KKTVE+ A++ K E +K L LPDIE+F+Y Q+ +K Sbjct: 53 SVPELRVFIRRKRVKKTVEVIAKEVKEESSGKKVMLVRLPDIEDFSYSKDITHPQSTPSK 112 Query: 366 P-------TSNMLAVGSEDAFPIKTKVEPPSNWEEVLEGIRKMRSSEDAPVDSMGCEKAG 524 T L F + ++PPSNWE+VLEGIRKMRS+EDAPVDSMGCEKAG Sbjct: 113 TVRLTGEKTLPQLMQTEIKGFSLSDPLQPPSNWEKVLEGIRKMRSAEDAPVDSMGCEKAG 172 Query: 525 SILPPKERRFAILVSSLLSSQTKDEVTHGAIQRLLQNGLLTADAIDNAEEATIKNLIYPV 704 S LP KERRFA+LVSSLLSSQTKD+V HGA+QRLLQNGLL ADAID+A E TIK+LIYPV Sbjct: 173 SSLPAKERRFAVLVSSLLSSQTKDQVNHGAVQRLLQNGLLAADAIDSANEETIKSLIYPV 232 Query: 705 GFYSRKA 725 GFY+RKA Sbjct: 233 GFYTRKA 239 >ref|XP_007034070.1| DNA glycosylase superfamily protein isoform 4 [Theobroma cacao] gi|508713099|gb|EOY04996.1| DNA glycosylase superfamily protein isoform 4 [Theobroma cacao] Length = 336 Score = 215 bits (547), Expect = 1e-53 Identities = 117/196 (59%), Positives = 139/196 (70%), Gaps = 7/196 (3%) Frame = +3 Query: 159 VKSEIPNPESN-------PEIRVFVRKRRLKKTVEIPAEQPKIEPLQQKKKLCGLPDIEE 317 V S PNP S P +RVF RK+R+KKTV++ E PK E + KLCGLPDIEE Sbjct: 43 VPSSDPNPGSETTDNVSVPAVRVFTRKKRVKKTVDVVQEIPKAE--NKGLKLCGLPDIEE 100 Query: 318 FAYGNVNESAQTRHTKPTSNMLAVGSEDAFPIKTKVEPPSNWEEVLEGIRKMRSSEDAPV 497 FAY V+ + + + P+NWE+VLEGIRKMRS+EDAPV Sbjct: 101 FAYKKVDGPSLSGNA-----------------------PANWEKVLEGIRKMRSAEDAPV 137 Query: 498 DSMGCEKAGSILPPKERRFAILVSSLLSSQTKDEVTHGAIQRLLQNGLLTADAIDNAEEA 677 D+MGCEKAGS+LPPKERRFA+L+SSLLSSQTKD VTHGAIQRL+QN L+T DAID A+EA Sbjct: 138 DTMGCEKAGSVLPPKERRFAVLISSLLSSQTKDHVTHGAIQRLIQNCLMTPDAIDKADEA 197 Query: 678 TIKNLIYPVGFYSRKA 725 TIK+LIYPVGFY+RKA Sbjct: 198 TIKDLIYPVGFYTRKA 213 >ref|XP_007034069.1| DNA glycosylase superfamily protein isoform 3 [Theobroma cacao] gi|508713098|gb|EOY04995.1| DNA glycosylase superfamily protein isoform 3 [Theobroma cacao] Length = 364 Score = 215 bits (547), Expect = 1e-53 Identities = 117/196 (59%), Positives = 139/196 (70%), Gaps = 7/196 (3%) Frame = +3 Query: 159 VKSEIPNPESN-------PEIRVFVRKRRLKKTVEIPAEQPKIEPLQQKKKLCGLPDIEE 317 V S PNP S P +RVF RK+R+KKTV++ E PK E + KLCGLPDIEE Sbjct: 43 VPSSDPNPGSETTDNVSVPAVRVFTRKKRVKKTVDVVQEIPKAE--NKGLKLCGLPDIEE 100 Query: 318 FAYGNVNESAQTRHTKPTSNMLAVGSEDAFPIKTKVEPPSNWEEVLEGIRKMRSSEDAPV 497 FAY V+ + + + P+NWE+VLEGIRKMRS+EDAPV Sbjct: 101 FAYKKVDGPSLSGNA-----------------------PANWEKVLEGIRKMRSAEDAPV 137 Query: 498 DSMGCEKAGSILPPKERRFAILVSSLLSSQTKDEVTHGAIQRLLQNGLLTADAIDNAEEA 677 D+MGCEKAGS+LPPKERRFA+L+SSLLSSQTKD VTHGAIQRL+QN L+T DAID A+EA Sbjct: 138 DTMGCEKAGSVLPPKERRFAVLISSLLSSQTKDHVTHGAIQRLIQNCLMTPDAIDKADEA 197 Query: 678 TIKNLIYPVGFYSRKA 725 TIK+LIYPVGFY+RKA Sbjct: 198 TIKDLIYPVGFYTRKA 213 >ref|XP_007131959.1| hypothetical protein PHAVU_011G055100g [Phaseolus vulgaris] gi|561004959|gb|ESW03953.1| hypothetical protein PHAVU_011G055100g [Phaseolus vulgaris] Length = 359 Score = 202 bits (513), Expect = 1e-49 Identities = 116/210 (55%), Positives = 144/210 (68%), Gaps = 9/210 (4%) Frame = +3 Query: 123 MSE-IRLFSKKFPVKSEIPNP---ESNPEIRVFVRKRRLKKTVEIPAEQPKIEPLQQKKK 290 MSE R F K P P SN ++RVFVR+ + + + + E+ PL Q K Sbjct: 1 MSEKTRPFCKVTPPNPNTPTSFVESSNSKVRVFVRRNKKPRKMAVKLEEEDHLPLTQDHK 60 Query: 291 L-----CGLPDIEEFAYGNVNESAQTRHTKPTSNMLAVGSEDAFPIKTKVEPPSNWEEVL 455 + GLP+IE+FAY NE + R ++ S++ +V SE A + + P++WE+VL Sbjct: 61 VPVTQKFGLPEIEDFAYCGGNELTRRRKSEMESDVASVASEVA-STRPGGKSPAHWEKVL 119 Query: 456 EGIRKMRSSEDAPVDSMGCEKAGSILPPKERRFAILVSSLLSSQTKDEVTHGAIQRLLQN 635 EGIRKMRSS DAPVD+MGCEKAG LPPKERRFA+LVSSLLSSQTKD VTHGAIQRLLQN Sbjct: 120 EGIRKMRSSADAPVDTMGCEKAGDTLPPKERRFAVLVSSLLSSQTKDPVTHGAIQRLLQN 179 Query: 636 GLLTADAIDNAEEATIKNLIYPVGFYSRKA 725 LLT +AI+N +E TIK LIYPVGFY+RKA Sbjct: 180 DLLTPEAINNVDEETIKKLIYPVGFYTRKA 209 >ref|XP_004152104.1| PREDICTED: endonuclease III-like protein 1-like [Cucumis sativus] gi|449521044|ref|XP_004167541.1| PREDICTED: endonuclease III-like protein 1-like [Cucumis sativus] Length = 386 Score = 198 bits (503), Expect = 2e-48 Identities = 111/188 (59%), Positives = 135/188 (71%), Gaps = 5/188 (2%) Frame = +3 Query: 177 NPESNPEIRVFVRKRRLKKTVEIPAEQPKIEPLQQKKKLCGLPDIEEFAYGNVNESAQTR 356 N S PE RVFVR RR+KK E ++EP K+ C P+IE+FA+ +S +R Sbjct: 53 NGVSEPETRVFVR-RRVKKIAESQDSGFEVEPKIDTKRSCP-PNIEDFAFKRTKDSPGSR 110 Query: 357 HTKPTSNMLAVGSEDAFPI--KTKVE---PPSNWEEVLEGIRKMRSSEDAPVDSMGCEKA 521 KP ++L G ED+ P K K E PP NWE+VL+GIR+MRSSE+APVD+MGC +A Sbjct: 111 KLKPPLDLLLNGIEDSNPTTHKGKAERGKPPVNWEKVLKGIREMRSSEEAPVDTMGCGRA 170 Query: 522 GSILPPKERRFAILVSSLLSSQTKDEVTHGAIQRLLQNGLLTADAIDNAEEATIKNLIYP 701 GS LPPKERRFA+L SSLLSSQTKD VTHGA RL ++GLLTADA+D A+E TIK+LIYP Sbjct: 171 GSTLPPKERRFAVLASSLLSSQTKDHVTHGAALRLQESGLLTADAMDKADEETIKSLIYP 230 Query: 702 VGFYSRKA 725 VGFYS KA Sbjct: 231 VGFYSTKA 238 >ref|XP_006347463.1| PREDICTED: endonuclease III-like protein 1-like [Solanum tuberosum] Length = 422 Score = 197 bits (501), Expect = 3e-48 Identities = 116/231 (50%), Positives = 143/231 (61%), Gaps = 51/231 (22%) Frame = +3 Query: 186 SNPEIRVFVRKRRLKKTVEIPAEQPKIEPLQQKKKLCGLPDIEEFAYGNVNESAQTR--- 356 S PE+RVF+R++R+KKTVEI A++ K E KKL LP+IE+F+Y +Q + Sbjct: 53 SVPELRVFIRRKRVKKTVEIIAKEVKEE--SSGKKLVKLPEIEDFSYSKEATHSQPKLCH 110 Query: 357 ----------------------------------HT----KPTSNMLAVGSE-------- 398 H+ P+ ++ G + Sbjct: 111 KYKLSVTSAALLFYDPVHQHLDFPNFLVFHPCANHSLLCAAPSKSVRLTGEKALSQLTQT 170 Query: 399 --DAFPIKTKVEPPSNWEEVLEGIRKMRSSEDAPVDSMGCEKAGSILPPKERRFAILVSS 572 F + ++PP NWE+VLEGIRKMRS+EDAPVDSMGCEKAGS LP KERRFA+LVSS Sbjct: 171 EIKGFSLSDPLQPPLNWEKVLEGIRKMRSAEDAPVDSMGCEKAGSSLPAKERRFAVLVSS 230 Query: 573 LLSSQTKDEVTHGAIQRLLQNGLLTADAIDNAEEATIKNLIYPVGFYSRKA 725 LLSSQTKD+V HGAIQRLLQNGLL ADAID+A E TIK+LIYPVGFY+RKA Sbjct: 231 LLSSQTKDQVNHGAIQRLLQNGLLAADAIDSANEETIKSLIYPVGFYTRKA 281 >ref|XP_006410282.1| hypothetical protein EUTSA_v10016815mg [Eutrema salsugineum] gi|557111451|gb|ESQ51735.1| hypothetical protein EUTSA_v10016815mg [Eutrema salsugineum] Length = 373 Score = 195 bits (495), Expect = 2e-47 Identities = 105/186 (56%), Positives = 131/186 (70%) Frame = +3 Query: 168 EIPNPESNPEIRVFVRKRRLKKTVEIPAEQPKIEPLQQKKKLCGLPDIEEFAYGNVNESA 347 E P S E RV+ RK+RLK+ P E+ + +K+LC LPDIEEFAY S+ Sbjct: 49 EPAKPASGSETRVYTRKKRLKQEAFQPLEKDSC--INTQKQLCRLPDIEEFAYKKNTRSS 106 Query: 348 QTRHTKPTSNMLAVGSEDAFPIKTKVEPPSNWEEVLEGIRKMRSSEDAPVDSMGCEKAGS 527 +R + TS + V S +KT P NW +VLEGIR+MRSSEDAPVDSMGC+KAGS Sbjct: 107 SSRRSTETS--ITVTS-----VKTAGNAPENWVKVLEGIRQMRSSEDAPVDSMGCDKAGS 159 Query: 528 ILPPKERRFAILVSSLLSSQTKDEVTHGAIQRLLQNGLLTADAIDNAEEATIKNLIYPVG 707 LPP ERRFA+L+ +LLSSQTKDEV + AI RL QNGLLT +A+D A+E+T++ LIYPVG Sbjct: 160 FLPPTERRFAVLLGALLSSQTKDEVNNAAIHRLHQNGLLTPEAVDKADESTLRELIYPVG 219 Query: 708 FYSRKA 725 FY+RKA Sbjct: 220 FYTRKA 225 >gb|EYU42853.1| hypothetical protein MIMGU_mgv1a009936mg [Mimulus guttatus] Length = 319 Score = 194 bits (494), Expect = 2e-47 Identities = 107/176 (60%), Positives = 125/176 (71%) Frame = +3 Query: 198 IRVFVRKRRLKKTVEIPAEQPKIEPLQQKKKLCGLPDIEEFAYGNVNESAQTRHTKPTSN 377 +RV++RK+R KTV+ E+ E + +K C LP+IE+FAYGN N ++ Sbjct: 65 VRVYIRKKRSNKTVQPIVEEINPEIIDEKP--CSLPEIEDFAYGNGNSVSRL-------- 114 Query: 378 MLAVGSEDAFPIKTKVEPPSNWEEVLEGIRKMRSSEDAPVDSMGCEKAGSILPPKERRFA 557 TK+ P NWE+VLEGIR MRSSEDAPVDSMGCEKAGS LPPKERRFA Sbjct: 115 -------------TKL--PENWEKVLEGIRTMRSSEDAPVDSMGCEKAGSSLPPKERRFA 159 Query: 558 ILVSSLLSSQTKDEVTHGAIQRLLQNGLLTADAIDNAEEATIKNLIYPVGFYSRKA 725 +LVSSLLSSQTKD+VTHGAIQRLL+ LLTA+AID A E IK LIYPVGFYSRKA Sbjct: 160 VLVSSLLSSQTKDQVTHGAIQRLLEKDLLTAEAIDGANEGAIKELIYPVGFYSRKA 215 >ref|XP_003539044.2| PREDICTED: endonuclease III-like protein 1-like [Glycine max] Length = 357 Score = 194 bits (494), Expect = 2e-47 Identities = 105/181 (58%), Positives = 132/181 (72%), Gaps = 1/181 (0%) Frame = +3 Query: 186 SNPEIRVFVRKRRLKKTVEIPAEQPKIEPLQQK-KKLCGLPDIEEFAYGNVNESAQTRHT 362 ++ ++RVF+R+ + + + + EQ + L+ GLP+IEEFAY E Q + Sbjct: 27 THSQVRVFMRRNKRPRNMALKLEQSDHQDLKVPVTHKFGLPEIEEFAYCGAKELTQCGKS 86 Query: 363 KPTSNMLAVGSEDAFPIKTKVEPPSNWEEVLEGIRKMRSSEDAPVDSMGCEKAGSILPPK 542 + S+ + V SE A ++ E P+ WE+VLEGIRKMR S DAPVD+MGCEKAG LPPK Sbjct: 87 EMGSDAIPVASEVA-STRSSGESPAQWEKVLEGIRKMRCSADAPVDTMGCEKAGETLPPK 145 Query: 543 ERRFAILVSSLLSSQTKDEVTHGAIQRLLQNGLLTADAIDNAEEATIKNLIYPVGFYSRK 722 ERRFA+LVSSLLSSQTKD VTHGAIQRLLQN LLTADAI++A+E TIK LIYPVGFY+RK Sbjct: 146 ERRFAVLVSSLLSSQTKDPVTHGAIQRLLQNDLLTADAINDADEETIKKLIYPVGFYTRK 205 Query: 723 A 725 A Sbjct: 206 A 206 >ref|XP_007131960.1| hypothetical protein PHAVU_011G055100g [Phaseolus vulgaris] gi|561004960|gb|ESW03954.1| hypothetical protein PHAVU_011G055100g [Phaseolus vulgaris] Length = 408 Score = 194 bits (493), Expect = 3e-47 Identities = 106/185 (57%), Positives = 134/185 (72%), Gaps = 5/185 (2%) Frame = +3 Query: 186 SNPEIRVFVRKRRLKKTVEIPAEQPKIEPLQQKKKL-----CGLPDIEEFAYGNVNESAQ 350 S+ + RVFVR+ + + + + E+ P Q K+ GLP+IE+FAY NE + Sbjct: 75 SHSKARVFVRRNKNPRKMAVKLEEEDHLPSTQDHKVPVTQKFGLPEIEDFAYCGGNELTR 134 Query: 351 TRHTKPTSNMLAVGSEDAFPIKTKVEPPSNWEEVLEGIRKMRSSEDAPVDSMGCEKAGSI 530 R ++ S++ +V SE A + + P++WE+VLEGIRKMRSS DAPVD+MGCEKAG Sbjct: 135 RRKSEMESDVASVASEVA-STRPGGKSPAHWEKVLEGIRKMRSSADAPVDTMGCEKAGDT 193 Query: 531 LPPKERRFAILVSSLLSSQTKDEVTHGAIQRLLQNGLLTADAIDNAEEATIKNLIYPVGF 710 LPPKERRFA+LVSSLLSSQTKD VTHGAIQRLLQN LLT +AI+N +E TIK LIYPVGF Sbjct: 194 LPPKERRFAVLVSSLLSSQTKDPVTHGAIQRLLQNDLLTPEAINNVDEETIKKLIYPVGF 253 Query: 711 YSRKA 725 Y+RKA Sbjct: 254 YTRKA 258 >ref|XP_004507328.1| PREDICTED: endonuclease III-like protein 1-like [Cicer arietinum] Length = 387 Score = 194 bits (493), Expect = 3e-47 Identities = 107/184 (58%), Positives = 131/184 (71%), Gaps = 9/184 (4%) Frame = +3 Query: 201 RVFVRK------RRLKKTVEIPAEQPK-IEPLQQKKKLCGLPDIEEFAYGNVNESAQTRH 359 RV+VR+ +R K +Q + P Q KK GLP+IE+FAY NE Q R Sbjct: 54 RVYVRRNNSNNNKRAKGITTTKLQQNHHLPPTQTHKKFGGLPEIEDFAYRGPNELTQFRK 113 Query: 360 TKPTSNMLAVGSEDAFPIKT--KVEPPSNWEEVLEGIRKMRSSEDAPVDSMGCEKAGSIL 533 ++ +S+++ +E++ + E P++WEE LEGIRKMR S DAPVD+MGCEKAGS L Sbjct: 114 SEISSDVIVKPAEESEVASAAHRSESPADWEETLEGIRKMRCSADAPVDTMGCEKAGSTL 173 Query: 534 PPKERRFAILVSSLLSSQTKDEVTHGAIQRLLQNGLLTADAIDNAEEATIKNLIYPVGFY 713 PPKERRFA+LVSSLLSSQTKD V HGAIQRLLQN LLT DAI+NA+E TIK LIYPVGFY Sbjct: 174 PPKERRFAVLVSSLLSSQTKDHVNHGAIQRLLQNDLLTPDAINNADEETIKKLIYPVGFY 233 Query: 714 SRKA 725 +RKA Sbjct: 234 TRKA 237 >ref|NP_565725.1| protein NTH1 [Arabidopsis thaliana] gi|75206080|sp|Q9SIC4.2|NTH1_ARATH RecName: Full=Endonuclease III homolog 1, chloroplastic; Short=AtNTH1; AltName: Full=Bifunctional DNA N-glycoslyase/DNA-(apurinic or apyrimidinic site) lyase 1; Short=DNA glycoslyase/AP lyase 1; Flags: Precursor gi|20198157|gb|AAD26474.2| putative endonuclease [Arabidopsis thaliana] gi|330253455|gb|AEC08549.1| protein NTH1 [Arabidopsis thaliana] Length = 379 Score = 193 bits (491), Expect = 4e-47 Identities = 110/203 (54%), Positives = 138/203 (67%), Gaps = 9/203 (4%) Frame = +3 Query: 144 SKKFPVKSEIPNPESNPEI---------RVFVRKRRLKKTVEIPAEQPKIEPLQQKKKLC 296 SK +K++ P +SN E+ RV+ RK+RLK+ P E+ + + K LC Sbjct: 37 SKHISLKTQHPLSDSNSELAYGASGSETRVYTRKKRLKQEPFEPLEKYSGKGVNTHK-LC 95 Query: 297 GLPDIEEFAYGNVNESAQTRHTKPTSNMLAVGSEDAFPIKTKVEPPSNWEEVLEGIRKMR 476 GLPDIE+FAY S + + TS + V S +KT PP NW EVLEGIR+MR Sbjct: 96 GLPDIEDFAYKKTIGSPSSSRSTETS--ITVTS-----VKTAGYPPENWVEVLEGIRQMR 148 Query: 477 SSEDAPVDSMGCEKAGSILPPKERRFAILVSSLLSSQTKDEVTHGAIQRLLQNGLLTADA 656 SSEDAPVDSMGC+KAGS LPP ERRFA+L+ +LLSSQTKD+V + AI RL QNGLLT +A Sbjct: 149 SSEDAPVDSMGCDKAGSFLPPTERRFAVLLGALLSSQTKDQVNNAAIHRLHQNGLLTPEA 208 Query: 657 IDNAEEATIKNLIYPVGFYSRKA 725 +D A+E+TIK LIYPVGFY+RKA Sbjct: 209 VDKADESTIKELIYPVGFYTRKA 231 >emb|CAC16135.1| endonuclease III homologue [Arabidopsis thaliana] Length = 354 Score = 193 bits (491), Expect = 4e-47 Identities = 110/203 (54%), Positives = 138/203 (67%), Gaps = 9/203 (4%) Frame = +3 Query: 144 SKKFPVKSEIPNPESNPEI---------RVFVRKRRLKKTVEIPAEQPKIEPLQQKKKLC 296 SK +K++ P +SN E+ RV+ RK+RLK+ P E+ + + K LC Sbjct: 12 SKHISLKTQHPLSDSNSELAYGASGSETRVYTRKKRLKQEPFEPLEKYSGKGVNTHK-LC 70 Query: 297 GLPDIEEFAYGNVNESAQTRHTKPTSNMLAVGSEDAFPIKTKVEPPSNWEEVLEGIRKMR 476 GLPDIE+FAY S + + TS + V S +KT PP NW EVLEGIR+MR Sbjct: 71 GLPDIEDFAYKKTIGSPSSSRSTETS--ITVTS-----VKTAGYPPENWVEVLEGIRQMR 123 Query: 477 SSEDAPVDSMGCEKAGSILPPKERRFAILVSSLLSSQTKDEVTHGAIQRLLQNGLLTADA 656 SSEDAPVDSMGC+KAGS LPP ERRFA+L+ +LLSSQTKD+V + AI RL QNGLLT +A Sbjct: 124 SSEDAPVDSMGCDKAGSFLPPTERRFAVLLGALLSSQTKDQVNNAAIHRLHQNGLLTPEA 183 Query: 657 IDNAEEATIKNLIYPVGFYSRKA 725 +D A+E+TIK LIYPVGFY+RKA Sbjct: 184 VDKADESTIKELIYPVGFYTRKA 206