BLASTX nr result
ID: Sinomenium21_contig00017764
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Sinomenium21_contig00017764 (1286 letters) Database: ./nr 38,876,450 sequences; 13,856,398,315 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_006443060.1| hypothetical protein CICLE_v10020813mg [Citr... 453 e-125 ref|XP_002264475.2| PREDICTED: endonuclease III-like protein 1-l... 450 e-124 ref|XP_006494172.1| PREDICTED: endonuclease III-like protein 1-l... 449 e-123 ref|XP_002534117.1| endonuclease III, putative [Ricinus communis... 447 e-123 ref|XP_007034069.1| DNA glycosylase superfamily protein isoform ... 445 e-122 emb|CBI36652.3| unnamed protein product [Vitis vinifera] 444 e-122 ref|XP_007034067.1| DNA glycosylase superfamily protein isoform ... 439 e-120 ref|XP_004241397.1| PREDICTED: endonuclease III-like protein 1-l... 430 e-118 ref|XP_007222713.1| hypothetical protein PRUPE_ppa009900mg [Prun... 423 e-116 ref|XP_004309826.1| PREDICTED: endonuclease III-like protein 1-l... 420 e-115 ref|XP_007131959.1| hypothetical protein PHAVU_011G055100g [Phas... 418 e-114 ref|XP_007131960.1| hypothetical protein PHAVU_011G055100g [Phas... 415 e-113 ref|XP_003539044.2| PREDICTED: endonuclease III-like protein 1-l... 412 e-112 ref|XP_004507328.1| PREDICTED: endonuclease III-like protein 1-l... 410 e-112 ref|XP_002309812.1| endonuclease-related family protein [Populus... 409 e-111 ref|XP_006347463.1| PREDICTED: endonuclease III-like protein 1-l... 409 e-111 ref|XP_006410282.1| hypothetical protein EUTSA_v10016815mg [Eutr... 407 e-111 ref|XP_002881177.1| predicted protein [Arabidopsis lyrata subsp.... 404 e-110 emb|CAC16135.1| endonuclease III homologue [Arabidopsis thaliana] 404 e-110 ref|NP_565725.1| protein NTH1 [Arabidopsis thaliana] gi|75206080... 403 e-110 >ref|XP_006443060.1| hypothetical protein CICLE_v10020813mg [Citrus clementina] gi|557545322|gb|ESR56300.1| hypothetical protein CICLE_v10020813mg [Citrus clementina] Length = 357 Score = 453 bits (1165), Expect = e-125 Identities = 229/356 (64%), Positives = 268/356 (75%), Gaps = 25/356 (7%) Frame = -2 Query: 1267 LLRPMPETRSVSAKSQSKPEIPKPKAEPNAGIRVFARKRRSKCTVETHVEEHKIESPQQK 1088 +L MP +R S K +P + PN +RVF R++R K ++ EE K E+P + Sbjct: 4 ILLKMPNSRFYS-KRLLQPNANFSTSPPNPELRVFVRRKRQKNALQISKEEPKNEAPIEH 62 Query: 1087 KKLCDLSDIEEFAYGEVNGSAQMSKV-------------------------DAPANWEEV 983 K C L DIEEFAY E NGSA SK+ + PANWE V Sbjct: 63 KS-CGLPDIEEFAYKEANGSALSSKIAGKSKSTQDMPVVGTEVASLNRMRGEPPANWERV 121 Query: 982 LDGIRNMRSYEDAPVDSMGCEKAGSSLPPKERRFAVLVSSLLSSQTKDGVTHGAIQRLHQ 803 L+GIR MR+ EDAPVDSMGCEKAGSSLPP+ERRFAVL+SSLLSSQTKD VTHGAIQRL Q Sbjct: 122 LEGIRKMRTSEDAPVDSMGCEKAGSSLPPRERRFAVLISSLLSSQTKDNVTHGAIQRLLQ 181 Query: 802 NDLLAPDAIDNTEEETIKNLIYPVGFYSRKASNLKKIAKICLMKYGGDIPSSLKDLLLLP 623 N LL +AID +E TIK+LIYPVGFY+RKASN+KKIA ICL KY GDIPSSL +LLLLP Sbjct: 182 NGLLTAEAIDKADEATIKDLIYPVGFYTRKASNMKKIAPICLTKYDGDIPSSLDELLLLP 241 Query: 622 GIGPKMAHLVMNVGWNNVQGICVDTHVHRICNRLRWVSRPGTGQKTSSPEETRESLQLWL 443 GIGPKMAHLVMNVGWNNVQGICVDTHVHRICNRL WVS+PG QKTSSPE+TRE LQLWL Sbjct: 242 GIGPKMAHLVMNVGWNNVQGICVDTHVHRICNRLGWVSQPGRKQKTSSPEQTREVLQLWL 301 Query: 442 PKDEWIAINPLLVGFGQTVCTPLRPRCGMCSINRLCPSAFKETTSPVPRTKNSGRR 275 PK+EW+ INPLLVGFGQT+CTP+RPRCGMCS++ LCPSAFK+++SP +++ S ++ Sbjct: 302 PKEEWVPINPLLVGFGQTICTPIRPRCGMCSVSELCPSAFKDSSSPSSKSRKSAQK 357 >ref|XP_002264475.2| PREDICTED: endonuclease III-like protein 1-like [Vitis vinifera] Length = 355 Score = 450 bits (1157), Expect = e-124 Identities = 232/352 (65%), Positives = 258/352 (73%), Gaps = 32/352 (9%) Frame = -2 Query: 1231 AKSQSKPEIPK-----------PKAEPNAGIRVFARKRRSKCTVETHVEEHKIESPQQKK 1085 A S SKP +P P + +RVF RK+R K VET +E K E QQK Sbjct: 4 ATSSSKPLLPALQSKTSAHEETPNGVSGSEVRVFVRKKRVKMAVETPEKEIKAEPQQQK- 62 Query: 1084 KLCDLSDIEEFAYGEVNGSAQMSK---------------------VDAPANWEEVLDGIR 968 +C+L DIEEF Y + S + K + PANWE++L+GIR Sbjct: 63 -ICELPDIEEFTYRKGKRSTHLRKSKPTSDVPPGGTEITSSIRPAAELPANWEKILEGIR 121 Query: 967 NMRSYEDAPVDSMGCEKAGSSLPPKERRFAVLVSSLLSSQTKDGVTHGAIQRLHQNDLLA 788 MRS EDAPVDSMGCEKAGSSLPP+ERRFAVLVSSLLSSQTKD VTHGAIQRL QN LL Sbjct: 122 KMRSSEDAPVDSMGCEKAGSSLPPRERRFAVLVSSLLSSQTKDNVTHGAIQRLLQNGLLV 181 Query: 787 PDAIDNTEEETIKNLIYPVGFYSRKASNLKKIAKICLMKYGGDIPSSLKDLLLLPGIGPK 608 DAID +E T+K+LIYPVGFYSRKA NLKKIAKICLMKY GDIPSSL++LLLLPGIGPK Sbjct: 182 ADAIDKADEATVKSLIYPVGFYSRKAGNLKKIAKICLMKYDGDIPSSLEELLLLPGIGPK 241 Query: 607 MAHLVMNVGWNNVQGICVDTHVHRICNRLRWVSRPGTGQKTSSPEETRESLQLWLPKDEW 428 MAHLVMNV WNNVQGICVDTHVHRICNRL WVSR GT QKTS PEETRESLQLWLPK+EW Sbjct: 242 MAHLVMNVAWNNVQGICVDTHVHRICNRLGWVSRRGTKQKTSLPEETRESLQLWLPKEEW 301 Query: 427 IAINPLLVGFGQTVCTPLRPRCGMCSINRLCPSAFKETTSPVPRTKNSGRRK 272 + INPLLVGFGQT+CTPLRPRCG+C ++ LCPSAFKE SP + K G K Sbjct: 302 VPINPLLVGFGQTICTPLRPRCGVCGVSDLCPSAFKEAQSPSSKMKKPGTDK 353 >ref|XP_006494172.1| PREDICTED: endonuclease III-like protein 1-like [Citrus sinensis] Length = 357 Score = 449 bits (1155), Expect = e-123 Identities = 228/356 (64%), Positives = 267/356 (75%), Gaps = 25/356 (7%) Frame = -2 Query: 1267 LLRPMPETRSVSAKSQSKPEIPKPKAEPNAGIRVFARKRRSKCTVETHVEEHKIESPQQK 1088 +L MP +R S K +P + PN +RVF R++R K ++ EE K E+P + Sbjct: 4 ILLKMPNSRFYS-KRLLQPNANFSTSPPNPELRVFVRRKRQKNALQISKEEPKNEAPIEH 62 Query: 1087 KKLCDLSDIEEFAYGEVNGSAQMSKV-------------------------DAPANWEEV 983 K C L DIEEFAY E NGSA SK+ + PANWE V Sbjct: 63 KS-CGLPDIEEFAYKEANGSALSSKIAGKSKSTQDMPVVGTEVASLNRMRGEPPANWERV 121 Query: 982 LDGIRNMRSYEDAPVDSMGCEKAGSSLPPKERRFAVLVSSLLSSQTKDGVTHGAIQRLHQ 803 L+GIR MR+ EDAPVDSMGCEKAGSSLPP+ERRFAVL+SSLLSSQTKD VTHGAIQRL Q Sbjct: 122 LEGIRKMRTSEDAPVDSMGCEKAGSSLPPRERRFAVLISSLLSSQTKDNVTHGAIQRLLQ 181 Query: 802 NDLLAPDAIDNTEEETIKNLIYPVGFYSRKASNLKKIAKICLMKYGGDIPSSLKDLLLLP 623 N LL +AID +E TIK+LIY VGFY+RKASN+KKIA ICL KY GDIPSSL +LLLLP Sbjct: 182 NGLLTAEAIDKADEATIKDLIYLVGFYTRKASNMKKIAPICLTKYDGDIPSSLDELLLLP 241 Query: 622 GIGPKMAHLVMNVGWNNVQGICVDTHVHRICNRLRWVSRPGTGQKTSSPEETRESLQLWL 443 GIGPKMAHLVMNVGWNNVQGICVDTHVHRICNRL WVS+PG QKTSSPE+TRE LQLWL Sbjct: 242 GIGPKMAHLVMNVGWNNVQGICVDTHVHRICNRLGWVSQPGRKQKTSSPEQTREVLQLWL 301 Query: 442 PKDEWIAINPLLVGFGQTVCTPLRPRCGMCSINRLCPSAFKETTSPVPRTKNSGRR 275 PK+EW+ INPLLVGFGQT+CTP+RPRCGMCS++ LCPSAFK+++SP +++ S ++ Sbjct: 302 PKEEWVPINPLLVGFGQTICTPIRPRCGMCSVSELCPSAFKDSSSPSSKSRKSAQK 357 >ref|XP_002534117.1| endonuclease III, putative [Ricinus communis] gi|223525829|gb|EEF28268.1| endonuclease III, putative [Ricinus communis] Length = 357 Score = 447 bits (1149), Expect = e-123 Identities = 231/354 (65%), Positives = 264/354 (74%), Gaps = 29/354 (8%) Frame = -2 Query: 1255 MPETRSVSAKSQSKPEIPKPKAEPNAGI--------RVFARKRRSKCTVETHVEEHKIES 1100 MP TR S QSK EI ++P G RV+ RK+R+K T+E +E K+E+ Sbjct: 1 MPITRFSSKSLQSKTEIQILSSDPIPGSNEATEPASRVYVRKKRAKRTLEVAEKELKVET 60 Query: 1099 PQQKKKLCDLSDIEEFAYGEVNGSAQMSKV---------------------DAPANWEEV 983 + K+ L DIE+F++ NGSA + K + PANWE V Sbjct: 61 KEVKQSA--LPDIEDFSFKGTNGSAYLRKSKPSRDVLPVDNEVACTIRPSDEPPANWEIV 118 Query: 982 LDGIRNMRSYEDAPVDSMGCEKAGSSLPPKERRFAVLVSSLLSSQTKDGVTHGAIQRLHQ 803 L+GIR MRS EDAPVD+MGCEKAGS LP KERRFAVLVSSL+SSQTKD VTHGA+QRLHQ Sbjct: 119 LEGIRKMRSSEDAPVDTMGCEKAGSFLPSKERRFAVLVSSLMSSQTKDHVTHGAVQRLHQ 178 Query: 802 NDLLAPDAIDNTEEETIKNLIYPVGFYSRKASNLKKIAKICLMKYGGDIPSSLKDLLLLP 623 N LL DAID +E TIK+LIYPVGFY+RKASNLKKIAKICLMKY GDIP SL+DLL LP Sbjct: 179 NSLLTADAIDKADETTIKDLIYPVGFYTRKASNLKKIAKICLMKYDGDIPRSLEDLLSLP 238 Query: 622 GIGPKMAHLVMNVGWNNVQGICVDTHVHRICNRLRWVSRPGTGQKTSSPEETRESLQLWL 443 GIGPKMAHLVMNV W++VQGICVDTHVHRICNRL WVSRPGT QKTS+PEETR +LQLWL Sbjct: 239 GIGPKMAHLVMNVAWDDVQGICVDTHVHRICNRLGWVSRPGTEQKTSNPEETRVALQLWL 298 Query: 442 PKDEWIAINPLLVGFGQTVCTPLRPRCGMCSINRLCPSAFKETTSPVPRTKNSG 281 PK+EW+ INPLLVGFGQT+CTPLRPRCGMCSI CPSAFKET+SP + K SG Sbjct: 299 PKEEWVPINPLLVGFGQTICTPLRPRCGMCSITEFCPSAFKETSSPASKMKKSG 352 >ref|XP_007034069.1| DNA glycosylase superfamily protein isoform 3 [Theobroma cacao] gi|508713098|gb|EOY04995.1| DNA glycosylase superfamily protein isoform 3 [Theobroma cacao] Length = 364 Score = 445 bits (1144), Expect = e-122 Identities = 227/339 (66%), Positives = 261/339 (76%), Gaps = 11/339 (3%) Frame = -2 Query: 1255 MPETRSVSAKSQSKPEIPKPKAEPNAG-----------IRVFARKRRSKCTVETHVEEHK 1109 MP+TR S P ++PN G +RVF RK+R K TV+ E K Sbjct: 25 MPKTRLAFKTLSSSSTTEVPSSDPNPGSETTDNVSVPAVRVFTRKKRVKKTVDVVQEIPK 84 Query: 1108 IESPQQKKKLCDLSDIEEFAYGEVNGSAQMSKVDAPANWEEVLDGIRNMRSYEDAPVDSM 929 E+ + KLC L DIEEFAY +V+G + +APANWE+VL+GIR MRS EDAPVD+M Sbjct: 85 AEN--KGLKLCGLPDIEEFAYKKVDGPSLSG--NAPANWEKVLEGIRKMRSAEDAPVDTM 140 Query: 928 GCEKAGSSLPPKERRFAVLVSSLLSSQTKDGVTHGAIQRLHQNDLLAPDAIDNTEEETIK 749 GCEKAGS LPPKERRFAVL+SSLLSSQTKD VTHGAIQRL QN L+ PDAID +E TIK Sbjct: 141 GCEKAGSVLPPKERRFAVLISSLLSSQTKDHVTHGAIQRLIQNCLMTPDAIDKADEATIK 200 Query: 748 NLIYPVGFYSRKASNLKKIAKICLMKYGGDIPSSLKDLLLLPGIGPKMAHLVMNVGWNNV 569 +LIYPVGFY+RKA N+KKIAKICLMKY GDIPSSL++LLLLPGIGPKMAHLVMN+ W++V Sbjct: 201 DLIYPVGFYTRKAINVKKIAKICLMKYDGDIPSSLEELLLLPGIGPKMAHLVMNIAWDDV 260 Query: 568 QGICVDTHVHRICNRLRWVSRPGTGQKTSSPEETRESLQLWLPKDEWIAINPLLVGFGQT 389 QGICVDTHVHRICNRL WVSRPGT QKT PEETR +LQ WLPK+EW+ INPLLVGFGQT Sbjct: 261 QGICVDTHVHRICNRLGWVSRPGTKQKTLYPEETRVALQQWLPKEEWVPINPLLVGFGQT 320 Query: 388 VCTPLRPRCGMCSINRLCPSAFKETTSPVPRTKNSGRRK 272 +CTPLRP+C +CSI CPSAFKET+SP + K SG K Sbjct: 321 ICTPLRPQCEVCSITEFCPSAFKETSSPSSKVKKSGVTK 359 >emb|CBI36652.3| unnamed protein product [Vitis vinifera] Length = 379 Score = 444 bits (1143), Expect = e-122 Identities = 232/355 (65%), Positives = 258/355 (72%), Gaps = 35/355 (9%) Frame = -2 Query: 1231 AKSQSKPEIPK-----------PKAEPNAGIRVFARKRRSKCTVETHVEEHKIESPQQKK 1085 A S SKP +P P + +RVF RK+R K VET +E K E QQK Sbjct: 25 ATSSSKPLLPALQSKTSAHEETPNGVSGSEVRVFVRKKRVKMAVETPEKEIKAEPQQQK- 83 Query: 1084 KLCDLSDIEEFAYGEVNGSAQMSK---------------------VDAPANWEEVLDGIR 968 +C+L DIEEF Y + S + K + PANWE++L+GIR Sbjct: 84 -ICELPDIEEFTYRKGKRSTHLRKSKPTSDVPPGGTEITSSIRPAAELPANWEKILEGIR 142 Query: 967 NMRSYEDAPVDSMGCEKAGSSLPPKERRFAVLVSSLLSSQTKDGVTHG---AIQRLHQND 797 MRS EDAPVDSMGCEKAGSSLPP+ERRFAVLVSSLLSSQTKD VTHG AIQRL QN Sbjct: 143 KMRSSEDAPVDSMGCEKAGSSLPPRERRFAVLVSSLLSSQTKDNVTHGNAGAIQRLLQNG 202 Query: 796 LLAPDAIDNTEEETIKNLIYPVGFYSRKASNLKKIAKICLMKYGGDIPSSLKDLLLLPGI 617 LL DAID +E T+K+LIYPVGFYSRKA NLKKIAKICLMKY GDIPSSL++LLLLPGI Sbjct: 203 LLVADAIDKADEATVKSLIYPVGFYSRKAGNLKKIAKICLMKYDGDIPSSLEELLLLPGI 262 Query: 616 GPKMAHLVMNVGWNNVQGICVDTHVHRICNRLRWVSRPGTGQKTSSPEETRESLQLWLPK 437 GPKMAHLVMNV WNNVQGICVDTHVHRICNRL WVSR GT QKTS PEETRESLQLWLPK Sbjct: 263 GPKMAHLVMNVAWNNVQGICVDTHVHRICNRLGWVSRRGTKQKTSLPEETRESLQLWLPK 322 Query: 436 DEWIAINPLLVGFGQTVCTPLRPRCGMCSINRLCPSAFKETTSPVPRTKNSGRRK 272 +EW+ INPLLVGFGQT+CTPLRPRCG+C ++ LCPSAFKE SP + K G K Sbjct: 323 EEWVPINPLLVGFGQTICTPLRPRCGVCGVSDLCPSAFKEAQSPSSKMKKPGTDK 377 >ref|XP_007034067.1| DNA glycosylase superfamily protein isoform 1 [Theobroma cacao] gi|508713096|gb|EOY04993.1| DNA glycosylase superfamily protein isoform 1 [Theobroma cacao] Length = 387 Score = 439 bits (1128), Expect = e-120 Identities = 228/360 (63%), Positives = 262/360 (72%), Gaps = 32/360 (8%) Frame = -2 Query: 1255 MPETRSVSAKSQSKPEIPKPKAEPNAG-----------IRVFARKRRSKCTVETHVEEHK 1109 MP+TR S P ++PN G +RVF RK+R K TV+ E K Sbjct: 25 MPKTRLAFKTLSSSSTTEVPSSDPNPGSETTDNVSVPAVRVFTRKKRVKKTVDVVQEIPK 84 Query: 1108 IESPQQKKKLCDLSDIEEFAYGEVNGSAQMSKV---------------------DAPANW 992 E+ + KLC L DIEEFAY +V+G + K +APANW Sbjct: 85 AEN--KGLKLCGLPDIEEFAYKKVDGPSLSGKSKSTSDEINVGTGIASPVGIGGNAPANW 142 Query: 991 EEVLDGIRNMRSYEDAPVDSMGCEKAGSSLPPKERRFAVLVSSLLSSQTKDGVTHGAIQR 812 E+VL+GIR MRS EDAPVD+MGCEKAGS LPPKERRFAVL+SSLLSSQTKD VTHGAIQR Sbjct: 143 EKVLEGIRKMRSAEDAPVDTMGCEKAGSVLPPKERRFAVLISSLLSSQTKDHVTHGAIQR 202 Query: 811 LHQNDLLAPDAIDNTEEETIKNLIYPVGFYSRKASNLKKIAKICLMKYGGDIPSSLKDLL 632 L QN L+ PDAID +E TIK+LIYPVGFY+RKA N+KKIAKICLMKY GDIPSSL++LL Sbjct: 203 LIQNCLMTPDAIDKADEATIKDLIYPVGFYTRKAINVKKIAKICLMKYDGDIPSSLEELL 262 Query: 631 LLPGIGPKMAHLVMNVGWNNVQGICVDTHVHRICNRLRWVSRPGTGQKTSSPEETRESLQ 452 LLPGIGPKMAHLVMN+ W++VQGICVDTHVHRICNRL WVSRPGT QKT PEETR +LQ Sbjct: 263 LLPGIGPKMAHLVMNIAWDDVQGICVDTHVHRICNRLGWVSRPGTKQKTLYPEETRVALQ 322 Query: 451 LWLPKDEWIAINPLLVGFGQTVCTPLRPRCGMCSINRLCPSAFKETTSPVPRTKNSGRRK 272 WLPK+EW+ INPLLVGFGQT+CTPLRP+C +CSI CPSAFKET+SP + K SG K Sbjct: 323 QWLPKEEWVPINPLLVGFGQTICTPLRPQCEVCSITEFCPSAFKETSSPSSKVKKSGVTK 382 >ref|XP_004241397.1| PREDICTED: endonuclease III-like protein 1-like [Solanum lycopersicum] Length = 380 Score = 430 bits (1105), Expect = e-118 Identities = 221/346 (63%), Positives = 253/346 (73%), Gaps = 33/346 (9%) Frame = -2 Query: 1243 RSVSAKSQSKPEIPKPKAEPNAG-----IRVFARKRRSKCTVETHVEEHKIESPQQKKKL 1079 R+ S+ +Q P P + G +RVF R++R K TVE +E K ES +K L Sbjct: 29 RTRSSLNQETPSQKNPGCDGTGGSSVPELRVFIRRKRVKKTVEVIAKEVKEESSGKKVML 88 Query: 1078 CDLSDIEEFAYG----------------------------EVNGSAQMSKVDAPANWEEV 983 L DIE+F+Y E+ G + + P+NWE+V Sbjct: 89 VRLPDIEDFSYSKDITHPQSTPSKTVRLTGEKTLPQLMQTEIKGFSLSDPLQPPSNWEKV 148 Query: 982 LDGIRNMRSYEDAPVDSMGCEKAGSSLPPKERRFAVLVSSLLSSQTKDGVTHGAIQRLHQ 803 L+GIR MRS EDAPVDSMGCEKAGSSLP KERRFAVLVSSLLSSQTKD V HGA+QRL Q Sbjct: 149 LEGIRKMRSAEDAPVDSMGCEKAGSSLPAKERRFAVLVSSLLSSQTKDQVNHGAVQRLLQ 208 Query: 802 NDLLAPDAIDNTEEETIKNLIYPVGFYSRKASNLKKIAKICLMKYGGDIPSSLKDLLLLP 623 N LLA DAID+ EETIK+LIYPVGFY+RKASNLKK+AKICL KY GDIPSSL++LLLLP Sbjct: 209 NGLLAADAIDSANEETIKSLIYPVGFYTRKASNLKKVAKICLSKYNGDIPSSLEELLLLP 268 Query: 622 GIGPKMAHLVMNVGWNNVQGICVDTHVHRICNRLRWVSRPGTGQKTSSPEETRESLQLWL 443 GIGPKMAHLVMNV W NVQGICVDTHVHRI NRL WVSRPGT QKT +PEETRESLQLWL Sbjct: 269 GIGPKMAHLVMNVAWENVQGICVDTHVHRISNRLEWVSRPGTKQKTRTPEETRESLQLWL 328 Query: 442 PKDEWIAINPLLVGFGQTVCTPLRPRCGMCSINRLCPSAFKETTSP 305 PK+EW+ INPLLVGFGQT+CTPLRPRC +C+++ LCPSAFKE SP Sbjct: 329 PKEEWVPINPLLVGFGQTICTPLRPRCAICTVSDLCPSAFKEAASP 374 >ref|XP_007222713.1| hypothetical protein PRUPE_ppa009900mg [Prunus persica] gi|462419649|gb|EMJ23912.1| hypothetical protein PRUPE_ppa009900mg [Prunus persica] Length = 272 Score = 423 bits (1088), Expect = e-116 Identities = 209/271 (77%), Positives = 232/271 (85%) Frame = -2 Query: 1084 KLCDLSDIEEFAYGEVNGSAQMSKVDAPANWEEVLDGIRNMRSYEDAPVDSMGCEKAGSS 905 +L DIEEFAY +V+ S SK PANWE+VL+GIR MRS EDAPVDSMGCEKAGS+ Sbjct: 2 QLASPPDIEEFAYTKVSASTNSSK--PPANWEKVLEGIRKMRSSEDAPVDSMGCEKAGSA 59 Query: 904 LPPKERRFAVLVSSLLSSQTKDGVTHGAIQRLHQNDLLAPDAIDNTEEETIKNLIYPVGF 725 LPPKERRFAVLVSSLLSSQTKD VTHGAIQRL QN+LLA D+ID EE TIK+LIYPVGF Sbjct: 60 LPPKERRFAVLVSSLLSSQTKDHVTHGAIQRLLQNNLLAADSIDKAEEATIKSLIYPVGF 119 Query: 724 YSRKASNLKKIAKICLMKYGGDIPSSLKDLLLLPGIGPKMAHLVMNVGWNNVQGICVDTH 545 Y+RKA+NLKKIAKICL KY GDIPSSL +LL LPGIGPKMAHLVMNVGWNNVQGICVDTH Sbjct: 120 YTRKATNLKKIAKICLTKYDGDIPSSLDELLSLPGIGPKMAHLVMNVGWNNVQGICVDTH 179 Query: 544 VHRICNRLRWVSRPGTGQKTSSPEETRESLQLWLPKDEWIAINPLLVGFGQTVCTPLRPR 365 VHRI NRL WVSR G QKTS+PEETRE+LQLWLPK+EW INPLLVGFGQTVCTPLRP Sbjct: 180 VHRISNRLGWVSREGRKQKTSNPEETREALQLWLPKEEWDPINPLLVGFGQTVCTPLRPH 239 Query: 364 CGMCSINRLCPSAFKETTSPVPRTKNSGRRK 272 CG+C++++ CPSAFKE +SP ++K SG K Sbjct: 240 CGVCNVSKFCPSAFKEASSPSSKSKKSGLSK 270 >ref|XP_004309826.1| PREDICTED: endonuclease III-like protein 1-like [Fragaria vesca subsp. vesca] Length = 341 Score = 420 bits (1079), Expect = e-115 Identities = 210/297 (70%), Positives = 244/297 (82%), Gaps = 1/297 (0%) Frame = -2 Query: 1159 RKRRSKCTVETHVEEHKIESPQQKKKLCDLSDIEEFAYGEVNGSAQMSKVDAP-ANWEEV 983 R +R K T E ++E + ++ L DIEEFAY + S+ + + P A+WE+V Sbjct: 50 RSKRLKTT------EQRLEIVAKPHQMDLLPDIEEFAYRNESSSSYSTDIGKPPAHWEKV 103 Query: 982 LDGIRNMRSYEDAPVDSMGCEKAGSSLPPKERRFAVLVSSLLSSQTKDGVTHGAIQRLHQ 803 L+GIR MRS EDAPVDSMGCEKAGS+LPPKERRFAVLVSSLLSSQTKD VTHGA+QRL Q Sbjct: 104 LEGIRKMRSAEDAPVDSMGCEKAGSALPPKERRFAVLVSSLLSSQTKDQVTHGAVQRLLQ 163 Query: 802 NDLLAPDAIDNTEEETIKNLIYPVGFYSRKASNLKKIAKICLMKYGGDIPSSLKDLLLLP 623 N +L+ DAID +E TIK+LIYPVGFY+RKASNLKKIA ICL+KY GDIPSSL++LL LP Sbjct: 164 NGMLSADAIDKGDEPTIKSLIYPVGFYTRKASNLKKIANICLVKYDGDIPSSLEELLSLP 223 Query: 622 GIGPKMAHLVMNVGWNNVQGICVDTHVHRICNRLRWVSRPGTGQKTSSPEETRESLQLWL 443 GIGPKMAHLVMNV W+NVQGICVDTHVHRICNRL WV R G QKTS+PEETRE+LQLWL Sbjct: 224 GIGPKMAHLVMNVAWDNVQGICVDTHVHRICNRLGWV-RAGKKQKTSNPEETREALQLWL 282 Query: 442 PKDEWIAINPLLVGFGQTVCTPLRPRCGMCSINRLCPSAFKETTSPVPRTKNSGRRK 272 PKDEW+ INPLLVGFGQTVCTPLRPRCG+CS++ CPSA+KET+SP+ +TK SG K Sbjct: 283 PKDEWVPINPLLVGFGQTVCTPLRPRCGVCSVSEFCPSAYKETSSPLSKTKKSGSSK 339 >ref|XP_007131959.1| hypothetical protein PHAVU_011G055100g [Phaseolus vulgaris] gi|561004959|gb|ESW03953.1| hypothetical protein PHAVU_011G055100g [Phaseolus vulgaris] Length = 359 Score = 418 bits (1074), Expect = e-114 Identities = 217/347 (62%), Positives = 252/347 (72%), Gaps = 26/347 (7%) Frame = -2 Query: 1249 ETRSVSAKSQSKPEIPKPKAEP-NAGIRVFARKRRS--KCTVETHVEEHKIESPQQKKKL 1079 +TR + P P E N+ +RVF R+ + K V+ E+H + K + Sbjct: 4 KTRPFCKVTPPNPNTPTSFVESSNSKVRVFVRRNKKPRKMAVKLEEEDHLPLTQDHKVPV 63 Query: 1078 CD---LSDIEEFAYGEVNGSAQMSKVD--------------------APANWEEVLDGIR 968 L +IE+FAY N + K + +PA+WE+VL+GIR Sbjct: 64 TQKFGLPEIEDFAYCGGNELTRRRKSEMESDVASVASEVASTRPGGKSPAHWEKVLEGIR 123 Query: 967 NMRSYEDAPVDSMGCEKAGSSLPPKERRFAVLVSSLLSSQTKDGVTHGAIQRLHQNDLLA 788 MRS DAPVD+MGCEKAG +LPPKERRFAVLVSSLLSSQTKD VTHGAIQRL QNDLL Sbjct: 124 KMRSSADAPVDTMGCEKAGDTLPPKERRFAVLVSSLLSSQTKDPVTHGAIQRLLQNDLLT 183 Query: 787 PDAIDNTEEETIKNLIYPVGFYSRKASNLKKIAKICLMKYGGDIPSSLKDLLLLPGIGPK 608 P+AI+N +EETIK LIYPVGFY+RKA+NLKKIA ICLMKY GDIPSS+ LLLLPGIGPK Sbjct: 184 PEAINNVDEETIKKLIYPVGFYTRKATNLKKIANICLMKYHGDIPSSIDQLLLLPGIGPK 243 Query: 607 MAHLVMNVGWNNVQGICVDTHVHRICNRLRWVSRPGTGQKTSSPEETRESLQLWLPKDEW 428 MAHLVMN GWNNVQGICVDTHVHRICNRL WVSR GT QKTS+PEETRESLQ WLPK+EW Sbjct: 244 MAHLVMNAGWNNVQGICVDTHVHRICNRLGWVSRLGTNQKTSTPEETRESLQRWLPKEEW 303 Query: 427 IAINPLLVGFGQTVCTPLRPRCGMCSINRLCPSAFKETTSPVPRTKN 287 + INPLLVGFGQT+CTPLRPRCG CS+ LCPSAFKET++ P +K+ Sbjct: 304 VPINPLLVGFGQTICTPLRPRCGECSVRDLCPSAFKETSNSSPSSKS 350 >ref|XP_007131960.1| hypothetical protein PHAVU_011G055100g [Phaseolus vulgaris] gi|561004960|gb|ESW03954.1| hypothetical protein PHAVU_011G055100g [Phaseolus vulgaris] Length = 408 Score = 415 bits (1066), Expect = e-113 Identities = 211/320 (65%), Positives = 243/320 (75%), Gaps = 25/320 (7%) Frame = -2 Query: 1171 RVFARKRRS--KCTVETHVEEHKIESPQQKKKLCD---LSDIEEFAYGEVNGSAQMSKVD 1007 RVF R+ ++ K V+ E+H + K + L +IE+FAY N + K + Sbjct: 80 RVFVRRNKNPRKMAVKLEEEDHLPSTQDHKVPVTQKFGLPEIEDFAYCGGNELTRRRKSE 139 Query: 1006 --------------------APANWEEVLDGIRNMRSYEDAPVDSMGCEKAGSSLPPKER 887 +PA+WE+VL+GIR MRS DAPVD+MGCEKAG +LPPKER Sbjct: 140 MESDVASVASEVASTRPGGKSPAHWEKVLEGIRKMRSSADAPVDTMGCEKAGDTLPPKER 199 Query: 886 RFAVLVSSLLSSQTKDGVTHGAIQRLHQNDLLAPDAIDNTEEETIKNLIYPVGFYSRKAS 707 RFAVLVSSLLSSQTKD VTHGAIQRL QNDLL P+AI+N +EETIK LIYPVGFY+RKA+ Sbjct: 200 RFAVLVSSLLSSQTKDPVTHGAIQRLLQNDLLTPEAINNVDEETIKKLIYPVGFYTRKAT 259 Query: 706 NLKKIAKICLMKYGGDIPSSLKDLLLLPGIGPKMAHLVMNVGWNNVQGICVDTHVHRICN 527 NLKKIA ICLMKY GDIPSS+ LLLLPGIGPKMAHLVMN GWNNVQGICVDTHVHRICN Sbjct: 260 NLKKIANICLMKYHGDIPSSIDQLLLLPGIGPKMAHLVMNAGWNNVQGICVDTHVHRICN 319 Query: 526 RLRWVSRPGTGQKTSSPEETRESLQLWLPKDEWIAINPLLVGFGQTVCTPLRPRCGMCSI 347 RL WVSR GT QKTS+PEETRESLQ WLPK+EW+ INPLLVGFGQT+CTPLRPRCG CS+ Sbjct: 320 RLGWVSRLGTNQKTSTPEETRESLQRWLPKEEWVPINPLLVGFGQTICTPLRPRCGECSV 379 Query: 346 NRLCPSAFKETTSPVPRTKN 287 LCPSAFKET++ P +K+ Sbjct: 380 RDLCPSAFKETSNSSPSSKS 399 >ref|XP_003539044.2| PREDICTED: endonuclease III-like protein 1-like [Glycine max] Length = 357 Score = 412 bits (1060), Expect = e-112 Identities = 221/354 (62%), Positives = 255/354 (72%), Gaps = 26/354 (7%) Frame = -2 Query: 1255 MPETRSVSAKSQSKPEIPKPKAEPNAGIRVFAR--KRRSKCTVETHVEEHK-IESPQQKK 1085 M ET K+ S ++ +RVF R KR ++ +H+ ++ P K Sbjct: 4 MSETTRSFCKATSPSNTTSIIEATHSQVRVFMRRNKRPRNMALKLEQSDHQDLKVPVTHK 63 Query: 1084 KLCDLSDIEEFAYGEVN-----GSAQM---------------SKVDAPANWEEVLDGIRN 965 L +IEEFAY G ++M S ++PA WE+VL+GIR Sbjct: 64 --FGLPEIEEFAYCGAKELTQCGKSEMGSDAIPVASEVASTRSSGESPAQWEKVLEGIRK 121 Query: 964 MRSYEDAPVDSMGCEKAGSSLPPKERRFAVLVSSLLSSQTKDGVTHGAIQRLHQNDLLAP 785 MR DAPVD+MGCEKAG +LPPKERRFAVLVSSLLSSQTKD VTHGAIQRL QNDLL Sbjct: 122 MRCSADAPVDTMGCEKAGETLPPKERRFAVLVSSLLSSQTKDPVTHGAIQRLLQNDLLTA 181 Query: 784 DAIDNTEEETIKNLIYPVGFYSRKASNLKKIAKICLMKYGGDIPSSLKDLLLLPGIGPKM 605 DAI++ +EETIK LIYPVGFY+RKASNLKKIA ICLMKY GDIPSS++ LLLLPGIGPKM Sbjct: 182 DAINDADEETIKKLIYPVGFYTRKASNLKKIANICLMKYDGDIPSSIEQLLLLPGIGPKM 241 Query: 604 AHLVMNVGWNNVQGICVDTHVHRICNRLRWVSRPGTGQKTSSPEETRESLQLWLPKDEWI 425 AHLVMNVGWNNVQGICVDTHVHRICNRL WVSR GT QKTS+PEETRE LQ WLPK+EW+ Sbjct: 242 AHLVMNVGWNNVQGICVDTHVHRICNRLGWVSRLGTKQKTSTPEETREELQRWLPKEEWV 301 Query: 424 AINPLLVGFGQTVCTPLRPRCGMCSINRLCPSAFKETTSPVP---RTKNSGRRK 272 INPLLVGFGQT+CTPLRPRCG CSI+ LCPSAFKET++ P ++K SG K Sbjct: 302 PINPLLVGFGQTICTPLRPRCGECSISELCPSAFKETSNSSPSSSKSKKSGLNK 355 >ref|XP_004507328.1| PREDICTED: endonuclease III-like protein 1-like [Cicer arietinum] Length = 387 Score = 410 bits (1053), Expect = e-112 Identities = 218/363 (60%), Positives = 251/363 (69%), Gaps = 37/363 (10%) Frame = -2 Query: 1249 ETRSVSAKSQSKPEIPKPKAEPNAG-------IRVFARK------RRSKCTVETHVEE-H 1112 +TRS S P KP N RV+ R+ +R+K T +++ H Sbjct: 21 KTRSFHKSPLSNPSSVKPSDSTNDASVSHQQVTRVYVRRNNSNNNKRAKGITTTKLQQNH 80 Query: 1111 KIESPQQKKKLCDLSDIEEFAYGEVNGSAQMSKVD-----------------------AP 1001 + Q KK L +IE+FAY N Q K + +P Sbjct: 81 HLPPTQTHKKFGGLPEIEDFAYRGPNELTQFRKSEISSDVIVKPAEESEVASAAHRSESP 140 Query: 1000 ANWEEVLDGIRNMRSYEDAPVDSMGCEKAGSSLPPKERRFAVLVSSLLSSQTKDGVTHGA 821 A+WEE L+GIR MR DAPVD+MGCEKAGS+LPPKERRFAVLVSSLLSSQTKD V HGA Sbjct: 141 ADWEETLEGIRKMRCSADAPVDTMGCEKAGSTLPPKERRFAVLVSSLLSSQTKDHVNHGA 200 Query: 820 IQRLHQNDLLAPDAIDNTEEETIKNLIYPVGFYSRKASNLKKIAKICLMKYGGDIPSSLK 641 IQRL QNDLL PDAI+N +EETIK LIYPVGFY+RKA+NLKKIA ICLMKYGGDIPS+L+ Sbjct: 201 IQRLLQNDLLTPDAINNADEETIKKLIYPVGFYTRKATNLKKIANICLMKYGGDIPSTLE 260 Query: 640 DLLLLPGIGPKMAHLVMNVGWNNVQGICVDTHVHRICNRLRWVSRPGTGQKTSSPEETRE 461 LLLLPGIGPKMAHLVMNV WNNVQGICVDTHVHRICNRL WVSR GT QKT +PEETRE Sbjct: 261 QLLLLPGIGPKMAHLVMNVAWNNVQGICVDTHVHRICNRLGWVSRLGTKQKTLTPEETRE 320 Query: 460 SLQLWLPKDEWIAINPLLVGFGQTVCTPLRPRCGMCSINRLCPSAFKETTSPVPRTKNSG 281 SLQ WLP++EW INPLLVGFGQT+CTPLRPRCG C I+ LC SAFKE + +K++ Sbjct: 321 SLQRWLPREEWDPINPLLVGFGQTICTPLRPRCGECGISHLCLSAFKEASDSSSFSKSTK 380 Query: 280 RRK 272 R+ Sbjct: 381 SRR 383 >ref|XP_002309812.1| endonuclease-related family protein [Populus trichocarpa] gi|222852715|gb|EEE90262.1| endonuclease-related family protein [Populus trichocarpa] Length = 362 Score = 409 bits (1052), Expect = e-111 Identities = 217/357 (60%), Positives = 250/357 (70%), Gaps = 27/357 (7%) Frame = -2 Query: 1261 RPMPETRSVSAKSQSKPEI------PKPKAEPNAGIRVFARKRRSKCTVETHVEEHKIES 1100 + MP TR S QSK EI P P +RVF RKR+ K TVE +E K+E Sbjct: 23 KKMPNTRFSSKSLQSKTEISTSDTVPGPNEVSVPEVRVFVRKRKVKTTVEAAEKEVKVEP 82 Query: 1099 PQQKKKLCDLSDIEEFAYGEVNGSAQMSKV---------------------DAPANWEEV 983 +K+KL L DIEEFAY + NG A + K+ + P NW++V Sbjct: 83 --RKQKLSALPDIEEFAYKKGNGPALIRKLKSTENVLPVDSEAASTIRPAGEPPLNWDKV 140 Query: 982 LDGIRNMRSYEDAPVDSMGCEKAGSSLPPKERRFAVLVSSLLSSQTKDGVTHGAIQRLHQ 803 L+GI MRS EDAPVD+MGCEKAG SLPP V++S+ GAIQRL Q Sbjct: 141 LEGIHKMRSSEDAPVDTMGCEKAGISLPP-----GVVLSA------------GAIQRLQQ 183 Query: 802 NDLLAPDAIDNTEEETIKNLIYPVGFYSRKASNLKKIAKICLMKYGGDIPSSLKDLLLLP 623 N+LL DAID +E IK+LIYPVGFY+RKASNLKKIAKICL+KY GDIPSSL+DLL LP Sbjct: 184 NNLLTADAIDKADETAIKDLIYPVGFYTRKASNLKKIAKICLLKYDGDIPSSLEDLLSLP 243 Query: 622 GIGPKMAHLVMNVGWNNVQGICVDTHVHRICNRLRWVSRPGTGQKTSSPEETRESLQLWL 443 GIGPKMAHLVMN+ WNNVQGICVDTHVHRICNRL WV+RPGT QKTS+PEETRE+LQLWL Sbjct: 244 GIGPKMAHLVMNIAWNNVQGICVDTHVHRICNRLGWVARPGTKQKTSTPEETREALQLWL 303 Query: 442 PKDEWIAINPLLVGFGQTVCTPLRPRCGMCSINRLCPSAFKETTSPVPRTKNSGRRK 272 PKDEW+ INPLLVGFGQT+CTPLRPRCGMC I+ CPSAFKET+SP + K SG K Sbjct: 304 PKDEWVPINPLLVGFGQTICTPLRPRCGMCCISEFCPSAFKETSSPASKQKRSGGSK 360 >ref|XP_006347463.1| PREDICTED: endonuclease III-like protein 1-like [Solanum tuberosum] Length = 422 Score = 409 bits (1051), Expect = e-111 Identities = 198/271 (73%), Positives = 223/271 (82%) Frame = -2 Query: 1102 SPQQKKKLCDLSDIEEFAYGEVNGSAQMSKVDAPANWEEVLDGIRNMRSYEDAPVDSMGC 923 +P + +L + + E+ G + + P NWE+VL+GIR MRS EDAPVDSMGC Sbjct: 151 APSKSVRLTGEKALSQLTQTEIKGFSLSDPLQPPLNWEKVLEGIRKMRSAEDAPVDSMGC 210 Query: 922 EKAGSSLPPKERRFAVLVSSLLSSQTKDGVTHGAIQRLHQNDLLAPDAIDNTEEETIKNL 743 EKAGSSLP KERRFAVLVSSLLSSQTKD V HGAIQRL QN LLA DAID+ EETIK+L Sbjct: 211 EKAGSSLPAKERRFAVLVSSLLSSQTKDQVNHGAIQRLLQNGLLAADAIDSANEETIKSL 270 Query: 742 IYPVGFYSRKASNLKKIAKICLMKYGGDIPSSLKDLLLLPGIGPKMAHLVMNVGWNNVQG 563 IYPVGFY+RKASNLKK+AKICL KY GDIPSSL++LLLLPGIGPKMAHLVMNV W NVQG Sbjct: 271 IYPVGFYTRKASNLKKVAKICLSKYNGDIPSSLEELLLLPGIGPKMAHLVMNVAWENVQG 330 Query: 562 ICVDTHVHRICNRLRWVSRPGTGQKTSSPEETRESLQLWLPKDEWIAINPLLVGFGQTVC 383 ICVDTHVHRI NRL WVSRPGT QKT +PEETRESLQLWLPK+EW+ INPLLVGFGQT+C Sbjct: 331 ICVDTHVHRISNRLGWVSRPGTKQKTRTPEETRESLQLWLPKEEWVPINPLLVGFGQTIC 390 Query: 382 TPLRPRCGMCSINRLCPSAFKETTSPVPRTK 290 TPLRPRC +C+++ LCPSAFKE SP +K Sbjct: 391 TPLRPRCAICTVSDLCPSAFKEAASPSSTSK 421 >ref|XP_006410282.1| hypothetical protein EUTSA_v10016815mg [Eutrema salsugineum] gi|557111451|gb|ESQ51735.1| hypothetical protein EUTSA_v10016815mg [Eutrema salsugineum] Length = 373 Score = 407 bits (1046), Expect = e-111 Identities = 205/339 (60%), Positives = 249/339 (73%), Gaps = 16/339 (4%) Frame = -2 Query: 1240 SVSAKSQSKPEIPKPKAEPNAG--IRVFARKRRSKCTVETHVEEHKIESPQQKKKLCDLS 1067 S + S P+ A+P +G RV+ RK+R K +E+ + Q K+LC L Sbjct: 35 SKPTQQHSLPDSDPEPAKPASGSETRVYTRKKRLKQEAFQPLEKDSCINTQ--KQLCRLP 92 Query: 1066 DIEEFAYGEVNGSAQMSKV--------------DAPANWEEVLDGIRNMRSYEDAPVDSM 929 DIEEFAY + S+ + +AP NW +VL+GIR MRS EDAPVDSM Sbjct: 93 DIEEFAYKKNTRSSSSRRSTETSITVTSVKTAGNAPENWVKVLEGIRQMRSSEDAPVDSM 152 Query: 928 GCEKAGSSLPPKERRFAVLVSSLLSSQTKDGVTHGAIQRLHQNDLLAPDAIDNTEEETIK 749 GC+KAGS LPP ERRFAVL+ +LLSSQTKD V + AI RLHQN LL P+A+D +E T++ Sbjct: 153 GCDKAGSFLPPTERRFAVLLGALLSSQTKDEVNNAAIHRLHQNGLLTPEAVDKADESTLR 212 Query: 748 NLIYPVGFYSRKASNLKKIAKICLMKYGGDIPSSLKDLLLLPGIGPKMAHLVMNVGWNNV 569 LIYPVGFY+RKA+ +KKIAKICL+KY GDIPSSL DLL LPGIGPKMAHL++++ WN+V Sbjct: 213 ELIYPVGFYTRKATYMKKIAKICLVKYNGDIPSSLDDLLALPGIGPKMAHLILHIAWNDV 272 Query: 568 QGICVDTHVHRICNRLRWVSRPGTGQKTSSPEETRESLQLWLPKDEWIAINPLLVGFGQT 389 QGICVDTHVHRICNRL WVSRPGT QKTSSPEETR +LQ WLPK+EW+AINPLLVGFGQT Sbjct: 273 QGICVDTHVHRICNRLGWVSRPGTKQKTSSPEETRVALQQWLPKEEWVAINPLLVGFGQT 332 Query: 388 VCTPLRPRCGMCSINRLCPSAFKETTSPVPRTKNSGRRK 272 +CTPLRPRC CS+ +LCP+AFKE +SP + K S + K Sbjct: 333 ICTPLRPRCETCSVTKLCPAAFKEASSPSSKLKKSKQSK 371 >ref|XP_002881177.1| predicted protein [Arabidopsis lyrata subsp. lyrata] gi|297327016|gb|EFH57436.1| predicted protein [Arabidopsis lyrata subsp. lyrata] Length = 354 Score = 404 bits (1038), Expect = e-110 Identities = 203/345 (58%), Positives = 249/345 (72%), Gaps = 24/345 (6%) Frame = -2 Query: 1234 SAKSQSKP---EIPKPKAEPNAG-------IRVFARKRRSKCTVETHVEEHKIESPQQKK 1085 + S SKP + +P ++ N+ RV+ RK+R K +E + + K Sbjct: 8 AVSSSSKPISSKTQRPLSDSNSANGASGSVTRVYTRKKRLKQEASEPLEINPGKGVNTHK 67 Query: 1084 KLCDLSDIEEFAYGEVNGSAQMSKV--------------DAPANWEEVLDGIRNMRSYED 947 +L L DIE+FAY + GS + + P NW +VL+GIR MRS ED Sbjct: 68 QLRGLPDIEDFAYKKTIGSPSSRRSTETSITVTSVKTAGNPPENWVKVLEGIRQMRSSED 127 Query: 946 APVDSMGCEKAGSSLPPKERRFAVLVSSLLSSQTKDGVTHGAIQRLHQNDLLAPDAIDNT 767 APVDSMGC+KAGS LPP ERRFAVL+ +LLSSQTKD V + AI RLHQN LL P+A+D Sbjct: 128 APVDSMGCDKAGSFLPPTERRFAVLLGALLSSQTKDQVNNAAIHRLHQNSLLTPEAVDKA 187 Query: 766 EEETIKNLIYPVGFYSRKASNLKKIAKICLMKYGGDIPSSLKDLLLLPGIGPKMAHLVMN 587 +E TI+ LIYPVGFY+RKA+ +KKIA+ICL+KY GDIPSSL DLL LPGIGPKMAHL+++ Sbjct: 188 DESTIRELIYPVGFYTRKATYMKKIARICLVKYNGDIPSSLDDLLSLPGIGPKMAHLILH 247 Query: 586 VGWNNVQGICVDTHVHRICNRLRWVSRPGTGQKTSSPEETRESLQLWLPKDEWIAINPLL 407 + WN+VQGICVDTHVHRICNRL WVSRPGT QKT+SPEETR +LQ WLPK+EW+AINPLL Sbjct: 248 IAWNDVQGICVDTHVHRICNRLGWVSRPGTKQKTTSPEETRVALQQWLPKEEWVAINPLL 307 Query: 406 VGFGQTVCTPLRPRCGMCSINRLCPSAFKETTSPVPRTKNSGRRK 272 VGFGQT+CTPLRPRC CS+ +LCP+AFKET+SP + K S R K Sbjct: 308 VGFGQTICTPLRPRCEACSVTKLCPAAFKETSSPSSKLKKSNRSK 352 >emb|CAC16135.1| endonuclease III homologue [Arabidopsis thaliana] Length = 354 Score = 404 bits (1038), Expect = e-110 Identities = 200/314 (63%), Positives = 238/314 (75%), Gaps = 14/314 (4%) Frame = -2 Query: 1171 RVFARKRRSKCTVETHVEEHKIESPQQKKKLCDLSDIEEFAYGEVNGSAQMSKVDA---- 1004 RV+ RK+R K +E++ + K LC L DIE+FAY + GS S+ Sbjct: 40 RVYTRKKRLKQEPFEPLEKYSGKGVNTHK-LCGLPDIEDFAYKKTIGSPSSSRSTETSIT 98 Query: 1003 ----------PANWEEVLDGIRNMRSYEDAPVDSMGCEKAGSSLPPKERRFAVLVSSLLS 854 P NW EVL+GIR MRS EDAPVDSMGC+KAGS LPP ERRFAVL+ +LLS Sbjct: 99 VTSVKTAGYPPENWVEVLEGIRQMRSSEDAPVDSMGCDKAGSFLPPTERRFAVLLGALLS 158 Query: 853 SQTKDGVTHGAIQRLHQNDLLAPDAIDNTEEETIKNLIYPVGFYSRKASNLKKIAKICLM 674 SQTKD V + AI RLHQN LL P+A+D +E TIK LIYPVGFY+RKA+ +KKIA+ICL+ Sbjct: 159 SQTKDQVNNAAIHRLHQNGLLTPEAVDKADESTIKELIYPVGFYTRKATYMKKIARICLV 218 Query: 673 KYGGDIPSSLKDLLLLPGIGPKMAHLVMNVGWNNVQGICVDTHVHRICNRLRWVSRPGTG 494 KY GDIPSSL DLL LPGIGPKMAHL++++ WN+VQGICVDTHVHRICNRL WVSRPGT Sbjct: 219 KYDGDIPSSLDDLLSLPGIGPKMAHLILHIAWNDVQGICVDTHVHRICNRLGWVSRPGTK 278 Query: 493 QKTSSPEETRESLQLWLPKDEWIAINPLLVGFGQTVCTPLRPRCGMCSINRLCPSAFKET 314 QKT+SPEETR +LQ WLPK+EW+AINPLLVGFGQ +CTPLRPRC CS+++LCP+AFKET Sbjct: 279 QKTTSPEETRVALQQWLPKEEWVAINPLLVGFGQMICTPLRPRCEACSVSKLCPAAFKET 338 Query: 313 TSPVPRTKNSGRRK 272 +SP + K S R K Sbjct: 339 SSPSSKLKKSNRSK 352 >ref|NP_565725.1| protein NTH1 [Arabidopsis thaliana] gi|75206080|sp|Q9SIC4.2|NTH1_ARATH RecName: Full=Endonuclease III homolog 1, chloroplastic; Short=AtNTH1; AltName: Full=Bifunctional DNA N-glycoslyase/DNA-(apurinic or apyrimidinic site) lyase 1; Short=DNA glycoslyase/AP lyase 1; Flags: Precursor gi|20198157|gb|AAD26474.2| putative endonuclease [Arabidopsis thaliana] gi|330253455|gb|AEC08549.1| protein NTH1 [Arabidopsis thaliana] Length = 379 Score = 403 bits (1036), Expect = e-110 Identities = 199/314 (63%), Positives = 238/314 (75%), Gaps = 14/314 (4%) Frame = -2 Query: 1171 RVFARKRRSKCTVETHVEEHKIESPQQKKKLCDLSDIEEFAYGEVNGSAQMSKVDA---- 1004 RV+ RK+R K +E++ + K LC L DIE+FAY + GS S+ Sbjct: 65 RVYTRKKRLKQEPFEPLEKYSGKGVNTHK-LCGLPDIEDFAYKKTIGSPSSSRSTETSIT 123 Query: 1003 ----------PANWEEVLDGIRNMRSYEDAPVDSMGCEKAGSSLPPKERRFAVLVSSLLS 854 P NW EVL+GIR MRS EDAPVDSMGC+KAGS LPP ERRFAVL+ +LLS Sbjct: 124 VTSVKTAGYPPENWVEVLEGIRQMRSSEDAPVDSMGCDKAGSFLPPTERRFAVLLGALLS 183 Query: 853 SQTKDGVTHGAIQRLHQNDLLAPDAIDNTEEETIKNLIYPVGFYSRKASNLKKIAKICLM 674 SQTKD V + AI RLHQN LL P+A+D +E TIK LIYPVGFY+RKA+ +KKIA+ICL+ Sbjct: 184 SQTKDQVNNAAIHRLHQNGLLTPEAVDKADESTIKELIYPVGFYTRKATYMKKIARICLV 243 Query: 673 KYGGDIPSSLKDLLLLPGIGPKMAHLVMNVGWNNVQGICVDTHVHRICNRLRWVSRPGTG 494 KY GDIPSSL DLL LPGIGPKMAHL++++ WN+VQGICVDTHVHRICNRL WVSRPGT Sbjct: 244 KYDGDIPSSLDDLLSLPGIGPKMAHLILHIAWNDVQGICVDTHVHRICNRLGWVSRPGTK 303 Query: 493 QKTSSPEETRESLQLWLPKDEWIAINPLLVGFGQTVCTPLRPRCGMCSINRLCPSAFKET 314 QKT+SPEETR +LQ WLPK+EW+AINPLLVGFGQ +CTP+RPRC CS+++LCP+AFKET Sbjct: 304 QKTTSPEETRVALQQWLPKEEWVAINPLLVGFGQMICTPIRPRCEACSVSKLCPAAFKET 363 Query: 313 TSPVPRTKNSGRRK 272 +SP + K S R K Sbjct: 364 SSPSSKLKKSNRSK 377