BLASTX nr result
ID: Catharanthus23_contig00010564
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Catharanthus23_contig00010564 (1306 letters) Database: ./nr 37,332,560 sequences; 13,225,080,153 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value gb|EOY04995.1| DNA glycosylase superfamily protein isoform 3 [Th... 431 e-118 ref|XP_002264475.2| PREDICTED: endonuclease III-like protein 1-l... 431 e-118 gb|EMJ23912.1| hypothetical protein PRUPE_ppa009900mg [Prunus pe... 430 e-118 emb|CBI36652.3| unnamed protein product [Vitis vinifera] 426 e-117 ref|XP_004241397.1| PREDICTED: endonuclease III-like protein 1-l... 422 e-115 ref|XP_006443060.1| hypothetical protein CICLE_v10020813mg [Citr... 418 e-114 gb|EOY04993.1| DNA glycosylase superfamily protein isoform 1 [Th... 418 e-114 ref|XP_004309826.1| PREDICTED: endonuclease III-like protein 1-l... 416 e-113 ref|XP_006494172.1| PREDICTED: endonuclease III-like protein 1-l... 414 e-113 ref|XP_002534117.1| endonuclease III, putative [Ricinus communis... 411 e-112 ref|XP_006347463.1| PREDICTED: endonuclease III-like protein 1-l... 410 e-112 gb|ESW03953.1| hypothetical protein PHAVU_011G055100g [Phaseolus... 396 e-108 gb|ESW03954.1| hypothetical protein PHAVU_011G055100g [Phaseolus... 395 e-107 ref|XP_003539044.2| PREDICTED: endonuclease III-like protein 1-l... 394 e-107 ref|XP_004507328.1| PREDICTED: endonuclease III-like protein 1-l... 391 e-106 ref|XP_004152104.1| PREDICTED: endonuclease III-like protein 1-l... 386 e-105 ref|XP_006410282.1| hypothetical protein EUTSA_v10016815mg [Eutr... 386 e-104 emb|CAC16135.1| endonuclease III homologue [Arabidopsis thaliana] 385 e-104 ref|NP_565725.1| protein NTH1 [Arabidopsis thaliana] gi|20198157... 384 e-104 ref|NP_001077988.1| protein NTH1 [Arabidopsis thaliana] gi|17380... 384 e-104 >gb|EOY04995.1| DNA glycosylase superfamily protein isoform 3 [Theobroma cacao] Length = 364 Score = 431 bits (1107), Expect = e-118 Identities = 223/351 (63%), Positives = 259/351 (73%), Gaps = 5/351 (1%) Frame = +2 Query: 2 SFPLRIPSSLPLFSARINKIKMPQTSFSSKTQNPTTRNKNPGNENVNAGSKTELQXXXXX 181 SFPL + L ++N KMP+T + KT + ++ + P ++ T+ Sbjct: 7 SFPLGF--GVGLGGMKLNS-KMPKTRLAFKTLSSSSTTEVPSSDPNPGSETTDNVSVPAV 63 Query: 182 XXXXXXXXXXXXXDITEEHVQQ-----KLCRPLDIEDFAYGKSCGYSWSTLPPDNWERVL 346 D+ +E + KLC DIE+FAY K G S S P NWE+VL Sbjct: 64 RVFTRKKRVKKTVDVVQEIPKAENKGLKLCGLPDIEEFAYKKVDGPSLSGNAPANWEKVL 123 Query: 347 EGIRKMRSSEDAPVDSMGCEKAGTSLPPKERRFAVLASSLLSSQTKDHVTHGAIQRLLQQ 526 EGIRKMRS+EDAPVD+MGCEKAG+ LPPKERRFAVL SSLLSSQTKDHVTHGAIQRL+Q Sbjct: 124 EGIRKMRSAEDAPVDTMGCEKAGSVLPPKERRFAVLISSLLSSQTKDHVTHGAIQRLIQN 183 Query: 527 DLLTPDAVDKAEEATIKSLIYPVGFYTRKAGNLKKIAKICLLKYGGDIPSTXXXXXXXPG 706 L+TPDA+DKA+EATIK LIYPVGFYTRKA N+KKIAKICL+KY GDIPS+ PG Sbjct: 184 CLMTPDAIDKADEATIKDLIYPVGFYTRKAINVKKIAKICLMKYDGDIPSSLEELLLLPG 243 Query: 707 IGPKMAHLVMNVAWDNVQGICVDTHVHRICNRLGWVSRPGTKQKTSSPEETRESLQLWLP 886 IGPKMAHLVMN+AWD+VQGICVDTHVHRICNRLGWVSRPGTKQKT PEETR +LQ WLP Sbjct: 244 IGPKMAHLVMNIAWDDVQGICVDTHVHRICNRLGWVSRPGTKQKTLYPEETRVALQQWLP 303 Query: 887 KEEWVPINPLLVGFGQTVCSPLRPRCGICIISEFCPSAFKEG*SPSSTPRK 1039 KEEWVPINPLLVGFGQT+C+PLRP+C +C I+EFCPSAFKE SPSS +K Sbjct: 304 KEEWVPINPLLVGFGQTICTPLRPQCEVCSITEFCPSAFKETSSPSSKVKK 354 >ref|XP_002264475.2| PREDICTED: endonuclease III-like protein 1-like [Vitis vinifera] Length = 355 Score = 431 bits (1107), Expect = e-118 Identities = 222/349 (63%), Positives = 250/349 (71%), Gaps = 23/349 (6%) Frame = +2 Query: 65 MPQTSFSSKTQNPTTRNKNPGNENVNAGSKTELQXXXXXXXXXXXXXXXXXXDITEEHVQ 244 M + + SSK P ++K +E G +I E Q Sbjct: 1 MSRATSSSKPLLPALQSKTSAHEETPNGVSGSEVRVFVRKKRVKMAVETPEKEIKAEPQQ 60 Query: 245 QKLCRPLDIEDFAY--GKSCGYSWSTLP---------------------PDNWERVLEGI 355 QK+C DIE+F Y GK + + P P NWE++LEGI Sbjct: 61 QKICELPDIEEFTYRKGKRSTHLRKSKPTSDVPPGGTEITSSIRPAAELPANWEKILEGI 120 Query: 356 RKMRSSEDAPVDSMGCEKAGTSLPPKERRFAVLASSLLSSQTKDHVTHGAIQRLLQQDLL 535 RKMRSSEDAPVDSMGCEKAG+SLPP+ERRFAVL SSLLSSQTKD+VTHGAIQRLLQ LL Sbjct: 121 RKMRSSEDAPVDSMGCEKAGSSLPPRERRFAVLVSSLLSSQTKDNVTHGAIQRLLQNGLL 180 Query: 536 TPDAVDKAEEATIKSLIYPVGFYTRKAGNLKKIAKICLLKYGGDIPSTXXXXXXXPGIGP 715 DA+DKA+EAT+KSLIYPVGFY+RKAGNLKKIAKICL+KY GDIPS+ PGIGP Sbjct: 181 VADAIDKADEATVKSLIYPVGFYSRKAGNLKKIAKICLMKYDGDIPSSLEELLLLPGIGP 240 Query: 716 KMAHLVMNVAWDNVQGICVDTHVHRICNRLGWVSRPGTKQKTSSPEETRESLQLWLPKEE 895 KMAHLVMNVAW+NVQGICVDTHVHRICNRLGWVSR GTKQKTS PEETRESLQLWLPKEE Sbjct: 241 KMAHLVMNVAWNNVQGICVDTHVHRICNRLGWVSRRGTKQKTSLPEETRESLQLWLPKEE 300 Query: 896 WVPINPLLVGFGQTVCSPLRPRCGICIISEFCPSAFKEG*SPSSTPRKP 1042 WVPINPLLVGFGQT+C+PLRPRCG+C +S+ CPSAFKE SPSS +KP Sbjct: 301 WVPINPLLVGFGQTICTPLRPRCGVCGVSDLCPSAFKEAQSPSSKMKKP 349 >gb|EMJ23912.1| hypothetical protein PRUPE_ppa009900mg [Prunus persica] Length = 272 Score = 430 bits (1105), Expect = e-118 Identities = 210/267 (78%), Positives = 229/267 (85%) Frame = +2 Query: 248 KLCRPLDIEDFAYGKSCGYSWSTLPPDNWERVLEGIRKMRSSEDAPVDSMGCEKAGTSLP 427 +L P DIE+FAY K + S+ PP NWE+VLEGIRKMRSSEDAPVDSMGCEKAG++LP Sbjct: 2 QLASPPDIEEFAYTKVSASTNSSKPPANWEKVLEGIRKMRSSEDAPVDSMGCEKAGSALP 61 Query: 428 PKERRFAVLASSLLSSQTKDHVTHGAIQRLLQQDLLTPDAVDKAEEATIKSLIYPVGFYT 607 PKERRFAVL SSLLSSQTKDHVTHGAIQRLLQ +LL D++DKAEEATIKSLIYPVGFYT Sbjct: 62 PKERRFAVLVSSLLSSQTKDHVTHGAIQRLLQNNLLAADSIDKAEEATIKSLIYPVGFYT 121 Query: 608 RKAGNLKKIAKICLLKYGGDIPSTXXXXXXXPGIGPKMAHLVMNVAWDNVQGICVDTHVH 787 RKA NLKKIAKICL KY GDIPS+ PGIGPKMAHLVMNV W+NVQGICVDTHVH Sbjct: 122 RKATNLKKIAKICLTKYDGDIPSSLDELLSLPGIGPKMAHLVMNVGWNNVQGICVDTHVH 181 Query: 788 RICNRLGWVSRPGTKQKTSSPEETRESLQLWLPKEEWVPINPLLVGFGQTVCSPLRPRCG 967 RI NRLGWVSR G KQKTS+PEETRE+LQLWLPKEEW PINPLLVGFGQTVC+PLRP CG Sbjct: 182 RISNRLGWVSREGRKQKTSNPEETREALQLWLPKEEWDPINPLLVGFGQTVCTPLRPHCG 241 Query: 968 ICIISEFCPSAFKEG*SPSSTPRKPRL 1048 +C +S+FCPSAFKE SPSS +K L Sbjct: 242 VCNVSKFCPSAFKEASSPSSKSKKSGL 268 >emb|CBI36652.3| unnamed protein product [Vitis vinifera] Length = 379 Score = 426 bits (1096), Expect = e-117 Identities = 227/365 (62%), Positives = 256/365 (70%), Gaps = 26/365 (7%) Frame = +2 Query: 26 SLPLFSARINKIKMPQTSFSSKTQNPTTRNKNPGNENVNAGSKTELQXXXXXXXXXXXXX 205 +L L S RI M + + SSK P ++K +E G Sbjct: 10 TLALASVRITW-PMSRATSSSKPLLPALQSKTSAHEETPNGVSGSEVRVFVRKKRVKMAV 68 Query: 206 XXXXXDITEEHVQQKLCRPLDIEDFAY--GKSCGYSWSTLP------------------- 322 +I E QQK+C DIE+F Y GK + + P Sbjct: 69 ETPEKEIKAEPQQQKICELPDIEEFTYRKGKRSTHLRKSKPTSDVPPGGTEITSSIRPAA 128 Query: 323 --PDNWERVLEGIRKMRSSEDAPVDSMGCEKAGTSLPPKERRFAVLASSLLSSQTKDHVT 496 P NWE++LEGIRKMRSSEDAPVDSMGCEKAG+SLPP+ERRFAVL SSLLSSQTKD+VT Sbjct: 129 ELPANWEKILEGIRKMRSSEDAPVDSMGCEKAGSSLPPRERRFAVLVSSLLSSQTKDNVT 188 Query: 497 HG---AIQRLLQQDLLTPDAVDKAEEATIKSLIYPVGFYTRKAGNLKKIAKICLLKYGGD 667 HG AIQRLLQ LL DA+DKA+EAT+KSLIYPVGFY+RKAGNLKKIAKICL+KY GD Sbjct: 189 HGNAGAIQRLLQNGLLVADAIDKADEATVKSLIYPVGFYSRKAGNLKKIAKICLMKYDGD 248 Query: 668 IPSTXXXXXXXPGIGPKMAHLVMNVAWDNVQGICVDTHVHRICNRLGWVSRPGTKQKTSS 847 IPS+ PGIGPKMAHLVMNVAW+NVQGICVDTHVHRICNRLGWVSR GTKQKTS Sbjct: 249 IPSSLEELLLLPGIGPKMAHLVMNVAWNNVQGICVDTHVHRICNRLGWVSRRGTKQKTSL 308 Query: 848 PEETRESLQLWLPKEEWVPINPLLVGFGQTVCSPLRPRCGICIISEFCPSAFKEG*SPSS 1027 PEETRESLQLWLPKEEWVPINPLLVGFGQT+C+PLRPRCG+C +S+ CPSAFKE SPSS Sbjct: 309 PEETRESLQLWLPKEEWVPINPLLVGFGQTICTPLRPRCGVCGVSDLCPSAFKEAQSPSS 368 Query: 1028 TPRKP 1042 +KP Sbjct: 369 KMKKP 373 >ref|XP_004241397.1| PREDICTED: endonuclease III-like protein 1-like [Solanum lycopersicum] Length = 380 Score = 422 bits (1084), Expect = e-115 Identities = 229/378 (60%), Positives = 254/378 (67%), Gaps = 35/378 (9%) Frame = +2 Query: 11 LRIPSSLPLFSARINKI---KMPQTSFSSKTQNPTTRNKNPGNENVNAGSKTELQXXXXX 181 LR + LP S I I KM +T S + P+ KNPG + S EL+ Sbjct: 6 LRNTAFLPSISLGIQTISSAKMRRTRSSLNQETPS--QKNPGCDGTGGSSVPELRVFIRR 63 Query: 182 XXXXXXXXXXXXXDITEEHVQQK--LCRPLDIEDFAYGKSCGYSWST------------- 316 ++ EE +K L R DIEDF+Y K + ST Sbjct: 64 KRVKKTVEVIAK-EVKEESSGKKVMLVRLPDIEDFSYSKDITHPQSTPSKTVRLTGEKTL 122 Query: 317 -----------------LPPDNWERVLEGIRKMRSSEDAPVDSMGCEKAGTSLPPKERRF 445 PP NWE+VLEGIRKMRS+EDAPVDSMGCEKAG+SLP KERRF Sbjct: 123 PQLMQTEIKGFSLSDPLQPPSNWEKVLEGIRKMRSAEDAPVDSMGCEKAGSSLPAKERRF 182 Query: 446 AVLASSLLSSQTKDHVTHGAIQRLLQQDLLTPDAVDKAEEATIKSLIYPVGFYTRKAGNL 625 AVL SSLLSSQTKD V HGA+QRLLQ LL DA+D A E TIKSLIYPVGFYTRKA NL Sbjct: 183 AVLVSSLLSSQTKDQVNHGAVQRLLQNGLLAADAIDSANEETIKSLIYPVGFYTRKASNL 242 Query: 626 KKIAKICLLKYGGDIPSTXXXXXXXPGIGPKMAHLVMNVAWDNVQGICVDTHVHRICNRL 805 KK+AKICL KY GDIPS+ PGIGPKMAHLVMNVAW+NVQGICVDTHVHRI NRL Sbjct: 243 KKVAKICLSKYNGDIPSSLEELLLLPGIGPKMAHLVMNVAWENVQGICVDTHVHRISNRL 302 Query: 806 GWVSRPGTKQKTSSPEETRESLQLWLPKEEWVPINPLLVGFGQTVCSPLRPRCGICIISE 985 WVSRPGTKQKT +PEETRESLQLWLPKEEWVPINPLLVGFGQT+C+PLRPRC IC +S+ Sbjct: 303 EWVSRPGTKQKTRTPEETRESLQLWLPKEEWVPINPLLVGFGQTICTPLRPRCAICTVSD 362 Query: 986 FCPSAFKEG*SPSSTPRK 1039 CPSAFKE SPSSTP+K Sbjct: 363 LCPSAFKEAASPSSTPKK 380 >ref|XP_006443060.1| hypothetical protein CICLE_v10020813mg [Citrus clementina] gi|557545322|gb|ESR56300.1| hypothetical protein CICLE_v10020813mg [Citrus clementina] Length = 357 Score = 418 bits (1074), Expect = e-114 Identities = 211/301 (70%), Positives = 232/301 (77%), Gaps = 27/301 (8%) Frame = +2 Query: 230 EEHVQQKLCRPLDIEDFAY---------GKSCGYSWSTL------------------PPD 328 E ++ K C DIE+FAY K G S ST PP Sbjct: 57 EAPIEHKSCGLPDIEEFAYKEANGSALSSKIAGKSKSTQDMPVVGTEVASLNRMRGEPPA 116 Query: 329 NWERVLEGIRKMRSSEDAPVDSMGCEKAGTSLPPKERRFAVLASSLLSSQTKDHVTHGAI 508 NWERVLEGIRKMR+SEDAPVDSMGCEKAG+SLPP+ERRFAVL SSLLSSQTKD+VTHGAI Sbjct: 117 NWERVLEGIRKMRTSEDAPVDSMGCEKAGSSLPPRERRFAVLISSLLSSQTKDNVTHGAI 176 Query: 509 QRLLQQDLLTPDAVDKAEEATIKSLIYPVGFYTRKAGNLKKIAKICLLKYGGDIPSTXXX 688 QRLLQ LLT +A+DKA+EATIK LIYPVGFYTRKA N+KKIA ICL KY GDIPS+ Sbjct: 177 QRLLQNGLLTAEAIDKADEATIKDLIYPVGFYTRKASNMKKIAPICLTKYDGDIPSSLDE 236 Query: 689 XXXXPGIGPKMAHLVMNVAWDNVQGICVDTHVHRICNRLGWVSRPGTKQKTSSPEETRES 868 PGIGPKMAHLVMNV W+NVQGICVDTHVHRICNRLGWVS+PG KQKTSSPE+TRE Sbjct: 237 LLLLPGIGPKMAHLVMNVGWNNVQGICVDTHVHRICNRLGWVSQPGRKQKTSSPEQTREV 296 Query: 869 LQLWLPKEEWVPINPLLVGFGQTVCSPLRPRCGICIISEFCPSAFKEG*SPSSTPRKPRL 1048 LQLWLPKEEWVPINPLLVGFGQT+C+P+RPRCG+C +SE CPSAFK+ SPSS RK Sbjct: 297 LQLWLPKEEWVPINPLLVGFGQTICTPIRPRCGMCSVSELCPSAFKDSSSPSSKSRKSAQ 356 Query: 1049 K 1051 K Sbjct: 357 K 357 >gb|EOY04993.1| DNA glycosylase superfamily protein isoform 1 [Theobroma cacao] Length = 387 Score = 418 bits (1074), Expect = e-114 Identities = 223/374 (59%), Positives = 259/374 (69%), Gaps = 28/374 (7%) Frame = +2 Query: 2 SFPLRIPSSLPLFSARINKIKMPQTSFSSKTQNPTTRNKNPGNENVNAGSKTELQXXXXX 181 SFPL + L ++N KMP+T + KT + ++ + P ++ T+ Sbjct: 7 SFPLGF--GVGLGGMKLNS-KMPKTRLAFKTLSSSSTTEVPSSDPNPGSETTDNVSVPAV 63 Query: 182 XXXXXXXXXXXXXDITEEHVQQ-----KLCRPLDIEDFAYGKSCGYSWSTLP-------- 322 D+ +E + KLC DIE+FAY K G S S Sbjct: 64 RVFTRKKRVKKTVDVVQEIPKAENKGLKLCGLPDIEEFAYKKVDGPSLSGKSKSTSDEIN 123 Query: 323 ---------------PDNWERVLEGIRKMRSSEDAPVDSMGCEKAGTSLPPKERRFAVLA 457 P NWE+VLEGIRKMRS+EDAPVD+MGCEKAG+ LPPKERRFAVL Sbjct: 124 VGTGIASPVGIGGNAPANWEKVLEGIRKMRSAEDAPVDTMGCEKAGSVLPPKERRFAVLI 183 Query: 458 SSLLSSQTKDHVTHGAIQRLLQQDLLTPDAVDKAEEATIKSLIYPVGFYTRKAGNLKKIA 637 SSLLSSQTKDHVTHGAIQRL+Q L+TPDA+DKA+EATIK LIYPVGFYTRKA N+KKIA Sbjct: 184 SSLLSSQTKDHVTHGAIQRLIQNCLMTPDAIDKADEATIKDLIYPVGFYTRKAINVKKIA 243 Query: 638 KICLLKYGGDIPSTXXXXXXXPGIGPKMAHLVMNVAWDNVQGICVDTHVHRICNRLGWVS 817 KICL+KY GDIPS+ PGIGPKMAHLVMN+AWD+VQGICVDTHVHRICNRLGWVS Sbjct: 244 KICLMKYDGDIPSSLEELLLLPGIGPKMAHLVMNIAWDDVQGICVDTHVHRICNRLGWVS 303 Query: 818 RPGTKQKTSSPEETRESLQLWLPKEEWVPINPLLVGFGQTVCSPLRPRCGICIISEFCPS 997 RPGTKQKT PEETR +LQ WLPKEEWVPINPLLVGFGQT+C+PLRP+C +C I+EFCPS Sbjct: 304 RPGTKQKTLYPEETRVALQQWLPKEEWVPINPLLVGFGQTICTPLRPQCEVCSITEFCPS 363 Query: 998 AFKEG*SPSSTPRK 1039 AFKE SPSS +K Sbjct: 364 AFKETSSPSSKVKK 377 >ref|XP_004309826.1| PREDICTED: endonuclease III-like protein 1-like [Fragaria vesca subsp. vesca] Length = 341 Score = 416 bits (1068), Expect = e-113 Identities = 202/261 (77%), Positives = 224/261 (85%), Gaps = 3/261 (1%) Frame = +2 Query: 266 DIEDFAYGKSCGYSWST---LPPDNWERVLEGIRKMRSSEDAPVDSMGCEKAGTSLPPKE 436 DIE+FAY S+ST PP +WE+VLEGIRKMRS+EDAPVDSMGCEKAG++LPPKE Sbjct: 75 DIEEFAYRNESSSSYSTDIGKPPAHWEKVLEGIRKMRSAEDAPVDSMGCEKAGSALPPKE 134 Query: 437 RRFAVLASSLLSSQTKDHVTHGAIQRLLQQDLLTPDAVDKAEEATIKSLIYPVGFYTRKA 616 RRFAVL SSLLSSQTKD VTHGA+QRLLQ +L+ DA+DK +E TIKSLIYPVGFYTRKA Sbjct: 135 RRFAVLVSSLLSSQTKDQVTHGAVQRLLQNGMLSADAIDKGDEPTIKSLIYPVGFYTRKA 194 Query: 617 GNLKKIAKICLLKYGGDIPSTXXXXXXXPGIGPKMAHLVMNVAWDNVQGICVDTHVHRIC 796 NLKKIA ICL+KY GDIPS+ PGIGPKMAHLVMNVAWDNVQGICVDTHVHRIC Sbjct: 195 SNLKKIANICLVKYDGDIPSSLEELLSLPGIGPKMAHLVMNVAWDNVQGICVDTHVHRIC 254 Query: 797 NRLGWVSRPGTKQKTSSPEETRESLQLWLPKEEWVPINPLLVGFGQTVCSPLRPRCGICI 976 NRLGWV R G KQKTS+PEETRE+LQLWLPK+EWVPINPLLVGFGQTVC+PLRPRCG+C Sbjct: 255 NRLGWV-RAGKKQKTSNPEETREALQLWLPKDEWVPINPLLVGFGQTVCTPLRPRCGVCS 313 Query: 977 ISEFCPSAFKEG*SPSSTPRK 1039 +SEFCPSA+KE SP S +K Sbjct: 314 VSEFCPSAYKETSSPLSKTKK 334 >ref|XP_006494172.1| PREDICTED: endonuclease III-like protein 1-like [Citrus sinensis] Length = 357 Score = 414 bits (1064), Expect = e-113 Identities = 210/301 (69%), Positives = 231/301 (76%), Gaps = 27/301 (8%) Frame = +2 Query: 230 EEHVQQKLCRPLDIEDFAY---------GKSCGYSWSTL------------------PPD 328 E ++ K C DIE+FAY K G S ST PP Sbjct: 57 EAPIEHKSCGLPDIEEFAYKEANGSALSSKIAGKSKSTQDMPVVGTEVASLNRMRGEPPA 116 Query: 329 NWERVLEGIRKMRSSEDAPVDSMGCEKAGTSLPPKERRFAVLASSLLSSQTKDHVTHGAI 508 NWERVLEGIRKMR+SEDAPVDSMGCEKAG+SLPP+ERRFAVL SSLLSSQTKD+VTHGAI Sbjct: 117 NWERVLEGIRKMRTSEDAPVDSMGCEKAGSSLPPRERRFAVLISSLLSSQTKDNVTHGAI 176 Query: 509 QRLLQQDLLTPDAVDKAEEATIKSLIYPVGFYTRKAGNLKKIAKICLLKYGGDIPSTXXX 688 QRLLQ LLT +A+DKA+EATIK LIY VGFYTRKA N+KKIA ICL KY GDIPS+ Sbjct: 177 QRLLQNGLLTAEAIDKADEATIKDLIYLVGFYTRKASNMKKIAPICLTKYDGDIPSSLDE 236 Query: 689 XXXXPGIGPKMAHLVMNVAWDNVQGICVDTHVHRICNRLGWVSRPGTKQKTSSPEETRES 868 PGIGPKMAHLVMNV W+NVQGICVDTHVHRICNRLGWVS+PG KQKTSSPE+TRE Sbjct: 237 LLLLPGIGPKMAHLVMNVGWNNVQGICVDTHVHRICNRLGWVSQPGRKQKTSSPEQTREV 296 Query: 869 LQLWLPKEEWVPINPLLVGFGQTVCSPLRPRCGICIISEFCPSAFKEG*SPSSTPRKPRL 1048 LQLWLPKEEWVPINPLLVGFGQT+C+P+RPRCG+C +SE CPSAFK+ SPSS RK Sbjct: 297 LQLWLPKEEWVPINPLLVGFGQTICTPIRPRCGMCSVSELCPSAFKDSSSPSSKSRKSAQ 356 Query: 1049 K 1051 K Sbjct: 357 K 357 >ref|XP_002534117.1| endonuclease III, putative [Ricinus communis] gi|223525829|gb|EEF28268.1| endonuclease III, putative [Ricinus communis] Length = 357 Score = 411 bits (1057), Expect = e-112 Identities = 198/252 (78%), Positives = 217/252 (86%) Frame = +2 Query: 293 SCGYSWSTLPPDNWERVLEGIRKMRSSEDAPVDSMGCEKAGTSLPPKERRFAVLASSLLS 472 +C S PP NWE VLEGIRKMRSSEDAPVD+MGCEKAG+ LP KERRFAVL SSL+S Sbjct: 102 ACTIRPSDEPPANWEIVLEGIRKMRSSEDAPVDTMGCEKAGSFLPSKERRFAVLVSSLMS 161 Query: 473 SQTKDHVTHGAIQRLLQQDLLTPDAVDKAEEATIKSLIYPVGFYTRKAGNLKKIAKICLL 652 SQTKDHVTHGA+QRL Q LLT DA+DKA+E TIK LIYPVGFYTRKA NLKKIAKICL+ Sbjct: 162 SQTKDHVTHGAVQRLHQNSLLTADAIDKADETTIKDLIYPVGFYTRKASNLKKIAKICLM 221 Query: 653 KYGGDIPSTXXXXXXXPGIGPKMAHLVMNVAWDNVQGICVDTHVHRICNRLGWVSRPGTK 832 KY GDIP + PGIGPKMAHLVMNVAWD+VQGICVDTHVHRICNRLGWVSRPGT+ Sbjct: 222 KYDGDIPRSLEDLLSLPGIGPKMAHLVMNVAWDDVQGICVDTHVHRICNRLGWVSRPGTE 281 Query: 833 QKTSSPEETRESLQLWLPKEEWVPINPLLVGFGQTVCSPLRPRCGICIISEFCPSAFKEG 1012 QKTS+PEETR +LQLWLPKEEWVPINPLLVGFGQT+C+PLRPRCG+C I+EFCPSAFKE Sbjct: 282 QKTSNPEETRVALQLWLPKEEWVPINPLLVGFGQTICTPLRPRCGMCSITEFCPSAFKET 341 Query: 1013 *SPSSTPRKPRL 1048 SP+S +K L Sbjct: 342 SSPASKMKKSGL 353 >ref|XP_006347463.1| PREDICTED: endonuclease III-like protein 1-like [Solanum tuberosum] Length = 422 Score = 410 bits (1054), Expect = e-112 Identities = 197/240 (82%), Positives = 210/240 (87%) Frame = +2 Query: 320 PPDNWERVLEGIRKMRSSEDAPVDSMGCEKAGTSLPPKERRFAVLASSLLSSQTKDHVTH 499 PP NWE+VLEGIRKMRS+EDAPVDSMGCEKAG+SLP KERRFAVL SSLLSSQTKD V H Sbjct: 183 PPLNWEKVLEGIRKMRSAEDAPVDSMGCEKAGSSLPAKERRFAVLVSSLLSSQTKDQVNH 242 Query: 500 GAIQRLLQQDLLTPDAVDKAEEATIKSLIYPVGFYTRKAGNLKKIAKICLLKYGGDIPST 679 GAIQRLLQ LL DA+D A E TIKSLIYPVGFYTRKA NLKK+AKICL KY GDIPS+ Sbjct: 243 GAIQRLLQNGLLAADAIDSANEETIKSLIYPVGFYTRKASNLKKVAKICLSKYNGDIPSS 302 Query: 680 XXXXXXXPGIGPKMAHLVMNVAWDNVQGICVDTHVHRICNRLGWVSRPGTKQKTSSPEET 859 PGIGPKMAHLVMNVAW+NVQGICVDTHVHRI NRLGWVSRPGTKQKT +PEET Sbjct: 303 LEELLLLPGIGPKMAHLVMNVAWENVQGICVDTHVHRISNRLGWVSRPGTKQKTRTPEET 362 Query: 860 RESLQLWLPKEEWVPINPLLVGFGQTVCSPLRPRCGICIISEFCPSAFKEG*SPSSTPRK 1039 RESLQLWLPKEEWVPINPLLVGFGQT+C+PLRPRC IC +S+ CPSAFKE SPSST +K Sbjct: 363 RESLQLWLPKEEWVPINPLLVGFGQTICTPLRPRCAICTVSDLCPSAFKEAASPSSTSKK 422 >gb|ESW03953.1| hypothetical protein PHAVU_011G055100g [Phaseolus vulgaris] Length = 359 Score = 396 bits (1018), Expect = e-108 Identities = 208/305 (68%), Positives = 230/305 (75%), Gaps = 30/305 (9%) Frame = +2 Query: 224 ITEEH---VQQKLCRPLDIEDFAYGKSCGYSW---------------------STLP--- 322 +T++H V QK P +IEDFAY CG + ST P Sbjct: 55 LTQDHKVPVTQKFGLP-EIEDFAY---CGGNELTRRRKSEMESDVASVASEVASTRPGGK 110 Query: 323 -PDNWERVLEGIRKMRSSEDAPVDSMGCEKAGTSLPPKERRFAVLASSLLSSQTKDHVTH 499 P +WE+VLEGIRKMRSS DAPVD+MGCEKAG +LPPKERRFAVL SSLLSSQTKD VTH Sbjct: 111 SPAHWEKVLEGIRKMRSSADAPVDTMGCEKAGDTLPPKERRFAVLVSSLLSSQTKDPVTH 170 Query: 500 GAIQRLLQQDLLTPDAVDKAEEATIKSLIYPVGFYTRKAGNLKKIAKICLLKYGGDIPST 679 GAIQRLLQ DLLTP+A++ +E TIK LIYPVGFYTRKA NLKKIA ICL+KY GDIPS+ Sbjct: 171 GAIQRLLQNDLLTPEAINNVDEETIKKLIYPVGFYTRKATNLKKIANICLMKYHGDIPSS 230 Query: 680 XXXXXXXPGIGPKMAHLVMNVAWDNVQGICVDTHVHRICNRLGWVSRPGTKQKTSSPEET 859 PGIGPKMAHLVMN W+NVQGICVDTHVHRICNRLGWVSR GT QKTS+PEET Sbjct: 231 IDQLLLLPGIGPKMAHLVMNAGWNNVQGICVDTHVHRICNRLGWVSRLGTNQKTSTPEET 290 Query: 860 RESLQLWLPKEEWVPINPLLVGFGQTVCSPLRPRCGICIISEFCPSAFKE--G*SPSSTP 1033 RESLQ WLPKEEWVPINPLLVGFGQT+C+PLRPRCG C + + CPSAFKE SPSS Sbjct: 291 RESLQRWLPKEEWVPINPLLVGFGQTICTPLRPRCGECSVRDLCPSAFKETSNSSPSSKS 350 Query: 1034 RKPRL 1048 +KP L Sbjct: 351 KKPGL 355 >gb|ESW03954.1| hypothetical protein PHAVU_011G055100g [Phaseolus vulgaris] Length = 408 Score = 395 bits (1016), Expect = e-107 Identities = 208/304 (68%), Positives = 229/304 (75%), Gaps = 30/304 (9%) Frame = +2 Query: 227 TEEH---VQQKLCRPLDIEDFAYGKSCGYSW---------------------STLP---- 322 T++H V QK P +IEDFAY CG + ST P Sbjct: 105 TQDHKVPVTQKFGLP-EIEDFAY---CGGNELTRRRKSEMESDVASVASEVASTRPGGKS 160 Query: 323 PDNWERVLEGIRKMRSSEDAPVDSMGCEKAGTSLPPKERRFAVLASSLLSSQTKDHVTHG 502 P +WE+VLEGIRKMRSS DAPVD+MGCEKAG +LPPKERRFAVL SSLLSSQTKD VTHG Sbjct: 161 PAHWEKVLEGIRKMRSSADAPVDTMGCEKAGDTLPPKERRFAVLVSSLLSSQTKDPVTHG 220 Query: 503 AIQRLLQQDLLTPDAVDKAEEATIKSLIYPVGFYTRKAGNLKKIAKICLLKYGGDIPSTX 682 AIQRLLQ DLLTP+A++ +E TIK LIYPVGFYTRKA NLKKIA ICL+KY GDIPS+ Sbjct: 221 AIQRLLQNDLLTPEAINNVDEETIKKLIYPVGFYTRKATNLKKIANICLMKYHGDIPSSI 280 Query: 683 XXXXXXPGIGPKMAHLVMNVAWDNVQGICVDTHVHRICNRLGWVSRPGTKQKTSSPEETR 862 PGIGPKMAHLVMN W+NVQGICVDTHVHRICNRLGWVSR GT QKTS+PEETR Sbjct: 281 DQLLLLPGIGPKMAHLVMNAGWNNVQGICVDTHVHRICNRLGWVSRLGTNQKTSTPEETR 340 Query: 863 ESLQLWLPKEEWVPINPLLVGFGQTVCSPLRPRCGICIISEFCPSAFKE--G*SPSSTPR 1036 ESLQ WLPKEEWVPINPLLVGFGQT+C+PLRPRCG C + + CPSAFKE SPSS + Sbjct: 341 ESLQRWLPKEEWVPINPLLVGFGQTICTPLRPRCGECSVRDLCPSAFKETSNSSPSSKSK 400 Query: 1037 KPRL 1048 KP L Sbjct: 401 KPGL 404 >ref|XP_003539044.2| PREDICTED: endonuclease III-like protein 1-like [Glycine max] Length = 357 Score = 394 bits (1013), Expect = e-107 Identities = 193/241 (80%), Positives = 206/241 (85%), Gaps = 2/241 (0%) Frame = +2 Query: 323 PDNWERVLEGIRKMRSSEDAPVDSMGCEKAGTSLPPKERRFAVLASSLLSSQTKDHVTHG 502 P WE+VLEGIRKMR S DAPVD+MGCEKAG +LPPKERRFAVL SSLLSSQTKD VTHG Sbjct: 109 PAQWEKVLEGIRKMRCSADAPVDTMGCEKAGETLPPKERRFAVLVSSLLSSQTKDPVTHG 168 Query: 503 AIQRLLQQDLLTPDAVDKAEEATIKSLIYPVGFYTRKAGNLKKIAKICLLKYGGDIPSTX 682 AIQRLLQ DLLT DA++ A+E TIK LIYPVGFYTRKA NLKKIA ICL+KY GDIPS+ Sbjct: 169 AIQRLLQNDLLTADAINDADEETIKKLIYPVGFYTRKASNLKKIANICLMKYDGDIPSSI 228 Query: 683 XXXXXXPGIGPKMAHLVMNVAWDNVQGICVDTHVHRICNRLGWVSRPGTKQKTSSPEETR 862 PGIGPKMAHLVMNV W+NVQGICVDTHVHRICNRLGWVSR GTKQKTS+PEETR Sbjct: 229 EQLLLLPGIGPKMAHLVMNVGWNNVQGICVDTHVHRICNRLGWVSRLGTKQKTSTPEETR 288 Query: 863 ESLQLWLPKEEWVPINPLLVGFGQTVCSPLRPRCGICIISEFCPSAFKE--G*SPSSTPR 1036 E LQ WLPKEEWVPINPLLVGFGQT+C+PLRPRCG C ISE CPSAFKE SPSS+ Sbjct: 289 EELQRWLPKEEWVPINPLLVGFGQTICTPLRPRCGECSISELCPSAFKETSNSSPSSSKS 348 Query: 1037 K 1039 K Sbjct: 349 K 349 >ref|XP_004507328.1| PREDICTED: endonuclease III-like protein 1-like [Cicer arietinum] Length = 387 Score = 391 bits (1005), Expect = e-106 Identities = 189/235 (80%), Positives = 202/235 (85%) Frame = +2 Query: 323 PDNWERVLEGIRKMRSSEDAPVDSMGCEKAGTSLPPKERRFAVLASSLLSSQTKDHVTHG 502 P +WE LEGIRKMR S DAPVD+MGCEKAG++LPPKERRFAVL SSLLSSQTKDHV HG Sbjct: 140 PADWEETLEGIRKMRCSADAPVDTMGCEKAGSTLPPKERRFAVLVSSLLSSQTKDHVNHG 199 Query: 503 AIQRLLQQDLLTPDAVDKAEEATIKSLIYPVGFYTRKAGNLKKIAKICLLKYGGDIPSTX 682 AIQRLLQ DLLTPDA++ A+E TIK LIYPVGFYTRKA NLKKIA ICL+KYGGDIPST Sbjct: 200 AIQRLLQNDLLTPDAINNADEETIKKLIYPVGFYTRKATNLKKIANICLMKYGGDIPSTL 259 Query: 683 XXXXXXPGIGPKMAHLVMNVAWDNVQGICVDTHVHRICNRLGWVSRPGTKQKTSSPEETR 862 PGIGPKMAHLVMNVAW+NVQGICVDTHVHRICNRLGWVSR GTKQKT +PEETR Sbjct: 260 EQLLLLPGIGPKMAHLVMNVAWNNVQGICVDTHVHRICNRLGWVSRLGTKQKTLTPEETR 319 Query: 863 ESLQLWLPKEEWVPINPLLVGFGQTVCSPLRPRCGICIISEFCPSAFKEG*SPSS 1027 ESLQ WLP+EEW PINPLLVGFGQT+C+PLRPRCG C IS C SAFKE SS Sbjct: 320 ESLQRWLPREEWDPINPLLVGFGQTICTPLRPRCGECGISHLCLSAFKEASDSSS 374 >ref|XP_004152104.1| PREDICTED: endonuclease III-like protein 1-like [Cucumis sativus] gi|449521044|ref|XP_004167541.1| PREDICTED: endonuclease III-like protein 1-like [Cucumis sativus] Length = 386 Score = 386 bits (992), Expect = e-105 Identities = 208/379 (54%), Positives = 253/379 (66%), Gaps = 31/379 (8%) Frame = +2 Query: 8 PLRIPSSLPLFSARINKIKMPQTSFSSKTQNPTTRNKNPGNENV---NAGSKTELQXXXX 178 P+RIP+ F+ RI M + S SS + NPG +V N S+ E + Sbjct: 6 PIRIPALSITFARRITCSAMSKGSSSSLPTSSNEVPPNPGISSVKSSNGVSEPETRVFVR 65 Query: 179 XXXXXXXXXXXXXXDITEEHVQQKLCRPLDIEDFAYGKSC-------------------- 298 ++ E + K P +IEDFA+ ++ Sbjct: 66 RRVKKIAESQDSGFEV-EPKIDTKRSCPPNIEDFAFKRTKDSPGSRKLKPPLDLLLNGIE 124 Query: 299 --------GYSWSTLPPDNWERVLEGIRKMRSSEDAPVDSMGCEKAGTSLPPKERRFAVL 454 G + PP NWE+VL+GIR+MRSSE+APVD+MGC +AG++LPPKERRFAVL Sbjct: 125 DSNPTTHKGKAERGKPPVNWEKVLKGIREMRSSEEAPVDTMGCGRAGSTLPPKERRFAVL 184 Query: 455 ASSLLSSQTKDHVTHGAIQRLLQQDLLTPDAVDKAEEATIKSLIYPVGFYTRKAGNLKKI 634 ASSLLSSQTKDHVTHGA RL + LLT DA+DKA+E TIKSLIYPVGFY+ KA NLKKI Sbjct: 185 ASSLLSSQTKDHVTHGAALRLQESGLLTADAMDKADEETIKSLIYPVGFYSTKAKNLKKI 244 Query: 635 AKICLLKYGGDIPSTXXXXXXXPGIGPKMAHLVMNVAWDNVQGICVDTHVHRICNRLGWV 814 A+ICL+KYGGDIP + PGIGPK+AHL+M +AW++VQGICVDTHVHRICNRLGWV Sbjct: 245 ARICLMKYGGDIPRSLAELLLLPGIGPKIAHLIMIMAWNDVQGICVDTHVHRICNRLGWV 304 Query: 815 SRPGTKQKTSSPEETRESLQLWLPKEEWVPINPLLVGFGQTVCSPLRPRCGICIISEFCP 994 S G+KQKTS+PEETR L+LWLPKEEWVPINPLLVGFGQT+C+PLRP+CG C +S+ CP Sbjct: 305 SGKGSKQKTSTPEETRVGLELWLPKEEWVPINPLLVGFGQTICTPLRPKCGNCSVSDLCP 364 Query: 995 SAFKEG*SPSSTPRKPRLK 1051 SAFKE SPS P+LK Sbjct: 365 SAFKESSSPS-----PKLK 378 >ref|XP_006410282.1| hypothetical protein EUTSA_v10016815mg [Eutrema salsugineum] gi|557111451|gb|ESQ51735.1| hypothetical protein EUTSA_v10016815mg [Eutrema salsugineum] Length = 373 Score = 386 bits (991), Expect = e-104 Identities = 199/348 (57%), Positives = 242/348 (69%), Gaps = 16/348 (4%) Frame = +2 Query: 50 INKIKMPQTSFSSKTQNPTTRNKNPGNENVNAGSKTELQXXXXXXXXXXXXXXXXXXDIT 229 +N + S S TQ + + +P +GS+T + I Sbjct: 24 MNHLNYGTVSSSKPTQQHSLPDSDPEPAKPASGSETRVYTRKKRLKQEAFQPLEKDSCI- 82 Query: 230 EEHVQQKLCRPLDIEDFAYGKSCGYSWSTLP----------------PDNWERVLEGIRK 361 + Q++LCR DIE+FAY K+ S S P+NW +VLEGIR+ Sbjct: 83 --NTQKQLCRLPDIEEFAYKKNTRSSSSRRSTETSITVTSVKTAGNAPENWVKVLEGIRQ 140 Query: 362 MRSSEDAPVDSMGCEKAGTSLPPKERRFAVLASSLLSSQTKDHVTHGAIQRLLQQDLLTP 541 MRSSEDAPVDSMGC+KAG+ LPP ERRFAVL +LLSSQTKD V + AI RL Q LLTP Sbjct: 141 MRSSEDAPVDSMGCDKAGSFLPPTERRFAVLLGALLSSQTKDEVNNAAIHRLHQNGLLTP 200 Query: 542 DAVDKAEEATIKSLIYPVGFYTRKAGNLKKIAKICLLKYGGDIPSTXXXXXXXPGIGPKM 721 +AVDKA+E+T++ LIYPVGFYTRKA +KKIAKICL+KY GDIPS+ PGIGPKM Sbjct: 201 EAVDKADESTLRELIYPVGFYTRKATYMKKIAKICLVKYNGDIPSSLDDLLALPGIGPKM 260 Query: 722 AHLVMNVAWDNVQGICVDTHVHRICNRLGWVSRPGTKQKTSSPEETRESLQLWLPKEEWV 901 AHL++++AW++VQGICVDTHVHRICNRLGWVSRPGTKQKTSSPEETR +LQ WLPKEEWV Sbjct: 261 AHLILHIAWNDVQGICVDTHVHRICNRLGWVSRPGTKQKTSSPEETRVALQQWLPKEEWV 320 Query: 902 PINPLLVGFGQTVCSPLRPRCGICIISEFCPSAFKEG*SPSSTPRKPR 1045 INPLLVGFGQT+C+PLRPRC C +++ CP+AFKE SPSS +K + Sbjct: 321 AINPLLVGFGQTICTPLRPRCETCSVTKLCPAAFKEASSPSSKLKKSK 368 >emb|CAC16135.1| endonuclease III homologue [Arabidopsis thaliana] Length = 354 Score = 385 bits (988), Expect = e-104 Identities = 190/280 (67%), Positives = 219/280 (78%), Gaps = 16/280 (5%) Frame = +2 Query: 248 KLCRPLDIEDFAYGKSCGYSWST----------------LPPDNWERVLEGIRKMRSSED 379 KLC DIEDFAY K+ G S+ PP+NW VLEGIR+MRSSED Sbjct: 68 KLCGLPDIEDFAYKKTIGSPSSSRSTETSITVTSVKTAGYPPENWVEVLEGIRQMRSSED 127 Query: 380 APVDSMGCEKAGTSLPPKERRFAVLASSLLSSQTKDHVTHGAIQRLLQQDLLTPDAVDKA 559 APVDSMGC+KAG+ LPP ERRFAVL +LLSSQTKD V + AI RL Q LLTP+AVDKA Sbjct: 128 APVDSMGCDKAGSFLPPTERRFAVLLGALLSSQTKDQVNNAAIHRLHQNGLLTPEAVDKA 187 Query: 560 EEATIKSLIYPVGFYTRKAGNLKKIAKICLLKYGGDIPSTXXXXXXXPGIGPKMAHLVMN 739 +E+TIK LIYPVGFYTRKA +KKIA+ICL+KY GDIPS+ PGIGPKMAHL+++ Sbjct: 188 DESTIKELIYPVGFYTRKATYMKKIARICLVKYDGDIPSSLDDLLSLPGIGPKMAHLILH 247 Query: 740 VAWDNVQGICVDTHVHRICNRLGWVSRPGTKQKTSSPEETRESLQLWLPKEEWVPINPLL 919 +AW++VQGICVDTHVHRICNRLGWVSRPGTKQKT+SPEETR +LQ WLPKEEWV INPLL Sbjct: 248 IAWNDVQGICVDTHVHRICNRLGWVSRPGTKQKTTSPEETRVALQQWLPKEEWVAINPLL 307 Query: 920 VGFGQTVCSPLRPRCGICIISEFCPSAFKEG*SPSSTPRK 1039 VGFGQ +C+PLRPRC C +S+ CP+AFKE SPSS +K Sbjct: 308 VGFGQMICTPLRPRCEACSVSKLCPAAFKETSSPSSKLKK 347 >ref|NP_565725.1| protein NTH1 [Arabidopsis thaliana] gi|20198157|gb|AAD26474.2| putative endonuclease [Arabidopsis thaliana] gi|330253455|gb|AEC08549.1| protein NTH1 [Arabidopsis thaliana] Length = 379 Score = 384 bits (986), Expect = e-104 Identities = 189/280 (67%), Positives = 219/280 (78%), Gaps = 16/280 (5%) Frame = +2 Query: 248 KLCRPLDIEDFAYGKSCGYSWST----------------LPPDNWERVLEGIRKMRSSED 379 KLC DIEDFAY K+ G S+ PP+NW VLEGIR+MRSSED Sbjct: 93 KLCGLPDIEDFAYKKTIGSPSSSRSTETSITVTSVKTAGYPPENWVEVLEGIRQMRSSED 152 Query: 380 APVDSMGCEKAGTSLPPKERRFAVLASSLLSSQTKDHVTHGAIQRLLQQDLLTPDAVDKA 559 APVDSMGC+KAG+ LPP ERRFAVL +LLSSQTKD V + AI RL Q LLTP+AVDKA Sbjct: 153 APVDSMGCDKAGSFLPPTERRFAVLLGALLSSQTKDQVNNAAIHRLHQNGLLTPEAVDKA 212 Query: 560 EEATIKSLIYPVGFYTRKAGNLKKIAKICLLKYGGDIPSTXXXXXXXPGIGPKMAHLVMN 739 +E+TIK LIYPVGFYTRKA +KKIA+ICL+KY GDIPS+ PGIGPKMAHL+++ Sbjct: 213 DESTIKELIYPVGFYTRKATYMKKIARICLVKYDGDIPSSLDDLLSLPGIGPKMAHLILH 272 Query: 740 VAWDNVQGICVDTHVHRICNRLGWVSRPGTKQKTSSPEETRESLQLWLPKEEWVPINPLL 919 +AW++VQGICVDTHVHRICNRLGWVSRPGTKQKT+SPEETR +LQ WLPKEEWV INPLL Sbjct: 273 IAWNDVQGICVDTHVHRICNRLGWVSRPGTKQKTTSPEETRVALQQWLPKEEWVAINPLL 332 Query: 920 VGFGQTVCSPLRPRCGICIISEFCPSAFKEG*SPSSTPRK 1039 VGFGQ +C+P+RPRC C +S+ CP+AFKE SPSS +K Sbjct: 333 VGFGQMICTPIRPRCEACSVSKLCPAAFKETSSPSSKLKK 372 >ref|NP_001077988.1| protein NTH1 [Arabidopsis thaliana] gi|17380754|gb|AAL36207.1| putative endonuclease [Arabidopsis thaliana] gi|20259623|gb|AAM14168.1| putative endonuclease [Arabidopsis thaliana] gi|330253456|gb|AEC08550.1| protein NTH1 [Arabidopsis thaliana] Length = 377 Score = 384 bits (986), Expect = e-104 Identities = 189/280 (67%), Positives = 219/280 (78%), Gaps = 16/280 (5%) Frame = +2 Query: 248 KLCRPLDIEDFAYGKSCGYSWST----------------LPPDNWERVLEGIRKMRSSED 379 KLC DIEDFAY K+ G S+ PP+NW VLEGIR+MRSSED Sbjct: 91 KLCGLPDIEDFAYKKTIGSPSSSRSTETSITVTSVKTAGYPPENWVEVLEGIRQMRSSED 150 Query: 380 APVDSMGCEKAGTSLPPKERRFAVLASSLLSSQTKDHVTHGAIQRLLQQDLLTPDAVDKA 559 APVDSMGC+KAG+ LPP ERRFAVL +LLSSQTKD V + AI RL Q LLTP+AVDKA Sbjct: 151 APVDSMGCDKAGSFLPPTERRFAVLLGALLSSQTKDQVNNAAIHRLHQNGLLTPEAVDKA 210 Query: 560 EEATIKSLIYPVGFYTRKAGNLKKIAKICLLKYGGDIPSTXXXXXXXPGIGPKMAHLVMN 739 +E+TIK LIYPVGFYTRKA +KKIA+ICL+KY GDIPS+ PGIGPKMAHL+++ Sbjct: 211 DESTIKELIYPVGFYTRKATYMKKIARICLVKYDGDIPSSLDDLLSLPGIGPKMAHLILH 270 Query: 740 VAWDNVQGICVDTHVHRICNRLGWVSRPGTKQKTSSPEETRESLQLWLPKEEWVPINPLL 919 +AW++VQGICVDTHVHRICNRLGWVSRPGTKQKT+SPEETR +LQ WLPKEEWV INPLL Sbjct: 271 IAWNDVQGICVDTHVHRICNRLGWVSRPGTKQKTTSPEETRVALQQWLPKEEWVAINPLL 330 Query: 920 VGFGQTVCSPLRPRCGICIISEFCPSAFKEG*SPSSTPRK 1039 VGFGQ +C+P+RPRC C +S+ CP+AFKE SPSS +K Sbjct: 331 VGFGQMICTPIRPRCEACSVSKLCPAAFKETSSPSSKLKK 370