BLASTX nr result
ID: Catharanthus22_contig00012020
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Catharanthus22_contig00012020 (1352 letters) Database: ./nr 37,332,560 sequences; 13,225,080,153 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value gb|EOY04995.1| DNA glycosylase superfamily protein isoform 3 [Th... 432 e-118 ref|XP_002264475.2| PREDICTED: endonuclease III-like protein 1-l... 431 e-118 gb|EMJ23912.1| hypothetical protein PRUPE_ppa009900mg [Prunus pe... 430 e-118 emb|CBI36652.3| unnamed protein product [Vitis vinifera] 427 e-117 ref|XP_004241397.1| PREDICTED: endonuclease III-like protein 1-l... 422 e-115 gb|EOY04993.1| DNA glycosylase superfamily protein isoform 1 [Th... 419 e-114 ref|XP_006443060.1| hypothetical protein CICLE_v10020813mg [Citr... 418 e-114 ref|XP_004309826.1| PREDICTED: endonuclease III-like protein 1-l... 416 e-113 ref|XP_006494172.1| PREDICTED: endonuclease III-like protein 1-l... 414 e-113 ref|XP_002534117.1| endonuclease III, putative [Ricinus communis... 411 e-112 ref|XP_006347463.1| PREDICTED: endonuclease III-like protein 1-l... 410 e-112 gb|ESW03954.1| hypothetical protein PHAVU_011G055100g [Phaseolus... 397 e-108 gb|ESW03953.1| hypothetical protein PHAVU_011G055100g [Phaseolus... 396 e-108 ref|XP_003539044.2| PREDICTED: endonuclease III-like protein 1-l... 394 e-107 ref|XP_004507328.1| PREDICTED: endonuclease III-like protein 1-l... 391 e-106 ref|XP_004152104.1| PREDICTED: endonuclease III-like protein 1-l... 386 e-104 ref|XP_006410282.1| hypothetical protein EUTSA_v10016815mg [Eutr... 386 e-104 emb|CAC16135.1| endonuclease III homologue [Arabidopsis thaliana] 385 e-104 ref|NP_565725.1| protein NTH1 [Arabidopsis thaliana] gi|20198157... 384 e-104 ref|NP_001077988.1| protein NTH1 [Arabidopsis thaliana] gi|17380... 384 e-104 >gb|EOY04995.1| DNA glycosylase superfamily protein isoform 3 [Theobroma cacao] Length = 364 Score = 432 bits (1111), Expect = e-118 Identities = 224/357 (62%), Positives = 261/357 (73%), Gaps = 5/357 (1%) Frame = +2 Query: 17 LYTRKMSFPLRIPSSLPLFSARINKIKMPQTSFSSKTQNPTTRNKNPGNENVNAGSKTEL 196 +Y SFPL + L ++N KMP+T + KT + ++ + P ++ T+ Sbjct: 1 MYAVPRSFPLGF--GVGLGGMKLNS-KMPKTRLAFKTLSSSSTTEVPSSDPNPGSETTDN 57 Query: 197 QXXXXXXXXXXXXXXXXXXDITEEHVQQ-----KLCRPLDIEDFAYGKSCGYSWSTLPPD 361 D+ +E + KLC DIE+FAY K G S S P Sbjct: 58 VSVPAVRVFTRKKRVKKTVDVVQEIPKAENKGLKLCGLPDIEEFAYKKVDGPSLSGNAPA 117 Query: 362 NWERVLEGIRKMRSSEDAPVDSMGCEKAGTSLPPKERRFAVLASSLLSSQTKDHVTHGAI 541 NWE+VLEGIRKMRS+EDAPVD+MGCEKAG+ LPPKERRFAVL SSLLSSQTKDHVTHGAI Sbjct: 118 NWEKVLEGIRKMRSAEDAPVDTMGCEKAGSVLPPKERRFAVLISSLLSSQTKDHVTHGAI 177 Query: 542 QRLLQQDLLTPDAVDKAEEATIKSLIYPVGFYTRKAGNLKKIAKICLLKYGGDIPSTXXX 721 QRL+Q L+TPDA+DKA+EATIK LIYPVGFYTRKA N+KKIAKICL+KY GDIPS+ Sbjct: 178 QRLIQNCLMTPDAIDKADEATIKDLIYPVGFYTRKAINVKKIAKICLMKYDGDIPSSLEE 237 Query: 722 XXXXPGIGPKMAHLVMNVAWDNVQGICVDTHVHRICNRLGWVSRPGTKQKTSSPEETRES 901 PGIGPKMAHLVMN+AWD+VQGICVDTHVHRICNRLGWVSRPGTKQKT PEETR + Sbjct: 238 LLLLPGIGPKMAHLVMNIAWDDVQGICVDTHVHRICNRLGWVSRPGTKQKTLYPEETRVA 297 Query: 902 LQLWLPKEEWVPINPLLVGFGQTVCSPLRPRCGICIISEFCPSAFKEG*SPSSTPRK 1072 LQ WLPKEEWVPINPLLVGFGQT+C+PLRP+C +C I+EFCPSAFKE SPSS +K Sbjct: 298 LQQWLPKEEWVPINPLLVGFGQTICTPLRPQCEVCSITEFCPSAFKETSSPSSKVKK 354 >ref|XP_002264475.2| PREDICTED: endonuclease III-like protein 1-like [Vitis vinifera] Length = 355 Score = 431 bits (1107), Expect = e-118 Identities = 222/349 (63%), Positives = 250/349 (71%), Gaps = 23/349 (6%) Frame = +2 Query: 98 MPQTSFSSKTQNPTTRNKNPGNENVNAGSKTELQXXXXXXXXXXXXXXXXXXDITEEHVQ 277 M + + SSK P ++K +E G +I E Q Sbjct: 1 MSRATSSSKPLLPALQSKTSAHEETPNGVSGSEVRVFVRKKRVKMAVETPEKEIKAEPQQ 60 Query: 278 QKLCRPLDIEDFAY--GKSCGYSWSTLP---------------------PDNWERVLEGI 388 QK+C DIE+F Y GK + + P P NWE++LEGI Sbjct: 61 QKICELPDIEEFTYRKGKRSTHLRKSKPTSDVPPGGTEITSSIRPAAELPANWEKILEGI 120 Query: 389 RKMRSSEDAPVDSMGCEKAGTSLPPKERRFAVLASSLLSSQTKDHVTHGAIQRLLQQDLL 568 RKMRSSEDAPVDSMGCEKAG+SLPP+ERRFAVL SSLLSSQTKD+VTHGAIQRLLQ LL Sbjct: 121 RKMRSSEDAPVDSMGCEKAGSSLPPRERRFAVLVSSLLSSQTKDNVTHGAIQRLLQNGLL 180 Query: 569 TPDAVDKAEEATIKSLIYPVGFYTRKAGNLKKIAKICLLKYGGDIPSTXXXXXXXPGIGP 748 DA+DKA+EAT+KSLIYPVGFY+RKAGNLKKIAKICL+KY GDIPS+ PGIGP Sbjct: 181 VADAIDKADEATVKSLIYPVGFYSRKAGNLKKIAKICLMKYDGDIPSSLEELLLLPGIGP 240 Query: 749 KMAHLVMNVAWDNVQGICVDTHVHRICNRLGWVSRPGTKQKTSSPEETRESLQLWLPKEE 928 KMAHLVMNVAW+NVQGICVDTHVHRICNRLGWVSR GTKQKTS PEETRESLQLWLPKEE Sbjct: 241 KMAHLVMNVAWNNVQGICVDTHVHRICNRLGWVSRRGTKQKTSLPEETRESLQLWLPKEE 300 Query: 929 WVPINPLLVGFGQTVCSPLRPRCGICIISEFCPSAFKEG*SPSSTPRKP 1075 WVPINPLLVGFGQT+C+PLRPRCG+C +S+ CPSAFKE SPSS +KP Sbjct: 301 WVPINPLLVGFGQTICTPLRPRCGVCGVSDLCPSAFKEAQSPSSKMKKP 349 >gb|EMJ23912.1| hypothetical protein PRUPE_ppa009900mg [Prunus persica] Length = 272 Score = 430 bits (1105), Expect = e-118 Identities = 210/267 (78%), Positives = 229/267 (85%) Frame = +2 Query: 281 KLCRPLDIEDFAYGKSCGYSWSTLPPDNWERVLEGIRKMRSSEDAPVDSMGCEKAGTSLP 460 +L P DIE+FAY K + S+ PP NWE+VLEGIRKMRSSEDAPVDSMGCEKAG++LP Sbjct: 2 QLASPPDIEEFAYTKVSASTNSSKPPANWEKVLEGIRKMRSSEDAPVDSMGCEKAGSALP 61 Query: 461 PKERRFAVLASSLLSSQTKDHVTHGAIQRLLQQDLLTPDAVDKAEEATIKSLIYPVGFYT 640 PKERRFAVL SSLLSSQTKDHVTHGAIQRLLQ +LL D++DKAEEATIKSLIYPVGFYT Sbjct: 62 PKERRFAVLVSSLLSSQTKDHVTHGAIQRLLQNNLLAADSIDKAEEATIKSLIYPVGFYT 121 Query: 641 RKAGNLKKIAKICLLKYGGDIPSTXXXXXXXPGIGPKMAHLVMNVAWDNVQGICVDTHVH 820 RKA NLKKIAKICL KY GDIPS+ PGIGPKMAHLVMNV W+NVQGICVDTHVH Sbjct: 122 RKATNLKKIAKICLTKYDGDIPSSLDELLSLPGIGPKMAHLVMNVGWNNVQGICVDTHVH 181 Query: 821 RICNRLGWVSRPGTKQKTSSPEETRESLQLWLPKEEWVPINPLLVGFGQTVCSPLRPRCG 1000 RI NRLGWVSR G KQKTS+PEETRE+LQLWLPKEEW PINPLLVGFGQTVC+PLRP CG Sbjct: 182 RISNRLGWVSREGRKQKTSNPEETREALQLWLPKEEWDPINPLLVGFGQTVCTPLRPHCG 241 Query: 1001 ICIISEFCPSAFKEG*SPSSTPRKPRL 1081 +C +S+FCPSAFKE SPSS +K L Sbjct: 242 VCNVSKFCPSAFKEASSPSSKSKKSGL 268 >emb|CBI36652.3| unnamed protein product [Vitis vinifera] Length = 379 Score = 427 bits (1098), Expect = e-117 Identities = 230/374 (61%), Positives = 259/374 (69%), Gaps = 26/374 (6%) Frame = +2 Query: 32 MSFPLRIPSSLPLFSARINKIKMPQTSFSSKTQNPTTRNKNPGNENVNAGSKTELQXXXX 211 MS L +L L S RI M + + SSK P ++K +E G Sbjct: 1 MSHILLKSCTLALASVRITW-PMSRATSSSKPLLPALQSKTSAHEETPNGVSGSEVRVFV 59 Query: 212 XXXXXXXXXXXXXXDITEEHVQQKLCRPLDIEDFAY--GKSCGYSWSTLP---------- 355 +I E QQK+C DIE+F Y GK + + P Sbjct: 60 RKKRVKMAVETPEKEIKAEPQQQKICELPDIEEFTYRKGKRSTHLRKSKPTSDVPPGGTE 119 Query: 356 -----------PDNWERVLEGIRKMRSSEDAPVDSMGCEKAGTSLPPKERRFAVLASSLL 502 P NWE++LEGIRKMRSSEDAPVDSMGCEKAG+SLPP+ERRFAVL SSLL Sbjct: 120 ITSSIRPAAELPANWEKILEGIRKMRSSEDAPVDSMGCEKAGSSLPPRERRFAVLVSSLL 179 Query: 503 SSQTKDHVTHG---AIQRLLQQDLLTPDAVDKAEEATIKSLIYPVGFYTRKAGNLKKIAK 673 SSQTKD+VTHG AIQRLLQ LL DA+DKA+EAT+KSLIYPVGFY+RKAGNLKKIAK Sbjct: 180 SSQTKDNVTHGNAGAIQRLLQNGLLVADAIDKADEATVKSLIYPVGFYSRKAGNLKKIAK 239 Query: 674 ICLLKYGGDIPSTXXXXXXXPGIGPKMAHLVMNVAWDNVQGICVDTHVHRICNRLGWVSR 853 ICL+KY GDIPS+ PGIGPKMAHLVMNVAW+NVQGICVDTHVHRICNRLGWVSR Sbjct: 240 ICLMKYDGDIPSSLEELLLLPGIGPKMAHLVMNVAWNNVQGICVDTHVHRICNRLGWVSR 299 Query: 854 PGTKQKTSSPEETRESLQLWLPKEEWVPINPLLVGFGQTVCSPLRPRCGICIISEFCPSA 1033 GTKQKTS PEETRESLQLWLPKEEWVPINPLLVGFGQT+C+PLRPRCG+C +S+ CPSA Sbjct: 300 RGTKQKTSLPEETRESLQLWLPKEEWVPINPLLVGFGQTICTPLRPRCGVCGVSDLCPSA 359 Query: 1034 FKEG*SPSSTPRKP 1075 FKE SPSS +KP Sbjct: 360 FKEAQSPSSKMKKP 373 >ref|XP_004241397.1| PREDICTED: endonuclease III-like protein 1-like [Solanum lycopersicum] Length = 380 Score = 422 bits (1084), Expect = e-115 Identities = 229/378 (60%), Positives = 254/378 (67%), Gaps = 35/378 (9%) Frame = +2 Query: 44 LRIPSSLPLFSARINKI---KMPQTSFSSKTQNPTTRNKNPGNENVNAGSKTELQXXXXX 214 LR + LP S I I KM +T S + P+ KNPG + S EL+ Sbjct: 6 LRNTAFLPSISLGIQTISSAKMRRTRSSLNQETPS--QKNPGCDGTGGSSVPELRVFIRR 63 Query: 215 XXXXXXXXXXXXXDITEEHVQQK--LCRPLDIEDFAYGKSCGYSWST------------- 349 ++ EE +K L R DIEDF+Y K + ST Sbjct: 64 KRVKKTVEVIAK-EVKEESSGKKVMLVRLPDIEDFSYSKDITHPQSTPSKTVRLTGEKTL 122 Query: 350 -----------------LPPDNWERVLEGIRKMRSSEDAPVDSMGCEKAGTSLPPKERRF 478 PP NWE+VLEGIRKMRS+EDAPVDSMGCEKAG+SLP KERRF Sbjct: 123 PQLMQTEIKGFSLSDPLQPPSNWEKVLEGIRKMRSAEDAPVDSMGCEKAGSSLPAKERRF 182 Query: 479 AVLASSLLSSQTKDHVTHGAIQRLLQQDLLTPDAVDKAEEATIKSLIYPVGFYTRKAGNL 658 AVL SSLLSSQTKD V HGA+QRLLQ LL DA+D A E TIKSLIYPVGFYTRKA NL Sbjct: 183 AVLVSSLLSSQTKDQVNHGAVQRLLQNGLLAADAIDSANEETIKSLIYPVGFYTRKASNL 242 Query: 659 KKIAKICLLKYGGDIPSTXXXXXXXPGIGPKMAHLVMNVAWDNVQGICVDTHVHRICNRL 838 KK+AKICL KY GDIPS+ PGIGPKMAHLVMNVAW+NVQGICVDTHVHRI NRL Sbjct: 243 KKVAKICLSKYNGDIPSSLEELLLLPGIGPKMAHLVMNVAWENVQGICVDTHVHRISNRL 302 Query: 839 GWVSRPGTKQKTSSPEETRESLQLWLPKEEWVPINPLLVGFGQTVCSPLRPRCGICIISE 1018 WVSRPGTKQKT +PEETRESLQLWLPKEEWVPINPLLVGFGQT+C+PLRPRC IC +S+ Sbjct: 303 EWVSRPGTKQKTRTPEETRESLQLWLPKEEWVPINPLLVGFGQTICTPLRPRCAICTVSD 362 Query: 1019 FCPSAFKEG*SPSSTPRK 1072 CPSAFKE SPSSTP+K Sbjct: 363 LCPSAFKEAASPSSTPKK 380 >gb|EOY04993.1| DNA glycosylase superfamily protein isoform 1 [Theobroma cacao] Length = 387 Score = 419 bits (1078), Expect = e-114 Identities = 224/380 (58%), Positives = 261/380 (68%), Gaps = 28/380 (7%) Frame = +2 Query: 17 LYTRKMSFPLRIPSSLPLFSARINKIKMPQTSFSSKTQNPTTRNKNPGNENVNAGSKTEL 196 +Y SFPL + L ++N KMP+T + KT + ++ + P ++ T+ Sbjct: 1 MYAVPRSFPLGF--GVGLGGMKLNS-KMPKTRLAFKTLSSSSTTEVPSSDPNPGSETTDN 57 Query: 197 QXXXXXXXXXXXXXXXXXXDITEEHVQQ-----KLCRPLDIEDFAYGKSCGYSWSTLP-- 355 D+ +E + KLC DIE+FAY K G S S Sbjct: 58 VSVPAVRVFTRKKRVKKTVDVVQEIPKAENKGLKLCGLPDIEEFAYKKVDGPSLSGKSKS 117 Query: 356 ---------------------PDNWERVLEGIRKMRSSEDAPVDSMGCEKAGTSLPPKER 472 P NWE+VLEGIRKMRS+EDAPVD+MGCEKAG+ LPPKER Sbjct: 118 TSDEINVGTGIASPVGIGGNAPANWEKVLEGIRKMRSAEDAPVDTMGCEKAGSVLPPKER 177 Query: 473 RFAVLASSLLSSQTKDHVTHGAIQRLLQQDLLTPDAVDKAEEATIKSLIYPVGFYTRKAG 652 RFAVL SSLLSSQTKDHVTHGAIQRL+Q L+TPDA+DKA+EATIK LIYPVGFYTRKA Sbjct: 178 RFAVLISSLLSSQTKDHVTHGAIQRLIQNCLMTPDAIDKADEATIKDLIYPVGFYTRKAI 237 Query: 653 NLKKIAKICLLKYGGDIPSTXXXXXXXPGIGPKMAHLVMNVAWDNVQGICVDTHVHRICN 832 N+KKIAKICL+KY GDIPS+ PGIGPKMAHLVMN+AWD+VQGICVDTHVHRICN Sbjct: 238 NVKKIAKICLMKYDGDIPSSLEELLLLPGIGPKMAHLVMNIAWDDVQGICVDTHVHRICN 297 Query: 833 RLGWVSRPGTKQKTSSPEETRESLQLWLPKEEWVPINPLLVGFGQTVCSPLRPRCGICII 1012 RLGWVSRPGTKQKT PEETR +LQ WLPKEEWVPINPLLVGFGQT+C+PLRP+C +C I Sbjct: 298 RLGWVSRPGTKQKTLYPEETRVALQQWLPKEEWVPINPLLVGFGQTICTPLRPQCEVCSI 357 Query: 1013 SEFCPSAFKEG*SPSSTPRK 1072 +EFCPSAFKE SPSS +K Sbjct: 358 TEFCPSAFKETSSPSSKVKK 377 >ref|XP_006443060.1| hypothetical protein CICLE_v10020813mg [Citrus clementina] gi|557545322|gb|ESR56300.1| hypothetical protein CICLE_v10020813mg [Citrus clementina] Length = 357 Score = 418 bits (1074), Expect = e-114 Identities = 211/301 (70%), Positives = 232/301 (77%), Gaps = 27/301 (8%) Frame = +2 Query: 263 EEHVQQKLCRPLDIEDFAY---------GKSCGYSWSTL------------------PPD 361 E ++ K C DIE+FAY K G S ST PP Sbjct: 57 EAPIEHKSCGLPDIEEFAYKEANGSALSSKIAGKSKSTQDMPVVGTEVASLNRMRGEPPA 116 Query: 362 NWERVLEGIRKMRSSEDAPVDSMGCEKAGTSLPPKERRFAVLASSLLSSQTKDHVTHGAI 541 NWERVLEGIRKMR+SEDAPVDSMGCEKAG+SLPP+ERRFAVL SSLLSSQTKD+VTHGAI Sbjct: 117 NWERVLEGIRKMRTSEDAPVDSMGCEKAGSSLPPRERRFAVLISSLLSSQTKDNVTHGAI 176 Query: 542 QRLLQQDLLTPDAVDKAEEATIKSLIYPVGFYTRKAGNLKKIAKICLLKYGGDIPSTXXX 721 QRLLQ LLT +A+DKA+EATIK LIYPVGFYTRKA N+KKIA ICL KY GDIPS+ Sbjct: 177 QRLLQNGLLTAEAIDKADEATIKDLIYPVGFYTRKASNMKKIAPICLTKYDGDIPSSLDE 236 Query: 722 XXXXPGIGPKMAHLVMNVAWDNVQGICVDTHVHRICNRLGWVSRPGTKQKTSSPEETRES 901 PGIGPKMAHLVMNV W+NVQGICVDTHVHRICNRLGWVS+PG KQKTSSPE+TRE Sbjct: 237 LLLLPGIGPKMAHLVMNVGWNNVQGICVDTHVHRICNRLGWVSQPGRKQKTSSPEQTREV 296 Query: 902 LQLWLPKEEWVPINPLLVGFGQTVCSPLRPRCGICIISEFCPSAFKEG*SPSSTPRKPRL 1081 LQLWLPKEEWVPINPLLVGFGQT+C+P+RPRCG+C +SE CPSAFK+ SPSS RK Sbjct: 297 LQLWLPKEEWVPINPLLVGFGQTICTPIRPRCGMCSVSELCPSAFKDSSSPSSKSRKSAQ 356 Query: 1082 K 1084 K Sbjct: 357 K 357 >ref|XP_004309826.1| PREDICTED: endonuclease III-like protein 1-like [Fragaria vesca subsp. vesca] Length = 341 Score = 416 bits (1068), Expect = e-113 Identities = 202/261 (77%), Positives = 224/261 (85%), Gaps = 3/261 (1%) Frame = +2 Query: 299 DIEDFAYGKSCGYSWST---LPPDNWERVLEGIRKMRSSEDAPVDSMGCEKAGTSLPPKE 469 DIE+FAY S+ST PP +WE+VLEGIRKMRS+EDAPVDSMGCEKAG++LPPKE Sbjct: 75 DIEEFAYRNESSSSYSTDIGKPPAHWEKVLEGIRKMRSAEDAPVDSMGCEKAGSALPPKE 134 Query: 470 RRFAVLASSLLSSQTKDHVTHGAIQRLLQQDLLTPDAVDKAEEATIKSLIYPVGFYTRKA 649 RRFAVL SSLLSSQTKD VTHGA+QRLLQ +L+ DA+DK +E TIKSLIYPVGFYTRKA Sbjct: 135 RRFAVLVSSLLSSQTKDQVTHGAVQRLLQNGMLSADAIDKGDEPTIKSLIYPVGFYTRKA 194 Query: 650 GNLKKIAKICLLKYGGDIPSTXXXXXXXPGIGPKMAHLVMNVAWDNVQGICVDTHVHRIC 829 NLKKIA ICL+KY GDIPS+ PGIGPKMAHLVMNVAWDNVQGICVDTHVHRIC Sbjct: 195 SNLKKIANICLVKYDGDIPSSLEELLSLPGIGPKMAHLVMNVAWDNVQGICVDTHVHRIC 254 Query: 830 NRLGWVSRPGTKQKTSSPEETRESLQLWLPKEEWVPINPLLVGFGQTVCSPLRPRCGICI 1009 NRLGWV R G KQKTS+PEETRE+LQLWLPK+EWVPINPLLVGFGQTVC+PLRPRCG+C Sbjct: 255 NRLGWV-RAGKKQKTSNPEETREALQLWLPKDEWVPINPLLVGFGQTVCTPLRPRCGVCS 313 Query: 1010 ISEFCPSAFKEG*SPSSTPRK 1072 +SEFCPSA+KE SP S +K Sbjct: 314 VSEFCPSAYKETSSPLSKTKK 334 >ref|XP_006494172.1| PREDICTED: endonuclease III-like protein 1-like [Citrus sinensis] Length = 357 Score = 414 bits (1064), Expect = e-113 Identities = 210/301 (69%), Positives = 231/301 (76%), Gaps = 27/301 (8%) Frame = +2 Query: 263 EEHVQQKLCRPLDIEDFAY---------GKSCGYSWSTL------------------PPD 361 E ++ K C DIE+FAY K G S ST PP Sbjct: 57 EAPIEHKSCGLPDIEEFAYKEANGSALSSKIAGKSKSTQDMPVVGTEVASLNRMRGEPPA 116 Query: 362 NWERVLEGIRKMRSSEDAPVDSMGCEKAGTSLPPKERRFAVLASSLLSSQTKDHVTHGAI 541 NWERVLEGIRKMR+SEDAPVDSMGCEKAG+SLPP+ERRFAVL SSLLSSQTKD+VTHGAI Sbjct: 117 NWERVLEGIRKMRTSEDAPVDSMGCEKAGSSLPPRERRFAVLISSLLSSQTKDNVTHGAI 176 Query: 542 QRLLQQDLLTPDAVDKAEEATIKSLIYPVGFYTRKAGNLKKIAKICLLKYGGDIPSTXXX 721 QRLLQ LLT +A+DKA+EATIK LIY VGFYTRKA N+KKIA ICL KY GDIPS+ Sbjct: 177 QRLLQNGLLTAEAIDKADEATIKDLIYLVGFYTRKASNMKKIAPICLTKYDGDIPSSLDE 236 Query: 722 XXXXPGIGPKMAHLVMNVAWDNVQGICVDTHVHRICNRLGWVSRPGTKQKTSSPEETRES 901 PGIGPKMAHLVMNV W+NVQGICVDTHVHRICNRLGWVS+PG KQKTSSPE+TRE Sbjct: 237 LLLLPGIGPKMAHLVMNVGWNNVQGICVDTHVHRICNRLGWVSQPGRKQKTSSPEQTREV 296 Query: 902 LQLWLPKEEWVPINPLLVGFGQTVCSPLRPRCGICIISEFCPSAFKEG*SPSSTPRKPRL 1081 LQLWLPKEEWVPINPLLVGFGQT+C+P+RPRCG+C +SE CPSAFK+ SPSS RK Sbjct: 297 LQLWLPKEEWVPINPLLVGFGQTICTPIRPRCGMCSVSELCPSAFKDSSSPSSKSRKSAQ 356 Query: 1082 K 1084 K Sbjct: 357 K 357 >ref|XP_002534117.1| endonuclease III, putative [Ricinus communis] gi|223525829|gb|EEF28268.1| endonuclease III, putative [Ricinus communis] Length = 357 Score = 411 bits (1057), Expect = e-112 Identities = 198/252 (78%), Positives = 217/252 (86%) Frame = +2 Query: 326 SCGYSWSTLPPDNWERVLEGIRKMRSSEDAPVDSMGCEKAGTSLPPKERRFAVLASSLLS 505 +C S PP NWE VLEGIRKMRSSEDAPVD+MGCEKAG+ LP KERRFAVL SSL+S Sbjct: 102 ACTIRPSDEPPANWEIVLEGIRKMRSSEDAPVDTMGCEKAGSFLPSKERRFAVLVSSLMS 161 Query: 506 SQTKDHVTHGAIQRLLQQDLLTPDAVDKAEEATIKSLIYPVGFYTRKAGNLKKIAKICLL 685 SQTKDHVTHGA+QRL Q LLT DA+DKA+E TIK LIYPVGFYTRKA NLKKIAKICL+ Sbjct: 162 SQTKDHVTHGAVQRLHQNSLLTADAIDKADETTIKDLIYPVGFYTRKASNLKKIAKICLM 221 Query: 686 KYGGDIPSTXXXXXXXPGIGPKMAHLVMNVAWDNVQGICVDTHVHRICNRLGWVSRPGTK 865 KY GDIP + PGIGPKMAHLVMNVAWD+VQGICVDTHVHRICNRLGWVSRPGT+ Sbjct: 222 KYDGDIPRSLEDLLSLPGIGPKMAHLVMNVAWDDVQGICVDTHVHRICNRLGWVSRPGTE 281 Query: 866 QKTSSPEETRESLQLWLPKEEWVPINPLLVGFGQTVCSPLRPRCGICIISEFCPSAFKEG 1045 QKTS+PEETR +LQLWLPKEEWVPINPLLVGFGQT+C+PLRPRCG+C I+EFCPSAFKE Sbjct: 282 QKTSNPEETRVALQLWLPKEEWVPINPLLVGFGQTICTPLRPRCGMCSITEFCPSAFKET 341 Query: 1046 *SPSSTPRKPRL 1081 SP+S +K L Sbjct: 342 SSPASKMKKSGL 353 >ref|XP_006347463.1| PREDICTED: endonuclease III-like protein 1-like [Solanum tuberosum] Length = 422 Score = 410 bits (1054), Expect = e-112 Identities = 197/240 (82%), Positives = 210/240 (87%) Frame = +2 Query: 353 PPDNWERVLEGIRKMRSSEDAPVDSMGCEKAGTSLPPKERRFAVLASSLLSSQTKDHVTH 532 PP NWE+VLEGIRKMRS+EDAPVDSMGCEKAG+SLP KERRFAVL SSLLSSQTKD V H Sbjct: 183 PPLNWEKVLEGIRKMRSAEDAPVDSMGCEKAGSSLPAKERRFAVLVSSLLSSQTKDQVNH 242 Query: 533 GAIQRLLQQDLLTPDAVDKAEEATIKSLIYPVGFYTRKAGNLKKIAKICLLKYGGDIPST 712 GAIQRLLQ LL DA+D A E TIKSLIYPVGFYTRKA NLKK+AKICL KY GDIPS+ Sbjct: 243 GAIQRLLQNGLLAADAIDSANEETIKSLIYPVGFYTRKASNLKKVAKICLSKYNGDIPSS 302 Query: 713 XXXXXXXPGIGPKMAHLVMNVAWDNVQGICVDTHVHRICNRLGWVSRPGTKQKTSSPEET 892 PGIGPKMAHLVMNVAW+NVQGICVDTHVHRI NRLGWVSRPGTKQKT +PEET Sbjct: 303 LEELLLLPGIGPKMAHLVMNVAWENVQGICVDTHVHRISNRLGWVSRPGTKQKTRTPEET 362 Query: 893 RESLQLWLPKEEWVPINPLLVGFGQTVCSPLRPRCGICIISEFCPSAFKEG*SPSSTPRK 1072 RESLQLWLPKEEWVPINPLLVGFGQT+C+PLRPRC IC +S+ CPSAFKE SPSST +K Sbjct: 363 RESLQLWLPKEEWVPINPLLVGFGQTICTPLRPRCAICTVSDLCPSAFKEAASPSSTSKK 422 >gb|ESW03954.1| hypothetical protein PHAVU_011G055100g [Phaseolus vulgaris] Length = 408 Score = 397 bits (1020), Expect = e-108 Identities = 227/387 (58%), Positives = 253/387 (65%), Gaps = 35/387 (9%) Frame = +2 Query: 26 RKMSFPLRIPSSLPLFSA-RINKIKMPQTSF----SSKTQNPTTRNKNPGNENVNAGSKT 190 RKM+ L LPL ++ TSF SK + RNKNP V + Sbjct: 41 RKMAVKLEEEDHLPLTQDHKVPVTPNSATSFIEASHSKARVFVRRNKNPRKMAVKLEEED 100 Query: 191 ELQXXXXXXXXXXXXXXXXXXDITEEH---VQQKLCRPLDIEDFAYGKSCGYSW------ 343 L T++H V QK P +IEDFAY CG + Sbjct: 101 HLPS-------------------TQDHKVPVTQKFGLP-EIEDFAY---CGGNELTRRRK 137 Query: 344 ---------------STLP----PDNWERVLEGIRKMRSSEDAPVDSMGCEKAGTSLPPK 466 ST P P +WE+VLEGIRKMRSS DAPVD+MGCEKAG +LPPK Sbjct: 138 SEMESDVASVASEVASTRPGGKSPAHWEKVLEGIRKMRSSADAPVDTMGCEKAGDTLPPK 197 Query: 467 ERRFAVLASSLLSSQTKDHVTHGAIQRLLQQDLLTPDAVDKAEEATIKSLIYPVGFYTRK 646 ERRFAVL SSLLSSQTKD VTHGAIQRLLQ DLLTP+A++ +E TIK LIYPVGFYTRK Sbjct: 198 ERRFAVLVSSLLSSQTKDPVTHGAIQRLLQNDLLTPEAINNVDEETIKKLIYPVGFYTRK 257 Query: 647 AGNLKKIAKICLLKYGGDIPSTXXXXXXXPGIGPKMAHLVMNVAWDNVQGICVDTHVHRI 826 A NLKKIA ICL+KY GDIPS+ PGIGPKMAHLVMN W+NVQGICVDTHVHRI Sbjct: 258 ATNLKKIANICLMKYHGDIPSSIDQLLLLPGIGPKMAHLVMNAGWNNVQGICVDTHVHRI 317 Query: 827 CNRLGWVSRPGTKQKTSSPEETRESLQLWLPKEEWVPINPLLVGFGQTVCSPLRPRCGIC 1006 CNRLGWVSR GT QKTS+PEETRESLQ WLPKEEWVPINPLLVGFGQT+C+PLRPRCG C Sbjct: 318 CNRLGWVSRLGTNQKTSTPEETRESLQRWLPKEEWVPINPLLVGFGQTICTPLRPRCGEC 377 Query: 1007 IISEFCPSAFKE--G*SPSSTPRKPRL 1081 + + CPSAFKE SPSS +KP L Sbjct: 378 SVRDLCPSAFKETSNSSPSSKSKKPGL 404 >gb|ESW03953.1| hypothetical protein PHAVU_011G055100g [Phaseolus vulgaris] Length = 359 Score = 396 bits (1018), Expect = e-108 Identities = 208/305 (68%), Positives = 230/305 (75%), Gaps = 30/305 (9%) Frame = +2 Query: 257 ITEEH---VQQKLCRPLDIEDFAYGKSCGYSW---------------------STLP--- 355 +T++H V QK P +IEDFAY CG + ST P Sbjct: 55 LTQDHKVPVTQKFGLP-EIEDFAY---CGGNELTRRRKSEMESDVASVASEVASTRPGGK 110 Query: 356 -PDNWERVLEGIRKMRSSEDAPVDSMGCEKAGTSLPPKERRFAVLASSLLSSQTKDHVTH 532 P +WE+VLEGIRKMRSS DAPVD+MGCEKAG +LPPKERRFAVL SSLLSSQTKD VTH Sbjct: 111 SPAHWEKVLEGIRKMRSSADAPVDTMGCEKAGDTLPPKERRFAVLVSSLLSSQTKDPVTH 170 Query: 533 GAIQRLLQQDLLTPDAVDKAEEATIKSLIYPVGFYTRKAGNLKKIAKICLLKYGGDIPST 712 GAIQRLLQ DLLTP+A++ +E TIK LIYPVGFYTRKA NLKKIA ICL+KY GDIPS+ Sbjct: 171 GAIQRLLQNDLLTPEAINNVDEETIKKLIYPVGFYTRKATNLKKIANICLMKYHGDIPSS 230 Query: 713 XXXXXXXPGIGPKMAHLVMNVAWDNVQGICVDTHVHRICNRLGWVSRPGTKQKTSSPEET 892 PGIGPKMAHLVMN W+NVQGICVDTHVHRICNRLGWVSR GT QKTS+PEET Sbjct: 231 IDQLLLLPGIGPKMAHLVMNAGWNNVQGICVDTHVHRICNRLGWVSRLGTNQKTSTPEET 290 Query: 893 RESLQLWLPKEEWVPINPLLVGFGQTVCSPLRPRCGICIISEFCPSAFKE--G*SPSSTP 1066 RESLQ WLPKEEWVPINPLLVGFGQT+C+PLRPRCG C + + CPSAFKE SPSS Sbjct: 291 RESLQRWLPKEEWVPINPLLVGFGQTICTPLRPRCGECSVRDLCPSAFKETSNSSPSSKS 350 Query: 1067 RKPRL 1081 +KP L Sbjct: 351 KKPGL 355 >ref|XP_003539044.2| PREDICTED: endonuclease III-like protein 1-like [Glycine max] Length = 357 Score = 394 bits (1013), Expect = e-107 Identities = 193/241 (80%), Positives = 206/241 (85%), Gaps = 2/241 (0%) Frame = +2 Query: 356 PDNWERVLEGIRKMRSSEDAPVDSMGCEKAGTSLPPKERRFAVLASSLLSSQTKDHVTHG 535 P WE+VLEGIRKMR S DAPVD+MGCEKAG +LPPKERRFAVL SSLLSSQTKD VTHG Sbjct: 109 PAQWEKVLEGIRKMRCSADAPVDTMGCEKAGETLPPKERRFAVLVSSLLSSQTKDPVTHG 168 Query: 536 AIQRLLQQDLLTPDAVDKAEEATIKSLIYPVGFYTRKAGNLKKIAKICLLKYGGDIPSTX 715 AIQRLLQ DLLT DA++ A+E TIK LIYPVGFYTRKA NLKKIA ICL+KY GDIPS+ Sbjct: 169 AIQRLLQNDLLTADAINDADEETIKKLIYPVGFYTRKASNLKKIANICLMKYDGDIPSSI 228 Query: 716 XXXXXXPGIGPKMAHLVMNVAWDNVQGICVDTHVHRICNRLGWVSRPGTKQKTSSPEETR 895 PGIGPKMAHLVMNV W+NVQGICVDTHVHRICNRLGWVSR GTKQKTS+PEETR Sbjct: 229 EQLLLLPGIGPKMAHLVMNVGWNNVQGICVDTHVHRICNRLGWVSRLGTKQKTSTPEETR 288 Query: 896 ESLQLWLPKEEWVPINPLLVGFGQTVCSPLRPRCGICIISEFCPSAFKE--G*SPSSTPR 1069 E LQ WLPKEEWVPINPLLVGFGQT+C+PLRPRCG C ISE CPSAFKE SPSS+ Sbjct: 289 EELQRWLPKEEWVPINPLLVGFGQTICTPLRPRCGECSISELCPSAFKETSNSSPSSSKS 348 Query: 1070 K 1072 K Sbjct: 349 K 349 >ref|XP_004507328.1| PREDICTED: endonuclease III-like protein 1-like [Cicer arietinum] Length = 387 Score = 391 bits (1005), Expect = e-106 Identities = 189/235 (80%), Positives = 202/235 (85%) Frame = +2 Query: 356 PDNWERVLEGIRKMRSSEDAPVDSMGCEKAGTSLPPKERRFAVLASSLLSSQTKDHVTHG 535 P +WE LEGIRKMR S DAPVD+MGCEKAG++LPPKERRFAVL SSLLSSQTKDHV HG Sbjct: 140 PADWEETLEGIRKMRCSADAPVDTMGCEKAGSTLPPKERRFAVLVSSLLSSQTKDHVNHG 199 Query: 536 AIQRLLQQDLLTPDAVDKAEEATIKSLIYPVGFYTRKAGNLKKIAKICLLKYGGDIPSTX 715 AIQRLLQ DLLTPDA++ A+E TIK LIYPVGFYTRKA NLKKIA ICL+KYGGDIPST Sbjct: 200 AIQRLLQNDLLTPDAINNADEETIKKLIYPVGFYTRKATNLKKIANICLMKYGGDIPSTL 259 Query: 716 XXXXXXPGIGPKMAHLVMNVAWDNVQGICVDTHVHRICNRLGWVSRPGTKQKTSSPEETR 895 PGIGPKMAHLVMNVAW+NVQGICVDTHVHRICNRLGWVSR GTKQKT +PEETR Sbjct: 260 EQLLLLPGIGPKMAHLVMNVAWNNVQGICVDTHVHRICNRLGWVSRLGTKQKTLTPEETR 319 Query: 896 ESLQLWLPKEEWVPINPLLVGFGQTVCSPLRPRCGICIISEFCPSAFKEG*SPSS 1060 ESLQ WLP+EEW PINPLLVGFGQT+C+PLRPRCG C IS C SAFKE SS Sbjct: 320 ESLQRWLPREEWDPINPLLVGFGQTICTPLRPRCGECGISHLCLSAFKEASDSSS 374 >ref|XP_004152104.1| PREDICTED: endonuclease III-like protein 1-like [Cucumis sativus] gi|449521044|ref|XP_004167541.1| PREDICTED: endonuclease III-like protein 1-like [Cucumis sativus] Length = 386 Score = 386 bits (992), Expect = e-104 Identities = 208/379 (54%), Positives = 253/379 (66%), Gaps = 31/379 (8%) Frame = +2 Query: 41 PLRIPSSLPLFSARINKIKMPQTSFSSKTQNPTTRNKNPGNENV---NAGSKTELQXXXX 211 P+RIP+ F+ RI M + S SS + NPG +V N S+ E + Sbjct: 6 PIRIPALSITFARRITCSAMSKGSSSSLPTSSNEVPPNPGISSVKSSNGVSEPETRVFVR 65 Query: 212 XXXXXXXXXXXXXXDITEEHVQQKLCRPLDIEDFAYGKSC-------------------- 331 ++ E + K P +IEDFA+ ++ Sbjct: 66 RRVKKIAESQDSGFEV-EPKIDTKRSCPPNIEDFAFKRTKDSPGSRKLKPPLDLLLNGIE 124 Query: 332 --------GYSWSTLPPDNWERVLEGIRKMRSSEDAPVDSMGCEKAGTSLPPKERRFAVL 487 G + PP NWE+VL+GIR+MRSSE+APVD+MGC +AG++LPPKERRFAVL Sbjct: 125 DSNPTTHKGKAERGKPPVNWEKVLKGIREMRSSEEAPVDTMGCGRAGSTLPPKERRFAVL 184 Query: 488 ASSLLSSQTKDHVTHGAIQRLLQQDLLTPDAVDKAEEATIKSLIYPVGFYTRKAGNLKKI 667 ASSLLSSQTKDHVTHGA RL + LLT DA+DKA+E TIKSLIYPVGFY+ KA NLKKI Sbjct: 185 ASSLLSSQTKDHVTHGAALRLQESGLLTADAMDKADEETIKSLIYPVGFYSTKAKNLKKI 244 Query: 668 AKICLLKYGGDIPSTXXXXXXXPGIGPKMAHLVMNVAWDNVQGICVDTHVHRICNRLGWV 847 A+ICL+KYGGDIP + PGIGPK+AHL+M +AW++VQGICVDTHVHRICNRLGWV Sbjct: 245 ARICLMKYGGDIPRSLAELLLLPGIGPKIAHLIMIMAWNDVQGICVDTHVHRICNRLGWV 304 Query: 848 SRPGTKQKTSSPEETRESLQLWLPKEEWVPINPLLVGFGQTVCSPLRPRCGICIISEFCP 1027 S G+KQKTS+PEETR L+LWLPKEEWVPINPLLVGFGQT+C+PLRP+CG C +S+ CP Sbjct: 305 SGKGSKQKTSTPEETRVGLELWLPKEEWVPINPLLVGFGQTICTPLRPKCGNCSVSDLCP 364 Query: 1028 SAFKEG*SPSSTPRKPRLK 1084 SAFKE SPS P+LK Sbjct: 365 SAFKESSSPS-----PKLK 378 >ref|XP_006410282.1| hypothetical protein EUTSA_v10016815mg [Eutrema salsugineum] gi|557111451|gb|ESQ51735.1| hypothetical protein EUTSA_v10016815mg [Eutrema salsugineum] Length = 373 Score = 386 bits (991), Expect = e-104 Identities = 199/348 (57%), Positives = 242/348 (69%), Gaps = 16/348 (4%) Frame = +2 Query: 83 INKIKMPQTSFSSKTQNPTTRNKNPGNENVNAGSKTELQXXXXXXXXXXXXXXXXXXDIT 262 +N + S S TQ + + +P +GS+T + I Sbjct: 24 MNHLNYGTVSSSKPTQQHSLPDSDPEPAKPASGSETRVYTRKKRLKQEAFQPLEKDSCI- 82 Query: 263 EEHVQQKLCRPLDIEDFAYGKSCGYSWSTLP----------------PDNWERVLEGIRK 394 + Q++LCR DIE+FAY K+ S S P+NW +VLEGIR+ Sbjct: 83 --NTQKQLCRLPDIEEFAYKKNTRSSSSRRSTETSITVTSVKTAGNAPENWVKVLEGIRQ 140 Query: 395 MRSSEDAPVDSMGCEKAGTSLPPKERRFAVLASSLLSSQTKDHVTHGAIQRLLQQDLLTP 574 MRSSEDAPVDSMGC+KAG+ LPP ERRFAVL +LLSSQTKD V + AI RL Q LLTP Sbjct: 141 MRSSEDAPVDSMGCDKAGSFLPPTERRFAVLLGALLSSQTKDEVNNAAIHRLHQNGLLTP 200 Query: 575 DAVDKAEEATIKSLIYPVGFYTRKAGNLKKIAKICLLKYGGDIPSTXXXXXXXPGIGPKM 754 +AVDKA+E+T++ LIYPVGFYTRKA +KKIAKICL+KY GDIPS+ PGIGPKM Sbjct: 201 EAVDKADESTLRELIYPVGFYTRKATYMKKIAKICLVKYNGDIPSSLDDLLALPGIGPKM 260 Query: 755 AHLVMNVAWDNVQGICVDTHVHRICNRLGWVSRPGTKQKTSSPEETRESLQLWLPKEEWV 934 AHL++++AW++VQGICVDTHVHRICNRLGWVSRPGTKQKTSSPEETR +LQ WLPKEEWV Sbjct: 261 AHLILHIAWNDVQGICVDTHVHRICNRLGWVSRPGTKQKTSSPEETRVALQQWLPKEEWV 320 Query: 935 PINPLLVGFGQTVCSPLRPRCGICIISEFCPSAFKEG*SPSSTPRKPR 1078 INPLLVGFGQT+C+PLRPRC C +++ CP+AFKE SPSS +K + Sbjct: 321 AINPLLVGFGQTICTPLRPRCETCSVTKLCPAAFKEASSPSSKLKKSK 368 >emb|CAC16135.1| endonuclease III homologue [Arabidopsis thaliana] Length = 354 Score = 385 bits (988), Expect = e-104 Identities = 190/280 (67%), Positives = 219/280 (78%), Gaps = 16/280 (5%) Frame = +2 Query: 281 KLCRPLDIEDFAYGKSCGYSWST----------------LPPDNWERVLEGIRKMRSSED 412 KLC DIEDFAY K+ G S+ PP+NW VLEGIR+MRSSED Sbjct: 68 KLCGLPDIEDFAYKKTIGSPSSSRSTETSITVTSVKTAGYPPENWVEVLEGIRQMRSSED 127 Query: 413 APVDSMGCEKAGTSLPPKERRFAVLASSLLSSQTKDHVTHGAIQRLLQQDLLTPDAVDKA 592 APVDSMGC+KAG+ LPP ERRFAVL +LLSSQTKD V + AI RL Q LLTP+AVDKA Sbjct: 128 APVDSMGCDKAGSFLPPTERRFAVLLGALLSSQTKDQVNNAAIHRLHQNGLLTPEAVDKA 187 Query: 593 EEATIKSLIYPVGFYTRKAGNLKKIAKICLLKYGGDIPSTXXXXXXXPGIGPKMAHLVMN 772 +E+TIK LIYPVGFYTRKA +KKIA+ICL+KY GDIPS+ PGIGPKMAHL+++ Sbjct: 188 DESTIKELIYPVGFYTRKATYMKKIARICLVKYDGDIPSSLDDLLSLPGIGPKMAHLILH 247 Query: 773 VAWDNVQGICVDTHVHRICNRLGWVSRPGTKQKTSSPEETRESLQLWLPKEEWVPINPLL 952 +AW++VQGICVDTHVHRICNRLGWVSRPGTKQKT+SPEETR +LQ WLPKEEWV INPLL Sbjct: 248 IAWNDVQGICVDTHVHRICNRLGWVSRPGTKQKTTSPEETRVALQQWLPKEEWVAINPLL 307 Query: 953 VGFGQTVCSPLRPRCGICIISEFCPSAFKEG*SPSSTPRK 1072 VGFGQ +C+PLRPRC C +S+ CP+AFKE SPSS +K Sbjct: 308 VGFGQMICTPLRPRCEACSVSKLCPAAFKETSSPSSKLKK 347 >ref|NP_565725.1| protein NTH1 [Arabidopsis thaliana] gi|20198157|gb|AAD26474.2| putative endonuclease [Arabidopsis thaliana] gi|330253455|gb|AEC08549.1| protein NTH1 [Arabidopsis thaliana] Length = 379 Score = 384 bits (986), Expect = e-104 Identities = 189/280 (67%), Positives = 219/280 (78%), Gaps = 16/280 (5%) Frame = +2 Query: 281 KLCRPLDIEDFAYGKSCGYSWST----------------LPPDNWERVLEGIRKMRSSED 412 KLC DIEDFAY K+ G S+ PP+NW VLEGIR+MRSSED Sbjct: 93 KLCGLPDIEDFAYKKTIGSPSSSRSTETSITVTSVKTAGYPPENWVEVLEGIRQMRSSED 152 Query: 413 APVDSMGCEKAGTSLPPKERRFAVLASSLLSSQTKDHVTHGAIQRLLQQDLLTPDAVDKA 592 APVDSMGC+KAG+ LPP ERRFAVL +LLSSQTKD V + AI RL Q LLTP+AVDKA Sbjct: 153 APVDSMGCDKAGSFLPPTERRFAVLLGALLSSQTKDQVNNAAIHRLHQNGLLTPEAVDKA 212 Query: 593 EEATIKSLIYPVGFYTRKAGNLKKIAKICLLKYGGDIPSTXXXXXXXPGIGPKMAHLVMN 772 +E+TIK LIYPVGFYTRKA +KKIA+ICL+KY GDIPS+ PGIGPKMAHL+++ Sbjct: 213 DESTIKELIYPVGFYTRKATYMKKIARICLVKYDGDIPSSLDDLLSLPGIGPKMAHLILH 272 Query: 773 VAWDNVQGICVDTHVHRICNRLGWVSRPGTKQKTSSPEETRESLQLWLPKEEWVPINPLL 952 +AW++VQGICVDTHVHRICNRLGWVSRPGTKQKT+SPEETR +LQ WLPKEEWV INPLL Sbjct: 273 IAWNDVQGICVDTHVHRICNRLGWVSRPGTKQKTTSPEETRVALQQWLPKEEWVAINPLL 332 Query: 953 VGFGQTVCSPLRPRCGICIISEFCPSAFKEG*SPSSTPRK 1072 VGFGQ +C+P+RPRC C +S+ CP+AFKE SPSS +K Sbjct: 333 VGFGQMICTPIRPRCEACSVSKLCPAAFKETSSPSSKLKK 372 >ref|NP_001077988.1| protein NTH1 [Arabidopsis thaliana] gi|17380754|gb|AAL36207.1| putative endonuclease [Arabidopsis thaliana] gi|20259623|gb|AAM14168.1| putative endonuclease [Arabidopsis thaliana] gi|330253456|gb|AEC08550.1| protein NTH1 [Arabidopsis thaliana] Length = 377 Score = 384 bits (986), Expect = e-104 Identities = 189/280 (67%), Positives = 219/280 (78%), Gaps = 16/280 (5%) Frame = +2 Query: 281 KLCRPLDIEDFAYGKSCGYSWST----------------LPPDNWERVLEGIRKMRSSED 412 KLC DIEDFAY K+ G S+ PP+NW VLEGIR+MRSSED Sbjct: 91 KLCGLPDIEDFAYKKTIGSPSSSRSTETSITVTSVKTAGYPPENWVEVLEGIRQMRSSED 150 Query: 413 APVDSMGCEKAGTSLPPKERRFAVLASSLLSSQTKDHVTHGAIQRLLQQDLLTPDAVDKA 592 APVDSMGC+KAG+ LPP ERRFAVL +LLSSQTKD V + AI RL Q LLTP+AVDKA Sbjct: 151 APVDSMGCDKAGSFLPPTERRFAVLLGALLSSQTKDQVNNAAIHRLHQNGLLTPEAVDKA 210 Query: 593 EEATIKSLIYPVGFYTRKAGNLKKIAKICLLKYGGDIPSTXXXXXXXPGIGPKMAHLVMN 772 +E+TIK LIYPVGFYTRKA +KKIA+ICL+KY GDIPS+ PGIGPKMAHL+++ Sbjct: 211 DESTIKELIYPVGFYTRKATYMKKIARICLVKYDGDIPSSLDDLLSLPGIGPKMAHLILH 270 Query: 773 VAWDNVQGICVDTHVHRICNRLGWVSRPGTKQKTSSPEETRESLQLWLPKEEWVPINPLL 952 +AW++VQGICVDTHVHRICNRLGWVSRPGTKQKT+SPEETR +LQ WLPKEEWV INPLL Sbjct: 271 IAWNDVQGICVDTHVHRICNRLGWVSRPGTKQKTTSPEETRVALQQWLPKEEWVAINPLL 330 Query: 953 VGFGQTVCSPLRPRCGICIISEFCPSAFKEG*SPSSTPRK 1072 VGFGQ +C+P+RPRC C +S+ CP+AFKE SPSS +K Sbjct: 331 VGFGQMICTPIRPRCEACSVSKLCPAAFKETSSPSSKLKK 370