BLASTX nr result
ID: Mentha22_contig00029012
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Mentha22_contig00029012 (1198 letters) Database: ./nr 38,876,450 sequences; 13,856,398,315 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value gb|ACX43965.1| cysteine protease 1 [Brachiaria hybrid cultivar] 318 3e-84 gb|ACX43954.1| cysteine protease 1 [Brachiaria hybrid cultivar] ... 316 1e-83 gb|ACX43963.1| cysteine protease 3, partial [Brachiaria hybrid c... 292 2e-76 gb|AHW80377.1| cathepsin L protease [Cerebratulus lacteus] 277 7e-72 ref|XP_003879989.1| cathepsin L, related [Neospora caninum Liver... 277 7e-72 gb|AAD32136.1|AF121837_1 cathepsin L [Mus musculus] gi|4887000|g... 270 7e-70 dbj|BAE27712.1| unnamed protein product [Mus musculus] 270 7e-70 ref|NP_034114.1| cathepsin L1 preproprotein [Mus musculus] gi|56... 270 7e-70 dbj|BAB27719.1| unnamed protein product [Mus musculus] 270 7e-70 dbj|BAE36450.1| unnamed protein product [Mus musculus] 270 9e-70 ref|XP_006872648.1| PREDICTED: cathepsin L1-like [Chrysochloris ... 270 1e-69 dbj|BAE35627.1| unnamed protein product [Mus musculus] 270 1e-69 gb|ADC52431.1| cathepsin L2 cysteine protease [Pinctada fucata] 270 1e-69 ref|XP_001745088.1| hypothetical protein [Monosiga brevicollis M... 269 2e-69 dbj|BAE38161.1| unnamed protein product [Mus musculus] 269 2e-69 dbj|BAE22939.1| unnamed protein product [Mus musculus] 269 2e-69 dbj|BAE31977.1| unnamed protein product [Mus musculus] 269 2e-69 gb|ABY58967.1| cathepsin L [Toxoplasma gondii] 269 2e-69 ref|XP_002371694.1| cathepsin L-like thiolproteinase, putative [... 269 2e-69 gb|EJY65772.1| Cathepsin L [Oxytricha trifallax] 268 3e-69 >gb|ACX43965.1| cysteine protease 1 [Brachiaria hybrid cultivar] Length = 338 Score = 318 bits (815), Expect = 3e-84 Identities = 163/312 (52%), Positives = 207/312 (66%), Gaps = 1/312 (0%) Frame = -1 Query: 1090 TERQYQREFSRFVKTYSKKYATDDFFNRYEIFKANLDLIELHNSQAGITSTMGVNAFTDM 911 +E Q F+ F+K YSK Y+ +F +R+ FKAN++ I LHN+ A + TMG+N F D+ Sbjct: 34 SEVMLQDMFTAFMKQYSKAYSHAEFSSRFNQFKANVETIRLHNTLANASYTMGLNEFADL 93 Query: 910 TSVEFASKMNGYXXXXXXXXXXXXXANTMKDVPTSWDWRTKGAVTPVKDQAQCGSCWSFS 731 + EF K GY ++ PTS DWRT AVTP+KDQ QCGSCW+FS Sbjct: 94 SFEEFKGKYFGYKHVEREFARSNNLHQEVEAAPTSIDWRTSNAVTPIKDQGQCGSCWAFS 153 Query: 730 ATGSIEGAWFL-AKGNLTSLSEQQLMDCSHPEGDDSCEGGLMDDAFQFVIDNKGICSEAS 554 ATGSIEGAW L K LTSLSEQQL+DCS GD C GGLMD AF+++I NKGIC+E++ Sbjct: 154 ATGSIEGAWVLQGKHTLTSLSEQQLVDCSTSYGDAGCNGGLMDYAFEYIIANKGICAESA 213 Query: 553 YPYKAIDEKCKKTCKSVATISSFVDVDFDQSNPNNDTALMAALQMGPVSIAIEADQPIFQ 374 YPYK + C+K+C V TIS + DV ++ + + L A +GPVS+AIEADQ FQ Sbjct: 214 YPYKGVGGLCQKSCTKVVTISGYKDV----ASGDEASLLNAVGTVGPVSVAIEADQAGFQ 269 Query: 373 MYTGGVITGPSCGTNLDHGVLLVGYGTDSKLGDYWIVKNSWGQAWGIESGYVRLARGQNE 194 Y+ GV +G +CG NLDHGVL VGYGT DYWIVKNSWG +WG ESGY+R+ R +N+ Sbjct: 270 FYSSGVFSG-TCGHNLDHGVLAVGYGTTGS-QDYWIVKNSWGTSWG-ESGYIRMIRNKNQ 326 Query: 193 CGMNSAASYPVV 158 CG+ SYP V Sbjct: 327 CGIAIQPSYPTV 338 >gb|ACX43954.1| cysteine protease 1 [Brachiaria hybrid cultivar] gi|260516656|gb|ACX43955.1| cysteine protease 1 [Brachiaria hybrid cultivar] gi|260516658|gb|ACX43956.1| cysteine protease 1 [Brachiaria hybrid cultivar] gi|260516660|gb|ACX43957.1| cysteine protease 1 [Brachiaria hybrid cultivar] gi|260516662|gb|ACX43958.1| cysteine protease 2 [Brachiaria hybrid cultivar] gi|260516664|gb|ACX43959.1| cysteine protease 2 [Brachiaria hybrid cultivar] gi|260516666|gb|ACX43960.1| cysteine protease 2 [Brachiaria hybrid cultivar] gi|260516668|gb|ACX43961.1| cysteine protease 2 [Brachiaria hybrid cultivar] gi|260516670|gb|ACX43962.1| cysteine protease 2 [Brachiaria hybrid cultivar] Length = 338 Score = 316 bits (810), Expect = 1e-83 Identities = 162/312 (51%), Positives = 207/312 (66%), Gaps = 1/312 (0%) Frame = -1 Query: 1090 TERQYQREFSRFVKTYSKKYATDDFFNRYEIFKANLDLIELHNSQAGITSTMGVNAFTDM 911 +E Q F+ F+K YSK Y+ +F +R+ FKAN++ I LHN+ A + TMG+N F D+ Sbjct: 34 SEVMLQDMFTAFMKQYSKAYSHAEFSSRFNQFKANVETIRLHNTLANASYTMGLNEFADL 93 Query: 910 TSVEFASKMNGYXXXXXXXXXXXXXANTMKDVPTSWDWRTKGAVTPVKDQAQCGSCWSFS 731 + EF K GY ++ PTS DWRT AVTP+KDQ QCGSCW+FS Sbjct: 94 SFEEFKGKYFGYKHVEREFARSNNLHQEVEAAPTSIDWRTSNAVTPIKDQGQCGSCWAFS 153 Query: 730 ATGSIEGAWFL-AKGNLTSLSEQQLMDCSHPEGDDSCEGGLMDDAFQFVIDNKGICSEAS 554 ATGSIEGAW L K LTSLSEQQL+DCS G+ C GGLMD AF+++I NKGIC+E++ Sbjct: 154 ATGSIEGAWVLQGKHTLTSLSEQQLVDCSTSYGNAGCNGGLMDYAFEYIIANKGICAESA 213 Query: 553 YPYKAIDEKCKKTCKSVATISSFVDVDFDQSNPNNDTALMAALQMGPVSIAIEADQPIFQ 374 YPYK + C+K+C V TIS + DV ++ + + L A +GPVS+AIEADQ FQ Sbjct: 214 YPYKGVGGLCQKSCTKVVTISGYKDV----ASGDEASLLNAVGTVGPVSVAIEADQAGFQ 269 Query: 373 MYTGGVITGPSCGTNLDHGVLLVGYGTDSKLGDYWIVKNSWGQAWGIESGYVRLARGQNE 194 Y+ GV +G +CG NLDHGVL VGYGT DYWIVKNSWG +WG ESGY+R+ R +N+ Sbjct: 270 FYSSGVFSG-TCGHNLDHGVLAVGYGTTGS-QDYWIVKNSWGTSWG-ESGYIRMIRNKNQ 326 Query: 193 CGMNSAASYPVV 158 CG+ SYP V Sbjct: 327 CGIAIQPSYPTV 338 >gb|ACX43963.1| cysteine protease 3, partial [Brachiaria hybrid cultivar] Length = 319 Score = 292 bits (747), Expect = 2e-76 Identities = 152/293 (51%), Positives = 194/293 (66%), Gaps = 1/293 (0%) Frame = -1 Query: 1090 TERQYQREFSRFVKTYSKKYATDDFFNRYEIFKANLDLIELHNSQAGITSTMGVNAFTDM 911 +E Q F+ F+K YSK Y+ +F +R+ FKA+++ I LHN+ A + TMG+N F D+ Sbjct: 34 SEVMLQDMFTAFMKQYSKAYSHAEFSSRFNQFKASVETIRLHNTLANASYTMGLNEFADL 93 Query: 910 TSVEFASKMNGYXXXXXXXXXXXXXANTMKDVPTSWDWRTKGAVTPVKDQAQCGSCWSFS 731 + EF K G ++ PTS DWRT AVTP+KDQ QCGSCW+FS Sbjct: 94 SFEEFKGKYFGCKHVEREFARSNNLHQEVEAAPTSIDWRTSNAVTPIKDQGQCGSCWAFS 153 Query: 730 ATGSIEGAWFL-AKGNLTSLSEQQLMDCSHPEGDDSCEGGLMDDAFQFVIDNKGICSEAS 554 ATGSIEGAW L K LTSLSEQQL+DCS G+ C GGLMD AF+++I NKGIC+E++ Sbjct: 154 ATGSIEGAWVLQGKHTLTSLSEQQLVDCSTSYGNAGCNGGLMDYAFEYIIANKGICAESA 213 Query: 553 YPYKAIDEKCKKTCKSVATISSFVDVDFDQSNPNNDTALMAALQMGPVSIAIEADQPIFQ 374 YPYK + C+K+C V TIS DV ++ + ++L A +GPVS+AIEADQ FQ Sbjct: 214 YPYKGVGGLCQKSCTKVVTISGHKDV----ASGDEASSLNAVGTVGPVSVAIEADQAGFQ 269 Query: 373 MYTGGVITGPSCGTNLDHGVLLVGYGTDSKLGDYWIVKNSWGQAWGIESGYVR 215 Y+ GV +G +CG NLDHGVL VGYGT DYWIVKNSWG +WG ESGY+R Sbjct: 270 FYSSGVFSG-TCGHNLDHGVLAVGYGTTGS-QDYWIVKNSWGTSWG-ESGYIR 319 >gb|AHW80377.1| cathepsin L protease [Cerebratulus lacteus] Length = 327 Score = 277 bits (708), Expect = 7e-72 Identities = 151/311 (48%), Positives = 202/311 (64%), Gaps = 7/311 (2%) Frame = -1 Query: 1069 EFSRFVKTYSKKYATDDFFNRYEIFKANLDLIELHNSQAGI---TSTMGVNAFTDMTSVE 899 E++ F T+ K Y ++ R IF NL +I+ HN +A + T MGVN ++D TS E Sbjct: 25 EWTNFKATHVKTYEVEEELVRKAIFHNNLRIIQKHNVEADLGQHTYWMGVNEYSDWTSTE 84 Query: 898 FASKMNGYXXXXXXXXXXXXXANTMKDVPTSWDWRTKGAVTPVKDQAQCGSCWSFSATGS 719 F + MNGY A+ +K VP S DWR KG VTPVK+Q QCGSCW+FSATGS Sbjct: 85 FRNYMNGYKRSNVTSGSTFMPASYIK-VPASVDWRPKGYVTPVKNQGQCGSCWAFSATGS 143 Query: 718 IEGAWFLAKGNLTSLSEQQLMDCSHPEGDDSCEGGLMDDAFQFVIDNKGICSEASYPYKA 539 +EG F GNL SLSEQ L+DCS G+ CEGGLMD AF+++ N GI +E YPYK Sbjct: 144 LEGQHFKKTGNLVSLSEQNLVDCSKSYGNMGCEGGLMDSAFKYIKANHGIDTEKCYPYKH 203 Query: 538 IDEKC--KKTCKSVATISSFVDVDFDQSNPNNDTALMAALQMGPVSIAIEADQPIFQMYT 365 +DE+C KK+C AT++ +VDV + + L A+ +GP+S+AI+A FQMY+ Sbjct: 204 VDERCHFKKSCIG-ATVTGYVDV----KQMDENALLQASATIGPISVAIDAGHQSFQMYS 258 Query: 364 GGVITGPSCG-TNLDHGVLLVGYGTDSKLGDYWIVKNSWGQAWGIESGYVRLARGQ-NEC 191 GV P C T LDHGVL+VGYGT+S DYW+VKNSWG WG ++GY+ ++R + N+C Sbjct: 259 HGVYNEPRCSQTQLDHGVLVVGYGTESG-QDYWLVKNSWGATWG-QNGYIMMSRNKNNQC 316 Query: 190 GMNSAASYPVV 158 G+ ++ASYP+V Sbjct: 317 GIATSASYPLV 327 >ref|XP_003879989.1| cathepsin L, related [Neospora caninum Liverpool] gi|325114397|emb|CBZ49954.1| cathepsin L, related [Neospora caninum Liverpool] Length = 415 Score = 277 bits (708), Expect = 7e-72 Identities = 153/305 (50%), Positives = 188/305 (61%), Gaps = 7/305 (2%) Frame = -1 Query: 1087 ERQYQREFSRFVKTYSKKYATDDFFN-RYEIFKANLDLIELHNSQAGITSTMGVNAFTDM 911 E +Q F F TY K YAT++ RY IFK NL I HN Q G + ++ +N F D+ Sbjct: 112 EEHFQNAFGSFRATYGKSYATEEETQKRYAIFKNNLAYIHTHNQQ-GYSYSLKMNHFGDL 170 Query: 910 TSVEFASKMNGYXXXXXXXXXXXXXANTM-----KDVPTSWDWRTKGAVTPVKDQAQCGS 746 + EF K GY A + DVP++ DWR KG VTPVKDQ CGS Sbjct: 171 SREEFRRKYLGYNKSRNLKSNNLGVATELLKVSPSDVPSAVDWREKGCVTPVKDQRDCGS 230 Query: 745 CWSFSATGSIEGAWFLAKGNLTSLSEQQLMDCSHPEGDDSCEGGLMDDAFQFVIDNKGIC 566 CW+FSATG++EGA G L SLSEQ+L+DCS EG+ C GG M+DAFQ+V+D+ G+C Sbjct: 231 CWAFSATGALEGAHCAKTGELLSLSEQELVDCSLAEGNQGCSGGEMNDAFQYVVDSGGLC 290 Query: 565 SEASYPYKAIDEKCKKTCKSVATISSFVDVDFDQSNPNNDTALMAALQMGPVSIAIEADQ 386 SE YPY A D +CK+ CK V TIS F DV ++TA+ AAL PVSIAIEADQ Sbjct: 291 SEEGYPYLARDGECKRACKKVVTISGFKDVP-----RKSETAMKAALAHSPVSIAIEADQ 345 Query: 385 PIFQMYTGGVITGPSCGTNLDHGVLLVGYGTDSKL-GDYWIVKNSWGQAWGIESGYVRLA 209 FQ Y GV SCGT+LDHGVLLVGYGTD + D+WI+KNSWG WG GY+ +A Sbjct: 346 LPFQFYHEGVFDA-SCGTDLDHGVLLVGYGTDKETKKDFWIMKNSWGSGWG-RDGYMYMA 403 Query: 208 RGQNE 194 + E Sbjct: 404 MHKGE 408 >gb|AAD32136.1|AF121837_1 cathepsin L [Mus musculus] gi|4887000|gb|AAD32137.1|AF121838_1 cathepsin L [Mus musculus] gi|4887002|gb|AAD32138.1|AF121839_1 cathepsin L [Mus musculus] gi|200501|gb|AAA39984.1| preprocathepsin L precursor [Mus musculus] Length = 334 Score = 270 bits (691), Expect = 7e-70 Identities = 145/320 (45%), Positives = 203/320 (63%), Gaps = 10/320 (3%) Frame = -1 Query: 1087 ERQYQREFSRFVKTYSKKYATDDFFNRYEIFKANLDLIELHNSQAGITS---TMGVNAFT 917 ++ + E+ ++ T+ + Y T++ R I++ N+ +I+LHN + +M +NAF Sbjct: 22 DQTFSAEWHQWKSTHRRLYGTNEEEWRRAIWEKNMRIIQLHNGEYSNGQHGFSMEMNAFG 81 Query: 916 DMTSVEFASKMNGYXXXXXXXXXXXXXANTMKDVPTSWDWRTKGAVTPVKDQAQCGSCWS 737 DMT+ EF +NGY +K +P S DWR KG VTPVK+Q QCGSCW+ Sbjct: 82 DMTNEEFRQVVNGYRHQKHKKGRLFQEPLMLK-IPKSVDWREKGCVTPVKNQGQCGSCWA 140 Query: 736 FSATGSIEGAWFLAKGNLTSLSEQQLMDCSHPEGDDSCEGGLMDDAFQFVIDNKGICSEA 557 FSA+G +EG FL G L SLSEQ L+DCSH +G+ C GGLMD AFQ++ +N G+ SE Sbjct: 141 FSASGCLEGQMFLKTGKLISLSEQNLVDCSHAQGNQGCNGGLMDFAFQYIKENGGLDSEE 200 Query: 556 SYPYKAIDEKCKKTCK-SVATISSFVDVDFDQSNPNNDTALMAAL-QMGPVSIAIEADQP 383 SYPY+A D CK + +VA + FVD+ P + ALM A+ +GP+S+A++A P Sbjct: 201 SYPYEAKDGSCKYRAEFAVANDTGFVDI------PQQEKALMKAVATVGPISVAMDASHP 254 Query: 382 IFQMYTGGVITGPSCGT-NLDHGVLLVGY---GTDSKLGDYWIVKNSWGQAWGIESGYVR 215 Q Y+ G+ P+C + NLDHGVLLVGY GTDS YW+VKNSWG WG+E GY++ Sbjct: 255 SLQFYSSGIYYEPNCSSKNLDHGVLLVGYGYEGTDSNKNKYWLVKNSWGSEWGME-GYIK 313 Query: 214 LARGQ-NECGMNSAASYPVV 158 +A+ + N CG+ +AASYPVV Sbjct: 314 IAKDRDNHCGLATAASYPVV 333 >dbj|BAE27712.1| unnamed protein product [Mus musculus] Length = 334 Score = 270 bits (691), Expect = 7e-70 Identities = 145/320 (45%), Positives = 203/320 (63%), Gaps = 10/320 (3%) Frame = -1 Query: 1087 ERQYQREFSRFVKTYSKKYATDDFFNRYEIFKANLDLIELHNSQAGITS---TMGVNAFT 917 ++ + E+ ++ T+ + Y T++ R I++ N+ +I+LHN + +M +NAF Sbjct: 22 DQTFSAEWHQWKSTHRRLYGTNEEEWRRAIWEKNMRMIQLHNGEYSNGQHGFSMEMNAFG 81 Query: 916 DMTSVEFASKMNGYXXXXXXXXXXXXXANTMKDVPTSWDWRTKGAVTPVKDQAQCGSCWS 737 DMT+ EF +NGY +K +P S DWR KG VTPVK+Q QCGSCW+ Sbjct: 82 DMTNEEFRQVVNGYRHQKHKKGRLFQEPLMLK-IPKSVDWREKGCVTPVKNQGQCGSCWA 140 Query: 736 FSATGSIEGAWFLAKGNLTSLSEQQLMDCSHPEGDDSCEGGLMDDAFQFVIDNKGICSEA 557 FSA+G +EG FL G L SLSEQ L+DCSH +G+ C GGLMD AFQ++ +N G+ SE Sbjct: 141 FSASGCLEGQMFLKTGKLISLSEQNLVDCSHAQGNQGCNGGLMDFAFQYIKENGGLDSEE 200 Query: 556 SYPYKAIDEKCKKTCK-SVATISSFVDVDFDQSNPNNDTALMAAL-QMGPVSIAIEADQP 383 SYPY+A D CK + +VA + FVD+ P + ALM A+ +GP+S+A++A P Sbjct: 201 SYPYEAKDGSCKYRAEFAVANDTGFVDI------PQQEEALMKAVATVGPISVAMDASHP 254 Query: 382 IFQMYTGGVITGPSCGT-NLDHGVLLVGY---GTDSKLGDYWIVKNSWGQAWGIESGYVR 215 Q Y+ G+ P+C + NLDHGVLLVGY GTDS YW+VKNSWG WG+E GY++ Sbjct: 255 SLQFYSSGIYYEPNCSSKNLDHGVLLVGYGYEGTDSNKNKYWLVKNSWGSEWGME-GYIK 313 Query: 214 LARGQ-NECGMNSAASYPVV 158 +A+ + N CG+ +AASYPVV Sbjct: 314 IAKDRDNHCGLATAASYPVV 333 >ref|NP_034114.1| cathepsin L1 preproprotein [Mus musculus] gi|568983003|ref|XP_006517143.1| PREDICTED: cathepsin L1 isoform X1 [Mus musculus] gi|568983005|ref|XP_006517144.1| PREDICTED: cathepsin L1 isoform X2 [Mus musculus] gi|115742|sp|P06797.2|CATL1_MOUSE RecName: Full=Cathepsin L1; AltName: Full=Cathepsin L; AltName: Full=Major excreted protein; Short=MEP; AltName: Full=p39 cysteine proteinase; Contains: RecName: Full=Cathepsin L1 heavy chain; Contains: RecName: Full=Cathepsin L1 light chain; Flags: Precursor gi|53047|emb|CAA29470.1| unnamed protein product [Mus musculus] gi|309186|gb|AAA37445.1| preprocysteine proteinase [Mus musculus] gi|12832050|dbj|BAB21945.1| unnamed protein product [Mus musculus] gi|26340196|dbj|BAC33761.1| unnamed protein product [Mus musculus] gi|45768760|gb|AAH68163.1| Cathepsin L [Mus musculus] gi|74139700|dbj|BAE31701.1| unnamed protein product [Mus musculus] gi|74146632|dbj|BAE41323.1| unnamed protein product [Mus musculus] gi|74151584|dbj|BAE41141.1| unnamed protein product [Mus musculus] gi|74185397|dbj|BAE30172.1| unnamed protein product [Mus musculus] gi|74197196|dbj|BAE35143.1| unnamed protein product [Mus musculus] gi|74203006|dbj|BAE26206.1| unnamed protein product [Mus musculus] gi|74219606|dbj|BAE29572.1| unnamed protein product [Mus musculus] gi|148684295|gb|EDL16242.1| cathepsin L [Mus musculus] Length = 334 Score = 270 bits (691), Expect = 7e-70 Identities = 145/320 (45%), Positives = 203/320 (63%), Gaps = 10/320 (3%) Frame = -1 Query: 1087 ERQYQREFSRFVKTYSKKYATDDFFNRYEIFKANLDLIELHNSQAGITS---TMGVNAFT 917 ++ + E+ ++ T+ + Y T++ R I++ N+ +I+LHN + +M +NAF Sbjct: 22 DQTFSAEWHQWKSTHRRLYGTNEEEWRRAIWEKNMRMIQLHNGEYSNGQHGFSMEMNAFG 81 Query: 916 DMTSVEFASKMNGYXXXXXXXXXXXXXANTMKDVPTSWDWRTKGAVTPVKDQAQCGSCWS 737 DMT+ EF +NGY +K +P S DWR KG VTPVK+Q QCGSCW+ Sbjct: 82 DMTNEEFRQVVNGYRHQKHKKGRLFQEPLMLK-IPKSVDWREKGCVTPVKNQGQCGSCWA 140 Query: 736 FSATGSIEGAWFLAKGNLTSLSEQQLMDCSHPEGDDSCEGGLMDDAFQFVIDNKGICSEA 557 FSA+G +EG FL G L SLSEQ L+DCSH +G+ C GGLMD AFQ++ +N G+ SE Sbjct: 141 FSASGCLEGQMFLKTGKLISLSEQNLVDCSHAQGNQGCNGGLMDFAFQYIKENGGLDSEE 200 Query: 556 SYPYKAIDEKCKKTCK-SVATISSFVDVDFDQSNPNNDTALMAAL-QMGPVSIAIEADQP 383 SYPY+A D CK + +VA + FVD+ P + ALM A+ +GP+S+A++A P Sbjct: 201 SYPYEAKDGSCKYRAEFAVANDTGFVDI------PQQEKALMKAVATVGPISVAMDASHP 254 Query: 382 IFQMYTGGVITGPSCGT-NLDHGVLLVGY---GTDSKLGDYWIVKNSWGQAWGIESGYVR 215 Q Y+ G+ P+C + NLDHGVLLVGY GTDS YW+VKNSWG WG+E GY++ Sbjct: 255 SLQFYSSGIYYEPNCSSKNLDHGVLLVGYGYEGTDSNKNKYWLVKNSWGSEWGME-GYIK 313 Query: 214 LARGQ-NECGMNSAASYPVV 158 +A+ + N CG+ +AASYPVV Sbjct: 314 IAKDRDNHCGLATAASYPVV 333 >dbj|BAB27719.1| unnamed protein product [Mus musculus] Length = 334 Score = 270 bits (691), Expect = 7e-70 Identities = 145/320 (45%), Positives = 203/320 (63%), Gaps = 10/320 (3%) Frame = -1 Query: 1087 ERQYQREFSRFVKTYSKKYATDDFFNRYEIFKANLDLIELHNSQAGITS---TMGVNAFT 917 ++ + E+ ++ T+ + Y T++ R I++ N+ +I+LHN + +M +NAF Sbjct: 22 DQTFSAEWHQWKSTHRRLYGTNEEEWRRAIWEKNMRMIQLHNGEYSNGQHGFSMEMNAFG 81 Query: 916 DMTSVEFASKMNGYXXXXXXXXXXXXXANTMKDVPTSWDWRTKGAVTPVKDQAQCGSCWS 737 DMT+ EF +NGY +K +P S DWR KG VTPVK+Q QCGSCW+ Sbjct: 82 DMTNEEFRQVVNGYRHQKHKKGRLFQEPLMLK-IPKSVDWREKGCVTPVKNQGQCGSCWA 140 Query: 736 FSATGSIEGAWFLAKGNLTSLSEQQLMDCSHPEGDDSCEGGLMDDAFQFVIDNKGICSEA 557 FSA+G +EG FL G L SLSEQ L+DCSH +G+ C GGLMD AFQ++ +N G+ SE Sbjct: 141 FSASGCLEGQMFLKTGKLISLSEQNLVDCSHAQGNQGCNGGLMDYAFQYIKENGGLDSEE 200 Query: 556 SYPYKAIDEKCKKTCK-SVATISSFVDVDFDQSNPNNDTALMAAL-QMGPVSIAIEADQP 383 SYPY+A D CK + +VA + FVD+ P + ALM A+ +GP+S+A++A P Sbjct: 201 SYPYEAKDGSCKYRAEFAVANDTGFVDI------PQQEKALMKAVATVGPISVAMDASHP 254 Query: 382 IFQMYTGGVITGPSCGT-NLDHGVLLVGY---GTDSKLGDYWIVKNSWGQAWGIESGYVR 215 Q Y+ G+ P+C + NLDHGVLLVGY GTDS YW+VKNSWG WG+E GY++ Sbjct: 255 SLQFYSSGIYYEPNCSSKNLDHGVLLVGYGYEGTDSNKNKYWLVKNSWGSEWGME-GYIK 313 Query: 214 LARGQ-NECGMNSAASYPVV 158 +A+ + N CG+ +AASYPVV Sbjct: 314 IAKDRDNHCGLATAASYPVV 333 >dbj|BAE36450.1| unnamed protein product [Mus musculus] Length = 334 Score = 270 bits (690), Expect = 9e-70 Identities = 145/320 (45%), Positives = 203/320 (63%), Gaps = 10/320 (3%) Frame = -1 Query: 1087 ERQYQREFSRFVKTYSKKYATDDFFNRYEIFKANLDLIELHNSQAGITS---TMGVNAFT 917 ++ + E+ ++ T+ + Y T++ R I++ N+ +I+LHN + +M +NAF Sbjct: 22 DQTFSAEWHQWKSTHRRLYGTNEEEWRRAIWEKNMRMIQLHNGEYSNGQHGFSMEMNAFG 81 Query: 916 DMTSVEFASKMNGYXXXXXXXXXXXXXANTMKDVPTSWDWRTKGAVTPVKDQAQCGSCWS 737 DMT+ EF +NGY +K +P S DWR KG VTPVK+Q QCGSCW+ Sbjct: 82 DMTNEEFRQVVNGYRHQKHKKGRLFQEPLMLK-IPKSVDWREKGCVTPVKNQGQCGSCWA 140 Query: 736 FSATGSIEGAWFLAKGNLTSLSEQQLMDCSHPEGDDSCEGGLMDDAFQFVIDNKGICSEA 557 FSA+G +EG FL G L SLSEQ L+DCSH +G+ C GGLMD AFQ++ +N G+ SE Sbjct: 141 FSASGCLEGQMFLKTGKLISLSEQNLVDCSHAQGNQGCNGGLMDFAFQYIKENGGLDSEE 200 Query: 556 SYPYKAIDEKCKKTCK-SVATISSFVDVDFDQSNPNNDTALMAAL-QMGPVSIAIEADQP 383 SYPY+A D CK + +VA + FVD+ P + ALM A+ +GP+S+A++A P Sbjct: 201 SYPYEAKDGSCKYRAEFAVANGTGFVDI------PQQEKALMKAVATVGPISVAMDASHP 254 Query: 382 IFQMYTGGVITGPSCGT-NLDHGVLLVGY---GTDSKLGDYWIVKNSWGQAWGIESGYVR 215 Q Y+ G+ P+C + NLDHGVLLVGY GTDS YW+VKNSWG WG+E GY++ Sbjct: 255 SLQFYSSGIYYEPNCSSKNLDHGVLLVGYGYEGTDSNKNKYWLVKNSWGSEWGME-GYIK 313 Query: 214 LARGQ-NECGMNSAASYPVV 158 +A+ + N CG+ +AASYPVV Sbjct: 314 IAKDRDNHCGLATAASYPVV 333 >ref|XP_006872648.1| PREDICTED: cathepsin L1-like [Chrysochloris asiatica] Length = 334 Score = 270 bits (689), Expect = 1e-69 Identities = 156/329 (47%), Positives = 204/329 (62%), Gaps = 11/329 (3%) Frame = -1 Query: 1111 MANAAYVTERQYQREFSRFVKTYSKKYATDDFFNRYEIFKANLDLIELHN---SQAGITS 941 +A+AA ++ +++++ TY + Y+TD+ R +++ N+ +IELHN SQ Sbjct: 14 IASAALEHDQNLDAQWNQWKSTYKRPYSTDEGGWRRSVWEKNMKMIELHNREYSQGKHGF 73 Query: 940 TMGVNAFTDMTSVEFASKMNGYXXXXXXXXXXXXXANTMKDVPTSWDWRTKGAVTPVKDQ 761 TM +NAF DMT+ EF MNG+ +VP S DW KG VTPVK+Q Sbjct: 74 TMAMNAFGDMTNEEFRQVMNGFQNQKHKKGKMFREP-VFLEVPKSVDWTQKGYVTPVKNQ 132 Query: 760 AQCGSCWSFSATGSIEGAWFLAKGNLTSLSEQQLMDCSHPEGDDSCEGGLMDDAFQFVID 581 QCGSCW+FSATG++EG F G L SLSEQ L+DCS EG++ C GGLMD AFQ+V D Sbjct: 133 GQCGSCWAFSATGALEGQMFRKTGKLVSLSEQNLVDCSRSEGNNGCNGGLMDYAFQYVKD 192 Query: 580 NKGICSEASYPYKAID-EKCK-KTCKSVATISSFVDVDFDQSNPNNDTALMAAL-QMGPV 410 N GI SEASYPY A D + C K SVA + FVD+ P + ALM A+ +GPV Sbjct: 193 NLGIDSEASYPYLATDTQTCNYKPEYSVANDTGFVDI------PPREKALMKAVATVGPV 246 Query: 409 SIAIEADQPIFQMYTGGVITGPSCGT-NLDHGVLLVGY---GTDSKLGDYWIVKNSWGQA 242 S+AI+A FQ Y G+ P C + +LDHGVL+VGY G DS YWIVKNSWG + Sbjct: 247 SVAIDAGHESFQFYKSGIYYEPDCSSKDLDHGVLVVGYGYEGKDSANNKYWIVKNSWGSS 306 Query: 241 WGIESGYVRLARGQ-NECGMNSAASYPVV 158 WG +GYV++A+ Q N CG+ +AASYP V Sbjct: 307 WG-TNGYVKMAKDQNNHCGIATAASYPTV 334 >dbj|BAE35627.1| unnamed protein product [Mus musculus] Length = 334 Score = 270 bits (689), Expect = 1e-69 Identities = 145/320 (45%), Positives = 202/320 (63%), Gaps = 10/320 (3%) Frame = -1 Query: 1087 ERQYQREFSRFVKTYSKKYATDDFFNRYEIFKANLDLIELHNSQAGITS---TMGVNAFT 917 ++ + E+ ++ T+ + Y T++ R I++ N+ +I+LHN + +M +NAF Sbjct: 22 DQTFSAEWHQWKSTHRRLYGTNEEEWRRAIWEKNMRMIQLHNGEYSNGQHGFSMEMNAFG 81 Query: 916 DMTSVEFASKMNGYXXXXXXXXXXXXXANTMKDVPTSWDWRTKGAVTPVKDQAQCGSCWS 737 DMT+ EF +NGY +K +P S DWR KG VTPVK+Q QCGSCW+ Sbjct: 82 DMTNEEFRQVVNGYRHQKHKKGRLFQEPLMLK-IPKSVDWREKGCVTPVKNQGQCGSCWA 140 Query: 736 FSATGSIEGAWFLAKGNLTSLSEQQLMDCSHPEGDDSCEGGLMDDAFQFVIDNKGICSEA 557 FSA+G +EG FL G L SLSEQ L+DCSH +G+ C GGLMD AFQ++ +N G+ SE Sbjct: 141 FSASGCLEGQMFLKTGKLISLSEQNLVDCSHAQGNQGCNGGLMDFAFQYIKENGGLDSEE 200 Query: 556 SYPYKAIDEKCKKTCK-SVATISSFVDVDFDQSNPNNDTALMAAL-QMGPVSIAIEADQP 383 SYPY+A D CK + +VA + FVD+ P + ALM A+ +GP+S+A++A P Sbjct: 201 SYPYEAKDGSCKYRAEFAVANDTGFVDI------PQQEKALMKAVATVGPISVAMDASHP 254 Query: 382 IFQMYTGGVITGPSCGT-NLDHGVLLVGY---GTDSKLGDYWIVKNSWGQAWGIESGYVR 215 Q Y+ G+ P+C + NLDHGVLLVGY GTDS YW+VKNSWG WG+E GY+ Sbjct: 255 SLQFYSSGIYYEPNCSSKNLDHGVLLVGYGYEGTDSNKNKYWLVKNSWGSEWGME-GYIE 313 Query: 214 LARGQ-NECGMNSAASYPVV 158 +A+ + N CG+ +AASYPVV Sbjct: 314 IAKDRDNHCGLATAASYPVV 333 >gb|ADC52431.1| cathepsin L2 cysteine protease [Pinctada fucata] Length = 330 Score = 270 bits (689), Expect = 1e-69 Identities = 149/312 (47%), Positives = 196/312 (62%), Gaps = 8/312 (2%) Frame = -1 Query: 1069 EFSRFVKTYSKKYATDDFFNRYEIFKANLDLIELHNSQAGI---TSTMGVNAFTDMTSVE 899 E++ F K Y+K Y ++ R ++++NLD I LHN A T +G+N + DMT+ E Sbjct: 26 EWNIFKKQYNKLYQNEEEARRRLVWESNLDFITLHNLAADRGEHTFWVGMNEYGDMTNEE 85 Query: 898 FASKMNGYXXXXXXXXXXXXXA-NTMKDVPTSWDWRTKGAVTPVKDQAQCGSCWSFSATG 722 F MNGY N M D+P + DWR KG VTP+K+Q QCGSCWSFSATG Sbjct: 86 FTKTMNGYRMRNKTSNAPVFMPPNNMGDLPDTVDWRPKGYVTPIKNQGQCGSCWSFSATG 145 Query: 721 SIEGAWFLAKGNLTSLSEQQLMDCSHPEGDDSCEGGLMDDAFQFVIDNKGICSEASYPYK 542 S+EG F G L SLSEQ L+DCS +G+ CEGGLMDDAF ++ N GI +EASYPYK Sbjct: 146 SLEGQTFKKTGKLVSLSEQNLVDCSKKQGNHGCEGGLMDDAFTYIKANNGIDTEASYPYK 205 Query: 541 AIDEKCK-KTCKSVATISSFVDVDFDQSNPNNDTALMAALQMGPVSIAIEADQPIFQMYT 365 A D KC+ K+ AT + FVD+ + + A +GP+S+AI+A FQ+Y Sbjct: 206 ARDGKCEFKSADVGATDTGFVDI----KTKDEEALKQAVATVGPISVAIDASHMSFQLYR 261 Query: 364 GGVITGPSCG-TNLDHGVLLVGYGT-DSKLGDYWIVKNSWGQAWGIESGYVRLARG-QNE 194 GV C T LDHGVL VGYGT DSK DYW+VKNSWG++WG + GY++++R +N Sbjct: 262 TGVYHDWFCSQTKLDHGVLAVGYGTEDSK--DYWLVKNSWGESWG-QKGYIQMSRNRRNN 318 Query: 193 CGMNSAASYPVV 158 CG+ ++ASYP V Sbjct: 319 CGIATSASYPTV 330 >ref|XP_001745088.1| hypothetical protein [Monosiga brevicollis MX1] gi|163776702|gb|EDQ90321.1| predicted protein [Monosiga brevicollis MX1] Length = 294 Score = 269 bits (688), Expect = 2e-69 Identities = 157/309 (50%), Positives = 192/309 (62%), Gaps = 9/309 (2%) Frame = -1 Query: 1057 FVKTYSKKYATDDF-FNRYEIFKANLDLIELHNSQ--AGITS-TMGVNAFTDMTSVEFAS 890 F YSK Y ++ R F+ANL+ I HN++ G+ S T+GVN F D+T EF + Sbjct: 1 FKSDYSKSYESEAVEAKRLAAFEANLEFINKHNAEHAQGLHSYTVGVNEFADLTIDEFMA 60 Query: 889 KMNGYXXXXXXXXXXXXXANTMKDVPTSWDWRTKGAVTPVKDQAQCGSCWSFSATGSIEG 710 T +D S DWRTKGAVTP+K+Q QCGSCWSFS TGS EG Sbjct: 61 LYVPSKFNRTMPYNTVYLPATSED---SVDWRTKGAVTPIKNQGQCGSCWSFSTTGSTEG 117 Query: 709 AWFLAKGNLTSLSEQQLMDCSHPEGDDSCEGGLMDDAFQFVIDNKGICSEASYPYKAIDE 530 A +A GNL SLSEQQL+DCS G+ C GGLMDDAF+++I NKG+ +E YPY A D Sbjct: 118 AHAIATGNLVSLSEQQLVDCSGSFGNQGCNGGLMDDAFKYIISNKGLDTEEDYPYTAQDG 177 Query: 529 KC--KKTCKSVATISSFVDVDFDQSNPNNDTALMAALQMGPVSIAIEADQPIFQMYTGGV 356 C +K K ATISS+ DV NN+ L AA+ GPVS+AIEADQ FQ+Y GV Sbjct: 178 TCNKEKEAKHAATISSYSDVP-----KNNEDQLAAAVAKGPVSVAIEADQSGFQLYKSGV 232 Query: 355 ITGPSCGTNLDHGVLLVGYGTDSKLGDYWIVKNSWGQAWGIESGYVRLARGQNE---CGM 185 G +CGTNLDHGVL+VGY TD DYWIVKNSWG WG+E GY+ + RG + CG+ Sbjct: 233 FDG-NCGTNLDHGVLVVGY-TD----DYWIVKNSWGTTWGVE-GYINMKRGVSASGICGI 285 Query: 184 NSAASYPVV 158 SYP+V Sbjct: 286 AMQPSYPIV 294 >dbj|BAE38161.1| unnamed protein product [Mus musculus] Length = 334 Score = 269 bits (687), Expect = 2e-69 Identities = 145/320 (45%), Positives = 203/320 (63%), Gaps = 10/320 (3%) Frame = -1 Query: 1087 ERQYQREFSRFVKTYSKKYATDDFFNRYEIFKANLDLIELHNSQAGITS---TMGVNAFT 917 ++ + E+ ++ T+ + Y T++ R I++ N+ +I+LHN + +M +NAF Sbjct: 22 DQTFSAEWHQWKSTHRRLYGTNEEEWRRAIWEKNMRMIQLHNGEYSNGQHGFSMEMNAFG 81 Query: 916 DMTSVEFASKMNGYXXXXXXXXXXXXXANTMKDVPTSWDWRTKGAVTPVKDQAQCGSCWS 737 DMT+ EF +NGY +K +P S DWR KG VTPVK+Q QCGSCW+ Sbjct: 82 DMTNEEFRQVVNGYRHQKHKKGRLFQEPLMLK-IPKSVDWREKGCVTPVKNQGQCGSCWA 140 Query: 736 FSATGSIEGAWFLAKGNLTSLSEQQLMDCSHPEGDDSCEGGLMDDAFQFVIDNKGICSEA 557 FSA+G +EG FL G L SLSEQ L+DCSH +G+ C GGLMD AFQ++ +N G+ SE Sbjct: 141 FSASGCLEGQMFLKTGKLISLSEQNLVDCSHAQGNQGCNGGLMDFAFQYIKENGGLDSEE 200 Query: 556 SYPYKAIDEKCKKTCK-SVATISSFVDVDFDQSNPNNDTALMAAL-QMGPVSIAIEADQP 383 SYPY+A D CK + +VA + FVD+ P + ALM A+ +GP+S+A++A P Sbjct: 201 SYPYEAKDGSCKYRAEFAVANDTGFVDI------PQQEKALMKAVATVGPISVAMDASHP 254 Query: 382 IFQMYTGGVITGPSCGT-NLDHGVLLVGY---GTDSKLGDYWIVKNSWGQAWGIESGYVR 215 Q Y+ G+ P+C + NLDHGVLLVGY GTDS YW+VKNSWG WG+E GY++ Sbjct: 255 SLQFYSLGIYYEPNCSSKNLDHGVLLVGYGYEGTDSNKNKYWLVKNSWGSEWGME-GYIK 313 Query: 214 LARGQ-NECGMNSAASYPVV 158 +A+ + N CG+ +AASYPVV Sbjct: 314 IAKDRDNHCGLATAASYPVV 333 >dbj|BAE22939.1| unnamed protein product [Mus musculus] Length = 308 Score = 269 bits (687), Expect = 2e-69 Identities = 145/314 (46%), Positives = 200/314 (63%), Gaps = 10/314 (3%) Frame = -1 Query: 1069 EFSRFVKTYSKKYATDDFFNRYEIFKANLDLIELHNSQAGITS---TMGVNAFTDMTSVE 899 E+ ++ T+ + Y T++ R I++ N+ +I+LHN + +M +NAF DMT+ E Sbjct: 2 EWHQWKSTHRRLYGTNEEEWRRAIWEKNMRMIQLHNGEYSNGQHGFSMEMNAFGDMTNEE 61 Query: 898 FASKMNGYXXXXXXXXXXXXXANTMKDVPTSWDWRTKGAVTPVKDQAQCGSCWSFSATGS 719 F +NGY +K +P S DWR KG VTPVK+Q QCGSCW+FSA+G Sbjct: 62 FRQVVNGYRHQKHKKGRLFQEPLMLK-IPKSVDWREKGCVTPVKNQGQCGSCWAFSASGC 120 Query: 718 IEGAWFLAKGNLTSLSEQQLMDCSHPEGDDSCEGGLMDDAFQFVIDNKGICSEASYPYKA 539 +EG FL G L SLSEQ L+DCSH +G+ C GGLMD AFQ++ +N G+ SE SYPY+A Sbjct: 121 LEGQMFLKTGKLISLSEQNLVDCSHAQGNQGCNGGLMDFAFQYIKENGGLDSEESYPYEA 180 Query: 538 IDEKCKKTCK-SVATISSFVDVDFDQSNPNNDTALMAAL-QMGPVSIAIEADQPIFQMYT 365 D CK + +VA + FVD+ P + ALM A+ +GP+S+A++A P Q Y+ Sbjct: 181 KDGSCKYRAEFAVANDTGFVDI------PQQEKALMKAVATVGPISVAMDASHPSLQFYS 234 Query: 364 GGVITGPSCGT-NLDHGVLLVGY---GTDSKLGDYWIVKNSWGQAWGIESGYVRLARGQ- 200 G+ P+C + NLDHGVLLVGY GTDS YW+VKNSWG WG+E GY+++A+ + Sbjct: 235 SGIYYEPNCSSKNLDHGVLLVGYGYEGTDSNKNKYWLVKNSWGSEWGME-GYIKIAKDRD 293 Query: 199 NECGMNSAASYPVV 158 N CG+ +AASYPVV Sbjct: 294 NHCGLATAASYPVV 307 >dbj|BAE31977.1| unnamed protein product [Mus musculus] Length = 334 Score = 269 bits (687), Expect = 2e-69 Identities = 144/320 (45%), Positives = 203/320 (63%), Gaps = 10/320 (3%) Frame = -1 Query: 1087 ERQYQREFSRFVKTYSKKYATDDFFNRYEIFKANLDLIELHNSQAGITS---TMGVNAFT 917 ++ + E+ ++ T+ + Y T++ R I++ N+ +I+LHN + +M +NAF Sbjct: 22 DQTFSAEWHQWKSTHRRLYGTNEEEWRRAIWEKNMRMIQLHNGEYSNGQHGFSMEMNAFG 81 Query: 916 DMTSVEFASKMNGYXXXXXXXXXXXXXANTMKDVPTSWDWRTKGAVTPVKDQAQCGSCWS 737 DMT+ EF +NGY +K +P S DWR KG VTPVK++ QCGSCW+ Sbjct: 82 DMTNEEFRQVVNGYRHQKHKKGRLFQEPLMLK-IPKSVDWREKGCVTPVKNKGQCGSCWA 140 Query: 736 FSATGSIEGAWFLAKGNLTSLSEQQLMDCSHPEGDDSCEGGLMDDAFQFVIDNKGICSEA 557 FSA+G +EG FL G L SLSEQ L+DCSH +G+ C GGLMD AFQ++ +N G+ SE Sbjct: 141 FSASGCLEGQMFLKTGKLISLSEQNLVDCSHAQGNQGCNGGLMDFAFQYIKENGGLDSEE 200 Query: 556 SYPYKAIDEKCKKTCK-SVATISSFVDVDFDQSNPNNDTALMAAL-QMGPVSIAIEADQP 383 SYPY+A D CK + +VA + FVD+ P + ALM A+ +GP+S+A++A P Sbjct: 201 SYPYEAKDGSCKYRAEFAVANDTGFVDI------PQQEKALMKAVATVGPISVAMDASHP 254 Query: 382 IFQMYTGGVITGPSCGT-NLDHGVLLVGY---GTDSKLGDYWIVKNSWGQAWGIESGYVR 215 Q Y+ G+ P+C + NLDHGVLLVGY GTDS YW+VKNSWG WG+E GY++ Sbjct: 255 SLQFYSSGIYYEPNCSSKNLDHGVLLVGYGYEGTDSNKNKYWLVKNSWGSEWGME-GYIK 313 Query: 214 LARGQ-NECGMNSAASYPVV 158 +A+ + N CG+ +AASYPVV Sbjct: 314 IAKDRDNHCGLATAASYPVV 333 >gb|ABY58967.1| cathepsin L [Toxoplasma gondii] Length = 421 Score = 269 bits (687), Expect = 2e-69 Identities = 154/321 (47%), Positives = 197/321 (61%), Gaps = 11/321 (3%) Frame = -1 Query: 1087 ERQYQREFSRFVKTYSKKYATDDFFNR-YEIFKANLDLIELHNSQAGITSTMGVNAFTDM 911 E +Q FS F Y+K YAT++ R Y IFK NL I HN Q G + ++ +N F D+ Sbjct: 109 EAHFQDAFSSFQAMYAKSYATEEEKQRRYAIFKNNLVYIHTHNQQ-GYSYSLKMNHFGDL 167 Query: 910 TSVEFASKMNGYXXXXXXXXXXXXXANTMKDV-----PTSWDWRTKGAVTPVKDQAQCGS 746 + EF K G+ A + +V P DWR++G VTPVKDQ CGS Sbjct: 168 SRDEFRRKYLGFKKSRNLKSHHLGVATELLNVLPSELPAGVDWRSRGCVTPVKDQRDCGS 227 Query: 745 CWSFSATGSIEGAWFLAKGNLTSLSEQQLMDCSHPEGDDSCEGGLMDDAFQFVIDNKGIC 566 CW+FS TG++EGA G L SLSEQ+LMDCS EG+ SC GG M+DAFQ+V+D+ GIC Sbjct: 228 CWAFSTTGALEGAHCAKTGKLVSLSEQELMDCSRAEGNQSCSGGEMNDAFQYVLDSGGIC 287 Query: 565 SEASYPYKAIDEKCK-KTCKSVATISSFVDVDFDQSNPNNDTALMAALQMGPVSIAIEAD 389 SE +YPY A DE+C+ ++C+ V I F DV ++ A+ AAL PVSIAIEAD Sbjct: 288 SEDAYPYLARDEECRAQSCEKVVKILGFKDVP-----RRSEAAMKAALAKSPVSIAIEAD 342 Query: 388 QPIFQMYTGGVITGPSCGTNLDHGVLLVGYGTDSK-LGDYWIVKNSWGQAWGIESGYVRL 212 Q FQ Y GV SCGT+LDHGVLLVGYGTD + D+WI+KNSWG WG GY+ + Sbjct: 343 QMPFQFYHEGVFDA-SCGTDLDHGVLLVGYGTDKESKKDFWIMKNSWGTGWG-RDGYMYM 400 Query: 211 ARGQNE---CGMNSAASYPVV 158 A + E CG+ AS+PV+ Sbjct: 401 AMHKGEEGQCGLLLDASFPVM 421 >ref|XP_002371694.1| cathepsin L-like thiolproteinase, putative [Toxoplasma gondii ME49] gi|50313163|gb|AAT74529.1| toxopain-2 [Toxoplasma gondii] gi|89242977|gb|ABD64744.1| cathepsin L [Toxoplasma gondii] gi|95007485|emb|CAJ20707.1| toxopain-2 [Toxoplasma gondii RH] gi|523570907|gb|EPR57821.1| cathepsin CPL [Toxoplasma gondii GT1] gi|527315630|gb|EPT32244.1| cathepsin CPL [Toxoplasma gondii ME49] gi|557733437|gb|ESS29589.1| cathepsin CPL [Toxoplasma gondii VEG] Length = 422 Score = 269 bits (687), Expect = 2e-69 Identities = 154/321 (47%), Positives = 197/321 (61%), Gaps = 11/321 (3%) Frame = -1 Query: 1087 ERQYQREFSRFVKTYSKKYATDDFFNR-YEIFKANLDLIELHNSQAGITSTMGVNAFTDM 911 E +Q FS F Y+K YAT++ R Y IFK NL I HN Q G + ++ +N F D+ Sbjct: 110 EAHFQDAFSSFQAMYAKSYATEEEKQRRYAIFKNNLVYIHTHNQQ-GYSYSLKMNHFGDL 168 Query: 910 TSVEFASKMNGYXXXXXXXXXXXXXANTMKDV-----PTSWDWRTKGAVTPVKDQAQCGS 746 + EF K G+ A + +V P DWR++G VTPVKDQ CGS Sbjct: 169 SRDEFRRKYLGFKKSRNLKSHHLGVATELLNVLPSELPAGVDWRSRGCVTPVKDQRDCGS 228 Query: 745 CWSFSATGSIEGAWFLAKGNLTSLSEQQLMDCSHPEGDDSCEGGLMDDAFQFVIDNKGIC 566 CW+FS TG++EGA G L SLSEQ+LMDCS EG+ SC GG M+DAFQ+V+D+ GIC Sbjct: 229 CWAFSTTGALEGAHCAKTGKLVSLSEQELMDCSRAEGNQSCSGGEMNDAFQYVLDSGGIC 288 Query: 565 SEASYPYKAIDEKCK-KTCKSVATISSFVDVDFDQSNPNNDTALMAALQMGPVSIAIEAD 389 SE +YPY A DE+C+ ++C+ V I F DV ++ A+ AAL PVSIAIEAD Sbjct: 289 SEDAYPYLARDEECRAQSCEKVVKILGFKDVP-----RRSEAAMKAALAKSPVSIAIEAD 343 Query: 388 QPIFQMYTGGVITGPSCGTNLDHGVLLVGYGTDSK-LGDYWIVKNSWGQAWGIESGYVRL 212 Q FQ Y GV SCGT+LDHGVLLVGYGTD + D+WI+KNSWG WG GY+ + Sbjct: 344 QMPFQFYHEGVFDA-SCGTDLDHGVLLVGYGTDKESKKDFWIMKNSWGTGWG-RDGYMYM 401 Query: 211 ARGQNE---CGMNSAASYPVV 158 A + E CG+ AS+PV+ Sbjct: 402 AMHKGEEGQCGLLLDASFPVM 422 >gb|EJY65772.1| Cathepsin L [Oxytricha trifallax] Length = 338 Score = 268 bits (685), Expect = 3e-69 Identities = 144/305 (47%), Positives = 193/305 (63%), Gaps = 2/305 (0%) Frame = -1 Query: 1066 FSRFVKTYSKKYATDDFFN-RYEIFKANLDLIELHNSQAGITSTMGVNAFTDMTSVEFAS 890 F+ FV Y K Y T + ++ R ++FK NL + ++N++ +T +G+N F D T E+ Sbjct: 43 FTNFVAKYGKSYGTKEEYDFRSKLFKQNLAKVSMNNARNDVTYRLGLNKFADYTEAEY-K 101 Query: 889 KMNGYXXXXXXXXXXXXXANTMKDVPTSWDWRTKGAVTPVKDQAQCGSCWSFSATGSIEG 710 ++ G+ K+ +W +GAVTPVKDQ QCGSCWSFSATG++EG Sbjct: 102 RLLGFGGQKNKNPRNIKVLGAPKN--DGVNWVEQGAVTPVKDQGQCGSCWSFSATGAMEG 159 Query: 709 AWFLAKGNLTSLSEQQLMDCSHPEGDDSCEGGLMDDAFQFVIDNKGICSEASYPYKAIDE 530 + G L SLSEQQL+DCS EG++ C GG MD AFQ+V + + +E YPY+A+D+ Sbjct: 160 HAKIQFGTLYSLSEQQLVDCSQAEGNEGCGGGWMDQAFQYV-EQTALETEDQYPYEAVDD 218 Query: 529 KCKKTCKSVATISSFVDVDFDQSNPNNDTALMAALQMGPVSIAIEADQPIFQMYTGGVIT 350 C+ + V + SFVDV PNN L AAL GPVS+AIEADQ +FQ Y+GGVI Sbjct: 219 TCRASSAGVVKVDSFVDV-----TPNNVNELKAALDKGPVSVAIEADQMVFQFYSGGVIN 273 Query: 349 GPSCGTNLDHGVLLVGYGTDSKLGDYWIVKNSWGQAWGIESGYVRLARG-QNECGMNSAA 173 SCGT LDHGVL VGYG +S DY++VKNSWG +WG E GYV++A N CG+ S A Sbjct: 274 DASCGTTLDHGVLAVGYGNESG-QDYFLVKNSWGASWG-EEGYVKIAASPDNICGILSQA 331 Query: 172 SYPVV 158 SYP++ Sbjct: 332 SYPIM 336