BLASTX nr result

ID: Mentha22_contig00029012 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Mentha22_contig00029012
         (1198 letters)

Database: ./nr 
           38,876,450 sequences; 13,856,398,315 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

gb|ACX43965.1| cysteine protease 1 [Brachiaria hybrid cultivar]       318   3e-84
gb|ACX43954.1| cysteine protease 1 [Brachiaria hybrid cultivar] ...   316   1e-83
gb|ACX43963.1| cysteine protease 3, partial [Brachiaria hybrid c...   292   2e-76
gb|AHW80377.1| cathepsin L protease [Cerebratulus lacteus]            277   7e-72
ref|XP_003879989.1| cathepsin L, related [Neospora caninum Liver...   277   7e-72
gb|AAD32136.1|AF121837_1 cathepsin L [Mus musculus] gi|4887000|g...   270   7e-70
dbj|BAE27712.1| unnamed protein product [Mus musculus]                270   7e-70
ref|NP_034114.1| cathepsin L1 preproprotein [Mus musculus] gi|56...   270   7e-70
dbj|BAB27719.1| unnamed protein product [Mus musculus]                270   7e-70
dbj|BAE36450.1| unnamed protein product [Mus musculus]                270   9e-70
ref|XP_006872648.1| PREDICTED: cathepsin L1-like [Chrysochloris ...   270   1e-69
dbj|BAE35627.1| unnamed protein product [Mus musculus]                270   1e-69
gb|ADC52431.1| cathepsin L2 cysteine protease [Pinctada fucata]       270   1e-69
ref|XP_001745088.1| hypothetical protein [Monosiga brevicollis M...   269   2e-69
dbj|BAE38161.1| unnamed protein product [Mus musculus]                269   2e-69
dbj|BAE22939.1| unnamed protein product [Mus musculus]                269   2e-69
dbj|BAE31977.1| unnamed protein product [Mus musculus]                269   2e-69
gb|ABY58967.1| cathepsin L [Toxoplasma gondii]                        269   2e-69
ref|XP_002371694.1| cathepsin L-like thiolproteinase, putative [...   269   2e-69
gb|EJY65772.1| Cathepsin L [Oxytricha trifallax]                      268   3e-69

>gb|ACX43965.1| cysteine protease 1 [Brachiaria hybrid cultivar]
          Length = 338

 Score =  318 bits (815), Expect = 3e-84
 Identities = 163/312 (52%), Positives = 207/312 (66%), Gaps = 1/312 (0%)
 Frame = -1

Query: 1090 TERQYQREFSRFVKTYSKKYATDDFFNRYEIFKANLDLIELHNSQAGITSTMGVNAFTDM 911
            +E   Q  F+ F+K YSK Y+  +F +R+  FKAN++ I LHN+ A  + TMG+N F D+
Sbjct: 34   SEVMLQDMFTAFMKQYSKAYSHAEFSSRFNQFKANVETIRLHNTLANASYTMGLNEFADL 93

Query: 910  TSVEFASKMNGYXXXXXXXXXXXXXANTMKDVPTSWDWRTKGAVTPVKDQAQCGSCWSFS 731
            +  EF  K  GY                ++  PTS DWRT  AVTP+KDQ QCGSCW+FS
Sbjct: 94   SFEEFKGKYFGYKHVEREFARSNNLHQEVEAAPTSIDWRTSNAVTPIKDQGQCGSCWAFS 153

Query: 730  ATGSIEGAWFL-AKGNLTSLSEQQLMDCSHPEGDDSCEGGLMDDAFQFVIDNKGICSEAS 554
            ATGSIEGAW L  K  LTSLSEQQL+DCS   GD  C GGLMD AF+++I NKGIC+E++
Sbjct: 154  ATGSIEGAWVLQGKHTLTSLSEQQLVDCSTSYGDAGCNGGLMDYAFEYIIANKGICAESA 213

Query: 553  YPYKAIDEKCKKTCKSVATISSFVDVDFDQSNPNNDTALMAALQMGPVSIAIEADQPIFQ 374
            YPYK +   C+K+C  V TIS + DV    ++ +  + L A   +GPVS+AIEADQ  FQ
Sbjct: 214  YPYKGVGGLCQKSCTKVVTISGYKDV----ASGDEASLLNAVGTVGPVSVAIEADQAGFQ 269

Query: 373  MYTGGVITGPSCGTNLDHGVLLVGYGTDSKLGDYWIVKNSWGQAWGIESGYVRLARGQNE 194
             Y+ GV +G +CG NLDHGVL VGYGT     DYWIVKNSWG +WG ESGY+R+ R +N+
Sbjct: 270  FYSSGVFSG-TCGHNLDHGVLAVGYGTTGS-QDYWIVKNSWGTSWG-ESGYIRMIRNKNQ 326

Query: 193  CGMNSAASYPVV 158
            CG+    SYP V
Sbjct: 327  CGIAIQPSYPTV 338


>gb|ACX43954.1| cysteine protease 1 [Brachiaria hybrid cultivar]
            gi|260516656|gb|ACX43955.1| cysteine protease 1
            [Brachiaria hybrid cultivar] gi|260516658|gb|ACX43956.1|
            cysteine protease 1 [Brachiaria hybrid cultivar]
            gi|260516660|gb|ACX43957.1| cysteine protease 1
            [Brachiaria hybrid cultivar] gi|260516662|gb|ACX43958.1|
            cysteine protease 2 [Brachiaria hybrid cultivar]
            gi|260516664|gb|ACX43959.1| cysteine protease 2
            [Brachiaria hybrid cultivar] gi|260516666|gb|ACX43960.1|
            cysteine protease 2 [Brachiaria hybrid cultivar]
            gi|260516668|gb|ACX43961.1| cysteine protease 2
            [Brachiaria hybrid cultivar] gi|260516670|gb|ACX43962.1|
            cysteine protease 2 [Brachiaria hybrid cultivar]
          Length = 338

 Score =  316 bits (810), Expect = 1e-83
 Identities = 162/312 (51%), Positives = 207/312 (66%), Gaps = 1/312 (0%)
 Frame = -1

Query: 1090 TERQYQREFSRFVKTYSKKYATDDFFNRYEIFKANLDLIELHNSQAGITSTMGVNAFTDM 911
            +E   Q  F+ F+K YSK Y+  +F +R+  FKAN++ I LHN+ A  + TMG+N F D+
Sbjct: 34   SEVMLQDMFTAFMKQYSKAYSHAEFSSRFNQFKANVETIRLHNTLANASYTMGLNEFADL 93

Query: 910  TSVEFASKMNGYXXXXXXXXXXXXXANTMKDVPTSWDWRTKGAVTPVKDQAQCGSCWSFS 731
            +  EF  K  GY                ++  PTS DWRT  AVTP+KDQ QCGSCW+FS
Sbjct: 94   SFEEFKGKYFGYKHVEREFARSNNLHQEVEAAPTSIDWRTSNAVTPIKDQGQCGSCWAFS 153

Query: 730  ATGSIEGAWFL-AKGNLTSLSEQQLMDCSHPEGDDSCEGGLMDDAFQFVIDNKGICSEAS 554
            ATGSIEGAW L  K  LTSLSEQQL+DCS   G+  C GGLMD AF+++I NKGIC+E++
Sbjct: 154  ATGSIEGAWVLQGKHTLTSLSEQQLVDCSTSYGNAGCNGGLMDYAFEYIIANKGICAESA 213

Query: 553  YPYKAIDEKCKKTCKSVATISSFVDVDFDQSNPNNDTALMAALQMGPVSIAIEADQPIFQ 374
            YPYK +   C+K+C  V TIS + DV    ++ +  + L A   +GPVS+AIEADQ  FQ
Sbjct: 214  YPYKGVGGLCQKSCTKVVTISGYKDV----ASGDEASLLNAVGTVGPVSVAIEADQAGFQ 269

Query: 373  MYTGGVITGPSCGTNLDHGVLLVGYGTDSKLGDYWIVKNSWGQAWGIESGYVRLARGQNE 194
             Y+ GV +G +CG NLDHGVL VGYGT     DYWIVKNSWG +WG ESGY+R+ R +N+
Sbjct: 270  FYSSGVFSG-TCGHNLDHGVLAVGYGTTGS-QDYWIVKNSWGTSWG-ESGYIRMIRNKNQ 326

Query: 193  CGMNSAASYPVV 158
            CG+    SYP V
Sbjct: 327  CGIAIQPSYPTV 338


>gb|ACX43963.1| cysteine protease 3, partial [Brachiaria hybrid cultivar]
          Length = 319

 Score =  292 bits (747), Expect = 2e-76
 Identities = 152/293 (51%), Positives = 194/293 (66%), Gaps = 1/293 (0%)
 Frame = -1

Query: 1090 TERQYQREFSRFVKTYSKKYATDDFFNRYEIFKANLDLIELHNSQAGITSTMGVNAFTDM 911
            +E   Q  F+ F+K YSK Y+  +F +R+  FKA+++ I LHN+ A  + TMG+N F D+
Sbjct: 34   SEVMLQDMFTAFMKQYSKAYSHAEFSSRFNQFKASVETIRLHNTLANASYTMGLNEFADL 93

Query: 910  TSVEFASKMNGYXXXXXXXXXXXXXANTMKDVPTSWDWRTKGAVTPVKDQAQCGSCWSFS 731
            +  EF  K  G                 ++  PTS DWRT  AVTP+KDQ QCGSCW+FS
Sbjct: 94   SFEEFKGKYFGCKHVEREFARSNNLHQEVEAAPTSIDWRTSNAVTPIKDQGQCGSCWAFS 153

Query: 730  ATGSIEGAWFL-AKGNLTSLSEQQLMDCSHPEGDDSCEGGLMDDAFQFVIDNKGICSEAS 554
            ATGSIEGAW L  K  LTSLSEQQL+DCS   G+  C GGLMD AF+++I NKGIC+E++
Sbjct: 154  ATGSIEGAWVLQGKHTLTSLSEQQLVDCSTSYGNAGCNGGLMDYAFEYIIANKGICAESA 213

Query: 553  YPYKAIDEKCKKTCKSVATISSFVDVDFDQSNPNNDTALMAALQMGPVSIAIEADQPIFQ 374
            YPYK +   C+K+C  V TIS   DV    ++ +  ++L A   +GPVS+AIEADQ  FQ
Sbjct: 214  YPYKGVGGLCQKSCTKVVTISGHKDV----ASGDEASSLNAVGTVGPVSVAIEADQAGFQ 269

Query: 373  MYTGGVITGPSCGTNLDHGVLLVGYGTDSKLGDYWIVKNSWGQAWGIESGYVR 215
             Y+ GV +G +CG NLDHGVL VGYGT     DYWIVKNSWG +WG ESGY+R
Sbjct: 270  FYSSGVFSG-TCGHNLDHGVLAVGYGTTGS-QDYWIVKNSWGTSWG-ESGYIR 319


>gb|AHW80377.1| cathepsin L protease [Cerebratulus lacteus]
          Length = 327

 Score =  277 bits (708), Expect = 7e-72
 Identities = 151/311 (48%), Positives = 202/311 (64%), Gaps = 7/311 (2%)
 Frame = -1

Query: 1069 EFSRFVKTYSKKYATDDFFNRYEIFKANLDLIELHNSQAGI---TSTMGVNAFTDMTSVE 899
            E++ F  T+ K Y  ++   R  IF  NL +I+ HN +A +   T  MGVN ++D TS E
Sbjct: 25   EWTNFKATHVKTYEVEEELVRKAIFHNNLRIIQKHNVEADLGQHTYWMGVNEYSDWTSTE 84

Query: 898  FASKMNGYXXXXXXXXXXXXXANTMKDVPTSWDWRTKGAVTPVKDQAQCGSCWSFSATGS 719
            F + MNGY             A+ +K VP S DWR KG VTPVK+Q QCGSCW+FSATGS
Sbjct: 85   FRNYMNGYKRSNVTSGSTFMPASYIK-VPASVDWRPKGYVTPVKNQGQCGSCWAFSATGS 143

Query: 718  IEGAWFLAKGNLTSLSEQQLMDCSHPEGDDSCEGGLMDDAFQFVIDNKGICSEASYPYKA 539
            +EG  F   GNL SLSEQ L+DCS   G+  CEGGLMD AF+++  N GI +E  YPYK 
Sbjct: 144  LEGQHFKKTGNLVSLSEQNLVDCSKSYGNMGCEGGLMDSAFKYIKANHGIDTEKCYPYKH 203

Query: 538  IDEKC--KKTCKSVATISSFVDVDFDQSNPNNDTALMAALQMGPVSIAIEADQPIFQMYT 365
            +DE+C  KK+C   AT++ +VDV       + +  L A+  +GP+S+AI+A    FQMY+
Sbjct: 204  VDERCHFKKSCIG-ATVTGYVDV----KQMDENALLQASATIGPISVAIDAGHQSFQMYS 258

Query: 364  GGVITGPSCG-TNLDHGVLLVGYGTDSKLGDYWIVKNSWGQAWGIESGYVRLARGQ-NEC 191
             GV   P C  T LDHGVL+VGYGT+S   DYW+VKNSWG  WG ++GY+ ++R + N+C
Sbjct: 259  HGVYNEPRCSQTQLDHGVLVVGYGTESG-QDYWLVKNSWGATWG-QNGYIMMSRNKNNQC 316

Query: 190  GMNSAASYPVV 158
            G+ ++ASYP+V
Sbjct: 317  GIATSASYPLV 327


>ref|XP_003879989.1| cathepsin L, related [Neospora caninum Liverpool]
            gi|325114397|emb|CBZ49954.1| cathepsin L, related
            [Neospora caninum Liverpool]
          Length = 415

 Score =  277 bits (708), Expect = 7e-72
 Identities = 153/305 (50%), Positives = 188/305 (61%), Gaps = 7/305 (2%)
 Frame = -1

Query: 1087 ERQYQREFSRFVKTYSKKYATDDFFN-RYEIFKANLDLIELHNSQAGITSTMGVNAFTDM 911
            E  +Q  F  F  TY K YAT++    RY IFK NL  I  HN Q G + ++ +N F D+
Sbjct: 112  EEHFQNAFGSFRATYGKSYATEEETQKRYAIFKNNLAYIHTHNQQ-GYSYSLKMNHFGDL 170

Query: 910  TSVEFASKMNGYXXXXXXXXXXXXXANTM-----KDVPTSWDWRTKGAVTPVKDQAQCGS 746
            +  EF  K  GY             A  +      DVP++ DWR KG VTPVKDQ  CGS
Sbjct: 171  SREEFRRKYLGYNKSRNLKSNNLGVATELLKVSPSDVPSAVDWREKGCVTPVKDQRDCGS 230

Query: 745  CWSFSATGSIEGAWFLAKGNLTSLSEQQLMDCSHPEGDDSCEGGLMDDAFQFVIDNKGIC 566
            CW+FSATG++EGA     G L SLSEQ+L+DCS  EG+  C GG M+DAFQ+V+D+ G+C
Sbjct: 231  CWAFSATGALEGAHCAKTGELLSLSEQELVDCSLAEGNQGCSGGEMNDAFQYVVDSGGLC 290

Query: 565  SEASYPYKAIDEKCKKTCKSVATISSFVDVDFDQSNPNNDTALMAALQMGPVSIAIEADQ 386
            SE  YPY A D +CK+ CK V TIS F DV        ++TA+ AAL   PVSIAIEADQ
Sbjct: 291  SEEGYPYLARDGECKRACKKVVTISGFKDVP-----RKSETAMKAALAHSPVSIAIEADQ 345

Query: 385  PIFQMYTGGVITGPSCGTNLDHGVLLVGYGTDSKL-GDYWIVKNSWGQAWGIESGYVRLA 209
              FQ Y  GV    SCGT+LDHGVLLVGYGTD +   D+WI+KNSWG  WG   GY+ +A
Sbjct: 346  LPFQFYHEGVFDA-SCGTDLDHGVLLVGYGTDKETKKDFWIMKNSWGSGWG-RDGYMYMA 403

Query: 208  RGQNE 194
              + E
Sbjct: 404  MHKGE 408


>gb|AAD32136.1|AF121837_1 cathepsin L [Mus musculus] gi|4887000|gb|AAD32137.1|AF121838_1
            cathepsin L [Mus musculus]
            gi|4887002|gb|AAD32138.1|AF121839_1 cathepsin L [Mus
            musculus] gi|200501|gb|AAA39984.1| preprocathepsin L
            precursor [Mus musculus]
          Length = 334

 Score =  270 bits (691), Expect = 7e-70
 Identities = 145/320 (45%), Positives = 203/320 (63%), Gaps = 10/320 (3%)
 Frame = -1

Query: 1087 ERQYQREFSRFVKTYSKKYATDDFFNRYEIFKANLDLIELHNSQAGITS---TMGVNAFT 917
            ++ +  E+ ++  T+ + Y T++   R  I++ N+ +I+LHN +        +M +NAF 
Sbjct: 22   DQTFSAEWHQWKSTHRRLYGTNEEEWRRAIWEKNMRIIQLHNGEYSNGQHGFSMEMNAFG 81

Query: 916  DMTSVEFASKMNGYXXXXXXXXXXXXXANTMKDVPTSWDWRTKGAVTPVKDQAQCGSCWS 737
            DMT+ EF   +NGY                +K +P S DWR KG VTPVK+Q QCGSCW+
Sbjct: 82   DMTNEEFRQVVNGYRHQKHKKGRLFQEPLMLK-IPKSVDWREKGCVTPVKNQGQCGSCWA 140

Query: 736  FSATGSIEGAWFLAKGNLTSLSEQQLMDCSHPEGDDSCEGGLMDDAFQFVIDNKGICSEA 557
            FSA+G +EG  FL  G L SLSEQ L+DCSH +G+  C GGLMD AFQ++ +N G+ SE 
Sbjct: 141  FSASGCLEGQMFLKTGKLISLSEQNLVDCSHAQGNQGCNGGLMDFAFQYIKENGGLDSEE 200

Query: 556  SYPYKAIDEKCKKTCK-SVATISSFVDVDFDQSNPNNDTALMAAL-QMGPVSIAIEADQP 383
            SYPY+A D  CK   + +VA  + FVD+      P  + ALM A+  +GP+S+A++A  P
Sbjct: 201  SYPYEAKDGSCKYRAEFAVANDTGFVDI------PQQEKALMKAVATVGPISVAMDASHP 254

Query: 382  IFQMYTGGVITGPSCGT-NLDHGVLLVGY---GTDSKLGDYWIVKNSWGQAWGIESGYVR 215
              Q Y+ G+   P+C + NLDHGVLLVGY   GTDS    YW+VKNSWG  WG+E GY++
Sbjct: 255  SLQFYSSGIYYEPNCSSKNLDHGVLLVGYGYEGTDSNKNKYWLVKNSWGSEWGME-GYIK 313

Query: 214  LARGQ-NECGMNSAASYPVV 158
            +A+ + N CG+ +AASYPVV
Sbjct: 314  IAKDRDNHCGLATAASYPVV 333


>dbj|BAE27712.1| unnamed protein product [Mus musculus]
          Length = 334

 Score =  270 bits (691), Expect = 7e-70
 Identities = 145/320 (45%), Positives = 203/320 (63%), Gaps = 10/320 (3%)
 Frame = -1

Query: 1087 ERQYQREFSRFVKTYSKKYATDDFFNRYEIFKANLDLIELHNSQAGITS---TMGVNAFT 917
            ++ +  E+ ++  T+ + Y T++   R  I++ N+ +I+LHN +        +M +NAF 
Sbjct: 22   DQTFSAEWHQWKSTHRRLYGTNEEEWRRAIWEKNMRMIQLHNGEYSNGQHGFSMEMNAFG 81

Query: 916  DMTSVEFASKMNGYXXXXXXXXXXXXXANTMKDVPTSWDWRTKGAVTPVKDQAQCGSCWS 737
            DMT+ EF   +NGY                +K +P S DWR KG VTPVK+Q QCGSCW+
Sbjct: 82   DMTNEEFRQVVNGYRHQKHKKGRLFQEPLMLK-IPKSVDWREKGCVTPVKNQGQCGSCWA 140

Query: 736  FSATGSIEGAWFLAKGNLTSLSEQQLMDCSHPEGDDSCEGGLMDDAFQFVIDNKGICSEA 557
            FSA+G +EG  FL  G L SLSEQ L+DCSH +G+  C GGLMD AFQ++ +N G+ SE 
Sbjct: 141  FSASGCLEGQMFLKTGKLISLSEQNLVDCSHAQGNQGCNGGLMDFAFQYIKENGGLDSEE 200

Query: 556  SYPYKAIDEKCKKTCK-SVATISSFVDVDFDQSNPNNDTALMAAL-QMGPVSIAIEADQP 383
            SYPY+A D  CK   + +VA  + FVD+      P  + ALM A+  +GP+S+A++A  P
Sbjct: 201  SYPYEAKDGSCKYRAEFAVANDTGFVDI------PQQEEALMKAVATVGPISVAMDASHP 254

Query: 382  IFQMYTGGVITGPSCGT-NLDHGVLLVGY---GTDSKLGDYWIVKNSWGQAWGIESGYVR 215
              Q Y+ G+   P+C + NLDHGVLLVGY   GTDS    YW+VKNSWG  WG+E GY++
Sbjct: 255  SLQFYSSGIYYEPNCSSKNLDHGVLLVGYGYEGTDSNKNKYWLVKNSWGSEWGME-GYIK 313

Query: 214  LARGQ-NECGMNSAASYPVV 158
            +A+ + N CG+ +AASYPVV
Sbjct: 314  IAKDRDNHCGLATAASYPVV 333


>ref|NP_034114.1| cathepsin L1 preproprotein [Mus musculus]
            gi|568983003|ref|XP_006517143.1| PREDICTED: cathepsin L1
            isoform X1 [Mus musculus]
            gi|568983005|ref|XP_006517144.1| PREDICTED: cathepsin L1
            isoform X2 [Mus musculus]
            gi|115742|sp|P06797.2|CATL1_MOUSE RecName: Full=Cathepsin
            L1; AltName: Full=Cathepsin L; AltName: Full=Major
            excreted protein; Short=MEP; AltName: Full=p39 cysteine
            proteinase; Contains: RecName: Full=Cathepsin L1 heavy
            chain; Contains: RecName: Full=Cathepsin L1 light chain;
            Flags: Precursor gi|53047|emb|CAA29470.1| unnamed protein
            product [Mus musculus] gi|309186|gb|AAA37445.1|
            preprocysteine proteinase [Mus musculus]
            gi|12832050|dbj|BAB21945.1| unnamed protein product [Mus
            musculus] gi|26340196|dbj|BAC33761.1| unnamed protein
            product [Mus musculus] gi|45768760|gb|AAH68163.1|
            Cathepsin L [Mus musculus] gi|74139700|dbj|BAE31701.1|
            unnamed protein product [Mus musculus]
            gi|74146632|dbj|BAE41323.1| unnamed protein product [Mus
            musculus] gi|74151584|dbj|BAE41141.1| unnamed protein
            product [Mus musculus] gi|74185397|dbj|BAE30172.1|
            unnamed protein product [Mus musculus]
            gi|74197196|dbj|BAE35143.1| unnamed protein product [Mus
            musculus] gi|74203006|dbj|BAE26206.1| unnamed protein
            product [Mus musculus] gi|74219606|dbj|BAE29572.1|
            unnamed protein product [Mus musculus]
            gi|148684295|gb|EDL16242.1| cathepsin L [Mus musculus]
          Length = 334

 Score =  270 bits (691), Expect = 7e-70
 Identities = 145/320 (45%), Positives = 203/320 (63%), Gaps = 10/320 (3%)
 Frame = -1

Query: 1087 ERQYQREFSRFVKTYSKKYATDDFFNRYEIFKANLDLIELHNSQAGITS---TMGVNAFT 917
            ++ +  E+ ++  T+ + Y T++   R  I++ N+ +I+LHN +        +M +NAF 
Sbjct: 22   DQTFSAEWHQWKSTHRRLYGTNEEEWRRAIWEKNMRMIQLHNGEYSNGQHGFSMEMNAFG 81

Query: 916  DMTSVEFASKMNGYXXXXXXXXXXXXXANTMKDVPTSWDWRTKGAVTPVKDQAQCGSCWS 737
            DMT+ EF   +NGY                +K +P S DWR KG VTPVK+Q QCGSCW+
Sbjct: 82   DMTNEEFRQVVNGYRHQKHKKGRLFQEPLMLK-IPKSVDWREKGCVTPVKNQGQCGSCWA 140

Query: 736  FSATGSIEGAWFLAKGNLTSLSEQQLMDCSHPEGDDSCEGGLMDDAFQFVIDNKGICSEA 557
            FSA+G +EG  FL  G L SLSEQ L+DCSH +G+  C GGLMD AFQ++ +N G+ SE 
Sbjct: 141  FSASGCLEGQMFLKTGKLISLSEQNLVDCSHAQGNQGCNGGLMDFAFQYIKENGGLDSEE 200

Query: 556  SYPYKAIDEKCKKTCK-SVATISSFVDVDFDQSNPNNDTALMAAL-QMGPVSIAIEADQP 383
            SYPY+A D  CK   + +VA  + FVD+      P  + ALM A+  +GP+S+A++A  P
Sbjct: 201  SYPYEAKDGSCKYRAEFAVANDTGFVDI------PQQEKALMKAVATVGPISVAMDASHP 254

Query: 382  IFQMYTGGVITGPSCGT-NLDHGVLLVGY---GTDSKLGDYWIVKNSWGQAWGIESGYVR 215
              Q Y+ G+   P+C + NLDHGVLLVGY   GTDS    YW+VKNSWG  WG+E GY++
Sbjct: 255  SLQFYSSGIYYEPNCSSKNLDHGVLLVGYGYEGTDSNKNKYWLVKNSWGSEWGME-GYIK 313

Query: 214  LARGQ-NECGMNSAASYPVV 158
            +A+ + N CG+ +AASYPVV
Sbjct: 314  IAKDRDNHCGLATAASYPVV 333


>dbj|BAB27719.1| unnamed protein product [Mus musculus]
          Length = 334

 Score =  270 bits (691), Expect = 7e-70
 Identities = 145/320 (45%), Positives = 203/320 (63%), Gaps = 10/320 (3%)
 Frame = -1

Query: 1087 ERQYQREFSRFVKTYSKKYATDDFFNRYEIFKANLDLIELHNSQAGITS---TMGVNAFT 917
            ++ +  E+ ++  T+ + Y T++   R  I++ N+ +I+LHN +        +M +NAF 
Sbjct: 22   DQTFSAEWHQWKSTHRRLYGTNEEEWRRAIWEKNMRMIQLHNGEYSNGQHGFSMEMNAFG 81

Query: 916  DMTSVEFASKMNGYXXXXXXXXXXXXXANTMKDVPTSWDWRTKGAVTPVKDQAQCGSCWS 737
            DMT+ EF   +NGY                +K +P S DWR KG VTPVK+Q QCGSCW+
Sbjct: 82   DMTNEEFRQVVNGYRHQKHKKGRLFQEPLMLK-IPKSVDWREKGCVTPVKNQGQCGSCWA 140

Query: 736  FSATGSIEGAWFLAKGNLTSLSEQQLMDCSHPEGDDSCEGGLMDDAFQFVIDNKGICSEA 557
            FSA+G +EG  FL  G L SLSEQ L+DCSH +G+  C GGLMD AFQ++ +N G+ SE 
Sbjct: 141  FSASGCLEGQMFLKTGKLISLSEQNLVDCSHAQGNQGCNGGLMDYAFQYIKENGGLDSEE 200

Query: 556  SYPYKAIDEKCKKTCK-SVATISSFVDVDFDQSNPNNDTALMAAL-QMGPVSIAIEADQP 383
            SYPY+A D  CK   + +VA  + FVD+      P  + ALM A+  +GP+S+A++A  P
Sbjct: 201  SYPYEAKDGSCKYRAEFAVANDTGFVDI------PQQEKALMKAVATVGPISVAMDASHP 254

Query: 382  IFQMYTGGVITGPSCGT-NLDHGVLLVGY---GTDSKLGDYWIVKNSWGQAWGIESGYVR 215
              Q Y+ G+   P+C + NLDHGVLLVGY   GTDS    YW+VKNSWG  WG+E GY++
Sbjct: 255  SLQFYSSGIYYEPNCSSKNLDHGVLLVGYGYEGTDSNKNKYWLVKNSWGSEWGME-GYIK 313

Query: 214  LARGQ-NECGMNSAASYPVV 158
            +A+ + N CG+ +AASYPVV
Sbjct: 314  IAKDRDNHCGLATAASYPVV 333


>dbj|BAE36450.1| unnamed protein product [Mus musculus]
          Length = 334

 Score =  270 bits (690), Expect = 9e-70
 Identities = 145/320 (45%), Positives = 203/320 (63%), Gaps = 10/320 (3%)
 Frame = -1

Query: 1087 ERQYQREFSRFVKTYSKKYATDDFFNRYEIFKANLDLIELHNSQAGITS---TMGVNAFT 917
            ++ +  E+ ++  T+ + Y T++   R  I++ N+ +I+LHN +        +M +NAF 
Sbjct: 22   DQTFSAEWHQWKSTHRRLYGTNEEEWRRAIWEKNMRMIQLHNGEYSNGQHGFSMEMNAFG 81

Query: 916  DMTSVEFASKMNGYXXXXXXXXXXXXXANTMKDVPTSWDWRTKGAVTPVKDQAQCGSCWS 737
            DMT+ EF   +NGY                +K +P S DWR KG VTPVK+Q QCGSCW+
Sbjct: 82   DMTNEEFRQVVNGYRHQKHKKGRLFQEPLMLK-IPKSVDWREKGCVTPVKNQGQCGSCWA 140

Query: 736  FSATGSIEGAWFLAKGNLTSLSEQQLMDCSHPEGDDSCEGGLMDDAFQFVIDNKGICSEA 557
            FSA+G +EG  FL  G L SLSEQ L+DCSH +G+  C GGLMD AFQ++ +N G+ SE 
Sbjct: 141  FSASGCLEGQMFLKTGKLISLSEQNLVDCSHAQGNQGCNGGLMDFAFQYIKENGGLDSEE 200

Query: 556  SYPYKAIDEKCKKTCK-SVATISSFVDVDFDQSNPNNDTALMAAL-QMGPVSIAIEADQP 383
            SYPY+A D  CK   + +VA  + FVD+      P  + ALM A+  +GP+S+A++A  P
Sbjct: 201  SYPYEAKDGSCKYRAEFAVANGTGFVDI------PQQEKALMKAVATVGPISVAMDASHP 254

Query: 382  IFQMYTGGVITGPSCGT-NLDHGVLLVGY---GTDSKLGDYWIVKNSWGQAWGIESGYVR 215
              Q Y+ G+   P+C + NLDHGVLLVGY   GTDS    YW+VKNSWG  WG+E GY++
Sbjct: 255  SLQFYSSGIYYEPNCSSKNLDHGVLLVGYGYEGTDSNKNKYWLVKNSWGSEWGME-GYIK 313

Query: 214  LARGQ-NECGMNSAASYPVV 158
            +A+ + N CG+ +AASYPVV
Sbjct: 314  IAKDRDNHCGLATAASYPVV 333


>ref|XP_006872648.1| PREDICTED: cathepsin L1-like [Chrysochloris asiatica]
          Length = 334

 Score =  270 bits (689), Expect = 1e-69
 Identities = 156/329 (47%), Positives = 204/329 (62%), Gaps = 11/329 (3%)
 Frame = -1

Query: 1111 MANAAYVTERQYQREFSRFVKTYSKKYATDDFFNRYEIFKANLDLIELHN---SQAGITS 941
            +A+AA   ++    +++++  TY + Y+TD+   R  +++ N+ +IELHN   SQ     
Sbjct: 14   IASAALEHDQNLDAQWNQWKSTYKRPYSTDEGGWRRSVWEKNMKMIELHNREYSQGKHGF 73

Query: 940  TMGVNAFTDMTSVEFASKMNGYXXXXXXXXXXXXXANTMKDVPTSWDWRTKGAVTPVKDQ 761
            TM +NAF DMT+ EF   MNG+                  +VP S DW  KG VTPVK+Q
Sbjct: 74   TMAMNAFGDMTNEEFRQVMNGFQNQKHKKGKMFREP-VFLEVPKSVDWTQKGYVTPVKNQ 132

Query: 760  AQCGSCWSFSATGSIEGAWFLAKGNLTSLSEQQLMDCSHPEGDDSCEGGLMDDAFQFVID 581
             QCGSCW+FSATG++EG  F   G L SLSEQ L+DCS  EG++ C GGLMD AFQ+V D
Sbjct: 133  GQCGSCWAFSATGALEGQMFRKTGKLVSLSEQNLVDCSRSEGNNGCNGGLMDYAFQYVKD 192

Query: 580  NKGICSEASYPYKAID-EKCK-KTCKSVATISSFVDVDFDQSNPNNDTALMAAL-QMGPV 410
            N GI SEASYPY A D + C  K   SVA  + FVD+      P  + ALM A+  +GPV
Sbjct: 193  NLGIDSEASYPYLATDTQTCNYKPEYSVANDTGFVDI------PPREKALMKAVATVGPV 246

Query: 409  SIAIEADQPIFQMYTGGVITGPSCGT-NLDHGVLLVGY---GTDSKLGDYWIVKNSWGQA 242
            S+AI+A    FQ Y  G+   P C + +LDHGVL+VGY   G DS    YWIVKNSWG +
Sbjct: 247  SVAIDAGHESFQFYKSGIYYEPDCSSKDLDHGVLVVGYGYEGKDSANNKYWIVKNSWGSS 306

Query: 241  WGIESGYVRLARGQ-NECGMNSAASYPVV 158
            WG  +GYV++A+ Q N CG+ +AASYP V
Sbjct: 307  WG-TNGYVKMAKDQNNHCGIATAASYPTV 334


>dbj|BAE35627.1| unnamed protein product [Mus musculus]
          Length = 334

 Score =  270 bits (689), Expect = 1e-69
 Identities = 145/320 (45%), Positives = 202/320 (63%), Gaps = 10/320 (3%)
 Frame = -1

Query: 1087 ERQYQREFSRFVKTYSKKYATDDFFNRYEIFKANLDLIELHNSQAGITS---TMGVNAFT 917
            ++ +  E+ ++  T+ + Y T++   R  I++ N+ +I+LHN +        +M +NAF 
Sbjct: 22   DQTFSAEWHQWKSTHRRLYGTNEEEWRRAIWEKNMRMIQLHNGEYSNGQHGFSMEMNAFG 81

Query: 916  DMTSVEFASKMNGYXXXXXXXXXXXXXANTMKDVPTSWDWRTKGAVTPVKDQAQCGSCWS 737
            DMT+ EF   +NGY                +K +P S DWR KG VTPVK+Q QCGSCW+
Sbjct: 82   DMTNEEFRQVVNGYRHQKHKKGRLFQEPLMLK-IPKSVDWREKGCVTPVKNQGQCGSCWA 140

Query: 736  FSATGSIEGAWFLAKGNLTSLSEQQLMDCSHPEGDDSCEGGLMDDAFQFVIDNKGICSEA 557
            FSA+G +EG  FL  G L SLSEQ L+DCSH +G+  C GGLMD AFQ++ +N G+ SE 
Sbjct: 141  FSASGCLEGQMFLKTGKLISLSEQNLVDCSHAQGNQGCNGGLMDFAFQYIKENGGLDSEE 200

Query: 556  SYPYKAIDEKCKKTCK-SVATISSFVDVDFDQSNPNNDTALMAAL-QMGPVSIAIEADQP 383
            SYPY+A D  CK   + +VA  + FVD+      P  + ALM A+  +GP+S+A++A  P
Sbjct: 201  SYPYEAKDGSCKYRAEFAVANDTGFVDI------PQQEKALMKAVATVGPISVAMDASHP 254

Query: 382  IFQMYTGGVITGPSCGT-NLDHGVLLVGY---GTDSKLGDYWIVKNSWGQAWGIESGYVR 215
              Q Y+ G+   P+C + NLDHGVLLVGY   GTDS    YW+VKNSWG  WG+E GY+ 
Sbjct: 255  SLQFYSSGIYYEPNCSSKNLDHGVLLVGYGYEGTDSNKNKYWLVKNSWGSEWGME-GYIE 313

Query: 214  LARGQ-NECGMNSAASYPVV 158
            +A+ + N CG+ +AASYPVV
Sbjct: 314  IAKDRDNHCGLATAASYPVV 333


>gb|ADC52431.1| cathepsin L2 cysteine protease [Pinctada fucata]
          Length = 330

 Score =  270 bits (689), Expect = 1e-69
 Identities = 149/312 (47%), Positives = 196/312 (62%), Gaps = 8/312 (2%)
 Frame = -1

Query: 1069 EFSRFVKTYSKKYATDDFFNRYEIFKANLDLIELHNSQAGI---TSTMGVNAFTDMTSVE 899
            E++ F K Y+K Y  ++   R  ++++NLD I LHN  A     T  +G+N + DMT+ E
Sbjct: 26   EWNIFKKQYNKLYQNEEEARRRLVWESNLDFITLHNLAADRGEHTFWVGMNEYGDMTNEE 85

Query: 898  FASKMNGYXXXXXXXXXXXXXA-NTMKDVPTSWDWRTKGAVTPVKDQAQCGSCWSFSATG 722
            F   MNGY               N M D+P + DWR KG VTP+K+Q QCGSCWSFSATG
Sbjct: 86   FTKTMNGYRMRNKTSNAPVFMPPNNMGDLPDTVDWRPKGYVTPIKNQGQCGSCWSFSATG 145

Query: 721  SIEGAWFLAKGNLTSLSEQQLMDCSHPEGDDSCEGGLMDDAFQFVIDNKGICSEASYPYK 542
            S+EG  F   G L SLSEQ L+DCS  +G+  CEGGLMDDAF ++  N GI +EASYPYK
Sbjct: 146  SLEGQTFKKTGKLVSLSEQNLVDCSKKQGNHGCEGGLMDDAFTYIKANNGIDTEASYPYK 205

Query: 541  AIDEKCK-KTCKSVATISSFVDVDFDQSNPNNDTALMAALQMGPVSIAIEADQPIFQMYT 365
            A D KC+ K+    AT + FVD+       + +    A   +GP+S+AI+A    FQ+Y 
Sbjct: 206  ARDGKCEFKSADVGATDTGFVDI----KTKDEEALKQAVATVGPISVAIDASHMSFQLYR 261

Query: 364  GGVITGPSCG-TNLDHGVLLVGYGT-DSKLGDYWIVKNSWGQAWGIESGYVRLARG-QNE 194
             GV     C  T LDHGVL VGYGT DSK  DYW+VKNSWG++WG + GY++++R  +N 
Sbjct: 262  TGVYHDWFCSQTKLDHGVLAVGYGTEDSK--DYWLVKNSWGESWG-QKGYIQMSRNRRNN 318

Query: 193  CGMNSAASYPVV 158
            CG+ ++ASYP V
Sbjct: 319  CGIATSASYPTV 330


>ref|XP_001745088.1| hypothetical protein [Monosiga brevicollis MX1]
            gi|163776702|gb|EDQ90321.1| predicted protein [Monosiga
            brevicollis MX1]
          Length = 294

 Score =  269 bits (688), Expect = 2e-69
 Identities = 157/309 (50%), Positives = 192/309 (62%), Gaps = 9/309 (2%)
 Frame = -1

Query: 1057 FVKTYSKKYATDDF-FNRYEIFKANLDLIELHNSQ--AGITS-TMGVNAFTDMTSVEFAS 890
            F   YSK Y ++     R   F+ANL+ I  HN++   G+ S T+GVN F D+T  EF +
Sbjct: 1    FKSDYSKSYESEAVEAKRLAAFEANLEFINKHNAEHAQGLHSYTVGVNEFADLTIDEFMA 60

Query: 889  KMNGYXXXXXXXXXXXXXANTMKDVPTSWDWRTKGAVTPVKDQAQCGSCWSFSATGSIEG 710
                                T +D   S DWRTKGAVTP+K+Q QCGSCWSFS TGS EG
Sbjct: 61   LYVPSKFNRTMPYNTVYLPATSED---SVDWRTKGAVTPIKNQGQCGSCWSFSTTGSTEG 117

Query: 709  AWFLAKGNLTSLSEQQLMDCSHPEGDDSCEGGLMDDAFQFVIDNKGICSEASYPYKAIDE 530
            A  +A GNL SLSEQQL+DCS   G+  C GGLMDDAF+++I NKG+ +E  YPY A D 
Sbjct: 118  AHAIATGNLVSLSEQQLVDCSGSFGNQGCNGGLMDDAFKYIISNKGLDTEEDYPYTAQDG 177

Query: 529  KC--KKTCKSVATISSFVDVDFDQSNPNNDTALMAALQMGPVSIAIEADQPIFQMYTGGV 356
             C  +K  K  ATISS+ DV       NN+  L AA+  GPVS+AIEADQ  FQ+Y  GV
Sbjct: 178  TCNKEKEAKHAATISSYSDVP-----KNNEDQLAAAVAKGPVSVAIEADQSGFQLYKSGV 232

Query: 355  ITGPSCGTNLDHGVLLVGYGTDSKLGDYWIVKNSWGQAWGIESGYVRLARGQNE---CGM 185
              G +CGTNLDHGVL+VGY TD    DYWIVKNSWG  WG+E GY+ + RG +    CG+
Sbjct: 233  FDG-NCGTNLDHGVLVVGY-TD----DYWIVKNSWGTTWGVE-GYINMKRGVSASGICGI 285

Query: 184  NSAASYPVV 158
                SYP+V
Sbjct: 286  AMQPSYPIV 294


>dbj|BAE38161.1| unnamed protein product [Mus musculus]
          Length = 334

 Score =  269 bits (687), Expect = 2e-69
 Identities = 145/320 (45%), Positives = 203/320 (63%), Gaps = 10/320 (3%)
 Frame = -1

Query: 1087 ERQYQREFSRFVKTYSKKYATDDFFNRYEIFKANLDLIELHNSQAGITS---TMGVNAFT 917
            ++ +  E+ ++  T+ + Y T++   R  I++ N+ +I+LHN +        +M +NAF 
Sbjct: 22   DQTFSAEWHQWKSTHRRLYGTNEEEWRRAIWEKNMRMIQLHNGEYSNGQHGFSMEMNAFG 81

Query: 916  DMTSVEFASKMNGYXXXXXXXXXXXXXANTMKDVPTSWDWRTKGAVTPVKDQAQCGSCWS 737
            DMT+ EF   +NGY                +K +P S DWR KG VTPVK+Q QCGSCW+
Sbjct: 82   DMTNEEFRQVVNGYRHQKHKKGRLFQEPLMLK-IPKSVDWREKGCVTPVKNQGQCGSCWA 140

Query: 736  FSATGSIEGAWFLAKGNLTSLSEQQLMDCSHPEGDDSCEGGLMDDAFQFVIDNKGICSEA 557
            FSA+G +EG  FL  G L SLSEQ L+DCSH +G+  C GGLMD AFQ++ +N G+ SE 
Sbjct: 141  FSASGCLEGQMFLKTGKLISLSEQNLVDCSHAQGNQGCNGGLMDFAFQYIKENGGLDSEE 200

Query: 556  SYPYKAIDEKCKKTCK-SVATISSFVDVDFDQSNPNNDTALMAAL-QMGPVSIAIEADQP 383
            SYPY+A D  CK   + +VA  + FVD+      P  + ALM A+  +GP+S+A++A  P
Sbjct: 201  SYPYEAKDGSCKYRAEFAVANDTGFVDI------PQQEKALMKAVATVGPISVAMDASHP 254

Query: 382  IFQMYTGGVITGPSCGT-NLDHGVLLVGY---GTDSKLGDYWIVKNSWGQAWGIESGYVR 215
              Q Y+ G+   P+C + NLDHGVLLVGY   GTDS    YW+VKNSWG  WG+E GY++
Sbjct: 255  SLQFYSLGIYYEPNCSSKNLDHGVLLVGYGYEGTDSNKNKYWLVKNSWGSEWGME-GYIK 313

Query: 214  LARGQ-NECGMNSAASYPVV 158
            +A+ + N CG+ +AASYPVV
Sbjct: 314  IAKDRDNHCGLATAASYPVV 333


>dbj|BAE22939.1| unnamed protein product [Mus musculus]
          Length = 308

 Score =  269 bits (687), Expect = 2e-69
 Identities = 145/314 (46%), Positives = 200/314 (63%), Gaps = 10/314 (3%)
 Frame = -1

Query: 1069 EFSRFVKTYSKKYATDDFFNRYEIFKANLDLIELHNSQAGITS---TMGVNAFTDMTSVE 899
            E+ ++  T+ + Y T++   R  I++ N+ +I+LHN +        +M +NAF DMT+ E
Sbjct: 2    EWHQWKSTHRRLYGTNEEEWRRAIWEKNMRMIQLHNGEYSNGQHGFSMEMNAFGDMTNEE 61

Query: 898  FASKMNGYXXXXXXXXXXXXXANTMKDVPTSWDWRTKGAVTPVKDQAQCGSCWSFSATGS 719
            F   +NGY                +K +P S DWR KG VTPVK+Q QCGSCW+FSA+G 
Sbjct: 62   FRQVVNGYRHQKHKKGRLFQEPLMLK-IPKSVDWREKGCVTPVKNQGQCGSCWAFSASGC 120

Query: 718  IEGAWFLAKGNLTSLSEQQLMDCSHPEGDDSCEGGLMDDAFQFVIDNKGICSEASYPYKA 539
            +EG  FL  G L SLSEQ L+DCSH +G+  C GGLMD AFQ++ +N G+ SE SYPY+A
Sbjct: 121  LEGQMFLKTGKLISLSEQNLVDCSHAQGNQGCNGGLMDFAFQYIKENGGLDSEESYPYEA 180

Query: 538  IDEKCKKTCK-SVATISSFVDVDFDQSNPNNDTALMAAL-QMGPVSIAIEADQPIFQMYT 365
             D  CK   + +VA  + FVD+      P  + ALM A+  +GP+S+A++A  P  Q Y+
Sbjct: 181  KDGSCKYRAEFAVANDTGFVDI------PQQEKALMKAVATVGPISVAMDASHPSLQFYS 234

Query: 364  GGVITGPSCGT-NLDHGVLLVGY---GTDSKLGDYWIVKNSWGQAWGIESGYVRLARGQ- 200
             G+   P+C + NLDHGVLLVGY   GTDS    YW+VKNSWG  WG+E GY+++A+ + 
Sbjct: 235  SGIYYEPNCSSKNLDHGVLLVGYGYEGTDSNKNKYWLVKNSWGSEWGME-GYIKIAKDRD 293

Query: 199  NECGMNSAASYPVV 158
            N CG+ +AASYPVV
Sbjct: 294  NHCGLATAASYPVV 307


>dbj|BAE31977.1| unnamed protein product [Mus musculus]
          Length = 334

 Score =  269 bits (687), Expect = 2e-69
 Identities = 144/320 (45%), Positives = 203/320 (63%), Gaps = 10/320 (3%)
 Frame = -1

Query: 1087 ERQYQREFSRFVKTYSKKYATDDFFNRYEIFKANLDLIELHNSQAGITS---TMGVNAFT 917
            ++ +  E+ ++  T+ + Y T++   R  I++ N+ +I+LHN +        +M +NAF 
Sbjct: 22   DQTFSAEWHQWKSTHRRLYGTNEEEWRRAIWEKNMRMIQLHNGEYSNGQHGFSMEMNAFG 81

Query: 916  DMTSVEFASKMNGYXXXXXXXXXXXXXANTMKDVPTSWDWRTKGAVTPVKDQAQCGSCWS 737
            DMT+ EF   +NGY                +K +P S DWR KG VTPVK++ QCGSCW+
Sbjct: 82   DMTNEEFRQVVNGYRHQKHKKGRLFQEPLMLK-IPKSVDWREKGCVTPVKNKGQCGSCWA 140

Query: 736  FSATGSIEGAWFLAKGNLTSLSEQQLMDCSHPEGDDSCEGGLMDDAFQFVIDNKGICSEA 557
            FSA+G +EG  FL  G L SLSEQ L+DCSH +G+  C GGLMD AFQ++ +N G+ SE 
Sbjct: 141  FSASGCLEGQMFLKTGKLISLSEQNLVDCSHAQGNQGCNGGLMDFAFQYIKENGGLDSEE 200

Query: 556  SYPYKAIDEKCKKTCK-SVATISSFVDVDFDQSNPNNDTALMAAL-QMGPVSIAIEADQP 383
            SYPY+A D  CK   + +VA  + FVD+      P  + ALM A+  +GP+S+A++A  P
Sbjct: 201  SYPYEAKDGSCKYRAEFAVANDTGFVDI------PQQEKALMKAVATVGPISVAMDASHP 254

Query: 382  IFQMYTGGVITGPSCGT-NLDHGVLLVGY---GTDSKLGDYWIVKNSWGQAWGIESGYVR 215
              Q Y+ G+   P+C + NLDHGVLLVGY   GTDS    YW+VKNSWG  WG+E GY++
Sbjct: 255  SLQFYSSGIYYEPNCSSKNLDHGVLLVGYGYEGTDSNKNKYWLVKNSWGSEWGME-GYIK 313

Query: 214  LARGQ-NECGMNSAASYPVV 158
            +A+ + N CG+ +AASYPVV
Sbjct: 314  IAKDRDNHCGLATAASYPVV 333


>gb|ABY58967.1| cathepsin L [Toxoplasma gondii]
          Length = 421

 Score =  269 bits (687), Expect = 2e-69
 Identities = 154/321 (47%), Positives = 197/321 (61%), Gaps = 11/321 (3%)
 Frame = -1

Query: 1087 ERQYQREFSRFVKTYSKKYATDDFFNR-YEIFKANLDLIELHNSQAGITSTMGVNAFTDM 911
            E  +Q  FS F   Y+K YAT++   R Y IFK NL  I  HN Q G + ++ +N F D+
Sbjct: 109  EAHFQDAFSSFQAMYAKSYATEEEKQRRYAIFKNNLVYIHTHNQQ-GYSYSLKMNHFGDL 167

Query: 910  TSVEFASKMNGYXXXXXXXXXXXXXANTMKDV-----PTSWDWRTKGAVTPVKDQAQCGS 746
            +  EF  K  G+             A  + +V     P   DWR++G VTPVKDQ  CGS
Sbjct: 168  SRDEFRRKYLGFKKSRNLKSHHLGVATELLNVLPSELPAGVDWRSRGCVTPVKDQRDCGS 227

Query: 745  CWSFSATGSIEGAWFLAKGNLTSLSEQQLMDCSHPEGDDSCEGGLMDDAFQFVIDNKGIC 566
            CW+FS TG++EGA     G L SLSEQ+LMDCS  EG+ SC GG M+DAFQ+V+D+ GIC
Sbjct: 228  CWAFSTTGALEGAHCAKTGKLVSLSEQELMDCSRAEGNQSCSGGEMNDAFQYVLDSGGIC 287

Query: 565  SEASYPYKAIDEKCK-KTCKSVATISSFVDVDFDQSNPNNDTALMAALQMGPVSIAIEAD 389
            SE +YPY A DE+C+ ++C+ V  I  F DV        ++ A+ AAL   PVSIAIEAD
Sbjct: 288  SEDAYPYLARDEECRAQSCEKVVKILGFKDVP-----RRSEAAMKAALAKSPVSIAIEAD 342

Query: 388  QPIFQMYTGGVITGPSCGTNLDHGVLLVGYGTDSK-LGDYWIVKNSWGQAWGIESGYVRL 212
            Q  FQ Y  GV    SCGT+LDHGVLLVGYGTD +   D+WI+KNSWG  WG   GY+ +
Sbjct: 343  QMPFQFYHEGVFDA-SCGTDLDHGVLLVGYGTDKESKKDFWIMKNSWGTGWG-RDGYMYM 400

Query: 211  ARGQNE---CGMNSAASYPVV 158
            A  + E   CG+   AS+PV+
Sbjct: 401  AMHKGEEGQCGLLLDASFPVM 421


>ref|XP_002371694.1| cathepsin L-like thiolproteinase, putative [Toxoplasma gondii ME49]
            gi|50313163|gb|AAT74529.1| toxopain-2 [Toxoplasma gondii]
            gi|89242977|gb|ABD64744.1| cathepsin L [Toxoplasma
            gondii] gi|95007485|emb|CAJ20707.1| toxopain-2
            [Toxoplasma gondii RH] gi|523570907|gb|EPR57821.1|
            cathepsin CPL [Toxoplasma gondii GT1]
            gi|527315630|gb|EPT32244.1| cathepsin CPL [Toxoplasma
            gondii ME49] gi|557733437|gb|ESS29589.1| cathepsin CPL
            [Toxoplasma gondii VEG]
          Length = 422

 Score =  269 bits (687), Expect = 2e-69
 Identities = 154/321 (47%), Positives = 197/321 (61%), Gaps = 11/321 (3%)
 Frame = -1

Query: 1087 ERQYQREFSRFVKTYSKKYATDDFFNR-YEIFKANLDLIELHNSQAGITSTMGVNAFTDM 911
            E  +Q  FS F   Y+K YAT++   R Y IFK NL  I  HN Q G + ++ +N F D+
Sbjct: 110  EAHFQDAFSSFQAMYAKSYATEEEKQRRYAIFKNNLVYIHTHNQQ-GYSYSLKMNHFGDL 168

Query: 910  TSVEFASKMNGYXXXXXXXXXXXXXANTMKDV-----PTSWDWRTKGAVTPVKDQAQCGS 746
            +  EF  K  G+             A  + +V     P   DWR++G VTPVKDQ  CGS
Sbjct: 169  SRDEFRRKYLGFKKSRNLKSHHLGVATELLNVLPSELPAGVDWRSRGCVTPVKDQRDCGS 228

Query: 745  CWSFSATGSIEGAWFLAKGNLTSLSEQQLMDCSHPEGDDSCEGGLMDDAFQFVIDNKGIC 566
            CW+FS TG++EGA     G L SLSEQ+LMDCS  EG+ SC GG M+DAFQ+V+D+ GIC
Sbjct: 229  CWAFSTTGALEGAHCAKTGKLVSLSEQELMDCSRAEGNQSCSGGEMNDAFQYVLDSGGIC 288

Query: 565  SEASYPYKAIDEKCK-KTCKSVATISSFVDVDFDQSNPNNDTALMAALQMGPVSIAIEAD 389
            SE +YPY A DE+C+ ++C+ V  I  F DV        ++ A+ AAL   PVSIAIEAD
Sbjct: 289  SEDAYPYLARDEECRAQSCEKVVKILGFKDVP-----RRSEAAMKAALAKSPVSIAIEAD 343

Query: 388  QPIFQMYTGGVITGPSCGTNLDHGVLLVGYGTDSK-LGDYWIVKNSWGQAWGIESGYVRL 212
            Q  FQ Y  GV    SCGT+LDHGVLLVGYGTD +   D+WI+KNSWG  WG   GY+ +
Sbjct: 344  QMPFQFYHEGVFDA-SCGTDLDHGVLLVGYGTDKESKKDFWIMKNSWGTGWG-RDGYMYM 401

Query: 211  ARGQNE---CGMNSAASYPVV 158
            A  + E   CG+   AS+PV+
Sbjct: 402  AMHKGEEGQCGLLLDASFPVM 422


>gb|EJY65772.1| Cathepsin L [Oxytricha trifallax]
          Length = 338

 Score =  268 bits (685), Expect = 3e-69
 Identities = 144/305 (47%), Positives = 193/305 (63%), Gaps = 2/305 (0%)
 Frame = -1

Query: 1066 FSRFVKTYSKKYATDDFFN-RYEIFKANLDLIELHNSQAGITSTMGVNAFTDMTSVEFAS 890
            F+ FV  Y K Y T + ++ R ++FK NL  + ++N++  +T  +G+N F D T  E+  
Sbjct: 43   FTNFVAKYGKSYGTKEEYDFRSKLFKQNLAKVSMNNARNDVTYRLGLNKFADYTEAEY-K 101

Query: 889  KMNGYXXXXXXXXXXXXXANTMKDVPTSWDWRTKGAVTPVKDQAQCGSCWSFSATGSIEG 710
            ++ G+                 K+     +W  +GAVTPVKDQ QCGSCWSFSATG++EG
Sbjct: 102  RLLGFGGQKNKNPRNIKVLGAPKN--DGVNWVEQGAVTPVKDQGQCGSCWSFSATGAMEG 159

Query: 709  AWFLAKGNLTSLSEQQLMDCSHPEGDDSCEGGLMDDAFQFVIDNKGICSEASYPYKAIDE 530
               +  G L SLSEQQL+DCS  EG++ C GG MD AFQ+V +   + +E  YPY+A+D+
Sbjct: 160  HAKIQFGTLYSLSEQQLVDCSQAEGNEGCGGGWMDQAFQYV-EQTALETEDQYPYEAVDD 218

Query: 529  KCKKTCKSVATISSFVDVDFDQSNPNNDTALMAALQMGPVSIAIEADQPIFQMYTGGVIT 350
             C+ +   V  + SFVDV      PNN   L AAL  GPVS+AIEADQ +FQ Y+GGVI 
Sbjct: 219  TCRASSAGVVKVDSFVDV-----TPNNVNELKAALDKGPVSVAIEADQMVFQFYSGGVIN 273

Query: 349  GPSCGTNLDHGVLLVGYGTDSKLGDYWIVKNSWGQAWGIESGYVRLARG-QNECGMNSAA 173
              SCGT LDHGVL VGYG +S   DY++VKNSWG +WG E GYV++A    N CG+ S A
Sbjct: 274  DASCGTTLDHGVLAVGYGNESG-QDYFLVKNSWGASWG-EEGYVKIAASPDNICGILSQA 331

Query: 172  SYPVV 158
            SYP++
Sbjct: 332  SYPIM 336


Top