BLASTX nr result

ID: Mentha23_contig00045984 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Mentha23_contig00045984
         (577 letters)

Database: ./nr 
           38,876,450 sequences; 13,856,398,315 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

gb|EGB07432.1| hypothetical protein AURANDRAFT_2413 [Aureococcus...   194   1e-47
gb|AGM32335.1| cathepsin L-like protein [Coptotermes formosanus]      190   3e-46
gb|ACX43965.1| cysteine protease 1 [Brachiaria hybrid cultivar]       189   4e-46
gb|ACO15375.1| Cathepsin L1 precursor [Caligus clemensi]              189   4e-46
gb|ABY58967.1| cathepsin L [Toxoplasma gondii]                        189   4e-46
pdb|3F75|A Chain A, Activated Toxoplasma Gondii Cathepsin L (tgc...   189   4e-46
ref|XP_002371694.1| cathepsin L-like thiolproteinase, putative [...   189   4e-46
ref|XP_003879989.1| cathepsin L, related [Neospora caninum Liver...   188   8e-46
gb|ERL86466.1| hypothetical protein D910_03872 [Dendroctonus pon...   188   1e-45
ref|XP_004344432.1| cathepsin L2 [Capsaspora owczarzaki ATCC 308...   188   1e-45
ref|XP_005320914.1| PREDICTED: cathepsin L1-like [Ictidomys trid...   187   2e-45
gb|EMS56453.1| KDEL-tailed cysteine endopeptidase CEP1 [Triticum...   187   2e-45
gb|ACX43954.1| cysteine protease 1 [Brachiaria hybrid cultivar] ...   187   2e-45
gb|ENN82151.1| hypothetical protein YQE_01471, partial [Dendroct...   186   3e-45
gb|EFZ20954.1| hypothetical protein SINV_06041 [Solenopsis invicta]   186   3e-45
ref|XP_005765534.1| hypothetical protein EMIHUDRAFT_437163 [Emil...   186   4e-45
ref|XP_801784.2| PREDICTED: cathepsin L1-like isoform 2 [Strongy...   186   4e-45
gb|EFN65237.1| Cathepsin L [Camponotus floridanus]                    186   5e-45
gb|AAO65603.1| cathepsin L precursor [Hydra vulgaris]                 185   6e-45
dbj|BAA24442.1| cysteine proteinase [Sitophilus zeamais]              185   6e-45

>gb|EGB07432.1| hypothetical protein AURANDRAFT_2413 [Aureococcus anophagefferens]
          Length = 263

 Score =  194 bits (493), Expect = 1e-47
 Identities = 98/175 (56%), Positives = 118/175 (67%)
 Frame = +3

Query: 24  LTSLSEQQLMDCSHPEGDDSCEGGLMDDAFQFVLDNKGICAEADYPYVAKDEKCKKTCKT 203
           LTSLSEQ L+DC     D  C GGLMD+AF+++  N GIC+EADY Y A    CK TC  
Sbjct: 99  LTSLSEQNLVDCDTT--DSGCNGGLMDNAFKWIQSNGGICSEADYAYTAAKGTCKTTCDK 156

Query: 204 VSTISSFADVDFNEGKPTDETALMAAVQLGPVSIAIEADKPYFQLYTGGVLTDPVKCGTD 383
           V+T+S   DV        DE AL  AV +GPVSIAIEADK  FQ Y+ G+L D   CGT+
Sbjct: 157 VATLSGHTDVPSG-----DEDALKTAVAIGPVSIAIEADKSVFQSYSSGIL-DSSACGTN 210

Query: 384 LDHGVLIVGYGTDSATNVDYWLVKNSWGAKWGEKGFIRLARNKNMCGLNSAASYP 548
           LDHGVL+VGYGTD  +  +YW VKNSWG  WGE G++R+AR  N+CG+ S  SYP
Sbjct: 211 LDHGVLVVGYGTDDGS--EYWKVKNSWGTTWGESGYVRIARGSNICGIASEPSYP 263


>gb|AGM32335.1| cathepsin L-like protein [Coptotermes formosanus]
          Length = 335

 Score =  190 bits (482), Expect = 3e-46
 Identities = 95/184 (51%), Positives = 124/184 (67%), Gaps = 2/184 (1%)
 Frame = +3

Query: 6   FLKTGNLTSLSEQQLMDCSHPEGDDSCEGGLMDDAFQFVLDNKGICAEADYPYVAKDEKC 185
           F KTG L SLSEQ L+DCS   G+  C GGLMD AFQ+V DN+GI  E  YPY A+D+KC
Sbjct: 155 FRKTGVLISLSEQNLIDCSGKYGNQGCNGGLMDQAFQYVRDNRGIDTEVSYPYEAEDDKC 214

Query: 186 KKTCKTVSTISSFADVDFNEGKPTDETALMAAVQ-LGPVSIAIEADKPYFQLYTGGVLTD 362
           +       + S   DV F + +  +E  L  AV  +GP+S+AI+A    FQ Y  GV  +
Sbjct: 215 RYD----PSESGATDVGFTDVEEGNEQQLKEAVATIGPISVAIDAGHTSFQFYKSGVYYE 270

Query: 363 PVKCGTDLDHGVLIVGYGTDSATNVDYWLVKNSWGAKWGEKGFIRLARNK-NMCGLNSAA 539
           P   GT+LDHGVL+VGYGTD+ T  DYWLVKNSWG  WG++G++++ARN+ N CG+ + A
Sbjct: 271 PECNGTNLDHGVLVVGYGTDTETGEDYWLVKNSWGTTWGDEGYVKMARNRNNHCGIATQA 330

Query: 540 SYPI 551
           SYP+
Sbjct: 331 SYPL 334


>gb|ACX43965.1| cysteine protease 1 [Brachiaria hybrid cultivar]
          Length = 338

 Score =  189 bits (480), Expect = 4e-46
 Identities = 97/176 (55%), Positives = 119/176 (67%), Gaps = 1/176 (0%)
 Frame = +3

Query: 24  LTSLSEQQLMDCSHPEGDDSCEGGLMDDAFQFVLDNKGICAEADYPYVAKDEKCKKTCKT 203
           LTSLSEQQL+DCS   GD  C GGLMD AF++++ NKGICAE+ YPY      C+K+C  
Sbjct: 170 LTSLSEQQLVDCSTSYGDAGCNGGLMDYAFEYIIANKGICAESAYPYKGVGGLCQKSCTK 229

Query: 204 VSTISSFADVDFNEGKPTDETALMAAV-QLGPVSIAIEADKPYFQLYTGGVLTDPVKCGT 380
           V TIS + DV        DE +L+ AV  +GPVS+AIEAD+  FQ Y+ GV +    CG 
Sbjct: 230 VVTISGYKDV-----ASGDEASLLNAVGTVGPVSVAIEADQAGFQFYSSGVFSG--TCGH 282

Query: 381 DLDHGVLIVGYGTDSATNVDYWLVKNSWGAKWGEKGFIRLARNKNMCGLNSAASYP 548
           +LDHGVL VGYGT  +   DYW+VKNSWG  WGE G+IR+ RNKN CG+    SYP
Sbjct: 283 NLDHGVLAVGYGTTGSQ--DYWIVKNSWGTSWGESGYIRMIRNKNQCGIAIQPSYP 336


>gb|ACO15375.1| Cathepsin L1 precursor [Caligus clemensi]
          Length = 326

 Score =  189 bits (480), Expect = 4e-46
 Identities = 102/186 (54%), Positives = 122/186 (65%), Gaps = 3/186 (1%)
 Frame = +3

Query: 3   VFLKTGNLTSLSEQQLMDCSHPEGDDSCEGGLMDDAFQFVLDNKGICAEADYPYVAKDEK 182
           +FLK   L SLSEQQL+DCS  EG+  C GGLMD+AF++ + NKGI  E  YPY AKD  
Sbjct: 145 IFLKKKKLMSLSEQQLVDCSGDEGNLGCGGGLMDNAFKYFIANKGIANEKSYPYTAKDND 204

Query: 183 CK-KTCKTVSTISSFADVDFNEGKPTDETAL-MAAVQLGPVSIAIEADKPYFQLYTGGVL 356
           CK K   +V+TISSF DV     K  DE  L MA   +GPVS+AI+A    FQ Y  GV 
Sbjct: 205 CKYKKSMSVATISSFKDV-----KHKDEDQLKMAVANVGPVSVAIDASSSKFQFYESGVY 259

Query: 357 TDPVKCGTDLDHGVLIVGYGTDSATNVDYWLVKNSWGAKWGEKGFIRLARNK-NMCGLNS 533
            D       LDHGVL VGYGTD  + +D+WLVKNSW A WG  G+I++ARNK N CG+ +
Sbjct: 260 YDENCSSEVLDHGVLAVGYGTDKKSGMDFWLVKNSWAASWGLNGYIKMARNKDNNCGIAT 319

Query: 534 AASYPI 551
            ASYPI
Sbjct: 320 MASYPI 325


>gb|ABY58967.1| cathepsin L [Toxoplasma gondii]
          Length = 421

 Score =  189 bits (480), Expect = 4e-46
 Identities = 97/184 (52%), Positives = 124/184 (67%), Gaps = 4/184 (2%)
 Frame = +3

Query: 12  KTGNLTSLSEQQLMDCSHPEGDDSCEGGLMDDAFQFVLDNKGICAEADYPYVAKDEKCK- 188
           KTG L SLSEQ+LMDCS  EG+ SC GG M+DAFQ+VLD+ GIC+E  YPY+A+DE+C+ 
Sbjct: 244 KTGKLVSLSEQELMDCSRAEGNQSCSGGEMNDAFQYVLDSGGICSEDAYPYLARDEECRA 303

Query: 189 KTCKTVSTISSFADVDFNEGKPTDETALMAAVQLGPVSIAIEADKPYFQLYTGGVLTDPV 368
           ++C+ V  I  F DV         E A+ AA+   PVSIAIEAD+  FQ Y  GV     
Sbjct: 304 QSCEKVVKILGFKDV-----PRRSEAAMKAALAKSPVSIAIEADQMPFQFYHEGVF--DA 356

Query: 369 KCGTDLDHGVLIVGYGTDSATNVDYWLVKNSWGAKWGEKGFIRLARNK---NMCGLNSAA 539
            CGTDLDHGVL+VGYGTD  +  D+W++KNSWG  WG  G++ +A +K     CGL   A
Sbjct: 357 SCGTDLDHGVLLVGYGTDKESKKDFWIMKNSWGTGWGRDGYMYMAMHKGEEGQCGLLLDA 416

Query: 540 SYPI 551
           S+P+
Sbjct: 417 SFPV 420


>pdb|3F75|A Chain A, Activated Toxoplasma Gondii Cathepsin L (tgcpl) In Complex
           With Its Propeptide
          Length = 224

 Score =  189 bits (480), Expect = 4e-46
 Identities = 97/184 (52%), Positives = 124/184 (67%), Gaps = 4/184 (2%)
 Frame = +3

Query: 12  KTGNLTSLSEQQLMDCSHPEGDDSCEGGLMDDAFQFVLDNKGICAEADYPYVAKDEKCK- 188
           KTG L SLSEQ+LMDCS  EG+ SC GG M+DAFQ+VLD+ GIC+E  YPY+A+DE+C+ 
Sbjct: 47  KTGKLVSLSEQELMDCSRAEGNQSCSGGEMNDAFQYVLDSGGICSEDAYPYLARDEECRA 106

Query: 189 KTCKTVSTISSFADVDFNEGKPTDETALMAAVQLGPVSIAIEADKPYFQLYTGGVLTDPV 368
           ++C+ V  I  F DV         E A+ AA+   PVSIAIEAD+  FQ Y  GV     
Sbjct: 107 QSCEKVVKILGFKDV-----PRRSEAAMKAALAKSPVSIAIEADQMPFQFYHEGVF--DA 159

Query: 369 KCGTDLDHGVLIVGYGTDSATNVDYWLVKNSWGAKWGEKGFIRLARNK---NMCGLNSAA 539
            CGTDLDHGVL+VGYGTD  +  D+W++KNSWG  WG  G++ +A +K     CGL   A
Sbjct: 160 SCGTDLDHGVLLVGYGTDKESKKDFWIMKNSWGTGWGRDGYMYMAMHKGEEGQCGLLLDA 219

Query: 540 SYPI 551
           S+P+
Sbjct: 220 SFPV 223


>ref|XP_002371694.1| cathepsin L-like thiolproteinase, putative [Toxoplasma gondii ME49]
           gi|50313163|gb|AAT74529.1| toxopain-2 [Toxoplasma
           gondii] gi|89242977|gb|ABD64744.1| cathepsin L
           [Toxoplasma gondii] gi|95007485|emb|CAJ20707.1|
           toxopain-2 [Toxoplasma gondii RH]
           gi|523570907|gb|EPR57821.1| cathepsin CPL [Toxoplasma
           gondii GT1] gi|527315630|gb|EPT32244.1| cathepsin CPL
           [Toxoplasma gondii ME49] gi|557733437|gb|ESS29589.1|
           cathepsin CPL [Toxoplasma gondii VEG]
          Length = 422

 Score =  189 bits (480), Expect = 4e-46
 Identities = 97/184 (52%), Positives = 124/184 (67%), Gaps = 4/184 (2%)
 Frame = +3

Query: 12  KTGNLTSLSEQQLMDCSHPEGDDSCEGGLMDDAFQFVLDNKGICAEADYPYVAKDEKCK- 188
           KTG L SLSEQ+LMDCS  EG+ SC GG M+DAFQ+VLD+ GIC+E  YPY+A+DE+C+ 
Sbjct: 245 KTGKLVSLSEQELMDCSRAEGNQSCSGGEMNDAFQYVLDSGGICSEDAYPYLARDEECRA 304

Query: 189 KTCKTVSTISSFADVDFNEGKPTDETALMAAVQLGPVSIAIEADKPYFQLYTGGVLTDPV 368
           ++C+ V  I  F DV         E A+ AA+   PVSIAIEAD+  FQ Y  GV     
Sbjct: 305 QSCEKVVKILGFKDV-----PRRSEAAMKAALAKSPVSIAIEADQMPFQFYHEGVF--DA 357

Query: 369 KCGTDLDHGVLIVGYGTDSATNVDYWLVKNSWGAKWGEKGFIRLARNK---NMCGLNSAA 539
            CGTDLDHGVL+VGYGTD  +  D+W++KNSWG  WG  G++ +A +K     CGL   A
Sbjct: 358 SCGTDLDHGVLLVGYGTDKESKKDFWIMKNSWGTGWGRDGYMYMAMHKGEEGQCGLLLDA 417

Query: 540 SYPI 551
           S+P+
Sbjct: 418 SFPV 421


>ref|XP_003879989.1| cathepsin L, related [Neospora caninum Liverpool]
           gi|325114397|emb|CBZ49954.1| cathepsin L, related
           [Neospora caninum Liverpool]
          Length = 415

 Score =  188 bits (478), Expect = 8e-46
 Identities = 92/167 (55%), Positives = 117/167 (70%)
 Frame = +3

Query: 12  KTGNLTSLSEQQLMDCSHPEGDDSCEGGLMDDAFQFVLDNKGICAEADYPYVAKDEKCKK 191
           KTG L SLSEQ+L+DCS  EG+  C GG M+DAFQ+V+D+ G+C+E  YPY+A+D +CK+
Sbjct: 247 KTGELLSLSEQELVDCSLAEGNQGCSGGEMNDAFQYVVDSGGLCSEEGYPYLARDGECKR 306

Query: 192 TCKTVSTISSFADVDFNEGKPTDETALMAAVQLGPVSIAIEADKPYFQLYTGGVLTDPVK 371
            CK V TIS F DV         ETA+ AA+   PVSIAIEAD+  FQ Y  GV      
Sbjct: 307 ACKKVVTISGFKDV-----PRKSETAMKAALAHSPVSIAIEADQLPFQFYHEGVF--DAS 359

Query: 372 CGTDLDHGVLIVGYGTDSATNVDYWLVKNSWGAKWGEKGFIRLARNK 512
           CGTDLDHGVL+VGYGTD  T  D+W++KNSWG+ WG  G++ +A +K
Sbjct: 360 CGTDLDHGVLLVGYGTDKETKKDFWIMKNSWGSGWGRDGYMYMAMHK 406


>gb|ERL86466.1| hypothetical protein D910_03872 [Dendroctonus ponderosae]
          Length = 338

 Score =  188 bits (477), Expect = 1e-45
 Identities = 96/185 (51%), Positives = 126/185 (68%), Gaps = 3/185 (1%)
 Frame = +3

Query: 6   FLKTGNLTSLSEQQLMDCSHPEGDDSCEGGLMDDAFQFVLDNKGICAEADYPYVAKDEKC 185
           F +T  L SLSEQ L+DCS   G++ C GGLMD+AF+++ +N GI  EA YPY+ +DEKC
Sbjct: 158 FRQTKKLVSLSEQNLVDCSSRFGNNGCNGGLMDNAFRYIKNNGGIDTEAAYPYMGEDEKC 217

Query: 186 KKTCKTV-STISSFADVDFNEGKPTDETALMAAVQ-LGPVSIAIEADKPYFQLYTGGVLT 359
           + + K   +T   F D+        DE  L AAV  +GP+SIAI+A    FQLY+ GV +
Sbjct: 218 RYSAKNRGATDKGFVDIPSG-----DEDKLKAAVATVGPISIAIDASHESFQLYSNGVYS 272

Query: 360 DPVKCGTDLDHGVLIVGYGTDSATNVDYWLVKNSWGAKWGEKGFIRLARNK-NMCGLNSA 536
           DP    T+LDHGVL+VGYGTD  T +DYWLVKNSWG  WG  G+I++ARN+ N CG+ + 
Sbjct: 273 DPTCSSTELDHGVLVVGYGTDEKTGMDYWLVKNSWGDTWGLDGYIKMARNQDNQCGVATQ 332

Query: 537 ASYPI 551
           ASYP+
Sbjct: 333 ASYPL 337


>ref|XP_004344432.1| cathepsin L2 [Capsaspora owczarzaki ATCC 30864]
           gi|320164780|gb|EFW41679.1| cathepsin L2 [Capsaspora
           owczarzaki ATCC 30864]
          Length = 334

 Score =  188 bits (477), Expect = 1e-45
 Identities = 98/182 (53%), Positives = 121/182 (66%), Gaps = 2/182 (1%)
 Frame = +3

Query: 12  KTGNLTSLSEQQLMDCSHPEGDDSCEGGLMDDAFQFVLDNKGICAEADYPYVAKDEKCKK 191
           KTG L SLSEQ L+DCS  +G+  C GGLMDDAFQ+++ NKGI  EA YPY AKD  CK 
Sbjct: 158 KTGQLVSLSEQNLVDCSKAQGNQGCNGGLMDDAFQYIITNKGIDTEASYPYTAKDGTCKF 217

Query: 192 TCKTV-STISSFADVDFNEGKPTDETALMAAVQLGPVSIAIEADKPYFQLYTGGVLTDPV 368
               V +T+SSF D+    G  +D    +A V  GPVS+AI+A K  FQLYT GV  +  
Sbjct: 218 NAANVGATLSSFQDI--TRGSESDLQNAVATV--GPVSVAIDASKNSFQLYTSGVYNEKK 273

Query: 369 KCGTDLDHGVLIVGYGTDSATNVDYWLVKNSWGAKWGEKGFIRLARN-KNMCGLNSAASY 545
              T LDHGVL  GYGT + T   YWLVKNSWG+ WG+ G+I ++RN  N CG+ ++ASY
Sbjct: 274 CSSTSLDHGVLAAGYGTSNGT--PYWLVKNSWGSSWGQAGYIWMSRNANNQCGIATSASY 331

Query: 546 PI 551
           PI
Sbjct: 332 PI 333


>ref|XP_005320914.1| PREDICTED: cathepsin L1-like [Ictidomys tridecemlineatus]
          Length = 332

 Score =  187 bits (475), Expect = 2e-45
 Identities = 94/185 (50%), Positives = 128/185 (69%), Gaps = 3/185 (1%)
 Frame = +3

Query: 3   VFLKTGNLTSLSEQQLMDCSHPEGDDSCEGGLMDDAFQFVLDNKGICAEADYPYVAKDEK 182
           +F KTG L SLSEQ L+DCS P+G+  C+GGLMD+AFQ++ DN G+ +E  YPY A+DE 
Sbjct: 152 MFRKTGKLISLSEQNLVDCSRPQGNLGCDGGLMDNAFQYIKDNGGLDSEDSYPYEAQDET 211

Query: 183 CK-KTCKTVSTISSFADVDFNEGKPTDETALMAAVQ-LGPVSIAIEADKPYFQLYTGGVL 356
           CK K    V+  + F D+      P  E ALM+AV  +GP+S+AI+A    FQ Y  GV 
Sbjct: 212 CKYKPEFAVANDTGFVDI------PPREKALMSAVATVGPISVAIDAGHASFQFYKSGVY 265

Query: 357 TDPVKCGTDLDHGVLIVGYGTDSATNVDYWLVKNSWGAKWGEKGFIRLARNK-NMCGLNS 533
            DP     DLDHGVL+VGYG ++ +N  +WLVKNSWG++WG  G++++A++K N CG+ +
Sbjct: 266 YDPECSSKDLDHGVLVVGYGVEANSNKKFWLVKNSWGSEWGADGYVKMAKDKNNHCGIAT 325

Query: 534 AASYP 548
           AASYP
Sbjct: 326 AASYP 330


>gb|EMS56453.1| KDEL-tailed cysteine endopeptidase CEP1 [Triticum urartu]
          Length = 339

 Score =  187 bits (475), Expect = 2e-45
 Identities = 96/188 (51%), Positives = 118/188 (62%), Gaps = 4/188 (2%)
 Frame = +3

Query: 3   VFLKTGNLTSLSEQQLMDCSHPEGDDSCEGGLMDDAFQFVLDNKGICAEADYPYVAKDEK 182
           V L TGNL SLSEQ+L+DC     D  CEGGLMDDAF+F++ N G+  E+ YPY   D+K
Sbjct: 160 VKLSTGNLVSLSEQELVDCDVHGEDQGCEGGLMDDAFKFIIKNGGLTKESSYPYAGADDK 219

Query: 183 CKKTCKTVSTISSFADVDFNEGKPTDETALMAAVQLGPVSIAIEADKPYFQLYTGGVLTD 362
           CK    +V+TI  + DV  N     DE ALM AV   PVS+A++     FQ Y+GGV+T 
Sbjct: 220 CKSGSNSVATIKGYEDVPTN-----DEGALMKAVASQPVSVAVDGGDMTFQFYSGGVMTG 274

Query: 363 PVKCGTDLDHGVLIVGYGTDSATNVDYWLVKNSWGAKWGEKGFIR----LARNKNMCGLN 530
              CGTDLDHG+  +GYGT S     YWL+KNSWG  WGE G++R    +A  K MCGL 
Sbjct: 275 --SCGTDLDHGIAAIGYGTTS-DGTKYWLLKNSWGTTWGENGYLRMEKDIADKKGMCGLA 331

Query: 531 SAASYPIA 554
              SYP A
Sbjct: 332 MEPSYPTA 339


>gb|ACX43954.1| cysteine protease 1 [Brachiaria hybrid cultivar]
           gi|260516656|gb|ACX43955.1| cysteine protease 1
           [Brachiaria hybrid cultivar] gi|260516658|gb|ACX43956.1|
           cysteine protease 1 [Brachiaria hybrid cultivar]
           gi|260516660|gb|ACX43957.1| cysteine protease 1
           [Brachiaria hybrid cultivar] gi|260516662|gb|ACX43958.1|
           cysteine protease 2 [Brachiaria hybrid cultivar]
           gi|260516664|gb|ACX43959.1| cysteine protease 2
           [Brachiaria hybrid cultivar] gi|260516666|gb|ACX43960.1|
           cysteine protease 2 [Brachiaria hybrid cultivar]
           gi|260516668|gb|ACX43961.1| cysteine protease 2
           [Brachiaria hybrid cultivar] gi|260516670|gb|ACX43962.1|
           cysteine protease 2 [Brachiaria hybrid cultivar]
          Length = 338

 Score =  187 bits (475), Expect = 2e-45
 Identities = 96/176 (54%), Positives = 119/176 (67%), Gaps = 1/176 (0%)
 Frame = +3

Query: 24  LTSLSEQQLMDCSHPEGDDSCEGGLMDDAFQFVLDNKGICAEADYPYVAKDEKCKKTCKT 203
           LTSLSEQQL+DCS   G+  C GGLMD AF++++ NKGICAE+ YPY      C+K+C  
Sbjct: 170 LTSLSEQQLVDCSTSYGNAGCNGGLMDYAFEYIIANKGICAESAYPYKGVGGLCQKSCTK 229

Query: 204 VSTISSFADVDFNEGKPTDETALMAAV-QLGPVSIAIEADKPYFQLYTGGVLTDPVKCGT 380
           V TIS + DV        DE +L+ AV  +GPVS+AIEAD+  FQ Y+ GV +    CG 
Sbjct: 230 VVTISGYKDV-----ASGDEASLLNAVGTVGPVSVAIEADQAGFQFYSSGVFSG--TCGH 282

Query: 381 DLDHGVLIVGYGTDSATNVDYWLVKNSWGAKWGEKGFIRLARNKNMCGLNSAASYP 548
           +LDHGVL VGYGT  +   DYW+VKNSWG  WGE G+IR+ RNKN CG+    SYP
Sbjct: 283 NLDHGVLAVGYGTTGSQ--DYWIVKNSWGTSWGESGYIRMIRNKNQCGIAIQPSYP 336


>gb|ENN82151.1| hypothetical protein YQE_01471, partial [Dendroctonus ponderosae]
          Length = 334

 Score =  186 bits (473), Expect = 3e-45
 Identities = 95/185 (51%), Positives = 126/185 (68%), Gaps = 3/185 (1%)
 Frame = +3

Query: 6   FLKTGNLTSLSEQQLMDCSHPEGDDSCEGGLMDDAFQFVLDNKGICAEADYPYVAKDEKC 185
           F +T  L SLSEQ L+DCS   G++ C GGLMD+AF+++ +N GI  EA YPY+ +D+KC
Sbjct: 154 FRQTKKLVSLSEQNLVDCSSRFGNNGCNGGLMDNAFRYIKNNGGIDTEAAYPYMGEDKKC 213

Query: 186 KKTCKTV-STISSFADVDFNEGKPTDETALMAAVQ-LGPVSIAIEADKPYFQLYTGGVLT 359
           + + K   +T   F D+        DE  L AAV  +GP+SIAI+A    FQLY+ GV +
Sbjct: 214 RYSAKNRGATDKGFVDIPSG-----DEDKLKAAVATVGPISIAIDASHESFQLYSNGVYS 268

Query: 360 DPVKCGTDLDHGVLIVGYGTDSATNVDYWLVKNSWGAKWGEKGFIRLARNK-NMCGLNSA 536
           DP    T+LDHGVL+VGYGTD  T +DYWLVKNSWG  WG  G+I++ARN+ N CG+ + 
Sbjct: 269 DPTCSSTELDHGVLVVGYGTDEKTGMDYWLVKNSWGDTWGLDGYIKMARNQDNQCGVATQ 328

Query: 537 ASYPI 551
           ASYP+
Sbjct: 329 ASYPL 333


>gb|EFZ20954.1| hypothetical protein SINV_06041 [Solenopsis invicta]
          Length = 337

 Score =  186 bits (473), Expect = 3e-45
 Identities = 97/184 (52%), Positives = 125/184 (67%), Gaps = 2/184 (1%)
 Frame = +3

Query: 6   FLKTGNLTSLSEQQLMDCSHPEGDDSCEGGLMDDAFQFVLDNKGICAEADYPYVAKDEKC 185
           F  TG L SLSEQ L+DCS   G++ C GGLMD AFQ++ DNKG+  E  YPY A++++C
Sbjct: 157 FRSTGYLVSLSEQNLIDCSGKYGNNGCNGGLMDYAFQYIKDNKGLDTEKTYPYEAENDRC 216

Query: 186 KKTCKTVSTISSFADVDFNEGKPTDETALMAAVQ-LGPVSIAIEADKPYFQLYTGGVLTD 362
           +   +  S  +    VD  +G   DE  L AAV  +GP+S+AI+A    FQLY+ GV  D
Sbjct: 217 RYNPRN-SGATDKGYVDIPQG---DEEKLKAAVATIGPISVAIDASHESFQLYSEGVYYD 272

Query: 363 PVKCGTDLDHGVLIVGYGTDSATNVDYWLVKNSWGAKWGEKGFIRLARNK-NMCGLNSAA 539
           P     +LDHGVLIVGYGTD  +  DYWLVKNSWG  WG+KG+I++ARNK N CG+ S+A
Sbjct: 273 PDCSAENLDHGVLIVGYGTDETSGHDYWLVKNSWGKTWGQKGYIKMARNKNNHCGIASSA 332

Query: 540 SYPI 551
           SYP+
Sbjct: 333 SYPL 336


>ref|XP_005765534.1| hypothetical protein EMIHUDRAFT_437163 [Emiliania huxleyi CCMP1516]
           gi|485615557|gb|EOD13105.1| hypothetical protein
           EMIHUDRAFT_437163 [Emiliania huxleyi CCMP1516]
          Length = 452

 Score =  186 bits (472), Expect = 4e-45
 Identities = 94/185 (50%), Positives = 122/185 (65%), Gaps = 5/185 (2%)
 Frame = +3

Query: 9   LKTGNLTSLSEQQL--MDCSHPEGDDSCEGGLMDDAFQFVLDNKGICAEADYPYVAK--- 173
           + TG L SLSE++L  + C    GD  C+GGLMD+AF+++ +   +C E+ YPY +    
Sbjct: 146 IATGKLVSLSEEELELVQCD-TNGDHGCKGGLMDNAFEWIAEGNPLCTESTYPYTSGAGL 204

Query: 174 DEKCKKTCKTVSTISSFADVDFNEGKPTDETALMAAVQLGPVSIAIEADKPYFQLYTGGV 353
              CKK C    +++S  DV        DE AL AAV   PVS+AIEADK  FQLY  GV
Sbjct: 205 TGTCKKACNGEVSLTSHKDVPSG-----DEDALRAAVAKQPVSVAIEADKSAFQLYQSGV 259

Query: 354 LTDPVKCGTDLDHGVLIVGYGTDSATNVDYWLVKNSWGAKWGEKGFIRLARNKNMCGLNS 533
           + D   CG +LDHGVL+VGYGTD+AT  DYW +KNSWG  WGE+GF+R+ + KNMCG++S
Sbjct: 260 I-DSASCGKELDHGVLVVGYGTDTATGKDYWKIKNSWGGTWGEEGFVRVVQGKNMCGISS 318

Query: 534 AASYP 548
            ASYP
Sbjct: 319 QASYP 323


>ref|XP_801784.2| PREDICTED: cathepsin L1-like isoform 2 [Strongylocentrotus
           purpuratus]
          Length = 336

 Score =  186 bits (472), Expect = 4e-45
 Identities = 99/186 (53%), Positives = 125/186 (67%), Gaps = 4/186 (2%)
 Frame = +3

Query: 6   FLKTGNLTSLSEQQLMDCSHPEGDDSCEGGLMDDAFQFVLDNKGICAEADYPYVAKDEKC 185
           F KT  L SLSEQ L+DCS  EG+  CEGGLMD  FQ+V+DN GI +E  YPY A+DE C
Sbjct: 158 FKKTSKLVSLSEQNLVDCSRTEGNMGCEGGLMDQGFQYVIDNHGIDSEDCYPYDAEDETC 217

Query: 186 --KKTCKTVSTISSFADVDFNEGKPTDETALMAAV-QLGPVSIAIEADKPYFQLYTGGVL 356
             K +C + + ++ F DV        DE ALM AV  +GPVS+AI+A    FQLY  GV 
Sbjct: 218 HYKASCDS-AEVTGFTDVTSG-----DEQALMEAVASVGPVSVAIDASHQSFQLYESGVY 271

Query: 357 TDPVKCGTDLDHGVLIVGYGTDSATNVDYWLVKNSWGAKWGEKGFIRLARNK-NMCGLNS 533
            +P    ++LDHGVL+VGYGTD     DYWLVKNSWG  WG  G+I+++RNK N CG+ +
Sbjct: 272 DEPECSSSELDHGVLVVGYGTDGGK--DYWLVKNSWGETWGLSGYIKMSRNKSNQCGIAT 329

Query: 534 AASYPI 551
           +ASYP+
Sbjct: 330 SASYPL 335


>gb|EFN65237.1| Cathepsin L [Camponotus floridanus]
          Length = 372

 Score =  186 bits (471), Expect = 5e-45
 Identities = 95/184 (51%), Positives = 125/184 (67%), Gaps = 2/184 (1%)
 Frame = +3

Query: 6   FLKTGNLTSLSEQQLMDCSHPEGDDSCEGGLMDDAFQFVLDNKGICAEADYPYVAKDEKC 185
           F ++G L SLSEQ L+DCS   G++ C GGLMD AF+++ +NKG+  E  YPY A++++C
Sbjct: 192 FRQSGVLVSLSEQNLIDCSGKYGNNGCNGGLMDYAFRYIKENKGLDTEKSYPYEAENDQC 251

Query: 186 KKTCKTVSTISSFADVDFNEGKPTDETALMAAVQ-LGPVSIAIEADKPYFQLYTGGVLTD 362
           +   K     S  +DV F +    DE  L AAV  +GP+S+AI+A    F  Y+ GV  +
Sbjct: 252 RYNPKN----SGASDVGFVDIPEGDEDKLKAAVATIGPISVAIDASHESFHFYSEGVYYE 307

Query: 363 PVKCGTDLDHGVLIVGYGTDSATNVDYWLVKNSWGAKWGEKGFIRLARNK-NMCGLNSAA 539
           P     +LDHGVLIVGYGTDS T  DYWLVKNSWG  WGEKG+I++ARNK N CG+ S+A
Sbjct: 308 PECSPANLDHGVLIVGYGTDSGTGEDYWLVKNSWGETWGEKGYIKMARNKENHCGIASSA 367

Query: 540 SYPI 551
           SYP+
Sbjct: 368 SYPL 371


>gb|AAO65603.1| cathepsin L precursor [Hydra vulgaris]
          Length = 324

 Score =  185 bits (470), Expect = 6e-45
 Identities = 93/185 (50%), Positives = 124/185 (67%), Gaps = 3/185 (1%)
 Frame = +3

Query: 6   FLKTGNLTSLSEQQLMDCSHPEGDDSCEGGLMDDAFQFVLDNKGICAEADYPYVAKDEKC 185
           F KTG L SLSEQ L+DCS   G++ C+GGLMD+AF ++ +NKGI +EA YPY A+D KC
Sbjct: 146 FKKTGKLVSLSEQNLVDCSTAYGNNGCDGGLMDNAFTYIKENKGIDSEASYPYTAEDGKC 205

Query: 186 K-KTCKTVSTISSFADV-DFNEGKPTDETALMAAVQLGPVSIAIEADKPYFQLYTGGVLT 359
             K     +T + F D+ + NE K  +     A   +GP+S+AI+A    FQ Y+ GV  
Sbjct: 206 VFKKSSVAATDTGFVDIPEGNENKLKE-----AVASVGPISVAIDASHESFQFYSSGVYN 260

Query: 360 DPVKCGTDLDHGVLIVGYGTDSATNVDYWLVKNSWGAKWGEKGFIRLARN-KNMCGLNSA 536
           +P    T+LDHGVL+VGYGT+S    DYWLVKNSW   WG+KG+I++ RN KN CG+ + 
Sbjct: 261 EPSCSSTELDHGVLVVGYGTESGK--DYWLVKNSWNTSWGDKGYIKMRRNAKNQCGIATK 318

Query: 537 ASYPI 551
           ASYP+
Sbjct: 319 ASYPL 323


>dbj|BAA24442.1| cysteine proteinase [Sitophilus zeamais]
          Length = 338

 Score =  185 bits (470), Expect = 6e-45
 Identities = 99/184 (53%), Positives = 124/184 (67%), Gaps = 2/184 (1%)
 Frame = +3

Query: 6   FLKTGNLTSLSEQQLMDCSHPEGDDSCEGGLMDDAFQFVLDNKGICAEADYPYVAKDEKC 185
           F KTG L SLSEQ L+DCS   G++ C GGLMD+AF+++ DN GI  E  YPY+A+DEKC
Sbjct: 159 FRKTGKLVSLSEQNLVDCSGRYGNNGCNGGLMDNAFRYIKDNGGIDTEKSYPYLAEDEKC 218

Query: 186 K-KTCKTVSTISSFADVDFNEGKPTDETALMAAVQLGPVSIAIEADKPYFQLYTGGVLTD 362
             K   + +T   F D++  E    D  A +A V  GPVSIAI+A    FQLY+ GV +D
Sbjct: 219 HYKAQNSGATDKGFVDIE--EANEDDLKAAVATV--GPVSIAIDASHETFQLYSDGVYSD 274

Query: 363 PVKCGTDLDHGVLIVGYGTDSATNVDYWLVKNSWGAKWGEKGFIRLARNK-NMCGLNSAA 539
           P     +LDHGVL+VGYGT S    DYWLVKNSWG  WG  G+I++ARN+ NMCG+ S A
Sbjct: 275 PECSSQELDHGVLVVGYGT-SDDGQDYWLVKNSWGPSWGLNGYIKMARNQDNMCGVASQA 333

Query: 540 SYPI 551
           SYP+
Sbjct: 334 SYPL 337


Top