BLASTX nr result

ID: Mentha23_contig00020754 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Mentha23_contig00020754
         (850 letters)

Database: ./nr 
           38,876,450 sequences; 13,856,398,315 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_002427787.1| Cathepsin F precursor, putative [Pediculus h...   234   4e-59
ref|XP_003696602.1| PREDICTED: putative cysteine proteinase CG12...   233   9e-59
ref|XP_006567715.1| PREDICTED: putative cysteine proteinase CG12...   231   3e-58
ref|XP_392381.3| PREDICTED: putative cysteine proteinase CG12163...   231   3e-58
gb|EFO21301.2| ctsf protein [Loa loa]                                 230   6e-58
ref|XP_003142769.1| ctsf protein [Loa loa]                            230   6e-58
ref|XP_006608091.1| PREDICTED: uncharacterized protein LOC102680...   229   8e-58
ref|XP_003436277.1| AGAP002879-PB [Anopheles gambiae str. PEST] ...   229   8e-58
ref|XP_312034.5| AGAP002879-PA [Anopheles gambiae str. PEST] gi|...   229   8e-58
ref|XP_003436276.1| AGAP002879-PC [Anopheles gambiae str. PEST] ...   229   8e-58
gb|ESP02306.1| hypothetical protein LOTGIDRAFT_172186 [Lottia gi...   229   1e-57
gb|EKC42097.1| Cathepsin F [Crassostrea gigas]                        229   1e-57
gb|AFQ01139.1| cathepsin F-like protease, partial [Chilo suppres...   228   2e-57
ref|NP_001156454.1| cathepsin F isoform 2 precursor [Acyrthosiph...   226   6e-57
gb|EFN80374.1| Putative cysteine proteinase CG12163 [Harpegnatho...   226   8e-57
emb|CDJ26737.1| cathepsin F-like cysteine peptidase protein [Tit...   226   1e-56
gb|AAT07059.1| cathepsin F-like cysteine proteinase, partial [Br...   225   1e-56
ref|XP_003378245.1| cathepsin F [Trichinella spiralis] gi|316972...   224   4e-56
ref|NP_001156453.1| cathepsin F isoform 1 precursor [Acyrthosiph...   224   4e-56
gb|EJW79799.1| cysteine protease 6 [Wuchereria bancrofti]             223   5e-56

>ref|XP_002427787.1| Cathepsin F precursor, putative [Pediculus humanus corporis]
           gi|212512256|gb|EEB15049.1| Cathepsin F precursor,
           putative [Pediculus humanus corporis]
          Length = 434

 Score =  234 bits (596), Expect = 4e-59
 Identities = 110/224 (49%), Positives = 151/224 (67%), Gaps = 6/224 (2%)
 Frame = -2

Query: 747 VFIPKQFDYRKKKAVTPVKNQGACGSCWAFSAISNIEGVWAKKKKQLIDLSEQELLDCDK 568
           V +P  FD+R   AVTPVKNQG+CGSCWAFS   NIEG+WA KK +L+ LSEQEL+DCDK
Sbjct: 213 VKLPDNFDWRHYNAVTPVKNQGSCGSCWAFSVTGNIEGLWAIKKHELLSLSEQELIDCDK 272

Query: 567 VDLGCMGGWMEDAVKELVRLGGVAKSKDYPYTAQEGQCRFKKSMAAVKVSGSRGVPANEK 388
           +D GC GG+M +  + +++LGG+    DYPY A+  +C   K+   VK++G+  +  +E 
Sbjct: 273 IDNGCNGGYMPETYEAIMKLGGLETETDYPYEAENEKCNLNKTEIKVKINGAVNLTKSEL 332

Query: 387 KIARALIKNGPLSVAVVASANFQFYVKGIFNPDRNQCSTNKQDVNHGVTIVGYGVEKS-- 214
            IA+ L KNGP+S  + A+A  QFY+ GI +P +  C+  +QD  HG+ IVGYG+ KS  
Sbjct: 333 DIAKWLYKNGPVSAGLNANA-MQFYLGGISHPPKILCNPEEQD--HGILIVGYGIHKSSI 389

Query: 213 ----LPYWVIKNSWGDSWGDKGYIKLIRGKNACAVASYVSTAYL 94
               +PYW+IKNSWG  WG+KGY +L RG   C +   VS+A +
Sbjct: 390 LKRTIPYWIIKNSWGKHWGEKGYYRLYRGSGVCGINQMVSSALI 433


>ref|XP_003696602.1| PREDICTED: putative cysteine proteinase CG12163-like [Apis florea]
          Length = 881

 Score =  233 bits (593), Expect = 9e-59
 Identities = 111/222 (50%), Positives = 151/222 (68%), Gaps = 6/222 (2%)
 Frame = -2

Query: 747  VFIPKQFDYRKKKAVTPVKNQGACGSCWAFSAISNIEGVWAKKKKQLIDLSEQELLDCDK 568
            +F+P +FD+R   AVTPVK+QG CGSCWAFS   N+EG +A K K+L+ LSEQELLDCD 
Sbjct: 660  IFLPPKFDWRDYNAVTPVKDQGLCGSCWAFSVTGNVEGQYAIKYKKLLSLSEQELLDCDT 719

Query: 567  VDLGCMGGWMEDAVKELVRLGGVAKSKDYPYTAQEGQCRFKKSMAAVKVSGSRGVPANEK 388
            +D GC GG+ME+A K + +LGG+    DYPY  +  +C F K  A V+V G+  + +NE 
Sbjct: 720  LDEGCNGGYMENAYKAIEKLGGLELESDYPYDGRNEKCHFFKKNAKVQVVGAVNITSNET 779

Query: 387  KIARALIKNGPLSVAVVASANFQFYVKGIFNPDRNQCSTNKQDVNHGVTIVGYGV----- 223
            K+A+ LIKNGP+S+ + A+A  QFY+ G+ +P    C  N +D++HGV IVGYG+     
Sbjct: 780  KMAQWLIKNGPISIGINANA-MQFYIGGVSHPFHFLC--NPKDLDHGVLIVGYGISKYPL 836

Query: 222  -EKSLPYWVIKNSWGDSWGDKGYIKLIRGKNACAVASYVSTA 100
              K LPYW+IKNSWG  WG+ GY ++ RG   C V +  S+A
Sbjct: 837  FHKELPYWIIKNSWGSRWGENGYYRVYRGDGTCGVNAMASSA 878


>ref|XP_006567715.1| PREDICTED: putative cysteine proteinase CG12163-like isoform X1 [Apis
            mellifera]
          Length = 884

 Score =  231 bits (588), Expect = 3e-58
 Identities = 110/222 (49%), Positives = 150/222 (67%), Gaps = 6/222 (2%)
 Frame = -2

Query: 747  VFIPKQFDYRKKKAVTPVKNQGACGSCWAFSAISNIEGVWAKKKKQLIDLSEQELLDCDK 568
            +F+P +FD+R    VTPVK+QG CGSCWAFS   N+EG +A K K+L+ LSEQELLDCD 
Sbjct: 663  IFLPLKFDWRDYNVVTPVKDQGLCGSCWAFSVTGNVEGQYAIKYKKLLSLSEQELLDCDT 722

Query: 567  VDLGCMGGWMEDAVKELVRLGGVAKSKDYPYTAQEGQCRFKKSMAAVKVSGSRGVPANEK 388
            +D GC GG+ME+A K + +LGG+    DYPY  +  +C F K  A V+V G+  + +NE 
Sbjct: 723  LDEGCNGGYMENAYKAIEKLGGLELESDYPYDGRNEKCHFFKKNAKVQVVGAVNITSNET 782

Query: 387  KIARALIKNGPLSVAVVASANFQFYVKGIFNPDRNQCSTNKQDVNHGVTIVGYGV----- 223
            K+A+ LIKNGP+S+ + A+A  QFY+ G+ +P    C  N +D++HGV IVGYG+     
Sbjct: 783  KMAQWLIKNGPISIGINANA-MQFYIGGVSHPFHFLC--NPKDLDHGVLIVGYGISKYPL 839

Query: 222  -EKSLPYWVIKNSWGDSWGDKGYIKLIRGKNACAVASYVSTA 100
              K LPYW+IKNSWG  WG+ GY ++ RG   C V +  S+A
Sbjct: 840  FHKKLPYWIIKNSWGSRWGENGYYRVYRGDGTCGVNAMASSA 881


>ref|XP_392381.3| PREDICTED: putative cysteine proteinase CG12163-like isoform X2 [Apis
            mellifera]
          Length = 881

 Score =  231 bits (588), Expect = 3e-58
 Identities = 110/222 (49%), Positives = 150/222 (67%), Gaps = 6/222 (2%)
 Frame = -2

Query: 747  VFIPKQFDYRKKKAVTPVKNQGACGSCWAFSAISNIEGVWAKKKKQLIDLSEQELLDCDK 568
            +F+P +FD+R    VTPVK+QG CGSCWAFS   N+EG +A K K+L+ LSEQELLDCD 
Sbjct: 660  IFLPLKFDWRDYNVVTPVKDQGLCGSCWAFSVTGNVEGQYAIKYKKLLSLSEQELLDCDT 719

Query: 567  VDLGCMGGWMEDAVKELVRLGGVAKSKDYPYTAQEGQCRFKKSMAAVKVSGSRGVPANEK 388
            +D GC GG+ME+A K + +LGG+    DYPY  +  +C F K  A V+V G+  + +NE 
Sbjct: 720  LDEGCNGGYMENAYKAIEKLGGLELESDYPYDGRNEKCHFFKKNAKVQVVGAVNITSNET 779

Query: 387  KIARALIKNGPLSVAVVASANFQFYVKGIFNPDRNQCSTNKQDVNHGVTIVGYGV----- 223
            K+A+ LIKNGP+S+ + A+A  QFY+ G+ +P    C  N +D++HGV IVGYG+     
Sbjct: 780  KMAQWLIKNGPISIGINANA-MQFYIGGVSHPFHFLC--NPKDLDHGVLIVGYGISKYPL 836

Query: 222  -EKSLPYWVIKNSWGDSWGDKGYIKLIRGKNACAVASYVSTA 100
              K LPYW+IKNSWG  WG+ GY ++ RG   C V +  S+A
Sbjct: 837  FHKKLPYWIIKNSWGSRWGENGYYRVYRGDGTCGVNAMASSA 878


>gb|EFO21301.2| ctsf protein [Loa loa]
          Length = 472

 Score =  230 bits (586), Expect = 6e-58
 Identities = 106/216 (49%), Positives = 144/216 (66%)
 Frame = -2

Query: 741 IPKQFDYRKKKAVTPVKNQGACGSCWAFSAISNIEGVWAKKKKQLIDLSEQELLDCDKVD 562
           +P+QFD+R K  VTPVKNQG+CGSCWAFS   NIEG+WA K  +LI LSEQEL+DCD++D
Sbjct: 259 LPEQFDWRTKGVVTPVKNQGSCGSCWAFSVTGNIEGLWAIKTGKLISLSEQELIDCDRID 318

Query: 561 LGCMGGWMEDAVKELVRLGGVAKSKDYPYTAQEGQCRFKKSMAAVKVSGSRGVPANEKKI 382
            GC GG   +A +E+ R+GG+     YPY A+ G C   +S  AV +  +  +P NE  +
Sbjct: 319 KGCNGGLPINAFREIQRMGGLEPEDQYPYKARNGTCHLIRSAIAVTIDDAVEIPRNETVM 378

Query: 381 ARALIKNGPLSVAVVASANFQFYVKGIFNPDRNQCSTNKQDVNHGVTIVGYGVEKSLPYW 202
              +++ GPLSV + A     +Y  GI +P R++C  +   ++HGV I GYGVE  LPYW
Sbjct: 379 KAWIVQRGPLSVGIDAKL-LAYYKSGILHPSRSRCPPS--GIDHGVLITGYGVENGLPYW 435

Query: 201 VIKNSWGDSWGDKGYIKLIRGKNACAVASYVSTAYL 94
            IKNSWGD WG+ GY +L+ GK+ C V+  VS+A +
Sbjct: 436 TIKNSWGDQWGEDGYFRLMLGKDVCGVSDLVSSAII 471


>ref|XP_003142769.1| ctsf protein [Loa loa]
          Length = 437

 Score =  230 bits (586), Expect = 6e-58
 Identities = 106/216 (49%), Positives = 144/216 (66%)
 Frame = -2

Query: 741 IPKQFDYRKKKAVTPVKNQGACGSCWAFSAISNIEGVWAKKKKQLIDLSEQELLDCDKVD 562
           +P+QFD+R K  VTPVKNQG+CGSCWAFS   NIEG+WA K  +LI LSEQEL+DCD++D
Sbjct: 224 LPEQFDWRTKGVVTPVKNQGSCGSCWAFSVTGNIEGLWAIKTGKLISLSEQELIDCDRID 283

Query: 561 LGCMGGWMEDAVKELVRLGGVAKSKDYPYTAQEGQCRFKKSMAAVKVSGSRGVPANEKKI 382
            GC GG   +A +E+ R+GG+     YPY A+ G C   +S  AV +  +  +P NE  +
Sbjct: 284 KGCNGGLPINAFREIQRMGGLEPEDQYPYKARNGTCHLIRSAIAVTIDDAVEIPRNETVM 343

Query: 381 ARALIKNGPLSVAVVASANFQFYVKGIFNPDRNQCSTNKQDVNHGVTIVGYGVEKSLPYW 202
              +++ GPLSV + A     +Y  GI +P R++C  +   ++HGV I GYGVE  LPYW
Sbjct: 344 KAWIVQRGPLSVGIDAKL-LAYYKSGILHPSRSRCPPS--GIDHGVLITGYGVENGLPYW 400

Query: 201 VIKNSWGDSWGDKGYIKLIRGKNACAVASYVSTAYL 94
            IKNSWGD WG+ GY +L+ GK+ C V+  VS+A +
Sbjct: 401 TIKNSWGDQWGEDGYFRLMLGKDVCGVSDLVSSAII 436


>ref|XP_006608091.1| PREDICTED: uncharacterized protein LOC102680728 [Apis dorsata]
          Length = 880

 Score =  229 bits (585), Expect = 8e-58
 Identities = 109/222 (49%), Positives = 150/222 (67%), Gaps = 6/222 (2%)
 Frame = -2

Query: 747  VFIPKQFDYRKKKAVTPVKNQGACGSCWAFSAISNIEGVWAKKKKQLIDLSEQELLDCDK 568
            +F+P +FD+R    VTPVK+QG CGSCWAFS   N+EG +A K K+L+ LSEQELLDCD 
Sbjct: 659  IFLPSKFDWRDYNVVTPVKDQGLCGSCWAFSVTGNVEGQYAIKYKKLLSLSEQELLDCDT 718

Query: 567  VDLGCMGGWMEDAVKELVRLGGVAKSKDYPYTAQEGQCRFKKSMAAVKVSGSRGVPANEK 388
            +D GC GG+ME+A K + +LGG+    DYPY  +  +C F K  A V+V G+  + +NE 
Sbjct: 719  LDEGCNGGYMENAYKAIEKLGGLELESDYPYDGKNEKCHFFKKNAKVQVVGAVNITSNET 778

Query: 387  KIARALIKNGPLSVAVVASANFQFYVKGIFNPDRNQCSTNKQDVNHGVTIVGYGV----- 223
            K+A+ LIKNGP+S+ + A+A  QFY+ G+ +P    C  N ++++HGV IVGYG+     
Sbjct: 779  KMAQWLIKNGPISIGINANA-MQFYIGGVSHPFHFLC--NPKNLDHGVLIVGYGISKYPL 835

Query: 222  -EKSLPYWVIKNSWGDSWGDKGYIKLIRGKNACAVASYVSTA 100
              K LPYW+IKNSWG  WG+ GY ++ RG   C V +  S+A
Sbjct: 836  FHKELPYWIIKNSWGSRWGENGYYRVYRGDGTCGVNAMASSA 877


>ref|XP_003436277.1| AGAP002879-PB [Anopheles gambiae str. PEST]
            gi|333467869|gb|EGK96736.1| AGAP002879-PB [Anopheles
            gambiae str. PEST]
          Length = 1834

 Score =  229 bits (585), Expect = 8e-58
 Identities = 112/223 (50%), Positives = 155/223 (69%), Gaps = 7/223 (3%)
 Frame = -2

Query: 741  IPKQFDYRKKKAVTPVKNQGACGSCWAFSAISNIEGVWAKKKKQLIDLSEQELLDCDKVD 562
            +P+ FD+R   AVT VKNQG+CGSCWAFSA+ N+EG+   K K+L   SEQEL+DCDKVD
Sbjct: 1614 LPRSFDWRDHGAVTEVKNQGSCGSCWAFSAVGNVEGLHQIKTKKLESYSEQELIDCDKVD 1673

Query: 561  LGCMGGWMEDAVKELVRLGGVAKSKDYPYTAQ-EGQCRFKKSMAAVKVSGSRGVPANEKK 385
             GC GG+M+DA K + +LGG+    DYPY A+ +  C F +S++ V+V G+  +P NE  
Sbjct: 1674 NGCGGGYMDDAFKAIEQLGGLELENDYPYEAKAQKSCHFNRSLSHVQVKGAVDMPKNETY 1733

Query: 384  IARALIKNGPLSVAVVASANFQFYVKGIFNPDRNQCSTNKQDVNHGVTIVGYGVE----- 220
            IA+ LIKNGP+++ + A+A  QFY  GI +P    C  N + ++HGV IVGYG++     
Sbjct: 1734 IAKYLIKNGPIAIGLNANA-MQFYRGGISHPWHPLC--NHKSIDHGVLIVGYGIKEYPMF 1790

Query: 219  -KSLPYWVIKNSWGDSWGDKGYIKLIRGKNACAVASYVSTAYL 94
             K+LPYW+IKNSWG  WG++GY ++ RG N+C V+   S+A L
Sbjct: 1791 NKTLPYWIIKNSWGPRWGEQGYYRIYRGDNSCGVSEMASSAIL 1833


>ref|XP_312034.5| AGAP002879-PA [Anopheles gambiae str. PEST]
            gi|333467868|gb|EAA08025.5| AGAP002879-PA [Anopheles
            gambiae str. PEST]
          Length = 1810

 Score =  229 bits (585), Expect = 8e-58
 Identities = 112/223 (50%), Positives = 155/223 (69%), Gaps = 7/223 (3%)
 Frame = -2

Query: 741  IPKQFDYRKKKAVTPVKNQGACGSCWAFSAISNIEGVWAKKKKQLIDLSEQELLDCDKVD 562
            +P+ FD+R   AVT VKNQG+CGSCWAFSA+ N+EG+   K K+L   SEQEL+DCDKVD
Sbjct: 1590 LPRSFDWRDHGAVTEVKNQGSCGSCWAFSAVGNVEGLHQIKTKKLESYSEQELIDCDKVD 1649

Query: 561  LGCMGGWMEDAVKELVRLGGVAKSKDYPYTAQ-EGQCRFKKSMAAVKVSGSRGVPANEKK 385
             GC GG+M+DA K + +LGG+    DYPY A+ +  C F +S++ V+V G+  +P NE  
Sbjct: 1650 NGCGGGYMDDAFKAIEQLGGLELENDYPYEAKAQKSCHFNRSLSHVQVKGAVDMPKNETY 1709

Query: 384  IARALIKNGPLSVAVVASANFQFYVKGIFNPDRNQCSTNKQDVNHGVTIVGYGVE----- 220
            IA+ LIKNGP+++ + A+A  QFY  GI +P    C  N + ++HGV IVGYG++     
Sbjct: 1710 IAKYLIKNGPIAIGLNANA-MQFYRGGISHPWHPLC--NHKSIDHGVLIVGYGIKEYPMF 1766

Query: 219  -KSLPYWVIKNSWGDSWGDKGYIKLIRGKNACAVASYVSTAYL 94
             K+LPYW+IKNSWG  WG++GY ++ RG N+C V+   S+A L
Sbjct: 1767 NKTLPYWIIKNSWGPRWGEQGYYRIYRGDNSCGVSEMASSAIL 1809


>ref|XP_003436276.1| AGAP002879-PC [Anopheles gambiae str. PEST]
            gi|333467870|gb|EGK96737.1| AGAP002879-PC [Anopheles
            gambiae str. PEST]
          Length = 953

 Score =  229 bits (585), Expect = 8e-58
 Identities = 112/223 (50%), Positives = 155/223 (69%), Gaps = 7/223 (3%)
 Frame = -2

Query: 741  IPKQFDYRKKKAVTPVKNQGACGSCWAFSAISNIEGVWAKKKKQLIDLSEQELLDCDKVD 562
            +P+ FD+R   AVT VKNQG+CGSCWAFSA+ N+EG+   K K+L   SEQEL+DCDKVD
Sbjct: 733  LPRSFDWRDHGAVTEVKNQGSCGSCWAFSAVGNVEGLHQIKTKKLESYSEQELIDCDKVD 792

Query: 561  LGCMGGWMEDAVKELVRLGGVAKSKDYPYTAQ-EGQCRFKKSMAAVKVSGSRGVPANEKK 385
             GC GG+M+DA K + +LGG+    DYPY A+ +  C F +S++ V+V G+  +P NE  
Sbjct: 793  NGCGGGYMDDAFKAIEQLGGLELENDYPYEAKAQKSCHFNRSLSHVQVKGAVDMPKNETY 852

Query: 384  IARALIKNGPLSVAVVASANFQFYVKGIFNPDRNQCSTNKQDVNHGVTIVGYGVE----- 220
            IA+ LIKNGP+++ + A+A  QFY  GI +P    C  N + ++HGV IVGYG++     
Sbjct: 853  IAKYLIKNGPIAIGLNANA-MQFYRGGISHPWHPLC--NHKSIDHGVLIVGYGIKEYPMF 909

Query: 219  -KSLPYWVIKNSWGDSWGDKGYIKLIRGKNACAVASYVSTAYL 94
             K+LPYW+IKNSWG  WG++GY ++ RG N+C V+   S+A L
Sbjct: 910  NKTLPYWIIKNSWGPRWGEQGYYRIYRGDNSCGVSEMASSAIL 952


>gb|ESP02306.1| hypothetical protein LOTGIDRAFT_172186 [Lottia gigantea]
          Length = 269

 Score =  229 bits (584), Expect = 1e-57
 Identities = 112/213 (52%), Positives = 144/213 (67%)
 Frame = -2

Query: 738 PKQFDYRKKKAVTPVKNQGACGSCWAFSAISNIEGVWAKKKKQLIDLSEQELLDCDKVDL 559
           PK +D+R    VTPVKNQG+CGSCWAFS   N+EG WA KK++L+ LSEQEL+DCDKVD 
Sbjct: 57  PKAWDWRDHGVVTPVKNQGSCGSCWAFSTTGNVEGQWAIKKQKLVSLSEQELVDCDKVDQ 116

Query: 558 GCMGGWMEDAVKELVRLGGVAKSKDYPYTAQEGQCRFKKSMAAVKVSGSRGVPANEKKIA 379
           GC GG    A   +++LGG+   K+Y Y   + +C FKKS  A K++GS  +  NE  +A
Sbjct: 117 GCNGGLPSQAYDAIMKLGGLETEKEYGYKGYDEKCFFKKSDVAAKINGSVKISENEDDMA 176

Query: 378 RALIKNGPLSVAVVASANFQFYVKGIFNPDRNQCSTNKQDVNHGVTIVGYGVEKSLPYWV 199
             L KNGP+S+ + A A  QFY+ GI +P +  C+  K D  HGV IVGYGVE S PYW+
Sbjct: 177 AWLAKNGPISIGINAFA-MQFYMGGIAHPWKIFCNPKKLD--HGVLIVGYGVEGSKPYWI 233

Query: 198 IKNSWGDSWGDKGYIKLIRGKNACAVASYVSTA 100
           IKNSWG+SWG+KGY  L RG   C V +  ++A
Sbjct: 234 IKNSWGESWGEKGYYLLYRGGGVCGVNTMCTSA 266


>gb|EKC42097.1| Cathepsin F [Crassostrea gigas]
          Length = 715

 Score =  229 bits (584), Expect = 1e-57
 Identities = 109/214 (50%), Positives = 145/214 (67%)
 Frame = -2

Query: 741  IPKQFDYRKKKAVTPVKNQGACGSCWAFSAISNIEGVWAKKKKQLIDLSEQELLDCDKVD 562
            +P  FD+R+  AVT VKNQG+CGSCWAFS   NIEG WA  KK+L+ LSEQEL+DCDKVD
Sbjct: 502  LPNSFDWREHGAVTEVKNQGSCGSCWAFSTTGNIEGQWAISKKKLVSLSEQELVDCDKVD 561

Query: 561  LGCMGGWMEDAVKELVRLGGVAKSKDYPYTAQEGQCRFKKSMAAVKVSGSRGVPANEKKI 382
             GC GG    A KE++RLGG+    DY Y     +C   KS   VK++GS  + +NE ++
Sbjct: 562  EGCNGGLPSQAYKEIIRLGGLETETDYKYRGHNEKCSMDKSKIRVKINGSVSISSNETEM 621

Query: 381  ARALIKNGPLSVAVVASANFQFYVKGIFNPDRNQCSTNKQDVNHGVTIVGYGVEKSLPYW 202
            A  L+KNGP+S+ + A A  QFY+ GI +P +  C  N ++++HGV IVGYGV+ S PYW
Sbjct: 622  AAWLVKNGPISIGINAFA-MQFYMGGISHPWKIFC--NPKELDHGVLIVGYGVKGSKPYW 678

Query: 201  VIKNSWGDSWGDKGYIKLIRGKNACAVASYVSTA 100
            +IKNSWG  WG+KGY  + RG   C + +  ++A
Sbjct: 679  IIKNSWGPDWGEKGYYLVYRGAGVCGLNTMCTSA 712


>gb|AFQ01139.1| cathepsin F-like protease, partial [Chilo suppressalis]
          Length = 537

 Score =  228 bits (582), Expect = 2e-57
 Identities = 105/220 (47%), Positives = 154/220 (70%), Gaps = 6/220 (2%)
 Frame = -2

Query: 741 IPKQFDYRKKKAVTPVKNQGACGSCWAFSAISNIEGVWAKKKKQLIDLSEQELLDCDKVD 562
           +P  FD+R   AVT VK+QGACGSCWAFS   NIEG W  K  +L+ LSEQEL+DCDK+D
Sbjct: 319 LPASFDWRPLGAVTEVKDQGACGSCWAFSVTGNIEGQWKLKTGKLLSLSEQELVDCDKMD 378

Query: 561 LGCMGGWMEDAVKELVRLGGVAKSKDYPYTAQEGQCRFKKSMAAVKVSGSRGVPANEKKI 382
            GC GG+M++A + + +LGG+   ++YPY A++ +C F KS++ V++SG+  + +NE  +
Sbjct: 379 DGCDGGYMDNAYRAIEQLGGLETEEEYPYEAEDDKCSFNKSLSKVQISGAVNISSNETNM 438

Query: 381 ARALIKNGPLSVAVVASANFQFYVKGIFNPDRNQCSTNKQDVNHGVTIVGYGVE------ 220
           A+ L+ NGP+S+ + A+A  QFYV G+ +P +  C  N ++++HGV IVGYG++      
Sbjct: 439 AKWLVHNGPISIGINANA-MQFYVGGVSHPWKALC--NPKNIDHGVLIVGYGIKEYPLFN 495

Query: 219 KSLPYWVIKNSWGDSWGDKGYIKLIRGKNACAVASYVSTA 100
           K LPYWV+KNSWG  WG++GY ++ RG   C V +  S+A
Sbjct: 496 KQLPYWVVKNSWGPGWGEQGYYRVFRGDGTCGVNTMASSA 535


>ref|NP_001156454.1| cathepsin F isoform 2 precursor [Acyrthosiphon pisum]
          Length = 586

 Score =  226 bits (577), Expect = 6e-57
 Identities = 119/233 (51%), Positives = 148/233 (63%), Gaps = 8/233 (3%)
 Frame = -2

Query: 768  MVVANDKVFIPKQFDYRKKKAVTPVKNQGACGSCWAFSAISNIEGVWAKKKKQLIDLSEQ 589
            M V      IP +FD+R    VTPVKNQGACGSCWAFSAI+NIEG +A K K+L+ LSEQ
Sbjct: 356  MAVIPQSASIPNEFDWRNHNVVTPVKNQGACGSCWAFSAIANIEGQYALKSKELLSLSEQ 415

Query: 588  ELLDCDKVDLGCMGGWMEDAVKELVRLGGVAKSKDYPYT--AQEGQCRFKKSMAAVKVSG 415
            EL+DCD +D GC GG M  A + +  LGG+    DYPY   A    C+ KKS   V +S 
Sbjct: 416  ELIDCDNLDNGCGGGLMTQAFEAVENLGGLETESDYPYEGHADRKGCQLKKSDVKVSISK 475

Query: 414  SRGVPANEKKIARALIKNGPLSVAVVASANFQFYVKGIFNPDRNQCSTNKQDVNHGVTIV 235
            +  V  +E+ IA+ L+K+GPLSV V A+A  QFY+ G+ +P    CS    D  HGV IV
Sbjct: 476  AVNVSTDEEDIAKFLVKHGPLSVGVNANA-MQFYMGGVSHPIHALCSPKSLD--HGVAIV 532

Query: 234  GYGV------EKSLPYWVIKNSWGDSWGDKGYIKLIRGKNACAVASYVSTAYL 94
            GYGV       K+LPYW+IKNSWG  WG+KGY  L RG  +C V   VS+A +
Sbjct: 533  GYGVHRTKYTHKNLPYWLIKNSWGPGWGEKGYYLLYRGDGSCGVNQMVSSAII 585


>gb|EFN80374.1| Putative cysteine proteinase CG12163 [Harpegnathos saltator]
          Length = 1032

 Score =  226 bits (576), Expect = 8e-57
 Identities = 105/220 (47%), Positives = 151/220 (68%), Gaps = 6/220 (2%)
 Frame = -2

Query: 741  IPKQFDYRKKKAVTPVKNQGACGSCWAFSAISNIEGVWAKKKKQLIDLSEQELLDCDKVD 562
            +P  FD+R+K AVTPVKNQG CGSCWAFS   N+EG +A K  +L+ LSEQEL+DCD +D
Sbjct: 813  LPNSFDWRQKGAVTPVKNQGMCGSCWAFSVTGNVEGQYAIKHNKLLSLSEQELVDCDDLD 872

Query: 561  LGCMGGWMEDAVKELVRLGGVAKSKDYPYTAQEGQCRFKKSMAAVKVSGSRGVPANEKKI 382
             GC GG  ++A + + +LGG+    DYPY A+  +C FKK+MA V+V  +  + +NE +I
Sbjct: 873  EGCNGGLPDNAYRAIEKLGGLELESDYPYEAENERCHFKKNMAKVQVGSAVNITSNETQI 932

Query: 381  ARALIKNGPLSVAVVASANFQFYVKGIFNPDRNQCSTNKQDVNHGVTIVGYGV------E 220
            A+ L+ NGP+S+ + A+A  QFY+ G+ +P +  C  N ++++HGV IVGYG        
Sbjct: 933  AQWLVANGPISIGINANA-MQFYMGGVSHPFKFLC--NPKNLDHGVLIVGYGTSNYPLFH 989

Query: 219  KSLPYWVIKNSWGDSWGDKGYIKLIRGKNACAVASYVSTA 100
            K LPYW++KNSWGD WG++GY ++ RG   C + +  S+A
Sbjct: 990  KKLPYWIVKNSWGDRWGEQGYYRVYRGDGTCGLNTMASSA 1029


>emb|CDJ26737.1| cathepsin F-like cysteine peptidase protein [Tityus serrulatus]
          Length = 446

 Score =  226 bits (575), Expect = 1e-56
 Identities = 108/215 (50%), Positives = 147/215 (68%), Gaps = 6/215 (2%)
 Frame = -2

Query: 747 VFIPKQFDYRKKKAVTPVKNQGACGSCWAFSAISNIEGVWAKKKKQLIDLSEQELLDCDK 568
           + +PK FD+R    VT VKNQ  CGSCWAFS   NIEG WA KKK+L+ LSEQEL+DCDK
Sbjct: 225 ISMPKSFDWRHYNVVTEVKNQQQCGSCWAFSTTGNIEGQWALKKKKLVSLSEQELVDCDK 284

Query: 567 VDLGCMGGWMEDAVKELVRLGGVAKSKDYPYTAQEGQCRFKKSMAAVKVSGSRGVPANEK 388
           VD GC GG   +A KE++RLGG+   K+YPY A + +C+FKKS   V ++ S  +  NE 
Sbjct: 285 VDQGCNGGLPSNAYKEIIRLGGLETEKEYPYEADDEKCQFKKSDVRVYINSSVSISQNET 344

Query: 387 KIARALIKNGPLSVAVVASANFQFYVKGIFNPDRNQCSTNKQDVNHGVTIVGYGV----- 223
           ++A  L+KNGP+S+ + A+A  QFY  GI +P +  C  N ++++HGV IVGYGV     
Sbjct: 345 EMAVWLVKNGPISIGINANA-MQFYYGGISHPWKFLC--NPENLDHGVLIVGYGVHSYPL 401

Query: 222 -EKSLPYWVIKNSWGDSWGDKGYIKLIRGKNACAV 121
            + +LP+W+IKNSWG  WG++GY ++ RG   C +
Sbjct: 402 FKTTLPFWIIKNSWGADWGEQGYYRVYRGDGTCGL 436


>gb|AAT07059.1| cathepsin F-like cysteine proteinase, partial [Brugia malayi]
          Length = 461

 Score =  225 bits (574), Expect = 1e-56
 Identities = 102/216 (47%), Positives = 143/216 (66%)
 Frame = -2

Query: 741 IPKQFDYRKKKAVTPVKNQGACGSCWAFSAISNIEGVWAKKKKQLIDLSEQELLDCDKVD 562
           +P +FD+R +  VTPVK+QG+CGSCWAFS   NIE +WA K  +LI LSEQEL+DCD +D
Sbjct: 248 LPSKFDWRTEGVVTPVKDQGSCGSCWAFSVTGNIESLWAIKTGKLISLSEQELIDCDVID 307

Query: 561 LGCMGGWMEDAVKELVRLGGVAKSKDYPYTAQEGQCRFKKSMAAVKVSGSRGVPANEKKI 382
            GC GG   +A +E+ R+GG+     YPY A+ G C   ++  AV +  +  +P NE  +
Sbjct: 308 KGCNGGLPINAFREIKRMGGLEPEDQYPYEAKNGTCHLVRAQIAVSIDDAVEIPRNETVM 367

Query: 381 ARALIKNGPLSVAVVASANFQFYVKGIFNPDRNQCSTNKQDVNHGVTIVGYGVEKSLPYW 202
              + + GPLSV + A     +Y  GI +P +++C  +K  +NHGV I GYG+E +LPYW
Sbjct: 368 KAWIAQRGPLSVGIDAEL-LSYYKSGILHPSKSRCPPSK--INHGVLITGYGIENNLPYW 424

Query: 201 VIKNSWGDSWGDKGYIKLIRGKNACAVASYVSTAYL 94
            IKNSWG+ WG+ GY +L+RGKN C V+  VS+A +
Sbjct: 425 TIKNSWGEQWGENGYFQLMRGKNICGVSDLVSSAII 460


>ref|XP_003378245.1| cathepsin F [Trichinella spiralis] gi|316972864|gb|EFV56510.1|
           cathepsin F [Trichinella spiralis]
          Length = 366

 Score =  224 bits (570), Expect = 4e-56
 Identities = 109/220 (49%), Positives = 150/220 (68%), Gaps = 4/220 (1%)
 Frame = -2

Query: 741 IPKQFDYRKKKAVTPVKNQGACGSCWAFSAISNIEGVWAKKKKQLIDLSEQELLDCDKVD 562
           I ++ D+RK  AVT VK+QG CGSCWAF  ++NIEG WA K  QLI LSEQ+L+DCD++D
Sbjct: 149 ISERMDWRKFNAVTSVKDQGNCGSCWAFCTVANIEGAWAVKTAQLISLSEQQLVDCDRLD 208

Query: 561 LGCMGGWMEDAVKELVRLGGVAKSKDYPYTAQEGQCRFKKSMAAVKVSGSRGVPANEKKI 382
            GC GG   +A  E++RLGG+ K +DY YTA+ G+C+F  + +AV ++ +  +P +E  I
Sbjct: 209 DGCEGGLPVNAYLEIIRLGGLEKEEDYKYTARSGKCKFNHTKSAVYINDTVVLPEDEDAI 268

Query: 381 ARALIKNGPLSVAVVASANFQFYVKGIFNPDRNQCSTNKQDVNHGVTIVGYGVEKSL--- 211
           AR + +NGP++V + A A   FY  GI +P R  CS +   +NHGVTIVGY V++SL   
Sbjct: 269 ARYVSENGPVAVGLNADA-MMFYRSGIAHPSRLMCSPD--GINHGVTIVGYDVKESLFWS 325

Query: 210 -PYWVIKNSWGDSWGDKGYIKLIRGKNACAVASYVSTAYL 94
            PYW+IKNSWG +WG+KGY  L RGK  C +    S+  +
Sbjct: 326 TPYWIIKNSWGPNWGEKGYYYLYRGKGVCGIDQMASSVVI 365


>ref|NP_001156453.1| cathepsin F isoform 1 precursor [Acyrthosiphon pisum]
          Length = 586

 Score =  224 bits (570), Expect = 4e-56
 Identities = 118/233 (50%), Positives = 147/233 (63%), Gaps = 8/233 (3%)
 Frame = -2

Query: 768  MVVANDKVFIPKQFDYRKKKAVTPVKNQGACGSCWAFSAISNIEGVWAKKKKQLIDLSEQ 589
            M V      IP +FD+R    VTPVKNQGACGSCWAFSAI+NIEG +A K K+L+ LSEQ
Sbjct: 356  MAVIPQSASIPNEFDWRNHNVVTPVKNQGACGSCWAFSAIANIEGQYALKSKELLSLSEQ 415

Query: 588  ELLDCDKVDLGCMGGWMEDAVKELVRLGGVAKSKDYPYT--AQEGQCRFKKSMAAVKVSG 415
            EL+DCD +D GC GG M  A + +  LGG+    DYPY   A    C+ KKS   V +S 
Sbjct: 416  ELIDCDNLDNGCGGGLMTQAFEAVENLGGLETESDYPYEGHADRKGCQLKKSDVKVSISK 475

Query: 414  SRGVPANEKKIARALIKNGPLSVAVVASANFQFYVKGIFNPDRNQCSTNKQDVNHGVTIV 235
            +  V  +E+ IA+ L+K+GPLSV V A+A  QFY+ G+ +P    CS    D  HGV IV
Sbjct: 476  AVNVSTDEEDIAKFLVKHGPLSVGVNANA-MQFYMGGVSHPIHALCSPKSLD--HGVAIV 532

Query: 234  GYGVEK------SLPYWVIKNSWGDSWGDKGYIKLIRGKNACAVASYVSTAYL 94
            GYGV K      +LP+W IKNSWGD WG +GY  L RG  +C V   VS+A +
Sbjct: 533  GYGVHKYPYLNATLPFWTIKNSWGDKWGMQGYYLLYRGDGSCGVNQMVSSAII 585


>gb|EJW79799.1| cysteine protease 6 [Wuchereria bancrofti]
          Length = 242

 Score =  223 bits (569), Expect = 5e-56
 Identities = 102/216 (47%), Positives = 140/216 (64%)
 Frame = -2

Query: 741 IPKQFDYRKKKAVTPVKNQGACGSCWAFSAISNIEGVWAKKKKQLIDLSEQELLDCDKVD 562
           +P +FD+  K  VTPVKNQG+CGSCWAFS   NIE +WA K   LI LSEQEL+DCD +D
Sbjct: 29  LPNKFDWNTKGVVTPVKNQGSCGSCWAFSVTGNIESLWAIKTGNLISLSEQELIDCDVID 88

Query: 561 LGCMGGWMEDAVKELVRLGGVAKSKDYPYTAQEGQCRFKKSMAAVKVSGSRGVPANEKKI 382
            GC GG   +A +E+ R+GG+     YPY A+ G C   ++  AV +  +  +P NE  +
Sbjct: 89  NGCNGGLPINAFREIKRMGGLEPEDQYPYKAKNGTCHLVRAQIAVTIDDAIEIPRNETVM 148

Query: 381 ARALIKNGPLSVAVVASANFQFYVKGIFNPDRNQCSTNKQDVNHGVTIVGYGVEKSLPYW 202
              + + GPLSV + A     +Y  GI +P +++C  +K  +NHGV I GYG+E  LPYW
Sbjct: 149 KAWIAQRGPLSVGIDAEL-LAYYKSGILHPSKSRCPPSK--INHGVLITGYGIENGLPYW 205

Query: 201 VIKNSWGDSWGDKGYIKLIRGKNACAVASYVSTAYL 94
            IKNSWG+ WG+ GY +L+RGK+ C V+  VS+A +
Sbjct: 206 TIKNSWGEEWGENGYFRLMRGKDICGVSDLVSSAII 241


Top