BLASTX nr result

ID: Mentha23_contig00017447 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Mentha23_contig00017447
         (989 letters)

Database: ./nr 
           38,876,450 sequences; 13,856,398,315 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_006360817.1| PREDICTED: pro-cathepsin H-like [Solanum tub...    79   3e-12
ref|XP_006347646.1| PREDICTED: uncharacterized protein LOC102578...    77   1e-11
ref|NP_509408.1| Protein R09F10.1 [Caenorhabditis elegans] gi|35...    74   9e-11
ref|XP_002645498.1| Hypothetical protein CBG22748 [Caenorhabditi...    74   9e-11
gb|EGT32164.1| hypothetical protein CAEBREN_11106 [Caenorhabditi...    73   2e-10
ref|XP_003100293.1| hypothetical protein CRE_21852 [Caenorhabdit...    73   2e-10
ref|NP_001163996.1| cathepsin L-like proteinase precursor [Tribo...    69   2e-09
ref|NP_001164314.1| cathepsin L precursor [Tribolium castaneum] ...    68   5e-09
emb|CDJ81168.1| Proteinase inhibitor I29 and Peptidase C1A domai...    65   4e-08
ref|XP_003601499.1| Cathepsin B [Medicago truncatula] gi|3554905...    65   6e-08
ref|XP_006396923.1| hypothetical protein EUTSA_v10028733mg [Eutr...    64   1e-07
gb|AGV15822.1| cysteine protease CP14 [Nicotiana tabacum]              64   1e-07
gb|AGB07568.1| cathepsin b-like cysteine protease 11, partial [A...    62   4e-07
gb|ABC88769.1| putative cathepsin L-like proteinase [Tenebrio mo...    62   5e-07
gb|AAP94048.2| cathepsin-L-like midgut cysteine proteinase [Tene...    62   5e-07
gb|EYC39727.1| hypothetical protein Y032_0643g1064 [Ancylostoma ...    61   8e-07
gb|EYC39726.1| hypothetical protein Y032_0643g1064 [Ancylostoma ...    61   8e-07
gb|EYC39725.1| hypothetical protein Y032_0643g1064 [Ancylostoma ...    61   8e-07
gb|ETN70308.1| papain family cysteine protease [Necator americanus]    61   8e-07
gb|ETN61493.1| cathepsin b [Anopheles darlingi]                        61   8e-07

>ref|XP_006360817.1| PREDICTED: pro-cathepsin H-like [Solanum tuberosum]
          Length = 346

 Score = 79.0 bits (193), Expect = 3e-12
 Identities = 52/163 (31%), Positives = 78/163 (47%), Gaps = 6/163 (3%)
 Frame = -2

Query: 568 PYSYRTVFKYATEIGISPVELYPWNGKCKFREWDGSVWDRVVECKGKVIEPSKKIFIDGK 389
           P  Y   F+YA E G+ P + YP+  +       G   +   E K K I+  KK+   G 
Sbjct: 190 PSHYNNYFQYAIEKGVYPDKPYPYLAE------RGECLELPNEEKTK-IKAYKKVNDLG- 241

Query: 388 KILQAREIDDYLRHQPLTGQIKVTEELNAWKGDGIYRGG------DTTCYMGSDVGKHAV 227
             L  + I++ ++ QP+ G +K+ +     KG  IY G                 G+HAV
Sbjct: 242 --LDKKSIEELIQKQPICGSVKLAKNFQKHKGKDIYMGQTKEEIYSEASKNNQSRGRHAV 299

Query: 226 TIVGFGGEGKDAYFVCQNSYGTGWGYRGYFKVARHLITNKYIP 98
            I+GFG E    Y++ +NS+G  WGY GY +V R L+T+   P
Sbjct: 300 LIIGFGIENGIEYYLIKNSWGVNWGYLGYARVERRLVTSLSFP 342


>ref|XP_006347646.1| PREDICTED: uncharacterized protein LOC102578529 [Solanum tuberosum]
          Length = 893

 Score = 77.0 bits (188), Expect = 1e-11
 Identities = 58/204 (28%), Positives = 96/204 (47%), Gaps = 14/204 (6%)
 Frame = -2

Query: 667  IPGSPQEVVDYGSLFIRPKDAPASGPYELNERVPYSYRTVFKYATEIGISPVELYPW--- 497
            +P S Q+++D   ++ + K           E  P SY   +K+A + GI+    YP+   
Sbjct: 694  VPLSKQQLID--CMYTKYKKPSYFADLGEKECFPCSYNKAYKFAMDYGITVETKYPFMEE 751

Query: 496  NGKCKFREWDGSVWDRVVECKG--KVIEPSKKIFIDGKKILQAREI-DDYLRHQPLTGQI 326
             GKC+ +        R+++  G  +V E  K++     + L  +EI +  +R QP+T   
Sbjct: 752  RGKCECQSEM-----RIIKINGFQRVSELIKELEEKAIEKLDEKEIIEKLIRQQPITCAA 806

Query: 325  KVTEELNAWKGDGIYRGGDTTCYM--------GSDVGKHAVTIVGFGGEGKDAYFVCQNS 170
                 L   +G G+Y G               G  VGKHA+ IVG+G E    +++ +NS
Sbjct: 807  LHVPSLQLHRGKGVYMGPTENEIAQVRQKETEGQVVGKHAMLIVGYGEEEGVEFYLVKNS 866

Query: 169  YGTGWGYRGYFKVARHLITNKYIP 98
            +GT WGY+GY K+ R  ++    P
Sbjct: 867  WGTEWGYQGYAKIKRSALSKLSYP 890


>ref|NP_509408.1| Protein R09F10.1 [Caenorhabditis elegans]
           gi|351061560|emb|CCD69414.1| Protein R09F10.1
           [Caenorhabditis elegans]
          Length = 383

 Score = 73.9 bits (180), Expect = 9e-11
 Identities = 59/222 (26%), Positives = 104/222 (46%), Gaps = 7/222 (3%)
 Frame = -2

Query: 769 RLMEVRNQEKTNACXXXXXXXXXFGAQVRQNGHAIPGSPQEVVDYGSLFIRPKDAPASGP 590
           +L  ++NQ +  +C               + G  +  S QE+VD        ++   SG 
Sbjct: 179 KLTPIKNQGQCGSCWAFATVASVEAQNAIKKGKLVSLSEQEMVDCDG-----RNNGCSGG 233

Query: 589 YELNERVPYSYRTVFKYATEIGISPVELYPWNG----KCKFREWDGSVWDRVVECKGKVI 422
           Y      PY+     K+  E G+   + YP++     +C  +E D               
Sbjct: 234 YR-----PYA----MKFVKENGLESEKEYPYSALKHDQCFLKEND--------------- 269

Query: 421 EPSKKIFIDGKKILQAREID--DYLRHQ-PLTGQIKVTEELNAWKGDGIYRGGDTTCYMG 251
               ++FID  ++L   E D  +++  + P+T  + V + + +++  GI+      C   
Sbjct: 270 ---TRVFIDDFRMLSNNEEDIANWVGTKGPVTFGMNVVKAMYSYRS-GIFNPSVEDCTEK 325

Query: 250 SDVGKHAVTIVGFGGEGKDAYFVCQNSYGTGWGYRGYFKVAR 125
           S +G HA+TI+G+GGEG+ AY++ +NS+GT WG  GYF++AR
Sbjct: 326 S-MGAHALTIIGYGGEGESAYWIVKNSWGTSWGASGYFRLAR 366


>ref|XP_002645498.1| Hypothetical protein CBG22748 [Caenorhabditis briggsae]
           gi|187021579|emb|CAP39268.1| Protein CBG22748
           [Caenorhabditis briggsae]
          Length = 379

 Score = 73.9 bits (180), Expect = 9e-11
 Identities = 60/222 (27%), Positives = 102/222 (45%), Gaps = 7/222 (3%)
 Frame = -2

Query: 769 RLMEVRNQEKTNACXXXXXXXXXFGAQVRQNGHAIPGSPQEVVDYGSLFIRPKDAPASGP 590
           +L  ++NQ +  +C               + G  +  S QE+VD        ++   SG 
Sbjct: 175 KLTPIKNQGQCGSCWAFATVAAIEAQHAIKKGILVSLSEQEMVDCDG-----RNNGCSGG 229

Query: 589 YELNERVPYSYRTVFKYATEIGISPVELYPWNG----KCKFREWDGSVWDRVVECKGKVI 422
           Y      PY+ R    +  E G+   + YP++     +C   + D               
Sbjct: 230 YR-----PYAMR----FVKENGLETEKSYPYSALKHDQCMLHQND--------------- 265

Query: 421 EPSKKIFIDGKKILQARE--IDDYLRHQ-PLTGQIKVTEELNAWKGDGIYRGGDTTCYMG 251
               K++ID  ++L   E  I D++  + P+T  + V + + +++  GI+      C   
Sbjct: 266 ---TKVYIDDYRMLSTSEENIADWVGTKGPVTFGMNVVKAMYSYRS-GIFNPSAEDCAEK 321

Query: 250 SDVGKHAVTIVGFGGEGKDAYFVCQNSYGTGWGYRGYFKVAR 125
           S +G HA+TIVG+GGEG  AY++ +NS+GT WG  GYF++AR
Sbjct: 322 S-MGAHALTIVGYGGEGTSAYWIVKNSWGTSWGSDGYFRLAR 362


>gb|EGT32164.1| hypothetical protein CAEBREN_11106 [Caenorhabditis brenneri]
          Length = 389

 Score = 73.2 bits (178), Expect = 2e-10
 Identities = 58/222 (26%), Positives = 103/222 (46%), Gaps = 7/222 (3%)
 Frame = -2

Query: 769 RLMEVRNQEKTNACXXXXXXXXXFGAQVRQNGHAIPGSPQEVVDYGSLFIRPKDAPASGP 590
           +L  ++NQ +  +C               + G  +  S QE+VD        ++   SG 
Sbjct: 185 KLTPIKNQGQCGSCWAFATVAAVEAQHAIKKGQLVSLSEQEMVDCDG-----RNNGCSGG 239

Query: 589 YELNERVPYSYRTVFKYATEIGISPVELYPWNG----KCKFREWDGSVWDRVVECKGKVI 422
           Y      PY+ R    +  E G+   + YP++     +C  ++ D               
Sbjct: 240 YR-----PYAMR----FVKENGLESEKEYPYSALKHDQCFLKQND--------------- 275

Query: 421 EPSKKIFIDGKKILQAREID--DYLRHQ-PLTGQIKVTEELNAWKGDGIYRGGDTTCYMG 251
               ++FID  ++L   E D  +++  + P+T  + V + + +++  GI+      C   
Sbjct: 276 ---TRVFIDDFRMLSTNEEDIANWVGTKGPVTFGMNVVKAMYSYRS-GIFNPSSEDCAEK 331

Query: 250 SDVGKHAVTIVGFGGEGKDAYFVCQNSYGTGWGYRGYFKVAR 125
           S +G HA+TIVG+GGEG  A+++ +NS+GT WG  GYF++AR
Sbjct: 332 S-MGAHALTIVGYGGEGSSAFWIVKNSWGTSWGSSGYFRLAR 372


>ref|XP_003100293.1| hypothetical protein CRE_21852 [Caenorhabditis remanei]
           gi|308265817|gb|EFP09770.1| hypothetical protein
           CRE_21852 [Caenorhabditis remanei]
          Length = 391

 Score = 73.2 bits (178), Expect = 2e-10
 Identities = 59/222 (26%), Positives = 104/222 (46%), Gaps = 7/222 (3%)
 Frame = -2

Query: 769 RLMEVRNQEKTNACXXXXXXXXXFGAQVRQNGHAIPGSPQEVVDYGSLFIRPKDAPASGP 590
           +L  ++NQ +  +C               +    +  S QE+VD        K+   SG 
Sbjct: 187 KLTPIKNQGQCGSCWAFATVAAVEAQHAIRKNQLVSLSEQEMVDCDD-----KNNGCSGG 241

Query: 589 YELNERVPYSYRTVFKYATEIGISPVELYPWNG----KCKFREWDGSVWDRVVECKGKVI 422
           Y      PY+ R    +  E G+   + YP++     +C  ++ D               
Sbjct: 242 YR-----PYAMR----FVKENGLESEKEYPYSALKHDQCMLKQND--------------- 277

Query: 421 EPSKKIFIDGKKILQARE--IDDYLRHQ-PLTGQIKVTEELNAWKGDGIYRGGDTTCYMG 251
               ++FID  ++L   E  I +++  + P+T  + VT+ + +++  GI+      C   
Sbjct: 278 ---TRVFIDDFRMLSQNEEEIANWVGTKGPVTFGMSVTKAMYSYRS-GIFNPSADDCAEK 333

Query: 250 SDVGKHAVTIVGFGGEGKDAYFVCQNSYGTGWGYRGYFKVAR 125
           S +G HA+TIVG+GGEG+ A+++ +NS+GT WG  GYF++AR
Sbjct: 334 S-MGSHALTIVGYGGEGEAAFWIVKNSWGTSWGASGYFRLAR 374


>ref|NP_001163996.1| cathepsin L-like proteinase precursor [Tribolium castaneum]
           gi|281427798|ref|NP_001164001.1| cathepsin L-like
           proteinase precursor [Tribolium castaneum]
           gi|270001241|gb|EEZ97688.1| cathepsin L precursor
           [Tribolium castaneum] gi|270016928|gb|EFA13374.1|
           hypothetical protein TcasGA2_TC001950 [Tribolium
           castaneum]
          Length = 328

 Score = 69.3 bits (168), Expect = 2e-09
 Identities = 70/252 (27%), Positives = 111/252 (44%), Gaps = 6/252 (2%)
 Frame = -2

Query: 847 KLKDSERNTLPIFASAK--ACLVTDHPERLMEVRNQEKTNACXXXXXXXXXFGAQVRQNG 674
           K K +E+  +P   S K  A  V    + + EV++Q +  +C          G Q+  +G
Sbjct: 97  KPKMNEKLRIPFVKSGKPAAAEVDWRSKAVTEVKDQGQCGSCWSFSTTGAVEG-QLAISG 155

Query: 673 HAIPG-SPQEVVDYGSLFIRPKDAPASGPYELNERVPYSYRTVFKYATEIGISPVELYPW 497
             +   S Q +VD  S +    +A  +G +           + F Y  + GI     YP+
Sbjct: 156 KGLTSLSEQNLVDCSSQY---GNAGCNGGW---------MDSAFDYIHDNGIMSESAYPY 203

Query: 496 ---NGKCKFREWDGSVWDRVVECKGKVIEPSKKIFIDGKKILQAREIDDYLRHQPLTGQI 326
              +G C+F   D S    V   +G    PS       +  LQ    D    + P+   +
Sbjct: 204 TAMDGNCRF---DAS--QSVTSLQGYYDIPS-----GDESALQ----DAVANNGPVAVAL 249

Query: 325 KVTEELNAWKGDGIYRGGDTTCYMGSDVGKHAVTIVGFGGEGKDAYFVCQNSYGTGWGYR 146
             TEEL  + G  +Y   DTTC   +    H V +VG+G EG   Y++ +NS+G+GWG +
Sbjct: 250 DATEELQLYSGGVLY---DTTC--SAQALNHGVLVVGYGSEGGQDYWIVKNSWGSGWGEQ 304

Query: 145 GYFKVARHLITN 110
           GY++ AR+   N
Sbjct: 305 GYWRQARNRNNN 316


>ref|NP_001164314.1| cathepsin L precursor [Tribolium castaneum]
           gi|270001247|gb|EEZ97694.1| cathepsin L precursor
           [Tribolium castaneum]
          Length = 328

 Score = 68.2 bits (165), Expect = 5e-09
 Identities = 68/252 (26%), Positives = 107/252 (42%), Gaps = 6/252 (2%)
 Frame = -2

Query: 847 KLKDSERNTLPIFASAK--ACLVTDHPERLMEVRNQEKTNACXXXXXXXXXFGAQVRQNG 674
           K K +E+  LP   S K  A  V      + EV+NQ +  +C          G Q+  +G
Sbjct: 97  KPKKNEKLRLPFVQSDKPAAAEVDWRNSAVSEVKNQGQCGSCWSFSTTGAVEG-QLAISG 155

Query: 673 HAIPG-SPQEVVDYGSLFIRPKDAPASGPYELNERVPYSYRTVFKYATEIGISPVELYPW 497
             +   S Q +VD  S +    +A  +G +           + F Y  + GI     YP+
Sbjct: 156 RGLTSLSEQNLVDCSSAY---GNAGCNGGW---------MDSAFDYIHDNGIMSESAYPY 203

Query: 496 N---GKCKFREWDGSVWDRVVECKGKVIEPSKKIFIDGKKILQAREIDDYLRHQPLTGQI 326
               G C+F   +      V   +G    PS     D   +  A        + P+   +
Sbjct: 204 TASEGSCRFNPSES-----VTSLQGYYDLPSG----DENALKSA-----VANNGPIAVAL 249

Query: 325 KVTEELNAWKGDGIYRGGDTTCYMGSDVGKHAVTIVGFGGEGKDAYFVCQNSYGTGWGYR 146
             T+EL  + G  +Y   DTTC   +    H V +VG+G EG   Y++ +NS+G+GWG +
Sbjct: 250 DATDELQFYSGGVLY---DTTC--SAQALNHGVLVVGYGSEGGQDYWIVKNSWGSGWGEQ 304

Query: 145 GYFKVARHLITN 110
           GY++ AR+   N
Sbjct: 305 GYWRQARNRNNN 316


>emb|CDJ81168.1| Proteinase inhibitor I29 and Peptidase C1A domain containing
           protein [Haemonchus contortus]
          Length = 390

 Score = 65.1 bits (157), Expect = 4e-08
 Identities = 57/218 (26%), Positives = 100/218 (45%), Gaps = 3/218 (1%)
 Frame = -2

Query: 769 RLMEVRNQEKTNACXXXXXXXXXFGAQVRQNGHAIPGSPQEVVDYGSLFIRPKDAPASGP 590
           +L  V++Q +  +C           A   + G     S QE+VD  +     ++    G 
Sbjct: 185 KLTPVKDQGQCGSCWAFATVASIEAANAIKTGQLTRLSEQEMVDCDT-----QNNGCQGG 239

Query: 589 YELNERVPYSYRTVFKYATEIGISPVELYPWNGKCKFREWDGSVWDRVVECKGKVIEPSK 410
           Y      PY+      +  + G+   E YP++G  +              C  K    S+
Sbjct: 240 YR-----PYA----MSFVQQNGLMKEEKYPYSGTDQNT------------CLLK--RDSE 276

Query: 409 KIFIDGKKILQARE--IDDYLR-HQPLTGQIKVTEELNAWKGDGIYRGGDTTCYMGSDVG 239
           ++FI   ++L + E  I D++  + P+T  + VT+ + +++  GI+      C   S +G
Sbjct: 277 RVFIQSYRMLSSNEEVIADWIAANGPVTFGMNVTKSMYSYRS-GIFAPTQEDCEQHS-LG 334

Query: 238 KHAVTIVGFGGEGKDAYFVCQNSYGTGWGYRGYFKVAR 125
            HA+T VG+G E    Y++ +NS+G+ WG  GYFK+AR
Sbjct: 335 SHALTFVGYGTENGQPYWLVKNSWGSRWGQDGYFKLAR 372


>ref|XP_003601499.1| Cathepsin B [Medicago truncatula] gi|355490547|gb|AES71750.1|
           Cathepsin B [Medicago truncatula]
          Length = 232

 Score = 64.7 bits (156), Expect = 6e-08
 Identities = 35/94 (37%), Positives = 56/94 (59%), Gaps = 9/94 (9%)
 Frame = -2

Query: 388 KILQAREIDDYLRHQ-PLTGQIKVTEELNAWKGDGIYRG-GDTTCYM---GSDVGKHAVT 224
           K L  R++ ++LR + P+  ++K  +E+  +KGDGIY G  D   ++    + VG HA+ 
Sbjct: 112 KWLPFRKMKEHLRDEGPIAVEVKWIKEMGDYKGDGIYNGPADANAFVKTVNNHVGDHALL 171

Query: 223 IVGFGGEGKDA----YFVCQNSYGTGWGYRGYFK 134
           ++GFG E  +     Y++ QNS+G GWG  GY K
Sbjct: 172 VIGFGSERIEGELVHYWIVQNSHGEGWGKEGYAK 205


>ref|XP_006396923.1| hypothetical protein EUTSA_v10028733mg [Eutrema salsugineum]
           gi|557097940|gb|ESQ38376.1| hypothetical protein
           EUTSA_v10028733mg [Eutrema salsugineum]
          Length = 379

 Score = 63.5 bits (153), Expect = 1e-07
 Identities = 37/111 (33%), Positives = 62/111 (55%), Gaps = 3/111 (2%)
 Frame = -2

Query: 439 CKGKVIEPSKKIFIDGKKILQARE---IDDYLRHQPLTGQIKVTEELNAWKGDGIYRGGD 269
           C G++ E +KK+ IDG + L A +   +   + HQP+T  I  +         G++ G  
Sbjct: 247 CDGRLKENNKKVMIDGYENLPANDEFALMKAVAHQPVTAVIDSSSRDFQLYESGVFDG-- 304

Query: 268 TTCYMGSDVGKHAVTIVGFGGEGKDAYFVCQNSYGTGWGYRGYFKVARHLI 116
            TC  G+++  H V +VG+G E    Y++ +NS+G  WG  GY K+AR+++
Sbjct: 305 -TC--GTNLN-HGVVVVGYGTENGHDYWIVRNSWGNTWGEAGYMKMARNIV 351


>gb|AGV15822.1| cysteine protease CP14 [Nicotiana tabacum]
          Length = 505

 Score = 63.5 bits (153), Expect = 1e-07
 Identities = 61/228 (26%), Positives = 92/228 (40%), Gaps = 7/228 (3%)
 Frame = -2

Query: 760 EVRNQEKTNACXXXXXXXXXFGAQVRQNGHAIPGSPQEVVDYGSLFIRPKDAPASGPYEL 581
           EV+NQ++  AC          G      G  I  S QE+++  + +    D     P   
Sbjct: 160 EVKNQDQCGACWAFSACGAMEGINAIATGELISLSEQELINCDNSYNTGCDGGLMDP--- 216

Query: 580 NERVPYSYRTVFKYA-TEIGISPVELYPW---NGKCKFREWDGSVWDRVVECKGKVIEPS 413
                      F++     GI+    YP+    G+C + +         V  K  +I+  
Sbjct: 217 ----------AFEWVMNNSGINSEADYPYTASQGRCNYDK---------VNHKVVIIDGY 257

Query: 412 KKIFIDGKKILQAREIDDYLRHQPLTGQIKVTEELNAWKGD-GIYRGG--DTTCYMGSDV 242
           + +  D   +L A             GQ  V+  ++    D  +YRGG  D  C    D 
Sbjct: 258 QDVPEDENALLCA------------VGQQPVSVGIDGSSLDFQLYRGGIYDGECSSNPDD 305

Query: 241 GKHAVTIVGFGGEGKDAYFVCQNSYGTGWGYRGYFKVARHLITNKYIP 98
             HAV IVG+G EG D Y++ +NS+GT WG  GY  + R    N Y+P
Sbjct: 306 LSHAVVIVGYGSEGDDDYWIIKNSWGTSWGMEGYAYIRR----NTYLP 349


>gb|AGB07568.1| cathepsin b-like cysteine protease 11, partial [Ancylostoma
           duodenale]
          Length = 250

 Score = 62.0 bits (149), Expect = 4e-07
 Identities = 39/142 (27%), Positives = 62/142 (43%), Gaps = 11/142 (7%)
 Frame = -2

Query: 517 PVELYP--------WNGKCKFREWDGSVWDRVVECKGKVIEPSKKIFIDGKKILQARE-- 368
           P  LYP        + G C  + WD  V     + K  +     KI+ +   I+   +  
Sbjct: 101 PYPLYPCGRHQNQTYYGPCSEKLWDTPVCRSACQFKYPIPYRQDKIYGNSTYIIPKNQTI 160

Query: 367 -IDDYLRHQPLTGQIKVTEELNAWKGDGIYRGGDTTCYMGSDVGKHAVTIVGFGGEGKDA 191
            + + + H P+    KV E+         Y+GG      G   G HAV ++G+G E    
Sbjct: 161 IMTEIMTHGPVVATYKVYEDF------AYYKGGVYVHTAGEQKGAHAVRVIGWGEENSLP 214

Query: 190 YFVCQNSYGTGWGYRGYFKVAR 125
           Y++  NS+ T WG +GYF++ R
Sbjct: 215 YWLVANSWNTDWGEKGYFRILR 236


>gb|ABC88769.1| putative cathepsin L-like proteinase [Tenebrio molitor]
          Length = 328

 Score = 61.6 bits (148), Expect = 5e-07
 Identities = 63/248 (25%), Positives = 97/248 (39%), Gaps = 2/248 (0%)
 Frame = -2

Query: 847 KLKDSERNTLPIFASAK--ACLVTDHPERLMEVRNQEKTNACXXXXXXXXXFGAQVRQNG 674
           K K  E   +P  +S K  A  V      + EV++Q +  +C          G    Q G
Sbjct: 97  KPKHPENLRMPYVSSKKPLAASVDWRSNAVSEVKDQGQCGSCWSFSTTGAVEGQLALQRG 156

Query: 673 HAIPGSPQEVVDYGSLFIRPKDAPASGPYELNERVPYSYRTVFKYATEIGISPVELYPWN 494
                S Q ++D  S +    +A   G +           + F Y  + GI     YP+ 
Sbjct: 157 RLTSLSEQNLIDCSSSY---GNAGCDGGW---------MDSAFSYIHDYGIMSESAYPYE 204

Query: 493 GKCKFREWDGSVWDRVVECKGKVIEPSKKIFIDGKKILQAREIDDYLRHQPLTGQIKVTE 314
            +  +  +D S    V    G    PS         + QA          P+   I  T+
Sbjct: 205 AQGDYCRFDSS--QSVTTLSGYYDLPSGDENSLADAVGQAG---------PVAVAIDATD 253

Query: 313 ELNAWKGDGIYRGGDTTCYMGSDVGKHAVTIVGFGGEGKDAYFVCQNSYGTGWGYRGYFK 134
           EL  + G   Y   D TC   SD+  H V +VG+G +    Y++ +NS+G+GWG  GY++
Sbjct: 254 ELQFYSGGLFY---DQTCNQ-SDLN-HGVLVVGYGSDNGQDYWILKNSWGSGWGESGYWR 308

Query: 133 VARHLITN 110
             R+   N
Sbjct: 309 QVRNYGNN 316


>gb|AAP94048.2| cathepsin-L-like midgut cysteine proteinase [Tenebrio molitor]
          Length = 330

 Score = 61.6 bits (148), Expect = 5e-07
 Identities = 63/248 (25%), Positives = 97/248 (39%), Gaps = 2/248 (0%)
 Frame = -2

Query: 847 KLKDSERNTLPIFASAK--ACLVTDHPERLMEVRNQEKTNACXXXXXXXXXFGAQVRQNG 674
           K K  E   +P  +S K  A  V      + EV++Q +  +C          G    Q G
Sbjct: 99  KPKHPENLRMPYVSSKKPLAASVDWRSNAVSEVKDQGQCGSCWSFSTTGAVEGQLALQRG 158

Query: 673 HAIPGSPQEVVDYGSLFIRPKDAPASGPYELNERVPYSYRTVFKYATEIGISPVELYPWN 494
                S Q ++D  S +    +A   G +           + F Y  + GI     YP+ 
Sbjct: 159 RLTSLSEQNLIDCSSSY---GNAGCDGGW---------MDSAFSYIHDYGIMSESAYPYE 206

Query: 493 GKCKFREWDGSVWDRVVECKGKVIEPSKKIFIDGKKILQAREIDDYLRHQPLTGQIKVTE 314
            +  +  +D S    V    G    PS         + QA          P+   I  T+
Sbjct: 207 AQGDYCRFDSS--QSVTTLSGYYDLPSGDENSLADAVGQAG---------PVAVAIDATD 255

Query: 313 ELNAWKGDGIYRGGDTTCYMGSDVGKHAVTIVGFGGEGKDAYFVCQNSYGTGWGYRGYFK 134
           EL  + G   Y   D TC   SD+  H V +VG+G +    Y++ +NS+G+GWG  GY++
Sbjct: 256 ELQFYSGGLFY---DQTCNQ-SDLN-HGVLVVGYGSDNGQDYWILKNSWGSGWGESGYWR 310

Query: 133 VARHLITN 110
             R+   N
Sbjct: 311 QVRNYGNN 318


>gb|EYC39727.1| hypothetical protein Y032_0643g1064 [Ancylostoma ceylanicum]
          Length = 510

 Score = 60.8 bits (146), Expect = 8e-07
 Identities = 40/150 (26%), Positives = 68/150 (45%), Gaps = 11/150 (7%)
 Frame = -2

Query: 541 YATEIGISPVELYP--------WNGKCKFREWDGSVWDRVVECKGKVIEPSKKIFIDGKK 386
           Y  +    P  LYP        + G C  + W+  V     + K  +     KI+ +   
Sbjct: 349 YREKNACKPYPLYPCGHHQNQTFYGPCPEKLWNTPVCRSACQRKYPIPYRKDKIYGNSTY 408

Query: 385 ILQARE---IDDYLRHQPLTGQIKVTEELNAWKGDGIYRGGDTTCYMGSDVGKHAVTIVG 215
           I+   +   + + + H P+    K+ E+ + +KG GIY         G + G HAV ++G
Sbjct: 409 IIPMNQTIIMTEIMTHGPVVATYKIYEDFSYYKG-GIY-----VHTAGEEKGAHAVRVIG 462

Query: 214 FGGEGKDAYFVCQNSYGTGWGYRGYFKVAR 125
           +G E    Y++  NS+ T WG +GYF++ R
Sbjct: 463 WGEEKSIPYWLVANSWNTDWGEKGYFRILR 492


>gb|EYC39726.1| hypothetical protein Y032_0643g1064 [Ancylostoma ceylanicum]
          Length = 521

 Score = 60.8 bits (146), Expect = 8e-07
 Identities = 40/150 (26%), Positives = 68/150 (45%), Gaps = 11/150 (7%)
 Frame = -2

Query: 541 YATEIGISPVELYP--------WNGKCKFREWDGSVWDRVVECKGKVIEPSKKIFIDGKK 386
           Y  +    P  LYP        + G C  + W+  V     + K  +     KI+ +   
Sbjct: 360 YREKNACKPYPLYPCGHHQNQTFYGPCPEKLWNTPVCRSACQRKYPIPYRKDKIYGNSTY 419

Query: 385 ILQARE---IDDYLRHQPLTGQIKVTEELNAWKGDGIYRGGDTTCYMGSDVGKHAVTIVG 215
           I+   +   + + + H P+    K+ E+ + +KG GIY         G + G HAV ++G
Sbjct: 420 IIPMNQTIIMTEIMTHGPVVATYKIYEDFSYYKG-GIY-----VHTAGEEKGAHAVRVIG 473

Query: 214 FGGEGKDAYFVCQNSYGTGWGYRGYFKVAR 125
           +G E    Y++  NS+ T WG +GYF++ R
Sbjct: 474 WGEEKSIPYWLVANSWNTDWGEKGYFRILR 503


>gb|EYC39725.1| hypothetical protein Y032_0643g1064 [Ancylostoma ceylanicum]
          Length = 529

 Score = 60.8 bits (146), Expect = 8e-07
 Identities = 40/150 (26%), Positives = 68/150 (45%), Gaps = 11/150 (7%)
 Frame = -2

Query: 541 YATEIGISPVELYP--------WNGKCKFREWDGSVWDRVVECKGKVIEPSKKIFIDGKK 386
           Y  +    P  LYP        + G C  + W+  V     + K  +     KI+ +   
Sbjct: 368 YREKNACKPYPLYPCGHHQNQTFYGPCPEKLWNTPVCRSACQRKYPIPYRKDKIYGNSTY 427

Query: 385 ILQARE---IDDYLRHQPLTGQIKVTEELNAWKGDGIYRGGDTTCYMGSDVGKHAVTIVG 215
           I+   +   + + + H P+    K+ E+ + +KG GIY         G + G HAV ++G
Sbjct: 428 IIPMNQTIIMTEIMTHGPVVATYKIYEDFSYYKG-GIY-----VHTAGEEKGAHAVRVIG 481

Query: 214 FGGEGKDAYFVCQNSYGTGWGYRGYFKVAR 125
           +G E    Y++  NS+ T WG +GYF++ R
Sbjct: 482 WGEEKSIPYWLVANSWNTDWGEKGYFRILR 511


>gb|ETN70308.1| papain family cysteine protease [Necator americanus]
          Length = 414

 Score = 60.8 bits (146), Expect = 8e-07
 Identities = 34/99 (34%), Positives = 60/99 (60%), Gaps = 3/99 (3%)
 Frame = -2

Query: 412 KKIFIDGKKILQARE--IDDYLR-HQPLTGQIKVTEELNAWKGDGIYRGGDTTCYMGSDV 242
           ++++I   + L + E  + D++  + P+T  + VT+ L +++  GI+      C   S +
Sbjct: 302 ERVYIRSYRTLSSNEDAVADWIAANGPVTFGMNVTKSLYSYRS-GIFSPSKEDCEEHS-L 359

Query: 241 GKHAVTIVGFGGEGKDAYFVCQNSYGTGWGYRGYFKVAR 125
           G HA+T VG+G EG   Y++ +NS+G+ WG  GYFK+AR
Sbjct: 360 GSHALTFVGYGTEGGQPYWLVKNSWGSRWGQNGYFKMAR 398


>gb|ETN61493.1| cathepsin b [Anopheles darlingi]
          Length = 339

 Score = 60.8 bits (146), Expect = 8e-07
 Identities = 32/79 (40%), Positives = 45/79 (56%)
 Frame = -2

Query: 361 DYLRHQPLTGQIKVTEELNAWKGDGIYRGGDTTCYMGSDVGKHAVTIVGFGGEGKDAYFV 182
           + + + P+ G   V E++  +K  G+YR        G  VGKHAV I+G+G EG   Y++
Sbjct: 249 EIMTNGPVEGGFDVYEDVFLYKS-GVYRH-----VYGEHVGKHAVRIIGWGREGGIPYWL 302

Query: 181 CQNSYGTGWGYRGYFKVAR 125
             NSYG  WG  GYFK+ R
Sbjct: 303 ISNSYGEDWGDHGYFKIVR 321


Top