BLASTX nr result
ID: Mentha23_contig00017447
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Mentha23_contig00017447 (989 letters) Database: ./nr 38,876,450 sequences; 13,856,398,315 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_006360817.1| PREDICTED: pro-cathepsin H-like [Solanum tub... 79 3e-12 ref|XP_006347646.1| PREDICTED: uncharacterized protein LOC102578... 77 1e-11 ref|NP_509408.1| Protein R09F10.1 [Caenorhabditis elegans] gi|35... 74 9e-11 ref|XP_002645498.1| Hypothetical protein CBG22748 [Caenorhabditi... 74 9e-11 gb|EGT32164.1| hypothetical protein CAEBREN_11106 [Caenorhabditi... 73 2e-10 ref|XP_003100293.1| hypothetical protein CRE_21852 [Caenorhabdit... 73 2e-10 ref|NP_001163996.1| cathepsin L-like proteinase precursor [Tribo... 69 2e-09 ref|NP_001164314.1| cathepsin L precursor [Tribolium castaneum] ... 68 5e-09 emb|CDJ81168.1| Proteinase inhibitor I29 and Peptidase C1A domai... 65 4e-08 ref|XP_003601499.1| Cathepsin B [Medicago truncatula] gi|3554905... 65 6e-08 ref|XP_006396923.1| hypothetical protein EUTSA_v10028733mg [Eutr... 64 1e-07 gb|AGV15822.1| cysteine protease CP14 [Nicotiana tabacum] 64 1e-07 gb|AGB07568.1| cathepsin b-like cysteine protease 11, partial [A... 62 4e-07 gb|ABC88769.1| putative cathepsin L-like proteinase [Tenebrio mo... 62 5e-07 gb|AAP94048.2| cathepsin-L-like midgut cysteine proteinase [Tene... 62 5e-07 gb|EYC39727.1| hypothetical protein Y032_0643g1064 [Ancylostoma ... 61 8e-07 gb|EYC39726.1| hypothetical protein Y032_0643g1064 [Ancylostoma ... 61 8e-07 gb|EYC39725.1| hypothetical protein Y032_0643g1064 [Ancylostoma ... 61 8e-07 gb|ETN70308.1| papain family cysteine protease [Necator americanus] 61 8e-07 gb|ETN61493.1| cathepsin b [Anopheles darlingi] 61 8e-07 >ref|XP_006360817.1| PREDICTED: pro-cathepsin H-like [Solanum tuberosum] Length = 346 Score = 79.0 bits (193), Expect = 3e-12 Identities = 52/163 (31%), Positives = 78/163 (47%), Gaps = 6/163 (3%) Frame = -2 Query: 568 PYSYRTVFKYATEIGISPVELYPWNGKCKFREWDGSVWDRVVECKGKVIEPSKKIFIDGK 389 P Y F+YA E G+ P + YP+ + G + E K K I+ KK+ G Sbjct: 190 PSHYNNYFQYAIEKGVYPDKPYPYLAE------RGECLELPNEEKTK-IKAYKKVNDLG- 241 Query: 388 KILQAREIDDYLRHQPLTGQIKVTEELNAWKGDGIYRGG------DTTCYMGSDVGKHAV 227 L + I++ ++ QP+ G +K+ + KG IY G G+HAV Sbjct: 242 --LDKKSIEELIQKQPICGSVKLAKNFQKHKGKDIYMGQTKEEIYSEASKNNQSRGRHAV 299 Query: 226 TIVGFGGEGKDAYFVCQNSYGTGWGYRGYFKVARHLITNKYIP 98 I+GFG E Y++ +NS+G WGY GY +V R L+T+ P Sbjct: 300 LIIGFGIENGIEYYLIKNSWGVNWGYLGYARVERRLVTSLSFP 342 >ref|XP_006347646.1| PREDICTED: uncharacterized protein LOC102578529 [Solanum tuberosum] Length = 893 Score = 77.0 bits (188), Expect = 1e-11 Identities = 58/204 (28%), Positives = 96/204 (47%), Gaps = 14/204 (6%) Frame = -2 Query: 667 IPGSPQEVVDYGSLFIRPKDAPASGPYELNERVPYSYRTVFKYATEIGISPVELYPW--- 497 +P S Q+++D ++ + K E P SY +K+A + GI+ YP+ Sbjct: 694 VPLSKQQLID--CMYTKYKKPSYFADLGEKECFPCSYNKAYKFAMDYGITVETKYPFMEE 751 Query: 496 NGKCKFREWDGSVWDRVVECKG--KVIEPSKKIFIDGKKILQAREI-DDYLRHQPLTGQI 326 GKC+ + R+++ G +V E K++ + L +EI + +R QP+T Sbjct: 752 RGKCECQSEM-----RIIKINGFQRVSELIKELEEKAIEKLDEKEIIEKLIRQQPITCAA 806 Query: 325 KVTEELNAWKGDGIYRGGDTTCYM--------GSDVGKHAVTIVGFGGEGKDAYFVCQNS 170 L +G G+Y G G VGKHA+ IVG+G E +++ +NS Sbjct: 807 LHVPSLQLHRGKGVYMGPTENEIAQVRQKETEGQVVGKHAMLIVGYGEEEGVEFYLVKNS 866 Query: 169 YGTGWGYRGYFKVARHLITNKYIP 98 +GT WGY+GY K+ R ++ P Sbjct: 867 WGTEWGYQGYAKIKRSALSKLSYP 890 >ref|NP_509408.1| Protein R09F10.1 [Caenorhabditis elegans] gi|351061560|emb|CCD69414.1| Protein R09F10.1 [Caenorhabditis elegans] Length = 383 Score = 73.9 bits (180), Expect = 9e-11 Identities = 59/222 (26%), Positives = 104/222 (46%), Gaps = 7/222 (3%) Frame = -2 Query: 769 RLMEVRNQEKTNACXXXXXXXXXFGAQVRQNGHAIPGSPQEVVDYGSLFIRPKDAPASGP 590 +L ++NQ + +C + G + S QE+VD ++ SG Sbjct: 179 KLTPIKNQGQCGSCWAFATVASVEAQNAIKKGKLVSLSEQEMVDCDG-----RNNGCSGG 233 Query: 589 YELNERVPYSYRTVFKYATEIGISPVELYPWNG----KCKFREWDGSVWDRVVECKGKVI 422 Y PY+ K+ E G+ + YP++ +C +E D Sbjct: 234 YR-----PYA----MKFVKENGLESEKEYPYSALKHDQCFLKEND--------------- 269 Query: 421 EPSKKIFIDGKKILQAREID--DYLRHQ-PLTGQIKVTEELNAWKGDGIYRGGDTTCYMG 251 ++FID ++L E D +++ + P+T + V + + +++ GI+ C Sbjct: 270 ---TRVFIDDFRMLSNNEEDIANWVGTKGPVTFGMNVVKAMYSYRS-GIFNPSVEDCTEK 325 Query: 250 SDVGKHAVTIVGFGGEGKDAYFVCQNSYGTGWGYRGYFKVAR 125 S +G HA+TI+G+GGEG+ AY++ +NS+GT WG GYF++AR Sbjct: 326 S-MGAHALTIIGYGGEGESAYWIVKNSWGTSWGASGYFRLAR 366 >ref|XP_002645498.1| Hypothetical protein CBG22748 [Caenorhabditis briggsae] gi|187021579|emb|CAP39268.1| Protein CBG22748 [Caenorhabditis briggsae] Length = 379 Score = 73.9 bits (180), Expect = 9e-11 Identities = 60/222 (27%), Positives = 102/222 (45%), Gaps = 7/222 (3%) Frame = -2 Query: 769 RLMEVRNQEKTNACXXXXXXXXXFGAQVRQNGHAIPGSPQEVVDYGSLFIRPKDAPASGP 590 +L ++NQ + +C + G + S QE+VD ++ SG Sbjct: 175 KLTPIKNQGQCGSCWAFATVAAIEAQHAIKKGILVSLSEQEMVDCDG-----RNNGCSGG 229 Query: 589 YELNERVPYSYRTVFKYATEIGISPVELYPWNG----KCKFREWDGSVWDRVVECKGKVI 422 Y PY+ R + E G+ + YP++ +C + D Sbjct: 230 YR-----PYAMR----FVKENGLETEKSYPYSALKHDQCMLHQND--------------- 265 Query: 421 EPSKKIFIDGKKILQARE--IDDYLRHQ-PLTGQIKVTEELNAWKGDGIYRGGDTTCYMG 251 K++ID ++L E I D++ + P+T + V + + +++ GI+ C Sbjct: 266 ---TKVYIDDYRMLSTSEENIADWVGTKGPVTFGMNVVKAMYSYRS-GIFNPSAEDCAEK 321 Query: 250 SDVGKHAVTIVGFGGEGKDAYFVCQNSYGTGWGYRGYFKVAR 125 S +G HA+TIVG+GGEG AY++ +NS+GT WG GYF++AR Sbjct: 322 S-MGAHALTIVGYGGEGTSAYWIVKNSWGTSWGSDGYFRLAR 362 >gb|EGT32164.1| hypothetical protein CAEBREN_11106 [Caenorhabditis brenneri] Length = 389 Score = 73.2 bits (178), Expect = 2e-10 Identities = 58/222 (26%), Positives = 103/222 (46%), Gaps = 7/222 (3%) Frame = -2 Query: 769 RLMEVRNQEKTNACXXXXXXXXXFGAQVRQNGHAIPGSPQEVVDYGSLFIRPKDAPASGP 590 +L ++NQ + +C + G + S QE+VD ++ SG Sbjct: 185 KLTPIKNQGQCGSCWAFATVAAVEAQHAIKKGQLVSLSEQEMVDCDG-----RNNGCSGG 239 Query: 589 YELNERVPYSYRTVFKYATEIGISPVELYPWNG----KCKFREWDGSVWDRVVECKGKVI 422 Y PY+ R + E G+ + YP++ +C ++ D Sbjct: 240 YR-----PYAMR----FVKENGLESEKEYPYSALKHDQCFLKQND--------------- 275 Query: 421 EPSKKIFIDGKKILQAREID--DYLRHQ-PLTGQIKVTEELNAWKGDGIYRGGDTTCYMG 251 ++FID ++L E D +++ + P+T + V + + +++ GI+ C Sbjct: 276 ---TRVFIDDFRMLSTNEEDIANWVGTKGPVTFGMNVVKAMYSYRS-GIFNPSSEDCAEK 331 Query: 250 SDVGKHAVTIVGFGGEGKDAYFVCQNSYGTGWGYRGYFKVAR 125 S +G HA+TIVG+GGEG A+++ +NS+GT WG GYF++AR Sbjct: 332 S-MGAHALTIVGYGGEGSSAFWIVKNSWGTSWGSSGYFRLAR 372 >ref|XP_003100293.1| hypothetical protein CRE_21852 [Caenorhabditis remanei] gi|308265817|gb|EFP09770.1| hypothetical protein CRE_21852 [Caenorhabditis remanei] Length = 391 Score = 73.2 bits (178), Expect = 2e-10 Identities = 59/222 (26%), Positives = 104/222 (46%), Gaps = 7/222 (3%) Frame = -2 Query: 769 RLMEVRNQEKTNACXXXXXXXXXFGAQVRQNGHAIPGSPQEVVDYGSLFIRPKDAPASGP 590 +L ++NQ + +C + + S QE+VD K+ SG Sbjct: 187 KLTPIKNQGQCGSCWAFATVAAVEAQHAIRKNQLVSLSEQEMVDCDD-----KNNGCSGG 241 Query: 589 YELNERVPYSYRTVFKYATEIGISPVELYPWNG----KCKFREWDGSVWDRVVECKGKVI 422 Y PY+ R + E G+ + YP++ +C ++ D Sbjct: 242 YR-----PYAMR----FVKENGLESEKEYPYSALKHDQCMLKQND--------------- 277 Query: 421 EPSKKIFIDGKKILQARE--IDDYLRHQ-PLTGQIKVTEELNAWKGDGIYRGGDTTCYMG 251 ++FID ++L E I +++ + P+T + VT+ + +++ GI+ C Sbjct: 278 ---TRVFIDDFRMLSQNEEEIANWVGTKGPVTFGMSVTKAMYSYRS-GIFNPSADDCAEK 333 Query: 250 SDVGKHAVTIVGFGGEGKDAYFVCQNSYGTGWGYRGYFKVAR 125 S +G HA+TIVG+GGEG+ A+++ +NS+GT WG GYF++AR Sbjct: 334 S-MGSHALTIVGYGGEGEAAFWIVKNSWGTSWGASGYFRLAR 374 >ref|NP_001163996.1| cathepsin L-like proteinase precursor [Tribolium castaneum] gi|281427798|ref|NP_001164001.1| cathepsin L-like proteinase precursor [Tribolium castaneum] gi|270001241|gb|EEZ97688.1| cathepsin L precursor [Tribolium castaneum] gi|270016928|gb|EFA13374.1| hypothetical protein TcasGA2_TC001950 [Tribolium castaneum] Length = 328 Score = 69.3 bits (168), Expect = 2e-09 Identities = 70/252 (27%), Positives = 111/252 (44%), Gaps = 6/252 (2%) Frame = -2 Query: 847 KLKDSERNTLPIFASAK--ACLVTDHPERLMEVRNQEKTNACXXXXXXXXXFGAQVRQNG 674 K K +E+ +P S K A V + + EV++Q + +C G Q+ +G Sbjct: 97 KPKMNEKLRIPFVKSGKPAAAEVDWRSKAVTEVKDQGQCGSCWSFSTTGAVEG-QLAISG 155 Query: 673 HAIPG-SPQEVVDYGSLFIRPKDAPASGPYELNERVPYSYRTVFKYATEIGISPVELYPW 497 + S Q +VD S + +A +G + + F Y + GI YP+ Sbjct: 156 KGLTSLSEQNLVDCSSQY---GNAGCNGGW---------MDSAFDYIHDNGIMSESAYPY 203 Query: 496 ---NGKCKFREWDGSVWDRVVECKGKVIEPSKKIFIDGKKILQAREIDDYLRHQPLTGQI 326 +G C+F D S V +G PS + LQ D + P+ + Sbjct: 204 TAMDGNCRF---DAS--QSVTSLQGYYDIPS-----GDESALQ----DAVANNGPVAVAL 249 Query: 325 KVTEELNAWKGDGIYRGGDTTCYMGSDVGKHAVTIVGFGGEGKDAYFVCQNSYGTGWGYR 146 TEEL + G +Y DTTC + H V +VG+G EG Y++ +NS+G+GWG + Sbjct: 250 DATEELQLYSGGVLY---DTTC--SAQALNHGVLVVGYGSEGGQDYWIVKNSWGSGWGEQ 304 Query: 145 GYFKVARHLITN 110 GY++ AR+ N Sbjct: 305 GYWRQARNRNNN 316 >ref|NP_001164314.1| cathepsin L precursor [Tribolium castaneum] gi|270001247|gb|EEZ97694.1| cathepsin L precursor [Tribolium castaneum] Length = 328 Score = 68.2 bits (165), Expect = 5e-09 Identities = 68/252 (26%), Positives = 107/252 (42%), Gaps = 6/252 (2%) Frame = -2 Query: 847 KLKDSERNTLPIFASAK--ACLVTDHPERLMEVRNQEKTNACXXXXXXXXXFGAQVRQNG 674 K K +E+ LP S K A V + EV+NQ + +C G Q+ +G Sbjct: 97 KPKKNEKLRLPFVQSDKPAAAEVDWRNSAVSEVKNQGQCGSCWSFSTTGAVEG-QLAISG 155 Query: 673 HAIPG-SPQEVVDYGSLFIRPKDAPASGPYELNERVPYSYRTVFKYATEIGISPVELYPW 497 + S Q +VD S + +A +G + + F Y + GI YP+ Sbjct: 156 RGLTSLSEQNLVDCSSAY---GNAGCNGGW---------MDSAFDYIHDNGIMSESAYPY 203 Query: 496 N---GKCKFREWDGSVWDRVVECKGKVIEPSKKIFIDGKKILQAREIDDYLRHQPLTGQI 326 G C+F + V +G PS D + A + P+ + Sbjct: 204 TASEGSCRFNPSES-----VTSLQGYYDLPSG----DENALKSA-----VANNGPIAVAL 249 Query: 325 KVTEELNAWKGDGIYRGGDTTCYMGSDVGKHAVTIVGFGGEGKDAYFVCQNSYGTGWGYR 146 T+EL + G +Y DTTC + H V +VG+G EG Y++ +NS+G+GWG + Sbjct: 250 DATDELQFYSGGVLY---DTTC--SAQALNHGVLVVGYGSEGGQDYWIVKNSWGSGWGEQ 304 Query: 145 GYFKVARHLITN 110 GY++ AR+ N Sbjct: 305 GYWRQARNRNNN 316 >emb|CDJ81168.1| Proteinase inhibitor I29 and Peptidase C1A domain containing protein [Haemonchus contortus] Length = 390 Score = 65.1 bits (157), Expect = 4e-08 Identities = 57/218 (26%), Positives = 100/218 (45%), Gaps = 3/218 (1%) Frame = -2 Query: 769 RLMEVRNQEKTNACXXXXXXXXXFGAQVRQNGHAIPGSPQEVVDYGSLFIRPKDAPASGP 590 +L V++Q + +C A + G S QE+VD + ++ G Sbjct: 185 KLTPVKDQGQCGSCWAFATVASIEAANAIKTGQLTRLSEQEMVDCDT-----QNNGCQGG 239 Query: 589 YELNERVPYSYRTVFKYATEIGISPVELYPWNGKCKFREWDGSVWDRVVECKGKVIEPSK 410 Y PY+ + + G+ E YP++G + C K S+ Sbjct: 240 YR-----PYA----MSFVQQNGLMKEEKYPYSGTDQNT------------CLLK--RDSE 276 Query: 409 KIFIDGKKILQARE--IDDYLR-HQPLTGQIKVTEELNAWKGDGIYRGGDTTCYMGSDVG 239 ++FI ++L + E I D++ + P+T + VT+ + +++ GI+ C S +G Sbjct: 277 RVFIQSYRMLSSNEEVIADWIAANGPVTFGMNVTKSMYSYRS-GIFAPTQEDCEQHS-LG 334 Query: 238 KHAVTIVGFGGEGKDAYFVCQNSYGTGWGYRGYFKVAR 125 HA+T VG+G E Y++ +NS+G+ WG GYFK+AR Sbjct: 335 SHALTFVGYGTENGQPYWLVKNSWGSRWGQDGYFKLAR 372 >ref|XP_003601499.1| Cathepsin B [Medicago truncatula] gi|355490547|gb|AES71750.1| Cathepsin B [Medicago truncatula] Length = 232 Score = 64.7 bits (156), Expect = 6e-08 Identities = 35/94 (37%), Positives = 56/94 (59%), Gaps = 9/94 (9%) Frame = -2 Query: 388 KILQAREIDDYLRHQ-PLTGQIKVTEELNAWKGDGIYRG-GDTTCYM---GSDVGKHAVT 224 K L R++ ++LR + P+ ++K +E+ +KGDGIY G D ++ + VG HA+ Sbjct: 112 KWLPFRKMKEHLRDEGPIAVEVKWIKEMGDYKGDGIYNGPADANAFVKTVNNHVGDHALL 171 Query: 223 IVGFGGEGKDA----YFVCQNSYGTGWGYRGYFK 134 ++GFG E + Y++ QNS+G GWG GY K Sbjct: 172 VIGFGSERIEGELVHYWIVQNSHGEGWGKEGYAK 205 >ref|XP_006396923.1| hypothetical protein EUTSA_v10028733mg [Eutrema salsugineum] gi|557097940|gb|ESQ38376.1| hypothetical protein EUTSA_v10028733mg [Eutrema salsugineum] Length = 379 Score = 63.5 bits (153), Expect = 1e-07 Identities = 37/111 (33%), Positives = 62/111 (55%), Gaps = 3/111 (2%) Frame = -2 Query: 439 CKGKVIEPSKKIFIDGKKILQARE---IDDYLRHQPLTGQIKVTEELNAWKGDGIYRGGD 269 C G++ E +KK+ IDG + L A + + + HQP+T I + G++ G Sbjct: 247 CDGRLKENNKKVMIDGYENLPANDEFALMKAVAHQPVTAVIDSSSRDFQLYESGVFDG-- 304 Query: 268 TTCYMGSDVGKHAVTIVGFGGEGKDAYFVCQNSYGTGWGYRGYFKVARHLI 116 TC G+++ H V +VG+G E Y++ +NS+G WG GY K+AR+++ Sbjct: 305 -TC--GTNLN-HGVVVVGYGTENGHDYWIVRNSWGNTWGEAGYMKMARNIV 351 >gb|AGV15822.1| cysteine protease CP14 [Nicotiana tabacum] Length = 505 Score = 63.5 bits (153), Expect = 1e-07 Identities = 61/228 (26%), Positives = 92/228 (40%), Gaps = 7/228 (3%) Frame = -2 Query: 760 EVRNQEKTNACXXXXXXXXXFGAQVRQNGHAIPGSPQEVVDYGSLFIRPKDAPASGPYEL 581 EV+NQ++ AC G G I S QE+++ + + D P Sbjct: 160 EVKNQDQCGACWAFSACGAMEGINAIATGELISLSEQELINCDNSYNTGCDGGLMDP--- 216 Query: 580 NERVPYSYRTVFKYA-TEIGISPVELYPW---NGKCKFREWDGSVWDRVVECKGKVIEPS 413 F++ GI+ YP+ G+C + + V K +I+ Sbjct: 217 ----------AFEWVMNNSGINSEADYPYTASQGRCNYDK---------VNHKVVIIDGY 257 Query: 412 KKIFIDGKKILQAREIDDYLRHQPLTGQIKVTEELNAWKGD-GIYRGG--DTTCYMGSDV 242 + + D +L A GQ V+ ++ D +YRGG D C D Sbjct: 258 QDVPEDENALLCA------------VGQQPVSVGIDGSSLDFQLYRGGIYDGECSSNPDD 305 Query: 241 GKHAVTIVGFGGEGKDAYFVCQNSYGTGWGYRGYFKVARHLITNKYIP 98 HAV IVG+G EG D Y++ +NS+GT WG GY + R N Y+P Sbjct: 306 LSHAVVIVGYGSEGDDDYWIIKNSWGTSWGMEGYAYIRR----NTYLP 349 >gb|AGB07568.1| cathepsin b-like cysteine protease 11, partial [Ancylostoma duodenale] Length = 250 Score = 62.0 bits (149), Expect = 4e-07 Identities = 39/142 (27%), Positives = 62/142 (43%), Gaps = 11/142 (7%) Frame = -2 Query: 517 PVELYP--------WNGKCKFREWDGSVWDRVVECKGKVIEPSKKIFIDGKKILQARE-- 368 P LYP + G C + WD V + K + KI+ + I+ + Sbjct: 101 PYPLYPCGRHQNQTYYGPCSEKLWDTPVCRSACQFKYPIPYRQDKIYGNSTYIIPKNQTI 160 Query: 367 -IDDYLRHQPLTGQIKVTEELNAWKGDGIYRGGDTTCYMGSDVGKHAVTIVGFGGEGKDA 191 + + + H P+ KV E+ Y+GG G G HAV ++G+G E Sbjct: 161 IMTEIMTHGPVVATYKVYEDF------AYYKGGVYVHTAGEQKGAHAVRVIGWGEENSLP 214 Query: 190 YFVCQNSYGTGWGYRGYFKVAR 125 Y++ NS+ T WG +GYF++ R Sbjct: 215 YWLVANSWNTDWGEKGYFRILR 236 >gb|ABC88769.1| putative cathepsin L-like proteinase [Tenebrio molitor] Length = 328 Score = 61.6 bits (148), Expect = 5e-07 Identities = 63/248 (25%), Positives = 97/248 (39%), Gaps = 2/248 (0%) Frame = -2 Query: 847 KLKDSERNTLPIFASAK--ACLVTDHPERLMEVRNQEKTNACXXXXXXXXXFGAQVRQNG 674 K K E +P +S K A V + EV++Q + +C G Q G Sbjct: 97 KPKHPENLRMPYVSSKKPLAASVDWRSNAVSEVKDQGQCGSCWSFSTTGAVEGQLALQRG 156 Query: 673 HAIPGSPQEVVDYGSLFIRPKDAPASGPYELNERVPYSYRTVFKYATEIGISPVELYPWN 494 S Q ++D S + +A G + + F Y + GI YP+ Sbjct: 157 RLTSLSEQNLIDCSSSY---GNAGCDGGW---------MDSAFSYIHDYGIMSESAYPYE 204 Query: 493 GKCKFREWDGSVWDRVVECKGKVIEPSKKIFIDGKKILQAREIDDYLRHQPLTGQIKVTE 314 + + +D S V G PS + QA P+ I T+ Sbjct: 205 AQGDYCRFDSS--QSVTTLSGYYDLPSGDENSLADAVGQAG---------PVAVAIDATD 253 Query: 313 ELNAWKGDGIYRGGDTTCYMGSDVGKHAVTIVGFGGEGKDAYFVCQNSYGTGWGYRGYFK 134 EL + G Y D TC SD+ H V +VG+G + Y++ +NS+G+GWG GY++ Sbjct: 254 ELQFYSGGLFY---DQTCNQ-SDLN-HGVLVVGYGSDNGQDYWILKNSWGSGWGESGYWR 308 Query: 133 VARHLITN 110 R+ N Sbjct: 309 QVRNYGNN 316 >gb|AAP94048.2| cathepsin-L-like midgut cysteine proteinase [Tenebrio molitor] Length = 330 Score = 61.6 bits (148), Expect = 5e-07 Identities = 63/248 (25%), Positives = 97/248 (39%), Gaps = 2/248 (0%) Frame = -2 Query: 847 KLKDSERNTLPIFASAK--ACLVTDHPERLMEVRNQEKTNACXXXXXXXXXFGAQVRQNG 674 K K E +P +S K A V + EV++Q + +C G Q G Sbjct: 99 KPKHPENLRMPYVSSKKPLAASVDWRSNAVSEVKDQGQCGSCWSFSTTGAVEGQLALQRG 158 Query: 673 HAIPGSPQEVVDYGSLFIRPKDAPASGPYELNERVPYSYRTVFKYATEIGISPVELYPWN 494 S Q ++D S + +A G + + F Y + GI YP+ Sbjct: 159 RLTSLSEQNLIDCSSSY---GNAGCDGGW---------MDSAFSYIHDYGIMSESAYPYE 206 Query: 493 GKCKFREWDGSVWDRVVECKGKVIEPSKKIFIDGKKILQAREIDDYLRHQPLTGQIKVTE 314 + + +D S V G PS + QA P+ I T+ Sbjct: 207 AQGDYCRFDSS--QSVTTLSGYYDLPSGDENSLADAVGQAG---------PVAVAIDATD 255 Query: 313 ELNAWKGDGIYRGGDTTCYMGSDVGKHAVTIVGFGGEGKDAYFVCQNSYGTGWGYRGYFK 134 EL + G Y D TC SD+ H V +VG+G + Y++ +NS+G+GWG GY++ Sbjct: 256 ELQFYSGGLFY---DQTCNQ-SDLN-HGVLVVGYGSDNGQDYWILKNSWGSGWGESGYWR 310 Query: 133 VARHLITN 110 R+ N Sbjct: 311 QVRNYGNN 318 >gb|EYC39727.1| hypothetical protein Y032_0643g1064 [Ancylostoma ceylanicum] Length = 510 Score = 60.8 bits (146), Expect = 8e-07 Identities = 40/150 (26%), Positives = 68/150 (45%), Gaps = 11/150 (7%) Frame = -2 Query: 541 YATEIGISPVELYP--------WNGKCKFREWDGSVWDRVVECKGKVIEPSKKIFIDGKK 386 Y + P LYP + G C + W+ V + K + KI+ + Sbjct: 349 YREKNACKPYPLYPCGHHQNQTFYGPCPEKLWNTPVCRSACQRKYPIPYRKDKIYGNSTY 408 Query: 385 ILQARE---IDDYLRHQPLTGQIKVTEELNAWKGDGIYRGGDTTCYMGSDVGKHAVTIVG 215 I+ + + + + H P+ K+ E+ + +KG GIY G + G HAV ++G Sbjct: 409 IIPMNQTIIMTEIMTHGPVVATYKIYEDFSYYKG-GIY-----VHTAGEEKGAHAVRVIG 462 Query: 214 FGGEGKDAYFVCQNSYGTGWGYRGYFKVAR 125 +G E Y++ NS+ T WG +GYF++ R Sbjct: 463 WGEEKSIPYWLVANSWNTDWGEKGYFRILR 492 >gb|EYC39726.1| hypothetical protein Y032_0643g1064 [Ancylostoma ceylanicum] Length = 521 Score = 60.8 bits (146), Expect = 8e-07 Identities = 40/150 (26%), Positives = 68/150 (45%), Gaps = 11/150 (7%) Frame = -2 Query: 541 YATEIGISPVELYP--------WNGKCKFREWDGSVWDRVVECKGKVIEPSKKIFIDGKK 386 Y + P LYP + G C + W+ V + K + KI+ + Sbjct: 360 YREKNACKPYPLYPCGHHQNQTFYGPCPEKLWNTPVCRSACQRKYPIPYRKDKIYGNSTY 419 Query: 385 ILQARE---IDDYLRHQPLTGQIKVTEELNAWKGDGIYRGGDTTCYMGSDVGKHAVTIVG 215 I+ + + + + H P+ K+ E+ + +KG GIY G + G HAV ++G Sbjct: 420 IIPMNQTIIMTEIMTHGPVVATYKIYEDFSYYKG-GIY-----VHTAGEEKGAHAVRVIG 473 Query: 214 FGGEGKDAYFVCQNSYGTGWGYRGYFKVAR 125 +G E Y++ NS+ T WG +GYF++ R Sbjct: 474 WGEEKSIPYWLVANSWNTDWGEKGYFRILR 503 >gb|EYC39725.1| hypothetical protein Y032_0643g1064 [Ancylostoma ceylanicum] Length = 529 Score = 60.8 bits (146), Expect = 8e-07 Identities = 40/150 (26%), Positives = 68/150 (45%), Gaps = 11/150 (7%) Frame = -2 Query: 541 YATEIGISPVELYP--------WNGKCKFREWDGSVWDRVVECKGKVIEPSKKIFIDGKK 386 Y + P LYP + G C + W+ V + K + KI+ + Sbjct: 368 YREKNACKPYPLYPCGHHQNQTFYGPCPEKLWNTPVCRSACQRKYPIPYRKDKIYGNSTY 427 Query: 385 ILQARE---IDDYLRHQPLTGQIKVTEELNAWKGDGIYRGGDTTCYMGSDVGKHAVTIVG 215 I+ + + + + H P+ K+ E+ + +KG GIY G + G HAV ++G Sbjct: 428 IIPMNQTIIMTEIMTHGPVVATYKIYEDFSYYKG-GIY-----VHTAGEEKGAHAVRVIG 481 Query: 214 FGGEGKDAYFVCQNSYGTGWGYRGYFKVAR 125 +G E Y++ NS+ T WG +GYF++ R Sbjct: 482 WGEEKSIPYWLVANSWNTDWGEKGYFRILR 511 >gb|ETN70308.1| papain family cysteine protease [Necator americanus] Length = 414 Score = 60.8 bits (146), Expect = 8e-07 Identities = 34/99 (34%), Positives = 60/99 (60%), Gaps = 3/99 (3%) Frame = -2 Query: 412 KKIFIDGKKILQARE--IDDYLR-HQPLTGQIKVTEELNAWKGDGIYRGGDTTCYMGSDV 242 ++++I + L + E + D++ + P+T + VT+ L +++ GI+ C S + Sbjct: 302 ERVYIRSYRTLSSNEDAVADWIAANGPVTFGMNVTKSLYSYRS-GIFSPSKEDCEEHS-L 359 Query: 241 GKHAVTIVGFGGEGKDAYFVCQNSYGTGWGYRGYFKVAR 125 G HA+T VG+G EG Y++ +NS+G+ WG GYFK+AR Sbjct: 360 GSHALTFVGYGTEGGQPYWLVKNSWGSRWGQNGYFKMAR 398 >gb|ETN61493.1| cathepsin b [Anopheles darlingi] Length = 339 Score = 60.8 bits (146), Expect = 8e-07 Identities = 32/79 (40%), Positives = 45/79 (56%) Frame = -2 Query: 361 DYLRHQPLTGQIKVTEELNAWKGDGIYRGGDTTCYMGSDVGKHAVTIVGFGGEGKDAYFV 182 + + + P+ G V E++ +K G+YR G VGKHAV I+G+G EG Y++ Sbjct: 249 EIMTNGPVEGGFDVYEDVFLYKS-GVYRH-----VYGEHVGKHAVRIIGWGREGGIPYWL 302 Query: 181 CQNSYGTGWGYRGYFKVAR 125 NSYG WG GYFK+ R Sbjct: 303 ISNSYGEDWGDHGYFKIVR 321