BLASTX nr result
ID: Mentha25_contig00007772
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Mentha25_contig00007772 (1274 letters) Database: ./nr 38,876,450 sequences; 13,856,398,315 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_006347646.1| PREDICTED: uncharacterized protein LOC102578... 78 9e-12 ref|XP_006360817.1| PREDICTED: pro-cathepsin H-like [Solanum tub... 76 3e-11 ref|NP_509408.1| Protein R09F10.1 [Caenorhabditis elegans] gi|35... 74 1e-10 ref|XP_002645498.1| Hypothetical protein CBG22748 [Caenorhabditi... 74 1e-10 gb|EGT32164.1| hypothetical protein CAEBREN_11106 [Caenorhabditi... 73 2e-10 ref|XP_003100293.1| hypothetical protein CRE_21852 [Caenorhabdit... 73 2e-10 ref|NP_001163996.1| cathepsin L-like proteinase precursor [Tribo... 66 3e-08 ref|NP_001164314.1| cathepsin L precursor [Tribolium castaneum] ... 65 5e-08 emb|CDJ81168.1| Proteinase inhibitor I29 and Peptidase C1A domai... 65 6e-08 ref|XP_003601499.1| Cathepsin B [Medicago truncatula] gi|3554905... 65 8e-08 ref|XP_006396923.1| hypothetical protein EUTSA_v10028733mg [Eutr... 64 2e-07 gb|ABC88769.1| putative cathepsin L-like proteinase [Tenebrio mo... 63 3e-07 gb|AAP94048.2| cathepsin-L-like midgut cysteine proteinase [Tene... 63 3e-07 gb|AGB07568.1| cathepsin b-like cysteine protease 11, partial [A... 62 7e-07 gb|AGV15822.1| cysteine protease CP14 [Nicotiana tabacum] 62 7e-07 ref|XP_004307286.1| PREDICTED: oryzain beta chain-like [Fragaria... 61 9e-07 gb|ETN70308.1| papain family cysteine protease [Necator americanus] 61 1e-06 gb|ETN61493.1| cathepsin b [Anopheles darlingi] 61 1e-06 ref|XP_005366697.1| PREDICTED: pro-cathepsin H [Microtus ochroga... 61 1e-06 dbj|BAN20308.1| cathepsin L [Riptortus pedestris] 61 1e-06 >ref|XP_006347646.1| PREDICTED: uncharacterized protein LOC102578529 [Solanum tuberosum] Length = 893 Score = 77.8 bits (190), Expect = 9e-12 Identities = 58/199 (29%), Positives = 95/199 (47%), Gaps = 14/199 (7%) Frame = +1 Query: 343 IPGSPQEVVDYGSLFIRPKDAPASGPYELNERVPYSYRTVFKYATEIGISPVELYPW--- 513 +P S Q+++D ++ + K E P SY +K+A + GI+ YP+ Sbjct: 694 VPLSKQQLID--CMYTKYKKPSYFADLGEKECFPCSYNKAYKFAMDYGITVETKYPFMEE 751 Query: 514 NGKCKFREWDGSVWDRVVKCKG--KVIEPSKKIFIDGKKILQAREI-DDYLRHQPLTGQI 684 GKC+ + R++K G +V E K++ + L +EI + +R QP+T Sbjct: 752 RGKCECQSEM-----RIIKINGFQRVSELIKELEEKAIEKLDEKEIIEKLIRQQPITCAA 806 Query: 685 KVTEELNAWKGDGIYRGGDTTCYM--------GSDVGKHAVTIVGFGGEGKDAYFVCQNS 840 L +G G+Y G G VGKHA+ IVG+G E +++ +NS Sbjct: 807 LHVPSLQLHRGKGVYMGPTENEIAQVRQKETEGQVVGKHAMLIVGYGEEEGVEFYLVKNS 866 Query: 841 YGTGWGYRGYFKVARHLIT 897 +GT WGY+GY K+ R ++ Sbjct: 867 WGTEWGYQGYAKIKRSALS 885 >ref|XP_006360817.1| PREDICTED: pro-cathepsin H-like [Solanum tuberosum] Length = 346 Score = 76.3 bits (186), Expect = 3e-11 Identities = 50/159 (31%), Positives = 77/159 (48%), Gaps = 6/159 (3%) Frame = +1 Query: 442 PYSYRTVFKYATEIGISPVELYPWNGKCKFREWDGSVWDRVVKCKGKVIEPSKKIFIDGK 621 P Y F+YA E G+ P + YP+ + G + + K K I+ KK+ G Sbjct: 190 PSHYNNYFQYAIEKGVYPDKPYPYLAE------RGECLELPNEEKTK-IKAYKKVNDLG- 241 Query: 622 KILQAREIDDYLRHQPLTGQIKVTEELNAWKGDGIYRGG------DTTCYMGSDVGKHAV 783 L + I++ ++ QP+ G +K+ + KG IY G G+HAV Sbjct: 242 --LDKKSIEELIQKQPICGSVKLAKNFQKHKGKDIYMGQTKEEIYSEASKNNQSRGRHAV 299 Query: 784 TIVGFGGEGKDAYFVCQNSYGTGWGYRGYFKVARHLITN 900 I+GFG E Y++ +NS+G WGY GY +V R L+T+ Sbjct: 300 LIIGFGIENGIEYYLIKNSWGVNWGYLGYARVERRLVTS 338 >ref|NP_509408.1| Protein R09F10.1 [Caenorhabditis elegans] gi|351061560|emb|CCD69414.1| Protein R09F10.1 [Caenorhabditis elegans] Length = 383 Score = 73.9 bits (180), Expect = 1e-10 Identities = 59/222 (26%), Positives = 104/222 (46%), Gaps = 7/222 (3%) Frame = +1 Query: 241 RLMEVRNQEKTNACXXXXXXXXXXGAQVRQNGHAIPGSPQEVVDYGSLFIRPKDAPASGP 420 +L ++NQ + +C + G + S QE+VD ++ SG Sbjct: 179 KLTPIKNQGQCGSCWAFATVASVEAQNAIKKGKLVSLSEQEMVDCDG-----RNNGCSGG 233 Query: 421 YELNERVPYSYRTVFKYATEIGISPVELYPWNG----KCKFREWDGSVWDRVVKCKGKVI 588 Y PY+ K+ E G+ + YP++ +C +E D Sbjct: 234 YR-----PYA----MKFVKENGLESEKEYPYSALKHDQCFLKEND--------------- 269 Query: 589 EPSKKIFIDGKKILQAREID--DYLRHQ-PLTGQIKVTEELNAWKGDGIYRGGDTTCYMG 759 ++FID ++L E D +++ + P+T + V + + +++ GI+ C Sbjct: 270 ---TRVFIDDFRMLSNNEEDIANWVGTKGPVTFGMNVVKAMYSYRS-GIFNPSVEDCTEK 325 Query: 760 SDVGKHAVTIVGFGGEGKDAYFVCQNSYGTGWGYRGYFKVAR 885 S +G HA+TI+G+GGEG+ AY++ +NS+GT WG GYF++AR Sbjct: 326 S-MGAHALTIIGYGGEGESAYWIVKNSWGTSWGASGYFRLAR 366 >ref|XP_002645498.1| Hypothetical protein CBG22748 [Caenorhabditis briggsae] gi|187021579|emb|CAP39268.1| Protein CBG22748 [Caenorhabditis briggsae] Length = 379 Score = 73.9 bits (180), Expect = 1e-10 Identities = 60/222 (27%), Positives = 102/222 (45%), Gaps = 7/222 (3%) Frame = +1 Query: 241 RLMEVRNQEKTNACXXXXXXXXXXGAQVRQNGHAIPGSPQEVVDYGSLFIRPKDAPASGP 420 +L ++NQ + +C + G + S QE+VD ++ SG Sbjct: 175 KLTPIKNQGQCGSCWAFATVAAIEAQHAIKKGILVSLSEQEMVDCDG-----RNNGCSGG 229 Query: 421 YELNERVPYSYRTVFKYATEIGISPVELYPWNG----KCKFREWDGSVWDRVVKCKGKVI 588 Y PY+ R + E G+ + YP++ +C + D Sbjct: 230 YR-----PYAMR----FVKENGLETEKSYPYSALKHDQCMLHQND--------------- 265 Query: 589 EPSKKIFIDGKKILQARE--IDDYLRHQ-PLTGQIKVTEELNAWKGDGIYRGGDTTCYMG 759 K++ID ++L E I D++ + P+T + V + + +++ GI+ C Sbjct: 266 ---TKVYIDDYRMLSTSEENIADWVGTKGPVTFGMNVVKAMYSYRS-GIFNPSAEDCAEK 321 Query: 760 SDVGKHAVTIVGFGGEGKDAYFVCQNSYGTGWGYRGYFKVAR 885 S +G HA+TIVG+GGEG AY++ +NS+GT WG GYF++AR Sbjct: 322 S-MGAHALTIVGYGGEGTSAYWIVKNSWGTSWGSDGYFRLAR 362 >gb|EGT32164.1| hypothetical protein CAEBREN_11106 [Caenorhabditis brenneri] Length = 389 Score = 73.2 bits (178), Expect = 2e-10 Identities = 58/222 (26%), Positives = 103/222 (46%), Gaps = 7/222 (3%) Frame = +1 Query: 241 RLMEVRNQEKTNACXXXXXXXXXXGAQVRQNGHAIPGSPQEVVDYGSLFIRPKDAPASGP 420 +L ++NQ + +C + G + S QE+VD ++ SG Sbjct: 185 KLTPIKNQGQCGSCWAFATVAAVEAQHAIKKGQLVSLSEQEMVDCDG-----RNNGCSGG 239 Query: 421 YELNERVPYSYRTVFKYATEIGISPVELYPWNG----KCKFREWDGSVWDRVVKCKGKVI 588 Y PY+ R + E G+ + YP++ +C ++ D Sbjct: 240 YR-----PYAMR----FVKENGLESEKEYPYSALKHDQCFLKQND--------------- 275 Query: 589 EPSKKIFIDGKKILQAREID--DYLRHQ-PLTGQIKVTEELNAWKGDGIYRGGDTTCYMG 759 ++FID ++L E D +++ + P+T + V + + +++ GI+ C Sbjct: 276 ---TRVFIDDFRMLSTNEEDIANWVGTKGPVTFGMNVVKAMYSYRS-GIFNPSSEDCAEK 331 Query: 760 SDVGKHAVTIVGFGGEGKDAYFVCQNSYGTGWGYRGYFKVAR 885 S +G HA+TIVG+GGEG A+++ +NS+GT WG GYF++AR Sbjct: 332 S-MGAHALTIVGYGGEGSSAFWIVKNSWGTSWGSSGYFRLAR 372 >ref|XP_003100293.1| hypothetical protein CRE_21852 [Caenorhabditis remanei] gi|308265817|gb|EFP09770.1| hypothetical protein CRE_21852 [Caenorhabditis remanei] Length = 391 Score = 73.2 bits (178), Expect = 2e-10 Identities = 59/222 (26%), Positives = 104/222 (46%), Gaps = 7/222 (3%) Frame = +1 Query: 241 RLMEVRNQEKTNACXXXXXXXXXXGAQVRQNGHAIPGSPQEVVDYGSLFIRPKDAPASGP 420 +L ++NQ + +C + + S QE+VD K+ SG Sbjct: 187 KLTPIKNQGQCGSCWAFATVAAVEAQHAIRKNQLVSLSEQEMVDCDD-----KNNGCSGG 241 Query: 421 YELNERVPYSYRTVFKYATEIGISPVELYPWNG----KCKFREWDGSVWDRVVKCKGKVI 588 Y PY+ R + E G+ + YP++ +C ++ D Sbjct: 242 YR-----PYAMR----FVKENGLESEKEYPYSALKHDQCMLKQND--------------- 277 Query: 589 EPSKKIFIDGKKILQARE--IDDYLRHQ-PLTGQIKVTEELNAWKGDGIYRGGDTTCYMG 759 ++FID ++L E I +++ + P+T + VT+ + +++ GI+ C Sbjct: 278 ---TRVFIDDFRMLSQNEEEIANWVGTKGPVTFGMSVTKAMYSYRS-GIFNPSADDCAEK 333 Query: 760 SDVGKHAVTIVGFGGEGKDAYFVCQNSYGTGWGYRGYFKVAR 885 S +G HA+TIVG+GGEG+ A+++ +NS+GT WG GYF++AR Sbjct: 334 S-MGSHALTIVGYGGEGEAAFWIVKNSWGTSWGASGYFRLAR 374 >ref|NP_001163996.1| cathepsin L-like proteinase precursor [Tribolium castaneum] gi|281427798|ref|NP_001164001.1| cathepsin L-like proteinase precursor [Tribolium castaneum] gi|270001241|gb|EEZ97688.1| cathepsin L precursor [Tribolium castaneum] gi|270016928|gb|EFA13374.1| hypothetical protein TcasGA2_TC001950 [Tribolium castaneum] Length = 328 Score = 66.2 bits (160), Expect = 3e-08 Identities = 67/247 (27%), Positives = 108/247 (43%), Gaps = 6/247 (2%) Frame = +1 Query: 178 ERNTLPIFASAE--ACLVTAHPERLMEVRNQEKTNACXXXXXXXXXXGAQVRQNGHAIPG 351 E+ +P S + A V + + EV++Q + +C G Q+ +G + Sbjct: 102 EKLRIPFVKSGKPAAAEVDWRSKAVTEVKDQGQCGSCWSFSTTGAVEG-QLAISGKGLTS 160 Query: 352 -SPQEVVDYGSLFIRPKDAPASGPYELNERVPYSYRTVFKYATEIGISPVELYPW---NG 519 S Q +VD S + +A +G + + F Y + GI YP+ +G Sbjct: 161 LSEQNLVDCSSQY---GNAGCNGGW---------MDSAFDYIHDNGIMSESAYPYTAMDG 208 Query: 520 KCKFREWDGSVWDRVVKCKGKVIEPSKKIFIDGKKILQAREIDDYLRHQPLTGQIKVTEE 699 C+F D S V +G PS + LQ D + P+ + TEE Sbjct: 209 NCRF---DAS--QSVTSLQGYYDIPS-----GDESALQ----DAVANNGPVAVALDATEE 254 Query: 700 LNAWKGDGIYRGGDTTCYMGSDVGKHAVTIVGFGGEGKDAYFVCQNSYGTGWGYRGYFKV 879 L + G +Y DTTC + H V +VG+G EG Y++ +NS+G+GWG +GY++ Sbjct: 255 LQLYSGGVLY---DTTC--SAQALNHGVLVVGYGSEGGQDYWIVKNSWGSGWGEQGYWRQ 309 Query: 880 ARHLITN 900 AR+ N Sbjct: 310 ARNRNNN 316 >ref|NP_001164314.1| cathepsin L precursor [Tribolium castaneum] gi|270001247|gb|EEZ97694.1| cathepsin L precursor [Tribolium castaneum] Length = 328 Score = 65.5 bits (158), Expect = 5e-08 Identities = 69/267 (25%), Positives = 112/267 (41%), Gaps = 6/267 (2%) Frame = +1 Query: 118 LTRVVYTSPRKHSMEVLKDPERNTLPIFASAE--ACLVTAHPERLMEVRNQEKTNACXXX 291 + R + T P+K+ E+ LP S + A V + EV+NQ + +C Sbjct: 90 VNRGLATKPKKN--------EKLRLPFVQSDKPAAAEVDWRNSAVSEVKNQGQCGSCWSF 141 Query: 292 XXXXXXXGAQVRQNGHAIPG-SPQEVVDYGSLFIRPKDAPASGPYELNERVPYSYRTVFK 468 G Q+ +G + S Q +VD S + +A +G + + F Sbjct: 142 STTGAVEG-QLAISGRGLTSLSEQNLVDCSSAY---GNAGCNGGW---------MDSAFD 188 Query: 469 YATEIGISPVELYPWN---GKCKFREWDGSVWDRVVKCKGKVIEPSKKIFIDGKKILQAR 639 Y + GI YP+ G C+F + V +G PS D + A Sbjct: 189 YIHDNGIMSESAYPYTASEGSCRFNPSES-----VTSLQGYYDLPSG----DENALKSA- 238 Query: 640 EIDDYLRHQPLTGQIKVTEELNAWKGDGIYRGGDTTCYMGSDVGKHAVTIVGFGGEGKDA 819 + P+ + T+EL + G +Y DTTC + H V +VG+G EG Sbjct: 239 ----VANNGPIAVALDATDELQFYSGGVLY---DTTC--SAQALNHGVLVVGYGSEGGQD 289 Query: 820 YFVCQNSYGTGWGYRGYFKVARHLITN 900 Y++ +NS+G+GWG +GY++ AR+ N Sbjct: 290 YWIVKNSWGSGWGEQGYWRQARNRNNN 316 >emb|CDJ81168.1| Proteinase inhibitor I29 and Peptidase C1A domain containing protein [Haemonchus contortus] Length = 390 Score = 65.1 bits (157), Expect = 6e-08 Identities = 57/218 (26%), Positives = 100/218 (45%), Gaps = 3/218 (1%) Frame = +1 Query: 241 RLMEVRNQEKTNACXXXXXXXXXXGAQVRQNGHAIPGSPQEVVDYGSLFIRPKDAPASGP 420 +L V++Q + +C A + G S QE+VD + ++ G Sbjct: 185 KLTPVKDQGQCGSCWAFATVASIEAANAIKTGQLTRLSEQEMVDCDT-----QNNGCQGG 239 Query: 421 YELNERVPYSYRTVFKYATEIGISPVELYPWNGKCKFREWDGSVWDRVVKCKGKVIEPSK 600 Y PY+ + + G+ E YP++G + C K S+ Sbjct: 240 YR-----PYA----MSFVQQNGLMKEEKYPYSGTDQNT------------CLLK--RDSE 276 Query: 601 KIFIDGKKILQARE--IDDYLR-HQPLTGQIKVTEELNAWKGDGIYRGGDTTCYMGSDVG 771 ++FI ++L + E I D++ + P+T + VT+ + +++ GI+ C S +G Sbjct: 277 RVFIQSYRMLSSNEEVIADWIAANGPVTFGMNVTKSMYSYRS-GIFAPTQEDCEQHS-LG 334 Query: 772 KHAVTIVGFGGEGKDAYFVCQNSYGTGWGYRGYFKVAR 885 HA+T VG+G E Y++ +NS+G+ WG GYFK+AR Sbjct: 335 SHALTFVGYGTENGQPYWLVKNSWGSRWGQDGYFKLAR 372 >ref|XP_003601499.1| Cathepsin B [Medicago truncatula] gi|355490547|gb|AES71750.1| Cathepsin B [Medicago truncatula] Length = 232 Score = 64.7 bits (156), Expect = 8e-08 Identities = 35/94 (37%), Positives = 56/94 (59%), Gaps = 9/94 (9%) Frame = +1 Query: 622 KILQAREIDDYLRHQ-PLTGQIKVTEELNAWKGDGIYRG-GDTTCYM---GSDVGKHAVT 786 K L R++ ++LR + P+ ++K +E+ +KGDGIY G D ++ + VG HA+ Sbjct: 112 KWLPFRKMKEHLRDEGPIAVEVKWIKEMGDYKGDGIYNGPADANAFVKTVNNHVGDHALL 171 Query: 787 IVGFGGEGKDA----YFVCQNSYGTGWGYRGYFK 876 ++GFG E + Y++ QNS+G GWG GY K Sbjct: 172 VIGFGSERIEGELVHYWIVQNSHGEGWGKEGYAK 205 >ref|XP_006396923.1| hypothetical protein EUTSA_v10028733mg [Eutrema salsugineum] gi|557097940|gb|ESQ38376.1| hypothetical protein EUTSA_v10028733mg [Eutrema salsugineum] Length = 379 Score = 63.5 bits (153), Expect = 2e-07 Identities = 37/111 (33%), Positives = 62/111 (55%), Gaps = 3/111 (2%) Frame = +1 Query: 571 CKGKVIEPSKKIFIDGKKILQARE---IDDYLRHQPLTGQIKVTEELNAWKGDGIYRGGD 741 C G++ E +KK+ IDG + L A + + + HQP+T I + G++ G Sbjct: 247 CDGRLKENNKKVMIDGYENLPANDEFALMKAVAHQPVTAVIDSSSRDFQLYESGVFDG-- 304 Query: 742 TTCYMGSDVGKHAVTIVGFGGEGKDAYFVCQNSYGTGWGYRGYFKVARHLI 894 TC G+++ H V +VG+G E Y++ +NS+G WG GY K+AR+++ Sbjct: 305 -TC--GTNLN-HGVVVVGYGTENGHDYWIVRNSWGNTWGEAGYMKMARNIV 351 >gb|ABC88769.1| putative cathepsin L-like proteinase [Tenebrio molitor] Length = 328 Score = 62.8 bits (151), Expect = 3e-07 Identities = 62/246 (25%), Positives = 97/246 (39%), Gaps = 2/246 (0%) Frame = +1 Query: 169 KDPERNTLPIFASAE--ACLVTAHPERLMEVRNQEKTNACXXXXXXXXXXGAQVRQNGHA 342 K PE +P +S + A V + EV++Q + +C G Q G Sbjct: 99 KHPENLRMPYVSSKKPLAASVDWRSNAVSEVKDQGQCGSCWSFSTTGAVEGQLALQRGRL 158 Query: 343 IPGSPQEVVDYGSLFIRPKDAPASGPYELNERVPYSYRTVFKYATEIGISPVELYPWNGK 522 S Q ++D S + +A G + + F Y + GI YP+ + Sbjct: 159 TSLSEQNLIDCSSSY---GNAGCDGGW---------MDSAFSYIHDYGIMSESAYPYEAQ 206 Query: 523 CKFREWDGSVWDRVVKCKGKVIEPSKKIFIDGKKILQAREIDDYLRHQPLTGQIKVTEEL 702 + +D S V G PS + QA P+ I T+EL Sbjct: 207 GDYCRFDSS--QSVTTLSGYYDLPSGDENSLADAVGQAG---------PVAVAIDATDEL 255 Query: 703 NAWKGDGIYRGGDTTCYMGSDVGKHAVTIVGFGGEGKDAYFVCQNSYGTGWGYRGYFKVA 882 + G Y D TC SD+ H V +VG+G + Y++ +NS+G+GWG GY++ Sbjct: 256 QFYSGGLFY---DQTCNQ-SDLN-HGVLVVGYGSDNGQDYWILKNSWGSGWGESGYWRQV 310 Query: 883 RHLITN 900 R+ N Sbjct: 311 RNYGNN 316 >gb|AAP94048.2| cathepsin-L-like midgut cysteine proteinase [Tenebrio molitor] Length = 330 Score = 62.8 bits (151), Expect = 3e-07 Identities = 62/246 (25%), Positives = 97/246 (39%), Gaps = 2/246 (0%) Frame = +1 Query: 169 KDPERNTLPIFASAE--ACLVTAHPERLMEVRNQEKTNACXXXXXXXXXXGAQVRQNGHA 342 K PE +P +S + A V + EV++Q + +C G Q G Sbjct: 101 KHPENLRMPYVSSKKPLAASVDWRSNAVSEVKDQGQCGSCWSFSTTGAVEGQLALQRGRL 160 Query: 343 IPGSPQEVVDYGSLFIRPKDAPASGPYELNERVPYSYRTVFKYATEIGISPVELYPWNGK 522 S Q ++D S + +A G + + F Y + GI YP+ + Sbjct: 161 TSLSEQNLIDCSSSY---GNAGCDGGW---------MDSAFSYIHDYGIMSESAYPYEAQ 208 Query: 523 CKFREWDGSVWDRVVKCKGKVIEPSKKIFIDGKKILQAREIDDYLRHQPLTGQIKVTEEL 702 + +D S V G PS + QA P+ I T+EL Sbjct: 209 GDYCRFDSS--QSVTTLSGYYDLPSGDENSLADAVGQAG---------PVAVAIDATDEL 257 Query: 703 NAWKGDGIYRGGDTTCYMGSDVGKHAVTIVGFGGEGKDAYFVCQNSYGTGWGYRGYFKVA 882 + G Y D TC SD+ H V +VG+G + Y++ +NS+G+GWG GY++ Sbjct: 258 QFYSGGLFY---DQTCNQ-SDLN-HGVLVVGYGSDNGQDYWILKNSWGSGWGESGYWRQV 312 Query: 883 RHLITN 900 R+ N Sbjct: 313 RNYGNN 318 >gb|AGB07568.1| cathepsin b-like cysteine protease 11, partial [Ancylostoma duodenale] Length = 250 Score = 61.6 bits (148), Expect = 7e-07 Identities = 39/142 (27%), Positives = 62/142 (43%), Gaps = 11/142 (7%) Frame = +1 Query: 493 PVELYP--------WNGKCKFREWDGSVWDRVVKCKGKVIEPSKKIFIDGKKILQARE-- 642 P LYP + G C + WD V + K + KI+ + I+ + Sbjct: 101 PYPLYPCGRHQNQTYYGPCSEKLWDTPVCRSACQFKYPIPYRQDKIYGNSTYIIPKNQTI 160 Query: 643 -IDDYLRHQPLTGQIKVTEELNAWKGDGIYRGGDTTCYMGSDVGKHAVTIVGFGGEGKDA 819 + + + H P+ KV E+ Y+GG G G HAV ++G+G E Sbjct: 161 IMTEIMTHGPVVATYKVYEDF------AYYKGGVYVHTAGEQKGAHAVRVIGWGEENSLP 214 Query: 820 YFVCQNSYGTGWGYRGYFKVAR 885 Y++ NS+ T WG +GYF++ R Sbjct: 215 YWLVANSWNTDWGEKGYFRILR 236 >gb|AGV15822.1| cysteine protease CP14 [Nicotiana tabacum] Length = 505 Score = 61.6 bits (148), Expect = 7e-07 Identities = 58/220 (26%), Positives = 89/220 (40%), Gaps = 7/220 (3%) Frame = +1 Query: 250 EVRNQEKTNACXXXXXXXXXXGAQVRQNGHAIPGSPQEVVDYGSLFIRPKDAPASGPYEL 429 EV+NQ++ AC G G I S QE+++ + + D P Sbjct: 160 EVKNQDQCGACWAFSACGAMEGINAIATGELISLSEQELINCDNSYNTGCDGGLMDP--- 216 Query: 430 NERVPYSYRTVFKYA-TEIGISPVELYPW---NGKCKFREWDGSVWDRVVKCKGKVIEPS 597 F++ GI+ YP+ G+C + + V K +I+ Sbjct: 217 ----------AFEWVMNNSGINSEADYPYTASQGRCNYDK---------VNHKVVIIDGY 257 Query: 598 KKIFIDGKKILQAREIDDYLRHQPLTGQIKVTEELNAWKGD-GIYRGG--DTTCYMGSDV 768 + + D +L A GQ V+ ++ D +YRGG D C D Sbjct: 258 QDVPEDENALLCA------------VGQQPVSVGIDGSSLDFQLYRGGIYDGECSSNPDD 305 Query: 769 GKHAVTIVGFGGEGKDAYFVCQNSYGTGWGYRGYFKVARH 888 HAV IVG+G EG D Y++ +NS+GT WG GY + R+ Sbjct: 306 LSHAVVIVGYGSEGDDDYWIIKNSWGTSWGMEGYAYIRRN 345 >ref|XP_004307286.1| PREDICTED: oryzain beta chain-like [Fragaria vesca subsp. vesca] Length = 344 Score = 61.2 bits (147), Expect = 9e-07 Identities = 57/214 (26%), Positives = 86/214 (40%) Frame = +1 Query: 253 VRNQEKTNACXXXXXXXXXXGAQVRQNGHAIPGSPQEVVDYGSLFIRPKDAPASGPYELN 432 VR+Q + +C G G +P S QE+VD + + G + N Sbjct: 142 VRDQGRCGSCWAFSAVAAVEGLHKINTGKLVPLSEQELVDCD---VNTGNQGCRGGFMEN 198 Query: 433 ERVPYSYRTVFKYATEIGISPVELYPWNGKCKFREWDGSVWDRVVKCKGKVIEPSKKIFI 612 F Y + GI+ + YP+ G DG+ K G I + + Sbjct: 199 ---------AFDYIRKYGITTQKDYPYTGS------DGTCNKSKQKKSGVKIGGYETVPE 243 Query: 613 DGKKILQAREIDDYLRHQPLTGQIKVTEELNAWKGDGIYRGGDTTCYMGSDVGKHAVTIV 792 + +K LQA + HQP++ I + GI+ G C D H VT V Sbjct: 244 NDEKSLQAA-----VAHQPVSVAIDASGFAMQLYSSGIFSG--LLCGKSLD---HGVTAV 293 Query: 793 GFGGEGKDAYFVCQNSYGTGWGYRGYFKVARHLI 894 G+G E Y++ +NS+GT WG GY ++ R I Sbjct: 294 GYGEENGLKYWIVKNSWGTNWGESGYIRITRDYI 327 >gb|ETN70308.1| papain family cysteine protease [Necator americanus] Length = 414 Score = 60.8 bits (146), Expect = 1e-06 Identities = 34/99 (34%), Positives = 60/99 (60%), Gaps = 3/99 (3%) Frame = +1 Query: 598 KKIFIDGKKILQARE--IDDYLR-HQPLTGQIKVTEELNAWKGDGIYRGGDTTCYMGSDV 768 ++++I + L + E + D++ + P+T + VT+ L +++ GI+ C S + Sbjct: 302 ERVYIRSYRTLSSNEDAVADWIAANGPVTFGMNVTKSLYSYRS-GIFSPSKEDCEEHS-L 359 Query: 769 GKHAVTIVGFGGEGKDAYFVCQNSYGTGWGYRGYFKVAR 885 G HA+T VG+G EG Y++ +NS+G+ WG GYFK+AR Sbjct: 360 GSHALTFVGYGTEGGQPYWLVKNSWGSRWGQNGYFKMAR 398 >gb|ETN61493.1| cathepsin b [Anopheles darlingi] Length = 339 Score = 60.8 bits (146), Expect = 1e-06 Identities = 32/79 (40%), Positives = 45/79 (56%) Frame = +1 Query: 649 DYLRHQPLTGQIKVTEELNAWKGDGIYRGGDTTCYMGSDVGKHAVTIVGFGGEGKDAYFV 828 + + + P+ G V E++ +K G+YR G VGKHAV I+G+G EG Y++ Sbjct: 249 EIMTNGPVEGGFDVYEDVFLYKS-GVYRH-----VYGEHVGKHAVRIIGWGREGGIPYWL 302 Query: 829 CQNSYGTGWGYRGYFKVAR 885 NSYG WG GYFK+ R Sbjct: 303 ISNSYGEDWGDHGYFKIVR 321 >ref|XP_005366697.1| PREDICTED: pro-cathepsin H [Microtus ochrogaster] Length = 333 Score = 60.8 bits (146), Expect = 1e-06 Identities = 54/211 (25%), Positives = 87/211 (41%) Frame = +1 Query: 253 VRNQEKTNACXXXXXXXXXXGAQVRQNGHAIPGSPQEVVDYGSLFIRPKDAPASGPYELN 432 V+NQ +C A G + + Q++VD F P + Sbjct: 130 VKNQGSCGSCWTFSTTGALESAVAIATGKMLSLAEQQLVDCAQNF-NNHGCQGGLPSQAF 188 Query: 433 ERVPYSYRTVFKYATEIGISPVELYPWNGKCKFREWDGSVWDRVVKCKGKVIEPSKKIFI 612 E + Y+ GI + YP+ G+ DG K V + + Sbjct: 189 EYILYNK----------GIMGEDTYPYRGR------DGHCKFNPQKAIAFVKDVANITLN 232 Query: 613 DGKKILQAREIDDYLRHQPLTGQIKVTEELNAWKGDGIYRGGDTTCYMGSDVGKHAVTIV 792 D K +++A + H P++ +VTE+ ++ GIY TTC+ D HAV V Sbjct: 233 DEKAMVEAVAL-----HNPVSFAFEVTEDFMLYR-KGIY--SSTTCHQTPDKVNHAVLAV 284 Query: 793 GFGGEGKDAYFVCQNSYGTGWGYRGYFKVAR 885 G+G + Y++ +NS+GT WG +GYF + R Sbjct: 285 GYGEQDGVPYWIVKNSWGTQWGDKGYFLIER 315 >dbj|BAN20308.1| cathepsin L [Riptortus pedestris] Length = 331 Score = 60.8 bits (146), Expect = 1e-06 Identities = 54/201 (26%), Positives = 87/201 (43%), Gaps = 5/201 (2%) Frame = +1 Query: 313 GAQVRQNGHAIPGSPQEVVDYGSLFIRPKDAPASGPYELNERVPYSYRTVFKYATEIGIS 492 G R+ G + S Q ++D S + G Y +R+ +KY + GI Sbjct: 149 GQNYRKTGRLVSLSEQNLLDCSSNIWYGNNGCNGG---------YMFRS-YKYIKKNGID 198 Query: 493 PVELYPWNGKCKFREWDGSVWDRVVKCKGKVIEPSKKIFIDGKKILQAREIDDYLRHQPL 672 E YP++GK V+KC+ ++ I + ++ ++ Y + Sbjct: 199 TEESYPYDGK-------------VIKCRFN----NETIGANITGYIRVKKDSQYALQDAV 241 Query: 673 TGQIKVTEELNAWKGDGIYRGG---DTTCYMGSDVGKHAVTIVGFGGE--GKDAYFVCQN 837 V L +K Y GG D C G+ + HA +VG+G E GKD Y++ +N Sbjct: 242 ANVGPVAVGLEVYKSFRYYNGGVYYDAQC--GTSLQNHAALVVGYGTEEDGKD-YWLVKN 298 Query: 838 SYGTGWGYRGYFKVARHLITN 900 S+GT WG GY K+ R+ T+ Sbjct: 299 SWGTHWGLDGYIKMIRNFPTS 319