BLASTX nr result
ID: Mentha23_contig00004267
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Mentha23_contig00004267 (1344 letters) Database: ./nr 38,876,450 sequences; 13,856,398,315 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_002864490.1| hypothetical protein ARALYDRAFT_918859 [Arab... 62 4e-07 dbj|BAB86960.1| cathepsin L [Fasciola gigantica] 62 7e-07 gb|ABN50361.2| cathepsin L [Fasciola hepatica] 60 3e-06 gb|AAA29137.1| cathepsin [Fasciola hepatica] 59 4e-06 gb|ABG00259.1| cathepsin L2 [Fasciola hepatica] 59 5e-06 ref|XP_005917627.1| PREDICTED: cathepsin K-like [Haplochromis bu... 59 6e-06 ref|XP_002160197.2| PREDICTED: dipeptidyl peptidase 1-like [Hydr... 59 6e-06 ref|XP_006787967.1| PREDICTED: cathepsin K-like [Neolamprologus ... 58 8e-06 ref|XP_005728566.1| PREDICTED: cathepsin K-like isoform X1 [Pund... 58 8e-06 ref|XP_004549306.1| PREDICTED: uncharacterized protein LOC101471... 58 8e-06 gb|AAK38169.1| cathepsin L-like [Fasciola hepatica] 58 8e-06 >ref|XP_002864490.1| hypothetical protein ARALYDRAFT_918859 [Arabidopsis lyrata subsp. lyrata] gi|297310325|gb|EFH40749.1| hypothetical protein ARALYDRAFT_918859 [Arabidopsis lyrata subsp. lyrata] Length = 274 Score = 62.4 bits (150), Expect = 4e-07 Identities = 53/251 (21%), Positives = 107/251 (42%), Gaps = 1/251 (0%) Frame = +2 Query: 176 MRPKNVKEKDWEDFIENVYITWTDKAGTEIVMKVHNQGRKDMCWAWASTDSLSVVDYLVD 355 M K+ KE+ WE + +W + ++ V NQ + +CWA A +++ + + Sbjct: 31 MPSKSSKEELWE-----LPPSWDWRDYPGVIGPVMNQKLQAICWAIALVRAVTALLNINL 85 Query: 356 KNVDVCQKFSVQEFMNFLHLHENYQDDAFKSDPKVRLSGVRNTPYNAFQYVKAYGICKED 535 + + S+Q +N +H +++ G++N AF + G C Sbjct: 86 PHENQIVDLSIQHAVNKVHYNKD--------------DGIQNMK-RAFSFATGEGFCTAS 130 Query: 536 DCRYTGRVDRSATWETRPTGMEKLFIRKIKEYDDGRQLNCMKFFQMLSCRGPFVGVIFLC 715 C R D + + + + K+ E++ +N + Q + + P +G++ Sbjct: 131 QCTPNTR-DNNVFKKLVCRHPDNIHYIKVDEFEYLTNVNDEEL-QAIVVQQPVIGILRNT 188 Query: 716 SS-FHEIRDELYKGPSXXXXXXXXXXXXXXXXDARYHSILIVGYGSEDDTDFFLCKNTHG 892 + F I +Y+ PS D +H +LI+GYG ++ +++ +N++G Sbjct: 189 NDEFLAIGSGIYRSPSGDV-------------DVNFHQVLIIGYGYDNGKPYWIIQNSYG 235 Query: 893 RSWGKGGFGKI 925 WG GGFG + Sbjct: 236 EGWGNGGFGYV 246 >dbj|BAB86960.1| cathepsin L [Fasciola gigantica] Length = 326 Score = 61.6 bits (148), Expect = 7e-07 Identities = 52/217 (23%), Positives = 93/217 (42%) Frame = +2 Query: 266 VMKVHNQGRKDMCWAWASTDSLSVVDYLVDKNVDVCQKFSVQEFMNFLHLHENYQDDAFK 445 V +V +QG+ CWA+++T ++ Y+ ++ VD FS Q+ ++ Sbjct: 120 VTEVKDQGQCCSCWAFSTTGTME-GQYMKNERVDT--SFSEQQLVDC------------- 163 Query: 446 SDPKVRLSGVRNTPYNAFQYVKAYGICKEDDCRYTGRVDRSATWETRPTGMEKLFIRKIK 625 S P NA+ Y++ +G+ E Y V+ S + +L + K+ Sbjct: 164 SRPWGNNGCGGGFMENAYNYLRQFGLESESSYPYQA-VEDSCQCD------RQLGVAKVT 216 Query: 626 EYDDGRQLNCMKFFQMLSCRGPFVGVIFLCSSFHEIRDELYKGPSXXXXXXXXXXXXXXX 805 Y G N ++ ++ GP + + S F R +Y+ Sbjct: 217 GYYTGHSGNELELQSLVGAEGPAAVAVAVDSDFMMYRGGIYQSEICSLLRLN-------- 268 Query: 806 XDARYHSILIVGYGSEDDTDFFLCKNTHGRSWGKGGF 916 H++L VGYGS+DDTD+++ KN+ G WG+ G+ Sbjct: 269 -----HAVLTVGYGSQDDTDYWIVKNSWGTCWGEYGY 300 >gb|ABN50361.2| cathepsin L [Fasciola hepatica] Length = 326 Score = 59.7 bits (143), Expect = 3e-06 Identities = 58/248 (23%), Positives = 100/248 (40%), Gaps = 6/248 (2%) Frame = +2 Query: 203 DWEDFIENVYITWTDKAGTEIVMKVHNQGRKDMCWAWASTDSLSVVDYLVDKNVDVCQKF 382 DW D+ Y+T +V NQG+ CWA+++T + V+ KN F Sbjct: 113 DWRDYY---YVT-----------EVKNQGQCGSCWAFSTTGA---VEGQFRKNERASASF 155 Query: 383 SVQEFMNFLHLHENYQ------DDAFKSDPKVRLSGVRNTPYNAFQYVKAYGICKEDDCR 544 S Q+ +N NY ++A++ ++ +G+ Y +Q V E C+ Sbjct: 156 SEQQLVNCTRDFGNYGCGGGYVENAYEY---LKHNGLETESYYPYQAV-------EGPCQ 205 Query: 545 YTGRVDRSATWETRPTGMEKLFIRKIKEYDDGRQLNCMKFFQMLSCRGPFVGVIFLCSSF 724 Y GR L K+ Y + ++ ++ GP + S F Sbjct: 206 YDGR----------------LAYAKVTGYYTVHSGDEIELKNLVGTEGPAAVALDADSDF 249 Query: 725 HEIRDELYKGPSXXXXXXXXXXXXXXXXDARYHSILIVGYGSEDDTDFFLCKNTHGRSWG 904 + +Y+ + D H++L VGYGS+D TD+++ KN+ G WG Sbjct: 250 MMYQSGIYQSQTCLP-------------DRLTHAVLAVGYGSQDGTDYWIVKNSWGTWWG 296 Query: 905 KGGFGKIA 928 + G+ + A Sbjct: 297 EDGYIRFA 304 >gb|AAA29137.1| cathepsin [Fasciola hepatica] Length = 326 Score = 59.3 bits (142), Expect = 4e-06 Identities = 52/217 (23%), Positives = 93/217 (42%) Frame = +2 Query: 266 VMKVHNQGRKDMCWAWASTDSLSVVDYLVDKNVDVCQKFSVQEFMNFLHLHENYQDDAFK 445 V +V +QG CWA+++T ++ Y+ KN FS Q+ ++ NY + Sbjct: 120 VTEVKDQGGCGSCWAFSTTGAMEG-QYM--KNEKTSISFSEQQLVDCSGPFGNYGCNGGL 176 Query: 446 SDPKVRLSGVRNTPYNAFQYVKAYGICKEDDCRYTGRVDRSATWETRPTGMEKLFIRKIK 625 + NA++Y+K +G+ E Y R+ + R E+L + K+ Sbjct: 177 ME-------------NAYEYLKRFGLETESSYPY-----RAVEGQCRYN--EQLGVAKVT 216 Query: 626 EYDDGRQLNCMKFFQMLSCRGPFVGVIFLCSSFHEIRDELYKGPSXXXXXXXXXXXXXXX 805 Y + ++ ++ CR P + + S F R +Y+ + Sbjct: 217 GYYTVHSGDEVELQNLVGCRRPAAVALDVESDFMMYRSGIYQSQTCSP------------ 264 Query: 806 XDARYHSILIVGYGSEDDTDFFLCKNTHGRSWGKGGF 916 D H +L VGYG +D TD+++ KN+ G WG+ G+ Sbjct: 265 -DRLNHGVLAVGYGIQDGTDYWIVKNSWGTWWGEDGY 300 >gb|ABG00259.1| cathepsin L2 [Fasciola hepatica] Length = 219 Score = 58.9 bits (141), Expect = 5e-06 Identities = 57/248 (22%), Positives = 103/248 (41%), Gaps = 6/248 (2%) Frame = +2 Query: 203 DWEDFIENVYITWTDKAGTEIVMKVHNQGRKDMCWAWASTDSLSVVDYLVDKNVDVCQKF 382 DW D+ Y+T +V +QG+ CWA+++T + V+ KN F Sbjct: 6 DWRDYY---YVT-----------EVKDQGQCGSCWAFSTTGA---VEGQFRKNERASASF 48 Query: 383 SVQEFMNFLHLHENY------QDDAFKSDPKVRLSGVRNTPYNAFQYVKAYGICKEDDCR 544 S Q+ ++ NY ++A++ ++ +G+ Y +Q V E C+ Sbjct: 49 SEQQLVDCTRDFGNYGCGGGYMENAYEY---LKHNGLETESYYPYQAV-------EGPCQ 98 Query: 545 YTGRVDRSATWETRPTGMEKLFIRKIKEYDDGRQLNCMKFFQMLSCRGPFVGVIFLCSSF 724 Y GR L K+ Y + ++ ++ GP + + S F Sbjct: 99 YDGR----------------LAYAKVTGYYTVHSGDEIELKNLVGTEGPAAIAVDVESDF 142 Query: 725 HEIRDELYKGPSXXXXXXXXXXXXXXXXDARYHSILIVGYGSEDDTDFFLCKNTHGRSWG 904 R +Y+ + A H++L VGYG++D TD+++ KN+ G SWG Sbjct: 143 MMYRSGIYQSQTCLPF-------------ALNHAVLAVGYGTQDGTDYWIVKNSWGLSWG 189 Query: 905 KGGFGKIA 928 + G+ ++A Sbjct: 190 ERGYIRMA 197 >ref|XP_005917627.1| PREDICTED: cathepsin K-like [Haplochromis burtoni] Length = 330 Score = 58.5 bits (140), Expect = 6e-06 Identities = 57/231 (24%), Positives = 94/231 (40%), Gaps = 9/231 (3%) Frame = +2 Query: 263 IVMKVHNQGRKDMCWAWASTDSLSVVDYLVDKNVDVCQKFSVQEFMNF------LHLHEN 424 +V V NQG CWA++S L ++ + K+ S Q ++ L Sbjct: 125 LVGPVRNQGLCGSCWAFSS---LGALEGQLKKHTGTLVSLSPQNLVDCSTQDGNLGCRGG 181 Query: 425 YQDDAFKSDPKVRLSGVRNTPYNAFQYVKAYGICKEDDCRYT--GRVDRSATWETRPTGM 598 Y A+ +R GV + + +++ K CRY+ GR + + P G Sbjct: 182 YITKAYSY--VIRNGGVDSESFYPYEH-------KNGKCRYSVQGRAGYCSKFSVLPEGD 232 Query: 599 EKLFIRKIKEYDDGRQLNCMKFFQMLSCRGPF-VGVIFLCSSFHEIRDELYKGPSXXXXX 775 EK+ + +L+ GP V V + SFH LY PS Sbjct: 233 EKMLQK------------------VLASVGPISVAVNAMLESFHMYSGGLYNVPSCNPKL 274 Query: 776 XXXXXXXXXXXDARYHSILIVGYGSEDDTDFFLCKNTHGRSWGKGGFGKIA 928 H++L+VGYG++ D++L KN+ G +WG+GG+ ++A Sbjct: 275 IN-------------HAVLLVGYGTDGGQDYWLVKNSWGTAWGEGGYIRLA 312 >ref|XP_002160197.2| PREDICTED: dipeptidyl peptidase 1-like [Hydra vulgaris] Length = 454 Score = 58.5 bits (140), Expect = 6e-06 Identities = 65/279 (23%), Positives = 104/279 (37%), Gaps = 16/279 (5%) Frame = +2 Query: 164 LLPVM-RPKNVKEKDWEDFIENVYITWTDKAGTEIVMKVHNQGRKDMCWAWASTDSLSVV 340 L PV+ RP + KD D + W K + V V NQG C+A+AS L Sbjct: 204 LSPVLPRPSLLDGKDLPDAFD-----WRSKHSSNFVSPVRNQGNCGSCYAFASMAQLEA- 257 Query: 341 DYLVDKNVDVCQKFSVQEFMNFLHLHENYQDDAFKSDPKVRLSGVRNTPY-NAFQYVKAY 517 ++ N + FS Q ++ L + + P+ A +Y +Y Sbjct: 258 SARIETNNRIKPVFSTQNIVSCSPLSQGCEG---------------GFPFLTAGRYAHSY 302 Query: 518 GICKEDDCRYTGRVDRSATWETRPTGMEKLFIRKIKEYDDGRQLNCMKFFQMLSC--RGP 691 G+ ED Y G + P + F Y G C + L+ GP Sbjct: 303 GVITEDKYPYIGNDTKC-----NPESSDYRFFASEYGYVGGFYGGCSEVLMRLALIRYGP 357 Query: 692 FVGVIFLCSSFHEIRDELYKGPSXXXXXXXXXXXXXXXXDARYHSILIVGYGSEDDTD-- 865 I + S F + ++ P H++L+VGYG + D Sbjct: 358 LSVGINVTSEFLHYKGGIFYQPETHLLGSKFNPFYLTN-----HAVLVVGYGVDHDNGVK 412 Query: 866 FFLCKNTHGRSWGKGGFGKI----------AVTAFSELY 952 +++ KN+ G WG+GGF +I ++ FS++Y Sbjct: 413 YWIVKNSWGEGWGEGGFFRIRRGTNEIGIESIAVFSKIY 451 >ref|XP_006787967.1| PREDICTED: cathepsin K-like [Neolamprologus brichardi] Length = 330 Score = 58.2 bits (139), Expect = 8e-06 Identities = 61/252 (24%), Positives = 99/252 (39%), Gaps = 9/252 (3%) Frame = +2 Query: 200 KDWEDFIENVYITWTDKAGTEIVMKVHNQGRKDMCWAWASTDSLSVVDYLVDKNVDVCQK 379 KD D + W + +V V NQG CWA++S L ++ + K Sbjct: 107 KDVSDSSLPANVDWRKEG---LVGPVRNQGLCGSCWAFSS---LGALEGQLKKRTGTLVS 160 Query: 380 FSVQEFMNF------LHLHENYQDDAFKSDPKVRLSGVRNTPYNAFQYVKAYGICKEDDC 541 S Q ++ L Y A+ +R GV + + +++ K C Sbjct: 161 LSPQNLVDCSTQDGNLGCRGGYITKAYSY--VIRNGGVDSESFYPYEH-------KNGKC 211 Query: 542 RYT--GRVDRSATWETRPTGMEKLFIRKIKEYDDGRQLNCMKFFQMLSCRGPF-VGVIFL 712 RY+ GR + + P G EK+ + +L+ GP V V + Sbjct: 212 RYSVQGRAGYCSKFSVLPEGDEKMLQK------------------VLASVGPISVAVNAM 253 Query: 713 CSSFHEIRDELYKGPSXXXXXXXXXXXXXXXXDARYHSILIVGYGSEDDTDFFLCKNTHG 892 SFH LY PS H++L+VGYG++ D++L KN+ G Sbjct: 254 LESFHMYSGGLYNVPSCNPKLIN-------------HAVLLVGYGTDAGQDYWLVKNSWG 300 Query: 893 RSWGKGGFGKIA 928 +WG+GG+ ++A Sbjct: 301 TAWGEGGYIRLA 312 >ref|XP_005728566.1| PREDICTED: cathepsin K-like isoform X1 [Pundamilia nyererei] gi|548352616|ref|XP_005728567.1| PREDICTED: cathepsin K-like isoform X2 [Pundamilia nyererei] Length = 330 Score = 58.2 bits (139), Expect = 8e-06 Identities = 57/231 (24%), Positives = 93/231 (40%), Gaps = 9/231 (3%) Frame = +2 Query: 263 IVMKVHNQGRKDMCWAWASTDSLSVVDYLVDKNVDVCQKFSVQEFMNF------LHLHEN 424 +V V NQG CWA++S L ++ + K S Q ++ L Sbjct: 125 LVGPVRNQGLCGSCWAFSS---LGALEGQLKKRTGTLVSLSPQNLVDCSTQDGNLGCRGG 181 Query: 425 YQDDAFKSDPKVRLSGVRNTPYNAFQYVKAYGICKEDDCRYT--GRVDRSATWETRPTGM 598 Y A+ +R GV + + +++ K CRY+ GR + + P G Sbjct: 182 YITKAYSY--VIRNGGVDSESFYPYEH-------KNGKCRYSVQGRAGYCSKFSVLPEGD 232 Query: 599 EKLFIRKIKEYDDGRQLNCMKFFQMLSCRGPF-VGVIFLCSSFHEIRDELYKGPSXXXXX 775 EK+ + +L+ GP V V + SFH LY PS Sbjct: 233 EKMLQK------------------VLASVGPISVAVNAMLESFHMYSGGLYNVPSCNPKL 274 Query: 776 XXXXXXXXXXXDARYHSILIVGYGSEDDTDFFLCKNTHGRSWGKGGFGKIA 928 H++L+VGYG++ D++L KN+ G +WG+GG+ ++A Sbjct: 275 IN-------------HAVLLVGYGTDGGQDYWLVKNSWGTAWGEGGYIRLA 312 >ref|XP_004549306.1| PREDICTED: uncharacterized protein LOC101471071 [Maylandia zebra] Length = 730 Score = 58.2 bits (139), Expect = 8e-06 Identities = 57/231 (24%), Positives = 93/231 (40%), Gaps = 9/231 (3%) Frame = +2 Query: 263 IVMKVHNQGRKDMCWAWASTDSLSVVDYLVDKNVDVCQKFSVQEFMNF------LHLHEN 424 +V V NQG CWA++S L ++ + K S Q ++ L Sbjct: 125 LVGPVRNQGLCGSCWAFSS---LGALEGQLKKRTGTLVSLSPQNLVDCSTQDGNLGCRGG 181 Query: 425 YQDDAFKSDPKVRLSGVRNTPYNAFQYVKAYGICKEDDCRYT--GRVDRSATWETRPTGM 598 Y A+ +R GV + + +++ K CRY+ GR + + P G Sbjct: 182 YITKAYSY--VIRNGGVDSESFYPYEH-------KNGKCRYSVQGRAGYCSKFSVLPEGD 232 Query: 599 EKLFIRKIKEYDDGRQLNCMKFFQMLSCRGPF-VGVIFLCSSFHEIRDELYKGPSXXXXX 775 EK+ + +L+ GP V V + SFH LY PS Sbjct: 233 EKMLQK------------------VLASVGPISVAVNAMLESFHMYSGGLYNVPSCNPKL 274 Query: 776 XXXXXXXXXXXDARYHSILIVGYGSEDDTDFFLCKNTHGRSWGKGGFGKIA 928 H++L+VGYG++ D++L KN+ G +WG+GG+ ++A Sbjct: 275 IN-------------HAVLLVGYGTDGGQDYWLVKNSWGTAWGEGGYIRLA 312 >gb|AAK38169.1| cathepsin L-like [Fasciola hepatica] Length = 310 Score = 58.2 bits (139), Expect = 8e-06 Identities = 51/221 (23%), Positives = 96/221 (43%) Frame = +2 Query: 266 VMKVHNQGRKDMCWAWASTDSLSVVDYLVDKNVDVCQKFSVQEFMNFLHLHENYQDDAFK 445 V +V +QG CWA+++T ++ Y+ ++ + FS Q+ ++ Sbjct: 104 VTEVKDQGNCGSCWAFSTTGTMEG-QYMKNERTSI--SFSEQQLVDC------------- 147 Query: 446 SDPKVRLSGVRNTPYNAFQYVKAYGICKEDDCRYTGRVDRSATWETRPTGMEKLFIRKIK 625 S P NA+QY+K +G+ E YT V+ + +L + K+ Sbjct: 148 SGPWGNNGCSGGLMENAYQYLKQFGLETESSYPYTA-VEGQCRYN------RQLGVAKVT 200 Query: 626 EYDDGRQLNCMKFFQMLSCRGPFVGVIFLCSSFHEIRDELYKGPSXXXXXXXXXXXXXXX 805 Y + ++ ++ R P + + S F R +Y+ + Sbjct: 201 GYYTVHSGSEVELKNLVGSRRPAAIAVDVESDFMMYRSGIYQSQTCLPF----------- 249 Query: 806 XDARYHSILIVGYGSEDDTDFFLCKNTHGRSWGKGGFGKIA 928 A H++L VGYG++D TD+++ KN+ G SWG+ G+ ++A Sbjct: 250 --ALNHAVLAVGYGTQDGTDYWIVKNSWGLSWGERGYIRMA 288