BLASTX nr result
ID: Mentha22_contig00041146
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Mentha22_contig00041146 (395 letters) Database: ./nr 38,876,450 sequences; 13,856,398,315 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_001029851.1| Papain family cysteine protease containing p... 94 2e-17 ref|XP_004351999.1| cysteine protease [Dictyostelium fasciculatu... 90 3e-16 dbj|BAK00754.1| predicted protein [Hordeum vulgare subsp. vulgare] 89 8e-16 gb|EFA83712.1| cysteine proteinase 1 [Polysphondylium pallidum P... 89 8e-16 ref|XP_005598385.1| PREDICTED: cathepsin F [Equus caballus] 87 2e-15 gb|EGB13210.1| hypothetical protein AURANDRAFT_18666 [Aureococcu... 87 3e-15 ref|XP_005305555.1| PREDICTED: cathepsin F-like isoform X1 [Chry... 85 9e-15 ref|XP_004368288.1| cysteine proteinase precursor, putative [Aca... 85 9e-15 emb|CAA26255.1| cysteine proteinase I precursor [Dictyostelium d... 85 9e-15 ref|XP_635417.1| cysteine proteinase 1 [Dictyostelium discoideum... 84 2e-14 ref|XP_003284553.1| hypothetical protein DICPUDRAFT_96752 [Dicty... 84 3e-14 ref|XP_003419606.1| PREDICTED: cathepsin F [Loxodonta africana] 83 3e-14 emb|CCD12886.1| unnamed protein product [Trypanosoma congolense ... 83 3e-14 emb|CCD15016.1| unnamed protein product [Trypanosoma congolense ... 83 3e-14 emb|CCD11901.1| unnamed protein product [Trypanosoma congolense ... 83 5e-14 ref|XP_001439792.1| hypothetical protein [Paramecium tetraurelia... 82 6e-14 emb|CCD14094.1| unnamed protein product, partial [Trypanosoma co... 82 8e-14 ref|XP_006129733.1| PREDICTED: cathepsin F-like [Pelodiscus sine... 82 1e-13 gb|AAA18215.1| cysteine protease precursor [Trypanosoma congolense] 82 1e-13 emb|CCD11724.1| unnamed protein product, partial [Trypanosoma co... 82 1e-13 >ref|XP_001029851.1| Papain family cysteine protease containing protein [Tetrahymena thermophila] gi|89284124|gb|EAR82188.1| papain family cysteine protease [Tetrahymena thermophila SB210] Length = 330 Score = 94.0 bits (232), Expect = 2e-17 Identities = 56/130 (43%), Positives = 76/130 (58%), Gaps = 5/130 (3%) Frame = +2 Query: 20 LATLAVSQAATLSGE-VMEAFAHFTETYNKKYSDAE-WSQRMAIFAENLERINEQNRQHI 193 LA ++ +T+ + + AF FT+TYNKKYS E ++ R++IF ENL RI N+ Sbjct: 10 LAACVFARFSTMQDQDIAAAFKKFTQTYNKKYSSEEHYNARLSIFKENLRRIELFNKND- 68 Query: 194 LIGGDAVFGVTQFSDLTPAEFKSMYLNYIPSNVTFPRANIELDGAP---ATVVDWRTKGA 364 +A G+TQF+DLT EF MYL Y P + +A + L P T +DW TKGA Sbjct: 69 ----EAQHGITQFADLTHEEFADMYLGYKPQ-LRNSQAKVSLSSTPFTAPTAIDWTTKGA 123 Query: 365 VTDVKDQGQC 394 VT VK+QG C Sbjct: 124 VTPVKNQGSC 133 >ref|XP_004351999.1| cysteine protease [Dictyostelium fasciculatum] gi|328866896|gb|EGG15279.1| cysteine protease [Dictyostelium fasciculatum] Length = 347 Score = 90.1 bits (222), Expect = 3e-16 Identities = 53/131 (40%), Positives = 74/131 (56%), Gaps = 6/131 (4%) Frame = +2 Query: 20 LATLAVSQAATLSGEVMEAFAHFTETYNKKYSDAEWSQRMAIFAENLERINEQNRQHILI 199 L +A++ A LS E ++ F F YNK Y E+SQ+ F +NL RI+ N Sbjct: 9 LFLVALAAARKLSPEEIQ-FRDFQVKYNKVYGSHEFSQKFVTFKDNLNRIDTLNANAAAS 67 Query: 200 GGDAVFGVTQFSDLTPAEFKSMYLNYIPSNVTFPRANIELDGAPATVV------DWRTKG 361 G D FGV +F+DL+ EF+ Y+N +P++V A + D + T+ DWRTKG Sbjct: 68 GSDTKFGVNEFADLSVQEFRKFYMNAVPASVP-SDAQVAGDYSDETLASIPSSFDWRTKG 126 Query: 362 AVTDVKDQGQC 394 AVT VK+QGQC Sbjct: 127 AVTPVKNQGQC 137 >dbj|BAK00754.1| predicted protein [Hordeum vulgare subsp. vulgare] Length = 341 Score = 88.6 bits (218), Expect = 8e-16 Identities = 55/136 (40%), Positives = 74/136 (54%), Gaps = 9/136 (6%) Frame = +2 Query: 14 LALATLAVSQAATLSGEVMEA----FAHFTETYNKKYSDAE-WSQRMAIFAENLERINEQ 178 LAL LA + + S ++ F FT ++K Y E ++ R A F +NLER+ + Sbjct: 7 LALCALAAAYSYPSSDFELDLNFAKFQEFTARFSKNYKSVEEYTTRYATFLDNLERVAKL 66 Query: 179 NRQHILIGGDAVFGVTQFSDLTPAEFKSMYLNYIPSNVTFPRANI----ELDGAPATVVD 346 N+ G VFGVT+F D+TPAEFK+ YL + P + P+A + VD Sbjct: 67 NQD-----GRGVFGVTKFMDMTPAEFKATYLGFKPDEMAPPKAPVARPHRAKRNATGSVD 121 Query: 347 WRTKGAVTDVKDQGQC 394 WRTKGAVT VKDQ QC Sbjct: 122 WRTKGAVTPVKDQAQC 137 >gb|EFA83712.1| cysteine proteinase 1 [Polysphondylium pallidum PN500] Length = 465 Score = 88.6 bits (218), Expect = 8e-16 Identities = 54/127 (42%), Positives = 74/127 (58%), Gaps = 5/127 (3%) Frame = +2 Query: 29 LAVSQAATLSGEVMEA-FAHFTETYNKKYSDAEWSQRMAIFAENLERINEQNRQHILIGG 205 L VS AA + E F F YNK+Y+ +E+++R A F NL+ I+E+NR Sbjct: 11 LLVSMAAAKKLSLEETQFRQFQIKYNKQYTSSEYAERFATFKSNLKVIDEKNRDAASRKS 70 Query: 206 DAVFGVTQFSDLTPAEFKSMYLNYIPSNVTFPRANIELDGAPA----TVVDWRTKGAVTD 373 FGV +F+DL+ +EF++ YLN + + V P A + D P T DWRTKGAVT Sbjct: 71 SVRFGVNEFADLSQSEFRATYLNSVQA-VRDPNAAVAAD-LPVEDLPTAFDWRTKGAVTG 128 Query: 374 VKDQGQC 394 VK+QGQC Sbjct: 129 VKNQGQC 135 >ref|XP_005598385.1| PREDICTED: cathepsin F [Equus caballus] Length = 411 Score = 87.0 bits (214), Expect = 2e-15 Identities = 55/123 (44%), Positives = 70/123 (56%), Gaps = 4/123 (3%) Frame = +2 Query: 38 SQAATLSGEVMEAFAHFTETYNKKYSDAEWSQ-RMAIFAENLERINEQNRQHILIGGDAV 214 SQ S ++ F HF TYN+ Y E +Q RM+IFA N+ R + L G A Sbjct: 101 SQTQDFSVKMASIFKHFVTTYNRTYETKEEAQWRMSIFASNMVRAQ---KIQALDRGTAQ 157 Query: 215 FGVTQFSDLTPAEFKSMYLNYI---PSNVTFPRANIELDGAPATVVDWRTKGAVTDVKDQ 385 +GVT+FSDLT EF+++YLN + V RA D AP DWR+KGAVT+VKDQ Sbjct: 158 YGVTKFSDLTEEEFRTIYLNPLLKEEPGVKMRRAKSVGDSAPPEW-DWRSKGAVTEVKDQ 216 Query: 386 GQC 394 G C Sbjct: 217 GMC 219 >gb|EGB13210.1| hypothetical protein AURANDRAFT_18666 [Aureococcus anophagefferens] Length = 346 Score = 86.7 bits (213), Expect = 3e-15 Identities = 59/133 (44%), Positives = 74/133 (55%), Gaps = 6/133 (4%) Frame = +2 Query: 14 LALATLAVSQAATLSGEVMEAF-AHFTETYNKKYSDAEWSQRMAIFAENLERINEQNRQH 190 L A L V AA + E F + + ++YN ++AE R IF+ NL + N Q Sbjct: 2 LKAALLLVPAAALTDESLFELFKSDYVKSYNSTEAEAE---RFTIFSANLRKTEALNAQR 58 Query: 191 ILIGGDAVFGVTQFSDLTPAEFKSMYLNYIPSNVTFPR---ANIELDGAPATVVDWRTK- 358 + DA FGVTQF DLT AEFK+ YLNY+PS A E AP + +DWRTK Sbjct: 59 V-DEDDAEFGVTQFMDLTEAEFKAQYLNYVPSEQVLAEDVYAAPEGFAAPGS-LDWRTKQ 116 Query: 359 -GAVTDVKDQGQC 394 G V+DVKDQGQC Sbjct: 117 SGVVSDVKDQGQC 129 >ref|XP_005305555.1| PREDICTED: cathepsin F-like isoform X1 [Chrysemys picta bellii] gi|530637467|ref|XP_005305556.1| PREDICTED: cathepsin F-like isoform X2 [Chrysemys picta bellii] Length = 355 Score = 85.1 bits (209), Expect = 9e-15 Identities = 59/127 (46%), Positives = 72/127 (56%), Gaps = 6/127 (4%) Frame = +2 Query: 32 AVSQAATLSGEVMEAFAHFTETYNKKYSDA-EWSQRMAIFAENLERINEQNRQHILIGGD 208 A+SQ A++ +++ F F TY K Y D E +R+ IFAENLE+ L G Sbjct: 45 ALSQNASI--QLISLFKDFLTTYKKSYKDEREAERRLQIFAENLEKARTIQE---LDQGT 99 Query: 209 AVFGVTQFSDLTPAEFKSMYLNYIPSNVTFPR-----ANIELDGAPATVVDWRTKGAVTD 373 A +GVT+FSDLT EF+S+YLN P FP A I D PA DWR GAVTD Sbjct: 100 AEYGVTKFSDLTEEEFRSLYLN--PLLAKFPARPMKPAAIPSDPPPAE-WDWREHGAVTD 156 Query: 374 VKDQGQC 394 VKDQG C Sbjct: 157 VKDQGMC 163 >ref|XP_004368288.1| cysteine proteinase precursor, putative [Acanthamoeba castellanii str. Neff] gi|440804656|gb|ELR25533.1| cysteine proteinase precursor, putative [Acanthamoeba castellanii str. Neff] Length = 330 Score = 85.1 bits (209), Expect = 9e-15 Identities = 52/132 (39%), Positives = 73/132 (55%), Gaps = 7/132 (5%) Frame = +2 Query: 20 LATLAV----SQAATLSGEVMEAFAHFTETYNKKYSDAEWSQRMAIFAENLERINEQNRQ 187 LA LAV ++A T++ E + F F Y K Y+ E+ +R+ IF +NL+RI+ N Sbjct: 11 LAALAVLFIAAEAGTMTAE--QQFRQFAAQYGKSYASEEFGERLRIFRDNLDRIDALNSA 68 Query: 188 HILIGGDAVFGVTQFSDLTPAEFKSMYLNYIPS---NVTFPRANIELDGAPATVVDWRTK 358 + A +GV +F+DLTP EFK+ YL S A +++ G + DWR K Sbjct: 69 NT----GARYGVNKFADLTPKEFKATYLKGARSAGQKKAAATAKLDMTGPLPSQFDWRDK 124 Query: 359 GAVTDVKDQGQC 394 GAVT KDQGQC Sbjct: 125 GAVTPTKDQGQC 136 >emb|CAA26255.1| cysteine proteinase I precursor [Dictyostelium discoideum] Length = 343 Score = 85.1 bits (209), Expect = 9e-15 Identities = 55/132 (41%), Positives = 72/132 (54%), Gaps = 5/132 (3%) Frame = +2 Query: 14 LALATLAVSQAATLSGEVMEAFAHFTETYNKKYSDAEWSQRMAIFAENLERINEQNRQHI 193 LA+ T+ VS + E F F + +NKKYS E+ +R IF NL +I E N I Sbjct: 9 LAVFTVFVSSRG-IPPEEQSQFLEFQDKFNKKYSHEEYLERFEIFKSNLGKIEELNLIAI 67 Query: 194 LIGGDAVFGVTQFSDLTPAEFKSMYLNYIPSNVT--FPRANI---ELDGAPATVVDWRTK 358 D FGV +F+DL+ EFK+ YLN + T P A+ E + T DWRT+ Sbjct: 68 NHKADTKFGVNKFADLSSDEFKNYYLNNKEAIFTDDLPVADYLDDEFINSIPTAFDWRTR 127 Query: 359 GAVTDVKDQGQC 394 GAVT VK+QGQC Sbjct: 128 GAVTPVKNQGQC 139 >ref|XP_635417.1| cysteine proteinase 1 [Dictyostelium discoideum AX4] gi|166201987|sp|P04988.2|CYSP1_DICDI RecName: Full=Cysteine proteinase 1; Flags: Precursor gi|60463731|gb|EAL61909.1| cysteine proteinase 1 [Dictyostelium discoideum AX4] Length = 343 Score = 84.3 bits (207), Expect = 2e-14 Identities = 55/132 (41%), Positives = 72/132 (54%), Gaps = 5/132 (3%) Frame = +2 Query: 14 LALATLAVSQAATLSGEVMEAFAHFTETYNKKYSDAEWSQRMAIFAENLERINEQNRQHI 193 LA+ T+ VS + E F F + +NKKYS E+ +R IF NL +I E N I Sbjct: 9 LAVFTVFVSSRG-IPLEEQSQFLEFQDKFNKKYSHEEYLERFEIFKSNLGKIEELNLIAI 67 Query: 194 LIGGDAVFGVTQFSDLTPAEFKSMYLNYIPSNVT--FPRANI---ELDGAPATVVDWRTK 358 D FGV +F+DL+ EFK+ YLN + T P A+ E + T DWRT+ Sbjct: 68 NHKADTKFGVNKFADLSSDEFKNYYLNNKEAIFTDDLPVADYLDDEFINSIPTAFDWRTR 127 Query: 359 GAVTDVKDQGQC 394 GAVT VK+QGQC Sbjct: 128 GAVTPVKNQGQC 139 >ref|XP_003284553.1| hypothetical protein DICPUDRAFT_96752 [Dictyostelium purpureum] gi|325085467|gb|EGC38873.1| hypothetical protein DICPUDRAFT_96752 [Dictyostelium purpureum] Length = 346 Score = 83.6 bits (205), Expect = 3e-14 Identities = 54/132 (40%), Positives = 72/132 (54%), Gaps = 10/132 (7%) Frame = +2 Query: 29 LAVSQAATLSGEVMEA--FAHFTETYNKKYSDAEWSQRMAIFAENLERINEQNRQHILIG 202 L V+ AA +G +E F F + YNK YS E+S + F NL I + N++ L Sbjct: 11 LFVAFAAAKNGHTIEQTQFVAFQQKYNKVYSSNEYSAKFETFKANLGVIAQLNQKAKLHK 70 Query: 203 GDAVFGVTQFSDLTPAEFKSMYLNYIPSNVTFPRANI--------ELDGAPATVVDWRTK 358 D FGV +F+DL+ AEF+ YLN + V P A++ E+ T DWRTK Sbjct: 71 SDTKFGVNEFADLSAAEFRKYYLN---AQVAKPDASLPMAPLLTEEVLETIPTAFDWRTK 127 Query: 359 GAVTDVKDQGQC 394 GAVT VK+QGQC Sbjct: 128 GAVTGVKNQGQC 139 >ref|XP_003419606.1| PREDICTED: cathepsin F [Loxodonta africana] Length = 473 Score = 83.2 bits (204), Expect = 3e-14 Identities = 52/123 (42%), Positives = 69/123 (56%), Gaps = 5/123 (4%) Frame = +2 Query: 41 QAATLSGEVMEAFAHFTETYNKKYSDAEWSQ-RMAIFAENLERINEQNRQHILIGGDAVF 217 Q SG++ F +F TYN+ Y E ++ RM++FA N+ R + L G A + Sbjct: 164 QPQDFSGKMASIFKNFVTTYNRTYETKEETKWRMSVFANNMIRAQ---KLQALDQGTAQY 220 Query: 218 GVTQFSDLTPAEFKSMYLNYIPSNVTFPRANIELDGAPATVV----DWRTKGAVTDVKDQ 385 G+T+FSDLT EF+++YLN P P + L AP V DWRTKGAVT VKDQ Sbjct: 221 GITKFSDLTEEEFRTIYLN--PLLREDPGQKMRLGKAPKGPVPPDWDWRTKGAVTKVKDQ 278 Query: 386 GQC 394 G C Sbjct: 279 GMC 281 >emb|CCD12886.1| unnamed protein product [Trypanosoma congolense IL3000] Length = 361 Score = 83.2 bits (204), Expect = 3e-14 Identities = 51/134 (38%), Positives = 75/134 (55%), Gaps = 5/134 (3%) Frame = +2 Query: 8 IKLALATLAVSQAATLSGEVMEAFAHFTETYNKKYSDA-EWSQRMAIFAENLERINEQNR 184 + +AL L Q+ + + FA F + Y++ Y DA E + R +F +N+ER E+ Sbjct: 24 VPVALGVLHAEQS------LQQQFAAFKQKYSRSYKDATEEAFRFRVFKQNMERAKEEAA 77 Query: 185 QHILIGGDAVFGVTQFSDLTPAEFKSMYLN---YIPSNVTFPRANIELD-GAPATVVDWR 352 + A FGVT+FSD++P EF++ Y N Y + + PR + + G P VDWR Sbjct: 78 ANPY----ATFGVTRFSDMSPEEFRATYHNGAEYYAAALKRPRKVVNVSTGRPPMTVDWR 133 Query: 353 TKGAVTDVKDQGQC 394 KGAVT VKDQG+C Sbjct: 134 KKGAVTPVKDQGKC 147 >emb|CCD15016.1| unnamed protein product [Trypanosoma congolense IL3000] Length = 361 Score = 83.2 bits (204), Expect = 3e-14 Identities = 51/134 (38%), Positives = 75/134 (55%), Gaps = 5/134 (3%) Frame = +2 Query: 8 IKLALATLAVSQAATLSGEVMEAFAHFTETYNKKYSDA-EWSQRMAIFAENLERINEQNR 184 + +AL L Q+ + + FA F + Y++ Y DA E + R +F +N+ER E+ Sbjct: 24 VPVALGVLHAEQS------LQQQFAAFKQKYSRSYKDATEEAFRFRVFKQNMERAKEEAA 77 Query: 185 QHILIGGDAVFGVTQFSDLTPAEFKSMYLN---YIPSNVTFPRANIELD-GAPATVVDWR 352 + A FGVT+FSD++P EF++ Y N Y + + PR + + G P VDWR Sbjct: 78 ANPY----ATFGVTRFSDMSPEEFRATYHNGAEYYAAALKRPRKVVNVSTGRPPMTVDWR 133 Query: 353 TKGAVTDVKDQGQC 394 KGAVT VKDQG+C Sbjct: 134 KKGAVTPVKDQGKC 147 >emb|CCD11901.1| unnamed protein product [Trypanosoma congolense IL3000] Length = 361 Score = 82.8 bits (203), Expect = 5e-14 Identities = 51/134 (38%), Positives = 75/134 (55%), Gaps = 5/134 (3%) Frame = +2 Query: 8 IKLALATLAVSQAATLSGEVMEAFAHFTETYNKKYSDA-EWSQRMAIFAENLERINEQNR 184 + +AL L Q+ + + FA F + Y++ Y DA E + R +F +N+ER E+ Sbjct: 24 VPVALGVLHAEQS------LQQQFAAFKQKYSRSYRDATEEAFRFRVFKQNMERAKEEAA 77 Query: 185 QHILIGGDAVFGVTQFSDLTPAEFKSMYLN---YIPSNVTFPRANIELD-GAPATVVDWR 352 + A FGVT+FSD++P EF++ Y N Y + + PR + + G P VDWR Sbjct: 78 ANPY----ATFGVTRFSDMSPEEFRATYHNGAEYYAAALKRPRKVVNVSTGRPPMTVDWR 133 Query: 353 TKGAVTDVKDQGQC 394 KGAVT VKDQG+C Sbjct: 134 KKGAVTPVKDQGKC 147 >ref|XP_001439792.1| hypothetical protein [Paramecium tetraurelia strain d4-2] gi|124406987|emb|CAK72395.1| unnamed protein product [Paramecium tetraurelia] Length = 350 Score = 82.4 bits (202), Expect = 6e-14 Identities = 45/108 (41%), Positives = 64/108 (59%), Gaps = 2/108 (1%) Frame = +2 Query: 77 FAHFTETYNKKYSDAEWSQRMAIFAENLERINEQNRQHILIGGDAVFGVTQFSDLTPAEF 256 F ++ T+NK+YS +E R+ ++ NL I +N++ G +FG TQF+DLT EF Sbjct: 62 FTNYQATFNKQYSGSELLYRLQVYEANLADIKARNQKL----GREIFGETQFTDLTDEEF 117 Query: 257 KSMYLNYI--PSNVTFPRANIELDGAPATVVDWRTKGAVTDVKDQGQC 394 + YL P ++ P+A E AT +DWRT+GAV VKDQGQC Sbjct: 118 AATYLTLKVNPDDLEVPKAQFE--NVNATPIDWRTRGAVNKVKDQGQC 163 >emb|CCD14094.1| unnamed protein product, partial [Trypanosoma congolense IL3000] Length = 307 Score = 82.0 bits (201), Expect = 8e-14 Identities = 53/135 (39%), Positives = 76/135 (56%), Gaps = 6/135 (4%) Frame = +2 Query: 8 IKLALATLAVSQAATLSGEVMEAFAHFTETYNKKYSDA-EWSQRMAIFAENLERINEQNR 184 + +AL L Q+ + + FA F + Y++ Y DA E + R +F +N+ER E+ Sbjct: 24 VPVALGVLHAEQS------LQQQFAAFKQKYSRSYKDATEEAFRFRVFKQNMERAKEEAA 77 Query: 185 QHILIGGDAVFGVTQFSDLTPAEFKSMYLN---YIPSNVTFPR--ANIELDGAPATVVDW 349 + A FGVT+FSD++P EF++ Y N Y + + PR N+ AP TV DW Sbjct: 78 ANPY----ATFGVTRFSDMSPEEFRATYHNGAEYYAAALKRPRKVVNVSTGKAPKTV-DW 132 Query: 350 RTKGAVTDVKDQGQC 394 R KGAVT VKDQG+C Sbjct: 133 RKKGAVTPVKDQGKC 147 >ref|XP_006129733.1| PREDICTED: cathepsin F-like [Pelodiscus sinensis] Length = 355 Score = 81.6 bits (200), Expect = 1e-13 Identities = 56/126 (44%), Positives = 72/126 (57%), Gaps = 5/126 (3%) Frame = +2 Query: 32 AVSQAATLSGEVMEAFAHFTETYNKKYSDA-EWSQRMAIFAENLERINEQNRQHILIGGD 208 A+SQ A++ +++ F F TY K Y D E +R+ IFAENLE+ L G Sbjct: 45 ALSQNASV--QLISLFKDFLTTYKKSYKDQRETKKRLLIFAENLEKARTIQE---LDQGT 99 Query: 209 AVFGVTQFSDLTPAEFKSMYLN----YIPSNVTFPRANIELDGAPATVVDWRTKGAVTDV 376 A +GVT+FSDLT EF+++YLN +P P A I D P T DWR GAVT+V Sbjct: 100 AEYGVTKFSDLTEDEFRNLYLNPLLAKLPGRPMKPAA-IPSD-PPPTEWDWREHGAVTEV 157 Query: 377 KDQGQC 394 KDQG C Sbjct: 158 KDQGMC 163 >gb|AAA18215.1| cysteine protease precursor [Trypanosoma congolense] Length = 444 Score = 81.6 bits (200), Expect = 1e-13 Identities = 51/134 (38%), Positives = 74/134 (55%), Gaps = 5/134 (3%) Frame = +2 Query: 8 IKLALATLAVSQAATLSGEVMEAFAHFTETYNKKYSDA-EWSQRMAIFAENLERINEQNR 184 + +AL L Q+ + + FA F + Y++ Y DA E + R +F +N+ER E+ Sbjct: 24 VPVALGVLHAEQS------LQQQFAAFKQKYSRSYKDATEEAFRFRVFKQNMERAKEEAA 77 Query: 185 QHILIGGDAVFGVTQFSDLTPAEFKSMYLN---YIPSNVTFPRANIELD-GAPATVVDWR 352 + A FGVT+FSD++P EF++ Y N Y + + PR + + G VDWR Sbjct: 78 ANPY----ATFGVTRFSDMSPEEFRATYHNGAEYYAAALKRPRKVVNVSTGKAPEAVDWR 133 Query: 353 TKGAVTDVKDQGQC 394 KGAVT VKDQGQC Sbjct: 134 KKGAVTPVKDQGQC 147 >emb|CCD11724.1| unnamed protein product, partial [Trypanosoma congolense IL3000] Length = 380 Score = 81.6 bits (200), Expect = 1e-13 Identities = 51/134 (38%), Positives = 74/134 (55%), Gaps = 5/134 (3%) Frame = +2 Query: 8 IKLALATLAVSQAATLSGEVMEAFAHFTETYNKKYSDA-EWSQRMAIFAENLERINEQNR 184 + +AL L Q+ + + FA F + Y++ Y DA E + R +F +N+ER E+ Sbjct: 24 VPVALGVLHAEQS------LQQQFAAFKQKYSRSYKDATEEAFRFRVFKQNMERAKEEAA 77 Query: 185 QHILIGGDAVFGVTQFSDLTPAEFKSMYLN---YIPSNVTFPRANIELD-GAPATVVDWR 352 + A FGVT+FSD++P EF++ Y N Y + + PR + + G VDWR Sbjct: 78 ANPY----ATFGVTRFSDMSPEEFRATYHNGAEYYAAALKRPRKVVNVSTGKAPEAVDWR 133 Query: 353 TKGAVTDVKDQGQC 394 KGAVT VKDQGQC Sbjct: 134 KKGAVTPVKDQGQC 147