BLASTX nr result
ID: Ophiopogon26_contig00055954
seq
BLASTX 2.2.26 [Sep-21-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Ophiopogon26_contig00055954 (1237 letters) Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF excluding environmental samples from WGS projects 149,584,005 sequences; 54,822,741,787 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value dbj|BAK00754.1| predicted protein [Hordeum vulgare subsp. vulgare] 321 e-104 ref|XP_009032801.1| hypothetical protein AURANDRAFT_18666 [Aureo... 301 3e-96 gb|KYQ91485.1| hypothetical protein DLAC_08453 [Tieghemostelium ... 251 1e-76 ref|XP_004368288.1| cysteine proteinase precursor, putative [Aca... 249 4e-76 ref|XP_012756472.1| hypothetical protein SAMD00019534_028850 [Ac... 248 9e-76 ref|XP_004335426.1| cathepsin L, putative [Acanthamoeba castella... 246 6e-75 ref|XP_013762925.1| cruzipain [Thecamonas trahens ATCC 50062] >g... 244 2e-73 emb|CUI14619.1| cysteine peptidase, putative [Bodo saltans] 240 3e-71 ref|XP_012759759.1| hypothetical protein SAMD00019534_034130, pa... 236 6e-71 ref|XP_004363040.1| hypothetical protein DFA_03437 [Cavenderia f... 234 4e-70 gb|KOO53669.1| cathepsin l-like protease [Chrysochromulina sp. C... 229 2e-68 ref|XP_020433674.1| hypothetical protein PPL_05546 [Heterosteliu... 229 2e-68 ref|XP_003293312.1| hypothetical protein DICPUDRAFT_41833 [Dicty... 227 2e-67 gb|KMZ58469.1| Cysteine proteinase cathepsin F [Zostera marina] 226 7e-67 ref|XP_003284553.1| hypothetical protein DICPUDRAFT_96752 [Dicty... 223 4e-66 ref|XP_020235829.1| LOW QUALITY PROTEIN: cysteine proteinase 15A... 221 4e-65 ref|XP_020435829.1| cysteine proteinase 1 [Heterostelium album P... 224 5e-65 gb|AAF75546.1| cruzipain [Trypanosoma cruzi] 224 6e-65 gb|OQR72774.1| cathepsin L-like [Tropilaelaps mercedesae] 219 9e-65 gb|AFA34858.1| cathepsin L-like protein [Trypanosoma dionisii] 223 1e-64 >dbj|BAK00754.1| predicted protein [Hordeum vulgare subsp. vulgare] Length = 341 Score = 321 bits (823), Expect = e-104 Identities = 182/355 (51%), Positives = 220/355 (61%), Gaps = 1/355 (0%) Frame = -1 Query: 1222 LLVLLGLSAFASATRLPVPSDSITGENVADEFLAKSLFDKFKLDHSKSYHSASEETHRYS 1043 +++LL L A A+A P SD N A F +F SK+Y S E T RY+ Sbjct: 3 VVLLLALCALAAAYSYP-SSDFELDLNFAK-------FQEFTARFSKNYKSVEEYTTRYA 54 Query: 1042 VFRSNLARIADLNSKNGSPSFGITPFADLTAHEFAKTHLGFKPSLDEESQAARLNTPVFE 863 F NL R+A LN ++G FG+T F D+T EF T+LGFKP + A PV Sbjct: 55 TFLDNLERVAKLN-QDGRGVFGVTKFMDMTPAEFKATYLGFKPD-----EMAPPKAPVAR 108 Query: 862 LDEDDNMMAWGANNTLVDWRQKGAVTKVKDQAQCGSCWAFSATEQIESAWFLAKGTLPVL 683 A G+ VDWR KGAVT VKDQAQCGSCWAFSATEQIES WFLA L L Sbjct: 109 -PHRAKRNATGS----VDWRTKGAVTPVKDQAQCGSCWAFSATEQIESNWFLAGNELISL 163 Query: 682 SPQQILSCDKTDAGCDGGDTPTAYAYVKSAGGLESESAYPYSSGAGNTGTCKFK-AASIV 506 SPQQI+SCD TD GC GG T TAY YV+SAGGL++++AYPYSSGAG TGTC AS Sbjct: 164 SPQQIVSCDTTDGGCGGGWTYTAYQYVQSAGGLDTDAAYPYSSGAGVTGTCDNPLPASPA 223 Query: 505 AKISGFSYATPPCSGACKTQDEATFANNVAAKGPASICVNAEAWQLYSSGVLTAKACGGS 326 A+ISGF YA P CS +C QDE + A + P S+CV+AE WQ YSSG++T C S Sbjct: 224 AQISGFGYAIPTCSDSCTNQDENSMAQYMQENSPLSVCVDAEPWQFYSSGIMTVDQC-PS 282 Query: 325 AYTDLDHCVQLVGYNKPASGKSYWIVRNSWGASWGYSGYLYVEYGTNACGVADEA 161 ++ LDHCVQ VGY+ S + YWIVRNSW +WG G++ + GTN CG+ D A Sbjct: 283 DFSGLDHCVQAVGYDATGS-QPYWIVRNSWNTNWGEDGFIRLALGTNTCGIGDVA 336 >ref|XP_009032801.1| hypothetical protein AURANDRAFT_18666 [Aureococcus anophagefferens] gb|EGB13210.1| hypothetical protein AURANDRAFT_18666 [Aureococcus anophagefferens] Length = 346 Score = 301 bits (770), Expect = 3e-96 Identities = 172/362 (47%), Positives = 212/362 (58%), Gaps = 20/362 (5%) Frame = -1 Query: 1186 ATRLPVPSDSITGENVADEFLAKSLFDKFKLDHSKSYHSASEETHRYSVFRSNLARIADL 1007 A L VP+ ++T E SLF+ FK D+ KSY+S E R+++F +NL + L Sbjct: 4 AALLLVPAAALTDE---------SLFELFKSDYVKSYNSTEAEAERFTIFSANLRKTEAL 54 Query: 1006 NSKN---GSPSFGITPFADLTAHEFAKTHLGFKPS---LDEESQAARLNTPVFELDEDDN 845 N++ FG+T F DLT EF +L + PS L E+ AA + Sbjct: 55 NAQRVDEDDAEFGVTQFMDLTEAEFKAQYLNYVPSEQVLAEDVYAA-----------PEG 103 Query: 844 MMAWGANNTLVDWR--QKGAVTKVKDQAQCGSCWAFSATEQIESAWFLAKGTLPVLSPQQ 671 A G+ +DWR Q G V+ VKDQ QCGSCWAFSATEQIES W LA V +PQQ Sbjct: 104 FAAPGS----LDWRTKQSGVVSDVKDQGQCGSCWAFSATEQIESEWVLAGNDPLVFAPQQ 159 Query: 670 ILSCDKTDAGCDGGDTPTAYAYVKSAGGLESESAYPYSSG-AGNTGTCKFKAASIVAKIS 494 I+SCDK D GC+GG+T TAYAYV+ AGG+ ESAYPY SG +GNTG CK K + + Sbjct: 160 IVSCDKVDQGCNGGNTETAYAYVEKAGGMALESAYPYKSGTSGNTGRCK-KFETAGGDVE 218 Query: 493 GFSYATPPC-SGACKTQDEATFANNVAAKGPASICVNAEAWQLYSSGVLTAKACGGSAYT 317 FSY P C G C QDE A +A+ GPASICVNA AWQ Y+ GV+T CG A Sbjct: 219 SFSYVVPECKKGKCNDQDEDKMAAALASHGPASICVNAGAWQTYTKGVMTNLQCGSHAAN 278 Query: 316 DLDHCVQLVGY----------NKPASGKSYWIVRNSWGASWGYSGYLYVEYGTNACGVAD 167 LDHCVQ+VGY K K W VRNSWG SWGY GY+ V+ G NACG+A+ Sbjct: 279 ALDHCVQVVGYTGYTGDAKACGKGLKDKCVWNVRNSWGTSWGYQGYIRVQMGKNACGIAN 338 Query: 166 EA 161 +A Sbjct: 339 DA 340 >gb|KYQ91485.1| hypothetical protein DLAC_08453 [Tieghemostelium lacteum] Length = 354 Score = 251 bits (641), Expect = 1e-76 Identities = 146/344 (42%), Positives = 195/344 (56%), Gaps = 8/344 (2%) Frame = -1 Query: 1168 PSDSITGENVADEFLAKSLFDKFKLDHSKSYHSASEETHRYSVFRSNLARIADLNSKN-G 992 P+ + + E + K+ FD++ H+K YH+ E RY F++NL +I N+ + G Sbjct: 28 PNQNQQDNYIQRERILKNQFDQWVEKHAKKYHTHREYLTRYQNFKNNLKKIEQQNAAHQG 87 Query: 991 SPSFGITPFADLTAHEFAKTHL--GFKPSLDEESQAARLNTPVFE--LDEDDNMMAWGAN 824 S FG+ F+DL+ EF K +L +KP+ + + PV + D+N+ Sbjct: 88 SAKFGMNKFSDLSEEEFTKFYLMPEYKPT--PRKSLYKKHYPVMQDAQSSDENIPL---- 141 Query: 823 NTLVDWRQKGAVTKVKDQAQCGSCWAFSATEQIESAWFLAKGTLPVLSPQQILSCDKTDA 644 N VDWR +G VT VKDQ CGSCWAFSATEQIE+AW A +LS QQI+ CD D Sbjct: 142 NLKVDWRTEGLVTPVKDQGACGSCWAFSATEQIETAWIKAGNDQVILSEQQIVDCDTNDG 201 Query: 643 GCDGGDTPTAYAYVKSAGGLESESAYPYSSGAGNTGTCKFKAASIVAKISGFSYATPPCS 464 GC GGD TA YV AGGL SES YPY N GTC VA ISG+ AT P Sbjct: 202 GCGGGDPHTAMDYVIKAGGLTSESQYPY---IANDGTCHTNFTP-VAHISGYYAATTP-- 255 Query: 463 GACKTQDEATFANNVAAKGPASICVNAEAWQLYSSGVLTAKACGGSAYTDLDHCVQLVGY 284 ++ A +V +GP SICV+A +W YSSG++ + + +DLDHCVQ+VG Sbjct: 256 -----GNDTQLAYSVMNEGPISICVDASSWMTYSSGIIRS-----NCDSDLDHCVQIVGL 305 Query: 283 NKPASGKS---YWIVRNSWGASWGYSGYLYVEYGTNACGVADEA 161 N +G + Y+I+RNSWG WG G++YVE G + CGV EA Sbjct: 306 NVDTNGTTPIPYYIIRNSWGTDWGIDGFIYVEIGHDLCGVTQEA 349 >ref|XP_004368288.1| cysteine proteinase precursor, putative [Acanthamoeba castellanii str. Neff] gb|ELR25533.1| cysteine proteinase precursor, putative [Acanthamoeba castellanii str. Neff] Length = 330 Score = 249 bits (635), Expect = 4e-76 Identities = 147/332 (44%), Positives = 186/332 (56%), Gaps = 7/332 (2%) Frame = -1 Query: 1138 ADEFLAKSLFDKFKLDHSKSYHSASEET-HRYSVFRSNLARIADLNSKNGSPSFGITPFA 962 A A+ F +F + KSY ASEE R +FR NL RI LNS N +G+ FA Sbjct: 23 AGTMTAEQQFRQFAAQYGKSY--ASEEFGERLRIFRDNLDRIDALNSANTGARYGVNKFA 80 Query: 961 DLTAHEFAKTHLGFKPSLDEESQAARLNTPVFELDEDDNMMAWGANNTLVDWRQKGAVTK 782 DLT EF T+L S ++ AA + G + DWR KGAVT Sbjct: 81 DLTPKEFKATYLKGARSAGQKKAAATAKLDMT-----------GPLPSQFDWRDKGAVTP 129 Query: 781 VKDQAQCGSCWAFSATEQIESAWFLAKGTLPVLSPQQILSCDKT--DAGCDGGDTPTAYA 608 KDQ QCG WAFS TE IES WFL+ L L+PQQI+ CD+ D GCDGGD PTAY Sbjct: 130 TKDQGQCG--WAFSVTEAIESQWFLSGRKLVSLAPQQIVDCDQGNGDYGCDGGDPPTAYE 187 Query: 607 YVKSAGGLESESAYPYSSGAGNTGTCKFKAASIVAKISGFSYATPPCSGACKTQDEATFA 428 YV AGGL++E +YPY++ G C FK +++ AKIS ++Y T T++E Sbjct: 188 YVIKAGGLDTEESYPYTA---EDGQCAFKPSAVGAKISNWTYITT-------TKNETEMQ 237 Query: 427 NNVAAKGPASICVNAEAWQLYSSGVLTAKACGGSAYTDLDHCVQLVGYNKPASGK----S 260 +A++GP SICV+A +WQ Y GV+T+ C S LDHCV + GY+ Sbjct: 238 YGLASRGPLSICVDASSWQYYIGGVITS-LCEDS----LDHCVMITGYSVQEGWDFMKYD 292 Query: 259 YWIVRNSWGASWGYSGYLYVEYGTNACGVADE 164 W +RNSWG WGY GYLYV+ G+N CGV DE Sbjct: 293 VWNIRNSWGEDWGYGGYLYVQRGSNLCGVGDE 324 >ref|XP_012756472.1| hypothetical protein SAMD00019534_028850 [Acytostelium subglobosum LB1] dbj|GAM19710.1| hypothetical protein SAMD00019534_028850 [Acytostelium subglobosum LB1] Length = 325 Score = 248 bits (632), Expect = 9e-76 Identities = 136/318 (42%), Positives = 183/318 (57%), Gaps = 2/318 (0%) Frame = -1 Query: 1111 FDKFKLDHSKSYHSASEETHRYSVFRSNLARIADLNSKN-GSPSFGITPFADLTAHEFAK 935 F ++ + + Y E R S F SNLA I++ N+K+ G +FG+ F+DL+ EF K Sbjct: 26 FKQWMSKYERHYVDEKEYLIRLSNFVSNLATISEYNAKHHGRATFGLNQFSDLSIEEFRK 85 Query: 934 THLGFKPSLDEESQAARLNTPVFELDEDDNMMAWGANNTLVDWRQKGAVTKVKDQAQCGS 755 THL + P+ + SQ + D N+ VDWR KG VT VK+Q QCGS Sbjct: 86 THLNYVPTHKKASQVRQ------HFDYPSNIPE------RVDWRAKGFVTPVKNQLQCGS 133 Query: 754 CWAFSATEQIESAWFLAKGTLPVLSPQQILSCDKTDAGCDGGDTPTAYAYVKSAGGLESE 575 CWAFSATEQIE+A+ A S QQI+ CD D GC GGD TAY YV+SAGG+ ++ Sbjct: 134 CWAFSATEQIETAFIQAGNAQQFFSEQQIVDCDPFDGGCGGGDPMTAYQYVQSAGGITTD 193 Query: 574 SAYPYSSGAGNTGTCKFKAASIVAKISGFSYATPPCSGACKTQDEATFANNVAAKGPASI 395 +AYPY++ GTC+ + VA+I + YA+ +E +AA GP SI Sbjct: 194 TAYPYTA---QDGTCEANTTTKVAQIKTYGYAS-------TAGNETQMKEAIAALGPLSI 243 Query: 394 CVNAEAWQLYSSGVLTAKACGGSAYTDLDHCVQLVGYN-KPASGKSYWIVRNSWGASWGY 218 CV+AE W Y SG++T DLDHCVQ+VGY+ S Y+IVRNSWG +WG Sbjct: 244 CVDAETWMTYQSGIITTDCA-----ADLDHCVQVVGYDVDTTSNIPYYIVRNSWGTTWGQ 298 Query: 217 SGYLYVEYGTNACGVADE 164 GY+Y+ G+N CG+ +E Sbjct: 299 EGYIYIGEGSNLCGITEE 316 >ref|XP_004335426.1| cathepsin L, putative [Acanthamoeba castellanii str. Neff] gb|ELR13413.1| cathepsin L, putative [Acanthamoeba castellanii str. Neff] Length = 331 Score = 246 bits (627), Expect = 6e-75 Identities = 145/317 (45%), Positives = 181/317 (57%), Gaps = 2/317 (0%) Frame = -1 Query: 1111 FDKFKLDHSKSYHSASEETHRYSVFRSNLARIADLNSK-NGSPSFGITPFADLTAHEFAK 935 F+ F + KSY SA E R+++F NLA A LN K G FGIT FAD++ EF Sbjct: 34 FNAFVQRYGKSYASAEEAEQRFAIFTQNLAETAALNIKYEGKTQFGITKFADMSQEEFQS 93 Query: 934 THLGFKPSLDEESQAARLNTPVFELDEDDNMMAWGANNTLVDWRQK-GAVTKVKDQAQCG 758 L P + R P FE + A +T DWR K G VT V DQ QCG Sbjct: 94 RVLMSNPPPPPTEKPYR--GPKFE--------GFTAPSTF-DWRNKPGVVTPVYDQGQCG 142 Query: 757 SCWAFSATEQIESAWFLAKGTLPVLSPQQILSCDKTDAGCDGGDTPTAYAYVKSAGGLES 578 SCWAFSATE IES W LA L LS QQI+ C D GC GG AY YV A GL++ Sbjct: 143 SCWAFSATENIESQWALAGHKLTGLSMQQIVDCSWWDDGCGGGFPSYAYDYVIDAPGLDA 202 Query: 577 ESAYPYSSGAGNTGTCKFKAASIVAKISGFSYATPPCSGACKTQDEATFANNVAAKGPAS 398 + YPY++ G+C FK + +VAKIS ++Y T +E AN +A GP S Sbjct: 203 LANYPYTAVG---GSCAFKESQVVAKISSWTYTT-------TDSNEHQMANYLAQHGPIS 252 Query: 397 ICVNAEAWQLYSSGVLTAKACGGSAYTDLDHCVQLVGYNKPASGKSYWIVRNSWGASWGY 218 +CV+AE+W Y+ GV A ACG T +DHCV VGYN A+ YWI+RNSWG SWG Sbjct: 253 VCVDAESWPSYTGGVYRASACG----TSIDHCVLAVGYNLTAN-PPYWIIRNSWGTSWGL 307 Query: 217 SGYLYVEYGTNACGVAD 167 GY+++E+GT+AC VA+ Sbjct: 308 EGYMHLEFGTDACAVAE 324 >ref|XP_013762925.1| cruzipain [Thecamonas trahens ATCC 50062] gb|KNC45942.1| cruzipain [Thecamonas trahens ATCC 50062] Length = 394 Score = 244 bits (623), Expect = 2e-73 Identities = 141/325 (43%), Positives = 187/325 (57%), Gaps = 8/325 (2%) Frame = -1 Query: 1111 FDKFKLDHSKSYHSASEETHRYSVFRSNLARIADLNSKN----GSPSFGITPFADLTAHE 944 F FK + + Y S+ E + VF++N + A L + N G FG++PF DLT +E Sbjct: 23 FALFKETYKRQYASSKAEAAAFEVFKTNAEKAAKLEAANKAAGGDAKFGMSPFMDLTENE 82 Query: 943 FAKTHLGFKPSLDEESQAARLNTPVFELDEDDNMMAWGANNTLVDWR--QKGAVTKVKDQ 770 F +L K ++ E AA L PV GA DWR + +T VK+Q Sbjct: 83 FKARYLMPKGAV--EGGAAEL--PVLRASNV------GALPKAYDWRDHKPAVITPVKNQ 132 Query: 769 AQCGSCWAFSATEQIESAWFLAKGTLPVLSPQQILSCDKTDAGCDGGDTPTAYAYVKSAG 590 QCGSCWAFSA ++ES W LA L VLS QQ++ CD TD GC+GGDT +AY Y++ AG Sbjct: 133 GQCGSCWAFSAVSEVESMWALAGHELVVLSEQQVVDCDTTDDGCNGGDTISAYHYIEKAG 192 Query: 589 GLESESAYPYSSGAGNTGTCKFKAA--SIVAKISGFSYATPPCSGACKTQDEATFANNVA 416 GL E YPY++ G CK VAKI G++YAT P T++E A N+ Sbjct: 193 GLVPEKDYPYTA---RDGKCKDSVVKKDAVAKIMGYNYATSP-----STKNETQLAANLM 244 Query: 415 AKGPASICVNAEAWQLYSSGVLTAKACGGSAYTDLDHCVQLVGYNKPASGKSYWIVRNSW 236 + GP SICV+A +WQ Y+SG+L+ CG LDHCVQ+ G+ S + YW VRNSW Sbjct: 245 STGPVSICVDASSWQTYTSGILS--HCG----KQLDHCVQITGWGTSGS-EMYWWVRNSW 297 Query: 235 GASWGYSGYLYVEYGTNACGVADEA 161 SWG SGY+ +++G N CG+ADEA Sbjct: 298 ATSWGMSGYIQLKFGQNTCGLADEA 322 >emb|CUI14619.1| cysteine peptidase, putative [Bodo saltans] Length = 466 Score = 240 bits (613), Expect = 3e-71 Identities = 137/348 (39%), Positives = 184/348 (52%), Gaps = 2/348 (0%) Frame = -1 Query: 1216 VLLGLSAFASATRLPVPSDSITGENVADEFLAKSLFDKFKLDHSKSYHSASEETHRYSVF 1037 +L L AFA+ + + +D++ ++ F+ FK H KSY + SEET+R +VF Sbjct: 7 LLCALIAFAAVSSVSATTDAL-----------RASFESFKAKHGKSYATPSEETYRLTVF 55 Query: 1036 RSNLARIADLNSKNGSPSFGITPFADLTAHEFAKTHLGFKPSLDEESQAARLNTPVFELD 857 N+ + LN+KN FG +PFAD+T EF H G K + T + Sbjct: 56 AENIRKAEILNAKNPQARFGASPFADMTETEFKSYHNGDKYFSARVQELKSDKTTYYPRY 115 Query: 856 EDDNMMAWGANNTLVDWRQKGAVTKVKDQAQCGSCWAFSATEQIESAWFLAKGTLPVLSP 677 D + A N DWR +GAVT VK+Q QCGSCWAFS T +E W LA TL LS Sbjct: 116 TDAQVKAAPTNK---DWRTEGAVTAVKNQGQCGSCWAFSTTGGVEGQWQLAGNTLVSLSE 172 Query: 676 QQILSCDKTDAGCDGGDTPTAYAYV--KSAGGLESESAYPYSSGAGNTGTCKFKAASIVA 503 QQ++SCD D+GC+GG AY ++ G SE++YPY SG G C + A Sbjct: 173 QQLVSCDTVDSGCNGGLMNNAYEWILANKGGEFVSEASYPYVSGGGTAPACDATQGTNAA 232 Query: 502 KISGFSYATPPCSGACKTQDEATFANNVAAKGPASICVNAEAWQLYSSGVLTAKACGGSA 323 KI+G DE + GP S+ ++A AWQ+Y GV+T CGGSA Sbjct: 233 KITGHYNI---------YHDEDQMKAWIGENGPLSLAIDASAWQMYMGGVMT--TCGGSA 281 Query: 322 YTDLDHCVQLVGYNKPASGKSYWIVRNSWGASWGYSGYLYVEYGTNAC 179 LDH V +VGY YWI +NSWGASWG +GY+YV +G++ C Sbjct: 282 ---LDHGVLIVGYQFENQATPYWIFKNSWGASWGEAGYIYVAFGSDQC 326 >ref|XP_012759759.1| hypothetical protein SAMD00019534_034130, partial [Acytostelium subglobosum LB1] dbj|GAM20238.1| hypothetical protein SAMD00019534_034130, partial [Acytostelium subglobosum LB1] Length = 338 Score = 236 bits (601), Expect = 6e-71 Identities = 141/357 (39%), Positives = 192/357 (53%), Gaps = 4/357 (1%) Frame = -1 Query: 1222 LLVLLGLSAFASATRLPVPSDSITGENVADEFLAKSLFDKFKLDHSKSYHSASEETHRYS 1043 L+ L AF + +P + E E++A FDK +D ++ Y R S Sbjct: 9 LVATLTTLAFVEVNAVRLPGRTRNYEQQFREWMAD--FDKVYVDDAEYYR-------RLS 59 Query: 1042 VFRSNLARIADLNSKN-GSPSFGITPFADLTAHEFAKTHLGFKPSLDEESQAARLNTPVF 866 F +NL IA N + G +FG+ FADL+ EF +L F+ + + ++ P Sbjct: 60 NFITNLGTIARNNRMHKGRATFGVNKFADLSMEEFKSYYLNFETDRTPKREPTNVSYP-- 117 Query: 865 ELDEDDNMMAWGANNTLVDWRQKGAVTKVKDQAQCGSCWAFSATEQIESAWFLAKGTLPV 686 N+ + VDWRQKG VT VK+Q QCGSCWAFSA EQIESA+ + + Sbjct: 118 -----SNIPSQ------VDWRQKGYVTPVKNQEQCGSCWAFSAAEQIESAYIMLGNEAQI 166 Query: 685 LSPQQILSCDKTDAGCDGGDTPTAYAYVKSAGGLESESAYPYSSGAGNTGTCKFKAASIV 506 S QQI+ CD D GC GGDT TAY YV++AGGL + ++YPY++ GTC Sbjct: 167 ASEQQIVDCDSFDGGCGGGDTMTAYKYVETAGGLTTNASYPYTA---QDGTCYANKTKKF 223 Query: 505 AKISGFSYATPPCSGACKTQDEATFANNVAAKGPASICVNAEAWQLYSSGVLTAKACGGS 326 K++ ++YA+ +E +AA GP SICV+A +W Y SG++T+ CG Sbjct: 224 VKVTNYNYAS-------SQGNETQLKEAIAALGPLSICVDAISWMTYQSGIITSN-CG-- 273 Query: 325 AYTDLDHCVQLVGYNKPAS---GKSYWIVRNSWGASWGYSGYLYVEYGTNACGVADE 164 DLDHCVQLVGY +S Y+IVRNSWG WG GY+Y+ G N CG+ DE Sbjct: 274 --NDLDHCVQLVGYAIESSVTPNIPYYIVRNSWGLDWGQEGYIYIGEGQNLCGITDE 328 >ref|XP_004363040.1| hypothetical protein DFA_03437 [Cavenderia fasciculata] gb|EGG25189.1| hypothetical protein DFA_03437 [Cavenderia fasciculata] Length = 341 Score = 234 bits (596), Expect = 4e-70 Identities = 133/331 (40%), Positives = 187/331 (56%), Gaps = 4/331 (1%) Frame = -1 Query: 1144 NVADEFLAKSLFDKFKLDHSKSYHSASEETHRYSVFRSNLARIADLNSKNG-SPSFGITP 968 + AD++ + F + ++H+K YH E R S F N+ I +N + G + +FG+ Sbjct: 23 STADDYTTR--FKTWMVEHNKMYHEEEEFYLRLSNFIRNIHSIEKMNRQYGRTATFGLNK 80 Query: 967 FADLTAHEFAKTHL--GFKPSLDEESQAARLNTPVFELDEDDNMMAWGANNTLVDWRQKG 794 F+DL+ EF K +L +KP AR+ F N+ A +DWR KG Sbjct: 81 FSDLSLDEFKKHYLMPNYKPK-------ARVTKETFNYPS--NIPA------TLDWRTKG 125 Query: 793 AVTKVKDQAQCGSCWAFSATEQIESAWFLAKGTLPVLSPQQILSCDKTDAGCDGGDTPTA 614 VT VK+Q CGSCWAFSATEQIE+A +A G + LS QQI+ CD D GC GGD TA Sbjct: 126 YVTPVKNQLMCGSCWAFSATEQIETANIMAGGQVEYLSEQQIVDCDPYDGGCGGGDPYTA 185 Query: 613 YAYVKSAGGLESESAYPYSSGAGNTGTCKFKAASIVAKISGFSYATPPCSGACKTQDEAT 434 Y YV++ GGL YPY++ G C + + +++ F YA+ +E Sbjct: 186 YQYVQNNGGLTLNVTYPYTAA---NGACYANSTAPAVQVTAFGYAS-------SQGNETQ 235 Query: 433 FANNVAAKGPASICVNAEAWQLYSSGVLTAKACGGSAYTDLDHCVQLVGYNKPASGKS-Y 257 +AA+GP SICVNAE W Y SG+ ++ + DLDHCVQ+VGY+ A+ K+ Y Sbjct: 236 LREAMAARGPLSICVNAEPWMSYQSGIFSS-----TCSDDLDHCVQIVGYDTDATSKTPY 290 Query: 256 WIVRNSWGASWGYSGYLYVEYGTNACGVADE 164 +IVRNSWG WG GY+Y++ G+N CG+ +E Sbjct: 291 FIVRNSWGTDWGLLGYIYIQAGSNLCGITNE 321 >gb|KOO53669.1| cathepsin l-like protease [Chrysochromulina sp. CCMP291] Length = 345 Score = 229 bits (585), Expect = 2e-68 Identities = 141/359 (39%), Positives = 194/359 (54%), Gaps = 7/359 (1%) Frame = -1 Query: 1216 VLLGLSAFASATRLPVPSDSITGENVADEFLAKSLFDKFKLDHSKS--YHSASEETHRYS 1043 V++ LS ++A + P + + +F+ KF D Y SA+E R++ Sbjct: 5 VVVALSIISTAAQRPADPTDMHLDPAFPQFM------KFMTDFRNGVPYSSAAETLGRFT 58 Query: 1042 VFRSNLARIADLNSKNGSPSFGITPFADLTAHEFAKTHLGFKPSLDEESQAARLNTPVFE 863 F++NL I + N+K G + GIT FADLT EF +L +P + A R Sbjct: 59 AFKANLQLIGERNAK-GQETHGITKFADLTREEFKAQYLTLRPPT---ANALR------S 108 Query: 862 LDEDDNMMAWGANNTLVDWRQKGAVTKVKDQAQCGSCWAFSATEQIESAWFLAKGTLPVL 683 + + D+++ DW KGA T VK+Q QCGSCWAFSATEQ+ES ++ G L L Sbjct: 109 MKQLDHLVQANYTAASTDWCAKGACTPVKNQGQCGSCWAFSATEQLESQYYQTYGKLIEL 168 Query: 682 SPQQILSCDKTDAGCDGGDTPTAYAYVKSAGGLESESAYPYSSG-AGNTGTCKFKAASIV 506 SPQQ+ SCD GC+GG+ A+ YV S GG ESES YPY SG TG+C K A + Sbjct: 169 SPQQLTSCDPNCGGCNGGNPINAWIYVNSFGGQESESDYPYVSGVTKQTGSCSSKIAEVT 228 Query: 505 AKIS---GFSYATPPCSGACKTQDEATFANNVAAKGPASICVNAEAWQLYSSGVLTAKA- 338 + G+ A P E+ + P SI V+AE WQ Y+ G++ K+ Sbjct: 229 EAVGADVGYFIAQRPA-------QESNMLKQIGL-SPMSIAVDAELWQTYTGGIIGPKSG 280 Query: 337 CGGSAYTDLDHCVQLVGYNKPASGKSYWIVRNSWGASWGYSGYLYVEYGTNACGVADEA 161 CG T +DH VQ+ GYN A G +YWIVRNSWG +WG SG++Y+ YG N CG+ +A Sbjct: 281 CG----TTIDHAVQVTGYN--AEG-NYWIVRNSWGPNWGESGFVYLTYGDNVCGITSQA 332 >ref|XP_020433674.1| hypothetical protein PPL_05546 [Heterostelium album PN500] gb|EFA81557.1| hypothetical protein PPL_05546 [Heterostelium album PN500] Length = 341 Score = 229 bits (584), Expect = 2e-68 Identities = 138/352 (39%), Positives = 194/352 (55%), Gaps = 2/352 (0%) Frame = -1 Query: 1213 LLGLSAFASATRLPVPSDSITGENVADEFLAKSLFDKFKLDHSKSYHSASEETHRYSVFR 1034 L+ L A AS + + ++S G + A +F + F ++ H KSY SE R S + Sbjct: 8 LVCLVAIASVDAIRIQNNS--GFHRARDFEGE--FRQWMTKHEKSYADDSEYYLRLSHYI 63 Query: 1033 SNLARIADLNSKN-GSPSFGITPFADLTAHEFAKTHLGFKPSLDEESQAARLNTPVFELD 857 NL +AD N K+ G F F+DL+ EF +L + P+ + ++ + N D Sbjct: 64 KNLRTVADYNKKHAGMAKFAPNKFSDLSIEEFRAGYLNYVPNKLIKDRSTKQN-----FD 118 Query: 856 EDDNMMAWGANNTLVDWRQKGAVTKVKDQAQCGSCWAFSATEQIESAWFLAKGTLPVLSP 677 N+ +DWRQKG VT VK+Q QCGSCWAFSA EQIE+A+ +A +S Sbjct: 119 YPANIPV------SLDWRQKGFVTPVKNQEQCGSCWAFSAGEQIETAYIMAGNAAQNVSE 172 Query: 676 QQILSCDKTDAGCDGGDTPTAYAYVKSAGGLESESAYPYSSGAGNTGTCKFKAASIVAKI 497 QQI+ CD D GC GGD TAY YV+SAGG+ + + YPY++ GTC + +I Sbjct: 173 QQIVDCDPYDGGCGGGDPMTAYQYVQSAGGITTNTDYPYTA---TDGTCYAQNTPKFTQI 229 Query: 496 SGFSYATPPCSGACKTQDEATFANNVAAKGPASICVNAEAWQLYSSGVLTAKACGGSAYT 317 + + YA+ +E +AA+GP SICV+AE W Y SGVL + + Sbjct: 230 ASYGYAS-------NKGNETELKQAIAARGPLSICVDAETWMNYQSGVLNS-----NCPD 277 Query: 316 DLDHCVQLVGYN-KPASGKSYWIVRNSWGASWGYSGYLYVEYGTNACGVADE 164 +LDHCVQ+VGY+ + ++ Y+IVRNSWG WG GY+ V G N CG+ DE Sbjct: 278 ELDHCVQIVGYDVEQSTNTPYYIVRNSWGTDWGMEGYILVGEGQNLCGITDE 329 >ref|XP_003293312.1| hypothetical protein DICPUDRAFT_41833 [Dictyostelium purpureum] gb|EGC30167.1| hypothetical protein DICPUDRAFT_41833 [Dictyostelium purpureum] Length = 352 Score = 227 bits (579), Expect = 2e-67 Identities = 134/334 (40%), Positives = 185/334 (55%), Gaps = 13/334 (3%) Frame = -1 Query: 1123 AKSLFDKFKLDHSKSYHSASEETHRYSVFRSNLARIADLNSKN-GSPSFGITPFADLTAH 947 +K LF + + K Y ++ E R+S F++NL +I +LN+ + G SFG+ ++DL+ Sbjct: 35 SKDLFHHWTKQNGKIYETSEEFEKRFSNFKTNLKKIENLNNLHKGKASFGMNKYSDLSEE 94 Query: 946 EFAKTHL--GFKPSLDEESQAARL------NTPVFELDEDDNMMAWGANNTLVDWRQKGA 791 EF+ +L FK +EE + N L+ DD + A VDWR KG Sbjct: 95 EFSNFYLMKNFKGKPEEERDYIKKPENPSSNLIGGYLNTDDGLKAMYQ----VDWRNKGL 150 Query: 790 VTKVKDQAQCGSCWAFSATEQIESAWFLAKGTLPVLSPQQILSCDKTDAGCDGGDTPTAY 611 VT VKDQ QCGSC+ FSATEQIES + A +LS QQ + CD D GC GGD Y Sbjct: 151 VTPVKDQGQCGSCYIFSATEQIESEYIRAGHKAILLSEQQSVDCDTMDGGCGGGDPANVY 210 Query: 610 AYVKSAGGLESESAYPYSSGAGNTGTCKFKAASIVAKISGFSYATPPCSGACKTQDEATF 431 Y+ SAGG+ +E YPY++ GTC F V+ I+GF Y T + DE T Sbjct: 211 NYIISAGGVSTEKDYPYTA---QDGTC-FNTTRAVS-ITGFQYVT-------QNSDEDTL 258 Query: 430 ANNVAAKGPASICVNAEAWQLYSSGVLTAKACGGSAYTDLDHCVQLVGYN----KPASGK 263 +A GP SICV+A WQ Y+ G++T ++DHCVQ+VG + P++ Sbjct: 259 ITTIANHGPVSICVDASTWQSYTGGIITT-----GCEQNIDHCVQVVGLDIDKTDPSNPI 313 Query: 262 SYWIVRNSWGASWGYSGYLYVEYGTNACGVADEA 161 Y+I+RNSWG SWG GY+YV G+N CG+ E+ Sbjct: 314 PYYIIRNSWGTSWGDKGYIYVAQGSNLCGITYES 347 >gb|KMZ58469.1| Cysteine proteinase cathepsin F [Zostera marina] Length = 377 Score = 226 bits (577), Expect = 7e-67 Identities = 142/351 (40%), Positives = 180/351 (51%), Gaps = 26/351 (7%) Frame = -1 Query: 1147 ENVADEFLAKSLFDKFKLDHSKSYHSASEETHRYSVFRSNLARIADLNSKNGSPSFGITP 968 EN D+ L KS F F +SKSY + E HRY +FRSN R + S + GIT Sbjct: 45 ENEEDDHLLKSEFTSFVSRYSKSYETTEEHDHRYKIFRSNFRRAQRNQVLDPSATHGITK 104 Query: 967 FADLTAHEFAKTHLGFK---PSLDEESQAARLNT------PVFELDEDDNMMAWGANNTL 815 F+DLT EF +LG K PS+ +++ A + T P +L ED Sbjct: 105 FSDLTTEEFESQYLGLKKPKPSIFQKTNPASIGTHEAATLPTTDLPED------------ 152 Query: 814 VDWRQKGAVTKVKDQAQCGSCWAFSATEQIESAWFLAKGTLPVLSPQQILSCD------- 656 DWR GAVT VKDQ CGSCW+FSA +E A +LA G L LS QQ++ CD Sbjct: 153 FDWRDLGAVTPVKDQGVCGSCWSFSAAAALEGANYLATGKLIGLSEQQMVDCDHVCDPTD 212 Query: 655 --KTDAGCDGGDTPTAYAYVKSAGGLESESAYPYSSGAGNTGTCKFKAASIVAKISGFSY 482 DAGC+GG A++Y+ +GGLESE YPY+ G+ GTCKF + I A ++ FS Sbjct: 213 SRSCDAGCNGGLMTNAFSYLMQSGGLESEKDYPYT---GSDGTCKFDKSKIAASVANFSV 269 Query: 481 ATPPCSGACKTQDEATFANNVAAKGPASICVNAEAWQLYSSGVLTAKACGGSAYTDLDHC 302 + DE A N+ GP ++ +NA Q Y GV C + LDH Sbjct: 270 I---------SSDEDQIAANLVKYGPLAVGINAAFMQTYIGGVSCPYICFKNY---LDHG 317 Query: 301 VQLVGYNKPASG--------KSYWIVRNSWGASWGYSGYLYVEYGTNACGV 173 V LVGY ASG K YWI++NSWG SWG GY + G N CGV Sbjct: 318 VLLVGYG--ASGYSQLRFKNKPYWIIKNSWGDSWGEDGYYKICRGNNICGV 366 >ref|XP_003284553.1| hypothetical protein DICPUDRAFT_96752 [Dictyostelium purpureum] gb|EGC38873.1| hypothetical protein DICPUDRAFT_96752 [Dictyostelium purpureum] Length = 346 Score = 223 bits (569), Expect = 4e-66 Identities = 139/340 (40%), Positives = 184/340 (54%), Gaps = 20/340 (5%) Frame = -1 Query: 1126 LAKSLFDKFKLDHSKSYHSASEETHRYSVFRSNLARIADLNSK----NGSPSFGITPFAD 959 + ++ F F+ ++K Y S++E + ++ F++NL IA LN K FG+ FAD Sbjct: 24 IEQTQFVAFQQKYNKVY-SSNEYSAKFETFKANLGVIAQLNQKAKLHKSDTKFGVNEFAD 82 Query: 958 LTAHEFAKTHLGFKPSLDEES--QAARLNTPVFELDEDDNMMAWGANNTLVDWRQKGAVT 785 L+A EF K +L + + + S A L V E T DWR KGAVT Sbjct: 83 LSAAEFRKYYLNAQVAKPDASLPMAPLLTEEVLETIP-----------TAFDWRTKGAVT 131 Query: 784 KVKDQAQCGSCWAFSATEQIESAWFLAKGTLPVLSPQQILSCD----------KTDAGCD 635 VK+Q QCGSCW+FS T IE W+LA TL LS Q ++ CD DAGCD Sbjct: 132 GVKNQGQCGSCWSFSTTGNIEGQWYLAGNTLVGLSEQNLVDCDHQCMEYDGQKSCDAGCD 191 Query: 634 GGDTPTAYAYVKSAGGLESESAYPYSSGAGNTGTCKFKAASIVAKISGFSYATPPCSGAC 455 GG P AY YV GGL+SE++YPY + G+ +CKFK+ ++ AKIS F+ Sbjct: 192 GGLQPNAYRYVIENGGLDSENSYPYLAVTGD--SCKFKSGNVAAKISNFTMI-------- 241 Query: 454 KTQDEATFANNVAAKGPASICVNAEAWQLYSSGVLTAKACGGSAYTDLDHCVQLVGYN-- 281 Q+E A +A GP +I +A WQ Y GV CG S LDH + +VG++ Sbjct: 242 -PQNETQMAGYLATHGPLAIAADAAEWQFYIGGVFDL-PCGQS----LDHGILIVGFSAE 295 Query: 280 KPASG--KSYWIVRNSWGASWGYSGYLYVEYGTNACGVAD 167 K G K YWIV+NSWGASWG GYLY+ G N CGV+D Sbjct: 296 KNIFGHLKPYWIVKNSWGASWGEQGYLYLGKGKNLCGVSD 335 >ref|XP_020235829.1| LOW QUALITY PROTEIN: cysteine proteinase 15A-like [Cajanus cajan] Length = 351 Score = 221 bits (563), Expect = 4e-65 Identities = 147/369 (39%), Positives = 190/369 (51%), Gaps = 18/369 (4%) Frame = -1 Query: 1225 SLLVLLGLSAFASATRLPVPSDSITGENVADEFL-AKSLFDKFKLDHSKSYHSASEETHR 1049 SLL LL L+A A+ R VP E D L A+ F FK KSY + E HR Sbjct: 5 SLLALLLLAAVAAXIRQVVPG----AEPEEDHLLNAEHHFSTFKARFGKSYATKEEHDHR 60 Query: 1048 YSVFRSNLARIADLNSK-NGSPSFGITPFADLTAHEFAKTHLGFKP-SLDEESQAARLNT 875 + VF SNL R A L++K + S G+T F+DLT EF + LG KP L +Q A + Sbjct: 61 FGVFESNLRR-ARLHAKLDPSAVHGVTKFSDLTPAEFRRQFLGLKPLRLPAHAQNAPV-L 118 Query: 874 PVFELDEDDNMMAWGANNTLVDWRQKGAVTKVKDQAQCGSCWAFSATEQIESAWFLAKGT 695 P +L +D DWR KGAVT VKDQ CGSCW+FS T +E A +LA G Sbjct: 119 PTKDLPKD------------FDWRDKGAVTNVKDQGSCGSCWSFSTTGALEGAHYLATGE 166 Query: 694 LPVLSPQQILSCDKT---------DAGCDGGDTPTAYAYVKSAGGLESESAYPYSSGAGN 542 L S QQ++ CD DAGC+GG A+ Y+ +GG++ E YPY+ G Sbjct: 167 LLSFSEQQLVDCDHVCDPEEYGACDAGCNGGLMNNAFEYILESGGIQLEKDYPYT---GR 223 Query: 541 TGTCKFKAASIVAKISGFSYATPPCSGACKTQDEATFANNVAAKGPASICVNAEAWQLYS 362 GTCKF + +VA +S +S + DE A N+ GP ++ +NA Q Y Sbjct: 224 DGTCKFDKSKVVATVSNYSVV---------SLDEDQIAANLVKNGPLAVGINAVYMQTYI 274 Query: 361 SGVLTAKACGGSAYTDLDHCVQLVGYNKPA------SGKSYWIVRNSWGASWGYSGYLYV 200 GV CG +LDH V LVGY + A K YWI++NSWG +WG +GY + Sbjct: 275 GGVSCPYICG----KNLDHGVLLVGYGEGAYAPIRFKEKPYWILKNSWGENWGENGYYKI 330 Query: 199 EYGTNACGV 173 G N CGV Sbjct: 331 CRGRNVCGV 339 >ref|XP_020435829.1| cysteine proteinase 1 [Heterostelium album PN500] gb|EFA83712.1| cysteine proteinase 1 [Heterostelium album PN500] Length = 465 Score = 224 bits (571), Expect = 5e-65 Identities = 130/334 (38%), Positives = 181/334 (54%), Gaps = 18/334 (5%) Frame = -1 Query: 1126 LAKSLFDKFKLDHSKSYHSASEETHRYSVFRSNLARIADLN----SKNGSPSFGITPFAD 959 L ++ F +F++ ++K Y S SE R++ F+SNL I + N S+ S FG+ FAD Sbjct: 23 LEETQFRQFQIKYNKQYTS-SEYAERFATFKSNLKVIDEKNRDAASRKSSVRFGVNEFAD 81 Query: 958 LTAHEFAKTHLGFKPSLDEESQAARLNTPVFELDEDDNMMAWGANNTLVDWRQKGAVTKV 779 L+ EF T+L ++ + + A + PV +L T DWR KGAVT V Sbjct: 82 LSQSEFRATYLNSVQAVRDPNAAVAADLPVEDLP------------TAFDWRTKGAVTGV 129 Query: 778 KDQAQCGSCWAFSATEQIESAWFLAKGTLPVLSPQQILSCDK----------TDAGCDGG 629 K+Q QCGSCW+FS T +E WFLA TL LS Q ++ CD D GC+GG Sbjct: 130 KNQGQCGSCWSFSTTGNVEGQWFLAGNTLTGLSEQNLVDCDHECMEYLGDNVCDQGCNGG 189 Query: 628 DTPTAYAYVKSAGGLESESAYPYSSGAGNTGTCKFKAASIVAKISGFSYATPPCSGACKT 449 P AY Y+ GG+++E++YPY G GTC FKAA+I AKIS ++Y + Sbjct: 190 LQPNAYTYIIKNGGIDTEASYPYQ---GVDGTCSFKAANIGAKISNWTYV---------S 237 Query: 448 QDEATFANNVAAKGPASICVNAEAWQLYSSGVLTAKACGGSAYTDLDHCVQLVGYNKPAS 269 +E A + A GP +I +A WQ Y GV CG + LDH + +VGY+ + Sbjct: 238 SNETQMAAYLVANGPLAIAADAVEWQFYLGGVFDV-PCGNT----LDHGILIVGYSAENT 292 Query: 268 ----GKSYWIVRNSWGASWGYSGYLYVEYGTNAC 179 K+YWIV+NSWGA+WG GY+Y+ G C Sbjct: 293 IFHKDKAYWIVKNSWGATWGEQGYIYISRGNGEC 326 >gb|AAF75546.1| cruzipain [Trypanosoma cruzi] Length = 467 Score = 224 bits (571), Expect = 6e-65 Identities = 136/328 (41%), Positives = 183/328 (55%), Gaps = 2/328 (0%) Frame = -1 Query: 1138 ADEFLAKSLFDKFKLDHSKSYHSASEETHRYSVFRSNLARIADLNSKNGSPSFGITPFAD 959 A+E LA S F +FK H + Y SA+EE R SVFR NL + N +FG+TPF+D Sbjct: 30 AEETLA-SQFAEFKQKHGRVYESAAEEAFRLSVFRENLFLARLHAAANPHATFGVTPFSD 88 Query: 958 LTAHEFAKTHLGFKPSLDEESQAARLNTPVFELDEDDNMMAWGANNTLVDWRQKGAVTKV 779 LT EF + + AR+ PV N+ GA VDWR +GAVT V Sbjct: 89 LTREEFRSRYHNGAAHFAAAQERARV--PV-------NVEVVGAP-AAVDWRARGAVTAV 138 Query: 778 KDQAQCGSCWAFSATEQIESAWFLAKGTLPVLSPQQILSCDKTDAGCDGGDTPTAYAYV- 602 KDQ QCGSCWAFSA +E WFLA L LS Q ++SCDKTD+GC GG A+ ++ Sbjct: 139 KDQGQCGSCWAFSAIGNVECQWFLAGHPLTNLSEQMLVSCDKTDSGCGGGLMNNAFGWIV 198 Query: 601 -KSAGGLESESAYPYSSGAGNTGTCKFKAASIVAKISGFSYATPPCSGACKTQDEATFAN 425 ++ G + +E++YPY+SG G + C ++ A I+G + P QDEA A Sbjct: 199 QENNGAVYTENSYPYASGEGISPPCTTSGHTVGATITG--HVELP-------QDEAQIAA 249 Query: 424 NVAAKGPASICVNAEAWQLYSSGVLTAKACGGSAYTDLDHCVQLVGYNKPASGKSYWIVR 245 +A GP ++ V+A +W Y+ GV+T+ LDH V LVGYN A+ YWI++ Sbjct: 250 WLAVNGPVAVAVDASSWMTYTGGVMTS-----CVSEQLDHGVLLVGYNDSAA-VPYWIIK 303 Query: 244 NSWGASWGYSGYLYVEYGTNACGVADEA 161 NSW A WG GY+ + G+N C V +EA Sbjct: 304 NSWTAQWGEDGYIRIAKGSNQCLVKEEA 331 >gb|OQR72774.1| cathepsin L-like [Tropilaelaps mercedesae] Length = 331 Score = 219 bits (559), Expect = 9e-65 Identities = 143/367 (38%), Positives = 186/367 (50%), Gaps = 10/367 (2%) Frame = -1 Query: 1231 MKSLLVLLGLSAFASATRLPVPSDSITGENVADEFLAKSLFDKFKLDHSKSYHSASEETH 1052 M SL+VLL + A A R+P P A+ + +F+ H K YH SEE Sbjct: 1 MHSLIVLLAVVGAALAVRVPRPD-------------AEHHWAEFRRTHQKQYHG-SEELQ 46 Query: 1051 RYSVFRSNLARIADLNSKNGSPS---FGITPFADLTAHEFAKTHLGFKPSLDEESQAARL 881 R +F NL I + N N S + GI FAD+T EF KT LG + S + S A Sbjct: 47 RRFIFEDNLYIIQEFNRVNASEAGFRLGINQFADMTNEEFRKTFLGHRYSANHVSHA--- 103 Query: 880 NTPVFELDEDDNMMAWGANN--TLVDWRQKGAVTKVKDQAQCGSCWAFSATEQIESAWFL 707 D A G N VDW KG VT VK+Q QCGSCWAFS T +E F Sbjct: 104 ---------DSTFEATGIQNLPAKVDWTTKGYVTPVKNQGQCGSCWAFSTTGSLEGQHFK 154 Query: 706 AKGTLPVLSPQQILSCDKTDA--GCDGGDTPTAYAYVKSAGGLESESAYPYSSGAGNTGT 533 G L LS Q ++ C GC+GG A+ Y+K+ GG+++E +YPYS+ G Sbjct: 155 KTGKLVSLSEQNLIDCSDAQGNNGCNGGLMDLAFDYIKANGGIDTEQSYPYSA---VDGI 211 Query: 532 CKFKAASIVAKISGFSYATPPCSGACKTQDEATFANNVAAKGPASICVNAEA--WQLYSS 359 C+FK +I AK++G+ K DE+ VA GP SI ++A + +QLYSS Sbjct: 212 CEFKKRAIGAKVTGYV--------DIKNGDESALKEAVATVGPVSIAIDASSPHFQLYSS 263 Query: 358 GVLTAKACGGSAYTDLDHCVQLVGYNKPASGKSYWIVRNSWGASWGYSGYL-YVEYGTNA 182 GV TA C +LDH V VGY GK YW+V+NSWG SWG GY+ + N Sbjct: 264 GVYTASDCSS---VELDHGVLAVGYGH-EDGKDYWLVKNSWGTSWGIDGYIKMIRNKDNR 319 Query: 181 CGVADEA 161 CG+A +A Sbjct: 320 CGIATQA 326 >gb|AFA34858.1| cathepsin L-like protein [Trypanosoma dionisii] Length = 467 Score = 223 bits (569), Expect = 1e-64 Identities = 134/328 (40%), Positives = 174/328 (53%), Gaps = 2/328 (0%) Frame = -1 Query: 1138 ADEFLAKSLFDKFKLDHSKSYHSASEETHRYSVFRSNLARIADLNSKNGSPSFGITPFAD 959 A+E LA S F FK + + Y SA+EE R SVFR NL + N +FG+TPF+D Sbjct: 30 AEETLA-SQFADFKQRYGRVYKSAAEEAFRLSVFRKNLLDAKLHAAANPHATFGVTPFSD 88 Query: 958 LTAHEFAKTHLGFKPSLDEESQAARLNTPVFELDEDDNMMAWGANNTLVDWRQKGAVTKV 779 LT EF H + AR+ V G VDWR +GAVT V Sbjct: 89 LTREEFRSRHHSGAAHFAAGRKRARVPVDV----------GVGDAPAAVDWRDRGAVTPV 138 Query: 778 KDQAQCGSCWAFSATEQIESAWFLAKGTLPVLSPQQILSCDKTDAGCDGGDTPTAYAYV- 602 KDQ QCGSCWAFSA +E WFLA L LS Q ++SCD D+GCDGG +A+ ++ Sbjct: 139 KDQGQCGSCWAFSAIGNVEGQWFLAGNALTSLSEQMLVSCDTMDSGCDGGLMNSAFEWIV 198 Query: 601 -KSAGGLESESAYPYSSGAGNTGTCKFKAASIVAKISGFSYATPPCSGACKTQDEATFAN 425 G + +E +Y Y+SG G C+ ++ A I+G P DEA A Sbjct: 199 EHHNGTVYTEESYRYASGDGIAQPCRTSGRTVGAVITGHVKLPP---------DEAKMAT 249 Query: 424 NVAAKGPASICVNAEAWQLYSSGVLTAKACGGSAYTDLDHCVQLVGYNKPASGKSYWIVR 245 +AA GP ++ V+A +W Y+ GVLT+ +LDH V LVGYN A+ YWIV+ Sbjct: 250 WLAANGPLAVAVDASSWMFYTGGVLTS-----CVSNELDHGVLLVGYNDSAA-PPYWIVK 303 Query: 244 NSWGASWGYSGYLYVEYGTNACGVADEA 161 NSWG WG GY+ + GTN C V +EA Sbjct: 304 NSWGTLWGEDGYVRIAKGTNQCLVKEEA 331