BLASTX nr result
ID: Ophiopogon25_contig00029201
seq
BLASTX 2.2.26 [Sep-21-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Ophiopogon25_contig00029201 (1251 letters) Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF excluding environmental samples from WGS projects 149,584,005 sequences; 54,822,741,787 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value dbj|BAK00754.1| predicted protein [Hordeum vulgare subsp. vulgare] 321 e-104 ref|XP_009032801.1| hypothetical protein AURANDRAFT_18666 [Aureo... 300 7e-96 ref|XP_004368288.1| cysteine proteinase precursor, putative [Aca... 251 5e-77 gb|KYQ91485.1| hypothetical protein DLAC_08453 [Tieghemostelium ... 250 2e-76 ref|XP_012756472.1| hypothetical protein SAMD00019534_028850 [Ac... 248 1e-75 ref|XP_004335426.1| cathepsin L, putative [Acanthamoeba castella... 246 7e-75 ref|XP_013762925.1| cruzipain [Thecamonas trahens ATCC 50062] >g... 243 3e-73 emb|CUI14619.1| cysteine peptidase, putative [Bodo saltans] 240 4e-71 ref|XP_012759759.1| hypothetical protein SAMD00019534_034130, pa... 236 6e-71 ref|XP_004363040.1| hypothetical protein DFA_03437 [Cavenderia f... 233 8e-70 gb|KOO53669.1| cathepsin l-like protease [Chrysochromulina sp. C... 229 2e-68 ref|XP_020433674.1| hypothetical protein PPL_05546 [Heterosteliu... 229 2e-68 ref|XP_003293312.1| hypothetical protein DICPUDRAFT_41833 [Dicty... 227 2e-67 gb|KMZ58469.1| Cysteine proteinase cathepsin F [Zostera marina] 226 8e-67 ref|XP_003284553.1| hypothetical protein DICPUDRAFT_96752 [Dicty... 223 5e-66 gb|OEU13361.1| cysteine proteinase [Fragilariopsis cylindrus CCM... 223 9e-66 gb|OQR72774.1| cathepsin L-like [Tropilaelaps mercedesae] 222 1e-65 ref|XP_020235829.1| LOW QUALITY PROTEIN: cysteine proteinase 15A... 221 4e-65 gb|AAF75546.1| cruzipain [Trypanosoma cruzi] 224 6e-65 gb|AFA34858.1| cathepsin L-like protein [Trypanosoma dionisii] 223 1e-64 >dbj|BAK00754.1| predicted protein [Hordeum vulgare subsp. vulgare] Length = 341 Score = 321 bits (823), Expect = e-104 Identities = 182/355 (51%), Positives = 220/355 (61%), Gaps = 1/355 (0%) Frame = -2 Query: 1238 LLVLLGLSAFASATRLPVPSDSITGENVADEFLAKSLFDKFKLDHSKSYHSASEETHRYS 1059 +++LL L A A+A P SD N A F +F SK+Y S E T RY+ Sbjct: 3 VVLLLALCALAAAYSYP-SSDFELDLNFAK-------FQEFTARFSKNYKSVEEYTTRYA 54 Query: 1058 VFRSNLARIADLNSKNGSPSFGITPFADLTAHEFAKTHLGFKPSLDEESQAARLNTPVFE 879 F NL R+A LN ++G FG+T F D+T EF T+LGFKP + A PV Sbjct: 55 TFLDNLERVAKLN-QDGRGVFGVTKFMDMTPAEFKATYLGFKPD-----EMAPPKAPVAR 108 Query: 878 LDEDDNMMAWGANNTLVDWRQKGAVTKVKDQAQCGSCWAFSATEQIESAWFLAKGTLPVL 699 A G+ VDWR KGAVT VKDQAQCGSCWAFSATEQIES WFLA L L Sbjct: 109 -PHRAKRNATGS----VDWRTKGAVTPVKDQAQCGSCWAFSATEQIESNWFLAGNELISL 163 Query: 698 SPQQILSCDKTDAGCDGGDTPTAYAYVKSAGGLESESAYPYSSGAGNTGTCKFK-AASIV 522 SPQQI+SCD TD GC GG T TAY YV+SAGGL++++AYPYSSGAG TGTC AS Sbjct: 164 SPQQIVSCDTTDGGCGGGWTYTAYQYVQSAGGLDTDAAYPYSSGAGVTGTCDNPLPASPA 223 Query: 521 AKISGFSYATPPCSGACKTQDEATFANNVAAKGPASICVNAEAWQLYSSGVLTAKACGGS 342 A+ISGF YA P CS +C QDE + A + P S+CV+AE WQ YSSG++T C S Sbjct: 224 AQISGFGYAIPTCSDSCTNQDENSMAQYMQENSPLSVCVDAEPWQFYSSGIMTVDQC-PS 282 Query: 341 AYTDLDHCVQLVGYNKPASGKDYWIVRNSWGASWGYSGYLYVEYGTNACGVADEA 177 ++ LDHCVQ VGY+ S + YWIVRNSW +WG G++ + GTN CG+ D A Sbjct: 283 DFSGLDHCVQAVGYDATGS-QPYWIVRNSWNTNWGEDGFIRLALGTNTCGIGDVA 336 >ref|XP_009032801.1| hypothetical protein AURANDRAFT_18666 [Aureococcus anophagefferens] gb|EGB13210.1| hypothetical protein AURANDRAFT_18666 [Aureococcus anophagefferens] Length = 346 Score = 300 bits (768), Expect = 7e-96 Identities = 172/362 (47%), Positives = 212/362 (58%), Gaps = 20/362 (5%) Frame = -2 Query: 1202 ATRLPVPSDSITGENVADEFLAKSLFDKFKLDHSKSYHSASEETHRYSVFRSNLARIADL 1023 A L VP+ ++T E SLF+ FK D+ KSY+S E R+++F +NL + L Sbjct: 4 AALLLVPAAALTDE---------SLFELFKSDYVKSYNSTEAEAERFTIFSANLRKTEAL 54 Query: 1022 NSKN---GSPSFGITPFADLTAHEFAKTHLGFKPS---LDEESQAARLNTPVFELDEDDN 861 N++ FG+T F DLT EF +L + PS L E+ AA + Sbjct: 55 NAQRVDEDDAEFGVTQFMDLTEAEFKAQYLNYVPSEQVLAEDVYAA-----------PEG 103 Query: 860 MMAWGANNTLVDWR--QKGAVTKVKDQAQCGSCWAFSATEQIESAWFLAKGTLPVLSPQQ 687 A G+ +DWR Q G V+ VKDQ QCGSCWAFSATEQIES W LA V +PQQ Sbjct: 104 FAAPGS----LDWRTKQSGVVSDVKDQGQCGSCWAFSATEQIESEWVLAGNDPLVFAPQQ 159 Query: 686 ILSCDKTDAGCDGGDTPTAYAYVKSAGGLESESAYPYSSG-AGNTGTCKFKAASIVAKIS 510 I+SCDK D GC+GG+T TAYAYV+ AGG+ ESAYPY SG +GNTG CK K + + Sbjct: 160 IVSCDKVDQGCNGGNTETAYAYVEKAGGMALESAYPYKSGTSGNTGRCK-KFETAGGDVE 218 Query: 509 GFSYATPPC-SGACKTQDEATFANNVAAKGPASICVNAEAWQLYSSGVLTAKACGGSAYT 333 FSY P C G C QDE A +A+ GPASICVNA AWQ Y+ GV+T CG A Sbjct: 219 SFSYVVPECKKGKCNDQDEDKMAAALASHGPASICVNAGAWQTYTKGVMTNLQCGSHAAN 278 Query: 332 DLDHCVQLVGY----------NKPASGKDYWIVRNSWGASWGYSGYLYVEYGTNACGVAD 183 LDHCVQ+VGY K K W VRNSWG SWGY GY+ V+ G NACG+A+ Sbjct: 279 ALDHCVQVVGYTGYTGDAKACGKGLKDKCVWNVRNSWGTSWGYQGYIRVQMGKNACGIAN 338 Query: 182 EA 177 +A Sbjct: 339 DA 340 >ref|XP_004368288.1| cysteine proteinase precursor, putative [Acanthamoeba castellanii str. Neff] gb|ELR25533.1| cysteine proteinase precursor, putative [Acanthamoeba castellanii str. Neff] Length = 330 Score = 251 bits (641), Expect = 5e-77 Identities = 148/332 (44%), Positives = 187/332 (56%), Gaps = 7/332 (2%) Frame = -2 Query: 1154 ADEFLAKSLFDKFKLDHSKSYHSASEET-HRYSVFRSNLARIADLNSKNGSPSFGITPFA 978 A A+ F +F + KSY ASEE R +FR NL RI LNS N +G+ FA Sbjct: 23 AGTMTAEQQFRQFAAQYGKSY--ASEEFGERLRIFRDNLDRIDALNSANTGARYGVNKFA 80 Query: 977 DLTAHEFAKTHLGFKPSLDEESQAARLNTPVFELDEDDNMMAWGANNTLVDWRQKGAVTK 798 DLT EF T+L S ++ AA + G + DWR KGAVT Sbjct: 81 DLTPKEFKATYLKGARSAGQKKAAATAKLDMT-----------GPLPSQFDWRDKGAVTP 129 Query: 797 VKDQAQCGSCWAFSATEQIESAWFLAKGTLPVLSPQQILSCDKT--DAGCDGGDTPTAYA 624 KDQ QCG WAFS TE IES WFL+ L L+PQQI+ CD+ D GCDGGD PTAY Sbjct: 130 TKDQGQCG--WAFSVTEAIESQWFLSGRKLVSLAPQQIVDCDQGNGDYGCDGGDPPTAYE 187 Query: 623 YVKSAGGLESESAYPYSSGAGNTGTCKFKAASIVAKISGFSYATPPCSGACKTQDEATFA 444 YV AGGL++E +YPY++ G C FK +++ AKIS ++Y T T++E Sbjct: 188 YVIKAGGLDTEESYPYTA---EDGQCAFKPSAVGAKISNWTYITT-------TKNETEMQ 237 Query: 443 NNVAAKGPASICVNAEAWQLYSSGVLTAKACGGSAYTDLDHCVQLVGYNKPASGK----D 276 +A++GP SICV+A +WQ Y GV+T+ C S LDHCV + GY+ D Sbjct: 238 YGLASRGPLSICVDASSWQYYIGGVITS-LCEDS----LDHCVMITGYSVQEGWDFMKYD 292 Query: 275 YWIVRNSWGASWGYSGYLYVEYGTNACGVADE 180 W +RNSWG WGY GYLYV+ G+N CGV DE Sbjct: 293 VWNIRNSWGEDWGYGGYLYVQRGSNLCGVGDE 324 >gb|KYQ91485.1| hypothetical protein DLAC_08453 [Tieghemostelium lacteum] Length = 354 Score = 250 bits (639), Expect = 2e-76 Identities = 146/344 (42%), Positives = 194/344 (56%), Gaps = 8/344 (2%) Frame = -2 Query: 1184 PSDSITGENVADEFLAKSLFDKFKLDHSKSYHSASEETHRYSVFRSNLARIADLNSKN-G 1008 P+ + + E + K+ FD++ H+K YH+ E RY F++NL +I N+ + G Sbjct: 28 PNQNQQDNYIQRERILKNQFDQWVEKHAKKYHTHREYLTRYQNFKNNLKKIEQQNAAHQG 87 Query: 1007 SPSFGITPFADLTAHEFAKTHL--GFKPSLDEESQAARLNTPVFE--LDEDDNMMAWGAN 840 S FG+ F+DL+ EF K +L +KP+ + + PV + D+N+ Sbjct: 88 SAKFGMNKFSDLSEEEFTKFYLMPEYKPT--PRKSLYKKHYPVMQDAQSSDENIPL---- 141 Query: 839 NTLVDWRQKGAVTKVKDQAQCGSCWAFSATEQIESAWFLAKGTLPVLSPQQILSCDKTDA 660 N VDWR +G VT VKDQ CGSCWAFSATEQIE+AW A +LS QQI+ CD D Sbjct: 142 NLKVDWRTEGLVTPVKDQGACGSCWAFSATEQIETAWIKAGNDQVILSEQQIVDCDTNDG 201 Query: 659 GCDGGDTPTAYAYVKSAGGLESESAYPYSSGAGNTGTCKFKAASIVAKISGFSYATPPCS 480 GC GGD TA YV AGGL SES YPY N GTC VA ISG+ AT P Sbjct: 202 GCGGGDPHTAMDYVIKAGGLTSESQYPY---IANDGTCHTNFTP-VAHISGYYAATTP-- 255 Query: 479 GACKTQDEATFANNVAAKGPASICVNAEAWQLYSSGVLTAKACGGSAYTDLDHCVQLVGY 300 ++ A +V +GP SICV+A +W YSSG++ + + +DLDHCVQ+VG Sbjct: 256 -----GNDTQLAYSVMNEGPISICVDASSWMTYSSGIIRS-----NCDSDLDHCVQIVGL 305 Query: 299 NKPASGK---DYWIVRNSWGASWGYSGYLYVEYGTNACGVADEA 177 N +G Y+I+RNSWG WG G++YVE G + CGV EA Sbjct: 306 NVDTNGTTPIPYYIIRNSWGTDWGIDGFIYVEIGHDLCGVTQEA 349 >ref|XP_012756472.1| hypothetical protein SAMD00019534_028850 [Acytostelium subglobosum LB1] dbj|GAM19710.1| hypothetical protein SAMD00019534_028850 [Acytostelium subglobosum LB1] Length = 325 Score = 248 bits (632), Expect = 1e-75 Identities = 136/318 (42%), Positives = 183/318 (57%), Gaps = 2/318 (0%) Frame = -2 Query: 1127 FDKFKLDHSKSYHSASEETHRYSVFRSNLARIADLNSKN-GSPSFGITPFADLTAHEFAK 951 F ++ + + Y E R S F SNLA I++ N+K+ G +FG+ F+DL+ EF K Sbjct: 26 FKQWMSKYERHYVDEKEYLIRLSNFVSNLATISEYNAKHHGRATFGLNQFSDLSIEEFRK 85 Query: 950 THLGFKPSLDEESQAARLNTPVFELDEDDNMMAWGANNTLVDWRQKGAVTKVKDQAQCGS 771 THL + P+ + SQ + D N+ VDWR KG VT VK+Q QCGS Sbjct: 86 THLNYVPTHKKASQVRQ------HFDYPSNIPE------RVDWRAKGFVTPVKNQLQCGS 133 Query: 770 CWAFSATEQIESAWFLAKGTLPVLSPQQILSCDKTDAGCDGGDTPTAYAYVKSAGGLESE 591 CWAFSATEQIE+A+ A S QQI+ CD D GC GGD TAY YV+SAGG+ ++ Sbjct: 134 CWAFSATEQIETAFIQAGNAQQFFSEQQIVDCDPFDGGCGGGDPMTAYQYVQSAGGITTD 193 Query: 590 SAYPYSSGAGNTGTCKFKAASIVAKISGFSYATPPCSGACKTQDEATFANNVAAKGPASI 411 +AYPY++ GTC+ + VA+I + YA+ +E +AA GP SI Sbjct: 194 TAYPYTA---QDGTCEANTTTKVAQIKTYGYAS-------TAGNETQMKEAIAALGPLSI 243 Query: 410 CVNAEAWQLYSSGVLTAKACGGSAYTDLDHCVQLVGYN-KPASGKDYWIVRNSWGASWGY 234 CV+AE W Y SG++T DLDHCVQ+VGY+ S Y+IVRNSWG +WG Sbjct: 244 CVDAETWMTYQSGIITTDCA-----ADLDHCVQVVGYDVDTTSNIPYYIVRNSWGTTWGQ 298 Query: 233 SGYLYVEYGTNACGVADE 180 GY+Y+ G+N CG+ +E Sbjct: 299 EGYIYIGEGSNLCGITEE 316 >ref|XP_004335426.1| cathepsin L, putative [Acanthamoeba castellanii str. Neff] gb|ELR13413.1| cathepsin L, putative [Acanthamoeba castellanii str. Neff] Length = 331 Score = 246 bits (627), Expect = 7e-75 Identities = 145/317 (45%), Positives = 181/317 (57%), Gaps = 2/317 (0%) Frame = -2 Query: 1127 FDKFKLDHSKSYHSASEETHRYSVFRSNLARIADLNSK-NGSPSFGITPFADLTAHEFAK 951 F+ F + KSY SA E R+++F NLA A LN K G FGIT FAD++ EF Sbjct: 34 FNAFVQRYGKSYASAEEAEQRFAIFTQNLAETAALNIKYEGKTQFGITKFADMSQEEFQS 93 Query: 950 THLGFKPSLDEESQAARLNTPVFELDEDDNMMAWGANNTLVDWRQK-GAVTKVKDQAQCG 774 L P + R P FE + A +T DWR K G VT V DQ QCG Sbjct: 94 RVLMSNPPPPPTEKPYR--GPKFE--------GFTAPSTF-DWRNKPGVVTPVYDQGQCG 142 Query: 773 SCWAFSATEQIESAWFLAKGTLPVLSPQQILSCDKTDAGCDGGDTPTAYAYVKSAGGLES 594 SCWAFSATE IES W LA L LS QQI+ C D GC GG AY YV A GL++ Sbjct: 143 SCWAFSATENIESQWALAGHKLTGLSMQQIVDCSWWDDGCGGGFPSYAYDYVIDAPGLDA 202 Query: 593 ESAYPYSSGAGNTGTCKFKAASIVAKISGFSYATPPCSGACKTQDEATFANNVAAKGPAS 414 + YPY++ G+C FK + +VAKIS ++Y T +E AN +A GP S Sbjct: 203 LANYPYTAVG---GSCAFKESQVVAKISSWTYTT-------TDSNEHQMANYLAQHGPIS 252 Query: 413 ICVNAEAWQLYSSGVLTAKACGGSAYTDLDHCVQLVGYNKPASGKDYWIVRNSWGASWGY 234 +CV+AE+W Y+ GV A ACG T +DHCV VGYN A+ YWI+RNSWG SWG Sbjct: 253 VCVDAESWPSYTGGVYRASACG----TSIDHCVLAVGYNLTAN-PPYWIIRNSWGTSWGL 307 Query: 233 SGYLYVEYGTNACGVAD 183 GY+++E+GT+AC VA+ Sbjct: 308 EGYMHLEFGTDACAVAE 324 >ref|XP_013762925.1| cruzipain [Thecamonas trahens ATCC 50062] gb|KNC45942.1| cruzipain [Thecamonas trahens ATCC 50062] Length = 394 Score = 243 bits (621), Expect = 3e-73 Identities = 141/325 (43%), Positives = 187/325 (57%), Gaps = 8/325 (2%) Frame = -2 Query: 1127 FDKFKLDHSKSYHSASEETHRYSVFRSNLARIADLNSKN----GSPSFGITPFADLTAHE 960 F FK + + Y S+ E + VF++N + A L + N G FG++PF DLT +E Sbjct: 23 FALFKETYKRQYASSKAEAAAFEVFKTNAEKAAKLEAANKAAGGDAKFGMSPFMDLTENE 82 Query: 959 FAKTHLGFKPSLDEESQAARLNTPVFELDEDDNMMAWGANNTLVDWR--QKGAVTKVKDQ 786 F +L K ++ E AA L PV GA DWR + +T VK+Q Sbjct: 83 FKARYLMPKGAV--EGGAAEL--PVLRASNV------GALPKAYDWRDHKPAVITPVKNQ 132 Query: 785 AQCGSCWAFSATEQIESAWFLAKGTLPVLSPQQILSCDKTDAGCDGGDTPTAYAYVKSAG 606 QCGSCWAFSA ++ES W LA L VLS QQ++ CD TD GC+GGDT +AY Y++ AG Sbjct: 133 GQCGSCWAFSAVSEVESMWALAGHELVVLSEQQVVDCDTTDDGCNGGDTISAYHYIEKAG 192 Query: 605 GLESESAYPYSSGAGNTGTCKFKAA--SIVAKISGFSYATPPCSGACKTQDEATFANNVA 432 GL E YPY++ G CK VAKI G++YAT P T++E A N+ Sbjct: 193 GLVPEKDYPYTA---RDGKCKDSVVKKDAVAKIMGYNYATSP-----STKNETQLAANLM 244 Query: 431 AKGPASICVNAEAWQLYSSGVLTAKACGGSAYTDLDHCVQLVGYNKPASGKDYWIVRNSW 252 + GP SICV+A +WQ Y+SG+L+ CG LDHCVQ+ G+ S + YW VRNSW Sbjct: 245 STGPVSICVDASSWQTYTSGILS--HCG----KQLDHCVQITGWGTSGS-EMYWWVRNSW 297 Query: 251 GASWGYSGYLYVEYGTNACGVADEA 177 SWG SGY+ +++G N CG+ADEA Sbjct: 298 ATSWGMSGYIQLKFGQNTCGLADEA 322 >emb|CUI14619.1| cysteine peptidase, putative [Bodo saltans] Length = 466 Score = 240 bits (613), Expect = 4e-71 Identities = 137/348 (39%), Positives = 184/348 (52%), Gaps = 2/348 (0%) Frame = -2 Query: 1232 VLLGLSAFASATRLPVPSDSITGENVADEFLAKSLFDKFKLDHSKSYHSASEETHRYSVF 1053 +L L AFA+ + + +D++ ++ F+ FK H KSY + SEET+R +VF Sbjct: 7 LLCALIAFAAVSSVSATTDAL-----------RASFESFKAKHGKSYATPSEETYRLTVF 55 Query: 1052 RSNLARIADLNSKNGSPSFGITPFADLTAHEFAKTHLGFKPSLDEESQAARLNTPVFELD 873 N+ + LN+KN FG +PFAD+T EF H G K + T + Sbjct: 56 AENIRKAEILNAKNPQARFGASPFADMTETEFKSYHNGDKYFSARVQELKSDKTTYYPRY 115 Query: 872 EDDNMMAWGANNTLVDWRQKGAVTKVKDQAQCGSCWAFSATEQIESAWFLAKGTLPVLSP 693 D + A N DWR +GAVT VK+Q QCGSCWAFS T +E W LA TL LS Sbjct: 116 TDAQVKAAPTNK---DWRTEGAVTAVKNQGQCGSCWAFSTTGGVEGQWQLAGNTLVSLSE 172 Query: 692 QQILSCDKTDAGCDGGDTPTAYAYV--KSAGGLESESAYPYSSGAGNTGTCKFKAASIVA 519 QQ++SCD D+GC+GG AY ++ G SE++YPY SG G C + A Sbjct: 173 QQLVSCDTVDSGCNGGLMNNAYEWILANKGGEFVSEASYPYVSGGGTAPACDATQGTNAA 232 Query: 518 KISGFSYATPPCSGACKTQDEATFANNVAAKGPASICVNAEAWQLYSSGVLTAKACGGSA 339 KI+G DE + GP S+ ++A AWQ+Y GV+T CGGSA Sbjct: 233 KITGHYNI---------YHDEDQMKAWIGENGPLSLAIDASAWQMYMGGVMT--TCGGSA 281 Query: 338 YTDLDHCVQLVGYNKPASGKDYWIVRNSWGASWGYSGYLYVEYGTNAC 195 LDH V +VGY YWI +NSWGASWG +GY+YV +G++ C Sbjct: 282 ---LDHGVLIVGYQFENQATPYWIFKNSWGASWGEAGYIYVAFGSDQC 326 >ref|XP_012759759.1| hypothetical protein SAMD00019534_034130, partial [Acytostelium subglobosum LB1] dbj|GAM20238.1| hypothetical protein SAMD00019534_034130, partial [Acytostelium subglobosum LB1] Length = 338 Score = 236 bits (601), Expect = 6e-71 Identities = 141/357 (39%), Positives = 192/357 (53%), Gaps = 4/357 (1%) Frame = -2 Query: 1238 LLVLLGLSAFASATRLPVPSDSITGENVADEFLAKSLFDKFKLDHSKSYHSASEETHRYS 1059 L+ L AF + +P + E E++A FDK +D ++ Y R S Sbjct: 9 LVATLTTLAFVEVNAVRLPGRTRNYEQQFREWMAD--FDKVYVDDAEYYR-------RLS 59 Query: 1058 VFRSNLARIADLNSKN-GSPSFGITPFADLTAHEFAKTHLGFKPSLDEESQAARLNTPVF 882 F +NL IA N + G +FG+ FADL+ EF +L F+ + + ++ P Sbjct: 60 NFITNLGTIARNNRMHKGRATFGVNKFADLSMEEFKSYYLNFETDRTPKREPTNVSYP-- 117 Query: 881 ELDEDDNMMAWGANNTLVDWRQKGAVTKVKDQAQCGSCWAFSATEQIESAWFLAKGTLPV 702 N+ + VDWRQKG VT VK+Q QCGSCWAFSA EQIESA+ + + Sbjct: 118 -----SNIPSQ------VDWRQKGYVTPVKNQEQCGSCWAFSAAEQIESAYIMLGNEAQI 166 Query: 701 LSPQQILSCDKTDAGCDGGDTPTAYAYVKSAGGLESESAYPYSSGAGNTGTCKFKAASIV 522 S QQI+ CD D GC GGDT TAY YV++AGGL + ++YPY++ GTC Sbjct: 167 ASEQQIVDCDSFDGGCGGGDTMTAYKYVETAGGLTTNASYPYTA---QDGTCYANKTKKF 223 Query: 521 AKISGFSYATPPCSGACKTQDEATFANNVAAKGPASICVNAEAWQLYSSGVLTAKACGGS 342 K++ ++YA+ +E +AA GP SICV+A +W Y SG++T+ CG Sbjct: 224 VKVTNYNYAS-------SQGNETQLKEAIAALGPLSICVDAISWMTYQSGIITSN-CG-- 273 Query: 341 AYTDLDHCVQLVGYNKPAS---GKDYWIVRNSWGASWGYSGYLYVEYGTNACGVADE 180 DLDHCVQLVGY +S Y+IVRNSWG WG GY+Y+ G N CG+ DE Sbjct: 274 --NDLDHCVQLVGYAIESSVTPNIPYYIVRNSWGLDWGQEGYIYIGEGQNLCGITDE 328 >ref|XP_004363040.1| hypothetical protein DFA_03437 [Cavenderia fasciculata] gb|EGG25189.1| hypothetical protein DFA_03437 [Cavenderia fasciculata] Length = 341 Score = 233 bits (594), Expect = 8e-70 Identities = 133/331 (40%), Positives = 186/331 (56%), Gaps = 4/331 (1%) Frame = -2 Query: 1160 NVADEFLAKSLFDKFKLDHSKSYHSASEETHRYSVFRSNLARIADLNSKNG-SPSFGITP 984 + AD++ + F + ++H+K YH E R S F N+ I +N + G + +FG+ Sbjct: 23 STADDYTTR--FKTWMVEHNKMYHEEEEFYLRLSNFIRNIHSIEKMNRQYGRTATFGLNK 80 Query: 983 FADLTAHEFAKTHL--GFKPSLDEESQAARLNTPVFELDEDDNMMAWGANNTLVDWRQKG 810 F+DL+ EF K +L +KP AR+ F N+ A +DWR KG Sbjct: 81 FSDLSLDEFKKHYLMPNYKPK-------ARVTKETFNYPS--NIPA------TLDWRTKG 125 Query: 809 AVTKVKDQAQCGSCWAFSATEQIESAWFLAKGTLPVLSPQQILSCDKTDAGCDGGDTPTA 630 VT VK+Q CGSCWAFSATEQIE+A +A G + LS QQI+ CD D GC GGD TA Sbjct: 126 YVTPVKNQLMCGSCWAFSATEQIETANIMAGGQVEYLSEQQIVDCDPYDGGCGGGDPYTA 185 Query: 629 YAYVKSAGGLESESAYPYSSGAGNTGTCKFKAASIVAKISGFSYATPPCSGACKTQDEAT 450 Y YV++ GGL YPY++ G C + + +++ F YA+ +E Sbjct: 186 YQYVQNNGGLTLNVTYPYTAA---NGACYANSTAPAVQVTAFGYAS-------SQGNETQ 235 Query: 449 FANNVAAKGPASICVNAEAWQLYSSGVLTAKACGGSAYTDLDHCVQLVGYNKPASGK-DY 273 +AA+GP SICVNAE W Y SG+ ++ + DLDHCVQ+VGY+ A+ K Y Sbjct: 236 LREAMAARGPLSICVNAEPWMSYQSGIFSS-----TCSDDLDHCVQIVGYDTDATSKTPY 290 Query: 272 WIVRNSWGASWGYSGYLYVEYGTNACGVADE 180 +IVRNSWG WG GY+Y++ G+N CG+ +E Sbjct: 291 FIVRNSWGTDWGLLGYIYIQAGSNLCGITNE 321 >gb|KOO53669.1| cathepsin l-like protease [Chrysochromulina sp. CCMP291] Length = 345 Score = 229 bits (585), Expect = 2e-68 Identities = 141/359 (39%), Positives = 194/359 (54%), Gaps = 7/359 (1%) Frame = -2 Query: 1232 VLLGLSAFASATRLPVPSDSITGENVADEFLAKSLFDKFKLDHSKS--YHSASEETHRYS 1059 V++ LS ++A + P + + +F+ KF D Y SA+E R++ Sbjct: 5 VVVALSIISTAAQRPADPTDMHLDPAFPQFM------KFMTDFRNGVPYSSAAETLGRFT 58 Query: 1058 VFRSNLARIADLNSKNGSPSFGITPFADLTAHEFAKTHLGFKPSLDEESQAARLNTPVFE 879 F++NL I + N+K G + GIT FADLT EF +L +P + A R Sbjct: 59 AFKANLQLIGERNAK-GQETHGITKFADLTREEFKAQYLTLRPPT---ANALR------S 108 Query: 878 LDEDDNMMAWGANNTLVDWRQKGAVTKVKDQAQCGSCWAFSATEQIESAWFLAKGTLPVL 699 + + D+++ DW KGA T VK+Q QCGSCWAFSATEQ+ES ++ G L L Sbjct: 109 MKQLDHLVQANYTAASTDWCAKGACTPVKNQGQCGSCWAFSATEQLESQYYQTYGKLIEL 168 Query: 698 SPQQILSCDKTDAGCDGGDTPTAYAYVKSAGGLESESAYPYSSG-AGNTGTCKFKAASIV 522 SPQQ+ SCD GC+GG+ A+ YV S GG ESES YPY SG TG+C K A + Sbjct: 169 SPQQLTSCDPNCGGCNGGNPINAWIYVNSFGGQESESDYPYVSGVTKQTGSCSSKIAEVT 228 Query: 521 AKIS---GFSYATPPCSGACKTQDEATFANNVAAKGPASICVNAEAWQLYSSGVLTAKA- 354 + G+ A P E+ + P SI V+AE WQ Y+ G++ K+ Sbjct: 229 EAVGADVGYFIAQRPA-------QESNMLKQIGL-SPMSIAVDAELWQTYTGGIIGPKSG 280 Query: 353 CGGSAYTDLDHCVQLVGYNKPASGKDYWIVRNSWGASWGYSGYLYVEYGTNACGVADEA 177 CG T +DH VQ+ GYN A G +YWIVRNSWG +WG SG++Y+ YG N CG+ +A Sbjct: 281 CG----TTIDHAVQVTGYN--AEG-NYWIVRNSWGPNWGESGFVYLTYGDNVCGITSQA 332 >ref|XP_020433674.1| hypothetical protein PPL_05546 [Heterostelium album PN500] gb|EFA81557.1| hypothetical protein PPL_05546 [Heterostelium album PN500] Length = 341 Score = 229 bits (584), Expect = 2e-68 Identities = 138/352 (39%), Positives = 194/352 (55%), Gaps = 2/352 (0%) Frame = -2 Query: 1229 LLGLSAFASATRLPVPSDSITGENVADEFLAKSLFDKFKLDHSKSYHSASEETHRYSVFR 1050 L+ L A AS + + ++S G + A +F + F ++ H KSY SE R S + Sbjct: 8 LVCLVAIASVDAIRIQNNS--GFHRARDFEGE--FRQWMTKHEKSYADDSEYYLRLSHYI 63 Query: 1049 SNLARIADLNSKN-GSPSFGITPFADLTAHEFAKTHLGFKPSLDEESQAARLNTPVFELD 873 NL +AD N K+ G F F+DL+ EF +L + P+ + ++ + N D Sbjct: 64 KNLRTVADYNKKHAGMAKFAPNKFSDLSIEEFRAGYLNYVPNKLIKDRSTKQN-----FD 118 Query: 872 EDDNMMAWGANNTLVDWRQKGAVTKVKDQAQCGSCWAFSATEQIESAWFLAKGTLPVLSP 693 N+ +DWRQKG VT VK+Q QCGSCWAFSA EQIE+A+ +A +S Sbjct: 119 YPANIPV------SLDWRQKGFVTPVKNQEQCGSCWAFSAGEQIETAYIMAGNAAQNVSE 172 Query: 692 QQILSCDKTDAGCDGGDTPTAYAYVKSAGGLESESAYPYSSGAGNTGTCKFKAASIVAKI 513 QQI+ CD D GC GGD TAY YV+SAGG+ + + YPY++ GTC + +I Sbjct: 173 QQIVDCDPYDGGCGGGDPMTAYQYVQSAGGITTNTDYPYTA---TDGTCYAQNTPKFTQI 229 Query: 512 SGFSYATPPCSGACKTQDEATFANNVAAKGPASICVNAEAWQLYSSGVLTAKACGGSAYT 333 + + YA+ +E +AA+GP SICV+AE W Y SGVL + + Sbjct: 230 ASYGYAS-------NKGNETELKQAIAARGPLSICVDAETWMNYQSGVLNS-----NCPD 277 Query: 332 DLDHCVQLVGYN-KPASGKDYWIVRNSWGASWGYSGYLYVEYGTNACGVADE 180 +LDHCVQ+VGY+ + ++ Y+IVRNSWG WG GY+ V G N CG+ DE Sbjct: 278 ELDHCVQIVGYDVEQSTNTPYYIVRNSWGTDWGMEGYILVGEGQNLCGITDE 329 >ref|XP_003293312.1| hypothetical protein DICPUDRAFT_41833 [Dictyostelium purpureum] gb|EGC30167.1| hypothetical protein DICPUDRAFT_41833 [Dictyostelium purpureum] Length = 352 Score = 227 bits (579), Expect = 2e-67 Identities = 134/334 (40%), Positives = 185/334 (55%), Gaps = 13/334 (3%) Frame = -2 Query: 1139 AKSLFDKFKLDHSKSYHSASEETHRYSVFRSNLARIADLNSKN-GSPSFGITPFADLTAH 963 +K LF + + K Y ++ E R+S F++NL +I +LN+ + G SFG+ ++DL+ Sbjct: 35 SKDLFHHWTKQNGKIYETSEEFEKRFSNFKTNLKKIENLNNLHKGKASFGMNKYSDLSEE 94 Query: 962 EFAKTHL--GFKPSLDEESQAARL------NTPVFELDEDDNMMAWGANNTLVDWRQKGA 807 EF+ +L FK +EE + N L+ DD + A VDWR KG Sbjct: 95 EFSNFYLMKNFKGKPEEERDYIKKPENPSSNLIGGYLNTDDGLKAMYQ----VDWRNKGL 150 Query: 806 VTKVKDQAQCGSCWAFSATEQIESAWFLAKGTLPVLSPQQILSCDKTDAGCDGGDTPTAY 627 VT VKDQ QCGSC+ FSATEQIES + A +LS QQ + CD D GC GGD Y Sbjct: 151 VTPVKDQGQCGSCYIFSATEQIESEYIRAGHKAILLSEQQSVDCDTMDGGCGGGDPANVY 210 Query: 626 AYVKSAGGLESESAYPYSSGAGNTGTCKFKAASIVAKISGFSYATPPCSGACKTQDEATF 447 Y+ SAGG+ +E YPY++ GTC F V+ I+GF Y T + DE T Sbjct: 211 NYIISAGGVSTEKDYPYTA---QDGTC-FNTTRAVS-ITGFQYVT-------QNSDEDTL 258 Query: 446 ANNVAAKGPASICVNAEAWQLYSSGVLTAKACGGSAYTDLDHCVQLVGYN----KPASGK 279 +A GP SICV+A WQ Y+ G++T ++DHCVQ+VG + P++ Sbjct: 259 ITTIANHGPVSICVDASTWQSYTGGIITT-----GCEQNIDHCVQVVGLDIDKTDPSNPI 313 Query: 278 DYWIVRNSWGASWGYSGYLYVEYGTNACGVADEA 177 Y+I+RNSWG SWG GY+YV G+N CG+ E+ Sbjct: 314 PYYIIRNSWGTSWGDKGYIYVAQGSNLCGITYES 347 >gb|KMZ58469.1| Cysteine proteinase cathepsin F [Zostera marina] Length = 377 Score = 226 bits (577), Expect = 8e-67 Identities = 142/351 (40%), Positives = 180/351 (51%), Gaps = 26/351 (7%) Frame = -2 Query: 1163 ENVADEFLAKSLFDKFKLDHSKSYHSASEETHRYSVFRSNLARIADLNSKNGSPSFGITP 984 EN D+ L KS F F +SKSY + E HRY +FRSN R + S + GIT Sbjct: 45 ENEEDDHLLKSEFTSFVSRYSKSYETTEEHDHRYKIFRSNFRRAQRNQVLDPSATHGITK 104 Query: 983 FADLTAHEFAKTHLGFK---PSLDEESQAARLNT------PVFELDEDDNMMAWGANNTL 831 F+DLT EF +LG K PS+ +++ A + T P +L ED Sbjct: 105 FSDLTTEEFESQYLGLKKPKPSIFQKTNPASIGTHEAATLPTTDLPED------------ 152 Query: 830 VDWRQKGAVTKVKDQAQCGSCWAFSATEQIESAWFLAKGTLPVLSPQQILSCD------- 672 DWR GAVT VKDQ CGSCW+FSA +E A +LA G L LS QQ++ CD Sbjct: 153 FDWRDLGAVTPVKDQGVCGSCWSFSAAAALEGANYLATGKLIGLSEQQMVDCDHVCDPTD 212 Query: 671 --KTDAGCDGGDTPTAYAYVKSAGGLESESAYPYSSGAGNTGTCKFKAASIVAKISGFSY 498 DAGC+GG A++Y+ +GGLESE YPY+ G+ GTCKF + I A ++ FS Sbjct: 213 SRSCDAGCNGGLMTNAFSYLMQSGGLESEKDYPYT---GSDGTCKFDKSKIAASVANFSV 269 Query: 497 ATPPCSGACKTQDEATFANNVAAKGPASICVNAEAWQLYSSGVLTAKACGGSAYTDLDHC 318 + DE A N+ GP ++ +NA Q Y GV C + LDH Sbjct: 270 I---------SSDEDQIAANLVKYGPLAVGINAAFMQTYIGGVSCPYICFKNY---LDHG 317 Query: 317 VQLVGYNKPASG--------KDYWIVRNSWGASWGYSGYLYVEYGTNACGV 189 V LVGY ASG K YWI++NSWG SWG GY + G N CGV Sbjct: 318 VLLVGYG--ASGYSQLRFKNKPYWIIKNSWGDSWGEDGYYKICRGNNICGV 366 >ref|XP_003284553.1| hypothetical protein DICPUDRAFT_96752 [Dictyostelium purpureum] gb|EGC38873.1| hypothetical protein DICPUDRAFT_96752 [Dictyostelium purpureum] Length = 346 Score = 223 bits (569), Expect = 5e-66 Identities = 139/340 (40%), Positives = 184/340 (54%), Gaps = 20/340 (5%) Frame = -2 Query: 1142 LAKSLFDKFKLDHSKSYHSASEETHRYSVFRSNLARIADLNSK----NGSPSFGITPFAD 975 + ++ F F+ ++K Y S++E + ++ F++NL IA LN K FG+ FAD Sbjct: 24 IEQTQFVAFQQKYNKVY-SSNEYSAKFETFKANLGVIAQLNQKAKLHKSDTKFGVNEFAD 82 Query: 974 LTAHEFAKTHLGFKPSLDEES--QAARLNTPVFELDEDDNMMAWGANNTLVDWRQKGAVT 801 L+A EF K +L + + + S A L V E T DWR KGAVT Sbjct: 83 LSAAEFRKYYLNAQVAKPDASLPMAPLLTEEVLETIP-----------TAFDWRTKGAVT 131 Query: 800 KVKDQAQCGSCWAFSATEQIESAWFLAKGTLPVLSPQQILSCD----------KTDAGCD 651 VK+Q QCGSCW+FS T IE W+LA TL LS Q ++ CD DAGCD Sbjct: 132 GVKNQGQCGSCWSFSTTGNIEGQWYLAGNTLVGLSEQNLVDCDHQCMEYDGQKSCDAGCD 191 Query: 650 GGDTPTAYAYVKSAGGLESESAYPYSSGAGNTGTCKFKAASIVAKISGFSYATPPCSGAC 471 GG P AY YV GGL+SE++YPY + G+ +CKFK+ ++ AKIS F+ Sbjct: 192 GGLQPNAYRYVIENGGLDSENSYPYLAVTGD--SCKFKSGNVAAKISNFTMI-------- 241 Query: 470 KTQDEATFANNVAAKGPASICVNAEAWQLYSSGVLTAKACGGSAYTDLDHCVQLVGYN-- 297 Q+E A +A GP +I +A WQ Y GV CG S LDH + +VG++ Sbjct: 242 -PQNETQMAGYLATHGPLAIAADAAEWQFYIGGVFDL-PCGQS----LDHGILIVGFSAE 295 Query: 296 KPASG--KDYWIVRNSWGASWGYSGYLYVEYGTNACGVAD 183 K G K YWIV+NSWGASWG GYLY+ G N CGV+D Sbjct: 296 KNIFGHLKPYWIVKNSWGASWGEQGYLYLGKGKNLCGVSD 335 >gb|OEU13361.1| cysteine proteinase [Fragilariopsis cylindrus CCMP1102] Length = 368 Score = 223 bits (569), Expect = 9e-66 Identities = 127/359 (35%), Positives = 182/359 (50%), Gaps = 50/359 (13%) Frame = -2 Query: 1106 HSKSYHSASEETHRYSVFRSNLARIADLNSKNGSPS------FGITPFADLTAHEF-AKT 948 H+K+YHS E+ HR+S++ N AR A+ N ++G + FG F DL EF AK Sbjct: 4 HNKAYHSEEEKQHRFSIWSQNHARTAEKNRRHGPCTLTKQHVFGSNHFKDLAPEEFQAKF 63 Query: 947 HLGFKPSLDEESQAARLNTP--VFELDEDDNMMAWGANNTLVDWRQKGAVTKVKDQAQCG 774 G+K + + + R P + L +D + A VDWR GA++ ++ Q +CG Sbjct: 64 LTGYKGAFTDVLEDKRQEQPPDIRRLRKDSGIYDADAFPNSVDWRDSGAISDIRTQGECG 123 Query: 773 SCWAFSATEQIESAWFLAKGTLPVLSPQQILSCDKTDAGCDGGDTPTAYAYVKSAGGLES 594 +CWA +A E++ESA FL+ GTL LS +I+ CD + C GG A+ +V GGL Sbjct: 124 ACWAVTAVEEVESAVFLSTGTLYALSESEIIVCDDSCEMCSGGWPQNAFEWVMDHGGLPL 183 Query: 593 ESAYPYSSGAGNTGTCKFKAASIVAKISGFSYATPPC------SGA-CKTQDEATFANNV 435 +S++PY + T + I G+ YAT C SG C+ QDE T NN+ Sbjct: 184 QSSFPYDAYTLIALTADYSNQGRYGNIRGYGYATDRCLCYSDGSGCDCEDQDEDTAINNI 243 Query: 434 AAKGPASICVNAEAWQLYSSGVLTAKACGGSAYTDLDHCVQLVGY--------------- 300 A GP+ +C+ A WQ Y G++T+ + + D++HCVQ+VGY Sbjct: 244 ATYGPSVVCLEASTWQDYGGGIITSDSGCAQTFLDMNHCVQVVGYAFTTGSSDCNDSNDE 303 Query: 299 ----NKPASGKD---------------YWIVRNSWGASWGYSGYLYVEYGTNACGVADE 180 N SG D YWIVRN WG SWG +GY YV GTN CG+ ++ Sbjct: 304 GCDSNDENSGSDSGSNSGSGDSNGREGYWIVRNQWGDSWGMNGYAYVSMGTNTCGILND 362 >gb|OQR72774.1| cathepsin L-like [Tropilaelaps mercedesae] Length = 331 Score = 222 bits (565), Expect = 1e-65 Identities = 144/367 (39%), Positives = 187/367 (50%), Gaps = 10/367 (2%) Frame = -2 Query: 1247 MKSLLVLLGLSAFASATRLPVPSDSITGENVADEFLAKSLFDKFKLDHSKSYHSASEETH 1068 M SL+VLL + A A R+P P A+ + +F+ H K YH SEE Sbjct: 1 MHSLIVLLAVVGAALAVRVPRPD-------------AEHHWAEFRRTHQKQYHG-SEELQ 46 Query: 1067 RYSVFRSNLARIADLNSKNGSPS---FGITPFADLTAHEFAKTHLGFKPSLDEESQAARL 897 R +F NL I + N N S + GI FAD+T EF KT LG + S + S A Sbjct: 47 RRFIFEDNLYIIQEFNRVNASEAGFRLGINQFADMTNEEFRKTFLGHRYSANHVSHA--- 103 Query: 896 NTPVFELDEDDNMMAWGANN--TLVDWRQKGAVTKVKDQAQCGSCWAFSATEQIESAWFL 723 D A G N VDW KG VT VK+Q QCGSCWAFS T +E F Sbjct: 104 ---------DSTFEATGIQNLPAKVDWTTKGYVTPVKNQGQCGSCWAFSTTGSLEGQHFK 154 Query: 722 AKGTLPVLSPQQILSCDKTDA--GCDGGDTPTAYAYVKSAGGLESESAYPYSSGAGNTGT 549 G L LS Q ++ C GC+GG A+ Y+K+ GG+++E +YPYS+ G Sbjct: 155 KTGKLVSLSEQNLIDCSDAQGNNGCNGGLMDLAFDYIKANGGIDTEQSYPYSA---VDGI 211 Query: 548 CKFKAASIVAKISGFSYATPPCSGACKTQDEATFANNVAAKGPASICVNAEA--WQLYSS 375 C+FK +I AK++G+ K DE+ VA GP SI ++A + +QLYSS Sbjct: 212 CEFKKRAIGAKVTGYV--------DIKNGDESALKEAVATVGPVSIAIDASSPHFQLYSS 263 Query: 374 GVLTAKACGGSAYTDLDHCVQLVGYNKPASGKDYWIVRNSWGASWGYSGYL-YVEYGTNA 198 GV TA C +LDH V VGY GKDYW+V+NSWG SWG GY+ + N Sbjct: 264 GVYTASDCSS---VELDHGVLAVGYGH-EDGKDYWLVKNSWGTSWGIDGYIKMIRNKDNR 319 Query: 197 CGVADEA 177 CG+A +A Sbjct: 320 CGIATQA 326 >ref|XP_020235829.1| LOW QUALITY PROTEIN: cysteine proteinase 15A-like [Cajanus cajan] Length = 351 Score = 221 bits (563), Expect = 4e-65 Identities = 147/369 (39%), Positives = 190/369 (51%), Gaps = 18/369 (4%) Frame = -2 Query: 1241 SLLVLLGLSAFASATRLPVPSDSITGENVADEFL-AKSLFDKFKLDHSKSYHSASEETHR 1065 SLL LL L+A A+ R VP E D L A+ F FK KSY + E HR Sbjct: 5 SLLALLLLAAVAAXIRQVVPG----AEPEEDHLLNAEHHFSTFKARFGKSYATKEEHDHR 60 Query: 1064 YSVFRSNLARIADLNSK-NGSPSFGITPFADLTAHEFAKTHLGFKP-SLDEESQAARLNT 891 + VF SNL R A L++K + S G+T F+DLT EF + LG KP L +Q A + Sbjct: 61 FGVFESNLRR-ARLHAKLDPSAVHGVTKFSDLTPAEFRRQFLGLKPLRLPAHAQNAPV-L 118 Query: 890 PVFELDEDDNMMAWGANNTLVDWRQKGAVTKVKDQAQCGSCWAFSATEQIESAWFLAKGT 711 P +L +D DWR KGAVT VKDQ CGSCW+FS T +E A +LA G Sbjct: 119 PTKDLPKD------------FDWRDKGAVTNVKDQGSCGSCWSFSTTGALEGAHYLATGE 166 Query: 710 LPVLSPQQILSCDKT---------DAGCDGGDTPTAYAYVKSAGGLESESAYPYSSGAGN 558 L S QQ++ CD DAGC+GG A+ Y+ +GG++ E YPY+ G Sbjct: 167 LLSFSEQQLVDCDHVCDPEEYGACDAGCNGGLMNNAFEYILESGGIQLEKDYPYT---GR 223 Query: 557 TGTCKFKAASIVAKISGFSYATPPCSGACKTQDEATFANNVAAKGPASICVNAEAWQLYS 378 GTCKF + +VA +S +S + DE A N+ GP ++ +NA Q Y Sbjct: 224 DGTCKFDKSKVVATVSNYSVV---------SLDEDQIAANLVKNGPLAVGINAVYMQTYI 274 Query: 377 SGVLTAKACGGSAYTDLDHCVQLVGYNKPA------SGKDYWIVRNSWGASWGYSGYLYV 216 GV CG +LDH V LVGY + A K YWI++NSWG +WG +GY + Sbjct: 275 GGVSCPYICG----KNLDHGVLLVGYGEGAYAPIRFKEKPYWILKNSWGENWGENGYYKI 330 Query: 215 EYGTNACGV 189 G N CGV Sbjct: 331 CRGRNVCGV 339 >gb|AAF75546.1| cruzipain [Trypanosoma cruzi] Length = 467 Score = 224 bits (571), Expect = 6e-65 Identities = 136/328 (41%), Positives = 183/328 (55%), Gaps = 2/328 (0%) Frame = -2 Query: 1154 ADEFLAKSLFDKFKLDHSKSYHSASEETHRYSVFRSNLARIADLNSKNGSPSFGITPFAD 975 A+E LA S F +FK H + Y SA+EE R SVFR NL + N +FG+TPF+D Sbjct: 30 AEETLA-SQFAEFKQKHGRVYESAAEEAFRLSVFRENLFLARLHAAANPHATFGVTPFSD 88 Query: 974 LTAHEFAKTHLGFKPSLDEESQAARLNTPVFELDEDDNMMAWGANNTLVDWRQKGAVTKV 795 LT EF + + AR+ PV N+ GA VDWR +GAVT V Sbjct: 89 LTREEFRSRYHNGAAHFAAAQERARV--PV-------NVEVVGAP-AAVDWRARGAVTAV 138 Query: 794 KDQAQCGSCWAFSATEQIESAWFLAKGTLPVLSPQQILSCDKTDAGCDGGDTPTAYAYV- 618 KDQ QCGSCWAFSA +E WFLA L LS Q ++SCDKTD+GC GG A+ ++ Sbjct: 139 KDQGQCGSCWAFSAIGNVECQWFLAGHPLTNLSEQMLVSCDKTDSGCGGGLMNNAFGWIV 198 Query: 617 -KSAGGLESESAYPYSSGAGNTGTCKFKAASIVAKISGFSYATPPCSGACKTQDEATFAN 441 ++ G + +E++YPY+SG G + C ++ A I+G + P QDEA A Sbjct: 199 QENNGAVYTENSYPYASGEGISPPCTTSGHTVGATITG--HVELP-------QDEAQIAA 249 Query: 440 NVAAKGPASICVNAEAWQLYSSGVLTAKACGGSAYTDLDHCVQLVGYNKPASGKDYWIVR 261 +A GP ++ V+A +W Y+ GV+T+ LDH V LVGYN A+ YWI++ Sbjct: 250 WLAVNGPVAVAVDASSWMTYTGGVMTS-----CVSEQLDHGVLLVGYNDSAA-VPYWIIK 303 Query: 260 NSWGASWGYSGYLYVEYGTNACGVADEA 177 NSW A WG GY+ + G+N C V +EA Sbjct: 304 NSWTAQWGEDGYIRIAKGSNQCLVKEEA 331 >gb|AFA34858.1| cathepsin L-like protein [Trypanosoma dionisii] Length = 467 Score = 223 bits (569), Expect = 1e-64 Identities = 134/328 (40%), Positives = 174/328 (53%), Gaps = 2/328 (0%) Frame = -2 Query: 1154 ADEFLAKSLFDKFKLDHSKSYHSASEETHRYSVFRSNLARIADLNSKNGSPSFGITPFAD 975 A+E LA S F FK + + Y SA+EE R SVFR NL + N +FG+TPF+D Sbjct: 30 AEETLA-SQFADFKQRYGRVYKSAAEEAFRLSVFRKNLLDAKLHAAANPHATFGVTPFSD 88 Query: 974 LTAHEFAKTHLGFKPSLDEESQAARLNTPVFELDEDDNMMAWGANNTLVDWRQKGAVTKV 795 LT EF H + AR+ V G VDWR +GAVT V Sbjct: 89 LTREEFRSRHHSGAAHFAAGRKRARVPVDV----------GVGDAPAAVDWRDRGAVTPV 138 Query: 794 KDQAQCGSCWAFSATEQIESAWFLAKGTLPVLSPQQILSCDKTDAGCDGGDTPTAYAYV- 618 KDQ QCGSCWAFSA +E WFLA L LS Q ++SCD D+GCDGG +A+ ++ Sbjct: 139 KDQGQCGSCWAFSAIGNVEGQWFLAGNALTSLSEQMLVSCDTMDSGCDGGLMNSAFEWIV 198 Query: 617 -KSAGGLESESAYPYSSGAGNTGTCKFKAASIVAKISGFSYATPPCSGACKTQDEATFAN 441 G + +E +Y Y+SG G C+ ++ A I+G P DEA A Sbjct: 199 EHHNGTVYTEESYRYASGDGIAQPCRTSGRTVGAVITGHVKLPP---------DEAKMAT 249 Query: 440 NVAAKGPASICVNAEAWQLYSSGVLTAKACGGSAYTDLDHCVQLVGYNKPASGKDYWIVR 261 +AA GP ++ V+A +W Y+ GVLT+ +LDH V LVGYN A+ YWIV+ Sbjct: 250 WLAANGPLAVAVDASSWMFYTGGVLTS-----CVSNELDHGVLLVGYNDSAA-PPYWIVK 303 Query: 260 NSWGASWGYSGYLYVEYGTNACGVADEA 177 NSWG WG GY+ + GTN C V +EA Sbjct: 304 NSWGTLWGEDGYVRIAKGTNQCLVKEEA 331