BLASTX nr result
ID: Ophiopogon27_contig00038764
seq
BLASTX 2.2.26 [Sep-21-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Ophiopogon27_contig00038764 (757 letters) Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF excluding environmental samples from WGS projects 149,584,005 sequences; 54,822,741,787 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value dbj|BAK00754.1| predicted protein [Hordeum vulgare subsp. vulgare] 233 4e-72 ref|XP_009032801.1| hypothetical protein AURANDRAFT_18666 [Aureo... 228 5e-70 ref|XP_013762925.1| cruzipain [Thecamonas trahens ATCC 50062] >g... 188 5e-54 ref|XP_004368288.1| cysteine proteinase precursor, putative [Aca... 186 8e-54 ref|XP_004335426.1| cathepsin L, putative [Acanthamoeba castella... 181 7e-52 ref|XP_012756472.1| hypothetical protein SAMD00019534_028850 [Ac... 178 7e-51 gb|KYQ91485.1| hypothetical protein DLAC_08453 [Tieghemostelium ... 176 6e-50 ref|XP_004363040.1| hypothetical protein DFA_03437 [Cavenderia f... 175 2e-49 ref|XP_012759759.1| hypothetical protein SAMD00019534_034130, pa... 174 2e-49 ref|XP_009032695.1| hypothetical protein AURANDRAFT_19240 [Aureo... 172 2e-49 ref|XP_020433674.1| hypothetical protein PPL_05546 [Heterosteliu... 166 3e-46 ref|XP_009040822.1| hypothetical protein AURANDRAFT_5922, partia... 162 7e-46 gb|KOO53669.1| cathepsin l-like protease [Chrysochromulina sp. C... 164 3e-45 gb|KOO22621.1| cysteine proteinase [Chrysochromulina sp. CCMP291] 160 1e-43 ref|XP_003284553.1| hypothetical protein DICPUDRAFT_96752 [Dicty... 159 2e-43 ref|XP_004344606.1| cysteine protease 5, putative [Acanthamoeba ... 157 4e-43 ref|XP_003293312.1| hypothetical protein DICPUDRAFT_41833 [Dicty... 157 2e-42 ref|XP_012755566.1| hypothetical protein SAMD00019534_046220 [Ac... 152 6e-41 ref|XP_009037507.1| hypothetical protein AURANDRAFT_5846, partia... 148 9e-41 ref|XP_013753886.1| cysteine proteinase 1 [Thecamonas trahens AT... 149 1e-39 >dbj|BAK00754.1| predicted protein [Hordeum vulgare subsp. vulgare] Length = 341 Score = 233 bits (594), Expect = 4e-72 Identities = 115/198 (58%), Positives = 138/198 (69%), Gaps = 1/198 (0%) Frame = -2 Query: 756 WAFSATEQIESAWFLAKGTLPVLSPQQILSCDKTDAGCDGGDTPTAYAYVKSAGGLESES 577 WAFSATEQIES WFLA L LSPQQI+SCD TD GC GG T TAY YV+SAGGL++++ Sbjct: 141 WAFSATEQIESNWFLAGNELISLSPQQIVSCDTTDGGCGGGWTYTAYQYVQSAGGLDTDA 200 Query: 576 AYPYSSGAGNTGTCKFK-AASIVAKISGFSYATPPCSGACKTQDEATFANNVAAKGPASI 400 AYPYSSGAG TGTC AS A+ISGF YA P CS +C QDE + A + P S+ Sbjct: 201 AYPYSSGAGVTGTCDNPLPASPAAQISGFGYAIPTCSDSCTNQDENSMAQYMQENSPLSV 260 Query: 399 CVNAEAWQLYSSGVLTAKACGGSAYTDLDHCVQLVGYNKPASGKSYWIVRNSWGASWGYS 220 CV+AE WQ YSSG++T C S ++ LDHCVQ VGY+ S + YWIVRNSW +WG Sbjct: 261 CVDAEPWQFYSSGIMTVDQC-PSDFSGLDHCVQAVGYDATGS-QPYWIVRNSWNTNWGED 318 Query: 219 GYLYVEYGTNACGVADEA 166 G++ + GTN CG+ D A Sbjct: 319 GFIRLALGTNTCGIGDVA 336 >ref|XP_009032801.1| hypothetical protein AURANDRAFT_18666 [Aureococcus anophagefferens] gb|EGB13210.1| hypothetical protein AURANDRAFT_18666 [Aureococcus anophagefferens] Length = 346 Score = 228 bits (580), Expect = 5e-70 Identities = 116/209 (55%), Positives = 135/209 (64%), Gaps = 12/209 (5%) Frame = -2 Query: 756 WAFSATEQIESAWFLAKGTLPVLSPQQILSCDKTDAGCDGGDTPTAYAYVKSAGGLESES 577 WAFSATEQIES W LA V +PQQI+SCDK D GC+GG+T TAYAYV+ AGG+ ES Sbjct: 133 WAFSATEQIESEWVLAGNDPLVFAPQQIVSCDKVDQGCNGGNTETAYAYVEKAGGMALES 192 Query: 576 AYPYSSG-AGNTGTCKFKAASIVAKISGFSYATPPC-SGACKTQDEATFANNVAAKGPAS 403 AYPY SG +GNTG CK K + + FSY P C G C QDE A +A+ GPAS Sbjct: 193 AYPYKSGTSGNTGRCK-KFETAGGDVESFSYVVPECKKGKCNDQDEDKMAAALASHGPAS 251 Query: 402 ICVNAEAWQLYSSGVLTAKACGGSAYTDLDHCVQLVGY----------NKPASGKSYWIV 253 ICVNA AWQ Y+ GV+T CG A LDHCVQ+VGY K K W V Sbjct: 252 ICVNAGAWQTYTKGVMTNLQCGSHAANALDHCVQVVGYTGYTGDAKACGKGLKDKCVWNV 311 Query: 252 RNSWGASWGYSGYLYVEYGTNACGVADEA 166 RNSWG SWGY GY+ V+ G NACG+A++A Sbjct: 312 RNSWGTSWGYQGYIRVQMGKNACGIANDA 340 >ref|XP_013762925.1| cruzipain [Thecamonas trahens ATCC 50062] gb|KNC45942.1| cruzipain [Thecamonas trahens ATCC 50062] Length = 394 Score = 188 bits (477), Expect = 5e-54 Identities = 97/199 (48%), Positives = 126/199 (63%), Gaps = 2/199 (1%) Frame = -2 Query: 756 WAFSATEQIESAWFLAKGTLPVLSPQQILSCDKTDAGCDGGDTPTAYAYVKSAGGLESES 577 WAFSA ++ES W LA L VLS QQ++ CD TD GC+GGDT +AY Y++ AGGL E Sbjct: 139 WAFSAVSEVESMWALAGHELVVLSEQQVVDCDTTDDGCNGGDTISAYHYIEKAGGLVPEK 198 Query: 576 AYPYSSGAGNTGTCKFKAA--SIVAKISGFSYATPPCSGACKTQDEATFANNVAAKGPAS 403 YPY++ G CK VAKI G++YAT P T++E A N+ + GP S Sbjct: 199 DYPYTA---RDGKCKDSVVKKDAVAKIMGYNYATSP-----STKNETQLAANLMSTGPVS 250 Query: 402 ICVNAEAWQLYSSGVLTAKACGGSAYTDLDHCVQLVGYNKPASGKSYWIVRNSWGASWGY 223 ICV+A +WQ Y+SG+L+ CG LDHCVQ+ G+ S + YW VRNSW SWG Sbjct: 251 ICVDASSWQTYTSGILS--HCG----KQLDHCVQITGWGTSGS-EMYWWVRNSWATSWGM 303 Query: 222 SGYLYVEYGTNACGVADEA 166 SGY+ +++G N CG+ADEA Sbjct: 304 SGYIQLKFGQNTCGLADEA 322 >ref|XP_004368288.1| cysteine proteinase precursor, putative [Acanthamoeba castellanii str. Neff] gb|ELR25533.1| cysteine proteinase precursor, putative [Acanthamoeba castellanii str. Neff] Length = 330 Score = 186 bits (471), Expect = 8e-54 Identities = 97/202 (48%), Positives = 125/202 (61%), Gaps = 6/202 (2%) Frame = -2 Query: 756 WAFSATEQIESAWFLAKGTLPVLSPQQILSCDKT--DAGCDGGDTPTAYAYVKSAGGLES 583 WAFS TE IES WFL+ L L+PQQI+ CD+ D GCDGGD PTAY YV AGGL++ Sbjct: 138 WAFSVTEAIESQWFLSGRKLVSLAPQQIVDCDQGNGDYGCDGGDPPTAYEYVIKAGGLDT 197 Query: 582 ESAYPYSSGAGNTGTCKFKAASIVAKISGFSYATPPCSGACKTQDEATFANNVAAKGPAS 403 E +YPY++ G C FK +++ AKIS ++Y T T++E +A++GP S Sbjct: 198 EESYPYTA---EDGQCAFKPSAVGAKISNWTYITT-------TKNETEMQYGLASRGPLS 247 Query: 402 ICVNAEAWQLYSSGVLTAKACGGSAYTDLDHCVQLVGYNKPASGK----SYWIVRNSWGA 235 ICV+A +WQ Y GV+T+ C S LDHCV + GY+ W +RNSWG Sbjct: 248 ICVDASSWQYYIGGVITS-LCEDS----LDHCVMITGYSVQEGWDFMKYDVWNIRNSWGE 302 Query: 234 SWGYSGYLYVEYGTNACGVADE 169 WGY GYLYV+ G+N CGV DE Sbjct: 303 DWGYGGYLYVQRGSNLCGVGDE 324 >ref|XP_004335426.1| cathepsin L, putative [Acanthamoeba castellanii str. Neff] gb|ELR13413.1| cathepsin L, putative [Acanthamoeba castellanii str. Neff] Length = 331 Score = 181 bits (458), Expect = 7e-52 Identities = 95/195 (48%), Positives = 121/195 (62%) Frame = -2 Query: 756 WAFSATEQIESAWFLAKGTLPVLSPQQILSCDKTDAGCDGGDTPTAYAYVKSAGGLESES 577 WAFSATE IES W LA L LS QQI+ C D GC GG AY YV A GL++ + Sbjct: 145 WAFSATENIESQWALAGHKLTGLSMQQIVDCSWWDDGCGGGFPSYAYDYVIDAPGLDALA 204 Query: 576 AYPYSSGAGNTGTCKFKAASIVAKISGFSYATPPCSGACKTQDEATFANNVAAKGPASIC 397 YPY++ G+C FK + +VAKIS ++Y T +E AN +A GP S+C Sbjct: 205 NYPYTAVG---GSCAFKESQVVAKISSWTYTT-------TDSNEHQMANYLAQHGPISVC 254 Query: 396 VNAEAWQLYSSGVLTAKACGGSAYTDLDHCVQLVGYNKPASGKSYWIVRNSWGASWGYSG 217 V+AE+W Y+ GV A ACG T +DHCV VGYN A+ YWI+RNSWG SWG G Sbjct: 255 VDAESWPSYTGGVYRASACG----TSIDHCVLAVGYNLTAN-PPYWIIRNSWGTSWGLEG 309 Query: 216 YLYVEYGTNACGVAD 172 Y+++E+GT+AC VA+ Sbjct: 310 YMHLEFGTDACAVAE 324 >ref|XP_012756472.1| hypothetical protein SAMD00019534_028850 [Acytostelium subglobosum LB1] dbj|GAM19710.1| hypothetical protein SAMD00019534_028850 [Acytostelium subglobosum LB1] Length = 325 Score = 178 bits (451), Expect = 7e-51 Identities = 90/197 (45%), Positives = 119/197 (60%), Gaps = 1/197 (0%) Frame = -2 Query: 756 WAFSATEQIESAWFLAKGTLPVLSPQQILSCDKTDAGCDGGDTPTAYAYVKSAGGLESES 577 WAFSATEQIE+A+ A S QQI+ CD D GC GGD TAY YV+SAGG+ +++ Sbjct: 135 WAFSATEQIETAFIQAGNAQQFFSEQQIVDCDPFDGGCGGGDPMTAYQYVQSAGGITTDT 194 Query: 576 AYPYSSGAGNTGTCKFKAASIVAKISGFSYATPPCSGACKTQDEATFANNVAAKGPASIC 397 AYPY++ GTC+ + VA+I + YA+ +E +AA GP SIC Sbjct: 195 AYPYTA---QDGTCEANTTTKVAQIKTYGYAS-------TAGNETQMKEAIAALGPLSIC 244 Query: 396 VNAEAWQLYSSGVLTAKACGGSAYTDLDHCVQLVGYN-KPASGKSYWIVRNSWGASWGYS 220 V+AE W Y SG++T DLDHCVQ+VGY+ S Y+IVRNSWG +WG Sbjct: 245 VDAETWMTYQSGIITTDCA-----ADLDHCVQVVGYDVDTTSNIPYYIVRNSWGTTWGQE 299 Query: 219 GYLYVEYGTNACGVADE 169 GY+Y+ G+N CG+ +E Sbjct: 300 GYIYIGEGSNLCGITEE 316 >gb|KYQ91485.1| hypothetical protein DLAC_08453 [Tieghemostelium lacteum] Length = 354 Score = 176 bits (447), Expect = 6e-50 Identities = 96/200 (48%), Positives = 119/200 (59%), Gaps = 3/200 (1%) Frame = -2 Query: 756 WAFSATEQIESAWFLAKGTLPVLSPQQILSCDKTDAGCDGGDTPTAYAYVKSAGGLESES 577 WAFSATEQIE+AW A +LS QQI+ CD D GC GGD TA YV AGGL SES Sbjct: 166 WAFSATEQIETAWIKAGNDQVILSEQQIVDCDTNDGGCGGGDPHTAMDYVIKAGGLTSES 225 Query: 576 AYPYSSGAGNTGTCKFKAASIVAKISGFSYATPPCSGACKTQDEATFANNVAAKGPASIC 397 YPY N GTC VA ISG+ AT P ++ A +V +GP SIC Sbjct: 226 QYPY---IANDGTCHTNFTP-VAHISGYYAATTP-------GNDTQLAYSVMNEGPISIC 274 Query: 396 VNAEAWQLYSSGVLTAKACGGSAYTDLDHCVQLVGYNKPASGKS---YWIVRNSWGASWG 226 V+A +W YSSG++ + + +DLDHCVQ+VG N +G + Y+I+RNSWG WG Sbjct: 275 VDASSWMTYSSGIIRS-----NCDSDLDHCVQIVGLNVDTNGTTPIPYYIIRNSWGTDWG 329 Query: 225 YSGYLYVEYGTNACGVADEA 166 G++YVE G + CGV EA Sbjct: 330 IDGFIYVEIGHDLCGVTQEA 349 >ref|XP_004363040.1| hypothetical protein DFA_03437 [Cavenderia fasciculata] gb|EGG25189.1| hypothetical protein DFA_03437 [Cavenderia fasciculata] Length = 341 Score = 175 bits (443), Expect = 2e-49 Identities = 88/197 (44%), Positives = 120/197 (60%), Gaps = 1/197 (0%) Frame = -2 Query: 756 WAFSATEQIESAWFLAKGTLPVLSPQQILSCDKTDAGCDGGDTPTAYAYVKSAGGLESES 577 WAFSATEQIE+A +A G + LS QQI+ CD D GC GGD TAY YV++ GGL Sbjct: 140 WAFSATEQIETANIMAGGQVEYLSEQQIVDCDPYDGGCGGGDPYTAYQYVQNNGGLTLNV 199 Query: 576 AYPYSSGAGNTGTCKFKAASIVAKISGFSYATPPCSGACKTQDEATFANNVAAKGPASIC 397 YPY++ G C + + +++ F YA+ +E +AA+GP SIC Sbjct: 200 TYPYTAA---NGACYANSTAPAVQVTAFGYAS-------SQGNETQLREAMAARGPLSIC 249 Query: 396 VNAEAWQLYSSGVLTAKACGGSAYTDLDHCVQLVGYNKPASGKS-YWIVRNSWGASWGYS 220 VNAE W Y SG+ ++ + DLDHCVQ+VGY+ A+ K+ Y+IVRNSWG WG Sbjct: 250 VNAEPWMSYQSGIFSS-----TCSDDLDHCVQIVGYDTDATSKTPYFIVRNSWGTDWGLL 304 Query: 219 GYLYVEYGTNACGVADE 169 GY+Y++ G+N CG+ +E Sbjct: 305 GYIYIQAGSNLCGITNE 321 >ref|XP_012759759.1| hypothetical protein SAMD00019534_034130, partial [Acytostelium subglobosum LB1] dbj|GAM20238.1| hypothetical protein SAMD00019534_034130, partial [Acytostelium subglobosum LB1] Length = 338 Score = 174 bits (442), Expect = 2e-49 Identities = 90/199 (45%), Positives = 118/199 (59%), Gaps = 3/199 (1%) Frame = -2 Query: 756 WAFSATEQIESAWFLAKGTLPVLSPQQILSCDKTDAGCDGGDTPTAYAYVKSAGGLESES 577 WAFSA EQIESA+ + + S QQI+ CD D GC GGDT TAY YV++AGGL + + Sbjct: 145 WAFSAAEQIESAYIMLGNEAQIASEQQIVDCDSFDGGCGGGDTMTAYKYVETAGGLTTNA 204 Query: 576 AYPYSSGAGNTGTCKFKAASIVAKISGFSYATPPCSGACKTQDEATFANNVAAKGPASIC 397 +YPY++ GTC K++ ++YA+ +E +AA GP SIC Sbjct: 205 SYPYTA---QDGTCYANKTKKFVKVTNYNYAS-------SQGNETQLKEAIAALGPLSIC 254 Query: 396 VNAEAWQLYSSGVLTAKACGGSAYTDLDHCVQLVGYNKPAS---GKSYWIVRNSWGASWG 226 V+A +W Y SG++T+ CG DLDHCVQLVGY +S Y+IVRNSWG WG Sbjct: 255 VDAISWMTYQSGIITSN-CG----NDLDHCVQLVGYAIESSVTPNIPYYIVRNSWGLDWG 309 Query: 225 YSGYLYVEYGTNACGVADE 169 GY+Y+ G N CG+ DE Sbjct: 310 QEGYIYIGEGQNLCGITDE 328 >ref|XP_009032695.1| hypothetical protein AURANDRAFT_19240 [Aureococcus anophagefferens] gb|EGB13096.1| hypothetical protein AURANDRAFT_19240 [Aureococcus anophagefferens] Length = 254 Score = 172 bits (435), Expect = 2e-49 Identities = 87/216 (40%), Positives = 116/216 (53%), Gaps = 19/216 (8%) Frame = -2 Query: 756 WAFSATEQIESAWFLAKGTLPVLSPQQILSCDKTDAGCDGGDTPTAYAYVKS--AGGLES 583 WAFS +Q++S W+L S QQI+SCD GC GG +AYAY+K GL + Sbjct: 16 WAFSVAQQVQSEWYLDGNPASEFSAQQIISCDDEMFGCGGGGPVSAYAYIKERVTPGLSN 75 Query: 582 ESAYPYSSGAGNTGTC-----------------KFKAASIVAKISGFSYATPPCSGACKT 454 YPY G G+ TC + + A+++ +S+AT C C+ Sbjct: 76 LWYYPYVQGMGSQRTCLSPKCTETCRGIGTEVEETLLTGVYAQVANYSWATKGCFDDCED 135 Query: 453 QDEATFANNVAAKGPASICVNAEAWQLYSSGVLTAKACGGSAYTDLDHCVQLVGYNKPAS 274 QD VAA GPASICVNA W +Y+ GV++ CG + DLDHCV LVG+N S Sbjct: 136 QDLYGLRKAVAAHGPASICVNAANWDVYAGGVMSTATCGSYNFNDLDHCVGLVGFNMD-S 194 Query: 273 GKSYWIVRNSWGASWGYSGYLYVEYGTNACGVADEA 166 YWIV+N W +WG GY++++ N CGVAD A Sbjct: 195 DPPYWIVKNQWSTTWGVDGYIFLDARNNTCGVADTA 230 >ref|XP_020433674.1| hypothetical protein PPL_05546 [Heterostelium album PN500] gb|EFA81557.1| hypothetical protein PPL_05546 [Heterostelium album PN500] Length = 341 Score = 166 bits (421), Expect = 3e-46 Identities = 86/197 (43%), Positives = 117/197 (59%), Gaps = 1/197 (0%) Frame = -2 Query: 756 WAFSATEQIESAWFLAKGTLPVLSPQQILSCDKTDAGCDGGDTPTAYAYVKSAGGLESES 577 WAFSA EQIE+A+ +A +S QQI+ CD D GC GGD TAY YV+SAGG+ + + Sbjct: 148 WAFSAGEQIETAYIMAGNAAQNVSEQQIVDCDPYDGGCGGGDPMTAYQYVQSAGGITTNT 207 Query: 576 AYPYSSGAGNTGTCKFKAASIVAKISGFSYATPPCSGACKTQDEATFANNVAAKGPASIC 397 YPY++ GTC + +I+ + YA+ +E +AA+GP SIC Sbjct: 208 DYPYTA---TDGTCYAQNTPKFTQIASYGYAS-------NKGNETELKQAIAARGPLSIC 257 Query: 396 VNAEAWQLYSSGVLTAKACGGSAYTDLDHCVQLVGYN-KPASGKSYWIVRNSWGASWGYS 220 V+AE W Y SGVL + + +LDHCVQ+VGY+ + ++ Y+IVRNSWG WG Sbjct: 258 VDAETWMNYQSGVLNS-----NCPDELDHCVQIVGYDVEQSTNTPYYIVRNSWGTDWGME 312 Query: 219 GYLYVEYGTNACGVADE 169 GY+ V G N CG+ DE Sbjct: 313 GYILVGEGQNLCGITDE 329 >ref|XP_009040822.1| hypothetical protein AURANDRAFT_5922, partial [Aureococcus anophagefferens] gb|EGB04435.1| hypothetical protein AURANDRAFT_5922, partial [Aureococcus anophagefferens] Length = 230 Score = 162 bits (410), Expect = 7e-46 Identities = 90/205 (43%), Positives = 107/205 (52%), Gaps = 8/205 (3%) Frame = -2 Query: 756 WAFSATEQIESAWFLAKGTLPVLSPQQILSCDKTDA-----GCDGGDTPTAYAYVKSA-- 598 WAFSATEQ+ES LA LS QQ+ SC GC GGD AY Y+ A Sbjct: 38 WAFSATEQVESQLVLAGAPQVELSTQQVASCTADPQLMCCDGCAGGDPTAAYEYLAWASR 97 Query: 597 -GGLESESAYPYSSGAGNTGTCKFKAASIVAKISGFSYATPPCSGACKTQDEATFANNVA 421 GGL ++ +PY G C+ P C+ C D A + Sbjct: 98 KGGLAPDAWWPYEQGLTPDEVCE----------------APACTKTCDKDDTDQLAARLE 141 Query: 420 AKGPASICVNAEAWQLYSSGVLTAKACGGSAYTDLDHCVQLVGYNKPASGKSYWIVRNSW 241 A P S+C+NA AW Y+ GVL+ ACGG D+DHCVQLVGYNK SYWIVRNSW Sbjct: 142 AS-PLSVCLNAGAWDDYTGGVLSEAACGGHGADDVDHCVQLVGYNKTEPENSYWIVRNSW 200 Query: 240 GASWGYSGYLYVEYGTNACGVADEA 166 SWG GY+Y+ NACGVA+EA Sbjct: 201 STSWGEDGYIYLSMDGNACGVANEA 225 >gb|KOO53669.1| cathepsin l-like protease [Chrysochromulina sp. CCMP291] Length = 345 Score = 164 bits (415), Expect = 3e-45 Identities = 91/202 (45%), Positives = 118/202 (58%), Gaps = 5/202 (2%) Frame = -2 Query: 756 WAFSATEQIESAWFLAKGTLPVLSPQQILSCDKTDAGCDGGDTPTAYAYVKSAGGLESES 577 WAFSATEQ+ES ++ G L LSPQQ+ SCD GC+GG+ A+ YV S GG ESES Sbjct: 146 WAFSATEQLESQYYQTYGKLIELSPQQLTSCDPNCGGCNGGNPINAWIYVNSFGGQESES 205 Query: 576 AYPYSSG-AGNTGTCKFKAASIVAKIS---GFSYATPPCSGACKTQDEATFANNVAAKGP 409 YPY SG TG+C K A + + G+ A P E+ + P Sbjct: 206 DYPYVSGVTKQTGSCSSKIAEVTEAVGADVGYFIAQRPA-------QESNMLKQIGL-SP 257 Query: 408 ASICVNAEAWQLYSSGVLTAKA-CGGSAYTDLDHCVQLVGYNKPASGKSYWIVRNSWGAS 232 SI V+AE WQ Y+ G++ K+ CG T +DH VQ+ GYN A G +YWIVRNSWG + Sbjct: 258 MSIAVDAELWQTYTGGIIGPKSGCG----TTIDHAVQVTGYN--AEG-NYWIVRNSWGPN 310 Query: 231 WGYSGYLYVEYGTNACGVADEA 166 WG SG++Y+ YG N CG+ +A Sbjct: 311 WGESGFVYLTYGDNVCGITSQA 332 >gb|KOO22621.1| cysteine proteinase [Chrysochromulina sp. CCMP291] Length = 386 Score = 160 bits (406), Expect = 1e-43 Identities = 92/215 (42%), Positives = 117/215 (54%), Gaps = 18/215 (8%) Frame = -2 Query: 756 WAFSATEQIESAWFLAKGTLP--VLSPQQILSCDKTDA-GCDGGDTPTAYAYVKSAGGLE 586 WAFSATE +ES L+ G L+PQQ SC + A GC+GG T AY Y+ + GL Sbjct: 142 WAFSATEAVESQLVLSSGNQVRIELAPQQTTSCTPSPAQGCNGGFTEAAYEYMGTVTGLT 201 Query: 585 SESAYPYSSGAGNTGTCKF----KAASI----------VAKISGFSYATPPC-SGACKTQ 451 + YPY T + K A+I A +SG+ YAT PC SGAC Q Sbjct: 202 NSFNYPYMQSLTATSATQACNTAKVAAIDGPMMQLTGGYAAVSGYHYATTPCTSGACANQ 261 Query: 450 DEATFANNVAAKGPASICVNAEAWQLYSSGVLTAKACGGSAYTDLDHCVQLVGYNKPASG 271 D A + P S+CVNA +W Y+ GV+T+ ACG A DHCV G+N A Sbjct: 262 DLAALQAAIETT-PVSVCVNAASWNDYTGGVMTSAACGSMAAKAQDHCVMATGFNTTAP- 319 Query: 270 KSYWIVRNSWGASWGYSGYLYVEYGTNACGVADEA 166 YWIVRNSW ++WG GY+Y+E N CG+AD+A Sbjct: 320 TPYWIVRNSWSSTWGEYGYIYLEMAENTCGIADDA 354 >ref|XP_003284553.1| hypothetical protein DICPUDRAFT_96752 [Dictyostelium purpureum] gb|EGC38873.1| hypothetical protein DICPUDRAFT_96752 [Dictyostelium purpureum] Length = 346 Score = 159 bits (403), Expect = 2e-43 Identities = 92/209 (44%), Positives = 117/209 (55%), Gaps = 14/209 (6%) Frame = -2 Query: 756 WAFSATEQIESAWFLAKGTLPVLSPQQILSCD----------KTDAGCDGGDTPTAYAYV 607 W+FS T IE W+LA TL LS Q ++ CD DAGCDGG P AY YV Sbjct: 143 WSFSTTGNIEGQWYLAGNTLVGLSEQNLVDCDHQCMEYDGQKSCDAGCDGGLQPNAYRYV 202 Query: 606 KSAGGLESESAYPYSSGAGNTGTCKFKAASIVAKISGFSYATPPCSGACKTQDEATFANN 427 GGL+SE++YPY + G++ CKFK+ ++ AKIS F+ Q+E A Sbjct: 203 IENGGLDSENSYPYLAVTGDS--CKFKSGNVAAKISNFTMIP---------QNETQMAGY 251 Query: 426 VAAKGPASICVNAEAWQLYSSGVLTAKACGGSAYTDLDHCVQLVGYN--KPASG--KSYW 259 +A GP +I +A WQ Y GV CG S LDH + +VG++ K G K YW Sbjct: 252 LATHGPLAIAADAAEWQFYIGGVFDLP-CGQS----LDHGILIVGFSAEKNIFGHLKPYW 306 Query: 258 IVRNSWGASWGYSGYLYVEYGTNACGVAD 172 IV+NSWGASWG GYLY+ G N CGV+D Sbjct: 307 IVKNSWGASWGEQGYLYLGKGKNLCGVSD 335 >ref|XP_004344606.1| cysteine protease 5, putative [Acanthamoeba castellanii str. Neff] gb|ELR20863.1| cysteine protease 5, putative [Acanthamoeba castellanii str. Neff] Length = 315 Score = 157 bits (398), Expect = 4e-43 Identities = 87/193 (45%), Positives = 108/193 (55%) Frame = -2 Query: 753 AFSATEQIESAWFLAKGTLPVLSPQQILSCDKTDAGCDGGDTPTAYAYVKSAGGLESESA 574 AFSATE IES W LA L L+ QQI+ CD TD+GC GG AY YV SA GLE + Sbjct: 149 AFSATENIESQWALAGNKLTELAMQQIVDCDSTDSGCGGGWPYNAYEYVMSAPGLEPLAD 208 Query: 573 YPYSSGAGNTGTCKFKAASIVAKISGFSYATPPCSGACKTQDEATFANNVAAKGPASICV 394 YPY++ GN C + + +VAKIS ++Y T QDE AN +A GP S+CV Sbjct: 209 YPYTAADGN---CAYNSGEVVAKISDWTYTT-------TDQDEHQMANYLAQHGPISVCV 258 Query: 393 NAEAWQLYSSGVLTAKACGGSAYTDLDHCVQLVGYNKPASGKSYWIVRNSWGASWGYSGY 214 +A W LY+ VGYN A+ YWI+RNSWGA WG GY Sbjct: 259 DASQWSLYT-----------------------VGYNL-AANPPYWIIRNSWGADWGLQGY 294 Query: 213 LYVEYGTNACGVA 175 +Y+E+G +AC VA Sbjct: 295 MYLEFGQDACAVA 307 >ref|XP_003293312.1| hypothetical protein DICPUDRAFT_41833 [Dictyostelium purpureum] gb|EGC30167.1| hypothetical protein DICPUDRAFT_41833 [Dictyostelium purpureum] Length = 352 Score = 157 bits (396), Expect = 2e-42 Identities = 85/201 (42%), Positives = 114/201 (56%), Gaps = 4/201 (1%) Frame = -2 Query: 756 WAFSATEQIESAWFLAKGTLPVLSPQQILSCDKTDAGCDGGDTPTAYAYVKSAGGLESES 577 + FSATEQIES + A +LS QQ + CD D GC GGD Y Y+ SAGG+ +E Sbjct: 164 YIFSATEQIESEYIRAGHKAILLSEQQSVDCDTMDGGCGGGDPANVYNYIISAGGVSTEK 223 Query: 576 AYPYSSGAGNTGTCKFKAASIVAKISGFSYATPPCSGACKTQDEATFANNVAAKGPASIC 397 YPY++ GTC F V+ I+GF Y T + DE T +A GP SIC Sbjct: 224 DYPYTA---QDGTC-FNTTRAVS-ITGFQYVT-------QNSDEDTLITTIANHGPVSIC 271 Query: 396 VNAEAWQLYSSGVLTAKACGGSAYTDLDHCVQLVGYN----KPASGKSYWIVRNSWGASW 229 V+A WQ Y+ G++T ++DHCVQ+VG + P++ Y+I+RNSWG SW Sbjct: 272 VDASTWQSYTGGIITT-----GCEQNIDHCVQVVGLDIDKTDPSNPIPYYIIRNSWGTSW 326 Query: 228 GYSGYLYVEYGTNACGVADEA 166 G GY+YV G+N CG+ E+ Sbjct: 327 GDKGYIYVAQGSNLCGITYES 347 >ref|XP_012755566.1| hypothetical protein SAMD00019534_046220 [Acytostelium subglobosum LB1] dbj|GAM21447.1| hypothetical protein SAMD00019534_046220 [Acytostelium subglobosum LB1] Length = 335 Score = 152 bits (385), Expect = 6e-41 Identities = 82/204 (40%), Positives = 111/204 (54%), Gaps = 11/204 (5%) Frame = -2 Query: 756 WAFSATEQIESAWFLAKGTLPVLSPQQILSCDKT----------DAGCDGGDTPTAYAYV 607 W+FSAT IE AWFLA L LS Q ++ CD D GC+GG P AY Y+ Sbjct: 138 WSFSATGNIEGAWFLAGNNLTGLSEQNLVDCDHECMQYLGDHVCDQGCNGGLQPNAYEYI 197 Query: 606 KSAGGLESESAYPYSSGAGNTGTCKFKAASIVAKISGFSYATPPCSGACKTQDEATFANN 427 GG+++E +YPY+ G T C F A++I AKIS ++Y + +E T A+ Sbjct: 198 LKNGGIDTEESYPYTGVTGTT--CNFDASNIGAKISSWTYVS---------SNETTMASY 246 Query: 426 VAAKGPASICVNAEAWQLYSSGVLTAKACGGSAYTDLDHCVQLVGYN-KPASGKSYWIVR 250 + A GP +I +A WQ YS GV K CG LDH + + G+ + + YWIV+ Sbjct: 247 LYANGPLAIAADALTWQYYSGGVFDFKECGSV----LDHGILITGFGVDTTNNEPYWIVK 302 Query: 249 NSWGASWGYSGYLYVEYGTNACGV 178 NSWGA WG SGY+ + G CG+ Sbjct: 303 NSWGADWGESGYMRIIRGKGLCGL 326 >ref|XP_009037507.1| hypothetical protein AURANDRAFT_5846, partial [Aureococcus anophagefferens] gb|EGB07772.1| hypothetical protein AURANDRAFT_5846, partial [Aureococcus anophagefferens] Length = 208 Score = 148 bits (374), Expect = 9e-41 Identities = 88/206 (42%), Positives = 108/206 (52%), Gaps = 13/206 (6%) Frame = -2 Query: 756 WAFSATEQIESAWFLAKGTLPVLSPQQILSCDK--------TDAGCDGGDTPTAYAYVKS 601 WAFS TEQ+ES ++LA G VLS QQ+ SC K GC GGD AY Y+K Sbjct: 23 WAFSTTEQVESQFYLAGGPPVVLSAQQVTSCAKYIDDPEEGCCFGCGGGDVTVAYDYIKG 82 Query: 600 AGGLESESAYPYSSGAGNTGTCKFKAASIVAKISGFSYATPPCSGACKTQ----DEATFA 433 A GL + +PY+ C P C+ AC D A Sbjct: 83 AIGLSPAAYWPYTQALTPDEEC----------------LGPFCTNACDMDLSELDLGELA 126 Query: 432 NNVAAKGPASICVNAEAWQLYSSGVLTAKACGGSAYTDLDHCVQLVGYNKPASGKS-YWI 256 + A PA++CVNA AW Y+ GVL AC G AY D+DHCVQLVGY+ A+G+ YWI Sbjct: 127 KTIQAT-PAAVCVNAGAWDDYTGGVLRYDACSG-AYADIDHCVQLVGYD--ATGEEPYWI 182 Query: 255 VRNSWGASWGYSGYLYVEYGTNACGV 178 VRNSW SWG GY+ ++ N CGV Sbjct: 183 VRNSWSTSWGEDGYIRLQMDANTCGV 208 >ref|XP_013753886.1| cysteine proteinase 1 [Thecamonas trahens ATCC 50062] gb|KNC54251.1| cysteine proteinase 1 [Thecamonas trahens ATCC 50062] Length = 343 Score = 149 bits (377), Expect = 1e-39 Identities = 87/213 (40%), Positives = 117/213 (54%), Gaps = 18/213 (8%) Frame = -2 Query: 756 WAFSATEQIESAWFLAKGTLPVLSPQQILSCDKTD--------------AGCDGGDTPTA 619 WAFSA IESAW LA L LS Q I+ C + AGC+GG P A Sbjct: 136 WAFSAVSAIESAWALAGNPLVSLSEQNIIDCTLNNEAYCIKYHGQKSCPAGCNGGLQPEA 195 Query: 618 YAYVKSAGGLESESAYPYSSGAGNTGTCKFKAASIVAKISGFSYATPPCSGACKTQDEAT 439 Y YV + G+++ES+YPY + GTC+FK++S+ A +S +++ + + +E Sbjct: 196 YLYVMANHGIDTESSYPYQAV---DGTCEFKSSSVGATVSNWTFVS-----TYEAPNEEA 247 Query: 438 FANNVAAKGPASICVNAEAWQLYSSGVLTAKACGGSAYTDLDHCVQLVGYN--KPASGKS 265 A + GP +I +A WQLY GV CG S LDH + +VGY K GK Sbjct: 248 VAEALVEHGPLAIAADASEWQLYMGGVFDLP-CGHS----LDHGIVIVGYGSKKTILGKE 302 Query: 264 Y--WIVRNSWGASWGYSGYLYVEYGTNACGVAD 172 + WI+RNSWGASWG GY+Y++ G N CGVAD Sbjct: 303 HPIWIIRNSWGASWGEKGYMYLQRGDNKCGVAD 335