BLASTX nr result
ID: Ophiopogon25_contig00055708
seq
BLASTX 2.2.26 [Sep-21-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Ophiopogon25_contig00055708 (718 letters) Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF excluding environmental samples from WGS projects 149,584,005 sequences; 54,822,741,787 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value gb|OLQ02485.1| Cathepsin Z [Symbiodinium microadriaticum] 239 6e-75 gb|KOO21039.1| hypothetical protein Ctob_001092 [Chrysochromulin... 236 6e-74 ref|XP_009036506.1| hypothetical protein AURANDRAFT_25936 [Aureo... 234 4e-73 gb|AAT09097.1| cathepsin Z [Bigelowiella natans] 226 1e-69 ref|XP_009036816.1| hypothetical protein AURANDRAFT_12990, parti... 218 2e-67 gb|OLP80048.1| Cathepsin Z, partial [Symbiodinium microadriaticum] 225 1e-65 gb|KOO22687.1| hypothetical protein Ctob_008847 [Chrysochromulin... 212 2e-64 ref|XP_023253505.1| cathepsin Z-like, partial [Seriola lalandi d... 208 3e-63 ref|XP_005756898.1| hypothetical protein EMIHUDRAFT_446652 [Emil... 209 4e-63 gb|OLP88871.1| Cathepsin Z [Symbiodinium microadriaticum] 216 7e-63 ref|XP_005766319.1| hypothetical protein EMIHUDRAFT_437057 [Emil... 205 2e-61 ref|XP_014156933.1| cathepsin X [Sphaeroforma arctica JP610] >gi... 175 2e-49 ref|XP_002288580.1| probable papain cysteine protease [Thalassio... 174 2e-49 gb|EJK74471.1| hypothetical protein THAOC_03848, partial [Thalas... 166 4e-47 gb|OEU10039.1| cysteine proteinase [Fragilariopsis cylindrus CCM... 166 3e-46 ref|XP_001415619.1| predicted protein [Ostreococcus lucimarinus ... 158 1e-43 ref|XP_002507788.1| cysteine endopeptidase [Micromonas commoda] ... 159 5e-43 ref|XP_007515195.1| predicted protein [Bathycoccus prasinos] >gi... 159 7e-43 ref|XP_002177642.1| predicted protein, partial [Phaeodactylum tr... 154 9e-43 dbj|GAX22053.1| cathepsin X [Fistulifera solaris] 155 4e-42 >gb|OLQ02485.1| Cathepsin Z [Symbiodinium microadriaticum] Length = 338 Score = 239 bits (611), Expect = 6e-75 Identities = 116/193 (60%), Positives = 138/193 (71%), Gaps = 2/193 (1%) Frame = -1 Query: 718 WISSISDKTGTGLVYESVNPYLACTSDLKQGICKGQDWSCKPLNVARTCGTFVESGGKCV 539 W+ IS K GTG+ YE+ NPY+ACTSD +G CK D SCK LNVARTCG+F + GG C Sbjct: 140 WLMEISKK-GTGISYETANPYVACTSDSDEGFCKYVDTSCKALNVARTCGSFSQEGGPCT 198 Query: 538 GLTKYPNVTISDYGSISGAAAMQKEIFNRGPISCGVDAEPIVDYTTGIVTAQGSGQDHVV 359 GL ++PN TISDYGSISGA AM KEIF+RGPISCGVDA P+++Y +GI+ +G G DHVV Sbjct: 199 GLGEFPNATISDYGSISGADAMMKEIFHRGPISCGVDANPLLNYESGIIKTKGEGVDHVV 258 Query: 358 SVTGWGYDEASSKSYWIVRNSWGESWGNMGFVYVEMGNDALSLEEQCSWAVVDKFT--DE 185 SV GWG D +W+VRNSWGE WG MG+ V G AL LE+QCSWAV FT ++ Sbjct: 259 SVVGWGTDPKDG-FFWVVRNSWGEYWGEMGYFRVAKG--ALLLEDQCSWAVPATFTAAEK 315 Query: 184 NNQSHCSEDGATC 146 NQ HC E G C Sbjct: 316 KNQVHCHEGGDNC 328 >gb|KOO21039.1| hypothetical protein Ctob_001092 [Chrysochromulina sp. CCMP291] Length = 324 Score = 236 bits (603), Expect = 6e-74 Identities = 113/193 (58%), Positives = 135/193 (69%), Gaps = 2/193 (1%) Frame = -1 Query: 718 WISSISDKTGTGLVYESVNPYLACTSDLKQGICKGQDWSCKPLNVARTCGTFVESGGKCV 539 W+ IS KTG G+ YE+ NPY+ACTSD +G CK D +CK +N+ARTCG+F + GG C Sbjct: 134 WMMEIS-KTGAGVAYETSNPYIACTSDSSEGFCKHVDTTCKAINIARTCGSFSQEGGTCT 192 Query: 538 GLTKYPNVTISDYGSISGAAAMQKEIFNRGPISCGVDAEPIVDYTTGIVTAQGSGQDHVV 359 GL+ YPN TISDYGSISG AM KEI RGPISCG+DA P+++Y GI A G G DHV+ Sbjct: 193 GLSSYPNATISDYGSISGKDAMMKEIIARGPISCGIDAGPLLNYEKGIADASGEGVDHVI 252 Query: 358 SVTGWGYDEASSKSYWIVRNSWGESWGNMGFVYVEMGNDALSLEEQCSWAVVDKFT--DE 185 SV GWG + + YWIVRNSWGE WG MG+V V G AL +EEQCSWAV FT ++ Sbjct: 253 SVVGWGNE--AGVPYWIVRNSWGEYWGEMGYVRVAFG--ALKIEEQCSWAVPKDFTSAEK 308 Query: 184 NNQSHCSEDGATC 146 NQ HC E G C Sbjct: 309 ANQVHCHEGGDNC 321 >ref|XP_009036506.1| hypothetical protein AURANDRAFT_25936 [Aureococcus anophagefferens] gb|EGB08493.1| hypothetical protein AURANDRAFT_25936 [Aureococcus anophagefferens] Length = 326 Score = 234 bits (598), Expect = 4e-73 Identities = 108/193 (55%), Positives = 139/193 (72%), Gaps = 2/193 (1%) Frame = -1 Query: 718 WISSISDKTGTGLVYESVNPYLACTSDLKQGICKGQDWSCKPLNVARTCGTFVESGGKCV 539 W+ +SDK G G+ YE+ NPY+AC+S+ +GIC G DW+C P+N+ARTC TF +G C Sbjct: 136 WLKGLSDK-GEGISYETSNPYMACSSECTEGICPGGDWTCTPINIARTCSTFPPAG-TCT 193 Query: 538 GLTKYPNVTISDYGSISGAAAMQKEIFNRGPISCGVDAEPIVDYTTGIVTAQGSGQDHVV 359 +T YP +TI DYGSISG AMQKEI+ RG I+CG+DA PI++YTTG+ T G G DHV+ Sbjct: 194 EITPYPQITIDDYGSISGQKAMQKEIYARGSIACGIDAGPILNYTTGVATGAGEGVDHVI 253 Query: 358 SVTGWGYDEASSKSYWIVRNSWGESWGNMGFVYVEMGNDALSLEEQCSWAVVDKFT--DE 185 SV GWG ++ + SYWIVRNSWGE WG MG+V V G AL +EEQC+WA V +T ++ Sbjct: 254 SVVGWGVEDGT--SYWIVRNSWGEYWGEMGYVRVAFG--ALMVEEQCAWATVKDYTAPEK 309 Query: 184 NNQSHCSEDGATC 146 +NQ HC E G C Sbjct: 310 DNQVHCFEGGENC 322 >gb|AAT09097.1| cathepsin Z [Bigelowiella natans] Length = 325 Score = 226 bits (575), Expect = 1e-69 Identities = 109/193 (56%), Positives = 137/193 (70%), Gaps = 2/193 (1%) Frame = -1 Query: 718 WISSISDKTGTGLVYESVNPYLACTSDLKQGICKGQDWSCKPLNVARTCGTFVESGGKCV 539 WI S SDK G+G+ YE+ NPYLAC+S+ ++GIC DWSC P+N ARTC TF G KC+ Sbjct: 137 WIKSKSDK-GSGISYETSNPYLACSSESEEGICGNADWSCTPMNEARTCSTFPPQG-KCL 194 Query: 538 GLTKYPNVTISDYGSISGAAAMQKEIFNRGPISCGVDAEPIVDYTTGIVTAQGSGQDHVV 359 + ++P +IS YG+ISG AMQKEI RGPISCG+DA PI++YT+GI +G DHV+ Sbjct: 195 PIKQFPMASISAYGTISGQTAMQKEIAARGPISCGIDAAPILNYTSGIADMRGEMVDHVI 254 Query: 358 SVTGWGYDEASSKSYWIVRNSWGESWGNMGFVYVEMGNDALSLEEQCSWAVVDKFT--DE 185 SV GWG D+ SYWIVRNSWGE WG MG++ V G AL +EEQC+WA V +T ++ Sbjct: 255 SVVGWGKDDTKG-SYWIVRNSWGEYWGEMGYIRVAFG--ALKVEEQCAWAEVKDYTAPEK 311 Query: 184 NNQSHCSEDGATC 146 NNQ HC E G C Sbjct: 312 NNQVHCFEGGENC 324 >ref|XP_009036816.1| hypothetical protein AURANDRAFT_12990, partial [Aureococcus anophagefferens] gb|EGB08838.1| hypothetical protein AURANDRAFT_12990, partial [Aureococcus anophagefferens] Length = 272 Score = 218 bits (555), Expect = 2e-67 Identities = 101/195 (51%), Positives = 135/195 (69%), Gaps = 4/195 (2%) Frame = -1 Query: 718 WISSISDKTGTGLVYESVNPYLACTSDLKQGICKGQDWSCKPLNVARTCGTFVESGGKCV 539 W+ +S K G G+ YE+ NPY+AC+S+ + GIC +W+C +N+ARTC TF SGG C Sbjct: 82 WLHKLSKK-GEGISYETSNPYMACSSESEDGICPHGEWTCTDINIARTCSTFPSSGGTCS 140 Query: 538 GLTKYPNVTISDYGSISGAAAMQKEIFNRGPISCGVDAEPIVDYTTGIVTAQGSGQ--DH 365 + +YP + I DYGSISG AAMQKEI+ RGPI+CG+DA PI++YT G+ +G Q DH Sbjct: 141 EIAQYPQILIDDYGSISGQAAMQKEIYARGPIACGIDASPILNYTKGVFVKKGLFQMVDH 200 Query: 364 VVSVTGWGYDEASSKSYWIVRNSWGESWGNMGFVYVEMGNDALSLEEQCSWAVVDKFT-- 191 V+SV GWG ++ +SYWIVRNSWGE WG MG Y+ +G +L +E QC+WA V +T Sbjct: 201 VISVVGWGVED--GQSYWIVRNSWGEYWGEMG--YIRVGFGSLKVESQCAWATVKDYTAP 256 Query: 190 DENNQSHCSEDGATC 146 ++ NQ HC E G C Sbjct: 257 EKLNQVHCYEGGENC 271 >gb|OLP80048.1| Cathepsin Z, partial [Symbiodinium microadriaticum] Length = 723 Score = 225 bits (574), Expect = 1e-65 Identities = 108/193 (55%), Positives = 133/193 (68%), Gaps = 2/193 (1%) Frame = -1 Query: 718 WISSISDKTGTGLVYESVNPYLACTSDLKQGICKGQDWSCKPLNVARTCGTFVESGGKCV 539 WI + +TG+G+ Y + PYLAC+S+ + GICK D SCK N+ARTC TF G CV Sbjct: 72 WIYKLGKRTGSGISYFTSQPYLACSSESRDGICKNADTSCKAENIARTCSTF---GEPCV 128 Query: 538 GLTKYPNVTISDYGSISGAAAMQKEIFNRGPISCGVDAEPIVDYTTGIVTAQGSGQDHVV 359 GL++YPN TISDYGSI G AM KEI+NRGPISCG+DA P++ YTTG++ QDHVV Sbjct: 129 GLSEYPNATISDYGSIMGKNAMMKEIYNRGPISCGIDANPLLKYTTGVIRGWSLMQDHVV 188 Query: 358 SVTGWGYDEASSKSYWIVRNSWGESWGNMGFVYVEMGNDALSLEEQCSWAVVDKFT--DE 185 SV GWG D YWIVRNSWGE WG G+V V+ G AL+LE C+WAV D +T ++ Sbjct: 189 SVVGWGTDPEEG-LYWIVRNSWGEYWGENGYVRVKSG--ALALERSCTWAVPDTYTAPEK 245 Query: 184 NNQSHCSEDGATC 146 NN+ HC E G C Sbjct: 246 NNEIHCYEGGENC 258 Score = 212 bits (540), Expect = 1e-60 Identities = 106/194 (54%), Positives = 127/194 (65%), Gaps = 3/194 (1%) Frame = -1 Query: 718 WISSISDKTGTGLVYESVNPYLACTSDLKQGICKGQDWSCKPLNVARTCGTFVESGGKCV 539 W+ ISDKTG+G+ Y + PYLAC+ D G CKG D+SC NV RTC TF G KCV Sbjct: 392 WLKEISDKTGSGISYTTGQPYLACSKDSDAGFCKGVDFSCTAENVQRTCATF---GKKCV 448 Query: 538 GLTKYPNVTISDYGSISGAAAMQKEIFNRGPISCGVDAEPIVDYTTGIVTAQGSGQDHVV 359 G+ KYPN T++ YGSI G AM KEIFNRGPI+C VDA PI++YT GIVTA+ DH + Sbjct: 449 GMAKYPNATVAKYGSIQGKDAMMKEIFNRGPIACNVDAVPILNYTGGIVTAKSKDTDHSI 508 Query: 358 SVTGWGYDEASSKSYWIVRNSWGESWGNMGFVYVEMGNDALSLE-EQCSWAVVDKFT--D 188 SV GWG D+ YW VRNSWGE WG G+ V+ G AL+LE + C WA FT + Sbjct: 509 SVVGWGTDDKLG-FYWQVRNSWGEYWGEQGYFRVQSG--ALALEQDSCIWATPKDFTAPE 565 Query: 187 ENNQSHCSEDGATC 146 N HC EDG+ C Sbjct: 566 RVNLLHCYEDGSNC 579 >gb|KOO22687.1| hypothetical protein Ctob_008847 [Chrysochromulina sp. CCMP291] Length = 318 Score = 212 bits (539), Expect = 2e-64 Identities = 107/193 (55%), Positives = 132/193 (68%), Gaps = 2/193 (1%) Frame = -1 Query: 718 WISSISDKTGTGLVYESVNPYLACTSDLKQGICKGQDWSCKPLNVARTCGTFVESGGKCV 539 WI SIS KTGTG+ Y PYLAC+S+ K+G C+ D +C LN ARTCGTF E+ CV Sbjct: 131 WIHSISSKTGTGISYALSQPYLACSSESKEGWCEHVDSTCTALNTARTCGTFGEA---CV 187 Query: 538 GLTKYPNVTISDYGSISGAAAMQKEIFNRGPISCGVDAEPIVDYTTGIVTAQGSGQDHVV 359 GLT YPN T++++GSISG AMQKEIF RGPI+C +DA PI YT GI T + DHV+ Sbjct: 188 GLTHYPNATVAEFGSISGPDAMQKEIFARGPIACTIDAAPITKYTGGIATEKSFMTDHVI 247 Query: 358 SVTGWGYDEASSKSYWIVRNSWGESWGNMGFVYVEMGNDALSLEEQCSWAVVDKFT--DE 185 SV GWG D A+ YWIVRNSWGE WG G+V V+ G AL+LE+ C+WA F+ + Sbjct: 248 SVVGWGTD-ATEGLYWIVRNSWGEYWGENGYVRVKSG--ALALEDACAWATPGTFSAPEF 304 Query: 184 NNQSHCSEDGATC 146 +N C EDG+ C Sbjct: 305 DNIYKCYEDGSNC 317 >ref|XP_023253505.1| cathepsin Z-like, partial [Seriola lalandi dorsalis] Length = 302 Score = 208 bits (530), Expect = 3e-63 Identities = 101/183 (55%), Positives = 126/183 (68%), Gaps = 2/183 (1%) Frame = -1 Query: 685 GLVYESVNPYLACTSDLKQGICKGQDWSCKPLNVARTCGTFVESGGKCVGLTKYPNVTIS 506 G+ Y + PY+AC+++ K+G C DWSC P+NVARTCGTF G +CVGL+ +PN TI+ Sbjct: 98 GISYATSQPYMACSAESKEGTCAHGDWSCTPMNVARTCGTF---GQECVGLSHFPNATIA 154 Query: 505 DYGSISGAAAMQKEIFNRGPISCGVDAEPIVDYTTGIVTAQGSGQDHVVSVTGWGYDEAS 326 +YG ISGA AMQKEIF RGPI+C +DA PI YT GI T + DHV+SV GWG Sbjct: 155 EYGHISGARAMQKEIFARGPIACTIDAAPIEKYTGGIATQRSFMTDHVISVVGWGTCPKE 214 Query: 325 SKSYWIVRNSWGESWGNMGFVYVEMGNDALSLEEQCSWAVVDKFT--DENNQSHCSEDGA 152 YWIVRNSWGE WG G+V V+ G AL+LE+ C+WA V FT + +NQ C EDG+ Sbjct: 215 G-LYWIVRNSWGEYWGEQGYVKVKSG--ALALEQACAWATVQDFTAPERHNQYACFEDGS 271 Query: 151 TCD 143 CD Sbjct: 272 NCD 274 >ref|XP_005756898.1| hypothetical protein EMIHUDRAFT_446652 [Emiliania huxleyi CCMP1516] gb|EOD04469.1| hypothetical protein EMIHUDRAFT_446652 [Emiliania huxleyi CCMP1516] Length = 350 Score = 209 bits (533), Expect = 4e-63 Identities = 101/183 (55%), Positives = 127/183 (69%), Gaps = 2/183 (1%) Frame = -1 Query: 685 GLVYESVNPYLACTSDLKQGICKGQDWSCKPLNVARTCGTFVESGGKCVGLTKYPNVTIS 506 G+ Y ++ PY+AC+++ K+G C DWSC P+NVARTCGTF G +CVGL+ +PN TI+ Sbjct: 146 GISYATLQPYMACSAESKEGTCAHGDWSCTPMNVARTCGTF---GQECVGLSHFPNATIA 202 Query: 505 DYGSISGAAAMQKEIFNRGPISCGVDAEPIVDYTTGIVTAQGSGQDHVVSVTGWGYDEAS 326 +YG ISGA AMQKEIF RGPI+C +DA PI YT GI T + DHV+SV GWG Sbjct: 203 EYGHISGARAMQKEIFARGPIACTIDAAPIEKYTGGIATQRSFMTDHVISVVGWGTCPKE 262 Query: 325 SKSYWIVRNSWGESWGNMGFVYVEMGNDALSLEEQCSWAVVDKFT--DENNQSHCSEDGA 152 YWIVRNSWGE WG G+V V+ G AL+LE+ C+WA V FT + +NQ C EDG+ Sbjct: 263 G-LYWIVRNSWGEYWGEQGYVKVKSG--ALALEQACAWATVQDFTAPERHNQYACFEDGS 319 Query: 151 TCD 143 CD Sbjct: 320 NCD 322 >gb|OLP88871.1| Cathepsin Z [Symbiodinium microadriaticum] Length = 644 Score = 216 bits (551), Expect = 7e-63 Identities = 103/179 (57%), Positives = 126/179 (70%) Frame = -1 Query: 718 WISSISDKTGTGLVYESVNPYLACTSDLKQGICKGQDWSCKPLNVARTCGTFVESGGKCV 539 W+ IS K GTG+ YE+ NPY+AC+SD +G CK D SCK LNVARTCG+F + GG C Sbjct: 458 WLMEISKK-GTGISYETANPYVACSSDSDEGFCKHVDTSCKALNVARTCGSFSQEGGPCT 516 Query: 538 GLTKYPNVTISDYGSISGAAAMQKEIFNRGPISCGVDAEPIVDYTTGIVTAQGSGQDHVV 359 GL ++PN TISDYGSISGA AM KEIF+RGPISCGVDA P+++Y +GI+ +G G DHVV Sbjct: 517 GLGEFPNATISDYGSISGADAMMKEIFHRGPISCGVDANPLLNYESGIIKTKGEGVDHVV 576 Query: 358 SVTGWGYDEASSKSYWIVRNSWGESWGNMGFVYVEMGNDALSLEEQCSWAVVDKFTDEN 182 SV GWG D + YWIVRNSWGE WG MG+ V G L + + +VD +D N Sbjct: 577 SVVGWGTD-SKDGMYWIVRNSWGEYWGEMGYFRVAKGALLLEAGTKITSIIVDAISDNN 634 >ref|XP_005766319.1| hypothetical protein EMIHUDRAFT_437057 [Emiliania huxleyi CCMP1516] gb|EOD13890.1| hypothetical protein EMIHUDRAFT_437057 [Emiliania huxleyi CCMP1516] Length = 350 Score = 205 bits (522), Expect = 2e-61 Identities = 100/183 (54%), Positives = 126/183 (68%), Gaps = 2/183 (1%) Frame = -1 Query: 685 GLVYESVNPYLACTSDLKQGICKGQDWSCKPLNVARTCGTFVESGGKCVGLTKYPNVTIS 506 G+ Y ++ PY+AC+++ K+G C DWSC P+NVARTCGTF G +CVGL+ +PN TI+ Sbjct: 146 GISYATLQPYMACSAESKEGTCAHGDWSCTPMNVARTCGTF---GQECVGLSHFPNATIA 202 Query: 505 DYGSISGAAAMQKEIFNRGPISCGVDAEPIVDYTTGIVTAQGSGQDHVVSVTGWGYDEAS 326 +YG ISGA AMQKEIF RGPI+C +DA PI YT GI T + DHV+SV GWG Sbjct: 203 EYGHISGARAMQKEIFARGPIACTIDAAPIEKYTGGIATQRSFMTDHVISVVGWGTCPKE 262 Query: 325 SKSYWIVRNSWGESWGNMGFVYVEMGNDALSLEEQCSWAVVDKFT--DENNQSHCSEDGA 152 YWIVRNSWGE WG G+V V+ G AL+LE+ C+WA V FT + +NQ C EDG+ Sbjct: 263 G-LYWIVRNSWGEYWGEQGYVKVKSG--ALALEQACAWATVQDFTAPERHNQYACFEDGS 319 Query: 151 TCD 143 D Sbjct: 320 NWD 322 >ref|XP_014156933.1| cathepsin X [Sphaeroforma arctica JP610] gb|KNC83031.1| cathepsin X [Sphaeroforma arctica JP610] Length = 362 Score = 175 bits (443), Expect = 2e-49 Identities = 77/178 (43%), Positives = 111/178 (62%), Gaps = 1/178 (0%) Frame = -1 Query: 676 YESVNPYLACTSDLKQGICKGQDWSCKPLNVARTCGTFVESGGKCVGLTKYPNVTISDYG 497 Y++ Y AC+++ +G CK +D++C N RTC TF GG C + +YPN TI++YG Sbjct: 170 YDTCLQYEACSAESTEGTCKDRDFTCTGANTCRTCSTFTAYGGFCSEVDRYPNATIAEYG 229 Query: 496 SISGAAAMQKEIFNRGPISCGVDAEPIVDYTTGIV-TAQGSGQDHVVSVTGWGYDEASSK 320 ++ G M EI+ RGPI+CG+DA P+ +Y GIV +H++SV GWG D+ + Sbjct: 230 NVVGEYNMMAEIYRRGPIACGIDASPVDNYKGGIVDNTSAKSINHIISVVGWGVDKKTDT 289 Query: 319 SYWIVRNSWGESWGNMGFVYVEMGNDALSLEEQCSWAVVDKFTDENNQSHCSEDGATC 146 YWIVRNSWGE WG MG+ V+ G + L++E C+WA +T+ N C EDG+ C Sbjct: 290 PYWIVRNSWGEYWGEMGYFRVKRGENQLAIESSCAWATPGTWTEMN--FPCYEDGSNC 345 >ref|XP_002288580.1| probable papain cysteine protease [Thalassiosira pseudonana CCMP1335] gb|EED94016.1| probable papain cysteine protease [Thalassiosira pseudonana CCMP1335] Length = 336 Score = 174 bits (441), Expect = 2e-49 Identities = 84/186 (45%), Positives = 115/186 (61%), Gaps = 9/186 (4%) Frame = -1 Query: 676 YESVNPYLACTSDLKQGICKGQDWSCKPLNVARTCGTFVESGGKCVGLTKYPNVTISDYG 497 Y++ PYLAC+SD ++G C D +C LN RTCG F ESGG CVGL +PN T+++YG Sbjct: 116 YDTCQPYLACSSDSQEGFCGYVDTTCNALNTCRTCGGFSESGGSCVGLDFFPNATVAEYG 175 Query: 496 SISG------AAAMQKEIFNRGPISCGVDAEPIVDYTTGIVTAQGSG---QDHVVSVTGW 344 +ISG ++ EI RGP++ + A P+ D+ G + + S +H+V++ GW Sbjct: 176 TISGTDDADRVKKIKMEIKARGPVAATIQAGPLRDFMGGSIFSDDSAPKFPNHIVAIVGW 235 Query: 343 GYDEASSKSYWIVRNSWGESWGNMGFVYVEMGNDALSLEEQCSWAVVDKFTDENNQSHCS 164 G D S+KS+W+VRNSWG WG GF VEMG + L +E +WA V FT N C+ Sbjct: 236 GKDVESNKSFWVVRNSWGYYWGEEGFFRVEMGKNILGIEMGVAWATVGSFTTAN--VPCT 293 Query: 163 EDGATC 146 EDGATC Sbjct: 294 EDGATC 299 >gb|EJK74471.1| hypothetical protein THAOC_03848, partial [Thalassiosira oceanica] Length = 281 Score = 166 bits (421), Expect = 4e-47 Identities = 80/182 (43%), Positives = 110/182 (60%), Gaps = 8/182 (4%) Frame = -1 Query: 676 YESVNPYLACTSDLKQGICKGQDWSCKPLNVARTCGTFVESGGKCVGLTKYPNVTISDYG 497 Y++ PYLAC+ D +G C D SC NV RTC TF GG C + +PN T+++YG Sbjct: 77 YDTCQPYLACSDDSDEGFCSSVDTSCSKHNVCRTCSTFSSRGGACTEIDFFPNATVAEYG 136 Query: 496 -----SISGAAAMQKEIFNRGPISCGVDAEPIVDYTTGIV---TAQGSGQDHVVSVTGWG 341 S + ++ EI+ RGP++ GV+A+P++DY GIV T DH+VS+ GWG Sbjct: 137 SYNLLSFNRIHKIKSEIYARGPVAAGVNADPLLDYKGGIVKEGTIVNMIIDHIVSIVGWG 196 Query: 340 YDEASSKSYWIVRNSWGESWGNMGFVYVEMGNDALSLEEQCSWAVVDKFTDENNQSHCSE 161 DE + YWIVRNSWG+ WG MGF +++G++ L +E +WA KFT ENN C E Sbjct: 197 TDEEGDE-YWIVRNSWGQYWGEMGFFRIKIGSNLLGIESSIAWATPGKFTTENN-FPCGE 254 Query: 160 DG 155 G Sbjct: 255 SG 256 >gb|OEU10039.1| cysteine proteinase [Fragilariopsis cylindrus CCMP1102] Length = 361 Score = 166 bits (421), Expect = 3e-46 Identities = 79/192 (41%), Positives = 117/192 (60%), Gaps = 11/192 (5%) Frame = -1 Query: 688 TGLV-YESVNPYLACTSDLKQGICKGQDWSCKPLNVARTCGTFVESGGKCVGLTKYPNVT 512 TG V Y++ PYLAC+++ G C D SC LN +TC TF GGKC + +PN T Sbjct: 140 TGYVPYDTCTPYLACSAESTDGFCGKIDTSCSALNTCKTCDTFGGMGGKCTEIDFFPNAT 199 Query: 511 ISDYGSISGAA--------AMQKEIFNRGPISCGVDAEPIVDYTTGIVTAQGSGQ--DHV 362 +++YG + + + EI++RGP++ ++AEPIV YT GI T + + +H+ Sbjct: 200 VAEYGIVDYDSNDKEGVSHKIMSEIYSRGPVAATINAEPIVKYTGGIFTDENYSEQTNHI 259 Query: 361 VSVTGWGYDEASSKSYWIVRNSWGESWGNMGFVYVEMGNDALSLEEQCSWAVVDKFTDEN 182 VS+TGWG D+ S +WIVRNSWG+ WG MG++ +EMG + L +E + +W V +T N Sbjct: 260 VSITGWGTDKESGTKFWIVRNSWGQYWGEMGYMRLEMGKNLLGIEGEIAWVVPGTYT--N 317 Query: 181 NQSHCSEDGATC 146 + C EDG+ C Sbjct: 318 HNFACYEDGSNC 329 >ref|XP_001415619.1| predicted protein [Ostreococcus lucimarinus CCE9901] gb|ABO93911.1| predicted protein [Ostreococcus lucimarinus CCE9901] Length = 316 Score = 158 bits (400), Expect = 1e-43 Identities = 77/184 (41%), Positives = 114/184 (61%), Gaps = 3/184 (1%) Frame = -1 Query: 688 TGLV-YESVNPYLACTSDLKQGIC-KGQDWSCKPLNVARTCGTFVESGGKCVGLTKYPNV 515 TG V Y++ PY AC+++ +G C +G D++C +N RTC TF E GG C L+ +PN Sbjct: 112 TGFVPYDTCLPYEACSAESTEGNCARGGDYTCTAMNTCRTCSTFAEFGGFCSALSTFPNA 171 Query: 514 TISDYGSISGAAAMQKEIFNRGPISCGVDAEPIVDYTTGIVTAQGSGQ-DHVVSVTGWGY 338 T+++YG ISG + EIF RGP+S G+DA+ + Y GI + +H+VS+ GWG Sbjct: 172 TVAEYGMISGEKEIMAEIFARGPVSAGIDADGLRGYVGGIYKDTPDFEINHIVSIVGWGT 231 Query: 337 DEASSKSYWIVRNSWGESWGNMGFVYVEMGNDALSLEEQCSWAVVDKFTDENNQSHCSED 158 + +K YW+VRNSWG+ WG MGF + G ++L +E++ +WA +T N C ED Sbjct: 232 ADDGTK-YWVVRNSWGQYWGEMGFFRIIRGVNSLGIEDEVAWATPGSWTHMN--FACYED 288 Query: 157 GATC 146 G+ C Sbjct: 289 GSNC 292 >ref|XP_002507788.1| cysteine endopeptidase [Micromonas commoda] gb|ACO69046.1| cysteine endopeptidase [Micromonas commoda] Length = 388 Score = 159 bits (401), Expect = 5e-43 Identities = 74/183 (40%), Positives = 108/183 (59%), Gaps = 6/183 (3%) Frame = -1 Query: 676 YESVNPYLACTSDLKQGICKGQD---WSCKPLNVARTCGTFVESGGKCVGLTKYPNVTIS 506 +E+ Y AC+++ +G C D + CKP N RTC TF + GG C L +PN +I+ Sbjct: 182 FETCLVYEACSAESSEGSCAAGDVSRYECKPENTCRTCSTFSDMGGFCSALDSFPNASIA 241 Query: 505 DYGSISGAAAMQKEIFNRGPISCGVDAEPIVDYTTGIVTAQGSGQ---DHVVSVTGWGYD 335 +YG +SG + E++ RGP++ G+DA + +YT GI+ + +H+V++ GWG Sbjct: 242 EYGEVSGEKEIMAEVYARGPVAAGIDANLLDEYTGGILDQPADYEYEINHIVAIVGWGET 301 Query: 334 EASSKSYWIVRNSWGESWGNMGFVYVEMGNDALSLEEQCSWAVVDKFTDENNQSHCSEDG 155 + K YWIVRNSWGE WG MGF + G AL +E++CSWA +T N C EDG Sbjct: 302 KKGEK-YWIVRNSWGEYWGEMGFFRIVRGKKALGIEDECSWATPASWTTHN--QGCYEDG 358 Query: 154 ATC 146 + C Sbjct: 359 SNC 361 >ref|XP_007515195.1| predicted protein [Bathycoccus prasinos] emb|CCO14074.1| predicted protein [Bathycoccus prasinos] Length = 413 Score = 159 bits (402), Expect = 7e-43 Identities = 73/184 (39%), Positives = 112/184 (60%), Gaps = 7/184 (3%) Frame = -1 Query: 676 YESVNPYLACTSDLKQGICK--GQDWSCKPLNVARTCGTFVESGGKCVGLTKYPNVTISD 503 Y++ Y AC+++ +G C + C +N RTC TF + GG C L +PN T+ + Sbjct: 191 YDTCLSYEACSNESIEGSCGYVTDRYKCNAINTCRTCSTFSQLGGFCAPLATFPNATVKE 250 Query: 502 YGSISGAAAMQKEIFNRGPISCGVDAEPIVDYTTGIVTAQGSGQ-DHVVSVTGWGYDEAS 326 YG+ISG A+ KE++ RGP++ G+DA+ + YT GI T S + +H+VS+ GWG ++ + Sbjct: 251 YGTISGEEAIMKELYARGPVAAGIDADGLRSYTKGIYTDTPSYEINHIVSIVGWGVEKKT 310 Query: 325 SKSYWIVRNSWGESWGNMGFVYVEMGNDALSLEEQCSWAVVDKFT----DENNQSHCSED 158 + YWIVRNSWG+ WG MGF + G +L +E++ +WA ++T D+ C ED Sbjct: 311 NTKYWIVRNSWGQYWGEMGFFRIVRGKKSLGIEDEVAWATPGRWTGMKHDDFANFPCFED 370 Query: 157 GATC 146 GA C Sbjct: 371 GANC 374 >ref|XP_002177642.1| predicted protein, partial [Phaeodactylum tricornutum CCAP 1055/1] gb|EEC50456.1| predicted protein, partial [Phaeodactylum tricornutum CCAP 1055/1] Length = 256 Score = 154 bits (390), Expect = 9e-43 Identities = 70/162 (43%), Positives = 100/162 (61%), Gaps = 6/162 (3%) Frame = -1 Query: 676 YESVNPYLACTSDLKQGICKGQDWSCKPLNVARTCGTFVESGGKCVGLTKYPNVTISDYG 497 Y++ YLAC+ + +G C D SC P N RTC TF GG C + +PN T+++YG Sbjct: 93 YDTCMSYLACSEESTEGFCPQLDTSCTPDNTCRTCDTFAGMGGACSRIDYFPNATVAEYG 152 Query: 496 SIS----GAAAMQKEIFNRGPISCGVDAEPIVDYTTGIVTAQGSGQ--DHVVSVTGWGYD 335 I +Q E++ RGP++ ++AEPIV+Y G+ G Q +H+VS+TGWG D Sbjct: 153 LIDLDDFVVHKIQTELYVRGPVAATINAEPIVEYAGGVFGEDGHSQRTNHIVSITGWGTD 212 Query: 334 EASSKSYWIVRNSWGESWGNMGFVYVEMGNDALSLEEQCSWA 209 E + K YWIVRNSWG+ WG MGF+ +E G + L +E + +WA Sbjct: 213 EDTGKLYWIVRNSWGQYWGEMGFMRIEAGKNLLGIEGEVAWA 254 >dbj|GAX22053.1| cathepsin X [Fistulifera solaris] Length = 359 Score = 155 bits (393), Expect = 4e-42 Identities = 80/198 (40%), Positives = 117/198 (59%), Gaps = 20/198 (10%) Frame = -1 Query: 676 YESVNPYLACTSDLKQGICKGQDWSCKPLNVARTCGTFVESGGKCVGLTKYPNVTISDYG 497 Y++ PYLAC+S+ ++G CK D +C P+N RTC + C L ++PN TI++YG Sbjct: 153 YDTCMPYLACSSESREGFCKSVDTTCTPMNTCRTC-----TRKGCRALERFPNATIAEYG 207 Query: 496 SIS------GAAA--MQKEIFNRGPISCGVDAEPIVDYTTGIVTAQGSGQ---------- 371 + S GA A ++ EI+ RGP++ GV+AEP+V+Y +G+G+ Sbjct: 208 TYSYVTDGFGAVANKIKAEIYARGPVAAGVNAEPLVNY-------KGNGEIIRETSIFKM 260 Query: 370 --DHVVSVTGWGYDEASSKSYWIVRNSWGESWGNMGFVYVEMGNDALSLEEQCSWAVVDK 197 +H+VS+ GWG D + YWIVRNSWG WGNMG+ + MG++AL +E + SWA Sbjct: 261 LVNHIVSIVGWGMDPETGDQYWIVRNSWGAYWGNMGYFNILMGHNALGIELEVSWATPGT 320 Query: 196 FTDENNQSHCSEDGATCD 143 FT +N C DG+ CD Sbjct: 321 FTTKNYP--CHVDGSDCD 336