BLASTX nr result

ID: Ophiopogon25_contig00055708 seq

BLASTX 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Ophiopogon25_contig00055708
         (718 letters)

Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF excluding environmental samples
from WGS projects 
           149,584,005 sequences; 54,822,741,787 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

gb|OLQ02485.1| Cathepsin Z [Symbiodinium microadriaticum]             239   6e-75
gb|KOO21039.1| hypothetical protein Ctob_001092 [Chrysochromulin...   236   6e-74
ref|XP_009036506.1| hypothetical protein AURANDRAFT_25936 [Aureo...   234   4e-73
gb|AAT09097.1| cathepsin Z [Bigelowiella natans]                      226   1e-69
ref|XP_009036816.1| hypothetical protein AURANDRAFT_12990, parti...   218   2e-67
gb|OLP80048.1| Cathepsin Z, partial [Symbiodinium microadriaticum]    225   1e-65
gb|KOO22687.1| hypothetical protein Ctob_008847 [Chrysochromulin...   212   2e-64
ref|XP_023253505.1| cathepsin Z-like, partial [Seriola lalandi d...   208   3e-63
ref|XP_005756898.1| hypothetical protein EMIHUDRAFT_446652 [Emil...   209   4e-63
gb|OLP88871.1| Cathepsin Z [Symbiodinium microadriaticum]             216   7e-63
ref|XP_005766319.1| hypothetical protein EMIHUDRAFT_437057 [Emil...   205   2e-61
ref|XP_014156933.1| cathepsin X [Sphaeroforma arctica JP610] >gi...   175   2e-49
ref|XP_002288580.1| probable papain cysteine protease [Thalassio...   174   2e-49
gb|EJK74471.1| hypothetical protein THAOC_03848, partial [Thalas...   166   4e-47
gb|OEU10039.1| cysteine proteinase [Fragilariopsis cylindrus CCM...   166   3e-46
ref|XP_001415619.1| predicted protein [Ostreococcus lucimarinus ...   158   1e-43
ref|XP_002507788.1| cysteine endopeptidase [Micromonas commoda] ...   159   5e-43
ref|XP_007515195.1| predicted protein [Bathycoccus prasinos] >gi...   159   7e-43
ref|XP_002177642.1| predicted protein, partial [Phaeodactylum tr...   154   9e-43
dbj|GAX22053.1| cathepsin X [Fistulifera solaris]                     155   4e-42

>gb|OLQ02485.1| Cathepsin Z [Symbiodinium microadriaticum]
          Length = 338

 Score =  239 bits (611), Expect = 6e-75
 Identities = 116/193 (60%), Positives = 138/193 (71%), Gaps = 2/193 (1%)
 Frame = -1

Query: 718 WISSISDKTGTGLVYESVNPYLACTSDLKQGICKGQDWSCKPLNVARTCGTFVESGGKCV 539
           W+  IS K GTG+ YE+ NPY+ACTSD  +G CK  D SCK LNVARTCG+F + GG C 
Sbjct: 140 WLMEISKK-GTGISYETANPYVACTSDSDEGFCKYVDTSCKALNVARTCGSFSQEGGPCT 198

Query: 538 GLTKYPNVTISDYGSISGAAAMQKEIFNRGPISCGVDAEPIVDYTTGIVTAQGSGQDHVV 359
           GL ++PN TISDYGSISGA AM KEIF+RGPISCGVDA P+++Y +GI+  +G G DHVV
Sbjct: 199 GLGEFPNATISDYGSISGADAMMKEIFHRGPISCGVDANPLLNYESGIIKTKGEGVDHVV 258

Query: 358 SVTGWGYDEASSKSYWIVRNSWGESWGNMGFVYVEMGNDALSLEEQCSWAVVDKFT--DE 185
           SV GWG D      +W+VRNSWGE WG MG+  V  G  AL LE+QCSWAV   FT  ++
Sbjct: 259 SVVGWGTDPKDG-FFWVVRNSWGEYWGEMGYFRVAKG--ALLLEDQCSWAVPATFTAAEK 315

Query: 184 NNQSHCSEDGATC 146
            NQ HC E G  C
Sbjct: 316 KNQVHCHEGGDNC 328


>gb|KOO21039.1| hypothetical protein Ctob_001092 [Chrysochromulina sp. CCMP291]
          Length = 324

 Score =  236 bits (603), Expect = 6e-74
 Identities = 113/193 (58%), Positives = 135/193 (69%), Gaps = 2/193 (1%)
 Frame = -1

Query: 718 WISSISDKTGTGLVYESVNPYLACTSDLKQGICKGQDWSCKPLNVARTCGTFVESGGKCV 539
           W+  IS KTG G+ YE+ NPY+ACTSD  +G CK  D +CK +N+ARTCG+F + GG C 
Sbjct: 134 WMMEIS-KTGAGVAYETSNPYIACTSDSSEGFCKHVDTTCKAINIARTCGSFSQEGGTCT 192

Query: 538 GLTKYPNVTISDYGSISGAAAMQKEIFNRGPISCGVDAEPIVDYTTGIVTAQGSGQDHVV 359
           GL+ YPN TISDYGSISG  AM KEI  RGPISCG+DA P+++Y  GI  A G G DHV+
Sbjct: 193 GLSSYPNATISDYGSISGKDAMMKEIIARGPISCGIDAGPLLNYEKGIADASGEGVDHVI 252

Query: 358 SVTGWGYDEASSKSYWIVRNSWGESWGNMGFVYVEMGNDALSLEEQCSWAVVDKFT--DE 185
           SV GWG +  +   YWIVRNSWGE WG MG+V V  G  AL +EEQCSWAV   FT  ++
Sbjct: 253 SVVGWGNE--AGVPYWIVRNSWGEYWGEMGYVRVAFG--ALKIEEQCSWAVPKDFTSAEK 308

Query: 184 NNQSHCSEDGATC 146
            NQ HC E G  C
Sbjct: 309 ANQVHCHEGGDNC 321


>ref|XP_009036506.1| hypothetical protein AURANDRAFT_25936 [Aureococcus anophagefferens]
 gb|EGB08493.1| hypothetical protein AURANDRAFT_25936 [Aureococcus anophagefferens]
          Length = 326

 Score =  234 bits (598), Expect = 4e-73
 Identities = 108/193 (55%), Positives = 139/193 (72%), Gaps = 2/193 (1%)
 Frame = -1

Query: 718 WISSISDKTGTGLVYESVNPYLACTSDLKQGICKGQDWSCKPLNVARTCGTFVESGGKCV 539
           W+  +SDK G G+ YE+ NPY+AC+S+  +GIC G DW+C P+N+ARTC TF  +G  C 
Sbjct: 136 WLKGLSDK-GEGISYETSNPYMACSSECTEGICPGGDWTCTPINIARTCSTFPPAG-TCT 193

Query: 538 GLTKYPNVTISDYGSISGAAAMQKEIFNRGPISCGVDAEPIVDYTTGIVTAQGSGQDHVV 359
            +T YP +TI DYGSISG  AMQKEI+ RG I+CG+DA PI++YTTG+ T  G G DHV+
Sbjct: 194 EITPYPQITIDDYGSISGQKAMQKEIYARGSIACGIDAGPILNYTTGVATGAGEGVDHVI 253

Query: 358 SVTGWGYDEASSKSYWIVRNSWGESWGNMGFVYVEMGNDALSLEEQCSWAVVDKFT--DE 185
           SV GWG ++ +  SYWIVRNSWGE WG MG+V V  G  AL +EEQC+WA V  +T  ++
Sbjct: 254 SVVGWGVEDGT--SYWIVRNSWGEYWGEMGYVRVAFG--ALMVEEQCAWATVKDYTAPEK 309

Query: 184 NNQSHCSEDGATC 146
           +NQ HC E G  C
Sbjct: 310 DNQVHCFEGGENC 322


>gb|AAT09097.1| cathepsin Z [Bigelowiella natans]
          Length = 325

 Score =  226 bits (575), Expect = 1e-69
 Identities = 109/193 (56%), Positives = 137/193 (70%), Gaps = 2/193 (1%)
 Frame = -1

Query: 718 WISSISDKTGTGLVYESVNPYLACTSDLKQGICKGQDWSCKPLNVARTCGTFVESGGKCV 539
           WI S SDK G+G+ YE+ NPYLAC+S+ ++GIC   DWSC P+N ARTC TF   G KC+
Sbjct: 137 WIKSKSDK-GSGISYETSNPYLACSSESEEGICGNADWSCTPMNEARTCSTFPPQG-KCL 194

Query: 538 GLTKYPNVTISDYGSISGAAAMQKEIFNRGPISCGVDAEPIVDYTTGIVTAQGSGQDHVV 359
            + ++P  +IS YG+ISG  AMQKEI  RGPISCG+DA PI++YT+GI   +G   DHV+
Sbjct: 195 PIKQFPMASISAYGTISGQTAMQKEIAARGPISCGIDAAPILNYTSGIADMRGEMVDHVI 254

Query: 358 SVTGWGYDEASSKSYWIVRNSWGESWGNMGFVYVEMGNDALSLEEQCSWAVVDKFT--DE 185
           SV GWG D+    SYWIVRNSWGE WG MG++ V  G  AL +EEQC+WA V  +T  ++
Sbjct: 255 SVVGWGKDDTKG-SYWIVRNSWGEYWGEMGYIRVAFG--ALKVEEQCAWAEVKDYTAPEK 311

Query: 184 NNQSHCSEDGATC 146
           NNQ HC E G  C
Sbjct: 312 NNQVHCFEGGENC 324


>ref|XP_009036816.1| hypothetical protein AURANDRAFT_12990, partial [Aureococcus
           anophagefferens]
 gb|EGB08838.1| hypothetical protein AURANDRAFT_12990, partial [Aureococcus
           anophagefferens]
          Length = 272

 Score =  218 bits (555), Expect = 2e-67
 Identities = 101/195 (51%), Positives = 135/195 (69%), Gaps = 4/195 (2%)
 Frame = -1

Query: 718 WISSISDKTGTGLVYESVNPYLACTSDLKQGICKGQDWSCKPLNVARTCGTFVESGGKCV 539
           W+  +S K G G+ YE+ NPY+AC+S+ + GIC   +W+C  +N+ARTC TF  SGG C 
Sbjct: 82  WLHKLSKK-GEGISYETSNPYMACSSESEDGICPHGEWTCTDINIARTCSTFPSSGGTCS 140

Query: 538 GLTKYPNVTISDYGSISGAAAMQKEIFNRGPISCGVDAEPIVDYTTGIVTAQGSGQ--DH 365
            + +YP + I DYGSISG AAMQKEI+ RGPI+CG+DA PI++YT G+   +G  Q  DH
Sbjct: 141 EIAQYPQILIDDYGSISGQAAMQKEIYARGPIACGIDASPILNYTKGVFVKKGLFQMVDH 200

Query: 364 VVSVTGWGYDEASSKSYWIVRNSWGESWGNMGFVYVEMGNDALSLEEQCSWAVVDKFT-- 191
           V+SV GWG ++   +SYWIVRNSWGE WG MG  Y+ +G  +L +E QC+WA V  +T  
Sbjct: 201 VISVVGWGVED--GQSYWIVRNSWGEYWGEMG--YIRVGFGSLKVESQCAWATVKDYTAP 256

Query: 190 DENNQSHCSEDGATC 146
           ++ NQ HC E G  C
Sbjct: 257 EKLNQVHCYEGGENC 271


>gb|OLP80048.1| Cathepsin Z, partial [Symbiodinium microadriaticum]
          Length = 723

 Score =  225 bits (574), Expect = 1e-65
 Identities = 108/193 (55%), Positives = 133/193 (68%), Gaps = 2/193 (1%)
 Frame = -1

Query: 718 WISSISDKTGTGLVYESVNPYLACTSDLKQGICKGQDWSCKPLNVARTCGTFVESGGKCV 539
           WI  +  +TG+G+ Y +  PYLAC+S+ + GICK  D SCK  N+ARTC TF   G  CV
Sbjct: 72  WIYKLGKRTGSGISYFTSQPYLACSSESRDGICKNADTSCKAENIARTCSTF---GEPCV 128

Query: 538 GLTKYPNVTISDYGSISGAAAMQKEIFNRGPISCGVDAEPIVDYTTGIVTAQGSGQDHVV 359
           GL++YPN TISDYGSI G  AM KEI+NRGPISCG+DA P++ YTTG++      QDHVV
Sbjct: 129 GLSEYPNATISDYGSIMGKNAMMKEIYNRGPISCGIDANPLLKYTTGVIRGWSLMQDHVV 188

Query: 358 SVTGWGYDEASSKSYWIVRNSWGESWGNMGFVYVEMGNDALSLEEQCSWAVVDKFT--DE 185
           SV GWG D      YWIVRNSWGE WG  G+V V+ G  AL+LE  C+WAV D +T  ++
Sbjct: 189 SVVGWGTDPEEG-LYWIVRNSWGEYWGENGYVRVKSG--ALALERSCTWAVPDTYTAPEK 245

Query: 184 NNQSHCSEDGATC 146
           NN+ HC E G  C
Sbjct: 246 NNEIHCYEGGENC 258



 Score =  212 bits (540), Expect = 1e-60
 Identities = 106/194 (54%), Positives = 127/194 (65%), Gaps = 3/194 (1%)
 Frame = -1

Query: 718 WISSISDKTGTGLVYESVNPYLACTSDLKQGICKGQDWSCKPLNVARTCGTFVESGGKCV 539
           W+  ISDKTG+G+ Y +  PYLAC+ D   G CKG D+SC   NV RTC TF   G KCV
Sbjct: 392 WLKEISDKTGSGISYTTGQPYLACSKDSDAGFCKGVDFSCTAENVQRTCATF---GKKCV 448

Query: 538 GLTKYPNVTISDYGSISGAAAMQKEIFNRGPISCGVDAEPIVDYTTGIVTAQGSGQDHVV 359
           G+ KYPN T++ YGSI G  AM KEIFNRGPI+C VDA PI++YT GIVTA+    DH +
Sbjct: 449 GMAKYPNATVAKYGSIQGKDAMMKEIFNRGPIACNVDAVPILNYTGGIVTAKSKDTDHSI 508

Query: 358 SVTGWGYDEASSKSYWIVRNSWGESWGNMGFVYVEMGNDALSLE-EQCSWAVVDKFT--D 188
           SV GWG D+     YW VRNSWGE WG  G+  V+ G  AL+LE + C WA    FT  +
Sbjct: 509 SVVGWGTDDKLG-FYWQVRNSWGEYWGEQGYFRVQSG--ALALEQDSCIWATPKDFTAPE 565

Query: 187 ENNQSHCSEDGATC 146
             N  HC EDG+ C
Sbjct: 566 RVNLLHCYEDGSNC 579


>gb|KOO22687.1| hypothetical protein Ctob_008847 [Chrysochromulina sp. CCMP291]
          Length = 318

 Score =  212 bits (539), Expect = 2e-64
 Identities = 107/193 (55%), Positives = 132/193 (68%), Gaps = 2/193 (1%)
 Frame = -1

Query: 718 WISSISDKTGTGLVYESVNPYLACTSDLKQGICKGQDWSCKPLNVARTCGTFVESGGKCV 539
           WI SIS KTGTG+ Y    PYLAC+S+ K+G C+  D +C  LN ARTCGTF E+   CV
Sbjct: 131 WIHSISSKTGTGISYALSQPYLACSSESKEGWCEHVDSTCTALNTARTCGTFGEA---CV 187

Query: 538 GLTKYPNVTISDYGSISGAAAMQKEIFNRGPISCGVDAEPIVDYTTGIVTAQGSGQDHVV 359
           GLT YPN T++++GSISG  AMQKEIF RGPI+C +DA PI  YT GI T +    DHV+
Sbjct: 188 GLTHYPNATVAEFGSISGPDAMQKEIFARGPIACTIDAAPITKYTGGIATEKSFMTDHVI 247

Query: 358 SVTGWGYDEASSKSYWIVRNSWGESWGNMGFVYVEMGNDALSLEEQCSWAVVDKFT--DE 185
           SV GWG D A+   YWIVRNSWGE WG  G+V V+ G  AL+LE+ C+WA    F+  + 
Sbjct: 248 SVVGWGTD-ATEGLYWIVRNSWGEYWGENGYVRVKSG--ALALEDACAWATPGTFSAPEF 304

Query: 184 NNQSHCSEDGATC 146
           +N   C EDG+ C
Sbjct: 305 DNIYKCYEDGSNC 317


>ref|XP_023253505.1| cathepsin Z-like, partial [Seriola lalandi dorsalis]
          Length = 302

 Score =  208 bits (530), Expect = 3e-63
 Identities = 101/183 (55%), Positives = 126/183 (68%), Gaps = 2/183 (1%)
 Frame = -1

Query: 685 GLVYESVNPYLACTSDLKQGICKGQDWSCKPLNVARTCGTFVESGGKCVGLTKYPNVTIS 506
           G+ Y +  PY+AC+++ K+G C   DWSC P+NVARTCGTF   G +CVGL+ +PN TI+
Sbjct: 98  GISYATSQPYMACSAESKEGTCAHGDWSCTPMNVARTCGTF---GQECVGLSHFPNATIA 154

Query: 505 DYGSISGAAAMQKEIFNRGPISCGVDAEPIVDYTTGIVTAQGSGQDHVVSVTGWGYDEAS 326
           +YG ISGA AMQKEIF RGPI+C +DA PI  YT GI T +    DHV+SV GWG     
Sbjct: 155 EYGHISGARAMQKEIFARGPIACTIDAAPIEKYTGGIATQRSFMTDHVISVVGWGTCPKE 214

Query: 325 SKSYWIVRNSWGESWGNMGFVYVEMGNDALSLEEQCSWAVVDKFT--DENNQSHCSEDGA 152
              YWIVRNSWGE WG  G+V V+ G  AL+LE+ C+WA V  FT  + +NQ  C EDG+
Sbjct: 215 G-LYWIVRNSWGEYWGEQGYVKVKSG--ALALEQACAWATVQDFTAPERHNQYACFEDGS 271

Query: 151 TCD 143
            CD
Sbjct: 272 NCD 274


>ref|XP_005756898.1| hypothetical protein EMIHUDRAFT_446652 [Emiliania huxleyi CCMP1516]
 gb|EOD04469.1| hypothetical protein EMIHUDRAFT_446652 [Emiliania huxleyi CCMP1516]
          Length = 350

 Score =  209 bits (533), Expect = 4e-63
 Identities = 101/183 (55%), Positives = 127/183 (69%), Gaps = 2/183 (1%)
 Frame = -1

Query: 685 GLVYESVNPYLACTSDLKQGICKGQDWSCKPLNVARTCGTFVESGGKCVGLTKYPNVTIS 506
           G+ Y ++ PY+AC+++ K+G C   DWSC P+NVARTCGTF   G +CVGL+ +PN TI+
Sbjct: 146 GISYATLQPYMACSAESKEGTCAHGDWSCTPMNVARTCGTF---GQECVGLSHFPNATIA 202

Query: 505 DYGSISGAAAMQKEIFNRGPISCGVDAEPIVDYTTGIVTAQGSGQDHVVSVTGWGYDEAS 326
           +YG ISGA AMQKEIF RGPI+C +DA PI  YT GI T +    DHV+SV GWG     
Sbjct: 203 EYGHISGARAMQKEIFARGPIACTIDAAPIEKYTGGIATQRSFMTDHVISVVGWGTCPKE 262

Query: 325 SKSYWIVRNSWGESWGNMGFVYVEMGNDALSLEEQCSWAVVDKFT--DENNQSHCSEDGA 152
              YWIVRNSWGE WG  G+V V+ G  AL+LE+ C+WA V  FT  + +NQ  C EDG+
Sbjct: 263 G-LYWIVRNSWGEYWGEQGYVKVKSG--ALALEQACAWATVQDFTAPERHNQYACFEDGS 319

Query: 151 TCD 143
            CD
Sbjct: 320 NCD 322


>gb|OLP88871.1| Cathepsin Z [Symbiodinium microadriaticum]
          Length = 644

 Score =  216 bits (551), Expect = 7e-63
 Identities = 103/179 (57%), Positives = 126/179 (70%)
 Frame = -1

Query: 718 WISSISDKTGTGLVYESVNPYLACTSDLKQGICKGQDWSCKPLNVARTCGTFVESGGKCV 539
           W+  IS K GTG+ YE+ NPY+AC+SD  +G CK  D SCK LNVARTCG+F + GG C 
Sbjct: 458 WLMEISKK-GTGISYETANPYVACSSDSDEGFCKHVDTSCKALNVARTCGSFSQEGGPCT 516

Query: 538 GLTKYPNVTISDYGSISGAAAMQKEIFNRGPISCGVDAEPIVDYTTGIVTAQGSGQDHVV 359
           GL ++PN TISDYGSISGA AM KEIF+RGPISCGVDA P+++Y +GI+  +G G DHVV
Sbjct: 517 GLGEFPNATISDYGSISGADAMMKEIFHRGPISCGVDANPLLNYESGIIKTKGEGVDHVV 576

Query: 358 SVTGWGYDEASSKSYWIVRNSWGESWGNMGFVYVEMGNDALSLEEQCSWAVVDKFTDEN 182
           SV GWG D +    YWIVRNSWGE WG MG+  V  G   L    + +  +VD  +D N
Sbjct: 577 SVVGWGTD-SKDGMYWIVRNSWGEYWGEMGYFRVAKGALLLEAGTKITSIIVDAISDNN 634


>ref|XP_005766319.1| hypothetical protein EMIHUDRAFT_437057 [Emiliania huxleyi CCMP1516]
 gb|EOD13890.1| hypothetical protein EMIHUDRAFT_437057 [Emiliania huxleyi CCMP1516]
          Length = 350

 Score =  205 bits (522), Expect = 2e-61
 Identities = 100/183 (54%), Positives = 126/183 (68%), Gaps = 2/183 (1%)
 Frame = -1

Query: 685 GLVYESVNPYLACTSDLKQGICKGQDWSCKPLNVARTCGTFVESGGKCVGLTKYPNVTIS 506
           G+ Y ++ PY+AC+++ K+G C   DWSC P+NVARTCGTF   G +CVGL+ +PN TI+
Sbjct: 146 GISYATLQPYMACSAESKEGTCAHGDWSCTPMNVARTCGTF---GQECVGLSHFPNATIA 202

Query: 505 DYGSISGAAAMQKEIFNRGPISCGVDAEPIVDYTTGIVTAQGSGQDHVVSVTGWGYDEAS 326
           +YG ISGA AMQKEIF RGPI+C +DA PI  YT GI T +    DHV+SV GWG     
Sbjct: 203 EYGHISGARAMQKEIFARGPIACTIDAAPIEKYTGGIATQRSFMTDHVISVVGWGTCPKE 262

Query: 325 SKSYWIVRNSWGESWGNMGFVYVEMGNDALSLEEQCSWAVVDKFT--DENNQSHCSEDGA 152
              YWIVRNSWGE WG  G+V V+ G  AL+LE+ C+WA V  FT  + +NQ  C EDG+
Sbjct: 263 G-LYWIVRNSWGEYWGEQGYVKVKSG--ALALEQACAWATVQDFTAPERHNQYACFEDGS 319

Query: 151 TCD 143
             D
Sbjct: 320 NWD 322


>ref|XP_014156933.1| cathepsin X [Sphaeroforma arctica JP610]
 gb|KNC83031.1| cathepsin X [Sphaeroforma arctica JP610]
          Length = 362

 Score =  175 bits (443), Expect = 2e-49
 Identities = 77/178 (43%), Positives = 111/178 (62%), Gaps = 1/178 (0%)
 Frame = -1

Query: 676 YESVNPYLACTSDLKQGICKGQDWSCKPLNVARTCGTFVESGGKCVGLTKYPNVTISDYG 497
           Y++   Y AC+++  +G CK +D++C   N  RTC TF   GG C  + +YPN TI++YG
Sbjct: 170 YDTCLQYEACSAESTEGTCKDRDFTCTGANTCRTCSTFTAYGGFCSEVDRYPNATIAEYG 229

Query: 496 SISGAAAMQKEIFNRGPISCGVDAEPIVDYTTGIV-TAQGSGQDHVVSVTGWGYDEASSK 320
           ++ G   M  EI+ RGPI+CG+DA P+ +Y  GIV        +H++SV GWG D+ +  
Sbjct: 230 NVVGEYNMMAEIYRRGPIACGIDASPVDNYKGGIVDNTSAKSINHIISVVGWGVDKKTDT 289

Query: 319 SYWIVRNSWGESWGNMGFVYVEMGNDALSLEEQCSWAVVDKFTDENNQSHCSEDGATC 146
            YWIVRNSWGE WG MG+  V+ G + L++E  C+WA    +T+ N    C EDG+ C
Sbjct: 290 PYWIVRNSWGEYWGEMGYFRVKRGENQLAIESSCAWATPGTWTEMN--FPCYEDGSNC 345


>ref|XP_002288580.1| probable papain cysteine protease [Thalassiosira pseudonana
           CCMP1335]
 gb|EED94016.1| probable papain cysteine protease [Thalassiosira pseudonana
           CCMP1335]
          Length = 336

 Score =  174 bits (441), Expect = 2e-49
 Identities = 84/186 (45%), Positives = 115/186 (61%), Gaps = 9/186 (4%)
 Frame = -1

Query: 676 YESVNPYLACTSDLKQGICKGQDWSCKPLNVARTCGTFVESGGKCVGLTKYPNVTISDYG 497
           Y++  PYLAC+SD ++G C   D +C  LN  RTCG F ESGG CVGL  +PN T+++YG
Sbjct: 116 YDTCQPYLACSSDSQEGFCGYVDTTCNALNTCRTCGGFSESGGSCVGLDFFPNATVAEYG 175

Query: 496 SISG------AAAMQKEIFNRGPISCGVDAEPIVDYTTGIVTAQGSG---QDHVVSVTGW 344
           +ISG         ++ EI  RGP++  + A P+ D+  G + +  S     +H+V++ GW
Sbjct: 176 TISGTDDADRVKKIKMEIKARGPVAATIQAGPLRDFMGGSIFSDDSAPKFPNHIVAIVGW 235

Query: 343 GYDEASSKSYWIVRNSWGESWGNMGFVYVEMGNDALSLEEQCSWAVVDKFTDENNQSHCS 164
           G D  S+KS+W+VRNSWG  WG  GF  VEMG + L +E   +WA V  FT  N    C+
Sbjct: 236 GKDVESNKSFWVVRNSWGYYWGEEGFFRVEMGKNILGIEMGVAWATVGSFTTAN--VPCT 293

Query: 163 EDGATC 146
           EDGATC
Sbjct: 294 EDGATC 299


>gb|EJK74471.1| hypothetical protein THAOC_03848, partial [Thalassiosira oceanica]
          Length = 281

 Score =  166 bits (421), Expect = 4e-47
 Identities = 80/182 (43%), Positives = 110/182 (60%), Gaps = 8/182 (4%)
 Frame = -1

Query: 676 YESVNPYLACTSDLKQGICKGQDWSCKPLNVARTCGTFVESGGKCVGLTKYPNVTISDYG 497
           Y++  PYLAC+ D  +G C   D SC   NV RTC TF   GG C  +  +PN T+++YG
Sbjct: 77  YDTCQPYLACSDDSDEGFCSSVDTSCSKHNVCRTCSTFSSRGGACTEIDFFPNATVAEYG 136

Query: 496 -----SISGAAAMQKEIFNRGPISCGVDAEPIVDYTTGIV---TAQGSGQDHVVSVTGWG 341
                S +    ++ EI+ RGP++ GV+A+P++DY  GIV   T      DH+VS+ GWG
Sbjct: 137 SYNLLSFNRIHKIKSEIYARGPVAAGVNADPLLDYKGGIVKEGTIVNMIIDHIVSIVGWG 196

Query: 340 YDEASSKSYWIVRNSWGESWGNMGFVYVEMGNDALSLEEQCSWAVVDKFTDENNQSHCSE 161
            DE   + YWIVRNSWG+ WG MGF  +++G++ L +E   +WA   KFT ENN   C E
Sbjct: 197 TDEEGDE-YWIVRNSWGQYWGEMGFFRIKIGSNLLGIESSIAWATPGKFTTENN-FPCGE 254

Query: 160 DG 155
            G
Sbjct: 255 SG 256


>gb|OEU10039.1| cysteine proteinase [Fragilariopsis cylindrus CCMP1102]
          Length = 361

 Score =  166 bits (421), Expect = 3e-46
 Identities = 79/192 (41%), Positives = 117/192 (60%), Gaps = 11/192 (5%)
 Frame = -1

Query: 688 TGLV-YESVNPYLACTSDLKQGICKGQDWSCKPLNVARTCGTFVESGGKCVGLTKYPNVT 512
           TG V Y++  PYLAC+++   G C   D SC  LN  +TC TF   GGKC  +  +PN T
Sbjct: 140 TGYVPYDTCTPYLACSAESTDGFCGKIDTSCSALNTCKTCDTFGGMGGKCTEIDFFPNAT 199

Query: 511 ISDYGSISGAA--------AMQKEIFNRGPISCGVDAEPIVDYTTGIVTAQGSGQ--DHV 362
           +++YG +   +         +  EI++RGP++  ++AEPIV YT GI T +   +  +H+
Sbjct: 200 VAEYGIVDYDSNDKEGVSHKIMSEIYSRGPVAATINAEPIVKYTGGIFTDENYSEQTNHI 259

Query: 361 VSVTGWGYDEASSKSYWIVRNSWGESWGNMGFVYVEMGNDALSLEEQCSWAVVDKFTDEN 182
           VS+TGWG D+ S   +WIVRNSWG+ WG MG++ +EMG + L +E + +W V   +T  N
Sbjct: 260 VSITGWGTDKESGTKFWIVRNSWGQYWGEMGYMRLEMGKNLLGIEGEIAWVVPGTYT--N 317

Query: 181 NQSHCSEDGATC 146
           +   C EDG+ C
Sbjct: 318 HNFACYEDGSNC 329


>ref|XP_001415619.1| predicted protein [Ostreococcus lucimarinus CCE9901]
 gb|ABO93911.1| predicted protein [Ostreococcus lucimarinus CCE9901]
          Length = 316

 Score =  158 bits (400), Expect = 1e-43
 Identities = 77/184 (41%), Positives = 114/184 (61%), Gaps = 3/184 (1%)
 Frame = -1

Query: 688 TGLV-YESVNPYLACTSDLKQGIC-KGQDWSCKPLNVARTCGTFVESGGKCVGLTKYPNV 515
           TG V Y++  PY AC+++  +G C +G D++C  +N  RTC TF E GG C  L+ +PN 
Sbjct: 112 TGFVPYDTCLPYEACSAESTEGNCARGGDYTCTAMNTCRTCSTFAEFGGFCSALSTFPNA 171

Query: 514 TISDYGSISGAAAMQKEIFNRGPISCGVDAEPIVDYTTGIVTAQGSGQ-DHVVSVTGWGY 338
           T+++YG ISG   +  EIF RGP+S G+DA+ +  Y  GI       + +H+VS+ GWG 
Sbjct: 172 TVAEYGMISGEKEIMAEIFARGPVSAGIDADGLRGYVGGIYKDTPDFEINHIVSIVGWGT 231

Query: 337 DEASSKSYWIVRNSWGESWGNMGFVYVEMGNDALSLEEQCSWAVVDKFTDENNQSHCSED 158
            +  +K YW+VRNSWG+ WG MGF  +  G ++L +E++ +WA    +T  N    C ED
Sbjct: 232 ADDGTK-YWVVRNSWGQYWGEMGFFRIIRGVNSLGIEDEVAWATPGSWTHMN--FACYED 288

Query: 157 GATC 146
           G+ C
Sbjct: 289 GSNC 292


>ref|XP_002507788.1| cysteine endopeptidase [Micromonas commoda]
 gb|ACO69046.1| cysteine endopeptidase [Micromonas commoda]
          Length = 388

 Score =  159 bits (401), Expect = 5e-43
 Identities = 74/183 (40%), Positives = 108/183 (59%), Gaps = 6/183 (3%)
 Frame = -1

Query: 676 YESVNPYLACTSDLKQGICKGQD---WSCKPLNVARTCGTFVESGGKCVGLTKYPNVTIS 506
           +E+   Y AC+++  +G C   D   + CKP N  RTC TF + GG C  L  +PN +I+
Sbjct: 182 FETCLVYEACSAESSEGSCAAGDVSRYECKPENTCRTCSTFSDMGGFCSALDSFPNASIA 241

Query: 505 DYGSISGAAAMQKEIFNRGPISCGVDAEPIVDYTTGIVTAQGSGQ---DHVVSVTGWGYD 335
           +YG +SG   +  E++ RGP++ G+DA  + +YT GI+      +   +H+V++ GWG  
Sbjct: 242 EYGEVSGEKEIMAEVYARGPVAAGIDANLLDEYTGGILDQPADYEYEINHIVAIVGWGET 301

Query: 334 EASSKSYWIVRNSWGESWGNMGFVYVEMGNDALSLEEQCSWAVVDKFTDENNQSHCSEDG 155
           +   K YWIVRNSWGE WG MGF  +  G  AL +E++CSWA    +T  N    C EDG
Sbjct: 302 KKGEK-YWIVRNSWGEYWGEMGFFRIVRGKKALGIEDECSWATPASWTTHN--QGCYEDG 358

Query: 154 ATC 146
           + C
Sbjct: 359 SNC 361


>ref|XP_007515195.1| predicted protein [Bathycoccus prasinos]
 emb|CCO14074.1| predicted protein [Bathycoccus prasinos]
          Length = 413

 Score =  159 bits (402), Expect = 7e-43
 Identities = 73/184 (39%), Positives = 112/184 (60%), Gaps = 7/184 (3%)
 Frame = -1

Query: 676 YESVNPYLACTSDLKQGICK--GQDWSCKPLNVARTCGTFVESGGKCVGLTKYPNVTISD 503
           Y++   Y AC+++  +G C      + C  +N  RTC TF + GG C  L  +PN T+ +
Sbjct: 191 YDTCLSYEACSNESIEGSCGYVTDRYKCNAINTCRTCSTFSQLGGFCAPLATFPNATVKE 250

Query: 502 YGSISGAAAMQKEIFNRGPISCGVDAEPIVDYTTGIVTAQGSGQ-DHVVSVTGWGYDEAS 326
           YG+ISG  A+ KE++ RGP++ G+DA+ +  YT GI T   S + +H+VS+ GWG ++ +
Sbjct: 251 YGTISGEEAIMKELYARGPVAAGIDADGLRSYTKGIYTDTPSYEINHIVSIVGWGVEKKT 310

Query: 325 SKSYWIVRNSWGESWGNMGFVYVEMGNDALSLEEQCSWAVVDKFT----DENNQSHCSED 158
           +  YWIVRNSWG+ WG MGF  +  G  +L +E++ +WA   ++T    D+     C ED
Sbjct: 311 NTKYWIVRNSWGQYWGEMGFFRIVRGKKSLGIEDEVAWATPGRWTGMKHDDFANFPCFED 370

Query: 157 GATC 146
           GA C
Sbjct: 371 GANC 374


>ref|XP_002177642.1| predicted protein, partial [Phaeodactylum tricornutum CCAP 1055/1]
 gb|EEC50456.1| predicted protein, partial [Phaeodactylum tricornutum CCAP 1055/1]
          Length = 256

 Score =  154 bits (390), Expect = 9e-43
 Identities = 70/162 (43%), Positives = 100/162 (61%), Gaps = 6/162 (3%)
 Frame = -1

Query: 676 YESVNPYLACTSDLKQGICKGQDWSCKPLNVARTCGTFVESGGKCVGLTKYPNVTISDYG 497
           Y++   YLAC+ +  +G C   D SC P N  RTC TF   GG C  +  +PN T+++YG
Sbjct: 93  YDTCMSYLACSEESTEGFCPQLDTSCTPDNTCRTCDTFAGMGGACSRIDYFPNATVAEYG 152

Query: 496 SIS----GAAAMQKEIFNRGPISCGVDAEPIVDYTTGIVTAQGSGQ--DHVVSVTGWGYD 335
            I         +Q E++ RGP++  ++AEPIV+Y  G+    G  Q  +H+VS+TGWG D
Sbjct: 153 LIDLDDFVVHKIQTELYVRGPVAATINAEPIVEYAGGVFGEDGHSQRTNHIVSITGWGTD 212

Query: 334 EASSKSYWIVRNSWGESWGNMGFVYVEMGNDALSLEEQCSWA 209
           E + K YWIVRNSWG+ WG MGF+ +E G + L +E + +WA
Sbjct: 213 EDTGKLYWIVRNSWGQYWGEMGFMRIEAGKNLLGIEGEVAWA 254


>dbj|GAX22053.1| cathepsin X [Fistulifera solaris]
          Length = 359

 Score =  155 bits (393), Expect = 4e-42
 Identities = 80/198 (40%), Positives = 117/198 (59%), Gaps = 20/198 (10%)
 Frame = -1

Query: 676 YESVNPYLACTSDLKQGICKGQDWSCKPLNVARTCGTFVESGGKCVGLTKYPNVTISDYG 497
           Y++  PYLAC+S+ ++G CK  D +C P+N  RTC     +   C  L ++PN TI++YG
Sbjct: 153 YDTCMPYLACSSESREGFCKSVDTTCTPMNTCRTC-----TRKGCRALERFPNATIAEYG 207

Query: 496 SIS------GAAA--MQKEIFNRGPISCGVDAEPIVDYTTGIVTAQGSGQ---------- 371
           + S      GA A  ++ EI+ RGP++ GV+AEP+V+Y       +G+G+          
Sbjct: 208 TYSYVTDGFGAVANKIKAEIYARGPVAAGVNAEPLVNY-------KGNGEIIRETSIFKM 260

Query: 370 --DHVVSVTGWGYDEASSKSYWIVRNSWGESWGNMGFVYVEMGNDALSLEEQCSWAVVDK 197
             +H+VS+ GWG D  +   YWIVRNSWG  WGNMG+  + MG++AL +E + SWA    
Sbjct: 261 LVNHIVSIVGWGMDPETGDQYWIVRNSWGAYWGNMGYFNILMGHNALGIELEVSWATPGT 320

Query: 196 FTDENNQSHCSEDGATCD 143
           FT +N    C  DG+ CD
Sbjct: 321 FTTKNYP--CHVDGSDCD 336


Top