BLASTX nr result

ID: Ophiopogon25_contig00053415 seq

BLASTX 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Ophiopogon25_contig00053415
         (475 letters)

Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF excluding environmental samples
from WGS projects 
           149,584,005 sequences; 54,822,741,787 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

gb|KOO35432.1| papain family cysteine protease containing protei...   145   8e-41
gb|PRP82211.1| hypothetical protein PROFUN_10420 [Planoprotostel...   141   1e-37
gb|KOO28060.1| papain family cysteine protease containing protei...   140   2e-37
gb|ABH06549.2| cathepsin L cysteine protease ICP1, partial [Icht...   136   9e-36
ref|XP_001019547.3| papain family cysteine protease [Tetrahymena...   130   1e-33
gb|PRP89226.1| hypothetical protein PROFUN_02100 [Planoprotostel...   128   1e-32
ref|XP_023332086.1| zingipain-2-like isoform X1 [Eurytemora affi...   125   2e-31
ref|XP_001015624.1| papain family cysteine protease [Tetrahymena...   124   3e-31
ref|XP_004993594.1| hypothetical protein PTSG_05727 [Salpingoeca...   124   5e-31
ref|XP_004358308.1| papain family cysteine protease subfamily pr...   123   1e-30
gb|AII16495.1| cathepsin K, partial [Paracyclopina nana]              120   9e-30
ref|XP_023324712.1| ervatamin-C-like [Eurytemora affinis]             120   1e-29
ref|XP_001022365.1| papain family cysteine protease [Tetrahymena...   120   2e-29
ref|XP_009034544.1| hypothetical protein AURANDRAFT_22037 [Aureo...   118   7e-29
ref|XP_001012928.1| papain family cysteine protease [Tetrahymena...   118   7e-29
ref|XP_001022291.2| hypothetical protein TTHERM_00502370 [Tetrah...   117   2e-28
ref|XP_004341646.1| papain family cysteine protease containing p...   117   2e-28
ref|XP_004997373.1| hypothetical protein PTSG_11722 [Salpingoeca...   117   3e-28
ref|XP_002185251.1| predicted protein [Phaeodactylum tricornutum...   117   5e-28
gb|EJK56325.1| hypothetical protein THAOC_23815 [Thalassiosira o...   115   1e-27

>gb|KOO35432.1| papain family cysteine protease containing protein
           [Chrysochromulina sp. CCMP291]
          Length = 218

 Score =  145 bits (366), Expect = 8e-41
 Identities = 79/141 (56%), Positives = 89/141 (63%), Gaps = 5/141 (3%)
 Frame = -2

Query: 462 GITGYVALPANNYTALMNAGAQM-PIAISAAAGAWQLYESGVLSK----CDADVDHAIQL 298
           GITG V LP NNYT+LM+A     PIAIS  A +W  YE GV S        ++DHA+QL
Sbjct: 80  GITGKVELPTNNYTSLMHALVTAGPIAISVDA-SWGAYEGGVFSDPKSGIHTNIDHAVQL 138

Query: 297 VGWGTDKKAGDYWLVRNSWGTSWGEAGYIRIKRFGEGKEPCKTDTXXXXXXXXXXXXXXX 118
           VG+GT     DYWLVRNSWG SWGE GYI+I+RFGEGKEPC TD                
Sbjct: 139 VGYGT-MGGKDYWLVRNSWGASWGEQGYIKIERFGEGKEPCGTDARPGDGFGCAGGPSTI 197

Query: 117 SVCGLCGILSDSSYPTGGHLV 55
            VCG  GILS SSYPTG HL+
Sbjct: 198 QVCGTSGILSGSSYPTGAHLI 218


>gb|PRP82211.1| hypothetical protein PROFUN_10420 [Planoprotostelium fungivorum]
          Length = 380

 Score =  141 bits (356), Expect = 1e-37
 Identities = 75/138 (54%), Positives = 83/138 (60%), Gaps = 4/138 (2%)
 Frame = -2

Query: 471 VVAGITGYVALPANNYTALMNAGAQM-PIAISAAAGAWQLYESGVLSKCDA---DVDHAI 304
           V A ITGYV LP N Y  LM A A + PIAIS  A AW  YESGV + C+    D+DHA+
Sbjct: 240 VFANITGYVTLPTNKYKPLMIAVATLGPIAISVDASAWHDYESGVFNGCNQTNPDIDHAV 299

Query: 303 QLVGWGTDKKAGDYWLVRNSWGTSWGEAGYIRIKRFGEGKEPCKTDTXXXXXXXXXXXXX 124
           QLVG+GTD+K GDYWLVRNSWG  WGE GYIRI R    +  C  D              
Sbjct: 300 QLVGYGTDEKLGDYWLVRNSWGPKWGEKGYIRIYRSDSEEGRCGEDITPRDGEGCADGPP 359

Query: 123 XXSVCGLCGILSDSSYPT 70
              VCG CGIL DS YPT
Sbjct: 360 QVEVCGTCGILFDSVYPT 377


>gb|KOO28060.1| papain family cysteine protease containing protein
           [Chrysochromulina sp. CCMP291]
          Length = 376

 Score =  140 bits (354), Expect = 2e-37
 Identities = 77/141 (54%), Positives = 95/141 (67%), Gaps = 3/141 (2%)
 Frame = -2

Query: 468 VAGITGYVALPANNYTALMNAGA-QMPIAISAAAGAWQLYESGVLSKCDAD--VDHAIQL 298
           V GI+GY  LP N+YTAL+ A   + PIAIS  A +W  YE GV S  D++  +DHA+QL
Sbjct: 239 VIGISGYEDLPMNDYTALLEALVNEGPIAISVDA-SWGAYEDGVFSS-DSNWLIDHAVQL 296

Query: 297 VGWGTDKKAGDYWLVRNSWGTSWGEAGYIRIKRFGEGKEPCKTDTXXXXXXXXXXXXXXX 118
           VG+GT+    DYWLVRNSWGTSWGE GYI++++FGEGKEPC TDT               
Sbjct: 297 VGYGTEG-GKDYWLVRNSWGTSWGENGYIKLEKFGEGKEPCGTDTAPASGFVCEGGPSTI 355

Query: 117 SVCGLCGILSDSSYPTGGHLV 55
           +VCG  G+LS SSYPTG  L+
Sbjct: 356 TVCGTSGMLSGSSYPTGAKLL 376


>gb|ABH06549.2| cathepsin L cysteine protease ICP1, partial [Ichthyophthirius
           multifiliis]
          Length = 374

 Score =  136 bits (343), Expect = 9e-36
 Identities = 71/140 (50%), Positives = 85/140 (60%), Gaps = 5/140 (3%)
 Frame = -2

Query: 459 ITGYVALPANNYTALMNAGAQM-PIAISAAAGAWQLYESGVLSKCDA----DVDHAIQLV 295
           + GY+ LP N+Y  L++A A + PIAIS  A  W  YE GV S CD     ++DHA+ L+
Sbjct: 236 VDGYLKLPVNSYEHLLHAIATVGPIAISVDASKWHDYEEGVYSGCDVTQNIEIDHAVTLI 295

Query: 294 GWGTDKKAGDYWLVRNSWGTSWGEAGYIRIKRFGEGKEPCKTDTXXXXXXXXXXXXXXXS 115
           G+GTD+K GDYWLVRNSWGT WGE GYIR+KR  E    C TD                 
Sbjct: 296 GYGTDEKLGDYWLVRNSWGTKWGENGYIRLKR--ESTPQCGTDYTPGIGNACRGQNDAQK 353

Query: 114 VCGLCGILSDSSYPTGGHLV 55
           VCG CGILSDSSYP    +V
Sbjct: 354 VCGQCGILSDSSYPLNVRVV 373


>ref|XP_001019547.3| papain family cysteine protease [Tetrahymena thermophila SB210]
 gb|EAR99302.3| papain family cysteine protease (macronuclear) [Tetrahymena
           thermophila SB210]
          Length = 375

 Score =  130 bits (328), Expect = 1e-33
 Identities = 69/134 (51%), Positives = 84/134 (62%), Gaps = 5/134 (3%)
 Frame = -2

Query: 459 ITGYVALPANNYTALMNAGA-QMPIAISAAAGAWQLYESGVLSKCDA----DVDHAIQLV 295
           I GY+ +P N+Y +LMNA A Q P+ IS  A  +  YESGV   CD     D++HA+ LV
Sbjct: 237 IDGYLKVPENDYASLMNAVATQGPLVISVDASNFHDYESGVFHGCDGADNVDINHAVVLV 296

Query: 294 GWGTDKKAGDYWLVRNSWGTSWGEAGYIRIKRFGEGKEPCKTDTXXXXXXXXXXXXXXXS 115
           G+GTD+K GDYW+VRNSWGT +GE GYIR+KR  E    CKTD                 
Sbjct: 297 GYGTDEKEGDYWIVRNSWGTRFGENGYIRVKR--EATPTCKTDFTPLDGNGCVGFAKPQK 354

Query: 114 VCGLCGILSDSSYP 73
           VCG CGILSDS+YP
Sbjct: 355 VCGQCGILSDSAYP 368


>gb|PRP89226.1| hypothetical protein PROFUN_02100 [Planoprotostelium fungivorum]
          Length = 375

 Score =  128 bits (322), Expect = 1e-32
 Identities = 65/135 (48%), Positives = 83/135 (61%), Gaps = 4/135 (2%)
 Frame = -2

Query: 465 AGITGYVALPANNYTALMNAGAQM-PIAISAAAGAWQLYESGVLSKCDA---DVDHAIQL 298
           A ITG+V LP+N Y  +M A A + P+A++  A +W  YESGV + CDA   D+DH +QL
Sbjct: 240 ASITGHVVLPSNQYAPVMKALATVGPLAVNVDASSWSFYESGVYTACDAENVDIDHVVQL 299

Query: 297 VGWGTDKKAGDYWLVRNSWGTSWGEAGYIRIKRFGEGKEPCKTDTXXXXXXXXXXXXXXX 118
           VG+GTD++ GDYW VRNSWG  +GE GYIR+ R  + K  C  D                
Sbjct: 300 VGYGTDEQYGDYWTVRNSWGPKYGEKGYIRLARSSDVK--CGVDKTPLDGTGCAGGPSTQ 357

Query: 117 SVCGLCGILSDSSYP 73
           +VCG CGIL D SYP
Sbjct: 358 TVCGACGILFDVSYP 372


>ref|XP_023332086.1| zingipain-2-like isoform X1 [Eurytemora affinis]
          Length = 362

 Score =  125 bits (313), Expect = 2e-31
 Identities = 66/144 (45%), Positives = 89/144 (61%), Gaps = 6/144 (4%)
 Frame = -2

Query: 468 VAGITGYVALPANNYTALMNAGAQM-PIAISAAAGAWQLYESGVLSKCDAD----VDHAI 304
           VA ITGY  LP N+  A++   A++ P+AIS AA  ++ Y  GV   C  +    ++HA+
Sbjct: 221 VASITGYNNLPPNSMEAVIQHIAEVGPLAISVAANTFKNYNGGVFDGCSYEDNIALNHAV 280

Query: 303 QLVGWGTDKKAGDYWLVRNSWGTSWGEAGYIRIKRFGEGKEPCKTD-TXXXXXXXXXXXX 127
           QLVG+GTD+  GDYW+VRNSWG  WGE GYIR+KR  E   PC TD T            
Sbjct: 281 QLVGYGTDEDLGDYWIVRNSWGLGWGENGYIRMKR--EANPPCGTDSTTSGHVCQGGPGS 338

Query: 126 XXXSVCGLCGILSDSSYPTGGHLV 55
              +VCG+CG+L ++S+P G HL+
Sbjct: 339 DSLTVCGMCGMLFETSFPLGAHLL 362


>ref|XP_001015624.1| papain family cysteine protease [Tetrahymena thermophila SB210]
 gb|EAR95379.1| papain family cysteine protease [Tetrahymena thermophila SB210]
          Length = 375

 Score =  124 bits (312), Expect = 3e-31
 Identities = 66/132 (50%), Positives = 78/132 (59%), Gaps = 5/132 (3%)
 Frame = -2

Query: 453 GYVALPANNYTALMNAGAQM-PIAISAAAGAWQLYESGVLSKCD----ADVDHAIQLVGW 289
           GY  +  N+  AL+ A A + PIAIS  A  W  YE GV   CD     D++HA+ LVG+
Sbjct: 239 GYANVTPNDQNALLEAVATVGPIAISVDASNWASYEEGVFDGCDYSKNVDINHAVVLVGY 298

Query: 288 GTDKKAGDYWLVRNSWGTSWGEAGYIRIKRFGEGKEPCKTDTXXXXXXXXXXXXXXXSVC 109
           GTD K GDYWLVRNSWGT +GE GYIR+KR  E    C  DT                VC
Sbjct: 299 GTDPKYGDYWLVRNSWGTDYGEDGYIRVKR--ESVAQCAMDTTPTDGFGCAGDEEPIKVC 356

Query: 108 GLCGILSDSSYP 73
           G+CGILSDS+YP
Sbjct: 357 GMCGILSDSAYP 368


>ref|XP_004993594.1| hypothetical protein PTSG_05727 [Salpingoeca rosetta]
 gb|EGD74032.1| hypothetical protein PTSG_05727 [Salpingoeca rosetta]
          Length = 398

 Score =  124 bits (312), Expect = 5e-31
 Identities = 67/139 (48%), Positives = 80/139 (57%), Gaps = 4/139 (2%)
 Frame = -2

Query: 468 VAGITGYVALPANNYTALMNAGAQM-PIAISAAAGAWQLYESGVLSKCDA---DVDHAIQ 301
           V  +TGYV LP+N Y  LM+A A   PI+IS  A AW+ YESG+   C+    D+DHA+Q
Sbjct: 257 VVNVTGYVKLPSNQYEPLMDAIANKGPISISVEAVAWKNYESGIFDGCNQTNPDIDHAVQ 316

Query: 300 LVGWGTDKKAGDYWLVRNSWGTSWGEAGYIRIKRFGEGKEPCKTDTXXXXXXXXXXXXXX 121
           LVG+G D   G YWLVRNSW   WGE+GYIRI+R       C  D               
Sbjct: 317 LVGYGDDNSQG-YWLVRNSWTPHWGESGYIRIRRTANEGGRCGMDITPQDGSGCKGGPDK 375

Query: 120 XSVCGLCGILSDSSYPTGG 64
             VCG CGIL D+ YPT G
Sbjct: 376 VKVCGTCGILFDNVYPTIG 394


>ref|XP_004358308.1| papain family cysteine protease subfamily protein [Acanthamoeba
           castellanii str. Neff]
 gb|ELR25744.1| papain family cysteine protease subfamily protein [Acanthamoeba
           castellanii str. Neff]
          Length = 383

 Score =  123 bits (309), Expect = 1e-30
 Identities = 66/135 (48%), Positives = 79/135 (58%), Gaps = 4/135 (2%)
 Frame = -2

Query: 465 AGITGYVALPANNYTALMNAG-AQMPIAISAAAGAWQLYESGVLSKCDA---DVDHAIQL 298
           A IT +V LP+N +  L+ A   Q PIAIS  A +W  YE+GV + C+    D+DHA+QL
Sbjct: 249 AKITNFVKLPSNEHYPLLGAIITQGPIAISVDASSWSSYETGVYNGCNQTNPDIDHAVQL 308

Query: 297 VGWGTDKKAGDYWLVRNSWGTSWGEAGYIRIKRFGEGKEPCKTDTXXXXXXXXXXXXXXX 118
           VG+G D K GDYWLVRNSW  +WGEAGYIRI         C TD                
Sbjct: 309 VGYGKDPKHGDYWLVRNSWSPAWGEAGYIRISM--TSSPQCGTDLNPSDGTGCKGGPPTQ 366

Query: 117 SVCGLCGILSDSSYP 73
            VCG CGIL D+SYP
Sbjct: 367 RVCGTCGILFDNSYP 381


>gb|AII16495.1| cathepsin K, partial [Paracyclopina nana]
          Length = 376

 Score =  120 bits (302), Expect = 9e-30
 Identities = 63/140 (45%), Positives = 84/140 (60%), Gaps = 6/140 (4%)
 Frame = -2

Query: 468 VAGITGYVALPANNYTALMNAGAQM-PIAISAAAGAWQLYESGVLSKCD----ADVDHAI 304
           VA + GY  LP NNY A+MN  A + P++++  A +W  Y +GV   C+     +++HA+
Sbjct: 229 VATLRGYETLPRNNYEAVMNHLANVGPLSVAVDASSWSFYSTGVFDDCNYSYNIEINHAV 288

Query: 303 QLVGWGTDKKAGDYWLVRNSWGTSWGEAGYIRIKRFGEGKEPCKTD-TXXXXXXXXXXXX 127
           QLVG+GTD+  GDYWLVRNSWG  WG+ GYI++KR  E K  C  D T            
Sbjct: 289 QLVGYGTDEFEGDYWLVRNSWGGFWGDDGYIKLKRESETK--CGIDSTPLMGTGCPNDGN 346

Query: 126 XXXSVCGLCGILSDSSYPTG 67
              +VCG CGIL D+ YP G
Sbjct: 347 EVLTVCGQCGILFDTCYPIG 366


>ref|XP_023324712.1| ervatamin-C-like [Eurytemora affinis]
          Length = 359

 Score =  120 bits (300), Expect = 1e-29
 Identities = 63/137 (45%), Positives = 81/137 (59%), Gaps = 5/137 (3%)
 Frame = -2

Query: 468 VAGITGYVALPANNYTALMNAGAQM-PIAISAAAGAWQLYESGVLSKCDAD----VDHAI 304
           V GITGY  +PAN+  A +   A + P+A++A A AWQ Y SGV   C  +    ++HA+
Sbjct: 217 VVGITGYNTIPANDLEATLQHVANVGPLAVAADASAWQFYGSGVFGSCAYEDNIALNHAV 276

Query: 303 QLVGWGTDKKAGDYWLVRNSWGTSWGEAGYIRIKRFGEGKEPCKTDTXXXXXXXXXXXXX 124
           QLVG+G+D   GDYWLVRNSWG +WGE GYIR++R  E      +               
Sbjct: 277 QLVGYGSDDAHGDYWLVRNSWGKNWGEHGYIRLQRESELMCGINSTPMDGTACENGPGTD 336

Query: 123 XXSVCGLCGILSDSSYP 73
             +VCG CGIL DSSYP
Sbjct: 337 EQTVCGQCGILFDSSYP 353


>ref|XP_001022365.1| papain family cysteine protease [Tetrahymena thermophila SB210]
 gb|EAS02120.1| papain family cysteine protease [Tetrahymena thermophila SB210]
          Length = 376

 Score =  120 bits (300), Expect = 2e-29
 Identities = 62/145 (42%), Positives = 85/145 (58%), Gaps = 5/145 (3%)
 Frame = -2

Query: 474 TVVAGITGYVALPANNYTALMNAGAQM-PIAISAAAGAWQLYESGVLSKC----DADVDH 310
           T    + GYV L AN+Y AL+ A A + P+A++  A  W+ Y+SGV + C    + DV+H
Sbjct: 233 TPEVALDGYVKLQANDYDALLYALANIGPLAVAVDASQWRNYQSGVFNGCSYTDNIDVNH 292

Query: 309 AIQLVGWGTDKKAGDYWLVRNSWGTSWGEAGYIRIKRFGEGKEPCKTDTXXXXXXXXXXX 130
            + LVG+GTD + GDYWL+RNSWGT +GE GYIR+ R  E K  C TD            
Sbjct: 293 VVVLVGYGTDPELGDYWLIRNSWGTKFGENGYIRLAR--ESKVTCGTDYTPLDGQACAGQ 350

Query: 129 XXXXSVCGLCGILSDSSYPTGGHLV 55
                VCG CG+  D++YPT   ++
Sbjct: 351 NVPTKVCGQCGVAYDAAYPTNVRVI 375


>ref|XP_009034544.1| hypothetical protein AURANDRAFT_22037 [Aureococcus anophagefferens]
 gb|EGB10983.1| hypothetical protein AURANDRAFT_22037 [Aureococcus anophagefferens]
          Length = 372

 Score =  118 bits (296), Expect = 7e-29
 Identities = 67/143 (46%), Positives = 84/143 (58%), Gaps = 5/143 (3%)
 Frame = -2

Query: 468 VAGITGYVALPANNYTALMNAGAQM-PIAISAAAGAWQLYESGVLSK--CDADVDHAIQL 298
           VA I+G+  LP N Y  LM A A   P+A+S  A +W  YESG+      D+ +DHA+ L
Sbjct: 231 VATISGFEKLPVNEYAPLMRAVATKGPVAVSVDA-SWGGYESGIFETDTYDSVIDHAVVL 289

Query: 297 VGWGTDKKAG-DYWLVRNSWGTSWGEAGYIRIKRFGEGKEP-CKTDTXXXXXXXXXXXXX 124
           VG+GTD+  G DYWLVRNSWG +WGE GYIR++R     +  C  DT             
Sbjct: 290 VGYGTDEALGKDYWLVRNSWGPTWGEKGYIRLRRHAADLDAYCGVDTKPLDGVGCAAGPA 349

Query: 123 XXSVCGLCGILSDSSYPTGGHLV 55
             + CG  GILSDS+YP GG LV
Sbjct: 350 NMTTCGTSGILSDSAYPVGGRLV 372


>ref|XP_001012928.1| papain family cysteine protease [Tetrahymena thermophila SB210]
 gb|EAR92683.1| papain family cysteine protease (macronuclear) [Tetrahymena
           thermophila SB210]
          Length = 377

 Score =  118 bits (296), Expect = 7e-29
 Identities = 61/145 (42%), Positives = 82/145 (56%), Gaps = 5/145 (3%)
 Frame = -2

Query: 474 TVVAGITGYVALPANNYTALMNAGAQM-PIAISAAAGAWQLYESGVLSKCD----ADVDH 310
           T    + GYV L AN+Y AL+ A A + P+A++     W  Y+SG+ + CD     DV+H
Sbjct: 234 TPEVSLDGYVKLQANDYNALLYAVATIGPLAVAVDGAKWHSYQSGIYNGCDYSQNIDVNH 293

Query: 309 AIQLVGWGTDKKAGDYWLVRNSWGTSWGEAGYIRIKRFGEGKEPCKTDTXXXXXXXXXXX 130
            + L G+GTD + GDYWL+RNSWGTS+GE GYIR+ R  E K  C TD            
Sbjct: 294 VVVLEGYGTDPELGDYWLIRNSWGTSFGENGYIRLAR--ESKVTCGTDYSPLDGQACSGQ 351

Query: 129 XXXXSVCGLCGILSDSSYPTGGHLV 55
                VCG CG+  DS+YP    ++
Sbjct: 352 NIPTKVCGQCGVAYDSAYPVNVQVI 376


>ref|XP_001022291.2| hypothetical protein TTHERM_00502370 [Tetrahymena thermophila
           SB210]
 gb|EAS02046.2| hypothetical protein TTHERM_00502370 (macronuclear) [Tetrahymena
           thermophila SB210]
          Length = 376

 Score =  117 bits (293), Expect = 2e-28
 Identities = 64/132 (48%), Positives = 78/132 (59%), Gaps = 5/132 (3%)
 Frame = -2

Query: 453 GYVALPANNYTALMNAGAQM-PIAISAAAGAWQLYESGVLSKCD----ADVDHAIQLVGW 289
           G+  +  N+  AL+ A A + PIAIS  +  W  YE GV   CD     ++DHA+ LVG+
Sbjct: 240 GFAKVTPNDQQALLEAVATIGPIAISIDSTGWDSYEEGVFDGCDYSENINIDHAVVLVGY 299

Query: 288 GTDKKAGDYWLVRNSWGTSWGEAGYIRIKRFGEGKEPCKTDTXXXXXXXXXXXXXXXSVC 109
           GTD K GDYWLVRNS+GT +GE GYIRIKR  E    C  D+                VC
Sbjct: 300 GTDPKYGDYWLVRNSYGTEFGEDGYIRIKR--EAVPQCGIDSTPTNGFACGGDETPIKVC 357

Query: 108 GLCGILSDSSYP 73
           G+CGILSDSSYP
Sbjct: 358 GMCGILSDSSYP 369


>ref|XP_004341646.1| papain family cysteine protease containing protein [Acanthamoeba
           castellanii str. Neff]
 gb|ELR19560.1| papain family cysteine protease containing protein [Acanthamoeba
           castellanii str. Neff]
          Length = 385

 Score =  117 bits (293), Expect = 2e-28
 Identities = 61/137 (44%), Positives = 76/137 (55%), Gaps = 4/137 (2%)
 Frame = -2

Query: 471 VVAGITGYVALPANNYTALMNAGAQM-PIAISAAAGAWQLYESGVLSKCDA---DVDHAI 304
           VVA +  YV LP+N Y  ++ A     P+ I+  A +W  YESGV   C+    D++H +
Sbjct: 246 VVAKVKNYVVLPSNKYDPVIEALTTTGPLVINVDASSWHAYESGVFDGCNQTNPDINHVV 305

Query: 303 QLVGWGTDKKAGDYWLVRNSWGTSWGEAGYIRIKRFGEGKEPCKTDTXXXXXXXXXXXXX 124
           QLVG+GTD K GDYWLVRNSW   WGE GYIR+KR       C  D              
Sbjct: 306 QLVGYGTDAKEGDYWLVRNSWSPVWGEKGYIRLKR--RSNPICGIDLKPSDGTGCKGGPA 363

Query: 123 XXSVCGLCGILSDSSYP 73
             +VCG CG+L D SYP
Sbjct: 364 TVTVCGECGLLYDVSYP 380


>ref|XP_004997373.1| hypothetical protein PTSG_11722 [Salpingoeca rosetta]
 gb|EGD80812.1| hypothetical protein PTSG_11722 [Salpingoeca rosetta]
          Length = 372

 Score =  117 bits (292), Expect = 3e-28
 Identities = 64/136 (47%), Positives = 76/136 (55%), Gaps = 4/136 (2%)
 Frame = -2

Query: 459 ITGYVALPANNYTALMNAGAQM-PIAISAAAGAWQLYESGVLSKCDA---DVDHAIQLVG 292
           +TGYV LPAN Y  LM A A   PI+IS  A  W+ YESG+ + C+    D+DH +QLVG
Sbjct: 237 VTGYVKLPANQYEPLMEAVANKGPISISVEAIHWKNYESGIFNGCNQTNPDIDHVVQLVG 296

Query: 291 WGTDKKAGDYWLVRNSWGTSWGEAGYIRIKRFGEGKEPCKTDTXXXXXXXXXXXXXXXSV 112
           +GTD   G YWLVRNSW   +GE GYIR+ R     + C  D                  
Sbjct: 297 YGTDNGQG-YWLVRNSWTPHFGEGGYIRLLRASNEGQRCGIDVKPQDGSGCKGGPPTVKA 355

Query: 111 CGLCGILSDSSYPTGG 64
           CG CGIL DS YPT G
Sbjct: 356 CGTCGILFDSVYPTLG 371


>ref|XP_002185251.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
 gb|EEC43383.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
          Length = 424

 Score =  117 bits (292), Expect = 5e-28
 Identities = 70/157 (44%), Positives = 86/157 (54%), Gaps = 19/157 (12%)
 Frame = -2

Query: 468 VAGITGYVALPANNYTALMNAGAQM-PIAISAAAGAWQLYESGVL-----SKCDADVDHA 307
           VA I G+VALP NNYT LMNA A++ P+A+S AA  W LY+ GV      S+ + +V+H 
Sbjct: 267 VASIQGWVALPTNNYTTLMNAVAKVGPVAVSVAATPWALYKEGVFESSMKSEKETNVNHL 326

Query: 306 IQLVGWGTDKKAG-DYWLVRNSWGTSWGEAGYIRIKRF------------GEGKEPCKTD 166
           + L G+GTD++ G DYWLVRNSWG  WGE GYIR+KR             G    P    
Sbjct: 327 VVLDGYGTDEETGVDYWLVRNSWGPMWGEDGYIRLKRVDPVSLTDPEMDCGMDVTPSDGV 386

Query: 165 TXXXXXXXXXXXXXXXSVCGLCGILSDSSYPTGGHLV 55
                            VCG  GIL DSS P G +LV
Sbjct: 387 ACTIDDKGNSVIPPAVKVCGTSGILFDSSLPLGPYLV 423


>gb|EJK56325.1| hypothetical protein THAOC_23815 [Thalassiosira oceanica]
          Length = 418

 Score =  115 bits (289), Expect = 1e-27
 Identities = 64/150 (42%), Positives = 82/150 (54%), Gaps = 9/150 (6%)
 Frame = -2

Query: 474 TVVAGITGYVALPANNYTALMNAGAQM-PIAISAAAGAWQLYESGVLSKCDADVDHAIQL 298
           T   G+TG+  LP N+Y ++MNA  Q  P+AI+AAA  W LYE GV S  DA V+HAI L
Sbjct: 264 TASVGVTGWTQLPTNDYKSVMNALVQKGPVAIAAAASDWALYEKGVFSSDDATVNHAILL 323

Query: 297 VGWGTDKKAGD-YWLVRNSWGTSWGEAGYIRIKRFGEGKEPCKTD-------TXXXXXXX 142
           VG+G D+  G+ Y+ +RNSWG  +GE GYIR+ R  E    C  D               
Sbjct: 324 VGYGIDEDTGEKYYKIRNSWGPHFGEDGYIRVLRTDEDSTVCNMDNDPLVGLACALDDSG 383

Query: 141 XXXXXXXXSVCGLCGILSDSSYPTGGHLVN 52
                    VCG  G+L D SYP G H ++
Sbjct: 384 NQIDVQPVEVCGASGVLFDVSYPVGVHKID 413


Top