BLASTX nr result
ID: Ophiopogon25_contig00053415
seq
BLASTX 2.2.26 [Sep-21-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Ophiopogon25_contig00053415 (475 letters) Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF excluding environmental samples from WGS projects 149,584,005 sequences; 54,822,741,787 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value gb|KOO35432.1| papain family cysteine protease containing protei... 145 8e-41 gb|PRP82211.1| hypothetical protein PROFUN_10420 [Planoprotostel... 141 1e-37 gb|KOO28060.1| papain family cysteine protease containing protei... 140 2e-37 gb|ABH06549.2| cathepsin L cysteine protease ICP1, partial [Icht... 136 9e-36 ref|XP_001019547.3| papain family cysteine protease [Tetrahymena... 130 1e-33 gb|PRP89226.1| hypothetical protein PROFUN_02100 [Planoprotostel... 128 1e-32 ref|XP_023332086.1| zingipain-2-like isoform X1 [Eurytemora affi... 125 2e-31 ref|XP_001015624.1| papain family cysteine protease [Tetrahymena... 124 3e-31 ref|XP_004993594.1| hypothetical protein PTSG_05727 [Salpingoeca... 124 5e-31 ref|XP_004358308.1| papain family cysteine protease subfamily pr... 123 1e-30 gb|AII16495.1| cathepsin K, partial [Paracyclopina nana] 120 9e-30 ref|XP_023324712.1| ervatamin-C-like [Eurytemora affinis] 120 1e-29 ref|XP_001022365.1| papain family cysteine protease [Tetrahymena... 120 2e-29 ref|XP_009034544.1| hypothetical protein AURANDRAFT_22037 [Aureo... 118 7e-29 ref|XP_001012928.1| papain family cysteine protease [Tetrahymena... 118 7e-29 ref|XP_001022291.2| hypothetical protein TTHERM_00502370 [Tetrah... 117 2e-28 ref|XP_004341646.1| papain family cysteine protease containing p... 117 2e-28 ref|XP_004997373.1| hypothetical protein PTSG_11722 [Salpingoeca... 117 3e-28 ref|XP_002185251.1| predicted protein [Phaeodactylum tricornutum... 117 5e-28 gb|EJK56325.1| hypothetical protein THAOC_23815 [Thalassiosira o... 115 1e-27 >gb|KOO35432.1| papain family cysteine protease containing protein [Chrysochromulina sp. CCMP291] Length = 218 Score = 145 bits (366), Expect = 8e-41 Identities = 79/141 (56%), Positives = 89/141 (63%), Gaps = 5/141 (3%) Frame = -2 Query: 462 GITGYVALPANNYTALMNAGAQM-PIAISAAAGAWQLYESGVLSK----CDADVDHAIQL 298 GITG V LP NNYT+LM+A PIAIS A +W YE GV S ++DHA+QL Sbjct: 80 GITGKVELPTNNYTSLMHALVTAGPIAISVDA-SWGAYEGGVFSDPKSGIHTNIDHAVQL 138 Query: 297 VGWGTDKKAGDYWLVRNSWGTSWGEAGYIRIKRFGEGKEPCKTDTXXXXXXXXXXXXXXX 118 VG+GT DYWLVRNSWG SWGE GYI+I+RFGEGKEPC TD Sbjct: 139 VGYGT-MGGKDYWLVRNSWGASWGEQGYIKIERFGEGKEPCGTDARPGDGFGCAGGPSTI 197 Query: 117 SVCGLCGILSDSSYPTGGHLV 55 VCG GILS SSYPTG HL+ Sbjct: 198 QVCGTSGILSGSSYPTGAHLI 218 >gb|PRP82211.1| hypothetical protein PROFUN_10420 [Planoprotostelium fungivorum] Length = 380 Score = 141 bits (356), Expect = 1e-37 Identities = 75/138 (54%), Positives = 83/138 (60%), Gaps = 4/138 (2%) Frame = -2 Query: 471 VVAGITGYVALPANNYTALMNAGAQM-PIAISAAAGAWQLYESGVLSKCDA---DVDHAI 304 V A ITGYV LP N Y LM A A + PIAIS A AW YESGV + C+ D+DHA+ Sbjct: 240 VFANITGYVTLPTNKYKPLMIAVATLGPIAISVDASAWHDYESGVFNGCNQTNPDIDHAV 299 Query: 303 QLVGWGTDKKAGDYWLVRNSWGTSWGEAGYIRIKRFGEGKEPCKTDTXXXXXXXXXXXXX 124 QLVG+GTD+K GDYWLVRNSWG WGE GYIRI R + C D Sbjct: 300 QLVGYGTDEKLGDYWLVRNSWGPKWGEKGYIRIYRSDSEEGRCGEDITPRDGEGCADGPP 359 Query: 123 XXSVCGLCGILSDSSYPT 70 VCG CGIL DS YPT Sbjct: 360 QVEVCGTCGILFDSVYPT 377 >gb|KOO28060.1| papain family cysteine protease containing protein [Chrysochromulina sp. CCMP291] Length = 376 Score = 140 bits (354), Expect = 2e-37 Identities = 77/141 (54%), Positives = 95/141 (67%), Gaps = 3/141 (2%) Frame = -2 Query: 468 VAGITGYVALPANNYTALMNAGA-QMPIAISAAAGAWQLYESGVLSKCDAD--VDHAIQL 298 V GI+GY LP N+YTAL+ A + PIAIS A +W YE GV S D++ +DHA+QL Sbjct: 239 VIGISGYEDLPMNDYTALLEALVNEGPIAISVDA-SWGAYEDGVFSS-DSNWLIDHAVQL 296 Query: 297 VGWGTDKKAGDYWLVRNSWGTSWGEAGYIRIKRFGEGKEPCKTDTXXXXXXXXXXXXXXX 118 VG+GT+ DYWLVRNSWGTSWGE GYI++++FGEGKEPC TDT Sbjct: 297 VGYGTEG-GKDYWLVRNSWGTSWGENGYIKLEKFGEGKEPCGTDTAPASGFVCEGGPSTI 355 Query: 117 SVCGLCGILSDSSYPTGGHLV 55 +VCG G+LS SSYPTG L+ Sbjct: 356 TVCGTSGMLSGSSYPTGAKLL 376 >gb|ABH06549.2| cathepsin L cysteine protease ICP1, partial [Ichthyophthirius multifiliis] Length = 374 Score = 136 bits (343), Expect = 9e-36 Identities = 71/140 (50%), Positives = 85/140 (60%), Gaps = 5/140 (3%) Frame = -2 Query: 459 ITGYVALPANNYTALMNAGAQM-PIAISAAAGAWQLYESGVLSKCDA----DVDHAIQLV 295 + GY+ LP N+Y L++A A + PIAIS A W YE GV S CD ++DHA+ L+ Sbjct: 236 VDGYLKLPVNSYEHLLHAIATVGPIAISVDASKWHDYEEGVYSGCDVTQNIEIDHAVTLI 295 Query: 294 GWGTDKKAGDYWLVRNSWGTSWGEAGYIRIKRFGEGKEPCKTDTXXXXXXXXXXXXXXXS 115 G+GTD+K GDYWLVRNSWGT WGE GYIR+KR E C TD Sbjct: 296 GYGTDEKLGDYWLVRNSWGTKWGENGYIRLKR--ESTPQCGTDYTPGIGNACRGQNDAQK 353 Query: 114 VCGLCGILSDSSYPTGGHLV 55 VCG CGILSDSSYP +V Sbjct: 354 VCGQCGILSDSSYPLNVRVV 373 >ref|XP_001019547.3| papain family cysteine protease [Tetrahymena thermophila SB210] gb|EAR99302.3| papain family cysteine protease (macronuclear) [Tetrahymena thermophila SB210] Length = 375 Score = 130 bits (328), Expect = 1e-33 Identities = 69/134 (51%), Positives = 84/134 (62%), Gaps = 5/134 (3%) Frame = -2 Query: 459 ITGYVALPANNYTALMNAGA-QMPIAISAAAGAWQLYESGVLSKCDA----DVDHAIQLV 295 I GY+ +P N+Y +LMNA A Q P+ IS A + YESGV CD D++HA+ LV Sbjct: 237 IDGYLKVPENDYASLMNAVATQGPLVISVDASNFHDYESGVFHGCDGADNVDINHAVVLV 296 Query: 294 GWGTDKKAGDYWLVRNSWGTSWGEAGYIRIKRFGEGKEPCKTDTXXXXXXXXXXXXXXXS 115 G+GTD+K GDYW+VRNSWGT +GE GYIR+KR E CKTD Sbjct: 297 GYGTDEKEGDYWIVRNSWGTRFGENGYIRVKR--EATPTCKTDFTPLDGNGCVGFAKPQK 354 Query: 114 VCGLCGILSDSSYP 73 VCG CGILSDS+YP Sbjct: 355 VCGQCGILSDSAYP 368 >gb|PRP89226.1| hypothetical protein PROFUN_02100 [Planoprotostelium fungivorum] Length = 375 Score = 128 bits (322), Expect = 1e-32 Identities = 65/135 (48%), Positives = 83/135 (61%), Gaps = 4/135 (2%) Frame = -2 Query: 465 AGITGYVALPANNYTALMNAGAQM-PIAISAAAGAWQLYESGVLSKCDA---DVDHAIQL 298 A ITG+V LP+N Y +M A A + P+A++ A +W YESGV + CDA D+DH +QL Sbjct: 240 ASITGHVVLPSNQYAPVMKALATVGPLAVNVDASSWSFYESGVYTACDAENVDIDHVVQL 299 Query: 297 VGWGTDKKAGDYWLVRNSWGTSWGEAGYIRIKRFGEGKEPCKTDTXXXXXXXXXXXXXXX 118 VG+GTD++ GDYW VRNSWG +GE GYIR+ R + K C D Sbjct: 300 VGYGTDEQYGDYWTVRNSWGPKYGEKGYIRLARSSDVK--CGVDKTPLDGTGCAGGPSTQ 357 Query: 117 SVCGLCGILSDSSYP 73 +VCG CGIL D SYP Sbjct: 358 TVCGACGILFDVSYP 372 >ref|XP_023332086.1| zingipain-2-like isoform X1 [Eurytemora affinis] Length = 362 Score = 125 bits (313), Expect = 2e-31 Identities = 66/144 (45%), Positives = 89/144 (61%), Gaps = 6/144 (4%) Frame = -2 Query: 468 VAGITGYVALPANNYTALMNAGAQM-PIAISAAAGAWQLYESGVLSKCDAD----VDHAI 304 VA ITGY LP N+ A++ A++ P+AIS AA ++ Y GV C + ++HA+ Sbjct: 221 VASITGYNNLPPNSMEAVIQHIAEVGPLAISVAANTFKNYNGGVFDGCSYEDNIALNHAV 280 Query: 303 QLVGWGTDKKAGDYWLVRNSWGTSWGEAGYIRIKRFGEGKEPCKTD-TXXXXXXXXXXXX 127 QLVG+GTD+ GDYW+VRNSWG WGE GYIR+KR E PC TD T Sbjct: 281 QLVGYGTDEDLGDYWIVRNSWGLGWGENGYIRMKR--EANPPCGTDSTTSGHVCQGGPGS 338 Query: 126 XXXSVCGLCGILSDSSYPTGGHLV 55 +VCG+CG+L ++S+P G HL+ Sbjct: 339 DSLTVCGMCGMLFETSFPLGAHLL 362 >ref|XP_001015624.1| papain family cysteine protease [Tetrahymena thermophila SB210] gb|EAR95379.1| papain family cysteine protease [Tetrahymena thermophila SB210] Length = 375 Score = 124 bits (312), Expect = 3e-31 Identities = 66/132 (50%), Positives = 78/132 (59%), Gaps = 5/132 (3%) Frame = -2 Query: 453 GYVALPANNYTALMNAGAQM-PIAISAAAGAWQLYESGVLSKCD----ADVDHAIQLVGW 289 GY + N+ AL+ A A + PIAIS A W YE GV CD D++HA+ LVG+ Sbjct: 239 GYANVTPNDQNALLEAVATVGPIAISVDASNWASYEEGVFDGCDYSKNVDINHAVVLVGY 298 Query: 288 GTDKKAGDYWLVRNSWGTSWGEAGYIRIKRFGEGKEPCKTDTXXXXXXXXXXXXXXXSVC 109 GTD K GDYWLVRNSWGT +GE GYIR+KR E C DT VC Sbjct: 299 GTDPKYGDYWLVRNSWGTDYGEDGYIRVKR--ESVAQCAMDTTPTDGFGCAGDEEPIKVC 356 Query: 108 GLCGILSDSSYP 73 G+CGILSDS+YP Sbjct: 357 GMCGILSDSAYP 368 >ref|XP_004993594.1| hypothetical protein PTSG_05727 [Salpingoeca rosetta] gb|EGD74032.1| hypothetical protein PTSG_05727 [Salpingoeca rosetta] Length = 398 Score = 124 bits (312), Expect = 5e-31 Identities = 67/139 (48%), Positives = 80/139 (57%), Gaps = 4/139 (2%) Frame = -2 Query: 468 VAGITGYVALPANNYTALMNAGAQM-PIAISAAAGAWQLYESGVLSKCDA---DVDHAIQ 301 V +TGYV LP+N Y LM+A A PI+IS A AW+ YESG+ C+ D+DHA+Q Sbjct: 257 VVNVTGYVKLPSNQYEPLMDAIANKGPISISVEAVAWKNYESGIFDGCNQTNPDIDHAVQ 316 Query: 300 LVGWGTDKKAGDYWLVRNSWGTSWGEAGYIRIKRFGEGKEPCKTDTXXXXXXXXXXXXXX 121 LVG+G D G YWLVRNSW WGE+GYIRI+R C D Sbjct: 317 LVGYGDDNSQG-YWLVRNSWTPHWGESGYIRIRRTANEGGRCGMDITPQDGSGCKGGPDK 375 Query: 120 XSVCGLCGILSDSSYPTGG 64 VCG CGIL D+ YPT G Sbjct: 376 VKVCGTCGILFDNVYPTIG 394 >ref|XP_004358308.1| papain family cysteine protease subfamily protein [Acanthamoeba castellanii str. Neff] gb|ELR25744.1| papain family cysteine protease subfamily protein [Acanthamoeba castellanii str. Neff] Length = 383 Score = 123 bits (309), Expect = 1e-30 Identities = 66/135 (48%), Positives = 79/135 (58%), Gaps = 4/135 (2%) Frame = -2 Query: 465 AGITGYVALPANNYTALMNAG-AQMPIAISAAAGAWQLYESGVLSKCDA---DVDHAIQL 298 A IT +V LP+N + L+ A Q PIAIS A +W YE+GV + C+ D+DHA+QL Sbjct: 249 AKITNFVKLPSNEHYPLLGAIITQGPIAISVDASSWSSYETGVYNGCNQTNPDIDHAVQL 308 Query: 297 VGWGTDKKAGDYWLVRNSWGTSWGEAGYIRIKRFGEGKEPCKTDTXXXXXXXXXXXXXXX 118 VG+G D K GDYWLVRNSW +WGEAGYIRI C TD Sbjct: 309 VGYGKDPKHGDYWLVRNSWSPAWGEAGYIRISM--TSSPQCGTDLNPSDGTGCKGGPPTQ 366 Query: 117 SVCGLCGILSDSSYP 73 VCG CGIL D+SYP Sbjct: 367 RVCGTCGILFDNSYP 381 >gb|AII16495.1| cathepsin K, partial [Paracyclopina nana] Length = 376 Score = 120 bits (302), Expect = 9e-30 Identities = 63/140 (45%), Positives = 84/140 (60%), Gaps = 6/140 (4%) Frame = -2 Query: 468 VAGITGYVALPANNYTALMNAGAQM-PIAISAAAGAWQLYESGVLSKCD----ADVDHAI 304 VA + GY LP NNY A+MN A + P++++ A +W Y +GV C+ +++HA+ Sbjct: 229 VATLRGYETLPRNNYEAVMNHLANVGPLSVAVDASSWSFYSTGVFDDCNYSYNIEINHAV 288 Query: 303 QLVGWGTDKKAGDYWLVRNSWGTSWGEAGYIRIKRFGEGKEPCKTD-TXXXXXXXXXXXX 127 QLVG+GTD+ GDYWLVRNSWG WG+ GYI++KR E K C D T Sbjct: 289 QLVGYGTDEFEGDYWLVRNSWGGFWGDDGYIKLKRESETK--CGIDSTPLMGTGCPNDGN 346 Query: 126 XXXSVCGLCGILSDSSYPTG 67 +VCG CGIL D+ YP G Sbjct: 347 EVLTVCGQCGILFDTCYPIG 366 >ref|XP_023324712.1| ervatamin-C-like [Eurytemora affinis] Length = 359 Score = 120 bits (300), Expect = 1e-29 Identities = 63/137 (45%), Positives = 81/137 (59%), Gaps = 5/137 (3%) Frame = -2 Query: 468 VAGITGYVALPANNYTALMNAGAQM-PIAISAAAGAWQLYESGVLSKCDAD----VDHAI 304 V GITGY +PAN+ A + A + P+A++A A AWQ Y SGV C + ++HA+ Sbjct: 217 VVGITGYNTIPANDLEATLQHVANVGPLAVAADASAWQFYGSGVFGSCAYEDNIALNHAV 276 Query: 303 QLVGWGTDKKAGDYWLVRNSWGTSWGEAGYIRIKRFGEGKEPCKTDTXXXXXXXXXXXXX 124 QLVG+G+D GDYWLVRNSWG +WGE GYIR++R E + Sbjct: 277 QLVGYGSDDAHGDYWLVRNSWGKNWGEHGYIRLQRESELMCGINSTPMDGTACENGPGTD 336 Query: 123 XXSVCGLCGILSDSSYP 73 +VCG CGIL DSSYP Sbjct: 337 EQTVCGQCGILFDSSYP 353 >ref|XP_001022365.1| papain family cysteine protease [Tetrahymena thermophila SB210] gb|EAS02120.1| papain family cysteine protease [Tetrahymena thermophila SB210] Length = 376 Score = 120 bits (300), Expect = 2e-29 Identities = 62/145 (42%), Positives = 85/145 (58%), Gaps = 5/145 (3%) Frame = -2 Query: 474 TVVAGITGYVALPANNYTALMNAGAQM-PIAISAAAGAWQLYESGVLSKC----DADVDH 310 T + GYV L AN+Y AL+ A A + P+A++ A W+ Y+SGV + C + DV+H Sbjct: 233 TPEVALDGYVKLQANDYDALLYALANIGPLAVAVDASQWRNYQSGVFNGCSYTDNIDVNH 292 Query: 309 AIQLVGWGTDKKAGDYWLVRNSWGTSWGEAGYIRIKRFGEGKEPCKTDTXXXXXXXXXXX 130 + LVG+GTD + GDYWL+RNSWGT +GE GYIR+ R E K C TD Sbjct: 293 VVVLVGYGTDPELGDYWLIRNSWGTKFGENGYIRLAR--ESKVTCGTDYTPLDGQACAGQ 350 Query: 129 XXXXSVCGLCGILSDSSYPTGGHLV 55 VCG CG+ D++YPT ++ Sbjct: 351 NVPTKVCGQCGVAYDAAYPTNVRVI 375 >ref|XP_009034544.1| hypothetical protein AURANDRAFT_22037 [Aureococcus anophagefferens] gb|EGB10983.1| hypothetical protein AURANDRAFT_22037 [Aureococcus anophagefferens] Length = 372 Score = 118 bits (296), Expect = 7e-29 Identities = 67/143 (46%), Positives = 84/143 (58%), Gaps = 5/143 (3%) Frame = -2 Query: 468 VAGITGYVALPANNYTALMNAGAQM-PIAISAAAGAWQLYESGVLSK--CDADVDHAIQL 298 VA I+G+ LP N Y LM A A P+A+S A +W YESG+ D+ +DHA+ L Sbjct: 231 VATISGFEKLPVNEYAPLMRAVATKGPVAVSVDA-SWGGYESGIFETDTYDSVIDHAVVL 289 Query: 297 VGWGTDKKAG-DYWLVRNSWGTSWGEAGYIRIKRFGEGKEP-CKTDTXXXXXXXXXXXXX 124 VG+GTD+ G DYWLVRNSWG +WGE GYIR++R + C DT Sbjct: 290 VGYGTDEALGKDYWLVRNSWGPTWGEKGYIRLRRHAADLDAYCGVDTKPLDGVGCAAGPA 349 Query: 123 XXSVCGLCGILSDSSYPTGGHLV 55 + CG GILSDS+YP GG LV Sbjct: 350 NMTTCGTSGILSDSAYPVGGRLV 372 >ref|XP_001012928.1| papain family cysteine protease [Tetrahymena thermophila SB210] gb|EAR92683.1| papain family cysteine protease (macronuclear) [Tetrahymena thermophila SB210] Length = 377 Score = 118 bits (296), Expect = 7e-29 Identities = 61/145 (42%), Positives = 82/145 (56%), Gaps = 5/145 (3%) Frame = -2 Query: 474 TVVAGITGYVALPANNYTALMNAGAQM-PIAISAAAGAWQLYESGVLSKCD----ADVDH 310 T + GYV L AN+Y AL+ A A + P+A++ W Y+SG+ + CD DV+H Sbjct: 234 TPEVSLDGYVKLQANDYNALLYAVATIGPLAVAVDGAKWHSYQSGIYNGCDYSQNIDVNH 293 Query: 309 AIQLVGWGTDKKAGDYWLVRNSWGTSWGEAGYIRIKRFGEGKEPCKTDTXXXXXXXXXXX 130 + L G+GTD + GDYWL+RNSWGTS+GE GYIR+ R E K C TD Sbjct: 294 VVVLEGYGTDPELGDYWLIRNSWGTSFGENGYIRLAR--ESKVTCGTDYSPLDGQACSGQ 351 Query: 129 XXXXSVCGLCGILSDSSYPTGGHLV 55 VCG CG+ DS+YP ++ Sbjct: 352 NIPTKVCGQCGVAYDSAYPVNVQVI 376 >ref|XP_001022291.2| hypothetical protein TTHERM_00502370 [Tetrahymena thermophila SB210] gb|EAS02046.2| hypothetical protein TTHERM_00502370 (macronuclear) [Tetrahymena thermophila SB210] Length = 376 Score = 117 bits (293), Expect = 2e-28 Identities = 64/132 (48%), Positives = 78/132 (59%), Gaps = 5/132 (3%) Frame = -2 Query: 453 GYVALPANNYTALMNAGAQM-PIAISAAAGAWQLYESGVLSKCD----ADVDHAIQLVGW 289 G+ + N+ AL+ A A + PIAIS + W YE GV CD ++DHA+ LVG+ Sbjct: 240 GFAKVTPNDQQALLEAVATIGPIAISIDSTGWDSYEEGVFDGCDYSENINIDHAVVLVGY 299 Query: 288 GTDKKAGDYWLVRNSWGTSWGEAGYIRIKRFGEGKEPCKTDTXXXXXXXXXXXXXXXSVC 109 GTD K GDYWLVRNS+GT +GE GYIRIKR E C D+ VC Sbjct: 300 GTDPKYGDYWLVRNSYGTEFGEDGYIRIKR--EAVPQCGIDSTPTNGFACGGDETPIKVC 357 Query: 108 GLCGILSDSSYP 73 G+CGILSDSSYP Sbjct: 358 GMCGILSDSSYP 369 >ref|XP_004341646.1| papain family cysteine protease containing protein [Acanthamoeba castellanii str. Neff] gb|ELR19560.1| papain family cysteine protease containing protein [Acanthamoeba castellanii str. Neff] Length = 385 Score = 117 bits (293), Expect = 2e-28 Identities = 61/137 (44%), Positives = 76/137 (55%), Gaps = 4/137 (2%) Frame = -2 Query: 471 VVAGITGYVALPANNYTALMNAGAQM-PIAISAAAGAWQLYESGVLSKCDA---DVDHAI 304 VVA + YV LP+N Y ++ A P+ I+ A +W YESGV C+ D++H + Sbjct: 246 VVAKVKNYVVLPSNKYDPVIEALTTTGPLVINVDASSWHAYESGVFDGCNQTNPDINHVV 305 Query: 303 QLVGWGTDKKAGDYWLVRNSWGTSWGEAGYIRIKRFGEGKEPCKTDTXXXXXXXXXXXXX 124 QLVG+GTD K GDYWLVRNSW WGE GYIR+KR C D Sbjct: 306 QLVGYGTDAKEGDYWLVRNSWSPVWGEKGYIRLKR--RSNPICGIDLKPSDGTGCKGGPA 363 Query: 123 XXSVCGLCGILSDSSYP 73 +VCG CG+L D SYP Sbjct: 364 TVTVCGECGLLYDVSYP 380 >ref|XP_004997373.1| hypothetical protein PTSG_11722 [Salpingoeca rosetta] gb|EGD80812.1| hypothetical protein PTSG_11722 [Salpingoeca rosetta] Length = 372 Score = 117 bits (292), Expect = 3e-28 Identities = 64/136 (47%), Positives = 76/136 (55%), Gaps = 4/136 (2%) Frame = -2 Query: 459 ITGYVALPANNYTALMNAGAQM-PIAISAAAGAWQLYESGVLSKCDA---DVDHAIQLVG 292 +TGYV LPAN Y LM A A PI+IS A W+ YESG+ + C+ D+DH +QLVG Sbjct: 237 VTGYVKLPANQYEPLMEAVANKGPISISVEAIHWKNYESGIFNGCNQTNPDIDHVVQLVG 296 Query: 291 WGTDKKAGDYWLVRNSWGTSWGEAGYIRIKRFGEGKEPCKTDTXXXXXXXXXXXXXXXSV 112 +GTD G YWLVRNSW +GE GYIR+ R + C D Sbjct: 297 YGTDNGQG-YWLVRNSWTPHFGEGGYIRLLRASNEGQRCGIDVKPQDGSGCKGGPPTVKA 355 Query: 111 CGLCGILSDSSYPTGG 64 CG CGIL DS YPT G Sbjct: 356 CGTCGILFDSVYPTLG 371 >ref|XP_002185251.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1] gb|EEC43383.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1] Length = 424 Score = 117 bits (292), Expect = 5e-28 Identities = 70/157 (44%), Positives = 86/157 (54%), Gaps = 19/157 (12%) Frame = -2 Query: 468 VAGITGYVALPANNYTALMNAGAQM-PIAISAAAGAWQLYESGVL-----SKCDADVDHA 307 VA I G+VALP NNYT LMNA A++ P+A+S AA W LY+ GV S+ + +V+H Sbjct: 267 VASIQGWVALPTNNYTTLMNAVAKVGPVAVSVAATPWALYKEGVFESSMKSEKETNVNHL 326 Query: 306 IQLVGWGTDKKAG-DYWLVRNSWGTSWGEAGYIRIKRF------------GEGKEPCKTD 166 + L G+GTD++ G DYWLVRNSWG WGE GYIR+KR G P Sbjct: 327 VVLDGYGTDEETGVDYWLVRNSWGPMWGEDGYIRLKRVDPVSLTDPEMDCGMDVTPSDGV 386 Query: 165 TXXXXXXXXXXXXXXXSVCGLCGILSDSSYPTGGHLV 55 VCG GIL DSS P G +LV Sbjct: 387 ACTIDDKGNSVIPPAVKVCGTSGILFDSSLPLGPYLV 423 >gb|EJK56325.1| hypothetical protein THAOC_23815 [Thalassiosira oceanica] Length = 418 Score = 115 bits (289), Expect = 1e-27 Identities = 64/150 (42%), Positives = 82/150 (54%), Gaps = 9/150 (6%) Frame = -2 Query: 474 TVVAGITGYVALPANNYTALMNAGAQM-PIAISAAAGAWQLYESGVLSKCDADVDHAIQL 298 T G+TG+ LP N+Y ++MNA Q P+AI+AAA W LYE GV S DA V+HAI L Sbjct: 264 TASVGVTGWTQLPTNDYKSVMNALVQKGPVAIAAAASDWALYEKGVFSSDDATVNHAILL 323 Query: 297 VGWGTDKKAGD-YWLVRNSWGTSWGEAGYIRIKRFGEGKEPCKTD-------TXXXXXXX 142 VG+G D+ G+ Y+ +RNSWG +GE GYIR+ R E C D Sbjct: 324 VGYGIDEDTGEKYYKIRNSWGPHFGEDGYIRVLRTDEDSTVCNMDNDPLVGLACALDDSG 383 Query: 141 XXXXXXXXSVCGLCGILSDSSYPTGGHLVN 52 VCG G+L D SYP G H ++ Sbjct: 384 NQIDVQPVEVCGASGVLFDVSYPVGVHKID 413