BLASTX nr result

ID: Ophiopogon27_contig00038764 seq

BLASTX 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Ophiopogon27_contig00038764
         (757 letters)

Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF excluding environmental samples
from WGS projects 
           149,584,005 sequences; 54,822,741,787 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

dbj|BAK00754.1| predicted protein [Hordeum vulgare subsp. vulgare]    233   4e-72
ref|XP_009032801.1| hypothetical protein AURANDRAFT_18666 [Aureo...   228   5e-70
ref|XP_013762925.1| cruzipain [Thecamonas trahens ATCC 50062] >g...   188   5e-54
ref|XP_004368288.1| cysteine proteinase precursor, putative [Aca...   186   8e-54
ref|XP_004335426.1| cathepsin L, putative [Acanthamoeba castella...   181   7e-52
ref|XP_012756472.1| hypothetical protein SAMD00019534_028850 [Ac...   178   7e-51
gb|KYQ91485.1| hypothetical protein DLAC_08453 [Tieghemostelium ...   176   6e-50
ref|XP_004363040.1| hypothetical protein DFA_03437 [Cavenderia f...   175   2e-49
ref|XP_012759759.1| hypothetical protein SAMD00019534_034130, pa...   174   2e-49
ref|XP_009032695.1| hypothetical protein AURANDRAFT_19240 [Aureo...   172   2e-49
ref|XP_020433674.1| hypothetical protein PPL_05546 [Heterosteliu...   166   3e-46
ref|XP_009040822.1| hypothetical protein AURANDRAFT_5922, partia...   162   7e-46
gb|KOO53669.1| cathepsin l-like protease [Chrysochromulina sp. C...   164   3e-45
gb|KOO22621.1| cysteine proteinase [Chrysochromulina sp. CCMP291]     160   1e-43
ref|XP_003284553.1| hypothetical protein DICPUDRAFT_96752 [Dicty...   159   2e-43
ref|XP_004344606.1| cysteine protease 5, putative [Acanthamoeba ...   157   4e-43
ref|XP_003293312.1| hypothetical protein DICPUDRAFT_41833 [Dicty...   157   2e-42
ref|XP_012755566.1| hypothetical protein SAMD00019534_046220 [Ac...   152   6e-41
ref|XP_009037507.1| hypothetical protein AURANDRAFT_5846, partia...   148   9e-41
ref|XP_013753886.1| cysteine proteinase 1 [Thecamonas trahens AT...   149   1e-39

>dbj|BAK00754.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 341

 Score =  233 bits (594), Expect = 4e-72
 Identities = 115/198 (58%), Positives = 138/198 (69%), Gaps = 1/198 (0%)
 Frame = -2

Query: 756 WAFSATEQIESAWFLAKGTLPVLSPQQILSCDKTDAGCDGGDTPTAYAYVKSAGGLESES 577
           WAFSATEQIES WFLA   L  LSPQQI+SCD TD GC GG T TAY YV+SAGGL++++
Sbjct: 141 WAFSATEQIESNWFLAGNELISLSPQQIVSCDTTDGGCGGGWTYTAYQYVQSAGGLDTDA 200

Query: 576 AYPYSSGAGNTGTCKFK-AASIVAKISGFSYATPPCSGACKTQDEATFANNVAAKGPASI 400
           AYPYSSGAG TGTC     AS  A+ISGF YA P CS +C  QDE + A  +    P S+
Sbjct: 201 AYPYSSGAGVTGTCDNPLPASPAAQISGFGYAIPTCSDSCTNQDENSMAQYMQENSPLSV 260

Query: 399 CVNAEAWQLYSSGVLTAKACGGSAYTDLDHCVQLVGYNKPASGKSYWIVRNSWGASWGYS 220
           CV+AE WQ YSSG++T   C  S ++ LDHCVQ VGY+   S + YWIVRNSW  +WG  
Sbjct: 261 CVDAEPWQFYSSGIMTVDQC-PSDFSGLDHCVQAVGYDATGS-QPYWIVRNSWNTNWGED 318

Query: 219 GYLYVEYGTNACGVADEA 166
           G++ +  GTN CG+ D A
Sbjct: 319 GFIRLALGTNTCGIGDVA 336


>ref|XP_009032801.1| hypothetical protein AURANDRAFT_18666 [Aureococcus anophagefferens]
 gb|EGB13210.1| hypothetical protein AURANDRAFT_18666 [Aureococcus anophagefferens]
          Length = 346

 Score =  228 bits (580), Expect = 5e-70
 Identities = 116/209 (55%), Positives = 135/209 (64%), Gaps = 12/209 (5%)
 Frame = -2

Query: 756 WAFSATEQIESAWFLAKGTLPVLSPQQILSCDKTDAGCDGGDTPTAYAYVKSAGGLESES 577
           WAFSATEQIES W LA     V +PQQI+SCDK D GC+GG+T TAYAYV+ AGG+  ES
Sbjct: 133 WAFSATEQIESEWVLAGNDPLVFAPQQIVSCDKVDQGCNGGNTETAYAYVEKAGGMALES 192

Query: 576 AYPYSSG-AGNTGTCKFKAASIVAKISGFSYATPPC-SGACKTQDEATFANNVAAKGPAS 403
           AYPY SG +GNTG CK K  +    +  FSY  P C  G C  QDE   A  +A+ GPAS
Sbjct: 193 AYPYKSGTSGNTGRCK-KFETAGGDVESFSYVVPECKKGKCNDQDEDKMAAALASHGPAS 251

Query: 402 ICVNAEAWQLYSSGVLTAKACGGSAYTDLDHCVQLVGY----------NKPASGKSYWIV 253
           ICVNA AWQ Y+ GV+T   CG  A   LDHCVQ+VGY           K    K  W V
Sbjct: 252 ICVNAGAWQTYTKGVMTNLQCGSHAANALDHCVQVVGYTGYTGDAKACGKGLKDKCVWNV 311

Query: 252 RNSWGASWGYSGYLYVEYGTNACGVADEA 166
           RNSWG SWGY GY+ V+ G NACG+A++A
Sbjct: 312 RNSWGTSWGYQGYIRVQMGKNACGIANDA 340


>ref|XP_013762925.1| cruzipain [Thecamonas trahens ATCC 50062]
 gb|KNC45942.1| cruzipain [Thecamonas trahens ATCC 50062]
          Length = 394

 Score =  188 bits (477), Expect = 5e-54
 Identities = 97/199 (48%), Positives = 126/199 (63%), Gaps = 2/199 (1%)
 Frame = -2

Query: 756 WAFSATEQIESAWFLAKGTLPVLSPQQILSCDKTDAGCDGGDTPTAYAYVKSAGGLESES 577
           WAFSA  ++ES W LA   L VLS QQ++ CD TD GC+GGDT +AY Y++ AGGL  E 
Sbjct: 139 WAFSAVSEVESMWALAGHELVVLSEQQVVDCDTTDDGCNGGDTISAYHYIEKAGGLVPEK 198

Query: 576 AYPYSSGAGNTGTCKFKAA--SIVAKISGFSYATPPCSGACKTQDEATFANNVAAKGPAS 403
            YPY++     G CK        VAKI G++YAT P      T++E   A N+ + GP S
Sbjct: 199 DYPYTA---RDGKCKDSVVKKDAVAKIMGYNYATSP-----STKNETQLAANLMSTGPVS 250

Query: 402 ICVNAEAWQLYSSGVLTAKACGGSAYTDLDHCVQLVGYNKPASGKSYWIVRNSWGASWGY 223
           ICV+A +WQ Y+SG+L+   CG      LDHCVQ+ G+    S + YW VRNSW  SWG 
Sbjct: 251 ICVDASSWQTYTSGILS--HCG----KQLDHCVQITGWGTSGS-EMYWWVRNSWATSWGM 303

Query: 222 SGYLYVEYGTNACGVADEA 166
           SGY+ +++G N CG+ADEA
Sbjct: 304 SGYIQLKFGQNTCGLADEA 322


>ref|XP_004368288.1| cysteine proteinase precursor, putative [Acanthamoeba castellanii
           str. Neff]
 gb|ELR25533.1| cysteine proteinase precursor, putative [Acanthamoeba castellanii
           str. Neff]
          Length = 330

 Score =  186 bits (471), Expect = 8e-54
 Identities = 97/202 (48%), Positives = 125/202 (61%), Gaps = 6/202 (2%)
 Frame = -2

Query: 756 WAFSATEQIESAWFLAKGTLPVLSPQQILSCDKT--DAGCDGGDTPTAYAYVKSAGGLES 583
           WAFS TE IES WFL+   L  L+PQQI+ CD+   D GCDGGD PTAY YV  AGGL++
Sbjct: 138 WAFSVTEAIESQWFLSGRKLVSLAPQQIVDCDQGNGDYGCDGGDPPTAYEYVIKAGGLDT 197

Query: 582 ESAYPYSSGAGNTGTCKFKAASIVAKISGFSYATPPCSGACKTQDEATFANNVAAKGPAS 403
           E +YPY++     G C FK +++ AKIS ++Y T        T++E      +A++GP S
Sbjct: 198 EESYPYTA---EDGQCAFKPSAVGAKISNWTYITT-------TKNETEMQYGLASRGPLS 247

Query: 402 ICVNAEAWQLYSSGVLTAKACGGSAYTDLDHCVQLVGYNKPASGK----SYWIVRNSWGA 235
           ICV+A +WQ Y  GV+T+  C  S    LDHCV + GY+            W +RNSWG 
Sbjct: 248 ICVDASSWQYYIGGVITS-LCEDS----LDHCVMITGYSVQEGWDFMKYDVWNIRNSWGE 302

Query: 234 SWGYSGYLYVEYGTNACGVADE 169
            WGY GYLYV+ G+N CGV DE
Sbjct: 303 DWGYGGYLYVQRGSNLCGVGDE 324


>ref|XP_004335426.1| cathepsin L, putative [Acanthamoeba castellanii str. Neff]
 gb|ELR13413.1| cathepsin L, putative [Acanthamoeba castellanii str. Neff]
          Length = 331

 Score =  181 bits (458), Expect = 7e-52
 Identities = 95/195 (48%), Positives = 121/195 (62%)
 Frame = -2

Query: 756 WAFSATEQIESAWFLAKGTLPVLSPQQILSCDKTDAGCDGGDTPTAYAYVKSAGGLESES 577
           WAFSATE IES W LA   L  LS QQI+ C   D GC GG    AY YV  A GL++ +
Sbjct: 145 WAFSATENIESQWALAGHKLTGLSMQQIVDCSWWDDGCGGGFPSYAYDYVIDAPGLDALA 204

Query: 576 AYPYSSGAGNTGTCKFKAASIVAKISGFSYATPPCSGACKTQDEATFANNVAAKGPASIC 397
            YPY++     G+C FK + +VAKIS ++Y T          +E   AN +A  GP S+C
Sbjct: 205 NYPYTAVG---GSCAFKESQVVAKISSWTYTT-------TDSNEHQMANYLAQHGPISVC 254

Query: 396 VNAEAWQLYSSGVLTAKACGGSAYTDLDHCVQLVGYNKPASGKSYWIVRNSWGASWGYSG 217
           V+AE+W  Y+ GV  A ACG    T +DHCV  VGYN  A+   YWI+RNSWG SWG  G
Sbjct: 255 VDAESWPSYTGGVYRASACG----TSIDHCVLAVGYNLTAN-PPYWIIRNSWGTSWGLEG 309

Query: 216 YLYVEYGTNACGVAD 172
           Y+++E+GT+AC VA+
Sbjct: 310 YMHLEFGTDACAVAE 324


>ref|XP_012756472.1| hypothetical protein SAMD00019534_028850 [Acytostelium subglobosum
           LB1]
 dbj|GAM19710.1| hypothetical protein SAMD00019534_028850 [Acytostelium subglobosum
           LB1]
          Length = 325

 Score =  178 bits (451), Expect = 7e-51
 Identities = 90/197 (45%), Positives = 119/197 (60%), Gaps = 1/197 (0%)
 Frame = -2

Query: 756 WAFSATEQIESAWFLAKGTLPVLSPQQILSCDKTDAGCDGGDTPTAYAYVKSAGGLESES 577
           WAFSATEQIE+A+  A       S QQI+ CD  D GC GGD  TAY YV+SAGG+ +++
Sbjct: 135 WAFSATEQIETAFIQAGNAQQFFSEQQIVDCDPFDGGCGGGDPMTAYQYVQSAGGITTDT 194

Query: 576 AYPYSSGAGNTGTCKFKAASIVAKISGFSYATPPCSGACKTQDEATFANNVAAKGPASIC 397
           AYPY++     GTC+    + VA+I  + YA+          +E      +AA GP SIC
Sbjct: 195 AYPYTA---QDGTCEANTTTKVAQIKTYGYAS-------TAGNETQMKEAIAALGPLSIC 244

Query: 396 VNAEAWQLYSSGVLTAKACGGSAYTDLDHCVQLVGYN-KPASGKSYWIVRNSWGASWGYS 220
           V+AE W  Y SG++T          DLDHCVQ+VGY+    S   Y+IVRNSWG +WG  
Sbjct: 245 VDAETWMTYQSGIITTDCA-----ADLDHCVQVVGYDVDTTSNIPYYIVRNSWGTTWGQE 299

Query: 219 GYLYVEYGTNACGVADE 169
           GY+Y+  G+N CG+ +E
Sbjct: 300 GYIYIGEGSNLCGITEE 316


>gb|KYQ91485.1| hypothetical protein DLAC_08453 [Tieghemostelium lacteum]
          Length = 354

 Score =  176 bits (447), Expect = 6e-50
 Identities = 96/200 (48%), Positives = 119/200 (59%), Gaps = 3/200 (1%)
 Frame = -2

Query: 756 WAFSATEQIESAWFLAKGTLPVLSPQQILSCDKTDAGCDGGDTPTAYAYVKSAGGLESES 577
           WAFSATEQIE+AW  A     +LS QQI+ CD  D GC GGD  TA  YV  AGGL SES
Sbjct: 166 WAFSATEQIETAWIKAGNDQVILSEQQIVDCDTNDGGCGGGDPHTAMDYVIKAGGLTSES 225

Query: 576 AYPYSSGAGNTGTCKFKAASIVAKISGFSYATPPCSGACKTQDEATFANNVAAKGPASIC 397
            YPY     N GTC       VA ISG+  AT P        ++   A +V  +GP SIC
Sbjct: 226 QYPY---IANDGTCHTNFTP-VAHISGYYAATTP-------GNDTQLAYSVMNEGPISIC 274

Query: 396 VNAEAWQLYSSGVLTAKACGGSAYTDLDHCVQLVGYNKPASGKS---YWIVRNSWGASWG 226
           V+A +W  YSSG++ +     +  +DLDHCVQ+VG N   +G +   Y+I+RNSWG  WG
Sbjct: 275 VDASSWMTYSSGIIRS-----NCDSDLDHCVQIVGLNVDTNGTTPIPYYIIRNSWGTDWG 329

Query: 225 YSGYLYVEYGTNACGVADEA 166
             G++YVE G + CGV  EA
Sbjct: 330 IDGFIYVEIGHDLCGVTQEA 349


>ref|XP_004363040.1| hypothetical protein DFA_03437 [Cavenderia fasciculata]
 gb|EGG25189.1| hypothetical protein DFA_03437 [Cavenderia fasciculata]
          Length = 341

 Score =  175 bits (443), Expect = 2e-49
 Identities = 88/197 (44%), Positives = 120/197 (60%), Gaps = 1/197 (0%)
 Frame = -2

Query: 756 WAFSATEQIESAWFLAKGTLPVLSPQQILSCDKTDAGCDGGDTPTAYAYVKSAGGLESES 577
           WAFSATEQIE+A  +A G +  LS QQI+ CD  D GC GGD  TAY YV++ GGL    
Sbjct: 140 WAFSATEQIETANIMAGGQVEYLSEQQIVDCDPYDGGCGGGDPYTAYQYVQNNGGLTLNV 199

Query: 576 AYPYSSGAGNTGTCKFKAASIVAKISGFSYATPPCSGACKTQDEATFANNVAAKGPASIC 397
            YPY++     G C   + +   +++ F YA+          +E      +AA+GP SIC
Sbjct: 200 TYPYTAA---NGACYANSTAPAVQVTAFGYAS-------SQGNETQLREAMAARGPLSIC 249

Query: 396 VNAEAWQLYSSGVLTAKACGGSAYTDLDHCVQLVGYNKPASGKS-YWIVRNSWGASWGYS 220
           VNAE W  Y SG+ ++     +   DLDHCVQ+VGY+  A+ K+ Y+IVRNSWG  WG  
Sbjct: 250 VNAEPWMSYQSGIFSS-----TCSDDLDHCVQIVGYDTDATSKTPYFIVRNSWGTDWGLL 304

Query: 219 GYLYVEYGTNACGVADE 169
           GY+Y++ G+N CG+ +E
Sbjct: 305 GYIYIQAGSNLCGITNE 321


>ref|XP_012759759.1| hypothetical protein SAMD00019534_034130, partial [Acytostelium
           subglobosum LB1]
 dbj|GAM20238.1| hypothetical protein SAMD00019534_034130, partial [Acytostelium
           subglobosum LB1]
          Length = 338

 Score =  174 bits (442), Expect = 2e-49
 Identities = 90/199 (45%), Positives = 118/199 (59%), Gaps = 3/199 (1%)
 Frame = -2

Query: 756 WAFSATEQIESAWFLAKGTLPVLSPQQILSCDKTDAGCDGGDTPTAYAYVKSAGGLESES 577
           WAFSA EQIESA+ +      + S QQI+ CD  D GC GGDT TAY YV++AGGL + +
Sbjct: 145 WAFSAAEQIESAYIMLGNEAQIASEQQIVDCDSFDGGCGGGDTMTAYKYVETAGGLTTNA 204

Query: 576 AYPYSSGAGNTGTCKFKAASIVAKISGFSYATPPCSGACKTQDEATFANNVAAKGPASIC 397
           +YPY++     GTC         K++ ++YA+          +E      +AA GP SIC
Sbjct: 205 SYPYTA---QDGTCYANKTKKFVKVTNYNYAS-------SQGNETQLKEAIAALGPLSIC 254

Query: 396 VNAEAWQLYSSGVLTAKACGGSAYTDLDHCVQLVGYNKPAS---GKSYWIVRNSWGASWG 226
           V+A +W  Y SG++T+  CG     DLDHCVQLVGY   +S      Y+IVRNSWG  WG
Sbjct: 255 VDAISWMTYQSGIITSN-CG----NDLDHCVQLVGYAIESSVTPNIPYYIVRNSWGLDWG 309

Query: 225 YSGYLYVEYGTNACGVADE 169
             GY+Y+  G N CG+ DE
Sbjct: 310 QEGYIYIGEGQNLCGITDE 328


>ref|XP_009032695.1| hypothetical protein AURANDRAFT_19240 [Aureococcus anophagefferens]
 gb|EGB13096.1| hypothetical protein AURANDRAFT_19240 [Aureococcus anophagefferens]
          Length = 254

 Score =  172 bits (435), Expect = 2e-49
 Identities = 87/216 (40%), Positives = 116/216 (53%), Gaps = 19/216 (8%)
 Frame = -2

Query: 756 WAFSATEQIESAWFLAKGTLPVLSPQQILSCDKTDAGCDGGDTPTAYAYVKS--AGGLES 583
           WAFS  +Q++S W+L        S QQI+SCD    GC GG   +AYAY+K     GL +
Sbjct: 16  WAFSVAQQVQSEWYLDGNPASEFSAQQIISCDDEMFGCGGGGPVSAYAYIKERVTPGLSN 75

Query: 582 ESAYPYSSGAGNTGTC-----------------KFKAASIVAKISGFSYATPPCSGACKT 454
              YPY  G G+  TC                 +     + A+++ +S+AT  C   C+ 
Sbjct: 76  LWYYPYVQGMGSQRTCLSPKCTETCRGIGTEVEETLLTGVYAQVANYSWATKGCFDDCED 135

Query: 453 QDEATFANNVAAKGPASICVNAEAWQLYSSGVLTAKACGGSAYTDLDHCVQLVGYNKPAS 274
           QD       VAA GPASICVNA  W +Y+ GV++   CG   + DLDHCV LVG+N   S
Sbjct: 136 QDLYGLRKAVAAHGPASICVNAANWDVYAGGVMSTATCGSYNFNDLDHCVGLVGFNMD-S 194

Query: 273 GKSYWIVRNSWGASWGYSGYLYVEYGTNACGVADEA 166
              YWIV+N W  +WG  GY++++   N CGVAD A
Sbjct: 195 DPPYWIVKNQWSTTWGVDGYIFLDARNNTCGVADTA 230


>ref|XP_020433674.1| hypothetical protein PPL_05546 [Heterostelium album PN500]
 gb|EFA81557.1| hypothetical protein PPL_05546 [Heterostelium album PN500]
          Length = 341

 Score =  166 bits (421), Expect = 3e-46
 Identities = 86/197 (43%), Positives = 117/197 (59%), Gaps = 1/197 (0%)
 Frame = -2

Query: 756 WAFSATEQIESAWFLAKGTLPVLSPQQILSCDKTDAGCDGGDTPTAYAYVKSAGGLESES 577
           WAFSA EQIE+A+ +A      +S QQI+ CD  D GC GGD  TAY YV+SAGG+ + +
Sbjct: 148 WAFSAGEQIETAYIMAGNAAQNVSEQQIVDCDPYDGGCGGGDPMTAYQYVQSAGGITTNT 207

Query: 576 AYPYSSGAGNTGTCKFKAASIVAKISGFSYATPPCSGACKTQDEATFANNVAAKGPASIC 397
            YPY++     GTC  +      +I+ + YA+          +E      +AA+GP SIC
Sbjct: 208 DYPYTA---TDGTCYAQNTPKFTQIASYGYAS-------NKGNETELKQAIAARGPLSIC 257

Query: 396 VNAEAWQLYSSGVLTAKACGGSAYTDLDHCVQLVGYN-KPASGKSYWIVRNSWGASWGYS 220
           V+AE W  Y SGVL +     +   +LDHCVQ+VGY+ + ++   Y+IVRNSWG  WG  
Sbjct: 258 VDAETWMNYQSGVLNS-----NCPDELDHCVQIVGYDVEQSTNTPYYIVRNSWGTDWGME 312

Query: 219 GYLYVEYGTNACGVADE 169
           GY+ V  G N CG+ DE
Sbjct: 313 GYILVGEGQNLCGITDE 329


>ref|XP_009040822.1| hypothetical protein AURANDRAFT_5922, partial [Aureococcus
           anophagefferens]
 gb|EGB04435.1| hypothetical protein AURANDRAFT_5922, partial [Aureococcus
           anophagefferens]
          Length = 230

 Score =  162 bits (410), Expect = 7e-46
 Identities = 90/205 (43%), Positives = 107/205 (52%), Gaps = 8/205 (3%)
 Frame = -2

Query: 756 WAFSATEQIESAWFLAKGTLPVLSPQQILSCDKTDA-----GCDGGDTPTAYAYVKSA-- 598
           WAFSATEQ+ES   LA      LS QQ+ SC          GC GGD   AY Y+  A  
Sbjct: 38  WAFSATEQVESQLVLAGAPQVELSTQQVASCTADPQLMCCDGCAGGDPTAAYEYLAWASR 97

Query: 597 -GGLESESAYPYSSGAGNTGTCKFKAASIVAKISGFSYATPPCSGACKTQDEATFANNVA 421
            GGL  ++ +PY  G      C+                 P C+  C   D    A  + 
Sbjct: 98  KGGLAPDAWWPYEQGLTPDEVCE----------------APACTKTCDKDDTDQLAARLE 141

Query: 420 AKGPASICVNAEAWQLYSSGVLTAKACGGSAYTDLDHCVQLVGYNKPASGKSYWIVRNSW 241
           A  P S+C+NA AW  Y+ GVL+  ACGG    D+DHCVQLVGYNK     SYWIVRNSW
Sbjct: 142 AS-PLSVCLNAGAWDDYTGGVLSEAACGGHGADDVDHCVQLVGYNKTEPENSYWIVRNSW 200

Query: 240 GASWGYSGYLYVEYGTNACGVADEA 166
             SWG  GY+Y+    NACGVA+EA
Sbjct: 201 STSWGEDGYIYLSMDGNACGVANEA 225


>gb|KOO53669.1| cathepsin l-like protease [Chrysochromulina sp. CCMP291]
          Length = 345

 Score =  164 bits (415), Expect = 3e-45
 Identities = 91/202 (45%), Positives = 118/202 (58%), Gaps = 5/202 (2%)
 Frame = -2

Query: 756 WAFSATEQIESAWFLAKGTLPVLSPQQILSCDKTDAGCDGGDTPTAYAYVKSAGGLESES 577
           WAFSATEQ+ES ++   G L  LSPQQ+ SCD    GC+GG+   A+ YV S GG ESES
Sbjct: 146 WAFSATEQLESQYYQTYGKLIELSPQQLTSCDPNCGGCNGGNPINAWIYVNSFGGQESES 205

Query: 576 AYPYSSG-AGNTGTCKFKAASIVAKIS---GFSYATPPCSGACKTQDEATFANNVAAKGP 409
            YPY SG    TG+C  K A +   +    G+  A  P         E+     +    P
Sbjct: 206 DYPYVSGVTKQTGSCSSKIAEVTEAVGADVGYFIAQRPA-------QESNMLKQIGL-SP 257

Query: 408 ASICVNAEAWQLYSSGVLTAKA-CGGSAYTDLDHCVQLVGYNKPASGKSYWIVRNSWGAS 232
            SI V+AE WQ Y+ G++  K+ CG    T +DH VQ+ GYN  A G +YWIVRNSWG +
Sbjct: 258 MSIAVDAELWQTYTGGIIGPKSGCG----TTIDHAVQVTGYN--AEG-NYWIVRNSWGPN 310

Query: 231 WGYSGYLYVEYGTNACGVADEA 166
           WG SG++Y+ YG N CG+  +A
Sbjct: 311 WGESGFVYLTYGDNVCGITSQA 332


>gb|KOO22621.1| cysteine proteinase [Chrysochromulina sp. CCMP291]
          Length = 386

 Score =  160 bits (406), Expect = 1e-43
 Identities = 92/215 (42%), Positives = 117/215 (54%), Gaps = 18/215 (8%)
 Frame = -2

Query: 756 WAFSATEQIESAWFLAKGTLP--VLSPQQILSCDKTDA-GCDGGDTPTAYAYVKSAGGLE 586
           WAFSATE +ES   L+ G      L+PQQ  SC  + A GC+GG T  AY Y+ +  GL 
Sbjct: 142 WAFSATEAVESQLVLSSGNQVRIELAPQQTTSCTPSPAQGCNGGFTEAAYEYMGTVTGLT 201

Query: 585 SESAYPYSSGAGNTGTCKF----KAASI----------VAKISGFSYATPPC-SGACKTQ 451
           +   YPY      T   +     K A+I           A +SG+ YAT PC SGAC  Q
Sbjct: 202 NSFNYPYMQSLTATSATQACNTAKVAAIDGPMMQLTGGYAAVSGYHYATTPCTSGACANQ 261

Query: 450 DEATFANNVAAKGPASICVNAEAWQLYSSGVLTAKACGGSAYTDLDHCVQLVGYNKPASG 271
           D A     +    P S+CVNA +W  Y+ GV+T+ ACG  A    DHCV   G+N  A  
Sbjct: 262 DLAALQAAIETT-PVSVCVNAASWNDYTGGVMTSAACGSMAAKAQDHCVMATGFNTTAP- 319

Query: 270 KSYWIVRNSWGASWGYSGYLYVEYGTNACGVADEA 166
             YWIVRNSW ++WG  GY+Y+E   N CG+AD+A
Sbjct: 320 TPYWIVRNSWSSTWGEYGYIYLEMAENTCGIADDA 354


>ref|XP_003284553.1| hypothetical protein DICPUDRAFT_96752 [Dictyostelium purpureum]
 gb|EGC38873.1| hypothetical protein DICPUDRAFT_96752 [Dictyostelium purpureum]
          Length = 346

 Score =  159 bits (403), Expect = 2e-43
 Identities = 92/209 (44%), Positives = 117/209 (55%), Gaps = 14/209 (6%)
 Frame = -2

Query: 756 WAFSATEQIESAWFLAKGTLPVLSPQQILSCD----------KTDAGCDGGDTPTAYAYV 607
           W+FS T  IE  W+LA  TL  LS Q ++ CD            DAGCDGG  P AY YV
Sbjct: 143 WSFSTTGNIEGQWYLAGNTLVGLSEQNLVDCDHQCMEYDGQKSCDAGCDGGLQPNAYRYV 202

Query: 606 KSAGGLESESAYPYSSGAGNTGTCKFKAASIVAKISGFSYATPPCSGACKTQDEATFANN 427
              GGL+SE++YPY +  G++  CKFK+ ++ AKIS F+            Q+E   A  
Sbjct: 203 IENGGLDSENSYPYLAVTGDS--CKFKSGNVAAKISNFTMIP---------QNETQMAGY 251

Query: 426 VAAKGPASICVNAEAWQLYSSGVLTAKACGGSAYTDLDHCVQLVGYN--KPASG--KSYW 259
           +A  GP +I  +A  WQ Y  GV     CG S    LDH + +VG++  K   G  K YW
Sbjct: 252 LATHGPLAIAADAAEWQFYIGGVFDLP-CGQS----LDHGILIVGFSAEKNIFGHLKPYW 306

Query: 258 IVRNSWGASWGYSGYLYVEYGTNACGVAD 172
           IV+NSWGASWG  GYLY+  G N CGV+D
Sbjct: 307 IVKNSWGASWGEQGYLYLGKGKNLCGVSD 335


>ref|XP_004344606.1| cysteine protease 5, putative [Acanthamoeba castellanii str. Neff]
 gb|ELR20863.1| cysteine protease 5, putative [Acanthamoeba castellanii str. Neff]
          Length = 315

 Score =  157 bits (398), Expect = 4e-43
 Identities = 87/193 (45%), Positives = 108/193 (55%)
 Frame = -2

Query: 753 AFSATEQIESAWFLAKGTLPVLSPQQILSCDKTDAGCDGGDTPTAYAYVKSAGGLESESA 574
           AFSATE IES W LA   L  L+ QQI+ CD TD+GC GG    AY YV SA GLE  + 
Sbjct: 149 AFSATENIESQWALAGNKLTELAMQQIVDCDSTDSGCGGGWPYNAYEYVMSAPGLEPLAD 208

Query: 573 YPYSSGAGNTGTCKFKAASIVAKISGFSYATPPCSGACKTQDEATFANNVAAKGPASICV 394
           YPY++  GN   C + +  +VAKIS ++Y T         QDE   AN +A  GP S+CV
Sbjct: 209 YPYTAADGN---CAYNSGEVVAKISDWTYTT-------TDQDEHQMANYLAQHGPISVCV 258

Query: 393 NAEAWQLYSSGVLTAKACGGSAYTDLDHCVQLVGYNKPASGKSYWIVRNSWGASWGYSGY 214
           +A  W LY+                       VGYN  A+   YWI+RNSWGA WG  GY
Sbjct: 259 DASQWSLYT-----------------------VGYNL-AANPPYWIIRNSWGADWGLQGY 294

Query: 213 LYVEYGTNACGVA 175
           +Y+E+G +AC VA
Sbjct: 295 MYLEFGQDACAVA 307


>ref|XP_003293312.1| hypothetical protein DICPUDRAFT_41833 [Dictyostelium purpureum]
 gb|EGC30167.1| hypothetical protein DICPUDRAFT_41833 [Dictyostelium purpureum]
          Length = 352

 Score =  157 bits (396), Expect = 2e-42
 Identities = 85/201 (42%), Positives = 114/201 (56%), Gaps = 4/201 (1%)
 Frame = -2

Query: 756 WAFSATEQIESAWFLAKGTLPVLSPQQILSCDKTDAGCDGGDTPTAYAYVKSAGGLESES 577
           + FSATEQIES +  A     +LS QQ + CD  D GC GGD    Y Y+ SAGG+ +E 
Sbjct: 164 YIFSATEQIESEYIRAGHKAILLSEQQSVDCDTMDGGCGGGDPANVYNYIISAGGVSTEK 223

Query: 576 AYPYSSGAGNTGTCKFKAASIVAKISGFSYATPPCSGACKTQDEATFANNVAAKGPASIC 397
            YPY++     GTC F     V+ I+GF Y T       +  DE T    +A  GP SIC
Sbjct: 224 DYPYTA---QDGTC-FNTTRAVS-ITGFQYVT-------QNSDEDTLITTIANHGPVSIC 271

Query: 396 VNAEAWQLYSSGVLTAKACGGSAYTDLDHCVQLVGYN----KPASGKSYWIVRNSWGASW 229
           V+A  WQ Y+ G++T          ++DHCVQ+VG +     P++   Y+I+RNSWG SW
Sbjct: 272 VDASTWQSYTGGIITT-----GCEQNIDHCVQVVGLDIDKTDPSNPIPYYIIRNSWGTSW 326

Query: 228 GYSGYLYVEYGTNACGVADEA 166
           G  GY+YV  G+N CG+  E+
Sbjct: 327 GDKGYIYVAQGSNLCGITYES 347


>ref|XP_012755566.1| hypothetical protein SAMD00019534_046220 [Acytostelium subglobosum
           LB1]
 dbj|GAM21447.1| hypothetical protein SAMD00019534_046220 [Acytostelium subglobosum
           LB1]
          Length = 335

 Score =  152 bits (385), Expect = 6e-41
 Identities = 82/204 (40%), Positives = 111/204 (54%), Gaps = 11/204 (5%)
 Frame = -2

Query: 756 WAFSATEQIESAWFLAKGTLPVLSPQQILSCDKT----------DAGCDGGDTPTAYAYV 607
           W+FSAT  IE AWFLA   L  LS Q ++ CD            D GC+GG  P AY Y+
Sbjct: 138 WSFSATGNIEGAWFLAGNNLTGLSEQNLVDCDHECMQYLGDHVCDQGCNGGLQPNAYEYI 197

Query: 606 KSAGGLESESAYPYSSGAGNTGTCKFKAASIVAKISGFSYATPPCSGACKTQDEATFANN 427
              GG+++E +YPY+   G T  C F A++I AKIS ++Y +          +E T A+ 
Sbjct: 198 LKNGGIDTEESYPYTGVTGTT--CNFDASNIGAKISSWTYVS---------SNETTMASY 246

Query: 426 VAAKGPASICVNAEAWQLYSSGVLTAKACGGSAYTDLDHCVQLVGYN-KPASGKSYWIVR 250
           + A GP +I  +A  WQ YS GV   K CG      LDH + + G+     + + YWIV+
Sbjct: 247 LYANGPLAIAADALTWQYYSGGVFDFKECGSV----LDHGILITGFGVDTTNNEPYWIVK 302

Query: 249 NSWGASWGYSGYLYVEYGTNACGV 178
           NSWGA WG SGY+ +  G   CG+
Sbjct: 303 NSWGADWGESGYMRIIRGKGLCGL 326


>ref|XP_009037507.1| hypothetical protein AURANDRAFT_5846, partial [Aureococcus
           anophagefferens]
 gb|EGB07772.1| hypothetical protein AURANDRAFT_5846, partial [Aureococcus
           anophagefferens]
          Length = 208

 Score =  148 bits (374), Expect = 9e-41
 Identities = 88/206 (42%), Positives = 108/206 (52%), Gaps = 13/206 (6%)
 Frame = -2

Query: 756 WAFSATEQIESAWFLAKGTLPVLSPQQILSCDK--------TDAGCDGGDTPTAYAYVKS 601
           WAFS TEQ+ES ++LA G   VLS QQ+ SC K           GC GGD   AY Y+K 
Sbjct: 23  WAFSTTEQVESQFYLAGGPPVVLSAQQVTSCAKYIDDPEEGCCFGCGGGDVTVAYDYIKG 82

Query: 600 AGGLESESAYPYSSGAGNTGTCKFKAASIVAKISGFSYATPPCSGACKTQ----DEATFA 433
           A GL   + +PY+        C                  P C+ AC       D    A
Sbjct: 83  AIGLSPAAYWPYTQALTPDEEC----------------LGPFCTNACDMDLSELDLGELA 126

Query: 432 NNVAAKGPASICVNAEAWQLYSSGVLTAKACGGSAYTDLDHCVQLVGYNKPASGKS-YWI 256
             + A  PA++CVNA AW  Y+ GVL   AC G AY D+DHCVQLVGY+  A+G+  YWI
Sbjct: 127 KTIQAT-PAAVCVNAGAWDDYTGGVLRYDACSG-AYADIDHCVQLVGYD--ATGEEPYWI 182

Query: 255 VRNSWGASWGYSGYLYVEYGTNACGV 178
           VRNSW  SWG  GY+ ++   N CGV
Sbjct: 183 VRNSWSTSWGEDGYIRLQMDANTCGV 208


>ref|XP_013753886.1| cysteine proteinase 1 [Thecamonas trahens ATCC 50062]
 gb|KNC54251.1| cysteine proteinase 1 [Thecamonas trahens ATCC 50062]
          Length = 343

 Score =  149 bits (377), Expect = 1e-39
 Identities = 87/213 (40%), Positives = 117/213 (54%), Gaps = 18/213 (8%)
 Frame = -2

Query: 756 WAFSATEQIESAWFLAKGTLPVLSPQQILSCDKTD--------------AGCDGGDTPTA 619
           WAFSA   IESAW LA   L  LS Q I+ C   +              AGC+GG  P A
Sbjct: 136 WAFSAVSAIESAWALAGNPLVSLSEQNIIDCTLNNEAYCIKYHGQKSCPAGCNGGLQPEA 195

Query: 618 YAYVKSAGGLESESAYPYSSGAGNTGTCKFKAASIVAKISGFSYATPPCSGACKTQDEAT 439
           Y YV +  G+++ES+YPY +     GTC+FK++S+ A +S +++ +       +  +E  
Sbjct: 196 YLYVMANHGIDTESSYPYQAV---DGTCEFKSSSVGATVSNWTFVS-----TYEAPNEEA 247

Query: 438 FANNVAAKGPASICVNAEAWQLYSSGVLTAKACGGSAYTDLDHCVQLVGYN--KPASGKS 265
            A  +   GP +I  +A  WQLY  GV     CG S    LDH + +VGY   K   GK 
Sbjct: 248 VAEALVEHGPLAIAADASEWQLYMGGVFDLP-CGHS----LDHGIVIVGYGSKKTILGKE 302

Query: 264 Y--WIVRNSWGASWGYSGYLYVEYGTNACGVAD 172
           +  WI+RNSWGASWG  GY+Y++ G N CGVAD
Sbjct: 303 HPIWIIRNSWGASWGEKGYMYLQRGDNKCGVAD 335


Top