BLASTX nr result

ID: Ophiopogon25_contig00029201 seq

BLASTX 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Ophiopogon25_contig00029201
         (1251 letters)

Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF excluding environmental samples
from WGS projects 
           149,584,005 sequences; 54,822,741,787 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

dbj|BAK00754.1| predicted protein [Hordeum vulgare subsp. vulgare]    321   e-104
ref|XP_009032801.1| hypothetical protein AURANDRAFT_18666 [Aureo...   300   7e-96
ref|XP_004368288.1| cysteine proteinase precursor, putative [Aca...   251   5e-77
gb|KYQ91485.1| hypothetical protein DLAC_08453 [Tieghemostelium ...   250   2e-76
ref|XP_012756472.1| hypothetical protein SAMD00019534_028850 [Ac...   248   1e-75
ref|XP_004335426.1| cathepsin L, putative [Acanthamoeba castella...   246   7e-75
ref|XP_013762925.1| cruzipain [Thecamonas trahens ATCC 50062] >g...   243   3e-73
emb|CUI14619.1| cysteine peptidase, putative [Bodo saltans]           240   4e-71
ref|XP_012759759.1| hypothetical protein SAMD00019534_034130, pa...   236   6e-71
ref|XP_004363040.1| hypothetical protein DFA_03437 [Cavenderia f...   233   8e-70
gb|KOO53669.1| cathepsin l-like protease [Chrysochromulina sp. C...   229   2e-68
ref|XP_020433674.1| hypothetical protein PPL_05546 [Heterosteliu...   229   2e-68
ref|XP_003293312.1| hypothetical protein DICPUDRAFT_41833 [Dicty...   227   2e-67
gb|KMZ58469.1| Cysteine proteinase cathepsin F [Zostera marina]       226   8e-67
ref|XP_003284553.1| hypothetical protein DICPUDRAFT_96752 [Dicty...   223   5e-66
gb|OEU13361.1| cysteine proteinase [Fragilariopsis cylindrus CCM...   223   9e-66
gb|OQR72774.1| cathepsin L-like [Tropilaelaps mercedesae]             222   1e-65
ref|XP_020235829.1| LOW QUALITY PROTEIN: cysteine proteinase 15A...   221   4e-65
gb|AAF75546.1| cruzipain [Trypanosoma cruzi]                          224   6e-65
gb|AFA34858.1| cathepsin L-like protein [Trypanosoma dionisii]        223   1e-64

>dbj|BAK00754.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 341

 Score =  321 bits (823), Expect = e-104
 Identities = 182/355 (51%), Positives = 220/355 (61%), Gaps = 1/355 (0%)
 Frame = -2

Query: 1238 LLVLLGLSAFASATRLPVPSDSITGENVADEFLAKSLFDKFKLDHSKSYHSASEETHRYS 1059
            +++LL L A A+A   P  SD     N A        F +F    SK+Y S  E T RY+
Sbjct: 3    VVLLLALCALAAAYSYP-SSDFELDLNFAK-------FQEFTARFSKNYKSVEEYTTRYA 54

Query: 1058 VFRSNLARIADLNSKNGSPSFGITPFADLTAHEFAKTHLGFKPSLDEESQAARLNTPVFE 879
             F  NL R+A LN ++G   FG+T F D+T  EF  T+LGFKP      + A    PV  
Sbjct: 55   TFLDNLERVAKLN-QDGRGVFGVTKFMDMTPAEFKATYLGFKPD-----EMAPPKAPVAR 108

Query: 878  LDEDDNMMAWGANNTLVDWRQKGAVTKVKDQAQCGSCWAFSATEQIESAWFLAKGTLPVL 699
                    A G+    VDWR KGAVT VKDQAQCGSCWAFSATEQIES WFLA   L  L
Sbjct: 109  -PHRAKRNATGS----VDWRTKGAVTPVKDQAQCGSCWAFSATEQIESNWFLAGNELISL 163

Query: 698  SPQQILSCDKTDAGCDGGDTPTAYAYVKSAGGLESESAYPYSSGAGNTGTCKFK-AASIV 522
            SPQQI+SCD TD GC GG T TAY YV+SAGGL++++AYPYSSGAG TGTC     AS  
Sbjct: 164  SPQQIVSCDTTDGGCGGGWTYTAYQYVQSAGGLDTDAAYPYSSGAGVTGTCDNPLPASPA 223

Query: 521  AKISGFSYATPPCSGACKTQDEATFANNVAAKGPASICVNAEAWQLYSSGVLTAKACGGS 342
            A+ISGF YA P CS +C  QDE + A  +    P S+CV+AE WQ YSSG++T   C  S
Sbjct: 224  AQISGFGYAIPTCSDSCTNQDENSMAQYMQENSPLSVCVDAEPWQFYSSGIMTVDQC-PS 282

Query: 341  AYTDLDHCVQLVGYNKPASGKDYWIVRNSWGASWGYSGYLYVEYGTNACGVADEA 177
             ++ LDHCVQ VGY+   S + YWIVRNSW  +WG  G++ +  GTN CG+ D A
Sbjct: 283  DFSGLDHCVQAVGYDATGS-QPYWIVRNSWNTNWGEDGFIRLALGTNTCGIGDVA 336


>ref|XP_009032801.1| hypothetical protein AURANDRAFT_18666 [Aureococcus anophagefferens]
 gb|EGB13210.1| hypothetical protein AURANDRAFT_18666 [Aureococcus anophagefferens]
          Length = 346

 Score =  300 bits (768), Expect = 7e-96
 Identities = 172/362 (47%), Positives = 212/362 (58%), Gaps = 20/362 (5%)
 Frame = -2

Query: 1202 ATRLPVPSDSITGENVADEFLAKSLFDKFKLDHSKSYHSASEETHRYSVFRSNLARIADL 1023
            A  L VP+ ++T E         SLF+ FK D+ KSY+S   E  R+++F +NL +   L
Sbjct: 4    AALLLVPAAALTDE---------SLFELFKSDYVKSYNSTEAEAERFTIFSANLRKTEAL 54

Query: 1022 NSKN---GSPSFGITPFADLTAHEFAKTHLGFKPS---LDEESQAARLNTPVFELDEDDN 861
            N++        FG+T F DLT  EF   +L + PS   L E+  AA            + 
Sbjct: 55   NAQRVDEDDAEFGVTQFMDLTEAEFKAQYLNYVPSEQVLAEDVYAA-----------PEG 103

Query: 860  MMAWGANNTLVDWR--QKGAVTKVKDQAQCGSCWAFSATEQIESAWFLAKGTLPVLSPQQ 687
              A G+    +DWR  Q G V+ VKDQ QCGSCWAFSATEQIES W LA     V +PQQ
Sbjct: 104  FAAPGS----LDWRTKQSGVVSDVKDQGQCGSCWAFSATEQIESEWVLAGNDPLVFAPQQ 159

Query: 686  ILSCDKTDAGCDGGDTPTAYAYVKSAGGLESESAYPYSSG-AGNTGTCKFKAASIVAKIS 510
            I+SCDK D GC+GG+T TAYAYV+ AGG+  ESAYPY SG +GNTG CK K  +    + 
Sbjct: 160  IVSCDKVDQGCNGGNTETAYAYVEKAGGMALESAYPYKSGTSGNTGRCK-KFETAGGDVE 218

Query: 509  GFSYATPPC-SGACKTQDEATFANNVAAKGPASICVNAEAWQLYSSGVLTAKACGGSAYT 333
             FSY  P C  G C  QDE   A  +A+ GPASICVNA AWQ Y+ GV+T   CG  A  
Sbjct: 219  SFSYVVPECKKGKCNDQDEDKMAAALASHGPASICVNAGAWQTYTKGVMTNLQCGSHAAN 278

Query: 332  DLDHCVQLVGY----------NKPASGKDYWIVRNSWGASWGYSGYLYVEYGTNACGVAD 183
             LDHCVQ+VGY           K    K  W VRNSWG SWGY GY+ V+ G NACG+A+
Sbjct: 279  ALDHCVQVVGYTGYTGDAKACGKGLKDKCVWNVRNSWGTSWGYQGYIRVQMGKNACGIAN 338

Query: 182  EA 177
            +A
Sbjct: 339  DA 340


>ref|XP_004368288.1| cysteine proteinase precursor, putative [Acanthamoeba castellanii
            str. Neff]
 gb|ELR25533.1| cysteine proteinase precursor, putative [Acanthamoeba castellanii
            str. Neff]
          Length = 330

 Score =  251 bits (641), Expect = 5e-77
 Identities = 148/332 (44%), Positives = 187/332 (56%), Gaps = 7/332 (2%)
 Frame = -2

Query: 1154 ADEFLAKSLFDKFKLDHSKSYHSASEET-HRYSVFRSNLARIADLNSKNGSPSFGITPFA 978
            A    A+  F +F   + KSY  ASEE   R  +FR NL RI  LNS N    +G+  FA
Sbjct: 23   AGTMTAEQQFRQFAAQYGKSY--ASEEFGERLRIFRDNLDRIDALNSANTGARYGVNKFA 80

Query: 977  DLTAHEFAKTHLGFKPSLDEESQAARLNTPVFELDEDDNMMAWGANNTLVDWRQKGAVTK 798
            DLT  EF  T+L    S  ++  AA     +            G   +  DWR KGAVT 
Sbjct: 81   DLTPKEFKATYLKGARSAGQKKAAATAKLDMT-----------GPLPSQFDWRDKGAVTP 129

Query: 797  VKDQAQCGSCWAFSATEQIESAWFLAKGTLPVLSPQQILSCDKT--DAGCDGGDTPTAYA 624
             KDQ QCG  WAFS TE IES WFL+   L  L+PQQI+ CD+   D GCDGGD PTAY 
Sbjct: 130  TKDQGQCG--WAFSVTEAIESQWFLSGRKLVSLAPQQIVDCDQGNGDYGCDGGDPPTAYE 187

Query: 623  YVKSAGGLESESAYPYSSGAGNTGTCKFKAASIVAKISGFSYATPPCSGACKTQDEATFA 444
            YV  AGGL++E +YPY++     G C FK +++ AKIS ++Y T        T++E    
Sbjct: 188  YVIKAGGLDTEESYPYTA---EDGQCAFKPSAVGAKISNWTYITT-------TKNETEMQ 237

Query: 443  NNVAAKGPASICVNAEAWQLYSSGVLTAKACGGSAYTDLDHCVQLVGYNKPASGK----D 276
              +A++GP SICV+A +WQ Y  GV+T+  C  S    LDHCV + GY+          D
Sbjct: 238  YGLASRGPLSICVDASSWQYYIGGVITS-LCEDS----LDHCVMITGYSVQEGWDFMKYD 292

Query: 275  YWIVRNSWGASWGYSGYLYVEYGTNACGVADE 180
             W +RNSWG  WGY GYLYV+ G+N CGV DE
Sbjct: 293  VWNIRNSWGEDWGYGGYLYVQRGSNLCGVGDE 324


>gb|KYQ91485.1| hypothetical protein DLAC_08453 [Tieghemostelium lacteum]
          Length = 354

 Score =  250 bits (639), Expect = 2e-76
 Identities = 146/344 (42%), Positives = 194/344 (56%), Gaps = 8/344 (2%)
 Frame = -2

Query: 1184 PSDSITGENVADEFLAKSLFDKFKLDHSKSYHSASEETHRYSVFRSNLARIADLNSKN-G 1008
            P+ +     +  E + K+ FD++   H+K YH+  E   RY  F++NL +I   N+ + G
Sbjct: 28   PNQNQQDNYIQRERILKNQFDQWVEKHAKKYHTHREYLTRYQNFKNNLKKIEQQNAAHQG 87

Query: 1007 SPSFGITPFADLTAHEFAKTHL--GFKPSLDEESQAARLNTPVFE--LDEDDNMMAWGAN 840
            S  FG+  F+DL+  EF K +L   +KP+        + + PV +     D+N+      
Sbjct: 88   SAKFGMNKFSDLSEEEFTKFYLMPEYKPT--PRKSLYKKHYPVMQDAQSSDENIPL---- 141

Query: 839  NTLVDWRQKGAVTKVKDQAQCGSCWAFSATEQIESAWFLAKGTLPVLSPQQILSCDKTDA 660
            N  VDWR +G VT VKDQ  CGSCWAFSATEQIE+AW  A     +LS QQI+ CD  D 
Sbjct: 142  NLKVDWRTEGLVTPVKDQGACGSCWAFSATEQIETAWIKAGNDQVILSEQQIVDCDTNDG 201

Query: 659  GCDGGDTPTAYAYVKSAGGLESESAYPYSSGAGNTGTCKFKAASIVAKISGFSYATPPCS 480
            GC GGD  TA  YV  AGGL SES YPY     N GTC       VA ISG+  AT P  
Sbjct: 202  GCGGGDPHTAMDYVIKAGGLTSESQYPY---IANDGTCHTNFTP-VAHISGYYAATTP-- 255

Query: 479  GACKTQDEATFANNVAAKGPASICVNAEAWQLYSSGVLTAKACGGSAYTDLDHCVQLVGY 300
                  ++   A +V  +GP SICV+A +W  YSSG++ +     +  +DLDHCVQ+VG 
Sbjct: 256  -----GNDTQLAYSVMNEGPISICVDASSWMTYSSGIIRS-----NCDSDLDHCVQIVGL 305

Query: 299  NKPASGK---DYWIVRNSWGASWGYSGYLYVEYGTNACGVADEA 177
            N   +G     Y+I+RNSWG  WG  G++YVE G + CGV  EA
Sbjct: 306  NVDTNGTTPIPYYIIRNSWGTDWGIDGFIYVEIGHDLCGVTQEA 349


>ref|XP_012756472.1| hypothetical protein SAMD00019534_028850 [Acytostelium subglobosum
            LB1]
 dbj|GAM19710.1| hypothetical protein SAMD00019534_028850 [Acytostelium subglobosum
            LB1]
          Length = 325

 Score =  248 bits (632), Expect = 1e-75
 Identities = 136/318 (42%), Positives = 183/318 (57%), Gaps = 2/318 (0%)
 Frame = -2

Query: 1127 FDKFKLDHSKSYHSASEETHRYSVFRSNLARIADLNSKN-GSPSFGITPFADLTAHEFAK 951
            F ++   + + Y    E   R S F SNLA I++ N+K+ G  +FG+  F+DL+  EF K
Sbjct: 26   FKQWMSKYERHYVDEKEYLIRLSNFVSNLATISEYNAKHHGRATFGLNQFSDLSIEEFRK 85

Query: 950  THLGFKPSLDEESQAARLNTPVFELDEDDNMMAWGANNTLVDWRQKGAVTKVKDQAQCGS 771
            THL + P+  + SQ  +        D   N+         VDWR KG VT VK+Q QCGS
Sbjct: 86   THLNYVPTHKKASQVRQ------HFDYPSNIPE------RVDWRAKGFVTPVKNQLQCGS 133

Query: 770  CWAFSATEQIESAWFLAKGTLPVLSPQQILSCDKTDAGCDGGDTPTAYAYVKSAGGLESE 591
            CWAFSATEQIE+A+  A       S QQI+ CD  D GC GGD  TAY YV+SAGG+ ++
Sbjct: 134  CWAFSATEQIETAFIQAGNAQQFFSEQQIVDCDPFDGGCGGGDPMTAYQYVQSAGGITTD 193

Query: 590  SAYPYSSGAGNTGTCKFKAASIVAKISGFSYATPPCSGACKTQDEATFANNVAAKGPASI 411
            +AYPY++     GTC+    + VA+I  + YA+          +E      +AA GP SI
Sbjct: 194  TAYPYTA---QDGTCEANTTTKVAQIKTYGYAS-------TAGNETQMKEAIAALGPLSI 243

Query: 410  CVNAEAWQLYSSGVLTAKACGGSAYTDLDHCVQLVGYN-KPASGKDYWIVRNSWGASWGY 234
            CV+AE W  Y SG++T          DLDHCVQ+VGY+    S   Y+IVRNSWG +WG 
Sbjct: 244  CVDAETWMTYQSGIITTDCA-----ADLDHCVQVVGYDVDTTSNIPYYIVRNSWGTTWGQ 298

Query: 233  SGYLYVEYGTNACGVADE 180
             GY+Y+  G+N CG+ +E
Sbjct: 299  EGYIYIGEGSNLCGITEE 316


>ref|XP_004335426.1| cathepsin L, putative [Acanthamoeba castellanii str. Neff]
 gb|ELR13413.1| cathepsin L, putative [Acanthamoeba castellanii str. Neff]
          Length = 331

 Score =  246 bits (627), Expect = 7e-75
 Identities = 145/317 (45%), Positives = 181/317 (57%), Gaps = 2/317 (0%)
 Frame = -2

Query: 1127 FDKFKLDHSKSYHSASEETHRYSVFRSNLARIADLNSK-NGSPSFGITPFADLTAHEFAK 951
            F+ F   + KSY SA E   R+++F  NLA  A LN K  G   FGIT FAD++  EF  
Sbjct: 34   FNAFVQRYGKSYASAEEAEQRFAIFTQNLAETAALNIKYEGKTQFGITKFADMSQEEFQS 93

Query: 950  THLGFKPSLDEESQAARLNTPVFELDEDDNMMAWGANNTLVDWRQK-GAVTKVKDQAQCG 774
              L   P      +  R   P FE         + A +T  DWR K G VT V DQ QCG
Sbjct: 94   RVLMSNPPPPPTEKPYR--GPKFE--------GFTAPSTF-DWRNKPGVVTPVYDQGQCG 142

Query: 773  SCWAFSATEQIESAWFLAKGTLPVLSPQQILSCDKTDAGCDGGDTPTAYAYVKSAGGLES 594
            SCWAFSATE IES W LA   L  LS QQI+ C   D GC GG    AY YV  A GL++
Sbjct: 143  SCWAFSATENIESQWALAGHKLTGLSMQQIVDCSWWDDGCGGGFPSYAYDYVIDAPGLDA 202

Query: 593  ESAYPYSSGAGNTGTCKFKAASIVAKISGFSYATPPCSGACKTQDEATFANNVAAKGPAS 414
             + YPY++     G+C FK + +VAKIS ++Y T          +E   AN +A  GP S
Sbjct: 203  LANYPYTAVG---GSCAFKESQVVAKISSWTYTT-------TDSNEHQMANYLAQHGPIS 252

Query: 413  ICVNAEAWQLYSSGVLTAKACGGSAYTDLDHCVQLVGYNKPASGKDYWIVRNSWGASWGY 234
            +CV+AE+W  Y+ GV  A ACG    T +DHCV  VGYN  A+   YWI+RNSWG SWG 
Sbjct: 253  VCVDAESWPSYTGGVYRASACG----TSIDHCVLAVGYNLTAN-PPYWIIRNSWGTSWGL 307

Query: 233  SGYLYVEYGTNACGVAD 183
             GY+++E+GT+AC VA+
Sbjct: 308  EGYMHLEFGTDACAVAE 324


>ref|XP_013762925.1| cruzipain [Thecamonas trahens ATCC 50062]
 gb|KNC45942.1| cruzipain [Thecamonas trahens ATCC 50062]
          Length = 394

 Score =  243 bits (621), Expect = 3e-73
 Identities = 141/325 (43%), Positives = 187/325 (57%), Gaps = 8/325 (2%)
 Frame = -2

Query: 1127 FDKFKLDHSKSYHSASEETHRYSVFRSNLARIADLNSKN----GSPSFGITPFADLTAHE 960
            F  FK  + + Y S+  E   + VF++N  + A L + N    G   FG++PF DLT +E
Sbjct: 23   FALFKETYKRQYASSKAEAAAFEVFKTNAEKAAKLEAANKAAGGDAKFGMSPFMDLTENE 82

Query: 959  FAKTHLGFKPSLDEESQAARLNTPVFELDEDDNMMAWGANNTLVDWR--QKGAVTKVKDQ 786
            F   +L  K ++  E  AA L  PV            GA     DWR  +   +T VK+Q
Sbjct: 83   FKARYLMPKGAV--EGGAAEL--PVLRASNV------GALPKAYDWRDHKPAVITPVKNQ 132

Query: 785  AQCGSCWAFSATEQIESAWFLAKGTLPVLSPQQILSCDKTDAGCDGGDTPTAYAYVKSAG 606
             QCGSCWAFSA  ++ES W LA   L VLS QQ++ CD TD GC+GGDT +AY Y++ AG
Sbjct: 133  GQCGSCWAFSAVSEVESMWALAGHELVVLSEQQVVDCDTTDDGCNGGDTISAYHYIEKAG 192

Query: 605  GLESESAYPYSSGAGNTGTCKFKAA--SIVAKISGFSYATPPCSGACKTQDEATFANNVA 432
            GL  E  YPY++     G CK        VAKI G++YAT P      T++E   A N+ 
Sbjct: 193  GLVPEKDYPYTA---RDGKCKDSVVKKDAVAKIMGYNYATSP-----STKNETQLAANLM 244

Query: 431  AKGPASICVNAEAWQLYSSGVLTAKACGGSAYTDLDHCVQLVGYNKPASGKDYWIVRNSW 252
            + GP SICV+A +WQ Y+SG+L+   CG      LDHCVQ+ G+    S + YW VRNSW
Sbjct: 245  STGPVSICVDASSWQTYTSGILS--HCG----KQLDHCVQITGWGTSGS-EMYWWVRNSW 297

Query: 251  GASWGYSGYLYVEYGTNACGVADEA 177
              SWG SGY+ +++G N CG+ADEA
Sbjct: 298  ATSWGMSGYIQLKFGQNTCGLADEA 322


>emb|CUI14619.1| cysteine peptidase, putative [Bodo saltans]
          Length = 466

 Score =  240 bits (613), Expect = 4e-71
 Identities = 137/348 (39%), Positives = 184/348 (52%), Gaps = 2/348 (0%)
 Frame = -2

Query: 1232 VLLGLSAFASATRLPVPSDSITGENVADEFLAKSLFDKFKLDHSKSYHSASEETHRYSVF 1053
            +L  L AFA+ + +   +D++           ++ F+ FK  H KSY + SEET+R +VF
Sbjct: 7    LLCALIAFAAVSSVSATTDAL-----------RASFESFKAKHGKSYATPSEETYRLTVF 55

Query: 1052 RSNLARIADLNSKNGSPSFGITPFADLTAHEFAKTHLGFKPSLDEESQAARLNTPVFELD 873
              N+ +   LN+KN    FG +PFAD+T  EF   H G K       +     T  +   
Sbjct: 56   AENIRKAEILNAKNPQARFGASPFADMTETEFKSYHNGDKYFSARVQELKSDKTTYYPRY 115

Query: 872  EDDNMMAWGANNTLVDWRQKGAVTKVKDQAQCGSCWAFSATEQIESAWFLAKGTLPVLSP 693
             D  + A   N    DWR +GAVT VK+Q QCGSCWAFS T  +E  W LA  TL  LS 
Sbjct: 116  TDAQVKAAPTNK---DWRTEGAVTAVKNQGQCGSCWAFSTTGGVEGQWQLAGNTLVSLSE 172

Query: 692  QQILSCDKTDAGCDGGDTPTAYAYV--KSAGGLESESAYPYSSGAGNTGTCKFKAASIVA 519
            QQ++SCD  D+GC+GG    AY ++     G   SE++YPY SG G    C     +  A
Sbjct: 173  QQLVSCDTVDSGCNGGLMNNAYEWILANKGGEFVSEASYPYVSGGGTAPACDATQGTNAA 232

Query: 518  KISGFSYATPPCSGACKTQDEATFANNVAAKGPASICVNAEAWQLYSSGVLTAKACGGSA 339
            KI+G               DE      +   GP S+ ++A AWQ+Y  GV+T   CGGSA
Sbjct: 233  KITGHYNI---------YHDEDQMKAWIGENGPLSLAIDASAWQMYMGGVMT--TCGGSA 281

Query: 338  YTDLDHCVQLVGYNKPASGKDYWIVRNSWGASWGYSGYLYVEYGTNAC 195
               LDH V +VGY        YWI +NSWGASWG +GY+YV +G++ C
Sbjct: 282  ---LDHGVLIVGYQFENQATPYWIFKNSWGASWGEAGYIYVAFGSDQC 326


>ref|XP_012759759.1| hypothetical protein SAMD00019534_034130, partial [Acytostelium
            subglobosum LB1]
 dbj|GAM20238.1| hypothetical protein SAMD00019534_034130, partial [Acytostelium
            subglobosum LB1]
          Length = 338

 Score =  236 bits (601), Expect = 6e-71
 Identities = 141/357 (39%), Positives = 192/357 (53%), Gaps = 4/357 (1%)
 Frame = -2

Query: 1238 LLVLLGLSAFASATRLPVPSDSITGENVADEFLAKSLFDKFKLDHSKSYHSASEETHRYS 1059
            L+  L   AF     + +P  +   E    E++A   FDK  +D ++ Y        R S
Sbjct: 9    LVATLTTLAFVEVNAVRLPGRTRNYEQQFREWMAD--FDKVYVDDAEYYR-------RLS 59

Query: 1058 VFRSNLARIADLNSKN-GSPSFGITPFADLTAHEFAKTHLGFKPSLDEESQAARLNTPVF 882
             F +NL  IA  N  + G  +FG+  FADL+  EF   +L F+     + +   ++ P  
Sbjct: 60   NFITNLGTIARNNRMHKGRATFGVNKFADLSMEEFKSYYLNFETDRTPKREPTNVSYP-- 117

Query: 881  ELDEDDNMMAWGANNTLVDWRQKGAVTKVKDQAQCGSCWAFSATEQIESAWFLAKGTLPV 702
                  N+ +       VDWRQKG VT VK+Q QCGSCWAFSA EQIESA+ +      +
Sbjct: 118  -----SNIPSQ------VDWRQKGYVTPVKNQEQCGSCWAFSAAEQIESAYIMLGNEAQI 166

Query: 701  LSPQQILSCDKTDAGCDGGDTPTAYAYVKSAGGLESESAYPYSSGAGNTGTCKFKAASIV 522
             S QQI+ CD  D GC GGDT TAY YV++AGGL + ++YPY++     GTC        
Sbjct: 167  ASEQQIVDCDSFDGGCGGGDTMTAYKYVETAGGLTTNASYPYTA---QDGTCYANKTKKF 223

Query: 521  AKISGFSYATPPCSGACKTQDEATFANNVAAKGPASICVNAEAWQLYSSGVLTAKACGGS 342
             K++ ++YA+          +E      +AA GP SICV+A +W  Y SG++T+  CG  
Sbjct: 224  VKVTNYNYAS-------SQGNETQLKEAIAALGPLSICVDAISWMTYQSGIITSN-CG-- 273

Query: 341  AYTDLDHCVQLVGYNKPAS---GKDYWIVRNSWGASWGYSGYLYVEYGTNACGVADE 180
               DLDHCVQLVGY   +S      Y+IVRNSWG  WG  GY+Y+  G N CG+ DE
Sbjct: 274  --NDLDHCVQLVGYAIESSVTPNIPYYIVRNSWGLDWGQEGYIYIGEGQNLCGITDE 328


>ref|XP_004363040.1| hypothetical protein DFA_03437 [Cavenderia fasciculata]
 gb|EGG25189.1| hypothetical protein DFA_03437 [Cavenderia fasciculata]
          Length = 341

 Score =  233 bits (594), Expect = 8e-70
 Identities = 133/331 (40%), Positives = 186/331 (56%), Gaps = 4/331 (1%)
 Frame = -2

Query: 1160 NVADEFLAKSLFDKFKLDHSKSYHSASEETHRYSVFRSNLARIADLNSKNG-SPSFGITP 984
            + AD++  +  F  + ++H+K YH   E   R S F  N+  I  +N + G + +FG+  
Sbjct: 23   STADDYTTR--FKTWMVEHNKMYHEEEEFYLRLSNFIRNIHSIEKMNRQYGRTATFGLNK 80

Query: 983  FADLTAHEFAKTHL--GFKPSLDEESQAARLNTPVFELDEDDNMMAWGANNTLVDWRQKG 810
            F+DL+  EF K +L   +KP        AR+    F      N+ A       +DWR KG
Sbjct: 81   FSDLSLDEFKKHYLMPNYKPK-------ARVTKETFNYPS--NIPA------TLDWRTKG 125

Query: 809  AVTKVKDQAQCGSCWAFSATEQIESAWFLAKGTLPVLSPQQILSCDKTDAGCDGGDTPTA 630
             VT VK+Q  CGSCWAFSATEQIE+A  +A G +  LS QQI+ CD  D GC GGD  TA
Sbjct: 126  YVTPVKNQLMCGSCWAFSATEQIETANIMAGGQVEYLSEQQIVDCDPYDGGCGGGDPYTA 185

Query: 629  YAYVKSAGGLESESAYPYSSGAGNTGTCKFKAASIVAKISGFSYATPPCSGACKTQDEAT 450
            Y YV++ GGL     YPY++     G C   + +   +++ F YA+          +E  
Sbjct: 186  YQYVQNNGGLTLNVTYPYTAA---NGACYANSTAPAVQVTAFGYAS-------SQGNETQ 235

Query: 449  FANNVAAKGPASICVNAEAWQLYSSGVLTAKACGGSAYTDLDHCVQLVGYNKPASGK-DY 273
                +AA+GP SICVNAE W  Y SG+ ++     +   DLDHCVQ+VGY+  A+ K  Y
Sbjct: 236  LREAMAARGPLSICVNAEPWMSYQSGIFSS-----TCSDDLDHCVQIVGYDTDATSKTPY 290

Query: 272  WIVRNSWGASWGYSGYLYVEYGTNACGVADE 180
            +IVRNSWG  WG  GY+Y++ G+N CG+ +E
Sbjct: 291  FIVRNSWGTDWGLLGYIYIQAGSNLCGITNE 321


>gb|KOO53669.1| cathepsin l-like protease [Chrysochromulina sp. CCMP291]
          Length = 345

 Score =  229 bits (585), Expect = 2e-68
 Identities = 141/359 (39%), Positives = 194/359 (54%), Gaps = 7/359 (1%)
 Frame = -2

Query: 1232 VLLGLSAFASATRLPVPSDSITGENVADEFLAKSLFDKFKLDHSKS--YHSASEETHRYS 1059
            V++ LS  ++A + P     +  +    +F+      KF  D      Y SA+E   R++
Sbjct: 5    VVVALSIISTAAQRPADPTDMHLDPAFPQFM------KFMTDFRNGVPYSSAAETLGRFT 58

Query: 1058 VFRSNLARIADLNSKNGSPSFGITPFADLTAHEFAKTHLGFKPSLDEESQAARLNTPVFE 879
             F++NL  I + N+K G  + GIT FADLT  EF   +L  +P     + A R       
Sbjct: 59   AFKANLQLIGERNAK-GQETHGITKFADLTREEFKAQYLTLRPPT---ANALR------S 108

Query: 878  LDEDDNMMAWGANNTLVDWRQKGAVTKVKDQAQCGSCWAFSATEQIESAWFLAKGTLPVL 699
            + + D+++         DW  KGA T VK+Q QCGSCWAFSATEQ+ES ++   G L  L
Sbjct: 109  MKQLDHLVQANYTAASTDWCAKGACTPVKNQGQCGSCWAFSATEQLESQYYQTYGKLIEL 168

Query: 698  SPQQILSCDKTDAGCDGGDTPTAYAYVKSAGGLESESAYPYSSG-AGNTGTCKFKAASIV 522
            SPQQ+ SCD    GC+GG+   A+ YV S GG ESES YPY SG    TG+C  K A + 
Sbjct: 169  SPQQLTSCDPNCGGCNGGNPINAWIYVNSFGGQESESDYPYVSGVTKQTGSCSSKIAEVT 228

Query: 521  AKIS---GFSYATPPCSGACKTQDEATFANNVAAKGPASICVNAEAWQLYSSGVLTAKA- 354
              +    G+  A  P         E+     +    P SI V+AE WQ Y+ G++  K+ 
Sbjct: 229  EAVGADVGYFIAQRPA-------QESNMLKQIGL-SPMSIAVDAELWQTYTGGIIGPKSG 280

Query: 353  CGGSAYTDLDHCVQLVGYNKPASGKDYWIVRNSWGASWGYSGYLYVEYGTNACGVADEA 177
            CG    T +DH VQ+ GYN  A G +YWIVRNSWG +WG SG++Y+ YG N CG+  +A
Sbjct: 281  CG----TTIDHAVQVTGYN--AEG-NYWIVRNSWGPNWGESGFVYLTYGDNVCGITSQA 332


>ref|XP_020433674.1| hypothetical protein PPL_05546 [Heterostelium album PN500]
 gb|EFA81557.1| hypothetical protein PPL_05546 [Heterostelium album PN500]
          Length = 341

 Score =  229 bits (584), Expect = 2e-68
 Identities = 138/352 (39%), Positives = 194/352 (55%), Gaps = 2/352 (0%)
 Frame = -2

Query: 1229 LLGLSAFASATRLPVPSDSITGENVADEFLAKSLFDKFKLDHSKSYHSASEETHRYSVFR 1050
            L+ L A AS   + + ++S  G + A +F  +  F ++   H KSY   SE   R S + 
Sbjct: 8    LVCLVAIASVDAIRIQNNS--GFHRARDFEGE--FRQWMTKHEKSYADDSEYYLRLSHYI 63

Query: 1049 SNLARIADLNSKN-GSPSFGITPFADLTAHEFAKTHLGFKPSLDEESQAARLNTPVFELD 873
             NL  +AD N K+ G   F    F+DL+  EF   +L + P+   + ++ + N      D
Sbjct: 64   KNLRTVADYNKKHAGMAKFAPNKFSDLSIEEFRAGYLNYVPNKLIKDRSTKQN-----FD 118

Query: 872  EDDNMMAWGANNTLVDWRQKGAVTKVKDQAQCGSCWAFSATEQIESAWFLAKGTLPVLSP 693
               N+         +DWRQKG VT VK+Q QCGSCWAFSA EQIE+A+ +A      +S 
Sbjct: 119  YPANIPV------SLDWRQKGFVTPVKNQEQCGSCWAFSAGEQIETAYIMAGNAAQNVSE 172

Query: 692  QQILSCDKTDAGCDGGDTPTAYAYVKSAGGLESESAYPYSSGAGNTGTCKFKAASIVAKI 513
            QQI+ CD  D GC GGD  TAY YV+SAGG+ + + YPY++     GTC  +      +I
Sbjct: 173  QQIVDCDPYDGGCGGGDPMTAYQYVQSAGGITTNTDYPYTA---TDGTCYAQNTPKFTQI 229

Query: 512  SGFSYATPPCSGACKTQDEATFANNVAAKGPASICVNAEAWQLYSSGVLTAKACGGSAYT 333
            + + YA+          +E      +AA+GP SICV+AE W  Y SGVL +     +   
Sbjct: 230  ASYGYAS-------NKGNETELKQAIAARGPLSICVDAETWMNYQSGVLNS-----NCPD 277

Query: 332  DLDHCVQLVGYN-KPASGKDYWIVRNSWGASWGYSGYLYVEYGTNACGVADE 180
            +LDHCVQ+VGY+ + ++   Y+IVRNSWG  WG  GY+ V  G N CG+ DE
Sbjct: 278  ELDHCVQIVGYDVEQSTNTPYYIVRNSWGTDWGMEGYILVGEGQNLCGITDE 329


>ref|XP_003293312.1| hypothetical protein DICPUDRAFT_41833 [Dictyostelium purpureum]
 gb|EGC30167.1| hypothetical protein DICPUDRAFT_41833 [Dictyostelium purpureum]
          Length = 352

 Score =  227 bits (579), Expect = 2e-67
 Identities = 134/334 (40%), Positives = 185/334 (55%), Gaps = 13/334 (3%)
 Frame = -2

Query: 1139 AKSLFDKFKLDHSKSYHSASEETHRYSVFRSNLARIADLNSKN-GSPSFGITPFADLTAH 963
            +K LF  +   + K Y ++ E   R+S F++NL +I +LN+ + G  SFG+  ++DL+  
Sbjct: 35   SKDLFHHWTKQNGKIYETSEEFEKRFSNFKTNLKKIENLNNLHKGKASFGMNKYSDLSEE 94

Query: 962  EFAKTHL--GFKPSLDEESQAARL------NTPVFELDEDDNMMAWGANNTLVDWRQKGA 807
            EF+  +L   FK   +EE    +       N     L+ DD + A       VDWR KG 
Sbjct: 95   EFSNFYLMKNFKGKPEEERDYIKKPENPSSNLIGGYLNTDDGLKAMYQ----VDWRNKGL 150

Query: 806  VTKVKDQAQCGSCWAFSATEQIESAWFLAKGTLPVLSPQQILSCDKTDAGCDGGDTPTAY 627
            VT VKDQ QCGSC+ FSATEQIES +  A     +LS QQ + CD  D GC GGD    Y
Sbjct: 151  VTPVKDQGQCGSCYIFSATEQIESEYIRAGHKAILLSEQQSVDCDTMDGGCGGGDPANVY 210

Query: 626  AYVKSAGGLESESAYPYSSGAGNTGTCKFKAASIVAKISGFSYATPPCSGACKTQDEATF 447
             Y+ SAGG+ +E  YPY++     GTC F     V+ I+GF Y T       +  DE T 
Sbjct: 211  NYIISAGGVSTEKDYPYTA---QDGTC-FNTTRAVS-ITGFQYVT-------QNSDEDTL 258

Query: 446  ANNVAAKGPASICVNAEAWQLYSSGVLTAKACGGSAYTDLDHCVQLVGYN----KPASGK 279
               +A  GP SICV+A  WQ Y+ G++T          ++DHCVQ+VG +     P++  
Sbjct: 259  ITTIANHGPVSICVDASTWQSYTGGIITT-----GCEQNIDHCVQVVGLDIDKTDPSNPI 313

Query: 278  DYWIVRNSWGASWGYSGYLYVEYGTNACGVADEA 177
             Y+I+RNSWG SWG  GY+YV  G+N CG+  E+
Sbjct: 314  PYYIIRNSWGTSWGDKGYIYVAQGSNLCGITYES 347


>gb|KMZ58469.1| Cysteine proteinase cathepsin F [Zostera marina]
          Length = 377

 Score =  226 bits (577), Expect = 8e-67
 Identities = 142/351 (40%), Positives = 180/351 (51%), Gaps = 26/351 (7%)
 Frame = -2

Query: 1163 ENVADEFLAKSLFDKFKLDHSKSYHSASEETHRYSVFRSNLARIADLNSKNGSPSFGITP 984
            EN  D+ L KS F  F   +SKSY +  E  HRY +FRSN  R       + S + GIT 
Sbjct: 45   ENEEDDHLLKSEFTSFVSRYSKSYETTEEHDHRYKIFRSNFRRAQRNQVLDPSATHGITK 104

Query: 983  FADLTAHEFAKTHLGFK---PSLDEESQAARLNT------PVFELDEDDNMMAWGANNTL 831
            F+DLT  EF   +LG K   PS+ +++  A + T      P  +L ED            
Sbjct: 105  FSDLTTEEFESQYLGLKKPKPSIFQKTNPASIGTHEAATLPTTDLPED------------ 152

Query: 830  VDWRQKGAVTKVKDQAQCGSCWAFSATEQIESAWFLAKGTLPVLSPQQILSCD------- 672
             DWR  GAVT VKDQ  CGSCW+FSA   +E A +LA G L  LS QQ++ CD       
Sbjct: 153  FDWRDLGAVTPVKDQGVCGSCWSFSAAAALEGANYLATGKLIGLSEQQMVDCDHVCDPTD 212

Query: 671  --KTDAGCDGGDTPTAYAYVKSAGGLESESAYPYSSGAGNTGTCKFKAASIVAKISGFSY 498
                DAGC+GG    A++Y+  +GGLESE  YPY+   G+ GTCKF  + I A ++ FS 
Sbjct: 213  SRSCDAGCNGGLMTNAFSYLMQSGGLESEKDYPYT---GSDGTCKFDKSKIAASVANFSV 269

Query: 497  ATPPCSGACKTQDEATFANNVAAKGPASICVNAEAWQLYSSGVLTAKACGGSAYTDLDHC 318
                      + DE   A N+   GP ++ +NA   Q Y  GV     C  +    LDH 
Sbjct: 270  I---------SSDEDQIAANLVKYGPLAVGINAAFMQTYIGGVSCPYICFKNY---LDHG 317

Query: 317  VQLVGYNKPASG--------KDYWIVRNSWGASWGYSGYLYVEYGTNACGV 189
            V LVGY   ASG        K YWI++NSWG SWG  GY  +  G N CGV
Sbjct: 318  VLLVGYG--ASGYSQLRFKNKPYWIIKNSWGDSWGEDGYYKICRGNNICGV 366


>ref|XP_003284553.1| hypothetical protein DICPUDRAFT_96752 [Dictyostelium purpureum]
 gb|EGC38873.1| hypothetical protein DICPUDRAFT_96752 [Dictyostelium purpureum]
          Length = 346

 Score =  223 bits (569), Expect = 5e-66
 Identities = 139/340 (40%), Positives = 184/340 (54%), Gaps = 20/340 (5%)
 Frame = -2

Query: 1142 LAKSLFDKFKLDHSKSYHSASEETHRYSVFRSNLARIADLNSK----NGSPSFGITPFAD 975
            + ++ F  F+  ++K Y S++E + ++  F++NL  IA LN K         FG+  FAD
Sbjct: 24   IEQTQFVAFQQKYNKVY-SSNEYSAKFETFKANLGVIAQLNQKAKLHKSDTKFGVNEFAD 82

Query: 974  LTAHEFAKTHLGFKPSLDEES--QAARLNTPVFELDEDDNMMAWGANNTLVDWRQKGAVT 801
            L+A EF K +L  + +  + S   A  L   V E              T  DWR KGAVT
Sbjct: 83   LSAAEFRKYYLNAQVAKPDASLPMAPLLTEEVLETIP-----------TAFDWRTKGAVT 131

Query: 800  KVKDQAQCGSCWAFSATEQIESAWFLAKGTLPVLSPQQILSCD----------KTDAGCD 651
             VK+Q QCGSCW+FS T  IE  W+LA  TL  LS Q ++ CD            DAGCD
Sbjct: 132  GVKNQGQCGSCWSFSTTGNIEGQWYLAGNTLVGLSEQNLVDCDHQCMEYDGQKSCDAGCD 191

Query: 650  GGDTPTAYAYVKSAGGLESESAYPYSSGAGNTGTCKFKAASIVAKISGFSYATPPCSGAC 471
            GG  P AY YV   GGL+SE++YPY +  G+  +CKFK+ ++ AKIS F+          
Sbjct: 192  GGLQPNAYRYVIENGGLDSENSYPYLAVTGD--SCKFKSGNVAAKISNFTMI-------- 241

Query: 470  KTQDEATFANNVAAKGPASICVNAEAWQLYSSGVLTAKACGGSAYTDLDHCVQLVGYN-- 297
              Q+E   A  +A  GP +I  +A  WQ Y  GV     CG S    LDH + +VG++  
Sbjct: 242  -PQNETQMAGYLATHGPLAIAADAAEWQFYIGGVFDL-PCGQS----LDHGILIVGFSAE 295

Query: 296  KPASG--KDYWIVRNSWGASWGYSGYLYVEYGTNACGVAD 183
            K   G  K YWIV+NSWGASWG  GYLY+  G N CGV+D
Sbjct: 296  KNIFGHLKPYWIVKNSWGASWGEQGYLYLGKGKNLCGVSD 335


>gb|OEU13361.1| cysteine proteinase [Fragilariopsis cylindrus CCMP1102]
          Length = 368

 Score =  223 bits (569), Expect = 9e-66
 Identities = 127/359 (35%), Positives = 182/359 (50%), Gaps = 50/359 (13%)
 Frame = -2

Query: 1106 HSKSYHSASEETHRYSVFRSNLARIADLNSKNGSPS------FGITPFADLTAHEF-AKT 948
            H+K+YHS  E+ HR+S++  N AR A+ N ++G  +      FG   F DL   EF AK 
Sbjct: 4    HNKAYHSEEEKQHRFSIWSQNHARTAEKNRRHGPCTLTKQHVFGSNHFKDLAPEEFQAKF 63

Query: 947  HLGFKPSLDEESQAARLNTP--VFELDEDDNMMAWGANNTLVDWRQKGAVTKVKDQAQCG 774
              G+K +  +  +  R   P  +  L +D  +    A    VDWR  GA++ ++ Q +CG
Sbjct: 64   LTGYKGAFTDVLEDKRQEQPPDIRRLRKDSGIYDADAFPNSVDWRDSGAISDIRTQGECG 123

Query: 773  SCWAFSATEQIESAWFLAKGTLPVLSPQQILSCDKTDAGCDGGDTPTAYAYVKSAGGLES 594
            +CWA +A E++ESA FL+ GTL  LS  +I+ CD +   C GG    A+ +V   GGL  
Sbjct: 124  ACWAVTAVEEVESAVFLSTGTLYALSESEIIVCDDSCEMCSGGWPQNAFEWVMDHGGLPL 183

Query: 593  ESAYPYSSGAGNTGTCKFKAASIVAKISGFSYATPPC------SGA-CKTQDEATFANNV 435
            +S++PY +      T  +        I G+ YAT  C      SG  C+ QDE T  NN+
Sbjct: 184  QSSFPYDAYTLIALTADYSNQGRYGNIRGYGYATDRCLCYSDGSGCDCEDQDEDTAINNI 243

Query: 434  AAKGPASICVNAEAWQLYSSGVLTAKACGGSAYTDLDHCVQLVGY--------------- 300
            A  GP+ +C+ A  WQ Y  G++T+ +     + D++HCVQ+VGY               
Sbjct: 244  ATYGPSVVCLEASTWQDYGGGIITSDSGCAQTFLDMNHCVQVVGYAFTTGSSDCNDSNDE 303

Query: 299  ----NKPASGKD---------------YWIVRNSWGASWGYSGYLYVEYGTNACGVADE 180
                N   SG D               YWIVRN WG SWG +GY YV  GTN CG+ ++
Sbjct: 304  GCDSNDENSGSDSGSNSGSGDSNGREGYWIVRNQWGDSWGMNGYAYVSMGTNTCGILND 362


>gb|OQR72774.1| cathepsin L-like [Tropilaelaps mercedesae]
          Length = 331

 Score =  222 bits (565), Expect = 1e-65
 Identities = 144/367 (39%), Positives = 187/367 (50%), Gaps = 10/367 (2%)
 Frame = -2

Query: 1247 MKSLLVLLGLSAFASATRLPVPSDSITGENVADEFLAKSLFDKFKLDHSKSYHSASEETH 1068
            M SL+VLL +   A A R+P P              A+  + +F+  H K YH  SEE  
Sbjct: 1    MHSLIVLLAVVGAALAVRVPRPD-------------AEHHWAEFRRTHQKQYHG-SEELQ 46

Query: 1067 RYSVFRSNLARIADLNSKNGSPS---FGITPFADLTAHEFAKTHLGFKPSLDEESQAARL 897
            R  +F  NL  I + N  N S +    GI  FAD+T  EF KT LG + S +  S A   
Sbjct: 47   RRFIFEDNLYIIQEFNRVNASEAGFRLGINQFADMTNEEFRKTFLGHRYSANHVSHA--- 103

Query: 896  NTPVFELDEDDNMMAWGANN--TLVDWRQKGAVTKVKDQAQCGSCWAFSATEQIESAWFL 723
                     D    A G  N    VDW  KG VT VK+Q QCGSCWAFS T  +E   F 
Sbjct: 104  ---------DSTFEATGIQNLPAKVDWTTKGYVTPVKNQGQCGSCWAFSTTGSLEGQHFK 154

Query: 722  AKGTLPVLSPQQILSCDKTDA--GCDGGDTPTAYAYVKSAGGLESESAYPYSSGAGNTGT 549
              G L  LS Q ++ C       GC+GG    A+ Y+K+ GG+++E +YPYS+     G 
Sbjct: 155  KTGKLVSLSEQNLIDCSDAQGNNGCNGGLMDLAFDYIKANGGIDTEQSYPYSA---VDGI 211

Query: 548  CKFKAASIVAKISGFSYATPPCSGACKTQDEATFANNVAAKGPASICVNAEA--WQLYSS 375
            C+FK  +I AK++G+           K  DE+     VA  GP SI ++A +  +QLYSS
Sbjct: 212  CEFKKRAIGAKVTGYV--------DIKNGDESALKEAVATVGPVSIAIDASSPHFQLYSS 263

Query: 374  GVLTAKACGGSAYTDLDHCVQLVGYNKPASGKDYWIVRNSWGASWGYSGYL-YVEYGTNA 198
            GV TA  C      +LDH V  VGY     GKDYW+V+NSWG SWG  GY+  +    N 
Sbjct: 264  GVYTASDCSS---VELDHGVLAVGYGH-EDGKDYWLVKNSWGTSWGIDGYIKMIRNKDNR 319

Query: 197  CGVADEA 177
            CG+A +A
Sbjct: 320  CGIATQA 326


>ref|XP_020235829.1| LOW QUALITY PROTEIN: cysteine proteinase 15A-like [Cajanus cajan]
          Length = 351

 Score =  221 bits (563), Expect = 4e-65
 Identities = 147/369 (39%), Positives = 190/369 (51%), Gaps = 18/369 (4%)
 Frame = -2

Query: 1241 SLLVLLGLSAFASATRLPVPSDSITGENVADEFL-AKSLFDKFKLDHSKSYHSASEETHR 1065
            SLL LL L+A A+  R  VP      E   D  L A+  F  FK    KSY +  E  HR
Sbjct: 5    SLLALLLLAAVAAXIRQVVPG----AEPEEDHLLNAEHHFSTFKARFGKSYATKEEHDHR 60

Query: 1064 YSVFRSNLARIADLNSK-NGSPSFGITPFADLTAHEFAKTHLGFKP-SLDEESQAARLNT 891
            + VF SNL R A L++K + S   G+T F+DLT  EF +  LG KP  L   +Q A +  
Sbjct: 61   FGVFESNLRR-ARLHAKLDPSAVHGVTKFSDLTPAEFRRQFLGLKPLRLPAHAQNAPV-L 118

Query: 890  PVFELDEDDNMMAWGANNTLVDWRQKGAVTKVKDQAQCGSCWAFSATEQIESAWFLAKGT 711
            P  +L +D             DWR KGAVT VKDQ  CGSCW+FS T  +E A +LA G 
Sbjct: 119  PTKDLPKD------------FDWRDKGAVTNVKDQGSCGSCWSFSTTGALEGAHYLATGE 166

Query: 710  LPVLSPQQILSCDKT---------DAGCDGGDTPTAYAYVKSAGGLESESAYPYSSGAGN 558
            L   S QQ++ CD           DAGC+GG    A+ Y+  +GG++ E  YPY+   G 
Sbjct: 167  LLSFSEQQLVDCDHVCDPEEYGACDAGCNGGLMNNAFEYILESGGIQLEKDYPYT---GR 223

Query: 557  TGTCKFKAASIVAKISGFSYATPPCSGACKTQDEATFANNVAAKGPASICVNAEAWQLYS 378
             GTCKF  + +VA +S +S           + DE   A N+   GP ++ +NA   Q Y 
Sbjct: 224  DGTCKFDKSKVVATVSNYSVV---------SLDEDQIAANLVKNGPLAVGINAVYMQTYI 274

Query: 377  SGVLTAKACGGSAYTDLDHCVQLVGYNKPA------SGKDYWIVRNSWGASWGYSGYLYV 216
             GV     CG     +LDH V LVGY + A        K YWI++NSWG +WG +GY  +
Sbjct: 275  GGVSCPYICG----KNLDHGVLLVGYGEGAYAPIRFKEKPYWILKNSWGENWGENGYYKI 330

Query: 215  EYGTNACGV 189
              G N CGV
Sbjct: 331  CRGRNVCGV 339


>gb|AAF75546.1| cruzipain [Trypanosoma cruzi]
          Length = 467

 Score =  224 bits (571), Expect = 6e-65
 Identities = 136/328 (41%), Positives = 183/328 (55%), Gaps = 2/328 (0%)
 Frame = -2

Query: 1154 ADEFLAKSLFDKFKLDHSKSYHSASEETHRYSVFRSNLARIADLNSKNGSPSFGITPFAD 975
            A+E LA S F +FK  H + Y SA+EE  R SVFR NL       + N   +FG+TPF+D
Sbjct: 30   AEETLA-SQFAEFKQKHGRVYESAAEEAFRLSVFRENLFLARLHAAANPHATFGVTPFSD 88

Query: 974  LTAHEFAKTHLGFKPSLDEESQAARLNTPVFELDEDDNMMAWGANNTLVDWRQKGAVTKV 795
            LT  EF   +           + AR+  PV       N+   GA    VDWR +GAVT V
Sbjct: 89   LTREEFRSRYHNGAAHFAAAQERARV--PV-------NVEVVGAP-AAVDWRARGAVTAV 138

Query: 794  KDQAQCGSCWAFSATEQIESAWFLAKGTLPVLSPQQILSCDKTDAGCDGGDTPTAYAYV- 618
            KDQ QCGSCWAFSA   +E  WFLA   L  LS Q ++SCDKTD+GC GG    A+ ++ 
Sbjct: 139  KDQGQCGSCWAFSAIGNVECQWFLAGHPLTNLSEQMLVSCDKTDSGCGGGLMNNAFGWIV 198

Query: 617  -KSAGGLESESAYPYSSGAGNTGTCKFKAASIVAKISGFSYATPPCSGACKTQDEATFAN 441
             ++ G + +E++YPY+SG G +  C     ++ A I+G  +   P       QDEA  A 
Sbjct: 199  QENNGAVYTENSYPYASGEGISPPCTTSGHTVGATITG--HVELP-------QDEAQIAA 249

Query: 440  NVAAKGPASICVNAEAWQLYSSGVLTAKACGGSAYTDLDHCVQLVGYNKPASGKDYWIVR 261
             +A  GP ++ V+A +W  Y+ GV+T+          LDH V LVGYN  A+   YWI++
Sbjct: 250  WLAVNGPVAVAVDASSWMTYTGGVMTS-----CVSEQLDHGVLLVGYNDSAA-VPYWIIK 303

Query: 260  NSWGASWGYSGYLYVEYGTNACGVADEA 177
            NSW A WG  GY+ +  G+N C V +EA
Sbjct: 304  NSWTAQWGEDGYIRIAKGSNQCLVKEEA 331


>gb|AFA34858.1| cathepsin L-like protein [Trypanosoma dionisii]
          Length = 467

 Score =  223 bits (569), Expect = 1e-64
 Identities = 134/328 (40%), Positives = 174/328 (53%), Gaps = 2/328 (0%)
 Frame = -2

Query: 1154 ADEFLAKSLFDKFKLDHSKSYHSASEETHRYSVFRSNLARIADLNSKNGSPSFGITPFAD 975
            A+E LA S F  FK  + + Y SA+EE  R SVFR NL       + N   +FG+TPF+D
Sbjct: 30   AEETLA-SQFADFKQRYGRVYKSAAEEAFRLSVFRKNLLDAKLHAAANPHATFGVTPFSD 88

Query: 974  LTAHEFAKTHLGFKPSLDEESQAARLNTPVFELDEDDNMMAWGANNTLVDWRQKGAVTKV 795
            LT  EF   H           + AR+   V            G     VDWR +GAVT V
Sbjct: 89   LTREEFRSRHHSGAAHFAAGRKRARVPVDV----------GVGDAPAAVDWRDRGAVTPV 138

Query: 794  KDQAQCGSCWAFSATEQIESAWFLAKGTLPVLSPQQILSCDKTDAGCDGGDTPTAYAYV- 618
            KDQ QCGSCWAFSA   +E  WFLA   L  LS Q ++SCD  D+GCDGG   +A+ ++ 
Sbjct: 139  KDQGQCGSCWAFSAIGNVEGQWFLAGNALTSLSEQMLVSCDTMDSGCDGGLMNSAFEWIV 198

Query: 617  -KSAGGLESESAYPYSSGAGNTGTCKFKAASIVAKISGFSYATPPCSGACKTQDEATFAN 441
                G + +E +Y Y+SG G    C+    ++ A I+G     P         DEA  A 
Sbjct: 199  EHHNGTVYTEESYRYASGDGIAQPCRTSGRTVGAVITGHVKLPP---------DEAKMAT 249

Query: 440  NVAAKGPASICVNAEAWQLYSSGVLTAKACGGSAYTDLDHCVQLVGYNKPASGKDYWIVR 261
             +AA GP ++ V+A +W  Y+ GVLT+         +LDH V LVGYN  A+   YWIV+
Sbjct: 250  WLAANGPLAVAVDASSWMFYTGGVLTS-----CVSNELDHGVLLVGYNDSAA-PPYWIVK 303

Query: 260  NSWGASWGYSGYLYVEYGTNACGVADEA 177
            NSWG  WG  GY+ +  GTN C V +EA
Sbjct: 304  NSWGTLWGEDGYVRIAKGTNQCLVKEEA 331


Top