BLASTX nr result

ID: Ophiopogon21_contig00046332 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Ophiopogon21_contig00046332
         (716 letters)

Database: ./nr 
           77,306,371 sequences; 28,104,191,420 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_004993594.1| hypothetical protein PTSG_05727 [Salpingoeca...   220   8e-55
ref|XP_004997373.1| hypothetical protein PTSG_11722 [Salpingoeca...   215   2e-53
ref|XP_004358308.1| papain family cysteine protease subfamily pr...   212   2e-52
ref|XP_001748864.1| hypothetical protein [Monosiga brevicollis M...   209   2e-51
ref|XP_004341646.1| papain family cysteine protease containing p...   191   4e-46
ref|XP_001022365.1| papain family cysteine protease [Tetrahymena...   169   2e-39
ref|XP_001012928.1| papain family cysteine protease [Tetrahymena...   157   8e-36
ref|XP_009309215.1| cysteine proteinase [Trypanosoma grayi] gi|6...   155   3e-35
ref|XP_005770544.1| hypothetical protein EMIHUDRAFT_461333 [Emil...   155   3e-35
ref|XP_001019547.3| papain family cysteine protease [Tetrahymena...   154   6e-35
ref|XP_001015624.1| papain family cysteine protease [Tetrahymena...   150   8e-34
ref|XP_009033689.1| hypothetical protein AURANDRAFT_11974, parti...   147   7e-33
gb|ABH06549.2| cathepsin L cysteine protease ICP1 [Ichthyophthir...   145   3e-32
gb|EKF26898.1| cysteine proteinase, putative [Trypanosoma cruzi ...   143   1e-31
gb|KPA83885.1| putative cysteine proteinase [Leptomonas pyrrhoco...   143   1e-31
ref|XP_809860.1| cysteine proteinase [Trypanosoma cruzi strain C...   140   1e-30
gb|KPI82686.1| putative cysteine proteinase [Leptomonas seymouri]     139   1e-30
ref|XP_001015629.1| papain family cysteine protease [Tetrahymena...   139   1e-30
ref|XP_811957.1| cysteine proteinase [Trypanosoma cruzi strain C...   138   3e-30
gb|ESS61639.1| cysteine proteinase [Trypanosoma cruzi Dm28c]          138   3e-30

>ref|XP_004993594.1| hypothetical protein PTSG_05727 [Salpingoeca rosetta]
           gi|326428462|gb|EGD74032.1| hypothetical protein
           PTSG_05727 [Salpingoeca rosetta]
          Length = 398

 Score =  220 bits (560), Expect = 8e-55
 Identities = 111/233 (47%), Positives = 145/233 (62%), Gaps = 1/233 (0%)
 Frame = -2

Query: 700 MRLFAALLIGLAACVNSASAKTLWHQLEDYTFEDYLAEYGKNYSGSEYHQ-RKAIFEDRL 524
           MR    +   +A    + SAKTLWHQL+ Y+FE + AEYGK Y  SE H  R+ +FE  L
Sbjct: 26  MRAVVVVAALVALVATTVSAKTLWHQLDGYSFEHFKAEYGKRYLSSEEHDFRRQVFERTL 85

Query: 523 AEIQKHNTDPSQSYKKGVNHLSDLTDDEFTRMLGYKKHVTFSSTTRVXXXXXXXXXXXXX 344
           A ++ HN+DP++++K+G+NH+SD TD EF R+LGY K + +S                  
Sbjct: 86  ASVKAHNSDPTKTWKQGINHMSDWTDGEFKRLLGYDKGIGYSLHRPTPPGFKANVDVNGL 145

Query: 343 XXXXXXXXSRDWRKSGVVTAVKDQGQCGSCWSFGTTETIESHYAIKTGHLAVLSEQQILD 164
                     DWR   VVTAVKDQGQCGSCWSFG+ ET+ESH A++TG L VLSEQ ILD
Sbjct: 146 PDSV------DWRTKHVVTAVKDQGQCGSCWSFGSAETLESHVAVQTGTLEVLSEQNILD 199

Query: 163 CTPNPQQXXXXXXXXXXTPELAYNRLIKLGGHASEWTYPYRSYFGNNFKSCSF 5
           CTPNP++          T E+AY  + K GG  +EWTYPY S++G+N+K C F
Sbjct: 200 CTPNPEECGGTGGCQGGTAEIAYEHMAKHGGLQTEWTYPYLSWYGDNYK-CHF 251


>ref|XP_004997373.1| hypothetical protein PTSG_11722 [Salpingoeca rosetta]
           gi|326435242|gb|EGD80812.1| hypothetical protein
           PTSG_11722 [Salpingoeca rosetta]
          Length = 372

 Score =  215 bits (548), Expect = 2e-53
 Identities = 113/235 (48%), Positives = 148/235 (62%), Gaps = 2/235 (0%)
 Frame = -2

Query: 700 MRLFAALLIGLAACVNSASAK-TLWHQLEDYTFEDYLAEYGKNYSGSEYHQ-RKAIFEDR 527
           M+L   L + + A   +A+AK TLWHQL+ Y+FED+  E+GK Y+  E H+ R++IFE  
Sbjct: 2   MKLCGVLALVMLASAGAATAKKTLWHQLDAYSFEDFKLEFGKTYASHEEHEYRRSIFEQT 61

Query: 526 LAEIQKHNTDPSQSYKKGVNHLSDLTDDEFTRMLGYKKHVTFSSTTRVXXXXXXXXXXXX 347
           LA ++ HN D S+++K+GVNH+SD TD+EF R+LGY K V FS                 
Sbjct: 62  LATVKAHNRDESKTWKQGVNHMSDWTDEEFKRLLGYSKDVGFSLHRPTPPDFKSNVDLES 121

Query: 346 XXXXXXXXXSRDWRKSGVVTAVKDQGQCGSCWSFGTTETIESHYAIKTGHLAVLSEQQIL 167
                      DWR   VVT VKDQGQCGSCWSF + ET+ES  A+  G+L VLSEQ IL
Sbjct: 122 LPDSV------DWRTKHVVTTVKDQGQCGSCWSFASAETLESQVAVNKGYLEVLSEQNIL 175

Query: 166 DCTPNPQQXXXXXXXXXXTPELAYNRLIKLGGHASEWTYPYRSYFGNNFKSCSFN 2
           DCTPNPQ+          T ELAY +++K GG  +EWTYPY S+ G+N+K CSF+
Sbjct: 176 DCTPNPQECGGFGGCQGGTAELAYAQMVKNGGLQTEWTYPYISWKGDNYK-CSFD 229


>ref|XP_004358308.1| papain family cysteine protease subfamily protein [Acanthamoeba
           castellanii str. Neff] gi|440804881|gb|ELR25744.1|
           papain family cysteine protease subfamily protein
           [Acanthamoeba castellanii str. Neff]
          Length = 383

 Score =  212 bits (539), Expect = 2e-52
 Identities = 111/238 (46%), Positives = 146/238 (61%), Gaps = 7/238 (2%)
 Frame = -2

Query: 694 LFAALLIGLAACVNSASAK------TLWHQLEDYTFEDYLAEYGKNYSGSEYHQ-RKAIF 536
           L  AL +   A + SASA       T WH LE Y+FE Y+ E+ K Y+  E  + R+A+F
Sbjct: 6   LLLALCLASVALLASASATSADRRTTPWHALEGYSFEAYVKEFNKVYASLEEREARRAVF 65

Query: 535 EDRLAEIQKHNTDPSQSYKKGVNHLSDLTDDEFTRMLGYKKHVTFSSTTRVXXXXXXXXX 356
           E RLA+I+ HN DP++++K+GVNHL+D  + EF R+LGYK     ++             
Sbjct: 66  EARLAKIRAHNADPTKTWKEGVNHLTDRHEHEFRRLLGYKP--ALAAKPAYLAATAPLPA 123

Query: 355 XXXXXXXXXXXXSRDWRKSGVVTAVKDQGQCGSCWSFGTTETIESHYAIKTGHLAVLSEQ 176
                         DWR+  VV+ VKDQG CGSCW+F T ET+ESH+A+KTG LAVLSEQ
Sbjct: 124 AIRDFDFTQMPEKVDWREKSVVSPVKDQGHCGSCWTFATAETVESHWALKTGQLAVLSEQ 183

Query: 175 QILDCTPNPQQXXXXXXXXXXTPELAYNRLIKLGGHASEWTYPYRSYFGNNFKSCSFN 2
           QILDCT NP Q          T ELA  R+I++GG ++EWTYPYRSY+G NF  C+F+
Sbjct: 184 QILDCTSNPNQCGGTGGCAGGTAELAMARIIEMGGLSAEWTYPYRSYWGENFSQCNFS 241


>ref|XP_001748864.1| hypothetical protein [Monosiga brevicollis MX1]
           gi|163772544|gb|EDQ86194.1| predicted protein [Monosiga
           brevicollis MX1]
          Length = 340

 Score =  209 bits (531), Expect = 2e-51
 Identities = 110/240 (45%), Positives = 142/240 (59%), Gaps = 7/240 (2%)
 Frame = -2

Query: 700 MRLFAALLIGLAACVNSASAKTLWHQLEDYTFEDYLAEYGKNYSGSEYHQ-RKAIFEDRL 524
           M+     ++ +      A+AKT WHQLE YTFE Y AEY K Y+ +  H+ R+ +FE  L
Sbjct: 1   MKATTVAVLAVVLLAGMAAAKTRWHQLEGYTFEHYKAEYKKAYATTTEHEYRRQVFEQNL 60

Query: 523 AEIQKHNTDPSQSYKKGVNHLSDLTDDEFTRMLGY------KKHVTFSSTTRVXXXXXXX 362
           A+I+ HN D ++++K+GVNH+SD T +EF R+LGY       KH    + TR+       
Sbjct: 61  AKIRAHNADTTKTWKEGVNHMSDWTSEEFRRLLGYDQSYGYSKHQPAPTQTRLGMVQLPE 120

Query: 361 XXXXXXXXXXXXXXSRDWRKSGVVTAVKDQGQCGSCWSFGTTETIESHYAIKTGHLAVLS 182
                           DWR  GVVT VKDQG CGSCWSFG+ +T+ESH AIKTG+L  LS
Sbjct: 121 TV--------------DWRTQGVVTPVKDQGNCGSCWSFGSAQTLESHVAIKTGYLETLS 166

Query: 181 EQQILDCTPNPQQXXXXXXXXXXTPELAYNRLIKLGGHASEWTYPYRSYFGNNFKSCSFN 2
           EQ ILDCTPNP +          T ELAY+   K GG  +EWTYPY S+ G NF+ C F+
Sbjct: 167 EQNILDCTPNPNECGGTGGCEGGTAELAYDAFAKNGGVQTEWTYPYISWAGKNFE-CQFD 225


>ref|XP_004341646.1| papain family cysteine protease containing protein [Acanthamoeba
           castellanii str. Neff] gi|440798492|gb|ELR19560.1|
           papain family cysteine protease containing protein
           [Acanthamoeba castellanii str. Neff]
          Length = 385

 Score =  191 bits (485), Expect = 4e-46
 Identities = 104/228 (45%), Positives = 135/228 (59%), Gaps = 5/228 (2%)
 Frame = -2

Query: 670 LAACVNSASAKTLWHQLEDYTFEDYLAEYGKNY-SGSEYHQRKAIFEDRLAEIQKHNTDP 494
           LA   + A AKT W  L+ Y+F+ Y+ E+ K Y S  E   R+AIFE RLA I+ HN D 
Sbjct: 14  LALLASGAGAKTTWDALDHYSFDRYVVEFNKAYASDDEVVSRRAIFESRLAAIKAHNRDA 73

Query: 493 SQSYKKGVNHLSDLTDDEFTRMLGYKKHVTFSSTTR---VXXXXXXXXXXXXXXXXXXXX 323
           S+S+K+GVN L+D ++ E  ++LGY K V      R                        
Sbjct: 74  SKSWKQGVNQLTDRSEAEIRQLLGYNKGVAAGLAPRGGLQWESAWTGLNEIAQKMRAAAI 133

Query: 322 XSRDWRKSGVVTAVKDQGQCGSCWSFGTTETIESHYAIKTGHLAVLSEQQILDCTPNPQQ 143
              DWR+ GV++ VKDQGQCGSCW+FG++ET+ES++A+ TG L +LSEQ ILDC PNP Q
Sbjct: 134 NHVDWREKGVISPVKDQGQCGSCWTFGSSETLESYWALATGQLPILSEQHILDCIPNPDQ 193

Query: 142 XXXXXXXXXXTPELAYNRLI-KLGGHASEWTYPYRSYFGNNFKSCSFN 2
                     T EL Y  L+ +  G ASEWTYPYRSY+G  F+ CSFN
Sbjct: 194 CGGTGGCAGGTAELVYRALMTQSSGLASEWTYPYRSYWGEAFQ-CSFN 240


>ref|XP_001022365.1| papain family cysteine protease [Tetrahymena thermophila SB210]
           gi|89304132|gb|EAS02120.1| papain family cysteine
           protease [Tetrahymena thermophila SB210]
          Length = 376

 Score =  169 bits (428), Expect = 2e-39
 Identities = 97/229 (42%), Positives = 127/229 (55%), Gaps = 3/229 (1%)
 Frame = -2

Query: 679 LIGLAACVNSASAKTLWHQLEDYTFEDYLAEYGKNYS--GSEYHQRKAIFEDRLAEIQKH 506
           LI LA  +    ++T    L+ YTF+ Y+ ++ K Y    SEY  RKAIFE +LAEI   
Sbjct: 4   LIILAIVLGLTVSQTYNSNLKQYTFDQYIQDFNKGYQYGSSEYFMRKAIFEQKLAEIIAF 63

Query: 505 NTDPSQSYKKGVNHLSDLTDDEFTR-MLGYKKHVTFSSTTRVXXXXXXXXXXXXXXXXXX 329
           N   +QSYKKGVN  +DLTD EF +  LGY K++   S  R                   
Sbjct: 64  NEQTNQSYKKGVNRFTDLTDSEFKQNSLGYSKNM---SNVRAFRNLSVKNLEVTEQQLKE 120

Query: 328 XXXSRDWRKSGVVTAVKDQGQCGSCWSFGTTETIESHYAIKTGHLAVLSEQQILDCTPNP 149
              + DWR+ GVVT VKDQG CGSCW+F +T TIES+ AI +G L  LS QQ++ C PNP
Sbjct: 121 LPVNVDWRQKGVVTPVKDQGHCGSCWAFASTATIESYAAINSGQLKTLSTQQLVSCVPNP 180

Query: 148 QQXXXXXXXXXXTPELAYNRLIKLGGHASEWTYPYRSYFGNNFKSCSFN 2
            Q            ELA+N  ++L G  SE+ YPY+SY      +C+F+
Sbjct: 181 YQCGGTGGCNGAISELAFN-YVQLYGLTSEFKYPYQSYVSGVTGNCTFD 228


>ref|XP_001012928.1| papain family cysteine protease [Tetrahymena thermophila SB210]
           gi|89294695|gb|EAR92683.1| papain family cysteine
           protease [Tetrahymena thermophila SB210]
          Length = 377

 Score =  157 bits (396), Expect = 8e-36
 Identities = 86/210 (40%), Positives = 116/210 (55%), Gaps = 3/210 (1%)
 Frame = -2

Query: 622 LEDYTFEDYLAEYGKNY--SGSEYHQRKAIFEDRLAEIQKHNTDPSQSYKKGVNHLSDLT 449
           L+ YTFE Y+ E+ KNY  +  +Y  RK+IFE  LAEI   N DP+ SYKKGVN  +D T
Sbjct: 24  LKVYTFEQYVKEFNKNYGFNSEDYQLRKSIFERNLAEIIDFNNDPNHSYKKGVNQFTDQT 83

Query: 448 DDEFT-RMLGYKKHVTFSSTTRVXXXXXXXXXXXXXXXXXXXXXSRDWRKSGVVTAVKDQ 272
            +E   + LGY K++   +  R                        DWR+ GV T VKDQ
Sbjct: 84  QNELKEKTLGYSKNM---NKIRPFRNLSVKPIEITEQQLKDLPTHVDWREKGVTTPVKDQ 140

Query: 271 GQCGSCWSFGTTETIESHYAIKTGHLAVLSEQQILDCTPNPQQXXXXXXXXXXTPELAYN 92
           G CGSCW+  +  TIE+H AI +G L +LS QQ++ C PNP Q            ELA+ 
Sbjct: 141 GHCGSCWAQASAATIEAHAAINSGQLKILSTQQLVSCVPNPYQCGGTGGCQGAISELAFT 200

Query: 91  RLIKLGGHASEWTYPYRSYFGNNFKSCSFN 2
             ++L G  SE+ YPY+SYF  N  +C+++
Sbjct: 201 -YVQLYGLTSEFKYPYQSYFSGNTGNCTYD 229


>ref|XP_009309215.1| cysteine proteinase [Trypanosoma grayi] gi|657142683|gb|KEG12550.1|
           cysteine proteinase [Trypanosoma grayi]
          Length = 378

 Score =  155 bits (391), Expect = 3e-35
 Identities = 93/235 (39%), Positives = 124/235 (52%), Gaps = 12/235 (5%)
 Frame = -2

Query: 697 RLFA-ALLIGLAACVNSASAKTLW-----HQLEDYTFEDYLAEYGKNYSGSEYHQRKAIF 536
           RLF+ A+++ LAA   SA+          H+L  YTFE YL ++GK+Y G EY  RKA F
Sbjct: 3   RLFSLAVIVALAALTASAALTATMKGPRRHRLNGYTFEQYLRDFGKSYEGQEYVHRKAFF 62

Query: 535 EDRLAEIQKHNTDPSQSYKKGVNHLSDLTDDEFTRMLGYK----KHVTFSSTTRVXXXXX 368
              LA ++ HN   ++ Y  G+NH+SD   +EF R+ G K    +H+      R      
Sbjct: 63  GQTLASVRAHNAAGNKLYVMGINHMSDWAPEEFARLNGAKPRMMRHLARPELRRTYRKSG 122

Query: 367 XXXXXXXXXXXXXXXXSRDWRKS--GVVTAVKDQGQCGSCWSFGTTETIESHYAIKTGHL 194
                             D+RKS   ++TAVKDQG CGSCW+ G  E +ESHYAI +G L
Sbjct: 123 QVLPNAV-----------DYRKSVPPILTAVKDQGGCGSCWAHGAVENMESHYAIVSGKL 171

Query: 193 AVLSEQQILDCTPNPQQXXXXXXXXXXTPELAYNRLIKLGGHASEWTYPYRSYFG 29
            VLS+QQ+  CTPN Q+          T +LAY     + G  +EW YPY SY G
Sbjct: 172 HVLSQQQLTSCTPNHQKCGGTGGCKGSTADLAYE--YAMDGLTTEWVYPYTSYGG 224


>ref|XP_005770544.1| hypothetical protein EMIHUDRAFT_461333 [Emiliania huxleyi CCMP1516]
           gi|485621672|gb|EOD18115.1| hypothetical protein
           EMIHUDRAFT_461333 [Emiliania huxleyi CCMP1516]
          Length = 537

 Score =  155 bits (391), Expect = 3e-35
 Identities = 85/201 (42%), Positives = 109/201 (54%), Gaps = 1/201 (0%)
 Frame = -2

Query: 643 AKTLWHQLEDYTFEDYLAEYGKNYS-GSEYHQRKAIFEDRLAEIQKHNTDPSQSYKKGVN 467
           A T WH+L+ Y FE Y  E+GK Y+   E   R+A F  +L E++ HN     S+K+GVN
Sbjct: 177 AATRWHELDGYHFERYRQEFGKMYATAEEAAAREAAFNKKLKEVRSHNA-AGHSWKRGVN 235

Query: 466 HLSDLTDDEFTRMLGYKKHVTFSSTTRVXXXXXXXXXXXXXXXXXXXXXSRDWRKSGVVT 287
           HL+D T +E + + G  + + FS    +                       DWR+ G VT
Sbjct: 236 HLTDRTPEELSVLHGLDRALLFSQKHALSTAKPARPFTKADEVPSGV----DWRRQGAVT 291

Query: 286 AVKDQGQCGSCWSFGTTETIESHYAIKTGHLAVLSEQQILDCTPNPQQXXXXXXXXXXTP 107
            VK+QG CGSCW+F + ET+ES + +KTG L  LSEQ ILDCTPNP            T 
Sbjct: 292 PVKNQGHCGSCWTFASAETVESRWFLKTGELQDLSEQFILDCTPNPHACGGTGGCGGGTA 351

Query: 106 ELAYNRLIKLGGHASEWTYPY 44
            LAY RL  LGG  SEW YPY
Sbjct: 352 ALAYERLKALGGLPSEWVYPY 372


>ref|XP_001019547.3| papain family cysteine protease [Tetrahymena thermophila SB210]
           gi|225566367|gb|EAR99302.3| papain family cysteine
           protease [Tetrahymena thermophila SB210]
          Length = 375

 Score =  154 bits (389), Expect = 6e-35
 Identities = 92/229 (40%), Positives = 125/229 (54%), Gaps = 3/229 (1%)
 Frame = -2

Query: 679 LIGLAACVNSASAKTLWHQLEDYTFEDYLAEYGKNYS--GSEYHQRKAIFEDRLAEIQKH 506
           L+ L A +  A+A   W+QL +YTF+DY+ ++ K Y+   +EY+QRK IFE +L EI+  
Sbjct: 4   LVLLFALIVLATAAPKWNQLVNYTFDDYVKDFNKAYTKFSAEYNQRKRIFEQKLKEIKAF 63

Query: 505 NTDPSQSYKKGVNHLSDLTDDEFTR-MLGYKKHVTFSSTTRVXXXXXXXXXXXXXXXXXX 329
           N++    YKKG+N  +D T +E     LGY K V  ++  +                   
Sbjct: 64  NSNSENGYKKGINQFTDRTAEELRETTLGYSKTVKNAANKQ---NMFRNLKTSDKINVKD 120

Query: 328 XXXSRDWRKSGVVTAVKDQGQCGSCWSFGTTETIESHYAIKTGHLAVLSEQQILDCTPNP 149
              S DWR +GVVT VKDQG CGSCW+F TT  IES+ AI TG L  LS QQ++ C  N 
Sbjct: 121 LPKSVDWRDAGVVTPVKDQGHCGSCWAFATTAVIESYAAIATGQLKTLSTQQLVSCVQNS 180

Query: 148 QQXXXXXXXXXXTPELAYNRLIKLGGHASEWTYPYRSYFGNNFKSCSFN 2
            Q            ELAYN  ++L G  SE+ Y Y SY G    +C+F+
Sbjct: 181 YQCGGQGGCNGAVSELAYN-YVQLFGLTSEYKYSYSSYQGQT-GNCTFD 227


>ref|XP_001015624.1| papain family cysteine protease [Tetrahymena thermophila SB210]
           gi|89297391|gb|EAR95379.1| papain family cysteine
           protease [Tetrahymena thermophila SB210]
          Length = 375

 Score =  150 bits (379), Expect = 8e-34
 Identities = 95/238 (39%), Positives = 121/238 (50%), Gaps = 4/238 (1%)
 Frame = -2

Query: 703 KMRLFAALLIGLAACVNSASAKTLWHQLEDYTFEDYLAEYGKNYS--GSEYHQRKAIFED 530
           K  +FA LL+ + +C      K  WHQL++YTFE Y+ ++ K Y     EY+QRK  FE 
Sbjct: 3   KYIIFATLLLSVVSC------KPKWHQLQNYTFEQYIVDFEKEYEVDSVEYNQRKQTFEK 56

Query: 529 RLAEIQKHNTDPSQSYKKGVNHLSDLTDDEFTRMLGYKKHVTFSSTTRVXXXXXXXXXXX 350
            L EI   N +   SYKKGVN  +DLT  EF   LG KK    S   R            
Sbjct: 57  NLVEIIAFN-NKDHSYKKGVNRNTDLTTKEFQVQLGLKK----SMKNRKNPIQRLLAKNN 111

Query: 349 XXXXXXXXXXSRDWRKSGVVTAVKDQGQCGSCWSFGTTETIESHYAIKT--GHLAVLSEQ 176
                     S DWR+ GVV+ VKDQG CGSCW+F +   +ES  AI    G L  LS Q
Sbjct: 112 TAASLTDLPQSVDWRQKGVVSPVKDQGGCGSCWAFASAAVLESAAAIAAGPGQLKTLSTQ 171

Query: 175 QILDCTPNPQQXXXXXXXXXXTPELAYNRLIKLGGHASEWTYPYRSYFGNNFKSCSFN 2
           Q++ C PNP Q            ELA++    L G  SE+ Y Y+SYFG  + SC ++
Sbjct: 172 QLVSCVPNPNQCGGTGGCSGAVAELAFS-YTTLYGITSEYKYSYQSYFGTTY-SCKYD 227


>ref|XP_009033689.1| hypothetical protein AURANDRAFT_11974, partial [Aureococcus
           anophagefferens] gi|323455439|gb|EGB11307.1|
           hypothetical protein AURANDRAFT_11974, partial
           [Aureococcus anophagefferens]
          Length = 330

 Score =  147 bits (371), Expect = 7e-33
 Identities = 84/206 (40%), Positives = 110/206 (53%), Gaps = 2/206 (0%)
 Frame = -2

Query: 613 YTFEDYLAEYGKNYSGSEYHQRKAIFEDRLAEIQKHNTDPSQSYKKGVNHLSDLTDDEFT 434
           YTFE Y A++GK+Y+  E  +R+ IFE RL  I +HNT+PS +YKKGVN L+D TDDE  
Sbjct: 1   YTFEQYKADFGKSYAPQEDDERRVIFEARLKSILEHNTNPS-TYKKGVNALTDRTDDERR 59

Query: 433 RMLGYKKHVTFSSTTRVXXXXXXXXXXXXXXXXXXXXXSRDWR--KSGVVTAVKDQGQCG 260
            + G  K +  S                            DWR  +  V+TAVKDQGQCG
Sbjct: 60  ALNGRDKALAASRPRAALAAPAPLAALPTTF---------DWRDRRPSVLTAVKDQGQCG 110

Query: 259 SCWSFGTTETIESHYAIKTGHLAVLSEQQILDCTPNPQQXXXXXXXXXXTPELAYNRLIK 80
           SCW+F +TETIESH A+ TG L  L+ Q ++ C  N               ELA+   ++
Sbjct: 111 SCWAFASTETIESHVALNTGELVELAPQHLVSCAANVYDCGGTGGCGGSIAELAF-EYVQ 169

Query: 79  LGGHASEWTYPYRSYFGNNFKSCSFN 2
             G A+EWTYPY +       SC +N
Sbjct: 170 THGMATEWTYPYTAGLDGKSGSCRYN 195


>gb|ABH06549.2| cathepsin L cysteine protease ICP1 [Ichthyophthirius multifiliis]
          Length = 374

 Score =  145 bits (366), Expect = 3e-32
 Identities = 87/218 (39%), Positives = 118/218 (54%), Gaps = 5/218 (2%)
 Frame = -2

Query: 661 CVNSASAKTLWHQLE-DYTFEDYLAEYGKNYSGS--EYHQRKAIFEDRLAEIQKHNTDPS 491
           C+  A+ K  WH+L  +YTF+ Y+++Y KNY+    EY QRK IFE +L EI  HN + S
Sbjct: 7   CLLLAAPK--WHELSSEYTFDQYISDYSKNYAKGTREYDQRKIIFESKLQEILSHNQNTS 64

Query: 490 QSYKKGVNHLSDLTDDEFTRM-LGYKKHVTFSSTTRVXXXXXXXXXXXXXXXXXXXXXSR 314
            +YK+G+N  +D++  EF +  LGY K    SS  +                      S 
Sbjct: 65  HTYKRGINAFTDMSHQEFKQSKLGYSKGFR-SSRNQQFRQLLLNNKKITPEQIAELPKSV 123

Query: 313 DWRKSGVVTAVKDQGQCGSCWSFGTTETIESHYAIKTG-HLAVLSEQQILDCTPNPQQXX 137
           DWR   VV+ VKDQG CGSCW+F T   IESH AI    HL VLS +Q+++C  N  Q  
Sbjct: 124 DWRDHNVVSPVKDQGHCGSCWAFATVAVIESHAAISADKHLKVLSTEQLVNCMSNEMQCG 183

Query: 136 XXXXXXXXTPELAYNRLIKLGGHASEWTYPYRSYFGNN 23
                     EL +N  ++L G  SE+ YPY+SY G +
Sbjct: 184 GQGGCNGAVAELGFN-YVQLFGLTSEYKYPYQSYQGKS 220


>gb|EKF26898.1| cysteine proteinase, putative [Trypanosoma cruzi marinkellei]
          Length = 392

 Score =  143 bits (361), Expect = 1e-31
 Identities = 82/204 (40%), Positives = 108/204 (52%), Gaps = 6/204 (2%)
 Frame = -2

Query: 622 LEDYTFEDYLAEYGKNYSGSEYHQRKAIFEDRLAEIQKHNTDPSQSYKKGVNHLSDLTDD 443
           L+ YTF+ +L EYGK Y   EY +R+AIFE  LA ++ HN   +  Y  G+NH+SD T +
Sbjct: 50  LDGYTFDRFLQEYGKKYDAKEYVRRRAIFEQTLARVRTHNEAGNHLYVMGINHMSDWTPE 109

Query: 442 EFTRMLGYK----KHVTFSSTTRVXXXXXXXXXXXXXXXXXXXXXSRDWRKSG--VVTAV 281
           EFT + G +     H+   S  R                        D+R S   ++TAV
Sbjct: 110 EFTSLNGARPRMMSHLAQKSLRRRYQASGEKIPNEV-----------DYRNSSPAILTAV 158

Query: 280 KDQGQCGSCWSFGTTETIESHYAIKTGHLAVLSEQQILDCTPNPQQXXXXXXXXXXTPEL 101
           KDQG CGSCW+ G  E +E+HYAI TG L VLS+QQ+  C PNP++          T +L
Sbjct: 159 KDQGHCGSCWAHGAAEEMETHYAILTGRLHVLSQQQLTSCAPNPKKCGGTGGCYGSTADL 218

Query: 100 AYNRLIKLGGHASEWTYPYRSYFG 29
           AY    +  G  SEW Y Y SY G
Sbjct: 219 AYEYAKQ--GITSEWMYSYTSYRG 240


>gb|KPA83885.1| putative cysteine proteinase [Leptomonas pyrrhocoris]
          Length = 369

 Score =  143 bits (360), Expect = 1e-31
 Identities = 75/200 (37%), Positives = 109/200 (54%), Gaps = 2/200 (1%)
 Frame = -2

Query: 610 TFEDYLAEYGKNYSGSEYHQRKAIFEDRLAEIQKHNTDPSQSYKKGVNHLSDLTDDEFTR 431
           +F+ Y+ +YGK YS  EY +R  IF +RL EI++ N D   SY++G+N L+D T+DE   
Sbjct: 28  SFDAYIRQYGKRYSAVEYSKRLKIFTERLREIEEFNRDGKHSYRRGLNKLTDWTEDEIGA 87

Query: 430 MLGYKKHVTFSSTTRVXXXXXXXXXXXXXXXXXXXXXSRDWRKS--GVVTAVKDQGQCGS 257
           + G +  ++ +  + V                       D+R S   V+T++KDQG CGS
Sbjct: 88  LNGARPMMSRNLRSSVPKHIYNRSSHTLPRRV-------DYRTSVPPVLTSIKDQGSCGS 140

Query: 256 CWSFGTTETIESHYAIKTGHLAVLSEQQILDCTPNPQQXXXXXXXXXXTPELAYNRLIKL 77
           CW+    E +ESH+AI TGHL VLS+QQ+  CTPNP+              LAY+ +   
Sbjct: 141 CWAHSAVEAMESHWAIATGHLHVLSQQQVTACTPNPRHCGGTGGCDGSIEALAYDYVAGA 200

Query: 76  GGHASEWTYPYRSYFGNNFK 17
           GG   EW YPY +++G   K
Sbjct: 201 GGIQEEWGYPYTAFYGETGK 220


>ref|XP_809860.1| cysteine proteinase [Trypanosoma cruzi strain CL Brener]
           gi|70874305|gb|EAN88009.1| cysteine proteinase, putative
           [Trypanosoma cruzi]
          Length = 392

 Score =  140 bits (352), Expect = 1e-30
 Identities = 80/204 (39%), Positives = 107/204 (52%), Gaps = 6/204 (2%)
 Frame = -2

Query: 622 LEDYTFEDYLAEYGKNYSGSEYHQRKAIFEDRLAEIQKHNTDPSQSYKKGVNHLSDLTDD 443
           L+ YTF+ +L EYGK Y   EY +R+A+FE  LA ++ HN   +  Y  G+NH+SD T +
Sbjct: 50  LDGYTFDRFLQEYGKKYDAREYVRRRALFEQTLARVRTHNEAGNHLYVMGINHMSDWTPE 109

Query: 442 EFTRMLGYK----KHVTFSSTTRVXXXXXXXXXXXXXXXXXXXXXSRDWRKSG--VVTAV 281
           E   + G +     H+   S  R                        D+R S   ++TAV
Sbjct: 110 ELASLNGARPRMMSHLAQKSLQRRYQAIGGRIPDEV-----------DYRNSSPAILTAV 158

Query: 280 KDQGQCGSCWSFGTTETIESHYAIKTGHLAVLSEQQILDCTPNPQQXXXXXXXXXXTPEL 101
           KDQG+CGSCW+ G  E +ESH+AI TG L  LS+QQ+  C PNP++          T +L
Sbjct: 159 KDQGRCGSCWAHGAAEEMESHFAILTGRLHALSQQQLTSCAPNPKKCGGTGGCYGSTADL 218

Query: 100 AYNRLIKLGGHASEWTYPYRSYFG 29
           AY    K  G ASEW Y Y SY G
Sbjct: 219 AYEYAEK--GIASEWVYSYTSYRG 240


>gb|KPI82686.1| putative cysteine proteinase [Leptomonas seymouri]
          Length = 371

 Score =  139 bits (351), Expect = 1e-30
 Identities = 73/196 (37%), Positives = 107/196 (54%), Gaps = 2/196 (1%)
 Frame = -2

Query: 610 TFEDYLAEYGKNYSGSEYHQRKAIFEDRLAEIQKHNTDPSQSYKKGVNHLSDLTDDEFTR 431
           +F  Y+ +YGK+Y  +EY +R  IF  RL EI++ N +   SY+KG+NH++D TDDE + 
Sbjct: 28  SFVSYVRQYGKHYDAAEYSRRLKIFTKRLHEIEEFNRNGMHSYRKGLNHMTDWTDDELSA 87

Query: 430 MLGYKKHVTFSSTTRVXXXXXXXXXXXXXXXXXXXXXSRDWRKS--GVVTAVKDQGQCGS 257
           + G +  +  +  + +                       D+R S   V+TA+K+QG CGS
Sbjct: 88  LNGARPMMARNIQSSLPKHIHNRSSHTLPHHV-------DYRTSVPPVLTAIKNQGYCGS 140

Query: 256 CWSFGTTETIESHYAIKTGHLAVLSEQQILDCTPNPQQXXXXXXXXXXTPELAYNRLIKL 77
           CW+    E +ESH+AI TG L VLS QQ+  CTPNP+              LA++ + K 
Sbjct: 141 CWAHSAVEAMESHWAILTGRLHVLSLQQVTACTPNPRHCGGTGGCSGSIEPLAFDYIAKA 200

Query: 76  GGHASEWTYPYRSYFG 29
           GG   EW YPY +Y+G
Sbjct: 201 GGIQEEWAYPYTAYYG 216


>ref|XP_001015629.1| papain family cysteine protease [Tetrahymena thermophila SB210]
           gi|89297396|gb|EAR95384.1| papain family cysteine
           protease [Tetrahymena thermophila SB210]
          Length = 380

 Score =  139 bits (351), Expect = 1e-30
 Identities = 82/216 (37%), Positives = 113/216 (52%), Gaps = 6/216 (2%)
 Frame = -2

Query: 631 WHQLEDYTFEDYLAEYGKNYSGS--EYHQRKAIFEDRLAEIQKHNTDPSQSYKKGVNHLS 458
           W+QL +YTF+ Y+ ++ K Y+ +  EYH RK IF  +L  I   N+     YKKGVN  +
Sbjct: 24  WNQLNNYTFDQYITDFNKGYTPNSPEYHMRKTIFNKKLQAIISFNSLQGTYYKKGVNQFT 83

Query: 457 DLTDDEF-TRMLGYK---KHVTFSSTTRVXXXXXXXXXXXXXXXXXXXXXSRDWRKSGVV 290
           D ++ E   + LGY    +   FSS++R+                       DWR+  VV
Sbjct: 84  DQSEQELENQTLGYVSSGQKSPFSSSSRLLSSTLSNVTLQELPASV------DWREKNVV 137

Query: 289 TAVKDQGQCGSCWSFGTTETIESHYAIKTGHLAVLSEQQILDCTPNPQQXXXXXXXXXXT 110
           T VKDQG+CGSCW+F +  TIESH AI +G L  LS QQ++ C  N              
Sbjct: 138 TPVKDQGKCGSCWAFASAATIESHAAIASGKLKTLSTQQLVSCAQNSYNCGGVGGCHGSI 197

Query: 109 PELAYNRLIKLGGHASEWTYPYRSYFGNNFKSCSFN 2
            ELA++  ++L G  S++ Y Y SY G    SCSFN
Sbjct: 198 AELAFS-YVQLFGITSDYKYSYSSYQGVEEGSCSFN 232


>ref|XP_811957.1| cysteine proteinase [Trypanosoma cruzi strain CL Brener]
           gi|70876682|gb|EAN90106.1| cysteine proteinase, putative
           [Trypanosoma cruzi]
          Length = 392

 Score =  138 bits (348), Expect = 3e-30
 Identities = 79/204 (38%), Positives = 107/204 (52%), Gaps = 6/204 (2%)
 Frame = -2

Query: 622 LEDYTFEDYLAEYGKNYSGSEYHQRKAIFEDRLAEIQKHNTDPSQSYKKGVNHLSDLTDD 443
           L+ YTF+ +L EYGK Y   EY +R+A+FE  LA ++ HN   +  Y  G+NH+SD T +
Sbjct: 50  LDGYTFDRFLQEYGKKYDAREYVRRRALFEQTLARVRTHNEAGNHLYVMGINHMSDWTPE 109

Query: 442 EFTRMLGYK----KHVTFSSTTRVXXXXXXXXXXXXXXXXXXXXXSRDWRKSG--VVTAV 281
           E   + G +     H+   S  R                        D+R S   ++TAV
Sbjct: 110 ELASLNGARPRMMSHLAQKSLQRRYQSSGGRIPDEV-----------DYRNSSPAILTAV 158

Query: 280 KDQGQCGSCWSFGTTETIESHYAIKTGHLAVLSEQQILDCTPNPQQXXXXXXXXXXTPEL 101
           KDQG+CGSCW+ G  E +ESH+AI TG L VLS+QQ+  C PNP++          T +L
Sbjct: 159 KDQGRCGSCWAHGAAEEMESHFAILTGRLHVLSQQQLTSCAPNPKKCGGTGGCYGSTADL 218

Query: 100 AYNRLIKLGGHASEWTYPYRSYFG 29
           AY    +  G  SEW Y Y SY G
Sbjct: 219 AYEYAKQ--GITSEWVYSYTSYRG 240


>gb|ESS61639.1| cysteine proteinase [Trypanosoma cruzi Dm28c]
          Length = 392

 Score =  138 bits (348), Expect = 3e-30
 Identities = 80/204 (39%), Positives = 106/204 (51%), Gaps = 6/204 (2%)
 Frame = -2

Query: 622 LEDYTFEDYLAEYGKNYSGSEYHQRKAIFEDRLAEIQKHNTDPSQSYKKGVNHLSDLTDD 443
           L  YTF+ +L EYGK Y   EY +R+A+FE  LA ++ HN   +  Y  G+NH+SD T +
Sbjct: 50  LGGYTFDRFLQEYGKKYDAREYVRRRALFEQTLARVRTHNEAGNHLYVMGINHMSDWTPE 109

Query: 442 EFTRMLGYK----KHVTFSSTTRVXXXXXXXXXXXXXXXXXXXXXSRDWRKSG--VVTAV 281
           E   + G +     H+   S  R                        D+R S   ++TAV
Sbjct: 110 ELATLNGARPRMMSHLAQKSLQRRYQASGGRIPDEV-----------DYRNSSPAILTAV 158

Query: 280 KDQGQCGSCWSFGTTETIESHYAIKTGHLAVLSEQQILDCTPNPQQXXXXXXXXXXTPEL 101
           KDQG+CGSCW+ G  E +ESH+AI TG L VLS+QQ+  C PNP++          T +L
Sbjct: 159 KDQGRCGSCWAHGAAEEMESHFAILTGRLHVLSQQQLTSCAPNPKKCGGTGGCYGSTADL 218

Query: 100 AYNRLIKLGGHASEWTYPYRSYFG 29
           AY    K  G  SEW Y Y SY G
Sbjct: 219 AYEYAKK--GITSEWVYSYTSYRG 240


Top