BLASTX nr result

ID: Rehmannia28_contig00051975 seq

BLASTX 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Rehmannia28_contig00051975
         (476 letters)

Database: ./nr 
           84,704,028 sequences; 31,038,470,784 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_012653036.1| papain family cysteine protease [Tetrahymena...   183   9e-55
ref|XP_001026313.1| papain family cysteine protease [Tetrahymena...   182   5e-54
ref|XP_001460368.1| hypothetical protein [Paramecium tetraurelia...   169   2e-49
ref|XP_001440132.1| hypothetical protein [Paramecium tetraurelia...   169   2e-49
sp|A0E358.2|CATL2_PARTE RecName: Full=Cathepsin L 2; Flags: Prec...   165   1e-47
ref|XP_001457122.1| hypothetical protein [Paramecium tetraurelia...   165   2e-47
ref|XP_001013459.1| papain family cysteine protease [Tetrahymena...   164   3e-47
ref|XP_001457867.1| hypothetical protein [Paramecium tetraurelia...   164   3e-47
sp|Q94714.1|CATL1_PARTE RecName: Full=Cathepsin L 1; Flags: Prec...   162   1e-46
ref|XP_001438701.1| hypothetical protein [Paramecium tetraurelia...   162   1e-46
ref|XP_001454745.1| hypothetical protein [Paramecium tetraurelia...   162   1e-46
ref|XP_004039041.1| papain family cysteine protease, putative, p...   161   5e-46
ref|XP_001020107.1| papain family cysteine protease [Tetrahymena...   161   7e-46
ref|XP_001013456.3| papain family cysteine protease [Tetrahymena...   160   1e-45
ref|XP_001446315.1| hypothetical protein [Paramecium tetraurelia...   158   7e-45
ref|XP_001031724.1| papain family cysteine protease [Tetrahymena...   158   9e-45
ref|XP_001462117.1| hypothetical protein [Paramecium tetraurelia...   157   1e-44
ref|XP_001427790.1| hypothetical protein [Paramecium tetraurelia...   157   1e-44
ref|XP_001008299.1| papain family cysteine protease [Tetrahymena...   157   1e-44
ref|XP_001020099.1| papain family cysteine protease [Tetrahymena...   157   2e-44

>ref|XP_012653036.1| papain family cysteine protease [Tetrahymena thermophila SB210]
           gi|586738495|gb|EWS74459.1| papain family cysteine
           protease [Tetrahymena thermophila SB210]
          Length = 319

 Score =  183 bits (465), Expect = 9e-55
 Identities = 86/165 (52%), Positives = 110/165 (66%), Gaps = 7/165 (4%)
 Frame = -2

Query: 475 CGSCWAFSATGVLESWALQRGQ---NVDLSEQQLVDCSRPQGNQGCNGGWPSSALNYVKA 305
           CGSCWAFS TG LES  +  GQ    ++LSEQQLVDCS   GNQGCNGG   +A  Y+KA
Sbjct: 135 CGSCWAFSTTGALESALIVAGQATNTINLSEQQLVDCSTSYGNQGCNGGLMDNAFKYIKA 194

Query: 304 NGITTESAYPYAAKDQSCKT---QGGSFRINGYSSHSGC-NGLSSQINNSPVSVTVDATN 137
           N +TTES YPY  KD  C +   +   + + G++  +   + L + I   PV++ VDA+ 
Sbjct: 195 NQLTTESNYPYTGKDGKCNSAAIKAPLYSLKGFTDVAKTTSALQAAIQKQPVAIAVDASK 254

Query: 136 WSPYRSGVFNNCASSINHAVLLVGVVGGNWKIKNSWGTGWGENGF 2
           WS Y  GVF+NCA+ +NH VLLVG+V GNW +KNSWG  WGENG+
Sbjct: 255 WSYYTGGVFSNCATQLNHGVLLVGIVNGNWLVKNSWGASWGENGY 299


>ref|XP_001026313.1| papain family cysteine protease [Tetrahymena thermophila SB210]
           gi|89308080|gb|EAS06068.1| papain family cysteine
           protease [Tetrahymena thermophila SB210]
          Length = 328

 Score =  182 bits (461), Expect = 5e-54
 Identities = 88/161 (54%), Positives = 106/161 (65%), Gaps = 3/161 (1%)
 Frame = -2

Query: 475 CGSCWAFSATGVLE-SWALQRGQNVDLSEQQLVDCSRPQGNQGCNGGWPSSALNYVKANG 299
           CGSCWAFS TG LE S+ L+  Q +  SEQQLVDCSR   N GCNGG    A  YVKA+G
Sbjct: 148 CGSCWAFSTTGALEGSYFLKNNQLISFSEQQLVDCSRLYLNMGCNGGLMPRAFRYVKAHG 207

Query: 298 ITTESAYPYAAKDQSCKTQGGSFRINGYSS--HSGCNGLSSQINNSPVSVTVDATNWSPY 125
           ITTE  YPY AKD  C+T+ G ++I  +S+     C+ L++ I   PVSV VDATN+  Y
Sbjct: 208 ITTEEEYPYTAKDGKCQTKQGQYKIKSFSTVPRGNCDKLAAAIAQQPVSVGVDATNFKFY 267

Query: 124 RSGVFNNCASSINHAVLLVGVVGGNWKIKNSWGTGWGENGF 2
            SGVF+NC   +NH VL  G     W IKNSWGT WG+NG+
Sbjct: 268 TSGVFDNCKKKLNHGVLATGYTADYWIIKNSWGTAWGQNGY 308


>ref|XP_001460368.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
           gi|124428198|emb|CAK92971.1| unnamed protein product
           [Paramecium tetraurelia]
          Length = 309

 Score =  169 bits (429), Expect = 2e-49
 Identities = 81/160 (50%), Positives = 112/160 (70%), Gaps = 2/160 (1%)
 Frame = -2

Query: 475 CGSCWAFSATGVLESW-ALQRGQNVDLSEQQLVDCSRPQGNQGCNGGWPSSALNYVKANG 299
           CGSCWAFSA G +E++  +++G + +LSEQQLVDC +   + GC+GG+P  A+ Y+ ANG
Sbjct: 134 CGSCWAFSAVGAVEAFFKIKKGADHNLSEQQLVDCDK--ASNGCDGGYPDKAIKYIAANG 191

Query: 298 ITTESAYPYAAKDQSCKTQGGSFRINGYSSHSGCNGLSSQINNSPVSVTVDATNWSPYRS 119
             T++AY Y     +CK+  GS + +G S+ +  +GL + I + P+SV VDA+NWS Y+S
Sbjct: 192 SQTQAAYQYTGVKGTCKSATGSVKNSGVSTIAK-SGLQAAIKDYPISVCVDASNWSNYKS 250

Query: 118 GVFNNCASSINHAVLLVGV-VGGNWKIKNSWGTGWGENGF 2
           GVFNNC  ++NHAV+ VG    GNW IKNSW T WGE GF
Sbjct: 251 GVFNNCNKNLNHAVMAVGYDASGNWIIKNSWATSWGEKGF 290


>ref|XP_001440132.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
           gi|124407338|emb|CAK72735.1| unnamed protein product
           [Paramecium tetraurelia]
          Length = 321

 Score =  169 bits (429), Expect = 2e-49
 Identities = 82/161 (50%), Positives = 105/161 (65%), Gaps = 3/161 (1%)
 Frame = -2

Query: 475 CGSCWAFSATGVLE-SWALQRGQNVDLSEQQLVDCSRPQGNQGCNGGWPSSALNYVKANG 299
           CGSCWAFSA G LE +  +Q  + VDLSEQ LVDC+ P GN GC+GGW  SAL+Y+  +G
Sbjct: 140 CGSCWAFSAVGALEINTKIQFNEIVDLSEQDLVDCAGPYGNAGCDGGWMESALDYIIDSG 199

Query: 298 ITTESAYPYAAKDQSCKTQGGSF-RINGYSSHSGCNGLSSQINNSPVSVTVDATNWSPYR 122
           I     YPY  +D  CK+   +F R+ GY    GC  +S+ +    VSV VDATNW  Y 
Sbjct: 200 IAETKVYPYKGEDGICKSVERNFRRVIGYVDLDGCQDISNALIQQSVSVGVDATNWRFYS 259

Query: 121 SGVFNNCASSINHAVLLVGV-VGGNWKIKNSWGTGWGENGF 2
           SGVF++C   +NH V+LVG+   G WK++NSWG  WGE G+
Sbjct: 260 SGVFSDCKKYLNHGVVLVGINKNGVWKVRNSWGQDWGEQGY 300


>sp|A0E358.2|CATL2_PARTE RecName: Full=Cathepsin L 2; Flags: Precursor
          Length = 314

 Score =  165 bits (417), Expect = 1e-47
 Identities = 80/161 (49%), Positives = 100/161 (62%), Gaps = 4/161 (2%)
 Frame = -2

Query: 475 CGSCWAFSATGVLE-SWALQRGQNVDLSEQQLVDCSRPQGNQGCNGGWPSSALNYVKANG 299
           CGSCWAFSA G LE +  ++  +  +LSEQ LVDCS P  N+GCNGGW  SA  YV  NG
Sbjct: 132 CGSCWAFSAVGALEINTDIELNKKYELSEQDLVDCSGPYDNEGCNGGWMDSAFEYVADNG 191

Query: 298 ITTESAYPYAAKDQSCKT--QGGSFRINGYSSHSGCNGLSSQINNSPVSVTVDATNWSPY 125
           +     YPY AKD +CKT  +     + G++    C+ L+  I    VSV VDA  W  Y
Sbjct: 192 LAEAKDYPYTAKDGTCKTSVKRPYTHVQGFTDIDSCDELAQAIQERTVSVAVDANPWQFY 251

Query: 124 RSGVFNNCASSINHAVLLVGV-VGGNWKIKNSWGTGWGENG 5
           RSGV + C  ++NH V+LVGV   G WKI+NSWG+ WGE G
Sbjct: 252 RSGVLSKCTKNLNHGVVLVGVQADGAWKIRNSWGSSWGEAG 292


>ref|XP_001457122.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
           gi|124424937|emb|CAK89725.1| unnamed protein product
           [Paramecium tetraurelia]
          Length = 324

 Score =  165 bits (417), Expect = 2e-47
 Identities = 80/161 (49%), Positives = 100/161 (62%), Gaps = 4/161 (2%)
 Frame = -2

Query: 475 CGSCWAFSATGVLE-SWALQRGQNVDLSEQQLVDCSRPQGNQGCNGGWPSSALNYVKANG 299
           CGSCWAFSA G LE +  ++  +  +LSEQ LVDCS P  N+GCNGGW  SA  YV  NG
Sbjct: 142 CGSCWAFSAVGALEINTDIELNKKYELSEQDLVDCSGPYDNEGCNGGWMDSAFEYVADNG 201

Query: 298 ITTESAYPYAAKDQSCKT--QGGSFRINGYSSHSGCNGLSSQINNSPVSVTVDATNWSPY 125
           +     YPY AKD +CKT  +     + G++    C+ L+  I    VSV VDA  W  Y
Sbjct: 202 LAEAKDYPYTAKDGTCKTSVKRPYTHVQGFTDIDSCDELAQAIQERTVSVAVDANPWQFY 261

Query: 124 RSGVFNNCASSINHAVLLVGV-VGGNWKIKNSWGTGWGENG 5
           RSGV + C  ++NH V+LVGV   G WKI+NSWG+ WGE G
Sbjct: 262 RSGVLSKCTKNLNHGVVLVGVQADGAWKIRNSWGSSWGEAG 302


>ref|XP_001013459.1| papain family cysteine protease [Tetrahymena thermophila SB210]
           gi|89295226|gb|EAR93214.1| papain family cysteine
           protease [Tetrahymena thermophila SB210]
          Length = 320

 Score =  164 bits (415), Expect = 3e-47
 Identities = 74/161 (45%), Positives = 107/161 (66%), Gaps = 3/161 (1%)
 Frame = -2

Query: 475 CGSCWAFSATGVLE-SWALQRGQNVDLSEQQLVDCSRPQGNQGCNGGWPSSALNYVKANG 299
           CGSCW+FS TG +E +  L   + V LSEQ L+DCS+  GN+GCNGG   +A +++  NG
Sbjct: 141 CGSCWSFSTTGAVEGAHFLSSNELVSLSEQYLIDCSK-NGNEGCNGGLMDTAFDFIAQNG 199

Query: 298 ITTESAYPYAAKDQSCKTQGGSFRINGYSSHSGCNGLSSQINNSPVSVTVDATNWSPYRS 119
           I TE+AYPY A D +CK   G ++I+ Y +   CN L S++   P+++ VDA N+  Y  
Sbjct: 200 IPTENAYPYKALDGTCKMTTGPYKISSYQNIISCNDLLSKLQKQPIAIAVDANNFQFYTK 259

Query: 118 GVFNNCASSINHAVLLVGVVGGN--WKIKNSWGTGWGENGF 2
           G+F+ C  +++H VLLVG    +  WK+KNSWG+ WGE+G+
Sbjct: 260 GIFSKCGKNLDHGVLLVGYSSKDKFWKVKNSWGSSWGEDGY 300


>ref|XP_001457867.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
           gi|124425685|emb|CAK90470.1| unnamed protein product
           [Paramecium tetraurelia]
          Length = 324

 Score =  164 bits (415), Expect = 3e-47
 Identities = 76/162 (46%), Positives = 105/162 (64%), Gaps = 4/162 (2%)
 Frame = -2

Query: 475 CGSCWAFSATGVLE-SWALQRGQNVDLSEQQLVDCSRPQGNQGCNGGWPSSALNYVKANG 299
           CGS WAFSA GVLE +  ++ G    LSEQ ++DCS P GNQGC+GGW  S   YV+ +G
Sbjct: 140 CGSSWAFSAVGVLEINSNIEFGLETTLSEQDMLDCSGPYGNQGCSGGWMDSGFEYVRDHG 199

Query: 298 ITTESAYPYAAKDQSCKTQ-GGSFR-INGYSSHSGCNGLSSQINNSPVSVTVDATNWSPY 125
           I   S YPY   DQ+C+T     F+ + G+    GCNGL + I +  +S+ VDA+NW+ Y
Sbjct: 200 IANGSVYPYVGSDQTCRTSVKRDFKYVTGFVDVDGCNGLQTAIQDQALSIGVDASNWAYY 259

Query: 124 RSGVFNNCASSINHAVLLVGV-VGGNWKIKNSWGTGWGENGF 2
           + G+FNNC  ++    +LVGV   G WK+++ WG+ WGENG+
Sbjct: 260 KGGIFNNCKQNLTSGSILVGVDQNGVWKVRHQWGSKWGENGY 301


>sp|Q94714.1|CATL1_PARTE RecName: Full=Cathepsin L 1; Flags: Precursor
           gi|1403087|emb|CAA62869.1| cathepsin L [Paramecium
           tetraurelia]
          Length = 314

 Score =  162 bits (411), Expect = 1e-46
 Identities = 79/161 (49%), Positives = 98/161 (60%), Gaps = 4/161 (2%)
 Frame = -2

Query: 475 CGSCWAFSATGVLE-SWALQRGQNVDLSEQQLVDCSRPQGNQGCNGGWPSSALNYVKANG 299
           CGSCWAFSA G LE +  ++  +  +LSEQ LVDCS P  N GCNGGW  SA  YV  NG
Sbjct: 132 CGSCWAFSAVGALEINTDIELNRKYELSEQDLVDCSGPYDNDGCNGGWMDSAFEYVADNG 191

Query: 298 ITTESAYPYAAKDQSCKT--QGGSFRINGYSSHSGCNGLSSQINNSPVSVTVDATNWSPY 125
           +     YPY AKD +CKT  +     + G+     C+ L+  I    V+V VDA  W  Y
Sbjct: 192 LAEAKDYPYTAKDGTCKTSVKRPYTHVQGFKDIDSCDELAQTIQERTVAVAVDANPWQFY 251

Query: 124 RSGVFNNCASSINHAVLLVGV-VGGNWKIKNSWGTGWGENG 5
           RSGV + C  ++NH V+LVGV   G WKI+NSWG+ WGE G
Sbjct: 252 RSGVLSKCTKNLNHGVVLVGVQADGAWKIRNSWGSSWGEAG 292


>ref|XP_001438701.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
           gi|124405873|emb|CAK71304.1| unnamed protein product
           [Paramecium tetraurelia]
          Length = 320

 Score =  162 bits (411), Expect = 1e-46
 Identities = 85/169 (50%), Positives = 110/169 (65%), Gaps = 11/169 (6%)
 Frame = -2

Query: 475 CGSCWAFSATGVLESW-ALQRGQNVDLSEQQLVDCSR--PQGNQGCNGGWPSSALNYVKA 305
           CGSCWAFS TGVLE W  +  G+  +LSEQQLVDCS   P  NQGCNGG PS ALNYVK 
Sbjct: 133 CGSCWAFSTTGVLEGWFQINTGKLPNLSEQQLVDCSTFIPDLNQGCNGGMPSRALNYVKR 192

Query: 304 NGITTESAYPYAAKDQSCKTQGGSFRINGYSSHSGCNGLSSQ--INNSPVSVTVDATNWS 131
           NG+TT+ AYPY A DQ+CK +GG ++++G S+    N  + Q  + + PVSV V A++W 
Sbjct: 193 NGLTTQDAYPYQAVDQACKIKGGEYKVSG-STAIAANEAAHQAALQSGPVSVAVKASDWK 251

Query: 130 PYR----SGVF--NNCASSINHAVLLVGVVGGNWKIKNSWGTGWGENGF 2
            Y+    + +F  + C   +NHAVL VG       +KNSW T WG +G+
Sbjct: 252 NYKPKGDNYIFPDSECTGDVNHAVLAVGFTSEALIVKNSWNTVWGVDGY 300


>ref|XP_001454745.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
           gi|124422522|emb|CAK87348.1| unnamed protein product
           [Paramecium tetraurelia]
          Length = 324

 Score =  162 bits (411), Expect = 1e-46
 Identities = 79/161 (49%), Positives = 98/161 (60%), Gaps = 4/161 (2%)
 Frame = -2

Query: 475 CGSCWAFSATGVLE-SWALQRGQNVDLSEQQLVDCSRPQGNQGCNGGWPSSALNYVKANG 299
           CGSCWAFSA G LE +  ++  +  +LSEQ LVDCS P  N GCNGGW  SA  YV  NG
Sbjct: 142 CGSCWAFSAVGALEINTDIELNRKYELSEQDLVDCSGPYDNDGCNGGWMDSAFEYVADNG 201

Query: 298 ITTESAYPYAAKDQSCKT--QGGSFRINGYSSHSGCNGLSSQINNSPVSVTVDATNWSPY 125
           +     YPY AKD +CKT  +     + G+     C+ L+  I    V+V VDA  W  Y
Sbjct: 202 LAEAKDYPYTAKDGTCKTSVKRPYTHVQGFKDIDSCDELAQTIQERTVAVAVDANPWQFY 261

Query: 124 RSGVFNNCASSINHAVLLVGV-VGGNWKIKNSWGTGWGENG 5
           RSGV + C  ++NH V+LVGV   G WKI+NSWG+ WGE G
Sbjct: 262 RSGVLSKCTKNLNHGVVLVGVQADGAWKIRNSWGSSWGEAG 302


>ref|XP_004039041.1| papain family cysteine protease, putative, partial
           [Ichthyophthirius multifiliis]
           gi|340508003|gb|EGR33817.1| papain family cysteine
           protease, putative, partial [Ichthyophthirius
           multifiliis]
          Length = 334

 Score =  161 bits (408), Expect = 5e-46
 Identities = 76/162 (46%), Positives = 100/162 (61%), Gaps = 4/162 (2%)
 Frame = -2

Query: 475 CGSCWAFSATGVLESWALQRGQNVD-LSEQQLVDCSRPQGNQGCNGGWPSSALNYVKANG 299
           CGSCWAFS TG LE     + + +   SEQQL+DCS   GN GCNGG    A  +V ++G
Sbjct: 153 CGSCWAFSTTGSLEGANYLQNKTLSAFSEQQLMDCSWLYGNLGCNGGLMPRAFKWVASHG 212

Query: 298 ITTESAYPYAAKDQ-SCKTQGGSFRINGYSSH--SGCNGLSSQINNSPVSVTVDATNWSP 128
           +TTE  YPY AK   SCK + G F+I+ Y       C+ L+  ++  P S+ VDA+NW  
Sbjct: 213 VTTEDKYPYEAKSHFSCKNKNGEFKISSYQEIPVGDCDALAQSVSQRPTSIAVDASNWQS 272

Query: 127 YRSGVFNNCASSINHAVLLVGVVGGNWKIKNSWGTGWGENGF 2
           Y SGVF++CA+ +NH VL VG     W +KNSW T WG+ G+
Sbjct: 273 YSSGVFDDCATRLNHGVLAVGYTSEYWIVKNSWNTSWGQQGY 314


>ref|XP_001020107.1| papain family cysteine protease [Tetrahymena thermophila SB210]
           gi|89301874|gb|EAR99862.1| papain family cysteine
           protease [Tetrahymena thermophila SB210]
          Length = 337

 Score =  161 bits (407), Expect = 7e-46
 Identities = 85/172 (49%), Positives = 105/172 (61%), Gaps = 14/172 (8%)
 Frame = -2

Query: 475 CGSCWAFSATGVLESWALQRGQNV-DLSEQQLVDCSRPQG---NQGCNGGWPSSALNYVK 308
           CGSCWAFSA G++ES+   + +N+ D SEQQLVDC        + GCNGGWP S L+Y  
Sbjct: 149 CGSCWAFSAAGLMESFNFIKHKNLTDFSEQQLVDCVNSANGYYSNGCNGGWPESCLDYSS 208

Query: 307 ANGITTESAYPYAAKDQSCKTQGGSFRINGYSSHS------GCNGLSSQINNSPVSVTVD 146
             GITT  +YPY    + C   G +   NG+   S          L + +NNSPVSV VD
Sbjct: 209 KFGITTLQSYPYVGVQKKCNITGAN---NGFKPKSWKQIPNTSKDLQNALNNSPVSVVVD 265

Query: 145 ATNWSPYRSGVFNNCASS---INHAVLLVGV-VGGNWKIKNSWGTGWGENGF 2
           A+ WS YRSGV+N C  +   +NHAVL VG    GNW +KNSWGTGWGE G+
Sbjct: 266 ASTWSHYRSGVYNGCDQTKIRLNHAVLAVGYDQFGNWIVKNSWGTGWGEQGY 317


>ref|XP_001013456.3| papain family cysteine protease [Tetrahymena thermophila SB210]
           gi|225565626|gb|EAR93211.3| papain family cysteine
           protease [Tetrahymena thermophila SB210]
          Length = 315

 Score =  160 bits (404), Expect = 1e-45
 Identities = 73/161 (45%), Positives = 106/161 (65%), Gaps = 3/161 (1%)
 Frame = -2

Query: 475 CGSCWAFSATGVLE-SWALQRGQNVDLSEQQLVDCSRPQGNQGCNGGWPSSALNYVKANG 299
           CGSCW+FS TG +E +  L   +   LSEQ LVDCS+  GN+GCNGG   +A +++  +G
Sbjct: 136 CGSCWSFSTTGAVEGALFLSTKKLTSLSEQYLVDCSK-DGNEGCNGGLMDTAFDFISQHG 194

Query: 298 ITTESAYPYAAKDQSCKTQGGSFRINGYSSHSGCNGLSSQINNSPVSVTVDATNWSPYRS 119
           I TE+AYPY A D +CK   G ++I+ ++    CN L ++I   P+++ VDA N+  Y+ 
Sbjct: 195 IPTEAAYPYKAVDGTCKMTSGPYKISSHTDIQDCNDLLNKIQKQPIAIAVDANNFQYYQK 254

Query: 118 GVFNNCASSINHAVLLVG--VVGGNWKIKNSWGTGWGENGF 2
            +F++C + ++H VLLVG    G  WK+KNSWG  WGE+GF
Sbjct: 255 DIFSDCGTELDHGVLLVGYSASGKYWKVKNSWGPNWGESGF 295


>ref|XP_001446315.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
           gi|124413792|emb|CAK78918.1| unnamed protein product
           [Paramecium tetraurelia]
          Length = 317

 Score =  158 bits (399), Expect = 7e-45
 Identities = 85/164 (51%), Positives = 105/164 (64%), Gaps = 6/164 (3%)
 Frame = -2

Query: 475 CGSCWAFSATGVLESWA-LQRGQNVDLSEQQLVDCSRP-QGNQGCNGGWPSSALNYVKAN 302
           CGSCW FS TGVLES+  L  G+  DLSEQQL+DCS     N+GC+GG P+ ALNYVK N
Sbjct: 134 CGSCWTFSTTGVLESFFYLTTGELPDLSEQQLLDCSTVIDFNKGCDGGLPARALNYVKRN 193

Query: 301 GITTESAYPYAAKDQSCKTQGGSFRINGYS-SHSGCNGLSSQINNSPVSVTVDATNW--- 134
           GITT +AYPY A   +CK +GG++ I G          L + +N  PVSV VDATNW   
Sbjct: 194 GITTGAAYPYTAVQGTCKIKGGAYHIKGSQVLAKDEETLVAYLNKGPVSVGVDATNWQYY 253

Query: 133 SPYRSGVFNNCASSINHAVLLVGVVGGNWKIKNSWGTGWGENGF 2
           SP    VF++C + +NH VL VG     +K+KNSW T WG  G+
Sbjct: 254 SPKDEKVFSDCDTKMNHVVLAVGYDDKAFKLKNSWSTSWGVKGY 297


>ref|XP_001031724.1| papain family cysteine protease [Tetrahymena thermophila SB210]
           gi|89286057|gb|EAR84061.1| papain family cysteine
           protease [Tetrahymena thermophila SB210]
          Length = 325

 Score =  158 bits (399), Expect = 9e-45
 Identities = 79/168 (47%), Positives = 109/168 (64%), Gaps = 10/168 (5%)
 Frame = -2

Query: 475 CGSCWAFSATGVLESWALQRGQ---NVDLSEQQLVDCSRPQ-GNQGCNGGWPSSALNYVK 308
           CGSCW FSATG +ES  +  G+   +++LSEQQLVDC   +  N GCNGG    A  Y++
Sbjct: 138 CGSCWTFSATGAVESALIIAGKAERSINLSEQQLVDCCTAEYDNAGCNGGNKDQAFRYIE 197

Query: 307 ANGITTESAYPYAAKDQSCKTQGG----SFRINGYSS-HSGCNGLSSQINNSPVSVTVDA 143
           +N ITTE+ YPY A +Q C TQ      ++ I+ Y   ++  N L+  +   P++++VDA
Sbjct: 198 SNPITTEANYPYKAVNQKCNTQKAALTPNYTISNYKQVNASTNDLAEALKIQPIAISVDA 257

Query: 142 TNWSPYRSGVFNNCASSI-NHAVLLVGVVGGNWKIKNSWGTGWGENGF 2
           +NWS Y  G+F+NC ++  NHAVLLVG     W +KNSWGT WGENG+
Sbjct: 258 SNWSFYTGGIFSNCNNTTHNHAVLLVGFQNDAWIVKNSWGTTWGENGY 305


>ref|XP_001462117.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
           gi|124429955|emb|CAK94744.1| unnamed protein product
           [Paramecium tetraurelia]
          Length = 317

 Score =  157 bits (397), Expect = 1e-44
 Identities = 83/164 (50%), Positives = 102/164 (62%), Gaps = 6/164 (3%)
 Frame = -2

Query: 475 CGSCWAFSATGVLESWALQR-GQNVDLSEQQLVDCSRPQG-NQGCNGGWPSSALNYVKAN 302
           CGSCW F  TGVLE +     G+  +LSEQQL+DCS  Q  N GCNGG P+ AL YVK +
Sbjct: 134 CGSCWTFGTTGVLEGFFFTTTGELPNLSEQQLLDCSTFQDFNLGCNGGLPARALQYVKRS 193

Query: 301 GITTESAYPYAAKDQSCKTQGGSFRING-YSSHSGCNGLSSQINNSPVSVTVDATNWSPY 125
           GITT+ AY Y     SCK +GG++ I G  +       L S +N  PVSV VDA+NW  Y
Sbjct: 194 GITTQDAYEYKGVQGSCKIKGGAYHIKGSVALEPTEEALISYLNEGPVSVGVDASNWQYY 253

Query: 124 RSG---VFNNCASSINHAVLLVGVVGGNWKIKNSWGTGWGENGF 2
                 VF+ C  S+NHAVL VG    ++K+KNSWGT WG+ GF
Sbjct: 254 NPSDEKVFSTCEKSLNHAVLAVGYDKDSFKVKNSWGTAWGDKGF 297


>ref|XP_001427790.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
           gi|124394873|emb|CAK60392.1| unnamed protein product
           [Paramecium tetraurelia]
          Length = 317

 Score =  157 bits (397), Expect = 1e-44
 Identities = 84/164 (51%), Positives = 103/164 (62%), Gaps = 6/164 (3%)
 Frame = -2

Query: 475 CGSCWAFSATGVLESWALQR-GQNVDLSEQQLVDCSRPQG-NQGCNGGWPSSALNYVKAN 302
           CGSCW F  TGVLE +  +  G+  +LSEQQL+DCS  Q  N GCNGG P  AL YVK +
Sbjct: 134 CGSCWTFGTTGVLEGFFFKTTGELPNLSEQQLLDCSTFQDFNLGCNGGLPYRALQYVKRS 193

Query: 301 GITTESAYPYAAKDQSCKTQGGSFRING-YSSHSGCNGLSSQINNSPVSVTVDATNWSPY 125
           GITT++AYPY     SC+ +GG++RI G     +    L S +N  PVSV VDATNW  Y
Sbjct: 194 GITTQAAYPYKGVQGSCQIKGGAYRIKGAVQLEATEEALISYLNEGPVSVGVDATNWQYY 253

Query: 124 RSG---VFNNCASSINHAVLLVGVVGGNWKIKNSWGTGWGENGF 2
                 VF+ C SS+NHAVL VG    + KIKNSW   WG+ G+
Sbjct: 254 NPSDEKVFSTCDSSLNHAVLAVGYDKNSLKIKNSWSAQWGDRGY 297


>ref|XP_001008299.1| papain family cysteine protease [Tetrahymena thermophila SB210]
           gi|89290066|gb|EAR88054.1| papain family cysteine
           protease [Tetrahymena thermophila SB210]
          Length = 332

 Score =  157 bits (398), Expect = 1e-44
 Identities = 80/163 (49%), Positives = 100/163 (61%), Gaps = 5/163 (3%)
 Frame = -2

Query: 475 CGSCWAFSATGVLESWA-LQRGQNVDLSEQQLVDCSRPQGNQGCNGGWPSSALNYVKANG 299
           CGSCWAFSATG LES   +  G    LSEQ+LVDCS   GN+GC+GG   +A  ++  N 
Sbjct: 146 CGSCWAFSATGALESATFISTGTLPSLSEQELVDCSTSYGNEGCDGGDMDAAFKFIHDNN 205

Query: 298 ITTESAYPYAAKDQSCK-TQ-GGSFRINGYSSHSGCNGLSSQINNSPVSVTVDATNWSPY 125
           I TE  Y Y   DQ CK TQ   ++ ++ +     C+ L + I   PVSV VDATNW  Y
Sbjct: 206 IATEKEYTYRGFDQKCKGTQYPTTYGLSSFVDVQSCDELVAAIQQQPVSVAVDATNWQYY 265

Query: 124 RSGVFNNCASSINHAVLLVGVVG--GNWKIKNSWGTGWGENGF 2
             G FN+C  ++NH VLLVG       WK+KNSWGT WGE+G+
Sbjct: 266 EFGTFNDCFDNLNHGVLLVGYNSKTHQWKVKNSWGTSWGEDGY 308


>ref|XP_001020099.1| papain family cysteine protease [Tetrahymena thermophila SB210]
           gi|89301866|gb|EAR99854.1| papain family cysteine
           protease [Tetrahymena thermophila SB210]
          Length = 332

 Score =  157 bits (397), Expect = 2e-44
 Identities = 83/172 (48%), Positives = 105/172 (61%), Gaps = 14/172 (8%)
 Frame = -2

Query: 475 CGSCWAFSATGVLESWALQRGQNV-DLSEQQLVDC---SRPQGNQGCNGGWPSSALNYVK 308
           CGSCW FSA G++ES+   + +N+ + SEQQLVDC   +   G+ GCNGGWP+S L+Y  
Sbjct: 144 CGSCWTFSAAGLMESFNFIKNKNLTNFSEQQLVDCVNSANGYGSNGCNGGWPASCLDYSS 203

Query: 307 ANGITTESAYPYAAKDQSCKTQGGSFRINGYSSHS------GCNGLSSQINNSPVSVTVD 146
             GITT   YPY    + C   G +   NG+   S          L + +N SPVSV VD
Sbjct: 204 KFGITTLQNYPYVGVQKKCNITGTN---NGFKPKSWKQIPNTSKDLQNALNFSPVSVVVD 260

Query: 145 ATNWSPYRSGVFNNCASS---INHAVLLVGVVG-GNWKIKNSWGTGWGENGF 2
           A+ WS YRSGV+N C  +   +NHAVL VG    GNW +KNSWGTGWGE G+
Sbjct: 261 ASTWSHYRSGVYNGCNQTKIQLNHAVLAVGYDSVGNWIVKNSWGTGWGEQGY 312


Top