BLASTX nr result
ID: Rehmannia28_contig00051975
seq
BLASTX 2.2.26 [Sep-21-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Rehmannia28_contig00051975 (476 letters) Database: ./nr 84,704,028 sequences; 31,038,470,784 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_012653036.1| papain family cysteine protease [Tetrahymena... 183 9e-55 ref|XP_001026313.1| papain family cysteine protease [Tetrahymena... 182 5e-54 ref|XP_001460368.1| hypothetical protein [Paramecium tetraurelia... 169 2e-49 ref|XP_001440132.1| hypothetical protein [Paramecium tetraurelia... 169 2e-49 sp|A0E358.2|CATL2_PARTE RecName: Full=Cathepsin L 2; Flags: Prec... 165 1e-47 ref|XP_001457122.1| hypothetical protein [Paramecium tetraurelia... 165 2e-47 ref|XP_001013459.1| papain family cysteine protease [Tetrahymena... 164 3e-47 ref|XP_001457867.1| hypothetical protein [Paramecium tetraurelia... 164 3e-47 sp|Q94714.1|CATL1_PARTE RecName: Full=Cathepsin L 1; Flags: Prec... 162 1e-46 ref|XP_001438701.1| hypothetical protein [Paramecium tetraurelia... 162 1e-46 ref|XP_001454745.1| hypothetical protein [Paramecium tetraurelia... 162 1e-46 ref|XP_004039041.1| papain family cysteine protease, putative, p... 161 5e-46 ref|XP_001020107.1| papain family cysteine protease [Tetrahymena... 161 7e-46 ref|XP_001013456.3| papain family cysteine protease [Tetrahymena... 160 1e-45 ref|XP_001446315.1| hypothetical protein [Paramecium tetraurelia... 158 7e-45 ref|XP_001031724.1| papain family cysteine protease [Tetrahymena... 158 9e-45 ref|XP_001462117.1| hypothetical protein [Paramecium tetraurelia... 157 1e-44 ref|XP_001427790.1| hypothetical protein [Paramecium tetraurelia... 157 1e-44 ref|XP_001008299.1| papain family cysteine protease [Tetrahymena... 157 1e-44 ref|XP_001020099.1| papain family cysteine protease [Tetrahymena... 157 2e-44 >ref|XP_012653036.1| papain family cysteine protease [Tetrahymena thermophila SB210] gi|586738495|gb|EWS74459.1| papain family cysteine protease [Tetrahymena thermophila SB210] Length = 319 Score = 183 bits (465), Expect = 9e-55 Identities = 86/165 (52%), Positives = 110/165 (66%), Gaps = 7/165 (4%) Frame = -2 Query: 475 CGSCWAFSATGVLESWALQRGQ---NVDLSEQQLVDCSRPQGNQGCNGGWPSSALNYVKA 305 CGSCWAFS TG LES + GQ ++LSEQQLVDCS GNQGCNGG +A Y+KA Sbjct: 135 CGSCWAFSTTGALESALIVAGQATNTINLSEQQLVDCSTSYGNQGCNGGLMDNAFKYIKA 194 Query: 304 NGITTESAYPYAAKDQSCKT---QGGSFRINGYSSHSGC-NGLSSQINNSPVSVTVDATN 137 N +TTES YPY KD C + + + + G++ + + L + I PV++ VDA+ Sbjct: 195 NQLTTESNYPYTGKDGKCNSAAIKAPLYSLKGFTDVAKTTSALQAAIQKQPVAIAVDASK 254 Query: 136 WSPYRSGVFNNCASSINHAVLLVGVVGGNWKIKNSWGTGWGENGF 2 WS Y GVF+NCA+ +NH VLLVG+V GNW +KNSWG WGENG+ Sbjct: 255 WSYYTGGVFSNCATQLNHGVLLVGIVNGNWLVKNSWGASWGENGY 299 >ref|XP_001026313.1| papain family cysteine protease [Tetrahymena thermophila SB210] gi|89308080|gb|EAS06068.1| papain family cysteine protease [Tetrahymena thermophila SB210] Length = 328 Score = 182 bits (461), Expect = 5e-54 Identities = 88/161 (54%), Positives = 106/161 (65%), Gaps = 3/161 (1%) Frame = -2 Query: 475 CGSCWAFSATGVLE-SWALQRGQNVDLSEQQLVDCSRPQGNQGCNGGWPSSALNYVKANG 299 CGSCWAFS TG LE S+ L+ Q + SEQQLVDCSR N GCNGG A YVKA+G Sbjct: 148 CGSCWAFSTTGALEGSYFLKNNQLISFSEQQLVDCSRLYLNMGCNGGLMPRAFRYVKAHG 207 Query: 298 ITTESAYPYAAKDQSCKTQGGSFRINGYSS--HSGCNGLSSQINNSPVSVTVDATNWSPY 125 ITTE YPY AKD C+T+ G ++I +S+ C+ L++ I PVSV VDATN+ Y Sbjct: 208 ITTEEEYPYTAKDGKCQTKQGQYKIKSFSTVPRGNCDKLAAAIAQQPVSVGVDATNFKFY 267 Query: 124 RSGVFNNCASSINHAVLLVGVVGGNWKIKNSWGTGWGENGF 2 SGVF+NC +NH VL G W IKNSWGT WG+NG+ Sbjct: 268 TSGVFDNCKKKLNHGVLATGYTADYWIIKNSWGTAWGQNGY 308 >ref|XP_001460368.1| hypothetical protein [Paramecium tetraurelia strain d4-2] gi|124428198|emb|CAK92971.1| unnamed protein product [Paramecium tetraurelia] Length = 309 Score = 169 bits (429), Expect = 2e-49 Identities = 81/160 (50%), Positives = 112/160 (70%), Gaps = 2/160 (1%) Frame = -2 Query: 475 CGSCWAFSATGVLESW-ALQRGQNVDLSEQQLVDCSRPQGNQGCNGGWPSSALNYVKANG 299 CGSCWAFSA G +E++ +++G + +LSEQQLVDC + + GC+GG+P A+ Y+ ANG Sbjct: 134 CGSCWAFSAVGAVEAFFKIKKGADHNLSEQQLVDCDK--ASNGCDGGYPDKAIKYIAANG 191 Query: 298 ITTESAYPYAAKDQSCKTQGGSFRINGYSSHSGCNGLSSQINNSPVSVTVDATNWSPYRS 119 T++AY Y +CK+ GS + +G S+ + +GL + I + P+SV VDA+NWS Y+S Sbjct: 192 SQTQAAYQYTGVKGTCKSATGSVKNSGVSTIAK-SGLQAAIKDYPISVCVDASNWSNYKS 250 Query: 118 GVFNNCASSINHAVLLVGV-VGGNWKIKNSWGTGWGENGF 2 GVFNNC ++NHAV+ VG GNW IKNSW T WGE GF Sbjct: 251 GVFNNCNKNLNHAVMAVGYDASGNWIIKNSWATSWGEKGF 290 >ref|XP_001440132.1| hypothetical protein [Paramecium tetraurelia strain d4-2] gi|124407338|emb|CAK72735.1| unnamed protein product [Paramecium tetraurelia] Length = 321 Score = 169 bits (429), Expect = 2e-49 Identities = 82/161 (50%), Positives = 105/161 (65%), Gaps = 3/161 (1%) Frame = -2 Query: 475 CGSCWAFSATGVLE-SWALQRGQNVDLSEQQLVDCSRPQGNQGCNGGWPSSALNYVKANG 299 CGSCWAFSA G LE + +Q + VDLSEQ LVDC+ P GN GC+GGW SAL+Y+ +G Sbjct: 140 CGSCWAFSAVGALEINTKIQFNEIVDLSEQDLVDCAGPYGNAGCDGGWMESALDYIIDSG 199 Query: 298 ITTESAYPYAAKDQSCKTQGGSF-RINGYSSHSGCNGLSSQINNSPVSVTVDATNWSPYR 122 I YPY +D CK+ +F R+ GY GC +S+ + VSV VDATNW Y Sbjct: 200 IAETKVYPYKGEDGICKSVERNFRRVIGYVDLDGCQDISNALIQQSVSVGVDATNWRFYS 259 Query: 121 SGVFNNCASSINHAVLLVGV-VGGNWKIKNSWGTGWGENGF 2 SGVF++C +NH V+LVG+ G WK++NSWG WGE G+ Sbjct: 260 SGVFSDCKKYLNHGVVLVGINKNGVWKVRNSWGQDWGEQGY 300 >sp|A0E358.2|CATL2_PARTE RecName: Full=Cathepsin L 2; Flags: Precursor Length = 314 Score = 165 bits (417), Expect = 1e-47 Identities = 80/161 (49%), Positives = 100/161 (62%), Gaps = 4/161 (2%) Frame = -2 Query: 475 CGSCWAFSATGVLE-SWALQRGQNVDLSEQQLVDCSRPQGNQGCNGGWPSSALNYVKANG 299 CGSCWAFSA G LE + ++ + +LSEQ LVDCS P N+GCNGGW SA YV NG Sbjct: 132 CGSCWAFSAVGALEINTDIELNKKYELSEQDLVDCSGPYDNEGCNGGWMDSAFEYVADNG 191 Query: 298 ITTESAYPYAAKDQSCKT--QGGSFRINGYSSHSGCNGLSSQINNSPVSVTVDATNWSPY 125 + YPY AKD +CKT + + G++ C+ L+ I VSV VDA W Y Sbjct: 192 LAEAKDYPYTAKDGTCKTSVKRPYTHVQGFTDIDSCDELAQAIQERTVSVAVDANPWQFY 251 Query: 124 RSGVFNNCASSINHAVLLVGV-VGGNWKIKNSWGTGWGENG 5 RSGV + C ++NH V+LVGV G WKI+NSWG+ WGE G Sbjct: 252 RSGVLSKCTKNLNHGVVLVGVQADGAWKIRNSWGSSWGEAG 292 >ref|XP_001457122.1| hypothetical protein [Paramecium tetraurelia strain d4-2] gi|124424937|emb|CAK89725.1| unnamed protein product [Paramecium tetraurelia] Length = 324 Score = 165 bits (417), Expect = 2e-47 Identities = 80/161 (49%), Positives = 100/161 (62%), Gaps = 4/161 (2%) Frame = -2 Query: 475 CGSCWAFSATGVLE-SWALQRGQNVDLSEQQLVDCSRPQGNQGCNGGWPSSALNYVKANG 299 CGSCWAFSA G LE + ++ + +LSEQ LVDCS P N+GCNGGW SA YV NG Sbjct: 142 CGSCWAFSAVGALEINTDIELNKKYELSEQDLVDCSGPYDNEGCNGGWMDSAFEYVADNG 201 Query: 298 ITTESAYPYAAKDQSCKT--QGGSFRINGYSSHSGCNGLSSQINNSPVSVTVDATNWSPY 125 + YPY AKD +CKT + + G++ C+ L+ I VSV VDA W Y Sbjct: 202 LAEAKDYPYTAKDGTCKTSVKRPYTHVQGFTDIDSCDELAQAIQERTVSVAVDANPWQFY 261 Query: 124 RSGVFNNCASSINHAVLLVGV-VGGNWKIKNSWGTGWGENG 5 RSGV + C ++NH V+LVGV G WKI+NSWG+ WGE G Sbjct: 262 RSGVLSKCTKNLNHGVVLVGVQADGAWKIRNSWGSSWGEAG 302 >ref|XP_001013459.1| papain family cysteine protease [Tetrahymena thermophila SB210] gi|89295226|gb|EAR93214.1| papain family cysteine protease [Tetrahymena thermophila SB210] Length = 320 Score = 164 bits (415), Expect = 3e-47 Identities = 74/161 (45%), Positives = 107/161 (66%), Gaps = 3/161 (1%) Frame = -2 Query: 475 CGSCWAFSATGVLE-SWALQRGQNVDLSEQQLVDCSRPQGNQGCNGGWPSSALNYVKANG 299 CGSCW+FS TG +E + L + V LSEQ L+DCS+ GN+GCNGG +A +++ NG Sbjct: 141 CGSCWSFSTTGAVEGAHFLSSNELVSLSEQYLIDCSK-NGNEGCNGGLMDTAFDFIAQNG 199 Query: 298 ITTESAYPYAAKDQSCKTQGGSFRINGYSSHSGCNGLSSQINNSPVSVTVDATNWSPYRS 119 I TE+AYPY A D +CK G ++I+ Y + CN L S++ P+++ VDA N+ Y Sbjct: 200 IPTENAYPYKALDGTCKMTTGPYKISSYQNIISCNDLLSKLQKQPIAIAVDANNFQFYTK 259 Query: 118 GVFNNCASSINHAVLLVGVVGGN--WKIKNSWGTGWGENGF 2 G+F+ C +++H VLLVG + WK+KNSWG+ WGE+G+ Sbjct: 260 GIFSKCGKNLDHGVLLVGYSSKDKFWKVKNSWGSSWGEDGY 300 >ref|XP_001457867.1| hypothetical protein [Paramecium tetraurelia strain d4-2] gi|124425685|emb|CAK90470.1| unnamed protein product [Paramecium tetraurelia] Length = 324 Score = 164 bits (415), Expect = 3e-47 Identities = 76/162 (46%), Positives = 105/162 (64%), Gaps = 4/162 (2%) Frame = -2 Query: 475 CGSCWAFSATGVLE-SWALQRGQNVDLSEQQLVDCSRPQGNQGCNGGWPSSALNYVKANG 299 CGS WAFSA GVLE + ++ G LSEQ ++DCS P GNQGC+GGW S YV+ +G Sbjct: 140 CGSSWAFSAVGVLEINSNIEFGLETTLSEQDMLDCSGPYGNQGCSGGWMDSGFEYVRDHG 199 Query: 298 ITTESAYPYAAKDQSCKTQ-GGSFR-INGYSSHSGCNGLSSQINNSPVSVTVDATNWSPY 125 I S YPY DQ+C+T F+ + G+ GCNGL + I + +S+ VDA+NW+ Y Sbjct: 200 IANGSVYPYVGSDQTCRTSVKRDFKYVTGFVDVDGCNGLQTAIQDQALSIGVDASNWAYY 259 Query: 124 RSGVFNNCASSINHAVLLVGV-VGGNWKIKNSWGTGWGENGF 2 + G+FNNC ++ +LVGV G WK+++ WG+ WGENG+ Sbjct: 260 KGGIFNNCKQNLTSGSILVGVDQNGVWKVRHQWGSKWGENGY 301 >sp|Q94714.1|CATL1_PARTE RecName: Full=Cathepsin L 1; Flags: Precursor gi|1403087|emb|CAA62869.1| cathepsin L [Paramecium tetraurelia] Length = 314 Score = 162 bits (411), Expect = 1e-46 Identities = 79/161 (49%), Positives = 98/161 (60%), Gaps = 4/161 (2%) Frame = -2 Query: 475 CGSCWAFSATGVLE-SWALQRGQNVDLSEQQLVDCSRPQGNQGCNGGWPSSALNYVKANG 299 CGSCWAFSA G LE + ++ + +LSEQ LVDCS P N GCNGGW SA YV NG Sbjct: 132 CGSCWAFSAVGALEINTDIELNRKYELSEQDLVDCSGPYDNDGCNGGWMDSAFEYVADNG 191 Query: 298 ITTESAYPYAAKDQSCKT--QGGSFRINGYSSHSGCNGLSSQINNSPVSVTVDATNWSPY 125 + YPY AKD +CKT + + G+ C+ L+ I V+V VDA W Y Sbjct: 192 LAEAKDYPYTAKDGTCKTSVKRPYTHVQGFKDIDSCDELAQTIQERTVAVAVDANPWQFY 251 Query: 124 RSGVFNNCASSINHAVLLVGV-VGGNWKIKNSWGTGWGENG 5 RSGV + C ++NH V+LVGV G WKI+NSWG+ WGE G Sbjct: 252 RSGVLSKCTKNLNHGVVLVGVQADGAWKIRNSWGSSWGEAG 292 >ref|XP_001438701.1| hypothetical protein [Paramecium tetraurelia strain d4-2] gi|124405873|emb|CAK71304.1| unnamed protein product [Paramecium tetraurelia] Length = 320 Score = 162 bits (411), Expect = 1e-46 Identities = 85/169 (50%), Positives = 110/169 (65%), Gaps = 11/169 (6%) Frame = -2 Query: 475 CGSCWAFSATGVLESW-ALQRGQNVDLSEQQLVDCSR--PQGNQGCNGGWPSSALNYVKA 305 CGSCWAFS TGVLE W + G+ +LSEQQLVDCS P NQGCNGG PS ALNYVK Sbjct: 133 CGSCWAFSTTGVLEGWFQINTGKLPNLSEQQLVDCSTFIPDLNQGCNGGMPSRALNYVKR 192 Query: 304 NGITTESAYPYAAKDQSCKTQGGSFRINGYSSHSGCNGLSSQ--INNSPVSVTVDATNWS 131 NG+TT+ AYPY A DQ+CK +GG ++++G S+ N + Q + + PVSV V A++W Sbjct: 193 NGLTTQDAYPYQAVDQACKIKGGEYKVSG-STAIAANEAAHQAALQSGPVSVAVKASDWK 251 Query: 130 PYR----SGVF--NNCASSINHAVLLVGVVGGNWKIKNSWGTGWGENGF 2 Y+ + +F + C +NHAVL VG +KNSW T WG +G+ Sbjct: 252 NYKPKGDNYIFPDSECTGDVNHAVLAVGFTSEALIVKNSWNTVWGVDGY 300 >ref|XP_001454745.1| hypothetical protein [Paramecium tetraurelia strain d4-2] gi|124422522|emb|CAK87348.1| unnamed protein product [Paramecium tetraurelia] Length = 324 Score = 162 bits (411), Expect = 1e-46 Identities = 79/161 (49%), Positives = 98/161 (60%), Gaps = 4/161 (2%) Frame = -2 Query: 475 CGSCWAFSATGVLE-SWALQRGQNVDLSEQQLVDCSRPQGNQGCNGGWPSSALNYVKANG 299 CGSCWAFSA G LE + ++ + +LSEQ LVDCS P N GCNGGW SA YV NG Sbjct: 142 CGSCWAFSAVGALEINTDIELNRKYELSEQDLVDCSGPYDNDGCNGGWMDSAFEYVADNG 201 Query: 298 ITTESAYPYAAKDQSCKT--QGGSFRINGYSSHSGCNGLSSQINNSPVSVTVDATNWSPY 125 + YPY AKD +CKT + + G+ C+ L+ I V+V VDA W Y Sbjct: 202 LAEAKDYPYTAKDGTCKTSVKRPYTHVQGFKDIDSCDELAQTIQERTVAVAVDANPWQFY 261 Query: 124 RSGVFNNCASSINHAVLLVGV-VGGNWKIKNSWGTGWGENG 5 RSGV + C ++NH V+LVGV G WKI+NSWG+ WGE G Sbjct: 262 RSGVLSKCTKNLNHGVVLVGVQADGAWKIRNSWGSSWGEAG 302 >ref|XP_004039041.1| papain family cysteine protease, putative, partial [Ichthyophthirius multifiliis] gi|340508003|gb|EGR33817.1| papain family cysteine protease, putative, partial [Ichthyophthirius multifiliis] Length = 334 Score = 161 bits (408), Expect = 5e-46 Identities = 76/162 (46%), Positives = 100/162 (61%), Gaps = 4/162 (2%) Frame = -2 Query: 475 CGSCWAFSATGVLESWALQRGQNVD-LSEQQLVDCSRPQGNQGCNGGWPSSALNYVKANG 299 CGSCWAFS TG LE + + + SEQQL+DCS GN GCNGG A +V ++G Sbjct: 153 CGSCWAFSTTGSLEGANYLQNKTLSAFSEQQLMDCSWLYGNLGCNGGLMPRAFKWVASHG 212 Query: 298 ITTESAYPYAAKDQ-SCKTQGGSFRINGYSSH--SGCNGLSSQINNSPVSVTVDATNWSP 128 +TTE YPY AK SCK + G F+I+ Y C+ L+ ++ P S+ VDA+NW Sbjct: 213 VTTEDKYPYEAKSHFSCKNKNGEFKISSYQEIPVGDCDALAQSVSQRPTSIAVDASNWQS 272 Query: 127 YRSGVFNNCASSINHAVLLVGVVGGNWKIKNSWGTGWGENGF 2 Y SGVF++CA+ +NH VL VG W +KNSW T WG+ G+ Sbjct: 273 YSSGVFDDCATRLNHGVLAVGYTSEYWIVKNSWNTSWGQQGY 314 >ref|XP_001020107.1| papain family cysteine protease [Tetrahymena thermophila SB210] gi|89301874|gb|EAR99862.1| papain family cysteine protease [Tetrahymena thermophila SB210] Length = 337 Score = 161 bits (407), Expect = 7e-46 Identities = 85/172 (49%), Positives = 105/172 (61%), Gaps = 14/172 (8%) Frame = -2 Query: 475 CGSCWAFSATGVLESWALQRGQNV-DLSEQQLVDCSRPQG---NQGCNGGWPSSALNYVK 308 CGSCWAFSA G++ES+ + +N+ D SEQQLVDC + GCNGGWP S L+Y Sbjct: 149 CGSCWAFSAAGLMESFNFIKHKNLTDFSEQQLVDCVNSANGYYSNGCNGGWPESCLDYSS 208 Query: 307 ANGITTESAYPYAAKDQSCKTQGGSFRINGYSSHS------GCNGLSSQINNSPVSVTVD 146 GITT +YPY + C G + NG+ S L + +NNSPVSV VD Sbjct: 209 KFGITTLQSYPYVGVQKKCNITGAN---NGFKPKSWKQIPNTSKDLQNALNNSPVSVVVD 265 Query: 145 ATNWSPYRSGVFNNCASS---INHAVLLVGV-VGGNWKIKNSWGTGWGENGF 2 A+ WS YRSGV+N C + +NHAVL VG GNW +KNSWGTGWGE G+ Sbjct: 266 ASTWSHYRSGVYNGCDQTKIRLNHAVLAVGYDQFGNWIVKNSWGTGWGEQGY 317 >ref|XP_001013456.3| papain family cysteine protease [Tetrahymena thermophila SB210] gi|225565626|gb|EAR93211.3| papain family cysteine protease [Tetrahymena thermophila SB210] Length = 315 Score = 160 bits (404), Expect = 1e-45 Identities = 73/161 (45%), Positives = 106/161 (65%), Gaps = 3/161 (1%) Frame = -2 Query: 475 CGSCWAFSATGVLE-SWALQRGQNVDLSEQQLVDCSRPQGNQGCNGGWPSSALNYVKANG 299 CGSCW+FS TG +E + L + LSEQ LVDCS+ GN+GCNGG +A +++ +G Sbjct: 136 CGSCWSFSTTGAVEGALFLSTKKLTSLSEQYLVDCSK-DGNEGCNGGLMDTAFDFISQHG 194 Query: 298 ITTESAYPYAAKDQSCKTQGGSFRINGYSSHSGCNGLSSQINNSPVSVTVDATNWSPYRS 119 I TE+AYPY A D +CK G ++I+ ++ CN L ++I P+++ VDA N+ Y+ Sbjct: 195 IPTEAAYPYKAVDGTCKMTSGPYKISSHTDIQDCNDLLNKIQKQPIAIAVDANNFQYYQK 254 Query: 118 GVFNNCASSINHAVLLVG--VVGGNWKIKNSWGTGWGENGF 2 +F++C + ++H VLLVG G WK+KNSWG WGE+GF Sbjct: 255 DIFSDCGTELDHGVLLVGYSASGKYWKVKNSWGPNWGESGF 295 >ref|XP_001446315.1| hypothetical protein [Paramecium tetraurelia strain d4-2] gi|124413792|emb|CAK78918.1| unnamed protein product [Paramecium tetraurelia] Length = 317 Score = 158 bits (399), Expect = 7e-45 Identities = 85/164 (51%), Positives = 105/164 (64%), Gaps = 6/164 (3%) Frame = -2 Query: 475 CGSCWAFSATGVLESWA-LQRGQNVDLSEQQLVDCSRP-QGNQGCNGGWPSSALNYVKAN 302 CGSCW FS TGVLES+ L G+ DLSEQQL+DCS N+GC+GG P+ ALNYVK N Sbjct: 134 CGSCWTFSTTGVLESFFYLTTGELPDLSEQQLLDCSTVIDFNKGCDGGLPARALNYVKRN 193 Query: 301 GITTESAYPYAAKDQSCKTQGGSFRINGYS-SHSGCNGLSSQINNSPVSVTVDATNW--- 134 GITT +AYPY A +CK +GG++ I G L + +N PVSV VDATNW Sbjct: 194 GITTGAAYPYTAVQGTCKIKGGAYHIKGSQVLAKDEETLVAYLNKGPVSVGVDATNWQYY 253 Query: 133 SPYRSGVFNNCASSINHAVLLVGVVGGNWKIKNSWGTGWGENGF 2 SP VF++C + +NH VL VG +K+KNSW T WG G+ Sbjct: 254 SPKDEKVFSDCDTKMNHVVLAVGYDDKAFKLKNSWSTSWGVKGY 297 >ref|XP_001031724.1| papain family cysteine protease [Tetrahymena thermophila SB210] gi|89286057|gb|EAR84061.1| papain family cysteine protease [Tetrahymena thermophila SB210] Length = 325 Score = 158 bits (399), Expect = 9e-45 Identities = 79/168 (47%), Positives = 109/168 (64%), Gaps = 10/168 (5%) Frame = -2 Query: 475 CGSCWAFSATGVLESWALQRGQ---NVDLSEQQLVDCSRPQ-GNQGCNGGWPSSALNYVK 308 CGSCW FSATG +ES + G+ +++LSEQQLVDC + N GCNGG A Y++ Sbjct: 138 CGSCWTFSATGAVESALIIAGKAERSINLSEQQLVDCCTAEYDNAGCNGGNKDQAFRYIE 197 Query: 307 ANGITTESAYPYAAKDQSCKTQGG----SFRINGYSS-HSGCNGLSSQINNSPVSVTVDA 143 +N ITTE+ YPY A +Q C TQ ++ I+ Y ++ N L+ + P++++VDA Sbjct: 198 SNPITTEANYPYKAVNQKCNTQKAALTPNYTISNYKQVNASTNDLAEALKIQPIAISVDA 257 Query: 142 TNWSPYRSGVFNNCASSI-NHAVLLVGVVGGNWKIKNSWGTGWGENGF 2 +NWS Y G+F+NC ++ NHAVLLVG W +KNSWGT WGENG+ Sbjct: 258 SNWSFYTGGIFSNCNNTTHNHAVLLVGFQNDAWIVKNSWGTTWGENGY 305 >ref|XP_001462117.1| hypothetical protein [Paramecium tetraurelia strain d4-2] gi|124429955|emb|CAK94744.1| unnamed protein product [Paramecium tetraurelia] Length = 317 Score = 157 bits (397), Expect = 1e-44 Identities = 83/164 (50%), Positives = 102/164 (62%), Gaps = 6/164 (3%) Frame = -2 Query: 475 CGSCWAFSATGVLESWALQR-GQNVDLSEQQLVDCSRPQG-NQGCNGGWPSSALNYVKAN 302 CGSCW F TGVLE + G+ +LSEQQL+DCS Q N GCNGG P+ AL YVK + Sbjct: 134 CGSCWTFGTTGVLEGFFFTTTGELPNLSEQQLLDCSTFQDFNLGCNGGLPARALQYVKRS 193 Query: 301 GITTESAYPYAAKDQSCKTQGGSFRING-YSSHSGCNGLSSQINNSPVSVTVDATNWSPY 125 GITT+ AY Y SCK +GG++ I G + L S +N PVSV VDA+NW Y Sbjct: 194 GITTQDAYEYKGVQGSCKIKGGAYHIKGSVALEPTEEALISYLNEGPVSVGVDASNWQYY 253 Query: 124 RSG---VFNNCASSINHAVLLVGVVGGNWKIKNSWGTGWGENGF 2 VF+ C S+NHAVL VG ++K+KNSWGT WG+ GF Sbjct: 254 NPSDEKVFSTCEKSLNHAVLAVGYDKDSFKVKNSWGTAWGDKGF 297 >ref|XP_001427790.1| hypothetical protein [Paramecium tetraurelia strain d4-2] gi|124394873|emb|CAK60392.1| unnamed protein product [Paramecium tetraurelia] Length = 317 Score = 157 bits (397), Expect = 1e-44 Identities = 84/164 (51%), Positives = 103/164 (62%), Gaps = 6/164 (3%) Frame = -2 Query: 475 CGSCWAFSATGVLESWALQR-GQNVDLSEQQLVDCSRPQG-NQGCNGGWPSSALNYVKAN 302 CGSCW F TGVLE + + G+ +LSEQQL+DCS Q N GCNGG P AL YVK + Sbjct: 134 CGSCWTFGTTGVLEGFFFKTTGELPNLSEQQLLDCSTFQDFNLGCNGGLPYRALQYVKRS 193 Query: 301 GITTESAYPYAAKDQSCKTQGGSFRING-YSSHSGCNGLSSQINNSPVSVTVDATNWSPY 125 GITT++AYPY SC+ +GG++RI G + L S +N PVSV VDATNW Y Sbjct: 194 GITTQAAYPYKGVQGSCQIKGGAYRIKGAVQLEATEEALISYLNEGPVSVGVDATNWQYY 253 Query: 124 RSG---VFNNCASSINHAVLLVGVVGGNWKIKNSWGTGWGENGF 2 VF+ C SS+NHAVL VG + KIKNSW WG+ G+ Sbjct: 254 NPSDEKVFSTCDSSLNHAVLAVGYDKNSLKIKNSWSAQWGDRGY 297 >ref|XP_001008299.1| papain family cysteine protease [Tetrahymena thermophila SB210] gi|89290066|gb|EAR88054.1| papain family cysteine protease [Tetrahymena thermophila SB210] Length = 332 Score = 157 bits (398), Expect = 1e-44 Identities = 80/163 (49%), Positives = 100/163 (61%), Gaps = 5/163 (3%) Frame = -2 Query: 475 CGSCWAFSATGVLESWA-LQRGQNVDLSEQQLVDCSRPQGNQGCNGGWPSSALNYVKANG 299 CGSCWAFSATG LES + G LSEQ+LVDCS GN+GC+GG +A ++ N Sbjct: 146 CGSCWAFSATGALESATFISTGTLPSLSEQELVDCSTSYGNEGCDGGDMDAAFKFIHDNN 205 Query: 298 ITTESAYPYAAKDQSCK-TQ-GGSFRINGYSSHSGCNGLSSQINNSPVSVTVDATNWSPY 125 I TE Y Y DQ CK TQ ++ ++ + C+ L + I PVSV VDATNW Y Sbjct: 206 IATEKEYTYRGFDQKCKGTQYPTTYGLSSFVDVQSCDELVAAIQQQPVSVAVDATNWQYY 265 Query: 124 RSGVFNNCASSINHAVLLVGVVG--GNWKIKNSWGTGWGENGF 2 G FN+C ++NH VLLVG WK+KNSWGT WGE+G+ Sbjct: 266 EFGTFNDCFDNLNHGVLLVGYNSKTHQWKVKNSWGTSWGEDGY 308 >ref|XP_001020099.1| papain family cysteine protease [Tetrahymena thermophila SB210] gi|89301866|gb|EAR99854.1| papain family cysteine protease [Tetrahymena thermophila SB210] Length = 332 Score = 157 bits (397), Expect = 2e-44 Identities = 83/172 (48%), Positives = 105/172 (61%), Gaps = 14/172 (8%) Frame = -2 Query: 475 CGSCWAFSATGVLESWALQRGQNV-DLSEQQLVDC---SRPQGNQGCNGGWPSSALNYVK 308 CGSCW FSA G++ES+ + +N+ + SEQQLVDC + G+ GCNGGWP+S L+Y Sbjct: 144 CGSCWTFSAAGLMESFNFIKNKNLTNFSEQQLVDCVNSANGYGSNGCNGGWPASCLDYSS 203 Query: 307 ANGITTESAYPYAAKDQSCKTQGGSFRINGYSSHS------GCNGLSSQINNSPVSVTVD 146 GITT YPY + C G + NG+ S L + +N SPVSV VD Sbjct: 204 KFGITTLQNYPYVGVQKKCNITGTN---NGFKPKSWKQIPNTSKDLQNALNFSPVSVVVD 260 Query: 145 ATNWSPYRSGVFNNCASS---INHAVLLVGVVG-GNWKIKNSWGTGWGENGF 2 A+ WS YRSGV+N C + +NHAVL VG GNW +KNSWGTGWGE G+ Sbjct: 261 ASTWSHYRSGVYNGCNQTKIQLNHAVLAVGYDSVGNWIVKNSWGTGWGEQGY 312