BLASTX nr result
ID: Papaver25_contig00021286
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Papaver25_contig00021286 (1181 letters) Database: ./nr 38,876,450 sequences; 13,856,398,315 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_006291447.1| hypothetical protein CARUB_v10017584mg [Caps... 71 1e-09 ref|XP_001020178.1| Papain family cysteine protease containing p... 69 4e-09 emb|CBI99501.1| cysteine peptidase precursor [Bromelia hieronymi] 68 7e-09 ref|XP_007068163.1| PREDICTED: cathepsin S-like [Chelonia mydas] 68 9e-09 gb|EMP27944.1| Golgi phosphoprotein 3-like protein [Chelonia mydas] 68 9e-09 gb|EYU36591.1| hypothetical protein MIMGU_mgv1a009120mg [Mimulus... 67 1e-08 gb|ABU51882.1| ervatamin-C precursor [Tabernaemontana divaricata] 67 1e-08 pdb|2PNS|A Chain A, 1.9 Angstrom Resolution Crystal Structure Of... 67 1e-08 gb|AGV15823.1| cysteine protease CP15 [Nicotiana tabacum] 67 1e-08 gb|AGI59309.1| procerain B, partial [Calotropis procera] 67 1e-08 gb|ABL85445.1| cathepsin L [Kudoa thyrsites] 67 2e-08 gb|ABL85443.1| cathepsin L [Kudoa thyrsites] 67 2e-08 gb|AAD54424.1|AF182079_1 thiol protease [Matricaria chamomilla] 66 3e-08 emb|CAR31335.1| pro-asclepain f [Gomphocarpus fruticosus subsp. ... 66 3e-08 gb|ESN92362.1| hypothetical protein HELRODRAFT_157956 [Helobdell... 66 3e-08 ref|XP_004966191.1| PREDICTED: oryzain alpha chain-like [Setaria... 66 3e-08 ref|XP_004963863.1| PREDICTED: cysteine proteinase EP-B 2-like [... 66 3e-08 ref|XP_004293931.1| PREDICTED: KDEL-tailed cysteine endopeptidas... 66 3e-08 sp|P83654.1|ERVC_TABDI RecName: Full=Ervatamin-C; Short=ERV-C gi... 66 3e-08 ref|XP_003593763.1| Cysteine proteinase [Medicago truncatula] gi... 66 3e-08 >ref|XP_006291447.1| hypothetical protein CARUB_v10017584mg [Capsella rubella] gi|482560154|gb|EOA24345.1| hypothetical protein CARUB_v10017584mg [Capsella rubella] Length = 340 Score = 70.9 bits (172), Expect = 1e-09 Identities = 31/89 (34%), Positives = 53/89 (59%) Frame = +3 Query: 879 GESFNWREQGVLPSIRNQGECGACYVVTVADSISAVFNINTNNMRKVALSYQEIIDRCQW 1058 GES +WR++G + S++NQG CG C+ + ++ + I+ + V+LS Q+++D Sbjct: 126 GESMDWRQEGAVTSVKNQGHCGCCWAFSTVAAVEGITKISKGEL--VSLSVQQLVDCNTE 183 Query: 1059 THGCKGGNTALAFLYAIFKPGLSRIEDIP 1145 ++GC GGNT AF Y I G++ + P Sbjct: 184 SYGCGGGNTVNAFNYIIKNQGITTEDSYP 212 >ref|XP_001020178.1| Papain family cysteine protease containing protein [Tetrahymena thermophila] gi|89301945|gb|EAR99933.1| papain family cysteine protease [Tetrahymena thermophila SB210] Length = 339 Score = 68.9 bits (167), Expect = 4e-09 Identities = 33/91 (36%), Positives = 54/91 (59%), Gaps = 2/91 (2%) Frame = +3 Query: 891 NWREQGVLPSIRNQGECGACYVVTVADSISAVFNINTNNMRKVALSYQEIIDRCQW--TH 1064 +WRE+GV+ +++NQGECG+C+ +I + F++ T + LS Q++ID + H Sbjct: 128 DWREKGVVTAVKNQGECGSCWAFAAVGAIESHFSLKTGK-SPIQLSEQQLIDCARQFDNH 186 Query: 1065 GCKGGNTALAFLYAIFKPGLSRIEDIPI*GK 1157 GC GG + AF Y ++ G+ +D P GK Sbjct: 187 GCDGGLPSKAFEYIAYEGGIENSKDYPYTGK 217 >emb|CBI99501.1| cysteine peptidase precursor [Bromelia hieronymi] Length = 230 Score = 68.2 bits (165), Expect = 7e-09 Identities = 32/91 (35%), Positives = 54/91 (59%) Frame = +3 Query: 882 ESFNWREQGVLPSIRNQGECGACYVVTVADSISAVFNINTNNMRKVALSYQEIIDRCQWT 1061 +S +WR+ G + S++NQG CG+C+ + ++ ++ I T N+ V+LS QE++D C + Sbjct: 4 QSIDWRDYGAVTSVKNQGRCGSCWSFSAIATVEGIYKIKTGNL--VSLSEQEVLD-CAVS 60 Query: 1062 HGCKGGNTALAFLYAIFKPGLSRIEDIPI*G 1154 HGCKGG A+ + I G++ P G Sbjct: 61 HGCKGGWVDKAYNFIISNNGVTSAAYYPYKG 91 >ref|XP_007068163.1| PREDICTED: cathepsin S-like [Chelonia mydas] Length = 326 Score = 67.8 bits (164), Expect = 9e-09 Identities = 34/99 (34%), Positives = 52/99 (52%), Gaps = 6/99 (6%) Frame = +3 Query: 867 KLKPG----ESFNWREQGVLPSIRNQGECGACYVVTVADSISAVFNINTNNMRKVALSYQ 1034 K +PG +S +WR++G + ++NQG CGAC+ + ++ A + T N+ V+LS Q Sbjct: 110 KPRPGSQVPDSMDWRDKGCVTDVKNQGPCGACWAFSAVGALEAQVKLKTGNL--VSLSAQ 167 Query: 1035 EIID--RCQWTHGCKGGNTALAFLYAIFKPGLSRIEDIP 1145 ++D W HGC GG AF Y I G+ P Sbjct: 168 NLVDCSTMYWNHGCSGGFMTYAFQYIIDNDGIDSDASYP 206 >gb|EMP27944.1| Golgi phosphoprotein 3-like protein [Chelonia mydas] Length = 1089 Score = 67.8 bits (164), Expect = 9e-09 Identities = 34/99 (34%), Positives = 52/99 (52%), Gaps = 6/99 (6%) Frame = +3 Query: 867 KLKPG----ESFNWREQGVLPSIRNQGECGACYVVTVADSISAVFNINTNNMRKVALSYQ 1034 K +PG +S +WR++G + ++NQG CGAC+ + ++ A + T N+ V+LS Q Sbjct: 151 KPRPGSQVPDSMDWRDKGCVTDVKNQGPCGACWAFSAVGALEAQVKLKTGNL--VSLSAQ 208 Query: 1035 EIID--RCQWTHGCKGGNTALAFLYAIFKPGLSRIEDIP 1145 ++D W HGC GG AF Y I G+ P Sbjct: 209 NLVDCSTMYWNHGCSGGFMTYAFQYIIDNDGIDSDASYP 247 Score = 57.8 bits (138), Expect = 9e-06 Identities = 29/95 (30%), Positives = 47/95 (49%), Gaps = 3/95 (3%) Frame = +3 Query: 882 ESFNWREQGVLPSIRNQGECGACYVVTVADSISAVFNINTNNMRKVALSYQEIIDRCQW- 1058 + +WR+ G + +++NQG CG+C+ + ++ T R V+LS Q ++D C W Sbjct: 413 DCIDWRKSGYVTNVKNQGSCGSCWAFSAVGALEGQLKKKTG--RLVSLSPQNLVD-CSWR 469 Query: 1059 --THGCKGGNTALAFLYAIFKPGLSRIEDIPI*GK 1157 HGC GG AF Y + G+ P G+ Sbjct: 470 YGNHGCNGGFMTKAFRYVMNNSGIDSETSYPYEGQ 504 >gb|EYU36591.1| hypothetical protein MIMGU_mgv1a009120mg [Mimulus guttatus] Length = 352 Score = 67.4 bits (163), Expect = 1e-08 Identities = 33/89 (37%), Positives = 53/89 (59%), Gaps = 1/89 (1%) Frame = +3 Query: 882 ESFNWREQGVLPSIRNQGECGACYVVTVADSISAVFNINTNNMRKVALSYQEIID-RCQW 1058 +S +WR++G + +I+NQG CG+C+ + ++ + I T N+ + LS QE+ID + Sbjct: 136 KSVDWRKKGAVAAIKNQGSCGSCWAFSTVAAVEGINQIVTGNLTE--LSEQELIDCDTSY 193 Query: 1059 THGCKGGNTALAFLYAIFKPGLSRIEDIP 1145 +GC GG AF Y + K GL + ED P Sbjct: 194 NNGCNGGLMDYAFAYIVSKGGLHKEEDYP 222 >gb|ABU51882.1| ervatamin-C precursor [Tabernaemontana divaricata] Length = 365 Score = 67.4 bits (163), Expect = 1e-08 Identities = 28/88 (31%), Positives = 52/88 (59%) Frame = +3 Query: 882 ESFNWREQGVLPSIRNQGECGACYVVTVADSISAVFNINTNNMRKVALSYQEIIDRCQWT 1061 E +WR++G + ++NQG+CG+C+ + ++ ++ I T N+ ++LS Q+++D + Sbjct: 136 EQIDWRKKGAVTPVKNQGKCGSCWAFSTVSTVESINQIRTGNL--ISLSEQQLVDCNKKN 193 Query: 1062 HGCKGGNTALAFLYAIFKPGLSRIEDIP 1145 HGCKGG A+ Y I G+ + P Sbjct: 194 HGCKGGAFVYAYQYIIDNGGIDTEANYP 221 >pdb|2PNS|A Chain A, 1.9 Angstrom Resolution Crystal Structure Of A Plant Cysteine Protease Ervatamin-C Refinement With Cdna Derived Amino Acid Sequence gi|150261414|pdb|2PNS|B Chain B, 1.9 Angstrom Resolution Crystal Structure Of A Plant Cysteine Protease Ervatamin-C Refinement With Cdna Derived Amino Acid Sequence gi|166007115|pdb|2PRE|A Chain A, Crystal Structure Of Plant Cysteine Protease Ervatamin-C Complexed With Irreversible Inhibitor E-64 At 2.7 A Resolution gi|166007116|pdb|2PRE|B Chain B, Crystal Structure Of Plant Cysteine Protease Ervatamin-C Complexed With Irreversible Inhibitor E-64 At 2.7 A Resolution Length = 208 Score = 67.4 bits (163), Expect = 1e-08 Identities = 28/88 (31%), Positives = 52/88 (59%) Frame = +3 Query: 882 ESFNWREQGVLPSIRNQGECGACYVVTVADSISAVFNINTNNMRKVALSYQEIIDRCQWT 1061 E +WR++G + ++NQG+CG+C+ + ++ ++ I T N+ ++LS Q+++D + Sbjct: 3 EQIDWRKKGAVTPVKNQGKCGSCWAFSTVSTVESINQIRTGNL--ISLSEQQLVDCNKKN 60 Query: 1062 HGCKGGNTALAFLYAIFKPGLSRIEDIP 1145 HGCKGG A+ Y I G+ + P Sbjct: 61 HGCKGGAFVYAYQYIIDNGGIDTEANYP 88 >gb|AGV15823.1| cysteine protease CP15 [Nicotiana tabacum] Length = 474 Score = 67.0 bits (162), Expect = 1e-08 Identities = 36/90 (40%), Positives = 54/90 (60%), Gaps = 2/90 (2%) Frame = +3 Query: 882 ESFNWREQGVLPSIRNQGECGACYVVTVADSISAVFNINTNNMRKVALSYQEIIDRCQW- 1058 +S +WR++GVL ++NQG+CG+C+ + SI AV I T N+ ++LS QE++D C Sbjct: 147 DSVDWRKKGVLVDVKNQGQCGSCWAFSAVASIEAVNKIMTGNL--ISLSEQELVD-CDTA 203 Query: 1059 -THGCKGGNTALAFLYAIFKPGLSRIEDIP 1145 GC+GG AF + I G+ ED P Sbjct: 204 DNQGCQGGLMDDAFKFVIQNGGIDTEEDYP 233 >gb|AGI59309.1| procerain B, partial [Calotropis procera] Length = 212 Score = 67.0 bits (162), Expect = 1e-08 Identities = 36/87 (41%), Positives = 53/87 (60%) Frame = +3 Query: 885 SFNWREQGVLPSIRNQGECGACYVVTVADSISAVFNINTNNMRKVALSYQEIIDRCQWTH 1064 S +WRE+ V+ IRNQG+CG+C+ + SI + I + M +ALS QE++D + ++ Sbjct: 4 SVDWREKDVVFPIRNQGQCGSCWTFSAVASIETLIGIKEDRM--IALSEQELLDCERTSY 61 Query: 1065 GCKGGNTALAFLYAIFKPGLSRIEDIP 1145 GCKGG AF Y + K GL+ E P Sbjct: 62 GCKGGYYTNAFAY-VAKKGLTSREKYP 87 >gb|ABL85445.1| cathepsin L [Kudoa thyrsites] Length = 300 Score = 66.6 bits (161), Expect = 2e-08 Identities = 33/98 (33%), Positives = 55/98 (56%) Frame = +3 Query: 852 ALNAVKLKPGESFNWREQGVLPSIRNQGECGACYVVTVADSISAVFNINTNNMRKVALSY 1031 A +K S +W+ G + S++NQG+CG+C+ + A +I + + I T + V S Sbjct: 94 ATKDIKSTLPSSVDWKALGKVTSVKNQGQCGSCWSFSAAGAIESAYAIKTGEL--VNFSE 151 Query: 1032 QEIIDRCQWTHGCKGGNTALAFLYAIFKPGLSRIEDIP 1145 Q+++D HGC GG +AFLY I G+ +++D P Sbjct: 152 QQLVDCSTENHGCNGGLPEIAFLYVI-NNGIMKLKDYP 188 >gb|ABL85443.1| cathepsin L [Kudoa thyrsites] Length = 300 Score = 66.6 bits (161), Expect = 2e-08 Identities = 33/98 (33%), Positives = 55/98 (56%) Frame = +3 Query: 852 ALNAVKLKPGESFNWREQGVLPSIRNQGECGACYVVTVADSISAVFNINTNNMRKVALSY 1031 A +K S +W+ G + S++NQG+CG+C+ + A +I + + I T + V S Sbjct: 94 ATKDIKSTLPSSVDWKALGKVTSVKNQGQCGSCWSFSAAGAIESAYAIKTGEL--VNFSE 151 Query: 1032 QEIIDRCQWTHGCKGGNTALAFLYAIFKPGLSRIEDIP 1145 Q+++D HGC GG +AFLY I G+ +++D P Sbjct: 152 QQLVDCSTENHGCNGGLPEIAFLYVI-NNGIMKLKDYP 188 >gb|AAD54424.1|AF182079_1 thiol protease [Matricaria chamomilla] Length = 501 Score = 66.2 bits (160), Expect = 3e-08 Identities = 31/87 (35%), Positives = 54/87 (62%) Frame = +3 Query: 885 SFNWREQGVLPSIRNQGECGACYVVTVADSISAVFNINTNNMRKVALSYQEIIDRCQWTH 1064 S +WR++GV+ +++QG+CG+C+ +V+ SI + I T ++ + LS QE++D + + Sbjct: 146 SLDWRDKGVVTPMKDQGQCGSCWAFSVSGSIESANAIATGDL--IRLSEQELVDCDTYDY 203 Query: 1065 GCKGGNTALAFLYAIFKPGLSRIEDIP 1145 GC GGN A+ + I GL +D P Sbjct: 204 GCDGGNMDTAYRWIIKNGGLDSEDDYP 230 >emb|CAR31335.1| pro-asclepain f [Gomphocarpus fruticosus subsp. fruticosus] Length = 340 Score = 66.2 bits (160), Expect = 3e-08 Identities = 36/88 (40%), Positives = 54/88 (61%) Frame = +3 Query: 882 ESFNWREQGVLPSIRNQGECGACYVVTVADSISAVFNINTNNMRKVALSYQEIIDRCQWT 1061 +S +WRE+GV+ IRNQG+CG+C+ + SI + I +M +ALS QE++D + Sbjct: 132 DSVDWREKGVVFPIRNQGKCGSCWTFSAVASIETLNGIKKGHM--IALSEQELLDCETIS 189 Query: 1062 HGCKGGNTALAFLYAIFKPGLSRIEDIP 1145 GCKGG+ AF Y + K G++ E P Sbjct: 190 QGCKGGHYNNAFAY-VAKNGITSEEKYP 216 >gb|ESN92362.1| hypothetical protein HELRODRAFT_157956 [Helobdella robusta] Length = 310 Score = 65.9 bits (159), Expect = 3e-08 Identities = 32/97 (32%), Positives = 55/97 (56%) Frame = +3 Query: 867 KLKPGESFNWREQGVLPSIRNQGECGACYVVTVADSISAVFNINTNNMRKVALSYQEIID 1046 ++ P F+WRE G + ++NQG CG+C+ +V +I ++I + ++LS Q+++D Sbjct: 91 QVDPPPQFDWREHGAVTPVKNQGMCGSCWAFSVTGNIEGQWSIKKKKL--LSLSEQQLVD 148 Query: 1047 RCQWTHGCKGGNTALAFLYAIFKPGLSRIEDIPI*GK 1157 + GC GG +LA+L + GL +D P GK Sbjct: 149 CDKLDEGCNGGLPSLAYLEIMRMGGLESEKDYPYSGK 185 >ref|XP_004966191.1| PREDICTED: oryzain alpha chain-like [Setaria italica] Length = 374 Score = 65.9 bits (159), Expect = 3e-08 Identities = 32/97 (32%), Positives = 53/97 (54%) Frame = +3 Query: 867 KLKPGESFNWREQGVLPSIRNQGECGACYVVTVADSISAVFNINTNNMRKVALSYQEIID 1046 +L+ + +WR+ G + ++NQG CG C+ + ++ + I T + V+LS QE+ID Sbjct: 138 ELQVPSAVDWRKSGAVTPVKNQGACGGCWAFSAVAAMEGINKIATGKL--VSLSEQELID 195 Query: 1047 RCQWTHGCKGGNTALAFLYAIFKPGLSRIEDIPI*GK 1157 + +HGCKGG AF + I G+ D P G+ Sbjct: 196 CDRKSHGCKGGRMDYAFQFVISNGGIDTEADYPYTGR 232 >ref|XP_004963863.1| PREDICTED: cysteine proteinase EP-B 2-like [Setaria italica] Length = 348 Score = 65.9 bits (159), Expect = 3e-08 Identities = 28/88 (31%), Positives = 51/88 (57%) Frame = +3 Query: 882 ESFNWREQGVLPSIRNQGECGACYVVTVADSISAVFNINTNNMRKVALSYQEIIDRCQWT 1061 ++ +WR+QG + ++NQG CG+C+ + ++ + I T N+ ++LS Q+++D Sbjct: 140 QAVDWRKQGAVTGVKNQGTCGSCWAFSAVAAVEGIHQITTGNL--ISLSEQQVLDCSTGN 197 Query: 1062 HGCKGGNTALAFLYAIFKPGLSRIEDIP 1145 +GC GG+ AF Y I GL+ + P Sbjct: 198 NGCNGGSMDKAFQYIINNGGLTTEDTYP 225 >ref|XP_004293931.1| PREDICTED: KDEL-tailed cysteine endopeptidase CEP1-like [Fragaria vesca subsp. vesca] Length = 345 Score = 65.9 bits (159), Expect = 3e-08 Identities = 30/85 (35%), Positives = 50/85 (58%) Frame = +3 Query: 885 SFNWREQGVLPSIRNQGECGACYVVTVADSISAVFNINTNNMRKVALSYQEIIDRCQWTH 1064 S +WREQ + I++QG CGAC+ TV ++ + I T + ++LS Q+++D Sbjct: 131 SMDWREQAAVTGIKDQGRCGACWAFTVVAAVEGLTKIKTGQL--ISLSEQQLVDCSHQNG 188 Query: 1065 GCKGGNTALAFLYAIFKPGLSRIED 1139 GC+GG+ A+ Y I G++R E+ Sbjct: 189 GCRGGSLESAYEYVIQNGGIAREEN 213 >sp|P83654.1|ERVC_TABDI RecName: Full=Ervatamin-C; Short=ERV-C gi|46014979|pdb|1O0E|A Chain A, 1.9 Angstrom Crystal Structure Of A Plant Cysteine Protease Ervatamin C gi|46014980|pdb|1O0E|B Chain B, 1.9 Angstrom Crystal Structure Of A Plant Cysteine Protease Ervatamin C Length = 208 Score = 65.9 bits (159), Expect = 3e-08 Identities = 28/88 (31%), Positives = 50/88 (56%) Frame = +3 Query: 882 ESFNWREQGVLPSIRNQGECGACYVVTVADSISAVFNINTNNMRKVALSYQEIIDRCQWT 1061 E +WR++G + ++NQG CG+C+ + ++ ++ I T N+ ++LS QE++D + Sbjct: 3 EQIDWRKKGAVTPVKNQGSCGSCWAFSTVSTVESINQIRTGNL--ISLSEQELVDCDKKN 60 Query: 1062 HGCKGGNTALAFLYAIFKPGLSRIEDIP 1145 HGC GG A+ Y I G+ + P Sbjct: 61 HGCLGGAFVFAYQYIINNGGIDTQANYP 88 >ref|XP_003593763.1| Cysteine proteinase [Medicago truncatula] gi|355482811|gb|AES64014.1| Cysteine proteinase [Medicago truncatula] Length = 350 Score = 65.9 bits (159), Expect = 3e-08 Identities = 29/87 (33%), Positives = 52/87 (59%) Frame = +3 Query: 885 SFNWREQGVLPSIRNQGECGACYVVTVADSISAVFNINTNNMRKVALSYQEIIDRCQWTH 1064 +F+WRE+GV+ ++NQ +CG C+ T ++ + I N+ ++LS Q+++D + + Sbjct: 124 NFDWREKGVVTDVKNQRQCGCCWAFTAVAAVEGIVKIKNGNL--ISLSEQQLVDCDRQSS 181 Query: 1065 GCKGGNTALAFLYAIFKPGLSRIEDIP 1145 GC GG+ LAF I G+ + +D P Sbjct: 182 GCGGGDFVLAFDSIIKSRGIVKEDDYP 208