BLASTX nr result

ID: Papaver25_contig00021286 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Papaver25_contig00021286
         (1181 letters)

Database: ./nr 
           38,876,450 sequences; 13,856,398,315 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_006291447.1| hypothetical protein CARUB_v10017584mg [Caps...    71   1e-09
ref|XP_001020178.1| Papain family cysteine protease containing p...    69   4e-09
emb|CBI99501.1| cysteine peptidase precursor [Bromelia hieronymi]      68   7e-09
ref|XP_007068163.1| PREDICTED: cathepsin S-like [Chelonia mydas]       68   9e-09
gb|EMP27944.1| Golgi phosphoprotein 3-like protein [Chelonia mydas]    68   9e-09
gb|EYU36591.1| hypothetical protein MIMGU_mgv1a009120mg [Mimulus...    67   1e-08
gb|ABU51882.1| ervatamin-C precursor [Tabernaemontana divaricata]      67   1e-08
pdb|2PNS|A Chain A, 1.9 Angstrom Resolution Crystal Structure Of...    67   1e-08
gb|AGV15823.1| cysteine protease CP15 [Nicotiana tabacum]              67   1e-08
gb|AGI59309.1| procerain B, partial [Calotropis procera]               67   1e-08
gb|ABL85445.1| cathepsin L [Kudoa thyrsites]                           67   2e-08
gb|ABL85443.1| cathepsin L [Kudoa thyrsites]                           67   2e-08
gb|AAD54424.1|AF182079_1 thiol protease [Matricaria chamomilla]        66   3e-08
emb|CAR31335.1| pro-asclepain f [Gomphocarpus fruticosus subsp. ...    66   3e-08
gb|ESN92362.1| hypothetical protein HELRODRAFT_157956 [Helobdell...    66   3e-08
ref|XP_004966191.1| PREDICTED: oryzain alpha chain-like [Setaria...    66   3e-08
ref|XP_004963863.1| PREDICTED: cysteine proteinase EP-B 2-like [...    66   3e-08
ref|XP_004293931.1| PREDICTED: KDEL-tailed cysteine endopeptidas...    66   3e-08
sp|P83654.1|ERVC_TABDI RecName: Full=Ervatamin-C; Short=ERV-C gi...    66   3e-08
ref|XP_003593763.1| Cysteine proteinase [Medicago truncatula] gi...    66   3e-08

>ref|XP_006291447.1| hypothetical protein CARUB_v10017584mg [Capsella rubella]
            gi|482560154|gb|EOA24345.1| hypothetical protein
            CARUB_v10017584mg [Capsella rubella]
          Length = 340

 Score = 70.9 bits (172), Expect = 1e-09
 Identities = 31/89 (34%), Positives = 53/89 (59%)
 Frame = +3

Query: 879  GESFNWREQGVLPSIRNQGECGACYVVTVADSISAVFNINTNNMRKVALSYQEIIDRCQW 1058
            GES +WR++G + S++NQG CG C+  +   ++  +  I+   +  V+LS Q+++D    
Sbjct: 126  GESMDWRQEGAVTSVKNQGHCGCCWAFSTVAAVEGITKISKGEL--VSLSVQQLVDCNTE 183

Query: 1059 THGCKGGNTALAFLYAIFKPGLSRIEDIP 1145
            ++GC GGNT  AF Y I   G++  +  P
Sbjct: 184  SYGCGGGNTVNAFNYIIKNQGITTEDSYP 212


>ref|XP_001020178.1| Papain family cysteine protease containing protein [Tetrahymena
            thermophila] gi|89301945|gb|EAR99933.1| papain family
            cysteine protease [Tetrahymena thermophila SB210]
          Length = 339

 Score = 68.9 bits (167), Expect = 4e-09
 Identities = 33/91 (36%), Positives = 54/91 (59%), Gaps = 2/91 (2%)
 Frame = +3

Query: 891  NWREQGVLPSIRNQGECGACYVVTVADSISAVFNINTNNMRKVALSYQEIIDRCQW--TH 1064
            +WRE+GV+ +++NQGECG+C+      +I + F++ T     + LS Q++ID  +    H
Sbjct: 128  DWREKGVVTAVKNQGECGSCWAFAAVGAIESHFSLKTGK-SPIQLSEQQLIDCARQFDNH 186

Query: 1065 GCKGGNTALAFLYAIFKPGLSRIEDIPI*GK 1157
            GC GG  + AF Y  ++ G+   +D P  GK
Sbjct: 187  GCDGGLPSKAFEYIAYEGGIENSKDYPYTGK 217


>emb|CBI99501.1| cysteine peptidase precursor [Bromelia hieronymi]
          Length = 230

 Score = 68.2 bits (165), Expect = 7e-09
 Identities = 32/91 (35%), Positives = 54/91 (59%)
 Frame = +3

Query: 882  ESFNWREQGVLPSIRNQGECGACYVVTVADSISAVFNINTNNMRKVALSYQEIIDRCQWT 1061
            +S +WR+ G + S++NQG CG+C+  +   ++  ++ I T N+  V+LS QE++D C  +
Sbjct: 4    QSIDWRDYGAVTSVKNQGRCGSCWSFSAIATVEGIYKIKTGNL--VSLSEQEVLD-CAVS 60

Query: 1062 HGCKGGNTALAFLYAIFKPGLSRIEDIPI*G 1154
            HGCKGG    A+ + I   G++     P  G
Sbjct: 61   HGCKGGWVDKAYNFIISNNGVTSAAYYPYKG 91


>ref|XP_007068163.1| PREDICTED: cathepsin S-like [Chelonia mydas]
          Length = 326

 Score = 67.8 bits (164), Expect = 9e-09
 Identities = 34/99 (34%), Positives = 52/99 (52%), Gaps = 6/99 (6%)
 Frame = +3

Query: 867  KLKPG----ESFNWREQGVLPSIRNQGECGACYVVTVADSISAVFNINTNNMRKVALSYQ 1034
            K +PG    +S +WR++G +  ++NQG CGAC+  +   ++ A   + T N+  V+LS Q
Sbjct: 110  KPRPGSQVPDSMDWRDKGCVTDVKNQGPCGACWAFSAVGALEAQVKLKTGNL--VSLSAQ 167

Query: 1035 EIID--RCQWTHGCKGGNTALAFLYAIFKPGLSRIEDIP 1145
             ++D     W HGC GG    AF Y I   G+      P
Sbjct: 168  NLVDCSTMYWNHGCSGGFMTYAFQYIIDNDGIDSDASYP 206


>gb|EMP27944.1| Golgi phosphoprotein 3-like protein [Chelonia mydas]
          Length = 1089

 Score = 67.8 bits (164), Expect = 9e-09
 Identities = 34/99 (34%), Positives = 52/99 (52%), Gaps = 6/99 (6%)
 Frame = +3

Query: 867  KLKPG----ESFNWREQGVLPSIRNQGECGACYVVTVADSISAVFNINTNNMRKVALSYQ 1034
            K +PG    +S +WR++G +  ++NQG CGAC+  +   ++ A   + T N+  V+LS Q
Sbjct: 151  KPRPGSQVPDSMDWRDKGCVTDVKNQGPCGACWAFSAVGALEAQVKLKTGNL--VSLSAQ 208

Query: 1035 EIID--RCQWTHGCKGGNTALAFLYAIFKPGLSRIEDIP 1145
             ++D     W HGC GG    AF Y I   G+      P
Sbjct: 209  NLVDCSTMYWNHGCSGGFMTYAFQYIIDNDGIDSDASYP 247



 Score = 57.8 bits (138), Expect = 9e-06
 Identities = 29/95 (30%), Positives = 47/95 (49%), Gaps = 3/95 (3%)
 Frame = +3

Query: 882  ESFNWREQGVLPSIRNQGECGACYVVTVADSISAVFNINTNNMRKVALSYQEIIDRCQW- 1058
            +  +WR+ G + +++NQG CG+C+  +   ++       T   R V+LS Q ++D C W 
Sbjct: 413  DCIDWRKSGYVTNVKNQGSCGSCWAFSAVGALEGQLKKKTG--RLVSLSPQNLVD-CSWR 469

Query: 1059 --THGCKGGNTALAFLYAIFKPGLSRIEDIPI*GK 1157
               HGC GG    AF Y +   G+      P  G+
Sbjct: 470  YGNHGCNGGFMTKAFRYVMNNSGIDSETSYPYEGQ 504


>gb|EYU36591.1| hypothetical protein MIMGU_mgv1a009120mg [Mimulus guttatus]
          Length = 352

 Score = 67.4 bits (163), Expect = 1e-08
 Identities = 33/89 (37%), Positives = 53/89 (59%), Gaps = 1/89 (1%)
 Frame = +3

Query: 882  ESFNWREQGVLPSIRNQGECGACYVVTVADSISAVFNINTNNMRKVALSYQEIID-RCQW 1058
            +S +WR++G + +I+NQG CG+C+  +   ++  +  I T N+ +  LS QE+ID    +
Sbjct: 136  KSVDWRKKGAVAAIKNQGSCGSCWAFSTVAAVEGINQIVTGNLTE--LSEQELIDCDTSY 193

Query: 1059 THGCKGGNTALAFLYAIFKPGLSRIEDIP 1145
             +GC GG    AF Y + K GL + ED P
Sbjct: 194  NNGCNGGLMDYAFAYIVSKGGLHKEEDYP 222


>gb|ABU51882.1| ervatamin-C precursor [Tabernaemontana divaricata]
          Length = 365

 Score = 67.4 bits (163), Expect = 1e-08
 Identities = 28/88 (31%), Positives = 52/88 (59%)
 Frame = +3

Query: 882  ESFNWREQGVLPSIRNQGECGACYVVTVADSISAVFNINTNNMRKVALSYQEIIDRCQWT 1061
            E  +WR++G +  ++NQG+CG+C+  +   ++ ++  I T N+  ++LS Q+++D  +  
Sbjct: 136  EQIDWRKKGAVTPVKNQGKCGSCWAFSTVSTVESINQIRTGNL--ISLSEQQLVDCNKKN 193

Query: 1062 HGCKGGNTALAFLYAIFKPGLSRIEDIP 1145
            HGCKGG    A+ Y I   G+    + P
Sbjct: 194  HGCKGGAFVYAYQYIIDNGGIDTEANYP 221


>pdb|2PNS|A Chain A, 1.9 Angstrom Resolution Crystal Structure Of A Plant
            Cysteine Protease Ervatamin-C Refinement With Cdna
            Derived Amino Acid Sequence gi|150261414|pdb|2PNS|B Chain
            B, 1.9 Angstrom Resolution Crystal Structure Of A Plant
            Cysteine Protease Ervatamin-C Refinement With Cdna
            Derived Amino Acid Sequence gi|166007115|pdb|2PRE|A Chain
            A, Crystal Structure Of Plant Cysteine Protease
            Ervatamin-C Complexed With Irreversible Inhibitor E-64 At
            2.7 A Resolution gi|166007116|pdb|2PRE|B Chain B, Crystal
            Structure Of Plant Cysteine Protease Ervatamin-C
            Complexed With Irreversible Inhibitor E-64 At 2.7 A
            Resolution
          Length = 208

 Score = 67.4 bits (163), Expect = 1e-08
 Identities = 28/88 (31%), Positives = 52/88 (59%)
 Frame = +3

Query: 882  ESFNWREQGVLPSIRNQGECGACYVVTVADSISAVFNINTNNMRKVALSYQEIIDRCQWT 1061
            E  +WR++G +  ++NQG+CG+C+  +   ++ ++  I T N+  ++LS Q+++D  +  
Sbjct: 3    EQIDWRKKGAVTPVKNQGKCGSCWAFSTVSTVESINQIRTGNL--ISLSEQQLVDCNKKN 60

Query: 1062 HGCKGGNTALAFLYAIFKPGLSRIEDIP 1145
            HGCKGG    A+ Y I   G+    + P
Sbjct: 61   HGCKGGAFVYAYQYIIDNGGIDTEANYP 88


>gb|AGV15823.1| cysteine protease CP15 [Nicotiana tabacum]
          Length = 474

 Score = 67.0 bits (162), Expect = 1e-08
 Identities = 36/90 (40%), Positives = 54/90 (60%), Gaps = 2/90 (2%)
 Frame = +3

Query: 882  ESFNWREQGVLPSIRNQGECGACYVVTVADSISAVFNINTNNMRKVALSYQEIIDRCQW- 1058
            +S +WR++GVL  ++NQG+CG+C+  +   SI AV  I T N+  ++LS QE++D C   
Sbjct: 147  DSVDWRKKGVLVDVKNQGQCGSCWAFSAVASIEAVNKIMTGNL--ISLSEQELVD-CDTA 203

Query: 1059 -THGCKGGNTALAFLYAIFKPGLSRIEDIP 1145
               GC+GG    AF + I   G+   ED P
Sbjct: 204  DNQGCQGGLMDDAFKFVIQNGGIDTEEDYP 233


>gb|AGI59309.1| procerain B, partial [Calotropis procera]
          Length = 212

 Score = 67.0 bits (162), Expect = 1e-08
 Identities = 36/87 (41%), Positives = 53/87 (60%)
 Frame = +3

Query: 885  SFNWREQGVLPSIRNQGECGACYVVTVADSISAVFNINTNNMRKVALSYQEIIDRCQWTH 1064
            S +WRE+ V+  IRNQG+CG+C+  +   SI  +  I  + M  +ALS QE++D  + ++
Sbjct: 4    SVDWREKDVVFPIRNQGQCGSCWTFSAVASIETLIGIKEDRM--IALSEQELLDCERTSY 61

Query: 1065 GCKGGNTALAFLYAIFKPGLSRIEDIP 1145
            GCKGG    AF Y + K GL+  E  P
Sbjct: 62   GCKGGYYTNAFAY-VAKKGLTSREKYP 87


>gb|ABL85445.1| cathepsin L [Kudoa thyrsites]
          Length = 300

 Score = 66.6 bits (161), Expect = 2e-08
 Identities = 33/98 (33%), Positives = 55/98 (56%)
 Frame = +3

Query: 852  ALNAVKLKPGESFNWREQGVLPSIRNQGECGACYVVTVADSISAVFNINTNNMRKVALSY 1031
            A   +K     S +W+  G + S++NQG+CG+C+  + A +I + + I T  +  V  S 
Sbjct: 94   ATKDIKSTLPSSVDWKALGKVTSVKNQGQCGSCWSFSAAGAIESAYAIKTGEL--VNFSE 151

Query: 1032 QEIIDRCQWTHGCKGGNTALAFLYAIFKPGLSRIEDIP 1145
            Q+++D     HGC GG   +AFLY I   G+ +++D P
Sbjct: 152  QQLVDCSTENHGCNGGLPEIAFLYVI-NNGIMKLKDYP 188


>gb|ABL85443.1| cathepsin L [Kudoa thyrsites]
          Length = 300

 Score = 66.6 bits (161), Expect = 2e-08
 Identities = 33/98 (33%), Positives = 55/98 (56%)
 Frame = +3

Query: 852  ALNAVKLKPGESFNWREQGVLPSIRNQGECGACYVVTVADSISAVFNINTNNMRKVALSY 1031
            A   +K     S +W+  G + S++NQG+CG+C+  + A +I + + I T  +  V  S 
Sbjct: 94   ATKDIKSTLPSSVDWKALGKVTSVKNQGQCGSCWSFSAAGAIESAYAIKTGEL--VNFSE 151

Query: 1032 QEIIDRCQWTHGCKGGNTALAFLYAIFKPGLSRIEDIP 1145
            Q+++D     HGC GG   +AFLY I   G+ +++D P
Sbjct: 152  QQLVDCSTENHGCNGGLPEIAFLYVI-NNGIMKLKDYP 188


>gb|AAD54424.1|AF182079_1 thiol protease [Matricaria chamomilla]
          Length = 501

 Score = 66.2 bits (160), Expect = 3e-08
 Identities = 31/87 (35%), Positives = 54/87 (62%)
 Frame = +3

Query: 885  SFNWREQGVLPSIRNQGECGACYVVTVADSISAVFNINTNNMRKVALSYQEIIDRCQWTH 1064
            S +WR++GV+  +++QG+CG+C+  +V+ SI +   I T ++  + LS QE++D   + +
Sbjct: 146  SLDWRDKGVVTPMKDQGQCGSCWAFSVSGSIESANAIATGDL--IRLSEQELVDCDTYDY 203

Query: 1065 GCKGGNTALAFLYAIFKPGLSRIEDIP 1145
            GC GGN   A+ + I   GL   +D P
Sbjct: 204  GCDGGNMDTAYRWIIKNGGLDSEDDYP 230


>emb|CAR31335.1| pro-asclepain f [Gomphocarpus fruticosus subsp. fruticosus]
          Length = 340

 Score = 66.2 bits (160), Expect = 3e-08
 Identities = 36/88 (40%), Positives = 54/88 (61%)
 Frame = +3

Query: 882  ESFNWREQGVLPSIRNQGECGACYVVTVADSISAVFNINTNNMRKVALSYQEIIDRCQWT 1061
            +S +WRE+GV+  IRNQG+CG+C+  +   SI  +  I   +M  +ALS QE++D    +
Sbjct: 132  DSVDWREKGVVFPIRNQGKCGSCWTFSAVASIETLNGIKKGHM--IALSEQELLDCETIS 189

Query: 1062 HGCKGGNTALAFLYAIFKPGLSRIEDIP 1145
             GCKGG+   AF Y + K G++  E  P
Sbjct: 190  QGCKGGHYNNAFAY-VAKNGITSEEKYP 216


>gb|ESN92362.1| hypothetical protein HELRODRAFT_157956 [Helobdella robusta]
          Length = 310

 Score = 65.9 bits (159), Expect = 3e-08
 Identities = 32/97 (32%), Positives = 55/97 (56%)
 Frame = +3

Query: 867  KLKPGESFNWREQGVLPSIRNQGECGACYVVTVADSISAVFNINTNNMRKVALSYQEIID 1046
            ++ P   F+WRE G +  ++NQG CG+C+  +V  +I   ++I    +  ++LS Q+++D
Sbjct: 91   QVDPPPQFDWREHGAVTPVKNQGMCGSCWAFSVTGNIEGQWSIKKKKL--LSLSEQQLVD 148

Query: 1047 RCQWTHGCKGGNTALAFLYAIFKPGLSRIEDIPI*GK 1157
              +   GC GG  +LA+L  +   GL   +D P  GK
Sbjct: 149  CDKLDEGCNGGLPSLAYLEIMRMGGLESEKDYPYSGK 185


>ref|XP_004966191.1| PREDICTED: oryzain alpha chain-like [Setaria italica]
          Length = 374

 Score = 65.9 bits (159), Expect = 3e-08
 Identities = 32/97 (32%), Positives = 53/97 (54%)
 Frame = +3

Query: 867  KLKPGESFNWREQGVLPSIRNQGECGACYVVTVADSISAVFNINTNNMRKVALSYQEIID 1046
            +L+   + +WR+ G +  ++NQG CG C+  +   ++  +  I T  +  V+LS QE+ID
Sbjct: 138  ELQVPSAVDWRKSGAVTPVKNQGACGGCWAFSAVAAMEGINKIATGKL--VSLSEQELID 195

Query: 1047 RCQWTHGCKGGNTALAFLYAIFKPGLSRIEDIPI*GK 1157
              + +HGCKGG    AF + I   G+    D P  G+
Sbjct: 196  CDRKSHGCKGGRMDYAFQFVISNGGIDTEADYPYTGR 232


>ref|XP_004963863.1| PREDICTED: cysteine proteinase EP-B 2-like [Setaria italica]
          Length = 348

 Score = 65.9 bits (159), Expect = 3e-08
 Identities = 28/88 (31%), Positives = 51/88 (57%)
 Frame = +3

Query: 882  ESFNWREQGVLPSIRNQGECGACYVVTVADSISAVFNINTNNMRKVALSYQEIIDRCQWT 1061
            ++ +WR+QG +  ++NQG CG+C+  +   ++  +  I T N+  ++LS Q+++D     
Sbjct: 140  QAVDWRKQGAVTGVKNQGTCGSCWAFSAVAAVEGIHQITTGNL--ISLSEQQVLDCSTGN 197

Query: 1062 HGCKGGNTALAFLYAIFKPGLSRIEDIP 1145
            +GC GG+   AF Y I   GL+  +  P
Sbjct: 198  NGCNGGSMDKAFQYIINNGGLTTEDTYP 225


>ref|XP_004293931.1| PREDICTED: KDEL-tailed cysteine endopeptidase CEP1-like [Fragaria
            vesca subsp. vesca]
          Length = 345

 Score = 65.9 bits (159), Expect = 3e-08
 Identities = 30/85 (35%), Positives = 50/85 (58%)
 Frame = +3

Query: 885  SFNWREQGVLPSIRNQGECGACYVVTVADSISAVFNINTNNMRKVALSYQEIIDRCQWTH 1064
            S +WREQ  +  I++QG CGAC+  TV  ++  +  I T  +  ++LS Q+++D      
Sbjct: 131  SMDWREQAAVTGIKDQGRCGACWAFTVVAAVEGLTKIKTGQL--ISLSEQQLVDCSHQNG 188

Query: 1065 GCKGGNTALAFLYAIFKPGLSRIED 1139
            GC+GG+   A+ Y I   G++R E+
Sbjct: 189  GCRGGSLESAYEYVIQNGGIAREEN 213


>sp|P83654.1|ERVC_TABDI RecName: Full=Ervatamin-C; Short=ERV-C gi|46014979|pdb|1O0E|A Chain
            A, 1.9 Angstrom Crystal Structure Of A Plant Cysteine
            Protease Ervatamin C gi|46014980|pdb|1O0E|B Chain B, 1.9
            Angstrom Crystal Structure Of A Plant Cysteine Protease
            Ervatamin C
          Length = 208

 Score = 65.9 bits (159), Expect = 3e-08
 Identities = 28/88 (31%), Positives = 50/88 (56%)
 Frame = +3

Query: 882  ESFNWREQGVLPSIRNQGECGACYVVTVADSISAVFNINTNNMRKVALSYQEIIDRCQWT 1061
            E  +WR++G +  ++NQG CG+C+  +   ++ ++  I T N+  ++LS QE++D  +  
Sbjct: 3    EQIDWRKKGAVTPVKNQGSCGSCWAFSTVSTVESINQIRTGNL--ISLSEQELVDCDKKN 60

Query: 1062 HGCKGGNTALAFLYAIFKPGLSRIEDIP 1145
            HGC GG    A+ Y I   G+    + P
Sbjct: 61   HGCLGGAFVFAYQYIINNGGIDTQANYP 88


>ref|XP_003593763.1| Cysteine proteinase [Medicago truncatula] gi|355482811|gb|AES64014.1|
            Cysteine proteinase [Medicago truncatula]
          Length = 350

 Score = 65.9 bits (159), Expect = 3e-08
 Identities = 29/87 (33%), Positives = 52/87 (59%)
 Frame = +3

Query: 885  SFNWREQGVLPSIRNQGECGACYVVTVADSISAVFNINTNNMRKVALSYQEIIDRCQWTH 1064
            +F+WRE+GV+  ++NQ +CG C+  T   ++  +  I   N+  ++LS Q+++D  + + 
Sbjct: 124  NFDWREKGVVTDVKNQRQCGCCWAFTAVAAVEGIVKIKNGNL--ISLSEQQLVDCDRQSS 181

Query: 1065 GCKGGNTALAFLYAIFKPGLSRIEDIP 1145
            GC GG+  LAF   I   G+ + +D P
Sbjct: 182  GCGGGDFVLAFDSIIKSRGIVKEDDYP 208


Top