BLASTX nr result

ID: Atractylodes21_contig00016397 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Atractylodes21_contig00016397
         (1564 letters)

Database: ./nr 
           23,641,837 sequences; 8,123,359,852 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

dbj|BAJ53169.1| JHL18I08.3 [Jatropha curcas]                          583   e-164
ref|XP_004152671.1| PREDICTED: cysteine proteinase RD21a-like [C...   573   e-161
ref|XP_002510459.1| cysteine protease, putative [Ricinus communi...   562   e-158
ref|XP_002307688.1| predicted protein [Populus trichocarpa] gi|2...   560   e-157
gb|AAK71314.1|AF388175_1 papain-like cysteine peptidase XBCP3 [A...   556   e-156

>dbj|BAJ53169.1| JHL18I08.3 [Jatropha curcas]
          Length = 441

 Score =  583 bits (1504), Expect = e-164
 Identities = 271/410 (66%), Positives = 323/410 (78%)
 Frame = -1

Query: 1381 HLFQQWCQQHGKSYSTEEEKLYRLNVFQDNYAYILRHXXXXXXSYTLSLNAFADLTHHEF 1202
            HLF+ WCQQHGK+Y+++EEKL+RL VFQDNY ++  H      SYTLSLNAFADLTHHEF
Sbjct: 28   HLFETWCQQHGKTYASQEEKLFRLKVFQDNYDFVTEHNSQGNSSYTLSLNAFADLTHHEF 87

Query: 1201 KLARLRGLSASSPSRDLIRLNRGSSLIESSNDLPKSLDWREKGAVTPVKDQGSCGACWSF 1022
            K +RL   SA+S S ++ R NR   + +   D+P S+DWR+ GAVT VKDQG+CGACWSF
Sbjct: 88   KASRLGLSSAASASLNVDRSNR--QIPDFVADVPASVDWRKNGAVTQVKDQGNCGACWSF 145

Query: 1021 SATGAMEGINQIVTGSLLSLSEQELVDCDRSFNDGCDGGLMDYAYEFVIKNHGIDTEEDY 842
            SATGA+EGIN+IVTGSL+SLSEQELVDCD+S+N+GC+GG+MDYA++FVI NHGIDTEEDY
Sbjct: 146  SATGAIEGINKIVTGSLVSLSEQELVDCDKSYNNGCEGGIMDYAFQFVIDNHGIDTEEDY 205

Query: 841  PYQGREASCNKNKRNRNVVTIDGYNDIPENDEDQLLKAVVTQPVSVGICGSERAFQLYSK 662
            PYQGR+ SCNK K  R+VVTIDGY D+P+N+E +LLKAV  QPVSVGICGSERAFQLYSK
Sbjct: 206  PYQGRDRSCNKEKLKRHVVTIDGYVDVPQNNEKELLKAVANQPVSVGICGSERAFQLYSK 265

Query: 661  GVFTGPCSTSLDHAVLIVGYDSKDGVDYWIIKNSWGTSWGMDGYMYMARNTGDSHGLCGI 482
            G+FTGPCSTSLDHAVLIVGY S++GVDYWI+KNSWG+ WGMDGYM+M RN+G S GLCGI
Sbjct: 266  GIFTGPCSTSLDHAVLIVGYGSENGVDYWIVKNSWGSYWGMDGYMHMQRNSGSSRGLCGI 325

Query: 481  NMLASYXXXXXXXXXXXXXXXPVKCNLFSWCAEGETCCCATKFLGICFKWMCCELNASVC 302
            NMLASY               P +C+LF+ C EGETCCC     GIC  W CCEL+++VC
Sbjct: 326  NMLASYPKKTSPNPPPPAPPGPTRCDLFTHCGEGETCCCVHHIFGICLSWKCCELDSAVC 385

Query: 301  CKDHHHCCPSDYPICDSERNLCLKQTGNGTVAKQAKKKSIFGKSGGRSSL 152
            CKD  HCCP DYP+CD+ RN+CLK  GN T  ++  K S  GK    SSL
Sbjct: 386  CKDGRHCCPRDYPVCDTTRNICLKHYGNATRIEKFAKNSSSGKFRSWSSL 435


>ref|XP_004152671.1| PREDICTED: cysteine proteinase RD21a-like [Cucumis sativus]
            gi|449529596|ref|XP_004171784.1| PREDICTED: cysteine
            proteinase RD21a-like [Cucumis sativus]
          Length = 431

 Score =  573 bits (1478), Expect = e-161
 Identities = 273/408 (66%), Positives = 316/408 (77%)
 Frame = -1

Query: 1378 LFQQWCQQHGKSYSTEEEKLYRLNVFQDNYAYILRHXXXXXXSYTLSLNAFADLTHHEFK 1199
            LF+ WC +HGKSYS+ EEKLYRL VF DNY ++  H      SYTLSLN++ADLTHHEFK
Sbjct: 28   LFEIWCTEHGKSYSSAEEKLYRLGVFADNYEFVTHHNNLDNSSYTLSLNSYADLTHHEFK 87

Query: 1198 LARLRGLSASSPSRDLIRLNRGSSLIESSNDLPKSLDWREKGAVTPVKDQGSCGACWSFS 1019
            ++RL    A    R +  L +  SL     D+P SLDWR+KGAVT VKDQGSCGACWSFS
Sbjct: 88   VSRLGFSPALRNFRPV--LPQEPSL---PRDVPDSLDWRKKGAVTAVKDQGSCGACWSFS 142

Query: 1018 ATGAMEGINQIVTGSLLSLSEQELVDCDRSFNDGCDGGLMDYAYEFVIKNHGIDTEEDYP 839
            ATGAMEGINQI+TGSL+SLSEQEL+DCDRS+N GC GGLMDYAY+FVI NHGIDTE DYP
Sbjct: 143  ATGAMEGINQIMTGSLISLSEQELIDCDRSYNSGCGGGLMDYAYQFVISNHGIDTENDYP 202

Query: 838  YQGREASCNKNKRNRNVVTIDGYNDIPENDEDQLLKAVVTQPVSVGICGSERAFQLYSKG 659
            YQ R+ SC K+K  RNVVTIDGY DIP NDE +LL+AV  QPVSVGICGSERAFQLYSKG
Sbjct: 203  YQARDGSCRKDKLQRNVVTIDGYADIPSNDEGKLLQAVAAQPVSVGICGSERAFQLYSKG 262

Query: 658  VFTGPCSTSLDHAVLIVGYDSKDGVDYWIIKNSWGTSWGMDGYMYMARNTGDSHGLCGIN 479
            +F+GPCSTSLDHAVLIVGY S++GVDYWI+KNSWG SWGMDGYM+M RN+G+S G+CGIN
Sbjct: 263  IFSGPCSTSLDHAVLIVGYGSENGVDYWIVKNSWGKSWGMDGYMHMQRNSGNSEGVCGIN 322

Query: 478  MLASYXXXXXXXXXXXXXXXPVKCNLFSWCAEGETCCCATKFLGICFKWMCCELNASVCC 299
             LASY               P KC++ + CA GETCCCA KFLG+C  W CC L+++VCC
Sbjct: 323  KLASYPTKTNPNPPPSPPPGPTKCSILTSCAAGETCCCAKKFLGLCLSWKCCGLSSAVCC 382

Query: 298  KDHHHCCPSDYPICDSERNLCLKQTGNGTVAKQAKKKSIFGKSGGRSS 155
            KD  HCCP DYPICD++RNLCLKQT NGT  +  + +S  G SG  SS
Sbjct: 383  KDGRHCCPFDYPICDTDRNLCLKQTMNGTRTEILENRSSSGSSGTWSS 430


>ref|XP_002510459.1| cysteine protease, putative [Ricinus communis]
            gi|223551160|gb|EEF52646.1| cysteine protease, putative
            [Ricinus communis]
          Length = 422

 Score =  562 bits (1449), Expect = e-158
 Identities = 257/383 (67%), Positives = 313/383 (81%)
 Frame = -1

Query: 1378 LFQQWCQQHGKSYSTEEEKLYRLNVFQDNYAYILRHXXXXXXSYTLSLNAFADLTHHEFK 1199
            LF+ W ++HGK+Y+++E+KLYR  +F++NY ++ +H      SYTLSLNAFADLTHHEFK
Sbjct: 31   LFESWTKEHGKTYTSKEDKLYRFKIFEENYEFVKKHNSQGNSSYTLSLNAFADLTHHEFK 90

Query: 1198 LARLRGLSASSPSRDLIRLNRGSSLIESSNDLPKSLDWREKGAVTPVKDQGSCGACWSFS 1019
             +RL GLSA S S  L R  R   L +   D+P S+DWR+KGAV+ VKDQG+CGACWSFS
Sbjct: 91   ASRL-GLSAFSTSGKLSR--RNFPLHDFVGDVPISIDWRKKGAVSQVKDQGNCGACWSFS 147

Query: 1018 ATGAMEGINQIVTGSLLSLSEQELVDCDRSFNDGCDGGLMDYAYEFVIKNHGIDTEEDYP 839
            ATGA+EGIN+IVTGSL+SLSEQELVDCDRS+N+GC+GGLMDYAY+FVI+N+GIDTEEDYP
Sbjct: 148  ATGAIEGINKIVTGSLVSLSEQELVDCDRSYNNGCEGGLMDYAYQFVIENNGIDTEEDYP 207

Query: 838  YQGREASCNKNKRNRNVVTIDGYNDIPENDEDQLLKAVVTQPVSVGICGSERAFQLYSKG 659
            YQ RE +CNK K  R+VVTIDGY D+P+N+E +LLKAV  QPVSVGICGSERAFQLYSKG
Sbjct: 208  YQAREKTCNKEKLKRHVVTIDGYTDVPQNNEKELLKAVAAQPVSVGICGSERAFQLYSKG 267

Query: 658  VFTGPCSTSLDHAVLIVGYDSKDGVDYWIIKNSWGTSWGMDGYMYMARNTGDSHGLCGIN 479
            +FTGPCSTSLDHAVLIVGY S++GVDYWI+KNSWGT WG++GYMYM RN+G+S GLCGIN
Sbjct: 268  IFTGPCSTSLDHAVLIVGYGSENGVDYWIVKNSWGTHWGINGYMYMLRNSGNSQGLCGIN 327

Query: 478  MLASYXXXXXXXXXXXXXXXPVKCNLFSWCAEGETCCCATKFLGICFKWMCCELNASVCC 299
            MLAS+               P KC+LF+ C EGETCCC  +  G+CF W CCEL+++VCC
Sbjct: 328  MLASFPVKTSPNPPPPAPPGPTKCDLFTRCGEGETCCCTRRIFGLCFSWKCCELDSAVCC 387

Query: 298  KDHHHCCPSDYPICDSERNLCLK 230
            KD  HCCP DYP+CD++RN+CLK
Sbjct: 388  KDGLHCCPHDYPVCDTKRNMCLK 410


>ref|XP_002307688.1| predicted protein [Populus trichocarpa] gi|222857137|gb|EEE94684.1|
            predicted protein [Populus trichocarpa]
          Length = 436

 Score =  560 bits (1443), Expect = e-157
 Identities = 256/406 (63%), Positives = 318/406 (78%)
 Frame = -1

Query: 1378 LFQQWCQQHGKSYSTEEEKLYRLNVFQDNYAYILRHXXXXXXSYTLSLNAFADLTHHEFK 1199
            LF+ WC++HGKSY+++EE+ +RL VF+DNY ++ +H      SY+L+LNAFADLTHHEFK
Sbjct: 28   LFETWCKEHGKSYTSQEERSHRLKVFEDNYDFVTKHNSKGNSSYSLALNAFADLTHHEFK 87

Query: 1198 LARLRGLSASSPSRDLIRLNRGSSLIESSNDLPKSLDWREKGAVTPVKDQGSCGACWSFS 1019
             +RL GLSA+     L   +R   +     D+P S+DWR KG VT VKDQGSCGACWSFS
Sbjct: 88   TSRL-GLSAAP----LNLAHRNLEITGVVGDIPASIDWRNKGVVTNVKDQGSCGACWSFS 142

Query: 1018 ATGAMEGINQIVTGSLLSLSEQELVDCDRSFNDGCDGGLMDYAYEFVIKNHGIDTEEDYP 839
            ATGA+EGIN+IVTGSL+SLSEQEL++CD+S+NDGC GGLMDYA++FVI NHGIDTEEDYP
Sbjct: 143  ATGAIEGINKIVTGSLVSLSEQELIECDKSYNDGCGGGLMDYAFQFVINNHGIDTEEDYP 202

Query: 838  YQGREASCNKNKRNRNVVTIDGYNDIPENDEDQLLKAVVTQPVSVGICGSERAFQLYSKG 659
            Y+ R+ +CNK++  R VVTID Y D+PEN+E QLL+AV  QPVSVGICGSERAFQ+YSKG
Sbjct: 203  YRARDGTCNKDRMKRRVVTIDKYVDVPENNEKQLLQAVAAQPVSVGICGSERAFQMYSKG 262

Query: 658  VFTGPCSTSLDHAVLIVGYDSKDGVDYWIIKNSWGTSWGMDGYMYMARNTGDSHGLCGIN 479
            +FTGPCSTSLDHAVLIVGY S++GVDYWI+KNSWGT WGM GYM+M RN+G+S G+CGIN
Sbjct: 263  IFTGPCSTSLDHAVLIVGYGSENGVDYWIVKNSWGTGWGMRGYMHMQRNSGNSQGVCGIN 322

Query: 478  MLASYXXXXXXXXXXXXXXXPVKCNLFSWCAEGETCCCATKFLGICFKWMCCELNASVCC 299
            MLASY               P KCNL ++CA GETCCCA KF GIC  W CC L+++VCC
Sbjct: 323  MLASYPVKTSPNPPPPPPPGPTKCNLLTYCAAGETCCCARKFFGICISWKCCGLDSAVCC 382

Query: 298  KDHHHCCPSDYPICDSERNLCLKQTGNGTVAKQAKKKSIFGKSGGR 161
            KD  HCCP DYP+CD+++N+C K+ GN T     + ++I GK+ G+
Sbjct: 383  KDRLHCCPHDYPVCDTDKNMCFKRAGNAT-----RMEAIEGKTSGK 423


>gb|AAK71314.1|AF388175_1 papain-like cysteine peptidase XBCP3 [Arabidopsis thaliana]
          Length = 437

 Score =  556 bits (1434), Expect = e-156
 Identities = 256/398 (64%), Positives = 314/398 (78%)
 Frame = -1

Query: 1378 LFQQWCQQHGKSYSTEEEKLYRLNVFQDNYAYILRHXXXXXXSYTLSLNAFADLTHHEFK 1199
            LF  WCQ+HGK+Y +EEE+  R+ +F+DN+ ++ +H      +Y+LSLNAFADLTHHEFK
Sbjct: 31   LFDDWCQKHGKTYGSEEERQQRIQIFKDNHDFVTQHNLITNATYSLSLNAFADLTHHEFK 90

Query: 1198 LARLRGLSASSPSRDLIRLNRGSSLIESSNDLPKSLDWREKGAVTPVKDQGSCGACWSFS 1019
             +RL GLS S+PS  +I  ++G SL   S  +P S+DWR+KGAVT VKDQGSCGACWSFS
Sbjct: 91   ASRL-GLSVSAPS--VIMASKGQSL-GGSVKVPDSVDWRKKGAVTNVKDQGSCGACWSFS 146

Query: 1018 ATGAMEGINQIVTGSLLSLSEQELVDCDRSFNDGCDGGLMDYAYEFVIKNHGIDTEEDYP 839
            ATGAMEGINQIVTG L+SLSEQEL+DCD+S+N GC+GGLMDYA+EFVIKNHGIDTE+DYP
Sbjct: 147  ATGAMEGINQIVTGDLISLSEQELIDCDKSYNAGCNGGLMDYAFEFVIKNHGIDTEKDYP 206

Query: 838  YQGREASCNKNKRNRNVVTIDGYNDIPENDEDQLLKAVVTQPVSVGICGSERAFQLYSKG 659
            YQ R+ +C K+K  + VVTID Y  +  NDE  L++AV  QPVSVGICGSERAFQLYS+G
Sbjct: 207  YQERDGTCKKDKLKQKVVTIDSYAGVKSNDEKALMEAVAAQPVSVGICGSERAFQLYSRG 266

Query: 658  VFTGPCSTSLDHAVLIVGYDSKDGVDYWIIKNSWGTSWGMDGYMYMARNTGDSHGLCGIN 479
            +F+GPCSTSLDHAVLIVGY S++GVDYWI+KNSWG SWGMDG+M+M RNT +S G+CGIN
Sbjct: 267  IFSGPCSTSLDHAVLIVGYGSQNGVDYWIVKNSWGKSWGMDGFMHMQRNTENSDGVCGIN 326

Query: 478  MLASYXXXXXXXXXXXXXXXPVKCNLFSWCAEGETCCCATKFLGICFKWMCCELNASVCC 299
            MLASY               P KCNLF++C+ GETCCCA +  G+CF W CCE+ ++VCC
Sbjct: 327  MLASYPIKTHPNPPPPSPPGPTKCNLFTYCSSGETCCCARELFGLCFSWKCCEIESAVCC 386

Query: 298  KDHHHCCPSDYPICDSERNLCLKQTGNGTVAKQAKKKS 185
            KD  HCCP DYP+CD+ R+LCLK+TGN T  K   KK+
Sbjct: 387  KDGRHCCPHDYPVCDTTRSLCLKKTGNFTAIKPFWKKN 424


Top