BLASTX nr result

ID: Atractylodes22_contig00014990 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Atractylodes22_contig00014990
         (1817 letters)

Database: ./nr 
           23,641,837 sequences; 8,123,359,852 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

dbj|BAJ53169.1| JHL18I08.3 [Jatropha curcas]                          584   e-164
ref|XP_004152671.1| PREDICTED: cysteine proteinase RD21a-like [C...   570   e-160
ref|XP_002510459.1| cysteine protease, putative [Ricinus communi...   562   e-158
ref|XP_002307688.1| predicted protein [Populus trichocarpa] gi|2...   561   e-157
gb|AAK71314.1|AF388175_1 papain-like cysteine peptidase XBCP3 [A...   555   e-155

>dbj|BAJ53169.1| JHL18I08.3 [Jatropha curcas]
          Length = 441

 Score =  584 bits (1505), Expect = e-164
 Identities = 273/410 (66%), Positives = 325/410 (79%), Gaps = 1/410 (0%)
 Frame = -3

Query: 1587 YLFQQWCQQHGKSYSTEEEKLYRLNVFQDNYAYILRHXXXXXXSYTLSLNAFADLTHHEF 1408
            +LF+ WCQQHGK+Y+++EEKL+RL VFQDNY ++  H      SYTLSLNAFADLTHHEF
Sbjct: 28   HLFETWCQQHGKTYASQEEKLFRLKVFQDNYDFVTEHNSQGNSSYTLSLNAFADLTHHEF 87

Query: 1407 KLARLRGLSASSPSRDLIRLNRGSSLIESSNDLPKSLDWREKGAVTPVKDQGSCGACWSF 1228
            K +RL   SA+S S ++ R NR   + +   D+P S+DWR+ GAVT VKDQG+CGACWSF
Sbjct: 88   KASRLGLSSAASASLNVDRSNR--QIPDFVADVPASVDWRKNGAVTQVKDQGNCGACWSF 145

Query: 1227 SATGAMEGINQIVTGSLLSLSEQELVDCDRSFNDGCDGGLMDYAYEFVIKNHGIDTEEDY 1048
            SATGA+EGIN+IVTGSL+SLSEQELVDCD+S+N+GC+GG+MDYA++FVI NHGIDTEEDY
Sbjct: 146  SATGAIEGINKIVTGSLVSLSEQELVDCDKSYNNGCEGGIMDYAFQFVIDNHGIDTEEDY 205

Query: 1047 PYQGREASCNKNKRNRNVVTIDGYNDIPENDEDQLLKAVVTQPVSVGICGSERAFQLYSK 868
            PYQGR+ SCNK K  R+VVTIDGY D+P+N+E +LLKAV  QPVSVGICGSERAFQLYSK
Sbjct: 206  PYQGRDRSCNKEKLKRHVVTIDGYVDVPQNNEKELLKAVANQPVSVGICGSERAFQLYSK 265

Query: 867  GVFTGPCSTSLDHAVLIVGYDSKDGVDYWIIKNSWGTSWGMDGYMYMARNTGDSHGLCGI 688
            G+FTGPCSTSLDHAVLIVGY S++GVDYWI+KNSWG+ WGMDGYM+M RN+G S GLCGI
Sbjct: 266  GIFTGPCSTSLDHAVLIVGYGSENGVDYWIVKNSWGSYWGMDGYMHMQRNSGSSRGLCGI 325

Query: 687  NMLASYXXXXXXXXXXXXXXXPVKCNLFSWCAEGETCCCATKFLGICFKWMCCELNASVC 508
            NMLASY               P +C+LF+ C EGETCCC     GIC  W CCEL+++VC
Sbjct: 326  NMLASYPKKTSPNPPPPAPPGPTRCDLFTHCGEGETCCCVHHIFGICLSWKCCELDSAVC 385

Query: 507  CKDHHHCCPSDYPICDSERNLCLKQTGNGT-VAKQAKNSIFGKSGGRSSL 361
            CKD  HCCP DYP+CD+ RN+CLK  GN T + K AKNS  GK    SSL
Sbjct: 386  CKDGRHCCPRDYPVCDTTRNICLKHYGNATRIEKFAKNSSSGKFRSWSSL 435


>ref|XP_004152671.1| PREDICTED: cysteine proteinase RD21a-like [Cucumis sativus]
            gi|449529596|ref|XP_004171784.1| PREDICTED: cysteine
            proteinase RD21a-like [Cucumis sativus]
          Length = 431

 Score =  570 bits (1470), Expect = e-160
 Identities = 274/408 (67%), Positives = 316/408 (77%), Gaps = 1/408 (0%)
 Frame = -3

Query: 1584 LFQQWCQQHGKSYSTEEEKLYRLNVFQDNYAYILRHXXXXXXSYTLSLNAFADLTHHEFK 1405
            LF+ WC +HGKSYS+ EEKLYRL VF DNY ++  H      SYTLSLN++ADLTHHEFK
Sbjct: 28   LFEIWCTEHGKSYSSAEEKLYRLGVFADNYEFVTHHNNLDNSSYTLSLNSYADLTHHEFK 87

Query: 1404 LARLRGLSASSPSRDLIRLNRGSSLIESSNDLPKSLDWREKGAVTPVKDQGSCGACWSFS 1225
            ++RL    A    R +  L +  SL     D+P SLDWR+KGAVT VKDQGSCGACWSFS
Sbjct: 88   VSRLGFSPALRNFRPV--LPQEPSL---PRDVPDSLDWRKKGAVTAVKDQGSCGACWSFS 142

Query: 1224 ATGAMEGINQIVTGSLLSLSEQELVDCDRSFNDGCDGGLMDYAYEFVIKNHGIDTEEDYP 1045
            ATGAMEGINQI+TGSL+SLSEQEL+DCDRS+N GC GGLMDYAY+FVI NHGIDTE DYP
Sbjct: 143  ATGAMEGINQIMTGSLISLSEQELIDCDRSYNSGCGGGLMDYAYQFVISNHGIDTENDYP 202

Query: 1044 YQGREASCNKNKRNRNVVTIDGYNDIPENDEDQLLKAVVTQPVSVGICGSERAFQLYSKG 865
            YQ R+ SC K+K  RNVVTIDGY DIP NDE +LL+AV  QPVSVGICGSERAFQLYSKG
Sbjct: 203  YQARDGSCRKDKLQRNVVTIDGYADIPSNDEGKLLQAVAAQPVSVGICGSERAFQLYSKG 262

Query: 864  VFTGPCSTSLDHAVLIVGYDSKDGVDYWIIKNSWGTSWGMDGYMYMARNTGDSHGLCGIN 685
            +F+GPCSTSLDHAVLIVGY S++GVDYWI+KNSWG SWGMDGYM+M RN+G+S G+CGIN
Sbjct: 263  IFSGPCSTSLDHAVLIVGYGSENGVDYWIVKNSWGKSWGMDGYMHMQRNSGNSEGVCGIN 322

Query: 684  MLASYXXXXXXXXXXXXXXXPVKCNLFSWCAEGETCCCATKFLGICFKWMCCELNASVCC 505
             LASY               P KC++ + CA GETCCCA KFLG+C  W CC L+++VCC
Sbjct: 323  KLASYPTKTNPNPPPSPPPGPTKCSILTSCAAGETCCCAKKFLGLCLSWKCCGLSSAVCC 382

Query: 504  KDHHHCCPSDYPICDSERNLCLKQTGNGTVAKQAKN-SIFGKSGGRSS 364
            KD  HCCP DYPICD++RNLCLKQT NGT  +  +N S  G SG  SS
Sbjct: 383  KDGRHCCPFDYPICDTDRNLCLKQTMNGTRTEILENRSSSGSSGTWSS 430


>ref|XP_002510459.1| cysteine protease, putative [Ricinus communis]
            gi|223551160|gb|EEF52646.1| cysteine protease, putative
            [Ricinus communis]
          Length = 422

 Score =  562 bits (1449), Expect = e-158
 Identities = 257/383 (67%), Positives = 313/383 (81%)
 Frame = -3

Query: 1584 LFQQWCQQHGKSYSTEEEKLYRLNVFQDNYAYILRHXXXXXXSYTLSLNAFADLTHHEFK 1405
            LF+ W ++HGK+Y+++E+KLYR  +F++NY ++ +H      SYTLSLNAFADLTHHEFK
Sbjct: 31   LFESWTKEHGKTYTSKEDKLYRFKIFEENYEFVKKHNSQGNSSYTLSLNAFADLTHHEFK 90

Query: 1404 LARLRGLSASSPSRDLIRLNRGSSLIESSNDLPKSLDWREKGAVTPVKDQGSCGACWSFS 1225
             +RL GLSA S S  L R  R   L +   D+P S+DWR+KGAV+ VKDQG+CGACWSFS
Sbjct: 91   ASRL-GLSAFSTSGKLSR--RNFPLHDFVGDVPISIDWRKKGAVSQVKDQGNCGACWSFS 147

Query: 1224 ATGAMEGINQIVTGSLLSLSEQELVDCDRSFNDGCDGGLMDYAYEFVIKNHGIDTEEDYP 1045
            ATGA+EGIN+IVTGSL+SLSEQELVDCDRS+N+GC+GGLMDYAY+FVI+N+GIDTEEDYP
Sbjct: 148  ATGAIEGINKIVTGSLVSLSEQELVDCDRSYNNGCEGGLMDYAYQFVIENNGIDTEEDYP 207

Query: 1044 YQGREASCNKNKRNRNVVTIDGYNDIPENDEDQLLKAVVTQPVSVGICGSERAFQLYSKG 865
            YQ RE +CNK K  R+VVTIDGY D+P+N+E +LLKAV  QPVSVGICGSERAFQLYSKG
Sbjct: 208  YQAREKTCNKEKLKRHVVTIDGYTDVPQNNEKELLKAVAAQPVSVGICGSERAFQLYSKG 267

Query: 864  VFTGPCSTSLDHAVLIVGYDSKDGVDYWIIKNSWGTSWGMDGYMYMARNTGDSHGLCGIN 685
            +FTGPCSTSLDHAVLIVGY S++GVDYWI+KNSWGT WG++GYMYM RN+G+S GLCGIN
Sbjct: 268  IFTGPCSTSLDHAVLIVGYGSENGVDYWIVKNSWGTHWGINGYMYMLRNSGNSQGLCGIN 327

Query: 684  MLASYXXXXXXXXXXXXXXXPVKCNLFSWCAEGETCCCATKFLGICFKWMCCELNASVCC 505
            MLAS+               P KC+LF+ C EGETCCC  +  G+CF W CCEL+++VCC
Sbjct: 328  MLASFPVKTSPNPPPPAPPGPTKCDLFTRCGEGETCCCTRRIFGLCFSWKCCELDSAVCC 387

Query: 504  KDHHHCCPSDYPICDSERNLCLK 436
            KD  HCCP DYP+CD++RN+CLK
Sbjct: 388  KDGLHCCPHDYPVCDTKRNMCLK 410


>ref|XP_002307688.1| predicted protein [Populus trichocarpa] gi|222857137|gb|EEE94684.1|
            predicted protein [Populus trichocarpa]
          Length = 436

 Score =  561 bits (1447), Expect = e-157
 Identities = 257/412 (62%), Positives = 318/412 (77%)
 Frame = -3

Query: 1584 LFQQWCQQHGKSYSTEEEKLYRLNVFQDNYAYILRHXXXXXXSYTLSLNAFADLTHHEFK 1405
            LF+ WC++HGKSY+++EE+ +RL VF+DNY ++ +H      SY+L+LNAFADLTHHEFK
Sbjct: 28   LFETWCKEHGKSYTSQEERSHRLKVFEDNYDFVTKHNSKGNSSYSLALNAFADLTHHEFK 87

Query: 1404 LARLRGLSASSPSRDLIRLNRGSSLIESSNDLPKSLDWREKGAVTPVKDQGSCGACWSFS 1225
             +RL GLSA+     L   +R   +     D+P S+DWR KG VT VKDQGSCGACWSFS
Sbjct: 88   TSRL-GLSAAP----LNLAHRNLEITGVVGDIPASIDWRNKGVVTNVKDQGSCGACWSFS 142

Query: 1224 ATGAMEGINQIVTGSLLSLSEQELVDCDRSFNDGCDGGLMDYAYEFVIKNHGIDTEEDYP 1045
            ATGA+EGIN+IVTGSL+SLSEQEL++CD+S+NDGC GGLMDYA++FVI NHGIDTEEDYP
Sbjct: 143  ATGAIEGINKIVTGSLVSLSEQELIECDKSYNDGCGGGLMDYAFQFVINNHGIDTEEDYP 202

Query: 1044 YQGREASCNKNKRNRNVVTIDGYNDIPENDEDQLLKAVVTQPVSVGICGSERAFQLYSKG 865
            Y+ R+ +CNK++  R VVTID Y D+PEN+E QLL+AV  QPVSVGICGSERAFQ+YSKG
Sbjct: 203  YRARDGTCNKDRMKRRVVTIDKYVDVPENNEKQLLQAVAAQPVSVGICGSERAFQMYSKG 262

Query: 864  VFTGPCSTSLDHAVLIVGYDSKDGVDYWIIKNSWGTSWGMDGYMYMARNTGDSHGLCGIN 685
            +FTGPCSTSLDHAVLIVGY S++GVDYWI+KNSWGT WGM GYM+M RN+G+S G+CGIN
Sbjct: 263  IFTGPCSTSLDHAVLIVGYGSENGVDYWIVKNSWGTGWGMRGYMHMQRNSGNSQGVCGIN 322

Query: 684  MLASYXXXXXXXXXXXXXXXPVKCNLFSWCAEGETCCCATKFLGICFKWMCCELNASVCC 505
            MLASY               P KCNL ++CA GETCCCA KF GIC  W CC L+++VCC
Sbjct: 323  MLASYPVKTSPNPPPPPPPGPTKCNLLTYCAAGETCCCARKFFGICISWKCCGLDSAVCC 382

Query: 504  KDHHHCCPSDYPICDSERNLCLKQTGNGTVAKQAKNSIFGKSGGRSSLHQQY 349
            KD  HCCP DYP+CD+++N+C K+ GN T  +  +    GK G   SL + +
Sbjct: 383  KDRLHCCPHDYPVCDTDKNMCFKRAGNATRMEAIEGKTSGKFGSWISLPEAW 434


>gb|AAK71314.1|AF388175_1 papain-like cysteine peptidase XBCP3 [Arabidopsis thaliana]
          Length = 437

 Score =  555 bits (1430), Expect = e-155
 Identities = 257/405 (63%), Positives = 314/405 (77%)
 Frame = -3

Query: 1584 LFQQWCQQHGKSYSTEEEKLYRLNVFQDNYAYILRHXXXXXXSYTLSLNAFADLTHHEFK 1405
            LF  WCQ+HGK+Y +EEE+  R+ +F+DN+ ++ +H      +Y+LSLNAFADLTHHEFK
Sbjct: 31   LFDDWCQKHGKTYGSEEERQQRIQIFKDNHDFVTQHNLITNATYSLSLNAFADLTHHEFK 90

Query: 1404 LARLRGLSASSPSRDLIRLNRGSSLIESSNDLPKSLDWREKGAVTPVKDQGSCGACWSFS 1225
             +RL GLS S+PS  +I  ++G SL   S  +P S+DWR+KGAVT VKDQGSCGACWSFS
Sbjct: 91   ASRL-GLSVSAPS--VIMASKGQSL-GGSVKVPDSVDWRKKGAVTNVKDQGSCGACWSFS 146

Query: 1224 ATGAMEGINQIVTGSLLSLSEQELVDCDRSFNDGCDGGLMDYAYEFVIKNHGIDTEEDYP 1045
            ATGAMEGINQIVTG L+SLSEQEL+DCD+S+N GC+GGLMDYA+EFVIKNHGIDTE+DYP
Sbjct: 147  ATGAMEGINQIVTGDLISLSEQELIDCDKSYNAGCNGGLMDYAFEFVIKNHGIDTEKDYP 206

Query: 1044 YQGREASCNKNKRNRNVVTIDGYNDIPENDEDQLLKAVVTQPVSVGICGSERAFQLYSKG 865
            YQ R+ +C K+K  + VVTID Y  +  NDE  L++AV  QPVSVGICGSERAFQLYS+G
Sbjct: 207  YQERDGTCKKDKLKQKVVTIDSYAGVKSNDEKALMEAVAAQPVSVGICGSERAFQLYSRG 266

Query: 864  VFTGPCSTSLDHAVLIVGYDSKDGVDYWIIKNSWGTSWGMDGYMYMARNTGDSHGLCGIN 685
            +F+GPCSTSLDHAVLIVGY S++GVDYWI+KNSWG SWGMDG+M+M RNT +S G+CGIN
Sbjct: 267  IFSGPCSTSLDHAVLIVGYGSQNGVDYWIVKNSWGKSWGMDGFMHMQRNTENSDGVCGIN 326

Query: 684  MLASYXXXXXXXXXXXXXXXPVKCNLFSWCAEGETCCCATKFLGICFKWMCCELNASVCC 505
            MLASY               P KCNLF++C+ GETCCCA +  G+CF W CCE+ ++VCC
Sbjct: 327  MLASYPIKTHPNPPPPSPPGPTKCNLFTYCSSGETCCCARELFGLCFSWKCCEIESAVCC 386

Query: 504  KDHHHCCPSDYPICDSERNLCLKQTGNGTVAKQAKNSIFGKSGGR 370
            KD  HCCP DYP+CD+ R+LCLK+TGN T  K        K  GR
Sbjct: 387  KDGRHCCPHDYPVCDTTRSLCLKKTGNFTAIKPFWKKNSSKQLGR 431


Top