BLASTX nr result
ID: Atractylodes21_contig00016397
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Atractylodes21_contig00016397 (1564 letters) Database: ./nr 23,641,837 sequences; 8,123,359,852 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value dbj|BAJ53169.1| JHL18I08.3 [Jatropha curcas] 583 e-164 ref|XP_004152671.1| PREDICTED: cysteine proteinase RD21a-like [C... 573 e-161 ref|XP_002510459.1| cysteine protease, putative [Ricinus communi... 562 e-158 ref|XP_002307688.1| predicted protein [Populus trichocarpa] gi|2... 560 e-157 gb|AAK71314.1|AF388175_1 papain-like cysteine peptidase XBCP3 [A... 556 e-156 >dbj|BAJ53169.1| JHL18I08.3 [Jatropha curcas] Length = 441 Score = 583 bits (1504), Expect = e-164 Identities = 271/410 (66%), Positives = 323/410 (78%) Frame = -1 Query: 1381 HLFQQWCQQHGKSYSTEEEKLYRLNVFQDNYAYILRHXXXXXXSYTLSLNAFADLTHHEF 1202 HLF+ WCQQHGK+Y+++EEKL+RL VFQDNY ++ H SYTLSLNAFADLTHHEF Sbjct: 28 HLFETWCQQHGKTYASQEEKLFRLKVFQDNYDFVTEHNSQGNSSYTLSLNAFADLTHHEF 87 Query: 1201 KLARLRGLSASSPSRDLIRLNRGSSLIESSNDLPKSLDWREKGAVTPVKDQGSCGACWSF 1022 K +RL SA+S S ++ R NR + + D+P S+DWR+ GAVT VKDQG+CGACWSF Sbjct: 88 KASRLGLSSAASASLNVDRSNR--QIPDFVADVPASVDWRKNGAVTQVKDQGNCGACWSF 145 Query: 1021 SATGAMEGINQIVTGSLLSLSEQELVDCDRSFNDGCDGGLMDYAYEFVIKNHGIDTEEDY 842 SATGA+EGIN+IVTGSL+SLSEQELVDCD+S+N+GC+GG+MDYA++FVI NHGIDTEEDY Sbjct: 146 SATGAIEGINKIVTGSLVSLSEQELVDCDKSYNNGCEGGIMDYAFQFVIDNHGIDTEEDY 205 Query: 841 PYQGREASCNKNKRNRNVVTIDGYNDIPENDEDQLLKAVVTQPVSVGICGSERAFQLYSK 662 PYQGR+ SCNK K R+VVTIDGY D+P+N+E +LLKAV QPVSVGICGSERAFQLYSK Sbjct: 206 PYQGRDRSCNKEKLKRHVVTIDGYVDVPQNNEKELLKAVANQPVSVGICGSERAFQLYSK 265 Query: 661 GVFTGPCSTSLDHAVLIVGYDSKDGVDYWIIKNSWGTSWGMDGYMYMARNTGDSHGLCGI 482 G+FTGPCSTSLDHAVLIVGY S++GVDYWI+KNSWG+ WGMDGYM+M RN+G S GLCGI Sbjct: 266 GIFTGPCSTSLDHAVLIVGYGSENGVDYWIVKNSWGSYWGMDGYMHMQRNSGSSRGLCGI 325 Query: 481 NMLASYXXXXXXXXXXXXXXXPVKCNLFSWCAEGETCCCATKFLGICFKWMCCELNASVC 302 NMLASY P +C+LF+ C EGETCCC GIC W CCEL+++VC Sbjct: 326 NMLASYPKKTSPNPPPPAPPGPTRCDLFTHCGEGETCCCVHHIFGICLSWKCCELDSAVC 385 Query: 301 CKDHHHCCPSDYPICDSERNLCLKQTGNGTVAKQAKKKSIFGKSGGRSSL 152 CKD HCCP DYP+CD+ RN+CLK GN T ++ K S GK SSL Sbjct: 386 CKDGRHCCPRDYPVCDTTRNICLKHYGNATRIEKFAKNSSSGKFRSWSSL 435 >ref|XP_004152671.1| PREDICTED: cysteine proteinase RD21a-like [Cucumis sativus] gi|449529596|ref|XP_004171784.1| PREDICTED: cysteine proteinase RD21a-like [Cucumis sativus] Length = 431 Score = 573 bits (1478), Expect = e-161 Identities = 273/408 (66%), Positives = 316/408 (77%) Frame = -1 Query: 1378 LFQQWCQQHGKSYSTEEEKLYRLNVFQDNYAYILRHXXXXXXSYTLSLNAFADLTHHEFK 1199 LF+ WC +HGKSYS+ EEKLYRL VF DNY ++ H SYTLSLN++ADLTHHEFK Sbjct: 28 LFEIWCTEHGKSYSSAEEKLYRLGVFADNYEFVTHHNNLDNSSYTLSLNSYADLTHHEFK 87 Query: 1198 LARLRGLSASSPSRDLIRLNRGSSLIESSNDLPKSLDWREKGAVTPVKDQGSCGACWSFS 1019 ++RL A R + L + SL D+P SLDWR+KGAVT VKDQGSCGACWSFS Sbjct: 88 VSRLGFSPALRNFRPV--LPQEPSL---PRDVPDSLDWRKKGAVTAVKDQGSCGACWSFS 142 Query: 1018 ATGAMEGINQIVTGSLLSLSEQELVDCDRSFNDGCDGGLMDYAYEFVIKNHGIDTEEDYP 839 ATGAMEGINQI+TGSL+SLSEQEL+DCDRS+N GC GGLMDYAY+FVI NHGIDTE DYP Sbjct: 143 ATGAMEGINQIMTGSLISLSEQELIDCDRSYNSGCGGGLMDYAYQFVISNHGIDTENDYP 202 Query: 838 YQGREASCNKNKRNRNVVTIDGYNDIPENDEDQLLKAVVTQPVSVGICGSERAFQLYSKG 659 YQ R+ SC K+K RNVVTIDGY DIP NDE +LL+AV QPVSVGICGSERAFQLYSKG Sbjct: 203 YQARDGSCRKDKLQRNVVTIDGYADIPSNDEGKLLQAVAAQPVSVGICGSERAFQLYSKG 262 Query: 658 VFTGPCSTSLDHAVLIVGYDSKDGVDYWIIKNSWGTSWGMDGYMYMARNTGDSHGLCGIN 479 +F+GPCSTSLDHAVLIVGY S++GVDYWI+KNSWG SWGMDGYM+M RN+G+S G+CGIN Sbjct: 263 IFSGPCSTSLDHAVLIVGYGSENGVDYWIVKNSWGKSWGMDGYMHMQRNSGNSEGVCGIN 322 Query: 478 MLASYXXXXXXXXXXXXXXXPVKCNLFSWCAEGETCCCATKFLGICFKWMCCELNASVCC 299 LASY P KC++ + CA GETCCCA KFLG+C W CC L+++VCC Sbjct: 323 KLASYPTKTNPNPPPSPPPGPTKCSILTSCAAGETCCCAKKFLGLCLSWKCCGLSSAVCC 382 Query: 298 KDHHHCCPSDYPICDSERNLCLKQTGNGTVAKQAKKKSIFGKSGGRSS 155 KD HCCP DYPICD++RNLCLKQT NGT + + +S G SG SS Sbjct: 383 KDGRHCCPFDYPICDTDRNLCLKQTMNGTRTEILENRSSSGSSGTWSS 430 >ref|XP_002510459.1| cysteine protease, putative [Ricinus communis] gi|223551160|gb|EEF52646.1| cysteine protease, putative [Ricinus communis] Length = 422 Score = 562 bits (1449), Expect = e-158 Identities = 257/383 (67%), Positives = 313/383 (81%) Frame = -1 Query: 1378 LFQQWCQQHGKSYSTEEEKLYRLNVFQDNYAYILRHXXXXXXSYTLSLNAFADLTHHEFK 1199 LF+ W ++HGK+Y+++E+KLYR +F++NY ++ +H SYTLSLNAFADLTHHEFK Sbjct: 31 LFESWTKEHGKTYTSKEDKLYRFKIFEENYEFVKKHNSQGNSSYTLSLNAFADLTHHEFK 90 Query: 1198 LARLRGLSASSPSRDLIRLNRGSSLIESSNDLPKSLDWREKGAVTPVKDQGSCGACWSFS 1019 +RL GLSA S S L R R L + D+P S+DWR+KGAV+ VKDQG+CGACWSFS Sbjct: 91 ASRL-GLSAFSTSGKLSR--RNFPLHDFVGDVPISIDWRKKGAVSQVKDQGNCGACWSFS 147 Query: 1018 ATGAMEGINQIVTGSLLSLSEQELVDCDRSFNDGCDGGLMDYAYEFVIKNHGIDTEEDYP 839 ATGA+EGIN+IVTGSL+SLSEQELVDCDRS+N+GC+GGLMDYAY+FVI+N+GIDTEEDYP Sbjct: 148 ATGAIEGINKIVTGSLVSLSEQELVDCDRSYNNGCEGGLMDYAYQFVIENNGIDTEEDYP 207 Query: 838 YQGREASCNKNKRNRNVVTIDGYNDIPENDEDQLLKAVVTQPVSVGICGSERAFQLYSKG 659 YQ RE +CNK K R+VVTIDGY D+P+N+E +LLKAV QPVSVGICGSERAFQLYSKG Sbjct: 208 YQAREKTCNKEKLKRHVVTIDGYTDVPQNNEKELLKAVAAQPVSVGICGSERAFQLYSKG 267 Query: 658 VFTGPCSTSLDHAVLIVGYDSKDGVDYWIIKNSWGTSWGMDGYMYMARNTGDSHGLCGIN 479 +FTGPCSTSLDHAVLIVGY S++GVDYWI+KNSWGT WG++GYMYM RN+G+S GLCGIN Sbjct: 268 IFTGPCSTSLDHAVLIVGYGSENGVDYWIVKNSWGTHWGINGYMYMLRNSGNSQGLCGIN 327 Query: 478 MLASYXXXXXXXXXXXXXXXPVKCNLFSWCAEGETCCCATKFLGICFKWMCCELNASVCC 299 MLAS+ P KC+LF+ C EGETCCC + G+CF W CCEL+++VCC Sbjct: 328 MLASFPVKTSPNPPPPAPPGPTKCDLFTRCGEGETCCCTRRIFGLCFSWKCCELDSAVCC 387 Query: 298 KDHHHCCPSDYPICDSERNLCLK 230 KD HCCP DYP+CD++RN+CLK Sbjct: 388 KDGLHCCPHDYPVCDTKRNMCLK 410 >ref|XP_002307688.1| predicted protein [Populus trichocarpa] gi|222857137|gb|EEE94684.1| predicted protein [Populus trichocarpa] Length = 436 Score = 560 bits (1443), Expect = e-157 Identities = 256/406 (63%), Positives = 318/406 (78%) Frame = -1 Query: 1378 LFQQWCQQHGKSYSTEEEKLYRLNVFQDNYAYILRHXXXXXXSYTLSLNAFADLTHHEFK 1199 LF+ WC++HGKSY+++EE+ +RL VF+DNY ++ +H SY+L+LNAFADLTHHEFK Sbjct: 28 LFETWCKEHGKSYTSQEERSHRLKVFEDNYDFVTKHNSKGNSSYSLALNAFADLTHHEFK 87 Query: 1198 LARLRGLSASSPSRDLIRLNRGSSLIESSNDLPKSLDWREKGAVTPVKDQGSCGACWSFS 1019 +RL GLSA+ L +R + D+P S+DWR KG VT VKDQGSCGACWSFS Sbjct: 88 TSRL-GLSAAP----LNLAHRNLEITGVVGDIPASIDWRNKGVVTNVKDQGSCGACWSFS 142 Query: 1018 ATGAMEGINQIVTGSLLSLSEQELVDCDRSFNDGCDGGLMDYAYEFVIKNHGIDTEEDYP 839 ATGA+EGIN+IVTGSL+SLSEQEL++CD+S+NDGC GGLMDYA++FVI NHGIDTEEDYP Sbjct: 143 ATGAIEGINKIVTGSLVSLSEQELIECDKSYNDGCGGGLMDYAFQFVINNHGIDTEEDYP 202 Query: 838 YQGREASCNKNKRNRNVVTIDGYNDIPENDEDQLLKAVVTQPVSVGICGSERAFQLYSKG 659 Y+ R+ +CNK++ R VVTID Y D+PEN+E QLL+AV QPVSVGICGSERAFQ+YSKG Sbjct: 203 YRARDGTCNKDRMKRRVVTIDKYVDVPENNEKQLLQAVAAQPVSVGICGSERAFQMYSKG 262 Query: 658 VFTGPCSTSLDHAVLIVGYDSKDGVDYWIIKNSWGTSWGMDGYMYMARNTGDSHGLCGIN 479 +FTGPCSTSLDHAVLIVGY S++GVDYWI+KNSWGT WGM GYM+M RN+G+S G+CGIN Sbjct: 263 IFTGPCSTSLDHAVLIVGYGSENGVDYWIVKNSWGTGWGMRGYMHMQRNSGNSQGVCGIN 322 Query: 478 MLASYXXXXXXXXXXXXXXXPVKCNLFSWCAEGETCCCATKFLGICFKWMCCELNASVCC 299 MLASY P KCNL ++CA GETCCCA KF GIC W CC L+++VCC Sbjct: 323 MLASYPVKTSPNPPPPPPPGPTKCNLLTYCAAGETCCCARKFFGICISWKCCGLDSAVCC 382 Query: 298 KDHHHCCPSDYPICDSERNLCLKQTGNGTVAKQAKKKSIFGKSGGR 161 KD HCCP DYP+CD+++N+C K+ GN T + ++I GK+ G+ Sbjct: 383 KDRLHCCPHDYPVCDTDKNMCFKRAGNAT-----RMEAIEGKTSGK 423 >gb|AAK71314.1|AF388175_1 papain-like cysteine peptidase XBCP3 [Arabidopsis thaliana] Length = 437 Score = 556 bits (1434), Expect = e-156 Identities = 256/398 (64%), Positives = 314/398 (78%) Frame = -1 Query: 1378 LFQQWCQQHGKSYSTEEEKLYRLNVFQDNYAYILRHXXXXXXSYTLSLNAFADLTHHEFK 1199 LF WCQ+HGK+Y +EEE+ R+ +F+DN+ ++ +H +Y+LSLNAFADLTHHEFK Sbjct: 31 LFDDWCQKHGKTYGSEEERQQRIQIFKDNHDFVTQHNLITNATYSLSLNAFADLTHHEFK 90 Query: 1198 LARLRGLSASSPSRDLIRLNRGSSLIESSNDLPKSLDWREKGAVTPVKDQGSCGACWSFS 1019 +RL GLS S+PS +I ++G SL S +P S+DWR+KGAVT VKDQGSCGACWSFS Sbjct: 91 ASRL-GLSVSAPS--VIMASKGQSL-GGSVKVPDSVDWRKKGAVTNVKDQGSCGACWSFS 146 Query: 1018 ATGAMEGINQIVTGSLLSLSEQELVDCDRSFNDGCDGGLMDYAYEFVIKNHGIDTEEDYP 839 ATGAMEGINQIVTG L+SLSEQEL+DCD+S+N GC+GGLMDYA+EFVIKNHGIDTE+DYP Sbjct: 147 ATGAMEGINQIVTGDLISLSEQELIDCDKSYNAGCNGGLMDYAFEFVIKNHGIDTEKDYP 206 Query: 838 YQGREASCNKNKRNRNVVTIDGYNDIPENDEDQLLKAVVTQPVSVGICGSERAFQLYSKG 659 YQ R+ +C K+K + VVTID Y + NDE L++AV QPVSVGICGSERAFQLYS+G Sbjct: 207 YQERDGTCKKDKLKQKVVTIDSYAGVKSNDEKALMEAVAAQPVSVGICGSERAFQLYSRG 266 Query: 658 VFTGPCSTSLDHAVLIVGYDSKDGVDYWIIKNSWGTSWGMDGYMYMARNTGDSHGLCGIN 479 +F+GPCSTSLDHAVLIVGY S++GVDYWI+KNSWG SWGMDG+M+M RNT +S G+CGIN Sbjct: 267 IFSGPCSTSLDHAVLIVGYGSQNGVDYWIVKNSWGKSWGMDGFMHMQRNTENSDGVCGIN 326 Query: 478 MLASYXXXXXXXXXXXXXXXPVKCNLFSWCAEGETCCCATKFLGICFKWMCCELNASVCC 299 MLASY P KCNLF++C+ GETCCCA + G+CF W CCE+ ++VCC Sbjct: 327 MLASYPIKTHPNPPPPSPPGPTKCNLFTYCSSGETCCCARELFGLCFSWKCCEIESAVCC 386 Query: 298 KDHHHCCPSDYPICDSERNLCLKQTGNGTVAKQAKKKS 185 KD HCCP DYP+CD+ R+LCLK+TGN T K KK+ Sbjct: 387 KDGRHCCPHDYPVCDTTRSLCLKKTGNFTAIKPFWKKN 424