BLASTX nr result
ID: Atractylodes22_contig00014990
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Atractylodes22_contig00014990 (1817 letters) Database: ./nr 23,641,837 sequences; 8,123,359,852 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value dbj|BAJ53169.1| JHL18I08.3 [Jatropha curcas] 584 e-164 ref|XP_004152671.1| PREDICTED: cysteine proteinase RD21a-like [C... 570 e-160 ref|XP_002510459.1| cysteine protease, putative [Ricinus communi... 562 e-158 ref|XP_002307688.1| predicted protein [Populus trichocarpa] gi|2... 561 e-157 gb|AAK71314.1|AF388175_1 papain-like cysteine peptidase XBCP3 [A... 555 e-155 >dbj|BAJ53169.1| JHL18I08.3 [Jatropha curcas] Length = 441 Score = 584 bits (1505), Expect = e-164 Identities = 273/410 (66%), Positives = 325/410 (79%), Gaps = 1/410 (0%) Frame = -3 Query: 1587 YLFQQWCQQHGKSYSTEEEKLYRLNVFQDNYAYILRHXXXXXXSYTLSLNAFADLTHHEF 1408 +LF+ WCQQHGK+Y+++EEKL+RL VFQDNY ++ H SYTLSLNAFADLTHHEF Sbjct: 28 HLFETWCQQHGKTYASQEEKLFRLKVFQDNYDFVTEHNSQGNSSYTLSLNAFADLTHHEF 87 Query: 1407 KLARLRGLSASSPSRDLIRLNRGSSLIESSNDLPKSLDWREKGAVTPVKDQGSCGACWSF 1228 K +RL SA+S S ++ R NR + + D+P S+DWR+ GAVT VKDQG+CGACWSF Sbjct: 88 KASRLGLSSAASASLNVDRSNR--QIPDFVADVPASVDWRKNGAVTQVKDQGNCGACWSF 145 Query: 1227 SATGAMEGINQIVTGSLLSLSEQELVDCDRSFNDGCDGGLMDYAYEFVIKNHGIDTEEDY 1048 SATGA+EGIN+IVTGSL+SLSEQELVDCD+S+N+GC+GG+MDYA++FVI NHGIDTEEDY Sbjct: 146 SATGAIEGINKIVTGSLVSLSEQELVDCDKSYNNGCEGGIMDYAFQFVIDNHGIDTEEDY 205 Query: 1047 PYQGREASCNKNKRNRNVVTIDGYNDIPENDEDQLLKAVVTQPVSVGICGSERAFQLYSK 868 PYQGR+ SCNK K R+VVTIDGY D+P+N+E +LLKAV QPVSVGICGSERAFQLYSK Sbjct: 206 PYQGRDRSCNKEKLKRHVVTIDGYVDVPQNNEKELLKAVANQPVSVGICGSERAFQLYSK 265 Query: 867 GVFTGPCSTSLDHAVLIVGYDSKDGVDYWIIKNSWGTSWGMDGYMYMARNTGDSHGLCGI 688 G+FTGPCSTSLDHAVLIVGY S++GVDYWI+KNSWG+ WGMDGYM+M RN+G S GLCGI Sbjct: 266 GIFTGPCSTSLDHAVLIVGYGSENGVDYWIVKNSWGSYWGMDGYMHMQRNSGSSRGLCGI 325 Query: 687 NMLASYXXXXXXXXXXXXXXXPVKCNLFSWCAEGETCCCATKFLGICFKWMCCELNASVC 508 NMLASY P +C+LF+ C EGETCCC GIC W CCEL+++VC Sbjct: 326 NMLASYPKKTSPNPPPPAPPGPTRCDLFTHCGEGETCCCVHHIFGICLSWKCCELDSAVC 385 Query: 507 CKDHHHCCPSDYPICDSERNLCLKQTGNGT-VAKQAKNSIFGKSGGRSSL 361 CKD HCCP DYP+CD+ RN+CLK GN T + K AKNS GK SSL Sbjct: 386 CKDGRHCCPRDYPVCDTTRNICLKHYGNATRIEKFAKNSSSGKFRSWSSL 435 >ref|XP_004152671.1| PREDICTED: cysteine proteinase RD21a-like [Cucumis sativus] gi|449529596|ref|XP_004171784.1| PREDICTED: cysteine proteinase RD21a-like [Cucumis sativus] Length = 431 Score = 570 bits (1470), Expect = e-160 Identities = 274/408 (67%), Positives = 316/408 (77%), Gaps = 1/408 (0%) Frame = -3 Query: 1584 LFQQWCQQHGKSYSTEEEKLYRLNVFQDNYAYILRHXXXXXXSYTLSLNAFADLTHHEFK 1405 LF+ WC +HGKSYS+ EEKLYRL VF DNY ++ H SYTLSLN++ADLTHHEFK Sbjct: 28 LFEIWCTEHGKSYSSAEEKLYRLGVFADNYEFVTHHNNLDNSSYTLSLNSYADLTHHEFK 87 Query: 1404 LARLRGLSASSPSRDLIRLNRGSSLIESSNDLPKSLDWREKGAVTPVKDQGSCGACWSFS 1225 ++RL A R + L + SL D+P SLDWR+KGAVT VKDQGSCGACWSFS Sbjct: 88 VSRLGFSPALRNFRPV--LPQEPSL---PRDVPDSLDWRKKGAVTAVKDQGSCGACWSFS 142 Query: 1224 ATGAMEGINQIVTGSLLSLSEQELVDCDRSFNDGCDGGLMDYAYEFVIKNHGIDTEEDYP 1045 ATGAMEGINQI+TGSL+SLSEQEL+DCDRS+N GC GGLMDYAY+FVI NHGIDTE DYP Sbjct: 143 ATGAMEGINQIMTGSLISLSEQELIDCDRSYNSGCGGGLMDYAYQFVISNHGIDTENDYP 202 Query: 1044 YQGREASCNKNKRNRNVVTIDGYNDIPENDEDQLLKAVVTQPVSVGICGSERAFQLYSKG 865 YQ R+ SC K+K RNVVTIDGY DIP NDE +LL+AV QPVSVGICGSERAFQLYSKG Sbjct: 203 YQARDGSCRKDKLQRNVVTIDGYADIPSNDEGKLLQAVAAQPVSVGICGSERAFQLYSKG 262 Query: 864 VFTGPCSTSLDHAVLIVGYDSKDGVDYWIIKNSWGTSWGMDGYMYMARNTGDSHGLCGIN 685 +F+GPCSTSLDHAVLIVGY S++GVDYWI+KNSWG SWGMDGYM+M RN+G+S G+CGIN Sbjct: 263 IFSGPCSTSLDHAVLIVGYGSENGVDYWIVKNSWGKSWGMDGYMHMQRNSGNSEGVCGIN 322 Query: 684 MLASYXXXXXXXXXXXXXXXPVKCNLFSWCAEGETCCCATKFLGICFKWMCCELNASVCC 505 LASY P KC++ + CA GETCCCA KFLG+C W CC L+++VCC Sbjct: 323 KLASYPTKTNPNPPPSPPPGPTKCSILTSCAAGETCCCAKKFLGLCLSWKCCGLSSAVCC 382 Query: 504 KDHHHCCPSDYPICDSERNLCLKQTGNGTVAKQAKN-SIFGKSGGRSS 364 KD HCCP DYPICD++RNLCLKQT NGT + +N S G SG SS Sbjct: 383 KDGRHCCPFDYPICDTDRNLCLKQTMNGTRTEILENRSSSGSSGTWSS 430 >ref|XP_002510459.1| cysteine protease, putative [Ricinus communis] gi|223551160|gb|EEF52646.1| cysteine protease, putative [Ricinus communis] Length = 422 Score = 562 bits (1449), Expect = e-158 Identities = 257/383 (67%), Positives = 313/383 (81%) Frame = -3 Query: 1584 LFQQWCQQHGKSYSTEEEKLYRLNVFQDNYAYILRHXXXXXXSYTLSLNAFADLTHHEFK 1405 LF+ W ++HGK+Y+++E+KLYR +F++NY ++ +H SYTLSLNAFADLTHHEFK Sbjct: 31 LFESWTKEHGKTYTSKEDKLYRFKIFEENYEFVKKHNSQGNSSYTLSLNAFADLTHHEFK 90 Query: 1404 LARLRGLSASSPSRDLIRLNRGSSLIESSNDLPKSLDWREKGAVTPVKDQGSCGACWSFS 1225 +RL GLSA S S L R R L + D+P S+DWR+KGAV+ VKDQG+CGACWSFS Sbjct: 91 ASRL-GLSAFSTSGKLSR--RNFPLHDFVGDVPISIDWRKKGAVSQVKDQGNCGACWSFS 147 Query: 1224 ATGAMEGINQIVTGSLLSLSEQELVDCDRSFNDGCDGGLMDYAYEFVIKNHGIDTEEDYP 1045 ATGA+EGIN+IVTGSL+SLSEQELVDCDRS+N+GC+GGLMDYAY+FVI+N+GIDTEEDYP Sbjct: 148 ATGAIEGINKIVTGSLVSLSEQELVDCDRSYNNGCEGGLMDYAYQFVIENNGIDTEEDYP 207 Query: 1044 YQGREASCNKNKRNRNVVTIDGYNDIPENDEDQLLKAVVTQPVSVGICGSERAFQLYSKG 865 YQ RE +CNK K R+VVTIDGY D+P+N+E +LLKAV QPVSVGICGSERAFQLYSKG Sbjct: 208 YQAREKTCNKEKLKRHVVTIDGYTDVPQNNEKELLKAVAAQPVSVGICGSERAFQLYSKG 267 Query: 864 VFTGPCSTSLDHAVLIVGYDSKDGVDYWIIKNSWGTSWGMDGYMYMARNTGDSHGLCGIN 685 +FTGPCSTSLDHAVLIVGY S++GVDYWI+KNSWGT WG++GYMYM RN+G+S GLCGIN Sbjct: 268 IFTGPCSTSLDHAVLIVGYGSENGVDYWIVKNSWGTHWGINGYMYMLRNSGNSQGLCGIN 327 Query: 684 MLASYXXXXXXXXXXXXXXXPVKCNLFSWCAEGETCCCATKFLGICFKWMCCELNASVCC 505 MLAS+ P KC+LF+ C EGETCCC + G+CF W CCEL+++VCC Sbjct: 328 MLASFPVKTSPNPPPPAPPGPTKCDLFTRCGEGETCCCTRRIFGLCFSWKCCELDSAVCC 387 Query: 504 KDHHHCCPSDYPICDSERNLCLK 436 KD HCCP DYP+CD++RN+CLK Sbjct: 388 KDGLHCCPHDYPVCDTKRNMCLK 410 >ref|XP_002307688.1| predicted protein [Populus trichocarpa] gi|222857137|gb|EEE94684.1| predicted protein [Populus trichocarpa] Length = 436 Score = 561 bits (1447), Expect = e-157 Identities = 257/412 (62%), Positives = 318/412 (77%) Frame = -3 Query: 1584 LFQQWCQQHGKSYSTEEEKLYRLNVFQDNYAYILRHXXXXXXSYTLSLNAFADLTHHEFK 1405 LF+ WC++HGKSY+++EE+ +RL VF+DNY ++ +H SY+L+LNAFADLTHHEFK Sbjct: 28 LFETWCKEHGKSYTSQEERSHRLKVFEDNYDFVTKHNSKGNSSYSLALNAFADLTHHEFK 87 Query: 1404 LARLRGLSASSPSRDLIRLNRGSSLIESSNDLPKSLDWREKGAVTPVKDQGSCGACWSFS 1225 +RL GLSA+ L +R + D+P S+DWR KG VT VKDQGSCGACWSFS Sbjct: 88 TSRL-GLSAAP----LNLAHRNLEITGVVGDIPASIDWRNKGVVTNVKDQGSCGACWSFS 142 Query: 1224 ATGAMEGINQIVTGSLLSLSEQELVDCDRSFNDGCDGGLMDYAYEFVIKNHGIDTEEDYP 1045 ATGA+EGIN+IVTGSL+SLSEQEL++CD+S+NDGC GGLMDYA++FVI NHGIDTEEDYP Sbjct: 143 ATGAIEGINKIVTGSLVSLSEQELIECDKSYNDGCGGGLMDYAFQFVINNHGIDTEEDYP 202 Query: 1044 YQGREASCNKNKRNRNVVTIDGYNDIPENDEDQLLKAVVTQPVSVGICGSERAFQLYSKG 865 Y+ R+ +CNK++ R VVTID Y D+PEN+E QLL+AV QPVSVGICGSERAFQ+YSKG Sbjct: 203 YRARDGTCNKDRMKRRVVTIDKYVDVPENNEKQLLQAVAAQPVSVGICGSERAFQMYSKG 262 Query: 864 VFTGPCSTSLDHAVLIVGYDSKDGVDYWIIKNSWGTSWGMDGYMYMARNTGDSHGLCGIN 685 +FTGPCSTSLDHAVLIVGY S++GVDYWI+KNSWGT WGM GYM+M RN+G+S G+CGIN Sbjct: 263 IFTGPCSTSLDHAVLIVGYGSENGVDYWIVKNSWGTGWGMRGYMHMQRNSGNSQGVCGIN 322 Query: 684 MLASYXXXXXXXXXXXXXXXPVKCNLFSWCAEGETCCCATKFLGICFKWMCCELNASVCC 505 MLASY P KCNL ++CA GETCCCA KF GIC W CC L+++VCC Sbjct: 323 MLASYPVKTSPNPPPPPPPGPTKCNLLTYCAAGETCCCARKFFGICISWKCCGLDSAVCC 382 Query: 504 KDHHHCCPSDYPICDSERNLCLKQTGNGTVAKQAKNSIFGKSGGRSSLHQQY 349 KD HCCP DYP+CD+++N+C K+ GN T + + GK G SL + + Sbjct: 383 KDRLHCCPHDYPVCDTDKNMCFKRAGNATRMEAIEGKTSGKFGSWISLPEAW 434 >gb|AAK71314.1|AF388175_1 papain-like cysteine peptidase XBCP3 [Arabidopsis thaliana] Length = 437 Score = 555 bits (1430), Expect = e-155 Identities = 257/405 (63%), Positives = 314/405 (77%) Frame = -3 Query: 1584 LFQQWCQQHGKSYSTEEEKLYRLNVFQDNYAYILRHXXXXXXSYTLSLNAFADLTHHEFK 1405 LF WCQ+HGK+Y +EEE+ R+ +F+DN+ ++ +H +Y+LSLNAFADLTHHEFK Sbjct: 31 LFDDWCQKHGKTYGSEEERQQRIQIFKDNHDFVTQHNLITNATYSLSLNAFADLTHHEFK 90 Query: 1404 LARLRGLSASSPSRDLIRLNRGSSLIESSNDLPKSLDWREKGAVTPVKDQGSCGACWSFS 1225 +RL GLS S+PS +I ++G SL S +P S+DWR+KGAVT VKDQGSCGACWSFS Sbjct: 91 ASRL-GLSVSAPS--VIMASKGQSL-GGSVKVPDSVDWRKKGAVTNVKDQGSCGACWSFS 146 Query: 1224 ATGAMEGINQIVTGSLLSLSEQELVDCDRSFNDGCDGGLMDYAYEFVIKNHGIDTEEDYP 1045 ATGAMEGINQIVTG L+SLSEQEL+DCD+S+N GC+GGLMDYA+EFVIKNHGIDTE+DYP Sbjct: 147 ATGAMEGINQIVTGDLISLSEQELIDCDKSYNAGCNGGLMDYAFEFVIKNHGIDTEKDYP 206 Query: 1044 YQGREASCNKNKRNRNVVTIDGYNDIPENDEDQLLKAVVTQPVSVGICGSERAFQLYSKG 865 YQ R+ +C K+K + VVTID Y + NDE L++AV QPVSVGICGSERAFQLYS+G Sbjct: 207 YQERDGTCKKDKLKQKVVTIDSYAGVKSNDEKALMEAVAAQPVSVGICGSERAFQLYSRG 266 Query: 864 VFTGPCSTSLDHAVLIVGYDSKDGVDYWIIKNSWGTSWGMDGYMYMARNTGDSHGLCGIN 685 +F+GPCSTSLDHAVLIVGY S++GVDYWI+KNSWG SWGMDG+M+M RNT +S G+CGIN Sbjct: 267 IFSGPCSTSLDHAVLIVGYGSQNGVDYWIVKNSWGKSWGMDGFMHMQRNTENSDGVCGIN 326 Query: 684 MLASYXXXXXXXXXXXXXXXPVKCNLFSWCAEGETCCCATKFLGICFKWMCCELNASVCC 505 MLASY P KCNLF++C+ GETCCCA + G+CF W CCE+ ++VCC Sbjct: 327 MLASYPIKTHPNPPPPSPPGPTKCNLFTYCSSGETCCCARELFGLCFSWKCCEIESAVCC 386 Query: 504 KDHHHCCPSDYPICDSERNLCLKQTGNGTVAKQAKNSIFGKSGGR 370 KD HCCP DYP+CD+ R+LCLK+TGN T K K GR Sbjct: 387 KDGRHCCPHDYPVCDTTRSLCLKKTGNFTAIKPFWKKNSSKQLGR 431