BLASTX nr result
ID: Cnidium21_contig00017427
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Cnidium21_contig00017427 (1674 letters) Database: ./nr 23,641,837 sequences; 8,123,359,852 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value dbj|BAJ53169.1| JHL18I08.3 [Jatropha curcas] 569 e-160 ref|XP_002510459.1| cysteine protease, putative [Ricinus communi... 558 e-156 ref|XP_002307688.1| predicted protein [Populus trichocarpa] gi|2... 557 e-156 ref|XP_004152671.1| PREDICTED: cysteine proteinase RD21a-like [C... 544 e-152 gb|AAK71314.1|AF388175_1 papain-like cysteine peptidase XBCP3 [A... 541 e-151 >dbj|BAJ53169.1| JHL18I08.3 [Jatropha curcas] Length = 441 Score = 569 bits (1467), Expect = e-160 Identities = 273/405 (67%), Positives = 317/405 (78%), Gaps = 4/405 (0%) Frame = +1 Query: 145 IYSSTTSD---LFETWCISHGKTYSSQEEKLYRLKIFEENYMYVTQHNKNNDIAANSSLS 315 ++SS++S+ LFETWC HGKTY+SQEEKL+RLK+F++NY +VT+HN + + S Sbjct: 18 LFSSSSSEIAHLFETWCQQHGKTYASQEEKLFRLKVFQDNYDFVTEHN------SQGNSS 71 Query: 316 YTLSIDNAFADLTHQEFKASRLGLSSTGIIRMNLGGSSEG-SDGVTNVPTSLDWRDKGAV 492 YTLS+ NAFADLTH EFKASRLGLSS +N+ S+ D V +VP S+DWR GAV Sbjct: 72 YTLSL-NAFADLTHHEFKASRLGLSSAASASLNVDRSNRQIPDFVADVPASVDWRKNGAV 130 Query: 493 TNVKDQGSCGACWSFSATGAIEGINQIVTGSLTSLSEQELVDCDRSYNDGCEGGLMDYAY 672 T VKDQG+CGACWSFSATGAIEGIN+IVTGSL SLSEQELVDCD+SYN+GCEGG+MDYA+ Sbjct: 131 TQVKDQGNCGACWSFSATGAIEGINKIVTGSLVSLSEQELVDCDKSYNNGCEGGIMDYAF 190 Query: 673 QFVVKNKGIDTEDDYPYQSRDMTCNKNKLNRHVVTIDGYIDVRENDEKQLLAAVAAQPVS 852 QFV+ N GIDTE+DYPYQ RD +CNK KL RHVVTIDGY+DV +N+EK+LL AVA QPVS Sbjct: 191 QFVIDNHGIDTEEDYPYQGRDRSCNKEKLKRHVVTIDGYVDVPQNNEKELLKAVANQPVS 250 Query: 853 VGICGSERNFQLYSKGIFNGPCSTSLDHAVLIVGYGSENGVDYWILKNSWGKQWGMNGYM 1032 VGICGSER FQLYSKGIF GPCSTSLDHAVLIVGYGSENGVDYWI+KNSWG WGM+GYM Sbjct: 251 VGICGSERAFQLYSKGIFTGPCSTSLDHAVLIVGYGSENGVDYWIVKNSWGSYWGMDGYM 310 Query: 1033 HMQRNSGNSQGICGINMMASYXXXXXXXXXXXXXXXXXKCSLLTSCSEGETCCCARTLFG 1212 HMQRNSG+S+G+CGINM+ASY +C L T C EGETCCC +FG Sbjct: 311 HMQRNSGSSRGLCGINMLASYPKKTSPNPPPPAPPGPTRCDLFTHCGEGETCCCVHHIFG 370 Query: 1213 ICLSWKCCELNSAVXXXXXXXXXXXXYPICDTKRNMCLKQTGNYT 1347 ICLSWKCCEL+SAV YP+CDT RN+CLK GN T Sbjct: 371 ICLSWKCCELDSAVCCKDGRHCCPRDYPVCDTTRNICLKHYGNAT 415 >ref|XP_002510459.1| cysteine protease, putative [Ricinus communis] gi|223551160|gb|EEF52646.1| cysteine protease, putative [Ricinus communis] Length = 422 Score = 558 bits (1437), Expect = e-156 Identities = 270/393 (68%), Positives = 305/393 (77%) Frame = +1 Query: 151 SSTTSDLFETWCISHGKTYSSQEEKLYRLKIFEENYMYVTQHNKNNDIAANSSLSYTLSI 330 SS S LFE+W HGKTY+S+E+KLYR KIFEENY +V +HN + + SYTLS+ Sbjct: 25 SSDISKLFESWTKEHGKTYTSKEDKLYRFKIFEENYEFVKKHN------SQGNSSYTLSL 78 Query: 331 DNAFADLTHQEFKASRLGLSSTGIIRMNLGGSSEGSDGVTNVPTSLDWRDKGAVTNVKDQ 510 NAFADLTH EFKASRLGLS+ + D V +VP S+DWR KGAV+ VKDQ Sbjct: 79 -NAFADLTHHEFKASRLGLSAFSTSGKLSRRNFPLHDFVGDVPISIDWRKKGAVSQVKDQ 137 Query: 511 GSCGACWSFSATGAIEGINQIVTGSLTSLSEQELVDCDRSYNDGCEGGLMDYAYQFVVKN 690 G+CGACWSFSATGAIEGIN+IVTGSL SLSEQELVDCDRSYN+GCEGGLMDYAYQFV++N Sbjct: 138 GNCGACWSFSATGAIEGINKIVTGSLVSLSEQELVDCDRSYNNGCEGGLMDYAYQFVIEN 197 Query: 691 KGIDTEDDYPYQSRDMTCNKNKLNRHVVTIDGYIDVRENDEKQLLAAVAAQPVSVGICGS 870 GIDTE+DYPYQ+R+ TCNK KL RHVVTIDGY DV +N+EK+LL AVAAQPVSVGICGS Sbjct: 198 NGIDTEEDYPYQAREKTCNKEKLKRHVVTIDGYTDVPQNNEKELLKAVAAQPVSVGICGS 257 Query: 871 ERNFQLYSKGIFNGPCSTSLDHAVLIVGYGSENGVDYWILKNSWGKQWGMNGYMHMQRNS 1050 ER FQLYSKGIF GPCSTSLDHAVLIVGYGSENGVDYWI+KNSWG WG+NGYM+M RNS Sbjct: 258 ERAFQLYSKGIFTGPCSTSLDHAVLIVGYGSENGVDYWIVKNSWGTHWGINGYMYMLRNS 317 Query: 1051 GNSQGICGINMMASYXXXXXXXXXXXXXXXXXKCSLLTSCSEGETCCCARTLFGICLSWK 1230 GNSQG+CGINM+AS+ KC L T C EGETCCC R +FG+C SWK Sbjct: 318 GNSQGLCGINMLASFPVKTSPNPPPPAPPGPTKCDLFTRCGEGETCCCTRRIFGLCFSWK 377 Query: 1231 CCELNSAVXXXXXXXXXXXXYPICDTKRNMCLK 1329 CCEL+SAV YP+CDTKRNMCLK Sbjct: 378 CCELDSAVCCKDGLHCCPHDYPVCDTKRNMCLK 410 >ref|XP_002307688.1| predicted protein [Populus trichocarpa] gi|222857137|gb|EEE94684.1| predicted protein [Populus trichocarpa] Length = 436 Score = 557 bits (1435), Expect = e-156 Identities = 269/400 (67%), Positives = 309/400 (77%), Gaps = 1/400 (0%) Frame = +1 Query: 151 SSTTSDLFETWCISHGKTYSSQEEKLYRLKIFEENYMYVTQHNKNNDIAANSSLSYTLSI 330 SS S LFETWC HGK+Y+SQEE+ +RLK+FE+NY +VT+HN NSS S L Sbjct: 22 SSDISQLFETWCKEHGKSYTSQEERSHRLKVFEDNYDFVTKHNSKG----NSSYSLAL-- 75 Query: 331 DNAFADLTHQEFKASRLGLSSTGIIRMNLGGSSEGSDGVT-NVPTSLDWRDKGAVTNVKD 507 NAFADLTH EFK SRLGLS+ + NL + GV ++P S+DWR+KG VTNVKD Sbjct: 76 -NAFADLTHHEFKTSRLGLSAAPL---NLAHRNLEITGVVGDIPASIDWRNKGVVTNVKD 131 Query: 508 QGSCGACWSFSATGAIEGINQIVTGSLTSLSEQELVDCDRSYNDGCEGGLMDYAYQFVVK 687 QGSCGACWSFSATGAIEGIN+IVTGSL SLSEQEL++CD+SYNDGC GGLMDYA+QFV+ Sbjct: 132 QGSCGACWSFSATGAIEGINKIVTGSLVSLSEQELIECDKSYNDGCGGGLMDYAFQFVIN 191 Query: 688 NKGIDTEDDYPYQSRDMTCNKNKLNRHVVTIDGYIDVRENDEKQLLAAVAAQPVSVGICG 867 N GIDTE+DYPY++RD TCNK+++ R VVTID Y+DV EN+EKQLL AVAAQPVSVGICG Sbjct: 192 NHGIDTEEDYPYRARDGTCNKDRMKRRVVTIDKYVDVPENNEKQLLQAVAAQPVSVGICG 251 Query: 868 SERNFQLYSKGIFNGPCSTSLDHAVLIVGYGSENGVDYWILKNSWGKQWGMNGYMHMQRN 1047 SER FQ+YSKGIF GPCSTSLDHAVLIVGYGSENGVDYWI+KNSWG WGM GYMHMQRN Sbjct: 252 SERAFQMYSKGIFTGPCSTSLDHAVLIVGYGSENGVDYWIVKNSWGTGWGMRGYMHMQRN 311 Query: 1048 SGNSQGICGINMMASYXXXXXXXXXXXXXXXXXKCSLLTSCSEGETCCCARTLFGICLSW 1227 SGNSQG+CGINM+ASY KC+LLT C+ GETCCCAR FGIC+SW Sbjct: 312 SGNSQGVCGINMLASYPVKTSPNPPPPPPPGPTKCNLLTYCAAGETCCCARKFFGICISW 371 Query: 1228 KCCELNSAVXXXXXXXXXXXXYPICDTKRNMCLKQTGNYT 1347 KCC L+SAV YP+CDT +NMC K+ GN T Sbjct: 372 KCCGLDSAVCCKDRLHCCPHDYPVCDTDKNMCFKRAGNAT 411 >ref|XP_004152671.1| PREDICTED: cysteine proteinase RD21a-like [Cucumis sativus] gi|449529596|ref|XP_004171784.1| PREDICTED: cysteine proteinase RD21a-like [Cucumis sativus] Length = 431 Score = 544 bits (1402), Expect = e-152 Identities = 266/399 (66%), Positives = 303/399 (75%) Frame = +1 Query: 151 SSTTSDLFETWCISHGKTYSSQEEKLYRLKIFEENYMYVTQHNKNNDIAANSSLSYTLSI 330 +S S+LFE WC HGK+YSS EEKLYRL +F +NY +VT HN ++ SYTLS+ Sbjct: 22 TSNVSELFEIWCTEHGKSYSSAEEKLYRLGVFADNYEFVTHHNNLDNS------SYTLSL 75 Query: 331 DNAFADLTHQEFKASRLGLSSTGIIRMNLGGSSEGSDGVTNVPTSLDWRDKGAVTNVKDQ 510 N++ADLTH EFK SRLG S +R + +VP SLDWR KGAVT VKDQ Sbjct: 76 -NSYADLTHHEFKVSRLGFSPA--LRNFRPVLPQEPSLPRDVPDSLDWRKKGAVTAVKDQ 132 Query: 511 GSCGACWSFSATGAIEGINQIVTGSLTSLSEQELVDCDRSYNDGCEGGLMDYAYQFVVKN 690 GSCGACWSFSATGA+EGINQI+TGSL SLSEQEL+DCDRSYN GC GGLMDYAYQFV+ N Sbjct: 133 GSCGACWSFSATGAMEGINQIMTGSLISLSEQELIDCDRSYNSGCGGGLMDYAYQFVISN 192 Query: 691 KGIDTEDDYPYQSRDMTCNKNKLNRHVVTIDGYIDVRENDEKQLLAAVAAQPVSVGICGS 870 GIDTE+DYPYQ+RD +C K+KL R+VVTIDGY D+ NDE +LL AVAAQPVSVGICGS Sbjct: 193 HGIDTENDYPYQARDGSCRKDKLQRNVVTIDGYADIPSNDEGKLLQAVAAQPVSVGICGS 252 Query: 871 ERNFQLYSKGIFNGPCSTSLDHAVLIVGYGSENGVDYWILKNSWGKQWGMNGYMHMQRNS 1050 ER FQLYSKGIF+GPCSTSLDHAVLIVGYGSENGVDYWI+KNSWGK WGM+GYMHMQRNS Sbjct: 253 ERAFQLYSKGIFSGPCSTSLDHAVLIVGYGSENGVDYWIVKNSWGKSWGMDGYMHMQRNS 312 Query: 1051 GNSQGICGINMMASYXXXXXXXXXXXXXXXXXKCSLLTSCSEGETCCCARTLFGICLSWK 1230 GNS+G+CGIN +ASY KCS+LTSC+ GETCCCA+ G+CLSWK Sbjct: 313 GNSEGVCGINKLASYPTKTNPNPPPSPPPGPTKCSILTSCAAGETCCCAKKFLGLCLSWK 372 Query: 1231 CCELNSAVXXXXXXXXXXXXYPICDTKRNMCLKQTGNYT 1347 CC L+SAV YPICDT RN+CLKQT N T Sbjct: 373 CCGLSSAVCCKDGRHCCPFDYPICDTDRNLCLKQTMNGT 411 >gb|AAK71314.1|AF388175_1 papain-like cysteine peptidase XBCP3 [Arabidopsis thaliana] Length = 437 Score = 541 bits (1393), Expect = e-151 Identities = 260/401 (64%), Positives = 306/401 (76%), Gaps = 2/401 (0%) Frame = +1 Query: 151 SSTTSDLFETWCISHGKTYSSQEEKLYRLKIFEENYMYVTQHNKNNDIAANSSLSYTLSI 330 S S+LF+ WC HGKTY S+EE+ R++IF++N+ +VTQHN ++ +Y+LS+ Sbjct: 25 SDDISELFDDWCQKHGKTYGSEEERQQRIQIFKDNHDFVTQHN------LITNATYSLSL 78 Query: 331 DNAFADLTHQEFKASRLGLS--STGIIRMNLGGSSEGSDGVTNVPTSLDWRDKGAVTNVK 504 NAFADLTH EFKASRLGLS + +I + G S GS VP S+DWR KGAVTNVK Sbjct: 79 -NAFADLTHHEFKASRLGLSVSAPSVIMASKGQSLGGS---VKVPDSVDWRKKGAVTNVK 134 Query: 505 DQGSCGACWSFSATGAIEGINQIVTGSLTSLSEQELVDCDRSYNDGCEGGLMDYAYQFVV 684 DQGSCGACWSFSATGA+EGINQIVTG L SLSEQEL+DCD+SYN GC GGLMDYA++FV+ Sbjct: 135 DQGSCGACWSFSATGAMEGINQIVTGDLISLSEQELIDCDKSYNAGCNGGLMDYAFEFVI 194 Query: 685 KNKGIDTEDDYPYQSRDMTCNKNKLNRHVVTIDGYIDVRENDEKQLLAAVAAQPVSVGIC 864 KN GIDTE DYPYQ RD TC K+KL + VVTID Y V+ NDEK L+ AVAAQPVSVGIC Sbjct: 195 KNHGIDTEKDYPYQERDGTCKKDKLKQKVVTIDSYAGVKSNDEKALMEAVAAQPVSVGIC 254 Query: 865 GSERNFQLYSKGIFNGPCSTSLDHAVLIVGYGSENGVDYWILKNSWGKQWGMNGYMHMQR 1044 GSER FQLYS+GIF+GPCSTSLDHAVLIVGYGS+NGVDYWI+KNSWGK WGM+G+MHMQR Sbjct: 255 GSERAFQLYSRGIFSGPCSTSLDHAVLIVGYGSQNGVDYWIVKNSWGKSWGMDGFMHMQR 314 Query: 1045 NSGNSQGICGINMMASYXXXXXXXXXXXXXXXXXKCSLLTSCSEGETCCCARTLFGICLS 1224 N+ NS G+CGINM+ASY KC+L T CS GETCCCAR LFG+C S Sbjct: 315 NTENSDGVCGINMLASYPIKTHPNPPPPSPPGPTKCNLFTYCSSGETCCCARELFGLCFS 374 Query: 1225 WKCCELNSAVXXXXXXXXXXXXYPICDTKRNMCLKQTGNYT 1347 WKCCE+ SAV YP+CDT R++CLK+TGN+T Sbjct: 375 WKCCEIESAVCCKDGRHCCPHDYPVCDTTRSLCLKKTGNFT 415