BLASTX nr result
ID: Cnidium21_contig00001071
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Cnidium21_contig00001071 (1807 letters) Database: ./nr 23,641,837 sequences; 8,123,359,852 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value dbj|BAD29957.1| cysteine protease [Daucus carota] 699 0.0 dbj|BAD29960.1| cysteine protease [Daucus carota] 634 e-179 dbj|BAE80740.1| cysteine proteinase [Platycodon grandiflorus] 620 e-175 dbj|BAC75923.1| cysteine protease-1 [Helianthus annuus] 610 e-172 dbj|BAD29958.1| cysteine protease [Daucus carota] 608 e-171 >dbj|BAD29957.1| cysteine protease [Daucus carota] Length = 437 Score = 699 bits (1803), Expect = 0.0 Identities = 334/412 (81%), Positives = 353/412 (85%), Gaps = 3/412 (0%) Frame = +2 Query: 218 ASDMSIITYDETHKPSTNSLIRTDDEVMTMYNSWLVKHGKSYNALGEKETRFQIFKDNLR 397 ASDMSII YD+TH TNSLIRTDDEVMTMYNSWLVKHGKSYNALGEKETRFQIFKDNLR Sbjct: 22 ASDMSIINYDQTH---TNSLIRTDDEVMTMYNSWLVKHGKSYNALGEKETRFQIFKDNLR 78 Query: 398 YIDNHKADPDRSYELGLNKFADLTNEEYRSKYMGTKSRDSRPNLSKGKSDRYAPVAGETL 577 YIDNH ADPDRSYELGLN+FADLTNEEYR+KY+GTKSR+SRP LSKG SDRYAPV GE L Sbjct: 79 YIDNHNADPDRSYELGLNRFADLTNEEYRAKYLGTKSRESRPKLSKGPSDRYAPVEGEEL 138 Query: 578 PDSIDWREKGAVAAVKDQGGCGSCWAFSAIASVEGINQISTGDLITLSEQELVDCDKSYN 757 PDSIDWREKGAVAAVKDQG CGSCWAFSAI +VEGINQI+TG+LITLSEQELVDCD+SYN Sbjct: 139 PDSIDWREKGAVAAVKDQGSCGSCWAFSAIGAVEGINQITTGELITLSEQELVDCDRSYN 198 Query: 758 AGCEGGLMDYAFQFIIKNGGIDSDKDYPYTGRDGSCDKNKKNAKVVTIDSYEDVPVYDEK 937 GCEGGLMDYAF FIIKNGGIDSD DYPYTGRDG+C++NK+NAKVVTIDSYEDVPVYDEK Sbjct: 199 EGCEGGLMDYAFNFIIKNGGIDSDLDYPYTGRDGTCNQNKENAKVVTIDSYEDVPVYDEK 258 Query: 938 ALQKAVANQPISVAIEAGGIDFQLYVSGIFTGKCGTNVDHXXXXXXXXXXXXKDYWIVRN 1117 ALQKA ANQPISVAIEAGG+DFQLYVSGIFTGKCGT VDH DYWIVRN Sbjct: 259 ALQKAAANQPISVAIEAGGMDFQLYVSGIFTGKCGTAVDHGVVVVGYGSEEGMDYWIVRN 318 Query: 1118 SWGAAWGEAGYLRMERNVANKPSGLCGITIEPSY---XXXXXXXXXXXXXXXXXXXXXXX 1288 SWGAAWGEAGYL+M+RNV K SGLCGITIEPSY Sbjct: 319 SWGAAWGEAGYLKMQRNV-GKSSGLCGITIEPSYPVKNGDNPPNPGPTPPSPPSPSLPDN 377 Query: 1289 VCDEYSSCPAHTTCCCLYTYGKQCFFWGCCPLEAASCCDDGYSCCPHDYPVC 1444 VCD Y+SCPAHTTCCCLYT+GKQCF+WGCCPLEAASCCDDGYSCCPHDYPVC Sbjct: 378 VCDAYTSCPAHTTCCCLYTFGKQCFYWGCCPLEAASCCDDGYSCCPHDYPVC 429 >dbj|BAD29960.1| cysteine protease [Daucus carota] Length = 460 Score = 634 bits (1636), Expect = e-179 Identities = 308/449 (68%), Positives = 346/449 (77%), Gaps = 6/449 (1%) Frame = +2 Query: 218 ASDMSIITYDETHKPSTNSLIRTDDEVMTMYNSWLVKHGKSYNALGEKETRFQIFKDNLR 397 A+DMSIITYD+TH + TDD +M Y SWLVKHGKSYNALGEKE RFQIFKDN Sbjct: 19 AADMSIITYDQTHAVGS-----TDDVIMAAYESWLVKHGKSYNALGEKEQRFQIFKDNFL 73 Query: 398 YIDNHKADPDRSYELGLNKFADLTNEEYRSKYMGTKSRDSRPNLSKGKSDRYAPVAGETL 577 YID A DRS++LGLN+FADLTNEEYRSKY G +++DSR +S GKS RYA +AGE+L Sbjct: 74 YIDEQNAAKDRSFKLGLNRFADLTNEEYRSKYTGIRTKDSRKKVS-GKSQRYASLAGESL 132 Query: 578 PDSIDWREKGAVAAVKDQGGCGSCWAFSAIASVEGINQISTGDLITLSEQELVDCDKSYN 757 P+S+DWRE GAVA+VKDQG CGSCWAFS I++VEGINQI+TG LITLSEQELVDCD+SYN Sbjct: 133 PESVDWREHGAVASVKDQGQCGSCWAFSTISAVEGINQIATGKLITLSEQELVDCDRSYN 192 Query: 758 AGCEGGLMDYAFQFIIKNGGIDSDKDYPYTGRDGSCDKNKKNAKVVTIDSYEDVPVYDEK 937 GC GGLMD AFQFII NGGIDSD DYPYTGRDG CD+ +KNAKVVTIDSYEDVP YDEK Sbjct: 193 EGCNGGLMDDAFQFIINNGGIDSDADYPYTGRDGQCDQYRKNAKVVTIDSYEDVPEYDEK 252 Query: 938 ALQKAVANQPISVAIEAGGIDFQLYVSGIFTGKCGTNVDHXXXXXXXXXXXXKDYWIVRN 1117 ALQKA ANQPISVAIEA G DFQ Y SGIFTGKCGT++DH KDYWIVRN Sbjct: 253 ALQKAAANQPISVAIEASGRDFQFYDSGIFTGKCGTDLDHGVVVVGYGTENGKDYWIVRN 312 Query: 1118 SWGAAWGEAGYLRMERNVANKPSGLCGITIEPSYXXXXXXXXXXXXXXXXXXXXXXXVCD 1297 SWGA WGE GYLRMER +++K +G+CGIT EPSY VCD Sbjct: 313 SWGADWGEKGYLRMERGISSK-AGICGITSEPSYPVKSGVNPPNPGPSPPSPKSPESVCD 371 Query: 1298 EYSSCPAHTTCCCLYTYGKQCFFWGCCPLEAASCCDDGYSCCPHDYPVCHVYSGTCSMSA 1477 EY +CP TTCCC+Y Y CF WGCCPLE ASCCDDGYSCCPHDYPVC+V +GTCSMS Sbjct: 372 EYYTCPMSTTCCCMYEYYGYCFAWGCCPLEGASCCDDGYSCCPHDYPVCNVRAGTCSMSN 431 Query: 1478 NSPLGVKALKRTLATP------IGRKVSA 1546 N+PLGVKA++R LATP G+KV+A Sbjct: 432 NNPLGVKAIQRILATPNWQHGSKGKKVTA 460 >dbj|BAE80740.1| cysteine proteinase [Platycodon grandiflorus] Length = 462 Score = 620 bits (1600), Expect = e-175 Identities = 292/436 (66%), Positives = 337/436 (77%) Frame = +2 Query: 218 ASDMSIITYDETHKPSTNSLIRTDDEVMTMYNSWLVKHGKSYNALGEKETRFQIFKDNLR 397 A DMSII YD TH ++ S RTDDEVM MY SWLVKHGKSYNALGEKE RFQIFKDNLR Sbjct: 22 ALDMSIINYDATH--ASKSSWRTDDEVMAMYESWLVKHGKSYNALGEKEKRFQIFKDNLR 79 Query: 398 YIDNHKADPDRSYELGLNKFADLTNEEYRSKYMGTKSRDSRPNLSKGKSDRYAPVAGETL 577 +ID H A+ + SY++GLN+FADLTNEEYRS Y+G KS+ P LSK KSDRYAP G++L Sbjct: 80 FIDEHNAEENLSYKVGLNRFADLTNEEYRSTYLGAKSK---PKLSKVKSDRYAPRVGDSL 136 Query: 578 PDSIDWREKGAVAAVKDQGGCGSCWAFSAIASVEGINQISTGDLITLSEQELVDCDKSYN 757 P+S+DWR KGAVA +KDQG CGSCWAFS + +VEGINQI TG+LITLSEQELVDCDKSYN Sbjct: 137 PESVDWRAKGAVAPIKDQGSCGSCWAFSTVNAVEGINQIVTGELITLSEQELVDCDKSYN 196 Query: 758 AGCEGGLMDYAFQFIIKNGGIDSDKDYPYTGRDGSCDKNKKNAKVVTIDSYEDVPVYDEK 937 GC+GGLMDY F+FII NGGID+DKDYPY GRD CD+ +KNAKVVTIDSYEDVPV +E+ Sbjct: 197 EGCDGGLMDYGFEFIINNGGIDTDKDYPYLGRDARCDQYRKNAKVVTIDSYEDVPVNNEE 256 Query: 938 ALQKAVANQPISVAIEAGGIDFQLYVSGIFTGKCGTNVDHXXXXXXXXXXXXKDYWIVRN 1117 AL+KAVA+QP+SV IE GG FQ Y SGIFTGKCGT +DH KDYWIVRN Sbjct: 257 ALKKAVASQPVSVGIEGGGRAFQFYDSGIFTGKCGTALDHGVNVVGYGTEKGKDYWIVRN 316 Query: 1118 SWGAAWGEAGYLRMERNVANKPSGLCGITIEPSYXXXXXXXXXXXXXXXXXXXXXXXVCD 1297 SWG++WGEAGY+RMERN+A G CGI +EPSY VCD Sbjct: 317 SWGSSWGEAGYIRMERNLAGTSVGKCGIAMEPSYPLKNGQNPPNPGPSPPTPVRPPTVCD 376 Query: 1298 EYSSCPAHTTCCCLYTYGKQCFFWGCCPLEAASCCDDGYSCCPHDYPVCHVYSGTCSMSA 1477 +Y +CP +TCCC+Y Y CF WGCCPL+ A+CCDD YSCCPHDYPVC+V +GTCSMS Sbjct: 377 DYYTCPESSTCCCVYEYYGYCFSWGCCPLDGATCCDDHYSCCPHDYPVCNVQAGTCSMSK 436 Query: 1478 NSPLGVKALKRTLATP 1525 N+PLGVKA++R LATP Sbjct: 437 NNPLGVKAIQRILATP 452 >dbj|BAC75923.1| cysteine protease-1 [Helianthus annuus] Length = 461 Score = 610 bits (1572), Expect = e-172 Identities = 288/438 (65%), Positives = 333/438 (76%), Gaps = 2/438 (0%) Frame = +2 Query: 218 ASDMSIITYDETHKPSTNSL--IRTDDEVMTMYNSWLVKHGKSYNALGEKETRFQIFKDN 391 A DMSII YD TH S++S +RTDDEV +Y SWLVKHGK+YNALGEK+ RFQIFKDN Sbjct: 20 AMDMSIINYDATHMSSSSSSAPLRTDDEVNALYESWLVKHGKTYNALGEKDRRFQIFKDN 79 Query: 392 LRYIDNHKADPDRSYELGLNKFADLTNEEYRSKYMGTKSRDSRPNLSKGKSDRYAPVAGE 571 LR+ID H + D +Y+LGLNKFADLTNEEYR Y G K+ D + LSK KSDRYA +G+ Sbjct: 80 LRFIDEHNSG-DHTYKLGLNKFADLTNEEYRMTYTGIKTIDDKKKLSKMKSDRYAYRSGD 138 Query: 572 TLPDSIDWREKGAVAAVKDQGGCGSCWAFSAIASVEGINQISTGDLITLSEQELVDCDKS 751 +LP+ +DWRE+GAV VKDQG CGSCWAFS SVEG+N+I TGDLI++SEQELV+CD S Sbjct: 139 SLPEYVDWREQGAVTDVKDQGSCGSCWAFSTTGSVEGVNKIVTGDLISVSEQELVNCDTS 198 Query: 752 YNAGCEGGLMDYAFQFIIKNGGIDSDKDYPYTGRDGSCDKNKKNAKVVTIDSYEDVPVYD 931 YN GC GGLMDYAF+FIIKNGGID+++DYPYTG+DG CDKNKKNAKVVTIDSYEDVPV D Sbjct: 199 YNQGCNGGLMDYAFEFIIKNGGIDTEEDYPYTGKDGKCDKNKKNAKVVTIDSYEDVPVND 258 Query: 932 EKALQKAVANQPISVAIEAGGIDFQLYVSGIFTGKCGTNVDHXXXXXXXXXXXXKDYWIV 1111 E +L+KAV+NQP++VAIEAGG DFQ Y SGIFTG CGT +DH KDYW+V Sbjct: 259 ESSLKKAVSNQPVAVAIEAGGRDFQFYTSGIFTGSCGTALDHGVLAAGYGTEDGKDYWLV 318 Query: 1112 RNSWGAAWGEAGYLRMERNVANKPSGLCGITIEPSYXXXXXXXXXXXXXXXXXXXXXXXV 1291 +NSWGA WGE GYL+MERN+A+K SG CGI +E SY V Sbjct: 319 KNSWGAEWGEGGYLKMERNIADK-SGKCGIAMEASYPIKNGDNPPNPGPTPPSPAAPEVV 377 Query: 1292 CDEYSSCPAHTTCCCLYTYGKQCFFWGCCPLEAASCCDDGYSCCPHDYPVCHVYSGTCSM 1471 CDEYS+CP TTCCC+Y Y CF WGCCPLE ASCCDD YSCCPHDYP+C+V GTCS Sbjct: 378 CDEYSTCPESTTCCCIYEYYGYCFAWGCCPLEGASCCDDHYSCCPHDYPICNVRRGTCSK 437 Query: 1472 SANSPLGVKALKRTLATP 1525 S NSPL + A KR LATP Sbjct: 438 SRNSPLEISATKRILATP 455 >dbj|BAD29958.1| cysteine protease [Daucus carota] Length = 496 Score = 608 bits (1569), Expect = e-171 Identities = 290/427 (67%), Positives = 326/427 (76%) Frame = +2 Query: 218 ASDMSIITYDETHKPSTNSLIRTDDEVMTMYNSWLVKHGKSYNALGEKETRFQIFKDNLR 397 A+DMSIITYDETH +TDDE T++ SWLV HGKSYNALGE+E RFQIFK+NLR Sbjct: 19 ATDMSIITYDETHAVG----FKTDDEATTLFESWLVTHGKSYNALGEEEKRFQIFKNNLR 74 Query: 398 YIDNHKADPDRSYELGLNKFADLTNEEYRSKYMGTKSRDSRPNLSKGKSDRYAPVAGETL 577 YID DR ++LGLNKFADLTNEEYRSKY G KS+D R +S KS RYA ++GE+L Sbjct: 75 YIDEQNLVEDRGFKLGLNKFADLTNEEYRSKYTGIKSKDLRKKVS-AKSGRYATLSGESL 133 Query: 578 PDSIDWREKGAVAAVKDQGGCGSCWAFSAIASVEGINQISTGDLITLSEQELVDCDKSYN 757 P+S+DWRE GAVA VKDQG CGSCWAFS I++VEGINQI+TG LITLSEQELVDCD+SYN Sbjct: 134 PESVDWRESGAVATVKDQGSCGSCWAFSTISAVEGINQIATGKLITLSEQELVDCDRSYN 193 Query: 758 AGCEGGLMDYAFQFIIKNGGIDSDKDYPYTGRDGSCDKNKKNAKVVTIDSYEDVPVYDEK 937 GC GGLMDYAF+FII NGGID+D DYPYTGRDG CD+ +KNAKVVTIDSYEDVP YDE Sbjct: 194 EGCNGGLMDYAFEFIINNGGIDTDVDYPYTGRDGKCDQYRKNAKVVTIDSYEDVPAYDEL 253 Query: 938 ALQKAVANQPISVAIEAGGIDFQLYVSGIFTGKCGTNVDHXXXXXXXXXXXXKDYWIVRN 1117 AL+KA ANQPISVAIEA G DFQ Y SGIFTGKCG +DH KDYWIVRN Sbjct: 254 ALKKAAANQPISVAIEASGRDFQFYDSGIFTGKCGIALDHGVVVVGYGTENGKDYWIVRN 313 Query: 1118 SWGAAWGEAGYLRMERNVANKPSGLCGITIEPSYXXXXXXXXXXXXXXXXXXXXXXXVCD 1297 SWGA WGE GYLRMER +++K +G+CGI IEPSY VCD Sbjct: 314 SWGADWGENGYLRMERGISSK-TGICGIAIEPSYPVKTGVNPPNPGPSPPTPKTPESVCD 372 Query: 1298 EYSSCPAHTTCCCLYTYGKQCFFWGCCPLEAASCCDDGYSCCPHDYPVCHVYSGTCSMSA 1477 EY +CP TTCCC+Y Y CF WGCCPLE ASCCDDGYSCCPHDYPVC+V +GTCSM Sbjct: 373 EYYTCPMSTTCCCMYEYYGYCFAWGCCPLEGASCCDDGYSCCPHDYPVCNVRAGTCSMKY 432 Query: 1478 NSPLGVK 1498 N+PLGV+ Sbjct: 433 NNPLGVR 439