BLASTX nr result
ID: Atractylodes21_contig00008568
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Atractylodes21_contig00008568 (1862 letters) Database: ./nr 23,641,837 sequences; 8,123,359,852 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value dbj|BAC75923.1| cysteine protease-1 [Helianthus annuus] 741 0.0 dbj|BAE80740.1| cysteine proteinase [Platycodon grandiflorus] 675 0.0 dbj|BAD29960.1| cysteine protease [Daucus carota] 632 e-178 gb|AAX84673.1| cysteine protease CP1 [Manihot esculenta] 630 e-178 gb|ABQ10204.1| cysteine protease Cp6 [Actinidia deliciosa] 630 e-178 >dbj|BAC75923.1| cysteine protease-1 [Helianthus annuus] Length = 461 Score = 741 bits (1913), Expect = 0.0 Identities = 355/511 (69%), Positives = 399/511 (78%) Frame = -2 Query: 1783 MKLLAMTILLFFALFAVSSAMDMSIIGYDATHMTTADASSSSWRTDDEVNAMYESWLVKH 1604 MKL+ M L FFAL ++ SAMDMSII YDATHM+++ +SS+ RTDDEVNA+YESWLVKH Sbjct: 1 MKLIPMATLSFFALISIISAMDMSIINYDATHMSSS-SSSAPLRTDDEVNALYESWLVKH 59 Query: 1603 GKFYNALGEKERRFQXXXXXXXXXXXXXXXXXXXRFQIFKDNLRFIDHHNSGDHSYKLGL 1424 GK YNALGEK+RRFQ IFKDNLRFID HNSGDH+YKLGL Sbjct: 60 GKTYNALGEKDRRFQ----------------------IFKDNLRFIDEHNSGDHTYKLGL 97 Query: 1423 NKFADLSNEEYRSTYTGAKTIDSKRKLNNVKSDRYSPRSDDVLPDFVDWRSKGAVAAVKD 1244 NKFADL+NEEYR TYTG KTID K+KL+ +KSDRY+ RS D LP++VDWR +GAV VKD Sbjct: 98 NKFADLTNEEYRMTYTGIKTIDDKKKLSKMKSDRYAYRSGDSLPEYVDWREQGAVTDVKD 157 Query: 1243 QGSCXXXXXXXXXXXXXXXXXXXXXKDQGSCGSCWAFSTIGSVEGINQIVTGEMITLSEQ 1064 QGS CGSCWAFST GSVEG+N+IVTG++I++SEQ Sbjct: 158 QGS---------------------------CGSCWAFSTTGSVEGVNKIVTGDLISVSEQ 190 Query: 1063 ELVECDTSYNQGCNGGLMDYAFEFIIKNGGIDTDTDYPYTGKDGKCDKSRKNSKVVSIDS 884 ELV CDTSYNQGCNGGLMDYAFEFIIKNGGIDT+ DYPYTGKDGKCDK++KN+KVV+IDS Sbjct: 191 ELVNCDTSYNQGCNGGLMDYAFEFIIKNGGIDTEEDYPYTGKDGKCDKNKKNAKVVTIDS 250 Query: 883 YEDVPVNDESALQKAAANQPITVAIEASSRDFQFYTSGIFSGKCGTDLDHGVVVVGYGTE 704 YEDVPVNDES+L+KA +NQP+ VAIEA RDFQFYTSGIF+G CGT LDHGV+ GYGTE Sbjct: 251 YEDVPVNDESSLKKAVSNQPVAVAIEAGGRDFQFYTSGIFTGSCGTALDHGVLAAGYGTE 310 Query: 703 DGKDYWIVRNSWGAEWGEEGYLRMERNIKENEGKCGIAMEPSYPIKNGQNXXXXXXXXXX 524 DGKDYW+V+NSWGAEWGE GYL+MERNI + GKCGIAME SYPIKNG N Sbjct: 311 DGKDYWLVKNSWGAEWGEGGYLKMERNIADKSGKCGIAMEASYPIKNGDNPPNPGPTPPS 370 Query: 523 XXXPEKVCDQYYTCPASTTCCCIYNYHGSCFAWGCCPLEGAACCDDHYSCCPHDYPICNV 344 PE VCD+Y TCP STTCCCIY Y+G CFAWGCCPLEGA+CCDDHYSCCPHDYPICNV Sbjct: 371 PAAPEVVCDEYSTCPESTTCCCIYEYYGYCFAWGCCPLEGASCCDDHYSCCPHDYPICNV 430 Query: 343 RRRTCSKRKNSPLEIEALKRILATPTNAKRN 251 RR TCSK +NSPLEI A KRILATPT KRN Sbjct: 431 RRGTCSKSRNSPLEISATKRILATPTKLKRN 461 >dbj|BAE80740.1| cysteine proteinase [Platycodon grandiflorus] Length = 462 Score = 675 bits (1742), Expect = 0.0 Identities = 332/509 (65%), Positives = 380/509 (74%), Gaps = 4/509 (0%) Frame = -2 Query: 1783 MKLLA--MTILLFFALFAVSSAMDMSIIGYDATHMTTADASSSSWRTDDEVNAMYESWLV 1610 MKLL+ M I L FALF SSA+DMSII YDATH AS SSWRTDDEV AMYESWLV Sbjct: 1 MKLLSPSMAIALLFALFVASSALDMSIINYDATH-----ASKSSWRTDDEVMAMYESWLV 55 Query: 1609 KHGKFYNALGEKERRFQXXXXXXXXXXXXXXXXXXXRFQIFKDNLRFIDHHNSGDH-SYK 1433 KHGK YNALGEKE+RFQ IFKDNLRFID HN+ ++ SYK Sbjct: 56 KHGKSYNALGEKEKRFQ----------------------IFKDNLRFIDEHNAEENLSYK 93 Query: 1432 LGLNKFADLSNEEYRSTYTGAKTIDSKRKLNNVKSDRYSPRSDDVLPDFVDWRSKGAVAA 1253 +GLN+FADL+NEEYRSTY GAK SK KL+ VKSDRY+PR D LP+ VDWR+KGAVA Sbjct: 94 VGLNRFADLTNEEYRSTYLGAK---SKPKLSKVKSDRYAPRVGDSLPESVDWRAKGAVAP 150 Query: 1252 VKDQGSCXXXXXXXXXXXXXXXXXXXXXKDQGSCGSCWAFSTIGSVEGINQIVTGEMITL 1073 +KDQGS CGSCWAFST+ +VEGINQIVTGE+ITL Sbjct: 151 IKDQGS---------------------------CGSCWAFSTVNAVEGINQIVTGELITL 183 Query: 1072 SEQELVECDTSYNQGCNGGLMDYAFEFIIKNGGIDTDTDYPYTGKDGKCDKSRKNSKVVS 893 SEQELV+CD SYN+GC+GGLMDY FEFII NGGIDTD DYPY G+D +CD+ RKN+KVV+ Sbjct: 184 SEQELVDCDKSYNEGCDGGLMDYGFEFIINNGGIDTDKDYPYLGRDARCDQYRKNAKVVT 243 Query: 892 IDSYEDVPVNDESALQKAAANQPITVAIEASSRDFQFYTSGIFSGKCGTDLDHGVVVVGY 713 IDSYEDVPVN+E AL+KA A+QP++V IE R FQFY SGIF+GKCGT LDHGV VVGY Sbjct: 244 IDSYEDVPVNNEEALKKAVASQPVSVGIEGGGRAFQFYDSGIFTGKCGTALDHGVNVVGY 303 Query: 712 GTEDGKDYWIVRNSWGAEWGEEGYLRMERNIK-ENEGKCGIAMEPSYPIKNGQNXXXXXX 536 GTE GKDYWIVRNSWG+ WGE GY+RMERN+ + GKCGIAMEPSYP+KNGQN Sbjct: 304 GTEKGKDYWIVRNSWGSSWGEAGYIRMERNLAGTSVGKCGIAMEPSYPLKNGQNPPNPGP 363 Query: 535 XXXXXXXPEKVCDQYYTCPASTTCCCIYNYHGSCFAWGCCPLEGAACCDDHYSCCPHDYP 356 P VCD YYTCP S+TCCC+Y Y+G CF+WGCCPL+GA CCDDHYSCCPHDYP Sbjct: 364 SPPTPVRPPTVCDDYYTCPESSTCCCVYEYYGYCFSWGCCPLDGATCCDDHYSCCPHDYP 423 Query: 355 ICNVRRRTCSKRKNSPLEIEALKRILATP 269 +CNV+ TCS KN+PL ++A++RILATP Sbjct: 424 VCNVQAGTCSMSKNNPLGVKAIQRILATP 452 >dbj|BAD29960.1| cysteine protease [Daucus carota] Length = 460 Score = 632 bits (1629), Expect = e-178 Identities = 312/506 (61%), Positives = 365/506 (72%), Gaps = 1/506 (0%) Frame = -2 Query: 1783 MKLLAMTILLFFALFAVSSAMDMSIIGYDATHMTTADASSSSWRTDDEVNAMYESWLVKH 1604 MK++ +++L L A +A DMSII YD TH + TDD + A YESWLVKH Sbjct: 1 MKMI-LSLLSLSLLAAAVTAADMSIITYDQTHAVGS--------TDDVIMAAYESWLVKH 51 Query: 1603 GKFYNALGEKERRFQXXXXXXXXXXXXXXXXXXXRFQIFKDNLRFIDHHNSG-DHSYKLG 1427 GK YNALGEKE+RFQ IFKDN +ID N+ D S+KLG Sbjct: 52 GKSYNALGEKEQRFQ----------------------IFKDNFLYIDEQNAAKDRSFKLG 89 Query: 1426 LNKFADLSNEEYRSTYTGAKTIDSKRKLNNVKSDRYSPRSDDVLPDFVDWRSKGAVAAVK 1247 LN+FADL+NEEYRS YTG +T DS++K++ KS RY+ + + LP+ VDWR GAVA+VK Sbjct: 90 LNRFADLTNEEYRSKYTGIRTKDSRKKVSG-KSQRYASLAGESLPESVDWREHGAVASVK 148 Query: 1246 DQGSCXXXXXXXXXXXXXXXXXXXXXKDQGSCGSCWAFSTIGSVEGINQIVTGEMITLSE 1067 DQG CGSCWAFSTI +VEGINQI TG++ITLSE Sbjct: 149 DQGQ---------------------------CGSCWAFSTISAVEGINQIATGKLITLSE 181 Query: 1066 QELVECDTSYNQGCNGGLMDYAFEFIIKNGGIDTDTDYPYTGKDGKCDKSRKNSKVVSID 887 QELV+CD SYN+GCNGGLMD AF+FII NGGID+D DYPYTG+DG+CD+ RKN+KVV+ID Sbjct: 182 QELVDCDRSYNEGCNGGLMDDAFQFIINNGGIDSDADYPYTGRDGQCDQYRKNAKVVTID 241 Query: 886 SYEDVPVNDESALQKAAANQPITVAIEASSRDFQFYTSGIFSGKCGTDLDHGVVVVGYGT 707 SYEDVP DE ALQKAAANQPI+VAIEAS RDFQFY SGIF+GKCGTDLDHGVVVVGYGT Sbjct: 242 SYEDVPEYDEKALQKAAANQPISVAIEASGRDFQFYDSGIFTGKCGTDLDHGVVVVGYGT 301 Query: 706 EDGKDYWIVRNSWGAEWGEEGYLRMERNIKENEGKCGIAMEPSYPIKNGQNXXXXXXXXX 527 E+GKDYWIVRNSWGA+WGE+GYLRMER I G CGI EPSYP+K+G N Sbjct: 302 ENGKDYWIVRNSWGADWGEKGYLRMERGISSKAGICGITSEPSYPVKSGVNPPNPGPSPP 361 Query: 526 XXXXPEKVCDQYYTCPASTTCCCIYNYHGSCFAWGCCPLEGAACCDDHYSCCPHDYPICN 347 PE VCD+YYTCP STTCCC+Y Y+G CFAWGCCPLEGA+CCDD YSCCPHDYP+CN Sbjct: 362 SPKSPESVCDEYYTCPMSTTCCCMYEYYGYCFAWGCCPLEGASCCDDGYSCCPHDYPVCN 421 Query: 346 VRRRTCSKRKNSPLEIEALKRILATP 269 VR TCS N+PL ++A++RILATP Sbjct: 422 VRAGTCSMSNNNPLGVKAIQRILATP 447 >gb|AAX84673.1| cysteine protease CP1 [Manihot esculenta] Length = 467 Score = 630 bits (1626), Expect = e-178 Identities = 308/501 (61%), Positives = 355/501 (70%) Frame = -2 Query: 1771 AMTILLFFALFAVSSAMDMSIIGYDATHMTTADASSSSWRTDDEVNAMYESWLVKHGKFY 1592 AM +LLF + F +SSA DMSII YD TH A+ SSWRTDDEV A+YE WLVK GK Y Sbjct: 10 AMFVLLFLS-FTLSSASDMSIISYDQTH-----ATKSSWRTDDEVMAIYEEWLVKQGKVY 63 Query: 1591 NALGEKERRFQXXXXXXXXXXXXXXXXXXXRFQIFKDNLRFIDHHNSGDHSYKLGLNKFA 1412 NALGE+E+RFQ +FKDNLRFID HNS + +YKLGLN FA Sbjct: 64 NALGEREKRFQ----------------------VFKDNLRFIDEHNSENRTYKLGLNGFA 101 Query: 1411 DLSNEEYRSTYTGAKTIDSKRKLNNVKSDRYSPRSDDVLPDFVDWRSKGAVAAVKDQGSC 1232 DL+NEEYRSTY GA+ + +L SDRY+PR + LPD VDWR +GAVA VKDQGS Sbjct: 102 DLTNEEYRSTYLGARGGMKRNRLRKT-SDRYAPRVGESLPDSVDWRKEGAVAEVKDQGS- 159 Query: 1231 XXXXXXXXXXXXXXXXXXXXXKDQGSCGSCWAFSTIGSVEGINQIVTGEMITLSEQELVE 1052 CGSCWAFSTI +VEGIN+IVTG++I+LSEQELV+ Sbjct: 160 --------------------------CGSCWAFSTIAAVEGINKIVTGDLISLSEQELVD 193 Query: 1051 CDTSYNQGCNGGLMDYAFEFIIKNGGIDTDTDYPYTGKDGKCDKSRKNSKVVSIDSYEDV 872 CDTSYN+GCNGGLMDYAFEFII NGGIDT+ DYPY +DG+CD RKN+KVV+ID YEDV Sbjct: 194 CDTSYNEGCNGGLMDYAFEFIINNGGIDTEEDYPYLARDGRCDTYRKNAKVVTIDDYEDV 253 Query: 871 PVNDESALQKAAANQPITVAIEASSRDFQFYTSGIFSGKCGTDLDHGVVVVGYGTEDGKD 692 PVN E+ALQKA ANQP++VAIEA RDFQFY SGIFSG+CGT LDHGV VGYGTE+GKD Sbjct: 254 PVNSETALQKAVANQPVSVAIEAGGRDFQFYASGIFSGRCGTQLDHGVAAVGYGTENGKD 313 Query: 691 YWIVRNSWGAEWGEEGYLRMERNIKENEGKCGIAMEPSYPIKNGQNXXXXXXXXXXXXXP 512 YWIVRNSWG WGE GYLRM R+I G CGIAME SYPIK GQN P Sbjct: 314 YWIVRNSWGKSWGENGYLRMARSINSPTGICGIAMEASYPIKKGQNPPNPAPLPPSPVTP 373 Query: 511 EKVCDQYYTCPASTTCCCIYNYHGSCFAWGCCPLEGAACCDDHYSCCPHDYPICNVRRRT 332 VCD YY+CP + TCCC++ Y CF WGCCPLEGA CC+DHYSCCPHDYPICN+ + T Sbjct: 374 PTVCDNYYSCPDNNTCCCLFEYGNFCFEWGCCPLEGATCCEDHYSCCPHDYPICNINQGT 433 Query: 331 CSKRKNSPLEIEALKRILATP 269 C K++PL ++A+ RI A P Sbjct: 434 CLMSKDNPLAVKAMIRIPAKP 454 >gb|ABQ10204.1| cysteine protease Cp6 [Actinidia deliciosa] Length = 461 Score = 630 bits (1625), Expect = e-178 Identities = 311/513 (60%), Positives = 358/513 (69%) Frame = -2 Query: 1789 STMKLLAMTILLFFALFAVSSAMDMSIIGYDATHMTTADASSSSWRTDDEVNAMYESWLV 1610 S+ AM +LL F+LFA+SSA+DMSIIG SS RTDDEV AMYESWLV Sbjct: 3 SSRSFTAMALLLLFSLFALSSALDMSIIG-----------ELSSSRTDDEVMAMYESWLV 51 Query: 1609 KHGKFYNALGEKERRFQXXXXXXXXXXXXXXXXXXXRFQIFKDNLRFIDHHNSGDHSYKL 1430 KHGK YNA+GEKE+RFQ IFKDNLRFID HN+ +YK+ Sbjct: 52 KHGKSYNAIGEKEKRFQ----------------------IFKDNLRFIDEHNAESRTYKV 89 Query: 1429 GLNKFADLSNEEYRSTYTGAKTIDSKRKLNNVKSDRYSPRSDDVLPDFVDWRSKGAVAAV 1250 GLN+FADL+N+EYRS Y GA+T +R +SDRY P + + LPD VDWR KGAV V Sbjct: 90 GLNRFADLTNDEYRSMYLGARTGSRRRLSTQKRSDRYVPVAGESLPDSVDWREKGAVVGV 149 Query: 1249 KDQGSCXXXXXXXXXXXXXXXXXXXXXKDQGSCGSCWAFSTIGSVEGINQIVTGEMITLS 1070 KDQGS CGSCWAFSTI +VEGINQIVTG++I+LS Sbjct: 150 KDQGS---------------------------CGSCWAFSTIAAVEGINQIVTGDLISLS 182 Query: 1069 EQELVECDTSYNQGCNGGLMDYAFEFIIKNGGIDTDTDYPYTGKDGKCDKSRKNSKVVSI 890 EQELV+CDTSYN+GCNGGLMDYAFEFIIKNGGIDT+ DYPY +DG+CD+ RKN+KVV+I Sbjct: 183 EQELVDCDTSYNEGCNGGLMDYAFEFIIKNGGIDTEEDYPYNARDGRCDQYRKNAKVVTI 242 Query: 889 DSYEDVPVNDESALQKAAANQPITVAIEASSRDFQFYTSGIFSGKCGTDLDHGVVVVGYG 710 D YEDVPVN+E ALQKA ANQP++VAIEAS FQFY SG+F+G CGT LDHGV VGYG Sbjct: 243 DDYEDVPVNNEQALQKAVANQPVSVAIEASGMAFQFYESGVFTGNCGTALDHGVTAVGYG 302 Query: 709 TEDGKDYWIVRNSWGAEWGEEGYLRMERNIKENEGKCGIAMEPSYPIKNGQNXXXXXXXX 530 TE+ DYWIV+NSWG+ WGE GY+RMERN GKCGIA+EPSYPIK QN Sbjct: 303 TENSVDYWIVKNSWGSSWGESGYIRMERNTGAT-GKCGIAVEPSYPIKTSQNPPNPGPSP 361 Query: 529 XXXXXPEKVCDQYYTCPASTTCCCIYNYHGSCFAWGCCPLEGAACCDDHYSCCPHDYPIC 350 P VCD YYTCP S+TCCC+Y Y CFAWGCCPLEGA CCDDHYSCCPHDYPIC Sbjct: 362 PSPIKPPTVCDDYYTCPESSTCCCVYEYGKYCFAWGCCPLEGATCCDDHYSCCPHDYPIC 421 Query: 349 NVRRRTCSKRKNSPLEIEALKRILATPTNAKRN 251 NV TC K++PL ++A+KRI A P A N Sbjct: 422 NVYAGTCLMSKDNPLGVKAMKRIQAKPQWAFAN 454