BLASTX nr result

ID: Atractylodes21_contig00008568 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Atractylodes21_contig00008568
         (1862 letters)

Database: ./nr 
           23,641,837 sequences; 8,123,359,852 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

dbj|BAC75923.1| cysteine protease-1 [Helianthus annuus]               741   0.0  
dbj|BAE80740.1| cysteine proteinase [Platycodon grandiflorus]         675   0.0  
dbj|BAD29960.1| cysteine protease [Daucus carota]                     632   e-178
gb|AAX84673.1| cysteine protease CP1 [Manihot esculenta]              630   e-178
gb|ABQ10204.1| cysteine protease Cp6 [Actinidia deliciosa]            630   e-178

>dbj|BAC75923.1| cysteine protease-1 [Helianthus annuus]
          Length = 461

 Score =  741 bits (1913), Expect = 0.0
 Identities = 355/511 (69%), Positives = 399/511 (78%)
 Frame = -2

Query: 1783 MKLLAMTILLFFALFAVSSAMDMSIIGYDATHMTTADASSSSWRTDDEVNAMYESWLVKH 1604
            MKL+ M  L FFAL ++ SAMDMSII YDATHM+++ +SS+  RTDDEVNA+YESWLVKH
Sbjct: 1    MKLIPMATLSFFALISIISAMDMSIINYDATHMSSS-SSSAPLRTDDEVNALYESWLVKH 59

Query: 1603 GKFYNALGEKERRFQXXXXXXXXXXXXXXXXXXXRFQIFKDNLRFIDHHNSGDHSYKLGL 1424
            GK YNALGEK+RRFQ                      IFKDNLRFID HNSGDH+YKLGL
Sbjct: 60   GKTYNALGEKDRRFQ----------------------IFKDNLRFIDEHNSGDHTYKLGL 97

Query: 1423 NKFADLSNEEYRSTYTGAKTIDSKRKLNNVKSDRYSPRSDDVLPDFVDWRSKGAVAAVKD 1244
            NKFADL+NEEYR TYTG KTID K+KL+ +KSDRY+ RS D LP++VDWR +GAV  VKD
Sbjct: 98   NKFADLTNEEYRMTYTGIKTIDDKKKLSKMKSDRYAYRSGDSLPEYVDWREQGAVTDVKD 157

Query: 1243 QGSCXXXXXXXXXXXXXXXXXXXXXKDQGSCGSCWAFSTIGSVEGINQIVTGEMITLSEQ 1064
            QGS                           CGSCWAFST GSVEG+N+IVTG++I++SEQ
Sbjct: 158  QGS---------------------------CGSCWAFSTTGSVEGVNKIVTGDLISVSEQ 190

Query: 1063 ELVECDTSYNQGCNGGLMDYAFEFIIKNGGIDTDTDYPYTGKDGKCDKSRKNSKVVSIDS 884
            ELV CDTSYNQGCNGGLMDYAFEFIIKNGGIDT+ DYPYTGKDGKCDK++KN+KVV+IDS
Sbjct: 191  ELVNCDTSYNQGCNGGLMDYAFEFIIKNGGIDTEEDYPYTGKDGKCDKNKKNAKVVTIDS 250

Query: 883  YEDVPVNDESALQKAAANQPITVAIEASSRDFQFYTSGIFSGKCGTDLDHGVVVVGYGTE 704
            YEDVPVNDES+L+KA +NQP+ VAIEA  RDFQFYTSGIF+G CGT LDHGV+  GYGTE
Sbjct: 251  YEDVPVNDESSLKKAVSNQPVAVAIEAGGRDFQFYTSGIFTGSCGTALDHGVLAAGYGTE 310

Query: 703  DGKDYWIVRNSWGAEWGEEGYLRMERNIKENEGKCGIAMEPSYPIKNGQNXXXXXXXXXX 524
            DGKDYW+V+NSWGAEWGE GYL+MERNI +  GKCGIAME SYPIKNG N          
Sbjct: 311  DGKDYWLVKNSWGAEWGEGGYLKMERNIADKSGKCGIAMEASYPIKNGDNPPNPGPTPPS 370

Query: 523  XXXPEKVCDQYYTCPASTTCCCIYNYHGSCFAWGCCPLEGAACCDDHYSCCPHDYPICNV 344
               PE VCD+Y TCP STTCCCIY Y+G CFAWGCCPLEGA+CCDDHYSCCPHDYPICNV
Sbjct: 371  PAAPEVVCDEYSTCPESTTCCCIYEYYGYCFAWGCCPLEGASCCDDHYSCCPHDYPICNV 430

Query: 343  RRRTCSKRKNSPLEIEALKRILATPTNAKRN 251
            RR TCSK +NSPLEI A KRILATPT  KRN
Sbjct: 431  RRGTCSKSRNSPLEISATKRILATPTKLKRN 461


>dbj|BAE80740.1| cysteine proteinase [Platycodon grandiflorus]
          Length = 462

 Score =  675 bits (1742), Expect = 0.0
 Identities = 332/509 (65%), Positives = 380/509 (74%), Gaps = 4/509 (0%)
 Frame = -2

Query: 1783 MKLLA--MTILLFFALFAVSSAMDMSIIGYDATHMTTADASSSSWRTDDEVNAMYESWLV 1610
            MKLL+  M I L FALF  SSA+DMSII YDATH     AS SSWRTDDEV AMYESWLV
Sbjct: 1    MKLLSPSMAIALLFALFVASSALDMSIINYDATH-----ASKSSWRTDDEVMAMYESWLV 55

Query: 1609 KHGKFYNALGEKERRFQXXXXXXXXXXXXXXXXXXXRFQIFKDNLRFIDHHNSGDH-SYK 1433
            KHGK YNALGEKE+RFQ                      IFKDNLRFID HN+ ++ SYK
Sbjct: 56   KHGKSYNALGEKEKRFQ----------------------IFKDNLRFIDEHNAEENLSYK 93

Query: 1432 LGLNKFADLSNEEYRSTYTGAKTIDSKRKLNNVKSDRYSPRSDDVLPDFVDWRSKGAVAA 1253
            +GLN+FADL+NEEYRSTY GAK   SK KL+ VKSDRY+PR  D LP+ VDWR+KGAVA 
Sbjct: 94   VGLNRFADLTNEEYRSTYLGAK---SKPKLSKVKSDRYAPRVGDSLPESVDWRAKGAVAP 150

Query: 1252 VKDQGSCXXXXXXXXXXXXXXXXXXXXXKDQGSCGSCWAFSTIGSVEGINQIVTGEMITL 1073
            +KDQGS                           CGSCWAFST+ +VEGINQIVTGE+ITL
Sbjct: 151  IKDQGS---------------------------CGSCWAFSTVNAVEGINQIVTGELITL 183

Query: 1072 SEQELVECDTSYNQGCNGGLMDYAFEFIIKNGGIDTDTDYPYTGKDGKCDKSRKNSKVVS 893
            SEQELV+CD SYN+GC+GGLMDY FEFII NGGIDTD DYPY G+D +CD+ RKN+KVV+
Sbjct: 184  SEQELVDCDKSYNEGCDGGLMDYGFEFIINNGGIDTDKDYPYLGRDARCDQYRKNAKVVT 243

Query: 892  IDSYEDVPVNDESALQKAAANQPITVAIEASSRDFQFYTSGIFSGKCGTDLDHGVVVVGY 713
            IDSYEDVPVN+E AL+KA A+QP++V IE   R FQFY SGIF+GKCGT LDHGV VVGY
Sbjct: 244  IDSYEDVPVNNEEALKKAVASQPVSVGIEGGGRAFQFYDSGIFTGKCGTALDHGVNVVGY 303

Query: 712  GTEDGKDYWIVRNSWGAEWGEEGYLRMERNIK-ENEGKCGIAMEPSYPIKNGQNXXXXXX 536
            GTE GKDYWIVRNSWG+ WGE GY+RMERN+   + GKCGIAMEPSYP+KNGQN      
Sbjct: 304  GTEKGKDYWIVRNSWGSSWGEAGYIRMERNLAGTSVGKCGIAMEPSYPLKNGQNPPNPGP 363

Query: 535  XXXXXXXPEKVCDQYYTCPASTTCCCIYNYHGSCFAWGCCPLEGAACCDDHYSCCPHDYP 356
                   P  VCD YYTCP S+TCCC+Y Y+G CF+WGCCPL+GA CCDDHYSCCPHDYP
Sbjct: 364  SPPTPVRPPTVCDDYYTCPESSTCCCVYEYYGYCFSWGCCPLDGATCCDDHYSCCPHDYP 423

Query: 355  ICNVRRRTCSKRKNSPLEIEALKRILATP 269
            +CNV+  TCS  KN+PL ++A++RILATP
Sbjct: 424  VCNVQAGTCSMSKNNPLGVKAIQRILATP 452


>dbj|BAD29960.1| cysteine protease [Daucus carota]
          Length = 460

 Score =  632 bits (1629), Expect = e-178
 Identities = 312/506 (61%), Positives = 365/506 (72%), Gaps = 1/506 (0%)
 Frame = -2

Query: 1783 MKLLAMTILLFFALFAVSSAMDMSIIGYDATHMTTADASSSSWRTDDEVNAMYESWLVKH 1604
            MK++ +++L    L A  +A DMSII YD TH   +        TDD + A YESWLVKH
Sbjct: 1    MKMI-LSLLSLSLLAAAVTAADMSIITYDQTHAVGS--------TDDVIMAAYESWLVKH 51

Query: 1603 GKFYNALGEKERRFQXXXXXXXXXXXXXXXXXXXRFQIFKDNLRFIDHHNSG-DHSYKLG 1427
            GK YNALGEKE+RFQ                      IFKDN  +ID  N+  D S+KLG
Sbjct: 52   GKSYNALGEKEQRFQ----------------------IFKDNFLYIDEQNAAKDRSFKLG 89

Query: 1426 LNKFADLSNEEYRSTYTGAKTIDSKRKLNNVKSDRYSPRSDDVLPDFVDWRSKGAVAAVK 1247
            LN+FADL+NEEYRS YTG +T DS++K++  KS RY+  + + LP+ VDWR  GAVA+VK
Sbjct: 90   LNRFADLTNEEYRSKYTGIRTKDSRKKVSG-KSQRYASLAGESLPESVDWREHGAVASVK 148

Query: 1246 DQGSCXXXXXXXXXXXXXXXXXXXXXKDQGSCGSCWAFSTIGSVEGINQIVTGEMITLSE 1067
            DQG                            CGSCWAFSTI +VEGINQI TG++ITLSE
Sbjct: 149  DQGQ---------------------------CGSCWAFSTISAVEGINQIATGKLITLSE 181

Query: 1066 QELVECDTSYNQGCNGGLMDYAFEFIIKNGGIDTDTDYPYTGKDGKCDKSRKNSKVVSID 887
            QELV+CD SYN+GCNGGLMD AF+FII NGGID+D DYPYTG+DG+CD+ RKN+KVV+ID
Sbjct: 182  QELVDCDRSYNEGCNGGLMDDAFQFIINNGGIDSDADYPYTGRDGQCDQYRKNAKVVTID 241

Query: 886  SYEDVPVNDESALQKAAANQPITVAIEASSRDFQFYTSGIFSGKCGTDLDHGVVVVGYGT 707
            SYEDVP  DE ALQKAAANQPI+VAIEAS RDFQFY SGIF+GKCGTDLDHGVVVVGYGT
Sbjct: 242  SYEDVPEYDEKALQKAAANQPISVAIEASGRDFQFYDSGIFTGKCGTDLDHGVVVVGYGT 301

Query: 706  EDGKDYWIVRNSWGAEWGEEGYLRMERNIKENEGKCGIAMEPSYPIKNGQNXXXXXXXXX 527
            E+GKDYWIVRNSWGA+WGE+GYLRMER I    G CGI  EPSYP+K+G N         
Sbjct: 302  ENGKDYWIVRNSWGADWGEKGYLRMERGISSKAGICGITSEPSYPVKSGVNPPNPGPSPP 361

Query: 526  XXXXPEKVCDQYYTCPASTTCCCIYNYHGSCFAWGCCPLEGAACCDDHYSCCPHDYPICN 347
                PE VCD+YYTCP STTCCC+Y Y+G CFAWGCCPLEGA+CCDD YSCCPHDYP+CN
Sbjct: 362  SPKSPESVCDEYYTCPMSTTCCCMYEYYGYCFAWGCCPLEGASCCDDGYSCCPHDYPVCN 421

Query: 346  VRRRTCSKRKNSPLEIEALKRILATP 269
            VR  TCS   N+PL ++A++RILATP
Sbjct: 422  VRAGTCSMSNNNPLGVKAIQRILATP 447


>gb|AAX84673.1| cysteine protease CP1 [Manihot esculenta]
          Length = 467

 Score =  630 bits (1626), Expect = e-178
 Identities = 308/501 (61%), Positives = 355/501 (70%)
 Frame = -2

Query: 1771 AMTILLFFALFAVSSAMDMSIIGYDATHMTTADASSSSWRTDDEVNAMYESWLVKHGKFY 1592
            AM +LLF + F +SSA DMSII YD TH     A+ SSWRTDDEV A+YE WLVK GK Y
Sbjct: 10   AMFVLLFLS-FTLSSASDMSIISYDQTH-----ATKSSWRTDDEVMAIYEEWLVKQGKVY 63

Query: 1591 NALGEKERRFQXXXXXXXXXXXXXXXXXXXRFQIFKDNLRFIDHHNSGDHSYKLGLNKFA 1412
            NALGE+E+RFQ                      +FKDNLRFID HNS + +YKLGLN FA
Sbjct: 64   NALGEREKRFQ----------------------VFKDNLRFIDEHNSENRTYKLGLNGFA 101

Query: 1411 DLSNEEYRSTYTGAKTIDSKRKLNNVKSDRYSPRSDDVLPDFVDWRSKGAVAAVKDQGSC 1232
            DL+NEEYRSTY GA+    + +L    SDRY+PR  + LPD VDWR +GAVA VKDQGS 
Sbjct: 102  DLTNEEYRSTYLGARGGMKRNRLRKT-SDRYAPRVGESLPDSVDWRKEGAVAEVKDQGS- 159

Query: 1231 XXXXXXXXXXXXXXXXXXXXXKDQGSCGSCWAFSTIGSVEGINQIVTGEMITLSEQELVE 1052
                                      CGSCWAFSTI +VEGIN+IVTG++I+LSEQELV+
Sbjct: 160  --------------------------CGSCWAFSTIAAVEGINKIVTGDLISLSEQELVD 193

Query: 1051 CDTSYNQGCNGGLMDYAFEFIIKNGGIDTDTDYPYTGKDGKCDKSRKNSKVVSIDSYEDV 872
            CDTSYN+GCNGGLMDYAFEFII NGGIDT+ DYPY  +DG+CD  RKN+KVV+ID YEDV
Sbjct: 194  CDTSYNEGCNGGLMDYAFEFIINNGGIDTEEDYPYLARDGRCDTYRKNAKVVTIDDYEDV 253

Query: 871  PVNDESALQKAAANQPITVAIEASSRDFQFYTSGIFSGKCGTDLDHGVVVVGYGTEDGKD 692
            PVN E+ALQKA ANQP++VAIEA  RDFQFY SGIFSG+CGT LDHGV  VGYGTE+GKD
Sbjct: 254  PVNSETALQKAVANQPVSVAIEAGGRDFQFYASGIFSGRCGTQLDHGVAAVGYGTENGKD 313

Query: 691  YWIVRNSWGAEWGEEGYLRMERNIKENEGKCGIAMEPSYPIKNGQNXXXXXXXXXXXXXP 512
            YWIVRNSWG  WGE GYLRM R+I    G CGIAME SYPIK GQN             P
Sbjct: 314  YWIVRNSWGKSWGENGYLRMARSINSPTGICGIAMEASYPIKKGQNPPNPAPLPPSPVTP 373

Query: 511  EKVCDQYYTCPASTTCCCIYNYHGSCFAWGCCPLEGAACCDDHYSCCPHDYPICNVRRRT 332
              VCD YY+CP + TCCC++ Y   CF WGCCPLEGA CC+DHYSCCPHDYPICN+ + T
Sbjct: 374  PTVCDNYYSCPDNNTCCCLFEYGNFCFEWGCCPLEGATCCEDHYSCCPHDYPICNINQGT 433

Query: 331  CSKRKNSPLEIEALKRILATP 269
            C   K++PL ++A+ RI A P
Sbjct: 434  CLMSKDNPLAVKAMIRIPAKP 454


>gb|ABQ10204.1| cysteine protease Cp6 [Actinidia deliciosa]
          Length = 461

 Score =  630 bits (1625), Expect = e-178
 Identities = 311/513 (60%), Positives = 358/513 (69%)
 Frame = -2

Query: 1789 STMKLLAMTILLFFALFAVSSAMDMSIIGYDATHMTTADASSSSWRTDDEVNAMYESWLV 1610
            S+    AM +LL F+LFA+SSA+DMSIIG             SS RTDDEV AMYESWLV
Sbjct: 3    SSRSFTAMALLLLFSLFALSSALDMSIIG-----------ELSSSRTDDEVMAMYESWLV 51

Query: 1609 KHGKFYNALGEKERRFQXXXXXXXXXXXXXXXXXXXRFQIFKDNLRFIDHHNSGDHSYKL 1430
            KHGK YNA+GEKE+RFQ                      IFKDNLRFID HN+   +YK+
Sbjct: 52   KHGKSYNAIGEKEKRFQ----------------------IFKDNLRFIDEHNAESRTYKV 89

Query: 1429 GLNKFADLSNEEYRSTYTGAKTIDSKRKLNNVKSDRYSPRSDDVLPDFVDWRSKGAVAAV 1250
            GLN+FADL+N+EYRS Y GA+T   +R     +SDRY P + + LPD VDWR KGAV  V
Sbjct: 90   GLNRFADLTNDEYRSMYLGARTGSRRRLSTQKRSDRYVPVAGESLPDSVDWREKGAVVGV 149

Query: 1249 KDQGSCXXXXXXXXXXXXXXXXXXXXXKDQGSCGSCWAFSTIGSVEGINQIVTGEMITLS 1070
            KDQGS                           CGSCWAFSTI +VEGINQIVTG++I+LS
Sbjct: 150  KDQGS---------------------------CGSCWAFSTIAAVEGINQIVTGDLISLS 182

Query: 1069 EQELVECDTSYNQGCNGGLMDYAFEFIIKNGGIDTDTDYPYTGKDGKCDKSRKNSKVVSI 890
            EQELV+CDTSYN+GCNGGLMDYAFEFIIKNGGIDT+ DYPY  +DG+CD+ RKN+KVV+I
Sbjct: 183  EQELVDCDTSYNEGCNGGLMDYAFEFIIKNGGIDTEEDYPYNARDGRCDQYRKNAKVVTI 242

Query: 889  DSYEDVPVNDESALQKAAANQPITVAIEASSRDFQFYTSGIFSGKCGTDLDHGVVVVGYG 710
            D YEDVPVN+E ALQKA ANQP++VAIEAS   FQFY SG+F+G CGT LDHGV  VGYG
Sbjct: 243  DDYEDVPVNNEQALQKAVANQPVSVAIEASGMAFQFYESGVFTGNCGTALDHGVTAVGYG 302

Query: 709  TEDGKDYWIVRNSWGAEWGEEGYLRMERNIKENEGKCGIAMEPSYPIKNGQNXXXXXXXX 530
            TE+  DYWIV+NSWG+ WGE GY+RMERN     GKCGIA+EPSYPIK  QN        
Sbjct: 303  TENSVDYWIVKNSWGSSWGESGYIRMERNTGAT-GKCGIAVEPSYPIKTSQNPPNPGPSP 361

Query: 529  XXXXXPEKVCDQYYTCPASTTCCCIYNYHGSCFAWGCCPLEGAACCDDHYSCCPHDYPIC 350
                 P  VCD YYTCP S+TCCC+Y Y   CFAWGCCPLEGA CCDDHYSCCPHDYPIC
Sbjct: 362  PSPIKPPTVCDDYYTCPESSTCCCVYEYGKYCFAWGCCPLEGATCCDDHYSCCPHDYPIC 421

Query: 349  NVRRRTCSKRKNSPLEIEALKRILATPTNAKRN 251
            NV   TC   K++PL ++A+KRI A P  A  N
Sbjct: 422  NVYAGTCLMSKDNPLGVKAMKRIQAKPQWAFAN 454


Top