BLASTX nr result

ID: Atractylodes22_contig00001149 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Atractylodes22_contig00001149
         (1655 letters)

Database: ./nr 
           23,641,837 sequences; 8,123,359,852 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

dbj|BAC75923.1| cysteine protease-1 [Helianthus annuus]               629   0.0  
dbj|BAE80740.1| cysteine proteinase [Platycodon grandiflorus]         566   e-176
dbj|BAD29960.1| cysteine protease [Daucus carota]                     568   e-169
gb|ABK95110.1| unknown [Populus trichocarpa]                          540   e-165
ref|XP_002326950.1| predicted protein [Populus trichocarpa] gi|2...   540   e-165

>dbj|BAC75923.1| cysteine protease-1 [Helianthus annuus]
          Length = 461

 Score =  629 bits (1621), Expect(2) = 0.0
 Identities = 285/367 (77%), Positives = 318/367 (86%)
 Frame = -2

Query: 1387 LGLNRFADLSNEEYRSTYTGAKTIDSKRKLNNVKSDRYSPRSDDVLPDFVDWRSKGAVSA 1208
            LGLN+FADL+NEEYR TYTG KTID K+KL+ +KSDRY+ RS D LP++VDWR +GAV+ 
Sbjct: 95   LGLNKFADLTNEEYRMTYTGIKTIDDKKKLSKMKSDRYAYRSGDSLPEYVDWREQGAVTD 154

Query: 1207 VKDQGSCGSCWAFSTTGSVEGINKLVTGDLITLSEQELVECDTSYNQGCNGGLMDYAFEF 1028
            VKDQGSCGSCWAFSTTGSVEG+NK+VTGDLI++SEQELV CDTSYNQGCNGGLMDYAFEF
Sbjct: 155  VKDQGSCGSCWAFSTTGSVEGVNKIVTGDLISVSEQELVNCDTSYNQGCNGGLMDYAFEF 214

Query: 1027 IIKNGGIDTDTDYPYTGKDGKCDKSRKNSKVVSIDSYEDVPVNDESALQKAAANQPITVA 848
            IIKNGGIDT+ DYPYTGKDGKCDK++KN+KVV+IDSYEDVPVNDES+L+KA +NQP+ VA
Sbjct: 215  IIKNGGIDTEEDYPYTGKDGKCDKNKKNAKVVTIDSYEDVPVNDESSLKKAVSNQPVAVA 274

Query: 847  IEASSRDFQFYTSGIFSGKCGTDLDHGVVVVGYGTEDGKDYWIVRNSWGAEWGEEGYLRM 668
            IEA  RDFQFYTSGIF+G CGT LDHGV+  GYGTEDGKDYW+V+NSWGAEWGE GYL+M
Sbjct: 275  IEAGGRDFQFYTSGIFTGSCGTALDHGVLAAGYGTEDGKDYWLVKNSWGAEWGEGGYLKM 334

Query: 667  ERNIKENEGKCGIAMEPSYPIKNGQNXXXXXXXXXXXXXPEKVCDQYYTCPASTTCCCIY 488
            ERNI +  GKCGIAME SYPIKNG N             PE VCD+Y TCP STTCCCIY
Sbjct: 335  ERNIADKSGKCGIAMEASYPIKNGDNPPNPGPTPPSPAAPEVVCDEYSTCPESTTCCCIY 394

Query: 487  NYHGSCFAWGCCPLEGAACCDDHYSCCPHDYPICNVRRRTCSKRKNSPLEIEALKRILAT 308
             Y+G CFAWGCCPLEGA+CCDDHYSCCPHDYPICNVRR TCSK +NSPLEI A KRILAT
Sbjct: 395  EYYGYCFAWGCCPLEGASCCDDHYSCCPHDYPICNVRRGTCSKSRNSPLEISATKRILAT 454

Query: 307  PTNAKRN 287
            PT  KRN
Sbjct: 455  PTKLKRN 461



 Score = 77.8 bits (190), Expect(2) = 0.0
 Identities = 37/49 (75%), Positives = 44/49 (89%)
 Frame = -1

Query: 1655 IIGYDATHMTTADASSSSWRTDDEVNAMYESWLVKHGKFYNALGEKERR 1509
            II YDATHM+++ +SS+  RTDDEVNA+YESWLVKHGK YNALGEK+RR
Sbjct: 25   IINYDATHMSSS-SSSAPLRTDDEVNALYESWLVKHGKTYNALGEKDRR 72


>dbj|BAE80740.1| cysteine proteinase [Platycodon grandiflorus]
          Length = 462

 Score =  567 bits (1460), Expect(2) = e-176
 Identities = 257/362 (70%), Positives = 300/362 (82%), Gaps = 1/362 (0%)
 Frame = -2

Query: 1387 LGLNRFADLSNEEYRSTYTGAKTIDSKRKLNNVKSDRYSPRSDDVLPDFVDWRSKGAVSA 1208
            +GLNRFADL+NEEYRSTY GAK   SK KL+ VKSDRY+PR  D LP+ VDWR+KGAV+ 
Sbjct: 94   VGLNRFADLTNEEYRSTYLGAK---SKPKLSKVKSDRYAPRVGDSLPESVDWRAKGAVAP 150

Query: 1207 VKDQGSCGSCWAFSTTGSVEGINKLVTGDLITLSEQELVECDTSYNQGCNGGLMDYAFEF 1028
            +KDQGSCGSCWAFST  +VEGIN++VTG+LITLSEQELV+CD SYN+GC+GGLMDY FEF
Sbjct: 151  IKDQGSCGSCWAFSTVNAVEGINQIVTGELITLSEQELVDCDKSYNEGCDGGLMDYGFEF 210

Query: 1027 IIKNGGIDTDTDYPYTGKDGKCDKSRKNSKVVSIDSYEDVPVNDESALQKAAANQPITVA 848
            II NGGIDTD DYPY G+D +CD+ RKN+KVV+IDSYEDVPVN+E AL+KA A+QP++V 
Sbjct: 211  IINNGGIDTDKDYPYLGRDARCDQYRKNAKVVTIDSYEDVPVNNEEALKKAVASQPVSVG 270

Query: 847  IEASSRDFQFYTSGIFSGKCGTDLDHGVVVVGYGTEDGKDYWIVRNSWGAEWGEEGYLRM 668
            IE   R FQFY SGIF+GKCGT LDHGV VVGYGTE GKDYWIVRNSWG+ WGE GY+RM
Sbjct: 271  IEGGGRAFQFYDSGIFTGKCGTALDHGVNVVGYGTEKGKDYWIVRNSWGSSWGEAGYIRM 330

Query: 667  ERNIK-ENEGKCGIAMEPSYPIKNGQNXXXXXXXXXXXXXPEKVCDQYYTCPASTTCCCI 491
            ERN+   + GKCGIAMEPSYP+KNGQN             P  VCD YYTCP S+TCCC+
Sbjct: 331  ERNLAGTSVGKCGIAMEPSYPLKNGQNPPNPGPSPPTPVRPPTVCDDYYTCPESSTCCCV 390

Query: 490  YNYHGSCFAWGCCPLEGAACCDDHYSCCPHDYPICNVRRRTCSKRKNSPLEIEALKRILA 311
            Y Y+G CF+WGCCPL+GA CCDDHYSCCPHDYP+CNV+  TCS  KN+PL ++A++RILA
Sbjct: 391  YEYYGYCFSWGCCPLDGATCCDDHYSCCPHDYPVCNVQAGTCSMSKNNPLGVKAIQRILA 450

Query: 310  TP 305
            TP
Sbjct: 451  TP 452



 Score = 79.0 bits (193), Expect(2) = e-176
 Identities = 39/49 (79%), Positives = 40/49 (81%)
 Frame = -1

Query: 1655 IIGYDATHMTTADASSSSWRTDDEVNAMYESWLVKHGKFYNALGEKERR 1509
            II YDATH     AS SSWRTDDEV AMYESWLVKHGK YNALGEKE+R
Sbjct: 27   IINYDATH-----ASKSSWRTDDEVMAMYESWLVKHGKSYNALGEKEKR 70


>dbj|BAD29960.1| cysteine protease [Daucus carota]
          Length = 460

 Score =  568 bits (1464), Expect(2) = e-169
 Identities = 259/361 (71%), Positives = 300/361 (83%)
 Frame = -2

Query: 1387 LGLNRFADLSNEEYRSTYTGAKTIDSKRKLNNVKSDRYSPRSDDVLPDFVDWRSKGAVSA 1208
            LGLNRFADL+NEEYRS YTG +T DS++K++  KS RY+  + + LP+ VDWR  GAV++
Sbjct: 88   LGLNRFADLTNEEYRSKYTGIRTKDSRKKVSG-KSQRYASLAGESLPESVDWREHGAVAS 146

Query: 1207 VKDQGSCGSCWAFSTTGSVEGINKLVTGDLITLSEQELVECDTSYNQGCNGGLMDYAFEF 1028
            VKDQG CGSCWAFST  +VEGIN++ TG LITLSEQELV+CD SYN+GCNGGLMD AF+F
Sbjct: 147  VKDQGQCGSCWAFSTISAVEGINQIATGKLITLSEQELVDCDRSYNEGCNGGLMDDAFQF 206

Query: 1027 IIKNGGIDTDTDYPYTGKDGKCDKSRKNSKVVSIDSYEDVPVNDESALQKAAANQPITVA 848
            II NGGID+D DYPYTG+DG+CD+ RKN+KVV+IDSYEDVP  DE ALQKAAANQPI+VA
Sbjct: 207  IINNGGIDSDADYPYTGRDGQCDQYRKNAKVVTIDSYEDVPEYDEKALQKAAANQPISVA 266

Query: 847  IEASSRDFQFYTSGIFSGKCGTDLDHGVVVVGYGTEDGKDYWIVRNSWGAEWGEEGYLRM 668
            IEAS RDFQFY SGIF+GKCGTDLDHGVVVVGYGTE+GKDYWIVRNSWGA+WGE+GYLRM
Sbjct: 267  IEASGRDFQFYDSGIFTGKCGTDLDHGVVVVGYGTENGKDYWIVRNSWGADWGEKGYLRM 326

Query: 667  ERNIKENEGKCGIAMEPSYPIKNGQNXXXXXXXXXXXXXPEKVCDQYYTCPASTTCCCIY 488
            ER I    G CGI  EPSYP+K+G N             PE VCD+YYTCP STTCCC+Y
Sbjct: 327  ERGISSKAGICGITSEPSYPVKSGVNPPNPGPSPPSPKSPESVCDEYYTCPMSTTCCCMY 386

Query: 487  NYHGSCFAWGCCPLEGAACCDDHYSCCPHDYPICNVRRRTCSKRKNSPLEIEALKRILAT 308
             Y+G CFAWGCCPLEGA+CCDD YSCCPHDYP+CNVR  TCS   N+PL ++A++RILAT
Sbjct: 387  EYYGYCFAWGCCPLEGASCCDDGYSCCPHDYPVCNVRAGTCSMSNNNPLGVKAIQRILAT 446

Query: 307  P 305
            P
Sbjct: 447  P 447



 Score = 56.2 bits (134), Expect(2) = e-169
 Identities = 29/49 (59%), Positives = 32/49 (65%)
 Frame = -1

Query: 1655 IIGYDATHMTTADASSSSWRTDDEVNAMYESWLVKHGKFYNALGEKERR 1509
            II YD TH   +        TDD + A YESWLVKHGK YNALGEKE+R
Sbjct: 24   IITYDQTHAVGS--------TDDVIMAAYESWLVKHGKSYNALGEKEQR 64


>gb|ABK95110.1| unknown [Populus trichocarpa]
          Length = 465

 Score =  540 bits (1392), Expect(2) = e-165
 Identities = 246/361 (68%), Positives = 281/361 (77%)
 Frame = -2

Query: 1387 LGLNRFADLSNEEYRSTYTGAKTIDSKRKLNNVKSDRYSPRSDDVLPDFVDWRSKGAVSA 1208
            +GLNRFADL+NEE+RS Y G +T   KR      SDRY+PR  D LPD VDWR +GAV+ 
Sbjct: 94   VGLNRFADLTNEEFRSMYLGTRTGHKKRLPKT--SDRYAPRVGDSLPDSVDWRKEGAVAE 151

Query: 1207 VKDQGSCGSCWAFSTTGSVEGINKLVTGDLITLSEQELVECDTSYNQGCNGGLMDYAFEF 1028
            VKDQG CGSCWAFST  +VEGINK+VTGDLI LSEQELV+CDTSYN+GCNGGLMDYAFEF
Sbjct: 152  VKDQGGCGSCWAFSTIAAVEGINKIVTGDLIALSEQELVDCDTSYNEGCNGGLMDYAFEF 211

Query: 1027 IIKNGGIDTDTDYPYTGKDGKCDKSRKNSKVVSIDSYEDVPVNDESALQKAAANQPITVA 848
            II NGGIDT+ DYPY G+DG+CD  RKN+KVVSIDSYEDVP NDE+AL+KA ANQP++VA
Sbjct: 212  IINNGGIDTEDDYPYLGRDGRCDTYRKNAKVVSIDSYEDVPENDETALKKAVANQPVSVA 271

Query: 847  IEASSRDFQFYTSGIFSGKCGTDLDHGVVVVGYGTEDGKDYWIVRNSWGAEWGEEGYLRM 668
            IE   R+FQ Y SG+F+G+CGT LDHGV  VGYGTE GKDYWIVRNSWG  WGE GY+RM
Sbjct: 272  IEGGGRNFQLYNSGVFTGECGTSLDHGVAAVGYGTEKGKDYWIVRNSWGKSWGESGYIRM 331

Query: 667  ERNIKENEGKCGIAMEPSYPIKNGQNXXXXXXXXXXXXXPEKVCDQYYTCPASTTCCCIY 488
            ERNI    GKCGIA+EPSYPIK GQN             P  VCD Y++CP S+TCCCI+
Sbjct: 332  ERNIASPTGKCGIAIEPSYPIKKGQNPPNPGPSPPSPVKPPSVCDNYFSCPDSSTCCCIF 391

Query: 487  NYHGSCFAWGCCPLEGAACCDDHYSCCPHDYPICNVRRRTCSKRKNSPLEIEALKRILAT 308
             Y   CFAWGCCPLEGA CCDDHYSCCPH+YP+CNV   TC   K +P  ++AL+R  A 
Sbjct: 392  EYGKYCFAWGCCPLEGATCCDDHYSCCPHEYPVCNVNEGTCLISKGNPFGVKALRRTPAK 451

Query: 307  P 305
            P
Sbjct: 452  P 452



 Score = 71.2 bits (173), Expect(2) = e-165
 Identities = 35/49 (71%), Positives = 37/49 (75%)
 Frame = -1

Query: 1655 IIGYDATHMTTADASSSSWRTDDEVNAMYESWLVKHGKFYNALGEKERR 1509
            II Y  TH     A+ SSWRTDDEV AMYE WLVKHGK YNALGEKE+R
Sbjct: 28   IISYHQTH-----ATKSSWRTDDEVMAMYEEWLVKHGKNYNALGEKEKR 71


>ref|XP_002326950.1| predicted protein [Populus trichocarpa] gi|222835265|gb|EEE73700.1|
            predicted protein [Populus trichocarpa]
          Length = 456

 Score =  540 bits (1392), Expect(2) = e-165
 Identities = 246/361 (68%), Positives = 281/361 (77%)
 Frame = -2

Query: 1387 LGLNRFADLSNEEYRSTYTGAKTIDSKRKLNNVKSDRYSPRSDDVLPDFVDWRSKGAVSA 1208
            +GLNRFADL+NEE+RS Y G +T   KR      SDRY+PR  D LPD VDWR +GAV+ 
Sbjct: 85   VGLNRFADLTNEEFRSMYLGTRTGHKKRLPKT--SDRYAPRVGDSLPDSVDWRKEGAVAE 142

Query: 1207 VKDQGSCGSCWAFSTTGSVEGINKLVTGDLITLSEQELVECDTSYNQGCNGGLMDYAFEF 1028
            VKDQG CGSCWAFST  +VEGINK+VTGDLI LSEQELV+CDTSYN+GCNGGLMDYAFEF
Sbjct: 143  VKDQGGCGSCWAFSTIAAVEGINKIVTGDLIALSEQELVDCDTSYNEGCNGGLMDYAFEF 202

Query: 1027 IIKNGGIDTDTDYPYTGKDGKCDKSRKNSKVVSIDSYEDVPVNDESALQKAAANQPITVA 848
            II NGGIDT+ DYPY G+DG+CD  RKN+KVVSIDSYEDVP NDE+AL+KA ANQP++VA
Sbjct: 203  IINNGGIDTEDDYPYLGRDGRCDTYRKNAKVVSIDSYEDVPENDETALKKAVANQPVSVA 262

Query: 847  IEASSRDFQFYTSGIFSGKCGTDLDHGVVVVGYGTEDGKDYWIVRNSWGAEWGEEGYLRM 668
            IE   R+FQ Y SG+F+G+CGT LDHGV  VGYGTE GKDYWIVRNSWG  WGE GY+RM
Sbjct: 263  IEGGGRNFQLYNSGVFTGECGTSLDHGVAAVGYGTEKGKDYWIVRNSWGKSWGESGYIRM 322

Query: 667  ERNIKENEGKCGIAMEPSYPIKNGQNXXXXXXXXXXXXXPEKVCDQYYTCPASTTCCCIY 488
            ERNI    GKCGIA+EPSYPIK GQN             P  VCD Y++CP S+TCCCI+
Sbjct: 323  ERNIASPTGKCGIAIEPSYPIKKGQNPPNPGPSPPSPVKPPSVCDNYFSCPDSSTCCCIF 382

Query: 487  NYHGSCFAWGCCPLEGAACCDDHYSCCPHDYPICNVRRRTCSKRKNSPLEIEALKRILAT 308
             Y   CFAWGCCPLEGA CCDDHYSCCPH+YP+CNV   TC   K +P  ++AL+R  A 
Sbjct: 383  EYGKYCFAWGCCPLEGATCCDDHYSCCPHEYPVCNVNEGTCLISKGNPFGVKALRRTPAK 442

Query: 307  P 305
            P
Sbjct: 443  P 443



 Score = 71.2 bits (173), Expect(2) = e-165
 Identities = 35/49 (71%), Positives = 37/49 (75%)
 Frame = -1

Query: 1655 IIGYDATHMTTADASSSSWRTDDEVNAMYESWLVKHGKFYNALGEKERR 1509
            II Y  TH     A+ SSWRTDDEV AMYE WLVKHGK YNALGEKE+R
Sbjct: 19   IISYHQTH-----ATKSSWRTDDEVMAMYEEWLVKHGKNYNALGEKEKR 62