BLASTX nr result

ID: Atractylodes21_contig00002052 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Atractylodes21_contig00002052
         (1593 letters)

Database: ./nr 
           23,641,837 sequences; 8,123,359,852 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_002534038.1| Aspartic proteinase nepenthesin-1 precursor,...   536   e-150
ref|NP_849967.1| aspartyl protease-like protein [Arabidopsis tha...   508   e-141
dbj|BAJ34248.1| unnamed protein product [Thellungiella halophila]     504   e-140
ref|XP_002884082.1| aspartyl protease family protein [Arabidopsi...   503   e-140
ref|XP_003517157.1| PREDICTED: aspartic proteinase-like protein ...   500   e-139

>ref|XP_002534038.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
            communis] gi|223525945|gb|EEF28342.1| Aspartic proteinase
            nepenthesin-1 precursor, putative [Ricinus communis]
          Length = 533

 Score =  536 bits (1381), Expect = e-150
 Identities = 265/471 (56%), Positives = 340/471 (72%), Gaps = 10/471 (2%)
 Frame = -1

Query: 1593 QMGSVDYYSAMARRDRIFHGRRLASVESS--LAFIDGNETYQLPALGYLHYANVSVGNPS 1420
            + GS+ YY++MA RD + HGR+L S  +S  L F  GNETY+  +LG+LHYANVS+G PS
Sbjct: 64   EKGSLHYYASMAHRDILIHGRKLVSDNTSTPLTFFSGNETYRFSSLGFLHYANVSIGTPS 123

Query: 1419 LWFLVALDTGSDLFWIPCNCRS--CVKGLVTRSGRHLDFNIYSPNTSSTSQRVPCDSSSC 1246
            L +LVALDTGSDLFW+PC+C +  CV+GL   SG  +DFNIY PN SSTSQ +PC+++ C
Sbjct: 124  LSYLVALDTGSDLFWLPCDCTNSGCVQGLQFPSGEQIDFNIYRPNASSTSQTIPCNNTLC 183

Query: 1245 KLRKQCSARPDICPYQVNYLSSNTSSTGILIEDTLHLTTEDSSLKAVDAKIKFGCGMIQT 1066
              + +C +    CPYQV YLS+ TSSTG+L+ED LHLTT+D+  +A+DAKI FGCG +QT
Sbjct: 184  SRQSRCPSAQSTCPYQVQYLSNGTSSTGVLVEDLLHLTTDDAQSRALDAKIIFGCGRVQT 243

Query: 1065 GSFLDGAAPNGLFGLGMENLSVPSILASSGLAANSFSMCFDPYGAGRINFGDKGSSDQGE 886
            GSFLDGAAPNGLFGLGM N+SVPS LA  G  +NSFSMCF   G GRI+FGD GSS QGE
Sbjct: 244  GSFLDGAAPNGLFGLGMTNISVPSTLAREGYTSNSFSMCFGRDGIGRISFGDTGSSGQGE 303

Query: 885  TPFNLETPHRTYNISMTQTVVGDNITDVDFDAIFDTGTSFTYLNDPAYSIISESFASQTK 706
            TPFNL   H TYN+S+T+  VG    D++F AIFD+GTSFTYLNDPAY++ISESF    K
Sbjct: 304  TPFNLRQLHPTYNVSITKINVGGRDADLEFSAIFDSGTSFTYLNDPAYTLISESFNIGAK 363

Query: 705  ETRSRPNSDLPFEYCYDLSPNQKTFEAPLLNLTMRGGDQFAVTDPLIIVPLEEGRSSFCL 526
            E R    SD+PFEYCY++S NQ   E P +NL M+GG QF VTDP++IV L+ G S +CL
Sbjct: 364  EKRYSSISDIPFEYCYEMSSNQTNLEIPTVNLVMQGGSQFNVTDPIVIVILQGGASIYCL 423

Query: 525  GIVKSEDVNIIGQNFMTGYRVVFDREKNVLGWKASNCYNAIESNTLXXXXXXXXXXXPAM 346
             IVKS DVNIIGQNFMTGYR+VF+RE+NVLGWKAS+CY+ +++ T            PA 
Sbjct: 424  AIVKSGDVNIIGQNFMTGYRIVFNRERNVLGWKASDCYDDMDTTTF-PVDPISPGIPPAT 482

Query: 345  SVGPEATSRNGSPNSSQAQPGSIASG------LKTLSYTHLILVISILTMV 211
            +V P+AT+ +G+       P  + +       L +L++  ++++I   T+V
Sbjct: 483  AVNPQATAGSGNTTEVSGTPPPVGNNAPKLPKLNSLTFAIIMVLIPFFTIV 533


>ref|NP_849967.1| aspartyl protease-like protein [Arabidopsis thaliana]
            gi|17978947|gb|AAL47439.1| putative chloroplast nucleoid
            DNA-binding protein [Arabidopsis thaliana]
            gi|22655368|gb|AAM98276.1| At2g17760/At2g17760
            [Arabidopsis thaliana] gi|330251585|gb|AEC06679.1|
            aspartyl protease-like protein [Arabidopsis thaliana]
          Length = 513

 Score =  508 bits (1307), Expect = e-141
 Identities = 255/461 (55%), Positives = 326/461 (70%), Gaps = 3/461 (0%)
 Frame = -1

Query: 1584 SVDYYSAMARRDRIFHGRRLASVESSLA-FIDGNETYQLPALGYLHYANVSVGNPSLWFL 1408
            S  YY  MA RDR+  GRRLA+ + SL  F DGNET ++ ALG+LHYANV+VG PS WF+
Sbjct: 59   SSKYYRVMAHRDRLIRGRRLANEDQSLVTFSDGNETVRVDALGFLHYANVTVGTPSDWFM 118

Query: 1407 VALDTGSDLFWIPCNCRSCVKGLVTRSGRHLDFNIYSPNTSSTSQRVPCDSSSCKLRKQC 1228
            VALDTGSDLFW+PC+C +CV+ L    G  LD NIYSPN SSTS +VPC+S+ C    +C
Sbjct: 119  VALDTGSDLFWLPCDCTNCVRELKAPGGSSLDLNIYSPNASSTSTKVPCNSTLCTRGDRC 178

Query: 1227 SARPDICPYQVNYLSSNTSSTGILIEDTLHLTTEDSSLKAVDAKIKFGCGMIQTGSFLDG 1048
            ++    CPYQ+ YLS+ TSSTG+L+ED LHL + D S KA+ A++ FGCG +QTG F DG
Sbjct: 179  ASPESDCPYQIRYLSNGTSSTGVLVEDVLHLVSNDKSSKAIPARVTFGCGQVQTGVFHDG 238

Query: 1047 AAPNGLFGLGMENLSVPSILASSGLAANSFSMCFDPYGAGRINFGDKGSSDQGETPFNLE 868
            AAPNGLFGLG+E++SVPS+LA  G+AANSFSMCF   GAGRI+FGDKGS DQ ETP N+ 
Sbjct: 239  AAPNGLFGLGLEDISVPSVLAKEGIAANSFSMCFGNDGAGRISFGDKGSVDQRETPLNIR 298

Query: 867  TPHRTYNISMTQTVVGDNITDVDFDAIFDTGTSFTYLNDPAYSIISESFASQTKETR-SR 691
             PH TYNI++T+  VG N  D++FDA+FD+GTSFTYL D AY++ISESF S   + R   
Sbjct: 299  QPHPTYNITVTKISVGGNTGDLEFDAVFDSGTSFTYLTDAAYTLISESFNSLALDKRYQT 358

Query: 690  PNSDLPFEYCYDLSPNQKTFEAPLLNLTMRGGDQFAVTDPLIIVPLEEGRSSFCLGIVKS 511
             +S+LPFEYCY LSPN+ +F+ P +NLTM+GG  + V  PL+++P+++    +CL I+K 
Sbjct: 359  TDSELPFEYCYALSPNKDSFQYPAVNLTMKGGSSYPVYHPLVVIPMKD-TDVYCLAIMKI 417

Query: 510  EDVNIIGQNFMTGYRVVFDREKNVLGWKASNCYNA-IESNTLXXXXXXXXXXXPAMSVGP 334
            ED++IIGQNFMTGYRVVFDREK +LGWK S+CY     + TL           PA S  P
Sbjct: 418  EDISIIGQNFMTGYRVVFDREKLILGWKESDCYTGETSARTLPSNRSSSSARPPASSFDP 477

Query: 333  EATSRNGSPNSSQAQPGSIASGLKTLSYTHLILVISILTMV 211
            EAT+       SQ    S  S   +LS +  +   SIL ++
Sbjct: 478  EATN-----IPSQRPNTSTTSAAYSLSISLSLFFFSILAIL 513


>dbj|BAJ34248.1| unnamed protein product [Thellungiella halophila]
          Length = 515

 Score =  504 bits (1299), Expect = e-140
 Identities = 255/458 (55%), Positives = 327/458 (71%), Gaps = 6/458 (1%)
 Frame = -1

Query: 1584 SVDYYSAMARRDRIFHGRRLASVESSLA-FIDGNETYQLPALGYLHYANVSVGNPSLWFL 1408
            S  YY  MA RDR+  GRRLAS + SL  F DGNET ++ ALG+LHYANV+VG PS WFL
Sbjct: 59   SSKYYRVMAHRDRLIRGRRLASEDQSLVTFADGNETIRVNALGFLHYANVTVGTPSDWFL 118

Query: 1407 VALDTGSDLFWIPCNCRS-CVKGLVTRSGRHLDFNIYSPNTSSTSQRVPCDSSSCKLRKQ 1231
            VALDTGSDLFW+PC+C + CV+ L    G  LD NIYSPN SSTS +VPC+S+ C    +
Sbjct: 119  VALDTGSDLFWLPCDCSTNCVRELKAPGGSSLDLNIYSPNASSTSSKVPCNSTLCTRVDR 178

Query: 1230 CSARPDICPYQVNYLSSNTSSTGILIEDTLHLTTEDSSLKAVDAKIKFGCGMIQTGSFLD 1051
            C++    CPYQ+ YLS+ TSSTG+L+ED LHL + + + K + A+I  GCG++QTG F D
Sbjct: 179  CASPLSDCPYQIRYLSNGTSSTGVLVEDVLHLVSMEKNSKPIRARITLGCGLVQTGVFHD 238

Query: 1050 GAAPNGLFGLGMENLSVPSILASSGLAANSFSMCFDPYGAGRINFGDKGSSDQGETPFNL 871
            GAAPNGLFGLG+E++SVPS+LA  G+AANSFSMCF   GAGRI+FGDKGS DQ ETP N+
Sbjct: 239  GAAPNGLFGLGLEDISVPSVLAKEGIAANSFSMCFGDDGAGRISFGDKGSVDQRETPLNI 298

Query: 870  ETPHRTYNISMTQTVVGDNITDVDFDAIFDTGTSFTYLNDPAYSIISESFASQTKETRSR 691
              PH TYN+++TQ  VG N  D++FDA+FDTGTSFTYL D  Y++ISESF S   + R +
Sbjct: 299  RQPHPTYNVTVTQISVGGNTGDLEFDAVFDTGTSFTYLTDAPYTLISESFNSLALDKRYQ 358

Query: 690  PNSDLPFEYCYDLSPNQKTFEAPLLNLTMRGGDQFAVTDPLIIVPLEEGRSSFCLGIVKS 511
             +S+LPFEYCY +SPN+K+FE P +NLTM+GG  + V  PLI+VP+E+    +CL I+KS
Sbjct: 359  TDSELPFEYCYAVSPNKKSFEYPDVNLTMKGGSSYPVYHPLIVVPIED-TVVYCLAIMKS 417

Query: 510  EDVNIIGQNFMTGYRVVFDREKNVLGWKASNCYNA-IESNTLXXXXXXXXXXXPAMSVGP 334
            ED++IIGQNFMTGYRVVFDREK +LGWK S+C      + T            PA S  P
Sbjct: 418  EDISIIGQNFMTGYRVVFDREKLILGWKESDCSTGETSARTQPSNRSSSSARPPASSFDP 477

Query: 333  EAT---SRNGSPNSSQAQPGSIASGLKTLSYTHLILVI 229
            EAT   S+  S +SS +   S++  L  L +  ++ ++
Sbjct: 478  EATNIPSQRPSSSSSSSYSYSLSLSLPFLYFFSILAIL 515


>ref|XP_002884082.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
            gi|297329922|gb|EFH60341.1| aspartyl protease family
            protein [Arabidopsis lyrata subsp. lyrata]
          Length = 513

 Score =  503 bits (1296), Expect = e-140
 Identities = 254/461 (55%), Positives = 325/461 (70%), Gaps = 3/461 (0%)
 Frame = -1

Query: 1584 SVDYYSAMARRDRIFHGRRLASVESSLA-FIDGNETYQLPALGYLHYANVSVGNPSLWFL 1408
            S  YY  MA RDR+  GRRLA+ + SL  F DGNET ++ ALG+LHYANV+VG PS WFL
Sbjct: 59   SSKYYRVMAHRDRLIRGRRLANEDQSLVTFSDGNETIRVDALGFLHYANVTVGTPSDWFL 118

Query: 1407 VALDTGSDLFWIPCNCRSCVKGLVTRSGRHLDFNIYSPNTSSTSQRVPCDSSSCKLRKQC 1228
            VALDTGSDLFW+PC+C +CV+ L    G  LD NIYSPN SSTS +VPC+S+ C    +C
Sbjct: 119  VALDTGSDLFWLPCDCTNCVRELKAPGGSSLDLNIYSPNASSTSTKVPCNSTLCTRGDRC 178

Query: 1227 SARPDICPYQVNYLSSNTSSTGILIEDTLHLTTEDSSLKAVDAKIKFGCGMIQTGSFLDG 1048
            ++    CPYQ+ YLS+ TSSTG+L+ED LHL + D S KA+ A++  GCG +QTG F DG
Sbjct: 179  ASPESNCPYQIRYLSNGTSSTGVLVEDVLHLVSNDKSSKAIPARVTLGCGQVQTGVFHDG 238

Query: 1047 AAPNGLFGLGMENLSVPSILASSGLAANSFSMCFDPYGAGRINFGDKGSSDQGETPFNLE 868
            AAPNGLFGLG+E++SVPS+LA  G+AANSFSMCF   GAGRI+FGDKGS DQ ETP N+ 
Sbjct: 239  AAPNGLFGLGLEDISVPSVLAKEGIAANSFSMCFGNDGAGRISFGDKGSVDQRETPLNIR 298

Query: 867  TPHRTYNISMTQTVVGDNITDVDFDAIFDTGTSFTYLNDPAYSIISESFASQTKETR-SR 691
             PH TYNI++T+  V  N  D++FDA+FD+GTSFTYL D AY++ISESF S   + R   
Sbjct: 299  QPHPTYNITVTKISVEGNTGDLEFDAVFDSGTSFTYLTDAAYTLISESFNSLALDKRYQT 358

Query: 690  PNSDLPFEYCYDLSPNQKTFEAPLLNLTMRGGDQFAVTDPLIIVPLEEGRSSFCLGIVKS 511
             +S+LPFEYCY LSPN+ +F+ P +NLTM+GG  + V  PL+++P+++    +CL I+K 
Sbjct: 359  TDSELPFEYCYALSPNKDSFQYPAVNLTMKGGSSYPVYHPLVVIPMKD-TDVYCLAILKI 417

Query: 510  EDVNIIGQNFMTGYRVVFDREKNVLGWKASNCYNA-IESNTLXXXXXXXXXXXPAMSVGP 334
            ED++IIGQNFMTGYRVVFDREK +LGWK S+CY     + TL           PA S  P
Sbjct: 418  EDISIIGQNFMTGYRVVFDREKLILGWKESDCYTGETSARTLPSNRSSSSARPPASSFDP 477

Query: 333  EATSRNGSPNSSQAQPGSIASGLKTLSYTHLILVISILTMV 211
            EAT+       SQ    S +S   +LS +  +   SIL ++
Sbjct: 478  EATN-----IPSQRPNTSTSSAAYSLSISLSLFFFSILAIL 513


>ref|XP_003517157.1| PREDICTED: aspartic proteinase-like protein 1-like [Glycine max]
          Length = 508

 Score =  500 bits (1288), Expect = e-139
 Identities = 259/438 (59%), Positives = 312/438 (71%), Gaps = 8/438 (1%)
 Frame = -1

Query: 1587 GSVDYYSAMARRDRIFHGRRLAS-VESSLAFIDGNETYQLPALGYLHYANVSVGNPSLWF 1411
            G+  YY AMA RDRIF GRRLA+   S L FI  NETYQ+ A G+LH+ANVSVG P L F
Sbjct: 55   GTRQYYVAMAHRDRIFRGRRLAAGYHSPLTFIPSNETYQIEAFGFLHFANVSVGTPPLSF 114

Query: 1410 LVALDTGSDLFWIPCNCRSCVKGLVTRSGRHLDFNIYSPNTSSTSQRVPCDSSSCKLRKQ 1231
            LVALDTGSDLFW+PCNC  CV G+   +G  + FNIY    SSTSQ V C+SS C+L++Q
Sbjct: 115  LVALDTGSDLFWLPCNCTKCVHGIGLSNGEKIAFNIYDLKGSSTSQPVLCNSSLCELQRQ 174

Query: 1230 CSARPDICPYQVNYLSSNTSSTGILIEDTLHLTTEDSSLKAVDAKIKFGCGMIQTGSFLD 1051
            C +   ICPY+VNYLS+ TS+TG L+ED LHL T+D   K  D +I FGCG +QTG+FLD
Sbjct: 175  CPSSDTICPYEVNYLSNGTSTTGFLVEDVLHLITDDDKTKDADTRITFGCGQVQTGAFLD 234

Query: 1050 GAAPNGLFGLGMENLSVPSILASSGLAANSFSMCFDPYGAGRINFGDKGSSDQGETPFNL 871
            GAAPNGLFGLGM N SVPSILA  GL +NSFSMCF   G GRI FGD  S  QG+TPFNL
Sbjct: 235  GAAPNGLFGLGMSNESVPSILAKEGLTSNSFSMCFGSDGLGRITFGDNSSLVQGKTPFNL 294

Query: 870  ETPHRTYNISMTQTVVGDNITDVDFDAIFDTGTSFTYLNDPAYSIISESFASQTKETRSR 691
               H TYNI++TQ +VG+ + D++F AIFD+GTSFTYLNDPAY  I+ SF S+ K  R  
Sbjct: 295  RALHPTYNITVTQIIVGEKVDDLEFHAIFDSGTSFTYLNDPAYKQITNSFNSEIKLQRHS 354

Query: 690  PNS--DLPFEYCYDLSPNQKTFEAPLLNLTMRGGDQFAVTDPLIIVPLEEGRSSFCLGIV 517
             +S  +LPFEYCY+LSPNQ T E   +NLTM+GGD + VTDP++ V   EG +  CLG++
Sbjct: 355  TSSSNELPFEYCYELSPNQ-TVELS-INLTMKGGDNYLVTDPIVTVS-GEGINLLCLGVL 411

Query: 516  KSEDVNIIGQNFMTGYRVVFDREKNVLGWKASNCYNAIESNTLXXXXXXXXXXXPAMSVG 337
            KS +VNIIGQNFMTGYR+VFDRE  +LGW+ SNCY+  E +TL           PA++V 
Sbjct: 412  KSNNVNIIGQNFMTGYRIVFDRENMILGWRESNCYDD-ELSTLPINRSNTPAISPAIAVN 470

Query: 336  PEATSRNG-----SPNSS 298
            PEA S        SPN S
Sbjct: 471  PEARSSQSNNPVLSPNLS 488


Top