BLASTX nr result

ID: Atractylodes21_contig00029526 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Atractylodes21_contig00029526
         (1494 letters)

Database: ./nr 
           23,641,837 sequences; 8,123,359,852 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_002278677.2| PREDICTED: aspartic proteinase nepenthesin-1...   534   e-149
ref|XP_004143702.1| PREDICTED: aspartic proteinase nepenthesin-2...   521   e-145
ref|XP_002524401.1| basic 7S globulin 2 precursor small subunit,...   519   e-145
ref|XP_002311432.1| predicted protein [Populus trichocarpa] gi|2...   509   e-142
ref|NP_189198.1| aspartyl protease family protein [Arabidopsis t...   505   e-140

>ref|XP_002278677.2| PREDICTED: aspartic proteinase nepenthesin-1-like [Vitis vinifera]
          Length = 458

 Score =  534 bits (1376), Expect = e-149
 Identities = 271/440 (61%), Positives = 322/440 (73%), Gaps = 8/440 (1%)
 Frame = +2

Query: 62   FTHICHSATINSDNH-HFLKLPLLHINPLQSPSQAFXXXXXXXXXXXXXXHRKPNL--PL 232
            FT IC++  I  +    +LKL LLHI P  +PSQA               H   +L  P+
Sbjct: 19   FTDICNALPIAQNGTVEYLKLRLLHIKPFTTPSQALSFDSHRLSFFFSALHTPQSLKSPV 78

Query: 233  TSGAYAGAGQYFVTLHLGTPPQPLLLIADTGSDLIWVACSACRDDCSLTRPAHSAFLARH 412
             SGA  G+GQYFV L LGTPPQ LLL+ADTGSDL+WV CSACR+ C+   P  SAFLARH
Sbjct: 79   VSGASTGSGQYFVDLRLGTPPQKLLLVADTGSDLVWVKCSACRN-CTRHTPG-SAFLARH 136

Query: 413  SSSFGLHHCFDPACQLVPHPRPPVACNHTRLHSPCRYAYSYADGSITNGFFAKEATSFNS 592
            S++F  +HC+D ACQLVP P+    CNH RLHSPCRY YSY DGS T+GFF+KE T+ N+
Sbjct: 137  STTFSPNHCYDSACQLVPLPKHH-RCNHARLHSPCRYEYSYGDGSKTSGFFSKETTTLNT 195

Query: 593  STGKALQHDSLAFGCGFKISGPSVSGPSFNGAQGVMGLGRGSISFVTQLGRRFGNKFSYC 772
            S+G+  +   +AFGC F+ISGPSVSG SFNGA GVMGLGRG IS  +QLG RFGNKFSYC
Sbjct: 196  SSGREAKLKGIAFGCAFRISGPSVSGASFNGAHGVMGLGRGPISLSSQLGHRFGNKFSYC 255

Query: 773  LKDYTITPPPTSYLLIGTGARN-----SRMRYTPLQTNPLSHTFYYIGIQSVSVDNVKLR 937
            L D+ I+P PTSYLLIG+   +      RMR+TPL  NPLS TFYYIGI+SVSVD +KL 
Sbjct: 256  LMDHDISPSPTSYLLIGSTQNDVAPGKRRMRFTPLHINPLSPTFYYIGIESVSVDGIKLP 315

Query: 938  VSPSVWVIDKLGNGGTIVDSGTTLTFLPDIAYRHVLAAFRRLVKLPTPSGSPPNFDICFN 1117
            ++PSVW +D+LGNGGTIVDSGTTLTFLP+ AY  +L   +R V+LP+P+   P FD+C N
Sbjct: 316  INPSVWALDELGNGGTIVDSGTTLTFLPEPAYLQILTVIKRRVRLPSPAEPTPGFDLCVN 375

Query: 1118 VSGIRRPSLPKLSFKLGGNSVFSPPVGNYLIETAEGVKCLALQPVTSPGGFSVIGNLMQQ 1297
            VS I  P LPKLSFKLGG+SVFSPP  NY ++T E VKCLALQ V +P GFSVIGNLMQQ
Sbjct: 376  VSEIEHPRLPKLSFKLGGDSVFSPPPRNYFVDTDEDVKCLALQAVMTPSGFSVIGNLMQQ 435

Query: 1298 GFLFEFDIGKSRLGFSRRGC 1357
            GFL EFD  ++RLGFSR GC
Sbjct: 436  GFLLEFDKDRTRLGFSRHGC 455


>ref|XP_004143702.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Cucumis sativus]
            gi|449529900|ref|XP_004171936.1| PREDICTED: aspartic
            proteinase nepenthesin-2-like [Cucumis sativus]
          Length = 459

 Score =  521 bits (1341), Expect = e-145
 Identities = 265/439 (60%), Positives = 319/439 (72%), Gaps = 8/439 (1%)
 Frame = +2

Query: 65   THICH-SATINSDNHHFLKLPLLHINPLQSPSQAFXXXXXXXXXXXXXXHRKPNLPLTSG 241
            TH+ + +AT  +    FLKLPLLH  P  SPSQ+               +     PL SG
Sbjct: 21   THLPNPNATAVAAPADFLKLPLLHKPPFSSPSQSLSSDTHRLSLLFSRPNPTLKSPLISG 80

Query: 242  AYAGAGQYFVTLHLGTPPQPLLLIADTGSDLIWVACSACRDDCSLTRPAHSAFLARHSSS 421
            A  G+GQYFV + LGTPPQ LLL+ADTGSDL+WV CSACR+ CS   P  SAFL RHSSS
Sbjct: 81   ASTGSGQYFVDIRLGTPPQSLLLVADTGSDLVWVKCSACRN-CS-HHPPSSAFLPRHSSS 138

Query: 422  FGLHHCFDPACQLVPHPRPPVACNHTRLHSPCRYAYSYADGSITNGFFAKEATSFNSSTG 601
            F   HCFDP C+L+PH  P   CNHTRLHSPCR+ YSYADGS+++GFF+KE T+  S +G
Sbjct: 139  FSPFHCFDPHCRLLPHA-PHHLCNHTRLHSPCRFLYSYADGSLSSGFFSKETTTLKSLSG 197

Query: 602  KALQHDSLAFGCGFKISGPSVSGPSFNGAQGVMGLGRGSISFVTQLGRRFGNKFSYCLKD 781
              +    L+FGCGF+ISGPSVSG  FNGA+GVMGLGRGSISF +QLGRRFGNKFSYCL D
Sbjct: 198  SEIHLKGLSFGCGFRISGPSVSGAQFNGARGVMGLGRGSISFSSQLGRRFGNKFSYCLMD 257

Query: 782  YTITPPPTSYLLIGTGARN------SRMRYTPLQTNPLSHTFYYIGIQSVSVDNVKLRVS 943
            YT++PPPTS+L+IG G  +      +++ YTPLQ NPLS TFYYI I S+++D VKL ++
Sbjct: 258  YTLSPPPTSFLMIGGGLHSLPLTNATKISYTPLQINPLSPTFYYITIHSITIDGVKLPIN 317

Query: 944  PSVWVIDKLGNGGTIVDSGTTLTFLPDIAYRHVLAAFRRLVKLPTPSGSPPNFDICFNVS 1123
            P+VW ID+ GNGGT+VDSGTTLT+L   AY  VL + RR VKLP  +   P FD+C N S
Sbjct: 318  PAVWEIDEQGNGGTVVDSGTTLTYLTKTAYEEVLKSVRRRVKLPNAAELTPGFDLCVNAS 377

Query: 1124 G-IRRPSLPKLSFKLGGNSVFSPPVGNYLIETAEGVKCLALQPVTSPGGFSVIGNLMQQG 1300
            G  RRPSLP+L F+LGG +VF+PP  NY +ET EGV CLA++ V S  GFSVIGNLMQQG
Sbjct: 378  GESRRPSLPRLRFRLGGGAVFAPPPRNYFLETEEGVMCLAIRAVESGNGFSVIGNLMQQG 437

Query: 1301 FLFEFDIGKSRLGFSRRGC 1357
            FL EFD  +SRLGF+RRGC
Sbjct: 438  FLLEFDKEESRLGFTRRGC 456


>ref|XP_002524401.1| basic 7S globulin 2 precursor small subunit, putative [Ricinus
            communis] gi|223536362|gb|EEF38012.1| basic 7S globulin 2
            precursor small subunit, putative [Ricinus communis]
          Length = 455

 Score =  519 bits (1336), Expect = e-145
 Identities = 263/439 (59%), Positives = 320/439 (72%), Gaps = 13/439 (2%)
 Frame = +2

Query: 80   SATINSDNHHFLKLPLLHINPLQSPSQAFXXXXXXXXXXXXXX-----HRKPNL--PLTS 238
            SA  N+    +LKLPLLH  P  SPS+A                    H++ +   P+ S
Sbjct: 19   SAAANTTTE-YLKLPLLHKTPFTSPSEALAFDINRRLSLLHHHRHQQQHKQNSFRSPVIS 77

Query: 239  GAYAGAGQYFVTLHLGTPPQPLLLIADTGSDLIWVACSACRDDCSLTRPAHSAFLARHSS 418
            GA +G+GQYFV+L +GTPPQ LLL+ADTGSDLIWV CS CR+ CS   P  SAF ARHS+
Sbjct: 78   GASSGSGQYFVSLRIGTPPQTLLLVADTGSDLIWVKCSPCRN-CSHRSPG-SAFFARHST 135

Query: 419  SFGLHHCFDPACQLVPHPRPPVACNHTRLHSPCRYAYSYADGSITNGFFAKEATSFNSST 598
            ++   HC+ P CQLVPHP P   CN TRLHSPCRY Y+YAD S T GFF+KEA + N+ST
Sbjct: 136  TYSAIHCYSPQCQLVPHPHPN-PCNRTRLHSPCRYQYTYADSSTTTGFFSKEALTLNTST 194

Query: 599  GKALQHDSLAFGCGFKISGPSVSGPSFNGAQGVMGLGRGSISFVTQLGRRFGNKFSYCLK 778
            GK  + + L+FGCGF+ISGPS++G SF GAQGVMGLGR  ISF +QLGRRFG+KFSYCL 
Sbjct: 195  GKVKKLNGLSFGCGFRISGPSLTGASFEGAQGVMGLGRAPISFSSQLGRRFGSKFSYCLM 254

Query: 779  DYTITPPPTSYLLIGTGARNSR------MRYTPLQTNPLSHTFYYIGIQSVSVDNVKLRV 940
            DYT++PPPTS+L IG GA+N        M +TPL  NPLS TFYYI I+ V V+ VKL +
Sbjct: 255  DYTLSPPPTSFLTIG-GAQNVAVSKKGIMSFTPLLINPLSPTFYYIAIKGVYVNGVKLPI 313

Query: 941  SPSVWVIDKLGNGGTIVDSGTTLTFLPDIAYRHVLAAFRRLVKLPTPSGSPPNFDICFNV 1120
            +PSVW ID LGNGGTI+DSGTTLTF+ + AY  +L AF++ VKLP+P+   P FD+C NV
Sbjct: 314  NPSVWSIDDLGNGGTIIDSGTTLTFITEPAYTEILKAFKKRVKLPSPAEPTPGFDLCMNV 373

Query: 1121 SGIRRPSLPKLSFKLGGNSVFSPPVGNYLIETAEGVKCLALQPVTSPGGFSVIGNLMQQG 1300
            SG+ RP+LP++SF L G SVFSPP  NY IET + +KCLA+QPV+  GGFSV+GNLMQQG
Sbjct: 374  SGVTRPALPRMSFNLAGGSVFSPPPRNYFIETGDQIKCLAVQPVSQDGGFSVLGNLMQQG 433

Query: 1301 FLFEFDIGKSRLGFSRRGC 1357
            FL EFD  KSRLGF+RRGC
Sbjct: 434  FLLEFDRDKSRLGFTRRGC 452


>ref|XP_002311432.1| predicted protein [Populus trichocarpa] gi|222851252|gb|EEE88799.1|
            predicted protein [Populus trichocarpa]
          Length = 458

 Score =  509 bits (1311), Expect = e-142
 Identities = 254/440 (57%), Positives = 313/440 (71%), Gaps = 17/440 (3%)
 Frame = +2

Query: 89   INSDNHHFLKLPLLHINPLQSPSQAFXXXXXXXXXXXXXXHRKPN-------LPLTSGAY 247
            +++    +LKLPLLH  P  +P Q+               HR  N        PL SGA 
Sbjct: 18   LSTSTTEYLKLPLLHKTPFPTPLQSLSSDLQRLSLLHHSHHRHQNHRRTSSKSPLMSGAS 77

Query: 248  AGAGQYFVTLHLGTPPQPLLLIADTGSDLIWVACSACRDDCSLTRPAHSAFLARHSSSFG 427
            +G+GQYFV++ LG+PPQ LLL+ADTGSDL WV CSAC+ +CS+  P  S FLARHS++F 
Sbjct: 78   SGSGQYFVSIRLGSPPQTLLLVADTGSDLTWVRCSACKTNCSI-HPPGSTFLARHSTTFS 136

Query: 428  LHHCFDPACQLVPHPRPPVACNHTRLHSPCRYAYSYADGSITNGFFAKEATSFNSSTGKA 607
              HCF   CQLVP P P   CNHTRLHS CRY Y Y+DGS T+GFF+KE T+ N+S+G+ 
Sbjct: 137  PTHCFSSLCQLVPQPNPN-PCNHTRLHSTCRYEYVYSDGSKTSGFFSKETTTLNTSSGRE 195

Query: 608  LQHDSLAFGCGFKISGPSVSGPSFNGAQGVMGLGRGSISFVTQLGRRFGNKFSYCLKDYT 787
            ++  S+AFGCGF  SGPS+ G SFNGA GVMGLGRG ISF +QLGRRFG  FSYCL DYT
Sbjct: 196  MKLKSIAFGCGFHASGPSLIGSSFNGASGVMGLGRGPISFASQLGRRFGRSFSYCLLDYT 255

Query: 788  ITPPPTSYLLIGTGA-----RNSRMRYTPLQTNPLSHTFYYIGIQSVSVDNVKLRVSPSV 952
            ++PPPTSYL+IG          S M +TPL  NP + TFYYI I+ V VD VKL + PSV
Sbjct: 256  LSPPPTSYLMIGDVVSTKKDNKSMMSFTPLLINPEAPTFYYISIKGVFVDGVKLHIDPSV 315

Query: 953  WVIDKLGNGGTIVDSGTTLTFLPDIAYRHVLAAFRRLVKLPTP----SGSPPNFDICFNV 1120
            W +D+LGNGGT++DSGTTLTFL + AYR +L+AF+R VKLP+P    + +   FD+C NV
Sbjct: 316  WSLDELGNGGTVIDSGTTLTFLTEPAYREILSAFKREVKLPSPTPGGASTRSGFDLCVNV 375

Query: 1121 SGIRRPSLPKLSFKLGGNSVFSPPVGNYLIETAEGVKCLALQPVTSPGG-FSVIGNLMQQ 1297
            +G+ RP  P+LS +LGG S++SPP  NY I+ +EG+KCLA+QPV +  G FSVIGNLMQQ
Sbjct: 376  TGVSRPRFPRLSLELGGESLYSPPPRNYFIDISEGIKCLAIQPVEAESGRFSVIGNLMQQ 435

Query: 1298 GFLFEFDIGKSRLGFSRRGC 1357
            GFL EFD GKSRLGFSRRGC
Sbjct: 436  GFLLEFDRGKSRLGFSRRGC 455


>ref|NP_189198.1| aspartyl protease family protein [Arabidopsis thaliana]
            gi|11994761|dbj|BAB03090.1| chloroplast nucleoid DNA
            binding protein-like; nucellin-like protein [Arabidopsis
            thaliana] gi|189339286|gb|ACD89063.1| At3g25700
            [Arabidopsis thaliana] gi|332643533|gb|AEE77054.1|
            aspartyl protease family protein [Arabidopsis thaliana]
          Length = 452

 Score =  505 bits (1300), Expect = e-140
 Identities = 256/428 (59%), Positives = 307/428 (71%), Gaps = 7/428 (1%)
 Frame = +2

Query: 95   SDNHHFLKLPLLHINPLQSPSQAFXXXXXXXXXXXXXXHRKP----NLPLTSGAYAGAGQ 262
            S+++ +LKLPLL  +P  SP+QA                RKP      P+ SGA +G+GQ
Sbjct: 26   SNHNKYLKLPLLRKSPFPSPTQALALDTRRLHFLSL--RRKPIPFVKSPVVSGAASGSGQ 83

Query: 263  YFVTLHLGTPPQPLLLIADTGSDLIWVACSACRDDCSLTRPAHSAFLARHSSSFGLHHCF 442
            YFV L +G PPQ LLLIADTGSDL+WV CSACR+ CS   PA + F  RHSS+F   HC+
Sbjct: 84   YFVDLRIGQPPQSLLLIADTGSDLVWVKCSACRN-CSHHSPA-TVFFPRHSSTFSPAHCY 141

Query: 443  DPACQLVPHPRPPVACNHTRLHSPCRYAYSYADGSITNGFFAKEATSFNSSTGKALQHDS 622
            DP C+LVP P     CNHTR+HS C Y Y YADGS+T+G FA+E TS  +S+GK  +  S
Sbjct: 142  DPVCRLVPKPDRAPICNHTRIHSTCHYEYGYADGSLTSGLFARETTSLKTSSGKEARLKS 201

Query: 623  LAFGCGFKISGPSVSGPSFNGAQGVMGLGRGSISFVTQLGRRFGNKFSYCLKDYTITPPP 802
            +AFGCGF+ISG SVSG SFNGA GVMGLGRG ISF +QLGRRFGNKFSYCL DYT++PPP
Sbjct: 202  VAFGCGFRISGQSVSGTSFNGANGVMGLGRGPISFASQLGRRFGNKFSYCLMDYTLSPPP 261

Query: 803  TSYLLIGTGARN-SRMRYTPLQTNPLSHTFYYIGIQSVSVDNVKLRVSPSVWVIDKLGNG 979
            TSYL+IG G    S++ +TPL TNPLS TFYY+ ++SV V+  KLR+ PS+W ID  GNG
Sbjct: 262  TSYLIIGNGGDGISKLFFTPLLTNPLSPTFYYVKLKSVFVNGAKLRIDPSIWEIDDSGNG 321

Query: 980  GTIVDSGTTLTFLPDIAYRHVLAAFRRLVKLPTPSGSPPNFDICFNVSGIRRPS--LPKL 1153
            GT+VDSGTTL FL + AYR V+AA RR VKLP      P FD+C NVSG+ +P   LP+L
Sbjct: 322  GTVVDSGTTLAFLAEPAYRSVIAAVRRRVKLPIADALTPGFDLCVNVSGVTKPEKILPRL 381

Query: 1154 SFKLGGNSVFSPPVGNYLIETAEGVKCLALQPVTSPGGFSVIGNLMQQGFLFEFDIGKSR 1333
             F+  G +VF PP  NY IET E ++CLA+Q V    GFSVIGNLMQQGFLFEFD  +SR
Sbjct: 382  KFEFSGGAVFVPPPRNYFIETEEQIQCLAIQSVDPKVGFSVIGNLMQQGFLFEFDRDRSR 441

Query: 1334 LGFSRRGC 1357
            LGFSRRGC
Sbjct: 442  LGFSRRGC 449


Top