BLASTX nr result

ID: Bupleurum21_contig00000129 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Bupleurum21_contig00000129
         (1851 letters)

Database: ./nr 
           23,641,837 sequences; 8,123,359,852 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

gb|ABG37021.1| aspartic protease [Nicotiana tabacum]                  650   0.0  
emb|CAA70340.1| aspartic proteinase [Centaurea calcitrapa]            658   0.0  
gb|AFB73927.2| preprocirsin [Cirsium vulgare]                         652   0.0  
emb|CAA57510.1| cyprosin [Cynara cardunculus]                         658   0.0  
gb|ACX55829.1| aspartic proteinase 1 [Castanea mollissima]            636   0.0  

>gb|ABG37021.1| aspartic protease [Nicotiana tabacum]
          Length = 508

 Score =  650 bits (1676), Expect(2) = 0.0
 Identities = 314/413 (76%), Positives = 357/413 (86%)
 Frame = -3

Query: 1486 KFTVIFDTGSSNLWVPSAKCYFSVACLFXXXXXXXXXXXXXKNGKSAAIQYGTGSISGFF 1307
            KFTVIFDTGSSNLWVPSA+CYFS+AC               KNG SAAI+YGTGSISG+F
Sbjct: 97   KFTVIFDTGSSNLWVPSARCYFSLACYLHPKYKSSHSSTYKKNGTSAAIRYGTGSISGYF 156

Query: 1306 SQDSVKVGDLVVKEQDFIEATKEPGITFLAAKFDGILGLGFQEISVGNAVPVWYSMVNQG 1127
            S D+VKVGDL+VK+QDFIEAT+EPGITFLAAKFDGILGLGFQEISVG +VPVWY+MVNQG
Sbjct: 157  SNDNVKVGDLIVKDQDFIEATREPGITFLAAKFDGILGLGFQEISVGKSVPVWYNMVNQG 216

Query: 1126 LVKEPVFSFWLNRNAXXXXXXELVFGGVDTTHYKGEHTYVPVSHKGYWQFDMGDVLVGGE 947
            LVK+PVFSFW NRNA      ELVFGGVD  H+KG+HTYVPV+HKGYWQFDMGDVLVGGE
Sbjct: 217  LVKKPVFSFWFNRNAQEEEGGELVFGGVDPNHFKGKHTYVPVTHKGYWQFDMGDVLVGGE 276

Query: 946  STGLCSGGCSAIADSGTSLLAGPTTVITQINHAIGASGVMSQECKSVVSQYGKTILDLLL 767
            +TG CSGGCSAIADSGTSLLAGPTT+ITQINH IGASGV+SQECKS+V++YGKTILDLL 
Sbjct: 277  TTGFCSGGCSAIADSGTSLLAGPTTIITQINHVIGASGVVSQECKSLVTEYGKTILDLLE 336

Query: 766  SETQPLKVCSQIGLCSSHGSHDHSMIIESVVDINSGKTSGGLHDEMCTFCEMTVAWMQSQ 587
            S+  P K+CSQIGLCSS GS D SMIIESVVD ++G  S GL DEMC  CEM V WMQ+Q
Sbjct: 337  SKAAPQKICSQIGLCSSDGSRDVSMIIESVVDKHNG-ASNGLGDEMCRVCEMAVIWMQNQ 395

Query: 586  LLKNQTEDKIIDYVNELCNRLPSPMGESAVDCNTLSSLPSVSFTLGDKVFDLSAEQYVLK 407
            + +N+T D I DYVN+LC+RLPSPMGESAVDC++L+S+P+VSFT+G++ F L+ +QYVL+
Sbjct: 396  MRRNETADSIYDYVNQLCDRLPSPMGESAVDCSSLASMPNVSFTVGNQTFGLTPQQYVLQ 455

Query: 406  VGEGTAAQCISGFIALDVPPPRGPLWILGDVFMGKYHTVFDYGNMKVGFAEAA 248
            VGEG  AQCISGF ALDVPPPRGPLWILGDVFMG+YHTVFDYGN +VGFAEAA
Sbjct: 456  VGEGPVAQCISGFTALDVPPPRGPLWILGDVFMGRYHTVFDYGNSRVGFAEAA 508



 Score =  117 bits (292), Expect(2) = 0.0
 Identities = 60/103 (58%), Positives = 71/103 (68%), Gaps = 1/103 (0%)
 Frame = -1

Query: 1845 MGTNFGLCXXXXXXXXXXAPTVFSSSNDGLIRVELKKRNVDPVNRLARHVNSN-EGSARN 1669
            MGT +G C          +P VFS SNDGLIRV +KKR +D +N+    ++SN   SAR 
Sbjct: 1    MGTRYGACLSALCLLLLLSPMVFSVSNDGLIRVGIKKRKLDQINQAFGGIDSNGANSART 60

Query: 1668 YGLRGNGGDPDADIVGLKNYMDAQYFGEIGIGTPPQKFTVIFD 1540
            Y L GN GD D DI+ LKNY+DAQYFGEI IG+PPQKFTVIFD
Sbjct: 61   YHLGGNIGDSDTDIIALKNYLDAQYFGEICIGSPPQKFTVIFD 103


>emb|CAA70340.1| aspartic proteinase [Centaurea calcitrapa]
          Length = 509

 Score =  658 bits (1698), Expect(2) = 0.0
 Identities = 314/413 (76%), Positives = 359/413 (86%)
 Frame = -3

Query: 1486 KFTVIFDTGSSNLWVPSAKCYFSVACLFXXXXXXXXXXXXXKNGKSAAIQYGTGSISGFF 1307
            KFTVIFDTGSSNLWVPSAKCYFSVACLF             KNG SAAIQYGTGSISGF 
Sbjct: 97   KFTVIFDTGSSNLWVPSAKCYFSVACLFHSKYKSSHSSTYKKNGTSAAIQYGTGSISGFV 156

Query: 1306 SQDSVKVGDLVVKEQDFIEATKEPGITFLAAKFDGILGLGFQEISVGNAVPVWYSMVNQG 1127
            SQDSVK+GDLVVKEQDFIEATKEPG+TFLAAKFDGILGLGFQEISVG +VPVWY+MVNQG
Sbjct: 157  SQDSVKLGDLVVKEQDFIEATKEPGVTFLAAKFDGILGLGFQEISVGKSVPVWYNMVNQG 216

Query: 1126 LVKEPVFSFWLNRNAXXXXXXELVFGGVDTTHYKGEHTYVPVSHKGYWQFDMGDVLVGGE 947
            LV+EPVFSFW NRNA      ELVFGGVD  H+KG+HTYVPV+ KGYWQF+MGDVL+  +
Sbjct: 217  LVQEPVFSFWFNRNADEEEGGELVFGGVDPNHFKGKHTYVPVTQKGYWQFNMGDVLIEDK 276

Query: 946  STGLCSGGCSAIADSGTSLLAGPTTVITQINHAIGASGVMSQECKSVVSQYGKTILDLLL 767
            +TG C+ GC+AIADSGTSLLAGPT +ITQINHAIGA GVMSQ+CK++V QYGKTI+++LL
Sbjct: 277  TTGFCADGCAAIADSGTSLLAGPTAIITQINHAIGAKGVMSQQCKTLVDQYGKTIIEMLL 336

Query: 766  SETQPLKVCSQIGLCSSHGSHDHSMIIESVVDINSGKTSGGLHDEMCTFCEMTVAWMQSQ 587
            SE QP K+CSQ+ LC+  G+ D S IIESVVD N+GK+SGG+HDEMCTFCEM V WMQ+Q
Sbjct: 337  SEAQPDKICSQMKLCTFDGARDVSSIIESVVDKNNGKSSGGVHDEMCTFCEMAVVWMQNQ 396

Query: 586  LLKNQTEDKIIDYVNELCNRLPSPMGESAVDCNTLSSLPSVSFTLGDKVFDLSAEQYVLK 407
            + +NQTED II+YVNELC+RLPSPMGESAVDCN LSS+P+++FT+G KVF+L  EQY+LK
Sbjct: 397  IKRNQTEDNIINYVNELCDRLPSPMGESAVDCNDLSSMPNIAFTIGGKVFELCPEQYILK 456

Query: 406  VGEGTAAQCISGFIALDVPPPRGPLWILGDVFMGKYHTVFDYGNMKVGFAEAA 248
            +GEG AAQCISGF A+DV PPRGPLWILGDVFMG+YHTVFDYG ++VGFAEAA
Sbjct: 457  IGEGEAAQCISGFTAMDVAPPRGPLWILGDVFMGQYHTVFDYGKLRVGFAEAA 509



 Score =  107 bits (268), Expect(2) = 0.0
 Identities = 54/84 (64%), Positives = 65/84 (77%), Gaps = 1/84 (1%)
 Frame = -1

Query: 1788 PTVFSSSNDGLIRVELKKRNVDPVNRLARHVNSNEGSAR-NYGLRGNGGDPDADIVGLKN 1612
            PT FS+SN GL+RV LKKR VD +N+L  H  S EG AR ++G  G+  D D+DI+ LKN
Sbjct: 20   PTAFSASNGGLLRVGLKKRKVDQINQLRNHGASMEGKARKDFGFGGSLRDSDSDIIELKN 79

Query: 1611 YMDAQYFGEIGIGTPPQKFTVIFD 1540
            YMDAQY+GEIGIG+P QKFTVIFD
Sbjct: 80   YMDAQYYGEIGIGSPAQKFTVIFD 103


>gb|AFB73927.2| preprocirsin [Cirsium vulgare]
          Length = 509

 Score =  652 bits (1681), Expect(2) = 0.0
 Identities = 312/413 (75%), Positives = 358/413 (86%)
 Frame = -3

Query: 1486 KFTVIFDTGSSNLWVPSAKCYFSVACLFXXXXXXXXXXXXXKNGKSAAIQYGTGSISGFF 1307
            KFTVIFDTGSSNLWVPSAKCYFSVACLF             KNG SAAIQYGTGSISGF 
Sbjct: 97   KFTVIFDTGSSNLWVPSAKCYFSVACLFHSKYKSSHSSTYKKNGTSAAIQYGTGSISGFV 156

Query: 1306 SQDSVKVGDLVVKEQDFIEATKEPGITFLAAKFDGILGLGFQEISVGNAVPVWYSMVNQG 1127
            SQDSVK+GDLVVKEQDFIEATKEPGITFLAAKFDGILGLGFQEISVG +VPVWY+MVNQG
Sbjct: 157  SQDSVKLGDLVVKEQDFIEATKEPGITFLAAKFDGILGLGFQEISVGKSVPVWYNMVNQG 216

Query: 1126 LVKEPVFSFWLNRNAXXXXXXELVFGGVDTTHYKGEHTYVPVSHKGYWQFDMGDVLVGGE 947
            LV+EPVFSFW NRNA      ELVFGGVD  H+KG+HTYVPV+ KGYWQF+MGDVL+  +
Sbjct: 217  LVQEPVFSFWFNRNANEEEGGELVFGGVDPNHFKGKHTYVPVTEKGYWQFNMGDVLIEDK 276

Query: 946  STGLCSGGCSAIADSGTSLLAGPTTVITQINHAIGASGVMSQECKSVVSQYGKTILDLLL 767
            +TG CS GC+AIADSGTSLLAGPT +IT+INHA GA GVMSQ+CK++VSQYGK+I+++LL
Sbjct: 277  TTGFCSDGCAAIADSGTSLLAGPTAIITEINHASGAKGVMSQQCKTLVSQYGKSIIEMLL 336

Query: 766  SETQPLKVCSQIGLCSSHGSHDHSMIIESVVDINSGKTSGGLHDEMCTFCEMTVAWMQSQ 587
            SE QP K+CSQ+ LC+  G+ D S IIESVVD N+GK+SGG +DEMCTFCEM V WMQ+Q
Sbjct: 337  SEAQPDKICSQMKLCTFDGARDVSSIIESVVDKNNGKSSGGANDEMCTFCEMAVVWMQNQ 396

Query: 586  LLKNQTEDKIIDYVNELCNRLPSPMGESAVDCNTLSSLPSVSFTLGDKVFDLSAEQYVLK 407
            + +N+TED II+YVNELC+RLPSPMGESAVDCN+LSS+P+++FT+G KVF+L  EQY+LK
Sbjct: 397  IKRNETEDNIINYVNELCDRLPSPMGESAVDCNSLSSMPNIAFTIGGKVFELCPEQYILK 456

Query: 406  VGEGTAAQCISGFIALDVPPPRGPLWILGDVFMGKYHTVFDYGNMKVGFAEAA 248
            +GEG AAQCISGF A+DV PPRGPLWILGDVFMG+YHTVFDYG  +VGFAEAA
Sbjct: 457  IGEGEAAQCISGFTAMDVAPPRGPLWILGDVFMGRYHTVFDYGKSRVGFAEAA 509



 Score =  111 bits (278), Expect(2) = 0.0
 Identities = 59/103 (57%), Positives = 69/103 (66%), Gaps = 1/103 (0%)
 Frame = -1

Query: 1845 MGTNFGLCXXXXXXXXXXAPTVFSSSNDGLIRVELKKRNVDPVNRLARHVNSNEGSAR-N 1669
            MGT+              +PT  S SNDGLIRV LKKR VD +N+L+ H  S EG AR +
Sbjct: 1    MGTSIKASLLALFLLFLLSPTAISVSNDGLIRVGLKKRKVDQINQLSGHGASMEGKARKD 60

Query: 1668 YGLRGNGGDPDADIVGLKNYMDAQYFGEIGIGTPPQKFTVIFD 1540
            +G  G   D D+DI+ LKNYMDAQY+GEIGIG PPQKFTVIFD
Sbjct: 61   FGFGGTLRDSDSDIIALKNYMDAQYYGEIGIGAPPQKFTVIFD 103


>emb|CAA57510.1| cyprosin [Cynara cardunculus]
          Length = 509

 Score =  658 bits (1697), Expect(2) = 0.0
 Identities = 313/413 (75%), Positives = 360/413 (87%)
 Frame = -3

Query: 1486 KFTVIFDTGSSNLWVPSAKCYFSVACLFXXXXXXXXXXXXXKNGKSAAIQYGTGSISGFF 1307
            KFTVIFDTGSSNLWVPSAKCYFSVACLF             KNG SAAIQYGTGSISGF 
Sbjct: 97   KFTVIFDTGSSNLWVPSAKCYFSVACLFHSKYKSSHSSTYKKNGTSAAIQYGTGSISGFV 156

Query: 1306 SQDSVKVGDLVVKEQDFIEATKEPGITFLAAKFDGILGLGFQEISVGNAVPVWYSMVNQG 1127
            SQDSVK+GDLVVKEQDFIEATKEPGITFLAAKFDGILGLGFQEISVG +VP+WY+MVNQG
Sbjct: 157  SQDSVKLGDLVVKEQDFIEATKEPGITFLAAKFDGILGLGFQEISVGKSVPLWYNMVNQG 216

Query: 1126 LVKEPVFSFWLNRNAXXXXXXELVFGGVDTTHYKGEHTYVPVSHKGYWQFDMGDVLVGGE 947
            LV+EPVFSFW NRNA      ELVFGGVD  H+KG+HTYVPV+ KGYWQFDMGDVL+  +
Sbjct: 217  LVQEPVFSFWFNRNADEEEGGELVFGGVDPNHFKGKHTYVPVTEKGYWQFDMGDVLIEDK 276

Query: 946  STGLCSGGCSAIADSGTSLLAGPTTVITQINHAIGASGVMSQECKSVVSQYGKTILDLLL 767
            +TG CS GC+AIADSGTSLLAGPT +IT+INHAIGA GVMSQ+CK++VSQYGKT++++LL
Sbjct: 277  TTGFCSDGCAAIADSGTSLLAGPTAIITEINHAIGAKGVMSQQCKTLVSQYGKTMIEMLL 336

Query: 766  SETQPLKVCSQIGLCSSHGSHDHSMIIESVVDINSGKTSGGLHDEMCTFCEMTVAWMQSQ 587
            SE QP K+CSQ+ LC+  G+ D S IIESVVD N+GK+S G+HDEMCTFCEM V WMQ+Q
Sbjct: 337  SEAQPDKICSQMKLCTFDGARDASSIIESVVDENNGKSSSGVHDEMCTFCEMAVVWMQNQ 396

Query: 586  LLKNQTEDKIIDYVNELCNRLPSPMGESAVDCNTLSSLPSVSFTLGDKVFDLSAEQYVLK 407
            + +N+TED II+YVNELC+RLPSPMGESAVDCN+LSS+P+++FT+G KVF+L  EQY+LK
Sbjct: 397  IKRNETEDNIINYVNELCDRLPSPMGESAVDCNSLSSMPNIAFTIGGKVFELCPEQYILK 456

Query: 406  VGEGTAAQCISGFIALDVPPPRGPLWILGDVFMGKYHTVFDYGNMKVGFAEAA 248
            +GEG AAQCISGF A+DV PPRGPLWILGDVFMG+YHTVFDYG ++VGFAEAA
Sbjct: 457  IGEGEAAQCISGFTAMDVAPPRGPLWILGDVFMGRYHTVFDYGKLRVGFAEAA 509



 Score =  103 bits (258), Expect(2) = 0.0
 Identities = 53/84 (63%), Positives = 63/84 (75%), Gaps = 1/84 (1%)
 Frame = -1

Query: 1788 PTVFSSSNDGLIRVELKKRNVDPVNRLARHVNSNEGSAR-NYGLRGNGGDPDADIVGLKN 1612
            PT FS SN GL+RV LKKR VD +N+L+ H  S E  AR ++G  G   D  +DI+ LKN
Sbjct: 20   PTAFSVSNGGLLRVGLKKRKVDQINQLSGHGVSMEAKARKDFGFGGALRDSGSDIIALKN 79

Query: 1611 YMDAQYFGEIGIGTPPQKFTVIFD 1540
            YMDAQY+GEIGIG+PPQKFTVIFD
Sbjct: 80   YMDAQYYGEIGIGSPPQKFTVIFD 103


>gb|ACX55829.1| aspartic proteinase 1 [Castanea mollissima]
          Length = 513

 Score =  636 bits (1640), Expect(2) = 0.0
 Identities = 307/413 (74%), Positives = 350/413 (84%)
 Frame = -3

Query: 1486 KFTVIFDTGSSNLWVPSAKCYFSVACLFXXXXXXXXXXXXXKNGKSAAIQYGTGSISGFF 1307
            KFTVIFDTGSSNLWVPS+KCYFSVAC F             KNGK A I YGTG+ISG+F
Sbjct: 102  KFTVIFDTGSSNLWVPSSKCYFSVACYFHSKYKSSSSSTYKKNGKPADIHYGTGAISGYF 161

Query: 1306 SQDSVKVGDLVVKEQDFIEATKEPGITFLAAKFDGILGLGFQEISVGNAVPVWYSMVNQG 1127
            SQD VKVGDLVVK Q+FIEAT+EP ITFL AKFDGILGLGF+EISVGNAVPVWY+MV QG
Sbjct: 162  SQDHVKVGDLVVKNQEFIEATREPSITFLVAKFDGILGLGFKEISVGNAVPVWYNMVKQG 221

Query: 1126 LVKEPVFSFWLNRNAXXXXXXELVFGGVDTTHYKGEHTYVPVSHKGYWQFDMGDVLVGGE 947
            LVKEPVFSFW NRN       E+VFGGVD  HYKG+HTYVPV+ KGYWQFDMGDVL+ G+
Sbjct: 222  LVKEPVFSFWFNRNTDEEEGGEIVFGGVDPNHYKGKHTYVPVTQKGYWQFDMGDVLIDGQ 281

Query: 946  STGLCSGGCSAIADSGTSLLAGPTTVITQINHAIGASGVMSQECKSVVSQYGKTILDLLL 767
            +TG C+ GCSAIADSGTSLLAGPTT+IT++NHAIGA+GV+SQECK+VV++YG+TI+ +LL
Sbjct: 282  TTGFCARGCSAIADSGTSLLAGPTTIITEVNHAIGATGVVSQECKAVVAEYGETIIKMLL 341

Query: 766  SETQPLKVCSQIGLCSSHGSHDHSMIIESVVDINSGKTSGGLHDEMCTFCEMTVAWMQSQ 587
             + QP+K+CSQIGLC+  G    SM IESVVD N+ K S GL D MC+ CEMTV WMQ+Q
Sbjct: 342  EKDQPMKICSQIGLCTFDGVRGVSMDIESVVD-NTRKASNGLRDAMCSTCEMTVVWMQNQ 400

Query: 586  LLKNQTEDKIIDYVNELCNRLPSPMGESAVDCNTLSSLPSVSFTLGDKVFDLSAEQYVLK 407
            L +NQT+D+I+ YVNELC+RLPSPMGESAVDC +LSSLP+VS T+G +VFDLS EQYVLK
Sbjct: 401  LKQNQTQDRILTYVNELCDRLPSPMGESAVDCGSLSSLPNVSLTIGGRVFDLSPEQYVLK 460

Query: 406  VGEGTAAQCISGFIALDVPPPRGPLWILGDVFMGKYHTVFDYGNMKVGFAEAA 248
            VGEG AAQCISGF ALDVPPPRGPLWILGDVFMG+YHTVFDYGN +VGFAEAA
Sbjct: 461  VGEGEAAQCISGFTALDVPPPRGPLWILGDVFMGRYHTVFDYGNQRVGFAEAA 513



 Score =  113 bits (282), Expect(2) = 0.0
 Identities = 61/108 (56%), Positives = 69/108 (63%), Gaps = 6/108 (5%)
 Frame = -1

Query: 1845 MGTNFGLCXXXXXXXXXXAPTVFSSSNDGLIRVELKKRNVDPVNRLARHVNSNEG----- 1681
            MGT                P VFS+SN GL+R+ LKK  +D  NR+A  + S +G     
Sbjct: 1    MGTKLKTVVATFFLCFLLFPLVFSASNGGLVRIGLKKMKLDKNNRVAAQLESKDGEVRSA 60

Query: 1680 SARNYGLRGNGGDP-DADIVGLKNYMDAQYFGEIGIGTPPQKFTVIFD 1540
            S R Y LRGN GDP D DIV LKNYMDAQYFGEIG+GTPPQKFTVIFD
Sbjct: 61   SIRKYYLRGNSGDPEDIDIVSLKNYMDAQYFGEIGVGTPPQKFTVIFD 108


Top