BLASTX nr result
ID: Cnidium21_contig00001111
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Cnidium21_contig00001111 (1974 letters) Database: ./nr 23,641,837 sequences; 8,123,359,852 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value gb|AFB73927.2| preprocirsin [Cirsium vulgare] 769 0.0 emb|CAA70340.1| aspartic proteinase [Centaurea calcitrapa] 768 0.0 emb|CAA57510.1| cyprosin [Cynara cardunculus] 767 0.0 gb|ABG37021.1| aspartic protease [Nicotiana tabacum] 749 0.0 gb|ACX55829.1| aspartic proteinase 1 [Castanea mollissima] 748 0.0 >gb|AFB73927.2| preprocirsin [Cirsium vulgare] Length = 509 Score = 769 bits (1985), Expect = 0.0 Identities = 374/532 (70%), Positives = 433/532 (81%) Frame = -1 Query: 1938 MGTNLRVSAVAFLMLLLLAPTVFSSSGDGLIRVGLKKRKVDHVNRLAGGLKSKEGSAVRK 1759 MGT+++ S +A +L LL+PT S S DGLIRVGLKKRKVD +N+L+G S EG A + Sbjct: 1 MGTSIKASLLALFLLFLLSPTAISVSNDGLIRVGLKKRKVDQINQLSGHGASMEGKARKD 60 Query: 1758 YGLRGNLGDPDADIVELRNYMDAQYYGEIGIGTPPQKFTVIFDTGSSNLWVPSAKCYFSV 1579 +G G L D D+DI+ L+NYMDAQYYGEIGIG PPQKFTVIFDTGSSNLWVPSAKCYFSV Sbjct: 61 FGFGGTLRDSDSDIIALKNYMDAQYYGEIGIGAPPQKFTVIFDTGSSNLWVPSAKCYFSV 120 Query: 1578 ACLFXXXXXXXXXXXXXKNGKSAAIHYGTGSISGFFSQDNVKVGDLVVKEQDFIEATKEP 1399 ACLF KNG SAAI YGTGSISGF SQD+VK+GDLVVKEQDFIEATKEP Sbjct: 121 ACLFHSKYKSSHSSTYKKNGTSAAIQYGTGSISGFVSQDSVKLGDLVVKEQDFIEATKEP 180 Query: 1398 GVTFLAAKFDGILGLGFQEISVGNAVPVWYSMVNQGLVKEPVFSFWLNRNAXXXXXXELV 1219 G+TFLAAKFDGILGLGFQEISVG +VPVWY+MVNQGLV+EPVFSFW NRNA ELV Sbjct: 181 GITFLAAKFDGILGLGFQEISVGKSVPVWYNMVNQGLVQEPVFSFWFNRNANEEEGGELV 240 Query: 1218 FGGVDTNHFKGEHTYVPVSQKGYWQFDMGDVLVGGESTGFCSDGCSAIADSGTSLLAGPT 1039 FGGVD NHFKG+HTYVPV++KGYWQF+MGDVL+ ++TGFCSDGC+AIADSGTSLLAGPT Sbjct: 241 FGGVDPNHFKGKHTYVPVTEKGYWQFNMGDVLIEDKTTGFCSDGCAAIADSGTSLLAGPT 300 Query: 1038 TVITQINHAIGASGVISQECKSVVSQYGKSILDLLLSEAQPRKICSQIGLCSLGSTHDRS 859 +IT+INHA GA GV+SQ+CK++VSQYGKSI+++LLSEAQP KICSQ+ LC+ D S Sbjct: 301 AIITEINHASGAKGVMSQQCKTLVSQYGKSIIEMLLSEAQPDKICSQMKLCTFDGARDVS 360 Query: 858 MIIESVVDMNSGKSSSGLHDEMCTFCEMTVVWMQSQLSRNQTEDKILDYINELCNRLPSP 679 IIESVVD N+GKSS G +DEMCTFCEM VVWMQ+Q+ RN+TED I++Y+NELC+RLPSP Sbjct: 361 SIIESVVDKNNGKSSGGANDEMCTFCEMAVVWMQNQIKRNETEDNIINYVNELCDRLPSP 420 Query: 678 MGESAVDCNSLSSMPSVTFTLGGKIFDLSADQYVLKVGXXXXXXXXXXXXXXXXXYVLKV 499 MGESAVDCNSLSSMP++ FT+GGK+F+L +Q Y+LK+ Sbjct: 421 MGESAVDCNSLSSMPNIAFTIGGKVFELCPEQ-----------------------YILKI 457 Query: 498 GEGAAAQCISGFTALDVPPPRGPLWILGDVFMGKYHTVFDYGNMKVGFAEAA 343 GEG AAQCISGFTA+DV PPRGPLWILGDVFMG+YHTVFDYG +VGFAEAA Sbjct: 458 GEGEAAQCISGFTAMDVAPPRGPLWILGDVFMGRYHTVFDYGKSRVGFAEAA 509 >emb|CAA70340.1| aspartic proteinase [Centaurea calcitrapa] Length = 509 Score = 768 bits (1984), Expect = 0.0 Identities = 373/532 (70%), Positives = 434/532 (81%) Frame = -1 Query: 1938 MGTNLRVSAVAFLMLLLLAPTVFSSSGDGLIRVGLKKRKVDHVNRLAGGLKSKEGSAVRK 1759 MGT ++ S +A + +LL+PT FS+S GL+RVGLKKRKVD +N+L S EG A + Sbjct: 1 MGTAIKASLLALFLFVLLSPTAFSASNGGLLRVGLKKRKVDQINQLRNHGASMEGKARKD 60 Query: 1758 YGLRGNLGDPDADIVELRNYMDAQYYGEIGIGTPPQKFTVIFDTGSSNLWVPSAKCYFSV 1579 +G G+L D D+DI+EL+NYMDAQYYGEIGIG+P QKFTVIFDTGSSNLWVPSAKCYFSV Sbjct: 61 FGFGGSLRDSDSDIIELKNYMDAQYYGEIGIGSPAQKFTVIFDTGSSNLWVPSAKCYFSV 120 Query: 1578 ACLFXXXXXXXXXXXXXKNGKSAAIHYGTGSISGFFSQDNVKVGDLVVKEQDFIEATKEP 1399 ACLF KNG SAAI YGTGSISGF SQD+VK+GDLVVKEQDFIEATKEP Sbjct: 121 ACLFHSKYKSSHSSTYKKNGTSAAIQYGTGSISGFVSQDSVKLGDLVVKEQDFIEATKEP 180 Query: 1398 GVTFLAAKFDGILGLGFQEISVGNAVPVWYSMVNQGLVKEPVFSFWLNRNAXXXXXXELV 1219 GVTFLAAKFDGILGLGFQEISVG +VPVWY+MVNQGLV+EPVFSFW NRNA ELV Sbjct: 181 GVTFLAAKFDGILGLGFQEISVGKSVPVWYNMVNQGLVQEPVFSFWFNRNADEEEGGELV 240 Query: 1218 FGGVDTNHFKGEHTYVPVSQKGYWQFDMGDVLVGGESTGFCSDGCSAIADSGTSLLAGPT 1039 FGGVD NHFKG+HTYVPV+QKGYWQF+MGDVL+ ++TGFC+DGC+AIADSGTSLLAGPT Sbjct: 241 FGGVDPNHFKGKHTYVPVTQKGYWQFNMGDVLIEDKTTGFCADGCAAIADSGTSLLAGPT 300 Query: 1038 TVITQINHAIGASGVISQECKSVVSQYGKSILDLLLSEAQPRKICSQIGLCSLGSTHDRS 859 +ITQINHAIGA GV+SQ+CK++V QYGK+I+++LLSEAQP KICSQ+ LC+ D S Sbjct: 301 AIITQINHAIGAKGVMSQQCKTLVDQYGKTIIEMLLSEAQPDKICSQMKLCTFDGARDVS 360 Query: 858 MIIESVVDMNSGKSSSGLHDEMCTFCEMTVVWMQSQLSRNQTEDKILDYINELCNRLPSP 679 IIESVVD N+GKSS G+HDEMCTFCEM VVWMQ+Q+ RNQTED I++Y+NELC+RLPSP Sbjct: 361 SIIESVVDKNNGKSSGGVHDEMCTFCEMAVVWMQNQIKRNQTEDNIINYVNELCDRLPSP 420 Query: 678 MGESAVDCNSLSSMPSVTFTLGGKIFDLSADQYVLKVGXXXXXXXXXXXXXXXXXYVLKV 499 MGESAVDCN LSSMP++ FT+GGK+F+L +Q Y+LK+ Sbjct: 421 MGESAVDCNDLSSMPNIAFTIGGKVFELCPEQ-----------------------YILKI 457 Query: 498 GEGAAAQCISGFTALDVPPPRGPLWILGDVFMGKYHTVFDYGNMKVGFAEAA 343 GEG AAQCISGFTA+DV PPRGPLWILGDVFMG+YHTVFDYG ++VGFAEAA Sbjct: 458 GEGEAAQCISGFTAMDVAPPRGPLWILGDVFMGQYHTVFDYGKLRVGFAEAA 509 >emb|CAA57510.1| cyprosin [Cynara cardunculus] Length = 509 Score = 767 bits (1980), Expect = 0.0 Identities = 371/532 (69%), Positives = 434/532 (81%) Frame = -1 Query: 1938 MGTNLRVSAVAFLMLLLLAPTVFSSSGDGLIRVGLKKRKVDHVNRLAGGLKSKEGSAVRK 1759 MGT ++ S +A + LL+PT FS S GL+RVGLKKRKVD +N+L+G S E A + Sbjct: 1 MGTAIKASVLALFLFFLLSPTAFSVSNGGLLRVGLKKRKVDQINQLSGHGVSMEAKARKD 60 Query: 1758 YGLRGNLGDPDADIVELRNYMDAQYYGEIGIGTPPQKFTVIFDTGSSNLWVPSAKCYFSV 1579 +G G L D +DI+ L+NYMDAQYYGEIGIG+PPQKFTVIFDTGSSNLWVPSAKCYFSV Sbjct: 61 FGFGGALRDSGSDIIALKNYMDAQYYGEIGIGSPPQKFTVIFDTGSSNLWVPSAKCYFSV 120 Query: 1578 ACLFXXXXXXXXXXXXXKNGKSAAIHYGTGSISGFFSQDNVKVGDLVVKEQDFIEATKEP 1399 ACLF KNG SAAI YGTGSISGF SQD+VK+GDLVVKEQDFIEATKEP Sbjct: 121 ACLFHSKYKSSHSSTYKKNGTSAAIQYGTGSISGFVSQDSVKLGDLVVKEQDFIEATKEP 180 Query: 1398 GVTFLAAKFDGILGLGFQEISVGNAVPVWYSMVNQGLVKEPVFSFWLNRNAXXXXXXELV 1219 G+TFLAAKFDGILGLGFQEISVG +VP+WY+MVNQGLV+EPVFSFW NRNA ELV Sbjct: 181 GITFLAAKFDGILGLGFQEISVGKSVPLWYNMVNQGLVQEPVFSFWFNRNADEEEGGELV 240 Query: 1218 FGGVDTNHFKGEHTYVPVSQKGYWQFDMGDVLVGGESTGFCSDGCSAIADSGTSLLAGPT 1039 FGGVD NHFKG+HTYVPV++KGYWQFDMGDVL+ ++TGFCSDGC+AIADSGTSLLAGPT Sbjct: 241 FGGVDPNHFKGKHTYVPVTEKGYWQFDMGDVLIEDKTTGFCSDGCAAIADSGTSLLAGPT 300 Query: 1038 TVITQINHAIGASGVISQECKSVVSQYGKSILDLLLSEAQPRKICSQIGLCSLGSTHDRS 859 +IT+INHAIGA GV+SQ+CK++VSQYGK+++++LLSEAQP KICSQ+ LC+ D S Sbjct: 301 AIITEINHAIGAKGVMSQQCKTLVSQYGKTMIEMLLSEAQPDKICSQMKLCTFDGARDAS 360 Query: 858 MIIESVVDMNSGKSSSGLHDEMCTFCEMTVVWMQSQLSRNQTEDKILDYINELCNRLPSP 679 IIESVVD N+GKSSSG+HDEMCTFCEM VVWMQ+Q+ RN+TED I++Y+NELC+RLPSP Sbjct: 361 SIIESVVDENNGKSSSGVHDEMCTFCEMAVVWMQNQIKRNETEDNIINYVNELCDRLPSP 420 Query: 678 MGESAVDCNSLSSMPSVTFTLGGKIFDLSADQYVLKVGXXXXXXXXXXXXXXXXXYVLKV 499 MGESAVDCNSLSSMP++ FT+GGK+F+L +Q Y+LK+ Sbjct: 421 MGESAVDCNSLSSMPNIAFTIGGKVFELCPEQ-----------------------YILKI 457 Query: 498 GEGAAAQCISGFTALDVPPPRGPLWILGDVFMGKYHTVFDYGNMKVGFAEAA 343 GEG AAQCISGFTA+DV PPRGPLWILGDVFMG+YHTVFDYG ++VGFAEAA Sbjct: 458 GEGEAAQCISGFTAMDVAPPRGPLWILGDVFMGRYHTVFDYGKLRVGFAEAA 509 >gb|ABG37021.1| aspartic protease [Nicotiana tabacum] Length = 508 Score = 749 bits (1935), Expect = 0.0 Identities = 369/532 (69%), Positives = 427/532 (80%) Frame = -1 Query: 1938 MGTNLRVSAVAFLMLLLLAPTVFSSSGDGLIRVGLKKRKVDHVNRLAGGLKSKEGSAVRK 1759 MGT A +LLLL+P VFS S DGLIRVG+KKRK+D +N+ GG+ S ++ R Sbjct: 1 MGTRYGACLSALCLLLLLSPMVFSVSNDGLIRVGIKKRKLDQINQAFGGIDSNGANSART 60 Query: 1758 YGLRGNLGDPDADIVELRNYMDAQYYGEIGIGTPPQKFTVIFDTGSSNLWVPSAKCYFSV 1579 Y L GN+GD D DI+ L+NY+DAQY+GEI IG+PPQKFTVIFDTGSSNLWVPSA+CYFS+ Sbjct: 61 YHLGGNIGDSDTDIIALKNYLDAQYFGEICIGSPPQKFTVIFDTGSSNLWVPSARCYFSL 120 Query: 1578 ACLFXXXXXXXXXXXXXKNGKSAAIHYGTGSISGFFSQDNVKVGDLVVKEQDFIEATKEP 1399 AC KNG SAAI YGTGSISG+FS DNVKVGDL+VK+QDFIEAT+EP Sbjct: 121 ACYLHPKYKSSHSSTYKKNGTSAAIRYGTGSISGYFSNDNVKVGDLIVKDQDFIEATREP 180 Query: 1398 GVTFLAAKFDGILGLGFQEISVGNAVPVWYSMVNQGLVKEPVFSFWLNRNAXXXXXXELV 1219 G+TFLAAKFDGILGLGFQEISVG +VPVWY+MVNQGLVK+PVFSFW NRNA ELV Sbjct: 181 GITFLAAKFDGILGLGFQEISVGKSVPVWYNMVNQGLVKKPVFSFWFNRNAQEEEGGELV 240 Query: 1218 FGGVDTNHFKGEHTYVPVSQKGYWQFDMGDVLVGGESTGFCSDGCSAIADSGTSLLAGPT 1039 FGGVD NHFKG+HTYVPV+ KGYWQFDMGDVLVGGE+TGFCS GCSAIADSGTSLLAGPT Sbjct: 241 FGGVDPNHFKGKHTYVPVTHKGYWQFDMGDVLVGGETTGFCSGGCSAIADSGTSLLAGPT 300 Query: 1038 TVITQINHAIGASGVISQECKSVVSQYGKSILDLLLSEAQPRKICSQIGLCSLGSTHDRS 859 T+ITQINH IGASGV+SQECKS+V++YGK+ILDLL S+A P+KICSQIGLCS + D S Sbjct: 301 TIITQINHVIGASGVVSQECKSLVTEYGKTILDLLESKAAPQKICSQIGLCSSDGSRDVS 360 Query: 858 MIIESVVDMNSGKSSSGLHDEMCTFCEMTVVWMQSQLSRNQTEDKILDYINELCNRLPSP 679 MIIESVVD ++G +S+GL DEMC CEM V+WMQ+Q+ RN+T D I DY+N+LC+RLPSP Sbjct: 361 MIIESVVDKHNG-ASNGLGDEMCRVCEMAVIWMQNQMRRNETADSIYDYVNQLCDRLPSP 419 Query: 678 MGESAVDCNSLSSMPSVTFTLGGKIFDLSADQYVLKVGXXXXXXXXXXXXXXXXXYVLKV 499 MGESAVDC+SL+SMP+V+FT+G + F L+ Q YVL+V Sbjct: 420 MGESAVDCSSLASMPNVSFTVGNQTFGLTPQQ-----------------------YVLQV 456 Query: 498 GEGAAAQCISGFTALDVPPPRGPLWILGDVFMGKYHTVFDYGNMKVGFAEAA 343 GEG AQCISGFTALDVPPPRGPLWILGDVFMG+YHTVFDYGN +VGFAEAA Sbjct: 457 GEGPVAQCISGFTALDVPPPRGPLWILGDVFMGRYHTVFDYGNSRVGFAEAA 508 >gb|ACX55829.1| aspartic proteinase 1 [Castanea mollissima] Length = 513 Score = 748 bits (1930), Expect = 0.0 Identities = 367/537 (68%), Positives = 432/537 (80%), Gaps = 5/537 (0%) Frame = -1 Query: 1938 MGTNLRVSAVAFLMLLLLAPTVFSSSGDGLIRVGLKKRKVDHVNRLAGGLKSKEG----S 1771 MGT L+ F + LL P VFS+S GL+R+GLKK K+D NR+A L+SK+G + Sbjct: 1 MGTKLKTVVATFFLCFLLFPLVFSASNGGLVRIGLKKMKLDKNNRVAAQLESKDGEVRSA 60 Query: 1770 AVRKYGLRGNLGDP-DADIVELRNYMDAQYYGEIGIGTPPQKFTVIFDTGSSNLWVPSAK 1594 ++RKY LRGN GDP D DIV L+NYMDAQY+GEIG+GTPPQKFTVIFDTGSSNLWVPS+K Sbjct: 61 SIRKYYLRGNSGDPEDIDIVSLKNYMDAQYFGEIGVGTPPQKFTVIFDTGSSNLWVPSSK 120 Query: 1593 CYFSVACLFXXXXXXXXXXXXXKNGKSAAIHYGTGSISGFFSQDNVKVGDLVVKEQDFIE 1414 CYFSVAC F KNGK A IHYGTG+ISG+FSQD+VKVGDLVVK Q+FIE Sbjct: 121 CYFSVACYFHSKYKSSSSSTYKKNGKPADIHYGTGAISGYFSQDHVKVGDLVVKNQEFIE 180 Query: 1413 ATKEPGVTFLAAKFDGILGLGFQEISVGNAVPVWYSMVNQGLVKEPVFSFWLNRNAXXXX 1234 AT+EP +TFL AKFDGILGLGF+EISVGNAVPVWY+MV QGLVKEPVFSFW NRN Sbjct: 181 ATREPSITFLVAKFDGILGLGFKEISVGNAVPVWYNMVKQGLVKEPVFSFWFNRNTDEEE 240 Query: 1233 XXELVFGGVDTNHFKGEHTYVPVSQKGYWQFDMGDVLVGGESTGFCSDGCSAIADSGTSL 1054 E+VFGGVD NH+KG+HTYVPV+QKGYWQFDMGDVL+ G++TGFC+ GCSAIADSGTSL Sbjct: 241 GGEIVFGGVDPNHYKGKHTYVPVTQKGYWQFDMGDVLIDGQTTGFCARGCSAIADSGTSL 300 Query: 1053 LAGPTTVITQINHAIGASGVISQECKSVVSQYGKSILDLLLSEAQPRKICSQIGLCSLGS 874 LAGPTT+IT++NHAIGA+GV+SQECK+VV++YG++I+ +LL + QP KICSQIGLC+ Sbjct: 301 LAGPTTIITEVNHAIGATGVVSQECKAVVAEYGETIIKMLLEKDQPMKICSQIGLCTFDG 360 Query: 873 THDRSMIIESVVDMNSGKSSSGLHDEMCTFCEMTVVWMQSQLSRNQTEDKILDYINELCN 694 SM IESVVD N+ K+S+GL D MC+ CEMTVVWMQ+QL +NQT+D+IL Y+NELC+ Sbjct: 361 VRGVSMDIESVVD-NTRKASNGLRDAMCSTCEMTVVWMQNQLKQNQTQDRILTYVNELCD 419 Query: 693 RLPSPMGESAVDCNSLSSMPSVTFTLGGKIFDLSADQYVLKVGXXXXXXXXXXXXXXXXX 514 RLPSPMGESAVDC SLSS+P+V+ T+GG++FDLS +Q Sbjct: 420 RLPSPMGESAVDCGSLSSLPNVSLTIGGRVFDLSPEQ----------------------- 456 Query: 513 YVLKVGEGAAAQCISGFTALDVPPPRGPLWILGDVFMGKYHTVFDYGNMKVGFAEAA 343 YVLKVGEG AAQCISGFTALDVPPPRGPLWILGDVFMG+YHTVFDYGN +VGFAEAA Sbjct: 457 YVLKVGEGEAAQCISGFTALDVPPPRGPLWILGDVFMGRYHTVFDYGNQRVGFAEAA 513