BLASTX nr result

ID: Cnidium21_contig00001111 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Cnidium21_contig00001111
         (1974 letters)

Database: ./nr 
           23,641,837 sequences; 8,123,359,852 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

gb|AFB73927.2| preprocirsin [Cirsium vulgare]                         769   0.0  
emb|CAA70340.1| aspartic proteinase [Centaurea calcitrapa]            768   0.0  
emb|CAA57510.1| cyprosin [Cynara cardunculus]                         767   0.0  
gb|ABG37021.1| aspartic protease [Nicotiana tabacum]                  749   0.0  
gb|ACX55829.1| aspartic proteinase 1 [Castanea mollissima]            748   0.0  

>gb|AFB73927.2| preprocirsin [Cirsium vulgare]
          Length = 509

 Score =  769 bits (1985), Expect = 0.0
 Identities = 374/532 (70%), Positives = 433/532 (81%)
 Frame = -1

Query: 1938 MGTNLRVSAVAFLMLLLLAPTVFSSSGDGLIRVGLKKRKVDHVNRLAGGLKSKEGSAVRK 1759
            MGT+++ S +A  +L LL+PT  S S DGLIRVGLKKRKVD +N+L+G   S EG A + 
Sbjct: 1    MGTSIKASLLALFLLFLLSPTAISVSNDGLIRVGLKKRKVDQINQLSGHGASMEGKARKD 60

Query: 1758 YGLRGNLGDPDADIVELRNYMDAQYYGEIGIGTPPQKFTVIFDTGSSNLWVPSAKCYFSV 1579
            +G  G L D D+DI+ L+NYMDAQYYGEIGIG PPQKFTVIFDTGSSNLWVPSAKCYFSV
Sbjct: 61   FGFGGTLRDSDSDIIALKNYMDAQYYGEIGIGAPPQKFTVIFDTGSSNLWVPSAKCYFSV 120

Query: 1578 ACLFXXXXXXXXXXXXXKNGKSAAIHYGTGSISGFFSQDNVKVGDLVVKEQDFIEATKEP 1399
            ACLF             KNG SAAI YGTGSISGF SQD+VK+GDLVVKEQDFIEATKEP
Sbjct: 121  ACLFHSKYKSSHSSTYKKNGTSAAIQYGTGSISGFVSQDSVKLGDLVVKEQDFIEATKEP 180

Query: 1398 GVTFLAAKFDGILGLGFQEISVGNAVPVWYSMVNQGLVKEPVFSFWLNRNAXXXXXXELV 1219
            G+TFLAAKFDGILGLGFQEISVG +VPVWY+MVNQGLV+EPVFSFW NRNA      ELV
Sbjct: 181  GITFLAAKFDGILGLGFQEISVGKSVPVWYNMVNQGLVQEPVFSFWFNRNANEEEGGELV 240

Query: 1218 FGGVDTNHFKGEHTYVPVSQKGYWQFDMGDVLVGGESTGFCSDGCSAIADSGTSLLAGPT 1039
            FGGVD NHFKG+HTYVPV++KGYWQF+MGDVL+  ++TGFCSDGC+AIADSGTSLLAGPT
Sbjct: 241  FGGVDPNHFKGKHTYVPVTEKGYWQFNMGDVLIEDKTTGFCSDGCAAIADSGTSLLAGPT 300

Query: 1038 TVITQINHAIGASGVISQECKSVVSQYGKSILDLLLSEAQPRKICSQIGLCSLGSTHDRS 859
             +IT+INHA GA GV+SQ+CK++VSQYGKSI+++LLSEAQP KICSQ+ LC+     D S
Sbjct: 301  AIITEINHASGAKGVMSQQCKTLVSQYGKSIIEMLLSEAQPDKICSQMKLCTFDGARDVS 360

Query: 858  MIIESVVDMNSGKSSSGLHDEMCTFCEMTVVWMQSQLSRNQTEDKILDYINELCNRLPSP 679
             IIESVVD N+GKSS G +DEMCTFCEM VVWMQ+Q+ RN+TED I++Y+NELC+RLPSP
Sbjct: 361  SIIESVVDKNNGKSSGGANDEMCTFCEMAVVWMQNQIKRNETEDNIINYVNELCDRLPSP 420

Query: 678  MGESAVDCNSLSSMPSVTFTLGGKIFDLSADQYVLKVGXXXXXXXXXXXXXXXXXYVLKV 499
            MGESAVDCNSLSSMP++ FT+GGK+F+L  +Q                       Y+LK+
Sbjct: 421  MGESAVDCNSLSSMPNIAFTIGGKVFELCPEQ-----------------------YILKI 457

Query: 498  GEGAAAQCISGFTALDVPPPRGPLWILGDVFMGKYHTVFDYGNMKVGFAEAA 343
            GEG AAQCISGFTA+DV PPRGPLWILGDVFMG+YHTVFDYG  +VGFAEAA
Sbjct: 458  GEGEAAQCISGFTAMDVAPPRGPLWILGDVFMGRYHTVFDYGKSRVGFAEAA 509


>emb|CAA70340.1| aspartic proteinase [Centaurea calcitrapa]
          Length = 509

 Score =  768 bits (1984), Expect = 0.0
 Identities = 373/532 (70%), Positives = 434/532 (81%)
 Frame = -1

Query: 1938 MGTNLRVSAVAFLMLLLLAPTVFSSSGDGLIRVGLKKRKVDHVNRLAGGLKSKEGSAVRK 1759
            MGT ++ S +A  + +LL+PT FS+S  GL+RVGLKKRKVD +N+L     S EG A + 
Sbjct: 1    MGTAIKASLLALFLFVLLSPTAFSASNGGLLRVGLKKRKVDQINQLRNHGASMEGKARKD 60

Query: 1758 YGLRGNLGDPDADIVELRNYMDAQYYGEIGIGTPPQKFTVIFDTGSSNLWVPSAKCYFSV 1579
            +G  G+L D D+DI+EL+NYMDAQYYGEIGIG+P QKFTVIFDTGSSNLWVPSAKCYFSV
Sbjct: 61   FGFGGSLRDSDSDIIELKNYMDAQYYGEIGIGSPAQKFTVIFDTGSSNLWVPSAKCYFSV 120

Query: 1578 ACLFXXXXXXXXXXXXXKNGKSAAIHYGTGSISGFFSQDNVKVGDLVVKEQDFIEATKEP 1399
            ACLF             KNG SAAI YGTGSISGF SQD+VK+GDLVVKEQDFIEATKEP
Sbjct: 121  ACLFHSKYKSSHSSTYKKNGTSAAIQYGTGSISGFVSQDSVKLGDLVVKEQDFIEATKEP 180

Query: 1398 GVTFLAAKFDGILGLGFQEISVGNAVPVWYSMVNQGLVKEPVFSFWLNRNAXXXXXXELV 1219
            GVTFLAAKFDGILGLGFQEISVG +VPVWY+MVNQGLV+EPVFSFW NRNA      ELV
Sbjct: 181  GVTFLAAKFDGILGLGFQEISVGKSVPVWYNMVNQGLVQEPVFSFWFNRNADEEEGGELV 240

Query: 1218 FGGVDTNHFKGEHTYVPVSQKGYWQFDMGDVLVGGESTGFCSDGCSAIADSGTSLLAGPT 1039
            FGGVD NHFKG+HTYVPV+QKGYWQF+MGDVL+  ++TGFC+DGC+AIADSGTSLLAGPT
Sbjct: 241  FGGVDPNHFKGKHTYVPVTQKGYWQFNMGDVLIEDKTTGFCADGCAAIADSGTSLLAGPT 300

Query: 1038 TVITQINHAIGASGVISQECKSVVSQYGKSILDLLLSEAQPRKICSQIGLCSLGSTHDRS 859
             +ITQINHAIGA GV+SQ+CK++V QYGK+I+++LLSEAQP KICSQ+ LC+     D S
Sbjct: 301  AIITQINHAIGAKGVMSQQCKTLVDQYGKTIIEMLLSEAQPDKICSQMKLCTFDGARDVS 360

Query: 858  MIIESVVDMNSGKSSSGLHDEMCTFCEMTVVWMQSQLSRNQTEDKILDYINELCNRLPSP 679
             IIESVVD N+GKSS G+HDEMCTFCEM VVWMQ+Q+ RNQTED I++Y+NELC+RLPSP
Sbjct: 361  SIIESVVDKNNGKSSGGVHDEMCTFCEMAVVWMQNQIKRNQTEDNIINYVNELCDRLPSP 420

Query: 678  MGESAVDCNSLSSMPSVTFTLGGKIFDLSADQYVLKVGXXXXXXXXXXXXXXXXXYVLKV 499
            MGESAVDCN LSSMP++ FT+GGK+F+L  +Q                       Y+LK+
Sbjct: 421  MGESAVDCNDLSSMPNIAFTIGGKVFELCPEQ-----------------------YILKI 457

Query: 498  GEGAAAQCISGFTALDVPPPRGPLWILGDVFMGKYHTVFDYGNMKVGFAEAA 343
            GEG AAQCISGFTA+DV PPRGPLWILGDVFMG+YHTVFDYG ++VGFAEAA
Sbjct: 458  GEGEAAQCISGFTAMDVAPPRGPLWILGDVFMGQYHTVFDYGKLRVGFAEAA 509


>emb|CAA57510.1| cyprosin [Cynara cardunculus]
          Length = 509

 Score =  767 bits (1980), Expect = 0.0
 Identities = 371/532 (69%), Positives = 434/532 (81%)
 Frame = -1

Query: 1938 MGTNLRVSAVAFLMLLLLAPTVFSSSGDGLIRVGLKKRKVDHVNRLAGGLKSKEGSAVRK 1759
            MGT ++ S +A  +  LL+PT FS S  GL+RVGLKKRKVD +N+L+G   S E  A + 
Sbjct: 1    MGTAIKASVLALFLFFLLSPTAFSVSNGGLLRVGLKKRKVDQINQLSGHGVSMEAKARKD 60

Query: 1758 YGLRGNLGDPDADIVELRNYMDAQYYGEIGIGTPPQKFTVIFDTGSSNLWVPSAKCYFSV 1579
            +G  G L D  +DI+ L+NYMDAQYYGEIGIG+PPQKFTVIFDTGSSNLWVPSAKCYFSV
Sbjct: 61   FGFGGALRDSGSDIIALKNYMDAQYYGEIGIGSPPQKFTVIFDTGSSNLWVPSAKCYFSV 120

Query: 1578 ACLFXXXXXXXXXXXXXKNGKSAAIHYGTGSISGFFSQDNVKVGDLVVKEQDFIEATKEP 1399
            ACLF             KNG SAAI YGTGSISGF SQD+VK+GDLVVKEQDFIEATKEP
Sbjct: 121  ACLFHSKYKSSHSSTYKKNGTSAAIQYGTGSISGFVSQDSVKLGDLVVKEQDFIEATKEP 180

Query: 1398 GVTFLAAKFDGILGLGFQEISVGNAVPVWYSMVNQGLVKEPVFSFWLNRNAXXXXXXELV 1219
            G+TFLAAKFDGILGLGFQEISVG +VP+WY+MVNQGLV+EPVFSFW NRNA      ELV
Sbjct: 181  GITFLAAKFDGILGLGFQEISVGKSVPLWYNMVNQGLVQEPVFSFWFNRNADEEEGGELV 240

Query: 1218 FGGVDTNHFKGEHTYVPVSQKGYWQFDMGDVLVGGESTGFCSDGCSAIADSGTSLLAGPT 1039
            FGGVD NHFKG+HTYVPV++KGYWQFDMGDVL+  ++TGFCSDGC+AIADSGTSLLAGPT
Sbjct: 241  FGGVDPNHFKGKHTYVPVTEKGYWQFDMGDVLIEDKTTGFCSDGCAAIADSGTSLLAGPT 300

Query: 1038 TVITQINHAIGASGVISQECKSVVSQYGKSILDLLLSEAQPRKICSQIGLCSLGSTHDRS 859
             +IT+INHAIGA GV+SQ+CK++VSQYGK+++++LLSEAQP KICSQ+ LC+     D S
Sbjct: 301  AIITEINHAIGAKGVMSQQCKTLVSQYGKTMIEMLLSEAQPDKICSQMKLCTFDGARDAS 360

Query: 858  MIIESVVDMNSGKSSSGLHDEMCTFCEMTVVWMQSQLSRNQTEDKILDYINELCNRLPSP 679
             IIESVVD N+GKSSSG+HDEMCTFCEM VVWMQ+Q+ RN+TED I++Y+NELC+RLPSP
Sbjct: 361  SIIESVVDENNGKSSSGVHDEMCTFCEMAVVWMQNQIKRNETEDNIINYVNELCDRLPSP 420

Query: 678  MGESAVDCNSLSSMPSVTFTLGGKIFDLSADQYVLKVGXXXXXXXXXXXXXXXXXYVLKV 499
            MGESAVDCNSLSSMP++ FT+GGK+F+L  +Q                       Y+LK+
Sbjct: 421  MGESAVDCNSLSSMPNIAFTIGGKVFELCPEQ-----------------------YILKI 457

Query: 498  GEGAAAQCISGFTALDVPPPRGPLWILGDVFMGKYHTVFDYGNMKVGFAEAA 343
            GEG AAQCISGFTA+DV PPRGPLWILGDVFMG+YHTVFDYG ++VGFAEAA
Sbjct: 458  GEGEAAQCISGFTAMDVAPPRGPLWILGDVFMGRYHTVFDYGKLRVGFAEAA 509


>gb|ABG37021.1| aspartic protease [Nicotiana tabacum]
          Length = 508

 Score =  749 bits (1935), Expect = 0.0
 Identities = 369/532 (69%), Positives = 427/532 (80%)
 Frame = -1

Query: 1938 MGTNLRVSAVAFLMLLLLAPTVFSSSGDGLIRVGLKKRKVDHVNRLAGGLKSKEGSAVRK 1759
            MGT       A  +LLLL+P VFS S DGLIRVG+KKRK+D +N+  GG+ S   ++ R 
Sbjct: 1    MGTRYGACLSALCLLLLLSPMVFSVSNDGLIRVGIKKRKLDQINQAFGGIDSNGANSART 60

Query: 1758 YGLRGNLGDPDADIVELRNYMDAQYYGEIGIGTPPQKFTVIFDTGSSNLWVPSAKCYFSV 1579
            Y L GN+GD D DI+ L+NY+DAQY+GEI IG+PPQKFTVIFDTGSSNLWVPSA+CYFS+
Sbjct: 61   YHLGGNIGDSDTDIIALKNYLDAQYFGEICIGSPPQKFTVIFDTGSSNLWVPSARCYFSL 120

Query: 1578 ACLFXXXXXXXXXXXXXKNGKSAAIHYGTGSISGFFSQDNVKVGDLVVKEQDFIEATKEP 1399
            AC               KNG SAAI YGTGSISG+FS DNVKVGDL+VK+QDFIEAT+EP
Sbjct: 121  ACYLHPKYKSSHSSTYKKNGTSAAIRYGTGSISGYFSNDNVKVGDLIVKDQDFIEATREP 180

Query: 1398 GVTFLAAKFDGILGLGFQEISVGNAVPVWYSMVNQGLVKEPVFSFWLNRNAXXXXXXELV 1219
            G+TFLAAKFDGILGLGFQEISVG +VPVWY+MVNQGLVK+PVFSFW NRNA      ELV
Sbjct: 181  GITFLAAKFDGILGLGFQEISVGKSVPVWYNMVNQGLVKKPVFSFWFNRNAQEEEGGELV 240

Query: 1218 FGGVDTNHFKGEHTYVPVSQKGYWQFDMGDVLVGGESTGFCSDGCSAIADSGTSLLAGPT 1039
            FGGVD NHFKG+HTYVPV+ KGYWQFDMGDVLVGGE+TGFCS GCSAIADSGTSLLAGPT
Sbjct: 241  FGGVDPNHFKGKHTYVPVTHKGYWQFDMGDVLVGGETTGFCSGGCSAIADSGTSLLAGPT 300

Query: 1038 TVITQINHAIGASGVISQECKSVVSQYGKSILDLLLSEAQPRKICSQIGLCSLGSTHDRS 859
            T+ITQINH IGASGV+SQECKS+V++YGK+ILDLL S+A P+KICSQIGLCS   + D S
Sbjct: 301  TIITQINHVIGASGVVSQECKSLVTEYGKTILDLLESKAAPQKICSQIGLCSSDGSRDVS 360

Query: 858  MIIESVVDMNSGKSSSGLHDEMCTFCEMTVVWMQSQLSRNQTEDKILDYINELCNRLPSP 679
            MIIESVVD ++G +S+GL DEMC  CEM V+WMQ+Q+ RN+T D I DY+N+LC+RLPSP
Sbjct: 361  MIIESVVDKHNG-ASNGLGDEMCRVCEMAVIWMQNQMRRNETADSIYDYVNQLCDRLPSP 419

Query: 678  MGESAVDCNSLSSMPSVTFTLGGKIFDLSADQYVLKVGXXXXXXXXXXXXXXXXXYVLKV 499
            MGESAVDC+SL+SMP+V+FT+G + F L+  Q                       YVL+V
Sbjct: 420  MGESAVDCSSLASMPNVSFTVGNQTFGLTPQQ-----------------------YVLQV 456

Query: 498  GEGAAAQCISGFTALDVPPPRGPLWILGDVFMGKYHTVFDYGNMKVGFAEAA 343
            GEG  AQCISGFTALDVPPPRGPLWILGDVFMG+YHTVFDYGN +VGFAEAA
Sbjct: 457  GEGPVAQCISGFTALDVPPPRGPLWILGDVFMGRYHTVFDYGNSRVGFAEAA 508


>gb|ACX55829.1| aspartic proteinase 1 [Castanea mollissima]
          Length = 513

 Score =  748 bits (1930), Expect = 0.0
 Identities = 367/537 (68%), Positives = 432/537 (80%), Gaps = 5/537 (0%)
 Frame = -1

Query: 1938 MGTNLRVSAVAFLMLLLLAPTVFSSSGDGLIRVGLKKRKVDHVNRLAGGLKSKEG----S 1771
            MGT L+     F +  LL P VFS+S  GL+R+GLKK K+D  NR+A  L+SK+G    +
Sbjct: 1    MGTKLKTVVATFFLCFLLFPLVFSASNGGLVRIGLKKMKLDKNNRVAAQLESKDGEVRSA 60

Query: 1770 AVRKYGLRGNLGDP-DADIVELRNYMDAQYYGEIGIGTPPQKFTVIFDTGSSNLWVPSAK 1594
            ++RKY LRGN GDP D DIV L+NYMDAQY+GEIG+GTPPQKFTVIFDTGSSNLWVPS+K
Sbjct: 61   SIRKYYLRGNSGDPEDIDIVSLKNYMDAQYFGEIGVGTPPQKFTVIFDTGSSNLWVPSSK 120

Query: 1593 CYFSVACLFXXXXXXXXXXXXXKNGKSAAIHYGTGSISGFFSQDNVKVGDLVVKEQDFIE 1414
            CYFSVAC F             KNGK A IHYGTG+ISG+FSQD+VKVGDLVVK Q+FIE
Sbjct: 121  CYFSVACYFHSKYKSSSSSTYKKNGKPADIHYGTGAISGYFSQDHVKVGDLVVKNQEFIE 180

Query: 1413 ATKEPGVTFLAAKFDGILGLGFQEISVGNAVPVWYSMVNQGLVKEPVFSFWLNRNAXXXX 1234
            AT+EP +TFL AKFDGILGLGF+EISVGNAVPVWY+MV QGLVKEPVFSFW NRN     
Sbjct: 181  ATREPSITFLVAKFDGILGLGFKEISVGNAVPVWYNMVKQGLVKEPVFSFWFNRNTDEEE 240

Query: 1233 XXELVFGGVDTNHFKGEHTYVPVSQKGYWQFDMGDVLVGGESTGFCSDGCSAIADSGTSL 1054
              E+VFGGVD NH+KG+HTYVPV+QKGYWQFDMGDVL+ G++TGFC+ GCSAIADSGTSL
Sbjct: 241  GGEIVFGGVDPNHYKGKHTYVPVTQKGYWQFDMGDVLIDGQTTGFCARGCSAIADSGTSL 300

Query: 1053 LAGPTTVITQINHAIGASGVISQECKSVVSQYGKSILDLLLSEAQPRKICSQIGLCSLGS 874
            LAGPTT+IT++NHAIGA+GV+SQECK+VV++YG++I+ +LL + QP KICSQIGLC+   
Sbjct: 301  LAGPTTIITEVNHAIGATGVVSQECKAVVAEYGETIIKMLLEKDQPMKICSQIGLCTFDG 360

Query: 873  THDRSMIIESVVDMNSGKSSSGLHDEMCTFCEMTVVWMQSQLSRNQTEDKILDYINELCN 694
                SM IESVVD N+ K+S+GL D MC+ CEMTVVWMQ+QL +NQT+D+IL Y+NELC+
Sbjct: 361  VRGVSMDIESVVD-NTRKASNGLRDAMCSTCEMTVVWMQNQLKQNQTQDRILTYVNELCD 419

Query: 693  RLPSPMGESAVDCNSLSSMPSVTFTLGGKIFDLSADQYVLKVGXXXXXXXXXXXXXXXXX 514
            RLPSPMGESAVDC SLSS+P+V+ T+GG++FDLS +Q                       
Sbjct: 420  RLPSPMGESAVDCGSLSSLPNVSLTIGGRVFDLSPEQ----------------------- 456

Query: 513  YVLKVGEGAAAQCISGFTALDVPPPRGPLWILGDVFMGKYHTVFDYGNMKVGFAEAA 343
            YVLKVGEG AAQCISGFTALDVPPPRGPLWILGDVFMG+YHTVFDYGN +VGFAEAA
Sbjct: 457  YVLKVGEGEAAQCISGFTALDVPPPRGPLWILGDVFMGRYHTVFDYGNQRVGFAEAA 513


Top