BLASTX nr result

ID: Angelica23_contig00005970 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Angelica23_contig00005970
         (1370 letters)

Database: ./nr 
           23,641,837 sequences; 8,123,359,852 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

emb|CAA57510.1| cyprosin [Cynara cardunculus]                         573   e-161
emb|CAA70340.1| aspartic proteinase [Centaurea calcitrapa]            570   e-160
gb|ABG37021.1| aspartic protease [Nicotiana tabacum]                  569   e-160
emb|CAA48939.1| cyprosin [Cynara cardunculus]                         565   e-159
gb|AFB73927.2| preprocirsin [Cirsium vulgare]                         563   e-158

>emb|CAA57510.1| cyprosin [Cynara cardunculus]
          Length = 509

 Score =  573 bits (1477), Expect = e-161
 Identities = 272/382 (71%), Positives = 322/382 (84%)
 Frame = -3

Query: 1368 SGFFSQDNVKVGDLVVKEQDFIEAIKEPGLTFLAAKFDGILGLGFQEISXXXXXXXXXXX 1189
            SGF SQD+VK+GDLVVKEQDFIEA KEPG+TFLAAKFDGILGLGFQE             
Sbjct: 153  SGFVSQDSVKLGDLVVKEQDFIEATKEPGITFLAAKFDGILGLGFQE------------- 199

Query: 1188 XXXXXXXLGFQEISVGNAVPVWYSMVNQGLVKEPVFSFWLNRNAXXXXXXELVFGGVDTN 1009
                        ISVG +VP+WY+MVNQGLV+EPVFSFW NRNA      ELVFGGVD N
Sbjct: 200  ------------ISVGKSVPLWYNMVNQGLVQEPVFSFWFNRNADEEEGGELVFGGVDPN 247

Query: 1008 HFKGEHTYVPVSQKGYWQFDMGNVLVGGESTGYCSEGCSAIADSGTSLLAGPTTVITQIN 829
            HFKG+HTYVPV++KGYWQFDMG+VL+  ++TG+CS+GC+AIADSGTSLLAGPT +IT+IN
Sbjct: 248  HFKGKHTYVPVTEKGYWQFDMGDVLIEDKTTGFCSDGCAAIADSGTSLLAGPTAIITEIN 307

Query: 828  HAIGASGVISQECKSVVSQYGKTILDLLLSEAQPRKICSQIGLCSSGSTHDRSMIIESVV 649
            HAIGA GV+SQ+CK++VSQYGKT++++LLSEAQP KICSQ+ LC+     D S IIESVV
Sbjct: 308  HAIGAKGVMSQQCKTLVSQYGKTMIEMLLSEAQPDKICSQMKLCTFDGARDASSIIESVV 367

Query: 648  DMNSGKSSSGLHDELCTFCEMTVVWMQNQLIKNQTEDKILDYINELCNRLPSPMGESAVD 469
            D N+GKSSSG+HDE+CTFCEM VVWMQNQ+ +N+TED I++Y+NELC+RLPSPMGESAVD
Sbjct: 368  DENNGKSSSGVHDEMCTFCEMAVVWMQNQIKRNETEDNIINYVNELCDRLPSPMGESAVD 427

Query: 468  CNSLSSMPSVTFTLGGKTFDLSADQYVLKVGEGTAAQCISGFTALDVPPPRGPLWILGDV 289
            CNSLSSMP++ FT+GGK F+L  +QY+LK+GEG AAQCISGFTA+DV PPRGPLWILGDV
Sbjct: 428  CNSLSSMPNIAFTIGGKVFELCPEQYILKIGEGEAAQCISGFTAMDVAPPRGPLWILGDV 487

Query: 288  FMGKYHTVFDYGNMKVGFAEAA 223
            FMG+YHTVFDYG ++VGFAEAA
Sbjct: 488  FMGRYHTVFDYGKLRVGFAEAA 509


>emb|CAA70340.1| aspartic proteinase [Centaurea calcitrapa]
          Length = 509

 Score =  570 bits (1469), Expect = e-160
 Identities = 272/382 (71%), Positives = 319/382 (83%)
 Frame = -3

Query: 1368 SGFFSQDNVKVGDLVVKEQDFIEAIKEPGLTFLAAKFDGILGLGFQEISXXXXXXXXXXX 1189
            SGF SQD+VK+GDLVVKEQDFIEA KEPG+TFLAAKFDGILGLGFQE             
Sbjct: 153  SGFVSQDSVKLGDLVVKEQDFIEATKEPGVTFLAAKFDGILGLGFQE------------- 199

Query: 1188 XXXXXXXLGFQEISVGNAVPVWYSMVNQGLVKEPVFSFWLNRNAXXXXXXELVFGGVDTN 1009
                        ISVG +VPVWY+MVNQGLV+EPVFSFW NRNA      ELVFGGVD N
Sbjct: 200  ------------ISVGKSVPVWYNMVNQGLVQEPVFSFWFNRNADEEEGGELVFGGVDPN 247

Query: 1008 HFKGEHTYVPVSQKGYWQFDMGNVLVGGESTGYCSEGCSAIADSGTSLLAGPTTVITQIN 829
            HFKG+HTYVPV+QKGYWQF+MG+VL+  ++TG+C++GC+AIADSGTSLLAGPT +ITQIN
Sbjct: 248  HFKGKHTYVPVTQKGYWQFNMGDVLIEDKTTGFCADGCAAIADSGTSLLAGPTAIITQIN 307

Query: 828  HAIGASGVISQECKSVVSQYGKTILDLLLSEAQPRKICSQIGLCSSGSTHDRSMIIESVV 649
            HAIGA GV+SQ+CK++V QYGKTI+++LLSEAQP KICSQ+ LC+     D S IIESVV
Sbjct: 308  HAIGAKGVMSQQCKTLVDQYGKTIIEMLLSEAQPDKICSQMKLCTFDGARDVSSIIESVV 367

Query: 648  DMNSGKSSSGLHDELCTFCEMTVVWMQNQLIKNQTEDKILDYINELCNRLPSPMGESAVD 469
            D N+GKSS G+HDE+CTFCEM VVWMQNQ+ +NQTED I++Y+NELC+RLPSPMGESAVD
Sbjct: 368  DKNNGKSSGGVHDEMCTFCEMAVVWMQNQIKRNQTEDNIINYVNELCDRLPSPMGESAVD 427

Query: 468  CNSLSSMPSVTFTLGGKTFDLSADQYVLKVGEGTAAQCISGFTALDVPPPRGPLWILGDV 289
            CN LSSMP++ FT+GGK F+L  +QY+LK+GEG AAQCISGFTA+DV PPRGPLWILGDV
Sbjct: 428  CNDLSSMPNIAFTIGGKVFELCPEQYILKIGEGEAAQCISGFTAMDVAPPRGPLWILGDV 487

Query: 288  FMGKYHTVFDYGNMKVGFAEAA 223
            FMG+YHTVFDYG ++VGFAEAA
Sbjct: 488  FMGQYHTVFDYGKLRVGFAEAA 509


>gb|ABG37021.1| aspartic protease [Nicotiana tabacum]
          Length = 508

 Score =  569 bits (1466), Expect = e-160
 Identities = 277/382 (72%), Positives = 320/382 (83%)
 Frame = -3

Query: 1368 SGFFSQDNVKVGDLVVKEQDFIEAIKEPGLTFLAAKFDGILGLGFQEISXXXXXXXXXXX 1189
            SG+FS DNVKVGDL+VK+QDFIEA +EPG+TFLAAKFDGILGLGFQE             
Sbjct: 153  SGYFSNDNVKVGDLIVKDQDFIEATREPGITFLAAKFDGILGLGFQE------------- 199

Query: 1188 XXXXXXXLGFQEISVGNAVPVWYSMVNQGLVKEPVFSFWLNRNAXXXXXXELVFGGVDTN 1009
                        ISVG +VPVWY+MVNQGLVK+PVFSFW NRNA      ELVFGGVD N
Sbjct: 200  ------------ISVGKSVPVWYNMVNQGLVKKPVFSFWFNRNAQEEEGGELVFGGVDPN 247

Query: 1008 HFKGEHTYVPVSQKGYWQFDMGNVLVGGESTGYCSEGCSAIADSGTSLLAGPTTVITQIN 829
            HFKG+HTYVPV+ KGYWQFDMG+VLVGGE+TG+CS GCSAIADSGTSLLAGPTT+ITQIN
Sbjct: 248  HFKGKHTYVPVTHKGYWQFDMGDVLVGGETTGFCSGGCSAIADSGTSLLAGPTTIITQIN 307

Query: 828  HAIGASGVISQECKSVVSQYGKTILDLLLSEAQPRKICSQIGLCSSGSTHDRSMIIESVV 649
            H IGASGV+SQECKS+V++YGKTILDLL S+A P+KICSQIGLCSS  + D SMIIESVV
Sbjct: 308  HVIGASGVVSQECKSLVTEYGKTILDLLESKAAPQKICSQIGLCSSDGSRDVSMIIESVV 367

Query: 648  DMNSGKSSSGLHDELCTFCEMTVVWMQNQLIKNQTEDKILDYINELCNRLPSPMGESAVD 469
            D ++G +S+GL DE+C  CEM V+WMQNQ+ +N+T D I DY+N+LC+RLPSPMGESAVD
Sbjct: 368  DKHNG-ASNGLGDEMCRVCEMAVIWMQNQMRRNETADSIYDYVNQLCDRLPSPMGESAVD 426

Query: 468  CNSLSSMPSVTFTLGGKTFDLSADQYVLKVGEGTAAQCISGFTALDVPPPRGPLWILGDV 289
            C+SL+SMP+V+FT+G +TF L+  QYVL+VGEG  AQCISGFTALDVPPPRGPLWILGDV
Sbjct: 427  CSSLASMPNVSFTVGNQTFGLTPQQYVLQVGEGPVAQCISGFTALDVPPPRGPLWILGDV 486

Query: 288  FMGKYHTVFDYGNMKVGFAEAA 223
            FMG+YHTVFDYGN +VGFAEAA
Sbjct: 487  FMGRYHTVFDYGNSRVGFAEAA 508


>emb|CAA48939.1| cyprosin [Cynara cardunculus]
          Length = 474

 Score =  565 bits (1457), Expect = e-159
 Identities = 271/382 (70%), Positives = 323/382 (84%)
 Frame = -3

Query: 1368 SGFFSQDNVKVGDLVVKEQDFIEAIKEPGLTFLAAKFDGILGLGFQEISXXXXXXXXXXX 1189
            SGFFSQD+VK+GDL+VKEQDFIEA KEPG+TFLAAKFDGILGLGFQE             
Sbjct: 119  SGFFSQDSVKLGDLLVKEQDFIEATKEPGITFLAAKFDGILGLGFQE------------- 165

Query: 1188 XXXXXXXLGFQEISVGNAVPVWYSMVNQGLVKEPVFSFWLNRNAXXXXXXELVFGGVDTN 1009
                        ISVG+AVPVWY+M+NQGLV+EPVFSFWLNRNA      ELVFGGVD N
Sbjct: 166  ------------ISVGDAVPVWYTMLNQGLVQEPVFSFWLNRNADEQEGGELVFGGVDPN 213

Query: 1008 HFKGEHTYVPVSQKGYWQFDMGNVLVGGESTGYCSEGCSAIADSGTSLLAGPTTVITQIN 829
            HFKGEHTYVPV+QKGYWQF+MG+VL+G ++TG+C+ GC+AIADSGTSLLAG TT++TQIN
Sbjct: 214  HFKGEHTYVPVTQKGYWQFEMGDVLIGDKTTGFCASGCAAIADSGTSLLAGTTTIVTQIN 273

Query: 828  HAIGASGVISQECKSVVSQYGKTILDLLLSEAQPRKICSQIGLCSSGSTHDRSMIIESVV 649
             AIGA+GV+SQ+CKS+V QYGK+++++LLSE QP KICSQ+ LCS   +HD SMIIESVV
Sbjct: 274  QAIGAAGVMSQQCKSLVDQYGKSMIEMLLSEEQPEKICSQMKLCSFDGSHDTSMIIESVV 333

Query: 648  DMNSGKSSSGLHDELCTFCEMTVVWMQNQLIKNQTEDKILDYINELCNRLPSPMGESAVD 469
            D + GK SSGLHDE+CT C+M VVWMQNQ+ +N+TE+ I++Y+++LC RLPSPMGESAVD
Sbjct: 334  DKSKGK-SSGLHDEMCTMCQMAVVWMQNQIRQNETEENIINYVDKLCERLPSPMGESAVD 392

Query: 468  CNSLSSMPSVTFTLGGKTFDLSADQYVLKVGEGTAAQCISGFTALDVPPPRGPLWILGDV 289
            C+SLSSMP++ FT+GGKTF+LS +QYVLKVGEG  AQCISGFTA+DV PP GPLWILGDV
Sbjct: 393  CSSLSSMPNIAFTVGGKTFNLSPEQYVLKVGEGATAQCISGFTAMDVAPPHGPLWILGDV 452

Query: 288  FMGKYHTVFDYGNMKVGFAEAA 223
            FMG+YHTVFDYGN++VGFAEAA
Sbjct: 453  FMGQYHTVFDYGNLRVGFAEAA 474


>gb|AFB73927.2| preprocirsin [Cirsium vulgare]
          Length = 509

 Score =  563 bits (1451), Expect = e-158
 Identities = 269/382 (70%), Positives = 318/382 (83%)
 Frame = -3

Query: 1368 SGFFSQDNVKVGDLVVKEQDFIEAIKEPGLTFLAAKFDGILGLGFQEISXXXXXXXXXXX 1189
            SGF SQD+VK+GDLVVKEQDFIEA KEPG+TFLAAKFDGILGLGFQE             
Sbjct: 153  SGFVSQDSVKLGDLVVKEQDFIEATKEPGITFLAAKFDGILGLGFQE------------- 199

Query: 1188 XXXXXXXLGFQEISVGNAVPVWYSMVNQGLVKEPVFSFWLNRNAXXXXXXELVFGGVDTN 1009
                        ISVG +VPVWY+MVNQGLV+EPVFSFW NRNA      ELVFGGVD N
Sbjct: 200  ------------ISVGKSVPVWYNMVNQGLVQEPVFSFWFNRNANEEEGGELVFGGVDPN 247

Query: 1008 HFKGEHTYVPVSQKGYWQFDMGNVLVGGESTGYCSEGCSAIADSGTSLLAGPTTVITQIN 829
            HFKG+HTYVPV++KGYWQF+MG+VL+  ++TG+CS+GC+AIADSGTSLLAGPT +IT+IN
Sbjct: 248  HFKGKHTYVPVTEKGYWQFNMGDVLIEDKTTGFCSDGCAAIADSGTSLLAGPTAIITEIN 307

Query: 828  HAIGASGVISQECKSVVSQYGKTILDLLLSEAQPRKICSQIGLCSSGSTHDRSMIIESVV 649
            HA GA GV+SQ+CK++VSQYGK+I+++LLSEAQP KICSQ+ LC+     D S IIESVV
Sbjct: 308  HASGAKGVMSQQCKTLVSQYGKSIIEMLLSEAQPDKICSQMKLCTFDGARDVSSIIESVV 367

Query: 648  DMNSGKSSSGLHDELCTFCEMTVVWMQNQLIKNQTEDKILDYINELCNRLPSPMGESAVD 469
            D N+GKSS G +DE+CTFCEM VVWMQNQ+ +N+TED I++Y+NELC+RLPSPMGESAVD
Sbjct: 368  DKNNGKSSGGANDEMCTFCEMAVVWMQNQIKRNETEDNIINYVNELCDRLPSPMGESAVD 427

Query: 468  CNSLSSMPSVTFTLGGKTFDLSADQYVLKVGEGTAAQCISGFTALDVPPPRGPLWILGDV 289
            CNSLSSMP++ FT+GGK F+L  +QY+LK+GEG AAQCISGFTA+DV PPRGPLWILGDV
Sbjct: 428  CNSLSSMPNIAFTIGGKVFELCPEQYILKIGEGEAAQCISGFTAMDVAPPRGPLWILGDV 487

Query: 288  FMGKYHTVFDYGNMKVGFAEAA 223
            FMG+YHTVFDYG  +VGFAEAA
Sbjct: 488  FMGRYHTVFDYGKSRVGFAEAA 509


Top