BLASTX nr result

ID: Angelica22_contig00000786 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Angelica22_contig00000786
         (1509 letters)

Database: ./nr 
           23,641,837 sequences; 8,123,359,852 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

emb|CAA57510.1| cyprosin [Cynara cardunculus]                         632   e-178
emb|CAA70340.1| aspartic proteinase [Centaurea calcitrapa]            629   e-178
emb|CAA48939.1| cyprosin [Cynara cardunculus]                         625   e-177
gb|AFB73927.2| preprocirsin [Cirsium vulgare]                         622   e-175
dbj|BAB20969.1| aspartic proteinase 1 [Nepenthes alata]               618   e-174

>emb|CAA57510.1| cyprosin [Cynara cardunculus]
          Length = 509

 Score =  632 bits (1629), Expect = e-178
 Identities = 301/431 (69%), Positives = 353/431 (81%)
 Frame = +3

Query: 3    SSNLWVPSAKCYFSVACLFXXXXXXXXXXXXXXNGKSASIHYGTGSVSGFFSQDNVKVGD 182
            SSNLWVPSAKCYFSVACLF              NG SA+I YGTGS+SGF SQD+VK+GD
Sbjct: 106  SSNLWVPSAKCYFSVACLFHSKYKSSHSSTYKKNGTSAAIQYGTGSISGFVSQDSVKLGD 165

Query: 183  LVVKEQDFIEAIKEPGLTFLAAKFDGILGLGFQEISXXXXXXXXXXXXXXXXXXXXLGFQ 362
            LVVKEQDFIEA KEPG+TFLAAKFDGILGLGFQE                          
Sbjct: 166  LVVKEQDFIEATKEPGITFLAAKFDGILGLGFQE-------------------------- 199

Query: 363  EISVGNAVPVWYSMVNQGLVKEPVFSFWLNRNAXXXXXXXLVFGGVDTNHFKGEHTYVPV 542
             ISVG +VP+WY+MVNQGLV+EPVFSFW NRNA       LVFGGVD NHFKG+HTYVPV
Sbjct: 200  -ISVGKSVPLWYNMVNQGLVQEPVFSFWFNRNADEEEGGELVFGGVDPNHFKGKHTYVPV 258

Query: 543  SQKGYWQFDMGNVLVGDESTGYCSEGCSAIADSGTSLLAGPTTVITQINHAIGASGVISQ 722
            ++KGYWQFDMG+VL+ D++TG+CS+GC+AIADSGTSLLAGPT +IT+INHAIGA GV+SQ
Sbjct: 259  TEKGYWQFDMGDVLIEDKTTGFCSDGCAAIADSGTSLLAGPTAIITEINHAIGAKGVMSQ 318

Query: 723  ECKSVVSQYGKTILDLLLSEAQPRKICSQIGLCSSGSTHDRSMIIESVVDMNSGKSSSGL 902
            +CK++VSQYGKT++++LLSEAQP KICSQ+ LC+     D S IIESVVD N+GKSSSG+
Sbjct: 319  QCKTLVSQYGKTMIEMLLSEAQPDKICSQMKLCTFDGARDASSIIESVVDENNGKSSSGV 378

Query: 903  HDELCTFCEMTVVWMQNQLIKNQTEDKILDYINELCNRLPSPMGESAVDCNSLSSMPSVT 1082
            HDE+CTFCEM VVWMQNQ+ +N+TED I++Y+NELC+RLPSPMGESAVDCNSLSSMP++ 
Sbjct: 379  HDEMCTFCEMAVVWMQNQIKRNETEDNIINYVNELCDRLPSPMGESAVDCNSLSSMPNIA 438

Query: 1083 FTLGGKTFDLSADQYVLKVGEGTAAQCISGFTALDVPPPRGPLWILGDVFMGKYHTVFDY 1262
            FT+GGK F+L  +QY+LK+GEG AAQCISGFTA+DV PPRGPLWILGDVFMG+YHTVFDY
Sbjct: 439  FTIGGKVFELCPEQYILKIGEGEAAQCISGFTAMDVAPPRGPLWILGDVFMGRYHTVFDY 498

Query: 1263 GNMKVGFAEAA 1295
            G ++VGFAEAA
Sbjct: 499  GKLRVGFAEAA 509


>emb|CAA70340.1| aspartic proteinase [Centaurea calcitrapa]
          Length = 509

 Score =  629 bits (1621), Expect = e-178
 Identities = 301/431 (69%), Positives = 350/431 (81%)
 Frame = +3

Query: 3    SSNLWVPSAKCYFSVACLFXXXXXXXXXXXXXXNGKSASIHYGTGSVSGFFSQDNVKVGD 182
            SSNLWVPSAKCYFSVACLF              NG SA+I YGTGS+SGF SQD+VK+GD
Sbjct: 106  SSNLWVPSAKCYFSVACLFHSKYKSSHSSTYKKNGTSAAIQYGTGSISGFVSQDSVKLGD 165

Query: 183  LVVKEQDFIEAIKEPGLTFLAAKFDGILGLGFQEISXXXXXXXXXXXXXXXXXXXXLGFQ 362
            LVVKEQDFIEA KEPG+TFLAAKFDGILGLGFQE                          
Sbjct: 166  LVVKEQDFIEATKEPGVTFLAAKFDGILGLGFQE-------------------------- 199

Query: 363  EISVGNAVPVWYSMVNQGLVKEPVFSFWLNRNAXXXXXXXLVFGGVDTNHFKGEHTYVPV 542
             ISVG +VPVWY+MVNQGLV+EPVFSFW NRNA       LVFGGVD NHFKG+HTYVPV
Sbjct: 200  -ISVGKSVPVWYNMVNQGLVQEPVFSFWFNRNADEEEGGELVFGGVDPNHFKGKHTYVPV 258

Query: 543  SQKGYWQFDMGNVLVGDESTGYCSEGCSAIADSGTSLLAGPTTVITQINHAIGASGVISQ 722
            +QKGYWQF+MG+VL+ D++TG+C++GC+AIADSGTSLLAGPT +ITQINHAIGA GV+SQ
Sbjct: 259  TQKGYWQFNMGDVLIEDKTTGFCADGCAAIADSGTSLLAGPTAIITQINHAIGAKGVMSQ 318

Query: 723  ECKSVVSQYGKTILDLLLSEAQPRKICSQIGLCSSGSTHDRSMIIESVVDMNSGKSSSGL 902
            +CK++V QYGKTI+++LLSEAQP KICSQ+ LC+     D S IIESVVD N+GKSS G+
Sbjct: 319  QCKTLVDQYGKTIIEMLLSEAQPDKICSQMKLCTFDGARDVSSIIESVVDKNNGKSSGGV 378

Query: 903  HDELCTFCEMTVVWMQNQLIKNQTEDKILDYINELCNRLPSPMGESAVDCNSLSSMPSVT 1082
            HDE+CTFCEM VVWMQNQ+ +NQTED I++Y+NELC+RLPSPMGESAVDCN LSSMP++ 
Sbjct: 379  HDEMCTFCEMAVVWMQNQIKRNQTEDNIINYVNELCDRLPSPMGESAVDCNDLSSMPNIA 438

Query: 1083 FTLGGKTFDLSADQYVLKVGEGTAAQCISGFTALDVPPPRGPLWILGDVFMGKYHTVFDY 1262
            FT+GGK F+L  +QY+LK+GEG AAQCISGFTA+DV PPRGPLWILGDVFMG+YHTVFDY
Sbjct: 439  FTIGGKVFELCPEQYILKIGEGEAAQCISGFTAMDVAPPRGPLWILGDVFMGQYHTVFDY 498

Query: 1263 GNMKVGFAEAA 1295
            G ++VGFAEAA
Sbjct: 499  GKLRVGFAEAA 509


>emb|CAA48939.1| cyprosin [Cynara cardunculus]
          Length = 474

 Score =  625 bits (1612), Expect = e-177
 Identities = 300/431 (69%), Positives = 355/431 (82%)
 Frame = +3

Query: 3    SSNLWVPSAKCYFSVACLFXXXXXXXXXXXXXXNGKSASIHYGTGSVSGFFSQDNVKVGD 182
            SSNLWVPS+KCYFSVACLF              NGKSA+I YGTGS+SGFFSQD+VK+GD
Sbjct: 72   SSNLWVPSSKCYFSVACLFHSKYRSTDSTTYKKNGKSAAIQYGTGSISGFFSQDSVKLGD 131

Query: 183  LVVKEQDFIEAIKEPGLTFLAAKFDGILGLGFQEISXXXXXXXXXXXXXXXXXXXXLGFQ 362
            L+VKEQDFIEA KEPG+TFLAAKFDGILGLGFQE                          
Sbjct: 132  LLVKEQDFIEATKEPGITFLAAKFDGILGLGFQE-------------------------- 165

Query: 363  EISVGNAVPVWYSMVNQGLVKEPVFSFWLNRNAXXXXXXXLVFGGVDTNHFKGEHTYVPV 542
             ISVG+AVPVWY+M+NQGLV+EPVFSFWLNRNA       LVFGGVD NHFKGEHTYVPV
Sbjct: 166  -ISVGDAVPVWYTMLNQGLVQEPVFSFWLNRNADEQEGGELVFGGVDPNHFKGEHTYVPV 224

Query: 543  SQKGYWQFDMGNVLVGDESTGYCSEGCSAIADSGTSLLAGPTTVITQINHAIGASGVISQ 722
            +QKGYWQF+MG+VL+GD++TG+C+ GC+AIADSGTSLLAG TT++TQIN AIGA+GV+SQ
Sbjct: 225  TQKGYWQFEMGDVLIGDKTTGFCASGCAAIADSGTSLLAGTTTIVTQINQAIGAAGVMSQ 284

Query: 723  ECKSVVSQYGKTILDLLLSEAQPRKICSQIGLCSSGSTHDRSMIIESVVDMNSGKSSSGL 902
            +CKS+V QYGK+++++LLSE QP KICSQ+ LCS   +HD SMIIESVVD + GK SSGL
Sbjct: 285  QCKSLVDQYGKSMIEMLLSEEQPEKICSQMKLCSFDGSHDTSMIIESVVDKSKGK-SSGL 343

Query: 903  HDELCTFCEMTVVWMQNQLIKNQTEDKILDYINELCNRLPSPMGESAVDCNSLSSMPSVT 1082
            HDE+CT C+M VVWMQNQ+ +N+TE+ I++Y+++LC RLPSPMGESAVDC+SLSSMP++ 
Sbjct: 344  HDEMCTMCQMAVVWMQNQIRQNETEENIINYVDKLCERLPSPMGESAVDCSSLSSMPNIA 403

Query: 1083 FTLGGKTFDLSADQYVLKVGEGTAAQCISGFTALDVPPPRGPLWILGDVFMGKYHTVFDY 1262
            FT+GGKTF+LS +QYVLKVGEG  AQCISGFTA+DV PP GPLWILGDVFMG+YHTVFDY
Sbjct: 404  FTVGGKTFNLSPEQYVLKVGEGATAQCISGFTAMDVAPPHGPLWILGDVFMGQYHTVFDY 463

Query: 1263 GNMKVGFAEAA 1295
            GN++VGFAEAA
Sbjct: 464  GNLRVGFAEAA 474


>gb|AFB73927.2| preprocirsin [Cirsium vulgare]
          Length = 509

 Score =  622 bits (1603), Expect = e-175
 Identities = 298/431 (69%), Positives = 349/431 (80%)
 Frame = +3

Query: 3    SSNLWVPSAKCYFSVACLFXXXXXXXXXXXXXXNGKSASIHYGTGSVSGFFSQDNVKVGD 182
            SSNLWVPSAKCYFSVACLF              NG SA+I YGTGS+SGF SQD+VK+GD
Sbjct: 106  SSNLWVPSAKCYFSVACLFHSKYKSSHSSTYKKNGTSAAIQYGTGSISGFVSQDSVKLGD 165

Query: 183  LVVKEQDFIEAIKEPGLTFLAAKFDGILGLGFQEISXXXXXXXXXXXXXXXXXXXXLGFQ 362
            LVVKEQDFIEA KEPG+TFLAAKFDGILGLGFQE                          
Sbjct: 166  LVVKEQDFIEATKEPGITFLAAKFDGILGLGFQE-------------------------- 199

Query: 363  EISVGNAVPVWYSMVNQGLVKEPVFSFWLNRNAXXXXXXXLVFGGVDTNHFKGEHTYVPV 542
             ISVG +VPVWY+MVNQGLV+EPVFSFW NRNA       LVFGGVD NHFKG+HTYVPV
Sbjct: 200  -ISVGKSVPVWYNMVNQGLVQEPVFSFWFNRNANEEEGGELVFGGVDPNHFKGKHTYVPV 258

Query: 543  SQKGYWQFDMGNVLVGDESTGYCSEGCSAIADSGTSLLAGPTTVITQINHAIGASGVISQ 722
            ++KGYWQF+MG+VL+ D++TG+CS+GC+AIADSGTSLLAGPT +IT+INHA GA GV+SQ
Sbjct: 259  TEKGYWQFNMGDVLIEDKTTGFCSDGCAAIADSGTSLLAGPTAIITEINHASGAKGVMSQ 318

Query: 723  ECKSVVSQYGKTILDLLLSEAQPRKICSQIGLCSSGSTHDRSMIIESVVDMNSGKSSSGL 902
            +CK++VSQYGK+I+++LLSEAQP KICSQ+ LC+     D S IIESVVD N+GKSS G 
Sbjct: 319  QCKTLVSQYGKSIIEMLLSEAQPDKICSQMKLCTFDGARDVSSIIESVVDKNNGKSSGGA 378

Query: 903  HDELCTFCEMTVVWMQNQLIKNQTEDKILDYINELCNRLPSPMGESAVDCNSLSSMPSVT 1082
            +DE+CTFCEM VVWMQNQ+ +N+TED I++Y+NELC+RLPSPMGESAVDCNSLSSMP++ 
Sbjct: 379  NDEMCTFCEMAVVWMQNQIKRNETEDNIINYVNELCDRLPSPMGESAVDCNSLSSMPNIA 438

Query: 1083 FTLGGKTFDLSADQYVLKVGEGTAAQCISGFTALDVPPPRGPLWILGDVFMGKYHTVFDY 1262
            FT+GGK F+L  +QY+LK+GEG AAQCISGFTA+DV PPRGPLWILGDVFMG+YHTVFDY
Sbjct: 439  FTIGGKVFELCPEQYILKIGEGEAAQCISGFTAMDVAPPRGPLWILGDVFMGRYHTVFDY 498

Query: 1263 GNMKVGFAEAA 1295
            G  +VGFAEAA
Sbjct: 499  GKSRVGFAEAA 509


>dbj|BAB20969.1| aspartic proteinase 1 [Nepenthes alata]
          Length = 514

 Score =  618 bits (1593), Expect = e-174
 Identities = 301/430 (70%), Positives = 342/430 (79%)
 Frame = +3

Query: 3    SSNLWVPSAKCYFSVACLFXXXXXXXXXXXXXXNGKSASIHYGTGSVSGFFSQDNVKVGD 182
            SSNLWVPSAKCYFS+AC F              NGKSA IHYGTG++SGFFSQD+VK+GD
Sbjct: 111  SSNLWVPSAKCYFSIACYFHSKYKSSLSSSYTKNGKSAEIHYGTGAISGFFSQDHVKLGD 170

Query: 183  LVVKEQDFIEAIKEPGLTFLAAKFDGILGLGFQEISXXXXXXXXXXXXXXXXXXXXLGFQ 362
            LVV+ QDFIEA +EP +TF+AAKFDGILGLGFQE                          
Sbjct: 171  LVVENQDFIEATREPSITFVAAKFDGILGLGFQE-------------------------- 204

Query: 363  EISVGNAVPVWYSMVNQGLVKEPVFSFWLNRNAXXXXXXXLVFGGVDTNHFKGEHTYVPV 542
             ISVGNAVPVWY+MV QGLV EPVFSFWLNRNA       +VFGGVD NH+KGEHT+VPV
Sbjct: 205  -ISVGNAVPVWYNMVKQGLVNEPVFSFWLNRNATEEEGGEIVFGGVDPNHYKGEHTFVPV 263

Query: 543  SQKGYWQFDMGNVLVGDESTGYCSEGCSAIADSGTSLLAGPTTVITQINHAIGASGVISQ 722
            + KGYWQFDM +VLVG E+TGYCS GCSAIADSGTSLLAGPTT++ QINHAIGASGV+SQ
Sbjct: 264  THKGYWQFDMDDVLVGGETTGYCSGGCSAIADSGTSLLAGPTTIVAQINHAIGASGVVSQ 323

Query: 723  ECKSVVSQYGKTILDLLLSEAQPRKICSQIGLCSSGSTHDRSMIIESVVDMNSGKSSSGL 902
            ECK+VV+QYG  ILD+L+SE QP+KICSQIGLC+       S+ I+SVVDMN   SSSGL
Sbjct: 324  ECKAVVAQYGTAILDMLISETQPKKICSQIGLCTFDGKRGVSVGIKSVVDMNVDGSSSGL 383

Query: 903  HDELCTFCEMTVVWMQNQLIKNQTEDKILDYINELCNRLPSPMGESAVDCNSLSSMPSVT 1082
             D  CT CEMTVVWMQNQL +NQTE++IL+Y+NELCNRLPSPMGESAVDC+SLSSMP V+
Sbjct: 384  QDATCTACEMTVVWMQNQLKQNQTEERILNYVNELCNRLPSPMGESAVDCSSLSSMPGVS 443

Query: 1083 FTLGGKTFDLSADQYVLKVGEGTAAQCISGFTALDVPPPRGPLWILGDVFMGKYHTVFDY 1262
            FT+GGK FDL  +QY+L+VGEG A QCISGFTALDV PP GPLWILGD+FMG+YHTVFDY
Sbjct: 444  FTVGGKVFDLLPEQYILQVGEGVATQCISGFTALDVAPPLGPLWILGDIFMGQYHTVFDY 503

Query: 1263 GNMKVGFAEA 1292
            GNM+VGFAEA
Sbjct: 504  GNMRVGFAEA 513


Top