BLASTX nr result

ID: Cimicifuga21_contig00007657 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Cimicifuga21_contig00007657
         (1789 letters)

Database: ./nr 
           23,641,837 sequences; 8,123,359,852 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_002278677.2| PREDICTED: aspartic proteinase nepenthesin-1...   599   e-169
ref|NP_189198.1| aspartyl protease family protein [Arabidopsis t...   583   e-164
ref|XP_004143702.1| PREDICTED: aspartic proteinase nepenthesin-2...   581   e-163
ref|XP_002875271.1| hypothetical protein ARALYDRAFT_484331 [Arab...   578   e-162
ref|XP_002524401.1| basic 7S globulin 2 precursor small subunit,...   576   e-162

>ref|XP_002278677.2| PREDICTED: aspartic proteinase nepenthesin-1-like [Vitis vinifera]
          Length = 458

 Score =  599 bits (1545), Expect = e-169
 Identities = 294/426 (69%), Positives = 341/426 (80%)
 Frame = +3

Query: 216  YLKLPLLHRNPFIPPSKSLPSDSTRISLLFSTLNSQKTLKSPLVSGASGGSGQYFVDFSI 395
            YLKL LLH  PF  PS++L  DS R+S  FS L++ ++LKSP+VSGAS GSGQYFVD  +
Sbjct: 36   YLKLRLLHIKPFTTPSQALSFDSHRLSFFFSALHTPQSLKSPVVSGASTGSGQYFVDLRL 95

Query: 396  GTPPQKLLLVADTGSDLVWVKCSACRNCTNHVPGSAFFPRHSSTFSPFHCYDSACRLVPH 575
            GTPPQKLLLVADTGSDLVWVKCSACRNCT H PGSAF  RHS+TFSP HCYDSAC+LVP 
Sbjct: 96   GTPPQKLLLVADTGSDLVWVKCSACRNCTRHTPGSAFLARHSTTFSPNHCYDSACQLVP- 154

Query: 576  SPETGRVCNRTRLHSTCRYEYSYADESKTSGFFSRETTRLNTSSGQEAKLKNLAFGCGFR 755
             P+  R CN  RLHS CRYEYSY D SKTSGFFS+ETT LNTSSG+EAKLK +AFGC FR
Sbjct: 155  LPKHHR-CNHARLHSPCRYEYSYGDGSKTSGFFSKETTTLNTSSGREAKLKGIAFGCAFR 213

Query: 756  QSGPSVSGASFNGASGVMGLGRGPISFSTQLGKRFGNKFSYCLMDYTLSPPPTSFLMIGE 935
             SGPSVSGASFNGA GVMGLGRGPIS S+QLG RFGNKFSYCLMD+ +SP PTS+L+IG 
Sbjct: 214  ISGPSVSGASFNGAHGVMGLGRGPISLSSQLGHRFGNKFSYCLMDHDISPSPTSYLLIGS 273

Query: 936  AHGPIKPKSKPKMSFTPLQANRLSPTFYYVQIKSVMINGVKLRIDPSVWVFDGEGNGGTV 1115
                + P  K +M FTPL  N LSPTFYY+ I+SV ++G+KL I+PSVW  D  GNGGT+
Sbjct: 274  TQNDVAP-GKRRMRFTPLHINPLSPTFYYIGIESVSVDGIKLPINPSVWALDELGNGGTI 332

Query: 1116 IDSGTTLTFFAEPAYRHILMAVKRRAKLPIIDDPTKSFDFCVNVSGSVNPSLPRLSFELI 1295
            +DSGTTLTF  EPAY  IL  +KRR +LP   +PT  FD CVNVS   +P LP+LSF+L 
Sbjct: 333  VDSGTTLTFLPEPAYLQILTVIKRRVRLPSPAEPTPGFDLCVNVSEIEHPRLPKLSFKLG 392

Query: 1296 GNSVFSPPSTNYFIDTADDVKCLALQPVNSPSGVSVIGNLMQQGFLFEFDREKSRLGFTQ 1475
            G+SVFSPP  NYF+DT +DVKCLALQ V +PSG SVIGNLMQQGFL EFD++++RLGF++
Sbjct: 393  GDSVFSPPPRNYFVDTDEDVKCLALQAVMTPSGFSVIGNLMQQGFLLEFDKDRTRLGFSR 452

Query: 1476 RGCSLP 1493
             GC+LP
Sbjct: 453  HGCALP 458


>ref|NP_189198.1| aspartyl protease family protein [Arabidopsis thaliana]
            gi|11994761|dbj|BAB03090.1| chloroplast nucleoid DNA
            binding protein-like; nucellin-like protein [Arabidopsis
            thaliana] gi|189339286|gb|ACD89063.1| At3g25700
            [Arabidopsis thaliana] gi|332643533|gb|AEE77054.1|
            aspartyl protease family protein [Arabidopsis thaliana]
          Length = 452

 Score =  583 bits (1504), Expect = e-164
 Identities = 283/428 (66%), Positives = 332/428 (77%), Gaps = 2/428 (0%)
 Frame = +3

Query: 216  YLKLPLLHRNPFIPPSKSLPSDSTRISLLFSTLNSQKTLKSPLVSGASGGSGQYFVDFSI 395
            YLKLPLL ++PF  P+++L  D+ R+  L         +KSP+VSGA+ GSGQYFVD  I
Sbjct: 31   YLKLPLLRKSPFPSPTQALALDTRRLHFLSLRRKPIPFVKSPVVSGAASGSGQYFVDLRI 90

Query: 396  GTPPQKLLLVADTGSDLVWVKCSACRNCTNHVPGSAFFPRHSSTFSPFHCYDSACRLVPH 575
            G PPQ LLL+ADTGSDLVWVKCSACRNC++H P + FFPRHSSTFSP HCYD  CRLVP 
Sbjct: 91   GQPPQSLLLIADTGSDLVWVKCSACRNCSHHSPATVFFPRHSSTFSPAHCYDPVCRLVP- 149

Query: 576  SPETGRVCNRTRLHSTCRYEYSYADESKTSGFFSRETTRLNTSSGQEAKLKNLAFGCGFR 755
             P+   +CN TR+HSTC YEY YAD S TSG F+RETT L TSSG+EA+LK++AFGCGFR
Sbjct: 150  KPDRAPICNHTRIHSTCHYEYGYADGSLTSGLFARETTSLKTSSGKEARLKSVAFGCGFR 209

Query: 756  QSGPSVSGASFNGASGVMGLGRGPISFSTQLGKRFGNKFSYCLMDYTLSPPPTSFLMIGE 935
             SG SVSG SFNGA+GVMGLGRGPISF++QLG+RFGNKFSYCLMDYTLSPPPTS+L+IG 
Sbjct: 210  ISGQSVSGTSFNGANGVMGLGRGPISFASQLGRRFGNKFSYCLMDYTLSPPPTSYLIIGN 269

Query: 936  AHGPIKPKSKPKMSFTPLQANRLSPTFYYVQIKSVMINGVKLRIDPSVWVFDGEGNGGTV 1115
                I      K+ FTPL  N LSPTFYYV++KSV +NG KLRIDPS+W  D  GNGGTV
Sbjct: 270  GGDGIS-----KLFFTPLLTNPLSPTFYYVKLKSVFVNGAKLRIDPSIWEIDDSGNGGTV 324

Query: 1116 IDSGTTLTFFAEPAYRHILMAVKRRAKLPIIDDPTKSFDFCVNVSGSVNPS--LPRLSFE 1289
            +DSGTTL F AEPAYR ++ AV+RR KLPI D  T  FD CVNVSG   P   LPRL FE
Sbjct: 325  VDSGTTLAFLAEPAYRSVIAAVRRRVKLPIADALTPGFDLCVNVSGVTKPEKILPRLKFE 384

Query: 1290 LIGNSVFSPPSTNYFIDTADDVKCLALQPVNSPSGVSVIGNLMQQGFLFEFDREKSRLGF 1469
              G +VF PP  NYFI+T + ++CLA+Q V+   G SVIGNLMQQGFLFEFDR++SRLGF
Sbjct: 385  FSGGAVFVPPPRNYFIETEEQIQCLAIQSVDPKVGFSVIGNLMQQGFLFEFDRDRSRLGF 444

Query: 1470 TQRGCSLP 1493
            ++RGC+LP
Sbjct: 445  SRRGCALP 452


>ref|XP_004143702.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Cucumis sativus]
            gi|449529900|ref|XP_004171936.1| PREDICTED: aspartic
            proteinase nepenthesin-2-like [Cucumis sativus]
          Length = 459

 Score =  581 bits (1498), Expect = e-163
 Identities = 287/427 (67%), Positives = 336/427 (78%), Gaps = 1/427 (0%)
 Frame = +3

Query: 216  YLKLPLLHRNPFIPPSKSLPSDSTRISLLFSTLNSQKTLKSPLVSGASGGSGQYFVDFSI 395
            +LKLPLLH+ PF  PS+SL SD+ R+SLLFS  N   TLKSPL+SGAS GSGQYFVD  +
Sbjct: 37   FLKLPLLHKPPFSSPSQSLSSDTHRLSLLFSRPNP--TLKSPLISGASTGSGQYFVDIRL 94

Query: 396  GTPPQKLLLVADTGSDLVWVKCSACRNCTNHVPGSAFFPRHSSTFSPFHCYDSACRLVPH 575
            GTPPQ LLLVADTGSDLVWVKCSACRNC++H P SAF PRHSS+FSPFHC+D  CRL+PH
Sbjct: 95   GTPPQSLLLVADTGSDLVWVKCSACRNCSHHPPSSAFLPRHSSSFSPFHCFDPHCRLLPH 154

Query: 576  SPETGRVCNRTRLHSTCRYEYSYADESKTSGFFSRETTRLNTSSGQEAKLKNLAFGCGFR 755
            +P    +CN TRLHS CR+ YSYAD S +SGFFS+ETT L + SG E  LK L+FGCGFR
Sbjct: 155  APH--HLCNHTRLHSPCRFLYSYADGSLSSGFFSKETTTLKSLSGSEIHLKGLSFGCGFR 212

Query: 756  QSGPSVSGASFNGASGVMGLGRGPISFSTQLGKRFGNKFSYCLMDYTLSPPPTSFLMIGE 935
             SGPSVSGA FNGA GVMGLGRG ISFS+QLG+RFGNKFSYCLMDYTLSPPPTSFLMIG 
Sbjct: 213  ISGPSVSGAQFNGARGVMGLGRGSISFSSQLGRRFGNKFSYCLMDYTLSPPPTSFLMIGG 272

Query: 936  AHGPIKPKSKPKMSFTPLQANRLSPTFYYVQIKSVMINGVKLRIDPSVWVFDGEGNGGTV 1115
                +   +  K+S+TPLQ N LSPTFYY+ I S+ I+GVKL I+P+VW  D +GNGGTV
Sbjct: 273  GLHSLPLTNATKISYTPLQINPLSPTFYYITIHSITIDGVKLPINPAVWEIDEQGNGGTV 332

Query: 1116 IDSGTTLTFFAEPAYRHILMAVKRRAKLPIIDDPTKSFDFCVNVSG-SVNPSLPRLSFEL 1292
            +DSGTTLT+  + AY  +L +V+RR KLP   + T  FD CVN SG S  PSLPRL F L
Sbjct: 333  VDSGTTLTYLTKTAYEEVLKSVRRRVKLPNAAELTPGFDLCVNASGESRRPSLPRLRFRL 392

Query: 1293 IGNSVFSPPSTNYFIDTADDVKCLALQPVNSPSGVSVIGNLMQQGFLFEFDREKSRLGFT 1472
             G +VF+PP  NYF++T + V CLA++ V S +G SVIGNLMQQGFL EFD+E+SRLGFT
Sbjct: 393  GGGAVFAPPPRNYFLETEEGVMCLAIRAVESGNGFSVIGNLMQQGFLLEFDKEESRLGFT 452

Query: 1473 QRGCSLP 1493
            +RGC LP
Sbjct: 453  RRGCGLP 459


>ref|XP_002875271.1| hypothetical protein ARALYDRAFT_484331 [Arabidopsis lyrata subsp.
            lyrata] gi|297321109|gb|EFH51530.1| hypothetical protein
            ARALYDRAFT_484331 [Arabidopsis lyrata subsp. lyrata]
          Length = 451

 Score =  578 bits (1489), Expect = e-162
 Identities = 282/428 (65%), Positives = 331/428 (77%), Gaps = 2/428 (0%)
 Frame = +3

Query: 216  YLKLPLLHRNPFIPPSKSLPSDSTRISLLFSTLNSQKTLKSPLVSGASGGSGQYFVDFSI 395
            YLKLPLL ++PF  P+++L  D+ R+  L         +KSP+VSGAS GSGQYFVD  I
Sbjct: 30   YLKLPLLRKSPFPSPTQALALDTRRLHFLSLRRKPVPFVKSPVVSGASSGSGQYFVDLRI 89

Query: 396  GTPPQKLLLVADTGSDLVWVKCSACRNCTNHVPGSAFFPRHSSTFSPFHCYDSACRLVPH 575
            G PPQ LLL+ADTGSDLVWVKCSACRNC++H P + FFPRHSSTFSP HCYD  CRLVP 
Sbjct: 90   GQPPQSLLLIADTGSDLVWVKCSACRNCSHHSPATVFFPRHSSTFSPAHCYDPVCRLVPK 149

Query: 576  SPETGRVCNRTRLHSTCRYEYSYADESKTSGFFSRETTRLNTSSGQEAKLKNLAFGCGFR 755
                 R CN TR+HSTC YEY YAD S TSG F+RETT L TSSG+EAKLK++AFGCGFR
Sbjct: 150  PGRAPR-CNHTRIHSTCPYEYGYADGSLTSGLFARETTSLKTSSGKEAKLKSVAFGCGFR 208

Query: 756  QSGPSVSGASFNGASGVMGLGRGPISFSTQLGKRFGNKFSYCLMDYTLSPPPTSFLMIGE 935
             SG SVSG SFNGA+GVMGLGRGPISF++QLG+RFGNKFSYCLMDYTLSPPPTS+L+IG+
Sbjct: 209  ISGQSVSGTSFNGANGVMGLGRGPISFASQLGRRFGNKFSYCLMDYTLSPPPTSYLIIGD 268

Query: 936  AHGPIKPKSKPKMSFTPLQANRLSPTFYYVQIKSVMINGVKLRIDPSVWVFDGEGNGGTV 1115
                +      K+ FTPL  N LSPTFYYV++KSV +NG KLRIDPS+W  D  GNGGTV
Sbjct: 269  GGDAVS-----KLFFTPLLTNPLSPTFYYVKLKSVFVNGAKLRIDPSIWEIDDSGNGGTV 323

Query: 1116 IDSGTTLTFFAEPAYRHILMAVKRRAKLPIIDDPTKSFDFCVNVSGSVNPS--LPRLSFE 1289
            +DSGTTL F A+PAYR ++ AVK+R KLP  D+ T  FD CVNVSG   P   LPRL FE
Sbjct: 324  MDSGTTLAFLADPAYRLVIAAVKQRIKLPNADELTPGFDLCVNVSGVTKPEKILPRLKFE 383

Query: 1290 LIGNSVFSPPSTNYFIDTADDVKCLALQPVNSPSGVSVIGNLMQQGFLFEFDREKSRLGF 1469
              G +VF PP  NYFI+T + ++CLA+Q V+   G SVIGNLMQQGFLFEFDR++SRLGF
Sbjct: 384  FSGGAVFVPPPRNYFIETEEQIQCLAIQSVDPKVGFSVIGNLMQQGFLFEFDRDRSRLGF 443

Query: 1470 TQRGCSLP 1493
            ++RGC+LP
Sbjct: 444  SRRGCALP 451


>ref|XP_002524401.1| basic 7S globulin 2 precursor small subunit, putative [Ricinus
            communis] gi|223536362|gb|EEF38012.1| basic 7S globulin 2
            precursor small subunit, putative [Ricinus communis]
          Length = 455

 Score =  576 bits (1485), Expect = e-162
 Identities = 285/443 (64%), Positives = 332/443 (74%), Gaps = 5/443 (1%)
 Frame = +3

Query: 180  PSTSQXXXXXXYYLKLPLLHRNPFIPPSKSLPSD-STRISLLF----STLNSQKTLKSPL 344
            PS+S        YLKLPLLH+ PF  PS++L  D + R+SLL        + Q + +SP+
Sbjct: 16   PSSSAAANTTTEYLKLPLLHKTPFTSPSEALAFDINRRLSLLHHHRHQQQHKQNSFRSPV 75

Query: 345  VSGASGGSGQYFVDFSIGTPPQKLLLVADTGSDLVWVKCSACRNCTNHVPGSAFFPRHSS 524
            +SGAS GSGQYFV   IGTPPQ LLLVADTGSDL+WVKCS CRNC++  PGSAFF RHS+
Sbjct: 76   ISGASSGSGQYFVSLRIGTPPQTLLLVADTGSDLIWVKCSPCRNCSHRSPGSAFFARHST 135

Query: 525  TFSPFHCYDSACRLVPHSPETGRVCNRTRLHSTCRYEYSYADESKTSGFFSRETTRLNTS 704
            T+S  HCY   C+LVPH       CNRTRLHS CRY+Y+YAD S T+GFFS+E   LNTS
Sbjct: 136  TYSAIHCYSPQCQLVPHPHPNP--CNRTRLHSPCRYQYTYADSSTTTGFFSKEALTLNTS 193

Query: 705  SGQEAKLKNLAFGCGFRQSGPSVSGASFNGASGVMGLGRGPISFSTQLGKRFGNKFSYCL 884
            +G+  KL  L+FGCGFR SGPS++GASF GA GVMGLGR PISFS+QLG+RFG+KFSYCL
Sbjct: 194  TGKVKKLNGLSFGCGFRISGPSLTGASFEGAQGVMGLGRAPISFSSQLGRRFGSKFSYCL 253

Query: 885  MDYTLSPPPTSFLMIGEAHGPIKPKSKPKMSFTPLQANRLSPTFYYVQIKSVMINGVKLR 1064
            MDYTLSPPPTSFL IG A   +    K  MSFTPL  N LSPTFYY+ IK V +NGVKL 
Sbjct: 254  MDYTLSPPPTSFLTIGGAQN-VAVSKKGIMSFTPLLINPLSPTFYYIAIKGVYVNGVKLP 312

Query: 1065 IDPSVWVFDGEGNGGTVIDSGTTLTFFAEPAYRHILMAVKRRAKLPIIDDPTKSFDFCVN 1244
            I+PSVW  D  GNGGT+IDSGTTLTF  EPAY  IL A K+R KLP   +PT  FD C+N
Sbjct: 313  INPSVWSIDDLGNGGTIIDSGTTLTFITEPAYTEILKAFKKRVKLPSPAEPTPGFDLCMN 372

Query: 1245 VSGSVNPSLPRLSFELIGNSVFSPPSTNYFIDTADDVKCLALQPVNSPSGVSVIGNLMQQ 1424
            VSG   P+LPR+SF L G SVFSPP  NYFI+T D +KCLA+QPV+   G SV+GNLMQQ
Sbjct: 373  VSGVTRPALPRMSFNLAGGSVFSPPPRNYFIETGDQIKCLAVQPVSQDGGFSVLGNLMQQ 432

Query: 1425 GFLFEFDREKSRLGFTQRGCSLP 1493
            GFL EFDR+KSRLGFT+RGC+LP
Sbjct: 433  GFLLEFDRDKSRLGFTRRGCALP 455


Top