BLASTX nr result

ID: Coptis23_contig00012408 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Coptis23_contig00012408
         (1561 letters)

Database: ./nr 
           23,641,837 sequences; 8,123,359,852 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_002278677.2| PREDICTED: aspartic proteinase nepenthesin-1...   588   e-165
ref|XP_002524401.1| basic 7S globulin 2 precursor small subunit,...   563   e-158
ref|XP_004143702.1| PREDICTED: aspartic proteinase nepenthesin-2...   556   e-156
ref|NP_189198.1| aspartyl protease family protein [Arabidopsis t...   556   e-156
ref|XP_002875271.1| hypothetical protein ARALYDRAFT_484331 [Arab...   550   e-154

>ref|XP_002278677.2| PREDICTED: aspartic proteinase nepenthesin-1-like [Vitis vinifera]
          Length = 458

 Score =  588 bits (1515), Expect = e-165
 Identities = 284/426 (66%), Positives = 343/426 (80%), Gaps = 1/426 (0%)
 Frame = -3

Query: 1454 YLKLPLLHTSPFISPLKSLHSDTSRLSILFSTLTNNPKSLKSPLVSGAPMGSGQYFVDFS 1275
            YLKL LLH  PF +P ++L  D+ RLS  FS L + P+SLKSP+VSGA  GSGQYFVD  
Sbjct: 36   YLKLRLLHIKPFTTPSQALSFDSHRLSFFFSAL-HTPQSLKSPVVSGASTGSGQYFVDLR 94

Query: 1274 IGTPPQKLLLVADTGSDLVWVKCSACKDCTKHLPGSSFFARHSSTFSPFHCYDRQCRLVP 1095
            +GTPPQKLLLVADTGSDLVWVKCSAC++CT+H PGS+F ARHS+TFSP HCYD  C+LVP
Sbjct: 95   LGTPPQKLLLVADTGSDLVWVKCSACRNCTRHTPGSAFLARHSTTFSPNHCYDSACQLVP 154

Query: 1094 -HSERVCNSTRLHSSCRYEYTYGDESKTSGFFSKETTTLNTSSGHEAKLRNLEFGCGFRV 918
                  CN  RLHS CRYEY+YGD SKTSGFFSKETTTLNTSSG EAKL+ + FGC FR+
Sbjct: 155  LPKHHRCNHARLHSPCRYEYSYGDGSKTSGFFSKETTTLNTSSGREAKLKGIAFGCAFRI 214

Query: 917  FGPSVSGSSFNGANGVMGLGRGPISFSTQLGKRFGNKFSYCLMDYTLSPPPTSFLMIGQA 738
             GPSVSG+SFNGA+GVMGLGRGPIS S+QLG RFGNKFSYCLMD+ +SP PTS+L+IG  
Sbjct: 215  SGPSVSGASFNGAHGVMGLGRGPISLSSQLGHRFGNKFSYCLMDHDISPSPTSYLLIGS- 273

Query: 737  HYSESKSIRSKPKMCFTPLQTNPISPTFYYIGIKSVLINGVNLRIDPSVWVFSEDGSGGT 558
              +++     K +M FTPL  NP+SPTFYYIGI+SV ++G+ L I+PSVW   E G+GGT
Sbjct: 274  --TQNDVAPGKRRMRFTPLHINPLSPTFYYIGIESVSVDGIKLPINPSVWALDELGNGGT 331

Query: 557  VIDSGTTLSFLAEPAYLQILTAFKRRVKYPIVDDPSLSFELCVNVSDSVNPSLPRLSFKL 378
            ++DSGTTL+FL EPAYLQILT  KRRV+ P   +P+  F+LCVNVS+  +P LP+LSFKL
Sbjct: 332  IVDSGTTLTFLPEPAYLQILTVIKRRVRLPSPAEPTPGFDLCVNVSEIEHPRLPKLSFKL 391

Query: 377  VGNSVFSPPSANYFINVADGIKCLALQSVISESGFSVIGNLMQQGFLFEFDRDKSRLGFT 198
             G+SVFSPP  NYF++  + +KCLALQ+V++ SGFSVIGNLMQQGFL EFD+D++RLGF+
Sbjct: 392  GGDSVFSPPPRNYFVDTDEDVKCLALQAVMTPSGFSVIGNLMQQGFLLEFDKDRTRLGFS 451

Query: 197  RRGCGI 180
            R GC +
Sbjct: 452  RHGCAL 457


>ref|XP_002524401.1| basic 7S globulin 2 precursor small subunit, putative [Ricinus
            communis] gi|223536362|gb|EEF38012.1| basic 7S globulin 2
            precursor small subunit, putative [Ricinus communis]
          Length = 455

 Score =  563 bits (1451), Expect = e-158
 Identities = 279/444 (62%), Positives = 333/444 (75%), Gaps = 5/444 (1%)
 Frame = -3

Query: 1496 PSTSSQPNTQKQDFYLKLPLLHTSPFISPLKSLHSDTSR-LSILFS---TLTNNPKSLKS 1329
            PS+S+  NT  +  YLKLPLLH +PF SP ++L  D +R LS+L        +   S +S
Sbjct: 16   PSSSAAANTTTE--YLKLPLLHKTPFTSPSEALAFDINRRLSLLHHHRHQQQHKQNSFRS 73

Query: 1328 PLVSGAPMGSGQYFVDFSIGTPPQKLLLVADTGSDLVWVKCSACKDCTKHLPGSSFFARH 1149
            P++SGA  GSGQYFV   IGTPPQ LLLVADTGSDL+WVKCS C++C+   PGS+FFARH
Sbjct: 74   PVISGASSGSGQYFVSLRIGTPPQTLLLVADTGSDLIWVKCSPCRNCSHRSPGSAFFARH 133

Query: 1148 SSTFSPFHCYDRQCRLVPHSE-RVCNSTRLHSSCRYEYTYGDESKTSGFFSKETTTLNTS 972
            S+T+S  HCY  QC+LVPH     CN TRLHS CRY+YTY D S T+GFFSKE  TLNTS
Sbjct: 134  STTYSAIHCYSPQCQLVPHPHPNPCNRTRLHSPCRYQYTYADSSTTTGFFSKEALTLNTS 193

Query: 971  SGHEAKLRNLEFGCGFRVFGPSVSGSSFNGANGVMGLGRGPISFSTQLGKRFGNKFSYCL 792
            +G   KL  L FGCGFR+ GPS++G+SF GA GVMGLGR PISFS+QLG+RFG+KFSYCL
Sbjct: 194  TGKVKKLNGLSFGCGFRISGPSLTGASFEGAQGVMGLGRAPISFSSQLGRRFGSKFSYCL 253

Query: 791  MDYTLSPPPTSFLMIGQAHYSESKSIRSKPKMCFTPLQTNPISPTFYYIGIKSVLINGVN 612
            MDYTLSPPPTSFL IG A   ++ ++  K  M FTPL  NP+SPTFYYI IK V +NGV 
Sbjct: 254  MDYTLSPPPTSFLTIGGA---QNVAVSKKGIMSFTPLLINPLSPTFYYIAIKGVYVNGVK 310

Query: 611  LRIDPSVWVFSEDGSGGTVIDSGTTLSFLAEPAYLQILTAFKRRVKYPIVDDPSLSFELC 432
            L I+PSVW   + G+GGT+IDSGTTL+F+ EPAY +IL AFK+RVK P   +P+  F+LC
Sbjct: 311  LPINPSVWSIDDLGNGGTIIDSGTTLTFITEPAYTEILKAFKKRVKLPSPAEPTPGFDLC 370

Query: 431  VNVSDSVNPSLPRLSFKLVGNSVFSPPSANYFINVADGIKCLALQSVISESGFSVIGNLM 252
            +NVS    P+LPR+SF L G SVFSPP  NYFI   D IKCLA+Q V  + GFSV+GNLM
Sbjct: 371  MNVSGVTRPALPRMSFNLAGGSVFSPPPRNYFIETGDQIKCLAVQPVSQDGGFSVLGNLM 430

Query: 251  QQGFLFEFDRDKSRLGFTRRGCGI 180
            QQGFL EFDRDKSRLGFTRRGC +
Sbjct: 431  QQGFLLEFDRDKSRLGFTRRGCAL 454


>ref|XP_004143702.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Cucumis sativus]
            gi|449529900|ref|XP_004171936.1| PREDICTED: aspartic
            proteinase nepenthesin-2-like [Cucumis sativus]
          Length = 459

 Score =  556 bits (1434), Expect = e-156
 Identities = 274/427 (64%), Positives = 335/427 (78%), Gaps = 2/427 (0%)
 Frame = -3

Query: 1454 YLKLPLLHTSPFISPLKSLHSDTSRLSILFSTLTNNPKSLKSPLVSGAPMGSGQYFVDFS 1275
            +LKLPLLH  PF SP +SL SDT RLS+LFS    NP +LKSPL+SGA  GSGQYFVD  
Sbjct: 37   FLKLPLLHKPPFSSPSQSLSSDTHRLSLLFSR--PNP-TLKSPLISGASTGSGQYFVDIR 93

Query: 1274 IGTPPQKLLLVADTGSDLVWVKCSACKDCTKHLPGSSFFARHSSTFSPFHCYDRQCRLVP 1095
            +GTPPQ LLLVADTGSDLVWVKCSAC++C+ H P S+F  RHSS+FSPFHC+D  CRL+P
Sbjct: 94   LGTPPQSLLLVADTGSDLVWVKCSACRNCSHHPPSSAFLPRHSSSFSPFHCFDPHCRLLP 153

Query: 1094 HS-ERVCNSTRLHSSCRYEYTYGDESKTSGFFSKETTTLNTSSGHEAKLRNLEFGCGFRV 918
            H+   +CN TRLHS CR+ Y+Y D S +SGFFSKETTTL + SG E  L+ L FGCGFR+
Sbjct: 154  HAPHHLCNHTRLHSPCRFLYSYADGSLSSGFFSKETTTLKSLSGSEIHLKGLSFGCGFRI 213

Query: 917  FGPSVSGSSFNGANGVMGLGRGPISFSTQLGKRFGNKFSYCLMDYTLSPPPTSFLMIGQA 738
             GPSVSG+ FNGA GVMGLGRG ISFS+QLG+RFGNKFSYCLMDYTLSPPPTSFLMIG  
Sbjct: 214  SGPSVSGAQFNGARGVMGLGRGSISFSSQLGRRFGNKFSYCLMDYTLSPPPTSFLMIGGG 273

Query: 737  HYSESKSIRSKPKMCFTPLQTNPISPTFYYIGIKSVLINGVNLRIDPSVWVFSEDGSGGT 558
             +  S  + +  K+ +TPLQ NP+SPTFYYI I S+ I+GV L I+P+VW   E G+GGT
Sbjct: 274  LH--SLPLTNATKISYTPLQINPLSPTFYYITIHSITIDGVKLPINPAVWEIDEQGNGGT 331

Query: 557  VIDSGTTLSFLAEPAYLQILTAFKRRVKYPIVDDPSLSFELCVNVS-DSVNPSLPRLSFK 381
            V+DSGTTL++L + AY ++L + +RRVK P   + +  F+LCVN S +S  PSLPRL F+
Sbjct: 332  VVDSGTTLTYLTKTAYEEVLKSVRRRVKLPNAAELTPGFDLCVNASGESRRPSLPRLRFR 391

Query: 380  LVGNSVFSPPSANYFINVADGIKCLALQSVISESGFSVIGNLMQQGFLFEFDRDKSRLGF 201
            L G +VF+PP  NYF+   +G+ CLA+++V S +GFSVIGNLMQQGFL EFD+++SRLGF
Sbjct: 392  LGGGAVFAPPPRNYFLETEEGVMCLAIRAVESGNGFSVIGNLMQQGFLLEFDKEESRLGF 451

Query: 200  TRRGCGI 180
            TRRGCG+
Sbjct: 452  TRRGCGL 458


>ref|NP_189198.1| aspartyl protease family protein [Arabidopsis thaliana]
            gi|11994761|dbj|BAB03090.1| chloroplast nucleoid DNA
            binding protein-like; nucellin-like protein [Arabidopsis
            thaliana] gi|189339286|gb|ACD89063.1| At3g25700
            [Arabidopsis thaliana] gi|332643533|gb|AEE77054.1|
            aspartyl protease family protein [Arabidopsis thaliana]
          Length = 452

 Score =  556 bits (1433), Expect = e-156
 Identities = 276/430 (64%), Positives = 328/430 (76%), Gaps = 5/430 (1%)
 Frame = -3

Query: 1454 YLKLPLLHTSPFISPLKSLHSDTSRLSILFSTLTNNP-KSLKSPLVSGAPMGSGQYFVDF 1278
            YLKLPLL  SPF SP ++L  DT RL  L  +L   P   +KSP+VSGA  GSGQYFVD 
Sbjct: 31   YLKLPLLRKSPFPSPTQALALDTRRLHFL--SLRRKPIPFVKSPVVSGAASGSGQYFVDL 88

Query: 1277 SIGTPPQKLLLVADTGSDLVWVKCSACKDCTKHLPGSSFFARHSSTFSPFHCYDRQCRLV 1098
             IG PPQ LLL+ADTGSDLVWVKCSAC++C+ H P + FF RHSSTFSP HCYD  CRLV
Sbjct: 89   RIGQPPQSLLLIADTGSDLVWVKCSACRNCSHHSPATVFFPRHSSTFSPAHCYDPVCRLV 148

Query: 1097 PHSER--VCNSTRLHSSCRYEYTYGDESKTSGFFSKETTTLNTSSGHEAKLRNLEFGCGF 924
            P  +R  +CN TR+HS+C YEY Y D S TSG F++ETT+L TSSG EA+L+++ FGCGF
Sbjct: 149  PKPDRAPICNHTRIHSTCHYEYGYADGSLTSGLFARETTSLKTSSGKEARLKSVAFGCGF 208

Query: 923  RVFGPSVSGSSFNGANGVMGLGRGPISFSTQLGKRFGNKFSYCLMDYTLSPPPTSFLMIG 744
            R+ G SVSG+SFNGANGVMGLGRGPISF++QLG+RFGNKFSYCLMDYTLSPPPTS+L+IG
Sbjct: 209  RISGQSVSGTSFNGANGVMGLGRGPISFASQLGRRFGNKFSYCLMDYTLSPPPTSYLIIG 268

Query: 743  QAHYSESKSIRSKPKMCFTPLQTNPISPTFYYIGIKSVLINGVNLRIDPSVWVFSEDGSG 564
                  S       K+ FTPL TNP+SPTFYY+ +KSV +NG  LRIDPS+W   + G+G
Sbjct: 269  NGGDGIS-------KLFFTPLLTNPLSPTFYYVKLKSVFVNGAKLRIDPSIWEIDDSGNG 321

Query: 563  GTVIDSGTTLSFLAEPAYLQILTAFKRRVKYPIVDDPSLSFELCVNVSDSVNPS--LPRL 390
            GTV+DSGTTL+FLAEPAY  ++ A +RRVK PI D  +  F+LCVNVS    P   LPRL
Sbjct: 322  GTVVDSGTTLAFLAEPAYRSVIAAVRRRVKLPIADALTPGFDLCVNVSGVTKPEKILPRL 381

Query: 389  SFKLVGNSVFSPPSANYFINVADGIKCLALQSVISESGFSVIGNLMQQGFLFEFDRDKSR 210
             F+  G +VF PP  NYFI   + I+CLA+QSV  + GFSVIGNLMQQGFLFEFDRD+SR
Sbjct: 382  KFEFSGGAVFVPPPRNYFIETEEQIQCLAIQSVDPKVGFSVIGNLMQQGFLFEFDRDRSR 441

Query: 209  LGFTRRGCGI 180
            LGF+RRGC +
Sbjct: 442  LGFSRRGCAL 451


>ref|XP_002875271.1| hypothetical protein ARALYDRAFT_484331 [Arabidopsis lyrata subsp.
            lyrata] gi|297321109|gb|EFH51530.1| hypothetical protein
            ARALYDRAFT_484331 [Arabidopsis lyrata subsp. lyrata]
          Length = 451

 Score =  550 bits (1417), Expect = e-154
 Identities = 274/430 (63%), Positives = 327/430 (76%), Gaps = 5/430 (1%)
 Frame = -3

Query: 1454 YLKLPLLHTSPFISPLKSLHSDTSRLSILFSTLTNNPKS-LKSPLVSGAPMGSGQYFVDF 1278
            YLKLPLL  SPF SP ++L  DT RL  L  +L   P   +KSP+VSGA  GSGQYFVD 
Sbjct: 30   YLKLPLLRKSPFPSPTQALALDTRRLHFL--SLRRKPVPFVKSPVVSGASSGSGQYFVDL 87

Query: 1277 SIGTPPQKLLLVADTGSDLVWVKCSACKDCTKHLPGSSFFARHSSTFSPFHCYDRQCRLV 1098
             IG PPQ LLL+ADTGSDLVWVKCSAC++C+ H P + FF RHSSTFSP HCYD  CRLV
Sbjct: 88   RIGQPPQSLLLIADTGSDLVWVKCSACRNCSHHSPATVFFPRHSSTFSPAHCYDPVCRLV 147

Query: 1097 PHSERV--CNSTRLHSSCRYEYTYGDESKTSGFFSKETTTLNTSSGHEAKLRNLEFGCGF 924
            P   R   CN TR+HS+C YEY Y D S TSG F++ETT+L TSSG EAKL+++ FGCGF
Sbjct: 148  PKPGRAPRCNHTRIHSTCPYEYGYADGSLTSGLFARETTSLKTSSGKEAKLKSVAFGCGF 207

Query: 923  RVFGPSVSGSSFNGANGVMGLGRGPISFSTQLGKRFGNKFSYCLMDYTLSPPPTSFLMIG 744
            R+ G SVSG+SFNGANGVMGLGRGPISF++QLG+RFGNKFSYCLMDYTLSPPPTS+L+IG
Sbjct: 208  RISGQSVSGTSFNGANGVMGLGRGPISFASQLGRRFGNKFSYCLMDYTLSPPPTSYLIIG 267

Query: 743  QAHYSESKSIRSKPKMCFTPLQTNPISPTFYYIGIKSVLINGVNLRIDPSVWVFSEDGSG 564
                + S       K+ FTPL TNP+SPTFYY+ +KSV +NG  LRIDPS+W   + G+G
Sbjct: 268  DGGDAVS-------KLFFTPLLTNPLSPTFYYVKLKSVFVNGAKLRIDPSIWEIDDSGNG 320

Query: 563  GTVIDSGTTLSFLAEPAYLQILTAFKRRVKYPIVDDPSLSFELCVNVSDSVNPS--LPRL 390
            GTV+DSGTTL+FLA+PAY  ++ A K+R+K P  D+ +  F+LCVNVS    P   LPRL
Sbjct: 321  GTVMDSGTTLAFLADPAYRLVIAAVKQRIKLPNADELTPGFDLCVNVSGVTKPEKILPRL 380

Query: 389  SFKLVGNSVFSPPSANYFINVADGIKCLALQSVISESGFSVIGNLMQQGFLFEFDRDKSR 210
             F+  G +VF PP  NYFI   + I+CLA+QSV  + GFSVIGNLMQQGFLFEFDRD+SR
Sbjct: 381  KFEFSGGAVFVPPPRNYFIETEEQIQCLAIQSVDPKVGFSVIGNLMQQGFLFEFDRDRSR 440

Query: 209  LGFTRRGCGI 180
            LGF+RRGC +
Sbjct: 441  LGFSRRGCAL 450


Top