BLASTX nr result

ID: Coptis25_contig00016899 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Coptis25_contig00016899
         (1260 letters)

Database: ./nr 
           23,641,837 sequences; 8,123,359,852 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_002534038.1| Aspartic proteinase nepenthesin-1 precursor,...   506   e-141
ref|XP_002884082.1| aspartyl protease family protein [Arabidopsi...   469   e-130
dbj|BAJ34248.1| unnamed protein product [Thellungiella halophila]     467   e-129
ref|NP_849967.1| aspartyl protease-like protein [Arabidopsis tha...   462   e-128
ref|XP_003547911.1| PREDICTED: aspartic proteinase-like protein ...   451   e-124

>ref|XP_002534038.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
            communis] gi|223525945|gb|EEF28342.1| Aspartic proteinase
            nepenthesin-1 precursor, putative [Ricinus communis]
          Length = 533

 Score =  506 bits (1304), Expect = e-141
 Identities = 249/420 (59%), Positives = 310/420 (73%), Gaps = 3/420 (0%)
 Frame = +3

Query: 9    ILLLLLFISRTCYGFGTVGFNFHHRFSDEVKGVLSVDDLPRKGTVDYYKAMAHRDRIVIH 188
            +L+L+L  S   YGFGT GF+ HHR+SD VKG+LSVDDLP KG++ YY +MAHRD I+IH
Sbjct: 24   LLVLMLSSSSFSYGFGTFGFDLHHRYSDPVKGMLSVDDLPEKGSLHYYASMAHRD-ILIH 82

Query: 189  GRGLATSTTGDQQQLSFENGNDTLRIPQLGFLHYANVSVGTPSLSFLVALDTGSDLFWVP 368
            GR L +  T     L+F +GN+T R   LGFLHYANVS+GTPSLS+LVALDTGSDLFW+P
Sbjct: 83   GRKLVSDNTSTP--LTFFSGNETYRFSSLGFLHYANVSIGTPSLSYLVALDTGSDLFWLP 140

Query: 369  CDCKS--CVKALVTNSGARLDLNIYXXXXXXXXXXVPCDSSACDVQGACSGVSRTCPYQV 542
            CDC +  CV+ L   SG ++D NIY          +PC+++ C  Q  C     TCPYQV
Sbjct: 141  CDCTNSGCVQGLQFPSGEQIDFNIYRPNASSTSQTIPCNNTLCSRQSRCPSAQSTCPYQV 200

Query: 543  EYLSNGTSSKGILIEDVLHLTTEGNHPEVVDAQITLGCGLVETGSFLNGAAPNGLFGLGM 722
            +YLSNGTSS G+L+ED+LHLTT+      +DA+I  GCG V+TGSFL+GAAPNGLFGLGM
Sbjct: 201  QYLSNGTSSTGVLVEDLLHLTTDDAQSRALDAKIIFGCGRVQTGSFLDGAAPNGLFGLGM 260

Query: 723  DKSSVPSILSSAGIAANSFSMCFGPDGIGRINFGDKGSVNQGETPFNMQQLQPMYNVSLT 902
               SVPS L+  G  +NSFSMCFG DGIGRI+FGD GS  QGETPFN++QL P YNVS+T
Sbjct: 261  TNISVPSTLAREGYTSNSFSMCFGRDGIGRISFGDTGSSGQGETPFNLRQLHPTYNVSIT 320

Query: 903  HVTVETNQSNVDFSAIFDSGTSFTYLNDPAYTGLSETFNFNVQDRRRKSDPNIPFEYCYD 1082
             + V    ++++FSAIFDSGTSFTYLNDPAYT +SE+FN   +++R  S  +IPFEYCY+
Sbjct: 321  KINVGGRDADLEFSAIFDSGTSFTYLNDPAYTLISESFNIGAKEKRYSSISDIPFEYCYE 380

Query: 1083 VGPGKTTILVPNVTLTMKGGSQFFVFDPIIDISDESGAK-YCLAVVKSPDVNIIGQNFMT 1259
            +   +T + +P V L M+GGSQF V DPI+ +  + GA  YCLA+VKS DVNIIGQNFMT
Sbjct: 381  MSSNQTNLEIPTVNLVMQGGSQFNVTDPIVIVILQGGASIYCLAIVKSGDVNIIGQNFMT 440


>ref|XP_002884082.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
            gi|297329922|gb|EFH60341.1| aspartyl protease family
            protein [Arabidopsis lyrata subsp. lyrata]
          Length = 513

 Score =  469 bits (1207), Expect = e-130
 Identities = 230/418 (55%), Positives = 295/418 (70%), Gaps = 1/418 (0%)
 Frame = +3

Query: 9    ILLLLLFISRTCYGFGTVGFNFHHRFSDEVKGVLSVDDLPRKGTVDYYKAMAHRDRIVIH 188
            ILL   ++   C GFG  GF FHHRFSD+V GVL  D LP + +  YY+ MAHRDR+ I 
Sbjct: 16   ILLASSWVLERCEGFGEFGFEFHHRFSDQVVGVLPGDGLPNRDSSKYYRVMAHRDRL-IR 74

Query: 189  GRGLATSTTGDQQQLSFENGNDTLRIPQLGFLHYANVSVGTPSLSFLVALDTGSDLFWVP 368
            GR LA     DQ  ++F +GN+T+R+  LGFLHYANV+VGTPS  FLVALDTGSDLFW+P
Sbjct: 75   GRRLANE---DQSLVTFSDGNETIRVDALGFLHYANVTVGTPSDWFLVALDTGSDLFWLP 131

Query: 369  CDCKSCVKALVTNSGARLDLNIYXXXXXXXXXXVPCDSSACDVQGACSGVSRTCPYQVEY 548
            CDC +CV+ L    G+ LDLNIY          VPC+S+ C     C+     CPYQ+ Y
Sbjct: 132  CDCTNCVRELKAPGGSSLDLNIYSPNASSTSTKVPCNSTLCTRGDRCASPESNCPYQIRY 191

Query: 549  LSNGTSSKGILIEDVLHLTTEGNHPEVVDAQITLGCGLVETGSFLNGAAPNGLFGLGMDK 728
            LSNGTSS G+L+EDVLHL +     + + A++TLGCG V+TG F +GAAPNGLFGLG++ 
Sbjct: 192  LSNGTSSTGVLVEDVLHLVSNDKSSKAIPARVTLGCGQVQTGVFHDGAAPNGLFGLGLED 251

Query: 729  SSVPSILSSAGIAANSFSMCFGPDGIGRINFGDKGSVNQGETPFNMQQLQPMYNVSLTHV 908
             SVPS+L+  GIAANSFSMCFG DG GRI+FGDKGSV+Q ETP N++Q  P YN+++T +
Sbjct: 252  ISVPSVLAKEGIAANSFSMCFGNDGAGRISFGDKGSVDQRETPLNIRQPHPTYNITVTKI 311

Query: 909  TVETNQSNVDFSAIFDSGTSFTYLNDPAYTGLSETFNFNVQDRR-RKSDPNIPFEYCYDV 1085
            +VE N  +++F A+FDSGTSFTYL D AYT +SE+FN    D+R + +D  +PFEYCY +
Sbjct: 312  SVEGNTGDLEFDAVFDSGTSFTYLTDAAYTLISESFNSLALDKRYQTTDSELPFEYCYAL 371

Query: 1086 GPGKTTILVPNVTLTMKGGSQFFVFDPIIDISDESGAKYCLAVVKSPDVNIIGQNFMT 1259
             P K +   P V LTMKGGS + V+ P++ I  +    YCLA++K  D++IIGQNFMT
Sbjct: 372  SPNKDSFQYPAVNLTMKGGSSYPVYHPLVVIPMKDTDVYCLAILKIEDISIIGQNFMT 429


>dbj|BAJ34248.1| unnamed protein product [Thellungiella halophila]
          Length = 515

 Score =  467 bits (1201), Expect = e-129
 Identities = 230/418 (55%), Positives = 298/418 (71%), Gaps = 1/418 (0%)
 Frame = +3

Query: 9    ILLLLLFISRTCYGFGTVGFNFHHRFSDEVKGVLSVDDLPRKGTVDYYKAMAHRDRIVIH 188
            ++L+  ++   C G G  GF FHHRFSD+V GVL  D LP + +  YY+ MAHRDR+ I 
Sbjct: 16   LMLVSSWVLDRCEGLGEFGFEFHHRFSDQVVGVLPGDGLPNRDSSKYYRVMAHRDRL-IR 74

Query: 189  GRGLATSTTGDQQQLSFENGNDTLRIPQLGFLHYANVSVGTPSLSFLVALDTGSDLFWVP 368
            GR LA+    DQ  ++F +GN+T+R+  LGFLHYANV+VGTPS  FLVALDTGSDLFW+P
Sbjct: 75   GRRLASE---DQSLVTFADGNETIRVNALGFLHYANVTVGTPSDWFLVALDTGSDLFWLP 131

Query: 369  CDCKS-CVKALVTNSGARLDLNIYXXXXXXXXXXVPCDSSACDVQGACSGVSRTCPYQVE 545
            CDC + CV+ L    G+ LDLNIY          VPC+S+ C     C+     CPYQ+ 
Sbjct: 132  CDCSTNCVRELKAPGGSSLDLNIYSPNASSTSSKVPCNSTLCTRVDRCASPLSDCPYQIR 191

Query: 546  YLSNGTSSKGILIEDVLHLTTEGNHPEVVDAQITLGCGLVETGSFLNGAAPNGLFGLGMD 725
            YLSNGTSS G+L+EDVLHL +   + + + A+ITLGCGLV+TG F +GAAPNGLFGLG++
Sbjct: 192  YLSNGTSSTGVLVEDVLHLVSMEKNSKPIRARITLGCGLVQTGVFHDGAAPNGLFGLGLE 251

Query: 726  KSSVPSILSSAGIAANSFSMCFGPDGIGRINFGDKGSVNQGETPFNMQQLQPMYNVSLTH 905
              SVPS+L+  GIAANSFSMCFG DG GRI+FGDKGSV+Q ETP N++Q  P YNV++T 
Sbjct: 252  DISVPSVLAKEGIAANSFSMCFGDDGAGRISFGDKGSVDQRETPLNIRQPHPTYNVTVTQ 311

Query: 906  VTVETNQSNVDFSAIFDSGTSFTYLNDPAYTGLSETFNFNVQDRRRKSDPNIPFEYCYDV 1085
            ++V  N  +++F A+FD+GTSFTYL D  YT +SE+FN    D+R ++D  +PFEYCY V
Sbjct: 312  ISVGGNTGDLEFDAVFDTGTSFTYLTDAPYTLISESFNSLALDKRYQTDSELPFEYCYAV 371

Query: 1086 GPGKTTILVPNVTLTMKGGSQFFVFDPIIDISDESGAKYCLAVVKSPDVNIIGQNFMT 1259
             P K +   P+V LTMKGGS + V+ P+I +  E    YCLA++KS D++IIGQNFMT
Sbjct: 372  SPNKKSFEYPDVNLTMKGGSSYPVYHPLIVVPIEDTVVYCLAIMKSEDISIIGQNFMT 429


>ref|NP_849967.1| aspartyl protease-like protein [Arabidopsis thaliana]
            gi|17978947|gb|AAL47439.1| putative chloroplast nucleoid
            DNA-binding protein [Arabidopsis thaliana]
            gi|22655368|gb|AAM98276.1| At2g17760/At2g17760
            [Arabidopsis thaliana] gi|330251585|gb|AEC06679.1|
            aspartyl protease-like protein [Arabidopsis thaliana]
          Length = 513

 Score =  462 bits (1190), Expect = e-128
 Identities = 227/418 (54%), Positives = 293/418 (70%), Gaps = 1/418 (0%)
 Frame = +3

Query: 9    ILLLLLFISRTCYGFGTVGFNFHHRFSDEVKGVLSVDDLPRKGTVDYYKAMAHRDRIVIH 188
            ILL   ++   C GFG  GF FHHRFSD+V GVL  D LP + +  YY+ MAHRDR+ I 
Sbjct: 16   ILLASSWVLDRCEGFGEFGFEFHHRFSDQVVGVLPGDGLPNRDSSKYYRVMAHRDRL-IR 74

Query: 189  GRGLATSTTGDQQQLSFENGNDTLRIPQLGFLHYANVSVGTPSLSFLVALDTGSDLFWVP 368
            GR LA     DQ  ++F +GN+T+R+  LGFLHYANV+VGTPS  F+VALDTGSDLFW+P
Sbjct: 75   GRRLANE---DQSLVTFSDGNETVRVDALGFLHYANVTVGTPSDWFMVALDTGSDLFWLP 131

Query: 369  CDCKSCVKALVTNSGARLDLNIYXXXXXXXXXXVPCDSSACDVQGACSGVSRTCPYQVEY 548
            CDC +CV+ L    G+ LDLNIY          VPC+S+ C     C+     CPYQ+ Y
Sbjct: 132  CDCTNCVRELKAPGGSSLDLNIYSPNASSTSTKVPCNSTLCTRGDRCASPESDCPYQIRY 191

Query: 549  LSNGTSSKGILIEDVLHLTTEGNHPEVVDAQITLGCGLVETGSFLNGAAPNGLFGLGMDK 728
            LSNGTSS G+L+EDVLHL +     + + A++T GCG V+TG F +GAAPNGLFGLG++ 
Sbjct: 192  LSNGTSSTGVLVEDVLHLVSNDKSSKAIPARVTFGCGQVQTGVFHDGAAPNGLFGLGLED 251

Query: 729  SSVPSILSSAGIAANSFSMCFGPDGIGRINFGDKGSVNQGETPFNMQQLQPMYNVSLTHV 908
             SVPS+L+  GIAANSFSMCFG DG GRI+FGDKGSV+Q ETP N++Q  P YN+++T +
Sbjct: 252  ISVPSVLAKEGIAANSFSMCFGNDGAGRISFGDKGSVDQRETPLNIRQPHPTYNITVTKI 311

Query: 909  TVETNQSNVDFSAIFDSGTSFTYLNDPAYTGLSETFNFNVQDRR-RKSDPNIPFEYCYDV 1085
            +V  N  +++F A+FDSGTSFTYL D AYT +SE+FN    D+R + +D  +PFEYCY +
Sbjct: 312  SVGGNTGDLEFDAVFDSGTSFTYLTDAAYTLISESFNSLALDKRYQTTDSELPFEYCYAL 371

Query: 1086 GPGKTTILVPNVTLTMKGGSQFFVFDPIIDISDESGAKYCLAVVKSPDVNIIGQNFMT 1259
             P K +   P V LTMKGGS + V+ P++ I  +    YCLA++K  D++IIGQNFMT
Sbjct: 372  SPNKDSFQYPAVNLTMKGGSSYPVYHPLVVIPMKDTDVYCLAIMKIEDISIIGQNFMT 429


>ref|XP_003547911.1| PREDICTED: aspartic proteinase-like protein 1-like [Glycine max]
          Length = 516

 Score =  451 bits (1161), Expect = e-124
 Identities = 232/423 (54%), Positives = 289/423 (68%), Gaps = 6/423 (1%)
 Frame = +3

Query: 9    ILLLLLFISRTCYGFGTVGFNFHHRFSDEVKGVLSVDDLPRKGTVDYYKAMAHRDRIVIH 188
            +LL L      CYG  T GF+ HHRFSD++KG+L +DD+P+KGT  YY  MAHRDR V  
Sbjct: 16   VLLSLAASQSCCYGLSTFGFDIHHRFSDQIKGMLGIDDVPQKGTPQYYAVMAHRDR-VFR 74

Query: 189  GRGLATSTTGDQQQLSFENGNDTLRIPQLGFLHYANVSVGTPSLSFLVALDTGSDLFWVP 368
            GR LA +       L+F  GNDT +I   GFLH+ANVSVGTP L FLVALDTGSDLFW+P
Sbjct: 75   GRRLAGAD--HHSPLTFAAGNDTHQIASSGFLHFANVSVGTPPLWFLVALDTGSDLFWLP 132

Query: 369  CDCKSCVKA-LVTNSGARLDLNIYXXXXXXXXXXVPCDSSA-CDVQGACSGVSRTCPYQV 542
            CDC SCV   L T +G  L  N Y          V C++S  C  +  C     TC YQV
Sbjct: 133  CDCISCVHGGLRTRTGKILKFNTYDLDKSSTSNEVSCNNSTFCRQRQQCPSAGSTCRYQV 192

Query: 543  EYLSNGTSSKGILIEDVLHLTTEGNHPEVVDAQITLGCGLVETGSFLNGAAPNGLFGLGM 722
            +YLSN TSS+G ++EDVLHL T+ +  +  D +I  GCG V+TG FLNGAAPNGLFGLGM
Sbjct: 193  DYLSNDTSSRGFVVEDVLHLITDDDQTKDADTRIAFGCGQVQTGVFLNGAAPNGLFGLGM 252

Query: 723  DKSSVPSILSSAGIAANSFSMCFGPDGIGRINFGDKGSVNQGETPFNMQQLQPMYNVSLT 902
            D  SVPSIL+  G+ +NSFSMCFG D  GRI FGD GS +Q +TPFN+++L P YN+++T
Sbjct: 253  DNISVPSILAREGLISNSFSMCFGSDSAGRITFGDTGSPDQRKTPFNVRKLHPTYNITIT 312

Query: 903  HVTVETNQSNVDFSAIFDSGTSFTYLNDPAYTGLSETFNFNVQDRRRKS---DPNIPFEY 1073
             + VE + ++++F AIFDSGTSFTY+NDPAYT + E +N  V+ +R  S   D NIPF+Y
Sbjct: 313  KIIVEDSVADLEFHAIFDSGTSFTYINDPAYTRIGEMYNSKVKAKRHSSQSPDSNIPFDY 372

Query: 1074 CYDVGPGKTTILVPNVTLTMKGGSQFFVFDPIIDI-SDESGAKYCLAVVKSPDVNIIGQN 1250
            CYD+   + TI VP + LTMKGG  ++V DPII + S+E G   CL + KS  VNIIGQN
Sbjct: 373  CYDISISQ-TIEVPFLNLTMKGGDDYYVMDPIIQVSSEEEGDLLCLGIQKSDSVNIIGQN 431

Query: 1251 FMT 1259
            FMT
Sbjct: 432  FMT 434


Top