BLASTX nr result

ID: Cimicifuga21_contig00009803 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Cimicifuga21_contig00009803
         (946 letters)

Database: ./nr 
           23,641,837 sequences; 8,123,359,852 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_002522914.1| Aspartic proteinase nepenthesin-1 precursor,...   178   1e-42
ref|XP_002300215.1| predicted protein [Populus trichocarpa] gi|2...   178   2e-42
ref|XP_002876869.1| aspartyl protease family protein [Arabidopsi...   177   2e-42
ref|NP_565298.2| aspartyl protease-like protein [Arabidopsis tha...   176   5e-42
gb|AAC34482.2| putative chloroplast nucleoid DNA binding protein...   176   5e-42

>ref|XP_002522914.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
            communis] gi|223537841|gb|EEF39457.1| Aspartic proteinase
            nepenthesin-1 precursor, putative [Ricinus communis]
          Length = 442

 Score =  178 bits (452), Expect = 1e-42
 Identities = 120/311 (38%), Positives = 158/311 (50%), Gaps = 6/311 (1%)
 Frame = +3

Query: 15   PIFDPAKSSSFRPLICDDDECKKIDNNQCDPHNNCKYNITYGDDDTTIGVVVQETLQFRG 194
            PIFDP KSSSF  L C    CK +  + C   ++C+Y  TYGD  +T G +  ET  F  
Sbjct: 140  PIFDPKKSSSFSKLSCSSQLCKALPQSSCS--DSCEYLYTYGDYSSTQGTMATETFTFG- 196

Query: 195  ANENVTLEDIIIGCGYNNTNIMENLGEGIPGLIGLSRDPESFVSQLFYDKFAYCLGDVID 374
                V++ ++  GCG +N    +   +G  GL+GL R P S VSQL   KF+YCL   ID
Sbjct: 197  ---KVSIPNVGFGCGEDNEG--DGFTQG-SGLVGLGRGPLSLVSQLKEAKFSYCLTS-ID 249

Query: 375  DEAYGYARFGEAASITGFTTPIQSV----EDNGXXXXXXILEGISVDNTRLSIPNGTFSV 542
            D        G  AS+ G +  I++                LEGISV  TRL I   TF +
Sbjct: 250  DTKTSTLLMGSLASVNGTSAAIRTTPLIQNPLQPSFYYLSLEGISVGGTRLPIKESTFQL 309

Query: 543  NKEYPSGFIIDSGATYTYLNKTAYNILINDLKQKTQGLRKVSDPNKLFELCYELEGFDLN 722
              +   G IIDSG T TYL ++A++++  +   +  GL   +      ELCY L      
Sbjct: 310  QDDGTGGLIIDSGTTITYLEESAFDLVKKEFTSQ-MGLPVDNSGATGLELCYNLPSDTSE 368

Query: 723  F-APVITLHFSGADLRLPVWNTWLTPIK-NVYCLSMFPTGGMSILGNFQQQNFNVGHDLD 896
               P + LHF+GADL LP  N  +      V CL+M  +GGMSI GN QQQN  V HDL+
Sbjct: 369  LEVPKLVLHFTGADLELPGENYMIADSSMGVICLAMGSSGGMSIFGNVQQQNMFVSHDLE 428

Query: 897  NNVVSFDLTYC 929
               +SF  T C
Sbjct: 429  KETLSFLPTNC 439


>ref|XP_002300215.1| predicted protein [Populus trichocarpa] gi|222847473|gb|EEE85020.1|
            predicted protein [Populus trichocarpa]
          Length = 439

 Score =  178 bits (451), Expect = 2e-42
 Identities = 117/311 (37%), Positives = 159/311 (51%), Gaps = 6/311 (1%)
 Frame = +3

Query: 15   PIFDPAKSSSFRPLICDDDECKKIDNNQCDPHNNCKYNITYGDDDTTIGVVVQETLQFRG 194
            PIFDP KSSSF  L C    C+ +  + C+  N C+Y  +YGD  +T G++  ETL F  
Sbjct: 137  PIFDPKKSSSFSKLSCSSQLCEALPQSSCN--NGCEYLYSYGDYSSTQGILASETLTFGK 194

Query: 195  ANENVTLEDIIIGCGYNNTNIMENLGEGIPGLIGLSRDPESFVSQLFYDKFAYCLGDVID 374
            A+    + ++  GCG +N     + G G   L+GL R P S VSQL   KF+YCL   +D
Sbjct: 195  AS----VPNVAFGCGADNEGSGFSQGAG---LVGLGRGPLSLVSQLKEPKFSYCL-TTVD 246

Query: 375  DEAYGYARFGEAASITGFTTPIQSV----EDNGXXXXXXILEGISVDNTRLSIPNGTFSV 542
            D        G  AS+   ++ I++                LEGISV +TRL I   TFS+
Sbjct: 247  DTKTSTLLMGSLASVNASSSAIKTTPLIHSPAHPSFYYLSLEGISVGDTRLPIKKSTFSL 306

Query: 543  NKEYPSGFIIDSGATYTYLNKTAYNILINDLKQKTQGLRKVSDPNKLFELCYELEGFDLN 722
              +   G IIDSG T TYL ++A+N++  +   K   L   S  +   ++C+ L     N
Sbjct: 307  QDDGSGGLIIDSGTTITYLEESAFNLVAKEFTAKI-NLPVDSSGSTGLDVCFTLPSGSTN 365

Query: 723  F-APVITLHFSGADLRLPVWNTWL-TPIKNVYCLSMFPTGGMSILGNFQQQNFNVGHDLD 896
               P +  HF GADL LP  N  +      V CL+M  + GMSI GN QQQN  V HDL+
Sbjct: 366  IEVPKLVFHFDGADLELPAENYMIGDSSMGVACLAMGSSSGMSIFGNVQQQNMLVLHDLE 425

Query: 897  NNVVSFDLTYC 929
               +SF  T C
Sbjct: 426  KETLSFLPTQC 436


>ref|XP_002876869.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
            gi|297322707|gb|EFH53128.1| aspartyl protease family
            protein [Arabidopsis lyrata subsp. lyrata]
          Length = 462

 Score =  177 bits (450), Expect = 2e-42
 Identities = 121/323 (37%), Positives = 162/323 (50%), Gaps = 18/323 (5%)
 Frame = +3

Query: 15   PIFDPAKSSSFRPLICDDDECKKIDNNQC-DPHNNCKYNITYGDDDTTIGVVVQETLQFR 191
            PIFDP KSSS+  + C    C  +  + C +  ++C+Y  TYGD  +T G++  ET  F 
Sbjct: 148  PIFDPEKSSSYSKVGCSSGLCNALPRSNCNEDKDSCEYLYTYGDYSSTRGLLATETFTFE 207

Query: 192  GANENVTLEDIIIGCGYNNTNIMENLGEGIP---GLIGLSRDPESFVSQLFYDKFAYCLG 362
              N   ++  I  GCG      +EN G+G     GL+GL R P S +SQL   KF+YCL 
Sbjct: 208  DEN---SISGIGFGCG------VENEGDGFSQGSGLVGLGRGPLSLISQLKETKFSYCLT 258

Query: 363  DVIDDEAYGYARFGEAAS---------ITGFTTPIQSVEDNGXXXXXXILE--GISVDNT 509
             + D EA      G  AS         + G  T   S+  N        LE  GI+V   
Sbjct: 259  SIEDSEASSSLFIGSLASGIVNKTGANLDGEVTKTMSLLRNPDQPSFYYLELQGITVGAK 318

Query: 510  RLSIPNGTFSVNKEYPSGFIIDSGATYTYLNKTAYNILINDLKQKTQGLRKVSDPNKL-F 686
            RLS+   TF ++++   G IIDSG T TYL +TA+ +L  +   +      V D      
Sbjct: 319  RLSVEKSTFELSEDGTGGMIIDSGTTITYLEETAFKVLKEEFTSRMS--LPVDDSGSTGL 376

Query: 687  ELCYELEGFDLNFA-PVITLHFSGADLRLPVWNTWLTPIKN-VYCLSMFPTGGMSILGNF 860
            +LC++L     N A P +  HF GADL LP  N  +      V CL+M  + GMSI GN 
Sbjct: 377  DLCFKLPNAAKNIAVPKLIFHFKGADLELPGENYMVADSSTGVLCLAMGSSNGMSIFGNV 436

Query: 861  QQQNFNVGHDLDNNVVSFDLTYC 929
            QQQNFNV HDL+   V+F  T C
Sbjct: 437  QQQNFNVLHDLEKETVTFVPTEC 459


>ref|NP_565298.2| aspartyl protease-like protein [Arabidopsis thaliana]
            gi|30102688|gb|AAP21262.1| At2g03200 [Arabidopsis
            thaliana] gi|110736021|dbj|BAE99983.1| putative
            chloroplast nucleoid DNA binding protein [Arabidopsis
            thaliana] gi|330250580|gb|AEC05674.1| aspartyl
            protease-like protein [Arabidopsis thaliana]
          Length = 461

 Score =  176 bits (447), Expect = 5e-42
 Identities = 122/323 (37%), Positives = 160/323 (49%), Gaps = 18/323 (5%)
 Frame = +3

Query: 15   PIFDPAKSSSFRPLICDDDECKKIDNNQCDPHNN-CKYNITYGDDDTTIGVVVQETLQFR 191
            PIFDP KSSS+  + C    C  +  + C+   + C+Y  TYGD  +T G++  ET  F 
Sbjct: 147  PIFDPEKSSSYSKVGCSSGLCNALPRSNCNEDKDACEYLYTYGDYSSTRGLLATETFTFE 206

Query: 192  GANENVTLEDIIIGCGYNNTNIMENLGEGIP---GLIGLSRDPESFVSQLFYDKFAYCLG 362
              N   ++  I  GCG      +EN G+G     GL+GL R P S +SQL   KF+YCL 
Sbjct: 207  DEN---SISGIGFGCG------VENEGDGFSQGSGLVGLGRGPLSLISQLKETKFSYCLT 257

Query: 363  DVIDDEAYGYARFGE---------AASITGFTTPIQSVEDNGXXXXXXILE--GISVDNT 509
             + D EA      G           AS+ G  T   S+  N        LE  GI+V   
Sbjct: 258  SIEDSEASSSLFIGSLASGIVNKTGASLDGEVTKTMSLLRNPDQPSFYYLELQGITVGAK 317

Query: 510  RLSIPNGTFSVNKEYPSGFIIDSGATYTYLNKTAYNILINDLKQKTQGLRKVSDPNKL-F 686
            RLS+   TF + ++   G IIDSG T TYL +TA+ +L  +   +      V D      
Sbjct: 318  RLSVEKSTFELAEDGTGGMIIDSGTTITYLEETAFKVLKEEFTSRMS--LPVDDSGSTGL 375

Query: 687  ELCYELEGFDLNFA-PVITLHFSGADLRLPVWNTWLTPIKN-VYCLSMFPTGGMSILGNF 860
            +LC++L     N A P +  HF GADL LP  N  +      V CL+M  + GMSI GN 
Sbjct: 376  DLCFKLPDAAKNIAVPKMIFHFKGADLELPGENYMVADSSTGVLCLAMGSSNGMSIFGNV 435

Query: 861  QQQNFNVGHDLDNNVVSFDLTYC 929
            QQQNFNV HDL+   VSF  T C
Sbjct: 436  QQQNFNVLHDLEKETVSFVPTEC 458


>gb|AAC34482.2| putative chloroplast nucleoid DNA binding protein [Arabidopsis
           thaliana]
          Length = 353

 Score =  176 bits (447), Expect = 5e-42
 Identities = 122/323 (37%), Positives = 160/323 (49%), Gaps = 18/323 (5%)
 Frame = +3

Query: 15  PIFDPAKSSSFRPLICDDDECKKIDNNQCDPHNN-CKYNITYGDDDTTIGVVVQETLQFR 191
           PIFDP KSSS+  + C    C  +  + C+   + C+Y  TYGD  +T G++  ET  F 
Sbjct: 39  PIFDPEKSSSYSKVGCSSGLCNALPRSNCNEDKDACEYLYTYGDYSSTRGLLATETFTFE 98

Query: 192 GANENVTLEDIIIGCGYNNTNIMENLGEGIP---GLIGLSRDPESFVSQLFYDKFAYCLG 362
             N   ++  I  GCG      +EN G+G     GL+GL R P S +SQL   KF+YCL 
Sbjct: 99  DEN---SISGIGFGCG------VENEGDGFSQGSGLVGLGRGPLSLISQLKETKFSYCLT 149

Query: 363 DVIDDEAYGYARFGE---------AASITGFTTPIQSVEDNGXXXXXXILE--GISVDNT 509
            + D EA      G           AS+ G  T   S+  N        LE  GI+V   
Sbjct: 150 SIEDSEASSSLFIGSLASGIVNKTGASLDGEVTKTMSLLRNPDQPSFYYLELQGITVGAK 209

Query: 510 RLSIPNGTFSVNKEYPSGFIIDSGATYTYLNKTAYNILINDLKQKTQGLRKVSDPNKL-F 686
           RLS+   TF + ++   G IIDSG T TYL +TA+ +L  +   +      V D      
Sbjct: 210 RLSVEKSTFELAEDGTGGMIIDSGTTITYLEETAFKVLKEEFTSRMS--LPVDDSGSTGL 267

Query: 687 ELCYELEGFDLNFA-PVITLHFSGADLRLPVWNTWLTPIKN-VYCLSMFPTGGMSILGNF 860
           +LC++L     N A P +  HF GADL LP  N  +      V CL+M  + GMSI GN 
Sbjct: 268 DLCFKLPDAAKNIAVPKMIFHFKGADLELPGENYMVADSSTGVLCLAMGSSNGMSIFGNV 327

Query: 861 QQQNFNVGHDLDNNVVSFDLTYC 929
           QQQNFNV HDL+   VSF  T C
Sbjct: 328 QQQNFNVLHDLEKETVSFVPTEC 350


Top