BLASTX nr result

ID: Cnidium21_contig00009071 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Cnidium21_contig00009071
         (1535 letters)

Database: ./nr 
           23,641,837 sequences; 8,123,359,852 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_002302634.1| predicted protein [Populus trichocarpa] gi|2...   583   e-164
ref|XP_003554768.1| PREDICTED: aspartic proteinase nepenthesin-1...   578   e-162
ref|XP_002523337.1| Aspartic proteinase nepenthesin-1 precursor,...   568   e-159
ref|NP_171637.1| aspartyl protease-like protein [Arabidopsis tha...   567   e-159
ref|XP_002320868.1| predicted protein [Populus trichocarpa] gi|2...   566   e-159

>ref|XP_002302634.1| predicted protein [Populus trichocarpa] gi|222844360|gb|EEE81907.1|
            predicted protein [Populus trichocarpa]
          Length = 490

 Score =  583 bits (1503), Expect = e-164
 Identities = 299/466 (64%), Positives = 349/466 (74%), Gaps = 10/466 (2%)
 Frame = +1

Query: 133  QYQTLITKSLPLLHPLTW---DQQSDTVTQLAGEDTEN------TLSLQLHHLDYLNLDH 285
            Q+QTL    LP    L+W   + +S+  TQ   + T        +LS+QLHHLD L+ D 
Sbjct: 33   QFQTLTVNPLPNKPTLSWADTEPESEPETQTLTDSTSTEASTTTSLSVQLHHLDALSSDE 92

Query: 286  TNANTTSDALFISRLTRDAGRVDALSTIAALKTNSTRTRRRGKATSDFSSSIISGLAHGS 465
            T  +     LF SRL RDA RV +L+++AA   ++ RTR RG     FSSS+ SGLA GS
Sbjct: 93   TPQD-----LFNSRLARDASRVKSLTSLAAAVGSTNRTRARGPG---FSSSVTSGLAQGS 144

Query: 466  GEYFTRLGVGTPARYSYMVLDTGSDVVWIQCSPCRKCYTQSDPVFDPTKSSTFGGVACGS 645
            GEYFTRLGVGTPARY +MVLDTGSDVVWIQC+PC+KCY+Q+DPVF+PTKS +F  + CGS
Sbjct: 145  GEYFTRLGVGTPARYVFMVLDTGSDVVWIQCAPCKKCYSQTDPVFNPTKSRSFANIPCGS 204

Query: 646  PLCRRLDSPGCNSGKK-CMYQVSYGDGSFTVGEFSTETMTFRKNRVKNIALGCGHDNEGL 822
            PLCRRLDSPGC++ K  C+YQVSYGDGSFT GEFSTET+TFR  RV  +ALGCGHDNEGL
Sbjct: 205  PLCRRLDSPGCSTKKHICLYQVSYGDGSFTYGEFSTETLTFRGTRVGRVALGCGHDNEGL 264

Query: 823  FVXXXXXXXXXXXXXSFPSQAGPRFGRAFSYCLVDRXXXXXXXXXXXXXXXVSRKAVFTP 1002
            F+             SFPSQ G RF R FSYCLVDR               +SR A FTP
Sbjct: 265  FIGAAGLLGLGRGRLSFPSQIGRRFSRKFSYCLVDRSASSKPSYMVFGDSAISRTARFTP 324

Query: 1003 LISNPKLDTFYYIGLTGISVGGALVRGVSASLFKIDAAGNGGVIIDSGTSVTRLTRPAYI 1182
            L+SNPKLDTFYY+ L G+SVGG  V G++ASLFK+D+ GNGGVIIDSGTSVTRLTRPAY+
Sbjct: 325  LVSNPKLDTFYYVELLGVSVGGTRVPGITASLFKLDSTGNGGVIIDSGTSVTRLTRPAYV 384

Query: 1183 AVRDAFRVGARNLIRASSFSLFDTCFDLSGVSEVKVPTVLFHFKGANVALPASNYLIPVD 1362
            A+RDAFRVGA NL RA  FSLFDTCFDLSG +EVKVPTV+ HF+GA+V+LPASNYLIPVD
Sbjct: 385  ALRDAFRVGASNLKRAPEFSLFDTCFDLSGKTEVKVPTVVLHFRGADVSLPASNYLIPVD 444

Query: 1363 SKGTFCFAFAGTSNGLSIIGNIQQQGFRVVYDLGRSRVGFAKDGCA 1500
            + G+FCFAFAGT +GLSI+GNIQQQGFRVVYDL  SRVGFA  GCA
Sbjct: 445  NSGSFCFAFAGTMSGLSIVGNIQQQGFRVVYDLAASRVGFAPRGCA 490


>ref|XP_003554768.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
          Length = 472

 Score =  578 bits (1490), Expect = e-162
 Identities = 298/456 (65%), Positives = 345/456 (75%), Gaps = 1/456 (0%)
 Frame = +1

Query: 133  QYQTLITKSLPLLHPLTWDQQSDTVTQLAGEDTENTLSLQLHHLDYLNLDHTNANTTSDA 312
            Q QTL   SLP    ++W +     ++   +  E  LSL LHH+D L+     +N T + 
Sbjct: 31   QTQTLPLHSLPHPPAISWPE-----SESEPDPEEEALSLHLHHIDALS-----SNKTPEQ 80

Query: 313  LFISRLTRDAGRVDALSTIAALKTNSTRTRRRGKATSDFSSSIISGLAHGSGEYFTRLGV 492
            LF  RL RDA RV+ +  +AAL  N +  RR G   S FSSSIISGLA GSGEYFTR+GV
Sbjct: 81   LFQLRLQRDAKRVEGVVALAAL--NQSHARRSG---SSFSSSIISGLAQGSGEYFTRIGV 135

Query: 493  GTPARYSYMVLDTGSDVVWIQCSPCRKCYTQSDPVFDPTKSSTFGGVACGSPLCRRLDSP 672
            GTPARY YMVLDTGSDVVW+QC+PCRKCYTQ+DPVFDPTKS T+ G+ CG+PLCRRLDSP
Sbjct: 136  GTPARYVYMVLDTGSDVVWLQCAPCRKCYTQADPVFDPTKSRTYAGIPCGAPLCRRLDSP 195

Query: 673  GCNSGKK-CMYQVSYGDGSFTVGEFSTETMTFRKNRVKNIALGCGHDNEGLFVXXXXXXX 849
            GCN+  K C YQVSYGDGSFT G+FSTET+TFR+ RV  +ALGCGHDNEGLF+       
Sbjct: 196  GCNNKNKVCQYQVSYGDGSFTFGDFSTETLTFRRTRVTRVALGCGHDNEGLFIGAAGLLG 255

Query: 850  XXXXXXSFPSQAGPRFGRAFSYCLVDRXXXXXXXXXXXXXXXVSRKAVFTPLISNPKLDT 1029
                  SFP Q G RF + FSYCLVDR               VSR A FTPLI NPKLDT
Sbjct: 256  LGRGRLSFPVQTGRRFNQKFSYCLVDRSASAKPSSVVFGDSAVSRTARFTPLIKNPKLDT 315

Query: 1030 FYYIGLTGISVGGALVRGVSASLFKIDAAGNGGVIIDSGTSVTRLTRPAYIAVRDAFRVG 1209
            FYY+ L GISVGG+ VRG+SASLF++DAAGNGGVIIDSGTSVTRLTRPAYIA+RDAFRVG
Sbjct: 316  FYYLELLGISVGGSPVRGLSASLFRLDAAGNGGVIIDSGTSVTRLTRPAYIALRDAFRVG 375

Query: 1210 ARNLIRASSFSLFDTCFDLSGVSEVKVPTVLFHFKGANVALPASNYLIPVDSKGTFCFAF 1389
            A +L RA+ FSLFDTCFDLSG++EVKVPTV+ HF+GA+V+LPA+NYLIPVD+ G+FCFAF
Sbjct: 376  ASHLKRAAEFSLFDTCFDLSGLTEVKVPTVVLHFRGADVSLPATNYLIPVDNSGSFCFAF 435

Query: 1390 AGTSNGLSIIGNIQQQGFRVVYDLGRSRVGFAKDGC 1497
            AGT +GLSIIGNIQQQGFRV +DL  SRVGFA  GC
Sbjct: 436  AGTMSGLSIIGNIQQQGFRVSFDLAGSRVGFAPRGC 471


>ref|XP_002523337.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
            communis] gi|223537425|gb|EEF39053.1| Aspartic proteinase
            nepenthesin-1 precursor, putative [Ricinus communis]
          Length = 469

 Score =  568 bits (1464), Expect = e-159
 Identities = 295/458 (64%), Positives = 335/458 (73%), Gaps = 3/458 (0%)
 Frame = +1

Query: 136  YQTLITKSLPLLHPLTW-DQQSDTVTQLAGEDTENTLSLQLHHLDYLNLDHTNANTTSDA 312
            YQTL+   L     L+W D +S T T     ++  T S+QLHH+D L+      N+T + 
Sbjct: 28   YQTLVANPLRSQPTLSWTDSESPTDTA----ESSATFSVQLHHVDALSF-----NSTPET 78

Query: 313  LFISRLTRDAGRVDALSTIAALKTNSTRTRRRGKAT-SDFSSSIISGLAHGSGEYFTRLG 489
            LF +RL RDA RV+A+S +A        T   GK   + FSSS+ISGLA GSGEYFTR+G
Sbjct: 79   LFTTRLQRDAARVEAISYLA-------ETAGTGKRVGTGFSSSVISGLAQGSGEYFTRIG 131

Query: 490  VGTPARYSYMVLDTGSDVVWIQCSPCRKCYTQSDPVFDPTKSSTFGGVACGSPLCRRLDS 669
            VGTP RY YMVLDTGSD+VWIQC+PC++CY QSDPVFDP KS +F  +AC SPLC RLDS
Sbjct: 132  VGTPPRYVYMVLDTGSDIVWIQCAPCKRCYAQSDPVFDPRKSRSFASIACRSPLCHRLDS 191

Query: 670  PGCNSGKK-CMYQVSYGDGSFTVGEFSTETMTFRKNRVKNIALGCGHDNEGLFVXXXXXX 846
            PGCN+ K+ CMYQVSYGDGSFT G+FSTET+TFR+ RV  +ALGCGHDNEGLFV      
Sbjct: 192  PGCNTQKQTCMYQVSYGDGSFTFGDFSTETLTFRRTRVARVALGCGHDNEGLFVGAAGLL 251

Query: 847  XXXXXXXSFPSQAGPRFGRAFSYCLVDRXXXXXXXXXXXXXXXVSRKAVFTPLISNPKLD 1026
                   SFPSQ G RF   FSYCLVDR               VSR A FTPL+SNPKLD
Sbjct: 252  GLGRGRLSFPSQTGRRFNHKFSYCLVDRSASSKPSSMVFGDSAVSRTARFTPLVSNPKLD 311

Query: 1027 TFYYIGLTGISVGGALVRGVSASLFKIDAAGNGGVIIDSGTSVTRLTRPAYIAVRDAFRV 1206
            TFYY+ L GISVGG  V G++ASLFK+D  GNGGVIIDSGTSVTRLTRPAYIA RDAFR 
Sbjct: 312  TFYYVELLGISVGGTRVPGITASLFKLDQTGNGGVIIDSGTSVTRLTRPAYIAFRDAFRA 371

Query: 1207 GARNLIRASSFSLFDTCFDLSGVSEVKVPTVLFHFKGANVALPASNYLIPVDSKGTFCFA 1386
            GA NL RA  FSLFDTCFDLSG +EVKVPTV+ HF+GA+V+LPASNYLIPVD+ G FC A
Sbjct: 372  GASNLKRAPQFSLFDTCFDLSGKTEVKVPTVVLHFRGADVSLPASNYLIPVDTSGNFCLA 431

Query: 1387 FAGTSNGLSIIGNIQQQGFRVVYDLGRSRVGFAKDGCA 1500
            FAGT  GLSIIGNIQQQGFRVVYDL  SRVGFA  GCA
Sbjct: 432  FAGTMGGLSIIGNIQQQGFRVVYDLAGSRVGFAPHGCA 469


>ref|NP_171637.1| aspartyl protease-like protein [Arabidopsis thaliana]
            gi|9665144|gb|AAF97328.1|AC023628_9 Unknown protein
            [Arabidopsis thaliana] gi|22135930|gb|AAM91547.1|
            chloroplast nucleoid DNA binding protein, putative
            [Arabidopsis thaliana] gi|30387595|gb|AAP31963.1|
            At1g01300 [Arabidopsis thaliana]
            gi|332189147|gb|AEE27268.1| aspartyl protease-like
            protein [Arabidopsis thaliana]
          Length = 485

 Score =  567 bits (1462), Expect = e-159
 Identities = 296/468 (63%), Positives = 347/468 (74%), Gaps = 13/468 (2%)
 Frame = +1

Query: 136  YQTLI--TKSLPLLHPLTWDQQSDTVTQL-----AGEDTENTLSLQLHHLDYLNLDHTNA 294
            +QTL   + SLP   P+++   SD+ + L     +G D+E++ S+ L      NLDH +A
Sbjct: 28   FQTLFPNSHSLPCASPVSFQPDSDSESLLESEFESGSDSESSSSITL------NLDHIDA 81

Query: 295  ---NTTSDALFISRLTRDAGRVDALSTIAAL--KTNSTRTRRRGKATSDFSSSIISGLAH 459
               N T D LF SRL RD+ RV +++T+AA     N T   R G     FSSS++SGL+ 
Sbjct: 82   LSSNKTPDELFSSRLQRDSRRVKSIATLAAQIPGRNVTHAPRPG----GFSSSVVSGLSQ 137

Query: 460  GSGEYFTRLGVGTPARYSYMVLDTGSDVVWIQCSPCRKCYTQSDPVFDPTKSSTFGGVAC 639
            GSGEYFTRLGVGTPARY YMVLDTGSD+VW+QC+PCR+CY+QSDP+FDP KS T+  + C
Sbjct: 138  GSGEYFTRLGVGTPARYVYMVLDTGSDIVWLQCAPCRRCYSQSDPIFDPRKSKTYATIPC 197

Query: 640  GSPLCRRLDSPGCNSGKK-CMYQVSYGDGSFTVGEFSTETMTFRKNRVKNIALGCGHDNE 816
             SP CRRLDS GCN+ +K C+YQVSYGDGSFTVG+FSTET+TFR+NRVK +ALGCGHDNE
Sbjct: 198  SSPHCRRLDSAGCNTRRKTCLYQVSYGDGSFTVGDFSTETLTFRRNRVKGVALGCGHDNE 257

Query: 817  GLFVXXXXXXXXXXXXXSFPSQAGPRFGRAFSYCLVDRXXXXXXXXXXXXXXXVSRKAVF 996
            GLFV             SFP Q G RF + FSYCLVDR               VSR A F
Sbjct: 258  GLFVGAAGLLGLGKGKLSFPGQTGHRFNQKFSYCLVDRSASSKPSSVVFGNAAVSRIARF 317

Query: 997  TPLISNPKLDTFYYIGLTGISVGGALVRGVSASLFKIDAAGNGGVIIDSGTSVTRLTRPA 1176
            TPL+SNPKLDTFYY+GL GISVGG  V GV+ASLFK+D  GNGGVIIDSGTSVTRL RPA
Sbjct: 318  TPLLSNPKLDTFYYVGLLGISVGGTRVPGVTASLFKLDQIGNGGVIIDSGTSVTRLIRPA 377

Query: 1177 YIAVRDAFRVGARNLIRASSFSLFDTCFDLSGVSEVKVPTVLFHFKGANVALPASNYLIP 1356
            YIA+RDAFRVGA+ L RA  FSLFDTCFDLS ++EVKVPTV+ HF+GA+V+LPA+NYLIP
Sbjct: 378  YIAMRDAFRVGAKTLKRAPDFSLFDTCFDLSNMNEVKVPTVVLHFRGADVSLPATNYLIP 437

Query: 1357 VDSKGTFCFAFAGTSNGLSIIGNIQQQGFRVVYDLGRSRVGFAKDGCA 1500
            VD+ G FCFAFAGT  GLSIIGNIQQQGFRVVYDL  SRVGFA  GCA
Sbjct: 438  VDTNGKFCFAFAGTMGGLSIIGNIQQQGFRVVYDLASSRVGFAPGGCA 485


>ref|XP_002320868.1| predicted protein [Populus trichocarpa] gi|222861641|gb|EEE99183.1|
            predicted protein [Populus trichocarpa]
          Length = 488

 Score =  566 bits (1459), Expect = e-159
 Identities = 294/464 (63%), Positives = 335/464 (72%), Gaps = 8/464 (1%)
 Frame = +1

Query: 133  QYQTLITKSLPLLHPLTWDQQ-------SDTVTQLAGEDTENTLSLQLHHLDYLNLDHTN 291
            Q+QTL    LP    ++W          +D  T          LS+QLHH+D L+ D + 
Sbjct: 33   QFQTLTLNPLPNKPTISWADTEPGTQTFTDQTTSEPSSSATTFLSVQLHHIDALSSDKS- 91

Query: 292  ANTTSDALFISRLTRDAGRVDALSTIAALKTNSTRTRRRGKATSDFSSSIISGLAHGSGE 471
                S  LF SRL RDA RV +L ++AA    +  TR RG     FSSS+ISGLA GSGE
Sbjct: 92   ----SQDLFNSRLVRDAARVKSLISLAATVGGTNLTRARGPG---FSSSVISGLAQGSGE 144

Query: 472  YFTRLGVGTPARYSYMVLDTGSDVVWIQCSPCRKCYTQSDPVFDPTKSSTFGGVACGSPL 651
            YFTRLGVGTPARY YMVLDTGSD+VWIQC+PC KCY+Q+DPVFDPTKS +F  + CGSPL
Sbjct: 145  YFTRLGVGTPARYVYMVLDTGSDIVWIQCAPCIKCYSQTDPVFDPTKSRSFANIPCGSPL 204

Query: 652  CRRLDSPGCNSGKK-CMYQVSYGDGSFTVGEFSTETMTFRKNRVKNIALGCGHDNEGLFV 828
            CRRLD PGC++ K+ C+YQVSYGDGSFTVGEFSTET+TFR  RV  + LGCGHDNEGLFV
Sbjct: 205  CRRLDYPGCSTKKQICLYQVSYGDGSFTVGEFSTETLTFRGTRVGRVVLGCGHDNEGLFV 264

Query: 829  XXXXXXXXXXXXXSFPSQAGPRFGRAFSYCLVDRXXXXXXXXXXXXXXXVSRKAVFTPLI 1008
                         SFPSQ G RF   FSYCL DR               +SR   FTPL+
Sbjct: 265  GAAGLLGLGRGRLSFPSQIGRRFNSKFSYCLGDRSASSRPSSIVFGDSAISRTTRFTPLL 324

Query: 1009 SNPKLDTFYYIGLTGISVGGALVRGVSASLFKIDAAGNGGVIIDSGTSVTRLTRPAYIAV 1188
            SNPKLDTFYY+ L GISVGG  V G+SASLFK+D+ GNGGVIIDSGTSVTRLTR AY+A+
Sbjct: 325  SNPKLDTFYYVELLGISVGGTRVSGISASLFKLDSTGNGGVIIDSGTSVTRLTRAAYVAL 384

Query: 1189 RDAFRVGARNLIRASSFSLFDTCFDLSGVSEVKVPTVLFHFKGANVALPASNYLIPVDSK 1368
            RDAF VGA NL RA  FSLFDTCFDLSG +EVKVPTV+ HF+GA+V LPASNYLIPVD+ 
Sbjct: 385  RDAFLVGASNLKRAPEFSLFDTCFDLSGKTEVKVPTVVLHFRGADVPLPASNYLIPVDNS 444

Query: 1369 GTFCFAFAGTSNGLSIIGNIQQQGFRVVYDLGRSRVGFAKDGCA 1500
            G+FCFAFAGT++GLSIIGNIQQQGFRVVYDL  SRVGFA  GCA
Sbjct: 445  GSFCFAFAGTASGLSIIGNIQQQGFRVVYDLATSRVGFAPRGCA 488


Top