BLASTX nr result

ID: Cnidium21_contig00023171 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Cnidium21_contig00023171
         (1691 letters)

Database: ./nr 
           23,641,837 sequences; 8,123,359,852 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_002321129.1| predicted protein [Populus trichocarpa] gi|2...   502   e-139
ref|XP_002529666.1| Protease ecfE, putative [Ricinus communis] g...   501   e-139
ref|XP_002881228.1| hypothetical protein ARALYDRAFT_482175 [Arab...   489   e-135
ref|NP_563729.1| peptidase M50-like protein [Arabidopsis thalian...   486   e-135
ref|NP_565745.1| serine protease [Arabidopsis thaliana] gi|14423...   485   e-134

>ref|XP_002321129.1| predicted protein [Populus trichocarpa] gi|222861902|gb|EEE99444.1|
            predicted protein [Populus trichocarpa]
          Length = 447

 Score =  502 bits (1293), Expect = e-139
 Identities = 273/435 (62%), Positives = 317/435 (72%), Gaps = 9/435 (2%)
 Frame = +2

Query: 5    NLTKIKNSKSPIYP-----KRXXXXXXXXXXXXNFKNQSFHYK----NRNPHQKRLKIKS 157
            +L +  NSKSP+        +               N SFH K    +R PH KRL ++S
Sbjct: 15   SLPRFSNSKSPVLEPPSLKSKTNFLKPLPSSPCYSSNLSFHPKTHLFSRCPHGKRLDLRS 74

Query: 158  FAIPGYDFSGFDGVQPVVEAAAVLTAIIVVHESGHFLAAYLQGIRVSKFAVGFGPILAKF 337
             A+ G+D   F+ V   +EAA VLTAIIVVHESGHFLAAYLQGI VSKFAVGFGP+LAKF
Sbjct: 75   CAVSGFDLGNFESV---LEAAGVLTAIIVVHESGHFLAAYLQGIHVSKFAVGFGPVLAKF 131

Query: 338  NANNVEYSLRAFPLGGFVGFXXXXXXXXXXXXXXXLLKNRPIFDRLLVVSAGVIANIVFA 517
            +A NVEYSLRAFPLGGFVGF               LLKNRPI DR +V+SAGVIANI+FA
Sbjct: 132  SAKNVEYSLRAFPLGGFVGFPDNDPESDIPVDDENLLKNRPILDRTIVISAGVIANIIFA 191

Query: 518  YLIIFVQILSVGLPVQDNFPGVLVPEVRPLSAAFRAGLLPGDVILGVNGVELSKQGPSLV 697
            Y IIFVQ+LSVGLPVQ+ FPGVLVPEVR  SAA R GLLPGDVIL VNG  L K GP+ V
Sbjct: 192  YAIIFVQVLSVGLPVQEAFPGVLVPEVRAFSAASRDGLLPGDVILAVNGTNLPKIGPNAV 251

Query: 698  SEVVNIIKKNPKADVMLKIKRSNQDIDITVSPDENSDGTGRIGVQLSPNVKFSNVRPKDL 877
            SEVV +IK +PK +V+LK+ R  QD +I V+PDE+ DGTG+IGVQLSPNVK + V  K++
Sbjct: 252  SEVVGVIKSSPKKNVLLKVGRGKQDFEIGVTPDESFDGTGKIGVQLSPNVKITKVVAKNI 311

Query: 878  IEAFKYTGREFFGLTFNVLDSLKQTFFNFSQTASKVSGPVAIIAVGAEVAKSSTNGLYQF 1057
            +EAF + G+EF GL+ NV+DSLKQTF NFSQ+ASKVSGPVAIIAVGAEVA+S+ +GLYQF
Sbjct: 312  LEAFNFAGKEFLGLSSNVVDSLKQTFLNFSQSASKVSGPVAIIAVGAEVARSNIDGLYQF 371

Query: 1058 XXXXXXXXXXXXXXXXXXXDGGSLALILVEAARGGKKLPSEIEQGIMSSGIMLVTVLGLF 1237
                               DGGSLA IL+EAARGG+KLP EIEQ IMSSGIMLV +LGLF
Sbjct: 372  AAVLNINLAVINLLPLPALDGGSLAFILIEAARGGRKLPLEIEQRIMSSGIMLVILLGLF 431

Query: 1238 LIVRDTLNLDFIKDL 1282
            LIVRDTLNLDFIKD+
Sbjct: 432  LIVRDTLNLDFIKDM 446


>ref|XP_002529666.1| Protease ecfE, putative [Ricinus communis]
            gi|223530846|gb|EEF32708.1| Protease ecfE, putative
            [Ricinus communis]
          Length = 447

 Score =  501 bits (1289), Expect = e-139
 Identities = 263/391 (67%), Positives = 308/391 (78%)
 Frame = +2

Query: 110  HYKNRNPHQKRLKIKSFAIPGYDFSGFDGVQPVVEAAAVLTAIIVVHESGHFLAAYLQGI 289
            H   R P  KRL  +S+A+ G+DFS F+ V   +EAA+VLTAII+VHESGHFLAAYLQGI
Sbjct: 59   HVFGRYPLGKRLDFRSWAVSGFDFSNFESV---LEAASVLTAIIIVHESGHFLAAYLQGI 115

Query: 290  RVSKFAVGFGPILAKFNANNVEYSLRAFPLGGFVGFXXXXXXXXXXXXXXXLLKNRPIFD 469
             VSKFAVGFGPILAKFNA NVEYS+RAFPLGGFVGF               LLKNRPI D
Sbjct: 116  HVSKFAVGFGPILAKFNAKNVEYSVRAFPLGGFVGFPDNDPESDIPPDDKNLLKNRPILD 175

Query: 470  RLLVVSAGVIANIVFAYLIIFVQILSVGLPVQDNFPGVLVPEVRPLSAAFRAGLLPGDVI 649
            R++V+SAGVIANI+FAY IIFVQ+LSVGLPVQ+ FPGVLVPEVR  SAA R GLLPGDVI
Sbjct: 176  RVIVISAGVIANIIFAYAIIFVQVLSVGLPVQEAFPGVLVPEVRAFSAASRDGLLPGDVI 235

Query: 650  LGVNGVELSKQGPSLVSEVVNIIKKNPKADVMLKIKRSNQDIDITVSPDENSDGTGRIGV 829
            L +NG++L K GPS VSEVV++IK+NPK +V+L + R  Q ++I V+PDEN DGTG+IGV
Sbjct: 236  LAINGIDLPKTGPSSVSEVVDVIKRNPKRNVLLTVGRGAQALEIGVTPDENFDGTGKIGV 295

Query: 830  QLSPNVKFSNVRPKDLIEAFKYTGREFFGLTFNVLDSLKQTFFNFSQTASKVSGPVAIIA 1009
            QLSPNVK + +  K+++EA  + G+EF GL+ NVLDSLKQTF NFSQ+ASKVSGPVAIIA
Sbjct: 296  QLSPNVKITKLVAKNVLEAINFAGKEFAGLSSNVLDSLKQTFLNFSQSASKVSGPVAIIA 355

Query: 1010 VGAEVAKSSTNGLYQFXXXXXXXXXXXXXXXXXXXDGGSLALILVEAARGGKKLPSEIEQ 1189
            VGAEVA+S+ +GLYQF                   DGGSLALIL+EAARGG+KLP EIEQ
Sbjct: 356  VGAEVARSNIDGLYQFAAVLNINLAVINLLPLPALDGGSLALILIEAARGGRKLPLEIEQ 415

Query: 1190 GIMSSGIMLVTVLGLFLIVRDTLNLDFIKDL 1282
             IMSSGIMLV +LGLFLIVRDTLNLDFI+D+
Sbjct: 416  RIMSSGIMLVILLGLFLIVRDTLNLDFIRDM 446


>ref|XP_002881228.1| hypothetical protein ARALYDRAFT_482175 [Arabidopsis lyrata subsp.
            lyrata] gi|297327067|gb|EFH57487.1| hypothetical protein
            ARALYDRAFT_482175 [Arabidopsis lyrata subsp. lyrata]
          Length = 444

 Score =  489 bits (1258), Expect = e-135
 Identities = 258/424 (60%), Positives = 313/424 (73%)
 Frame = +2

Query: 11   TKIKNSKSPIYPKRXXXXXXXXXXXXNFKNQSFHYKNRNPHQKRLKIKSFAIPGYDFSGF 190
            +K   SKS ++PK             + KN+      R P+ +R   ++ AI G D   F
Sbjct: 29   SKTHLSKSHLFPK------FTPLSNQSLKNRVLFGNKRYPNGERFDFRARAISGIDLGSF 82

Query: 191  DGVQPVVEAAAVLTAIIVVHESGHFLAAYLQGIRVSKFAVGFGPILAKFNANNVEYSLRA 370
            + V   +EA AVLT IIVVHESGHFLAA LQGI VSKFA+GFGPILAKF+ NNVEYSLRA
Sbjct: 83   ESV---LEAIAVLTTIIVVHESGHFLAASLQGIHVSKFAIGFGPILAKFDYNNVEYSLRA 139

Query: 371  FPLGGFVGFXXXXXXXXXXXXXXXLLKNRPIFDRLLVVSAGVIANIVFAYLIIFVQILSV 550
            FPLGGFVGF               LLKNRP  DR +VVSAG+IAN++FAY IIFVQ+LSV
Sbjct: 140  FPLGGFVGFPDNDPDSEIPIDDENLLKNRPTLDRSIVVSAGIIANVIFAYAIIFVQVLSV 199

Query: 551  GLPVQDNFPGVLVPEVRPLSAAFRAGLLPGDVILGVNGVELSKQGPSLVSEVVNIIKKNP 730
            GLPVQ+ FPGVLVPEV+  SAA R GLL GDVI+ V+G ELSK GP  VS++V+I+K+NP
Sbjct: 200  GLPVQEAFPGVLVPEVKTFSAASRYGLLSGDVIIAVDGTELSKTGPDAVSKIVDIVKRNP 259

Query: 731  KADVMLKIKRSNQDIDITVSPDENSDGTGRIGVQLSPNVKFSNVRPKDLIEAFKYTGREF 910
            K+DV+ +++R N+D DI V+PD+N DGTG+IGVQLSPNV+ + VRP+++ E F++ GREF
Sbjct: 260  KSDVLFRVERGNKDFDIRVTPDKNFDGTGKIGVQLSPNVRITKVRPRNIPETFRFVGREF 319

Query: 911  FGLTFNVLDSLKQTFFNFSQTASKVSGPVAIIAVGAEVAKSSTNGLYQFXXXXXXXXXXX 1090
             GL+ NVLD LKQTFFNFSQTASKV+GPVAIIAVGAEVA+S+ +GLYQF           
Sbjct: 320  MGLSSNVLDGLKQTFFNFSQTASKVAGPVAIIAVGAEVARSNIDGLYQFAALLNINLAVI 379

Query: 1091 XXXXXXXXDGGSLALILVEAARGGKKLPSEIEQGIMSSGIMLVTVLGLFLIVRDTLNLDF 1270
                    DGG+LALIL+EA RGGKKLP E+EQGIMSSGIMLV  LGLFLIV+DTL+LDF
Sbjct: 380  NLLPLPALDGGTLALILLEAVRGGKKLPVEVEQGIMSSGIMLVIFLGLFLIVKDTLSLDF 439

Query: 1271 IKDL 1282
            IK++
Sbjct: 440  IKEM 443


>ref|NP_563729.1| peptidase M50-like protein [Arabidopsis thaliana]
            gi|2388583|gb|AAB71464.1| Similar to Synechocystis
            hypothetical protein (gb|D90908) [Arabidopsis thaliana]
            gi|17065222|gb|AAL32765.1| Unknown protein [Arabidopsis
            thaliana] gi|332189673|gb|AEE27794.1| peptidase M50-like
            protein [Arabidopsis thaliana]
          Length = 441

 Score =  486 bits (1252), Expect = e-135
 Identities = 247/396 (62%), Positives = 307/396 (77%)
 Frame = +2

Query: 95   KNQSFHYKNRNPHQKRLKIKSFAIPGYDFSGFDGVQPVVEAAAVLTAIIVVHESGHFLAA 274
            KN++F+   RNP+ +   +  F         F  ++ V+EA+AVLTAIIVVHE+GHFLAA
Sbjct: 53   KNRAFYKNKRNPYNRTQALGRF--------DFGSLESVLEASAVLTAIIVVHETGHFLAA 104

Query: 275  YLQGIRVSKFAVGFGPILAKFNANNVEYSLRAFPLGGFVGFXXXXXXXXXXXXXXXLLKN 454
             LQGIRVSKFA+GFGPILAKFN+NNVEYSLRAFPLGGFVGF               LLKN
Sbjct: 105  SLQGIRVSKFAIGFGPILAKFNSNNVEYSLRAFPLGGFVGFPDNDPDSDIPVDDRNLLKN 164

Query: 455  RPIFDRLLVVSAGVIANIVFAYLIIFVQILSVGLPVQDNFPGVLVPEVRPLSAAFRAGLL 634
            RPI DR++VVSAG++AN++FAY IIF Q++SVGLPVQ++FPGVLVP+V+  SAA R GLL
Sbjct: 165  RPILDRVIVVSAGIVANVIFAYAIIFTQVVSVGLPVQESFPGVLVPDVKSFSAASRDGLL 224

Query: 635  PGDVILGVNGVELSKQGPSLVSEVVNIIKKNPKADVMLKIKRSNQDIDITVSPDENSDGT 814
            PGDVIL V+G ELS  G   VS+VV+++K+NP+ +V+L+I+R  +  +I ++PD++ DGT
Sbjct: 225  PGDVILAVDGTELSNSGSDSVSKVVDVVKRNPEHNVLLRIERGKESFEIRITPDKSFDGT 284

Query: 815  GRIGVQLSPNVKFSNVRPKDLIEAFKYTGREFFGLTFNVLDSLKQTFFNFSQTASKVSGP 994
            G+IGVQLSPNV+F  VRPK++ E F + GREFFGL++NVLDSLKQTF NFSQTASKV+GP
Sbjct: 285  GKIGVQLSPNVRFGKVRPKNIPETFSFAGREFFGLSYNVLDSLKQTFLNFSQTASKVAGP 344

Query: 995  VAIIAVGAEVAKSSTNGLYQFXXXXXXXXXXXXXXXXXXXDGGSLALILVEAARGGKKLP 1174
            VAIIAVGAEVA+S+ +GLYQF                   DGG+LALIL+EA RGG+KLP
Sbjct: 345  VAIIAVGAEVARSNADGLYQFAALLNLNLAVINLLPLPALDGGTLALILLEAVRGGRKLP 404

Query: 1175 SEIEQGIMSSGIMLVTVLGLFLIVRDTLNLDFIKDL 1282
             E+EQGIMSSGIMLV  LGLFLIV+DTLNLDFIK++
Sbjct: 405  LEVEQGIMSSGIMLVLFLGLFLIVKDTLNLDFIKEM 440


>ref|NP_565745.1| serine protease [Arabidopsis thaliana]
            gi|14423492|gb|AAK62428.1|AF386983_1 Unknown protein
            [Arabidopsis thaliana] gi|3298536|gb|AAC25930.1|
            expressed protein [Arabidopsis thaliana]
            gi|21553979|gb|AAM63060.1| unknown [Arabidopsis thaliana]
            gi|30387545|gb|AAP31938.1| At2g32480 [Arabidopsis
            thaliana] gi|330253597|gb|AEC08691.1| serine protease
            [Arabidopsis thaliana]
          Length = 447

 Score =  485 bits (1249), Expect = e-134
 Identities = 259/424 (61%), Positives = 310/424 (73%)
 Frame = +2

Query: 11   TKIKNSKSPIYPKRXXXXXXXXXXXXNFKNQSFHYKNRNPHQKRLKIKSFAIPGYDFSGF 190
            +K   SKS  +PK             + KN+      R P  +R   +S AI G D   F
Sbjct: 32   SKTHLSKSHFFPK------FTPLSNQSLKNRVLFGNKRYPDGERFDFRSRAISGIDLGSF 85

Query: 191  DGVQPVVEAAAVLTAIIVVHESGHFLAAYLQGIRVSKFAVGFGPILAKFNANNVEYSLRA 370
            + V   +EA AVLT IIVVHESGHFLAA LQGI VSKFA+GFGPILAKF+ NNVEYSLRA
Sbjct: 86   ESV---LEAIAVLTTIIVVHESGHFLAASLQGIHVSKFAIGFGPILAKFDYNNVEYSLRA 142

Query: 371  FPLGGFVGFXXXXXXXXXXXXXXXLLKNRPIFDRLLVVSAGVIANIVFAYLIIFVQILSV 550
            FPLGGFVGF               LLKNRP  DR +VVSAG+IAN++FAY IIFVQ+LSV
Sbjct: 143  FPLGGFVGFPDNDPDSEIPIDDENLLKNRPTLDRSIVVSAGIIANVIFAYAIIFVQVLSV 202

Query: 551  GLPVQDNFPGVLVPEVRPLSAAFRAGLLPGDVILGVNGVELSKQGPSLVSEVVNIIKKNP 730
            GLPVQ+ FPGVLVPEV+  SAA R GLL GDVIL V+G ELSK GP  VS++V+I+K+NP
Sbjct: 203  GLPVQEAFPGVLVPEVKTFSAASRDGLLSGDVILAVDGTELSKTGPDAVSKIVDIVKRNP 262

Query: 731  KADVMLKIKRSNQDIDITVSPDENSDGTGRIGVQLSPNVKFSNVRPKDLIEAFKYTGREF 910
            K++V+ +I+R  +D DI V+PD+N DGTG+IGVQLSPNV+ + VRP+++ E F++ GREF
Sbjct: 263  KSNVVFRIERGGEDFDIRVTPDKNFDGTGKIGVQLSPNVRITKVRPRNIPETFRFVGREF 322

Query: 911  FGLTFNVLDSLKQTFFNFSQTASKVSGPVAIIAVGAEVAKSSTNGLYQFXXXXXXXXXXX 1090
             GL+ NVLD LKQTFFNFSQTASKV+GPVAIIAVGAEVA+S+ +GLYQF           
Sbjct: 323  MGLSSNVLDGLKQTFFNFSQTASKVAGPVAIIAVGAEVARSNIDGLYQFAALLNINLAVI 382

Query: 1091 XXXXXXXXDGGSLALILVEAARGGKKLPSEIEQGIMSSGIMLVTVLGLFLIVRDTLNLDF 1270
                    DGG+LALIL+EA RGGKKLP E+EQGIMSSGIMLV  LGLFLIV+DTL+LDF
Sbjct: 383  NLLPLPALDGGTLALILLEAVRGGKKLPVEVEQGIMSSGIMLVIFLGLFLIVKDTLSLDF 442

Query: 1271 IKDL 1282
            IK++
Sbjct: 443  IKEM 446


Top