BLASTX nr result

ID: Angelica22_contig00007332 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Angelica22_contig00007332
         (1847 letters)

Database: ./nr 
           23,641,837 sequences; 8,123,359,852 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_002529666.1| Protease ecfE, putative [Ricinus communis] g...   418   e-114
ref|XP_002321129.1| predicted protein [Populus trichocarpa] gi|2...   418   e-114
ref|NP_563729.1| peptidase M50-like protein [Arabidopsis thalian...   404   e-110
gb|AAM98118.1| unknown protein [Arabidopsis thaliana]                 401   e-109
ref|NP_565745.1| serine protease [Arabidopsis thaliana] gi|14423...   400   e-109

>ref|XP_002529666.1| Protease ecfE, putative [Ricinus communis]
            gi|223530846|gb|EEF32708.1| Protease ecfE, putative
            [Ricinus communis]
          Length = 447

 Score =  418 bits (1074), Expect = e-114
 Identities = 236/390 (60%), Positives = 269/390 (68%), Gaps = 14/390 (3%)
 Frame = -1

Query: 1502 HYKNQNPHQKRLKIKSFAIPGYDFSGFDGVQPVVEAAAVLTAIIVVHESGHFLAAYLQGI 1323
            H   + P  KRL  +S+A+ G+DFS F+ V   +EAA+VLTAII+VHESGHFLAAYLQGI
Sbjct: 59   HVFGRYPLGKRLDFRSWAVSGFDFSNFESV---LEAASVLTAIIIVHESGHFLAAYLQGI 115

Query: 1322 RVSKFAVGFGPILAKFNANNVEYSLRAFPLGGFVGFXXXXXXXXXXXXXXDLLKNRPIFD 1143
             VSKFAVGFGPILAKFNA NVEYS+RAFPLGGFVGF              +LLKNRPI D
Sbjct: 116  HVSKFAVGFGPILAKFNAKNVEYSVRAFPLGGFVGFPDNDPESDIPPDDKNLLKNRPILD 175

Query: 1142 RVLVVSAGVIANIVFAYLIIFVQILSVGLPVQDNFPGVLVPEVRPLSAAFRAG------- 984
            RV+V+SAGVIANI+FAY IIFVQ+LSVGLPVQ+ FPGVLVPEVR  SAA R G       
Sbjct: 176  RVIVISAGVIANIIFAYAIIFVQVLSVGLPVQEAFPGVLVPEVRAFSAASRDGLLPGDVI 235

Query: 983  -------XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXVSPDENSDGTGRIGV 825
                                                         V+PDEN DGTG+IGV
Sbjct: 236  LAINGIDLPKTGPSSVSEVVDVIKRNPKRNVLLTVGRGAQALEIGVTPDENFDGTGKIGV 295

Query: 824  QLSPNVKFSNVRPKDLIEAFKYTGREFVGLTSNVLDSLKQTFFNFSQTASKVSGPVAIIA 645
            QLSPNVK + +  K+++EA  + G+EF GL+SNVLDSLKQTF NFSQ+ASKVSGPVAIIA
Sbjct: 296  QLSPNVKITKLVAKNVLEAINFAGKEFAGLSSNVLDSLKQTFLNFSQSASKVSGPVAIIA 355

Query: 644  VGAEVAKSSSNGLYQFXXXXXXXXXXXXXXXXXXLDGGSLALILIEAARGGKKLPSEIEQ 465
            VGAEVA+S+ +GLYQF                  LDGGSLALILIEAARGG+KLP EIEQ
Sbjct: 356  VGAEVARSNIDGLYQFAAVLNINLAVINLLPLPALDGGSLALILIEAARGGRKLPLEIEQ 415

Query: 464  GIMSSGIMLVTVLGLFLIVRDTLNLDFIKD 375
             IMSSGIMLV +LGLFLIVRDTLNLDFI+D
Sbjct: 416  RIMSSGIMLVILLGLFLIVRDTLNLDFIRD 445


>ref|XP_002321129.1| predicted protein [Populus trichocarpa] gi|222861902|gb|EEE99444.1|
            predicted protein [Populus trichocarpa]
          Length = 447

 Score =  418 bits (1074), Expect = e-114
 Identities = 238/398 (59%), Positives = 272/398 (68%), Gaps = 18/398 (4%)
 Frame = -1

Query: 1514 NQSFHYK----NQNPHQKRLKIKSFAIPGYDFSGFDGVQPVVEAAAVLTAIIVVHESGHF 1347
            N SFH K    ++ PH KRL ++S A+ G+D   F+ V   +EAA VLTAIIVVHESGHF
Sbjct: 51   NLSFHPKTHLFSRCPHGKRLDLRSCAVSGFDLGNFESV---LEAAGVLTAIIVVHESGHF 107

Query: 1346 LAAYLQGIRVSKFAVGFGPILAKFNANNVEYSLRAFPLGGFVGFXXXXXXXXXXXXXXDL 1167
            LAAYLQGI VSKFAVGFGP+LAKF+A NVEYSLRAFPLGGFVGF              +L
Sbjct: 108  LAAYLQGIHVSKFAVGFGPVLAKFSAKNVEYSLRAFPLGGFVGFPDNDPESDIPVDDENL 167

Query: 1166 LKNRPIFDRVLVVSAGVIANIVFAYLIIFVQILSVGLPVQDNFPGVLVPEVRPLSAAFRA 987
            LKNRPI DR +V+SAGVIANI+FAY IIFVQ+LSVGLPVQ+ FPGVLVPEVR  SAA R 
Sbjct: 168  LKNRPILDRTIVISAGVIANIIFAYAIIFVQVLSVGLPVQEAFPGVLVPEVRAFSAASRD 227

Query: 986  G--------------XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXVSPDENS 849
            G                                                    V+PDE+ 
Sbjct: 228  GLLPGDVILAVNGTNLPKIGPNAVSEVVGVIKSSPKKNVLLKVGRGKQDFEIGVTPDESF 287

Query: 848  DGTGRIGVQLSPNVKFSNVRPKDLIEAFKYTGREFVGLTSNVLDSLKQTFFNFSQTASKV 669
            DGTG+IGVQLSPNVK + V  K+++EAF + G+EF+GL+SNV+DSLKQTF NFSQ+ASKV
Sbjct: 288  DGTGKIGVQLSPNVKITKVVAKNILEAFNFAGKEFLGLSSNVVDSLKQTFLNFSQSASKV 347

Query: 668  SGPVAIIAVGAEVAKSSSNGLYQFXXXXXXXXXXXXXXXXXXLDGGSLALILIEAARGGK 489
            SGPVAIIAVGAEVA+S+ +GLYQF                  LDGGSLA ILIEAARGG+
Sbjct: 348  SGPVAIIAVGAEVARSNIDGLYQFAAVLNINLAVINLLPLPALDGGSLAFILIEAARGGR 407

Query: 488  KLPSEIEQGIMSSGIMLVTVLGLFLIVRDTLNLDFIKD 375
            KLP EIEQ IMSSGIMLV +LGLFLIVRDTLNLDFIKD
Sbjct: 408  KLPLEIEQRIMSSGIMLVILLGLFLIVRDTLNLDFIKD 445


>ref|NP_563729.1| peptidase M50-like protein [Arabidopsis thaliana]
            gi|2388583|gb|AAB71464.1| Similar to Synechocystis
            hypothetical protein (gb|D90908) [Arabidopsis thaliana]
            gi|17065222|gb|AAL32765.1| Unknown protein [Arabidopsis
            thaliana] gi|332189673|gb|AEE27794.1| peptidase M50-like
            protein [Arabidopsis thaliana]
          Length = 441

 Score =  404 bits (1037), Expect = e-110
 Identities = 221/395 (55%), Positives = 268/395 (67%), Gaps = 14/395 (3%)
 Frame = -1

Query: 1517 KNQSFHYKNQNPHQKRLKIKSFAIPGYDFSGFDGVQPVVEAAAVLTAIIVVHESGHFLAA 1338
            KN++F+   +NP+ +   +  F         F  ++ V+EA+AVLTAIIVVHE+GHFLAA
Sbjct: 53   KNRAFYKNKRNPYNRTQALGRF--------DFGSLESVLEASAVLTAIIVVHETGHFLAA 104

Query: 1337 YLQGIRVSKFAVGFGPILAKFNANNVEYSLRAFPLGGFVGFXXXXXXXXXXXXXXDLLKN 1158
             LQGIRVSKFA+GFGPILAKFN+NNVEYSLRAFPLGGFVGF              +LLKN
Sbjct: 105  SLQGIRVSKFAIGFGPILAKFNSNNVEYSLRAFPLGGFVGFPDNDPDSDIPVDDRNLLKN 164

Query: 1157 RPIFDRVLVVSAGVIANIVFAYLIIFVQILSVGLPVQDNFPGVLVPEVRPLSAAFRAGXX 978
            RPI DRV+VVSAG++AN++FAY IIF Q++SVGLPVQ++FPGVLVP+V+  SAA R G  
Sbjct: 165  RPILDRVIVVSAGIVANVIFAYAIIFTQVVSVGLPVQESFPGVLVPDVKSFSAASRDGLL 224

Query: 977  XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX--------------VSPDENSDGT 840
                                                              ++PD++ DGT
Sbjct: 225  PGDVILAVDGTELSNSGSDSVSKVVDVVKRNPEHNVLLRIERGKESFEIRITPDKSFDGT 284

Query: 839  GRIGVQLSPNVKFSNVRPKDLIEAFKYTGREFVGLTSNVLDSLKQTFFNFSQTASKVSGP 660
            G+IGVQLSPNV+F  VRPK++ E F + GREF GL+ NVLDSLKQTF NFSQTASKV+GP
Sbjct: 285  GKIGVQLSPNVRFGKVRPKNIPETFSFAGREFFGLSYNVLDSLKQTFLNFSQTASKVAGP 344

Query: 659  VAIIAVGAEVAKSSSNGLYQFXXXXXXXXXXXXXXXXXXLDGGSLALILIEAARGGKKLP 480
            VAIIAVGAEVA+S+++GLYQF                  LDGG+LALIL+EA RGG+KLP
Sbjct: 345  VAIIAVGAEVARSNADGLYQFAALLNLNLAVINLLPLPALDGGTLALILLEAVRGGRKLP 404

Query: 479  SEIEQGIMSSGIMLVTVLGLFLIVRDTLNLDFIKD 375
             E+EQGIMSSGIMLV  LGLFLIV+DTLNLDFIK+
Sbjct: 405  LEVEQGIMSSGIMLVLFLGLFLIVKDTLNLDFIKE 439


>gb|AAM98118.1| unknown protein [Arabidopsis thaliana]
          Length = 441

 Score =  401 bits (1031), Expect = e-109
 Identities = 220/395 (55%), Positives = 267/395 (67%), Gaps = 14/395 (3%)
 Frame = -1

Query: 1517 KNQSFHYKNQNPHQKRLKIKSFAIPGYDFSGFDGVQPVVEAAAVLTAIIVVHESGHFLAA 1338
            KN++F+   +NP+ +   +  F         F  ++ V+EA+AVLTAIIVVHE+GHFLAA
Sbjct: 53   KNRAFYKNKRNPYNRTQALGRF--------DFGSLESVLEASAVLTAIIVVHETGHFLAA 104

Query: 1337 YLQGIRVSKFAVGFGPILAKFNANNVEYSLRAFPLGGFVGFXXXXXXXXXXXXXXDLLKN 1158
             LQGIRVSKFA+GFGPILAKFN+NNVEYSLRAFPLGGFVGF              +LLKN
Sbjct: 105  SLQGIRVSKFAIGFGPILAKFNSNNVEYSLRAFPLGGFVGFPDNDPDSDIPVDDRNLLKN 164

Query: 1157 RPIFDRVLVVSAGVIANIVFAYLIIFVQILSVGLPVQDNFPGVLVPEVRPLSAAFRAGXX 978
            RPI DRV+VVSAG++AN++FAY II  Q++SVGLPVQ++FPGVLVP+V+  SAA R G  
Sbjct: 165  RPILDRVIVVSAGIVANVIFAYAIILTQVVSVGLPVQESFPGVLVPDVKSFSAASRDGLL 224

Query: 977  XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX--------------VSPDENSDGT 840
                                                              ++PD++ DGT
Sbjct: 225  PGDVILAVDGTELSNSGSDSVSKVVDVVKRNPEHNVLLRIERGKESFEIRITPDKSFDGT 284

Query: 839  GRIGVQLSPNVKFSNVRPKDLIEAFKYTGREFVGLTSNVLDSLKQTFFNFSQTASKVSGP 660
            G+IGVQLSPNV+F  VRPK++ E F + GREF GL+ NVLDSLKQTF NFSQTASKV+GP
Sbjct: 285  GKIGVQLSPNVRFGKVRPKNIPETFSFAGREFFGLSYNVLDSLKQTFLNFSQTASKVAGP 344

Query: 659  VAIIAVGAEVAKSSSNGLYQFXXXXXXXXXXXXXXXXXXLDGGSLALILIEAARGGKKLP 480
            VAIIAVGAEVA+S+++GLYQF                  LDGG+LALIL+EA RGG+KLP
Sbjct: 345  VAIIAVGAEVARSNADGLYQFAALLNLNLAVINLLPLPALDGGTLALILLEAVRGGRKLP 404

Query: 479  SEIEQGIMSSGIMLVTVLGLFLIVRDTLNLDFIKD 375
             E+EQGIMSSGIMLV  LGLFLIV+DTLNLDFIK+
Sbjct: 405  LEVEQGIMSSGIMLVLFLGLFLIVKDTLNLDFIKE 439


>ref|NP_565745.1| serine protease [Arabidopsis thaliana]
            gi|14423492|gb|AAK62428.1|AF386983_1 Unknown protein
            [Arabidopsis thaliana] gi|3298536|gb|AAC25930.1|
            expressed protein [Arabidopsis thaliana]
            gi|21553979|gb|AAM63060.1| unknown [Arabidopsis thaliana]
            gi|30387545|gb|AAP31938.1| At2g32480 [Arabidopsis
            thaliana] gi|330253597|gb|AEC08691.1| serine protease
            [Arabidopsis thaliana]
          Length = 447

 Score =  400 bits (1029), Expect = e-109
 Identities = 225/399 (56%), Positives = 265/399 (66%), Gaps = 14/399 (3%)
 Frame = -1

Query: 1529 SSNFKNQSFHYKNQNPHQKRLKIKSFAIPGYDFSGFDGVQPVVEAAAVLTAIIVVHESGH 1350
            + + KN+      + P  +R   +S AI G D   F+ V   +EA AVLT IIVVHESGH
Sbjct: 50   NQSLKNRVLFGNKRYPDGERFDFRSRAISGIDLGSFESV---LEAIAVLTTIIVVHESGH 106

Query: 1349 FLAAYLQGIRVSKFAVGFGPILAKFNANNVEYSLRAFPLGGFVGFXXXXXXXXXXXXXXD 1170
            FLAA LQGI VSKFA+GFGPILAKF+ NNVEYSLRAFPLGGFVGF              +
Sbjct: 107  FLAASLQGIHVSKFAIGFGPILAKFDYNNVEYSLRAFPLGGFVGFPDNDPDSEIPIDDEN 166

Query: 1169 LLKNRPIFDRVLVVSAGVIANIVFAYLIIFVQILSVGLPVQDNFPGVLVPEVRPLSAAFR 990
            LLKNRP  DR +VVSAG+IAN++FAY IIFVQ+LSVGLPVQ+ FPGVLVPEV+  SAA R
Sbjct: 167  LLKNRPTLDRSIVVSAGIIANVIFAYAIIFVQVLSVGLPVQEAFPGVLVPEVKTFSAASR 226

Query: 989  AGXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXV--------------SPDEN 852
             G                                      V              +PD+N
Sbjct: 227  DGLLSGDVILAVDGTELSKTGPDAVSKIVDIVKRNPKSNVVFRIERGGEDFDIRVTPDKN 286

Query: 851  SDGTGRIGVQLSPNVKFSNVRPKDLIEAFKYTGREFVGLTSNVLDSLKQTFFNFSQTASK 672
             DGTG+IGVQLSPNV+ + VRP+++ E F++ GREF+GL+SNVLD LKQTFFNFSQTASK
Sbjct: 287  FDGTGKIGVQLSPNVRITKVRPRNIPETFRFVGREFMGLSSNVLDGLKQTFFNFSQTASK 346

Query: 671  VSGPVAIIAVGAEVAKSSSNGLYQFXXXXXXXXXXXXXXXXXXLDGGSLALILIEAARGG 492
            V+GPVAIIAVGAEVA+S+ +GLYQF                  LDGG+LALIL+EA RGG
Sbjct: 347  VAGPVAIIAVGAEVARSNIDGLYQFAALLNINLAVINLLPLPALDGGTLALILLEAVRGG 406

Query: 491  KKLPSEIEQGIMSSGIMLVTVLGLFLIVRDTLNLDFIKD 375
            KKLP E+EQGIMSSGIMLV  LGLFLIV+DTL+LDFIK+
Sbjct: 407  KKLPVEVEQGIMSSGIMLVIFLGLFLIVKDTLSLDFIKE 445


Top