BLASTX nr result
ID: Angelica22_contig00007332
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Angelica22_contig00007332 (1847 letters) Database: ./nr 23,641,837 sequences; 8,123,359,852 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_002529666.1| Protease ecfE, putative [Ricinus communis] g... 418 e-114 ref|XP_002321129.1| predicted protein [Populus trichocarpa] gi|2... 418 e-114 ref|NP_563729.1| peptidase M50-like protein [Arabidopsis thalian... 404 e-110 gb|AAM98118.1| unknown protein [Arabidopsis thaliana] 401 e-109 ref|NP_565745.1| serine protease [Arabidopsis thaliana] gi|14423... 400 e-109 >ref|XP_002529666.1| Protease ecfE, putative [Ricinus communis] gi|223530846|gb|EEF32708.1| Protease ecfE, putative [Ricinus communis] Length = 447 Score = 418 bits (1074), Expect = e-114 Identities = 236/390 (60%), Positives = 269/390 (68%), Gaps = 14/390 (3%) Frame = -1 Query: 1502 HYKNQNPHQKRLKIKSFAIPGYDFSGFDGVQPVVEAAAVLTAIIVVHESGHFLAAYLQGI 1323 H + P KRL +S+A+ G+DFS F+ V +EAA+VLTAII+VHESGHFLAAYLQGI Sbjct: 59 HVFGRYPLGKRLDFRSWAVSGFDFSNFESV---LEAASVLTAIIIVHESGHFLAAYLQGI 115 Query: 1322 RVSKFAVGFGPILAKFNANNVEYSLRAFPLGGFVGFXXXXXXXXXXXXXXDLLKNRPIFD 1143 VSKFAVGFGPILAKFNA NVEYS+RAFPLGGFVGF +LLKNRPI D Sbjct: 116 HVSKFAVGFGPILAKFNAKNVEYSVRAFPLGGFVGFPDNDPESDIPPDDKNLLKNRPILD 175 Query: 1142 RVLVVSAGVIANIVFAYLIIFVQILSVGLPVQDNFPGVLVPEVRPLSAAFRAG------- 984 RV+V+SAGVIANI+FAY IIFVQ+LSVGLPVQ+ FPGVLVPEVR SAA R G Sbjct: 176 RVIVISAGVIANIIFAYAIIFVQVLSVGLPVQEAFPGVLVPEVRAFSAASRDGLLPGDVI 235 Query: 983 -------XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXVSPDENSDGTGRIGV 825 V+PDEN DGTG+IGV Sbjct: 236 LAINGIDLPKTGPSSVSEVVDVIKRNPKRNVLLTVGRGAQALEIGVTPDENFDGTGKIGV 295 Query: 824 QLSPNVKFSNVRPKDLIEAFKYTGREFVGLTSNVLDSLKQTFFNFSQTASKVSGPVAIIA 645 QLSPNVK + + K+++EA + G+EF GL+SNVLDSLKQTF NFSQ+ASKVSGPVAIIA Sbjct: 296 QLSPNVKITKLVAKNVLEAINFAGKEFAGLSSNVLDSLKQTFLNFSQSASKVSGPVAIIA 355 Query: 644 VGAEVAKSSSNGLYQFXXXXXXXXXXXXXXXXXXLDGGSLALILIEAARGGKKLPSEIEQ 465 VGAEVA+S+ +GLYQF LDGGSLALILIEAARGG+KLP EIEQ Sbjct: 356 VGAEVARSNIDGLYQFAAVLNINLAVINLLPLPALDGGSLALILIEAARGGRKLPLEIEQ 415 Query: 464 GIMSSGIMLVTVLGLFLIVRDTLNLDFIKD 375 IMSSGIMLV +LGLFLIVRDTLNLDFI+D Sbjct: 416 RIMSSGIMLVILLGLFLIVRDTLNLDFIRD 445 >ref|XP_002321129.1| predicted protein [Populus trichocarpa] gi|222861902|gb|EEE99444.1| predicted protein [Populus trichocarpa] Length = 447 Score = 418 bits (1074), Expect = e-114 Identities = 238/398 (59%), Positives = 272/398 (68%), Gaps = 18/398 (4%) Frame = -1 Query: 1514 NQSFHYK----NQNPHQKRLKIKSFAIPGYDFSGFDGVQPVVEAAAVLTAIIVVHESGHF 1347 N SFH K ++ PH KRL ++S A+ G+D F+ V +EAA VLTAIIVVHESGHF Sbjct: 51 NLSFHPKTHLFSRCPHGKRLDLRSCAVSGFDLGNFESV---LEAAGVLTAIIVVHESGHF 107 Query: 1346 LAAYLQGIRVSKFAVGFGPILAKFNANNVEYSLRAFPLGGFVGFXXXXXXXXXXXXXXDL 1167 LAAYLQGI VSKFAVGFGP+LAKF+A NVEYSLRAFPLGGFVGF +L Sbjct: 108 LAAYLQGIHVSKFAVGFGPVLAKFSAKNVEYSLRAFPLGGFVGFPDNDPESDIPVDDENL 167 Query: 1166 LKNRPIFDRVLVVSAGVIANIVFAYLIIFVQILSVGLPVQDNFPGVLVPEVRPLSAAFRA 987 LKNRPI DR +V+SAGVIANI+FAY IIFVQ+LSVGLPVQ+ FPGVLVPEVR SAA R Sbjct: 168 LKNRPILDRTIVISAGVIANIIFAYAIIFVQVLSVGLPVQEAFPGVLVPEVRAFSAASRD 227 Query: 986 G--------------XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXVSPDENS 849 G V+PDE+ Sbjct: 228 GLLPGDVILAVNGTNLPKIGPNAVSEVVGVIKSSPKKNVLLKVGRGKQDFEIGVTPDESF 287 Query: 848 DGTGRIGVQLSPNVKFSNVRPKDLIEAFKYTGREFVGLTSNVLDSLKQTFFNFSQTASKV 669 DGTG+IGVQLSPNVK + V K+++EAF + G+EF+GL+SNV+DSLKQTF NFSQ+ASKV Sbjct: 288 DGTGKIGVQLSPNVKITKVVAKNILEAFNFAGKEFLGLSSNVVDSLKQTFLNFSQSASKV 347 Query: 668 SGPVAIIAVGAEVAKSSSNGLYQFXXXXXXXXXXXXXXXXXXLDGGSLALILIEAARGGK 489 SGPVAIIAVGAEVA+S+ +GLYQF LDGGSLA ILIEAARGG+ Sbjct: 348 SGPVAIIAVGAEVARSNIDGLYQFAAVLNINLAVINLLPLPALDGGSLAFILIEAARGGR 407 Query: 488 KLPSEIEQGIMSSGIMLVTVLGLFLIVRDTLNLDFIKD 375 KLP EIEQ IMSSGIMLV +LGLFLIVRDTLNLDFIKD Sbjct: 408 KLPLEIEQRIMSSGIMLVILLGLFLIVRDTLNLDFIKD 445 >ref|NP_563729.1| peptidase M50-like protein [Arabidopsis thaliana] gi|2388583|gb|AAB71464.1| Similar to Synechocystis hypothetical protein (gb|D90908) [Arabidopsis thaliana] gi|17065222|gb|AAL32765.1| Unknown protein [Arabidopsis thaliana] gi|332189673|gb|AEE27794.1| peptidase M50-like protein [Arabidopsis thaliana] Length = 441 Score = 404 bits (1037), Expect = e-110 Identities = 221/395 (55%), Positives = 268/395 (67%), Gaps = 14/395 (3%) Frame = -1 Query: 1517 KNQSFHYKNQNPHQKRLKIKSFAIPGYDFSGFDGVQPVVEAAAVLTAIIVVHESGHFLAA 1338 KN++F+ +NP+ + + F F ++ V+EA+AVLTAIIVVHE+GHFLAA Sbjct: 53 KNRAFYKNKRNPYNRTQALGRF--------DFGSLESVLEASAVLTAIIVVHETGHFLAA 104 Query: 1337 YLQGIRVSKFAVGFGPILAKFNANNVEYSLRAFPLGGFVGFXXXXXXXXXXXXXXDLLKN 1158 LQGIRVSKFA+GFGPILAKFN+NNVEYSLRAFPLGGFVGF +LLKN Sbjct: 105 SLQGIRVSKFAIGFGPILAKFNSNNVEYSLRAFPLGGFVGFPDNDPDSDIPVDDRNLLKN 164 Query: 1157 RPIFDRVLVVSAGVIANIVFAYLIIFVQILSVGLPVQDNFPGVLVPEVRPLSAAFRAGXX 978 RPI DRV+VVSAG++AN++FAY IIF Q++SVGLPVQ++FPGVLVP+V+ SAA R G Sbjct: 165 RPILDRVIVVSAGIVANVIFAYAIIFTQVVSVGLPVQESFPGVLVPDVKSFSAASRDGLL 224 Query: 977 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX--------------VSPDENSDGT 840 ++PD++ DGT Sbjct: 225 PGDVILAVDGTELSNSGSDSVSKVVDVVKRNPEHNVLLRIERGKESFEIRITPDKSFDGT 284 Query: 839 GRIGVQLSPNVKFSNVRPKDLIEAFKYTGREFVGLTSNVLDSLKQTFFNFSQTASKVSGP 660 G+IGVQLSPNV+F VRPK++ E F + GREF GL+ NVLDSLKQTF NFSQTASKV+GP Sbjct: 285 GKIGVQLSPNVRFGKVRPKNIPETFSFAGREFFGLSYNVLDSLKQTFLNFSQTASKVAGP 344 Query: 659 VAIIAVGAEVAKSSSNGLYQFXXXXXXXXXXXXXXXXXXLDGGSLALILIEAARGGKKLP 480 VAIIAVGAEVA+S+++GLYQF LDGG+LALIL+EA RGG+KLP Sbjct: 345 VAIIAVGAEVARSNADGLYQFAALLNLNLAVINLLPLPALDGGTLALILLEAVRGGRKLP 404 Query: 479 SEIEQGIMSSGIMLVTVLGLFLIVRDTLNLDFIKD 375 E+EQGIMSSGIMLV LGLFLIV+DTLNLDFIK+ Sbjct: 405 LEVEQGIMSSGIMLVLFLGLFLIVKDTLNLDFIKE 439 >gb|AAM98118.1| unknown protein [Arabidopsis thaliana] Length = 441 Score = 401 bits (1031), Expect = e-109 Identities = 220/395 (55%), Positives = 267/395 (67%), Gaps = 14/395 (3%) Frame = -1 Query: 1517 KNQSFHYKNQNPHQKRLKIKSFAIPGYDFSGFDGVQPVVEAAAVLTAIIVVHESGHFLAA 1338 KN++F+ +NP+ + + F F ++ V+EA+AVLTAIIVVHE+GHFLAA Sbjct: 53 KNRAFYKNKRNPYNRTQALGRF--------DFGSLESVLEASAVLTAIIVVHETGHFLAA 104 Query: 1337 YLQGIRVSKFAVGFGPILAKFNANNVEYSLRAFPLGGFVGFXXXXXXXXXXXXXXDLLKN 1158 LQGIRVSKFA+GFGPILAKFN+NNVEYSLRAFPLGGFVGF +LLKN Sbjct: 105 SLQGIRVSKFAIGFGPILAKFNSNNVEYSLRAFPLGGFVGFPDNDPDSDIPVDDRNLLKN 164 Query: 1157 RPIFDRVLVVSAGVIANIVFAYLIIFVQILSVGLPVQDNFPGVLVPEVRPLSAAFRAGXX 978 RPI DRV+VVSAG++AN++FAY II Q++SVGLPVQ++FPGVLVP+V+ SAA R G Sbjct: 165 RPILDRVIVVSAGIVANVIFAYAIILTQVVSVGLPVQESFPGVLVPDVKSFSAASRDGLL 224 Query: 977 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX--------------VSPDENSDGT 840 ++PD++ DGT Sbjct: 225 PGDVILAVDGTELSNSGSDSVSKVVDVVKRNPEHNVLLRIERGKESFEIRITPDKSFDGT 284 Query: 839 GRIGVQLSPNVKFSNVRPKDLIEAFKYTGREFVGLTSNVLDSLKQTFFNFSQTASKVSGP 660 G+IGVQLSPNV+F VRPK++ E F + GREF GL+ NVLDSLKQTF NFSQTASKV+GP Sbjct: 285 GKIGVQLSPNVRFGKVRPKNIPETFSFAGREFFGLSYNVLDSLKQTFLNFSQTASKVAGP 344 Query: 659 VAIIAVGAEVAKSSSNGLYQFXXXXXXXXXXXXXXXXXXLDGGSLALILIEAARGGKKLP 480 VAIIAVGAEVA+S+++GLYQF LDGG+LALIL+EA RGG+KLP Sbjct: 345 VAIIAVGAEVARSNADGLYQFAALLNLNLAVINLLPLPALDGGTLALILLEAVRGGRKLP 404 Query: 479 SEIEQGIMSSGIMLVTVLGLFLIVRDTLNLDFIKD 375 E+EQGIMSSGIMLV LGLFLIV+DTLNLDFIK+ Sbjct: 405 LEVEQGIMSSGIMLVLFLGLFLIVKDTLNLDFIKE 439 >ref|NP_565745.1| serine protease [Arabidopsis thaliana] gi|14423492|gb|AAK62428.1|AF386983_1 Unknown protein [Arabidopsis thaliana] gi|3298536|gb|AAC25930.1| expressed protein [Arabidopsis thaliana] gi|21553979|gb|AAM63060.1| unknown [Arabidopsis thaliana] gi|30387545|gb|AAP31938.1| At2g32480 [Arabidopsis thaliana] gi|330253597|gb|AEC08691.1| serine protease [Arabidopsis thaliana] Length = 447 Score = 400 bits (1029), Expect = e-109 Identities = 225/399 (56%), Positives = 265/399 (66%), Gaps = 14/399 (3%) Frame = -1 Query: 1529 SSNFKNQSFHYKNQNPHQKRLKIKSFAIPGYDFSGFDGVQPVVEAAAVLTAIIVVHESGH 1350 + + KN+ + P +R +S AI G D F+ V +EA AVLT IIVVHESGH Sbjct: 50 NQSLKNRVLFGNKRYPDGERFDFRSRAISGIDLGSFESV---LEAIAVLTTIIVVHESGH 106 Query: 1349 FLAAYLQGIRVSKFAVGFGPILAKFNANNVEYSLRAFPLGGFVGFXXXXXXXXXXXXXXD 1170 FLAA LQGI VSKFA+GFGPILAKF+ NNVEYSLRAFPLGGFVGF + Sbjct: 107 FLAASLQGIHVSKFAIGFGPILAKFDYNNVEYSLRAFPLGGFVGFPDNDPDSEIPIDDEN 166 Query: 1169 LLKNRPIFDRVLVVSAGVIANIVFAYLIIFVQILSVGLPVQDNFPGVLVPEVRPLSAAFR 990 LLKNRP DR +VVSAG+IAN++FAY IIFVQ+LSVGLPVQ+ FPGVLVPEV+ SAA R Sbjct: 167 LLKNRPTLDRSIVVSAGIIANVIFAYAIIFVQVLSVGLPVQEAFPGVLVPEVKTFSAASR 226 Query: 989 AGXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXV--------------SPDEN 852 G V +PD+N Sbjct: 227 DGLLSGDVILAVDGTELSKTGPDAVSKIVDIVKRNPKSNVVFRIERGGEDFDIRVTPDKN 286 Query: 851 SDGTGRIGVQLSPNVKFSNVRPKDLIEAFKYTGREFVGLTSNVLDSLKQTFFNFSQTASK 672 DGTG+IGVQLSPNV+ + VRP+++ E F++ GREF+GL+SNVLD LKQTFFNFSQTASK Sbjct: 287 FDGTGKIGVQLSPNVRITKVRPRNIPETFRFVGREFMGLSSNVLDGLKQTFFNFSQTASK 346 Query: 671 VSGPVAIIAVGAEVAKSSSNGLYQFXXXXXXXXXXXXXXXXXXLDGGSLALILIEAARGG 492 V+GPVAIIAVGAEVA+S+ +GLYQF LDGG+LALIL+EA RGG Sbjct: 347 VAGPVAIIAVGAEVARSNIDGLYQFAALLNINLAVINLLPLPALDGGTLALILLEAVRGG 406 Query: 491 KKLPSEIEQGIMSSGIMLVTVLGLFLIVRDTLNLDFIKD 375 KKLP E+EQGIMSSGIMLV LGLFLIV+DTL+LDFIK+ Sbjct: 407 KKLPVEVEQGIMSSGIMLVIFLGLFLIVKDTLSLDFIKE 445