BLASTX nr result

ID: Atractylodes22_contig00011210 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Atractylodes22_contig00011210
         (1621 letters)

Database: ./nr 
           23,641,837 sequences; 8,123,359,852 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_002529666.1| Protease ecfE, putative [Ricinus communis] g...   473   e-131
ref|XP_002321129.1| predicted protein [Populus trichocarpa] gi|2...   462   e-127
ref|XP_002301619.1| predicted protein [Populus trichocarpa] gi|2...   459   e-127
ref|NP_565745.1| serine protease [Arabidopsis thaliana] gi|14423...   454   e-125
ref|XP_002881228.1| hypothetical protein ARALYDRAFT_482175 [Arab...   450   e-124

>ref|XP_002529666.1| Protease ecfE, putative [Ricinus communis]
            gi|223530846|gb|EEF32708.1| Protease ecfE, putative
            [Ricinus communis]
          Length = 447

 Score =  473 bits (1217), Expect = e-131
 Identities = 259/397 (65%), Positives = 296/397 (74%)
 Frame = -1

Query: 1447 FPLRIKEKFKTSALADFGFGGFESAQSVLEAVSVLTAIIVVHESGHFLAAYLQGIHVSKF 1268
            +PL  +  F++ A++ F F  FES   VLEA SVLTAII+VHESGHFLAAYLQGIHVSKF
Sbjct: 64   YPLGKRLDFRSWAVSGFDFSNFES---VLEAASVLTAIIIVHESGHFLAAYLQGIHVSKF 120

Query: 1267 AVGFGPILAKFNANNVEYSIRAFPLGGFVGFXXXXXXXXXXXXDENLLKNRPIPDRILVI 1088
            AVGFGPILAKFNA NVEYS+RAFPLGGFVGF            D+NLLKNRPI DR++VI
Sbjct: 121  AVGFGPILAKFNAKNVEYSVRAFPLGGFVGFPDNDPESDIPPDDKNLLKNRPILDRVIVI 180

Query: 1087 SAGVIANIVFAYVIIFAQIVFVGLPVQEAFPGVIVPEVRPFSAASRDGLLAGDVILSVND 908
            SAGVIANI+FAY IIF Q++ VGLPVQEAFPGV+VPEVR FSAASRDGLL GDVIL++N 
Sbjct: 181  SAGVIANIIFAYAIIFVQVLSVGLPVQEAFPGVLVPEVRAFSAASRDGLLPGDVILAING 240

Query: 907  IELPKTVPNSVSQVVDVIKKNPKSTVLFKVGRGGKDFYIKVTPDQNTDGSGRIGVQLSPN 728
            I+LPKT P+SVS+VVDVIK+NPK  VL  VGRG +   I VTPD+N DG+G+IGVQLSPN
Sbjct: 241  IDLPKTGPSSVSEVVDVIKRNPKRNVLLTVGRGAQALEIGVTPDENFDGTGKIGVQLSPN 300

Query: 727  VKVLKEKPKDVLEAFSFTGREFWGLTSNVLDSLKQTFLNFSQSASKVAXXXXXXXXXXXX 548
            VK+ K   K+VLEA +F G+EF GL+SNVLDSLKQTFLNFSQSASKV+            
Sbjct: 301  VKITKLVAKNVLEAINFAGKEFAGLSSNVLDSLKQTFLNFSQSASKVS----------GP 350

Query: 547  XXXXXXXXXXXXXXXXGLYQFXXXXXXXXXXXXXXXXXXLDGGSLALILVEAVRGGRKLP 368
                            GLYQF                  LDGGSLALIL+EA RGGRKLP
Sbjct: 351  VAIIAVGAEVARSNIDGLYQFAAVLNINLAVINLLPLPALDGGSLALILIEAARGGRKLP 410

Query: 367  LEVEQGIMSSGITLVFVLGLFLIIRDTLNLDFIKDLL 257
            LE+EQ IMSSGI LV +LGLFLI+RDTLNLDFI+D+L
Sbjct: 411  LEIEQRIMSSGIMLVILLGLFLIVRDTLNLDFIRDML 447


>ref|XP_002321129.1| predicted protein [Populus trichocarpa] gi|222861902|gb|EEE99444.1|
            predicted protein [Populus trichocarpa]
          Length = 447

 Score =  462 bits (1189), Expect = e-127
 Identities = 253/388 (65%), Positives = 287/388 (73%)
 Frame = -1

Query: 1420 KTSALADFGFGGFESAQSVLEAVSVLTAIIVVHESGHFLAAYLQGIHVSKFAVGFGPILA 1241
            ++ A++ F  G FES   VLEA  VLTAIIVVHESGHFLAAYLQGIHVSKFAVGFGP+LA
Sbjct: 73   RSCAVSGFDLGNFES---VLEAAGVLTAIIVVHESGHFLAAYLQGIHVSKFAVGFGPVLA 129

Query: 1240 KFNANNVEYSIRAFPLGGFVGFXXXXXXXXXXXXDENLLKNRPIPDRILVISAGVIANIV 1061
            KF+A NVEYS+RAFPLGGFVGF            DENLLKNRPI DR +VISAGVIANI+
Sbjct: 130  KFSAKNVEYSLRAFPLGGFVGFPDNDPESDIPVDDENLLKNRPILDRTIVISAGVIANII 189

Query: 1060 FAYVIIFAQIVFVGLPVQEAFPGVIVPEVRPFSAASRDGLLAGDVILSVNDIELPKTVPN 881
            FAY IIF Q++ VGLPVQEAFPGV+VPEVR FSAASRDGLL GDVIL+VN   LPK  PN
Sbjct: 190  FAYAIIFVQVLSVGLPVQEAFPGVLVPEVRAFSAASRDGLLPGDVILAVNGTNLPKIGPN 249

Query: 880  SVSQVVDVIKKNPKSTVLFKVGRGGKDFYIKVTPDQNTDGSGRIGVQLSPNVKVLKEKPK 701
            +VS+VV VIK +PK  VL KVGRG +DF I VTPD++ DG+G+IGVQLSPNVK+ K   K
Sbjct: 250  AVSEVVGVIKSSPKKNVLLKVGRGKQDFEIGVTPDESFDGTGKIGVQLSPNVKITKVVAK 309

Query: 700  DVLEAFSFTGREFWGLTSNVLDSLKQTFLNFSQSASKVAXXXXXXXXXXXXXXXXXXXXX 521
            ++LEAF+F G+EF GL+SNV+DSLKQTFLNFSQSASKV+                     
Sbjct: 310  NILEAFNFAGKEFLGLSSNVVDSLKQTFLNFSQSASKVS----------GPVAIIAVGAE 359

Query: 520  XXXXXXXGLYQFXXXXXXXXXXXXXXXXXXLDGGSLALILVEAVRGGRKLPLEVEQGIMS 341
                   GLYQF                  LDGGSLA IL+EA RGGRKLPLE+EQ IMS
Sbjct: 360  VARSNIDGLYQFAAVLNINLAVINLLPLPALDGGSLAFILIEAARGGRKLPLEIEQRIMS 419

Query: 340  SGITLVFVLGLFLIIRDTLNLDFIKDLL 257
            SGI LV +LGLFLI+RDTLNLDFIKD+L
Sbjct: 420  SGIMLVILLGLFLIVRDTLNLDFIKDML 447


>ref|XP_002301619.1| predicted protein [Populus trichocarpa] gi|222843345|gb|EEE80892.1|
            predicted protein [Populus trichocarpa]
          Length = 449

 Score =  459 bits (1182), Expect = e-127
 Identities = 254/396 (64%), Positives = 286/396 (72%)
 Frame = -1

Query: 1444 PLRIKEKFKTSALADFGFGGFESAQSVLEAVSVLTAIIVVHESGHFLAAYLQGIHVSKFA 1265
            PL  +  F++ A++ F  G FES   VLEAV VLTAIIVVHE GHFLAAYLQGIHVSKFA
Sbjct: 67   PLGKRLDFRSWAVSGFDLGNFES---VLEAVGVLTAIIVVHEGGHFLAAYLQGIHVSKFA 123

Query: 1264 VGFGPILAKFNANNVEYSIRAFPLGGFVGFXXXXXXXXXXXXDENLLKNRPIPDRILVIS 1085
            VGFGPILAKFNA NVEYSIRAFPLGGFVGF            DENLLKNRPI DR +VIS
Sbjct: 124  VGFGPILAKFNARNVEYSIRAFPLGGFVGFPDNDPESDIPVDDENLLKNRPILDRTIVIS 183

Query: 1084 AGVIANIVFAYVIIFAQIVFVGLPVQEAFPGVIVPEVRPFSAASRDGLLAGDVILSVNDI 905
            AGVIANI+FAY II AQ++ VGLPVQEAFPGV+VPEV+ FSAASRDGLL GDVIL+VN  
Sbjct: 184  AGVIANIIFAYAIILAQVLSVGLPVQEAFPGVLVPEVQAFSAASRDGLLPGDVILAVNGT 243

Query: 904  ELPKTVPNSVSQVVDVIKKNPKSTVLFKVGRGGKDFYIKVTPDQNTDGSGRIGVQLSPNV 725
             LPKT PN+VS+VVDVIK +P   VL KV RG ++F I VTPD++ DG+G+IGVQLS NV
Sbjct: 244  NLPKTGPNAVSEVVDVIKSSPNKNVLLKVERGEQNFEIGVTPDESFDGTGKIGVQLSNNV 303

Query: 724  KVLKEKPKDVLEAFSFTGREFWGLTSNVLDSLKQTFLNFSQSASKVAXXXXXXXXXXXXX 545
            K+ K   K++ EAF+F G EFWGL+SNV+DSLKQTF NFSQSASKV+             
Sbjct: 304  KITKAIAKNIFEAFNFAGEEFWGLSSNVVDSLKQTFSNFSQSASKVS----------GPV 353

Query: 544  XXXXXXXXXXXXXXXGLYQFXXXXXXXXXXXXXXXXXXLDGGSLALILVEAVRGGRKLPL 365
                           GLYQF                  LDGGSLA IL+EA RGGRKLPL
Sbjct: 354  AIIAVGAEVARSNIDGLYQFAAVLNINLAVINLLPLPALDGGSLAFILIEAARGGRKLPL 413

Query: 364  EVEQGIMSSGITLVFVLGLFLIIRDTLNLDFIKDLL 257
            E+EQ IMSSGI LV  LG FLI+RDTLNLDFIKD+L
Sbjct: 414  EIEQRIMSSGIVLVITLGFFLIVRDTLNLDFIKDML 449


>ref|NP_565745.1| serine protease [Arabidopsis thaliana]
            gi|14423492|gb|AAK62428.1|AF386983_1 Unknown protein
            [Arabidopsis thaliana] gi|3298536|gb|AAC25930.1|
            expressed protein [Arabidopsis thaliana]
            gi|21553979|gb|AAM63060.1| unknown [Arabidopsis thaliana]
            gi|30387545|gb|AAP31938.1| At2g32480 [Arabidopsis
            thaliana] gi|330253597|gb|AEC08691.1| serine protease
            [Arabidopsis thaliana]
          Length = 447

 Score =  454 bits (1168), Expect = e-125
 Identities = 239/389 (61%), Positives = 292/389 (75%)
 Frame = -1

Query: 1423 FKTSALADFGFGGFESAQSVLEAVSVLTAIIVVHESGHFLAAYLQGIHVSKFAVGFGPIL 1244
            F++ A++    G FES   VLEA++VLT IIVVHESGHFLAA LQGIHVSKFA+GFGPIL
Sbjct: 72   FRSRAISGIDLGSFES---VLEAIAVLTTIIVVHESGHFLAASLQGIHVSKFAIGFGPIL 128

Query: 1243 AKFNANNVEYSIRAFPLGGFVGFXXXXXXXXXXXXDENLLKNRPIPDRILVISAGVIANI 1064
            AKF+ NNVEYS+RAFPLGGFVGF            DENLLKNRP  DR +V+SAG+IAN+
Sbjct: 129  AKFDYNNVEYSLRAFPLGGFVGFPDNDPDSEIPIDDENLLKNRPTLDRSIVVSAGIIANV 188

Query: 1063 VFAYVIIFAQIVFVGLPVQEAFPGVIVPEVRPFSAASRDGLLAGDVILSVNDIELPKTVP 884
            +FAY IIF Q++ VGLPVQEAFPGV+VPEV+ FSAASRDGLL+GDVIL+V+  EL KT P
Sbjct: 189  IFAYAIIFVQVLSVGLPVQEAFPGVLVPEVKTFSAASRDGLLSGDVILAVDGTELSKTGP 248

Query: 883  NSVSQVVDVIKKNPKSTVLFKVGRGGKDFYIKVTPDQNTDGSGRIGVQLSPNVKVLKEKP 704
            ++VS++VD++K+NPKS V+F++ RGG+DF I+VTPD+N DG+G+IGVQLSPNV++ K +P
Sbjct: 249  DAVSKIVDIVKRNPKSNVVFRIERGGEDFDIRVTPDKNFDGTGKIGVQLSPNVRITKVRP 308

Query: 703  KDVLEAFSFTGREFWGLTSNVLDSLKQTFLNFSQSASKVAXXXXXXXXXXXXXXXXXXXX 524
            +++ E F F GREF GL+SNVLD LKQTF NFSQ+ASKVA                    
Sbjct: 309  RNIPETFRFVGREFMGLSSNVLDGLKQTFFNFSQTASKVA----------GPVAIIAVGA 358

Query: 523  XXXXXXXXGLYQFXXXXXXXXXXXXXXXXXXLDGGSLALILVEAVRGGRKLPLEVEQGIM 344
                    GLYQF                  LDGG+LALIL+EAVRGG+KLP+EVEQGIM
Sbjct: 359  EVARSNIDGLYQFAALLNINLAVINLLPLPALDGGTLALILLEAVRGGKKLPVEVEQGIM 418

Query: 343  SSGITLVFVLGLFLIIRDTLNLDFIKDLL 257
            SSGI LV  LGLFLI++DTL+LDFIK++L
Sbjct: 419  SSGIMLVIFLGLFLIVKDTLSLDFIKEML 447


>ref|XP_002881228.1| hypothetical protein ARALYDRAFT_482175 [Arabidopsis lyrata subsp.
            lyrata] gi|297327067|gb|EFH57487.1| hypothetical protein
            ARALYDRAFT_482175 [Arabidopsis lyrata subsp. lyrata]
          Length = 444

 Score =  450 bits (1158), Expect = e-124
 Identities = 242/407 (59%), Positives = 295/407 (72%)
 Frame = -1

Query: 1477 KKQFLHEKIGFPLRIKEKFKTSALADFGFGGFESAQSVLEAVSVLTAIIVVHESGHFLAA 1298
            K + L     +P   +  F+  A++    G FES   VLEA++VLT IIVVHESGHFLAA
Sbjct: 51   KNRVLFGNKRYPNGERFDFRARAISGIDLGSFES---VLEAIAVLTTIIVVHESGHFLAA 107

Query: 1297 YLQGIHVSKFAVGFGPILAKFNANNVEYSIRAFPLGGFVGFXXXXXXXXXXXXDENLLKN 1118
             LQGIHVSKFA+GFGPILAKF+ NNVEYS+RAFPLGGFVGF            DENLLKN
Sbjct: 108  SLQGIHVSKFAIGFGPILAKFDYNNVEYSLRAFPLGGFVGFPDNDPDSEIPIDDENLLKN 167

Query: 1117 RPIPDRILVISAGVIANIVFAYVIIFAQIVFVGLPVQEAFPGVIVPEVRPFSAASRDGLL 938
            RP  DR +V+SAG+IAN++FAY IIF Q++ VGLPVQEAFPGV+VPEV+ FSAASR GLL
Sbjct: 168  RPTLDRSIVVSAGIIANVIFAYAIIFVQVLSVGLPVQEAFPGVLVPEVKTFSAASRYGLL 227

Query: 937  AGDVILSVNDIELPKTVPNSVSQVVDVIKKNPKSTVLFKVGRGGKDFYIKVTPDQNTDGS 758
            +GDVI++V+  EL KT P++VS++VD++K+NPKS VLF+V RG KDF I+VTPD+N DG+
Sbjct: 228  SGDVIIAVDGTELSKTGPDAVSKIVDIVKRNPKSDVLFRVERGNKDFDIRVTPDKNFDGT 287

Query: 757  GRIGVQLSPNVKVLKEKPKDVLEAFSFTGREFWGLTSNVLDSLKQTFLNFSQSASKVAXX 578
            G+IGVQLSPNV++ K +P+++ E F F GREF GL+SNVLD LKQTF NFSQ+ASKVA  
Sbjct: 288  GKIGVQLSPNVRITKVRPRNIPETFRFVGREFMGLSSNVLDGLKQTFFNFSQTASKVA-- 345

Query: 577  XXXXXXXXXXXXXXXXXXXXXXXXXXGLYQFXXXXXXXXXXXXXXXXXXLDGGSLALILV 398
                                      GLYQF                  LDGG+LALIL+
Sbjct: 346  --------GPVAIIAVGAEVARSNIDGLYQFAALLNINLAVINLLPLPALDGGTLALILL 397

Query: 397  EAVRGGRKLPLEVEQGIMSSGITLVFVLGLFLIIRDTLNLDFIKDLL 257
            EAVRGG+KLP+EVEQGIMSSGI LV  LGLFLI++DTL+LDFIK++L
Sbjct: 398  EAVRGGKKLPVEVEQGIMSSGIMLVIFLGLFLIVKDTLSLDFIKEML 444


Top