BLASTX nr result

ID: Atractylodes22_contig00010596 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Atractylodes22_contig00010596
         (1440 letters)

Database: ./nr 
           23,641,837 sequences; 8,123,359,852 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_003553394.1| PREDICTED: uncharacterized protein LOC100778...   164   7e-38
ref|XP_002879936.1| DNAJ heat shock N-terminal domain-containing...   148   3e-33
ref|NP_973659.1| DNAJ heat shock N-terminal domain-containing pr...   146   2e-32
ref|NP_850351.1| DNAJ heat shock N-terminal domain-containing pr...   146   2e-32
gb|AAL32666.1| Unknown protein [Arabidopsis thaliana]                 146   2e-32

>ref|XP_003553394.1| PREDICTED: uncharacterized protein LOC100778106 [Glycine max]
          Length = 1017

 Score =  164 bits (414), Expect = 7e-38
 Identities = 113/315 (35%), Positives = 158/315 (50%), Gaps = 52/315 (16%)
 Frame = +3

Query: 651  SSFTSDMFPGLGKKLEFS-KSNSVGQRKLKKTKGKLRQQARNCQQGGHTRLSKDI-PESF 824
            S F  ++FP L KK+E + K  S  ++  K  + K++  + N +Q G   LSK+   +  
Sbjct: 491  SCFKENLFPKLNKKVESTPKGRSCKEKGSKCMRKKMKPHSVNKKQSGLYHLSKENGSQKT 550

Query: 825  EESPGCSSPMDVSPYWATDCA----------------PTSTDPATSQVKNE---DDVDTT 947
             +S G  SPMD SPY  T  +                PT    + +    +   D +  T
Sbjct: 551  PDSSGIHSPMDFSPYQETTASDRVKASEKLNDLHSTMPTDRSGSVAGASADAGFDFIPNT 610

Query: 948  EK--------------LSAKNFSFSATSFANVNAPARQRPHQKKYKMKTGRG--LESTTT 1079
            EK                 K F+FSA+S  +   P+ +R  +KK++ K G    + S   
Sbjct: 611  EKQKDDVFRFVHGVNDSKGKGFAFSASSSVD-GTPSLKRQQKKKFRRKMGCNSFVNSPRV 669

Query: 1080 NSRGDASLAHEPTSRKHTKA---------------TDQETCDKWRKRGNQAYKNRDLSEA 1214
            N    +S+   P +  +  +               T    CD WR RGNQA+K+ DLS+A
Sbjct: 670  NGNFVSSVQFSPHNPANMSSHSDVQFKEGDVASLDTIPAACDTWRLRGNQAHKDGDLSKA 729

Query: 1215 EVYYSKGISSIQHTETPAFCIEPLLLCYSNRAATRMALGRMREGLNDCRMAAALDPKFMK 1394
            E  YS+GI+S+  +E      +PLLLCYSNRAATRM+LGR+RE L DC MA ALDP FMK
Sbjct: 730  EDLYSRGINSVPSSERSGCWAKPLLLCYSNRAATRMSLGRIREALEDCMMATALDPTFMK 789

Query: 1395 ANLRSANCHLFLGEV 1439
              +R+ANCHL LGEV
Sbjct: 790  VQMRTANCHLLLGEV 804


>ref|XP_002879936.1| DNAJ heat shock N-terminal domain-containing protein [Arabidopsis
            lyrata subsp. lyrata] gi|297325775|gb|EFH56195.1| DNAJ
            heat shock N-terminal domain-containing protein
            [Arabidopsis lyrata subsp. lyrata]
          Length = 1099

 Score =  148 bits (374), Expect = 3e-33
 Identities = 97/280 (34%), Positives = 150/280 (53%), Gaps = 23/280 (8%)
 Frame = +3

Query: 669  MFPGLGKKLEFSKSN-SVGQRKLKKTKGKLRQQARNCQQGGHTRLSKDIPESFE-----E 830
            +FP + +    ++SN S   ++ KK   K++Q+ ++       R +    E  E      
Sbjct: 368  LFPEVNRNPVLARSNRSSKDKRSKKVMEKIKQRKQD-------RCNDQTAEGIEAQEKLN 420

Query: 831  SPGCSSPMDVSPYW---ATDCAPTSTDPATSQVKNEDDVDTTEKLSAK--------NFSF 977
            SPG  SPMD SPY    A++  PT T P T     E     +   +A+        NFSF
Sbjct: 421  SPGYCSPMDYSPYQGETASNQLPTET-PLTPSHSREPSARDSFLFTAEDHGSSCMPNFSF 479

Query: 978  SATSFANVNAPARQRPHQKKYKMKTGRGLE----STTTNSRGDASLAHEPTSRKHTKATD 1145
            SA++ +    P ++    KKY+ K    +     +TT  +  +    +   S++ + +T 
Sbjct: 480  SAST-SQGTIPHKKLQAVKKYRRKVNNSVPKNNLNTTMRNNEENQRVNTGQSKQDSGSTS 538

Query: 1146 Q--ETCDKWRKRGNQAYKNRDLSEAEVYYSKGISSIQHTETPAFCIEPLLLCYSNRAATR 1319
               + C+ WR RGNQAYKN ++ +AE  Y+ GISS    +   + ++PL LCY NRAA R
Sbjct: 539  MMPDVCEVWRLRGNQAYKNGNMCKAEECYTHGISSSPSNDNSEYSVKPLALCYGNRAAAR 598

Query: 1320 MALGRMREGLNDCRMAAALDPKFMKANLRSANCHLFLGEV 1439
            ++LGR+RE ++DC MAA+LDP ++KA  R+ANCHL LGE+
Sbjct: 599  ISLGRLREAISDCEMAASLDPSYIKAYTRAANCHLVLGEL 638


>ref|NP_973659.1| DNAJ heat shock N-terminal domain-containing protein [Arabidopsis
            thaliana] gi|330254900|gb|AEC09994.1| DNAJ heat shock
            N-terminal domain-containing protein [Arabidopsis
            thaliana]
          Length = 1077

 Score =  146 bits (368), Expect = 2e-32
 Identities = 98/293 (33%), Positives = 145/293 (49%), Gaps = 30/293 (10%)
 Frame = +3

Query: 651  SSFTSDMFPGLGKKLEFSKSN-SVGQRKLKKTKGKLRQ-QARNCQQGGHTRLSKDIPESF 824
            S     +FP + +    ++SN S   ++ KK K K++Q +   C   G T    +  E  
Sbjct: 359  SLLKDSLFPEVDRNPVHARSNRSSKDKRSKKVKEKMKQGEPDRCN--GQTAEGIEAQEKL 416

Query: 825  EESPGCSSPMDVSPYWATDCA---PTSTDPATSQVKNEDDVDTTEKLSAK---------- 965
              SPG  SPMD SPY     +   PT T  A S  +   D  ++                
Sbjct: 417  N-SPGYCSPMDYSPYQGDKTSNQFPTETPLAPSHSREHIDSRSSNDFKVASARDSSLFTA 475

Query: 966  ---------NFSFSATSFANVNAPARQRPHQKKYKMKTGRGLESTTTNSRGDASLAHEPT 1118
                     NFSFSA++ +      ++    KKY+ K    L  +  N+    +  ++P 
Sbjct: 476  EDHGSTCIPNFSFSAST-SQETIRHKKLQAVKKYRRKVNNSLPKSNLNATMRNNQENQPV 534

Query: 1119 SRKHTKATDQET------CDKWRKRGNQAYKNRDLSEAEVYYSKGISSIQHTETPAFCIE 1280
            +    K     T      C+ WR RGNQAYKN  +S+AE  Y+ GI+S    +   + ++
Sbjct: 535  NTGQAKQDSGSTSMMPDVCEVWRLRGNQAYKNGYMSKAEECYTHGINSSPSKDNSEYSVK 594

Query: 1281 PLLLCYSNRAATRMALGRMREGLNDCRMAAALDPKFMKANLRSANCHLFLGEV 1439
            PL LCY NRAA R++LGR+RE ++DC MAA+LDP ++KA +R+ANCHL LGE+
Sbjct: 595  PLALCYGNRAAARISLGRLREAISDCEMAASLDPSYIKAYMRAANCHLVLGEL 647


>ref|NP_850351.1| DNAJ heat shock N-terminal domain-containing protein [Arabidopsis
            thaliana] gi|330254899|gb|AEC09993.1| DNAJ heat shock
            N-terminal domain-containing protein [Arabidopsis
            thaliana]
          Length = 1108

 Score =  146 bits (368), Expect = 2e-32
 Identities = 98/293 (33%), Positives = 145/293 (49%), Gaps = 30/293 (10%)
 Frame = +3

Query: 651  SSFTSDMFPGLGKKLEFSKSN-SVGQRKLKKTKGKLRQ-QARNCQQGGHTRLSKDIPESF 824
            S     +FP + +    ++SN S   ++ KK K K++Q +   C   G T    +  E  
Sbjct: 359  SLLKDSLFPEVDRNPVHARSNRSSKDKRSKKVKEKMKQGEPDRCN--GQTAEGIEAQEKL 416

Query: 825  EESPGCSSPMDVSPYWATDCA---PTSTDPATSQVKNEDDVDTTEKLSAK---------- 965
              SPG  SPMD SPY     +   PT T  A S  +   D  ++                
Sbjct: 417  N-SPGYCSPMDYSPYQGDKTSNQFPTETPLAPSHSREHIDSRSSNDFKVASARDSSLFTA 475

Query: 966  ---------NFSFSATSFANVNAPARQRPHQKKYKMKTGRGLESTTTNSRGDASLAHEPT 1118
                     NFSFSA++ +      ++    KKY+ K    L  +  N+    +  ++P 
Sbjct: 476  EDHGSTCIPNFSFSAST-SQETIRHKKLQAVKKYRRKVNNSLPKSNLNATMRNNQENQPV 534

Query: 1119 SRKHTKATDQET------CDKWRKRGNQAYKNRDLSEAEVYYSKGISSIQHTETPAFCIE 1280
            +    K     T      C+ WR RGNQAYKN  +S+AE  Y+ GI+S    +   + ++
Sbjct: 535  NTGQAKQDSGSTSMMPDVCEVWRLRGNQAYKNGYMSKAEECYTHGINSSPSKDNSEYSVK 594

Query: 1281 PLLLCYSNRAATRMALGRMREGLNDCRMAAALDPKFMKANLRSANCHLFLGEV 1439
            PL LCY NRAA R++LGR+RE ++DC MAA+LDP ++KA +R+ANCHL LGE+
Sbjct: 595  PLALCYGNRAAARISLGRLREAISDCEMAASLDPSYIKAYMRAANCHLVLGEL 647


>gb|AAL32666.1| Unknown protein [Arabidopsis thaliana]
          Length = 1108

 Score =  146 bits (368), Expect = 2e-32
 Identities = 98/293 (33%), Positives = 145/293 (49%), Gaps = 30/293 (10%)
 Frame = +3

Query: 651  SSFTSDMFPGLGKKLEFSKSN-SVGQRKLKKTKGKLRQ-QARNCQQGGHTRLSKDIPESF 824
            S     +FP + +    ++SN S   ++ KK K K++Q +   C   G T    +  E  
Sbjct: 359  SLLKDSLFPEVDRNPVHARSNRSSKDKRSKKVKEKMKQGEPDRCN--GQTAEGIEAQEKL 416

Query: 825  EESPGCSSPMDVSPYWATDCA---PTSTDPATSQVKNEDDVDTTEKLSAK---------- 965
              SPG  SPMD SPY     +   PT T  A S  +   D  ++                
Sbjct: 417  N-SPGYCSPMDYSPYQGDKTSNQFPTETPLAPSHSREHIDSRSSNDFKVASARDSSLFTA 475

Query: 966  ---------NFSFSATSFANVNAPARQRPHQKKYKMKTGRGLESTTTNSRGDASLAHEPT 1118
                     NFSFSA++ +      ++    KKY+ K    L  +  N+    +  ++P 
Sbjct: 476  EDHGSTCIPNFSFSAST-SQETIRHKKLQAVKKYRRKVNNSLPKSNLNATMRNNQENQPV 534

Query: 1119 SRKHTKATDQET------CDKWRKRGNQAYKNRDLSEAEVYYSKGISSIQHTETPAFCIE 1280
            +    K     T      C+ WR RGNQAYKN  +S+AE  Y+ GI+S    +   + ++
Sbjct: 535  NTGQAKQDSGSTSMMPDVCEVWRLRGNQAYKNGYMSKAEECYTHGINSSPSKDNSEYSVK 594

Query: 1281 PLLLCYSNRAATRMALGRMREGLNDCRMAAALDPKFMKANLRSANCHLFLGEV 1439
            PL LCY NRAA R++LGR+RE ++DC MAA+LDP ++KA +R+ANCHL LGE+
Sbjct: 595  PLALCYGNRAAARISLGRLREAISDCEMAASLDPSYIKAYMRAANCHLVLGEL 647


Top