BLASTX nr result

ID: Astragalus24_contig00023999 seq

BLASTX 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Astragalus24_contig00023999
         (1092 letters)

Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF excluding environmental samples
from WGS projects 
           149,584,005 sequences; 54,822,741,787 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

dbj|GAU30604.1| hypothetical protein TSUD_392950 [Trifolium subt...   100   4e-21
dbj|GAU38238.1| hypothetical protein TSUD_145930 [Trifolium subt...    96   5e-19
dbj|GAU44820.1| hypothetical protein TSUD_400390 [Trifolium subt...    94   8e-19
dbj|GAU33152.1| hypothetical protein TSUD_206110 [Trifolium subt...    93   9e-19
dbj|GAU10638.1| hypothetical protein TSUD_421140, partial [Trifo...    93   3e-18
dbj|GAU32642.1| hypothetical protein TSUD_71900 [Trifolium subte...    89   6e-18
gb|PNX66369.1| ribonuclease H, partial [Trifolium pratense]            90   4e-17
gb|PNY06521.1| nucleic acid binding protein [Trifolium pratense]       87   6e-17
dbj|GAU50328.1| hypothetical protein TSUD_290640 [Trifolium subt...    92   1e-16
dbj|GAU38343.1| hypothetical protein TSUD_395970 [Trifolium subt...    85   3e-16
dbj|GAU26109.1| hypothetical protein TSUD_225720 [Trifolium subt...    86   7e-16
gb|PNX61593.1| ribonuclease H, partial [Trifolium pratense]            84   8e-16
dbj|GAU44140.1| hypothetical protein TSUD_188010 [Trifolium subt...    85   1e-15
dbj|GAU36844.1| hypothetical protein TSUD_213680 [Trifolium subt...    89   1e-15
dbj|GAU46467.1| hypothetical protein TSUD_402340 [Trifolium subt...    87   2e-15
dbj|GAU29065.1| hypothetical protein TSUD_278210 [Trifolium subt...    81   2e-15
gb|ABD33209.1| Ribonuclease H [Medicago truncatula]                    86   2e-15
dbj|GAU34105.1| hypothetical protein TSUD_256010 [Trifolium subt...    88   3e-15
gb|PNY12120.1| 3-ketoacyl-CoA synthase, partial [Trifolium prate...    88   4e-15
dbj|GAU13699.1| hypothetical protein TSUD_348030 [Trifolium subt...    82   4e-15

>dbj|GAU30604.1| hypothetical protein TSUD_392950 [Trifolium subterraneum]
          Length = 233

 Score =  100 bits (250), Expect = 4e-21
 Identities = 45/104 (43%), Positives = 65/104 (62%)
 Frame = +3

Query: 780  EVRWDP*GSEWICLNCDGAVKAADKVAGCGGLMRNSDGMWILDFSKQIGYCEPLIADLWG 959
            +VRW P    WICLN D A K+ +   GCGGL R+ DG WI  FSK +G     +A+LWG
Sbjct: 62   DVRWYPSEHGWICLNTDDAFKSHNTKDGCGGLFRDEDGHWIRGFSKSLGSATAYVAELWG 121

Query: 960  VLTALQLCRSKGWGKVIVQIDANVVCDVILGLEIGRISGWSIVR 1091
            +L  + + RS G+ K+ VQ+D+ ++  +I     G +SGWSI++
Sbjct: 122  LLEGISIARSMGFNKLEVQMDSEIIVSIINKHGHGNVSGWSIIK 165


>dbj|GAU38238.1| hypothetical protein TSUD_145930 [Trifolium subterraneum]
          Length = 246

 Score = 95.5 bits (236), Expect = 5e-19
 Identities = 45/155 (29%), Positives = 78/155 (50%), Gaps = 16/155 (10%)
 Frame = +3

Query: 675  EAKLIVSMWLGLVRVEMRNNFFGGDLQHWLQFNLV----------------EVRWDP*GS 806
            + K    +W  L+ +E R+ FF GD + W+  NL                  +RW     
Sbjct: 28   DCKSAAIVWNHLLPIEARHLFFVGDFKDWVILNLTTTGWRLEQLERKNVVAHIRWTCPQQ 87

Query: 807  EWICLNCDGAVKAADKVAGCGGLMRNSDGMWILDFSKQIGYCEPLIADLWGVLTALQLCR 986
             ++CLN DGAVK  ++ AGCGG++R++ G W+  F+K +G C   +A+LWG+   +++  
Sbjct: 88   GYLCLNNDGAVKKGNQQAGCGGVVRDNSGKWVCGFAKVLGSCSAYVAELWGIYEEIKIAN 147

Query: 987  SKGWGKVIVQIDANVVCDVILGLEIGRISGWSIVR 1091
             +   ++ +Q+D+          + G I G  +VR
Sbjct: 148  DRNLMRIEIQVDSKATLQCFTSSKTGNIRGRKLVR 182


>dbj|GAU44820.1| hypothetical protein TSUD_400390 [Trifolium subterraneum]
          Length = 198

 Score = 93.6 bits (231), Expect = 8e-19
 Identities = 38/98 (38%), Positives = 66/98 (67%)
 Frame = +3

Query: 753  QHWLQFNLVEVRWDP*GSEWICLNCDGAVKAADKVAGCGGLMRNSDGMWILDFSKQIGYC 932
            + + Q   + + W+    EWI LNCDGA K +  +AGCGGL R+S+G W+  ++++IG C
Sbjct: 15   ERYRQLETIYIGWEHPQGEWIKLNCDGAYKESMGLAGCGGLFRDSNGRWLKGYAQKIGAC 74

Query: 933  EPLIADLWGVLTALQLCRSKGWGKVIVQIDANVVCDVI 1046
            + L A++WG+ T +Q+ R +G+  +IVQ D+ ++ D++
Sbjct: 75   DALHAEMWGMYTGMQMARRQGFTHIIVQSDSKLLIDMV 112


>dbj|GAU33152.1| hypothetical protein TSUD_206110 [Trifolium subterraneum]
          Length = 190

 Score = 93.2 bits (230), Expect = 9e-19
 Identities = 38/100 (38%), Positives = 66/100 (66%)
 Frame = +3

Query: 753  QHWLQFNLVEVRWDP*GSEWICLNCDGAVKAADKVAGCGGLMRNSDGMWILDFSKQIGYC 932
            + + Q   + + W     EWI LNCDGA K +  +AGCGGL R+S+G W+  ++++IG C
Sbjct: 15   ERYRQLETIYIGWKHPQGEWIKLNCDGAYKESMGLAGCGGLFRDSNGRWLKGYAQKIGAC 74

Query: 933  EPLIADLWGVLTALQLCRSKGWGKVIVQIDANVVCDVILG 1052
            + L A++WG+ T +Q+ R +G+  +IV+ D+ ++ D++ G
Sbjct: 75   DALHAEMWGMYTGMQMARRQGFTHIIVESDSKLLIDMVTG 114


>dbj|GAU10638.1| hypothetical protein TSUD_421140, partial [Trifolium subterraneum]
          Length = 236

 Score = 93.2 bits (230), Expect = 3e-18
 Identities = 38/100 (38%), Positives = 66/100 (66%)
 Frame = +3

Query: 753  QHWLQFNLVEVRWDP*GSEWICLNCDGAVKAADKVAGCGGLMRNSDGMWILDFSKQIGYC 932
            + + Q   + + W     EWI LNCDGA K +  +AGCGGL R+S+G W+  ++++IG C
Sbjct: 53   ERYRQLETIYIGWKHPQGEWIKLNCDGAYKESMGLAGCGGLFRDSNGRWLKGYAQKIGAC 112

Query: 933  EPLIADLWGVLTALQLCRSKGWGKVIVQIDANVVCDVILG 1052
            + L A++WG+ T +Q+ R +G+  +IV+ D+ ++ D++ G
Sbjct: 113  DALHAEMWGMYTGMQMARRQGFTHIIVESDSKLLIDMVTG 152


>dbj|GAU32642.1| hypothetical protein TSUD_71900 [Trifolium subterraneum]
          Length = 109

 Score = 88.6 bits (218), Expect = 6e-18
 Identities = 35/82 (42%), Positives = 60/82 (73%)
 Frame = +3

Query: 807  EWICLNCDGAVKAADKVAGCGGLMRNSDGMWILDFSKQIGYCEPLIADLWGVLTALQLCR 986
            +WI LNCDGA K +  +AGCGGL R+S+G W+  ++++IG C+ L A++WG+ T +Q+ R
Sbjct: 6    KWIKLNCDGAYKESMGLAGCGGLFRDSNGRWLKGYAQKIGACDALHAEMWGIYTGMQMAR 65

Query: 987  SKGWGKVIVQIDANVVCDVILG 1052
             +G+  +IV+ D+ ++ D++ G
Sbjct: 66   RQGFTHIIVESDSKLLIDMVTG 87


>gb|PNX66369.1| ribonuclease H, partial [Trifolium pratense]
          Length = 252

 Score = 90.1 bits (222), Expect = 4e-17
 Identities = 44/111 (39%), Positives = 68/111 (61%)
 Frame = +3

Query: 756  HWLQFNLVEVRWDP*GSEWICLNCDGAVKAADKVAGCGGLMRNSDGMWILDFSKQIGYCE 935
            H ++ N+V + W P    W+ LN DGA K    VAGCGG++R+S+G+W   F+K +G C 
Sbjct: 74   HNVEKNVVMINWKPPSEGWVKLNTDGAYKEGS-VAGCGGVIRDSNGVWRGGFAKNLGICS 132

Query: 936  PLIADLWGVLTALQLCRSKGWGKVIVQIDANVVCDVILGLEIGRISGWSIV 1088
              +A+LWGVL  L+   S G+ +V + +D++VV  V+     GR  G ++V
Sbjct: 133  AYVAELWGVLEGLRYANSLGFNRVELNVDSSVVIHVLRRPGYGRPLGGALV 183


>gb|PNY06521.1| nucleic acid binding protein [Trifolium pratense]
          Length = 135

 Score = 86.7 bits (213), Expect = 6e-17
 Identities = 41/107 (38%), Positives = 64/107 (59%)
 Frame = +3

Query: 771  NLVEVRWDP*GSEWICLNCDGAVKAADKVAGCGGLMRNSDGMWILDFSKQIGYCEPLIAD 950
            +LV + W P  S W+ +N DGA +  D  AGCGGL+R  +  W+  FSK IG C   +++
Sbjct: 7    SLVNIGWTPPNSGWVKINXDGA-RRLDGRAGCGGLIRGENKEWLGGFSKYIGQCSAFVSE 65

Query: 951  LWGVLTALQLCRSKGWGKVIVQIDANVVCDVILGLEIGRISGWSIVR 1091
            LWGV   L+L R+KG+ KV + +D+  V + I   + G   G+ +++
Sbjct: 66   LWGVFEGLKLARAKGFEKVEICVDSQAVINSIKNRDGGNAMGYRLIQ 112


>dbj|GAU50328.1| hypothetical protein TSUD_290640 [Trifolium subterraneum]
          Length = 474

 Score = 92.0 bits (227), Expect = 1e-16
 Identities = 45/100 (45%), Positives = 60/100 (60%)
 Frame = +3

Query: 777  VEVRWDP*GSEWICLNCDGAVKAADKVAGCGGLMRNSDGMWILDFSKQIGYCEPLIADLW 956
            V++ W    + W+ LN DGA K  D  AGCGGL+RNS G WI  FS+ +G C   +A+LW
Sbjct: 358  VDIAWQQPEAGWVVLNTDGASKM-DVAAGCGGLLRNSHGQWIGGFSRHLGICSAYLAELW 416

Query: 957  GVLTALQLCRSKGWGKVIVQIDANVVCDVILGLEIGRISG 1076
            GVL  L+L R +G  K+ VQ+D+ VV   +    IG   G
Sbjct: 417  GVLDGLRLARERGITKLKVQVDSRVVVQTLNSSNIGSTVG 456


>dbj|GAU38343.1| hypothetical protein TSUD_395970 [Trifolium subterraneum]
          Length = 144

 Score = 85.1 bits (209), Expect = 3e-16
 Identities = 36/96 (37%), Positives = 61/96 (63%)
 Frame = +3

Query: 765  QFNLVEVRWDP*GSEWICLNCDGAVKAADKVAGCGGLMRNSDGMWILDFSKQIGYCEPLI 944
            Q   + + W     +WI LNCD A K +  +AGCGGL R+SDG W+  ++ +IG C+ L 
Sbjct: 19   QRETIYISWKYPHGDWIKLNCDRAYKDSMNIAGCGGLFRDSDGRWLKAYTLRIGDCDALH 78

Query: 945  ADLWGVLTALQLCRSKGWGKVIVQIDANVVCDVILG 1052
            A++WG+ T +++ R +G+  +IV+ D  ++ D++ G
Sbjct: 79   AEMWGMYTGMKMARRQGYTHLIVESDFKLLIDMVTG 114


>dbj|GAU26109.1| hypothetical protein TSUD_225720 [Trifolium subterraneum]
          Length = 198

 Score = 85.5 bits (210), Expect = 7e-16
 Identities = 40/102 (39%), Positives = 63/102 (61%), Gaps = 1/102 (0%)
 Frame = +3

Query: 750  LQHW-LQFNLVEVRWDP*GSEWICLNCDGAVKAADKVAGCGGLMRNSDGMWILDFSKQIG 926
            L  W LQ   + + W      W  LNCDGA K++ ++ GCGGL+RN+DG+ +  F+++IG
Sbjct: 13   LDGWNLQNETIFIGWKQPREGWFKLNCDGAHKSSIQLLGCGGLLRNNDGICVSSFARKIG 72

Query: 927  YCEPLIADLWGVLTALQLCRSKGWGKVIVQIDANVVCDVILG 1052
             C+ L AD+WG+   + L R KG   + V+ D+ V+ D++ G
Sbjct: 73   SCDALHADMWGMYIGMNLARRKGVTHLQVESDSKVLVDMVTG 114


>gb|PNX61593.1| ribonuclease H, partial [Trifolium pratense]
          Length = 146

 Score = 84.0 bits (206), Expect = 8e-16
 Identities = 37/105 (35%), Positives = 61/105 (58%)
 Frame = +3

Query: 777  VEVRWDP*GSEWICLNCDGAVKAADKVAGCGGLMRNSDGMWILDFSKQIGYCEPLIADLW 956
            V+VRW      ++ LN DGAVK   + AGCGG++RN  G W+  F+K +G C   +A+LW
Sbjct: 25   VQVRWMRPQQGYLSLNTDGAVKNGSQQAGCGGVIRNDSGNWVCGFAKALGPCSAFVAELW 84

Query: 957  GVLTALQLCRSKGWGKVIVQIDANVVCDVILGLEIGRISGWSIVR 1091
            G+L  + + + +   ++ VQ+D+  V   +   + G + G  +VR
Sbjct: 85   GILEGIIIAKDRNIMRIEVQVDSTAVLQCLTSSKNGSVRGRRLVR 129


>dbj|GAU44140.1| hypothetical protein TSUD_188010 [Trifolium subterraneum]
          Length = 200

 Score = 84.7 bits (208), Expect = 1e-15
 Identities = 46/104 (44%), Positives = 61/104 (58%)
 Frame = +3

Query: 777  VEVRWDP*GSEWICLNCDGAVKAADKVAGCGGLMRNSDGMWILDFSKQIGYCEPLIADLW 956
            VE+ W P    ++ LN DGA K  +K A CGG++R + G W+  F+K IG C   IA+LW
Sbjct: 51   VEIGWKPPSDNFVRLNTDGARKDNNK-AECGGIIRGNHGEWLGGFAKVIGECSAFIAELW 109

Query: 957  GVLTALQLCRSKGWGKVIVQIDANVVCDVILGLEIGRISGWSIV 1088
            GV   L L R  G+ KV V ID+ VV  VI   ++    GWS+V
Sbjct: 110  GVFEGLTLARRMGFRKVEVHIDSVVVVQVITTGKLHNKIGWSLV 153


>dbj|GAU36844.1| hypothetical protein TSUD_213680 [Trifolium subterraneum]
          Length = 1025

 Score = 89.4 bits (220), Expect = 1e-15
 Identities = 37/96 (38%), Positives = 63/96 (65%)
 Frame = +3

Query: 765  QFNLVEVRWDP*GSEWICLNCDGAVKAADKVAGCGGLMRNSDGMWILDFSKQIGYCEPLI 944
            Q   + + W     +WI LNCDGA K +  +AGCGGL R+SDG W+  ++ +IG C+ L 
Sbjct: 846  QRETIYIGWKYPHGDWIKLNCDGAYKDSMNIAGCGGLFRDSDGRWLKGYTLRIGDCDALH 905

Query: 945  ADLWGVLTALQLCRSKGWGKVIVQIDANVVCDVILG 1052
            A++WG+ T +++ R +G+  +IV+ D+ ++ D++ G
Sbjct: 906  AEMWGMYTGMKMARRQGYTHLIVESDSKLLIDMVTG 941


>dbj|GAU46467.1| hypothetical protein TSUD_402340 [Trifolium subterraneum]
          Length = 299

 Score = 86.7 bits (213), Expect = 2e-15
 Identities = 42/105 (40%), Positives = 63/105 (60%)
 Frame = +3

Query: 777  VEVRWDP*GSEWICLNCDGAVKAADKVAGCGGLMRNSDGMWILDFSKQIGYCEPLIADLW 956
            V + W+P    W+ LN DGA K  ++VAGCGG++RN+ G WI  F+K +G C   +A+LW
Sbjct: 128  VMIGWEPPSQGWVKLNTDGARKN-ERVAGCGGIIRNNIGDWIGGFAKHVGSCSAFVAELW 186

Query: 957  GVLTALQLCRSKGWGKVIVQIDANVVCDVILGLEIGRISGWSIVR 1091
            GVL  L      G+ KV ++ID+ +V D +   E     G +++R
Sbjct: 187  GVLEGLNYAWKLGFKKVELEIDSAIVVDAVNSGETNSAMGIALIR 231


>dbj|GAU29065.1| hypothetical protein TSUD_278210 [Trifolium subterraneum]
          Length = 102

 Score = 81.3 bits (199), Expect = 2e-15
 Identities = 38/95 (40%), Positives = 60/95 (63%)
 Frame = +3

Query: 777  VEVRWDP*GSEWICLNCDGAVKAADKVAGCGGLMRNSDGMWILDFSKQIGYCEPLIADLW 956
            ++V+W P    WIC+N DGAV+    VAGCGG++R+  G WI  F K + + +  I++LW
Sbjct: 8    IDVKWCPPKENWICINTDGAVQ--QDVAGCGGVIRDQIGKWIAGFVKNMRFSKDYISELW 65

Query: 957  GVLTALQLCRSKGWGKVIVQIDANVVCDVILGLEI 1061
            G    L+  RSKG+  V +++D++VV   I G ++
Sbjct: 66   GAYEVLKFARSKGFTMVELRMDSSVVVGSIKGEKV 100


>gb|ABD33209.1| Ribonuclease H [Medicago truncatula]
          Length = 302

 Score = 86.3 bits (212), Expect = 2e-15
 Identities = 38/91 (41%), Positives = 58/91 (63%)
 Frame = +3

Query: 774  LVEVRWDP*GSEWICLNCDGAVKAADKVAGCGGLMRNSDGMWILDFSKQIGYCEPLIADL 953
            +V + W      W+ LNCDGA K   ++AGCGGL+R SDG WI  FS++IG C+ L A++
Sbjct: 197  IVYIDWKRPLDGWVKLNCDGACKGNGELAGCGGLLRQSDGTWIKGFSRKIGACDALHAEM 256

Query: 954  WGVLTALQLCRSKGWGKVIVQIDANVVCDVI 1046
            WG+   L +   +G   +IV+ D+ V+ D++
Sbjct: 257  WGLYLGLDMAWREGISHLIVESDSKVLIDMV 287


>dbj|GAU34105.1| hypothetical protein TSUD_256010 [Trifolium subterraneum]
          Length = 679

 Score = 88.2 bits (217), Expect = 3e-15
 Identities = 40/96 (41%), Positives = 61/96 (63%)
 Frame = +3

Query: 765  QFNLVEVRWDP*GSEWICLNCDGAVKAADKVAGCGGLMRNSDGMWILDFSKQIGYCEPLI 944
            Q N V + W+     WI LNCDGA K +  +AGCGGL RNSDG WI  ++++IG C+ L 
Sbjct: 453  QCNTVFIGWNKPREGWIKLNCDGAYKDSLGLAGCGGLFRNSDGRWIKGYARKIGTCDALS 512

Query: 945  ADLWGVLTALQLCRSKGWGKVIVQIDANVVCDVILG 1052
            A++WG+   +QL   +G+  + V+ D+  + D++ G
Sbjct: 513  AEMWGMYLGMQLAWRQGFHLLQVESDSKTLVDMVTG 548


>gb|PNY12120.1| 3-ketoacyl-CoA synthase, partial [Trifolium pratense]
          Length = 609

 Score = 87.8 bits (216), Expect = 4e-15
 Identities = 42/90 (46%), Positives = 60/90 (66%), Gaps = 5/90 (5%)
 Frame = +3

Query: 771  NLVE-----VRWDP*GSEWICLNCDGAVKAADKVAGCGGLMRNSDGMWILDFSKQIGYCE 935
            NLVE     VRW P    W+ +N DGA K  D+VAGCGG+++  DG WI  F+K +G C 
Sbjct: 276  NLVERVVTNVRWLPLEPGWVRINTDGASKG-DEVAGCGGMIKGEDGSWICGFTKGVGVCS 334

Query: 936  PLIADLWGVLTALQLCRSKGWGKVIVQIDA 1025
              +A+LWGVL ALQ+ R++G+ +V + +D+
Sbjct: 335  AYVAELWGVLEALQIARARGFRQVELHVDS 364


>dbj|GAU13699.1| hypothetical protein TSUD_348030 [Trifolium subterraneum]
          Length = 131

 Score = 81.6 bits (200), Expect = 4e-15
 Identities = 45/105 (42%), Positives = 60/105 (57%)
 Frame = +3

Query: 777  VEVRWDP*GSEWICLNCDGAVKAADKVAGCGGLMRNSDGMWILDFSKQIGYCEPLIADLW 956
            + VRW+   + WI LN DGAV+    VAGCGG++R+  G WI  FSK IG      A+LW
Sbjct: 25   INVRWEAPRNGWISLNTDGAVQHG--VAGCGGVLRDYQGNWITGFSKFIGTASVFKAELW 82

Query: 957  GVLTALQLCRSKGWGKVIVQIDANVVCDVILGLEIGRISGWSIVR 1091
            GV   L L R +G   + +QID+  V   + G  +G   G S+VR
Sbjct: 83   GVYAGLCLARQRGINNIELQIDSLAVVRNLGGDSLGSSEGKSLVR 127


Top