BLASTX nr result

ID: Astragalus24_contig00002338 seq

BLASTX 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Astragalus24_contig00002338
         (394 letters)

Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF excluding environmental samples
from WGS projects 
           149,584,005 sequences; 54,822,741,787 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

dbj|GAU15206.1| hypothetical protein TSUD_09440 [Trifolium subte...    72   6e-13
gb|KHN08856.1| Putative ribonuclease H protein, partial [Glycine...    68   1e-11
gb|PNX91284.1| ribonuclease H [Trifolium pratense]                     67   3e-11
dbj|GAU36544.1| hypothetical protein TSUD_277500 [Trifolium subt...    65   1e-10
gb|PNX55786.1| ribonuclease H, partial [Trifolium pratense]            65   2e-10
dbj|GAU31120.1| hypothetical protein TSUD_212270 [Trifolium subt...    66   4e-10
dbj|GAU29126.1| hypothetical protein TSUD_58920 [Trifolium subte...    61   2e-09
dbj|GAU51393.1| hypothetical protein TSUD_138060 [Trifolium subt...    61   5e-09
dbj|GAU51253.1| hypothetical protein TSUD_412460, partial [Trifo...    63   6e-09
gb|PNY12327.1| ribonuclease H [Trifolium pratense]                     63   7e-09
dbj|GAU28906.1| hypothetical protein TSUD_59160 [Trifolium subte...    59   9e-09
gb|KHN22754.1| Putative ribonuclease H protein [Glycine soja]          59   2e-08
gb|PNX75059.1| ribonuclease H [Trifolium pratense]                     62   2e-08
dbj|GAU30604.1| hypothetical protein TSUD_392950 [Trifolium subt...    60   5e-08
gb|PNY06521.1| nucleic acid binding protein [Trifolium pratense]       57   2e-07
gb|PNY06182.1| ribonuclease H [Trifolium pratense]                     59   2e-07
gb|PNX95782.1| ribonuclease H [Trifolium pratense]                     59   2e-07
dbj|GAU14934.1| hypothetical protein TSUD_47270 [Trifolium subte...    59   2e-07
gb|PNX79929.1| ribonuclease H, partial [Trifolium pratense]            59   3e-07
dbj|GAU44140.1| hypothetical protein TSUD_188010 [Trifolium subt...    57   3e-07

>dbj|GAU15206.1| hypothetical protein TSUD_09440 [Trifolium subterraneum]
          Length = 175

 Score = 71.6 bits (174), Expect = 6e-13
 Identities = 34/76 (44%), Positives = 50/76 (65%)
 Frame = -3

Query: 278 WGIYEGLMLAKSQGYKQLIVEIDSAEIVTNL*KGVLGSSEGWNMMRRIIKEIKSSNWNVQ 99
           WG YEGL LA   G  QL V++DS+ +V ++  G  GS++ W+++RR IK++ + NWNV+
Sbjct: 43  WGFYEGLRLANQMGVHQLEVQLDSSIVVDSIQHGKTGSAKAWSIIRR-IKQLLAFNWNVR 101

Query: 98  FQHCFKEANRSGHALA 51
             H + EANR    LA
Sbjct: 102 INHTYWEANRCADVLA 117


>gb|KHN08856.1| Putative ribonuclease H protein, partial [Glycine soja]
          Length = 147

 Score = 67.8 bits (164), Expect = 1e-11
 Identities = 32/77 (41%), Positives = 57/77 (74%), Gaps = 1/77 (1%)
 Frame = -3

Query: 278 WGIYEGLMLAKSQGYKQLIVEIDSAEIVTNL*KGVL-GSSEGWNMMRRIIKEIKSSNWNV 102
           WG+YEGL LA+++G+K +IV++DS +++ N+ KG   GS++GW+++R+ I+ + S++W  
Sbjct: 33  WGVYEGLKLAQNRGFKVVIVQVDS-QVIVNILKGKKDGSAQGWSLVRK-IRLLFSADWRT 90

Query: 101 QFQHCFKEANRSGHALA 51
           +  H ++EAN   H LA
Sbjct: 91  KVFHVYREANIWAHMLA 107


>gb|PNX91284.1| ribonuclease H [Trifolium pratense]
          Length = 178

 Score = 67.4 bits (163), Expect = 3e-11
 Identities = 31/76 (40%), Positives = 52/76 (68%)
 Frame = -3

Query: 278 WGIYEGLMLAKSQGYKQLIVEIDSAEIVTNL*KGVLGSSEGWNMMRRIIKEIKSSNWNVQ 99
           W ++EGL LA+    + L V++DS  +V +L +G LGS+ GW+++++ IKE+ + +WNV+
Sbjct: 69  WSLHEGLCLAREYXVRHLEVQLDSKVVVCSLQEGKLGSAAGWSLIKK-IKELLNYSWNVK 127

Query: 98  FQHCFKEANRSGHALA 51
             H ++EANR    LA
Sbjct: 128 IIHVYREANRCADILA 143


>dbj|GAU36544.1| hypothetical protein TSUD_277500 [Trifolium subterraneum]
          Length = 147

 Score = 65.1 bits (157), Expect = 1e-10
 Identities = 31/76 (40%), Positives = 49/76 (64%)
 Frame = -3

Query: 278 WGIYEGLMLAKSQGYKQLIVEIDSAEIVTNL*KGVLGSSEGWNMMRRIIKEIKSSNWNVQ 99
           WG++ GL +A+SQG  +LIV+ DS  IV +L  G  GS+ GW + ++ IK++ + NW V+
Sbjct: 34  WGLFHGLRIARSQGIGKLIVQSDSVVIVKSLQTGSEGSATGWMLFKK-IKQLLTLNWEVR 92

Query: 98  FQHCFKEANRSGHALA 51
             H ++EAN     +A
Sbjct: 93  IIHVYREANSCADIMA 108


>gb|PNX55786.1| ribonuclease H, partial [Trifolium pratense]
          Length = 166

 Score = 64.7 bits (156), Expect = 2e-10
 Identities = 32/76 (42%), Positives = 48/76 (63%)
 Frame = -3

Query: 278 WGIYEGLMLAKSQGYKQLIVEIDSAEIVTNL*KGVLGSSEGWNMMRRIIKEIKSSNWNVQ 99
           WG++EG+ LAK  G+  + ++IDS  +V NL    LGSS G  ++RR I+ +    WNV+
Sbjct: 61  WGVHEGICLAKQNGFNNIELQIDSMIVVRNLGGDRLGSSGGRCLVRR-IRSLFQEGWNVR 119

Query: 98  FQHCFKEANRSGHALA 51
            +H ++E NR   ALA
Sbjct: 120 IRHVYRETNRVADALA 135


>dbj|GAU31120.1| hypothetical protein TSUD_212270 [Trifolium subterraneum]
          Length = 347

 Score = 66.2 bits (160), Expect = 4e-10
 Identities = 33/76 (43%), Positives = 49/76 (64%)
 Frame = -3

Query: 278 WGIYEGLMLAKSQGYKQLIVEIDSAEIVTNL*KGVLGSSEGWNMMRRIIKEIKSSNWNVQ 99
           WG+Y GL LA+ +G   + ++IDS  +V NL    LGSSEG +++RR ++ +     NV+
Sbjct: 268 WGVYAGLCLARQRGINNIELQIDSLAVVRNLGDDSLGSSEGKSLVRR-VRNLFQEGLNVR 326

Query: 98  FQHCFKEANRSGHALA 51
            QH ++EANR   ALA
Sbjct: 327 IQHVYREANRVADALA 342


>dbj|GAU29126.1| hypothetical protein TSUD_58920 [Trifolium subterraneum]
          Length = 118

 Score = 61.2 bits (147), Expect = 2e-09
 Identities = 28/76 (36%), Positives = 49/76 (64%)
 Frame = -3

Query: 278 WGIYEGLMLAKSQGYKQLIVEIDSAEIVTNL*KGVLGSSEGWNMMRRIIKEIKSSNWNVQ 99
           WG+YEGL LA++ G  +L V++DS  +V  +    +G +  WN+M++ I+E+ + +W ++
Sbjct: 5   WGLYEGLCLARNLGITRLEVQVDSEALVKAIQGSNVGGNMYWNIMKK-IQELLNLDWEIR 63

Query: 98  FQHCFKEANRSGHALA 51
             H ++EANR    LA
Sbjct: 64  LTHIYREANRCADILA 79


>dbj|GAU51393.1| hypothetical protein TSUD_138060 [Trifolium subterraneum]
          Length = 167

 Score = 61.2 bits (147), Expect = 5e-09
 Identities = 28/76 (36%), Positives = 49/76 (64%)
 Frame = -3

Query: 278 WGIYEGLMLAKSQGYKQLIVEIDSAEIVTNL*KGVLGSSEGWNMMRRIIKEIKSSNWNVQ 99
           WG+YEGL LA++ G  +L V++DS  +V  +    +G +  WN+M++ I+E+ + +W ++
Sbjct: 54  WGLYEGLCLARNLGITRLEVQVDSEALVKAIQGSNVGGNMYWNIMKK-IQELLNLDWEIR 112

Query: 98  FQHCFKEANRSGHALA 51
             H ++EANR    LA
Sbjct: 113 LTHIYREANRCADILA 128


>dbj|GAU51253.1| hypothetical protein TSUD_412460, partial [Trifolium subterraneum]
          Length = 609

 Score = 63.2 bits (152), Expect = 6e-09
 Identities = 31/76 (40%), Positives = 46/76 (60%)
 Frame = -3

Query: 278 WGIYEGLMLAKSQGYKQLIVEIDSAEIVTNL*KGVLGSSEGWNMMRRIIKEIKSSNWNVQ 99
           WG+YEGL +A++ G ++L V++DS  +V    K   G +  WN+MRR I+ +   NW V+
Sbjct: 506 WGLYEGLSMARNLGIERLEVQVDSEVLVMATKKDGTGCTMSWNIMRR-IRALLDLNWEVR 564

Query: 98  FQHCFKEANRSGHALA 51
            +H F E NR    LA
Sbjct: 565 IKHIFCEGNRCADVLA 580


>gb|PNY12327.1| ribonuclease H [Trifolium pratense]
          Length = 370

 Score = 62.8 bits (151), Expect = 7e-09
 Identities = 31/76 (40%), Positives = 49/76 (64%)
 Frame = -3

Query: 278 WGIYEGLMLAKSQGYKQLIVEIDSAEIVTNL*KGVLGSSEGWNMMRRIIKEIKSSNWNVQ 99
           WG+YEGL LA+ +G   + ++IDS  +V NL    +GS+ G +++RRI  +     WNV+
Sbjct: 259 WGVYEGLCLARQRGLMNVELQIDSLAVVKNLEGKSIGSNGGRSLIRRI--QCLLQGWNVR 316

Query: 98  FQHCFKEANRSGHALA 51
            +H ++EAN+   ALA
Sbjct: 317 VRHVYREANKVADALA 332


>dbj|GAU28906.1| hypothetical protein TSUD_59160 [Trifolium subterraneum]
          Length = 114

 Score = 59.3 bits (142), Expect = 9e-09
 Identities = 27/76 (35%), Positives = 49/76 (64%)
 Frame = -3

Query: 278 WGIYEGLMLAKSQGYKQLIVEIDSAEIVTNL*KGVLGSSEGWNMMRRIIKEIKSSNWNVQ 99
           WG+YEGL LA++ G  +L V++DS  +V  +    +G +  WN+M++ I+E+ + +W ++
Sbjct: 5   WGLYEGLCLARNLGITRLEVQVDSEALVKAIQGSNVGGNMYWNIMKK-IQELLNLDWEIR 63

Query: 98  FQHCFKEANRSGHALA 51
             H +++ANR    LA
Sbjct: 64  LTHIYRDANRCTDILA 79


>gb|KHN22754.1| Putative ribonuclease H protein [Glycine soja]
          Length = 121

 Score = 58.9 bits (141), Expect = 2e-08
 Identities = 31/103 (30%), Positives = 58/103 (56%), Gaps = 10/103 (9%)
 Frame = -3

Query: 329 LLWG----WSFPWQRFV------VSVGWGIYEGLMLAKSQGYKQLIVEIDSAEIVTNL*K 180
           L+W     W + +Q+++      V+  WG+++GL LA  +G+ ++++++DS  ++  +  
Sbjct: 8   LIWDHDGRWIYGFQKYIGRSSTFVAELWGVFQGLKLAILKGFTRILLQVDSKAVILAIRS 67

Query: 179 GVLGSSEGWNMMRRIIKEIKSSNWNVQFQHCFKEANRSGHALA 51
           G  GS+ GW +++ I K I+  N   Q  H +KE NR    LA
Sbjct: 68  GNEGSASGWRLIQAIQKFIRMVN-QFQINHMYKETNRCVDKLA 109


>gb|PNX75059.1| ribonuclease H [Trifolium pratense]
          Length = 362

 Score = 61.6 bits (148), Expect = 2e-08
 Identities = 29/76 (38%), Positives = 47/76 (61%)
 Frame = -3

Query: 278 WGIYEGLMLAKSQGYKQLIVEIDSAEIVTNL*KGVLGSSEGWNMMRRIIKEIKSSNWNVQ 99
           WG+Y+GL LA+S G  Q+ V +DS+ +V  L     GS  GW +++  I+ + + +W ++
Sbjct: 249 WGVYDGLCLARSLGATQIKVHVDSSVVVQTLNSTNGGSVVGWRLVQE-IRRLLALDWEIK 307

Query: 98  FQHCFKEANRSGHALA 51
             HC++EAN    ALA
Sbjct: 308 VCHCYREANACADALA 323


>dbj|GAU30604.1| hypothetical protein TSUD_392950 [Trifolium subterraneum]
          Length = 233

 Score = 59.7 bits (143), Expect = 5e-08
 Identities = 28/76 (36%), Positives = 51/76 (67%)
 Frame = -3

Query: 278 WGIYEGLMLAKSQGYKQLIVEIDSAEIVTNL*KGVLGSSEGWNMMRRIIKEIKSSNWNVQ 99
           WG+ EG+ +A+S G+ +L V++DS  IV+ + K   G+  GW+++++ I+ + S +W+V+
Sbjct: 120 WGLLEGISIARSMGFNKLEVQMDSEIIVSIINKHGHGNVSGWSIIKK-IRSLLSLDWSVK 178

Query: 98  FQHCFKEANRSGHALA 51
             H ++EANR    LA
Sbjct: 179 ICHFYREANRCADMLA 194


>gb|PNY06521.1| nucleic acid binding protein [Trifolium pratense]
          Length = 135

 Score = 56.6 bits (135), Expect = 2e-07
 Identities = 24/74 (32%), Positives = 49/74 (66%)
 Frame = -3

Query: 290 VSVGWGIYEGLMLAKSQGYKQLIVEIDSAEIVTNL*KGVLGSSEGWNMMRRIIKEIKSSN 111
           VS  WG++EGL LA+++G++++ + +DS  ++ ++     G++ G+ +++R IK++   N
Sbjct: 63  VSELWGVFEGLKLARAKGFEKVEICVDSQAVINSIKNRDGGNAMGYRLIQR-IKQLLELN 121

Query: 110 WNVQFQHCFKEANR 69
           W V   H ++E NR
Sbjct: 122 WEVNISHSYRETNR 135


>gb|PNY06182.1| ribonuclease H [Trifolium pratense]
          Length = 686

 Score = 58.9 bits (141), Expect = 2e-07
 Identities = 25/76 (32%), Positives = 49/76 (64%)
 Frame = -3

Query: 278 WGIYEGLMLAKSQGYKQLIVEIDSAEIVTNL*KGVLGSSEGWNMMRRIIKEIKSSNWNVQ 99
           WG+YEGL LA+ + +  + +++DS  +V  +    +GS+ G  ++ R I+++ + +WNV+
Sbjct: 574 WGVYEGLCLARRKSFNNIELQVDSLVVVRGIKGEEVGSASGRILLNR-IRQLMNMDWNVR 632

Query: 98  FQHCFKEANRSGHALA 51
             H ++EAN+   A+A
Sbjct: 633 ISHVYREANKVADAIA 648


>gb|PNX95782.1| ribonuclease H [Trifolium pratense]
          Length = 360

 Score = 58.5 bits (140), Expect = 2e-07
 Identities = 28/76 (36%), Positives = 46/76 (60%)
 Frame = -3

Query: 278 WGIYEGLMLAKSQGYKQLIVEIDSAEIVTNL*KGVLGSSEGWNMMRRIIKEIKSSNWNVQ 99
           WG+YEGL +A++ G ++L V++DS  +V    K   G +  WN+M++I   +   +  V+
Sbjct: 246 WGLYEGLSMARNLGLERLEVQVDSEVLVKVTKKDGTGCTMSWNIMKKIRDLLLELDCEVR 305

Query: 98  FQHCFKEANRSGHALA 51
            +H F+E NR   ALA
Sbjct: 306 IKHIFREGNRCADALA 321


>dbj|GAU14934.1| hypothetical protein TSUD_47270 [Trifolium subterraneum]
          Length = 650

 Score = 58.5 bits (140), Expect = 2e-07
 Identities = 26/81 (32%), Positives = 50/81 (61%)
 Frame = -3

Query: 293 VVSVGWGIYEGLMLAKSQGYKQLIVEIDSAEIVTNL*KGVLGSSEGWNMMRRIIKEIKSS 114
           +++  WG+ EGL LAK + ++++ V +DS+ +V  +  G   SS G+++++  I+ +   
Sbjct: 532 IIAELWGVLEGLKLAKGRRFRKIEVNVDSSSVVKMIMNGESSSSMGFSLIKS-IRRLLDG 590

Query: 113 NWNVQFQHCFKEANRSGHALA 51
            W V+  H ++EAN+   ALA
Sbjct: 591 EWEVKISHTYREANKCADALA 611


>gb|PNX79929.1| ribonuclease H, partial [Trifolium pratense]
          Length = 709

 Score = 58.5 bits (140), Expect = 3e-07
 Identities = 29/76 (38%), Positives = 47/76 (61%)
 Frame = -3

Query: 278 WGIYEGLMLAKSQGYKQLIVEIDSAEIVTNL*KGVLGSSEGWNMMRRIIKEIKSSNWNVQ 99
           WG YEGL LA+ +G   + ++IDS  +V  +    +GS+ G ++ RR I+ +    WNV+
Sbjct: 597 WGAYEGLCLARRRGLINVELQIDSLAVVKTIGGESIGSNGGRSLTRR-IRRLIQEEWNVR 655

Query: 98  FQHCFKEANRSGHALA 51
            +H ++EAN+   ALA
Sbjct: 656 IRHVYREANKVADALA 671


>dbj|GAU44140.1| hypothetical protein TSUD_188010 [Trifolium subterraneum]
          Length = 200

 Score = 57.0 bits (136), Expect = 3e-07
 Identities = 26/76 (34%), Positives = 46/76 (60%)
 Frame = -3

Query: 278 WGIYEGLMLAKSQGYKQLIVEIDSAEIVTNL*KGVLGSSEGWNMMRRIIKEIKSSNWNVQ 99
           WG++EGL LA+  G++++ V IDS  +V  +  G L +  GW+++   I+++   +W V 
Sbjct: 109 WGVFEGLTLARRMGFRKVEVHIDSVVVVQVITTGKLHNKIGWSLVLN-IRKLLELDWEVI 167

Query: 98  FQHCFKEANRSGHALA 51
             H ++E N+   ALA
Sbjct: 168 IAHAYRETNKCADALA 183


Top