BLASTX nr result

ID: Astragalus22_contig00034695 seq

BLASTX 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Astragalus22_contig00034695
         (436 letters)

Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF excluding environmental samples
from WGS projects 
           149,584,005 sequences; 54,822,741,787 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

gb|PNY00696.1| ribonuclease H [Trifolium pratense]                     92   7e-20
dbj|GAU30604.1| hypothetical protein TSUD_392950 [Trifolium subt...    90   4e-19
gb|PNY06182.1| ribonuclease H [Trifolium pratense]                     90   4e-18
dbj|GAU50453.1| hypothetical protein TSUD_373190 [Trifolium subt...    85   5e-18
gb|PNX55023.1| ribonuclease H [Trifolium pratense]                     85   8e-18
gb|PNY12327.1| ribonuclease H [Trifolium pratense]                     88   1e-17
dbj|GAU43826.1| hypothetical protein TSUD_399190 [Trifolium subt...    89   1e-17
dbj|GAU48590.1| hypothetical protein TSUD_405800 [Trifolium subt...    88   2e-17
gb|PNX91284.1| ribonuclease H [Trifolium pratense]                     82   8e-17
gb|PNX62306.1| ribonuclease H, partial [Trifolium pratense]            80   8e-17
gb|PNX79929.1| ribonuclease H, partial [Trifolium pratense]            85   2e-16
dbj|GAU14017.1| hypothetical protein TSUD_168420 [Trifolium subt...    81   4e-16
dbj|GAU38338.1| hypothetical protein TSUD_61990 [Trifolium subte...    84   6e-16
dbj|GAU36544.1| hypothetical protein TSUD_277500 [Trifolium subt...    79   6e-16
gb|KYP32512.1| hypothetical protein KK1_046788 [Cajanus cajan]         78   6e-16
dbj|GAU43502.1| hypothetical protein TSUD_398950 [Trifolium subt...    83   1e-15
dbj|GAU42972.1| hypothetical protein TSUD_188450 [Trifolium subt...    77   3e-15
dbj|GAU36430.1| hypothetical protein TSUD_19650 [Trifolium subte...    78   4e-15
dbj|GAU35983.1| hypothetical protein TSUD_207870 [Trifolium subt...    77   4e-15
dbj|GAU17428.1| hypothetical protein TSUD_233050 [Trifolium subt...    78   5e-15

>gb|PNY00696.1| ribonuclease H [Trifolium pratense]
          Length = 276

 Score = 92.4 bits (228), Expect = 7e-20
 Identities = 47/110 (42%), Positives = 73/110 (66%)
 Frame = -2

Query: 342 LSYLNQSRLGYMNIELQMDSVVIVNLLKNGTFGSISGRNLLRRIKYYLSFFSNVRIVHIY 163
           LSY    RLG+M +EL +DSV +VN+L  GT  S++   L+R I+  ++    V IVH Y
Sbjct: 169 LSYAR--RLGFMKVELNIDSVTVVNVLTKGTLQSLARAMLVRNIRSLIALDWEVSIVHAY 226

Query: 162 REANRVVDGLANVGCSRNEGLVVYSHPPSDIQLLVDSDVLRVSIPRLISV 13
           RE+N+  D L N+GC+ ++ ++VY   PS+I+ L+ +DVL ++ PRL+ V
Sbjct: 227 RESNQCADALVNIGCTLDKEIIVYDDCPSEIKDLLLADVLGITTPRLLHV 276


>dbj|GAU30604.1| hypothetical protein TSUD_392950 [Trifolium subterraneum]
          Length = 233

 Score = 89.7 bits (221), Expect = 4e-19
 Identities = 48/138 (34%), Positives = 85/138 (61%), Gaps = 1/138 (0%)
 Frame = -2

Query: 423 DSHWILFN*NNTDLMAETDLIIKTKDFLSYLNQSR-LGYMNIELQMDSVVIVNLLKNGTF 247
           D HWI     +  L + T  + +    L  ++ +R +G+  +E+QMDS +IV+++     
Sbjct: 98  DGHWI--RGFSKSLGSATAYVAELWGLLEGISIARSMGFNKLEVQMDSEIIVSIINKHGH 155

Query: 246 GSISGRNLLRRIKYYLSFFSNVRIVHIYREANRVVDGLANVGCSRNEGLVVYSHPPSDIQ 67
           G++SG +++++I+  LS   +V+I H YREANR  D LAN+GC  N G ++Y+ P ++++
Sbjct: 156 GNVSGWSIIKKIRSLLSLDWSVKICHFYREANRCADMLANMGCVHNHGTLIYNQPLTNLR 215

Query: 66  LLVDSDVLRVSIPRLISV 13
            L+D D   VS  RL+++
Sbjct: 216 QLLDDDNRGVSFLRLVAL 233


>gb|PNY06182.1| ribonuclease H [Trifolium pratense]
          Length = 686

 Score = 90.1 bits (222), Expect = 4e-18
 Identities = 46/103 (44%), Positives = 70/103 (67%)
 Frame = -2

Query: 321 RLGYMNIELQMDSVVIVNLLKNGTFGSISGRNLLRRIKYYLSFFSNVRIVHIYREANRVV 142
           R  + NIELQ+DS+V+V  +K    GS SGR LL RI+  ++   NVRI H+YREAN+V 
Sbjct: 585 RKSFNNIELQVDSLVVVRGIKGEEVGSASGRILLNRIRQLMNMDWNVRISHVYREANKVA 644

Query: 141 DGLANVGCSRNEGLVVYSHPPSDIQLLVDSDVLRVSIPRLISV 13
           D +A +GC+  +G   ++ PP++++ L   DV+ VS PR+I++
Sbjct: 645 DAIAALGCT-TQGFSYFNTPPANLERLCLDDVMGVSTPRIITL 686


>dbj|GAU50453.1| hypothetical protein TSUD_373190 [Trifolium subterraneum]
          Length = 167

 Score = 85.1 bits (209), Expect = 5e-18
 Identities = 47/100 (47%), Positives = 60/100 (60%)
 Frame = -2

Query: 321 RLGYMNIELQMDSVVIVNLLKNGTFGSISGRNLLRRIKYYLSFFSNVRIVHIYREANRVV 142
           RLG+  IEL +DS V+V +LKNGT  S  G +LL+ IK  L+    V I H YREAN+  
Sbjct: 68  RLGFKKIELNIDSEVVVRVLKNGTSNSAMGSSLLKHIKNLLALDWMVEISHSYREANKRA 127

Query: 141 DGLANVGCSRNEGLVVYSHPPSDIQLLVDSDVLRVSIPRL 22
           D  AN+GCS +   V Y   P  I+ L D+D+   S PRL
Sbjct: 128 DARANIGCSNSYDTVFYDWCPELIRNLYDADIQGSSTPRL 167


>gb|PNX55023.1| ribonuclease H [Trifolium pratense]
          Length = 171

 Score = 84.7 bits (208), Expect = 8e-18
 Identities = 43/101 (42%), Positives = 65/101 (64%)
 Frame = -2

Query: 321 RLGYMNIELQMDSVVIVNLLKNGTFGSISGRNLLRRIKYYLSFFSNVRIVHIYREANRVV 142
           R GY NIELQ+DS V+V+ L+    G+  GR L+ RI+  +    NVR+ H+YR+AN+V 
Sbjct: 70  RQGYTNIELQVDSSVLVSGLEGVEVGNAHGRILISRIRRLIQMNRNVRVSHVYRKANKVA 129

Query: 141 DGLANVGCSRNEGLVVYSHPPSDIQLLVDSDVLRVSIPRLI 19
           D +A++GC   +G   +  PP+ ++ L   DV+ VS PR+I
Sbjct: 130 DAIASLGCEM-QGFSYFDAPPTSLEQLCIDDVMGVSTPRVI 169


>gb|PNY12327.1| ribonuclease H [Trifolium pratense]
          Length = 370

 Score = 87.8 bits (216), Expect = 1e-17
 Identities = 45/101 (44%), Positives = 70/101 (69%)
 Frame = -2

Query: 315 GYMNIELQMDSVVIVNLLKNGTFGSISGRNLLRRIKYYLSFFSNVRIVHIYREANRVVDG 136
           G MN+ELQ+DS+ +V  L+  + GS  GR+L+RRI+  L  + NVR+ H+YREAN+V D 
Sbjct: 272 GLMNVELQIDSLAVVKNLEGKSIGSNGGRSLIRRIQCLLQGW-NVRVRHVYREANKVADA 330

Query: 135 LANVGCSRNEGLVVYSHPPSDIQLLVDSDVLRVSIPRLISV 13
           LA++GC ++ G +++  PP  I  L   D LRV+ PR++++
Sbjct: 331 LASIGC-QSVGCIMFDIPPVGIDQLCLDDRLRVTTPRIVAL 370


>dbj|GAU43826.1| hypothetical protein TSUD_399190 [Trifolium subterraneum]
          Length = 1071

 Score = 88.6 bits (218), Expect = 1e-17
 Identities = 39/101 (38%), Positives = 67/101 (66%)
 Frame = -2

Query: 321  RLGYMNIELQMDSVVIVNLLKNGTFGSISGRNLLRRIKYYLSFFSNVRIVHIYREANRVV 142
            RLGY  I+L +DS+ +  +L +G+  ++ G+NL++ I+  L     V + H YR+AN   
Sbjct: 761  RLGYQAIDLNVDSLAVKQVLTSGSSNNLLGQNLVKNIRRLLELNWKVTVEHSYRKANTCA 820

Query: 141  DGLANVGCSRNEGLVVYSHPPSDIQLLVDSDVLRVSIPRLI 19
            D LAN+GCS +  ++ Y +PP+ ++ L+++D LR++ PRLI
Sbjct: 821  DALANIGCSLDYNIIFYDNPPTQLRHLLEADALRITTPRLI 861


>dbj|GAU48590.1| hypothetical protein TSUD_405800 [Trifolium subterraneum]
          Length = 818

 Score = 88.2 bits (217), Expect = 2e-17
 Identities = 48/101 (47%), Positives = 67/101 (66%)
 Frame = -2

Query: 315  GYMNIELQMDSVVIVNLLKNGTFGSISGRNLLRRIKYYLSFFSNVRIVHIYREANRVVDG 136
            GY+N+EL+MDS+V+V  L     GS+ G  L+RRIK  L    NV I+H+YREAN+VVD 
Sbjct: 719  GYVNVELRMDSLVVVRCLNGEEVGSVDGMKLIRRIKDLLLEDWNVCIIHVYREANKVVDA 778

Query: 135  LANVGCSRNEGLVVYSHPPSDIQLLVDSDVLRVSIPRLISV 13
            LA +GC  + G+  +  P  +I+ L  +DV+ VS PR+I V
Sbjct: 779  LAALGCD-SIGISYFEDPLVEIEHLCLADVMGVSTPRVIFV 818


>gb|PNX91284.1| ribonuclease H [Trifolium pratense]
          Length = 178

 Score = 82.4 bits (202), Expect = 8e-17
 Identities = 44/94 (46%), Positives = 63/94 (67%), Gaps = 1/94 (1%)
 Frame = -2

Query: 306 NIELQMDSVVIVNLLKNGTFGSISGRNLLRRIKYYLSFFSNVRIVHIYREANRVVDGLAN 127
           ++E+Q+DS V+V  L+ G  GS +G +L+++IK  L++  NV+I+H+YREANR  D LAN
Sbjct: 85  HLEVQLDSKVVVCSLQEGKLGSAAGWSLIKKIKELLNYSWNVKIIHVYREANRCADILAN 144

Query: 126 VGCSRNEGLVVYSHP-PSDIQLLVDSDVLRVSIP 28
           +GC      +VY HP P  IQ+L D D   VS P
Sbjct: 145 IGCMSLVDSIVYEHPSPELIQVLAD-DCRGVSFP 177


>gb|PNX62306.1| ribonuclease H, partial [Trifolium pratense]
          Length = 110

 Score = 80.5 bits (197), Expect = 8e-17
 Identities = 44/99 (44%), Positives = 66/99 (66%)
 Frame = -2

Query: 315 GYMNIELQMDSVVIVNLLKNGTFGSISGRNLLRRIKYYLSFFSNVRIVHIYREANRVVDG 136
           G MN+ELQ+DS+ +V  L+  + GS  GR+L+RRI+  L  + NVR+ HIYREAN+V D 
Sbjct: 9   GLMNVELQIDSLAVVKNLEGKSIGSNGGRSLIRRIQCLLQGW-NVRVRHIYREANKVADA 67

Query: 135 LANVGCSRNEGLVVYSHPPSDIQLLVDSDVLRVSIPRLI 19
           LA++GC ++ G +++  P   I  L  +D L V+ P L+
Sbjct: 68  LASIGC-QSVGCIMFDIPSIGIDQLCLADRLGVTTPELL 105


>gb|PNX79929.1| ribonuclease H, partial [Trifolium pratense]
          Length = 709

 Score = 85.1 bits (209), Expect = 2e-16
 Identities = 42/103 (40%), Positives = 68/103 (66%)
 Frame = -2

Query: 321 RLGYMNIELQMDSVVIVNLLKNGTFGSISGRNLLRRIKYYLSFFSNVRIVHIYREANRVV 142
           R G +N+ELQ+DS+ +V  +   + GS  GR+L RRI+  +    NVRI H+YREAN+V 
Sbjct: 608 RRGLINVELQIDSLAVVKTIGGESIGSNGGRSLTRRIRRLIQEEWNVRIRHVYREANKVA 667

Query: 141 DGLANVGCSRNEGLVVYSHPPSDIQLLVDSDVLRVSIPRLISV 13
           D LA++GC ++ G +++  PP+ +  L  +D L V+ PR +++
Sbjct: 668 DALASIGC-QSVGCILFDDPPAGVDQLCFADRLGVTTPRSVAL 709


>dbj|GAU14017.1| hypothetical protein TSUD_168420 [Trifolium subterraneum]
          Length = 211

 Score = 81.3 bits (199), Expect = 4e-16
 Identities = 41/110 (37%), Positives = 71/110 (64%)
 Frame = -2

Query: 342 LSYLNQSRLGYMNIELQMDSVVIVNLLKNGTFGSISGRNLLRRIKYYLSFFSNVRIVHIY 163
           LSY+   RLG+M +EL +D V++V+++  G   S  G  L+R I+  +     V IVH Y
Sbjct: 104 LSYVR--RLGFMAVELNIDLVMVVHVITKGILQSPVGAMLVRHIQRLIDLDCEVNIVHAY 161

Query: 162 REANRVVDGLANVGCSRNEGLVVYSHPPSDIQLLVDSDVLRVSIPRLISV 13
           RE+N+  D LA++GC+ ++ ++ Y+  P +I+ L+ +DV+ ++ PR+I V
Sbjct: 162 RESNQCADALASIGCTLDKEIIYYNDCPLEIKELLLADVMGITTPRMIPV 211


>dbj|GAU38338.1| hypothetical protein TSUD_61990 [Trifolium subterraneum]
          Length = 813

 Score = 84.0 bits (206), Expect = 6e-16
 Identities = 45/110 (40%), Positives = 70/110 (63%)
 Frame = -2

Query: 342  LSYLNQSRLGYMNIELQMDSVVIVNLLKNGTFGSISGRNLLRRIKYYLSFFSNVRIVHIY 163
            LSY++  RLG+  + L +DS V+V ++KNG+  S +G +LL +I   L     V + H Y
Sbjct: 706  LSYVH--RLGFRKVVLHIDSEVVVRVIKNGSSDSSAGSSLLTQIWRLLEMDWIVEVSHTY 763

Query: 162  REANRVVDGLANVGCSRNEGLVVYSHPPSDIQLLVDSDVLRVSIPRLISV 13
            REAN   D LAN+GCS +   V+++  P  I+ + D+D++ +S PRLIS+
Sbjct: 764  REANNCADALANLGCSLDYDTVIFNDFPPQIRNIFDTDLMGISSPRLISL 813


>dbj|GAU36544.1| hypothetical protein TSUD_277500 [Trifolium subterraneum]
          Length = 147

 Score = 79.3 bits (194), Expect = 6e-16
 Identities = 43/100 (43%), Positives = 60/100 (60%)
 Frame = -2

Query: 315 GYMNIELQMDSVVIVNLLKNGTFGSISGRNLLRRIKYYLSFFSNVRIVHIYREANRVVDG 136
           G   + +Q DSVVIV  L+ G+ GS +G  L ++IK  L+    VRI+H+YREAN   D 
Sbjct: 47  GIGKLIVQSDSVVIVKSLQTGSEGSATGWMLFKKIKQLLTLNWEVRIIHVYREANSCADI 106

Query: 135 LANVGCSRNEGLVVYSHPPSDIQLLVDSDVLRVSIPRLIS 16
           +A  GCS      +YS+PP+ +   V +D   VS PRL+S
Sbjct: 107 MAGQGCSLQGDEEIYSNPPTVVLQCVSNDARGVSFPRLVS 146


>gb|KYP32512.1| hypothetical protein KK1_046788 [Cajanus cajan]
          Length = 110

 Score = 78.2 bits (191), Expect = 6e-16
 Identities = 40/106 (37%), Positives = 58/106 (54%)
 Frame = -2

Query: 318 LGYMNIELQMDSVVIVNLLKNGTFGSISGRNLLRRIKYYLSFFSNVRIVHIYREANRVVD 139
           +G   +EL +D + +V  +K  T GS+ G  L + + Y  S      + H+YREA +  D
Sbjct: 3   IGMRALELHLDLLTVVKSIKGETAGSVHGGRLFQAVPYLQSLDRQASVKHVYREARKCAD 62

Query: 138 GLANVGCSRNEGLVVYSHPPSDIQLLVDSDVLRVSIPRLISV*SFL 1
            LAN+ C +   LV+   PP  I  L+  DVL V  PRL+S+ SFL
Sbjct: 63  SLANMACYKGNNLVISESPPLHISHLLLIDVLGVCTPRLVSLLSFL 108


>dbj|GAU43502.1| hypothetical protein TSUD_398950 [Trifolium subterraneum]
          Length = 1962

 Score = 82.8 bits (203), Expect = 1e-15
 Identities = 39/101 (38%), Positives = 65/101 (64%)
 Frame = -2

Query: 321  RLGYMNIELQMDSVVIVNLLKNGTFGSISGRNLLRRIKYYLSFFSNVRIVHIYREANRVV 142
            R+G+ N+EL +DS ++V+ L +GT  S+ G  ++R+++  L    NVR+ H YREAN+  
Sbjct: 1860 RMGFANVELSIDSKIVVHALTSGTATSVDGYAIVRKVRRLLLLDWNVRVTHEYREANKCA 1919

Query: 141  DGLANVGCSRNEGLVVYSHPPSDIQLLVDSDVLRVSIPRLI 19
            D LAN+GC+ +     +   P++I+ ++ +D L  S PRLI
Sbjct: 1920 DALANIGCTLDMECTYFQECPAEIRHILLADELGTSSPRLI 1960


>dbj|GAU42972.1| hypothetical protein TSUD_188450 [Trifolium subterraneum]
          Length = 145

 Score = 77.4 bits (189), Expect = 3e-15
 Identities = 40/99 (40%), Positives = 59/99 (59%)
 Frame = -2

Query: 315 GYMNIELQMDSVVIVNLLKNGTFGSISGRNLLRRIKYYLSFFSNVRIVHIYREANRVVDG 136
           G+  IEL +DS V+V  L +   GS+ G  +++ I+  L+   +V+I H YREAN   D 
Sbjct: 45  GHKKIELHIDSNVVVQTLHSARDGSVVGWRIIQEIRRLLALDWDVKICHSYREANACADA 104

Query: 135 LANVGCSRNEGLVVYSHPPSDIQLLVDSDVLRVSIPRLI 19
           LAN+GC    GL VY   P  I  L+ +DV+ ++ PR+I
Sbjct: 105 LANLGCDHGPGLRVYEQCPPKISSLLLADVMGITTPRVI 143


>dbj|GAU36430.1| hypothetical protein TSUD_19650 [Trifolium subterraneum]
          Length = 174

 Score = 77.8 bits (190), Expect = 4e-15
 Identities = 37/103 (35%), Positives = 67/103 (65%)
 Frame = -2

Query: 336 YLNQSRLGYMNIELQMDSVVIVNLLKNGTFGSISGRNLLRRIKYYLSFFSNVRIVHIYRE 157
           +L    LG+  +E  +DS+V+V+ +KNG   S  G++L+++I+  L    +++IVH+YRE
Sbjct: 71  FLYARSLGFTAVESNIDSIVVVSAIKNGRKSSSIGKSLVKQIRRSLELDWDIKIVHVYRE 130

Query: 156 ANRVVDGLANVGCSRNEGLVVYSHPPSDIQLLVDSDVLRVSIP 28
           +N+ VD LAN+GC+ +  ++ Y   P++I+ LV  D   ++ P
Sbjct: 131 SNKCVDALANIGCTLDCEVIHYDSFPNEIRNLVLDDERGITTP 173


>dbj|GAU35983.1| hypothetical protein TSUD_207870 [Trifolium subterraneum]
          Length = 159

 Score = 77.4 bits (189), Expect = 4e-15
 Identities = 40/101 (39%), Positives = 61/101 (60%)
 Frame = -2

Query: 315 GYMNIELQMDSVVIVNLLKNGTFGSISGRNLLRRIKYYLSFFSNVRIVHIYREANRVVDG 136
           G+  I L +DS V+V  L++   GS+ G  L++ I+  L     VRI H YRE+N  VD 
Sbjct: 59  GFKKIVLHVDSNVVVQTLQSDRDGSVVGWRLIQEIQRLLVMDWEVRICHSYRESNACVDA 118

Query: 135 LANVGCSRNEGLVVYSHPPSDIQLLVDSDVLRVSIPRLISV 13
           LAN+GC    G+ VY   P+ +  L+ +DV+ ++ PR+IS+
Sbjct: 119 LANLGCDHEPGMRVYEQCPTSLSSLLLADVMGITTPRVISL 159


>dbj|GAU17428.1| hypothetical protein TSUD_233050 [Trifolium subterraneum]
          Length = 196

 Score = 78.2 bits (191), Expect = 5e-15
 Identities = 38/103 (36%), Positives = 63/103 (61%)
 Frame = -2

Query: 321 RLGYMNIELQMDSVVIVNLLKNGTFGSISGRNLLRRIKYYLSFFSNVRIVHIYREANRVV 142
           RLG+M +E+++DS  +V  LK+    S  G  L +++   L    N+ I+HIYREAN+  
Sbjct: 52  RLGFMYVEMEIDSAAVVKALKDRCVKSPMGAALAKQVWQLLDMEWNIEILHIYREANKCA 111

Query: 141 DGLANVGCSRNEGLVVYSHPPSDIQLLVDSDVLRVSIPRLISV 13
           D +AN+GCS    +++Y+  P  ++ ++  D + +S PR ISV
Sbjct: 112 DAMANLGCSLGYDVILYADCPLPLREMLAFDNMGISTPRFISV 154


Top