BLASTX nr result

ID: Astragalus22_contig00017917 seq

BLASTX 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Astragalus22_contig00017917
         (563 letters)

Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF excluding environmental samples
from WGS projects 
           149,584,005 sequences; 54,822,741,787 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

dbj|GAU10400.1| hypothetical protein TSUD_423410, partial [Trifo...    89   4e-18
dbj|GAU33259.1| hypothetical protein TSUD_333820 [Trifolium subt...    87   2e-17
dbj|GAU10666.1| hypothetical protein TSUD_425710, partial [Trifo...    83   8e-17
dbj|GAU48971.1| hypothetical protein TSUD_188180 [Trifolium subt...    80   3e-16
dbj|GAU16646.1| hypothetical protein TSUD_325960 [Trifolium subt...    82   3e-16
dbj|GAU23820.1| hypothetical protein TSUD_27290 [Trifolium subte...    82   4e-16
dbj|GAU34195.1| hypothetical protein TSUD_162960 [Trifolium subt...    82   4e-16
dbj|GAU47648.1| hypothetical protein TSUD_27720 [Trifolium subte...    83   3e-15
dbj|GAU27944.1| hypothetical protein TSUD_146590 [Trifolium subt...    80   5e-15
dbj|GAU17063.1| hypothetical protein TSUD_105620 [Trifolium subt...    82   5e-15
gb|PNY12436.1| ribonuclease H [Trifolium pratense]                     78   6e-15
gb|PNX92765.1| ribonuclease H [Trifolium pratense]                     82   7e-15
dbj|GAU14768.1| hypothetical protein TSUD_204170 [Trifolium subt...    82   1e-14
gb|PNX57966.1| ribonuclease H [Trifolium pratense]                     78   2e-14
gb|PNX59771.1| ribonuclease H [Trifolium pratense] >gi|133521531...    75   3e-14
dbj|GAU50297.1| hypothetical protein TSUD_288310 [Trifolium subt...    80   5e-14
dbj|GAU31243.1| hypothetical protein TSUD_149290 [Trifolium subt...    76   6e-14
dbj|GAU51939.1| hypothetical protein TSUD_417210 [Trifolium subt...    75   2e-13
gb|KHN12872.1| Putative ribonuclease H protein, partial [Glycine...    74   2e-13
gb|PNX95949.1| ribonuclease H [Trifolium pratense]                     74   2e-13

>dbj|GAU10400.1| hypothetical protein TSUD_423410, partial [Trifolium subterraneum]
          Length = 284

 Score = 89.4 bits (220), Expect = 4e-18
 Identities = 40/76 (52%), Positives = 54/76 (71%)
 Frame = +2

Query: 2   LICYSDSKNALDLISSATNSWNRYAPLIQSIQDILSWNWRAQLDHMLREGNVCADFLAKV 181
           L CYSDSK  LDL+S   NS++ YA +I +IQD+L   W   L H LREGN CADFLAK+
Sbjct: 191 LFCYSDSKTVLDLLSKERNSFHCYAAIIANIQDLLVLEWDVSLKHSLREGNFCADFLAKL 250

Query: 182 GASGNCKLLVFDNPPP 229
           G++ + K  ++++PPP
Sbjct: 251 GSANDEKFFIWESPPP 266


>dbj|GAU33259.1| hypothetical protein TSUD_333820 [Trifolium subterraneum]
          Length = 284

 Score = 87.4 bits (215), Expect = 2e-17
 Identities = 39/76 (51%), Positives = 54/76 (71%)
 Frame = +2

Query: 2   LICYSDSKNALDLISSATNSWNRYAPLIQSIQDILSWNWRAQLDHMLREGNVCADFLAKV 181
           L CYSDSK  LDL+S   NS++ YA +I +IQD+L   W   L H +REGN CADFLAK+
Sbjct: 191 LFCYSDSKTVLDLLSKERNSFHCYAAIIANIQDLLVLEWDVSLKHSVREGNFCADFLAKL 250

Query: 182 GASGNCKLLVFDNPPP 229
           G++ + K  ++++PPP
Sbjct: 251 GSANDEKFSIWESPPP 266


>dbj|GAU10666.1| hypothetical protein TSUD_425710, partial [Trifolium subterraneum]
          Length = 157

 Score = 83.2 bits (204), Expect = 8e-17
 Identities = 40/62 (64%), Positives = 45/62 (72%)
 Frame = +2

Query: 2   LICYSDSKNALDLISSATNSWNRYAPLIQSIQDILSWNWRAQLDHMLREGNVCADFLAKV 181
           L CYSDSK AL LI    N W+ YA +I +I+DILS NWR +L H LREGN CADFLAK 
Sbjct: 76  LYCYSDSKTALKLIYDHVNEWHHYAAIIYNIKDILSRNWRVRLVHTLREGNNCADFLAKF 135

Query: 182 GA 187
           GA
Sbjct: 136 GA 137


>dbj|GAU48971.1| hypothetical protein TSUD_188180 [Trifolium subterraneum]
          Length = 111

 Score = 80.5 bits (197), Expect = 3e-16
 Identities = 35/78 (44%), Positives = 55/78 (70%)
 Frame = +2

Query: 2   LICYSDSKNALDLISSATNSWNRYAPLIQSIQDILSWNWRAQLDHMLREGNVCADFLAKV 181
           ++CYSDS+  LDLI    + ++ YA +I +IQD+L +NW   L H LREGN  ADFLAK+
Sbjct: 18  ILCYSDSQTVLDLILKGHSIYHCYAAVITNIQDMLKFNWNVTLCHSLREGNFSADFLAKL 77

Query: 182 GASGNCKLLVFDNPPPVI 235
           G++ + K+ ++++PP  +
Sbjct: 78  GSANDTKIKIWESPPKAL 95


>dbj|GAU16646.1| hypothetical protein TSUD_325960 [Trifolium subterraneum]
          Length = 157

 Score = 81.6 bits (200), Expect = 3e-16
 Identities = 39/62 (62%), Positives = 44/62 (70%)
 Frame = +2

Query: 2   LICYSDSKNALDLISSATNSWNRYAPLIQSIQDILSWNWRAQLDHMLREGNVCADFLAKV 181
           L CYSDSK AL LI    N W+ YA +I +I+D LS NWR +L H LREGN CADFLAK 
Sbjct: 65  LCCYSDSKTALKLIYDHVNEWHHYAAIIYNIKDFLSRNWRVRLVHTLREGNNCADFLAKF 124

Query: 182 GA 187
           GA
Sbjct: 125 GA 126


>dbj|GAU23820.1| hypothetical protein TSUD_27290 [Trifolium subterraneum]
          Length = 168

 Score = 81.6 bits (200), Expect = 4e-16
 Identities = 36/62 (58%), Positives = 47/62 (75%)
 Frame = +2

Query: 2   LICYSDSKNALDLISSATNSWNRYAPLIQSIQDILSWNWRAQLDHMLREGNVCADFLAKV 181
           L CYSDS+ A+ L+S+  N W+ YA +I +I+D+L+ NWR +L H LREGN CADFLAK 
Sbjct: 76  LWCYSDSRTAIKLLSNHVNEWHHYAAIIYNIKDLLTRNWRVKLMHTLREGNTCADFLAKF 135

Query: 182 GA 187
           GA
Sbjct: 136 GA 137


>dbj|GAU34195.1| hypothetical protein TSUD_162960 [Trifolium subterraneum]
          Length = 168

 Score = 81.6 bits (200), Expect = 4e-16
 Identities = 39/62 (62%), Positives = 44/62 (70%)
 Frame = +2

Query: 2   LICYSDSKNALDLISSATNSWNRYAPLIQSIQDILSWNWRAQLDHMLREGNVCADFLAKV 181
           L CYSDSK AL LI    N W+ YA +I +I+D LS NWR +L H LREGN CADFLAK 
Sbjct: 76  LCCYSDSKTALKLIYDHVNEWHHYAAIIYNIKDFLSRNWRVRLVHTLREGNNCADFLAKF 135

Query: 182 GA 187
           GA
Sbjct: 136 GA 137


>dbj|GAU47648.1| hypothetical protein TSUD_27720 [Trifolium subterraneum]
          Length = 521

 Score = 83.2 bits (204), Expect = 3e-15
 Identities = 37/75 (49%), Positives = 50/75 (66%)
 Frame = +2

Query: 2   LICYSDSKNALDLISSATNSWNRYAPLIQSIQDILSWNWRAQLDHMLREGNVCADFLAKV 181
           L CYSDS+ A+ LI+   + W+ YA ++ +I+DIL+  WR  + H  REGN CAD+LAK+
Sbjct: 429 LWCYSDSETAIKLITEPVDEWHHYAAILLNIKDILAREWRVNIAHTFREGNACADYLAKL 488

Query: 182 GASGNCKLLVFDNPP 226
           GA  N  L V  NPP
Sbjct: 489 GACNNEALSVMTNPP 503


>dbj|GAU27944.1| hypothetical protein TSUD_146590 [Trifolium subterraneum]
          Length = 204

 Score = 79.7 bits (195), Expect = 5e-15
 Identities = 35/62 (56%), Positives = 46/62 (74%)
 Frame = +2

Query: 2   LICYSDSKNALDLISSATNSWNRYAPLIQSIQDILSWNWRAQLDHMLREGNVCADFLAKV 181
           L CYSDSK A+ L+S   N W++YA +I +I+D+ + NWR +L H LR+GN CADFLAK 
Sbjct: 112 LWCYSDSKTAIKLLSDHVNEWHQYAAIIYNIKDLFTRNWRVRLMHTLRKGNTCADFLAKF 171

Query: 182 GA 187
           GA
Sbjct: 172 GA 173


>dbj|GAU17063.1| hypothetical protein TSUD_105620 [Trifolium subterraneum]
          Length = 440

 Score = 82.4 bits (202), Expect = 5e-15
 Identities = 39/76 (51%), Positives = 51/76 (67%)
 Frame = +2

Query: 2   LICYSDSKNALDLISSATNSWNRYAPLIQSIQDILSWNWRAQLDHMLREGNVCADFLAKV 181
           LICYSDSK A+ LI    N W+ +A ++Q+I+DIL+ +WR  + H LREGN CAD+LAK 
Sbjct: 348 LICYSDSKTAIKLIGDPINEWHHFAAILQNIKDILARDWRVTVAHTLREGNACADYLAKF 407

Query: 182 GASGNCKLLVFDNPPP 229
           GA  N K+      PP
Sbjct: 408 GAQ-NIKVFSTMTTPP 422


>gb|PNY12436.1| ribonuclease H [Trifolium pratense]
          Length = 137

 Score = 77.8 bits (190), Expect = 6e-15
 Identities = 38/76 (50%), Positives = 53/76 (69%)
 Frame = +2

Query: 2   LICYSDSKNALDLISSATNSWNRYAPLIQSIQDILSWNWRAQLDHMLREGNVCADFLAKV 181
           + CYSDS N+L+LI  +   ++ YA LIQ+I+D+L       L H LREGN CADF+AK+
Sbjct: 44  IACYSDSLNSLNLIQYSIPRFHIYAVLIQNIKDLLLGFGTTTLIHSLREGNSCADFMAKL 103

Query: 182 GASGNCKLLVFDNPPP 229
           GAS N ++L+  +PPP
Sbjct: 104 GASSNVEVLIHSSPPP 119


>gb|PNX92765.1| ribonuclease H [Trifolium pratense]
          Length = 1310

 Score = 82.4 bits (202), Expect = 7e-15
 Identities = 37/73 (50%), Positives = 53/73 (72%)
 Frame = +2

Query: 8    CYSDSKNALDLISSATNSWNRYAPLIQSIQDILSWNWRAQLDHMLREGNVCADFLAKVGA 187
            CYSDS+  +D IS   NS++RYA +I SI+D+L  +W  +L H LREGN  ADFLAK+G+
Sbjct: 1219 CYSDSQTVVDAISKDLNSFHRYAAVIASIKDLLQLDWEVRLSHSLREGNAGADFLAKIGS 1278

Query: 188  SGNCKLLVFDNPP 226
            + + KL  +++PP
Sbjct: 1279 ANDDKLTFWESPP 1291


>dbj|GAU14768.1| hypothetical protein TSUD_204170 [Trifolium subterraneum]
          Length = 503

 Score = 81.6 bits (200), Expect = 1e-14
 Identities = 35/78 (44%), Positives = 55/78 (70%)
 Frame = +2

Query: 2   LICYSDSKNALDLISSATNSWNRYAPLIQSIQDILSWNWRAQLDHMLREGNVCADFLAKV 181
           ++CYSDS+  LDLI    + ++ YA +I +IQD+L +NW   L H LREGN  ADFLAK+
Sbjct: 410 ILCYSDSQTVLDLILKGHSIYHCYAAVITNIQDMLKFNWNVTLSHSLREGNFSADFLAKL 469

Query: 182 GASGNCKLLVFDNPPPVI 235
           G++ + K+ ++++PP  +
Sbjct: 470 GSANDTKIKIWESPPEAL 487


>gb|PNX57966.1| ribonuclease H [Trifolium pratense]
          Length = 192

 Score = 77.8 bits (190), Expect = 2e-14
 Identities = 33/75 (44%), Positives = 49/75 (65%)
 Frame = +2

Query: 2   LICYSDSKNALDLISSATNSWNRYAPLIQSIQDILSWNWRAQLDHMLREGNVCADFLAKV 181
           L+C+SDS  A+ LI +  + ++ YA  I  I+ ++  +W   +DH LREGN CADFLAK+
Sbjct: 99  LVCFSDSLQAVSLIKNGVSPYHTYANEIHKIRQLIGRDWNVSIDHTLREGNACADFLAKL 158

Query: 182 GASGNCKLLVFDNPP 226
           GAS    L++ + PP
Sbjct: 159 GASSKSSLVILEAPP 173


>gb|PNX59771.1| ribonuclease H [Trifolium pratense]
 gb|PNX74266.1| ribonuclease H [Trifolium pratense]
          Length = 111

 Score = 75.1 bits (183), Expect = 3e-14
 Identities = 31/74 (41%), Positives = 50/74 (67%)
 Frame = +2

Query: 2   LICYSDSKNALDLISSATNSWNRYAPLIQSIQDILSWNWRAQLDHMLREGNVCADFLAKV 181
           + CY D++  LDL++   ++++ YA +I +IQD+L  +W   L H LREGN C DFL K+
Sbjct: 18  IFCYFDAQTVLDLVTKGYSNFHCYAAVIANIQDLLKLDWEVSLLHTLREGNACTDFLTKL 77

Query: 182 GASGNCKLLVFDNP 223
           G+  + KL ++D+P
Sbjct: 78  GSKNDTKLSIWDSP 91


>dbj|GAU50297.1| hypothetical protein TSUD_288310 [Trifolium subterraneum]
          Length = 545

 Score = 79.7 bits (195), Expect = 5e-14
 Identities = 38/62 (61%), Positives = 44/62 (70%)
 Frame = +2

Query: 2   LICYSDSKNALDLISSATNSWNRYAPLIQSIQDILSWNWRAQLDHMLREGNVCADFLAKV 181
           L CYSDSK AL LI    N W++YA +I +I+D LS NWR +L HMLREGN CAD L K 
Sbjct: 453 LCCYSDSKTALKLIYDHVNEWHQYAAIIYNIKDFLSRNWRVRLVHMLREGNNCADILDKF 512

Query: 182 GA 187
           GA
Sbjct: 513 GA 514


>dbj|GAU31243.1| hypothetical protein TSUD_149290 [Trifolium subterraneum]
          Length = 168

 Score = 75.9 bits (185), Expect = 6e-14
 Identities = 35/62 (56%), Positives = 44/62 (70%)
 Frame = +2

Query: 2   LICYSDSKNALDLISSATNSWNRYAPLIQSIQDILSWNWRAQLDHMLREGNVCADFLAKV 181
           L CYSDSK A+ L+S   N W++YA +I +I++ LS NWR +L H LREGN C  FLAK 
Sbjct: 76  LCCYSDSKTAIKLLSDHVNEWHQYAAIIYNIKNFLSRNWRVRLVHTLREGNNCTYFLAKF 135

Query: 182 GA 187
           GA
Sbjct: 136 GA 137


>dbj|GAU51939.1| hypothetical protein TSUD_417210 [Trifolium subterraneum]
          Length = 181

 Score = 75.1 bits (183), Expect = 2e-13
 Identities = 34/75 (45%), Positives = 46/75 (61%)
 Frame = +2

Query: 2   LICYSDSKNALDLISSATNSWNRYAPLIQSIQDILSWNWRAQLDHMLREGNVCADFLAKV 181
           L CYSDS  A+ LI+   + W+ YA ++ SI+DIL+ +W+  + H  REGN CAD+LAK 
Sbjct: 89  LSCYSDSATAIKLITEPVDVWHHYAAILNSIKDILNRDWQVSIFHTFREGNACADYLAKH 148

Query: 182 GASGNCKLLVFDNPP 226
           GA  N        PP
Sbjct: 149 GAHNNIVFTTIAIPP 163


>gb|KHN12872.1| Putative ribonuclease H protein, partial [Glycine soja]
          Length = 151

 Score = 74.3 bits (181), Expect = 2e-13
 Identities = 38/73 (52%), Positives = 45/73 (61%)
 Frame = +2

Query: 8   CYSDSKNALDLISSATNSWNRYAPLIQSIQDILSWNWRAQLDHMLREGNVCADFLAKVGA 187
           C SDSK AL LIS   NS + YA +IQ IQD    +W    +H  REGN+CAD LAK G+
Sbjct: 67  CESDSKLALQLISEGRNSLHPYASIIQKIQDFKLLHWDLHFNHTFREGNMCADELAKTGS 126

Query: 188 SGNCKLLVFDNPP 226
           S  C L VF+  P
Sbjct: 127 SLQCNLQVFNGCP 139


>gb|PNX95949.1| ribonuclease H [Trifolium pratense]
          Length = 157

 Score = 74.3 bits (181), Expect = 2e-13
 Identities = 33/75 (44%), Positives = 51/75 (68%)
 Frame = +2

Query: 2   LICYSDSKNALDLISSATNSWNRYAPLIQSIQDILSWNWRAQLDHMLREGNVCADFLAKV 181
           +ICYSDS N + L+++     + YA ++Q ++++++ NW  QL H LREGN  ADFLAK+
Sbjct: 64  IICYSDSLNVVKLVTAPITPMHLYAAILQEVKNLMNRNWTVQLRHTLREGNQSADFLAKM 123

Query: 182 GASGNCKLLVFDNPP 226
           G+S + KL +   PP
Sbjct: 124 GSSCHDKLKIISAPP 138


Top