BLASTX nr result

ID: Astragalus23_contig00030116 seq

BLASTX 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Astragalus23_contig00030116
         (401 letters)

Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF excluding environmental samples
from WGS projects 
           149,584,005 sequences; 54,822,741,787 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

dbj|GAU31849.1| hypothetical protein TSUD_114600 [Trifolium subt...    81   2e-16
gb|PNX73497.1| ribonuclease H [Trifolium pratense]                     80   3e-16
dbj|GAU48831.1| hypothetical protein TSUD_190610 [Trifolium subt...    80   2e-15
gb|KRH39041.1| hypothetical protein GLYMA_09G173900 [Glycine max]      77   2e-15
dbj|GAU10807.1| hypothetical protein TSUD_424460, partial [Trifo...    74   6e-14
gb|PNX68108.1| ribonuclease H, partial [Trifolium pratense]            71   8e-14
gb|KHN25063.1| Putative ribonuclease H protein, partial [Glycine...    73   1e-13
gb|KHN25742.1| hypothetical protein glysoja_018316, partial [Gly...    70   3e-13
dbj|GAU35993.1| hypothetical protein TSUD_211300 [Trifolium subt...    73   3e-13
gb|KHN15187.1| hypothetical protein glysoja_011805 [Glycine soja]      70   7e-13
gb|ABN09101.1| Ribonuclease H [Medicago truncatula]                    71   9e-13
gb|KHN33901.1| Putative ribonuclease H protein, partial [Glycine...    70   1e-12
gb|KHN02177.1| Putative ribonuclease H protein [Glycine soja]          71   2e-12
gb|PNY08513.1| ribonuclease H [Trifolium pratense]                     69   4e-12
gb|KHN15930.1| Putative ribonuclease H protein [Glycine soja]          68   1e-11
ref|XP_019447242.1| PREDICTED: uncharacterized protein LOC109350...    68   2e-11
gb|PNX57406.1| hypothetical protein L195_g050384, partial [Trifo...    66   2e-11
dbj|GAU35798.1| hypothetical protein TSUD_155730 [Trifolium subt...    68   3e-11
gb|KRG89307.1| hypothetical protein GLYMA_20G015000 [Glycine max]      63   2e-10
gb|KHN08093.1| hypothetical protein glysoja_023703, partial [Gly...    63   4e-10

>dbj|GAU31849.1| hypothetical protein TSUD_114600 [Trifolium subterraneum]
          Length = 171

 Score = 80.9 bits (198), Expect = 2e-16
 Identities = 38/77 (49%), Positives = 47/77 (61%)
 Frame = -2

Query: 397 GVSHTHPYAPLVDYIRSFTNLDWRLAFSHSLREGNNCADWLAKYGSNMDYGLFRWPHCPR 218
           G S  HPYA L+  IR     DW ++F H+LREGN CADWLAK G++ +  L  W  CP 
Sbjct: 94  GTSQFHPYATLIQQIRHLHQGDWNVSFQHTLREGNECADWLAKTGASSNDTLKIWNSCPP 153

Query: 217 ALSSFLLADAMGITSIR 167
            LS  LLAD +G+   R
Sbjct: 154 QLSLVLLADVVGVARPR 170


>gb|PNX73497.1| ribonuclease H [Trifolium pratense]
          Length = 183

 Score = 80.5 bits (197), Expect = 3e-16
 Identities = 37/76 (48%), Positives = 48/76 (63%)
 Frame = -2

Query: 391 SHTHPYAPLVDYIRSFTNLDWRLAFSHSLREGNNCADWLAKYGSNMDYGLFRWPHCPRAL 212
           S  HPYAPL+  IR    +DW ++F  +LREGN CADWLAK G++ +  L  W  CP  L
Sbjct: 96  SQLHPYAPLIQQIRHLHRVDWNVSFHRTLREGNECADWLAKTGASSNDTLKIWNSCPPQL 155

Query: 211 SSFLLADAMGITSIRE 164
           S  LLAD +G+   R+
Sbjct: 156 SLVLLADIVGVARRRK 171


>dbj|GAU48831.1| hypothetical protein TSUD_190610 [Trifolium subterraneum]
          Length = 259

 Score = 80.1 bits (196), Expect = 2e-15
 Identities = 38/74 (51%), Positives = 50/74 (67%)
 Frame = -2

Query: 400 DGVSHTHPYAPLVDYIRSFTNLDWRLAFSHSLREGNNCADWLAKYGSNMDYGLFRWPHCP 221
           +GV  THPYAPLV+YI+S  + +W+L   H+LREGN  ADWLAK G+++      +  CP
Sbjct: 181 EGVPSTHPYAPLVNYIQSLIHKEWKLFLVHTLREGNASADWLAKLGASLVQEPKMFSICP 240

Query: 220 RALSSFLLADAMGI 179
             LSS  L D+MGI
Sbjct: 241 SPLSSICLVDSMGI 254


>gb|KRH39041.1| hypothetical protein GLYMA_09G173900 [Glycine max]
          Length = 135

 Score = 77.0 bits (188), Expect = 2e-15
 Identities = 35/73 (47%), Positives = 46/73 (63%)
 Frame = -2

Query: 385 THPYAPLVDYIRSFTNLDWRLAFSHSLREGNNCADWLAKYGSNMDYGLFRWPHCPRALSS 206
           THP A ++  I+ F  +D  L FSH+LRE NNC DWLAK G++ D     W  CP AL  
Sbjct: 61  THPCAAIIQAIKGFMQMDRNLIFSHTLREENNCVDWLAKIGAHNDTSFRLWNVCPAALGP 120

Query: 205 FLLADAMGITSIR 167
            +LADA+G ++ R
Sbjct: 121 LILADALGKSATR 133


>dbj|GAU10807.1| hypothetical protein TSUD_424460, partial [Trifolium subterraneum]
          Length = 170

 Score = 74.3 bits (181), Expect = 6e-14
 Identities = 31/49 (63%), Positives = 36/49 (73%)
 Frame = -2

Query: 397 GVSHTHPYAPLVDYIRSFTNLDWRLAFSHSLREGNNCADWLAKYGSNMD 251
           G S  HPYAPL+  IR F N+DW +AF H+LREGN CADWLAK G+  D
Sbjct: 121 GTSPLHPYAPLIKNIRQFQNMDWTIAFHHTLREGNECADWLAKKGATSD 169


>gb|PNX68108.1| ribonuclease H, partial [Trifolium pratense]
          Length = 72

 Score = 71.2 bits (173), Expect = 8e-14
 Identities = 33/71 (46%), Positives = 43/71 (60%)
 Frame = -2

Query: 379 PYAPLVDYIRSFTNLDWRLAFSHSLREGNNCADWLAKYGSNMDYGLFRWPHCPRALSSFL 200
           PYAP++  IR     DW ++F H+LREGN C DWLAK G++ +  L  W   P  LS  L
Sbjct: 1   PYAPIIQQIRHLHQGDWNVSFKHTLREGNECVDWLAKTGASCNDILKIWNSYPPQLSLVL 60

Query: 199 LADAMGITSIR 167
           +AD MG+   R
Sbjct: 61  MADVMGVARPR 71


>gb|KHN25063.1| Putative ribonuclease H protein, partial [Glycine soja]
          Length = 145

 Score = 72.8 bits (177), Expect = 1e-13
 Identities = 34/77 (44%), Positives = 44/77 (57%)
 Frame = -2

Query: 397 GVSHTHPYAPLVDYIRSFTNLDWRLAFSHSLREGNNCADWLAKYGSNMDYGLFRWPHCPR 218
           G  HTH Y PL+  IRSF +  W L+F H  RE N CADWLAK G++    L  +  CP 
Sbjct: 67  GADHTHHYFPLISRIRSFLHHPWELSFQHEFREANYCADWLAKLGASSANHLLVFYSCPI 126

Query: 217 ALSSFLLADAMGITSIR 167
           A++    AD  G+ + R
Sbjct: 127 AMAHLSFADNRGVLNPR 143


>gb|KHN25742.1| hypothetical protein glysoja_018316, partial [Glycine soja]
          Length = 95

 Score = 70.5 bits (171), Expect = 3e-13
 Identities = 37/77 (48%), Positives = 46/77 (59%)
 Frame = -2

Query: 397 GVSHTHPYAPLVDYIRSFTNLDWRLAFSHSLREGNNCADWLAKYGSNMDYGLFRWPHCPR 218
           GV   H + P V  IRSF +LDW L F H+LRE N+CAD LAK G   D+ L      P 
Sbjct: 18  GVVSPHSFYPNVQKIRSFLDLDWHLQFQHTLREANSCADILAKQGPLEDHRLIILDTSPL 77

Query: 217 ALSSFLLADAMGITSIR 167
           +LSS L+ DA+G+   R
Sbjct: 78  SLSSKLIVDALGVIFTR 94


>dbj|GAU35993.1| hypothetical protein TSUD_211300 [Trifolium subterraneum]
          Length = 205

 Score = 73.2 bits (178), Expect = 3e-13
 Identities = 33/68 (48%), Positives = 44/68 (64%)
 Frame = -2

Query: 382 HPYAPLVDYIRSFTNLDWRLAFSHSLREGNNCADWLAKYGSNMDYGLFRWPHCPRALSSF 203
           HP+A ++  IR F   +W L+FSH+LREGN CADWLAK+G+  D  L  W   P  ++  
Sbjct: 132 HPHAIVLGQIRIFRARNWSLSFSHTLREGNECADWLAKHGAQSDVNLKLWVSPPPQIAHV 191

Query: 202 LLADAMGI 179
           LLAD  G+
Sbjct: 192 LLADVTGV 199


>gb|KHN15187.1| hypothetical protein glysoja_011805 [Glycine soja]
          Length = 128

 Score = 70.5 bits (171), Expect = 7e-13
 Identities = 35/74 (47%), Positives = 49/74 (66%)
 Frame = -2

Query: 400 DGVSHTHPYAPLVDYIRSFTNLDWRLAFSHSLREGNNCADWLAKYGSNMDYGLFRWPHCP 221
           D V  +HPY P+V  I+S   L W+L FSH+LRE N+CA+ LAK G++ D  L    + P
Sbjct: 49  DAVVRSHPYYPIVLKIQSLITLQWQLHFSHTLREANSCANSLAKTGASEDSHLVILGNAP 108

Query: 220 RALSSFLLADAMGI 179
             LS+ L+ADA+G+
Sbjct: 109 SNLSTSLMADALGV 122


>gb|ABN09101.1| Ribonuclease H [Medicago truncatula]
          Length = 170

 Score = 71.2 bits (173), Expect = 9e-13
 Identities = 33/72 (45%), Positives = 44/72 (61%)
 Frame = -2

Query: 382 HPYAPLVDYIRSFTNLDWRLAFSHSLREGNNCADWLAKYGSNMDYGLFRWPHCPRALSSF 203
           HP+A ++  IR+  + DW L F+H+LREGN CADWLAKY +  D  L  W   P   +  
Sbjct: 97  HPHAIVLGRIRTLMSRDWSLLFNHTLREGNECADWLAKYDAQSDVSLKLWVSPPPQFAHV 156

Query: 202 LLADAMGITSIR 167
           LLADA  +  +R
Sbjct: 157 LLADASCVLRLR 168


>gb|KHN33901.1| Putative ribonuclease H protein, partial [Glycine soja]
          Length = 125

 Score = 69.7 bits (169), Expect = 1e-12
 Identities = 32/76 (42%), Positives = 43/76 (56%)
 Frame = -2

Query: 394 VSHTHPYAPLVDYIRSFTNLDWRLAFSHSLREGNNCADWLAKYGSNMDYGLFRWPHCPRA 215
           V+  HPY+P+V  I     L W ++F H+L+E N CADWLAKYG+        W  CP  
Sbjct: 49  VNEYHPYSPMVQLIDHLLALTWIVSFKHTLQEENACADWLAKYGAMHAESYKIWNVCPSQ 108

Query: 214 LSSFLLADAMGITSIR 167
           LS+ ++ D MG    R
Sbjct: 109 LSTLVMTDKMGAVHFR 124


>gb|KHN02177.1| Putative ribonuclease H protein [Glycine soja]
          Length = 216

 Score = 71.2 bits (173), Expect = 2e-12
 Identities = 38/77 (49%), Positives = 51/77 (66%)
 Frame = -2

Query: 397 GVSHTHPYAPLVDYIRSFTNLDWRLAFSHSLREGNNCADWLAKYGSNMDYGLFRWPHCPR 218
           GVS THP A LV  I++F   DW+L F H+LREGN  AD LAK G+++D  L     CP 
Sbjct: 139 GVSFTHPSAALVAKIQAFKVKDWKLIFQHTLREGNFVADNLAKKGAHLDQ-LSIMNVCPN 197

Query: 217 ALSSFLLADAMGITSIR 167
           AL +   ADA+G++++R
Sbjct: 198 ALHNLCFADAIGVSTVR 214


>gb|PNY08513.1| ribonuclease H [Trifolium pratense]
          Length = 145

 Score = 68.9 bits (167), Expect = 4e-12
 Identities = 39/79 (49%), Positives = 46/79 (58%)
 Frame = -2

Query: 400 DGVSHTHPYAPLVDYIRSFTNLDWRLAFSHSLREGNNCADWLAKYGSNMDYGLFRWPHCP 221
           +GVS  H +A  V  IR     DW +A +H+LREGN CAD LAK G+  D  L +    P
Sbjct: 67  EGVSAHHRFANEVFSIRQLLTRDWEVAINHTLREGNACADALAKMGALSDSSLVKISTPP 126

Query: 220 RALSSFLLADAMGITSIRE 164
             LS  LLADA GI  IRE
Sbjct: 127 SDLSMLLLADAQGIVFIRE 145


>gb|KHN15930.1| Putative ribonuclease H protein [Glycine soja]
          Length = 159

 Score = 67.8 bits (164), Expect = 1e-11
 Identities = 30/72 (41%), Positives = 40/72 (55%)
 Frame = -2

Query: 382 HPYAPLVDYIRSFTNLDWRLAFSHSLREGNNCADWLAKYGSNMDYGLFRWPHCPRALSSF 203
           H Y P +  I S  ++DW + F H  REGN CADW AK G++   G+    H P  +   
Sbjct: 86  HQYFPTIMLIHSLKHMDWEVTFVHVYREGNKCADWFAKTGNSSQQGMIILDHRPSPIYQA 145

Query: 202 LLADAMGITSIR 167
            L DAMG+T+ R
Sbjct: 146 YLGDAMGVTTQR 157


>ref|XP_019447242.1| PREDICTED: uncharacterized protein LOC109350462 [Lupinus
           angustifolius]
          Length = 163

 Score = 67.8 bits (164), Expect = 2e-11
 Identities = 31/72 (43%), Positives = 42/72 (58%)
 Frame = -2

Query: 382 HPYAPLVDYIRSFTNLDWRLAFSHSLREGNNCADWLAKYGSNMDYGLFRWPHCPRALSSF 203
           HP   ++  I+S  NL   L   H+LREGN   DW AKYG++ D+  F W +CP  LS  
Sbjct: 91  HPQVVVIKRIKSLINLLLCLTLKHTLREGNEGVDWFAKYGADNDHLFFIWGYCPTQLSYI 150

Query: 202 LLADAMGITSIR 167
           LL D +G+  +R
Sbjct: 151 LLFDVVGVVRLR 162


>gb|PNX57406.1| hypothetical protein L195_g050384, partial [Trifolium pratense]
          Length = 92

 Score = 65.9 bits (159), Expect = 2e-11
 Identities = 31/70 (44%), Positives = 42/70 (60%)
 Frame = -2

Query: 391 SHTHPYAPLVDYIRSFTNLDWRLAFSHSLREGNNCADWLAKYGSNMDYGLFRWPHCPRAL 212
           ++ +P A ++  IR F   +W L+FSH+LRE N CADWLAKY +  D  +  W   P  +
Sbjct: 16  NNIYPRAIVLGQIRIFRTCNWSLSFSHTLREENECADWLAKYSAQSDVNIKLWTSPPLQI 75

Query: 211 SSFLLADAMG 182
              LLADA G
Sbjct: 76  VHALLADATG 85


>dbj|GAU35798.1| hypothetical protein TSUD_155730 [Trifolium subterraneum]
          Length = 212

 Score = 68.2 bits (165), Expect = 3e-11
 Identities = 32/68 (47%), Positives = 42/68 (61%)
 Frame = -2

Query: 382 HPYAPLVDYIRSFTNLDWRLAFSHSLREGNNCADWLAKYGSNMDYGLFRWPHCPRALSSF 203
           HP+A ++  IR     +W L+FSH+LRE N CADWLAK+G   D  L  W   P  ++  
Sbjct: 130 HPHAIVLGQIRILRARNWSLSFSHTLREENECADWLAKHGVQSDVNLKLWVSPPPQIAHV 189

Query: 202 LLADAMGI 179
           LLADA G+
Sbjct: 190 LLADATGV 197


>gb|KRG89307.1| hypothetical protein GLYMA_20G015000 [Glycine max]
          Length = 86

 Score = 62.8 bits (151), Expect = 2e-10
 Identities = 32/72 (44%), Positives = 39/72 (54%)
 Frame = -2

Query: 382 HPYAPLVDYIRSFTNLDWRLAFSHSLREGNNCADWLAKYGSNMDYGLFRWPHCPRALSSF 203
           H Y+ L+  I     L W L+F H  REGN  ADWLAK  S   + L    HCP AL + 
Sbjct: 14  HLYSGLIIKIHHLITLAWELSFQHVYREGNFTADWLAKQDSASTHDLQLLHHCPAALFNI 73

Query: 202 LLADAMGITSIR 167
             AD MG +S+R
Sbjct: 74  FSADVMGFSSLR 85


>gb|KHN08093.1| hypothetical protein glysoja_023703, partial [Glycine soja]
          Length = 102

 Score = 62.8 bits (151), Expect = 4e-10
 Identities = 32/72 (44%), Positives = 39/72 (54%)
 Frame = -2

Query: 382 HPYAPLVDYIRSFTNLDWRLAFSHSLREGNNCADWLAKYGSNMDYGLFRWPHCPRALSSF 203
           H Y+ L+  I     L W L+F H  REGN  ADWLAK  S   + L    HCP AL + 
Sbjct: 30  HLYSGLIIKIHHLITLAWELSFQHVYREGNFTADWLAKQDSASTHDLQLLHHCPAALFNI 89

Query: 202 LLADAMGITSIR 167
             AD MG +S+R
Sbjct: 90  FSADVMGFSSLR 101


Top