BLASTX nr result

ID: Astragalus24_contig00017857 seq

BLASTX 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Astragalus24_contig00017857
         (790 letters)

Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF excluding environmental samples
from WGS projects 
           149,584,005 sequences; 54,822,741,787 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

dbj|GAU48831.1| hypothetical protein TSUD_190610 [Trifolium subt...   114   7e-27
dbj|GAU10807.1| hypothetical protein TSUD_424460, partial [Trifo...   100   2e-22
gb|KHN33901.1| Putative ribonuclease H protein, partial [Glycine...    99   3e-22
gb|KHN12872.1| Putative ribonuclease H protein, partial [Glycine...    92   1e-19
gb|PNX57966.1| ribonuclease H [Trifolium pratense]                     91   9e-19
dbj|GAU10804.1| hypothetical protein TSUD_425290, partial [Trifo...    90   2e-18
dbj|GAU35798.1| hypothetical protein TSUD_155730 [Trifolium subt...    91   2e-18
gb|PNX73497.1| ribonuclease H [Trifolium pratense]                     90   2e-18
gb|PNY11046.1| ribonuclease H [Trifolium pratense]                     88   3e-18
gb|KHN19802.1| hypothetical protein glysoja_036719, partial [Gly...    86   4e-18
dbj|GAU31849.1| hypothetical protein TSUD_114600 [Trifolium subt...    89   4e-18
gb|KHN13235.1| Putative ribonuclease H protein, partial [Glycine...    86   5e-18
gb|PNX87151.1| ribonuclease H, partial [Trifolium pratense]            87   5e-18
gb|PNX58316.1| hypothetical protein L195_g059131, partial [Trifo...    85   3e-17
gb|KRH66643.1| hypothetical protein GLYMA_03G119600 [Glycine max]      85   5e-17
dbj|GAU35042.1| hypothetical protein TSUD_30080 [Trifolium subte...    91   6e-17
dbj|GAU51939.1| hypothetical protein TSUD_417210 [Trifolium subt...    86   8e-17
gb|KHN15281.1| hypothetical protein glysoja_044276, partial [Gly...    83   1e-16
gb|KHN25063.1| Putative ribonuclease H protein, partial [Glycine...    84   2e-16
gb|PNX61255.1| ribonuclease H, partial [Trifolium pratense]            84   2e-16

>dbj|GAU48831.1| hypothetical protein TSUD_190610 [Trifolium subterraneum]
          Length = 259

 Score =  114 bits (285), Expect = 7e-27
 Identities = 54/100 (54%), Positives = 74/100 (74%)
 Frame = +3

Query: 183 MNAELQAIVKGLMIAWNNGFCRIICESDSTTALSLIESGVPQSHPYAPIIDNIRSSMNMD 362
           +NAELQ I+ GL IAWN+GF  +ICESDS TAL LI+ GVP +HPYAP+++ I+S ++ +
Sbjct: 144 INAELQVILHGLDIAWNHGFRNVICESDSQTALKLIQEGVPSTHPYAPLVNYIQSLIHKE 203

Query: 363 WQLSFTHSLREGNSCAD*LAKYGSSLGQGSHNWVQCPTAI 482
           W+L   H+LREGN+ AD LAK G+SL Q    +  CP+ +
Sbjct: 204 WKLFLVHTLREGNASADWLAKLGASLVQEPKMFSICPSPL 243


>dbj|GAU10807.1| hypothetical protein TSUD_424460, partial [Trifolium subterraneum]
          Length = 170

 Score =  100 bits (248), Expect = 2e-22
 Identities = 47/85 (55%), Positives = 59/85 (69%)
 Frame = +3

Query: 183 MNAELQAIVKGLMIAWNNGFCRIICESDSTTALSLIESGVPQSHPYAPIIDNIRSSMNMD 362
           + AEL AI  GL +A + GF  +I ESDST A+ L+E G    HPYAP+I NIR   NMD
Sbjct: 83  LQAELHAIYNGLCLAMDQGFNNVIIESDSTIAIGLVEHGTSPLHPYAPLIKNIRQFQNMD 142

Query: 363 WQLSFTHSLREGNSCAD*LAKYGSS 437
           W ++F H+LREGN CAD LAK G++
Sbjct: 143 WTIAFHHTLREGNECADWLAKKGAT 167


>gb|KHN33901.1| Putative ribonuclease H protein, partial [Glycine soja]
          Length = 125

 Score = 98.6 bits (244), Expect = 3e-22
 Identities = 46/100 (46%), Positives = 66/100 (66%)
 Frame = +3

Query: 183 MNAELQAIVKGLMIAWNNGFCRIICESDSTTALSLIESGVPQSHPYAPIIDNIRSSMNMD 362
           + AEL AI  GL IAW+NG+  ++CESDST AL LI + V + HPY+P++  I   + + 
Sbjct: 10  LKAELFAIYYGLRIAWDNGYTHVLCESDSTLALHLIHNEVNEYHPYSPMVQLIDHLLALT 69

Query: 363 WQLSFTHSLREGNSCAD*LAKYGSSLGQGSHNWVQCPTAI 482
           W +SF H+L+E N+CAD LAKYG+   +    W  CP+ +
Sbjct: 70  WIVSFKHTLQEENACADWLAKYGAMHAESYKIWNVCPSQL 109


>gb|KHN12872.1| Putative ribonuclease H protein, partial [Glycine soja]
          Length = 151

 Score = 92.4 bits (228), Expect = 1e-19
 Identities = 50/111 (45%), Positives = 59/111 (53%)
 Frame = +3

Query: 141 WHFRLGHLSNQRMKMNAELQAIVKGLMIAWNNGFCRIICESDSTTALSLIESGVPQSHPY 320
           WH            + AEL AI+ GL ++W+ GF  I CESDS  AL LI  G    HPY
Sbjct: 29  WHAGFYGSCGTATSLQAELLAILHGLNLSWDKGFRNIQCESDSKLALQLISEGRNSLHPY 88

Query: 321 APIIDNIRSSMNMDWQLSFTHSLREGNSCAD*LAKYGSSLGQGSHNWVQCP 473
           A II  I+    + W L F H+ REGN CAD LAK GSSL      +  CP
Sbjct: 89  ASIIQKIQDFKLLHWDLHFNHTFREGNMCADELAKTGSSLQCNLQVFNGCP 139


>gb|PNX57966.1| ribonuclease H [Trifolium pratense]
          Length = 192

 Score = 91.3 bits (225), Expect = 9e-19
 Identities = 42/83 (50%), Positives = 57/83 (68%)
 Frame = +3

Query: 189 AELQAIVKGLMIAWNNGFCRIICESDSTTALSLIESGVPQSHPYAPIIDNIRSSMNMDWQ 368
           AE+ A++ GL + WNNG+  ++C SDS  A+SLI++GV   H YA  I  IR  +  DW 
Sbjct: 79  AEIMAVLHGLELCWNNGYTNLVCFSDSLQAVSLIKNGVSPYHTYANEIHKIRQLIGRDWN 138

Query: 369 LSFTHSLREGNSCAD*LAKYGSS 437
           +S  H+LREGN+CAD LAK G+S
Sbjct: 139 VSIDHTLREGNACADFLAKLGAS 161


>dbj|GAU10804.1| hypothetical protein TSUD_425290, partial [Trifolium subterraneum]
          Length = 174

 Score = 90.1 bits (222), Expect = 2e-18
 Identities = 43/84 (51%), Positives = 58/84 (69%)
 Frame = +3

Query: 183 MNAELQAIVKGLMIAWNNGFCRIICESDSTTALSLIESGVPQSHPYAPIIDNIRSSMNMD 362
           ++AEL AI+KGL+IAW      + C SDS TA+ LI   V   H YA I++NI+  +N D
Sbjct: 60  LHAELMAILKGLLIAWELNIKDLSCYSDSATAIKLITEPVDVWHHYAAILNNIKDILNRD 119

Query: 363 WQLSFTHSLREGNSCAD*LAKYGS 434
           WQ+S  H+ REGN+CAD LAK+G+
Sbjct: 120 WQVSIFHTFREGNACADYLAKHGA 143


>dbj|GAU35798.1| hypothetical protein TSUD_155730 [Trifolium subterraneum]
          Length = 212

 Score = 90.9 bits (224), Expect = 2e-18
 Identities = 50/109 (45%), Positives = 63/109 (57%)
 Frame = +3

Query: 156 GHLSNQRMKMNAELQAIVKGLMIAWNNGFCRIICESDSTTALSLIESGVPQSHPYAPIID 335
           GH SN    + AE  AI+KGL +AW+ GFC II ESDS +A+ LI       HP+A ++ 
Sbjct: 82  GHASN----LLAEFYAILKGLQLAWDLGFCTIILESDSKSAIDLILEDDNNFHPHAIVLG 137

Query: 336 NIRSSMNMDWQLSFTHSLREGNSCAD*LAKYGSSLGQGSHNWVQCPTAI 482
            IR     +W LSF+H+LRE N CAD LAK+G         WV  P  I
Sbjct: 138 QIRILRARNWSLSFSHTLREENECADWLAKHGVQSDVNLKLWVSPPPQI 186


>gb|PNX73497.1| ribonuclease H [Trifolium pratense]
          Length = 183

 Score = 90.1 bits (222), Expect = 2e-18
 Identities = 48/95 (50%), Positives = 56/95 (58%)
 Frame = +3

Query: 189 AELQAIVKGLMIAWNNGFCRIICESDSTTALSLIESGVPQSHPYAPIIDNIRSSMNMDWQ 368
           AEL AI+ GL IA   GF  II ESDST A++       Q HPYAP+I  IR    +DW 
Sbjct: 58  AELHAILNGLKIAQAEGFRNIIIESDSTLAVNFACHRTSQLHPYAPLIQQIRHLHRVDWN 117

Query: 369 LSFTHSLREGNSCAD*LAKYGSSLGQGSHNWVQCP 473
           +SF  +LREGN CAD LAK G+S       W  CP
Sbjct: 118 VSFHRTLREGNECADWLAKTGASSNDTLKIWNSCP 152


>gb|PNY11046.1| ribonuclease H [Trifolium pratense]
          Length = 114

 Score = 87.8 bits (216), Expect = 3e-18
 Identities = 42/83 (50%), Positives = 53/83 (63%)
 Frame = +3

Query: 189 AELQAIVKGLMIAWNNGFCRIICESDSTTALSLIESGVPQSHPYAPIIDNIRSSMNMDWQ 368
           AE+ AI+ GL + WNNG+  I+C  DS  A+SLI+ GV   H +A  I NIR  +  DW 
Sbjct: 11  AEIMAILHGLQLCWNNGYRSIVCYPDSLKAVSLIKDGVSHFHTFANEIHNIRQLLRRDWN 70

Query: 369 LSFTHSLREGNSCAD*LAKYGSS 437
           +   H LREGN CAD LAK G+S
Sbjct: 71  VVIDHILREGNECADILAKLGTS 93


>gb|KHN19802.1| hypothetical protein glysoja_036719, partial [Glycine soja]
          Length = 80

 Score = 86.3 bits (212), Expect = 4e-18
 Identities = 39/75 (52%), Positives = 52/75 (69%)
 Frame = +3

Query: 213 GLMIAWNNGFCRIICESDSTTALSLIESGVPQSHPYAPIIDNIRSSMNMDWQLSFTHSLR 392
           GL +A + GFC +ICESDS  AL  IE GV   HP+AP++  IR  M ++W +SF H+ R
Sbjct: 1   GLNLAGHKGFCLVICESDSKMALQFIEEGVVDCHPHAPLVAAIRLLMGLNWDVSFLHTFR 60

Query: 393 EGNSCAD*LAKYGSS 437
           EGN CAD LA+ G++
Sbjct: 61  EGNFCADALAELGAT 75


>dbj|GAU31849.1| hypothetical protein TSUD_114600 [Trifolium subterraneum]
          Length = 171

 Score = 89.0 bits (219), Expect = 4e-18
 Identities = 48/95 (50%), Positives = 55/95 (57%)
 Frame = +3

Query: 189 AELQAIVKGLMIAWNNGFCRIICESDSTTALSLIESGVPQSHPYAPIIDNIRSSMNMDWQ 368
           AEL AI+ GL IA    F  II ESDST A++    G  Q HPYA +I  IR     DW 
Sbjct: 58  AELHAILNGLKIAQAERFRNIIIESDSTLAVNFACHGTSQFHPYATLIQQIRHLHQGDWN 117

Query: 369 LSFTHSLREGNSCAD*LAKYGSSLGQGSHNWVQCP 473
           +SF H+LREGN CAD LAK G+S       W  CP
Sbjct: 118 VSFQHTLREGNECADWLAKTGASSNDTLKIWNSCP 152


>gb|KHN13235.1| Putative ribonuclease H protein, partial [Glycine soja]
          Length = 86

 Score = 86.3 bits (212), Expect = 5e-18
 Identities = 44/81 (54%), Positives = 52/81 (64%)
 Frame = +3

Query: 183 MNAELQAIVKGLMIAWNNGFCRIICESDSTTALSLIESGVPQSHPYAPIIDNIRSSMNMD 362
           +NAEL AI  GL   W  G   IIC+SDS  AL  +  GV QSHPYAPI+  IR  ++ D
Sbjct: 6   VNAELHAIFHGLRCEWIKGLRNIICKSDSKLALQFVTEGVIQSHPYAPIVAKIREFLSYD 65

Query: 363 WQLSFTHSLREGNSCAD*LAK 425
             LSF H+LREGN  +D LAK
Sbjct: 66  GNLSFLHTLREGNLVSDYLAK 86


>gb|PNX87151.1| ribonuclease H, partial [Trifolium pratense]
          Length = 124

 Score = 87.4 bits (215), Expect = 5e-18
 Identities = 42/93 (45%), Positives = 58/93 (62%)
 Frame = +3

Query: 162 LSNQRMKMNAELQAIVKGLMIAWNNGFCRIICESDSTTALSLIESGVPQSHPYAPIIDNI 341
           +++Q   + AE+  I+ GL ++W NGF  I C SDS  A+SLI  GV   H YA  I  I
Sbjct: 28  VASQSSVLYAEIMTIIHGLELSWANGFRNIACYSDSLQAVSLIRDGVSAFHQYANEIQKI 87

Query: 342 RSSMNMDWQLSFTHSLREGNSCAD*LAKYGSSL 440
           R  ++ DW +   H+ REGN+CAD LAK G+S+
Sbjct: 88  RQLLSRDWNVVINHTFREGNACADFLAKMGASM 120


>gb|PNX58316.1| hypothetical protein L195_g059131, partial [Trifolium pratense]
          Length = 114

 Score = 85.1 bits (209), Expect = 3e-17
 Identities = 40/76 (52%), Positives = 53/76 (69%)
 Frame = +3

Query: 183 MNAELQAIVKGLMIAWNNGFCRIICESDSTTALSLIESGVPQSHPYAPIIDNIRSSMNMD 362
           +NAELQAI+    +AWNNG+  + CESD  + L+ I+  V  +H YAP+ID I+  ++  
Sbjct: 8   INAELQAILHSFQMAWNNGYRYVECESDCQSDLNFIQDEVHTTHLYAPVIDLIKRFIDYP 67

Query: 363 WQLSFTHSLREGNSCA 410
           W LSF HSLREGNSCA
Sbjct: 68  WLLSFHHSLREGNSCA 83


>gb|KRH66643.1| hypothetical protein GLYMA_03G119600 [Glycine max]
          Length = 122

 Score = 84.7 bits (208), Expect = 5e-17
 Identities = 44/89 (49%), Positives = 55/89 (61%)
 Frame = +3

Query: 183 MNAELQAIVKGLMIAWNNGFCRIICESDSTTALSLIESGVPQSHPYAPIIDNIRSSMNMD 362
           +N ELQ I KGL +A N  +  ++CESDS T L LI+ GV  +HPYAPI           
Sbjct: 43  LNPELQVIAKGLQLALNEWYRAVVCESDSKTTLMLIDEGVHDTHPYAPI----------- 91

Query: 363 WQLSFTHSLREGNSCAD*LAKYGSSLGQG 449
             L F HSL EGNSCAD LAK+G+++  G
Sbjct: 92  -DLVFAHSLCEGNSCADWLAKFGATMDYG 119


>dbj|GAU35042.1| hypothetical protein TSUD_30080 [Trifolium subterraneum]
          Length = 724

 Score = 90.9 bits (224), Expect = 6e-17
 Identities = 42/84 (50%), Positives = 59/84 (70%)
 Frame = +3

Query: 183 MNAELQAIVKGLMIAWNNGFCRIICESDSTTALSLIESGVPQSHPYAPIIDNIRSSMNMD 362
           ++AEL AI+KGL++AW      ++C SDS TA+ LI   V   H YA I++NI+  +N D
Sbjct: 610 LHAELMAILKGLLLAWELNIKDLLCYSDSATAIKLITEPVDVWHHYAAILNNIKDILNRD 669

Query: 363 WQLSFTHSLREGNSCAD*LAKYGS 434
           WQ+S  H+ REGN+CAD LAK+G+
Sbjct: 670 WQVSIFHTFREGNACADYLAKHGA 693


>dbj|GAU51939.1| hypothetical protein TSUD_417210 [Trifolium subterraneum]
          Length = 181

 Score = 85.9 bits (211), Expect = 8e-17
 Identities = 40/84 (47%), Positives = 57/84 (67%)
 Frame = +3

Query: 183 MNAELQAIVKGLMIAWNNGFCRIICESDSTTALSLIESGVPQSHPYAPIIDNIRSSMNMD 362
           ++AEL  I+KGL++AW      + C SDS TA+ LI   V   H YA I+++I+  +N D
Sbjct: 67  LHAELMVILKGLLLAWELNIKDLSCYSDSATAIKLITEPVDVWHHYAAILNSIKDILNRD 126

Query: 363 WQLSFTHSLREGNSCAD*LAKYGS 434
           WQ+S  H+ REGN+CAD LAK+G+
Sbjct: 127 WQVSIFHTFREGNACADYLAKHGA 150


>gb|KHN15281.1| hypothetical protein glysoja_044276, partial [Glycine soja]
 gb|KHN38112.1| hypothetical protein glysoja_007154, partial [Glycine soja]
          Length = 111

 Score = 83.2 bits (204), Expect = 1e-16
 Identities = 42/97 (43%), Positives = 55/97 (56%)
 Frame = +3

Query: 192 ELQAIVKGLMIAWNNGFCRIICESDSTTALSLIESGVPQSHPYAPIIDNIRSSMNMDWQL 371
           EL  +  GL +AWN+G+  + CES S  AL L+  GV   H Y PII+ I +     WQL
Sbjct: 1   ELHVLFHGLSLAWNSGYKSVECESHSLLALQLVAEGVTTHHSYTPIINCICTFQTKKWQL 60

Query: 372 SFTHSLREGNSCAD*LAKYGSSLGQGSHNWVQCPTAI 482
           SF H  RE N CA+ L+K GSS  +     V CP ++
Sbjct: 61  SFHHVYREANICANWLSKKGSSSPEPMSTLVPCPISL 97


>gb|KHN25063.1| Putative ribonuclease H protein, partial [Glycine soja]
          Length = 145

 Score = 83.6 bits (205), Expect = 2e-16
 Identities = 47/114 (41%), Positives = 62/114 (54%)
 Frame = +3

Query: 141 WHFRLGHLSNQRMKMNAELQAIVKGLMIAWNNGFCRIICESDSTTALSLIESGVPQSHPY 320
           W      L   +  + AEL AI+K L +AW+     +I ES+S TALSL   G   +H Y
Sbjct: 15  WVQGFAGLCGVKTNLYAELLAILKVLQMAWDYNGQALIYESNSRTALSLTLHGADHTHHY 74

Query: 321 APIIDNIRSSMNMDWQLSFTHSLREGNSCAD*LAKYGSSLGQGSHNWVQCPTAI 482
            P+I  IRS ++  W+LSF H  RE N CAD LAK G+S       +  CP A+
Sbjct: 75  FPLISRIRSFLHHPWELSFQHEFREANYCADWLAKLGASSANHLLVFYSCPIAM 128


>gb|PNX61255.1| ribonuclease H, partial [Trifolium pratense]
          Length = 146

 Score = 83.6 bits (205), Expect = 2e-16
 Identities = 41/91 (45%), Positives = 57/91 (62%)
 Frame = +3

Query: 165 SNQRMKMNAELQAIVKGLMIAWNNGFCRIICESDSTTALSLIESGVPQSHPYAPIIDNIR 344
           ++Q   + AE+ A++ GL + W NGF  I C SDS  A++LI  GV   H +A  I +IR
Sbjct: 48  ASQSSVLYAEIMAVLHGLELCWVNGFRNIACYSDSLQAVALIRDGVSLFHKFANEIQSIR 107

Query: 345 SSMNMDWQLSFTHSLREGNSCAD*LAKYGSS 437
             +  DW +   H+LREGN+CAD LAK G+S
Sbjct: 108 QLLRRDWNVVVDHTLREGNACADVLAKMGAS 138


Top