BLASTX nr result
ID: Astragalus24_contig00017857
seq
BLASTX 2.2.26 [Sep-21-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Astragalus24_contig00017857 (790 letters) Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF excluding environmental samples from WGS projects 149,584,005 sequences; 54,822,741,787 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value dbj|GAU48831.1| hypothetical protein TSUD_190610 [Trifolium subt... 114 7e-27 dbj|GAU10807.1| hypothetical protein TSUD_424460, partial [Trifo... 100 2e-22 gb|KHN33901.1| Putative ribonuclease H protein, partial [Glycine... 99 3e-22 gb|KHN12872.1| Putative ribonuclease H protein, partial [Glycine... 92 1e-19 gb|PNX57966.1| ribonuclease H [Trifolium pratense] 91 9e-19 dbj|GAU10804.1| hypothetical protein TSUD_425290, partial [Trifo... 90 2e-18 dbj|GAU35798.1| hypothetical protein TSUD_155730 [Trifolium subt... 91 2e-18 gb|PNX73497.1| ribonuclease H [Trifolium pratense] 90 2e-18 gb|PNY11046.1| ribonuclease H [Trifolium pratense] 88 3e-18 gb|KHN19802.1| hypothetical protein glysoja_036719, partial [Gly... 86 4e-18 dbj|GAU31849.1| hypothetical protein TSUD_114600 [Trifolium subt... 89 4e-18 gb|KHN13235.1| Putative ribonuclease H protein, partial [Glycine... 86 5e-18 gb|PNX87151.1| ribonuclease H, partial [Trifolium pratense] 87 5e-18 gb|PNX58316.1| hypothetical protein L195_g059131, partial [Trifo... 85 3e-17 gb|KRH66643.1| hypothetical protein GLYMA_03G119600 [Glycine max] 85 5e-17 dbj|GAU35042.1| hypothetical protein TSUD_30080 [Trifolium subte... 91 6e-17 dbj|GAU51939.1| hypothetical protein TSUD_417210 [Trifolium subt... 86 8e-17 gb|KHN15281.1| hypothetical protein glysoja_044276, partial [Gly... 83 1e-16 gb|KHN25063.1| Putative ribonuclease H protein, partial [Glycine... 84 2e-16 gb|PNX61255.1| ribonuclease H, partial [Trifolium pratense] 84 2e-16 >dbj|GAU48831.1| hypothetical protein TSUD_190610 [Trifolium subterraneum] Length = 259 Score = 114 bits (285), Expect = 7e-27 Identities = 54/100 (54%), Positives = 74/100 (74%) Frame = +3 Query: 183 MNAELQAIVKGLMIAWNNGFCRIICESDSTTALSLIESGVPQSHPYAPIIDNIRSSMNMD 362 +NAELQ I+ GL IAWN+GF +ICESDS TAL LI+ GVP +HPYAP+++ I+S ++ + Sbjct: 144 INAELQVILHGLDIAWNHGFRNVICESDSQTALKLIQEGVPSTHPYAPLVNYIQSLIHKE 203 Query: 363 WQLSFTHSLREGNSCAD*LAKYGSSLGQGSHNWVQCPTAI 482 W+L H+LREGN+ AD LAK G+SL Q + CP+ + Sbjct: 204 WKLFLVHTLREGNASADWLAKLGASLVQEPKMFSICPSPL 243 >dbj|GAU10807.1| hypothetical protein TSUD_424460, partial [Trifolium subterraneum] Length = 170 Score = 100 bits (248), Expect = 2e-22 Identities = 47/85 (55%), Positives = 59/85 (69%) Frame = +3 Query: 183 MNAELQAIVKGLMIAWNNGFCRIICESDSTTALSLIESGVPQSHPYAPIIDNIRSSMNMD 362 + AEL AI GL +A + GF +I ESDST A+ L+E G HPYAP+I NIR NMD Sbjct: 83 LQAELHAIYNGLCLAMDQGFNNVIIESDSTIAIGLVEHGTSPLHPYAPLIKNIRQFQNMD 142 Query: 363 WQLSFTHSLREGNSCAD*LAKYGSS 437 W ++F H+LREGN CAD LAK G++ Sbjct: 143 WTIAFHHTLREGNECADWLAKKGAT 167 >gb|KHN33901.1| Putative ribonuclease H protein, partial [Glycine soja] Length = 125 Score = 98.6 bits (244), Expect = 3e-22 Identities = 46/100 (46%), Positives = 66/100 (66%) Frame = +3 Query: 183 MNAELQAIVKGLMIAWNNGFCRIICESDSTTALSLIESGVPQSHPYAPIIDNIRSSMNMD 362 + AEL AI GL IAW+NG+ ++CESDST AL LI + V + HPY+P++ I + + Sbjct: 10 LKAELFAIYYGLRIAWDNGYTHVLCESDSTLALHLIHNEVNEYHPYSPMVQLIDHLLALT 69 Query: 363 WQLSFTHSLREGNSCAD*LAKYGSSLGQGSHNWVQCPTAI 482 W +SF H+L+E N+CAD LAKYG+ + W CP+ + Sbjct: 70 WIVSFKHTLQEENACADWLAKYGAMHAESYKIWNVCPSQL 109 >gb|KHN12872.1| Putative ribonuclease H protein, partial [Glycine soja] Length = 151 Score = 92.4 bits (228), Expect = 1e-19 Identities = 50/111 (45%), Positives = 59/111 (53%) Frame = +3 Query: 141 WHFRLGHLSNQRMKMNAELQAIVKGLMIAWNNGFCRIICESDSTTALSLIESGVPQSHPY 320 WH + AEL AI+ GL ++W+ GF I CESDS AL LI G HPY Sbjct: 29 WHAGFYGSCGTATSLQAELLAILHGLNLSWDKGFRNIQCESDSKLALQLISEGRNSLHPY 88 Query: 321 APIIDNIRSSMNMDWQLSFTHSLREGNSCAD*LAKYGSSLGQGSHNWVQCP 473 A II I+ + W L F H+ REGN CAD LAK GSSL + CP Sbjct: 89 ASIIQKIQDFKLLHWDLHFNHTFREGNMCADELAKTGSSLQCNLQVFNGCP 139 >gb|PNX57966.1| ribonuclease H [Trifolium pratense] Length = 192 Score = 91.3 bits (225), Expect = 9e-19 Identities = 42/83 (50%), Positives = 57/83 (68%) Frame = +3 Query: 189 AELQAIVKGLMIAWNNGFCRIICESDSTTALSLIESGVPQSHPYAPIIDNIRSSMNMDWQ 368 AE+ A++ GL + WNNG+ ++C SDS A+SLI++GV H YA I IR + DW Sbjct: 79 AEIMAVLHGLELCWNNGYTNLVCFSDSLQAVSLIKNGVSPYHTYANEIHKIRQLIGRDWN 138 Query: 369 LSFTHSLREGNSCAD*LAKYGSS 437 +S H+LREGN+CAD LAK G+S Sbjct: 139 VSIDHTLREGNACADFLAKLGAS 161 >dbj|GAU10804.1| hypothetical protein TSUD_425290, partial [Trifolium subterraneum] Length = 174 Score = 90.1 bits (222), Expect = 2e-18 Identities = 43/84 (51%), Positives = 58/84 (69%) Frame = +3 Query: 183 MNAELQAIVKGLMIAWNNGFCRIICESDSTTALSLIESGVPQSHPYAPIIDNIRSSMNMD 362 ++AEL AI+KGL+IAW + C SDS TA+ LI V H YA I++NI+ +N D Sbjct: 60 LHAELMAILKGLLIAWELNIKDLSCYSDSATAIKLITEPVDVWHHYAAILNNIKDILNRD 119 Query: 363 WQLSFTHSLREGNSCAD*LAKYGS 434 WQ+S H+ REGN+CAD LAK+G+ Sbjct: 120 WQVSIFHTFREGNACADYLAKHGA 143 >dbj|GAU35798.1| hypothetical protein TSUD_155730 [Trifolium subterraneum] Length = 212 Score = 90.9 bits (224), Expect = 2e-18 Identities = 50/109 (45%), Positives = 63/109 (57%) Frame = +3 Query: 156 GHLSNQRMKMNAELQAIVKGLMIAWNNGFCRIICESDSTTALSLIESGVPQSHPYAPIID 335 GH SN + AE AI+KGL +AW+ GFC II ESDS +A+ LI HP+A ++ Sbjct: 82 GHASN----LLAEFYAILKGLQLAWDLGFCTIILESDSKSAIDLILEDDNNFHPHAIVLG 137 Query: 336 NIRSSMNMDWQLSFTHSLREGNSCAD*LAKYGSSLGQGSHNWVQCPTAI 482 IR +W LSF+H+LRE N CAD LAK+G WV P I Sbjct: 138 QIRILRARNWSLSFSHTLREENECADWLAKHGVQSDVNLKLWVSPPPQI 186 >gb|PNX73497.1| ribonuclease H [Trifolium pratense] Length = 183 Score = 90.1 bits (222), Expect = 2e-18 Identities = 48/95 (50%), Positives = 56/95 (58%) Frame = +3 Query: 189 AELQAIVKGLMIAWNNGFCRIICESDSTTALSLIESGVPQSHPYAPIIDNIRSSMNMDWQ 368 AEL AI+ GL IA GF II ESDST A++ Q HPYAP+I IR +DW Sbjct: 58 AELHAILNGLKIAQAEGFRNIIIESDSTLAVNFACHRTSQLHPYAPLIQQIRHLHRVDWN 117 Query: 369 LSFTHSLREGNSCAD*LAKYGSSLGQGSHNWVQCP 473 +SF +LREGN CAD LAK G+S W CP Sbjct: 118 VSFHRTLREGNECADWLAKTGASSNDTLKIWNSCP 152 >gb|PNY11046.1| ribonuclease H [Trifolium pratense] Length = 114 Score = 87.8 bits (216), Expect = 3e-18 Identities = 42/83 (50%), Positives = 53/83 (63%) Frame = +3 Query: 189 AELQAIVKGLMIAWNNGFCRIICESDSTTALSLIESGVPQSHPYAPIIDNIRSSMNMDWQ 368 AE+ AI+ GL + WNNG+ I+C DS A+SLI+ GV H +A I NIR + DW Sbjct: 11 AEIMAILHGLQLCWNNGYRSIVCYPDSLKAVSLIKDGVSHFHTFANEIHNIRQLLRRDWN 70 Query: 369 LSFTHSLREGNSCAD*LAKYGSS 437 + H LREGN CAD LAK G+S Sbjct: 71 VVIDHILREGNECADILAKLGTS 93 >gb|KHN19802.1| hypothetical protein glysoja_036719, partial [Glycine soja] Length = 80 Score = 86.3 bits (212), Expect = 4e-18 Identities = 39/75 (52%), Positives = 52/75 (69%) Frame = +3 Query: 213 GLMIAWNNGFCRIICESDSTTALSLIESGVPQSHPYAPIIDNIRSSMNMDWQLSFTHSLR 392 GL +A + GFC +ICESDS AL IE GV HP+AP++ IR M ++W +SF H+ R Sbjct: 1 GLNLAGHKGFCLVICESDSKMALQFIEEGVVDCHPHAPLVAAIRLLMGLNWDVSFLHTFR 60 Query: 393 EGNSCAD*LAKYGSS 437 EGN CAD LA+ G++ Sbjct: 61 EGNFCADALAELGAT 75 >dbj|GAU31849.1| hypothetical protein TSUD_114600 [Trifolium subterraneum] Length = 171 Score = 89.0 bits (219), Expect = 4e-18 Identities = 48/95 (50%), Positives = 55/95 (57%) Frame = +3 Query: 189 AELQAIVKGLMIAWNNGFCRIICESDSTTALSLIESGVPQSHPYAPIIDNIRSSMNMDWQ 368 AEL AI+ GL IA F II ESDST A++ G Q HPYA +I IR DW Sbjct: 58 AELHAILNGLKIAQAERFRNIIIESDSTLAVNFACHGTSQFHPYATLIQQIRHLHQGDWN 117 Query: 369 LSFTHSLREGNSCAD*LAKYGSSLGQGSHNWVQCP 473 +SF H+LREGN CAD LAK G+S W CP Sbjct: 118 VSFQHTLREGNECADWLAKTGASSNDTLKIWNSCP 152 >gb|KHN13235.1| Putative ribonuclease H protein, partial [Glycine soja] Length = 86 Score = 86.3 bits (212), Expect = 5e-18 Identities = 44/81 (54%), Positives = 52/81 (64%) Frame = +3 Query: 183 MNAELQAIVKGLMIAWNNGFCRIICESDSTTALSLIESGVPQSHPYAPIIDNIRSSMNMD 362 +NAEL AI GL W G IIC+SDS AL + GV QSHPYAPI+ IR ++ D Sbjct: 6 VNAELHAIFHGLRCEWIKGLRNIICKSDSKLALQFVTEGVIQSHPYAPIVAKIREFLSYD 65 Query: 363 WQLSFTHSLREGNSCAD*LAK 425 LSF H+LREGN +D LAK Sbjct: 66 GNLSFLHTLREGNLVSDYLAK 86 >gb|PNX87151.1| ribonuclease H, partial [Trifolium pratense] Length = 124 Score = 87.4 bits (215), Expect = 5e-18 Identities = 42/93 (45%), Positives = 58/93 (62%) Frame = +3 Query: 162 LSNQRMKMNAELQAIVKGLMIAWNNGFCRIICESDSTTALSLIESGVPQSHPYAPIIDNI 341 +++Q + AE+ I+ GL ++W NGF I C SDS A+SLI GV H YA I I Sbjct: 28 VASQSSVLYAEIMTIIHGLELSWANGFRNIACYSDSLQAVSLIRDGVSAFHQYANEIQKI 87 Query: 342 RSSMNMDWQLSFTHSLREGNSCAD*LAKYGSSL 440 R ++ DW + H+ REGN+CAD LAK G+S+ Sbjct: 88 RQLLSRDWNVVINHTFREGNACADFLAKMGASM 120 >gb|PNX58316.1| hypothetical protein L195_g059131, partial [Trifolium pratense] Length = 114 Score = 85.1 bits (209), Expect = 3e-17 Identities = 40/76 (52%), Positives = 53/76 (69%) Frame = +3 Query: 183 MNAELQAIVKGLMIAWNNGFCRIICESDSTTALSLIESGVPQSHPYAPIIDNIRSSMNMD 362 +NAELQAI+ +AWNNG+ + CESD + L+ I+ V +H YAP+ID I+ ++ Sbjct: 8 INAELQAILHSFQMAWNNGYRYVECESDCQSDLNFIQDEVHTTHLYAPVIDLIKRFIDYP 67 Query: 363 WQLSFTHSLREGNSCA 410 W LSF HSLREGNSCA Sbjct: 68 WLLSFHHSLREGNSCA 83 >gb|KRH66643.1| hypothetical protein GLYMA_03G119600 [Glycine max] Length = 122 Score = 84.7 bits (208), Expect = 5e-17 Identities = 44/89 (49%), Positives = 55/89 (61%) Frame = +3 Query: 183 MNAELQAIVKGLMIAWNNGFCRIICESDSTTALSLIESGVPQSHPYAPIIDNIRSSMNMD 362 +N ELQ I KGL +A N + ++CESDS T L LI+ GV +HPYAPI Sbjct: 43 LNPELQVIAKGLQLALNEWYRAVVCESDSKTTLMLIDEGVHDTHPYAPI----------- 91 Query: 363 WQLSFTHSLREGNSCAD*LAKYGSSLGQG 449 L F HSL EGNSCAD LAK+G+++ G Sbjct: 92 -DLVFAHSLCEGNSCADWLAKFGATMDYG 119 >dbj|GAU35042.1| hypothetical protein TSUD_30080 [Trifolium subterraneum] Length = 724 Score = 90.9 bits (224), Expect = 6e-17 Identities = 42/84 (50%), Positives = 59/84 (70%) Frame = +3 Query: 183 MNAELQAIVKGLMIAWNNGFCRIICESDSTTALSLIESGVPQSHPYAPIIDNIRSSMNMD 362 ++AEL AI+KGL++AW ++C SDS TA+ LI V H YA I++NI+ +N D Sbjct: 610 LHAELMAILKGLLLAWELNIKDLLCYSDSATAIKLITEPVDVWHHYAAILNNIKDILNRD 669 Query: 363 WQLSFTHSLREGNSCAD*LAKYGS 434 WQ+S H+ REGN+CAD LAK+G+ Sbjct: 670 WQVSIFHTFREGNACADYLAKHGA 693 >dbj|GAU51939.1| hypothetical protein TSUD_417210 [Trifolium subterraneum] Length = 181 Score = 85.9 bits (211), Expect = 8e-17 Identities = 40/84 (47%), Positives = 57/84 (67%) Frame = +3 Query: 183 MNAELQAIVKGLMIAWNNGFCRIICESDSTTALSLIESGVPQSHPYAPIIDNIRSSMNMD 362 ++AEL I+KGL++AW + C SDS TA+ LI V H YA I+++I+ +N D Sbjct: 67 LHAELMVILKGLLLAWELNIKDLSCYSDSATAIKLITEPVDVWHHYAAILNSIKDILNRD 126 Query: 363 WQLSFTHSLREGNSCAD*LAKYGS 434 WQ+S H+ REGN+CAD LAK+G+ Sbjct: 127 WQVSIFHTFREGNACADYLAKHGA 150 >gb|KHN15281.1| hypothetical protein glysoja_044276, partial [Glycine soja] gb|KHN38112.1| hypothetical protein glysoja_007154, partial [Glycine soja] Length = 111 Score = 83.2 bits (204), Expect = 1e-16 Identities = 42/97 (43%), Positives = 55/97 (56%) Frame = +3 Query: 192 ELQAIVKGLMIAWNNGFCRIICESDSTTALSLIESGVPQSHPYAPIIDNIRSSMNMDWQL 371 EL + GL +AWN+G+ + CES S AL L+ GV H Y PII+ I + WQL Sbjct: 1 ELHVLFHGLSLAWNSGYKSVECESHSLLALQLVAEGVTTHHSYTPIINCICTFQTKKWQL 60 Query: 372 SFTHSLREGNSCAD*LAKYGSSLGQGSHNWVQCPTAI 482 SF H RE N CA+ L+K GSS + V CP ++ Sbjct: 61 SFHHVYREANICANWLSKKGSSSPEPMSTLVPCPISL 97 >gb|KHN25063.1| Putative ribonuclease H protein, partial [Glycine soja] Length = 145 Score = 83.6 bits (205), Expect = 2e-16 Identities = 47/114 (41%), Positives = 62/114 (54%) Frame = +3 Query: 141 WHFRLGHLSNQRMKMNAELQAIVKGLMIAWNNGFCRIICESDSTTALSLIESGVPQSHPY 320 W L + + AEL AI+K L +AW+ +I ES+S TALSL G +H Y Sbjct: 15 WVQGFAGLCGVKTNLYAELLAILKVLQMAWDYNGQALIYESNSRTALSLTLHGADHTHHY 74 Query: 321 APIIDNIRSSMNMDWQLSFTHSLREGNSCAD*LAKYGSSLGQGSHNWVQCPTAI 482 P+I IRS ++ W+LSF H RE N CAD LAK G+S + CP A+ Sbjct: 75 FPLISRIRSFLHHPWELSFQHEFREANYCADWLAKLGASSANHLLVFYSCPIAM 128 >gb|PNX61255.1| ribonuclease H, partial [Trifolium pratense] Length = 146 Score = 83.6 bits (205), Expect = 2e-16 Identities = 41/91 (45%), Positives = 57/91 (62%) Frame = +3 Query: 165 SNQRMKMNAELQAIVKGLMIAWNNGFCRIICESDSTTALSLIESGVPQSHPYAPIIDNIR 344 ++Q + AE+ A++ GL + W NGF I C SDS A++LI GV H +A I +IR Sbjct: 48 ASQSSVLYAEIMAVLHGLELCWVNGFRNIACYSDSLQAVALIRDGVSLFHKFANEIQSIR 107 Query: 345 SSMNMDWQLSFTHSLREGNSCAD*LAKYGSS 437 + DW + H+LREGN+CAD LAK G+S Sbjct: 108 QLLRRDWNVVVDHTLREGNACADVLAKMGAS 138