BLASTX nr result
ID: Astragalus23_contig00031264
seq
BLASTX 2.2.26 [Sep-21-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Astragalus23_contig00031264 (473 letters) Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF excluding environmental samples from WGS projects 149,584,005 sequences; 54,822,741,787 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value gb|KHN04350.1| hypothetical protein glysoja_030944 [Glycine soja] 112 6e-28 gb|KHN35644.1| hypothetical protein glysoja_030996 [Glycine soja] 110 8e-28 gb|KHN18837.1| hypothetical protein glysoja_028206 [Glycine soja] 107 2e-27 gb|PNX71113.1| pantothenate synthetase [Trifolium pratense] 80 2e-15 dbj|GAU42970.1| hypothetical protein TSUD_188430 [Trifolium subt... 67 5e-14 gb|PNY01502.1| ribonuclease H [Trifolium pratense] 77 2e-13 gb|PNY05892.1| ribonuclease H [Trifolium pratense] 75 9e-13 gb|PNX84435.1| hypothetical protein L195_g040495, partial [Trifo... 69 6e-11 dbj|GAU48210.1| hypothetical protein TSUD_404970 [Trifolium subt... 64 3e-10 gb|KHN39553.1| hypothetical protein glysoja_045723, partial [Gly... 59 1e-07 ref|XP_003600144.1| hypothetical protein MTR_3g052690 [Medicago ... 49 3e-07 gb|KHN41375.1| Putative ribonuclease H protein, partial [Glycine... 59 3e-07 gb|KHN03945.1| hypothetical protein glysoja_022631, partial [Gly... 49 3e-07 gb|PNY15032.1| ribonuclease H, partial [Trifolium pratense] 59 4e-07 gb|KHN04679.1| hypothetical protein glysoja_046613, partial [Gly... 54 1e-06 gb|KHN20559.1| hypothetical protein glysoja_033247, partial [Gly... 53 2e-06 dbj|GAU46753.1| hypothetical protein TSUD_402790 [Trifolium subt... 46 2e-06 gb|KRH56422.1| hypothetical protein GLYMA_06G322900 [Glycine max] 53 3e-06 >gb|KHN04350.1| hypothetical protein glysoja_030944 [Glycine soja] Length = 229 Score = 112 bits (281), Expect = 6e-28 Identities = 47/85 (55%), Positives = 63/85 (74%) Frame = +1 Query: 88 FFGFWENSKGQEAKGVRHLIWLAVTWNIWIMRNKIVFKGEVASVPTMIAGIMDIAWCWFK 267 F F + KG++ K V+HLIW+AV WNIW+ RNK++FK E A++P MI+GI D AW WF Sbjct: 145 FMAFGKLIKGKKQKRVKHLIWMAVIWNIWLTRNKVIFKEEAAAIPVMISGIKDCAWAWFM 204 Query: 268 ARRGRSRDLGWTDWYHGPFGCLDSM 342 AR+GR+ GW+DWY+ P GCL S+ Sbjct: 205 ARQGRTCWDGWSDWYNCPMGCLLSL 229 Score = 55.8 bits (133), Expect = 2e-06 Identities = 20/44 (45%), Positives = 31/44 (70%) Frame = +2 Query: 5 CRVIKRVWQHVSTWIGAHIKDSEDMVTHFLDFGKIVKGKKQREL 136 CRV K++WQ V W+ + + ED+ HF+ FGK++KGKKQ+ + Sbjct: 117 CRVAKKIWQQVCLWLDVPVVEGEDIQAHFMAFGKLIKGKKQKRV 160 >gb|KHN35644.1| hypothetical protein glysoja_030996 [Glycine soja] Length = 160 Score = 110 bits (275), Expect = 8e-28 Identities = 47/85 (55%), Positives = 63/85 (74%) Frame = +1 Query: 88 FFGFWENSKGQEAKGVRHLIWLAVTWNIWIMRNKIVFKGEVASVPTMIAGIMDIAWCWFK 267 F F + KG++ K V+HLIW+AV WNIW+ RNK++FK E A++P MI+GI D AW WFK Sbjct: 76 FMAFGKLIKGKKQKRVKHLIWMAVIWNIWLTRNKVIFKEEAAAIPVMISGIKDCAWAWFK 135 Query: 268 ARRGRSRDLGWTDWYHGPFGCLDSM 342 AR+GR +GW+D Y+ P GCL S+ Sbjct: 136 ARQGRICWVGWSDSYNCPMGCLLSL 160 Score = 55.8 bits (133), Expect = 1e-06 Identities = 20/44 (45%), Positives = 31/44 (70%) Frame = +2 Query: 5 CRVIKRVWQHVSTWIGAHIKDSEDMVTHFLDFGKIVKGKKQREL 136 CRV K++WQ V W+ + + ED+ HF+ FGK++KGKKQ+ + Sbjct: 48 CRVAKKIWQQVCLWLDVPVVEGEDIQAHFMAFGKLIKGKKQKRV 91 >gb|KHN18837.1| hypothetical protein glysoja_028206 [Glycine soja] Length = 84 Score = 107 bits (266), Expect = 2e-27 Identities = 44/77 (57%), Positives = 59/77 (76%) Frame = +1 Query: 112 KGQEAKGVRHLIWLAVTWNIWIMRNKIVFKGEVASVPTMIAGIMDIAWCWFKARRGRSRD 291 KG++ K V+HLIW+AV WNIW+ RNK++FK E A++P MI+GI D AW WF AR+GR+ Sbjct: 8 KGKKQKRVKHLIWMAVIWNIWLTRNKVIFKEEAAAIPVMISGIKDCAWAWFMARQGRTCW 67 Query: 292 LGWTDWYHGPFGCLDSM 342 GW+ WY+ P GCL S+ Sbjct: 68 DGWSYWYNCPMGCLLSL 84 >gb|PNX71113.1| pantothenate synthetase [Trifolium pratense] Length = 255 Score = 80.5 bits (197), Expect = 2e-15 Identities = 34/86 (39%), Positives = 49/86 (56%) Frame = +1 Query: 82 DSFFGFWENSKGQEAKGVRHLIWLAVTWNIWIMRNKIVFKGEVASVPTMIAGIMDIAWCW 261 D F F E K ++ VRHL+WLA TWN+W MRNK++FKG++ ++ I +W W Sbjct: 169 DHFLLFGEFFKVKDKGHVRHLVWLATTWNLWKMRNKVIFKGDIPDTAVLLDSIKLFSWIW 228 Query: 262 FKARRGRSRDLGWTDWYHGPFGCLDS 339 F R GR+ ++ W P C+ S Sbjct: 229 FNGRYGRNVCCPFSSWCLDPISCIQS 254 >dbj|GAU42970.1| hypothetical protein TSUD_188430 [Trifolium subterraneum] Length = 767 Score = 67.0 bits (162), Expect(2) = 5e-14 Identities = 29/77 (37%), Positives = 41/77 (53%) Frame = +1 Query: 112 KGQEAKGVRHLIWLAVTWNIWIMRNKIVFKGEVASVPTMIAGIMDIAWCWFKARRGRSRD 291 K + + VRHLIWLA TW+IW +RN +VF G S +++ I I+ W R G Sbjct: 690 KTKRFEKVRHLIWLATTWSIWKLRNNVVFNGVTLSSSSLVNDIKTISCLWLSGRYGHISS 749 Query: 292 LGWTDWYHGPFGCLDSM 342 + + DW P C S+ Sbjct: 750 ISFPDWCFDPMTCFQSI 766 Score = 38.1 bits (87), Expect(2) = 5e-14 Identities = 16/40 (40%), Positives = 22/40 (55%) Frame = +2 Query: 5 CRVIKRVWQHVSTWIGAHIKDSEDMVTHFLDFGKIVKGKK 124 C V+KRVW+ V W G + D HF FG ++K K+ Sbjct: 654 CSVLKRVWEEVFKWFGKSYQAEADGWNHFNIFGSLLKTKR 693 >gb|PNY01502.1| ribonuclease H [Trifolium pratense] Length = 554 Score = 77.0 bits (188), Expect = 2e-13 Identities = 32/86 (37%), Positives = 49/86 (56%) Frame = +1 Query: 82 DSFFGFWENSKGQEAKGVRHLIWLAVTWNIWIMRNKIVFKGEVASVPTMIAGIMDIAWCW 261 D F F + K ++ VRHL+WLA TWN+W +RNK++FKG++ ++ I +W W Sbjct: 468 DHFMLFGDLFKVKDKGRVRHLVWLATTWNLWKLRNKVIFKGDIPETSALLDSIKLSSWMW 527 Query: 262 FKARRGRSRDLGWTDWYHGPFGCLDS 339 F R GR+ ++ W P C+ S Sbjct: 528 FNGRYGRNVCCPFSSWCLDPMSCVQS 553 >gb|PNY05892.1| ribonuclease H [Trifolium pratense] Length = 455 Score = 75.1 bits (183), Expect = 9e-13 Identities = 38/103 (36%), Positives = 57/103 (55%) Frame = +1 Query: 34 CFYVDWGAHKGLRRHGDSFFGFWENSKGQEAKGVRHLIWLAVTWNIWIMRNKIVFKGEVA 213 C Y ++G +H + F G ++ KG++ V+HLIWL TW IW +RN I+F+G +A Sbjct: 359 CAYYNYGEGW---KHFNFFGGIVKSKKGEK---VKHLIWLVTTWCIWRLRNNIIFRGALA 412 Query: 214 SVPTMIAGIMDIAWCWFKARRGRSRDLGWTDWYHGPFGCLDSM 342 ++ I I+W WF R GR ++DW P C+ SM Sbjct: 413 DCAQLVDQIKLISWVWFIGRSGRHCPYLYSDWCVNPLECILSM 455 >gb|PNX84435.1| hypothetical protein L195_g040495, partial [Trifolium pratense] Length = 248 Score = 68.6 bits (166), Expect = 6e-11 Identities = 29/87 (33%), Positives = 47/87 (54%), Gaps = 2/87 (2%) Frame = +1 Query: 88 FFGFWENSKGQEAKGVRHLIWLAVTWNIWIMRNKIVFKGEVASVPTMIAGIMDIAWCWFK 267 F F + ++ + VRHL WLA TWNIW +RN +VF G + + +++ I+ +W WF Sbjct: 162 FLLFDDMITAKKGERVRHLFWLATTWNIWKLRNNVVFNGVIPNASSLVEDIIANSWLWFN 221 Query: 268 ARRGR--SRDLGWTDWYHGPFGCLDSM 342 R G + +++W H C+ M Sbjct: 222 GRYGHHSCTSMSFSNWCHDHMTCIQRM 248 >dbj|GAU48210.1| hypothetical protein TSUD_404970 [Trifolium subterraneum] Length = 1653 Score = 64.3 bits (155), Expect(2) = 3e-10 Identities = 30/80 (37%), Positives = 41/80 (51%) Frame = +1 Query: 91 FGFWENSKGQEAKGVRHLIWLAVTWNIWIMRNKIVFKGEVASVPTMIAGIMDIAWCWFKA 270 FG NS E VRHLIWLA TWN+W +RN ++F G S +++ I I+ W Sbjct: 1571 FGDMVNSTNIER--VRHLIWLATTWNLWKLRNNVIFNGATPSASSLLNDIKAISCAWVSG 1628 Query: 271 RRGRSRDLGWTDWYHGPFGC 330 R G + ++ W P C Sbjct: 1629 RYGHKSCISFSLWCFDPLAC 1648 Score = 27.7 bits (60), Expect(2) = 3e-10 Identities = 12/36 (33%), Positives = 17/36 (47%) Frame = +2 Query: 5 CRVIKRVWQHVSTWIGAHIKDSEDMVTHFLDFGKIV 112 C + VW+ V WIG + +HF FG +V Sbjct: 1540 CSFVNCVWEAVYNWIGKDYHAGAEGWSHFKVFGDMV 1575 >gb|KHN39553.1| hypothetical protein glysoja_045723, partial [Glycine soja] Length = 211 Score = 59.3 bits (142), Expect = 1e-07 Identities = 28/82 (34%), Positives = 37/82 (45%), Gaps = 4/82 (4%) Frame = +1 Query: 97 FWENSK----GQEAKGVRHLIWLAVTWNIWIMRNKIVFKGEVASVPTMIAGIMDIAWCWF 264 FWE + V + WLA W IW +RN +FK E +P I I I W WF Sbjct: 130 FWEYDRLLKYNTSRNKVPFMFWLATLWIIWQVRNNSIFKEEEKDIPKTINQIKHICWAWF 189 Query: 265 KARRGRSRDLGWTDWYHGPFGC 330 + G +DW++ PF C Sbjct: 190 MGKVGGVTGSNISDWWNSPFLC 211 >ref|XP_003600144.1| hypothetical protein MTR_3g052690 [Medicago truncatula] gb|AES70395.1| hypothetical protein MTR_3g052690 [Medicago truncatula] Length = 149 Score = 48.9 bits (115), Expect(2) = 3e-07 Identities = 29/84 (34%), Positives = 41/84 (48%), Gaps = 1/84 (1%) Frame = +1 Query: 82 DSFFGFWENSK-GQEAKGVRHLIWLAVTWNIWIMRNKIVFKGEVASVPTMIAGIMDIAWC 258 D F F +S G+ + HLIW A W IW RN +F+G+ S M+ I +++ Sbjct: 63 DHFHQFGTSSGYGKLRCSLMHLIWFATVWEIWKERNDRIFRGQERSHYQMLEAIKLLSFW 122 Query: 259 WFKARRGRSRDLGWTDWYHGPFGC 330 WFKA + + DW PF C Sbjct: 123 WFKA-KFTVFPYCFHDWCQAPFLC 145 Score = 33.1 bits (74), Expect(2) = 3e-07 Identities = 11/33 (33%), Positives = 18/33 (54%) Frame = +2 Query: 5 CRVIKRVWQHVSTWIGAHIKDSEDMVTHFLDFG 103 C + +WQ + W+G + D ++V HF FG Sbjct: 37 CTLFGHIWQLIRNWLGVYSADPGNIVDHFHQFG 69 >gb|KHN41375.1| Putative ribonuclease H protein, partial [Glycine soja] Length = 363 Score = 58.9 bits (141), Expect = 3e-07 Identities = 28/82 (34%), Positives = 37/82 (45%), Gaps = 4/82 (4%) Frame = +1 Query: 97 FWENSK----GQEAKGVRHLIWLAVTWNIWIMRNKIVFKGEVASVPTMIAGIMDIAWCWF 264 FWE + V + WLA W IW +RN +FK E +P I I I W WF Sbjct: 282 FWEYDRLLKYNTSRNKVPFMFWLATLWIIWQVRNNSIFKEEEKDIPKTINQIKHICWAWF 341 Query: 265 KARRGRSRDLGWTDWYHGPFGC 330 + G +DW++ PF C Sbjct: 342 MGKVGGVGGSNISDWWNSPFLC 363 >gb|KHN03945.1| hypothetical protein glysoja_022631, partial [Glycine soja] Length = 114 Score = 49.3 bits (116), Expect(2) = 3e-07 Identities = 20/76 (26%), Positives = 35/76 (46%) Frame = +1 Query: 106 NSKGQEAKGVRHLIWLAVTWNIWIMRNKIVFKGEVASVPTMIAGIMDIAWCWFKARRGRS 285 N +G++ + +HL+W W+IW RN ++F+ V I I ++W W + Sbjct: 39 NIRGKKKRRFKHLLWHNTCWSIWCHRNNVIFRNAEVDVNNTILFIKSMSWQWVLYKSSGK 98 Query: 286 RDLGWTDWYHGPFGCL 333 ++ W P CL Sbjct: 99 PGFFFSSWCLCPLDCL 114 Score = 32.7 bits (73), Expect(2) = 3e-07 Identities = 13/42 (30%), Positives = 22/42 (52%) Frame = +2 Query: 5 CRVIKRVWQHVSTWIGAHIKDSEDMVTHFLDFGKIVKGKKQR 130 C K+VWQ + W+G ++ +L G ++GKK+R Sbjct: 5 CPFAKQVWQGILNWLGYSFSLPNNIQELYLQLGMNIRGKKKR 46 >gb|PNY15032.1| ribonuclease H, partial [Trifolium pratense] Length = 1490 Score = 58.9 bits (141), Expect = 4e-07 Identities = 28/69 (40%), Positives = 40/69 (57%) Frame = +1 Query: 76 HGDSFFGFWENSKGQEAKGVRHLIWLAVTWNIWIMRNKIVFKGEVASVPTMIAGIMDIAW 255 H + FG N K + VR+L+WLA +WNIW RN ++FKG + V +++ I +W Sbjct: 1417 HHFTLFGDLFNVKDKGC--VRYLVWLATSWNIWKFRNLVIFKGVLPDVSSVVDAIKLSSW 1474 Query: 256 CWFKARRGR 282 WF R GR Sbjct: 1475 VWFTNRYGR 1483 >gb|KHN04679.1| hypothetical protein glysoja_046613, partial [Glycine soja] Length = 103 Score = 54.3 bits (129), Expect = 1e-06 Identities = 25/74 (33%), Positives = 40/74 (54%) Frame = +1 Query: 112 KGQEAKGVRHLIWLAVTWNIWIMRNKIVFKGEVASVPTMIAGIMDIAWCWFKARRGRSRD 291 KG +++GV IW+A W+IW RN I+F+ + +I I +W W KA+ + Sbjct: 31 KGVKSRGVLMFIWVAAVWSIWNHRNVIIFRNQQPCAEYVIEEIKSKSWGWIKAKY-KCFQ 89 Query: 292 LGWTDWYHGPFGCL 333 + +W+ PF CL Sbjct: 90 SSYYEWHSQPFLCL 103 >gb|KHN20559.1| hypothetical protein glysoja_033247, partial [Glycine soja] Length = 86 Score = 53.1 bits (126), Expect = 2e-06 Identities = 23/74 (31%), Positives = 38/74 (51%), Gaps = 1/74 (1%) Frame = +1 Query: 112 KGQEAKGVRHLIWLAVTWNIWIMRNKIVFKGE-VASVPTMIAGIMDIAWCWFKARRGRSR 288 +G++ + VR L+W A W +W+ RN ++FK V ++ I I+W W K + S Sbjct: 13 RGKKLRRVRLLVWHATCWCLWLYRNSVIFKDNFFPDVQNVVYHIQRISWTWMKYKGHGSS 72 Query: 289 DLGWTDWYHGPFGC 330 L + +W P C Sbjct: 73 SLSFANWCTSPLLC 86 >dbj|GAU46753.1| hypothetical protein TSUD_402790 [Trifolium subterraneum] Length = 592 Score = 45.8 bits (107), Expect(2) = 2e-06 Identities = 22/64 (34%), Positives = 33/64 (51%) Frame = +1 Query: 142 LIWLAVTWNIWIMRNKIVFKGEVASVPTMIAGIMDIAWCWFKARRGRSRDLGWTDWYHGP 321 LIWL W +W RN F G S+ M+ I + ++ W KA+R + L + W++ P Sbjct: 527 LIWLVSVWLVWNERNSRCFSGSANSLQHMLDKIKNYSYRWLKAKR-CTLALNYHSWWYSP 585 Query: 322 FGCL 333 CL Sbjct: 586 LICL 589 Score = 33.1 bits (74), Expect(2) = 2e-06 Identities = 15/50 (30%), Positives = 22/50 (44%) Frame = +2 Query: 5 CRVIKRVWQHVSTWIGAHIKDSEDMVTHFLDFGKIVKGKKQRELGISFGW 154 C +W S+WIG+ + DS + HF F V G + R + W Sbjct: 480 CSTFGPLWPMASSWIGSPLVDSHTIPDHFAQFTLSVGGSRGRRSFMQLIW 529 >gb|KRH56422.1| hypothetical protein GLYMA_06G322900 [Glycine max] Length = 93 Score = 53.1 bits (126), Expect = 3e-06 Identities = 23/74 (31%), Positives = 38/74 (51%), Gaps = 1/74 (1%) Frame = +1 Query: 112 KGQEAKGVRHLIWLAVTWNIWIMRNKIVFKGE-VASVPTMIAGIMDIAWCWFKARRGRSR 288 +G++ + VR L+W A W +W+ RN ++FK V ++ I I+W W K + S Sbjct: 19 RGKKLRRVRLLVWHATCWCLWLYRNSVIFKDNFFPDVQNVVYHIQRISWTWMKYKGHGSS 78 Query: 289 DLGWTDWYHGPFGC 330 L + +W P C Sbjct: 79 SLSFANWCTSPLLC 92