BLASTX nr result
ID: Astragalus22_contig00020873
seq
BLASTX 2.2.26 [Sep-21-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Astragalus22_contig00020873 (540 letters) Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF excluding environmental samples from WGS projects 149,584,005 sequences; 54,822,741,787 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value gb|KHN03945.1| hypothetical protein glysoja_022631, partial [Gly... 84 2e-17 gb|KHN04350.1| hypothetical protein glysoja_030944 [Glycine soja] 82 1e-15 gb|KHN35644.1| hypothetical protein glysoja_030996 [Glycine soja] 77 2e-14 gb|PNY05892.1| ribonuclease H [Trifolium pratense] 77 3e-13 gb|KHN20429.1| hypothetical protein glysoja_044415, partial [Gly... 72 7e-13 gb|PNX71113.1| pantothenate synthetase [Trifolium pratense] 73 3e-12 gb|KHN39553.1| hypothetical protein glysoja_045723, partial [Gly... 71 7e-12 gb|KHN24583.1| hypothetical protein glysoja_039590, partial [Gly... 67 3e-11 gb|KHN10514.1| hypothetical protein glysoja_040669, partial [Gly... 68 3e-11 gb|PNY01502.1| ribonuclease H [Trifolium pratense] 71 4e-11 gb|KHN41375.1| Putative ribonuclease H protein, partial [Glycine... 71 4e-11 gb|KHN30886.1| Putative ribonuclease H protein, partial [Glycine... 71 4e-11 dbj|GAU42970.1| hypothetical protein TSUD_188430 [Trifolium subt... 70 8e-11 ref|XP_003614519.1| hypothetical protein MTR_5g054985 [Medicago ... 65 2e-10 gb|KRH23381.1| hypothetical protein GLYMA_13G353700 [Glycine max] 65 3e-10 gb|KHN32657.1| hypothetical protein glysoja_022339 [Glycine soja] 64 6e-10 dbj|GAU35033.1| hypothetical protein TSUD_103560 [Trifolium subt... 65 3e-09 dbj|GAU26034.1| hypothetical protein TSUD_224950 [Trifolium subt... 64 6e-09 dbj|GAU34177.1| hypothetical protein TSUD_162770 [Trifolium subt... 65 8e-09 gb|PNX69227.1| hypothetical protein L195_g056601 [Trifolium prat... 61 1e-08 >gb|KHN03945.1| hypothetical protein glysoja_022631, partial [Glycine soja] Length = 114 Score = 83.6 bits (205), Expect = 2e-17 Identities = 36/108 (33%), Positives = 62/108 (57%) Frame = -1 Query: 516 LFITCSFAREVWKQVLLWLDLQAEELYNCVDHFLWIGEACKGQKLKKFNYLIWLSTCCCW 337 LF+ C FA++VW+ +L WL N + +L +G +G+K ++F +L+W +T CW Sbjct: 1 LFVFCPFAKQVWQGILNWLGYSFSLPNNIQELYLQLGMNIRGKKKRRFKHLLWHNT--CW 58 Query: 336 CIRLYRNGLVFRNEEANIASVVTHIKGLSGSWFIYRMGNSYNFSVTSW 193 I +RN ++FRN E ++ + + IK +S W +Y+ F +SW Sbjct: 59 SIWCHRNNVIFRNAEVDVNNTILFIKSMSWQWVLYKSSGKPGFFFSSW 106 >gb|KHN04350.1| hypothetical protein glysoja_030944 [Glycine soja] Length = 229 Score = 81.6 bits (200), Expect = 1e-15 Identities = 38/110 (34%), Positives = 65/110 (59%) Frame = -1 Query: 519 HLFITCSFAREVWKQVLLWLDLQAEELYNCVDHFLWIGEACKGQKLKKFNYLIWLSTCCC 340 H+F+ C A+++W+QV LWLD+ E + HF+ G+ KG+K K+ +LIW++ Sbjct: 112 HIFLDCRVAKKIWQQVCLWLDVPVVEGEDIQAHFMAFGKLIKGKKQKRVKHLIWMAV--I 169 Query: 339 WCIRLYRNGLVFRNEEANIASVVTHIKGLSGSWFIYRMGNSYNFSVTSWW 190 W I L RN ++F+ E A I +++ IK + +WF+ R G + + W+ Sbjct: 170 WNIWLTRNKVIFKEEAAAIPVMISGIKDCAWAWFMARQGRTCWDGWSDWY 219 >gb|KHN35644.1| hypothetical protein glysoja_030996 [Glycine soja] Length = 160 Score = 76.6 bits (187), Expect = 2e-14 Identities = 36/99 (36%), Positives = 59/99 (59%) Frame = -1 Query: 519 HLFITCSFAREVWKQVLLWLDLQAEELYNCVDHFLWIGEACKGQKLKKFNYLIWLSTCCC 340 H+ + C A+++W+QV LWLD+ E + HF+ G+ KG+K K+ +LIW++ Sbjct: 43 HILLDCRVAKKIWQQVCLWLDVPVVEGEDIQAHFMAFGKLIKGKKQKRVKHLIWMAV--I 100 Query: 339 WCIRLYRNGLVFRNEEANIASVVTHIKGLSGSWFIYRMG 223 W I L RN ++F+ E A I +++ IK + +WF R G Sbjct: 101 WNIWLTRNKVIFKEEAAAIPVMISGIKDCAWAWFKARQG 139 >gb|PNY05892.1| ribonuclease H [Trifolium pratense] Length = 455 Score = 77.4 bits (189), Expect = 3e-13 Identities = 40/113 (35%), Positives = 56/113 (49%) Frame = -1 Query: 531 EDSIHLFITCSFAREVWKQVLLWLDLQAEELYNCVDHFLWIGEACKGQKLKKFNYLIWLS 352 ED +HLF C ++VW+ + WL HF + G K +K +K +LIWL Sbjct: 334 EDCMHLFFNCKLMQQVWRSIYKWLGCAYYNYGEGWKHFNFFGGIVKSKKGEKVKHLIWLV 393 Query: 351 TCCCWCIRLYRNGLVFRNEEANIASVVTHIKGLSGSWFIYRMGNSYNFSVTSW 193 T WCI RN ++FR A+ A +V IK +S WFI R G + + W Sbjct: 394 TT--WCIWRLRNNIIFRGALADCAQLVDQIKLISWVWFIGRSGRHCPYLYSDW 444 >gb|KHN20429.1| hypothetical protein glysoja_044415, partial [Glycine soja] Length = 118 Score = 71.6 bits (174), Expect = 7e-13 Identities = 32/113 (28%), Positives = 58/113 (51%) Frame = -1 Query: 531 EDSIHLFITCSFAREVWKQVLLWLDLQAEELYNCVDHFLWIGEACKGQKLKKFNYLIWLS 352 ED HLF+ CSF ++W VL WL + + F+W+G + +++K+ ++ W Sbjct: 3 EDKNHLFVNCSFNSKIWYVVLAWLGVSVVLPNDAKSLFIWMGGFVRVRRVKRLIFIFWHV 62 Query: 351 TCCCWCIRLYRNGLVFRNEEANIASVVTHIKGLSGSWFIYRMGNSYNFSVTSW 193 T WC+ RN ++F+++ + + HIK +S W + G + +SW Sbjct: 63 T--VWCLWNLRNQIIFKSDSIEFLACMAHIKIISWQWLFSKNGVKTSLFFSSW 113 >gb|PNX71113.1| pantothenate synthetase [Trifolium pratense] Length = 255 Score = 72.8 bits (177), Expect = 3e-12 Identities = 39/113 (34%), Positives = 59/113 (52%) Frame = -1 Query: 531 EDSIHLFITCSFAREVWKQVLLWLDLQAEELYNCVDHFLWIGEACKGQKLKKFNYLIWLS 352 ED HLF +C F++ VW+ VL WL L + +DHFL GE K + +L+WL+ Sbjct: 134 EDGAHLFFSCYFSKVVWRNVLKWLGLSSPLDVEGIDHFLLFGEFFKVKDKGHVRHLVWLA 193 Query: 351 TCCCWCIRLYRNGLVFRNEEANIASVVTHIKGLSGSWFIYRMGNSYNFSVTSW 193 T W + RN ++F+ + + A ++ IK S WF R G + +SW Sbjct: 194 T--TWNLWKMRNKVIFKGDIPDTAVLLDSIKLFSWIWFNGRYGRNVCCPFSSW 244 >gb|KHN39553.1| hypothetical protein glysoja_045723, partial [Glycine soja] Length = 211 Score = 71.2 bits (173), Expect = 7e-12 Identities = 33/116 (28%), Positives = 58/116 (50%), Gaps = 1/116 (0%) Frame = -1 Query: 534 NEDSIHLFITCSFAREVWKQVLLWLDLQAEELYNCVDHFLWIGEACK-GQKLKKFNYLIW 358 +E+ +HLF C F++ +WK++L W+ + V HF K K ++ W Sbjct: 92 DENVVHLFFHCDFSKCIWKEILSWIGIVDVIAVGGVQHFWEYDRLLKYNTSRNKVPFMFW 151 Query: 357 LSTCCCWCIRLYRNGLVFRNEEANIASVVTHIKGLSGSWFIYRMGNSYNFSVTSWW 190 L+T W I RN +F+ EE +I + IK + +WF+ ++G +++ WW Sbjct: 152 LATL--WIIWQVRNNSIFKEEEKDIPKTINQIKHICWAWFMGKVGGVTGSNISDWW 205 >gb|KHN24583.1| hypothetical protein glysoja_039590, partial [Glycine soja] Length = 102 Score = 67.0 bits (162), Expect = 3e-11 Identities = 35/109 (32%), Positives = 55/109 (50%), Gaps = 1/109 (0%) Frame = -1 Query: 516 LFITCSFAREVWKQVLLWLDLQAEELYNCVDHFLWIGEACKGQKLKKFNYLIWLSTCCCW 337 LF+ CSF+ +VW V WL + + H+ G + + LK + +IW C CW Sbjct: 1 LFMDCSFSFQVWNSVFRWLGVSLVQ-----QHYSQFGLVFREKNLKILHRVIW--HCTCW 53 Query: 336 CIRLYRNGLVFRN-EEANIASVVTHIKGLSGSWFIYRMGNSYNFSVTSW 193 CI L+ N ++F+N A+ ++ HI LS +W Y+ S S +W Sbjct: 54 CIWLHHNKIMFQNGRRADACEIIQHIHALSWTWARYKGSLSSGLSFGAW 102 >gb|KHN10514.1| hypothetical protein glysoja_040669, partial [Glycine soja] Length = 132 Score = 67.8 bits (164), Expect = 3e-11 Identities = 32/87 (36%), Positives = 50/87 (57%) Frame = -1 Query: 531 EDSIHLFITCSFAREVWKQVLLWLDLQAEELYNCVDHFLWIGEACKGQKLKKFNYLIWLS 352 ED HLF C+FA VW + WL + H++ G C+G++L K + IW + Sbjct: 48 EDVDHLFPGCNFAYNVWIAIYSWLGFVMIQHNQVKYHYVQHGLVCRGKRLSKVCHFIWHA 107 Query: 351 TCCCWCIRLYRNGLVFRNEEANIASVV 271 TC WC+ L+RN ++F+ E+A++ VV Sbjct: 108 TC--WCLWLHRNRIIFQEEQADVQLVV 132 >gb|PNY01502.1| ribonuclease H [Trifolium pratense] Length = 554 Score = 71.2 bits (173), Expect = 4e-11 Identities = 36/113 (31%), Positives = 59/113 (52%) Frame = -1 Query: 531 EDSIHLFITCSFAREVWKQVLLWLDLQAEELYNCVDHFLWIGEACKGQKLKKFNYLIWLS 352 ED HLF +C F++ VW+ VL WL L +DHF+ G+ K + + +L+WL+ Sbjct: 433 EDGAHLFFSCYFSKVVWRNVLKWLGLSIPMDVEGIDHFMLFGDLFKVKDKGRVRHLVWLA 492 Query: 351 TCCCWCIRLYRNGLVFRNEEANIASVVTHIKGLSGSWFIYRMGNSYNFSVTSW 193 T W + RN ++F+ + ++++ IK S WF R G + +SW Sbjct: 493 T--TWNLWKLRNKVIFKGDIPETSALLDSIKLSSWMWFNGRYGRNVCCPFSSW 543 >gb|KHN41375.1| Putative ribonuclease H protein, partial [Glycine soja] Length = 363 Score = 70.9 bits (172), Expect = 4e-11 Identities = 33/116 (28%), Positives = 58/116 (50%), Gaps = 1/116 (0%) Frame = -1 Query: 534 NEDSIHLFITCSFAREVWKQVLLWLDLQAEELYNCVDHFLWIGEACK-GQKLKKFNYLIW 358 +E+ +HLF C F++ +WK++L W+ + V HF K K ++ W Sbjct: 244 DENVVHLFFHCDFSKCIWKEILSWIGIVDVIAVGGVQHFWEYDRLLKYNTSRNKVPFMFW 303 Query: 357 LSTCCCWCIRLYRNGLVFRNEEANIASVVTHIKGLSGSWFIYRMGNSYNFSVTSWW 190 L+T W I RN +F+ EE +I + IK + +WF+ ++G +++ WW Sbjct: 304 LATL--WIIWQVRNNSIFKEEEKDIPKTINQIKHICWAWFMGKVGGVGGSNISDWW 357 >gb|KHN30886.1| Putative ribonuclease H protein, partial [Glycine soja] Length = 373 Score = 70.9 bits (172), Expect = 4e-11 Identities = 33/116 (28%), Positives = 58/116 (50%), Gaps = 1/116 (0%) Frame = -1 Query: 534 NEDSIHLFITCSFAREVWKQVLLWLDLQAEELYNCVDHFLWIGEACK-GQKLKKFNYLIW 358 +E+ +HLF C F++ +WK++L W+ + V HF K K ++ W Sbjct: 258 DENVVHLFFHCDFSKCIWKEILSWIGIVDVIAVGGVQHFWEYDRLLKYNTSRNKVPFMFW 317 Query: 357 LSTCCCWCIRLYRNGLVFRNEEANIASVVTHIKGLSGSWFIYRMGNSYNFSVTSWW 190 L+T W I RN +F+ EE +I + IK + +WF+ ++G +++ WW Sbjct: 318 LATL--WIIWQVRNNSIFKEEEKDIPKTINQIKHICWAWFMGKVGGVGGSNISDWW 371 >dbj|GAU42970.1| hypothetical protein TSUD_188430 [Trifolium subterraneum] Length = 767 Score = 70.5 bits (171), Expect = 8e-11 Identities = 38/114 (33%), Positives = 57/114 (50%) Frame = -1 Query: 534 NEDSIHLFITCSFAREVWKQVLLWLDLQAEELYNCVDHFLWIGEACKGQKLKKFNYLIWL 355 +EDS HLF CS + VW++V W + + +HF G K ++ +K +LIWL Sbjct: 644 DEDSAHLFFHCSVLKRVWEEVFKWFGKSYQAEADGWNHFNIFGSLLKTKRFEKVRHLIWL 703 Query: 354 STCCCWCIRLYRNGLVFRNEEANIASVVTHIKGLSGSWFIYRMGNSYNFSVTSW 193 +T W I RN +VF + +S+V IK +S W R G+ + S W Sbjct: 704 AT--TWSIWKLRNNVVFNGVTLSSSSLVNDIKTISCLWLSGRYGHISSISFPDW 755 >ref|XP_003614519.1| hypothetical protein MTR_5g054985 [Medicago truncatula] gb|AES97477.1| hypothetical protein MTR_5g054985 [Medicago truncatula] Length = 130 Score = 65.5 bits (158), Expect = 2e-10 Identities = 39/116 (33%), Positives = 55/116 (47%), Gaps = 2/116 (1%) Frame = -1 Query: 531 EDSIHLFITCSFAREVWKQVLLWLDLQAEELYNCVDHFLWIG--EACKGQKLKKFNYLIW 358 ED+ HLF++C F ++W + WL Q N +DH K N LIW Sbjct: 8 EDAKHLFLSCDFFGKLWYDISYWLGYQLVFPENVLDHLYQFATFSGFSNSKRSSLN-LIW 66 Query: 357 LSTCCCWCIRLYRNGLVFRNEEANIASVVTHIKGLSGSWFIYRMGNSYNFSVTSWW 190 LS C W I L RN +F +EA+ ++ +K L W++ S+ FS SWW Sbjct: 67 LS--CVWVIWLERNARIFHQKEASFNQLLDKVK-LQSYWWLKVNRPSFVFSYHSWW 119 >gb|KRH23381.1| hypothetical protein GLYMA_13G353700 [Glycine max] Length = 114 Score = 64.7 bits (156), Expect = 3e-10 Identities = 30/110 (27%), Positives = 55/110 (50%), Gaps = 1/110 (0%) Frame = -1 Query: 516 LFITCSFAREVWKQVLLWLDLQAEELYNCVDH-FLWIGEACKGQKLKKFNYLIWLSTCCC 340 +F++C F+ +W QV WL + L +DH + +G + G K+ + W C Sbjct: 1 MFLSCPFSSAIWNQVFRWLGIHT-VLPRHIDHLYDQMGHSIGGATNKRIKLVFW--HAAC 57 Query: 339 WCIRLYRNGLVFRNEEANIASVVTHIKGLSGSWFIYRMGNSYNFSVTSWW 190 W +R RN ++F +EE ++ IK ++ W Y+ G + + +SW+ Sbjct: 58 WLLRNARNSVIFNSEEPEPGGILMAIKSIAWQWIAYKKGFAVGYQFSSWF 107 >gb|KHN32657.1| hypothetical protein glysoja_022339 [Glycine soja] Length = 114 Score = 63.9 bits (154), Expect = 6e-10 Identities = 30/110 (27%), Positives = 55/110 (50%), Gaps = 1/110 (0%) Frame = -1 Query: 516 LFITCSFAREVWKQVLLWLDLQAEELYNCVDH-FLWIGEACKGQKLKKFNYLIWLSTCCC 340 +F++C F+ +W QV WL + L +DH + +G + G K+ + W C Sbjct: 1 MFLSCPFSSAIWNQVFGWLGIHT-VLPRHIDHLYDQMGHSIGGATNKRIKLVFW--HAAC 57 Query: 339 WCIRLYRNGLVFRNEEANIASVVTHIKGLSGSWFIYRMGNSYNFSVTSWW 190 W +R RN ++F +EE ++ IK ++ W Y+ G + + +SW+ Sbjct: 58 WLLRNARNSVIFNSEEPEPGGILMAIKSIAWQWIAYKKGFAVGYQFSSWF 107 >dbj|GAU35033.1| hypothetical protein TSUD_103560 [Trifolium subterraneum] Length = 311 Score = 65.1 bits (157), Expect = 3e-09 Identities = 33/101 (32%), Positives = 52/101 (51%) Frame = -1 Query: 531 EDSIHLFITCSFAREVWKQVLLWLDLQAEELYNCVDHFLWIGEACKGQKLKKFNYLIWLS 352 E S+HLF+ C FA +VW+Q++ WL + + V F + E G+K ++ +IW Sbjct: 192 ETSVHLFVYCHFATQVWEQIITWLGMVFMLPQSLVSFFSFFAETSGGKKRRQGLIMIW-- 249 Query: 351 TCCCWCIRLYRNGLVFRNEEANIASVVTHIKGLSGSWFIYR 229 W + RN ++F N ++ VV IK S W+I R Sbjct: 250 NAVVWALWRQRNRIIFENGTGDLNGVVEEIKVSSWKWWIGR 290 >dbj|GAU26034.1| hypothetical protein TSUD_224950 [Trifolium subterraneum] Length = 225 Score = 63.5 bits (153), Expect = 6e-09 Identities = 36/115 (31%), Positives = 57/115 (49%), Gaps = 1/115 (0%) Frame = -1 Query: 531 EDSIHLFITCSFAREVWKQVLLWLDLQAEELYNCVDHFLWIGEACKGQKLKK-FNYLIWL 355 E + HLFI+CS +W V W+ + + +N DHFL + G +++ F LIWL Sbjct: 104 ETAQHLFISCSIFGSLWSSVRSWIGFSSVDPHNLTDHFLQFTFSSGGLSVRRSFLQLIWL 163 Query: 354 STCCCWCIRLYRNGLVFRNEEANIASVVTHIKGLSGSWFIYRMGNSYNFSVTSWW 190 C W I RN +FRN E ++ ++ +K L W++ + + SWW Sbjct: 164 --VCVWVIWNERNQRLFRNSEQSLPQLLDKVK-LYSYWWLKTTNINLVSNYHSWW 215 >dbj|GAU34177.1| hypothetical protein TSUD_162770 [Trifolium subterraneum] Length = 800 Score = 64.7 bits (156), Expect = 8e-09 Identities = 36/115 (31%), Positives = 57/115 (49%), Gaps = 1/115 (0%) Frame = -1 Query: 531 EDSIHLFITCSFAREVWKQVLLWLDLQAEELYNCVDHFLWIGEACKGQKLKK-FNYLIWL 355 E + HLFI+CS +W V W+D + + +N DHFL + G +++ F L WL Sbjct: 679 ETAQHLFISCSIFGSLWSSVRSWIDFSSVDPHNLTDHFLQFTFSSGGLSVRRSFLQLTWL 738 Query: 354 STCCCWCIRLYRNGLVFRNEEANIASVVTHIKGLSGSWFIYRMGNSYNFSVTSWW 190 C W I RN +FRN E ++ ++ +K L W++ + + SWW Sbjct: 739 G--CVWVIWNERNQRLFRNSEQSLPQLLDKVK-LYSYWWLKTTNINLVSNYHSWW 790 >gb|PNX69227.1| hypothetical protein L195_g056601 [Trifolium pratense] Length = 120 Score = 60.8 bits (146), Expect = 1e-08 Identities = 33/97 (34%), Positives = 51/97 (52%) Frame = -1 Query: 519 HLFITCSFAREVWKQVLLWLDLQAEELYNCVDHFLWIGEACKGQKLKKFNYLIWLSTCCC 340 HLF+ C+ A +VW Q++ WL L N V + + K ++ ++ LIW S Sbjct: 8 HLFLHCTIASKVWYQIMSWLGLVVIVPQNLVTSYGMLVGCGKDKRNRECLALIWNSLM-- 65 Query: 339 WCIRLYRNGLVFRNEEANIASVVTHIKGLSGSWFIYR 229 W I +RN +F N+EA + +V +K LS WF+ R Sbjct: 66 WVIWRFRNDCIFNNKEATVEEMVDEVKLLSWKWFMGR 102