BLASTX nr result

ID: Astragalus23_contig00031264 seq

BLASTX 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Astragalus23_contig00031264
         (473 letters)

Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF excluding environmental samples
from WGS projects 
           149,584,005 sequences; 54,822,741,787 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

gb|KHN04350.1| hypothetical protein glysoja_030944 [Glycine soja]     112   6e-28
gb|KHN35644.1| hypothetical protein glysoja_030996 [Glycine soja]     110   8e-28
gb|KHN18837.1| hypothetical protein glysoja_028206 [Glycine soja]     107   2e-27
gb|PNX71113.1| pantothenate synthetase [Trifolium pratense]            80   2e-15
dbj|GAU42970.1| hypothetical protein TSUD_188430 [Trifolium subt...    67   5e-14
gb|PNY01502.1| ribonuclease H [Trifolium pratense]                     77   2e-13
gb|PNY05892.1| ribonuclease H [Trifolium pratense]                     75   9e-13
gb|PNX84435.1| hypothetical protein L195_g040495, partial [Trifo...    69   6e-11
dbj|GAU48210.1| hypothetical protein TSUD_404970 [Trifolium subt...    64   3e-10
gb|KHN39553.1| hypothetical protein glysoja_045723, partial [Gly...    59   1e-07
ref|XP_003600144.1| hypothetical protein MTR_3g052690 [Medicago ...    49   3e-07
gb|KHN41375.1| Putative ribonuclease H protein, partial [Glycine...    59   3e-07
gb|KHN03945.1| hypothetical protein glysoja_022631, partial [Gly...    49   3e-07
gb|PNY15032.1| ribonuclease H, partial [Trifolium pratense]            59   4e-07
gb|KHN04679.1| hypothetical protein glysoja_046613, partial [Gly...    54   1e-06
gb|KHN20559.1| hypothetical protein glysoja_033247, partial [Gly...    53   2e-06
dbj|GAU46753.1| hypothetical protein TSUD_402790 [Trifolium subt...    46   2e-06
gb|KRH56422.1| hypothetical protein GLYMA_06G322900 [Glycine max]      53   3e-06

>gb|KHN04350.1| hypothetical protein glysoja_030944 [Glycine soja]
          Length = 229

 Score =  112 bits (281), Expect = 6e-28
 Identities = 47/85 (55%), Positives = 63/85 (74%)
 Frame = +1

Query: 88  FFGFWENSKGQEAKGVRHLIWLAVTWNIWIMRNKIVFKGEVASVPTMIAGIMDIAWCWFK 267
           F  F +  KG++ K V+HLIW+AV WNIW+ RNK++FK E A++P MI+GI D AW WF 
Sbjct: 145 FMAFGKLIKGKKQKRVKHLIWMAVIWNIWLTRNKVIFKEEAAAIPVMISGIKDCAWAWFM 204

Query: 268 ARRGRSRDLGWTDWYHGPFGCLDSM 342
           AR+GR+   GW+DWY+ P GCL S+
Sbjct: 205 ARQGRTCWDGWSDWYNCPMGCLLSL 229



 Score = 55.8 bits (133), Expect = 2e-06
 Identities = 20/44 (45%), Positives = 31/44 (70%)
 Frame = +2

Query: 5   CRVIKRVWQHVSTWIGAHIKDSEDMVTHFLDFGKIVKGKKQREL 136
           CRV K++WQ V  W+   + + ED+  HF+ FGK++KGKKQ+ +
Sbjct: 117 CRVAKKIWQQVCLWLDVPVVEGEDIQAHFMAFGKLIKGKKQKRV 160


>gb|KHN35644.1| hypothetical protein glysoja_030996 [Glycine soja]
          Length = 160

 Score =  110 bits (275), Expect = 8e-28
 Identities = 47/85 (55%), Positives = 63/85 (74%)
 Frame = +1

Query: 88  FFGFWENSKGQEAKGVRHLIWLAVTWNIWIMRNKIVFKGEVASVPTMIAGIMDIAWCWFK 267
           F  F +  KG++ K V+HLIW+AV WNIW+ RNK++FK E A++P MI+GI D AW WFK
Sbjct: 76  FMAFGKLIKGKKQKRVKHLIWMAVIWNIWLTRNKVIFKEEAAAIPVMISGIKDCAWAWFK 135

Query: 268 ARRGRSRDLGWTDWYHGPFGCLDSM 342
           AR+GR   +GW+D Y+ P GCL S+
Sbjct: 136 ARQGRICWVGWSDSYNCPMGCLLSL 160



 Score = 55.8 bits (133), Expect = 1e-06
 Identities = 20/44 (45%), Positives = 31/44 (70%)
 Frame = +2

Query: 5   CRVIKRVWQHVSTWIGAHIKDSEDMVTHFLDFGKIVKGKKQREL 136
           CRV K++WQ V  W+   + + ED+  HF+ FGK++KGKKQ+ +
Sbjct: 48  CRVAKKIWQQVCLWLDVPVVEGEDIQAHFMAFGKLIKGKKQKRV 91


>gb|KHN18837.1| hypothetical protein glysoja_028206 [Glycine soja]
          Length = 84

 Score =  107 bits (266), Expect = 2e-27
 Identities = 44/77 (57%), Positives = 59/77 (76%)
 Frame = +1

Query: 112 KGQEAKGVRHLIWLAVTWNIWIMRNKIVFKGEVASVPTMIAGIMDIAWCWFKARRGRSRD 291
           KG++ K V+HLIW+AV WNIW+ RNK++FK E A++P MI+GI D AW WF AR+GR+  
Sbjct: 8   KGKKQKRVKHLIWMAVIWNIWLTRNKVIFKEEAAAIPVMISGIKDCAWAWFMARQGRTCW 67

Query: 292 LGWTDWYHGPFGCLDSM 342
            GW+ WY+ P GCL S+
Sbjct: 68  DGWSYWYNCPMGCLLSL 84


>gb|PNX71113.1| pantothenate synthetase [Trifolium pratense]
          Length = 255

 Score = 80.5 bits (197), Expect = 2e-15
 Identities = 34/86 (39%), Positives = 49/86 (56%)
 Frame = +1

Query: 82  DSFFGFWENSKGQEAKGVRHLIWLAVTWNIWIMRNKIVFKGEVASVPTMIAGIMDIAWCW 261
           D F  F E  K ++   VRHL+WLA TWN+W MRNK++FKG++     ++  I   +W W
Sbjct: 169 DHFLLFGEFFKVKDKGHVRHLVWLATTWNLWKMRNKVIFKGDIPDTAVLLDSIKLFSWIW 228

Query: 262 FKARRGRSRDLGWTDWYHGPFGCLDS 339
           F  R GR+    ++ W   P  C+ S
Sbjct: 229 FNGRYGRNVCCPFSSWCLDPISCIQS 254


>dbj|GAU42970.1| hypothetical protein TSUD_188430 [Trifolium subterraneum]
          Length = 767

 Score = 67.0 bits (162), Expect(2) = 5e-14
 Identities = 29/77 (37%), Positives = 41/77 (53%)
 Frame = +1

Query: 112 KGQEAKGVRHLIWLAVTWNIWIMRNKIVFKGEVASVPTMIAGIMDIAWCWFKARRGRSRD 291
           K +  + VRHLIWLA TW+IW +RN +VF G   S  +++  I  I+  W   R G    
Sbjct: 690 KTKRFEKVRHLIWLATTWSIWKLRNNVVFNGVTLSSSSLVNDIKTISCLWLSGRYGHISS 749

Query: 292 LGWTDWYHGPFGCLDSM 342
           + + DW   P  C  S+
Sbjct: 750 ISFPDWCFDPMTCFQSI 766



 Score = 38.1 bits (87), Expect(2) = 5e-14
 Identities = 16/40 (40%), Positives = 22/40 (55%)
 Frame = +2

Query: 5   CRVIKRVWQHVSTWIGAHIKDSEDMVTHFLDFGKIVKGKK 124
           C V+KRVW+ V  W G   +   D   HF  FG ++K K+
Sbjct: 654 CSVLKRVWEEVFKWFGKSYQAEADGWNHFNIFGSLLKTKR 693


>gb|PNY01502.1| ribonuclease H [Trifolium pratense]
          Length = 554

 Score = 77.0 bits (188), Expect = 2e-13
 Identities = 32/86 (37%), Positives = 49/86 (56%)
 Frame = +1

Query: 82  DSFFGFWENSKGQEAKGVRHLIWLAVTWNIWIMRNKIVFKGEVASVPTMIAGIMDIAWCW 261
           D F  F +  K ++   VRHL+WLA TWN+W +RNK++FKG++     ++  I   +W W
Sbjct: 468 DHFMLFGDLFKVKDKGRVRHLVWLATTWNLWKLRNKVIFKGDIPETSALLDSIKLSSWMW 527

Query: 262 FKARRGRSRDLGWTDWYHGPFGCLDS 339
           F  R GR+    ++ W   P  C+ S
Sbjct: 528 FNGRYGRNVCCPFSSWCLDPMSCVQS 553


>gb|PNY05892.1| ribonuclease H [Trifolium pratense]
          Length = 455

 Score = 75.1 bits (183), Expect = 9e-13
 Identities = 38/103 (36%), Positives = 57/103 (55%)
 Frame = +1

Query: 34  CFYVDWGAHKGLRRHGDSFFGFWENSKGQEAKGVRHLIWLAVTWNIWIMRNKIVFKGEVA 213
           C Y ++G      +H + F G  ++ KG++   V+HLIWL  TW IW +RN I+F+G +A
Sbjct: 359 CAYYNYGEGW---KHFNFFGGIVKSKKGEK---VKHLIWLVTTWCIWRLRNNIIFRGALA 412

Query: 214 SVPTMIAGIMDIAWCWFKARRGRSRDLGWTDWYHGPFGCLDSM 342
               ++  I  I+W WF  R GR     ++DW   P  C+ SM
Sbjct: 413 DCAQLVDQIKLISWVWFIGRSGRHCPYLYSDWCVNPLECILSM 455


>gb|PNX84435.1| hypothetical protein L195_g040495, partial [Trifolium pratense]
          Length = 248

 Score = 68.6 bits (166), Expect = 6e-11
 Identities = 29/87 (33%), Positives = 47/87 (54%), Gaps = 2/87 (2%)
 Frame = +1

Query: 88  FFGFWENSKGQEAKGVRHLIWLAVTWNIWIMRNKIVFKGEVASVPTMIAGIMDIAWCWFK 267
           F  F +    ++ + VRHL WLA TWNIW +RN +VF G + +  +++  I+  +W WF 
Sbjct: 162 FLLFDDMITAKKGERVRHLFWLATTWNIWKLRNNVVFNGVIPNASSLVEDIIANSWLWFN 221

Query: 268 ARRGR--SRDLGWTDWYHGPFGCLDSM 342
            R G      + +++W H    C+  M
Sbjct: 222 GRYGHHSCTSMSFSNWCHDHMTCIQRM 248


>dbj|GAU48210.1| hypothetical protein TSUD_404970 [Trifolium subterraneum]
          Length = 1653

 Score = 64.3 bits (155), Expect(2) = 3e-10
 Identities = 30/80 (37%), Positives = 41/80 (51%)
 Frame = +1

Query: 91   FGFWENSKGQEAKGVRHLIWLAVTWNIWIMRNKIVFKGEVASVPTMIAGIMDIAWCWFKA 270
            FG   NS   E   VRHLIWLA TWN+W +RN ++F G   S  +++  I  I+  W   
Sbjct: 1571 FGDMVNSTNIER--VRHLIWLATTWNLWKLRNNVIFNGATPSASSLLNDIKAISCAWVSG 1628

Query: 271  RRGRSRDLGWTDWYHGPFGC 330
            R G    + ++ W   P  C
Sbjct: 1629 RYGHKSCISFSLWCFDPLAC 1648



 Score = 27.7 bits (60), Expect(2) = 3e-10
 Identities = 12/36 (33%), Positives = 17/36 (47%)
 Frame = +2

Query: 5    CRVIKRVWQHVSTWIGAHIKDSEDMVTHFLDFGKIV 112
            C  +  VW+ V  WIG       +  +HF  FG +V
Sbjct: 1540 CSFVNCVWEAVYNWIGKDYHAGAEGWSHFKVFGDMV 1575


>gb|KHN39553.1| hypothetical protein glysoja_045723, partial [Glycine soja]
          Length = 211

 Score = 59.3 bits (142), Expect = 1e-07
 Identities = 28/82 (34%), Positives = 37/82 (45%), Gaps = 4/82 (4%)
 Frame = +1

Query: 97  FWENSK----GQEAKGVRHLIWLAVTWNIWIMRNKIVFKGEVASVPTMIAGIMDIAWCWF 264
           FWE  +          V  + WLA  W IW +RN  +FK E   +P  I  I  I W WF
Sbjct: 130 FWEYDRLLKYNTSRNKVPFMFWLATLWIIWQVRNNSIFKEEEKDIPKTINQIKHICWAWF 189

Query: 265 KARRGRSRDLGWTDWYHGPFGC 330
             + G       +DW++ PF C
Sbjct: 190 MGKVGGVTGSNISDWWNSPFLC 211


>ref|XP_003600144.1| hypothetical protein MTR_3g052690 [Medicago truncatula]
 gb|AES70395.1| hypothetical protein MTR_3g052690 [Medicago truncatula]
          Length = 149

 Score = 48.9 bits (115), Expect(2) = 3e-07
 Identities = 29/84 (34%), Positives = 41/84 (48%), Gaps = 1/84 (1%)
 Frame = +1

Query: 82  DSFFGFWENSK-GQEAKGVRHLIWLAVTWNIWIMRNKIVFKGEVASVPTMIAGIMDIAWC 258
           D F  F  +S  G+    + HLIW A  W IW  RN  +F+G+  S   M+  I  +++ 
Sbjct: 63  DHFHQFGTSSGYGKLRCSLMHLIWFATVWEIWKERNDRIFRGQERSHYQMLEAIKLLSFW 122

Query: 259 WFKARRGRSRDLGWTDWYHGPFGC 330
           WFKA +       + DW   PF C
Sbjct: 123 WFKA-KFTVFPYCFHDWCQAPFLC 145



 Score = 33.1 bits (74), Expect(2) = 3e-07
 Identities = 11/33 (33%), Positives = 18/33 (54%)
 Frame = +2

Query: 5   CRVIKRVWQHVSTWIGAHIKDSEDMVTHFLDFG 103
           C +   +WQ +  W+G +  D  ++V HF  FG
Sbjct: 37  CTLFGHIWQLIRNWLGVYSADPGNIVDHFHQFG 69


>gb|KHN41375.1| Putative ribonuclease H protein, partial [Glycine soja]
          Length = 363

 Score = 58.9 bits (141), Expect = 3e-07
 Identities = 28/82 (34%), Positives = 37/82 (45%), Gaps = 4/82 (4%)
 Frame = +1

Query: 97  FWENSK----GQEAKGVRHLIWLAVTWNIWIMRNKIVFKGEVASVPTMIAGIMDIAWCWF 264
           FWE  +          V  + WLA  W IW +RN  +FK E   +P  I  I  I W WF
Sbjct: 282 FWEYDRLLKYNTSRNKVPFMFWLATLWIIWQVRNNSIFKEEEKDIPKTINQIKHICWAWF 341

Query: 265 KARRGRSRDLGWTDWYHGPFGC 330
             + G       +DW++ PF C
Sbjct: 342 MGKVGGVGGSNISDWWNSPFLC 363


>gb|KHN03945.1| hypothetical protein glysoja_022631, partial [Glycine soja]
          Length = 114

 Score = 49.3 bits (116), Expect(2) = 3e-07
 Identities = 20/76 (26%), Positives = 35/76 (46%)
 Frame = +1

Query: 106 NSKGQEAKGVRHLIWLAVTWNIWIMRNKIVFKGEVASVPTMIAGIMDIAWCWFKARRGRS 285
           N +G++ +  +HL+W    W+IW  RN ++F+     V   I  I  ++W W   +    
Sbjct: 39  NIRGKKKRRFKHLLWHNTCWSIWCHRNNVIFRNAEVDVNNTILFIKSMSWQWVLYKSSGK 98

Query: 286 RDLGWTDWYHGPFGCL 333
               ++ W   P  CL
Sbjct: 99  PGFFFSSWCLCPLDCL 114



 Score = 32.7 bits (73), Expect(2) = 3e-07
 Identities = 13/42 (30%), Positives = 22/42 (52%)
 Frame = +2

Query: 5   CRVIKRVWQHVSTWIGAHIKDSEDMVTHFLDFGKIVKGKKQR 130
           C   K+VWQ +  W+G       ++   +L  G  ++GKK+R
Sbjct: 5   CPFAKQVWQGILNWLGYSFSLPNNIQELYLQLGMNIRGKKKR 46


>gb|PNY15032.1| ribonuclease H, partial [Trifolium pratense]
          Length = 1490

 Score = 58.9 bits (141), Expect = 4e-07
 Identities = 28/69 (40%), Positives = 40/69 (57%)
 Frame = +1

Query: 76   HGDSFFGFWENSKGQEAKGVRHLIWLAVTWNIWIMRNKIVFKGEVASVPTMIAGIMDIAW 255
            H  + FG   N K +    VR+L+WLA +WNIW  RN ++FKG +  V +++  I   +W
Sbjct: 1417 HHFTLFGDLFNVKDKGC--VRYLVWLATSWNIWKFRNLVIFKGVLPDVSSVVDAIKLSSW 1474

Query: 256  CWFKARRGR 282
             WF  R GR
Sbjct: 1475 VWFTNRYGR 1483


>gb|KHN04679.1| hypothetical protein glysoja_046613, partial [Glycine soja]
          Length = 103

 Score = 54.3 bits (129), Expect = 1e-06
 Identities = 25/74 (33%), Positives = 40/74 (54%)
 Frame = +1

Query: 112 KGQEAKGVRHLIWLAVTWNIWIMRNKIVFKGEVASVPTMIAGIMDIAWCWFKARRGRSRD 291
           KG +++GV   IW+A  W+IW  RN I+F+ +      +I  I   +W W KA+  +   
Sbjct: 31  KGVKSRGVLMFIWVAAVWSIWNHRNVIIFRNQQPCAEYVIEEIKSKSWGWIKAKY-KCFQ 89

Query: 292 LGWTDWYHGPFGCL 333
             + +W+  PF CL
Sbjct: 90  SSYYEWHSQPFLCL 103


>gb|KHN20559.1| hypothetical protein glysoja_033247, partial [Glycine soja]
          Length = 86

 Score = 53.1 bits (126), Expect = 2e-06
 Identities = 23/74 (31%), Positives = 38/74 (51%), Gaps = 1/74 (1%)
 Frame = +1

Query: 112 KGQEAKGVRHLIWLAVTWNIWIMRNKIVFKGE-VASVPTMIAGIMDIAWCWFKARRGRSR 288
           +G++ + VR L+W A  W +W+ RN ++FK      V  ++  I  I+W W K +   S 
Sbjct: 13  RGKKLRRVRLLVWHATCWCLWLYRNSVIFKDNFFPDVQNVVYHIQRISWTWMKYKGHGSS 72

Query: 289 DLGWTDWYHGPFGC 330
            L + +W   P  C
Sbjct: 73  SLSFANWCTSPLLC 86


>dbj|GAU46753.1| hypothetical protein TSUD_402790 [Trifolium subterraneum]
          Length = 592

 Score = 45.8 bits (107), Expect(2) = 2e-06
 Identities = 22/64 (34%), Positives = 33/64 (51%)
 Frame = +1

Query: 142 LIWLAVTWNIWIMRNKIVFKGEVASVPTMIAGIMDIAWCWFKARRGRSRDLGWTDWYHGP 321
           LIWL   W +W  RN   F G   S+  M+  I + ++ W KA+R  +  L +  W++ P
Sbjct: 527 LIWLVSVWLVWNERNSRCFSGSANSLQHMLDKIKNYSYRWLKAKR-CTLALNYHSWWYSP 585

Query: 322 FGCL 333
             CL
Sbjct: 586 LICL 589



 Score = 33.1 bits (74), Expect(2) = 2e-06
 Identities = 15/50 (30%), Positives = 22/50 (44%)
 Frame = +2

Query: 5   CRVIKRVWQHVSTWIGAHIKDSEDMVTHFLDFGKIVKGKKQRELGISFGW 154
           C     +W   S+WIG+ + DS  +  HF  F   V G + R   +   W
Sbjct: 480 CSTFGPLWPMASSWIGSPLVDSHTIPDHFAQFTLSVGGSRGRRSFMQLIW 529


>gb|KRH56422.1| hypothetical protein GLYMA_06G322900 [Glycine max]
          Length = 93

 Score = 53.1 bits (126), Expect = 3e-06
 Identities = 23/74 (31%), Positives = 38/74 (51%), Gaps = 1/74 (1%)
 Frame = +1

Query: 112 KGQEAKGVRHLIWLAVTWNIWIMRNKIVFKGE-VASVPTMIAGIMDIAWCWFKARRGRSR 288
           +G++ + VR L+W A  W +W+ RN ++FK      V  ++  I  I+W W K +   S 
Sbjct: 19  RGKKLRRVRLLVWHATCWCLWLYRNSVIFKDNFFPDVQNVVYHIQRISWTWMKYKGHGSS 78

Query: 289 DLGWTDWYHGPFGC 330
            L + +W   P  C
Sbjct: 79  SLSFANWCTSPLLC 92


Top