BLASTX nr result

ID: Astragalus22_contig00020873 seq

BLASTX 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Astragalus22_contig00020873
         (540 letters)

Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF excluding environmental samples
from WGS projects 
           149,584,005 sequences; 54,822,741,787 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

gb|KHN03945.1| hypothetical protein glysoja_022631, partial [Gly...    84   2e-17
gb|KHN04350.1| hypothetical protein glysoja_030944 [Glycine soja]      82   1e-15
gb|KHN35644.1| hypothetical protein glysoja_030996 [Glycine soja]      77   2e-14
gb|PNY05892.1| ribonuclease H [Trifolium pratense]                     77   3e-13
gb|KHN20429.1| hypothetical protein glysoja_044415, partial [Gly...    72   7e-13
gb|PNX71113.1| pantothenate synthetase [Trifolium pratense]            73   3e-12
gb|KHN39553.1| hypothetical protein glysoja_045723, partial [Gly...    71   7e-12
gb|KHN24583.1| hypothetical protein glysoja_039590, partial [Gly...    67   3e-11
gb|KHN10514.1| hypothetical protein glysoja_040669, partial [Gly...    68   3e-11
gb|PNY01502.1| ribonuclease H [Trifolium pratense]                     71   4e-11
gb|KHN41375.1| Putative ribonuclease H protein, partial [Glycine...    71   4e-11
gb|KHN30886.1| Putative ribonuclease H protein, partial [Glycine...    71   4e-11
dbj|GAU42970.1| hypothetical protein TSUD_188430 [Trifolium subt...    70   8e-11
ref|XP_003614519.1| hypothetical protein MTR_5g054985 [Medicago ...    65   2e-10
gb|KRH23381.1| hypothetical protein GLYMA_13G353700 [Glycine max]      65   3e-10
gb|KHN32657.1| hypothetical protein glysoja_022339 [Glycine soja]      64   6e-10
dbj|GAU35033.1| hypothetical protein TSUD_103560 [Trifolium subt...    65   3e-09
dbj|GAU26034.1| hypothetical protein TSUD_224950 [Trifolium subt...    64   6e-09
dbj|GAU34177.1| hypothetical protein TSUD_162770 [Trifolium subt...    65   8e-09
gb|PNX69227.1| hypothetical protein L195_g056601 [Trifolium prat...    61   1e-08

>gb|KHN03945.1| hypothetical protein glysoja_022631, partial [Glycine soja]
          Length = 114

 Score = 83.6 bits (205), Expect = 2e-17
 Identities = 36/108 (33%), Positives = 62/108 (57%)
 Frame = -1

Query: 516 LFITCSFAREVWKQVLLWLDLQAEELYNCVDHFLWIGEACKGQKLKKFNYLIWLSTCCCW 337
           LF+ C FA++VW+ +L WL        N  + +L +G   +G+K ++F +L+W +T  CW
Sbjct: 1   LFVFCPFAKQVWQGILNWLGYSFSLPNNIQELYLQLGMNIRGKKKRRFKHLLWHNT--CW 58

Query: 336 CIRLYRNGLVFRNEEANIASVVTHIKGLSGSWFIYRMGNSYNFSVTSW 193
            I  +RN ++FRN E ++ + +  IK +S  W +Y+      F  +SW
Sbjct: 59  SIWCHRNNVIFRNAEVDVNNTILFIKSMSWQWVLYKSSGKPGFFFSSW 106


>gb|KHN04350.1| hypothetical protein glysoja_030944 [Glycine soja]
          Length = 229

 Score = 81.6 bits (200), Expect = 1e-15
 Identities = 38/110 (34%), Positives = 65/110 (59%)
 Frame = -1

Query: 519 HLFITCSFAREVWKQVLLWLDLQAEELYNCVDHFLWIGEACKGQKLKKFNYLIWLSTCCC 340
           H+F+ C  A+++W+QV LWLD+   E  +   HF+  G+  KG+K K+  +LIW++    
Sbjct: 112 HIFLDCRVAKKIWQQVCLWLDVPVVEGEDIQAHFMAFGKLIKGKKQKRVKHLIWMAV--I 169

Query: 339 WCIRLYRNGLVFRNEEANIASVVTHIKGLSGSWFIYRMGNSYNFSVTSWW 190
           W I L RN ++F+ E A I  +++ IK  + +WF+ R G +     + W+
Sbjct: 170 WNIWLTRNKVIFKEEAAAIPVMISGIKDCAWAWFMARQGRTCWDGWSDWY 219


>gb|KHN35644.1| hypothetical protein glysoja_030996 [Glycine soja]
          Length = 160

 Score = 76.6 bits (187), Expect = 2e-14
 Identities = 36/99 (36%), Positives = 59/99 (59%)
 Frame = -1

Query: 519 HLFITCSFAREVWKQVLLWLDLQAEELYNCVDHFLWIGEACKGQKLKKFNYLIWLSTCCC 340
           H+ + C  A+++W+QV LWLD+   E  +   HF+  G+  KG+K K+  +LIW++    
Sbjct: 43  HILLDCRVAKKIWQQVCLWLDVPVVEGEDIQAHFMAFGKLIKGKKQKRVKHLIWMAV--I 100

Query: 339 WCIRLYRNGLVFRNEEANIASVVTHIKGLSGSWFIYRMG 223
           W I L RN ++F+ E A I  +++ IK  + +WF  R G
Sbjct: 101 WNIWLTRNKVIFKEEAAAIPVMISGIKDCAWAWFKARQG 139


>gb|PNY05892.1| ribonuclease H [Trifolium pratense]
          Length = 455

 Score = 77.4 bits (189), Expect = 3e-13
 Identities = 40/113 (35%), Positives = 56/113 (49%)
 Frame = -1

Query: 531 EDSIHLFITCSFAREVWKQVLLWLDLQAEELYNCVDHFLWIGEACKGQKLKKFNYLIWLS 352
           ED +HLF  C   ++VW+ +  WL            HF + G   K +K +K  +LIWL 
Sbjct: 334 EDCMHLFFNCKLMQQVWRSIYKWLGCAYYNYGEGWKHFNFFGGIVKSKKGEKVKHLIWLV 393

Query: 351 TCCCWCIRLYRNGLVFRNEEANIASVVTHIKGLSGSWFIYRMGNSYNFSVTSW 193
           T   WCI   RN ++FR   A+ A +V  IK +S  WFI R G    +  + W
Sbjct: 394 TT--WCIWRLRNNIIFRGALADCAQLVDQIKLISWVWFIGRSGRHCPYLYSDW 444


>gb|KHN20429.1| hypothetical protein glysoja_044415, partial [Glycine soja]
          Length = 118

 Score = 71.6 bits (174), Expect = 7e-13
 Identities = 32/113 (28%), Positives = 58/113 (51%)
 Frame = -1

Query: 531 EDSIHLFITCSFAREVWKQVLLWLDLQAEELYNCVDHFLWIGEACKGQKLKKFNYLIWLS 352
           ED  HLF+ CSF  ++W  VL WL +      +    F+W+G   + +++K+  ++ W  
Sbjct: 3   EDKNHLFVNCSFNSKIWYVVLAWLGVSVVLPNDAKSLFIWMGGFVRVRRVKRLIFIFWHV 62

Query: 351 TCCCWCIRLYRNGLVFRNEEANIASVVTHIKGLSGSWFIYRMGNSYNFSVTSW 193
           T   WC+   RN ++F+++     + + HIK +S  W   + G   +   +SW
Sbjct: 63  T--VWCLWNLRNQIIFKSDSIEFLACMAHIKIISWQWLFSKNGVKTSLFFSSW 113


>gb|PNX71113.1| pantothenate synthetase [Trifolium pratense]
          Length = 255

 Score = 72.8 bits (177), Expect = 3e-12
 Identities = 39/113 (34%), Positives = 59/113 (52%)
 Frame = -1

Query: 531 EDSIHLFITCSFAREVWKQVLLWLDLQAEELYNCVDHFLWIGEACKGQKLKKFNYLIWLS 352
           ED  HLF +C F++ VW+ VL WL L +      +DHFL  GE  K +      +L+WL+
Sbjct: 134 EDGAHLFFSCYFSKVVWRNVLKWLGLSSPLDVEGIDHFLLFGEFFKVKDKGHVRHLVWLA 193

Query: 351 TCCCWCIRLYRNGLVFRNEEANIASVVTHIKGLSGSWFIYRMGNSYNFSVTSW 193
           T   W +   RN ++F+ +  + A ++  IK  S  WF  R G +     +SW
Sbjct: 194 T--TWNLWKMRNKVIFKGDIPDTAVLLDSIKLFSWIWFNGRYGRNVCCPFSSW 244


>gb|KHN39553.1| hypothetical protein glysoja_045723, partial [Glycine soja]
          Length = 211

 Score = 71.2 bits (173), Expect = 7e-12
 Identities = 33/116 (28%), Positives = 58/116 (50%), Gaps = 1/116 (0%)
 Frame = -1

Query: 534 NEDSIHLFITCSFAREVWKQVLLWLDLQAEELYNCVDHFLWIGEACK-GQKLKKFNYLIW 358
           +E+ +HLF  C F++ +WK++L W+ +        V HF       K      K  ++ W
Sbjct: 92  DENVVHLFFHCDFSKCIWKEILSWIGIVDVIAVGGVQHFWEYDRLLKYNTSRNKVPFMFW 151

Query: 357 LSTCCCWCIRLYRNGLVFRNEEANIASVVTHIKGLSGSWFIYRMGNSYNFSVTSWW 190
           L+T   W I   RN  +F+ EE +I   +  IK +  +WF+ ++G     +++ WW
Sbjct: 152 LATL--WIIWQVRNNSIFKEEEKDIPKTINQIKHICWAWFMGKVGGVTGSNISDWW 205


>gb|KHN24583.1| hypothetical protein glysoja_039590, partial [Glycine soja]
          Length = 102

 Score = 67.0 bits (162), Expect = 3e-11
 Identities = 35/109 (32%), Positives = 55/109 (50%), Gaps = 1/109 (0%)
 Frame = -1

Query: 516 LFITCSFAREVWKQVLLWLDLQAEELYNCVDHFLWIGEACKGQKLKKFNYLIWLSTCCCW 337
           LF+ CSF+ +VW  V  WL +   +      H+   G   + + LK  + +IW   C CW
Sbjct: 1   LFMDCSFSFQVWNSVFRWLGVSLVQ-----QHYSQFGLVFREKNLKILHRVIW--HCTCW 53

Query: 336 CIRLYRNGLVFRN-EEANIASVVTHIKGLSGSWFIYRMGNSYNFSVTSW 193
           CI L+ N ++F+N   A+   ++ HI  LS +W  Y+   S   S  +W
Sbjct: 54  CIWLHHNKIMFQNGRRADACEIIQHIHALSWTWARYKGSLSSGLSFGAW 102


>gb|KHN10514.1| hypothetical protein glysoja_040669, partial [Glycine soja]
          Length = 132

 Score = 67.8 bits (164), Expect = 3e-11
 Identities = 32/87 (36%), Positives = 50/87 (57%)
 Frame = -1

Query: 531 EDSIHLFITCSFAREVWKQVLLWLDLQAEELYNCVDHFLWIGEACKGQKLKKFNYLIWLS 352
           ED  HLF  C+FA  VW  +  WL     +      H++  G  C+G++L K  + IW +
Sbjct: 48  EDVDHLFPGCNFAYNVWIAIYSWLGFVMIQHNQVKYHYVQHGLVCRGKRLSKVCHFIWHA 107

Query: 351 TCCCWCIRLYRNGLVFRNEEANIASVV 271
           TC  WC+ L+RN ++F+ E+A++  VV
Sbjct: 108 TC--WCLWLHRNRIIFQEEQADVQLVV 132


>gb|PNY01502.1| ribonuclease H [Trifolium pratense]
          Length = 554

 Score = 71.2 bits (173), Expect = 4e-11
 Identities = 36/113 (31%), Positives = 59/113 (52%)
 Frame = -1

Query: 531 EDSIHLFITCSFAREVWKQVLLWLDLQAEELYNCVDHFLWIGEACKGQKLKKFNYLIWLS 352
           ED  HLF +C F++ VW+ VL WL L        +DHF+  G+  K +   +  +L+WL+
Sbjct: 433 EDGAHLFFSCYFSKVVWRNVLKWLGLSIPMDVEGIDHFMLFGDLFKVKDKGRVRHLVWLA 492

Query: 351 TCCCWCIRLYRNGLVFRNEEANIASVVTHIKGLSGSWFIYRMGNSYNFSVTSW 193
           T   W +   RN ++F+ +    ++++  IK  S  WF  R G +     +SW
Sbjct: 493 T--TWNLWKLRNKVIFKGDIPETSALLDSIKLSSWMWFNGRYGRNVCCPFSSW 543


>gb|KHN41375.1| Putative ribonuclease H protein, partial [Glycine soja]
          Length = 363

 Score = 70.9 bits (172), Expect = 4e-11
 Identities = 33/116 (28%), Positives = 58/116 (50%), Gaps = 1/116 (0%)
 Frame = -1

Query: 534 NEDSIHLFITCSFAREVWKQVLLWLDLQAEELYNCVDHFLWIGEACK-GQKLKKFNYLIW 358
           +E+ +HLF  C F++ +WK++L W+ +        V HF       K      K  ++ W
Sbjct: 244 DENVVHLFFHCDFSKCIWKEILSWIGIVDVIAVGGVQHFWEYDRLLKYNTSRNKVPFMFW 303

Query: 357 LSTCCCWCIRLYRNGLVFRNEEANIASVVTHIKGLSGSWFIYRMGNSYNFSVTSWW 190
           L+T   W I   RN  +F+ EE +I   +  IK +  +WF+ ++G     +++ WW
Sbjct: 304 LATL--WIIWQVRNNSIFKEEEKDIPKTINQIKHICWAWFMGKVGGVGGSNISDWW 357


>gb|KHN30886.1| Putative ribonuclease H protein, partial [Glycine soja]
          Length = 373

 Score = 70.9 bits (172), Expect = 4e-11
 Identities = 33/116 (28%), Positives = 58/116 (50%), Gaps = 1/116 (0%)
 Frame = -1

Query: 534 NEDSIHLFITCSFAREVWKQVLLWLDLQAEELYNCVDHFLWIGEACK-GQKLKKFNYLIW 358
           +E+ +HLF  C F++ +WK++L W+ +        V HF       K      K  ++ W
Sbjct: 258 DENVVHLFFHCDFSKCIWKEILSWIGIVDVIAVGGVQHFWEYDRLLKYNTSRNKVPFMFW 317

Query: 357 LSTCCCWCIRLYRNGLVFRNEEANIASVVTHIKGLSGSWFIYRMGNSYNFSVTSWW 190
           L+T   W I   RN  +F+ EE +I   +  IK +  +WF+ ++G     +++ WW
Sbjct: 318 LATL--WIIWQVRNNSIFKEEEKDIPKTINQIKHICWAWFMGKVGGVGGSNISDWW 371


>dbj|GAU42970.1| hypothetical protein TSUD_188430 [Trifolium subterraneum]
          Length = 767

 Score = 70.5 bits (171), Expect = 8e-11
 Identities = 38/114 (33%), Positives = 57/114 (50%)
 Frame = -1

Query: 534 NEDSIHLFITCSFAREVWKQVLLWLDLQAEELYNCVDHFLWIGEACKGQKLKKFNYLIWL 355
           +EDS HLF  CS  + VW++V  W     +   +  +HF   G   K ++ +K  +LIWL
Sbjct: 644 DEDSAHLFFHCSVLKRVWEEVFKWFGKSYQAEADGWNHFNIFGSLLKTKRFEKVRHLIWL 703

Query: 354 STCCCWCIRLYRNGLVFRNEEANIASVVTHIKGLSGSWFIYRMGNSYNFSVTSW 193
           +T   W I   RN +VF     + +S+V  IK +S  W   R G+  + S   W
Sbjct: 704 AT--TWSIWKLRNNVVFNGVTLSSSSLVNDIKTISCLWLSGRYGHISSISFPDW 755


>ref|XP_003614519.1| hypothetical protein MTR_5g054985 [Medicago truncatula]
 gb|AES97477.1| hypothetical protein MTR_5g054985 [Medicago truncatula]
          Length = 130

 Score = 65.5 bits (158), Expect = 2e-10
 Identities = 39/116 (33%), Positives = 55/116 (47%), Gaps = 2/116 (1%)
 Frame = -1

Query: 531 EDSIHLFITCSFAREVWKQVLLWLDLQAEELYNCVDHFLWIG--EACKGQKLKKFNYLIW 358
           ED+ HLF++C F  ++W  +  WL  Q     N +DH             K    N LIW
Sbjct: 8   EDAKHLFLSCDFFGKLWYDISYWLGYQLVFPENVLDHLYQFATFSGFSNSKRSSLN-LIW 66

Query: 357 LSTCCCWCIRLYRNGLVFRNEEANIASVVTHIKGLSGSWFIYRMGNSYNFSVTSWW 190
           LS  C W I L RN  +F  +EA+   ++  +K L   W++     S+ FS  SWW
Sbjct: 67  LS--CVWVIWLERNARIFHQKEASFNQLLDKVK-LQSYWWLKVNRPSFVFSYHSWW 119


>gb|KRH23381.1| hypothetical protein GLYMA_13G353700 [Glycine max]
          Length = 114

 Score = 64.7 bits (156), Expect = 3e-10
 Identities = 30/110 (27%), Positives = 55/110 (50%), Gaps = 1/110 (0%)
 Frame = -1

Query: 516 LFITCSFAREVWKQVLLWLDLQAEELYNCVDH-FLWIGEACKGQKLKKFNYLIWLSTCCC 340
           +F++C F+  +W QV  WL +    L   +DH +  +G +  G   K+   + W     C
Sbjct: 1   MFLSCPFSSAIWNQVFRWLGIHT-VLPRHIDHLYDQMGHSIGGATNKRIKLVFW--HAAC 57

Query: 339 WCIRLYRNGLVFRNEEANIASVVTHIKGLSGSWFIYRMGNSYNFSVTSWW 190
           W +R  RN ++F +EE     ++  IK ++  W  Y+ G +  +  +SW+
Sbjct: 58  WLLRNARNSVIFNSEEPEPGGILMAIKSIAWQWIAYKKGFAVGYQFSSWF 107


>gb|KHN32657.1| hypothetical protein glysoja_022339 [Glycine soja]
          Length = 114

 Score = 63.9 bits (154), Expect = 6e-10
 Identities = 30/110 (27%), Positives = 55/110 (50%), Gaps = 1/110 (0%)
 Frame = -1

Query: 516 LFITCSFAREVWKQVLLWLDLQAEELYNCVDH-FLWIGEACKGQKLKKFNYLIWLSTCCC 340
           +F++C F+  +W QV  WL +    L   +DH +  +G +  G   K+   + W     C
Sbjct: 1   MFLSCPFSSAIWNQVFGWLGIHT-VLPRHIDHLYDQMGHSIGGATNKRIKLVFW--HAAC 57

Query: 339 WCIRLYRNGLVFRNEEANIASVVTHIKGLSGSWFIYRMGNSYNFSVTSWW 190
           W +R  RN ++F +EE     ++  IK ++  W  Y+ G +  +  +SW+
Sbjct: 58  WLLRNARNSVIFNSEEPEPGGILMAIKSIAWQWIAYKKGFAVGYQFSSWF 107


>dbj|GAU35033.1| hypothetical protein TSUD_103560 [Trifolium subterraneum]
          Length = 311

 Score = 65.1 bits (157), Expect = 3e-09
 Identities = 33/101 (32%), Positives = 52/101 (51%)
 Frame = -1

Query: 531 EDSIHLFITCSFAREVWKQVLLWLDLQAEELYNCVDHFLWIGEACKGQKLKKFNYLIWLS 352
           E S+HLF+ C FA +VW+Q++ WL +      + V  F +  E   G+K ++   +IW  
Sbjct: 192 ETSVHLFVYCHFATQVWEQIITWLGMVFMLPQSLVSFFSFFAETSGGKKRRQGLIMIW-- 249

Query: 351 TCCCWCIRLYRNGLVFRNEEANIASVVTHIKGLSGSWFIYR 229
               W +   RN ++F N   ++  VV  IK  S  W+I R
Sbjct: 250 NAVVWALWRQRNRIIFENGTGDLNGVVEEIKVSSWKWWIGR 290


>dbj|GAU26034.1| hypothetical protein TSUD_224950 [Trifolium subterraneum]
          Length = 225

 Score = 63.5 bits (153), Expect = 6e-09
 Identities = 36/115 (31%), Positives = 57/115 (49%), Gaps = 1/115 (0%)
 Frame = -1

Query: 531 EDSIHLFITCSFAREVWKQVLLWLDLQAEELYNCVDHFLWIGEACKGQKLKK-FNYLIWL 355
           E + HLFI+CS    +W  V  W+   + + +N  DHFL    +  G  +++ F  LIWL
Sbjct: 104 ETAQHLFISCSIFGSLWSSVRSWIGFSSVDPHNLTDHFLQFTFSSGGLSVRRSFLQLIWL 163

Query: 354 STCCCWCIRLYRNGLVFRNEEANIASVVTHIKGLSGSWFIYRMGNSYNFSVTSWW 190
              C W I   RN  +FRN E ++  ++  +K L   W++     +   +  SWW
Sbjct: 164 --VCVWVIWNERNQRLFRNSEQSLPQLLDKVK-LYSYWWLKTTNINLVSNYHSWW 215


>dbj|GAU34177.1| hypothetical protein TSUD_162770 [Trifolium subterraneum]
          Length = 800

 Score = 64.7 bits (156), Expect = 8e-09
 Identities = 36/115 (31%), Positives = 57/115 (49%), Gaps = 1/115 (0%)
 Frame = -1

Query: 531  EDSIHLFITCSFAREVWKQVLLWLDLQAEELYNCVDHFLWIGEACKGQKLKK-FNYLIWL 355
            E + HLFI+CS    +W  V  W+D  + + +N  DHFL    +  G  +++ F  L WL
Sbjct: 679  ETAQHLFISCSIFGSLWSSVRSWIDFSSVDPHNLTDHFLQFTFSSGGLSVRRSFLQLTWL 738

Query: 354  STCCCWCIRLYRNGLVFRNEEANIASVVTHIKGLSGSWFIYRMGNSYNFSVTSWW 190
               C W I   RN  +FRN E ++  ++  +K L   W++     +   +  SWW
Sbjct: 739  G--CVWVIWNERNQRLFRNSEQSLPQLLDKVK-LYSYWWLKTTNINLVSNYHSWW 790


>gb|PNX69227.1| hypothetical protein L195_g056601 [Trifolium pratense]
          Length = 120

 Score = 60.8 bits (146), Expect = 1e-08
 Identities = 33/97 (34%), Positives = 51/97 (52%)
 Frame = -1

Query: 519 HLFITCSFAREVWKQVLLWLDLQAEELYNCVDHFLWIGEACKGQKLKKFNYLIWLSTCCC 340
           HLF+ C+ A +VW Q++ WL L      N V  +  +    K ++ ++   LIW S    
Sbjct: 8   HLFLHCTIASKVWYQIMSWLGLVVIVPQNLVTSYGMLVGCGKDKRNRECLALIWNSLM-- 65

Query: 339 WCIRLYRNGLVFRNEEANIASVVTHIKGLSGSWFIYR 229
           W I  +RN  +F N+EA +  +V  +K LS  WF+ R
Sbjct: 66  WVIWRFRNDCIFNNKEATVEEMVDEVKLLSWKWFMGR 102


Top