BLASTX nr result

ID: Astragalus22_contig00030842 seq

BLASTX 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Astragalus22_contig00030842
         (372 letters)

Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF excluding environmental samples
from WGS projects 
           149,584,005 sequences; 54,822,741,787 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

gb|KHN04350.1| hypothetical protein glysoja_030944 [Glycine soja]      84   4e-17
gb|KHN10514.1| hypothetical protein glysoja_040669, partial [Gly...    80   1e-16
gb|KHN03945.1| hypothetical protein glysoja_022631, partial [Gly...    76   2e-15
gb|PNY05892.1| ribonuclease H [Trifolium pratense]                     80   4e-15
gb|KHN35644.1| hypothetical protein glysoja_030996 [Glycine soja]      77   4e-15
gb|KRH35933.1| hypothetical protein GLYMA_10G272900 [Glycine max]      69   8e-13
gb|PNY01502.1| ribonuclease H [Trifolium pratense]                     72   5e-12
gb|PNX71113.1| pantothenate synthetase [Trifolium pratense]            69   2e-11
gb|KYP55738.1| hypothetical protein KK1_001963 [Cajanus cajan]         67   4e-11
dbj|GAU26446.1| hypothetical protein TSUD_294120 [Trifolium subt...    69   5e-11
dbj|GAU42970.1| hypothetical protein TSUD_188430 [Trifolium subt...    68   9e-11
gb|KHN39553.1| hypothetical protein glysoja_045723, partial [Gly...    66   1e-10
gb|PNX84435.1| hypothetical protein L195_g040495, partial [Trifo...    66   2e-10
dbj|GAU46742.1| hypothetical protein TSUD_286020 [Trifolium subt...    66   3e-10
gb|KYP39460.1| hypothetical protein KK1_039239 [Cajanus cajan]         65   3e-10
ref|XP_022024524.1| uncharacterized protein LOC110924846 [Helian...    65   4e-10
ref|XP_020203066.1| uncharacterized protein LOC109788691 [Cajanu...    65   5e-10
gb|PNX95738.1| ribonuclease H, partial [Trifolium pratense]            66   6e-10
dbj|GAU48811.1| hypothetical protein TSUD_406440 [Trifolium subt...    65   7e-10
gb|KHN41375.1| Putative ribonuclease H protein, partial [Glycine...    65   9e-10

>gb|KHN04350.1| hypothetical protein glysoja_030944 [Glycine soja]
          Length = 229

 Score = 83.6 bits (205), Expect = 4e-17
 Identities = 43/109 (39%), Positives = 59/109 (54%)
 Frame = -3

Query: 364 DAQCPFCMQHREDCEHTFLVCNFASGIWQQVYC*LGVAGNCEFNTLQEHCYWLGTLCKGS 185
           D +C  C    E  +H FL C  A  IWQQV   L V    E   +Q H    G L KG 
Sbjct: 97  DRRCVLCSSEDETVKHIFLDCRVAKKIWQQVCLWLDVPV-VEGEDIQAHFMAFGKLIKGK 155

Query: 184 KERKIKYLIWLTTCWCIWLSLNGIVFKNE*ANIDSLIVRIKSLSWS*FI 38
           K++++K+LIW+   W IWL+ N ++FK E A I  +I  IK  +W+ F+
Sbjct: 156 KQKRVKHLIWMAVIWNIWLTRNKVIFKEEAAAIPVMISGIKDCAWAWFM 204


>gb|KHN10514.1| hypothetical protein glysoja_040669, partial [Glycine soja]
          Length = 132

 Score = 80.1 bits (196), Expect = 1e-16
 Identities = 35/98 (35%), Positives = 54/98 (55%)
 Frame = -3

Query: 367 EDAQCPFCMQHREDCEHTFLVCNFASGIWQQVYC*LGVAGNCEFNTLQEHCYWLGTLCKG 188
           +D  C FC    ED +H F  CNFA  +W  +Y  LG     + N ++ H    G +C+G
Sbjct: 36  DDVGCVFCEHDWEDVDHLFPGCNFAYNVWIAIYSWLGFV-MIQHNQVKYHYVQHGLVCRG 94

Query: 187 SKERKIKYLIWLTTCWCIWLSLNGIVFKNE*ANIDSLI 74
            +  K+ + IW  TCWC+WL  N I+F+ E A++  ++
Sbjct: 95  KRLSKVCHFIWHATCWCLWLHRNRIIFQEEQADVQLVV 132


>gb|KHN03945.1| hypothetical protein glysoja_022631, partial [Glycine soja]
          Length = 114

 Score = 76.3 bits (186), Expect = 2e-15
 Identities = 35/94 (37%), Positives = 56/94 (59%)
 Frame = -3

Query: 313 FLVCNFASGIWQQVYC*LGVAGNCEFNTLQEHCYWLGTLCKGSKERKIKYLIWLTTCWCI 134
           F+ C FA  +WQ +   LG + +   N +QE    LG   +G K+R+ K+L+W  TCW I
Sbjct: 2   FVFCPFAKQVWQGILNWLGYSFSLP-NNIQELYLQLGMNIRGKKKRRFKHLLWHNTCWSI 60

Query: 133 WLSLNGIVFKNE*ANIDSLIVRIKSLSWS*FIYK 32
           W   N ++F+N   ++++ I+ IKS+SW   +YK
Sbjct: 61  WCHRNNVIFRNAEVDVNNTILFIKSMSWQWVLYK 94


>gb|PNY05892.1| ribonuclease H [Trifolium pratense]
          Length = 455

 Score = 80.5 bits (197), Expect = 4e-15
 Identities = 39/104 (37%), Positives = 54/104 (51%)
 Frame = -3

Query: 349 FCMQHREDCEHTFLVCNFASGIWQQVYC*LGVAGNCEFNTLQEHCYWLGTLCKGSKERKI 170
           FC    EDC H F  C     +W+ +Y  LG A    +    +H  + G + K  K  K+
Sbjct: 328 FCFTEIEDCMHLFFNCKLMQQVWRSIYKWLGCA-YYNYGEGWKHFNFFGGIVKSKKGEKV 386

Query: 169 KYLIWLTTCWCIWLSLNGIVFKNE*ANIDSLIVRIKSLSWS*FI 38
           K+LIWL T WCIW   N I+F+   A+   L+ +IK +SW  FI
Sbjct: 387 KHLIWLVTTWCIWRLRNNIIFRGALADCAQLVDQIKLISWVWFI 430


>gb|KHN35644.1| hypothetical protein glysoja_030996 [Glycine soja]
          Length = 160

 Score = 76.6 bits (187), Expect = 4e-15
 Identities = 41/108 (37%), Positives = 56/108 (51%)
 Frame = -3

Query: 364 DAQCPFCMQHREDCEHTFLVCNFASGIWQQVYC*LGVAGNCEFNTLQEHCYWLGTLCKGS 185
           D +   C    E  +H  L C  A  IWQQV   L V    E   +Q H    G L KG 
Sbjct: 28  DRRYVLCSSEDETVKHILLDCRVAKKIWQQVCLWLDVPV-VEGEDIQAHFMAFGKLIKGK 86

Query: 184 KERKIKYLIWLTTCWCIWLSLNGIVFKNE*ANIDSLIVRIKSLSWS*F 41
           K++++K+LIW+   W IWL+ N ++FK E A I  +I  IK  +W+ F
Sbjct: 87  KQKRVKHLIWMAVIWNIWLTRNKVIFKEEAAAIPVMISGIKDCAWAWF 134


>gb|KRH35933.1| hypothetical protein GLYMA_10G272900 [Glycine max]
          Length = 79

 Score = 68.6 bits (166), Expect = 8e-13
 Identities = 32/81 (39%), Positives = 44/81 (54%), Gaps = 1/81 (1%)
 Frame = -3

Query: 343 MQHREDCEHTFLVCNFASGIWQQVYC*LGVAGNCEF-NTLQEHCYWLGTLCKGSKERKIK 167
           M+  ED  H F  C   + IW+QV   +GV  N     T  +     G L  G K R++K
Sbjct: 1   MEEEEDVHHLFYACQVTASIWRQVIAWVGV--NLVMPQTFGQLFKQFGGLLPGRKSRRVK 58

Query: 166 YLIWLTTCWCIWLSLNGIVFK 104
           +++W  TCWC+WLS N I+FK
Sbjct: 59  HILWHATCWCVWLSRNAIIFK 79


>gb|PNY01502.1| ribonuclease H [Trifolium pratense]
          Length = 554

 Score = 71.6 bits (174), Expect = 5e-12
 Identities = 34/108 (31%), Positives = 56/108 (51%)
 Frame = -3

Query: 364 DAQCPFCMQHREDCEHTFLVCNFASGIWQQVYC*LGVAGNCEFNTLQEHCYWLGTLCKGS 185
           +  C FC +HRED  H F  C F+  +W+ V   LG++   +   + +H    G L K  
Sbjct: 422 ELSCVFCFRHREDGAHLFFSCYFSKVVWRNVLKWLGLSIPMDVEGI-DHFMLFGDLFKVK 480

Query: 184 KERKIKYLIWLTTCWCIWLSLNGIVFKNE*ANIDSLIVRIKSLSWS*F 41
            + ++++L+WL T W +W   N ++FK +     +L+  IK  SW  F
Sbjct: 481 DKGRVRHLVWLATTWNLWKLRNKVIFKGDIPETSALLDSIKLSSWMWF 528


>gb|PNX71113.1| pantothenate synthetase [Trifolium pratense]
          Length = 255

 Score = 68.6 bits (166), Expect = 2e-11
 Identities = 32/105 (30%), Positives = 52/105 (49%)
 Frame = -3

Query: 364 DAQCPFCMQHREDCEHTFLVCNFASGIWQQVYC*LGVAGNCEFNTLQEHCYWLGTLCKGS 185
           +  C FC  HRED  H F  C F+  +W+ V   LG++   +   + +H    G   K  
Sbjct: 123 ELSCVFCFWHREDGAHLFFSCYFSKVVWRNVLKWLGLSSPLDVEGI-DHFLLFGEFFKVK 181

Query: 184 KERKIKYLIWLTTCWCIWLSLNGIVFKNE*ANIDSLIVRIKSLSW 50
            +  +++L+WL T W +W   N ++FK +  +   L+  IK  SW
Sbjct: 182 DKGHVRHLVWLATTWNLWKMRNKVIFKGDIPDTAVLLDSIKLFSW 226


>gb|KYP55738.1| hypothetical protein KK1_001963 [Cajanus cajan]
          Length = 186

 Score = 67.0 bits (162), Expect = 4e-11
 Identities = 34/117 (29%), Positives = 58/117 (49%), Gaps = 1/117 (0%)
 Frame = -3

Query: 355 CPFCMQHREDCEHTFLVCNFASGIWQQVYC*LGVAGNCEFNTLQEHCYWL-GTLCKGSKE 179
           CPFC    E  +H FL C F+  +W  V+   G+    E  +   H ++L  ++     +
Sbjct: 57  CPFCSTTLESSQHLFLECEFSRNVWHNVFTWTGI--RLELPSSLGHLFFLLRSMFLDKVK 114

Query: 178 RKIKYLIWLTTCWCIWLSLNGIVFKNE*ANIDSLIVRIKSLSWS*FIYKVVCNSNFS 8
           RK + + W  T W +W + N IVF+N+  +      +IK +SW  ++Y+  C   F+
Sbjct: 115 RKWRDIFWHATIWVLWTNRNEIVFRNKTVSHFDFPYQIKIISWHWWMYRNGCRPGFT 171


>dbj|GAU26446.1| hypothetical protein TSUD_294120 [Trifolium subterraneum]
          Length = 333

 Score = 68.6 bits (166), Expect = 5e-11
 Identities = 40/120 (33%), Positives = 58/120 (48%), Gaps = 1/120 (0%)
 Frame = -3

Query: 370 QEDAQCPFCMQHREDCEHTFLVCNFASGIWQQVYC*LGVAGNCEFNTLQEHCYWLGTLCK 191
           +EDA CP C +  E   H FL C FAS +W +V   LG       + +  H   +G  C 
Sbjct: 201 REDALCPTCGETIETVRHLFLHCRFASAVWYRVNRWLGTMVVIPHDIIMSHGLLVG--CG 258

Query: 190 GSKE-RKIKYLIWLTTCWCIWLSLNGIVFKNE*ANIDSLIVRIKSLSWS*FIYKVVCNSN 14
           G+K+ RK   ++WL   W IW   N  VF N    ++  +  I+ LSW  ++ K    S+
Sbjct: 259 GNKKVRKGYSIVWLAFVWVIWRFRNDRVFNNINGEVEDAMDSIQRLSWQWYLLKTAKGSS 318


>dbj|GAU42970.1| hypothetical protein TSUD_188430 [Trifolium subterraneum]
          Length = 767

 Score = 68.2 bits (165), Expect = 9e-11
 Identities = 34/105 (32%), Positives = 53/105 (50%)
 Frame = -3

Query: 367 EDAQCPFCMQHREDCEHTFLVCNFASGIWQQVYC*LGVAGNCEFNTLQEHCYWLGTLCKG 188
           +D  C FC  + ED  H F  C+    +W++V+   G +   E +    H    G+L K 
Sbjct: 633 QDLHCVFCSSYDEDSAHLFFHCSVLKRVWEEVFKWFGKSYQAEADGWN-HFNIFGSLLKT 691

Query: 187 SKERKIKYLIWLTTCWCIWLSLNGIVFKNE*ANIDSLIVRIKSLS 53
            +  K+++LIWL T W IW   N +VF     +  SL+  IK++S
Sbjct: 692 KRFEKVRHLIWLATTWSIWKLRNNVVFNGVTLSSSSLVNDIKTIS 736


>gb|KHN39553.1| hypothetical protein glysoja_045723, partial [Glycine soja]
          Length = 211

 Score = 65.9 bits (159), Expect = 1e-10
 Identities = 35/121 (28%), Positives = 58/121 (47%), Gaps = 2/121 (1%)
 Frame = -3

Query: 364 DAQCPFCMQHREDCEHTFLVCNFASGIWQQVYC*LGVAGNCEFNTLQEHCYWLGTLCKGS 185
           +++C  C    E+  H F  C+F+  IW+++   +G+        +Q    +   L   +
Sbjct: 82  NSRCSLCDSCDENVVHLFFHCDFSKCIWKEILSWIGIVDVIAVGGVQHFWEYDRLLKYNT 141

Query: 184 KERKIKYLIWLTTCWCIWLSLNGIVFKNE*ANIDSLIVRIKSLSWS*FIYKV--VCNSNF 11
              K+ ++ WL T W IW   N  +FK E  +I   I +IK + W+ F+ KV  V  SN 
Sbjct: 142 SRNKVPFMFWLATLWIIWQVRNNSIFKEEEKDIPKTINQIKHICWAWFMGKVGGVTGSNI 201

Query: 10  S 8
           S
Sbjct: 202 S 202


>gb|PNX84435.1| hypothetical protein L195_g040495, partial [Trifolium pratense]
          Length = 248

 Score = 65.9 bits (159), Expect = 2e-10
 Identities = 33/99 (33%), Positives = 47/99 (47%), Gaps = 1/99 (1%)
 Frame = -3

Query: 343 MQHR-EDCEHTFLVCNFASGIWQQVYC*LGVAGNCEFNTLQEHCYWLGTLCKGSKERKIK 167
           ++HR EDC H F  C F+ G+W+ VY  LG+           H      +    K  +++
Sbjct: 119 LEHRQEDCSHLFFHCAFSKGVWESVYRWLGMKSISAGAEGWNHFLLFDDMITAKKGERVR 178

Query: 166 YLIWLTTCWCIWLSLNGIVFKNE*ANIDSLIVRIKSLSW 50
           +L WL T W IW   N +VF     N  SL+  I + SW
Sbjct: 179 HLFWLATTWNIWKLRNNVVFNGVIPNASSLVEDIIANSW 217


>dbj|GAU46742.1| hypothetical protein TSUD_286020 [Trifolium subterraneum]
          Length = 258

 Score = 65.9 bits (159), Expect = 3e-10
 Identities = 35/119 (29%), Positives = 59/119 (49%), Gaps = 1/119 (0%)
 Frame = -3

Query: 367 EDAQCPFCMQHREDCEHTFLVCNFASGIWQQVYC*LGVAGNCEFNTLQEHCYWLGTLCKG 188
           ED +C  C +  E   H FL C F + +W  V   LGV      + +  +   +G  C G
Sbjct: 127 EDTRCSLCGELAETSCHLFLHCRFVAAVWYAVIKWLGVVVVLPADPIMSYGILVG--CGG 184

Query: 187 SKE-RKIKYLIWLTTCWCIWLSLNGIVFKNE*ANIDSLIVRIKSLSWS*FIYKVVCNSN 14
           +K+ RK   ++W+   W +W   N  VF N   +++ +I RI+ +SW  +++K    S+
Sbjct: 185 NKKIRKSLSIVWMDFVWVLWRVRNDRVFNNVDGSVEDVIERIQRISWQWYLHKTTMGSS 243


>gb|KYP39460.1| hypothetical protein KK1_039239 [Cajanus cajan]
          Length = 223

 Score = 65.1 bits (157), Expect = 3e-10
 Identities = 31/102 (30%), Positives = 50/102 (49%)
 Frame = -3

Query: 355 CPFCMQHREDCEHTFLVCNFASGIWQQVYC*LGVAGNCEFNTLQEHCYWLGTLCKGSKER 176
           C FC    ED  H FL C+ AS +W  +   LG + +C  ++++     +     G K  
Sbjct: 94  CVFCNNREEDVTHLFLHCDVASSVWYSLCRWLGFS-SCLSDSVEGQLLTMSGFVAGKKPA 152

Query: 175 KIKYLIWLTTCWCIWLSLNGIVFKNE*ANIDSLIVRIKSLSW 50
           ++   IW+ T W +WL  N IVF N   ++  ++  +K  SW
Sbjct: 153 RVVVTIWVATVWSLWLHRNKIVFNNGVCDVLEVVESVKYRSW 194


>ref|XP_022024524.1| uncharacterized protein LOC110924846 [Helianthus annuus]
          Length = 203

 Score = 64.7 bits (156), Expect = 4e-10
 Identities = 34/100 (34%), Positives = 52/100 (52%)
 Frame = -3

Query: 349 FCMQHREDCEHTFLVCNFASGIWQQVYC*LGVAGNCEFNTLQEHCYWLGTLCKGSKERKI 170
           FC ++ E C+H F+ C+FA  +WQ V   L +     F    +    L  L  GS+ +K+
Sbjct: 81  FCGEYVESCDHIFVSCHFAQMVWQNVARWLRIQSIIAFGI--QDLLTLHGLSSGSRRKKV 138

Query: 169 KYLIWLTTCWCIWLSLNGIVFKNE*ANIDSLIVRIKSLSW 50
            + I L   WCIW + N IVF+N   N    +  IKS+++
Sbjct: 139 IHAIVLVVLWCIWKTRNDIVFRNVVPNYARTLDEIKSMAF 178


>ref|XP_020203066.1| uncharacterized protein LOC109788691 [Cajanus cajan]
          Length = 265

 Score = 65.1 bits (157), Expect = 5e-10
 Identities = 31/102 (30%), Positives = 50/102 (49%)
 Frame = -3

Query: 355 CPFCMQHREDCEHTFLVCNFASGIWQQVYC*LGVAGNCEFNTLQEHCYWLGTLCKGSKER 176
           C FC    ED  H FL C+ AS +W  +   LG + +C  ++++     +     G K  
Sbjct: 136 CVFCNNREEDVTHLFLHCDVASSVWYSLCRWLGFS-SCLSDSVEGQLLTMSGFVAGKKPA 194

Query: 175 KIKYLIWLTTCWCIWLSLNGIVFKNE*ANIDSLIVRIKSLSW 50
           ++   IW+ T W +WL  N IVF N   ++  ++  +K  SW
Sbjct: 195 RVVVTIWVATVWSLWLHRNKIVFNNGVCDVLEVVESVKYRSW 236


>gb|PNX95738.1| ribonuclease H, partial [Trifolium pratense]
          Length = 1375

 Score = 65.9 bits (159), Expect = 6e-10
 Identities = 35/102 (34%), Positives = 52/102 (50%)
 Frame = -3

Query: 355  CPFCMQHREDCEHTFLVCNFASGIWQQVYC*LGVAGNCEFNTLQEHCYWLGTLCKGSKER 176
            C  C+   E   H FL C+FAS IW +++  LG+      N  Q    ++G    G K R
Sbjct: 1248 CAICVGVDESSVHLFLHCDFASCIWYEIFRWLGLVIVLPANLFQCFDSFIGAAV-GKKCR 1306

Query: 175  KIKYLIWLTTCWCIWLSLNGIVFKNE*ANIDSLIVRIKSLSW 50
            K+  +IW T  W IW + N ++F N    ++ ++  IK LSW
Sbjct: 1307 KMFRMIWHTIVWLIWKNRNDVIFSNSSKEVNEVVDDIKQLSW 1348


>dbj|GAU48811.1| hypothetical protein TSUD_406440 [Trifolium subterraneum]
          Length = 563

 Score = 65.5 bits (158), Expect = 7e-10
 Identities = 36/112 (32%), Positives = 57/112 (50%), Gaps = 1/112 (0%)
 Frame = -3

Query: 370 QEDAQCPFCMQHREDCEHTFLVCNFASGIWQQVYC*LGVAGNCEFNTLQEHCYWLGTLCK 191
           +E   CP C + RE   H FL C  A+ IW  +   LGV         Q +  ++G  C 
Sbjct: 431 EEVVFCPLCEEERETSCHLFLHCRVAASIWYGLTRWLGVVVVLPPLVAQSYAGFVG--CG 488

Query: 190 GSKERKIKY-LIWLTTCWCIWLSLNGIVFKNE*ANIDSLIVRIKSLSWS*FI 38
            +K+RK  + ++WL   W +W + N  VF N+   ++ ++V I+ LSW  F+
Sbjct: 489 SNKKRKKGFSIVWLAFVWALWQARNDRVFNNKEVKVEEVVVYIQRLSWRWFL 540


>gb|KHN41375.1| Putative ribonuclease H protein, partial [Glycine soja]
          Length = 363

 Score = 65.1 bits (157), Expect = 9e-10
 Identities = 35/121 (28%), Positives = 58/121 (47%), Gaps = 2/121 (1%)
 Frame = -3

Query: 364 DAQCPFCMQHREDCEHTFLVCNFASGIWQQVYC*LGVAGNCEFNTLQEHCYWLGTLCKGS 185
           +++C  C    E+  H F  C+F+  IW+++   +G+        +Q    +   L   +
Sbjct: 234 NSRCSLCDSCDENVVHLFFHCDFSKCIWKEILSWIGIVDVIAVGGVQHFWEYDRLLKYNT 293

Query: 184 KERKIKYLIWLTTCWCIWLSLNGIVFKNE*ANIDSLIVRIKSLSWS*FIYKV--VCNSNF 11
              K+ ++ WL T W IW   N  +FK E  +I   I +IK + W+ F+ KV  V  SN 
Sbjct: 294 SRNKVPFMFWLATLWIIWQVRNNSIFKEEEKDIPKTINQIKHICWAWFMGKVGGVGGSNI 353

Query: 10  S 8
           S
Sbjct: 354 S 354


Top