BLASTX nr result

ID: Astragalus23_contig00030177 seq

BLASTX 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Astragalus23_contig00030177
         (407 letters)

Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF excluding environmental samples
from WGS projects 
           149,584,005 sequences; 54,822,741,787 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

gb|PNY01502.1| ribonuclease H [Trifolium pratense]                     99   1e-21
gb|PNX71113.1| pantothenate synthetase [Trifolium pratense]            96   2e-21
gb|KHN04350.1| hypothetical protein glysoja_030944 [Glycine soja]      92   9e-21
gb|KYP39460.1| hypothetical protein KK1_039239 [Cajanus cajan]         91   8e-20
ref|XP_020203066.1| uncharacterized protein LOC109788691 [Cajanu...    91   2e-19
gb|KHN35644.1| hypothetical protein glysoja_030996 [Glycine soja]      84   6e-18
gb|PNY05892.1| ribonuclease H [Trifolium pratense]                     87   2e-17
gb|KHN03945.1| hypothetical protein glysoja_022631, partial [Gly...    82   2e-17
gb|AFK37936.1| unknown [Lotus japonicus]                               82   4e-17
gb|KYP46173.1| Putative ribonuclease H protein At1g65750 family,...    86   5e-17
gb|KYP55738.1| hypothetical protein KK1_001963 [Cajanus cajan]         82   1e-16
gb|KYP44105.1| Putative ribonuclease H protein At1g65750 family ...    84   3e-16
gb|KYP34033.1| Putative ribonuclease H protein At1g65750 family,...    84   4e-16
gb|KYP59667.1| Putative ribonuclease H protein At1g65750 family ...    83   5e-16
ref|XP_022030815.1| uncharacterized protein LOC110931740 [Helian...    79   1e-15
dbj|GAU48210.1| hypothetical protein TSUD_404970 [Trifolium subt...    82   1e-15
dbj|GAU42970.1| hypothetical protein TSUD_188430 [Trifolium subt...    79   3e-15
ref|XP_020239954.1| uncharacterized protein LOC109818836 [Cajanu...    78   1e-14
gb|PNX84435.1| hypothetical protein L195_g040495, partial [Trifo...    77   2e-14
ref|XP_021996043.1| uncharacterized protein LOC110893235 [Helian...    75   2e-14

>gb|PNY01502.1| ribonuclease H [Trifolium pratense]
          Length = 554

 Score = 99.4 bits (246), Expect = 1e-21
 Identities = 40/108 (37%), Positives = 66/108 (61%)
 Frame = -1

Query: 407 PYDLSCVFCFREVEVADHVFCRCVFVRSVWRGVFLWLNVNEGLCDDVLNTFLQSSKCIQG 228
           P++LSCVFCFR  E   H+F  C F + VWR V  WL ++  +  + ++ F+      + 
Sbjct: 420 PFELSCVFCFRHREDGAHLFFSCYFSKVVWRNVLKWLGLSIPMDVEGIDHFMLFGDLFKV 479

Query: 227 KSARRIKHIFWVATMWQLWLVRNRIIFQGGVTNVNSVIYSLKFVSWSW 84
           K   R++H+ W+AT W LW +RN++IF+G +   ++++ S+K  SW W
Sbjct: 480 KDKGRVRHLVWLATTWNLWKLRNKVIFKGDIPETSALLDSIKLSSWMW 527


>gb|PNX71113.1| pantothenate synthetase [Trifolium pratense]
          Length = 255

 Score = 95.5 bits (236), Expect = 2e-21
 Identities = 40/108 (37%), Positives = 64/108 (59%)
 Frame = -1

Query: 407 PYDLSCVFCFREVEVADHVFCRCVFVRSVWRGVFLWLNVNEGLCDDVLNTFLQSSKCIQG 228
           P++LSCVFCF   E   H+F  C F + VWR V  WL ++  L  + ++ FL   +  + 
Sbjct: 121 PFELSCVFCFWHREDGAHLFFSCYFSKVVWRNVLKWLGLSSPLDVEGIDHFLLFGEFFKV 180

Query: 227 KSARRIKHIFWVATMWQLWLVRNRIIFQGGVTNVNSVIYSLKFVSWSW 84
           K    ++H+ W+AT W LW +RN++IF+G + +   ++ S+K  SW W
Sbjct: 181 KDKGHVRHLVWLATTWNLWKMRNKVIFKGDIPDTAVLLDSIKLFSWIW 228


>gb|KHN04350.1| hypothetical protein glysoja_030944 [Glycine soja]
          Length = 229

 Score = 92.0 bits (227), Expect(2) = 9e-21
 Identities = 38/109 (34%), Positives = 62/109 (56%)
 Frame = -1

Query: 401 DLSCVFCFREVEVADHVFCRCVFVRSVWRGVFLWLNVNEGLCDDVLNTFLQSSKCIQGKS 222
           D  CV C  E E   H+F  C   + +W+ V LWL+V     +D+   F+   K I+GK 
Sbjct: 97  DRRCVLCSSEDETVKHIFLDCRVAKKIWQQVCLWLDVPVVEGEDIQAHFMAFGKLIKGKK 156

Query: 221 ARRIKHIFWVATMWQLWLVRNRIIFQGGVTNVNSVIYSLKFVSWSWSVA 75
            +R+KH+ W+A +W +WL RN++IF+     +  +I  +K  +W+W +A
Sbjct: 157 QKRVKHLIWMAVIWNIWLTRNKVIFKEEAAAIPVMISGIKDCAWAWFMA 205



 Score = 35.8 bits (81), Expect(2) = 9e-21
 Identities = 14/25 (56%), Positives = 18/25 (72%)
 Frame = -3

Query: 75  SRKGRNSGVTFSDWCNCPMGCIASL 1
           +R+GR     +SDW NCPMGC+ SL
Sbjct: 205 ARQGRTCWDGWSDWYNCPMGCLLSL 229


>gb|KYP39460.1| hypothetical protein KK1_039239 [Cajanus cajan]
          Length = 223

 Score = 90.9 bits (224), Expect = 8e-20
 Identities = 43/106 (40%), Positives = 55/106 (51%)
 Frame = -1

Query: 392 CVFCFREVEVADHVFCRCVFVRSVWRGVFLWLNVNEGLCDDVLNTFLQSSKCIQGKSARR 213
           CVFC    E   H+F  C    SVW  +  WL  +  L D V    L  S  + GK   R
Sbjct: 94  CVFCNNREEDVTHLFLHCDVASSVWYSLCRWLGFSSCLSDSVEGQLLTMSGFVAGKKPAR 153

Query: 212 IKHIFWVATMWQLWLVRNRIIFQGGVTNVNSVIYSLKFVSWSWSVA 75
           +    WVAT+W LWL RN+I+F  GV +V  V+ S+K+ SW W  A
Sbjct: 154 VVVTIWVATVWSLWLHRNKIVFNNGVCDVLEVVESVKYRSWKWLTA 199


>ref|XP_020203066.1| uncharacterized protein LOC109788691 [Cajanus cajan]
          Length = 265

 Score = 90.9 bits (224), Expect = 2e-19
 Identities = 43/106 (40%), Positives = 55/106 (51%)
 Frame = -1

Query: 392 CVFCFREVEVADHVFCRCVFVRSVWRGVFLWLNVNEGLCDDVLNTFLQSSKCIQGKSARR 213
           CVFC    E   H+F  C    SVW  +  WL  +  L D V    L  S  + GK   R
Sbjct: 136 CVFCNNREEDVTHLFLHCDVASSVWYSLCRWLGFSSCLSDSVEGQLLTMSGFVAGKKPAR 195

Query: 212 IKHIFWVATMWQLWLVRNRIIFQGGVTNVNSVIYSLKFVSWSWSVA 75
           +    WVAT+W LWL RN+I+F  GV +V  V+ S+K+ SW W  A
Sbjct: 196 VVVTIWVATVWSLWLHRNKIVFNNGVCDVLEVVESVKYRSWKWLTA 241


>gb|KHN35644.1| hypothetical protein glysoja_030996 [Glycine soja]
          Length = 160

 Score = 84.3 bits (207), Expect = 6e-18
 Identities = 35/106 (33%), Positives = 58/106 (54%)
 Frame = -1

Query: 401 DLSCVFCFREVEVADHVFCRCVFVRSVWRGVFLWLNVNEGLCDDVLNTFLQSSKCIQGKS 222
           D   V C  E E   H+   C   + +W+ V LWL+V     +D+   F+   K I+GK 
Sbjct: 28  DRRYVLCSSEDETVKHILLDCRVAKKIWQQVCLWLDVPVVEGEDIQAHFMAFGKLIKGKK 87

Query: 221 ARRIKHIFWVATMWQLWLVRNRIIFQGGVTNVNSVIYSLKFVSWSW 84
            +R+KH+ W+A +W +WL RN++IF+     +  +I  +K  +W+W
Sbjct: 88  QKRVKHLIWMAVIWNIWLTRNKVIFKEEAAAIPVMISGIKDCAWAW 133


>gb|PNY05892.1| ribonuclease H [Trifolium pratense]
          Length = 455

 Score = 87.4 bits (215), Expect(2) = 2e-17
 Identities = 35/110 (31%), Positives = 59/110 (53%)
 Frame = -1

Query: 407 PYDLSCVFCFREVEVADHVFCRCVFVRSVWRGVFLWLNVNEGLCDDVLNTFLQSSKCIQG 228
           P+DL  VFCF E+E   H+F  C  ++ VWR ++ WL        +    F      ++ 
Sbjct: 321 PHDLPRVFCFTEIEDCMHLFFNCKLMQQVWRSIYKWLGCAYYNYGEGWKHFNFFGGIVKS 380

Query: 227 KSARRIKHIFWVATMWQLWLVRNRIIFQGGVTNVNSVIYSLKFVSWSWSV 78
           K   ++KH+ W+ T W +W +RN IIF+G + +   ++  +K +SW W +
Sbjct: 381 KKGEKVKHLIWLVTTWCIWRLRNNIIFRGALADCAQLVDQIKLISWVWFI 430



 Score = 29.3 bits (64), Expect(2) = 2e-17
 Identities = 11/24 (45%), Positives = 15/24 (62%)
 Frame = -3

Query: 72  RKGRNSGVTFSDWCNCPMGCIASL 1
           R GR+    +SDWC  P+ CI S+
Sbjct: 432 RSGRHCPYLYSDWCVNPLECILSM 455


>gb|KHN03945.1| hypothetical protein glysoja_022631, partial [Glycine soja]
          Length = 114

 Score = 81.6 bits (200), Expect = 2e-17
 Identities = 34/98 (34%), Positives = 54/98 (55%)
 Frame = -1

Query: 353 VFCRCVFVRSVWRGVFLWLNVNEGLCDDVLNTFLQSSKCIQGKSARRIKHIFWVATMWQL 174
           +F  C F + VW+G+  WL  +  L +++   +LQ    I+GK  RR KH+ W  T W +
Sbjct: 1   LFVFCPFAKQVWQGILNWLGYSFSLPNNIQELYLQLGMNIRGKKKRRFKHLLWHNTCWSI 60

Query: 173 WLVRNRIIFQGGVTNVNSVIYSLKFVSWSWSVALGRGE 60
           W  RN +IF+    +VN+ I  +K +SW W +    G+
Sbjct: 61  WCHRNNVIFRNAEVDVNNTILFIKSMSWQWVLYKSSGK 98


>gb|AFK37936.1| unknown [Lotus japonicus]
          Length = 138

 Score = 81.6 bits (200), Expect = 4e-17
 Identities = 39/105 (37%), Positives = 55/105 (52%)
 Frame = -1

Query: 398 LSCVFCFREVEVADHVFCRCVFVRSVWRGVFLWLNVNEGLCDDVLNTFLQSSKCIQGKSA 219
           L+C FC  + E +DH+FC C F  ++WR V  W  V+  L   V   F+Q     +  S 
Sbjct: 8   LACSFCQLQDETSDHLFCTCAFSMAIWRMVLGWFGVSIALPSLVKALFVQFPVFGRCSSK 67

Query: 218 RRIKHIFWVATMWQLWLVRNRIIFQGGVTNVNSVIYSLKFVSWSW 84
           R      W+AT W LWL+RNR+IF  G  +   V+  ++  SW W
Sbjct: 68  REALVTVWMATCWSLWLMRNRVIFDNGELDTGLVLDLIQVRSWHW 112


>gb|KYP46173.1| Putative ribonuclease H protein At1g65750 family, partial [Cajanus
           cajan]
          Length = 353

 Score = 85.5 bits (210), Expect = 5e-17
 Identities = 41/113 (36%), Positives = 57/113 (50%), Gaps = 1/113 (0%)
 Frame = -1

Query: 392 CVFCFREVEVADHVFCRCVFVRSVWRGVFLWLNVNEGLCDDVLNTFLQSSKCIQGKSARR 213
           CVFC    E   H+F  C    SVW  +  WL  +  L   V    L  S  + GK   R
Sbjct: 224 CVFCNNREEDVTHLFLHCDVASSVWYSLCRWLGFSSCLSGSVEGQLLTMSGFVVGKKPAR 283

Query: 212 IKHIFWVATMWQLWLVRNRIIFQGGVTNVNSVIYSLKFVSWSW-SVALGRGEI 57
           +    WVAT+W LWL RN+++F  GV +V  V+ S K+ +W W ++ +  G I
Sbjct: 284 VVVTIWVATVWSLWLHRNKMVFNNGVCDVLEVVESAKYRAWKWLTIGMSHGSI 336


>gb|KYP55738.1| hypothetical protein KK1_001963 [Cajanus cajan]
          Length = 186

 Score = 81.6 bits (200), Expect = 1e-16
 Identities = 37/105 (35%), Positives = 52/105 (49%)
 Frame = -1

Query: 398 LSCVFCFREVEVADHVFCRCVFVRSVWRGVFLWLNVNEGLCDDVLNTFLQSSKCIQGKSA 219
           LSC FC   +E + H+F  C F R+VW  VF W  +   L   + + F         K  
Sbjct: 55  LSCPFCSTTLESSQHLFLECEFSRNVWHNVFTWTGIRLELPSSLGHLFFLLRSMFLDKVK 114

Query: 218 RRIKHIFWVATMWQLWLVRNRIIFQGGVTNVNSVIYSLKFVSWSW 84
           R+ + IFW AT+W LW  RN I+F+    +     Y +K +SW W
Sbjct: 115 RKWRDIFWHATIWVLWTNRNEIVFRNKTVSHFDFPYQIKIISWHW 159


>gb|KYP44105.1| Putative ribonuclease H protein At1g65750 family [Cajanus cajan]
          Length = 346

 Score = 83.6 bits (205), Expect = 3e-16
 Identities = 36/107 (33%), Positives = 51/107 (47%)
 Frame = -1

Query: 404 YDLSCVFCFREVEVADHVFCRCVFVRSVWRGVFLWLNVNEGLCDDVLNTFLQSSKCIQGK 225
           ++  C+FC  ++E   H+FC C  V  VW+    WLN    L   + + F      IQ +
Sbjct: 211 HESRCIFCKVDIETTTHIFCTCPMVDRVWKQCLSWLNCPAPLPRQIFDHFSFLPAPIQNE 270

Query: 224 SARRIKHIFWVATMWQLWLVRNRIIFQGGVTNVNSVIYSLKFVSWSW 84
           S     H  W+ATMW LW  RN+ IF GG     S++  +    W W
Sbjct: 271 SEAETWHTVWLATMWTLWRYRNKCIFDGGTFEQGSIVRDILIFCWRW 317


>gb|KYP34033.1| Putative ribonuclease H protein At1g65750 family, partial [Cajanus
           cajan]
          Length = 443

 Score = 83.6 bits (205), Expect = 4e-16
 Identities = 36/107 (33%), Positives = 51/107 (47%)
 Frame = -1

Query: 404 YDLSCVFCFREVEVADHVFCRCVFVRSVWRGVFLWLNVNEGLCDDVLNTFLQSSKCIQGK 225
           ++  C+FC  ++E   H+FC C  V  VW+    WLN    L   + + F      IQ +
Sbjct: 308 HESRCIFCKVDIETTTHIFCTCPMVDRVWKQCLSWLNCPAPLPRQIFDHFSFLPAPIQNE 367

Query: 224 SARRIKHIFWVATMWQLWLVRNRIIFQGGVTNVNSVIYSLKFVSWSW 84
           S     H  W+ATMW LW  RN+ IF GG     S++  +    W W
Sbjct: 368 SEAETWHTVWLATMWTLWRYRNKCIFDGGTFEQGSIVRDILIFCWRW 414


>gb|KYP59667.1| Putative ribonuclease H protein At1g65750 family [Cajanus cajan]
          Length = 409

 Score = 83.2 bits (204), Expect = 5e-16
 Identities = 36/107 (33%), Positives = 51/107 (47%)
 Frame = -1

Query: 404 YDLSCVFCFREVEVADHVFCRCVFVRSVWRGVFLWLNVNEGLCDDVLNTFLQSSKCIQGK 225
           ++  C+FC  ++E   H+FC C  V  VW+    WLN    L   + + F      IQ +
Sbjct: 274 HESKCIFCKVDIETTTHIFCTCPMVDRVWKQCLSWLNCPAPLPRQIFDHFSFLPAPIQNE 333

Query: 224 SARRIKHIFWVATMWQLWLVRNRIIFQGGVTNVNSVIYSLKFVSWSW 84
           S     H  W+ATMW LW  RN+ IF GG     S++  +    W W
Sbjct: 334 SEAETWHTVWLATMWTLWRYRNKCIFVGGTFEQGSIVRDILIFCWRW 380


>ref|XP_022030815.1| uncharacterized protein LOC110931740 [Helianthus annuus]
          Length = 157

 Score = 78.6 bits (192), Expect = 1e-15
 Identities = 35/99 (35%), Positives = 55/99 (55%), Gaps = 1/99 (1%)
 Frame = -1

Query: 395 SCVFCFREVEVADHVFCRCVFVRSVWRGVFLWLNVNEGLCDDVLNTFLQSSKCIQ-GKSA 219
           +CV C ++VE ADH+   C     VW  + LW+N+  GL    ++  LQS   +   K+ 
Sbjct: 31  TCVLCEQDVESADHILLNCRVAEEVWHRLSLWMNIPPGLNQSTVDEMLQSVNGLNVSKNR 90

Query: 218 RRIKHIFWVATMWQLWLVRNRIIFQGGVTNVNSVIYSLK 102
           +RI H  ++ TMW +W  RNR IF+G + N   ++  +K
Sbjct: 91  KRIIHAIYIITMWSIWKARNRKIFEGIIVNRYKLVEDIK 129


>dbj|GAU48210.1| hypothetical protein TSUD_404970 [Trifolium subterraneum]
          Length = 1653

 Score = 82.4 bits (202), Expect = 1e-15
 Identities = 35/108 (32%), Positives = 56/108 (51%)
 Frame = -1

Query: 407  PYDLSCVFCFREVEVADHVFCRCVFVRSVWRGVFLWLNVNEGLCDDVLNTFLQSSKCIQG 228
            P DL CVFC    E   H+F  C FV  VW  V+ W+  +     +  + F      +  
Sbjct: 1518 PQDLPCVFCSVFYEDCVHLFFHCSFVNCVWEAVYNWIGKDYHAGAEGWSHFKVFGDMVNS 1577

Query: 227  KSARRIKHIFWVATMWQLWLVRNRIIFQGGVTNVNSVIYSLKFVSWSW 84
             +  R++H+ W+AT W LW +RN +IF G   + +S++  +K +S +W
Sbjct: 1578 TNIERVRHLIWLATTWNLWKLRNNVIFNGATPSASSLLNDIKAISCAW 1625


>dbj|GAU42970.1| hypothetical protein TSUD_188430 [Trifolium subterraneum]
          Length = 767

 Score = 78.6 bits (192), Expect(2) = 3e-15
 Identities = 34/108 (31%), Positives = 56/108 (51%)
 Frame = -1

Query: 407 PYDLSCVFCFREVEVADHVFCRCVFVRSVWRGVFLWLNVNEGLCDDVLNTFLQSSKCIQG 228
           P DL CVFC    E + H+F  C  ++ VW  VF W   +     D  N F      ++ 
Sbjct: 632 PQDLHCVFCSSYDEDSAHLFFHCSVLKRVWEEVFKWFGKSYQAEADGWNHFNIFGSLLKT 691

Query: 227 KSARRIKHIFWVATMWQLWLVRNRIIFQGGVTNVNSVIYSLKFVSWSW 84
           K   +++H+ W+AT W +W +RN ++F G   + +S++  +K +S  W
Sbjct: 692 KRFEKVRHLIWLATTWSIWKLRNNVVFNGVTLSSSSLVNDIKTISCLW 739



 Score = 30.4 bits (67), Expect(2) = 3e-15
 Identities = 12/26 (46%), Positives = 15/26 (57%)
 Frame = -3

Query: 78  SSRKGRNSGVTFSDWCNCPMGCIASL 1
           S R G  S ++F DWC  PM C  S+
Sbjct: 741 SGRYGHISSISFPDWCFDPMTCFQSI 766


>ref|XP_020239954.1| uncharacterized protein LOC109818836 [Cajanus cajan]
 gb|KYP41807.1| hypothetical protein KK1_036829 [Cajanus cajan]
          Length = 237

 Score = 77.8 bits (190), Expect = 1e-14
 Identities = 38/95 (40%), Positives = 48/95 (50%)
 Frame = -1

Query: 392 CVFCFREVEVADHVFCRCVFVRSVWRGVFLWLNVNEGLCDDVLNTFLQSSKCIQGKSARR 213
           CVFC    E   H+F  C    SVW  +  WL  +  L D V    L  S  + GK   R
Sbjct: 136 CVFCNNREEDVTHLFLHCDVASSVWYSLCRWLGFSSCLSDSVEGQLLTMSGFVAGKKPAR 195

Query: 212 IKHIFWVATMWQLWLVRNRIIFQGGVTNVNSVIYS 108
           +    WVAT+W LWL RN+I+F  GV +V  V+ S
Sbjct: 196 VVVTIWVATVWSLWLHRNKIVFNNGVCDVLEVVES 230


>gb|PNX84435.1| hypothetical protein L195_g040495, partial [Trifolium pratense]
          Length = 248

 Score = 77.4 bits (189), Expect = 2e-14
 Identities = 32/96 (33%), Positives = 50/96 (52%), Gaps = 1/96 (1%)
 Frame = -1

Query: 368 EVADHVFCRCVFVRSVWRGVFLWLNVNE-GLCDDVLNTFLQSSKCIQGKSARRIKHIFWV 192
           E   H+F  C F + VW  V+ WL +       +  N FL     I  K   R++H+FW+
Sbjct: 124 EDCSHLFFHCAFSKGVWESVYRWLGMKSISAGAEGWNHFLLFDDMITAKKGERVRHLFWL 183

Query: 191 ATMWQLWLVRNRIIFQGGVTNVNSVIYSLKFVSWSW 84
           AT W +W +RN ++F G + N +S++  +   SW W
Sbjct: 184 ATTWNIWKLRNNVVFNGVIPNASSLVEDIIANSWLW 219


>ref|XP_021996043.1| uncharacterized protein LOC110893235 [Helianthus annuus]
          Length = 149

 Score = 75.1 bits (183), Expect = 2e-14
 Identities = 40/110 (36%), Positives = 58/110 (52%), Gaps = 4/110 (3%)
 Frame = -1

Query: 401 DLSCVFCFREVEVADHVFCRCVFVRSVWRGVFLWLNV----NEGLCDDVLNTFLQSSKCI 234
           D  C  C  E E ADH+F +C+  RSVW  +F WL V    N     D+L  F  S    
Sbjct: 19  DALCANCGFEEESADHLFAKCLTARSVWWNIFSWLKVPWPSNVDSLKDLLEVFYNSP--- 75

Query: 233 QGKSARRIKHIFWVATMWQLWLVRNRIIFQGGVTNVNSVIYSLKFVSWSW 84
             K  +R+ H+  V T+W++W  RNR +F+G   +V  ++ S+K  S+ W
Sbjct: 76  GSKVWKRLAHMVAVDTVWRIWNARNRKVFEGDGISVRKIVDSIKEESFIW 125


Top