BLASTX nr result

ID: Glycyrrhiza32_contig00030560 seq

BLASTX 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Glycyrrhiza32_contig00030560
         (777 letters)

Database: ./nr 
           115,041,592 sequences; 42,171,959,267 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

KHN03945.1 hypothetical protein glysoja_022631, partial [Glycine...    87   3e-18
KHN04350.1 hypothetical protein glysoja_030944 [Glycine soja]          86   1e-16
GAU42970.1 hypothetical protein TSUD_188430 [Trifolium subterran...    89   3e-16
KHN39553.1 hypothetical protein glysoja_045723, partial [Glycine...    84   5e-16
KHN35644.1 hypothetical protein glysoja_030996 [Glycine soja]          81   3e-15
GAU48210.1 hypothetical protein TSUD_404970 [Trifolium subterran...    86   3e-15
GAU35033.1 hypothetical protein TSUD_103560 [Trifolium subterran...    84   4e-15
KHN41375.1 Putative ribonuclease H protein, partial [Glycine soja]     83   1e-14
XP_004514417.1 PREDICTED: uncharacterized protein LOC101494040 [...    81   1e-14
GAU10013.1 hypothetical protein TSUD_415800, partial [Trifolium ...    77   1e-13
KHN30886.1 Putative ribonuclease H protein, partial [Glycine soja]     79   2e-13
KHN18837.1 hypothetical protein glysoja_028206 [Glycine soja]          74   2e-13
KHN20429.1 hypothetical protein glysoja_044415, partial [Glycine...    74   3e-13
GAU40143.1 hypothetical protein TSUD_163120 [Trifolium subterran...    77   2e-12
GAU34577.1 hypothetical protein TSUD_29230 [Trifolium subterraneum]    74   3e-12
GAU28852.1 hypothetical protein TSUD_21890 [Trifolium subterraneum]    77   3e-12
XP_003599293.1 hypothetical protein MTR_3g031290 [Medicago trunc...    75   4e-12
AFK37936.1 unknown [Lotus japonicus]                                   71   6e-12
KHN20559.1 hypothetical protein glysoja_033247, partial [Glycine...    70   6e-12
KYP53237.1 hypothetical protein KK1_024864 [Cajanus cajan]             71   6e-12

>KHN03945.1 hypothetical protein glysoja_022631, partial [Glycine soja]
          Length = 114

 Score = 87.4 bits (215), Expect = 3e-18
 Identities = 36/108 (33%), Positives = 58/108 (53%)
 Frame = +3

Query: 3   FSYGIWSAVLNWFGGVGVFQQGCIDHFLQFSEIMKGRKTRKVKNLVWMVVSWSLWTTRNN 182
           F+  +W  +LNW G          + +LQ    ++G+K R+ K+L+W    WS+W  RNN
Sbjct: 7   FAKQVWQGILNWLGYSFSLPNNIQELYLQLGMNIRGKKKRRFKHLLWHNTCWSIWCHRNN 66

Query: 183 IIFTGGVADFMSVLNQVKVLSWGWFINRAGRNLGVLFSDWCINPLNCL 326
           +IF     D  + +  +K +SW W + ++    G  FS WC+ PL+CL
Sbjct: 67  VIFRNAEVDVNNTILFIKSMSWQWVLYKSSGKPGFFFSSWCLCPLDCL 114


>KHN04350.1 hypothetical protein glysoja_030944 [Glycine soja]
          Length = 229

 Score = 86.3 bits (212), Expect = 1e-16
 Identities = 38/104 (36%), Positives = 59/104 (56%)
 Frame = +3

Query: 15  IWSAVLNWFGGVGVFQQGCIDHFLQFSEIMKGRKTRKVKNLVWMVVSWSLWTTRNNIIFT 194
           IW  V  W     V  +    HF+ F +++KG+K ++VK+L+WM V W++W TRN +IF 
Sbjct: 123 IWQQVCLWLDVPVVEGEDIQAHFMAFGKLIKGKKQKRVKHLIWMAVIWNIWLTRNKVIFK 182

Query: 195 GGVADFMSVLNQVKVLSWGWFINRAGRNLGVLFSDWCINPLNCL 326
              A    +++ +K  +W WF+ R GR     +SDW   P+ CL
Sbjct: 183 EEAAAIPVMISGIKDCAWAWFMARQGRTCWDGWSDWYNCPMGCL 226


>GAU42970.1 hypothetical protein TSUD_188430 [Trifolium subterraneum]
          Length = 767

 Score = 88.6 bits (218), Expect = 3e-16
 Identities = 34/103 (33%), Positives = 56/103 (54%)
 Frame = +3

Query: 15  IWSAVLNWFGGVGVFQQGCIDHFLQFSEIMKGRKTRKVKNLVWMVVSWSLWTTRNNIIFT 194
           +W  V  WFG     +    +HF  F  ++K ++  KV++L+W+  +WS+W  RNN++F 
Sbjct: 660 VWEEVFKWFGKSYQAEADGWNHFNIFGSLLKTKRFEKVRHLIWLATTWSIWKLRNNVVFN 719

Query: 195 GGVADFMSVLNQVKVLSWGWFINRAGRNLGVLFSDWCINPLNC 323
           G      S++N +K +S  W   R G    + F DWC +P+ C
Sbjct: 720 GVTLSSSSLVNDIKTISCLWLSGRYGHISSISFPDWCFDPMTC 762


>KHN39553.1 hypothetical protein glysoja_045723, partial [Glycine soja]
          Length = 211

 Score = 84.0 bits (206), Expect = 5e-16
 Identities = 38/108 (35%), Positives = 55/108 (50%), Gaps = 1/108 (0%)
 Frame = +3

Query: 3   FSYGIWSAVLNWFGGVGVFQQGCIDHFLQFSEIMKGRKTR-KVKNLVWMVVSWSLWTTRN 179
           FS  IW  +L+W G V V   G + HF ++  ++K   +R KV  + W+   W +W  RN
Sbjct: 104 FSKCIWKEILSWIGIVDVIAVGGVQHFWEYDRLLKYNTSRNKVPFMFWLATLWIIWQVRN 163

Query: 180 NIIFTGGVADFMSVLNQVKVLSWGWFINRAGRNLGVLFSDWCINPLNC 323
           N IF     D    +NQ+K + W WF+ + G   G   SDW  +P  C
Sbjct: 164 NSIFKEEEKDIPKTINQIKHICWAWFMGKVGGVTGSNISDWWNSPFLC 211


>KHN35644.1 hypothetical protein glysoja_030996 [Glycine soja]
          Length = 160

 Score = 80.9 bits (198), Expect = 3e-15
 Identities = 38/104 (36%), Positives = 58/104 (55%)
 Frame = +3

Query: 15  IWSAVLNWFGGVGVFQQGCIDHFLQFSEIMKGRKTRKVKNLVWMVVSWSLWTTRNNIIFT 194
           IW  V  W     V  +    HF+ F +++KG+K ++VK+L+WM V W++W TRN +IF 
Sbjct: 54  IWQQVCLWLDVPVVEGEDIQAHFMAFGKLIKGKKQKRVKHLIWMAVIWNIWLTRNKVIFK 113

Query: 195 GGVADFMSVLNQVKVLSWGWFINRAGRNLGVLFSDWCINPLNCL 326
              A    +++ +K  +W WF  R GR   V +SD    P+ CL
Sbjct: 114 EEAAAIPVMISGIKDCAWAWFKARQGRICWVGWSDSYNCPMGCL 157


>GAU48210.1 hypothetical protein TSUD_404970 [Trifolium subterraneum]
          Length = 1653

 Score = 85.5 bits (210), Expect = 3e-15
 Identities = 36/103 (34%), Positives = 53/103 (51%)
 Frame = +3

Query: 15   IWSAVLNWFGGVGVFQQGCIDHFLQFSEIMKGRKTRKVKNLVWMVVSWSLWTTRNNIIFT 194
            +W AV NW G           HF  F +++      +V++L+W+  +W+LW  RNN+IF 
Sbjct: 1546 VWEAVYNWIGKDYHAGAEGWSHFKVFGDMVNSTNIERVRHLIWLATTWNLWKLRNNVIFN 1605

Query: 195  GGVADFMSVLNQVKVLSWGWFINRAGRNLGVLFSDWCINPLNC 323
            G      S+LN +K +S  W   R G    + FS WC +PL C
Sbjct: 1606 GATPSASSLLNDIKAISCAWVSGRYGHKSCISFSLWCFDPLAC 1648


>GAU35033.1 hypothetical protein TSUD_103560 [Trifolium subterraneum]
          Length = 311

 Score = 83.6 bits (205), Expect = 4e-15
 Identities = 38/109 (34%), Positives = 60/109 (55%)
 Frame = +3

Query: 3   FSYGIWSAVLNWFGGVGVFQQGCIDHFLQFSEIMKGRKTRKVKNLVWMVVSWSLWTTRNN 182
           F+  +W  ++ W G V +  Q  +  F  F+E   G+K R+   ++W  V W+LW  RN 
Sbjct: 203 FATQVWEQIITWLGMVFMLPQSLVSFFSFFAETSGGKKRRQGLIMIWNAVVWALWRQRNR 262

Query: 183 IIFTGGVADFMSVLNQVKVLSWGWFINRAGRNLGVLFSDWCINPLNCLS 329
           IIF  G  D   V+ ++KV SW W+I R+  +  +L+ +W   PL CL+
Sbjct: 263 IIFENGTGDLNGVVEEIKVSSWKWWIGRSKSDPCLLY-EWNQEPLLCLA 310


>KHN41375.1 Putative ribonuclease H protein, partial [Glycine soja]
          Length = 363

 Score = 82.8 bits (203), Expect = 1e-14
 Identities = 38/108 (35%), Positives = 55/108 (50%), Gaps = 1/108 (0%)
 Frame = +3

Query: 3   FSYGIWSAVLNWFGGVGVFQQGCIDHFLQFSEIMKGRKTR-KVKNLVWMVVSWSLWTTRN 179
           FS  IW  +L+W G V V   G + HF ++  ++K   +R KV  + W+   W +W  RN
Sbjct: 256 FSKCIWKEILSWIGIVDVIAVGGVQHFWEYDRLLKYNTSRNKVPFMFWLATLWIIWQVRN 315

Query: 180 NIIFTGGVADFMSVLNQVKVLSWGWFINRAGRNLGVLFSDWCINPLNC 323
           N IF     D    +NQ+K + W WF+ + G   G   SDW  +P  C
Sbjct: 316 NSIFKEEEKDIPKTINQIKHICWAWFMGKVGGVGGSNISDWWNSPFLC 363


>XP_004514417.1 PREDICTED: uncharacterized protein LOC101494040 [Cicer arietinum]
          Length = 246

 Score = 80.9 bits (198), Expect = 1e-14
 Identities = 39/102 (38%), Positives = 56/102 (54%), Gaps = 1/102 (0%)
 Frame = +3

Query: 15  IWSAVLNWFGGVGVFQQGCIDHFLQFSEI-MKGRKTRKVKNLVWMVVSWSLWTTRNNIIF 191
           +W  VLNWF   GVF +  I+H  QF  + + GR TR+V NL+W  V W++W  RN  IF
Sbjct: 146 VWYGVLNWFALFGVFHKDAINHANQFDGLFLCGRVTRQVINLIWFAVMWAIWKLRNEAIF 205

Query: 192 TGGVADFMSVLNQVKVLSWGWFINRAGRNLGVLFSDWCINPL 317
              V +   ++  VK+L+W W I    +     ++ WC  PL
Sbjct: 206 RSHVLNANLIIEHVKLLAWRW-IRLHVKEFCYGYNQWCSQPL 246


>GAU10013.1 hypothetical protein TSUD_415800, partial [Trifolium subterraneum]
          Length = 169

 Score = 76.6 bits (187), Expect = 1e-13
 Identities = 41/108 (37%), Positives = 53/108 (49%)
 Frame = +3

Query: 3   FSYGIWSAVLNWFGGVGVFQQGCIDHFLQFSEIMKGRKTRKVKNLVWMVVSWSLWTTRNN 182
           FS  IW AV  W G V V        F         +K RK   L+W    W LW +RN 
Sbjct: 61  FSVQIWQAVFRWLGLVVVIPPNMFVLFDCLIGAASNKKIRKGYALIWHATIWMLWKSRNE 120

Query: 183 IIFTGGVADFMSVLNQVKVLSWGWFINRAGRNLGVLFSDWCINPLNCL 326
           IIF+ GV D   V +++K+LSW W ++R       LF +WC +P  CL
Sbjct: 121 IIFSNGVKDSEKVFDEIKLLSWRWGLSRHSIPT-CLFYEWCWDPGMCL 167


>KHN30886.1 Putative ribonuclease H protein, partial [Glycine soja]
          Length = 373

 Score = 79.3 bits (194), Expect = 2e-13
 Identities = 36/101 (35%), Positives = 52/101 (51%), Gaps = 1/101 (0%)
 Frame = +3

Query: 3   FSYGIWSAVLNWFGGVGVFQQGCIDHFLQFSEIMKGRKTR-KVKNLVWMVVSWSLWTTRN 179
           FS  IW  +L+W G V V   G + HF ++  ++K   +R KV  + W+   W +W  RN
Sbjct: 270 FSKCIWKEILSWIGIVDVIAVGGVQHFWEYDRLLKYNTSRNKVPFMFWLATLWIIWQVRN 329

Query: 180 NIIFTGGVADFMSVLNQVKVLSWGWFINRAGRNLGVLFSDW 302
           N IF     D    +NQ+K + W WF+ + G   G   SDW
Sbjct: 330 NSIFKEEEKDIPKTINQIKHICWAWFMGKVGGVGGSNISDW 370


>KHN18837.1 hypothetical protein glysoja_028206 [Glycine soja]
          Length = 84

 Score = 73.6 bits (179), Expect = 2e-13
 Identities = 30/81 (37%), Positives = 50/81 (61%)
 Frame = +3

Query: 84  LQFSEIMKGRKTRKVKNLVWMVVSWSLWTTRNNIIFTGGVADFMSVLNQVKVLSWGWFIN 263
           + F +++KG+K ++VK+L+WM V W++W TRN +IF    A    +++ +K  +W WF+ 
Sbjct: 1   MAFGKLIKGKKQKRVKHLIWMAVIWNIWLTRNKVIFKEEAAAIPVMISGIKDCAWAWFMA 60

Query: 264 RAGRNLGVLFSDWCINPLNCL 326
           R GR     +S W   P+ CL
Sbjct: 61  RQGRTCWDGWSYWYNCPMGCL 81


>KHN20429.1 hypothetical protein glysoja_044415, partial [Glycine soja]
          Length = 118

 Score = 74.3 bits (181), Expect = 3e-13
 Identities = 32/105 (30%), Positives = 51/105 (48%)
 Frame = +3

Query: 3   FSYGIWSAVLNWFGGVGVFQQGCIDHFLQFSEIMKGRKTRKVKNLVWMVVSWSLWTTRNN 182
           F+  IW  VL W G   V        F+     ++ R+ +++  + W V  W LW  RN 
Sbjct: 14  FNSKIWYVVLAWLGVSVVLPNDAKSLFIWMGGFVRVRRVKRLIFIFWHVTVWCLWNLRNQ 73

Query: 183 IIFTGGVADFMSVLNQVKVLSWGWFINRAGRNLGVLFSDWCINPL 317
           IIF     +F++ +  +K++SW W  ++ G    + FS WC  PL
Sbjct: 74  IIFKSDSIEFLACMAHIKIISWQWLFSKNGVKTSLFFSSWCCCPL 118


>GAU40143.1 hypothetical protein TSUD_163120 [Trifolium subterraneum]
          Length = 419

 Score = 77.0 bits (188), Expect = 2e-12
 Identities = 39/108 (36%), Positives = 58/108 (53%)
 Frame = +3

Query: 3   FSYGIWSAVLNWFGGVGVFQQGCIDHFLQFSEIMKGRKTRKVKNLVWMVVSWSLWTTRNN 182
           F+  IW+A+  W G V V        F  F+     +K RK   L+W    W LW +RN+
Sbjct: 311 FAGQIWNAIFRWLGLVLVIPPNFFLLFECFTGAAANKKIRKGYALIWHTTIWMLWKSRND 370

Query: 183 IIFTGGVADFMSVLNQVKVLSWGWFINRAGRNLGVLFSDWCINPLNCL 326
           I+F+ GV D   V++ +K+LSW W ++R    +  LF +WC +P  CL
Sbjct: 371 IMFSNGVIDVEKVIDDIKLLSWRWGLSRHSIPV-CLFYEWCWDPGLCL 417


>GAU34577.1 hypothetical protein TSUD_29230 [Trifolium subterraneum]
          Length = 211

 Score = 73.9 bits (180), Expect = 3e-12
 Identities = 34/107 (31%), Positives = 55/107 (51%), Gaps = 1/107 (0%)
 Frame = +3

Query: 15  IWSAVLNWFGGVGVFQQGCIDHFLQFSEIMK-GRKTRKVKNLVWMVVSWSLWTTRNNIIF 191
           +W  +LNW G  G       DH  QF  I   G++++    ++W+   WS+W  RNN +F
Sbjct: 101 LWIKILNWLGIFGPLPNVVADHVSQFCNIFPVGKESQIGSQVLWLACCWSIWKERNNRLF 160

Query: 192 TGGVADFMSVLNQVKVLSWGWFINRAGRNLGVLFSDWCINPLNCLSL 332
                   +++ +VK++SW W++N    N    F+ W  NPL CL +
Sbjct: 161 AHKELSVEALVEKVKIISW-WWLNTRKHNFDYDFNSWRSNPLVCLGI 206


>GAU28852.1 hypothetical protein TSUD_21890 [Trifolium subterraneum]
          Length = 836

 Score = 76.6 bits (187), Expect = 3e-12
 Identities = 39/105 (37%), Positives = 56/105 (53%), Gaps = 1/105 (0%)
 Frame = +3

Query: 15   IWSAVLNWFGGVGVFQQGCIDHFLQFSEIMKGRKTRKVKNLVWMVVSWSLWTTRNNIIFT 194
            IW A+  W   V V        F  F      +K RK   L+W    W LW +RN+IIF+
Sbjct: 732  IWKAIFRWLNLVIVLPPNLFMMFDCFLGAAPNKKIRKGYGLIWHATIWRLWKSRNDIIFS 791

Query: 195  GGVADFMSVLNQVKVLSWGWFINRAGRNLGV-LFSDWCINPLNCL 326
             GV D   V++++K+LSW W ++R   N+ + LF +WC +P  CL
Sbjct: 792  NGVIDAEKVIDEIKLLSWRWGLSR--HNIPICLFYEWCWDPGLCL 834


>XP_003599293.1 hypothetical protein MTR_3g031290 [Medicago truncatula] AES69544.1
           hypothetical protein MTR_3g031290 [Medicago truncatula]
          Length = 282

 Score = 74.7 bits (182), Expect = 4e-12
 Identities = 38/108 (35%), Positives = 57/108 (52%), Gaps = 2/108 (1%)
 Frame = +3

Query: 15  IWSAVLNWFGGVGVFQQGCIDHFLQFSEIMKGRKT-RKVKNLVWMVVSWSLWTTRNNIIF 191
           +W  V NW G +GV     +DHFLQF  +  G K  R    L+W++  W LW  RNN +F
Sbjct: 176 LWPMVRNWLGIIGVDTNVLLDHFLQFVHLSGGGKAVRDFLQLIWLLCVWVLWNERNNRLF 235

Query: 192 TGGVADFMSVLNQVKVLSWGWFINRAGRNLGVLFSD-WCINPLNCLSL 332
              V   + +L++VK +S  W   +A + +    +D WC +P  CL +
Sbjct: 236 NNVVTSILRLLHKVKFMSLAWL--KAKKVVFRFGTDRWCSSPFQCLDI 281


>AFK37936.1 unknown [Lotus japonicus]
          Length = 138

 Score = 71.2 bits (173), Expect = 6e-12
 Identities = 41/111 (36%), Positives = 51/111 (45%)
 Frame = +3

Query: 3   FSYGIWSAVLNWFGGVGVFQQGCIDHFLQFSEIMKGRKTRKVKNLVWMVVSWSLWTTRNN 182
           FS  IW  VL WFG            F+QF    +    R+    VWM   WSLW  RN 
Sbjct: 29  FSMAIWRMVLGWFGVSIALPSLVKALFVQFPVFGRCSSKREALVTVWMATCWSLWLMRNR 88

Query: 183 IIFTGGVADFMSVLNQVKVLSWGWFINRAGRNLGVLFSDWCINPLNCLSLL 335
           +IF  G  D   VL+ ++V SW W I     N    F +W ++PL CL  L
Sbjct: 89  VIFDNGELDTGLVLDLIQVRSWHW-IKAKRVNFQNSFYEWKLSPLACLDSL 138


>KHN20559.1 hypothetical protein glysoja_033247, partial [Glycine soja]
          Length = 86

 Score = 69.7 bits (169), Expect = 6e-12
 Identities = 28/83 (33%), Positives = 48/83 (57%), Gaps = 1/83 (1%)
 Frame = +3

Query: 78  HFLQFSEIMKGRKTRKVKNLVWMVVSWSLWTTRNNIIFTGG-VADFMSVLNQVKVLSWGW 254
           H++QF  + +G+K R+V+ LVW    W LW  RN++IF      D  +V+  ++ +SW W
Sbjct: 4   HYIQFESLFRGKKLRRVRLLVWHATCWCLWLYRNSVIFKDNFFPDVQNVVYHIQRISWTW 63

Query: 255 FINRAGRNLGVLFSDWCINPLNC 323
              +   +  + F++WC +PL C
Sbjct: 64  MKYKGHGSSSLSFANWCTSPLLC 86


>KYP53237.1 hypothetical protein KK1_024864 [Cajanus cajan]
          Length = 141

 Score = 71.2 bits (173), Expect = 6e-12
 Identities = 38/119 (31%), Positives = 59/119 (49%), Gaps = 1/119 (0%)
 Frame = +3

Query: 3   FSYGIWSAVLNWFGGVGVFQQGCIDHFLQFSEIMKGRKTRKVK-NLVWMVVSWSLWTTRN 179
           F+Y IW   LNWFG   V    C  H  QF ++       ++K + VW+ V WSLW  RN
Sbjct: 15  FAYCIWMLCLNWFGIKSVLHNSCHLHLAQFVDLPICSNVDRLKWSTVWITVLWSLWIARN 74

Query: 180 NIIFTGGVADFMSVLNQVKVLSWGWFINRAGRNLGVLFSDWCINPLNCLSLL*VWRVAD 356
             IF   V    ++L  +++ SW W +     +    +S W  +P  CL+ L ++ V++
Sbjct: 75  EAIFFDKVILHSNLLELIQMRSWKW-LKAKDLSFQYPYSSWVGSPAVCLNFLDIYSVSE 132


Top