BLASTX nr result

ID: Glycyrrhiza28_contig00009627 seq

BLASTX 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Glycyrrhiza28_contig00009627
         (349 letters)

Database: ./nr 
           115,041,592 sequences; 42,171,959,267 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

GAU31820.1 hypothetical protein TSUD_58210 [Trifolium subterraneum]    78   2e-14
GAU45704.1 hypothetical protein TSUD_86800 [Trifolium subterraneum]    76   7e-14
GAU50616.1 hypothetical protein TSUD_290710 [Trifolium subterran...    73   8e-13
KYP68633.1 Retrovirus-related Pol polyprotein from transposon TN...    70   1e-11
GAU44433.1 hypothetical protein TSUD_100800 [Trifolium subterran...    70   1e-11
OIT30261.1 hypothetical protein A4A49_58297, partial [Nicotiana ...    67   2e-11
KYP45328.1 hypothetical protein KK1_033114 [Cajanus cajan]             68   2e-11
XP_012570513.1 PREDICTED: LOW QUALITY PROTEIN: LINE-1 reverse tr...    69   2e-11
GAU25315.1 hypothetical protein TSUD_375780 [Trifolium subterran...    69   3e-11
GAU29576.1 hypothetical protein TSUD_153260 [Trifolium subterran...    69   3e-11
GAU31823.1 hypothetical protein TSUD_58240 [Trifolium subterraneum]    69   4e-11
GAU11490.1 hypothetical protein TSUD_344800 [Trifolium subterran...    68   5e-11
XP_019229157.1 PREDICTED: uncharacterized protein LOC109210231 [...    66   1e-10
GAU32234.1 hypothetical protein TSUD_53610 [Trifolium subterraneum]    67   1e-10
GAU36337.1 hypothetical protein TSUD_321760 [Trifolium subterran...    67   2e-10
XP_016164475.1 PREDICTED: uncharacterized protein LOC107606996 [...    66   2e-10
XP_013455653.1 hypothetical protein MTR_4g048330 [Medicago trunc...    66   3e-10
GAU51530.1 hypothetical protein TSUD_413950 [Trifolium subterran...    66   3e-10
XP_006579271.1 PREDICTED: uncharacterized protein LOC102666356 [...    65   4e-10
GAU31058.1 hypothetical protein TSUD_214940 [Trifolium subterran...    65   5e-10

>GAU31820.1 hypothetical protein TSUD_58210 [Trifolium subterraneum]
          Length = 1409

 Score = 77.8 bits (190), Expect = 2e-14
 Identities = 49/114 (42%), Positives = 70/114 (61%), Gaps = 12/114 (10%)
 Frame = +3

Query: 27  DASQSLVNAADGKKHYGRG-----RGKWSNKQCSHCGKAGHTIDVCYEIHGYPIGYKRNF 191
           D S SL+NAA   +  G+G     + K SN+QC+ C ++GHT+D CY+ HG+P  +K+N 
Sbjct: 236 DDSSSLINAAQRYEAKGKGIASSSQSKNSNRQCTFCHRSGHTVDFCYQKHGHP-SFKKN- 293

Query: 192 KPSVNLA------ADASDTKSEMGDKMASSS-ISQEEFKQLMDLLKKVNVAQPS 332
           + SVN A      A AS + +E+G    +SS ISQE+F QLM LL + N+   S
Sbjct: 294 RSSVNAANTQVVQAPASVSNTEVGSSSGTSSPISQEQFGQLMALLHQTNLLPAS 347


>GAU45704.1 hypothetical protein TSUD_86800 [Trifolium subterraneum]
          Length = 902

 Score = 76.3 bits (186), Expect = 7e-14
 Identities = 44/115 (38%), Positives = 67/115 (58%), Gaps = 13/115 (11%)
 Frame = +3

Query: 27  DASQSLVNAADGKKHYGRG--------RGKWSNKQCSHCGKAGHTIDVCYEIHGYPIGYK 182
           D S+ LVNAAD KK Y +          GK  N+ C++C + GHT+D C++ HGYP   +
Sbjct: 244 DESKVLVNAADSKKPYYKNSKPNFQSFNGK-GNRHCTYCDRQGHTVDGCFKKHGYPPHMQ 302

Query: 183 RNFKPSVNLAADASDTKSEMGDKMASS-----SISQEEFKQLMDLLKKVNVAQPS 332
           RNF    N + + SD++S+  ++  SS     S++Q++F QLM LL+   + Q S
Sbjct: 303 RNFGSVHNTSTEGSDSQSQQMERGESSNSSPASLTQDQFDQLMLLLQSSGMNQSS 357


>GAU50616.1 hypothetical protein TSUD_290710 [Trifolium subterraneum]
          Length = 404

 Score = 73.2 bits (178), Expect = 8e-13
 Identities = 36/91 (39%), Positives = 57/91 (62%), Gaps = 11/91 (12%)
 Frame = +3

Query: 69  HYGRGRGKWS--------NKQCSHCGKAGHTIDVCYEIHGYPIGYKRNFKPSVNLAADAS 224
           H GRGRG+ +        N+ C+HCG+  H +D C+E+HGYP GY+     SVN+AA AS
Sbjct: 266 HGGRGRGRGNHHGGRGPNNRTCTHCGRHNHIVDTCFELHGYPPGYQHKNSKSVNVAATAS 325

Query: 225 DTKSEMGD-KMASSSIS--QEEFKQLMDLLK 308
           +   + G   + S++I+  QE++ Q++ LL+
Sbjct: 326 NATLKEGHINLTSATINTIQEQYNQILQLLQ 356


>KYP68633.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94
           [Cajanus cajan]
          Length = 1108

 Score = 70.1 bits (170), Expect = 1e-11
 Identities = 36/102 (35%), Positives = 61/102 (59%), Gaps = 10/102 (9%)
 Frame = +3

Query: 57  DGKKHYGRGRGK--W---SNKQCSHCGKAGHTIDVCYEIHGYPIGY-KRNFKPSVNLAAD 218
           D +K +G G+G   W   S K CS+CGK+GHT+DVCY+ HGYP+ +  +N     N+  +
Sbjct: 242 DNRKSFGIGKGNNSWGRGSGKVCSYCGKSGHTVDVCYKKHGYPLNFGSKNNSTVQNIIQE 301

Query: 219 ASDTKSEMGDKMASSS----ISQEEFKQLMDLLKKVNVAQPS 332
            ++   +   K  SS+    I+QE+++ L+ L+++ N+   S
Sbjct: 302 ETEENEDQSRKEDSSNSQQVITQEQYRNLLALIQQSNLQASS 343


>GAU44433.1 hypothetical protein TSUD_100800 [Trifolium subterraneum]
          Length = 914

 Score = 69.7 bits (169), Expect = 1e-11
 Identities = 40/108 (37%), Positives = 55/108 (50%), Gaps = 9/108 (8%)
 Frame = +3

Query: 12  LEPIQDASQSLVNAADGKKHYGRGRGKWSNKQCSHCGKAGHTIDVCYEIHGYPIGY---- 179
           L  + D+S +L+N A  K +YG+G    SNK C+ CGK GH +D+ Y+ HGYP G+    
Sbjct: 240 LNSLDDSSSALINLAK-KPYYGKGNSSNSNKTCTFCGKGGHIVDIFYKKHGYPPGFRFRD 298

Query: 180 -----KRNFKPSVNLAADASDTKSEMGDKMASSSISQEEFKQLMDLLK 308
                K     S+N  A  +     + D     S S  E+K LM LLK
Sbjct: 299 GTVVGKSQGNSSINNIAGENVESKAVVDSDDRVSFSHVEYKALMALLK 346


>OIT30261.1 hypothetical protein A4A49_58297, partial [Nicotiana attenuata]
          Length = 197

 Score = 67.4 bits (163), Expect = 2e-11
 Identities = 39/123 (31%), Positives = 59/123 (47%), Gaps = 8/123 (6%)
 Frame = +3

Query: 3   QNGLEPIQDASQSLVNAADGKK--HYGRGRGKWSNKQCSHCGKAGHTIDVCYEIHGYPIG 176
           Q  ++P+ +  ++  N     +  H    R   +N  C +C K GHTI+ CY+IHGYP+ 
Sbjct: 23  QRSIQPLGNYPRNTANGLKNNQTQHQDYKRSTSTNLICRYCKKPGHTIEKCYKIHGYPLN 82

Query: 177 YKR------NFKPSVNLAADASDTKSEMGDKMASSSISQEEFKQLMDLLKKVNVAQPSPV 338
           +K       N     N  +   DT+S+  D   S  I QE+  QL +LL++V   Q  P 
Sbjct: 83  FKNSRPRPFNNHAHSNATSTQEDTQSQASD--VSIGIKQEQLNQLTELLQQVKFGQQGPS 140

Query: 339 LKE 347
             E
Sbjct: 141 SSE 143


>KYP45328.1 hypothetical protein KK1_033114 [Cajanus cajan]
          Length = 260

 Score = 68.2 bits (165), Expect = 2e-11
 Identities = 31/78 (39%), Positives = 49/78 (62%), Gaps = 1/78 (1%)
 Frame = +3

Query: 3   QNGLEPIQDASQSLVNAADGKKHYGRGRGKWSNKQCSHCGKAGHTIDVCYEIHGYPIGYK 182
           QNGL    + +Q ++NA DG++  G+GRG  +++ C++CGK GHT+D CY  HG+P   K
Sbjct: 53  QNGLIHSNEENQVMINATDGRRFVGKGRG--NSRICTYCGKTGHTVDTCYRKHGFPPSLK 110

Query: 183 -RNFKPSVNLAADASDTK 233
            +    SVN   +  D++
Sbjct: 111 SKGSNSSVNCTLNYKDSE 128


>XP_012570513.1 PREDICTED: LOW QUALITY PROTEIN: LINE-1 reverse transcriptase
           homolog [Cicer arietinum]
          Length = 424

 Score = 68.9 bits (167), Expect = 2e-11
 Identities = 33/92 (35%), Positives = 53/92 (57%), Gaps = 2/92 (2%)
 Frame = +3

Query: 27  DASQSLVNAADGKKHYGRGRGKWSNKQCSHCGKAGHTIDVCYEIHGYP--IGYKRNFKPS 200
           D   +LVNA DG + YG+GRG+ S K C++CGK GH ++ CY+ HG+P  +G+  N+   
Sbjct: 14  DDPTTLVNAVDGGRSYGQGRGRGSGKICTYCGKVGHMVETCYKKHGFPPNLGHDNNYA-- 71

Query: 201 VNLAADASDTKSEMGDKMASSSISQEEFKQLM 296
                 A+ TK+E  +    + + +E    L+
Sbjct: 72  ------ANYTKAESYNNCIDNGLDEEVIPNLI 97


>GAU25315.1 hypothetical protein TSUD_375780 [Trifolium subterraneum]
          Length = 758

 Score = 68.9 bits (167), Expect = 3e-11
 Identities = 43/120 (35%), Positives = 61/120 (50%), Gaps = 11/120 (9%)
 Frame = +3

Query: 6   NGLEPIQDASQSLVNAADGKKHYG--------RGRGKWSNKQCSHCGKAGHTIDVCYEIH 161
           NGLE   D++  L N A+ +K+ G         G   +SNK C++CGK+GH ID+CY  +
Sbjct: 236 NGLEVADDSTIGL-NLAEARKNNGYKGKNASNAGNNGYSNKTCTYCGKSGHNIDICYRKN 294

Query: 162 GYPIGYK---RNFKPSVNLAADASDTKSEMGDKMASSSISQEEFKQLMDLLKKVNVAQPS 332
           GYP G+K    +   S+   A AS + S   D       S+ EF+ L  LL+   V   S
Sbjct: 295 GYPPGFKYKDGSVPKSLMANAAASTSSSTKQDSKPLMGFSEAEFQALRKLLQNSQVGTTS 354


>GAU29576.1 hypothetical protein TSUD_153260 [Trifolium subterraneum]
          Length = 1100

 Score = 68.9 bits (167), Expect = 3e-11
 Identities = 37/98 (37%), Positives = 50/98 (51%), Gaps = 3/98 (3%)
 Frame = +3

Query: 48  NAADGKKHYGRGRGKWSNKQCSHCGKAGHTIDVCYEIHGYPIGYK---RNFKPSVNLAAD 218
           N   GK  Y  G   +SNK C++CGK+GH ID+CY  +GYP G+K    +   S+   A 
Sbjct: 257 NGYKGKNAYNAGNNGYSNKTCTYCGKSGHNIDICYRKNGYPPGFKYKDGSVPKSLMANAA 316

Query: 219 ASDTKSEMGDKMASSSISQEEFKQLMDLLKKVNVAQPS 332
           AS + S   D       S+ EF+ L  LL+   V   S
Sbjct: 317 ASTSSSTKQDSKPFMGFSEAEFQALRKLLQNSQVGTTS 354


>GAU31823.1 hypothetical protein TSUD_58240 [Trifolium subterraneum]
          Length = 1119

 Score = 68.6 bits (166), Expect = 4e-11
 Identities = 43/112 (38%), Positives = 60/112 (53%), Gaps = 10/112 (8%)
 Frame = +3

Query: 27  DASQSLVNAADGKKHY-----GRGRGKWSNKQCSHCGKAGHTIDVCYEIHGYP---IGYK 182
           + S+ LVNAAD KK       G  +    NK C++C K  HTI+ C++ HG+P     Y 
Sbjct: 244 EESKVLVNAADSKKQNYYASKGNSQSSKGNKYCTYCHKTNHTINECFKKHGFPPHMQKYN 303

Query: 183 RNFKPSVNLAADASDTKSEMGDKMASS--SISQEEFKQLMDLLKKVNVAQPS 332
           R    S +  +++    SE G+   SS  SISQE++ QLM LLK  +V   S
Sbjct: 304 RAASNSSHAGSNSMTNSSEHGESSRSSTPSISQEQYDQLMTLLKNSSVNHSS 355


>GAU11490.1 hypothetical protein TSUD_344800 [Trifolium subterraneum]
          Length = 551

 Score = 68.2 bits (165), Expect = 5e-11
 Identities = 38/110 (34%), Positives = 69/110 (62%), Gaps = 9/110 (8%)
 Frame = +3

Query: 18  PIQDASQSLVNAADGKKHYGRGRG----KWSNKQCSHCGKAGHTIDVCYEIHGYPIGYKR 185
           PI D+S + +NA+D +K   RG+G    + SN+ C++CGK GHT+D CY  HG+P  +K 
Sbjct: 235 PIDDSS-AFINASDARKPPFRGKGPSNGRNSNRVCTYCGKNGHTVDFCYAKHGHPNVHKG 293

Query: 186 NFKPSVNLAADA----SDTKSEMGDKMASSS-ISQEEFKQLMDLLKKVNV 320
           N     +    A    +++ +E+G   ++++ +SQ+++ QL+ LL++ N+
Sbjct: 294 NASVHASNGEVAETRFANSVTEVGSSSSNATGLSQDKYDQLISLLQQANL 343


>XP_019229157.1 PREDICTED: uncharacterized protein LOC109210231 [Nicotiana
           attenuata]
          Length = 274

 Score = 66.2 bits (160), Expect = 1e-10
 Identities = 40/111 (36%), Positives = 54/111 (48%), Gaps = 11/111 (9%)
 Frame = +3

Query: 48  NAADGKK-----HYGRGRGKWSNKQCSHCGKAGHTIDVCYEIHGYPIGYKR------NFK 194
           N A+G K     H    R   +N  C +C K GHTI+ CY+IHGYP+ +K       N  
Sbjct: 148 NTANGLKNNQTQHQDYKRSTSTNLICRYCKKPGHTIEKCYKIHGYPLNFKNSRPRPFNNH 207

Query: 195 PSVNLAADASDTKSEMGDKMASSSISQEEFKQLMDLLKKVNVAQPSPVLKE 347
              N  +   DT+S+  D   S  I QE+  QL +LL++V   Q  P   E
Sbjct: 208 AHSNATSTQEDTQSQASD--VSIGIKQEQLNQLTELLQQVKFGQQGPSSSE 256


>GAU32234.1 hypothetical protein TSUD_53610 [Trifolium subterraneum]
          Length = 1009

 Score = 67.0 bits (162), Expect = 1e-10
 Identities = 40/112 (35%), Positives = 61/112 (54%), Gaps = 10/112 (8%)
 Frame = +3

Query: 27  DASQSLVNAADGKKHY-----GRGRGKWSNKQCSHCGKAGHTIDVCYEIHGYPIGYKRNF 191
           + S+ LVNAAD KK       G  +    N+ C++C K  HTI+ C++ HG+P   +++ 
Sbjct: 244 EESKVLVNAADSKKQNYYASKGYSQSSKENRYCTYCHKTNHTINECFKKHGFPPHMQKHN 303

Query: 192 KPSVNLAADASDTKSEMGDKMASS-----SISQEEFKQLMDLLKKVNVAQPS 332
           + + N +   SD+ +   D   SS     SISQE++ QLM LLK  +V   S
Sbjct: 304 RAASNSSHAGSDSMTNSSDHGESSRSSTPSISQEQYDQLMTLLKNSSVNHSS 355


>GAU36337.1 hypothetical protein TSUD_321760 [Trifolium subterraneum]
          Length = 1094

 Score = 66.6 bits (161), Expect = 2e-10
 Identities = 49/133 (36%), Positives = 73/133 (54%), Gaps = 26/133 (19%)
 Frame = +3

Query: 9   GLEPIQDA----SQSLVNAADGK---KHYGRGRGKWS-------------NKQCSHCGKA 128
           GL P  DA    S +L N+ D     + YGRGRG +S              K C++CGK 
Sbjct: 214 GLTPQVDAKTESSDALANSVDSHGAGRGYGRGRGNFSYQGGRGRGNNSNTTKVCTYCGKN 273

Query: 129 GHTIDVCYEIHGYP--IGYKRN---FKPSV-NLAADASDTKSEMGDKMASSSISQEEFKQ 290
           GHTID+CY+ HGYP   GY R+      SV N+ AD  D   E+G+  ++ S++++++  
Sbjct: 274 GHTIDICYKKHGYPPNWGYTRSNNGGNSSVNNVEADHDD---EVGN--SNVSLTKDQYNS 328

Query: 291 LMDLLKKVNVAQP 329
           L+ LL++ N+  P
Sbjct: 329 LLALLERNNLDNP 341


>XP_016164475.1 PREDICTED: uncharacterized protein LOC107606996 [Arachis ipaensis]
          Length = 338

 Score = 66.2 bits (160), Expect = 2e-10
 Identities = 37/81 (45%), Positives = 47/81 (58%), Gaps = 4/81 (4%)
 Frame = +3

Query: 75  GRGRG--KWSNKQCSHCGKAGHTIDVCYEIHGYP--IGYKRNFKPSVNLAADASDTKSEM 242
           GRGRG  K     CSHCG  GHTID CY+IHGYP   G  RN KPSV+  + +  T    
Sbjct: 257 GRGRGMVKKDRPLCSHCGILGHTIDKCYKIHGYPPNFGKGRNMKPSVHHVSTSQST---- 312

Query: 243 GDKMASSSISQEEFKQLMDLL 305
            D   S +++  + +QL+ LL
Sbjct: 313 SDAQISPNLAPTQVQQLLTLL 333


>XP_013455653.1 hypothetical protein MTR_4g048330 [Medicago truncatula] KEH29684.1
           hypothetical protein MTR_4g048330 [Medicago truncatula]
          Length = 422

 Score = 65.9 bits (159), Expect = 3e-10
 Identities = 36/113 (31%), Positives = 65/113 (57%), Gaps = 9/113 (7%)
 Frame = +3

Query: 21  IQDASQSLVNAADGKKHYGRGRG-------KWSNKQCSHCGKAGHTIDVCYEIHGYPIGY 179
           I+D+S  LVNA D +K YGRG+        K +++ C+ C +  HT+D CY+ HGYP   
Sbjct: 171 IEDSS-ILVNALDARKLYGRGKSPSGYSQSKNTSRYCTFCHRNNHTVDFCYQKHGYPNAN 229

Query: 180 KRNFKPSVNLAADASDTKS--EMGDKMASSSISQEEFKQLMDLLKKVNVAQPS 332
           K     +V     ++D++S  E    ++ + ++QE++  L+ LL++ ++  P+
Sbjct: 230 KSLAASNVVTTESSTDSQSIGEGSSSISQTGLTQEQYVHLVSLLQQSSLVSPA 282


>GAU51530.1 hypothetical protein TSUD_413950 [Trifolium subterraneum]
          Length = 830

 Score = 65.9 bits (159), Expect = 3e-10
 Identities = 45/130 (34%), Positives = 71/130 (54%), Gaps = 23/130 (17%)
 Frame = +3

Query: 9   GLEPIQDA----SQSLVNAADGK---KHYGRGRGKWSN-------------KQCSHCGKA 128
           GL P  DA    S +L N+ D     + YGRGRG +S              K C++CGK 
Sbjct: 240 GLTPQVDAKTESSDALANSVDSHGAGRDYGRGRGNFSYQGGRGRGNNSNTAKVCTYCGKN 299

Query: 129 GHTIDVCYEIHGYP--IGYKR-NFKPSVNLAADASDTKSEMGDKMASSSISQEEFKQLMD 299
           GHTID+CY+ HGYP   GY R N   + ++    +D   E+G+  ++ S++++++  L+ 
Sbjct: 300 GHTIDICYKKHGYPPNWGYTRGNNGGNSSVNNVEADHDDEVGN--SNVSLTKDQYNSLLA 357

Query: 300 LLKKVNVAQP 329
           LL++ N+  P
Sbjct: 358 LLERNNLDNP 367


>XP_006579271.1 PREDICTED: uncharacterized protein LOC102666356 [Glycine max]
          Length = 422

 Score = 65.5 bits (158), Expect = 4e-10
 Identities = 31/84 (36%), Positives = 52/84 (61%), Gaps = 5/84 (5%)
 Frame = +3

Query: 75  GRGRGKWSNKQCSHCGKAGHTIDVCYEIHGYPIGYKRNFKPSVNLAADASDTKSEMGDKM 254
           G+G+G  + K C++CGK GHTIDVCY+ HGYP G+K N   ++  +  A++ K+     +
Sbjct: 303 GKGKGYGTRKTCTYCGKLGHTIDVCYKKHGYPPGFKFNNGKAIANSVVATEGKATDDQIL 362

Query: 255 ASSS-----ISQEEFKQLMDLLKK 311
           +  S      S E++K L+ L+++
Sbjct: 363 SQESQEQVRFSSEQYKALLALIQQ 386


>GAU31058.1 hypothetical protein TSUD_214940 [Trifolium subterraneum]
          Length = 927

 Score = 65.5 bits (158), Expect = 5e-10
 Identities = 43/129 (33%), Positives = 66/129 (51%), Gaps = 22/129 (17%)
 Frame = +3

Query: 9   GLEPIQDA----SQSLVNAAD---GKKHYGRGRGKWSN-------------KQCSHCGKA 128
           GL P  DA    S +L N+ D    ++ YGRGRG +S              K CS+CGK 
Sbjct: 240 GLTPQVDAKTESSDALANSVDRHGARRGYGRGRGNFSYQGGRGRGNNSNTAKVCSYCGKN 299

Query: 129 GHTIDVCYEIHGYP--IGYKRNFKPSVNLAADASDTKSEMGDKMASSSISQEEFKQLMDL 302
           GHTID+CY+ HGYP   GY R      +   +      + G   ++ S++++++  L+ L
Sbjct: 300 GHTIDICYKKHGYPPNWGYTRGNNGGNSSVNNVEVDHDDEGGN-SNVSLTKDQYNSLLAL 358

Query: 303 LKKVNVAQP 329
           L++ N+  P
Sbjct: 359 LERNNLDNP 367


Top