BLASTX nr result

ID: Glycyrrhiza32_contig00034237 seq

BLASTX 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Glycyrrhiza32_contig00034237
         (391 letters)

Database: ./nr 
           115,041,592 sequences; 42,171,959,267 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

KHN31995.1 Copia protein, partial [Glycine soja]                      168   3e-50
KYP50660.1 Retrovirus-related Pol polyprotein from transposon TN...   154   9e-46
GAU46782.1 hypothetical protein TSUD_351810 [Trifolium subterran...   166   3e-45
KYP46147.1 Copia protein, partial [Cajanus cajan]                     155   2e-44
GAU47169.1 hypothetical protein TSUD_28920 [Trifolium subterraneum]   163   4e-44
GAU11490.1 hypothetical protein TSUD_344800 [Trifolium subterran...   160   5e-44
ABE88099.1 conserved hypothetical protein [Medicago truncatula]       149   6e-44
GAU41219.1 hypothetical protein TSUD_128950 [Trifolium subterran...   158   2e-43
KHN24193.1 Retrovirus-related Pol polyprotein from transposon TN...   151   2e-43
GAU22921.1 hypothetical protein TSUD_326940 [Trifolium subterran...   159   7e-43
GAU31823.1 hypothetical protein TSUD_58240 [Trifolium subterraneum]   159   1e-42
GAU51775.1 hypothetical protein TSUD_415620 [Trifolium subterran...   157   5e-42
KYP65473.1 Copia protein [Cajanus cajan]                              145   8e-42
KYP65404.1 Copia protein, partial [Cajanus cajan]                     150   1e-41
GAU39523.1 hypothetical protein TSUD_222930 [Trifolium subterran...   156   1e-41
KYP35345.1 hypothetical protein KK1_043626, partial [Cajanus cajan]   146   3e-41
GAU31202.1 hypothetical protein TSUD_210590 [Trifolium subterran...   154   4e-41
KHN03110.1 Copia protein, partial [Glycine soja]                      145   4e-41
KYP54539.1 Copia protein [Cajanus cajan]                              145   5e-41
ABD32333.1 polyprotein-like, putative [Medicago truncatula]           152   7e-41

>KHN31995.1 Copia protein, partial [Glycine soja]
          Length = 224

 Score =  168 bits (425), Expect = 3e-50
 Identities = 78/96 (81%), Positives = 86/96 (89%)
 Frame = +3

Query: 3   SRSSSEAEYRALSTASCELQWILYILGDLKVMCHRPPVLYCDNQSAMHIAANPVFHERTK 182
           SRSSSEAEYRALST +CELQW+LY+L DL + C R P LYCDNQSA+HIAANP+FHERTK
Sbjct: 104 SRSSSEAEYRALSTTACELQWLLYLLHDLHITCTRAPALYCDNQSALHIAANPMFHERTK 163

Query: 183 HLEIDCHFVREKVQNGVLRLLPISSKSQLADFFTKL 290
           HLEIDCHFVR K+Q GVLRLLPISSK QLADFFTK+
Sbjct: 164 HLEIDCHFVRNKIQEGVLRLLPISSKEQLADFFTKV 199


>KYP50660.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94,
           partial [Cajanus cajan]
          Length = 138

 Score =  154 bits (388), Expect = 9e-46
 Identities = 78/112 (69%), Positives = 88/112 (78%)
 Frame = +3

Query: 3   SRSSSEAEYRALSTASCELQWILYILGDLKVMCHRPPVLYCDNQSAMHIAANPVFHERTK 182
           SRSSS+AEY ALSTA CELQW+LY+L DL + C R PVLYCDNQS +HIAAN +FHERTK
Sbjct: 29  SRSSSKAEYSALSTAICELQWLLYLLHDLLITCTRAPVLYCDNQSDLHIAANRLFHERTK 88

Query: 183 HLEIDCHFVREKVQNGVLRLLPISSKSQLADFFTKLRLCIQEFLFLFSPSWA 338
           HLEIDC FVR K+Q+GVLRLLPISSK +LA+ FTK        LF   PSWA
Sbjct: 89  HLEIDCDFVRNKIQDGVLRLLPISSKEKLANCFTK--ALPLHLLFRSFPSWA 138


>GAU46782.1 hypothetical protein TSUD_351810 [Trifolium subterraneum]
          Length = 1512

 Score =  166 bits (421), Expect = 3e-45
 Identities = 77/95 (81%), Positives = 88/95 (92%)
 Frame = +3

Query: 3    SRSSSEAEYRALSTASCELQWILYILGDLKVMCHRPPVLYCDNQSAMHIAANPVFHERTK 182
            SRSSSEAEYR+LS ASCELQWI+Y+L DL + C RPPVLYCDNQSA+HIA+NPVFHERTK
Sbjct: 1379 SRSSSEAEYRSLSFASCELQWIVYLLKDLSIDCERPPVLYCDNQSAIHIASNPVFHERTK 1438

Query: 183  HLEIDCHFVREKVQNGVLRLLPISSKSQLADFFTK 287
            HLEIDCH VR+KVQ+GV +LLPIS+K+QLADFFTK
Sbjct: 1439 HLEIDCHLVRDKVQSGVFKLLPISTKAQLADFFTK 1473


>KYP46147.1 Copia protein, partial [Cajanus cajan]
          Length = 285

 Score =  155 bits (391), Expect = 2e-44
 Identities = 74/95 (77%), Positives = 82/95 (86%)
 Frame = +3

Query: 3   SRSSSEAEYRALSTASCELQWILYILGDLKVMCHRPPVLYCDNQSAMHIAANPVFHERTK 182
           SRSS EAEYR LSTA+CELQW+LY+L DL + C R  VLYCDNQSA+HIAAN VFHERTK
Sbjct: 146 SRSSFEAEYRELSTAACELQWLLYLLHDLHITCTRAHVLYCDNQSALHIAANLVFHERTK 205

Query: 183 HLEIDCHFVREKVQNGVLRLLPISSKSQLADFFTK 287
           HLEIDCHF R K+Q+GVL LLPISSK QLA+FFTK
Sbjct: 206 HLEIDCHFFRNKIQDGVLHLLPISSKEQLANFFTK 240


>GAU47169.1 hypothetical protein TSUD_28920 [Trifolium subterraneum]
          Length = 1086

 Score =  163 bits (412), Expect = 4e-44
 Identities = 74/95 (77%), Positives = 87/95 (91%)
 Frame = +3

Query: 3    SRSSSEAEYRALSTASCELQWILYILGDLKVMCHRPPVLYCDNQSAMHIAANPVFHERTK 182
            SRSSSEAEYRALS+ASCELQW+LY+L DL+V C RPPVLYCD+QSA+HIA+NP+FHERTK
Sbjct: 968  SRSSSEAEYRALSSASCELQWLLYLLNDLQVKCTRPPVLYCDSQSAIHIASNPIFHERTK 1027

Query: 183  HLEIDCHFVREKVQNGVLRLLPISSKSQLADFFTK 287
            HL+IDCH VREKVQ G+L+LLPIS+  Q+ADF TK
Sbjct: 1028 HLKIDCHLVREKVQKGILKLLPISTNEQVADFLTK 1062


>GAU11490.1 hypothetical protein TSUD_344800 [Trifolium subterraneum]
          Length = 551

 Score =  160 bits (404), Expect = 5e-44
 Identities = 74/95 (77%), Positives = 87/95 (91%)
 Frame = +3

Query: 3   SRSSSEAEYRALSTASCELQWILYILGDLKVMCHRPPVLYCDNQSAMHIAANPVFHERTK 182
           SRSSSEAEYRAL+ A+CELQWILY+L DL+V C + PV+YCDNQSA+HIAANPVFHERTK
Sbjct: 412 SRSSSEAEYRALAAATCELQWILYLLKDLQVTCTKLPVIYCDNQSALHIAANPVFHERTK 471

Query: 183 HLEIDCHFVREKVQNGVLRLLPISSKSQLADFFTK 287
           HLEIDCH VRE++Q GVL+LLP+ S++QLADFFTK
Sbjct: 472 HLEIDCHIVRERLQAGVLKLLPVLSQNQLADFFTK 506


>ABE88099.1 conserved hypothetical protein [Medicago truncatula]
          Length = 148

 Score =  149 bits (377), Expect = 6e-44
 Identities = 71/93 (76%), Positives = 83/93 (89%)
 Frame = +3

Query: 9   SSSEAEYRALSTASCELQWILYILGDLKVMCHRPPVLYCDNQSAMHIAANPVFHERTKHL 188
           SSSEAEYRAL++A+CELQ + Y+L DLKV C +PPVLYCDNQSA++IAANPVFHE TKHL
Sbjct: 5   SSSEAEYRALASATCELQRLTYLLRDLKVNCIKPPVLYCDNQSAIYIAANPVFHECTKHL 64

Query: 189 EIDCHFVREKVQNGVLRLLPISSKSQLADFFTK 287
           EIDCH VREK+Q G+ +LLPISSK Q+ADFFTK
Sbjct: 65  EIDCHIVREKLQAGLFKLLPISSKDQVADFFTK 97


>GAU41219.1 hypothetical protein TSUD_128950 [Trifolium subterraneum]
          Length = 539

 Score =  158 bits (400), Expect = 2e-43
 Identities = 73/95 (76%), Positives = 85/95 (89%)
 Frame = +3

Query: 3   SRSSSEAEYRALSTASCELQWILYILGDLKVMCHRPPVLYCDNQSAMHIAANPVFHERTK 182
           SRSSSEA+YRALSTA+CEL W+L++L DL   C +PPVLYCD+QSAMHIA+NPVFHERTK
Sbjct: 417 SRSSSEADYRALSTATCELIWLLFLLRDLNTTCSKPPVLYCDSQSAMHIASNPVFHERTK 476

Query: 183 HLEIDCHFVREKVQNGVLRLLPISSKSQLADFFTK 287
           HLEIDCH VREKVQ G+L+LLPIS++ QLADF TK
Sbjct: 477 HLEIDCHLVREKVQQGLLKLLPISTQEQLADFLTK 511


>KHN24193.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94,
           partial [Glycine soja] KHN37451.1 Retrovirus-related Pol
           polyprotein from transposon TNT 1-94, partial [Glycine
           soja]
          Length = 234

 Score =  151 bits (381), Expect = 2e-43
 Identities = 69/95 (72%), Positives = 81/95 (85%)
 Frame = +3

Query: 3   SRSSSEAEYRALSTASCELQWILYILGDLKVMCHRPPVLYCDNQSAMHIAANPVFHERTK 182
           SRSSSEAEYRALS+A+CELQW+LY+  DL+V   R P LYCDNQSA+HIA+NPVFHERTK
Sbjct: 113 SRSSSEAEYRALSSAACELQWLLYLFADLRVQLTRTPTLYCDNQSAVHIASNPVFHERTK 172

Query: 183 HLEIDCHFVREKVQNGVLRLLPISSKSQLADFFTK 287
           HLEIDCH VREK+  G L+LLP+S+  Q+ADF TK
Sbjct: 173 HLEIDCHLVREKLLKGTLKLLPVSTSDQVADFLTK 207


>GAU22921.1 hypothetical protein TSUD_326940 [Trifolium subterraneum]
          Length = 1122

 Score =  159 bits (403), Expect = 7e-43
 Identities = 75/95 (78%), Positives = 84/95 (88%)
 Frame = +3

Query: 3    SRSSSEAEYRALSTASCELQWILYILGDLKVMCHRPPVLYCDNQSAMHIAANPVFHERTK 182
            SRSSSEAEYRALS ASCELQW+L++L DL + C R PVLYCDNQSA+HIA+NPVFHERTK
Sbjct: 966  SRSSSEAEYRALSFASCELQWLLFLLRDLGLQCTRAPVLYCDNQSAVHIASNPVFHERTK 1025

Query: 183  HLEIDCHFVREKVQNGVLRLLPISSKSQLADFFTK 287
            HLEIDCHFV +KV  G  +LLPISSKSQ+ADFFTK
Sbjct: 1026 HLEIDCHFVHDKVLQGTFKLLPISSKSQIADFFTK 1060


>GAU31823.1 hypothetical protein TSUD_58240 [Trifolium subterraneum]
          Length = 1119

 Score =  159 bits (401), Expect = 1e-42
 Identities = 71/95 (74%), Positives = 86/95 (90%)
 Frame = +3

Query: 3    SRSSSEAEYRALSTASCELQWILYILGDLKVMCHRPPVLYCDNQSAMHIAANPVFHERTK 182
            +RSSSEAEYRAL++A+CELQW+LY+L DL V C RPPVLYCD+QSA+HIA+NPVFHERTK
Sbjct: 1004 ARSSSEAEYRALASATCELQWLLYLLQDLNVECSRPPVLYCDSQSAIHIASNPVFHERTK 1063

Query: 183  HLEIDCHFVREKVQNGVLRLLPISSKSQLADFFTK 287
            HLEIDCH +REK+Q G+L+LLPIS+  Q+ADF TK
Sbjct: 1064 HLEIDCHLIREKLQKGILKLLPISTNEQVADFLTK 1098


>GAU51775.1 hypothetical protein TSUD_415620 [Trifolium subterraneum]
          Length = 1234

 Score =  157 bits (397), Expect = 5e-42
 Identities = 73/95 (76%), Positives = 86/95 (90%)
 Frame = +3

Query: 3    SRSSSEAEYRALSTASCELQWILYILGDLKVMCHRPPVLYCDNQSAMHIAANPVFHERTK 182
            SRSSSEAEYRALS+A+CEL W+L++L DL++ C +PPVLYCD+QSAMHIA+NPVFHERTK
Sbjct: 1112 SRSSSEAEYRALSSATCELIWLLFLLKDLQIECSKPPVLYCDSQSAMHIASNPVFHERTK 1171

Query: 183  HLEIDCHFVREKVQNGVLRLLPISSKSQLADFFTK 287
            HLEIDCH VREKVQ G+LRLLPIS++ QLAD  TK
Sbjct: 1172 HLEIDCHLVREKVQQGLLRLLPISTEDQLADCLTK 1206


>KYP65473.1 Copia protein [Cajanus cajan]
          Length = 198

 Score =  145 bits (367), Expect = 8e-42
 Identities = 74/124 (59%), Positives = 90/124 (72%)
 Frame = +3

Query: 3   SRSSSEAEYRALSTASCELQWILYILGDLKVMCHRPPVLYCDNQSAMHIAANPVFHERTK 182
           SRSSSEAEYRAL+T++ ELQW+ Y+L DLK+ C +   LYCDNQSA+H A NPVFHERTK
Sbjct: 78  SRSSSEAEYRALATSTYELQWLTYLLTDLKIQCSKSATLYCDNQSALHTATNPVFHERTK 137

Query: 183 HLEIDCHFVREKVQNGVLRLLPISSKSQLADFFTKLRLCIQEFLFLFSPSWA*WICTMLQ 362
           HLEIDCH V+EK Q G+++LLP+ S  QLAD FTK    +   LF  + S    + T LQ
Sbjct: 138 HLEIDCHLVQEKAQAGLMKLLPVPSFRQLADIFTK---ALPPRLFHANLSKLEMVDTPLQ 194

Query: 363 LAGG 374
            AGG
Sbjct: 195 FAGG 198


>KYP65404.1 Copia protein, partial [Cajanus cajan]
          Length = 356

 Score =  150 bits (378), Expect = 1e-41
 Identities = 71/95 (74%), Positives = 82/95 (86%)
 Frame = +3

Query: 3   SRSSSEAEYRALSTASCELQWILYILGDLKVMCHRPPVLYCDNQSAMHIAANPVFHERTK 182
           SRSSSEAEYRAL+TA+CELQW+ Y+L DLKV   +  +LYCDNQSA+ IAANPVFHERTK
Sbjct: 235 SRSSSEAEYRALATATCELQWLTYLLTDLKVPFSKQAILYCDNQSALQIAANPVFHERTK 294

Query: 183 HLEIDCHFVREKVQNGVLRLLPISSKSQLADFFTK 287
           HLEIDCH VREK Q G++RLLP+SS +QLAD FTK
Sbjct: 295 HLEIDCHLVREKNQAGLMRLLPVSSSNQLADMFTK 329


>GAU39523.1 hypothetical protein TSUD_222930 [Trifolium subterraneum]
          Length = 1210

 Score =  156 bits (394), Expect = 1e-41
 Identities = 71/95 (74%), Positives = 84/95 (88%)
 Frame = +3

Query: 3    SRSSSEAEYRALSTASCELQWILYILGDLKVMCHRPPVLYCDNQSAMHIAANPVFHERTK 182
            SRSSSEAEYRALS+ +CEL W+L ++ DLK+ C +PPV+YCD+QSAMHIA+NPVFHERTK
Sbjct: 1088 SRSSSEAEYRALSSTTCELIWLLSLINDLKIQCDKPPVIYCDSQSAMHIASNPVFHERTK 1147

Query: 183  HLEIDCHFVREKVQNGVLRLLPISSKSQLADFFTK 287
            HLEIDCH VREKVQ G+LRLLPIS++ QLAD  TK
Sbjct: 1148 HLEIDCHLVREKVQQGILRLLPISTQDQLADCLTK 1182


>KYP35345.1 hypothetical protein KK1_043626, partial [Cajanus cajan]
          Length = 296

 Score =  146 bits (369), Expect(2) = 3e-41
 Identities = 69/92 (75%), Positives = 79/92 (85%), Gaps = 3/92 (3%)
 Frame = +3

Query: 21  AEYRALSTASCELQWILYILGDLKVMCHRP---PVLYCDNQSAMHIAANPVFHERTKHLE 191
           +EYR LSTA+CELQW+L++L DL + C      PVLYCDNQ+A+HIAANPVFHERTKHLE
Sbjct: 109 SEYRELSTAACELQWLLFLLHDLHITCSLQEIAPVLYCDNQNALHIAANPVFHERTKHLE 168

Query: 192 IDCHFVREKVQNGVLRLLPISSKSQLADFFTK 287
           IDCHFVR K+Q GV+RLLPISSK QLADFFTK
Sbjct: 169 IDCHFVRTKLQEGVMRLLPISSKEQLADFFTK 200



 Score = 49.3 bits (116), Expect(2) = 3e-41
 Identities = 20/28 (71%), Positives = 24/28 (85%)
 Frame = +2

Query: 290 KALHPGVFVPFLSKLGMMDLYHAPACRG 373
           KAL P +F PF+SKLGM+D+YHAPAC G
Sbjct: 200 KALPPPIFTPFISKLGMIDIYHAPACGG 227


>GAU31202.1 hypothetical protein TSUD_210590 [Trifolium subterraneum]
          Length = 1059

 Score =  154 bits (390), Expect = 4e-41
 Identities = 70/95 (73%), Positives = 85/95 (89%)
 Frame = +3

Query: 3    SRSSSEAEYRALSTASCELQWILYILGDLKVMCHRPPVLYCDNQSAMHIAANPVFHERTK 182
            SRSSSEAEYRALSTA+CEL W+ +++ DL + C +PPV+YCD+QSAMHIA+NPVFHERTK
Sbjct: 937  SRSSSEAEYRALSTATCELIWLTFLMKDLNIHCSKPPVIYCDSQSAMHIASNPVFHERTK 996

Query: 183  HLEIDCHFVREKVQNGVLRLLPISSKSQLADFFTK 287
            HLEI+CHFVREK+Q G+LRLLPIS++ QLAD  TK
Sbjct: 997  HLEIECHFVREKLQQGLLRLLPISTEDQLADCLTK 1031


>KHN03110.1 Copia protein, partial [Glycine soja]
          Length = 245

 Score =  145 bits (366), Expect = 4e-41
 Identities = 68/95 (71%), Positives = 80/95 (84%)
 Frame = +3

Query: 3   SRSSSEAEYRALSTASCELQWILYILGDLKVMCHRPPVLYCDNQSAMHIAANPVFHERTK 182
           SRSSSEAEYRAL+T +CELQW+ Y+L DLK+ C +  VLYCDNQSA++IAANPVFHERTK
Sbjct: 119 SRSSSEAEYRALATNTCELQWLSYLLDDLKITCTKSTVLYCDNQSALYIAANPVFHERTK 178

Query: 183 HLEIDCHFVREKVQNGVLRLLPISSKSQLADFFTK 287
           HLEIDCH +REK Q G++ LLP+ S  QLAD FTK
Sbjct: 179 HLEIDCHLIREKSQAGLMCLLPVPSCYQLADMFTK 213


>KYP54539.1 Copia protein [Cajanus cajan]
          Length = 234

 Score =  145 bits (365), Expect = 5e-41
 Identities = 67/107 (62%), Positives = 88/107 (82%)
 Frame = +3

Query: 3   SRSSSEAEYRALSTASCELQWILYILGDLKVMCHRPPVLYCDNQSAMHIAANPVFHERTK 182
           SRSSSEAEYRA+++  CELQW+ ++L D+ +   +P VLYCDN+SA+HIAANPVFHERTK
Sbjct: 119 SRSSSEAEYRAMASVVCELQWLTFLLKDIGISFIQPAVLYCDNKSALHIAANPVFHERTK 178

Query: 183 HLEIDCHFVREKVQNGVLRLLPISSKSQLADFFTKLRLCIQEFLFLF 323
           H+EIDCH +REKVQNG+++LLP++S +QLAD +TK  L    F FL+
Sbjct: 179 HIEIDCHIIREKVQNGLVKLLPVTSPNQLADIYTK-ALSPAAFKFLY 224


>ABD32333.1 polyprotein-like, putative [Medicago truncatula]
          Length = 635

 Score =  152 bits (385), Expect = 7e-41
 Identities = 71/108 (65%), Positives = 90/108 (83%)
 Frame = +3

Query: 3   SRSSSEAEYRALSTASCELQWILYILGDLKVMCHRPPVLYCDNQSAMHIAANPVFHERTK 182
           S+SSSEAEYRA+++A+CE+QW+LY+L DL+V C + PVLYCDNQSAMHIA+NPVFHERTK
Sbjct: 492 SKSSSEAEYRAMASATCEMQWLLYLLRDLQVQCVQLPVLYCDNQSAMHIASNPVFHERTK 551

Query: 183 HLEIDCHFVREKVQNGVLRLLPISSKSQLADFFTKLRLCIQEFLFLFS 326
           HLEIDCH VREK+Q G+ +LLP+++  Q+ D FTK  L +Q F  L S
Sbjct: 552 HLEIDCHIVREKLQAGIFKLLPVTTHDQIGDSFTK-ALYLQPFSLLLS 598


Top