BLASTX nr result
ID: Glycyrrhiza32_contig00034237
seq
BLASTX 2.2.26 [Sep-21-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Glycyrrhiza32_contig00034237 (391 letters) Database: ./nr 115,041,592 sequences; 42,171,959,267 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value KHN31995.1 Copia protein, partial [Glycine soja] 168 3e-50 KYP50660.1 Retrovirus-related Pol polyprotein from transposon TN... 154 9e-46 GAU46782.1 hypothetical protein TSUD_351810 [Trifolium subterran... 166 3e-45 KYP46147.1 Copia protein, partial [Cajanus cajan] 155 2e-44 GAU47169.1 hypothetical protein TSUD_28920 [Trifolium subterraneum] 163 4e-44 GAU11490.1 hypothetical protein TSUD_344800 [Trifolium subterran... 160 5e-44 ABE88099.1 conserved hypothetical protein [Medicago truncatula] 149 6e-44 GAU41219.1 hypothetical protein TSUD_128950 [Trifolium subterran... 158 2e-43 KHN24193.1 Retrovirus-related Pol polyprotein from transposon TN... 151 2e-43 GAU22921.1 hypothetical protein TSUD_326940 [Trifolium subterran... 159 7e-43 GAU31823.1 hypothetical protein TSUD_58240 [Trifolium subterraneum] 159 1e-42 GAU51775.1 hypothetical protein TSUD_415620 [Trifolium subterran... 157 5e-42 KYP65473.1 Copia protein [Cajanus cajan] 145 8e-42 KYP65404.1 Copia protein, partial [Cajanus cajan] 150 1e-41 GAU39523.1 hypothetical protein TSUD_222930 [Trifolium subterran... 156 1e-41 KYP35345.1 hypothetical protein KK1_043626, partial [Cajanus cajan] 146 3e-41 GAU31202.1 hypothetical protein TSUD_210590 [Trifolium subterran... 154 4e-41 KHN03110.1 Copia protein, partial [Glycine soja] 145 4e-41 KYP54539.1 Copia protein [Cajanus cajan] 145 5e-41 ABD32333.1 polyprotein-like, putative [Medicago truncatula] 152 7e-41 >KHN31995.1 Copia protein, partial [Glycine soja] Length = 224 Score = 168 bits (425), Expect = 3e-50 Identities = 78/96 (81%), Positives = 86/96 (89%) Frame = +3 Query: 3 SRSSSEAEYRALSTASCELQWILYILGDLKVMCHRPPVLYCDNQSAMHIAANPVFHERTK 182 SRSSSEAEYRALST +CELQW+LY+L DL + C R P LYCDNQSA+HIAANP+FHERTK Sbjct: 104 SRSSSEAEYRALSTTACELQWLLYLLHDLHITCTRAPALYCDNQSALHIAANPMFHERTK 163 Query: 183 HLEIDCHFVREKVQNGVLRLLPISSKSQLADFFTKL 290 HLEIDCHFVR K+Q GVLRLLPISSK QLADFFTK+ Sbjct: 164 HLEIDCHFVRNKIQEGVLRLLPISSKEQLADFFTKV 199 >KYP50660.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94, partial [Cajanus cajan] Length = 138 Score = 154 bits (388), Expect = 9e-46 Identities = 78/112 (69%), Positives = 88/112 (78%) Frame = +3 Query: 3 SRSSSEAEYRALSTASCELQWILYILGDLKVMCHRPPVLYCDNQSAMHIAANPVFHERTK 182 SRSSS+AEY ALSTA CELQW+LY+L DL + C R PVLYCDNQS +HIAAN +FHERTK Sbjct: 29 SRSSSKAEYSALSTAICELQWLLYLLHDLLITCTRAPVLYCDNQSDLHIAANRLFHERTK 88 Query: 183 HLEIDCHFVREKVQNGVLRLLPISSKSQLADFFTKLRLCIQEFLFLFSPSWA 338 HLEIDC FVR K+Q+GVLRLLPISSK +LA+ FTK LF PSWA Sbjct: 89 HLEIDCDFVRNKIQDGVLRLLPISSKEKLANCFTK--ALPLHLLFRSFPSWA 138 >GAU46782.1 hypothetical protein TSUD_351810 [Trifolium subterraneum] Length = 1512 Score = 166 bits (421), Expect = 3e-45 Identities = 77/95 (81%), Positives = 88/95 (92%) Frame = +3 Query: 3 SRSSSEAEYRALSTASCELQWILYILGDLKVMCHRPPVLYCDNQSAMHIAANPVFHERTK 182 SRSSSEAEYR+LS ASCELQWI+Y+L DL + C RPPVLYCDNQSA+HIA+NPVFHERTK Sbjct: 1379 SRSSSEAEYRSLSFASCELQWIVYLLKDLSIDCERPPVLYCDNQSAIHIASNPVFHERTK 1438 Query: 183 HLEIDCHFVREKVQNGVLRLLPISSKSQLADFFTK 287 HLEIDCH VR+KVQ+GV +LLPIS+K+QLADFFTK Sbjct: 1439 HLEIDCHLVRDKVQSGVFKLLPISTKAQLADFFTK 1473 >KYP46147.1 Copia protein, partial [Cajanus cajan] Length = 285 Score = 155 bits (391), Expect = 2e-44 Identities = 74/95 (77%), Positives = 82/95 (86%) Frame = +3 Query: 3 SRSSSEAEYRALSTASCELQWILYILGDLKVMCHRPPVLYCDNQSAMHIAANPVFHERTK 182 SRSS EAEYR LSTA+CELQW+LY+L DL + C R VLYCDNQSA+HIAAN VFHERTK Sbjct: 146 SRSSFEAEYRELSTAACELQWLLYLLHDLHITCTRAHVLYCDNQSALHIAANLVFHERTK 205 Query: 183 HLEIDCHFVREKVQNGVLRLLPISSKSQLADFFTK 287 HLEIDCHF R K+Q+GVL LLPISSK QLA+FFTK Sbjct: 206 HLEIDCHFFRNKIQDGVLHLLPISSKEQLANFFTK 240 >GAU47169.1 hypothetical protein TSUD_28920 [Trifolium subterraneum] Length = 1086 Score = 163 bits (412), Expect = 4e-44 Identities = 74/95 (77%), Positives = 87/95 (91%) Frame = +3 Query: 3 SRSSSEAEYRALSTASCELQWILYILGDLKVMCHRPPVLYCDNQSAMHIAANPVFHERTK 182 SRSSSEAEYRALS+ASCELQW+LY+L DL+V C RPPVLYCD+QSA+HIA+NP+FHERTK Sbjct: 968 SRSSSEAEYRALSSASCELQWLLYLLNDLQVKCTRPPVLYCDSQSAIHIASNPIFHERTK 1027 Query: 183 HLEIDCHFVREKVQNGVLRLLPISSKSQLADFFTK 287 HL+IDCH VREKVQ G+L+LLPIS+ Q+ADF TK Sbjct: 1028 HLKIDCHLVREKVQKGILKLLPISTNEQVADFLTK 1062 >GAU11490.1 hypothetical protein TSUD_344800 [Trifolium subterraneum] Length = 551 Score = 160 bits (404), Expect = 5e-44 Identities = 74/95 (77%), Positives = 87/95 (91%) Frame = +3 Query: 3 SRSSSEAEYRALSTASCELQWILYILGDLKVMCHRPPVLYCDNQSAMHIAANPVFHERTK 182 SRSSSEAEYRAL+ A+CELQWILY+L DL+V C + PV+YCDNQSA+HIAANPVFHERTK Sbjct: 412 SRSSSEAEYRALAAATCELQWILYLLKDLQVTCTKLPVIYCDNQSALHIAANPVFHERTK 471 Query: 183 HLEIDCHFVREKVQNGVLRLLPISSKSQLADFFTK 287 HLEIDCH VRE++Q GVL+LLP+ S++QLADFFTK Sbjct: 472 HLEIDCHIVRERLQAGVLKLLPVLSQNQLADFFTK 506 >ABE88099.1 conserved hypothetical protein [Medicago truncatula] Length = 148 Score = 149 bits (377), Expect = 6e-44 Identities = 71/93 (76%), Positives = 83/93 (89%) Frame = +3 Query: 9 SSSEAEYRALSTASCELQWILYILGDLKVMCHRPPVLYCDNQSAMHIAANPVFHERTKHL 188 SSSEAEYRAL++A+CELQ + Y+L DLKV C +PPVLYCDNQSA++IAANPVFHE TKHL Sbjct: 5 SSSEAEYRALASATCELQRLTYLLRDLKVNCIKPPVLYCDNQSAIYIAANPVFHECTKHL 64 Query: 189 EIDCHFVREKVQNGVLRLLPISSKSQLADFFTK 287 EIDCH VREK+Q G+ +LLPISSK Q+ADFFTK Sbjct: 65 EIDCHIVREKLQAGLFKLLPISSKDQVADFFTK 97 >GAU41219.1 hypothetical protein TSUD_128950 [Trifolium subterraneum] Length = 539 Score = 158 bits (400), Expect = 2e-43 Identities = 73/95 (76%), Positives = 85/95 (89%) Frame = +3 Query: 3 SRSSSEAEYRALSTASCELQWILYILGDLKVMCHRPPVLYCDNQSAMHIAANPVFHERTK 182 SRSSSEA+YRALSTA+CEL W+L++L DL C +PPVLYCD+QSAMHIA+NPVFHERTK Sbjct: 417 SRSSSEADYRALSTATCELIWLLFLLRDLNTTCSKPPVLYCDSQSAMHIASNPVFHERTK 476 Query: 183 HLEIDCHFVREKVQNGVLRLLPISSKSQLADFFTK 287 HLEIDCH VREKVQ G+L+LLPIS++ QLADF TK Sbjct: 477 HLEIDCHLVREKVQQGLLKLLPISTQEQLADFLTK 511 >KHN24193.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94, partial [Glycine soja] KHN37451.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94, partial [Glycine soja] Length = 234 Score = 151 bits (381), Expect = 2e-43 Identities = 69/95 (72%), Positives = 81/95 (85%) Frame = +3 Query: 3 SRSSSEAEYRALSTASCELQWILYILGDLKVMCHRPPVLYCDNQSAMHIAANPVFHERTK 182 SRSSSEAEYRALS+A+CELQW+LY+ DL+V R P LYCDNQSA+HIA+NPVFHERTK Sbjct: 113 SRSSSEAEYRALSSAACELQWLLYLFADLRVQLTRTPTLYCDNQSAVHIASNPVFHERTK 172 Query: 183 HLEIDCHFVREKVQNGVLRLLPISSKSQLADFFTK 287 HLEIDCH VREK+ G L+LLP+S+ Q+ADF TK Sbjct: 173 HLEIDCHLVREKLLKGTLKLLPVSTSDQVADFLTK 207 >GAU22921.1 hypothetical protein TSUD_326940 [Trifolium subterraneum] Length = 1122 Score = 159 bits (403), Expect = 7e-43 Identities = 75/95 (78%), Positives = 84/95 (88%) Frame = +3 Query: 3 SRSSSEAEYRALSTASCELQWILYILGDLKVMCHRPPVLYCDNQSAMHIAANPVFHERTK 182 SRSSSEAEYRALS ASCELQW+L++L DL + C R PVLYCDNQSA+HIA+NPVFHERTK Sbjct: 966 SRSSSEAEYRALSFASCELQWLLFLLRDLGLQCTRAPVLYCDNQSAVHIASNPVFHERTK 1025 Query: 183 HLEIDCHFVREKVQNGVLRLLPISSKSQLADFFTK 287 HLEIDCHFV +KV G +LLPISSKSQ+ADFFTK Sbjct: 1026 HLEIDCHFVHDKVLQGTFKLLPISSKSQIADFFTK 1060 >GAU31823.1 hypothetical protein TSUD_58240 [Trifolium subterraneum] Length = 1119 Score = 159 bits (401), Expect = 1e-42 Identities = 71/95 (74%), Positives = 86/95 (90%) Frame = +3 Query: 3 SRSSSEAEYRALSTASCELQWILYILGDLKVMCHRPPVLYCDNQSAMHIAANPVFHERTK 182 +RSSSEAEYRAL++A+CELQW+LY+L DL V C RPPVLYCD+QSA+HIA+NPVFHERTK Sbjct: 1004 ARSSSEAEYRALASATCELQWLLYLLQDLNVECSRPPVLYCDSQSAIHIASNPVFHERTK 1063 Query: 183 HLEIDCHFVREKVQNGVLRLLPISSKSQLADFFTK 287 HLEIDCH +REK+Q G+L+LLPIS+ Q+ADF TK Sbjct: 1064 HLEIDCHLIREKLQKGILKLLPISTNEQVADFLTK 1098 >GAU51775.1 hypothetical protein TSUD_415620 [Trifolium subterraneum] Length = 1234 Score = 157 bits (397), Expect = 5e-42 Identities = 73/95 (76%), Positives = 86/95 (90%) Frame = +3 Query: 3 SRSSSEAEYRALSTASCELQWILYILGDLKVMCHRPPVLYCDNQSAMHIAANPVFHERTK 182 SRSSSEAEYRALS+A+CEL W+L++L DL++ C +PPVLYCD+QSAMHIA+NPVFHERTK Sbjct: 1112 SRSSSEAEYRALSSATCELIWLLFLLKDLQIECSKPPVLYCDSQSAMHIASNPVFHERTK 1171 Query: 183 HLEIDCHFVREKVQNGVLRLLPISSKSQLADFFTK 287 HLEIDCH VREKVQ G+LRLLPIS++ QLAD TK Sbjct: 1172 HLEIDCHLVREKVQQGLLRLLPISTEDQLADCLTK 1206 >KYP65473.1 Copia protein [Cajanus cajan] Length = 198 Score = 145 bits (367), Expect = 8e-42 Identities = 74/124 (59%), Positives = 90/124 (72%) Frame = +3 Query: 3 SRSSSEAEYRALSTASCELQWILYILGDLKVMCHRPPVLYCDNQSAMHIAANPVFHERTK 182 SRSSSEAEYRAL+T++ ELQW+ Y+L DLK+ C + LYCDNQSA+H A NPVFHERTK Sbjct: 78 SRSSSEAEYRALATSTYELQWLTYLLTDLKIQCSKSATLYCDNQSALHTATNPVFHERTK 137 Query: 183 HLEIDCHFVREKVQNGVLRLLPISSKSQLADFFTKLRLCIQEFLFLFSPSWA*WICTMLQ 362 HLEIDCH V+EK Q G+++LLP+ S QLAD FTK + LF + S + T LQ Sbjct: 138 HLEIDCHLVQEKAQAGLMKLLPVPSFRQLADIFTK---ALPPRLFHANLSKLEMVDTPLQ 194 Query: 363 LAGG 374 AGG Sbjct: 195 FAGG 198 >KYP65404.1 Copia protein, partial [Cajanus cajan] Length = 356 Score = 150 bits (378), Expect = 1e-41 Identities = 71/95 (74%), Positives = 82/95 (86%) Frame = +3 Query: 3 SRSSSEAEYRALSTASCELQWILYILGDLKVMCHRPPVLYCDNQSAMHIAANPVFHERTK 182 SRSSSEAEYRAL+TA+CELQW+ Y+L DLKV + +LYCDNQSA+ IAANPVFHERTK Sbjct: 235 SRSSSEAEYRALATATCELQWLTYLLTDLKVPFSKQAILYCDNQSALQIAANPVFHERTK 294 Query: 183 HLEIDCHFVREKVQNGVLRLLPISSKSQLADFFTK 287 HLEIDCH VREK Q G++RLLP+SS +QLAD FTK Sbjct: 295 HLEIDCHLVREKNQAGLMRLLPVSSSNQLADMFTK 329 >GAU39523.1 hypothetical protein TSUD_222930 [Trifolium subterraneum] Length = 1210 Score = 156 bits (394), Expect = 1e-41 Identities = 71/95 (74%), Positives = 84/95 (88%) Frame = +3 Query: 3 SRSSSEAEYRALSTASCELQWILYILGDLKVMCHRPPVLYCDNQSAMHIAANPVFHERTK 182 SRSSSEAEYRALS+ +CEL W+L ++ DLK+ C +PPV+YCD+QSAMHIA+NPVFHERTK Sbjct: 1088 SRSSSEAEYRALSSTTCELIWLLSLINDLKIQCDKPPVIYCDSQSAMHIASNPVFHERTK 1147 Query: 183 HLEIDCHFVREKVQNGVLRLLPISSKSQLADFFTK 287 HLEIDCH VREKVQ G+LRLLPIS++ QLAD TK Sbjct: 1148 HLEIDCHLVREKVQQGILRLLPISTQDQLADCLTK 1182 >KYP35345.1 hypothetical protein KK1_043626, partial [Cajanus cajan] Length = 296 Score = 146 bits (369), Expect(2) = 3e-41 Identities = 69/92 (75%), Positives = 79/92 (85%), Gaps = 3/92 (3%) Frame = +3 Query: 21 AEYRALSTASCELQWILYILGDLKVMCHRP---PVLYCDNQSAMHIAANPVFHERTKHLE 191 +EYR LSTA+CELQW+L++L DL + C PVLYCDNQ+A+HIAANPVFHERTKHLE Sbjct: 109 SEYRELSTAACELQWLLFLLHDLHITCSLQEIAPVLYCDNQNALHIAANPVFHERTKHLE 168 Query: 192 IDCHFVREKVQNGVLRLLPISSKSQLADFFTK 287 IDCHFVR K+Q GV+RLLPISSK QLADFFTK Sbjct: 169 IDCHFVRTKLQEGVMRLLPISSKEQLADFFTK 200 Score = 49.3 bits (116), Expect(2) = 3e-41 Identities = 20/28 (71%), Positives = 24/28 (85%) Frame = +2 Query: 290 KALHPGVFVPFLSKLGMMDLYHAPACRG 373 KAL P +F PF+SKLGM+D+YHAPAC G Sbjct: 200 KALPPPIFTPFISKLGMIDIYHAPACGG 227 >GAU31202.1 hypothetical protein TSUD_210590 [Trifolium subterraneum] Length = 1059 Score = 154 bits (390), Expect = 4e-41 Identities = 70/95 (73%), Positives = 85/95 (89%) Frame = +3 Query: 3 SRSSSEAEYRALSTASCELQWILYILGDLKVMCHRPPVLYCDNQSAMHIAANPVFHERTK 182 SRSSSEAEYRALSTA+CEL W+ +++ DL + C +PPV+YCD+QSAMHIA+NPVFHERTK Sbjct: 937 SRSSSEAEYRALSTATCELIWLTFLMKDLNIHCSKPPVIYCDSQSAMHIASNPVFHERTK 996 Query: 183 HLEIDCHFVREKVQNGVLRLLPISSKSQLADFFTK 287 HLEI+CHFVREK+Q G+LRLLPIS++ QLAD TK Sbjct: 997 HLEIECHFVREKLQQGLLRLLPISTEDQLADCLTK 1031 >KHN03110.1 Copia protein, partial [Glycine soja] Length = 245 Score = 145 bits (366), Expect = 4e-41 Identities = 68/95 (71%), Positives = 80/95 (84%) Frame = +3 Query: 3 SRSSSEAEYRALSTASCELQWILYILGDLKVMCHRPPVLYCDNQSAMHIAANPVFHERTK 182 SRSSSEAEYRAL+T +CELQW+ Y+L DLK+ C + VLYCDNQSA++IAANPVFHERTK Sbjct: 119 SRSSSEAEYRALATNTCELQWLSYLLDDLKITCTKSTVLYCDNQSALYIAANPVFHERTK 178 Query: 183 HLEIDCHFVREKVQNGVLRLLPISSKSQLADFFTK 287 HLEIDCH +REK Q G++ LLP+ S QLAD FTK Sbjct: 179 HLEIDCHLIREKSQAGLMCLLPVPSCYQLADMFTK 213 >KYP54539.1 Copia protein [Cajanus cajan] Length = 234 Score = 145 bits (365), Expect = 5e-41 Identities = 67/107 (62%), Positives = 88/107 (82%) Frame = +3 Query: 3 SRSSSEAEYRALSTASCELQWILYILGDLKVMCHRPPVLYCDNQSAMHIAANPVFHERTK 182 SRSSSEAEYRA+++ CELQW+ ++L D+ + +P VLYCDN+SA+HIAANPVFHERTK Sbjct: 119 SRSSSEAEYRAMASVVCELQWLTFLLKDIGISFIQPAVLYCDNKSALHIAANPVFHERTK 178 Query: 183 HLEIDCHFVREKVQNGVLRLLPISSKSQLADFFTKLRLCIQEFLFLF 323 H+EIDCH +REKVQNG+++LLP++S +QLAD +TK L F FL+ Sbjct: 179 HIEIDCHIIREKVQNGLVKLLPVTSPNQLADIYTK-ALSPAAFKFLY 224 >ABD32333.1 polyprotein-like, putative [Medicago truncatula] Length = 635 Score = 152 bits (385), Expect = 7e-41 Identities = 71/108 (65%), Positives = 90/108 (83%) Frame = +3 Query: 3 SRSSSEAEYRALSTASCELQWILYILGDLKVMCHRPPVLYCDNQSAMHIAANPVFHERTK 182 S+SSSEAEYRA+++A+CE+QW+LY+L DL+V C + PVLYCDNQSAMHIA+NPVFHERTK Sbjct: 492 SKSSSEAEYRAMASATCEMQWLLYLLRDLQVQCVQLPVLYCDNQSAMHIASNPVFHERTK 551 Query: 183 HLEIDCHFVREKVQNGVLRLLPISSKSQLADFFTKLRLCIQEFLFLFS 326 HLEIDCH VREK+Q G+ +LLP+++ Q+ D FTK L +Q F L S Sbjct: 552 HLEIDCHIVREKLQAGIFKLLPVTTHDQIGDSFTK-ALYLQPFSLLLS 598