BLASTX nr result
ID: Glycyrrhiza35_contig00039066
seq
BLASTX 2.2.26 [Sep-21-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Glycyrrhiza35_contig00039066 (297 letters) Database: ./nr 115,041,592 sequences; 42,171,959,267 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value GAU46782.1 hypothetical protein TSUD_351810 [Trifolium subterran... 92 1e-19 AJY78067.1 putative polyprotein [Glycine max] 91 3e-19 KHN05285.1 Retrovirus-related Pol polyprotein from transposon TN... 91 3e-19 GAU47513.1 hypothetical protein TSUD_138850 [Trifolium subterran... 91 4e-19 GAU44842.1 hypothetical protein TSUD_400430 [Trifolium subterran... 89 1e-18 GAU41868.1 hypothetical protein TSUD_366150 [Trifolium subterran... 89 2e-18 GAU31769.1 hypothetical protein TSUD_22150 [Trifolium subterraneum] 86 2e-17 GAU39523.1 hypothetical protein TSUD_222930 [Trifolium subterran... 81 7e-16 KYP78542.1 hypothetical protein KK1_048519 [Cajanus cajan] 74 4e-15 GAU24572.1 hypothetical protein TSUD_149130 [Trifolium subterran... 78 1e-14 KYP68633.1 Retrovirus-related Pol polyprotein from transposon TN... 77 2e-14 GAU31823.1 hypothetical protein TSUD_58240 [Trifolium subterraneum] 77 2e-14 GAU13996.1 hypothetical protein TSUD_168210 [Trifolium subterran... 75 1e-13 GAU45704.1 hypothetical protein TSUD_86800 [Trifolium subterraneum] 75 1e-13 GAU31820.1 hypothetical protein TSUD_58210 [Trifolium subterraneum] 74 2e-13 GAU15708.1 hypothetical protein TSUD_307180 [Trifolium subterran... 73 7e-13 KYP36109.1 Retrovirus-related Pol polyprotein from transposon TN... 72 1e-12 XP_014632547.1 PREDICTED: uncharacterized protein LOC106799099 [... 70 8e-12 KYP56844.1 Retrovirus-related Pol polyprotein from transposon TN... 67 1e-11 KYP65402.1 Retrovirus-related Pol polyprotein from transposon TN... 69 2e-11 >GAU46782.1 hypothetical protein TSUD_351810 [Trifolium subterraneum] Length = 1512 Score = 92.0 bits (227), Expect = 1e-19 Identities = 48/96 (50%), Positives = 60/96 (62%), Gaps = 4/96 (4%) Frame = -1 Query: 279 QEQKSHRMIGLGEQCQGLYYLXXXXXXXXXXXXVH-GPD---FIPAEALWHFKLGHLSNN 112 QE+KS +MIG GE +GLYYL + P IP +ALWHF+LGHLS+ Sbjct: 489 QEKKSLKMIGSGELIEGLYYLTNKPQPVSANSSISINPSSNIHIPKQALWHFRLGHLSHA 548 Query: 111 RWSLVNSQFPFVHVDKKGVCDVCHFARHKKSSYTLS 4 R L+ S FPFV +D+ VCD+CH ARHKK +Y LS Sbjct: 549 RLLLMQSSFPFVTIDEHAVCDICHLARHKKLTYKLS 584 >AJY78067.1 putative polyprotein [Glycine max] Length = 886 Score = 90.9 bits (224), Expect = 3e-19 Identities = 46/99 (46%), Positives = 59/99 (59%), Gaps = 7/99 (7%) Frame = -1 Query: 279 QEQKSHRMIGLGEQCQGLYYLXXXXXXXXXXXXVHGPDF-------IPAEALWHFKLGHL 121 QEQKS +MIGLGE GLYYL F IP A+WHF+LGHL Sbjct: 480 QEQKSLKMIGLGESRDGLYYLTQTNKECASSNYNISSIFSSANNVHIPENAIWHFRLGHL 539 Query: 120 SNNRWSLVNSQFPFVHVDKKGVCDVCHFARHKKSSYTLS 4 S++R +L++SQFPF+ D VCD+CHFA+H+K + S Sbjct: 540 SSSRIALLHSQFPFIVNDSSSVCDICHFAKHRKLPFVHS 578 >KHN05285.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94, partial [Glycine soja] Length = 1346 Score = 90.9 bits (224), Expect = 3e-19 Identities = 46/99 (46%), Positives = 59/99 (59%), Gaps = 7/99 (7%) Frame = -1 Query: 279 QEQKSHRMIGLGEQCQGLYYLXXXXXXXXXXXXVHGPDF-------IPAEALWHFKLGHL 121 QEQKS +MIGLGE GLYYL F IP A+WHF+LGHL Sbjct: 384 QEQKSLKMIGLGESRDGLYYLTQTNKECASSNYNISSIFSSANNVHIPENAIWHFRLGHL 443 Query: 120 SNNRWSLVNSQFPFVHVDKKGVCDVCHFARHKKSSYTLS 4 S++R +L++SQFPF+ D VCD+CHFA+H+K + S Sbjct: 444 SSSRIALLHSQFPFIVNDSSSVCDICHFAKHRKLPFVHS 482 >GAU47513.1 hypothetical protein TSUD_138850 [Trifolium subterraneum] Length = 1469 Score = 90.5 bits (223), Expect = 4e-19 Identities = 46/97 (47%), Positives = 58/97 (59%), Gaps = 5/97 (5%) Frame = -1 Query: 279 QEQKSHRMIGLGEQCQGLYYLXXXXXXXXXXXXVHGPDF-----IPAEALWHFKLGHLSN 115 Q+Q + RMIG + +GLYYL H D IP +ALWHF+LGH SN Sbjct: 455 QDQLTKRMIGFADLIEGLYYLTLTSKEVHA----HSIDGTQHTNIPDQALWHFRLGHTSN 510 Query: 114 NRWSLVNSQFPFVHVDKKGVCDVCHFARHKKSSYTLS 4 + SL+ S FPF+ VD KGVCD+CH A+HKK Y +S Sbjct: 511 TKMSLLQSVFPFITVDNKGVCDICHLAKHKKLYYKVS 547 >GAU44842.1 hypothetical protein TSUD_400430 [Trifolium subterraneum] Length = 1289 Score = 89.0 bits (219), Expect = 1e-18 Identities = 44/93 (47%), Positives = 53/93 (56%), Gaps = 1/93 (1%) Frame = -1 Query: 279 QEQKSHRMIGLGEQCQGLYYLXXXXXXXXXXXXVHGPDFIPAE-ALWHFKLGHLSNNRWS 103 Q+ + RMIG E GLYYL E ALWHF+LGH+S NR Sbjct: 424 QDHPTKRMIGFAEMIDGLYYLKLTSKDAHAYAVDSSDKSTILEPALWHFRLGHISLNRMH 483 Query: 102 LVNSQFPFVHVDKKGVCDVCHFARHKKSSYTLS 4 L++SQFPF+ +D KGVCD+CH ARHKK Y S Sbjct: 484 LLHSQFPFITLDNKGVCDICHLARHKKIPYNTS 516 >GAU41868.1 hypothetical protein TSUD_366150 [Trifolium subterraneum] Length = 792 Score = 88.6 bits (218), Expect = 2e-18 Identities = 43/93 (46%), Positives = 55/93 (59%), Gaps = 1/93 (1%) Frame = -1 Query: 279 QEQKSHRMIGLGEQCQGLYYLXXXXXXXXXXXXVHGPD-FIPAEALWHFKLGHLSNNRWS 103 Q+ + RMIG + +GLYYL + IP ALWHF+LGH+S +R Sbjct: 461 QDSLTKRMIGFVDMREGLYYLNLTNKDVHAYTADGSHNTHIPEPALWHFRLGHMSFSRMQ 520 Query: 102 LVNSQFPFVHVDKKGVCDVCHFARHKKSSYTLS 4 L+ SQFPF+ VD K VCD+CH ARHKK Y +S Sbjct: 521 LLKSQFPFISVDNKSVCDICHLARHKKIPYNIS 553 >GAU31769.1 hypothetical protein TSUD_22150 [Trifolium subterraneum] Length = 1372 Score = 85.5 bits (210), Expect = 2e-17 Identities = 45/97 (46%), Positives = 57/97 (58%), Gaps = 5/97 (5%) Frame = -1 Query: 279 QEQKSHRMIGLGEQCQGLYYLXXXXXXXXXXXXVHGPDF-----IPAEALWHFKLGHLSN 115 Q+Q + RMIG + +GLYYL H D IP +ALW F+LGH SN Sbjct: 414 QDQLTKRMIGSADLIEGLYYLTLTSEEVHA----HNIDGTQHTNIPDQALWLFRLGHTSN 469 Query: 114 NRWSLVNSQFPFVHVDKKGVCDVCHFARHKKSSYTLS 4 + SL+ S FPF+ VD KGVCD+CH A+HKK Y +S Sbjct: 470 TKMSLLQSVFPFIAVDNKGVCDICHLAKHKKLYYKVS 506 >GAU39523.1 hypothetical protein TSUD_222930 [Trifolium subterraneum] Length = 1210 Score = 81.3 bits (199), Expect = 7e-16 Identities = 38/86 (44%), Positives = 51/86 (59%) Frame = -1 Query: 279 QEQKSHRMIGLGEQCQGLYYLXXXXXXXXXXXXVHGPDFIPAEALWHFKLGHLSNNRWSL 100 Q+ + MIG + +GLYYL + IP +ALWHF+LGH S +R Sbjct: 416 QDTATKMMIGFADGIEGLYYLVLQDDEVHIHAAEGSDNIIPNQALWHFRLGHPSLSRLHS 475 Query: 99 VNSQFPFVHVDKKGVCDVCHFARHKK 22 ++S+FP++ VD KGVCDVCH AR KK Sbjct: 476 LHSKFPYITVDDKGVCDVCHLARQKK 501 >KYP78542.1 hypothetical protein KK1_048519 [Cajanus cajan] Length = 112 Score = 74.3 bits (181), Expect = 4e-15 Identities = 38/92 (41%), Positives = 52/92 (56%), Gaps = 2/92 (2%) Frame = -1 Query: 270 KSHRMIGLGEQCQGLYYLXXXXXXXXXXXXVHG--PDFIPAEALWHFKLGHLSNNRWSLV 97 KS RMIGL +Q GLY L V + IP +LWHF+LGHLS+ R + Sbjct: 2 KSTRMIGLAKQVGGLYLLKAKTQEKMAEVQVSNITTESIPESSLWHFRLGHLSHERLETM 61 Query: 96 NSQFPFVHVDKKGVCDVCHFARHKKSSYTLSK 1 + + P + ++K VCD+CH A+ KK Y +SK Sbjct: 62 SRENPIIFINKYAVCDICHLAKKKKLPYLMSK 93 >GAU24572.1 hypothetical protein TSUD_149130 [Trifolium subterraneum] Length = 1067 Score = 77.8 bits (190), Expect = 1e-14 Identities = 38/94 (40%), Positives = 57/94 (60%), Gaps = 2/94 (2%) Frame = -1 Query: 279 QEQKSHRMIGLGEQCQGLYYLXXXXXXXXXXXXVHGPDF--IPAEALWHFKLGHLSNNRW 106 Q+QKS RMIG ++ +GLYY+ G ++ IP A+WH ++GH S++R Sbjct: 325 QDQKSLRMIGSADEHEGLYYVDLTDNIAHVATF-EGTNYPTIPKSAIWHSRIGHPSHHRL 383 Query: 105 SLVNSQFPFVHVDKKGVCDVCHFARHKKSSYTLS 4 ++ +FP+V D+ G+CD+CH ARHKK Y S Sbjct: 384 VSLHEKFPYVTSDQGGICDICHLARHKKLPYKTS 417 >KYP68633.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cajanus cajan] Length = 1108 Score = 77.4 bits (189), Expect = 2e-14 Identities = 39/95 (41%), Positives = 54/95 (56%), Gaps = 2/95 (2%) Frame = -1 Query: 279 QEQKSHRMIGLGEQCQGLYYLXXXXXXXXXXXXVHG--PDFIPAEALWHFKLGHLSNNRW 106 Q+ KS RMIGL +Q GLY L V + IP +LWHF+LGHLS+ R Sbjct: 447 QDMKSTRMIGLAKQVGGLYLLKAKTQEKMAEVQVSNITTESIPESSLWHFRLGHLSHERL 506 Query: 105 SLVNSQFPFVHVDKKGVCDVCHFARHKKSSYTLSK 1 ++ + P + ++K VCD+CH A+ KK Y +SK Sbjct: 507 ETMSRENPIIFINKDAVCDICHLAKKKKLPYLMSK 541 >GAU31823.1 hypothetical protein TSUD_58240 [Trifolium subterraneum] Length = 1119 Score = 77.4 bits (189), Expect = 2e-14 Identities = 38/93 (40%), Positives = 55/93 (59%) Frame = -1 Query: 282 RQEQKSHRMIGLGEQCQGLYYLXXXXXXXXXXXXVHGPDFIPAEALWHFKLGHLSNNRWS 103 +Q + + +MIG E+ + LYYL +P ALWHF+LGH+S +R S Sbjct: 361 QQAKDNQKMIGSAEKFEDLYYLNLQDKETHVHNVSTSKT-LPTSALWHFRLGHISASRMS 419 Query: 102 LVNSQFPFVHVDKKGVCDVCHFARHKKSSYTLS 4 L+ S FP + VD K CDVCHFA+ +K S+++S Sbjct: 420 LMFSDFPSMVVDHKATCDVCHFAKQRKLSFSVS 452 >GAU13996.1 hypothetical protein TSUD_168210 [Trifolium subterraneum] Length = 684 Score = 74.7 bits (182), Expect = 1e-13 Identities = 39/90 (43%), Positives = 53/90 (58%), Gaps = 2/90 (2%) Frame = -1 Query: 276 EQKSHRMIGLGEQCQGLYYLXXXXXXXXXXXXVHGPDF--IPAEALWHFKLGHLSNNRWS 103 + ++ +MIG + GLYYL G + IP +ALWHF+LGH S++R Sbjct: 309 QDQTMKMIGFANEHGGLYYLNLTTKNASVSAID-GSSYPSIPTKALWHFRLGHPSHSRLV 367 Query: 102 LVNSQFPFVHVDKKGVCDVCHFARHKKSSY 13 + S+ P+V VD+ GVCDVCH ARHKK Y Sbjct: 368 SLKSKCPYVVVDQGGVCDVCHLARHKKLPY 397 >GAU45704.1 hypothetical protein TSUD_86800 [Trifolium subterraneum] Length = 902 Score = 74.7 bits (182), Expect = 1e-13 Identities = 38/86 (44%), Positives = 51/86 (59%) Frame = -1 Query: 279 QEQKSHRMIGLGEQCQGLYYLXXXXXXXXXXXXVHGPDFIPAEALWHFKLGHLSNNRWSL 100 QE++S RMIG EQ + LYYL + + ALWHF+LGHLS +R Sbjct: 486 QERESKRMIGSAEQIEDLYYLALQTKEVHASNV--STNSLLDSALWHFRLGHLSTSRMIS 543 Query: 99 VNSQFPFVHVDKKGVCDVCHFARHKK 22 + S+FPFV+VD VCDVC +A+ +K Sbjct: 544 MRSEFPFVNVDNNVVCDVCKYAKQRK 569 >GAU31820.1 hypothetical protein TSUD_58210 [Trifolium subterraneum] Length = 1409 Score = 74.3 bits (181), Expect = 2e-13 Identities = 39/103 (37%), Positives = 52/103 (50%), Gaps = 11/103 (10%) Frame = -1 Query: 279 QEQKSHRMIGLGEQCQGLYYLXXXXXXXXXXXXVH-----------GPDFIPAEALWHFK 133 Q+ + +MIGLG+Q GLY L IP ALWHF+ Sbjct: 481 QDMITKKMIGLGDQVDGLYRLQYNHTFLASQALPQSFPKSINNVAVNSVVIPVSALWHFR 540 Query: 132 LGHLSNNRWSLVNSQFPFVHVDKKGVCDVCHFARHKKSSYTLS 4 LGH+SNNR ++ +PF+ +D K VCDVC FAR +K + S Sbjct: 541 LGHVSNNRLLRMSKLYPFLSIDNKAVCDVCQFARKRKLPFNSS 583 >GAU15708.1 hypothetical protein TSUD_307180 [Trifolium subterraneum] Length = 1433 Score = 72.8 bits (177), Expect = 7e-13 Identities = 37/91 (40%), Positives = 49/91 (53%), Gaps = 5/91 (5%) Frame = -1 Query: 261 RMIGLGEQCQGLYYLXXXXXXXXXXXXVHGPDF-----IPAEALWHFKLGHLSNNRWSLV 97 RMIG G+ GLYYL IP ALWHF+ GH SN+R ++ Sbjct: 457 RMIGSGKLLNGLYYLEGTNASTHSLVKPVTGTVCTVFSIPQSALWHFRFGHASNSRLEIM 516 Query: 96 NSQFPFVHVDKKGVCDVCHFARHKKSSYTLS 4 + +P + ++K VCDVCH A+ KK SY+LS Sbjct: 517 HKSYPSISINKDCVCDVCHLAKQKKLSYSLS 547 >KYP36109.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cajanus cajan] Length = 1115 Score = 72.0 bits (175), Expect = 1e-12 Identities = 37/92 (40%), Positives = 51/92 (55%) Frame = -1 Query: 279 QEQKSHRMIGLGEQCQGLYYLXXXXXXXXXXXXVHGPDFIPAEALWHFKLGHLSNNRWSL 100 Q+ + +MIG +GL++L I LWHF+LGHLS+NR ++ Sbjct: 152 QDMNTLKMIGSANLKEGLFHLDIGKERRSSTNNATATP-INNSNLWHFRLGHLSSNRLNV 210 Query: 99 VNSQFPFVHVDKKGVCDVCHFARHKKSSYTLS 4 +N QFPF+ VCDVCHFA+ KK SY+ S Sbjct: 211 LNQQFPFISKHSNEVCDVCHFAKQKKLSYSPS 242 >XP_014632547.1 PREDICTED: uncharacterized protein LOC106799099 [Glycine max] Length = 428 Score = 69.7 bits (169), Expect = 8e-12 Identities = 35/77 (45%), Positives = 46/77 (59%), Gaps = 1/77 (1%) Frame = -1 Query: 276 EQKSHRMIGLGEQCQGLYYLXXXXXXXXXXXXVHGP-DFIPAEALWHFKLGHLSNNRWSL 100 +QKS RMIG E+ +GLY+L +P +ALWHF+LGHLS +R Sbjct: 158 DQKSQRMIGSAEKFEGLYHLVLDDKHASCSIIQTAEIQILPEDALWHFRLGHLSISRMIS 217 Query: 99 VNSQFPFVHVDKKGVCD 49 +NS+FPF+ VD K VCD Sbjct: 218 MNSEFPFIVVDPKAVCD 234 >KYP56844.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cajanus cajan] KYP56859.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cajanus cajan] Length = 195 Score = 67.4 bits (163), Expect = 1e-11 Identities = 34/87 (39%), Positives = 49/87 (56%), Gaps = 1/87 (1%) Frame = -1 Query: 261 RMIGLGEQCQGLYYLXXXXXXXXXXXXVHGPDFIPAEALWHFKLGHLSNNRWSLVNSQFP 82 +MIG GE GL+YL IP EALWHF+LGH+S++R + P Sbjct: 76 KMIGSGELVNGLFYLKMKKGIHESKAIAAIVASIPEEALWHFRLGHVSSSRIEGLKRIVP 135 Query: 81 FVH-VDKKGVCDVCHFARHKKSSYTLS 4 +H +K+ +CD+CHFA+ K S+ +S Sbjct: 136 SIHSQNKEDICDICHFAKQKHISFPIS 162 >KYP65402.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cajanus cajan] Length = 337 Score = 68.6 bits (166), Expect = 2e-11 Identities = 36/94 (38%), Positives = 49/94 (52%), Gaps = 2/94 (2%) Frame = -1 Query: 279 QEQKSHRMIGLGEQCQGLYYLXXXXXXXXXXXXVHGPDFIPA--EALWHFKLGHLSNNRW 106 QE S RMIGL QGLY+L + LWH +LGHLS +R Sbjct: 237 QEASSLRMIGLASLKQGLYHLVINKEGRLPTFSSSANSTANTNNKNLWHLRLGHLSGSRL 296 Query: 105 SLVNSQFPFVHVDKKGVCDVCHFARHKKSSYTLS 4 +L++ QFPF+ CD+CH ++ KK SY++S Sbjct: 297 NLLHEQFPFISKHINETCDICHLSKQKKLSYSIS 330