BLASTX nr result
ID: Astragalus24_contig00015605
seq
BLASTX 2.2.26 [Sep-21-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Astragalus24_contig00015605 (1051 letters) Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF excluding environmental samples from WGS projects 149,584,005 sequences; 54,822,741,787 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value dbj|GAU46782.1| hypothetical protein TSUD_351810 [Trifolium subt... 215 1e-82 gb|KHN31995.1| Copia protein, partial [Glycine soja] 222 1e-78 dbj|GAU31202.1| hypothetical protein TSUD_210590 [Trifolium subt... 193 1e-78 dbj|GAU41219.1| hypothetical protein TSUD_128950 [Trifolium subt... 199 1e-78 gb|PNX58721.1| putative copia-type protein, partial [Trifolium p... 212 5e-78 gb|PNX92373.1| retrovirus-related Pol polyprotein from transposo... 208 1e-76 gb|PNX87108.1| retrovirus-related Pol polyprotein from transposo... 207 3e-76 gb|PNX86354.1| retrovirus-related Pol polyprotein from transposo... 207 3e-76 dbj|GAU32754.1| hypothetical protein TSUD_323220 [Trifolium subt... 193 6e-76 gb|PNX92076.1| retrovirus-related Pol polyprotein from transposo... 201 8e-76 dbj|GAU51097.1| hypothetical protein TSUD_185270 [Trifolium subt... 189 1e-75 gb|PNX94461.1| retrovirus-related Pol polyprotein from transposo... 203 1e-75 gb|PNX93391.1| retrovirus-related Pol polyprotein from transposo... 197 2e-75 dbj|GAU36120.1| hypothetical protein TSUD_374830 [Trifolium subt... 192 4e-74 dbj|GAU41486.1| hypothetical protein TSUD_239620 [Trifolium subt... 186 4e-74 gb|AJY78065.1| putative polyprotein [Glycine max] 197 6e-74 gb|KHN24193.1| Retrovirus-related Pol polyprotein from transposo... 197 1e-73 dbj|GAU39523.1| hypothetical protein TSUD_222930 [Trifolium subt... 184 5e-73 dbj|GAU51049.1| hypothetical protein TSUD_371270 [Trifolium subt... 182 1e-72 dbj|GAU46985.1| hypothetical protein TSUD_403190 [Trifolium subt... 182 2e-72 >dbj|GAU46782.1| hypothetical protein TSUD_351810 [Trifolium subterraneum] Length = 1512 Score = 215 bits (547), Expect(2) = 1e-82 Identities = 101/148 (68%), Positives = 124/148 (83%) Frame = +3 Query: 276 FFLGSSLISWRTKKQQTVSRSSSEAEYRAMSTAVRELQWLLFLLRDLNISCERPPTLYCD 455 FF+GSSLISWR KKQ TVSRSSSEAEYR++S A ELQW+++LL+DL+I CERPP LYCD Sbjct: 1361 FFIGSSLISWRAKKQNTVSRSSSEAEYRSLSFASCELQWIVYLLKDLSIDCERPPVLYCD 1420 Query: 456 HQSALHIAANPVFHERTEHLEIDCHFVREKLQEGLMKLLLVPSKGQLADFFTKALTPTNF 635 +QSA+HIA+NPVFHERT+HLEIDCH VR+K+Q G+ KLL + +K QLADFFTKAL P F Sbjct: 1421 NQSAIHIASNPVFHERTKHLEIDCHLVRDKVQSGVFKLLPISTKAQLADFFTKALPPKVF 1480 Query: 636 IPFISKLRMINIYHGQACGGLLKYKNDD 719 F+SKL M+NI+H ACG LL +++D Sbjct: 1481 NSFLSKLNMLNIFHVPACGRLLNEEDND 1508 Score = 121 bits (304), Expect(2) = 1e-82 Identities = 58/100 (58%), Positives = 70/100 (70%), Gaps = 8/100 (8%) Frame = +2 Query: 5 DGGGAYKDTSSYRRLIGRLLYITKTRPDITFAVQQIRQHVNSPTVTHHSAVCRVLRYLKG 184 D AY D YRRLIG+LLY+T TRPDI+FA+QQ+ Q ++SPT TH CRV+RYLKG Sbjct: 1263 DSSPAYDDVGGYRRLIGKLLYLTTTRPDISFAIQQLSQFLSSPTTTHFDTACRVVRYLKG 1322 Query: 185 NPGMGLMFPRNFILQLSGF--------SDTRKSVTGYCFF 280 +PG GL FPR LQL GF +DTR+S +GYCFF Sbjct: 1323 SPGRGLFFPRQSPLQLLGFADADWANCADTRRSTSGYCFF 1362 >gb|KHN31995.1| Copia protein, partial [Glycine soja] Length = 224 Score = 222 bits (566), Expect(2) = 1e-78 Identities = 105/139 (75%), Positives = 121/139 (87%) Frame = +3 Query: 276 FFLGSSLISWRTKKQQTVSRSSSEAEYRAMSTAVRELQWLLFLLRDLNISCERPPTLYCD 455 FFLG+SLISWR KKQQTVSRSSSEAEYRA+ST ELQWLL+LL DL+I+C R P LYCD Sbjct: 86 FFLGASLISWRAKKQQTVSRSSSEAEYRALSTTACELQWLLYLLHDLHITCTRAPALYCD 145 Query: 456 HQSALHIAANPVFHERTEHLEIDCHFVREKLQEGLMKLLLVPSKGQLADFFTKALTPTNF 635 +QSALHIAANP+FHERT+HLEIDCHFVR K+QEG+++LL + SK QLADFFTK L P +F Sbjct: 146 NQSALHIAANPMFHERTKHLEIDCHFVRNKIQEGVLRLLPISSKEQLADFFTKVLPPPSF 205 Query: 636 IPFISKLRMINIYHGQACG 692 +PFISKL MI+IYH ACG Sbjct: 206 VPFISKLGMIDIYHAPACG 224 Score = 101 bits (251), Expect(2) = 1e-78 Identities = 48/87 (55%), Positives = 64/87 (73%), Gaps = 8/87 (9%) Frame = +2 Query: 44 RLIGRLLYITKTRPDITFAVQQIRQHVNSPTVTHHSAVCRVLRYLKGNPGMGLMFPRNFI 223 RLIG+LLY+ TRP+ITFA QQ+ Q ++ PT+TH++A RV+ YLKG+PG GL FPR Sbjct: 1 RLIGKLLYLNNTRPNITFATQQLSQFLSKPTMTHYNAAYRVVIYLKGSPGQGLFFPRKSE 60 Query: 224 LQLSGFS--------DTRKSVTGYCFF 280 +QL GFS D+R+S++GYCFF Sbjct: 61 IQLLGFSNAYWAGCLDSRRSISGYCFF 87 >dbj|GAU31202.1| hypothetical protein TSUD_210590 [Trifolium subterraneum] Length = 1059 Score = 193 bits (490), Expect(2) = 1e-78 Identities = 92/140 (65%), Positives = 113/140 (80%) Frame = +3 Query: 276 FFLGSSLISWRTKKQQTVSRSSSEAEYRAMSTAVRELQWLLFLLRDLNISCERPPTLYCD 455 FFLGSSL+SW+ KKQ TVSRSSSEAEYRA+STA EL WL FL++DLNI C +PP +YCD Sbjct: 919 FFLGSSLVSWKAKKQLTVSRSSSEAEYRALSTATCELIWLTFLMKDLNIHCSKPPVIYCD 978 Query: 456 HQSALHIAANPVFHERTEHLEIDCHFVREKLQEGLMKLLLVPSKGQLADFFTKALTPTNF 635 QSA+HIA+NPVFHERT+HLEI+CHFVREKLQ+GL++LL + ++ QLAD TK L F Sbjct: 979 SQSAMHIASNPVFHERTKHLEIECHFVREKLQQGLLRLLPISTEDQLADCLTKPLAAPKF 1038 Query: 636 IPFISKLRMINIYHGQACGG 695 FISKL +++IY + GG Sbjct: 1039 NSFISKLGLLDIYEPKLEGG 1058 Score = 130 bits (326), Expect(2) = 1e-78 Identities = 61/101 (60%), Positives = 77/101 (76%), Gaps = 8/101 (7%) Frame = +2 Query: 2 DDGGGAYKDTSSYRRLIGRLLYITKTRPDITFAVQQIRQHVNSPTVTHHSAVCRVLRYLK 181 +D G Y+D S+YRRLIGRLLY+T TRPDI+FA+QQ+ Q ++ PT+ H++A CRV+RYLK Sbjct: 820 NDAGKLYEDISAYRRLIGRLLYLTNTRPDISFAIQQLSQFLSKPTMVHYNAACRVVRYLK 879 Query: 182 GNPGMGLMFPRNFILQLSGFS--------DTRKSVTGYCFF 280 NPG GL FPR+F LQL GF+ DTR+S TGYCFF Sbjct: 880 HNPGRGLFFPRHFDLQLLGFTDADWARCIDTRRSTTGYCFF 920 >dbj|GAU41219.1| hypothetical protein TSUD_128950 [Trifolium subterraneum] Length = 539 Score = 199 bits (505), Expect(2) = 1e-78 Identities = 94/136 (69%), Positives = 113/136 (83%) Frame = +3 Query: 276 FFLGSSLISWRTKKQQTVSRSSSEAEYRAMSTAVRELQWLLFLLRDLNISCERPPTLYCD 455 FFLG SL+SW+ KKQ TVSRSSSEA+YRA+STA EL WLLFLLRDLN +C +PP LYCD Sbjct: 399 FFLGMSLVSWKAKKQVTVSRSSSEADYRALSTATCELIWLLFLLRDLNTTCSKPPVLYCD 458 Query: 456 HQSALHIAANPVFHERTEHLEIDCHFVREKLQEGLMKLLLVPSKGQLADFFTKALTPTNF 635 QSA+HIA+NPVFHERT+HLEIDCH VREK+Q+GL+KLL + ++ QLADF TKAL F Sbjct: 459 SQSAMHIASNPVFHERTKHLEIDCHLVREKVQQGLLKLLPISTQEQLADFLTKALPSPKF 518 Query: 636 IPFISKLRMINIYHGQ 683 F+SKL M++IYH + Sbjct: 519 NSFVSKLGMLDIYHSK 534 Score = 124 bits (311), Expect(2) = 1e-78 Identities = 59/100 (59%), Positives = 73/100 (73%), Gaps = 8/100 (8%) Frame = +2 Query: 5 DGGGAYKDTSSYRRLIGRLLYITKTRPDITFAVQQIRQHVNSPTVTHHSAVCRVLRYLKG 184 D G Y D SSYRRLIG+LLY+T TRPDI+FA QQ+ Q ++ PTVTH+ A CRV+RYLK Sbjct: 301 DNGETYADISSYRRLIGKLLYLTNTRPDISFATQQLSQFLHKPTVTHYKAACRVVRYLKH 360 Query: 185 NPGMGLMFPRNFILQLSGFS--------DTRKSVTGYCFF 280 +PG GLM PRN +Q+ G+S DTR+S +GYCFF Sbjct: 361 SPGKGLMLPRNSEIQILGYSDADWAGCLDTRRSTSGYCFF 400 >gb|PNX58721.1| putative copia-type protein, partial [Trifolium pratense] Length = 277 Score = 212 bits (540), Expect(2) = 5e-78 Identities = 100/134 (74%), Positives = 118/134 (88%) Frame = +3 Query: 276 FFLGSSLISWRTKKQQTVSRSSSEAEYRAMSTAVRELQWLLFLLRDLNISCERPPTLYCD 455 FFLG+SLISWR KKQ TVSRSSSEAEYRA+S A ELQWL++LL+DL ++C +PP LYCD Sbjct: 143 FFLGNSLISWRAKKQHTVSRSSSEAEYRALSFASCELQWLVYLLKDLQVNCIKPPVLYCD 202 Query: 456 HQSALHIAANPVFHERTEHLEIDCHFVREKLQEGLMKLLLVPSKGQLADFFTKALTPTNF 635 +QSALHIAANPVFHERT+HLEIDCHFVREKLQ+G+ KLL + +K QLADFFTKAL P +F Sbjct: 203 NQSALHIAANPVFHERTKHLEIDCHFVREKLQQGIFKLLPIHTKAQLADFFTKALPPKSF 262 Query: 636 IPFISKLRMINIYH 677 + FISKL M++IYH Sbjct: 263 LSFISKLNMLDIYH 276 Score = 108 bits (271), Expect(2) = 5e-78 Identities = 51/100 (51%), Positives = 67/100 (67%), Gaps = 8/100 (8%) Frame = +2 Query: 5 DGGGAYKDTSSYRRLIGRLLYITKTRPDITFAVQQIRQHVNSPTVTHHSAVCRVLRYLKG 184 D Y D + YRRL+G+LLY+T TRPDI F QQ+ Q +++PT TH+ CRV+RYLK Sbjct: 45 DTASPYADIAGYRRLVGKLLYLTTTRPDIAFVTQQLSQFLSAPTQTHYDTACRVVRYLKN 104 Query: 185 NPGMGLMFPRNFILQLSGFS--------DTRKSVTGYCFF 280 +PG GL+F R+ L L GF+ DTR+S +GYCFF Sbjct: 105 SPGRGLLFRRDSQLHLLGFTDADWAGCLDTRRSTSGYCFF 144 >gb|PNX92373.1| retrovirus-related Pol polyprotein from transposon TNT 1-94, partial [Trifolium pratense] Length = 1125 Score = 208 bits (529), Expect(2) = 1e-76 Identities = 98/150 (65%), Positives = 123/150 (82%) Frame = +3 Query: 276 FFLGSSLISWRTKKQQTVSRSSSEAEYRAMSTAVRELQWLLFLLRDLNISCERPPTLYCD 455 FFLG SL+SWRTKKQ TV+RSSSEAEYRA+++A ELQWLL+LL+DL + C + P +YCD Sbjct: 976 FFLGQSLVSWRTKKQFTVARSSSEAEYRALASATCELQWLLYLLQDLGVPCSKLPVIYCD 1035 Query: 456 HQSALHIAANPVFHERTEHLEIDCHFVREKLQEGLMKLLLVPSKGQLADFFTKALTPTNF 635 +QSALHIAANPVFHERT+HL+IDCH VREK+ G+MKLL V SK Q+ADFFTKAL P F Sbjct: 1036 NQSALHIAANPVFHERTKHLDIDCHIVREKMLAGVMKLLPVSSKDQIADFFTKALLPQPF 1095 Query: 636 IPFISKLRMINIYHGQACGGLLKYKNDDST 725 ++KL M++IYH CG +L++K +D+T Sbjct: 1096 GILLAKLGMVDIYHPPTCGRVLEHKTEDNT 1125 Score = 108 bits (270), Expect(2) = 1e-76 Identities = 53/101 (52%), Positives = 71/101 (70%), Gaps = 8/101 (7%) Frame = +2 Query: 2 DDGGGAYKDTSSYRRLIGRLLYITKTRPDITFAVQQIRQHVNSPTVTHHSAVCRVLRYLK 181 +D G ++D S+YRRL+GRLLY+T TRPDIT+ QQ+ Q ++ PT H++A RVL+YLK Sbjct: 877 NDIGPIFEDVSAYRRLVGRLLYLTTTRPDITYVTQQLSQFLSRPTQMHYNAALRVLKYLK 936 Query: 182 GNPGMGLMFPRNFILQLSGFS--------DTRKSVTGYCFF 280 +PG GL FPR LQL GFS D+R+S++G CFF Sbjct: 937 TSPGRGLFFPRASQLQLLGFSDADWAGCKDSRRSISGQCFF 977 >gb|PNX87108.1| retrovirus-related Pol polyprotein from transposon TNT 1-94, partial [Trifolium pratense] Length = 517 Score = 207 bits (526), Expect(2) = 3e-76 Identities = 100/148 (67%), Positives = 118/148 (79%) Frame = +3 Query: 276 FFLGSSLISWRTKKQQTVSRSSSEAEYRAMSTAVRELQWLLFLLRDLNISCERPPTLYCD 455 FFLG SLISWRTKKQ TVSRSSSEAEYRA++ A ELQW+L+LL+D+ I C + P +YCD Sbjct: 359 FFLGKSLISWRTKKQLTVSRSSSEAEYRALAAATCELQWILYLLKDIQIQCSKLPVIYCD 418 Query: 456 HQSALHIAANPVFHERTEHLEIDCHFVREKLQEGLMKLLLVPSKGQLADFFTKALTPTNF 635 +QSALHIAANPVFHERT+HLEIDCH VREKLQ G+MKLL V S+ Q+ADFFTKAL P F Sbjct: 419 NQSALHIAANPVFHERTKHLEIDCHLVREKLQAGVMKLLPVTSQNQVADFFTKALLPQPF 478 Query: 636 IPFISKLRMINIYHGQACGGLLKYKNDD 719 +SKL +++IY CGGLL +D Sbjct: 479 NTLMSKLNLLDIYQPSPCGGLLHSNIED 506 Score = 108 bits (270), Expect(2) = 3e-76 Identities = 51/100 (51%), Positives = 67/100 (67%), Gaps = 8/100 (8%) Frame = +2 Query: 5 DGGGAYKDTSSYRRLIGRLLYITKTRPDITFAVQQIRQHVNSPTVTHHSAVCRVLRYLKG 184 D G Y D +YRRLIGRL+Y+ TRPDIT+ QQ+ Q ++ PT H++A RVL+YLK Sbjct: 261 DNGTPYDDIPAYRRLIGRLIYLNTTRPDITYVTQQLSQFLSKPTTNHYNAAIRVLKYLKN 320 Query: 185 NPGMGLMFPRNFILQLSGFS--------DTRKSVTGYCFF 280 +PG GL FPR+ L + GFS DTR+S++G CFF Sbjct: 321 SPGRGLFFPRDSSLHILGFSDADWAGCVDTRRSISGQCFF 360 >gb|PNX86354.1| retrovirus-related Pol polyprotein from transposon TNT 1-94 [Trifolium pratense] Length = 512 Score = 207 bits (526), Expect(2) = 3e-76 Identities = 102/154 (66%), Positives = 119/154 (77%) Frame = +3 Query: 276 FFLGSSLISWRTKKQQTVSRSSSEAEYRAMSTAVRELQWLLFLLRDLNISCERPPTLYCD 455 FFLG SLISWRTKKQ TVSRSSSEAEYRA++ A ELQW+L+LL+D+ I C + P +YCD Sbjct: 359 FFLGKSLISWRTKKQLTVSRSSSEAEYRALAAATCELQWILYLLKDIQIQCSKLPVIYCD 418 Query: 456 HQSALHIAANPVFHERTEHLEIDCHFVREKLQEGLMKLLLVPSKGQLADFFTKALTPTNF 635 +QSALHIAANPVFHERT+HLEIDCH VREKLQ G+MKLL V S+ Q+ADFFTKAL P F Sbjct: 419 NQSALHIAANPVFHERTKHLEIDCHLVREKLQAGVMKLLPVTSQNQVADFFTKALLPQPF 478 Query: 636 IPFISKLRMINIYHGQACGGLLKYKNDDSTGDLS 737 +SKL + +IY CGGLL +D LS Sbjct: 479 NTLMSKLNLQDIYQPSPCGGLLHSNIEDKDKSLS 512 Score = 108 bits (270), Expect(2) = 3e-76 Identities = 51/100 (51%), Positives = 67/100 (67%), Gaps = 8/100 (8%) Frame = +2 Query: 5 DGGGAYKDTSSYRRLIGRLLYITKTRPDITFAVQQIRQHVNSPTVTHHSAVCRVLRYLKG 184 D G Y D +YRRLIGRL+Y+ TRPDIT+ QQ+ Q ++ PT H++A RVL+YLK Sbjct: 261 DNGTPYDDIPAYRRLIGRLIYLNTTRPDITYVTQQLSQFLSKPTTNHYNAAIRVLKYLKN 320 Query: 185 NPGMGLMFPRNFILQLSGFS--------DTRKSVTGYCFF 280 +PG GL FPR+ L + GFS DTR+S++G CFF Sbjct: 321 SPGRGLFFPRDSSLHILGFSDADWAGCVDTRRSISGQCFF 360 >dbj|GAU32754.1| hypothetical protein TSUD_323220 [Trifolium subterraneum] Length = 1095 Score = 193 bits (491), Expect(2) = 6e-76 Identities = 97/177 (54%), Positives = 124/177 (70%) Frame = +3 Query: 276 FFLGSSLISWRTKKQQTVSRSSSEAEYRAMSTAVRELQWLLFLLRDLNISCERPPTLYCD 455 FFLGSSLISW+ KKQ T+S+SSSEAEYRA+S++ EL WLL+LL+DL I C + P ++CD Sbjct: 587 FFLGSSLISWKAKKQLTISKSSSEAEYRALSSSTCELIWLLYLLKDLQIECTQLPVIFCD 646 Query: 456 HQSALHIAANPVFHERTEHLEIDCHFVREKLQEGLMKLLLVPSKGQLADFFTKALTPTNF 635 +QSALHIA+NPVFHERT+H+EIDCH VREK+Q GL++LL + ++ QL D TKAL F Sbjct: 647 NQSALHIASNPVFHERTKHIEIDCHLVREKVQAGLLRLLPISTQDQLTDCLTKALPTAKF 706 Query: 636 IPFISKLRMINIYHGQACGGLLKYKNDDSTGDLSNNSANEDMHKEDSSKCDKAKQSE 806 FI+KL +++IY ACG LL K S SNN + +D K D E Sbjct: 707 NHFIAKLGLLDIYQASACGRLLNIKIASSP---SNNHEEASLANDDQVKSDLYNMQE 760 Score = 120 bits (302), Expect(2) = 6e-76 Identities = 55/101 (54%), Positives = 74/101 (73%), Gaps = 8/101 (7%) Frame = +2 Query: 2 DDGGGAYKDTSSYRRLIGRLLYITKTRPDITFAVQQIRQHVNSPTVTHHSAVCRVLRYLK 181 +D G ++D S YRRL+G+LLY+T TRPDI +A QQ+ Q +++PT+TH+ A CRV+RYLK Sbjct: 488 NDNGKPFEDISLYRRLVGKLLYLTNTRPDIAYATQQLSQFLHNPTITHYKAACRVVRYLK 547 Query: 182 GNPGMGLMFPRNFILQLSGFS--------DTRKSVTGYCFF 280 NPG GLMF RN +Q+ G+S DTR+S +GYCFF Sbjct: 548 HNPGRGLMFHRNLDIQIIGYSDADWAGCLDTRRSTSGYCFF 588 >gb|PNX92076.1| retrovirus-related Pol polyprotein from transposon TNT 1-94 [Trifolium pratense] Length = 720 Score = 201 bits (510), Expect(2) = 8e-76 Identities = 97/143 (67%), Positives = 116/143 (81%) Frame = +3 Query: 276 FFLGSSLISWRTKKQQTVSRSSSEAEYRAMSTAVRELQWLLFLLRDLNISCERPPTLYCD 455 FFLG SLISWRTKKQ TV+RSSSEAEYRA++ A ELQWL +LL+DL+I+C + P LYCD Sbjct: 557 FFLGQSLISWRTKKQLTVARSSSEAEYRALAAATCELQWLAYLLQDLHITCPKLPVLYCD 616 Query: 456 HQSALHIAANPVFHERTEHLEIDCHFVREKLQEGLMKLLLVPSKGQLADFFTKALTPTNF 635 +QSALHIAANPVFHERT+H++IDCH VREKLQ GLMKLL V SK Q+ADFFTK+L P F Sbjct: 617 NQSALHIAANPVFHERTKHIDIDCHIVREKLQAGLMKLLPVSSKDQIADFFTKSLLPQPF 676 Query: 636 IPFISKLRMINIYHGQACGGLLK 704 ++KL M +IY CG ++K Sbjct: 677 GVLLAKLGMFDIYQAPTCGRVIK 699 Score = 113 bits (282), Expect(2) = 8e-76 Identities = 55/100 (55%), Positives = 69/100 (69%), Gaps = 8/100 (8%) Frame = +2 Query: 5 DGGGAYKDTSSYRRLIGRLLYITKTRPDITFAVQQIRQHVNSPTVTHHSAVCRVLRYLKG 184 D ++D S+YRRL+GRLLY+ TRPDITF QQ+ Q ++ PT TH+SA RVLRYLK Sbjct: 459 DDSAPFEDISAYRRLVGRLLYLNTTRPDITFITQQLSQFLSKPTHTHYSAAMRVLRYLKN 518 Query: 185 NPGMGLMFPRNFILQLSGFS--------DTRKSVTGYCFF 280 PG GL FPRN LQ+ GFS D+R+S++G CFF Sbjct: 519 CPGRGLFFPRNSTLQILGFSDADWAGCKDSRRSISGQCFF 558 >dbj|GAU51097.1| hypothetical protein TSUD_185270 [Trifolium subterraneum] Length = 1179 Score = 189 bits (480), Expect(2) = 1e-75 Identities = 87/133 (65%), Positives = 109/133 (81%) Frame = +3 Query: 276 FFLGSSLISWRTKKQQTVSRSSSEAEYRAMSTAVRELQWLLFLLRDLNISCERPPTLYCD 455 FFLGSSLISW+ KKQ+T++RSSS+AEY A+++A ELQWLL+LL DLN+ C RPP LYCD Sbjct: 1046 FFLGSSLISWKAKKQETIARSSSKAEYIALTSATCELQWLLYLLEDLNVKCSRPPVLYCD 1105 Query: 456 HQSALHIAANPVFHERTEHLEIDCHFVREKLQEGLMKLLLVPSKGQLADFFTKALTPTNF 635 QSA+HIA+NPVFHERT+HLEIDCH +REKLQ+G++KLL + + Q+ADF TK L F Sbjct: 1106 SQSAIHIASNPVFHERTKHLEIDCHLIREKLQKGILKLLSISTNEQVADFLTKPLVSPKF 1165 Query: 636 IPFISKLRMINIY 674 +SKL MINI+ Sbjct: 1166 KYLLSKLNMINIF 1178 Score = 124 bits (311), Expect(2) = 1e-75 Identities = 58/100 (58%), Positives = 74/100 (74%), Gaps = 8/100 (8%) Frame = +2 Query: 5 DGGGAYKDTSSYRRLIGRLLYITKTRPDITFAVQQIRQHVNSPTVTHHSAVCRVLRYLKG 184 D G ++D + YRRLIGRLLY+T TRPDI+ A QQ+ Q + +PT+TH++A CR+LRYLK Sbjct: 948 DNGALFEDITQYRRLIGRLLYLTTTRPDISLATQQLSQFLQAPTITHYNAACRILRYLKQ 1007 Query: 185 NPGMGLMFPRNFILQLSGFS--------DTRKSVTGYCFF 280 PG+GLMFPR+ LQL GF+ D+RKS TGYCFF Sbjct: 1008 EPGLGLMFPRDSELQLLGFADADWAGCVDSRKSTTGYCFF 1047 >gb|PNX94461.1| retrovirus-related Pol polyprotein from transposon TNT 1-94 [Trifolium pratense] Length = 1000 Score = 203 bits (517), Expect(2) = 1e-75 Identities = 99/159 (62%), Positives = 126/159 (79%) Frame = +3 Query: 276 FFLGSSLISWRTKKQQTVSRSSSEAEYRAMSTAVRELQWLLFLLRDLNISCERPPTLYCD 455 FFLG SLISWRTKKQ TV+RSSSEAEYRA+++A ELQWL++LL+DL+++C + P LYCD Sbjct: 840 FFLGQSLISWRTKKQITVARSSSEAEYRALASATCELQWLVYLLQDLHVTCSKLPVLYCD 899 Query: 456 HQSALHIAANPVFHERTEHLEIDCHFVREKLQEGLMKLLLVPSKGQLADFFTKALTPTNF 635 +QSALHIAANPVFHERT+HL+IDCH VREKLQ GLMKLL V SK Q+ADFFTK L P F Sbjct: 900 NQSALHIAANPVFHERTKHLDIDCHVVREKLQAGLMKLLPVSSKDQIADFFTKTLLPQPF 959 Query: 636 IPFISKLRMINIYHGQACGGLLKYKNDDSTGDLSNNSAN 752 ++KL M++IY CG +L+ + +++ D N+ + Sbjct: 960 GILLAKLGMVDIYQAPPCGRVLEPIHTEASSDNKLNTTH 998 Score = 110 bits (274), Expect(2) = 1e-75 Identities = 52/100 (52%), Positives = 70/100 (70%), Gaps = 8/100 (8%) Frame = +2 Query: 5 DGGGAYKDTSSYRRLIGRLLYITKTRPDITFAVQQIRQHVNSPTVTHHSAVCRVLRYLKG 184 D ++D S+YRRL+GRLLY+ TRPDITF QQ+ Q ++ PT TH++A RVL+YLK Sbjct: 742 DNSAPFEDISAYRRLVGRLLYLNTTRPDITFITQQLSQFLSKPTHTHYTAALRVLKYLKN 801 Query: 185 NPGMGLMFPRNFILQLSGFS--------DTRKSVTGYCFF 280 PG GL FPR+ LQ+ GFS D+R+S++G+CFF Sbjct: 802 CPGRGLFFPRSSSLQILGFSDADWAGCKDSRRSISGHCFF 841 >gb|PNX93391.1| retrovirus-related Pol polyprotein from transposon TNT 1-94 [Trifolium pratense] Length = 1296 Score = 197 bits (500), Expect(2) = 2e-75 Identities = 95/140 (67%), Positives = 113/140 (80%) Frame = +3 Query: 276 FFLGSSLISWRTKKQQTVSRSSSEAEYRAMSTAVRELQWLLFLLRDLNISCERPPTLYCD 455 FF+G SLISWR KKQ TVSRSSSEAEYRA+S+A ELQWLL+LL DL ++ + PTLYCD Sbjct: 1156 FFMGKSLISWRAKKQATVSRSSSEAEYRALSSATCELQWLLYLLADLKVTLTKTPTLYCD 1215 Query: 456 HQSALHIAANPVFHERTEHLEIDCHFVREKLQEGLMKLLLVPSKGQLADFFTKALTPTNF 635 +QSA+HIA+NPVFHERT+HL+IDCH VREK+ +G++KLL V + Q+ADF TKAL P F Sbjct: 1216 NQSAVHIASNPVFHERTKHLDIDCHLVREKVLQGILKLLPVSTNDQMADFLTKALAPPKF 1275 Query: 636 IPFISKLRMINIYHGQACGG 695 FISKL MINIY Q GG Sbjct: 1276 YEFISKLNMINIYQVQLEGG 1295 Score = 115 bits (288), Expect(2) = 2e-75 Identities = 55/98 (56%), Positives = 69/98 (70%), Gaps = 8/98 (8%) Frame = +2 Query: 11 GGAYKDTSSYRRLIGRLLYITKTRPDITFAVQQIRQHVNSPTVTHHSAVCRVLRYLKGNP 190 G Y D S YRRL+G+LLY+ TRPDI FA QQ+ Q +++PT H++A CRVLRYLK NP Sbjct: 1060 GTPYDDVSGYRRLVGKLLYLNTTRPDIAFATQQLSQFMHAPTNVHYNAACRVLRYLKNNP 1119 Query: 191 GMGLMFPRNFILQLSGFS--------DTRKSVTGYCFF 280 G G++F R+ LQL G+S DTRKS +GYCFF Sbjct: 1120 GQGVLFSRDSELQLIGYSDADWAGCMDTRKSTSGYCFF 1157 >dbj|GAU36120.1| hypothetical protein TSUD_374830 [Trifolium subterraneum] Length = 1037 Score = 192 bits (487), Expect(2) = 4e-74 Identities = 90/146 (61%), Positives = 118/146 (80%) Frame = +3 Query: 276 FFLGSSLISWRTKKQQTVSRSSSEAEYRAMSTAVRELQWLLFLLRDLNISCERPPTLYCD 455 FFLGSSL+SW+ KKQ T+S+SSSEAEYRA+S+A EL WLL+LL+DL+I C + P ++CD Sbjct: 874 FFLGSSLVSWKAKKQLTISKSSSEAEYRALSSATCELVWLLYLLKDLHIECTQLPVIFCD 933 Query: 456 HQSALHIAANPVFHERTEHLEIDCHFVREKLQEGLMKLLLVPSKGQLADFFTKALTPTNF 635 +QSALHIA+NPVFHERT+H+EIDCH VREK+QEGL++LL V ++ QLAD TKAL F Sbjct: 934 NQSALHIASNPVFHERTKHIEIDCHLVREKVQEGLLRLLPVSTQDQLADCLTKALPVPKF 993 Query: 636 IPFISKLRMINIYHGQACGGLLKYKN 713 F++KL +++IY ACG +L K+ Sbjct: 994 NHFVTKLGLLDIYQASACGRVLSIKD 1019 Score = 116 bits (290), Expect(2) = 4e-74 Identities = 55/101 (54%), Positives = 72/101 (71%), Gaps = 8/101 (7%) Frame = +2 Query: 2 DDGGGAYKDTSSYRRLIGRLLYITKTRPDITFAVQQIRQHVNSPTVTHHSAVCRVLRYLK 181 +D G ++D S YRRLIG+LLY+T TRPDI +A QQ+ Q +++PTVTH A CRV+RYLK Sbjct: 775 NDNGKPFEDVSLYRRLIGKLLYLTNTRPDIAYATQQLSQFLHNPTVTHFKAACRVIRYLK 834 Query: 182 GNPGMGLMFPRNFILQLSGFS--------DTRKSVTGYCFF 280 NPG GLMF R+ + + G+S DTR+S +GYCFF Sbjct: 835 HNPGRGLMFYRHSDIHIIGYSNADWAGCLDTRRSTSGYCFF 875 >dbj|GAU41486.1| hypothetical protein TSUD_239620 [Trifolium subterraneum] Length = 794 Score = 186 bits (473), Expect(2) = 4e-74 Identities = 90/140 (64%), Positives = 110/140 (78%) Frame = +3 Query: 276 FFLGSSLISWRTKKQQTVSRSSSEAEYRAMSTAVRELQWLLFLLRDLNISCERPPTLYCD 455 FFLGSSL+SW+ KKQ TVSRSSSEAEYRA+STA EL WLLFL++DL+I C + P +YCD Sbjct: 654 FFLGSSLVSWKAKKQLTVSRSSSEAEYRALSTATCELIWLLFLMKDLSIQCSKQPIIYCD 713 Query: 456 HQSALHIAANPVFHERTEHLEIDCHFVREKLQEGLMKLLLVPSKGQLADFFTKALTPTNF 635 QSA+HIA+NPVFHERT+HLEIDCH VREK+Q+GL++LL + + QLAD TK L F Sbjct: 714 SQSAIHIASNPVFHERTKHLEIDCHLVREKVQQGLLRLLPISTDDQLADCLTKPLAAPKF 773 Query: 636 IPFISKLRMINIYHGQACGG 695 FISKL + +IY + GG Sbjct: 774 NSFISKLGLFDIYEPKLEGG 793 Score = 121 bits (304), Expect(2) = 4e-74 Identities = 59/101 (58%), Positives = 74/101 (73%), Gaps = 8/101 (7%) Frame = +2 Query: 2 DDGGGAYKDTSSYRRLIGRLLYITKTRPDITFAVQQIRQHVNSPTVTHHSAVCRVLRYLK 181 +D G Y+D SSYRRLIGRLLY+T TRPDI+FAVQQ+ Q ++ PT+ H +A CRV+RYLK Sbjct: 555 NDAGKLYEDISSYRRLIGRLLYLTNTRPDISFAVQQLSQFLHKPTMVHFNAACRVVRYLK 614 Query: 182 GNPGMGLMFPRNFILQLSGFSD--------TRKSVTGYCFF 280 NPG GL+F R+ QL GF+D TR+S +GYCFF Sbjct: 615 HNPGRGLLFSRHSDTQLLGFADADWAGCIETRRSTSGYCFF 655 >gb|AJY78065.1| putative polyprotein [Glycine max] Length = 523 Score = 197 bits (500), Expect(2) = 6e-74 Identities = 95/140 (67%), Positives = 111/140 (79%) Frame = +3 Query: 276 FFLGSSLISWRTKKQQTVSRSSSEAEYRAMSTAVRELQWLLFLLRDLNISCERPPTLYCD 455 FF+G SL+SWR KKQ TVSRSSSEAEYRA+S+A ELQWLL+L DL + R PTLYCD Sbjct: 383 FFIGKSLVSWRAKKQATVSRSSSEAEYRALSSAACELQWLLYLFADLRVQLTRTPTLYCD 442 Query: 456 HQSALHIAANPVFHERTEHLEIDCHFVREKLQEGLMKLLLVPSKGQLADFFTKALTPTNF 635 +QSA+HIA+NPVFHERT+HLEIDCH VREKL +G +KLL V + Q+ADF TKAL P F Sbjct: 443 NQSAVHIASNPVFHERTKHLEIDCHLVREKLLKGTLKLLPVSTSDQVADFLTKALAPPKF 502 Query: 636 IPFISKLRMINIYHGQACGG 695 F+SKL MINIYH + GG Sbjct: 503 HDFVSKLSMINIYHDKLEGG 522 Score = 110 bits (276), Expect(2) = 6e-74 Identities = 52/98 (53%), Positives = 66/98 (67%), Gaps = 8/98 (8%) Frame = +2 Query: 11 GGAYKDTSSYRRLIGRLLYITKTRPDITFAVQQIRQHVNSPTVTHHSAVCRVLRYLKGNP 190 G Y D S YRR++G+LLY+ TRPDI FA QQ+ Q + +PT H +A CRVLRYLK NP Sbjct: 287 GTPYADISGYRRIVGKLLYLNTTRPDIAFATQQLSQFMQAPTNVHFNAACRVLRYLKNNP 346 Query: 191 GMGLMFPRNFILQLSGFS--------DTRKSVTGYCFF 280 G G+ F R +QL G+S D+RKS++GYCFF Sbjct: 347 GQGIFFSRTSEMQLIGYSDADWAGCMDSRKSISGYCFF 384 >gb|KHN24193.1| Retrovirus-related Pol polyprotein from transposon TNT 1-94, partial [Glycine soja] gb|KHN37451.1| Retrovirus-related Pol polyprotein from transposon TNT 1-94, partial [Glycine soja] Length = 234 Score = 197 bits (500), Expect(2) = 1e-73 Identities = 95/140 (67%), Positives = 111/140 (79%) Frame = +3 Query: 276 FFLGSSLISWRTKKQQTVSRSSSEAEYRAMSTAVRELQWLLFLLRDLNISCERPPTLYCD 455 FF+G SL+SWR KKQ TVSRSSSEAEYRA+S+A ELQWLL+L DL + R PTLYCD Sbjct: 95 FFIGKSLVSWRAKKQATVSRSSSEAEYRALSSAACELQWLLYLFADLRVQLTRTPTLYCD 154 Query: 456 HQSALHIAANPVFHERTEHLEIDCHFVREKLQEGLMKLLLVPSKGQLADFFTKALTPTNF 635 +QSA+HIA+NPVFHERT+HLEIDCH VREKL +G +KLL V + Q+ADF TKAL P F Sbjct: 155 NQSAVHIASNPVFHERTKHLEIDCHLVREKLLKGTLKLLPVSTSDQVADFLTKALAPPKF 214 Query: 636 IPFISKLRMINIYHGQACGG 695 F+SKL MINIYH + GG Sbjct: 215 HDFVSKLSMINIYHDKLEGG 234 Score = 109 bits (273), Expect(2) = 1e-73 Identities = 51/95 (53%), Positives = 65/95 (68%), Gaps = 8/95 (8%) Frame = +2 Query: 20 YKDTSSYRRLIGRLLYITKTRPDITFAVQQIRQHVNSPTVTHHSAVCRVLRYLKGNPGMG 199 Y D S YRR++G+LLY+ TRPDI FA QQ+ Q + +PT H +A CRVLRYLK NPG G Sbjct: 2 YADISGYRRIVGKLLYLNTTRPDIAFATQQLSQFMQAPTNVHFNAACRVLRYLKNNPGQG 61 Query: 200 LMFPRNFILQLSGFS--------DTRKSVTGYCFF 280 + F R +QL G+S D+RKS++GYCFF Sbjct: 62 IFFSRTSEMQLIGYSDADWAGCMDSRKSISGYCFF 96 >dbj|GAU39523.1| hypothetical protein TSUD_222930 [Trifolium subterraneum] Length = 1210 Score = 184 bits (467), Expect(2) = 5e-73 Identities = 88/140 (62%), Positives = 110/140 (78%) Frame = +3 Query: 276 FFLGSSLISWRTKKQQTVSRSSSEAEYRAMSTAVRELQWLLFLLRDLNISCERPPTLYCD 455 FF+GSSLISW+ KKQ TVSRSSSEAEYRA+S+ EL WLL L+ DL I C++PP +YCD Sbjct: 1070 FFIGSSLISWKAKKQLTVSRSSSEAEYRALSSTTCELIWLLSLINDLKIQCDKPPVIYCD 1129 Query: 456 HQSALHIAANPVFHERTEHLEIDCHFVREKLQEGLMKLLLVPSKGQLADFFTKALTPTNF 635 QSA+HIA+NPVFHERT+HLEIDCH VREK+Q+G+++LL + ++ QLAD TKAL F Sbjct: 1130 SQSAMHIASNPVFHERTKHLEIDCHLVREKVQQGILRLLPISTQDQLADCLTKALPGPKF 1189 Query: 636 IPFISKLRMINIYHGQACGG 695 ISKL + +IYH + GG Sbjct: 1190 SSIISKLGLKDIYHPKLEGG 1209 Score = 120 bits (301), Expect(2) = 5e-73 Identities = 58/101 (57%), Positives = 72/101 (71%), Gaps = 8/101 (7%) Frame = +2 Query: 2 DDGGGAYKDTSSYRRLIGRLLYITKTRPDITFAVQQIRQHVNSPTVTHHSAVCRVLRYLK 181 +D Y D SSYRRL+G+LLY+T TRPDI +A QQ+ Q ++ PT TH++A CRV++YLK Sbjct: 971 NDDAKPYDDISSYRRLVGKLLYLTNTRPDIAYATQQLSQFLHKPTWTHYNAACRVVKYLK 1030 Query: 182 GNPGMGLMFPRNFILQLSGFS--------DTRKSVTGYCFF 280 NPG GL+FPR LQL GFS DTR+S TGYCFF Sbjct: 1031 QNPGRGLLFPRASDLQLLGFSDADWAGCVDTRRSTTGYCFF 1071 >dbj|GAU51049.1| hypothetical protein TSUD_371270 [Trifolium subterraneum] Length = 1001 Score = 182 bits (462), Expect(2) = 1e-72 Identities = 89/140 (63%), Positives = 111/140 (79%) Frame = +3 Query: 276 FFLGSSLISWRTKKQQTVSRSSSEAEYRAMSTAVRELQWLLFLLRDLNISCERPPTLYCD 455 FF+GSSLISW+ KKQ TVSRSSSEAEYRA+S+ EL WLL L++DL I C++PP +Y D Sbjct: 861 FFIGSSLISWKAKKQLTVSRSSSEAEYRALSSTTCELIWLLSLMKDLKIECDKPPFIYRD 920 Query: 456 HQSALHIAANPVFHERTEHLEIDCHFVREKLQEGLMKLLLVPSKGQLADFFTKALTPTNF 635 QSA+HIA+NPVFHERT+HLEIDCH VREK+Q+GL+KLL + ++ +LAD TKAL F Sbjct: 921 SQSAMHIASNPVFHERTKHLEIDCHLVREKVQQGLLKLLPISTQDKLADCLTKALPGPKF 980 Query: 636 IPFISKLRMINIYHGQACGG 695 FISKL + +IYH + GG Sbjct: 981 NSFISKLGLQDIYHPKLKGG 1000 Score = 120 bits (302), Expect(2) = 1e-72 Identities = 58/101 (57%), Positives = 73/101 (72%), Gaps = 8/101 (7%) Frame = +2 Query: 2 DDGGGAYKDTSSYRRLIGRLLYITKTRPDITFAVQQIRQHVNSPTVTHHSAVCRVLRYLK 181 +D Y D SSYRRLIG+LLY+T TRPDI +A QQ+ Q ++ PT TH++A CRV++YLK Sbjct: 762 NDEAKPYADISSYRRLIGKLLYLTNTRPDIAYATQQLSQFLHKPTWTHYNAACRVVKYLK 821 Query: 182 GNPGMGLMFPRNFILQLSGFS--------DTRKSVTGYCFF 280 NPG GL+FPR+ LQ+ GFS DTR+S TGYCFF Sbjct: 822 QNPGRGLLFPRSSDLQILGFSDADWAGCVDTRRSTTGYCFF 862 >dbj|GAU46985.1| hypothetical protein TSUD_403190 [Trifolium subterraneum] Length = 1071 Score = 182 bits (463), Expect(2) = 2e-72 Identities = 91/169 (53%), Positives = 121/169 (71%) Frame = +3 Query: 276 FFLGSSLISWRTKKQQTVSRSSSEAEYRAMSTAVRELQWLLFLLRDLNISCERPPTLYCD 455 FFLGSSLISW+ KKQ T+S+SS EAEYRA+S++ EL WLL+LL+DL I C + P ++CD Sbjct: 900 FFLGSSLISWKAKKQLTISKSSLEAEYRALSSSTCELIWLLYLLKDLQIECTQLPVIFCD 959 Query: 456 HQSALHIAANPVFHERTEHLEIDCHFVREKLQEGLMKLLLVPSKGQLADFFTKALTPTNF 635 +QSAL+I++NPVFHE T+H+E+DCH VREK+Q GL++LL + ++ QLAD TKAL F Sbjct: 960 NQSALNISSNPVFHESTKHIELDCHLVREKVQAGLLRLLPISTQDQLADCLTKALPTAKF 1019 Query: 636 IPFISKLRMINIYHGQACGGLLKYKNDDSTGDLSNNSANEDMHKEDSSK 782 FI+KL +++IY ACG LL K S+ SNN + D K Sbjct: 1020 NHFIAKLGLLDIYQASACGRLLNNKIASSS---SNNYEEASLASNDQVK 1065 Score = 120 bits (300), Expect(2) = 2e-72 Identities = 55/101 (54%), Positives = 74/101 (73%), Gaps = 8/101 (7%) Frame = +2 Query: 2 DDGGGAYKDTSSYRRLIGRLLYITKTRPDITFAVQQIRQHVNSPTVTHHSAVCRVLRYLK 181 +D G ++D S YRRL+G+LLY+T TRPDI +A QQ+ Q +++PT+TH+ A CRV+RYLK Sbjct: 801 NDNGKPFEDISLYRRLVGKLLYLTNTRPDIAYATQQLSQFLHNPTITHYKAACRVVRYLK 860 Query: 182 GNPGMGLMFPRNFILQLSGFS--------DTRKSVTGYCFF 280 NPG GLMF RN +Q+ G+S DTR+S +GYCFF Sbjct: 861 HNPGRGLMFHRNSDIQIIGYSDADWAGCLDTRRSTSGYCFF 901