BLASTX nr result
ID: Astragalus22_contig00014687
seq
BLASTX 2.2.26 [Sep-21-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Astragalus22_contig00014687 (801 letters) Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF excluding environmental samples from WGS projects 149,584,005 sequences; 54,822,741,787 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value gb|PNY00055.1| Ty3/gypsy retrotransposon protein [Trifolium prat... 388 e-125 dbj|GAU39660.1| hypothetical protein TSUD_60270 [Trifolium subte... 387 e-123 gb|PNX92994.1| Ty3/gypsy retrotransposon protein [Trifolium prat... 390 e-122 dbj|GAU25735.1| hypothetical protein TSUD_216660 [Trifolium subt... 387 e-121 gb|PNY17651.1| retrotransposon-related protein [Trifolium pratense] 388 e-121 gb|PNY17582.1| Ty3/gypsy retrotransposon protein, partial [Trifo... 384 e-121 gb|PNX92911.1| Ty3/gypsy retrotransposon protein [Trifolium prat... 387 e-121 gb|PNX92266.1| retrotransposon-related protein [Trifolium pratense] 385 e-120 gb|PNY17729.1| retrotransposon-related protein [Trifolium pratense] 384 e-120 gb|PNX93486.1| retrotransposon-related protein, partial [Trifoli... 380 e-119 gb|PNX92532.1| retrotransposon-related protein [Trifolium pratense] 380 e-119 dbj|GAU25035.1| hypothetical protein TSUD_155090 [Trifolium subt... 378 e-118 dbj|GAU32562.1| hypothetical protein TSUD_218200 [Trifolium subt... 368 e-117 dbj|GAU27517.1| hypothetical protein TSUD_147110 [Trifolium subt... 357 e-111 dbj|GAU12723.1| hypothetical protein TSUD_122150 [Trifolium subt... 358 e-111 dbj|GAU25025.1| hypothetical protein TSUD_154990 [Trifolium subt... 342 e-110 gb|PNY06838.1| retrotransposon-related protein [Trifolium pratense] 341 e-109 gb|KYP35812.1| Retrovirus-related Pol polyprotein from transposo... 329 e-108 ref|XP_022041420.1| uncharacterized protein LOC110944000 [Helian... 335 e-104 gb|KYP76361.1| Retrovirus-related Pol polyprotein from transposo... 311 e-104 >gb|PNY00055.1| Ty3/gypsy retrotransposon protein [Trifolium pratense] Length = 1005 Score = 388 bits (996), Expect = e-125 Identities = 191/267 (71%), Positives = 220/267 (82%) Frame = -1 Query: 801 VMPFGLTNAPASFQALMNYIFTPFLRKFVIAFFDDILVYSPTLEAHVEHLRQVFITIRNH 622 VMPFGLTNAPASFQ+LMN +F+PFLRKFVI FFDD+L+YS +LE H HL+ +F TIR++ Sbjct: 228 VMPFGLTNAPASFQSLMNQVFSPFLRKFVIVFFDDLLIYSQSLEDHQVHLQLIFQTIRDN 287 Query: 621 QLFLQKQKCHFATSKVEYLGHFIPHGQVSTDPSKIQAVKDWPVPTNLKQLRGFLGLSGYY 442 LFL K KC+FA +VEYLGHFI VSTDP KIQAV WP+P NLKQLRGFLGL+GYY Sbjct: 288 HLFLNKSKCNFALPRVEYLGHFITREGVSTDPLKIQAVSSWPIPQNLKQLRGFLGLAGYY 347 Query: 441 RRFVRDYGKIAKPLTDLLKKDSFNWSEQATVSFNQMKESLISSPVLALPDFSKTSVVETD 262 RRFVRD+GKIAKPLTDLLKKDSF WS +AT +F +K++L+S+PVL LPDFSK +VETD Sbjct: 348 RRFVRDFGKIAKPLTDLLKKDSFIWSAEATQAFTTLKQALVSAPVLCLPDFSKKFIVETD 407 Query: 261 A*GKGIGVVLMQEHHPIAYISKSLGPRQQSMSVYERELLAIVYAVQKWGAYLSHRPFVXX 82 A GKGIG VLMQ HPIAYISKSLGP+QQ+MSVYERELLAIVYAVQKWG+YLSH PF+ Sbjct: 408 ASGKGIGAVLMQNQHPIAYISKSLGPKQQAMSVYERELLAIVYAVQKWGSYLSHAPFI-- 465 Query: 81 XXXXXXXXXXKYPLDQKLNTPFQQVWM 1 K+ LDQKLNTPFQQVW+ Sbjct: 466 --IKTDQKSIKHMLDQKLNTPFQQVWV 490 >dbj|GAU39660.1| hypothetical protein TSUD_60270 [Trifolium subterraneum] Length = 1128 Score = 387 bits (993), Expect = e-123 Identities = 190/267 (71%), Positives = 221/267 (82%) Frame = -1 Query: 801 VMPFGLTNAPASFQALMNYIFTPFLRKFVIAFFDDILVYSPTLEAHVEHLRQVFITIRNH 622 VMPFG TNAPASFQALMN+IF PFLRKFVI FFDD+L+YS L+ H HLR +F T+RN+ Sbjct: 382 VMPFGFTNAPASFQALMNHIFKPFLRKFVIIFFDDLLIYSRCLDEHENHLRLIFDTVRNN 441 Query: 621 QLFLQKQKCHFATSKVEYLGHFIPHGQVSTDPSKIQAVKDWPVPTNLKQLRGFLGLSGYY 442 LFL + KC FATSKVEYLGHFI VSTDP+K+QAV +WP+P NLKQLRGFLGL+GYY Sbjct: 442 NLFLNQNKCCFATSKVEYLGHFITAEGVSTDPAKLQAVSEWPLPKNLKQLRGFLGLAGYY 501 Query: 441 RRFVRDYGKIAKPLTDLLKKDSFNWSEQATVSFNQMKESLISSPVLALPDFSKTSVVETD 262 RRFV+D+GKIAKPLTD+LKKD F WS+Q+T +F ++K++L+SSPVL+LP F+K +VETD Sbjct: 502 RRFVKDFGKIAKPLTDMLKKDCFIWSDQSTKAFEELKQALVSSPVLSLPGFTKQFIVETD 561 Query: 261 A*GKGIGVVLMQEHHPIAYISKSLGPRQQSMSVYERELLAIVYAVQKWGAYLSHRPFVXX 82 A GKGIG VLMQ HPIAYISKSLGPRQQ++SVYERELLAIVYAVQKWGAYLSH PFV Sbjct: 562 ASGKGIGAVLMQNCHPIAYISKSLGPRQQALSVYERELLAIVYAVQKWGAYLSHAPFV-- 619 Query: 81 XXXXXXXXXXKYPLDQKLNTPFQQVWM 1 K+ LDQKLNTPFQQ W+ Sbjct: 620 --IKTDHRSIKHMLDQKLNTPFQQAWV 644 >gb|PNX92994.1| Ty3/gypsy retrotransposon protein [Trifolium pratense] Length = 1476 Score = 390 bits (1001), Expect = e-122 Identities = 187/267 (70%), Positives = 227/267 (85%) Frame = -1 Query: 801 VMPFGLTNAPASFQALMNYIFTPFLRKFVIAFFDDILVYSPTLEAHVEHLRQVFITIRNH 622 VMPFGLTNAPASFQ+LMN+IF PFLRKFVI FFDDILVYSP+ + H+ HL +F T+RN+ Sbjct: 697 VMPFGLTNAPASFQSLMNHIFHPFLRKFVIIFFDDILVYSPSFQEHLTHLEVIFQTLRNN 756 Query: 621 QLFLQKQKCHFATSKVEYLGHFIPHGQVSTDPSKIQAVKDWPVPTNLKQLRGFLGLSGYY 442 QLFL+K+KCHFAT++VEYLGHFI VSTDPSKIQAV+ WP+P N+KQLRGFLGL+GYY Sbjct: 757 QLFLRKEKCHFATTRVEYLGHFITKEGVSTDPSKIQAVESWPLPINIKQLRGFLGLAGYY 816 Query: 441 RRFVRDYGKIAKPLTDLLKKDSFNWSEQATVSFNQMKESLISSPVLALPDFSKTSVVETD 262 RRFV+++GKIAKPLTDLL+KD+F+WS AT SFN++K +LI++PVL LPDF+K VVETD Sbjct: 817 RRFVKNFGKIAKPLTDLLRKDAFHWSPAATASFNELKNALITAPVLILPDFTKPFVVETD 876 Query: 261 A*GKGIGVVLMQEHHPIAYISKSLGPRQQSMSVYERELLAIVYAVQKWGAYLSHRPFVXX 82 A G GIG VLMQ+ HP+AYISK+LG +QQ+MS+YERELLAIVYAVQ+WG YL+H+PFV Sbjct: 877 ASGTGIGAVLMQDRHPVAYISKALGIKQQAMSIYERELLAIVYAVQRWGTYLAHKPFV-- 934 Query: 81 XXXXXXXXXXKYPLDQKLNTPFQQVWM 1 K+ L+Q+LNTPFQQVWM Sbjct: 935 --IKTDQKSIKHMLEQRLNTPFQQVWM 959 >dbj|GAU25735.1| hypothetical protein TSUD_216660 [Trifolium subterraneum] Length = 1417 Score = 387 bits (995), Expect = e-121 Identities = 190/267 (71%), Positives = 220/267 (82%) Frame = -1 Query: 801 VMPFGLTNAPASFQALMNYIFTPFLRKFVIAFFDDILVYSPTLEAHVEHLRQVFITIRNH 622 VMPFGLTNAPASFQ+LMN +F+ FLRKFVI FFD +L+YS ++E H+ HL+ +F TIR+H Sbjct: 701 VMPFGLTNAPASFQSLMNQVFSSFLRKFVIIFFDVLLIYSKSMEDHIVHLQLIFQTIRDH 760 Query: 621 QLFLQKQKCHFATSKVEYLGHFIPHGQVSTDPSKIQAVKDWPVPTNLKQLRGFLGLSGYY 442 LFL K KC FA KVEYLGHFI VSTDP+K+QAV WP P NLKQLRGFLGL+GYY Sbjct: 761 NLFLNKSKCSFALPKVEYLGHFITKEGVSTDPAKVQAVNSWPPPQNLKQLRGFLGLAGYY 820 Query: 441 RRFVRDYGKIAKPLTDLLKKDSFNWSEQATVSFNQMKESLISSPVLALPDFSKTSVVETD 262 RRFV+D+GK+AKPLTDLLKKDSF WSE AT +F Q+K++L S+PVL LPDFSK VVETD Sbjct: 821 RRFVKDFGKLAKPLTDLLKKDSFVWSESATQAFLQLKQALTSAPVLCLPDFSKPFVVETD 880 Query: 261 A*GKGIGVVLMQEHHPIAYISKSLGPRQQSMSVYERELLAIVYAVQKWGAYLSHRPFVXX 82 A GKGIG VLMQEHHP+AYISKSLGP+QQ+MS+YERELLAIVYAVQKWG+YLSH PF+ Sbjct: 881 ASGKGIGAVLMQEHHPVAYISKSLGPKQQAMSIYERELLAIVYAVQKWGSYLSHAPFI-- 938 Query: 81 XXXXXXXXXXKYPLDQKLNTPFQQVWM 1 K+ LDQKLNTPFQQVW+ Sbjct: 939 --IKTDQKSIKHMLDQKLNTPFQQVWV 963 >gb|PNY17651.1| retrotransposon-related protein [Trifolium pratense] Length = 1478 Score = 388 bits (996), Expect = e-121 Identities = 187/267 (70%), Positives = 223/267 (83%) Frame = -1 Query: 801 VMPFGLTNAPASFQALMNYIFTPFLRKFVIAFFDDILVYSPTLEAHVEHLRQVFITIRNH 622 VMPFGLTNAPASFQALMN+IF PFLRKFVI FFDDIL+YS +++ HV HL QVF+TIR H Sbjct: 701 VMPFGLTNAPASFQALMNHIFKPFLRKFVIVFFDDILIYSSSMDQHVSHLAQVFLTIREH 760 Query: 621 QLFLQKQKCHFATSKVEYLGHFIPHGQVSTDPSKIQAVKDWPVPTNLKQLRGFLGLSGYY 442 L+L K KC FA +KVEYLGHF+ VSTDP+KI AVKDWP+P NLKQLRGFLGL+GYY Sbjct: 761 MLYLNKTKCQFAATKVEYLGHFLSAQGVSTDPAKISAVKDWPLPKNLKQLRGFLGLAGYY 820 Query: 441 RRFVRDYGKIAKPLTDLLKKDSFNWSEQATVSFNQMKESLISSPVLALPDFSKTSVVETD 262 RRFV+D+GKI++PLTD+LKKD+F+W++ A +F+ +K+SLI++PVL LPDF+K +VETD Sbjct: 821 RRFVKDFGKISQPLTDMLKKDNFHWNDLAKHAFSNLKQSLITAPVLQLPDFTKKFIVETD 880 Query: 261 A*GKGIGVVLMQEHHPIAYISKSLGPRQQSMSVYERELLAIVYAVQKWGAYLSHRPFVXX 82 A GKGIG VLMQ HPIA+ISKSLGPRQQ+MSVYERELLAI+YAVQKWGAYLSH PF+ Sbjct: 881 ASGKGIGAVLMQNKHPIAFISKSLGPRQQAMSVYERELLAIIYAVQKWGAYLSHAPFI-- 938 Query: 81 XXXXXXXXXXKYPLDQKLNTPFQQVWM 1 K+ L+QKLNTPFQQ W+ Sbjct: 939 --IKTDQKSIKHILEQKLNTPFQQAWV 963 >gb|PNY17582.1| Ty3/gypsy retrotransposon protein, partial [Trifolium pratense] Length = 1251 Score = 384 bits (986), Expect = e-121 Identities = 186/267 (69%), Positives = 220/267 (82%) Frame = -1 Query: 801 VMPFGLTNAPASFQALMNYIFTPFLRKFVIAFFDDILVYSPTLEAHVEHLRQVFITIRNH 622 VMPFGLTNAPASFQA MN+IF PFLRKFVI FFDDIL+YS +++ H+ HL QVF+TIR H Sbjct: 498 VMPFGLTNAPASFQAFMNHIFKPFLRKFVIIFFDDILIYSSSMDQHLSHLTQVFLTIREH 557 Query: 621 QLFLQKQKCHFATSKVEYLGHFIPHGQVSTDPSKIQAVKDWPVPTNLKQLRGFLGLSGYY 442 LFL K KC FA++KVEYLGHF+ VSTDP+KI AVKDWP+P NLKQLRGFLGL+GYY Sbjct: 558 HLFLNKAKCQFASNKVEYLGHFLSAQGVSTDPAKISAVKDWPLPKNLKQLRGFLGLAGYY 617 Query: 441 RRFVRDYGKIAKPLTDLLKKDSFNWSEQATVSFNQMKESLISSPVLALPDFSKTSVVETD 262 RRFV+D+G I++PLTDLLKKD+F+W+ A +F+ +K+SL+++PVL LPDF+K VETD Sbjct: 618 RRFVKDFGTISQPLTDLLKKDNFHWNALAKQAFSNLKQSLVTAPVLQLPDFTKKFTVETD 677 Query: 261 A*GKGIGVVLMQEHHPIAYISKSLGPRQQSMSVYERELLAIVYAVQKWGAYLSHRPFVXX 82 A GKGIG VLMQ HPIAYISKSLGPRQQ+MSVYERELLAI+YAVQKWGAYLSH PF+ Sbjct: 678 ASGKGIGAVLMQNKHPIAYISKSLGPRQQAMSVYERELLAIIYAVQKWGAYLSHAPFI-- 735 Query: 81 XXXXXXXXXXKYPLDQKLNTPFQQVWM 1 K+ L+QKLNTPFQQ W+ Sbjct: 736 --IKTDQKSIKHILEQKLNTPFQQAWV 760 >gb|PNX92911.1| Ty3/gypsy retrotransposon protein [Trifolium pratense] Length = 1478 Score = 387 bits (993), Expect = e-121 Identities = 188/267 (70%), Positives = 220/267 (82%) Frame = -1 Query: 801 VMPFGLTNAPASFQALMNYIFTPFLRKFVIAFFDDILVYSPTLEAHVEHLRQVFITIRNH 622 VMPFGLTNAPASFQ+LMN++F PFLRKFVI FFDD+L+YS +LE HV HLR +F T+R++ Sbjct: 701 VMPFGLTNAPASFQSLMNHVFQPFLRKFVIIFFDDLLIYSKSLEDHVCHLRMIFQTVRDN 760 Query: 621 QLFLQKQKCHFATSKVEYLGHFIPHGQVSTDPSKIQAVKDWPVPTNLKQLRGFLGLSGYY 442 L L K KC FA KVEYLGHFI +STDPSKIQAV WP+P NLKQLRGFLGL+GYY Sbjct: 761 HLLLNKSKCSFALPKVEYLGHFITVDGISTDPSKIQAVSSWPIPQNLKQLRGFLGLAGYY 820 Query: 441 RRFVRDYGKIAKPLTDLLKKDSFNWSEQATVSFNQMKESLISSPVLALPDFSKTSVVETD 262 RRFV+D+GKIAKPLTDLLKKD F WSE AT +F +K++L+++PVL LPDF+K VVETD Sbjct: 821 RRFVKDFGKIAKPLTDLLKKDCFIWSENATSAFLNLKQALVTAPVLCLPDFTKKFVVETD 880 Query: 261 A*GKGIGVVLMQEHHPIAYISKSLGPRQQSMSVYERELLAIVYAVQKWGAYLSHRPFVXX 82 A G GIG VLMQ+HHP+AYISKSLGP+QQ+MS+YERE+LAIVYAVQKWG+YLSH PFV Sbjct: 881 ASGTGIGAVLMQDHHPVAYISKSLGPKQQAMSIYEREMLAIVYAVQKWGSYLSHAPFV-- 938 Query: 81 XXXXXXXXXXKYPLDQKLNTPFQQVWM 1 K+ LDQKLNTPFQQVW+ Sbjct: 939 --IKTDQKSIKHMLDQKLNTPFQQVWV 963 >gb|PNX92266.1| retrotransposon-related protein [Trifolium pratense] Length = 1479 Score = 385 bits (990), Expect = e-120 Identities = 186/267 (69%), Positives = 222/267 (83%) Frame = -1 Query: 801 VMPFGLTNAPASFQALMNYIFTPFLRKFVIAFFDDILVYSPTLEAHVEHLRQVFITIRNH 622 VMPFGLTNAPASFQ+LMN++F PFLR+FVI FFDD+LVYS +L+ H+EHL +F IR H Sbjct: 702 VMPFGLTNAPASFQSLMNHLFKPFLRRFVIIFFDDLLVYSKSLQEHIEHLSSIFKLIRVH 761 Query: 621 QLFLQKQKCHFATSKVEYLGHFIPHGQVSTDPSKIQAVKDWPVPTNLKQLRGFLGLSGYY 442 QLFL ++KC F T +VEYLGHFI VSTDP+KIQAV+DWP P N KQLRGFLGL+GYY Sbjct: 762 QLFLNRKKCSFGTERVEYLGHFITKEGVSTDPAKIQAVRDWPFPKNPKQLRGFLGLAGYY 821 Query: 441 RRFVRDYGKIAKPLTDLLKKDSFNWSEQATVSFNQMKESLISSPVLALPDFSKTSVVETD 262 RRFV+D+GKIAKPLTD+LK+DSF WS ++T +F ++K++LIS+P++ LPDFS+ VVETD Sbjct: 822 RRFVKDFGKIAKPLTDMLKRDSFLWSSESTAAFTELKQALISAPLMRLPDFSQKFVVETD 881 Query: 261 A*GKGIGVVLMQEHHPIAYISKSLGPRQQSMSVYERELLAIVYAVQKWGAYLSHRPFVXX 82 A GKGIG VLMQ+HHPIAYISKSLGP+QQ+MSVYERELLAIVYAVQKWGAYLSH PF+ Sbjct: 882 ASGKGIGAVLMQQHHPIAYISKSLGPKQQAMSVYERELLAIVYAVQKWGAYLSHAPFI-- 939 Query: 81 XXXXXXXXXXKYPLDQKLNTPFQQVWM 1 K+ LDQKLNT FQQ W+ Sbjct: 940 --IKTDQRSIKHILDQKLNTSFQQAWV 964 >gb|PNY17729.1| retrotransposon-related protein [Trifolium pratense] Length = 1479 Score = 384 bits (987), Expect = e-120 Identities = 191/267 (71%), Positives = 219/267 (82%) Frame = -1 Query: 801 VMPFGLTNAPASFQALMNYIFTPFLRKFVIAFFDDILVYSPTLEAHVEHLRQVFITIRNH 622 VMPFGLTNAPASFQALMN +F PFLRKFVI FFDD+LVYS +L H HL+ +F TIR + Sbjct: 701 VMPFGLTNAPASFQALMNQVFKPFLRKFVIVFFDDLLVYSHSLVDHHVHLQLIFQTIREN 760 Query: 621 QLFLQKQKCHFATSKVEYLGHFIPHGQVSTDPSKIQAVKDWPVPTNLKQLRGFLGLSGYY 442 LFL + KC FA +VEYLGHFI VSTDP KIQAV WP+P NLKQLRGFLGL+GYY Sbjct: 761 HLFLNQSKCAFALPRVEYLGHFISREGVSTDPLKIQAVSSWPIPQNLKQLRGFLGLAGYY 820 Query: 441 RRFVRDYGKIAKPLTDLLKKDSFNWSEQATVSFNQMKESLISSPVLALPDFSKTSVVETD 262 RRFV+D+GK+AKPLTDLL+KD+FNWS +AT +F +K++L+S+PVL+LPDFSK VVETD Sbjct: 821 RRFVKDFGKLAKPLTDLLRKDNFNWSAEATHAFVTLKQALVSAPVLSLPDFSKRFVVETD 880 Query: 261 A*GKGIGVVLMQEHHPIAYISKSLGPRQQSMSVYERELLAIVYAVQKWGAYLSHRPFVXX 82 A GKGIG VLMQEHHPIAYISKSLGP+QQ+MSVYERELLAIVYAVQKWG+YLSH PF Sbjct: 881 ASGKGIGAVLMQEHHPIAYISKSLGPKQQAMSVYERELLAIVYAVQKWGSYLSHAPFT-- 938 Query: 81 XXXXXXXXXXKYPLDQKLNTPFQQVWM 1 K+ LDQKLNTPFQQVW+ Sbjct: 939 --IKTDQKSIKHMLDQKLNTPFQQVWV 963 >gb|PNX93486.1| retrotransposon-related protein, partial [Trifolium pratense] Length = 1414 Score = 380 bits (976), Expect = e-119 Identities = 184/267 (68%), Positives = 222/267 (83%) Frame = -1 Query: 801 VMPFGLTNAPASFQALMNYIFTPFLRKFVIAFFDDILVYSPTLEAHVEHLRQVFITIRNH 622 VMPFGLTNAPASFQALMN++F PFLRKFVI FFDD+L+YS +L+ H EHL +F IR++ Sbjct: 692 VMPFGLTNAPASFQALMNHLFKPFLRKFVIIFFDDLLIYSKSLQEHTEHLNSIFQLIRHN 751 Query: 621 QLFLQKQKCHFATSKVEYLGHFIPHGQVSTDPSKIQAVKDWPVPTNLKQLRGFLGLSGYY 442 LFL ++KC FATS+VEYLGHFI VSTDP+KI+AV +WP PT LKQLRGFLGL+GYY Sbjct: 752 NLFLNQKKCTFATSRVEYLGHFITQEGVSTDPAKIEAVGNWPFPTTLKQLRGFLGLAGYY 811 Query: 441 RRFVRDYGKIAKPLTDLLKKDSFNWSEQATVSFNQMKESLISSPVLALPDFSKTSVVETD 262 RRFV+D+GKIAKPLTD+LK+D+F WS +T +F Q+K++LIS+P+L+LPDFSK +VETD Sbjct: 812 RRFVKDFGKIAKPLTDMLKRDNFVWSSDSTNAFTQLKQALISAPLLSLPDFSKKFIVETD 871 Query: 261 A*GKGIGVVLMQEHHPIAYISKSLGPRQQSMSVYERELLAIVYAVQKWGAYLSHRPFVXX 82 A GKGIG VLMQ+HHPIAYISKSLGP+QQ MSVYERELLAIVYAVQKWGAYL+H PF+ Sbjct: 872 ASGKGIGAVLMQDHHPIAYISKSLGPKQQVMSVYERELLAIVYAVQKWGAYLAHAPFI-- 929 Query: 81 XXXXXXXXXXKYPLDQKLNTPFQQVWM 1 K+ L+Q+LNT FQQ W+ Sbjct: 930 --IKTDQRSIKHILEQRLNTAFQQAWV 954 >gb|PNX92532.1| retrotransposon-related protein [Trifolium pratense] Length = 1472 Score = 380 bits (977), Expect = e-119 Identities = 187/267 (70%), Positives = 220/267 (82%) Frame = -1 Query: 801 VMPFGLTNAPASFQALMNYIFTPFLRKFVIAFFDDILVYSPTLEAHVEHLRQVFITIRNH 622 VMPFGLTNAPASFQ+LMN +F PFLRKFVI FFDD+L+YS ++E H+ HLR +F TIR + Sbjct: 695 VMPFGLTNAPASFQSLMNQVFQPFLRKFVIIFFDDLLIYSQSIEDHLVHLRLIFHTIRAN 754 Query: 621 QLFLQKQKCHFATSKVEYLGHFIPHGQVSTDPSKIQAVKDWPVPTNLKQLRGFLGLSGYY 442 LFL KC FA KVEYLGHFI VSTDP+KIQAV WP+P N+KQLRGFLGL+GYY Sbjct: 755 HLFLNXSKCSFALPKVEYLGHFITKEGVSTDPAKIQAVSSWPIPQNVKQLRGFLGLAGYY 814 Query: 441 RRFVRDYGKIAKPLTDLLKKDSFNWSEQATVSFNQMKESLISSPVLALPDFSKTSVVETD 262 RRFV+D+GK+AKPLTDLLKK+ F W++ AT +F Q+K++LI++PVL+LP+FSK VVETD Sbjct: 815 RRFVQDFGKLAKPLTDLLKKEGFVWTDNATQAFMQLKQALITAPVLSLPNFSKQFVVETD 874 Query: 261 A*GKGIGVVLMQEHHPIAYISKSLGPRQQSMSVYERELLAIVYAVQKWGAYLSHRPFVXX 82 A GKGIG VLMQE HP+AYISKSLGP+QQ+MSVYERELLAIVYAVQKWG+YLSH PFV Sbjct: 875 ASGKGIGAVLMQEQHPVAYISKSLGPKQQAMSVYERELLAIVYAVQKWGSYLSHAPFV-- 932 Query: 81 XXXXXXXXXXKYPLDQKLNTPFQQVWM 1 K+ LDQKLNTPFQQVW+ Sbjct: 933 --IRTDQKSIKHMLDQKLNTPFQQVWV 957 >dbj|GAU25035.1| hypothetical protein TSUD_155090 [Trifolium subterraneum] Length = 1326 Score = 378 bits (970), Expect = e-118 Identities = 185/267 (69%), Positives = 217/267 (81%) Frame = -1 Query: 801 VMPFGLTNAPASFQALMNYIFTPFLRKFVIAFFDDILVYSPTLEAHVEHLRQVFITIRNH 622 VMPFGLTNAPASFQALMN+IF PFLRKFVI FFDDILVYS +L H+ HL +F TIR Sbjct: 577 VMPFGLTNAPASFQALMNHIFKPFLRKFVIVFFDDILVYSQSLSDHITHLELIFRTIREQ 636 Query: 621 QLFLQKQKCHFATSKVEYLGHFIPHGQVSTDPSKIQAVKDWPVPTNLKQLRGFLGLSGYY 442 LFL K KCHF T+KVEYLGHFI VSTDPSKI AV +W +PTNLKQLRGFLGL+GYY Sbjct: 637 NLFLNKAKCHFTTNKVEYLGHFITKEGVSTDPSKISAVSEWHLPTNLKQLRGFLGLAGYY 696 Query: 441 RRFVRDYGKIAKPLTDLLKKDSFNWSEQATVSFNQMKESLISSPVLALPDFSKTSVVETD 262 RRFV+++GKIA+PLTD+LK+D+F+W++ + +F +K++L S+PVLALPDF+K VVETD Sbjct: 697 RRFVKNFGKIAQPLTDMLKRDNFHWNDSSKFAFETLKQALASAPVLALPDFTKKFVVETD 756 Query: 261 A*GKGIGVVLMQEHHPIAYISKSLGPRQQSMSVYERELLAIVYAVQKWGAYLSHRPFVXX 82 A G GIG VLMQE HPIAYISKSLGP+ ++MSVYERELLAI+YAVQKWGAYLSH PF+ Sbjct: 757 ASGTGIGAVLMQEKHPIAYISKSLGPKHRAMSVYERELLAIIYAVQKWGAYLSHAPFI-- 814 Query: 81 XXXXXXXXXXKYPLDQKLNTPFQQVWM 1 K+ LDQKLNTPFQQ W+ Sbjct: 815 --IKTDQKSIKHILDQKLNTPFQQTWV 839 >dbj|GAU32562.1| hypothetical protein TSUD_218200 [Trifolium subterraneum] Length = 991 Score = 368 bits (944), Expect = e-117 Identities = 184/267 (68%), Positives = 214/267 (80%) Frame = -1 Query: 801 VMPFGLTNAPASFQALMNYIFTPFLRKFVIAFFDDILVYSPTLEAHVEHLRQVFITIRNH 622 VMPFGLTNAPASFQ+LMN++F PFLRKFVI FFDDIL+YS + HV L VF TIR + Sbjct: 368 VMPFGLTNAPASFQSLMNHVFQPFLRKFVIVFFDDILIYSRSHHEHVARLALVFQTIREN 427 Query: 621 QLFLQKQKCHFATSKVEYLGHFIPHGQVSTDPSKIQAVKDWPVPTNLKQLRGFLGLSGYY 442 LFL K KC FA+ KVEYLGHFI VSTDPSKI V +WP P NLKQLRGFLGL+GYY Sbjct: 428 NLFLNKSKCSFASLKVEYLGHFITKEGVSTDPSKIMVVSNWPQPQNLKQLRGFLGLAGYY 487 Query: 441 RRFVRDYGKIAKPLTDLLKKDSFNWSEQATVSFNQMKESLISSPVLALPDFSKTSVVETD 262 RRFV+D+GKIAKPLTDLLKKD+F W+E+ +F+ +K++LI++PVL LPDF+K VVETD Sbjct: 488 RRFVKDFGKIAKPLTDLLKKDNFVWNEEEISAFSSLKQTLITAPVLTLPDFNKKFVVETD 547 Query: 261 A*GKGIGVVLMQEHHPIAYISKSLGPRQQSMSVYERELLAIVYAVQKWGAYLSHRPFVXX 82 A GKGIG VLMQ+HHPIAYISKSLGP QQ++SVYERELLAIVYAVQKWGAYLSH PF+ Sbjct: 548 ASGKGIGAVLMQDHHPIAYISKSLGPMQQALSVYERELLAIVYAVQKWGAYLSHAPFI-- 605 Query: 81 XXXXXXXXXXKYPLDQKLNTPFQQVWM 1 K+ L Q+L+T FQQVW+ Sbjct: 606 --IKTDQKSIKHILKQQLHTSFQQVWV 630 >dbj|GAU27517.1| hypothetical protein TSUD_147110 [Trifolium subterraneum] Length = 1224 Score = 357 bits (916), Expect = e-111 Identities = 174/267 (65%), Positives = 218/267 (81%) Frame = -1 Query: 801 VMPFGLTNAPASFQALMNYIFTPFLRKFVIAFFDDILVYSPTLEAHVEHLRQVFITIRNH 622 VMPFGLTNAPASFQ+LMN +F +LRKFVI FFDDIL+YS +LE HV HL +VF +R H Sbjct: 443 VMPFGLTNAPASFQSLMNILFQQYLRKFVIIFFDDILIYSSSLEDHVLHLDKVFQILREH 502 Query: 621 QLFLQKQKCHFATSKVEYLGHFIPHGQVSTDPSKIQAVKDWPVPTNLKQLRGFLGLSGYY 442 +LFL+K+KC FATS+VEYLGH I VSTDP+KIQ V WP+P+++KQLRGFLGL+GYY Sbjct: 503 KLFLRKEKCCFATSRVEYLGHVITKEGVSTDPNKIQVVSSWPLPSSVKQLRGFLGLAGYY 562 Query: 441 RRFVRDYGKIAKPLTDLLKKDSFNWSEQATVSFNQMKESLISSPVLALPDFSKTSVVETD 262 RRFV+D+GKIAKPL D+LKK +F+WS +AT +F ++K +L S+PVLALP+F++ ++ET+ Sbjct: 563 RRFVKDFGKIAKPLNDMLKKYAFHWSIEATQAFTELKHALTSTPVLALPNFNQPFILETN 622 Query: 261 A*GKGIGVVLMQEHHPIAYISKSLGPRQQSMSVYERELLAIVYAVQKWGAYLSHRPFVXX 82 A GKGIG VLMQ HPIAYISK+LGP+QQ+MSVYERELLAIVYA+QKW YL++R F+ Sbjct: 623 ASGKGIGAVLMQNKHPIAYISKALGPKQQAMSVYERELLAIVYAIQKWSTYLAYRHFI-- 680 Query: 81 XXXXXXXXXXKYPLDQKLNTPFQQVWM 1 K+ L+Q+LNTPFQQVWM Sbjct: 681 --IKTDQKSIKFMLEQRLNTPFQQVWM 705 >dbj|GAU12723.1| hypothetical protein TSUD_122150 [Trifolium subterraneum] Length = 1492 Score = 358 bits (920), Expect = e-111 Identities = 182/276 (65%), Positives = 214/276 (77%), Gaps = 9/276 (3%) Frame = -1 Query: 801 VMPFGLTNAPASFQALMNYIFTPFLRKFVIAFFDDILVYSPTLEAHVEHLRQVFITIRNH 622 VMPFGLTNAPASFQALMN++F P+LRKFVI FFDDILVYS H HL VF TI+++ Sbjct: 705 VMPFGLTNAPASFQALMNHVFAPYLRKFVIIFFDDILVYSQCSADHEIHLAIVFQTIKDN 764 Query: 621 QLFLQKQKCHFATSKVEYLGHFIPHGQVSTDPSKIQAVKDWPVPTNLKQLRGFLGLSGYY 442 L L + KC FAT+ VEYLGHFI VSTDPSKI AV WP+P NLKQLRGFLGL+GYY Sbjct: 765 GLLLNRSKCRFATTCVEYLGHFITKEGVSTDPSKISAVNSWPIPKNLKQLRGFLGLAGYY 824 Query: 441 RRFVRDYGKIAKPLTDLLKKDSFNWSEQATVSFNQMKESLISSPVLALPDFSKTSVVETD 262 RRFV+D+GKIA+PLTDLLK+D+F+W++++T +F +K++L S+PVL LPDFSK VVETD Sbjct: 825 RRFVKDFGKIAQPLTDLLKRDNFHWNQRSTDAFESLKQALTSAPVLQLPDFSKKFVVETD 884 Query: 261 A*GKGIGVVLMQEHHPIAYISKSLGPRQQSMSVYE---------RELLAIVYAVQKWGAY 109 A G G+G VLMQ+ HPIAYISKSLGPRQQ++SVYE RELLAI+YAVQKWGAY Sbjct: 885 ASGCGLGAVLMQDKHPIAYISKSLGPRQQALSVYERELLAIIYARELLAIIYAVQKWGAY 944 Query: 108 LSHRPFVXXXXXXXXXXXXKYPLDQKLNTPFQQVWM 1 LSH PF KY L+QKLNTPFQQ W+ Sbjct: 945 LSHAPFT----IKTDQKSIKYILEQKLNTPFQQAWV 976 >dbj|GAU25025.1| hypothetical protein TSUD_154990 [Trifolium subterraneum] Length = 720 Score = 342 bits (877), Expect = e-110 Identities = 170/267 (63%), Positives = 208/267 (77%) Frame = -1 Query: 801 VMPFGLTNAPASFQALMNYIFTPFLRKFVIAFFDDILVYSPTLEAHVEHLRQVFITIRNH 622 VMPFGLTNAPASFQALMN IF +LR +P++E HV HL +VF +R H Sbjct: 144 VMPFGLTNAPASFQALMNRIFQQYLR-------------NPSIETHVMHLNKVFQVLREH 190 Query: 621 QLFLQKQKCHFATSKVEYLGHFIPHGQVSTDPSKIQAVKDWPVPTNLKQLRGFLGLSGYY 442 QLFL+K+KC+FAT KVEYLGHFI VSTDP+K+QAV WP+P+++KQLRGFLGL+GYY Sbjct: 191 QLFLRKEKCYFATDKVEYLGHFITKEGVSTDPNKVQAVSSWPLPSSIKQLRGFLGLAGYY 250 Query: 441 RRFVRDYGKIAKPLTDLLKKDSFNWSEQATVSFNQMKESLISSPVLALPDFSKTSVVETD 262 RRFVRD+GKIAKPLTD+LKKDSF+WS AT +F +K +LIS+PVLALPDF++ +ETD Sbjct: 251 RRFVRDFGKIAKPLTDMLKKDSFHWSPTATQAFYDLKNALISAPVLALPDFNQPFTLETD 310 Query: 261 A*GKGIGVVLMQEHHPIAYISKSLGPRQQSMSVYERELLAIVYAVQKWGAYLSHRPFVXX 82 A GKGIG VLMQ HPIAYISK+LG +QQ+MS+YERELLAIVYA+QKW YL+++ F+ Sbjct: 311 ASGKGIGAVLMQNKHPIAYISKALGLKQQAMSIYERELLAIVYAIQKWSTYLAYKHFI-- 368 Query: 81 XXXXXXXXXXKYPLDQKLNTPFQQVWM 1 K+ L+Q+LNTPFQQVWM Sbjct: 369 --IKTDQKSIKFMLEQRLNTPFQQVWM 393 >gb|PNY06838.1| retrotransposon-related protein [Trifolium pratense] Length = 775 Score = 341 bits (874), Expect = e-109 Identities = 169/269 (62%), Positives = 212/269 (78%), Gaps = 2/269 (0%) Frame = -1 Query: 801 VMPFGLTNAPASFQALMNYIFTPFLRKFVIAFFDDILVYSPTLEAHVEHLRQVFITIRNH 622 VMPFGLTNAPASFQALMN +F +LRKFVI FFDDILVYS +L HV HL +VF +R++ Sbjct: 45 VMPFGLTNAPASFQALMNRLFQSYLRKFVIIFFDDILVYSSSLTDHVTHLSKVFQVLRDN 104 Query: 621 QLFLQKQKCHFATSKVEYLGHFIPHGQVSTDPSKIQAVKDWPVPTNLKQLRGFLGLSGYY 442 +LFL+++K AT +VEYLGHFI +STDP+K+Q V WP P+++KQLRGFLGL+GYY Sbjct: 105 RLFLRREKYSSATPRVEYLGHFIAKEGISTDPNKVQVVSFWPFPSSIKQLRGFLGLAGYY 164 Query: 441 RRFVRDYGKIAKPLTDLLKKDSFNWSEQATVSFNQMKESLISSPVLALPDFSKTSVVETD 262 RRFV+D+GKIAKPLTD+LKKD F+WS +AT +F ++ +L S+P LALP+F++ ++ETD Sbjct: 165 RRFVKDFGKIAKPLTDMLKKDVFHWSTEATQTFTELNHALTSAPDLALPNFNQPFILETD 224 Query: 261 A*GKGIGVVLMQEHHPIAYISKSLGPRQQSMSVYERELLAIVYAVQKWGAYLSH--RPFV 88 A GKGIG VLMQ HPIAYISK+LGP+QQ+MS+YERELLAIVYA QKW YL++ R F+ Sbjct: 225 ASGKGIGAVLMQNKHPIAYISKALGPKQQAMSIYERELLAIVYATQKWSTYLAYTCRHFI 284 Query: 87 XXXXXXXXXXXXKYPLDQKLNTPFQQVWM 1 K+ L+Q LNTPFQQVWM Sbjct: 285 ----IKTDQKSIKFMLEQSLNTPFQQVWM 309 >gb|KYP35812.1| Retrovirus-related Pol polyprotein from transposon 17.6 [Cajanus cajan] Length = 460 Score = 329 bits (843), Expect = e-108 Identities = 158/267 (59%), Positives = 205/267 (76%) Frame = -1 Query: 801 VMPFGLTNAPASFQALMNYIFTPFLRKFVIAFFDDILVYSPTLEAHVEHLRQVFITIRNH 622 VMPFGLTNAPA+FQ LMN +F FLRKFV+ FFDDIL+YS L+ H+ HL+ V +T+RN+ Sbjct: 158 VMPFGLTNAPATFQGLMNTMFQQFLRKFVLVFFDDILIYSVNLQEHLNHLKLVLLTLRNN 217 Query: 621 QLFLQKQKCHFATSKVEYLGHFIPHGQVSTDPSKIQAVKDWPVPTNLKQLRGFLGLSGYY 442 LF ++ KC+FA ++VEYLGHFI V+TDPSKI+A+ WP+P +KQLRGFLGL+GYY Sbjct: 218 YLFARRSKCYFAVNRVEYLGHFISGEGVATDPSKIEAMTKWPLPQTIKQLRGFLGLAGYY 277 Query: 441 RRFVRDYGKIAKPLTDLLKKDSFNWSEQATVSFNQMKESLISSPVLALPDFSKTSVVETD 262 RRFV+ YG IAKPLTD+LKKD+F W++ A ++F ++KE L ++P+LALPDFSK VVE D Sbjct: 278 RRFVKGYGVIAKPLTDMLKKDNFRWTQDAKLAFQKLKELLSNTPILALPDFSKVFVVEVD 337 Query: 261 A*GKGIGVVLMQEHHPIAYISKSLGPRQQSMSVYERELLAIVYAVQKWGAYLSHRPFVXX 82 A G GIG VLMQ+HHPI YIS+ L +QQS+S YE+ELLA+V+A+Q+W YL +R F+ Sbjct: 338 ASGGGIGAVLMQDHHPIFYISRILNLQQQSLSTYEKELLAVVFAIQRWRHYLLNRHFI-- 395 Query: 81 XXXXXXXXXXKYPLDQKLNTPFQQVWM 1 KY LDQ+L T FQ+ W+ Sbjct: 396 --IKTDHYSLKYILDQRLTTDFQKKWL 420 >ref|XP_022041420.1| uncharacterized protein LOC110944000 [Helianthus annuus] Length = 1038 Score = 335 bits (860), Expect = e-104 Identities = 164/267 (61%), Positives = 196/267 (73%) Frame = -1 Query: 801 VMPFGLTNAPASFQALMNYIFTPFLRKFVIAFFDDILVYSPTLEAHVEHLRQVFITIRNH 622 VMPFGLTNAP++FQ LMN +F +LRK+V+ FFDDILVYSP + H+ HLR V +R H Sbjct: 679 VMPFGLTNAPSTFQGLMNQVFKQYLRKYVLVFFDDILVYSPCWDTHMSHLRDVLQVLRQH 738 Query: 621 QLFLQKQKCHFATSKVEYLGHFIPHGQVSTDPSKIQAVKDWPVPTNLKQLRGFLGLSGYY 442 QL +K KC F +K+EYLGH I VSTDP+K+ A++ WPVPTN+K+LRGFLGL+GYY Sbjct: 739 QLVAKKSKCEFGATKLEYLGHIISQKGVSTDPAKVSAIQRWPVPTNVKELRGFLGLTGYY 798 Query: 441 RRFVRDYGKIAKPLTDLLKKDSFNWSEQATVSFNQMKESLISSPVLALPDFSKTSVVETD 262 RRFV+ YG I KPLT LLKKDSF WS A +F Q+K S+ PVLALPDF +T VVETD Sbjct: 799 RRFVKGYGTITKPLTQLLKKDSFQWSSLAQQAFEQLKISMSEPPVLALPDFKETFVVETD 858 Query: 261 A*GKGIGVVLMQEHHPIAYISKSLGPRQQSMSVYERELLAIVYAVQKWGAYLSHRPFVXX 82 A G GIG VLMQ HPIAYISK+L PR ++S YERELLAI+YAVQKW YL+H FV Sbjct: 859 ASGYGIGAVLMQRGHPIAYISKALSPRHMALSTYERELLAIIYAVQKWQPYLAHNHFV-- 916 Query: 81 XXXXXXXXXXKYPLDQKLNTPFQQVWM 1 KY LD K++TPFQQ W+ Sbjct: 917 --IKTDQHSLKYLLDSKISTPFQQKWL 941 >gb|KYP76361.1| Retrovirus-related Pol polyprotein from transposon 297 [Cajanus cajan] Length = 245 Score = 311 bits (797), Expect = e-104 Identities = 147/237 (62%), Positives = 185/237 (78%) Frame = -1 Query: 798 MPFGLTNAPASFQALMNYIFTPFLRKFVIAFFDDILVYSPTLEAHVEHLRQVFITIRNHQ 619 MPFGLTNAPA+FQ LMN +F +LR+F++ FFDDIL+Y+ L+ H+ HL V +T+R + Sbjct: 1 MPFGLTNAPATFQGLMNDVFKEYLRRFLLVFFDDILIYNKDLQHHLLHLHMVLLTMRRNS 60 Query: 618 LFLQKQKCHFATSKVEYLGHFIPHGQVSTDPSKIQAVKDWPVPTNLKQLRGFLGLSGYYR 439 L+ +K KC+F +VEYLGHFI VSTDP+KI VK+WP+PT LKQLRGFLGL GYYR Sbjct: 61 LYAKKSKCYFGVERVEYLGHFITKDGVSTDPTKIMVVKNWPIPTTLKQLRGFLGLVGYYR 120 Query: 438 RFVRDYGKIAKPLTDLLKKDSFNWSEQATVSFNQMKESLISSPVLALPDFSKTSVVETDA 259 RFVR YG IA+PLT++LKK++F WSE+A +F +K+SLI SPVLALPDFSK VVE DA Sbjct: 121 RFVRGYGSIARPLTNMLKKNNFRWSEEARTAFQSLKDSLIQSPVLALPDFSKVFVVEVDA 180 Query: 258 *GKGIGVVLMQEHHPIAYISKSLGPRQQSMSVYERELLAIVYAVQKWGAYLSHRPFV 88 G GIG VLMQ HH IA+IS+ L +QQS+S YE+ELLA+V+AVQKW YL + F+ Sbjct: 181 SGYGIGAVLMQNHHLIAFISRVLNLQQQSLSTYEKELLAVVFAVQKWRHYLLNTHFI 237