BLASTX nr result
ID: Rehmannia30_contig00016559
seq
BLASTX 2.2.26 [Sep-21-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Rehmannia30_contig00016559 (3015 letters) Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF excluding environmental samples from WGS projects 149,584,005 sequences; 54,822,741,787 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value gb|PNX93614.1| retrovirus-related Pol polyprotein from transposo... 654 0.0 gb|KYP61022.1| Retrovirus-related Pol polyprotein from transposo... 645 0.0 gb|PNX74277.1| retrovirus-related Pol polyprotein from transposo... 620 0.0 gb|KYP65734.1| Retrovirus-related Pol polyprotein from transposo... 625 0.0 gb|OMO65653.1| hypothetical protein CCACVL1_21443 [Corchorus cap... 630 0.0 gb|PNY00469.1| retrovirus-related Pol polyprotein from transposo... 614 0.0 dbj|GAU41679.1| hypothetical protein TSUD_272630 [Trifolium subt... 627 0.0 gb|KYP42321.1| Copia protein [Cajanus cajan] 630 0.0 gb|KYP34298.1| Retrovirus-related Pol polyprotein from transposo... 612 0.0 ref|XP_012486681.1| PREDICTED: LOW QUALITY PROTEIN: retrovirus-r... 600 0.0 gb|KYP55668.1| Retrovirus-related Pol polyprotein from transposo... 615 0.0 gb|PNX93928.1| hypothetical protein L195_g017092, partial [Trifo... 604 0.0 gb|PNX97998.1| retrovirus-related Pol polyprotein from transposo... 603 0.0 gb|KZV53534.1| hypothetical protein F511_42283 [Dorcoceras hygro... 604 0.0 gb|PNX93906.1| hypothetical protein L195_g017068 [Trifolium prat... 607 0.0 gb|KYP34293.1| Retrovirus-related Pol polyprotein from transposo... 611 0.0 gb|PNX93131.1| retrovirus-related Pol polyprotein from transposo... 597 0.0 gb|PNX92076.1| retrovirus-related Pol polyprotein from transposo... 588 0.0 gb|OMP02866.1| Reverse transcriptase, RNA-dependent DNA polymera... 585 0.0 gb|PNY03100.1| retrovirus-related Pol polyprotein from transposo... 582 0.0 >gb|PNX93614.1| retrovirus-related Pol polyprotein from transposon TNT 1-94 [Trifolium pratense] Length = 1430 Score = 654 bits (1687), Expect = 0.0 Identities = 352/722 (48%), Positives = 452/722 (62%), Gaps = 40/722 (5%) Frame = -3 Query: 3013 CLSYAANTHPHKSKFAARAYKCVFLGYITGQKGYRLYDLDNHSLLTSRDVVFFEDSFPFA 2834 CL++A+ H H++KF RA K VFLGY G KG+ LYD+ NHS L SR+V+F+ED FP + Sbjct: 713 CLAFASTLHNHRTKFMPRARKTVFLGYRDGTKGFLLYDISNHSFLVSRNVIFYEDVFPLS 772 Query: 2833 SI---------------------------PVP----SNXXXXXXXXXXXXXXXXXXTSSA 2747 S+ P P + S++ Sbjct: 773 SVNSSHTSSTTTLDNFVLPIDPPNFPSSCPAPLSVSTGTNPLTDHAENSATLVDNQVSNS 832 Query: 2746 PISQPESH----PLRRSSRISKPPAWLSDFITNSVHSSTPMASPSHSAGPDSGDFSLAPT 2579 P P++ P R S+RI K P +L DF H S PS S FS P Sbjct: 833 PAVPPQNSSIPAPTRVSNRIRKIPGYLQDF-----HCSL---LPSQHQSSSSNAFSTYPI 884 Query: 2578 SFNHSSILGAT-YTAFLANLSNVEEPSSFSQACKSADWVDAMQRELTALEQNNTWVLTSL 2402 S + S AT Y F ++S EP +F QACKS W +AM+ EL ALE N TW + L Sbjct: 885 SSSLSYTNCATAYKHFCLSISTTIEPKTFKQACKSDCWKEAMKSELAALELNRTWSIVDL 944 Query: 2401 PPGKRAIGCKWVYKVKMRPDGTVERYKARLVAKGFHQEHGVDYLDSFSPVAKLVTVRLVI 2222 P GK IGCKWVYK+K DG++ERYKARLVAKG+ Q GVDY D+FSPVAKL TV+ ++ Sbjct: 945 PTGKNPIGCKWVYKIKHNADGSIERYKARLVAKGYTQMEGVDYFDTFSPVAKLTTVKTLL 1004 Query: 2221 ALATIKGWPLFQLDVNNAFLHGYLDEDIYMLPPEGY----SKAKDGEVCHLQRSLYGLKQ 2054 ALA+IKGW L QLDVNNAFLHG L+E++YM P G S + +VC L +SLYGLKQ Sbjct: 1005 ALASIKGWFLEQLDVNNAFLHGDLNEEVYMSLPPGVIIPNSCSNTPKVCRLHKSLYGLKQ 1064 Query: 2053 ASRQWNAEFCLKLQQFGFTQSGHDHCLFVRQSXXXXXXXXXXXXXVIISGTVLSEIDALK 1874 ASRQW ++ L G++QS DH LF+++ ++++G EI ++K Sbjct: 1065 ASRQWYSKLSSALLSLGYSQSAADHSLFLKKVGSSFTALLVYVDDIVLAGNNSLEITSVK 1124 Query: 1873 RYLDDLFTIKDLGYARYFLGMEIARNTDGSVLNQRKYVLDILHDAGMLHCKAAITPLPPG 1694 +LD F IKDLG R+F+G+EIAR+ G +LNQRKY L++L D+G L K + TP P Sbjct: 1125 SFLDKRFQIKDLGNLRFFVGLEIARSKKGILLNQRKYTLELLQDSGNLAAKPSSTPYDPS 1184 Query: 1693 LKLRAKEGDPLVDPERFRRLIGRLLYLNLTRPDVTYSVQQLSQFVNSPYTSHWDAAMHLL 1514 LKL E P DP +RRLIGRLLYL TRPD+T++VQQLSQFV+SP H+ AA +L Sbjct: 1185 LKLHDSESPPYNDPSGYRRLIGRLLYLTTTRPDITFAVQQLSQFVSSPREVHFQAATKVL 1244 Query: 1513 RYLKGSPSVGLFYSAHSSLCLEAYCDADWASCVDTRKSLTGFCIFLGSCLVSWKTKKQTT 1334 RYLK SP+ GLF+S+ SSL L + D+DWA+C TRKS+TG+C+FLG+ L+SWK+KKQ+T Sbjct: 1245 RYLKASPAKGLFFSSSSSLKLSGFSDSDWATCAITRKSITGYCVFLGTSLISWKSKKQST 1304 Query: 1333 VSRSSAEAEYRALGSGVCEIQWLSYLCADLGLSIPTPVTLWCDNQAAIHIVQNPVFHERT 1154 VSRSS+EAEYRAL S CE+QWL YL DLG+ P ++CDN++AI++ NP FHERT Sbjct: 1305 VSRSSSEAEYRALASLSCELQWLHYLFKDLGIKFDAPAMVYCDNKSAIYLAHNPSFHERT 1364 Query: 1153 KHLEIDCHLVRNLYKSGFLHLGHVSSRLQLADFFTKSLGRAAFLLLCSKLGLFDLHNPT* 974 KH+EIDCH+VR +SG +HL V S QLAD TK L +AF L SKLGL D+H+P Sbjct: 1365 KHIEIDCHVVRERIQSGLIHLLPVPSSSQLADVLTKQLSSSAFASLISKLGLLDIHSPAC 1424 Query: 973 GG 968 GG Sbjct: 1425 GG 1426 Score = 143 bits (361), Expect = 7e-31 Identities = 76/216 (35%), Positives = 128/216 (59%), Gaps = 10/216 (4%) Frame = -3 Query: 628 DPLKLYSSDHPGLSLVSSQLTGNNYLSWRRSMLIALGAKTKLGFINGKMEIPKEDSPKYD 449 +P L+ +++P + LV+ L G NY SW RSM IAL +K K+ F++G +E P+ P Y+ Sbjct: 15 NPYYLHPNENPAVVLVTPLLDGKNYHSWLRSMKIALLSKNKMKFVDGTLEQPRVSDPLYE 74 Query: 448 QWRKVDCMVISWILNSISKDLVDAFIYCDSAKDLWDDIAKRFGDCNGPLIYQLERDIANM 269 W + + MV+SWI SIS D+ + I+ D A +W D+ RF + I L+ +I + Sbjct: 75 PWIRCNSMVLSWIQRSISPDIAKSIIWFDHASAVWKDLEFRFSHGDMFKISDLQEEILRL 134 Query: 268 NQGNMSVVEYFTKLKRLWDELACIMPLPACE----------SDTRKLIDERDMNRKLMQF 119 +QG++ + Y+T+LK L +E+ P+ C +D +K E+D +++F Sbjct: 135 HQGSLDISSYYTQLKSLSEEIEIYRPVRDCTCAIPCSCGAVADMKK-YREQDC---VLKF 190 Query: 118 LMGLHESYDQVRNQLLLMDPLPSVDKAYSMALRVEK 11 L GL+E Y VR+Q+++M+PLP + K +S+ L+ E+ Sbjct: 191 LKGLNEQYSHVRSQIMMMEPLPPLHKVFSLVLQQER 226 >gb|KYP61022.1| Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cajanus cajan] Length = 1316 Score = 645 bits (1665), Expect = 0.0 Identities = 342/697 (49%), Positives = 446/697 (63%), Gaps = 15/697 (2%) Frame = -3 Query: 3013 CLSYAANTHPHKSKFAARAYKCVFLGYITGQKGYRLYDLDNHSLLTSRDVVFFEDSFP-- 2840 CL Y++ H++K RA+ C+FLG+ KGY L++L H LL SR+V+F ED FP Sbjct: 635 CLCYSSTITSHRTKLDPRAHPCIFLGFKPHTKGYLLFNLHTHGLLVSRNVLFHEDHFPSF 694 Query: 2839 -------FAS-IPVPSNXXXXXXXXXXXXXXXXXXTSSAPISQPESHPLRRSSRISKPPA 2684 F+S +P+ N +S S P PLRRS+R +PP Sbjct: 695 TKPHSPSFSSPVPIHYNYVDYPTFPSSSIVESSDPPTSDQHSSPP--PLRRSTRPRRPPT 752 Query: 2683 WLSDF-----ITNSVHSSTPMASPSHSAGPDSGDFSLAPTSFNHSSILGATYTAFLANLS 2519 +L DF T++ HSST + P HS + L SF+H ++ ++S Sbjct: 753 YLQDFHGAFTSTSTAHSSTGIRHPLHSFL----SYDLLSPSFHH----------YVFSIS 798 Query: 2518 NVEEPSSFSQACKSADWVDAMQRELTALEQNNTWVLTSLPPGKRAIGCKWVYKVKMRPDG 2339 +V EP +F++A KS W+ AM E+ ALE NNTWVLT+LPP K AIGC+WVYKVK + DG Sbjct: 799 SVTEPKNFAEASKSDSWLKAMHEEIFALEANNTWVLTTLPPHKTAIGCRWVYKVKHKADG 858 Query: 2338 TVERYKARLVAKGFHQEHGVDYLDSFSPVAKLVTVRLVIALATIKGWPLFQLDVNNAFLH 2159 +++RYKARLVAKG+ Q G+D+ D+FSPVAKL TVRL+++LA I W L QLDVNNAFLH Sbjct: 859 SIDRYKARLVAKGYTQMEGLDFFDTFSPVAKLTTVRLLLSLAAINNWHLKQLDVNNAFLH 918 Query: 2158 GYLDEDIYMLPPEGYSKAKDGEVCHLQRSLYGLKQASRQWNAEFCLKLQQFGFTQSGHDH 1979 G L+E++YM P G + + G+VC LQRSLYGLKQASRQW A L Q G+ S DH Sbjct: 919 GDLNEEVYMQLPPGLTPSFPGQVCRLQRSLYGLKQASRQWYARLSSFLIQHGYVPSPSDH 978 Query: 1978 CLFVRQSXXXXXXXXXXXXXVIISGTVLSEIDALKRYLDDLFTIKDLGYARYFLGMEIAR 1799 LF++ S ++++G L+EI L L F IKDLG +YFLG+E+AR Sbjct: 979 SLFLKCSPATTTAILIYVDDIVLAGNDLTEIHHLTSLLHTTFQIKDLGNLKYFLGLEVAR 1038 Query: 1798 NTDGSVLNQRKYVLDILHDAGMLHCKAAITPLPPGLKLRAKEGDPLVDPERFRRLIGRLL 1619 N G L QRKY+LD+L D GML K TP+ + L A G PL D +RRL+GRL+ Sbjct: 1039 NHTGIHLCQRKYILDLLSDTGMLASKPVSTPMDYSMHLSASSGTPLTDTAAYRRLVGRLI 1098 Query: 1618 YLNLTRPDVTYSVQQLSQFVNSPYTSHWDAAMHLLRYLKGSPSVGLFYSAHSSLCLEAYC 1439 YL TRPD+TY+VQQLSQFV++P T+H A +LRYLKG+P G+F S +SS+ L A+ Sbjct: 1099 YLTNTRPDITYAVQQLSQFVSNPTTAHRQALFRILRYLKGTPGSGIFLSVNSSVQLRAFS 1158 Query: 1438 DADWASCVDTRKSLTGFCIFLGSCLVSWKTKKQTTVSRSSAEAEYRALGSGVCEIQWLSY 1259 D+DWA C DTR+S+TGF ++LG L+SWK+KKQ TVSRSS+EAEYRAL + CE+QWLSY Sbjct: 1159 DSDWAGCPDTRRSITGFAVYLGDSLISWKSKKQITVSRSSSEAEYRALATTTCELQWLSY 1218 Query: 1258 LCADLGLSIPTPVTLWCDNQAAIHIVQNPVFHERTKHLEIDCHLVRNLYKSGFLHLGHVS 1079 L D + +P L+CDNQ+A+ I NPVFHERTKH+EIDCH+VR+ +G L L VS Sbjct: 1219 LLKDFHIDPISPSILYCDNQSALQIASNPVFHERTKHIEIDCHIVRDKVSTGLLKLLPVS 1278 Query: 1078 SRLQLADFFTKSLGRAAFLLLCSKLGLFDLHNPT*GG 968 S QLAD TK L F CSKLG+ ++H+ GG Sbjct: 1279 SSQQLADILTKPLSPFVFRSHCSKLGMLNIHSQLEGG 1315 Score = 100 bits (250), Expect = 1e-17 Identities = 49/146 (33%), Positives = 88/146 (60%), Gaps = 6/146 (4%) Frame = -3 Query: 427 MVISWILNSISKDLVDAFIYCDSAKDLWDDIAKRFGDCNGPLIYQLERDIANMNQGNMSV 248 MV+SW+++S+S + + ++ D A D+W D+ R+ + + L+ + +++ QG++SV Sbjct: 1 MVVSWLVHSVSPSIRQSILWMDQADDIWKDLKTRYSQGDLLRVSDLQLEASSLKQGDLSV 60 Query: 247 VEYFTKLKRLWDELACIMPLPACESDTR------KLIDERDMNRKLMQFLMGLHESYDQV 86 EYFTKL+ LWDEL P P C + +I +R + + MQFL GL++ Y+ V Sbjct: 61 TEYFTKLRILWDELENFRPDPNCTCTIKCACSVLTIIAQRKLEDQAMQFLRGLNDQYNNV 120 Query: 85 RNQLLLMDPLPSVDKAYSMALRVEKQ 8 ++ +LLM+ P + K +S ++ E+Q Sbjct: 121 KSHVLLME--PPISKIFSYVVQQERQ 144 >gb|PNX74277.1| retrovirus-related Pol polyprotein from transposon TNT 1-94 [Trifolium pratense] Length = 762 Score = 620 bits (1599), Expect = 0.0 Identities = 327/718 (45%), Positives = 433/718 (60%), Gaps = 39/718 (5%) Frame = -3 Query: 3013 CLSYAANTHPHKSKFAARAYKCVFLGYITGQKGYRLYDLDNHSLLTSRDVVFFEDSFPFA 2834 CLSYA H++KF +RA K +FLG+ G KGY LYDL +H + SR+VVF+E FP Sbjct: 45 CLSYATTLQAHRTKFDSRARKAIFLGFKDGTKGYILYDLSSHDIFVSRNVVFYETYFPLR 104 Query: 2833 -----------SIPVPSNXXXXXXXXXXXXXXXXXXTSSAPISQPESHPLRRSSRISKPP 2687 S P+PSN + P S + IS P Sbjct: 105 HSQPVHNASDFSKPLPSNSILDDPVSHTHNSLPLPVMFEPDSTSPSSVNIEPDRTISSPA 164 Query: 2686 AWLSDFITNSVHSSTPMASPSHSAG----------------------PDSGDFSLAPTSF 2573 + +++S H +A P + S + +++ ++ Sbjct: 165 SSSHTPLSSSSHDRPNLAPPPYHDNLRRSTRTITRPGYLEDYHCYSVTGSVNNNISHPNY 224 Query: 2572 NHSSILG-----ATYTAFLANLSNVEEPSSFSQACKSADWVDAMQRELTALEQNNTWVLT 2408 SS+L Y +F ++S + EP +FSQA K W AM EL AL++N TW + Sbjct: 225 PLSSVLSYDNCVPEYKSFCCSISAIIEPKTFSQASKLDCWRKAMDAELLALDENKTWSVV 284 Query: 2407 SLPPGKRAIGCKWVYKVKMRPDGTVERYKARLVAKGFHQEHGVDYLDSFSPVAKLVTVRL 2228 LP GK IGCKWVYK+K +G++ERYKARLVAKG+ Q G+DY D+FSPVAK+ TVR Sbjct: 285 DLPHGKTPIGCKWVYKIKYHANGSIERYKARLVAKGYTQMEGIDYFDTFSPVAKITTVRF 344 Query: 2227 VIALATIKGWPLFQLDVNNAFLHGYLDEDIYMLPPEGYSKA-KDGEVCHLQRSLYGLKQA 2051 ++ALA+IKGW L QLDVNNAFLHG L+E++YM P GYS A +VC L +SLYGLKQA Sbjct: 345 LLALASIKGWDLEQLDVNNAFLHGDLNEEVYMSLPPGYSSAIGSNKVCRLHKSLYGLKQA 404 Query: 2050 SRQWNAEFCLKLQQFGFTQSGHDHCLFVRQSXXXXXXXXXXXXXVIISGTVLSEIDALKR 1871 SRQW ++ L FG+ QS DH L+++ + ++++G EI A+K Sbjct: 405 SRQWYSKLSSALISFGYKQSVSDHSLYIKSTDSEFTALLVYVDDIVLAGNSSKEIQAVKH 464 Query: 1870 YLDDLFTIKDLGYARYFLGMEIARNTDGSVLNQRKYVLDILHDAGMLHCKAAITPLPPGL 1691 +LD F IKDLG RYFLG EIAR+ G +NQRKY L++L D G L K + P P Sbjct: 465 FLDQKFKIKDLGKLRYFLGFEIARSPKGIFVNQRKYTLELLQDTGFLATKPSNIPFNPTT 524 Query: 1690 KLRAKEGDPLVDPERFRRLIGRLLYLNLTRPDVTYSVQQLSQFVNSPYTSHWDAAMHLLR 1511 KL + +G PL DP +RRLIGRLLYL TRPD+++SVQ LSQFV+ P H+ AA +L+ Sbjct: 525 KLSSTDGAPLKDPSSYRRLIGRLLYLTNTRPDISFSVQHLSQFVSKPLIPHYTAATRILK 584 Query: 1510 YLKGSPSVGLFYSAHSSLCLEAYCDADWASCVDTRKSLTGFCIFLGSCLVSWKTKKQTTV 1331 YLK +P+ GLF+ SSL L Y D+DWA C DTRKS+TG+C+F+GS L+SWK+KKQ TV Sbjct: 585 YLKSAPANGLFFPVSSSLKLTGYADSDWARCPDTRKSITGYCVFIGSSLISWKSKKQNTV 644 Query: 1330 SRSSAEAEYRALGSGVCEIQWLSYLCADLGLSIPTPVTLWCDNQAAIHIVQNPVFHERTK 1151 SRSS EAEYRAL S CEIQWL YL D + P +++CD+++AI++ NP FHER+K Sbjct: 645 SRSSTEAEYRALASLTCEIQWLQYLFQDFKMKFSNPASVFCDSRSAIYLAHNPAFHERSK 704 Query: 1150 HLEIDCHLVRNLYKSGFLHLGHVSSRLQLADFFTKSLGRAAFLLLCSKLGLFDLHNPT 977 H+EIDCH++R +S +HL + S Q+AD FTK L AF L SKL L +H+PT Sbjct: 705 HIEIDCHVIREKIQSQLIHLLPIPSNSQIADMFTKPLHFPAFFDLLSKLNLCSIHSPT 762 >gb|KYP65734.1| Retrovirus-related Pol polyprotein from transposon TNT 1-94, partial [Cajanus cajan] Length = 1013 Score = 625 bits (1611), Expect = 0.0 Identities = 336/704 (47%), Positives = 434/704 (61%), Gaps = 22/704 (3%) Frame = -3 Query: 3013 CLSYAANTHPHKSKFAARAYKCVFLGYITGQKGYRLYDLDNHSLLTSRDVVFFEDSFPF- 2837 CL+YA+ H++KF RA K V LGY G KGY LYDL +H SR+V F E +FPF Sbjct: 313 CLAYASTLQAHRTKFQPRAKKSVLLGYKEGVKGYLLYDLHSHEFFMSRNVFFHEFTFPFH 372 Query: 2836 ----ASIPVPSNXXXXXXXXXXXXXXXXXXTSSAPIS-------------QPESHPLRRS 2708 S+ PS +P S P P R S Sbjct: 373 TPSQTSLTQPSPTPITIQTPISSPYDLDNHVPPSPTSSTSIPPEQPHQPLSPAPAPSRHS 432 Query: 2707 SRISKPPAWLSDFITNSVHSSTPMASPSHSAGPDSGDFSLAPTSFNHSSILGATYTAFLA 2528 +R+ +PP++L D+ + + + + S S + P S +L +Y F Sbjct: 433 TRMRQPPSYLKDYHCSLLAPTGRINSFSGISTPHSISSTLT------YDFCSPSYKQFCL 486 Query: 2527 NLSNVEEPSSFSQACKSADWVDAMQRELTALEQNNTWVLTSLPPGKRAIGCKWVYKVKMR 2348 ++S EP +++QA K W+ AM+ EL AL+ N TW + LP GKR IGCKWVYK+K Sbjct: 487 SVSTNFEPHTYTQASKYDCWIMAMKTELAALDMNQTWSIVDLPSGKRPIGCKWVYKIKYL 546 Query: 2347 PDGTVERYKARLVAKGFHQEHGVDYLDSFSPVAKLVTVRLVIALATIKGWPLFQLDVNNA 2168 DG++ERYKARLVAKG+ Q G+DYLD++SPVAKL TVR+++AL IKGW L QLDVNNA Sbjct: 547 SDGSIERYKARLVAKGYSQTEGLDYLDTYSPVAKLTTVRVLLALTAIKGWFLEQLDVNNA 606 Query: 2167 FLHGYLDEDIYMLPPEGYSKAKDG----EVCHLQRSLYGLKQASRQWNAEFCLKLQQFGF 2000 FLHG L E++YM P G S +VC L +S+YGLKQASRQW ++ L G+ Sbjct: 607 FLHGDLHEEVYMTLPPGLSVPSSSNTAPKVCKLHKSIYGLKQASRQWYSKLSSALISMGY 666 Query: 1999 TQSGHDHCLFVRQSXXXXXXXXXXXXXVIISGTVLSEIDALKRYLDDLFTIKDLGYARYF 1820 + S DH LF++ S +I++G EID +K L F IKDLG RYF Sbjct: 667 SPSTADHSLFIKSSSSHFTALLVYVDDIILAGNDKPEIDFIKAQLHKCFKIKDLGNLRYF 726 Query: 1819 LGMEIARNTDGSVLNQRKYVLDILHDAGMLHCKAAITPLPPGLKLRAKEGDPLVDPERFR 1640 LG+EIAR+ G +LNQRKY L+IL D G L K + TP P LKL + G P D +R Sbjct: 727 LGLEIARSNKGILLNQRKYTLEILEDVGFLAAKPSSTPFNPSLKLHSDHGSPYNDETAYR 786 Query: 1639 RLIGRLLYLNLTRPDVTYSVQQLSQFVNSPYTSHWDAAMHLLRYLKGSPSVGLFYSAHSS 1460 RLIGRLLYL TRPD++Y VQQLSQFV+ P H+ AA +LRYLKGS GLFYS+ +S Sbjct: 787 RLIGRLLYLTTTRPDISYVVQQLSQFVSKPLDIHYQAATRILRYLKGSHGRGLFYSSSAS 846 Query: 1459 LCLEAYCDADWASCVDTRKSLTGFCIFLGSCLVSWKTKKQTTVSRSSAEAEYRALGSGVC 1280 L L A+ D+DWASC +RKS+TGFC+FLGS L+SW++KKQ+T+SRSS+EAEYRAL S C Sbjct: 847 LKLSAFADSDWASCSISRKSITGFCVFLGSSLISWRSKKQSTISRSSSEAEYRALASLTC 906 Query: 1279 EIQWLSYLCADLGLSIPTPVTLWCDNQAAIHIVQNPVFHERTKHLEIDCHLVRNLYKSGF 1100 E+QWL YL DL S+ P +++CDN++AI++ NP FHERTKH+EIDCH++R +S Sbjct: 907 ELQWLHYLFNDLKTSLNFPTSVFCDNKSAIYLAHNPTFHERTKHIEIDCHVIREKIQSRL 966 Query: 1099 LHLGHVSSRLQLADFFTKSLGRAAFLLLCSKLGLFDLHNPT*GG 968 LHL V S QLAD FTK L +F SKLGL+D+H+ GG Sbjct: 967 LHLLPVPSSSQLADAFTKPLHATSFNSFVSKLGLYDVHSSACGG 1010 >gb|OMO65653.1| hypothetical protein CCACVL1_21443 [Corchorus capsularis] Length = 1245 Score = 630 bits (1624), Expect = 0.0 Identities = 331/692 (47%), Positives = 435/692 (62%), Gaps = 13/692 (1%) Frame = -3 Query: 3013 CLSYAANTHPHKSKFAARAYKCVFLGYITGQKGYRLYDLDNHSLLTSRDVVFFEDSFPFA 2834 CL YA KF+ R+ KC+F+GY G KGYR+YDL + SRDV F+E+ FPF Sbjct: 558 CLCYALQKPKPNDKFSPRSSKCIFVGYPNGTKGYRVYDLTTKKIFVSRDVRFYENQFPFE 617 Query: 2833 SIPVPSNXXXXXXXXXXXXXXXXXXTSSAPIS----QPESHP--------LRRSSRISKP 2690 + +N S P + QP+ HP R R Sbjct: 618 NTSTSTNDQTVVPLPALEDTDLSITHDSIPPNPPQEQPQPHPPTNPPNQPSTRPQRTKTR 677 Query: 2689 PAWLSDFITNSVHSSTPMASPSHSAGPDSGD-FSLAPTSFNHSSILGATYTAFLANLSNV 2513 P L D + N+ +S +H A SG +SL+ +F +++ AFLA +S Sbjct: 678 PKRLDDCVCNNSKVDNSPSSLTHEAS--SGTLYSLS--NFISYDNFHSSHKAFLAAISLR 733 Query: 2512 EEPSSFSQACKSADWVDAMQRELTALEQNNTWVLTSLPPGKRAIGCKWVYKVKMRPDGTV 2333 +EP SFSQA KS W +AMQ+EL ALE NNTW L +LPP K+ IGCKW++K+K + DGT+ Sbjct: 734 DEPKSFSQAVKSPQWREAMQKELAALENNNTWTLETLPPRKKPIGCKWIFKIKYKSDGTI 793 Query: 2332 ERYKARLVAKGFHQEHGVDYLDSFSPVAKLVTVRLVIALATIKGWPLFQLDVNNAFLHGY 2153 ERYKAR VAKG++Q G+D+ ++F+PVAKLVTVR ++A+A IK W L QLDVNNAFLHG Sbjct: 794 ERYKARFVAKGYNQIEGMDFHETFAPVAKLVTVRCLLAIAAIKNWELHQLDVNNAFLHGD 853 Query: 2152 LDEDIYMLPPEGYSKAKDGEVCHLQRSLYGLKQASRQWNAEFCLKLQQFGFTQSGHDHCL 1973 LDE++YM P GY D VC +++SLYGLKQASR W A+F L +FGF QS D+ L Sbjct: 854 LDEEVYMSLPPGYGDKNDSRVCRVRKSLYGLKQASRNWFAKFFAALLEFGFIQSTVDYSL 913 Query: 1972 FVRQSXXXXXXXXXXXXXVIISGTVLSEIDALKRYLDDLFTIKDLGYARYFLGMEIARNT 1793 F + +II+G I +LK++LD F IKDLG +YFLG+E+AR++ Sbjct: 914 FTLTTGSSFLVVLVYVDDLIIAGDDSVRIRSLKQHLDSRFHIKDLGPLKYFLGIEVARSS 973 Query: 1792 DGSVLNQRKYVLDILHDAGMLHCKAAITPLPPGLKLRAKEGDPLVDPERFRRLIGRLLYL 1613 G L QRKY LDIL + GM K + P+ L G P+ DP ++RRL+GRL+YL Sbjct: 974 SGIFLCQRKYTLDILEECGMTDAKPSAFPMEQKHNLTHDTGPPVQDPMQYRRLVGRLIYL 1033 Query: 1612 NLTRPDVTYSVQQLSQFVNSPYTSHWDAAMHLLRYLKGSPSVGLFYSAHSSLCLEAYCDA 1433 +TRP+++Y+V LSQF+N P H DAA+ +LRYLK P G+F+S+ SS L + D+ Sbjct: 1034 TITRPEISYAVHILSQFMNDPRQPHLDAALRVLRYLKSCPGQGIFFSSSSSPHLTGFSDS 1093 Query: 1432 DWASCVDTRKSLTGFCIFLGSCLVSWKTKKQTTVSRSSAEAEYRALGSGVCEIQWLSYLC 1253 DWASC TR+S TG+ LGS +SWKTKKQTTVSRSSAEAEYRA+ + V E+ WL L Sbjct: 1094 DWASCPQTRRSTTGYITMLGSSPISWKTKKQTTVSRSSAEAEYRAMAATVSELLWLRSLL 1153 Query: 1252 ADLGLSIPTPVTLWCDNQAAIHIVQNPVFHERTKHLEIDCHLVRNLYKSGFLHLGHVSSR 1073 LG+ P+ L+CDNQ AIHI NPVFHERTKH+E+DCH +R+ ++ + H+SS+ Sbjct: 1154 QTLGIPHQQPMALFCDNQVAIHIATNPVFHERTKHIELDCHFIRSHIQAKSIQTSHISSK 1213 Query: 1072 LQLADFFTKSLGRAAFLLLCSKLGLFDLHNPT 977 LQLAD FTK+LGR F L KLG+F+LH PT Sbjct: 1214 LQLADIFTKALGRDQFQFLLRKLGIFNLHAPT 1245 Score = 182 bits (463), Expect = 3e-43 Identities = 93/209 (44%), Positives = 132/209 (63%), Gaps = 1/209 (0%) Frame = -3 Query: 625 PLKLYSSDHPGLSLVSSQLTGNNYLSWRRSMLIALGAKTKLGFINGKMEIPKEDSPKYDQ 446 P L SDHPG LVS L G+NY +W R+M AL A+ K GF++G + P+ SP Sbjct: 29 PYLLQPSDHPGAILVSCPLNGDNYPTWARAMTNALRARNKYGFVDGSLAKPEATSPDVST 88 Query: 445 WRKVDCMVISWILNSISKDLVDAFIYCDSAKDLWDDIAKRFGDCNGPLIYQLERDIANMN 266 W K + MVISWI NS+S DL ++ Y D+A+++W D+ +RF N P I QL+RD+A Sbjct: 89 WEKCNSMVISWIFNSLSSDLHNSVAYVDTAREMWLDLEERFSQGNAPRINQLKRDLALTF 148 Query: 265 QGNMSVVEYFTKLKRLWDELACIMPLPACESDTRK-LIDERDMNRKLMQFLMGLHESYDQ 89 Q NMSV Y+TKLK +WDEL +P C K L+ ER+ K+ QF+MGL +S+ Sbjct: 149 QINMSVAAYYTKLKGIWDELQTYSTIPPCTCGAAKELLLERE-REKVHQFIMGLDDSFRS 207 Query: 88 VRNQLLLMDPLPSVDKAYSMALRVEKQRN 2 V + +L ++PLPS+ KAY++ R E++ + Sbjct: 208 VSSHILNIEPLPSLSKAYALVTRAERENS 236 >gb|PNY00469.1| retrovirus-related Pol polyprotein from transposon TNT 1-94 [Trifolium pratense] Length = 778 Score = 614 bits (1583), Expect = 0.0 Identities = 329/736 (44%), Positives = 445/736 (60%), Gaps = 57/736 (7%) Frame = -3 Query: 3013 CLSYAANTHPHKSKFAARAYKCVFLGYITGQKGYRLYDLDNHSLLTSRDVVFFEDSFPFA 2834 CLS+A H++KF +RA KCVF+GY G KGY LYDL +H++ SR+VVF+E PF Sbjct: 45 CLSFATTLQAHRTKFDSRARKCVFIGYKDGTKGYILYDLHSHNIFLSRNVVFYEHVLPFK 104 Query: 2833 SIPVPS---NXXXXXXXXXXXXXXXXXXTSSAPISQPESHPLRRSSRISKP--PAWLSDF 2669 S+P P+ N + P+S + + ++ P P S Sbjct: 105 SVPGPTSSHNSPTFPLYDDPLDISHNPCVDTFPLSTGSLDNVSLNPALTPPLVPTLDSSP 164 Query: 2668 ITNSVHSSTPMASPS----HSAGPD----------------------------------- 2606 +T ++++TP A PS HSA Sbjct: 165 LTPPINTATP-APPSFDSAHSAADQPSPNLDSVPVPSEPSIPLPTRVSTRVTRPPSYLQD 223 Query: 2605 ------SGDFSLAPTSFNH--SSIL-----GATYTAFLANLSNVEEPSSFSQACKSADWV 2465 SG + ++ H SS+L Y F ++S+ EP++++QA K W Sbjct: 224 YHCNIKSGCTNQVSSNIVHPLSSVLSYNTCSPAYKLFCCSISSTIEPTTYNQASKFDCWK 283 Query: 2464 DAMQRELTALEQNNTWVLTSLPPGKRAIGCKWVYKVKMRPDGTVERYKARLVAKGFHQEH 2285 AM E+TALE N TW + LP GK IGCKWVYK+K +GT+ERYKARLVAKG+ Q Sbjct: 284 KAMDAEITALELNKTWTVVDLPCGKVPIGCKWVYKIKYHANGTIERYKARLVAKGYTQME 343 Query: 2284 GVDYLDSFSPVAKLVTVRLVIALATIKGWPLFQLDVNNAFLHGYLDEDIYMLPPEGYSKA 2105 GVDY D+FSPVAK+ TVR+++A+A ++GW L QLDVNNAFLHG L E++YM P GY A Sbjct: 344 GVDYFDTFSPVAKMTTVRVLLAVAAVRGWHLEQLDVNNAFLHGDLHEEVYMSLPPGYD-A 402 Query: 2104 KDGEVCHLQRSLYGLKQASRQWNAEFCLKLQQFGFTQSGHDHCLFVRQSXXXXXXXXXXX 1925 +VC L +SLYGLKQASRQW ++ L G+ S DH L+V+ Sbjct: 403 TPSKVCKLNKSLYGLKQASRQWYSKLSAALISLGYQASQADHSLYVKSHGTSFTALLVYV 462 Query: 1924 XXVIISGTVLSEIDALKRYLDDLFTIKDLGYARYFLGMEIARNTDGSVLNQRKYVLDILH 1745 ++++GT + EI ++K +LD F IKDLG R+FLG+EIAR++ G LNQRKY L++L Sbjct: 463 DDIVLAGTSIEEIKSVKLFLDQQFKIKDLGPLRFFLGLEIARSSSGIFLNQRKYTLELLE 522 Query: 1744 DAGMLHCKAAITPLPPGLKLRAKEGDPLVDPERFRRLIGRLLYLNLTRPDVTYSVQQLSQ 1565 D G L K A PL P KL A +G P DP +RRLIGRLLYL TRPD++++VQ LSQ Sbjct: 523 DTGFLGSKPATVPLDPHTKLSATDGVPFDDPSGYRRLIGRLLYLTHTRPDISFAVQHLSQ 582 Query: 1564 FVNSPYTSHWDAAMHLLRYLKGSPSVGLFYSAHSSLCLEAYCDADWASCVDTRKSLTGFC 1385 +V++P H+ AA +LRYLK P+ G+ +S+HS L L + D+DWA C +TR+S+TG+C Sbjct: 583 YVSTPLVPHYQAATRILRYLKSCPAKGVLFSSHSPLQLHGFADSDWACCPNTRRSVTGYC 642 Query: 1384 IFLGSCLVSWKTKKQTTVSRSSAEAEYRALGSGVCEIQWLSYLCADLGLSIPTPVTLWCD 1205 + LGS L+SWK+KKQ TVSRSS EAEYRAL S CE+QWL YL DL ++ P +++CD Sbjct: 643 VLLGSSLISWKSKKQNTVSRSSTEAEYRALASLTCELQWLQYLFQDLHITFPQSASVYCD 702 Query: 1204 NQAAIHIVQNPVFHERTKHLEIDCHLVRNLYKSGFLHLGHVSSRLQLADFFTKSLGRAAF 1025 N++AI++ NP FHER+KH+E+DCH++R +S +HL V S+ QLAD FTK L AF Sbjct: 703 NKSAIYLAHNPTFHERSKHIELDCHIIREKLQSKLIHLLSVPSKSQLADVFTKPLHSPAF 762 Query: 1024 LLLCSKLGLFDLHNPT 977 + SKLGL +H+PT Sbjct: 763 SSMLSKLGLCSIHHPT 778 >dbj|GAU41679.1| hypothetical protein TSUD_272630 [Trifolium subterraneum] Length = 1178 Score = 627 bits (1616), Expect = 0.0 Identities = 329/716 (45%), Positives = 442/716 (61%), Gaps = 37/716 (5%) Frame = -3 Query: 3013 CLSYAANTHPHKSKFAARAYKCVFLGYITGQKGYRLYDLDNHSLLTSRDVVFFEDSFPFA 2834 CLSYA H++KF +RA K +FLGY G KGY LYDL +H + SR+V+F+E FPF Sbjct: 476 CLSYATTLQAHRTKFVSRARKAIFLGYKDGTKGYILYDLHSHEIFVSRNVIFYETDFPFH 535 Query: 2833 -----------------------------SIPVPSNXXXXXXXXXXXXXXXXXXTSSAPI 2741 ++P+P S PI Sbjct: 536 LSNSVKTDSASPASHLNHTLLYDAEPDPNALPIP---VMHEPDLTLSPIIGPSYNDSTPI 592 Query: 2740 SQPESHP------LRRSSRISKPPAWLSDFITNSVHSSTPMASPSHSAGPDSGDFSLAPT 2579 + PES P LR+SSR+ + P L F H T + + HSA + + L+ Sbjct: 593 NSPESSPIPNPAPLRKSSRVIQRPRHLEGF-----HCETLIGT--HSAASSNTVYPLSSV 645 Query: 2578 -SFNHSSILGATYTAFLANLSNVEEPSSFSQACKSADWVDAMQRELTALEQNNTWVLTSL 2402 S+N+ + Y A ++S + EP +++QA K W +AM EL AL++N TW + L Sbjct: 646 LSYNNCA---PNYHALCCSISAIVEPKTYTQASKFECWRNAMNAELLALDENKTWSVVDL 702 Query: 2401 PPGKRAIGCKWVYKVKMRPDGTVERYKARLVAKGFHQEHGVDYLDSFSPVAKLVTVRLVI 2222 P GK +GCKWVYKVK +G++ERYKARLVAKG+ Q GVDY D+FSPVAK+ TVR+++ Sbjct: 703 PNGKVPVGCKWVYKVKYHANGSIERYKARLVAKGYTQLEGVDYFDTFSPVAKITTVRVLL 762 Query: 2221 ALATIKGWPLFQLDVNNAFLHGYLDEDIYMLPPEGYSKAKDG-EVCHLQRSLYGLKQASR 2045 ALA+IKGW L QLDVNNAFLHG L+ED+YM P G++ + +VC L +S+YGLKQASR Sbjct: 763 ALASIKGWHLEQLDVNNAFLHGDLNEDVYMSLPPGFAATNESNKVCKLHKSIYGLKQASR 822 Query: 2044 QWNAEFCLKLQQFGFTQSGHDHCLFVRQSXXXXXXXXXXXXXVIISGTVLSEIDALKRYL 1865 QW ++ L G+T S DH L+++ + ++++G + EI +K +L Sbjct: 823 QWYSKLSSSLVSLGYTPSQSDHSLYIKSTTNSFTALLVYVDDIVLAGNSIHEIQTVKLFL 882 Query: 1864 DDLFTIKDLGYARYFLGMEIARNTDGSVLNQRKYVLDILHDAGMLHCKAAITPLPPGLKL 1685 D F IKDLG RYFL +EIAR+ G +NQRKY L++L D G+L K + P P KL Sbjct: 883 DQKFKIKDLGKLRYFLVLEIARSDTGIFVNQRKYTLELLEDVGLLGTKPSSIPFHPTTKL 942 Query: 1684 RAKEGDPLVDPERFRRLIGRLLYLNLTRPDVTYSVQQLSQFVNSPYTSHWDAAMHLLRYL 1505 + +G PL DP +RRLIGRLLYL TRPD+++SVQ LSQFV+ P H++AAMH+L+YL Sbjct: 943 SSTDGAPLDDPSSYRRLIGRLLYLTHTRPDISFSVQHLSQFVSKPLVPHYNAAMHILKYL 1002 Query: 1504 KGSPSVGLFYSAHSSLCLEAYCDADWASCVDTRKSLTGFCIFLGSCLVSWKTKKQTTVSR 1325 K P+ G+F SA SSL + A+ D+DWA C +TRKS+ GFC+ LGS L+SWK+KKQ TVSR Sbjct: 1003 KSDPAKGIFLSASSSLKISAFADSDWARCPETRKSIIGFCVLLGSSLISWKSKKQNTVSR 1062 Query: 1324 SSAEAEYRALGSGVCEIQWLSYLCADLGLSIPTPVTLWCDNQAAIHIVQNPVFHERTKHL 1145 SS EAEYRAL S CEIQWL Y+ D + P ++CDN++AI++ NP FHER+KH+ Sbjct: 1063 SSTEAEYRALASLTCEIQWLQYIFQDFKIIFSNPAYVFCDNKSAIYLAHNPTFHERSKHI 1122 Query: 1144 EIDCHLVRNLYKSGFLHLGHVSSRLQLADFFTKSLGRAAFLLLCSKLGLFDLHNPT 977 E+DCH++R +S +HL V + QLAD FTK L AF SKLGL +H+PT Sbjct: 1123 ELDCHVIREKIQSKLIHLLPVPTTSQLADVFTKPLNHPAFSSFLSKLGLCSIHSPT 1178 Score = 142 bits (359), Expect = 1e-30 Identities = 75/216 (34%), Positives = 126/216 (58%), Gaps = 9/216 (4%) Frame = -3 Query: 628 DPLKLYSSDHPGLSLVSSQLTGNNYLSWRRSMLIALGAKTKLGFINGKMEIPKEDSPKYD 449 +P L+ +++P + LVS L NY +W RSM IAL +K K FI+G + P P Y Sbjct: 13 NPYYLHPNENPAVILVSPPLDHKNYHTWSRSMQIALISKNKDKFIDGTLVKPSPLDPLYS 72 Query: 448 QWRKVDCMVISWILNSISKDLVDAFIYCDSAKDLWDDIAKRFGDCNGPLIYQLERDIANM 269 W + + MV++WI S+S + + ++ DSA LW ++ RF + I L+ ++ + Sbjct: 73 PWIRCNTMVLAWIHRSLSDSIARSVLWIDSAASLWKNLRTRFSQGDIFRISDLQEELYRL 132 Query: 268 NQGNMSVVEYFTKLKRLWDELACIMPLPACES---------DTRKLIDERDMNRKLMQFL 116 QGN+ V +YFTKL+ LWDEL P+P C+ ++ KL E+D +++FL Sbjct: 133 RQGNLDVSDYFTKLQVLWDELENYRPIPLCKCSIACTCGAVESFKLYREQDY---VIRFL 189 Query: 115 MGLHESYDQVRNQLLLMDPLPSVDKAYSMALRVEKQ 8 GL++ + ++Q++L++PLP VD +SM ++ E++ Sbjct: 190 KGLNDRFSNTKSQIMLINPLPDVDTVFSMLIQQERE 225 >gb|KYP42321.1| Copia protein [Cajanus cajan] Length = 1456 Score = 630 bits (1624), Expect = 0.0 Identities = 334/736 (45%), Positives = 451/736 (61%), Gaps = 57/736 (7%) Frame = -3 Query: 3013 CLSYAANTHPHKSKFAARAYKCVFLGYITGQKGYRLYDLDNHSLLTSRDVVFFEDSFPFA 2834 CL++A ++K RA KC+FLGY G KG+ L++L N S L SRDV+F+E FP++ Sbjct: 732 CLAFATTLSSKRTKLDRRASKCIFLGYKNGTKGFLLFNLHNKSFLISRDVLFYEKIFPYS 791 Query: 2833 S-------------------------------------------IPVPSNXXXXXXXXXX 2783 + P+PS+ Sbjct: 792 AHVPSMSASDSLLLDVVKDNDTTIYSDPFPTTTFSHGSPSIPLDTPLPSSETTISTDRPP 851 Query: 2782 XXXXXXXXTSSAPISQPE--------------SHPLRRSSRISKPPAWLSDFITNSVHSS 2645 +A +S PE R S+RI KPP +L ++ ++ SS Sbjct: 852 FSPINTCPIPTATLSTPELPSSNTTNDASQVVMPQTRVSTRIRKPPRYLQEYYCENLASS 911 Query: 2644 TPMASPSHSAGPDSGDFSLAPTSFNHSSILGATYTAFLANLSNVEEPSSFSQACKSADWV 2465 + ++ + +SF + ++T+F ++S EP+SF +A W Sbjct: 912 SAASNCLYPL-----------SSFVTYNNCSPSHTSFCLSISAQHEPTSFKEANSEECWR 960 Query: 2464 DAMQRELTALEQNNTWVLTSLPPGKRAIGCKWVYKVKMRPDGTVERYKARLVAKGFHQEH 2285 AM+ EL ALE+N TW L LP GKR +GCKWVY+VK + DG+VERYKARLVAKGF Q Sbjct: 961 RAMEAELQALEKNQTWSLVRLPEGKRPVGCKWVYRVKYKVDGSVERYKARLVAKGFTQTE 1020 Query: 2284 GVDYLDSFSPVAKLVTVRLVIALATIKGWPLFQLDVNNAFLHGYLDEDIYMLPPEGYSKA 2105 GVDY ++FSPV KL TVR +++LA W L QLDV+NAFLHG L E++YM PP G+ + Sbjct: 1021 GVDYFETFSPVVKLSTVRFLLSLAAAHNWFLHQLDVDNAFLHGDLFEEVYMKPPPGFKLS 1080 Query: 2104 KDGEVCHLQRSLYGLKQASRQWNAEFCLKLQQFGFTQSGHDHCLFVRQSXXXXXXXXXXX 1925 VC L +SLYGLKQASRQWN + L F QS DH LF+++S Sbjct: 1081 HPRLVCKLHKSLYGLKQASRQWNQKLTEALISLNFIQSSTDHSLFIKKSHSSITALLVYV 1140 Query: 1924 XXVIISGTVLSEIDALKRYLDDLFTIKDLGYARYFLGMEIARNTDGSVLNQRKYVLDILH 1745 V+++G ++EI A+K YL F IKDLG ++FLG+EIAR+ G +LNQRKY L++L Sbjct: 1141 DDVVLTGNDMAEISAVKAYLHAQFHIKDLGPLKFFLGLEIARSQSGLILNQRKYCLELLS 1200 Query: 1744 DAGMLHCKAAITPLPPGLKLRAKEGDPLVDPERFRRLIGRLLYLNLTRPDVTYSVQQLSQ 1565 + G+ CK TP+ +KL A EG PL DP FRRLIGRLLYL TRPD++++VQQLSQ Sbjct: 1201 EHGLTDCKPVSTPIDASVKLYASEGLPLDDPTIFRRLIGRLLYLTNTRPDISFAVQQLSQ 1260 Query: 1564 FVNSPYTSHWDAAMHLLRYLKGSPSVGLFYSAHSSLCLEAYCDADWASCVDTRKSLTGFC 1385 FV+SP +H+ AA+ +LRYLK SP++GLFY + + ++A+ D+DWASC +TR+S+TGFC Sbjct: 1261 FVDSPRATHFQAALRILRYLKSSPALGLFYPSQTEHRIQAFSDSDWASCPNTRRSVTGFC 1320 Query: 1384 IFLGSCLVSWKTKKQTTVSRSSAEAEYRALGSGVCEIQWLSYLCADLGLSIPTPVTLWCD 1205 IF GS L+SWK+KKQ+TVSRSS+EAEYRAL S CE+QWL +LC DL ++IPTP +++CD Sbjct: 1321 IFYGSALISWKSKKQSTVSRSSSEAEYRALASVTCELQWLLFLCHDLSINIPTPFSIFCD 1380 Query: 1204 NQAAIHIVQNPVFHERTKHLEIDCHLVRNLYKSGFLHLGHVSSRLQLADFFTKSLGRAAF 1025 +Q+AI+I +NP FHERTKH+E+DCHL R + G +HL HV S+ QLAD FTK+L F Sbjct: 1381 SQSAIYIAKNPTFHERTKHIEVDCHLTRLKIQQGLIHLFHVPSKSQLADVFTKALYPRNF 1440 Query: 1024 LLLCSKLGLFDLHNPT 977 SKL L D++NPT Sbjct: 1441 TEAVSKLCLIDIYNPT 1456 Score = 114 bits (286), Expect = 6e-22 Identities = 59/183 (32%), Positives = 103/183 (56%), Gaps = 7/183 (3%) Frame = -3 Query: 535 MLIALGAKTKLGFINGKMEIPKEDSPKYDQWRKVDCMVISWILNSISKDLVDAFIYCDSA 356 ML AL +K K FI+G + P P WR+ + V+SW++ S++ + + +Y D+A Sbjct: 1 MLTALESKNKEQFIDGSLPSPPTSDPLRSTWRRCNKTVMSWLIRSMTPSIAQSVLYMDTA 60 Query: 355 KDLWDDIAKRFGDCNGPLIYQLERDIANMNQGNMSVVEYFTKLKRLWDELACIMPLPACE 176 ++W D+ +RF + I L+ + QG+ +V +Y+T LK LW +L + C Sbjct: 61 AEIWKDLCERFSHGDKFRISDLQASVHECKQGDSTVSQYYTHLKTLWKQLEQYRSVLICS 120 Query: 175 SDT-------RKLIDERDMNRKLMQFLMGLHESYDQVRNQLLLMDPLPSVDKAYSMALRV 17 D K+ ER+ + +++FL GL+E Y QVR+ +L+MDP+PS+ K +S+ + Sbjct: 121 CDNPCSCGILLKIKKERE-DDCVIKFLRGLNEEYSQVRSNILMMDPMPSITKTFSLIQQH 179 Query: 16 EKQ 8 E++ Sbjct: 180 ERE 182 >gb|KYP34298.1| Retrovirus-related Pol polyprotein from transposon TNT 1-94, partial [Cajanus cajan] Length = 1002 Score = 612 bits (1578), Expect = 0.0 Identities = 318/696 (45%), Positives = 437/696 (62%), Gaps = 14/696 (2%) Frame = -3 Query: 3013 CLSYAANTHPHKSKFAARAYKCVFLGYITGQKGYRLYDLDNHSLLTSRDVVFFEDSFPFA 2834 CL+YA H ++ KF R +CVFLG+ KG LYDL++ SR V +FE FPF Sbjct: 316 CLAYATTLHHNRKKFDPRGRRCVFLGFKPQVKGSILYDLNSRETFLSRHVEYFEHIFPFL 375 Query: 2833 ---------SIPVPSNXXXXXXXXXXXXXXXXXXTSSAPISQPESHP-LRRSSRISKPPA 2684 +I +P + +S+P+S P +R+S+R K P+ Sbjct: 376 PTSPLDLTQTISLPRHQPPLPIDTDPTPLSTNTTPTSSPVSVVPPPPFVRKSTRPRKLPS 435 Query: 2683 WLSDFITN--SVHSSTPMASPSHSAGPDSGDFSLAPTSFNHSSILGATYTAFLANLSNVE 2510 +L D+ + H+S ++ P +S +L+P+ AF ++S+++ Sbjct: 436 YLHDYHHTLLTTHNSPTISQPLYSIHNHISYSNLSPSQ-----------KAFSLSISSIK 484 Query: 2509 EPSSFSQACKSADWVDAMQRELTALEQNNTWVLTSLPPGKRAIGCKWVYKVKMRPDGTVE 2330 EP+S+ +A + W A+Q ELTALE+NNTW+LT LPP K+ +GCKWV+K+K DGT+E Sbjct: 485 EPNSYVEAIQDESWKTAIQTELTALEKNNTWILTPLPPNKQVVGCKWVFKLKFNSDGTIE 544 Query: 2329 RYKARLVAKGFHQEHGVDYLDSFSPVAKLVTVRLVIALATIKGWPLFQLDVNNAFLHGYL 2150 R+KARLVAKG+ Q +DYLD+FSPV K+ TVR ++A+AT K W + QLDVN FLHG L Sbjct: 545 RHKARLVAKGYTQTETLDYLDTFSPVVKMTTVRTLLAVATAKNWHIHQLDVNTTFLHGDL 604 Query: 2149 DEDIYMLPPEGY--SKAKDGEVCHLQRSLYGLKQASRQWNAEFCLKLQQFGFTQSGHDHC 1976 E++YM PP G S + VC L +SLYGLKQASRQWNA+ L GF QS D+ Sbjct: 605 HEEVYMTPPPGLTVSPHQSNCVCKLVKSLYGLKQASRQWNAKLTSVLIDSGFKQSMADYS 664 Query: 1975 LFVRQSXXXXXXXXXXXXXVIISGTVLSEIDALKRYLDDLFTIKDLGYARYFLGMEIARN 1796 LF +Q ++++G +EI+ +K LD FTIKDLG +YFLGME+AR+ Sbjct: 665 LFTKQFGAKFTAILVYVDDLVLAGNDPTEINYIKSLLDQKFTIKDLGQLKYFLGMEVARS 724 Query: 1795 TDGSVLNQRKYVLDILHDAGMLHCKAAITPLPPGLKLRAKEGDPLVDPERFRRLIGRLLY 1616 + G L QRKY LD++ D G+L K +P+ +KL G PL DP ++RRL+GRL+Y Sbjct: 725 STGIALYQRKYALDLIEDTGLLASKPCKSPMDHSVKLHKTVGTPLTDPTQYRRLLGRLIY 784 Query: 1615 LNLTRPDVTYSVQQLSQFVNSPYTSHWDAAMHLLRYLKGSPSVGLFYSAHSSLCLEAYCD 1436 L TR D+++SV LSQF++ P H+ AA+ +L+Y+K +P GLF+ + S L L+ Y D Sbjct: 785 LTNTRADISFSVNHLSQFMDQPTDVHYQAALRILKYVKNAPGKGLFFPSSSDLTLKGYSD 844 Query: 1435 ADWASCVDTRKSLTGFCIFLGSCLVSWKTKKQTTVSRSSAEAEYRALGSGVCEIQWLSYL 1256 +DWASC DTR+S+TGF FLG L+SWK+KKQ TVS+SSAEAEYRAL CE QWLSYL Sbjct: 845 SDWASCSDTRRSVTGFSFFLGPALISWKSKKQATVSKSSAEAEYRALAQSTCEAQWLSYL 904 Query: 1255 CADLGLSIPTPVTLWCDNQAAIHIVQNPVFHERTKHLEIDCHLVRNLYKSGFLHLGHVSS 1076 D GL P+ L+CDNQ+A+HI NPVFHERTK++E+DCH+VR + G +HL +S+ Sbjct: 905 LHDFGLHSFHPIVLFCDNQSALHIASNPVFHERTKNIELDCHIVREKLQVGLIHLLPIST 964 Query: 1075 RLQLADFFTKSLGRAAFLLLCSKLGLFDLHNPT*GG 968 QLAD FTK+L F + KLG+FD+H+ GG Sbjct: 965 ADQLADVFTKALSLRPFEQIIFKLGMFDIHSSLRGG 1000 >ref|XP_012486681.1| PREDICTED: LOW QUALITY PROTEIN: retrovirus-related Pol polyprotein from transposon TNT 1-94 [Gossypium raimondii] Length = 683 Score = 600 bits (1548), Expect = 0.0 Identities = 318/689 (46%), Positives = 429/689 (62%), Gaps = 11/689 (1%) Frame = -3 Query: 3013 CLSYAANTHPHKSKFAARAYKCVFLGYITGQKGYRLYDLDNHSLLTSRDVVFFEDSFPFA 2834 CLS+A+ H+ KF RA +CVFLGY KGY L D++ ++ SR+V F E FPF Sbjct: 7 CLSFASTLSAHRKKFDPRAKQCVFLGYKPHVKGYILLDIETRAIFVSRNVTFHETIFPFL 66 Query: 2833 -------SIPVPSNXXXXXXXXXXXXXXXXXXTSSAPISQPESHPLR--RSSRISKPPAW 2681 + PV S+ S P + P R R +PP++ Sbjct: 67 QHSLNNPTTPVGLLASDTIYDSPISPPQPSSTDQSSSTSHPPTQPSTSSRPQRNRRPPSY 126 Query: 2680 LSDFITNSVHSSTPMASPSHSAGPDSGDFSLAPTSFNHSSILGATYTAFLANLSNVEEPS 2501 L D+ + ++T HS +L+P + + + A+ EP Sbjct: 127 LQDYQHYQLPAATNHPGTPHSIFNCISYHNLSPQHLHFTLAISASI-----------EPK 175 Query: 2500 SFSQACKSADWVDAMQRELTALEQNNTWVLTSLPPGKRAIGCKWVYKVKMRPDGTVERYK 2321 ++ QA K W +AMQ E+ ALEQNNTW +T+LPPGK GCKWV++VK R DG+ ERYK Sbjct: 176 TYKQASKFTHWNEAMQAEINALEQNNTWTMTTLPPGKTPXGCKWVFRVKHRADGSTERYK 235 Query: 2320 ARLVAKGFHQEHGVDYLDSFSPVAKLVTVRLVIALATIKGWPLFQLDVNNAFLHGYLDED 2141 ARLVAKG+ Q GVDY D+FSPVAK+ TVRL++ALAT + W + QLDVNNAFLHG L+ED Sbjct: 236 ARLVAKGYTQI-GVDYFDTFSPVAKITTVRLLLALATSRHWHIQQLDVNNAFLHGDLNED 294 Query: 2140 IYMLPPEGYSKAKDGEVCHLQRSLYGLKQASRQWNAEFCLKLQQFGFTQSGHDHCLFVRQ 1961 +YMLPP G+S +VC L +S+YGLKQASRQW ++ L G+ QS DH +F ++ Sbjct: 295 VYMLPPPGFSHDST-KVCKLHKSIYGLKQASRQWFSKLTTALISLGYIQSTADHSMFTKK 353 Query: 1960 SXXXXXXXXXXXXXVIISGTVLSEIDALKRYLDDLFTIKDLGYARYFLGMEIARNTDGSV 1781 +I++GT EI +K++LD F IKDLG +YFLG+E+AR + G Sbjct: 354 HSEDFTVLLIYVDDIILTGTSSPEIMKVKQFLDTTFRIKDLGDLKYFLGLEVARTSQGIH 413 Query: 1780 LNQRKYVLDILHDAGMLHCKAAITPLPPG--LKLRAKEGDPLVDPERFRRLIGRLLYLNL 1607 ++QRKY L+IL ++G + CK A TP+ KL + +G+ L D +R+L+G+LLYL Sbjct: 414 ISQRKYALEILQESGFIECKPAKTPMATKSVYKLTSTDGELLSDITSYRQLVGKLLYLTS 473 Query: 1606 TRPDVTYSVQQLSQFVNSPYTSHWDAAMHLLRYLKGSPSVGLFYSAHSSLCLEAYCDADW 1427 TR D+T++VQQLSQF++ P T+H AA +LRYLKG PS GLFY A SS L+A+ D+DW Sbjct: 474 TRLDLTFAVQQLSQFMDKPTTNHLQAAHRVLRYLKGCPSTGLFYPASSSFELKAFSDSDW 533 Query: 1426 ASCVDTRKSLTGFCIFLGSCLVSWKTKKQTTVSRSSAEAEYRALGSGVCEIQWLSYLCAD 1247 A C +TR+S+TG+CIF G L+SW+ KKQ TVSRSS+EAEYRAL S VCE+QWL YL D Sbjct: 534 AGCPETRRSITGYCIFFGEALISWRAKKQPTVSRSSSEAEYRALASTVCEVQWLHYLLCD 593 Query: 1246 LGLSIPTPVTLWCDNQAAIHIVQNPVFHERTKHLEIDCHLVRNLYKSGFLHLGHVSSRLQ 1067 L + I ++CDN++ I I NP FHERTKH+EIDCH+VR + +HL +S Q Sbjct: 594 LHVPISHATPVFCDNKSTIQIASNPTFHERTKHIEIDCHIVREKLQKDIVHLLPCTSSAQ 653 Query: 1066 LADFFTKSLGRAAFLLLCSKLGLFDLHNP 980 LAD FTK+L F + SKLG+ ++H+P Sbjct: 654 LADLFTKALAAQPFQDMISKLGMLNIHSP 682 >gb|KYP55668.1| Retrovirus-related Pol polyprotein from transposon TNT 1-94, partial [Cajanus cajan] Length = 1136 Score = 615 bits (1587), Expect = 0.0 Identities = 335/700 (47%), Positives = 437/700 (62%), Gaps = 18/700 (2%) Frame = -3 Query: 3013 CLSYAANTHPHKSKFAARAYKCVFLGYITGQKGYRLYDLDNHSLLTSRDVVFFEDSFP-- 2840 CL Y++ H++K RA+ C+FLG+ KGY L +L H LL S++V+F ED FP Sbjct: 456 CLCYSSIITSHRTKLDPRAHPCIFLGFKPHTKGYLLVNLHTHGLLVSQNVIFHEDHFPSF 515 Query: 2839 -------FAS-IPVPSNXXXXXXXXXXXXXXXXXXTSSAPISQPESH----PLRRSSRIS 2696 F+S +P+P N SS P P+ H PLRRS+R Sbjct: 516 TKPNSPSFSSPVPIPYNYADYPSFPSSSIVE-----SSEP-PPPDQHSSPPPLRRSTRPR 569 Query: 2695 KPPAWLSDF----ITNSVHSSTPMASPSHSAGPDSGDFSLAPTSFNHSSILGATYTAFLA 2528 +PP +L DF + HSST + P HS + SF+H ++ Sbjct: 570 RPPTYLQDFHGAFTSTGPHSSTGIRHPLHSFI----SYDRLSPSFHH----------YVF 615 Query: 2527 NLSNVEEPSSFSQACKSADWVDAMQRELTALEQNNTWVLTSLPPGKRAIGCKWVYKVKMR 2348 ++S+V +P +F +A KS W+ AM E++ALE NNTWVLT+LPP K AIGC+WVYKVK + Sbjct: 616 SISSVTKPKNFVEASKSDSWLKAMHEEISALEANNTWVLTTLPPHKTAIGCRWVYKVKHK 675 Query: 2347 PDGTVERYKARLVAKGFHQEHGVDYLDSFSPVAKLVTVRLVIALATIKGWPLFQLDVNNA 2168 DG+++RYKARLVAKG+ Q G+D+ D+FSPVAKL TVRL+I+LA I L QLDVNN+ Sbjct: 676 ADGSIDRYKARLVAKGYTQMEGLDFFDTFSPVAKLTTVRLLISLAAIHNCHLKQLDVNNS 735 Query: 2167 FLHGYLDEDIYMLPPEGYSKAKDGEVCHLQRSLYGLKQASRQWNAEFCLKLQQFGFTQSG 1988 FLHG L+E++YM P G + + G+VC LQRSLYGLKQASRQW A L Q G+ S Sbjct: 736 FLHGDLNEEVYMQLPPGITPSFPGQVCRLQRSLYGLKQASRQWYARLSSFLIQHGYVPSP 795 Query: 1987 HDHCLFVRQSXXXXXXXXXXXXXVIISGTVLSEIDALKRYLDDLFTIKDLGYARYFLGME 1808 DH LF++ S ++++G L+EI L L + F IKDLG +YFLG+E Sbjct: 796 SDHSLFLKCSPAITTAILIYVDDIVLAGNDLTEIHHLTSLLHNTFQIKDLGNLKYFLGLE 855 Query: 1807 IARNTDGSVLNQRKYVLDILHDAGMLHCKAAITPLPPGLKLRAKEGDPLVDPERFRRLIG 1628 +ARN G L QRKY LD+L D GML K TP+ L A G P D +RRL+G Sbjct: 856 VARNHTGIHLCQRKYTLDLLSDTGMLASKPVSTPMDYSTHLSASSGTPFTDTAAYRRLVG 915 Query: 1627 RLLYLNLTRPDVTYSVQQLSQFVNSPYTSHWDAAMHLLRYLKGSPSVGLFYSAHSSLCLE 1448 RL+YL TRP + Y+VQQLSQFV++P T+H A +L YLKG+P G+F S +SS+ L Sbjct: 916 RLIYLPNTRPAIAYAVQQLSQFVSNPPTAHRQALFRILCYLKGTPGSGIFLSVNSSVQLR 975 Query: 1447 AYCDADWASCVDTRKSLTGFCIFLGSCLVSWKTKKQTTVSRSSAEAEYRALGSGVCEIQW 1268 A+ D DWA C DTR+S+TGF ++LG L+SWK+KKQ TVSRSS+EAEYRAL + CE+QW Sbjct: 976 AFSDYDWAGCPDTRRSITGFAVYLGDSLISWKSKKQITVSRSSSEAEYRALATTTCELQW 1035 Query: 1267 LSYLCADLGLSIPTPVTLWCDNQAAIHIVQNPVFHERTKHLEIDCHLVRNLYKSGFLHLG 1088 LSYL D + + P L+CDNQ A+ I NP+FHERTKH+EIDCH+VR+ +G L L Sbjct: 1036 LSYLLKDFHIDLIRPSILYCDNQFALQIASNPIFHERTKHIEIDCHIVRDKVSTGLLKLL 1095 Query: 1087 HVSSRLQLADFFTKSLGRAAFLLLCSKLGLFDLHNPT*GG 968 VSS LQLAD TK L F SKLG+ ++H+ GG Sbjct: 1096 PVSSSLQLADILTKPLSPFVFHSHYSKLGMLNIHSQLEGG 1135 >gb|PNX93928.1| hypothetical protein L195_g017092, partial [Trifolium pratense] Length = 865 Score = 604 bits (1558), Expect = 0.0 Identities = 320/721 (44%), Positives = 438/721 (60%), Gaps = 42/721 (5%) Frame = -3 Query: 3013 CLSYAANTHPHKSKFAARAYKCVFLGYITGQKGYRLYDLDNHSLLTSRDVVFFEDSFP-- 2840 CL YA HP KF RA + +F+GY TGQKGY++YD + + SRDV F E +FP Sbjct: 153 CLCYATIVHP-THKFDPRAKRGIFVGYPTGQKGYKIYDPETKTFFVSRDVKFCETNFPSI 211 Query: 2839 -----------------FASIPVPSNXXXXXXXXXXXXXXXXXXTS---------SAPIS 2738 +P P++ S ++PI Sbjct: 212 PNTSEPNLISSHPSYEAIDDLPSPTSSHHQSQQTDIPSTHEPNSPSHITTETSSAASPIV 271 Query: 2737 QPE---SHP----------LRRSSRISKPPAWLSDFITNSVHSSTPMASPSHSAGPDSGD 2597 +P +H +R+S R PP W +D+ ++ + TP + P SG Sbjct: 272 EPTPLTTHTTDPPTPFIPQVRKSVRDKHPPIWHNDYHMSTQVNKTP-------SEPTSGS 324 Query: 2596 FSLAPTSFNHS-SILGATYTAFLANLSNVEEPSSFSQACKSADWVDAMQRELTALEQNNT 2420 + P S S S + ++ AFLAN++ EP S+ QA W DAM EL ALEQNNT Sbjct: 325 GTRYPLSHYLSYSRISSSNCAFLANITAHREPQSYDQAVHDPLWQDAMNAELEALEQNNT 384 Query: 2419 WVLTSLPPGKRAIGCKWVYKVKMRPDGTVERYKARLVAKGFHQEHGVDYLDSFSPVAKLV 2240 W L LP G + IGCKWVYK+K + DGT+ERYKARLVAKG+ Q G+DY ++FSP AK+ Sbjct: 385 WSLVPLPSGHKPIGCKWVYKIKYKSDGTIERYKARLVAKGYTQVEGIDYQETFSPTAKVT 444 Query: 2239 TVRLVIALATIKGWPLFQLDVNNAFLHGYLDEDIYMLPPEGYSKAKDGEVCHLQRSLYGL 2060 T+R ++ +A + W + QLDV NAFLHG L E +YM PP G + + VC L +SLYGL Sbjct: 445 TLRCLLTVAAARNWFIHQLDVQNAFLHGDLHELVYMEPPPGLRRQGENVVCRLNKSLYGL 504 Query: 2059 KQASRQWNAEFCLKLQQFGFTQSGHDHCLFVRQSXXXXXXXXXXXXXVIISGTVLSEIDA 1880 KQASR W + F +Q+ G+ QS D+ LF + ++++G L E+ Sbjct: 505 KQASRNWFSTFSEVIQKAGYQQSKADYSLFTKSQGTSFTAVLIYVDDILLTGNDLQEMKR 564 Query: 1879 LKRYLDDLFTIKDLGYARYFLGMEIARNTDGSVLNQRKYVLDILHDAGMLHCKAAITPLP 1700 LK +L F IKDLG +YFLG+E +R+ G ++QRKY LDIL D+G+ + P+ Sbjct: 565 LKEFLLKRFRIKDLGNLKYFLGIEFSRSKKGIFMSQRKYALDILQDSGLTGARPDKFPME 624 Query: 1699 PGLKLRAKEGDPLVDPERFRRLIGRLLYLNLTRPDVTYSVQQLSQFVNSPYTSHWDAAMH 1520 LKL +G L DP ++RRL+GRL+YL +TRPD+ YSVQ LSQF++ P HWDAA+ Sbjct: 625 QNLKLTPTDGVVLNDPTKYRRLVGRLIYLTVTRPDIVYSVQTLSQFMHEPRKPHWDAALR 684 Query: 1519 LLRYLKGSPSVGLFYSAHSSLCLEAYCDADWASCVDTRKSLTGFCIFLGSCLVSWKTKKQ 1340 +LRY+KG+P GL +S+ + L L+A+CD+DW C TR+S+TGFC+FLG+ L+SWK+KKQ Sbjct: 685 VLRYIKGTPGQGLLFSSTNDLTLKAFCDSDWGGCHATRRSVTGFCLFLGNSLISWKSKKQ 744 Query: 1339 TTVSRSSAEAEYRALGSGVCEIQWLSYLCADLGLSIPTPVTLWCDNQAAIHIVQNPVFHE 1160 VSRSSAE+EYRA+ + E+ WL ++ DL +S TP L+CDNQAA+HI NPVFHE Sbjct: 745 VVVSRSSAESEYRAMANTCLELTWLRFILQDLKVSQNTPTPLFCDNQAALHIAANPVFHE 804 Query: 1159 RTKHLEIDCHLVRNLYKSGFLHLGHVSSRLQLADFFTKSLGRAAFLLLCSKLGLFDLHNP 980 RTKH+EIDCH+VR ++G ++ +V +R QLAD FTK+LG+ F+ L SKLGL D+H+P Sbjct: 805 RTKHIEIDCHIVREKLQAGIINPSYVPTRFQLADVFTKALGKDQFVTLRSKLGLHDIHSP 864 Query: 979 T 977 T Sbjct: 865 T 865 >gb|PNX97998.1| retrovirus-related Pol polyprotein from transposon TNT 1-94, partial [Trifolium pratense] Length = 964 Score = 603 bits (1554), Expect = 0.0 Identities = 339/720 (47%), Positives = 437/720 (60%), Gaps = 42/720 (5%) Frame = -3 Query: 3013 CLSYAANTHPHKSKFAARAYKCVFLGYITGQKGYRLYDLDNHSLLTSRDVVFFEDSFPFA 2834 CL +A N + + KF RA +F+GY QKGYR+YD+ + SRDV FFE FP+ Sbjct: 257 CLCFAKNMNI-QHKFDERAKPGIFVGYPFNQKGYRIYDMHTRKIYVSRDVQFFETVFPYH 315 Query: 2833 SIPVPSNXXXXXXXXXXXXXXXXXXTSSA------------------------------- 2747 + PS S+ Sbjct: 316 DLQTPSFASDISINTQFLDYEVDDTPSNLSPASSIPPGISHHDNTIVTIPNPSVDNPSEI 375 Query: 2746 ---PISQPE-------SHPLRRSS-RISKPPAWLSDFITNSVHSSTPMASPSHSAGPDSG 2600 P+ P+ +HP RR R P L+D + + +++ T S SA P Sbjct: 376 PAIPVEPPQQHSPTAINHPERRYPLRHRTPSVRLTDHVCD-INNVT-----SQSAFPLKN 429 Query: 2599 DFSLAPTSFNHSSILGATYTAFLANLSNVEEPSSFSQACKSADWVDAMQRELTALEQNNT 2420 FSL+ S +H A L N+ +EP+S+SQA KSA+W +AM +E+ ALE NNT Sbjct: 430 YFSLSNLSTSHR--------ALLVNIIENKEPTSYSQAIKSAEWREAMAKEIHALESNNT 481 Query: 2419 WVLTSLPPGKRAIGCKWVYKVKMRPDGTVERYKARLVAKGFHQEHGVDYLDSFSPVAKLV 2240 WVL+ LP GK AIGCKWVYK+K DGTVERYKARLVAKG++Q HG+DY ++F+PVAKLV Sbjct: 482 WVLSPLPNGKTAIGCKWVYKIKYHSDGTVERYKARLVAKGYNQVHGIDYHETFAPVAKLV 541 Query: 2239 TVRLVIALATIKGWPLFQLDVNNAFLHGYLDEDIYMLPPEGYSKAKDGEVCHLQRSLYGL 2060 TVRL++++A IK W L QLDVNNAFL G L+E++YM P G+S VC L +S+YGL Sbjct: 542 TVRLLLSIAAIKNWSLHQLDVNNAFLQGDLNEEVYMKLPPGFSHKGQPCVCKLNKSIYGL 601 Query: 2059 KQASRQWNAEFCLKLQQFGFTQSGHDHCLFVRQSXXXXXXXXXXXXXVIISGTVLSEIDA 1880 KQASRQW ++F L Q GF QS D+ LF +S +II+G I Sbjct: 602 KQASRQWFSKFSTTLIQKGFHQSISDYSLFTFKSNHTTIFVLVYVDDIIITGNNDDAISD 661 Query: 1879 LKRYLDDLFTIKDLGYARYFLGMEIARNTDGSVLNQRKYVLDILHDAGMLHCKAAITPLP 1700 +K++L F+IKDLG YFLG+E++R+ G L QRKY LDIL DAG+ C+ + P+ Sbjct: 662 IKKFLAQAFSIKDLGNLSYFLGIEVSRSKKGIFLCQRKYTLDILSDAGLTGCRPSEFPME 721 Query: 1699 PGLKLRAKEGDPLVDPERFRRLIGRLLYLNLTRPDVTYSVQQLSQFVNSPYTSHWDAAMH 1520 L+LR +G PL DP +RRLIGRLLYL +TRPD+ Y+V LSQF+ SP T+H DAA Sbjct: 722 QHLRLRPNDGSPLPDPTVYRRLIGRLLYLTVTRPDIQYAVNTLSQFMQSPCTTHLDAATR 781 Query: 1519 LLRYLKGSPSVGLFYSAHSSLCLEAYCDADWASCVDTRKSLTGFCIFLGSCLVSWKTKKQ 1340 +LRYLKGS GLF SA SSL L Y D+DWA C TR+S TG+ LGS +SWKTKKQ Sbjct: 782 VLRYLKGSVGKGLFLSASSSLQLIGYADSDWAGCPTTRRSTTGYFTMLGSNPISWKTKKQ 841 Query: 1339 TTVSRSSAEAEYRALGSGVCEIQWLSYLCADLGLSIPTPVTLWCDNQAAIHIVQNPVFHE 1160 T+SRSSAEAEYR+L + E+QWL +L +DL ++ P P+T+ CD+QAAIHI +NPVFHE Sbjct: 842 PTISRSSAEAEYRSLATLASELQWLKFLLSDLDIAHPLPITVHCDSQAAIHIAENPVFHE 901 Query: 1159 RTKHLEIDCHLVRNLYKSGFLHLGHVSSRLQLADFFTKSLGRAAFLLLCSKLGLFDLHNP 980 RTKH+EIDCH VR KSG L ++ S QLAD FTK LG A+ L KLG+ ++ P Sbjct: 902 RTKHIEIDCHFVREKIKSGLLRPSYLRSFDQLADIFTKPLGGDAYKRLLGKLGVLEISIP 961 Score = 63.9 bits (154), Expect = 2e-06 Identities = 28/71 (39%), Positives = 44/71 (61%) Frame = -3 Query: 217 WDELACIMPLPACESDTRKLIDERDMNRKLMQFLMGLHESYDQVRNQLLLMDPLPSVDKA 38 WDEL I P+ C K I ++ + M+FL G+H+ + VR+Q+LLMDP PS+ + Sbjct: 1 WDELHSIAPINPCICGNAKSIIDQQNQDRAMEFLQGVHDRFSAVRSQILLMDPFPSIQRI 60 Query: 37 YSMALRVEKQR 5 Y++ + EKQ+ Sbjct: 61 YNIVRQEEKQQ 71 >gb|KZV53534.1| hypothetical protein F511_42283 [Dorcoceras hygrometricum] Length = 1012 Score = 604 bits (1557), Expect = 0.0 Identities = 311/684 (45%), Positives = 437/684 (63%), Gaps = 7/684 (1%) Frame = -3 Query: 3013 CLSYAANTHPHKSKFAARAYKCVFLGYITGQKGYRLYDLDNHSLLTSRDVVFFEDSFPFA 2834 CL YA+ + KF+ RA KCVFLGY G +GY+L +LD + +L S DV+F E FPF Sbjct: 340 CLCYASTLMSSRHKFSPRAIKCVFLGYPPGYRGYKLLNLDTNEILISCDVIFHEHEFPFQ 399 Query: 2833 SIPVPSNXXXXXXXXXXXXXXXXXXTSSAPISQP---ESHPLRRSSRISKPPAWLSDFIT 2663 + S+ +S I P +S RS RI +PP L ++ Sbjct: 400 NT-YNSDSQPSYIFSDNLLPVHSQLNNSHTIPDPISSKSKQQSRSQRILQPPHHLQNYHC 458 Query: 2662 NSVHSSTPMASPSHSAGPDSGDFSLAPTSFNHSSILGATYTAFLANLSNVEEPSSFSQAC 2483 +HSS+P S SH +F + S L + + N+S++ EP++FSQA Sbjct: 459 Y-MHSSSPSTSTSHPL-----------CNFVNYSKLSPLHRNLVNNISSIVEPTTFSQAV 506 Query: 2482 KSADWVDAMQRELTALEQNNTWVLTSLPPGKRAIGCKWVYKVKMRPDGTVERYKARLVAK 2303 +W AM EL ALE N+TW + SLPPGK +GC+WVYK K DG+++RYKARLVAK Sbjct: 507 AIPEWKQAMSDELKALELNHTWSIVSLPPGKSVVGCRWVYKAKFAADGSLQRYKARLVAK 566 Query: 2302 GFHQEHGVDYLDSFSPVAKLVTVRLVIALATIKGWPLFQLDVNNAFLHGYLDEDIYMLPP 2123 G+ Q+ G+DYL++FSPVAK+VTVR ++ALA +GW L QL V+NAFLHG LDE++YM P Sbjct: 567 GYTQQEGLDYLETFSPVAKMVTVRTLLALAAARGWSLIQLHVHNAFLHGELDEEVYMSLP 626 Query: 2122 EGYSKA----KDGEVCHLQRSLYGLKQASRQWNAEFCLKLQQFGFTQSGHDHCLFVRQSX 1955 GYS VC L +SLYGLKQASRQW A+F L GF+QS D+ LF++ Sbjct: 627 PGYSSEGGPLPPQSVCKLHKSLYGLKQASRQWFAKFSSTLLSVGFSQSHADNSLFIKVRD 686 Query: 1954 XXXXXXXXXXXXVIISGTVLSEIDALKRYLDDLFTIKDLGYARYFLGMEIARNTDGSVLN 1775 ++I+ LK +L++ F +KDLG +YFLG+E+AR++ G + Sbjct: 687 NVFLVLLVYVDDIVIATNNEEAASELKSFLNNKFKLKDLGKLKYFLGIEVARSSRGISIC 746 Query: 1774 QRKYVLDILHDAGMLHCKAAITPLPPGLKLRAKEGDPLVDPERFRRLIGRLLYLNLTRPD 1595 QR Y ++ L +AG++ C+ TP+ +K+ ++G+ L DP +RRLIGRLLYL +TRPD Sbjct: 747 QRNYAMNFLTEAGLMGCRPRSTPMEANVKITQEDGEILPDPSSYRRLIGRLLYLTVTRPD 806 Query: 1594 VTYSVQQLSQFVNSPYTSHWDAAMHLLRYLKGSPSVGLFYSAHSSLCLEAYCDADWASCV 1415 + ++V +LSQ+V+ P H +AA+++LRY+KG+ GL+Y ++S L L+ + DADW +C+ Sbjct: 807 LAFAVNKLSQYVSKPRLPHMEAALNILRYVKGTIGQGLYYGSNSDLRLKFFSDADWGACL 866 Query: 1414 DTRKSLTGFCIFLGSCLVSWKTKKQTTVSRSSAEAEYRALGSGVCEIQWLSYLCADLGLS 1235 DTR+S+TG+C+FLG ++SW+ KKQ TVSRSSAEAEYR++ + CEI W+ L DLG+ Sbjct: 867 DTRRSVTGYCVFLGESMISWRAKKQHTVSRSSAEAEYRSMAAATCEILWIRSLLTDLGVK 926 Query: 1234 IPTPVTLWCDNQAAIHIVQNPVFHERTKHLEIDCHLVRNLYKSGFLHLGHVSSRLQLADF 1055 P TL+CD+QAAIHI NPVFHERTKH++IDCH++R + G + L HVSS QLAD Sbjct: 927 CDGPATLFCDSQAAIHIASNPVFHERTKHIDIDCHVIREKVQQGIVKLMHVSSVQQLADL 986 Query: 1054 FTKSLGRAAFLLLCSKLGLFDLHN 983 FTK+L + F L SK+G+ ++H+ Sbjct: 987 FTKALLTSRFRSLLSKMGIHNIHD 1010 >gb|PNX93906.1| hypothetical protein L195_g017068 [Trifolium pratense] Length = 1183 Score = 607 bits (1564), Expect = 0.0 Identities = 319/714 (44%), Positives = 437/714 (61%), Gaps = 35/714 (4%) Frame = -3 Query: 3013 CLSYAANTHPHKSKFAARAYKCVFLGYITGQKGYRLYDLDNHSLLTSRDVVFFEDSFPF- 2837 CL +A HP KF RA + +F+GY TGQKGY++YD + + SRDV F E FP Sbjct: 478 CLCFATIVHP-THKFDPRARRGIFVGYPTGQKGYKIYDPETKNFFVSRDVRFCETDFPSI 536 Query: 2836 --------------------ASIPVPSNXXXXXXXXXXXXXXXXXXTSSAPIS------- 2738 + IP PS+ ++++PI+ Sbjct: 537 PTTSKPNSISYHPPHEALDDSPIPTPSHVPSTHDLNPPPQPPTATPSAASPINDSIPTTS 596 Query: 2737 ---QPESHPL---RRSSRISKPPAWLSDFITNSVHSSTPMASPSHSAGPDSGDFSLAPTS 2576 +P + P+ RRS R PP W D+ H S P + S S P S + P S Sbjct: 597 HTPEPPTSPIPQVRRSLRDKNPPIWHQDY-----HMS-PQVNTSSSV-PTSRSGTRYPLS 649 Query: 2575 FNHS-SILGATYTAFLANLSNVEEPSSFSQACKSADWVDAMQRELTALEQNNTWVLTSLP 2399 S S + +T+ FLAN++ +EP S+ QA W AM EL AL+QNNTW L LP Sbjct: 650 HYLSYSRISSTHCTFLANITANKEPQSYDQAVHDPQWQAAMNTELEALQQNNTWNLVPLP 709 Query: 2398 PGKRAIGCKWVYKVKMRPDGTVERYKARLVAKGFHQEHGVDYLDSFSPVAKLVTVRLVIA 2219 PG + IGCKWVYK+K + DGT+ERYKARLVAKG+ Q G+DY ++FSP AK+ T+R ++ Sbjct: 710 PGHKPIGCKWVYKIKYKSDGTIERYKARLVAKGYTQVEGIDYQETFSPTAKVTTLRCLLT 769 Query: 2218 LATIKGWPLFQLDVNNAFLHGYLDEDIYMLPPEGYSKAKDGEVCHLQRSLYGLKQASRQW 2039 +A + W + QLDV NAFLHG L E +YM PP G + + VC L +SLYGLKQASR W Sbjct: 770 VAASRNWFIHQLDVQNAFLHGDLHELVYMEPPPGLRRQGENVVCRLNKSLYGLKQASRNW 829 Query: 2038 NAEFCLKLQQFGFTQSGHDHCLFVRQSXXXXXXXXXXXXXVIISGTVLSEIDALKRYLDD 1859 + F +Q+ G+ QS D+ LF + ++++G L E+ LK +L Sbjct: 830 FSTFSKAIQKAGYQQSKADYSLFTKPQGTSFTAVLIYVDDILLTGNDLEEMKRLKEFLLR 889 Query: 1858 LFTIKDLGYARYFLGMEIARNTDGSVLNQRKYVLDILHDAGMLHCKAAITPLPPGLKLRA 1679 F IKDLG +YFLG+E +R+ G ++QRKY LDIL D+G++ + P+ LKL Sbjct: 890 HFRIKDLGDLKYFLGIEFSRSKKGIFMSQRKYALDILQDSGLIGARPDKFPMEQNLKLTP 949 Query: 1678 KEGDPLVDPERFRRLIGRLLYLNLTRPDVTYSVQQLSQFVNSPYTSHWDAAMHLLRYLKG 1499 +G L DP ++RRL+GRL+YL +TRPD+ YSVQ LSQF++ P HWDAA+ +LRY+KG Sbjct: 950 TDGVVLTDPTKYRRLVGRLIYLTVTRPDIVYSVQTLSQFMHEPRKPHWDAALRVLRYIKG 1009 Query: 1498 SPSVGLFYSAHSSLCLEAYCDADWASCVDTRKSLTGFCIFLGSCLVSWKTKKQTTVSRSS 1319 +P G+ +S + L L+A+CD+DW C TR+S+TGFCIFLG+ +SWK+KKQ TVSRSS Sbjct: 1010 TPGQGILFSTSNDLSLKAFCDSDWGGCHATRRSVTGFCIFLGNSPISWKSKKQVTVSRSS 1069 Query: 1318 AEAEYRALGSGVCEIQWLSYLCADLGLSIPTPVTLWCDNQAAIHIVQNPVFHERTKHLEI 1139 AE+EYRA+ + E+ WL ++ DL ++ P L+CDNQAA+HI NPVFHERTKH+EI Sbjct: 1070 AESEYRAMANTCLELTWLRFILQDLKVTQAAPTPLFCDNQAALHIAANPVFHERTKHIEI 1129 Query: 1138 DCHLVRNLYKSGFLHLGHVSSRLQLADFFTKSLGRAAFLLLCSKLGLFDLHNPT 977 DCH+VR ++G + +V +R QLAD FTK+LG+ F+ L +KLGL D+H+PT Sbjct: 1130 DCHIVREKLQAGMISPSYVPTRFQLADVFTKALGKDQFVTLRNKLGLHDIHSPT 1183 Score = 187 bits (476), Expect = 8e-45 Identities = 92/210 (43%), Positives = 136/210 (64%), Gaps = 2/210 (0%) Frame = -3 Query: 628 DPLKLYSSDHPGLSLVSSQLTGNNYLSWRRSMLIALGAKTKLGFINGKMEIPKEDSP--K 455 +P ++ SDHPG LV ++L G NY SW RSM+ AL AK K+GFI+G ++ P E+ + Sbjct: 5 NPYYIHPSDHPGHLLVPTKLNGTNYPSWSRSMVHALTAKNKVGFIDGSIKEPSEEKQPAE 64 Query: 454 YDQWRKVDCMVISWILNSISKDLVDAFIYCDSAKDLWDDIAKRFGDCNGPLIYQLERDIA 275 Y W + + M++SW+ +S+ +DL I+ +A +W D +F N P IYQ+++ +A Sbjct: 65 YALWNRCNSMILSWLTHSVEQDLAKGVIHAKTAYQVWKDFKDQFSQKNIPAIYQIQKSLA 124 Query: 274 NMNQGNMSVVEYFTKLKRLWDELACIMPLPACESDTRKLIDERDMNRKLMQFLMGLHESY 95 +++QG MSV YFTK+K LWDEL LP C K DE+ +LMQFLMGL++SY Sbjct: 125 SLSQGTMSVSTYFTKIKGLWDELESYRTLPTCSQ--MKAHDEQREEDRLMQFLMGLNDSY 182 Query: 94 DQVRNQLLLMDPLPSVDKAYSMALRVEKQR 5 VR+ +L+M PLP+V +AYS+ ++ E QR Sbjct: 183 STVRSNILMMSPLPNVRQAYSLVIQEETQR 212 >gb|KYP34293.1| Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cajanus cajan] Length = 1376 Score = 611 bits (1575), Expect = 0.0 Identities = 331/714 (46%), Positives = 440/714 (61%), Gaps = 32/714 (4%) Frame = -3 Query: 3013 CLSYAANTHPHKSKFAARAYKCVFLGYITGQKGYRLYDLDNHSLLTSRDVVFFEDSFPF- 2837 CL Y + + ++ K RA+ CVFLG+ KGY YDL ++ SR+V F+E+ FP Sbjct: 671 CLCYVSTSTANRKKLDPRAHPCVFLGFSPTTKGYITYDLHTRAITISRNVSFYENHFPLL 730 Query: 2836 ------ASIPVPS------------------NXXXXXXXXXXXXXXXXXXTSSAPIS--- 2738 ++IPV S + S AP S Sbjct: 731 QSTSSTSNIPVVSPISFGIHSPSHDLISILPDPHQHNVTSPNPATTSHDSISLAPYSTTA 790 Query: 2737 ---QPESHPLRRSSRISKPPAWLSDFITNSVHSSTPMASPSHSAGPDSGDFSLAP-TSFN 2570 P S PLRRS+R+ PP++L D+ HS T ++ H L P + Sbjct: 791 DSLPPNSSPLRRSTRLRNPPSYLQDY----HHSLTSTSTNLHPG-------MLYPIEKYI 839 Query: 2569 HSSILGATYTAFLANLSNVEEPSSFSQACKSADWVDAMQRELTALEQNNTWVLTSLPPGK 2390 S L + AF++++S V EP S+++A K W+ AM EL AL+ N TW LT LPP K Sbjct: 840 SYSRLSNDFQAFVSSISAVSEPHSYAEAAKHDCWLKAMHAELEALKMNQTWTLTPLPPHK 899 Query: 2389 RAIGCKWVYKVKMRPDGTVERYKARLVAKGFHQEHGVDYLDSFSPVAKLVTVRLVIALAT 2210 +A+GC+W+YK+K DG++ERYKARLVAKG+ Q G+DYL +FSPVAKL TVRL++ALA Sbjct: 900 QAVGCRWIYKIKYNADGSIERYKARLVAKGYTQVEGLDYLATFSPVAKLTTVRLLLALAA 959 Query: 2209 IKGWPLFQLDVNNAFLHGYLDEDIYMLPPEGYSKAKDGEVCHLQRSLYGLKQASRQWNAE 2030 + W L QLDVNNAFLHG L+E++YM P G +VC LQ+SLYGLKQASRQW A+ Sbjct: 960 VFDWHLKQLDVNNAFLHGDLNEEVYMTLPLGMRPEYSNQVCKLQKSLYGLKQASRQWFAK 1019 Query: 2029 FCLKLQQFGFTQSGHDHCLFVRQSXXXXXXXXXXXXXVIISGTVLSEIDALKRYLDDLFT 1850 L G+ QS DH LF++ S ++++G LSEI + LD F Sbjct: 1020 LSSFLIHHGYHQSASDHSLFMKFSSSSTTALLIYVDDIVLAGNNLSEIQLITGLLDVAFK 1079 Query: 1849 IKDLGYARYFLGMEIARNTDGSVLNQRKYVLDILHDAGMLHCKAAITPLPPGLKLRAKEG 1670 IKDLG +YFLG+E+ARN G L+QRKYVLDIL D GM+ + TP+ +L A G Sbjct: 1080 IKDLGNLKYFLGLEVARNKSGIHLSQRKYVLDILSDCGMMASRPVSTPMDYTSRLSASSG 1139 Query: 1669 DPLVDPERFRRLIGRLLYLNLTRPDVTYSVQQLSQFVNSPYTSHWDAAMHLLRYLKGSPS 1490 PL DP +RRL+GRL+YL TRPD++Y V LSQF+++P T+H A +LRYLK +P Sbjct: 1140 TPLADPSSYRRLLGRLIYLTTTRPDISYVVHHLSQFMSAPSTAHSQAIFRILRYLKQAPG 1199 Query: 1489 VGLFYSAHSSLCLEAYCDADWASCVDTRKSLTGFCIFLGSCLVSWKTKKQTTVSRSSAEA 1310 GLF+ +SSL L+A+ D+DWA C+DTR+S+TGF ++LG L+SW++KKQ TVSRSS+EA Sbjct: 1200 SGLFFPTNSSLHLKAFSDSDWAGCLDTRRSITGFSVYLGDSLISWRSKKQPTVSRSSSEA 1259 Query: 1309 EYRALGSGVCEIQWLSYLCADLGLSIPTPVTLWCDNQAAIHIVQNPVFHERTKHLEIDCH 1130 EYRAL + E+QWL+YL DL + + P L+CDNQ+A+HI N VFHERTKH++IDCH Sbjct: 1260 EYRALATTTSELQWLTYLLHDLHVPVHQPALLYCDNQSALHIAANQVFHERTKHIDIDCH 1319 Query: 1129 LVRNLYKSGFLHLGHVSSRLQLADFFTKSLGRAAFLLLCSKLGLFDLHNPT*GG 968 LVR +SG L L V+S QLAD FTKSL + F L SKLG+ +L++ GG Sbjct: 1320 LVREKLQSGLLKLLPVASPHQLADIFTKSLSPSMFTALYSKLGMLNLYSQLEGG 1373 Score = 147 bits (371), Expect = 4e-32 Identities = 72/201 (35%), Positives = 120/201 (59%), Gaps = 6/201 (2%) Frame = -3 Query: 592 LSLVSSQLTGNNYLSWRRSMLIALGAKTKLGFINGKMEIPKEDSPKYDQWRKVDCMVISW 413 ++LVS L NY SW RSML AL AK K+ F++G P Y W++ + MV+SW Sbjct: 1 MALVSPSLDSTNYHSWSRSMLTALSAKNKVEFVDGSAPQPPSSDRIYSAWKRCNNMVVSW 60 Query: 412 ILNSISKDLVDAFIYCDSAKDLWDDIAKRFGDCNGPLIYQLERDIANMNQGNMSVVEYFT 233 ++ S+S + + ++ DSA+++W D+ R+ + I L+ + +++ QG++SV +YFT Sbjct: 61 LVPSVSFSIRQSILWMDSAEEIWRDLKSRYSQGDLLRISALQLEASSIKQGDLSVTDYFT 120 Query: 232 KLKRLWDELACIMPLPACESDTR------KLIDERDMNRKLMQFLMGLHESYDQVRNQLL 71 +L+ +WDEL P P C + ++ +R + + MQFL GL++ Y VR+ +L Sbjct: 121 QLRIIWDELENFRPDPICVCIVKCICKVSSILAQRKLEDQAMQFLRGLNDQYANVRSHVL 180 Query: 70 LMDPLPSVDKAYSMALRVEKQ 8 LMDPLP ++K +S + E+Q Sbjct: 181 LMDPLPPINKIFSYVAQQERQ 201 >gb|PNX93131.1| retrovirus-related Pol polyprotein from transposon TNT 1-94 [Trifolium pratense] Length = 982 Score = 597 bits (1540), Expect = 0.0 Identities = 318/704 (45%), Positives = 429/704 (60%), Gaps = 29/704 (4%) Frame = -3 Query: 3010 LSYAANTHPHKSKFAARAYKCVFLGYITGQKGYRLYDLDNHSLLTSRDVVFFEDSFPFAS 2831 L+YA+ +K+K + R KCVFLG G KG L+DLD+ ++ SR+V F+ P+ + Sbjct: 264 LAYASTLDVNKTKLSPRGRKCVFLGQKQGVKGSILFDLDSKNIFLSRNVTHFDHILPYTT 323 Query: 2830 ----------IPVPSNXXXXXXXXXXXXXXXXXXTSSAP----ISQPE---SHPL----- 2717 + S P IS P S PL Sbjct: 324 NTSKLHWHYHSTINCEPFLDIDQSHTSTNPSDTTPSPTPPTNIISDPNPSTSSPLPSSPF 383 Query: 2716 ------RRSSRISKPPAWLSDFITNSVHSSTPMASPSHSAGPDSGDFSLAPTSFNHS-SI 2558 R RI P++LSDF+ S S + S ++ P S HS S Sbjct: 384 PIQPANTRPDRIKHRPSYLSDFV----------CSASDDSAKSSSTGTIYPISSFHSLSQ 433 Query: 2557 LGATYTAFLANLSNVEEPSSFSQACKSADWVDAMQRELTALEQNNTWVLTSLPPGKRAIG 2378 L +++ F ++L+ EP ++++ACKS W+ AM EL AL + TW + LPP + IG Sbjct: 434 LSPSHSVFTSSLTQHTEPRTYTEACKSQHWIQAMTSELEALARTGTWKIVDLPPNVKPIG 493 Query: 2377 CKWVYKVKMRPDGTVERYKARLVAKGFHQEHGVDYLDSFSPVAKLVTVRLVIALATIKGW 2198 KWVYK+K + DGT+ERYKARLVAKG++Q G+D+ D+FSPVAKL TVR+++A+A+IKGW Sbjct: 494 SKWVYKIKHKSDGTIERYKARLVAKGYNQVEGLDFFDTFSPVAKLTTVRMLLAIASIKGW 553 Query: 2197 PLFQLDVNNAFLHGYLDEDIYMLPPEGYSKAKDGEVCHLQRSLYGLKQASRQWNAEFCLK 2018 L QLDVNNAFLHG L E++YM P+G +K +VC L +SLYGLKQASR+W + Sbjct: 554 FLHQLDVNNAFLHGDLQENVYMSIPDGVQCSKPNQVCKLLKSLYGLKQASRKWYEKLTSL 613 Query: 2017 LQQFGFTQSGHDHCLFVRQSXXXXXXXXXXXXXVIISGTVLSEIDALKRYLDDLFTIKDL 1838 L + G+TQS DH LF +I++GT L EI+ +K LD F IKDL Sbjct: 614 LVKEGYTQSSSDHSLFTISQQDNFTALLIYVDDIILAGTSLQEINRIKNILDTHFKIKDL 673 Query: 1837 GYARYFLGMEIARNTDGSVLNQRKYVLDILHDAGMLHCKAAITPLPPGLKLRAKEGDPLV 1658 G +YFLG+E+A + +G ++QRKY LD+LHD+G+L K A TPL P +KL +G P Sbjct: 674 GVVKYFLGLEVAHSKEGISISQRKYCLDLLHDSGLLGSKPASTPLDPSVKLHHDDGKPFE 733 Query: 1657 DPERFRRLIGRLLYLNLTRPDVTYSVQQLSQFVNSPYTSHWDAAMHLLRYLKGSPSVGLF 1478 D +RRL+G+LLYL TRPD+ ++ QQLSQF++ P +H+ AA ++RYLK +P +GL Sbjct: 734 DISMYRRLVGKLLYLTNTRPDIAFATQQLSQFLHKPTMTHYKAACRVIRYLKHNPGMGLI 793 Query: 1477 YSAHSSLCLEAYCDADWASCVDTRKSLTGFCIFLGSCLVSWKTKKQTTVSRSSAEAEYRA 1298 + ++ + L Y DADWA C+DTR+S TG+C F+GS L+SWK KKQTT+S+SS+EAEYRA Sbjct: 794 FKRNADIQLIGYSDADWAGCLDTRRSTTGYCFFVGSSLISWKAKKQTTISKSSSEAEYRA 853 Query: 1297 LGSGVCEIQWLSYLCADLGLSIPTPVTLWCDNQAAIHIVQNPVFHERTKHLEIDCHLVRN 1118 L S CE+ WL YL DL + L+CDNQ+A+HI NPVFHERTKH+EIDCHLVR Sbjct: 854 LSSATCELVWLLYLLKDLHIECSKQPVLFCDNQSALHIASNPVFHERTKHIEIDCHLVRE 913 Query: 1117 LYKSGFLHLGHVSSRLQLADFFTKSLGRAAFLLLCSKLGLFDLH 986 + G L L VS++ QLADF TKSL F KLGL D++ Sbjct: 914 KVQEGLLRLIPVSTQEQLADFLTKSLPAPKFHDFLCKLGLLDIY 957 >gb|PNX92076.1| retrovirus-related Pol polyprotein from transposon TNT 1-94 [Trifolium pratense] Length = 720 Score = 588 bits (1516), Expect = 0.0 Identities = 322/697 (46%), Positives = 423/697 (60%), Gaps = 17/697 (2%) Frame = -3 Query: 3010 LSYAANTHPHKSKFAARAYKCVFLGYITGQKGYRLYDLDNHSLLTSRDVVFFEDSFP--- 2840 L YA H++K RA K +FLGY +G KGY LYDL + + SR V F E+ P Sbjct: 10 LCYATTLTSHRTKLDPRARKSLFLGYRSGYKGYVLYDLSSREIFISRHVTFHENVLPYPN 69 Query: 2839 ----------FASIPVPSNXXXXXXXXXXXXXXXXXXTSSAPISQPESHP---LRRSSRI 2699 + S S+ +S S S P R S+R Sbjct: 70 STSISTSNWDYISSHTSSDTSIHTSNEIITPPSINLPANSTASSPSTSAPPTLTRCSTRP 129 Query: 2698 SKPPAWLSDFITNSVHSSTPMASPSHSAGPDSGDFSLAPTSFNHSSILGATYTAFLANLS 2519 P +L D++ N++ HS+ SG S ++F L + + +L+ Sbjct: 130 KHIPPYLKDYVCNAL---------DHSSMKSSG-ISYPMSNFISYQNLSNPHCFYALSLT 179 Query: 2518 NVEEPSSFSQACKSADWVDAMQRELTALEQNNTWVLTSLPPGKRAIGCKWVYKVKMRPDG 2339 EP S+++A K W AMQ EL ALE TW+L LP + IGC+WVYKVK DG Sbjct: 180 THTEPKSYAEAIKFDCWKQAMQVELQALENTGTWILVDLPHHVKPIGCRWVYKVKHHADG 239 Query: 2338 TVERYKARLVAKGFHQEHGVDYLDSFSPVAKLVTVRLVIALATIKGWPLFQLDVNNAFLH 2159 +VERYKARLVAKGF+Q G+DY D+FSPVAKL TVR+VIALA++ W L QLDVNNAFLH Sbjct: 240 SVERYKARLVAKGFNQIEGLDYFDTFSPVAKLTTVRIVIALASVHNWFLHQLDVNNAFLH 299 Query: 2158 GYLDEDIYMLPPEGYSKAKDGEVCHLQRSLYGLKQASRQWNAEFCLKLQQFGFTQSGHDH 1979 G L ED+YMLPP G + +VC L +SLYGLKQASRQW A+ L G+ Q+ DH Sbjct: 300 GDLQEDVYMLPPPGVTN-DPNKVCKLVKSLYGLKQASRQWYAKLTSLLLSHGYKQAHSDH 358 Query: 1978 CLFVRQSXXXXXXXXXXXXXVIISGTVLSEIDALKRYLDDLFTIKDLGYARYFLGMEIAR 1799 LF + VI++G ++E +K L + F IKDLG +YFLG+E+A Sbjct: 359 SLFTKHDASHFTLLLVYVDDVILAGNHMAEFSYVKNLLHNAFKIKDLGQLKYFLGLEVAH 418 Query: 1798 NTDGSVLNQRKYVLDILHDAGMLHCKAAITPLPPGLKLRAKEGDPLVDPERFRRLIGRLL 1619 + G L QRKY LD+L D+G+L K TP LKL A + P D +RRL+GRLL Sbjct: 419 SAKGISLCQRKYCLDLLSDSGLLGAKPVSTPSDASLKLHADDSAPFEDISAYRRLVGRLL 478 Query: 1618 YLNLTRPDVTYSVQQLSQFVNSPYTSHWDAAMHLLRYLKGSPSVGLFYSAHSSLCLEAYC 1439 YLN TRPD+T+ QQLSQF++ P +H+ AAM +LRYLK P GLF+ +S+L + + Sbjct: 479 YLNTTRPDITFITQQLSQFLSKPTHTHYSAAMRVLRYLKNCPGRGLFFPRNSTLQILGFS 538 Query: 1438 DADWASCVDTRKSLTGFCIFLGSCLVSWKTKKQTTVSRSSAEAEYRALGSGVCEIQWLSY 1259 DADWA C D+R+S++G C FLG L+SW+TKKQ TV+RSS+EAEYRAL + CE+QWL+Y Sbjct: 539 DADWAGCKDSRRSISGQCFFLGQSLISWRTKKQLTVARSSSEAEYRALAAATCELQWLAY 598 Query: 1258 LCADLGLSIPTPVTLWCDNQAAIHIVQNPVFHERTKHLEIDCHLVRNLYKSGFLHLGHVS 1079 L DL ++ P L+CDNQ+A+HI NPVFHERTKH++IDCH+VR ++G + L VS Sbjct: 599 LLQDLHITCPKLPVLYCDNQSALHIAANPVFHERTKHIDIDCHIVREKLQAGLMKLLPVS 658 Query: 1078 SRLQLADFFTKSLGRAAFLLLCSKLGLFDLHN-PT*G 971 S+ Q+ADFFTKSL F +L +KLG+FD++ PT G Sbjct: 659 SKDQIADFFTKSLLPQPFGVLLAKLGMFDIYQAPTCG 695 >gb|OMP02866.1| Reverse transcriptase, RNA-dependent DNA polymerase [Corchorus capsularis] Length = 666 Score = 585 bits (1509), Expect = 0.0 Identities = 308/669 (46%), Positives = 412/669 (61%), Gaps = 20/669 (2%) Frame = -3 Query: 2923 QKGYRLYDLDNHSLLTSRDVVFFEDSFPFAS------------IPVPSNXXXXXXXXXXX 2780 QKGYRLYDL N L SRDVVF E+ FPF +P+P N Sbjct: 3 QKGYRLYDLSNQEYLVSRDVVFQENIFPFQQSRTPPTPSQVLPLPIPDNHSFNSLPSTPI 62 Query: 2779 XXXXXXXTSSAP-----ISQP--ESHPLRRSSRISKPPAWLSDFITNSVHSSTPMASPSH 2621 S IS P E PL RS R +PP +L + + V PS Sbjct: 63 ESPNETPIISNDSSLNEISLPSNEDQPLARSQRNRRPPPYLQYYECSKVRRQ-----PSQ 117 Query: 2620 SAGPDSGDFSLAPTS-FNHSSILGATYTAFLANLSNVEEPSSFSQACKSADWVDAMQREL 2444 S+ SG + P S F + L +TY+ F++N++++ EP S+S+A K +W A+ EL Sbjct: 118 SSSTTSGSGTRYPISNFLSTHRLSSTYSTFVSNITSIAEPQSYSEAIKDPNWKAAIDAEL 177 Query: 2443 TALEQNNTWVLTSLPPGKRAIGCKWVYKVKMRPDGTVERYKARLVAKGFHQEHGVDYLDS 2264 ALE N TW + LPP K +GCKWV+KVK + G++ERYKARLVAKG+ Q+ G+D+ ++ Sbjct: 178 HALEANKTWSIVDLPPHKSPVGCKWVFKVKYKSYGSIERYKARLVAKGYTQQEGIDFHET 237 Query: 2263 FSPVAKLVTVRLVIALATIKGWPLFQLDVNNAFLHGYLDEDIYMLPPEGYSKAKDGEVCH 2084 F+PVAK+ TVR ++A+A+ K WPL+QLDV NA LHG LDE++YM P G + + VC Sbjct: 238 FAPVAKMTTVRCLLAIASTKNWPLYQLDVQNALLHGDLDEEVYMSLPPGVTSKGENSVCK 297 Query: 2083 LQRSLYGLKQASRQWNAEFCLKLQQFGFTQSGHDHCLFVRQSXXXXXXXXXXXXXVIISG 1904 L +SLYGL+QAS QW A+F L +GF QS D+ LF++ S ++I+G Sbjct: 298 LHKSLYGLRQASLQWFAKFSTALLTYGFVQSRSDYSLFIKSSKTDFVAILVYVDDIVITG 357 Query: 1903 TVLSEIDALKRYLDDLFTIKDLGYARYFLGMEIARNTDGSVLNQRKYVLDILHDAGMLHC 1724 ID++K L F+IKDLG +YFLG+E+AR+ G L+QRKY L++L + G+ Sbjct: 358 NNSKLIDSVKNALQRQFSIKDLGSLKYFLGLEVARSKQGIYLSQRKYTLELLSETGLAGA 417 Query: 1723 KAAITPLPPGLKLRAKEGDPLVDPERFRRLIGRLLYLNLTRPDVTYSVQQLSQFVNSPYT 1544 + P+ KL A EG+ L DP +RRLIG+L+YL +TRPD+ YSV LSQF+N P Sbjct: 418 RPLYVPMEQNTKLSAHEGELLKDPSPYRRLIGKLIYLTITRPDIMYSVHVLSQFMNQPRH 477 Query: 1543 SHWDAAMHLLRYLKGSPSVGLFYSAHSSLCLEAYCDADWASCVDTRKSLTGFCIFLGSCL 1364 H+DAA+ L+RYLK SP G+ S+ S L A+ D+DWASC DTR+SLTGFCI LGS Sbjct: 478 PHFDAALRLVRYLKSSPGQGILLSSLSDFKLRAFSDSDWASCPDTRRSLTGFCILLGSSP 537 Query: 1363 VSWKTKKQTTVSRSSAEAEYRALGSGVCEIQWLSYLCADLGLSIPTPVTLWCDNQAAIHI 1184 +SWKTKKQ TVS SSAEAEYRA+ EI WL L D G+S TP +L CDN+AA+HI Sbjct: 538 ISWKTKKQQTVSCSSAEAEYRAMAFTCREIVWLQSLLHDFGISQCTPASLHCDNKAALHI 597 Query: 1183 VQNPVFHERTKHLEIDCHLVRNLYKSGFLHLGHVSSRLQLADFFTKSLGRAAFLLLCSKL 1004 NPVFHER+KH+E+DCH +R+ + + ++S+ Q AD FTK LG+ L KL Sbjct: 598 AANPVFHERSKHIEVDCHFIRDKLQQKIIETSYISTTQQPADLFTKPLGKDQLHHLLRKL 657 Query: 1003 GLFDLHNPT 977 + D+H+PT Sbjct: 658 AVHDIHSPT 666 >gb|PNY03100.1| retrovirus-related Pol polyprotein from transposon TNT 1-94 [Trifolium pratense] Length = 629 Score = 582 bits (1500), Expect = 0.0 Identities = 290/586 (49%), Positives = 390/586 (66%), Gaps = 1/586 (0%) Frame = -3 Query: 2737 QPESHPLRRSSRISKPPAWLSDFITNSVHSSTPMASPSHSAGPDSGDFSLAP-TSFNHSS 2561 QPE PLRRS+R S PP +L++ N + T P SA S P +S+ Sbjct: 36 QPE--PLRRSTRNSHPPPFLTE---NYYCNLTSATLPDSSAATLSSSSCKYPISSYVSYQ 90 Query: 2560 ILGATYTAFLANLSNVEEPSSFSQACKSADWVDAMQRELTALEQNNTWVLTSLPPGKRAI 2381 + + + FL NLS + EP+ + +A +W A+ EL+ALE+NNTW L LP K AI Sbjct: 91 NISSAHNHFLFNLSTIPEPTCYEKAVCDENWKTAINAELSALEKNNTWKLVPLPLHKHAI 150 Query: 2380 GCKWVYKVKMRPDGTVERYKARLVAKGFHQEHGVDYLDSFSPVAKLVTVRLVIALATIKG 2201 GCKWV+K+K+ DGT+ERYKARLVAKG+ Q G+DY+D+FSPV K+ T+R+++A+A + Sbjct: 151 GCKWVFKLKLHADGTIERYKARLVAKGYTQTEGIDYMDTFSPVVKMTTIRVLLAVAAAQN 210 Query: 2200 WPLFQLDVNNAFLHGYLDEDIYMLPPEGYSKAKDGEVCHLQRSLYGLKQASRQWNAEFCL 2021 WPL+QLDVN AFLHG L+E++YM PP G S VC LQRSLYGLKQASRQWN + Sbjct: 211 WPLYQLDVNTAFLHGDLNEEVYMQPPPGLSLPHSNLVCKLQRSLYGLKQASRQWNTKLTE 270 Query: 2020 KLQQFGFTQSGHDHCLFVRQSXXXXXXXXXXXXXVIISGTVLSEIDALKRYLDDLFTIKD 1841 L G+ QS D+ LF +Q+ +++ GT +EI +K LD+ F+IKD Sbjct: 271 TLTASGYVQSKSDYSLFTKQASSGLTIILVYVDDLVLGGTDSNEIQNIKALLDEKFSIKD 330 Query: 1840 LGYARYFLGMEIARNTDGSVLNQRKYVLDILHDAGMLHCKAAITPLPPGLKLRAKEGDPL 1661 LGY +YFLG E+AR G L QRKY LD++ DAG+L K TP+ P L+L G + Sbjct: 331 LGYLKYFLGFEVARTQAGISLCQRKYALDLIQDAGLLGAKPCSTPMQPQLQLHKSSGQAI 390 Query: 1660 VDPERFRRLIGRLLYLNLTRPDVTYSVQQLSQFVNSPYTSHWDAAMHLLRYLKGSPSVGL 1481 +P +RRLIGRLLYL +RP++ Y+V +LSQF++ P H A +H+LRY+K SP GL Sbjct: 391 SEPTSYRRLIGRLLYLTHSRPEIAYAVSKLSQFLDKPTNEHMLAGLHVLRYVKNSPGQGL 450 Query: 1480 FYSAHSSLCLEAYCDADWASCVDTRKSLTGFCIFLGSCLVSWKTKKQTTVSRSSAEAEYR 1301 F+ + S L L+ + D+DW +C DTR+S TGFC FLG+ L+SWK+KKQ VSRSS+EAEYR Sbjct: 451 FFDSKSPLTLKGFSDSDWGACPDTRRSTTGFCFFLGNSLISWKSKKQNVVSRSSSEAEYR 510 Query: 1300 ALGSGVCEIQWLSYLCADLGLSIPTPVTLWCDNQAAIHIVQNPVFHERTKHLEIDCHLVR 1121 AL CE QWL +L DL +S PTP+ ++CDN++A+HI NPVFHERTKH+E+DCH+VR Sbjct: 511 ALAQTTCEGQWLKFLLQDLHISHPTPIVIYCDNKSALHIAANPVFHERTKHIEMDCHVVR 570 Query: 1120 NLYKSGFLHLGHVSSRLQLADFFTKSLGRAAFLLLCSKLGLFDLHN 983 +SG +HL V ++ Q+AD TKSL F L SKLG+ D+++ Sbjct: 571 EKVQSGLIHLLSVHTKEQVADILTKSLHPGPFHTLQSKLGMIDIYS 616