BLASTX nr result
ID: Chrysanthemum22_contig00005937
seq
BLASTX 2.2.26 [Sep-21-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Chrysanthemum22_contig00005937 (3650 letters) Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF excluding environmental samples from WGS projects 149,584,005 sequences; 54,822,741,787 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value gb|OTG03795.1| putative ribonuclease H-like domain-containing pr... 962 0.0 gb|OTG03225.1| putative NB-ARC [Helianthus annuus] 846 0.0 gb|OMO87137.1| Integrase, catalytic core [Corchorus capsularis] 803 0.0 gb|OMO65653.1| hypothetical protein CCACVL1_21443 [Corchorus cap... 758 0.0 ref|XP_022018997.1| uncharacterized protein LOC110919025 [Helian... 711 0.0 gb|OTG09093.1| putative reverse transcriptase, RNA-dependent DNA... 724 0.0 gb|OMO75305.1| Integrase, catalytic core [Corchorus capsularis] 734 0.0 gb|PNY16822.1| retrovirus-related Pol polyprotein from transposo... 701 0.0 gb|OTG24510.1| putative reverse transcriptase, RNA-dependent DNA... 699 0.0 gb|OTG06009.1| putative zinc finger, CCHC-type [Helianthus annuus] 696 0.0 gb|PNX93928.1| hypothetical protein L195_g017092, partial [Trifo... 685 0.0 gb|PNX96222.1| retrovirus-related Pol polyprotein from transposo... 700 0.0 gb|PNX97998.1| retrovirus-related Pol polyprotein from transposo... 685 0.0 gb|PRQ55089.1| putative RNA-directed DNA polymerase [Rosa chinen... 693 0.0 ref|XP_022004406.1| uncharacterized protein LOC110901966 [Helian... 661 0.0 ref|XP_021979664.1| uncharacterized protein LOC110875770 [Helian... 656 0.0 emb|CAN71595.1| hypothetical protein VITISV_010143 [Vitis vinifera] 695 0.0 ref|XP_021975259.1| uncharacterized protein LOC110870383 [Helian... 657 0.0 gb|KYP64168.1| Retrovirus-related Pol polyprotein from transposo... 676 0.0 gb|PNX95363.1| retrovirus-related Pol polyprotein from transposo... 679 0.0 >gb|OTG03795.1| putative ribonuclease H-like domain-containing protein [Helianthus annuus] Length = 1050 Score = 962 bits (2486), Expect = 0.0 Identities = 478/785 (60%), Positives = 580/785 (73%), Gaps = 21/785 (2%) Frame = +2 Query: 5 NKTPYEVLHNSKPEYDHMKVFGCLAYYRSVETKGDKFEVRGRPGVFLGYPPGTKGYKVYD 184 NKTPYE++ KP+YD ++V GCLAYYRS+ET GDKFE RGRPGVFLGYP GTKGYK++D Sbjct: 278 NKTPYEIVFRKKPDYDRLRVMGCLAYYRSIETNGDKFEFRGRPGVFLGYPQGTKGYKIFD 337 Query: 185 LQHRKMVTSRDVKFLENVFPFAR-----------NPTEEEKIFVLPQKWDEEENTR---- 319 ++H K+ SRDV F+E VFPF + N EE+ + + ++ ++ T+ Sbjct: 338 VEHGKIAVSRDVTFVEKVFPFEKLKTNNSNQDLFNVPEEDYEIIFEEPYNSQKATQMDPH 397 Query: 320 --DIRADQTKSNDNIHEPSSVMAETETQDVHNGA----DFFGPSQEPSEPATSGPNEPLL 481 +I ++ + ++ + +S E + N DF PS T P L Sbjct: 398 GPNIGTEEAPGSGSMADATSPQREGLPIGLDNTRGQPQDFLSPSANDGLDQTPAPTHSLG 457 Query: 482 DITHETVPSNQNDMGSETVSEENRPTNAHTRPVRSRTRPARLDGFEVNLPPSLDHTQSSL 661 D HET E V E T TR R+ ++P+ L F VNLPPS+DHTQ Sbjct: 458 DDAHET---------GERVVNEFLST---TRGKRTISQPSYLKEFHVNLPPSVDHTQPVT 505 Query: 662 HHDSSTVHPLAHFISYDNFTNTHKAFLTAITTNNEPKHFKQAVKDVRWVEAMKREIQALE 841 SSTVH LA+++SY+ F+N+HK FLTAITT+NEPK F +A++D W AMK+EIQALE Sbjct: 506 DQSSSTVHSLANYVSYEKFSNSHKVFLTAITTHNEPKSFHEAMQDENWKLAMKKEIQALE 565 Query: 842 ENGTWILEELPKGKRAIDSKWVYKIKYKPNGEVERYKARLVAKGFTQMEGVDFHETFAPX 1021 EN TW LE LP+GKRAIDSKWVYK+KYKPNGE+ER+KARLVAKG+TQMEGVDFH+TFAP Sbjct: 566 ENKTWTLEPLPEGKRAIDSKWVYKLKYKPNGEIERHKARLVAKGYTQMEGVDFHDTFAPV 625 Query: 1022 XXXXXXXXXXXXXXXXGWHTHQLDVNNAFLHGDLNEDVYMKIPQGFGKQDDNRVCKLKKS 1201 W HQLDVNNAFLHGDL+E+VYMKIPQGF K+ + RVC+L+KS Sbjct: 626 AKLVTVRTLLAVAVKKEWLIHQLDVNNAFLHGDLDEEVYMKIPQGFAKRGETRVCRLRKS 685 Query: 1202 LYGLKQASRNWYHKFTGSLFEIGFKQTPADHSLFIHRKDKTFVAALIYVDDVVLVGNDSN 1381 LYGLKQASRNWYHKFT SL +IG+KQ+ ADHSLF + FVA LIYVDDVV+ GND+ Sbjct: 686 LYGLKQASRNWYHKFTSSLVDIGYKQSHADHSLFTFKDGVNFVAILIYVDDVVITGNDAT 745 Query: 1382 KIQDTKDFLDKRFSIKDLGPLKYFLGIEVAKTKEGMVLSQRKYTLDILEDAGMTGCRPSS 1561 KIQ+TK +LD +FSIK+LGPLKYFLGIEVA+T +G+VLSQRKYTLD+LED GM GCRPS Sbjct: 746 KIQETKQYLDNKFSIKNLGPLKYFLGIEVARTVDGLVLSQRKYTLDLLEDTGMLGCRPSP 805 Query: 1562 FPMEQNLKLDTCDKEPRVDANQYRRLIGRLLYLQATRPDIAYAVNILSQFVGDPRHSHME 1741 FPMEQ LKLD C + P+VDA QYRRLIGRLLYLQATRPDIAY+VN+LSQFV DPR H+ Sbjct: 806 FPMEQGLKLDNCQESPKVDAQQYRRLIGRLLYLQATRPDIAYSVNLLSQFVSDPREDHLF 865 Query: 1742 AATRVLCYLKGTPGQGILLPKEGGTNLLAYCDSDWLGCPMTRRSRTGYLLLLGGAPISWR 1921 AA R+L YLK +PGQG+ LPK GG +L A+CD+DWLGC +TRRSRTGYLLLLGGAPISW+ Sbjct: 866 AAHRILRYLKSSPGQGVFLPKHGGLHLSAFCDADWLGCQLTRRSRTGYLLLLGGAPISWK 925 Query: 1922 TKKQSVVSKSSAEAEYRAMSNAVSEILWMRWLLSELDMAPVGPTQLFCDNQAARHIANNP 2101 TKKQSVVS+SSAEAEYR+M++ VSE++WMRWLL++L + T +FCDN A +HIANNP Sbjct: 926 TKKQSVVSRSSAEAEYRSMASTVSEVIWMRWLLTDLQVVQDQATPIFCDNLAVKHIANNP 985 Query: 2102 VFHERTKHVEMDCYFVRERVDSMEICPMPIATKDQIADVLTKALGANSLHFLLCKLGVRN 2281 VFHERTKHVEMDCYF+RERV+S +I P+ I TK QIAD+ TK LGA L LL KLGVR+ Sbjct: 986 VFHERTKHVEMDCYFIRERVESKDIFPLHIDTKQQIADLFTKPLGAQHLQILLHKLGVRD 1045 Query: 2282 LHAPT 2296 LHAPT Sbjct: 1046 LHAPT 1050 >gb|OTG03225.1| putative NB-ARC [Helianthus annuus] Length = 1228 Score = 846 bits (2185), Expect = 0.0 Identities = 435/753 (57%), Positives = 531/753 (70%), Gaps = 1/753 (0%) Frame = +2 Query: 5 NKTPYEVLHNSKPEYDHMKVFGCLAYYRSVETKGDKFEVRGRPGVFLGYPPGTKGYKVYD 184 NKTPYE L P YDHM+VFGCL Y+R+ +TKGDKFE RGR G+FLGYP GTKGYK+YD Sbjct: 143 NKTPYEALLGIAPTYDHMRVFGCLTYHRNYDTKGDKFEPRGRRGIFLGYPFGTKGYKIYD 202 Query: 185 LQHRKMVTSRDVKFLENVFPFARNPTEEEKIFVLPQKWDEEENTRDIRADQTKSNDNIHE 364 L +K+ D + R E +K DE E R Sbjct: 203 LDEKKVENIED--------DWLRGEVHSE------EKGDEIEIGR--------------- 233 Query: 365 PSSVMAETETQDVHNGADFFGPSQEPSEPATSGPNEPLLDITHETVPSNQNDMGSETVSE 544 + E D H D E +PA NQND + Sbjct: 234 -AGQHVEIRGVDQHVEHDLGPHDSEVVDPADD----------------NQNDSAQ---LQ 273 Query: 545 ENRPTNAHTRPVRSRTRPARLDGFEVNLPPSLDHTQSSLHHDSSTVHPLAHFISYDNFTN 724 P TR R+R +P R + V LPPS+DH + SSTVHPLA+++SY++F Sbjct: 274 TPPPQTMTTRVPRTRIQPQRYKDYSVQLPPSIDHANPASEQASSTVHPLAYYLSYNSFGA 333 Query: 725 THKAFLTAITTNNEPKHFKQAVKDVRWVEAMKREIQALEENGTWILEELPKGKRAIDSKW 904 HKAFL+AI + +EPK+F QA +D +W EAM++EI+AL+ENGTW LE+LP GK+AI SKW Sbjct: 334 NHKAFLSAIDSCHEPKNFVQASQDPKWREAMEQEIKALQENGTWTLEKLPSGKKAIYSKW 393 Query: 905 VYKIKYKPNGEVERYKARLVAKGFTQME-GVDFHETFAPXXXXXXXXXXXXXXXXXGWHT 1081 VYK+K+KP+G+V+RYKARLVAKG+TQME GVD+H+TFAP W Sbjct: 394 VYKVKHKPDGQVDRYKARLVAKGYTQMEKGVDYHDTFAPVAKLVTMRTLLALAVKQDWII 453 Query: 1082 HQLDVNNAFLHGDLNEDVYMKIPQGFGKQDDNRVCKLKKSLYGLKQASRNWYHKFTGSLF 1261 HQLDVNNAFLHGDL+E+VYMK+PQG +++RVC+L+KS+YGLKQASRNWY+KFT SL Sbjct: 454 HQLDVNNAFLHGDLDEEVYMKVPQGLMMNNEDRVCRLRKSMYGLKQASRNWYYKFTQSLV 513 Query: 1262 EIGFKQTPADHSLFIHRKDKTFVAALIYVDDVVLVGNDSNKIQDTKDFLDKRFSIKDLGP 1441 +G+KQ+ AD SLFI + V+ALIYVDDV++VGN+ +KI+ TK L ++F+IKDLG Sbjct: 514 SMGYKQSVADPSLFIFTEGTVHVSALIYVDDVIIVGNNMDKIKATKTLLHEQFTIKDLGS 573 Query: 1442 LKYFLGIEVAKTKEGMVLSQRKYTLDILEDAGMTGCRPSSFPMEQNLKLDTCDKEPRVDA 1621 LKYFLGIEVA+TKEG+VLSQRKY LDIL D G+ GCRPSSFPMEQ LK D ++EP+VDA Sbjct: 574 LKYFLGIEVARTKEGLVLSQRKYILDILRDMGLEGCRPSSFPMEQTLKPDRAEEEPKVDA 633 Query: 1622 NQYRRLIGRLLYLQATRPDIAYAVNILSQFVGDPRHSHMEAATRVLCYLKGTPGQGILLP 1801 QYRRLIGRLLYLQATRPDI+++VN+LSQFV DPR H +AA R++ YLK T GQGILLP Sbjct: 634 GQYRRLIGRLLYLQATRPDISFSVNLLSQFVADPRQPHYDAAIRIVRYLKTTVGQGILLP 693 Query: 1802 KEGGTNLLAYCDSDWLGCPMTRRSRTGYLLLLGGAPISWRTKKQSVVSKSSAEAEYRAMS 1981 KEGG+NL+ YCDSDW+GCP +RRSRTGY+LL GGAP+ W++KKQSVVS+SSAEAEYRAM+ Sbjct: 694 KEGGSNLVTYCDSDWMGCPFSRRSRTGYMLLFGGAPVYWKSKKQSVVSRSSAEAEYRAMA 753 Query: 1982 NAVSEILWMRWLLSELDMAPVGPTQLFCDNQAARHIANNPVFHERTKHVEMDCYFVRERV 2161 VSEILW+RWLL+EL T LFCDN+AARHIANNPVFHERTKHVEMDCYFVRERV Sbjct: 754 TTVSEILWIRWLLNELGAQQSNSTILFCDNEAARHIANNPVFHERTKHVEMDCYFVRERV 813 Query: 2162 DSMEICPMPIATKDQIADVLTKALGANSLHFLL 2260 +S EI PM I + +QIAD+LTK LG L LL Sbjct: 814 ESKEILPMHIESANQIADLLTKPLGGPQLKILL 846 >gb|OMO87137.1| Integrase, catalytic core [Corchorus capsularis] Length = 1257 Score = 803 bits (2074), Expect = 0.0 Identities = 420/775 (54%), Positives = 510/775 (65%), Gaps = 10/775 (1%) Frame = +2 Query: 2 DNKTPYEVLHNSKPEYDHMKVFGCLAYYRSVETKGDKFEVRGRPGVFLGYPPGTKGYKVY 181 +NKTP+E+L KPEYDH++VFGCL Y +GDKF RG+P VF+GYP G KGY+VY Sbjct: 537 NNKTPFEMLFGKKPEYDHLRVFGCLVYAHDNSKRGDKFSERGKPCVFVGYPNGQKGYRVY 596 Query: 182 DLQHRKMVTSRDVKFLENVFPFARNPTEEEKIFVLPQKWDEEENTRDIRADQTKSNDNIH 361 DL+ +K TSRDV F EN++PF N E E T + N + Sbjct: 597 DLKEKKFYTSRDVTFFENIYPFRPNDCYS----------GETELTAGGEYCRPVLNAADN 646 Query: 362 EPSSVMAETETQDVHNGADFFGPSQEPSEPATSGPNEPLLDITHETVPSNQNDMGSETVS 541 + VM + + + N AD F + P A DIT V ++Q +E+ + Sbjct: 647 DCEEVMTLPQVKGLGNAADRFIAEEIPETAAG--------DIT-AGVTASQETTVTESTA 697 Query: 542 EENRPTNA----------HTRPVRSRTRPARLDGFEVNLPPSLDHTQSSLHHDSSTVHPL 691 EE ++A R R RT+P R DGF+V LPPS Q +L Sbjct: 698 EEPVVSSAVPISGQSRVEVRRSARERTQPKRFDGFDVQLPPSTVPAQPAL---------- 747 Query: 692 AHFISYDNFTNTHKAFLTAITTNNEPKHFKQAVKDVRWVEAMKREIQALEENGTWILEEL 871 P AVK W EAM++EIQALEENGTW L L Sbjct: 748 -------------------------PSADSSAVKHKHWREAMEKEIQALEENGTWDLVPL 782 Query: 872 PKGKRAIDSKWVYKIKYKPNGEVERYKARLVAKGFTQMEGVDFHETFAPXXXXXXXXXXX 1051 P+ KRAIDSKWVYK+K+KPNGE+ERYKARLVAKGFTQ+EGVDFHETFAP Sbjct: 783 PQDKRAIDSKWVYKVKFKPNGEIERYKARLVAKGFTQIEGVDFHETFAPVAKLVTVRCLL 842 Query: 1052 XXXXXXGWHTHQLDVNNAFLHGDLNEDVYMKIPQGFGKQDDNRVCKLKKSLYGLKQASRN 1231 W HQLDVNN FLHGDL E+V+MKIPQGF K + RVCKLKKSLYGL+QASRN Sbjct: 843 AIAAKRRWEVHQLDVNNVFLHGDLEEEVFMKIPQGFAKAGETRVCKLKKSLYGLRQASRN 902 Query: 1232 WYHKFTGSLFEIGFKQTPADHSLFIHRKDKTFVAALIYVDDVVLVGNDSNKIQDTKDFLD 1411 WYHKFT +L ++GF+Q+ ADHSLF++ K +TF+ ALIYVDDV+L GN+ +KIQ+ K +L+ Sbjct: 903 WYHKFTKALEDVGFRQSKADHSLFLYDKGETFLTALIYVDDVILAGNNGDKIQEVKSYLN 962 Query: 1412 KRFSIKDLGPLKYFLGIEVAKTKEGMVLSQRKYTLDILEDAGMTGCRPSSFPMEQNLKLD 1591 +F IKDLGPLKYFLGIE A++ G+VLSQRKY LDILE++GM GC+PS+FPMEQN KL Sbjct: 963 DKFGIKDLGPLKYFLGIEAARSPAGIVLSQRKYALDILEESGMQGCKPSAFPMEQNHKLR 1022 Query: 1592 TCDKEPRVDANQYRRLIGRLLYLQATRPDIAYAVNILSQFVGDPRHSHMEAATRVLCYLK 1771 P +DA QYRRL+GRLLYL TRPD+ +AVN+LSQFV PR HM+AA RVL YLK Sbjct: 1023 ADSNGPIIDAAQYRRLVGRLLYLTVTRPDLTFAVNVLSQFVSAPRQEHMDAALRVLRYLK 1082 Query: 1772 GTPGQGILLPKEGGTNLLAYCDSDWLGCPMTRRSRTGYLLLLGGAPISWRTKKQSVVSKS 1951 PGQG+LL +G L+AYCD+DW GC T+RS TGY + LGG+PISWRTK+Q VVSKS Sbjct: 1083 KAPGQGVLLSAKGDLFLIAYCDADWGGCLTTKRSCTGYFITLGGSPISWRTKRQEVVSKS 1142 Query: 1952 SAEAEYRAMSNAVSEILWMRWLLSELDMAPVGPTQLFCDNQAARHIANNPVFHERTKHVE 2131 SAEAEYRAM+ VSE+LW+ WLL++L PT LFCDNQAA HI NPV+HERTKHVE Sbjct: 1143 SAEAEYRAMAVTVSELLWLWWLLTDLQSPQTEPTPLFCDNQAALHITANPVYHERTKHVE 1202 Query: 2132 MDCYFVRERVDSMEICPMPIATKDQIADVLTKALGANSLHFLLCKLGVRNLHAPT 2296 MDCYFVRE+ S EI P I+T Q+AD+ TKALG + L+ KLGV NLHAPT Sbjct: 1203 MDCYFVREKAQSREIAPRKISTCAQLADIFTKALGKDRFESLVFKLGVANLHAPT 1257 Score = 306 bits (783), Expect = 1e-82 Identities = 135/233 (57%), Positives = 187/233 (80%) Frame = +1 Query: 2617 IDYESPYYLHPSDYPRQMHVNDVLSDNNYADWSQEMMNFLFAKNKVGFINGSIKKPEEES 2796 ID SP+YLH SD P Q++V+D+L D NY +W +M N LFAKNK+GF++G+I +P +S Sbjct: 21 IDVMSPFYLHASDNPGQIYVSDLLHDGNYGEWVNDMSNALFAKNKIGFVDGTIPRPGVDS 80 Query: 2797 SNYMPWMRCDAMIKGWLHTAMEKEIRTSVKYAMTAREIWIDLKERFGKVSAPRAYELKRS 2976 N WMRC+AM+KGWL +AM K++R SV+YA TAREIW+DL+ERFGK S PRAYE++R+ Sbjct: 81 PNLQHWMRCNAMVKGWLKSAMGKDVRGSVRYASTAREIWVDLEERFGKGSDPRAYEIRRA 140 Query: 2977 LTSTKQEGTSVSAYYTKLRGIWDEIQSVIPMPRCDCSSCKCDIGKKLQELRDKERLYEFL 3156 +T +QE SVS+YYTKL+G+WDE+QS+ P+ +C C+ CKC+I K+L ++R+KE+LY+FL Sbjct: 141 VTLLRQEKMSVSSYYTKLKGLWDEMQSIFPLLKCVCNGCKCNISKQLVDMREKEQLYDFL 200 Query: 3157 LGLDAEFGTIRTQILAMNPIPSLGKAYHLVAEDEQQRAISGSKRPSSDSVAFQ 3315 +GLD EFG ++TQIL+ P P LG AYHLVAEDEQQ+ IS +++P +++ AFQ Sbjct: 201 MGLDDEFGIVKTQILSTKPTPGLGHAYHLVAEDEQQKQISANRKPIAEAAAFQ 253 >gb|OMO65653.1| hypothetical protein CCACVL1_21443 [Corchorus capsularis] Length = 1245 Score = 758 bits (1956), Expect = 0.0 Identities = 387/767 (50%), Positives = 493/767 (64%), Gaps = 2/767 (0%) Frame = +2 Query: 2 DNKTPYEVLHNSKPEYDHMKVFGCLAYYRSVETKGDKFEVRGRPGVFLGYPPGTKGYKVY 181 D KTP+E+L + P YDH+KVFGCL Y DKF R +F+GYP GTKGY+VY Sbjct: 535 DGKTPFEMLFSKPPAYDHLKVFGCLCYALQKPKPNDKFSPRSSKCIFVGYPNGTKGYRVY 594 Query: 182 DLQHRKMVTSRDVKFLENVFPFARNPTEEEKIFVLPQKWDEEENTRDIRADQTKSNDNIH 361 DL +K+ SRDV+F EN FPF ENT T +ND Sbjct: 595 DLTTKKIFVSRDVRFYENQFPF--------------------ENT------STSTNDQTV 628 Query: 362 EPSSVMAETETQDVHNGADFFGPSQEPSEPATSGPNEPLLDITHETVPSNQNDMGSETVS 541 P + +T+ L ITH+++P N + Sbjct: 629 VPLPALEDTD-----------------------------LSITHDSIPPNPPQEQPQPHP 659 Query: 542 EENRPTNAHTRPVRSRTRPARLDGFEVNLPPSLDHTQSSLHHDSS--TVHPLAHFISYDN 715 N P TRP R++TRP RLD N +D++ SSL H++S T++ L++FISYDN Sbjct: 660 PTNPPNQPSTRPQRTKTRPKRLDDCVCN-NSKVDNSPSSLTHEASSGTLYSLSNFISYDN 718 Query: 716 FTNTHKAFLTAITTNNEPKHFKQAVKDVRWVEAMKREIQALEENGTWILEELPKGKRAID 895 F ++HKAFL AI+ +EPK F QAVK +W EAM++E+ ALE N TW LE LP K+ I Sbjct: 719 FHSSHKAFLAAISLRDEPKSFSQAVKSPQWREAMQKELAALENNNTWTLETLPPRKKPIG 778 Query: 896 SKWVYKIKYKPNGEVERYKARLVAKGFTQMEGVDFHETFAPXXXXXXXXXXXXXXXXXGW 1075 KW++KIKYK +G +ERYKAR VAKG+ Q+EG+DFHETFAP W Sbjct: 779 CKWIFKIKYKSDGTIERYKARFVAKGYNQIEGMDFHETFAPVAKLVTVRCLLAIAAIKNW 838 Query: 1076 HTHQLDVNNAFLHGDLNEDVYMKIPQGFGKQDDNRVCKLKKSLYGLKQASRNWYHKFTGS 1255 HQLDVNNAFLHGDL+E+VYM +P G+G ++D+RVC+++KSLYGLKQASRNW+ KF + Sbjct: 839 ELHQLDVNNAFLHGDLDEEVYMSLPPGYGDKNDSRVCRVRKSLYGLKQASRNWFAKFFAA 898 Query: 1256 LFEIGFKQTPADHSLFIHRKDKTFVAALIYVDDVVLVGNDSNKIQDTKDFLDKRFSIKDL 1435 L E GF Q+ D+SLF +F+ L+YVDD+++ G+DS +I+ K LD RF IKDL Sbjct: 899 LLEFGFIQSTVDYSLFTLTTGSSFLVVLVYVDDLIIAGDDSVRIRSLKQHLDSRFHIKDL 958 Query: 1436 GPLKYFLGIEVAKTKEGMVLSQRKYTLDILEDAGMTGCRPSSFPMEQNLKLDTCDKEPRV 1615 GPLKYFLGIEVA++ G+ L QRKYTLDILE+ GMT +PS+FPMEQ L P Sbjct: 959 GPLKYFLGIEVARSSSGIFLCQRKYTLDILEECGMTDAKPSAFPMEQKHNLTHDTGPPVQ 1018 Query: 1616 DANQYRRLIGRLLYLQATRPDIAYAVNILSQFVGDPRHSHMEAATRVLCYLKGTPGQGIL 1795 D QYRRL+GRL+YL TRP+I+YAV+ILSQF+ DPR H++AA RVL YLK PGQGI Sbjct: 1019 DPMQYRRLVGRLIYLTITRPEISYAVHILSQFMNDPRQPHLDAALRVLRYLKSCPGQGIF 1078 Query: 1796 LPKEGGTNLLAYCDSDWLGCPMTRRSRTGYLLLLGGAPISWRTKKQSVVSKSSAEAEYRA 1975 +L + DSDW CP TRRS TGY+ +LG +PISW+TKKQ+ VS+SSAEAEYRA Sbjct: 1079 FSSSSSPHLTGFSDSDWASCPQTRRSTTGYITMLGSSPISWKTKKQTTVSRSSAEAEYRA 1138 Query: 1976 MSNAVSEILWMRWLLSELDMAPVGPTQLFCDNQAARHIANNPVFHERTKHVEMDCYFVRE 2155 M+ VSE+LW+R LL L + P LFCDNQ A HIA NPVFHERTKH+E+DC+F+R Sbjct: 1139 MAATVSELLWLRSLLQTLGIPHQQPMALFCDNQVAIHIATNPVFHERTKHIELDCHFIRS 1198 Query: 2156 RVDSMEICPMPIATKDQIADVLTKALGANSLHFLLCKLGVRNLHAPT 2296 + + I I++K Q+AD+ TKALG + FLL KLG+ NLHAPT Sbjct: 1199 HIQAKSIQTSHISSKLQLADIFTKALGRDQFQFLLRKLGIFNLHAPT 1245 Score = 235 bits (600), Expect = 2e-59 Identities = 129/361 (35%), Positives = 196/361 (54%), Gaps = 24/361 (6%) Frame = +1 Query: 2617 IDYESPYYLHPSDYPRQMHVNDVLSDNNYADWSQEMMNFLFAKNKVGFINGSIKKPEEES 2796 +D SPY L PSD+P + V+ L+ +NY W++ M N L A+NK GF++GS+ KPE S Sbjct: 24 MDLSSPYLLQPSDHPGAILVSCPLNGDNYPTWARAMTNALRARNKYGFVDGSLAKPEATS 83 Query: 2797 SNYMPWMRCDAMIKGWLHTAMEKEIRTSVKYAMTAREIWIDLKERFGKVSAPRAYELKRS 2976 + W +C++M+ W+ ++ ++ SV Y TARE+W+DL+ERF + +APR +LKR Sbjct: 84 PDVSTWEKCNSMVISWIFNSLSSDLHNSVAYVDTAREMWLDLEERFSQGNAPRINQLKRD 143 Query: 2977 LTSTKQEGTSVSAYYTKLRGIWDEIQSVIPMPRCDCSSCKCDIGKKLQELRDKERLYEFL 3156 L T Q SV+AYYTKL+GIWDE+Q+ +P C C + K+L R++E++++F+ Sbjct: 144 LALTFQINMSVAAYYTKLKGIWDELQTYSTIPPCTCGA-----AKELLLEREREKVHQFI 198 Query: 3157 LGLDAEFGTIRTQILAMNPIPSLGKAYHLVAEDEQQRAISGSKRPSSDSVAFQAHVPVKR 3336 +GLD F ++ + IL + P+PSL KAY LV E++ ++ ++ P ++ A V Sbjct: 199 MGLDDSFRSVSSHILNIEPLPSLSKAYALVTRAERENSVRSTRPPIVEATALH----VTT 254 Query: 3337 DQNQSQNRTKQKDVKRGNSEPVEQCTVCGKDGHKSEGCFKLIGYP-EWWPGKGKQYKPKP 3513 N +Q+ T + +C C K GH C++L+GYP W GK + K KP Sbjct: 255 SANAAQSHTTRL-----------RCDHCNKTGHTKSHCYELVGYPSHWQKGKTDKDKRKP 303 Query: 3514 SAALVEG---------------EKSPIAGLSDSQYRQFLKFFG--------DKDGAKTED 3624 A + SPIAGL+ QY Q + D G T D Sbjct: 304 HAKAGSSNPKAMFPTCHVAKTIDASPIAGLTSEQYNQLISLLNIEKTNIVDDFSGKTTND 363 Query: 3625 S 3627 S Sbjct: 364 S 364 >ref|XP_022018997.1| uncharacterized protein LOC110919025 [Helianthus annuus] Length = 386 Score = 711 bits (1834), Expect = 0.0 Identities = 354/386 (91%), Positives = 362/386 (93%) Frame = +2 Query: 1139 MKIPQGFGKQDDNRVCKLKKSLYGLKQASRNWYHKFTGSLFEIGFKQTPADHSLFIHRKD 1318 MKIPQGFGKQDDNRVCKLKK LY LKQASRNWY KFT SL EIGFKQTPA++SLFI +++ Sbjct: 1 MKIPQGFGKQDDNRVCKLKKCLYDLKQASRNWYQKFTHSLLEIGFKQTPANYSLFIFKEN 60 Query: 1319 KTFVAALIYVDDVVLVGNDSNKIQDTKDFLDKRFSIKDLGPLKYFLGIEVAKTKEGMVLS 1498 K FVAALIYVDDVVLV NDS KIQ TKDFLDKRFSIKDLGPLKYFLGIEVAKT EGMVLS Sbjct: 61 KIFVAALIYVDDVVLVRNDSRKIQATKDFLDKRFSIKDLGPLKYFLGIEVAKTNEGMVLS 120 Query: 1499 QRKYTLDILEDAGMTGCRPSSFPMEQNLKLDTCDKEPRVDANQYRRLIGRLLYLQATRPD 1678 QRKYTLDILED GMTGCRPSSFPMEQNLKLD CDKEPRVDANQYRRLIGRLLYLQATRPD Sbjct: 121 QRKYTLDILEDVGMTGCRPSSFPMEQNLKLDMCDKEPRVDANQYRRLIGRLLYLQATRPD 180 Query: 1679 IAYAVNILSQFVGDPRHSHMEAATRVLCYLKGTPGQGILLPKEGGTNLLAYCDSDWLGCP 1858 IAYAVNILSQFV PR +HMEAATRVL YLKGT GQGIL+PKEG NLLAYCDSDWLGCP Sbjct: 181 IAYAVNILSQFVNYPRQTHMEAATRVLRYLKGTLGQGILIPKEGVANLLAYCDSDWLGCP 240 Query: 1859 MTRRSRTGYLLLLGGAPISWRTKKQSVVSKSSAEAEYRAMSNAVSEILWMRWLLSELDMA 2038 MTRRSRTGYLLLLGGAPISWRTKKQSVVSKSSAEAEYRAMSNAVSEILWMRWLL ELDMA Sbjct: 241 MTRRSRTGYLLLLGGAPISWRTKKQSVVSKSSAEAEYRAMSNAVSEILWMRWLLRELDMA 300 Query: 2039 PVGPTQLFCDNQAARHIANNPVFHERTKHVEMDCYFVRERVDSMEICPMPIATKDQIADV 2218 PVGPTQLFCDNQAARHIANNPVFHERTKHVEMDCYF+RERVDSMEICPM IATKDQIADV Sbjct: 301 PVGPTQLFCDNQAARHIANNPVFHERTKHVEMDCYFIRERVDSMEICPMSIATKDQIADV 360 Query: 2219 LTKALGANSLHFLLCKLGVRNLHAPT 2296 LTK LGANSL FLLCKLGVRNLHAPT Sbjct: 361 LTKDLGANSLCFLLCKLGVRNLHAPT 386 >gb|OTG09093.1| putative reverse transcriptase, RNA-dependent DNA polymerase, Gag-polypeptide of LTR copia-type [Helianthus annuus] Length = 938 Score = 724 bits (1870), Expect = 0.0 Identities = 369/575 (64%), Positives = 427/575 (74%) Frame = +2 Query: 572 RPVRSRTRPARLDGFEVNLPPSLDHTQSSLHHDSSTVHPLAHFISYDNFTNTHKAFLTAI 751 R R+R++PARL + V LPPS+DH + + SST Sbjct: 428 RAKRNRSQPARLSDYHVKLPPSVDHANPAPNEASST------------------------ 463 Query: 752 TTNNEPKHFKQAVKDVRWVEAMKREIQALEENGTWILEELPKGKRAIDSKWVYKIKYKPN 931 P++F QA++D RW EAMK+EI+ALEEN TW L +LP GKRA+DSKWVYKIKYKPN Sbjct: 464 -----PRNFNQAIQDERWKEAMKKEIRALEENNTWTLVDLPNGKRAVDSKWVYKIKYKPN 518 Query: 932 GEVERYKARLVAKGFTQMEGVDFHETFAPXXXXXXXXXXXXXXXXXGWHTHQLDVNNAFL 1111 GEVER+KARLVAKGFTQMEGVD+H+TFAP W +QLDVNNAFL Sbjct: 519 GEVERFKARLVAKGFTQMEGVDYHDTFAPVAKLVTVRTLLAVAVKKRWVINQLDVNNAFL 578 Query: 1112 HGDLNEDVYMKIPQGFGKQDDNRVCKLKKSLYGLKQASRNWYHKFTGSLFEIGFKQTPAD 1291 HGDLNE+VYMK+PQGF K++D RVC+L KSLYGLKQASRNWY KFT SL E+G+KQ AD Sbjct: 579 HGDLNEEVYMKLPQGFAKENDTRVCRLNKSLYGLKQASRNWYQKFTSSLLELGYKQCKAD 638 Query: 1292 HSLFIHRKDKTFVAALIYVDDVVLVGNDSNKIQDTKDFLDKRFSIKDLGPLKYFLGIEVA 1471 +SLFI ++D FVAALIYVDDV++VGND+ KIQ TK LD+RFSIKDLG LKYFLGIEVA Sbjct: 639 YSLFIFKEDACFVAALIYVDDVIIVGNDARKIQHTKVELDRRFSIKDLGTLKYFLGIEVA 698 Query: 1472 KTKEGMVLSQRKYTLDILEDAGMTGCRPSSFPMEQNLKLDTCDKEPRVDANQYRRLIGRL 1651 +T EG+VLSQRKY LDILED G+ GC+PS FP EQNLKLD D+EP+VDA++YRRL+GRL Sbjct: 699 RTPEGLVLSQRKYILDILEDCGLQGCKPSPFPFEQNLKLDKNDEEPKVDASRYRRLVGRL 758 Query: 1652 LYLQATRPDIAYAVNILSQFVGDPRHSHMEAATRVLCYLKGTPGQGILLPKEGGTNLLAY 1831 LYLQATRPDIAY+VN+LSQFV DPR SHM+AA RVL Sbjct: 759 LYLQATRPDIAYSVNVLSQFVADPRQSHMDAAHRVL------------------------ 794 Query: 1832 CDSDWLGCPMTRRSRTGYLLLLGGAPISWRTKKQSVVSKSSAEAEYRAMSNAVSEILWMR 2011 RRSRTGYLLLLGGAPISW+TKKQ+VVS+SSAEAEYR+M++ VSEILWMR Sbjct: 795 -----------RRSRTGYLLLLGGAPISWKTKKQNVVSRSSAEAEYRSMASTVSEILWMR 843 Query: 2012 WLLSELDMAPVGPTQLFCDNQAARHIANNPVFHERTKHVEMDCYFVRERVDSMEICPMPI 2191 WLL EL++ + PT LFCDNQAARHIANNPVFHERTKHVEMDCYFVRERV+S E+ P+ I Sbjct: 844 WLLKELNIYTIEPTPLFCDNQAARHIANNPVFHERTKHVEMDCYFVRERVESQEVQPLRI 903 Query: 2192 ATKDQIADVLTKALGANSLHFLLCKLGVRNLHAPT 2296 T QIAD+LTK LG L FLL KLGVRNLHAPT Sbjct: 904 DTSMQIADLLTKGLGTQQLTFLLDKLGVRNLHAPT 938 Score = 469 bits (1208), Expect = e-144 Identities = 219/338 (64%), Positives = 269/338 (79%), Gaps = 1/338 (0%) Frame = +1 Query: 2596 GKTKEGGI-DYESPYYLHPSDYPRQMHVNDVLSDNNYADWSQEMMNFLFAKNKVGFINGS 2772 G KEG D SP Y+H SDYP+QMHVND L+DNNY DWSQEM+NFLFAKNKVGF++G+ Sbjct: 7 GTKKEGSSPDINSPLYIHASDYPKQMHVNDTLTDNNYTDWSQEMLNFLFAKNKVGFVDGT 66 Query: 2773 IKKPEEESSNYMPWMRCDAMIKGWLHTAMEKEIRTSVKYAMTAREIWIDLKERFGKVSAP 2952 +KKPE+ +++YM WMRCDAM+KGWL TAMEK+IR SVKYA TA EIW DL+ERFGK SAP Sbjct: 67 LKKPEKTATDYMAWMRCDAMVKGWLTTAMEKDIRGSVKYANTASEIWSDLRERFGKASAP 126 Query: 2953 RAYELKRSLTSTKQEGTSVSAYYTKLRGIWDEIQSVIPMPRCDCSSCKCDIGKKLQELRD 3132 RAYELK++L++T Q G+SVSAYYTKLR +WDEI+SV+P PRC C C C +GKK+ ELR+ Sbjct: 127 RAYELKQTLSNTHQSGSSVSAYYTKLRVLWDEIESVLPAPRCTCDKCSCGVGKKMNELRE 186 Query: 3133 KERLYEFLLGLDAEFGTIRTQILAMNPIPSLGKAYHLVAEDEQQRAISGSKRPSSDSVAF 3312 KERLYEFL+GLDA+F I+TQILAMNPIP+LG AYHLVAEDE+QR ISG K+ +++ AF Sbjct: 187 KERLYEFLMGLDADFAVIKTQILAMNPIPTLGNAYHLVAEDERQRMISGEKKTPTENAAF 246 Query: 3313 QAHVPVKRDQNQSQNRTKQKDVKRGNSEPVEQCTVCGKDGHKSEGCFKLIGYPEWWPGKG 3492 +A PV+R+ + SQN+ KD K G + VEQCT CG+ GHK +GCFK+IGYP+WWPG Sbjct: 247 KAFKPVRRENSTSQNKAAPKDQKHG--DMVEQCTHCGRSGHKRDGCFKIIGYPDWWPG-- 302 Query: 3493 KQYKPKPSAALVEGEKSPIAGLSDSQYRQFLKFFGDKD 3606 K KP AA VE + SP+ GL+ QY+ FLK F + D Sbjct: 303 ---KMKPKAAHVETDASPVPGLTKEQYQSFLKHFAEND 337 >gb|OMO75305.1| Integrase, catalytic core [Corchorus capsularis] Length = 1373 Score = 734 bits (1895), Expect = 0.0 Identities = 396/775 (51%), Positives = 487/775 (62%), Gaps = 10/775 (1%) Frame = +2 Query: 2 DNKTPYEVLHNSKPEYDHMKVFGCLAYYRSVETKGDKFEVRGRPGVFLGYPPGTKGYKVY 181 +NKTP+E+L KPEYDH++VFGCL Y +GDKF RG+P VF+GYP G KG + Sbjct: 687 NNKTPFEMLFGKKPEYDHLRVFGCLVYAHDNSKRGDKFSERGKPCVFVGYPNGQKGQMIV 746 Query: 182 DLQHRKMVTSRDVKFLENVFPFARNPTEEEKIFVLPQKWDEEENTRDIRADQTKSNDNIH 361 +Q ++ +T+ ++ V A N EE + LPQ Sbjct: 747 -IQEKQSLTAGG-EYCRPVLNAANNDCEE--VMTLPQ----------------------- 779 Query: 362 EPSSVMAETETQDVHNGADFFGPSQEPSEPATSGPNEPLLDITHETVPSNQNDMGSETVS 541 + + N AD F + P A DIT V ++Q +E+ + Sbjct: 780 ----------VKGLGNAADRFIAEEIPETAAG--------DIT-AGVTASQETTVTESTA 820 Query: 542 EENRPTNAHT----------RPVRSRTRPARLDGFEVNLPPSLDHTQSSLHHDSSTVHPL 691 EE ++A R R RT+P R DGF+V LPPS Q +L S+V+PL Sbjct: 821 EEPVVSSAVPISGQSRVEVRRSARERTQPKRFDGFDVQLPPSTVPAQPALPSADSSVYPL 880 Query: 692 AHFISYDNFTNTHKAFLTAITTNNEPKHFKQAVKDVRWVEAMKREIQALEENGTWILEEL 871 +H++SYD ++HKAFL IT+++EPKHF QAVK W EA ++EIQALEENGTW L L Sbjct: 881 SHYVSYDRIAHSHKAFLATITSHDEPKHFSQAVKHKHWREAKEKEIQALEENGTWDLVPL 940 Query: 872 PKGKRAIDSKWVYKIKYKPNGEVERYKARLVAKGFTQMEGVDFHETFAPXXXXXXXXXXX 1051 P+ KRAIDSKWVYK+K+KPNGE+ERYKARLVAKGFTQ+EGVDFHETFAP Sbjct: 941 PQDKRAIDSKWVYKVKFKPNGEIERYKARLVAKGFTQIEGVDFHETFAPVAKLVTVRCLL 1000 Query: 1052 XXXXXXGWHTHQLDVNNAFLHGDLNEDVYMKIPQGFGKQDDNRVCKLKKSLYGLKQASRN 1231 W HQLD ASRN Sbjct: 1001 AIAAKRRWEVHQLD------------------------------------------ASRN 1018 Query: 1232 WYHKFTGSLFEIGFKQTPADHSLFIHRKDKTFVAALIYVDDVVLVGNDSNKIQDTKDFLD 1411 WYHKFT +L ++GF+Q+ ADHSLF++ K +TF+ ALIYVDDV+L GN+ +KIQ+ K +L+ Sbjct: 1019 WYHKFTKALEDVGFRQSKADHSLFLYDKGETFLTALIYVDDVILAGNNGDKIQEIKSYLN 1078 Query: 1412 KRFSIKDLGPLKYFLGIEVAKTKEGMVLSQRKYTLDILEDAGMTGCRPSSFPMEQNLKLD 1591 +F IKDLGPLKYFLGIEVA++ G+VLSQRKY LDILE++GM GC+PS+FPME N KL Sbjct: 1079 DKFDIKDLGPLKYFLGIEVARSPAGIVLSQRKYVLDILEESGMQGCKPSAFPMEHNHKLR 1138 Query: 1592 TCDKEPRVDANQYRRLIGRLLYLQATRPDIAYAVNILSQFVGDPRHSHMEAATRVLCYLK 1771 +DA QYRRL+GRLLYL TRPD+ +AVN+LSQFV PR HM+AA RVL YLK Sbjct: 1139 ADSNGTIIDAAQYRRLVGRLLYLTVTRPDLTFAVNVLSQFVSAPRQEHMDAALRVLRYLK 1198 Query: 1772 GTPGQGILLPKEGGTNLLAYCDSDWLGCPMTRRSRTGYLLLLGGAPISWRTKKQSVVSKS 1951 PGQGILL EG L AYCD+DW GC TRRS TGY + LGG+PISWRTK+Q VVSKS Sbjct: 1199 KAPGQGILLSAEGDLFLTAYCDADWGGCLTTRRSCTGYFITLGGSPISWRTKRQQVVSKS 1258 Query: 1952 SAEAEYRAMSNAVSEILWMRWLLSELDMAPVGPTQLFCDNQAARHIANNPVFHERTKHVE 2131 SAEAEYRAM+ VSE+LW+RWLL++L PT LFCDNQAA HI PV+HERTKHVE Sbjct: 1259 SAEAEYRAMAVTVSELLWLRWLLTDLQSPQTEPTPLFCDNQAALHITAKPVYHERTKHVE 1318 Query: 2132 MDCYFVRERVDSMEICPMPIATKDQIADVLTKALGANSLHFLLCKLGVRNLHAPT 2296 MDCYFVRER S EI P I+T Q+AD+ TKALG + L+ KLGV NLHA T Sbjct: 1319 MDCYFVRERAQSREIAPRKISTGAQLADIFTKALGKDRFESLVFKLGVANLHALT 1373 Score = 214 bits (546), Expect = 9e-53 Identities = 124/352 (35%), Positives = 183/352 (51%), Gaps = 15/352 (4%) Frame = +1 Query: 2617 IDYESPYYLHPSDYPRQMHVNDVLSDNNYADWSQEMMNFLFAKNKVGFINGSIKKPEEES 2796 ID SP+YLH SD P Q++V+D+L D NY +W +M N LFAKNK+GF++G+I +PE +S Sbjct: 21 IDVMSPFYLHASDNPGQIYVSDLLHDGNYGEWVNDMSNALFAKNKIGFVDGTIPRPEVDS 80 Query: 2797 SNYMPWMRCDAMIKGWLHTAMEKEIRTSVKYAMTAREIWIDLKERFGKVSAPRAYELKRS 2976 N WMRC+AM+KGWL +AM K++R SV+YA TAREIW+DL+ERFGK S PRAYE++R+ Sbjct: 81 PNLQHWMRCNAMVKGWLKSAMGKDVRGSVRYASTAREIWVDLEERFGKGSDPRAYEIRRA 140 Query: 2977 LTSTKQEGTSVSAYYTKLRGIWDEIQSVIPMPRCDCSSCKCDIGKKLQELRDKERLYEFL 3156 +T +QE SVS+YYTKL+G + R CS Sbjct: 141 VTLLRQEKMSVSSYYTKLKGFY----------RTRCS----------------------- 167 Query: 3157 LGLDAEFGTIRTQILAMNPIPSLGKAYHLVAEDEQQRAISGSKRPSSDSVAFQAHVPVKR 3336 +F R + + I L K + + + +R + ++R Sbjct: 168 -----QFSHCRNASVLVMHITWLRKMNNKNRSRQTANPLLRRRRSKCKEAKMEGTEDLER 222 Query: 3337 DQNQSQNRTKQKDVKRGN-----SEPVEQCTVCGKDGHKSEGCFKLIGYPEWW------- 3480 NQ N K+ + P +C C K GH + C+++IGYP W Sbjct: 223 KTNQGVNIVKKVGHTKDQCYEIIGYPAARCGHCQKSGHTKDQCYEIIGYPAGWRKNLRDK 282 Query: 3481 ---PGKGKQYKPKPSAALVEGEKSPIAGLSDSQYRQFLKFFGDKDGAKTEDS 3627 G+ ++ P AA VE E + I GL+ +Q + ++F + DG ++ + Sbjct: 283 KEKGGQTVNHRTFPKAAQVESEMTSIPGLTQAQLAKLVQFL-NVDGESSKQT 333 >gb|PNY16822.1| retrovirus-related Pol polyprotein from transposon TNT 1-94 [Trifolium pratense] Length = 834 Score = 701 bits (1808), Expect = 0.0 Identities = 368/771 (47%), Positives = 482/771 (62%), Gaps = 9/771 (1%) Frame = +2 Query: 11 TPYEVLHNSKPEYDHMKVFGCLAYYRSVETKGDKFEVRGRPGVFLGYPPGTKGYKVYDLQ 190 TPYEVL + P YDH+K+FGCL Y + DKF+ R +F+GYP G KG+KVY+ + Sbjct: 99 TPYEVLFRNSPTYDHLKIFGCLCYVSTNTKLRDKFDPRAERCIFVGYPQGQKGWKVYNPK 158 Query: 191 HRKMVTSRDVKFLENVFPFARNPTEEEKIFVLPQKWDEEENTRDIRADQTKSNDNIHEPS 370 +K SRDV F EN+ P+ + E LP + +I Q +SN + E Sbjct: 159 TQKFFVSRDVVFYENILPYVVHEKE------LPIE-SPSVVFHEISGQQEESNHDEVEKY 211 Query: 371 SVMAETETQDVHNGADFFGPSQEPSEPATSGPNEPLLDITHETVPSNQNDMGSETVSEEN 550 E Q N A+ G ++EP ND+G E E+ Sbjct: 212 E-RQENRGQGDTNPAEMDGQNEEP------------------------NDVGIEVHKEKE 246 Query: 551 RPTNAHTR-----PVRSRTRPARLDGFEVNL----PPSLDHTQSSLHHDSSTVHPLAHFI 703 H P R+R P L + P S+ TQS H S ++P+ +FI Sbjct: 247 TEVPVHNEMETDLPPRTRQPPGYLQDYHCYTSHKNPISMPKTQS---HSSGKIYPITNFI 303 Query: 704 SYDNFTNTHKAFLTAITTNNEPKHFKQAVKDVRWVEAMKREIQALEENGTWILEELPKGK 883 S D ++ H+A+L AI EP+ +++AVK W EAM E++ALEENGTW LE P K Sbjct: 304 SNDCYSRRHQAYLAAIHNTKEPQSYREAVKKTEWKEAMAAELKALEENGTWDLELPPTCK 363 Query: 884 RAIDSKWVYKIKYKPNGEVERYKARLVAKGFTQMEGVDFHETFAPXXXXXXXXXXXXXXX 1063 + + KWVYK+KYK GEVE+YKARLVAKG+TQ+EG DF+ETFAP Sbjct: 364 KIVGCKWVYKVKYKATGEVEKYKARLVAKGYTQVEGEDFNETFAPVAKMTTVRCLLTVAV 423 Query: 1064 XXGWHTHQLDVNNAFLHGDLNEDVYMKIPQGFGKQDDNRVCKLKKSLYGLKQASRNWYHK 1243 GW HQ+DV+NAFLHGDL+E+VYM++P+G+ VC+L+KSLYGLKQASRNWY K Sbjct: 424 AKGWELHQMDVSNAFLHGDLDEEVYMQVPEGYHTPKAGMVCRLRKSLYGLKQASRNWYSK 483 Query: 1244 FTGSLFEIGFKQTPADHSLFIHRKDKTFVAALIYVDDVVLVGNDSNKIQDTKDFLDKRFS 1423 + +L E GF+++ ADHSLF + ++ F+A L+YVDD+V+ GN S+ + K +L + F Sbjct: 484 LSHALIEYGFQESHADHSLFTYSREGEFMAVLVYVDDLVIAGNYSDTCTNFKQYLRRCFH 543 Query: 1424 IKDLGPLKYFLGIEVAKTKEGMVLSQRKYTLDILEDAGMTGCRPSSFPMEQNLKLDTCDK 1603 +KDLGPLKYFLG+E+A+ G+ + QRKY +DIL++ M +PS+FPMEQN KL Sbjct: 544 MKDLGPLKYFLGLELARGATGLFMCQRKYIMDILDECKMLDSKPSTFPMEQNQKLALDTG 603 Query: 1604 EPRVDANQYRRLIGRLLYLQATRPDIAYAVNILSQFVGDPRHSHMEAATRVLCYLKGTPG 1783 D +YRRL+GRL+YL TRP+I Y+V+ILSQF P+ +H +AA RVL YLK TPG Sbjct: 604 PAYSDPPRYRRLVGRLIYLTITRPEITYSVHILSQFTQSPQQAHWDAAMRVLRYLKFTPG 663 Query: 1784 QGILLPKEGGTNLLAYCDSDWLGCPMTRRSRTGYLLLLGGAPISWRTKKQSVVSKSSAEA 1963 QGI+LPKE L+AYCDSDW CP+TRRS +GYL+ LG APISW+TKKQS VSKSS+EA Sbjct: 664 QGIILPKENDLQLVAYCDSDWASCPLTRRSTSGYLMKLGSAPISWKTKKQSTVSKSSSEA 723 Query: 1964 EYRAMSNAVSEILWMRWLLSELDMAPVGPTQLFCDNQAARHIANNPVFHERTKHVEMDCY 2143 EYRAM AVSE++W+R LLS L + PT LFCDNQAA H+A NPV+HERTKH+E+DC+ Sbjct: 724 EYRAMGQAVSEVIWLRSLLSSLQVHYKSPTVLFCDNQAAIHLAANPVYHERTKHIEVDCH 783 Query: 2144 FVRERVDSMEICPMPIATKDQIADVLTKALGANSLHFLLCKLGVRNLHAPT 2296 F+R + I ++TK Q AD+ TKALGA L KLG N H PT Sbjct: 784 FIRTHLQKGTISTNYVSTKKQQADIFTKALGAKQFQELTFKLGAHNPHTPT 834 >gb|OTG24510.1| putative reverse transcriptase, RNA-dependent DNA polymerase, Gag-polypeptide of LTR copia-type [Helianthus annuus] Length = 934 Score = 699 bits (1805), Expect = 0.0 Identities = 346/525 (65%), Positives = 406/525 (77%) Frame = +2 Query: 722 NTHKAFLTAITTNNEPKHFKQAVKDVRWVEAMKREIQALEENGTWILEELPKGKRAIDSK 901 N + L TN++P + D +W AM++EI+ALE+NGTW LEELP+GKR DSK Sbjct: 425 NDYVVSLPPSVTNSQPGSSQANSTDEKWRNAMQQEIKALEKNGTWTLEELPEGKRPTDSK 484 Query: 902 WVYKIKYKPNGEVERYKARLVAKGFTQMEGVDFHETFAPXXXXXXXXXXXXXXXXXGWHT 1081 WVYK K+K +GEVERYKARLVAKGFTQMEGVD+HETFAP W Sbjct: 485 WVYKTKFKSDGEVERYKARLVAKGFTQMEGVDYHETFAPVAKLVTVRTLLAVATKKDWII 544 Query: 1082 HQLDVNNAFLHGDLNEDVYMKIPQGFGKQDDNRVCKLKKSLYGLKQASRNWYHKFTGSLF 1261 HQLDVNNAFLHGDL+E+VYMKIP+GF K+ + RVC+L+KSLYGLKQASRNWY + T L Sbjct: 545 HQLDVNNAFLHGDLDEEVYMKIPKGFEKEGETRVCRLRKSLYGLKQASRNWYKRLTSFLL 604 Query: 1262 EIGFKQTPADHSLFIHRKDKTFVAALIYVDDVVLVGNDSNKIQDTKDFLDKRFSIKDLGP 1441 + FKQ+ AD+SLF ++K +VA LIYVDDV++VG++S KIQ K LD FSIKDLGP Sbjct: 605 SLNFKQSKADYSLFTYQKAGIYVAILIYVDDVIIVGDNSKKIQQIKQQLDDEFSIKDLGP 664 Query: 1442 LKYFLGIEVAKTKEGMVLSQRKYTLDILEDAGMTGCRPSSFPMEQNLKLDTCDKEPRVDA 1621 LKYFLGIEVAKTK+G+VLSQRKY LDIL+D+GM GCRPS+FP EQ KLD +KE RVDA Sbjct: 665 LKYFLGIEVAKTKDGLVLSQRKYILDILKDSGMLGCRPSAFPFEQGTKLDKGEKEARVDA 724 Query: 1622 NQYRRLIGRLLYLQATRPDIAYAVNILSQFVGDPRHSHMEAATRVLCYLKGTPGQGILLP 1801 QYRRL+GRLLYLQATRPD+ YAVN AA RVL YLKGTPGQGILLP Sbjct: 725 TQYRRLVGRLLYLQATRPDVTYAVN---------------AANRVLRYLKGTPGQGILLP 769 Query: 1802 KEGGTNLLAYCDSDWLGCPMTRRSRTGYLLLLGGAPISWRTKKQSVVSKSSAEAEYRAMS 1981 +EG L YCDSDWLGCP TRRSRTGYLLLLGG+PISW+TKKQSVVS+SSAEAEYRAM+ Sbjct: 770 REGPPVLTGYCDSDWLGCPFTRRSRTGYLLLLGGSPISWKTKKQSVVSRSSAEAEYRAMA 829 Query: 1982 NAVSEILWMRWLLSELDMAPVGPTQLFCDNQAARHIANNPVFHERTKHVEMDCYFVRERV 2161 + VSEILW+RWLL ++ + PT LFCDNQAARHIANNPV+HERTKHVEMDC+FVRERV Sbjct: 830 STVSEILWVRWLLKDMQVQITTPTSLFCDNQAARHIANNPVYHERTKHVEMDCFFVRERV 889 Query: 2162 DSMEICPMPIATKDQIADVLTKALGANSLHFLLCKLGVRNLHAPT 2296 ++ EI P PI +K Q+AD+LTK LG L LL K+G+R+LHAP+ Sbjct: 890 ETREIEPKPIESKLQLADLLTKGLGTQQLRSLLSKMGIRDLHAPS 934 Score = 417 bits (1071), Expect = e-124 Identities = 197/345 (57%), Positives = 253/345 (73%), Gaps = 1/345 (0%) Frame = +1 Query: 2617 IDYESPYYLHPSDYPRQMHVNDVLSDNNYADWSQEMMNFLFAKNKVGFINGSIKKPEEES 2796 ID+ SPYYLHPSD P+Q VN+VL+D NY DW+QEM NFLFAKNK+ F++G++KKPE S Sbjct: 14 IDFSSPYYLHPSDSPKQPSVNEVLTDGNYNDWAQEMTNFLFAKNKIDFVDGTLKKPETSS 73 Query: 2797 SNYMPWMRCDAMIKGWLHTAMEKEIRTSVKYAMTAREIWIDLKERFGKVSAPRAYELKRS 2976 S Y WMRCDAMIKGWL TAMEK IR SVKYA+T+ EIW DLKERFGK SAPR YELK+ Sbjct: 74 SQYKSWMRCDAMIKGWLTTAMEKSIRDSVKYAVTSSEIWSDLKERFGKESAPRTYELKQK 133 Query: 2977 LTSTKQEGTSVSAYYTKLRGIWDEIQSVIPMPRCDCSSCKCDIGKKLQELRDKERLYEFL 3156 + +T+Q+G++VS YYT+LR +WDE S+ P P C C+ C C++GKK+ E +K++LYEFL Sbjct: 134 IAATRQDGSNVSTYYTRLRSLWDESHSIFPFPCCSCNKCTCELGKKIAEHLEKQQLYEFL 193 Query: 3157 LGLDAEFGTIRTQILAMNPIPSLGKAYHLVAEDEQQRAISGSKRPSSDSVAFQAHVPVKR 3336 +GLD +F IRTQILA P+P+LG AYH+VAEDE+QRAIS R + +S AF+ KR Sbjct: 194 MGLDNDFNVIRTQILATKPVPTLGTAYHMVAEDERQRAISNENRVAPESAAFKTF--QKR 251 Query: 3337 DQNQSQNRTKQKDVKRGNSEPVEQCTVCGKDGHKSEGCFKLIGYPEWWPGKGKQYKPKPS 3516 N + K + S+ +QCT CG++ HK EGCFKL+GYP+WWPGK K K KP Sbjct: 252 HNNFKSPKEKYTTTQEKESKQNDQCTFCGRNSHKREGCFKLVGYPDWWPGK-KDDKAKPK 310 Query: 3517 AALVEGEKSPIAGLSDSQYRQFLKFF-GDKDGAKTEDSGPKANLA 3648 AA V+ SPI G+S+ QY+ F+KFF G + +T+ +AN+A Sbjct: 311 AACVDTGTSPIPGISEEQYQAFVKFFSGSGNNVETKS---EANMA 352 >gb|OTG06009.1| putative zinc finger, CCHC-type [Helianthus annuus] Length = 961 Score = 696 bits (1795), Expect = 0.0 Identities = 353/506 (69%), Positives = 401/506 (79%), Gaps = 6/506 (1%) Frame = +2 Query: 632 PSLDHTQSSLHHDSSTVHPLAHFISYD--NFTNTHKAFLTAITTNNEPKHFK----QAVK 793 P T SLH TV P + + + T K+ T + P K QAVK Sbjct: 13 PKPSPTGPSLHEAFETVEPQREKKAGHGPSISMTMKSVSPHPLTMSNPPPIKSPQRQAVK 72 Query: 794 DVRWVEAMKREIQALEENGTWILEELPKGKRAIDSKWVYKIKYKPNGEVERYKARLVAKG 973 D +WVEAMK+E+QALEENGTW EELP+GKRAIDSKWVYKI+YKPNGE+ERYKARL+AKG Sbjct: 73 DPKWVEAMKKEVQALEENGTWRPEELPEGKRAIDSKWVYKIQYKPNGEIERYKARLMAKG 132 Query: 974 FTQMEGVDFHETFAPXXXXXXXXXXXXXXXXXGWHTHQLDVNNAFLHGDLNEDVYMKIPQ 1153 F+Q EG+DFHETFAP GWH HQLDVNNAFL GDL+EDVYMKIPQ Sbjct: 133 FSQTEGIDFHETFAPVAKLVTVRTLLAVAVKKGWHIHQLDVNNAFLDGDLHEDVYMKIPQ 192 Query: 1154 GFGKQDDNRVCKLKKSLYGLKQASRNWYHKFTGSLFEIGFKQTPADHSLFIHRKDKTFVA 1333 GF +Q+D VCKLKKSLYGLKQASRNWY KFT SL EIGF+QT DHSLF+ + D+ F+A Sbjct: 193 GFVRQNDQCVCKLKKSLYGLKQASRNWYKKFTKSLLEIGFRQTGVDHSLFLLKTDEVFMA 252 Query: 1334 ALIYVDDVVLVGNDSNKIQDTKDFLDKRFSIKDLGPLKYFLGIEVAKTKEGMVLSQRKYT 1513 ALIYVDDV+LV N+ ++Q+ K FLDKRFSIKDLG LK+FLGIEVA+TK+G+VLSQ KY Sbjct: 253 ALIYVDDVILVWNNMEEMQNVKSFLDKRFSIKDLGVLKFFLGIEVARTKKGLVLSQHKYI 312 Query: 1514 LDILEDAGMTGCRPSSFPMEQNLKLDTCDKEPRVDANQYRRLIGRLLYLQATRPDIAYAV 1693 LDILED GMTGCRPS FPM+QNLKLD C+KE +VDANQYRRLIGRLLYLQATRPD+AY V Sbjct: 313 LDILEDTGMTGCRPSQFPMKQNLKLDKCNKEAQVDANQYRRLIGRLLYLQATRPDVAYVV 372 Query: 1694 NILSQFVGDPRHSHMEAATRVLCYLKGTPGQGILLPKEGGTNLLAYCDSDWLGCPMTRRS 1873 NI S+FVGDPR SH+EAA RVL YLKGTP Q ILLPK GGTNL+ YCDSDWLGCP TRRS Sbjct: 373 NIFSKFVGDPRVSHLEAANRVLRYLKGTPRQRILLPKLGGTNLITYCDSDWLGCPFTRRS 432 Query: 1874 RTGYLLLLGGAPISWRTKKQSVVSKSSAEAEYRAMSNAVSEILWMRWLLSELDMAPVGPT 2053 RTGYLLLLGGAPISW++K+QSVVS+SSAEAEYRAM+ +SE MR LL ELD+ GPT Sbjct: 433 RTGYLLLLGGAPISWKSKEQSVVSRSSAEAEYRAMAAVISE---MRRLLKELDIPQEGPT 489 Query: 2054 QLFCDNQAARHIANNPVFHERTKHVE 2131 QLFC+NQAARHIANN VFHERTKH++ Sbjct: 490 QLFCNNQAARHIANNLVFHERTKHIQ 515 >gb|PNX93928.1| hypothetical protein L195_g017092, partial [Trifolium pratense] Length = 865 Score = 685 bits (1767), Expect = 0.0 Identities = 355/764 (46%), Positives = 487/764 (63%) Frame = +2 Query: 5 NKTPYEVLHNSKPEYDHMKVFGCLAYYRSVETKGDKFEVRGRPGVFLGYPPGTKGYKVYD 184 NK+P+E+L+N P DH++VFGCL Y V KF+ R + G+F+GYP G KGYK+YD Sbjct: 131 NKSPFELLYNKPPSLDHLRVFGCLCYATIVHPT-HKFDPRAKRGIFVGYPTGQKGYKIYD 189 Query: 185 LQHRKMVTSRDVKFLENVFPFARNPTEEEKIFVLPQKWDEEENTRDIRADQTKSNDNIHE 364 + + SRDVKF E FP N +E I P ++ D++ Sbjct: 190 PETKTFFVSRDVKFCETNFPSIPNTSEPNLISSHP---------------SYEAIDDLPS 234 Query: 365 PSSVMAETETQDVHNGADFFGPSQEPSEPATSGPNEPLLDITHETVPSNQNDMGSETVSE 544 P+S +++ D+ + + PS +E TS P+++ T T ++ D + + + Sbjct: 235 PTSSHHQSQQTDIPSTHEPNSPSHITTE--TSSAASPIVEPTPLT--THTTDPPTPFIPQ 290 Query: 545 ENRPTNAHTRPVRSRTRPARLDGFEVNLPPSLDHTQSSLHHDSSTVHPLAHFISYDNFTN 724 + P+ + ++ ++ T S S T +PL+H++SY ++ Sbjct: 291 VRKSVRDKHPPIWHN---------DYHMSTQVNKTPSEPTSGSGTRYPLSHYLSYSRISS 341 Query: 725 THKAFLTAITTNNEPKHFKQAVKDVRWVEAMKREIQALEENGTWILEELPKGKRAIDSKW 904 ++ AFL IT + EP+ + QAV D W +AM E++ALE+N TW L LP G + I KW Sbjct: 342 SNCAFLANITAHREPQSYDQAVHDPLWQDAMNAELEALEQNNTWSLVPLPSGHKPIGCKW 401 Query: 905 VYKIKYKPNGEVERYKARLVAKGFTQMEGVDFHETFAPXXXXXXXXXXXXXXXXXGWHTH 1084 VYKIKYK +G +ERYKARLVAKG+TQ+EG+D+ ETF+P W H Sbjct: 402 VYKIKYKSDGTIERYKARLVAKGYTQVEGIDYQETFSPTAKVTTLRCLLTVAAARNWFIH 461 Query: 1085 QLDVNNAFLHGDLNEDVYMKIPQGFGKQDDNRVCKLKKSLYGLKQASRNWYHKFTGSLFE 1264 QLDV NAFLHGDL+E VYM+ P G +Q +N VC+L KSLYGLKQASRNW+ F+ + + Sbjct: 462 QLDVQNAFLHGDLHELVYMEPPPGLRRQGENVVCRLNKSLYGLKQASRNWFSTFSEVIQK 521 Query: 1265 IGFKQTPADHSLFIHRKDKTFVAALIYVDDVVLVGNDSNKIQDTKDFLDKRFSIKDLGPL 1444 G++Q+ AD+SLF + +F A LIYVDD++L GND +++ K+FL KRF IKDLG L Sbjct: 522 AGYQQSKADYSLFTKSQGTSFTAVLIYVDDILLTGNDLQEMKRLKEFLLKRFRIKDLGNL 581 Query: 1445 KYFLGIEVAKTKEGMVLSQRKYTLDILEDAGMTGCRPSSFPMEQNLKLDTCDKEPRVDAN 1624 KYFLGIE +++K+G+ +SQRKY LDIL+D+G+TG RP FPMEQNLKL D D Sbjct: 582 KYFLGIEFSRSKKGIFMSQRKYALDILQDSGLTGARPDKFPMEQNLKLTPTDGVVLNDPT 641 Query: 1625 QYRRLIGRLLYLQATRPDIAYAVNILSQFVGDPRHSHMEAATRVLCYLKGTPGQGILLPK 1804 +YRRL+GRL+YL TRPDI Y+V LSQF+ +PR H +AA RVL Y+KGTPGQG+L Sbjct: 642 KYRRLVGRLIYLTVTRPDIVYSVQTLSQFMHEPRKPHWDAALRVLRYIKGTPGQGLLFSS 701 Query: 1805 EGGTNLLAYCDSDWLGCPMTRRSRTGYLLLLGGAPISWRTKKQSVVSKSSAEAEYRAMSN 1984 L A+CDSDW GC TRRS TG+ L LG + ISW++KKQ VVS+SSAE+EYRAM+N Sbjct: 702 TNDLTLKAFCDSDWGGCHATRRSVTGFCLFLGNSLISWKSKKQVVVSRSSAESEYRAMAN 761 Query: 1985 AVSEILWMRWLLSELDMAPVGPTQLFCDNQAARHIANNPVFHERTKHVEMDCYFVRERVD 2164 E+ W+R++L +L ++ PT LFCDNQAA HIA NPVFHERTKH+E+DC+ VRE++ Sbjct: 762 TCLELTWLRFILQDLKVSQNTPTPLFCDNQAALHIAANPVFHERTKHIEIDCHIVREKLQ 821 Query: 2165 SMEICPMPIATKDQIADVLTKALGANSLHFLLCKLGVRNLHAPT 2296 + I P + T+ Q+ADV TKALG + L KLG+ ++H+PT Sbjct: 822 AGIINPSYVPTRFQLADVFTKALGKDQFVTLRSKLGLHDIHSPT 865 >gb|PNX96222.1| retrovirus-related Pol polyprotein from transposon TNT 1-94 [Trifolium pratense] Length = 1369 Score = 700 bits (1806), Expect = 0.0 Identities = 370/765 (48%), Positives = 481/765 (62%), Gaps = 6/765 (0%) Frame = +2 Query: 11 TPYEVLHNSKPEYDHMKVFGCLAYYRSVETKGDKFEVRGRPGVFLGYPPGTKGYKVYDLQ 190 TPYE+L P YDH+KVFGCL Y + + DKF R VFLGYP G KG+KVY+L+ Sbjct: 634 TPYEMLFGKSPNYDHIKVFGCLCYVATTSKQRDKFGPRADRCVFLGYPQGQKGWKVYNLK 693 Query: 191 HRKMVTSRDVKFLENVFPFARNPTEEEKIFVLPQKWDEEENTRDIRADQTKSNDNIHEPS 370 R+ + SRDV F ENVFPF K DE E D P Sbjct: 694 TREFIVSRDVVFYENVFPF---------------KIDEREAVTD-------------PPR 725 Query: 371 SVMAETETQDVHNGADFFGPSQEPSEPATSGPNEPLLDITHET--VPSNQNDMGSETVSE 544 ++ T + + N G E ++ A ++I E+ V Q + E ++E Sbjct: 726 NLFHNTPPR-IENEISSEGEMAEENDSAAQ------VNIMEESCEVHVPQTEDNEEILNE 778 Query: 545 ENRPTNAHTR---PVRSRTRPARLDGFEVNLPPSLDHTQSSLHHDSS-TVHPLAHFISYD 712 +N P R R PA L F + ++S + SS VH + +F+ D Sbjct: 779 KNHHQQTEAMIIMPPRDRKPPAYLQDFHCYAAGEVPPSKSLIFSPSSGKVHSITNFMRND 838 Query: 713 NFTNTHKAFLTAITTNNEPKHFKQAVKDVRWVEAMKREIQALEENGTWILEELPKGKRAI 892 F+ H+AFL I+ + EP + QAVK V W AM +E++ALEEN TW L+ PKGK+ + Sbjct: 839 CFSQRHQAFLAEISKHEEPTTYSQAVKHVEWRNAMNQELKALEENETWELDFPPKGKKVV 898 Query: 893 DSKWVYKIKYKPNGEVERYKARLVAKGFTQMEGVDFHETFAPXXXXXXXXXXXXXXXXXG 1072 KWVYKIKYK GE+E+YKARLVAKG+TQ+EG DF+ETFAP Sbjct: 899 GCKWVYKIKYKATGEIEKYKARLVAKGYTQVEGEDFNETFAPVAKMTTVRCMLSVAVAKD 958 Query: 1073 WHTHQLDVNNAFLHGDLNEDVYMKIPQGFGKQDDNRVCKLKKSLYGLKQASRNWYHKFTG 1252 W HQ+DV+NAFLHG+L+E+VYMK P+G+ VC+LKKSLYGL+QASRNWY K + Sbjct: 959 WELHQMDVSNAFLHGELDEEVYMKAPEGYALPKIGMVCRLKKSLYGLRQASRNWYSKLSN 1018 Query: 1253 SLFEIGFKQTPADHSLFIHRKDKTFVAALIYVDDVVLVGNDSNKIQDTKDFLDKRFSIKD 1432 +L E GF ++ ADHSLF + TF+A LIYVDD+V+ GN++ K +L F +KD Sbjct: 1019 ALLEYGFIESHADHSLFTYSHQSTFLAVLIYVDDLVIAGNNTAACTKFKKYLSGCFHMKD 1078 Query: 1433 LGPLKYFLGIEVAKTKEGMVLSQRKYTLDILEDAGMTGCRPSSFPMEQNLKLDTCDKEPR 1612 LGPLKYFLG+E+A+ K G+ + QRKYTLDIL + GM GC+PSSFPMEQN +L EP Sbjct: 1079 LGPLKYFLGLELARGKSGLFICQRKYTLDILNECGMLGCKPSSFPMEQNHRLALASGEPY 1138 Query: 1613 VDANQYRRLIGRLLYLQATRPDIAYAVNILSQFVGDPRHSHMEAATRVLCYLKGTPGQGI 1792 + ++YRRL+GRL+YL TRP+I YAV+ LSQF+ P+ +H +AA VL YLK +PGQGI Sbjct: 1139 AEPSRYRRLVGRLIYLTITRPEITYAVHTLSQFMQCPQQAHWDAAMHVLRYLKSSPGQGI 1198 Query: 1793 LLPKEGGTNLLAYCDSDWLGCPMTRRSRTGYLLLLGGAPISWRTKKQSVVSKSSAEAEYR 1972 +LP+E L+AY DSDW CP+TRRS +GYLL LG APISW+TKKQS VS+SS+EAEYR Sbjct: 1199 VLPRENELKLVAYSDSDWASCPLTRRSISGYLLKLGAAPISWKTKKQSTVSRSSSEAEYR 1258 Query: 1973 AMSNAVSEILWMRWLLSELDMAPVGPTQLFCDNQAARHIANNPVFHERTKHVEMDCYFVR 2152 AM++A SEILW+R LL+ L + PT L+CDNQAA H+A NPV+HERTKH+E+DC+F+R Sbjct: 1259 AMAHATSEILWLRRLLTCLQVDCNSPTTLYCDNQAAMHLAANPVYHERTKHIEVDCHFIR 1318 Query: 2153 ERVDSMEICPMPIATKDQIADVLTKALGANSLHFLLCKLGVRNLH 2287 E + I + TK Q AD+ TK+LG+ L KLGV N H Sbjct: 1319 EHIQEGTIVTDYVPTKQQQADIFTKSLGSTQFQSLSVKLGVHNPH 1363 Score = 161 bits (407), Expect = 4e-36 Identities = 86/258 (33%), Positives = 143/258 (55%), Gaps = 5/258 (1%) Frame = +1 Query: 2722 MMNFLFAKNKVGFINGSIKKPEEESSNYMPWMRCDAMIKGWLHTAMEKEIRTSVKYAMTA 2901 M L AK K+GFI+G+IKKP +S++Y W R D+M+ W+ + + + S+ + TA Sbjct: 1 MRTALRAKVKLGFIDGTIKKPGAQSADYFNWERADSMVTAWIINSTDPALHGSISHGSTA 60 Query: 2902 REIWIDLKERFGKVSAPRAYELKRSL-TSTKQEGTSVSAYYTKLRGIWDEIQSVIPMPRC 3078 R++W+DL+ERF + + PR ++L R L K++ SV+ +YTK +GI+DE+ + P+P C Sbjct: 61 RDVWLDLEERFAQTNQPRIHQLWRMLCLMQKEDDLSVTEFYTKFKGIYDELNELQPLPEC 120 Query: 3079 DCSSCKCDIGKKLQELRDKERLYEFLLGLD-AEFGTIRTQILAMNPIPSLGKAYHLVAED 3255 C + K+L + + ++++ FL LD +F ++ IL P+PSL K ++ V + Sbjct: 121 SCGA-----SKELMKREEDQKVHLFLGSLDNQQFAHVKATILNTEPLPSLRKTFNTVLRE 175 Query: 3256 EQQRAISG---SKRPSSDSVAFQAHVPVKRDQNQSQNRTKQKDVKRGNSEPVEQCTVCGK 3426 E + S +P + + + S +R K +D + E+C CGK Sbjct: 176 EARYTAERERISNKPDAGAAFY-----------SSASRQKWRDRSK------EKCDHCGK 218 Query: 3427 DGHKSEGCFKLIGYPEWW 3480 GH GCF++IGYP W Sbjct: 219 TGHLKSGCFEIIGYPPNW 236 >gb|PNX97998.1| retrovirus-related Pol polyprotein from transposon TNT 1-94, partial [Trifolium pratense] Length = 964 Score = 685 bits (1767), Expect = 0.0 Identities = 364/775 (46%), Positives = 474/775 (61%), Gaps = 13/775 (1%) Frame = +2 Query: 8 KTPYEVLHNSKPEYDHMKVFGCLAYYRSVETKGDKFEVRGRPGVFLGYPPGTKGYKVYDL 187 K+P++VL S P Y ++VFGCL + +++ + KF+ R +PG+F+GYP KGY++YD+ Sbjct: 236 KSPHQVLLGSPPSYSSLRVFGCLCFAKNMNIQ-HKFDERAKPGIFVGYPFNQKGYRIYDM 294 Query: 188 QHRKMVTSRDVKFLENVFPF--ARNPTEEEKIFVLPQKWDEEENTRDIRADQTKSNDNIH 361 RK+ SRDV+F E VFP+ + P+ I + Q D E D T SN Sbjct: 295 HTRKIYVSRDVQFFETVFPYHDLQTPSFASDISINTQFLDYE-------VDDTPSN---- 343 Query: 362 EPSSVMAETETQDVHNGADFFGPSQEPSEPATS-GPNEPLLDITHETVPSNQNDMGSETV 538 PA+S P D T T+P+ D SE Sbjct: 344 ---------------------------LSPASSIPPGISHHDNTIVTIPNPSVDNPSEIP 376 Query: 539 SEENRPTNAHT----------RPVRSRTRPARLDGFEVNLPPSLDHTQSSLHHDSSTVHP 688 + P H+ P+R RT RL DH + S + P Sbjct: 377 AIPVEPPQQHSPTAINHPERRYPLRHRTPSVRL----------TDHVCDINNVTSQSAFP 426 Query: 689 LAHFISYDNFTNTHKAFLTAITTNNEPKHFKQAVKDVRWVEAMKREIQALEENGTWILEE 868 L ++ S N + +H+A L I N EP + QA+K W EAM +EI ALE N TW+L Sbjct: 427 LKNYFSLSNLSTSHRALLVNIIENKEPTSYSQAIKSAEWREAMAKEIHALESNNTWVLSP 486 Query: 869 LPKGKRAIDSKWVYKIKYKPNGEVERYKARLVAKGFTQMEGVDFHETFAPXXXXXXXXXX 1048 LP GK AI KWVYKIKY +G VERYKARLVAKG+ Q+ G+D+HETFAP Sbjct: 487 LPNGKTAIGCKWVYKIKYHSDGTVERYKARLVAKGYNQVHGIDYHETFAPVAKLVTVRLL 546 Query: 1049 XXXXXXXGWHTHQLDVNNAFLHGDLNEDVYMKIPQGFGKQDDNRVCKLKKSLYGLKQASR 1228 W HQLDVNNAFL GDLNE+VYMK+P GF + VCKL KS+YGLKQASR Sbjct: 547 LSIAAIKNWSLHQLDVNNAFLQGDLNEEVYMKLPPGFSHKGQPCVCKLNKSIYGLKQASR 606 Query: 1229 NWYHKFTGSLFEIGFKQTPADHSLFIHRKDKTFVAALIYVDDVVLVGNDSNKIQDTKDFL 1408 W+ KF+ +L + GF Q+ +D+SLF + + T + L+YVDD+++ GN+ + I D K FL Sbjct: 607 QWFSKFSTTLIQKGFHQSISDYSLFTFKSNHTTIFVLVYVDDIIITGNNDDAISDIKKFL 666 Query: 1409 DKRFSIKDLGPLKYFLGIEVAKTKEGMVLSQRKYTLDILEDAGMTGCRPSSFPMEQNLKL 1588 + FSIKDLG L YFLGIEV+++K+G+ L QRKYTLDIL DAG+TGCRPS FPMEQ+L+L Sbjct: 667 AQAFSIKDLGNLSYFLGIEVSRSKKGIFLCQRKYTLDILSDAGLTGCRPSEFPMEQHLRL 726 Query: 1589 DTCDKEPRVDANQYRRLIGRLLYLQATRPDIAYAVNILSQFVGDPRHSHMEAATRVLCYL 1768 D P D YRRLIGRLLYL TRPDI YAVN LSQF+ P +H++AATRVL YL Sbjct: 727 RPNDGSPLPDPTVYRRLIGRLLYLTVTRPDIQYAVNTLSQFMQSPCTTHLDAATRVLRYL 786 Query: 1769 KGTPGQGILLPKEGGTNLLAYCDSDWLGCPMTRRSRTGYLLLLGGAPISWRTKKQSVVSK 1948 KG+ G+G+ L L+ Y DSDW GCP TRRS TGY +LG PISW+TKKQ +S+ Sbjct: 787 KGSVGKGLFLSASSSLQLIGYADSDWAGCPTTRRSTTGYFTMLGSNPISWKTKKQPTISR 846 Query: 1949 SSAEAEYRAMSNAVSEILWMRWLLSELDMAPVGPTQLFCDNQAARHIANNPVFHERTKHV 2128 SSAEAEYR+++ SE+ W+++LLS+LD+A P + CD+QAA HIA NPVFHERTKH+ Sbjct: 847 SSAEAEYRSLATLASELQWLKFLLSDLDIAHPLPITVHCDSQAAIHIAENPVFHERTKHI 906 Query: 2129 EMDCYFVRERVDSMEICPMPIATKDQIADVLTKALGANSLHFLLCKLGVRNLHAP 2293 E+DC+FVRE++ S + P + + DQ+AD+ TK LG ++ LL KLGV + P Sbjct: 907 EIDCHFVREKIKSGLLRPSYLRSFDQLADIFTKPLGGDAYKRLLGKLGVLEISIP 961 Score = 93.2 bits (230), Expect = 3e-15 Identities = 61/203 (30%), Positives = 97/203 (47%) Frame = +1 Query: 3040 WDEIQSVIPMPRCDCSSCKCDIGKKLQELRDKERLYEFLLGLDAEFGTIRTQILAMNPIP 3219 WDE+ S+ P+ C C + K I ++ Q+ R EFL G+ F +R+QIL M+P P Sbjct: 1 WDELHSIAPINPCICGNAKSIIDQQNQD-----RAMEFLQGVHDRFSAVRSQILLMDPFP 55 Query: 3220 SLGKAYHLVAEDEQQRAISGSKRPSSDSVAFQAHVPVKRDQNQSQNRTKQKDVKRGNSEP 3399 S+ + Y++V ++E+Q+ I+ P+ +S A QA ++ Q+RT+ K P Sbjct: 56 SIQRIYNIVRQEEKQQEINFRPLPAEESAALQA--------SKVQHRTQGK-------RP 100 Query: 3400 VEQCTVCGKDGHKSEGCFKLIGYPEWWPGKGKQYKPKPSAALVEGEKSPIAGLSDSQYRQ 3579 C C K GH C+++ G+P P K + +P LS +QY++ Sbjct: 101 RPYCENCNKYGHTVATCYQIHGFPNRPPKKSE-----------SSSSTPAQQLSSAQYQK 149 Query: 3580 FLKFFGDKDGAKTEDSGPKANLA 3648 L AK + G NLA Sbjct: 150 LLSLL-----AKEDTMGSSVNLA 167 >gb|PRQ55089.1| putative RNA-directed DNA polymerase [Rosa chinensis] Length = 1285 Score = 693 bits (1789), Expect = 0.0 Identities = 362/781 (46%), Positives = 487/781 (62%), Gaps = 14/781 (1%) Frame = +2 Query: 8 KTPYEVLHNSKPEYDHMKVFGCLAYYRSVETKGDKFEVRGRPGVFLGYPPGTKGYKVYDL 187 KTP+E L + +P Y H++VFGC + + T+ KF+ R VFLGYP G KGYKVY+L Sbjct: 517 KTPFEKLFHKEPSYSHLRVFGCQCFVSTHPTRPSKFDPRSMECVFLGYPHGQKGYKVYNL 576 Query: 188 QHRKMVTSRDVKFLENVFPFARN----PTEEEKIFVLPQKWDEEENTRDIRADQTKSNDN 355 +K + SRDV F EN FPF +N P++ +F + +N Sbjct: 577 TTKKSLVSRDVIFFENAFPFPKNSESFPSQNTDLFPSIPRLAHYDNP------------- 623 Query: 356 IHEPSSVMAETETQDVHNGADFFGPS-QEPSEPATSGPNEPLLD---ITHETVPSNQNDM 523 S+ + H+ P+ Q +E + P++ L ++ + N + + Sbjct: 624 -----SIPKIPPSSPTHHSPPMISPNPQSSAEQYLNSPSKSLSSTDPVSSDITLPNLDTI 678 Query: 524 GSETVSEENRPTNAHTRP---VRSRTRPARLDGFEVN--LPPSLDHTQSSLHHDS-STVH 685 S+ + + P RP R+ P L F ++ LP L + SS + T H Sbjct: 679 SSDHIPSLSPPEQTPPRPRKSTRATKLPTALQDFHIDAALPTRLAPSSSSNEVTTPGTAH 738 Query: 686 PLAHFISYDNFTNTHKAFLTAITTNNEPKHFKQAVKDVRWVEAMKREIQALEENGTWILE 865 L+H +SY N ++ H+ F IT EP F QAVKD +W EAM+ E+QAL++N TW L Sbjct: 739 SLSHVLSYANLSSPHRTFTANITLQREPTSFSQAVKDPKWREAMRLEVQALQDNKTWSLV 798 Query: 866 ELPKGKRAIDSKWVYKIKYKPNGEVERYKARLVAKGFTQMEGVDFHETFAPXXXXXXXXX 1045 P KR I KWVYKIKY P+G +ERYKARLVAKG++Q+EG+D+ ETFAP Sbjct: 799 PPPAHKRPIGCKWVYKIKYNPDGTIERYKARLVAKGYSQVEGLDYRETFAPVAKLTTVRV 858 Query: 1046 XXXXXXXXGWHTHQLDVNNAFLHGDLNEDVYMKIPQGFGKQDDNRVCKLKKSLYGLKQAS 1225 WH HQLDVNNAFL+GDL+EDVYM +P GF ++ +++VCKL KSLYGL+QAS Sbjct: 859 LLSLAAQQNWHLHQLDVNNAFLNGDLHEDVYMHLPPGFERKGEHKVCKLHKSLYGLRQAS 918 Query: 1226 RNWYHKFTGSLFEIGFKQTPADHSLFIHRKDKTFVAALIYVDDVVLVGNDSNKIQDTKDF 1405 + W+ K + +L GFKQ+ +D+S+F+ TF A L+YVDDV+L GN+ + I TK F Sbjct: 919 KQWFLKLSSALKSAGFKQSWSDYSMFVRSHQGTFTALLVYVDDVILAGNNLDDIIRTKSF 978 Query: 1406 LDKRFSIKDLGPLKYFLGIEVAKTKEGMVLSQRKYTLDILEDAGMTGCRPSSFPMEQNLK 1585 L F +KD+G LKYFLG+EVA++K G+ LSQRKY L+ILED G G +PS FP+EQN+ Sbjct: 979 LSSHFKLKDMGQLKYFLGLEVARSKHGIALSQRKYALEILEDTGFLGAKPSRFPLEQNII 1038 Query: 1586 LDTCDKEPRVDANQYRRLIGRLLYLQATRPDIAYAVNILSQFVGDPRHSHMEAATRVLCY 1765 L D DA+QYRRL+GRL+Y TRPD+ YAV+ILSQF+ PR H++AA +VL Y Sbjct: 1039 LTQEDGRLLEDASQYRRLVGRLIYQTITRPDLVYAVHILSQFMDKPRQPHLDAAHKVLRY 1098 Query: 1766 LKGTPGQGILLPKEGGTNLLAYCDSDWLGCPMTRRSRTGYLLLLGGAPISWRTKKQSVVS 1945 LK TPGQGI LP +G L AYCD+DW C TRRS TGY + LG APISW+TKKQ VS Sbjct: 1099 LKQTPGQGIFLPSKGPLELSAYCDADWARCKDTRRSTTGYCIFLGHAPISWKTKKQRTVS 1158 Query: 1946 KSSAEAEYRAMSNAVSEILWMRWLLSELDMAPVGPTQLFCDNQAARHIANNPVFHERTKH 2125 +SSAEAEYR+M+ EI W++++L +L++ + P +LFCDN+AA HIA+NPVFHERTKH Sbjct: 1159 RSSAEAEYRSMATTCCEITWLQYILKDLNIQHLQPVKLFCDNKAAIHIASNPVFHERTKH 1218 Query: 2126 VEMDCYFVRERVDSMEICPMPIATKDQIADVLTKALGANSLHFLLCKLGVRNLHAPT*GG 2305 +E+DC+ VRE+V I I TK+Q AD+ TK L + LL KLGV N+H+ G Sbjct: 1219 IEIDCHVVREKVQRGLIQTEHIRTKEQPADIFTKPLSSEQFSLLLGKLGVINIHSNLRGS 1278 Query: 2306 V 2308 + Sbjct: 1279 I 1279 >ref|XP_022004406.1| uncharacterized protein LOC110901966 [Helianthus annuus] Length = 415 Score = 661 bits (1706), Expect = 0.0 Identities = 320/409 (78%), Positives = 357/409 (87%) Frame = +2 Query: 1070 GWHTHQLDVNNAFLHGDLNEDVYMKIPQGFGKQDDNRVCKLKKSLYGLKQASRNWYHKFT 1249 GWH HQLDVNNAFL+GDL+EDVYMKIP+GF KQD N VCKLKKSLYGLKQASRNWY KFT Sbjct: 7 GWHVHQLDVNNAFLYGDLHEDVYMKIPEGFRKQDTNMVCKLKKSLYGLKQASRNWYQKFT 66 Query: 1250 GSLFEIGFKQTPADHSLFIHRKDKTFVAALIYVDDVVLVGNDSNKIQDTKDFLDKRFSIK 1429 SL +IGFKQT A+HSLFI R+ FVAALIYVDDV++VGN NKIQ+ K FLDK+FSIK Sbjct: 67 NSLLDIGFKQTGANHSLFIFREKDIFVAALIYVDDVIIVGNALNKIQEIKLFLDKKFSIK 126 Query: 1430 DLGPLKYFLGIEVAKTKEGMVLSQRKYTLDILEDAGMTGCRPSSFPMEQNLKLDTCDKEP 1609 DLGPLK+FLGIEVA+T EGMVLSQRKYTLDILE+ GM GCRPS FPMEQNLKL C++EP Sbjct: 127 DLGPLKFFLGIEVARTNEGMVLSQRKYTLDILEETGMMGCRPSPFPMEQNLKLGKCEEEP 186 Query: 1610 RVDANQYRRLIGRLLYLQATRPDIAYAVNILSQFVGDPRHSHMEAATRVLCYLKGTPGQG 1789 ++D+NQYRRL+G+LLYLQATR DIAYAVN+LSQFVGDPR SHMEAA RVL YLK TPGQG Sbjct: 187 KIDSNQYRRLVGKLLYLQATRLDIAYAVNVLSQFVGDPRKSHMEAANRVLRYLKSTPGQG 246 Query: 1790 ILLPKEGGTNLLAYCDSDWLGCPMTRRSRTGYLLLLGGAPISWRTKKQSVVSKSSAEAEY 1969 IL+PKEGGT L AYCDSDWLGCP+TRRSR+GY+LL+GGAP+SW++KKQSVVS+SSA+AEY Sbjct: 247 ILIPKEGGTRLTAYCDSDWLGCPITRRSRSGYVLLIGGAPVSWKSKKQSVVSRSSAKAEY 306 Query: 1970 RAMSNAVSEILWMRWLLSELDMAPVGPTQLFCDNQAARHIANNPVFHERTKHVEMDCYFV 2149 R M N VSEILWMRWLLS + P GPT LFCDNQAARHIANNPVFHERTKHVEMDC+FV Sbjct: 307 REMVNTVSEILWMRWLLSLRGVPPTGPTPLFCDNQAARHIANNPVFHERTKHVEMDCHFV 366 Query: 2150 RERVDSMEICPMPIATKDQIADVLTKALGANSLHFLLCKLGVRNLHAPT 2296 RERV+S EI PM I TK IAD+LTK G + LL KLGVR+LHAPT Sbjct: 367 RERVESKEIQPMKIDTKAHIADILTKPSGTHQFKVLLDKLGVRDLHAPT 415 >ref|XP_021979664.1| uncharacterized protein LOC110875770 [Helianthus annuus] Length = 376 Score = 656 bits (1693), Expect = 0.0 Identities = 326/375 (86%), Positives = 333/375 (88%) Frame = +2 Query: 983 MEGVDFHETFAPXXXXXXXXXXXXXXXXXGWHTHQLDVNNAFLHGDLNEDVYMKIPQGFG 1162 MEGVDFHETFAP WH HQLDVNNAFLHGDL+EDVYMKIPQGFG Sbjct: 1 MEGVDFHETFAPVAKLVTVRTLLIVAVKHDWHIHQLDVNNAFLHGDLHEDVYMKIPQGFG 60 Query: 1163 KQDDNRVCKLKKSLYGLKQASRNWYHKFTGSLFEIGFKQTPADHSLFIHRKDKTFVAALI 1342 KQDDNRVCKLKKSLYGLKQASRNWY KFT SL EIGFKQTPADHSLFI +++K FVAALI Sbjct: 61 KQDDNRVCKLKKSLYGLKQASRNWYQKFTHSLLEIGFKQTPADHSLFIFKENKIFVAALI 120 Query: 1343 YVDDVVLVGNDSNKIQDTKDFLDKRFSIKDLGPLKYFLGIEVAKTKEGMVLSQRKYTLDI 1522 YVDDVVLVGNDS KI TKDFLDKRFSIKDLGPLKYFLGIEVAKT EGMVLSQRKYTLDI Sbjct: 121 YVDDVVLVGNDSRKIHATKDFLDKRFSIKDLGPLKYFLGIEVAKTNEGMVLSQRKYTLDI 180 Query: 1523 LEDAGMTGCRPSSFPMEQNLKLDTCDKEPRVDANQYRRLIGRLLYLQATRPDIAYAVNIL 1702 LED GMT CRPSSFPMEQNLKLD CDKE RVDANQYRRLIGRLLYLQATRPDIAYAVNIL Sbjct: 181 LEDVGMTVCRPSSFPMEQNLKLDMCDKETRVDANQYRRLIGRLLYLQATRPDIAYAVNIL 240 Query: 1703 SQFVGDPRHSHMEAATRVLCYLKGTPGQGILLPKEGGTNLLAYCDSDWLGCPMTRRSRTG 1882 QF PR +HMEAATRVL YLKGTPGQGIL+PKEGG NLLAYCDS+WLGCPMTRRSRTG Sbjct: 241 RQFANGPRQTHMEAATRVLRYLKGTPGQGILIPKEGGANLLAYCDSEWLGCPMTRRSRTG 300 Query: 1883 YLLLLGGAPISWRTKKQSVVSKSSAEAEYRAMSNAVSEILWMRWLLSELDMAPVGPTQLF 2062 YLLLLGGAPISWRTKKQSVV KSSAEAEYRAMSNAVSEILWMRWLL ELDMAPVG TQLF Sbjct: 301 YLLLLGGAPISWRTKKQSVVFKSSAEAEYRAMSNAVSEILWMRWLLRELDMAPVGLTQLF 360 Query: 2063 CDNQAARHIANNPVF 2107 CDNQAARHIANNPVF Sbjct: 361 CDNQAARHIANNPVF 375 >emb|CAN71595.1| hypothetical protein VITISV_010143 [Vitis vinifera] Length = 1523 Score = 695 bits (1793), Expect = 0.0 Identities = 371/796 (46%), Positives = 482/796 (60%), Gaps = 18/796 (2%) Frame = +2 Query: 8 KTPYEVLHNSKPEYDHMKVFGCLAYYRSVETKGDKFEVRGRPGVFLGYPPGTKGYKVYDL 187 KTP+E L + P Y H++VFGC + + + KF+ R VF+GYP G KGYKVY L Sbjct: 733 KTPFEKLFHKSPNYSHLRVFGCRCFVSTHPLRPSKFDPRSIESVFIGYPHGQKGYKVYSL 792 Query: 188 QHRKMVTSRDVKFLENVFPFAR-----NPTEEEKIFVLPQKWDEEENTRDIRADQTKSND 352 + +K + SRDV F E FP+ +P+ + LPQ D +++ S Sbjct: 793 KDKKXLISRDVTFFETEFPYQNXLSTTSPSLDTFFPSLPQTPDIDDD----HISFNHSGS 848 Query: 353 NIHEPSSVMAETETQDVHNGADFFGPSQEPSEPATSGPNEPLLDITHETVPSNQNDMGSE 532 N+ ++ + Q + + PS P + + P++ + P + Sbjct: 849 NLQPSATSSVDXHPQPTLDNSHSSSHVDPPSSPPSLNTSPPVISQPSPSQPRRSS----- 903 Query: 533 TVSEENRPTNAHTRPVRSRTRPARLDGFEVN-------LPPSLDHTQSSLHHDSSTVHPL 691 RPT P L F + +PPS + S + H S T+H L Sbjct: 904 ------RPTKT----------PTTLQDFHIEAALPSRPVPPS---STSEVAH-SGTIHSL 943 Query: 692 AHFISYDNFTNTHKAFLTAITTNNEPKHFKQAVKDVRWVEAMKREIQALEENGTWILEEL 871 + +SYD + HKAF IT EP+ F QAV D RW EAM EIQAL+ N TW L L Sbjct: 944 SQVLSYDRLSPMHKAFTVKITLAKEPRSFSQAVLDSRWREAMNTEIQALQANKTWSLVPL 1003 Query: 872 PKGKRAIDSKWVYKIKYKPNGEVERYKARLVAKGFTQMEGVDFHETFAPXXXXXXXXXXX 1051 P K+ I KWVYKIKY P+G +ERYKARLVAKGF+Q+EG+D+ ETFAP Sbjct: 1004 PSHKKPIGCKWVYKIKYNPDGTIERYKARLVAKGFSQVEGIDYRETFAPVAKLTTVRVLL 1063 Query: 1052 XXXXXXGWHTHQLDVNNAFLHGDLNEDVYMKIPQGFGKQDDNRVCKLKKSLYGLKQASRN 1231 GWH HQLDVNNAFL+GDL EDVYM++P GFG++ ++RVCKL KSLYGLKQASR Sbjct: 1064 SLASIQGWHLHQLDVNNAFLNGDLYEDVYMQLPPGFGRKGEHRVCKLHKSLYGLKQASRQ 1123 Query: 1232 WYHKFTGSLFEIGFKQTPADHSLFIHRKDKTFVAALIYVDDVVLVGNDSNKIQDTKDFLD 1411 W+ K + +L GFKQ+ +D+SLF F L+YVDDV+L GN I +TK FL Sbjct: 1124 WFLKLSSALKAAGFKQSWSDYSLFXRNTQGRFTTLLVYVDDVILAGNSLEDIIETKQFLA 1183 Query: 1412 KRFSIKDLGPLKYFLGIEVAKTKEGMVLSQRKYTLDILEDAGMTGCRPSSFPMEQNLKLD 1591 F +KD+G L+YFLGIEVA++K+G+VL QRKY L++LEDAG G +PS FP+EQ+L L Sbjct: 1184 SHFKLKDMGQLRYFLGIEVARSKQGIVLCQRKYALELLEDAGFLGAKPSRFPVEQSLTLT 1243 Query: 1592 TCDKEPRVDANQYRRLIGRLLYLQATRPDIAYAVNILSQFVGDPRHSHMEAATRVLCYLK 1771 D DA+QYRRL+GRL+YL TRPD+ YAV+ILSQF+ PR H++AA +VL Y+K Sbjct: 1244 RGDGAELKDASQYRRLVGRLIYLTITRPDLVYAVHILSQFMDTPRQPHLDAAYKVLRYVK 1303 Query: 1772 GTPGQGILLPKEGGTNLLAYCDSDWLGCPMTRRSRTGYLLLLGGAPISWRTKKQSVVSKS 1951 TPGQGI LP G L AYCD+DW C TRRS TGY + G APISW+TKKQ VS+S Sbjct: 1304 QTPGQGIFLPSTGQLELTAYCDADWARCKDTRRSTTGYCIFFGNAPISWKTKKQGTVSRS 1363 Query: 1952 SAEAEYRAMSNAVSEILWMRWLLSELDMAPVGPTQLFCDNQAARHIANNPVFHERTKHVE 2131 SAEAEYR+M+ EI W+R LL++L++ +LFCDNQAA HIA+NPVFHERTKH+E Sbjct: 1364 SAEAEYRSMATTCCEITWLRSLLADLNVNHAHAVKLFCDNQAAIHIASNPVFHERTKHIE 1423 Query: 2132 MDCYFVRERVDSMEICPMPIATKDQIADVLTKALGANSLHFLLCKLGVRNLHAPT*GGVL 2311 MDC+ VRE+V + M I T++Q AD+ TK L + LL KLGV N+H G + Sbjct: 1424 MDCHVVREKVQRGLVKTMHIRTQEQPADLFTKPLSSKQFSTLLSKLGVINIHTNLRGSEV 1483 Query: 2312 HMER------IWVFHY 2341 +ER IW+ Y Sbjct: 1484 DVERGSNDSGIWLKSY 1499 Score = 165 bits (418), Expect = 2e-37 Identities = 96/355 (27%), Positives = 176/355 (49%), Gaps = 6/355 (1%) Frame = +1 Query: 2581 KDQSTGKTKEGGIDYESPYYLHPSDYPRQMHVNDVLSDNNYADWSQEMMNFLFAKNKVGF 2760 K + +T E ++ P +LH SD P + V+ L ++NY W Q M L KNK GF Sbjct: 14 KHSNPSRTTEPWENFNHPLFLHHSDQPGAVLVSQPLMEDNYTTWVQSMDMALTIKNKKGF 73 Query: 2761 INGSIKKPEEESSNYMPWMRCDAMIKGWLHTAMEKEIRTSVKYAMTAREIWIDLKERFGK 2940 ++G++ +P + W RC+ ++K WL A+ KEI SV + A+ +W++L+ERF Sbjct: 74 VDGTLNRPTHNPNEQQQWDRCNILVKTWLLGAISKEISNSVIHCKDAKTMWLELQERFSH 133 Query: 2941 VSAPRAYELKRSLTSTKQEGTSVSAYYTKLRGIWDEIQSVIPMPRCDCSSCKCDIGKKLQ 3120 + + + ++ ++ Q +V++++TKL+G+WDE ++ P C C++ +++ Sbjct: 134 TNTVQLFNIENAIHECAQGTGTVTSFFTKLKGLWDEKDALCGFPPCTCAT-----AAEVK 188 Query: 3121 ELRDKERLYEFLLGLDAEFGTIRTQILAMNPIPSLGKAYHLVAEDEQQRAISGSKRPSSD 3300 + ++ +FL+GL + T+R+ I+ M+P+P++ KAY + E+Q S K + Sbjct: 189 TYMETQKTMKFLMGLGDNYATVRSNIIGMDPLPTVNKAYAMALRHEKQAEASNGKVAVPN 248 Query: 3301 SVAFQAHVPVKRDQNQSQNRTKQKDVKRGNSEPVE-----QCTVCGKDGHKSEGCFKLIG 3465 + + + +D N ++ K + N +CT CG GH + C + Sbjct: 249 EASAFSVRKLDQDPNTTEREVKCEKCNMTNHSTKNCRAHLKCTYCGGKGHTYDYCRRRKN 308 Query: 3466 YPEWWPGKGKQYKPKPSAALVEGEKSPI-AGLSDSQYRQFLKFFGDKDGAKTEDS 3627 G+G+ K +A L EG++ LS S+ +Q + A T S Sbjct: 309 --TMGGGQGRS-KVNHAATLNEGKEDVTNFPLSQSECQQMMGLLSKIKTAATSHS 360 >ref|XP_021975259.1| uncharacterized protein LOC110870383 [Helianthus annuus] Length = 438 Score = 657 bits (1694), Expect = 0.0 Identities = 320/438 (73%), Positives = 360/438 (82%) Frame = +2 Query: 983 MEGVDFHETFAPXXXXXXXXXXXXXXXXXGWHTHQLDVNNAFLHGDLNEDVYMKIPQGFG 1162 MEGVDFH+TFAP GW QLDVNNAFLHGDL+EDVYMK+PQGF Sbjct: 1 MEGVDFHDTFAPVAKLVTVRSLLAIATKRGWAIQQLDVNNAFLHGDLHEDVYMKMPQGFN 60 Query: 1163 KQDDNRVCKLKKSLYGLKQASRNWYHKFTGSLFEIGFKQTPADHSLFIHRKDKTFVAALI 1342 K + +VCKLKKSLYGLKQASRNWY KFT + +I FKQ+ DHSLFI++K +VA LI Sbjct: 61 KGEGTKVCKLKKSLYGLKQASRNWYQKFTSAPHKIEFKQSKVDHSLFIYKKGDAYVATLI 120 Query: 1343 YVDDVVLVGNDSNKIQDTKDFLDKRFSIKDLGPLKYFLGIEVAKTKEGMVLSQRKYTLDI 1522 YVDDV++VGND NKIQ TKD+LDK FSIKDLGPLKYFLGIEVA+TK+G+VLSQRKYTLDI Sbjct: 121 YVDDVIIVGNDLNKIQQTKDYLDKEFSIKDLGPLKYFLGIEVARTKDGLVLSQRKYTLDI 180 Query: 1523 LEDAGMTGCRPSSFPMEQNLKLDTCDKEPRVDANQYRRLIGRLLYLQATRPDIAYAVNIL 1702 LED+GM GCRPS FPMEQ+LKL ++E RVDA QYRRL+GRLLYLQATRPDI Y+VN+L Sbjct: 181 LEDSGMQGCRPSMFPMEQHLKLTKDEEEHRVDARQYRRLVGRLLYLQATRPDITYSVNVL 240 Query: 1703 SQFVGDPRHSHMEAATRVLCYLKGTPGQGILLPKEGGTNLLAYCDSDWLGCPMTRRSRTG 1882 SQFV DPR SHM+A TRVL YLK TPGQGILLPKEGG NL AY D DWLGC +TRRSRTG Sbjct: 241 SQFVSDPRRSHMDAVTRVLRYLKATPGQGILLPKEGGVNLTAYSDLDWLGCQLTRRSRTG 300 Query: 1883 YLLLLGGAPISWRTKKQSVVSKSSAEAEYRAMSNAVSEILWMRWLLSELDMAPVGPTQLF 2062 YLLLLGGA +SW++KKQSVVS+SS EAEYRAM++ VSE+LWMRWLL+EL P PT LF Sbjct: 301 YLLLLGGALVSWKSKKQSVVSRSSTEAEYRAMASTVSEVLWMRWLLTELGAPPDAPTPLF 360 Query: 2063 CDNQAARHIANNPVFHERTKHVEMDCYFVRERVDSMEICPMPIATKDQIADVLTKALGAN 2242 CDNQAARHIANNPVFHERTKHVEMDCYFVRERV+S E+ P PI TK Q+ADVLTKALG Sbjct: 361 CDNQAARHIANNPVFHERTKHVEMDCYFVRERVESQEVRPTPIDTKMQVADVLTKALGTQ 420 Query: 2243 SLHFLLCKLGVRNLHAPT 2296 L+ KLG+ +LHAPT Sbjct: 421 QFRTLINKLGICDLHAPT 438 >gb|KYP64168.1| Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cajanus cajan] Length = 967 Score = 676 bits (1743), Expect = 0.0 Identities = 354/762 (46%), Positives = 477/762 (62%), Gaps = 1/762 (0%) Frame = +2 Query: 11 TPYEVLHNSKPEYDHMKVFGCLAYYRSVETKGDKFEVRGRPGVFLGYPPGTKGYKVYDLQ 190 TPY++L+ P YDH+++FGCL Y ++ + DKF R +F+GYP KG+KVY+L+ Sbjct: 235 TPYKMLYEKPPSYDHLRIFGCLCYVKNSSKRQDKFMPRSEKCMFIGYPQNKKGWKVYNLE 294 Query: 191 HRKMVTSRDVKFLENVFPFARNPTEEEKIFVLPQKWDEEENTRDIRADQTKSNDNIHEPS 370 + SRDV F + F + LP + E T R + S + +HE + Sbjct: 295 THEFFISRDVIFYKEYFSYK-----------LPARIVFENYTSQERDEY--SYNMMHEDN 341 Query: 371 SVMAETETQDVHNGADFFGPSQEPSEPATSGPNEPLLDITHETVPSNQNDMGSETVSE-E 547 +++ SQ SE S E + +H + ND E SE E Sbjct: 342 DHVSKDA-------------SQVSSEGEHSENEEEIESNSHNIEVKSSNDEDLENQSEGE 388 Query: 548 NRPTNAHTRPVRSRTRPARLDGFEVNLPPSLDHTQSSLHHDSSTVHPLAHFISYDNFTNT 727 NR TR + + VN PS T + + S V+PL++F+SYD+F++ Sbjct: 389 NRAEEKRTRQPLTYLKDYYCHAITVN--PSCMTTNPN--NSSGMVYPLSNFVSYDHFSHR 444 Query: 728 HKAFLTAITTNNEPKHFKQAVKDVRWVEAMKREIQALEENGTWILEELPKGKRAIDSKWV 907 H+A+L A+ +++EPK + QA+K W AM +EI+ALE+N W L LP G+ + KWV Sbjct: 445 HRAYLAALGSHDEPKTYAQAMKHPEWRTAMTQEIKALEDNQRWELTHLPPGRETVGCKWV 504 Query: 908 YKIKYKPNGEVERYKARLVAKGFTQMEGVDFHETFAPXXXXXXXXXXXXXXXXXGWHTHQ 1087 YKIKYK GE+E+YKARLVAKGFTQ+EG DF+ETFAP W HQ Sbjct: 505 YKIKYKATGEIEKYKARLVAKGFTQIEGEDFNETFAPVAKMTTIRCLLSLTVAKEWELHQ 564 Query: 1088 LDVNNAFLHGDLNEDVYMKIPQGFGKQDDNRVCKLKKSLYGLKQASRNWYHKFTGSLFEI 1267 +DV+NAFLHG+LNE+VYM +PQG+G VC+L++SLYGL QASRNWY K + SL E Sbjct: 565 MDVSNAFLHGELNEEVYMVVPQGYGVPTKGMVCRLRESLYGLCQASRNWYTKLSHSLEEY 624 Query: 1268 GFKQTPADHSLFIHRKDKTFVAALIYVDDVVLVGNDSNKIQDTKDFLDKRFSIKDLGPLK 1447 GFK+ DHSLF++ + F+A LIYVDD+V+ N+S K++L F +KDLG LK Sbjct: 625 GFKECDVDHSLFVYSHNSIFIAVLIYVDDLVIASNNSLACAQFKEYLSNCFHMKDLGNLK 684 Query: 1448 YFLGIEVAKTKEGMVLSQRKYTLDILEDAGMTGCRPSSFPMEQNLKLDTCDKEPRVDANQ 1627 YFLG+E+A+ +G+ + QRKYTLDIL + GM GC+P+SFP+EQN +L P + +Q Sbjct: 685 YFLGLELARDSKGLFICQRKYTLDILNECGMLGCKPTSFPVEQNHRLALATGSPFPEPSQ 744 Query: 1628 YRRLIGRLLYLQATRPDIAYAVNILSQFVGDPRHSHMEAATRVLCYLKGTPGQGILLPKE 1807 YRRLIG L+YL TRP+I Y+V+ILSQF+ P H +AA RVL YLK +PGQGI+LP Sbjct: 745 YRRLIGCLIYLTITRPEITYSVHILSQFMQAPLQEHWDAAMRVLRYLKSSPGQGIILPNT 804 Query: 1808 GGTNLLAYCDSDWLGCPMTRRSRTGYLLLLGGAPISWRTKKQSVVSKSSAEAEYRAMSNA 1987 L+ YCDSDW CP+TR+S +GYL+ LG PISW+TKKQS VS+SS+EAEYRA+++A Sbjct: 805 NDLRLVGYCDSDWASCPLTRKSISGYLMKLGPTPISWKTKKQSTVSRSSSEAEYRAIAHA 864 Query: 1988 VSEILWMRWLLSELDMAPVGPTQLFCDNQAARHIANNPVFHERTKHVEMDCYFVRERVDS 2167 SEI+W+R LL +L + PT L CDNQAA H+A NPVFHERTKH+E+DC+F+R ++ Sbjct: 865 TSEIIWLRSLLKDLQVDCDSPTFLHCDNQAALHLAANPVFHERTKHIEVDCHFIRNHLEK 924 Query: 2168 MEICPMPIATKDQIADVLTKALGANSLHFLLCKLGVRNLHAP 2293 I I TK+Q AD+ TK+LG L KLGV N H P Sbjct: 925 GTITTSYIPTKEQQADIFTKSLGRKMFLELTVKLGVHNPHTP 966 >gb|PNX95363.1| retrovirus-related Pol polyprotein from transposon TNT 1-94, partial [Trifolium pratense] Length = 1200 Score = 679 bits (1751), Expect = 0.0 Identities = 355/768 (46%), Positives = 479/768 (62%), Gaps = 5/768 (0%) Frame = +2 Query: 8 KTPYEVLHNSKPEYDHMKVFGCLAYYRSVETKGDKFEVRGRPGVFLGYPPGTKGYKVYDL 187 KTPYE+L +KP Y ++VFGCL Y + GDKF R R +F+GYP G KG++VYD+ Sbjct: 453 KTPYELLFGAKPTYTQIRVFGCLCYALNENRGGDKFASRSRKCIFVGYPYGKKGWEVYDV 512 Query: 188 QHRKMVTSRDVKFLENVFPFARNPTEEEKIFVLPQKWDEEENTRDIRADQTKSNDNIHEP 367 + + SR+VKF+ENVFPFA N + +I + W + + D+ NI Sbjct: 513 ETEEYFVSRNVKFVENVFPFAANSGDMREIQIDEWGWPHDSSDDH---DEGNEEQNIVST 569 Query: 368 SS-----VMAETETQDVHNGADFFGPSQEPSEPATSGPNEPLLDITHETVPSNQNDMGSE 532 S+ V E + D + ++ QE +E T LD QN +E Sbjct: 570 SNLGSTLVPLENDIVDTESHEEWHIMQQE-TERTTDEDQVESLD-------PQQNIRSNE 621 Query: 533 TVSEENRPTNAHTRPVRSRTRPARLDGFEVNLPPSLDHTQSSLHHDSSTVHPLAHFISYD 712 + R TR T AR+ S + L S T +P+AHF++ D Sbjct: 622 LLGRGYRVKKPSTRLNDHVTHTARV---------STSTSSPLLSKSSGTRYPIAHFVNCD 672 Query: 713 NFTNTHKAFLTAITTNNEPKHFKQAVKDVRWVEAMKREIQALEENGTWILEELPKGKRAI 892 F+ H+ FL AIT +EP F +AVKD +W +AMK EI+ALE+NGTW +E+LP GK+AI Sbjct: 673 KFSMQHRVFLAAITAEHEPNSFAEAVKDEKWRDAMKEEIRALEDNGTWTIEDLPSGKKAI 732 Query: 893 DSKWVYKIKYKPNGEVERYKARLVAKGFTQMEGVDFHETFAPXXXXXXXXXXXXXXXXXG 1072 KW+YKIKY +G +ER+KARLV G Q+EGVD++ETFAP Sbjct: 733 GCKWIYKIKYNSDGSIERHKARLVIHGNRQVEGVDYNETFAPTAKMVTVRTFLAVAAAKN 792 Query: 1073 WHTHQLDVNNAFLHGDLNEDVYMKIPQGFGKQDDNRVCKLKKSLYGLKQASRNWYHKFTG 1252 + HQ+DV NAFLHGDL+E+VYMK+P GF KQ ++VC+L+KSLYGLKQA R W+ K + Sbjct: 793 FQLHQMDVRNAFLHGDLDEEVYMKLPPGFDKQPPSKVCRLRKSLYGLKQAPRCWFAKLSE 852 Query: 1253 SLFEIGFKQTPADHSLFIHRKDKTFVAALIYVDDVVLVGNDSNKIQDTKDFLDKRFSIKD 1432 +L GFKQ+ +D+SLF D T + L+YVDD+V+ GN S+ I + KD+L F +KD Sbjct: 853 ALKAYGFKQSYSDYSLFTLHSDDTEMYVLVYVDDIVISGNHSDAINEFKDYLGHCFHMKD 912 Query: 1433 LGPLKYFLGIEVAKTKEGMVLSQRKYTLDILEDAGMTGCRPSSFPMEQNLKLDTCDKEPR 1612 LG LKYFLGIEVA+ G+ LSQRKY LD++ D G+ G +P++ P+EQN +L + Sbjct: 913 LGKLKYFLGIEVARNGTGIFLSQRKYALDLISDCGLLGAKPANIPIEQNHRLALIEGVNL 972 Query: 1613 VDANQYRRLIGRLLYLQATRPDIAYAVNILSQFVGDPRHSHMEAATRVLCYLKGTPGQGI 1792 D YR L+GRL+YL T P+++Y V+IL+QF+ +P+ H AA RV+ YLKG PGQGI Sbjct: 973 EDPTGYRSLVGRLIYLTITHPELSYCVHILAQFMQNPKLEHWNAALRVVRYLKGNPGQGI 1032 Query: 1793 LLPKEGGTNLLAYCDSDWLGCPMTRRSRTGYLLLLGGAPISWRTKKQSVVSKSSAEAEYR 1972 LL + L YCD+DW GCP+TRRS T Y ++LG +PISW+TKKQ+ VS+SSAEAEYR Sbjct: 1033 LLSVDCDLRLYGYCDADWAGCPLTRRSLTAYFVMLGNSPISWKTKKQTTVSRSSAEAEYR 1092 Query: 1973 AMSNAVSEILWMRWLLSELDMAPVGPTQLFCDNQAARHIANNPVFHERTKHVEMDCYFVR 2152 +M+ A E+ W++ LLS L +A P L+CD+Q+A HIANN V HERTKH+E+DC+FVR Sbjct: 1093 SMAAATCELKWLKELLSSLGVAHSDPMHLYCDSQSALHIANNXVLHERTKHIEVDCHFVR 1152 Query: 2153 ERVDSMEICPMPIATKDQIADVLTKALGANSLHFLLCKLGVRNLHAPT 2296 + + I P + T Q+AD+LTKALG LL KLGV +LH PT Sbjct: 1153 DEIAKGNIQPKYVHTSTQLADILTKALGKRQFDALLSKLGVLHLHTPT 1200 Score = 205 bits (522), Expect = 5e-50 Identities = 108/359 (30%), Positives = 185/359 (51%), Gaps = 37/359 (10%) Frame = +1 Query: 2644 HPSDYPRQMHVNDVLSDNNYADWSQEMMNFLFAKNKVGFINGSIKKPEEESSNYMPWMRC 2823 + SD P + L NY +W++ + L A+ K GF++GSIK+P+E++ + W Sbjct: 1 YSSDNPGNIITQVQLKGENYDEWARAVRGSLRARRKFGFVDGSIKQPDEDAPDIDDWWTV 60 Query: 2824 DAMIKGWLHTAMEKEIRTSVKYAMTAREIWIDLKERFGKVSAPRAYELKRSLTSTKQEGT 3003 ++MI W+ +E ++R+++ Y A+E+W D+K+R + PR +LK L + +Q G Sbjct: 61 NSMIVSWILNTIEPKLRSTITYKENAQELWDDIKQRLSISNGPRIQQLKSELANCRQNGD 120 Query: 3004 SVSAYYTKLRGIWDEIQSVIPMPRCDCSSCKCDIGKKLQELRDKERLYEFLLGLD-AEFG 3180 S+ Y+ +L+ +WDE+ +P C C+ CKC I L + R++E+L++FL+GLD +F Sbjct: 121 SIVNYFGRLKKLWDELNDFDQIPICTCNGCKCGISTTLNKKREEEKLHQFLMGLDEVQFR 180 Query: 3181 TIRTQILAMNPIPSLGKAYHLVAEDEQQRAISGSKRPSSDSVAFQAHVPVKRDQNQSQNR 3360 T+R+ IL+++P+P+L +AY + ++E+ I+ K D V+F VK ++ + + Sbjct: 181 TVRSNILSLDPLPTLNRAYQMAVQEERVGVIARGKEERGDPVSF----AVKAGRSLGREK 236 Query: 3361 TKQKDVKRGNSEPVEQCTVCGKDGHKSEGCFKLIGYPEWW-------------------- 3480 Q+ + C+ C + GH + CF L+GYP+WW Sbjct: 237 KSQEMSSK-------TCSYCKRSGHDVDSCFPLVGYPDWWGDRPRAEGRDPGHGKAVHRP 289 Query: 3481 ---PGKGKQYKPKPSAALV-------------EGEKSPIAGLSDSQYRQFLKFFGDKDG 3609 GKGK K + A V EGE+ + GLS Q+ LK + G Sbjct: 290 MIGSGKGKGINAKVNVAQVVDVAGATNKDTEGEGEQIGLPGLSPGQWNALLKAINTQKG 348