BLASTX nr result

ID: Chrysanthemum22_contig00005937 seq

BLASTX 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Chrysanthemum22_contig00005937
         (3650 letters)

Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF excluding environmental samples
from WGS projects 
           149,584,005 sequences; 54,822,741,787 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

gb|OTG03795.1| putative ribonuclease H-like domain-containing pr...   962   0.0  
gb|OTG03225.1| putative NB-ARC [Helianthus annuus]                    846   0.0  
gb|OMO87137.1| Integrase, catalytic core [Corchorus capsularis]       803   0.0  
gb|OMO65653.1| hypothetical protein CCACVL1_21443 [Corchorus cap...   758   0.0  
ref|XP_022018997.1| uncharacterized protein LOC110919025 [Helian...   711   0.0  
gb|OTG09093.1| putative reverse transcriptase, RNA-dependent DNA...   724   0.0  
gb|OMO75305.1| Integrase, catalytic core [Corchorus capsularis]       734   0.0  
gb|PNY16822.1| retrovirus-related Pol polyprotein from transposo...   701   0.0  
gb|OTG24510.1| putative reverse transcriptase, RNA-dependent DNA...   699   0.0  
gb|OTG06009.1| putative zinc finger, CCHC-type [Helianthus annuus]    696   0.0  
gb|PNX93928.1| hypothetical protein L195_g017092, partial [Trifo...   685   0.0  
gb|PNX96222.1| retrovirus-related Pol polyprotein from transposo...   700   0.0  
gb|PNX97998.1| retrovirus-related Pol polyprotein from transposo...   685   0.0  
gb|PRQ55089.1| putative RNA-directed DNA polymerase [Rosa chinen...   693   0.0  
ref|XP_022004406.1| uncharacterized protein LOC110901966 [Helian...   661   0.0  
ref|XP_021979664.1| uncharacterized protein LOC110875770 [Helian...   656   0.0  
emb|CAN71595.1| hypothetical protein VITISV_010143 [Vitis vinifera]   695   0.0  
ref|XP_021975259.1| uncharacterized protein LOC110870383 [Helian...   657   0.0  
gb|KYP64168.1| Retrovirus-related Pol polyprotein from transposo...   676   0.0  
gb|PNX95363.1| retrovirus-related Pol polyprotein from transposo...   679   0.0  

>gb|OTG03795.1| putative ribonuclease H-like domain-containing protein [Helianthus
            annuus]
          Length = 1050

 Score =  962 bits (2486), Expect = 0.0
 Identities = 478/785 (60%), Positives = 580/785 (73%), Gaps = 21/785 (2%)
 Frame = +2

Query: 5    NKTPYEVLHNSKPEYDHMKVFGCLAYYRSVETKGDKFEVRGRPGVFLGYPPGTKGYKVYD 184
            NKTPYE++   KP+YD ++V GCLAYYRS+ET GDKFE RGRPGVFLGYP GTKGYK++D
Sbjct: 278  NKTPYEIVFRKKPDYDRLRVMGCLAYYRSIETNGDKFEFRGRPGVFLGYPQGTKGYKIFD 337

Query: 185  LQHRKMVTSRDVKFLENVFPFAR-----------NPTEEEKIFVLPQKWDEEENTR---- 319
            ++H K+  SRDV F+E VFPF +           N  EE+   +  + ++ ++ T+    
Sbjct: 338  VEHGKIAVSRDVTFVEKVFPFEKLKTNNSNQDLFNVPEEDYEIIFEEPYNSQKATQMDPH 397

Query: 320  --DIRADQTKSNDNIHEPSSVMAETETQDVHNGA----DFFGPSQEPSEPATSGPNEPLL 481
              +I  ++   + ++ + +S   E     + N      DF  PS       T  P   L 
Sbjct: 398  GPNIGTEEAPGSGSMADATSPQREGLPIGLDNTRGQPQDFLSPSANDGLDQTPAPTHSLG 457

Query: 482  DITHETVPSNQNDMGSETVSEENRPTNAHTRPVRSRTRPARLDGFEVNLPPSLDHTQSSL 661
            D  HET          E V  E   T   TR  R+ ++P+ L  F VNLPPS+DHTQ   
Sbjct: 458  DDAHET---------GERVVNEFLST---TRGKRTISQPSYLKEFHVNLPPSVDHTQPVT 505

Query: 662  HHDSSTVHPLAHFISYDNFTNTHKAFLTAITTNNEPKHFKQAVKDVRWVEAMKREIQALE 841
               SSTVH LA+++SY+ F+N+HK FLTAITT+NEPK F +A++D  W  AMK+EIQALE
Sbjct: 506  DQSSSTVHSLANYVSYEKFSNSHKVFLTAITTHNEPKSFHEAMQDENWKLAMKKEIQALE 565

Query: 842  ENGTWILEELPKGKRAIDSKWVYKIKYKPNGEVERYKARLVAKGFTQMEGVDFHETFAPX 1021
            EN TW LE LP+GKRAIDSKWVYK+KYKPNGE+ER+KARLVAKG+TQMEGVDFH+TFAP 
Sbjct: 566  ENKTWTLEPLPEGKRAIDSKWVYKLKYKPNGEIERHKARLVAKGYTQMEGVDFHDTFAPV 625

Query: 1022 XXXXXXXXXXXXXXXXGWHTHQLDVNNAFLHGDLNEDVYMKIPQGFGKQDDNRVCKLKKS 1201
                             W  HQLDVNNAFLHGDL+E+VYMKIPQGF K+ + RVC+L+KS
Sbjct: 626  AKLVTVRTLLAVAVKKEWLIHQLDVNNAFLHGDLDEEVYMKIPQGFAKRGETRVCRLRKS 685

Query: 1202 LYGLKQASRNWYHKFTGSLFEIGFKQTPADHSLFIHRKDKTFVAALIYVDDVVLVGNDSN 1381
            LYGLKQASRNWYHKFT SL +IG+KQ+ ADHSLF  +    FVA LIYVDDVV+ GND+ 
Sbjct: 686  LYGLKQASRNWYHKFTSSLVDIGYKQSHADHSLFTFKDGVNFVAILIYVDDVVITGNDAT 745

Query: 1382 KIQDTKDFLDKRFSIKDLGPLKYFLGIEVAKTKEGMVLSQRKYTLDILEDAGMTGCRPSS 1561
            KIQ+TK +LD +FSIK+LGPLKYFLGIEVA+T +G+VLSQRKYTLD+LED GM GCRPS 
Sbjct: 746  KIQETKQYLDNKFSIKNLGPLKYFLGIEVARTVDGLVLSQRKYTLDLLEDTGMLGCRPSP 805

Query: 1562 FPMEQNLKLDTCDKEPRVDANQYRRLIGRLLYLQATRPDIAYAVNILSQFVGDPRHSHME 1741
            FPMEQ LKLD C + P+VDA QYRRLIGRLLYLQATRPDIAY+VN+LSQFV DPR  H+ 
Sbjct: 806  FPMEQGLKLDNCQESPKVDAQQYRRLIGRLLYLQATRPDIAYSVNLLSQFVSDPREDHLF 865

Query: 1742 AATRVLCYLKGTPGQGILLPKEGGTNLLAYCDSDWLGCPMTRRSRTGYLLLLGGAPISWR 1921
            AA R+L YLK +PGQG+ LPK GG +L A+CD+DWLGC +TRRSRTGYLLLLGGAPISW+
Sbjct: 866  AAHRILRYLKSSPGQGVFLPKHGGLHLSAFCDADWLGCQLTRRSRTGYLLLLGGAPISWK 925

Query: 1922 TKKQSVVSKSSAEAEYRAMSNAVSEILWMRWLLSELDMAPVGPTQLFCDNQAARHIANNP 2101
            TKKQSVVS+SSAEAEYR+M++ VSE++WMRWLL++L +     T +FCDN A +HIANNP
Sbjct: 926  TKKQSVVSRSSAEAEYRSMASTVSEVIWMRWLLTDLQVVQDQATPIFCDNLAVKHIANNP 985

Query: 2102 VFHERTKHVEMDCYFVRERVDSMEICPMPIATKDQIADVLTKALGANSLHFLLCKLGVRN 2281
            VFHERTKHVEMDCYF+RERV+S +I P+ I TK QIAD+ TK LGA  L  LL KLGVR+
Sbjct: 986  VFHERTKHVEMDCYFIRERVESKDIFPLHIDTKQQIADLFTKPLGAQHLQILLHKLGVRD 1045

Query: 2282 LHAPT 2296
            LHAPT
Sbjct: 1046 LHAPT 1050


>gb|OTG03225.1| putative NB-ARC [Helianthus annuus]
          Length = 1228

 Score =  846 bits (2185), Expect = 0.0
 Identities = 435/753 (57%), Positives = 531/753 (70%), Gaps = 1/753 (0%)
 Frame = +2

Query: 5    NKTPYEVLHNSKPEYDHMKVFGCLAYYRSVETKGDKFEVRGRPGVFLGYPPGTKGYKVYD 184
            NKTPYE L    P YDHM+VFGCL Y+R+ +TKGDKFE RGR G+FLGYP GTKGYK+YD
Sbjct: 143  NKTPYEALLGIAPTYDHMRVFGCLTYHRNYDTKGDKFEPRGRRGIFLGYPFGTKGYKIYD 202

Query: 185  LQHRKMVTSRDVKFLENVFPFARNPTEEEKIFVLPQKWDEEENTRDIRADQTKSNDNIHE 364
            L  +K+    D         + R     E      +K DE E  R               
Sbjct: 203  LDEKKVENIED--------DWLRGEVHSE------EKGDEIEIGR--------------- 233

Query: 365  PSSVMAETETQDVHNGADFFGPSQEPSEPATSGPNEPLLDITHETVPSNQNDMGSETVSE 544
             +    E    D H   D      E  +PA                  NQND       +
Sbjct: 234  -AGQHVEIRGVDQHVEHDLGPHDSEVVDPADD----------------NQNDSAQ---LQ 273

Query: 545  ENRPTNAHTRPVRSRTRPARLDGFEVNLPPSLDHTQSSLHHDSSTVHPLAHFISYDNFTN 724
               P    TR  R+R +P R   + V LPPS+DH   +    SSTVHPLA+++SY++F  
Sbjct: 274  TPPPQTMTTRVPRTRIQPQRYKDYSVQLPPSIDHANPASEQASSTVHPLAYYLSYNSFGA 333

Query: 725  THKAFLTAITTNNEPKHFKQAVKDVRWVEAMKREIQALEENGTWILEELPKGKRAIDSKW 904
             HKAFL+AI + +EPK+F QA +D +W EAM++EI+AL+ENGTW LE+LP GK+AI SKW
Sbjct: 334  NHKAFLSAIDSCHEPKNFVQASQDPKWREAMEQEIKALQENGTWTLEKLPSGKKAIYSKW 393

Query: 905  VYKIKYKPNGEVERYKARLVAKGFTQME-GVDFHETFAPXXXXXXXXXXXXXXXXXGWHT 1081
            VYK+K+KP+G+V+RYKARLVAKG+TQME GVD+H+TFAP                  W  
Sbjct: 394  VYKVKHKPDGQVDRYKARLVAKGYTQMEKGVDYHDTFAPVAKLVTMRTLLALAVKQDWII 453

Query: 1082 HQLDVNNAFLHGDLNEDVYMKIPQGFGKQDDNRVCKLKKSLYGLKQASRNWYHKFTGSLF 1261
            HQLDVNNAFLHGDL+E+VYMK+PQG    +++RVC+L+KS+YGLKQASRNWY+KFT SL 
Sbjct: 454  HQLDVNNAFLHGDLDEEVYMKVPQGLMMNNEDRVCRLRKSMYGLKQASRNWYYKFTQSLV 513

Query: 1262 EIGFKQTPADHSLFIHRKDKTFVAALIYVDDVVLVGNDSNKIQDTKDFLDKRFSIKDLGP 1441
             +G+KQ+ AD SLFI  +    V+ALIYVDDV++VGN+ +KI+ TK  L ++F+IKDLG 
Sbjct: 514  SMGYKQSVADPSLFIFTEGTVHVSALIYVDDVIIVGNNMDKIKATKTLLHEQFTIKDLGS 573

Query: 1442 LKYFLGIEVAKTKEGMVLSQRKYTLDILEDAGMTGCRPSSFPMEQNLKLDTCDKEPRVDA 1621
            LKYFLGIEVA+TKEG+VLSQRKY LDIL D G+ GCRPSSFPMEQ LK D  ++EP+VDA
Sbjct: 574  LKYFLGIEVARTKEGLVLSQRKYILDILRDMGLEGCRPSSFPMEQTLKPDRAEEEPKVDA 633

Query: 1622 NQYRRLIGRLLYLQATRPDIAYAVNILSQFVGDPRHSHMEAATRVLCYLKGTPGQGILLP 1801
             QYRRLIGRLLYLQATRPDI+++VN+LSQFV DPR  H +AA R++ YLK T GQGILLP
Sbjct: 634  GQYRRLIGRLLYLQATRPDISFSVNLLSQFVADPRQPHYDAAIRIVRYLKTTVGQGILLP 693

Query: 1802 KEGGTNLLAYCDSDWLGCPMTRRSRTGYLLLLGGAPISWRTKKQSVVSKSSAEAEYRAMS 1981
            KEGG+NL+ YCDSDW+GCP +RRSRTGY+LL GGAP+ W++KKQSVVS+SSAEAEYRAM+
Sbjct: 694  KEGGSNLVTYCDSDWMGCPFSRRSRTGYMLLFGGAPVYWKSKKQSVVSRSSAEAEYRAMA 753

Query: 1982 NAVSEILWMRWLLSELDMAPVGPTQLFCDNQAARHIANNPVFHERTKHVEMDCYFVRERV 2161
              VSEILW+RWLL+EL       T LFCDN+AARHIANNPVFHERTKHVEMDCYFVRERV
Sbjct: 754  TTVSEILWIRWLLNELGAQQSNSTILFCDNEAARHIANNPVFHERTKHVEMDCYFVRERV 813

Query: 2162 DSMEICPMPIATKDQIADVLTKALGANSLHFLL 2260
            +S EI PM I + +QIAD+LTK LG   L  LL
Sbjct: 814  ESKEILPMHIESANQIADLLTKPLGGPQLKILL 846


>gb|OMO87137.1| Integrase, catalytic core [Corchorus capsularis]
          Length = 1257

 Score =  803 bits (2074), Expect = 0.0
 Identities = 420/775 (54%), Positives = 510/775 (65%), Gaps = 10/775 (1%)
 Frame = +2

Query: 2    DNKTPYEVLHNSKPEYDHMKVFGCLAYYRSVETKGDKFEVRGRPGVFLGYPPGTKGYKVY 181
            +NKTP+E+L   KPEYDH++VFGCL Y      +GDKF  RG+P VF+GYP G KGY+VY
Sbjct: 537  NNKTPFEMLFGKKPEYDHLRVFGCLVYAHDNSKRGDKFSERGKPCVFVGYPNGQKGYRVY 596

Query: 182  DLQHRKMVTSRDVKFLENVFPFARNPTEEEKIFVLPQKWDEEENTRDIRADQTKSNDNIH 361
            DL+ +K  TSRDV F EN++PF  N               E E T      +   N   +
Sbjct: 597  DLKEKKFYTSRDVTFFENIYPFRPNDCYS----------GETELTAGGEYCRPVLNAADN 646

Query: 362  EPSSVMAETETQDVHNGADFFGPSQEPSEPATSGPNEPLLDITHETVPSNQNDMGSETVS 541
            +   VM   + + + N AD F   + P   A         DIT   V ++Q    +E+ +
Sbjct: 647  DCEEVMTLPQVKGLGNAADRFIAEEIPETAAG--------DIT-AGVTASQETTVTESTA 697

Query: 542  EENRPTNA----------HTRPVRSRTRPARLDGFEVNLPPSLDHTQSSLHHDSSTVHPL 691
            EE   ++A            R  R RT+P R DGF+V LPPS    Q +L          
Sbjct: 698  EEPVVSSAVPISGQSRVEVRRSARERTQPKRFDGFDVQLPPSTVPAQPAL---------- 747

Query: 692  AHFISYDNFTNTHKAFLTAITTNNEPKHFKQAVKDVRWVEAMKREIQALEENGTWILEEL 871
                                     P     AVK   W EAM++EIQALEENGTW L  L
Sbjct: 748  -------------------------PSADSSAVKHKHWREAMEKEIQALEENGTWDLVPL 782

Query: 872  PKGKRAIDSKWVYKIKYKPNGEVERYKARLVAKGFTQMEGVDFHETFAPXXXXXXXXXXX 1051
            P+ KRAIDSKWVYK+K+KPNGE+ERYKARLVAKGFTQ+EGVDFHETFAP           
Sbjct: 783  PQDKRAIDSKWVYKVKFKPNGEIERYKARLVAKGFTQIEGVDFHETFAPVAKLVTVRCLL 842

Query: 1052 XXXXXXGWHTHQLDVNNAFLHGDLNEDVYMKIPQGFGKQDDNRVCKLKKSLYGLKQASRN 1231
                   W  HQLDVNN FLHGDL E+V+MKIPQGF K  + RVCKLKKSLYGL+QASRN
Sbjct: 843  AIAAKRRWEVHQLDVNNVFLHGDLEEEVFMKIPQGFAKAGETRVCKLKKSLYGLRQASRN 902

Query: 1232 WYHKFTGSLFEIGFKQTPADHSLFIHRKDKTFVAALIYVDDVVLVGNDSNKIQDTKDFLD 1411
            WYHKFT +L ++GF+Q+ ADHSLF++ K +TF+ ALIYVDDV+L GN+ +KIQ+ K +L+
Sbjct: 903  WYHKFTKALEDVGFRQSKADHSLFLYDKGETFLTALIYVDDVILAGNNGDKIQEVKSYLN 962

Query: 1412 KRFSIKDLGPLKYFLGIEVAKTKEGMVLSQRKYTLDILEDAGMTGCRPSSFPMEQNLKLD 1591
             +F IKDLGPLKYFLGIE A++  G+VLSQRKY LDILE++GM GC+PS+FPMEQN KL 
Sbjct: 963  DKFGIKDLGPLKYFLGIEAARSPAGIVLSQRKYALDILEESGMQGCKPSAFPMEQNHKLR 1022

Query: 1592 TCDKEPRVDANQYRRLIGRLLYLQATRPDIAYAVNILSQFVGDPRHSHMEAATRVLCYLK 1771
                 P +DA QYRRL+GRLLYL  TRPD+ +AVN+LSQFV  PR  HM+AA RVL YLK
Sbjct: 1023 ADSNGPIIDAAQYRRLVGRLLYLTVTRPDLTFAVNVLSQFVSAPRQEHMDAALRVLRYLK 1082

Query: 1772 GTPGQGILLPKEGGTNLLAYCDSDWLGCPMTRRSRTGYLLLLGGAPISWRTKKQSVVSKS 1951
              PGQG+LL  +G   L+AYCD+DW GC  T+RS TGY + LGG+PISWRTK+Q VVSKS
Sbjct: 1083 KAPGQGVLLSAKGDLFLIAYCDADWGGCLTTKRSCTGYFITLGGSPISWRTKRQEVVSKS 1142

Query: 1952 SAEAEYRAMSNAVSEILWMRWLLSELDMAPVGPTQLFCDNQAARHIANNPVFHERTKHVE 2131
            SAEAEYRAM+  VSE+LW+ WLL++L      PT LFCDNQAA HI  NPV+HERTKHVE
Sbjct: 1143 SAEAEYRAMAVTVSELLWLWWLLTDLQSPQTEPTPLFCDNQAALHITANPVYHERTKHVE 1202

Query: 2132 MDCYFVRERVDSMEICPMPIATKDQIADVLTKALGANSLHFLLCKLGVRNLHAPT 2296
            MDCYFVRE+  S EI P  I+T  Q+AD+ TKALG +    L+ KLGV NLHAPT
Sbjct: 1203 MDCYFVREKAQSREIAPRKISTCAQLADIFTKALGKDRFESLVFKLGVANLHAPT 1257



 Score =  306 bits (783), Expect = 1e-82
 Identities = 135/233 (57%), Positives = 187/233 (80%)
 Frame = +1

Query: 2617 IDYESPYYLHPSDYPRQMHVNDVLSDNNYADWSQEMMNFLFAKNKVGFINGSIKKPEEES 2796
            ID  SP+YLH SD P Q++V+D+L D NY +W  +M N LFAKNK+GF++G+I +P  +S
Sbjct: 21   IDVMSPFYLHASDNPGQIYVSDLLHDGNYGEWVNDMSNALFAKNKIGFVDGTIPRPGVDS 80

Query: 2797 SNYMPWMRCDAMIKGWLHTAMEKEIRTSVKYAMTAREIWIDLKERFGKVSAPRAYELKRS 2976
             N   WMRC+AM+KGWL +AM K++R SV+YA TAREIW+DL+ERFGK S PRAYE++R+
Sbjct: 81   PNLQHWMRCNAMVKGWLKSAMGKDVRGSVRYASTAREIWVDLEERFGKGSDPRAYEIRRA 140

Query: 2977 LTSTKQEGTSVSAYYTKLRGIWDEIQSVIPMPRCDCSSCKCDIGKKLQELRDKERLYEFL 3156
            +T  +QE  SVS+YYTKL+G+WDE+QS+ P+ +C C+ CKC+I K+L ++R+KE+LY+FL
Sbjct: 141  VTLLRQEKMSVSSYYTKLKGLWDEMQSIFPLLKCVCNGCKCNISKQLVDMREKEQLYDFL 200

Query: 3157 LGLDAEFGTIRTQILAMNPIPSLGKAYHLVAEDEQQRAISGSKRPSSDSVAFQ 3315
            +GLD EFG ++TQIL+  P P LG AYHLVAEDEQQ+ IS +++P +++ AFQ
Sbjct: 201  MGLDDEFGIVKTQILSTKPTPGLGHAYHLVAEDEQQKQISANRKPIAEAAAFQ 253


>gb|OMO65653.1| hypothetical protein CCACVL1_21443 [Corchorus capsularis]
          Length = 1245

 Score =  758 bits (1956), Expect = 0.0
 Identities = 387/767 (50%), Positives = 493/767 (64%), Gaps = 2/767 (0%)
 Frame = +2

Query: 2    DNKTPYEVLHNSKPEYDHMKVFGCLAYYRSVETKGDKFEVRGRPGVFLGYPPGTKGYKVY 181
            D KTP+E+L +  P YDH+KVFGCL Y        DKF  R    +F+GYP GTKGY+VY
Sbjct: 535  DGKTPFEMLFSKPPAYDHLKVFGCLCYALQKPKPNDKFSPRSSKCIFVGYPNGTKGYRVY 594

Query: 182  DLQHRKMVTSRDVKFLENVFPFARNPTEEEKIFVLPQKWDEEENTRDIRADQTKSNDNIH 361
            DL  +K+  SRDV+F EN FPF                    ENT       T +ND   
Sbjct: 595  DLTTKKIFVSRDVRFYENQFPF--------------------ENT------STSTNDQTV 628

Query: 362  EPSSVMAETETQDVHNGADFFGPSQEPSEPATSGPNEPLLDITHETVPSNQNDMGSETVS 541
             P   + +T+                             L ITH+++P N      +   
Sbjct: 629  VPLPALEDTD-----------------------------LSITHDSIPPNPPQEQPQPHP 659

Query: 542  EENRPTNAHTRPVRSRTRPARLDGFEVNLPPSLDHTQSSLHHDSS--TVHPLAHFISYDN 715
              N P    TRP R++TRP RLD    N    +D++ SSL H++S  T++ L++FISYDN
Sbjct: 660  PTNPPNQPSTRPQRTKTRPKRLDDCVCN-NSKVDNSPSSLTHEASSGTLYSLSNFISYDN 718

Query: 716  FTNTHKAFLTAITTNNEPKHFKQAVKDVRWVEAMKREIQALEENGTWILEELPKGKRAID 895
            F ++HKAFL AI+  +EPK F QAVK  +W EAM++E+ ALE N TW LE LP  K+ I 
Sbjct: 719  FHSSHKAFLAAISLRDEPKSFSQAVKSPQWREAMQKELAALENNNTWTLETLPPRKKPIG 778

Query: 896  SKWVYKIKYKPNGEVERYKARLVAKGFTQMEGVDFHETFAPXXXXXXXXXXXXXXXXXGW 1075
             KW++KIKYK +G +ERYKAR VAKG+ Q+EG+DFHETFAP                  W
Sbjct: 779  CKWIFKIKYKSDGTIERYKARFVAKGYNQIEGMDFHETFAPVAKLVTVRCLLAIAAIKNW 838

Query: 1076 HTHQLDVNNAFLHGDLNEDVYMKIPQGFGKQDDNRVCKLKKSLYGLKQASRNWYHKFTGS 1255
              HQLDVNNAFLHGDL+E+VYM +P G+G ++D+RVC+++KSLYGLKQASRNW+ KF  +
Sbjct: 839  ELHQLDVNNAFLHGDLDEEVYMSLPPGYGDKNDSRVCRVRKSLYGLKQASRNWFAKFFAA 898

Query: 1256 LFEIGFKQTPADHSLFIHRKDKTFVAALIYVDDVVLVGNDSNKIQDTKDFLDKRFSIKDL 1435
            L E GF Q+  D+SLF      +F+  L+YVDD+++ G+DS +I+  K  LD RF IKDL
Sbjct: 899  LLEFGFIQSTVDYSLFTLTTGSSFLVVLVYVDDLIIAGDDSVRIRSLKQHLDSRFHIKDL 958

Query: 1436 GPLKYFLGIEVAKTKEGMVLSQRKYTLDILEDAGMTGCRPSSFPMEQNLKLDTCDKEPRV 1615
            GPLKYFLGIEVA++  G+ L QRKYTLDILE+ GMT  +PS+FPMEQ   L      P  
Sbjct: 959  GPLKYFLGIEVARSSSGIFLCQRKYTLDILEECGMTDAKPSAFPMEQKHNLTHDTGPPVQ 1018

Query: 1616 DANQYRRLIGRLLYLQATRPDIAYAVNILSQFVGDPRHSHMEAATRVLCYLKGTPGQGIL 1795
            D  QYRRL+GRL+YL  TRP+I+YAV+ILSQF+ DPR  H++AA RVL YLK  PGQGI 
Sbjct: 1019 DPMQYRRLVGRLIYLTITRPEISYAVHILSQFMNDPRQPHLDAALRVLRYLKSCPGQGIF 1078

Query: 1796 LPKEGGTNLLAYCDSDWLGCPMTRRSRTGYLLLLGGAPISWRTKKQSVVSKSSAEAEYRA 1975
                   +L  + DSDW  CP TRRS TGY+ +LG +PISW+TKKQ+ VS+SSAEAEYRA
Sbjct: 1079 FSSSSSPHLTGFSDSDWASCPQTRRSTTGYITMLGSSPISWKTKKQTTVSRSSAEAEYRA 1138

Query: 1976 MSNAVSEILWMRWLLSELDMAPVGPTQLFCDNQAARHIANNPVFHERTKHVEMDCYFVRE 2155
            M+  VSE+LW+R LL  L +    P  LFCDNQ A HIA NPVFHERTKH+E+DC+F+R 
Sbjct: 1139 MAATVSELLWLRSLLQTLGIPHQQPMALFCDNQVAIHIATNPVFHERTKHIELDCHFIRS 1198

Query: 2156 RVDSMEICPMPIATKDQIADVLTKALGANSLHFLLCKLGVRNLHAPT 2296
             + +  I    I++K Q+AD+ TKALG +   FLL KLG+ NLHAPT
Sbjct: 1199 HIQAKSIQTSHISSKLQLADIFTKALGRDQFQFLLRKLGIFNLHAPT 1245



 Score =  235 bits (600), Expect = 2e-59
 Identities = 129/361 (35%), Positives = 196/361 (54%), Gaps = 24/361 (6%)
 Frame = +1

Query: 2617 IDYESPYYLHPSDYPRQMHVNDVLSDNNYADWSQEMMNFLFAKNKVGFINGSIKKPEEES 2796
            +D  SPY L PSD+P  + V+  L+ +NY  W++ M N L A+NK GF++GS+ KPE  S
Sbjct: 24   MDLSSPYLLQPSDHPGAILVSCPLNGDNYPTWARAMTNALRARNKYGFVDGSLAKPEATS 83

Query: 2797 SNYMPWMRCDAMIKGWLHTAMEKEIRTSVKYAMTAREIWIDLKERFGKVSAPRAYELKRS 2976
             +   W +C++M+  W+  ++  ++  SV Y  TARE+W+DL+ERF + +APR  +LKR 
Sbjct: 84   PDVSTWEKCNSMVISWIFNSLSSDLHNSVAYVDTAREMWLDLEERFSQGNAPRINQLKRD 143

Query: 2977 LTSTKQEGTSVSAYYTKLRGIWDEIQSVIPMPRCDCSSCKCDIGKKLQELRDKERLYEFL 3156
            L  T Q   SV+AYYTKL+GIWDE+Q+   +P C C +      K+L   R++E++++F+
Sbjct: 144  LALTFQINMSVAAYYTKLKGIWDELQTYSTIPPCTCGA-----AKELLLEREREKVHQFI 198

Query: 3157 LGLDAEFGTIRTQILAMNPIPSLGKAYHLVAEDEQQRAISGSKRPSSDSVAFQAHVPVKR 3336
            +GLD  F ++ + IL + P+PSL KAY LV   E++ ++  ++ P  ++ A      V  
Sbjct: 199  MGLDDSFRSVSSHILNIEPLPSLSKAYALVTRAERENSVRSTRPPIVEATALH----VTT 254

Query: 3337 DQNQSQNRTKQKDVKRGNSEPVEQCTVCGKDGHKSEGCFKLIGYP-EWWPGKGKQYKPKP 3513
              N +Q+ T +            +C  C K GH    C++L+GYP  W  GK  + K KP
Sbjct: 255  SANAAQSHTTRL-----------RCDHCNKTGHTKSHCYELVGYPSHWQKGKTDKDKRKP 303

Query: 3514 SAALVEG---------------EKSPIAGLSDSQYRQFLKFFG--------DKDGAKTED 3624
             A                    + SPIAGL+  QY Q +            D  G  T D
Sbjct: 304  HAKAGSSNPKAMFPTCHVAKTIDASPIAGLTSEQYNQLISLLNIEKTNIVDDFSGKTTND 363

Query: 3625 S 3627
            S
Sbjct: 364  S 364


>ref|XP_022018997.1| uncharacterized protein LOC110919025 [Helianthus annuus]
          Length = 386

 Score =  711 bits (1834), Expect = 0.0
 Identities = 354/386 (91%), Positives = 362/386 (93%)
 Frame = +2

Query: 1139 MKIPQGFGKQDDNRVCKLKKSLYGLKQASRNWYHKFTGSLFEIGFKQTPADHSLFIHRKD 1318
            MKIPQGFGKQDDNRVCKLKK LY LKQASRNWY KFT SL EIGFKQTPA++SLFI +++
Sbjct: 1    MKIPQGFGKQDDNRVCKLKKCLYDLKQASRNWYQKFTHSLLEIGFKQTPANYSLFIFKEN 60

Query: 1319 KTFVAALIYVDDVVLVGNDSNKIQDTKDFLDKRFSIKDLGPLKYFLGIEVAKTKEGMVLS 1498
            K FVAALIYVDDVVLV NDS KIQ TKDFLDKRFSIKDLGPLKYFLGIEVAKT EGMVLS
Sbjct: 61   KIFVAALIYVDDVVLVRNDSRKIQATKDFLDKRFSIKDLGPLKYFLGIEVAKTNEGMVLS 120

Query: 1499 QRKYTLDILEDAGMTGCRPSSFPMEQNLKLDTCDKEPRVDANQYRRLIGRLLYLQATRPD 1678
            QRKYTLDILED GMTGCRPSSFPMEQNLKLD CDKEPRVDANQYRRLIGRLLYLQATRPD
Sbjct: 121  QRKYTLDILEDVGMTGCRPSSFPMEQNLKLDMCDKEPRVDANQYRRLIGRLLYLQATRPD 180

Query: 1679 IAYAVNILSQFVGDPRHSHMEAATRVLCYLKGTPGQGILLPKEGGTNLLAYCDSDWLGCP 1858
            IAYAVNILSQFV  PR +HMEAATRVL YLKGT GQGIL+PKEG  NLLAYCDSDWLGCP
Sbjct: 181  IAYAVNILSQFVNYPRQTHMEAATRVLRYLKGTLGQGILIPKEGVANLLAYCDSDWLGCP 240

Query: 1859 MTRRSRTGYLLLLGGAPISWRTKKQSVVSKSSAEAEYRAMSNAVSEILWMRWLLSELDMA 2038
            MTRRSRTGYLLLLGGAPISWRTKKQSVVSKSSAEAEYRAMSNAVSEILWMRWLL ELDMA
Sbjct: 241  MTRRSRTGYLLLLGGAPISWRTKKQSVVSKSSAEAEYRAMSNAVSEILWMRWLLRELDMA 300

Query: 2039 PVGPTQLFCDNQAARHIANNPVFHERTKHVEMDCYFVRERVDSMEICPMPIATKDQIADV 2218
            PVGPTQLFCDNQAARHIANNPVFHERTKHVEMDCYF+RERVDSMEICPM IATKDQIADV
Sbjct: 301  PVGPTQLFCDNQAARHIANNPVFHERTKHVEMDCYFIRERVDSMEICPMSIATKDQIADV 360

Query: 2219 LTKALGANSLHFLLCKLGVRNLHAPT 2296
            LTK LGANSL FLLCKLGVRNLHAPT
Sbjct: 361  LTKDLGANSLCFLLCKLGVRNLHAPT 386


>gb|OTG09093.1| putative reverse transcriptase, RNA-dependent DNA polymerase,
            Gag-polypeptide of LTR copia-type [Helianthus annuus]
          Length = 938

 Score =  724 bits (1870), Expect = 0.0
 Identities = 369/575 (64%), Positives = 427/575 (74%)
 Frame = +2

Query: 572  RPVRSRTRPARLDGFEVNLPPSLDHTQSSLHHDSSTVHPLAHFISYDNFTNTHKAFLTAI 751
            R  R+R++PARL  + V LPPS+DH   + +  SST                        
Sbjct: 428  RAKRNRSQPARLSDYHVKLPPSVDHANPAPNEASST------------------------ 463

Query: 752  TTNNEPKHFKQAVKDVRWVEAMKREIQALEENGTWILEELPKGKRAIDSKWVYKIKYKPN 931
                 P++F QA++D RW EAMK+EI+ALEEN TW L +LP GKRA+DSKWVYKIKYKPN
Sbjct: 464  -----PRNFNQAIQDERWKEAMKKEIRALEENNTWTLVDLPNGKRAVDSKWVYKIKYKPN 518

Query: 932  GEVERYKARLVAKGFTQMEGVDFHETFAPXXXXXXXXXXXXXXXXXGWHTHQLDVNNAFL 1111
            GEVER+KARLVAKGFTQMEGVD+H+TFAP                  W  +QLDVNNAFL
Sbjct: 519  GEVERFKARLVAKGFTQMEGVDYHDTFAPVAKLVTVRTLLAVAVKKRWVINQLDVNNAFL 578

Query: 1112 HGDLNEDVYMKIPQGFGKQDDNRVCKLKKSLYGLKQASRNWYHKFTGSLFEIGFKQTPAD 1291
            HGDLNE+VYMK+PQGF K++D RVC+L KSLYGLKQASRNWY KFT SL E+G+KQ  AD
Sbjct: 579  HGDLNEEVYMKLPQGFAKENDTRVCRLNKSLYGLKQASRNWYQKFTSSLLELGYKQCKAD 638

Query: 1292 HSLFIHRKDKTFVAALIYVDDVVLVGNDSNKIQDTKDFLDKRFSIKDLGPLKYFLGIEVA 1471
            +SLFI ++D  FVAALIYVDDV++VGND+ KIQ TK  LD+RFSIKDLG LKYFLGIEVA
Sbjct: 639  YSLFIFKEDACFVAALIYVDDVIIVGNDARKIQHTKVELDRRFSIKDLGTLKYFLGIEVA 698

Query: 1472 KTKEGMVLSQRKYTLDILEDAGMTGCRPSSFPMEQNLKLDTCDKEPRVDANQYRRLIGRL 1651
            +T EG+VLSQRKY LDILED G+ GC+PS FP EQNLKLD  D+EP+VDA++YRRL+GRL
Sbjct: 699  RTPEGLVLSQRKYILDILEDCGLQGCKPSPFPFEQNLKLDKNDEEPKVDASRYRRLVGRL 758

Query: 1652 LYLQATRPDIAYAVNILSQFVGDPRHSHMEAATRVLCYLKGTPGQGILLPKEGGTNLLAY 1831
            LYLQATRPDIAY+VN+LSQFV DPR SHM+AA RVL                        
Sbjct: 759  LYLQATRPDIAYSVNVLSQFVADPRQSHMDAAHRVL------------------------ 794

Query: 1832 CDSDWLGCPMTRRSRTGYLLLLGGAPISWRTKKQSVVSKSSAEAEYRAMSNAVSEILWMR 2011
                       RRSRTGYLLLLGGAPISW+TKKQ+VVS+SSAEAEYR+M++ VSEILWMR
Sbjct: 795  -----------RRSRTGYLLLLGGAPISWKTKKQNVVSRSSAEAEYRSMASTVSEILWMR 843

Query: 2012 WLLSELDMAPVGPTQLFCDNQAARHIANNPVFHERTKHVEMDCYFVRERVDSMEICPMPI 2191
            WLL EL++  + PT LFCDNQAARHIANNPVFHERTKHVEMDCYFVRERV+S E+ P+ I
Sbjct: 844  WLLKELNIYTIEPTPLFCDNQAARHIANNPVFHERTKHVEMDCYFVRERVESQEVQPLRI 903

Query: 2192 ATKDQIADVLTKALGANSLHFLLCKLGVRNLHAPT 2296
             T  QIAD+LTK LG   L FLL KLGVRNLHAPT
Sbjct: 904  DTSMQIADLLTKGLGTQQLTFLLDKLGVRNLHAPT 938



 Score =  469 bits (1208), Expect = e-144
 Identities = 219/338 (64%), Positives = 269/338 (79%), Gaps = 1/338 (0%)
 Frame = +1

Query: 2596 GKTKEGGI-DYESPYYLHPSDYPRQMHVNDVLSDNNYADWSQEMMNFLFAKNKVGFINGS 2772
            G  KEG   D  SP Y+H SDYP+QMHVND L+DNNY DWSQEM+NFLFAKNKVGF++G+
Sbjct: 7    GTKKEGSSPDINSPLYIHASDYPKQMHVNDTLTDNNYTDWSQEMLNFLFAKNKVGFVDGT 66

Query: 2773 IKKPEEESSNYMPWMRCDAMIKGWLHTAMEKEIRTSVKYAMTAREIWIDLKERFGKVSAP 2952
            +KKPE+ +++YM WMRCDAM+KGWL TAMEK+IR SVKYA TA EIW DL+ERFGK SAP
Sbjct: 67   LKKPEKTATDYMAWMRCDAMVKGWLTTAMEKDIRGSVKYANTASEIWSDLRERFGKASAP 126

Query: 2953 RAYELKRSLTSTKQEGTSVSAYYTKLRGIWDEIQSVIPMPRCDCSSCKCDIGKKLQELRD 3132
            RAYELK++L++T Q G+SVSAYYTKLR +WDEI+SV+P PRC C  C C +GKK+ ELR+
Sbjct: 127  RAYELKQTLSNTHQSGSSVSAYYTKLRVLWDEIESVLPAPRCTCDKCSCGVGKKMNELRE 186

Query: 3133 KERLYEFLLGLDAEFGTIRTQILAMNPIPSLGKAYHLVAEDEQQRAISGSKRPSSDSVAF 3312
            KERLYEFL+GLDA+F  I+TQILAMNPIP+LG AYHLVAEDE+QR ISG K+  +++ AF
Sbjct: 187  KERLYEFLMGLDADFAVIKTQILAMNPIPTLGNAYHLVAEDERQRMISGEKKTPTENAAF 246

Query: 3313 QAHVPVKRDQNQSQNRTKQKDVKRGNSEPVEQCTVCGKDGHKSEGCFKLIGYPEWWPGKG 3492
            +A  PV+R+ + SQN+   KD K G  + VEQCT CG+ GHK +GCFK+IGYP+WWPG  
Sbjct: 247  KAFKPVRRENSTSQNKAAPKDQKHG--DMVEQCTHCGRSGHKRDGCFKIIGYPDWWPG-- 302

Query: 3493 KQYKPKPSAALVEGEKSPIAGLSDSQYRQFLKFFGDKD 3606
               K KP AA VE + SP+ GL+  QY+ FLK F + D
Sbjct: 303  ---KMKPKAAHVETDASPVPGLTKEQYQSFLKHFAEND 337


>gb|OMO75305.1| Integrase, catalytic core [Corchorus capsularis]
          Length = 1373

 Score =  734 bits (1895), Expect = 0.0
 Identities = 396/775 (51%), Positives = 487/775 (62%), Gaps = 10/775 (1%)
 Frame = +2

Query: 2    DNKTPYEVLHNSKPEYDHMKVFGCLAYYRSVETKGDKFEVRGRPGVFLGYPPGTKGYKVY 181
            +NKTP+E+L   KPEYDH++VFGCL Y      +GDKF  RG+P VF+GYP G KG  + 
Sbjct: 687  NNKTPFEMLFGKKPEYDHLRVFGCLVYAHDNSKRGDKFSERGKPCVFVGYPNGQKGQMIV 746

Query: 182  DLQHRKMVTSRDVKFLENVFPFARNPTEEEKIFVLPQKWDEEENTRDIRADQTKSNDNIH 361
             +Q ++ +T+   ++   V   A N  EE  +  LPQ                       
Sbjct: 747  -IQEKQSLTAGG-EYCRPVLNAANNDCEE--VMTLPQ----------------------- 779

Query: 362  EPSSVMAETETQDVHNGADFFGPSQEPSEPATSGPNEPLLDITHETVPSNQNDMGSETVS 541
                       + + N AD F   + P   A         DIT   V ++Q    +E+ +
Sbjct: 780  ----------VKGLGNAADRFIAEEIPETAAG--------DIT-AGVTASQETTVTESTA 820

Query: 542  EENRPTNAHT----------RPVRSRTRPARLDGFEVNLPPSLDHTQSSLHHDSSTVHPL 691
            EE   ++A            R  R RT+P R DGF+V LPPS    Q +L    S+V+PL
Sbjct: 821  EEPVVSSAVPISGQSRVEVRRSARERTQPKRFDGFDVQLPPSTVPAQPALPSADSSVYPL 880

Query: 692  AHFISYDNFTNTHKAFLTAITTNNEPKHFKQAVKDVRWVEAMKREIQALEENGTWILEEL 871
            +H++SYD   ++HKAFL  IT+++EPKHF QAVK   W EA ++EIQALEENGTW L  L
Sbjct: 881  SHYVSYDRIAHSHKAFLATITSHDEPKHFSQAVKHKHWREAKEKEIQALEENGTWDLVPL 940

Query: 872  PKGKRAIDSKWVYKIKYKPNGEVERYKARLVAKGFTQMEGVDFHETFAPXXXXXXXXXXX 1051
            P+ KRAIDSKWVYK+K+KPNGE+ERYKARLVAKGFTQ+EGVDFHETFAP           
Sbjct: 941  PQDKRAIDSKWVYKVKFKPNGEIERYKARLVAKGFTQIEGVDFHETFAPVAKLVTVRCLL 1000

Query: 1052 XXXXXXGWHTHQLDVNNAFLHGDLNEDVYMKIPQGFGKQDDNRVCKLKKSLYGLKQASRN 1231
                   W  HQLD                                          ASRN
Sbjct: 1001 AIAAKRRWEVHQLD------------------------------------------ASRN 1018

Query: 1232 WYHKFTGSLFEIGFKQTPADHSLFIHRKDKTFVAALIYVDDVVLVGNDSNKIQDTKDFLD 1411
            WYHKFT +L ++GF+Q+ ADHSLF++ K +TF+ ALIYVDDV+L GN+ +KIQ+ K +L+
Sbjct: 1019 WYHKFTKALEDVGFRQSKADHSLFLYDKGETFLTALIYVDDVILAGNNGDKIQEIKSYLN 1078

Query: 1412 KRFSIKDLGPLKYFLGIEVAKTKEGMVLSQRKYTLDILEDAGMTGCRPSSFPMEQNLKLD 1591
             +F IKDLGPLKYFLGIEVA++  G+VLSQRKY LDILE++GM GC+PS+FPME N KL 
Sbjct: 1079 DKFDIKDLGPLKYFLGIEVARSPAGIVLSQRKYVLDILEESGMQGCKPSAFPMEHNHKLR 1138

Query: 1592 TCDKEPRVDANQYRRLIGRLLYLQATRPDIAYAVNILSQFVGDPRHSHMEAATRVLCYLK 1771
                   +DA QYRRL+GRLLYL  TRPD+ +AVN+LSQFV  PR  HM+AA RVL YLK
Sbjct: 1139 ADSNGTIIDAAQYRRLVGRLLYLTVTRPDLTFAVNVLSQFVSAPRQEHMDAALRVLRYLK 1198

Query: 1772 GTPGQGILLPKEGGTNLLAYCDSDWLGCPMTRRSRTGYLLLLGGAPISWRTKKQSVVSKS 1951
              PGQGILL  EG   L AYCD+DW GC  TRRS TGY + LGG+PISWRTK+Q VVSKS
Sbjct: 1199 KAPGQGILLSAEGDLFLTAYCDADWGGCLTTRRSCTGYFITLGGSPISWRTKRQQVVSKS 1258

Query: 1952 SAEAEYRAMSNAVSEILWMRWLLSELDMAPVGPTQLFCDNQAARHIANNPVFHERTKHVE 2131
            SAEAEYRAM+  VSE+LW+RWLL++L      PT LFCDNQAA HI   PV+HERTKHVE
Sbjct: 1259 SAEAEYRAMAVTVSELLWLRWLLTDLQSPQTEPTPLFCDNQAALHITAKPVYHERTKHVE 1318

Query: 2132 MDCYFVRERVDSMEICPMPIATKDQIADVLTKALGANSLHFLLCKLGVRNLHAPT 2296
            MDCYFVRER  S EI P  I+T  Q+AD+ TKALG +    L+ KLGV NLHA T
Sbjct: 1319 MDCYFVRERAQSREIAPRKISTGAQLADIFTKALGKDRFESLVFKLGVANLHALT 1373



 Score =  214 bits (546), Expect = 9e-53
 Identities = 124/352 (35%), Positives = 183/352 (51%), Gaps = 15/352 (4%)
 Frame = +1

Query: 2617 IDYESPYYLHPSDYPRQMHVNDVLSDNNYADWSQEMMNFLFAKNKVGFINGSIKKPEEES 2796
            ID  SP+YLH SD P Q++V+D+L D NY +W  +M N LFAKNK+GF++G+I +PE +S
Sbjct: 21   IDVMSPFYLHASDNPGQIYVSDLLHDGNYGEWVNDMSNALFAKNKIGFVDGTIPRPEVDS 80

Query: 2797 SNYMPWMRCDAMIKGWLHTAMEKEIRTSVKYAMTAREIWIDLKERFGKVSAPRAYELKRS 2976
             N   WMRC+AM+KGWL +AM K++R SV+YA TAREIW+DL+ERFGK S PRAYE++R+
Sbjct: 81   PNLQHWMRCNAMVKGWLKSAMGKDVRGSVRYASTAREIWVDLEERFGKGSDPRAYEIRRA 140

Query: 2977 LTSTKQEGTSVSAYYTKLRGIWDEIQSVIPMPRCDCSSCKCDIGKKLQELRDKERLYEFL 3156
            +T  +QE  SVS+YYTKL+G +          R  CS                       
Sbjct: 141  VTLLRQEKMSVSSYYTKLKGFY----------RTRCS----------------------- 167

Query: 3157 LGLDAEFGTIRTQILAMNPIPSLGKAYHLVAEDEQQRAISGSKRPSSDSVAFQAHVPVKR 3336
                 +F   R   + +  I  L K  +     +    +   +R        +    ++R
Sbjct: 168  -----QFSHCRNASVLVMHITWLRKMNNKNRSRQTANPLLRRRRSKCKEAKMEGTEDLER 222

Query: 3337 DQNQSQNRTKQKDVKRGN-----SEPVEQCTVCGKDGHKSEGCFKLIGYPEWW------- 3480
              NQ  N  K+    +         P  +C  C K GH  + C+++IGYP  W       
Sbjct: 223  KTNQGVNIVKKVGHTKDQCYEIIGYPAARCGHCQKSGHTKDQCYEIIGYPAGWRKNLRDK 282

Query: 3481 ---PGKGKQYKPKPSAALVEGEKSPIAGLSDSQYRQFLKFFGDKDGAKTEDS 3627
                G+   ++  P AA VE E + I GL+ +Q  + ++F  + DG  ++ +
Sbjct: 283  KEKGGQTVNHRTFPKAAQVESEMTSIPGLTQAQLAKLVQFL-NVDGESSKQT 333


>gb|PNY16822.1| retrovirus-related Pol polyprotein from transposon TNT 1-94
            [Trifolium pratense]
          Length = 834

 Score =  701 bits (1808), Expect = 0.0
 Identities = 368/771 (47%), Positives = 482/771 (62%), Gaps = 9/771 (1%)
 Frame = +2

Query: 11   TPYEVLHNSKPEYDHMKVFGCLAYYRSVETKGDKFEVRGRPGVFLGYPPGTKGYKVYDLQ 190
            TPYEVL  + P YDH+K+FGCL Y  +     DKF+ R    +F+GYP G KG+KVY+ +
Sbjct: 99   TPYEVLFRNSPTYDHLKIFGCLCYVSTNTKLRDKFDPRAERCIFVGYPQGQKGWKVYNPK 158

Query: 191  HRKMVTSRDVKFLENVFPFARNPTEEEKIFVLPQKWDEEENTRDIRADQTKSNDNIHEPS 370
             +K   SRDV F EN+ P+  +  E      LP +        +I   Q +SN +  E  
Sbjct: 159  TQKFFVSRDVVFYENILPYVVHEKE------LPIE-SPSVVFHEISGQQEESNHDEVEKY 211

Query: 371  SVMAETETQDVHNGADFFGPSQEPSEPATSGPNEPLLDITHETVPSNQNDMGSETVSEEN 550
                E   Q   N A+  G ++EP                        ND+G E   E+ 
Sbjct: 212  E-RQENRGQGDTNPAEMDGQNEEP------------------------NDVGIEVHKEKE 246

Query: 551  RPTNAHTR-----PVRSRTRPARLDGFEVNL----PPSLDHTQSSLHHDSSTVHPLAHFI 703
                 H       P R+R  P  L  +        P S+  TQS   H S  ++P+ +FI
Sbjct: 247  TEVPVHNEMETDLPPRTRQPPGYLQDYHCYTSHKNPISMPKTQS---HSSGKIYPITNFI 303

Query: 704  SYDNFTNTHKAFLTAITTNNEPKHFKQAVKDVRWVEAMKREIQALEENGTWILEELPKGK 883
            S D ++  H+A+L AI    EP+ +++AVK   W EAM  E++ALEENGTW LE  P  K
Sbjct: 304  SNDCYSRRHQAYLAAIHNTKEPQSYREAVKKTEWKEAMAAELKALEENGTWDLELPPTCK 363

Query: 884  RAIDSKWVYKIKYKPNGEVERYKARLVAKGFTQMEGVDFHETFAPXXXXXXXXXXXXXXX 1063
            + +  KWVYK+KYK  GEVE+YKARLVAKG+TQ+EG DF+ETFAP               
Sbjct: 364  KIVGCKWVYKVKYKATGEVEKYKARLVAKGYTQVEGEDFNETFAPVAKMTTVRCLLTVAV 423

Query: 1064 XXGWHTHQLDVNNAFLHGDLNEDVYMKIPQGFGKQDDNRVCKLKKSLYGLKQASRNWYHK 1243
              GW  HQ+DV+NAFLHGDL+E+VYM++P+G+       VC+L+KSLYGLKQASRNWY K
Sbjct: 424  AKGWELHQMDVSNAFLHGDLDEEVYMQVPEGYHTPKAGMVCRLRKSLYGLKQASRNWYSK 483

Query: 1244 FTGSLFEIGFKQTPADHSLFIHRKDKTFVAALIYVDDVVLVGNDSNKIQDTKDFLDKRFS 1423
             + +L E GF+++ ADHSLF + ++  F+A L+YVDD+V+ GN S+   + K +L + F 
Sbjct: 484  LSHALIEYGFQESHADHSLFTYSREGEFMAVLVYVDDLVIAGNYSDTCTNFKQYLRRCFH 543

Query: 1424 IKDLGPLKYFLGIEVAKTKEGMVLSQRKYTLDILEDAGMTGCRPSSFPMEQNLKLDTCDK 1603
            +KDLGPLKYFLG+E+A+   G+ + QRKY +DIL++  M   +PS+FPMEQN KL     
Sbjct: 544  MKDLGPLKYFLGLELARGATGLFMCQRKYIMDILDECKMLDSKPSTFPMEQNQKLALDTG 603

Query: 1604 EPRVDANQYRRLIGRLLYLQATRPDIAYAVNILSQFVGDPRHSHMEAATRVLCYLKGTPG 1783
                D  +YRRL+GRL+YL  TRP+I Y+V+ILSQF   P+ +H +AA RVL YLK TPG
Sbjct: 604  PAYSDPPRYRRLVGRLIYLTITRPEITYSVHILSQFTQSPQQAHWDAAMRVLRYLKFTPG 663

Query: 1784 QGILLPKEGGTNLLAYCDSDWLGCPMTRRSRTGYLLLLGGAPISWRTKKQSVVSKSSAEA 1963
            QGI+LPKE    L+AYCDSDW  CP+TRRS +GYL+ LG APISW+TKKQS VSKSS+EA
Sbjct: 664  QGIILPKENDLQLVAYCDSDWASCPLTRRSTSGYLMKLGSAPISWKTKKQSTVSKSSSEA 723

Query: 1964 EYRAMSNAVSEILWMRWLLSELDMAPVGPTQLFCDNQAARHIANNPVFHERTKHVEMDCY 2143
            EYRAM  AVSE++W+R LLS L +    PT LFCDNQAA H+A NPV+HERTKH+E+DC+
Sbjct: 724  EYRAMGQAVSEVIWLRSLLSSLQVHYKSPTVLFCDNQAAIHLAANPVYHERTKHIEVDCH 783

Query: 2144 FVRERVDSMEICPMPIATKDQIADVLTKALGANSLHFLLCKLGVRNLHAPT 2296
            F+R  +    I    ++TK Q AD+ TKALGA     L  KLG  N H PT
Sbjct: 784  FIRTHLQKGTISTNYVSTKKQQADIFTKALGAKQFQELTFKLGAHNPHTPT 834


>gb|OTG24510.1| putative reverse transcriptase, RNA-dependent DNA polymerase,
            Gag-polypeptide of LTR copia-type [Helianthus annuus]
          Length = 934

 Score =  699 bits (1805), Expect = 0.0
 Identities = 346/525 (65%), Positives = 406/525 (77%)
 Frame = +2

Query: 722  NTHKAFLTAITTNNEPKHFKQAVKDVRWVEAMKREIQALEENGTWILEELPKGKRAIDSK 901
            N +   L    TN++P   +    D +W  AM++EI+ALE+NGTW LEELP+GKR  DSK
Sbjct: 425  NDYVVSLPPSVTNSQPGSSQANSTDEKWRNAMQQEIKALEKNGTWTLEELPEGKRPTDSK 484

Query: 902  WVYKIKYKPNGEVERYKARLVAKGFTQMEGVDFHETFAPXXXXXXXXXXXXXXXXXGWHT 1081
            WVYK K+K +GEVERYKARLVAKGFTQMEGVD+HETFAP                  W  
Sbjct: 485  WVYKTKFKSDGEVERYKARLVAKGFTQMEGVDYHETFAPVAKLVTVRTLLAVATKKDWII 544

Query: 1082 HQLDVNNAFLHGDLNEDVYMKIPQGFGKQDDNRVCKLKKSLYGLKQASRNWYHKFTGSLF 1261
            HQLDVNNAFLHGDL+E+VYMKIP+GF K+ + RVC+L+KSLYGLKQASRNWY + T  L 
Sbjct: 545  HQLDVNNAFLHGDLDEEVYMKIPKGFEKEGETRVCRLRKSLYGLKQASRNWYKRLTSFLL 604

Query: 1262 EIGFKQTPADHSLFIHRKDKTFVAALIYVDDVVLVGNDSNKIQDTKDFLDKRFSIKDLGP 1441
             + FKQ+ AD+SLF ++K   +VA LIYVDDV++VG++S KIQ  K  LD  FSIKDLGP
Sbjct: 605  SLNFKQSKADYSLFTYQKAGIYVAILIYVDDVIIVGDNSKKIQQIKQQLDDEFSIKDLGP 664

Query: 1442 LKYFLGIEVAKTKEGMVLSQRKYTLDILEDAGMTGCRPSSFPMEQNLKLDTCDKEPRVDA 1621
            LKYFLGIEVAKTK+G+VLSQRKY LDIL+D+GM GCRPS+FP EQ  KLD  +KE RVDA
Sbjct: 665  LKYFLGIEVAKTKDGLVLSQRKYILDILKDSGMLGCRPSAFPFEQGTKLDKGEKEARVDA 724

Query: 1622 NQYRRLIGRLLYLQATRPDIAYAVNILSQFVGDPRHSHMEAATRVLCYLKGTPGQGILLP 1801
             QYRRL+GRLLYLQATRPD+ YAVN               AA RVL YLKGTPGQGILLP
Sbjct: 725  TQYRRLVGRLLYLQATRPDVTYAVN---------------AANRVLRYLKGTPGQGILLP 769

Query: 1802 KEGGTNLLAYCDSDWLGCPMTRRSRTGYLLLLGGAPISWRTKKQSVVSKSSAEAEYRAMS 1981
            +EG   L  YCDSDWLGCP TRRSRTGYLLLLGG+PISW+TKKQSVVS+SSAEAEYRAM+
Sbjct: 770  REGPPVLTGYCDSDWLGCPFTRRSRTGYLLLLGGSPISWKTKKQSVVSRSSAEAEYRAMA 829

Query: 1982 NAVSEILWMRWLLSELDMAPVGPTQLFCDNQAARHIANNPVFHERTKHVEMDCYFVRERV 2161
            + VSEILW+RWLL ++ +    PT LFCDNQAARHIANNPV+HERTKHVEMDC+FVRERV
Sbjct: 830  STVSEILWVRWLLKDMQVQITTPTSLFCDNQAARHIANNPVYHERTKHVEMDCFFVRERV 889

Query: 2162 DSMEICPMPIATKDQIADVLTKALGANSLHFLLCKLGVRNLHAPT 2296
            ++ EI P PI +K Q+AD+LTK LG   L  LL K+G+R+LHAP+
Sbjct: 890  ETREIEPKPIESKLQLADLLTKGLGTQQLRSLLSKMGIRDLHAPS 934



 Score =  417 bits (1071), Expect = e-124
 Identities = 197/345 (57%), Positives = 253/345 (73%), Gaps = 1/345 (0%)
 Frame = +1

Query: 2617 IDYESPYYLHPSDYPRQMHVNDVLSDNNYADWSQEMMNFLFAKNKVGFINGSIKKPEEES 2796
            ID+ SPYYLHPSD P+Q  VN+VL+D NY DW+QEM NFLFAKNK+ F++G++KKPE  S
Sbjct: 14   IDFSSPYYLHPSDSPKQPSVNEVLTDGNYNDWAQEMTNFLFAKNKIDFVDGTLKKPETSS 73

Query: 2797 SNYMPWMRCDAMIKGWLHTAMEKEIRTSVKYAMTAREIWIDLKERFGKVSAPRAYELKRS 2976
            S Y  WMRCDAMIKGWL TAMEK IR SVKYA+T+ EIW DLKERFGK SAPR YELK+ 
Sbjct: 74   SQYKSWMRCDAMIKGWLTTAMEKSIRDSVKYAVTSSEIWSDLKERFGKESAPRTYELKQK 133

Query: 2977 LTSTKQEGTSVSAYYTKLRGIWDEIQSVIPMPRCDCSSCKCDIGKKLQELRDKERLYEFL 3156
            + +T+Q+G++VS YYT+LR +WDE  S+ P P C C+ C C++GKK+ E  +K++LYEFL
Sbjct: 134  IAATRQDGSNVSTYYTRLRSLWDESHSIFPFPCCSCNKCTCELGKKIAEHLEKQQLYEFL 193

Query: 3157 LGLDAEFGTIRTQILAMNPIPSLGKAYHLVAEDEQQRAISGSKRPSSDSVAFQAHVPVKR 3336
            +GLD +F  IRTQILA  P+P+LG AYH+VAEDE+QRAIS   R + +S AF+     KR
Sbjct: 194  MGLDNDFNVIRTQILATKPVPTLGTAYHMVAEDERQRAISNENRVAPESAAFKTF--QKR 251

Query: 3337 DQNQSQNRTKQKDVKRGNSEPVEQCTVCGKDGHKSEGCFKLIGYPEWWPGKGKQYKPKPS 3516
              N    + K    +   S+  +QCT CG++ HK EGCFKL+GYP+WWPGK K  K KP 
Sbjct: 252  HNNFKSPKEKYTTTQEKESKQNDQCTFCGRNSHKREGCFKLVGYPDWWPGK-KDDKAKPK 310

Query: 3517 AALVEGEKSPIAGLSDSQYRQFLKFF-GDKDGAKTEDSGPKANLA 3648
            AA V+   SPI G+S+ QY+ F+KFF G  +  +T+    +AN+A
Sbjct: 311  AACVDTGTSPIPGISEEQYQAFVKFFSGSGNNVETKS---EANMA 352


>gb|OTG06009.1| putative zinc finger, CCHC-type [Helianthus annuus]
          Length = 961

 Score =  696 bits (1795), Expect = 0.0
 Identities = 353/506 (69%), Positives = 401/506 (79%), Gaps = 6/506 (1%)
 Frame = +2

Query: 632  PSLDHTQSSLHHDSSTVHPLAHFISYD--NFTNTHKAFLTAITTNNEPKHFK----QAVK 793
            P    T  SLH    TV P     +    + + T K+      T + P   K    QAVK
Sbjct: 13   PKPSPTGPSLHEAFETVEPQREKKAGHGPSISMTMKSVSPHPLTMSNPPPIKSPQRQAVK 72

Query: 794  DVRWVEAMKREIQALEENGTWILEELPKGKRAIDSKWVYKIKYKPNGEVERYKARLVAKG 973
            D +WVEAMK+E+QALEENGTW  EELP+GKRAIDSKWVYKI+YKPNGE+ERYKARL+AKG
Sbjct: 73   DPKWVEAMKKEVQALEENGTWRPEELPEGKRAIDSKWVYKIQYKPNGEIERYKARLMAKG 132

Query: 974  FTQMEGVDFHETFAPXXXXXXXXXXXXXXXXXGWHTHQLDVNNAFLHGDLNEDVYMKIPQ 1153
            F+Q EG+DFHETFAP                 GWH HQLDVNNAFL GDL+EDVYMKIPQ
Sbjct: 133  FSQTEGIDFHETFAPVAKLVTVRTLLAVAVKKGWHIHQLDVNNAFLDGDLHEDVYMKIPQ 192

Query: 1154 GFGKQDDNRVCKLKKSLYGLKQASRNWYHKFTGSLFEIGFKQTPADHSLFIHRKDKTFVA 1333
            GF +Q+D  VCKLKKSLYGLKQASRNWY KFT SL EIGF+QT  DHSLF+ + D+ F+A
Sbjct: 193  GFVRQNDQCVCKLKKSLYGLKQASRNWYKKFTKSLLEIGFRQTGVDHSLFLLKTDEVFMA 252

Query: 1334 ALIYVDDVVLVGNDSNKIQDTKDFLDKRFSIKDLGPLKYFLGIEVAKTKEGMVLSQRKYT 1513
            ALIYVDDV+LV N+  ++Q+ K FLDKRFSIKDLG LK+FLGIEVA+TK+G+VLSQ KY 
Sbjct: 253  ALIYVDDVILVWNNMEEMQNVKSFLDKRFSIKDLGVLKFFLGIEVARTKKGLVLSQHKYI 312

Query: 1514 LDILEDAGMTGCRPSSFPMEQNLKLDTCDKEPRVDANQYRRLIGRLLYLQATRPDIAYAV 1693
            LDILED GMTGCRPS FPM+QNLKLD C+KE +VDANQYRRLIGRLLYLQATRPD+AY V
Sbjct: 313  LDILEDTGMTGCRPSQFPMKQNLKLDKCNKEAQVDANQYRRLIGRLLYLQATRPDVAYVV 372

Query: 1694 NILSQFVGDPRHSHMEAATRVLCYLKGTPGQGILLPKEGGTNLLAYCDSDWLGCPMTRRS 1873
            NI S+FVGDPR SH+EAA RVL YLKGTP Q ILLPK GGTNL+ YCDSDWLGCP TRRS
Sbjct: 373  NIFSKFVGDPRVSHLEAANRVLRYLKGTPRQRILLPKLGGTNLITYCDSDWLGCPFTRRS 432

Query: 1874 RTGYLLLLGGAPISWRTKKQSVVSKSSAEAEYRAMSNAVSEILWMRWLLSELDMAPVGPT 2053
            RTGYLLLLGGAPISW++K+QSVVS+SSAEAEYRAM+  +SE   MR LL ELD+   GPT
Sbjct: 433  RTGYLLLLGGAPISWKSKEQSVVSRSSAEAEYRAMAAVISE---MRRLLKELDIPQEGPT 489

Query: 2054 QLFCDNQAARHIANNPVFHERTKHVE 2131
            QLFC+NQAARHIANN VFHERTKH++
Sbjct: 490  QLFCNNQAARHIANNLVFHERTKHIQ 515


>gb|PNX93928.1| hypothetical protein L195_g017092, partial [Trifolium pratense]
          Length = 865

 Score =  685 bits (1767), Expect = 0.0
 Identities = 355/764 (46%), Positives = 487/764 (63%)
 Frame = +2

Query: 5    NKTPYEVLHNSKPEYDHMKVFGCLAYYRSVETKGDKFEVRGRPGVFLGYPPGTKGYKVYD 184
            NK+P+E+L+N  P  DH++VFGCL Y   V     KF+ R + G+F+GYP G KGYK+YD
Sbjct: 131  NKSPFELLYNKPPSLDHLRVFGCLCYATIVHPT-HKFDPRAKRGIFVGYPTGQKGYKIYD 189

Query: 185  LQHRKMVTSRDVKFLENVFPFARNPTEEEKIFVLPQKWDEEENTRDIRADQTKSNDNIHE 364
             + +    SRDVKF E  FP   N +E   I   P                 ++ D++  
Sbjct: 190  PETKTFFVSRDVKFCETNFPSIPNTSEPNLISSHP---------------SYEAIDDLPS 234

Query: 365  PSSVMAETETQDVHNGADFFGPSQEPSEPATSGPNEPLLDITHETVPSNQNDMGSETVSE 544
            P+S   +++  D+ +  +   PS   +E  TS    P+++ T  T  ++  D  +  + +
Sbjct: 235  PTSSHHQSQQTDIPSTHEPNSPSHITTE--TSSAASPIVEPTPLT--THTTDPPTPFIPQ 290

Query: 545  ENRPTNAHTRPVRSRTRPARLDGFEVNLPPSLDHTQSSLHHDSSTVHPLAHFISYDNFTN 724
              +       P+            + ++   ++ T S     S T +PL+H++SY   ++
Sbjct: 291  VRKSVRDKHPPIWHN---------DYHMSTQVNKTPSEPTSGSGTRYPLSHYLSYSRISS 341

Query: 725  THKAFLTAITTNNEPKHFKQAVKDVRWVEAMKREIQALEENGTWILEELPKGKRAIDSKW 904
            ++ AFL  IT + EP+ + QAV D  W +AM  E++ALE+N TW L  LP G + I  KW
Sbjct: 342  SNCAFLANITAHREPQSYDQAVHDPLWQDAMNAELEALEQNNTWSLVPLPSGHKPIGCKW 401

Query: 905  VYKIKYKPNGEVERYKARLVAKGFTQMEGVDFHETFAPXXXXXXXXXXXXXXXXXGWHTH 1084
            VYKIKYK +G +ERYKARLVAKG+TQ+EG+D+ ETF+P                  W  H
Sbjct: 402  VYKIKYKSDGTIERYKARLVAKGYTQVEGIDYQETFSPTAKVTTLRCLLTVAAARNWFIH 461

Query: 1085 QLDVNNAFLHGDLNEDVYMKIPQGFGKQDDNRVCKLKKSLYGLKQASRNWYHKFTGSLFE 1264
            QLDV NAFLHGDL+E VYM+ P G  +Q +N VC+L KSLYGLKQASRNW+  F+  + +
Sbjct: 462  QLDVQNAFLHGDLHELVYMEPPPGLRRQGENVVCRLNKSLYGLKQASRNWFSTFSEVIQK 521

Query: 1265 IGFKQTPADHSLFIHRKDKTFVAALIYVDDVVLVGNDSNKIQDTKDFLDKRFSIKDLGPL 1444
             G++Q+ AD+SLF   +  +F A LIYVDD++L GND  +++  K+FL KRF IKDLG L
Sbjct: 522  AGYQQSKADYSLFTKSQGTSFTAVLIYVDDILLTGNDLQEMKRLKEFLLKRFRIKDLGNL 581

Query: 1445 KYFLGIEVAKTKEGMVLSQRKYTLDILEDAGMTGCRPSSFPMEQNLKLDTCDKEPRVDAN 1624
            KYFLGIE +++K+G+ +SQRKY LDIL+D+G+TG RP  FPMEQNLKL   D     D  
Sbjct: 582  KYFLGIEFSRSKKGIFMSQRKYALDILQDSGLTGARPDKFPMEQNLKLTPTDGVVLNDPT 641

Query: 1625 QYRRLIGRLLYLQATRPDIAYAVNILSQFVGDPRHSHMEAATRVLCYLKGTPGQGILLPK 1804
            +YRRL+GRL+YL  TRPDI Y+V  LSQF+ +PR  H +AA RVL Y+KGTPGQG+L   
Sbjct: 642  KYRRLVGRLIYLTVTRPDIVYSVQTLSQFMHEPRKPHWDAALRVLRYIKGTPGQGLLFSS 701

Query: 1805 EGGTNLLAYCDSDWLGCPMTRRSRTGYLLLLGGAPISWRTKKQSVVSKSSAEAEYRAMSN 1984
                 L A+CDSDW GC  TRRS TG+ L LG + ISW++KKQ VVS+SSAE+EYRAM+N
Sbjct: 702  TNDLTLKAFCDSDWGGCHATRRSVTGFCLFLGNSLISWKSKKQVVVSRSSAESEYRAMAN 761

Query: 1985 AVSEILWMRWLLSELDMAPVGPTQLFCDNQAARHIANNPVFHERTKHVEMDCYFVRERVD 2164
               E+ W+R++L +L ++   PT LFCDNQAA HIA NPVFHERTKH+E+DC+ VRE++ 
Sbjct: 762  TCLELTWLRFILQDLKVSQNTPTPLFCDNQAALHIAANPVFHERTKHIEIDCHIVREKLQ 821

Query: 2165 SMEICPMPIATKDQIADVLTKALGANSLHFLLCKLGVRNLHAPT 2296
            +  I P  + T+ Q+ADV TKALG +    L  KLG+ ++H+PT
Sbjct: 822  AGIINPSYVPTRFQLADVFTKALGKDQFVTLRSKLGLHDIHSPT 865


>gb|PNX96222.1| retrovirus-related Pol polyprotein from transposon TNT 1-94
            [Trifolium pratense]
          Length = 1369

 Score =  700 bits (1806), Expect = 0.0
 Identities = 370/765 (48%), Positives = 481/765 (62%), Gaps = 6/765 (0%)
 Frame = +2

Query: 11   TPYEVLHNSKPEYDHMKVFGCLAYYRSVETKGDKFEVRGRPGVFLGYPPGTKGYKVYDLQ 190
            TPYE+L    P YDH+KVFGCL Y  +   + DKF  R    VFLGYP G KG+KVY+L+
Sbjct: 634  TPYEMLFGKSPNYDHIKVFGCLCYVATTSKQRDKFGPRADRCVFLGYPQGQKGWKVYNLK 693

Query: 191  HRKMVTSRDVKFLENVFPFARNPTEEEKIFVLPQKWDEEENTRDIRADQTKSNDNIHEPS 370
             R+ + SRDV F ENVFPF               K DE E   D              P 
Sbjct: 694  TREFIVSRDVVFYENVFPF---------------KIDEREAVTD-------------PPR 725

Query: 371  SVMAETETQDVHNGADFFGPSQEPSEPATSGPNEPLLDITHET--VPSNQNDMGSETVSE 544
            ++   T  + + N     G   E ++ A        ++I  E+  V   Q +   E ++E
Sbjct: 726  NLFHNTPPR-IENEISSEGEMAEENDSAAQ------VNIMEESCEVHVPQTEDNEEILNE 778

Query: 545  ENRPTNAHTR---PVRSRTRPARLDGFEVNLPPSLDHTQSSLHHDSS-TVHPLAHFISYD 712
            +N           P R R  PA L  F       +  ++S +   SS  VH + +F+  D
Sbjct: 779  KNHHQQTEAMIIMPPRDRKPPAYLQDFHCYAAGEVPPSKSLIFSPSSGKVHSITNFMRND 838

Query: 713  NFTNTHKAFLTAITTNNEPKHFKQAVKDVRWVEAMKREIQALEENGTWILEELPKGKRAI 892
             F+  H+AFL  I+ + EP  + QAVK V W  AM +E++ALEEN TW L+  PKGK+ +
Sbjct: 839  CFSQRHQAFLAEISKHEEPTTYSQAVKHVEWRNAMNQELKALEENETWELDFPPKGKKVV 898

Query: 893  DSKWVYKIKYKPNGEVERYKARLVAKGFTQMEGVDFHETFAPXXXXXXXXXXXXXXXXXG 1072
              KWVYKIKYK  GE+E+YKARLVAKG+TQ+EG DF+ETFAP                  
Sbjct: 899  GCKWVYKIKYKATGEIEKYKARLVAKGYTQVEGEDFNETFAPVAKMTTVRCMLSVAVAKD 958

Query: 1073 WHTHQLDVNNAFLHGDLNEDVYMKIPQGFGKQDDNRVCKLKKSLYGLKQASRNWYHKFTG 1252
            W  HQ+DV+NAFLHG+L+E+VYMK P+G+       VC+LKKSLYGL+QASRNWY K + 
Sbjct: 959  WELHQMDVSNAFLHGELDEEVYMKAPEGYALPKIGMVCRLKKSLYGLRQASRNWYSKLSN 1018

Query: 1253 SLFEIGFKQTPADHSLFIHRKDKTFVAALIYVDDVVLVGNDSNKIQDTKDFLDKRFSIKD 1432
            +L E GF ++ ADHSLF +    TF+A LIYVDD+V+ GN++      K +L   F +KD
Sbjct: 1019 ALLEYGFIESHADHSLFTYSHQSTFLAVLIYVDDLVIAGNNTAACTKFKKYLSGCFHMKD 1078

Query: 1433 LGPLKYFLGIEVAKTKEGMVLSQRKYTLDILEDAGMTGCRPSSFPMEQNLKLDTCDKEPR 1612
            LGPLKYFLG+E+A+ K G+ + QRKYTLDIL + GM GC+PSSFPMEQN +L     EP 
Sbjct: 1079 LGPLKYFLGLELARGKSGLFICQRKYTLDILNECGMLGCKPSSFPMEQNHRLALASGEPY 1138

Query: 1613 VDANQYRRLIGRLLYLQATRPDIAYAVNILSQFVGDPRHSHMEAATRVLCYLKGTPGQGI 1792
             + ++YRRL+GRL+YL  TRP+I YAV+ LSQF+  P+ +H +AA  VL YLK +PGQGI
Sbjct: 1139 AEPSRYRRLVGRLIYLTITRPEITYAVHTLSQFMQCPQQAHWDAAMHVLRYLKSSPGQGI 1198

Query: 1793 LLPKEGGTNLLAYCDSDWLGCPMTRRSRTGYLLLLGGAPISWRTKKQSVVSKSSAEAEYR 1972
            +LP+E    L+AY DSDW  CP+TRRS +GYLL LG APISW+TKKQS VS+SS+EAEYR
Sbjct: 1199 VLPRENELKLVAYSDSDWASCPLTRRSISGYLLKLGAAPISWKTKKQSTVSRSSSEAEYR 1258

Query: 1973 AMSNAVSEILWMRWLLSELDMAPVGPTQLFCDNQAARHIANNPVFHERTKHVEMDCYFVR 2152
            AM++A SEILW+R LL+ L +    PT L+CDNQAA H+A NPV+HERTKH+E+DC+F+R
Sbjct: 1259 AMAHATSEILWLRRLLTCLQVDCNSPTTLYCDNQAAMHLAANPVYHERTKHIEVDCHFIR 1318

Query: 2153 ERVDSMEICPMPIATKDQIADVLTKALGANSLHFLLCKLGVRNLH 2287
            E +    I    + TK Q AD+ TK+LG+     L  KLGV N H
Sbjct: 1319 EHIQEGTIVTDYVPTKQQQADIFTKSLGSTQFQSLSVKLGVHNPH 1363



 Score =  161 bits (407), Expect = 4e-36
 Identities = 86/258 (33%), Positives = 143/258 (55%), Gaps = 5/258 (1%)
 Frame = +1

Query: 2722 MMNFLFAKNKVGFINGSIKKPEEESSNYMPWMRCDAMIKGWLHTAMEKEIRTSVKYAMTA 2901
            M   L AK K+GFI+G+IKKP  +S++Y  W R D+M+  W+  + +  +  S+ +  TA
Sbjct: 1    MRTALRAKVKLGFIDGTIKKPGAQSADYFNWERADSMVTAWIINSTDPALHGSISHGSTA 60

Query: 2902 REIWIDLKERFGKVSAPRAYELKRSL-TSTKQEGTSVSAYYTKLRGIWDEIQSVIPMPRC 3078
            R++W+DL+ERF + + PR ++L R L    K++  SV+ +YTK +GI+DE+  + P+P C
Sbjct: 61   RDVWLDLEERFAQTNQPRIHQLWRMLCLMQKEDDLSVTEFYTKFKGIYDELNELQPLPEC 120

Query: 3079 DCSSCKCDIGKKLQELRDKERLYEFLLGLD-AEFGTIRTQILAMNPIPSLGKAYHLVAED 3255
             C +      K+L +  + ++++ FL  LD  +F  ++  IL   P+PSL K ++ V  +
Sbjct: 121  SCGA-----SKELMKREEDQKVHLFLGSLDNQQFAHVKATILNTEPLPSLRKTFNTVLRE 175

Query: 3256 EQQRAISG---SKRPSSDSVAFQAHVPVKRDQNQSQNRTKQKDVKRGNSEPVEQCTVCGK 3426
            E +        S +P + +  +            S +R K +D  +      E+C  CGK
Sbjct: 176  EARYTAERERISNKPDAGAAFY-----------SSASRQKWRDRSK------EKCDHCGK 218

Query: 3427 DGHKSEGCFKLIGYPEWW 3480
             GH   GCF++IGYP  W
Sbjct: 219  TGHLKSGCFEIIGYPPNW 236


>gb|PNX97998.1| retrovirus-related Pol polyprotein from transposon TNT 1-94, partial
            [Trifolium pratense]
          Length = 964

 Score =  685 bits (1767), Expect = 0.0
 Identities = 364/775 (46%), Positives = 474/775 (61%), Gaps = 13/775 (1%)
 Frame = +2

Query: 8    KTPYEVLHNSKPEYDHMKVFGCLAYYRSVETKGDKFEVRGRPGVFLGYPPGTKGYKVYDL 187
            K+P++VL  S P Y  ++VFGCL + +++  +  KF+ R +PG+F+GYP   KGY++YD+
Sbjct: 236  KSPHQVLLGSPPSYSSLRVFGCLCFAKNMNIQ-HKFDERAKPGIFVGYPFNQKGYRIYDM 294

Query: 188  QHRKMVTSRDVKFLENVFPF--ARNPTEEEKIFVLPQKWDEEENTRDIRADQTKSNDNIH 361
              RK+  SRDV+F E VFP+   + P+    I +  Q  D E        D T SN    
Sbjct: 295  HTRKIYVSRDVQFFETVFPYHDLQTPSFASDISINTQFLDYE-------VDDTPSN---- 343

Query: 362  EPSSVMAETETQDVHNGADFFGPSQEPSEPATS-GPNEPLLDITHETVPSNQNDMGSETV 538
                                         PA+S  P     D T  T+P+   D  SE  
Sbjct: 344  ---------------------------LSPASSIPPGISHHDNTIVTIPNPSVDNPSEIP 376

Query: 539  SEENRPTNAHT----------RPVRSRTRPARLDGFEVNLPPSLDHTQSSLHHDSSTVHP 688
            +    P   H+           P+R RT   RL           DH     +  S +  P
Sbjct: 377  AIPVEPPQQHSPTAINHPERRYPLRHRTPSVRL----------TDHVCDINNVTSQSAFP 426

Query: 689  LAHFISYDNFTNTHKAFLTAITTNNEPKHFKQAVKDVRWVEAMKREIQALEENGTWILEE 868
            L ++ S  N + +H+A L  I  N EP  + QA+K   W EAM +EI ALE N TW+L  
Sbjct: 427  LKNYFSLSNLSTSHRALLVNIIENKEPTSYSQAIKSAEWREAMAKEIHALESNNTWVLSP 486

Query: 869  LPKGKRAIDSKWVYKIKYKPNGEVERYKARLVAKGFTQMEGVDFHETFAPXXXXXXXXXX 1048
            LP GK AI  KWVYKIKY  +G VERYKARLVAKG+ Q+ G+D+HETFAP          
Sbjct: 487  LPNGKTAIGCKWVYKIKYHSDGTVERYKARLVAKGYNQVHGIDYHETFAPVAKLVTVRLL 546

Query: 1049 XXXXXXXGWHTHQLDVNNAFLHGDLNEDVYMKIPQGFGKQDDNRVCKLKKSLYGLKQASR 1228
                    W  HQLDVNNAFL GDLNE+VYMK+P GF  +    VCKL KS+YGLKQASR
Sbjct: 547  LSIAAIKNWSLHQLDVNNAFLQGDLNEEVYMKLPPGFSHKGQPCVCKLNKSIYGLKQASR 606

Query: 1229 NWYHKFTGSLFEIGFKQTPADHSLFIHRKDKTFVAALIYVDDVVLVGNDSNKIQDTKDFL 1408
             W+ KF+ +L + GF Q+ +D+SLF  + + T +  L+YVDD+++ GN+ + I D K FL
Sbjct: 607  QWFSKFSTTLIQKGFHQSISDYSLFTFKSNHTTIFVLVYVDDIIITGNNDDAISDIKKFL 666

Query: 1409 DKRFSIKDLGPLKYFLGIEVAKTKEGMVLSQRKYTLDILEDAGMTGCRPSSFPMEQNLKL 1588
             + FSIKDLG L YFLGIEV+++K+G+ L QRKYTLDIL DAG+TGCRPS FPMEQ+L+L
Sbjct: 667  AQAFSIKDLGNLSYFLGIEVSRSKKGIFLCQRKYTLDILSDAGLTGCRPSEFPMEQHLRL 726

Query: 1589 DTCDKEPRVDANQYRRLIGRLLYLQATRPDIAYAVNILSQFVGDPRHSHMEAATRVLCYL 1768
               D  P  D   YRRLIGRLLYL  TRPDI YAVN LSQF+  P  +H++AATRVL YL
Sbjct: 727  RPNDGSPLPDPTVYRRLIGRLLYLTVTRPDIQYAVNTLSQFMQSPCTTHLDAATRVLRYL 786

Query: 1769 KGTPGQGILLPKEGGTNLLAYCDSDWLGCPMTRRSRTGYLLLLGGAPISWRTKKQSVVSK 1948
            KG+ G+G+ L       L+ Y DSDW GCP TRRS TGY  +LG  PISW+TKKQ  +S+
Sbjct: 787  KGSVGKGLFLSASSSLQLIGYADSDWAGCPTTRRSTTGYFTMLGSNPISWKTKKQPTISR 846

Query: 1949 SSAEAEYRAMSNAVSEILWMRWLLSELDMAPVGPTQLFCDNQAARHIANNPVFHERTKHV 2128
            SSAEAEYR+++   SE+ W+++LLS+LD+A   P  + CD+QAA HIA NPVFHERTKH+
Sbjct: 847  SSAEAEYRSLATLASELQWLKFLLSDLDIAHPLPITVHCDSQAAIHIAENPVFHERTKHI 906

Query: 2129 EMDCYFVRERVDSMEICPMPIATKDQIADVLTKALGANSLHFLLCKLGVRNLHAP 2293
            E+DC+FVRE++ S  + P  + + DQ+AD+ TK LG ++   LL KLGV  +  P
Sbjct: 907  EIDCHFVREKIKSGLLRPSYLRSFDQLADIFTKPLGGDAYKRLLGKLGVLEISIP 961



 Score = 93.2 bits (230), Expect = 3e-15
 Identities = 61/203 (30%), Positives = 97/203 (47%)
 Frame = +1

Query: 3040 WDEIQSVIPMPRCDCSSCKCDIGKKLQELRDKERLYEFLLGLDAEFGTIRTQILAMNPIP 3219
            WDE+ S+ P+  C C + K  I ++ Q+     R  EFL G+   F  +R+QIL M+P P
Sbjct: 1    WDELHSIAPINPCICGNAKSIIDQQNQD-----RAMEFLQGVHDRFSAVRSQILLMDPFP 55

Query: 3220 SLGKAYHLVAEDEQQRAISGSKRPSSDSVAFQAHVPVKRDQNQSQNRTKQKDVKRGNSEP 3399
            S+ + Y++V ++E+Q+ I+    P+ +S A QA        ++ Q+RT+ K        P
Sbjct: 56   SIQRIYNIVRQEEKQQEINFRPLPAEESAALQA--------SKVQHRTQGK-------RP 100

Query: 3400 VEQCTVCGKDGHKSEGCFKLIGYPEWWPGKGKQYKPKPSAALVEGEKSPIAGLSDSQYRQ 3579
               C  C K GH    C+++ G+P   P K +               +P   LS +QY++
Sbjct: 101  RPYCENCNKYGHTVATCYQIHGFPNRPPKKSE-----------SSSSTPAQQLSSAQYQK 149

Query: 3580 FLKFFGDKDGAKTEDSGPKANLA 3648
             L        AK +  G   NLA
Sbjct: 150  LLSLL-----AKEDTMGSSVNLA 167


>gb|PRQ55089.1| putative RNA-directed DNA polymerase [Rosa chinensis]
          Length = 1285

 Score =  693 bits (1789), Expect = 0.0
 Identities = 362/781 (46%), Positives = 487/781 (62%), Gaps = 14/781 (1%)
 Frame = +2

Query: 8    KTPYEVLHNSKPEYDHMKVFGCLAYYRSVETKGDKFEVRGRPGVFLGYPPGTKGYKVYDL 187
            KTP+E L + +P Y H++VFGC  +  +  T+  KF+ R    VFLGYP G KGYKVY+L
Sbjct: 517  KTPFEKLFHKEPSYSHLRVFGCQCFVSTHPTRPSKFDPRSMECVFLGYPHGQKGYKVYNL 576

Query: 188  QHRKMVTSRDVKFLENVFPFARN----PTEEEKIFVLPQKWDEEENTRDIRADQTKSNDN 355
              +K + SRDV F EN FPF +N    P++   +F    +    +N              
Sbjct: 577  TTKKSLVSRDVIFFENAFPFPKNSESFPSQNTDLFPSIPRLAHYDNP------------- 623

Query: 356  IHEPSSVMAETETQDVHNGADFFGPS-QEPSEPATSGPNEPLLD---ITHETVPSNQNDM 523
                 S+     +   H+      P+ Q  +E   + P++ L     ++ +    N + +
Sbjct: 624  -----SIPKIPPSSPTHHSPPMISPNPQSSAEQYLNSPSKSLSSTDPVSSDITLPNLDTI 678

Query: 524  GSETVSEENRPTNAHTRP---VRSRTRPARLDGFEVN--LPPSLDHTQSSLHHDS-STVH 685
             S+ +   + P     RP    R+   P  L  F ++  LP  L  + SS    +  T H
Sbjct: 679  SSDHIPSLSPPEQTPPRPRKSTRATKLPTALQDFHIDAALPTRLAPSSSSNEVTTPGTAH 738

Query: 686  PLAHFISYDNFTNTHKAFLTAITTNNEPKHFKQAVKDVRWVEAMKREIQALEENGTWILE 865
             L+H +SY N ++ H+ F   IT   EP  F QAVKD +W EAM+ E+QAL++N TW L 
Sbjct: 739  SLSHVLSYANLSSPHRTFTANITLQREPTSFSQAVKDPKWREAMRLEVQALQDNKTWSLV 798

Query: 866  ELPKGKRAIDSKWVYKIKYKPNGEVERYKARLVAKGFTQMEGVDFHETFAPXXXXXXXXX 1045
              P  KR I  KWVYKIKY P+G +ERYKARLVAKG++Q+EG+D+ ETFAP         
Sbjct: 799  PPPAHKRPIGCKWVYKIKYNPDGTIERYKARLVAKGYSQVEGLDYRETFAPVAKLTTVRV 858

Query: 1046 XXXXXXXXGWHTHQLDVNNAFLHGDLNEDVYMKIPQGFGKQDDNRVCKLKKSLYGLKQAS 1225
                     WH HQLDVNNAFL+GDL+EDVYM +P GF ++ +++VCKL KSLYGL+QAS
Sbjct: 859  LLSLAAQQNWHLHQLDVNNAFLNGDLHEDVYMHLPPGFERKGEHKVCKLHKSLYGLRQAS 918

Query: 1226 RNWYHKFTGSLFEIGFKQTPADHSLFIHRKDKTFVAALIYVDDVVLVGNDSNKIQDTKDF 1405
            + W+ K + +L   GFKQ+ +D+S+F+     TF A L+YVDDV+L GN+ + I  TK F
Sbjct: 919  KQWFLKLSSALKSAGFKQSWSDYSMFVRSHQGTFTALLVYVDDVILAGNNLDDIIRTKSF 978

Query: 1406 LDKRFSIKDLGPLKYFLGIEVAKTKEGMVLSQRKYTLDILEDAGMTGCRPSSFPMEQNLK 1585
            L   F +KD+G LKYFLG+EVA++K G+ LSQRKY L+ILED G  G +PS FP+EQN+ 
Sbjct: 979  LSSHFKLKDMGQLKYFLGLEVARSKHGIALSQRKYALEILEDTGFLGAKPSRFPLEQNII 1038

Query: 1586 LDTCDKEPRVDANQYRRLIGRLLYLQATRPDIAYAVNILSQFVGDPRHSHMEAATRVLCY 1765
            L   D     DA+QYRRL+GRL+Y   TRPD+ YAV+ILSQF+  PR  H++AA +VL Y
Sbjct: 1039 LTQEDGRLLEDASQYRRLVGRLIYQTITRPDLVYAVHILSQFMDKPRQPHLDAAHKVLRY 1098

Query: 1766 LKGTPGQGILLPKEGGTNLLAYCDSDWLGCPMTRRSRTGYLLLLGGAPISWRTKKQSVVS 1945
            LK TPGQGI LP +G   L AYCD+DW  C  TRRS TGY + LG APISW+TKKQ  VS
Sbjct: 1099 LKQTPGQGIFLPSKGPLELSAYCDADWARCKDTRRSTTGYCIFLGHAPISWKTKKQRTVS 1158

Query: 1946 KSSAEAEYRAMSNAVSEILWMRWLLSELDMAPVGPTQLFCDNQAARHIANNPVFHERTKH 2125
            +SSAEAEYR+M+    EI W++++L +L++  + P +LFCDN+AA HIA+NPVFHERTKH
Sbjct: 1159 RSSAEAEYRSMATTCCEITWLQYILKDLNIQHLQPVKLFCDNKAAIHIASNPVFHERTKH 1218

Query: 2126 VEMDCYFVRERVDSMEICPMPIATKDQIADVLTKALGANSLHFLLCKLGVRNLHAPT*GG 2305
            +E+DC+ VRE+V    I    I TK+Q AD+ TK L +     LL KLGV N+H+   G 
Sbjct: 1219 IEIDCHVVREKVQRGLIQTEHIRTKEQPADIFTKPLSSEQFSLLLGKLGVINIHSNLRGS 1278

Query: 2306 V 2308
            +
Sbjct: 1279 I 1279


>ref|XP_022004406.1| uncharacterized protein LOC110901966 [Helianthus annuus]
          Length = 415

 Score =  661 bits (1706), Expect = 0.0
 Identities = 320/409 (78%), Positives = 357/409 (87%)
 Frame = +2

Query: 1070 GWHTHQLDVNNAFLHGDLNEDVYMKIPQGFGKQDDNRVCKLKKSLYGLKQASRNWYHKFT 1249
            GWH HQLDVNNAFL+GDL+EDVYMKIP+GF KQD N VCKLKKSLYGLKQASRNWY KFT
Sbjct: 7    GWHVHQLDVNNAFLYGDLHEDVYMKIPEGFRKQDTNMVCKLKKSLYGLKQASRNWYQKFT 66

Query: 1250 GSLFEIGFKQTPADHSLFIHRKDKTFVAALIYVDDVVLVGNDSNKIQDTKDFLDKRFSIK 1429
             SL +IGFKQT A+HSLFI R+   FVAALIYVDDV++VGN  NKIQ+ K FLDK+FSIK
Sbjct: 67   NSLLDIGFKQTGANHSLFIFREKDIFVAALIYVDDVIIVGNALNKIQEIKLFLDKKFSIK 126

Query: 1430 DLGPLKYFLGIEVAKTKEGMVLSQRKYTLDILEDAGMTGCRPSSFPMEQNLKLDTCDKEP 1609
            DLGPLK+FLGIEVA+T EGMVLSQRKYTLDILE+ GM GCRPS FPMEQNLKL  C++EP
Sbjct: 127  DLGPLKFFLGIEVARTNEGMVLSQRKYTLDILEETGMMGCRPSPFPMEQNLKLGKCEEEP 186

Query: 1610 RVDANQYRRLIGRLLYLQATRPDIAYAVNILSQFVGDPRHSHMEAATRVLCYLKGTPGQG 1789
            ++D+NQYRRL+G+LLYLQATR DIAYAVN+LSQFVGDPR SHMEAA RVL YLK TPGQG
Sbjct: 187  KIDSNQYRRLVGKLLYLQATRLDIAYAVNVLSQFVGDPRKSHMEAANRVLRYLKSTPGQG 246

Query: 1790 ILLPKEGGTNLLAYCDSDWLGCPMTRRSRTGYLLLLGGAPISWRTKKQSVVSKSSAEAEY 1969
            IL+PKEGGT L AYCDSDWLGCP+TRRSR+GY+LL+GGAP+SW++KKQSVVS+SSA+AEY
Sbjct: 247  ILIPKEGGTRLTAYCDSDWLGCPITRRSRSGYVLLIGGAPVSWKSKKQSVVSRSSAKAEY 306

Query: 1970 RAMSNAVSEILWMRWLLSELDMAPVGPTQLFCDNQAARHIANNPVFHERTKHVEMDCYFV 2149
            R M N VSEILWMRWLLS   + P GPT LFCDNQAARHIANNPVFHERTKHVEMDC+FV
Sbjct: 307  REMVNTVSEILWMRWLLSLRGVPPTGPTPLFCDNQAARHIANNPVFHERTKHVEMDCHFV 366

Query: 2150 RERVDSMEICPMPIATKDQIADVLTKALGANSLHFLLCKLGVRNLHAPT 2296
            RERV+S EI PM I TK  IAD+LTK  G +    LL KLGVR+LHAPT
Sbjct: 367  RERVESKEIQPMKIDTKAHIADILTKPSGTHQFKVLLDKLGVRDLHAPT 415


>ref|XP_021979664.1| uncharacterized protein LOC110875770 [Helianthus annuus]
          Length = 376

 Score =  656 bits (1693), Expect = 0.0
 Identities = 326/375 (86%), Positives = 333/375 (88%)
 Frame = +2

Query: 983  MEGVDFHETFAPXXXXXXXXXXXXXXXXXGWHTHQLDVNNAFLHGDLNEDVYMKIPQGFG 1162
            MEGVDFHETFAP                  WH HQLDVNNAFLHGDL+EDVYMKIPQGFG
Sbjct: 1    MEGVDFHETFAPVAKLVTVRTLLIVAVKHDWHIHQLDVNNAFLHGDLHEDVYMKIPQGFG 60

Query: 1163 KQDDNRVCKLKKSLYGLKQASRNWYHKFTGSLFEIGFKQTPADHSLFIHRKDKTFVAALI 1342
            KQDDNRVCKLKKSLYGLKQASRNWY KFT SL EIGFKQTPADHSLFI +++K FVAALI
Sbjct: 61   KQDDNRVCKLKKSLYGLKQASRNWYQKFTHSLLEIGFKQTPADHSLFIFKENKIFVAALI 120

Query: 1343 YVDDVVLVGNDSNKIQDTKDFLDKRFSIKDLGPLKYFLGIEVAKTKEGMVLSQRKYTLDI 1522
            YVDDVVLVGNDS KI  TKDFLDKRFSIKDLGPLKYFLGIEVAKT EGMVLSQRKYTLDI
Sbjct: 121  YVDDVVLVGNDSRKIHATKDFLDKRFSIKDLGPLKYFLGIEVAKTNEGMVLSQRKYTLDI 180

Query: 1523 LEDAGMTGCRPSSFPMEQNLKLDTCDKEPRVDANQYRRLIGRLLYLQATRPDIAYAVNIL 1702
            LED GMT CRPSSFPMEQNLKLD CDKE RVDANQYRRLIGRLLYLQATRPDIAYAVNIL
Sbjct: 181  LEDVGMTVCRPSSFPMEQNLKLDMCDKETRVDANQYRRLIGRLLYLQATRPDIAYAVNIL 240

Query: 1703 SQFVGDPRHSHMEAATRVLCYLKGTPGQGILLPKEGGTNLLAYCDSDWLGCPMTRRSRTG 1882
             QF   PR +HMEAATRVL YLKGTPGQGIL+PKEGG NLLAYCDS+WLGCPMTRRSRTG
Sbjct: 241  RQFANGPRQTHMEAATRVLRYLKGTPGQGILIPKEGGANLLAYCDSEWLGCPMTRRSRTG 300

Query: 1883 YLLLLGGAPISWRTKKQSVVSKSSAEAEYRAMSNAVSEILWMRWLLSELDMAPVGPTQLF 2062
            YLLLLGGAPISWRTKKQSVV KSSAEAEYRAMSNAVSEILWMRWLL ELDMAPVG TQLF
Sbjct: 301  YLLLLGGAPISWRTKKQSVVFKSSAEAEYRAMSNAVSEILWMRWLLRELDMAPVGLTQLF 360

Query: 2063 CDNQAARHIANNPVF 2107
            CDNQAARHIANNPVF
Sbjct: 361  CDNQAARHIANNPVF 375


>emb|CAN71595.1| hypothetical protein VITISV_010143 [Vitis vinifera]
          Length = 1523

 Score =  695 bits (1793), Expect = 0.0
 Identities = 371/796 (46%), Positives = 482/796 (60%), Gaps = 18/796 (2%)
 Frame = +2

Query: 8    KTPYEVLHNSKPEYDHMKVFGCLAYYRSVETKGDKFEVRGRPGVFLGYPPGTKGYKVYDL 187
            KTP+E L +  P Y H++VFGC  +  +   +  KF+ R    VF+GYP G KGYKVY L
Sbjct: 733  KTPFEKLFHKSPNYSHLRVFGCRCFVSTHPLRPSKFDPRSIESVFIGYPHGQKGYKVYSL 792

Query: 188  QHRKMVTSRDVKFLENVFPFAR-----NPTEEEKIFVLPQKWDEEENTRDIRADQTKSND 352
            + +K + SRDV F E  FP+       +P+ +     LPQ  D +++          S  
Sbjct: 793  KDKKXLISRDVTFFETEFPYQNXLSTTSPSLDTFFPSLPQTPDIDDD----HISFNHSGS 848

Query: 353  NIHEPSSVMAETETQDVHNGADFFGPSQEPSEPATSGPNEPLLDITHETVPSNQNDMGSE 532
            N+   ++   +   Q   + +        PS P +   + P++     + P   +     
Sbjct: 849  NLQPSATSSVDXHPQPTLDNSHSSSHVDPPSSPPSLNTSPPVISQPSPSQPRRSS----- 903

Query: 533  TVSEENRPTNAHTRPVRSRTRPARLDGFEVN-------LPPSLDHTQSSLHHDSSTVHPL 691
                  RPT            P  L  F +        +PPS   + S + H S T+H L
Sbjct: 904  ------RPTKT----------PTTLQDFHIEAALPSRPVPPS---STSEVAH-SGTIHSL 943

Query: 692  AHFISYDNFTNTHKAFLTAITTNNEPKHFKQAVKDVRWVEAMKREIQALEENGTWILEEL 871
            +  +SYD  +  HKAF   IT   EP+ F QAV D RW EAM  EIQAL+ N TW L  L
Sbjct: 944  SQVLSYDRLSPMHKAFTVKITLAKEPRSFSQAVLDSRWREAMNTEIQALQANKTWSLVPL 1003

Query: 872  PKGKRAIDSKWVYKIKYKPNGEVERYKARLVAKGFTQMEGVDFHETFAPXXXXXXXXXXX 1051
            P  K+ I  KWVYKIKY P+G +ERYKARLVAKGF+Q+EG+D+ ETFAP           
Sbjct: 1004 PSHKKPIGCKWVYKIKYNPDGTIERYKARLVAKGFSQVEGIDYRETFAPVAKLTTVRVLL 1063

Query: 1052 XXXXXXGWHTHQLDVNNAFLHGDLNEDVYMKIPQGFGKQDDNRVCKLKKSLYGLKQASRN 1231
                  GWH HQLDVNNAFL+GDL EDVYM++P GFG++ ++RVCKL KSLYGLKQASR 
Sbjct: 1064 SLASIQGWHLHQLDVNNAFLNGDLYEDVYMQLPPGFGRKGEHRVCKLHKSLYGLKQASRQ 1123

Query: 1232 WYHKFTGSLFEIGFKQTPADHSLFIHRKDKTFVAALIYVDDVVLVGNDSNKIQDTKDFLD 1411
            W+ K + +L   GFKQ+ +D+SLF       F   L+YVDDV+L GN    I +TK FL 
Sbjct: 1124 WFLKLSSALKAAGFKQSWSDYSLFXRNTQGRFTTLLVYVDDVILAGNSLEDIIETKQFLA 1183

Query: 1412 KRFSIKDLGPLKYFLGIEVAKTKEGMVLSQRKYTLDILEDAGMTGCRPSSFPMEQNLKLD 1591
              F +KD+G L+YFLGIEVA++K+G+VL QRKY L++LEDAG  G +PS FP+EQ+L L 
Sbjct: 1184 SHFKLKDMGQLRYFLGIEVARSKQGIVLCQRKYALELLEDAGFLGAKPSRFPVEQSLTLT 1243

Query: 1592 TCDKEPRVDANQYRRLIGRLLYLQATRPDIAYAVNILSQFVGDPRHSHMEAATRVLCYLK 1771
              D     DA+QYRRL+GRL+YL  TRPD+ YAV+ILSQF+  PR  H++AA +VL Y+K
Sbjct: 1244 RGDGAELKDASQYRRLVGRLIYLTITRPDLVYAVHILSQFMDTPRQPHLDAAYKVLRYVK 1303

Query: 1772 GTPGQGILLPKEGGTNLLAYCDSDWLGCPMTRRSRTGYLLLLGGAPISWRTKKQSVVSKS 1951
             TPGQGI LP  G   L AYCD+DW  C  TRRS TGY +  G APISW+TKKQ  VS+S
Sbjct: 1304 QTPGQGIFLPSTGQLELTAYCDADWARCKDTRRSTTGYCIFFGNAPISWKTKKQGTVSRS 1363

Query: 1952 SAEAEYRAMSNAVSEILWMRWLLSELDMAPVGPTQLFCDNQAARHIANNPVFHERTKHVE 2131
            SAEAEYR+M+    EI W+R LL++L++      +LFCDNQAA HIA+NPVFHERTKH+E
Sbjct: 1364 SAEAEYRSMATTCCEITWLRSLLADLNVNHAHAVKLFCDNQAAIHIASNPVFHERTKHIE 1423

Query: 2132 MDCYFVRERVDSMEICPMPIATKDQIADVLTKALGANSLHFLLCKLGVRNLHAPT*GGVL 2311
            MDC+ VRE+V    +  M I T++Q AD+ TK L +     LL KLGV N+H    G  +
Sbjct: 1424 MDCHVVREKVQRGLVKTMHIRTQEQPADLFTKPLSSKQFSTLLSKLGVINIHTNLRGSEV 1483

Query: 2312 HMER------IWVFHY 2341
             +ER      IW+  Y
Sbjct: 1484 DVERGSNDSGIWLKSY 1499



 Score =  165 bits (418), Expect = 2e-37
 Identities = 96/355 (27%), Positives = 176/355 (49%), Gaps = 6/355 (1%)
 Frame = +1

Query: 2581 KDQSTGKTKEGGIDYESPYYLHPSDYPRQMHVNDVLSDNNYADWSQEMMNFLFAKNKVGF 2760
            K  +  +T E   ++  P +LH SD P  + V+  L ++NY  W Q M   L  KNK GF
Sbjct: 14   KHSNPSRTTEPWENFNHPLFLHHSDQPGAVLVSQPLMEDNYTTWVQSMDMALTIKNKKGF 73

Query: 2761 INGSIKKPEEESSNYMPWMRCDAMIKGWLHTAMEKEIRTSVKYAMTAREIWIDLKERFGK 2940
            ++G++ +P    +    W RC+ ++K WL  A+ KEI  SV +   A+ +W++L+ERF  
Sbjct: 74   VDGTLNRPTHNPNEQQQWDRCNILVKTWLLGAISKEISNSVIHCKDAKTMWLELQERFSH 133

Query: 2941 VSAPRAYELKRSLTSTKQEGTSVSAYYTKLRGIWDEIQSVIPMPRCDCSSCKCDIGKKLQ 3120
             +  + + ++ ++    Q   +V++++TKL+G+WDE  ++   P C C++       +++
Sbjct: 134  TNTVQLFNIENAIHECAQGTGTVTSFFTKLKGLWDEKDALCGFPPCTCAT-----AAEVK 188

Query: 3121 ELRDKERLYEFLLGLDAEFGTIRTQILAMNPIPSLGKAYHLVAEDEQQRAISGSKRPSSD 3300
               + ++  +FL+GL   + T+R+ I+ M+P+P++ KAY +    E+Q   S  K    +
Sbjct: 189  TYMETQKTMKFLMGLGDNYATVRSNIIGMDPLPTVNKAYAMALRHEKQAEASNGKVAVPN 248

Query: 3301 SVAFQAHVPVKRDQNQSQNRTKQKDVKRGNSEPVE-----QCTVCGKDGHKSEGCFKLIG 3465
              +  +   + +D N ++   K +     N          +CT CG  GH  + C +   
Sbjct: 249  EASAFSVRKLDQDPNTTEREVKCEKCNMTNHSTKNCRAHLKCTYCGGKGHTYDYCRRRKN 308

Query: 3466 YPEWWPGKGKQYKPKPSAALVEGEKSPI-AGLSDSQYRQFLKFFGDKDGAKTEDS 3627
                  G+G+  K   +A L EG++      LS S+ +Q +        A T  S
Sbjct: 309  --TMGGGQGRS-KVNHAATLNEGKEDVTNFPLSQSECQQMMGLLSKIKTAATSHS 360


>ref|XP_021975259.1| uncharacterized protein LOC110870383 [Helianthus annuus]
          Length = 438

 Score =  657 bits (1694), Expect = 0.0
 Identities = 320/438 (73%), Positives = 360/438 (82%)
 Frame = +2

Query: 983  MEGVDFHETFAPXXXXXXXXXXXXXXXXXGWHTHQLDVNNAFLHGDLNEDVYMKIPQGFG 1162
            MEGVDFH+TFAP                 GW   QLDVNNAFLHGDL+EDVYMK+PQGF 
Sbjct: 1    MEGVDFHDTFAPVAKLVTVRSLLAIATKRGWAIQQLDVNNAFLHGDLHEDVYMKMPQGFN 60

Query: 1163 KQDDNRVCKLKKSLYGLKQASRNWYHKFTGSLFEIGFKQTPADHSLFIHRKDKTFVAALI 1342
            K +  +VCKLKKSLYGLKQASRNWY KFT +  +I FKQ+  DHSLFI++K   +VA LI
Sbjct: 61   KGEGTKVCKLKKSLYGLKQASRNWYQKFTSAPHKIEFKQSKVDHSLFIYKKGDAYVATLI 120

Query: 1343 YVDDVVLVGNDSNKIQDTKDFLDKRFSIKDLGPLKYFLGIEVAKTKEGMVLSQRKYTLDI 1522
            YVDDV++VGND NKIQ TKD+LDK FSIKDLGPLKYFLGIEVA+TK+G+VLSQRKYTLDI
Sbjct: 121  YVDDVIIVGNDLNKIQQTKDYLDKEFSIKDLGPLKYFLGIEVARTKDGLVLSQRKYTLDI 180

Query: 1523 LEDAGMTGCRPSSFPMEQNLKLDTCDKEPRVDANQYRRLIGRLLYLQATRPDIAYAVNIL 1702
            LED+GM GCRPS FPMEQ+LKL   ++E RVDA QYRRL+GRLLYLQATRPDI Y+VN+L
Sbjct: 181  LEDSGMQGCRPSMFPMEQHLKLTKDEEEHRVDARQYRRLVGRLLYLQATRPDITYSVNVL 240

Query: 1703 SQFVGDPRHSHMEAATRVLCYLKGTPGQGILLPKEGGTNLLAYCDSDWLGCPMTRRSRTG 1882
            SQFV DPR SHM+A TRVL YLK TPGQGILLPKEGG NL AY D DWLGC +TRRSRTG
Sbjct: 241  SQFVSDPRRSHMDAVTRVLRYLKATPGQGILLPKEGGVNLTAYSDLDWLGCQLTRRSRTG 300

Query: 1883 YLLLLGGAPISWRTKKQSVVSKSSAEAEYRAMSNAVSEILWMRWLLSELDMAPVGPTQLF 2062
            YLLLLGGA +SW++KKQSVVS+SS EAEYRAM++ VSE+LWMRWLL+EL   P  PT LF
Sbjct: 301  YLLLLGGALVSWKSKKQSVVSRSSTEAEYRAMASTVSEVLWMRWLLTELGAPPDAPTPLF 360

Query: 2063 CDNQAARHIANNPVFHERTKHVEMDCYFVRERVDSMEICPMPIATKDQIADVLTKALGAN 2242
            CDNQAARHIANNPVFHERTKHVEMDCYFVRERV+S E+ P PI TK Q+ADVLTKALG  
Sbjct: 361  CDNQAARHIANNPVFHERTKHVEMDCYFVRERVESQEVRPTPIDTKMQVADVLTKALGTQ 420

Query: 2243 SLHFLLCKLGVRNLHAPT 2296
                L+ KLG+ +LHAPT
Sbjct: 421  QFRTLINKLGICDLHAPT 438


>gb|KYP64168.1| Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cajanus
            cajan]
          Length = 967

 Score =  676 bits (1743), Expect = 0.0
 Identities = 354/762 (46%), Positives = 477/762 (62%), Gaps = 1/762 (0%)
 Frame = +2

Query: 11   TPYEVLHNSKPEYDHMKVFGCLAYYRSVETKGDKFEVRGRPGVFLGYPPGTKGYKVYDLQ 190
            TPY++L+   P YDH+++FGCL Y ++   + DKF  R    +F+GYP   KG+KVY+L+
Sbjct: 235  TPYKMLYEKPPSYDHLRIFGCLCYVKNSSKRQDKFMPRSEKCMFIGYPQNKKGWKVYNLE 294

Query: 191  HRKMVTSRDVKFLENVFPFARNPTEEEKIFVLPQKWDEEENTRDIRADQTKSNDNIHEPS 370
              +   SRDV F +  F +            LP +   E  T   R +   S + +HE +
Sbjct: 295  THEFFISRDVIFYKEYFSYK-----------LPARIVFENYTSQERDEY--SYNMMHEDN 341

Query: 371  SVMAETETQDVHNGADFFGPSQEPSEPATSGPNEPLLDITHETVPSNQNDMGSETVSE-E 547
              +++               SQ  SE   S   E +   +H     + ND   E  SE E
Sbjct: 342  DHVSKDA-------------SQVSSEGEHSENEEEIESNSHNIEVKSSNDEDLENQSEGE 388

Query: 548  NRPTNAHTRPVRSRTRPARLDGFEVNLPPSLDHTQSSLHHDSSTVHPLAHFISYDNFTNT 727
            NR     TR   +  +        VN  PS   T  +  + S  V+PL++F+SYD+F++ 
Sbjct: 389  NRAEEKRTRQPLTYLKDYYCHAITVN--PSCMTTNPN--NSSGMVYPLSNFVSYDHFSHR 444

Query: 728  HKAFLTAITTNNEPKHFKQAVKDVRWVEAMKREIQALEENGTWILEELPKGKRAIDSKWV 907
            H+A+L A+ +++EPK + QA+K   W  AM +EI+ALE+N  W L  LP G+  +  KWV
Sbjct: 445  HRAYLAALGSHDEPKTYAQAMKHPEWRTAMTQEIKALEDNQRWELTHLPPGRETVGCKWV 504

Query: 908  YKIKYKPNGEVERYKARLVAKGFTQMEGVDFHETFAPXXXXXXXXXXXXXXXXXGWHTHQ 1087
            YKIKYK  GE+E+YKARLVAKGFTQ+EG DF+ETFAP                  W  HQ
Sbjct: 505  YKIKYKATGEIEKYKARLVAKGFTQIEGEDFNETFAPVAKMTTIRCLLSLTVAKEWELHQ 564

Query: 1088 LDVNNAFLHGDLNEDVYMKIPQGFGKQDDNRVCKLKKSLYGLKQASRNWYHKFTGSLFEI 1267
            +DV+NAFLHG+LNE+VYM +PQG+G      VC+L++SLYGL QASRNWY K + SL E 
Sbjct: 565  MDVSNAFLHGELNEEVYMVVPQGYGVPTKGMVCRLRESLYGLCQASRNWYTKLSHSLEEY 624

Query: 1268 GFKQTPADHSLFIHRKDKTFVAALIYVDDVVLVGNDSNKIQDTKDFLDKRFSIKDLGPLK 1447
            GFK+   DHSLF++  +  F+A LIYVDD+V+  N+S      K++L   F +KDLG LK
Sbjct: 625  GFKECDVDHSLFVYSHNSIFIAVLIYVDDLVIASNNSLACAQFKEYLSNCFHMKDLGNLK 684

Query: 1448 YFLGIEVAKTKEGMVLSQRKYTLDILEDAGMTGCRPSSFPMEQNLKLDTCDKEPRVDANQ 1627
            YFLG+E+A+  +G+ + QRKYTLDIL + GM GC+P+SFP+EQN +L      P  + +Q
Sbjct: 685  YFLGLELARDSKGLFICQRKYTLDILNECGMLGCKPTSFPVEQNHRLALATGSPFPEPSQ 744

Query: 1628 YRRLIGRLLYLQATRPDIAYAVNILSQFVGDPRHSHMEAATRVLCYLKGTPGQGILLPKE 1807
            YRRLIG L+YL  TRP+I Y+V+ILSQF+  P   H +AA RVL YLK +PGQGI+LP  
Sbjct: 745  YRRLIGCLIYLTITRPEITYSVHILSQFMQAPLQEHWDAAMRVLRYLKSSPGQGIILPNT 804

Query: 1808 GGTNLLAYCDSDWLGCPMTRRSRTGYLLLLGGAPISWRTKKQSVVSKSSAEAEYRAMSNA 1987
                L+ YCDSDW  CP+TR+S +GYL+ LG  PISW+TKKQS VS+SS+EAEYRA+++A
Sbjct: 805  NDLRLVGYCDSDWASCPLTRKSISGYLMKLGPTPISWKTKKQSTVSRSSSEAEYRAIAHA 864

Query: 1988 VSEILWMRWLLSELDMAPVGPTQLFCDNQAARHIANNPVFHERTKHVEMDCYFVRERVDS 2167
             SEI+W+R LL +L +    PT L CDNQAA H+A NPVFHERTKH+E+DC+F+R  ++ 
Sbjct: 865  TSEIIWLRSLLKDLQVDCDSPTFLHCDNQAALHLAANPVFHERTKHIEVDCHFIRNHLEK 924

Query: 2168 MEICPMPIATKDQIADVLTKALGANSLHFLLCKLGVRNLHAP 2293
              I    I TK+Q AD+ TK+LG      L  KLGV N H P
Sbjct: 925  GTITTSYIPTKEQQADIFTKSLGRKMFLELTVKLGVHNPHTP 966


>gb|PNX95363.1| retrovirus-related Pol polyprotein from transposon TNT 1-94, partial
            [Trifolium pratense]
          Length = 1200

 Score =  679 bits (1751), Expect = 0.0
 Identities = 355/768 (46%), Positives = 479/768 (62%), Gaps = 5/768 (0%)
 Frame = +2

Query: 8    KTPYEVLHNSKPEYDHMKVFGCLAYYRSVETKGDKFEVRGRPGVFLGYPPGTKGYKVYDL 187
            KTPYE+L  +KP Y  ++VFGCL Y  +    GDKF  R R  +F+GYP G KG++VYD+
Sbjct: 453  KTPYELLFGAKPTYTQIRVFGCLCYALNENRGGDKFASRSRKCIFVGYPYGKKGWEVYDV 512

Query: 188  QHRKMVTSRDVKFLENVFPFARNPTEEEKIFVLPQKWDEEENTRDIRADQTKSNDNIHEP 367
            +  +   SR+VKF+ENVFPFA N  +  +I +    W  + +      D+     NI   
Sbjct: 513  ETEEYFVSRNVKFVENVFPFAANSGDMREIQIDEWGWPHDSSDDH---DEGNEEQNIVST 569

Query: 368  SS-----VMAETETQDVHNGADFFGPSQEPSEPATSGPNEPLLDITHETVPSNQNDMGSE 532
            S+     V  E +  D  +  ++    QE +E  T       LD         QN   +E
Sbjct: 570  SNLGSTLVPLENDIVDTESHEEWHIMQQE-TERTTDEDQVESLD-------PQQNIRSNE 621

Query: 533  TVSEENRPTNAHTRPVRSRTRPARLDGFEVNLPPSLDHTQSSLHHDSSTVHPLAHFISYD 712
             +    R     TR     T  AR+         S   +   L   S T +P+AHF++ D
Sbjct: 622  LLGRGYRVKKPSTRLNDHVTHTARV---------STSTSSPLLSKSSGTRYPIAHFVNCD 672

Query: 713  NFTNTHKAFLTAITTNNEPKHFKQAVKDVRWVEAMKREIQALEENGTWILEELPKGKRAI 892
             F+  H+ FL AIT  +EP  F +AVKD +W +AMK EI+ALE+NGTW +E+LP GK+AI
Sbjct: 673  KFSMQHRVFLAAITAEHEPNSFAEAVKDEKWRDAMKEEIRALEDNGTWTIEDLPSGKKAI 732

Query: 893  DSKWVYKIKYKPNGEVERYKARLVAKGFTQMEGVDFHETFAPXXXXXXXXXXXXXXXXXG 1072
              KW+YKIKY  +G +ER+KARLV  G  Q+EGVD++ETFAP                  
Sbjct: 733  GCKWIYKIKYNSDGSIERHKARLVIHGNRQVEGVDYNETFAPTAKMVTVRTFLAVAAAKN 792

Query: 1073 WHTHQLDVNNAFLHGDLNEDVYMKIPQGFGKQDDNRVCKLKKSLYGLKQASRNWYHKFTG 1252
            +  HQ+DV NAFLHGDL+E+VYMK+P GF KQ  ++VC+L+KSLYGLKQA R W+ K + 
Sbjct: 793  FQLHQMDVRNAFLHGDLDEEVYMKLPPGFDKQPPSKVCRLRKSLYGLKQAPRCWFAKLSE 852

Query: 1253 SLFEIGFKQTPADHSLFIHRKDKTFVAALIYVDDVVLVGNDSNKIQDTKDFLDKRFSIKD 1432
            +L   GFKQ+ +D+SLF    D T +  L+YVDD+V+ GN S+ I + KD+L   F +KD
Sbjct: 853  ALKAYGFKQSYSDYSLFTLHSDDTEMYVLVYVDDIVISGNHSDAINEFKDYLGHCFHMKD 912

Query: 1433 LGPLKYFLGIEVAKTKEGMVLSQRKYTLDILEDAGMTGCRPSSFPMEQNLKLDTCDKEPR 1612
            LG LKYFLGIEVA+   G+ LSQRKY LD++ D G+ G +P++ P+EQN +L   +    
Sbjct: 913  LGKLKYFLGIEVARNGTGIFLSQRKYALDLISDCGLLGAKPANIPIEQNHRLALIEGVNL 972

Query: 1613 VDANQYRRLIGRLLYLQATRPDIAYAVNILSQFVGDPRHSHMEAATRVLCYLKGTPGQGI 1792
             D   YR L+GRL+YL  T P+++Y V+IL+QF+ +P+  H  AA RV+ YLKG PGQGI
Sbjct: 973  EDPTGYRSLVGRLIYLTITHPELSYCVHILAQFMQNPKLEHWNAALRVVRYLKGNPGQGI 1032

Query: 1793 LLPKEGGTNLLAYCDSDWLGCPMTRRSRTGYLLLLGGAPISWRTKKQSVVSKSSAEAEYR 1972
            LL  +    L  YCD+DW GCP+TRRS T Y ++LG +PISW+TKKQ+ VS+SSAEAEYR
Sbjct: 1033 LLSVDCDLRLYGYCDADWAGCPLTRRSLTAYFVMLGNSPISWKTKKQTTVSRSSAEAEYR 1092

Query: 1973 AMSNAVSEILWMRWLLSELDMAPVGPTQLFCDNQAARHIANNPVFHERTKHVEMDCYFVR 2152
            +M+ A  E+ W++ LLS L +A   P  L+CD+Q+A HIANN V HERTKH+E+DC+FVR
Sbjct: 1093 SMAAATCELKWLKELLSSLGVAHSDPMHLYCDSQSALHIANNXVLHERTKHIEVDCHFVR 1152

Query: 2153 ERVDSMEICPMPIATKDQIADVLTKALGANSLHFLLCKLGVRNLHAPT 2296
            + +    I P  + T  Q+AD+LTKALG      LL KLGV +LH PT
Sbjct: 1153 DEIAKGNIQPKYVHTSTQLADILTKALGKRQFDALLSKLGVLHLHTPT 1200



 Score =  205 bits (522), Expect = 5e-50
 Identities = 108/359 (30%), Positives = 185/359 (51%), Gaps = 37/359 (10%)
 Frame = +1

Query: 2644 HPSDYPRQMHVNDVLSDNNYADWSQEMMNFLFAKNKVGFINGSIKKPEEESSNYMPWMRC 2823
            + SD P  +     L   NY +W++ +   L A+ K GF++GSIK+P+E++ +   W   
Sbjct: 1    YSSDNPGNIITQVQLKGENYDEWARAVRGSLRARRKFGFVDGSIKQPDEDAPDIDDWWTV 60

Query: 2824 DAMIKGWLHTAMEKEIRTSVKYAMTAREIWIDLKERFGKVSAPRAYELKRSLTSTKQEGT 3003
            ++MI  W+   +E ++R+++ Y   A+E+W D+K+R    + PR  +LK  L + +Q G 
Sbjct: 61   NSMIVSWILNTIEPKLRSTITYKENAQELWDDIKQRLSISNGPRIQQLKSELANCRQNGD 120

Query: 3004 SVSAYYTKLRGIWDEIQSVIPMPRCDCSSCKCDIGKKLQELRDKERLYEFLLGLD-AEFG 3180
            S+  Y+ +L+ +WDE+     +P C C+ CKC I   L + R++E+L++FL+GLD  +F 
Sbjct: 121  SIVNYFGRLKKLWDELNDFDQIPICTCNGCKCGISTTLNKKREEEKLHQFLMGLDEVQFR 180

Query: 3181 TIRTQILAMNPIPSLGKAYHLVAEDEQQRAISGSKRPSSDSVAFQAHVPVKRDQNQSQNR 3360
            T+R+ IL+++P+P+L +AY +  ++E+   I+  K    D V+F     VK  ++  + +
Sbjct: 181  TVRSNILSLDPLPTLNRAYQMAVQEERVGVIARGKEERGDPVSF----AVKAGRSLGREK 236

Query: 3361 TKQKDVKRGNSEPVEQCTVCGKDGHKSEGCFKLIGYPEWW-------------------- 3480
              Q+   +        C+ C + GH  + CF L+GYP+WW                    
Sbjct: 237  KSQEMSSK-------TCSYCKRSGHDVDSCFPLVGYPDWWGDRPRAEGRDPGHGKAVHRP 289

Query: 3481 ---PGKGKQYKPKPSAALV-------------EGEKSPIAGLSDSQYRQFLKFFGDKDG 3609
                GKGK    K + A V             EGE+  + GLS  Q+   LK    + G
Sbjct: 290  MIGSGKGKGINAKVNVAQVVDVAGATNKDTEGEGEQIGLPGLSPGQWNALLKAINTQKG 348


Top