BLASTX nr result

ID: Rehmannia30_contig00016559 seq

BLASTX 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Rehmannia30_contig00016559
         (3015 letters)

Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF excluding environmental samples
from WGS projects 
           149,584,005 sequences; 54,822,741,787 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

gb|PNX93614.1| retrovirus-related Pol polyprotein from transposo...   654   0.0  
gb|KYP61022.1| Retrovirus-related Pol polyprotein from transposo...   645   0.0  
gb|PNX74277.1| retrovirus-related Pol polyprotein from transposo...   620   0.0  
gb|KYP65734.1| Retrovirus-related Pol polyprotein from transposo...   625   0.0  
gb|OMO65653.1| hypothetical protein CCACVL1_21443 [Corchorus cap...   630   0.0  
gb|PNY00469.1| retrovirus-related Pol polyprotein from transposo...   614   0.0  
dbj|GAU41679.1| hypothetical protein TSUD_272630 [Trifolium subt...   627   0.0  
gb|KYP42321.1| Copia protein [Cajanus cajan]                          630   0.0  
gb|KYP34298.1| Retrovirus-related Pol polyprotein from transposo...   612   0.0  
ref|XP_012486681.1| PREDICTED: LOW QUALITY PROTEIN: retrovirus-r...   600   0.0  
gb|KYP55668.1| Retrovirus-related Pol polyprotein from transposo...   615   0.0  
gb|PNX93928.1| hypothetical protein L195_g017092, partial [Trifo...   604   0.0  
gb|PNX97998.1| retrovirus-related Pol polyprotein from transposo...   603   0.0  
gb|KZV53534.1| hypothetical protein F511_42283 [Dorcoceras hygro...   604   0.0  
gb|PNX93906.1| hypothetical protein L195_g017068 [Trifolium prat...   607   0.0  
gb|KYP34293.1| Retrovirus-related Pol polyprotein from transposo...   611   0.0  
gb|PNX93131.1| retrovirus-related Pol polyprotein from transposo...   597   0.0  
gb|PNX92076.1| retrovirus-related Pol polyprotein from transposo...   588   0.0  
gb|OMP02866.1| Reverse transcriptase, RNA-dependent DNA polymera...   585   0.0  
gb|PNY03100.1| retrovirus-related Pol polyprotein from transposo...   582   0.0  

>gb|PNX93614.1| retrovirus-related Pol polyprotein from transposon TNT 1-94
            [Trifolium pratense]
          Length = 1430

 Score =  654 bits (1687), Expect = 0.0
 Identities = 352/722 (48%), Positives = 452/722 (62%), Gaps = 40/722 (5%)
 Frame = -3

Query: 3013 CLSYAANTHPHKSKFAARAYKCVFLGYITGQKGYRLYDLDNHSLLTSRDVVFFEDSFPFA 2834
            CL++A+  H H++KF  RA K VFLGY  G KG+ LYD+ NHS L SR+V+F+ED FP +
Sbjct: 713  CLAFASTLHNHRTKFMPRARKTVFLGYRDGTKGFLLYDISNHSFLVSRNVIFYEDVFPLS 772

Query: 2833 SI---------------------------PVP----SNXXXXXXXXXXXXXXXXXXTSSA 2747
            S+                           P P    +                    S++
Sbjct: 773  SVNSSHTSSTTTLDNFVLPIDPPNFPSSCPAPLSVSTGTNPLTDHAENSATLVDNQVSNS 832

Query: 2746 PISQPESH----PLRRSSRISKPPAWLSDFITNSVHSSTPMASPSHSAGPDSGDFSLAPT 2579
            P   P++     P R S+RI K P +L DF     H S     PS      S  FS  P 
Sbjct: 833  PAVPPQNSSIPAPTRVSNRIRKIPGYLQDF-----HCSL---LPSQHQSSSSNAFSTYPI 884

Query: 2578 SFNHSSILGAT-YTAFLANLSNVEEPSSFSQACKSADWVDAMQRELTALEQNNTWVLTSL 2402
            S + S    AT Y  F  ++S   EP +F QACKS  W +AM+ EL ALE N TW +  L
Sbjct: 885  SSSLSYTNCATAYKHFCLSISTTIEPKTFKQACKSDCWKEAMKSELAALELNRTWSIVDL 944

Query: 2401 PPGKRAIGCKWVYKVKMRPDGTVERYKARLVAKGFHQEHGVDYLDSFSPVAKLVTVRLVI 2222
            P GK  IGCKWVYK+K   DG++ERYKARLVAKG+ Q  GVDY D+FSPVAKL TV+ ++
Sbjct: 945  PTGKNPIGCKWVYKIKHNADGSIERYKARLVAKGYTQMEGVDYFDTFSPVAKLTTVKTLL 1004

Query: 2221 ALATIKGWPLFQLDVNNAFLHGYLDEDIYMLPPEGY----SKAKDGEVCHLQRSLYGLKQ 2054
            ALA+IKGW L QLDVNNAFLHG L+E++YM  P G     S +   +VC L +SLYGLKQ
Sbjct: 1005 ALASIKGWFLEQLDVNNAFLHGDLNEEVYMSLPPGVIIPNSCSNTPKVCRLHKSLYGLKQ 1064

Query: 2053 ASRQWNAEFCLKLQQFGFTQSGHDHCLFVRQSXXXXXXXXXXXXXVIISGTVLSEIDALK 1874
            ASRQW ++    L   G++QS  DH LF+++              ++++G    EI ++K
Sbjct: 1065 ASRQWYSKLSSALLSLGYSQSAADHSLFLKKVGSSFTALLVYVDDIVLAGNNSLEITSVK 1124

Query: 1873 RYLDDLFTIKDLGYARYFLGMEIARNTDGSVLNQRKYVLDILHDAGMLHCKAAITPLPPG 1694
             +LD  F IKDLG  R+F+G+EIAR+  G +LNQRKY L++L D+G L  K + TP  P 
Sbjct: 1125 SFLDKRFQIKDLGNLRFFVGLEIARSKKGILLNQRKYTLELLQDSGNLAAKPSSTPYDPS 1184

Query: 1693 LKLRAKEGDPLVDPERFRRLIGRLLYLNLTRPDVTYSVQQLSQFVNSPYTSHWDAAMHLL 1514
            LKL   E  P  DP  +RRLIGRLLYL  TRPD+T++VQQLSQFV+SP   H+ AA  +L
Sbjct: 1185 LKLHDSESPPYNDPSGYRRLIGRLLYLTTTRPDITFAVQQLSQFVSSPREVHFQAATKVL 1244

Query: 1513 RYLKGSPSVGLFYSAHSSLCLEAYCDADWASCVDTRKSLTGFCIFLGSCLVSWKTKKQTT 1334
            RYLK SP+ GLF+S+ SSL L  + D+DWA+C  TRKS+TG+C+FLG+ L+SWK+KKQ+T
Sbjct: 1245 RYLKASPAKGLFFSSSSSLKLSGFSDSDWATCAITRKSITGYCVFLGTSLISWKSKKQST 1304

Query: 1333 VSRSSAEAEYRALGSGVCEIQWLSYLCADLGLSIPTPVTLWCDNQAAIHIVQNPVFHERT 1154
            VSRSS+EAEYRAL S  CE+QWL YL  DLG+    P  ++CDN++AI++  NP FHERT
Sbjct: 1305 VSRSSSEAEYRALASLSCELQWLHYLFKDLGIKFDAPAMVYCDNKSAIYLAHNPSFHERT 1364

Query: 1153 KHLEIDCHLVRNLYKSGFLHLGHVSSRLQLADFFTKSLGRAAFLLLCSKLGLFDLHNPT* 974
            KH+EIDCH+VR   +SG +HL  V S  QLAD  TK L  +AF  L SKLGL D+H+P  
Sbjct: 1365 KHIEIDCHVVRERIQSGLIHLLPVPSSSQLADVLTKQLSSSAFASLISKLGLLDIHSPAC 1424

Query: 973  GG 968
            GG
Sbjct: 1425 GG 1426



 Score =  143 bits (361), Expect = 7e-31
 Identities = 76/216 (35%), Positives = 128/216 (59%), Gaps = 10/216 (4%)
 Frame = -3

Query: 628 DPLKLYSSDHPGLSLVSSQLTGNNYLSWRRSMLIALGAKTKLGFINGKMEIPKEDSPKYD 449
           +P  L+ +++P + LV+  L G NY SW RSM IAL +K K+ F++G +E P+   P Y+
Sbjct: 15  NPYYLHPNENPAVVLVTPLLDGKNYHSWLRSMKIALLSKNKMKFVDGTLEQPRVSDPLYE 74

Query: 448 QWRKVDCMVISWILNSISKDLVDAFIYCDSAKDLWDDIAKRFGDCNGPLIYQLERDIANM 269
            W + + MV+SWI  SIS D+  + I+ D A  +W D+  RF   +   I  L+ +I  +
Sbjct: 75  PWIRCNSMVLSWIQRSISPDIAKSIIWFDHASAVWKDLEFRFSHGDMFKISDLQEEILRL 134

Query: 268 NQGNMSVVEYFTKLKRLWDELACIMPLPACE----------SDTRKLIDERDMNRKLMQF 119
           +QG++ +  Y+T+LK L +E+    P+  C           +D +K   E+D    +++F
Sbjct: 135 HQGSLDISSYYTQLKSLSEEIEIYRPVRDCTCAIPCSCGAVADMKK-YREQDC---VLKF 190

Query: 118 LMGLHESYDQVRNQLLLMDPLPSVDKAYSMALRVEK 11
           L GL+E Y  VR+Q+++M+PLP + K +S+ L+ E+
Sbjct: 191 LKGLNEQYSHVRSQIMMMEPLPPLHKVFSLVLQQER 226


>gb|KYP61022.1| Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cajanus
            cajan]
          Length = 1316

 Score =  645 bits (1665), Expect = 0.0
 Identities = 342/697 (49%), Positives = 446/697 (63%), Gaps = 15/697 (2%)
 Frame = -3

Query: 3013 CLSYAANTHPHKSKFAARAYKCVFLGYITGQKGYRLYDLDNHSLLTSRDVVFFEDSFP-- 2840
            CL Y++    H++K   RA+ C+FLG+    KGY L++L  H LL SR+V+F ED FP  
Sbjct: 635  CLCYSSTITSHRTKLDPRAHPCIFLGFKPHTKGYLLFNLHTHGLLVSRNVLFHEDHFPSF 694

Query: 2839 -------FAS-IPVPSNXXXXXXXXXXXXXXXXXXTSSAPISQPESHPLRRSSRISKPPA 2684
                   F+S +P+  N                   +S   S P   PLRRS+R  +PP 
Sbjct: 695  TKPHSPSFSSPVPIHYNYVDYPTFPSSSIVESSDPPTSDQHSSPP--PLRRSTRPRRPPT 752

Query: 2683 WLSDF-----ITNSVHSSTPMASPSHSAGPDSGDFSLAPTSFNHSSILGATYTAFLANLS 2519
            +L DF      T++ HSST +  P HS       + L   SF+H          ++ ++S
Sbjct: 753  YLQDFHGAFTSTSTAHSSTGIRHPLHSFL----SYDLLSPSFHH----------YVFSIS 798

Query: 2518 NVEEPSSFSQACKSADWVDAMQRELTALEQNNTWVLTSLPPGKRAIGCKWVYKVKMRPDG 2339
            +V EP +F++A KS  W+ AM  E+ ALE NNTWVLT+LPP K AIGC+WVYKVK + DG
Sbjct: 799  SVTEPKNFAEASKSDSWLKAMHEEIFALEANNTWVLTTLPPHKTAIGCRWVYKVKHKADG 858

Query: 2338 TVERYKARLVAKGFHQEHGVDYLDSFSPVAKLVTVRLVIALATIKGWPLFQLDVNNAFLH 2159
            +++RYKARLVAKG+ Q  G+D+ D+FSPVAKL TVRL+++LA I  W L QLDVNNAFLH
Sbjct: 859  SIDRYKARLVAKGYTQMEGLDFFDTFSPVAKLTTVRLLLSLAAINNWHLKQLDVNNAFLH 918

Query: 2158 GYLDEDIYMLPPEGYSKAKDGEVCHLQRSLYGLKQASRQWNAEFCLKLQQFGFTQSGHDH 1979
            G L+E++YM  P G + +  G+VC LQRSLYGLKQASRQW A     L Q G+  S  DH
Sbjct: 919  GDLNEEVYMQLPPGLTPSFPGQVCRLQRSLYGLKQASRQWYARLSSFLIQHGYVPSPSDH 978

Query: 1978 CLFVRQSXXXXXXXXXXXXXVIISGTVLSEIDALKRYLDDLFTIKDLGYARYFLGMEIAR 1799
             LF++ S             ++++G  L+EI  L   L   F IKDLG  +YFLG+E+AR
Sbjct: 979  SLFLKCSPATTTAILIYVDDIVLAGNDLTEIHHLTSLLHTTFQIKDLGNLKYFLGLEVAR 1038

Query: 1798 NTDGSVLNQRKYVLDILHDAGMLHCKAAITPLPPGLKLRAKEGDPLVDPERFRRLIGRLL 1619
            N  G  L QRKY+LD+L D GML  K   TP+   + L A  G PL D   +RRL+GRL+
Sbjct: 1039 NHTGIHLCQRKYILDLLSDTGMLASKPVSTPMDYSMHLSASSGTPLTDTAAYRRLVGRLI 1098

Query: 1618 YLNLTRPDVTYSVQQLSQFVNSPYTSHWDAAMHLLRYLKGSPSVGLFYSAHSSLCLEAYC 1439
            YL  TRPD+TY+VQQLSQFV++P T+H  A   +LRYLKG+P  G+F S +SS+ L A+ 
Sbjct: 1099 YLTNTRPDITYAVQQLSQFVSNPTTAHRQALFRILRYLKGTPGSGIFLSVNSSVQLRAFS 1158

Query: 1438 DADWASCVDTRKSLTGFCIFLGSCLVSWKTKKQTTVSRSSAEAEYRALGSGVCEIQWLSY 1259
            D+DWA C DTR+S+TGF ++LG  L+SWK+KKQ TVSRSS+EAEYRAL +  CE+QWLSY
Sbjct: 1159 DSDWAGCPDTRRSITGFAVYLGDSLISWKSKKQITVSRSSSEAEYRALATTTCELQWLSY 1218

Query: 1258 LCADLGLSIPTPVTLWCDNQAAIHIVQNPVFHERTKHLEIDCHLVRNLYKSGFLHLGHVS 1079
            L  D  +   +P  L+CDNQ+A+ I  NPVFHERTKH+EIDCH+VR+   +G L L  VS
Sbjct: 1219 LLKDFHIDPISPSILYCDNQSALQIASNPVFHERTKHIEIDCHIVRDKVSTGLLKLLPVS 1278

Query: 1078 SRLQLADFFTKSLGRAAFLLLCSKLGLFDLHNPT*GG 968
            S  QLAD  TK L    F   CSKLG+ ++H+   GG
Sbjct: 1279 SSQQLADILTKPLSPFVFRSHCSKLGMLNIHSQLEGG 1315



 Score =  100 bits (250), Expect = 1e-17
 Identities = 49/146 (33%), Positives = 88/146 (60%), Gaps = 6/146 (4%)
 Frame = -3

Query: 427 MVISWILNSISKDLVDAFIYCDSAKDLWDDIAKRFGDCNGPLIYQLERDIANMNQGNMSV 248
           MV+SW+++S+S  +  + ++ D A D+W D+  R+   +   +  L+ + +++ QG++SV
Sbjct: 1   MVVSWLVHSVSPSIRQSILWMDQADDIWKDLKTRYSQGDLLRVSDLQLEASSLKQGDLSV 60

Query: 247 VEYFTKLKRLWDELACIMPLPACESDTR------KLIDERDMNRKLMQFLMGLHESYDQV 86
            EYFTKL+ LWDEL    P P C    +       +I +R +  + MQFL GL++ Y+ V
Sbjct: 61  TEYFTKLRILWDELENFRPDPNCTCTIKCACSVLTIIAQRKLEDQAMQFLRGLNDQYNNV 120

Query: 85  RNQLLLMDPLPSVDKAYSMALRVEKQ 8
           ++ +LLM+  P + K +S  ++ E+Q
Sbjct: 121 KSHVLLME--PPISKIFSYVVQQERQ 144


>gb|PNX74277.1| retrovirus-related Pol polyprotein from transposon TNT 1-94
            [Trifolium pratense]
          Length = 762

 Score =  620 bits (1599), Expect = 0.0
 Identities = 327/718 (45%), Positives = 433/718 (60%), Gaps = 39/718 (5%)
 Frame = -3

Query: 3013 CLSYAANTHPHKSKFAARAYKCVFLGYITGQKGYRLYDLDNHSLLTSRDVVFFEDSFPFA 2834
            CLSYA     H++KF +RA K +FLG+  G KGY LYDL +H +  SR+VVF+E  FP  
Sbjct: 45   CLSYATTLQAHRTKFDSRARKAIFLGFKDGTKGYILYDLSSHDIFVSRNVVFYETYFPLR 104

Query: 2833 -----------SIPVPSNXXXXXXXXXXXXXXXXXXTSSAPISQPESHPLRRSSRISKPP 2687
                       S P+PSN                        + P S  +     IS P 
Sbjct: 105  HSQPVHNASDFSKPLPSNSILDDPVSHTHNSLPLPVMFEPDSTSPSSVNIEPDRTISSPA 164

Query: 2686 AWLSDFITNSVHSSTPMASPSHSAG----------------------PDSGDFSLAPTSF 2573
            +     +++S H    +A P +                           S + +++  ++
Sbjct: 165  SSSHTPLSSSSHDRPNLAPPPYHDNLRRSTRTITRPGYLEDYHCYSVTGSVNNNISHPNY 224

Query: 2572 NHSSILG-----ATYTAFLANLSNVEEPSSFSQACKSADWVDAMQRELTALEQNNTWVLT 2408
              SS+L        Y +F  ++S + EP +FSQA K   W  AM  EL AL++N TW + 
Sbjct: 225  PLSSVLSYDNCVPEYKSFCCSISAIIEPKTFSQASKLDCWRKAMDAELLALDENKTWSVV 284

Query: 2407 SLPPGKRAIGCKWVYKVKMRPDGTVERYKARLVAKGFHQEHGVDYLDSFSPVAKLVTVRL 2228
             LP GK  IGCKWVYK+K   +G++ERYKARLVAKG+ Q  G+DY D+FSPVAK+ TVR 
Sbjct: 285  DLPHGKTPIGCKWVYKIKYHANGSIERYKARLVAKGYTQMEGIDYFDTFSPVAKITTVRF 344

Query: 2227 VIALATIKGWPLFQLDVNNAFLHGYLDEDIYMLPPEGYSKA-KDGEVCHLQRSLYGLKQA 2051
            ++ALA+IKGW L QLDVNNAFLHG L+E++YM  P GYS A    +VC L +SLYGLKQA
Sbjct: 345  LLALASIKGWDLEQLDVNNAFLHGDLNEEVYMSLPPGYSSAIGSNKVCRLHKSLYGLKQA 404

Query: 2050 SRQWNAEFCLKLQQFGFTQSGHDHCLFVRQSXXXXXXXXXXXXXVIISGTVLSEIDALKR 1871
            SRQW ++    L  FG+ QS  DH L+++ +             ++++G    EI A+K 
Sbjct: 405  SRQWYSKLSSALISFGYKQSVSDHSLYIKSTDSEFTALLVYVDDIVLAGNSSKEIQAVKH 464

Query: 1870 YLDDLFTIKDLGYARYFLGMEIARNTDGSVLNQRKYVLDILHDAGMLHCKAAITPLPPGL 1691
            +LD  F IKDLG  RYFLG EIAR+  G  +NQRKY L++L D G L  K +  P  P  
Sbjct: 465  FLDQKFKIKDLGKLRYFLGFEIARSPKGIFVNQRKYTLELLQDTGFLATKPSNIPFNPTT 524

Query: 1690 KLRAKEGDPLVDPERFRRLIGRLLYLNLTRPDVTYSVQQLSQFVNSPYTSHWDAAMHLLR 1511
            KL + +G PL DP  +RRLIGRLLYL  TRPD+++SVQ LSQFV+ P   H+ AA  +L+
Sbjct: 525  KLSSTDGAPLKDPSSYRRLIGRLLYLTNTRPDISFSVQHLSQFVSKPLIPHYTAATRILK 584

Query: 1510 YLKGSPSVGLFYSAHSSLCLEAYCDADWASCVDTRKSLTGFCIFLGSCLVSWKTKKQTTV 1331
            YLK +P+ GLF+   SSL L  Y D+DWA C DTRKS+TG+C+F+GS L+SWK+KKQ TV
Sbjct: 585  YLKSAPANGLFFPVSSSLKLTGYADSDWARCPDTRKSITGYCVFIGSSLISWKSKKQNTV 644

Query: 1330 SRSSAEAEYRALGSGVCEIQWLSYLCADLGLSIPTPVTLWCDNQAAIHIVQNPVFHERTK 1151
            SRSS EAEYRAL S  CEIQWL YL  D  +    P +++CD+++AI++  NP FHER+K
Sbjct: 645  SRSSTEAEYRALASLTCEIQWLQYLFQDFKMKFSNPASVFCDSRSAIYLAHNPAFHERSK 704

Query: 1150 HLEIDCHLVRNLYKSGFLHLGHVSSRLQLADFFTKSLGRAAFLLLCSKLGLFDLHNPT 977
            H+EIDCH++R   +S  +HL  + S  Q+AD FTK L   AF  L SKL L  +H+PT
Sbjct: 705  HIEIDCHVIREKIQSQLIHLLPIPSNSQIADMFTKPLHFPAFFDLLSKLNLCSIHSPT 762


>gb|KYP65734.1| Retrovirus-related Pol polyprotein from transposon TNT 1-94, partial
            [Cajanus cajan]
          Length = 1013

 Score =  625 bits (1611), Expect = 0.0
 Identities = 336/704 (47%), Positives = 434/704 (61%), Gaps = 22/704 (3%)
 Frame = -3

Query: 3013 CLSYAANTHPHKSKFAARAYKCVFLGYITGQKGYRLYDLDNHSLLTSRDVVFFEDSFPF- 2837
            CL+YA+    H++KF  RA K V LGY  G KGY LYDL +H    SR+V F E +FPF 
Sbjct: 313  CLAYASTLQAHRTKFQPRAKKSVLLGYKEGVKGYLLYDLHSHEFFMSRNVFFHEFTFPFH 372

Query: 2836 ----ASIPVPSNXXXXXXXXXXXXXXXXXXTSSAPIS-------------QPESHPLRRS 2708
                 S+  PS                      +P S              P   P R S
Sbjct: 373  TPSQTSLTQPSPTPITIQTPISSPYDLDNHVPPSPTSSTSIPPEQPHQPLSPAPAPSRHS 432

Query: 2707 SRISKPPAWLSDFITNSVHSSTPMASPSHSAGPDSGDFSLAPTSFNHSSILGATYTAFLA 2528
            +R+ +PP++L D+  + +  +  + S S  + P S   +L             +Y  F  
Sbjct: 433  TRMRQPPSYLKDYHCSLLAPTGRINSFSGISTPHSISSTLT------YDFCSPSYKQFCL 486

Query: 2527 NLSNVEEPSSFSQACKSADWVDAMQRELTALEQNNTWVLTSLPPGKRAIGCKWVYKVKMR 2348
            ++S   EP +++QA K   W+ AM+ EL AL+ N TW +  LP GKR IGCKWVYK+K  
Sbjct: 487  SVSTNFEPHTYTQASKYDCWIMAMKTELAALDMNQTWSIVDLPSGKRPIGCKWVYKIKYL 546

Query: 2347 PDGTVERYKARLVAKGFHQEHGVDYLDSFSPVAKLVTVRLVIALATIKGWPLFQLDVNNA 2168
             DG++ERYKARLVAKG+ Q  G+DYLD++SPVAKL TVR+++AL  IKGW L QLDVNNA
Sbjct: 547  SDGSIERYKARLVAKGYSQTEGLDYLDTYSPVAKLTTVRVLLALTAIKGWFLEQLDVNNA 606

Query: 2167 FLHGYLDEDIYMLPPEGYSKAKDG----EVCHLQRSLYGLKQASRQWNAEFCLKLQQFGF 2000
            FLHG L E++YM  P G S         +VC L +S+YGLKQASRQW ++    L   G+
Sbjct: 607  FLHGDLHEEVYMTLPPGLSVPSSSNTAPKVCKLHKSIYGLKQASRQWYSKLSSALISMGY 666

Query: 1999 TQSGHDHCLFVRQSXXXXXXXXXXXXXVIISGTVLSEIDALKRYLDDLFTIKDLGYARYF 1820
            + S  DH LF++ S             +I++G    EID +K  L   F IKDLG  RYF
Sbjct: 667  SPSTADHSLFIKSSSSHFTALLVYVDDIILAGNDKPEIDFIKAQLHKCFKIKDLGNLRYF 726

Query: 1819 LGMEIARNTDGSVLNQRKYVLDILHDAGMLHCKAAITPLPPGLKLRAKEGDPLVDPERFR 1640
            LG+EIAR+  G +LNQRKY L+IL D G L  K + TP  P LKL +  G P  D   +R
Sbjct: 727  LGLEIARSNKGILLNQRKYTLEILEDVGFLAAKPSSTPFNPSLKLHSDHGSPYNDETAYR 786

Query: 1639 RLIGRLLYLNLTRPDVTYSVQQLSQFVNSPYTSHWDAAMHLLRYLKGSPSVGLFYSAHSS 1460
            RLIGRLLYL  TRPD++Y VQQLSQFV+ P   H+ AA  +LRYLKGS   GLFYS+ +S
Sbjct: 787  RLIGRLLYLTTTRPDISYVVQQLSQFVSKPLDIHYQAATRILRYLKGSHGRGLFYSSSAS 846

Query: 1459 LCLEAYCDADWASCVDTRKSLTGFCIFLGSCLVSWKTKKQTTVSRSSAEAEYRALGSGVC 1280
            L L A+ D+DWASC  +RKS+TGFC+FLGS L+SW++KKQ+T+SRSS+EAEYRAL S  C
Sbjct: 847  LKLSAFADSDWASCSISRKSITGFCVFLGSSLISWRSKKQSTISRSSSEAEYRALASLTC 906

Query: 1279 EIQWLSYLCADLGLSIPTPVTLWCDNQAAIHIVQNPVFHERTKHLEIDCHLVRNLYKSGF 1100
            E+QWL YL  DL  S+  P +++CDN++AI++  NP FHERTKH+EIDCH++R   +S  
Sbjct: 907  ELQWLHYLFNDLKTSLNFPTSVFCDNKSAIYLAHNPTFHERTKHIEIDCHVIREKIQSRL 966

Query: 1099 LHLGHVSSRLQLADFFTKSLGRAAFLLLCSKLGLFDLHNPT*GG 968
            LHL  V S  QLAD FTK L   +F    SKLGL+D+H+   GG
Sbjct: 967  LHLLPVPSSSQLADAFTKPLHATSFNSFVSKLGLYDVHSSACGG 1010


>gb|OMO65653.1| hypothetical protein CCACVL1_21443 [Corchorus capsularis]
          Length = 1245

 Score =  630 bits (1624), Expect = 0.0
 Identities = 331/692 (47%), Positives = 435/692 (62%), Gaps = 13/692 (1%)
 Frame = -3

Query: 3013 CLSYAANTHPHKSKFAARAYKCVFLGYITGQKGYRLYDLDNHSLLTSRDVVFFEDSFPFA 2834
            CL YA        KF+ R+ KC+F+GY  G KGYR+YDL    +  SRDV F+E+ FPF 
Sbjct: 558  CLCYALQKPKPNDKFSPRSSKCIFVGYPNGTKGYRVYDLTTKKIFVSRDVRFYENQFPFE 617

Query: 2833 SIPVPSNXXXXXXXXXXXXXXXXXXTSSAPIS----QPESHP--------LRRSSRISKP 2690
            +    +N                    S P +    QP+ HP          R  R    
Sbjct: 618  NTSTSTNDQTVVPLPALEDTDLSITHDSIPPNPPQEQPQPHPPTNPPNQPSTRPQRTKTR 677

Query: 2689 PAWLSDFITNSVHSSTPMASPSHSAGPDSGD-FSLAPTSFNHSSILGATYTAFLANLSNV 2513
            P  L D + N+       +S +H A   SG  +SL+  +F       +++ AFLA +S  
Sbjct: 678  PKRLDDCVCNNSKVDNSPSSLTHEAS--SGTLYSLS--NFISYDNFHSSHKAFLAAISLR 733

Query: 2512 EEPSSFSQACKSADWVDAMQRELTALEQNNTWVLTSLPPGKRAIGCKWVYKVKMRPDGTV 2333
            +EP SFSQA KS  W +AMQ+EL ALE NNTW L +LPP K+ IGCKW++K+K + DGT+
Sbjct: 734  DEPKSFSQAVKSPQWREAMQKELAALENNNTWTLETLPPRKKPIGCKWIFKIKYKSDGTI 793

Query: 2332 ERYKARLVAKGFHQEHGVDYLDSFSPVAKLVTVRLVIALATIKGWPLFQLDVNNAFLHGY 2153
            ERYKAR VAKG++Q  G+D+ ++F+PVAKLVTVR ++A+A IK W L QLDVNNAFLHG 
Sbjct: 794  ERYKARFVAKGYNQIEGMDFHETFAPVAKLVTVRCLLAIAAIKNWELHQLDVNNAFLHGD 853

Query: 2152 LDEDIYMLPPEGYSKAKDGEVCHLQRSLYGLKQASRQWNAEFCLKLQQFGFTQSGHDHCL 1973
            LDE++YM  P GY    D  VC +++SLYGLKQASR W A+F   L +FGF QS  D+ L
Sbjct: 854  LDEEVYMSLPPGYGDKNDSRVCRVRKSLYGLKQASRNWFAKFFAALLEFGFIQSTVDYSL 913

Query: 1972 FVRQSXXXXXXXXXXXXXVIISGTVLSEIDALKRYLDDLFTIKDLGYARYFLGMEIARNT 1793
            F   +             +II+G     I +LK++LD  F IKDLG  +YFLG+E+AR++
Sbjct: 914  FTLTTGSSFLVVLVYVDDLIIAGDDSVRIRSLKQHLDSRFHIKDLGPLKYFLGIEVARSS 973

Query: 1792 DGSVLNQRKYVLDILHDAGMLHCKAAITPLPPGLKLRAKEGDPLVDPERFRRLIGRLLYL 1613
             G  L QRKY LDIL + GM   K +  P+     L    G P+ DP ++RRL+GRL+YL
Sbjct: 974  SGIFLCQRKYTLDILEECGMTDAKPSAFPMEQKHNLTHDTGPPVQDPMQYRRLVGRLIYL 1033

Query: 1612 NLTRPDVTYSVQQLSQFVNSPYTSHWDAAMHLLRYLKGSPSVGLFYSAHSSLCLEAYCDA 1433
             +TRP+++Y+V  LSQF+N P   H DAA+ +LRYLK  P  G+F+S+ SS  L  + D+
Sbjct: 1034 TITRPEISYAVHILSQFMNDPRQPHLDAALRVLRYLKSCPGQGIFFSSSSSPHLTGFSDS 1093

Query: 1432 DWASCVDTRKSLTGFCIFLGSCLVSWKTKKQTTVSRSSAEAEYRALGSGVCEIQWLSYLC 1253
            DWASC  TR+S TG+   LGS  +SWKTKKQTTVSRSSAEAEYRA+ + V E+ WL  L 
Sbjct: 1094 DWASCPQTRRSTTGYITMLGSSPISWKTKKQTTVSRSSAEAEYRAMAATVSELLWLRSLL 1153

Query: 1252 ADLGLSIPTPVTLWCDNQAAIHIVQNPVFHERTKHLEIDCHLVRNLYKSGFLHLGHVSSR 1073
              LG+    P+ L+CDNQ AIHI  NPVFHERTKH+E+DCH +R+  ++  +   H+SS+
Sbjct: 1154 QTLGIPHQQPMALFCDNQVAIHIATNPVFHERTKHIELDCHFIRSHIQAKSIQTSHISSK 1213

Query: 1072 LQLADFFTKSLGRAAFLLLCSKLGLFDLHNPT 977
            LQLAD FTK+LGR  F  L  KLG+F+LH PT
Sbjct: 1214 LQLADIFTKALGRDQFQFLLRKLGIFNLHAPT 1245



 Score =  182 bits (463), Expect = 3e-43
 Identities = 93/209 (44%), Positives = 132/209 (63%), Gaps = 1/209 (0%)
 Frame = -3

Query: 625 PLKLYSSDHPGLSLVSSQLTGNNYLSWRRSMLIALGAKTKLGFINGKMEIPKEDSPKYDQ 446
           P  L  SDHPG  LVS  L G+NY +W R+M  AL A+ K GF++G +  P+  SP    
Sbjct: 29  PYLLQPSDHPGAILVSCPLNGDNYPTWARAMTNALRARNKYGFVDGSLAKPEATSPDVST 88

Query: 445 WRKVDCMVISWILNSISKDLVDAFIYCDSAKDLWDDIAKRFGDCNGPLIYQLERDIANMN 266
           W K + MVISWI NS+S DL ++  Y D+A+++W D+ +RF   N P I QL+RD+A   
Sbjct: 89  WEKCNSMVISWIFNSLSSDLHNSVAYVDTAREMWLDLEERFSQGNAPRINQLKRDLALTF 148

Query: 265 QGNMSVVEYFTKLKRLWDELACIMPLPACESDTRK-LIDERDMNRKLMQFLMGLHESYDQ 89
           Q NMSV  Y+TKLK +WDEL     +P C     K L+ ER+   K+ QF+MGL +S+  
Sbjct: 149 QINMSVAAYYTKLKGIWDELQTYSTIPPCTCGAAKELLLERE-REKVHQFIMGLDDSFRS 207

Query: 88  VRNQLLLMDPLPSVDKAYSMALRVEKQRN 2
           V + +L ++PLPS+ KAY++  R E++ +
Sbjct: 208 VSSHILNIEPLPSLSKAYALVTRAERENS 236


>gb|PNY00469.1| retrovirus-related Pol polyprotein from transposon TNT 1-94
            [Trifolium pratense]
          Length = 778

 Score =  614 bits (1583), Expect = 0.0
 Identities = 329/736 (44%), Positives = 445/736 (60%), Gaps = 57/736 (7%)
 Frame = -3

Query: 3013 CLSYAANTHPHKSKFAARAYKCVFLGYITGQKGYRLYDLDNHSLLTSRDVVFFEDSFPFA 2834
            CLS+A     H++KF +RA KCVF+GY  G KGY LYDL +H++  SR+VVF+E   PF 
Sbjct: 45   CLSFATTLQAHRTKFDSRARKCVFIGYKDGTKGYILYDLHSHNIFLSRNVVFYEHVLPFK 104

Query: 2833 SIPVPS---NXXXXXXXXXXXXXXXXXXTSSAPISQPESHPLRRSSRISKP--PAWLSDF 2669
            S+P P+   N                    + P+S      +  +  ++ P  P   S  
Sbjct: 105  SVPGPTSSHNSPTFPLYDDPLDISHNPCVDTFPLSTGSLDNVSLNPALTPPLVPTLDSSP 164

Query: 2668 ITNSVHSSTPMASPS----HSAGPD----------------------------------- 2606
            +T  ++++TP A PS    HSA                                      
Sbjct: 165  LTPPINTATP-APPSFDSAHSAADQPSPNLDSVPVPSEPSIPLPTRVSTRVTRPPSYLQD 223

Query: 2605 ------SGDFSLAPTSFNH--SSIL-----GATYTAFLANLSNVEEPSSFSQACKSADWV 2465
                  SG  +   ++  H  SS+L        Y  F  ++S+  EP++++QA K   W 
Sbjct: 224  YHCNIKSGCTNQVSSNIVHPLSSVLSYNTCSPAYKLFCCSISSTIEPTTYNQASKFDCWK 283

Query: 2464 DAMQRELTALEQNNTWVLTSLPPGKRAIGCKWVYKVKMRPDGTVERYKARLVAKGFHQEH 2285
             AM  E+TALE N TW +  LP GK  IGCKWVYK+K   +GT+ERYKARLVAKG+ Q  
Sbjct: 284  KAMDAEITALELNKTWTVVDLPCGKVPIGCKWVYKIKYHANGTIERYKARLVAKGYTQME 343

Query: 2284 GVDYLDSFSPVAKLVTVRLVIALATIKGWPLFQLDVNNAFLHGYLDEDIYMLPPEGYSKA 2105
            GVDY D+FSPVAK+ TVR+++A+A ++GW L QLDVNNAFLHG L E++YM  P GY  A
Sbjct: 344  GVDYFDTFSPVAKMTTVRVLLAVAAVRGWHLEQLDVNNAFLHGDLHEEVYMSLPPGYD-A 402

Query: 2104 KDGEVCHLQRSLYGLKQASRQWNAEFCLKLQQFGFTQSGHDHCLFVRQSXXXXXXXXXXX 1925
               +VC L +SLYGLKQASRQW ++    L   G+  S  DH L+V+             
Sbjct: 403  TPSKVCKLNKSLYGLKQASRQWYSKLSAALISLGYQASQADHSLYVKSHGTSFTALLVYV 462

Query: 1924 XXVIISGTVLSEIDALKRYLDDLFTIKDLGYARYFLGMEIARNTDGSVLNQRKYVLDILH 1745
              ++++GT + EI ++K +LD  F IKDLG  R+FLG+EIAR++ G  LNQRKY L++L 
Sbjct: 463  DDIVLAGTSIEEIKSVKLFLDQQFKIKDLGPLRFFLGLEIARSSSGIFLNQRKYTLELLE 522

Query: 1744 DAGMLHCKAAITPLPPGLKLRAKEGDPLVDPERFRRLIGRLLYLNLTRPDVTYSVQQLSQ 1565
            D G L  K A  PL P  KL A +G P  DP  +RRLIGRLLYL  TRPD++++VQ LSQ
Sbjct: 523  DTGFLGSKPATVPLDPHTKLSATDGVPFDDPSGYRRLIGRLLYLTHTRPDISFAVQHLSQ 582

Query: 1564 FVNSPYTSHWDAAMHLLRYLKGSPSVGLFYSAHSSLCLEAYCDADWASCVDTRKSLTGFC 1385
            +V++P   H+ AA  +LRYLK  P+ G+ +S+HS L L  + D+DWA C +TR+S+TG+C
Sbjct: 583  YVSTPLVPHYQAATRILRYLKSCPAKGVLFSSHSPLQLHGFADSDWACCPNTRRSVTGYC 642

Query: 1384 IFLGSCLVSWKTKKQTTVSRSSAEAEYRALGSGVCEIQWLSYLCADLGLSIPTPVTLWCD 1205
            + LGS L+SWK+KKQ TVSRSS EAEYRAL S  CE+QWL YL  DL ++ P   +++CD
Sbjct: 643  VLLGSSLISWKSKKQNTVSRSSTEAEYRALASLTCELQWLQYLFQDLHITFPQSASVYCD 702

Query: 1204 NQAAIHIVQNPVFHERTKHLEIDCHLVRNLYKSGFLHLGHVSSRLQLADFFTKSLGRAAF 1025
            N++AI++  NP FHER+KH+E+DCH++R   +S  +HL  V S+ QLAD FTK L   AF
Sbjct: 703  NKSAIYLAHNPTFHERSKHIELDCHIIREKLQSKLIHLLSVPSKSQLADVFTKPLHSPAF 762

Query: 1024 LLLCSKLGLFDLHNPT 977
              + SKLGL  +H+PT
Sbjct: 763  SSMLSKLGLCSIHHPT 778


>dbj|GAU41679.1| hypothetical protein TSUD_272630 [Trifolium subterraneum]
          Length = 1178

 Score =  627 bits (1616), Expect = 0.0
 Identities = 329/716 (45%), Positives = 442/716 (61%), Gaps = 37/716 (5%)
 Frame = -3

Query: 3013 CLSYAANTHPHKSKFAARAYKCVFLGYITGQKGYRLYDLDNHSLLTSRDVVFFEDSFPFA 2834
            CLSYA     H++KF +RA K +FLGY  G KGY LYDL +H +  SR+V+F+E  FPF 
Sbjct: 476  CLSYATTLQAHRTKFVSRARKAIFLGYKDGTKGYILYDLHSHEIFVSRNVIFYETDFPFH 535

Query: 2833 -----------------------------SIPVPSNXXXXXXXXXXXXXXXXXXTSSAPI 2741
                                         ++P+P                      S PI
Sbjct: 536  LSNSVKTDSASPASHLNHTLLYDAEPDPNALPIP---VMHEPDLTLSPIIGPSYNDSTPI 592

Query: 2740 SQPESHP------LRRSSRISKPPAWLSDFITNSVHSSTPMASPSHSAGPDSGDFSLAPT 2579
            + PES P      LR+SSR+ + P  L  F     H  T + +  HSA   +  + L+  
Sbjct: 593  NSPESSPIPNPAPLRKSSRVIQRPRHLEGF-----HCETLIGT--HSAASSNTVYPLSSV 645

Query: 2578 -SFNHSSILGATYTAFLANLSNVEEPSSFSQACKSADWVDAMQRELTALEQNNTWVLTSL 2402
             S+N+ +     Y A   ++S + EP +++QA K   W +AM  EL AL++N TW +  L
Sbjct: 646  LSYNNCA---PNYHALCCSISAIVEPKTYTQASKFECWRNAMNAELLALDENKTWSVVDL 702

Query: 2401 PPGKRAIGCKWVYKVKMRPDGTVERYKARLVAKGFHQEHGVDYLDSFSPVAKLVTVRLVI 2222
            P GK  +GCKWVYKVK   +G++ERYKARLVAKG+ Q  GVDY D+FSPVAK+ TVR+++
Sbjct: 703  PNGKVPVGCKWVYKVKYHANGSIERYKARLVAKGYTQLEGVDYFDTFSPVAKITTVRVLL 762

Query: 2221 ALATIKGWPLFQLDVNNAFLHGYLDEDIYMLPPEGYSKAKDG-EVCHLQRSLYGLKQASR 2045
            ALA+IKGW L QLDVNNAFLHG L+ED+YM  P G++   +  +VC L +S+YGLKQASR
Sbjct: 763  ALASIKGWHLEQLDVNNAFLHGDLNEDVYMSLPPGFAATNESNKVCKLHKSIYGLKQASR 822

Query: 2044 QWNAEFCLKLQQFGFTQSGHDHCLFVRQSXXXXXXXXXXXXXVIISGTVLSEIDALKRYL 1865
            QW ++    L   G+T S  DH L+++ +             ++++G  + EI  +K +L
Sbjct: 823  QWYSKLSSSLVSLGYTPSQSDHSLYIKSTTNSFTALLVYVDDIVLAGNSIHEIQTVKLFL 882

Query: 1864 DDLFTIKDLGYARYFLGMEIARNTDGSVLNQRKYVLDILHDAGMLHCKAAITPLPPGLKL 1685
            D  F IKDLG  RYFL +EIAR+  G  +NQRKY L++L D G+L  K +  P  P  KL
Sbjct: 883  DQKFKIKDLGKLRYFLVLEIARSDTGIFVNQRKYTLELLEDVGLLGTKPSSIPFHPTTKL 942

Query: 1684 RAKEGDPLVDPERFRRLIGRLLYLNLTRPDVTYSVQQLSQFVNSPYTSHWDAAMHLLRYL 1505
             + +G PL DP  +RRLIGRLLYL  TRPD+++SVQ LSQFV+ P   H++AAMH+L+YL
Sbjct: 943  SSTDGAPLDDPSSYRRLIGRLLYLTHTRPDISFSVQHLSQFVSKPLVPHYNAAMHILKYL 1002

Query: 1504 KGSPSVGLFYSAHSSLCLEAYCDADWASCVDTRKSLTGFCIFLGSCLVSWKTKKQTTVSR 1325
            K  P+ G+F SA SSL + A+ D+DWA C +TRKS+ GFC+ LGS L+SWK+KKQ TVSR
Sbjct: 1003 KSDPAKGIFLSASSSLKISAFADSDWARCPETRKSIIGFCVLLGSSLISWKSKKQNTVSR 1062

Query: 1324 SSAEAEYRALGSGVCEIQWLSYLCADLGLSIPTPVTLWCDNQAAIHIVQNPVFHERTKHL 1145
            SS EAEYRAL S  CEIQWL Y+  D  +    P  ++CDN++AI++  NP FHER+KH+
Sbjct: 1063 SSTEAEYRALASLTCEIQWLQYIFQDFKIIFSNPAYVFCDNKSAIYLAHNPTFHERSKHI 1122

Query: 1144 EIDCHLVRNLYKSGFLHLGHVSSRLQLADFFTKSLGRAAFLLLCSKLGLFDLHNPT 977
            E+DCH++R   +S  +HL  V +  QLAD FTK L   AF    SKLGL  +H+PT
Sbjct: 1123 ELDCHVIREKIQSKLIHLLPVPTTSQLADVFTKPLNHPAFSSFLSKLGLCSIHSPT 1178



 Score =  142 bits (359), Expect = 1e-30
 Identities = 75/216 (34%), Positives = 126/216 (58%), Gaps = 9/216 (4%)
 Frame = -3

Query: 628 DPLKLYSSDHPGLSLVSSQLTGNNYLSWRRSMLIALGAKTKLGFINGKMEIPKEDSPKYD 449
           +P  L+ +++P + LVS  L   NY +W RSM IAL +K K  FI+G +  P    P Y 
Sbjct: 13  NPYYLHPNENPAVILVSPPLDHKNYHTWSRSMQIALISKNKDKFIDGTLVKPSPLDPLYS 72

Query: 448 QWRKVDCMVISWILNSISKDLVDAFIYCDSAKDLWDDIAKRFGDCNGPLIYQLERDIANM 269
            W + + MV++WI  S+S  +  + ++ DSA  LW ++  RF   +   I  L+ ++  +
Sbjct: 73  PWIRCNTMVLAWIHRSLSDSIARSVLWIDSAASLWKNLRTRFSQGDIFRISDLQEELYRL 132

Query: 268 NQGNMSVVEYFTKLKRLWDELACIMPLPACES---------DTRKLIDERDMNRKLMQFL 116
            QGN+ V +YFTKL+ LWDEL    P+P C+          ++ KL  E+D    +++FL
Sbjct: 133 RQGNLDVSDYFTKLQVLWDELENYRPIPLCKCSIACTCGAVESFKLYREQDY---VIRFL 189

Query: 115 MGLHESYDQVRNQLLLMDPLPSVDKAYSMALRVEKQ 8
            GL++ +   ++Q++L++PLP VD  +SM ++ E++
Sbjct: 190 KGLNDRFSNTKSQIMLINPLPDVDTVFSMLIQQERE 225


>gb|KYP42321.1| Copia protein [Cajanus cajan]
          Length = 1456

 Score =  630 bits (1624), Expect = 0.0
 Identities = 334/736 (45%), Positives = 451/736 (61%), Gaps = 57/736 (7%)
 Frame = -3

Query: 3013 CLSYAANTHPHKSKFAARAYKCVFLGYITGQKGYRLYDLDNHSLLTSRDVVFFEDSFPFA 2834
            CL++A      ++K   RA KC+FLGY  G KG+ L++L N S L SRDV+F+E  FP++
Sbjct: 732  CLAFATTLSSKRTKLDRRASKCIFLGYKNGTKGFLLFNLHNKSFLISRDVLFYEKIFPYS 791

Query: 2833 S-------------------------------------------IPVPSNXXXXXXXXXX 2783
            +                                            P+PS+          
Sbjct: 792  AHVPSMSASDSLLLDVVKDNDTTIYSDPFPTTTFSHGSPSIPLDTPLPSSETTISTDRPP 851

Query: 2782 XXXXXXXXTSSAPISQPE--------------SHPLRRSSRISKPPAWLSDFITNSVHSS 2645
                      +A +S PE                  R S+RI KPP +L ++   ++ SS
Sbjct: 852  FSPINTCPIPTATLSTPELPSSNTTNDASQVVMPQTRVSTRIRKPPRYLQEYYCENLASS 911

Query: 2644 TPMASPSHSAGPDSGDFSLAPTSFNHSSILGATYTAFLANLSNVEEPSSFSQACKSADWV 2465
            +  ++  +             +SF   +    ++T+F  ++S   EP+SF +A     W 
Sbjct: 912  SAASNCLYPL-----------SSFVTYNNCSPSHTSFCLSISAQHEPTSFKEANSEECWR 960

Query: 2464 DAMQRELTALEQNNTWVLTSLPPGKRAIGCKWVYKVKMRPDGTVERYKARLVAKGFHQEH 2285
             AM+ EL ALE+N TW L  LP GKR +GCKWVY+VK + DG+VERYKARLVAKGF Q  
Sbjct: 961  RAMEAELQALEKNQTWSLVRLPEGKRPVGCKWVYRVKYKVDGSVERYKARLVAKGFTQTE 1020

Query: 2284 GVDYLDSFSPVAKLVTVRLVIALATIKGWPLFQLDVNNAFLHGYLDEDIYMLPPEGYSKA 2105
            GVDY ++FSPV KL TVR +++LA    W L QLDV+NAFLHG L E++YM PP G+  +
Sbjct: 1021 GVDYFETFSPVVKLSTVRFLLSLAAAHNWFLHQLDVDNAFLHGDLFEEVYMKPPPGFKLS 1080

Query: 2104 KDGEVCHLQRSLYGLKQASRQWNAEFCLKLQQFGFTQSGHDHCLFVRQSXXXXXXXXXXX 1925
                VC L +SLYGLKQASRQWN +    L    F QS  DH LF+++S           
Sbjct: 1081 HPRLVCKLHKSLYGLKQASRQWNQKLTEALISLNFIQSSTDHSLFIKKSHSSITALLVYV 1140

Query: 1924 XXVIISGTVLSEIDALKRYLDDLFTIKDLGYARYFLGMEIARNTDGSVLNQRKYVLDILH 1745
              V+++G  ++EI A+K YL   F IKDLG  ++FLG+EIAR+  G +LNQRKY L++L 
Sbjct: 1141 DDVVLTGNDMAEISAVKAYLHAQFHIKDLGPLKFFLGLEIARSQSGLILNQRKYCLELLS 1200

Query: 1744 DAGMLHCKAAITPLPPGLKLRAKEGDPLVDPERFRRLIGRLLYLNLTRPDVTYSVQQLSQ 1565
            + G+  CK   TP+   +KL A EG PL DP  FRRLIGRLLYL  TRPD++++VQQLSQ
Sbjct: 1201 EHGLTDCKPVSTPIDASVKLYASEGLPLDDPTIFRRLIGRLLYLTNTRPDISFAVQQLSQ 1260

Query: 1564 FVNSPYTSHWDAAMHLLRYLKGSPSVGLFYSAHSSLCLEAYCDADWASCVDTRKSLTGFC 1385
            FV+SP  +H+ AA+ +LRYLK SP++GLFY + +   ++A+ D+DWASC +TR+S+TGFC
Sbjct: 1261 FVDSPRATHFQAALRILRYLKSSPALGLFYPSQTEHRIQAFSDSDWASCPNTRRSVTGFC 1320

Query: 1384 IFLGSCLVSWKTKKQTTVSRSSAEAEYRALGSGVCEIQWLSYLCADLGLSIPTPVTLWCD 1205
            IF GS L+SWK+KKQ+TVSRSS+EAEYRAL S  CE+QWL +LC DL ++IPTP +++CD
Sbjct: 1321 IFYGSALISWKSKKQSTVSRSSSEAEYRALASVTCELQWLLFLCHDLSINIPTPFSIFCD 1380

Query: 1204 NQAAIHIVQNPVFHERTKHLEIDCHLVRNLYKSGFLHLGHVSSRLQLADFFTKSLGRAAF 1025
            +Q+AI+I +NP FHERTKH+E+DCHL R   + G +HL HV S+ QLAD FTK+L    F
Sbjct: 1381 SQSAIYIAKNPTFHERTKHIEVDCHLTRLKIQQGLIHLFHVPSKSQLADVFTKALYPRNF 1440

Query: 1024 LLLCSKLGLFDLHNPT 977
                SKL L D++NPT
Sbjct: 1441 TEAVSKLCLIDIYNPT 1456



 Score =  114 bits (286), Expect = 6e-22
 Identities = 59/183 (32%), Positives = 103/183 (56%), Gaps = 7/183 (3%)
 Frame = -3

Query: 535 MLIALGAKTKLGFINGKMEIPKEDSPKYDQWRKVDCMVISWILNSISKDLVDAFIYCDSA 356
           ML AL +K K  FI+G +  P    P    WR+ +  V+SW++ S++  +  + +Y D+A
Sbjct: 1   MLTALESKNKEQFIDGSLPSPPTSDPLRSTWRRCNKTVMSWLIRSMTPSIAQSVLYMDTA 60

Query: 355 KDLWDDIAKRFGDCNGPLIYQLERDIANMNQGNMSVVEYFTKLKRLWDELACIMPLPACE 176
            ++W D+ +RF   +   I  L+  +    QG+ +V +Y+T LK LW +L     +  C 
Sbjct: 61  AEIWKDLCERFSHGDKFRISDLQASVHECKQGDSTVSQYYTHLKTLWKQLEQYRSVLICS 120

Query: 175 SDT-------RKLIDERDMNRKLMQFLMGLHESYDQVRNQLLLMDPLPSVDKAYSMALRV 17
            D         K+  ER+ +  +++FL GL+E Y QVR+ +L+MDP+PS+ K +S+  + 
Sbjct: 121 CDNPCSCGILLKIKKERE-DDCVIKFLRGLNEEYSQVRSNILMMDPMPSITKTFSLIQQH 179

Query: 16  EKQ 8
           E++
Sbjct: 180 ERE 182


>gb|KYP34298.1| Retrovirus-related Pol polyprotein from transposon TNT 1-94, partial
            [Cajanus cajan]
          Length = 1002

 Score =  612 bits (1578), Expect = 0.0
 Identities = 318/696 (45%), Positives = 437/696 (62%), Gaps = 14/696 (2%)
 Frame = -3

Query: 3013 CLSYAANTHPHKSKFAARAYKCVFLGYITGQKGYRLYDLDNHSLLTSRDVVFFEDSFPFA 2834
            CL+YA   H ++ KF  R  +CVFLG+    KG  LYDL++     SR V +FE  FPF 
Sbjct: 316  CLAYATTLHHNRKKFDPRGRRCVFLGFKPQVKGSILYDLNSRETFLSRHVEYFEHIFPFL 375

Query: 2833 ---------SIPVPSNXXXXXXXXXXXXXXXXXXTSSAPISQPESHP-LRRSSRISKPPA 2684
                     +I +P +                   +S+P+S     P +R+S+R  K P+
Sbjct: 376  PTSPLDLTQTISLPRHQPPLPIDTDPTPLSTNTTPTSSPVSVVPPPPFVRKSTRPRKLPS 435

Query: 2683 WLSDFITN--SVHSSTPMASPSHSAGPDSGDFSLAPTSFNHSSILGATYTAFLANLSNVE 2510
            +L D+     + H+S  ++ P +S        +L+P+             AF  ++S+++
Sbjct: 436  YLHDYHHTLLTTHNSPTISQPLYSIHNHISYSNLSPSQ-----------KAFSLSISSIK 484

Query: 2509 EPSSFSQACKSADWVDAMQRELTALEQNNTWVLTSLPPGKRAIGCKWVYKVKMRPDGTVE 2330
            EP+S+ +A +   W  A+Q ELTALE+NNTW+LT LPP K+ +GCKWV+K+K   DGT+E
Sbjct: 485  EPNSYVEAIQDESWKTAIQTELTALEKNNTWILTPLPPNKQVVGCKWVFKLKFNSDGTIE 544

Query: 2329 RYKARLVAKGFHQEHGVDYLDSFSPVAKLVTVRLVIALATIKGWPLFQLDVNNAFLHGYL 2150
            R+KARLVAKG+ Q   +DYLD+FSPV K+ TVR ++A+AT K W + QLDVN  FLHG L
Sbjct: 545  RHKARLVAKGYTQTETLDYLDTFSPVVKMTTVRTLLAVATAKNWHIHQLDVNTTFLHGDL 604

Query: 2149 DEDIYMLPPEGY--SKAKDGEVCHLQRSLYGLKQASRQWNAEFCLKLQQFGFTQSGHDHC 1976
             E++YM PP G   S  +   VC L +SLYGLKQASRQWNA+    L   GF QS  D+ 
Sbjct: 605  HEEVYMTPPPGLTVSPHQSNCVCKLVKSLYGLKQASRQWNAKLTSVLIDSGFKQSMADYS 664

Query: 1975 LFVRQSXXXXXXXXXXXXXVIISGTVLSEIDALKRYLDDLFTIKDLGYARYFLGMEIARN 1796
            LF +Q              ++++G   +EI+ +K  LD  FTIKDLG  +YFLGME+AR+
Sbjct: 665  LFTKQFGAKFTAILVYVDDLVLAGNDPTEINYIKSLLDQKFTIKDLGQLKYFLGMEVARS 724

Query: 1795 TDGSVLNQRKYVLDILHDAGMLHCKAAITPLPPGLKLRAKEGDPLVDPERFRRLIGRLLY 1616
            + G  L QRKY LD++ D G+L  K   +P+   +KL    G PL DP ++RRL+GRL+Y
Sbjct: 725  STGIALYQRKYALDLIEDTGLLASKPCKSPMDHSVKLHKTVGTPLTDPTQYRRLLGRLIY 784

Query: 1615 LNLTRPDVTYSVQQLSQFVNSPYTSHWDAAMHLLRYLKGSPSVGLFYSAHSSLCLEAYCD 1436
            L  TR D+++SV  LSQF++ P   H+ AA+ +L+Y+K +P  GLF+ + S L L+ Y D
Sbjct: 785  LTNTRADISFSVNHLSQFMDQPTDVHYQAALRILKYVKNAPGKGLFFPSSSDLTLKGYSD 844

Query: 1435 ADWASCVDTRKSLTGFCIFLGSCLVSWKTKKQTTVSRSSAEAEYRALGSGVCEIQWLSYL 1256
            +DWASC DTR+S+TGF  FLG  L+SWK+KKQ TVS+SSAEAEYRAL    CE QWLSYL
Sbjct: 845  SDWASCSDTRRSVTGFSFFLGPALISWKSKKQATVSKSSAEAEYRALAQSTCEAQWLSYL 904

Query: 1255 CADLGLSIPTPVTLWCDNQAAIHIVQNPVFHERTKHLEIDCHLVRNLYKSGFLHLGHVSS 1076
              D GL    P+ L+CDNQ+A+HI  NPVFHERTK++E+DCH+VR   + G +HL  +S+
Sbjct: 905  LHDFGLHSFHPIVLFCDNQSALHIASNPVFHERTKNIELDCHIVREKLQVGLIHLLPIST 964

Query: 1075 RLQLADFFTKSLGRAAFLLLCSKLGLFDLHNPT*GG 968
              QLAD FTK+L    F  +  KLG+FD+H+   GG
Sbjct: 965  ADQLADVFTKALSLRPFEQIIFKLGMFDIHSSLRGG 1000


>ref|XP_012486681.1| PREDICTED: LOW QUALITY PROTEIN: retrovirus-related Pol polyprotein
            from transposon TNT 1-94 [Gossypium raimondii]
          Length = 683

 Score =  600 bits (1548), Expect = 0.0
 Identities = 318/689 (46%), Positives = 429/689 (62%), Gaps = 11/689 (1%)
 Frame = -3

Query: 3013 CLSYAANTHPHKSKFAARAYKCVFLGYITGQKGYRLYDLDNHSLLTSRDVVFFEDSFPFA 2834
            CLS+A+    H+ KF  RA +CVFLGY    KGY L D++  ++  SR+V F E  FPF 
Sbjct: 7    CLSFASTLSAHRKKFDPRAKQCVFLGYKPHVKGYILLDIETRAIFVSRNVTFHETIFPFL 66

Query: 2833 -------SIPVPSNXXXXXXXXXXXXXXXXXXTSSAPISQPESHPLR--RSSRISKPPAW 2681
                   + PV                       S+  S P + P    R  R  +PP++
Sbjct: 67   QHSLNNPTTPVGLLASDTIYDSPISPPQPSSTDQSSSTSHPPTQPSTSSRPQRNRRPPSY 126

Query: 2680 LSDFITNSVHSSTPMASPSHSAGPDSGDFSLAPTSFNHSSILGATYTAFLANLSNVEEPS 2501
            L D+    + ++T      HS        +L+P   + +  + A+            EP 
Sbjct: 127  LQDYQHYQLPAATNHPGTPHSIFNCISYHNLSPQHLHFTLAISASI-----------EPK 175

Query: 2500 SFSQACKSADWVDAMQRELTALEQNNTWVLTSLPPGKRAIGCKWVYKVKMRPDGTVERYK 2321
            ++ QA K   W +AMQ E+ ALEQNNTW +T+LPPGK   GCKWV++VK R DG+ ERYK
Sbjct: 176  TYKQASKFTHWNEAMQAEINALEQNNTWTMTTLPPGKTPXGCKWVFRVKHRADGSTERYK 235

Query: 2320 ARLVAKGFHQEHGVDYLDSFSPVAKLVTVRLVIALATIKGWPLFQLDVNNAFLHGYLDED 2141
            ARLVAKG+ Q  GVDY D+FSPVAK+ TVRL++ALAT + W + QLDVNNAFLHG L+ED
Sbjct: 236  ARLVAKGYTQI-GVDYFDTFSPVAKITTVRLLLALATSRHWHIQQLDVNNAFLHGDLNED 294

Query: 2140 IYMLPPEGYSKAKDGEVCHLQRSLYGLKQASRQWNAEFCLKLQQFGFTQSGHDHCLFVRQ 1961
            +YMLPP G+S     +VC L +S+YGLKQASRQW ++    L   G+ QS  DH +F ++
Sbjct: 295  VYMLPPPGFSHDST-KVCKLHKSIYGLKQASRQWFSKLTTALISLGYIQSTADHSMFTKK 353

Query: 1960 SXXXXXXXXXXXXXVIISGTVLSEIDALKRYLDDLFTIKDLGYARYFLGMEIARNTDGSV 1781
                          +I++GT   EI  +K++LD  F IKDLG  +YFLG+E+AR + G  
Sbjct: 354  HSEDFTVLLIYVDDIILTGTSSPEIMKVKQFLDTTFRIKDLGDLKYFLGLEVARTSQGIH 413

Query: 1780 LNQRKYVLDILHDAGMLHCKAAITPLPPG--LKLRAKEGDPLVDPERFRRLIGRLLYLNL 1607
            ++QRKY L+IL ++G + CK A TP+      KL + +G+ L D   +R+L+G+LLYL  
Sbjct: 414  ISQRKYALEILQESGFIECKPAKTPMATKSVYKLTSTDGELLSDITSYRQLVGKLLYLTS 473

Query: 1606 TRPDVTYSVQQLSQFVNSPYTSHWDAAMHLLRYLKGSPSVGLFYSAHSSLCLEAYCDADW 1427
            TR D+T++VQQLSQF++ P T+H  AA  +LRYLKG PS GLFY A SS  L+A+ D+DW
Sbjct: 474  TRLDLTFAVQQLSQFMDKPTTNHLQAAHRVLRYLKGCPSTGLFYPASSSFELKAFSDSDW 533

Query: 1426 ASCVDTRKSLTGFCIFLGSCLVSWKTKKQTTVSRSSAEAEYRALGSGVCEIQWLSYLCAD 1247
            A C +TR+S+TG+CIF G  L+SW+ KKQ TVSRSS+EAEYRAL S VCE+QWL YL  D
Sbjct: 534  AGCPETRRSITGYCIFFGEALISWRAKKQPTVSRSSSEAEYRALASTVCEVQWLHYLLCD 593

Query: 1246 LGLSIPTPVTLWCDNQAAIHIVQNPVFHERTKHLEIDCHLVRNLYKSGFLHLGHVSSRLQ 1067
            L + I     ++CDN++ I I  NP FHERTKH+EIDCH+VR   +   +HL   +S  Q
Sbjct: 594  LHVPISHATPVFCDNKSTIQIASNPTFHERTKHIEIDCHIVREKLQKDIVHLLPCTSSAQ 653

Query: 1066 LADFFTKSLGRAAFLLLCSKLGLFDLHNP 980
            LAD FTK+L    F  + SKLG+ ++H+P
Sbjct: 654  LADLFTKALAAQPFQDMISKLGMLNIHSP 682


>gb|KYP55668.1| Retrovirus-related Pol polyprotein from transposon TNT 1-94, partial
            [Cajanus cajan]
          Length = 1136

 Score =  615 bits (1587), Expect = 0.0
 Identities = 335/700 (47%), Positives = 437/700 (62%), Gaps = 18/700 (2%)
 Frame = -3

Query: 3013 CLSYAANTHPHKSKFAARAYKCVFLGYITGQKGYRLYDLDNHSLLTSRDVVFFEDSFP-- 2840
            CL Y++    H++K   RA+ C+FLG+    KGY L +L  H LL S++V+F ED FP  
Sbjct: 456  CLCYSSIITSHRTKLDPRAHPCIFLGFKPHTKGYLLVNLHTHGLLVSQNVIFHEDHFPSF 515

Query: 2839 -------FAS-IPVPSNXXXXXXXXXXXXXXXXXXTSSAPISQPESH----PLRRSSRIS 2696
                   F+S +P+P N                   SS P   P+ H    PLRRS+R  
Sbjct: 516  TKPNSPSFSSPVPIPYNYADYPSFPSSSIVE-----SSEP-PPPDQHSSPPPLRRSTRPR 569

Query: 2695 KPPAWLSDF----ITNSVHSSTPMASPSHSAGPDSGDFSLAPTSFNHSSILGATYTAFLA 2528
            +PP +L DF     +   HSST +  P HS       +     SF+H          ++ 
Sbjct: 570  RPPTYLQDFHGAFTSTGPHSSTGIRHPLHSFI----SYDRLSPSFHH----------YVF 615

Query: 2527 NLSNVEEPSSFSQACKSADWVDAMQRELTALEQNNTWVLTSLPPGKRAIGCKWVYKVKMR 2348
            ++S+V +P +F +A KS  W+ AM  E++ALE NNTWVLT+LPP K AIGC+WVYKVK +
Sbjct: 616  SISSVTKPKNFVEASKSDSWLKAMHEEISALEANNTWVLTTLPPHKTAIGCRWVYKVKHK 675

Query: 2347 PDGTVERYKARLVAKGFHQEHGVDYLDSFSPVAKLVTVRLVIALATIKGWPLFQLDVNNA 2168
             DG+++RYKARLVAKG+ Q  G+D+ D+FSPVAKL TVRL+I+LA I    L QLDVNN+
Sbjct: 676  ADGSIDRYKARLVAKGYTQMEGLDFFDTFSPVAKLTTVRLLISLAAIHNCHLKQLDVNNS 735

Query: 2167 FLHGYLDEDIYMLPPEGYSKAKDGEVCHLQRSLYGLKQASRQWNAEFCLKLQQFGFTQSG 1988
            FLHG L+E++YM  P G + +  G+VC LQRSLYGLKQASRQW A     L Q G+  S 
Sbjct: 736  FLHGDLNEEVYMQLPPGITPSFPGQVCRLQRSLYGLKQASRQWYARLSSFLIQHGYVPSP 795

Query: 1987 HDHCLFVRQSXXXXXXXXXXXXXVIISGTVLSEIDALKRYLDDLFTIKDLGYARYFLGME 1808
             DH LF++ S             ++++G  L+EI  L   L + F IKDLG  +YFLG+E
Sbjct: 796  SDHSLFLKCSPAITTAILIYVDDIVLAGNDLTEIHHLTSLLHNTFQIKDLGNLKYFLGLE 855

Query: 1807 IARNTDGSVLNQRKYVLDILHDAGMLHCKAAITPLPPGLKLRAKEGDPLVDPERFRRLIG 1628
            +ARN  G  L QRKY LD+L D GML  K   TP+     L A  G P  D   +RRL+G
Sbjct: 856  VARNHTGIHLCQRKYTLDLLSDTGMLASKPVSTPMDYSTHLSASSGTPFTDTAAYRRLVG 915

Query: 1627 RLLYLNLTRPDVTYSVQQLSQFVNSPYTSHWDAAMHLLRYLKGSPSVGLFYSAHSSLCLE 1448
            RL+YL  TRP + Y+VQQLSQFV++P T+H  A   +L YLKG+P  G+F S +SS+ L 
Sbjct: 916  RLIYLPNTRPAIAYAVQQLSQFVSNPPTAHRQALFRILCYLKGTPGSGIFLSVNSSVQLR 975

Query: 1447 AYCDADWASCVDTRKSLTGFCIFLGSCLVSWKTKKQTTVSRSSAEAEYRALGSGVCEIQW 1268
            A+ D DWA C DTR+S+TGF ++LG  L+SWK+KKQ TVSRSS+EAEYRAL +  CE+QW
Sbjct: 976  AFSDYDWAGCPDTRRSITGFAVYLGDSLISWKSKKQITVSRSSSEAEYRALATTTCELQW 1035

Query: 1267 LSYLCADLGLSIPTPVTLWCDNQAAIHIVQNPVFHERTKHLEIDCHLVRNLYKSGFLHLG 1088
            LSYL  D  + +  P  L+CDNQ A+ I  NP+FHERTKH+EIDCH+VR+   +G L L 
Sbjct: 1036 LSYLLKDFHIDLIRPSILYCDNQFALQIASNPIFHERTKHIEIDCHIVRDKVSTGLLKLL 1095

Query: 1087 HVSSRLQLADFFTKSLGRAAFLLLCSKLGLFDLHNPT*GG 968
             VSS LQLAD  TK L    F    SKLG+ ++H+   GG
Sbjct: 1096 PVSSSLQLADILTKPLSPFVFHSHYSKLGMLNIHSQLEGG 1135


>gb|PNX93928.1| hypothetical protein L195_g017092, partial [Trifolium pratense]
          Length = 865

 Score =  604 bits (1558), Expect = 0.0
 Identities = 320/721 (44%), Positives = 438/721 (60%), Gaps = 42/721 (5%)
 Frame = -3

Query: 3013 CLSYAANTHPHKSKFAARAYKCVFLGYITGQKGYRLYDLDNHSLLTSRDVVFFEDSFP-- 2840
            CL YA   HP   KF  RA + +F+GY TGQKGY++YD +  +   SRDV F E +FP  
Sbjct: 153  CLCYATIVHP-THKFDPRAKRGIFVGYPTGQKGYKIYDPETKTFFVSRDVKFCETNFPSI 211

Query: 2839 -----------------FASIPVPSNXXXXXXXXXXXXXXXXXXTS---------SAPIS 2738
                                +P P++                   S         ++PI 
Sbjct: 212  PNTSEPNLISSHPSYEAIDDLPSPTSSHHQSQQTDIPSTHEPNSPSHITTETSSAASPIV 271

Query: 2737 QPE---SHP----------LRRSSRISKPPAWLSDFITNSVHSSTPMASPSHSAGPDSGD 2597
            +P    +H           +R+S R   PP W +D+  ++  + TP       + P SG 
Sbjct: 272  EPTPLTTHTTDPPTPFIPQVRKSVRDKHPPIWHNDYHMSTQVNKTP-------SEPTSGS 324

Query: 2596 FSLAPTSFNHS-SILGATYTAFLANLSNVEEPSSFSQACKSADWVDAMQRELTALEQNNT 2420
             +  P S   S S + ++  AFLAN++   EP S+ QA     W DAM  EL ALEQNNT
Sbjct: 325  GTRYPLSHYLSYSRISSSNCAFLANITAHREPQSYDQAVHDPLWQDAMNAELEALEQNNT 384

Query: 2419 WVLTSLPPGKRAIGCKWVYKVKMRPDGTVERYKARLVAKGFHQEHGVDYLDSFSPVAKLV 2240
            W L  LP G + IGCKWVYK+K + DGT+ERYKARLVAKG+ Q  G+DY ++FSP AK+ 
Sbjct: 385  WSLVPLPSGHKPIGCKWVYKIKYKSDGTIERYKARLVAKGYTQVEGIDYQETFSPTAKVT 444

Query: 2239 TVRLVIALATIKGWPLFQLDVNNAFLHGYLDEDIYMLPPEGYSKAKDGEVCHLQRSLYGL 2060
            T+R ++ +A  + W + QLDV NAFLHG L E +YM PP G  +  +  VC L +SLYGL
Sbjct: 445  TLRCLLTVAAARNWFIHQLDVQNAFLHGDLHELVYMEPPPGLRRQGENVVCRLNKSLYGL 504

Query: 2059 KQASRQWNAEFCLKLQQFGFTQSGHDHCLFVRQSXXXXXXXXXXXXXVIISGTVLSEIDA 1880
            KQASR W + F   +Q+ G+ QS  D+ LF +               ++++G  L E+  
Sbjct: 505  KQASRNWFSTFSEVIQKAGYQQSKADYSLFTKSQGTSFTAVLIYVDDILLTGNDLQEMKR 564

Query: 1879 LKRYLDDLFTIKDLGYARYFLGMEIARNTDGSVLNQRKYVLDILHDAGMLHCKAAITPLP 1700
            LK +L   F IKDLG  +YFLG+E +R+  G  ++QRKY LDIL D+G+   +    P+ 
Sbjct: 565  LKEFLLKRFRIKDLGNLKYFLGIEFSRSKKGIFMSQRKYALDILQDSGLTGARPDKFPME 624

Query: 1699 PGLKLRAKEGDPLVDPERFRRLIGRLLYLNLTRPDVTYSVQQLSQFVNSPYTSHWDAAMH 1520
              LKL   +G  L DP ++RRL+GRL+YL +TRPD+ YSVQ LSQF++ P   HWDAA+ 
Sbjct: 625  QNLKLTPTDGVVLNDPTKYRRLVGRLIYLTVTRPDIVYSVQTLSQFMHEPRKPHWDAALR 684

Query: 1519 LLRYLKGSPSVGLFYSAHSSLCLEAYCDADWASCVDTRKSLTGFCIFLGSCLVSWKTKKQ 1340
            +LRY+KG+P  GL +S+ + L L+A+CD+DW  C  TR+S+TGFC+FLG+ L+SWK+KKQ
Sbjct: 685  VLRYIKGTPGQGLLFSSTNDLTLKAFCDSDWGGCHATRRSVTGFCLFLGNSLISWKSKKQ 744

Query: 1339 TTVSRSSAEAEYRALGSGVCEIQWLSYLCADLGLSIPTPVTLWCDNQAAIHIVQNPVFHE 1160
              VSRSSAE+EYRA+ +   E+ WL ++  DL +S  TP  L+CDNQAA+HI  NPVFHE
Sbjct: 745  VVVSRSSAESEYRAMANTCLELTWLRFILQDLKVSQNTPTPLFCDNQAALHIAANPVFHE 804

Query: 1159 RTKHLEIDCHLVRNLYKSGFLHLGHVSSRLQLADFFTKSLGRAAFLLLCSKLGLFDLHNP 980
            RTKH+EIDCH+VR   ++G ++  +V +R QLAD FTK+LG+  F+ L SKLGL D+H+P
Sbjct: 805  RTKHIEIDCHIVREKLQAGIINPSYVPTRFQLADVFTKALGKDQFVTLRSKLGLHDIHSP 864

Query: 979  T 977
            T
Sbjct: 865  T 865


>gb|PNX97998.1| retrovirus-related Pol polyprotein from transposon TNT 1-94, partial
            [Trifolium pratense]
          Length = 964

 Score =  603 bits (1554), Expect = 0.0
 Identities = 339/720 (47%), Positives = 437/720 (60%), Gaps = 42/720 (5%)
 Frame = -3

Query: 3013 CLSYAANTHPHKSKFAARAYKCVFLGYITGQKGYRLYDLDNHSLLTSRDVVFFEDSFPFA 2834
            CL +A N +  + KF  RA   +F+GY   QKGYR+YD+    +  SRDV FFE  FP+ 
Sbjct: 257  CLCFAKNMNI-QHKFDERAKPGIFVGYPFNQKGYRIYDMHTRKIYVSRDVQFFETVFPYH 315

Query: 2833 SIPVPSNXXXXXXXXXXXXXXXXXXTSSA------------------------------- 2747
             +  PS                    S+                                
Sbjct: 316  DLQTPSFASDISINTQFLDYEVDDTPSNLSPASSIPPGISHHDNTIVTIPNPSVDNPSEI 375

Query: 2746 ---PISQPE-------SHPLRRSS-RISKPPAWLSDFITNSVHSSTPMASPSHSAGPDSG 2600
               P+  P+       +HP RR   R   P   L+D + + +++ T     S SA P   
Sbjct: 376  PAIPVEPPQQHSPTAINHPERRYPLRHRTPSVRLTDHVCD-INNVT-----SQSAFPLKN 429

Query: 2599 DFSLAPTSFNHSSILGATYTAFLANLSNVEEPSSFSQACKSADWVDAMQRELTALEQNNT 2420
             FSL+  S +H         A L N+   +EP+S+SQA KSA+W +AM +E+ ALE NNT
Sbjct: 430  YFSLSNLSTSHR--------ALLVNIIENKEPTSYSQAIKSAEWREAMAKEIHALESNNT 481

Query: 2419 WVLTSLPPGKRAIGCKWVYKVKMRPDGTVERYKARLVAKGFHQEHGVDYLDSFSPVAKLV 2240
            WVL+ LP GK AIGCKWVYK+K   DGTVERYKARLVAKG++Q HG+DY ++F+PVAKLV
Sbjct: 482  WVLSPLPNGKTAIGCKWVYKIKYHSDGTVERYKARLVAKGYNQVHGIDYHETFAPVAKLV 541

Query: 2239 TVRLVIALATIKGWPLFQLDVNNAFLHGYLDEDIYMLPPEGYSKAKDGEVCHLQRSLYGL 2060
            TVRL++++A IK W L QLDVNNAFL G L+E++YM  P G+S      VC L +S+YGL
Sbjct: 542  TVRLLLSIAAIKNWSLHQLDVNNAFLQGDLNEEVYMKLPPGFSHKGQPCVCKLNKSIYGL 601

Query: 2059 KQASRQWNAEFCLKLQQFGFTQSGHDHCLFVRQSXXXXXXXXXXXXXVIISGTVLSEIDA 1880
            KQASRQW ++F   L Q GF QS  D+ LF  +S             +II+G     I  
Sbjct: 602  KQASRQWFSKFSTTLIQKGFHQSISDYSLFTFKSNHTTIFVLVYVDDIIITGNNDDAISD 661

Query: 1879 LKRYLDDLFTIKDLGYARYFLGMEIARNTDGSVLNQRKYVLDILHDAGMLHCKAAITPLP 1700
            +K++L   F+IKDLG   YFLG+E++R+  G  L QRKY LDIL DAG+  C+ +  P+ 
Sbjct: 662  IKKFLAQAFSIKDLGNLSYFLGIEVSRSKKGIFLCQRKYTLDILSDAGLTGCRPSEFPME 721

Query: 1699 PGLKLRAKEGDPLVDPERFRRLIGRLLYLNLTRPDVTYSVQQLSQFVNSPYTSHWDAAMH 1520
              L+LR  +G PL DP  +RRLIGRLLYL +TRPD+ Y+V  LSQF+ SP T+H DAA  
Sbjct: 722  QHLRLRPNDGSPLPDPTVYRRLIGRLLYLTVTRPDIQYAVNTLSQFMQSPCTTHLDAATR 781

Query: 1519 LLRYLKGSPSVGLFYSAHSSLCLEAYCDADWASCVDTRKSLTGFCIFLGSCLVSWKTKKQ 1340
            +LRYLKGS   GLF SA SSL L  Y D+DWA C  TR+S TG+   LGS  +SWKTKKQ
Sbjct: 782  VLRYLKGSVGKGLFLSASSSLQLIGYADSDWAGCPTTRRSTTGYFTMLGSNPISWKTKKQ 841

Query: 1339 TTVSRSSAEAEYRALGSGVCEIQWLSYLCADLGLSIPTPVTLWCDNQAAIHIVQNPVFHE 1160
             T+SRSSAEAEYR+L +   E+QWL +L +DL ++ P P+T+ CD+QAAIHI +NPVFHE
Sbjct: 842  PTISRSSAEAEYRSLATLASELQWLKFLLSDLDIAHPLPITVHCDSQAAIHIAENPVFHE 901

Query: 1159 RTKHLEIDCHLVRNLYKSGFLHLGHVSSRLQLADFFTKSLGRAAFLLLCSKLGLFDLHNP 980
            RTKH+EIDCH VR   KSG L   ++ S  QLAD FTK LG  A+  L  KLG+ ++  P
Sbjct: 902  RTKHIEIDCHFVREKIKSGLLRPSYLRSFDQLADIFTKPLGGDAYKRLLGKLGVLEISIP 961



 Score = 63.9 bits (154), Expect = 2e-06
 Identities = 28/71 (39%), Positives = 44/71 (61%)
 Frame = -3

Query: 217 WDELACIMPLPACESDTRKLIDERDMNRKLMQFLMGLHESYDQVRNQLLLMDPLPSVDKA 38
           WDEL  I P+  C     K I ++    + M+FL G+H+ +  VR+Q+LLMDP PS+ + 
Sbjct: 1   WDELHSIAPINPCICGNAKSIIDQQNQDRAMEFLQGVHDRFSAVRSQILLMDPFPSIQRI 60

Query: 37  YSMALRVEKQR 5
           Y++  + EKQ+
Sbjct: 61  YNIVRQEEKQQ 71


>gb|KZV53534.1| hypothetical protein F511_42283 [Dorcoceras hygrometricum]
          Length = 1012

 Score =  604 bits (1557), Expect = 0.0
 Identities = 311/684 (45%), Positives = 437/684 (63%), Gaps = 7/684 (1%)
 Frame = -3

Query: 3013 CLSYAANTHPHKSKFAARAYKCVFLGYITGQKGYRLYDLDNHSLLTSRDVVFFEDSFPFA 2834
            CL YA+     + KF+ RA KCVFLGY  G +GY+L +LD + +L S DV+F E  FPF 
Sbjct: 340  CLCYASTLMSSRHKFSPRAIKCVFLGYPPGYRGYKLLNLDTNEILISCDVIFHEHEFPFQ 399

Query: 2833 SIPVPSNXXXXXXXXXXXXXXXXXXTSSAPISQP---ESHPLRRSSRISKPPAWLSDFIT 2663
            +    S+                   +S  I  P   +S    RS RI +PP  L ++  
Sbjct: 400  NT-YNSDSQPSYIFSDNLLPVHSQLNNSHTIPDPISSKSKQQSRSQRILQPPHHLQNYHC 458

Query: 2662 NSVHSSTPMASPSHSAGPDSGDFSLAPTSFNHSSILGATYTAFLANLSNVEEPSSFSQAC 2483
              +HSS+P  S SH              +F + S L   +   + N+S++ EP++FSQA 
Sbjct: 459  Y-MHSSSPSTSTSHPL-----------CNFVNYSKLSPLHRNLVNNISSIVEPTTFSQAV 506

Query: 2482 KSADWVDAMQRELTALEQNNTWVLTSLPPGKRAIGCKWVYKVKMRPDGTVERYKARLVAK 2303
               +W  AM  EL ALE N+TW + SLPPGK  +GC+WVYK K   DG+++RYKARLVAK
Sbjct: 507  AIPEWKQAMSDELKALELNHTWSIVSLPPGKSVVGCRWVYKAKFAADGSLQRYKARLVAK 566

Query: 2302 GFHQEHGVDYLDSFSPVAKLVTVRLVIALATIKGWPLFQLDVNNAFLHGYLDEDIYMLPP 2123
            G+ Q+ G+DYL++FSPVAK+VTVR ++ALA  +GW L QL V+NAFLHG LDE++YM  P
Sbjct: 567  GYTQQEGLDYLETFSPVAKMVTVRTLLALAAARGWSLIQLHVHNAFLHGELDEEVYMSLP 626

Query: 2122 EGYSKA----KDGEVCHLQRSLYGLKQASRQWNAEFCLKLQQFGFTQSGHDHCLFVRQSX 1955
             GYS          VC L +SLYGLKQASRQW A+F   L   GF+QS  D+ LF++   
Sbjct: 627  PGYSSEGGPLPPQSVCKLHKSLYGLKQASRQWFAKFSSTLLSVGFSQSHADNSLFIKVRD 686

Query: 1954 XXXXXXXXXXXXVIISGTVLSEIDALKRYLDDLFTIKDLGYARYFLGMEIARNTDGSVLN 1775
                        ++I+         LK +L++ F +KDLG  +YFLG+E+AR++ G  + 
Sbjct: 687  NVFLVLLVYVDDIVIATNNEEAASELKSFLNNKFKLKDLGKLKYFLGIEVARSSRGISIC 746

Query: 1774 QRKYVLDILHDAGMLHCKAAITPLPPGLKLRAKEGDPLVDPERFRRLIGRLLYLNLTRPD 1595
            QR Y ++ L +AG++ C+   TP+   +K+  ++G+ L DP  +RRLIGRLLYL +TRPD
Sbjct: 747  QRNYAMNFLTEAGLMGCRPRSTPMEANVKITQEDGEILPDPSSYRRLIGRLLYLTVTRPD 806

Query: 1594 VTYSVQQLSQFVNSPYTSHWDAAMHLLRYLKGSPSVGLFYSAHSSLCLEAYCDADWASCV 1415
            + ++V +LSQ+V+ P   H +AA+++LRY+KG+   GL+Y ++S L L+ + DADW +C+
Sbjct: 807  LAFAVNKLSQYVSKPRLPHMEAALNILRYVKGTIGQGLYYGSNSDLRLKFFSDADWGACL 866

Query: 1414 DTRKSLTGFCIFLGSCLVSWKTKKQTTVSRSSAEAEYRALGSGVCEIQWLSYLCADLGLS 1235
            DTR+S+TG+C+FLG  ++SW+ KKQ TVSRSSAEAEYR++ +  CEI W+  L  DLG+ 
Sbjct: 867  DTRRSVTGYCVFLGESMISWRAKKQHTVSRSSAEAEYRSMAAATCEILWIRSLLTDLGVK 926

Query: 1234 IPTPVTLWCDNQAAIHIVQNPVFHERTKHLEIDCHLVRNLYKSGFLHLGHVSSRLQLADF 1055
               P TL+CD+QAAIHI  NPVFHERTKH++IDCH++R   + G + L HVSS  QLAD 
Sbjct: 927  CDGPATLFCDSQAAIHIASNPVFHERTKHIDIDCHVIREKVQQGIVKLMHVSSVQQLADL 986

Query: 1054 FTKSLGRAAFLLLCSKLGLFDLHN 983
            FTK+L  + F  L SK+G+ ++H+
Sbjct: 987  FTKALLTSRFRSLLSKMGIHNIHD 1010


>gb|PNX93906.1| hypothetical protein L195_g017068 [Trifolium pratense]
          Length = 1183

 Score =  607 bits (1564), Expect = 0.0
 Identities = 319/714 (44%), Positives = 437/714 (61%), Gaps = 35/714 (4%)
 Frame = -3

Query: 3013 CLSYAANTHPHKSKFAARAYKCVFLGYITGQKGYRLYDLDNHSLLTSRDVVFFEDSFPF- 2837
            CL +A   HP   KF  RA + +F+GY TGQKGY++YD +  +   SRDV F E  FP  
Sbjct: 478  CLCFATIVHP-THKFDPRARRGIFVGYPTGQKGYKIYDPETKNFFVSRDVRFCETDFPSI 536

Query: 2836 --------------------ASIPVPSNXXXXXXXXXXXXXXXXXXTSSAPIS------- 2738
                                + IP PS+                  ++++PI+       
Sbjct: 537  PTTSKPNSISYHPPHEALDDSPIPTPSHVPSTHDLNPPPQPPTATPSAASPINDSIPTTS 596

Query: 2737 ---QPESHPL---RRSSRISKPPAWLSDFITNSVHSSTPMASPSHSAGPDSGDFSLAPTS 2576
               +P + P+   RRS R   PP W  D+     H S P  + S S  P S   +  P S
Sbjct: 597  HTPEPPTSPIPQVRRSLRDKNPPIWHQDY-----HMS-PQVNTSSSV-PTSRSGTRYPLS 649

Query: 2575 FNHS-SILGATYTAFLANLSNVEEPSSFSQACKSADWVDAMQRELTALEQNNTWVLTSLP 2399
               S S + +T+  FLAN++  +EP S+ QA     W  AM  EL AL+QNNTW L  LP
Sbjct: 650  HYLSYSRISSTHCTFLANITANKEPQSYDQAVHDPQWQAAMNTELEALQQNNTWNLVPLP 709

Query: 2398 PGKRAIGCKWVYKVKMRPDGTVERYKARLVAKGFHQEHGVDYLDSFSPVAKLVTVRLVIA 2219
            PG + IGCKWVYK+K + DGT+ERYKARLVAKG+ Q  G+DY ++FSP AK+ T+R ++ 
Sbjct: 710  PGHKPIGCKWVYKIKYKSDGTIERYKARLVAKGYTQVEGIDYQETFSPTAKVTTLRCLLT 769

Query: 2218 LATIKGWPLFQLDVNNAFLHGYLDEDIYMLPPEGYSKAKDGEVCHLQRSLYGLKQASRQW 2039
            +A  + W + QLDV NAFLHG L E +YM PP G  +  +  VC L +SLYGLKQASR W
Sbjct: 770  VAASRNWFIHQLDVQNAFLHGDLHELVYMEPPPGLRRQGENVVCRLNKSLYGLKQASRNW 829

Query: 2038 NAEFCLKLQQFGFTQSGHDHCLFVRQSXXXXXXXXXXXXXVIISGTVLSEIDALKRYLDD 1859
             + F   +Q+ G+ QS  D+ LF +               ++++G  L E+  LK +L  
Sbjct: 830  FSTFSKAIQKAGYQQSKADYSLFTKPQGTSFTAVLIYVDDILLTGNDLEEMKRLKEFLLR 889

Query: 1858 LFTIKDLGYARYFLGMEIARNTDGSVLNQRKYVLDILHDAGMLHCKAAITPLPPGLKLRA 1679
             F IKDLG  +YFLG+E +R+  G  ++QRKY LDIL D+G++  +    P+   LKL  
Sbjct: 890  HFRIKDLGDLKYFLGIEFSRSKKGIFMSQRKYALDILQDSGLIGARPDKFPMEQNLKLTP 949

Query: 1678 KEGDPLVDPERFRRLIGRLLYLNLTRPDVTYSVQQLSQFVNSPYTSHWDAAMHLLRYLKG 1499
             +G  L DP ++RRL+GRL+YL +TRPD+ YSVQ LSQF++ P   HWDAA+ +LRY+KG
Sbjct: 950  TDGVVLTDPTKYRRLVGRLIYLTVTRPDIVYSVQTLSQFMHEPRKPHWDAALRVLRYIKG 1009

Query: 1498 SPSVGLFYSAHSSLCLEAYCDADWASCVDTRKSLTGFCIFLGSCLVSWKTKKQTTVSRSS 1319
            +P  G+ +S  + L L+A+CD+DW  C  TR+S+TGFCIFLG+  +SWK+KKQ TVSRSS
Sbjct: 1010 TPGQGILFSTSNDLSLKAFCDSDWGGCHATRRSVTGFCIFLGNSPISWKSKKQVTVSRSS 1069

Query: 1318 AEAEYRALGSGVCEIQWLSYLCADLGLSIPTPVTLWCDNQAAIHIVQNPVFHERTKHLEI 1139
            AE+EYRA+ +   E+ WL ++  DL ++   P  L+CDNQAA+HI  NPVFHERTKH+EI
Sbjct: 1070 AESEYRAMANTCLELTWLRFILQDLKVTQAAPTPLFCDNQAALHIAANPVFHERTKHIEI 1129

Query: 1138 DCHLVRNLYKSGFLHLGHVSSRLQLADFFTKSLGRAAFLLLCSKLGLFDLHNPT 977
            DCH+VR   ++G +   +V +R QLAD FTK+LG+  F+ L +KLGL D+H+PT
Sbjct: 1130 DCHIVREKLQAGMISPSYVPTRFQLADVFTKALGKDQFVTLRNKLGLHDIHSPT 1183



 Score =  187 bits (476), Expect = 8e-45
 Identities = 92/210 (43%), Positives = 136/210 (64%), Gaps = 2/210 (0%)
 Frame = -3

Query: 628 DPLKLYSSDHPGLSLVSSQLTGNNYLSWRRSMLIALGAKTKLGFINGKMEIPKEDSP--K 455
           +P  ++ SDHPG  LV ++L G NY SW RSM+ AL AK K+GFI+G ++ P E+    +
Sbjct: 5   NPYYIHPSDHPGHLLVPTKLNGTNYPSWSRSMVHALTAKNKVGFIDGSIKEPSEEKQPAE 64

Query: 454 YDQWRKVDCMVISWILNSISKDLVDAFIYCDSAKDLWDDIAKRFGDCNGPLIYQLERDIA 275
           Y  W + + M++SW+ +S+ +DL    I+  +A  +W D   +F   N P IYQ+++ +A
Sbjct: 65  YALWNRCNSMILSWLTHSVEQDLAKGVIHAKTAYQVWKDFKDQFSQKNIPAIYQIQKSLA 124

Query: 274 NMNQGNMSVVEYFTKLKRLWDELACIMPLPACESDTRKLIDERDMNRKLMQFLMGLHESY 95
           +++QG MSV  YFTK+K LWDEL     LP C     K  DE+    +LMQFLMGL++SY
Sbjct: 125 SLSQGTMSVSTYFTKIKGLWDELESYRTLPTCSQ--MKAHDEQREEDRLMQFLMGLNDSY 182

Query: 94  DQVRNQLLLMDPLPSVDKAYSMALRVEKQR 5
             VR+ +L+M PLP+V +AYS+ ++ E QR
Sbjct: 183 STVRSNILMMSPLPNVRQAYSLVIQEETQR 212


>gb|KYP34293.1| Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cajanus
            cajan]
          Length = 1376

 Score =  611 bits (1575), Expect = 0.0
 Identities = 331/714 (46%), Positives = 440/714 (61%), Gaps = 32/714 (4%)
 Frame = -3

Query: 3013 CLSYAANTHPHKSKFAARAYKCVFLGYITGQKGYRLYDLDNHSLLTSRDVVFFEDSFPF- 2837
            CL Y + +  ++ K   RA+ CVFLG+    KGY  YDL   ++  SR+V F+E+ FP  
Sbjct: 671  CLCYVSTSTANRKKLDPRAHPCVFLGFSPTTKGYITYDLHTRAITISRNVSFYENHFPLL 730

Query: 2836 ------ASIPVPS------------------NXXXXXXXXXXXXXXXXXXTSSAPIS--- 2738
                  ++IPV S                  +                   S AP S   
Sbjct: 731  QSTSSTSNIPVVSPISFGIHSPSHDLISILPDPHQHNVTSPNPATTSHDSISLAPYSTTA 790

Query: 2737 ---QPESHPLRRSSRISKPPAWLSDFITNSVHSSTPMASPSHSAGPDSGDFSLAP-TSFN 2570
                P S PLRRS+R+  PP++L D+     HS T  ++  H          L P   + 
Sbjct: 791  DSLPPNSSPLRRSTRLRNPPSYLQDY----HHSLTSTSTNLHPG-------MLYPIEKYI 839

Query: 2569 HSSILGATYTAFLANLSNVEEPSSFSQACKSADWVDAMQRELTALEQNNTWVLTSLPPGK 2390
              S L   + AF++++S V EP S+++A K   W+ AM  EL AL+ N TW LT LPP K
Sbjct: 840  SYSRLSNDFQAFVSSISAVSEPHSYAEAAKHDCWLKAMHAELEALKMNQTWTLTPLPPHK 899

Query: 2389 RAIGCKWVYKVKMRPDGTVERYKARLVAKGFHQEHGVDYLDSFSPVAKLVTVRLVIALAT 2210
            +A+GC+W+YK+K   DG++ERYKARLVAKG+ Q  G+DYL +FSPVAKL TVRL++ALA 
Sbjct: 900  QAVGCRWIYKIKYNADGSIERYKARLVAKGYTQVEGLDYLATFSPVAKLTTVRLLLALAA 959

Query: 2209 IKGWPLFQLDVNNAFLHGYLDEDIYMLPPEGYSKAKDGEVCHLQRSLYGLKQASRQWNAE 2030
            +  W L QLDVNNAFLHG L+E++YM  P G       +VC LQ+SLYGLKQASRQW A+
Sbjct: 960  VFDWHLKQLDVNNAFLHGDLNEEVYMTLPLGMRPEYSNQVCKLQKSLYGLKQASRQWFAK 1019

Query: 2029 FCLKLQQFGFTQSGHDHCLFVRQSXXXXXXXXXXXXXVIISGTVLSEIDALKRYLDDLFT 1850
                L   G+ QS  DH LF++ S             ++++G  LSEI  +   LD  F 
Sbjct: 1020 LSSFLIHHGYHQSASDHSLFMKFSSSSTTALLIYVDDIVLAGNNLSEIQLITGLLDVAFK 1079

Query: 1849 IKDLGYARYFLGMEIARNTDGSVLNQRKYVLDILHDAGMLHCKAAITPLPPGLKLRAKEG 1670
            IKDLG  +YFLG+E+ARN  G  L+QRKYVLDIL D GM+  +   TP+    +L A  G
Sbjct: 1080 IKDLGNLKYFLGLEVARNKSGIHLSQRKYVLDILSDCGMMASRPVSTPMDYTSRLSASSG 1139

Query: 1669 DPLVDPERFRRLIGRLLYLNLTRPDVTYSVQQLSQFVNSPYTSHWDAAMHLLRYLKGSPS 1490
             PL DP  +RRL+GRL+YL  TRPD++Y V  LSQF+++P T+H  A   +LRYLK +P 
Sbjct: 1140 TPLADPSSYRRLLGRLIYLTTTRPDISYVVHHLSQFMSAPSTAHSQAIFRILRYLKQAPG 1199

Query: 1489 VGLFYSAHSSLCLEAYCDADWASCVDTRKSLTGFCIFLGSCLVSWKTKKQTTVSRSSAEA 1310
             GLF+  +SSL L+A+ D+DWA C+DTR+S+TGF ++LG  L+SW++KKQ TVSRSS+EA
Sbjct: 1200 SGLFFPTNSSLHLKAFSDSDWAGCLDTRRSITGFSVYLGDSLISWRSKKQPTVSRSSSEA 1259

Query: 1309 EYRALGSGVCEIQWLSYLCADLGLSIPTPVTLWCDNQAAIHIVQNPVFHERTKHLEIDCH 1130
            EYRAL +   E+QWL+YL  DL + +  P  L+CDNQ+A+HI  N VFHERTKH++IDCH
Sbjct: 1260 EYRALATTTSELQWLTYLLHDLHVPVHQPALLYCDNQSALHIAANQVFHERTKHIDIDCH 1319

Query: 1129 LVRNLYKSGFLHLGHVSSRLQLADFFTKSLGRAAFLLLCSKLGLFDLHNPT*GG 968
            LVR   +SG L L  V+S  QLAD FTKSL  + F  L SKLG+ +L++   GG
Sbjct: 1320 LVREKLQSGLLKLLPVASPHQLADIFTKSLSPSMFTALYSKLGMLNLYSQLEGG 1373



 Score =  147 bits (371), Expect = 4e-32
 Identities = 72/201 (35%), Positives = 120/201 (59%), Gaps = 6/201 (2%)
 Frame = -3

Query: 592 LSLVSSQLTGNNYLSWRRSMLIALGAKTKLGFINGKMEIPKEDSPKYDQWRKVDCMVISW 413
           ++LVS  L   NY SW RSML AL AK K+ F++G    P      Y  W++ + MV+SW
Sbjct: 1   MALVSPSLDSTNYHSWSRSMLTALSAKNKVEFVDGSAPQPPSSDRIYSAWKRCNNMVVSW 60

Query: 412 ILNSISKDLVDAFIYCDSAKDLWDDIAKRFGDCNGPLIYQLERDIANMNQGNMSVVEYFT 233
           ++ S+S  +  + ++ DSA+++W D+  R+   +   I  L+ + +++ QG++SV +YFT
Sbjct: 61  LVPSVSFSIRQSILWMDSAEEIWRDLKSRYSQGDLLRISALQLEASSIKQGDLSVTDYFT 120

Query: 232 KLKRLWDELACIMPLPACESDTR------KLIDERDMNRKLMQFLMGLHESYDQVRNQLL 71
           +L+ +WDEL    P P C    +       ++ +R +  + MQFL GL++ Y  VR+ +L
Sbjct: 121 QLRIIWDELENFRPDPICVCIVKCICKVSSILAQRKLEDQAMQFLRGLNDQYANVRSHVL 180

Query: 70  LMDPLPSVDKAYSMALRVEKQ 8
           LMDPLP ++K +S   + E+Q
Sbjct: 181 LMDPLPPINKIFSYVAQQERQ 201


>gb|PNX93131.1| retrovirus-related Pol polyprotein from transposon TNT 1-94
            [Trifolium pratense]
          Length = 982

 Score =  597 bits (1540), Expect = 0.0
 Identities = 318/704 (45%), Positives = 429/704 (60%), Gaps = 29/704 (4%)
 Frame = -3

Query: 3010 LSYAANTHPHKSKFAARAYKCVFLGYITGQKGYRLYDLDNHSLLTSRDVVFFEDSFPFAS 2831
            L+YA+    +K+K + R  KCVFLG   G KG  L+DLD+ ++  SR+V  F+   P+ +
Sbjct: 264  LAYASTLDVNKTKLSPRGRKCVFLGQKQGVKGSILFDLDSKNIFLSRNVTHFDHILPYTT 323

Query: 2830 ----------IPVPSNXXXXXXXXXXXXXXXXXXTSSAP----ISQPE---SHPL----- 2717
                        +                      S  P    IS P    S PL     
Sbjct: 324  NTSKLHWHYHSTINCEPFLDIDQSHTSTNPSDTTPSPTPPTNIISDPNPSTSSPLPSSPF 383

Query: 2716 ------RRSSRISKPPAWLSDFITNSVHSSTPMASPSHSAGPDSGDFSLAPTSFNHS-SI 2558
                   R  RI   P++LSDF+           S S  +   S   ++ P S  HS S 
Sbjct: 384  PIQPANTRPDRIKHRPSYLSDFV----------CSASDDSAKSSSTGTIYPISSFHSLSQ 433

Query: 2557 LGATYTAFLANLSNVEEPSSFSQACKSADWVDAMQRELTALEQNNTWVLTSLPPGKRAIG 2378
            L  +++ F ++L+   EP ++++ACKS  W+ AM  EL AL +  TW +  LPP  + IG
Sbjct: 434  LSPSHSVFTSSLTQHTEPRTYTEACKSQHWIQAMTSELEALARTGTWKIVDLPPNVKPIG 493

Query: 2377 CKWVYKVKMRPDGTVERYKARLVAKGFHQEHGVDYLDSFSPVAKLVTVRLVIALATIKGW 2198
             KWVYK+K + DGT+ERYKARLVAKG++Q  G+D+ D+FSPVAKL TVR+++A+A+IKGW
Sbjct: 494  SKWVYKIKHKSDGTIERYKARLVAKGYNQVEGLDFFDTFSPVAKLTTVRMLLAIASIKGW 553

Query: 2197 PLFQLDVNNAFLHGYLDEDIYMLPPEGYSKAKDGEVCHLQRSLYGLKQASRQWNAEFCLK 2018
             L QLDVNNAFLHG L E++YM  P+G   +K  +VC L +SLYGLKQASR+W  +    
Sbjct: 554  FLHQLDVNNAFLHGDLQENVYMSIPDGVQCSKPNQVCKLLKSLYGLKQASRKWYEKLTSL 613

Query: 2017 LQQFGFTQSGHDHCLFVRQSXXXXXXXXXXXXXVIISGTVLSEIDALKRYLDDLFTIKDL 1838
            L + G+TQS  DH LF                 +I++GT L EI+ +K  LD  F IKDL
Sbjct: 614  LVKEGYTQSSSDHSLFTISQQDNFTALLIYVDDIILAGTSLQEINRIKNILDTHFKIKDL 673

Query: 1837 GYARYFLGMEIARNTDGSVLNQRKYVLDILHDAGMLHCKAAITPLPPGLKLRAKEGDPLV 1658
            G  +YFLG+E+A + +G  ++QRKY LD+LHD+G+L  K A TPL P +KL   +G P  
Sbjct: 674  GVVKYFLGLEVAHSKEGISISQRKYCLDLLHDSGLLGSKPASTPLDPSVKLHHDDGKPFE 733

Query: 1657 DPERFRRLIGRLLYLNLTRPDVTYSVQQLSQFVNSPYTSHWDAAMHLLRYLKGSPSVGLF 1478
            D   +RRL+G+LLYL  TRPD+ ++ QQLSQF++ P  +H+ AA  ++RYLK +P +GL 
Sbjct: 734  DISMYRRLVGKLLYLTNTRPDIAFATQQLSQFLHKPTMTHYKAACRVIRYLKHNPGMGLI 793

Query: 1477 YSAHSSLCLEAYCDADWASCVDTRKSLTGFCIFLGSCLVSWKTKKQTTVSRSSAEAEYRA 1298
            +  ++ + L  Y DADWA C+DTR+S TG+C F+GS L+SWK KKQTT+S+SS+EAEYRA
Sbjct: 794  FKRNADIQLIGYSDADWAGCLDTRRSTTGYCFFVGSSLISWKAKKQTTISKSSSEAEYRA 853

Query: 1297 LGSGVCEIQWLSYLCADLGLSIPTPVTLWCDNQAAIHIVQNPVFHERTKHLEIDCHLVRN 1118
            L S  CE+ WL YL  DL +       L+CDNQ+A+HI  NPVFHERTKH+EIDCHLVR 
Sbjct: 854  LSSATCELVWLLYLLKDLHIECSKQPVLFCDNQSALHIASNPVFHERTKHIEIDCHLVRE 913

Query: 1117 LYKSGFLHLGHVSSRLQLADFFTKSLGRAAFLLLCSKLGLFDLH 986
              + G L L  VS++ QLADF TKSL    F     KLGL D++
Sbjct: 914  KVQEGLLRLIPVSTQEQLADFLTKSLPAPKFHDFLCKLGLLDIY 957


>gb|PNX92076.1| retrovirus-related Pol polyprotein from transposon TNT 1-94
            [Trifolium pratense]
          Length = 720

 Score =  588 bits (1516), Expect = 0.0
 Identities = 322/697 (46%), Positives = 423/697 (60%), Gaps = 17/697 (2%)
 Frame = -3

Query: 3010 LSYAANTHPHKSKFAARAYKCVFLGYITGQKGYRLYDLDNHSLLTSRDVVFFEDSFP--- 2840
            L YA     H++K   RA K +FLGY +G KGY LYDL +  +  SR V F E+  P   
Sbjct: 10   LCYATTLTSHRTKLDPRARKSLFLGYRSGYKGYVLYDLSSREIFISRHVTFHENVLPYPN 69

Query: 2839 ----------FASIPVPSNXXXXXXXXXXXXXXXXXXTSSAPISQPESHP---LRRSSRI 2699
                      + S    S+                   +S   S   S P    R S+R 
Sbjct: 70   STSISTSNWDYISSHTSSDTSIHTSNEIITPPSINLPANSTASSPSTSAPPTLTRCSTRP 129

Query: 2698 SKPPAWLSDFITNSVHSSTPMASPSHSAGPDSGDFSLAPTSFNHSSILGATYTAFLANLS 2519
               P +L D++ N++          HS+   SG  S   ++F     L   +  +  +L+
Sbjct: 130  KHIPPYLKDYVCNAL---------DHSSMKSSG-ISYPMSNFISYQNLSNPHCFYALSLT 179

Query: 2518 NVEEPSSFSQACKSADWVDAMQRELTALEQNNTWVLTSLPPGKRAIGCKWVYKVKMRPDG 2339
               EP S+++A K   W  AMQ EL ALE   TW+L  LP   + IGC+WVYKVK   DG
Sbjct: 180  THTEPKSYAEAIKFDCWKQAMQVELQALENTGTWILVDLPHHVKPIGCRWVYKVKHHADG 239

Query: 2338 TVERYKARLVAKGFHQEHGVDYLDSFSPVAKLVTVRLVIALATIKGWPLFQLDVNNAFLH 2159
            +VERYKARLVAKGF+Q  G+DY D+FSPVAKL TVR+VIALA++  W L QLDVNNAFLH
Sbjct: 240  SVERYKARLVAKGFNQIEGLDYFDTFSPVAKLTTVRIVIALASVHNWFLHQLDVNNAFLH 299

Query: 2158 GYLDEDIYMLPPEGYSKAKDGEVCHLQRSLYGLKQASRQWNAEFCLKLQQFGFTQSGHDH 1979
            G L ED+YMLPP G +     +VC L +SLYGLKQASRQW A+    L   G+ Q+  DH
Sbjct: 300  GDLQEDVYMLPPPGVTN-DPNKVCKLVKSLYGLKQASRQWYAKLTSLLLSHGYKQAHSDH 358

Query: 1978 CLFVRQSXXXXXXXXXXXXXVIISGTVLSEIDALKRYLDDLFTIKDLGYARYFLGMEIAR 1799
             LF +               VI++G  ++E   +K  L + F IKDLG  +YFLG+E+A 
Sbjct: 359  SLFTKHDASHFTLLLVYVDDVILAGNHMAEFSYVKNLLHNAFKIKDLGQLKYFLGLEVAH 418

Query: 1798 NTDGSVLNQRKYVLDILHDAGMLHCKAAITPLPPGLKLRAKEGDPLVDPERFRRLIGRLL 1619
            +  G  L QRKY LD+L D+G+L  K   TP    LKL A +  P  D   +RRL+GRLL
Sbjct: 419  SAKGISLCQRKYCLDLLSDSGLLGAKPVSTPSDASLKLHADDSAPFEDISAYRRLVGRLL 478

Query: 1618 YLNLTRPDVTYSVQQLSQFVNSPYTSHWDAAMHLLRYLKGSPSVGLFYSAHSSLCLEAYC 1439
            YLN TRPD+T+  QQLSQF++ P  +H+ AAM +LRYLK  P  GLF+  +S+L +  + 
Sbjct: 479  YLNTTRPDITFITQQLSQFLSKPTHTHYSAAMRVLRYLKNCPGRGLFFPRNSTLQILGFS 538

Query: 1438 DADWASCVDTRKSLTGFCIFLGSCLVSWKTKKQTTVSRSSAEAEYRALGSGVCEIQWLSY 1259
            DADWA C D+R+S++G C FLG  L+SW+TKKQ TV+RSS+EAEYRAL +  CE+QWL+Y
Sbjct: 539  DADWAGCKDSRRSISGQCFFLGQSLISWRTKKQLTVARSSSEAEYRALAAATCELQWLAY 598

Query: 1258 LCADLGLSIPTPVTLWCDNQAAIHIVQNPVFHERTKHLEIDCHLVRNLYKSGFLHLGHVS 1079
            L  DL ++ P    L+CDNQ+A+HI  NPVFHERTKH++IDCH+VR   ++G + L  VS
Sbjct: 599  LLQDLHITCPKLPVLYCDNQSALHIAANPVFHERTKHIDIDCHIVREKLQAGLMKLLPVS 658

Query: 1078 SRLQLADFFTKSLGRAAFLLLCSKLGLFDLHN-PT*G 971
            S+ Q+ADFFTKSL    F +L +KLG+FD++  PT G
Sbjct: 659  SKDQIADFFTKSLLPQPFGVLLAKLGMFDIYQAPTCG 695


>gb|OMP02866.1| Reverse transcriptase, RNA-dependent DNA polymerase [Corchorus
            capsularis]
          Length = 666

 Score =  585 bits (1509), Expect = 0.0
 Identities = 308/669 (46%), Positives = 412/669 (61%), Gaps = 20/669 (2%)
 Frame = -3

Query: 2923 QKGYRLYDLDNHSLLTSRDVVFFEDSFPFAS------------IPVPSNXXXXXXXXXXX 2780
            QKGYRLYDL N   L SRDVVF E+ FPF              +P+P N           
Sbjct: 3    QKGYRLYDLSNQEYLVSRDVVFQENIFPFQQSRTPPTPSQVLPLPIPDNHSFNSLPSTPI 62

Query: 2779 XXXXXXXTSSAP-----ISQP--ESHPLRRSSRISKPPAWLSDFITNSVHSSTPMASPSH 2621
                     S       IS P  E  PL RS R  +PP +L  +  + V        PS 
Sbjct: 63   ESPNETPIISNDSSLNEISLPSNEDQPLARSQRNRRPPPYLQYYECSKVRRQ-----PSQ 117

Query: 2620 SAGPDSGDFSLAPTS-FNHSSILGATYTAFLANLSNVEEPSSFSQACKSADWVDAMQREL 2444
            S+   SG  +  P S F  +  L +TY+ F++N++++ EP S+S+A K  +W  A+  EL
Sbjct: 118  SSSTTSGSGTRYPISNFLSTHRLSSTYSTFVSNITSIAEPQSYSEAIKDPNWKAAIDAEL 177

Query: 2443 TALEQNNTWVLTSLPPGKRAIGCKWVYKVKMRPDGTVERYKARLVAKGFHQEHGVDYLDS 2264
             ALE N TW +  LPP K  +GCKWV+KVK +  G++ERYKARLVAKG+ Q+ G+D+ ++
Sbjct: 178  HALEANKTWSIVDLPPHKSPVGCKWVFKVKYKSYGSIERYKARLVAKGYTQQEGIDFHET 237

Query: 2263 FSPVAKLVTVRLVIALATIKGWPLFQLDVNNAFLHGYLDEDIYMLPPEGYSKAKDGEVCH 2084
            F+PVAK+ TVR ++A+A+ K WPL+QLDV NA LHG LDE++YM  P G +   +  VC 
Sbjct: 238  FAPVAKMTTVRCLLAIASTKNWPLYQLDVQNALLHGDLDEEVYMSLPPGVTSKGENSVCK 297

Query: 2083 LQRSLYGLKQASRQWNAEFCLKLQQFGFTQSGHDHCLFVRQSXXXXXXXXXXXXXVIISG 1904
            L +SLYGL+QAS QW A+F   L  +GF QS  D+ LF++ S             ++I+G
Sbjct: 298  LHKSLYGLRQASLQWFAKFSTALLTYGFVQSRSDYSLFIKSSKTDFVAILVYVDDIVITG 357

Query: 1903 TVLSEIDALKRYLDDLFTIKDLGYARYFLGMEIARNTDGSVLNQRKYVLDILHDAGMLHC 1724
                 ID++K  L   F+IKDLG  +YFLG+E+AR+  G  L+QRKY L++L + G+   
Sbjct: 358  NNSKLIDSVKNALQRQFSIKDLGSLKYFLGLEVARSKQGIYLSQRKYTLELLSETGLAGA 417

Query: 1723 KAAITPLPPGLKLRAKEGDPLVDPERFRRLIGRLLYLNLTRPDVTYSVQQLSQFVNSPYT 1544
            +    P+    KL A EG+ L DP  +RRLIG+L+YL +TRPD+ YSV  LSQF+N P  
Sbjct: 418  RPLYVPMEQNTKLSAHEGELLKDPSPYRRLIGKLIYLTITRPDIMYSVHVLSQFMNQPRH 477

Query: 1543 SHWDAAMHLLRYLKGSPSVGLFYSAHSSLCLEAYCDADWASCVDTRKSLTGFCIFLGSCL 1364
             H+DAA+ L+RYLK SP  G+  S+ S   L A+ D+DWASC DTR+SLTGFCI LGS  
Sbjct: 478  PHFDAALRLVRYLKSSPGQGILLSSLSDFKLRAFSDSDWASCPDTRRSLTGFCILLGSSP 537

Query: 1363 VSWKTKKQTTVSRSSAEAEYRALGSGVCEIQWLSYLCADLGLSIPTPVTLWCDNQAAIHI 1184
            +SWKTKKQ TVS SSAEAEYRA+     EI WL  L  D G+S  TP +L CDN+AA+HI
Sbjct: 538  ISWKTKKQQTVSCSSAEAEYRAMAFTCREIVWLQSLLHDFGISQCTPASLHCDNKAALHI 597

Query: 1183 VQNPVFHERTKHLEIDCHLVRNLYKSGFLHLGHVSSRLQLADFFTKSLGRAAFLLLCSKL 1004
              NPVFHER+KH+E+DCH +R+  +   +   ++S+  Q AD FTK LG+     L  KL
Sbjct: 598  AANPVFHERSKHIEVDCHFIRDKLQQKIIETSYISTTQQPADLFTKPLGKDQLHHLLRKL 657

Query: 1003 GLFDLHNPT 977
             + D+H+PT
Sbjct: 658  AVHDIHSPT 666


>gb|PNY03100.1| retrovirus-related Pol polyprotein from transposon TNT 1-94
            [Trifolium pratense]
          Length = 629

 Score =  582 bits (1500), Expect = 0.0
 Identities = 290/586 (49%), Positives = 390/586 (66%), Gaps = 1/586 (0%)
 Frame = -3

Query: 2737 QPESHPLRRSSRISKPPAWLSDFITNSVHSSTPMASPSHSAGPDSGDFSLAP-TSFNHSS 2561
            QPE  PLRRS+R S PP +L++   N   + T    P  SA   S      P +S+    
Sbjct: 36   QPE--PLRRSTRNSHPPPFLTE---NYYCNLTSATLPDSSAATLSSSSCKYPISSYVSYQ 90

Query: 2560 ILGATYTAFLANLSNVEEPSSFSQACKSADWVDAMQRELTALEQNNTWVLTSLPPGKRAI 2381
             + + +  FL NLS + EP+ + +A    +W  A+  EL+ALE+NNTW L  LP  K AI
Sbjct: 91   NISSAHNHFLFNLSTIPEPTCYEKAVCDENWKTAINAELSALEKNNTWKLVPLPLHKHAI 150

Query: 2380 GCKWVYKVKMRPDGTVERYKARLVAKGFHQEHGVDYLDSFSPVAKLVTVRLVIALATIKG 2201
            GCKWV+K+K+  DGT+ERYKARLVAKG+ Q  G+DY+D+FSPV K+ T+R+++A+A  + 
Sbjct: 151  GCKWVFKLKLHADGTIERYKARLVAKGYTQTEGIDYMDTFSPVVKMTTIRVLLAVAAAQN 210

Query: 2200 WPLFQLDVNNAFLHGYLDEDIYMLPPEGYSKAKDGEVCHLQRSLYGLKQASRQWNAEFCL 2021
            WPL+QLDVN AFLHG L+E++YM PP G S      VC LQRSLYGLKQASRQWN +   
Sbjct: 211  WPLYQLDVNTAFLHGDLNEEVYMQPPPGLSLPHSNLVCKLQRSLYGLKQASRQWNTKLTE 270

Query: 2020 KLQQFGFTQSGHDHCLFVRQSXXXXXXXXXXXXXVIISGTVLSEIDALKRYLDDLFTIKD 1841
             L   G+ QS  D+ LF +Q+             +++ GT  +EI  +K  LD+ F+IKD
Sbjct: 271  TLTASGYVQSKSDYSLFTKQASSGLTIILVYVDDLVLGGTDSNEIQNIKALLDEKFSIKD 330

Query: 1840 LGYARYFLGMEIARNTDGSVLNQRKYVLDILHDAGMLHCKAAITPLPPGLKLRAKEGDPL 1661
            LGY +YFLG E+AR   G  L QRKY LD++ DAG+L  K   TP+ P L+L    G  +
Sbjct: 331  LGYLKYFLGFEVARTQAGISLCQRKYALDLIQDAGLLGAKPCSTPMQPQLQLHKSSGQAI 390

Query: 1660 VDPERFRRLIGRLLYLNLTRPDVTYSVQQLSQFVNSPYTSHWDAAMHLLRYLKGSPSVGL 1481
             +P  +RRLIGRLLYL  +RP++ Y+V +LSQF++ P   H  A +H+LRY+K SP  GL
Sbjct: 391  SEPTSYRRLIGRLLYLTHSRPEIAYAVSKLSQFLDKPTNEHMLAGLHVLRYVKNSPGQGL 450

Query: 1480 FYSAHSSLCLEAYCDADWASCVDTRKSLTGFCIFLGSCLVSWKTKKQTTVSRSSAEAEYR 1301
            F+ + S L L+ + D+DW +C DTR+S TGFC FLG+ L+SWK+KKQ  VSRSS+EAEYR
Sbjct: 451  FFDSKSPLTLKGFSDSDWGACPDTRRSTTGFCFFLGNSLISWKSKKQNVVSRSSSEAEYR 510

Query: 1300 ALGSGVCEIQWLSYLCADLGLSIPTPVTLWCDNQAAIHIVQNPVFHERTKHLEIDCHLVR 1121
            AL    CE QWL +L  DL +S PTP+ ++CDN++A+HI  NPVFHERTKH+E+DCH+VR
Sbjct: 511  ALAQTTCEGQWLKFLLQDLHISHPTPIVIYCDNKSALHIAANPVFHERTKHIEMDCHVVR 570

Query: 1120 NLYKSGFLHLGHVSSRLQLADFFTKSLGRAAFLLLCSKLGLFDLHN 983
               +SG +HL  V ++ Q+AD  TKSL    F  L SKLG+ D+++
Sbjct: 571  EKVQSGLIHLLSVHTKEQVADILTKSLHPGPFHTLQSKLGMIDIYS 616