BLASTX nr result

ID: Astragalus23_contig00020350 seq

BLASTX 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Astragalus23_contig00020350
         (1229 letters)

Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF excluding environmental samples
from WGS projects 
           149,584,005 sequences; 54,822,741,787 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

gb|PNX85069.1| retrotransposon-related protein [Trifolium pratense]   503   e-173
gb|PRQ42077.1| putative RNA-directed DNA polymerase [Rosa chinen...   514   e-171
gb|PNX81877.1| hypothetical protein L195_g037902, partial [Trifo...   498   e-170
gb|PNY07765.1| putative copia-type polyprotein [Trifolium pratense]   517   e-169
gb|PRQ52345.1| putative RNA-directed DNA polymerase [Rosa chinen...   516   e-169
ref|XP_018816630.1| PREDICTED: uncharacterized protein LOC108988...   483   e-167
gb|PRQ33949.1| putative RNA-directed DNA polymerase [Rosa chinen...   483   e-167
gb|PNX86749.1| copia-type polyprotein [Trifolium pratense]            482   e-166
gb|PRQ38020.1| putative RNA-directed DNA polymerase [Rosa chinen...   484   e-164
gb|KYP66580.1| Retrovirus-related Pol polyprotein from transposo...   474   e-164
dbj|GAU31929.1| hypothetical protein TSUD_271130, partial [Trifo...   479   e-163
gb|KYP57482.1| Retrovirus-related Pol polyprotein from transposo...   473   e-163
gb|KZV22085.1| retrovirus-related Pol polyprotein from transposo...   478   e-163
gb|PNX87200.1| copia-type polyprotein, partial [Trifolium pratense]   472   e-163
dbj|GAU13002.1| hypothetical protein TSUD_173010 [Trifolium subt...   495   e-162
gb|KZV40714.1| retrovirus-related Pol polyprotein from transposo...   476   e-162
dbj|GAU23220.1| hypothetical protein TSUD_172480 [Trifolium subt...   498   e-162
gb|PNX95204.1| copia-type polyprotein [Trifolium pratense]            498   e-162
dbj|GAU23361.1| hypothetical protein TSUD_334080 [Trifolium subt...   498   e-161
dbj|GAU37106.1| hypothetical protein TSUD_278930 [Trifolium subt...   489   e-161

>gb|PNX85069.1| retrotransposon-related protein [Trifolium pratense]
          Length = 538

 Score =  503 bits (1294), Expect = e-173
 Identities = 238/371 (64%), Positives = 301/371 (81%)
 Frame = +1

Query: 1    KLKKALYGLRQAPRAWYSRIEKYFLSEDFEKCPSEHSLFVNTDSDQGLLIISLYVDDLIY 180
            KLKKALYGL+QAPR+W+SRIE YF+ E F KC SEH+LFV    +  +LI+S+YVDDLI+
Sbjct: 168  KLKKALYGLKQAPRSWFSRIEAYFVKEGFLKCHSEHTLFVKISKEGKILIVSIYVDDLIF 227

Query: 181  TGNNDRLITDFKQSMEKEFDMTDLGKMSYFLGVEVVQNDEGIFIHQSKYANEVLNRFKMT 360
            TG+++ +  DFK SM  EFDM+DLGKM YFLG+EV+Q ++GI+I Q KYA EVL RF M 
Sbjct: 228  TGDDESMFEDFKNSMMHEFDMSDLGKMRYFLGIEVLQRNDGIYICQKKYALEVLRRFGME 287

Query: 361  EVNAVKNPIVPGSKLSKEGSGDRVDSTLYKQMIGSLMYLTATRSDLMFSVCLVSRFMEKP 540
              N+V NPIVPG K+ K+  G +VD+T +KQ++GSLMYLT TR DLMF V L+SR+M +P
Sbjct: 288  GSNSVHNPIVPGFKICKDKEGVKVDATFFKQVVGSLMYLTTTRPDLMFVVSLISRYMAQP 347

Query: 541  TDLHMQAIKKIMRYLKGSLNHGIHYSRRGREELIGFSDSDYAGDLDDRKSTSGYVFMLGT 720
            T+LH+QA K+++RYLKG+ + GI Y + G E L+ ++DSDYAGDL+DRKSTSGYVF++ +
Sbjct: 348  TELHLQAAKRVLRYLKGTADFGIFYKKGGSENLVAYADSDYAGDLEDRKSTSGYVFLMSS 407

Query: 721  GAISWSSKKQAVVTLSTTEAEFISAASCACQAIWLRRILDRLGHIQKEKTVILCDNSSTI 900
            GA+SWSSKKQ +VTLSTTEAEF++AA CA Q +W+RRILD+LGH Q   TV+ CDNSSTI
Sbjct: 408  GAVSWSSKKQPIVTLSTTEAEFVAAAYCASQIVWMRRILDKLGHSQSGSTVMFCDNSSTI 467

Query: 901  KLSKNPVLHGRSKHIDVRFHFLRDLVKEEVIELVYCCTSDQIADLMTKPLKLDSFLRLSK 1080
            KLSKNPVLHGR KHIDVRFHFLRDL KE ++ELV+C T +Q+AD+MTKPLKLD FL+L  
Sbjct: 468  KLSKNPVLHGRCKHIDVRFHFLRDLTKEGIVELVFCGTQEQVADVMTKPLKLDIFLKLRS 527

Query: 1081 ELGMRSIEEIN 1113
             LG+  I E+N
Sbjct: 528  LLGVCQIPEVN 538


>gb|PRQ42077.1| putative RNA-directed DNA polymerase [Rosa chinensis]
          Length = 1044

 Score =  514 bits (1324), Expect = e-171
 Identities = 247/371 (66%), Positives = 300/371 (80%)
 Frame = +1

Query: 1    KLKKALYGLRQAPRAWYSRIEKYFLSEDFEKCPSEHSLFVNTDSDQGLLIISLYVDDLIY 180
            KL KALYGL+QAPRAWYSRIE YF+ E FE+CP EH+LFV       +LI+SLYVDDLI+
Sbjct: 674  KLNKALYGLKQAPRAWYSRIEAYFIKEGFERCPHEHTLFVKLSEGGKILIVSLYVDDLIF 733

Query: 181  TGNNDRLITDFKQSMEKEFDMTDLGKMSYFLGVEVVQNDEGIFIHQSKYANEVLNRFKMT 360
            TGN++ +  +FK+SM+ EFDM+DLG M YFLGVEVVQN+ GI+I Q KYA+E+L RF M 
Sbjct: 734  TGNDEYMFEEFKKSMKDEFDMSDLGMMRYFLGVEVVQNEAGIYICQKKYASEILERFGME 793

Query: 361  EVNAVKNPIVPGSKLSKEGSGDRVDSTLYKQMIGSLMYLTATRSDLMFSVCLVSRFMEKP 540
            + N+V+NPIVPG KL K+  G +VD+T+YKQ++GSLMYLTATR DLM+ V L+SRFM  P
Sbjct: 794  KANSVRNPIVPGFKLMKDEGGVKVDATMYKQIVGSLMYLTATRPDLMYVVSLISRFMSSP 853

Query: 541  TDLHMQAIKKIMRYLKGSLNHGIHYSRRGREELIGFSDSDYAGDLDDRKSTSGYVFMLGT 720
            T+LHMQA K+++RYLKG++N G+ Y R G E+L  F+DSDYAGD+DDRKSTSGY+FM   
Sbjct: 854  TELHMQAAKRVLRYLKGTVNLGVLYRRNGEEKLEAFTDSDYAGDVDDRKSTSGYIFMFSD 913

Query: 721  GAISWSSKKQAVVTLSTTEAEFISAASCACQAIWLRRILDRLGHIQKEKTVILCDNSSTI 900
            GA+SWSSKKQ VVTLSTTEAEFI+AA CACQ +W+RRI +RLGH Q+  T + CDNSSTI
Sbjct: 914  GAVSWSSKKQPVVTLSTTEAEFIAAAYCACQGVWMRRIFERLGHAQRGCTTVYCDNSSTI 973

Query: 901  KLSKNPVLHGRSKHIDVRFHFLRDLVKEEVIELVYCCTSDQIADLMTKPLKLDSFLRLSK 1080
            KLSKNPV+HGRSKHIDVRFHFLR L K+  +ELVYC T DQIAD MTKPLKL+ F +L  
Sbjct: 974  KLSKNPVMHGRSKHIDVRFHFLRQLTKDGNVELVYCNTQDQIADAMTKPLKLEVFEKLRD 1033

Query: 1081 ELGMRSIEEIN 1113
             LGM  +  IN
Sbjct: 1034 LLGMCLVPGIN 1044


>gb|PNX81877.1| hypothetical protein L195_g037902, partial [Trifolium pratense]
          Length = 575

 Score =  498 bits (1282), Expect = e-170
 Identities = 236/358 (65%), Positives = 293/358 (81%)
 Frame = +1

Query: 1    KLKKALYGLRQAPRAWYSRIEKYFLSEDFEKCPSEHSLFVNTDSDQGLLIISLYVDDLIY 180
            KLK+ALYGL+QAPRAWYSRIE YF  E FE+CP EH+LFV       +LI+SLYVDDLI+
Sbjct: 217  KLKRALYGLKQAPRAWYSRIEAYFTKEGFERCPYEHTLFVKQSETGNILIVSLYVDDLIF 276

Query: 181  TGNNDRLITDFKQSMEKEFDMTDLGKMSYFLGVEVVQNDEGIFIHQSKYANEVLNRFKMT 360
            TGN++ +  +FK+SMEKEF+M+DLGKM YFLGVEV+QN+EGI+I Q KY  ++L RF M 
Sbjct: 277  TGNDENMFKEFKKSMEKEFNMSDLGKMHYFLGVEVIQNEEGIYICQRKYVTDLLERFGME 336

Query: 361  EVNAVKNPIVPGSKLSKEGSGDRVDSTLYKQMIGSLMYLTATRSDLMFSVCLVSRFMEKP 540
            + N  +NPI PG KL K+ +G +VD+T+YKQ++G LMYL ATR DLM+ + L+SRFM  P
Sbjct: 337  KSNLSRNPIAPGCKLIKDENGVKVDATMYKQVVGCLMYLAATRPDLMYVLSLISRFMNCP 396

Query: 541  TDLHMQAIKKIMRYLKGSLNHGIHYSRRGREELIGFSDSDYAGDLDDRKSTSGYVFMLGT 720
            T+LHM A+K+++RYL G++N GI Y R G E+L  ++DSDYAGDLDDRKSTSGYVFML  
Sbjct: 397  TELHMHAVKRVLRYLNGTINLGIMYKRGGNEKLEAYTDSDYAGDLDDRKSTSGYVFMLSA 456

Query: 721  GAISWSSKKQAVVTLSTTEAEFISAASCACQAIWLRRILDRLGHIQKEKTVILCDNSSTI 900
            GA+SWSSKKQ VVTLSTTEAEFI+AASCACQ+IW++R+L++LGHIQ     + CDNSSTI
Sbjct: 457  GAVSWSSKKQPVVTLSTTEAEFIAAASCACQSIWMQRVLEKLGHIQVGSITVYCDNSSTI 516

Query: 901  KLSKNPVLHGRSKHIDVRFHFLRDLVKEEVIELVYCCTSDQIADLMTKPLKLDSFLRL 1074
            KLSKNPVLHGRSKHIDVRFHFLRDL K+  +ELV+C +  QIAD+MTKPLK + F +L
Sbjct: 517  KLSKNPVLHGRSKHIDVRFHFLRDLTKDGTLELVHCNSQYQIADIMTKPLKFEVFEKL 574


>gb|PNY07765.1| putative copia-type polyprotein [Trifolium pratense]
          Length = 1321

 Score =  517 bits (1331), Expect = e-169
 Identities = 248/371 (66%), Positives = 302/371 (81%)
 Frame = +1

Query: 1    KLKKALYGLRQAPRAWYSRIEKYFLSEDFEKCPSEHSLFVNTDSDQGLLIISLYVDDLIY 180
            KLK+ALYGL+QAPRAWYSRI+ YF    F KCP EH+L++ T      LI+ LYVDDLI+
Sbjct: 951  KLKRALYGLKQAPRAWYSRIDAYFSQTGFHKCPYEHTLYIKTGEKGNFLIVCLYVDDLIF 1010

Query: 181  TGNNDRLITDFKQSMEKEFDMTDLGKMSYFLGVEVVQNDEGIFIHQSKYANEVLNRFKMT 360
            TGN++ L  +FKQSM KEF+MTDLG M YFLG+EVVQ+  GIFI Q KYA EVL RFKM 
Sbjct: 1011 TGNDECLFKEFKQSMMKEFEMTDLGMMKYFLGIEVVQSAAGIFICQKKYAQEVLERFKMD 1070

Query: 361  EVNAVKNPIVPGSKLSKEGSGDRVDSTLYKQMIGSLMYLTATRSDLMFSVCLVSRFMEKP 540
            + N V+ PIVPG+KL+++  G ++D+T YKQM+GSLMY+TATR DL ++V L+SR+ME P
Sbjct: 1071 DCNPVQIPIVPGTKLTRDVEGTKIDNTYYKQMVGSLMYITATRPDLTYAVSLISRYMESP 1130

Query: 541  TDLHMQAIKKIMRYLKGSLNHGIHYSRRGREELIGFSDSDYAGDLDDRKSTSGYVFMLGT 720
            T+LH Q +KKI+RYLKG++N+G+ Y +    EL+GFSDSDYAGDLDDRKSTSGYVF+L  
Sbjct: 1131 TELHHQVVKKILRYLKGTVNYGLFYKKSEINELVGFSDSDYAGDLDDRKSTSGYVFLLSG 1190

Query: 721  GAISWSSKKQAVVTLSTTEAEFISAASCACQAIWLRRILDRLGHIQKEKTVILCDNSSTI 900
             A+SWSSKKQ VVTLSTTEAEFI+AASC CQ IWLRRIL+ + H Q+   ++ CDNSSTI
Sbjct: 1191 AAVSWSSKKQPVVTLSTTEAEFIAAASCVCQGIWLRRILEEVKHTQQGPLMLFCDNSSTI 1250

Query: 901  KLSKNPVLHGRSKHIDVRFHFLRDLVKEEVIELVYCCTSDQIADLMTKPLKLDSFLRLSK 1080
            KLSKNPVLHGRSKHIDVRFHFLRDL KEEV++L YC + +QIAD+ TKPLK+DSF++L  
Sbjct: 1251 KLSKNPVLHGRSKHIDVRFHFLRDLTKEEVVKLCYCRSDEQIADIFTKPLKVDSFMKLRA 1310

Query: 1081 ELGMRSIEEIN 1113
             LGM SIEEIN
Sbjct: 1311 LLGMCSIEEIN 1321


>gb|PRQ52345.1| putative RNA-directed DNA polymerase [Rosa chinensis]
          Length = 1316

 Score =  516 bits (1330), Expect = e-169
 Identities = 247/371 (66%), Positives = 301/371 (81%)
 Frame = +1

Query: 1    KLKKALYGLRQAPRAWYSRIEKYFLSEDFEKCPSEHSLFVNTDSDQGLLIISLYVDDLIY 180
            KL KALYGL+QAPRAWYSRIE YF+ E FE+CP EH+LFV +     +LI+SLYVDDLI+
Sbjct: 946  KLNKALYGLKQAPRAWYSRIEAYFIKEGFERCPHEHTLFVKSSEGGKILIVSLYVDDLIF 1005

Query: 181  TGNNDRLITDFKQSMEKEFDMTDLGKMSYFLGVEVVQNDEGIFIHQSKYANEVLNRFKMT 360
            TGN++ +  +FK+SM+ EFDM+DLG M YFLGVEVVQN+ GI+I Q KYA+E+L RF M 
Sbjct: 1006 TGNDEYMFEEFKKSMKDEFDMSDLGMMRYFLGVEVVQNEAGIYICQKKYASEILERFGME 1065

Query: 361  EVNAVKNPIVPGSKLSKEGSGDRVDSTLYKQMIGSLMYLTATRSDLMFSVCLVSRFMEKP 540
            + N+V+NPIVPG KL K+  G +VD+T+YKQ++GSLMYLTATR DLM+ V L+SRFM  P
Sbjct: 1066 KANSVRNPIVPGIKLMKDEGGVKVDATMYKQIVGSLMYLTATRPDLMYVVSLISRFMSSP 1125

Query: 541  TDLHMQAIKKIMRYLKGSLNHGIHYSRRGREELIGFSDSDYAGDLDDRKSTSGYVFMLGT 720
            T+LHMQA K+++RYLKG++N G+ Y R G E+L  F+DSDYAGD+DDRKSTSGY+FM   
Sbjct: 1126 TELHMQAAKRVLRYLKGTINLGVLYRRNGEEKLEAFTDSDYAGDVDDRKSTSGYIFMFSD 1185

Query: 721  GAISWSSKKQAVVTLSTTEAEFISAASCACQAIWLRRILDRLGHIQKEKTVILCDNSSTI 900
            GA+SWSSKKQ VVTLSTTEAEFI+AA CACQ +W+RRI +RLGH Q+  T + CDNSSTI
Sbjct: 1186 GAVSWSSKKQPVVTLSTTEAEFIAAAYCACQGVWMRRIFERLGHAQRGCTTVYCDNSSTI 1245

Query: 901  KLSKNPVLHGRSKHIDVRFHFLRDLVKEEVIELVYCCTSDQIADLMTKPLKLDSFLRLSK 1080
            KLSKNPV+HGRSKHIDVRFHFLR L K+  +ELVYC T DQIAD MTKPLKL+ F +L  
Sbjct: 1246 KLSKNPVMHGRSKHIDVRFHFLRQLTKDGTVELVYCNTQDQIADAMTKPLKLEVFEKLRD 1305

Query: 1081 ELGMRSIEEIN 1113
             LGM  +  IN
Sbjct: 1306 LLGMCLVPGIN 1316


>ref|XP_018816630.1| PREDICTED: uncharacterized protein LOC108988006 [Juglans regia]
          Length = 372

 Score =  483 bits (1242), Expect = e-167
 Identities = 225/369 (60%), Positives = 296/369 (80%)
 Frame = +1

Query: 7    KKALYGLRQAPRAWYSRIEKYFLSEDFEKCPSEHSLFVNTDSDQGLLIISLYVDDLIYTG 186
            K  ++G +QAPRAWYSRIE YF+ E FE+C  EH+LF+  D  + +LI+SLYVDDLI+TG
Sbjct: 5    KPLVHGTKQAPRAWYSRIEAYFVKEGFERCSCEHTLFITGDGGK-ILIVSLYVDDLIFTG 63

Query: 187  NNDRLITDFKQSMEKEFDMTDLGKMSYFLGVEVVQNDEGIFIHQSKYANEVLNRFKMTEV 366
            N++ +   FK SM+ EFDMTDLGKM YFLGVEV+QN E I+I Q KYA EVL +F+M + 
Sbjct: 64   NDESMFVKFKNSMKLEFDMTDLGKMKYFLGVEVLQNPESIYISQRKYAKEVLEKFRMEKS 123

Query: 367  NAVKNPIVPGSKLSKEGSGDRVDSTLYKQMIGSLMYLTATRSDLMFSVCLVSRFMEKPTD 546
            N+VKNPIVPG +L K+  G +V++T+YKQ++GSLMYLTATR DLM+ V L+SRFM  PT+
Sbjct: 124  NSVKNPIVPGFRLMKDEEGAKVNATMYKQLVGSLMYLTATRPDLMYVVSLISRFMANPTE 183

Query: 547  LHMQAIKKIMRYLKGSLNHGIHYSRRGREELIGFSDSDYAGDLDDRKSTSGYVFMLGTGA 726
            LH+Q  K+++RYLKG+++ GI Y + G  EL+ ++DSDYA  ++DR+STSGYVF+L  G 
Sbjct: 184  LHLQVAKRVLRYLKGTVDLGIFYRKEGNGELMAYTDSDYARYVNDRRSTSGYVFLLSEGV 243

Query: 727  ISWSSKKQAVVTLSTTEAEFISAASCACQAIWLRRILDRLGHIQKEKTVILCDNSSTIKL 906
            +SWSSKKQ VV LSTTEAEF++AASCACQ +W+RR+L++ GH Q + T +LCDNSSTIKL
Sbjct: 244  VSWSSKKQPVVALSTTEAEFVAAASCACQGVWMRRVLEKFGHSQGKCTTVLCDNSSTIKL 303

Query: 907  SKNPVLHGRSKHIDVRFHFLRDLVKEEVIELVYCCTSDQIADLMTKPLKLDSFLRLSKEL 1086
            SKNPV+HGRSKHIDVRFHFL DL ++ V+EL +C T +Q+AD+MTKPLKLD FL+L + +
Sbjct: 304  SKNPVMHGRSKHIDVRFHFLCDLTRDGVVELKHCVTQEQVADIMTKPLKLDVFLKLCESM 363

Query: 1087 GMRSIEEIN 1113
            G+  +  +N
Sbjct: 364  GVCVVPRVN 372


>gb|PRQ33949.1| putative RNA-directed DNA polymerase [Rosa chinensis]
          Length = 425

 Score =  483 bits (1244), Expect = e-167
 Identities = 232/371 (62%), Positives = 298/371 (80%)
 Frame = +1

Query: 1    KLKKALYGLRQAPRAWYSRIEKYFLSEDFEKCPSEHSLFVNTDSDQGLLIISLYVDDLIY 180
            KL+KALYGL+QAPRAW+SRIE YFL E FEKCPSE +LFV T++   LLI+SLYVDDLIY
Sbjct: 55   KLRKALYGLKQAPRAWFSRIEAYFLREGFEKCPSEQTLFVKTNNRGKLLIVSLYVDDLIY 114

Query: 181  TGNNDRLITDFKQSMEKEFDMTDLGKMSYFLGVEVVQNDEGIFIHQSKYANEVLNRFKMT 360
            TGN+D ++ +FK SM KEFDM+DLGKM YFLG+EV Q  +GIFI Q KYA +VL RF M 
Sbjct: 115  TGNDDAMMREFKDSMMKEFDMSDLGKMRYFLGLEVQQLQDGIFISQKKYAMDVLRRFGME 174

Query: 361  EVNAVKNPIVPGSKLSKEGSGDRVDSTLYKQMIGSLMYLTATRSDLMFSVCLVSRFMEKP 540
            + NAV NPIVPG K+SK+ SG  VD T YKQ++GSLMYLT+TR DLM++V L++R+M +P
Sbjct: 175  KSNAVLNPIVPGFKISKDESGIEVDGTFYKQLVGSLMYLTSTRPDLMYAVSLIARYMSQP 234

Query: 541  TDLHMQAIKKIMRYLKGSLNHGIHYSRRGREELIGFSDSDYAGDLDDRKSTSGYVFMLGT 720
            T+LH+ A K+++RY+KG++  GI Y +     L  ++DSDYAG L+DRKSTSGY FM+ +
Sbjct: 235  TELHLMAAKRVLRYVKGTVGFGILYKKEASGVLTAYTDSDYAGCLEDRKSTSGYAFMMSS 294

Query: 721  GAISWSSKKQAVVTLSTTEAEFISAASCACQAIWLRRILDRLGHIQKEKTVILCDNSSTI 900
            GA++WSS+KQ +VTLSTTEAEF++AA+CACQAIW++R+L  LG    + T + CDNSSTI
Sbjct: 295  GAVAWSSRKQPIVTLSTTEAEFVAAAACACQAIWMKRVLKMLGCEGDKCTTVFCDNSSTI 354

Query: 901  KLSKNPVLHGRSKHIDVRFHFLRDLVKEEVIELVYCCTSDQIADLMTKPLKLDSFLRLSK 1080
            KLSKNPV+HGRSKHI VRFHFLRDL KE V++LV+C + +QIAD++TKPLKL+ F +L  
Sbjct: 355  KLSKNPVMHGRSKHIGVRFHFLRDLSKEGVVQLVHCGSQEQIADVLTKPLKLEQFQKLRG 414

Query: 1081 ELGMRSIEEIN 1113
             LG+    ++N
Sbjct: 415  LLGVCESPKVN 425


>gb|PNX86749.1| copia-type polyprotein [Trifolium pratense]
          Length = 395

 Score =  482 bits (1240), Expect = e-166
 Identities = 236/373 (63%), Positives = 293/373 (78%), Gaps = 2/373 (0%)
 Frame = +1

Query: 1    KLKKALYGLRQAPRAWYSRIEKYFLSEDFEKCPSEHSLFVNTDSDQGLLIISLYVDDLIY 180
            KLKKALYGL+QAPRAWYS+IE YF+SE FEKC  EH+LFV   S++ +LI+S+YVDDLIY
Sbjct: 22   KLKKALYGLKQAPRAWYSKIESYFVSEQFEKCSHEHTLFVKYSSNKKVLIVSIYVDDLIY 81

Query: 181  TGNNDRLITDFKQSMEKEFDMTDLGKMSYFLGVEVVQNDEGIFIHQSKYANEVLNRFKMT 360
            TGN+++++ +FK SM+ +F MTDLGKM YFLGVEV Q +EGIFIHQ KY  E+L RF M 
Sbjct: 82   TGNDNQMMDEFKASMKDKFSMTDLGKMKYFLGVEVNQCEEGIFIHQQKYGTEILQRFGMQ 141

Query: 361  EVNAVKNPIVPGSKLSKEGSGDRVDSTLYKQMIGSLMYLTATRSDLMFSVCLVSRFMEKP 540
            E N V +PIVPG KL K+ +    D+TLYKQMIG LMYL ATR D+ ++VCL +R+ME+P
Sbjct: 142  ECNKVCSPIVPGCKLVKDETALACDATLYKQMIGCLMYLLATRPDMTYAVCLAARYMERP 201

Query: 541  TDLHMQAIKKIMRYLKGSLNHGIHYSRRGREELI--GFSDSDYAGDLDDRKSTSGYVFML 714
            T++H+  +K+I+RYLKG+L  G+ Y  R   + +  G+SDSDYAGD DDRKSTSGYVF L
Sbjct: 202  TEMHVAVVKRILRYLKGTLTLGVLYKCRNGNDFVLQGWSDSDYAGDYDDRKSTSGYVFTL 261

Query: 715  GTGAISWSSKKQAVVTLSTTEAEFISAASCACQAIWLRRILDRLGHIQKEKTVILCDNSS 894
            G  AI WSSKKQ +VTLSTTEAEF+SAASCACQ IWL+ IL  L   Q     I CDNSS
Sbjct: 262  GESAICWSSKKQPIVTLSTTEAEFVSAASCACQCIWLKNILSHLLVEQAGCVSINCDNSS 321

Query: 895  TIKLSKNPVLHGRSKHIDVRFHFLRDLVKEEVIELVYCCTSDQIADLMTKPLKLDSFLRL 1074
            +IKLSKNP++HGR KHIDVR+HFLRDL ++ VIEL YC + DQ+AD+MTKPLKL+SF RL
Sbjct: 322  SIKLSKNPIMHGRCKHIDVRYHFLRDLSRDGVIELKYCKSQDQLADIMTKPLKLESFCRL 381

Query: 1075 SKELGMRSIEEIN 1113
             + LGM   +++N
Sbjct: 382  REGLGMSIAQDVN 394


>gb|PRQ38020.1| putative RNA-directed DNA polymerase [Rosa chinensis]
          Length = 610

 Score =  484 bits (1246), Expect = e-164
 Identities = 232/371 (62%), Positives = 293/371 (78%)
 Frame = +1

Query: 1    KLKKALYGLRQAPRAWYSRIEKYFLSEDFEKCPSEHSLFVNTDSDQGLLIISLYVDDLIY 180
            KLKKALYGL+QAPRAWYSRIE YF+ E FEKC  EH+LF+   +    LI+SLYVDDLI+
Sbjct: 240  KLKKALYGLKQAPRAWYSRIENYFVKEGFEKCDFEHTLFIKMGAGGKCLIVSLYVDDLIF 299

Query: 181  TGNNDRLITDFKQSMEKEFDMTDLGKMSYFLGVEVVQNDEGIFIHQSKYANEVLNRFKMT 360
            TGN +++   FK+SM +EFDM+DLGKM YFLGVEV+Q  EGI+I Q K+A E+L RF M 
Sbjct: 300  TGNCEKMFVKFKESMMQEFDMSDLGKMRYFLGVEVLQCTEGIYISQKKFAKELLERFGMD 359

Query: 361  EVNAVKNPIVPGSKLSKEGSGDRVDSTLYKQMIGSLMYLTATRSDLMFSVCLVSRFMEKP 540
              N+V NPIVPG K+ K+ +G RVD+T YKQM+GSLMYLT TR DLMF VC+ SRFM  P
Sbjct: 360  GCNSVHNPIVPGVKIGKDENGVRVDATTYKQMVGSLMYLTVTRPDLMFVVCMASRFMANP 419

Query: 541  TDLHMQAIKKIMRYLKGSLNHGIHYSRRGREELIGFSDSDYAGDLDDRKSTSGYVFMLGT 720
            T+LH Q +K+++RY+ G++  GI Y +RG + L  ++DSDYAGD+ DR+STSGYVF L  
Sbjct: 420  TELHFQIVKRVLRYVGGTVELGIFYKKRGDQMLQAYTDSDYAGDVSDRRSTSGYVFSLSG 479

Query: 721  GAISWSSKKQAVVTLSTTEAEFISAASCACQAIWLRRILDRLGHIQKEKTVILCDNSSTI 900
            GA+SW SKKQ VVTLSTTEAE+++AASCA Q IW++R+L++LG  Q    +I CDNSSTI
Sbjct: 480  GAVSWMSKKQPVVTLSTTEAEYVAAASCATQGIWMQRVLEKLGLTQNCSVIIKCDNSSTI 539

Query: 901  KLSKNPVLHGRSKHIDVRFHFLRDLVKEEVIELVYCCTSDQIADLMTKPLKLDSFLRLSK 1080
            KLSKNPVLHGRSKHIDVRFHFLRDL ++  ++LV+C + DQ+AD+MTKPLKLD F++L +
Sbjct: 540  KLSKNPVLHGRSKHIDVRFHFLRDLTRDGKVKLVHCGSKDQVADIMTKPLKLDEFVKLRE 599

Query: 1081 ELGMRSIEEIN 1113
             LG+  +  IN
Sbjct: 600  LLGVCEVPVIN 610


>gb|KYP66580.1| Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cajanus
            cajan]
          Length = 380

 Score =  474 bits (1219), Expect = e-164
 Identities = 229/377 (60%), Positives = 295/377 (78%)
 Frame = +1

Query: 1    KLKKALYGLRQAPRAWYSRIEKYFLSEDFEKCPSEHSLFVNTDSDQGLLIISLYVDDLIY 180
            KL KALYGL+QAPRAW+SRIE YF++  F+K  +E +LF        +LIIS+YVDDLIY
Sbjct: 4    KLHKALYGLKQAPRAWFSRIESYFVTAGFQKSQNEQTLFFKRSKLGKILIISVYVDDLIY 63

Query: 181  TGNNDRLITDFKQSMEKEFDMTDLGKMSYFLGVEVVQNDEGIFIHQSKYANEVLNRFKMT 360
            T +++ ++ DFK+SM +EFDMTDLG M +FLG+EV+Q  +GI+I Q KYA E+L RF M 
Sbjct: 64   TRDDELMMEDFKRSMHREFDMTDLGMMRFFLGIEVLQCSDGIYICQKKYALEILRRFGME 123

Query: 361  EVNAVKNPIVPGSKLSKEGSGDRVDSTLYKQMIGSLMYLTATRSDLMFSVCLVSRFMEKP 540
            E N V NPIVPG KL ++  G +V+ T +KQM+GSLMY+T TR DLMF V L+SR+M +P
Sbjct: 124  ESNPVCNPIVPGYKLCRDEEGIKVNETHFKQMVGSLMYITTTRPDLMFVVSLISRYMSQP 183

Query: 541  TDLHMQAIKKIMRYLKGSLNHGIHYSRRGREELIGFSDSDYAGDLDDRKSTSGYVFMLGT 720
            T++H +  K+I+RYLKG+ N+GI Y R G EEL+ ++DSDYAGDL+DRKSTSGYVF++  
Sbjct: 184  TEMHAKVAKRILRYLKGTENYGILYKRGGIEELLAYTDSDYAGDLEDRKSTSGYVFIMSG 243

Query: 721  GAISWSSKKQAVVTLSTTEAEFISAASCACQAIWLRRILDRLGHIQKEKTVILCDNSSTI 900
            G++SWSS+KQ +VTLSTTE EFI+AA CA QA+W+RR+L  LG+ Q+  TVI CDN STI
Sbjct: 244  GSVSWSSRKQPIVTLSTTEVEFIAAAGCAYQAVWMRRVLKELGYKQEGSTVIKCDNCSTI 303

Query: 901  KLSKNPVLHGRSKHIDVRFHFLRDLVKEEVIELVYCCTSDQIADLMTKPLKLDSFLRLSK 1080
             LSKNPV+HGRSKHIDVRFHFLRDL K   IELV+C T DQ+AD+MTKPLKLDSF +L  
Sbjct: 304  NLSKNPVMHGRSKHIDVRFHFLRDLTKNNEIELVHCGTQDQVADVMTKPLKLDSFQKLRV 363

Query: 1081 ELGMRSIEEIN*IAHCS 1131
            +LGM  + ++N +  C+
Sbjct: 364  QLGMCEVPKLNKVHDCN 380


>dbj|GAU31929.1| hypothetical protein TSUD_271130, partial [Trifolium subterraneum]
          Length = 541

 Score =  479 bits (1233), Expect = e-163
 Identities = 234/367 (63%), Positives = 291/367 (79%), Gaps = 2/367 (0%)
 Frame = +1

Query: 1    KLKKALYGLRQAPRAWYSRIEKYFLSEDFEKCPSEHSLFVNTDSDQGLLIISLYVDDLIY 180
            KLKKALYGL+QAP AWYS+IE YF  E F+KC  EH+LFV   S+  +LI+SLYVDDLI 
Sbjct: 165  KLKKALYGLKQAPMAWYSKIEAYFTVEQFKKCSHEHTLFVKYGSNNKILIVSLYVDDLIC 224

Query: 181  TGNNDRLITDFKQSMEKEFDMTDLGKMSYFLGVEVVQNDEGIFIHQSKYANEVLNRFKMT 360
            TGN+  +I DFK+SM+K F MTDLGKM YFLGVEV Q+++GIFIHQ KYA E+L RF M 
Sbjct: 225  TGNDLSMIHDFKESMKKNFAMTDLGKMKYFLGVEVTQSEDGIFIHQHKYALEILKRFGME 284

Query: 361  EVNAVKNPIVPGSKLSKEGSGDRVDSTLYKQMIGSLMYLTATRSDLMFSVCLVSRFMEKP 540
              N V +PIVPG KLSK+ +G   D+  +KQM+G LMYL ATR DL +S+CLV+RFME+P
Sbjct: 285  NCNKVCSPIVPGCKLSKDENGSATDAKRFKQMVGCLMYLLATRPDLAYSICLVARFMERP 344

Query: 541  TDLHMQAIKKIMRYLKGSLNHGIHYSRRG--REELIGFSDSDYAGDLDDRKSTSGYVFML 714
            T +H+  +K+IMRYLKG+L  GI Y      R ELIG+SDSDYAGDL+DRK+TSGYVFML
Sbjct: 345  TVMHIAVVKRIMRYLKGTLTDGIMYKHTNDKRLELIGWSDSDYAGDLNDRKNTSGYVFML 404

Query: 715  GTGAISWSSKKQAVVTLSTTEAEFISAASCACQAIWLRRILDRLGHIQKEKTVILCDNSS 894
            GTGAISWSSKKQ ++TLSTTEAE+++AA CACQ IWL+ +L+ L        VI CDNSS
Sbjct: 405  GTGAISWSSKKQPILTLSTTEAEYVAAAVCACQCIWLKNVLNHLQITHNNGIVIYCDNSS 464

Query: 895  TIKLSKNPVLHGRSKHIDVRFHFLRDLVKEEVIELVYCCTSDQIADLMTKPLKLDSFLRL 1074
            +IKLSKNP++HGR KHIDVRFHF R+L K+ ++EL +C + +Q+ADLMTKPLKL++FL+L
Sbjct: 465  SIKLSKNPIMHGRCKHIDVRFHFPRNLTKDGIVELKHCKSQEQLADLMTKPLKLEAFLKL 524

Query: 1075 SKELGMR 1095
             + LGM+
Sbjct: 525  KEGLGMQ 531


>gb|KYP57482.1| Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cajanus
            cajan]
          Length = 380

 Score =  473 bits (1217), Expect = e-163
 Identities = 226/377 (59%), Positives = 296/377 (78%)
 Frame = +1

Query: 1    KLKKALYGLRQAPRAWYSRIEKYFLSEDFEKCPSEHSLFVNTDSDQGLLIISLYVDDLIY 180
            K+ KALYGL+QAPRAW+S IE YF++  F+K  ++H+LF        +LIIS+YVDDLIY
Sbjct: 4    KIHKALYGLKQAPRAWFSCIESYFVAAGFQKSQNKHTLFFKRSKLGKILIISVYVDDLIY 63

Query: 181  TGNNDRLITDFKQSMEKEFDMTDLGKMSYFLGVEVVQNDEGIFIHQSKYANEVLNRFKMT 360
            TG+++ ++ DFK+SM  EFDMTDLG M +FLG+EV+Q  +GI+I Q KYA E+L RF M 
Sbjct: 64   TGDDELMMEDFKRSMHGEFDMTDLGMMRFFLGIEVLQCSDGIYICQKKYALEILRRFGME 123

Query: 361  EVNAVKNPIVPGSKLSKEGSGDRVDSTLYKQMIGSLMYLTATRSDLMFSVCLVSRFMEKP 540
            E N + NPIVPG KL ++  G +V+ T +KQM+GSLMY+T TR DLMF V L+SR+M +P
Sbjct: 124  ESNPICNPIVPGYKLCRDEEGIKVNETHFKQMVGSLMYITTTRPDLMFVVSLISRYMSQP 183

Query: 541  TDLHMQAIKKIMRYLKGSLNHGIHYSRRGREELIGFSDSDYAGDLDDRKSTSGYVFMLGT 720
            T++H +  K+I+RYLKG+ N+GI Y R G EEL  ++DSDYAGDL+DRKSTSGYVF++  
Sbjct: 184  TEMHAKVAKRILRYLKGTENYGILYKRGGIEELQAYTDSDYAGDLEDRKSTSGYVFIMSG 243

Query: 721  GAISWSSKKQAVVTLSTTEAEFISAASCACQAIWLRRILDRLGHIQKEKTVILCDNSSTI 900
            G+++WSS+KQ +VT+STTEAEFI+AA CACQA+W+RR+L  LG+ Q+  TVI CDN STI
Sbjct: 244  GSVAWSSRKQPIVTISTTEAEFIAAAGCACQAVWMRRVLKELGYKQEGSTVIKCDNYSTI 303

Query: 901  KLSKNPVLHGRSKHIDVRFHFLRDLVKEEVIELVYCCTSDQIADLMTKPLKLDSFLRLSK 1080
            KLSKNPV+HG+SKHIDVRFHFLRDL K   IELV+C T DQ+AD+MTKPLK+DSF +   
Sbjct: 304  KLSKNPVMHGKSKHIDVRFHFLRDLTKNNEIELVHCGTQDQVADVMTKPLKMDSFQKHRV 363

Query: 1081 ELGMRSIEEIN*IAHCS 1131
            +LGM  + E+N +  C+
Sbjct: 364  QLGMCEVPELNKVHDCN 380


>gb|KZV22085.1| retrovirus-related Pol polyprotein from transposon TNT 1-94
            [Dorcoceras hygrometricum]
          Length = 536

 Score =  478 bits (1231), Expect = e-163
 Identities = 226/372 (60%), Positives = 295/372 (79%), Gaps = 1/372 (0%)
 Frame = +1

Query: 1    KLKKALYGLRQAPRAWYSRIEKYFLSEDFEKCPSEHSLFVNTDSDQGLLIISLYVDDLIY 180
            KL KALYGL+QAPRAW+SRIE YF+ E F   P+E +LF+        LI+S+YVDDL++
Sbjct: 161  KLHKALYGLKQAPRAWFSRIEAYFIKEGFSNSPNEQTLFIKRFGGN-FLIVSVYVDDLLF 219

Query: 181  TGNNDRLITDFKQSMEKEFDMTDLGKMSYFLGVEVVQNDE-GIFIHQSKYANEVLNRFKM 357
            TGNN RL+ +FK SM++EFDMTDLGKM YFLG+EVVQN E G+FI Q KYA E+++RF M
Sbjct: 220  TGNNVRLLEEFKCSMKREFDMTDLGKMRYFLGIEVVQNPEKGVFICQRKYAAEMIDRFGM 279

Query: 358  TEVNAVKNPIVPGSKLSKEGSGDRVDSTLYKQMIGSLMYLTATRSDLMFSVCLVSRFMEK 537
               N V NPI PG K+ ++ +G+++DSTLYKQM+GSLMYLTA+R DLMF VCL+SRFM  
Sbjct: 280  QHHNPVYNPIAPGQKIGRDEAGEKIDSTLYKQMVGSLMYLTASRPDLMFVVCLLSRFMAS 339

Query: 538  PTDLHMQAIKKIMRYLKGSLNHGIHYSRRGREELIGFSDSDYAGDLDDRKSTSGYVFMLG 717
            PT LH+   K+++RYLKG+L +GI Y      +L+ ++DSDYAGD+DD KSTSGY FM+ 
Sbjct: 340  PTQLHLAVAKRVLRYLKGTLEYGIWYKHGTMSDLVAYTDSDYAGDMDDSKSTSGYAFMMS 399

Query: 718  TGAISWSSKKQAVVTLSTTEAEFISAASCACQAIWLRRILDRLGHIQKEKTVILCDNSST 897
             GA++WSS+KQ +VTLSTTEAE+++A +CACQAIW+RRIL  +GH Q ++ V+LCDN+ST
Sbjct: 400  GGAVAWSSRKQPIVTLSTTEAEYVAAVACACQAIWMRRILKEIGHEQAKEMVVLCDNTST 459

Query: 898  IKLSKNPVLHGRSKHIDVRFHFLRDLVKEEVIELVYCCTSDQIADLMTKPLKLDSFLRLS 1077
            IKLSKN ++HGRSKHI VR+HFLRDL K+ +I+L++C T +Q+AD+MTKPLKL SF +  
Sbjct: 460  IKLSKNAIMHGRSKHIRVRYHFLRDLTKQGIIKLIHCNTEEQLADMMTKPLKLMSFQKAR 519

Query: 1078 KELGMRSIEEIN 1113
              LGM S+ E+N
Sbjct: 520  AALGMVSLSELN 531


>gb|PNX87200.1| copia-type polyprotein, partial [Trifolium pratense]
          Length = 396

 Score =  472 bits (1215), Expect = e-163
 Identities = 228/362 (62%), Positives = 290/362 (80%)
 Frame = +1

Query: 1    KLKKALYGLRQAPRAWYSRIEKYFLSEDFEKCPSEHSLFVNTDSDQGLLIISLYVDDLIY 180
            KLKKALYGL+QAPRAWY++IE YF  E+FEKCP EH+LFV  D  + +LI+SL VDDLIY
Sbjct: 41   KLKKALYGLKQAPRAWYNKIEAYFGQENFEKCPHEHTLFVKQDEGR-ILIVSLCVDDLIY 99

Query: 181  TGNNDRLITDFKQSMEKEFDMTDLGKMSYFLGVEVVQNDEGIFIHQSKYANEVLNRFKMT 360
            TGNN  +  DFK SM++ F MTDLG+M YFLGVEV Q   GIFI+Q KYA E+L+RF M 
Sbjct: 100  TGNNTEMFEDFKYSMKRRFAMTDLGQMRYFLGVEVTQEKYGIFINQQKYAKEILSRFGME 159

Query: 361  EVNAVKNPIVPGSKLSKEGSGDRVDSTLYKQMIGSLMYLTATRSDLMFSVCLVSRFMEKP 540
              NAV +PIVPG KLSK+ +G ++D T YKQ++G LMYL ATR DL FS+CL++R+M+KP
Sbjct: 160  MCNAVSSPIVPGCKLSKDENGKQIDVTKYKQIVGCLMYLLATRPDLAFSICLIARYMDKP 219

Query: 541  TDLHMQAIKKIMRYLKGSLNHGIHYSRRGREELIGFSDSDYAGDLDDRKSTSGYVFMLGT 720
            TD+H+ A K+I++YLKG+++ GI Y R    +L G++DSDYAGD+DDR+STSGY+F LG+
Sbjct: 220  TDMHLTAAKRILKYLKGTMSLGIFYKRGNELQLQGWTDSDYAGDIDDRRSTSGYIFKLGS 279

Query: 721  GAISWSSKKQAVVTLSTTEAEFISAASCACQAIWLRRILDRLGHIQKEKTVILCDNSSTI 900
             AISWSSKKQ +VTLSTTEAEF++AASC CQA     +L +LG  Q + TVI CDNSS+I
Sbjct: 280  SAISWSSKKQPIVTLSTTEAEFVAAASCVCQA-----VLHQLGKAQGKSTVIFCDNSSSI 334

Query: 901  KLSKNPVLHGRSKHIDVRFHFLRDLVKEEVIELVYCCTSDQIADLMTKPLKLDSFLRLSK 1080
            KLSKNP++HGR KHIDVR++F+RDLVK+ ++EL +C T DQIAD+MTKPLKL+SF +  +
Sbjct: 335  KLSKNPIMHGRMKHIDVRYYFIRDLVKDGILELKHCSTCDQIADVMTKPLKLESFTKFRE 394

Query: 1081 EL 1086
             L
Sbjct: 395  ML 396


>dbj|GAU13002.1| hypothetical protein TSUD_173010 [Trifolium subterraneum]
          Length = 1126

 Score =  495 bits (1275), Expect = e-162
 Identities = 238/371 (64%), Positives = 299/371 (80%), Gaps = 1/371 (0%)
 Frame = +1

Query: 1    KLKKALYGLRQAPRAWYSRIEKYFLSEDFEKCPSEHSLFVNTDSDQGLLIISLYVDDLIY 180
            KL+KALYGL+QAPRAWYS+IE YF  E FEKCP EH+LFV  D    +LI+SLYVDDLIY
Sbjct: 758  KLRKALYGLKQAPRAWYSKIESYFNQEKFEKCPHEHTLFVKQDKKGNILIVSLYVDDLIY 817

Query: 181  TGNNDRLITDFKQSMEKEFDMTDLGKMSYFLGVEVVQNDEGIFIHQSKYANEVLNRFKMT 360
            TGNN+ +  +FKQSM+ +F MTDLG+M +FLGVEV Q D GIF++Q KYA E+L RF M 
Sbjct: 818  TGNNEAMFEEFKQSMKSQFSMTDLGRMRFFLGVEVKQLDSGIFVYQQKYARELLERFHMD 877

Query: 361  EVNAVKNPIVPGSKLSKEGSGDRVDSTLYKQMIGSLMYLTATRSDLMFSVCLVSRFMEKP 540
            + N V +PIVPG+KL ++ +G  VD T Y+Q++G LMYL ATR DL +SVCL++R+ME+P
Sbjct: 878  QCNVVCSPIVPGNKLIRDENGKTVDVTNYRQIVGCLMYLLATRPDLTYSVCLIARYMERP 937

Query: 541  TDLHMQAIKKIMRYLKGSLNHGIHYSRRGREELIGFSDSDYAGDLDDRKSTSGYVFMLGT 720
            T++H+ A K++MRYLKG+L+ GI Y R    +L G+SDSDYAGDLDDRKSTSGYVFMLG+
Sbjct: 938  TEIHLAAAKRVMRYLKGTLDLGILYRRNEEMKLQGWSDSDYAGDLDDRKSTSGYVFMLGS 997

Query: 721  GAISWSSKKQAVVTLSTTEAEFISAASCACQAIWLRRILDRLGHIQKEKTV-ILCDNSST 897
              ISWSSKKQA+VTLSTTEAEF++AASCACQ+IWLRR+L++LG  QK+  V I CDNSS+
Sbjct: 998  SIISWSSKKQAIVTLSTTEAEFVAAASCACQSIWLRRVLEQLG--QKQGCVKIHCDNSSS 1055

Query: 898  IKLSKNPVLHGRSKHIDVRFHFLRDLVKEEVIELVYCCTSDQIADLMTKPLKLDSFLRLS 1077
            IKLSKNPV+HGR KHIDVR+HF RDL KE V+EL+YC T DQ+AD+MTK LKL++F +  
Sbjct: 1056 IKLSKNPVMHGRCKHIDVRYHFFRDLTKENVVELIYCNTQDQVADVMTKALKLEAFCKFR 1115

Query: 1078 KELGMRSIEEI 1110
              LG+  +  +
Sbjct: 1116 NMLGINDVHNL 1126


>gb|KZV40714.1| retrovirus-related Pol polyprotein from transposon TNT 1-94
            [Dorcoceras hygrometricum]
          Length = 536

 Score =  476 bits (1226), Expect = e-162
 Identities = 227/372 (61%), Positives = 294/372 (79%), Gaps = 1/372 (0%)
 Frame = +1

Query: 1    KLKKALYGLRQAPRAWYSRIEKYFLSEDFEKCPSEHSLFVNTDSDQGLLIISLYVDDLIY 180
            KL KALYGL+QAPRAW+SRIE YF+ E F   P+E +LF+        LI+S+YVDDL++
Sbjct: 161  KLHKALYGLKQAPRAWFSRIEAYFIKEGFSNSPNEQTLFIKRFGGN-FLIVSVYVDDLLF 219

Query: 181  TGNNDRLITDFKQSMEKEFDMTDLGKMSYFLGVEVVQNDE-GIFIHQSKYANEVLNRFKM 357
            TGNN RL+ +FK SM++EFDMTDLGKM YFLG+EVVQN E G+FI Q KYA EV++RF M
Sbjct: 220  TGNNVRLLEEFKCSMKREFDMTDLGKMRYFLGIEVVQNPEKGVFICQRKYAAEVIDRFGM 279

Query: 358  TEVNAVKNPIVPGSKLSKEGSGDRVDSTLYKQMIGSLMYLTATRSDLMFSVCLVSRFMEK 537
               N V NPI PG K+ ++ +G+++DSTLYKQM+GSLMYLTA+R DLMF VCL+SRFM  
Sbjct: 280  QHHNPVCNPIAPGQKIGRDEAGEKIDSTLYKQMVGSLMYLTASRPDLMFVVCLLSRFMAS 339

Query: 538  PTDLHMQAIKKIMRYLKGSLNHGIHYSRRGREELIGFSDSDYAGDLDDRKSTSGYVFMLG 717
            PT LH+   K+++RYLKG+L  GI Y      +L+ ++DSDYAGD+DD KSTSGY FM+ 
Sbjct: 340  PTQLHLAVAKRVLRYLKGTLECGIWYKHGTISDLVAYTDSDYAGDMDDSKSTSGYAFMMS 399

Query: 718  TGAISWSSKKQAVVTLSTTEAEFISAASCACQAIWLRRILDRLGHIQKEKTVILCDNSST 897
             GA++WSS+KQ +VTLSTTEAE+++A +CACQAIW+RRIL  +GH Q ++ V+LCDN+ST
Sbjct: 400  GGAVAWSSRKQPIVTLSTTEAEYVAAVACACQAIWMRRILKEIGHEQAKEMVVLCDNTST 459

Query: 898  IKLSKNPVLHGRSKHIDVRFHFLRDLVKEEVIELVYCCTSDQIADLMTKPLKLDSFLRLS 1077
            IKLSKN ++HGRSKHI VR+HFLRDL K+ +I+L++C T +Q+AD+MTKPLKL SF +  
Sbjct: 460  IKLSKNAIMHGRSKHIRVRYHFLRDLTKQGIIKLIHCNTEEQLADMMTKPLKLMSFQKAR 519

Query: 1078 KELGMRSIEEIN 1113
              LGM S+ E+N
Sbjct: 520  AALGMVSLSELN 531


>dbj|GAU23220.1| hypothetical protein TSUD_172480 [Trifolium subterraneum]
          Length = 1323

 Score =  498 bits (1283), Expect = e-162
 Identities = 245/366 (66%), Positives = 300/366 (81%)
 Frame = +1

Query: 1    KLKKALYGLRQAPRAWYSRIEKYFLSEDFEKCPSEHSLFVNTDSDQGLLIISLYVDDLIY 180
            KLKKALYGL+QAPRAWYS+IE YF+ E+F KCP EH+LFV  D D  +LI+SLYVDDLI+
Sbjct: 957  KLKKALYGLKQAPRAWYSKIESYFVQENFVKCPHEHTLFVKQDKDGSILIVSLYVDDLIF 1016

Query: 181  TGNNDRLITDFKQSMEKEFDMTDLGKMSYFLGVEVVQNDEGIFIHQSKYANEVLNRFKMT 360
            TGNN+ +   FK+SM+ +F MTDLGKM +FLGVEV Q + GIFIHQ KYANEVL RF M+
Sbjct: 1017 TGNNEAMFESFKKSMKSQFAMTDLGKMRFFLGVEVKQLNCGIFIHQQKYANEVLERFNMS 1076

Query: 361  EVNAVKNPIVPGSKLSKEGSGDRVDSTLYKQMIGSLMYLTATRSDLMFSVCLVSRFMEKP 540
            + N V +P+VPG+KL+K+ +G  VD+T Y+QMIG LMYL ATRSDL FSVCL++R+ME+P
Sbjct: 1077 QCNKVCSPMVPGNKLTKDENGKPVDATSYRQMIGCLMYLLATRSDLTFSVCLIARYMERP 1136

Query: 541  TDLHMQAIKKIMRYLKGSLNHGIHYSRRGREELIGFSDSDYAGDLDDRKSTSGYVFMLGT 720
            T++H+ A K+++RYLKGS++ GI Y       L G+SDSDYAGDLDDRK+TSGYVFM+G+
Sbjct: 1137 TEIHLAAAKRVLRYLKGSVDLGILYKANCELTLEGWSDSDYAGDLDDRKNTSGYVFMIGS 1196

Query: 721  GAISWSSKKQAVVTLSTTEAEFISAASCACQAIWLRRILDRLGHIQKEKTVILCDNSSTI 900
              ISWSSKKQA+VTLSTTEAE++SAASCACQAIWLRRIL++L   Q   T I CDNSS+I
Sbjct: 1197 SPISWSSKKQAIVTLSTTEAEYVSAASCACQAIWLRRILEQLKQPQ-GCTTIQCDNSSSI 1255

Query: 901  KLSKNPVLHGRSKHIDVRFHFLRDLVKEEVIELVYCCTSDQIADLMTKPLKLDSFLRLSK 1080
            KLSKNPV+HGR KHIDVR+HFLRDLVKE VI+L++C T DQ+AD+MTK LKLD F  L  
Sbjct: 1256 KLSKNPVMHGRCKHIDVRYHFLRDLVKENVIKLIHCNTQDQMADIMTKALKLDLFSNLRD 1315

Query: 1081 ELGMRS 1098
             LG+ S
Sbjct: 1316 RLGICS 1321


>gb|PNX95204.1| copia-type polyprotein [Trifolium pratense]
          Length = 1328

 Score =  498 bits (1283), Expect = e-162
 Identities = 242/365 (66%), Positives = 302/365 (82%), Gaps = 1/365 (0%)
 Frame = +1

Query: 1    KLKKALYGLRQAPRAWYSRIEKYFLSEDFEKCPSEHSLFVNTDSDQGLLIISLYVDDLIY 180
            KLKKALYGLRQAPRAWYS+IE YF +E FEKC  EH+LFV    ++ +LI+SLYVDDLIY
Sbjct: 957  KLKKALYGLRQAPRAWYSKIEAYFSNEKFEKCSHEHTLFVK-QVEEKILIVSLYVDDLIY 1015

Query: 181  TGNNDRLITDFKQSMEKEFDMTDLGKMSYFLGVEVVQNDEGIFIHQSKYANEVLNRFKMT 360
            TGN+D LI DFK SM++ F MTDLGKMSYFLGVEV QND GIFI+Q KYA E+LNRF M 
Sbjct: 1016 TGNDDELIRDFKSSMKRNFAMTDLGKMSYFLGVEVTQNDRGIFINQQKYAAEILNRFNMD 1075

Query: 361  EVNAVKNPIVPGSKLSKEGSGDRVDSTLYKQMIGSLMYLTATRSDLMFSVCLVSRFMEKP 540
              N V +PIVPG+KL K+ +G  VDST +KQM+G LMYL ATR DL +SVCL++R+ME+P
Sbjct: 1076 SCNFVCSPIVPGTKLFKDENGKCVDSTQFKQMVGCLMYLIATRPDLCYSVCLIARYMERP 1135

Query: 541  TDLHMQAIKKIMRYLKGSLNHGIHYSRRGRE-ELIGFSDSDYAGDLDDRKSTSGYVFMLG 717
            T++H+ A K+I+RYLKG++++G+ Y +     +L G++DSDYAGD DDRKSTSGYVF LG
Sbjct: 1136 TEIHLAAAKRILRYLKGTISYGVLYDKGSLNMKLEGWTDSDYAGDSDDRKSTSGYVFKLG 1195

Query: 718  TGAISWSSKKQAVVTLSTTEAEFISAASCACQAIWLRRILDRLGHIQKEKTVILCDNSST 897
            +GAISWSSKKQ +VTLSTTEAE+++AAS ACQ +WLRRIL +LG  Q + + + CDNSS+
Sbjct: 1196 SGAISWSSKKQPIVTLSTTEAEYVAAASGACQGVWLRRILQQLGQKQDKPSTMYCDNSSS 1255

Query: 898  IKLSKNPVLHGRSKHIDVRFHFLRDLVKEEVIELVYCCTSDQIADLMTKPLKLDSFLRLS 1077
            IKLSKNP+LHGR KHIDVR+HFLRDL K+ V+ELV+C T +QIAD+MTKPLKL+SF++L 
Sbjct: 1256 IKLSKNPILHGRCKHIDVRYHFLRDLTKQGVVELVHCSTDEQIADIMTKPLKLESFVKLR 1315

Query: 1078 KELGM 1092
             +LG+
Sbjct: 1316 SKLGV 1320


>dbj|GAU23361.1| hypothetical protein TSUD_334080 [Trifolium subterraneum]
          Length = 1322

 Score =  498 bits (1281), Expect = e-161
 Identities = 240/370 (64%), Positives = 302/370 (81%)
 Frame = +1

Query: 1    KLKKALYGLRQAPRAWYSRIEKYFLSEDFEKCPSEHSLFVNTDSDQGLLIISLYVDDLIY 180
            KLKKALYGL+QAPRAWYS+IE YF  E FEKCP EH+LFV  + ++ LLI+SLYVDDLIY
Sbjct: 952  KLKKALYGLKQAPRAWYSKIESYFGQEKFEKCPYEHTLFVKRNKEK-LLIVSLYVDDLIY 1010

Query: 181  TGNNDRLITDFKQSMEKEFDMTDLGKMSYFLGVEVVQNDEGIFIHQSKYANEVLNRFKMT 360
            TGN+  +  +FK SM+K+F MTDLGKM +FLGVEV Q + GIFI+Q KY  E+L+RF M 
Sbjct: 1011 TGNDVEMFNNFKDSMQKKFAMTDLGKMRFFLGVEVTQGEFGIFINQQKYVKEILSRFGME 1070

Query: 361  EVNAVKNPIVPGSKLSKEGSGDRVDSTLYKQMIGSLMYLTATRSDLMFSVCLVSRFMEKP 540
              N V +P+VPG+KL K+  G  VDST YKQM+G LMYL ATR DL FSVCL++R+ME+P
Sbjct: 1071 ACNMVCSPMVPGNKLMKDEEGSAVDSTKYKQMVGCLMYLLATRPDLAFSVCLIARYMERP 1130

Query: 541  TDLHMQAIKKIMRYLKGSLNHGIHYSRRGREELIGFSDSDYAGDLDDRKSTSGYVFMLGT 720
            T++H+ A K+I+RYLKGS+N GI Y R   +EL G++DSDYAGDL+DRKSTSGYVF +G+
Sbjct: 1131 TEMHLAAAKRILRYLKGSMNLGILYKRNTTQELKGWTDSDYAGDLNDRKSTSGYVFKVGS 1190

Query: 721  GAISWSSKKQAVVTLSTTEAEFISAASCACQAIWLRRILDRLGHIQKEKTVILCDNSSTI 900
             AISWSSKKQ +VTLSTTEAEF++AASCACQ +WL+RIL +LG  Q + T+I CDN+S+I
Sbjct: 1191 SAISWSSKKQPIVTLSTTEAEFVAAASCACQGVWLKRILHQLGQTQDKSTIIYCDNTSSI 1250

Query: 901  KLSKNPVLHGRSKHIDVRFHFLRDLVKEEVIELVYCCTSDQIADLMTKPLKLDSFLRLSK 1080
            KLSKNPV+HGR KHIDVR++FLRDLVK++V+EL +C T +QIAD+MTKPLKLDSF +  +
Sbjct: 1251 KLSKNPVMHGRCKHIDVRYYFLRDLVKDDVLELKHCNTEEQIADIMTKPLKLDSFYKFRE 1310

Query: 1081 ELGMRSIEEI 1110
             LG+  I ++
Sbjct: 1311 MLGVCDIMKL 1320


>dbj|GAU37106.1| hypothetical protein TSUD_278930 [Trifolium subterraneum]
          Length = 1013

 Score =  489 bits (1258), Expect = e-161
 Identities = 236/370 (63%), Positives = 297/370 (80%), Gaps = 2/370 (0%)
 Frame = +1

Query: 1    KLKKALYGLRQAPRAWYSRIEKYFLSEDFEKCPSEHSLFVNTDSDQGLLIISLYVDDLIY 180
            KL+KALYGL+QAPRAWYS+IE YF +E F+KC  E +LFV    +  +LI+SLYVDDLI 
Sbjct: 624  KLRKALYGLKQAPRAWYSKIESYFATEKFKKCSHEPTLFVKYGFNNKILIVSLYVDDLIC 683

Query: 181  TGNNDRLITDFKQSMEKEFDMTDLGKMSYFLGVEVVQNDEGIFIHQSKYANEVLNRFKMT 360
            TGNN  +I DFK+SM+K F MTDLGKM YFLGVEV+Q+D GIFIHQ KYA E+L RF M 
Sbjct: 684  TGNNLDMILDFKESMKKNFAMTDLGKMKYFLGVEVIQSDVGIFIHQHKYAIEILKRFGME 743

Query: 361  EVNAVKNPIVPGSKLSKEGSGDRVDSTLYKQMIGSLMYLTATRSDLMFSVCLVSRFMEKP 540
              N V +PIVPG KL K+ +G   D+T +KQM+G LMYL ATR DL +S+CLV+RFM++P
Sbjct: 744  NCNNVCSPIVPGCKLDKDENGKATDATTFKQMVGCLMYLLATRPDLAYSICLVARFMDRP 803

Query: 541  TDLHMQAIKKIMRYLKGSLNHGIHYSRRGRE--ELIGFSDSDYAGDLDDRKSTSGYVFML 714
            TD+H+ A+K+IMRYLKG+L  GI Y     +  EL+G+SDSDYAGD++DRKSTSGYVFML
Sbjct: 804  TDIHVAAVKRIMRYLKGTLTDGIMYKHTSNKNIELVGWSDSDYAGDVNDRKSTSGYVFML 863

Query: 715  GTGAISWSSKKQAVVTLSTTEAEFISAASCACQAIWLRRILDRLGHIQKEKTVILCDNSS 894
            GTGAI+WSSKKQ +VTLSTTEAE+++AA CACQ+IWL+ +L  +   Q +  VI CDNSS
Sbjct: 864  GTGAIAWSSKKQPIVTLSTTEAEYVAAAVCACQSIWLKAVLSHMKRPQNQAIVIHCDNSS 923

Query: 895  TIKLSKNPVLHGRSKHIDVRFHFLRDLVKEEVIELVYCCTSDQIADLMTKPLKLDSFLRL 1074
            +IKLSKNPV+HGR KHIDVRFHFLR+L K+ ++E+ +C + DQ+ D+MTKPLKL+SFL+L
Sbjct: 924  SIKLSKNPVMHGRCKHIDVRFHFLRNLTKDGMVEMKHCKSQDQLVDIMTKPLKLESFLKL 983

Query: 1075 SKELGMRSIE 1104
             + LGMRS +
Sbjct: 984  KEGLGMRSAQ 993


Top