BLASTX nr result
ID: Rehmannia29_contig00014224
seq
BLASTX 2.2.26 [Sep-21-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Rehmannia29_contig00014224 (2893 letters) Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF excluding environmental samples from WGS projects 149,584,005 sequences; 54,822,741,787 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value gb|PNY00469.1| retrovirus-related Pol polyprotein from transposo... 791 0.0 gb|KYP34293.1| Retrovirus-related Pol polyprotein from transposo... 811 0.0 gb|PRQ55089.1| putative RNA-directed DNA polymerase [Rosa chinen... 790 0.0 gb|PNX92270.1| retrovirus-related Pol polyprotein from transposo... 788 0.0 gb|PNX74277.1| retrovirus-related Pol polyprotein from transposo... 769 0.0 gb|KYP40677.1| Retrovirus-related Pol polyprotein from transposo... 769 0.0 gb|PNX93614.1| retrovirus-related Pol polyprotein from transposo... 786 0.0 gb|PNX93928.1| hypothetical protein L195_g017092, partial [Trifo... 761 0.0 gb|KYP39497.1| Retrovirus-related Pol polyprotein from transposo... 779 0.0 gb|KYP42321.1| Copia protein [Cajanus cajan] 775 0.0 emb|CAN71595.1| hypothetical protein VITISV_010143 [Vitis vinifera] 776 0.0 dbj|GAU46782.1| hypothetical protein TSUD_351810 [Trifolium subt... 772 0.0 gb|PNY17451.1| retrovirus-related Pol polyprotein from transposo... 768 0.0 gb|PNY16454.1| flavonol sulfotransferase-like protein [Trifolium... 768 0.0 gb|KYP42564.1| Retrovirus-related Pol polyprotein from transposo... 763 0.0 gb|OMO88956.1| Integrase, catalytic core [Corchorus capsularis] 763 0.0 gb|PNX97998.1| retrovirus-related Pol polyprotein from transposo... 747 0.0 dbj|GAU41679.1| hypothetical protein TSUD_272630 [Trifolium subt... 753 0.0 gb|PNX93131.1| retrovirus-related Pol polyprotein from transposo... 746 0.0 gb|KYP43110.1| Retrovirus-related Pol polyprotein from transposo... 755 0.0 >gb|PNY00469.1| retrovirus-related Pol polyprotein from transposon TNT 1-94 [Trifolium pratense] Length = 778 Score = 791 bits (2044), Expect = 0.0 Identities = 410/787 (52%), Positives = 529/787 (67%), Gaps = 23/787 (2%) Frame = +3 Query: 567 WDHCILTAAHIINRLPAPILHNKTPFEILFNKMPDYSHFKIFGCLAYVSTLTAQRHKFQA 746 W+ C+ A H+INR+P+P+L +KTP+E+LF P H K+FGCL++ +TL A R KF + Sbjct: 2 WNFCVQHAVHVINRIPSPLLKSKTPYELLFKLSPTLLHLKVFGCLSFATTLQAHRTKFDS 61 Query: 747 RASKCVFIGYPLGSKGYKLYDLESHKVLISRHVIFCENIFPFQEIASKSVSPEAPLFPIS 926 RA KCVFIGY G+KGY LYDL SH + +SR+V+F E++ PF+ + + S +P FP+ Sbjct: 62 RARKCVFIGYKDGTKGYILYDLHSHNIFLSRNVVFYEHVLPFKSVPGPTSSHNSPTFPLY 121 Query: 927 D--LSSAQDQSFSHFQNINAQNDIVTTTQII--------------------SDAQNDFAD 1040 D L + + F D V+ + + A F D Sbjct: 122 DDPLDISHNPCVDTFPLSTGSLDNVSLNPALTPPLVPTLDSSPLTPPINTATPAPPSF-D 180 Query: 1041 AQHDLAAQNTTVVQTDPVNSHIEHSNPNPPILRRSSRNRTNPSYLQVYDCKIPPSIQSQH 1220 + H A Q + + + PV S E S P P R S+R PSYLQ Y C I +Q Sbjct: 181 SAHSAADQPSPNLDSVPVPS--EPSIPLP--TRVSTRVTRPPSYLQDYHCNIKSGCTNQV 236 Query: 1221 SAQTIISGKQYPIQDFLSVDKLSNKHKAFSAQISQITEPKSYAQASKQKEWQDAMKAEIR 1400 S+ + +P+ LS + S +K F IS EP +Y QASK W+ AM AEI Sbjct: 237 SSNIV-----HPLSSVLSYNTCSPAYKLFCCSISSTIEPTTYNQASKFDCWKKAMDAEIT 291 Query: 1401 ALEKNQTWIMTDLPADKHCIGCKWVYKVKYRSNGSIERYKARLVAKGYTQEEGLDYFDTF 1580 ALE N+TW + DLP K IGCKWVYK+KY +NG+IERYKARLVAKGYTQ EG+DYFDTF Sbjct: 292 ALELNKTWTVVDLPCGKVPIGCKWVYKIKYHANGTIERYKARLVAKGYTQMEGVDYFDTF 351 Query: 1581 SPVAKITTVRLLIALAAAKGWFLHQLDVNNAFLHGDLHEEVYMLPPPGYLQSNDKKVCKL 1760 SPVAK+TTVR+L+A+AA +GW L QLDVNNAFLHGDLHEEVYM PPGY + KVCKL Sbjct: 352 SPVAKMTTVRVLLAVAAVRGWHLEQLDVNNAFLHGDLHEEVYMSLPPGY-DATPSKVCKL 410 Query: 1761 TKSLYGLKQASRQWFQKLTSCLLNYGFIQSKSDSSLFTMSHDDSITILCVYVDDIILAGN 1940 KSLYGLKQASRQW+ KL++ L++ G+ S++D SL+ SH S T L VYVDDI+LAG Sbjct: 411 NKSLYGLKQASRQWYSKLSAALISLGYQASQADHSLYVKSHGTSFTALLVYVDDIVLAGT 470 Query: 1941 DISKIEAIKLHLDEKFTIKDLGTLKYILGIEVARSTKGIHICQRKYALDLLTETGFLGSK 2120 I +I+++KL LD++F IKDLG L++ LG+E+ARS+ GI + QRKY L+LL +TGFLGSK Sbjct: 471 SIEEIKSVKLFLDQQFKIKDLGPLRFFLGLEIARSSSGIFLNQRKYTLELLEDTGFLGSK 530 Query: 2121 PSTTPMDCKAQFSSNDSSPLTDASQYRRLIGRMLYLTITRPELSFSIQTLSQYLSKPTTT 2300 P+T P+D + S+ D P D S YRRLIGR+LYLT TRP++SF++Q LSQY+S P Sbjct: 531 PATVPLDPHTKLSATDGVPFDDPSGYRRLIGRLLYLTHTRPDISFAVQHLSQYVSTPLVP 590 Query: 2301 HLAAAHRI*RYIKSTPVKGLFYSSTSPLHLKCFSDADWARCQETRRSISGFAVFLGDSLI 2480 H AA RI RY+KS P KG+ +SS SPL L F+D+DWA C TRRS++G+ V LG SLI Sbjct: 591 HYQAATRILRYLKSCPAKGVLFSSHSPLQLHGFADSDWACCPNTRRSVTGYCVLLGSSLI 650 Query: 2481 SWKSKKQPTISRSSAEAEYRAIALTTCEIQWLTYLLQDMHIKHPQSL-MYTDSRSAYCIS 2657 SWKSKKQ T+SRSS EAEYRA+A TCE+QWL YL QD+HI PQS +Y D++SA ++ Sbjct: 651 SWKSKKQNTVSRSSTEAEYRALASLTCELQWLQYLFQDLHITFPQSASVYCDNKSAIYLA 710 Query: 2658 QNSCHHERTKHIQLDCHFVREKLQEGLFKLIHIPSTQQTADIFTKPLPPAQFNYLISKLS 2837 N HER+KHI+LDCH +REKLQ L L+ +PS Q AD+FTKPL F+ ++SKL Sbjct: 711 HNPTFHERSKHIELDCHIIREKLQSKLIHLLSVPSKSQLADVFTKPLHSPAFSSMLSKLG 770 Query: 2838 MLNIHAP 2858 + +IH P Sbjct: 771 LCSIHHP 777 >gb|KYP34293.1| Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cajanus cajan] Length = 1376 Score = 811 bits (2094), Expect = 0.0 Identities = 420/801 (52%), Positives = 543/801 (67%), Gaps = 6/801 (0%) Frame = +3 Query: 486 GIVERKHQHLLQVARALLFQSKVPLSFWDHCILTAAHIINRLPAPILHNKTPFEILFNKM 665 G+VERKHQHLL V ALLF SK+P FW + +L A ++INR+ P+L NKTPF+ L+ + Sbjct: 601 GVVERKHQHLLNVTHALLFHSKLPYCFWSYALLHATYLINRITTPLLDNKTPFQKLYGQT 660 Query: 666 PDYSHFKIFGCLAYVSTLTAQRHKFQARASKCVFIGYPLGSKGYKLYDLESHKVLISRHV 845 D + ++FGCL YVST TA R K RA CVF+G+ +KGY YDL + + ISR+V Sbjct: 661 CDITELRVFGCLCYVSTSTANRKKLDPRAHPCVFLGFSPTTKGYITYDLHTRAITISRNV 720 Query: 846 IFCENIFPFQEIASKSVSPEAPLFPISDLSSAQDQSFSHFQNINAQNDIVTTTQIISDAQ 1025 F EN FP + S S S + PIS F + +D+++ I+ D Sbjct: 721 SFYENHFPLLQSTS-STSNIPVVSPIS------------FGIHSPSHDLIS---ILPDPH 764 Query: 1026 NDFADAQHDLAAQNTTVVQTD-----PVNSHIEHSNPNPPILRRSSRNRTNPSYLQVYDC 1190 QH++ + N D P ++ + PN LRRS+R R PSYLQ Y Sbjct: 765 ------QHNVTSPNPATTSHDSISLAPYSTTADSLPPNSSPLRRSTRLRNPPSYLQDYHH 818 Query: 1191 KIPPSIQSQHSAQTIISGKQYPIQDFLSVDKLSNKHKAFSAQISQITEPKSYAQASKQKE 1370 + + + H G YPI+ ++S +LSN +AF + IS ++EP SYA+A+K Sbjct: 819 SLTSTSTNLHP------GMLYPIEKYISYSRLSNDFQAFVSSISAVSEPHSYAEAAKHDC 872 Query: 1371 WQDAMKAEIRALEKNQTWIMTDLPADKHCIGCKWVYKVKYRSNGSIERYKARLVAKGYTQ 1550 W AM AE+ AL+ NQTW +T LP K +GC+W+YK+KY ++GSIERYKARLVAKGYTQ Sbjct: 873 WLKAMHAELEALKMNQTWTLTPLPPHKQAVGCRWIYKIKYNADGSIERYKARLVAKGYTQ 932 Query: 1551 EEGLDYFDTFSPVAKITTVRLLIALAAAKGWFLHQLDVNNAFLHGDLHEEVYMLPPPGYL 1730 EGLDY TFSPVAK+TTVRLL+ALAA W L QLDVNNAFLHGDL+EEVYM P G Sbjct: 933 VEGLDYLATFSPVAKLTTVRLLLALAAVFDWHLKQLDVNNAFLHGDLNEEVYMTLPLGMR 992 Query: 1731 QSNDKKVCKLTKSLYGLKQASRQWFQKLTSCLLNYGFIQSKSDSSLFTMSHDDSITILCV 1910 +VCKL KSLYGLKQASRQWF KL+S L+++G+ QS SD SLF S T L + Sbjct: 993 PEYSNQVCKLQKSLYGLKQASRQWFAKLSSFLIHHGYHQSASDHSLFMKFSSSSTTALLI 1052 Query: 1911 YVDDIILAGNDISKIEAIKLHLDEKFTIKDLGTLKYILGIEVARSTKGIHICQRKYALDL 2090 YVDDI+LAGN++S+I+ I LD F IKDLG LKY LG+EVAR+ GIH+ QRKY LD+ Sbjct: 1053 YVDDIVLAGNNLSEIQLITGLLDVAFKIKDLGNLKYFLGLEVARNKSGIHLSQRKYVLDI 1112 Query: 2091 LTETGFLGSKPSTTPMDCKAQFSSNDSSPLTDASQYRRLIGRMLYLTITRPELSFSIQTL 2270 L++ G + S+P +TPMD ++ S++ +PL D S YRRL+GR++YLT TRP++S+ + L Sbjct: 1113 LSDCGMMASRPVSTPMDYTSRLSASSGTPLADPSSYRRLLGRLIYLTTTRPDISYVVHHL 1172 Query: 2271 SQYLSKPTTTHLAAAHRI*RYIKSTPVKGLFYSSTSPLHLKCFSDADWARCQETRRSISG 2450 SQ++S P+T H A RI RY+K P GLF+ + S LHLK FSD+DWA C +TRRSI+G Sbjct: 1173 SQFMSAPSTAHSQAIFRILRYLKQAPGSGLFFPTNSSLHLKAFSDSDWAGCLDTRRSITG 1232 Query: 2451 FAVFLGDSLISWKSKKQPTISRSSAEAEYRAIALTTCEIQWLTYLLQDMHIK-HPQSLMY 2627 F+V+LGDSLISW+SKKQPT+SRSS+EAEYRA+A TT E+QWLTYLL D+H+ H +L+Y Sbjct: 1233 FSVYLGDSLISWRSKKQPTVSRSSSEAEYRALATTTSELQWLTYLLHDLHVPVHQPALLY 1292 Query: 2628 TDSRSAYCISQNSCHHERTKHIQLDCHFVREKLQEGLFKLIHIPSTQQTADIFTKPLPPA 2807 D++SA I+ N HERTKHI +DCH VREKLQ GL KL+ + S Q ADIFTK L P+ Sbjct: 1293 CDNQSALHIAANQVFHERTKHIDIDCHLVREKLQSGLLKLLPVASPHQLADIFTKSLSPS 1352 Query: 2808 QFNYLISKLSMLNIHAPLEGG 2870 F L SKL MLN+++ LEGG Sbjct: 1353 MFTALYSKLGMLNLYSQLEGG 1373 >gb|PRQ55089.1| putative RNA-directed DNA polymerase [Rosa chinensis] Length = 1285 Score = 790 bits (2040), Expect = 0.0 Identities = 408/815 (50%), Positives = 552/815 (67%), Gaps = 21/815 (2%) Frame = +3 Query: 486 GIVERKHQHLLQVARALLFQSKVPLSFWDHCILTAAHIINRLPAPILHNKTPFEILFNKM 665 G+ ERKH+HLL +ARALLFQ+ +P FW ILTAA++INR P PIL KTPFE LF+K Sbjct: 468 GVAERKHRHLLNMARALLFQANLPKPFWGDAILTAAYLINRTPTPILQGKTPFEKLFHKE 527 Query: 666 PDYSHFKIFGCLAYVSTLTAQRHKFQARASKCVFIGYPLGSKGYKLYDLESHKVLISRHV 845 P YSH ++FGC +VST + KF R+ +CVF+GYP G KGYK+Y+L + K L+SR V Sbjct: 528 PSYSHLRVFGCQCFVSTHPTRPSKFDPRSMECVFLGYPHGQKGYKVYNLTTKKSLVSRDV 587 Query: 846 IFCENIFPFQEIASKSVSPEAPLFP-ISDLSSAQDQSFSHFQNINAQNDIVTTTQIISDA 1022 IF EN FPF + + S LFP I L+ + S I + + +IS Sbjct: 588 IFFENAFPFPKNSESFPSQNTDLFPSIPRLAHYDNPSIP---KIPPSSPTHHSPPMISP- 643 Query: 1023 QNDFADAQHDLAAQNTTVVQTDPVNS-------------HIEHSNP---NPPILRRSSRN 1154 N + A+ L + + ++ TDPV+S HI +P PP R+S+R Sbjct: 644 -NPQSSAEQYLNSPSKSLSSTDPVSSDITLPNLDTISSDHIPSLSPPEQTPPRPRKSTRA 702 Query: 1155 RTNPSYLQVY--DCKIPPSIQSQHSAQTIIS-GKQYPIQDFLSVDKLSNKHKAFSAQISQ 1325 P+ LQ + D +P + S+ + + G + + LS LS+ H+ F+A I+ Sbjct: 703 TKLPTALQDFHIDAALPTRLAPSSSSNEVTTPGTAHSLSHVLSYANLSSPHRTFTANITL 762 Query: 1326 ITEPKSYAQASKQKEWQDAMKAEIRALEKNQTWIMTDLPADKHCIGCKWVYKVKYRSNGS 1505 EP S++QA K +W++AM+ E++AL+ N+TW + PA K IGCKWVYK+KY +G+ Sbjct: 763 QREPTSFSQAVKDPKWREAMRLEVQALQDNKTWSLVPPPAHKRPIGCKWVYKIKYNPDGT 822 Query: 1506 IERYKARLVAKGYTQEEGLDYFDTFSPVAKITTVRLLIALAAAKGWFLHQLDVNNAFLHG 1685 IERYKARLVAKGY+Q EGLDY +TF+PVAK+TTVR+L++LAA + W LHQLDVNNAFL+G Sbjct: 823 IERYKARLVAKGYSQVEGLDYRETFAPVAKLTTVRVLLSLAAQQNWHLHQLDVNNAFLNG 882 Query: 1686 DLHEEVYMLPPPGYLQSNDKKVCKLTKSLYGLKQASRQWFQKLTSCLLNYGFIQSKSDSS 1865 DLHE+VYM PPG+ + + KVCKL KSLYGL+QAS+QWF KL+S L + GF QS SD S Sbjct: 883 DLHEDVYMHLPPGFERKGEHKVCKLHKSLYGLRQASKQWFLKLSSALKSAGFKQSWSDYS 942 Query: 1866 LFTMSHDDSITILCVYVDDIILAGNDISKIEAIKLHLDEKFTIKDLGTLKYILGIEVARS 2045 +F SH + T L VYVDD+ILAGN++ I K L F +KD+G LKY LG+EVARS Sbjct: 943 MFVRSHQGTFTALLVYVDDVILAGNNLDDIIRTKSFLSSHFKLKDMGQLKYFLGLEVARS 1002 Query: 2046 TKGIHICQRKYALDLLTETGFLGSKPSTTPMDCKAQFSSNDSSPLTDASQYRRLIGRMLY 2225 GI + QRKYAL++L +TGFLG+KPS P++ + D L DASQYRRL+GR++Y Sbjct: 1003 KHGIALSQRKYALEILEDTGFLGAKPSRFPLEQNIILTQEDGRLLEDASQYRRLVGRLIY 1062 Query: 2226 LTITRPELSFSIQTLSQYLSKPTTTHLAAAHRI*RYIKSTPVKGLFYSSTSPLHLKCFSD 2405 TITRP+L +++ LSQ++ KP HL AAH++ RY+K TP +G+F S PL L + D Sbjct: 1063 QTITRPDLVYAVHILSQFMDKPRQPHLDAAHKVLRYLKQTPGQGIFLPSKGPLELSAYCD 1122 Query: 2406 ADWARCQETRRSISGFAVFLGDSLISWKSKKQPTISRSSAEAEYRAIALTTCEIQWLTYL 2585 ADWARC++TRRS +G+ +FLG + ISWK+KKQ T+SRSSAEAEYR++A T CEI WL Y+ Sbjct: 1123 ADWARCKDTRRSTTGYCIFLGHAPISWKTKKQRTVSRSSAEAEYRSMATTCCEITWLQYI 1182 Query: 2586 LQDMHIKHPQSL-MYTDSRSAYCISQNSCHHERTKHIQLDCHFVREKLQEGLFKLIHIPS 2762 L+D++I+H Q + ++ D+++A I+ N HERTKHI++DCH VREK+Q GL + HI + Sbjct: 1183 LKDLNIQHLQPVKLFCDNKAAIHIASNPVFHERTKHIEIDCHVVREKVQRGLIQTEHIRT 1242 Query: 2763 TQQTADIFTKPLPPAQFNYLISKLSMLNIHAPLEG 2867 +Q ADIFTKPL QF+ L+ KL ++NIH+ L G Sbjct: 1243 KEQPADIFTKPLSSEQFSLLLGKLGVINIHSNLRG 1277 >gb|PNX92270.1| retrovirus-related Pol polyprotein from transposon TNT 1-94 [Trifolium pratense] Length = 1246 Score = 788 bits (2034), Expect = 0.0 Identities = 404/810 (49%), Positives = 544/810 (67%), Gaps = 12/810 (1%) Frame = +3 Query: 486 GIVERKHQHLLQVARALLFQSKVPLSFWDHCILTAAHIINRLPAPILHNKTPFEILFNKM 665 G+VERKH+HLL VARALLFQ+ +P +FW ILTAA++INR PILH KTPFEILF+K Sbjct: 437 GVVERKHRHLLNVARALLFQANLPKTFWGDSILTAAYLINRTSTPILHGKTPFEILFHKP 496 Query: 666 PDYSHFKIFGCLAYVSTLTAQRHKFQARASKCVFIGYPLGSKGYKLYDLESHKVLISRHV 845 P Y H ++FGCL + S + KF R+ +C+F+GYP G+KGY++YDL + K SR V Sbjct: 497 PTYHHLRVFGCLCFASNHHHKPTKFDTRSIRCIFLGYPYGTKGYRVYDLATGKTFTSRDV 556 Query: 846 IFCENIFPFQEIASKSV--SPEAPLFPISDLSSAQDQSFSHF--QNINAQNDIVTTTQII 1013 IF E+IFP+ A+ S + + PL I S+ ++ F A +D T I+ Sbjct: 557 IFHEHIFPYSSAATMSTPSTHQIPLPNIEPFDSSSHETTPSFPTHTPTAFDDQQPTPPIV 616 Query: 1014 SDAQNDFADAQHDLAAQNTTVVQTDPVNSHI----EHSNPNPPILRRSSRNRTNPSYLQV 1181 + ++Q + + ++ +S E + PP + R PSYL+ Sbjct: 617 TPT----IESQDTIIPTIVSTIEASTSDSTTITPPEAPHDIPPPALMAKRLIRPPSYLRQ 672 Query: 1182 YDCKIP---PSIQSQHSAQTIISGKQYPIQDFLSVDKLSNKHKAFSAQISQITEPKSYAQ 1352 Y ++ S S HS T G +P+ L+ D+LS H+AF+ IS I EP S+ Q Sbjct: 673 YHVEVSLPTRSSPSSHSVLTAPKGIPHPLSSVLNYDRLSPAHRAFTTSISAIKEPTSFHQ 732 Query: 1353 ASKQKEWQDAMKAEIRALEKNQTWIMTDLPADKHCIGCKWVYKVKYRSNGSIERYKARLV 1532 A K +W+ AM E+RAL N TW + LP +K+ +GCKWVYK+K+ +G+IERYKARLV Sbjct: 733 AVKDPKWRFAMDEELRALHDNGTWSLQHLPPNKNPVGCKWVYKIKFNPDGTIERYKARLV 792 Query: 1533 AKGYTQEEGLDYFDTFSPVAKITTVRLLIALAAAKGWFLHQLDVNNAFLHGDLHEEVYML 1712 AKGY+Q EG DY +TF+PVAK+ TVRLL+A+A++ W L QLDVNNAFLHGDL EEVYM Sbjct: 793 AKGYSQIEGFDYRETFAPVAKLVTVRLLLAVASSMNWHLRQLDVNNAFLHGDLEEEVYMS 852 Query: 1713 PPPGYLQSNDKKVCKLTKSLYGLKQASRQWFQKLTSCLLNYGFIQSKSDSSLFTMSHDDS 1892 PPGY + + +VCKL KSLYGLKQASRQWF KL+ L+ + QSKSD SLF D S Sbjct: 853 LPPGYGRKGETRVCKLHKSLYGLKQASRQWFIKLSKVLILADYTQSKSDHSLFVRHRDTS 912 Query: 1893 ITILCVYVDDIILAGNDISKIEAIKLHLDEKFTIKDLGTLKYILGIEVARSTKGIHICQR 2072 T L +YVDDIILAGN++ +IE IK HL E+F +KDLG LKY LGIEV+RS +GI + QR Sbjct: 913 FTALLIYVDDIILAGNNLQEIERIKAHLMEQFKLKDLGNLKYFLGIEVSRSKQGITLSQR 972 Query: 2073 KYALDLLTETGFLGSKPSTTPMDCKAQFSSNDSSPLTDASQYRRLIGRMLYLTITRPELS 2252 KYAL++L + G+L KP+ +PM+ + D + + S YRRL+GR++YLTITRP+L Sbjct: 973 KYALEILEDMGYLAVKPANSPMEQNLSLNKTDGDCIDEPSSYRRLVGRLIYLTITRPDLV 1032 Query: 2253 FSIQTLSQYLSKPTTTHLAAAHRI*RYIKSTPVKGLFYSSTSPLHLKCFSDADWARCQET 2432 +++ LSQ++ KP HL AA R+ RYIK TP +G+ + STS L L F DADWARCQ+T Sbjct: 1033 YAVHILSQFMDKPRIPHLEAAQRVLRYIKKTPGQGILFPSTSTLQLNAFCDADWARCQDT 1092 Query: 2433 RRSISGFAVFLGDSLISWKSKKQPTISRSSAEAEYRAIALTTCEIQWLTYLLQDMHIKHP 2612 RRS SG+ VF+G+SLISWK+KKQ T+SRSSAEAEYR++A CE+ WL +L D+ I+H Sbjct: 1093 RRSTSGYCVFIGNSLISWKTKKQVTVSRSSAEAEYRSMASVCCEVTWLLSVLHDLGIEHQ 1152 Query: 2613 QSL-MYTDSRSAYCISQNSCHHERTKHIQLDCHFVREKLQEGLFKLIHIPSTQQTADIFT 2789 Q + ++ D+++A I+ N HERTKHI++DCH VREK+Q G+ K HI +++Q AD+FT Sbjct: 1153 QPVKLFCDNQAALHIASNPVFHERTKHIEIDCHLVREKVQAGVVKTYHISTSEQPADVFT 1212 Query: 2790 KPLPPAQFNYLISKLSMLNIHAPLEGGCKD 2879 K L QF+ LI+KL M+NI++ L G K+ Sbjct: 1213 KALSVPQFSNLINKLGMINIYSNLRGSVKN 1242 >gb|PNX74277.1| retrovirus-related Pol polyprotein from transposon TNT 1-94 [Trifolium pratense] Length = 762 Score = 769 bits (1985), Expect = 0.0 Identities = 407/780 (52%), Positives = 520/780 (66%), Gaps = 16/780 (2%) Frame = +3 Query: 567 WDHCILTAAHIINRLPAPILHNKTPFEILFNKMPDYSHFKIFGCLAYVSTLTAQRHKFQA 746 W+ + A HIINRLP+P+L+ K P+E+L+ K P H K+FGCL+Y +TL A R KF + Sbjct: 2 WNFSVQHAVHIINRLPSPLLNLKCPYELLYKKPPSLVHLKVFGCLSYATTLQAHRTKFDS 61 Query: 747 RASKCVFIGYPLGSKGYKLYDLESHKVLISRHVIFCENIFPFQEIASKSVSPEAPLFPIS 926 RA K +F+G+ G+KGY LYDL SH + +SR+V+F E FP + P+ S Sbjct: 62 RARKAIFLGFKDGTKGYILYDLSSHDIFVSRNVVFYETYFPLRH--------SQPVHNAS 113 Query: 927 DLS------SAQDQSFSHFQNINAQNDIVTTTQIISDAQNDFA-DAQHDLAAQNTTVVQT 1085 D S S D SH N + D+ + + + + D + Sbjct: 114 DFSKPLPSNSILDDPVSH-----THNSLPLPVMFEPDSTSPSSVNIEPDRTISSPASSSH 168 Query: 1086 DPVNSHIEHSNPN---PPI---LRRSSRNRTNPSYLQVYDCKIPPSIQSQHSAQTIISGK 1247 P++S H PN PP LRRS+R T P YL+ Y C S IS Sbjct: 169 TPLSSS-SHDRPNLAPPPYHDNLRRSTRTITRPGYLEDYHC-----YSVTGSVNNNISHP 222 Query: 1248 QYPIQDFLSVDKLSNKHKAFSAQISQITEPKSYAQASKQKEWQDAMKAEIRALEKNQTWI 1427 YP+ LS D ++K+F IS I EPK+++QASK W+ AM AE+ AL++N+TW Sbjct: 223 NYPLSSVLSYDNCVPEYKSFCCSISAIIEPKTFSQASKLDCWRKAMDAELLALDENKTWS 282 Query: 1428 MTDLPADKHCIGCKWVYKVKYRSNGSIERYKARLVAKGYTQEEGLDYFDTFSPVAKITTV 1607 + DLP K IGCKWVYK+KY +NGSIERYKARLVAKGYTQ EG+DYFDTFSPVAKITTV Sbjct: 283 VVDLPHGKTPIGCKWVYKIKYHANGSIERYKARLVAKGYTQMEGIDYFDTFSPVAKITTV 342 Query: 1608 RLLIALAAAKGWFLHQLDVNNAFLHGDLHEEVYMLPPPGYLQS-NDKKVCKLTKSLYGLK 1784 R L+ALA+ KGW L QLDVNNAFLHGDL+EEVYM PPGY + KVC+L KSLYGLK Sbjct: 343 RFLLALASIKGWDLEQLDVNNAFLHGDLNEEVYMSLPPGYSSAIGSNKVCRLHKSLYGLK 402 Query: 1785 QASRQWFQKLTSCLLNYGFIQSKSDSSLFTMSHDDSITILCVYVDDIILAGNDISKIEAI 1964 QASRQW+ KL+S L+++G+ QS SD SL+ S D T L VYVDDI+LAGN +I+A+ Sbjct: 403 QASRQWYSKLSSALISFGYKQSVSDHSLYIKSTDSEFTALLVYVDDIVLAGNSSKEIQAV 462 Query: 1965 KLHLDEKFTIKDLGTLKYILGIEVARSTKGIHICQRKYALDLLTETGFLGSKPSTTPMDC 2144 K LD+KF IKDLG L+Y LG E+ARS KGI + QRKY L+LL +TGFL +KPS P + Sbjct: 463 KHFLDQKFKIKDLGKLRYFLGFEIARSPKGIFVNQRKYTLELLQDTGFLATKPSNIPFNP 522 Query: 2145 KAQFSSNDSSPLTDASQYRRLIGRMLYLTITRPELSFSIQTLSQYLSKPTTTHLAAAHRI 2324 + SS D +PL D S YRRLIGR+LYLT TRP++SFS+Q LSQ++SKP H AA RI Sbjct: 523 TTKLSSTDGAPLKDPSSYRRLIGRLLYLTNTRPDISFSVQHLSQFVSKPLIPHYTAATRI 582 Query: 2325 *RYIKSTPVKGLFYSSTSPLHLKCFSDADWARCQETRRSISGFAVFLGDSLISWKSKKQP 2504 +Y+KS P GLF+ +S L L ++D+DWARC +TR+SI+G+ VF+G SLISWKSKKQ Sbjct: 583 LKYLKSAPANGLFFPVSSSLKLTGYADSDWARCPDTRKSITGYCVFIGSSLISWKSKKQN 642 Query: 2505 TISRSSAEAEYRAIALTTCEIQWLTYLLQDMHIK--HPQSLMYTDSRSAYCISQNSCHHE 2678 T+SRSS EAEYRA+A TCEIQWL YL QD +K +P S ++ DSRSA ++ N HE Sbjct: 643 TVSRSSTEAEYRALASLTCEIQWLQYLFQDFKMKFSNPAS-VFCDSRSAIYLAHNPAFHE 701 Query: 2679 RTKHIQLDCHFVREKLQEGLFKLIHIPSTQQTADIFTKPLPPAQFNYLISKLSMLNIHAP 2858 R+KHI++DCH +REK+Q L L+ IPS Q AD+FTKPL F L+SKL++ +IH+P Sbjct: 702 RSKHIEIDCHVIREKIQSQLIHLLPIPSNSQIADMFTKPLHFPAFFDLLSKLNLCSIHSP 761 >gb|KYP40677.1| Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cajanus cajan] Length = 810 Score = 769 bits (1985), Expect = 0.0 Identities = 403/793 (50%), Positives = 526/793 (66%), Gaps = 2/793 (0%) Frame = +3 Query: 486 GIVERKHQHLLQVARALLFQSKVPLSFWDHCILTAAHIINRLPAPILHNKTPFEILFNKM 665 GIVERKHQH+L V R+LLFQSKVP SFW I A HIINRLP P L+NK+PFE++FN+ Sbjct: 62 GIVERKHQHILNVCRSLLFQSKVPKSFWSFAIKHAVHIINRLPTPFLNNKSPFEMIFNQK 121 Query: 666 PDYSHFKIFGCLAYVSTLTAQRHKFQARASKCVFIGYPLGSKGYKLYDLESHKVLISRHV 845 P+ K FGCLAYVSTL+ R+K + RA KC+F+G+ G+KG+ L++L S +L+SRH Sbjct: 122 PNLHDLKTFGCLAYVSTLSGGRNKLEPRAHKCIFLGFKTGTKGFVLFNLHSKSILLSRHA 181 Query: 846 IFCENIFPFQEIASKSVSPEAPLFPISDLSSAQDQSFSHFQNINAQNDIVTTT-QIISDA 1022 IF + +FP+ + + + PIS+ D F + + +D +T I +A Sbjct: 182 IFHQTVFPYINVFDNQSNSN--IVPISN-----DNIFQPYDFFPSYSDHLTNNPNSIENA 234 Query: 1023 QNDFADAQHDLAAQNTTVVQTDPVNSHIEHSNPNPPILRRSSRNRTNPSYLQVYDCKIPP 1202 N+F + D + + + IE P R S R+++ P YL Y C Sbjct: 235 SNNFENHTQDPPIEMSQSIP-------IETGQIQPITTRVSLRSKSRPGYLDQYHCYNTV 287 Query: 1203 SIQSQHSAQTIISGKQYPIQDFLSVDKLSNKHKAFSAQISQITEPKSYAQASKQKEWQDA 1382 S S S YP+ ++ + +AS + W+ A Sbjct: 288 SSDST-------SNSLYPMHLYI------------------------FKEASTKDCWKQA 316 Query: 1383 MKAEIRALEKNQTWIMTDLPADKHCIGCKWVYKVKYRSNGSIERYKARLVAKGYTQEEGL 1562 M AE+ ALE+N+TW + LP K IGCKWVY+ K+++NG++ERYKARLVAKG+TQ EG+ Sbjct: 317 MIAELDALERNRTWSLVTLPPGKKLIGCKWVYRTKHKANGTVERYKARLVAKGFTQTEGI 376 Query: 1563 DYFDTFSPVAKITTVRLLIALAAAKGWFLHQLDVNNAFLHGDLHEEVYMLPPPGYLQSND 1742 DYF+TFSPVAKIT++R L+ALA++ WF+HQLD++NAFLHGDL EEVYM PP G + Sbjct: 377 DYFETFSPVAKITSIRFLLALASSHNWFIHQLDIDNAFLHGDLDEEVYMRPPQGLRLPSS 436 Query: 1743 KKVCKLTKSLYGLKQASRQWFQKLTSCLLNYGFIQSKSDSSLFTMSHDDSITILCVYVDD 1922 K VCKL KSLYGLKQASR W QKLTS L G+ QS +D SLF SITIL +YVDD Sbjct: 437 KLVCKLEKSLYGLKQASRNWNQKLTSELTLMGYKQSFADHSLFVNFTGSSITILLIYVDD 496 Query: 1923 IILAGNDISKIEAIKLHLDEKFTIKDLGTLKYILGIEVARSTKGIHICQRKYALDLLTET 2102 I+L+GND+++I+ +K HL KF IKDLG+LK+ LG+EVARS KGI + QRKY L+L+ ET Sbjct: 497 IVLSGNDMTEIKKVKAHLHNKFHIKDLGSLKFFLGLEVARSKKGILLNQRKYCLELIDET 556 Query: 2103 GFLGSKPSTTPMDCKAQFSSNDSSPLTDASQYRRLIGRMLYLTITRPELSFSIQTLSQYL 2282 G LG KP+ TP D + + L D + +RRLIGR+LYLT TRP++SFS+Q LSQ++ Sbjct: 557 GLLGCKPAPTPADPAMKLHVDHGDLLHDPTVFRRLIGRLLYLTNTRPDISFSVQQLSQFV 616 Query: 2283 SKPTTTHLAAAHRI*RYIKSTPVKGLFYSSTSPLHLKCFSDADWARCQETRRSISGFAVF 2462 SKP H+ AA RI RY+K P GLFYSST+PL ++ FSD+DWA C TRRS++G+ VF Sbjct: 617 SKPREPHMQAALRIVRYLKGAPGLGLFYSSTNPLKIQAFSDSDWATCATTRRSVTGYCVF 676 Query: 2463 LGDSLISWKSKKQPTISRSSAEAEYRAIALTTCEIQWLTYLLQDMHIKHPQSL-MYTDSR 2639 +G+SLISWKSKKQ T+SRSS+EAEYRA+A TCE+QWL YL MHIK P ++DS+ Sbjct: 677 IGNSLISWKSKKQSTVSRSSSEAEYRALASLTCELQWLKYLCDSMHIKIPTPFATFSDSQ 736 Query: 2640 SAYCISQNSCHHERTKHIQLDCHFVREKLQEGLFKLIHIPSTQQTADIFTKPLPPAQFNY 2819 SA IS+N HERTKHI++DCH +R K+QEGL LIH+ S Q AD FTK L P F+ Sbjct: 737 SAIQISKNPTFHERTKHIEVDCHLIRIKIQEGLLHLIHVLSANQLADAFTKALFPKPFHT 796 Query: 2820 LISKLSMLNIHAP 2858 ISKL +LNI+ P Sbjct: 797 AISKLGLLNIYHP 809 >gb|PNX93614.1| retrovirus-related Pol polyprotein from transposon TNT 1-94 [Trifolium pratense] Length = 1430 Score = 786 bits (2030), Expect = 0.0 Identities = 411/804 (51%), Positives = 543/804 (67%), Gaps = 9/804 (1%) Frame = +3 Query: 486 GIVERKHQHLLQVARALLFQSKVPLSFWDHCILTAAHIINRLPAPILHNKTPFEILFNKM 665 GIVERKHQH+L VARAL FQ+ +P +FW IL + H+INRLP P L +K+P+E+LF + Sbjct: 643 GIVERKHQHILNVARALSFQAFLPSNFWHLSILHSVHLINRLPTPFLQHKSPYEVLFQQP 702 Query: 666 PDYSHFKIFGCLAYVSTLTAQRHKFQARASKCVFIGYPLGSKGYKLYDLESHKVLISRHV 845 P H + FGCLA+ STL R KF RA K VF+GY G+KG+ LYD+ +H L+SR+V Sbjct: 703 PTLLHLRTFGCLAFASTLHNHRTKFMPRARKTVFLGYRDGTKGFLLYDISNHSFLVSRNV 762 Query: 846 IFCENIFPFQEIASKSVSPEAPL----FPISDLSSAQDQSFSHFQNINAQNDIVTTTQII 1013 IF E++FP + S S L PI + + A + T T + Sbjct: 763 IFYEDVFPLSSVNSSHTSSTTTLDNFVLPIDPPNFPS--------SCPAPLSVSTGTNPL 814 Query: 1014 SDAQNDFADAQHDLAAQNTTVVQTDPVNSHIEHSNPNPPILRRSSRNRTNPSYLQVYDCK 1193 +D + A + + + V P NS I P R S+R R P YLQ + C Sbjct: 815 TDHAENSATLVDNQVSNSPAV---PPQNSSI------PAPTRVSNRIRKIPGYLQDFHCS 865 Query: 1194 IPPSIQSQHSAQTIISGKQYPIQDFLSVDKLSNKHKAFSAQISQITEPKSYAQASKQKEW 1373 + PS QH + + + YPI LS + +K F IS EPK++ QA K W Sbjct: 866 LLPS---QHQSSSSNAFSTYPISSSLSYTNCATAYKHFCLSISTTIEPKTFKQACKSDCW 922 Query: 1374 QDAMKAEIRALEKNQTWIMTDLPADKHCIGCKWVYKVKYRSNGSIERYKARLVAKGYTQE 1553 ++AMK+E+ ALE N+TW + DLP K+ IGCKWVYK+K+ ++GSIERYKARLVAKGYTQ Sbjct: 923 KEAMKSELAALELNRTWSIVDLPTGKNPIGCKWVYKIKHNADGSIERYKARLVAKGYTQM 982 Query: 1554 EGLDYFDTFSPVAKITTVRLLIALAAAKGWFLHQLDVNNAFLHGDLHEEVYMLPPPGYLQ 1733 EG+DYFDTFSPVAK+TTV+ L+ALA+ KGWFL QLDVNNAFLHGDL+EEVYM PPG + Sbjct: 983 EGVDYFDTFSPVAKLTTVKTLLALASIKGWFLEQLDVNNAFLHGDLNEEVYMSLPPGVII 1042 Query: 1734 ----SNDKKVCKLTKSLYGLKQASRQWFQKLTSCLLNYGFIQSKSDSSLFTMSHDDSITI 1901 SN KVC+L KSLYGLKQASRQW+ KL+S LL+ G+ QS +D SLF S T Sbjct: 1043 PNSCSNTPKVCRLHKSLYGLKQASRQWYSKLSSALLSLGYSQSAADHSLFLKKVGSSFTA 1102 Query: 1902 LCVYVDDIILAGNDISKIEAIKLHLDEKFTIKDLGTLKYILGIEVARSTKGIHICQRKYA 2081 L VYVDDI+LAGN+ +I ++K LD++F IKDLG L++ +G+E+ARS KGI + QRKY Sbjct: 1103 LLVYVDDIVLAGNNSLEITSVKSFLDKRFQIKDLGNLRFFVGLEIARSKKGILLNQRKYT 1162 Query: 2082 LDLLTETGFLGSKPSTTPMDCKAQFSSNDSSPLTDASQYRRLIGRMLYLTITRPELSFSI 2261 L+LL ++G L +KPS+TP D + ++S P D S YRRLIGR+LYLT TRP+++F++ Sbjct: 1163 LELLQDSGNLAAKPSSTPYDPSLKLHDSESPPYNDPSGYRRLIGRLLYLTTTRPDITFAV 1222 Query: 2262 QTLSQYLSKPTTTHLAAAHRI*RYIKSTPVKGLFYSSTSPLHLKCFSDADWARCQETRRS 2441 Q LSQ++S P H AA ++ RY+K++P KGLF+SS+S L L FSD+DWA C TR+S Sbjct: 1223 QQLSQFVSSPREVHFQAATKVLRYLKASPAKGLFFSSSSSLKLSGFSDSDWATCAITRKS 1282 Query: 2442 ISGFAVFLGDSLISWKSKKQPTISRSSAEAEYRAIALTTCEIQWLTYLLQDMHIK-HPQS 2618 I+G+ VFLG SLISWKSKKQ T+SRSS+EAEYRA+A +CE+QWL YL +D+ IK + Sbjct: 1283 ITGYCVFLGTSLISWKSKKQSTVSRSSSEAEYRALASLSCELQWLHYLFKDLGIKFDAPA 1342 Query: 2619 LMYTDSRSAYCISQNSCHHERTKHIQLDCHFVREKLQEGLFKLIHIPSTQQTADIFTKPL 2798 ++Y D++SA ++ N HERTKHI++DCH VRE++Q GL L+ +PS+ Q AD+ TK L Sbjct: 1343 MVYCDNKSAIYLAHNPSFHERTKHIEIDCHVVRERIQSGLIHLLPVPSSSQLADVLTKQL 1402 Query: 2799 PPAQFNYLISKLSMLNIHAPLEGG 2870 + F LISKL +L+IH+P GG Sbjct: 1403 SSSAFASLISKLGLLDIHSPACGG 1426 >gb|PNX93928.1| hypothetical protein L195_g017092, partial [Trifolium pratense] Length = 865 Score = 761 bits (1964), Expect = 0.0 Identities = 388/795 (48%), Positives = 533/795 (67%), Gaps = 4/795 (0%) Frame = +3 Query: 486 GIVERKHQHLLQVARALLFQSKVPLSFWDHCILTAAHIINRLPAPILHNKTPFEILFNKM 665 G+VERKH+H+L VARALLF S +PL FW C+LTA ++INRLP+P+L NK+PFE+L+NK Sbjct: 83 GVVERKHRHILVVARALLFHSHLPLEFWGECVLTAVYLINRLPSPLLSNKSPFELLYNKP 142 Query: 666 PDYSHFKIFGCLAYVSTLTAQRHKFQARASKCVFIGYPLGSKGYKLYDLESHKVLISRHV 845 P H ++FGCL Y +T+ HKF RA + +F+GYP G KGYK+YD E+ +SR V Sbjct: 143 PSLDHLRVFGCLCY-ATIVHPTHKFDPRAKRGIFVGYPTGQKGYKIYDPETKTFFVSRDV 201 Query: 846 IFCENIFPFQEIASKS--VSPEAPLFPISDLSSAQDQSFSHFQNINAQNDIVTTTQIISD 1019 FCE FP S+ +S I DL S SH Q+ Q DI +T + Sbjct: 202 KFCETNFPSIPNTSEPNLISSHPSYEAIDDLPSPTS---SHHQS--QQTDIPSTHE---- 252 Query: 1020 AQNDFADAQHDLAAQNTTVVQTDPVNSHI-EHSNPNPPILRRSSRNRTNPSYLQVYDCKI 1196 N + + ++ + +V+ P+ +H + P P +R+S R++ P + Y + Sbjct: 253 -PNSPSHITTETSSAASPIVEPTPLTTHTTDPPTPFIPQVRKSVRDKHPPIWHNDYH--M 309 Query: 1197 PPSIQSQHSAQTIISGKQYPIQDFLSVDKLSNKHKAFSAQISQITEPKSYAQASKQKEWQ 1376 + S T SG +YP+ +LS ++S+ + AF A I+ EP+SY QA WQ Sbjct: 310 STQVNKTPSEPTSGSGTRYPLSHYLSYSRISSSNCAFLANITAHREPQSYDQAVHDPLWQ 369 Query: 1377 DAMKAEIRALEKNQTWIMTDLPADKHCIGCKWVYKVKYRSNGSIERYKARLVAKGYTQEE 1556 DAM AE+ ALE+N TW + LP+ IGCKWVYK+KY+S+G+IERYKARLVAKGYTQ E Sbjct: 370 DAMNAELEALEQNNTWSLVPLPSGHKPIGCKWVYKIKYKSDGTIERYKARLVAKGYTQVE 429 Query: 1557 GLDYFDTFSPVAKITTVRLLIALAAAKGWFLHQLDVNNAFLHGDLHEEVYMLPPPGYLQS 1736 G+DY +TFSP AK+TT+R L+ +AAA+ WF+HQLDV NAFLHGDLHE VYM PPPG + Sbjct: 430 GIDYQETFSPTAKVTTLRCLLTVAAARNWFIHQLDVQNAFLHGDLHELVYMEPPPGLRRQ 489 Query: 1737 NDKKVCKLTKSLYGLKQASRQWFQKLTSCLLNYGFIQSKSDSSLFTMSHDDSITILCVYV 1916 + VC+L KSLYGLKQASR WF + + G+ QSK+D SLFT S S T + +YV Sbjct: 490 GENVVCRLNKSLYGLKQASRNWFSTFSEVIQKAGYQQSKADYSLFTKSQGTSFTAVLIYV 549 Query: 1917 DDIILAGNDISKIEAIKLHLDEKFTIKDLGTLKYILGIEVARSTKGIHICQRKYALDLLT 2096 DDI+L GND+ +++ +K L ++F IKDLG LKY LGIE +RS KGI + QRKYALD+L Sbjct: 550 DDILLTGNDLQEMKRLKEFLLKRFRIKDLGNLKYFLGIEFSRSKKGIFMSQRKYALDILQ 609 Query: 2097 ETGFLGSKPSTTPMDCKAQFSSNDSSPLTDASQYRRLIGRMLYLTITRPELSFSIQTLSQ 2276 ++G G++P PM+ + + D L D ++YRRL+GR++YLT+TRP++ +S+QTLSQ Sbjct: 610 DSGLTGARPDKFPMEQNLKLTPTDGVVLNDPTKYRRLVGRLIYLTVTRPDIVYSVQTLSQ 669 Query: 2277 YLSKPTTTHLAAAHRI*RYIKSTPVKGLFYSSTSPLHLKCFSDADWARCQETRRSISGFA 2456 ++ +P H AA R+ RYIK TP +GL +SST+ L LK F D+DW C TRRS++GF Sbjct: 670 FMHEPRKPHWDAALRVLRYIKGTPGQGLLFSSTNDLTLKAFCDSDWGGCHATRRSVTGFC 729 Query: 2457 VFLGDSLISWKSKKQPTISRSSAEAEYRAIALTTCEIQWLTYLLQDMHI-KHPQSLMYTD 2633 +FLG+SLISWKSKKQ +SRSSAE+EYRA+A T E+ WL ++LQD+ + ++ + ++ D Sbjct: 730 LFLGNSLISWKSKKQVVVSRSSAESEYRAMANTCLELTWLRFILQDLKVSQNTPTPLFCD 789 Query: 2634 SRSAYCISQNSCHHERTKHIQLDCHFVREKLQEGLFKLIHIPSTQQTADIFTKPLPPAQF 2813 +++A I+ N HERTKHI++DCH VREKLQ G+ ++P+ Q AD+FTK L QF Sbjct: 790 NQAALHIAANPVFHERTKHIEIDCHIVREKLQAGIINPSYVPTRFQLADVFTKALGKDQF 849 Query: 2814 NYLISKLSMLNIHAP 2858 L SKL + +IH+P Sbjct: 850 VTLRSKLGLHDIHSP 864 >gb|KYP39497.1| Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cajanus cajan] Length = 1445 Score = 779 bits (2011), Expect = 0.0 Identities = 404/799 (50%), Positives = 528/799 (66%), Gaps = 8/799 (1%) Frame = +3 Query: 486 GIVERKHQHLLQVARALLFQSKVPLSFWDHCILTAAHIINRLPAPILHNKTPFEILFNKM 665 GIVERKHQHLL V RALLFQS++P +FW + I A HIINRLP L K+PFE++++ Sbjct: 691 GIVERKHQHLLNVCRALLFQSQIPKTFWSYAIKHAVHIINRLPTRFLKQKSPFEMIYHHK 750 Query: 666 PDYSHFKIFGCLAYVSTLTAQRHKFQARASKCVFIGYPLGSKGYKLYDLESHKVLISRHV 845 PD ++FGCLAY STL A R K RASKC+F+GY G+KGY LY+L S +SR+V Sbjct: 751 PDLHDLRVFGCLAYASTLAAGRTKLAPRASKCIFLGYKSGTKGYVLYNLHSRFFSLSRNV 810 Query: 846 IFCENIFPFQEIASKSVSPEAPLFPISDLSSAQDQSFSHFQNINAQNDIVTTTQIISDAQ 1025 IF E +FP+ S F ++ +D + T ++ Sbjct: 811 IFHETVFPY----------------------------SDFHKFSSNSDAILT-----EST 837 Query: 1026 NDFADAQ-HDLAAQNTTVVQT------DPVNSHIEHSNPNPPILRRSSRNRTNPSYLQVY 1184 NDFA + + + TT T P+ ++ P R S+R + P+YL Y Sbjct: 838 NDFAHYELYPITVLETTNTHTPSEQLQQPLTEEMQDIPVQP---RVSTRQKFRPNYLNQY 894 Query: 1185 DCKIPPSIQSQHSAQTIISGKQYPIQDFLSVDKLSNKHKAFSAQISQITEPKSYAQASKQ 1364 C +SA + S YP+ +LS +K S+ + AF IS EP +YAQA Q Sbjct: 895 HC---------YSATSSNSACLYPLHSYLSYNKCSSNYTAFCLSISAHVEPSNYAQAITQ 945 Query: 1365 KEWQDAMKAEIRALEKNQTWIMTDLPADKHCIGCKWVYKVKYRSNGSIERYKARLVAKGY 1544 W+ AM E+ AL +N+TW + LP + IGCKWVY++KY+++GS++R+KARLVAKG+ Sbjct: 946 DCWKQAMMTELDALNRNRTWSLVSLPPGRKLIGCKWVYRIKYKADGSVDRHKARLVAKGF 1005 Query: 1545 TQEEGLDYFDTFSPVAKITTVRLLIALAAAKGWFLHQLDVNNAFLHGDLHEEVYMLPPPG 1724 TQ EG+DYF+TFSPVAK+TTVR L+ALA+ WFLHQLDV+NAFLHGDL EEVYM PP G Sbjct: 1006 TQTEGIDYFETFSPVAKLTTVRFLLALASTNNWFLHQLDVDNAFLHGDLEEEVYMRPPQG 1065 Query: 1725 YLQSNDKKVCKLTKSLYGLKQASRQWFQKLTSCLLNYGFIQSKSDSSLFTMSHDDSITIL 1904 + K VCKL KSLYGLKQASR W QKL S LL+ G+ QS +D SLF SITIL Sbjct: 1066 LQLPDSKLVCKLEKSLYGLKQASRNWNQKLNSELLHLGYKQSSADHSLFIKKQGSSITIL 1125 Query: 1905 CVYVDDIILAGNDISKIEAIKLHLDEKFTIKDLGTLKYILGIEVARSTKGIHICQRKYAL 2084 VYVDD++L+GN+ S+I+ +K HL +KF IKDLG LK+ LG+EVARS KGI + QRKY L Sbjct: 1126 LVYVDDVVLSGNNFSEIQIVKQHLHQKFQIKDLGPLKFFLGLEVARSKKGIILNQRKYCL 1185 Query: 2085 DLLTETGFLGSKPSTTPMDCKAQFSSNDSSPLTDASQYRRLIGRMLYLTITRPELSFSIQ 2264 +L+ E+G LGSKP+ TP D + ++ S L D + +RRLIGR+LYLT TRP++SFS+Q Sbjct: 1186 ELIDESGLLGSKPAPTPADPAIKLHADHGSLLNDPTSFRRLIGRLLYLTNTRPDISFSVQ 1245 Query: 2265 TLSQYLSKPTTTHLAAAHRI*RYIKSTPVKGLFYSSTSPLHLKCFSDADWARCQETRRSI 2444 LSQ++S+P HL AA RI RY+K P GLFY + + L ++ FSD+DWA C TR+S+ Sbjct: 1246 QLSQFVSQPREPHLQAALRIVRYLKGAPGLGLFYPAENHLRIQAFSDSDWATCSTTRKSV 1305 Query: 2445 SGFAVFLGDSLISWKSKKQPTISRSSAEAEYRAIALTTCEIQWLTYLLQDMHIKHPQSL- 2621 +G+ VFLG SL+SWKSKKQ T+SRSS+EAEYRA+A TCE+QWL +L + +K P Sbjct: 1306 TGYCVFLGKSLVSWKSKKQSTVSRSSSEAEYRALASLTCELQWLKFLSDSLFVKIPAPFS 1365 Query: 2622 MYTDSRSAYCISQNSCHHERTKHIQLDCHFVREKLQEGLFKLIHIPSTQQTADIFTKPLP 2801 +++DS+SA I++N HERTKHI++DCH +R KLQE L LIH+PS Q AD FTK L Sbjct: 1366 VFSDSQSAIQIAKNPTFHERTKHIEVDCHLIRIKLQEELLHLIHVPSANQLADAFTKSLY 1425 Query: 2802 PAQFNYLISKLSMLNIHAP 2858 P F + ISKL + NIH P Sbjct: 1426 PKLFLHAISKLGLSNIHGP 1444 >gb|KYP42321.1| Copia protein [Cajanus cajan] Length = 1456 Score = 775 bits (2002), Expect = 0.0 Identities = 395/801 (49%), Positives = 537/801 (67%), Gaps = 10/801 (1%) Frame = +3 Query: 486 GIVERKHQHLLQVARALLFQSKVPLSFWDHCILTAAHIINRLPAPILHNKTPFEILFNKM 665 GIVERKHQH+L V RALLFQ+K+P FW + A HIINRLP P+L K+PFE+++N Sbjct: 662 GIVERKHQHILNVCRALLFQAKLPKQFWSFAVKQATHIINRLPTPLLSQKSPFEMIYNCK 721 Query: 666 PDYSHFKIFGCLAYVSTLTAQRHKFQARASKCVFIGYPLGSKGYKLYDLESHKVLISRHV 845 PD + K+FGCLA+ +TL+++R K RASKC+F+GY G+KG+ L++L + LISR V Sbjct: 722 PDLTELKVFGCLAFATTLSSKRTKLDRRASKCIFLGYKNGTKGFLLFNLHNKSFLISRDV 781 Query: 846 IFCENIFPFQ-EIASKSVSPEAPLFPISDLSSA------QDQSFSHFQ-NINAQNDIVTT 1001 +F E IFP+ + S S S L + D + +FSH +I + ++ Sbjct: 782 LFYEKIFPYSAHVPSMSASDSLLLDVVKDNDTTIYSDPFPTTTFSHGSPSIPLDTPLPSS 841 Query: 1002 TQIISDAQNDFADAQH-DLAAQNTTVVQTDPVNSHIEHSNPNPPILRRSSRNRTNPSYLQ 1178 IS + F+ + + + N+ + S P R S+R R P YLQ Sbjct: 842 ETTISTDRPPFSPINTCPIPTATLSTPELPSSNTTNDASQVVMPQTRVSTRIRKPPRYLQ 901 Query: 1179 VYDCKIPPSIQSQHSAQTIISGKQYPIQDFLSVDKLSNKHKAFSAQISQITEPKSYAQAS 1358 Y C+ ++ S +A + YP+ F++ + S H +F IS EP S+ +A+ Sbjct: 902 EYYCE---NLASSSAASNCL----YPLSSFVTYNNCSPSHTSFCLSISAQHEPTSFKEAN 954 Query: 1359 KQKEWQDAMKAEIRALEKNQTWIMTDLPADKHCIGCKWVYKVKYRSNGSIERYKARLVAK 1538 ++ W+ AM+AE++ALEKNQTW + LP K +GCKWVY+VKY+ +GS+ERYKARLVAK Sbjct: 955 SEECWRRAMEAELQALEKNQTWSLVRLPEGKRPVGCKWVYRVKYKVDGSVERYKARLVAK 1014 Query: 1539 GYTQEEGLDYFDTFSPVAKITTVRLLIALAAAKGWFLHQLDVNNAFLHGDLHEEVYMLPP 1718 G+TQ EG+DYF+TFSPV K++TVR L++LAAA WFLHQLDV+NAFLHGDL EEVYM PP Sbjct: 1015 GFTQTEGVDYFETFSPVVKLSTVRFLLSLAAAHNWFLHQLDVDNAFLHGDLFEEVYMKPP 1074 Query: 1719 PGYLQSNDKKVCKLTKSLYGLKQASRQWFQKLTSCLLNYGFIQSKSDSSLFTMSHDDSIT 1898 PG+ S+ + VCKL KSLYGLKQASRQW QKLT L++ FIQS +D SLF SIT Sbjct: 1075 PGFKLSHPRLVCKLHKSLYGLKQASRQWNQKLTEALISLNFIQSSTDHSLFIKKSHSSIT 1134 Query: 1899 ILCVYVDDIILAGNDISKIEAIKLHLDEKFTIKDLGTLKYILGIEVARSTKGIHICQRKY 2078 L VYVDD++L GND+++I A+K +L +F IKDLG LK+ LG+E+ARS G+ + QRKY Sbjct: 1135 ALLVYVDDVVLTGNDMAEISAVKAYLHAQFHIKDLGPLKFFLGLEIARSQSGLILNQRKY 1194 Query: 2079 ALDLLTETGFLGSKPSTTPMDCKAQFSSNDSSPLTDASQYRRLIGRMLYLTITRPELSFS 2258 L+LL+E G KP +TP+D + +++ PL D + +RRLIGR+LYLT TRP++SF+ Sbjct: 1195 CLELLSEHGLTDCKPVSTPIDASVKLYASEGLPLDDPTIFRRLIGRLLYLTNTRPDISFA 1254 Query: 2259 IQTLSQYLSKPTTTHLAAAHRI*RYIKSTPVKGLFYSSTSPLHLKCFSDADWARCQETRR 2438 +Q LSQ++ P TH AA RI RY+KS+P GLFY S + ++ FSD+DWA C TRR Sbjct: 1255 VQQLSQFVDSPRATHFQAALRILRYLKSSPALGLFYPSQTEHRIQAFSDSDWASCPNTRR 1314 Query: 2439 SISGFAVFLGDSLISWKSKKQPTISRSSAEAEYRAIALTTCEIQWLTYLLQDMHIKHPQS 2618 S++GF +F G +LISWKSKKQ T+SRSS+EAEYRA+A TCE+QWL +L D+ I P Sbjct: 1315 SVTGFCIFYGSALISWKSKKQSTVSRSSSEAEYRALASVTCELQWLLFLCHDLSINIPTP 1374 Query: 2619 L-MYTDSRSAYCISQNSCHHERTKHIQLDCHFVREKLQEGLFKLIHIPSTQQTADIFTKP 2795 ++ DS+SA I++N HERTKHI++DCH R K+Q+GL L H+PS Q AD+FTK Sbjct: 1375 FSIFCDSQSAIYIAKNPTFHERTKHIEVDCHLTRLKIQQGLIHLFHVPSKSQLADVFTKA 1434 Query: 2796 LPPAQFNYLISKLSMLNIHAP 2858 L P F +SKL +++I+ P Sbjct: 1435 LYPRNFTEAVSKLCLIDIYNP 1455 >emb|CAN71595.1| hypothetical protein VITISV_010143 [Vitis vinifera] Length = 1523 Score = 776 bits (2005), Expect = 0.0 Identities = 401/806 (49%), Positives = 539/806 (66%), Gaps = 12/806 (1%) Frame = +3 Query: 486 GIVERKHQHLLQVARALLFQSKVPLSFWDHCILTAAHIINRLPAPILHNKTPFEILFNKM 665 G+VERKH+HLL VARALLFQS +P FW ILTAA++INR P P+L KTPFE LF+K Sbjct: 684 GVVERKHRHLLNVARALLFQSHLPKPFWGDAILTAAYLINRTPTPLLQGKTPFEKLFHKS 743 Query: 666 PDYSHFKIFGCLAYVSTLTAQRHKFQARASKCVFIGYPLGSKGYKLYDLESHKVLISRHV 845 P+YSH ++FGC +VST + KF R+ + VFIGYP G KGYK+Y L+ K LISR V Sbjct: 744 PNYSHLRVFGCRCFVSTHPLRPSKFDPRSIESVFIGYPHGQKGYKVYSLKDKKXLISRDV 803 Query: 846 IFCENIFPFQEIASKSVSPEAPLFPISDLSSAQDQSFSHFQNINAQNDIVTTTQIISDAQ 1025 F E FP+Q S + FP + D F + + T+ + Q Sbjct: 804 TFFETEFPYQNXLSTTSPSLDTFFPSLPQTPDIDDDHISFNHSGSNLQPSATSSVDXHPQ 863 Query: 1026 ----NDFADAQHDLAAQNTTVVQTDPVNSHIEHSNPNPPILRRSSRNRTNPSYLQVYDCK 1193 N + + D + ++ + PV S P+P RRSSR P+ LQ + + Sbjct: 864 PTLDNSHSSSHVDPPSSPPSLNTSPPVISQ-----PSPSQPRRSSRPTKTPTTLQDFHIE 918 Query: 1194 -------IPPSIQSQHSAQTIISGKQYPIQDFLSVDKLSNKHKAFSAQISQITEPKSYAQ 1352 +PPS S+ + SG + + LS D+LS HKAF+ +I+ EP+S++Q Sbjct: 919 AALPSRPVPPSSTSEVAH----SGTIHSLSQVLSYDRLSPMHKAFTVKITLAKEPRSFSQ 974 Query: 1353 ASKQKEWQDAMKAEIRALEKNQTWIMTDLPADKHCIGCKWVYKVKYRSNGSIERYKARLV 1532 A W++AM EI+AL+ N+TW + LP+ K IGCKWVYK+KY +G+IERYKARLV Sbjct: 975 AVLDSRWREAMNTEIQALQANKTWSLVPLPSHKKPIGCKWVYKIKYNPDGTIERYKARLV 1034 Query: 1533 AKGYTQEEGLDYFDTFSPVAKITTVRLLIALAAAKGWFLHQLDVNNAFLHGDLHEEVYML 1712 AKG++Q EG+DY +TF+PVAK+TTVR+L++LA+ +GW LHQLDVNNAFL+GDL+E+VYM Sbjct: 1035 AKGFSQVEGIDYRETFAPVAKLTTVRVLLSLASIQGWHLHQLDVNNAFLNGDLYEDVYMQ 1094 Query: 1713 PPPGYLQSNDKKVCKLTKSLYGLKQASRQWFQKLTSCLLNYGFIQSKSDSSLFTMSHDDS 1892 PPG+ + + +VCKL KSLYGLKQASRQWF KL+S L GF QS SD SLF + Sbjct: 1095 LPPGFGRKGEHRVCKLHKSLYGLKQASRQWFLKLSSALKAAGFKQSWSDYSLFXRNTQGR 1154 Query: 1893 ITILCVYVDDIILAGNDISKIEAIKLHLDEKFTIKDLGTLKYILGIEVARSTKGIHICQR 2072 T L VYVDD+ILAGN + I K L F +KD+G L+Y LGIEVARS +GI +CQR Sbjct: 1155 FTTLLVYVDDVILAGNSLEDIIETKQFLASHFKLKDMGQLRYFLGIEVARSKQGIVLCQR 1214 Query: 2073 KYALDLLTETGFLGSKPSTTPMDCKAQFSSNDSSPLTDASQYRRLIGRMLYLTITRPELS 2252 KYAL+LL + GFLG+KPS P++ + D + L DASQYRRL+GR++YLTITRP+L Sbjct: 1215 KYALELLEDAGFLGAKPSRFPVEQSLTLTRGDGAELKDASQYRRLVGRLIYLTITRPDLV 1274 Query: 2253 FSIQTLSQYLSKPTTTHLAAAHRI*RYIKSTPVKGLFYSSTSPLHLKCFSDADWARCQET 2432 +++ LSQ++ P HL AA+++ RY+K TP +G+F ST L L + DADWARC++T Sbjct: 1275 YAVHILSQFMDTPRQPHLDAAYKVLRYVKQTPGQGIFLPSTGQLELTAYCDADWARCKDT 1334 Query: 2433 RRSISGFAVFLGDSLISWKSKKQPTISRSSAEAEYRAIALTTCEIQWLTYLLQDMHIKHP 2612 RRS +G+ +F G++ ISWK+KKQ T+SRSSAEAEYR++A T CEI WL LL D+++ H Sbjct: 1335 RRSTTGYCIFFGNAPISWKTKKQGTVSRSSAEAEYRSMATTCCEITWLRSLLADLNVNHA 1394 Query: 2613 QSL-MYTDSRSAYCISQNSCHHERTKHIQLDCHFVREKLQEGLFKLIHIPSTQQTADIFT 2789 ++ ++ D+++A I+ N HERTKHI++DCH VREK+Q GL K +HI + +Q AD+FT Sbjct: 1395 HAVKLFCDNQAAIHIASNPVFHERTKHIEMDCHVVREKVQRGLVKTMHIRTQEQPADLFT 1454 Query: 2790 KPLPPAQFNYLISKLSMLNIHAPLEG 2867 KPL QF+ L+SKL ++NIH L G Sbjct: 1455 KPLSSKQFSTLLSKLGVINIHTNLRG 1480 >dbj|GAU46782.1| hypothetical protein TSUD_351810 [Trifolium subterraneum] Length = 1512 Score = 772 bits (1994), Expect = 0.0 Identities = 404/815 (49%), Positives = 528/815 (64%), Gaps = 21/815 (2%) Frame = +3 Query: 486 GIVERKHQHLLQVARALLFQSKVPLSFWDHCILTAAHIINRLPAPILHNKTPFEILFNKM 665 G VERKHQHLL V R+LLFQSK+P FW + + A +IINR+ P+L NK+P+ +L+NK Sbjct: 691 GRVERKHQHLLNVGRSLLFQSKLPKKFWSYAVSHATYIINRVCTPLLQNKSPYHLLYNKP 750 Query: 666 PDYSHFKIFGCLAYVSTLTAQRHKFQARASKCVFIGYPLGSKGYKLYDLESHKVLISRHV 845 PD K+FG L Y STL QR K RA KC+F+GY G KG LYD+ +H + +SR++ Sbjct: 751 PDLEQLKVFGSLCYASTLQNQRTKLDPRARKCIFLGYKSGMKGVILYDIHNHNIFVSRNI 810 Query: 846 IFCENIFPFQEIASKSV-----SPEAPLFPISDLSSAQDQSFSHFQN--------INAQN 986 ++I P+ +S S+ SP F S++ S H + + +N Sbjct: 811 THYDHILPYAS-SSYSIPWSYHSPNIDPFITPPTSNSGSSSIPHSTDHIHFNTPMCDQEN 869 Query: 987 DIVTTTQIISD------AQNDFADAQHDLAAQNTTVVQTDPVNSHIEHSNPNPPILRRSS 1148 ++Q SD ND +Q + Q P + S+ + P R+S+ Sbjct: 870 PSQPSSQTPSDLFVPQVTDNDIVSSQPSIPHQPHDTHSPLPTTNLPSPSHNSIPQTRQST 929 Query: 1149 RNRTNPSYLQVYDCKIPPSIQSQHSAQTIISGKQYPIQDFLSVDKLSNKHKAFSAQISQI 1328 R P +L Y C + S S+ G YPI F S +S+K + ++ I+ Sbjct: 930 RMSVKPKHLSDYVCNL-----SVDSSPPSSPGILYPISSFHSYSNISSKFRNYALSITAS 984 Query: 1329 TEPKSYAQASKQKEWQDAMKAEIRALEKNQTWIMTDLPADKHCIGCKWVYKVKYRSNGSI 1508 EP+ Y +AS+Q+ W DAM EI+AL+ N+TW PA IGCKWVYKVK++++GS+ Sbjct: 985 VEPRDYKEASQQQCWVDAMNNEIQALQHNKTWCYVTPPAHIKPIGCKWVYKVKHKADGSV 1044 Query: 1509 ERYKARLVAKGYTQEEGLDYFDTFSPVAKITTVRLLIALAAAKGWFLHQLDVNNAFLHGD 1688 ERYKARLVAKGY Q EGLD+FDTFSPVAKITTVR LIALA+ + W L+Q+DVNNAFLHGD Sbjct: 1045 ERYKARLVAKGYNQVEGLDFFDTFSPVAKITTVRTLIALASIRSWHLNQMDVNNAFLHGD 1104 Query: 1689 LHEEVYMLPPPGYLQSNDKKVCKLTKSLYGLKQASRQWFQKLTSCLLNYGFIQSKSDSSL 1868 L E+VYM P G +VCKL KSLYGLKQASR+W++KLTS LL G+ Q+ SD SL Sbjct: 1105 LQEDVYMEVPQGVNSPKPHQVCKLLKSLYGLKQASRKWYEKLTSLLLKEGYTQASSDHSL 1164 Query: 1869 FTMSHDDSITILCVYVDDIILAGNDISKIEAIKLHLDEKFTIKDLGTLKYILGIEVARST 2048 FT+ H T L VYVDDIILAGN + + IKL +D F IKDLG LKY LGIEVA S Sbjct: 1165 FTLKHGSDFTALLVYVDDIILAGNSLQEFARIKLIMDNAFKIKDLGPLKYFLGIEVAHSK 1224 Query: 2049 KGIHICQRKYALDLLTETGFLGSKPSTTPMDCKAQFSSNDSSPLTDASQYRRLIGRMLYL 2228 +GI ICQRKY LDLL +TG LGSKP+ TP+D + + S D YRRLIG++LYL Sbjct: 1225 QGISICQRKYCLDLLKDTGLLGSKPAPTPLDPSIKLHQDSSPAYDDVGGYRRLIGKLLYL 1284 Query: 2229 TITRPELSFSIQTLSQYLSKPTTTHLAAAHRI*RYIKSTPVKGLFYSSTSPLHLKCFSDA 2408 T TRP++SF+IQ LSQ+LS PTTTH A R+ RY+K +P +GLF+ SPL L F+DA Sbjct: 1285 TTTRPDISFAIQQLSQFLSSPTTTHFDTACRVVRYLKGSPGRGLFFPRQSPLQLLGFADA 1344 Query: 2409 DWARCQETRRSISGFAVFLGDSLISWKSKKQPTISRSSAEAEYRAIALTTCEIQWLTYLL 2588 DWA C +TRRS SG+ F+G SLISW++KKQ T+SRSS+EAEYR+++ +CE+QW+ YLL Sbjct: 1345 DWANCADTRRSTSGYCFFIGSSLISWRAKKQNTVSRSSSEAEYRSLSFASCELQWIVYLL 1404 Query: 2589 QDMHIK-HPQSLMYTDSRSAYCISQNSCHHERTKHIQLDCHFVREKLQEGLFKLIHIPST 2765 +D+ I ++Y D++SA I+ N HERTKH+++DCH VR+K+Q G+FKL+ I + Sbjct: 1405 KDLSIDCERPPVLYCDNQSAIHIASNPVFHERTKHLEIDCHLVRDKVQSGVFKLLPISTK 1464 Query: 2766 QQTADIFTKPLPPAQFNYLISKLSMLNI-HAPLEG 2867 Q AD FTK LPP FN +SKL+MLNI H P G Sbjct: 1465 AQLADFFTKALPPKVFNSFLSKLNMLNIFHVPACG 1499 >gb|PNY17451.1| retrovirus-related Pol polyprotein from transposon TNT 1-94 [Trifolium pratense] Length = 1425 Score = 768 bits (1983), Expect = 0.0 Identities = 400/823 (48%), Positives = 534/823 (64%), Gaps = 29/823 (3%) Frame = +3 Query: 486 GIVERKHQHLLQVARALLFQSKVPLSFWDHCILTAAHIINRLPAPILHNKTPFEILFNKM 665 GIVERKHQH+L ARALLFQS +P FW H + + HIINRLP P L K+P+++L+N + Sbjct: 595 GIVERKHQHILGTARALLFQSHLPKIFWAHAVGHSVHIINRLPTPFLSQKSPYQMLYNCL 654 Query: 666 PDYSHFKIFGCLAYVSTLTAQRHKFQARASKCVFIGYPLGSKGYKLYDLESHKVLISRHV 845 PD ++ K+FGCLAY +TL RHK +R+ KCV +G G KG+ L+DL+S +V ISR V Sbjct: 655 PDINNLKVFGCLAYATTLQTNRHKLDSRSRKCVSLGLKTGVKGHILFDLQSREVFISRDV 714 Query: 846 IFCENIFPF-----QEIASKSVSPEAP--------LFPISDLSSAQDQSFSHFQNINAQN 986 +F E+IFPF +I + + ++P LF + S Q + Sbjct: 715 VFFEHIFPFYTKNQHQIDQTNSATQSPILYDDLDMLFTNHSTHHSSSPSLPLLQTATPHS 774 Query: 987 DIVTTTQIISDAQNDFADAQHDLAAQNTTVVQTDPV-----NSHIEHSNPNPPIL----- 1136 + D + HD + V++TD + N+ + S+ + PI+ Sbjct: 775 PTSIPSTHSPDDHSSPPSPTHDHHSPCDPVIETDVMIPTSTNTPLTTSSNSLPIIAPPSI 834 Query: 1137 ---RRSSRNRTNPSYLQVYDCKIPPSIQSQHSAQTIISGKQ--YPIQDFLSVDKLSNKHK 1301 R+S R + PSYLQ Y KI +I S T S Q +PI F+S D LS HK Sbjct: 835 NPVRKSDRVKHPPSYLQDYHTKILGNISHSASDSTHPSSSQCKFPISSFISYDHLSPAHK 894 Query: 1302 AFSAQISQITEPKSYAQASKQKEWQDAMKAEIRALEKNQTWIMTDLPADKHCIGCKWVYK 1481 ++ IS +TEP SY +A + W+ A+ E+ AL KN TW M LP K IGCKWV+K Sbjct: 895 HYALNISTLTEPSSYEEAMCDENWKSAVNVELTALLKNNTWDMVKLPPHKKAIGCKWVFK 954 Query: 1482 VKYRSNGSIERYKARLVAKGYTQEEGLDYFDTFSPVAKITTVRLLIALAAAKGWFLHQLD 1661 +K ++G++ER+KARLVAKG+TQ EG+DY DTFSPV K+TTVR +A+AA++ W L QLD Sbjct: 955 LKLHADGTVERHKARLVAKGFTQTEGIDYIDTFSPVVKMTTVRTFMAIAASQNWPLFQLD 1014 Query: 1662 VNNAFLHGDLHEEVYMLPPPGYLQSNDKKVCKLTKSLYGLKQASRQWFQKLTSCLLNYGF 1841 VN AFLHGDL+EEVYM PPPG ++ VCKL +SLYGLKQASRQW KLT LL+ G+ Sbjct: 1015 VNTAFLHGDLNEEVYMKPPPGLPLAHPDLVCKLQRSLYGLKQASRQWNVKLTETLLSSGY 1074 Query: 1842 IQSKSDSSLFTMSHDDSITILCVYVDDIILAGNDISKIEAIKLHLDEKFTIKDLGTLKYI 2021 IQSK+D SLFT + T + VYVDD++L G DI +I +K LD KF+IKDLG+LKY Sbjct: 1075 IQSKADYSLFTKNTSTGFTAILVYVDDLVLGGTDIDEIHQLKALLDTKFSIKDLGSLKYF 1134 Query: 2022 LGIEVARSTKGIHICQRKYALDLLTETGFLGSKPSTTPMDCKAQFSSNDSSPLTDASQYR 2201 LG EVARS GI +CQRKY LDLL ++G LG+KP+ TPM Q + + ++D + YR Sbjct: 1135 LGFEVARSKTGISLCQRKYTLDLLQDSGLLGTKPTPTPMQPHLQLQKSSGNAISDPTTYR 1194 Query: 2202 RLIGRMLYLTITRPELSFSIQTLSQYLSKPTTTHLAAAHRI*RYIKSTPVKGLFYSSTSP 2381 RLIGR+LYLT +RPE+S+++ LSQ+L PT H+ A + +Y+K+ P +GLF+SS+S Sbjct: 1195 RLIGRLLYLTHSRPEISYAVSKLSQFLDSPTDAHMLAGLHVLKYLKNNPGQGLFFSSSSS 1254 Query: 2382 LHLKCFSDADWARCQETRRSISGFAVFLGDSLISWKSKKQPTISRSSAEAEYRAIALTTC 2561 L LK +SD+DW C +TRRS +GF FLG S+ISWKSKKQ +SRSS+EAEYRA+A C Sbjct: 1255 LALKGYSDSDWGACPDTRRSTTGFCFFLGTSIISWKSKKQTVVSRSSSEAEYRALAQAAC 1314 Query: 2562 EIQWLTYLLQDMHIKHPQS-LMYTDSRSAYCISQNSCHHERTKHIQLDCHFVREKLQEGL 2738 E QWL YLLQD I H ++Y D++SA I+ N HERTKHI++DCH VR+K+Q + Sbjct: 1315 EGQWLLYLLQDFQIPHDSPIILYCDNKSALHIAANPVFHERTKHIEIDCHVVRDKVQANI 1374 Query: 2739 FKLIHIPSTQQTADIFTKPLPPAQFNYLISKLSMLNIHAPLEG 2867 L+ + S +Q ADIFTK L P F+ L+SKL ++IH+ L G Sbjct: 1375 IHLLPVSSKEQIADIFTKSLHPGPFHTLLSKLGTIDIHSSLRG 1417 >gb|PNY16454.1| flavonol sulfotransferase-like protein [Trifolium pratense] Length = 1475 Score = 768 bits (1984), Expect = 0.0 Identities = 408/824 (49%), Positives = 537/824 (65%), Gaps = 27/824 (3%) Frame = +3 Query: 486 GIVERKHQHLLQVARALLFQSKVPLSFWDHCILTAAHIINRLPAPILHNKTPFEILFNKM 665 G+VERKHQHLL ARALLFQS +P FWD+ I + HIINRLP P L+N +P+++L N + Sbjct: 635 GVVERKHQHLLGTARALLFQSSLPKVFWDYAIGHSVHIINRLPTPFLNNMSPYQVLHNAL 694 Query: 666 PDYSHFKIFGCLAYVSTLTAQRHKFQARASKCVFIGYPLGSKGYKLYDLESHKVLISRHV 845 PD S+ K+FG L Y +TL+A R K +R+ KC+ +G+ G KG+ L DL+S +V +SR V Sbjct: 695 PDISNLKVFGSLCYAATLSAHRKKLDSRSRKCLLLGFKFGVKGHILLDLKSREVFVSRDV 754 Query: 846 IFCENIFPFQE------IASKSVSPEAPLF--PISDLSSAQDQSFSHFQNINA------- 980 +F E+IFPFQ+ + S ++PL+ P D + +S S I++ Sbjct: 755 VFFEHIFPFQQQSQDVAVKSHLSHSQSPLYDDPFIDCPHSSPESPSPNDPISSPPPSNSL 814 Query: 981 ----QNDIVTTTQIISDAQNDFADAQHDLAAQN-TTVVQTDPVNSHIEHS-NPNP----P 1130 N I + I++ + A H + N T T + S++ H P+P P Sbjct: 815 PHDIHNSIPNQSPILNSPPH--ASTLHTPSTNNHDTDNPTVSIPSYVSHHPTPSPAMPPP 872 Query: 1131 ILRRSSRNRTNPSYL-QVYDCKIPPSIQSQHSAQTIISGKQYPIQDFLSVDKLSNKHKAF 1307 R+S+R P YL + Y C S + S +YP+ ++S LS+ H + Sbjct: 873 PTRKSNRITHPPPYLTEHYYCNAAIH-DSTKDTPSSSSKCKYPLSSYISYQHLSSAHHHY 931 Query: 1308 SAQISQITEPKSYAQASKQKEWQDAMKAEIRALEKNQTWIMTDLPADKHCIGCKWVYKVK 1487 + IS I+EP Y +A W+ A+ AE+ AL+K TW + LP KH IGCKWV+K+K Sbjct: 932 LSNISTISEPTCYEKAVCDPNWKAAINAELSALDKYNTWKLVPLPKHKHAIGCKWVFKLK 991 Query: 1488 YRSNGSIERYKARLVAKGYTQEEGLDYFDTFSPVAKITTVRLLIALAAAKGWFLHQLDVN 1667 +NG+IERYKARLVAKGYTQ EG+DY DTFSPV K+TT+R+ +A+AA + W L+QLDVN Sbjct: 992 LHANGTIERYKARLVAKGYTQTEGIDYMDTFSPVVKMTTIRMFLAIAAIQNWPLYQLDVN 1051 Query: 1668 NAFLHGDLHEEVYMLPPPGYLQSNDKKVCKLTKSLYGLKQASRQWFQKLTSCLLNYGFIQ 1847 AFLHGDL EEVYM PPPG + VCKL +SLYGLKQASRQW KLT LL+ G+ Q Sbjct: 1052 TAFLHGDLDEEVYMKPPPGLDLPSPNLVCKLQRSLYGLKQASRQWNTKLTQTLLSSGYTQ 1111 Query: 1848 SKSDSSLFTMSHDDSITILCVYVDDIILAGNDISKIEAIKLHLDEKFTIKDLGTLKYILG 2027 SKSD SLFT T++ VYVDD++L G D +I+ IK LD KF+IKDLGTLKY LG Sbjct: 1112 SKSDYSLFTKQASSGFTVILVYVDDLVLGGTDDKEIQKIKALLDRKFSIKDLGTLKYFLG 1171 Query: 2028 IEVARSTKGIHICQRKYALDLLTETGFLGSKPSTTPMDCKAQFSSNDSSPLTDASQYRRL 2207 EVAR+ GI +CQRKYALDL+ +TG LG+KP TTPM + Q S + L+D S YRRL Sbjct: 1172 FEVARTQAGISLCQRKYALDLIQDTGLLGAKPCTTPMQPQLQLHSESGTILSDPSTYRRL 1231 Query: 2208 IGRMLYLTITRPELSFSIQTLSQYLSKPTTTHLAAAHRI*RYIKSTPVKGLFYSSTSPLH 2387 +GR+LYLT +RPE+++S+ LSQ+LS PT H+ A + +YIK+ P +GLF+++ S L Sbjct: 1232 VGRLLYLTHSRPEIAYSVSKLSQFLSAPTNEHMLAGLHVLKYIKNCPGQGLFFAANSSLK 1291 Query: 2388 LKCFSDADWARCQETRRSISGFAVFLGDSLISWKSKKQPTISRSSAEAEYRAIALTTCEI 2567 LK FSD+DWA C +TRRS +G FLG+SLISWKSKKQ +SRSS+EAEYRA+A TCE Sbjct: 1292 LKGFSDSDWAACPDTRRSTTGLCFFLGNSLISWKSKKQNVVSRSSSEAEYRALAQATCEA 1351 Query: 2568 QWLTYLLQDMHIKHPQSL-MYTDSRSAYCISQNSCHHERTKHIQLDCHFVREKLQEGLFK 2744 QWL YLL D HI H + +Y D+RSA I+ N HERTKHI+LDCH VREKL GL Sbjct: 1352 QWLKYLLNDFHISHSSPIVLYCDNRSALHIAANPVFHERTKHIELDCHVVREKLLAGLIH 1411 Query: 2745 LIHIPSTQQTADIFTKPLPPAQFNYLISKLSMLNIHAPLEGGCK 2876 L+ + S +Q ADI TK L P F+ L +KL M++I++ L G K Sbjct: 1412 LLPVSSKEQVADILTKSLHPGPFHTLQNKLGMIDIYSSLRGDVK 1455 >gb|KYP42564.1| Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cajanus cajan] Length = 1427 Score = 763 bits (1971), Expect = 0.0 Identities = 399/801 (49%), Positives = 531/801 (66%), Gaps = 4/801 (0%) Frame = +3 Query: 489 IVERKHQHLLQVARALLFQSKVPLSFWDHCILTAAHIINRLPAPILHNKTPFEILFNKMP 668 +VERKH+H+L + R LLF S VP SFW + + A H+INRLP+P+L+N +P+++L++K P Sbjct: 645 VVERKHRHILNITRTLLFHSNVPKSFWCYAVGHAIHLINRLPSPVLNNSSPYQMLYDKPP 704 Query: 669 DYSHFKIFGCLAYVSTLTAQRHKFQARASKCVFIGYPLGSKGYKLYDLESHKVLISRHVI 848 K+FG L Y STL R K +A+K V++G G+KG+ + DL + + +SR+V+ Sbjct: 705 TLLDLKVFGSLCYASTLVQGRSKLAPKATKGVYLGVKQGTKGFLVLDLLTRSIFVSRNVV 764 Query: 849 FCENIFPFQEIASKSVSPEAPLFPISDLSSAQDQSFSHFQNINAQNDIVTTTQIISDAQN 1028 F E+IFPF E S ++ S Q+ + F +++ + VTT S Sbjct: 765 FYEHIFPFFEKGSTVITN----------SQQQNDACFDFLYLDSSSHPVTTIDNSSLLDI 814 Query: 1029 DFADAQHDLAAQNTTVVQTDPVNSHIEHSNPNPPILRRSSRNRTNPSYLQVYDCKI---P 1199 D A ++DL D S +P LR+S+R++ +P+YL+ Y C + Sbjct: 815 DSAHYENDL---------NDIDESAHPSETSSPSQLRKSTRHKCSPAYLKDYHCNLLIGV 865 Query: 1200 PSIQSQHSAQTIISGKQYPIQDFLSVDKLSNKHKAFSAQISQITEPKSYAQASKQKEWQD 1379 P + +H +YP+ LS D LS + + I+ EP ++ QA K K W + Sbjct: 866 PPPEDKHI--------RYPLNTVLSYDSLSASYSRYVLSITTHVEPHTFNQAVKNKVWVE 917 Query: 1380 AMKAEIRALEKNQTWIMTDLPADKHCIGCKWVYKVKYRSNGSIERYKARLVAKGYTQEEG 1559 AM+AE+ ALE N+TW + LP K IG KWVYK+KY+S+GSIERYKARLV KGYTQ +G Sbjct: 918 AMQAELDALEHNKTWTIMPLPPGKTPIGSKWVYKIKYKSDGSIERYKARLVVKGYTQIQG 977 Query: 1560 LDYFDTFSPVAKITTVRLLIALAAAKGWFLHQLDVNNAFLHGDLHEEVYMLPPPGYLQSN 1739 LDYFDTF+PVAK++TVR+L+A+A+ + W LHQLD+NNAFLHGDL E+VYM P G Sbjct: 978 LDYFDTFAPVAKLSTVRMLLAIASCQHWELHQLDINNAFLHGDLLEDVYMEIPQGLNIDK 1037 Query: 1740 DKKVCKLTKSLYGLKQASRQWFQKLTSCLLNYGFIQSKSDSSLFTMSHDDSITILCVYVD 1919 VCKL KSLYGLKQASRQWF KL+S LL+ + QS+ D SLFT H T++ +YVD Sbjct: 1038 PNHVCKLNKSLYGLKQASRQWFAKLSSFLLSLHYKQSQHDHSLFTKHHGTHFTVILIYVD 1097 Query: 1920 DIILAGNDISKIEAIKLHLDEKFTIKDLGTLKYILGIEVARSTKGIHICQRKYALDLLTE 2099 D+I+AG D +I IK LD KF IKDLG L+Y LG+E+ARS GI + QRKY LDLL E Sbjct: 1098 DLIIAGTDSEEINHIKQSLDVKFKIKDLGPLRYFLGLEIARSHLGISLSQRKYTLDLLDE 1157 Query: 2100 TGFLGSKPSTTPMDCKAQFSSNDSSPLTDASQYRRLIGRMLYLTITRPELSFSIQTLSQY 2279 T FL KP TP+ + S SP D + YRRLIG++LYL TRP++S+S+Q LSQ+ Sbjct: 1158 TSFLAGKPVLTPIIKGTRLSHTTDSPYEDPAGYRRLIGKLLYLITTRPDISYSVQQLSQF 1217 Query: 2280 LSKPTTTHLAAAHRI*RYIKSTPVKGLFYSSTSPLHLKCFSDADWARCQETRRSISGFAV 2459 LS P +H AA R+ RY+K P +GLFY + SPL LK FSD+DWA C +TRRS+SG+++ Sbjct: 1218 LSCPQQSHYQAAIRVLRYLKGNPGQGLFYPADSPLQLKAFSDSDWASCPDTRRSLSGYSI 1277 Query: 2460 FLGDSLISWKSKKQPTISRSSAEAEYRAIALTTCEIQWLTYLLQDMHIKH-PQSLMYTDS 2636 FLG+SLISWK KKQ TISRSS+EAEYRA+A T CEIQWLTYLLQD + +L+Y D+ Sbjct: 1278 FLGNSLISWKCKKQSTISRSSSEAEYRALAATACEIQWLTYLLQDFSVPFTTPALLYCDN 1337 Query: 2637 RSAYCISQNSCHHERTKHIQLDCHFVREKLQEGLFKLIHIPSTQQTADIFTKPLPPAQFN 2816 +SA I+ N+ HERTKHI++DCH VREKLQ GLF L+ I S+ Q ADI TKPL P+ F Sbjct: 1338 QSARHIASNAVFHERTKHIEIDCHLVREKLQAGLFHLLPIASSHQLADILTKPLDPSPFQ 1397 Query: 2817 YLISKLSMLNIHAPLEGGCKD 2879 YL+SKL ++NI++P G D Sbjct: 1398 YLLSKLGVINIYSPACRGVLD 1418 >gb|OMO88956.1| Integrase, catalytic core [Corchorus capsularis] Length = 1451 Score = 763 bits (1970), Expect = 0.0 Identities = 401/828 (48%), Positives = 548/828 (66%), Gaps = 30/828 (3%) Frame = +3 Query: 489 IVERKHQHLLQVARALLFQSKVPLSFWDHCILTAAHIINRLPAPILHNKTPFEILFNKMP 668 IVERKHQH+L VAR+L FQ+ +P+ FW C+L A +INR+P +L N TPF+ LFN+ P Sbjct: 639 IVERKHQHILNVARSLRFQASLPIDFWGECVLHAVFLINRIPTKVLGNVTPFQKLFNESP 698 Query: 669 DYSHFKIFGCLAYVSTLTAQRHKFQARASKCVFIGYPLGSKGYKLYDLESHKVLISRHVI 848 + K+FG LA+ S + ++KF +R+ K VF+G+ G KGYKLYDL+++K +SR V Sbjct: 699 NIDVLKVFGSLAFASNHSNIKNKFDSRSIKSVFLGFQPGVKGYKLYDLQNNKKNLSRDVT 758 Query: 849 FCENIFPFQEIA--------SKSVSPEAPLFPISDLSSAQD-----------QSFSHFQN 971 F E+I+PF E SK +S E + P SD +A + Q+ SH N Sbjct: 759 FYEHIYPFTEEYAKTDNLQFSKHISTENLVLPNSDNFAAMNDSIPSSDVSTQQNMSHLSN 818 Query: 972 I-----NAQNDIVTTTQIISDAQNDFADAQHDLAAQNTTVVQTDPVNSHI----EHSNPN 1124 + ++ + V I + QN+ H+++ P NS I + SN N Sbjct: 819 VPVASSSSNTEAVLQIPIATVYQNN--PVLHEISEPILQSSNVGPANSTIPNTSQQSNTN 876 Query: 1125 PPILRRSSRNRTNPSYLQVYDCKIPPSIQSQHSAQTIISGKQYPIQDFLSVDKLSNKHKA 1304 +RRS+R + P +LQ ++C S HS ++ S D +++KHKA Sbjct: 877 YHNVRRSTRLKFRPPHLQSFECNQVQKT-SPHSLSSVFS-----------YDNITSKHKA 924 Query: 1305 FSAQISQITEPKSYAQASKQKEWQDAMKAEIRALEKNQTWIMTDLPADKHCIGCKWVYKV 1484 F+ I Q TEP++Y +A K ++WQ AM E+ ALEK +TW + DLP K IGCKWVYKV Sbjct: 925 FAVAIDQDTEPRNYKEAIKSQQWQQAMNEELEALEKTKTWKLVDLPHGKQPIGCKWVYKV 984 Query: 1485 KYRSNGSIERYKARLVAKGYTQEEGLDYFDTFSPVAKITTVRLLIALAAAKGWFLHQLDV 1664 K +++GSIERYKARLVAKGYTQ+EG+DY DTFSPVAKI T+R L+ +AA KGW+LHQ DV Sbjct: 985 KRKADGSIERYKARLVAKGYTQQEGVDYLDTFSPVAKIATIRTLLVVAALKGWYLHQCDV 1044 Query: 1665 NNAFLHGDLHEEVYMLPPPGYLQSNDKKVCKLTKSLYGLKQASRQWFQKLTSCLLNYGFI 1844 N FLHGDL EEVYM P GYL+ + K VCKL KSLYGLKQASRQW KLT L+ YGF Sbjct: 1045 NTTFLHGDLSEEVYMKLPEGYLEGSTK-VCKLVKSLYGLKQASRQWNLKLTESLIKYGFH 1103 Query: 1845 QSKSDSSLFTMSHDDSITILCVYVDDIILAGNDISKIEAIKLHLDEKFTIKDLGTLKYIL 2024 QS++D +LF D + L VYVDDII+A NDI+++ IK +L + F+IKDLG LK+ L Sbjct: 1104 QSQADHTLFIKFVDKNFIALLVYVDDIIVASNDITEVINIKAYLHDLFSIKDLGELKFFL 1163 Query: 2025 GIEVARSTKGIHICQRKYALDLLTETGFLGSKPSTTPMDCKAQFSSNDSSPLTDASQYRR 2204 G+EVARS +GI++CQ+KY +DLL + FL KP++TP+ + + ++ +PL DASQYR+ Sbjct: 1164 GLEVARSKQGINVCQKKYTMDLLKDMNFLVCKPTSTPILPETRLTTESGTPLADASQYRQ 1223 Query: 2205 LIGRMLYLTITRPELSFSIQTLSQYLSKPTTTHLAAAHRI*RYIKSTPVKGLFYSSTSPL 2384 L+G++ YLT TR ++S+++Q L+Q+L KPT+ HL AHR+ RY+K T +GL +SS Sbjct: 1224 LVGKLQYLTTTRLDISYAVQQLAQFLDKPTSDHLQVAHRVLRYLKGTIGQGLLFSSQGIF 1283 Query: 2385 HLKCFSDADWARCQETRRSISGFAVFLGDSLISWKSKKQPTISRSSAEAEYRAIALTTCE 2564 LK +SD+DW C ++R+SI+G+ +FLGDSL+SWK+KKQ T+SRSS+EAEYRA+A T CE Sbjct: 1284 QLKAYSDSDWGTCLDSRKSITGYCIFLGDSLVSWKTKKQNTVSRSSSEAEYRALATTVCE 1343 Query: 2565 IQWLTYLLQDMHIK-HPQSLMYTDSRSAYCISQNSCHHERTKHIQLDCHFVREKLQEGLF 2741 IQWL YL++D+ I P + ++ D+ SA I++N HERTKHI +DCH VR KLQEGL Sbjct: 1344 IQWLNYLMKDLQITLEPSTPLFCDNLSAIHIAKNPVFHERTKHIDIDCHVVRTKLQEGLI 1403 Query: 2742 KLIHIPSTQQTADIFTKPLPPAQFNYLISKLSMLNIHAP-LEGGCKDS 2882 KL+ + S Q AD FTK L F SKL + N++ P L G ++S Sbjct: 1404 KLLPVSSKLQLADCFTKVLSSTNFINAFSKLGIQNLYIPSLRGDVRES 1451 >gb|PNX97998.1| retrovirus-related Pol polyprotein from transposon TNT 1-94, partial [Trifolium pratense] Length = 964 Score = 747 bits (1928), Expect = 0.0 Identities = 383/797 (48%), Positives = 519/797 (65%), Gaps = 6/797 (0%) Frame = +3 Query: 486 GIVERKHQHLLQVARALLFQSKVPLSFWDHCILTAAHIINRLPAPILHNKTPFEILFNKM 665 G+VERKH+HLL VARAL FQ+K+PLSFW C+LTAA++IN+LP PIL K+P ++L Sbjct: 187 GVVERKHRHLLDVARALRFQAKLPLSFWGECVLTAAYLINKLPTPILKYKSPHQVLLGSP 246 Query: 666 PDYSHFKIFGCLAYVSTLTAQRHKFQARASKCVFIGYPLGSKGYKLYDLESHKVLISRHV 845 P YS ++FGCL + + Q HKF RA +F+GYP KGY++YD+ + K+ +SR V Sbjct: 247 PSYSSLRVFGCLCFAKNMNIQ-HKFDERAKPGIFVGYPFNQKGYRIYDMHTRKIYVSRDV 305 Query: 846 IFCENIFPFQEIASKSVSPEAPLFPISDLSSAQDQSFSHFQNINAQNDIVTTTQI---IS 1016 F E +FP+ ++ + S + SD+S + F ++ + +++ + I IS Sbjct: 306 QFFETVFPYHDLQTPSFA--------SDISI--NTQFLDYEVDDTPSNLSPASSIPPGIS 355 Query: 1017 DAQNDFADAQHDLAAQNTTVVQTDPVNSHIEHSNP--NPPILRRSSRNRTNPSYLQVYDC 1190 N + + N + + PV +HS N P R R+RT L + C Sbjct: 356 HHDNTIVTIPNP-SVDNPSEIPAIPVEPPQQHSPTAINHPERRYPLRHRTPSVRLTDHVC 414 Query: 1191 KIPPSIQSQHSAQTIISGKQYPIQDFLSVDKLSNKHKAFSAQISQITEPKSYAQASKQKE 1370 I + S +P++++ S+ LS H+A I + EP SY+QA K E Sbjct: 415 DI----------NNVTSQSAFPLKNYFSLSNLSTSHRALLVNIIENKEPTSYSQAIKSAE 464 Query: 1371 WQDAMKAEIRALEKNQTWIMTDLPADKHCIGCKWVYKVKYRSNGSIERYKARLVAKGYTQ 1550 W++AM EI ALE N TW+++ LP K IGCKWVYK+KY S+G++ERYKARLVAKGY Q Sbjct: 465 WREAMAKEIHALESNNTWVLSPLPNGKTAIGCKWVYKIKYHSDGTVERYKARLVAKGYNQ 524 Query: 1551 EEGLDYFDTFSPVAKITTVRLLIALAAAKGWFLHQLDVNNAFLHGDLHEEVYMLPPPGYL 1730 G+DY +TF+PVAK+ TVRLL+++AA K W LHQLDVNNAFL GDL+EEVYM PPG+ Sbjct: 525 VHGIDYHETFAPVAKLVTVRLLLSIAAIKNWSLHQLDVNNAFLQGDLNEEVYMKLPPGFS 584 Query: 1731 QSNDKKVCKLTKSLYGLKQASRQWFQKLTSCLLNYGFIQSKSDSSLFTMSHDDSITILCV 1910 VCKL KS+YGLKQASRQWF K ++ L+ GF QS SD SLFT + + + V Sbjct: 585 HKGQPCVCKLNKSIYGLKQASRQWFSKFSTTLIQKGFHQSISDYSLFTFKSNHTTIFVLV 644 Query: 1911 YVDDIILAGNDISKIEAIKLHLDEKFTIKDLGTLKYILGIEVARSTKGIHICQRKYALDL 2090 YVDDII+ GN+ I IK L + F+IKDLG L Y LGIEV+RS KGI +CQRKY LD+ Sbjct: 645 YVDDIIITGNNDDAISDIKKFLAQAFSIKDLGNLSYFLGIEVSRSKKGIFLCQRKYTLDI 704 Query: 2091 LTETGFLGSKPSTTPMDCKAQFSSNDSSPLTDASQYRRLIGRMLYLTITRPELSFSIQTL 2270 L++ G G +PS PM+ + ND SPL D + YRRLIGR+LYLT+TRP++ +++ TL Sbjct: 705 LSDAGLTGCRPSEFPMEQHLRLRPNDGSPLPDPTVYRRLIGRLLYLTVTRPDIQYAVNTL 764 Query: 2271 SQYLSKPTTTHLAAAHRI*RYIKSTPVKGLFYSSTSPLHLKCFSDADWARCQETRRSISG 2450 SQ++ P TTHL AA R+ RY+K + KGLF S++S L L ++D+DWA C TRRS +G Sbjct: 765 SQFMQSPCTTHLDAATRVLRYLKGSVGKGLFLSASSSLQLIGYADSDWAGCPTTRRSTTG 824 Query: 2451 FAVFLGDSLISWKSKKQPTISRSSAEAEYRAIALTTCEIQWLTYLLQDMHIKHPQSL-MY 2627 + LG + ISWK+KKQPTISRSSAEAEYR++A E+QWL +LL D+ I HP + ++ Sbjct: 825 YFTMLGSNPISWKTKKQPTISRSSAEAEYRSLATLASELQWLKFLLSDLDIAHPLPITVH 884 Query: 2628 TDSRSAYCISQNSCHHERTKHIQLDCHFVREKLQEGLFKLIHIPSTQQTADIFTKPLPPA 2807 DS++A I++N HERTKHI++DCHFVREK++ GL + ++ S Q ADIFTKPL Sbjct: 885 CDSQAAIHIAENPVFHERTKHIEIDCHFVREKIKSGLLRPSYLRSFDQLADIFTKPLGGD 944 Query: 2808 QFNYLISKLSMLNIHAP 2858 + L+ KL +L I P Sbjct: 945 AYKRLLGKLGVLEISIP 961 >dbj|GAU41679.1| hypothetical protein TSUD_272630 [Trifolium subterraneum] Length = 1178 Score = 753 bits (1945), Expect = 0.0 Identities = 399/797 (50%), Positives = 524/797 (65%), Gaps = 6/797 (0%) Frame = +3 Query: 486 GIVERKHQHLLQVARALLFQSKVPLSFWDHCILTAAHIINRLPAPILHNKTPFEILFNKM 665 G+VERKHQH+L VA R P+L+ K P+E+L + Sbjct: 432 GVVERKHQHILNVA--------------------------RFSTPLLNFKCPYEMLHKEP 465 Query: 666 PDYSHFKIFGCLAYVSTLTAQRHKFQARASKCVFIGYPLGSKGYKLYDLESHKVLISRHV 845 P H K+FGCL+Y +TL A R KF +RA K +F+GY G+KGY LYDL SH++ +SR+V Sbjct: 466 PSIVHLKVFGCLSYATTLQAHRTKFVSRARKAIFLGYKDGTKGYILYDLHSHEIFVSRNV 525 Query: 846 IFCENIFPF---QEIASKSVSPEAPLFPISDLSSAQDQSFSHFQNINAQNDIVTTTQIIS 1016 IF E FPF + + S SP + L + D + ++ + +T + II Sbjct: 526 IFYETDFPFHLSNSVKTDSASPASHLNHTLLYDAEPDPNALPIPVMHEPD--LTLSPIIG 583 Query: 1017 DAQNDFADAQHDLAAQNTTVVQTDPVNSHIEHSNPNPPILRRSSRNRTNPSYLQVYDCKI 1196 + ND + P+NS PNP LR+SSR P +L+ + C+ Sbjct: 584 PSYND-----------------STPINSPESSPIPNPAPLRKSSRVIQRPRHLEGFHCET 626 Query: 1197 PPSIQSQHSAQTIISGKQYPIQDFLSVDKLSNKHKAFSAQISQITEPKSYAQASKQKEWQ 1376 S S+ T+ YP+ LS + + + A IS I EPK+Y QASK + W+ Sbjct: 627 LIGTHSAASSNTV-----YPLSSVLSYNNCAPNYHALCCSISAIVEPKTYTQASKFECWR 681 Query: 1377 DAMKAEIRALEKNQTWIMTDLPADKHCIGCKWVYKVKYRSNGSIERYKARLVAKGYTQEE 1556 +AM AE+ AL++N+TW + DLP K +GCKWVYKVKY +NGSIERYKARLVAKGYTQ E Sbjct: 682 NAMNAELLALDENKTWSVVDLPNGKVPVGCKWVYKVKYHANGSIERYKARLVAKGYTQLE 741 Query: 1557 GLDYFDTFSPVAKITTVRLLIALAAAKGWFLHQLDVNNAFLHGDLHEEVYMLPPPGYLQS 1736 G+DYFDTFSPVAKITTVR+L+ALA+ KGW L QLDVNNAFLHGDL+E+VYM PPG+ + Sbjct: 742 GVDYFDTFSPVAKITTVRVLLALASIKGWHLEQLDVNNAFLHGDLNEDVYMSLPPGFAAT 801 Query: 1737 ND-KKVCKLTKSLYGLKQASRQWFQKLTSCLLNYGFIQSKSDSSLFTMSHDDSITILCVY 1913 N+ KVCKL KS+YGLKQASRQW+ KL+S L++ G+ S+SD SL+ S +S T L VY Sbjct: 802 NESNKVCKLHKSIYGLKQASRQWYSKLSSSLVSLGYTPSQSDHSLYIKSTTNSFTALLVY 861 Query: 1914 VDDIILAGNDISKIEAIKLHLDEKFTIKDLGTLKYILGIEVARSTKGIHICQRKYALDLL 2093 VDDI+LAGN I +I+ +KL LD+KF IKDLG L+Y L +E+ARS GI + QRKY L+LL Sbjct: 862 VDDIVLAGNSIHEIQTVKLFLDQKFKIKDLGKLRYFLVLEIARSDTGIFVNQRKYTLELL 921 Query: 2094 TETGFLGSKPSTTPMDCKAQFSSNDSSPLTDASQYRRLIGRMLYLTITRPELSFSIQTLS 2273 + G LG+KPS+ P + SS D +PL D S YRRLIGR+LYLT TRP++SFS+Q LS Sbjct: 922 EDVGLLGTKPSSIPFHPTTKLSSTDGAPLDDPSSYRRLIGRLLYLTHTRPDISFSVQHLS 981 Query: 2274 QYLSKPTTTHLAAAHRI*RYIKSTPVKGLFYSSTSPLHLKCFSDADWARCQETRRSISGF 2453 Q++SKP H AA I +Y+KS P KG+F S++S L + F+D+DWARC ETR+SI GF Sbjct: 982 QFVSKPLVPHYNAAMHILKYLKSDPAKGIFLSASSSLKISAFADSDWARCPETRKSIIGF 1041 Query: 2454 AVFLGDSLISWKSKKQPTISRSSAEAEYRAIALTTCEIQWLTYLLQDMHI--KHPQSLMY 2627 V LG SLISWKSKKQ T+SRSS EAEYRA+A TCEIQWL Y+ QD I +P + ++ Sbjct: 1042 CVLLGSSLISWKSKKQNTVSRSSTEAEYRALASLTCEIQWLQYIFQDFKIIFSNP-AYVF 1100 Query: 2628 TDSRSAYCISQNSCHHERTKHIQLDCHFVREKLQEGLFKLIHIPSTQQTADIFTKPLPPA 2807 D++SA ++ N HER+KHI+LDCH +REK+Q L L+ +P+T Q AD+FTKPL Sbjct: 1101 CDNKSAIYLAHNPTFHERSKHIELDCHVIREKIQSKLIHLLPVPTTSQLADVFTKPLNHP 1160 Query: 2808 QFNYLISKLSMLNIHAP 2858 F+ +SKL + +IH+P Sbjct: 1161 AFSSFLSKLGLCSIHSP 1177 >gb|PNX93131.1| retrovirus-related Pol polyprotein from transposon TNT 1-94 [Trifolium pratense] Length = 982 Score = 746 bits (1925), Expect = 0.0 Identities = 391/799 (48%), Positives = 522/799 (65%), Gaps = 10/799 (1%) Frame = +3 Query: 486 GIVERKHQHLLQVARALLFQSKVPLSFWDHCILTAAHIINRLPAPILHNKTPFEILFNKM 665 G VERKHQH+L +ARALL+QS +P FW + +L A IIN++ P+L NK+P E+LF+ + Sbjct: 193 GRVERKHQHILNIARALLYQSNLPKYFWSYAVLHATAIINKIVTPVLQNKSPHEMLFHCL 252 Query: 666 PDYSHFKIFGCLAYVSTLTAQRHKFQARASKCVFIGYPLGSKGYKLYDLESHKVLISRHV 845 PD + K+FG LAY STL + K R KCVF+G G KG L+DL+S + +SR+V Sbjct: 253 PDLNELKVFGSLAYASTLDVNKTKLSPRGRKCVFLGQKQGVKGSILFDLDSKNIFLSRNV 312 Query: 846 IFCENIFPFQEIASK-------SVSPEAPLFPISDLSSAQDQSFSHFQNINAQNDIVTTT 1004 ++I P+ SK +++ E P D+ DQS + + T Sbjct: 313 THFDHILPYTTNTSKLHWHYHSTINCE----PFLDI----DQSHTSTNPSDTTPSPTPPT 364 Query: 1005 QIISDAQNDFADAQHDLAAQNTTVVQTDPVNSHIEHSNPNP--PILRRSSRNRTNPSYLQ 1178 IISD +P S S+P P P R R + PSYL Sbjct: 365 NIISDP---------------------NPSTSSPLPSSPFPIQPANTRPDRIKHRPSYLS 403 Query: 1179 VYDCKIPPSIQSQHSAQTIISGKQYPIQDFLSVDKLSNKHKAFSAQISQITEPKSYAQAS 1358 + C S SA++ +G YPI F S+ +LS H F++ ++Q TEP++Y +A Sbjct: 404 DFVCSA-----SDDSAKSSSTGTIYPISSFHSLSQLSPSHSVFTSSLTQHTEPRTYTEAC 458 Query: 1359 KQKEWQDAMKAEIRALEKNQTWIMTDLPADKHCIGCKWVYKVKYRSNGSIERYKARLVAK 1538 K + W AM +E+ AL + TW + DLP + IG KWVYK+K++S+G+IERYKARLVAK Sbjct: 459 KSQHWIQAMTSELEALARTGTWKIVDLPPNVKPIGSKWVYKIKHKSDGTIERYKARLVAK 518 Query: 1539 GYTQEEGLDYFDTFSPVAKITTVRLLIALAAAKGWFLHQLDVNNAFLHGDLHEEVYMLPP 1718 GY Q EGLD+FDTFSPVAK+TTVR+L+A+A+ KGWFLHQLDVNNAFLHGDL E VYM P Sbjct: 519 GYNQVEGLDFFDTFSPVAKLTTVRMLLAIASIKGWFLHQLDVNNAFLHGDLQENVYMSIP 578 Query: 1719 PGYLQSNDKKVCKLTKSLYGLKQASRQWFQKLTSCLLNYGFIQSKSDSSLFTMSHDDSIT 1898 G S +VCKL KSLYGLKQASR+W++KLTS L+ G+ QS SD SLFT+S D+ T Sbjct: 579 DGVQCSKPNQVCKLLKSLYGLKQASRKWYEKLTSLLVKEGYTQSSSDHSLFTISQQDNFT 638 Query: 1899 ILCVYVDDIILAGNDISKIEAIKLHLDEKFTIKDLGTLKYILGIEVARSTKGIHICQRKY 2078 L +YVDDIILAG + +I IK LD F IKDLG +KY LG+EVA S +GI I QRKY Sbjct: 639 ALLIYVDDIILAGTSLQEINRIKNILDTHFKIKDLGVVKYFLGLEVAHSKEGISISQRKY 698 Query: 2079 ALDLLTETGFLGSKPSTTPMDCKAQFSSNDSSPLTDASQYRRLIGRMLYLTITRPELSFS 2258 LDLL ++G LGSKP++TP+D + +D P D S YRRL+G++LYLT TRP+++F+ Sbjct: 699 CLDLLHDSGLLGSKPASTPLDPSVKLHHDDGKPFEDISMYRRLVGKLLYLTNTRPDIAFA 758 Query: 2259 IQTLSQYLSKPTTTHLAAAHRI*RYIKSTPVKGLFYSSTSPLHLKCFSDADWARCQETRR 2438 Q LSQ+L KPT TH AA R+ RY+K P GL + + + L +SDADWA C +TRR Sbjct: 759 TQQLSQFLHKPTMTHYKAACRVIRYLKHNPGMGLIFKRNADIQLIGYSDADWAGCLDTRR 818 Query: 2439 SISGFAVFLGDSLISWKSKKQPTISRSSAEAEYRAIALTTCEIQWLTYLLQDMHIK-HPQ 2615 S +G+ F+G SLISWK+KKQ TIS+SS+EAEYRA++ TCE+ WL YLL+D+HI+ Q Sbjct: 819 STTGYCFFVGSSLISWKAKKQTTISKSSSEAEYRALSSATCELVWLLYLLKDLHIECSKQ 878 Query: 2616 SLMYTDSRSAYCISQNSCHHERTKHIQLDCHFVREKLQEGLFKLIHIPSTQQTADIFTKP 2795 +++ D++SA I+ N HERTKHI++DCH VREK+QEGL +LI + + +Q AD TK Sbjct: 879 PVLFCDNQSALHIASNPVFHERTKHIEIDCHLVREKVQEGLLRLIPVSTQEQLADFLTKS 938 Query: 2796 LPPAQFNYLISKLSMLNIH 2852 LP +F+ + KL +L+I+ Sbjct: 939 LPAPKFHDFLCKLGLLDIY 957 >gb|KYP43110.1| Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cajanus cajan] Length = 1353 Score = 755 bits (1949), Expect = 0.0 Identities = 386/794 (48%), Positives = 530/794 (66%), Gaps = 3/794 (0%) Frame = +3 Query: 486 GIVERKHQHLLQVARALLFQSKVPLSFWDHCILTAAHIINRLPAPILHNKTPFEILFNKM 665 G+VERKHQH+L +ARAL+FQS V FW++ I A H+INRLP L +P+ +L+++ Sbjct: 578 GVVERKHQHILSMARALMFQSNVSKMFWNYAIGHAVHLINRLPTRFLQQNSPYYVLYSEK 637 Query: 666 PDYSHFKIFGCLAYVSTLTAQRHKFQARASKCVFIGYPLGSKGYKLYDLESHKVLISRHV 845 PD+SH K+FGCLA+ STL+ R K + R+ KC+F+GY G+KG+ +YDL++ + ISR V Sbjct: 638 PDFSHLKVFGCLAFASTLSHNRTKLEPRSRKCMFLGYSSGTKGFIMYDLKTRETFISRDV 697 Query: 846 IFCENIFPFQEIASKSVSPEAPLFPISDLSSAQDQSFSHFQNINAQNDIVTTTQIISDAQ 1025 F ENIFP Q+ S S + P+ PI AQ + + I S Sbjct: 698 QFYENIFPLQKDFSIQ-STDGPVVPI------------------AQMPLTSCDPIPSHTH 738 Query: 1026 NDFADAQHDLAAQNTTVVQTDPVNSHIEHSNPNPPILRRSS-RNRTNPSYLQVYDCKIPP 1202 ++ + +H+ ++T+ T+ NS + N P +RR+S R + P YLQ Y C + Sbjct: 739 DNLDETEHE--HNSSTLPMTNSSNSDQPNIEINIPEIRRTSQRVKNRPGYLQDYHCTLAA 796 Query: 1203 SIQSQHSAQTIISGKQYPIQDFLSVDKLSNKHKAFSAQISQITEPKSYAQASKQKEWQDA 1382 S Q S S +YPI D+L S ++F + IS I EP+SY A W++A Sbjct: 797 SKVDQSS-----STARYPISDYLPYTSYSAVQQSFVSTISSIIEPRSYQDAINHDCWKEA 851 Query: 1383 MKAEIRALEKNQTWIMTDLPADKHCIGCKWVYKVKYRSNGSIERYKARLVAKGYTQEEGL 1562 ++AE+ AL+K +TWI+TDLP +K +GC+WV+KVKY ++GS+ERYKARLVAKG+TQ GL Sbjct: 852 IRAELDALDKQKTWILTDLPPNKRAVGCRWVFKVKYHADGSVERYKARLVAKGFTQIPGL 911 Query: 1563 DYFDTFSPVAKITTVRLLIALAAAKGWFLHQLDVNNAFLHGDLHEEVYMLPPPGYLQSND 1742 DY DTFSPV ++TT+R+ +A+AAA W +HQLD+N AFLHGDL EEVYM PPPG + S+ Sbjct: 912 DYIDTFSPVVRMTTIRVFLAIAAASNWSVHQLDINTAFLHGDLVEEVYMKPPPGLILSSP 971 Query: 1743 KKVCKLTKSLYGLKQASRQWFQKLTSCLLNYGFIQSKSDSSLFTMSHDDSITILCVYVDD 1922 KVCKL KSLYGLKQ SRQW KLT L +GF+QSKSD SLFT + + VYVDD Sbjct: 972 NKVCKLQKSLYGLKQVSRQWNIKLTETLKLFGFVQSKSDYSLFTKRTNIGFIAILVYVDD 1031 Query: 1923 IILAGNDISKIEAIKLHLDEKFTIKDLGTLKYILGIEVARSTKGIHICQRKYALDLLTET 2102 +I++G+D ++I +K LD++F+IKDLG L Y LG+E +RS +GI +CQRKYAL+LL +T Sbjct: 1032 LIISGSDETEIMKVKRLLDKQFSIKDLGQLSYFLGLEFSRSDQGISVCQRKYALELLQDT 1091 Query: 2103 GFLGSKPSTTPMDCKAQFSSNDSSPLTDASQYRRLIGRMLYLTITRPELSFSIQTLSQYL 2282 G L SKP +TPMD + + +D S YRRL+GR++YLT TRP+L+F++ LSQ++ Sbjct: 1092 GLLASKPCSTPMDHTTRLHHDPLDLYSDPSSYRRLVGRLIYLTHTRPDLAFAVGKLSQFM 1151 Query: 2283 SKPTTTHLAAAHRI*RYIKSTPVKGLFYSSTSPLHLKCFSDADWARCQETRRSISGFAVF 2462 +P H AA ++ RY+K+TP KGLF+ S+S L L ++D+DWA C ++RRSISGF F Sbjct: 1152 HQPNNAHFQAARKVLRYVKATPTKGLFFPSSSDLKLTGYTDSDWATCPDSRRSISGFCFF 1211 Query: 2463 LGDSLISWKSKKQPTISRSSAEAEYRAIALTTCEIQWLTYLLQDMHIKH--PQSLMYTDS 2636 LG++L+SWKSKKQ +SRSS+EAEYRA+AL CE QWL LL D ++ P SL + D+ Sbjct: 1212 LGNALVSWKSKKQNVVSRSSSEAEYRALALGVCEAQWLHKLLTDFQLQDLIPISL-FCDN 1270 Query: 2637 RSAYCISQNSCHHERTKHIQLDCHFVREKLQEGLFKLIHIPSTQQTADIFTKPLPPAQFN 2816 +SA I+ N HERTKH+++DCH VR+++Q G L I S+ Q ADI TKPL P F Sbjct: 1271 QSALYIAANPVFHERTKHVEIDCHTVRDQVQAGFIHLAPITSSGQLADILTKPLLPKMFQ 1330 Query: 2817 YLISKLSMLNIHAP 2858 + KL + N P Sbjct: 1331 DFVCKLGLSNFTTP 1344