BLASTX nr result
ID: Rehmannia30_contig00015097
seq
BLASTX 2.2.26 [Sep-21-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Rehmannia30_contig00015097 (2596 letters) Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF excluding environmental samples from WGS projects 149,584,005 sequences; 54,822,741,787 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value gb|PNY00469.1| retrovirus-related Pol polyprotein from transposo... 791 0.0 gb|KYP34293.1| Retrovirus-related Pol polyprotein from transposo... 811 0.0 gb|PRQ55089.1| putative RNA-directed DNA polymerase [Rosa chinen... 790 0.0 gb|PNX92270.1| retrovirus-related Pol polyprotein from transposo... 788 0.0 gb|PNX74277.1| retrovirus-related Pol polyprotein from transposo... 769 0.0 gb|KYP40677.1| Retrovirus-related Pol polyprotein from transposo... 769 0.0 gb|PNX93614.1| retrovirus-related Pol polyprotein from transposo... 786 0.0 gb|PNX93928.1| hypothetical protein L195_g017092, partial [Trifo... 761 0.0 gb|KYP39497.1| Retrovirus-related Pol polyprotein from transposo... 779 0.0 gb|KYP42321.1| Copia protein [Cajanus cajan] 775 0.0 emb|CAN71595.1| hypothetical protein VITISV_010143 [Vitis vinifera] 776 0.0 dbj|GAU46782.1| hypothetical protein TSUD_351810 [Trifolium subt... 772 0.0 gb|PNY17451.1| retrovirus-related Pol polyprotein from transposo... 768 0.0 gb|PNY16454.1| flavonol sulfotransferase-like protein [Trifolium... 768 0.0 gb|KYP42564.1| Retrovirus-related Pol polyprotein from transposo... 763 0.0 gb|OMO88956.1| Integrase, catalytic core [Corchorus capsularis] 763 0.0 gb|PNX97998.1| retrovirus-related Pol polyprotein from transposo... 747 0.0 dbj|GAU41679.1| hypothetical protein TSUD_272630 [Trifolium subt... 753 0.0 gb|PNX93131.1| retrovirus-related Pol polyprotein from transposo... 746 0.0 gb|KYP43110.1| Retrovirus-related Pol polyprotein from transposo... 755 0.0 >gb|PNY00469.1| retrovirus-related Pol polyprotein from transposon TNT 1-94 [Trifolium pratense] Length = 778 Score = 791 bits (2044), Expect = 0.0 Identities = 410/787 (52%), Positives = 529/787 (67%), Gaps = 23/787 (2%) Frame = +1 Query: 271 WDHCILTAAHIINRLPAPILHNKTPFEILFNKMPDYSHFKIFGCLAYVSTLTAQRHKFQA 450 W+ C+ A H+INR+P+P+L +KTP+E+LF P H K+FGCL++ +TL A R KF + Sbjct: 2 WNFCVQHAVHVINRIPSPLLKSKTPYELLFKLSPTLLHLKVFGCLSFATTLQAHRTKFDS 61 Query: 451 RASKCVFIGYPLGSKGYKLYDLESHKVLISRHVIFCENIFPFQEIASKSVSPEAPLFPIS 630 RA KCVFIGY G+KGY LYDL SH + +SR+V+F E++ PF+ + + S +P FP+ Sbjct: 62 RARKCVFIGYKDGTKGYILYDLHSHNIFLSRNVVFYEHVLPFKSVPGPTSSHNSPTFPLY 121 Query: 631 D--LSSAQDQSFSHFQNINAQNDIVTTTQII--------------------SDAQNDFAD 744 D L + + F D V+ + + A F D Sbjct: 122 DDPLDISHNPCVDTFPLSTGSLDNVSLNPALTPPLVPTLDSSPLTPPINTATPAPPSF-D 180 Query: 745 AQHDLAAQNTTVVQTDPVNSHIEHSNPNPPILRRSSRNRTNPSYLQVYDCKIPPSIQSQH 924 + H A Q + + + PV S E S P P R S+R PSYLQ Y C I +Q Sbjct: 181 SAHSAADQPSPNLDSVPVPS--EPSIPLP--TRVSTRVTRPPSYLQDYHCNIKSGCTNQV 236 Query: 925 SAQTIISGKQYPIQDFLSVDKLSNKHKAFSAQISQITEPKSYAQASKQKEWQDAMKAEIR 1104 S+ + +P+ LS + S +K F IS EP +Y QASK W+ AM AEI Sbjct: 237 SSNIV-----HPLSSVLSYNTCSPAYKLFCCSISSTIEPTTYNQASKFDCWKKAMDAEIT 291 Query: 1105 ALEKNQTWIMTDLPADKHCIGCKWVYKVKYRSNGSIERYKARLVAKGYTQEEGLDYFDTF 1284 ALE N+TW + DLP K IGCKWVYK+KY +NG+IERYKARLVAKGYTQ EG+DYFDTF Sbjct: 292 ALELNKTWTVVDLPCGKVPIGCKWVYKIKYHANGTIERYKARLVAKGYTQMEGVDYFDTF 351 Query: 1285 SPVAKITTVRLLIALAAAKGWFLHQLDVNNAFLHGDLHEEVYMLPPPGYLQSNDKKVCKL 1464 SPVAK+TTVR+L+A+AA +GW L QLDVNNAFLHGDLHEEVYM PPGY + KVCKL Sbjct: 352 SPVAKMTTVRVLLAVAAVRGWHLEQLDVNNAFLHGDLHEEVYMSLPPGY-DATPSKVCKL 410 Query: 1465 TKSLYGLKQASRQWFQKLTSCLLNYGFIQSKSDSSLFTMSHDDSITILCVYVDDIILAGN 1644 KSLYGLKQASRQW+ KL++ L++ G+ S++D SL+ SH S T L VYVDDI+LAG Sbjct: 411 NKSLYGLKQASRQWYSKLSAALISLGYQASQADHSLYVKSHGTSFTALLVYVDDIVLAGT 470 Query: 1645 DISKIEAIKLHLDEKFTIKDLGTLKYILGIEVARSTKGIHICQRKYALDLLTETGFLGSK 1824 I +I+++KL LD++F IKDLG L++ LG+E+ARS+ GI + QRKY L+LL +TGFLGSK Sbjct: 471 SIEEIKSVKLFLDQQFKIKDLGPLRFFLGLEIARSSSGIFLNQRKYTLELLEDTGFLGSK 530 Query: 1825 PSTTPMDCKAQFSSNDSSPLTDASQYRRLIGRMLYLTITRPELSFSIQTLSQYLSKPTTT 2004 P+T P+D + S+ D P D S YRRLIGR+LYLT TRP++SF++Q LSQY+S P Sbjct: 531 PATVPLDPHTKLSATDGVPFDDPSGYRRLIGRLLYLTHTRPDISFAVQHLSQYVSTPLVP 590 Query: 2005 HLAAAHRI*RYIKSTPVKGLFYSSTSPLHLKCFSDADWARCQETRRSISGFAVFLGDSLI 2184 H AA RI RY+KS P KG+ +SS SPL L F+D+DWA C TRRS++G+ V LG SLI Sbjct: 591 HYQAATRILRYLKSCPAKGVLFSSHSPLQLHGFADSDWACCPNTRRSVTGYCVLLGSSLI 650 Query: 2185 SWKSKKQPTISRSSAEAEYRAIALTTCEIQWLTYLLQDMHIKHPQSL-MYTDSRSAYCIS 2361 SWKSKKQ T+SRSS EAEYRA+A TCE+QWL YL QD+HI PQS +Y D++SA ++ Sbjct: 651 SWKSKKQNTVSRSSTEAEYRALASLTCELQWLQYLFQDLHITFPQSASVYCDNKSAIYLA 710 Query: 2362 QNSCHHERTKHIQLDCHFVREKLQEGLFKLIHIPSTQQTADIFTKPLPPAQFNYLISKLS 2541 N HER+KHI+LDCH +REKLQ L L+ +PS Q AD+FTKPL F+ ++SKL Sbjct: 711 HNPTFHERSKHIELDCHIIREKLQSKLIHLLSVPSKSQLADVFTKPLHSPAFSSMLSKLG 770 Query: 2542 MLNIHAP 2562 + +IH P Sbjct: 771 LCSIHHP 777 >gb|KYP34293.1| Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cajanus cajan] Length = 1376 Score = 811 bits (2094), Expect = 0.0 Identities = 420/801 (52%), Positives = 543/801 (67%), Gaps = 6/801 (0%) Frame = +1 Query: 190 GIVERKHQHLLQVARALLFQSKVPLSFWDHCILTAAHIINRLPAPILHNKTPFEILFNKM 369 G+VERKHQHLL V ALLF SK+P FW + +L A ++INR+ P+L NKTPF+ L+ + Sbjct: 601 GVVERKHQHLLNVTHALLFHSKLPYCFWSYALLHATYLINRITTPLLDNKTPFQKLYGQT 660 Query: 370 PDYSHFKIFGCLAYVSTLTAQRHKFQARASKCVFIGYPLGSKGYKLYDLESHKVLISRHV 549 D + ++FGCL YVST TA R K RA CVF+G+ +KGY YDL + + ISR+V Sbjct: 661 CDITELRVFGCLCYVSTSTANRKKLDPRAHPCVFLGFSPTTKGYITYDLHTRAITISRNV 720 Query: 550 IFCENIFPFQEIASKSVSPEAPLFPISDLSSAQDQSFSHFQNINAQNDIVTTTQIISDAQ 729 F EN FP + S S S + PIS F + +D+++ I+ D Sbjct: 721 SFYENHFPLLQSTS-STSNIPVVSPIS------------FGIHSPSHDLIS---ILPDPH 764 Query: 730 NDFADAQHDLAAQNTTVVQTD-----PVNSHIEHSNPNPPILRRSSRNRTNPSYLQVYDC 894 QH++ + N D P ++ + PN LRRS+R R PSYLQ Y Sbjct: 765 ------QHNVTSPNPATTSHDSISLAPYSTTADSLPPNSSPLRRSTRLRNPPSYLQDYHH 818 Query: 895 KIPPSIQSQHSAQTIISGKQYPIQDFLSVDKLSNKHKAFSAQISQITEPKSYAQASKQKE 1074 + + + H G YPI+ ++S +LSN +AF + IS ++EP SYA+A+K Sbjct: 819 SLTSTSTNLHP------GMLYPIEKYISYSRLSNDFQAFVSSISAVSEPHSYAEAAKHDC 872 Query: 1075 WQDAMKAEIRALEKNQTWIMTDLPADKHCIGCKWVYKVKYRSNGSIERYKARLVAKGYTQ 1254 W AM AE+ AL+ NQTW +T LP K +GC+W+YK+KY ++GSIERYKARLVAKGYTQ Sbjct: 873 WLKAMHAELEALKMNQTWTLTPLPPHKQAVGCRWIYKIKYNADGSIERYKARLVAKGYTQ 932 Query: 1255 EEGLDYFDTFSPVAKITTVRLLIALAAAKGWFLHQLDVNNAFLHGDLHEEVYMLPPPGYL 1434 EGLDY TFSPVAK+TTVRLL+ALAA W L QLDVNNAFLHGDL+EEVYM P G Sbjct: 933 VEGLDYLATFSPVAKLTTVRLLLALAAVFDWHLKQLDVNNAFLHGDLNEEVYMTLPLGMR 992 Query: 1435 QSNDKKVCKLTKSLYGLKQASRQWFQKLTSCLLNYGFIQSKSDSSLFTMSHDDSITILCV 1614 +VCKL KSLYGLKQASRQWF KL+S L+++G+ QS SD SLF S T L + Sbjct: 993 PEYSNQVCKLQKSLYGLKQASRQWFAKLSSFLIHHGYHQSASDHSLFMKFSSSSTTALLI 1052 Query: 1615 YVDDIILAGNDISKIEAIKLHLDEKFTIKDLGTLKYILGIEVARSTKGIHICQRKYALDL 1794 YVDDI+LAGN++S+I+ I LD F IKDLG LKY LG+EVAR+ GIH+ QRKY LD+ Sbjct: 1053 YVDDIVLAGNNLSEIQLITGLLDVAFKIKDLGNLKYFLGLEVARNKSGIHLSQRKYVLDI 1112 Query: 1795 LTETGFLGSKPSTTPMDCKAQFSSNDSSPLTDASQYRRLIGRMLYLTITRPELSFSIQTL 1974 L++ G + S+P +TPMD ++ S++ +PL D S YRRL+GR++YLT TRP++S+ + L Sbjct: 1113 LSDCGMMASRPVSTPMDYTSRLSASSGTPLADPSSYRRLLGRLIYLTTTRPDISYVVHHL 1172 Query: 1975 SQYLSKPTTTHLAAAHRI*RYIKSTPVKGLFYSSTSPLHLKCFSDADWARCQETRRSISG 2154 SQ++S P+T H A RI RY+K P GLF+ + S LHLK FSD+DWA C +TRRSI+G Sbjct: 1173 SQFMSAPSTAHSQAIFRILRYLKQAPGSGLFFPTNSSLHLKAFSDSDWAGCLDTRRSITG 1232 Query: 2155 FAVFLGDSLISWKSKKQPTISRSSAEAEYRAIALTTCEIQWLTYLLQDMHIK-HPQSLMY 2331 F+V+LGDSLISW+SKKQPT+SRSS+EAEYRA+A TT E+QWLTYLL D+H+ H +L+Y Sbjct: 1233 FSVYLGDSLISWRSKKQPTVSRSSSEAEYRALATTTSELQWLTYLLHDLHVPVHQPALLY 1292 Query: 2332 TDSRSAYCISQNSCHHERTKHIQLDCHFVREKLQEGLFKLIHIPSTQQTADIFTKPLPPA 2511 D++SA I+ N HERTKHI +DCH VREKLQ GL KL+ + S Q ADIFTK L P+ Sbjct: 1293 CDNQSALHIAANQVFHERTKHIDIDCHLVREKLQSGLLKLLPVASPHQLADIFTKSLSPS 1352 Query: 2512 QFNYLISKLSMLNIHAPLEGG 2574 F L SKL MLN+++ LEGG Sbjct: 1353 MFTALYSKLGMLNLYSQLEGG 1373 >gb|PRQ55089.1| putative RNA-directed DNA polymerase [Rosa chinensis] Length = 1285 Score = 790 bits (2040), Expect = 0.0 Identities = 408/815 (50%), Positives = 552/815 (67%), Gaps = 21/815 (2%) Frame = +1 Query: 190 GIVERKHQHLLQVARALLFQSKVPLSFWDHCILTAAHIINRLPAPILHNKTPFEILFNKM 369 G+ ERKH+HLL +ARALLFQ+ +P FW ILTAA++INR P PIL KTPFE LF+K Sbjct: 468 GVAERKHRHLLNMARALLFQANLPKPFWGDAILTAAYLINRTPTPILQGKTPFEKLFHKE 527 Query: 370 PDYSHFKIFGCLAYVSTLTAQRHKFQARASKCVFIGYPLGSKGYKLYDLESHKVLISRHV 549 P YSH ++FGC +VST + KF R+ +CVF+GYP G KGYK+Y+L + K L+SR V Sbjct: 528 PSYSHLRVFGCQCFVSTHPTRPSKFDPRSMECVFLGYPHGQKGYKVYNLTTKKSLVSRDV 587 Query: 550 IFCENIFPFQEIASKSVSPEAPLFP-ISDLSSAQDQSFSHFQNINAQNDIVTTTQIISDA 726 IF EN FPF + + S LFP I L+ + S I + + +IS Sbjct: 588 IFFENAFPFPKNSESFPSQNTDLFPSIPRLAHYDNPSIP---KIPPSSPTHHSPPMISP- 643 Query: 727 QNDFADAQHDLAAQNTTVVQTDPVNS-------------HIEHSNP---NPPILRRSSRN 858 N + A+ L + + ++ TDPV+S HI +P PP R+S+R Sbjct: 644 -NPQSSAEQYLNSPSKSLSSTDPVSSDITLPNLDTISSDHIPSLSPPEQTPPRPRKSTRA 702 Query: 859 RTNPSYLQVY--DCKIPPSIQSQHSAQTIIS-GKQYPIQDFLSVDKLSNKHKAFSAQISQ 1029 P+ LQ + D +P + S+ + + G + + LS LS+ H+ F+A I+ Sbjct: 703 TKLPTALQDFHIDAALPTRLAPSSSSNEVTTPGTAHSLSHVLSYANLSSPHRTFTANITL 762 Query: 1030 ITEPKSYAQASKQKEWQDAMKAEIRALEKNQTWIMTDLPADKHCIGCKWVYKVKYRSNGS 1209 EP S++QA K +W++AM+ E++AL+ N+TW + PA K IGCKWVYK+KY +G+ Sbjct: 763 QREPTSFSQAVKDPKWREAMRLEVQALQDNKTWSLVPPPAHKRPIGCKWVYKIKYNPDGT 822 Query: 1210 IERYKARLVAKGYTQEEGLDYFDTFSPVAKITTVRLLIALAAAKGWFLHQLDVNNAFLHG 1389 IERYKARLVAKGY+Q EGLDY +TF+PVAK+TTVR+L++LAA + W LHQLDVNNAFL+G Sbjct: 823 IERYKARLVAKGYSQVEGLDYRETFAPVAKLTTVRVLLSLAAQQNWHLHQLDVNNAFLNG 882 Query: 1390 DLHEEVYMLPPPGYLQSNDKKVCKLTKSLYGLKQASRQWFQKLTSCLLNYGFIQSKSDSS 1569 DLHE+VYM PPG+ + + KVCKL KSLYGL+QAS+QWF KL+S L + GF QS SD S Sbjct: 883 DLHEDVYMHLPPGFERKGEHKVCKLHKSLYGLRQASKQWFLKLSSALKSAGFKQSWSDYS 942 Query: 1570 LFTMSHDDSITILCVYVDDIILAGNDISKIEAIKLHLDEKFTIKDLGTLKYILGIEVARS 1749 +F SH + T L VYVDD+ILAGN++ I K L F +KD+G LKY LG+EVARS Sbjct: 943 MFVRSHQGTFTALLVYVDDVILAGNNLDDIIRTKSFLSSHFKLKDMGQLKYFLGLEVARS 1002 Query: 1750 TKGIHICQRKYALDLLTETGFLGSKPSTTPMDCKAQFSSNDSSPLTDASQYRRLIGRMLY 1929 GI + QRKYAL++L +TGFLG+KPS P++ + D L DASQYRRL+GR++Y Sbjct: 1003 KHGIALSQRKYALEILEDTGFLGAKPSRFPLEQNIILTQEDGRLLEDASQYRRLVGRLIY 1062 Query: 1930 LTITRPELSFSIQTLSQYLSKPTTTHLAAAHRI*RYIKSTPVKGLFYSSTSPLHLKCFSD 2109 TITRP+L +++ LSQ++ KP HL AAH++ RY+K TP +G+F S PL L + D Sbjct: 1063 QTITRPDLVYAVHILSQFMDKPRQPHLDAAHKVLRYLKQTPGQGIFLPSKGPLELSAYCD 1122 Query: 2110 ADWARCQETRRSISGFAVFLGDSLISWKSKKQPTISRSSAEAEYRAIALTTCEIQWLTYL 2289 ADWARC++TRRS +G+ +FLG + ISWK+KKQ T+SRSSAEAEYR++A T CEI WL Y+ Sbjct: 1123 ADWARCKDTRRSTTGYCIFLGHAPISWKTKKQRTVSRSSAEAEYRSMATTCCEITWLQYI 1182 Query: 2290 LQDMHIKHPQSL-MYTDSRSAYCISQNSCHHERTKHIQLDCHFVREKLQEGLFKLIHIPS 2466 L+D++I+H Q + ++ D+++A I+ N HERTKHI++DCH VREK+Q GL + HI + Sbjct: 1183 LKDLNIQHLQPVKLFCDNKAAIHIASNPVFHERTKHIEIDCHVVREKVQRGLIQTEHIRT 1242 Query: 2467 TQQTADIFTKPLPPAQFNYLISKLSMLNIHAPLEG 2571 +Q ADIFTKPL QF+ L+ KL ++NIH+ L G Sbjct: 1243 KEQPADIFTKPLSSEQFSLLLGKLGVINIHSNLRG 1277 >gb|PNX92270.1| retrovirus-related Pol polyprotein from transposon TNT 1-94 [Trifolium pratense] Length = 1246 Score = 788 bits (2034), Expect = 0.0 Identities = 404/810 (49%), Positives = 544/810 (67%), Gaps = 12/810 (1%) Frame = +1 Query: 190 GIVERKHQHLLQVARALLFQSKVPLSFWDHCILTAAHIINRLPAPILHNKTPFEILFNKM 369 G+VERKH+HLL VARALLFQ+ +P +FW ILTAA++INR PILH KTPFEILF+K Sbjct: 437 GVVERKHRHLLNVARALLFQANLPKTFWGDSILTAAYLINRTSTPILHGKTPFEILFHKP 496 Query: 370 PDYSHFKIFGCLAYVSTLTAQRHKFQARASKCVFIGYPLGSKGYKLYDLESHKVLISRHV 549 P Y H ++FGCL + S + KF R+ +C+F+GYP G+KGY++YDL + K SR V Sbjct: 497 PTYHHLRVFGCLCFASNHHHKPTKFDTRSIRCIFLGYPYGTKGYRVYDLATGKTFTSRDV 556 Query: 550 IFCENIFPFQEIASKSV--SPEAPLFPISDLSSAQDQSFSHF--QNINAQNDIVTTTQII 717 IF E+IFP+ A+ S + + PL I S+ ++ F A +D T I+ Sbjct: 557 IFHEHIFPYSSAATMSTPSTHQIPLPNIEPFDSSSHETTPSFPTHTPTAFDDQQPTPPIV 616 Query: 718 SDAQNDFADAQHDLAAQNTTVVQTDPVNSHI----EHSNPNPPILRRSSRNRTNPSYLQV 885 + ++Q + + ++ +S E + PP + R PSYL+ Sbjct: 617 TPT----IESQDTIIPTIVSTIEASTSDSTTITPPEAPHDIPPPALMAKRLIRPPSYLRQ 672 Query: 886 YDCKIP---PSIQSQHSAQTIISGKQYPIQDFLSVDKLSNKHKAFSAQISQITEPKSYAQ 1056 Y ++ S S HS T G +P+ L+ D+LS H+AF+ IS I EP S+ Q Sbjct: 673 YHVEVSLPTRSSPSSHSVLTAPKGIPHPLSSVLNYDRLSPAHRAFTTSISAIKEPTSFHQ 732 Query: 1057 ASKQKEWQDAMKAEIRALEKNQTWIMTDLPADKHCIGCKWVYKVKYRSNGSIERYKARLV 1236 A K +W+ AM E+RAL N TW + LP +K+ +GCKWVYK+K+ +G+IERYKARLV Sbjct: 733 AVKDPKWRFAMDEELRALHDNGTWSLQHLPPNKNPVGCKWVYKIKFNPDGTIERYKARLV 792 Query: 1237 AKGYTQEEGLDYFDTFSPVAKITTVRLLIALAAAKGWFLHQLDVNNAFLHGDLHEEVYML 1416 AKGY+Q EG DY +TF+PVAK+ TVRLL+A+A++ W L QLDVNNAFLHGDL EEVYM Sbjct: 793 AKGYSQIEGFDYRETFAPVAKLVTVRLLLAVASSMNWHLRQLDVNNAFLHGDLEEEVYMS 852 Query: 1417 PPPGYLQSNDKKVCKLTKSLYGLKQASRQWFQKLTSCLLNYGFIQSKSDSSLFTMSHDDS 1596 PPGY + + +VCKL KSLYGLKQASRQWF KL+ L+ + QSKSD SLF D S Sbjct: 853 LPPGYGRKGETRVCKLHKSLYGLKQASRQWFIKLSKVLILADYTQSKSDHSLFVRHRDTS 912 Query: 1597 ITILCVYVDDIILAGNDISKIEAIKLHLDEKFTIKDLGTLKYILGIEVARSTKGIHICQR 1776 T L +YVDDIILAGN++ +IE IK HL E+F +KDLG LKY LGIEV+RS +GI + QR Sbjct: 913 FTALLIYVDDIILAGNNLQEIERIKAHLMEQFKLKDLGNLKYFLGIEVSRSKQGITLSQR 972 Query: 1777 KYALDLLTETGFLGSKPSTTPMDCKAQFSSNDSSPLTDASQYRRLIGRMLYLTITRPELS 1956 KYAL++L + G+L KP+ +PM+ + D + + S YRRL+GR++YLTITRP+L Sbjct: 973 KYALEILEDMGYLAVKPANSPMEQNLSLNKTDGDCIDEPSSYRRLVGRLIYLTITRPDLV 1032 Query: 1957 FSIQTLSQYLSKPTTTHLAAAHRI*RYIKSTPVKGLFYSSTSPLHLKCFSDADWARCQET 2136 +++ LSQ++ KP HL AA R+ RYIK TP +G+ + STS L L F DADWARCQ+T Sbjct: 1033 YAVHILSQFMDKPRIPHLEAAQRVLRYIKKTPGQGILFPSTSTLQLNAFCDADWARCQDT 1092 Query: 2137 RRSISGFAVFLGDSLISWKSKKQPTISRSSAEAEYRAIALTTCEIQWLTYLLQDMHIKHP 2316 RRS SG+ VF+G+SLISWK+KKQ T+SRSSAEAEYR++A CE+ WL +L D+ I+H Sbjct: 1093 RRSTSGYCVFIGNSLISWKTKKQVTVSRSSAEAEYRSMASVCCEVTWLLSVLHDLGIEHQ 1152 Query: 2317 QSL-MYTDSRSAYCISQNSCHHERTKHIQLDCHFVREKLQEGLFKLIHIPSTQQTADIFT 2493 Q + ++ D+++A I+ N HERTKHI++DCH VREK+Q G+ K HI +++Q AD+FT Sbjct: 1153 QPVKLFCDNQAALHIASNPVFHERTKHIEIDCHLVREKVQAGVVKTYHISTSEQPADVFT 1212 Query: 2494 KPLPPAQFNYLISKLSMLNIHAPLEGGCKD 2583 K L QF+ LI+KL M+NI++ L G K+ Sbjct: 1213 KALSVPQFSNLINKLGMINIYSNLRGSVKN 1242 >gb|PNX74277.1| retrovirus-related Pol polyprotein from transposon TNT 1-94 [Trifolium pratense] Length = 762 Score = 769 bits (1985), Expect = 0.0 Identities = 407/780 (52%), Positives = 520/780 (66%), Gaps = 16/780 (2%) Frame = +1 Query: 271 WDHCILTAAHIINRLPAPILHNKTPFEILFNKMPDYSHFKIFGCLAYVSTLTAQRHKFQA 450 W+ + A HIINRLP+P+L+ K P+E+L+ K P H K+FGCL+Y +TL A R KF + Sbjct: 2 WNFSVQHAVHIINRLPSPLLNLKCPYELLYKKPPSLVHLKVFGCLSYATTLQAHRTKFDS 61 Query: 451 RASKCVFIGYPLGSKGYKLYDLESHKVLISRHVIFCENIFPFQEIASKSVSPEAPLFPIS 630 RA K +F+G+ G+KGY LYDL SH + +SR+V+F E FP + P+ S Sbjct: 62 RARKAIFLGFKDGTKGYILYDLSSHDIFVSRNVVFYETYFPLRH--------SQPVHNAS 113 Query: 631 DLS------SAQDQSFSHFQNINAQNDIVTTTQIISDAQNDFA-DAQHDLAAQNTTVVQT 789 D S S D SH N + D+ + + + + D + Sbjct: 114 DFSKPLPSNSILDDPVSH-----THNSLPLPVMFEPDSTSPSSVNIEPDRTISSPASSSH 168 Query: 790 DPVNSHIEHSNPN---PPI---LRRSSRNRTNPSYLQVYDCKIPPSIQSQHSAQTIISGK 951 P++S H PN PP LRRS+R T P YL+ Y C S IS Sbjct: 169 TPLSSS-SHDRPNLAPPPYHDNLRRSTRTITRPGYLEDYHC-----YSVTGSVNNNISHP 222 Query: 952 QYPIQDFLSVDKLSNKHKAFSAQISQITEPKSYAQASKQKEWQDAMKAEIRALEKNQTWI 1131 YP+ LS D ++K+F IS I EPK+++QASK W+ AM AE+ AL++N+TW Sbjct: 223 NYPLSSVLSYDNCVPEYKSFCCSISAIIEPKTFSQASKLDCWRKAMDAELLALDENKTWS 282 Query: 1132 MTDLPADKHCIGCKWVYKVKYRSNGSIERYKARLVAKGYTQEEGLDYFDTFSPVAKITTV 1311 + DLP K IGCKWVYK+KY +NGSIERYKARLVAKGYTQ EG+DYFDTFSPVAKITTV Sbjct: 283 VVDLPHGKTPIGCKWVYKIKYHANGSIERYKARLVAKGYTQMEGIDYFDTFSPVAKITTV 342 Query: 1312 RLLIALAAAKGWFLHQLDVNNAFLHGDLHEEVYMLPPPGYLQS-NDKKVCKLTKSLYGLK 1488 R L+ALA+ KGW L QLDVNNAFLHGDL+EEVYM PPGY + KVC+L KSLYGLK Sbjct: 343 RFLLALASIKGWDLEQLDVNNAFLHGDLNEEVYMSLPPGYSSAIGSNKVCRLHKSLYGLK 402 Query: 1489 QASRQWFQKLTSCLLNYGFIQSKSDSSLFTMSHDDSITILCVYVDDIILAGNDISKIEAI 1668 QASRQW+ KL+S L+++G+ QS SD SL+ S D T L VYVDDI+LAGN +I+A+ Sbjct: 403 QASRQWYSKLSSALISFGYKQSVSDHSLYIKSTDSEFTALLVYVDDIVLAGNSSKEIQAV 462 Query: 1669 KLHLDEKFTIKDLGTLKYILGIEVARSTKGIHICQRKYALDLLTETGFLGSKPSTTPMDC 1848 K LD+KF IKDLG L+Y LG E+ARS KGI + QRKY L+LL +TGFL +KPS P + Sbjct: 463 KHFLDQKFKIKDLGKLRYFLGFEIARSPKGIFVNQRKYTLELLQDTGFLATKPSNIPFNP 522 Query: 1849 KAQFSSNDSSPLTDASQYRRLIGRMLYLTITRPELSFSIQTLSQYLSKPTTTHLAAAHRI 2028 + SS D +PL D S YRRLIGR+LYLT TRP++SFS+Q LSQ++SKP H AA RI Sbjct: 523 TTKLSSTDGAPLKDPSSYRRLIGRLLYLTNTRPDISFSVQHLSQFVSKPLIPHYTAATRI 582 Query: 2029 *RYIKSTPVKGLFYSSTSPLHLKCFSDADWARCQETRRSISGFAVFLGDSLISWKSKKQP 2208 +Y+KS P GLF+ +S L L ++D+DWARC +TR+SI+G+ VF+G SLISWKSKKQ Sbjct: 583 LKYLKSAPANGLFFPVSSSLKLTGYADSDWARCPDTRKSITGYCVFIGSSLISWKSKKQN 642 Query: 2209 TISRSSAEAEYRAIALTTCEIQWLTYLLQDMHIK--HPQSLMYTDSRSAYCISQNSCHHE 2382 T+SRSS EAEYRA+A TCEIQWL YL QD +K +P S ++ DSRSA ++ N HE Sbjct: 643 TVSRSSTEAEYRALASLTCEIQWLQYLFQDFKMKFSNPAS-VFCDSRSAIYLAHNPAFHE 701 Query: 2383 RTKHIQLDCHFVREKLQEGLFKLIHIPSTQQTADIFTKPLPPAQFNYLISKLSMLNIHAP 2562 R+KHI++DCH +REK+Q L L+ IPS Q AD+FTKPL F L+SKL++ +IH+P Sbjct: 702 RSKHIEIDCHVIREKIQSQLIHLLPIPSNSQIADMFTKPLHFPAFFDLLSKLNLCSIHSP 761 >gb|KYP40677.1| Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cajanus cajan] Length = 810 Score = 769 bits (1985), Expect = 0.0 Identities = 403/793 (50%), Positives = 526/793 (66%), Gaps = 2/793 (0%) Frame = +1 Query: 190 GIVERKHQHLLQVARALLFQSKVPLSFWDHCILTAAHIINRLPAPILHNKTPFEILFNKM 369 GIVERKHQH+L V R+LLFQSKVP SFW I A HIINRLP P L+NK+PFE++FN+ Sbjct: 62 GIVERKHQHILNVCRSLLFQSKVPKSFWSFAIKHAVHIINRLPTPFLNNKSPFEMIFNQK 121 Query: 370 PDYSHFKIFGCLAYVSTLTAQRHKFQARASKCVFIGYPLGSKGYKLYDLESHKVLISRHV 549 P+ K FGCLAYVSTL+ R+K + RA KC+F+G+ G+KG+ L++L S +L+SRH Sbjct: 122 PNLHDLKTFGCLAYVSTLSGGRNKLEPRAHKCIFLGFKTGTKGFVLFNLHSKSILLSRHA 181 Query: 550 IFCENIFPFQEIASKSVSPEAPLFPISDLSSAQDQSFSHFQNINAQNDIVTTT-QIISDA 726 IF + +FP+ + + + PIS+ D F + + +D +T I +A Sbjct: 182 IFHQTVFPYINVFDNQSNSN--IVPISN-----DNIFQPYDFFPSYSDHLTNNPNSIENA 234 Query: 727 QNDFADAQHDLAAQNTTVVQTDPVNSHIEHSNPNPPILRRSSRNRTNPSYLQVYDCKIPP 906 N+F + D + + + IE P R S R+++ P YL Y C Sbjct: 235 SNNFENHTQDPPIEMSQSIP-------IETGQIQPITTRVSLRSKSRPGYLDQYHCYNTV 287 Query: 907 SIQSQHSAQTIISGKQYPIQDFLSVDKLSNKHKAFSAQISQITEPKSYAQASKQKEWQDA 1086 S S S YP+ ++ + +AS + W+ A Sbjct: 288 SSDST-------SNSLYPMHLYI------------------------FKEASTKDCWKQA 316 Query: 1087 MKAEIRALEKNQTWIMTDLPADKHCIGCKWVYKVKYRSNGSIERYKARLVAKGYTQEEGL 1266 M AE+ ALE+N+TW + LP K IGCKWVY+ K+++NG++ERYKARLVAKG+TQ EG+ Sbjct: 317 MIAELDALERNRTWSLVTLPPGKKLIGCKWVYRTKHKANGTVERYKARLVAKGFTQTEGI 376 Query: 1267 DYFDTFSPVAKITTVRLLIALAAAKGWFLHQLDVNNAFLHGDLHEEVYMLPPPGYLQSND 1446 DYF+TFSPVAKIT++R L+ALA++ WF+HQLD++NAFLHGDL EEVYM PP G + Sbjct: 377 DYFETFSPVAKITSIRFLLALASSHNWFIHQLDIDNAFLHGDLDEEVYMRPPQGLRLPSS 436 Query: 1447 KKVCKLTKSLYGLKQASRQWFQKLTSCLLNYGFIQSKSDSSLFTMSHDDSITILCVYVDD 1626 K VCKL KSLYGLKQASR W QKLTS L G+ QS +D SLF SITIL +YVDD Sbjct: 437 KLVCKLEKSLYGLKQASRNWNQKLTSELTLMGYKQSFADHSLFVNFTGSSITILLIYVDD 496 Query: 1627 IILAGNDISKIEAIKLHLDEKFTIKDLGTLKYILGIEVARSTKGIHICQRKYALDLLTET 1806 I+L+GND+++I+ +K HL KF IKDLG+LK+ LG+EVARS KGI + QRKY L+L+ ET Sbjct: 497 IVLSGNDMTEIKKVKAHLHNKFHIKDLGSLKFFLGLEVARSKKGILLNQRKYCLELIDET 556 Query: 1807 GFLGSKPSTTPMDCKAQFSSNDSSPLTDASQYRRLIGRMLYLTITRPELSFSIQTLSQYL 1986 G LG KP+ TP D + + L D + +RRLIGR+LYLT TRP++SFS+Q LSQ++ Sbjct: 557 GLLGCKPAPTPADPAMKLHVDHGDLLHDPTVFRRLIGRLLYLTNTRPDISFSVQQLSQFV 616 Query: 1987 SKPTTTHLAAAHRI*RYIKSTPVKGLFYSSTSPLHLKCFSDADWARCQETRRSISGFAVF 2166 SKP H+ AA RI RY+K P GLFYSST+PL ++ FSD+DWA C TRRS++G+ VF Sbjct: 617 SKPREPHMQAALRIVRYLKGAPGLGLFYSSTNPLKIQAFSDSDWATCATTRRSVTGYCVF 676 Query: 2167 LGDSLISWKSKKQPTISRSSAEAEYRAIALTTCEIQWLTYLLQDMHIKHPQSL-MYTDSR 2343 +G+SLISWKSKKQ T+SRSS+EAEYRA+A TCE+QWL YL MHIK P ++DS+ Sbjct: 677 IGNSLISWKSKKQSTVSRSSSEAEYRALASLTCELQWLKYLCDSMHIKIPTPFATFSDSQ 736 Query: 2344 SAYCISQNSCHHERTKHIQLDCHFVREKLQEGLFKLIHIPSTQQTADIFTKPLPPAQFNY 2523 SA IS+N HERTKHI++DCH +R K+QEGL LIH+ S Q AD FTK L P F+ Sbjct: 737 SAIQISKNPTFHERTKHIEVDCHLIRIKIQEGLLHLIHVLSANQLADAFTKALFPKPFHT 796 Query: 2524 LISKLSMLNIHAP 2562 ISKL +LNI+ P Sbjct: 797 AISKLGLLNIYHP 809 >gb|PNX93614.1| retrovirus-related Pol polyprotein from transposon TNT 1-94 [Trifolium pratense] Length = 1430 Score = 786 bits (2030), Expect = 0.0 Identities = 411/804 (51%), Positives = 543/804 (67%), Gaps = 9/804 (1%) Frame = +1 Query: 190 GIVERKHQHLLQVARALLFQSKVPLSFWDHCILTAAHIINRLPAPILHNKTPFEILFNKM 369 GIVERKHQH+L VARAL FQ+ +P +FW IL + H+INRLP P L +K+P+E+LF + Sbjct: 643 GIVERKHQHILNVARALSFQAFLPSNFWHLSILHSVHLINRLPTPFLQHKSPYEVLFQQP 702 Query: 370 PDYSHFKIFGCLAYVSTLTAQRHKFQARASKCVFIGYPLGSKGYKLYDLESHKVLISRHV 549 P H + FGCLA+ STL R KF RA K VF+GY G+KG+ LYD+ +H L+SR+V Sbjct: 703 PTLLHLRTFGCLAFASTLHNHRTKFMPRARKTVFLGYRDGTKGFLLYDISNHSFLVSRNV 762 Query: 550 IFCENIFPFQEIASKSVSPEAPL----FPISDLSSAQDQSFSHFQNINAQNDIVTTTQII 717 IF E++FP + S S L PI + + A + T T + Sbjct: 763 IFYEDVFPLSSVNSSHTSSTTTLDNFVLPIDPPNFPS--------SCPAPLSVSTGTNPL 814 Query: 718 SDAQNDFADAQHDLAAQNTTVVQTDPVNSHIEHSNPNPPILRRSSRNRTNPSYLQVYDCK 897 +D + A + + + V P NS I P R S+R R P YLQ + C Sbjct: 815 TDHAENSATLVDNQVSNSPAV---PPQNSSI------PAPTRVSNRIRKIPGYLQDFHCS 865 Query: 898 IPPSIQSQHSAQTIISGKQYPIQDFLSVDKLSNKHKAFSAQISQITEPKSYAQASKQKEW 1077 + PS QH + + + YPI LS + +K F IS EPK++ QA K W Sbjct: 866 LLPS---QHQSSSSNAFSTYPISSSLSYTNCATAYKHFCLSISTTIEPKTFKQACKSDCW 922 Query: 1078 QDAMKAEIRALEKNQTWIMTDLPADKHCIGCKWVYKVKYRSNGSIERYKARLVAKGYTQE 1257 ++AMK+E+ ALE N+TW + DLP K+ IGCKWVYK+K+ ++GSIERYKARLVAKGYTQ Sbjct: 923 KEAMKSELAALELNRTWSIVDLPTGKNPIGCKWVYKIKHNADGSIERYKARLVAKGYTQM 982 Query: 1258 EGLDYFDTFSPVAKITTVRLLIALAAAKGWFLHQLDVNNAFLHGDLHEEVYMLPPPGYLQ 1437 EG+DYFDTFSPVAK+TTV+ L+ALA+ KGWFL QLDVNNAFLHGDL+EEVYM PPG + Sbjct: 983 EGVDYFDTFSPVAKLTTVKTLLALASIKGWFLEQLDVNNAFLHGDLNEEVYMSLPPGVII 1042 Query: 1438 ----SNDKKVCKLTKSLYGLKQASRQWFQKLTSCLLNYGFIQSKSDSSLFTMSHDDSITI 1605 SN KVC+L KSLYGLKQASRQW+ KL+S LL+ G+ QS +D SLF S T Sbjct: 1043 PNSCSNTPKVCRLHKSLYGLKQASRQWYSKLSSALLSLGYSQSAADHSLFLKKVGSSFTA 1102 Query: 1606 LCVYVDDIILAGNDISKIEAIKLHLDEKFTIKDLGTLKYILGIEVARSTKGIHICQRKYA 1785 L VYVDDI+LAGN+ +I ++K LD++F IKDLG L++ +G+E+ARS KGI + QRKY Sbjct: 1103 LLVYVDDIVLAGNNSLEITSVKSFLDKRFQIKDLGNLRFFVGLEIARSKKGILLNQRKYT 1162 Query: 1786 LDLLTETGFLGSKPSTTPMDCKAQFSSNDSSPLTDASQYRRLIGRMLYLTITRPELSFSI 1965 L+LL ++G L +KPS+TP D + ++S P D S YRRLIGR+LYLT TRP+++F++ Sbjct: 1163 LELLQDSGNLAAKPSSTPYDPSLKLHDSESPPYNDPSGYRRLIGRLLYLTTTRPDITFAV 1222 Query: 1966 QTLSQYLSKPTTTHLAAAHRI*RYIKSTPVKGLFYSSTSPLHLKCFSDADWARCQETRRS 2145 Q LSQ++S P H AA ++ RY+K++P KGLF+SS+S L L FSD+DWA C TR+S Sbjct: 1223 QQLSQFVSSPREVHFQAATKVLRYLKASPAKGLFFSSSSSLKLSGFSDSDWATCAITRKS 1282 Query: 2146 ISGFAVFLGDSLISWKSKKQPTISRSSAEAEYRAIALTTCEIQWLTYLLQDMHIK-HPQS 2322 I+G+ VFLG SLISWKSKKQ T+SRSS+EAEYRA+A +CE+QWL YL +D+ IK + Sbjct: 1283 ITGYCVFLGTSLISWKSKKQSTVSRSSSEAEYRALASLSCELQWLHYLFKDLGIKFDAPA 1342 Query: 2323 LMYTDSRSAYCISQNSCHHERTKHIQLDCHFVREKLQEGLFKLIHIPSTQQTADIFTKPL 2502 ++Y D++SA ++ N HERTKHI++DCH VRE++Q GL L+ +PS+ Q AD+ TK L Sbjct: 1343 MVYCDNKSAIYLAHNPSFHERTKHIEIDCHVVRERIQSGLIHLLPVPSSSQLADVLTKQL 1402 Query: 2503 PPAQFNYLISKLSMLNIHAPLEGG 2574 + F LISKL +L+IH+P GG Sbjct: 1403 SSSAFASLISKLGLLDIHSPACGG 1426 >gb|PNX93928.1| hypothetical protein L195_g017092, partial [Trifolium pratense] Length = 865 Score = 761 bits (1964), Expect = 0.0 Identities = 388/795 (48%), Positives = 533/795 (67%), Gaps = 4/795 (0%) Frame = +1 Query: 190 GIVERKHQHLLQVARALLFQSKVPLSFWDHCILTAAHIINRLPAPILHNKTPFEILFNKM 369 G+VERKH+H+L VARALLF S +PL FW C+LTA ++INRLP+P+L NK+PFE+L+NK Sbjct: 83 GVVERKHRHILVVARALLFHSHLPLEFWGECVLTAVYLINRLPSPLLSNKSPFELLYNKP 142 Query: 370 PDYSHFKIFGCLAYVSTLTAQRHKFQARASKCVFIGYPLGSKGYKLYDLESHKVLISRHV 549 P H ++FGCL Y +T+ HKF RA + +F+GYP G KGYK+YD E+ +SR V Sbjct: 143 PSLDHLRVFGCLCY-ATIVHPTHKFDPRAKRGIFVGYPTGQKGYKIYDPETKTFFVSRDV 201 Query: 550 IFCENIFPFQEIASKS--VSPEAPLFPISDLSSAQDQSFSHFQNINAQNDIVTTTQIISD 723 FCE FP S+ +S I DL S SH Q+ Q DI +T + Sbjct: 202 KFCETNFPSIPNTSEPNLISSHPSYEAIDDLPSPTS---SHHQS--QQTDIPSTHE---- 252 Query: 724 AQNDFADAQHDLAAQNTTVVQTDPVNSHI-EHSNPNPPILRRSSRNRTNPSYLQVYDCKI 900 N + + ++ + +V+ P+ +H + P P +R+S R++ P + Y + Sbjct: 253 -PNSPSHITTETSSAASPIVEPTPLTTHTTDPPTPFIPQVRKSVRDKHPPIWHNDYH--M 309 Query: 901 PPSIQSQHSAQTIISGKQYPIQDFLSVDKLSNKHKAFSAQISQITEPKSYAQASKQKEWQ 1080 + S T SG +YP+ +LS ++S+ + AF A I+ EP+SY QA WQ Sbjct: 310 STQVNKTPSEPTSGSGTRYPLSHYLSYSRISSSNCAFLANITAHREPQSYDQAVHDPLWQ 369 Query: 1081 DAMKAEIRALEKNQTWIMTDLPADKHCIGCKWVYKVKYRSNGSIERYKARLVAKGYTQEE 1260 DAM AE+ ALE+N TW + LP+ IGCKWVYK+KY+S+G+IERYKARLVAKGYTQ E Sbjct: 370 DAMNAELEALEQNNTWSLVPLPSGHKPIGCKWVYKIKYKSDGTIERYKARLVAKGYTQVE 429 Query: 1261 GLDYFDTFSPVAKITTVRLLIALAAAKGWFLHQLDVNNAFLHGDLHEEVYMLPPPGYLQS 1440 G+DY +TFSP AK+TT+R L+ +AAA+ WF+HQLDV NAFLHGDLHE VYM PPPG + Sbjct: 430 GIDYQETFSPTAKVTTLRCLLTVAAARNWFIHQLDVQNAFLHGDLHELVYMEPPPGLRRQ 489 Query: 1441 NDKKVCKLTKSLYGLKQASRQWFQKLTSCLLNYGFIQSKSDSSLFTMSHDDSITILCVYV 1620 + VC+L KSLYGLKQASR WF + + G+ QSK+D SLFT S S T + +YV Sbjct: 490 GENVVCRLNKSLYGLKQASRNWFSTFSEVIQKAGYQQSKADYSLFTKSQGTSFTAVLIYV 549 Query: 1621 DDIILAGNDISKIEAIKLHLDEKFTIKDLGTLKYILGIEVARSTKGIHICQRKYALDLLT 1800 DDI+L GND+ +++ +K L ++F IKDLG LKY LGIE +RS KGI + QRKYALD+L Sbjct: 550 DDILLTGNDLQEMKRLKEFLLKRFRIKDLGNLKYFLGIEFSRSKKGIFMSQRKYALDILQ 609 Query: 1801 ETGFLGSKPSTTPMDCKAQFSSNDSSPLTDASQYRRLIGRMLYLTITRPELSFSIQTLSQ 1980 ++G G++P PM+ + + D L D ++YRRL+GR++YLT+TRP++ +S+QTLSQ Sbjct: 610 DSGLTGARPDKFPMEQNLKLTPTDGVVLNDPTKYRRLVGRLIYLTVTRPDIVYSVQTLSQ 669 Query: 1981 YLSKPTTTHLAAAHRI*RYIKSTPVKGLFYSSTSPLHLKCFSDADWARCQETRRSISGFA 2160 ++ +P H AA R+ RYIK TP +GL +SST+ L LK F D+DW C TRRS++GF Sbjct: 670 FMHEPRKPHWDAALRVLRYIKGTPGQGLLFSSTNDLTLKAFCDSDWGGCHATRRSVTGFC 729 Query: 2161 VFLGDSLISWKSKKQPTISRSSAEAEYRAIALTTCEIQWLTYLLQDMHI-KHPQSLMYTD 2337 +FLG+SLISWKSKKQ +SRSSAE+EYRA+A T E+ WL ++LQD+ + ++ + ++ D Sbjct: 730 LFLGNSLISWKSKKQVVVSRSSAESEYRAMANTCLELTWLRFILQDLKVSQNTPTPLFCD 789 Query: 2338 SRSAYCISQNSCHHERTKHIQLDCHFVREKLQEGLFKLIHIPSTQQTADIFTKPLPPAQF 2517 +++A I+ N HERTKHI++DCH VREKLQ G+ ++P+ Q AD+FTK L QF Sbjct: 790 NQAALHIAANPVFHERTKHIEIDCHIVREKLQAGIINPSYVPTRFQLADVFTKALGKDQF 849 Query: 2518 NYLISKLSMLNIHAP 2562 L SKL + +IH+P Sbjct: 850 VTLRSKLGLHDIHSP 864 >gb|KYP39497.1| Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cajanus cajan] Length = 1445 Score = 779 bits (2011), Expect = 0.0 Identities = 404/799 (50%), Positives = 528/799 (66%), Gaps = 8/799 (1%) Frame = +1 Query: 190 GIVERKHQHLLQVARALLFQSKVPLSFWDHCILTAAHIINRLPAPILHNKTPFEILFNKM 369 GIVERKHQHLL V RALLFQS++P +FW + I A HIINRLP L K+PFE++++ Sbjct: 691 GIVERKHQHLLNVCRALLFQSQIPKTFWSYAIKHAVHIINRLPTRFLKQKSPFEMIYHHK 750 Query: 370 PDYSHFKIFGCLAYVSTLTAQRHKFQARASKCVFIGYPLGSKGYKLYDLESHKVLISRHV 549 PD ++FGCLAY STL A R K RASKC+F+GY G+KGY LY+L S +SR+V Sbjct: 751 PDLHDLRVFGCLAYASTLAAGRTKLAPRASKCIFLGYKSGTKGYVLYNLHSRFFSLSRNV 810 Query: 550 IFCENIFPFQEIASKSVSPEAPLFPISDLSSAQDQSFSHFQNINAQNDIVTTTQIISDAQ 729 IF E +FP+ S F ++ +D + T ++ Sbjct: 811 IFHETVFPY----------------------------SDFHKFSSNSDAILT-----EST 837 Query: 730 NDFADAQ-HDLAAQNTTVVQT------DPVNSHIEHSNPNPPILRRSSRNRTNPSYLQVY 888 NDFA + + + TT T P+ ++ P R S+R + P+YL Y Sbjct: 838 NDFAHYELYPITVLETTNTHTPSEQLQQPLTEEMQDIPVQP---RVSTRQKFRPNYLNQY 894 Query: 889 DCKIPPSIQSQHSAQTIISGKQYPIQDFLSVDKLSNKHKAFSAQISQITEPKSYAQASKQ 1068 C +SA + S YP+ +LS +K S+ + AF IS EP +YAQA Q Sbjct: 895 HC---------YSATSSNSACLYPLHSYLSYNKCSSNYTAFCLSISAHVEPSNYAQAITQ 945 Query: 1069 KEWQDAMKAEIRALEKNQTWIMTDLPADKHCIGCKWVYKVKYRSNGSIERYKARLVAKGY 1248 W+ AM E+ AL +N+TW + LP + IGCKWVY++KY+++GS++R+KARLVAKG+ Sbjct: 946 DCWKQAMMTELDALNRNRTWSLVSLPPGRKLIGCKWVYRIKYKADGSVDRHKARLVAKGF 1005 Query: 1249 TQEEGLDYFDTFSPVAKITTVRLLIALAAAKGWFLHQLDVNNAFLHGDLHEEVYMLPPPG 1428 TQ EG+DYF+TFSPVAK+TTVR L+ALA+ WFLHQLDV+NAFLHGDL EEVYM PP G Sbjct: 1006 TQTEGIDYFETFSPVAKLTTVRFLLALASTNNWFLHQLDVDNAFLHGDLEEEVYMRPPQG 1065 Query: 1429 YLQSNDKKVCKLTKSLYGLKQASRQWFQKLTSCLLNYGFIQSKSDSSLFTMSHDDSITIL 1608 + K VCKL KSLYGLKQASR W QKL S LL+ G+ QS +D SLF SITIL Sbjct: 1066 LQLPDSKLVCKLEKSLYGLKQASRNWNQKLNSELLHLGYKQSSADHSLFIKKQGSSITIL 1125 Query: 1609 CVYVDDIILAGNDISKIEAIKLHLDEKFTIKDLGTLKYILGIEVARSTKGIHICQRKYAL 1788 VYVDD++L+GN+ S+I+ +K HL +KF IKDLG LK+ LG+EVARS KGI + QRKY L Sbjct: 1126 LVYVDDVVLSGNNFSEIQIVKQHLHQKFQIKDLGPLKFFLGLEVARSKKGIILNQRKYCL 1185 Query: 1789 DLLTETGFLGSKPSTTPMDCKAQFSSNDSSPLTDASQYRRLIGRMLYLTITRPELSFSIQ 1968 +L+ E+G LGSKP+ TP D + ++ S L D + +RRLIGR+LYLT TRP++SFS+Q Sbjct: 1186 ELIDESGLLGSKPAPTPADPAIKLHADHGSLLNDPTSFRRLIGRLLYLTNTRPDISFSVQ 1245 Query: 1969 TLSQYLSKPTTTHLAAAHRI*RYIKSTPVKGLFYSSTSPLHLKCFSDADWARCQETRRSI 2148 LSQ++S+P HL AA RI RY+K P GLFY + + L ++ FSD+DWA C TR+S+ Sbjct: 1246 QLSQFVSQPREPHLQAALRIVRYLKGAPGLGLFYPAENHLRIQAFSDSDWATCSTTRKSV 1305 Query: 2149 SGFAVFLGDSLISWKSKKQPTISRSSAEAEYRAIALTTCEIQWLTYLLQDMHIKHPQSL- 2325 +G+ VFLG SL+SWKSKKQ T+SRSS+EAEYRA+A TCE+QWL +L + +K P Sbjct: 1306 TGYCVFLGKSLVSWKSKKQSTVSRSSSEAEYRALASLTCELQWLKFLSDSLFVKIPAPFS 1365 Query: 2326 MYTDSRSAYCISQNSCHHERTKHIQLDCHFVREKLQEGLFKLIHIPSTQQTADIFTKPLP 2505 +++DS+SA I++N HERTKHI++DCH +R KLQE L LIH+PS Q AD FTK L Sbjct: 1366 VFSDSQSAIQIAKNPTFHERTKHIEVDCHLIRIKLQEELLHLIHVPSANQLADAFTKSLY 1425 Query: 2506 PAQFNYLISKLSMLNIHAP 2562 P F + ISKL + NIH P Sbjct: 1426 PKLFLHAISKLGLSNIHGP 1444 >gb|KYP42321.1| Copia protein [Cajanus cajan] Length = 1456 Score = 775 bits (2002), Expect = 0.0 Identities = 395/801 (49%), Positives = 537/801 (67%), Gaps = 10/801 (1%) Frame = +1 Query: 190 GIVERKHQHLLQVARALLFQSKVPLSFWDHCILTAAHIINRLPAPILHNKTPFEILFNKM 369 GIVERKHQH+L V RALLFQ+K+P FW + A HIINRLP P+L K+PFE+++N Sbjct: 662 GIVERKHQHILNVCRALLFQAKLPKQFWSFAVKQATHIINRLPTPLLSQKSPFEMIYNCK 721 Query: 370 PDYSHFKIFGCLAYVSTLTAQRHKFQARASKCVFIGYPLGSKGYKLYDLESHKVLISRHV 549 PD + K+FGCLA+ +TL+++R K RASKC+F+GY G+KG+ L++L + LISR V Sbjct: 722 PDLTELKVFGCLAFATTLSSKRTKLDRRASKCIFLGYKNGTKGFLLFNLHNKSFLISRDV 781 Query: 550 IFCENIFPFQ-EIASKSVSPEAPLFPISDLSSA------QDQSFSHFQ-NINAQNDIVTT 705 +F E IFP+ + S S S L + D + +FSH +I + ++ Sbjct: 782 LFYEKIFPYSAHVPSMSASDSLLLDVVKDNDTTIYSDPFPTTTFSHGSPSIPLDTPLPSS 841 Query: 706 TQIISDAQNDFADAQH-DLAAQNTTVVQTDPVNSHIEHSNPNPPILRRSSRNRTNPSYLQ 882 IS + F+ + + + N+ + S P R S+R R P YLQ Sbjct: 842 ETTISTDRPPFSPINTCPIPTATLSTPELPSSNTTNDASQVVMPQTRVSTRIRKPPRYLQ 901 Query: 883 VYDCKIPPSIQSQHSAQTIISGKQYPIQDFLSVDKLSNKHKAFSAQISQITEPKSYAQAS 1062 Y C+ ++ S +A + YP+ F++ + S H +F IS EP S+ +A+ Sbjct: 902 EYYCE---NLASSSAASNCL----YPLSSFVTYNNCSPSHTSFCLSISAQHEPTSFKEAN 954 Query: 1063 KQKEWQDAMKAEIRALEKNQTWIMTDLPADKHCIGCKWVYKVKYRSNGSIERYKARLVAK 1242 ++ W+ AM+AE++ALEKNQTW + LP K +GCKWVY+VKY+ +GS+ERYKARLVAK Sbjct: 955 SEECWRRAMEAELQALEKNQTWSLVRLPEGKRPVGCKWVYRVKYKVDGSVERYKARLVAK 1014 Query: 1243 GYTQEEGLDYFDTFSPVAKITTVRLLIALAAAKGWFLHQLDVNNAFLHGDLHEEVYMLPP 1422 G+TQ EG+DYF+TFSPV K++TVR L++LAAA WFLHQLDV+NAFLHGDL EEVYM PP Sbjct: 1015 GFTQTEGVDYFETFSPVVKLSTVRFLLSLAAAHNWFLHQLDVDNAFLHGDLFEEVYMKPP 1074 Query: 1423 PGYLQSNDKKVCKLTKSLYGLKQASRQWFQKLTSCLLNYGFIQSKSDSSLFTMSHDDSIT 1602 PG+ S+ + VCKL KSLYGLKQASRQW QKLT L++ FIQS +D SLF SIT Sbjct: 1075 PGFKLSHPRLVCKLHKSLYGLKQASRQWNQKLTEALISLNFIQSSTDHSLFIKKSHSSIT 1134 Query: 1603 ILCVYVDDIILAGNDISKIEAIKLHLDEKFTIKDLGTLKYILGIEVARSTKGIHICQRKY 1782 L VYVDD++L GND+++I A+K +L +F IKDLG LK+ LG+E+ARS G+ + QRKY Sbjct: 1135 ALLVYVDDVVLTGNDMAEISAVKAYLHAQFHIKDLGPLKFFLGLEIARSQSGLILNQRKY 1194 Query: 1783 ALDLLTETGFLGSKPSTTPMDCKAQFSSNDSSPLTDASQYRRLIGRMLYLTITRPELSFS 1962 L+LL+E G KP +TP+D + +++ PL D + +RRLIGR+LYLT TRP++SF+ Sbjct: 1195 CLELLSEHGLTDCKPVSTPIDASVKLYASEGLPLDDPTIFRRLIGRLLYLTNTRPDISFA 1254 Query: 1963 IQTLSQYLSKPTTTHLAAAHRI*RYIKSTPVKGLFYSSTSPLHLKCFSDADWARCQETRR 2142 +Q LSQ++ P TH AA RI RY+KS+P GLFY S + ++ FSD+DWA C TRR Sbjct: 1255 VQQLSQFVDSPRATHFQAALRILRYLKSSPALGLFYPSQTEHRIQAFSDSDWASCPNTRR 1314 Query: 2143 SISGFAVFLGDSLISWKSKKQPTISRSSAEAEYRAIALTTCEIQWLTYLLQDMHIKHPQS 2322 S++GF +F G +LISWKSKKQ T+SRSS+EAEYRA+A TCE+QWL +L D+ I P Sbjct: 1315 SVTGFCIFYGSALISWKSKKQSTVSRSSSEAEYRALASVTCELQWLLFLCHDLSINIPTP 1374 Query: 2323 L-MYTDSRSAYCISQNSCHHERTKHIQLDCHFVREKLQEGLFKLIHIPSTQQTADIFTKP 2499 ++ DS+SA I++N HERTKHI++DCH R K+Q+GL L H+PS Q AD+FTK Sbjct: 1375 FSIFCDSQSAIYIAKNPTFHERTKHIEVDCHLTRLKIQQGLIHLFHVPSKSQLADVFTKA 1434 Query: 2500 LPPAQFNYLISKLSMLNIHAP 2562 L P F +SKL +++I+ P Sbjct: 1435 LYPRNFTEAVSKLCLIDIYNP 1455 >emb|CAN71595.1| hypothetical protein VITISV_010143 [Vitis vinifera] Length = 1523 Score = 776 bits (2005), Expect = 0.0 Identities = 401/806 (49%), Positives = 539/806 (66%), Gaps = 12/806 (1%) Frame = +1 Query: 190 GIVERKHQHLLQVARALLFQSKVPLSFWDHCILTAAHIINRLPAPILHNKTPFEILFNKM 369 G+VERKH+HLL VARALLFQS +P FW ILTAA++INR P P+L KTPFE LF+K Sbjct: 684 GVVERKHRHLLNVARALLFQSHLPKPFWGDAILTAAYLINRTPTPLLQGKTPFEKLFHKS 743 Query: 370 PDYSHFKIFGCLAYVSTLTAQRHKFQARASKCVFIGYPLGSKGYKLYDLESHKVLISRHV 549 P+YSH ++FGC +VST + KF R+ + VFIGYP G KGYK+Y L+ K LISR V Sbjct: 744 PNYSHLRVFGCRCFVSTHPLRPSKFDPRSIESVFIGYPHGQKGYKVYSLKDKKXLISRDV 803 Query: 550 IFCENIFPFQEIASKSVSPEAPLFPISDLSSAQDQSFSHFQNINAQNDIVTTTQIISDAQ 729 F E FP+Q S + FP + D F + + T+ + Q Sbjct: 804 TFFETEFPYQNXLSTTSPSLDTFFPSLPQTPDIDDDHISFNHSGSNLQPSATSSVDXHPQ 863 Query: 730 ----NDFADAQHDLAAQNTTVVQTDPVNSHIEHSNPNPPILRRSSRNRTNPSYLQVYDCK 897 N + + D + ++ + PV S P+P RRSSR P+ LQ + + Sbjct: 864 PTLDNSHSSSHVDPPSSPPSLNTSPPVISQ-----PSPSQPRRSSRPTKTPTTLQDFHIE 918 Query: 898 -------IPPSIQSQHSAQTIISGKQYPIQDFLSVDKLSNKHKAFSAQISQITEPKSYAQ 1056 +PPS S+ + SG + + LS D+LS HKAF+ +I+ EP+S++Q Sbjct: 919 AALPSRPVPPSSTSEVAH----SGTIHSLSQVLSYDRLSPMHKAFTVKITLAKEPRSFSQ 974 Query: 1057 ASKQKEWQDAMKAEIRALEKNQTWIMTDLPADKHCIGCKWVYKVKYRSNGSIERYKARLV 1236 A W++AM EI+AL+ N+TW + LP+ K IGCKWVYK+KY +G+IERYKARLV Sbjct: 975 AVLDSRWREAMNTEIQALQANKTWSLVPLPSHKKPIGCKWVYKIKYNPDGTIERYKARLV 1034 Query: 1237 AKGYTQEEGLDYFDTFSPVAKITTVRLLIALAAAKGWFLHQLDVNNAFLHGDLHEEVYML 1416 AKG++Q EG+DY +TF+PVAK+TTVR+L++LA+ +GW LHQLDVNNAFL+GDL+E+VYM Sbjct: 1035 AKGFSQVEGIDYRETFAPVAKLTTVRVLLSLASIQGWHLHQLDVNNAFLNGDLYEDVYMQ 1094 Query: 1417 PPPGYLQSNDKKVCKLTKSLYGLKQASRQWFQKLTSCLLNYGFIQSKSDSSLFTMSHDDS 1596 PPG+ + + +VCKL KSLYGLKQASRQWF KL+S L GF QS SD SLF + Sbjct: 1095 LPPGFGRKGEHRVCKLHKSLYGLKQASRQWFLKLSSALKAAGFKQSWSDYSLFXRNTQGR 1154 Query: 1597 ITILCVYVDDIILAGNDISKIEAIKLHLDEKFTIKDLGTLKYILGIEVARSTKGIHICQR 1776 T L VYVDD+ILAGN + I K L F +KD+G L+Y LGIEVARS +GI +CQR Sbjct: 1155 FTTLLVYVDDVILAGNSLEDIIETKQFLASHFKLKDMGQLRYFLGIEVARSKQGIVLCQR 1214 Query: 1777 KYALDLLTETGFLGSKPSTTPMDCKAQFSSNDSSPLTDASQYRRLIGRMLYLTITRPELS 1956 KYAL+LL + GFLG+KPS P++ + D + L DASQYRRL+GR++YLTITRP+L Sbjct: 1215 KYALELLEDAGFLGAKPSRFPVEQSLTLTRGDGAELKDASQYRRLVGRLIYLTITRPDLV 1274 Query: 1957 FSIQTLSQYLSKPTTTHLAAAHRI*RYIKSTPVKGLFYSSTSPLHLKCFSDADWARCQET 2136 +++ LSQ++ P HL AA+++ RY+K TP +G+F ST L L + DADWARC++T Sbjct: 1275 YAVHILSQFMDTPRQPHLDAAYKVLRYVKQTPGQGIFLPSTGQLELTAYCDADWARCKDT 1334 Query: 2137 RRSISGFAVFLGDSLISWKSKKQPTISRSSAEAEYRAIALTTCEIQWLTYLLQDMHIKHP 2316 RRS +G+ +F G++ ISWK+KKQ T+SRSSAEAEYR++A T CEI WL LL D+++ H Sbjct: 1335 RRSTTGYCIFFGNAPISWKTKKQGTVSRSSAEAEYRSMATTCCEITWLRSLLADLNVNHA 1394 Query: 2317 QSL-MYTDSRSAYCISQNSCHHERTKHIQLDCHFVREKLQEGLFKLIHIPSTQQTADIFT 2493 ++ ++ D+++A I+ N HERTKHI++DCH VREK+Q GL K +HI + +Q AD+FT Sbjct: 1395 HAVKLFCDNQAAIHIASNPVFHERTKHIEMDCHVVREKVQRGLVKTMHIRTQEQPADLFT 1454 Query: 2494 KPLPPAQFNYLISKLSMLNIHAPLEG 2571 KPL QF+ L+SKL ++NIH L G Sbjct: 1455 KPLSSKQFSTLLSKLGVINIHTNLRG 1480 >dbj|GAU46782.1| hypothetical protein TSUD_351810 [Trifolium subterraneum] Length = 1512 Score = 772 bits (1994), Expect = 0.0 Identities = 404/815 (49%), Positives = 528/815 (64%), Gaps = 21/815 (2%) Frame = +1 Query: 190 GIVERKHQHLLQVARALLFQSKVPLSFWDHCILTAAHIINRLPAPILHNKTPFEILFNKM 369 G VERKHQHLL V R+LLFQSK+P FW + + A +IINR+ P+L NK+P+ +L+NK Sbjct: 691 GRVERKHQHLLNVGRSLLFQSKLPKKFWSYAVSHATYIINRVCTPLLQNKSPYHLLYNKP 750 Query: 370 PDYSHFKIFGCLAYVSTLTAQRHKFQARASKCVFIGYPLGSKGYKLYDLESHKVLISRHV 549 PD K+FG L Y STL QR K RA KC+F+GY G KG LYD+ +H + +SR++ Sbjct: 751 PDLEQLKVFGSLCYASTLQNQRTKLDPRARKCIFLGYKSGMKGVILYDIHNHNIFVSRNI 810 Query: 550 IFCENIFPFQEIASKSV-----SPEAPLFPISDLSSAQDQSFSHFQN--------INAQN 690 ++I P+ +S S+ SP F S++ S H + + +N Sbjct: 811 THYDHILPYAS-SSYSIPWSYHSPNIDPFITPPTSNSGSSSIPHSTDHIHFNTPMCDQEN 869 Query: 691 DIVTTTQIISD------AQNDFADAQHDLAAQNTTVVQTDPVNSHIEHSNPNPPILRRSS 852 ++Q SD ND +Q + Q P + S+ + P R+S+ Sbjct: 870 PSQPSSQTPSDLFVPQVTDNDIVSSQPSIPHQPHDTHSPLPTTNLPSPSHNSIPQTRQST 929 Query: 853 RNRTNPSYLQVYDCKIPPSIQSQHSAQTIISGKQYPIQDFLSVDKLSNKHKAFSAQISQI 1032 R P +L Y C + S S+ G YPI F S +S+K + ++ I+ Sbjct: 930 RMSVKPKHLSDYVCNL-----SVDSSPPSSPGILYPISSFHSYSNISSKFRNYALSITAS 984 Query: 1033 TEPKSYAQASKQKEWQDAMKAEIRALEKNQTWIMTDLPADKHCIGCKWVYKVKYRSNGSI 1212 EP+ Y +AS+Q+ W DAM EI+AL+ N+TW PA IGCKWVYKVK++++GS+ Sbjct: 985 VEPRDYKEASQQQCWVDAMNNEIQALQHNKTWCYVTPPAHIKPIGCKWVYKVKHKADGSV 1044 Query: 1213 ERYKARLVAKGYTQEEGLDYFDTFSPVAKITTVRLLIALAAAKGWFLHQLDVNNAFLHGD 1392 ERYKARLVAKGY Q EGLD+FDTFSPVAKITTVR LIALA+ + W L+Q+DVNNAFLHGD Sbjct: 1045 ERYKARLVAKGYNQVEGLDFFDTFSPVAKITTVRTLIALASIRSWHLNQMDVNNAFLHGD 1104 Query: 1393 LHEEVYMLPPPGYLQSNDKKVCKLTKSLYGLKQASRQWFQKLTSCLLNYGFIQSKSDSSL 1572 L E+VYM P G +VCKL KSLYGLKQASR+W++KLTS LL G+ Q+ SD SL Sbjct: 1105 LQEDVYMEVPQGVNSPKPHQVCKLLKSLYGLKQASRKWYEKLTSLLLKEGYTQASSDHSL 1164 Query: 1573 FTMSHDDSITILCVYVDDIILAGNDISKIEAIKLHLDEKFTIKDLGTLKYILGIEVARST 1752 FT+ H T L VYVDDIILAGN + + IKL +D F IKDLG LKY LGIEVA S Sbjct: 1165 FTLKHGSDFTALLVYVDDIILAGNSLQEFARIKLIMDNAFKIKDLGPLKYFLGIEVAHSK 1224 Query: 1753 KGIHICQRKYALDLLTETGFLGSKPSTTPMDCKAQFSSNDSSPLTDASQYRRLIGRMLYL 1932 +GI ICQRKY LDLL +TG LGSKP+ TP+D + + S D YRRLIG++LYL Sbjct: 1225 QGISICQRKYCLDLLKDTGLLGSKPAPTPLDPSIKLHQDSSPAYDDVGGYRRLIGKLLYL 1284 Query: 1933 TITRPELSFSIQTLSQYLSKPTTTHLAAAHRI*RYIKSTPVKGLFYSSTSPLHLKCFSDA 2112 T TRP++SF+IQ LSQ+LS PTTTH A R+ RY+K +P +GLF+ SPL L F+DA Sbjct: 1285 TTTRPDISFAIQQLSQFLSSPTTTHFDTACRVVRYLKGSPGRGLFFPRQSPLQLLGFADA 1344 Query: 2113 DWARCQETRRSISGFAVFLGDSLISWKSKKQPTISRSSAEAEYRAIALTTCEIQWLTYLL 2292 DWA C +TRRS SG+ F+G SLISW++KKQ T+SRSS+EAEYR+++ +CE+QW+ YLL Sbjct: 1345 DWANCADTRRSTSGYCFFIGSSLISWRAKKQNTVSRSSSEAEYRSLSFASCELQWIVYLL 1404 Query: 2293 QDMHIK-HPQSLMYTDSRSAYCISQNSCHHERTKHIQLDCHFVREKLQEGLFKLIHIPST 2469 +D+ I ++Y D++SA I+ N HERTKH+++DCH VR+K+Q G+FKL+ I + Sbjct: 1405 KDLSIDCERPPVLYCDNQSAIHIASNPVFHERTKHLEIDCHLVRDKVQSGVFKLLPISTK 1464 Query: 2470 QQTADIFTKPLPPAQFNYLISKLSMLNI-HAPLEG 2571 Q AD FTK LPP FN +SKL+MLNI H P G Sbjct: 1465 AQLADFFTKALPPKVFNSFLSKLNMLNIFHVPACG 1499 >gb|PNY17451.1| retrovirus-related Pol polyprotein from transposon TNT 1-94 [Trifolium pratense] Length = 1425 Score = 768 bits (1983), Expect = 0.0 Identities = 400/823 (48%), Positives = 534/823 (64%), Gaps = 29/823 (3%) Frame = +1 Query: 190 GIVERKHQHLLQVARALLFQSKVPLSFWDHCILTAAHIINRLPAPILHNKTPFEILFNKM 369 GIVERKHQH+L ARALLFQS +P FW H + + HIINRLP P L K+P+++L+N + Sbjct: 595 GIVERKHQHILGTARALLFQSHLPKIFWAHAVGHSVHIINRLPTPFLSQKSPYQMLYNCL 654 Query: 370 PDYSHFKIFGCLAYVSTLTAQRHKFQARASKCVFIGYPLGSKGYKLYDLESHKVLISRHV 549 PD ++ K+FGCLAY +TL RHK +R+ KCV +G G KG+ L+DL+S +V ISR V Sbjct: 655 PDINNLKVFGCLAYATTLQTNRHKLDSRSRKCVSLGLKTGVKGHILFDLQSREVFISRDV 714 Query: 550 IFCENIFPF-----QEIASKSVSPEAP--------LFPISDLSSAQDQSFSHFQNINAQN 690 +F E+IFPF +I + + ++P LF + S Q + Sbjct: 715 VFFEHIFPFYTKNQHQIDQTNSATQSPILYDDLDMLFTNHSTHHSSSPSLPLLQTATPHS 774 Query: 691 DIVTTTQIISDAQNDFADAQHDLAAQNTTVVQTDPV-----NSHIEHSNPNPPIL----- 840 + D + HD + V++TD + N+ + S+ + PI+ Sbjct: 775 PTSIPSTHSPDDHSSPPSPTHDHHSPCDPVIETDVMIPTSTNTPLTTSSNSLPIIAPPSI 834 Query: 841 ---RRSSRNRTNPSYLQVYDCKIPPSIQSQHSAQTIISGKQ--YPIQDFLSVDKLSNKHK 1005 R+S R + PSYLQ Y KI +I S T S Q +PI F+S D LS HK Sbjct: 835 NPVRKSDRVKHPPSYLQDYHTKILGNISHSASDSTHPSSSQCKFPISSFISYDHLSPAHK 894 Query: 1006 AFSAQISQITEPKSYAQASKQKEWQDAMKAEIRALEKNQTWIMTDLPADKHCIGCKWVYK 1185 ++ IS +TEP SY +A + W+ A+ E+ AL KN TW M LP K IGCKWV+K Sbjct: 895 HYALNISTLTEPSSYEEAMCDENWKSAVNVELTALLKNNTWDMVKLPPHKKAIGCKWVFK 954 Query: 1186 VKYRSNGSIERYKARLVAKGYTQEEGLDYFDTFSPVAKITTVRLLIALAAAKGWFLHQLD 1365 +K ++G++ER+KARLVAKG+TQ EG+DY DTFSPV K+TTVR +A+AA++ W L QLD Sbjct: 955 LKLHADGTVERHKARLVAKGFTQTEGIDYIDTFSPVVKMTTVRTFMAIAASQNWPLFQLD 1014 Query: 1366 VNNAFLHGDLHEEVYMLPPPGYLQSNDKKVCKLTKSLYGLKQASRQWFQKLTSCLLNYGF 1545 VN AFLHGDL+EEVYM PPPG ++ VCKL +SLYGLKQASRQW KLT LL+ G+ Sbjct: 1015 VNTAFLHGDLNEEVYMKPPPGLPLAHPDLVCKLQRSLYGLKQASRQWNVKLTETLLSSGY 1074 Query: 1546 IQSKSDSSLFTMSHDDSITILCVYVDDIILAGNDISKIEAIKLHLDEKFTIKDLGTLKYI 1725 IQSK+D SLFT + T + VYVDD++L G DI +I +K LD KF+IKDLG+LKY Sbjct: 1075 IQSKADYSLFTKNTSTGFTAILVYVDDLVLGGTDIDEIHQLKALLDTKFSIKDLGSLKYF 1134 Query: 1726 LGIEVARSTKGIHICQRKYALDLLTETGFLGSKPSTTPMDCKAQFSSNDSSPLTDASQYR 1905 LG EVARS GI +CQRKY LDLL ++G LG+KP+ TPM Q + + ++D + YR Sbjct: 1135 LGFEVARSKTGISLCQRKYTLDLLQDSGLLGTKPTPTPMQPHLQLQKSSGNAISDPTTYR 1194 Query: 1906 RLIGRMLYLTITRPELSFSIQTLSQYLSKPTTTHLAAAHRI*RYIKSTPVKGLFYSSTSP 2085 RLIGR+LYLT +RPE+S+++ LSQ+L PT H+ A + +Y+K+ P +GLF+SS+S Sbjct: 1195 RLIGRLLYLTHSRPEISYAVSKLSQFLDSPTDAHMLAGLHVLKYLKNNPGQGLFFSSSSS 1254 Query: 2086 LHLKCFSDADWARCQETRRSISGFAVFLGDSLISWKSKKQPTISRSSAEAEYRAIALTTC 2265 L LK +SD+DW C +TRRS +GF FLG S+ISWKSKKQ +SRSS+EAEYRA+A C Sbjct: 1255 LALKGYSDSDWGACPDTRRSTTGFCFFLGTSIISWKSKKQTVVSRSSSEAEYRALAQAAC 1314 Query: 2266 EIQWLTYLLQDMHIKHPQS-LMYTDSRSAYCISQNSCHHERTKHIQLDCHFVREKLQEGL 2442 E QWL YLLQD I H ++Y D++SA I+ N HERTKHI++DCH VR+K+Q + Sbjct: 1315 EGQWLLYLLQDFQIPHDSPIILYCDNKSALHIAANPVFHERTKHIEIDCHVVRDKVQANI 1374 Query: 2443 FKLIHIPSTQQTADIFTKPLPPAQFNYLISKLSMLNIHAPLEG 2571 L+ + S +Q ADIFTK L P F+ L+SKL ++IH+ L G Sbjct: 1375 IHLLPVSSKEQIADIFTKSLHPGPFHTLLSKLGTIDIHSSLRG 1417 >gb|PNY16454.1| flavonol sulfotransferase-like protein [Trifolium pratense] Length = 1475 Score = 768 bits (1984), Expect = 0.0 Identities = 408/824 (49%), Positives = 537/824 (65%), Gaps = 27/824 (3%) Frame = +1 Query: 190 GIVERKHQHLLQVARALLFQSKVPLSFWDHCILTAAHIINRLPAPILHNKTPFEILFNKM 369 G+VERKHQHLL ARALLFQS +P FWD+ I + HIINRLP P L+N +P+++L N + Sbjct: 635 GVVERKHQHLLGTARALLFQSSLPKVFWDYAIGHSVHIINRLPTPFLNNMSPYQVLHNAL 694 Query: 370 PDYSHFKIFGCLAYVSTLTAQRHKFQARASKCVFIGYPLGSKGYKLYDLESHKVLISRHV 549 PD S+ K+FG L Y +TL+A R K +R+ KC+ +G+ G KG+ L DL+S +V +SR V Sbjct: 695 PDISNLKVFGSLCYAATLSAHRKKLDSRSRKCLLLGFKFGVKGHILLDLKSREVFVSRDV 754 Query: 550 IFCENIFPFQE------IASKSVSPEAPLF--PISDLSSAQDQSFSHFQNINA------- 684 +F E+IFPFQ+ + S ++PL+ P D + +S S I++ Sbjct: 755 VFFEHIFPFQQQSQDVAVKSHLSHSQSPLYDDPFIDCPHSSPESPSPNDPISSPPPSNSL 814 Query: 685 ----QNDIVTTTQIISDAQNDFADAQHDLAAQN-TTVVQTDPVNSHIEHS-NPNP----P 834 N I + I++ + A H + N T T + S++ H P+P P Sbjct: 815 PHDIHNSIPNQSPILNSPPH--ASTLHTPSTNNHDTDNPTVSIPSYVSHHPTPSPAMPPP 872 Query: 835 ILRRSSRNRTNPSYL-QVYDCKIPPSIQSQHSAQTIISGKQYPIQDFLSVDKLSNKHKAF 1011 R+S+R P YL + Y C S + S +YP+ ++S LS+ H + Sbjct: 873 PTRKSNRITHPPPYLTEHYYCNAAIH-DSTKDTPSSSSKCKYPLSSYISYQHLSSAHHHY 931 Query: 1012 SAQISQITEPKSYAQASKQKEWQDAMKAEIRALEKNQTWIMTDLPADKHCIGCKWVYKVK 1191 + IS I+EP Y +A W+ A+ AE+ AL+K TW + LP KH IGCKWV+K+K Sbjct: 932 LSNISTISEPTCYEKAVCDPNWKAAINAELSALDKYNTWKLVPLPKHKHAIGCKWVFKLK 991 Query: 1192 YRSNGSIERYKARLVAKGYTQEEGLDYFDTFSPVAKITTVRLLIALAAAKGWFLHQLDVN 1371 +NG+IERYKARLVAKGYTQ EG+DY DTFSPV K+TT+R+ +A+AA + W L+QLDVN Sbjct: 992 LHANGTIERYKARLVAKGYTQTEGIDYMDTFSPVVKMTTIRMFLAIAAIQNWPLYQLDVN 1051 Query: 1372 NAFLHGDLHEEVYMLPPPGYLQSNDKKVCKLTKSLYGLKQASRQWFQKLTSCLLNYGFIQ 1551 AFLHGDL EEVYM PPPG + VCKL +SLYGLKQASRQW KLT LL+ G+ Q Sbjct: 1052 TAFLHGDLDEEVYMKPPPGLDLPSPNLVCKLQRSLYGLKQASRQWNTKLTQTLLSSGYTQ 1111 Query: 1552 SKSDSSLFTMSHDDSITILCVYVDDIILAGNDISKIEAIKLHLDEKFTIKDLGTLKYILG 1731 SKSD SLFT T++ VYVDD++L G D +I+ IK LD KF+IKDLGTLKY LG Sbjct: 1112 SKSDYSLFTKQASSGFTVILVYVDDLVLGGTDDKEIQKIKALLDRKFSIKDLGTLKYFLG 1171 Query: 1732 IEVARSTKGIHICQRKYALDLLTETGFLGSKPSTTPMDCKAQFSSNDSSPLTDASQYRRL 1911 EVAR+ GI +CQRKYALDL+ +TG LG+KP TTPM + Q S + L+D S YRRL Sbjct: 1172 FEVARTQAGISLCQRKYALDLIQDTGLLGAKPCTTPMQPQLQLHSESGTILSDPSTYRRL 1231 Query: 1912 IGRMLYLTITRPELSFSIQTLSQYLSKPTTTHLAAAHRI*RYIKSTPVKGLFYSSTSPLH 2091 +GR+LYLT +RPE+++S+ LSQ+LS PT H+ A + +YIK+ P +GLF+++ S L Sbjct: 1232 VGRLLYLTHSRPEIAYSVSKLSQFLSAPTNEHMLAGLHVLKYIKNCPGQGLFFAANSSLK 1291 Query: 2092 LKCFSDADWARCQETRRSISGFAVFLGDSLISWKSKKQPTISRSSAEAEYRAIALTTCEI 2271 LK FSD+DWA C +TRRS +G FLG+SLISWKSKKQ +SRSS+EAEYRA+A TCE Sbjct: 1292 LKGFSDSDWAACPDTRRSTTGLCFFLGNSLISWKSKKQNVVSRSSSEAEYRALAQATCEA 1351 Query: 2272 QWLTYLLQDMHIKHPQSL-MYTDSRSAYCISQNSCHHERTKHIQLDCHFVREKLQEGLFK 2448 QWL YLL D HI H + +Y D+RSA I+ N HERTKHI+LDCH VREKL GL Sbjct: 1352 QWLKYLLNDFHISHSSPIVLYCDNRSALHIAANPVFHERTKHIELDCHVVREKLLAGLIH 1411 Query: 2449 LIHIPSTQQTADIFTKPLPPAQFNYLISKLSMLNIHAPLEGGCK 2580 L+ + S +Q ADI TK L P F+ L +KL M++I++ L G K Sbjct: 1412 LLPVSSKEQVADILTKSLHPGPFHTLQNKLGMIDIYSSLRGDVK 1455 >gb|KYP42564.1| Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cajanus cajan] Length = 1427 Score = 763 bits (1971), Expect = 0.0 Identities = 399/801 (49%), Positives = 531/801 (66%), Gaps = 4/801 (0%) Frame = +1 Query: 193 IVERKHQHLLQVARALLFQSKVPLSFWDHCILTAAHIINRLPAPILHNKTPFEILFNKMP 372 +VERKH+H+L + R LLF S VP SFW + + A H+INRLP+P+L+N +P+++L++K P Sbjct: 645 VVERKHRHILNITRTLLFHSNVPKSFWCYAVGHAIHLINRLPSPVLNNSSPYQMLYDKPP 704 Query: 373 DYSHFKIFGCLAYVSTLTAQRHKFQARASKCVFIGYPLGSKGYKLYDLESHKVLISRHVI 552 K+FG L Y STL R K +A+K V++G G+KG+ + DL + + +SR+V+ Sbjct: 705 TLLDLKVFGSLCYASTLVQGRSKLAPKATKGVYLGVKQGTKGFLVLDLLTRSIFVSRNVV 764 Query: 553 FCENIFPFQEIASKSVSPEAPLFPISDLSSAQDQSFSHFQNINAQNDIVTTTQIISDAQN 732 F E+IFPF E S ++ S Q+ + F +++ + VTT S Sbjct: 765 FYEHIFPFFEKGSTVITN----------SQQQNDACFDFLYLDSSSHPVTTIDNSSLLDI 814 Query: 733 DFADAQHDLAAQNTTVVQTDPVNSHIEHSNPNPPILRRSSRNRTNPSYLQVYDCKI---P 903 D A ++DL D S +P LR+S+R++ +P+YL+ Y C + Sbjct: 815 DSAHYENDL---------NDIDESAHPSETSSPSQLRKSTRHKCSPAYLKDYHCNLLIGV 865 Query: 904 PSIQSQHSAQTIISGKQYPIQDFLSVDKLSNKHKAFSAQISQITEPKSYAQASKQKEWQD 1083 P + +H +YP+ LS D LS + + I+ EP ++ QA K K W + Sbjct: 866 PPPEDKHI--------RYPLNTVLSYDSLSASYSRYVLSITTHVEPHTFNQAVKNKVWVE 917 Query: 1084 AMKAEIRALEKNQTWIMTDLPADKHCIGCKWVYKVKYRSNGSIERYKARLVAKGYTQEEG 1263 AM+AE+ ALE N+TW + LP K IG KWVYK+KY+S+GSIERYKARLV KGYTQ +G Sbjct: 918 AMQAELDALEHNKTWTIMPLPPGKTPIGSKWVYKIKYKSDGSIERYKARLVVKGYTQIQG 977 Query: 1264 LDYFDTFSPVAKITTVRLLIALAAAKGWFLHQLDVNNAFLHGDLHEEVYMLPPPGYLQSN 1443 LDYFDTF+PVAK++TVR+L+A+A+ + W LHQLD+NNAFLHGDL E+VYM P G Sbjct: 978 LDYFDTFAPVAKLSTVRMLLAIASCQHWELHQLDINNAFLHGDLLEDVYMEIPQGLNIDK 1037 Query: 1444 DKKVCKLTKSLYGLKQASRQWFQKLTSCLLNYGFIQSKSDSSLFTMSHDDSITILCVYVD 1623 VCKL KSLYGLKQASRQWF KL+S LL+ + QS+ D SLFT H T++ +YVD Sbjct: 1038 PNHVCKLNKSLYGLKQASRQWFAKLSSFLLSLHYKQSQHDHSLFTKHHGTHFTVILIYVD 1097 Query: 1624 DIILAGNDISKIEAIKLHLDEKFTIKDLGTLKYILGIEVARSTKGIHICQRKYALDLLTE 1803 D+I+AG D +I IK LD KF IKDLG L+Y LG+E+ARS GI + QRKY LDLL E Sbjct: 1098 DLIIAGTDSEEINHIKQSLDVKFKIKDLGPLRYFLGLEIARSHLGISLSQRKYTLDLLDE 1157 Query: 1804 TGFLGSKPSTTPMDCKAQFSSNDSSPLTDASQYRRLIGRMLYLTITRPELSFSIQTLSQY 1983 T FL KP TP+ + S SP D + YRRLIG++LYL TRP++S+S+Q LSQ+ Sbjct: 1158 TSFLAGKPVLTPIIKGTRLSHTTDSPYEDPAGYRRLIGKLLYLITTRPDISYSVQQLSQF 1217 Query: 1984 LSKPTTTHLAAAHRI*RYIKSTPVKGLFYSSTSPLHLKCFSDADWARCQETRRSISGFAV 2163 LS P +H AA R+ RY+K P +GLFY + SPL LK FSD+DWA C +TRRS+SG+++ Sbjct: 1218 LSCPQQSHYQAAIRVLRYLKGNPGQGLFYPADSPLQLKAFSDSDWASCPDTRRSLSGYSI 1277 Query: 2164 FLGDSLISWKSKKQPTISRSSAEAEYRAIALTTCEIQWLTYLLQDMHIKH-PQSLMYTDS 2340 FLG+SLISWK KKQ TISRSS+EAEYRA+A T CEIQWLTYLLQD + +L+Y D+ Sbjct: 1278 FLGNSLISWKCKKQSTISRSSSEAEYRALAATACEIQWLTYLLQDFSVPFTTPALLYCDN 1337 Query: 2341 RSAYCISQNSCHHERTKHIQLDCHFVREKLQEGLFKLIHIPSTQQTADIFTKPLPPAQFN 2520 +SA I+ N+ HERTKHI++DCH VREKLQ GLF L+ I S+ Q ADI TKPL P+ F Sbjct: 1338 QSARHIASNAVFHERTKHIEIDCHLVREKLQAGLFHLLPIASSHQLADILTKPLDPSPFQ 1397 Query: 2521 YLISKLSMLNIHAPLEGGCKD 2583 YL+SKL ++NI++P G D Sbjct: 1398 YLLSKLGVINIYSPACRGVLD 1418 >gb|OMO88956.1| Integrase, catalytic core [Corchorus capsularis] Length = 1451 Score = 763 bits (1970), Expect = 0.0 Identities = 401/828 (48%), Positives = 548/828 (66%), Gaps = 30/828 (3%) Frame = +1 Query: 193 IVERKHQHLLQVARALLFQSKVPLSFWDHCILTAAHIINRLPAPILHNKTPFEILFNKMP 372 IVERKHQH+L VAR+L FQ+ +P+ FW C+L A +INR+P +L N TPF+ LFN+ P Sbjct: 639 IVERKHQHILNVARSLRFQASLPIDFWGECVLHAVFLINRIPTKVLGNVTPFQKLFNESP 698 Query: 373 DYSHFKIFGCLAYVSTLTAQRHKFQARASKCVFIGYPLGSKGYKLYDLESHKVLISRHVI 552 + K+FG LA+ S + ++KF +R+ K VF+G+ G KGYKLYDL+++K +SR V Sbjct: 699 NIDVLKVFGSLAFASNHSNIKNKFDSRSIKSVFLGFQPGVKGYKLYDLQNNKKNLSRDVT 758 Query: 553 FCENIFPFQEIA--------SKSVSPEAPLFPISDLSSAQD-----------QSFSHFQN 675 F E+I+PF E SK +S E + P SD +A + Q+ SH N Sbjct: 759 FYEHIYPFTEEYAKTDNLQFSKHISTENLVLPNSDNFAAMNDSIPSSDVSTQQNMSHLSN 818 Query: 676 I-----NAQNDIVTTTQIISDAQNDFADAQHDLAAQNTTVVQTDPVNSHI----EHSNPN 828 + ++ + V I + QN+ H+++ P NS I + SN N Sbjct: 819 VPVASSSSNTEAVLQIPIATVYQNN--PVLHEISEPILQSSNVGPANSTIPNTSQQSNTN 876 Query: 829 PPILRRSSRNRTNPSYLQVYDCKIPPSIQSQHSAQTIISGKQYPIQDFLSVDKLSNKHKA 1008 +RRS+R + P +LQ ++C S HS ++ S D +++KHKA Sbjct: 877 YHNVRRSTRLKFRPPHLQSFECNQVQKT-SPHSLSSVFS-----------YDNITSKHKA 924 Query: 1009 FSAQISQITEPKSYAQASKQKEWQDAMKAEIRALEKNQTWIMTDLPADKHCIGCKWVYKV 1188 F+ I Q TEP++Y +A K ++WQ AM E+ ALEK +TW + DLP K IGCKWVYKV Sbjct: 925 FAVAIDQDTEPRNYKEAIKSQQWQQAMNEELEALEKTKTWKLVDLPHGKQPIGCKWVYKV 984 Query: 1189 KYRSNGSIERYKARLVAKGYTQEEGLDYFDTFSPVAKITTVRLLIALAAAKGWFLHQLDV 1368 K +++GSIERYKARLVAKGYTQ+EG+DY DTFSPVAKI T+R L+ +AA KGW+LHQ DV Sbjct: 985 KRKADGSIERYKARLVAKGYTQQEGVDYLDTFSPVAKIATIRTLLVVAALKGWYLHQCDV 1044 Query: 1369 NNAFLHGDLHEEVYMLPPPGYLQSNDKKVCKLTKSLYGLKQASRQWFQKLTSCLLNYGFI 1548 N FLHGDL EEVYM P GYL+ + K VCKL KSLYGLKQASRQW KLT L+ YGF Sbjct: 1045 NTTFLHGDLSEEVYMKLPEGYLEGSTK-VCKLVKSLYGLKQASRQWNLKLTESLIKYGFH 1103 Query: 1549 QSKSDSSLFTMSHDDSITILCVYVDDIILAGNDISKIEAIKLHLDEKFTIKDLGTLKYIL 1728 QS++D +LF D + L VYVDDII+A NDI+++ IK +L + F+IKDLG LK+ L Sbjct: 1104 QSQADHTLFIKFVDKNFIALLVYVDDIIVASNDITEVINIKAYLHDLFSIKDLGELKFFL 1163 Query: 1729 GIEVARSTKGIHICQRKYALDLLTETGFLGSKPSTTPMDCKAQFSSNDSSPLTDASQYRR 1908 G+EVARS +GI++CQ+KY +DLL + FL KP++TP+ + + ++ +PL DASQYR+ Sbjct: 1164 GLEVARSKQGINVCQKKYTMDLLKDMNFLVCKPTSTPILPETRLTTESGTPLADASQYRQ 1223 Query: 1909 LIGRMLYLTITRPELSFSIQTLSQYLSKPTTTHLAAAHRI*RYIKSTPVKGLFYSSTSPL 2088 L+G++ YLT TR ++S+++Q L+Q+L KPT+ HL AHR+ RY+K T +GL +SS Sbjct: 1224 LVGKLQYLTTTRLDISYAVQQLAQFLDKPTSDHLQVAHRVLRYLKGTIGQGLLFSSQGIF 1283 Query: 2089 HLKCFSDADWARCQETRRSISGFAVFLGDSLISWKSKKQPTISRSSAEAEYRAIALTTCE 2268 LK +SD+DW C ++R+SI+G+ +FLGDSL+SWK+KKQ T+SRSS+EAEYRA+A T CE Sbjct: 1284 QLKAYSDSDWGTCLDSRKSITGYCIFLGDSLVSWKTKKQNTVSRSSSEAEYRALATTVCE 1343 Query: 2269 IQWLTYLLQDMHIK-HPQSLMYTDSRSAYCISQNSCHHERTKHIQLDCHFVREKLQEGLF 2445 IQWL YL++D+ I P + ++ D+ SA I++N HERTKHI +DCH VR KLQEGL Sbjct: 1344 IQWLNYLMKDLQITLEPSTPLFCDNLSAIHIAKNPVFHERTKHIDIDCHVVRTKLQEGLI 1403 Query: 2446 KLIHIPSTQQTADIFTKPLPPAQFNYLISKLSMLNIHAP-LEGGCKDS 2586 KL+ + S Q AD FTK L F SKL + N++ P L G ++S Sbjct: 1404 KLLPVSSKLQLADCFTKVLSSTNFINAFSKLGIQNLYIPSLRGDVRES 1451 >gb|PNX97998.1| retrovirus-related Pol polyprotein from transposon TNT 1-94, partial [Trifolium pratense] Length = 964 Score = 747 bits (1928), Expect = 0.0 Identities = 383/797 (48%), Positives = 519/797 (65%), Gaps = 6/797 (0%) Frame = +1 Query: 190 GIVERKHQHLLQVARALLFQSKVPLSFWDHCILTAAHIINRLPAPILHNKTPFEILFNKM 369 G+VERKH+HLL VARAL FQ+K+PLSFW C+LTAA++IN+LP PIL K+P ++L Sbjct: 187 GVVERKHRHLLDVARALRFQAKLPLSFWGECVLTAAYLINKLPTPILKYKSPHQVLLGSP 246 Query: 370 PDYSHFKIFGCLAYVSTLTAQRHKFQARASKCVFIGYPLGSKGYKLYDLESHKVLISRHV 549 P YS ++FGCL + + Q HKF RA +F+GYP KGY++YD+ + K+ +SR V Sbjct: 247 PSYSSLRVFGCLCFAKNMNIQ-HKFDERAKPGIFVGYPFNQKGYRIYDMHTRKIYVSRDV 305 Query: 550 IFCENIFPFQEIASKSVSPEAPLFPISDLSSAQDQSFSHFQNINAQNDIVTTTQI---IS 720 F E +FP+ ++ + S + SD+S + F ++ + +++ + I IS Sbjct: 306 QFFETVFPYHDLQTPSFA--------SDISI--NTQFLDYEVDDTPSNLSPASSIPPGIS 355 Query: 721 DAQNDFADAQHDLAAQNTTVVQTDPVNSHIEHSNP--NPPILRRSSRNRTNPSYLQVYDC 894 N + + N + + PV +HS N P R R+RT L + C Sbjct: 356 HHDNTIVTIPNP-SVDNPSEIPAIPVEPPQQHSPTAINHPERRYPLRHRTPSVRLTDHVC 414 Query: 895 KIPPSIQSQHSAQTIISGKQYPIQDFLSVDKLSNKHKAFSAQISQITEPKSYAQASKQKE 1074 I + S +P++++ S+ LS H+A I + EP SY+QA K E Sbjct: 415 DI----------NNVTSQSAFPLKNYFSLSNLSTSHRALLVNIIENKEPTSYSQAIKSAE 464 Query: 1075 WQDAMKAEIRALEKNQTWIMTDLPADKHCIGCKWVYKVKYRSNGSIERYKARLVAKGYTQ 1254 W++AM EI ALE N TW+++ LP K IGCKWVYK+KY S+G++ERYKARLVAKGY Q Sbjct: 465 WREAMAKEIHALESNNTWVLSPLPNGKTAIGCKWVYKIKYHSDGTVERYKARLVAKGYNQ 524 Query: 1255 EEGLDYFDTFSPVAKITTVRLLIALAAAKGWFLHQLDVNNAFLHGDLHEEVYMLPPPGYL 1434 G+DY +TF+PVAK+ TVRLL+++AA K W LHQLDVNNAFL GDL+EEVYM PPG+ Sbjct: 525 VHGIDYHETFAPVAKLVTVRLLLSIAAIKNWSLHQLDVNNAFLQGDLNEEVYMKLPPGFS 584 Query: 1435 QSNDKKVCKLTKSLYGLKQASRQWFQKLTSCLLNYGFIQSKSDSSLFTMSHDDSITILCV 1614 VCKL KS+YGLKQASRQWF K ++ L+ GF QS SD SLFT + + + V Sbjct: 585 HKGQPCVCKLNKSIYGLKQASRQWFSKFSTTLIQKGFHQSISDYSLFTFKSNHTTIFVLV 644 Query: 1615 YVDDIILAGNDISKIEAIKLHLDEKFTIKDLGTLKYILGIEVARSTKGIHICQRKYALDL 1794 YVDDII+ GN+ I IK L + F+IKDLG L Y LGIEV+RS KGI +CQRKY LD+ Sbjct: 645 YVDDIIITGNNDDAISDIKKFLAQAFSIKDLGNLSYFLGIEVSRSKKGIFLCQRKYTLDI 704 Query: 1795 LTETGFLGSKPSTTPMDCKAQFSSNDSSPLTDASQYRRLIGRMLYLTITRPELSFSIQTL 1974 L++ G G +PS PM+ + ND SPL D + YRRLIGR+LYLT+TRP++ +++ TL Sbjct: 705 LSDAGLTGCRPSEFPMEQHLRLRPNDGSPLPDPTVYRRLIGRLLYLTVTRPDIQYAVNTL 764 Query: 1975 SQYLSKPTTTHLAAAHRI*RYIKSTPVKGLFYSSTSPLHLKCFSDADWARCQETRRSISG 2154 SQ++ P TTHL AA R+ RY+K + KGLF S++S L L ++D+DWA C TRRS +G Sbjct: 765 SQFMQSPCTTHLDAATRVLRYLKGSVGKGLFLSASSSLQLIGYADSDWAGCPTTRRSTTG 824 Query: 2155 FAVFLGDSLISWKSKKQPTISRSSAEAEYRAIALTTCEIQWLTYLLQDMHIKHPQSL-MY 2331 + LG + ISWK+KKQPTISRSSAEAEYR++A E+QWL +LL D+ I HP + ++ Sbjct: 825 YFTMLGSNPISWKTKKQPTISRSSAEAEYRSLATLASELQWLKFLLSDLDIAHPLPITVH 884 Query: 2332 TDSRSAYCISQNSCHHERTKHIQLDCHFVREKLQEGLFKLIHIPSTQQTADIFTKPLPPA 2511 DS++A I++N HERTKHI++DCHFVREK++ GL + ++ S Q ADIFTKPL Sbjct: 885 CDSQAAIHIAENPVFHERTKHIEIDCHFVREKIKSGLLRPSYLRSFDQLADIFTKPLGGD 944 Query: 2512 QFNYLISKLSMLNIHAP 2562 + L+ KL +L I P Sbjct: 945 AYKRLLGKLGVLEISIP 961 >dbj|GAU41679.1| hypothetical protein TSUD_272630 [Trifolium subterraneum] Length = 1178 Score = 753 bits (1945), Expect = 0.0 Identities = 399/797 (50%), Positives = 524/797 (65%), Gaps = 6/797 (0%) Frame = +1 Query: 190 GIVERKHQHLLQVARALLFQSKVPLSFWDHCILTAAHIINRLPAPILHNKTPFEILFNKM 369 G+VERKHQH+L VA R P+L+ K P+E+L + Sbjct: 432 GVVERKHQHILNVA--------------------------RFSTPLLNFKCPYEMLHKEP 465 Query: 370 PDYSHFKIFGCLAYVSTLTAQRHKFQARASKCVFIGYPLGSKGYKLYDLESHKVLISRHV 549 P H K+FGCL+Y +TL A R KF +RA K +F+GY G+KGY LYDL SH++ +SR+V Sbjct: 466 PSIVHLKVFGCLSYATTLQAHRTKFVSRARKAIFLGYKDGTKGYILYDLHSHEIFVSRNV 525 Query: 550 IFCENIFPF---QEIASKSVSPEAPLFPISDLSSAQDQSFSHFQNINAQNDIVTTTQIIS 720 IF E FPF + + S SP + L + D + ++ + +T + II Sbjct: 526 IFYETDFPFHLSNSVKTDSASPASHLNHTLLYDAEPDPNALPIPVMHEPD--LTLSPIIG 583 Query: 721 DAQNDFADAQHDLAAQNTTVVQTDPVNSHIEHSNPNPPILRRSSRNRTNPSYLQVYDCKI 900 + ND + P+NS PNP LR+SSR P +L+ + C+ Sbjct: 584 PSYND-----------------STPINSPESSPIPNPAPLRKSSRVIQRPRHLEGFHCET 626 Query: 901 PPSIQSQHSAQTIISGKQYPIQDFLSVDKLSNKHKAFSAQISQITEPKSYAQASKQKEWQ 1080 S S+ T+ YP+ LS + + + A IS I EPK+Y QASK + W+ Sbjct: 627 LIGTHSAASSNTV-----YPLSSVLSYNNCAPNYHALCCSISAIVEPKTYTQASKFECWR 681 Query: 1081 DAMKAEIRALEKNQTWIMTDLPADKHCIGCKWVYKVKYRSNGSIERYKARLVAKGYTQEE 1260 +AM AE+ AL++N+TW + DLP K +GCKWVYKVKY +NGSIERYKARLVAKGYTQ E Sbjct: 682 NAMNAELLALDENKTWSVVDLPNGKVPVGCKWVYKVKYHANGSIERYKARLVAKGYTQLE 741 Query: 1261 GLDYFDTFSPVAKITTVRLLIALAAAKGWFLHQLDVNNAFLHGDLHEEVYMLPPPGYLQS 1440 G+DYFDTFSPVAKITTVR+L+ALA+ KGW L QLDVNNAFLHGDL+E+VYM PPG+ + Sbjct: 742 GVDYFDTFSPVAKITTVRVLLALASIKGWHLEQLDVNNAFLHGDLNEDVYMSLPPGFAAT 801 Query: 1441 ND-KKVCKLTKSLYGLKQASRQWFQKLTSCLLNYGFIQSKSDSSLFTMSHDDSITILCVY 1617 N+ KVCKL KS+YGLKQASRQW+ KL+S L++ G+ S+SD SL+ S +S T L VY Sbjct: 802 NESNKVCKLHKSIYGLKQASRQWYSKLSSSLVSLGYTPSQSDHSLYIKSTTNSFTALLVY 861 Query: 1618 VDDIILAGNDISKIEAIKLHLDEKFTIKDLGTLKYILGIEVARSTKGIHICQRKYALDLL 1797 VDDI+LAGN I +I+ +KL LD+KF IKDLG L+Y L +E+ARS GI + QRKY L+LL Sbjct: 862 VDDIVLAGNSIHEIQTVKLFLDQKFKIKDLGKLRYFLVLEIARSDTGIFVNQRKYTLELL 921 Query: 1798 TETGFLGSKPSTTPMDCKAQFSSNDSSPLTDASQYRRLIGRMLYLTITRPELSFSIQTLS 1977 + G LG+KPS+ P + SS D +PL D S YRRLIGR+LYLT TRP++SFS+Q LS Sbjct: 922 EDVGLLGTKPSSIPFHPTTKLSSTDGAPLDDPSSYRRLIGRLLYLTHTRPDISFSVQHLS 981 Query: 1978 QYLSKPTTTHLAAAHRI*RYIKSTPVKGLFYSSTSPLHLKCFSDADWARCQETRRSISGF 2157 Q++SKP H AA I +Y+KS P KG+F S++S L + F+D+DWARC ETR+SI GF Sbjct: 982 QFVSKPLVPHYNAAMHILKYLKSDPAKGIFLSASSSLKISAFADSDWARCPETRKSIIGF 1041 Query: 2158 AVFLGDSLISWKSKKQPTISRSSAEAEYRAIALTTCEIQWLTYLLQDMHI--KHPQSLMY 2331 V LG SLISWKSKKQ T+SRSS EAEYRA+A TCEIQWL Y+ QD I +P + ++ Sbjct: 1042 CVLLGSSLISWKSKKQNTVSRSSTEAEYRALASLTCEIQWLQYIFQDFKIIFSNP-AYVF 1100 Query: 2332 TDSRSAYCISQNSCHHERTKHIQLDCHFVREKLQEGLFKLIHIPSTQQTADIFTKPLPPA 2511 D++SA ++ N HER+KHI+LDCH +REK+Q L L+ +P+T Q AD+FTKPL Sbjct: 1101 CDNKSAIYLAHNPTFHERSKHIELDCHVIREKIQSKLIHLLPVPTTSQLADVFTKPLNHP 1160 Query: 2512 QFNYLISKLSMLNIHAP 2562 F+ +SKL + +IH+P Sbjct: 1161 AFSSFLSKLGLCSIHSP 1177 >gb|PNX93131.1| retrovirus-related Pol polyprotein from transposon TNT 1-94 [Trifolium pratense] Length = 982 Score = 746 bits (1925), Expect = 0.0 Identities = 391/799 (48%), Positives = 522/799 (65%), Gaps = 10/799 (1%) Frame = +1 Query: 190 GIVERKHQHLLQVARALLFQSKVPLSFWDHCILTAAHIINRLPAPILHNKTPFEILFNKM 369 G VERKHQH+L +ARALL+QS +P FW + +L A IIN++ P+L NK+P E+LF+ + Sbjct: 193 GRVERKHQHILNIARALLYQSNLPKYFWSYAVLHATAIINKIVTPVLQNKSPHEMLFHCL 252 Query: 370 PDYSHFKIFGCLAYVSTLTAQRHKFQARASKCVFIGYPLGSKGYKLYDLESHKVLISRHV 549 PD + K+FG LAY STL + K R KCVF+G G KG L+DL+S + +SR+V Sbjct: 253 PDLNELKVFGSLAYASTLDVNKTKLSPRGRKCVFLGQKQGVKGSILFDLDSKNIFLSRNV 312 Query: 550 IFCENIFPFQEIASK-------SVSPEAPLFPISDLSSAQDQSFSHFQNINAQNDIVTTT 708 ++I P+ SK +++ E P D+ DQS + + T Sbjct: 313 THFDHILPYTTNTSKLHWHYHSTINCE----PFLDI----DQSHTSTNPSDTTPSPTPPT 364 Query: 709 QIISDAQNDFADAQHDLAAQNTTVVQTDPVNSHIEHSNPNP--PILRRSSRNRTNPSYLQ 882 IISD +P S S+P P P R R + PSYL Sbjct: 365 NIISDP---------------------NPSTSSPLPSSPFPIQPANTRPDRIKHRPSYLS 403 Query: 883 VYDCKIPPSIQSQHSAQTIISGKQYPIQDFLSVDKLSNKHKAFSAQISQITEPKSYAQAS 1062 + C S SA++ +G YPI F S+ +LS H F++ ++Q TEP++Y +A Sbjct: 404 DFVCSA-----SDDSAKSSSTGTIYPISSFHSLSQLSPSHSVFTSSLTQHTEPRTYTEAC 458 Query: 1063 KQKEWQDAMKAEIRALEKNQTWIMTDLPADKHCIGCKWVYKVKYRSNGSIERYKARLVAK 1242 K + W AM +E+ AL + TW + DLP + IG KWVYK+K++S+G+IERYKARLVAK Sbjct: 459 KSQHWIQAMTSELEALARTGTWKIVDLPPNVKPIGSKWVYKIKHKSDGTIERYKARLVAK 518 Query: 1243 GYTQEEGLDYFDTFSPVAKITTVRLLIALAAAKGWFLHQLDVNNAFLHGDLHEEVYMLPP 1422 GY Q EGLD+FDTFSPVAK+TTVR+L+A+A+ KGWFLHQLDVNNAFLHGDL E VYM P Sbjct: 519 GYNQVEGLDFFDTFSPVAKLTTVRMLLAIASIKGWFLHQLDVNNAFLHGDLQENVYMSIP 578 Query: 1423 PGYLQSNDKKVCKLTKSLYGLKQASRQWFQKLTSCLLNYGFIQSKSDSSLFTMSHDDSIT 1602 G S +VCKL KSLYGLKQASR+W++KLTS L+ G+ QS SD SLFT+S D+ T Sbjct: 579 DGVQCSKPNQVCKLLKSLYGLKQASRKWYEKLTSLLVKEGYTQSSSDHSLFTISQQDNFT 638 Query: 1603 ILCVYVDDIILAGNDISKIEAIKLHLDEKFTIKDLGTLKYILGIEVARSTKGIHICQRKY 1782 L +YVDDIILAG + +I IK LD F IKDLG +KY LG+EVA S +GI I QRKY Sbjct: 639 ALLIYVDDIILAGTSLQEINRIKNILDTHFKIKDLGVVKYFLGLEVAHSKEGISISQRKY 698 Query: 1783 ALDLLTETGFLGSKPSTTPMDCKAQFSSNDSSPLTDASQYRRLIGRMLYLTITRPELSFS 1962 LDLL ++G LGSKP++TP+D + +D P D S YRRL+G++LYLT TRP+++F+ Sbjct: 699 CLDLLHDSGLLGSKPASTPLDPSVKLHHDDGKPFEDISMYRRLVGKLLYLTNTRPDIAFA 758 Query: 1963 IQTLSQYLSKPTTTHLAAAHRI*RYIKSTPVKGLFYSSTSPLHLKCFSDADWARCQETRR 2142 Q LSQ+L KPT TH AA R+ RY+K P GL + + + L +SDADWA C +TRR Sbjct: 759 TQQLSQFLHKPTMTHYKAACRVIRYLKHNPGMGLIFKRNADIQLIGYSDADWAGCLDTRR 818 Query: 2143 SISGFAVFLGDSLISWKSKKQPTISRSSAEAEYRAIALTTCEIQWLTYLLQDMHIK-HPQ 2319 S +G+ F+G SLISWK+KKQ TIS+SS+EAEYRA++ TCE+ WL YLL+D+HI+ Q Sbjct: 819 STTGYCFFVGSSLISWKAKKQTTISKSSSEAEYRALSSATCELVWLLYLLKDLHIECSKQ 878 Query: 2320 SLMYTDSRSAYCISQNSCHHERTKHIQLDCHFVREKLQEGLFKLIHIPSTQQTADIFTKP 2499 +++ D++SA I+ N HERTKHI++DCH VREK+QEGL +LI + + +Q AD TK Sbjct: 879 PVLFCDNQSALHIASNPVFHERTKHIEIDCHLVREKVQEGLLRLIPVSTQEQLADFLTKS 938 Query: 2500 LPPAQFNYLISKLSMLNIH 2556 LP +F+ + KL +L+I+ Sbjct: 939 LPAPKFHDFLCKLGLLDIY 957 >gb|KYP43110.1| Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cajanus cajan] Length = 1353 Score = 755 bits (1949), Expect = 0.0 Identities = 386/794 (48%), Positives = 530/794 (66%), Gaps = 3/794 (0%) Frame = +1 Query: 190 GIVERKHQHLLQVARALLFQSKVPLSFWDHCILTAAHIINRLPAPILHNKTPFEILFNKM 369 G+VERKHQH+L +ARAL+FQS V FW++ I A H+INRLP L +P+ +L+++ Sbjct: 578 GVVERKHQHILSMARALMFQSNVSKMFWNYAIGHAVHLINRLPTRFLQQNSPYYVLYSEK 637 Query: 370 PDYSHFKIFGCLAYVSTLTAQRHKFQARASKCVFIGYPLGSKGYKLYDLESHKVLISRHV 549 PD+SH K+FGCLA+ STL+ R K + R+ KC+F+GY G+KG+ +YDL++ + ISR V Sbjct: 638 PDFSHLKVFGCLAFASTLSHNRTKLEPRSRKCMFLGYSSGTKGFIMYDLKTRETFISRDV 697 Query: 550 IFCENIFPFQEIASKSVSPEAPLFPISDLSSAQDQSFSHFQNINAQNDIVTTTQIISDAQ 729 F ENIFP Q+ S S + P+ PI AQ + + I S Sbjct: 698 QFYENIFPLQKDFSIQ-STDGPVVPI------------------AQMPLTSCDPIPSHTH 738 Query: 730 NDFADAQHDLAAQNTTVVQTDPVNSHIEHSNPNPPILRRSS-RNRTNPSYLQVYDCKIPP 906 ++ + +H+ ++T+ T+ NS + N P +RR+S R + P YLQ Y C + Sbjct: 739 DNLDETEHE--HNSSTLPMTNSSNSDQPNIEINIPEIRRTSQRVKNRPGYLQDYHCTLAA 796 Query: 907 SIQSQHSAQTIISGKQYPIQDFLSVDKLSNKHKAFSAQISQITEPKSYAQASKQKEWQDA 1086 S Q S S +YPI D+L S ++F + IS I EP+SY A W++A Sbjct: 797 SKVDQSS-----STARYPISDYLPYTSYSAVQQSFVSTISSIIEPRSYQDAINHDCWKEA 851 Query: 1087 MKAEIRALEKNQTWIMTDLPADKHCIGCKWVYKVKYRSNGSIERYKARLVAKGYTQEEGL 1266 ++AE+ AL+K +TWI+TDLP +K +GC+WV+KVKY ++GS+ERYKARLVAKG+TQ GL Sbjct: 852 IRAELDALDKQKTWILTDLPPNKRAVGCRWVFKVKYHADGSVERYKARLVAKGFTQIPGL 911 Query: 1267 DYFDTFSPVAKITTVRLLIALAAAKGWFLHQLDVNNAFLHGDLHEEVYMLPPPGYLQSND 1446 DY DTFSPV ++TT+R+ +A+AAA W +HQLD+N AFLHGDL EEVYM PPPG + S+ Sbjct: 912 DYIDTFSPVVRMTTIRVFLAIAAASNWSVHQLDINTAFLHGDLVEEVYMKPPPGLILSSP 971 Query: 1447 KKVCKLTKSLYGLKQASRQWFQKLTSCLLNYGFIQSKSDSSLFTMSHDDSITILCVYVDD 1626 KVCKL KSLYGLKQ SRQW KLT L +GF+QSKSD SLFT + + VYVDD Sbjct: 972 NKVCKLQKSLYGLKQVSRQWNIKLTETLKLFGFVQSKSDYSLFTKRTNIGFIAILVYVDD 1031 Query: 1627 IILAGNDISKIEAIKLHLDEKFTIKDLGTLKYILGIEVARSTKGIHICQRKYALDLLTET 1806 +I++G+D ++I +K LD++F+IKDLG L Y LG+E +RS +GI +CQRKYAL+LL +T Sbjct: 1032 LIISGSDETEIMKVKRLLDKQFSIKDLGQLSYFLGLEFSRSDQGISVCQRKYALELLQDT 1091 Query: 1807 GFLGSKPSTTPMDCKAQFSSNDSSPLTDASQYRRLIGRMLYLTITRPELSFSIQTLSQYL 1986 G L SKP +TPMD + + +D S YRRL+GR++YLT TRP+L+F++ LSQ++ Sbjct: 1092 GLLASKPCSTPMDHTTRLHHDPLDLYSDPSSYRRLVGRLIYLTHTRPDLAFAVGKLSQFM 1151 Query: 1987 SKPTTTHLAAAHRI*RYIKSTPVKGLFYSSTSPLHLKCFSDADWARCQETRRSISGFAVF 2166 +P H AA ++ RY+K+TP KGLF+ S+S L L ++D+DWA C ++RRSISGF F Sbjct: 1152 HQPNNAHFQAARKVLRYVKATPTKGLFFPSSSDLKLTGYTDSDWATCPDSRRSISGFCFF 1211 Query: 2167 LGDSLISWKSKKQPTISRSSAEAEYRAIALTTCEIQWLTYLLQDMHIKH--PQSLMYTDS 2340 LG++L+SWKSKKQ +SRSS+EAEYRA+AL CE QWL LL D ++ P SL + D+ Sbjct: 1212 LGNALVSWKSKKQNVVSRSSSEAEYRALALGVCEAQWLHKLLTDFQLQDLIPISL-FCDN 1270 Query: 2341 RSAYCISQNSCHHERTKHIQLDCHFVREKLQEGLFKLIHIPSTQQTADIFTKPLPPAQFN 2520 +SA I+ N HERTKH+++DCH VR+++Q G L I S+ Q ADI TKPL P F Sbjct: 1271 QSALYIAANPVFHERTKHVEIDCHTVRDQVQAGFIHLAPITSSGQLADILTKPLLPKMFQ 1330 Query: 2521 YLISKLSMLNIHAP 2562 + KL + N P Sbjct: 1331 DFVCKLGLSNFTTP 1344