BLASTX nr result
ID: Glycyrrhiza29_contig00030893
seq
BLASTX 2.2.26 [Sep-21-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Glycyrrhiza29_contig00030893 (2153 letters) Database: ./nr 115,041,592 sequences; 42,171,959,267 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value GAU46782.1 hypothetical protein TSUD_351810 [Trifolium subterran... 722 0.0 ABD32333.1 polyprotein-like, putative [Medicago truncatula] 692 0.0 GAU31820.1 hypothetical protein TSUD_58210 [Trifolium subterraneum] 701 0.0 GAU38852.1 hypothetical protein TSUD_154140 [Trifolium subterran... 688 0.0 GAU15708.1 hypothetical protein TSUD_307180 [Trifolium subterran... 682 0.0 AJY78065.1 putative polyprotein [Glycine max] 649 0.0 GAU47513.1 hypothetical protein TSUD_138850 [Trifolium subterran... 676 0.0 GAU31202.1 hypothetical protein TSUD_210590 [Trifolium subterran... 659 0.0 GAU51775.1 hypothetical protein TSUD_415620 [Trifolium subterran... 663 0.0 GAU50785.1 hypothetical protein TSUD_192210 [Trifolium subterran... 662 0.0 KHN05285.1 Retrovirus-related Pol polyprotein from transposon TN... 666 0.0 KYP36109.1 Retrovirus-related Pol polyprotein from transposon TN... 643 0.0 KYP61022.1 Retrovirus-related Pol polyprotein from transposon TN... 637 0.0 KYP65734.1 Retrovirus-related Pol polyprotein from transposon TN... 627 0.0 KYP34298.1 Retrovirus-related Pol polyprotein from transposon TN... 626 0.0 KYP34293.1 Retrovirus-related Pol polyprotein from transposon TN... 637 0.0 KYP55668.1 Retrovirus-related Pol polyprotein from transposon TN... 620 0.0 GAU41679.1 hypothetical protein TSUD_272630 [Trifolium subterran... 620 0.0 KYP38774.1 Retrovirus-related Pol polyprotein from transposon TN... 605 0.0 KYP42564.1 Retrovirus-related Pol polyprotein from transposon TN... 625 0.0 >GAU46782.1 hypothetical protein TSUD_351810 [Trifolium subterraneum] Length = 1512 Score = 722 bits (1864), Expect = 0.0 Identities = 355/614 (57%), Positives = 449/614 (73%), Gaps = 5/614 (0%) Frame = +1 Query: 178 SSFPNPPNTPATSQDTPIDTNHLPDXXXXXXXXXXNLRHSTRPHKPPSHLNDYICNHISS 357 SS P+ P+ P + +P+ T +LP R STR P HL+DY+CN Sbjct: 893 SSQPSIPHQPHDTH-SPLPTTNLPSPSHNSIP---QTRQSTRMSVKPKHLSDYVCNLSVD 948 Query: 358 INPPR-----YPIHAYVSYASLSQPHRAYTMSLDAVPEPSSFCEASKHTCWVEAMQQELT 522 +PP YPI ++ SY+++S R Y +S+ A EP + EAS+ CWV+AM E+ Sbjct: 949 SSPPSSPGILYPISSFHSYSNISSKFRNYALSITASVEPRDYKEASQQQCWVDAMNNEIQ 1008 Query: 523 ALEKNQTWIIVDKPANVIPIGCKWVYKVKRKADGSLERYKARLVAKGYTQTEGIDYFDTF 702 AL+ N+TW V PA++ PIGCKWVYKVK KADGS+ERYKARLVAKGY Q EG+D+FDTF Sbjct: 1009 ALQHNKTWCYVTPPAHIKPIGCKWVYKVKHKADGSVERYKARLVAKGYNQVEGLDFFDTF 1068 Query: 703 SPVAKMTTVRTLIALAAIKGWHIHQLDVNNAFLHGQLQEKVYTSIPQGVQISGSNKVCKL 882 SPVAK+TTVRTLIALA+I+ WH++Q+DVNNAFLHG LQE VY +PQGV ++VCKL Sbjct: 1069 SPVAKITTVRTLIALASIRSWHLNQMDVNNAFLHGDLQEDVYMEVPQGVNSPKPHQVCKL 1128 Query: 883 LKSLYGLKQASRKWYERLTNLLLGLGFKQAHSDHSLFTKITQSSYIALLIYIDDIVLVGD 1062 LKSLYGLKQASRKWYE+LT+LLL G+ QA SDHSLFT S + ALL+Y+DDI+L G+ Sbjct: 1129 LKSLYGLKQASRKWYEKLTSLLLKEGYTQASSDHSLFTLKHGSDFTALLVYVDDIILAGN 1188 Query: 1063 CLSQLQQIKTVLDHNXXXXXXXXXXXXXXXEAAHSPRGISLCQRQYCLNLLTDTGVLAAK 1242 L + +IK ++D+ E AHS +GIS+CQR+YCL+LL DTG+L +K Sbjct: 1189 SLQEFARIKLIMDNAFKIKDLGPLKYFLGIEVAHSKQGISICQRKYCLDLLKDTGLLGSK 1248 Query: 1243 PASTPMAPNSHLHQDDGPLFPDVECYRRLVGRLLYLTTTRPDITFAVQQLSQFLSKPTMT 1422 PA TP+ P+ LHQD P + DV YRRL+G+LLYLTTTRPDI+FA+QQLSQFLS PT T Sbjct: 1249 PAPTPLDPSIKLHQDSSPAYDDVGGYRRLIGKLLYLTTTRPDISFAIQQLSQFLSSPTTT 1308 Query: 1423 HYKAAQRILHYLKGSPGQGLFFSRSSSLDLQGYTDSNWAGCPDSRRSISGYCFFLGESLI 1602 H+ A R++ YLKGSPG+GLFF R S L L G+ D++WA C D+RRS SGYCFF+G SLI Sbjct: 1309 HFDTACRVVRYLKGSPGRGLFFPRQSPLQLLGFADADWANCADTRRSTSGYCFFIGSSLI 1368 Query: 1603 AWRSKKQTTVACSSSEAEYRALAFATCELQWIQFLLDDLGIVHSKQPVLYCDNKSALHIA 1782 +WR+KKQ TV+ SSSEAEYR+L+FA+CELQWI +LL DL I + PVLYCDN+SA+HIA Sbjct: 1369 SWRAKKQNTVSRSSSEAEYRSLSFASCELQWIVYLLKDLSIDCERPPVLYCDNQSAIHIA 1428 Query: 1783 ANPVFHERTKHLEIDCHVVRDKVNSGLMKLQPMPSAHQVADLFTKALLPQSFSHLVTKLQ 1962 +NPVFHERTKHLEIDCH+VRDKV SG+ KL P+ + Q+AD FTKAL P+ F+ ++KL Sbjct: 1429 SNPVFHERTKHLEIDCHLVRDKVQSGVFKLLPISTKAQLADFFTKALPPKVFNSFLSKLN 1488 Query: 1963 MLNIYQSLDCAGII 2004 MLNI+ C ++ Sbjct: 1489 MLNIFHVPACGRLL 1502 >ABD32333.1 polyprotein-like, putative [Medicago truncatula] Length = 635 Score = 692 bits (1787), Expect = 0.0 Identities = 346/614 (56%), Positives = 430/614 (70%), Gaps = 8/614 (1%) Frame = +1 Query: 187 PNPPNTPATSQDTP--IDTNHLPDXXXXXXXXXXNLRHSTRPHKPPSHLNDYICN----H 348 P PP+ P +S P + LR S+R K PS+L DYICN Sbjct: 2 PEPPSFPLSSTILPNLSSDQTIHSTPRSTFNPSSTLRVSSRTKKSPSYLQDYICNPSTNS 61 Query: 349 ISSINPP--RYPIHAYVSYASLSQPHRAYTMSLDAVPEPSSFCEASKHTCWVEAMQQELT 522 +SS N YP+ ++S+ LS + +SL + EP S+ EA K CW +AMQ EL Sbjct: 62 VSSANKSCILYPLSNFISHKHLSNSQHTFALSLVSHIEPKSYAEAIKSDCWKQAMQLELN 121 Query: 523 ALEKNQTWIIVDKPANVIPIGCKWVYKVKRKADGSLERYKARLVAKGYTQTEGIDYFDTF 702 AL++ TW +VD P+ V PIGCKWV ++K DGS+ERYKARLVAKGY Q EG+DYFDTF Sbjct: 122 ALDQTGTWTVVDIPSQVKPIGCKWVCRIKYNDDGSIERYKARLVAKGYNQIEGLDYFDTF 181 Query: 703 SPVAKMTTVRTLIALAAIKGWHIHQLDVNNAFLHGQLQEKVYTSIPQGVQISGSNKVCKL 882 SPVAK+T VR +IALA+I W +HQLDVNNAFLHG LQE VY IP G+ N+VCKL Sbjct: 182 SPVAKITIVRLVIALASINHWFLHQLDVNNAFLHGDLQENVYKKIPPGLSTFKPNQVCKL 241 Query: 883 LKSLYGLKQASRKWYERLTNLLLGLGFKQAHSDHSLFTKITQSSYIALLIYIDDIVLVGD 1062 KSLYGLKQASRKWYE+LT LL+ +KQA SD SLFTK+T ++ +L+Y+DDI+L G+ Sbjct: 242 SKSLYGLKQASRKWYEKLTTLLISNDYKQAASDASLFTKLTSETFTIILVYVDDIILAGN 301 Query: 1063 CLSQLQQIKTVLDHNXXXXXXXXXXXXXXXEAAHSPRGISLCQRQYCLNLLTDTGVLAAK 1242 L++ IK L E AHS GISLCQR+YCL+LL D+G+L +K Sbjct: 302 SLTEFHIIKNALHQAFKIKDLGILKYFLGLEVAHSHSGISLCQRKYCLDLLNDSGLLGSK 361 Query: 1243 PASTPMAPNSHLHQDDGPLFPDVECYRRLVGRLLYLTTTRPDITFAVQQLSQFLSKPTMT 1422 P STP P+ LH D LFPD+ YRRL+GRL+YL TTRPDITF QQLSQFLSKPT T Sbjct: 362 PVSTPSDPSIKLHNDTSLLFPDISAYRRLIGRLIYLNTTRPDITFITQQLSQFLSKPTQT 421 Query: 1423 HYKAAQRILHYLKGSPGQGLFFSRSSSLDLQGYTDSNWAGCPDSRRSISGYCFFLGESLI 1602 HY A R+L YLKGSPG+G+FF R+S L +QGYTD++W GC D+RRSISG+CFFLG+SLI Sbjct: 422 HYHATLRVLKYLKGSPGKGIFFPRASGLHIQGYTDADWVGCKDTRRSISGHCFFLGQSLI 481 Query: 1603 AWRSKKQTTVACSSSEAEYRALAFATCELQWIQFLLDDLGIVHSKQPVLYCDNKSALHIA 1782 WR+KKQ TV+ SSSEAEYRA+A ATCE+QW+ +LL DL + + PVLYCDN+SA+HIA Sbjct: 482 CWRTKKQPTVSKSSSEAEYRAMASATCEMQWLLYLLRDLQVQCVQLPVLYCDNQSAMHIA 541 Query: 1783 ANPVFHERTKHLEIDCHVVRDKVNSGLMKLQPMPSAHQVADLFTKALLPQSFSHLVTKLQ 1962 +NPVFHERTKHLEIDCH+VR+K+ +G+ KL P+ + Q+ D FTKAL Q FS L++KL Sbjct: 542 SNPVFHERTKHLEIDCHIVREKLQAGIFKLLPVTTHDQIGDSFTKALYLQPFSLLLSKLG 601 Query: 1963 MLNIYQSLDCAGII 2004 ML+IY C GI+ Sbjct: 602 MLDIYHPPTCGGIL 615 >GAU31820.1 hypothetical protein TSUD_58210 [Trifolium subterraneum] Length = 1409 Score = 701 bits (1810), Expect = 0.0 Identities = 360/683 (52%), Positives = 464/683 (67%), Gaps = 13/683 (1%) Frame = +1 Query: 1 RNVVFYENIFPYKCNSSRGSVAYLKRWDTGXXXXXXXXXXXXXXXXXXXXXXLSLNGQPS 180 R+V F+E+I PY NSS + WD Sbjct: 753 RHVSFHEHILPYTSNSSSET----HNWDY------------------------------- 777 Query: 181 SFPNPPNT---PATSQDTPIDTNHLP----DXXXXXXXXXXNLRHSTRPHKPPSHLNDYI 339 FP+ P P+++ DTP+ N +P R STR PS+L DY+ Sbjct: 778 -FPSSPTNNILPSSTSDTPLHIN-IPTAPSSAPCHTPISHPTNRISTRSKTKPSYLQDYV 835 Query: 340 CNHISSINPPR------YPIHAYVSYASLSQPHRAYTMSLDAVPEPSSFCEASKHTCWVE 501 C H S+ + P YPI Y+SY++LS H A+ +SL + EP ++ EASK CW + Sbjct: 836 C-HASTASLPSCSQDKLYPISDYMSYSNLSSNHCAFALSLMSHSEPKTYDEASKFECWNQ 894 Query: 502 AMQQELTALEKNQTWIIVDKPANVIPIGCKWVYKVKRKADGSLERYKARLVAKGYTQTEG 681 AM+ EL ALEK TW++VD P V PIGC+WVY++K ADGS+ERYKARL+AKGY Q EG Sbjct: 895 AMRVELEALEKTGTWLLVDLPPTVKPIGCRWVYRIKYNADGSIERYKARLIAKGYNQIEG 954 Query: 682 IDYFDTFSPVAKMTTVRTLIALAAIKGWHIHQLDVNNAFLHGQLQEKVYTSIPQGVQISG 861 +DYFDT+SPVAK+TTVRT+IALA+I W IHQLDVNNAFLHG+LQE VY +P GV S Sbjct: 955 LDYFDTYSPVAKLTTVRTVIALASINNWIIHQLDVNNAFLHGELQEDVYMIVPPGVTCSK 1014 Query: 862 SNKVCKLLKSLYGLKQASRKWYERLTNLLLGLGFKQAHSDHSLFTKITQSSYIALLIYID 1041 N+VCKL+KSLYGLKQASR+WYERLT L + QA SDHSLF K S+ LL+Y+D Sbjct: 1015 PNQVCKLVKSLYGLKQASRRWYERLTAFLQQHHYIQATSDHSLFLKKNGSTITILLVYVD 1074 Query: 1042 DIVLVGDCLSQLQQIKTVLDHNXXXXXXXXXXXXXXXEAAHSPRGISLCQRQYCLNLLTD 1221 D+++ G+ ++ +Q IK L + E AHS GISLCQR+YCL+LL D Sbjct: 1075 DVIVAGNSMTDIQAIKNALHESFKIKDLGILKYFLGIEVAHSKEGISLCQRKYCLDLLDD 1134 Query: 1222 TGVLAAKPASTPMAPNSHLHQDDGPLFPDVECYRRLVGRLLYLTTTRPDITFAVQQLSQF 1401 +G++ +KPASTP P++ LHQD +PD+ YRRLVGRLLYL TRPDITF+ QQLSQF Sbjct: 1135 SGMIESKPASTPSDPSTKLHQDSSAPYPDIPSYRRLVGRLLYLNATRPDITFSTQQLSQF 1194 Query: 1402 LSKPTMTHYKAAQRILHYLKGSPGQGLFFSRSSSLDLQGYTDSNWAGCPDSRRSISGYCF 1581 LSKPTM H+KAA R+L YLK PG+G+ R S + LQG++D++WAGC D+RRSISG CF Sbjct: 1195 LSKPTMAHFKAATRVLRYLKTCPGRGIMMPRDSIIHLQGFSDADWAGCIDTRRSISGQCF 1254 Query: 1582 FLGESLIAWRSKKQTTVACSSSEAEYRALAFATCELQWIQFLLDDLGIVHSKQPVLYCDN 1761 LG+SLI+WR+KKQ TV+ SSSEAEY+ALA ATCE+QW+ +LL+DL + K PVL+CDN Sbjct: 1255 LLGKSLISWRTKKQLTVSRSSSEAEYKALAAATCEMQWLLYLLNDLQVQSIKLPVLFCDN 1314 Query: 1762 KSALHIAANPVFHERTKHLEIDCHVVRDKVNSGLMKLQPMPSAHQVADLFTKALLPQSFS 1941 +SALHIAANPVFH+RTKHLEIDCH+VR+++NSG+MK P+ + +Q+AD FTK LLPQ F Sbjct: 1315 QSALHIAANPVFHKRTKHLEIDCHIVRERLNSGMMKFLPVSTKNQLADFFTKPLLPQPFH 1374 Query: 1942 HLVTKLQMLNIYQSLDCAGIIQY 2010 L++KL+M +IY+ C G++ + Sbjct: 1375 ILLSKLEMKDIYKPPTCGGLLNH 1397 >GAU38852.1 hypothetical protein TSUD_154140 [Trifolium subterraneum] Length = 1494 Score = 688 bits (1775), Expect = 0.0 Identities = 349/657 (53%), Positives = 452/657 (68%), Gaps = 4/657 (0%) Frame = +1 Query: 1 RNVVFYENIFPYKCNSSRGSVAYLKRWDTGXXXXXXXXXXXXXXXXXXXXXXLSLNGQPS 180 RNVVF E IFPY ++S+ + Y++ LS N Sbjct: 793 RNVVFEETIFPYPVSNSKTAWEYVEPTPN----THPSTEPTKTRNSQETTDDLSTNHDHD 848 Query: 181 SFPNPPNTPA----TSQDTPIDTNHLPDXXXXXXXXXXNLRHSTRPHKPPSHLNDYICNH 348 S P + P+ TS TN P R STR + P HL DY CN Sbjct: 849 SIDLPLDQPSDRTTTSTHDQKFTNSSP-------------RRSTRIKQTPLHLMDYQCNA 895 Query: 349 ISSINPPRYPIHAYVSYASLSQPHRAYTMSLDAVPEPSSFCEASKHTCWVEAMQQELTAL 528 I+ P YPI +++S+ +LS+ + + +SL A EP+++ EASKH CWV+AM+ ELTAL Sbjct: 896 ITHKTP--YPISSFISHNNLSKSYSTFCLSLLADTEPTTYAEASKHECWVKAMKNELTAL 953 Query: 529 EKNQTWIIVDKPANVIPIGCKWVYKVKRKADGSLERYKARLVAKGYTQTEGIDYFDTFSP 708 N+TWII D P V PIG KWVYK+KRKADG+++RYKARLVAKGY Q EG+D+ TFSP Sbjct: 954 ANNKTWIITDLPEGVKPIGSKWVYKIKRKADGTIDRYKARLVAKGYNQIEGVDFSQTFSP 1013 Query: 709 VAKMTTVRTLIALAAIKGWHIHQLDVNNAFLHGQLQEKVYTSIPQGVQISGSNKVCKLLK 888 VAKMTT+RT++A+A+IK WHIHQLDV+NAFLHG L E VY ++PQ + + S +VCKL K Sbjct: 1014 VAKMTTIRTVLAIASIKNWHIHQLDVDNAFLHGDLDENVYMTVPQRFEGATSRQVCKLQK 1073 Query: 889 SLYGLKQASRKWYERLTNLLLGLGFKQAHSDHSLFTKITQSSYIALLIYIDDIVLVGDCL 1068 LYGL+QASR+WYE+L++ L+ +G+K SD +LFTK T +S+ LL+Y+DDIVL G+CL Sbjct: 1074 FLYGLRQASRQWYEKLSHFLITIGYKHMPSDPTLFTKTTSASFTTLLVYVDDIVLSGNCL 1133 Query: 1069 SQLQQIKTVLDHNXXXXXXXXXXXXXXXEAAHSPRGISLCQRQYCLNLLTDTGVLAAKPA 1248 ++++ K+ L E AHS +GI+LCQR+YCL+LL DTG L KP+ Sbjct: 1134 AEIESTKSQLHQAFGIKDIGVLKFFLGLEVAHSQQGITLCQRKYCLDLLNDTGNLGCKPS 1193 Query: 1249 STPMAPNSHLHQDDGPLFPDVECYRRLVGRLLYLTTTRPDITFAVQQLSQFLSKPTMTHY 1428 S PM P++ LH DD ++ YR LVG+LLYLT+TRPDI F VQQLSQFL PT H+ Sbjct: 1194 SIPMDPSNRLHHDDSEPHSNITEYRALVGKLLYLTSTRPDIAFPVQQLSQFLDAPTTAHF 1253 Query: 1429 KAAQRILHYLKGSPGQGLFFSRSSSLDLQGYTDSNWAGCPDSRRSISGYCFFLGESLIAW 1608 KAA ++L YLKG+PG GLFF R++SL L G++D++W GCPDSRRSI+GYCFF+G+SLI W Sbjct: 1254 KAAHKVLRYLKGNPGTGLFFPRNASLQLMGFSDADWGGCPDSRRSITGYCFFIGQSLICW 1313 Query: 1609 RSKKQTTVACSSSEAEYRALAFATCELQWIQFLLDDLGIVHSKQPVLYCDNKSALHIAAN 1788 +SKKQ TV+ SSSEAEYRALA ATCELQW+ +LL DL + K VLYCD++SALHIA+N Sbjct: 1314 KSKKQLTVSKSSSEAEYRALASATCELQWLSYLLKDLQVHIDKANVLYCDSQSALHIASN 1373 Query: 1789 PVFHERTKHLEIDCHVVRDKVNSGLMKLQPMPSAHQVADLFTKALLPQSFSHLVTKL 1959 PVFHERTKHL+IDCH+VR+K+ +GLMKL P+ +Q AD+ TKAL P +F L +KL Sbjct: 1374 PVFHERTKHLDIDCHIVREKLQAGLMKLLPISGYNQTADILTKALHPANFHRLFSKL 1430 >GAU15708.1 hypothetical protein TSUD_307180 [Trifolium subterraneum] Length = 1433 Score = 682 bits (1759), Expect = 0.0 Identities = 336/565 (59%), Positives = 416/565 (73%) Frame = +1 Query: 286 LRHSTRPHKPPSHLNDYICNHISSINPPRYPIHAYVSYASLSQPHRAYTMSLDAVPEPSS 465 LR STR K P HL DY CN+I ++ +YPI Y S+ LS +YT+SL EPSS Sbjct: 859 LRKSTRITKLPPHLLDYECNNI--VHSTKYPISKYTSHNHLSSKQLSYTLSLLTETEPSS 916 Query: 466 FCEASKHTCWVEAMQQELTALEKNQTWIIVDKPANVIPIGCKWVYKVKRKADGSLERYKA 645 + EA KH WV+AM EL ALE+N+TW IV P PIG KWVYKVKRKADGS+ERYKA Sbjct: 917 YSEACKHDHWVKAMNAELQALEQNKTWSIVSLPVGAKPIGSKWVYKVKRKADGSIERYKA 976 Query: 646 RLVAKGYTQTEGIDYFDTFSPVAKMTTVRTLIALAAIKGWHIHQLDVNNAFLHGQLQEKV 825 RLVAKGY Q EGIDYF+TFSPVAKMTT+R ++A+A+IK W +HQLDVNNAFLHG+L E V Sbjct: 977 RLVAKGYNQVEGIDYFETFSPVAKMTTIRVILAIASIKNWFVHQLDVNNAFLHGELCEDV 1036 Query: 826 YTSIPQGVQISGSNKVCKLLKSLYGLKQASRKWYERLTNLLLGLGFKQAHSDHSLFTKIT 1005 Y IPQG+ ++KVCKL KSLYGLKQASRKWYE+L+ L+ F QA SD +LF K T Sbjct: 1037 YMKIPQGLDGFSADKVCKLTKSLYGLKQASRKWYEKLSQFLISHQFTQAPSDPTLFVKKT 1096 Query: 1006 QSSYIALLIYIDDIVLVGDCLSQLQQIKTVLDHNXXXXXXXXXXXXXXXEAAHSPRGISL 1185 ++ ALL+Y+DDIVL GD ++++ IK L+H E AHS +GI+L Sbjct: 1097 SENFTALLVYVDDIVLTGDSMNEITNIKNDLNHTFGIKDLGVLKFFLGLEVAHSLKGITL 1156 Query: 1186 CQRQYCLNLLTDTGVLAAKPASTPMAPNSHLHQDDGPLFPDVECYRRLVGRLLYLTTTRP 1365 QRQYCL+LL +TG L KP+S PM P+ LH DD + D+ YR LVG+LLYLT TRP Sbjct: 1157 SQRQYCLDLLAETGDLGCKPSSIPMDPSLKLHHDDSTPYNDITGYRTLVGKLLYLTNTRP 1216 Query: 1366 DITFAVQQLSQFLSKPTMTHYKAAQRILHYLKGSPGQGLFFSRSSSLDLQGYTDSNWAGC 1545 DI F VQQL QFL PT+ HYKAA ++L YLKG PG GL+F RSS L G+TD++W GC Sbjct: 1217 DIAFPVQQLCQFLDCPTILHYKAAHKVLRYLKGCPGTGLYFPRSSDAHLTGFTDADWGGC 1276 Query: 1546 PDSRRSISGYCFFLGESLIAWRSKKQTTVACSSSEAEYRALAFATCELQWIQFLLDDLGI 1725 D+RRSI+GYCFFLG SLI W+SKKQ T++ SSSEAEYRALA TCELQW+ +L DL + Sbjct: 1277 VDTRRSITGYCFFLGSSLICWKSKKQQTISRSSSEAEYRALASGTCELQWLTYLFRDLQV 1336 Query: 1726 VHSKQPVLYCDNKSALHIAANPVFHERTKHLEIDCHVVRDKVNSGLMKLQPMPSAHQVAD 1905 +++P+LYCD++SA+HIA+NPVFHERTKHL+IDCHVVR+++ SGLMKL P+ Q+AD Sbjct: 1337 TLTQKPLLYCDSQSAIHIASNPVFHERTKHLDIDCHVVRERLQSGLMKLLPVSGFLQLAD 1396 Query: 1906 LFTKALLPQSFSHLVTKLQMLNIYQ 1980 + TKAL P +F L+TKL +L+IY+ Sbjct: 1397 IMTKALHPANFHRLLTKLGLLDIYR 1421 >AJY78065.1 putative polyprotein [Glycine max] Length = 523 Score = 649 bits (1673), Expect = 0.0 Identities = 315/515 (61%), Positives = 389/515 (75%) Frame = +1 Query: 433 MSLDAVPEPSSFCEASKHTCWVEAMQQELTALEKNQTWIIVDKPANVIPIGCKWVYKVKR 612 MS+ EP S+ EASKH WV AM++EL AL KN TW IV+ P + PIGCKWVYKVK Sbjct: 1 MSITHCTEPQSYEEASKHEHWVTAMKEELNALAKNCTWKIVELPPHTKPIGCKWVYKVKH 60 Query: 613 KADGSLERYKARLVAKGYTQTEGIDYFDTFSPVAKMTTVRTLIALAAIKGWHIHQLDVNN 792 KA+G +ERYKARLVAKGY Q EGIDYF+TFSPVAK+TTVRTL+A+AAIK WH+HQLDVNN Sbjct: 61 KANGQIERYKARLVAKGYNQVEGIDYFETFSPVAKITTVRTLLAVAAIKNWHLHQLDVNN 120 Query: 793 AFLHGQLQEKVYTSIPQGVQISGSNKVCKLLKSLYGLKQASRKWYERLTNLLLGLGFKQA 972 AFLHG LQE VY IP GV + N VCKL KSLYGLKQASRKWYE+LTNLLL G+ Q+ Sbjct: 121 AFLHGDLQEDVYMKIPDGVTCAKPNSVCKLQKSLYGLKQASRKWYEKLTNLLLKEGYIQS 180 Query: 973 HSDHSLFTKITQSSYIALLIYIDDIVLVGDCLSQLQQIKTVLDHNXXXXXXXXXXXXXXX 1152 SD+SLFT +++ ALL+Y+DDI+L GD + + +IK VLD Sbjct: 181 ISDYSLFTLTKGNTFTALLVYVDDIILAGDSIDEFDRIKNVLDLAFKIKNLGKLKYFLGL 240 Query: 1153 EAAHSPRGISLCQRQYCLNLLTDTGVLAAKPASTPMAPNSHLHQDDGPLFPDVECYRRLV 1332 E AHS GI++ QR+YCL+LL D+G+L KPASTP+ + LH G + D+ YRR+V Sbjct: 241 EVAHSRLGITISQRKYCLDLLKDSGLLGCKPASTPLDTSIKLHSAAGTPYADISGYRRIV 300 Query: 1333 GRLLYLTTTRPDITFAVQQLSQFLSKPTMTHYKAAQRILHYLKGSPGQGLFFSRSSSLDL 1512 G+LLYL TTRPDI FA QQLSQF+ PT H+ AA R+L YLK +PGQG+FFSR+S + L Sbjct: 301 GKLLYLNTTRPDIAFATQQLSQFMQAPTNVHFNAACRVLRYLKNNPGQGIFFSRTSEMQL 360 Query: 1513 QGYTDSNWAGCPDSRRSISGYCFFLGESLIAWRSKKQTTVACSSSEAEYRALAFATCELQ 1692 GY+D++WAGC DSR+SISGYCFF+G+SL++WR+KKQ TV+ SSSEAEYRAL+ A CELQ Sbjct: 361 IGYSDADWAGCMDSRKSISGYCFFIGKSLVSWRAKKQATVSRSSSEAEYRALSSAACELQ 420 Query: 1693 WIQFLLDDLGIVHSKQPVLYCDNKSALHIAANPVFHERTKHLEIDCHVVRDKVNSGLMKL 1872 W+ +L DL + ++ P LYCDN+SA+HIA+NPVFHERTKHLEIDCH+VR+K+ G +KL Sbjct: 421 WLLYLFADLRVQLTRTPTLYCDNQSAVHIASNPVFHERTKHLEIDCHLVREKLLKGTLKL 480 Query: 1873 QPMPSAHQVADLFTKALLPQSFSHLVTKLQMLNIY 1977 P+ ++ QVAD TKAL P F V+KL M+NIY Sbjct: 481 LPVSTSDQVADFLTKALAPPKFHDFVSKLSMINIY 515 >GAU47513.1 hypothetical protein TSUD_138850 [Trifolium subterraneum] Length = 1469 Score = 676 bits (1745), Expect = 0.0 Identities = 336/612 (54%), Positives = 439/612 (71%), Gaps = 7/612 (1%) Frame = +1 Query: 166 NGQPSSFPNPPNTPATSQDTPIDTNHLPDXXXXXXXXXXNLRH-STRPHKPPSHLNDYIC 342 N Q ++ + N PA + +NHL N H STR PS+L+DY+C Sbjct: 857 NTQCTNHNDALNQPAQPLSPLLISNHLDQTPALPTSSDSNPTHRSTRLKHAPSYLSDYVC 916 Query: 343 NHIS------SINPPRYPIHAYVSYASLSQPHRAYTMSLDAVPEPSSFCEASKHTCWVEA 504 N + S + YPI Y S +LS H A+T+SL EP S+ EA K CW +A Sbjct: 917 NQSTTSPGTQSSSGSLYPISDYHSLKNLSSIHHAFTVSLTHNTEPKSYLEACKFECWQKA 976 Query: 505 MQQELTALEKNQTWIIVDKPANVIPIGCKWVYKVKRKADGSLERYKARLVAKGYTQTEGI 684 M EL AL K TW+IVD P+++ PIG KWVYKVK KADG++ERYKARLVAKGY Q EG+ Sbjct: 977 MDDELEALTKTGTWVIVDLPSHIKPIGSKWVYKVKYKADGTIERYKARLVAKGYNQVEGL 1036 Query: 685 DYFDTFSPVAKMTTVRTLIALAAIKGWHIHQLDVNNAFLHGQLQEKVYTSIPQGVQISGS 864 D+FDTFSPVAK+TTVR L+A+A+IKGW +HQLDVNNAFLHG+LQE VY +IP+GV Sbjct: 1037 DFFDTFSPVAKLTTVRLLLAIASIKGWFLHQLDVNNAFLHGELQEDVYMAIPEGVTTLKQ 1096 Query: 865 NKVCKLLKSLYGLKQASRKWYERLTNLLLGLGFKQAHSDHSLFTKITQSSYIALLIYIDD 1044 N+VCKL KSLYGLKQASRKWYE+LT+LL+ G+ Q+ +D+SLFT S + ALLIY+DD Sbjct: 1097 NQVCKLQKSLYGLKQASRKWYEKLTSLLISEGYSQSTADYSLFTLHNASHFTALLIYVDD 1156 Query: 1045 IVLVGDCLSQLQQIKTVLDHNXXXXXXXXXXXXXXXEAAHSPRGISLCQRQYCLNLLTDT 1224 I+L G L+++ +IK +LD + E A S GI++ QR+YCL+LL D+ Sbjct: 1157 IILAGTDLNEITRIKKILDTHFKIKDLGVLKYFLGLEVAQSREGITISQRKYCLDLLKDS 1216 Query: 1225 GVLAAKPASTPMAPNSHLHQDDGPLFPDVECYRRLVGRLLYLTTTRPDITFAVQQLSQFL 1404 G+L +KPASTP+ P LH DD + +V YRRL+G+LLYL TRPDI+FA QQLSQFL Sbjct: 1217 GLLGSKPASTPLDPAVKLHIDDSKPYENVSLYRRLIGKLLYLCNTRPDISFATQQLSQFL 1276 Query: 1405 SKPTMTHYKAAQRILHYLKGSPGQGLFFSRSSSLDLQGYTDSNWAGCPDSRRSISGYCFF 1584 KP++ HY AA R++ YLK +PG+GL F R S + L G++DS+WAGC D+R+S SGYCFF Sbjct: 1277 HKPSVNHYHAACRVIRYLKHNPGRGLLFPRKSDIQLLGFSDSDWAGCLDTRKSTSGYCFF 1336 Query: 1585 LGESLIAWRSKKQTTVACSSSEAEYRALAFATCELQWIQFLLDDLGIVHSKQPVLYCDNK 1764 LG SLI+W++KKQTT++ SSSEAEYRAL+ ATCEL W+ +L++DL + SK PV+YCD++ Sbjct: 1337 LGSSLISWKAKKQTTISRSSSEAEYRALSSATCELIWLTYLMNDLKVQCSKLPVIYCDSQ 1396 Query: 1765 SALHIAANPVFHERTKHLEIDCHVVRDKVNSGLMKLQPMPSAHQVADLFTKALLPQSFSH 1944 SALHIA+NPVFHERTKHLEIDCH+VR+KV G+++L P+P+ Q+AD TK+L F+ Sbjct: 1397 SALHIASNPVFHERTKHLEIDCHLVREKVQQGILRLLPIPTEEQLADCLTKSLAAPKFNE 1456 Query: 1945 LVTKLQMLNIYQ 1980 L++KL +++IYQ Sbjct: 1457 LISKLGLIDIYQ 1468 >GAU31202.1 hypothetical protein TSUD_210590 [Trifolium subterraneum] Length = 1059 Score = 659 bits (1699), Expect = 0.0 Identities = 332/616 (53%), Positives = 427/616 (69%), Gaps = 15/616 (2%) Frame = +1 Query: 178 SSFPN--PPNTPATSQ--------DTPIDTNHLPDXXXXXXXXXXNLRHSTRPHKPPSHL 327 +S PN PP +P T+ DTPI T + P R PS+L Sbjct: 453 TSIPNTIPPLSPTTTDTIQNVSNNDTPIPTYNKP----------------IRSRNAPSYL 496 Query: 328 NDYICNH-----ISSINPPRYPIHAYVSYASLSQPHRAYTMSLDAVPEPSSFCEASKHTC 492 +DY+C H ++SI YP+ Y + +LS H AYT++L EP S+ EASK C Sbjct: 497 SDYVCYHSDTSSLASITGTPYPLLGYHTLNNLSLSHHAYTVALTHNTEPKSYLEASKFEC 556 Query: 493 WVEAMQQELTALEKNQTWIIVDKPANVIPIGCKWVYKVKRKADGSLERYKARLVAKGYTQ 672 W +AM EL AL K TW IVD P + PIG KWVYKVK KADGS+ER+KARLVAKGY Q Sbjct: 557 WQKAMNDELDALAKTGTWKIVDLPPLIKPIGSKWVYKVKYKADGSIERHKARLVAKGYNQ 616 Query: 673 TEGIDYFDTFSPVAKMTTVRTLIALAAIKGWHIHQLDVNNAFLHGQLQEKVYTSIPQGVQ 852 EG+D+FDTFSPVAK+T VR L+A AA + WH+HQLDVNNAFLHG L+E VY +IP GV Sbjct: 617 VEGLDFFDTFSPVAKLTIVRLLLATAATQNWHLHQLDVNNAFLHGDLEEDVYMAIPDGVV 676 Query: 853 ISGSNKVCKLLKSLYGLKQASRKWYERLTNLLLGLGFKQAHSDHSLFTKITQSSYIALLI 1032 N+VCKLLKSLYGLKQA+RKWYE+LT+LLL G+ Q+ +D+SLFT + + + LL+ Sbjct: 677 SPKPNQVCKLLKSLYGLKQANRKWYEKLTSLLLQEGYTQSTADYSLFTLQSATDFTTLLV 736 Query: 1033 YIDDIVLVGDCLSQLQQIKTVLDHNXXXXXXXXXXXXXXXEAAHSPRGISLCQRQYCLNL 1212 Y+DD++L G L + +IK++LD E A S GI++ QR+YCL+L Sbjct: 737 YVDDVILAGTSLLEFTRIKSILDARFKIKDLGILKYFLGLEVAQSREGITVSQRKYCLDL 796 Query: 1213 LTDTGVLAAKPASTPMAPNSHLHQDDGPLFPDVECYRRLVGRLLYLTTTRPDITFAVQQL 1392 L D+G+L +KP TP+ P LH D G L+ D+ YRRL+GRLLYLT TRPDI+FA+QQL Sbjct: 797 LKDSGLLGSKPVVTPLDPAIKLHNDAGKLYEDISAYRRLIGRLLYLTNTRPDISFAIQQL 856 Query: 1393 SQFLSKPTMTHYKAAQRILHYLKGSPGQGLFFSRSSSLDLQGYTDSNWAGCPDSRRSISG 1572 SQFLSKPTM HY AA R++ YLK +PG+GLFF R L L G+TD++WA C D+RRS +G Sbjct: 857 SQFLSKPTMVHYNAACRVVRYLKHNPGRGLFFPRHFDLQLLGFTDADWARCIDTRRSTTG 916 Query: 1573 YCFFLGESLIAWRSKKQTTVACSSSEAEYRALAFATCELQWIQFLLDDLGIVHSKQPVLY 1752 YCFFLG SL++W++KKQ TV+ SSSEAEYRAL+ ATCEL W+ FL+ DL I SK PV+Y Sbjct: 917 YCFFLGSSLVSWKAKKQLTVSRSSSEAEYRALSTATCELIWLTFLMKDLNIHCSKPPVIY 976 Query: 1753 CDNKSALHIAANPVFHERTKHLEIDCHVVRDKVNSGLMKLQPMPSAHQVADLFTKALLPQ 1932 CD++SA+HIA+NPVFHERTKHLEI+CH VR+K+ GL++L P+ + Q+AD TK L Sbjct: 977 CDSQSAMHIASNPVFHERTKHLEIECHFVREKLQQGLLRLLPISTEDQLADCLTKPLAAP 1036 Query: 1933 SFSHLVTKLQMLNIYQ 1980 F+ ++KL +L+IY+ Sbjct: 1037 KFNSFISKLGLLDIYE 1052 >GAU51775.1 hypothetical protein TSUD_415620 [Trifolium subterraneum] Length = 1234 Score = 663 bits (1711), Expect = 0.0 Identities = 339/606 (55%), Positives = 425/606 (70%), Gaps = 6/606 (0%) Frame = +1 Query: 178 SSFPNPPNTPATSQDTP-IDTNHLPDXXXXXXXXXXNLRHSTRPHKPPSHLNDYICNHIS 354 S PN P P +P + TN P R PS+L DY+CN + Sbjct: 632 SHTPNSPIIPPNQCSSPDVTTNISPPAM-----------RPIRDKHAPSYLADYVCNQAT 680 Query: 355 -----SINPPRYPIHAYVSYASLSQPHRAYTMSLDAVPEPSSFCEASKHTCWVEAMQQEL 519 S YPI Y S A+LSQ H AYT+S+ EPSS+ EA K CW +AM EL Sbjct: 681 TPANLSSQGICYPISEYHSLANLSQTHHAYTLSVTHTTEPSSYSEACKFDCWQKAMNAEL 740 Query: 520 TALEKNQTWIIVDKPANVIPIGCKWVYKVKRKADGSLERYKARLVAKGYTQTEGIDYFDT 699 AL K TW+IVD P PIG KWVYKVK KADG++ER+KARLVAKGY Q EG+D+FDT Sbjct: 741 EALTKTGTWVIVDLPPLAKPIGSKWVYKVKYKADGTIERHKARLVAKGYNQVEGLDFFDT 800 Query: 700 FSPVAKMTTVRTLIALAAIKGWHIHQLDVNNAFLHGQLQEKVYTSIPQGVQISGSNKVCK 879 FSPVAK+TTVRTL+A+A+IK WH+HQLDVNNAFLHG L+E VY +IP GV + +VCK Sbjct: 801 FSPVAKLTTVRTLLAIASIKQWHLHQLDVNNAFLHGDLEEDVYMTIPDGVPHTKPGQVCK 860 Query: 880 LLKSLYGLKQASRKWYERLTNLLLGLGFKQAHSDHSLFTKITQSSYIALLIYIDDIVLVG 1059 LLKSLYGLKQASR WYE+LT+LL+ G+ Q+ +D+SLFT S++ ALL+Y+DDI+L G Sbjct: 861 LLKSLYGLKQASRMWYEKLTSLLIHEGYHQSTADYSLFTLQQGSAFTALLVYVDDIILAG 920 Query: 1060 DCLSQLQQIKTVLDHNXXXXXXXXXXXXXXXEAAHSPRGISLCQRQYCLNLLTDTGVLAA 1239 ++ +IK +LD E A S GI++ QR+YCL+LL D+G+L + Sbjct: 921 TSTTEFDRIKGILDAQFKIKDLGTLKYFLGLEVAQSREGINVSQRKYCLDLLKDSGLLGS 980 Query: 1240 KPASTPMAPNSHLHQDDGPLFPDVECYRRLVGRLLYLTTTRPDITFAVQQLSQFLSKPTM 1419 KPA+TP+ P LHQD+G + DV YRRL+G+LLYLT TRPDI +A QQLSQFL KPT+ Sbjct: 981 KPATTPLDPAIKLHQDEGKPYADVSQYRRLIGKLLYLTNTRPDIAYATQQLSQFLHKPTV 1040 Query: 1420 THYKAAQRILHYLKGSPGQGLFFSRSSSLDLQGYTDSNWAGCPDSRRSISGYCFFLGESL 1599 THY AA RI+ YLK SPG+GL FSR + L G++D++WAGC D+RRS SG+CFF+G SL Sbjct: 1041 THYNAACRIIRYLKHSPGKGLTFSRHCDIQLLGFSDADWAGCIDTRRSTSGHCFFIGTSL 1100 Query: 1600 IAWRSKKQTTVACSSSEAEYRALAFATCELQWIQFLLDDLGIVHSKQPVLYCDNKSALHI 1779 I+W++KKQTTV+ SSSEAEYRAL+ ATCEL W+ FLL DL I SK PVLYCD++SA+HI Sbjct: 1101 ISWKAKKQTTVSRSSSEAEYRALSSATCELIWLLFLLKDLQIECSKPPVLYCDSQSAMHI 1160 Query: 1780 AANPVFHERTKHLEIDCHVVRDKVNSGLMKLQPMPSAHQVADLFTKALLPQSFSHLVTKL 1959 A+NPVFHERTKHLEIDCH+VR+KV GL++L P+ + Q+AD TKAL FS + KL Sbjct: 1161 ASNPVFHERTKHLEIDCHLVREKVQQGLLRLLPISTEDQLADCLTKALPAPKFSSFIHKL 1220 Query: 1960 QMLNIY 1977 + +IY Sbjct: 1221 GLRDIY 1226 >GAU50785.1 hypothetical protein TSUD_192210 [Trifolium subterraneum] Length = 1214 Score = 662 bits (1708), Expect = 0.0 Identities = 343/671 (51%), Positives = 442/671 (65%), Gaps = 11/671 (1%) Frame = +1 Query: 1 RNVVFYENIFPYKCNSSRGSVAYLKRWDTGXXXXXXXXXXXXXXXXXXXXXXLSLNGQPS 180 RNV +E+IFP + ++ + Y + +SL+ Sbjct: 549 RNVTHHEHIFPDQSSTPKVPWTYHTDSLSSPNPYINTPLSNSHDPTPPIDGDISLDNNRH 608 Query: 181 SFPNPPNTPATSQDTPIDTNHLPDXXXXXXXXXXNLRHSTRP---HKPPSHLNDYICNH- 348 +PP++ + +P N + N +TRP + P HL+DY+CN+ Sbjct: 609 QSLSPPHSSVLTSPSPT-YNDISPSSTTSTLPTDNSNTNTRPIRQRRAPLHLSDYVCNNS 667 Query: 349 -------ISSINPPRYPIHAYVSYASLSQPHRAYTMSLDAVPEPSSFCEASKHTCWVEAM 507 I S N +YP+ ++ S LS H+AY+MS+ EP S+ EASKH W+ AM Sbjct: 668 FSTSNEPIISGNTSKYPLSSFHSLTQLSPSHKAYSMSITHCTEPQSYEEASKHENWLIAM 727 Query: 508 QQELTALEKNQTWIIVDKPANVIPIGCKWVYKVKRKADGSLERYKARLVAKGYTQTEGID 687 + EL AL KN TW +V+ P ++ PIGC+WVYKVK KADG++ERYKARLVAKGY Q EGID Sbjct: 728 KTELDALAKNCTWTLVELPPHIKPIGCRWVYKVKHKADGTIERYKARLVAKGYNQVEGID 787 Query: 688 YFDTFSPVAKMTTVRTLIALAAIKGWHIHQLDVNNAFLHGQLQEKVYTSIPQGVQISGSN 867 YF+TFSPVAK+TTVRTL+A+AAIK WH+HQLDVNNAFLHG LQE VY +P GVQ N Sbjct: 788 YFETFSPVAKLTTVRTLLAIAAIKNWHLHQLDVNNAFLHGDLQEDVYMKVPDGVQCDKPN 847 Query: 868 KVCKLLKSLYGLKQASRKWYERLTNLLLGLGFKQAHSDHSLFTKITQSSYIALLIYIDDI 1047 VCKL KSLYGLKQASRKWYE+LT LL+ G+ QA SD+SLFT + ALL+Y+DDI Sbjct: 848 LVCKLQKSLYGLKQASRKWYEKLTALLIIEGYTQAASDYSLFTLAKGDDFTALLVYVDDI 907 Query: 1048 VLVGDCLSQLQQIKTVLDHNXXXXXXXXXXXXXXXEAAHSPRGISLCQRQYCLNLLTDTG 1227 +L G+ +S+ +IK VLD E AHS GI++ QR+YCL++L D+G Sbjct: 908 ILAGNSISEFDRIKAVLDAAFKIKNLGQLKYFLGLEVAHSKSGITISQRKYCLDMLKDSG 967 Query: 1228 VLAAKPASTPMAPNSHLHQDDGPLFPDVECYRRLVGRLLYLTTTRPDITFAVQQLSQFLS 1407 +L +KPA TPM + LH + G + DV YRR+VG+LLYL TTRPDI FA QQLSQF+ Sbjct: 968 LLGSKPAMTPMDTSIKLHSNAGIPYDDVSSYRRMVGKLLYLNTTRPDIAFATQQLSQFMH 1027 Query: 1408 KPTMTHYKAAQRILHYLKGSPGQGLFFSRSSSLDLQGYTDSNWAGCPDSRRSISGYCFFL 1587 PT TH+ AA R+L YLK +PGQG+ FSR S L L GY+D++WAGC D+RRSI+GYCFF+ Sbjct: 1028 APTTTHFTAACRVLRYLKNNPGQGVLFSRDSELQLIGYSDADWAGCMDTRRSITGYCFFI 1087 Query: 1588 GESLIAWRSKKQTTVACSSSEAEYRALAFATCELQWIQFLLDDLGIVHSKQPVLYCDNKS 1767 G+SL++WR+KKQ TV+ SSSEAEYRAL+ AT L + K P LYCDN+S Sbjct: 1088 GKSLVSWRAKKQVTVSRSSSEAEYRALSSATY-----------LRVKLQKTPTLYCDNQS 1136 Query: 1768 ALHIAANPVFHERTKHLEIDCHVVRDKVNSGLMKLQPMPSAHQVADLFTKALLPQSFSHL 1947 A+HIA+NPVFHERTKHL+IDCH+VR+KV G++KL P+ + Q+AD TKAL P F Sbjct: 1137 AVHIASNPVFHERTKHLDIDCHLVREKVMQGILKLLPVSTHDQMADFLTKALAPPKFHAF 1196 Query: 1948 VTKLQMLNIYQ 1980 V+KL M+NIYQ Sbjct: 1197 VSKLNMINIYQ 1207 >KHN05285.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94, partial [Glycine soja] Length = 1346 Score = 666 bits (1718), Expect = 0.0 Identities = 344/653 (52%), Positives = 440/653 (67%), Gaps = 12/653 (1%) Frame = +1 Query: 1 RNVVFYENIFPYKCNSSRGSVAYLKRWDTGXXXXXXXXXXXXXXXXXXXXXXLSLNGQPS 180 R+V +E+IFPY+ +S + Y T +SL+ Sbjct: 706 RDVTHHEHIFPYQSSSPKTPWEYHSISPT------------PNDSDITLDSDISLDINAE 753 Query: 181 SFPNPPNT---PATSQDTPI-DTNHLPDXXXXXXXXXXNLRHSTRPHKPPSHLNDYICNH 348 P+PP++ P S DT I DT+ R + P HL+DY+C++ Sbjct: 754 QSPSPPHSSLSPNISNDTVISDTSTSTPPPKDHNDSPLLHSKPIRQRRAPLHLSDYVCHN 813 Query: 349 IS--------SINPPRYPIHAYVSYASLSQPHRAYTMSLDAVPEPSSFCEASKHTCWVEA 504 S S +YP+ ++ S LS H+A++MS+ EP S+ EASKH WV A Sbjct: 814 TSPTSHESLTSGTKSKYPLSSFHSLTLLSPSHKAFSMSITHCTEPQSYEEASKHEHWVTA 873 Query: 505 MQQELTALEKNQTWIIVDKPANVIPIGCKWVYKVKRKADGSLERYKARLVAKGYTQTEGI 684 M++EL AL KN TW IV+ P + PIGCKWVYKVK KA+G +ERYKARLVAKGY Q EGI Sbjct: 874 MKEELNALAKNCTWKIVELPPHTKPIGCKWVYKVKHKANGQIERYKARLVAKGYNQVEGI 933 Query: 685 DYFDTFSPVAKMTTVRTLIALAAIKGWHIHQLDVNNAFLHGQLQEKVYTSIPQGVQISGS 864 DYF+TFSPVAK+TTVRTL+A+AAIK WH+HQLDVNNAFLHG LQE VY IP GV + Sbjct: 934 DYFETFSPVAKITTVRTLLAVAAIKNWHLHQLDVNNAFLHGDLQEDVYMKIPDGVTCAKP 993 Query: 865 NKVCKLLKSLYGLKQASRKWYERLTNLLLGLGFKQAHSDHSLFTKITQSSYIALLIYIDD 1044 N VCKL KSLYGLKQASRKWYE+LTNLLL G+ Q+ SD+SLFT +++ ALL+Y+DD Sbjct: 994 NSVCKLQKSLYGLKQASRKWYEKLTNLLLKEGYIQSISDYSLFTLTKGNTFTALLVYVDD 1053 Query: 1045 IVLVGDCLSQLQQIKTVLDHNXXXXXXXXXXXXXXXEAAHSPRGISLCQRQYCLNLLTDT 1224 I+L GD + + +IK VLD E AHS GI++ QR+YCL+LL D+ Sbjct: 1054 IILAGDSIDEFDRIKNVLDLAFKIKNLGKLKYFLGLEVAHSRLGITISQRKYCLDLLKDS 1113 Query: 1225 GVLAAKPASTPMAPNSHLHQDDGPLFPDVECYRRLVGRLLYLTTTRPDITFAVQQLSQFL 1404 G+L KPASTP+ + LH G + D+ YRR+VG+LLYL TTRPDI FA QQLSQF+ Sbjct: 1114 GLLGCKPASTPLDTSIKLHSAAGTPYADISGYRRIVGKLLYLNTTRPDIAFATQQLSQFM 1173 Query: 1405 SKPTMTHYKAAQRILHYLKGSPGQGLFFSRSSSLDLQGYTDSNWAGCPDSRRSISGYCFF 1584 PT H+ AA R+L YLK +PGQG+FFSR+S + L GY+D++WAGC DSR+SISGYCFF Sbjct: 1174 QAPTNVHFNAACRVLRYLKNNPGQGIFFSRTSEMQLIGYSDADWAGCMDSRKSISGYCFF 1233 Query: 1585 LGESLIAWRSKKQTTVACSSSEAEYRALAFATCELQWIQFLLDDLGIVHSKQPVLYCDNK 1764 +G+SL++WR+KKQ TV+ SSSEAEYRAL+ A CELQW+ +L DL + ++ P LYCDN+ Sbjct: 1234 IGKSLVSWRAKKQATVSRSSSEAEYRALSSAACELQWLLYLFADLRVQLTRTPTLYCDNQ 1293 Query: 1765 SALHIAANPVFHERTKHLEIDCHVVRDKVNSGLMKLQPMPSAHQVADLFTKAL 1923 SA+HIA+NPVFHERTKHLEIDCH+VR+K+ G +KL P+ ++ QVAD TKAL Sbjct: 1294 SAVHIASNPVFHERTKHLEIDCHLVREKLLKGTLKLLPVSTSDQVADFLTKAL 1346 >KYP36109.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cajanus cajan] Length = 1115 Score = 643 bits (1659), Expect = 0.0 Identities = 344/679 (50%), Positives = 441/679 (64%), Gaps = 12/679 (1%) Frame = +1 Query: 1 RNVVFYENIFPYKCNSSRGSVAYLKRWDTGXXXXXXXXXXXXXXXXXXXXXXLSLNGQPS 180 RNV+F+E FP+ ++S + ++ + L+++ P Sbjct: 465 RNVIFHETSFPF--HTSAPTSPFITQ------------------------PSLTISPNPD 498 Query: 181 SFPNPPNTPATSQDTPIDTNHLPDXXXXXXXXXXNLRHSTRPHKPPSHLNDYICNHISSI 360 PP +P DT N PD + P P S +D I ++ +S Sbjct: 499 ILTTPPTSP---NDTSPSQNPNPDASASP----------SSPAAPTSPFHDSISSYPTST 545 Query: 361 NPPRYPIHAYVSYASLSQPHRA------------YTMSLDAVPEPSSFCEASKHTCWVEA 504 + P H S A + P R+ Y L ++ S + ++ CW EA Sbjct: 546 DFDIVPHHRSSSPA-IPPPRRSERDIHPPPYLSEYQYKLPSLKSVQSSTRSIQNPCWREA 604 Query: 505 MQQELTALEKNQTWIIVDKPANVIPIGCKWVYKVKRKADGSLERYKARLVAKGYTQTEGI 684 M EL ALE N TW +V+ P++V PIGCKWV+K+KR +GS+ERYKARLVAKGY+Q EGI Sbjct: 605 MTCELKALELNHTWDVVETPSHVRPIGCKWVFKIKRLPNGSIERYKARLVAKGYSQIEGI 664 Query: 685 DYFDTFSPVAKMTTVRTLIALAAIKGWHIHQLDVNNAFLHGQLQEKVYTSIPQGVQISGS 864 DYF+TFSPV KMTTVR ++ALA+I WHI QLDV+NAFLHG L E VY +PQG+ S Sbjct: 665 DYFETFSPVVKMTTVRVVLALASINQWHIQQLDVSNAFLHGDLLEDVYMELPQGLTGYSS 724 Query: 865 NKVCKLLKSLYGLKQASRKWYERLTNLLLGLGFKQAHSDHSLFTKITQSSYIALLIYIDD 1044 + CKL KSLYGLKQ+SRKWYE+L+NLLL G+KQAHSDHSLFTK S++ ALLIY+DD Sbjct: 725 SHSCKLRKSLYGLKQSSRKWYEKLSNLLLSNGYKQAHSDHSLFTKRHGSAFTALLIYVDD 784 Query: 1045 IVLVGDCLSQLQQIKTVLDHNXXXXXXXXXXXXXXXEAAHSPRGISLCQRQYCLNLLTDT 1224 IVL G+ +++ +IK +L N E AHS +GISLCQR+YCL+LL+D+ Sbjct: 785 IVLTGNSAAEITRIKHILHSNFHVKDLGQLKYFLGIEVAHSSKGISLCQRKYCLDLLSDS 844 Query: 1225 GVLAAKPASTPMAPNSHLHQDDGPLFPDVECYRRLVGRLLYLTTTRPDITFAVQQLSQFL 1404 G+L KP+STPM LH D D YRRLVGRL+YLT TRPDI F+ QQLSQF+ Sbjct: 845 GMLGCKPSSTPMDSTLRLHDDASGYLDDPLPYRRLVGRLVYLTNTRPDIAFSTQQLSQFM 904 Query: 1405 SKPTMTHYKAAQRILHYLKGSPGQGLFFSRSSSLDLQGYTDSNWAGCPDSRRSISGYCFF 1584 SKPT H+ AA R+L YLK PG GLFF R + + G++D++WA C SRRSI+GYCFF Sbjct: 905 SKPTNAHHAAAMRVLRYLKSCPGTGLFFPRVCPIQVSGFSDADWATCVASRRSITGYCFF 964 Query: 1585 LGESLIAWRSKKQTTVACSSSEAEYRALAFATCELQWIQFLLDDLGIVHSKQPVLYCDNK 1764 +G +LI+W++KKQTTV+ SSSEAEYRALA ATCELQWI +LL DL I S+ +LYCDN Sbjct: 965 IGNALISWKTKKQTTVSRSSSEAEYRALASATCELQWIIYLLRDLHISLSQISLLYCDNT 1024 Query: 1765 SALHIAANPVFHERTKHLEIDCHVVRDKVNSGLMKLQPMPSAHQVADLFTKALLPQSFSH 1944 SALHIAANPVFHERTKHL+IDCH+VR+K +GLMKL P+PSA Q+AD+FTKAL P +F Sbjct: 1025 SALHIAANPVFHERTKHLDIDCHIVREKTQAGLMKLLPVPSAKQLADIFTKALPPHAFKL 1084 Query: 1945 LVTKLQMLNIYQSLDCAGI 2001 ++KLQ+ NI+ C G+ Sbjct: 1085 NLSKLQLQNIFAPPACGGL 1103 >KYP61022.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cajanus cajan] Length = 1316 Score = 637 bits (1644), Expect = 0.0 Identities = 315/573 (54%), Positives = 411/573 (71%), Gaps = 5/573 (0%) Frame = +1 Query: 286 LRHSTRPHKPPSHLNDYICNHIS-----SINPPRYPIHAYVSYASLSQPHRAYTMSLDAV 450 LR STRP +PP++L D+ S S R+P+H+++SY LS Y S+ +V Sbjct: 741 LRRSTRPRRPPTYLQDFHGAFTSTSTAHSSTGIRHPLHSFLSYDLLSPSFHHYVFSISSV 800 Query: 451 PEPSSFCEASKHTCWVEAMQQELTALEKNQTWIIVDKPANVIPIGCKWVYKVKRKADGSL 630 EP +F EASK W++AM +E+ ALE N TW++ P + IGC+WVYKVK KADGS+ Sbjct: 801 TEPKNFAEASKSDSWLKAMHEEIFALEANNTWVLTTLPPHKTAIGCRWVYKVKHKADGSI 860 Query: 631 ERYKARLVAKGYTQTEGIDYFDTFSPVAKMTTVRTLIALAAIKGWHIHQLDVNNAFLHGQ 810 +RYKARLVAKGYTQ EG+D+FDTFSPVAK+TTVR L++LAAI WH+ QLDVNNAFLHG Sbjct: 861 DRYKARLVAKGYTQMEGLDFFDTFSPVAKLTTVRLLLSLAAINNWHLKQLDVNNAFLHGD 920 Query: 811 LQEKVYTSIPQGVQISGSNKVCKLLKSLYGLKQASRKWYERLTNLLLGLGFKQAHSDHSL 990 L E+VY +P G+ S +VC+L +SLYGLKQASR+WY RL++ L+ G+ + SDHSL Sbjct: 921 LNEEVYMQLPPGLTPSFPGQVCRLQRSLYGLKQASRQWYARLSSFLIQHGYVPSPSDHSL 980 Query: 991 FTKITQSSYIALLIYIDDIVLVGDCLSQLQQIKTVLDHNXXXXXXXXXXXXXXXEAAHSP 1170 F K + ++ A+LIY+DDIVL G+ L+++ + ++L E A + Sbjct: 981 FLKCSPATTTAILIYVDDIVLAGNDLTEIHHLTSLLHTTFQIKDLGNLKYFLGLEVARNH 1040 Query: 1171 RGISLCQRQYCLNLLTDTGVLAAKPASTPMAPNSHLHQDDGPLFPDVECYRRLVGRLLYL 1350 GI LCQR+Y L+LL+DTG+LA+KP STPM + HL G D YRRLVGRL+YL Sbjct: 1041 TGIHLCQRKYILDLLSDTGMLASKPVSTPMDYSMHLSASSGTPLTDTAAYRRLVGRLIYL 1100 Query: 1351 TTTRPDITFAVQQLSQFLSKPTMTHYKAAQRILHYLKGSPGQGLFFSRSSSLDLQGYTDS 1530 T TRPDIT+AVQQLSQF+S PT H +A RIL YLKG+PG G+F S +SS+ L+ ++DS Sbjct: 1101 TNTRPDITYAVQQLSQFVSNPTTAHRQALFRILRYLKGTPGSGIFLSVNSSVQLRAFSDS 1160 Query: 1531 NWAGCPDSRRSISGYCFFLGESLIAWRSKKQTTVACSSSEAEYRALAFATCELQWIQFLL 1710 +WAGCPD+RRSI+G+ +LG+SLI+W+SKKQ TV+ SSSEAEYRALA TCELQW+ +LL Sbjct: 1161 DWAGCPDTRRSITGFAVYLGDSLISWKSKKQITVSRSSSEAEYRALATTTCELQWLSYLL 1220 Query: 1711 DDLGIVHSKQPVLYCDNKSALHIAANPVFHERTKHLEIDCHVVRDKVNSGLMKLQPMPSA 1890 D I +LYCDN+SAL IA+NPVFHERTKH+EIDCH+VRDKV++GL+KL P+ S+ Sbjct: 1221 KDFHIDPISPSILYCDNQSALQIASNPVFHERTKHIEIDCHIVRDKVSTGLLKLLPVSSS 1280 Query: 1891 HQVADLFTKALLPQSFSHLVTKLQMLNIYQSLD 1989 Q+AD+ TK L P F +KL MLNI+ L+ Sbjct: 1281 QQLADILTKPLSPFVFRSHCSKLGMLNIHSQLE 1313 >KYP65734.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94, partial [Cajanus cajan] Length = 1013 Score = 627 bits (1616), Expect = 0.0 Identities = 318/616 (51%), Positives = 422/616 (68%), Gaps = 13/616 (2%) Frame = +1 Query: 196 PNTPATSQDTPIDTNHLPDXXXXXXXXXXNLRHSTRPHKPPSHLNDYIC---------NH 348 P +P +S P + H P RHSTR +PPS+L DY C N Sbjct: 404 PPSPTSSTSIPPEQPHQP-----LSPAPAPSRHSTRMRQPPSYLKDYHCSLLAPTGRINS 458 Query: 349 ISSINPPRYPIHAYVSYASLSQPHRAYTMSLDAVPEPSSFCEASKHTCWVEAMQQELTAL 528 S I+ P + I + ++Y S ++ + +S+ EP ++ +ASK+ CW+ AM+ EL AL Sbjct: 459 FSGISTP-HSISSTLTYDFCSPSYKQFCLSVSTNFEPHTYTQASKYDCWIMAMKTELAAL 517 Query: 529 EKNQTWIIVDKPANVIPIGCKWVYKVKRKADGSLERYKARLVAKGYTQTEGIDYFDTFSP 708 + NQTW IVD P+ PIGCKWVYK+K +DGS+ERYKARLVAKGY+QTEG+DY DT+SP Sbjct: 518 DMNQTWSIVDLPSGKRPIGCKWVYKIKYLSDGSIERYKARLVAKGYSQTEGLDYLDTYSP 577 Query: 709 VAKMTTVRTLIALAAIKGWHIHQLDVNNAFLHGQLQEKVYTSIPQGVQISGSN----KVC 876 VAK+TTVR L+AL AIKGW + QLDVNNAFLHG L E+VY ++P G+ + S+ KVC Sbjct: 578 VAKLTTVRVLLALTAIKGWFLEQLDVNNAFLHGDLHEEVYMTLPPGLSVPSSSNTAPKVC 637 Query: 877 KLLKSLYGLKQASRKWYERLTNLLLGLGFKQAHSDHSLFTKITQSSYIALLIYIDDIVLV 1056 KL KS+YGLKQASR+WY +L++ L+ +G+ + +DHSLF K + S + ALL+Y+DDI+L Sbjct: 638 KLHKSIYGLKQASRQWYSKLSSALISMGYSPSTADHSLFIKSSSSHFTALLVYVDDIILA 697 Query: 1057 GDCLSQLQQIKTVLDHNXXXXXXXXXXXXXXXEAAHSPRGISLCQRQYCLNLLTDTGVLA 1236 G+ ++ IK L E A S +GI L QR+Y L +L D G LA Sbjct: 698 GNDKPEIDFIKAQLHKCFKIKDLGNLRYFLGLEIARSNKGILLNQRKYTLEILEDVGFLA 757 Query: 1237 AKPASTPMAPNSHLHQDDGPLFPDVECYRRLVGRLLYLTTTRPDITFAVQQLSQFLSKPT 1416 AKP+STP P+ LH D G + D YRRL+GRLLYLTTTRPDI++ VQQLSQF+SKP Sbjct: 758 AKPSSTPFNPSLKLHSDHGSPYNDETAYRRLIGRLLYLTTTRPDISYVVQQLSQFVSKPL 817 Query: 1417 MTHYKAAQRILHYLKGSPGQGLFFSRSSSLDLQGYTDSNWAGCPDSRRSISGYCFFLGES 1596 HY+AA RIL YLKGS G+GLF+S S+SL L + DS+WA C SR+SI+G+C FLG S Sbjct: 818 DIHYQAATRILRYLKGSHGRGLFYSSSASLKLSAFADSDWASCSISRKSITGFCVFLGSS 877 Query: 1597 LIAWRSKKQTTVACSSSEAEYRALAFATCELQWIQFLLDDLGIVHSKQPVLYCDNKSALH 1776 LI+WRSKKQ+T++ SSSEAEYRALA TCELQW+ +L +DL + ++CDNKSA++ Sbjct: 878 LISWRSKKQSTISRSSSEAEYRALASLTCELQWLHYLFNDLKTSLNFPTSVFCDNKSAIY 937 Query: 1777 IAANPVFHERTKHLEIDCHVVRDKVNSGLMKLQPMPSAHQVADLFTKALLPQSFSHLVTK 1956 +A NP FHERTKH+EIDCHV+R+K+ S L+ L P+PS+ Q+AD FTK L SF+ V+K Sbjct: 938 LAHNPTFHERTKHIEIDCHVIREKIQSRLLHLLPVPSSSQLADAFTKPLHATSFNSFVSK 997 Query: 1957 LQMLNIYQSLDCAGII 2004 L + +++ S C G++ Sbjct: 998 LGLYDVHSSA-CGGLL 1012 >KYP34298.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94, partial [Cajanus cajan] Length = 1002 Score = 626 bits (1615), Expect = 0.0 Identities = 320/615 (52%), Positives = 434/615 (70%), Gaps = 10/615 (1%) Frame = +1 Query: 172 QPSSFP-NPPNTPATSQDTPIDTNHLPDXXXXXXXXXXN-LRHSTRPHKPPSHLNDY--- 336 Q S P + P P + TP+ TN P +R STRP K PS+L+DY Sbjct: 384 QTISLPRHQPPLPIDTDPTPLSTNTTPTSSPVSVVPPPPFVRKSTRPRKLPSYLHDYHHT 443 Query: 337 -ICNHIS-SINPPRYPIHAYVSYASLSQPHRAYTMSLDAVPEPSSFCEASKHTCWVEAMQ 510 + H S +I+ P Y IH ++SY++LS +A+++S+ ++ EP+S+ EA + W A+Q Sbjct: 444 LLTTHNSPTISQPLYSIHNHISYSNLSPSQKAFSLSISSIKEPNSYVEAIQDESWKTAIQ 503 Query: 511 QELTALEKNQTWIIVDKPANVIPIGCKWVYKVKRKADGSLERYKARLVAKGYTQTEGIDY 690 ELTALEKN TWI+ P N +GCKWV+K+K +DG++ER+KARLVAKGYTQTE +DY Sbjct: 504 TELTALEKNNTWILTPLPPNKQVVGCKWVFKLKFNSDGTIERHKARLVAKGYTQTETLDY 563 Query: 691 FDTFSPVAKMTTVRTLIALAAIKGWHIHQLDVNNAFLHGQLQEKVYTSIPQGVQISG--S 864 DTFSPV KMTTVRTL+A+A K WHIHQLDVN FLHG L E+VY + P G+ +S S Sbjct: 564 LDTFSPVVKMTTVRTLLAVATAKNWHIHQLDVNTTFLHGDLHEEVYMTPPPGLTVSPHQS 623 Query: 865 NKVCKLLKSLYGLKQASRKWYERLTNLLLGLGFKQAHSDHSLFTKITQSSYIALLIYIDD 1044 N VCKL+KSLYGLKQASR+W +LT++L+ GFKQ+ +D+SLFTK + + A+L+Y+DD Sbjct: 624 NCVCKLVKSLYGLKQASRQWNAKLTSVLIDSGFKQSMADYSLFTKQFGAKFTAILVYVDD 683 Query: 1045 IVLVGDCLSQLQQIKTVLDHNXXXXXXXXXXXXXXXEAAHSPRGISLCQRQYCLNLLTDT 1224 +VL G+ +++ IK++LD E A S GI+L QR+Y L+L+ DT Sbjct: 684 LVLAGNDPTEINYIKSLLDQKFTIKDLGQLKYFLGMEVARSSTGIALYQRKYALDLIEDT 743 Query: 1225 GVLAAKPASTPMAPNSHLHQDDGPLFPDVECYRRLVGRLLYLTTTRPDITFAVQQLSQFL 1404 G+LA+KP +PM + LH+ G D YRRL+GRL+YLT TR DI+F+V LSQF+ Sbjct: 744 GLLASKPCKSPMDHSVKLHKTVGTPLTDPTQYRRLLGRLIYLTNTRADISFSVNHLSQFM 803 Query: 1405 SKPTMTHYKAAQRILHYLKGSPGQGLFFSRSSSLDLQGYTDSNWAGCPDSRRSISGYCFF 1584 +PT HY+AA RIL Y+K +PG+GLFF SS L L+GY+DS+WA C D+RRS++G+ FF Sbjct: 804 DQPTDVHYQAALRILKYVKNAPGKGLFFPSSSDLTLKGYSDSDWASCSDTRRSVTGFSFF 863 Query: 1585 LGESLIAWRSKKQTTVACSSSEAEYRALAFATCELQWIQFLLDDLGIVHSKQP-VLYCDN 1761 LG +LI+W+SKKQ TV+ SS+EAEYRALA +TCE QW+ +LL D G+ HS P VL+CDN Sbjct: 864 LGPALISWKSKKQATVSKSSAEAEYRALAQSTCEAQWLSYLLHDFGL-HSFHPIVLFCDN 922 Query: 1762 KSALHIAANPVFHERTKHLEIDCHVVRDKVNSGLMKLQPMPSAHQVADLFTKALLPQSFS 1941 +SALHIA+NPVFHERTK++E+DCH+VR+K+ GL+ L P+ +A Q+AD+FTKAL + F Sbjct: 923 QSALHIASNPVFHERTKNIELDCHIVREKLQVGLIHLLPISTADQLADVFTKALSLRPFE 982 Query: 1942 HLVTKLQMLNIYQSL 1986 ++ KL M +I+ SL Sbjct: 983 QIIFKLGMFDIHSSL 997 >KYP34293.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cajanus cajan] Length = 1376 Score = 637 bits (1643), Expect = 0.0 Identities = 334/668 (50%), Positives = 439/668 (65%), Gaps = 5/668 (0%) Frame = +1 Query: 1 RNVVFYENIFPY-KCNSSRGSVAYLKRWDTGXXXXXXXXXXXXXXXXXXXXXXLSLNGQP 177 RNV FYEN FP + SS ++ + G +S+ P Sbjct: 718 RNVSFYENHFPLLQSTSSTSNIPVVSPISFGIHSPSHDL--------------ISILPDP 763 Query: 178 SSFPNPPNTPATSQDTPIDTNHLPDXXXXXXXXXXNLRHSTRPHKPPSHLNDY---ICNH 348 PAT+ I LR STR PPS+L DY + + Sbjct: 764 HQHNVTSPNPATTSHDSISLAPYSTTADSLPPNSSPLRRSTRLRNPPSYLQDYHHSLTST 823 Query: 349 ISSINPPR-YPIHAYVSYASLSQPHRAYTMSLDAVPEPSSFCEASKHTCWVEAMQQELTA 525 ++++P YPI Y+SY+ LS +A+ S+ AV EP S+ EA+KH CW++AM EL A Sbjct: 824 STNLHPGMLYPIEKYISYSRLSNDFQAFVSSISAVSEPHSYAEAAKHDCWLKAMHAELEA 883 Query: 526 LEKNQTWIIVDKPANVIPIGCKWVYKVKRKADGSLERYKARLVAKGYTQTEGIDYFDTFS 705 L+ NQTW + P + +GC+W+YK+K ADGS+ERYKARLVAKGYTQ EG+DY TFS Sbjct: 884 LKMNQTWTLTPLPPHKQAVGCRWIYKIKYNADGSIERYKARLVAKGYTQVEGLDYLATFS 943 Query: 706 PVAKMTTVRTLIALAAIKGWHIHQLDVNNAFLHGQLQEKVYTSIPQGVQISGSNKVCKLL 885 PVAK+TTVR L+ALAA+ WH+ QLDVNNAFLHG L E+VY ++P G++ SN+VCKL Sbjct: 944 PVAKLTTVRLLLALAAVFDWHLKQLDVNNAFLHGDLNEEVYMTLPLGMRPEYSNQVCKLQ 1003 Query: 886 KSLYGLKQASRKWYERLTNLLLGLGFKQAHSDHSLFTKITQSSYIALLIYIDDIVLVGDC 1065 KSLYGLKQASR+W+ +L++ L+ G+ Q+ SDHSLF K + SS ALLIY+DDIVL G+ Sbjct: 1004 KSLYGLKQASRQWFAKLSSFLIHHGYHQSASDHSLFMKFSSSSTTALLIYVDDIVLAGNN 1063 Query: 1066 LSQLQQIKTVLDHNXXXXXXXXXXXXXXXEAAHSPRGISLCQRQYCLNLLTDTGVLAAKP 1245 LS++Q I +LD E A + GI L QR+Y L++L+D G++A++P Sbjct: 1064 LSEIQLITGLLDVAFKIKDLGNLKYFLGLEVARNKSGIHLSQRKYVLDILSDCGMMASRP 1123 Query: 1246 ASTPMAPNSHLHQDDGPLFPDVECYRRLVGRLLYLTTTRPDITFAVQQLSQFLSKPTMTH 1425 STPM S L G D YRRL+GRL+YLTTTRPDI++ V LSQF+S P+ H Sbjct: 1124 VSTPMDYTSRLSASSGTPLADPSSYRRLLGRLIYLTTTRPDISYVVHHLSQFMSAPSTAH 1183 Query: 1426 YKAAQRILHYLKGSPGQGLFFSRSSSLDLQGYTDSNWAGCPDSRRSISGYCFFLGESLIA 1605 +A RIL YLK +PG GLFF +SSL L+ ++DS+WAGC D+RRSI+G+ +LG+SLI+ Sbjct: 1184 SQAIFRILRYLKQAPGSGLFFPTNSSLHLKAFSDSDWAGCLDTRRSITGFSVYLGDSLIS 1243 Query: 1606 WRSKKQTTVACSSSEAEYRALAFATCELQWIQFLLDDLGIVHSKQPVLYCDNKSALHIAA 1785 WRSKKQ TV+ SSSEAEYRALA T ELQW+ +LL DL + + +LYCDN+SALHIAA Sbjct: 1244 WRSKKQPTVSRSSSEAEYRALATTTSELQWLTYLLHDLHVPVHQPALLYCDNQSALHIAA 1303 Query: 1786 NPVFHERTKHLEIDCHVVRDKVNSGLMKLQPMPSAHQVADLFTKALLPQSFSHLVTKLQM 1965 N VFHERTKH++IDCH+VR+K+ SGL+KL P+ S HQ+AD+FTK+L P F+ L +KL M Sbjct: 1304 NQVFHERTKHIDIDCHLVREKLQSGLLKLLPVASPHQLADIFTKSLSPSMFTALYSKLGM 1363 Query: 1966 LNIYQSLD 1989 LN+Y L+ Sbjct: 1364 LNLYSQLE 1371 >KYP55668.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94, partial [Cajanus cajan] Length = 1136 Score = 620 bits (1598), Expect = 0.0 Identities = 307/572 (53%), Positives = 408/572 (71%), Gaps = 4/572 (0%) Frame = +1 Query: 286 LRHSTRPHKPPSHLNDYICNHIS----SINPPRYPIHAYVSYASLSQPHRAYTMSLDAVP 453 LR STRP +PP++L D+ S S R+P+H+++SY LS Y S+ +V Sbjct: 562 LRRSTRPRRPPTYLQDFHGAFTSTGPHSSTGIRHPLHSFISYDRLSPSFHHYVFSISSVT 621 Query: 454 EPSSFCEASKHTCWVEAMQQELTALEKNQTWIIVDKPANVIPIGCKWVYKVKRKADGSLE 633 +P +F EASK W++AM +E++ALE N TW++ P + IGC+WVYKVK KADGS++ Sbjct: 622 KPKNFVEASKSDSWLKAMHEEISALEANNTWVLTTLPPHKTAIGCRWVYKVKHKADGSID 681 Query: 634 RYKARLVAKGYTQTEGIDYFDTFSPVAKMTTVRTLIALAAIKGWHIHQLDVNNAFLHGQL 813 RYKARLVAKGYTQ EG+D+FDTFSPVAK+TTVR LI+LAAI H+ QLDVNN+FLHG L Sbjct: 682 RYKARLVAKGYTQMEGLDFFDTFSPVAKLTTVRLLISLAAIHNCHLKQLDVNNSFLHGDL 741 Query: 814 QEKVYTSIPQGVQISGSNKVCKLLKSLYGLKQASRKWYERLTNLLLGLGFKQAHSDHSLF 993 E+VY +P G+ S +VC+L +SLYGLKQASR+WY RL++ L+ G+ + SDHSLF Sbjct: 742 NEEVYMQLPPGITPSFPGQVCRLQRSLYGLKQASRQWYARLSSFLIQHGYVPSPSDHSLF 801 Query: 994 TKITQSSYIALLIYIDDIVLVGDCLSQLQQIKTVLDHNXXXXXXXXXXXXXXXEAAHSPR 1173 K + + A+LIY+DDIVL G+ L+++ + ++L + E A + Sbjct: 802 LKCSPAITTAILIYVDDIVLAGNDLTEIHHLTSLLHNTFQIKDLGNLKYFLGLEVARNHT 861 Query: 1174 GISLCQRQYCLNLLTDTGVLAAKPASTPMAPNSHLHQDDGPLFPDVECYRRLVGRLLYLT 1353 GI LCQR+Y L+LL+DTG+LA+KP STPM ++HL G F D YRRLVGRL+YL Sbjct: 862 GIHLCQRKYTLDLLSDTGMLASKPVSTPMDYSTHLSASSGTPFTDTAAYRRLVGRLIYLP 921 Query: 1354 TTRPDITFAVQQLSQFLSKPTMTHYKAAQRILHYLKGSPGQGLFFSRSSSLDLQGYTDSN 1533 TRP I +AVQQLSQF+S P H +A RIL YLKG+PG G+F S +SS+ L+ ++D + Sbjct: 922 NTRPAIAYAVQQLSQFVSNPPTAHRQALFRILCYLKGTPGSGIFLSVNSSVQLRAFSDYD 981 Query: 1534 WAGCPDSRRSISGYCFFLGESLIAWRSKKQTTVACSSSEAEYRALAFATCELQWIQFLLD 1713 WAGCPD+RRSI+G+ +LG+SLI+W+SKKQ TV+ SSSEAEYRALA TCELQW+ +LL Sbjct: 982 WAGCPDTRRSITGFAVYLGDSLISWKSKKQITVSRSSSEAEYRALATTTCELQWLSYLLK 1041 Query: 1714 DLGIVHSKQPVLYCDNKSALHIAANPVFHERTKHLEIDCHVVRDKVNSGLMKLQPMPSAH 1893 D I + +LYCDN+ AL IA+NP+FHERTKH+EIDCH+VRDKV++GL+KL P+ S+ Sbjct: 1042 DFHIDLIRPSILYCDNQFALQIASNPIFHERTKHIEIDCHIVRDKVSTGLLKLLPVSSSL 1101 Query: 1894 QVADLFTKALLPQSFSHLVTKLQMLNIYQSLD 1989 Q+AD+ TK L P F +KL MLNI+ L+ Sbjct: 1102 QLADILTKPLSPFVFHSHYSKLGMLNIHSQLE 1133 >GAU41679.1 hypothetical protein TSUD_272630 [Trifolium subterraneum] Length = 1178 Score = 620 bits (1599), Expect = 0.0 Identities = 315/668 (47%), Positives = 432/668 (64%), Gaps = 9/668 (1%) Frame = +1 Query: 1 RNVVFYENIFPYKCNSSRGSVAYLKRWDTGXXXXXXXXXXXXXXXXXXXXXXLSLNGQPS 180 RNV+FYE FP+ ++S + D+ + + +P Sbjct: 523 RNVIFYETDFPFHLSNS-------VKTDSASPASHLNHTLLYDAEPDPNALPIPVMHEPD 575 Query: 181 SFPNPPNTPATSQDTPI---DTNHLPDXXXXXXXXXXNLRHSTRPHKPPSHLNDYICN-- 345 +P P+ + TPI +++ +P+ LR S+R + P HL + C Sbjct: 576 LTLSPIIGPSYNDSTPINSPESSPIPNPAP--------LRKSSRVIQRPRHLEGFHCETL 627 Query: 346 ---HISSINPPRYPIHAYVSYASLSQPHRAYTMSLDAVPEPSSFCEASKHTCWVEAMQQE 516 H ++ + YP+ + +SY + + + A S+ A+ EP ++ +ASK CW AM E Sbjct: 628 IGTHSAASSNTVYPLSSVLSYNNCAPNYHALCCSISAIVEPKTYTQASKFECWRNAMNAE 687 Query: 517 LTALEKNQTWIIVDKPANVIPIGCKWVYKVKRKADGSLERYKARLVAKGYTQTEGIDYFD 696 L AL++N+TW +VD P +P+GCKWVYKVK A+GS+ERYKARLVAKGYTQ EG+DYFD Sbjct: 688 LLALDENKTWSVVDLPNGKVPVGCKWVYKVKYHANGSIERYKARLVAKGYTQLEGVDYFD 747 Query: 697 TFSPVAKMTTVRTLIALAAIKGWHIHQLDVNNAFLHGQLQEKVYTSIPQGVQISG-SNKV 873 TFSPVAK+TTVR L+ALA+IKGWH+ QLDVNNAFLHG L E VY S+P G + SNKV Sbjct: 748 TFSPVAKITTVRVLLALASIKGWHLEQLDVNNAFLHGDLNEDVYMSLPPGFAATNESNKV 807 Query: 874 CKLLKSLYGLKQASRKWYERLTNLLLGLGFKQAHSDHSLFTKITQSSYIALLIYIDDIVL 1053 CKL KS+YGLKQASR+WY +L++ L+ LG+ + SDHSL+ K T +S+ ALL+Y+DDIVL Sbjct: 808 CKLHKSIYGLKQASRQWYSKLSSSLVSLGYTPSQSDHSLYIKSTTNSFTALLVYVDDIVL 867 Query: 1054 VGDCLSQLQQIKTVLDHNXXXXXXXXXXXXXXXEAAHSPRGISLCQRQYCLNLLTDTGVL 1233 G+ + ++Q +K LD E A S GI + QR+Y L LL D G+L Sbjct: 868 AGNSIHEIQTVKLFLDQKFKIKDLGKLRYFLVLEIARSDTGIFVNQRKYTLELLEDVGLL 927 Query: 1234 AAKPASTPMAPNSHLHQDDGPLFPDVECYRRLVGRLLYLTTTRPDITFAVQQLSQFLSKP 1413 KP+S P P + L DG D YRRL+GRLLYLT TRPDI+F+VQ LSQF+SKP Sbjct: 928 GTKPSSIPFHPTTKLSSTDGAPLDDPSSYRRLIGRLLYLTHTRPDISFSVQHLSQFVSKP 987 Query: 1414 TMTHYKAAQRILHYLKGSPGQGLFFSRSSSLDLQGYTDSNWAGCPDSRRSISGYCFFLGE 1593 + HY AA IL YLK P +G+F S SSSL + + DS+WA CP++R+SI G+C LG Sbjct: 988 LVPHYNAAMHILKYLKSDPAKGIFLSASSSLKISAFADSDWARCPETRKSIIGFCVLLGS 1047 Query: 1594 SLIAWRSKKQTTVACSSSEAEYRALAFATCELQWIQFLLDDLGIVHSKQPVLYCDNKSAL 1773 SLI+W+SKKQ TV+ SS+EAEYRALA TCE+QW+Q++ D I+ S ++CDNKSA+ Sbjct: 1048 SLISWKSKKQNTVSRSSTEAEYRALASLTCEIQWLQYIFQDFKIIFSNPAYVFCDNKSAI 1107 Query: 1774 HIAANPVFHERTKHLEIDCHVVRDKVNSGLMKLQPMPSAHQVADLFTKALLPQSFSHLVT 1953 ++A NP FHER+KH+E+DCHV+R+K+ S L+ L P+P+ Q+AD+FTK L +FS ++ Sbjct: 1108 YLAHNPTFHERSKHIELDCHVIREKIQSKLIHLLPVPTTSQLADVFTKPLNHPAFSSFLS 1167 Query: 1954 KLQMLNIY 1977 KL + +I+ Sbjct: 1168 KLGLCSIH 1175 >KYP38774.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cajanus cajan] Length = 727 Score = 605 bits (1559), Expect = 0.0 Identities = 319/677 (47%), Positives = 432/677 (63%), Gaps = 11/677 (1%) Frame = +1 Query: 1 RNVVFYENIFPYKCNSSRGSVAYLKRWDTGXXXXXXXXXXXXXXXXXXXXXXLSLNGQPS 180 +NV+FYE++FP+ S D ++N + Sbjct: 64 KNVLFYEDVFPFHNKVSAS--------DLDHDMINTNNFYFLFADIHVSTHNTNINVISN 115 Query: 181 SFPNPPNTPATSQDTPIDTNHLPDXXXXXXXXXXNLRHSTRPHKPPSHLNDYICN----H 348 N N SQ P++ N PD ++ STR P ++L DY CN H Sbjct: 116 DHNNNINHHDESQ-VPLECNS-PDATPTI------IKRSTRTKYPLAYLTDYHCNLLTGH 167 Query: 349 ISSI-NPPRYPIHAYVSYASLSQPHRAYTMSLDAVPEPSSFCE-ASKHTCWVEAMQQELT 522 +SI RYP+ +SY++LS H +T+ + ++ EP ++ E A K W++AMQ EL Sbjct: 168 DTSIFKDARYPLSTVLSYSNLSPTHSHFTLIISSLIEPKNYLELAIKSEHWIKAMQNELD 227 Query: 523 ALEKNQTWIIVDKPA-----NVIPIGCKWVYKVKRKADGSLERYKARLVAKGYTQTEGID 687 AL N+TW +V+ P +PIG KWVYK+K K+DG++ERYKARLVAKGY Q EG+D Sbjct: 228 ALRLNRTWSLVNLPTVNLPTGKVPIGSKWVYKIKYKSDGTIERYKARLVAKGYNQIEGLD 287 Query: 688 YFDTFSPVAKMTTVRTLIALAAIKGWHIHQLDVNNAFLHGQLQEKVYTSIPQGVQISGSN 867 YFDTF+PVAK+TTVR L+A+A+ + W +HQLD+NNAFLHG L E+VY IPQG+ N Sbjct: 288 YFDTFAPVAKITTVRMLLAIASCQKWELHQLDINNAFLHGDLNEEVYMQIPQGLTSPKPN 347 Query: 868 KVCKLLKSLYGLKQASRKWYERLTNLLLGLGFKQAHSDHSLFTKITQSSYIALLIYIDDI 1047 VCKL KSLYGLKQAS++W+ +L+N L + +KQ+ +DHSLFTK + + +L+Y+DD+ Sbjct: 348 LVCKLHKSLYGLKQASKQWFAKLSNFLQTMNYKQSPNDHSLFTKHINNHFTLILVYVDDL 407 Query: 1048 VLVGDCLSQLQQIKTVLDHNXXXXXXXXXXXXXXXEAAHSPRGISLCQRQYCLNLLTDTG 1227 ++ G S++ IK +LD E A S GI L QR+Y L+LL+DTG Sbjct: 408 IIGGTDSSEIVSIKGILDAQFKIKDLGHLKYFLGLEVARSHLGIHLSQRKYALDLLSDTG 467 Query: 1228 VLAAKPASTPMAPNSHLHQDDGPLFPDVECYRRLVGRLLYLTTTRPDITFAVQQLSQFLS 1407 LAAKP TPM + Q DG + D YRRLVG+LLYLTTTRPDI+FAVQQLSQ +S Sbjct: 468 FLAAKPVPTPMVKTNKASQSDGTPYADPAGYRRLVGKLLYLTTTRPDISFAVQQLSQHIS 527 Query: 1408 KPTMTHYKAAQRILHYLKGSPGQGLFFSRSSSLDLQGYTDSNWAGCPDSRRSISGYCFFL 1587 P+ HY AA R+L YLKG+PGQGLF+ S L+ ++DS+W CP S RS+SGYC FL Sbjct: 528 SPSTAHYMAATRVLRYLKGNPGQGLFYPAQSPSQLKAFSDSDWGSCPTSCRSVSGYCIFL 587 Query: 1588 GESLIAWRSKKQTTVACSSSEAEYRALAFATCELQWIQFLLDDLGIVHSKQPVLYCDNKS 1767 G+SLI+W+ KKQ+T++ SSSEAEYRALA CE+QW+ +LL D I + +LYCDN+S Sbjct: 588 GDSLISWKCKKQSTISRSSSEAEYRALANTVCEIQWLTYLLHDFVIPFTSHALLYCDNQS 647 Query: 1768 ALHIAANPVFHERTKHLEIDCHVVRDKVNSGLMKLQPMPSAHQVADLFTKALLPQSFSHL 1947 A +IA NPVFHERTKH+++DCH++R+K+ + L L P+ S Q+AD+ TK L P F++L Sbjct: 648 ARYIATNPVFHERTKHIDLDCHIIREKLQAKLFHLLPIRSTEQLADILTKPLEPTPFNYL 707 Query: 1948 VTKLQMLNIYQSLDCAG 1998 ++KL +LNIY S C G Sbjct: 708 LSKLGVLNIY-SPACGG 723 >KYP42564.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cajanus cajan] Length = 1427 Score = 625 bits (1613), Expect = 0.0 Identities = 328/677 (48%), Positives = 434/677 (64%), Gaps = 6/677 (0%) Frame = +1 Query: 1 RNVVFYENIFPYKCNSSRGSVAYLKRWDTGXXXXXXXXXXXXXXXXXXXXXXLSLNGQPS 180 RNVVFYE+IFP+ +GS L+ + Sbjct: 761 RNVVFYEHIFPF---FEKGSTVITNSQQQNDACFDFLYLDSSSHPVTTIDNSSLLDIDSA 817 Query: 181 SFPNPPNTPATSQDTPIDTNHLPDXXXXXXXXXXNLRHSTRPHKPPSHLNDYICNHISSI 360 + N N S P +T+ LR STR P++L DY CN + + Sbjct: 818 HYENDLNDIDESAH-PSETSS-----------PSQLRKSTRHKCSPAYLKDYHCNLLIGV 865 Query: 361 NPP-----RYPIHAYVSYASLSQPHRAYTMSLDAVPEPSSFCEASKHTCWVEAMQQELTA 525 PP RYP++ +SY SLS + Y +S+ EP +F +A K+ WVEAMQ EL A Sbjct: 866 PPPEDKHIRYPLNTVLSYDSLSASYSRYVLSITTHVEPHTFNQAVKNKVWVEAMQAELDA 925 Query: 526 LEKNQTWIIVDKPANVIPIGCKWVYKVKRKADGSLERYKARLVAKGYTQTEGIDYFDTFS 705 LE N+TW I+ P PIG KWVYK+K K+DGS+ERYKARLV KGYTQ +G+DYFDTF+ Sbjct: 926 LEHNKTWTIMPLPPGKTPIGSKWVYKIKYKSDGSIERYKARLVVKGYTQIQGLDYFDTFA 985 Query: 706 PVAKMTTVRTLIALAAIKGWHIHQLDVNNAFLHGQLQEKVYTSIPQGVQISGSNKVCKLL 885 PVAK++TVR L+A+A+ + W +HQLD+NNAFLHG L E VY IPQG+ I N VCKL Sbjct: 986 PVAKLSTVRMLLAIASCQHWELHQLDINNAFLHGDLLEDVYMEIPQGLNIDKPNHVCKLN 1045 Query: 886 KSLYGLKQASRKWYERLTNLLLGLGFKQAHSDHSLFTKITQSSYIALLIYIDDIVLVGDC 1065 KSLYGLKQASR+W+ +L++ LL L +KQ+ DHSLFTK + + +LIY+DD+++ G Sbjct: 1046 KSLYGLKQASRQWFAKLSSFLLSLHYKQSQHDHSLFTKHHGTHFTVILIYVDDLIIAGTD 1105 Query: 1066 LSQLQQIKTVLDHNXXXXXXXXXXXXXXXEAAHSPRGISLCQRQYCLNLLTDTGVLAAKP 1245 ++ IK LD E A S GISL QR+Y L+LL +T LA KP Sbjct: 1106 SEEINHIKQSLDVKFKIKDLGPLRYFLGLEIARSHLGISLSQRKYTLDLLDETSFLAGKP 1165 Query: 1246 ASTPMAPNSHL-HQDDGPLFPDVECYRRLVGRLLYLTTTRPDITFAVQQLSQFLSKPTMT 1422 TP+ + L H D P + D YRRL+G+LLYL TTRPDI+++VQQLSQFLS P + Sbjct: 1166 VLTPIIKGTRLSHTTDSP-YEDPAGYRRLIGKLLYLITTRPDISYSVQQLSQFLSCPQQS 1224 Query: 1423 HYKAAQRILHYLKGSPGQGLFFSRSSSLDLQGYTDSNWAGCPDSRRSISGYCFFLGESLI 1602 HY+AA R+L YLKG+PGQGLF+ S L L+ ++DS+WA CPD+RRS+SGY FLG SLI Sbjct: 1225 HYQAAIRVLRYLKGNPGQGLFYPADSPLQLKAFSDSDWASCPDTRRSLSGYSIFLGNSLI 1284 Query: 1603 AWRSKKQTTVACSSSEAEYRALAFATCELQWIQFLLDDLGIVHSKQPVLYCDNKSALHIA 1782 +W+ KKQ+T++ SSSEAEYRALA CE+QW+ +LL D + + +LYCDN+SA HIA Sbjct: 1285 SWKCKKQSTISRSSSEAEYRALAATACEIQWLTYLLQDFSVPFTTPALLYCDNQSARHIA 1344 Query: 1783 ANPVFHERTKHLEIDCHVVRDKVNSGLMKLQPMPSAHQVADLFTKALLPQSFSHLVTKLQ 1962 +N VFHERTKH+EIDCH+VR+K+ +GL L P+ S+HQ+AD+ TK L P F +L++KL Sbjct: 1345 SNAVFHERTKHIEIDCHLVREKLQAGLFHLLPIASSHQLADILTKPLDPSPFQYLLSKLG 1404 Query: 1963 MLNIYQSLDCAGIIQYN 2013 ++NIY S C G++ +N Sbjct: 1405 VINIY-SPACRGVLDHN 1420