BLASTX nr result
ID: Glycyrrhiza29_contig00015764
seq
BLASTX 2.2.26 [Sep-21-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Glycyrrhiza29_contig00015764 (1062 letters) Database: ./nr 115,041,592 sequences; 42,171,959,267 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value GAU22483.1 hypothetical protein TSUD_296020 [Trifolium subterran... 175 1e-75 GAU45885.1 hypothetical protein TSUD_401090, partial [Trifolium ... 166 2e-75 GAU50364.1 hypothetical protein TSUD_409370 [Trifolium subterran... 169 2e-74 GAU14396.1 hypothetical protein TSUD_249360 [Trifolium subterran... 167 2e-73 GAU42656.1 hypothetical protein TSUD_398610 [Trifolium subterran... 162 5e-73 GAU21273.1 hypothetical protein TSUD_286830 [Trifolium subterran... 163 2e-71 GAU51593.1 hypothetical protein TSUD_12600 [Trifolium subterraneum] 157 2e-71 GAU43816.1 hypothetical protein TSUD_248050 [Trifolium subterran... 151 6e-71 GAU33337.1 hypothetical protein TSUD_166020 [Trifolium subterran... 157 8e-71 GAU40490.1 hypothetical protein TSUD_189710 [Trifolium subterran... 155 1e-70 KYP46096.1 LINE-1 reverse transcriptase isogeny [Cajanus cajan] 174 2e-69 GAU46774.1 hypothetical protein TSUD_402850 [Trifolium subterran... 163 4e-69 GAU32903.1 hypothetical protein TSUD_152630 [Trifolium subterran... 159 3e-68 GAU21183.1 hypothetical protein TSUD_11000 [Trifolium subterraneum] 160 8e-68 GAU48515.1 hypothetical protein TSUD_244350 [Trifolium subterran... 162 1e-67 GAU28506.1 hypothetical protein TSUD_156630 [Trifolium subterran... 158 2e-67 GAU24087.1 hypothetical protein TSUD_388800 [Trifolium subterran... 158 8e-67 GAU17884.1 hypothetical protein TSUD_330100 [Trifolium subterran... 145 2e-66 KYP44439.1 Retrovirus-related Pol polyprotein LINE-1 [Cajanus ca... 194 3e-66 GAU20577.1 hypothetical protein TSUD_33240 [Trifolium subterraneum] 153 5e-66 >GAU22483.1 hypothetical protein TSUD_296020 [Trifolium subterraneum] Length = 1115 Score = 175 bits (444), Expect(2) = 1e-75 Identities = 79/165 (47%), Positives = 109/165 (66%) Frame = +1 Query: 7 NKLSVSIVNVYAPCDLGGKRRVWEELGNLMASKGGGRWCVLGNFNSIKSISERKGVACNH 186 N ++ +VN+YAPCD GK +W+ LG L+ + RWCV G+FNS+++ ER G Sbjct: 95 NGVNFRVVNIYAPCDARGKADLWQRLGGLIQADSEARWCVCGDFNSVRNGGERCGRGGGV 154 Query: 187 RSEEIELFCDFIEVCGLIDLPLIGRKFTWYKPDGKAMSRLDRFLISEDWLFTWGNLSQWG 366 E++ F +FI LIDLPL GR+FTW + DG +MSRLDRFL+S W TW N Q Sbjct: 155 VDAEVDRFNEFILNSELIDLPLHGRRFTWSRSDGSSMSRLDRFLLSGSWCTTWPNCIQVA 214 Query: 367 MQRSVSDHCAVILKEKDINWGPKPFRMMRCWEETVGYEDFVKKRM 501 + R +SDHC +IL+E + +WGP+PFRMM+CW + GY FV+ +M Sbjct: 215 ILRGLSDHCPLILREHEEDWGPRPFRMMKCWRDFPGYNSFVRDQM 259 Score = 137 bits (346), Expect(2) = 1e-75 Identities = 70/183 (38%), Positives = 100/183 (54%), Gaps = 1/183 (0%) Frame = +2 Query: 515 VRGWKGYXXXXXXXXXXXXXXXWHKDHFGNLETKIKEAKEEVYRLDLKGEVDGLNEVE-V 691 + GW G+ WH H N+ +KI EAKE + LDL+GE L + + + Sbjct: 264 IDGWGGFVLKEKFKLLKNSLRVWHLSHAKNIGSKILEAKERLLGLDLRGENANLTDDDLI 323 Query: 692 GDRRLCIANINRFSSMNCRILWQKSRMKWLKEGDANSKFFHRCVQRRRKANEILGLDFDG 871 +RR + I S + C +LWQ SR KWL EGDANSKFFH R++ N I+ LD +G Sbjct: 324 IERREVTSTIFSLSRIECSMLWQNSRTKWLLEGDANSKFFHALANSRKRKNLIVLLDVNG 383 Query: 872 SFIEGVNPLRREIRGYFESHFNNRDWARPSFGNINIPTLGTAESNALTNPFSEEEVKAMV 1051 +EGV +R I +F S F ++ +RP GN++ T+ +E+ L F E+E K V Sbjct: 384 IQVEGVGNIRESIFNHFSSQFKSQRISRPDVGNLDFKTISESEATVLVEEFGEDETKQAV 443 Query: 1052 WNC 1060 W+C Sbjct: 444 WDC 446 >GAU45885.1 hypothetical protein TSUD_401090, partial [Trifolium subterraneum] Length = 751 Score = 166 bits (419), Expect(2) = 2e-75 Identities = 74/158 (46%), Positives = 103/158 (65%) Frame = +1 Query: 25 IVNVYAPCDLGGKRRVWEELGNLMASKGGGRWCVLGNFNSIKSISERKGVACNHRSEEIE 204 ++N+YAPCDLG K+R+W L + S G R CV G+FN+++ ER+ + Sbjct: 60 VMNIYAPCDLGAKQRLWNSLSVRLQSLAGRRVCVCGDFNAVRCQEERRSSRVGPSQADHI 119 Query: 205 LFCDFIEVCGLIDLPLIGRKFTWYKPDGKAMSRLDRFLISEDWLFTWGNLSQWGMQRSVS 384 F FIE L+DLPL GRKFTWY+ DG +MSRLDRFL+SE+W W N Q R +S Sbjct: 120 PFNSFIEDNNLVDLPLGGRKFTWYRGDGLSMSRLDRFLLSEEWCLAWPNCLQVAQLRGLS 179 Query: 385 DHCAVILKEKDINWGPKPFRMMRCWEETVGYEDFVKKR 498 DHC ++L + NWGP+P RM++CW++ GY+ FVK++ Sbjct: 180 DHCPLMLVASEENWGPRPLRMLKCWKDIPGYDLFVKEK 217 Score = 146 bits (368), Expect(2) = 2e-75 Identities = 75/190 (39%), Positives = 102/190 (53%) Frame = +2 Query: 491 KKEWSGLKVRGWKGYXXXXXXXXXXXXXXXWHKDHFGNLETKIKEAKEEVYRLDLKGEVD 670 K++W+ L V GW G+ WH H NL ++I K + LD KGE + Sbjct: 215 KEKWNSLHVDGWGGFVLKEKLKLIKVALKEWHLSHAQNLPSRIDSLKTRLSNLDNKGEEE 274 Query: 671 GLNEVEVGDRRLCIANINRFSSMNCRILWQKSRMKWLKEGDANSKFFHRCVQRRRKANEI 850 L+ EV D R + S ++ I WQ+SR+ WLKEGDANSK+FH + RR+ N I Sbjct: 275 DLSVDEVVDMRGITFEFHSLSRLHASISWQQSRLLWLKEGDANSKYFHSVLASRRRGNTI 334 Query: 851 LGLDFDGSFIEGVNPLRREIRGYFESHFNNRDWARPSFGNINIPTLGTAESNALTNPFSE 1030 L DG +EGVNP+R+ + +F SHF + RP N+ L + +LT PF Sbjct: 335 STLQADGVTLEGVNPIRQAVFTHFASHFKASNVERPGVDNLQFKRLSWLDIGSLTRPFLV 394 Query: 1031 EEVKAMVWNC 1060 EEVKA VW+C Sbjct: 395 EEVKAAVWDC 404 >GAU50364.1 hypothetical protein TSUD_409370 [Trifolium subterraneum] Length = 546 Score = 169 bits (428), Expect(2) = 2e-74 Identities = 76/158 (48%), Positives = 104/158 (65%) Frame = +1 Query: 25 IVNVYAPCDLGGKRRVWEELGNLMASKGGGRWCVLGNFNSIKSISERKGVACNHRSEEIE 204 + NVYAPC LG K+ +W L + G R CV G+FN+++SI ER+ S + Sbjct: 34 LANVYAPCGLGAKQSLWNSLLGRILLLNGERVCVCGDFNAVRSIEERRSARAGSHSSDHI 93 Query: 205 LFCDFIEVCGLIDLPLIGRKFTWYKPDGKAMSRLDRFLISEDWLFTWGNLSQWGMQRSVS 384 F FI+ LIDLPL GRKFTWYK DG AMSR+DRFL+SE+W TW N Q R +S Sbjct: 94 PFNRFIDDAVLIDLPLSGRKFTWYKGDGLAMSRIDRFLLSEEWCLTWPNCVQVAQLRGLS 153 Query: 385 DHCAVILKEKDINWGPKPFRMMRCWEETVGYEDFVKKR 498 DHC ++L+ ++ NWGP+P RM++CW++ GY+ FV+ + Sbjct: 154 DHCPLVLEVEEENWGPRPSRMLKCWKDIPGYQQFVRDK 191 Score = 140 bits (352), Expect(2) = 2e-74 Identities = 70/190 (36%), Positives = 98/190 (51%) Frame = +2 Query: 491 KKEWSGLKVRGWKGYXXXXXXXXXXXXXXXWHKDHFGNLETKIKEAKEEVYRLDLKGEVD 670 + +W +++ GW G+ WH H NL +I+ K + L++KGE Sbjct: 189 RDKWKAMQIVGWGGFVLKEKFKMIRLALKEWHAAHSQNLPGRIESLKVRLAALEVKGEAA 248 Query: 671 GLNEVEVGDRRLCIANINRFSSMNCRILWQKSRMKWLKEGDANSKFFHRCVQRRRKANEI 850 L+E E+ + I+ S + I WQ+SR WLKEGDANSK+FH V RR+ N + Sbjct: 249 VLSEAELEELHGLTTEIHSLSRRSASICWQQSRSLWLKEGDANSKYFHSVVASRRRGNAV 308 Query: 851 LGLDFDGSFIEGVNPLRREIRGYFESHFNNRDWARPSFGNINIPTLGTAESNALTNPFSE 1030 + DG EGV P+R+ + +F SHF ARP N+ L E +LT PFS Sbjct: 309 SFIQVDGVTTEGVQPIRQAVFEHFASHFKESHVARPGVDNLQFKRLTLLEGGSLTKPFSL 368 Query: 1031 EEVKAMVWNC 1060 EEVK VW+C Sbjct: 369 EEVKTAVWDC 378 >GAU14396.1 hypothetical protein TSUD_249360 [Trifolium subterraneum] Length = 845 Score = 167 bits (422), Expect(2) = 2e-73 Identities = 74/155 (47%), Positives = 100/155 (64%) Frame = +1 Query: 25 IVNVYAPCDLGGKRRVWEELGNLMASKGGGRWCVLGNFNSIKSISERKGVACNHRSEEIE 204 + NVYAPCD K+ +W+ L + G R CV G+FN+ + ER+ V RS + Sbjct: 244 LFNVYAPCDDNAKQVLWDSLSGKLQQLAGKRVCVCGDFNAARGAEERRSVRLGFRSIDHG 303 Query: 205 LFCDFIEVCGLIDLPLIGRKFTWYKPDGKAMSRLDRFLISEDWLFTWGNLSQWGMQRSVS 384 F FIE GL+DLPL GR++TW+K DG++MSR+DRFL+SEDW TW N Q R +S Sbjct: 304 PFNQFIEANGLVDLPLSGRRYTWFKGDGRSMSRIDRFLLSEDWCLTWPNCIQVAQLRGLS 363 Query: 385 DHCAVILKEKDINWGPKPFRMMRCWEETVGYEDFV 489 DHC IL + NWGP+P RM++CW +T G++ FV Sbjct: 364 DHCPFILSMDEENWGPRPVRMLKCWHDTPGFKQFV 398 Score = 139 bits (349), Expect(2) = 2e-73 Identities = 68/188 (36%), Positives = 103/188 (54%) Frame = +2 Query: 497 EWSGLKVRGWKGYXXXXXXXXXXXXXXXWHKDHFGNLETKIKEAKEEVYRLDLKGEVDGL 676 +W L+V GW G+ WH NL ++ + + + LD KGE + L Sbjct: 401 KWRSLEVDGWGGFVLKEKLKLIKMALKDWHGTQARNLPGRLNDLRNRLSVLDSKGEEEVL 460 Query: 677 NEVEVGDRRLCIANINRFSSMNCRILWQKSRMKWLKEGDANSKFFHRCVQRRRKANEILG 856 + E+ + R +I+ S MN I WQ+SR++WL+EGDANSK+FH + RR+ N Sbjct: 461 TDEELVELRTITYDIHALSRMNTSISWQQSRLQWLREGDANSKYFHSVLVSRRRQNAFSV 520 Query: 857 LDFDGSFIEGVNPLRREIRGYFESHFNNRDWARPSFGNINIPTLGTAESNALTNPFSEEE 1036 + DG +EGV +R+ + +F SHF + + RP+ ++ P+L AE L PFS EE Sbjct: 521 IMVDGERVEGVQAVRQALFSHFSSHFRSCNMPRPTVEELHFPSLSFAEGAGLVKPFSVEE 580 Query: 1037 VKAMVWNC 1060 VKA +W+C Sbjct: 581 VKAAIWDC 588 >GAU42656.1 hypothetical protein TSUD_398610 [Trifolium subterraneum] Length = 1707 Score = 162 bits (409), Expect(2) = 5e-73 Identities = 71/158 (44%), Positives = 104/158 (65%) Frame = +1 Query: 25 IVNVYAPCDLGGKRRVWEELGNLMASKGGGRWCVLGNFNSIKSISERKGVACNHRSEEIE 204 + NVY+PCD G K+ +W+ L S G R CV G+FN++ + ER+ + RS + Sbjct: 980 VANVYSPCDDGAKQGLWDSLLVRFQSLGRERVCVCGDFNAVTHVDERRSIGGALRSTDYI 1039 Query: 205 LFCDFIEVCGLIDLPLIGRKFTWYKPDGKAMSRLDRFLISEDWLFTWGNLSQWGMQRSVS 384 F FI+ L+DLPL GRKFTWY+ DG++MSRLDRF++SE+W TW N Q R +S Sbjct: 1040 PFNRFIDDNNLVDLPLRGRKFTWYRGDGQSMSRLDRFMLSEEWCLTWPNCEQVAKLRGLS 1099 Query: 385 DHCAVILKEKDINWGPKPFRMMRCWEETVGYEDFVKKR 498 DHC ++L + +WGP+P RM++CW++ GY FV+++ Sbjct: 1100 DHCPLVLSANEEDWGPRPLRMLKCWKDVPGYNLFVREK 1137 Score = 142 bits (358), Expect(2) = 5e-73 Identities = 72/190 (37%), Positives = 105/190 (55%) Frame = +2 Query: 491 KKEWSGLKVRGWKGYXXXXXXXXXXXXXXXWHKDHFGNLETKIKEAKEEVYRLDLKGEVD 670 +++W +V GW GY WHK H NL ++I K V LD KGE + Sbjct: 1135 REKWKSFQVDGWGGYAALKE----------WHKAHVQNLPSRIDSLKYRVSELDQKGEEE 1184 Query: 671 GLNEVEVGDRRLCIANINRFSSMNCRILWQKSRMKWLKEGDANSKFFHRCVQRRRKANEI 850 L+ EV + A+I+ S ++ I WQ+SR WLK+GD NSK+FH + RR+ N I Sbjct: 1185 VLSGDEVAELHGATADIHSLSRLHASISWQQSRSLWLKDGDVNSKYFHSILAGRRRRNAI 1244 Query: 851 LGLDFDGSFIEGVNPLRREIRGYFESHFNNRDWARPSFGNINIPTLGTAESNALTNPFSE 1030 + G +EGV+ +R+ + +F SHF N + RP N+ L +ES++LT PF+E Sbjct: 1245 STIQVGGVALEGVSSIRQAVFSHFASHFKNSNVGRPGVDNLQFKRLDHSESSSLTKPFTE 1304 Query: 1031 EEVKAMVWNC 1060 EVK+ VW+C Sbjct: 1305 NEVKSAVWDC 1314 >GAU21273.1 hypothetical protein TSUD_286830 [Trifolium subterraneum] Length = 1449 Score = 163 bits (412), Expect(2) = 2e-71 Identities = 76/156 (48%), Positives = 100/156 (64%) Frame = +1 Query: 31 NVYAPCDLGGKRRVWEELGNLMASKGGGRWCVLGNFNSIKSISERKGVACNHRSEEIELF 210 NVYAPCD K+R+W+ L + + S G R CV G+FN++KS+ E + + S + F Sbjct: 435 NVYAPCDARAKQRLWDSLSSRIQSLGRQRVCVCGDFNAVKSLDETRSLRGAQNSSDFLAF 494 Query: 211 CDFIEVCGLIDLPLIGRKFTWYKPDGKAMSRLDRFLISEDWLFTWGNLSQWGMQRSVSDH 390 FIE L+DLPL GRK TWYK DG +MSRLDRFL+SEDW TW Q R VSDH Sbjct: 495 NLFIEDNTLVDLPLSGRKLTWYKGDGLSMSRLDRFLLSEDWCLTWPYCKQEARMRGVSDH 554 Query: 391 CAVILKEKDINWGPKPFRMMRCWEETVGYEDFVKKR 498 C +IL + +WGP+P RM++CW+ GY FV+ + Sbjct: 555 CPLILSANEEDWGPRPSRMLKCWKLVPGYNLFVRDK 590 Score = 136 bits (342), Expect(2) = 2e-71 Identities = 68/190 (35%), Positives = 100/190 (52%) Frame = +2 Query: 491 KKEWSGLKVRGWKGYXXXXXXXXXXXXXXXWHKDHFGNLETKIKEAKEEVYRLDLKGEVD 670 + +W+ V GW GY WH H NL ++I K LD KGE D Sbjct: 588 RDKWNSFLVNGWGGYVLKEKFKMIKVALKEWHMTHTKNLPSRIDSLKVRQSCLDQKGEED 647 Query: 671 GLNEVEVGDRRLCIANINRFSSMNCRILWQKSRMKWLKEGDANSKFFHRCVQRRRKANEI 850 L+E E+ + ++I+ S ++ + WQ+SR WLKEGD NSKFFH + RR+ N I Sbjct: 648 VLSEAELEELHGVTSDIHTLSRLHASVCWQQSRSLWLKEGDVNSKFFHSVLASRRRGNAI 707 Query: 851 LGLDFDGSFIEGVNPLRREIRGYFESHFNNRDWARPSFGNINIPTLGTAESNALTNPFSE 1030 + DG +EGV+ +R+ + +F +HF + RP ++ TL E + L PF+ Sbjct: 708 SSIVVDGVPLEGVSSVRQAVVSHFAAHFKTSNVVRPRVDDLIFNTLNQVECSNLIKPFTR 767 Query: 1031 EEVKAMVWNC 1060 +EVKA VW+C Sbjct: 768 DEVKAAVWDC 777 >GAU51593.1 hypothetical protein TSUD_12600 [Trifolium subterraneum] Length = 851 Score = 157 bits (396), Expect(2) = 2e-71 Identities = 74/161 (45%), Positives = 102/161 (63%), Gaps = 3/161 (1%) Frame = +1 Query: 25 IVNVYAPCDLGGKRRVWEELGNLMASKGGGRWCVLGNFNSIKSISERKGVACNHRSEEIE 204 + NVYAPC+ G+ +W L ++ WCV+G+FN+++ ER G + N + Sbjct: 34 LANVYAPCEASGRALLWRALEGKISHFANMAWCVVGDFNAVRGSEERSGRSNNPIQNSVV 93 Query: 205 LFCDF---IEVCGLIDLPLIGRKFTWYKPDGKAMSRLDRFLISEDWLFTWGNLSQWGMQR 375 + DF I+ LIDLPL GRKFTWY+ DG MSRLDRFL+SE W+ + N Q + R Sbjct: 94 EYSDFNSFIDNNFLIDLPLGGRKFTWYRGDGITMSRLDRFLLSESWISRFPNSIQEALPR 153 Query: 376 SVSDHCAVILKEKDINWGPKPFRMMRCWEETVGYEDFVKKR 498 ++SDHC V L ++NWGPKP RM++CW + GY DFVK+R Sbjct: 154 TLSDHCPVQLSIDELNWGPKPQRMLKCWVDIQGYHDFVKER 194 Score = 142 bits (358), Expect(2) = 2e-71 Identities = 70/190 (36%), Positives = 101/190 (53%) Frame = +2 Query: 491 KKEWSGLKVRGWKGYXXXXXXXXXXXXXXXWHKDHFGNLETKIKEAKEEVYRLDLKGEVD 670 K+ WS +V GW G+ WH +H NL+ KI++A + D+ GE Sbjct: 192 KERWSSFQVHGWSGHILKTKLKFIKAELRNWHFNHTANLDGKIRDATTRLEEFDIIGETR 251 Query: 671 GLNEVEVGDRRLCIANINRFSSMNCRILWQKSRMKWLKEGDANSKFFHRCVQRRRKANEI 850 L+ E + ANI FS++ +LWQKSR+ WL+EGDANSKFF + RR++N I Sbjct: 252 RLDTNEELELHSAQANIVSFSNLQASMLWQKSRVNWLREGDANSKFFQGLMSSRRQSNTI 311 Query: 851 LGLDFDGSFIEGVNPLRREIRGYFESHFNNRDWARPSFGNINIPTLGTAESNALTNPFSE 1030 + L G +EGV +R E+ +F +HF + RP +N T+ S L PF Sbjct: 312 ISLQAGGRVVEGVEEVRWEVFQHFRNHFRKQTVTRPDMQGLNFKTISEDNSAELVKPFLL 371 Query: 1031 EEVKAMVWNC 1060 +E+KA VW+C Sbjct: 372 DEIKAAVWDC 381 >GAU43816.1 hypothetical protein TSUD_248050 [Trifolium subterraneum] Length = 1355 Score = 151 bits (381), Expect(2) = 6e-71 Identities = 72/165 (43%), Positives = 101/165 (61%) Frame = +1 Query: 4 ENKLSVSIVNVYAPCDLGGKRRVWEELGNLMASKGGGRWCVLGNFNSIKSISERKGVACN 183 + ++ + NVYAP D G+ +W L + + WCVL +FN ++ ER A + Sbjct: 354 KQNINFCLANVYAPYDYTGRPILWNNLESKILHFSQAAWCVLRDFNVVRYADERVSRASH 413 Query: 184 HRSEEIELFCDFIEVCGLIDLPLIGRKFTWYKPDGKAMSRLDRFLISEDWLFTWGNLSQW 363 +E F +FI+ LIDL L GRKFTWY+ DGK+MSRLDRFL S+ WL + N Q Sbjct: 414 SALDEFVAFNNFIDSTLLIDLTLCGRKFTWYRGDGKSMSRLDRFLFSDVWLAEFPNCIQA 473 Query: 364 GMQRSVSDHCAVILKEKDINWGPKPFRMMRCWEETVGYEDFVKKR 498 + RS+SDHC + L NWGPKP RM++CW + GYE+FV+++ Sbjct: 474 ALPRSLSDHCPIQLSIDVQNWGPKPLRMLKCWADIAGYEEFVEEK 518 Score = 146 bits (368), Expect(2) = 6e-71 Identities = 72/190 (37%), Positives = 104/190 (54%) Frame = +2 Query: 491 KKEWSGLKVRGWKGYXXXXXXXXXXXXXXXWHKDHFGNLETKIKEAKEEVYRLDLKGEVD 670 +++W +V GW G+ WH +H NL+ KI+ AK + LDL GE Sbjct: 516 EEKWHSFQVHGWSGHILKSKLKFIKSELKSWHLNHTANLDNKIQVAKTRLEELDLSGEER 575 Query: 671 GLNEVEVGDRRLCIANINRFSSMNCRILWQKSRMKWLKEGDANSKFFHRCVQRRRKANEI 850 L EVE + ANI+ FS + WQKSR+ WLKEGDANSKFFH + RR++N I Sbjct: 576 WLFEVEEVEMCSLQANISAFSKSQASMHWQKSRVSWLKEGDANSKFFHGIMSSRRQSNSI 635 Query: 851 LGLDFDGSFIEGVNPLRREIRGYFESHFNNRDWARPSFGNINIPTLGTAESNALTNPFSE 1030 + L +G +EGVN +R+ I +F HF ++ RP + ++ A+ L+ PF Sbjct: 636 VSLSSNGRTVEGVNEIRQVIFQHFSQHFRRKNHNRPDISGLVFNSISEADGEFLSRPFLL 695 Query: 1031 EEVKAMVWNC 1060 +E+K VW+C Sbjct: 696 DEIKKAVWDC 705 >GAU33337.1 hypothetical protein TSUD_166020 [Trifolium subterraneum] Length = 1227 Score = 157 bits (398), Expect(2) = 8e-71 Identities = 73/161 (45%), Positives = 103/161 (63%), Gaps = 3/161 (1%) Frame = +1 Query: 25 IVNVYAPCDLGGKRRVWEELGNLMASKGGGRWCVLGNFNSIKSISERKGVACNHRSEEIE 204 + NVYAPC+ G+ +W+ L ++ WCV+G+FN+++ E+ G + N + Sbjct: 420 LANVYAPCEASGRALIWQALEGKISHFVNMAWCVVGDFNAVRGSEEQSGRSSNPIQNSVV 479 Query: 205 LFCDF---IEVCGLIDLPLIGRKFTWYKPDGKAMSRLDRFLISEDWLFTWGNLSQWGMQR 375 + DF I+ LIDLPL GRKFTWY+ DG MSRLDRFL+SE W+ + N Q + R Sbjct: 480 EYSDFNSFIDNNFLIDLPLGGRKFTWYRGDGITMSRLDRFLLSESWISRFPNCIQEALPR 539 Query: 376 SVSDHCAVILKEKDINWGPKPFRMMRCWEETVGYEDFVKKR 498 ++SDHC V L ++NWGPKP RM++CW + GY DFVK+R Sbjct: 540 TLSDHCPVQLSIDELNWGPKPHRMLKCWVDIQGYHDFVKER 580 Score = 139 bits (350), Expect(2) = 8e-71 Identities = 70/190 (36%), Positives = 102/190 (53%) Frame = +2 Query: 491 KKEWSGLKVRGWKGYXXXXXXXXXXXXXXXWHKDHFGNLETKIKEAKEEVYRLDLKGEVD 670 K+ WS +V GW G+ WH +H NL+ KI++AK + D+ GE Sbjct: 578 KERWSSFQVHGWSGHILKTKLKFIKAELRNWHLNHTANLDGKIRDAKNRLEEFDVIGETR 637 Query: 671 GLNEVEVGDRRLCIANINRFSSMNCRILWQKSRMKWLKEGDANSKFFHRCVQRRRKANEI 850 L+ E + ANI FS + +LWQKSR+ WLKEG+ANSKFF + RR++N I Sbjct: 638 RLDTNEELELHSVQANIVSFSKLQASMLWQKSRVNWLKEGNANSKFFQGLMSSRRQSNTI 697 Query: 851 LGLDFDGSFIEGVNPLRREIRGYFESHFNNRDWARPSFGNINIPTLGTAESNALTNPFSE 1030 + L G +EGV +R E+ +F +HF + +RP +N ++ S L PF Sbjct: 698 ISLQAVGRVVEGVKEVRWEVFQHFCNHFRKQTVSRPYMQGLNFKSISKDNSAELVKPFLL 757 Query: 1031 EEVKAMVWNC 1060 +E+KA VW+C Sbjct: 758 DEIKAAVWDC 767 >GAU40490.1 hypothetical protein TSUD_189710 [Trifolium subterraneum] Length = 1087 Score = 155 bits (393), Expect(2) = 1e-70 Identities = 72/159 (45%), Positives = 100/159 (62%) Frame = +1 Query: 22 SIVNVYAPCDLGGKRRVWEELGNLMASKGGGRWCVLGNFNSIKSISERKGVACNHRSEEI 201 S+ NVYAPCD G K+R+W+ + G R CV G FN++++I ER+ S + Sbjct: 105 SVTNVYAPCDDGEKQRLWDLFSARIQLLVGRRVCVCGAFNAVRTIDERRFARGGSNSLDH 164 Query: 202 ELFCDFIEVCGLIDLPLIGRKFTWYKPDGKAMSRLDRFLISEDWLFTWGNLSQWGMQRSV 381 F FI+ LIDLPL GRKFTW+K DG +MSR+DRFL+SE+W W N Q R + Sbjct: 165 IPFNRFIDDNNLIDLPLSGRKFTWFKGDGFSMSRIDRFLLSEEWCLAWPNCRQVARLRGL 224 Query: 382 SDHCAVILKEKDINWGPKPFRMMRCWEETVGYEDFVKKR 498 SDHC ++L + +WGP+P RM++CW + GY FV+ + Sbjct: 225 SDHCPIVLSANEEDWGPRPSRMLKCWRDVPGYNVFVRDK 263 Score = 140 bits (353), Expect(2) = 1e-70 Identities = 71/190 (37%), Positives = 104/190 (54%) Frame = +2 Query: 491 KKEWSGLKVRGWKGYXXXXXXXXXXXXXXXWHKDHFGNLETKIKEAKEEVYRLDLKGEVD 670 + +W+ L+V W GY WH H NL ++I+ K + LD KGE Sbjct: 261 RDKWNSLQVDSWGGYVLKEKLKMIKAALKEWHSVHVQNLPSRIESLKARLTDLDQKGEDG 320 Query: 671 GLNEVEVGDRRLCIANINRFSSMNCRILWQKSRMKWLKEGDANSKFFHRCVQRRRKANEI 850 L+E E+ + ++I+ S MN I WQ+SR WLKEGDANSK+FH + RR+ N I Sbjct: 321 VLSEDEIVELHEVSSDIHSLSRMNASICWQQSRSLWLKEGDANSKYFHSVLAGRRRRNAI 380 Query: 851 LGLDFDGSFIEGVNPLRREIRGYFESHFNNRDWARPSFGNINIPTLGTAESNALTNPFSE 1030 + + +EGV+P+R+ + +F SHF + RP + L E ++LT PFSE Sbjct: 381 SVIQVEEVTLEGVDPIRQAVFSHFTSHFKATNVERPGVVTLQFKRLNQLERSSLTKPFSE 440 Query: 1031 EEVKAMVWNC 1060 EVK++VW+C Sbjct: 441 AEVKSVVWDC 450 >KYP46096.1 LINE-1 reverse transcriptase isogeny [Cajanus cajan] Length = 729 Score = 174 bits (440), Expect(2) = 2e-69 Identities = 76/166 (45%), Positives = 113/166 (68%) Frame = +1 Query: 1 GENKLSVSIVNVYAPCDLGGKRRVWEELGNLMASKGGGRWCVLGNFNSIKSISERKGVAC 180 G +K+ IVN+Y+PCDL GK+ +WEE+ + S G GRWC+ G+FN+++ SERKGV Sbjct: 18 GYDKIPCFIVNIYSPCDLRGKKNLWEEIHKIKNSYGSGRWCICGDFNTVRLKSERKGVHT 77 Query: 181 NHRSEEIELFCDFIEVCGLIDLPLIGRKFTWYKPDGKAMSRLDRFLISEDWLFTWGNLSQ 360 +E+ + FIE LIDLPL G K+TW++P+ SR+DRFL+S++WL W + SQ Sbjct: 78 RREEKEMLCYNQFIEDVELIDLPLGGGKYTWFRPNRIIASRIDRFLVSQEWLTQWPHCSQ 137 Query: 361 WGMQRSVSDHCAVILKEKDINWGPKPFRMMRCWEETVGYEDFVKKR 498 +QR VSDH ++LK+ ++WGPKPFR + CW + + FV+++ Sbjct: 138 KALQRDVSDHRPILLKDIRLDWGPKPFRSLNCWFDDPSFLGFVEEK 183 Score = 118 bits (295), Expect(2) = 2e-69 Identities = 65/192 (33%), Positives = 94/192 (48%), Gaps = 2/192 (1%) Frame = +2 Query: 491 KKEWSGLKVRGWKGYXXXXXXXXXXXXXXXWHKDHFGNLETKIKEAKEEVYRLDLKGEVD 670 +++W G V GW + W+K FGN+ T+I+E K + LD E Sbjct: 181 EEKWKGFSVTGWGAFILKEKLKHLKKSIKEWNKQAFGNIHTQIEEVKRNINSLDSIVETR 240 Query: 671 GLNEVEVGDRRLCIANINRFSSMNCR--ILWQKSRMKWLKEGDANSKFFHRCVQRRRKAN 844 LNE +V DRR N+ + +N + +L QKSR+KW +EGD+NS FFH CV +RRK N Sbjct: 241 SLNERKVSDRRNL--NVKLWDLLNKKESLLLQKSRLKWAREGDSNSSFFHMCVNKRRKMN 298 Query: 845 EILGLDFDGSFIEGVNPLRREIRGYFESHFNNRDWARPSFGNINIPTLGTAESNALTNPF 1024 EI+GLD +G ++ I L T + +LT PF Sbjct: 299 EIIGLDVNGKWL---------------------------LDGIQFQQLNTHQCRSLTRPF 331 Query: 1025 SEEEVKAMVWNC 1060 + EE++ VW+C Sbjct: 332 TAEEIREAVWSC 343 >GAU46774.1 hypothetical protein TSUD_402850 [Trifolium subterraneum] Length = 908 Score = 163 bits (413), Expect(2) = 4e-69 Identities = 76/158 (48%), Positives = 99/158 (62%) Frame = +1 Query: 25 IVNVYAPCDLGGKRRVWEELGNLMASKGGGRWCVLGNFNSIKSISERKGVACNHRSEEIE 204 + NVYAPCDLG K+ +W L + + G R CV G+FN+++ I ER+ RS +I Sbjct: 454 LANVYAPCDLGAKQVLWASLSDQIQLLGRRRMCVSGDFNAVRCIEERRSPRTVTRSTDIL 513 Query: 205 LFCDFIEVCGLIDLPLIGRKFTWYKPDGKAMSRLDRFLISEDWLFTWGNLSQWGMQRSVS 384 F FI+ IDLPL GRKFTWYK DG MSRLDRFL+SEDW W N R +S Sbjct: 514 PFNQFIDEMFFIDLPLSGRKFTWYKGDGHTMSRLDRFLLSEDWCLAWPNCVHVAQLRGLS 573 Query: 385 DHCAVILKEKDINWGPKPFRMMRCWEETVGYEDFVKKR 498 DHC +IL + NWGP+ RM++CW + GY +FV+ + Sbjct: 574 DHCPLILSADEENWGPRSSRMLKCWTDVPGYVNFVRDK 611 Score = 127 bits (320), Expect(2) = 4e-69 Identities = 65/190 (34%), Positives = 98/190 (51%) Frame = +2 Query: 491 KKEWSGLKVRGWKGYXXXXXXXXXXXXXXXWHKDHFGNLETKIKEAKEEVYRLDLKGEVD 670 + +W+ +V GW G+ WH+ H N+ +I K + LD KGE Sbjct: 609 RDKWNSFQVNGWGGFVLKEKLKMIKLALKEWHEAHVRNIPRRIDSLKVRLSDLDSKGEEA 668 Query: 671 GLNEVEVGDRRLCIANINRFSSMNCRILWQKSRMKWLKEGDANSKFFHRCVQRRRKANEI 850 L++ EV + +I+ S +N I Q+SR +WL+EGDAN+K+FH + RR+ N I Sbjct: 669 SLSDEEVQELHGITLDIHSLSRLNASICRQQSRSRWLREGDANTKYFHSVLTNRRRGNTI 728 Query: 851 LGLDFDGSFIEGVNPLRREIRGYFESHFNNRDWARPSFGNINIPTLGTAESNALTNPFSE 1030 L +G GV+P+R+ + +F HF + RP N++ L E +L FS Sbjct: 729 SSLQVNGVTTRGVHPIRQAVFTHFADHFKVNNVDRPRVENLHFRRLNPLECGSLIKAFSL 788 Query: 1031 EEVKAMVWNC 1060 EEVKA VW+C Sbjct: 789 EEVKAAVWDC 798 >GAU32903.1 hypothetical protein TSUD_152630 [Trifolium subterraneum] Length = 1715 Score = 159 bits (401), Expect(2) = 3e-68 Identities = 77/165 (46%), Positives = 107/165 (64%) Frame = +1 Query: 4 ENKLSVSIVNVYAPCDLGGKRRVWEELGNLMASKGGGRWCVLGNFNSIKSISERKGVACN 183 ++ L+ + NVYAPCD G+ +W EL + WCVLG+FN+I+S ER + Sbjct: 762 KDDLAFCLANVYAPCDARGRSLLWRELDVKLLQIPLSVWCVLGDFNAIRSRDERVSRGSS 821 Query: 184 HRSEEIELFCDFIEVCGLIDLPLIGRKFTWYKPDGKAMSRLDRFLISEDWLFTWGNLSQW 363 E+ F +FI+ LIDLPL GR FTWY DG +MSRLDRFLIS+ W+F++ + Q Sbjct: 822 G-VEDYMAFKNFIDRNALIDLPLGGRSFTWYSGDGLSMSRLDRFLISDSWVFSFPHCVQM 880 Query: 364 GMQRSVSDHCAVILKEKDINWGPKPFRMMRCWEETVGYEDFVKKR 498 + RS+SDHC ++L +WGPKPFRMM+CW + GY +FVK++ Sbjct: 881 ALPRSLSDHCPIMLSVDVQDWGPKPFRMMKCWADISGYAEFVKQK 925 Score = 129 bits (324), Expect(2) = 3e-68 Identities = 65/190 (34%), Positives = 96/190 (50%) Frame = +2 Query: 491 KKEWSGLKVRGWKGYXXXXXXXXXXXXXXXWHKDHFGNLETKIKEAKEEVYRLDLKGEVD 670 K++W ++ GW G+ WH+ H NL+ KI+ K + LD+ E Sbjct: 923 KQKWQSFQIHGWSGHILKTKLKLLKAELRSWHQIHTANLDGKIQRDKSRLEELDICKEGR 982 Query: 671 GLNEVEVGDRRLCIANINRFSSMNCRILWQKSRMKWLKEGDANSKFFHRCVQRRRKANEI 850 GL+ VE + +I S + + WQKSR+ WL+EGDANSKFFH + RR+AN I Sbjct: 983 GLDVVEEAELLSLPVDILALSKLQASMYWQKSRVTWLREGDANSKFFHGVMSSRRRANSI 1042 Query: 851 LGLDFDGSFIEGVNPLRREIRGYFESHFNNRDWARPSFGNINIPTLGTAESNALTNPFSE 1030 L G +E V +R + ++ +HF + RP + +L + LT PF Sbjct: 1043 GALIHAGRTVESVPEVRHIVYQHYSNHFRKQMHYRPDISRLEFRSLSSLHGAELTKPFLM 1102 Query: 1031 EEVKAMVWNC 1060 EE+KA VW+C Sbjct: 1103 EEIKAAVWDC 1112 >GAU21183.1 hypothetical protein TSUD_11000 [Trifolium subterraneum] Length = 482 Score = 160 bits (404), Expect(2) = 8e-68 Identities = 79/157 (50%), Positives = 100/157 (63%), Gaps = 2/157 (1%) Frame = +1 Query: 25 IVNVYAPCDLGGKRRVWEELGNLMASKGGGRWCVLGNFNSIKSISERKGVA--CNHRSEE 198 + NVYAPCD GK+ +W+ LG + + WCV G+FNSI+S ERKG N S Sbjct: 171 VANVYAPCDGNGKQLLWDRLGARLLNSDVS-WCVCGDFNSIRSDEERKGRGGVVNFAS-- 227 Query: 199 IELFCDFIEVCGLIDLPLIGRKFTWYKPDGKAMSRLDRFLISEDWLFTWGNLSQWGMQRS 378 F FIE L DLPL GR+FTWY+ DG +MSRLDRFL+SE W W N Q + R Sbjct: 228 ---FNSFIEDAALSDLPLCGRQFTWYRGDGVSMSRLDRFLLSEVWCQNWPNCFQLALPRG 284 Query: 379 VSDHCAVILKEKDINWGPKPFRMMRCWEETVGYEDFV 489 +SDHC ++L + NWGPKPFRM+R W + GY++FV Sbjct: 285 LSDHCPIVLSVDEENWGPKPFRMLRSWSDMPGYKEFV 321 Score = 127 bits (318), Expect(2) = 8e-68 Identities = 62/147 (42%), Positives = 84/147 (57%) Frame = +2 Query: 494 KEWSGLKVRGWKGYXXXXXXXXXXXXXXXWHKDHFGNLETKIKEAKEEVYRLDLKGEVDG 673 ++W V GW G+ WH H N+E +IKE KE ++ LD+KGE G Sbjct: 323 EKWRSFNVSGWGGFVLKEKLKLLKGSLKEWHLKHGRNIEGRIKETKERMHDLDVKGEGVG 382 Query: 674 LNEVEVGDRRLCIANINRFSSMNCRILWQKSRMKWLKEGDANSKFFHRCVQRRRKANEIL 853 LN+ E + R+ + SS+NCR WQKSR+ WLK+GDANSKFFH + RR+ N I Sbjct: 383 LNDEEREELRVLSTQVLSLSSLNCRNQWQKSRLVWLKDGDANSKFFHDVMSSRRRGNAIH 442 Query: 854 GLDFDGSFIEGVNPLRREIRGYFESHF 934 L +G +EGV+ +R I +FE HF Sbjct: 443 NLVVEGHQVEGVSGMRNAIFNHFEKHF 469 >GAU48515.1 hypothetical protein TSUD_244350 [Trifolium subterraneum] Length = 1633 Score = 162 bits (410), Expect(2) = 1e-67 Identities = 75/158 (47%), Positives = 102/158 (64%) Frame = +1 Query: 25 IVNVYAPCDLGGKRRVWEELGNLMASKGGGRWCVLGNFNSIKSISERKGVACNHRSEEIE 204 + NVYAPCD G K +W L + S G R CV G+FN++K + ER+ RS + Sbjct: 802 VANVYAPCDDGAKLVLWGSLSARIQSLGRQRLCVYGDFNAVKLVDERRSSRGESRSLDHI 861 Query: 205 LFCDFIEVCGLIDLPLIGRKFTWYKPDGKAMSRLDRFLISEDWLFTWGNLSQWGMQRSVS 384 F FI+ LIDLPL GRKFTW+K DG +MSRLDRFL+SE+W TW N +Q R +S Sbjct: 862 PFNSFIDDNNLIDLPLSGRKFTWFKGDGLSMSRLDRFLLSEEWCLTWPNCTQTASLRGLS 921 Query: 385 DHCAVILKEKDINWGPKPFRMMRCWEETVGYEDFVKKR 498 DHC+++L + +WG +P RM++CW + GY+ FVK + Sbjct: 922 DHCSLVLSANEDDWGSRPSRMLKCWRDVPGYKGFVKDK 959 Score = 124 bits (311), Expect(2) = 1e-67 Identities = 62/162 (38%), Positives = 90/162 (55%) Frame = +2 Query: 491 KKEWSGLKVRGWKGYXXXXXXXXXXXXXXXWHKDHFGNLETKIKEAKEEVYRLDLKGEVD 670 K +W+ +V GW G+ WH H NL ++I+ K + LDLKGE + Sbjct: 957 KDKWNSFQVDGWGGFVLKEKLRMIKTALKDWHTAHAQNLPSRIESLKARLSTLDLKGEEE 1016 Query: 671 GLNEVEVGDRRLCIANINRFSSMNCRILWQKSRMKWLKEGDANSKFFHRCVQRRRKANEI 850 L+E E+ + A+I+ S M+ I WQ+SR WLKEGDANSK+FH + RR+ N I Sbjct: 1017 ALSEDEINELHGISADIHSLSRMHASISWQQSRSLWLKEGDANSKYFHSVLAGRRRGNTI 1076 Query: 851 LGLDFDGSFIEGVNPLRREIRGYFESHFNNRDWARPSFGNIN 976 + DG +EGV P+R+ + +F SHF + RP GN++ Sbjct: 1077 SVIHADGVTLEGVLPIRQAVFSHFASHFKAINMERPRVGNLH 1118 >GAU28506.1 hypothetical protein TSUD_156630 [Trifolium subterraneum] Length = 1091 Score = 158 bits (399), Expect(2) = 2e-67 Identities = 72/159 (45%), Positives = 103/159 (64%) Frame = +1 Query: 22 SIVNVYAPCDLGGKRRVWEELGNLMASKGGGRWCVLGNFNSIKSISERKGVACNHRSEEI 201 S+ NVYAPCD G K+++W+ L + + G R CV G+FN+++S+ ER+ V+ +S + Sbjct: 105 SVANVYAPCDPGAKQQLWDSLSERIQALGRSRVCVCGDFNAVRSLEERRSVSGRSQSLDH 164 Query: 202 ELFCDFIEVCGLIDLPLIGRKFTWYKPDGKAMSRLDRFLISEDWLFTWGNLSQWGMQRSV 381 F FI+ LIDLPL GRKFTW+K D +MSRLDRFL+S +W TW N +Q R + Sbjct: 165 ISFNRFIDDNNLIDLPLCGRKFTWFKGDDLSMSRLDRFLLSGEWCLTWPNCTQVARMRGL 224 Query: 382 SDHCAVILKEKDINWGPKPFRMMRCWEETVGYEDFVKKR 498 S H +IL WGP+P RM++CW++ GY F+K + Sbjct: 225 SHHYPLILAVNVEEWGPRPSRMLKCWKDVPGYNTFIKDK 263 Score = 127 bits (319), Expect(2) = 2e-67 Identities = 65/190 (34%), Positives = 100/190 (52%) Frame = +2 Query: 491 KKEWSGLKVRGWKGYXXXXXXXXXXXXXXXWHKDHFGNLETKIKEAKEEVYRLDLKGEVD 670 K +W+ +V GW G+ WHK H NL + I+ ++ + LD K Sbjct: 261 KDKWNSFQVVGWGGFVLKEKFKMIKMALKDWHKTHTQNLPSGIESLQDRLAALDEKEGDV 320 Query: 671 GLNEVEVGDRRLCIANINRFSSMNCRILWQKSRMKWLKEGDANSKFFHRCVQRRRKANEI 850 L++VE+ + +I+ S +N I WQ+SR +WL EGDANSK+FH + RR+ N I Sbjct: 321 VLSDVEIAELHGVTLDIHSLSRLNASICWQQSRSRWLSEGDANSKYFHSVLANRRRGNAI 380 Query: 851 LGLDFDGSFIEGVNPLRREIRGYFESHFNNRDWARPSFGNINIPTLGTAESNALTNPFSE 1030 L + +EGV+P+++ + + SHF + RP ++N L E ++L FS Sbjct: 381 SSLQVGNATVEGVDPIKQAVVCHLASHFKVVNVERPGVDSLNFKRLHPPEVSSLIKSFSL 440 Query: 1031 EEVKAMVWNC 1060 EVKA VW+C Sbjct: 441 AEVKAAVWDC 450 >GAU24087.1 hypothetical protein TSUD_388800 [Trifolium subterraneum] Length = 1985 Score = 158 bits (400), Expect(2) = 8e-67 Identities = 76/162 (46%), Positives = 104/162 (64%), Gaps = 6/162 (3%) Frame = +1 Query: 25 IVNVYAPCDLGGKRRVWEELGNLMASKGG---GRWCVLGNFNSIKSISERKGVACN---H 186 IVNVYA C+L KR +W N++ SK G G WCVLG+FNS++ +ER+GV N Sbjct: 857 IVNVYAKCNLRNKRTLW---ANILMSKSGFGEGLWCVLGDFNSVRDSNERRGVVGNVDGQ 913 Query: 187 RSEEIELFCDFIEVCGLIDLPLIGRKFTWYKPDGKAMSRLDRFLISEDWLFTWGNLSQWG 366 RS E+ F F+ L+D+PLIGR+FTW+ P+G +MSRLDR LIS DW WG + W Sbjct: 914 RSSEMVAFDLFLNNLDLVDMPLIGRRFTWFHPNGVSMSRLDRILISSDWADVWGTPNVWA 973 Query: 367 MQRSVSDHCAVILKEKDINWGPKPFRMMRCWEETVGYEDFVK 492 M R V+DHC ++L+ +WGP+PFR W E +++ +K Sbjct: 974 MDRDVADHCPLVLRYSLADWGPRPFRFSNFWLEHREFKEVIK 1015 Score = 125 bits (313), Expect(2) = 8e-67 Identities = 60/190 (31%), Positives = 97/190 (51%) Frame = +2 Query: 491 KKEWSGLKVRGWKGYXXXXXXXXXXXXXXXWHKDHFGNLETKIKEAKEEVYRLDLKGEVD 670 K W GW G+ W + +G E K K +++ LDLK E Sbjct: 1015 KTAWDAHVAEGWMGFILKERLKVLKGVVKEWSRRTYGEAEAKKKRLIKDILALDLKSETT 1074 Query: 671 GLNEVEVGDRRLCIANINRFSSMNCRILWQKSRMKWLKEGDANSKFFHRCVQRRRKANEI 850 GL + EV +R++ ++ +++Q+SR KWLKEGD NS++FH C++ R++ N + Sbjct: 1075 GLLQGEVVERKILFDDLWITLKSMDAMIFQRSRSKWLKEGDTNSQYFHNCIKARKRRNNM 1134 Query: 851 LGLDFDGSFIEGVNPLRREIRGYFESHFNNRDWARPSFGNINIPTLGTAESNALTNPFSE 1030 + L ++EG + +R E+ +F +HF+N +W RP+ I P L A LT F+ Sbjct: 1135 VALRTRNGWVEGPSLVREEVVSFFRNHFSNEEWHRPTLNGIEFPRLSLARVEELTAMFTL 1194 Query: 1031 EEVKAMVWNC 1060 EE+ +V C Sbjct: 1195 EEISEVVRGC 1204 >GAU17884.1 hypothetical protein TSUD_330100 [Trifolium subterraneum] Length = 558 Score = 145 bits (367), Expect(2) = 2e-66 Identities = 72/190 (37%), Positives = 102/190 (53%) Frame = +2 Query: 491 KKEWSGLKVRGWKGYXXXXXXXXXXXXXXXWHKDHFGNLETKIKEAKEEVYRLDLKGEVD 670 K W+ +V GW GY WH H NL ++I ++ + LD KG + Sbjct: 154 KDRWNSYQVDGWGGYVLKEKFKMIKMALKDWHMTHTQNLPSRIMSLQDRIAELDEKGVEE 213 Query: 671 GLNEVEVGDRRLCIANINRFSSMNCRILWQKSRMKWLKEGDANSKFFHRCVQRRRKANEI 850 L+ E+ D ++++ S +N I WQ+SR +WLKEGDANSK+FH + RR+ N I Sbjct: 214 DLSGAEIDDLHGATSDLHTLSRLNASICWQQSRSRWLKEGDANSKYFHSVLANRRRGNAI 273 Query: 851 LGLDFDGSFIEGVNPLRREIRGYFESHFNNRDWARPSFGNINIPTLGTAESNALTNPFSE 1030 L+ G +EGV P+R+ I +F SHF D RP +++ L E+ L PFS Sbjct: 274 SSLEVGGVTVEGVAPIRQAIVCHFASHFKAVDVVRPGVNSLSFKRLHPTEAGNLIKPFSL 333 Query: 1031 EEVKAMVWNC 1060 EEVKA VW+C Sbjct: 334 EEVKAAVWDC 343 Score = 136 bits (342), Expect(2) = 2e-66 Identities = 65/131 (49%), Positives = 84/131 (64%) Frame = +1 Query: 106 GGGRWCVLGNFNSIKSISERKGVACNHRSEEIELFCDFIEVCGLIDLPLIGRKFTWYKPD 285 G R CV G+FN+++ I ER+ +S + F FIE LIDLPL GRKFTWYK D Sbjct: 26 GQSRVCVCGDFNAVRRIEERRSGRGRPQSLDHHSFNRFIEDNTLIDLPLSGRKFTWYKGD 85 Query: 286 GKAMSRLDRFLISEDWLFTWGNLSQWGMQRSVSDHCAVILKEKDINWGPKPFRMMRCWEE 465 G +MSRLDRFLIS +W W + +Q R +SDHC +IL +WGP+P RM++CW++ Sbjct: 86 GLSMSRLDRFLISPEWCLAWPDCTQTARMRGLSDHCPLILASNVEDWGPRPSRMLKCWKD 145 Query: 466 TVGYEDFVKKR 498 GY FVK R Sbjct: 146 VPGYNIFVKDR 156 >KYP44439.1 Retrovirus-related Pol polyprotein LINE-1 [Cajanus cajan] Length = 1142 Score = 194 bits (494), Expect(2) = 3e-66 Identities = 90/184 (48%), Positives = 118/184 (64%) Frame = +1 Query: 1 GENKLSVSIVNVYAPCDLGGKRRVWEELGNLMASKGGGRWCVLGNFNSIKSISERKGVAC 180 GE + +VNVYA C K+ +W +L +L S+G G+WC +G+FNSIK ERKG + Sbjct: 60 GEENVDCWVVNVYASCSHELKKHLWGKLQSLKQSRGDGKWCFIGDFNSIKHADERKGTSV 119 Query: 181 NHRSEEIELFCDFIEVCGLIDLPLIGRKFTWYKPDGKAMSRLDRFLISEDWLFTWGNLSQ 360 R EEIE F DFI+ LID+PL+GRKFTWY+PDG SRLDR L++ WL W N Sbjct: 120 ILRREEIECFVDFIDNLSLIDMPLLGRKFTWYRPDGSCKSRLDRCLVTTGWLDQWSNACL 179 Query: 361 WGMQRSVSDHCAVILKEKDINWGPKPFRMMRCWEETVGYEDFVKKRMVWAKSEGVERVHV 540 W + R VSD+CA++LK +D+NWGPKPFR + W GY FV+K K E ++ + Sbjct: 180 WALNRGVSDYCAIVLKSEDVNWGPKPFRFLNSWRHEPGYAYFVRKEWFVLK-EKLKTIRS 238 Query: 541 KRKI 552 K KI Sbjct: 239 KLKI 242 Score = 87.0 bits (214), Expect(2) = 3e-66 Identities = 49/160 (30%), Positives = 74/160 (46%) Frame = +2 Query: 581 WHKDHFGNLETKIKEAKEEVYRLDLKGEVDGLNEVEVGDRRLCIANINRFSSMNCRILWQ 760 W+K+ FG+L KI + K+ + + D+K E GL+ EV R+ +A +L+Q Sbjct: 243 WNKEVFGDLNLKISKVKDGIKQCDIKDEESGLSPSEVVQRKEYMAQWQMLMQKKDTLLFQ 302 Query: 761 KSRMKWLKEGDANSKFFHRCVQRRRKANEILGLDFDGSFIEGVNPLRREIRGYFESHFNN 940 KSR+KWL+EGDAN+K+FH C+ +R K N Sbjct: 303 KSRLKWLQEGDANTKYFHGCINKRLKLNH------------------------------- 331 Query: 941 RDWARPSFGNINIPTLGTAESNALTNPFSEEEVKAMVWNC 1060 RP + L + + L PF+ +EVK VW+C Sbjct: 332 ----RPVLNGLVFKRLNLDQVDVLIKPFTLQEVKEAVWDC 367 >GAU20577.1 hypothetical protein TSUD_33240 [Trifolium subterraneum] Length = 1732 Score = 153 bits (386), Expect(2) = 5e-66 Identities = 74/165 (44%), Positives = 104/165 (63%) Frame = +1 Query: 4 ENKLSVSIVNVYAPCDLGGKRRVWEELGNLMASKGGGRWCVLGNFNSIKSISERKGVACN 183 ++ L++ + NVYAPCD G+ +W EL + WCVLG+FN+I+S ER Sbjct: 778 KDDLALCLANVYAPCDARGRSLLWRELDAKLLQIPLSVWCVLGDFNAIRSRDERVSRG-G 836 Query: 184 HRSEEIELFCDFIEVCGLIDLPLIGRKFTWYKPDGKAMSRLDRFLISEDWLFTWGNLSQW 363 E+ F +FI+ LIDLPL GR FTWY DG +MS LDRFLIS+ W+ ++ N Q Sbjct: 837 SGVEDYMAFNNFIDRNALIDLPLGGRSFTWYSGDGLSMSHLDRFLISDSWVSSFPNCVQM 896 Query: 364 GMQRSVSDHCAVILKEKDINWGPKPFRMMRCWEETVGYEDFVKKR 498 + RS+SDHC ++L +WGPKPFRM++CW + GY +F K++ Sbjct: 897 ALPRSLSDHCPIMLSVGVQDWGPKPFRMLKCWADISGYAEFFKQK 941 Score = 127 bits (320), Expect(2) = 5e-66 Identities = 64/190 (33%), Positives = 95/190 (50%) Frame = +2 Query: 491 KKEWSGLKVRGWKGYXXXXXXXXXXXXXXXWHKDHFGNLETKIKEAKEEVYRLDLKGEVD 670 K++W ++ GW G+ WH+ H NL+ KI+ AK + LD+ E Sbjct: 939 KQKWQSFQIHGWSGHILKTKLKLLKAELRSWHQIHTANLDGKIQRAKSRLEELDICKEGR 998 Query: 671 GLNEVEVGDRRLCIANINRFSSMNCRILWQKSRMKWLKEGDANSKFFHRCVQRRRKANEI 850 GL+ E + +I S + + WQKSR+ WL++GDANSKFFH + RR+AN I Sbjct: 999 GLDVAEEAELMSLPVDILALSKLQASMYWQKSRVTWLRDGDANSKFFHGVMSSRRRANSI 1058 Query: 851 LGLDFDGSFIEGVNPLRREIRGYFESHFNNRDWARPSFGNINIPTLGTAESNALTNPFSE 1030 L +G +E V +R ++ +HF + RP + +L LT PF Sbjct: 1059 GALVHEGRTVESVPEVRHIAYQHYSNHFRKQLHYRPDISRLEFRSLSLLHGAELTKPFLL 1118 Query: 1031 EEVKAMVWNC 1060 EE+KA VW+C Sbjct: 1119 EEIKAAVWDC 1128