BLASTX nr result
ID: Glycyrrhiza32_contig00026986
seq
BLASTX 2.2.26 [Sep-21-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Glycyrrhiza32_contig00026986 (1652 letters) Database: ./nr 115,041,592 sequences; 42,171,959,267 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value KHN30886.1 Putative ribonuclease H protein, partial [Glycine soja] 255 2e-76 KHN41375.1 Putative ribonuclease H protein, partial [Glycine soja] 237 1e-69 KHN28363.1 Putative ribonuclease H protein, partial [Glycine soja] 237 6e-69 XP_019447203.1 PREDICTED: uncharacterized protein LOC109350421 [... 235 7e-68 KHN20323.1 Putative ribonuclease H protein, partial [Glycine soja] 234 1e-67 XP_019418409.1 PREDICTED: uncharacterized protein LOC109329191 [... 243 9e-67 GAU38148.1 hypothetical protein TSUD_395930 [Trifolium subterran... 229 7e-65 GAU25119.1 hypothetical protein TSUD_274080 [Trifolium subterran... 228 1e-61 GAU48210.1 hypothetical protein TSUD_404970 [Trifolium subterran... 230 1e-61 GAU34179.1 hypothetical protein TSUD_162800 [Trifolium subterran... 225 2e-61 KYP61726.1 Putative ribonuclease H protein At1g65750 family [Caj... 217 5e-60 GAU24087.1 hypothetical protein TSUD_388800 [Trifolium subterran... 225 7e-60 GAU26515.1 hypothetical protein TSUD_361480 [Trifolium subterran... 222 9e-60 KYP54863.1 Putative ribonuclease H protein At1g65750 family [Caj... 218 1e-59 GAU29820.1 hypothetical protein TSUD_223660 [Trifolium subterran... 219 1e-59 KYP53060.1 hypothetical protein KK1_025062 [Cajanus cajan] 209 1e-59 GAU40143.1 hypothetical protein TSUD_163120 [Trifolium subterran... 211 3e-59 GAU50085.1 hypothetical protein TSUD_371690 [Trifolium subterran... 212 4e-59 GAU20019.1 hypothetical protein TSUD_273540 [Trifolium subterran... 213 4e-59 KYP44439.1 Retrovirus-related Pol polyprotein LINE-1 [Cajanus ca... 222 5e-59 >KHN30886.1 Putative ribonuclease H protein, partial [Glycine soja] Length = 373 Score = 255 bits (652), Expect = 2e-76 Identities = 132/363 (36%), Positives = 186/363 (51%), Gaps = 4/363 (1%) Frame = +3 Query: 12 GGLGIKNLELFNXXXXXXXXXXXXTDHDAVWVGLLSFKYGQNFTISSHNTADHRFA---S 182 GGLG+KNLE+FN DH+A+W LL+F+YG N + D + S Sbjct: 1 GGLGVKNLEVFNISLLAKWKWRCIHDHNALWRDLLAFRYG-NLIAKQTCSLDRSWGTKDS 59 Query: 183 IWWRDLHLIELDRGVQPMWFSDALCRKVGDGEHTKFWSDTWLGVSSLKDCFPRLFLVAEN 362 IWWRDL L+E D +F A+ VGDG+ FW + WLG LKD FP LF ++ Sbjct: 60 IWWRDLMLLEKDLSQNQNFFQRAVSCDVGDGQSILFWYNKWLGSEPLKDAFPELFAISSQ 119 Query: 363 KEANIAEMGAWHGDIWRWEWRWRRPLFVWEEELLADFLNIMAPVQIQKNVVDSWLWKLEG 542 + ++ G+W D W W+ W+R L EEE L I+ V + D W W L Sbjct: 120 QLVSVGNAGSWRRDQWTWDLTWKRQLNPNEEESLHSLETILVDVHLVAESHDRWKWSLHN 179 Query: 543 SQVFSVKSAYNMLLTVQRNNGENQGLDRTFKWLWSCDVSSKIAVFTWRLLQDRLPTREQL 722 S++F+V S Y+ +++ N + +W V SK+A+F WRLL DRLPT++ L Sbjct: 180 SKLFTVSSCYSFAMSLVNQTQMNSDILDILSIVWKVPVPSKVALFCWRLLLDRLPTKDNL 239 Query: 723 ICRGIIDANHGGSCVFCFRAVENCSHLFFTCNFSYHVWSVVNSWIGVAGVFHNDGVNHFI 902 I R ++ N C C EN HLFF C+FS +W + SWIG+ V GV HF Sbjct: 240 IRRNVVINN--SRCSLCDSCDENVVHLFFHCDFSKCIWKEILSWIGIVDVIAVGGVQHFW 297 Query: 903 QHGDFFKGKKLR-RTRNLIWMAVVWSLWGMRNKIIFQGLVADFTSVIAQIKMLSWGWFVN 1079 ++ K R + + W+A +W +W +RN IF+ D I QIK + W WF+ Sbjct: 298 EYDRLLKYNTSRNKVPFMFWLATLWIIWQVRNNSIFKEEEKDIPKTINQIKHICWAWFMG 357 Query: 1080 RAG 1088 + G Sbjct: 358 KVG 360 >KHN41375.1 Putative ribonuclease H protein, partial [Glycine soja] Length = 363 Score = 237 bits (604), Expect = 1e-69 Identities = 122/338 (36%), Positives = 173/338 (51%), Gaps = 4/338 (1%) Frame = +3 Query: 87 DHDAVWVGLLSFKYGQNFTISSHNTADHRFA---SIWWRDLHLIELDRGVQPMWFSDALC 257 DH+A+W LL+F+YG N + D + SIWWRDL L+E D +F A+ Sbjct: 12 DHNALWRDLLAFRYG-NLIAKQTCSLDRSWGTKDSIWWRDLMLLEKDLSQNQNFFQRAVS 70 Query: 258 RKVGDGEHTKFWSDTWLGVSSLKDCFPRLFLVAENKEANIAEMGAWHGDIWRWEWRWRRP 437 VGDG+ FW + WLG LKD FP LF ++ + ++ G+W D W W W+R Sbjct: 71 CDVGDGQSILFWYNKWLGSEPLKDAFPELFAISSQQLVSVGNAGSWRRDQWTWGLTWKRQ 130 Query: 438 LFVWEEELLADFLNIMAPVQIQKNVVDSWLWKLEGSQVFSVKSAYNMLLTVQRNNGENQG 617 L EEE L I+ V + D W W L S++F+V S Y+ +++ N Sbjct: 131 LNPNEEESLHSLETILVDVHLVAESHDRWKWSLHNSKLFTVSSCYSFAMSLVNQTQMNSD 190 Query: 618 LDRTFKWLWSCDVSSKIAVFTWRLLQDRLPTREQLICRGIIDANHGGSCVFCFRAVENCS 797 + +W V SK+A+F WRLL DRLPT++ LI R ++ N C C EN Sbjct: 191 ILDILSIVWKVPVPSKVALFCWRLLLDRLPTKDNLIRRNVVINN--SRCSLCDSCDENVV 248 Query: 798 HLFFTCNFSYHVWSVVNSWIGVAGVFHNDGVNHFIQHGDFFKGKKLR-RTRNLIWMAVVW 974 HLFF C+FS +W + SWIG+ V GV HF ++ K R + + W+A +W Sbjct: 249 HLFFHCDFSKCIWKEILSWIGIVDVIAVGGVQHFWEYDRLLKYNTSRNKVPFMFWLATLW 308 Query: 975 SLWGMRNKIIFQGLVADFTSVIAQIKMLSWGWFVNRAG 1088 +W +RN IF+ D I QIK + W WF+ + G Sbjct: 309 IIWQVRNNSIFKEEEKDIPKTINQIKHICWAWFMGKVG 346 >KHN28363.1 Putative ribonuclease H protein, partial [Glycine soja] Length = 417 Score = 237 bits (604), Expect = 6e-69 Identities = 125/340 (36%), Positives = 175/340 (51%), Gaps = 4/340 (1%) Frame = +3 Query: 3 KRLGGLGIKNLELFNXXXXXXXXXXXXTDHDAVWVGLLSFKYGQNFTISSHNTADHRFA- 179 K+ GGLG+KNLE+FN DH+A+W LL+F+YG N + D + Sbjct: 73 KKEGGLGVKNLEVFNISLLAKWKWRCIHDHNALWRDLLAFRYG-NLIAKQTCSLDRSWGT 131 Query: 180 --SIWWRDLHLIELDRGVQPMWFSDALCRKVGDGEHTKFWSDTWLGVSSLKDCFPRLFLV 353 SIWWRDL L+E D +F A+ VGDG+ FW + WLG LKD FP LF + Sbjct: 132 KDSIWWRDLMLLEKDLSQNQNFFQRAVSCDVGDGQSILFWYNKWLGSEPLKDAFPELFAI 191 Query: 354 AENKEANIAEMGAWHGDIWRWEWRWRRPLFVWEEELLADFLNIMAPVQIQKNVVDSWLWK 533 + + ++ G+W D W W W+R L EEE L I+ V + D W W Sbjct: 192 SSQQLVSVGNAGSWRRDQWTWGLTWKRQLNPNEEESLHSLETILVDVHLVAESHDRWKWS 251 Query: 534 LEGSQVFSVKSAYNMLLTVQRNNGENQGLDRTFKWLWSCDVSSKIAVFTWRLLQDRLPTR 713 L S++F+V S Y+ +++ N + +W V SK+A+F WRLL DRLPT+ Sbjct: 252 LHNSKLFTVSSCYSFAMSLVNQTQMNSDILDILSIVWKVPVPSKVALFCWRLLLDRLPTK 311 Query: 714 EQLICRGIIDANHGGSCVFCFRAVENCSHLFFTCNFSYHVWSVVNSWIGVAGVFHNDGVN 893 + LI R ++ N C C EN HLFF C+FS +W V SWIG+ V GV Sbjct: 312 DNLIRRNVVINN--SRCSLCDSCDENVVHLFFHCDFSNCIWKEVLSWIGIVDVIAVGGVQ 369 Query: 894 HFIQHGDFFKGKKLR-RTRNLIWMAVVWSLWGMRNKIIFQ 1010 HF ++ K R + + W+A +W +W +RN IF+ Sbjct: 370 HFWEYDRLLKYNTSRNKVPFMFWLATLWIIWQVRNNSIFK 409 >XP_019447203.1 PREDICTED: uncharacterized protein LOC109350421 [Lupinus angustifolius] Length = 456 Score = 235 bits (600), Expect = 7e-68 Identities = 130/374 (34%), Positives = 187/374 (50%), Gaps = 4/374 (1%) Frame = +3 Query: 3 KRLGGLGIKNLELFNXXXXXXXXXXXXTDHDAVWVGLLSFKYGQNFTISSHNTADHRF-- 176 KR G G+KNL LFN + +++WV +L YG + F Sbjct: 8 KRGRGFGVKNLGLFNLALLGKWRWRMLSSSESLWVKVLRSIYGVEAVVRGGLVDVECFKK 67 Query: 177 ASIWWRDLHLI-ELDRGVQPMWFSDALCRKVGDGEHTKFWSDTWLGVSSLKDCFPRLFLV 353 S WWRDL + D G WF++ + R+VG G+ T FW D W+G LK+CF RLF V Sbjct: 68 GSSWWRDLGCVCNRDNGFNKGWFNEGVRRRVGSGQSTLFWRDIWVGGECLKNCFERLFQV 127 Query: 354 AENKEANIAEMGAWHGDIWRWEWRWRRPLFVWEEELLADFLNIMAPVQIQKNVVDSWLWK 533 NK+A I+ M W +W W WRR LF+WE++ + D LN + V++ + D WLW Sbjct: 128 TLNKDACISSMDEWRNGVWCWLLNWRRSLFLWEQDEVNDLLNKVEEVRLVQGNEDGWLWV 187 Query: 534 LEGSQVFSVKSAYNMLLTVQRNNGENQGLDRTFKWLWSCDVSSKIAVFTWRLLQDRLPTR 713 + + +SV++AY +L RN+ +K LW+ V SK+ F WRL +PTR Sbjct: 188 HDKNGTYSVRNAYKVLQNEVRNDNYLH-----YKRLWASKVPSKLKCFAWRLFVGGVPTR 242 Query: 714 EQLICRGIIDANHGGSCVFCFRAVENCSHLFFTCNFSYHVWSVVNSWIGVAGVFHNDGVN 893 L RGII + C FC E+ HLFFTC+ SY VW + S G+ + + + Sbjct: 243 MNLARRGIIGSLPSTLCAFCGELEESSDHLFFTCSLSYSVWQKLYSLFGIYSILPSSTGS 302 Query: 894 HFIQHGDFF-KGKKLRRTRNLIWMAVVWSLWGMRNKIIFQGLVADFTSVIAQIKMLSWGW 1070 +F+ H F + K + IW +WSLW +RNKIIF+ + I I L + Sbjct: 303 NFLSHWHLFGEAKNFHQQWMTIWFVTIWSLWLVRNKIIFEESSFNLDENIIIIFSLPHHF 362 Query: 1071 FVNRAGRSSEISLV 1112 F R + S ++ Sbjct: 363 FFARFNKESSFEVL 376 >KHN20323.1 Putative ribonuclease H protein, partial [Glycine soja] Length = 417 Score = 234 bits (596), Expect = 1e-67 Identities = 123/340 (36%), Positives = 174/340 (51%), Gaps = 4/340 (1%) Frame = +3 Query: 3 KRLGGLGIKNLELFNXXXXXXXXXXXXTDHDAVWVGLLSFKYGQNFTISSHNTADHRFA- 179 K+ GGLG+KNLE+FN DH+A+W LL+F+YG N + D + Sbjct: 73 KKEGGLGVKNLEVFNISLLAKWKWRCIHDHNALWRDLLAFRYG-NLIAKQTCSLDRSWGT 131 Query: 180 --SIWWRDLHLIELDRGVQPMWFSDALCRKVGDGEHTKFWSDTWLGVSSLKDCFPRLFLV 353 SIWWRDL L+E D +F A+ VGDG+ FW + WLG LKD FP LF + Sbjct: 132 KDSIWWRDLMLLEKDLSQNQNFFQRAVSCDVGDGQSILFWYNKWLGSEPLKDAFPELFAI 191 Query: 354 AENKEANIAEMGAWHGDIWRWEWRWRRPLFVWEEELLADFLNIMAPVQIQKNVVDSWLWK 533 + + ++ +W D W W W+R L EEE L I+ V + D W W Sbjct: 192 SSQQLESVGNASSWRRDQWTWGLTWKRQLNPNEEESLHSLETILVDVHLVAESHDRWKWS 251 Query: 534 LEGSQVFSVKSAYNMLLTVQRNNGENQGLDRTFKWLWSCDVSSKIAVFTWRLLQDRLPTR 713 L S++F+V S Y+ +++ N + +W V SK+A+F WRLL DRLPT+ Sbjct: 252 LHNSKLFTVSSCYSFAMSLVNQTQMNSDILDILSIVWKVPVPSKVALFCWRLLLDRLPTK 311 Query: 714 EQLICRGIIDANHGGSCVFCFRAVENCSHLFFTCNFSYHVWSVVNSWIGVAGVFHNDGVN 893 + LI R ++ N C C EN HLFF C+FS +W + SWIG+ V GV Sbjct: 312 DNLIRRNVVINN--SRCSLCDSCDENVVHLFFHCDFSKCIWKEILSWIGIVDVIAVGGVQ 369 Query: 894 HFIQHGDFFKGKKLR-RTRNLIWMAVVWSLWGMRNKIIFQ 1010 HF ++ K R + + W+A +W +W +RN IF+ Sbjct: 370 HFWEYDRLLKYNTSRNKVPFMFWLATLWIIWQVRNNSIFK 409 >XP_019418409.1 PREDICTED: uncharacterized protein LOC109329191 [Lupinus angustifolius] Length = 953 Score = 243 bits (619), Expect = 9e-67 Identities = 131/358 (36%), Positives = 185/358 (51%), Gaps = 4/358 (1%) Frame = +3 Query: 3 KRLGGLGIKNLELFNXXXXXXXXXXXXTDHDAVWVGLLSFKYGQNFTISSHNTADHRF-- 176 K GGLG+KNL LFN + +++WV +L YG + F Sbjct: 567 KEEGGLGVKNLGLFNLALLGKWRWHMLSSSESLWVKVLRSIYGVEAVVRGGLVDVECFKK 626 Query: 177 ASIWWRDLH-LIELDRGVQPMWFSDALCRKVGDGEHTKFWSDTWLGVSSLKDCFPRLFLV 353 S WWRDL L D G WF++ + R+VG G+ T FW D W+G LK+CF RLF V Sbjct: 627 GSSWWRDLGCLCNRDNGFNKGWFNEGVRRRVGSGQSTLFWRDIWVGGECLKNCFERLFQV 686 Query: 354 AENKEANIAEMGAWHGDIWRWEWRWRRPLFVWEEELLADFLNIMAPVQIQKNVVDSWLWK 533 NK+A I+ MG W +W W WRR LF+WE++ + D LN + V++ + D WLW Sbjct: 687 TLNKDACISSMGEWRNGVWCWLLNWRRSLFLWEQDEVNDLLNKVEEVRLVQGNEDGWLWV 746 Query: 534 LEGSQVFSVKSAYNMLLTVQRNNGENQGLDRTFKWLWSCDVSSKIAVFTWRLLQDRLPTR 713 + + +SV++AY +L RN+ +K LW+ V SK+ F WRL +PT Sbjct: 747 HDKNGTYSVRNAYKVLQNEVRNDNYLH-----YKRLWASKVPSKLKCFAWRLFVGGVPTW 801 Query: 714 EQLICRGIIDANHGGSCVFCFRAVENCSHLFFTCNFSYHVWSVVNSWIGVAGVFHNDGVN 893 L RGII + C FC E+ HLFFTC+ SY VW + S G+ + + + Sbjct: 802 MNLARRGIIGSLPSTLCAFCGELEESSDHLFFTCSLSYSVWQKLYSLFGIYSILPSSTGS 861 Query: 894 HFIQHGDFF-KGKKLRRTRNLIWMAVVWSLWGMRNKIIFQGLVADFTSVIAQIKMLSW 1064 +F+ H F + KK + IW +WSLW +RNKIIF+ + V+ I + SW Sbjct: 862 NFLSHWHLFGEAKKFHQQWMTIWFVTIWSLWLVRNKIIFEESSFNVDEVMFIINLHSW 919 >GAU38148.1 hypothetical protein TSUD_395930 [Trifolium subterraneum] Length = 503 Score = 229 bits (583), Expect = 7e-65 Identities = 134/371 (36%), Positives = 194/371 (52%), Gaps = 8/371 (2%) Frame = +3 Query: 3 KRLGGLGIKNLELFNXXXXXXXXXXXXTDHDAVWVGLLSFKYGQNFTISSHNTAD---HR 173 K+ GGLG+++L L N T VW ++ +YG++ I N D R Sbjct: 126 KKEGGLGVRDLRLVNISLLAKWRWKLLTTECEVWKEVVGARYGRD-VIGKVNLGDIDVTR 184 Query: 174 FASIWWRDLHLIELDRGVQPMWFSDALCRKVGDGEHTKFWSDTWLGVSSLKDCFPRLFLV 353 S WWRDL L++ D WFS A+ ++VG G+ T FW++ W+G L+ FPRLF + Sbjct: 185 TGSCWWRDLCLLDSD----VRWFSSAVGKRVGRGDSTMFWNEIWIGDQPLRQRFPRLFGM 240 Query: 354 AENKEANIAEMGAWHGDIWRWEWRWRRPLFVWEEELLADFLNIMAPVQIQKNV-VDSWLW 530 + + I MG+ +W WE +WRR F WEE+ FL+I+ VQ V D WLW Sbjct: 241 STQQNEVICNMGSLVNGLWHWELQWRRNFFTWEEDQYNHFLDII--VQFAPTVQQDRWLW 298 Query: 531 KLEGSQVFSVKSAY----NMLLTVQRNNGENQGLDRTFKWLWSCDVSSKIAVFTWRLLQD 698 +G Q ++ SAY N L+T + N D FK LW C SK++ F+W+L+ D Sbjct: 299 LGDGVQGYTANSAYSLVVNKLVTPSVCDPIN---DLVFKILWKCGAPSKVSAFSWQLMLD 355 Query: 699 RLPTREQLICRGIIDANHGGSCVFCFRAVENCSHLFFTCNFSYHVWSVVNSWIGVAGVFH 878 RL T++ L+ R II A+H G+CVFC A E+ SHLF C+ VW + W+G+ + Sbjct: 356 RLQTKDNLMKRRIIQAHH-GNCVFCNLAQESASHLFLHCDRVAKVWYDLMRWLGLTVILP 414 Query: 879 NDGVNHFIQHGDFFKGKKLRRTRNLIWMAVVWSLWGMRNKIIFQGLVADFTSVIAQIKML 1058 ++ V+ KK R LIW A +W +W +RN +F V V Q+K+ Sbjct: 415 HNIVSSLAILVTCANNKKERAGLCLIWNAYMWVIWTVRNVCVFNNGVFMEEEVADQVKLE 474 Query: 1059 SWGWFVNRAGR 1091 SW WF+ R + Sbjct: 475 SWKWFIGRVAK 485 >GAU25119.1 hypothetical protein TSUD_274080 [Trifolium subterraneum] Length = 937 Score = 228 bits (582), Expect = 1e-61 Identities = 136/367 (37%), Positives = 193/367 (52%), Gaps = 10/367 (2%) Frame = +3 Query: 12 GGLGIKNLELFNXXXXXXXXXXXXTDHDAVWVGLLSFKYGQNFTISSHNTA-----DHRF 176 GGLG++++ N AVW +L +YG+N + HN Sbjct: 560 GGLGVRDVGKVNLSLLIKWRWKLLQKDAAVWKDVLVARYGEN---ARHNVLWIGCPIPSS 616 Query: 177 ASIWWRDLHLIELDRGVQPMWFSDALCRKVGDGEHTKFWSDTWLGVSSLKDCFPRLFLVA 356 AS WWRDL I+L + WF+ + R+VG G+ T+FW D W+G L + FPRLF ++ Sbjct: 617 ASCWWRDLCRIDLTE--EGSWFAKNISRRVGRGDTTRFWKDCWVGQVPLCESFPRLFSIS 674 Query: 357 ENKEANIAEMGAWHGDIWRWEWRWRRPLFVWEEELLADFLNIMAPVQIQKNVVDSWLWKL 536 KEA ++E+ + WEW WRR LFVWEEELL + ++P+ + D W W L Sbjct: 675 LQKEALVSEIRVGGEGVSWWEWGWRRSLFVWEEELLLGLQDFISPMAFSTD-DDVWYWGL 733 Query: 537 EGSQVFSVKSAYNMLLTVQRNNGENQGLD-RTFKWLWSCDVSSKIAVFTWRLLQDRLPTR 713 E VF+VKSAY +L + + + R +W SK+ F+W+LL++R+PTR Sbjct: 734 EDGGVFTVKSAYLLLGRMFASFSMFNVCELRVLNSIWRSPAPSKVIAFSWKLLRNRIPTR 793 Query: 714 EQLICRGIIDANHGGSCVFCFRAVENCSHLFFTCNFSYHVWSVVNSWIGVAGVFHNDGVN 893 + L RGI+ A CV C E HLF C+F++ VWS + W+GV V N Sbjct: 794 DCLSRRGILAAGGSRECVHCQGREETALHLFLFCDFAFRVWSAIFQWLGVVIVM---PPN 850 Query: 894 HFIQHGDFFKG----KKLRRTRNLIWMAVVWSLWGMRNKIIFQGLVADFTSVIAQIKMLS 1061 FI D F G K + LIW VW++W RN+I+F V D +SVI +IK+LS Sbjct: 851 LFILF-DCFVGAAGCNKRAKGFLLIWHTTVWAIWRSRNEILFANGVLDPSSVIDEIKLLS 909 Query: 1062 WGWFVNR 1082 W W ++R Sbjct: 910 WRWGLSR 916 >GAU48210.1 hypothetical protein TSUD_404970 [Trifolium subterraneum] Length = 1653 Score = 230 bits (587), Expect = 1e-61 Identities = 131/370 (35%), Positives = 183/370 (49%), Gaps = 2/370 (0%) Frame = +3 Query: 3 KRLGGLGIKNLELFNXXXXXXXXXXXXTDHDAVWVGLLSFKYGQNFT--ISSHNTADHRF 176 K GGLG+KNL LFN T+ A+W LL F+YG T + + + Sbjct: 1270 KDQGGLGVKNLNLFNIALLNKWKWRFLTEDGALWAELLRFRYGHLPTQLMGGASFSIGAK 1329 Query: 177 ASIWWRDLHLIELDRGVQPMWFSDALCRKVGDGEHTKFWSDTWLGVSSLKDCFPRLFLVA 356 +S WW+D+ I + +G + WF + VG+G + FW+ W G + FP LF Sbjct: 1330 SSTWWKDV--IGMGKGAEFDWFKSNMRACVGNGVNIGFWNFKWFGNHPFSEIFPNLFAKE 1387 Query: 357 ENKEANIAEMGAWHGDIWRWEWRWRRPLFVWEEELLADFLNIMAPVQIQKNVVDSWLWKL 536 E +IAE +G+ + W+W PL E + +A+ ++ +Q DSW W L Sbjct: 1388 ERPNVSIAERLGGNGEAFVRHWQWSDPLSDSEHQQVAELTELLRGFSLQPGHQDSWRWIL 1447 Query: 537 EGSQVFSVKSAYNMLLTVQRNNGENQGLDRTFKWLWSCDVSSKIAVFTWRLLQDRLPTRE 716 E + +FSVKS YN L+ + + + LW DV SK+ F WRLL RLP R Sbjct: 1448 ETTGLFSVKSYYNALVKSRLIVELDSNVLTAINQLWKNDVPSKVLFFGWRLLLQRLPIRI 1507 Query: 717 QLICRGIIDANHGGSCVFCFRAVENCSHLFFTCNFSYHVWSVVNSWIGVAGVFHNDGVNH 896 L RGI+ CVFC E+C HLFF C+F VW V +WIG +G +H Sbjct: 1508 ALNHRGILTNPQDLPCVFCSVFYEDCVHLFFHCSFVNCVWEAVYNWIGKDYHAGAEGWSH 1567 Query: 897 FIQHGDFFKGKKLRRTRNLIWMAVVWSLWGMRNKIIFQGLVADFTSVIAQIKMLSWGWFV 1076 F GD + R R+LIW+A W+LW +RN +IF G +S++ IK +S W Sbjct: 1568 FKVFGDMVNSTNIERVRHLIWLATTWNLWKLRNNVIFNGATPSASSLLNDIKAISCAWVS 1627 Query: 1077 NRAGRSSEIS 1106 R G S IS Sbjct: 1628 GRYGHKSCIS 1637 >GAU34179.1 hypothetical protein TSUD_162800 [Trifolium subterraneum] Length = 757 Score = 225 bits (574), Expect = 2e-61 Identities = 127/369 (34%), Positives = 194/369 (52%), Gaps = 9/369 (2%) Frame = +3 Query: 3 KRLGGLGIKNLELFNXXXXXXXXXXXXTDHDA-VWVGLLSFKYGQNF---TISSHNTADH 170 K+ GGLGI++L+ N D +W +L KYG + + S + + Sbjct: 375 KKNGGLGIRDLKAVNLSLLMKWRWRLLNSEDTGLWKEVLVAKYGGHILHNVVWSLGSPPY 434 Query: 171 RFASIWWRDLHLIELDRGVQPM-WFSDALCRKVGDGEHTKFWSDTWLGVSSLKDCFPRLF 347 R AS+WW+D++ +L V W ++ + R +G+G T+FWSD W+G L FPRLF Sbjct: 435 R-ASLWWKDIN--DLQACVNSKNWVAEMVTRFLGNGSRTRFWSDNWIGDVLLCSKFPRLF 491 Query: 348 LVAENKEANIAEMGAWHGDIWRWEWRWRRPLFVWEEELLADFLNIMAPVQIQKNVVDSWL 527 ++ KEA ++EM G+ W + WRR LF+WEEE ++ L+++ V + D W Sbjct: 492 SLSLQKEATVSEMMVVEGETKSWNFLWRRSLFLWEEERVSQLLSLLENVSLSLEE-DKWH 550 Query: 528 WKLEGSQVFSVKSAYNMLLTVQRNNGENQGLD----RTFKWLWSCDVSSKIAVFTWRLLQ 695 W L+ FSVKSAY+ LL N + L + F +W K+ VF+WRLL Sbjct: 551 WALDPDGCFSVKSAYDSLL---ENLDTSPNLSPYEAKIFSNIWDSPAPLKVVVFSWRLLH 607 Query: 696 DRLPTREQLICRGIIDANHGGSCVFCFRAVENCSHLFFTCNFSYHVWSVVNSWIGVAGVF 875 DR+PT+E LI RG++ GSCV+C E+ +HLF C + VW + W+GV V Sbjct: 608 DRVPTKENLIVRGVLPRESSGSCVWCGDIRESSAHLFLHCKVALVVWYEIFRWLGVVIVI 667 Query: 876 HNDGVNHFIQHGDFFKGKKLRRTRNLIWMAVVWSLWGMRNKIIFQGLVADFTSVIAQIKM 1055 + F D + KK ++ L+W +V+W++W RN IF + +D ++ K+ Sbjct: 668 PPNLFTLFDYFSDSARSKKSKKGFLLVWHSVIWTIWKARNNQIFNNVTSDPFELVESAKV 727 Query: 1056 LSWGWFVNR 1082 LSW W +R Sbjct: 728 LSWRWSADR 736 >KYP61726.1 Putative ribonuclease H protein At1g65750 family [Cajanus cajan] Length = 554 Score = 217 bits (553), Expect = 5e-60 Identities = 120/362 (33%), Positives = 180/362 (49%), Gaps = 6/362 (1%) Frame = +3 Query: 3 KRLGGLGIKNLELFNXXXXXXXXXXXXTDHDAVWVGLLSFKYGQNFTISSHNTADHRFAS 182 K GGLGI +L FN + W +++ YG+ D +S Sbjct: 193 KEHGGLGILDLRAFNLAILEKWRWHLLVEKGRFWHKVVTSIYGEG---CFQGVGDKVQSS 249 Query: 183 IWWRDLHLIELDRGVQPMWFSDALCRKVGDGEHTKFWSDTWLGVSSLKDCFPRLFLVAEN 362 WW DL I+ WFS + VGDG++T FW D W G L + + RLF +A + Sbjct: 250 KWWVDLWTIDFAPYASFDWFSSRCTKVVGDGQNTFFWKDGWSGQGPLCNRYSRLFSIASD 309 Query: 363 KEANIAEMGAWHGDIWRWEWRWRRPLFVWEEELLADFLNIMAPVQIQKNVVDSWLWKLEG 542 K+ ++A M W + W W WRR LF WE +LL+ + ++ + D W WK Sbjct: 310 KDVSVANMVLWRDGGFEWIWSWRRSLFQWELDLLSQLAADLGSTVLKNDCCDRWCWKDSN 369 Query: 543 SQVFSVKSAYNMLLTVQRNNGENQGLDRTF---KWLWSCDVSSKIAVFTWRLLQDRLPTR 713 ++++VKSAY ++ N G+ F K+LWS V SK++ F W+ L +R+P+ Sbjct: 370 DEIYNVKSAYKAVI--------NDGIYANFLLHKFLWSSCVPSKVSGFAWKALLNRIPSN 421 Query: 714 EQLICRGIIDANHGGSCVFCFRAVENCSHLFFTCNFSYHVWSVVNSWIGVAGVFHN---D 884 LI R ++D + G C + +EN SHL F C ++Y VW + W GV+ V HN + Sbjct: 422 CNLIKRKVLDISASG-CAWYGEDLENTSHLLFGCYYAYSVWLSIFDWFGVSTVLHNSCHE 480 Query: 885 GVNHFIQHGDFFKGKKLRRTRNLIWMAVVWSLWGMRNKIIFQGLVADFTSVIAQIKMLSW 1064 HFI K+R +++W+A +WSLW RN +IF+ V T ++ IK+ SW Sbjct: 481 NFAHFIGIPRCSGRDKMR--WSVVWLATIWSLWLARNNVIFKDKVVAITDLVELIKIRSW 538 Query: 1065 GW 1070 W Sbjct: 539 NW 540 >GAU24087.1 hypothetical protein TSUD_388800 [Trifolium subterraneum] Length = 1985 Score = 225 bits (574), Expect = 7e-60 Identities = 120/372 (32%), Positives = 197/372 (52%), Gaps = 7/372 (1%) Frame = +3 Query: 3 KRLGGLGIKNLELFNXXXXXXXXXXXXTDHDAVWVGLLSFKYGQNFTISSHNTADHR--- 173 K+ GGL I++L N ++ + VW ++ KYG + ++ D R Sbjct: 1608 KKEGGLSIRDLRTVNLSLLAKWRWKLLSEEEEVWKNVIIAKYGIHMLGNAR--LDERDIG 1665 Query: 174 -FASIWWRDLHLIELDRGVQPMWFSDALCRKVGDGEHTKFWSDTWLGVSSLKDCFPRLFL 350 +S+WWRDL LD+GV WF+ + +G G KFW + W+G SL+ FPRLF Sbjct: 1666 SMSSLWWRDL--CRLDKGVG--WFNHFARKYLGCGNSIKFWKEVWVGGQSLELQFPRLFG 1721 Query: 351 VAENKEANIAEMGAWHGDIWRWEWRWRRPLFVWEEELLADFLNIMAPVQIQKNVVDSWLW 530 ++ ++ + E+G+W +WRW RWRR LFVWEE+L+++ ++ + I + D W+W Sbjct: 1722 ISVQQDDMVREVGSWVNGVWRWGLRWRRVLFVWEEDLVSELELVLNNISITEE-EDRWVW 1780 Query: 531 KLEGSQVFSVKSAY---NMLLTVQRNNGENQGLDRTFKWLWSCDVSSKIAVFTWRLLQDR 701 +L F+VKS Y + LLT + + ++ +W V SK++ W+L DR Sbjct: 1781 RLNVGDGFTVKSLYEALDPLLTPRCLVSSFESF--AYRSIWKSAVPSKVSALAWQLFLDR 1838 Query: 702 LPTREQLICRGIIDANHGGSCVFCFRAVENCSHLFFTCNFSYHVWSVVNSWIGVAGVFHN 881 +PT+ L RGI+ +H SCV C E HLF C+++ +W V W+GV V Sbjct: 1839 IPTKVNLYKRGILRMDH-ASCVLCGEEAETARHLFLHCDYAAGIWYAVCRWLGVFAVLPA 1897 Query: 882 DGVNHFIQHGDFFKGKKLRRTRNLIWMAVVWSLWGMRNKIIFQGLVADFTSVIAQIKMLS 1061 D + + + KK+R+ ++WMA +W +W +RN+ +F+ + T + ++ LS Sbjct: 1898 DVMMSYGLLVGCGRNKKIRKGFAIVWMAFIWVIWKVRNERVFKNATVEVTDAVDMVQRLS 1957 Query: 1062 WGWFVNRAGRSS 1097 W W++N+ SS Sbjct: 1958 WQWYLNKMASSS 1969 >GAU26515.1 hypothetical protein TSUD_361480 [Trifolium subterraneum] Length = 873 Score = 222 bits (566), Expect = 9e-60 Identities = 125/360 (34%), Positives = 179/360 (49%), Gaps = 3/360 (0%) Frame = +3 Query: 12 GGLGIKNLELFNXXXXXXXXXXXXTDHDAVWVGLLSFKYGQNFTISSH--NTADHRFASI 185 GGLG++++ N A W LL KYG+ H + AS Sbjct: 496 GGLGVRDVGKVNLSLLIKWRWRLLQPEGAFWKELLVAKYGEMVRQKLHWNDCPIPSRASS 555 Query: 186 WWRDLHLIELDRGVQPMWFSDALCRKVGDGEHTKFWSDTWLGVSSLKDCFPRLFLVAENK 365 WW+D+ E+D + WF+ + R+VG G+ +FW D W G S L D FPRLF +A +K Sbjct: 556 WWKDI--CEIDVCEEGSWFAQHVFRRVGKGDSIRFWKDCWFGNSPLCDLFPRLFSIATHK 613 Query: 366 EANIAEMGAWHGDIWRWEWRWRRPLFVWEEELLADFLNIMAPVQIQKNVVDSWLWKLEGS 545 EA + E+ + W W WRR LFVWE+ELL + P+ + D W W+LE Sbjct: 614 EALVNEVRVVTEGLNLWNWEWRRRLFVWEQELLVSLTETL-PLLVLSGEEDVWYWRLEDG 672 Query: 546 QVFSVKSAYNMLLTVQRNNGENQGLD-RTFKWLWSCDVSSKIAVFTWRLLQDRLPTREQL 722 VF+VKS Y +L +V + + R F +W SK+ VF W+LL++R+PT+ L Sbjct: 673 GVFTVKSVYTLLGSVFATDAVWSPPELRVFDQIWKSPAPSKVIVFPWKLLRNRIPTKANL 732 Query: 723 ICRGIIDANHGGSCVFCFRAVENCSHLFFTCNFSYHVWSVVNSWIGVAGVFHNDGVNHFI 902 RGI +CV C + E+ SHLF CNF+ VW+ + WIGV V + F Sbjct: 733 ALRGIQVVGGSLNCVHCVGSGEDASHLFMYCNFAAQVWNSIFRWIGVTIVIPPNIFLLFD 792 Query: 903 QHGDFFKGKKLRRTRNLIWMAVVWSLWGMRNKIIFQGLVADFTSVIAQIKMLSWGWFVNR 1082 K+ + +LIW +W +W RN I F D + +IK+LSW W ++R Sbjct: 793 CMRGAAPNNKIAKGFSLIWHTTLWVIWKSRNSISFGSGTIDLGQAVGEIKLLSWRWDLSR 852 >KYP54863.1 Putative ribonuclease H protein At1g65750 family [Cajanus cajan] Length = 648 Score = 218 bits (556), Expect = 1e-59 Identities = 128/399 (32%), Positives = 194/399 (48%), Gaps = 4/399 (1%) Frame = +3 Query: 3 KRLGGLGIKNLELFNXXXXXXXXXXXXTDHDAVWVGLLSFKYGQNFTISSHNTADHRFAS 182 K GGLGI +L FN + W +++ YG+ D +S Sbjct: 256 KEHGGLGILDLRAFNLALLGKWRWRLLVEKGRFWHRVVTSIYGEG---CFQGVGDKVQSS 312 Query: 183 IWWRDLHLIELDRGVQPMWFSDALCRKVGDGEHTKFWSDTWLGVSSLKDCFPRLFLVAEN 362 WW DL I+ WFS + VGDG +T FW D W G L + + RLF +A + Sbjct: 313 KWWVDLWTIDSTPYTSFDWFSSRCTKVVGDGRNTFFWKDGWSGQGPLCNRYSRLFSIASD 372 Query: 363 KEANIAEMGAWHGDIWRWEWRWRRPLFVWEEELLADFLNIMAPVQIQKNVVDSWLWKLEG 542 K+ ++A M W + W W WRR LF WE +LL+ + + ++ + D W WK Sbjct: 373 KDVSVANMVLWRDGGFEWIWSWRRSLFQWELDLLSQLAADLGSIVLKNDCCDRWCWKDSN 432 Query: 543 SQVFSVKSAYNMLLTVQRNNGENQGLDRTF---KWLWSCDVSSKIAVFTWRLLQDRLPTR 713 +++VKSAY ++ N G+ F K+LWS V SK++ F W+ L +R+P++ Sbjct: 433 DGIYNVKSAYKAVI--------NGGIYADFLLHKFLWSSCVPSKVSGFAWKALLNRIPSK 484 Query: 714 EQLICRGIIDANHGGSCVFCFRAVENCSHLFFTCNFSYHVWSVVNSWIGVAGVFHNDGVN 893 LI R +++ + G C +C +EN SHL F C ++Y VW +W GV+ V HN Sbjct: 485 CNLIKRKVLNISASG-CAWCGEDLENTSHLLFGCYYAYFVWLSNFAWFGVSTVIHNSCHE 543 Query: 894 HFIQHGDFFKGKKLRRTR-NLIWMAVVWSLWGMRNKIIFQGLVADFTSVIAQIKMLSWGW 1070 +F F + R R +++W+A +WSLW RN +IF+ V ++ IK+ SW W Sbjct: 544 NFAHFNGFPRCSGRDRMRWSVVWLATIWSLWLARNDVIFKDKVVAIKDLVELIKLRSWNW 603 Query: 1071 FVNRAGRSSEISLVGLTFSRRGWAFFHNCEPLWIIFTFG 1187 ++ + S T S+RG+ PL TFG Sbjct: 604 I-----KTKDKSF--FTHSQRGFL------PLVFALTFG 629 >GAU29820.1 hypothetical protein TSUD_223660 [Trifolium subterraneum] Length = 672 Score = 219 bits (557), Expect = 1e-59 Identities = 115/366 (31%), Positives = 181/366 (49%), Gaps = 3/366 (0%) Frame = +3 Query: 3 KRLGGLGIKNLELFNXXXXXXXXXXXXTDHDAVWVGLLSFKYGQNFTISSH--NTADHRF 176 K+ GGLGI+NL L N + VW ++ KYG+ ++ N +F Sbjct: 295 KKEGGLGIRNLRLVNLSLLTKWRWRLLSGEGEVWKDIIVAKYGERVMGNARLDNIVYLQF 354 Query: 177 ASIWWRDLHLIELDRGVQPMWFSDALCRKVGDGEHTKFWSDTWLGVSSLKDCFPRLFLVA 356 S WWRDL ++ D G WF+ + +KVG G FW D W G SL+ FPRLF ++ Sbjct: 355 GSAWWRDLCNLDKDEG----WFNQVVLKKVGMGNSILFWKDVWAGDQSLEHRFPRLFGIS 410 Query: 357 ENKEANIAEMGAWHGDIWRWEWRWRRPLFVWEEELLADFLNIMAPVQIQKNVVDSWLWKL 536 + + MG+W WRWE WRR FVWE EL+ + ++ + + VD W+WK Sbjct: 411 IQQNEVVRNMGSWVNVEWRWELLWRRQFFVWENELVRELGEVLNIFPLSEE-VDRWVWKP 469 Query: 537 EGSQVFSVKSAYNMLLTVQRNNGENQGLDR-TFKWLWSCDVSSKIAVFTWRLLQDRLPTR 713 ++ FSVKS Y+ L + L+ +F +W C V SK++ W+L DR+PT+ Sbjct: 470 NEAEGFSVKSLYDWLDSTLVTRAILTPLEAFSFCSIWKCVVPSKVSALAWQLFLDRIPTK 529 Query: 714 EQLICRGIIDANHGGSCVFCFRAVENCSHLFFTCNFSYHVWSVVNSWIGVAGVFHNDGVN 893 + L CR I + C C E H+F C+F+ VW + W+GV + D + Sbjct: 530 DNL-CRRRIIRSEDAVCDMCGGVSETSRHVFMHCDFAAQVWYAICRWLGVVVLLPPDVMT 588 Query: 894 HFIQHGDFFKGKKLRRTRNLIWMAVVWSLWGMRNKIIFQGLVADFTSVIAQIKMLSWGWF 1073 + KK+++ +++W+A +W +W RN +F + + I+ +SW WF Sbjct: 589 MYGSLVGCGSNKKIKKGFSIVWLAFIWVMWRSRNDKVFNNVAGVVEDALNHIQRISWQWF 648 Query: 1074 VNRAGR 1091 ++ + Sbjct: 649 LSNTAK 654 >KYP53060.1 hypothetical protein KK1_025062 [Cajanus cajan] Length = 323 Score = 209 bits (533), Expect = 1e-59 Identities = 112/307 (36%), Positives = 162/307 (52%), Gaps = 6/307 (1%) Frame = +3 Query: 180 SIWWRDLHLIELDRGVQPMWFSDALC-RKVGDGEHTKFWSDTWLGVSSLKDCFPRLFLVA 356 S WW DL I++ G+ WFS +C R +G+G +T FW D+W + + RLF + Sbjct: 5 SRWWLDLWSIDVCDGISWDWFSTIMCVRVLGNGRNTSFWKDSWCTTTPFCVRYGRLFSIT 64 Query: 357 ENKEANIAEMGAWHGDIWRWEWRWRRPLFVWEEELLADFLNIMAPVQIQKNVVDSWLWKL 536 N EA +A+M G W WRWRRPLF WE E L ++ + Q+Q+ DSW WK Sbjct: 65 INSEATVADMFFGRGGGVEWNWRWRRPLFQWELEQLDLLVSDLRGFQVQEYTHDSWRWKA 124 Query: 537 EGSQVFSVKSAYNMLLTVQRNNGENQGLDRTFKWLWSCDVSSKIAVFTWRLLQDRLPTRE 716 + +SVKSAY++++ N +++W V K++ F WR+L DR P++ Sbjct: 125 DSDGKYSVKSAYHVIV-----NDSLFAEIPLHRFIWCRLVPYKVSCFVWRVLLDRFPSKF 179 Query: 717 QLICRGIIDANHGGSCVFCFRAVENCSHLFFTCNFSYHVWSVVNSWIGVAGVFHNDGVNH 896 L+ R ++ N SCV+C +E SHLFF C F+YHVW + W G V +N Sbjct: 180 NLVKRHVL-INSDSSCVWCQYRMETSSHLFFECYFAYHVWMLSLEWCGFTSVL----LNS 234 Query: 897 FIQHGDFFKG-----KKLRRTRNLIWMAVVWSLWGMRNKIIFQGLVADFTSVIAQIKMLS 1061 FI H D F G K+R +IW+ V+WS+W RN +IF V V+ +K+ + Sbjct: 235 FIAHFDQFLGLPLCPSKMRYRWAVIWLTVIWSIWLARNALIFSDKVLSTLDVLELVKLRT 294 Query: 1062 WGWFVNR 1082 W W R Sbjct: 295 WKWLKAR 301 >GAU40143.1 hypothetical protein TSUD_163120 [Trifolium subterraneum] Length = 419 Score = 211 bits (538), Expect = 3e-59 Identities = 129/362 (35%), Positives = 182/362 (50%), Gaps = 5/362 (1%) Frame = +3 Query: 12 GGLGIKNLELFNXXXXXXXXXXXXTDHDAVWVGLLSFKYG--QNFTISSHNTADHRFASI 185 GGLG++++ N A W +L KYG F + A S+ Sbjct: 42 GGLGVRDVAKVNLSLLIKWRWRLLQSGYAFWKEVLVAKYGIMARFKVHWIGHALPNRVSL 101 Query: 186 WWRDLHLIELDRGVQPMWFSDALCRKVGDGEHTKFWSDTWLGVSSLKDCFPRLFLVAENK 365 WW+D+ I++ WF+ +CRK+G+G T+FW D W+G L D FPRLF ++ N+ Sbjct: 102 WWKDICGIDIRE--DGSWFARNMCRKLGNGNSTRFWLDRWIGSLPLSDQFPRLFSLSLNQ 159 Query: 366 EANIAEMGAWHGDIWRWEWRWRRPLFVWEEELLADFLNIMAPVQIQ-KNVVDSWLWKLEG 542 + + E G W RWRR LFVWEEELL +++ PV + D W W+LE Sbjct: 160 QGMVREFRDVRGGEDGWVMRWRRRLFVWEEELLQRLQDLL-PVDVPWSEAEDRWSWRLEE 218 Query: 543 SQVFSVKSAYNMLLTV--QRNNGENQGLDRTFKWLWSCDVSSKIAVFTWRLLQDRLPTRE 716 FSV S Y L +V Q ++ Q L F +W V SK+ FTW+LL++R+PTR Sbjct: 219 DGSFSVSSMYWYLGSVFSQASSFNAQEL-WVFGKIWKSPVPSKVIAFTWKLLRNRIPTRC 277 Query: 717 QLICRGIIDANHGGSCVFCFRAVENCSHLFFTCNFSYHVWSVVNSWIGVAGVFHNDGVNH 896 L RG I G CV C E+ +HLF C+F+ +W+ + W+G+ V + Sbjct: 278 NLASRG-IQLIGGLDCVHCVGREESGTHLFMFCDFAGQIWNAIFRWLGLVLVIPPNFFLL 336 Query: 897 FIQHGDFFKGKKLRRTRNLIWMAVVWSLWGMRNKIIFQGLVADFTSVIAQIKMLSWGWFV 1076 F KK+R+ LIW +W LW RN I+F V D VI IK+LSW W + Sbjct: 337 FECFTGAAANKKIRKGYALIWHTTIWMLWKSRNDIMFSNGVIDVEKVIDDIKLLSWRWGL 396 Query: 1077 NR 1082 +R Sbjct: 397 SR 398 >GAU50085.1 hypothetical protein TSUD_371690 [Trifolium subterraneum] Length = 438 Score = 212 bits (539), Expect = 4e-59 Identities = 121/365 (33%), Positives = 184/365 (50%), Gaps = 5/365 (1%) Frame = +3 Query: 3 KRLGGLGIKNLELFNXXXXXXXXXXXXTDHDAVWVGLLSFKYGQNFTISSHNTADHRF-- 176 K GGLG++++ L N + +W +L KYG N ++ + D R Sbjct: 57 KSKGGLGVRDVRLANLSLLAKWRWRLLLPGNPLWKEVLVAKYG-NHILNRVDWRDIRIPT 115 Query: 177 -ASIWWRDLHLIELDRGVQPM-WFSDALCRKVGDGEHTKFWSDTWLGVSSLKDCFPRLFL 350 AS WW+D+ LD+ V W ++++ RKVG+G T FW W+G + L FP LF Sbjct: 116 LASKWWKDI--CTLDKVVDNHNWLAESMIRKVGNGTSTSFWCSNWIGEAPLSVTFPLLFS 173 Query: 351 VAENKEANIAEMGAWHGDIWRWEWRWRRPLFVWEEELLADFLNIMAPVQIQKNVVDSWLW 530 ++ +K + G+ WRW + WRR LF WEE+L+ I+ PV + V D W W Sbjct: 174 LSNHKNGMVRNFCDHVGENWRWSFSWRRDLFQWEEDLVVRLREILEPV-VLSLVEDFWSW 232 Query: 531 KLEGSQVFSVKSAYNMLL-TVQRNNGENQGLDRTFKWLWSCDVSSKIAVFTWRLLQDRLP 707 KL+ FSVKSAY L+ + R++ + + F +W SK+ F+W+LL DR+P Sbjct: 233 KLDPEGKFSVKSAYTFLVEELTRDDDLEEAMATVFDQIWDSPAPSKVIAFSWQLLSDRIP 292 Query: 708 TREQLICRGIIDANHGGSCVFCFRAVENCSHLFFTCNFSYHVWSVVNSWIGVAGVFHNDG 887 TR L RG++ + CV C VE+ +HLF C + VW V W+GV + Sbjct: 293 TRRNLEIRGLLGLDMPWECVGCVGRVESTTHLFLHCPSAMMVWYEVFRWLGVVLIIPPSM 352 Query: 888 VNHFIQHGDFFKGKKLRRTRNLIWMAVVWSLWGMRNKIIFQGLVADFTSVIAQIKMLSWG 1067 F + KK+RR +IW A +W +W +NK +F ++ +IK++SW Sbjct: 353 EVLFEVLRGSVRIKKIRRGYLMIWHATLWCIWKAQNKALFANGTFIPKEIVEEIKVVSWK 412 Query: 1068 WFVNR 1082 W + R Sbjct: 413 WCLAR 417 >GAU20019.1 hypothetical protein TSUD_273540 [Trifolium subterraneum] Length = 504 Score = 213 bits (543), Expect = 4e-59 Identities = 119/357 (33%), Positives = 189/357 (52%), Gaps = 4/357 (1%) Frame = +3 Query: 12 GGLGIKNLELFNXXXXXXXXXXXXTDHDAVWVGLLSFKYGQNFTISSHNTADHRFASIWW 191 GGLG++ ++ FN + D++W LL KYGQ+ + S WW Sbjct: 131 GGLGVRRVKDFNYALLGKWVWRCFAEGDSLWCQLLKAKYGQDS--AGRVRFSEGVGSSWW 188 Query: 192 RDLHLIELDRG-VQPMWFSDALCRKVGDGEHTKFWSDTWLGVSSLKDCFPRLFLVAENKE 368 R L+ + RG + P W SD + RK+GDG T FW+D+WL V L F RL+ +A+NK Sbjct: 189 RALNFVWSGRGLIDPRWLSDNIVRKIGDGRSTAFWADSWLEVGPLARVFGRLYDLADNKH 248 Query: 369 ANIAEMGAWHGDIWRWEWRWRRPLFVWEEELLADFLNIMAPVQIQKNVVDSWLWKLEGSQ 548 ++A+M + W+WRR LFVWEEEL+A + ++A +Q + D W+W L SQ Sbjct: 249 ISVADMFQAGWALNGNGWKWRRRLFVWEEELVAQCVGVLANFVLQGDATDRWVWNLHPSQ 308 Query: 549 VFSVKSAYNMLLTVQRNNGENQGLDRTFKWLWSCDVSSKIAVFTWRLLQDRLPTREQLIC 728 +SV+SAY+ L + ++ +LW V K+ +F WR+ +RLPT++ L+ Sbjct: 309 SYSVRSAYSYLTA-----SDGSSMEDFASFLWVKSVPLKVNIFIWRIFLNRLPTKDNLLR 363 Query: 729 RGIIDANHGGSCVFCFRAVENCSHLFFTCNFSYHVWSVVNSWIGVAGVFHNDGVNHFIQH 908 RG+I+ + C +A E+ HLF C+ VW +V +W+G++ H H Q Sbjct: 364 RGVIEVHQELCSTNCGKA-EDAVHLFIQCDVYSQVWHLVLNWLGLSTALHVSLGGHTEQF 422 Query: 909 GDFFKGKKLRRTRNL---IWMAVVWSLWGMRNKIIFQGLVADFTSVIAQIKMLSWGW 1070 G + +RNL IW++V++ +W RN IFQ +++ +IK+ ++ W Sbjct: 423 AGL--GGNSKTSRNLFTIIWVSVLFVIWKDRNDRIFQMGNDSGVTLLERIKLQTYWW 477 >KYP44439.1 Retrovirus-related Pol polyprotein LINE-1 [Cajanus cajan] Length = 1142 Score = 222 bits (566), Expect = 5e-59 Identities = 116/355 (32%), Positives = 182/355 (51%), Gaps = 2/355 (0%) Frame = +3 Query: 12 GGLGIKNLELFNXXXXXXXXXXXXTDHDAVWVGLLSFKYGQNFTISSH-NTADHRFASIW 188 GGLG+K+L FN + +++WV ++ Y I+SH S W Sbjct: 774 GGLGMKDLSAFNLSLLGKWHWRMLVEKNSLWVRVIRSLYD----IASHLPNGSGAKGSRW 829 Query: 189 WRDLHLIELDRGVQPMWFSDALCRKVGDGEHTKFWSDTWLGVSSLKDCFPRLFLVAENKE 368 W DL+ IE V W S C+ +G+G TKFW D W+G L F RL+ +A NK Sbjct: 830 WVDLNRIEEGDLVSNEWMSSNCCKVIGNGVDTKFWLDKWVGHGILAHTFSRLYQIAINKN 889 Query: 369 ANIAEMGAWHGDIWRWEWRWRRPLFVWEEELLADFLNIMAPVQIQKNVVDSWLWKLEGSQ 548 +IAEM W G + +W+W WRR L VWE++LL N + + + D WLW + Sbjct: 890 VSIAEMFEWEGGVVKWKWSWRRRLLVWEQQLLNTLANFINGTKFIISDEDKWLWIAAPER 949 Query: 549 VFSVKSAYNMLLTVQRNNGENQGLDRTFKWLWSCDVSSKIAVFTWRLLQDRLPTREQLIC 728 V++V SAY +L N + F+W+W+ +K++ FTWR++ +R+PT++ L Sbjct: 950 VYTVSSAYKVL-----RNDIIFASNVIFRWIWTSIAPTKVSAFTWRVILNRIPTKDNLFR 1004 Query: 729 RGIIDANHGGSCVFCFRAVENCSHLFFTCNFSYHVWSVVNSWIGVAGVFHNDGVNHFIQ- 905 RG++ A C C E SHLFF C S+ +W +W+G+ + HN V + Q Sbjct: 1005 RGVLQATQ-LECGLCRNKEETTSHLFFECEVSFQLWMACFNWLGLNSIMHNCCVQNLEQF 1063 Query: 906 HGDFFKGKKLRRTRNLIWMAVVWSLWGMRNKIIFQGLVADFTSVIAQIKMLSWGW 1070 +G + K + LI + V+W++W RN +IF + + ++ +++ SW W Sbjct: 1064 YGLRYCSVKYQNCWILIRLPVIWTIWLARNDLIFSSKIIHVSEMLNMVQLRSWRW 1118