BLASTX nr result
ID: Glycyrrhiza36_contig00005589
seq
BLASTX 2.2.26 [Sep-21-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Glycyrrhiza36_contig00005589 (2399 letters) Database: ./nr 115,041,592 sequences; 42,171,959,267 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value GAU48210.1 hypothetical protein TSUD_404970 [Trifolium subterran... 550 e-173 GAU25119.1 hypothetical protein TSUD_274080 [Trifolium subterran... 502 e-161 XP_019418409.1 PREDICTED: uncharacterized protein LOC109329191 [... 503 e-161 GAU37021.1 hypothetical protein TSUD_207270 [Trifolium subterran... 486 e-160 GAU34179.1 hypothetical protein TSUD_162800 [Trifolium subterran... 489 e-159 GAU26515.1 hypothetical protein TSUD_361480 [Trifolium subterran... 484 e-155 KYP54863.1 Putative ribonuclease H protein At1g65750 family [Caj... 470 e-153 GAU18134.1 hypothetical protein TSUD_248350 [Trifolium subterran... 468 e-151 KYP34591.1 LINE-1 reverse transcriptase isogeny [Cajanus cajan] 472 e-151 GAU50434.1 hypothetical protein TSUD_134890, partial [Trifolium ... 466 e-150 KYP40876.1 Putative ribonuclease H protein At1g65750 family [Caj... 466 e-150 GAU46725.1 hypothetical protein TSUD_100170 [Trifolium subterran... 466 e-149 GAU24087.1 hypothetical protein TSUD_388800 [Trifolium subterran... 487 e-148 KYP63901.1 Putative ribonuclease H protein At1g65750 family [Caj... 456 e-148 KYP50779.1 Transposon TX1 uncharacterized [Cajanus cajan] 471 e-148 GAU43915.1 hypothetical protein TSUD_88880 [Trifolium subterraneum] 457 e-147 GAU17363.1 hypothetical protein TSUD_232390 [Trifolium subterran... 451 e-144 GAU43110.1 hypothetical protein TSUD_373050 [Trifolium subterran... 461 e-144 KYP44439.1 Retrovirus-related Pol polyprotein LINE-1 [Cajanus ca... 461 e-144 GAU27776.1 hypothetical protein TSUD_215870 [Trifolium subterran... 448 e-143 >GAU48210.1 hypothetical protein TSUD_404970 [Trifolium subterraneum] Length = 1653 Score = 550 bits (1416), Expect = e-173 Identities = 282/619 (45%), Positives = 381/619 (61%), Gaps = 2/619 (0%) Frame = -3 Query: 2397 GFNDLWRKWMKACIFSSSISVLVNGSPTDDFEPKRGLRQGDPLAPFLFLIAAEGLTGLVD 2218 GF++ W KW++ACIF SS+S+L+NGSPT+DF+ +RGLRQGDPL+PFLFLIAAEGLTGL+ Sbjct: 1021 GFSEEWLKWLRACIFESSMSILINGSPTEDFKVERGLRQGDPLSPFLFLIAAEGLTGLMK 1080 Query: 2217 SAIERGILQGYKVSEDISFTILQFADDTIMLCDASWNNLWAVKAIFRSFELVSGLRVNFL 2038 A+E G +GY+V+ +I F ILQFADDTI++ + W+N+ + + RSFELVSGL++NF+ Sbjct: 1081 RAVELGKFKGYQVNNNIQFQILQFADDTILMGEGVWDNIQTINILLRSFELVSGLKINFV 1140 Query: 2037 KSNMYGINIEGSFLNVASNFLSCCIGNIPLKFLGLPVAANPRRCQTWEPVVKAMRKKLAA 1858 KS +YGIN++ L S FLSC + P KFLG+PV ANPRR +TW+PVV AM K+L+ Sbjct: 1141 KSKIYGINVDDRLLVAGSAFLSCRVDVFPFKFLGIPVGANPRRRETWKPVVDAMTKRLST 1200 Query: 1857 WKGRHLSIGGRVTLINSVLNSMSLYFFSFYKAPKRVIRTLVAIQRNFLWGGSEENQKVSW 1678 WK RHLS GGRVTLINSVL S+ LYFFSF+KAP ++R L IQR+FLWGG E++K+ W Sbjct: 1201 WKSRHLSFGGRVTLINSVLTSLPLYFFSFFKAPCCILRLLERIQRSFLWGGGLEDKKLCW 1260 Query: 1677 VCRDKVCLPKRLGGLGIKNLELFNXXXXXXXXXXXLTDHDAVWVGLLSFKYGQNFT--IS 1504 V D++CL K GGLG+KNL LFN LT+ A+W LL F+YG T + Sbjct: 1261 VKWDQICLSKDQGGLGVKNLNLFNIALLNKWKWRFLTEDGALWAELLRFRYGHLPTQLMG 1320 Query: 1503 SHNTADHRFASIWWRDLHLIELDRGVQPMWFSDALCRKVGDGEHTKFWSDTWLGVSSLKD 1324 + + +S WW+D +I + +G + WF + VG+G + FW+ W G + Sbjct: 1321 GASFSIGAKSSTWWKD--VIGMGKGAEFDWFKSNMRACVGNGVNIGFWNFKWFGNHPFSE 1378 Query: 1323 CFPRLFQVAENKEANIAEMGAWHGDIWRWEWRWRRPLFVWEEELLADFLNIMAPVQIQKN 1144 FP LF E +IAE +G+ + W+W PL E + +A+ ++ +Q Sbjct: 1379 IFPNLFAKEERPNVSIAERLGGNGEAFVRHWQWSDPLSDSEHQQVAELTELLRGFSLQPG 1438 Query: 1143 VVDSWLWKLEGSQVFSVKSAYNMLLTVQRNNGENQGLDRTFKWLWSCDVSSKIAVFTWRL 964 DSW W LE + +FSVKS YN L+ + + + LW DV SK+ F WRL Sbjct: 1439 HQDSWRWILETTGLFSVKSYYNALVKSRLIVELDSNVLTAINQLWKNDVPSKVLFFGWRL 1498 Query: 963 LQDRLPTREQLICRGIIDANHGGSCVFCFRAVENCSHLFFTCNFSYHVWSVVNSWIGVAG 784 L RLP R L RGI+ CVFC E+C HLFF C+F VW V +WIG Sbjct: 1499 LLQRLPIRIALNHRGILTNPQDLPCVFCSVFYEDCVHLFFHCSFVNCVWEAVYNWIGKDY 1558 Query: 783 VFHNDGVNHFIQHGDFF*GKHLRRTRNLIWMAVVWSLWGMRNKIIFQGLVADFTSVITQI 604 +G +HF GD ++ R R+LIW+A W+LW +RN +IF G +S++ I Sbjct: 1559 HAGAEGWSHFKVFGDMVNSTNIERVRHLIWLATTWNLWKLRNNVIFNGATPSASSLLNDI 1618 Query: 603 KMLSWGWFVNRAGRSSEIS 547 K +S W R G S IS Sbjct: 1619 KAISCAWVSGRYGHKSCIS 1637 >GAU25119.1 hypothetical protein TSUD_274080 [Trifolium subterraneum] Length = 937 Score = 502 bits (1293), Expect = e-161 Identities = 259/619 (41%), Positives = 373/619 (60%), Gaps = 10/619 (1%) Frame = -3 Query: 2397 GFNDLWRKWMKACIFSSSISVLVNGSPTDDFEPKRGLRQGDPLAPFLFLIAAEGLTGLVD 2218 GF + W +W++AC+F+ ++SVLVNGSPT + +RGL+QGDPLAPFLFL+ EG GL+ Sbjct: 307 GFGETWVEWIRACVFAGNLSVLVNGSPTTEINIQRGLKQGDPLAPFLFLLVVEGFAGLMR 366 Query: 2217 SAIERGILQGYKV-SEDISFTILQFADDTIMLCDASWNNLWAVKAIFRSFELVSGLRVNF 2041 S +++ + +G+ V +E + + LQ+ADDT+ + +AS NLW +KAI R FEL SGLRVN Sbjct: 367 SVVDKNLFKGFSVGTEGLQISHLQYADDTLCIGEASMENLWTLKAILRGFELASGLRVNI 426 Query: 2040 LKSNMYGINIEGSFLNVASNFLSCCIGNIPLKFLGLPVAANPRRCQTWEPVVKAMRKKLA 1861 KS + G+N+ +F+ A +FL+C G +P +LGLPV ANPRR TWEPV+ ++RK+L Sbjct: 427 WKSYLIGVNVPNNFMENACHFLNCKRGVLPFSYLGLPVGANPRRSSTWEPVLDSLRKRLR 486 Query: 1860 AWKGRHLSIGGRVTLINSVLNSMSLYFFSFYKAPKRVIRTLVAIQRNFLWGGSEENQKVS 1681 AW +++S+GGR+ LINS+LNS+ ++F SF K P V++++ IQR FLWGG + K+S Sbjct: 487 AWGNKYVSLGGRIVLINSILNSIPIFFLSFLKLPAAVLKSITRIQREFLWGGVKGGSKIS 546 Query: 1680 WVCRDKVCLPKRLGGLGIKNLELFNXXXXXXXXXXXLTDHDAVWVGLLSFKYGQNFTISS 1501 WV +VC P+ GGLG++++ N L AVW +L +YG+N + Sbjct: 547 WVKWKEVCKPRSQGGLGVRDVGKVNLSLLIKWRWKLLQKDAAVWKDVLVARYGEN---AR 603 Query: 1500 HNT-----ADHRFASIWWRDLHLIELDRGVQPMWFSDALCRKVGDGEHTKFWSDTWLGVS 1336 HN AS WWRDL I+L + WF+ + R+VG G+ T+FW D W+G Sbjct: 604 HNVLWIGCPIPSSASCWWRDLCRIDLTE--EGSWFAKNISRRVGRGDTTRFWKDCWVGQV 661 Query: 1335 SLKDCFPRLFQVAENKEANIAEMGAWHGDIWRWEWRWRRPLFVWEEELLADFLNIMAPVQ 1156 L + FPRLF ++ KEA ++E+ + WEW WRR LFVWEEELL + ++P+ Sbjct: 662 PLCESFPRLFSISLQKEALVSEIRVGGEGVSWWEWGWRRSLFVWEEELLLGLQDFISPMA 721 Query: 1155 IQKNVVDSWLWKLEGSQVFSVKSAYNMLLTVQRNNGENQGLD-RTFKWLWSCDVSSKIAV 979 + D W W LE VF+VKSAY +L + + + R +W SK+ Sbjct: 722 FSTD-DDVWYWGLEDGGVFTVKSAYLLLGRMFASFSMFNVCELRVLNSIWRSPAPSKVIA 780 Query: 978 FTWRLLQDRLPTREQLICRGIIDANHGGSCVFCFRAVENCSHLFFTCNFSYHVWSVVNSW 799 F+W+LL++R+PTR+ L RGI+ A CV C E HLF C+F++ VWS + W Sbjct: 781 FSWKLLRNRIPTRDCLSRRGILAAGGSRECVHCQGREETALHLFLFCDFAFRVWSAIFQW 840 Query: 798 IGVAGVFHNDGVNHFIQHGDFF*GKHLRRTRN---LIWMAVVWSLWGMRNKIIFQGLVAD 628 +GV V N FI F + LIW VW++W RN+I+F V D Sbjct: 841 LGVVIVM---PPNLFILFDCFVGAAGCNKRAKGFLLIWHTTVWAIWRSRNEILFANGVLD 897 Query: 627 FTSVITQIKMLSWGWFVNR 571 +SVI +IK+LSW W ++R Sbjct: 898 PSSVIDEIKLLSWRWGLSR 916 >XP_019418409.1 PREDICTED: uncharacterized protein LOC109329191 [Lupinus angustifolius] Length = 953 Score = 503 bits (1294), Expect = e-161 Identities = 260/608 (42%), Positives = 362/608 (59%), Gaps = 5/608 (0%) Frame = -3 Query: 2397 GFNDLWRKWMKACIFSSSISVLVNGSPTDDFEPKRGLRQGDPLAPFLFLIAAEGLTGLVD 2218 GF WR W+K+C+ S+S S+LVNGSPT +F RGLRQGDP+APFLFLI AEGL G++ Sbjct: 317 GFCFKWRNWIKSCLQSNSFSILVNGSPTSEFRMARGLRQGDPIAPFLFLIVAEGLGGIMR 376 Query: 2217 SAIERGILQGYKVSED-ISFTILQFADDTIMLCDASWNNLWAVKAIFRSFELVSGLRVNF 2041 SA+ + I GY V D I + LQ+ADDT+++ + S +N+ +K+I + FELVSGL++NF Sbjct: 377 SAVSKKIFTGYSVGRDEIVISHLQYADDTLLIGENSADNIMVLKSILKCFELVSGLKINF 436 Query: 2040 LKSNMYGINIEGSFLNVASNFLSCCIGNIPLKFLGLPVAANPRRCQTWEPVVKAMRKKLA 1861 KS+ GI + SF+ VA N L C +G+IP KFLG+PV ANP+R TW V+ ++KL+ Sbjct: 437 HKSSFIGIKADPSFVQVAVNRLLCGVGSIPFKFLGIPVGANPKRLSTWSLVIDTFKRKLS 496 Query: 1860 AWKGRHLSIGGRVTLINSVLNSMSLYFFSFYKAPKRVIRTLVAIQRNFLWGGSEENQKVS 1681 W+ + LS GGRVTL+ SVL+S+ +Y+FSF+KAP +I L IQR FLWG E N+ + Sbjct: 497 RWQQKLLSFGGRVTLLKSVLSSLPIYYFSFFKAPVSIIHELERIQRRFLWGRGEVNKGIH 556 Query: 1680 WVCRDKVCLPKRLGGLGIKNLELFNXXXXXXXXXXXLTDHDAVWVGLLSFKYGQNFTISS 1501 WV +VC K GGLG+KNL LFN L+ +++WV +L YG + Sbjct: 557 WVRWKEVCRSKEEGGLGVKNLGLFNLALLGKWRWHMLSSSESLWVKVLRSIYGVEAVVRG 616 Query: 1500 H--NTADHRFASIWWRDLH-LIELDRGVQPMWFSDALCRKVGDGEHTKFWSDTWLGVSSL 1330 + + S WWRDL L D G WF++ + R+VG G+ T FW D W+G L Sbjct: 617 GLVDVECFKKGSSWWRDLGCLCNRDNGFNKGWFNEGVRRRVGSGQSTLFWRDIWVGGECL 676 Query: 1329 KDCFPRLFQVAENKEANIAEMGAWHGDIWRWEWRWRRPLFVWEEELLADFLNIMAPVQIQ 1150 K+CF RLFQV NK+A I+ MG W +W W WRR LF+WE++ + D LN + V++ Sbjct: 677 KNCFERLFQVTLNKDACISSMGEWRNGVWCWLLNWRRSLFLWEQDEVNDLLNKVEEVRLV 736 Query: 1149 KNVVDSWLWKLEGSQVFSVKSAYNMLLTVQRNNGENQGLDRTFKWLWSCDVSSKIAVFTW 970 + D WLW + + +SV++AY +L RN+ +K LW+ V SK+ F W Sbjct: 737 QGNEDGWLWVHDKNGTYSVRNAYKVLQNEVRNDNYLH-----YKRLWASKVPSKLKCFAW 791 Query: 969 RLLQDRLPTREQLICRGIIDANHGGSCVFCFRAVENCSHLFFTCNFSYHVWSVVNSWIGV 790 RL +PT L RGII + C FC E+ HLFFTC+ SY VW + S G+ Sbjct: 792 RLFVGGVPTWMNLARRGIIGSLPSTLCAFCGELEESSDHLFFTCSLSYSVWQKLYSLFGI 851 Query: 789 AGVFHNDGVNHFIQHGDFF-*GKHLRRTRNLIWMAVVWSLWGMRNKIIFQGLVADFTSVI 613 + + ++F+ H F K + IW +WSLW +RNKIIF+ + V+ Sbjct: 852 YSILPSSTGSNFLSHWHLFGEAKKFHQQWMTIWFVTIWSLWLVRNKIIFEESSFNVDEVM 911 Query: 612 TQIKMLSW 589 I + SW Sbjct: 912 FIINLHSW 919 >GAU37021.1 hypothetical protein TSUD_207270 [Trifolium subterraneum] Length = 596 Score = 486 bits (1251), Expect = e-160 Identities = 253/539 (46%), Positives = 334/539 (61%), Gaps = 4/539 (0%) Frame = -3 Query: 2397 GFNDLWRKWMKACIFSSSISVLVNGSPTDDFEPKRGLRQGDPLAPFLFLIAAEGLTGLVD 2218 GF + W KWM+ CI SS+SVLVNGS T DF +GLRQGDPL+PFLFLI AEGLTG+V Sbjct: 38 GFAEGWLKWMRTCICQSSMSVLVNGSSTKDFNVFKGLRQGDPLSPFLFLIVAEGLTGMVR 97 Query: 2217 SAIERGILQGYKVSEDISFTILQFADDTIMLCDASWNNLWAVKAIFRSFELVSGLRVNFL 2038 A+E G +GYKVSE I F ILQFADD I++ + SW+NLW +K + R FE+VSGL++NF Sbjct: 98 RAVELGKFKGYKVSESIQFQILQFADDMILMGENSWDNLWTIKTVLRGFEMVSGLKINFN 157 Query: 2037 KSNMYGINIEGSFLNVASNFLSCCIGNIPLKFLGLPVAANPRRCQTWEPVVKAMRKKLAA 1858 KS +YGIN+E FL S FLSC IP KFLG+PV ANPRR +TW PVV+AM + + Sbjct: 158 KSKLYGINVEEDFLEAGSTFLSCRSDVIPFKFLGIPVGANPRRKETWRPVVEAMSTRFSR 217 Query: 1857 WKGRHLSIGGRVTLINSVLNSMSLYFFSFYKAPKRVIRTLVAIQRNFLWGGSEENQKVSW 1678 W G HL+ GGR+TLINSVL S+ LYFFSF+KA V+ LV+IQRNFLWGG E +K+ W Sbjct: 218 WSGSHLTYGGRITLINSVLASLPLYFFSFFKASICVLNQLVSIQRNFLWGGGLEEKKMCW 277 Query: 1677 VCRDKVCLPKRLGGLGIKNLELFNXXXXXXXXXXXLTDHDAVWVGLLSFKYG--QNFTIS 1504 V D VCLP+ LGGLG+KNL+LFN + D +AVW +L +YG +F ++ Sbjct: 278 VKWDHVCLPRDLGGLGVKNLKLFNIALLSKWKWRCVNDSEAVWKEVLRHRYGHLSSFILN 337 Query: 1503 SHNTADHRFASIWWRDLHLIELDRGVQPMWFSDALCRKVGDGEHTKFWSDTWLGVSSLKD 1324 + + SIWW+D+ I G WF VG+G + FW++ WLG + L D Sbjct: 338 GVPISSNFKTSIWWKDMVNIGETFGYD--WFQSNTRIIVGNGNNIAFWTNRWLGNNVLSD 395 Query: 1323 CFPRLFQVAENKEANIAE--MGAWHGDIWRWEWRWRRPLFVWEEELLADFLNIMAPVQIQ 1150 FP LF K+A +A+ + G IWRWEWR R L EE LA+ ++ + Sbjct: 396 LFPNLFDKEAFKDAKVADRVINNNDGTIWRWEWRGR--LTEAEELDLAELQVLLTGFSLN 453 Query: 1149 KNVVDSWLWKLEGSQVFSVKSAYNMLLTVQRNNGENQGLDRTFKWLWSCDVSSKIAVFTW 970 D W W + F++KS YN+L+ V + L + LW D+ SK+ +F W Sbjct: 454 PTCCDRWKWIPDSVGDFTIKSCYNVLIHVGNTVVLSPHLLVAIRKLWKNDLPSKVGIFGW 513 Query: 969 RLLQDRLPTREQLICRGIIDANHGGSCVFCFRAVENCSHLFFTCNFSYHVWSVVNSWIG 793 RLL ++LPTR L R I++ + CVFC R E+ +HLFF ++W ++ W+G Sbjct: 514 RLLLEKLPTRAALAHRNILNTDDELLCVFCSRVREDSNHLFFNRVHMKYLWRRIHDWLG 572 >GAU34179.1 hypothetical protein TSUD_162800 [Trifolium subterraneum] Length = 757 Score = 489 bits (1260), Expect = e-159 Identities = 254/616 (41%), Positives = 369/616 (59%), Gaps = 7/616 (1%) Frame = -3 Query: 2397 GFNDLWRKWMKACIFSSSISVLVNGSPTDDFEPKRGLRQGDPLAPFLFLIAAEGLTGLVD 2218 GF ++W WMKACIF ++SVLVNG P + +RGL+QGDPLAPFLFL+ AEG G + Sbjct: 125 GFCEVWIGWMKACIFGGNLSVLVNGCPMGEINIQRGLKQGDPLAPFLFLLVAEGFGGAMR 184 Query: 2217 SAIERGILQGYKVSED-ISFTILQFADDTIMLCDASWNNLWAVKAIFRSFELVSGLRVNF 2041 A+E + +G+ +S D S + LQ+ADDT+ + +AS NLW +KAI R FEL SGLRVNF Sbjct: 185 RAVEINLFKGFNISRDGPSISHLQYADDTLCIGEASIENLWTMKAILRGFELASGLRVNF 244 Query: 2040 LKSNMYGINIEGSFLNVASNFLSCCIGNIPLKFLGLPVAANPRRCQTWEPVVKAMRKKLA 1861 KS + G+N+ F+ +A FL+C G +P K+LGLPV ANPRR TWEP+V ++RKKL Sbjct: 245 WKSCLIGVNVRDDFMELACTFLNCIQGFVPFKYLGLPVGANPRRLSTWEPLVASLRKKLN 304 Query: 1860 AWKGRHLSIGGRVTLINSVLNSMSLYFFSFYKAPKRVIRTLVAIQRNFLWGGSEENQKVS 1681 +W +H+SI GR+ LINSVLNS+ +++ SF K P +V++ ++ IQR FLWGG + +S Sbjct: 305 SWGHKHVSIEGRLVLINSVLNSIPIFYLSFMKMPVQVLKKVIRIQREFLWGGVNGGRNLS 364 Query: 1680 WVCRDKVCLPKRLGGLGIKNLELFNXXXXXXXXXXXLTDHD-AVWVGLLSFKYGQNF--- 1513 W+ R VC K+ GGLGI++L+ N L D +W +L KYG + Sbjct: 365 WIKRRVVCQGKKNGGLGIRDLKAVNLSLLMKWRWRLLNSEDTGLWKEVLVAKYGGHILHN 424 Query: 1512 TISSHNTADHRFASIWWRDLHLIELDRGVQPM-WFSDALCRKVGDGEHTKFWSDTWLGVS 1336 + S + +R AS+WW+D++ +L V W ++ + R +G+G T+FWSD W+G Sbjct: 425 VVWSLGSPPYR-ASLWWKDIN--DLQACVNSKNWVAEMVTRFLGNGSRTRFWSDNWIGDV 481 Query: 1335 SLKDCFPRLFQVAENKEANIAEMGAWHGDIWRWEWRWRRPLFVWEEELLADFLNIMAPVQ 1156 L FPRLF ++ KEA ++EM G+ W + WRR LF+WEEE ++ L+++ V Sbjct: 482 LLCSKFPRLFSLSLQKEATVSEMMVVEGETKSWNFLWRRSLFLWEEERVSQLLSLLENVS 541 Query: 1155 IQKNVVDSWLWKLEGSQVFSVKSAYNMLL-TVQRNNGENQGLDRTFKWLWSCDVSSKIAV 979 + D W W L+ FSVKSAY+ LL + + + + F +W K+ V Sbjct: 542 LSLE-EDKWHWALDPDGCFSVKSAYDSLLENLDTSPNLSPYEAKIFSNIWDSPAPLKVVV 600 Query: 978 FTWRLLQDRLPTREQLICRGIIDANHGGSCVFCFRAVENCSHLFFTCNFSYHVWSVVNSW 799 F+WRLL DR+PT+E LI RG++ GSCV+C E+ +HLF C + VW + W Sbjct: 601 FSWRLLHDRVPTKENLIVRGVLPRESSGSCVWCGDIRESSAHLFLHCKVALVVWYEIFRW 660 Query: 798 IGVAGVFHNDGVNHFIQHGDFF*GKHLRRTRNLIWMAVVWSLWGMRNKIIFQGLVADFTS 619 +GV V + F D K ++ L+W +V+W++W RN IF + +D Sbjct: 661 LGVVIVIPPNLFTLFDYFSDSARSKKSKKGFLLVWHSVIWTIWKARNNQIFNNVTSDPFE 720 Query: 618 VITQIKMLSWGWFVNR 571 ++ K+LSW W +R Sbjct: 721 LVESAKVLSWRWSADR 736 >GAU26515.1 hypothetical protein TSUD_361480 [Trifolium subterraneum] Length = 873 Score = 484 bits (1246), Expect = e-155 Identities = 248/612 (40%), Positives = 360/612 (58%), Gaps = 4/612 (0%) Frame = -3 Query: 2394 FNDLWRKWMKACIFSSSISVLVNGSPTDDFEPKRGLRQGDPLAPFLFLIAAEGLTGLVDS 2215 F + W +W++AC+F+ ++SVLVNGSPT + +RGL+QGDPLAPFLFL+ AE GL+ + Sbjct: 244 FCNKWVEWIRACVFAGNLSVLVNGSPTTEINIQRGLKQGDPLAPFLFLLVAEDFVGLMRN 303 Query: 2214 AIERGILQGYKV-SEDISFTILQFADDTIMLCDASWNNLWAVKAIFRSFELVSGLRVNFL 2038 A+ + +G+ + SE + + LQ+ADDT+ + D + NLW +KAI R FEL SGL+VNF Sbjct: 304 AVALNLFKGFSIGSEGLVISHLQYADDTLCIGDDTLKNLWTLKAILRGFELASGLKVNFW 363 Query: 2037 KSNMYGINIEGSFLNVASNFLSCCIGNIPLKFLGLPVAANPRRCQTWEPVVKAMRKKLAA 1858 KS++ G+N+ ++ A NFL+C G IP +LGLPV ANPRRC TW+P+V+ +RK+L A Sbjct: 364 KSSLIGVNVSNDYMVNACNFLNCKRGVIPFMYLGLPVGANPRRCSTWDPLVERLRKRLRA 423 Query: 1857 WKGRHLSIGGRVTLINSVLNSMSLYFFSFYKAPKRVIRTLVAIQRNFLWGGSEENQKVSW 1678 W R++S+GGR+ LIN VLN++ +++ S +K P VI+ ++ IQR FLWGG + +K+SW Sbjct: 424 WGNRYVSLGGRIVLINFVLNAIPIFYLSLFKMPVLVIKKIIRIQREFLWGGVKGGRKISW 483 Query: 1677 VCRDKVCLPKRLGGLGIKNLELFNXXXXXXXXXXXLTDHDAVWVGLLSFKYGQNFTISSH 1498 V +VC P+ GGLG++++ N L A W LL KYG+ H Sbjct: 484 VKWKEVCKPRCQGGLGVRDVGKVNLSLLIKWRWRLLQPEGAFWKELLVAKYGEMVRQKLH 543 Query: 1497 --NTADHRFASIWWRDLHLIELDRGVQPMWFSDALCRKVGDGEHTKFWSDTWLGVSSLKD 1324 + AS WW+D + E+D + WF+ + R+VG G+ +FW D W G S L D Sbjct: 544 WNDCPIPSRASSWWKD--ICEIDVCEEGSWFAQHVFRRVGKGDSIRFWKDCWFGNSPLCD 601 Query: 1323 CFPRLFQVAENKEANIAEMGAWHGDIWRWEWRWRRPLFVWEEELLADFLNIMAPVQIQKN 1144 FPRLF +A +KEA + E+ + W W WRR LFVWE+ELL + P+ + Sbjct: 602 LFPRLFSIATHKEALVNEVRVVTEGLNLWNWEWRRRLFVWEQELLVSLTETL-PLLVLSG 660 Query: 1143 VVDSWLWKLEGSQVFSVKSAYNMLLTVQRNNGENQGLD-RTFKWLWSCDVSSKIAVFTWR 967 D W W+LE VF+VKS Y +L +V + + R F +W SK+ VF W+ Sbjct: 661 EEDVWYWRLEDGGVFTVKSVYTLLGSVFATDAVWSPPELRVFDQIWKSPAPSKVIVFPWK 720 Query: 966 LLQDRLPTREQLICRGIIDANHGGSCVFCFRAVENCSHLFFTCNFSYHVWSVVNSWIGVA 787 LL++R+PT+ L RGI +CV C + E+ SHLF CNF+ VW+ + WIGV Sbjct: 721 LLRNRIPTKANLALRGIQVVGGSLNCVHCVGSGEDASHLFMYCNFAAQVWNSIFRWIGVT 780 Query: 786 GVFHNDGVNHFIQHGDFF*GKHLRRTRNLIWMAVVWSLWGMRNKIIFQGLVADFTSVITQ 607 V + F + + +LIW +W +W RN I F D + + Sbjct: 781 IVIPPNIFLLFDCMRGAAPNNKIAKGFSLIWHTTLWVIWKSRNSISFGSGTIDLGQAVGE 840 Query: 606 IKMLSWGWFVNR 571 IK+LSW W ++R Sbjct: 841 IKLLSWRWDLSR 852 >KYP54863.1 Putative ribonuclease H protein At1g65750 family [Cajanus cajan] Length = 648 Score = 470 bits (1210), Expect = e-153 Identities = 250/649 (38%), Positives = 370/649 (57%), Gaps = 5/649 (0%) Frame = -3 Query: 2397 GFNDLWRKWMKACIFSSSISVLVNGSPTDDFEPKRGLRQGDPLAPFLFLIAAEGLTGLVD 2218 GF WRKW+ C+ ++ I VL+NGSPT +F +GLRQGD LAPFLFLI AEGL L+ Sbjct: 6 GFPLKWRKWIAECVSTTRIFVLLNGSPTGEFGVGKGLRQGDLLAPFLFLIVAEGLNALMS 65 Query: 2217 SAIERGILQGYKVS-EDISFTILQFADDTIMLCDASWNNLWAVKAIFRSFELVSGLRVNF 2041 +E + GY V + +S + LQ+ADDT+++ AS +N+WA+K+I + FELV+GL+VNF Sbjct: 66 KVVECHVFSGYSVGHQSVSVSHLQYADDTLIIGGASSHNVWAIKSILQIFELVAGLKVNF 125 Query: 2040 LKSNMYGINIEGSFLNVASNFLSCCIGNIPLKFLGLPVAANPRRCQTWEPVVKAMRKKLA 1861 KS ++G NI LN+ + FL+C +G++P +LGLP+ ANPR +TWEPV+ ++K+L+ Sbjct: 126 HKSKLFGFNINIEVLNLMAQFLNCKVGSLPFCYLGLPLGANPRCIKTWEPVISKVKKRLS 185 Query: 1860 AWKGRHLSIGGRVTLINSVLNSMSLYFFSFYKAPKRVIRTLVAIQRNFLWGGSEENQKVS 1681 WK LS GGR L+ SVLNS+ +Y+ SF+KAP+ +I L ++ + FLWGG E ++K++ Sbjct: 186 KWKSSTLSFGGRSVLLKSVLNSIPIYYLSFFKAPQGIISKLESLFKLFLWGGDENHRKIA 245 Query: 1680 WVCRDKVCLPKRLGGLGIKNLELFNXXXXXXXXXXXLTDHDAVWVGLLSFKYGQNFTISS 1501 WV +VC K GGLGI +L FN L + W +++ YG+ Sbjct: 246 WVAWQEVCRGKEHGGLGILDLRAFNLALLGKWRWRLLVEKGRFWHRVVTSIYGEG---CF 302 Query: 1500 HNTADHRFASIWWRDLHLIELDRGVQPMWFSDALCRKVGDGEHTKFWSDTWLGVSSLKDC 1321 D +S WW DL I+ WFS + VGDG +T FW D W G L + Sbjct: 303 QGVGDKVQSSKWWVDLWTIDSTPYTSFDWFSSRCTKVVGDGRNTFFWKDGWSGQGPLCNR 362 Query: 1320 FPRLFQVAENKEANIAEMGAWHGDIWRWEWRWRRPLFVWEEELLADFLNIMAPVQIQKNV 1141 + RLF +A +K+ ++A M W + W W WRR LF WE +LL+ + + ++ + Sbjct: 363 YSRLFSIASDKDVSVANMVLWRDGGFEWIWSWRRSLFQWELDLLSQLAADLGSIVLKNDC 422 Query: 1140 VDSWLWKLEGSQVFSVKSAYNMLLTVQRNNGENQGLDRTF---KWLWSCDVSSKIAVFTW 970 D W WK +++VKSAY ++ N G+ F K+LWS V SK++ F W Sbjct: 423 CDRWCWKDSNDGIYNVKSAYKAVI--------NGGIYADFLLHKFLWSSCVPSKVSGFAW 474 Query: 969 RLLQDRLPTREQLICRGIIDANHGGSCVFCFRAVENCSHLFFTCNFSYHVWSVVNSWIGV 790 + L +R+P++ LI R +++ + G C +C +EN SHL F C ++Y VW +W GV Sbjct: 475 KALLNRIPSKCNLIKRKVLNISASG-CAWCGEDLENTSHLLFGCYYAYFVWLSNFAWFGV 533 Query: 789 AGVFHNDGVNHFIQHGDFF*GKHLRRTR-NLIWMAVVWSLWGMRNKIIFQGLVADFTSVI 613 + V HN +F F R R +++W+A +WSLW RN +IF+ V ++ Sbjct: 534 STVIHNSCHENFAHFNGFPRCSGRDRMRWSVVWLATIWSLWLARNDVIFKDKVVAIKDLV 593 Query: 612 TQIKMLSWGWFVNRAGRSSEISLVGLTFSRRGWAFFHNCEPLWIIFTFG 466 IK+ SW W ++ + S T S+RG+ PL TFG Sbjct: 594 ELIKLRSWNWI-----KTKDKSF--FTHSQRGFL------PLVFALTFG 629 >GAU18134.1 hypothetical protein TSUD_248350 [Trifolium subterraneum] Length = 694 Score = 468 bits (1204), Expect = e-151 Identities = 247/616 (40%), Positives = 361/616 (58%), Gaps = 7/616 (1%) Frame = -3 Query: 2397 GFNDLWRKWMKACIFSSSISVLVNGSPTDDFEPKRGLRQGDPLAPFLFLIAAEGLTGLVD 2218 GF++ WR W+KAC+F+ S+SVLVNGSPT+ + +GL+QGDPLAPFLF++ AEGL L+ Sbjct: 66 GFDEKWRSWIKACVFAGSLSVLVNGSPTEQIDISKGLKQGDPLAPFLFILVAEGLGALMK 125 Query: 2217 SAIERGILQGYKVSEDISF-TILQFADDTIMLCDASWNNLWAVKAIFRSFELVSGLRVNF 2041 A++ G +G ++S + + LQ+ADDT+ + +A NLW KAI R FEL+SGL+VNF Sbjct: 126 KAVDVGYFKGIQISTTGTIMSHLQYADDTLFVGEACVENLWTTKAILRWFELISGLKVNF 185 Query: 2040 LKSNMYGINIEGSFLNVASNFLSCCIGNIPLKFLGLPVAANPRRCQTWEPVVKAMRKKLA 1861 KS +YGIN+ +F++ A++FL C +G +P +LGLPV ANPRR TW PV++ ++K+LA Sbjct: 186 FKSKLYGINVGDNFISSAASFLKCKVGKLPFIYLGLPVGANPRRLVTWNPVIEVLQKRLA 245 Query: 1860 AWKGRHLSIGGRVTLINSVLNSMSLYFFSFYKAPKRVIRTLVAIQRNFLWGGSEENQKVS 1681 +WK +++S+GGRV L+NSVL ++ +++ S +K P V + +V +QR FLWGG + K+ Sbjct: 246 SWKNKYVSLGGRVVLLNSVLAAIPIFYLSLFKMPVGVWKKIVNLQRRFLWGGVAGSSKIP 305 Query: 1680 WVCRDKVCLPKRLGGLGIKNLELFNXXXXXXXXXXXLTDHDAVWVGLLSFKY--GQNFTI 1507 WV VC PK+ GGLG+K+L + N L++ ++W +L +Y G++ Sbjct: 306 WVNWRDVCRPKKEGGLGVKDLRIMNISLLAKWKWRLLSEGKSIWKNVLEDRYRGGESGVG 365 Query: 1506 SSHNTADHRFASIWWRDLHLIELDRGVQPMWFSDALCRKVGDGEHTKFWSDTWLGVSSLK 1327 AS WW DL + + G + +K+G+G T+FW D+W+G LK Sbjct: 366 WMSKVWVSSKASPWWNDLMTMGVVAGEDRL--HGIFFKKIGNGGDTRFWHDSWVGTQPLK 423 Query: 1326 DCFPRLFQVAENKEANIAEMGAWHGDIWRWEWRWRRPLFVWEEELLADFLNIMAPVQIQK 1147 + FPRLF ++ KE ++ E+G G RW+ +WRR LFVWEEEL +N++ P+Q+ Sbjct: 424 ELFPRLFLISVQKECSVFEVGG--GVSGRWDLKWRRNLFVWEEELRELLVNVLTPIQL-I 480 Query: 1146 NVVDSWLWKLEGSQVFSVKSAYNMLL-TVQRNNGENQGLDRTFKWLWSCDVSSKIAVFTW 970 N D W +FSV S Y L + + L R +LW SK+ VF+W Sbjct: 481 NKEDEWRCHYFNGSLFSVSSLYKYLSGIIIPPISRDPELVRDLGFLWESLAPSKVIVFSW 540 Query: 969 RLLQDRLPTREQLICRGIIDANHGGSCVFCFRAVENCSHLFFTCNFSYHVWSVVNSWIGV 790 +LL RLPT+ L RGI++ + CV C E HLF C F+ +WS V W G Sbjct: 541 QLLLSRLPTKANLAIRGIVEHDSNSFCVLCPMNTECEGHLFGWCAFASRIWSRVFDWFGW 600 Query: 789 AGVFHNDGVNHFIQHGDFF*GK-HLRRTRNL--IWMAVVWSLWGMRNKIIFQGLVADFTS 619 GV D F F G+ RR + L +W VVW++W RN +IF V Sbjct: 601 GGVVPRDPREIF---QSFCRGRPGGRRIKGLLAVWHVVVWAIWRARNDLIFNSKVPVLED 657 Query: 618 VITQIKMLSWGWFVNR 571 V+ I LSW W + + Sbjct: 658 VLHSIMSLSWKWLLEK 673 >KYP34591.1 LINE-1 reverse transcriptase isogeny [Cajanus cajan] Length = 817 Score = 472 bits (1214), Expect = e-151 Identities = 247/607 (40%), Positives = 363/607 (59%), Gaps = 5/607 (0%) Frame = -3 Query: 2397 GFNDLWRKWMKACIFSSSISVLVNGSPTDDFEPKRGLRQGDPLAPFLFLIAAEGLTGLVD 2218 GF++ W +WM+AC+ S+S LVNGSPT + RGL+QGDPLAP LFLIAAEGL L+ Sbjct: 220 GFHERWVRWMEACVCGGSLSTLVNGSPTAEVSLGRGLKQGDPLAPSLFLIAAEGLRLLMS 279 Query: 2217 SAIERGILQGYKVS-EDISFTILQFADDTIMLCDASWNNLWAVKAIFRSFELVSGLRVNF 2041 A++ + +G + E ++LQFADDT+++ +A+ NLW +KAI R FEL+SG+++NF Sbjct: 280 RALDMNLFKGLHIGGEGPPISLLQFADDTLIIGEATMQNLWCLKAILRGFELISGMKINF 339 Query: 2040 LKSNMYGINIEGSFLNVASNFLSCCIGNIPLKFLGLPVAANPRRCQTWEPVVKAMRKKLA 1861 KS + GI+ F N+A+ FL C +G +P K LGLP+ ANPR+ TW+P++ ++RK+L+ Sbjct: 340 HKSCVVGIHSGADFTNLAAAFLHCKVGQLPFKHLGLPLGANPRKLYTWKPMLDSLRKRLS 399 Query: 1860 AWKGRHLSIGGRVTLINSVLNSMSLYFFSFYKAPKRVIRTLVAIQRNFLWGGSEENQKVS 1681 +WK +HLSIGGRVTLINSVLN++ ++F SF+KAP VI+ +VAIQR+FLW G ++ K+ Sbjct: 400 SWKYKHLSIGGRVTLINSVLNAIPIHFLSFFKAPNLVIKEIVAIQRDFLWRGVKDGSKIP 459 Query: 1680 WVCRDKVCLPKRLGGLGIKNLELFNXXXXXXXXXXXLTDHDAVWVGLLSFKYGQNFTISS 1501 WV + VC K GGLGIK++ LFN + +W +L +YG+ + S+ Sbjct: 460 WVKWETVCKSKVEGGLGIKDVRLFNWALLGKWVWKCMQSPGMLWAKVLHHRYGRIESFSN 519 Query: 1500 HNTADHRFASIWWRDLHLIELDRGVQPMWFSDALCRKVGDGEHTKFWSDTWLGVSSLKDC 1321 + D R S+WW+D+ + L +G W + + R +GDG T+FW D W+G L + Sbjct: 520 CSNVDRR-TSLWWKDIVWV-LHQG--NCWLDEKIERCIGDGTMTRFWEDKWIGGLRLLEV 575 Query: 1320 FPRLFQVAENKEANIAEMGAWHGDIWRWEWRWRRPLFVWEEELLADFLNIMAPVQIQKNV 1141 FPRLF A + + +A+ G W G W W+ +WRR FV E + L+++ +Q+ + Sbjct: 576 FPRLFSFALDPLSVVADNGTWEGSTWVWQVKWRREPFVHEGRSVNILLDMLKGLQVISSR 635 Query: 1140 VDSWLWKLEGSQVFSVKSAYNMLLTVQRNNG----ENQGLDRTFKWLWSCDVSSKIAVFT 973 D W W + VFSVKSAY L +QR+ G + K LW C K VF Sbjct: 636 QDYWRWIYDKDGVFSVKSAY---LWLQRSVGGELRNSSDFQLVIKRLWKCKAPIKCLVFC 692 Query: 972 WRLLQDRLPTREQLICRGIIDANHGGSCVFCFRAVENCSHLFFTCNFSYHVWSVVNSWIG 793 W++L + P + L RG+ N+ C FC VEN HLF C +++ W V W+ Sbjct: 693 WQVLLNAFPCKSLLQVRGVELENN--LCSFCSLFVENPLHLFLMCPMAFNTWLAVAKWLE 750 Query: 792 VAGVFHNDGVNHFIQHGDFF*GKHLRRTRNLIWMAVVWSLWGMRNKIIFQGLVADFTSVI 613 V VF N +H++ + + + ++W++V+WSLW RN IIFQ D V+ Sbjct: 751 VTVVFPNSIFSHYLYWTNLGIYEKHSQLLRVVWVSVIWSLWLHRNVIIFQQGTIDAKEVL 810 Query: 612 TQIKMLS 592 IK+ S Sbjct: 811 DNIKLRS 817 >GAU50434.1 hypothetical protein TSUD_134890, partial [Trifolium subterraneum] Length = 712 Score = 466 bits (1199), Expect = e-150 Identities = 245/577 (42%), Positives = 332/577 (57%), Gaps = 4/577 (0%) Frame = -3 Query: 2397 GFNDLWRKWMKACIFSSSISVLVNGSPTDDFEPKRGLRQGDPLAPFLFLIAAEGLTGLVD 2218 GF W KWM+ACIF+SS+ VLVNGSP +DF ++GLRQGDPL+PFLFLI AE LT L+ Sbjct: 166 GFAPRWLKWMRACIFNSSMPVLVNGSPMEDFVVEKGLRQGDPLSPFLFLIIAERLTRLMQ 225 Query: 2217 SAIERGILQGYKVSEDISFTILQFADDTIMLCDASWNNLWAVKAIFRSFELVSGLRVNFL 2038 A++ G G+KV +D+ F LQFADDT+++ + +W NLW++K + RSFELVSGL+VNF Sbjct: 226 KAVDNGNYHGFKVRDDLQFHTLQFADDTVLVGEGNWENLWSLKTVLRSFELVSGLKVNFF 285 Query: 2037 KSNMYGINIEGSFLNVASNFLSCCIGNIPLKFLGLPVAANPRRCQTWEPVVKAMRKKLAA 1858 KS +YGIN++ +FL+ AS+FL C + +IP +FLG+PV ANPRR TW PVV+AM+K+L A Sbjct: 286 KSKLYGINLDDNFLSAASSFLHCEVDSIPFRFLGIPVGANPRRKITWNPVVEAMKKRLNA 345 Query: 1857 WKGRHLSIGGRVTLINSVLNSMSLYFFSFYKAPKRVIRTLVAIQRNFLWGGSEENQKVSW 1678 W R+LSIGGRVTLINS PK V WGG + +K+ W Sbjct: 346 WNCRNLSIGGRVTLINS--------------HPKEV-----------SWGGCSDIKKICW 380 Query: 1677 VCRDKVCLPKRLGGLGIKNLELFNXXXXXXXXXXXLTDHDAVWVGLLSFKYG---QNFTI 1507 V D +CLPK GGLGIKNL FN L DH+ +W LL +YG NF Sbjct: 381 VSWDTICLPKDKGGLGIKNLNCFNQALLCKWKWRGLCDHNTLWTKLLEHRYGSLADNFL- 439 Query: 1506 SSHNTADHRFASIWWRDLHLIELDRGVQ-PMWFSDALCRKVGDGEHTKFWSDTWLGVSSL 1330 T D + S+WWRD+ +I G++ WF + +G+G +FW +TW G L Sbjct: 440 -RDTTRDVKGQSLWWRDIMMI---GGIENDAWFRFNVRNVLGNGTCIRFWHETWHGPVCL 495 Query: 1329 KDCFPRLFQVAENKEANIAEMGAWHGDIWRWEWRWRRPLFVWEEELLADFLNIMAPVQIQ 1150 KD FP+L+ + EA I ++G W W W +W L E + + N++ +Q Sbjct: 496 KDLFPQLYCKSPQAEAIIYDVGKWVNQQWVWNLQWSTNLTSTEHDAACELANLLTGIQPS 555 Query: 1149 KNVVDSWLWKLEGSQVFSVKSAYNMLLTVQRNNGENQGLDRTFKWLWSCDVSSKIAVFTW 970 D W L + +FSVKS Y L + + + + + LW DV SK+++F W Sbjct: 556 LECADRRRWGLTQTGMFSVKSTYEFLQSREVVVAIEDNVVKALQLLWLNDVPSKVSIFGW 615 Query: 969 RLLQDRLPTREQLICRGIIDANHGGSCVFCFRAVENCSHLFFTCNFSYHVWSVVNSWIGV 790 RLL RLPTR L + II H SC+FC E SHL F C FS +W + W+ V Sbjct: 616 RLLLSRLPTRMALARKNIIVNLHELSCIFCGEEQEELSHLLFNCPFSQELWKRIFKWMNV 675 Query: 789 AGVFHNDGVNHFIQHGDFF*GKHLRRTRNLIWMAVVW 679 + ++G HF G K + R++IW+A W Sbjct: 676 DFISFDEGWKHFFAFGALLENKKFEKARHVIWLATTW 712 >KYP40876.1 Putative ribonuclease H protein At1g65750 family [Cajanus cajan] Length = 751 Score = 466 bits (1199), Expect = e-150 Identities = 249/635 (39%), Positives = 361/635 (56%), Gaps = 5/635 (0%) Frame = -3 Query: 2397 GFNDLWRKWMKACIFSSSISVLVNGSPTDDFEPKRGLRQGDPLAPFLFLIAAEGLTGLVD 2218 GF++ W +WM+ C+ S+S LVNGSPT + RGL+QGDPLAP LFLIA EGL L+ Sbjct: 127 GFDERWVRWMEGCVCGGSLSALVNGSPTGEVAIGRGLKQGDPLAPSLFLIAVEGLRLLMT 186 Query: 2217 SAIERGILQGYKVS-EDISFTILQFADDTIMLCDASWNNLWAVKAIFRSFELVSGLRVNF 2041 A++ + +G + E ++LQFADDT+++ +A+ NLW +KAI R FEL+SG+++NF Sbjct: 187 RALDMNLFKGLHLGGEGPLISLLQFADDTLIIGEATMQNLWCLKAILRCFELISGMKINF 246 Query: 2040 LKSNMYGINIEGSFLNVASNFLSCCIGNIPLKFLGLPVAANPRRCQTWEPVVKAMRKKLA 1861 KS++ GI+ F +A++FL C +G +P K LGLP+ ANPR+ TW P++ +R +L+ Sbjct: 247 HKSSVVGIHSGVDFTELAASFLHCKVGQLPFKHLGLPLGANPRKLATWRPILDGLRNRLS 306 Query: 1860 AWKGRHLSIGGRVTLINSVLNSMSLYFFSFYKAPKRVIRTLVAIQRNFLWGGSEENQKVS 1681 +WK R+LSIGGRVTLIN+VLN+M ++F SF+KAP VI+ +VAIQR FLW G E+ K+ Sbjct: 307 SWKHRYLSIGGRVTLINAVLNAMPIHFLSFFKAPNSVIKEIVAIQRGFLWRGVEDGSKIP 366 Query: 1680 WVCRDKVCLPKRLGGLGIKNLELFNXXXXXXXXXXXLTDHDAVWVGLLSFKYGQNFTISS 1501 WV + VC K GGLGIK++ LFN + +W +L +YG+ + S Sbjct: 367 WVKWETVCKSKDEGGLGIKDVRLFNWALLGKWVWRCMLYPSTMWAKVLQGRYGRIESFSK 426 Query: 1500 HNTADHRFASIWWRDLHLIELDRGVQPMWFSDALCRKVGDGEHTKFWSDTWLGVSSLKDC 1321 + D R S WW+D+ + L +G W + + R +GDG T+FW D W+G L D Sbjct: 427 TSNVDRR-DSWWWKDIVWV-LQQG--NFWLDEKIDRCIGDGTSTRFWEDKWIGGLRLLDV 482 Query: 1320 FPRLFQVAENKEANIAEMGAWHGDIWRWEWRWRRPLFVWEEELLADFLNIMAPVQIQKNV 1141 FPRL+ A + + + G W G W W+ +WRR FV E + L ++ +QI + Sbjct: 483 FPRLYSFAFDPLSMVGHNGNWEGSTWLWQVKWRREPFVHEVGSVNSLLEMLQGLQIFSSK 542 Query: 1140 VDSWLWKLEGSQVFSVKSAYNMLLTVQRNNGENQGLDRTF----KWLWSCDVSSKIAVFT 973 D W W + VFSVKSAY+ L QR+ G F K LW C K VF Sbjct: 543 QDQWRWICDKDGVFSVKSAYSWL---QRSLGGELSYSSDFHLVIKSLWKCKAPIKCLVFC 599 Query: 972 WRLLQDRLPTREQLICRGIIDANHGGSCVFCFRAVENCSHLFFTCNFSYHVWSVVNSWIG 793 W++ + P + L RG+ N+ C C +E+ HLF C +++ W V W+ Sbjct: 600 WQVFMNAFPCKSLLQVRGVELENN--LCSLCSFFIEDPLHLFLMCPMAFNTWLSVAKWLE 657 Query: 792 VAGVFHNDGVNHFIQHGDFF*GKHLRRTRNLIWMAVVWSLWGMRNKIIFQGLVADFTSVI 613 V V N + ++ + K + ++W++V+WSLW RN IIFQ V D V+ Sbjct: 658 VEVVLPNSLTSLYLYWTNLGIYKKSTQCFKVVWVSVIWSLWLHRNGIIFQQGVMDCKEVL 717 Query: 612 TQIKMLSWGWFVNRAGRSSEISLVGLTFSRRGWAF 508 IK+ SW W + S+ G +FS W F Sbjct: 718 DNIKLRSWKWI--------KSSVPGCSFSYSSWYF 744 >GAU46725.1 hypothetical protein TSUD_100170 [Trifolium subterraneum] Length = 776 Score = 466 bits (1199), Expect = e-149 Identities = 244/618 (39%), Positives = 360/618 (58%), Gaps = 5/618 (0%) Frame = -3 Query: 2397 GFNDLWRKWMKACIFSSSISVLVNGSPTDDFEPKRGLRQGDPLAPFLFLIAAEGLTGLVD 2218 GF WR WMKAC++ ++SVLVNGSPT + RGL+QGDPLAP LFL+ AEGL GL Sbjct: 151 GFCPKWRAWMKACVWGGNVSVLVNGSPTQEIPIMRGLKQGDPLAPLLFLLVAEGLGGL-- 208 Query: 2217 SAIERGILQGYKVSEDISFTILQFADDTIMLCDASWNNLWAVKAIFRSFELVSGLRVNFL 2038 A+E + + V + ++LQ+ADDT+ + +A+ NLW +KA+ R FE+ SGL+VNF Sbjct: 209 RAVEINRFRPFLVGDGAPVSLLQYADDTLCIGEATVENLWVMKAVLRGFEMTSGLKVNFW 268 Query: 2037 KSNMYGINIEGSFLNVASNFLSCCIGNIPLKFLGLPVAANPRRCQTWEPVVKAMRKKLAA 1858 KS + G+N+ FL +AS+FL+C IG P K+LGLPV AN R+ TWEP++ +R ++++ Sbjct: 269 KSCVIGVNVSEEFLGMASDFLNCRIGKTPFKYLGLPVGANSRKMSTWEPMLDTIRGRISS 328 Query: 1857 WKGRHLSIGGRVTLINSVLNSMSLYFFSFYKAPKRVIRTLVAIQRNFLWGGSEENQKVSW 1678 W +++S+GGR+ LIN+VLN++ +++ S+ K P +V R LV IQRNFLWGG K W Sbjct: 329 WSCKYVSLGGRIVLINAVLNAIPIFYLSYMKMPTKVWRQLVKIQRNFLWGGLSNRSKTCW 388 Query: 1677 VCRDKVCLPKRLGGLGIKNLELFNXXXXXXXXXXXLT-DHDAVWVGLLSFKYGQNFTISS 1501 V D +C PK GLGI++L L N L+ + + VW ++ +YG + I + Sbjct: 389 VKWDDICRPKNEVGLGIRDLRLVNTSLLAKWRWKILSHEEEEVWKQIVKARYGSD-VIGN 447 Query: 1500 H--NTAD-HRFASIWWRDLHLIELDRGVQPMWFSDALCRKVGDGEHTKFWSDTWLGVSSL 1330 AD R S WWRD+ +E + WFS A+ +KVG G+ T FW++ W+G SL Sbjct: 448 RCLGAADIPRSTSNWWRDICNLEGEFS----WFSSAVGKKVGRGDSTSFWNEIWIGDQSL 503 Query: 1329 KDCFPRLFQVAENKEANIAEMGAWHGDIWRWEWRWRRPLFVWEEELLADFLNIMAPVQIQ 1150 + FPRLF ++ K+ I +G+ W+WE WRR F WEE+ +F++++AP Sbjct: 504 RQRFPRLFGISLQKQEVIQNLGSLTEGRWQWELLWRRDRFQWEEDQYREFIDVIAPFAPV 563 Query: 1149 KNVVDSWLWKLEGSQVFSVKSAYNMLLTVQRNNGENQGLDR-TFKWLWSCDVSSKIAVFT 973 N D WLW +G Q F+VKSAY L + N + ++ FK LW C SK+ F Sbjct: 564 DN-HDRWLWLGDGIQGFTVKSAYMRLENLVNNRRILEPVENFVFKRLWKCAAPSKVHAFV 622 Query: 972 WRLLQDRLPTREQLICRGIIDANHGGSCVFCFRAVENCSHLFFTCNFSYHVWSVVNSWIG 793 W+LL DR+ T+ L ++ ++ +CV C +E HLF C++ VW + W+G Sbjct: 623 WQLLLDRVQTKANLFKCKMLHSDQ-QTCVLCDGKIETAVHLFLHCDWVAKVWYEITRWLG 681 Query: 792 VAGVFHNDGVNHFIQHGDFF*GKHLRRTRNLIWMAVVWSLWGMRNKIIFQGLVADFTSVI 613 + + F K ++ LIW +W +W RN IF + A V+ Sbjct: 682 FTLIIPPNLAISFAMWATCVSNKKEKKGICLIWNVFMWVVWKTRNGCIFNNMAAICEEVV 741 Query: 612 TQIKMLSWGWFVNRAGRS 559 QIK++SW WF+ R ++ Sbjct: 742 EQIKVMSWQWFIGRMAKA 759 >GAU24087.1 hypothetical protein TSUD_388800 [Trifolium subterraneum] Length = 1985 Score = 487 bits (1254), Expect = e-148 Identities = 243/622 (39%), Positives = 376/622 (60%), Gaps = 8/622 (1%) Frame = -3 Query: 2397 GFNDLWRKWMKACIFSSSISVLVNGSPTDDFEPKRGLRQGDPLAPFLFLIAAEGLTGLVD 2218 GF WR WM+AC+ + ++SVLVNGSPT++ +RGL+QGDPLAP LFLI AEGL L+ Sbjct: 1358 GFGTKWRNWMRACVCAGNMSVLVNGSPTEEISIRRGLKQGDPLAPLLFLIVAEGLGALMR 1417 Query: 2217 SAIERGILQGYKVSED-ISFTILQFADDTIMLCDASWNNLWAVKAIFRSFELVSGLRVNF 2041 SA+ERG + + V + +ILQ+ADDT+ + +A+ NLWA+KA+ R FEL SGL+VNF Sbjct: 1418 SAVERGRFKPFVVGRGALPVSILQYADDTLCIGEATTENLWALKAMLRGFELASGLKVNF 1477 Query: 2040 LKSNMYGINIEGSFLNVASNFLSCCIGNIPLKFLGLPVAANPRRCQTWEPVVKAMRKKLA 1861 KS + G+N+ FL AS FL+C IG +P +LGLPV ANPRR TW+P+V+ ++++L Sbjct: 1478 WKSCIMGVNVSQDFLLAASGFLNCRIGCLPFMYLGLPVGANPRRYSTWQPMVEGIKRRLR 1537 Query: 1860 AWKGRHLSIGGRVTLINSVLNSMSLYFFSFYKAPKRVIRTLVAIQRNFLWGGSEENQKVS 1681 +W +++S+GGR+ +IN+VL+S+ ++F S+ K P V + +V +QRNFLWGG + +++ Sbjct: 1538 SWGNKYISLGGRIVMINAVLSSIPIFFLSYMKMPLMVWKEIVTLQRNFLWGGLSKRRRIC 1597 Query: 1680 WVCRDKVCLPKRLGGLGIKNLELFNXXXXXXXXXXXLTDHDAVWVGLLSFKYGQNFTISS 1501 WV ++C PK+ GGL I++L N L++ + VW ++ KYG + Sbjct: 1598 WVKWAEICKPKKEGGLSIRDLRTVNLSLLAKWRWKLLSEEEEVWKNVIIAKYG--IHMLG 1655 Query: 1500 HNTADHR----FASIWWRDLHLIELDRGVQPMWFSDALCRKVGDGEHTKFWSDTWLGVSS 1333 + D R +S+WWRD L LD+GV WF+ + +G G KFW + W+G S Sbjct: 1656 NARLDERDIGSMSSLWWRD--LCRLDKGVG--WFNHFARKYLGCGNSIKFWKEVWVGGQS 1711 Query: 1332 LKDCFPRLFQVAENKEANIAEMGAWHGDIWRWEWRWRRPLFVWEEELLADFLNIMAPVQI 1153 L+ FPRLF ++ ++ + E+G+W +WRW RWRR LFVWEE+L+++ ++ + I Sbjct: 1712 LELQFPRLFGISVQQDDMVREVGSWVNGVWRWGLRWRRVLFVWEEDLVSELELVLNNISI 1771 Query: 1152 QKNVVDSWLWKLEGSQVFSVKSAY---NMLLTVQRNNGENQGLDRTFKWLWSCDVSSKIA 982 + D W+W+L F+VKS Y + LLT + + ++ +W V SK++ Sbjct: 1772 TEE-EDRWVWRLNVGDGFTVKSLYEALDPLLTPRCLVSSFESF--AYRSIWKSAVPSKVS 1828 Query: 981 VFTWRLLQDRLPTREQLICRGIIDANHGGSCVFCFRAVENCSHLFFTCNFSYHVWSVVNS 802 W+L DR+PT+ L RGI+ +H SCV C E HLF C+++ +W V Sbjct: 1829 ALAWQLFLDRIPTKVNLYKRGILRMDH-ASCVLCGEEAETARHLFLHCDYAAGIWYAVCR 1887 Query: 801 WIGVAGVFHNDGVNHFIQHGDFF*GKHLRRTRNLIWMAVVWSLWGMRNKIIFQGLVADFT 622 W+GV V D + + K +R+ ++WMA +W +W +RN+ +F+ + T Sbjct: 1888 WLGVFAVLPADVMMSYGLLVGCGRNKKIRKGFAIVWMAFIWVIWKVRNERVFKNATVEVT 1947 Query: 621 SVITQIKMLSWGWFVNRAGRSS 556 + ++ LSW W++N+ SS Sbjct: 1948 DAVDMVQRLSWQWYLNKMASSS 1969 >KYP63901.1 Putative ribonuclease H protein At1g65750 family [Cajanus cajan] Length = 616 Score = 456 bits (1173), Expect = e-148 Identities = 244/626 (38%), Positives = 357/626 (57%), Gaps = 5/626 (0%) Frame = -3 Query: 2370 MKACIFSSSISVLVNGSPTDDFEPKRGLRQGDPLAPFLFLIAAEGLTGLVDSAIERGILQ 2191 M+ C+ S+S LVNGSPT + RGL+QGDPLAP LFLIA EGL L+ A++ + + Sbjct: 1 MEGCVCGGSLSALVNGSPTVEVTIGRGLKQGDPLAPSLFLIAVEGLRLLMTRALDMNLFK 60 Query: 2190 GYKVS-EDISFTILQFADDTIMLCDASWNNLWAVKAIFRSFELVSGLRVNFLKSNMYGIN 2014 G ++ E ++LQFADDT+++ +A+ NLW +KAI R FEL+SG+R+NF KS++ GI+ Sbjct: 61 GLQLGGEGPLISLLQFADDTLIIGEATMQNLWCLKAILRCFELISGMRINFHKSSVVGIH 120 Query: 2013 IEGSFLNVASNFLSCCIGNIPLKFLGLPVAANPRRCQTWEPVVKAMRKKLAAWKGRHLSI 1834 F +A++FL C +G +P K LGLP+ ANPR+ TW P++ +RK+L++WK R+LSI Sbjct: 121 SGEDFTELAASFLHCKLGQLPFKHLGLPLGANPRKLATWRPILDGLRKRLSSWKHRYLSI 180 Query: 1833 GGRVTLINSVLNSMSLYFFSFYKAPKRVIRTLVAIQRNFLWGGSEENQKVSWVCRDKVCL 1654 GGRVTLIN+VLN+M ++F SF+KAP VI+ +VAIQR+FLW G ++ K+ WV + VC Sbjct: 181 GGRVTLINAVLNAMPIHFLSFFKAPNSVIKEIVAIQRDFLWRGVKDGSKIPWVKWETVCK 240 Query: 1653 PKRLGGLGIKNLELFNXXXXXXXXXXXLTDHDAVWVGLLSFKYGQNFTISSHNTADHRFA 1474 K GGLGIK++ LFN + +W +L +YG + D R Sbjct: 241 SKDKGGLGIKDVRLFNWALLGKWVWRCMISPRTIWAKVLQGRYGCIESFPKTPNVDKR-D 299 Query: 1473 SIWWRDLHLIELDRGVQPMWFSDALCRKVGDGEHTKFWSDTWLGVSSLKDCFPRLFQVAE 1294 S WW+D+ + + +G W + + R +GDG T+FW D W+G L D FPRL+ A Sbjct: 300 SWWWKDIVWV-IQQG--NYWLDEKIERCIGDGSSTRFWEDKWIGGLRLLDVFPRLYSFAF 356 Query: 1293 NKEANIAEMGAWHGDIWRWEWRWRRPLFVWEEELLADFLNIMAPVQIQKNVVDSWLWKLE 1114 + + + G W G W W+ +WRR FV EE + + ++ +QI + D W W + Sbjct: 357 DPLSMVGHNGNWEGSTWLWQVKWRREPFVHEEGSVNTLIEMLQEIQIFSSKQDQWRWICD 416 Query: 1113 GSQVFSVKSAYNMLLTVQRNNGENQGLDRTF----KWLWSCDVSSKIAVFTWRLLQDRLP 946 VFSVKSAY+ L Q + G F K LW C K VF W++ + P Sbjct: 417 KDGVFSVKSAYSWL---QHSMGGELSYSSDFILVTKSLWKCKAPIKCLVFCWQVFMNAFP 473 Query: 945 TREQLICRGIIDANHGGSCVFCFRAVENCSHLFFTCNFSYHVWSVVNSWIGVAGVFHNDG 766 + L RG+ N+ C C +E+ HLF C ++++W V +W+ V V N Sbjct: 474 CKSLLQVRGVEVENN--LCSLCSLFIEDPIHLFLMCPMAFNIWLSVANWLEVEVVLPNSL 531 Query: 765 VNHFIQHGDFF*GKHLRRTRNLIWMAVVWSLWGMRNKIIFQGLVADFTSVITQIKMLSWG 586 + ++ + K ++ ++W++V+WSLW RN IIFQ V D V+ IKM SW Sbjct: 532 TSLYLYWTNLGIYKKSKQCFKVVWVSVIWSLWLHRNGIIFQQGVMDCKEVLDNIKMRSWK 591 Query: 585 WFVNRAGRSSEISLVGLTFSRRGWAF 508 W + S+ G +FS W F Sbjct: 592 WI--------KSSVPGCSFSYSNWYF 609 >KYP50779.1 Transposon TX1 uncharacterized [Cajanus cajan] Length = 1102 Score = 471 bits (1211), Expect = e-148 Identities = 249/635 (39%), Positives = 364/635 (57%), Gaps = 5/635 (0%) Frame = -3 Query: 2397 GFNDLWRKWMKACIFSSSISVLVNGSPTDDFEPKRGLRQGDPLAPFLFLIAAEGLTGLVD 2218 GF++ W +WM+ C+ S+S LVNGSPT + RGL+QGDPLAP LFLIA EGL L+ Sbjct: 478 GFDERWVRWMEGCVCGGSLSALVNGSPTVEVTIGRGLKQGDPLAPSLFLIAVEGLRLLMT 537 Query: 2217 SAIERGILQGYKVS-EDISFTILQFADDTIMLCDASWNNLWAVKAIFRSFELVSGLRVNF 2041 A++ + +G ++ E ++LQFADDT+++ +A+ NLW +KAI R FEL+SG+R+NF Sbjct: 538 RALDMNLFKGLQLGGEGPLISLLQFADDTLIIGEATMQNLWCLKAILRCFELISGMRINF 597 Query: 2040 LKSNMYGINIEGSFLNVASNFLSCCIGNIPLKFLGLPVAANPRRCQTWEPVVKAMRKKLA 1861 KS++ GI+ F +A++FL C +G +P K LGLP+ ANPR+ TW P++ +RK+L+ Sbjct: 598 HKSSVVGIHSGEDFTELAASFLHCKLGQLPFKHLGLPLGANPRKLATWRPILDGLRKRLS 657 Query: 1860 AWKGRHLSIGGRVTLINSVLNSMSLYFFSFYKAPKRVIRTLVAIQRNFLWGGSEENQKVS 1681 +WK R+LSIGGRVTLIN+VLN+M ++F SF+KAP VI+ +VAIQR+FLW G ++ K+ Sbjct: 658 SWKHRYLSIGGRVTLINAVLNAMPIHFLSFFKAPNSVIKEIVAIQRDFLWRGVKDGSKIP 717 Query: 1680 WVCRDKVCLPKRLGGLGIKNLELFNXXXXXXXXXXXLTDHDAVWVGLLSFKYGQNFTISS 1501 WV + VC K GGLGIK++ LFN + +W +L +YG + Sbjct: 718 WVKWETVCKSKDKGGLGIKDVRLFNWALLGKWVWRCMISPRTIWAKVLQGRYGCIESFPK 777 Query: 1500 HNTADHRFASIWWRDLHLIELDRGVQPMWFSDALCRKVGDGEHTKFWSDTWLGVSSLKDC 1321 D R S WW+D+ + L +G W + + R +GDG T+FW D W+G L D Sbjct: 778 TPNVDKR-DSWWWKDIVWV-LQQG--NYWLDEKIERCIGDGSSTRFWEDKWIGGLRLLDV 833 Query: 1320 FPRLFQVAENKEANIAEMGAWHGDIWRWEWRWRRPLFVWEEELLADFLNIMAPVQIQKNV 1141 FPRL+ A + + + G W G W W+ +WRR FV EE + + ++ +QI + Sbjct: 834 FPRLYSFAFDPLSMVGHNGNWEGSTWLWQIKWRRETFVHEEGSVNTLIEMLQEIQIFSSK 893 Query: 1140 VDSWLWKLEGSQVFSVKSAYNMLLTVQRNNGENQGLDRTF----KWLWSCDVSSKIAVFT 973 D W W + VFSVKSAY+ L Q + G F K LW C K VF Sbjct: 894 QDQWRWICDKDGVFSVKSAYSWL---QHSMGGELSYSSDFILVTKSLWKCKAPIKCLVFC 950 Query: 972 WRLLQDRLPTREQLICRGIIDANHGGSCVFCFRAVENCSHLFFTCNFSYHVWSVVNSWIG 793 W++ + P + L RG+ N+ C C +E+ HLF C ++++W V +W+ Sbjct: 951 WQVFMNAFPCKSLLQVRGVEVENN--LCSLCSLFIEDPIHLFLLCPMAFNIWLSVANWLE 1008 Query: 792 VAGVFHNDGVNHFIQHGDFF*GKHLRRTRNLIWMAVVWSLWGMRNKIIFQGLVADFTSVI 613 V V N + ++ + K ++ ++W++V+WSLW RN IIFQ V D V+ Sbjct: 1009 VEVVLPNSLTSLYLYWTNLGIYKKSKQCFKVVWVSVIWSLWLHRNGIIFQQGVMDCKEVL 1068 Query: 612 TQIKMLSWGWFVNRAGRSSEISLVGLTFSRRGWAF 508 IKM SW W + S+ G +FS W F Sbjct: 1069 DNIKMRSWKWI--------KSSVPGCSFSYSNWYF 1095 >GAU43915.1 hypothetical protein TSUD_88880 [Trifolium subterraneum] Length = 691 Score = 457 bits (1175), Expect = e-147 Identities = 246/610 (40%), Positives = 358/610 (58%), Gaps = 5/610 (0%) Frame = -3 Query: 2397 GFNDLWRKWMKACIFSSSISVLVNGSPTDDFEPKRGLRQGDPLAPFLFLIAAEGLTGLVD 2218 GF LWRKW+K C+ +++ SVLVNGSP+D+F +RGLRQGDPL+PFLFL+AAEGL L++ Sbjct: 66 GFPTLWRKWIKECVCTATASVLVNGSPSDEFPLERGLRQGDPLSPFLFLLAAEGLHVLME 125 Query: 2217 SAIERGILQGYKV--SEDISFTILQFADDTIMLCDASWNNLWAVKAIFRSFELVSGLRVN 2044 + + I GY+V S IS + LQFADDT+++ W N+ A++A+ FE +SGL+VN Sbjct: 126 AMEVQNIFTGYRVGNSAPISVSHLQFADDTLLMGTKCWANVRALRAVLVLFETMSGLKVN 185 Query: 2043 FLKSNMYGINIEGSFLNVASNFLSCCIGNIPLKFLGLPVAANPRRCQTWEPVVKAMRKKL 1864 F KS + G+NI S+L A++ L C +G IP +LGLP+ +PRR WEPV+ ++K+L Sbjct: 186 FNKSMLVGVNISDSWLGEAASGLGCRVGKIPFLYLGLPIGGDPRRLSFWEPVLTRLKKRL 245 Query: 1863 AAWKGRHLSIGGRVTLINSVLNSMSLYFFSFYKAPKRVIRTLVAIQRNFLWGGSEENQKV 1684 + WK R LS GGR+ L+ SVL S+ +Y SF+KAP I ++ +I WGG E+ +K+ Sbjct: 246 SGWKSRFLSFGGRLVLLKSVLTSLPVYALSFFKAPSGTISSIESILIKIFWGGCEDFRKI 305 Query: 1683 SWVCRDKVCLPKRLGGLGIKNLELFNXXXXXXXXXXXLTDHDAVWVGLLSFKYG-QNFTI 1507 SWV +CL K GGLG++ L FN L D + +W +L+ +YG + ++ Sbjct: 306 SWVYWKTICLQKEYGGLGVRKLREFNLALLGKWCWRMLVDREGLWFRVLAARYGVERGSL 365 Query: 1506 SSHNTADHRFASIWWRDL-HLIELDRGVQPMWFSDALCRKVGDGEHTKFWSDTWLGVSSL 1330 + T S WWR++ H+ + WF + R+VGDG T FW+D W+ S L Sbjct: 366 CAGGTR----GSSWWREVAHIRDGGGEAAGGWFGGNISRQVGDGSDTFFWTDPWVDGSPL 421 Query: 1329 KDCFPRLFQVAENKEANIAEMGAWHGDIWRWEWRWRRPLFVWEEELLADFLNIMAPVQIQ 1150 + F RLF +A NK ++A+M I W W RPL WEEELL + ++ + +Q Sbjct: 422 SERFGRLFDLAVNKSDSVADMFQLGWGIGGDAWVWGRPLRAWEEELLGECQTLLLTISLQ 481 Query: 1149 KNVVDSWLWKLEGSQVFSVKSAYNMLLTVQRNNGENQGLDRTFKWLWSCDVSSKIAVFTW 970 + +D WLW+L+ ++++ AY LLT Q + LD +W V K+++ W Sbjct: 482 AHSLDRWLWRLDVDGGYTIQGAY-QLLTAQ----DAVPLDAATGLIWHPQVPLKVSILAW 536 Query: 969 RLLQDRLPTREQLICRGIIDANHGGSCVFCFRAVENCSHLFFTCNFSYHVWSVVNSWIGV 790 RLL DRLPT+ L RGI+ A CV AVE+ H+F +C+ +WS+V SW+G Sbjct: 537 RLLLDRLPTKVNLSYRGILPAG-DSLCVSGCGAVESAQHVFLSCSTFGSLWSLVRSWVGS 595 Query: 789 AGVFHNDGVNHFIQHGDFF*GKHLRRT-RNLIWMAVVWSLWGMRNKIIFQGLVADFTSVI 613 A V +HFIQ G RR+ LIW+A VW +W RN +F+G ++ Sbjct: 596 ASVTAQTLSDHFIQFTTSAGGTRARRSFMQLIWLACVWVVWTERNHRLFRGSANSSLHML 655 Query: 612 TQIKMLSWGW 583 +IK S+ W Sbjct: 656 DKIKTFSFRW 665 >GAU17363.1 hypothetical protein TSUD_232390 [Trifolium subterraneum] Length = 693 Score = 451 bits (1159), Expect = e-144 Identities = 246/615 (40%), Positives = 359/615 (58%), Gaps = 11/615 (1%) Frame = -3 Query: 2394 FNDLWRKWMKACIFSSSISVLVNGSPTDDFEPKRGLRQGDPLAPFLFLIAAEGLTGLVDS 2215 F +WR W+ C+ S++ SVLVNG PTD+F +RGLRQGDPL+PFL+L+AAEGL ++ S Sbjct: 68 FPRVWRGWIMECVSSATASVLVNGCPTDEFSLERGLRQGDPLSPFLYLLAAEGLHIMMTS 127 Query: 2214 AIERGILQGYKV--SEDISFTILQFADDTIMLCDASWNNLWAVKAIFRSFELVSGLRVNF 2041 A+ + Y + + ++S + LQFADDT+++ SW N+ +KA+ FE +SGL+VNF Sbjct: 128 AVSNHLFMPYNIGNANEVSVSHLQFADDTLLIGAKSWANIRTLKAVLILFESISGLKVNF 187 Query: 2040 LKSNMYGINIEGSFLNVASNFLSCCIGNIPLKFLGLPVAANPRRCQTWEPVVKAMRKKLA 1861 KS ++G+N+ S+L+ A++ L C G +P +LGLP+ + R+ Q W P+V +R +L+ Sbjct: 188 HKSILFGVNVNISWLHAAASVLRCKHGRLPFLYLGLPIGGDSRKLQFWSPLVNRIRDRLS 247 Query: 1860 AWKGRHLSIGGRVTLINSVLNSMSLYFFSFYKAPKRVIRTLVAIQRNFLWGGSEENQKVS 1681 WK ++LSIGGR+ L+ SVL+S+ +YF SF+KAP +I L +I FLWGGSEEN+K+S Sbjct: 248 GWKCKNLSIGGRLILLKSVLSSIPVYFLSFFKAPSGIISALESIFCQFLWGGSEENRKLS 307 Query: 1680 WVCRDKVCLPKRLGGLGIKNLELFNXXXXXXXXXXXLTDHDAVWVGLLSFKYGQNFTISS 1501 W+ D +CL + GGLG++ L+ FN L D+ W +L KYGQ + Sbjct: 308 WIKWDTICLQREHGGLGVRRLKEFNISLLGKWVWRLLEAGDSFWCEVLRAKYGQ---MGG 364 Query: 1500 HNTADHRFASIWWRDLHLIELDRG-VQPMWFSDALCRKVGDGEHTKFWSDTWLGVSSLKD 1324 S WWR L+ I G + W D RKVGDG +T FW++ WL L Sbjct: 365 RVCFSEGVGSSWWRTLNHIRDGVGLMDSRWLKDNNIRKVGDGRNTLFWTEPWLEDCPLDR 424 Query: 1323 CFPRLFQVAENKEANIAEMGAWHGDIWRWE---WRWRRPLFVWEEELLADFLNIMAPVQI 1153 F RLF +AENK +A+M HG W + W+WRR L WEEEL+ D + ++ V + Sbjct: 425 SFSRLFDLAENKFITVADM---HGLGWGVDGEAWKWRRRLRAWEEELVLDCVERLSNVVL 481 Query: 1152 QKNVVDSWLWKLEGSQVFSVKSAYNMLLTVQRNNGENQGLDRTFKWLWSCDVSSKIAVFT 973 Q NV D W+WKL S ++V+SAY L N N+G +R +LW + K+ +F Sbjct: 482 QVNVHDRWVWKLHPSHCYTVRSAYAFLTATDIN--LNEGFNR---FLWLKSIPLKVNIFV 536 Query: 972 WRLLQDRLPTREQLICRGIIDANHGGSCVFCFRAVENCSHLFFTCNFSYHVWSVVNSWIG 793 WRL +RLPTR+ L RGI+DA+ C R +E+ HLFF C +W+ V+ W+ Sbjct: 537 WRLFLNRLPTRDNLFRRGILDASMLACATSCGR-MEDVDHLFFQCPVYSRLWASVSKWME 595 Query: 792 VAGVFHNDGVNHFIQHGDFF*G-----KHLRRTRNLIWMAVVWSLWGMRNKIIFQGLVAD 628 V FH I H + F G K +IW+AV++ +W RN IF+ Sbjct: 596 VETAFHGT----LILHSNQFCGLGGSSKSYNTLLIIIWVAVLFIIWKGRNHHIFKAGQDS 651 Query: 627 FTSVITQIKMLSWGW 583 +++ ++K S+ W Sbjct: 652 LEAMVEKVKFQSYCW 666 >GAU43110.1 hypothetical protein TSUD_373050 [Trifolium subterraneum] Length = 1099 Score = 461 bits (1185), Expect = e-144 Identities = 236/615 (38%), Positives = 359/615 (58%), Gaps = 6/615 (0%) Frame = -3 Query: 2397 GFNDLWRKWMKACIFSSSISVLVNGSPTDDFEPKRGLRQGDPLAPFLFLIAAEGLTGLVD 2218 GF + W W + C+F+ ++SVLVNGSPT + +RGL+QGDPLAPFLFL+ EG +G++ Sbjct: 468 GFCEKWIGWTRGCVFAGNLSVLVNGSPTPEINIQRGLKQGDPLAPFLFLLVVEGFSGVMR 527 Query: 2217 SAIERGILQGYKVSED-ISFTILQFADDTIMLCDASWNNLWAVKAIFRSFELVSGLRVNF 2041 A+E + +G+ + + + LQ+ADDT+ + +AS NLW++KAI R FE+VSGL+VNF Sbjct: 528 RAVELNLFKGFNIGRGLVEISHLQYADDTLCIGEASVENLWSLKAILRGFEMVSGLKVNF 587 Query: 2040 LKSNMYGINIEGSFLNVASNFLSCCIGNIPLKFLGLPVAANPRRCQTWEPVVKAMRKKLA 1861 KS + GIN+ +FL +A+ FL+C +G+IP K+LGLPV ANP+ TWEP++ +RK+L Sbjct: 588 WKSGLMGINVSPTFLTMAATFLNCRLGSIPFKYLGLPVGANPKNGSTWEPLLDHLRKRLN 647 Query: 1860 AWKGRHLSIGGRVTLINSVLNSMSLYFFSFYKAPKRVIRTLVAIQRNFLWGGSEENQKVS 1681 +W+ +H+S GGR+ ++N+VLN++ +++ S K P V + +V +QR FLWGG + K+ Sbjct: 648 SWRNKHISFGGRIVMLNAVLNAIPIFYLSLLKMPVNVWKQVVRLQRVFLWGGVKGGNKIK 707 Query: 1680 WVCRDKVCLPKRLGGLGIKNLELFNXXXXXXXXXXXLTDHDAVWVGLLSFKYGQNFTISS 1501 WV VC K GGLG++++ + N L +W +L KYG++ + Sbjct: 708 WVKWSVVCRAKNKGGLGVRDVRIVNLSLLAKWRWRLLLPGRPLWKEILVAKYGEHI-LHR 766 Query: 1500 HNTADHRF---ASIWWRDLHLIELDRGVQPM-WFSDALCRKVGDGEHTKFWSDTWLGVSS 1333 + +D+R AS WW+D + +D+ V+ W + + RKVG+G T FWS W+G + Sbjct: 767 VDWSDYRIPSSASKWWKD--ICSIDKVVEDKNWLVEEVGRKVGNGNSTSFWSTKWIGDAP 824 Query: 1332 LKDCFPRLFQVAENKEANIAEMGAWHGDIWRWEWRWRRPLFVWEEELLADFLNIMAPVQI 1153 L FPRLF ++ +K+ + + GD RW + WRR LF WE + L ++ Sbjct: 825 LSVIFPRLFSLSNHKDCMVRDFYEDDGDNERWRFSWRRELFQWEVDRLTRLKELLVSFVF 884 Query: 1152 QKNVVDSWLWKLEGSQVFSVKSAYNMLLTVQRNNGE-NQGLDRTFKWLWSCDVSSKIAVF 976 + DSW+W+ + VFSVKSAYN+L+ R+ E + F+ +W SK+ F Sbjct: 885 SSD-DDSWIWRPDPDGVFSVKSAYNLLIEELRSGEELEEEAALIFEQIWESPAPSKVIAF 943 Query: 975 TWRLLQDRLPTREQLICRGIIDANHGGSCVFCFRAVENCSHLFFTCNFSYHVWSVVNSWI 796 +W+LL DR+PTR L RG++ + CV C +VE +HLF C + VW V WI Sbjct: 944 SWQLLYDRIPTRRNLEVRGLLGLDSPWECVGCVGSVETTTHLFLHCPSALMVWYEVFRWI 1003 Query: 795 GVAGVFHNDGVNHFIQHGDFF*GKHLRRTRNLIWMAVVWSLWGMRNKIIFQGLVADFTSV 616 GV V + F K R +IW A +W +W RN IF + Sbjct: 1004 GVIIVTPPSMMILFEVLRGSARNKKTRLGFLMIWHATIWCIWRARNNSIFANGSFSPKVI 1063 Query: 615 ITQIKMLSWGWFVNR 571 + +IK+LSW W ++R Sbjct: 1064 VEEIKVLSWKWCLSR 1078 >KYP44439.1 Retrovirus-related Pol polyprotein LINE-1 [Cajanus cajan] Length = 1142 Score = 461 bits (1187), Expect = e-144 Identities = 233/608 (38%), Positives = 354/608 (58%), Gaps = 3/608 (0%) Frame = -3 Query: 2397 GFNDLWRKWMKACIFSSSISVLVNGSPTDDFEPKRGLRQGDPLAPFLFLIAAEGLTGLVD 2218 GF +W W+ C+ +S +S+LVNGSPT++F + LRQGDPLAPFLFLI EGL L + Sbjct: 521 GFPTIWCTWIAECLKTSRMSILVNGSPTEEFGVSKELRQGDPLAPFLFLIVEEGLFMLFN 580 Query: 2217 SAIERGILQGYKVSED-ISFTILQFADDTIMLCDASWNNLWAVKAIFRSFELVSGLRVNF 2041 A + +G V +D + ILQ+ADDT+++ AS++N+W +K+I R FEL SGL+VNF Sbjct: 581 KASQLERFKGCLVGKDKVPVDILQYADDTLIMGHASYSNIWTIKSILRLFELASGLKVNF 640 Query: 2040 LKSNMYGINIEGSFLNVASNFLSCCIGNIPLKFLGLPVAANPRRCQTWEPVVKAMRKKLA 1861 KS G NIE +L + ++ L +G+ P +LGLP+ AN R TW PV++ ++K+L+ Sbjct: 641 SKSTFMGYNIESQWLQIMASVLHFRVGSTPFSYLGLPIGANHRISSTWHPVIEKVKKRLS 700 Query: 1860 AWKGRHLSIGGRVTLINSVLNSMSLYFFSFYKAPKRVIRTLVAIQRNFLWGGSEENQKVS 1681 WK LS GGR+ L+ SVL+S+ +YF SF KAPK +I ++ ++ ++FLWG ++N+K++ Sbjct: 701 RWKCTTLSFGGRIALLKSVLHSIPIYFLSFLKAPKGIISSIESLFKSFLWGADQDNRKIN 760 Query: 1680 WVCRDKVCLPKRLGGLGIKNLELFNXXXXXXXXXXXLTDHDAVWVGLLSFKYGQNFTISS 1501 WV D VC K GGLG+K+L FN L + +++WV ++ Y I+S Sbjct: 761 WVAWDVVCRDKIHGGLGMKDLSAFNLSLLGKWHWRMLVEKNSLWVRVIRSLY----DIAS 816 Query: 1500 H-NTADHRFASIWWRDLHLIELDRGVQPMWFSDALCRKVGDGEHTKFWSDTWLGVSSLKD 1324 H S WW DL+ IE V W S C+ +G+G TKFW D W+G L Sbjct: 817 HLPNGSGAKGSRWWVDLNRIEEGDLVSNEWMSSNCCKVIGNGVDTKFWLDKWVGHGILAH 876 Query: 1323 CFPRLFQVAENKEANIAEMGAWHGDIWRWEWRWRRPLFVWEEELLADFLNIMAPVQIQKN 1144 F RL+Q+A NK +IAEM W G + +W+W WRR L VWE++LL N + + + Sbjct: 877 TFSRLYQIAINKNVSIAEMFEWEGGVVKWKWSWRRRLLVWEQQLLNTLANFINGTKFIIS 936 Query: 1143 VVDSWLWKLEGSQVFSVKSAYNMLLTVQRNNGENQGLDRTFKWLWSCDVSSKIAVFTWRL 964 D WLW +V++V SAY +L N + F+W+W+ +K++ FTWR+ Sbjct: 937 DEDKWLWIAAPERVYTVSSAYKVL-----RNDIIFASNVIFRWIWTSIAPTKVSAFTWRV 991 Query: 963 LQDRLPTREQLICRGIIDANHGGSCVFCFRAVENCSHLFFTCNFSYHVWSVVNSWIGVAG 784 + +R+PT++ L RG++ A C C E SHLFF C S+ +W +W+G+ Sbjct: 992 ILNRIPTKDNLFRRGVLQATQ-LECGLCRNKEETTSHLFFECEVSFQLWMACFNWLGLNS 1050 Query: 783 VFHNDGVNHFIQ-HGDFF*GKHLRRTRNLIWMAVVWSLWGMRNKIIFQGLVADFTSVITQ 607 + HN V + Q +G + + LI + V+W++W RN +IF + + ++ Sbjct: 1051 IMHNCCVQNLEQFYGLRYCSVKYQNCWILIRLPVIWTIWLARNDLIFSSKIIHVSEMLNM 1110 Query: 606 IKMLSWGW 583 +++ SW W Sbjct: 1111 VQLRSWRW 1118 >GAU27776.1 hypothetical protein TSUD_215870 [Trifolium subterraneum] Length = 714 Score = 448 bits (1152), Expect = e-143 Identities = 215/475 (45%), Positives = 306/475 (64%), Gaps = 2/475 (0%) Frame = -3 Query: 2397 GFNDLWRKWMKACIFSSSISVLVNGSPTDDFEPKRGLRQGDPLAPFLFLIAAEGLTGLVD 2218 GF + W +WM+ACIF S+S+L+NGSPT+DF+ RGLRQGDPL+PFLFLI EGL G++ Sbjct: 223 GFTEGWLRWMRACIFEISMSILINGSPTEDFKVGRGLRQGDPLSPFLFLIVVEGLAGMMR 282 Query: 2217 SAIERGILQGYKVSEDISFTILQFADDTIMLCDASWNNLWAVKAIFRSFELVSGLRVNFL 2038 A+E G +G+ V++++ F +LQFADDTI++ +++W NLW++K + R FELVSG+R+NF+ Sbjct: 283 RAVEIGRFKGFHVNDNLQFQMLQFADDTILMGNSTWENLWSIKVLLRGFELVSGMRINFV 342 Query: 2037 KSNMYGINIEGSFLNVASNFLSCCIGNIPLKFLGLPVAANPRRCQTWEPVVKAMRKKLAA 1858 KSN+YG+N++ +FL S+FLSC IP KFLG+PV ANPRR +TW+PVV+AM +L+ Sbjct: 343 KSNLYGVNVDANFLEAGSSFLSCRSDVIPFKFLGIPVGANPRRRETWKPVVEAMTNRLST 402 Query: 1857 WKGRHLSIGGRVTLINSVLNSMSLYFFSFYKAPKRVIRTLVAIQRNFLWGGSEENQKVSW 1678 W R LS GGR+TLIN+VL SM LYFFSF+KAP +++ LV IQRNFLWGG +K+ W Sbjct: 403 WSSRQLSFGGRITLINTVLASMPLYFFSFFKAPVCILKLLVRIQRNFLWGGGLVEKKLCW 462 Query: 1677 VCRDKVCLPKRLGGLGIKNLELFNXXXXXXXXXXXLTDHDAVWVGLLSFKYGQNFT--IS 1504 + D++CLPK GGLG+KNLELFN L + D +W GLL F+YG T ++ Sbjct: 463 IKWDQICLPKNRGGLGVKNLELFNIALLSKWKWRLLDEGDTIWAGLLRFRYGHLSTKILT 522 Query: 1503 SHNTADHRFASIWWRDLHLIELDRGVQPMWFSDALCRKVGDGEHTKFWSDTWLGVSSLKD 1324 + SIWWRD ++ + + + +WF+ + VG+G + FW W G +SL D Sbjct: 523 GETSQIGAKDSIWWRD--IMSIGKSINGLWFNSNVRCCVGNGNNIGFWKFKWHGNTSLGD 580 Query: 1323 CFPRLFQVAENKEANIAEMGAWHGDIWRWEWRWRRPLFVWEEELLADFLNIMAPVQIQKN 1144 FP LF K+ I+E W+G+ W W+WR L EE+ + +++ ++ Sbjct: 581 LFPDLFAKEAFKDVLISERLRWNGNTADWNWQWREELVETEEQQFSVLKDLLVGIRSDPT 640 Query: 1143 VVDSWLWKLEGSQVFSVKSAYNMLLTVQRNNGENQGLDRTFKWLWSCDVSSKIAV 979 D+W W FSVKS Y++L++ + + LW DV SK A+ Sbjct: 641 RPDTWRWVPGTIGNFSVKSCYDVLISYYYLEAPDSNVLTALHKLWKTDVPSKTAI 695