BLASTX nr result

ID: Glycyrrhiza36_contig00005589 seq

BLASTX 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Glycyrrhiza36_contig00005589
         (2399 letters)

Database: ./nr 
           115,041,592 sequences; 42,171,959,267 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

GAU48210.1 hypothetical protein TSUD_404970 [Trifolium subterran...   550   e-173
GAU25119.1 hypothetical protein TSUD_274080 [Trifolium subterran...   502   e-161
XP_019418409.1 PREDICTED: uncharacterized protein LOC109329191 [...   503   e-161
GAU37021.1 hypothetical protein TSUD_207270 [Trifolium subterran...   486   e-160
GAU34179.1 hypothetical protein TSUD_162800 [Trifolium subterran...   489   e-159
GAU26515.1 hypothetical protein TSUD_361480 [Trifolium subterran...   484   e-155
KYP54863.1 Putative ribonuclease H protein At1g65750 family [Caj...   470   e-153
GAU18134.1 hypothetical protein TSUD_248350 [Trifolium subterran...   468   e-151
KYP34591.1 LINE-1 reverse transcriptase isogeny [Cajanus cajan]       472   e-151
GAU50434.1 hypothetical protein TSUD_134890, partial [Trifolium ...   466   e-150
KYP40876.1 Putative ribonuclease H protein At1g65750 family [Caj...   466   e-150
GAU46725.1 hypothetical protein TSUD_100170 [Trifolium subterran...   466   e-149
GAU24087.1 hypothetical protein TSUD_388800 [Trifolium subterran...   487   e-148
KYP63901.1 Putative ribonuclease H protein At1g65750 family [Caj...   456   e-148
KYP50779.1 Transposon TX1 uncharacterized [Cajanus cajan]             471   e-148
GAU43915.1 hypothetical protein TSUD_88880 [Trifolium subterraneum]   457   e-147
GAU17363.1 hypothetical protein TSUD_232390 [Trifolium subterran...   451   e-144
GAU43110.1 hypothetical protein TSUD_373050 [Trifolium subterran...   461   e-144
KYP44439.1 Retrovirus-related Pol polyprotein LINE-1 [Cajanus ca...   461   e-144
GAU27776.1 hypothetical protein TSUD_215870 [Trifolium subterran...   448   e-143

>GAU48210.1 hypothetical protein TSUD_404970 [Trifolium subterraneum]
          Length = 1653

 Score =  550 bits (1416), Expect = e-173
 Identities = 282/619 (45%), Positives = 381/619 (61%), Gaps = 2/619 (0%)
 Frame = -3

Query: 2397 GFNDLWRKWMKACIFSSSISVLVNGSPTDDFEPKRGLRQGDPLAPFLFLIAAEGLTGLVD 2218
            GF++ W KW++ACIF SS+S+L+NGSPT+DF+ +RGLRQGDPL+PFLFLIAAEGLTGL+ 
Sbjct: 1021 GFSEEWLKWLRACIFESSMSILINGSPTEDFKVERGLRQGDPLSPFLFLIAAEGLTGLMK 1080

Query: 2217 SAIERGILQGYKVSEDISFTILQFADDTIMLCDASWNNLWAVKAIFRSFELVSGLRVNFL 2038
             A+E G  +GY+V+ +I F ILQFADDTI++ +  W+N+  +  + RSFELVSGL++NF+
Sbjct: 1081 RAVELGKFKGYQVNNNIQFQILQFADDTILMGEGVWDNIQTINILLRSFELVSGLKINFV 1140

Query: 2037 KSNMYGINIEGSFLNVASNFLSCCIGNIPLKFLGLPVAANPRRCQTWEPVVKAMRKKLAA 1858
            KS +YGIN++   L   S FLSC +   P KFLG+PV ANPRR +TW+PVV AM K+L+ 
Sbjct: 1141 KSKIYGINVDDRLLVAGSAFLSCRVDVFPFKFLGIPVGANPRRRETWKPVVDAMTKRLST 1200

Query: 1857 WKGRHLSIGGRVTLINSVLNSMSLYFFSFYKAPKRVIRTLVAIQRNFLWGGSEENQKVSW 1678
            WK RHLS GGRVTLINSVL S+ LYFFSF+KAP  ++R L  IQR+FLWGG  E++K+ W
Sbjct: 1201 WKSRHLSFGGRVTLINSVLTSLPLYFFSFFKAPCCILRLLERIQRSFLWGGGLEDKKLCW 1260

Query: 1677 VCRDKVCLPKRLGGLGIKNLELFNXXXXXXXXXXXLTDHDAVWVGLLSFKYGQNFT--IS 1504
            V  D++CL K  GGLG+KNL LFN           LT+  A+W  LL F+YG   T  + 
Sbjct: 1261 VKWDQICLSKDQGGLGVKNLNLFNIALLNKWKWRFLTEDGALWAELLRFRYGHLPTQLMG 1320

Query: 1503 SHNTADHRFASIWWRDLHLIELDRGVQPMWFSDALCRKVGDGEHTKFWSDTWLGVSSLKD 1324
              + +    +S WW+D  +I + +G +  WF   +   VG+G +  FW+  W G     +
Sbjct: 1321 GASFSIGAKSSTWWKD--VIGMGKGAEFDWFKSNMRACVGNGVNIGFWNFKWFGNHPFSE 1378

Query: 1323 CFPRLFQVAENKEANIAEMGAWHGDIWRWEWRWRRPLFVWEEELLADFLNIMAPVQIQKN 1144
             FP LF   E    +IAE    +G+ +   W+W  PL   E + +A+   ++    +Q  
Sbjct: 1379 IFPNLFAKEERPNVSIAERLGGNGEAFVRHWQWSDPLSDSEHQQVAELTELLRGFSLQPG 1438

Query: 1143 VVDSWLWKLEGSQVFSVKSAYNMLLTVQRNNGENQGLDRTFKWLWSCDVSSKIAVFTWRL 964
              DSW W LE + +FSVKS YN L+  +     +  +      LW  DV SK+  F WRL
Sbjct: 1439 HQDSWRWILETTGLFSVKSYYNALVKSRLIVELDSNVLTAINQLWKNDVPSKVLFFGWRL 1498

Query: 963  LQDRLPTREQLICRGIIDANHGGSCVFCFRAVENCSHLFFTCNFSYHVWSVVNSWIGVAG 784
            L  RLP R  L  RGI+       CVFC    E+C HLFF C+F   VW  V +WIG   
Sbjct: 1499 LLQRLPIRIALNHRGILTNPQDLPCVFCSVFYEDCVHLFFHCSFVNCVWEAVYNWIGKDY 1558

Query: 783  VFHNDGVNHFIQHGDFF*GKHLRRTRNLIWMAVVWSLWGMRNKIIFQGLVADFTSVITQI 604
                +G +HF   GD     ++ R R+LIW+A  W+LW +RN +IF G     +S++  I
Sbjct: 1559 HAGAEGWSHFKVFGDMVNSTNIERVRHLIWLATTWNLWKLRNNVIFNGATPSASSLLNDI 1618

Query: 603  KMLSWGWFVNRAGRSSEIS 547
            K +S  W   R G  S IS
Sbjct: 1619 KAISCAWVSGRYGHKSCIS 1637


>GAU25119.1 hypothetical protein TSUD_274080 [Trifolium subterraneum]
          Length = 937

 Score =  502 bits (1293), Expect = e-161
 Identities = 259/619 (41%), Positives = 373/619 (60%), Gaps = 10/619 (1%)
 Frame = -3

Query: 2397 GFNDLWRKWMKACIFSSSISVLVNGSPTDDFEPKRGLRQGDPLAPFLFLIAAEGLTGLVD 2218
            GF + W +W++AC+F+ ++SVLVNGSPT +   +RGL+QGDPLAPFLFL+  EG  GL+ 
Sbjct: 307  GFGETWVEWIRACVFAGNLSVLVNGSPTTEINIQRGLKQGDPLAPFLFLLVVEGFAGLMR 366

Query: 2217 SAIERGILQGYKV-SEDISFTILQFADDTIMLCDASWNNLWAVKAIFRSFELVSGLRVNF 2041
            S +++ + +G+ V +E +  + LQ+ADDT+ + +AS  NLW +KAI R FEL SGLRVN 
Sbjct: 367  SVVDKNLFKGFSVGTEGLQISHLQYADDTLCIGEASMENLWTLKAILRGFELASGLRVNI 426

Query: 2040 LKSNMYGINIEGSFLNVASNFLSCCIGNIPLKFLGLPVAANPRRCQTWEPVVKAMRKKLA 1861
             KS + G+N+  +F+  A +FL+C  G +P  +LGLPV ANPRR  TWEPV+ ++RK+L 
Sbjct: 427  WKSYLIGVNVPNNFMENACHFLNCKRGVLPFSYLGLPVGANPRRSSTWEPVLDSLRKRLR 486

Query: 1860 AWKGRHLSIGGRVTLINSVLNSMSLYFFSFYKAPKRVIRTLVAIQRNFLWGGSEENQKVS 1681
            AW  +++S+GGR+ LINS+LNS+ ++F SF K P  V++++  IQR FLWGG +   K+S
Sbjct: 487  AWGNKYVSLGGRIVLINSILNSIPIFFLSFLKLPAAVLKSITRIQREFLWGGVKGGSKIS 546

Query: 1680 WVCRDKVCLPKRLGGLGIKNLELFNXXXXXXXXXXXLTDHDAVWVGLLSFKYGQNFTISS 1501
            WV   +VC P+  GGLG++++   N           L    AVW  +L  +YG+N   + 
Sbjct: 547  WVKWKEVCKPRSQGGLGVRDVGKVNLSLLIKWRWKLLQKDAAVWKDVLVARYGEN---AR 603

Query: 1500 HNT-----ADHRFASIWWRDLHLIELDRGVQPMWFSDALCRKVGDGEHTKFWSDTWLGVS 1336
            HN           AS WWRDL  I+L    +  WF+  + R+VG G+ T+FW D W+G  
Sbjct: 604  HNVLWIGCPIPSSASCWWRDLCRIDLTE--EGSWFAKNISRRVGRGDTTRFWKDCWVGQV 661

Query: 1335 SLKDCFPRLFQVAENKEANIAEMGAWHGDIWRWEWRWRRPLFVWEEELLADFLNIMAPVQ 1156
             L + FPRLF ++  KEA ++E+      +  WEW WRR LFVWEEELL    + ++P+ 
Sbjct: 662  PLCESFPRLFSISLQKEALVSEIRVGGEGVSWWEWGWRRSLFVWEEELLLGLQDFISPMA 721

Query: 1155 IQKNVVDSWLWKLEGSQVFSVKSAYNMLLTVQRNNGENQGLD-RTFKWLWSCDVSSKIAV 979
               +  D W W LE   VF+VKSAY +L  +  +       + R    +W     SK+  
Sbjct: 722  FSTD-DDVWYWGLEDGGVFTVKSAYLLLGRMFASFSMFNVCELRVLNSIWRSPAPSKVIA 780

Query: 978  FTWRLLQDRLPTREQLICRGIIDANHGGSCVFCFRAVENCSHLFFTCNFSYHVWSVVNSW 799
            F+W+LL++R+PTR+ L  RGI+ A     CV C    E   HLF  C+F++ VWS +  W
Sbjct: 781  FSWKLLRNRIPTRDCLSRRGILAAGGSRECVHCQGREETALHLFLFCDFAFRVWSAIFQW 840

Query: 798  IGVAGVFHNDGVNHFIQHGDFF*GKHLRRTRN---LIWMAVVWSLWGMRNKIIFQGLVAD 628
            +GV  V      N FI    F       +      LIW   VW++W  RN+I+F   V D
Sbjct: 841  LGVVIVM---PPNLFILFDCFVGAAGCNKRAKGFLLIWHTTVWAIWRSRNEILFANGVLD 897

Query: 627  FTSVITQIKMLSWGWFVNR 571
             +SVI +IK+LSW W ++R
Sbjct: 898  PSSVIDEIKLLSWRWGLSR 916


>XP_019418409.1 PREDICTED: uncharacterized protein LOC109329191 [Lupinus
            angustifolius]
          Length = 953

 Score =  503 bits (1294), Expect = e-161
 Identities = 260/608 (42%), Positives = 362/608 (59%), Gaps = 5/608 (0%)
 Frame = -3

Query: 2397 GFNDLWRKWMKACIFSSSISVLVNGSPTDDFEPKRGLRQGDPLAPFLFLIAAEGLTGLVD 2218
            GF   WR W+K+C+ S+S S+LVNGSPT +F   RGLRQGDP+APFLFLI AEGL G++ 
Sbjct: 317  GFCFKWRNWIKSCLQSNSFSILVNGSPTSEFRMARGLRQGDPIAPFLFLIVAEGLGGIMR 376

Query: 2217 SAIERGILQGYKVSED-ISFTILQFADDTIMLCDASWNNLWAVKAIFRSFELVSGLRVNF 2041
            SA+ + I  GY V  D I  + LQ+ADDT+++ + S +N+  +K+I + FELVSGL++NF
Sbjct: 377  SAVSKKIFTGYSVGRDEIVISHLQYADDTLLIGENSADNIMVLKSILKCFELVSGLKINF 436

Query: 2040 LKSNMYGINIEGSFLNVASNFLSCCIGNIPLKFLGLPVAANPRRCQTWEPVVKAMRKKLA 1861
             KS+  GI  + SF+ VA N L C +G+IP KFLG+PV ANP+R  TW  V+   ++KL+
Sbjct: 437  HKSSFIGIKADPSFVQVAVNRLLCGVGSIPFKFLGIPVGANPKRLSTWSLVIDTFKRKLS 496

Query: 1860 AWKGRHLSIGGRVTLINSVLNSMSLYFFSFYKAPKRVIRTLVAIQRNFLWGGSEENQKVS 1681
             W+ + LS GGRVTL+ SVL+S+ +Y+FSF+KAP  +I  L  IQR FLWG  E N+ + 
Sbjct: 497  RWQQKLLSFGGRVTLLKSVLSSLPIYYFSFFKAPVSIIHELERIQRRFLWGRGEVNKGIH 556

Query: 1680 WVCRDKVCLPKRLGGLGIKNLELFNXXXXXXXXXXXLTDHDAVWVGLLSFKYGQNFTISS 1501
            WV   +VC  K  GGLG+KNL LFN           L+  +++WV +L   YG    +  
Sbjct: 557  WVRWKEVCRSKEEGGLGVKNLGLFNLALLGKWRWHMLSSSESLWVKVLRSIYGVEAVVRG 616

Query: 1500 H--NTADHRFASIWWRDLH-LIELDRGVQPMWFSDALCRKVGDGEHTKFWSDTWLGVSSL 1330
               +    +  S WWRDL  L   D G    WF++ + R+VG G+ T FW D W+G   L
Sbjct: 617  GLVDVECFKKGSSWWRDLGCLCNRDNGFNKGWFNEGVRRRVGSGQSTLFWRDIWVGGECL 676

Query: 1329 KDCFPRLFQVAENKEANIAEMGAWHGDIWRWEWRWRRPLFVWEEELLADFLNIMAPVQIQ 1150
            K+CF RLFQV  NK+A I+ MG W   +W W   WRR LF+WE++ + D LN +  V++ 
Sbjct: 677  KNCFERLFQVTLNKDACISSMGEWRNGVWCWLLNWRRSLFLWEQDEVNDLLNKVEEVRLV 736

Query: 1149 KNVVDSWLWKLEGSQVFSVKSAYNMLLTVQRNNGENQGLDRTFKWLWSCDVSSKIAVFTW 970
            +   D WLW  + +  +SV++AY +L    RN+         +K LW+  V SK+  F W
Sbjct: 737  QGNEDGWLWVHDKNGTYSVRNAYKVLQNEVRNDNYLH-----YKRLWASKVPSKLKCFAW 791

Query: 969  RLLQDRLPTREQLICRGIIDANHGGSCVFCFRAVENCSHLFFTCNFSYHVWSVVNSWIGV 790
            RL    +PT   L  RGII +     C FC    E+  HLFFTC+ SY VW  + S  G+
Sbjct: 792  RLFVGGVPTWMNLARRGIIGSLPSTLCAFCGELEESSDHLFFTCSLSYSVWQKLYSLFGI 851

Query: 789  AGVFHNDGVNHFIQHGDFF-*GKHLRRTRNLIWMAVVWSLWGMRNKIIFQGLVADFTSVI 613
              +  +   ++F+ H   F   K   +    IW   +WSLW +RNKIIF+    +   V+
Sbjct: 852  YSILPSSTGSNFLSHWHLFGEAKKFHQQWMTIWFVTIWSLWLVRNKIIFEESSFNVDEVM 911

Query: 612  TQIKMLSW 589
              I + SW
Sbjct: 912  FIINLHSW 919


>GAU37021.1 hypothetical protein TSUD_207270 [Trifolium subterraneum]
          Length = 596

 Score =  486 bits (1251), Expect = e-160
 Identities = 253/539 (46%), Positives = 334/539 (61%), Gaps = 4/539 (0%)
 Frame = -3

Query: 2397 GFNDLWRKWMKACIFSSSISVLVNGSPTDDFEPKRGLRQGDPLAPFLFLIAAEGLTGLVD 2218
            GF + W KWM+ CI  SS+SVLVNGS T DF   +GLRQGDPL+PFLFLI AEGLTG+V 
Sbjct: 38   GFAEGWLKWMRTCICQSSMSVLVNGSSTKDFNVFKGLRQGDPLSPFLFLIVAEGLTGMVR 97

Query: 2217 SAIERGILQGYKVSEDISFTILQFADDTIMLCDASWNNLWAVKAIFRSFELVSGLRVNFL 2038
             A+E G  +GYKVSE I F ILQFADD I++ + SW+NLW +K + R FE+VSGL++NF 
Sbjct: 98   RAVELGKFKGYKVSESIQFQILQFADDMILMGENSWDNLWTIKTVLRGFEMVSGLKINFN 157

Query: 2037 KSNMYGINIEGSFLNVASNFLSCCIGNIPLKFLGLPVAANPRRCQTWEPVVKAMRKKLAA 1858
            KS +YGIN+E  FL   S FLSC    IP KFLG+PV ANPRR +TW PVV+AM  + + 
Sbjct: 158  KSKLYGINVEEDFLEAGSTFLSCRSDVIPFKFLGIPVGANPRRKETWRPVVEAMSTRFSR 217

Query: 1857 WKGRHLSIGGRVTLINSVLNSMSLYFFSFYKAPKRVIRTLVAIQRNFLWGGSEENQKVSW 1678
            W G HL+ GGR+TLINSVL S+ LYFFSF+KA   V+  LV+IQRNFLWGG  E +K+ W
Sbjct: 218  WSGSHLTYGGRITLINSVLASLPLYFFSFFKASICVLNQLVSIQRNFLWGGGLEEKKMCW 277

Query: 1677 VCRDKVCLPKRLGGLGIKNLELFNXXXXXXXXXXXLTDHDAVWVGLLSFKYG--QNFTIS 1504
            V  D VCLP+ LGGLG+KNL+LFN           + D +AVW  +L  +YG   +F ++
Sbjct: 278  VKWDHVCLPRDLGGLGVKNLKLFNIALLSKWKWRCVNDSEAVWKEVLRHRYGHLSSFILN 337

Query: 1503 SHNTADHRFASIWWRDLHLIELDRGVQPMWFSDALCRKVGDGEHTKFWSDTWLGVSSLKD 1324
                + +   SIWW+D+  I    G    WF       VG+G +  FW++ WLG + L D
Sbjct: 338  GVPISSNFKTSIWWKDMVNIGETFGYD--WFQSNTRIIVGNGNNIAFWTNRWLGNNVLSD 395

Query: 1323 CFPRLFQVAENKEANIAE--MGAWHGDIWRWEWRWRRPLFVWEEELLADFLNIMAPVQIQ 1150
             FP LF     K+A +A+  +    G IWRWEWR R  L   EE  LA+   ++    + 
Sbjct: 396  LFPNLFDKEAFKDAKVADRVINNNDGTIWRWEWRGR--LTEAEELDLAELQVLLTGFSLN 453

Query: 1149 KNVVDSWLWKLEGSQVFSVKSAYNMLLTVQRNNGENQGLDRTFKWLWSCDVSSKIAVFTW 970
                D W W  +    F++KS YN+L+ V      +  L    + LW  D+ SK+ +F W
Sbjct: 454  PTCCDRWKWIPDSVGDFTIKSCYNVLIHVGNTVVLSPHLLVAIRKLWKNDLPSKVGIFGW 513

Query: 969  RLLQDRLPTREQLICRGIIDANHGGSCVFCFRAVENCSHLFFTCNFSYHVWSVVNSWIG 793
            RLL ++LPTR  L  R I++ +    CVFC R  E+ +HLFF      ++W  ++ W+G
Sbjct: 514  RLLLEKLPTRAALAHRNILNTDDELLCVFCSRVREDSNHLFFNRVHMKYLWRRIHDWLG 572


>GAU34179.1 hypothetical protein TSUD_162800 [Trifolium subterraneum]
          Length = 757

 Score =  489 bits (1260), Expect = e-159
 Identities = 254/616 (41%), Positives = 369/616 (59%), Gaps = 7/616 (1%)
 Frame = -3

Query: 2397 GFNDLWRKWMKACIFSSSISVLVNGSPTDDFEPKRGLRQGDPLAPFLFLIAAEGLTGLVD 2218
            GF ++W  WMKACIF  ++SVLVNG P  +   +RGL+QGDPLAPFLFL+ AEG  G + 
Sbjct: 125  GFCEVWIGWMKACIFGGNLSVLVNGCPMGEINIQRGLKQGDPLAPFLFLLVAEGFGGAMR 184

Query: 2217 SAIERGILQGYKVSED-ISFTILQFADDTIMLCDASWNNLWAVKAIFRSFELVSGLRVNF 2041
             A+E  + +G+ +S D  S + LQ+ADDT+ + +AS  NLW +KAI R FEL SGLRVNF
Sbjct: 185  RAVEINLFKGFNISRDGPSISHLQYADDTLCIGEASIENLWTMKAILRGFELASGLRVNF 244

Query: 2040 LKSNMYGINIEGSFLNVASNFLSCCIGNIPLKFLGLPVAANPRRCQTWEPVVKAMRKKLA 1861
             KS + G+N+   F+ +A  FL+C  G +P K+LGLPV ANPRR  TWEP+V ++RKKL 
Sbjct: 245  WKSCLIGVNVRDDFMELACTFLNCIQGFVPFKYLGLPVGANPRRLSTWEPLVASLRKKLN 304

Query: 1860 AWKGRHLSIGGRVTLINSVLNSMSLYFFSFYKAPKRVIRTLVAIQRNFLWGGSEENQKVS 1681
            +W  +H+SI GR+ LINSVLNS+ +++ SF K P +V++ ++ IQR FLWGG    + +S
Sbjct: 305  SWGHKHVSIEGRLVLINSVLNSIPIFYLSFMKMPVQVLKKVIRIQREFLWGGVNGGRNLS 364

Query: 1680 WVCRDKVCLPKRLGGLGIKNLELFNXXXXXXXXXXXLTDHD-AVWVGLLSFKYGQNF--- 1513
            W+ R  VC  K+ GGLGI++L+  N           L   D  +W  +L  KYG +    
Sbjct: 365  WIKRRVVCQGKKNGGLGIRDLKAVNLSLLMKWRWRLLNSEDTGLWKEVLVAKYGGHILHN 424

Query: 1512 TISSHNTADHRFASIWWRDLHLIELDRGVQPM-WFSDALCRKVGDGEHTKFWSDTWLGVS 1336
             + S  +  +R AS+WW+D++  +L   V    W ++ + R +G+G  T+FWSD W+G  
Sbjct: 425  VVWSLGSPPYR-ASLWWKDIN--DLQACVNSKNWVAEMVTRFLGNGSRTRFWSDNWIGDV 481

Query: 1335 SLKDCFPRLFQVAENKEANIAEMGAWHGDIWRWEWRWRRPLFVWEEELLADFLNIMAPVQ 1156
             L   FPRLF ++  KEA ++EM    G+   W + WRR LF+WEEE ++  L+++  V 
Sbjct: 482  LLCSKFPRLFSLSLQKEATVSEMMVVEGETKSWNFLWRRSLFLWEEERVSQLLSLLENVS 541

Query: 1155 IQKNVVDSWLWKLEGSQVFSVKSAYNMLL-TVQRNNGENQGLDRTFKWLWSCDVSSKIAV 979
            +     D W W L+    FSVKSAY+ LL  +  +   +    + F  +W      K+ V
Sbjct: 542  LSLE-EDKWHWALDPDGCFSVKSAYDSLLENLDTSPNLSPYEAKIFSNIWDSPAPLKVVV 600

Query: 978  FTWRLLQDRLPTREQLICRGIIDANHGGSCVFCFRAVENCSHLFFTCNFSYHVWSVVNSW 799
            F+WRLL DR+PT+E LI RG++     GSCV+C    E+ +HLF  C  +  VW  +  W
Sbjct: 601  FSWRLLHDRVPTKENLIVRGVLPRESSGSCVWCGDIRESSAHLFLHCKVALVVWYEIFRW 660

Query: 798  IGVAGVFHNDGVNHFIQHGDFF*GKHLRRTRNLIWMAVVWSLWGMRNKIIFQGLVADFTS 619
            +GV  V   +    F    D    K  ++   L+W +V+W++W  RN  IF  + +D   
Sbjct: 661  LGVVIVIPPNLFTLFDYFSDSARSKKSKKGFLLVWHSVIWTIWKARNNQIFNNVTSDPFE 720

Query: 618  VITQIKMLSWGWFVNR 571
            ++   K+LSW W  +R
Sbjct: 721  LVESAKVLSWRWSADR 736


>GAU26515.1 hypothetical protein TSUD_361480 [Trifolium subterraneum]
          Length = 873

 Score =  484 bits (1246), Expect = e-155
 Identities = 248/612 (40%), Positives = 360/612 (58%), Gaps = 4/612 (0%)
 Frame = -3

Query: 2394 FNDLWRKWMKACIFSSSISVLVNGSPTDDFEPKRGLRQGDPLAPFLFLIAAEGLTGLVDS 2215
            F + W +W++AC+F+ ++SVLVNGSPT +   +RGL+QGDPLAPFLFL+ AE   GL+ +
Sbjct: 244  FCNKWVEWIRACVFAGNLSVLVNGSPTTEINIQRGLKQGDPLAPFLFLLVAEDFVGLMRN 303

Query: 2214 AIERGILQGYKV-SEDISFTILQFADDTIMLCDASWNNLWAVKAIFRSFELVSGLRVNFL 2038
            A+   + +G+ + SE +  + LQ+ADDT+ + D +  NLW +KAI R FEL SGL+VNF 
Sbjct: 304  AVALNLFKGFSIGSEGLVISHLQYADDTLCIGDDTLKNLWTLKAILRGFELASGLKVNFW 363

Query: 2037 KSNMYGINIEGSFLNVASNFLSCCIGNIPLKFLGLPVAANPRRCQTWEPVVKAMRKKLAA 1858
            KS++ G+N+   ++  A NFL+C  G IP  +LGLPV ANPRRC TW+P+V+ +RK+L A
Sbjct: 364  KSSLIGVNVSNDYMVNACNFLNCKRGVIPFMYLGLPVGANPRRCSTWDPLVERLRKRLRA 423

Query: 1857 WKGRHLSIGGRVTLINSVLNSMSLYFFSFYKAPKRVIRTLVAIQRNFLWGGSEENQKVSW 1678
            W  R++S+GGR+ LIN VLN++ +++ S +K P  VI+ ++ IQR FLWGG +  +K+SW
Sbjct: 424  WGNRYVSLGGRIVLINFVLNAIPIFYLSLFKMPVLVIKKIIRIQREFLWGGVKGGRKISW 483

Query: 1677 VCRDKVCLPKRLGGLGIKNLELFNXXXXXXXXXXXLTDHDAVWVGLLSFKYGQNFTISSH 1498
            V   +VC P+  GGLG++++   N           L    A W  LL  KYG+      H
Sbjct: 484  VKWKEVCKPRCQGGLGVRDVGKVNLSLLIKWRWRLLQPEGAFWKELLVAKYGEMVRQKLH 543

Query: 1497 --NTADHRFASIWWRDLHLIELDRGVQPMWFSDALCRKVGDGEHTKFWSDTWLGVSSLKD 1324
              +      AS WW+D  + E+D   +  WF+  + R+VG G+  +FW D W G S L D
Sbjct: 544  WNDCPIPSRASSWWKD--ICEIDVCEEGSWFAQHVFRRVGKGDSIRFWKDCWFGNSPLCD 601

Query: 1323 CFPRLFQVAENKEANIAEMGAWHGDIWRWEWRWRRPLFVWEEELLADFLNIMAPVQIQKN 1144
             FPRLF +A +KEA + E+      +  W W WRR LFVWE+ELL      + P+ +   
Sbjct: 602  LFPRLFSIATHKEALVNEVRVVTEGLNLWNWEWRRRLFVWEQELLVSLTETL-PLLVLSG 660

Query: 1143 VVDSWLWKLEGSQVFSVKSAYNMLLTVQRNNGENQGLD-RTFKWLWSCDVSSKIAVFTWR 967
              D W W+LE   VF+VKS Y +L +V   +      + R F  +W     SK+ VF W+
Sbjct: 661  EEDVWYWRLEDGGVFTVKSVYTLLGSVFATDAVWSPPELRVFDQIWKSPAPSKVIVFPWK 720

Query: 966  LLQDRLPTREQLICRGIIDANHGGSCVFCFRAVENCSHLFFTCNFSYHVWSVVNSWIGVA 787
            LL++R+PT+  L  RGI       +CV C  + E+ SHLF  CNF+  VW+ +  WIGV 
Sbjct: 721  LLRNRIPTKANLALRGIQVVGGSLNCVHCVGSGEDASHLFMYCNFAAQVWNSIFRWIGVT 780

Query: 786  GVFHNDGVNHFIQHGDFF*GKHLRRTRNLIWMAVVWSLWGMRNKIIFQGLVADFTSVITQ 607
             V   +    F           + +  +LIW   +W +W  RN I F     D    + +
Sbjct: 781  IVIPPNIFLLFDCMRGAAPNNKIAKGFSLIWHTTLWVIWKSRNSISFGSGTIDLGQAVGE 840

Query: 606  IKMLSWGWFVNR 571
            IK+LSW W ++R
Sbjct: 841  IKLLSWRWDLSR 852


>KYP54863.1 Putative ribonuclease H protein At1g65750 family [Cajanus cajan]
          Length = 648

 Score =  470 bits (1210), Expect = e-153
 Identities = 250/649 (38%), Positives = 370/649 (57%), Gaps = 5/649 (0%)
 Frame = -3

Query: 2397 GFNDLWRKWMKACIFSSSISVLVNGSPTDDFEPKRGLRQGDPLAPFLFLIAAEGLTGLVD 2218
            GF   WRKW+  C+ ++ I VL+NGSPT +F   +GLRQGD LAPFLFLI AEGL  L+ 
Sbjct: 6    GFPLKWRKWIAECVSTTRIFVLLNGSPTGEFGVGKGLRQGDLLAPFLFLIVAEGLNALMS 65

Query: 2217 SAIERGILQGYKVS-EDISFTILQFADDTIMLCDASWNNLWAVKAIFRSFELVSGLRVNF 2041
              +E  +  GY V  + +S + LQ+ADDT+++  AS +N+WA+K+I + FELV+GL+VNF
Sbjct: 66   KVVECHVFSGYSVGHQSVSVSHLQYADDTLIIGGASSHNVWAIKSILQIFELVAGLKVNF 125

Query: 2040 LKSNMYGINIEGSFLNVASNFLSCCIGNIPLKFLGLPVAANPRRCQTWEPVVKAMRKKLA 1861
             KS ++G NI    LN+ + FL+C +G++P  +LGLP+ ANPR  +TWEPV+  ++K+L+
Sbjct: 126  HKSKLFGFNINIEVLNLMAQFLNCKVGSLPFCYLGLPLGANPRCIKTWEPVISKVKKRLS 185

Query: 1860 AWKGRHLSIGGRVTLINSVLNSMSLYFFSFYKAPKRVIRTLVAIQRNFLWGGSEENQKVS 1681
             WK   LS GGR  L+ SVLNS+ +Y+ SF+KAP+ +I  L ++ + FLWGG E ++K++
Sbjct: 186  KWKSSTLSFGGRSVLLKSVLNSIPIYYLSFFKAPQGIISKLESLFKLFLWGGDENHRKIA 245

Query: 1680 WVCRDKVCLPKRLGGLGIKNLELFNXXXXXXXXXXXLTDHDAVWVGLLSFKYGQNFTISS 1501
            WV   +VC  K  GGLGI +L  FN           L +    W  +++  YG+      
Sbjct: 246  WVAWQEVCRGKEHGGLGILDLRAFNLALLGKWRWRLLVEKGRFWHRVVTSIYGEG---CF 302

Query: 1500 HNTADHRFASIWWRDLHLIELDRGVQPMWFSDALCRKVGDGEHTKFWSDTWLGVSSLKDC 1321
                D   +S WW DL  I+        WFS    + VGDG +T FW D W G   L + 
Sbjct: 303  QGVGDKVQSSKWWVDLWTIDSTPYTSFDWFSSRCTKVVGDGRNTFFWKDGWSGQGPLCNR 362

Query: 1320 FPRLFQVAENKEANIAEMGAWHGDIWRWEWRWRRPLFVWEEELLADFLNIMAPVQIQKNV 1141
            + RLF +A +K+ ++A M  W    + W W WRR LF WE +LL+     +  + ++ + 
Sbjct: 363  YSRLFSIASDKDVSVANMVLWRDGGFEWIWSWRRSLFQWELDLLSQLAADLGSIVLKNDC 422

Query: 1140 VDSWLWKLEGSQVFSVKSAYNMLLTVQRNNGENQGLDRTF---KWLWSCDVSSKIAVFTW 970
             D W WK     +++VKSAY  ++        N G+   F   K+LWS  V SK++ F W
Sbjct: 423  CDRWCWKDSNDGIYNVKSAYKAVI--------NGGIYADFLLHKFLWSSCVPSKVSGFAW 474

Query: 969  RLLQDRLPTREQLICRGIIDANHGGSCVFCFRAVENCSHLFFTCNFSYHVWSVVNSWIGV 790
            + L +R+P++  LI R +++ +  G C +C   +EN SHL F C ++Y VW    +W GV
Sbjct: 475  KALLNRIPSKCNLIKRKVLNISASG-CAWCGEDLENTSHLLFGCYYAYFVWLSNFAWFGV 533

Query: 789  AGVFHNDGVNHFIQHGDFF*GKHLRRTR-NLIWMAVVWSLWGMRNKIIFQGLVADFTSVI 613
            + V HN    +F     F       R R +++W+A +WSLW  RN +IF+  V     ++
Sbjct: 534  STVIHNSCHENFAHFNGFPRCSGRDRMRWSVVWLATIWSLWLARNDVIFKDKVVAIKDLV 593

Query: 612  TQIKMLSWGWFVNRAGRSSEISLVGLTFSRRGWAFFHNCEPLWIIFTFG 466
              IK+ SW W      ++ + S    T S+RG+       PL    TFG
Sbjct: 594  ELIKLRSWNWI-----KTKDKSF--FTHSQRGFL------PLVFALTFG 629


>GAU18134.1 hypothetical protein TSUD_248350 [Trifolium subterraneum]
          Length = 694

 Score =  468 bits (1204), Expect = e-151
 Identities = 247/616 (40%), Positives = 361/616 (58%), Gaps = 7/616 (1%)
 Frame = -3

Query: 2397 GFNDLWRKWMKACIFSSSISVLVNGSPTDDFEPKRGLRQGDPLAPFLFLIAAEGLTGLVD 2218
            GF++ WR W+KAC+F+ S+SVLVNGSPT+  +  +GL+QGDPLAPFLF++ AEGL  L+ 
Sbjct: 66   GFDEKWRSWIKACVFAGSLSVLVNGSPTEQIDISKGLKQGDPLAPFLFILVAEGLGALMK 125

Query: 2217 SAIERGILQGYKVSEDISF-TILQFADDTIMLCDASWNNLWAVKAIFRSFELVSGLRVNF 2041
             A++ G  +G ++S   +  + LQ+ADDT+ + +A   NLW  KAI R FEL+SGL+VNF
Sbjct: 126  KAVDVGYFKGIQISTTGTIMSHLQYADDTLFVGEACVENLWTTKAILRWFELISGLKVNF 185

Query: 2040 LKSNMYGINIEGSFLNVASNFLSCCIGNIPLKFLGLPVAANPRRCQTWEPVVKAMRKKLA 1861
             KS +YGIN+  +F++ A++FL C +G +P  +LGLPV ANPRR  TW PV++ ++K+LA
Sbjct: 186  FKSKLYGINVGDNFISSAASFLKCKVGKLPFIYLGLPVGANPRRLVTWNPVIEVLQKRLA 245

Query: 1860 AWKGRHLSIGGRVTLINSVLNSMSLYFFSFYKAPKRVIRTLVAIQRNFLWGGSEENQKVS 1681
            +WK +++S+GGRV L+NSVL ++ +++ S +K P  V + +V +QR FLWGG   + K+ 
Sbjct: 246  SWKNKYVSLGGRVVLLNSVLAAIPIFYLSLFKMPVGVWKKIVNLQRRFLWGGVAGSSKIP 305

Query: 1680 WVCRDKVCLPKRLGGLGIKNLELFNXXXXXXXXXXXLTDHDAVWVGLLSFKY--GQNFTI 1507
            WV    VC PK+ GGLG+K+L + N           L++  ++W  +L  +Y  G++   
Sbjct: 306  WVNWRDVCRPKKEGGLGVKDLRIMNISLLAKWKWRLLSEGKSIWKNVLEDRYRGGESGVG 365

Query: 1506 SSHNTADHRFASIWWRDLHLIELDRGVQPMWFSDALCRKVGDGEHTKFWSDTWLGVSSLK 1327
                      AS WW DL  + +  G   +       +K+G+G  T+FW D+W+G   LK
Sbjct: 366  WMSKVWVSSKASPWWNDLMTMGVVAGEDRL--HGIFFKKIGNGGDTRFWHDSWVGTQPLK 423

Query: 1326 DCFPRLFQVAENKEANIAEMGAWHGDIWRWEWRWRRPLFVWEEELLADFLNIMAPVQIQK 1147
            + FPRLF ++  KE ++ E+G   G   RW+ +WRR LFVWEEEL    +N++ P+Q+  
Sbjct: 424  ELFPRLFLISVQKECSVFEVGG--GVSGRWDLKWRRNLFVWEEELRELLVNVLTPIQL-I 480

Query: 1146 NVVDSWLWKLEGSQVFSVKSAYNMLL-TVQRNNGENQGLDRTFKWLWSCDVSSKIAVFTW 970
            N  D W        +FSV S Y  L   +      +  L R   +LW     SK+ VF+W
Sbjct: 481  NKEDEWRCHYFNGSLFSVSSLYKYLSGIIIPPISRDPELVRDLGFLWESLAPSKVIVFSW 540

Query: 969  RLLQDRLPTREQLICRGIIDANHGGSCVFCFRAVENCSHLFFTCNFSYHVWSVVNSWIGV 790
            +LL  RLPT+  L  RGI++ +    CV C    E   HLF  C F+  +WS V  W G 
Sbjct: 541  QLLLSRLPTKANLAIRGIVEHDSNSFCVLCPMNTECEGHLFGWCAFASRIWSRVFDWFGW 600

Query: 789  AGVFHNDGVNHFIQHGDFF*GK-HLRRTRNL--IWMAVVWSLWGMRNKIIFQGLVADFTS 619
             GV   D    F     F  G+   RR + L  +W  VVW++W  RN +IF   V     
Sbjct: 601  GGVVPRDPREIF---QSFCRGRPGGRRIKGLLAVWHVVVWAIWRARNDLIFNSKVPVLED 657

Query: 618  VITQIKMLSWGWFVNR 571
            V+  I  LSW W + +
Sbjct: 658  VLHSIMSLSWKWLLEK 673


>KYP34591.1 LINE-1 reverse transcriptase isogeny [Cajanus cajan]
          Length = 817

 Score =  472 bits (1214), Expect = e-151
 Identities = 247/607 (40%), Positives = 363/607 (59%), Gaps = 5/607 (0%)
 Frame = -3

Query: 2397 GFNDLWRKWMKACIFSSSISVLVNGSPTDDFEPKRGLRQGDPLAPFLFLIAAEGLTGLVD 2218
            GF++ W +WM+AC+   S+S LVNGSPT +    RGL+QGDPLAP LFLIAAEGL  L+ 
Sbjct: 220  GFHERWVRWMEACVCGGSLSTLVNGSPTAEVSLGRGLKQGDPLAPSLFLIAAEGLRLLMS 279

Query: 2217 SAIERGILQGYKVS-EDISFTILQFADDTIMLCDASWNNLWAVKAIFRSFELVSGLRVNF 2041
             A++  + +G  +  E    ++LQFADDT+++ +A+  NLW +KAI R FEL+SG+++NF
Sbjct: 280  RALDMNLFKGLHIGGEGPPISLLQFADDTLIIGEATMQNLWCLKAILRGFELISGMKINF 339

Query: 2040 LKSNMYGINIEGSFLNVASNFLSCCIGNIPLKFLGLPVAANPRRCQTWEPVVKAMRKKLA 1861
             KS + GI+    F N+A+ FL C +G +P K LGLP+ ANPR+  TW+P++ ++RK+L+
Sbjct: 340  HKSCVVGIHSGADFTNLAAAFLHCKVGQLPFKHLGLPLGANPRKLYTWKPMLDSLRKRLS 399

Query: 1860 AWKGRHLSIGGRVTLINSVLNSMSLYFFSFYKAPKRVIRTLVAIQRNFLWGGSEENQKVS 1681
            +WK +HLSIGGRVTLINSVLN++ ++F SF+KAP  VI+ +VAIQR+FLW G ++  K+ 
Sbjct: 400  SWKYKHLSIGGRVTLINSVLNAIPIHFLSFFKAPNLVIKEIVAIQRDFLWRGVKDGSKIP 459

Query: 1680 WVCRDKVCLPKRLGGLGIKNLELFNXXXXXXXXXXXLTDHDAVWVGLLSFKYGQNFTISS 1501
            WV  + VC  K  GGLGIK++ LFN           +     +W  +L  +YG+  + S+
Sbjct: 460  WVKWETVCKSKVEGGLGIKDVRLFNWALLGKWVWKCMQSPGMLWAKVLHHRYGRIESFSN 519

Query: 1500 HNTADHRFASIWWRDLHLIELDRGVQPMWFSDALCRKVGDGEHTKFWSDTWLGVSSLKDC 1321
             +  D R  S+WW+D+  + L +G    W  + + R +GDG  T+FW D W+G   L + 
Sbjct: 520  CSNVDRR-TSLWWKDIVWV-LHQG--NCWLDEKIERCIGDGTMTRFWEDKWIGGLRLLEV 575

Query: 1320 FPRLFQVAENKEANIAEMGAWHGDIWRWEWRWRRPLFVWEEELLADFLNIMAPVQIQKNV 1141
            FPRLF  A +  + +A+ G W G  W W+ +WRR  FV E   +   L+++  +Q+  + 
Sbjct: 576  FPRLFSFALDPLSVVADNGTWEGSTWVWQVKWRREPFVHEGRSVNILLDMLKGLQVISSR 635

Query: 1140 VDSWLWKLEGSQVFSVKSAYNMLLTVQRNNG----ENQGLDRTFKWLWSCDVSSKIAVFT 973
             D W W  +   VFSVKSAY   L +QR+ G     +       K LW C    K  VF 
Sbjct: 636  QDYWRWIYDKDGVFSVKSAY---LWLQRSVGGELRNSSDFQLVIKRLWKCKAPIKCLVFC 692

Query: 972  WRLLQDRLPTREQLICRGIIDANHGGSCVFCFRAVENCSHLFFTCNFSYHVWSVVNSWIG 793
            W++L +  P +  L  RG+   N+   C FC   VEN  HLF  C  +++ W  V  W+ 
Sbjct: 693  WQVLLNAFPCKSLLQVRGVELENN--LCSFCSLFVENPLHLFLMCPMAFNTWLAVAKWLE 750

Query: 792  VAGVFHNDGVNHFIQHGDFF*GKHLRRTRNLIWMAVVWSLWGMRNKIIFQGLVADFTSVI 613
            V  VF N   +H++   +    +   +   ++W++V+WSLW  RN IIFQ    D   V+
Sbjct: 751  VTVVFPNSIFSHYLYWTNLGIYEKHSQLLRVVWVSVIWSLWLHRNVIIFQQGTIDAKEVL 810

Query: 612  TQIKMLS 592
              IK+ S
Sbjct: 811  DNIKLRS 817


>GAU50434.1 hypothetical protein TSUD_134890, partial [Trifolium subterraneum]
          Length = 712

 Score =  466 bits (1199), Expect = e-150
 Identities = 245/577 (42%), Positives = 332/577 (57%), Gaps = 4/577 (0%)
 Frame = -3

Query: 2397 GFNDLWRKWMKACIFSSSISVLVNGSPTDDFEPKRGLRQGDPLAPFLFLIAAEGLTGLVD 2218
            GF   W KWM+ACIF+SS+ VLVNGSP +DF  ++GLRQGDPL+PFLFLI AE LT L+ 
Sbjct: 166  GFAPRWLKWMRACIFNSSMPVLVNGSPMEDFVVEKGLRQGDPLSPFLFLIIAERLTRLMQ 225

Query: 2217 SAIERGILQGYKVSEDISFTILQFADDTIMLCDASWNNLWAVKAIFRSFELVSGLRVNFL 2038
             A++ G   G+KV +D+ F  LQFADDT+++ + +W NLW++K + RSFELVSGL+VNF 
Sbjct: 226  KAVDNGNYHGFKVRDDLQFHTLQFADDTVLVGEGNWENLWSLKTVLRSFELVSGLKVNFF 285

Query: 2037 KSNMYGINIEGSFLNVASNFLSCCIGNIPLKFLGLPVAANPRRCQTWEPVVKAMRKKLAA 1858
            KS +YGIN++ +FL+ AS+FL C + +IP +FLG+PV ANPRR  TW PVV+AM+K+L A
Sbjct: 286  KSKLYGINLDDNFLSAASSFLHCEVDSIPFRFLGIPVGANPRRKITWNPVVEAMKKRLNA 345

Query: 1857 WKGRHLSIGGRVTLINSVLNSMSLYFFSFYKAPKRVIRTLVAIQRNFLWGGSEENQKVSW 1678
            W  R+LSIGGRVTLINS               PK V            WGG  + +K+ W
Sbjct: 346  WNCRNLSIGGRVTLINS--------------HPKEV-----------SWGGCSDIKKICW 380

Query: 1677 VCRDKVCLPKRLGGLGIKNLELFNXXXXXXXXXXXLTDHDAVWVGLLSFKYG---QNFTI 1507
            V  D +CLPK  GGLGIKNL  FN           L DH+ +W  LL  +YG    NF  
Sbjct: 381  VSWDTICLPKDKGGLGIKNLNCFNQALLCKWKWRGLCDHNTLWTKLLEHRYGSLADNFL- 439

Query: 1506 SSHNTADHRFASIWWRDLHLIELDRGVQ-PMWFSDALCRKVGDGEHTKFWSDTWLGVSSL 1330
                T D +  S+WWRD+ +I    G++   WF   +   +G+G   +FW +TW G   L
Sbjct: 440  -RDTTRDVKGQSLWWRDIMMI---GGIENDAWFRFNVRNVLGNGTCIRFWHETWHGPVCL 495

Query: 1329 KDCFPRLFQVAENKEANIAEMGAWHGDIWRWEWRWRRPLFVWEEELLADFLNIMAPVQIQ 1150
            KD FP+L+  +   EA I ++G W    W W  +W   L   E +   +  N++  +Q  
Sbjct: 496  KDLFPQLYCKSPQAEAIIYDVGKWVNQQWVWNLQWSTNLTSTEHDAACELANLLTGIQPS 555

Query: 1149 KNVVDSWLWKLEGSQVFSVKSAYNMLLTVQRNNGENQGLDRTFKWLWSCDVSSKIAVFTW 970
                D   W L  + +FSVKS Y  L + +        + +  + LW  DV SK+++F W
Sbjct: 556  LECADRRRWGLTQTGMFSVKSTYEFLQSREVVVAIEDNVVKALQLLWLNDVPSKVSIFGW 615

Query: 969  RLLQDRLPTREQLICRGIIDANHGGSCVFCFRAVENCSHLFFTCNFSYHVWSVVNSWIGV 790
            RLL  RLPTR  L  + II   H  SC+FC    E  SHL F C FS  +W  +  W+ V
Sbjct: 616  RLLLSRLPTRMALARKNIIVNLHELSCIFCGEEQEELSHLLFNCPFSQELWKRIFKWMNV 675

Query: 789  AGVFHNDGVNHFIQHGDFF*GKHLRRTRNLIWMAVVW 679
              +  ++G  HF   G     K   + R++IW+A  W
Sbjct: 676  DFISFDEGWKHFFAFGALLENKKFEKARHVIWLATTW 712


>KYP40876.1 Putative ribonuclease H protein At1g65750 family [Cajanus cajan]
          Length = 751

 Score =  466 bits (1199), Expect = e-150
 Identities = 249/635 (39%), Positives = 361/635 (56%), Gaps = 5/635 (0%)
 Frame = -3

Query: 2397 GFNDLWRKWMKACIFSSSISVLVNGSPTDDFEPKRGLRQGDPLAPFLFLIAAEGLTGLVD 2218
            GF++ W +WM+ C+   S+S LVNGSPT +    RGL+QGDPLAP LFLIA EGL  L+ 
Sbjct: 127  GFDERWVRWMEGCVCGGSLSALVNGSPTGEVAIGRGLKQGDPLAPSLFLIAVEGLRLLMT 186

Query: 2217 SAIERGILQGYKVS-EDISFTILQFADDTIMLCDASWNNLWAVKAIFRSFELVSGLRVNF 2041
             A++  + +G  +  E    ++LQFADDT+++ +A+  NLW +KAI R FEL+SG+++NF
Sbjct: 187  RALDMNLFKGLHLGGEGPLISLLQFADDTLIIGEATMQNLWCLKAILRCFELISGMKINF 246

Query: 2040 LKSNMYGINIEGSFLNVASNFLSCCIGNIPLKFLGLPVAANPRRCQTWEPVVKAMRKKLA 1861
             KS++ GI+    F  +A++FL C +G +P K LGLP+ ANPR+  TW P++  +R +L+
Sbjct: 247  HKSSVVGIHSGVDFTELAASFLHCKVGQLPFKHLGLPLGANPRKLATWRPILDGLRNRLS 306

Query: 1860 AWKGRHLSIGGRVTLINSVLNSMSLYFFSFYKAPKRVIRTLVAIQRNFLWGGSEENQKVS 1681
            +WK R+LSIGGRVTLIN+VLN+M ++F SF+KAP  VI+ +VAIQR FLW G E+  K+ 
Sbjct: 307  SWKHRYLSIGGRVTLINAVLNAMPIHFLSFFKAPNSVIKEIVAIQRGFLWRGVEDGSKIP 366

Query: 1680 WVCRDKVCLPKRLGGLGIKNLELFNXXXXXXXXXXXLTDHDAVWVGLLSFKYGQNFTISS 1501
            WV  + VC  K  GGLGIK++ LFN           +     +W  +L  +YG+  + S 
Sbjct: 367  WVKWETVCKSKDEGGLGIKDVRLFNWALLGKWVWRCMLYPSTMWAKVLQGRYGRIESFSK 426

Query: 1500 HNTADHRFASIWWRDLHLIELDRGVQPMWFSDALCRKVGDGEHTKFWSDTWLGVSSLKDC 1321
             +  D R  S WW+D+  + L +G    W  + + R +GDG  T+FW D W+G   L D 
Sbjct: 427  TSNVDRR-DSWWWKDIVWV-LQQG--NFWLDEKIDRCIGDGTSTRFWEDKWIGGLRLLDV 482

Query: 1320 FPRLFQVAENKEANIAEMGAWHGDIWRWEWRWRRPLFVWEEELLADFLNIMAPVQIQKNV 1141
            FPRL+  A +  + +   G W G  W W+ +WRR  FV E   +   L ++  +QI  + 
Sbjct: 483  FPRLYSFAFDPLSMVGHNGNWEGSTWLWQVKWRREPFVHEVGSVNSLLEMLQGLQIFSSK 542

Query: 1140 VDSWLWKLEGSQVFSVKSAYNMLLTVQRNNGENQGLDRTF----KWLWSCDVSSKIAVFT 973
             D W W  +   VFSVKSAY+ L   QR+ G        F    K LW C    K  VF 
Sbjct: 543  QDQWRWICDKDGVFSVKSAYSWL---QRSLGGELSYSSDFHLVIKSLWKCKAPIKCLVFC 599

Query: 972  WRLLQDRLPTREQLICRGIIDANHGGSCVFCFRAVENCSHLFFTCNFSYHVWSVVNSWIG 793
            W++  +  P +  L  RG+   N+   C  C   +E+  HLF  C  +++ W  V  W+ 
Sbjct: 600  WQVFMNAFPCKSLLQVRGVELENN--LCSLCSFFIEDPLHLFLMCPMAFNTWLSVAKWLE 657

Query: 792  VAGVFHNDGVNHFIQHGDFF*GKHLRRTRNLIWMAVVWSLWGMRNKIIFQGLVADFTSVI 613
            V  V  N   + ++   +    K   +   ++W++V+WSLW  RN IIFQ  V D   V+
Sbjct: 658  VEVVLPNSLTSLYLYWTNLGIYKKSTQCFKVVWVSVIWSLWLHRNGIIFQQGVMDCKEVL 717

Query: 612  TQIKMLSWGWFVNRAGRSSEISLVGLTFSRRGWAF 508
              IK+ SW W         + S+ G +FS   W F
Sbjct: 718  DNIKLRSWKWI--------KSSVPGCSFSYSSWYF 744


>GAU46725.1 hypothetical protein TSUD_100170 [Trifolium subterraneum]
          Length = 776

 Score =  466 bits (1199), Expect = e-149
 Identities = 244/618 (39%), Positives = 360/618 (58%), Gaps = 5/618 (0%)
 Frame = -3

Query: 2397 GFNDLWRKWMKACIFSSSISVLVNGSPTDDFEPKRGLRQGDPLAPFLFLIAAEGLTGLVD 2218
            GF   WR WMKAC++  ++SVLVNGSPT +    RGL+QGDPLAP LFL+ AEGL GL  
Sbjct: 151  GFCPKWRAWMKACVWGGNVSVLVNGSPTQEIPIMRGLKQGDPLAPLLFLLVAEGLGGL-- 208

Query: 2217 SAIERGILQGYKVSEDISFTILQFADDTIMLCDASWNNLWAVKAIFRSFELVSGLRVNFL 2038
             A+E    + + V +    ++LQ+ADDT+ + +A+  NLW +KA+ R FE+ SGL+VNF 
Sbjct: 209  RAVEINRFRPFLVGDGAPVSLLQYADDTLCIGEATVENLWVMKAVLRGFEMTSGLKVNFW 268

Query: 2037 KSNMYGINIEGSFLNVASNFLSCCIGNIPLKFLGLPVAANPRRCQTWEPVVKAMRKKLAA 1858
            KS + G+N+   FL +AS+FL+C IG  P K+LGLPV AN R+  TWEP++  +R ++++
Sbjct: 269  KSCVIGVNVSEEFLGMASDFLNCRIGKTPFKYLGLPVGANSRKMSTWEPMLDTIRGRISS 328

Query: 1857 WKGRHLSIGGRVTLINSVLNSMSLYFFSFYKAPKRVIRTLVAIQRNFLWGGSEENQKVSW 1678
            W  +++S+GGR+ LIN+VLN++ +++ S+ K P +V R LV IQRNFLWGG     K  W
Sbjct: 329  WSCKYVSLGGRIVLINAVLNAIPIFYLSYMKMPTKVWRQLVKIQRNFLWGGLSNRSKTCW 388

Query: 1677 VCRDKVCLPKRLGGLGIKNLELFNXXXXXXXXXXXLT-DHDAVWVGLLSFKYGQNFTISS 1501
            V  D +C PK   GLGI++L L N           L+ + + VW  ++  +YG +  I +
Sbjct: 389  VKWDDICRPKNEVGLGIRDLRLVNTSLLAKWRWKILSHEEEEVWKQIVKARYGSD-VIGN 447

Query: 1500 H--NTAD-HRFASIWWRDLHLIELDRGVQPMWFSDALCRKVGDGEHTKFWSDTWLGVSSL 1330
                 AD  R  S WWRD+  +E +      WFS A+ +KVG G+ T FW++ W+G  SL
Sbjct: 448  RCLGAADIPRSTSNWWRDICNLEGEFS----WFSSAVGKKVGRGDSTSFWNEIWIGDQSL 503

Query: 1329 KDCFPRLFQVAENKEANIAEMGAWHGDIWRWEWRWRRPLFVWEEELLADFLNIMAPVQIQ 1150
            +  FPRLF ++  K+  I  +G+     W+WE  WRR  F WEE+   +F++++AP    
Sbjct: 504  RQRFPRLFGISLQKQEVIQNLGSLTEGRWQWELLWRRDRFQWEEDQYREFIDVIAPFAPV 563

Query: 1149 KNVVDSWLWKLEGSQVFSVKSAYNMLLTVQRNNGENQGLDR-TFKWLWSCDVSSKIAVFT 973
             N  D WLW  +G Q F+VKSAY  L  +  N    + ++   FK LW C   SK+  F 
Sbjct: 564  DN-HDRWLWLGDGIQGFTVKSAYMRLENLVNNRRILEPVENFVFKRLWKCAAPSKVHAFV 622

Query: 972  WRLLQDRLPTREQLICRGIIDANHGGSCVFCFRAVENCSHLFFTCNFSYHVWSVVNSWIG 793
            W+LL DR+ T+  L    ++ ++   +CV C   +E   HLF  C++   VW  +  W+G
Sbjct: 623  WQLLLDRVQTKANLFKCKMLHSDQ-QTCVLCDGKIETAVHLFLHCDWVAKVWYEITRWLG 681

Query: 792  VAGVFHNDGVNHFIQHGDFF*GKHLRRTRNLIWMAVVWSLWGMRNKIIFQGLVADFTSVI 613
               +   +    F         K  ++   LIW   +W +W  RN  IF  + A    V+
Sbjct: 682  FTLIIPPNLAISFAMWATCVSNKKEKKGICLIWNVFMWVVWKTRNGCIFNNMAAICEEVV 741

Query: 612  TQIKMLSWGWFVNRAGRS 559
             QIK++SW WF+ R  ++
Sbjct: 742  EQIKVMSWQWFIGRMAKA 759


>GAU24087.1 hypothetical protein TSUD_388800 [Trifolium subterraneum]
          Length = 1985

 Score =  487 bits (1254), Expect = e-148
 Identities = 243/622 (39%), Positives = 376/622 (60%), Gaps = 8/622 (1%)
 Frame = -3

Query: 2397 GFNDLWRKWMKACIFSSSISVLVNGSPTDDFEPKRGLRQGDPLAPFLFLIAAEGLTGLVD 2218
            GF   WR WM+AC+ + ++SVLVNGSPT++   +RGL+QGDPLAP LFLI AEGL  L+ 
Sbjct: 1358 GFGTKWRNWMRACVCAGNMSVLVNGSPTEEISIRRGLKQGDPLAPLLFLIVAEGLGALMR 1417

Query: 2217 SAIERGILQGYKVSED-ISFTILQFADDTIMLCDASWNNLWAVKAIFRSFELVSGLRVNF 2041
            SA+ERG  + + V    +  +ILQ+ADDT+ + +A+  NLWA+KA+ R FEL SGL+VNF
Sbjct: 1418 SAVERGRFKPFVVGRGALPVSILQYADDTLCIGEATTENLWALKAMLRGFELASGLKVNF 1477

Query: 2040 LKSNMYGINIEGSFLNVASNFLSCCIGNIPLKFLGLPVAANPRRCQTWEPVVKAMRKKLA 1861
             KS + G+N+   FL  AS FL+C IG +P  +LGLPV ANPRR  TW+P+V+ ++++L 
Sbjct: 1478 WKSCIMGVNVSQDFLLAASGFLNCRIGCLPFMYLGLPVGANPRRYSTWQPMVEGIKRRLR 1537

Query: 1860 AWKGRHLSIGGRVTLINSVLNSMSLYFFSFYKAPKRVIRTLVAIQRNFLWGGSEENQKVS 1681
            +W  +++S+GGR+ +IN+VL+S+ ++F S+ K P  V + +V +QRNFLWGG  + +++ 
Sbjct: 1538 SWGNKYISLGGRIVMINAVLSSIPIFFLSYMKMPLMVWKEIVTLQRNFLWGGLSKRRRIC 1597

Query: 1680 WVCRDKVCLPKRLGGLGIKNLELFNXXXXXXXXXXXLTDHDAVWVGLLSFKYGQNFTISS 1501
            WV   ++C PK+ GGL I++L   N           L++ + VW  ++  KYG    +  
Sbjct: 1598 WVKWAEICKPKKEGGLSIRDLRTVNLSLLAKWRWKLLSEEEEVWKNVIIAKYG--IHMLG 1655

Query: 1500 HNTADHR----FASIWWRDLHLIELDRGVQPMWFSDALCRKVGDGEHTKFWSDTWLGVSS 1333
            +   D R     +S+WWRD  L  LD+GV   WF+    + +G G   KFW + W+G  S
Sbjct: 1656 NARLDERDIGSMSSLWWRD--LCRLDKGVG--WFNHFARKYLGCGNSIKFWKEVWVGGQS 1711

Query: 1332 LKDCFPRLFQVAENKEANIAEMGAWHGDIWRWEWRWRRPLFVWEEELLADFLNIMAPVQI 1153
            L+  FPRLF ++  ++  + E+G+W   +WRW  RWRR LFVWEE+L+++   ++  + I
Sbjct: 1712 LELQFPRLFGISVQQDDMVREVGSWVNGVWRWGLRWRRVLFVWEEDLVSELELVLNNISI 1771

Query: 1152 QKNVVDSWLWKLEGSQVFSVKSAY---NMLLTVQRNNGENQGLDRTFKWLWSCDVSSKIA 982
             +   D W+W+L     F+VKS Y   + LLT +      +     ++ +W   V SK++
Sbjct: 1772 TEE-EDRWVWRLNVGDGFTVKSLYEALDPLLTPRCLVSSFESF--AYRSIWKSAVPSKVS 1828

Query: 981  VFTWRLLQDRLPTREQLICRGIIDANHGGSCVFCFRAVENCSHLFFTCNFSYHVWSVVNS 802
               W+L  DR+PT+  L  RGI+  +H  SCV C    E   HLF  C+++  +W  V  
Sbjct: 1829 ALAWQLFLDRIPTKVNLYKRGILRMDH-ASCVLCGEEAETARHLFLHCDYAAGIWYAVCR 1887

Query: 801  WIGVAGVFHNDGVNHFIQHGDFF*GKHLRRTRNLIWMAVVWSLWGMRNKIIFQGLVADFT 622
            W+GV  V   D +  +         K +R+   ++WMA +W +W +RN+ +F+    + T
Sbjct: 1888 WLGVFAVLPADVMMSYGLLVGCGRNKKIRKGFAIVWMAFIWVIWKVRNERVFKNATVEVT 1947

Query: 621  SVITQIKMLSWGWFVNRAGRSS 556
              +  ++ LSW W++N+   SS
Sbjct: 1948 DAVDMVQRLSWQWYLNKMASSS 1969


>KYP63901.1 Putative ribonuclease H protein At1g65750 family [Cajanus cajan]
          Length = 616

 Score =  456 bits (1173), Expect = e-148
 Identities = 244/626 (38%), Positives = 357/626 (57%), Gaps = 5/626 (0%)
 Frame = -3

Query: 2370 MKACIFSSSISVLVNGSPTDDFEPKRGLRQGDPLAPFLFLIAAEGLTGLVDSAIERGILQ 2191
            M+ C+   S+S LVNGSPT +    RGL+QGDPLAP LFLIA EGL  L+  A++  + +
Sbjct: 1    MEGCVCGGSLSALVNGSPTVEVTIGRGLKQGDPLAPSLFLIAVEGLRLLMTRALDMNLFK 60

Query: 2190 GYKVS-EDISFTILQFADDTIMLCDASWNNLWAVKAIFRSFELVSGLRVNFLKSNMYGIN 2014
            G ++  E    ++LQFADDT+++ +A+  NLW +KAI R FEL+SG+R+NF KS++ GI+
Sbjct: 61   GLQLGGEGPLISLLQFADDTLIIGEATMQNLWCLKAILRCFELISGMRINFHKSSVVGIH 120

Query: 2013 IEGSFLNVASNFLSCCIGNIPLKFLGLPVAANPRRCQTWEPVVKAMRKKLAAWKGRHLSI 1834
                F  +A++FL C +G +P K LGLP+ ANPR+  TW P++  +RK+L++WK R+LSI
Sbjct: 121  SGEDFTELAASFLHCKLGQLPFKHLGLPLGANPRKLATWRPILDGLRKRLSSWKHRYLSI 180

Query: 1833 GGRVTLINSVLNSMSLYFFSFYKAPKRVIRTLVAIQRNFLWGGSEENQKVSWVCRDKVCL 1654
            GGRVTLIN+VLN+M ++F SF+KAP  VI+ +VAIQR+FLW G ++  K+ WV  + VC 
Sbjct: 181  GGRVTLINAVLNAMPIHFLSFFKAPNSVIKEIVAIQRDFLWRGVKDGSKIPWVKWETVCK 240

Query: 1653 PKRLGGLGIKNLELFNXXXXXXXXXXXLTDHDAVWVGLLSFKYGQNFTISSHNTADHRFA 1474
             K  GGLGIK++ LFN           +     +W  +L  +YG   +       D R  
Sbjct: 241  SKDKGGLGIKDVRLFNWALLGKWVWRCMISPRTIWAKVLQGRYGCIESFPKTPNVDKR-D 299

Query: 1473 SIWWRDLHLIELDRGVQPMWFSDALCRKVGDGEHTKFWSDTWLGVSSLKDCFPRLFQVAE 1294
            S WW+D+  + + +G    W  + + R +GDG  T+FW D W+G   L D FPRL+  A 
Sbjct: 300  SWWWKDIVWV-IQQG--NYWLDEKIERCIGDGSSTRFWEDKWIGGLRLLDVFPRLYSFAF 356

Query: 1293 NKEANIAEMGAWHGDIWRWEWRWRRPLFVWEEELLADFLNIMAPVQIQKNVVDSWLWKLE 1114
            +  + +   G W G  W W+ +WRR  FV EE  +   + ++  +QI  +  D W W  +
Sbjct: 357  DPLSMVGHNGNWEGSTWLWQVKWRREPFVHEEGSVNTLIEMLQEIQIFSSKQDQWRWICD 416

Query: 1113 GSQVFSVKSAYNMLLTVQRNNGENQGLDRTF----KWLWSCDVSSKIAVFTWRLLQDRLP 946
               VFSVKSAY+ L   Q + G        F    K LW C    K  VF W++  +  P
Sbjct: 417  KDGVFSVKSAYSWL---QHSMGGELSYSSDFILVTKSLWKCKAPIKCLVFCWQVFMNAFP 473

Query: 945  TREQLICRGIIDANHGGSCVFCFRAVENCSHLFFTCNFSYHVWSVVNSWIGVAGVFHNDG 766
             +  L  RG+   N+   C  C   +E+  HLF  C  ++++W  V +W+ V  V  N  
Sbjct: 474  CKSLLQVRGVEVENN--LCSLCSLFIEDPIHLFLMCPMAFNIWLSVANWLEVEVVLPNSL 531

Query: 765  VNHFIQHGDFF*GKHLRRTRNLIWMAVVWSLWGMRNKIIFQGLVADFTSVITQIKMLSWG 586
             + ++   +    K  ++   ++W++V+WSLW  RN IIFQ  V D   V+  IKM SW 
Sbjct: 532  TSLYLYWTNLGIYKKSKQCFKVVWVSVIWSLWLHRNGIIFQQGVMDCKEVLDNIKMRSWK 591

Query: 585  WFVNRAGRSSEISLVGLTFSRRGWAF 508
            W         + S+ G +FS   W F
Sbjct: 592  WI--------KSSVPGCSFSYSNWYF 609


>KYP50779.1 Transposon TX1 uncharacterized [Cajanus cajan]
          Length = 1102

 Score =  471 bits (1211), Expect = e-148
 Identities = 249/635 (39%), Positives = 364/635 (57%), Gaps = 5/635 (0%)
 Frame = -3

Query: 2397 GFNDLWRKWMKACIFSSSISVLVNGSPTDDFEPKRGLRQGDPLAPFLFLIAAEGLTGLVD 2218
            GF++ W +WM+ C+   S+S LVNGSPT +    RGL+QGDPLAP LFLIA EGL  L+ 
Sbjct: 478  GFDERWVRWMEGCVCGGSLSALVNGSPTVEVTIGRGLKQGDPLAPSLFLIAVEGLRLLMT 537

Query: 2217 SAIERGILQGYKVS-EDISFTILQFADDTIMLCDASWNNLWAVKAIFRSFELVSGLRVNF 2041
             A++  + +G ++  E    ++LQFADDT+++ +A+  NLW +KAI R FEL+SG+R+NF
Sbjct: 538  RALDMNLFKGLQLGGEGPLISLLQFADDTLIIGEATMQNLWCLKAILRCFELISGMRINF 597

Query: 2040 LKSNMYGINIEGSFLNVASNFLSCCIGNIPLKFLGLPVAANPRRCQTWEPVVKAMRKKLA 1861
             KS++ GI+    F  +A++FL C +G +P K LGLP+ ANPR+  TW P++  +RK+L+
Sbjct: 598  HKSSVVGIHSGEDFTELAASFLHCKLGQLPFKHLGLPLGANPRKLATWRPILDGLRKRLS 657

Query: 1860 AWKGRHLSIGGRVTLINSVLNSMSLYFFSFYKAPKRVIRTLVAIQRNFLWGGSEENQKVS 1681
            +WK R+LSIGGRVTLIN+VLN+M ++F SF+KAP  VI+ +VAIQR+FLW G ++  K+ 
Sbjct: 658  SWKHRYLSIGGRVTLINAVLNAMPIHFLSFFKAPNSVIKEIVAIQRDFLWRGVKDGSKIP 717

Query: 1680 WVCRDKVCLPKRLGGLGIKNLELFNXXXXXXXXXXXLTDHDAVWVGLLSFKYGQNFTISS 1501
            WV  + VC  K  GGLGIK++ LFN           +     +W  +L  +YG   +   
Sbjct: 718  WVKWETVCKSKDKGGLGIKDVRLFNWALLGKWVWRCMISPRTIWAKVLQGRYGCIESFPK 777

Query: 1500 HNTADHRFASIWWRDLHLIELDRGVQPMWFSDALCRKVGDGEHTKFWSDTWLGVSSLKDC 1321
                D R  S WW+D+  + L +G    W  + + R +GDG  T+FW D W+G   L D 
Sbjct: 778  TPNVDKR-DSWWWKDIVWV-LQQG--NYWLDEKIERCIGDGSSTRFWEDKWIGGLRLLDV 833

Query: 1320 FPRLFQVAENKEANIAEMGAWHGDIWRWEWRWRRPLFVWEEELLADFLNIMAPVQIQKNV 1141
            FPRL+  A +  + +   G W G  W W+ +WRR  FV EE  +   + ++  +QI  + 
Sbjct: 834  FPRLYSFAFDPLSMVGHNGNWEGSTWLWQIKWRRETFVHEEGSVNTLIEMLQEIQIFSSK 893

Query: 1140 VDSWLWKLEGSQVFSVKSAYNMLLTVQRNNGENQGLDRTF----KWLWSCDVSSKIAVFT 973
             D W W  +   VFSVKSAY+ L   Q + G        F    K LW C    K  VF 
Sbjct: 894  QDQWRWICDKDGVFSVKSAYSWL---QHSMGGELSYSSDFILVTKSLWKCKAPIKCLVFC 950

Query: 972  WRLLQDRLPTREQLICRGIIDANHGGSCVFCFRAVENCSHLFFTCNFSYHVWSVVNSWIG 793
            W++  +  P +  L  RG+   N+   C  C   +E+  HLF  C  ++++W  V +W+ 
Sbjct: 951  WQVFMNAFPCKSLLQVRGVEVENN--LCSLCSLFIEDPIHLFLLCPMAFNIWLSVANWLE 1008

Query: 792  VAGVFHNDGVNHFIQHGDFF*GKHLRRTRNLIWMAVVWSLWGMRNKIIFQGLVADFTSVI 613
            V  V  N   + ++   +    K  ++   ++W++V+WSLW  RN IIFQ  V D   V+
Sbjct: 1009 VEVVLPNSLTSLYLYWTNLGIYKKSKQCFKVVWVSVIWSLWLHRNGIIFQQGVMDCKEVL 1068

Query: 612  TQIKMLSWGWFVNRAGRSSEISLVGLTFSRRGWAF 508
              IKM SW W         + S+ G +FS   W F
Sbjct: 1069 DNIKMRSWKWI--------KSSVPGCSFSYSNWYF 1095


>GAU43915.1 hypothetical protein TSUD_88880 [Trifolium subterraneum]
          Length = 691

 Score =  457 bits (1175), Expect = e-147
 Identities = 246/610 (40%), Positives = 358/610 (58%), Gaps = 5/610 (0%)
 Frame = -3

Query: 2397 GFNDLWRKWMKACIFSSSISVLVNGSPTDDFEPKRGLRQGDPLAPFLFLIAAEGLTGLVD 2218
            GF  LWRKW+K C+ +++ SVLVNGSP+D+F  +RGLRQGDPL+PFLFL+AAEGL  L++
Sbjct: 66   GFPTLWRKWIKECVCTATASVLVNGSPSDEFPLERGLRQGDPLSPFLFLLAAEGLHVLME 125

Query: 2217 SAIERGILQGYKV--SEDISFTILQFADDTIMLCDASWNNLWAVKAIFRSFELVSGLRVN 2044
            +   + I  GY+V  S  IS + LQFADDT+++    W N+ A++A+   FE +SGL+VN
Sbjct: 126  AMEVQNIFTGYRVGNSAPISVSHLQFADDTLLMGTKCWANVRALRAVLVLFETMSGLKVN 185

Query: 2043 FLKSNMYGINIEGSFLNVASNFLSCCIGNIPLKFLGLPVAANPRRCQTWEPVVKAMRKKL 1864
            F KS + G+NI  S+L  A++ L C +G IP  +LGLP+  +PRR   WEPV+  ++K+L
Sbjct: 186  FNKSMLVGVNISDSWLGEAASGLGCRVGKIPFLYLGLPIGGDPRRLSFWEPVLTRLKKRL 245

Query: 1863 AAWKGRHLSIGGRVTLINSVLNSMSLYFFSFYKAPKRVIRTLVAIQRNFLWGGSEENQKV 1684
            + WK R LS GGR+ L+ SVL S+ +Y  SF+KAP   I ++ +I     WGG E+ +K+
Sbjct: 246  SGWKSRFLSFGGRLVLLKSVLTSLPVYALSFFKAPSGTISSIESILIKIFWGGCEDFRKI 305

Query: 1683 SWVCRDKVCLPKRLGGLGIKNLELFNXXXXXXXXXXXLTDHDAVWVGLLSFKYG-QNFTI 1507
            SWV    +CL K  GGLG++ L  FN           L D + +W  +L+ +YG +  ++
Sbjct: 306  SWVYWKTICLQKEYGGLGVRKLREFNLALLGKWCWRMLVDREGLWFRVLAARYGVERGSL 365

Query: 1506 SSHNTADHRFASIWWRDL-HLIELDRGVQPMWFSDALCRKVGDGEHTKFWSDTWLGVSSL 1330
             +  T      S WWR++ H+ +        WF   + R+VGDG  T FW+D W+  S L
Sbjct: 366  CAGGTR----GSSWWREVAHIRDGGGEAAGGWFGGNISRQVGDGSDTFFWTDPWVDGSPL 421

Query: 1329 KDCFPRLFQVAENKEANIAEMGAWHGDIWRWEWRWRRPLFVWEEELLADFLNIMAPVQIQ 1150
             + F RLF +A NK  ++A+M      I    W W RPL  WEEELL +   ++  + +Q
Sbjct: 422  SERFGRLFDLAVNKSDSVADMFQLGWGIGGDAWVWGRPLRAWEEELLGECQTLLLTISLQ 481

Query: 1149 KNVVDSWLWKLEGSQVFSVKSAYNMLLTVQRNNGENQGLDRTFKWLWSCDVSSKIAVFTW 970
             + +D WLW+L+    ++++ AY  LLT Q    +   LD     +W   V  K+++  W
Sbjct: 482  AHSLDRWLWRLDVDGGYTIQGAY-QLLTAQ----DAVPLDAATGLIWHPQVPLKVSILAW 536

Query: 969  RLLQDRLPTREQLICRGIIDANHGGSCVFCFRAVENCSHLFFTCNFSYHVWSVVNSWIGV 790
            RLL DRLPT+  L  RGI+ A     CV    AVE+  H+F +C+    +WS+V SW+G 
Sbjct: 537  RLLLDRLPTKVNLSYRGILPAG-DSLCVSGCGAVESAQHVFLSCSTFGSLWSLVRSWVGS 595

Query: 789  AGVFHNDGVNHFIQHGDFF*GKHLRRT-RNLIWMAVVWSLWGMRNKIIFQGLVADFTSVI 613
            A V      +HFIQ      G   RR+   LIW+A VW +W  RN  +F+G       ++
Sbjct: 596  ASVTAQTLSDHFIQFTTSAGGTRARRSFMQLIWLACVWVVWTERNHRLFRGSANSSLHML 655

Query: 612  TQIKMLSWGW 583
             +IK  S+ W
Sbjct: 656  DKIKTFSFRW 665


>GAU17363.1 hypothetical protein TSUD_232390 [Trifolium subterraneum]
          Length = 693

 Score =  451 bits (1159), Expect = e-144
 Identities = 246/615 (40%), Positives = 359/615 (58%), Gaps = 11/615 (1%)
 Frame = -3

Query: 2394 FNDLWRKWMKACIFSSSISVLVNGSPTDDFEPKRGLRQGDPLAPFLFLIAAEGLTGLVDS 2215
            F  +WR W+  C+ S++ SVLVNG PTD+F  +RGLRQGDPL+PFL+L+AAEGL  ++ S
Sbjct: 68   FPRVWRGWIMECVSSATASVLVNGCPTDEFSLERGLRQGDPLSPFLYLLAAEGLHIMMTS 127

Query: 2214 AIERGILQGYKV--SEDISFTILQFADDTIMLCDASWNNLWAVKAIFRSFELVSGLRVNF 2041
            A+   +   Y +  + ++S + LQFADDT+++   SW N+  +KA+   FE +SGL+VNF
Sbjct: 128  AVSNHLFMPYNIGNANEVSVSHLQFADDTLLIGAKSWANIRTLKAVLILFESISGLKVNF 187

Query: 2040 LKSNMYGINIEGSFLNVASNFLSCCIGNIPLKFLGLPVAANPRRCQTWEPVVKAMRKKLA 1861
             KS ++G+N+  S+L+ A++ L C  G +P  +LGLP+  + R+ Q W P+V  +R +L+
Sbjct: 188  HKSILFGVNVNISWLHAAASVLRCKHGRLPFLYLGLPIGGDSRKLQFWSPLVNRIRDRLS 247

Query: 1860 AWKGRHLSIGGRVTLINSVLNSMSLYFFSFYKAPKRVIRTLVAIQRNFLWGGSEENQKVS 1681
             WK ++LSIGGR+ L+ SVL+S+ +YF SF+KAP  +I  L +I   FLWGGSEEN+K+S
Sbjct: 248  GWKCKNLSIGGRLILLKSVLSSIPVYFLSFFKAPSGIISALESIFCQFLWGGSEENRKLS 307

Query: 1680 WVCRDKVCLPKRLGGLGIKNLELFNXXXXXXXXXXXLTDHDAVWVGLLSFKYGQNFTISS 1501
            W+  D +CL +  GGLG++ L+ FN           L   D+ W  +L  KYGQ   +  
Sbjct: 308  WIKWDTICLQREHGGLGVRRLKEFNISLLGKWVWRLLEAGDSFWCEVLRAKYGQ---MGG 364

Query: 1500 HNTADHRFASIWWRDLHLIELDRG-VQPMWFSDALCRKVGDGEHTKFWSDTWLGVSSLKD 1324
                     S WWR L+ I    G +   W  D   RKVGDG +T FW++ WL    L  
Sbjct: 365  RVCFSEGVGSSWWRTLNHIRDGVGLMDSRWLKDNNIRKVGDGRNTLFWTEPWLEDCPLDR 424

Query: 1323 CFPRLFQVAENKEANIAEMGAWHGDIWRWE---WRWRRPLFVWEEELLADFLNIMAPVQI 1153
             F RLF +AENK   +A+M   HG  W  +   W+WRR L  WEEEL+ D +  ++ V +
Sbjct: 425  SFSRLFDLAENKFITVADM---HGLGWGVDGEAWKWRRRLRAWEEELVLDCVERLSNVVL 481

Query: 1152 QKNVVDSWLWKLEGSQVFSVKSAYNMLLTVQRNNGENQGLDRTFKWLWSCDVSSKIAVFT 973
            Q NV D W+WKL  S  ++V+SAY  L     N   N+G +R   +LW   +  K+ +F 
Sbjct: 482  QVNVHDRWVWKLHPSHCYTVRSAYAFLTATDIN--LNEGFNR---FLWLKSIPLKVNIFV 536

Query: 972  WRLLQDRLPTREQLICRGIIDANHGGSCVFCFRAVENCSHLFFTCNFSYHVWSVVNSWIG 793
            WRL  +RLPTR+ L  RGI+DA+       C R +E+  HLFF C     +W+ V+ W+ 
Sbjct: 537  WRLFLNRLPTRDNLFRRGILDASMLACATSCGR-MEDVDHLFFQCPVYSRLWASVSKWME 595

Query: 792  VAGVFHNDGVNHFIQHGDFF*G-----KHLRRTRNLIWMAVVWSLWGMRNKIIFQGLVAD 628
            V   FH       I H + F G     K       +IW+AV++ +W  RN  IF+     
Sbjct: 596  VETAFHGT----LILHSNQFCGLGGSSKSYNTLLIIIWVAVLFIIWKGRNHHIFKAGQDS 651

Query: 627  FTSVITQIKMLSWGW 583
              +++ ++K  S+ W
Sbjct: 652  LEAMVEKVKFQSYCW 666


>GAU43110.1 hypothetical protein TSUD_373050 [Trifolium subterraneum]
          Length = 1099

 Score =  461 bits (1185), Expect = e-144
 Identities = 236/615 (38%), Positives = 359/615 (58%), Gaps = 6/615 (0%)
 Frame = -3

Query: 2397 GFNDLWRKWMKACIFSSSISVLVNGSPTDDFEPKRGLRQGDPLAPFLFLIAAEGLTGLVD 2218
            GF + W  W + C+F+ ++SVLVNGSPT +   +RGL+QGDPLAPFLFL+  EG +G++ 
Sbjct: 468  GFCEKWIGWTRGCVFAGNLSVLVNGSPTPEINIQRGLKQGDPLAPFLFLLVVEGFSGVMR 527

Query: 2217 SAIERGILQGYKVSED-ISFTILQFADDTIMLCDASWNNLWAVKAIFRSFELVSGLRVNF 2041
             A+E  + +G+ +    +  + LQ+ADDT+ + +AS  NLW++KAI R FE+VSGL+VNF
Sbjct: 528  RAVELNLFKGFNIGRGLVEISHLQYADDTLCIGEASVENLWSLKAILRGFEMVSGLKVNF 587

Query: 2040 LKSNMYGINIEGSFLNVASNFLSCCIGNIPLKFLGLPVAANPRRCQTWEPVVKAMRKKLA 1861
             KS + GIN+  +FL +A+ FL+C +G+IP K+LGLPV ANP+   TWEP++  +RK+L 
Sbjct: 588  WKSGLMGINVSPTFLTMAATFLNCRLGSIPFKYLGLPVGANPKNGSTWEPLLDHLRKRLN 647

Query: 1860 AWKGRHLSIGGRVTLINSVLNSMSLYFFSFYKAPKRVIRTLVAIQRNFLWGGSEENQKVS 1681
            +W+ +H+S GGR+ ++N+VLN++ +++ S  K P  V + +V +QR FLWGG +   K+ 
Sbjct: 648  SWRNKHISFGGRIVMLNAVLNAIPIFYLSLLKMPVNVWKQVVRLQRVFLWGGVKGGNKIK 707

Query: 1680 WVCRDKVCLPKRLGGLGIKNLELFNXXXXXXXXXXXLTDHDAVWVGLLSFKYGQNFTISS 1501
            WV    VC  K  GGLG++++ + N           L     +W  +L  KYG++  +  
Sbjct: 708  WVKWSVVCRAKNKGGLGVRDVRIVNLSLLAKWRWRLLLPGRPLWKEILVAKYGEHI-LHR 766

Query: 1500 HNTADHRF---ASIWWRDLHLIELDRGVQPM-WFSDALCRKVGDGEHTKFWSDTWLGVSS 1333
             + +D+R    AS WW+D  +  +D+ V+   W  + + RKVG+G  T FWS  W+G + 
Sbjct: 767  VDWSDYRIPSSASKWWKD--ICSIDKVVEDKNWLVEEVGRKVGNGNSTSFWSTKWIGDAP 824

Query: 1332 LKDCFPRLFQVAENKEANIAEMGAWHGDIWRWEWRWRRPLFVWEEELLADFLNIMAPVQI 1153
            L   FPRLF ++ +K+  + +     GD  RW + WRR LF WE + L     ++     
Sbjct: 825  LSVIFPRLFSLSNHKDCMVRDFYEDDGDNERWRFSWRRELFQWEVDRLTRLKELLVSFVF 884

Query: 1152 QKNVVDSWLWKLEGSQVFSVKSAYNMLLTVQRNNGE-NQGLDRTFKWLWSCDVSSKIAVF 976
              +  DSW+W+ +   VFSVKSAYN+L+   R+  E  +     F+ +W     SK+  F
Sbjct: 885  SSD-DDSWIWRPDPDGVFSVKSAYNLLIEELRSGEELEEEAALIFEQIWESPAPSKVIAF 943

Query: 975  TWRLLQDRLPTREQLICRGIIDANHGGSCVFCFRAVENCSHLFFTCNFSYHVWSVVNSWI 796
            +W+LL DR+PTR  L  RG++  +    CV C  +VE  +HLF  C  +  VW  V  WI
Sbjct: 944  SWQLLYDRIPTRRNLEVRGLLGLDSPWECVGCVGSVETTTHLFLHCPSALMVWYEVFRWI 1003

Query: 795  GVAGVFHNDGVNHFIQHGDFF*GKHLRRTRNLIWMAVVWSLWGMRNKIIFQGLVADFTSV 616
            GV  V     +  F         K  R    +IW A +W +W  RN  IF         +
Sbjct: 1004 GVIIVTPPSMMILFEVLRGSARNKKTRLGFLMIWHATIWCIWRARNNSIFANGSFSPKVI 1063

Query: 615  ITQIKMLSWGWFVNR 571
            + +IK+LSW W ++R
Sbjct: 1064 VEEIKVLSWKWCLSR 1078


>KYP44439.1 Retrovirus-related Pol polyprotein LINE-1 [Cajanus cajan]
          Length = 1142

 Score =  461 bits (1187), Expect = e-144
 Identities = 233/608 (38%), Positives = 354/608 (58%), Gaps = 3/608 (0%)
 Frame = -3

Query: 2397 GFNDLWRKWMKACIFSSSISVLVNGSPTDDFEPKRGLRQGDPLAPFLFLIAAEGLTGLVD 2218
            GF  +W  W+  C+ +S +S+LVNGSPT++F   + LRQGDPLAPFLFLI  EGL  L +
Sbjct: 521  GFPTIWCTWIAECLKTSRMSILVNGSPTEEFGVSKELRQGDPLAPFLFLIVEEGLFMLFN 580

Query: 2217 SAIERGILQGYKVSED-ISFTILQFADDTIMLCDASWNNLWAVKAIFRSFELVSGLRVNF 2041
             A +    +G  V +D +   ILQ+ADDT+++  AS++N+W +K+I R FEL SGL+VNF
Sbjct: 581  KASQLERFKGCLVGKDKVPVDILQYADDTLIMGHASYSNIWTIKSILRLFELASGLKVNF 640

Query: 2040 LKSNMYGINIEGSFLNVASNFLSCCIGNIPLKFLGLPVAANPRRCQTWEPVVKAMRKKLA 1861
             KS   G NIE  +L + ++ L   +G+ P  +LGLP+ AN R   TW PV++ ++K+L+
Sbjct: 641  SKSTFMGYNIESQWLQIMASVLHFRVGSTPFSYLGLPIGANHRISSTWHPVIEKVKKRLS 700

Query: 1860 AWKGRHLSIGGRVTLINSVLNSMSLYFFSFYKAPKRVIRTLVAIQRNFLWGGSEENQKVS 1681
             WK   LS GGR+ L+ SVL+S+ +YF SF KAPK +I ++ ++ ++FLWG  ++N+K++
Sbjct: 701  RWKCTTLSFGGRIALLKSVLHSIPIYFLSFLKAPKGIISSIESLFKSFLWGADQDNRKIN 760

Query: 1680 WVCRDKVCLPKRLGGLGIKNLELFNXXXXXXXXXXXLTDHDAVWVGLLSFKYGQNFTISS 1501
            WV  D VC  K  GGLG+K+L  FN           L + +++WV ++   Y     I+S
Sbjct: 761  WVAWDVVCRDKIHGGLGMKDLSAFNLSLLGKWHWRMLVEKNSLWVRVIRSLY----DIAS 816

Query: 1500 H-NTADHRFASIWWRDLHLIELDRGVQPMWFSDALCRKVGDGEHTKFWSDTWLGVSSLKD 1324
            H         S WW DL+ IE    V   W S   C+ +G+G  TKFW D W+G   L  
Sbjct: 817  HLPNGSGAKGSRWWVDLNRIEEGDLVSNEWMSSNCCKVIGNGVDTKFWLDKWVGHGILAH 876

Query: 1323 CFPRLFQVAENKEANIAEMGAWHGDIWRWEWRWRRPLFVWEEELLADFLNIMAPVQIQKN 1144
             F RL+Q+A NK  +IAEM  W G + +W+W WRR L VWE++LL    N +   +   +
Sbjct: 877  TFSRLYQIAINKNVSIAEMFEWEGGVVKWKWSWRRRLLVWEQQLLNTLANFINGTKFIIS 936

Query: 1143 VVDSWLWKLEGSQVFSVKSAYNMLLTVQRNNGENQGLDRTFKWLWSCDVSSKIAVFTWRL 964
              D WLW     +V++V SAY +L      N      +  F+W+W+    +K++ FTWR+
Sbjct: 937  DEDKWLWIAAPERVYTVSSAYKVL-----RNDIIFASNVIFRWIWTSIAPTKVSAFTWRV 991

Query: 963  LQDRLPTREQLICRGIIDANHGGSCVFCFRAVENCSHLFFTCNFSYHVWSVVNSWIGVAG 784
            + +R+PT++ L  RG++ A     C  C    E  SHLFF C  S+ +W    +W+G+  
Sbjct: 992  ILNRIPTKDNLFRRGVLQATQ-LECGLCRNKEETTSHLFFECEVSFQLWMACFNWLGLNS 1050

Query: 783  VFHNDGVNHFIQ-HGDFF*GKHLRRTRNLIWMAVVWSLWGMRNKIIFQGLVADFTSVITQ 607
            + HN  V +  Q +G  +     +    LI + V+W++W  RN +IF   +   + ++  
Sbjct: 1051 IMHNCCVQNLEQFYGLRYCSVKYQNCWILIRLPVIWTIWLARNDLIFSSKIIHVSEMLNM 1110

Query: 606  IKMLSWGW 583
            +++ SW W
Sbjct: 1111 VQLRSWRW 1118


>GAU27776.1 hypothetical protein TSUD_215870 [Trifolium subterraneum]
          Length = 714

 Score =  448 bits (1152), Expect = e-143
 Identities = 215/475 (45%), Positives = 306/475 (64%), Gaps = 2/475 (0%)
 Frame = -3

Query: 2397 GFNDLWRKWMKACIFSSSISVLVNGSPTDDFEPKRGLRQGDPLAPFLFLIAAEGLTGLVD 2218
            GF + W +WM+ACIF  S+S+L+NGSPT+DF+  RGLRQGDPL+PFLFLI  EGL G++ 
Sbjct: 223  GFTEGWLRWMRACIFEISMSILINGSPTEDFKVGRGLRQGDPLSPFLFLIVVEGLAGMMR 282

Query: 2217 SAIERGILQGYKVSEDISFTILQFADDTIMLCDASWNNLWAVKAIFRSFELVSGLRVNFL 2038
             A+E G  +G+ V++++ F +LQFADDTI++ +++W NLW++K + R FELVSG+R+NF+
Sbjct: 283  RAVEIGRFKGFHVNDNLQFQMLQFADDTILMGNSTWENLWSIKVLLRGFELVSGMRINFV 342

Query: 2037 KSNMYGINIEGSFLNVASNFLSCCIGNIPLKFLGLPVAANPRRCQTWEPVVKAMRKKLAA 1858
            KSN+YG+N++ +FL   S+FLSC    IP KFLG+PV ANPRR +TW+PVV+AM  +L+ 
Sbjct: 343  KSNLYGVNVDANFLEAGSSFLSCRSDVIPFKFLGIPVGANPRRRETWKPVVEAMTNRLST 402

Query: 1857 WKGRHLSIGGRVTLINSVLNSMSLYFFSFYKAPKRVIRTLVAIQRNFLWGGSEENQKVSW 1678
            W  R LS GGR+TLIN+VL SM LYFFSF+KAP  +++ LV IQRNFLWGG    +K+ W
Sbjct: 403  WSSRQLSFGGRITLINTVLASMPLYFFSFFKAPVCILKLLVRIQRNFLWGGGLVEKKLCW 462

Query: 1677 VCRDKVCLPKRLGGLGIKNLELFNXXXXXXXXXXXLTDHDAVWVGLLSFKYGQNFT--IS 1504
            +  D++CLPK  GGLG+KNLELFN           L + D +W GLL F+YG   T  ++
Sbjct: 463  IKWDQICLPKNRGGLGVKNLELFNIALLSKWKWRLLDEGDTIWAGLLRFRYGHLSTKILT 522

Query: 1503 SHNTADHRFASIWWRDLHLIELDRGVQPMWFSDALCRKVGDGEHTKFWSDTWLGVSSLKD 1324
               +      SIWWRD  ++ + + +  +WF+  +   VG+G +  FW   W G +SL D
Sbjct: 523  GETSQIGAKDSIWWRD--IMSIGKSINGLWFNSNVRCCVGNGNNIGFWKFKWHGNTSLGD 580

Query: 1323 CFPRLFQVAENKEANIAEMGAWHGDIWRWEWRWRRPLFVWEEELLADFLNIMAPVQIQKN 1144
             FP LF     K+  I+E   W+G+   W W+WR  L   EE+  +   +++  ++    
Sbjct: 581  LFPDLFAKEAFKDVLISERLRWNGNTADWNWQWREELVETEEQQFSVLKDLLVGIRSDPT 640

Query: 1143 VVDSWLWKLEGSQVFSVKSAYNMLLTVQRNNGENQGLDRTFKWLWSCDVSSKIAV 979
              D+W W       FSVKS Y++L++       +  +      LW  DV SK A+
Sbjct: 641  RPDTWRWVPGTIGNFSVKSCYDVLISYYYLEAPDSNVLTALHKLWKTDVPSKTAI 695


Top