BLASTX nr result

ID: Glycyrrhiza32_contig00026986 seq

BLASTX 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Glycyrrhiza32_contig00026986
         (1652 letters)

Database: ./nr 
           115,041,592 sequences; 42,171,959,267 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

KHN30886.1 Putative ribonuclease H protein, partial [Glycine soja]    255   2e-76
KHN41375.1 Putative ribonuclease H protein, partial [Glycine soja]    237   1e-69
KHN28363.1 Putative ribonuclease H protein, partial [Glycine soja]    237   6e-69
XP_019447203.1 PREDICTED: uncharacterized protein LOC109350421 [...   235   7e-68
KHN20323.1 Putative ribonuclease H protein, partial [Glycine soja]    234   1e-67
XP_019418409.1 PREDICTED: uncharacterized protein LOC109329191 [...   243   9e-67
GAU38148.1 hypothetical protein TSUD_395930 [Trifolium subterran...   229   7e-65
GAU25119.1 hypothetical protein TSUD_274080 [Trifolium subterran...   228   1e-61
GAU48210.1 hypothetical protein TSUD_404970 [Trifolium subterran...   230   1e-61
GAU34179.1 hypothetical protein TSUD_162800 [Trifolium subterran...   225   2e-61
KYP61726.1 Putative ribonuclease H protein At1g65750 family [Caj...   217   5e-60
GAU24087.1 hypothetical protein TSUD_388800 [Trifolium subterran...   225   7e-60
GAU26515.1 hypothetical protein TSUD_361480 [Trifolium subterran...   222   9e-60
KYP54863.1 Putative ribonuclease H protein At1g65750 family [Caj...   218   1e-59
GAU29820.1 hypothetical protein TSUD_223660 [Trifolium subterran...   219   1e-59
KYP53060.1 hypothetical protein KK1_025062 [Cajanus cajan]            209   1e-59
GAU40143.1 hypothetical protein TSUD_163120 [Trifolium subterran...   211   3e-59
GAU50085.1 hypothetical protein TSUD_371690 [Trifolium subterran...   212   4e-59
GAU20019.1 hypothetical protein TSUD_273540 [Trifolium subterran...   213   4e-59
KYP44439.1 Retrovirus-related Pol polyprotein LINE-1 [Cajanus ca...   222   5e-59

>KHN30886.1 Putative ribonuclease H protein, partial [Glycine soja]
          Length = 373

 Score =  255 bits (652), Expect = 2e-76
 Identities = 132/363 (36%), Positives = 186/363 (51%), Gaps = 4/363 (1%)
 Frame = +3

Query: 12   GGLGIKNLELFNXXXXXXXXXXXXTDHDAVWVGLLSFKYGQNFTISSHNTADHRFA---S 182
            GGLG+KNLE+FN             DH+A+W  LL+F+YG N       + D  +    S
Sbjct: 1    GGLGVKNLEVFNISLLAKWKWRCIHDHNALWRDLLAFRYG-NLIAKQTCSLDRSWGTKDS 59

Query: 183  IWWRDLHLIELDRGVQPMWFSDALCRKVGDGEHTKFWSDTWLGVSSLKDCFPRLFLVAEN 362
            IWWRDL L+E D      +F  A+   VGDG+   FW + WLG   LKD FP LF ++  
Sbjct: 60   IWWRDLMLLEKDLSQNQNFFQRAVSCDVGDGQSILFWYNKWLGSEPLKDAFPELFAISSQ 119

Query: 363  KEANIAEMGAWHGDIWRWEWRWRRPLFVWEEELLADFLNIMAPVQIQKNVVDSWLWKLEG 542
            +  ++   G+W  D W W+  W+R L   EEE L     I+  V +     D W W L  
Sbjct: 120  QLVSVGNAGSWRRDQWTWDLTWKRQLNPNEEESLHSLETILVDVHLVAESHDRWKWSLHN 179

Query: 543  SQVFSVKSAYNMLLTVQRNNGENQGLDRTFKWLWSCDVSSKIAVFTWRLLQDRLPTREQL 722
            S++F+V S Y+  +++      N  +      +W   V SK+A+F WRLL DRLPT++ L
Sbjct: 180  SKLFTVSSCYSFAMSLVNQTQMNSDILDILSIVWKVPVPSKVALFCWRLLLDRLPTKDNL 239

Query: 723  ICRGIIDANHGGSCVFCFRAVENCSHLFFTCNFSYHVWSVVNSWIGVAGVFHNDGVNHFI 902
            I R ++  N    C  C    EN  HLFF C+FS  +W  + SWIG+  V    GV HF 
Sbjct: 240  IRRNVVINN--SRCSLCDSCDENVVHLFFHCDFSKCIWKEILSWIGIVDVIAVGGVQHFW 297

Query: 903  QHGDFFKGKKLR-RTRNLIWMAVVWSLWGMRNKIIFQGLVADFTSVIAQIKMLSWGWFVN 1079
            ++    K    R +   + W+A +W +W +RN  IF+    D    I QIK + W WF+ 
Sbjct: 298  EYDRLLKYNTSRNKVPFMFWLATLWIIWQVRNNSIFKEEEKDIPKTINQIKHICWAWFMG 357

Query: 1080 RAG 1088
            + G
Sbjct: 358  KVG 360


>KHN41375.1 Putative ribonuclease H protein, partial [Glycine soja]
          Length = 363

 Score =  237 bits (604), Expect = 1e-69
 Identities = 122/338 (36%), Positives = 173/338 (51%), Gaps = 4/338 (1%)
 Frame = +3

Query: 87   DHDAVWVGLLSFKYGQNFTISSHNTADHRFA---SIWWRDLHLIELDRGVQPMWFSDALC 257
            DH+A+W  LL+F+YG N       + D  +    SIWWRDL L+E D      +F  A+ 
Sbjct: 12   DHNALWRDLLAFRYG-NLIAKQTCSLDRSWGTKDSIWWRDLMLLEKDLSQNQNFFQRAVS 70

Query: 258  RKVGDGEHTKFWSDTWLGVSSLKDCFPRLFLVAENKEANIAEMGAWHGDIWRWEWRWRRP 437
              VGDG+   FW + WLG   LKD FP LF ++  +  ++   G+W  D W W   W+R 
Sbjct: 71   CDVGDGQSILFWYNKWLGSEPLKDAFPELFAISSQQLVSVGNAGSWRRDQWTWGLTWKRQ 130

Query: 438  LFVWEEELLADFLNIMAPVQIQKNVVDSWLWKLEGSQVFSVKSAYNMLLTVQRNNGENQG 617
            L   EEE L     I+  V +     D W W L  S++F+V S Y+  +++      N  
Sbjct: 131  LNPNEEESLHSLETILVDVHLVAESHDRWKWSLHNSKLFTVSSCYSFAMSLVNQTQMNSD 190

Query: 618  LDRTFKWLWSCDVSSKIAVFTWRLLQDRLPTREQLICRGIIDANHGGSCVFCFRAVENCS 797
            +      +W   V SK+A+F WRLL DRLPT++ LI R ++  N    C  C    EN  
Sbjct: 191  ILDILSIVWKVPVPSKVALFCWRLLLDRLPTKDNLIRRNVVINN--SRCSLCDSCDENVV 248

Query: 798  HLFFTCNFSYHVWSVVNSWIGVAGVFHNDGVNHFIQHGDFFKGKKLR-RTRNLIWMAVVW 974
            HLFF C+FS  +W  + SWIG+  V    GV HF ++    K    R +   + W+A +W
Sbjct: 249  HLFFHCDFSKCIWKEILSWIGIVDVIAVGGVQHFWEYDRLLKYNTSRNKVPFMFWLATLW 308

Query: 975  SLWGMRNKIIFQGLVADFTSVIAQIKMLSWGWFVNRAG 1088
             +W +RN  IF+    D    I QIK + W WF+ + G
Sbjct: 309  IIWQVRNNSIFKEEEKDIPKTINQIKHICWAWFMGKVG 346


>KHN28363.1 Putative ribonuclease H protein, partial [Glycine soja]
          Length = 417

 Score =  237 bits (604), Expect = 6e-69
 Identities = 125/340 (36%), Positives = 175/340 (51%), Gaps = 4/340 (1%)
 Frame = +3

Query: 3    KRLGGLGIKNLELFNXXXXXXXXXXXXTDHDAVWVGLLSFKYGQNFTISSHNTADHRFA- 179
            K+ GGLG+KNLE+FN             DH+A+W  LL+F+YG N       + D  +  
Sbjct: 73   KKEGGLGVKNLEVFNISLLAKWKWRCIHDHNALWRDLLAFRYG-NLIAKQTCSLDRSWGT 131

Query: 180  --SIWWRDLHLIELDRGVQPMWFSDALCRKVGDGEHTKFWSDTWLGVSSLKDCFPRLFLV 353
              SIWWRDL L+E D      +F  A+   VGDG+   FW + WLG   LKD FP LF +
Sbjct: 132  KDSIWWRDLMLLEKDLSQNQNFFQRAVSCDVGDGQSILFWYNKWLGSEPLKDAFPELFAI 191

Query: 354  AENKEANIAEMGAWHGDIWRWEWRWRRPLFVWEEELLADFLNIMAPVQIQKNVVDSWLWK 533
            +  +  ++   G+W  D W W   W+R L   EEE L     I+  V +     D W W 
Sbjct: 192  SSQQLVSVGNAGSWRRDQWTWGLTWKRQLNPNEEESLHSLETILVDVHLVAESHDRWKWS 251

Query: 534  LEGSQVFSVKSAYNMLLTVQRNNGENQGLDRTFKWLWSCDVSSKIAVFTWRLLQDRLPTR 713
            L  S++F+V S Y+  +++      N  +      +W   V SK+A+F WRLL DRLPT+
Sbjct: 252  LHNSKLFTVSSCYSFAMSLVNQTQMNSDILDILSIVWKVPVPSKVALFCWRLLLDRLPTK 311

Query: 714  EQLICRGIIDANHGGSCVFCFRAVENCSHLFFTCNFSYHVWSVVNSWIGVAGVFHNDGVN 893
            + LI R ++  N    C  C    EN  HLFF C+FS  +W  V SWIG+  V    GV 
Sbjct: 312  DNLIRRNVVINN--SRCSLCDSCDENVVHLFFHCDFSNCIWKEVLSWIGIVDVIAVGGVQ 369

Query: 894  HFIQHGDFFKGKKLR-RTRNLIWMAVVWSLWGMRNKIIFQ 1010
            HF ++    K    R +   + W+A +W +W +RN  IF+
Sbjct: 370  HFWEYDRLLKYNTSRNKVPFMFWLATLWIIWQVRNNSIFK 409


>XP_019447203.1 PREDICTED: uncharacterized protein LOC109350421 [Lupinus
            angustifolius]
          Length = 456

 Score =  235 bits (600), Expect = 7e-68
 Identities = 130/374 (34%), Positives = 187/374 (50%), Gaps = 4/374 (1%)
 Frame = +3

Query: 3    KRLGGLGIKNLELFNXXXXXXXXXXXXTDHDAVWVGLLSFKYGQNFTISSHNTADHRF-- 176
            KR  G G+KNL LFN            +  +++WV +L   YG    +         F  
Sbjct: 8    KRGRGFGVKNLGLFNLALLGKWRWRMLSSSESLWVKVLRSIYGVEAVVRGGLVDVECFKK 67

Query: 177  ASIWWRDLHLI-ELDRGVQPMWFSDALCRKVGDGEHTKFWSDTWLGVSSLKDCFPRLFLV 353
             S WWRDL  +   D G    WF++ + R+VG G+ T FW D W+G   LK+CF RLF V
Sbjct: 68   GSSWWRDLGCVCNRDNGFNKGWFNEGVRRRVGSGQSTLFWRDIWVGGECLKNCFERLFQV 127

Query: 354  AENKEANIAEMGAWHGDIWRWEWRWRRPLFVWEEELLADFLNIMAPVQIQKNVVDSWLWK 533
              NK+A I+ M  W   +W W   WRR LF+WE++ + D LN +  V++ +   D WLW 
Sbjct: 128  TLNKDACISSMDEWRNGVWCWLLNWRRSLFLWEQDEVNDLLNKVEEVRLVQGNEDGWLWV 187

Query: 534  LEGSQVFSVKSAYNMLLTVQRNNGENQGLDRTFKWLWSCDVSSKIAVFTWRLLQDRLPTR 713
             + +  +SV++AY +L    RN+         +K LW+  V SK+  F WRL    +PTR
Sbjct: 188  HDKNGTYSVRNAYKVLQNEVRNDNYLH-----YKRLWASKVPSKLKCFAWRLFVGGVPTR 242

Query: 714  EQLICRGIIDANHGGSCVFCFRAVENCSHLFFTCNFSYHVWSVVNSWIGVAGVFHNDGVN 893
              L  RGII +     C FC    E+  HLFFTC+ SY VW  + S  G+  +  +   +
Sbjct: 243  MNLARRGIIGSLPSTLCAFCGELEESSDHLFFTCSLSYSVWQKLYSLFGIYSILPSSTGS 302

Query: 894  HFIQHGDFF-KGKKLRRTRNLIWMAVVWSLWGMRNKIIFQGLVADFTSVIAQIKMLSWGW 1070
            +F+ H   F + K   +    IW   +WSLW +RNKIIF+    +    I  I  L   +
Sbjct: 303  NFLSHWHLFGEAKNFHQQWMTIWFVTIWSLWLVRNKIIFEESSFNLDENIIIIFSLPHHF 362

Query: 1071 FVNRAGRSSEISLV 1112
            F  R  + S   ++
Sbjct: 363  FFARFNKESSFEVL 376


>KHN20323.1 Putative ribonuclease H protein, partial [Glycine soja]
          Length = 417

 Score =  234 bits (596), Expect = 1e-67
 Identities = 123/340 (36%), Positives = 174/340 (51%), Gaps = 4/340 (1%)
 Frame = +3

Query: 3    KRLGGLGIKNLELFNXXXXXXXXXXXXTDHDAVWVGLLSFKYGQNFTISSHNTADHRFA- 179
            K+ GGLG+KNLE+FN             DH+A+W  LL+F+YG N       + D  +  
Sbjct: 73   KKEGGLGVKNLEVFNISLLAKWKWRCIHDHNALWRDLLAFRYG-NLIAKQTCSLDRSWGT 131

Query: 180  --SIWWRDLHLIELDRGVQPMWFSDALCRKVGDGEHTKFWSDTWLGVSSLKDCFPRLFLV 353
              SIWWRDL L+E D      +F  A+   VGDG+   FW + WLG   LKD FP LF +
Sbjct: 132  KDSIWWRDLMLLEKDLSQNQNFFQRAVSCDVGDGQSILFWYNKWLGSEPLKDAFPELFAI 191

Query: 354  AENKEANIAEMGAWHGDIWRWEWRWRRPLFVWEEELLADFLNIMAPVQIQKNVVDSWLWK 533
            +  +  ++    +W  D W W   W+R L   EEE L     I+  V +     D W W 
Sbjct: 192  SSQQLESVGNASSWRRDQWTWGLTWKRQLNPNEEESLHSLETILVDVHLVAESHDRWKWS 251

Query: 534  LEGSQVFSVKSAYNMLLTVQRNNGENQGLDRTFKWLWSCDVSSKIAVFTWRLLQDRLPTR 713
            L  S++F+V S Y+  +++      N  +      +W   V SK+A+F WRLL DRLPT+
Sbjct: 252  LHNSKLFTVSSCYSFAMSLVNQTQMNSDILDILSIVWKVPVPSKVALFCWRLLLDRLPTK 311

Query: 714  EQLICRGIIDANHGGSCVFCFRAVENCSHLFFTCNFSYHVWSVVNSWIGVAGVFHNDGVN 893
            + LI R ++  N    C  C    EN  HLFF C+FS  +W  + SWIG+  V    GV 
Sbjct: 312  DNLIRRNVVINN--SRCSLCDSCDENVVHLFFHCDFSKCIWKEILSWIGIVDVIAVGGVQ 369

Query: 894  HFIQHGDFFKGKKLR-RTRNLIWMAVVWSLWGMRNKIIFQ 1010
            HF ++    K    R +   + W+A +W +W +RN  IF+
Sbjct: 370  HFWEYDRLLKYNTSRNKVPFMFWLATLWIIWQVRNNSIFK 409


>XP_019418409.1 PREDICTED: uncharacterized protein LOC109329191 [Lupinus
            angustifolius]
          Length = 953

 Score =  243 bits (619), Expect = 9e-67
 Identities = 131/358 (36%), Positives = 185/358 (51%), Gaps = 4/358 (1%)
 Frame = +3

Query: 3    KRLGGLGIKNLELFNXXXXXXXXXXXXTDHDAVWVGLLSFKYGQNFTISSHNTADHRF-- 176
            K  GGLG+KNL LFN            +  +++WV +L   YG    +         F  
Sbjct: 567  KEEGGLGVKNLGLFNLALLGKWRWHMLSSSESLWVKVLRSIYGVEAVVRGGLVDVECFKK 626

Query: 177  ASIWWRDLH-LIELDRGVQPMWFSDALCRKVGDGEHTKFWSDTWLGVSSLKDCFPRLFLV 353
             S WWRDL  L   D G    WF++ + R+VG G+ T FW D W+G   LK+CF RLF V
Sbjct: 627  GSSWWRDLGCLCNRDNGFNKGWFNEGVRRRVGSGQSTLFWRDIWVGGECLKNCFERLFQV 686

Query: 354  AENKEANIAEMGAWHGDIWRWEWRWRRPLFVWEEELLADFLNIMAPVQIQKNVVDSWLWK 533
              NK+A I+ MG W   +W W   WRR LF+WE++ + D LN +  V++ +   D WLW 
Sbjct: 687  TLNKDACISSMGEWRNGVWCWLLNWRRSLFLWEQDEVNDLLNKVEEVRLVQGNEDGWLWV 746

Query: 534  LEGSQVFSVKSAYNMLLTVQRNNGENQGLDRTFKWLWSCDVSSKIAVFTWRLLQDRLPTR 713
             + +  +SV++AY +L    RN+         +K LW+  V SK+  F WRL    +PT 
Sbjct: 747  HDKNGTYSVRNAYKVLQNEVRNDNYLH-----YKRLWASKVPSKLKCFAWRLFVGGVPTW 801

Query: 714  EQLICRGIIDANHGGSCVFCFRAVENCSHLFFTCNFSYHVWSVVNSWIGVAGVFHNDGVN 893
              L  RGII +     C FC    E+  HLFFTC+ SY VW  + S  G+  +  +   +
Sbjct: 802  MNLARRGIIGSLPSTLCAFCGELEESSDHLFFTCSLSYSVWQKLYSLFGIYSILPSSTGS 861

Query: 894  HFIQHGDFF-KGKKLRRTRNLIWMAVVWSLWGMRNKIIFQGLVADFTSVIAQIKMLSW 1064
            +F+ H   F + KK  +    IW   +WSLW +RNKIIF+    +   V+  I + SW
Sbjct: 862  NFLSHWHLFGEAKKFHQQWMTIWFVTIWSLWLVRNKIIFEESSFNVDEVMFIINLHSW 919


>GAU38148.1 hypothetical protein TSUD_395930 [Trifolium subterraneum]
          Length = 503

 Score =  229 bits (583), Expect = 7e-65
 Identities = 134/371 (36%), Positives = 194/371 (52%), Gaps = 8/371 (2%)
 Frame = +3

Query: 3    KRLGGLGIKNLELFNXXXXXXXXXXXXTDHDAVWVGLLSFKYGQNFTISSHNTAD---HR 173
            K+ GGLG+++L L N            T    VW  ++  +YG++  I   N  D    R
Sbjct: 126  KKEGGLGVRDLRLVNISLLAKWRWKLLTTECEVWKEVVGARYGRD-VIGKVNLGDIDVTR 184

Query: 174  FASIWWRDLHLIELDRGVQPMWFSDALCRKVGDGEHTKFWSDTWLGVSSLKDCFPRLFLV 353
              S WWRDL L++ D      WFS A+ ++VG G+ T FW++ W+G   L+  FPRLF +
Sbjct: 185  TGSCWWRDLCLLDSD----VRWFSSAVGKRVGRGDSTMFWNEIWIGDQPLRQRFPRLFGM 240

Query: 354  AENKEANIAEMGAWHGDIWRWEWRWRRPLFVWEEELLADFLNIMAPVQIQKNV-VDSWLW 530
            +  +   I  MG+    +W WE +WRR  F WEE+    FL+I+  VQ    V  D WLW
Sbjct: 241  STQQNEVICNMGSLVNGLWHWELQWRRNFFTWEEDQYNHFLDII--VQFAPTVQQDRWLW 298

Query: 531  KLEGSQVFSVKSAY----NMLLTVQRNNGENQGLDRTFKWLWSCDVSSKIAVFTWRLLQD 698
              +G Q ++  SAY    N L+T    +  N   D  FK LW C   SK++ F+W+L+ D
Sbjct: 299  LGDGVQGYTANSAYSLVVNKLVTPSVCDPIN---DLVFKILWKCGAPSKVSAFSWQLMLD 355

Query: 699  RLPTREQLICRGIIDANHGGSCVFCFRAVENCSHLFFTCNFSYHVWSVVNSWIGVAGVFH 878
            RL T++ L+ R II A+H G+CVFC  A E+ SHLF  C+    VW  +  W+G+  +  
Sbjct: 356  RLQTKDNLMKRRIIQAHH-GNCVFCNLAQESASHLFLHCDRVAKVWYDLMRWLGLTVILP 414

Query: 879  NDGVNHFIQHGDFFKGKKLRRTRNLIWMAVVWSLWGMRNKIIFQGLVADFTSVIAQIKML 1058
            ++ V+           KK R    LIW A +W +W +RN  +F   V     V  Q+K+ 
Sbjct: 415  HNIVSSLAILVTCANNKKERAGLCLIWNAYMWVIWTVRNVCVFNNGVFMEEEVADQVKLE 474

Query: 1059 SWGWFVNRAGR 1091
            SW WF+ R  +
Sbjct: 475  SWKWFIGRVAK 485


>GAU25119.1 hypothetical protein TSUD_274080 [Trifolium subterraneum]
          Length = 937

 Score =  228 bits (582), Expect = 1e-61
 Identities = 136/367 (37%), Positives = 193/367 (52%), Gaps = 10/367 (2%)
 Frame = +3

Query: 12   GGLGIKNLELFNXXXXXXXXXXXXTDHDAVWVGLLSFKYGQNFTISSHNTA-----DHRF 176
            GGLG++++   N                AVW  +L  +YG+N   + HN           
Sbjct: 560  GGLGVRDVGKVNLSLLIKWRWKLLQKDAAVWKDVLVARYGEN---ARHNVLWIGCPIPSS 616

Query: 177  ASIWWRDLHLIELDRGVQPMWFSDALCRKVGDGEHTKFWSDTWLGVSSLKDCFPRLFLVA 356
            AS WWRDL  I+L    +  WF+  + R+VG G+ T+FW D W+G   L + FPRLF ++
Sbjct: 617  ASCWWRDLCRIDLTE--EGSWFAKNISRRVGRGDTTRFWKDCWVGQVPLCESFPRLFSIS 674

Query: 357  ENKEANIAEMGAWHGDIWRWEWRWRRPLFVWEEELLADFLNIMAPVQIQKNVVDSWLWKL 536
              KEA ++E+      +  WEW WRR LFVWEEELL    + ++P+    +  D W W L
Sbjct: 675  LQKEALVSEIRVGGEGVSWWEWGWRRSLFVWEEELLLGLQDFISPMAFSTD-DDVWYWGL 733

Query: 537  EGSQVFSVKSAYNMLLTVQRNNGENQGLD-RTFKWLWSCDVSSKIAVFTWRLLQDRLPTR 713
            E   VF+VKSAY +L  +  +       + R    +W     SK+  F+W+LL++R+PTR
Sbjct: 734  EDGGVFTVKSAYLLLGRMFASFSMFNVCELRVLNSIWRSPAPSKVIAFSWKLLRNRIPTR 793

Query: 714  EQLICRGIIDANHGGSCVFCFRAVENCSHLFFTCNFSYHVWSVVNSWIGVAGVFHNDGVN 893
            + L  RGI+ A     CV C    E   HLF  C+F++ VWS +  W+GV  V      N
Sbjct: 794  DCLSRRGILAAGGSRECVHCQGREETALHLFLFCDFAFRVWSAIFQWLGVVIVM---PPN 850

Query: 894  HFIQHGDFFKG----KKLRRTRNLIWMAVVWSLWGMRNKIIFQGLVADFTSVIAQIKMLS 1061
             FI   D F G     K  +   LIW   VW++W  RN+I+F   V D +SVI +IK+LS
Sbjct: 851  LFILF-DCFVGAAGCNKRAKGFLLIWHTTVWAIWRSRNEILFANGVLDPSSVIDEIKLLS 909

Query: 1062 WGWFVNR 1082
            W W ++R
Sbjct: 910  WRWGLSR 916


>GAU48210.1 hypothetical protein TSUD_404970 [Trifolium subterraneum]
          Length = 1653

 Score =  230 bits (587), Expect = 1e-61
 Identities = 131/370 (35%), Positives = 183/370 (49%), Gaps = 2/370 (0%)
 Frame = +3

Query: 3    KRLGGLGIKNLELFNXXXXXXXXXXXXTDHDAVWVGLLSFKYGQNFT--ISSHNTADHRF 176
            K  GGLG+KNL LFN            T+  A+W  LL F+YG   T  +   + +    
Sbjct: 1270 KDQGGLGVKNLNLFNIALLNKWKWRFLTEDGALWAELLRFRYGHLPTQLMGGASFSIGAK 1329

Query: 177  ASIWWRDLHLIELDRGVQPMWFSDALCRKVGDGEHTKFWSDTWLGVSSLKDCFPRLFLVA 356
            +S WW+D+  I + +G +  WF   +   VG+G +  FW+  W G     + FP LF   
Sbjct: 1330 SSTWWKDV--IGMGKGAEFDWFKSNMRACVGNGVNIGFWNFKWFGNHPFSEIFPNLFAKE 1387

Query: 357  ENKEANIAEMGAWHGDIWRWEWRWRRPLFVWEEELLADFLNIMAPVQIQKNVVDSWLWKL 536
            E    +IAE    +G+ +   W+W  PL   E + +A+   ++    +Q    DSW W L
Sbjct: 1388 ERPNVSIAERLGGNGEAFVRHWQWSDPLSDSEHQQVAELTELLRGFSLQPGHQDSWRWIL 1447

Query: 537  EGSQVFSVKSAYNMLLTVQRNNGENQGLDRTFKWLWSCDVSSKIAVFTWRLLQDRLPTRE 716
            E + +FSVKS YN L+  +     +  +      LW  DV SK+  F WRLL  RLP R 
Sbjct: 1448 ETTGLFSVKSYYNALVKSRLIVELDSNVLTAINQLWKNDVPSKVLFFGWRLLLQRLPIRI 1507

Query: 717  QLICRGIIDANHGGSCVFCFRAVENCSHLFFTCNFSYHVWSVVNSWIGVAGVFHNDGVNH 896
             L  RGI+       CVFC    E+C HLFF C+F   VW  V +WIG       +G +H
Sbjct: 1508 ALNHRGILTNPQDLPCVFCSVFYEDCVHLFFHCSFVNCVWEAVYNWIGKDYHAGAEGWSH 1567

Query: 897  FIQHGDFFKGKKLRRTRNLIWMAVVWSLWGMRNKIIFQGLVADFTSVIAQIKMLSWGWFV 1076
            F   GD      + R R+LIW+A  W+LW +RN +IF G     +S++  IK +S  W  
Sbjct: 1568 FKVFGDMVNSTNIERVRHLIWLATTWNLWKLRNNVIFNGATPSASSLLNDIKAISCAWVS 1627

Query: 1077 NRAGRSSEIS 1106
             R G  S IS
Sbjct: 1628 GRYGHKSCIS 1637


>GAU34179.1 hypothetical protein TSUD_162800 [Trifolium subterraneum]
          Length = 757

 Score =  225 bits (574), Expect = 2e-61
 Identities = 127/369 (34%), Positives = 194/369 (52%), Gaps = 9/369 (2%)
 Frame = +3

Query: 3    KRLGGLGIKNLELFNXXXXXXXXXXXXTDHDA-VWVGLLSFKYGQNF---TISSHNTADH 170
            K+ GGLGI++L+  N               D  +W  +L  KYG +     + S  +  +
Sbjct: 375  KKNGGLGIRDLKAVNLSLLMKWRWRLLNSEDTGLWKEVLVAKYGGHILHNVVWSLGSPPY 434

Query: 171  RFASIWWRDLHLIELDRGVQPM-WFSDALCRKVGDGEHTKFWSDTWLGVSSLKDCFPRLF 347
            R AS+WW+D++  +L   V    W ++ + R +G+G  T+FWSD W+G   L   FPRLF
Sbjct: 435  R-ASLWWKDIN--DLQACVNSKNWVAEMVTRFLGNGSRTRFWSDNWIGDVLLCSKFPRLF 491

Query: 348  LVAENKEANIAEMGAWHGDIWRWEWRWRRPLFVWEEELLADFLNIMAPVQIQKNVVDSWL 527
             ++  KEA ++EM    G+   W + WRR LF+WEEE ++  L+++  V +     D W 
Sbjct: 492  SLSLQKEATVSEMMVVEGETKSWNFLWRRSLFLWEEERVSQLLSLLENVSLSLEE-DKWH 550

Query: 528  WKLEGSQVFSVKSAYNMLLTVQRNNGENQGLD----RTFKWLWSCDVSSKIAVFTWRLLQ 695
            W L+    FSVKSAY+ LL    N   +  L     + F  +W      K+ VF+WRLL 
Sbjct: 551  WALDPDGCFSVKSAYDSLL---ENLDTSPNLSPYEAKIFSNIWDSPAPLKVVVFSWRLLH 607

Query: 696  DRLPTREQLICRGIIDANHGGSCVFCFRAVENCSHLFFTCNFSYHVWSVVNSWIGVAGVF 875
            DR+PT+E LI RG++     GSCV+C    E+ +HLF  C  +  VW  +  W+GV  V 
Sbjct: 608  DRVPTKENLIVRGVLPRESSGSCVWCGDIRESSAHLFLHCKVALVVWYEIFRWLGVVIVI 667

Query: 876  HNDGVNHFIQHGDFFKGKKLRRTRNLIWMAVVWSLWGMRNKIIFQGLVADFTSVIAQIKM 1055
              +    F    D  + KK ++   L+W +V+W++W  RN  IF  + +D   ++   K+
Sbjct: 668  PPNLFTLFDYFSDSARSKKSKKGFLLVWHSVIWTIWKARNNQIFNNVTSDPFELVESAKV 727

Query: 1056 LSWGWFVNR 1082
            LSW W  +R
Sbjct: 728  LSWRWSADR 736


>KYP61726.1 Putative ribonuclease H protein At1g65750 family [Cajanus cajan]
          Length = 554

 Score =  217 bits (553), Expect = 5e-60
 Identities = 120/362 (33%), Positives = 180/362 (49%), Gaps = 6/362 (1%)
 Frame = +3

Query: 3    KRLGGLGIKNLELFNXXXXXXXXXXXXTDHDAVWVGLLSFKYGQNFTISSHNTADHRFAS 182
            K  GGLGI +L  FN             +    W  +++  YG+          D   +S
Sbjct: 193  KEHGGLGILDLRAFNLAILEKWRWHLLVEKGRFWHKVVTSIYGEG---CFQGVGDKVQSS 249

Query: 183  IWWRDLHLIELDRGVQPMWFSDALCRKVGDGEHTKFWSDTWLGVSSLKDCFPRLFLVAEN 362
             WW DL  I+        WFS    + VGDG++T FW D W G   L + + RLF +A +
Sbjct: 250  KWWVDLWTIDFAPYASFDWFSSRCTKVVGDGQNTFFWKDGWSGQGPLCNRYSRLFSIASD 309

Query: 363  KEANIAEMGAWHGDIWRWEWRWRRPLFVWEEELLADFLNIMAPVQIQKNVVDSWLWKLEG 542
            K+ ++A M  W    + W W WRR LF WE +LL+     +    ++ +  D W WK   
Sbjct: 310  KDVSVANMVLWRDGGFEWIWSWRRSLFQWELDLLSQLAADLGSTVLKNDCCDRWCWKDSN 369

Query: 543  SQVFSVKSAYNMLLTVQRNNGENQGLDRTF---KWLWSCDVSSKIAVFTWRLLQDRLPTR 713
             ++++VKSAY  ++        N G+   F   K+LWS  V SK++ F W+ L +R+P+ 
Sbjct: 370  DEIYNVKSAYKAVI--------NDGIYANFLLHKFLWSSCVPSKVSGFAWKALLNRIPSN 421

Query: 714  EQLICRGIIDANHGGSCVFCFRAVENCSHLFFTCNFSYHVWSVVNSWIGVAGVFHN---D 884
              LI R ++D +  G C +    +EN SHL F C ++Y VW  +  W GV+ V HN   +
Sbjct: 422  CNLIKRKVLDISASG-CAWYGEDLENTSHLLFGCYYAYSVWLSIFDWFGVSTVLHNSCHE 480

Query: 885  GVNHFIQHGDFFKGKKLRRTRNLIWMAVVWSLWGMRNKIIFQGLVADFTSVIAQIKMLSW 1064
               HFI         K+R   +++W+A +WSLW  RN +IF+  V   T ++  IK+ SW
Sbjct: 481  NFAHFIGIPRCSGRDKMR--WSVVWLATIWSLWLARNNVIFKDKVVAITDLVELIKIRSW 538

Query: 1065 GW 1070
             W
Sbjct: 539  NW 540


>GAU24087.1 hypothetical protein TSUD_388800 [Trifolium subterraneum]
          Length = 1985

 Score =  225 bits (574), Expect = 7e-60
 Identities = 120/372 (32%), Positives = 197/372 (52%), Gaps = 7/372 (1%)
 Frame = +3

Query: 3    KRLGGLGIKNLELFNXXXXXXXXXXXXTDHDAVWVGLLSFKYGQNFTISSHNTADHR--- 173
            K+ GGL I++L   N            ++ + VW  ++  KYG +   ++    D R   
Sbjct: 1608 KKEGGLSIRDLRTVNLSLLAKWRWKLLSEEEEVWKNVIIAKYGIHMLGNAR--LDERDIG 1665

Query: 174  -FASIWWRDLHLIELDRGVQPMWFSDALCRKVGDGEHTKFWSDTWLGVSSLKDCFPRLFL 350
              +S+WWRDL    LD+GV   WF+    + +G G   KFW + W+G  SL+  FPRLF 
Sbjct: 1666 SMSSLWWRDL--CRLDKGVG--WFNHFARKYLGCGNSIKFWKEVWVGGQSLELQFPRLFG 1721

Query: 351  VAENKEANIAEMGAWHGDIWRWEWRWRRPLFVWEEELLADFLNIMAPVQIQKNVVDSWLW 530
            ++  ++  + E+G+W   +WRW  RWRR LFVWEE+L+++   ++  + I +   D W+W
Sbjct: 1722 ISVQQDDMVREVGSWVNGVWRWGLRWRRVLFVWEEDLVSELELVLNNISITEE-EDRWVW 1780

Query: 531  KLEGSQVFSVKSAY---NMLLTVQRNNGENQGLDRTFKWLWSCDVSSKIAVFTWRLLQDR 701
            +L     F+VKS Y   + LLT +      +     ++ +W   V SK++   W+L  DR
Sbjct: 1781 RLNVGDGFTVKSLYEALDPLLTPRCLVSSFESF--AYRSIWKSAVPSKVSALAWQLFLDR 1838

Query: 702  LPTREQLICRGIIDANHGGSCVFCFRAVENCSHLFFTCNFSYHVWSVVNSWIGVAGVFHN 881
            +PT+  L  RGI+  +H  SCV C    E   HLF  C+++  +W  V  W+GV  V   
Sbjct: 1839 IPTKVNLYKRGILRMDH-ASCVLCGEEAETARHLFLHCDYAAGIWYAVCRWLGVFAVLPA 1897

Query: 882  DGVNHFIQHGDFFKGKKLRRTRNLIWMAVVWSLWGMRNKIIFQGLVADFTSVIAQIKMLS 1061
            D +  +       + KK+R+   ++WMA +W +W +RN+ +F+    + T  +  ++ LS
Sbjct: 1898 DVMMSYGLLVGCGRNKKIRKGFAIVWMAFIWVIWKVRNERVFKNATVEVTDAVDMVQRLS 1957

Query: 1062 WGWFVNRAGRSS 1097
            W W++N+   SS
Sbjct: 1958 WQWYLNKMASSS 1969


>GAU26515.1 hypothetical protein TSUD_361480 [Trifolium subterraneum]
          Length = 873

 Score =  222 bits (566), Expect = 9e-60
 Identities = 125/360 (34%), Positives = 179/360 (49%), Gaps = 3/360 (0%)
 Frame = +3

Query: 12   GGLGIKNLELFNXXXXXXXXXXXXTDHDAVWVGLLSFKYGQNFTISSH--NTADHRFASI 185
            GGLG++++   N                A W  LL  KYG+      H  +      AS 
Sbjct: 496  GGLGVRDVGKVNLSLLIKWRWRLLQPEGAFWKELLVAKYGEMVRQKLHWNDCPIPSRASS 555

Query: 186  WWRDLHLIELDRGVQPMWFSDALCRKVGDGEHTKFWSDTWLGVSSLKDCFPRLFLVAENK 365
            WW+D+   E+D   +  WF+  + R+VG G+  +FW D W G S L D FPRLF +A +K
Sbjct: 556  WWKDI--CEIDVCEEGSWFAQHVFRRVGKGDSIRFWKDCWFGNSPLCDLFPRLFSIATHK 613

Query: 366  EANIAEMGAWHGDIWRWEWRWRRPLFVWEEELLADFLNIMAPVQIQKNVVDSWLWKLEGS 545
            EA + E+      +  W W WRR LFVWE+ELL      + P+ +     D W W+LE  
Sbjct: 614  EALVNEVRVVTEGLNLWNWEWRRRLFVWEQELLVSLTETL-PLLVLSGEEDVWYWRLEDG 672

Query: 546  QVFSVKSAYNMLLTVQRNNGENQGLD-RTFKWLWSCDVSSKIAVFTWRLLQDRLPTREQL 722
             VF+VKS Y +L +V   +      + R F  +W     SK+ VF W+LL++R+PT+  L
Sbjct: 673  GVFTVKSVYTLLGSVFATDAVWSPPELRVFDQIWKSPAPSKVIVFPWKLLRNRIPTKANL 732

Query: 723  ICRGIIDANHGGSCVFCFRAVENCSHLFFTCNFSYHVWSVVNSWIGVAGVFHNDGVNHFI 902
              RGI       +CV C  + E+ SHLF  CNF+  VW+ +  WIGV  V   +    F 
Sbjct: 733  ALRGIQVVGGSLNCVHCVGSGEDASHLFMYCNFAAQVWNSIFRWIGVTIVIPPNIFLLFD 792

Query: 903  QHGDFFKGKKLRRTRNLIWMAVVWSLWGMRNKIIFQGLVADFTSVIAQIKMLSWGWFVNR 1082
                     K+ +  +LIW   +W +W  RN I F     D    + +IK+LSW W ++R
Sbjct: 793  CMRGAAPNNKIAKGFSLIWHTTLWVIWKSRNSISFGSGTIDLGQAVGEIKLLSWRWDLSR 852


>KYP54863.1 Putative ribonuclease H protein At1g65750 family [Cajanus cajan]
          Length = 648

 Score =  218 bits (556), Expect = 1e-59
 Identities = 128/399 (32%), Positives = 194/399 (48%), Gaps = 4/399 (1%)
 Frame = +3

Query: 3    KRLGGLGIKNLELFNXXXXXXXXXXXXTDHDAVWVGLLSFKYGQNFTISSHNTADHRFAS 182
            K  GGLGI +L  FN             +    W  +++  YG+          D   +S
Sbjct: 256  KEHGGLGILDLRAFNLALLGKWRWRLLVEKGRFWHRVVTSIYGEG---CFQGVGDKVQSS 312

Query: 183  IWWRDLHLIELDRGVQPMWFSDALCRKVGDGEHTKFWSDTWLGVSSLKDCFPRLFLVAEN 362
             WW DL  I+        WFS    + VGDG +T FW D W G   L + + RLF +A +
Sbjct: 313  KWWVDLWTIDSTPYTSFDWFSSRCTKVVGDGRNTFFWKDGWSGQGPLCNRYSRLFSIASD 372

Query: 363  KEANIAEMGAWHGDIWRWEWRWRRPLFVWEEELLADFLNIMAPVQIQKNVVDSWLWKLEG 542
            K+ ++A M  W    + W W WRR LF WE +LL+     +  + ++ +  D W WK   
Sbjct: 373  KDVSVANMVLWRDGGFEWIWSWRRSLFQWELDLLSQLAADLGSIVLKNDCCDRWCWKDSN 432

Query: 543  SQVFSVKSAYNMLLTVQRNNGENQGLDRTF---KWLWSCDVSSKIAVFTWRLLQDRLPTR 713
              +++VKSAY  ++        N G+   F   K+LWS  V SK++ F W+ L +R+P++
Sbjct: 433  DGIYNVKSAYKAVI--------NGGIYADFLLHKFLWSSCVPSKVSGFAWKALLNRIPSK 484

Query: 714  EQLICRGIIDANHGGSCVFCFRAVENCSHLFFTCNFSYHVWSVVNSWIGVAGVFHNDGVN 893
              LI R +++ +  G C +C   +EN SHL F C ++Y VW    +W GV+ V HN    
Sbjct: 485  CNLIKRKVLNISASG-CAWCGEDLENTSHLLFGCYYAYFVWLSNFAWFGVSTVIHNSCHE 543

Query: 894  HFIQHGDFFKGKKLRRTR-NLIWMAVVWSLWGMRNKIIFQGLVADFTSVIAQIKMLSWGW 1070
            +F     F +     R R +++W+A +WSLW  RN +IF+  V     ++  IK+ SW W
Sbjct: 544  NFAHFNGFPRCSGRDRMRWSVVWLATIWSLWLARNDVIFKDKVVAIKDLVELIKLRSWNW 603

Query: 1071 FVNRAGRSSEISLVGLTFSRRGWAFFHNCEPLWIIFTFG 1187
                  ++ + S    T S+RG+       PL    TFG
Sbjct: 604  I-----KTKDKSF--FTHSQRGFL------PLVFALTFG 629


>GAU29820.1 hypothetical protein TSUD_223660 [Trifolium subterraneum]
          Length = 672

 Score =  219 bits (557), Expect = 1e-59
 Identities = 115/366 (31%), Positives = 181/366 (49%), Gaps = 3/366 (0%)
 Frame = +3

Query: 3    KRLGGLGIKNLELFNXXXXXXXXXXXXTDHDAVWVGLLSFKYGQNFTISSH--NTADHRF 176
            K+ GGLGI+NL L N            +    VW  ++  KYG+    ++   N    +F
Sbjct: 295  KKEGGLGIRNLRLVNLSLLTKWRWRLLSGEGEVWKDIIVAKYGERVMGNARLDNIVYLQF 354

Query: 177  ASIWWRDLHLIELDRGVQPMWFSDALCRKVGDGEHTKFWSDTWLGVSSLKDCFPRLFLVA 356
             S WWRDL  ++ D G    WF+  + +KVG G    FW D W G  SL+  FPRLF ++
Sbjct: 355  GSAWWRDLCNLDKDEG----WFNQVVLKKVGMGNSILFWKDVWAGDQSLEHRFPRLFGIS 410

Query: 357  ENKEANIAEMGAWHGDIWRWEWRWRRPLFVWEEELLADFLNIMAPVQIQKNVVDSWLWKL 536
              +   +  MG+W    WRWE  WRR  FVWE EL+ +   ++    + +  VD W+WK 
Sbjct: 411  IQQNEVVRNMGSWVNVEWRWELLWRRQFFVWENELVRELGEVLNIFPLSEE-VDRWVWKP 469

Query: 537  EGSQVFSVKSAYNMLLTVQRNNGENQGLDR-TFKWLWSCDVSSKIAVFTWRLLQDRLPTR 713
              ++ FSVKS Y+ L +          L+  +F  +W C V SK++   W+L  DR+PT+
Sbjct: 470  NEAEGFSVKSLYDWLDSTLVTRAILTPLEAFSFCSIWKCVVPSKVSALAWQLFLDRIPTK 529

Query: 714  EQLICRGIIDANHGGSCVFCFRAVENCSHLFFTCNFSYHVWSVVNSWIGVAGVFHNDGVN 893
            + L CR  I  +    C  C    E   H+F  C+F+  VW  +  W+GV  +   D + 
Sbjct: 530  DNL-CRRRIIRSEDAVCDMCGGVSETSRHVFMHCDFAAQVWYAICRWLGVVVLLPPDVMT 588

Query: 894  HFIQHGDFFKGKKLRRTRNLIWMAVVWSLWGMRNKIIFQGLVADFTSVIAQIKMLSWGWF 1073
             +         KK+++  +++W+A +W +W  RN  +F  +       +  I+ +SW WF
Sbjct: 589  MYGSLVGCGSNKKIKKGFSIVWLAFIWVMWRSRNDKVFNNVAGVVEDALNHIQRISWQWF 648

Query: 1074 VNRAGR 1091
            ++   +
Sbjct: 649  LSNTAK 654


>KYP53060.1 hypothetical protein KK1_025062 [Cajanus cajan]
          Length = 323

 Score =  209 bits (533), Expect = 1e-59
 Identities = 112/307 (36%), Positives = 162/307 (52%), Gaps = 6/307 (1%)
 Frame = +3

Query: 180  SIWWRDLHLIELDRGVQPMWFSDALC-RKVGDGEHTKFWSDTWLGVSSLKDCFPRLFLVA 356
            S WW DL  I++  G+   WFS  +C R +G+G +T FW D+W   +     + RLF + 
Sbjct: 5    SRWWLDLWSIDVCDGISWDWFSTIMCVRVLGNGRNTSFWKDSWCTTTPFCVRYGRLFSIT 64

Query: 357  ENKEANIAEMGAWHGDIWRWEWRWRRPLFVWEEELLADFLNIMAPVQIQKNVVDSWLWKL 536
             N EA +A+M    G    W WRWRRPLF WE E L   ++ +   Q+Q+   DSW WK 
Sbjct: 65   INSEATVADMFFGRGGGVEWNWRWRRPLFQWELEQLDLLVSDLRGFQVQEYTHDSWRWKA 124

Query: 537  EGSQVFSVKSAYNMLLTVQRNNGENQGLDRTFKWLWSCDVSSKIAVFTWRLLQDRLPTRE 716
            +    +SVKSAY++++     N          +++W   V  K++ F WR+L DR P++ 
Sbjct: 125  DSDGKYSVKSAYHVIV-----NDSLFAEIPLHRFIWCRLVPYKVSCFVWRVLLDRFPSKF 179

Query: 717  QLICRGIIDANHGGSCVFCFRAVENCSHLFFTCNFSYHVWSVVNSWIGVAGVFHNDGVNH 896
             L+ R ++  N   SCV+C   +E  SHLFF C F+YHVW +   W G   V     +N 
Sbjct: 180  NLVKRHVL-INSDSSCVWCQYRMETSSHLFFECYFAYHVWMLSLEWCGFTSVL----LNS 234

Query: 897  FIQHGDFFKG-----KKLRRTRNLIWMAVVWSLWGMRNKIIFQGLVADFTSVIAQIKMLS 1061
            FI H D F G      K+R    +IW+ V+WS+W  RN +IF   V     V+  +K+ +
Sbjct: 235  FIAHFDQFLGLPLCPSKMRYRWAVIWLTVIWSIWLARNALIFSDKVLSTLDVLELVKLRT 294

Query: 1062 WGWFVNR 1082
            W W   R
Sbjct: 295  WKWLKAR 301


>GAU40143.1 hypothetical protein TSUD_163120 [Trifolium subterraneum]
          Length = 419

 Score =  211 bits (538), Expect = 3e-59
 Identities = 129/362 (35%), Positives = 182/362 (50%), Gaps = 5/362 (1%)
 Frame = +3

Query: 12   GGLGIKNLELFNXXXXXXXXXXXXTDHDAVWVGLLSFKYG--QNFTISSHNTADHRFASI 185
            GGLG++++   N                A W  +L  KYG    F +     A     S+
Sbjct: 42   GGLGVRDVAKVNLSLLIKWRWRLLQSGYAFWKEVLVAKYGIMARFKVHWIGHALPNRVSL 101

Query: 186  WWRDLHLIELDRGVQPMWFSDALCRKVGDGEHTKFWSDTWLGVSSLKDCFPRLFLVAENK 365
            WW+D+  I++       WF+  +CRK+G+G  T+FW D W+G   L D FPRLF ++ N+
Sbjct: 102  WWKDICGIDIRE--DGSWFARNMCRKLGNGNSTRFWLDRWIGSLPLSDQFPRLFSLSLNQ 159

Query: 366  EANIAEMGAWHGDIWRWEWRWRRPLFVWEEELLADFLNIMAPVQIQ-KNVVDSWLWKLEG 542
            +  + E     G    W  RWRR LFVWEEELL    +++ PV +      D W W+LE 
Sbjct: 160  QGMVREFRDVRGGEDGWVMRWRRRLFVWEEELLQRLQDLL-PVDVPWSEAEDRWSWRLEE 218

Query: 543  SQVFSVKSAYNMLLTV--QRNNGENQGLDRTFKWLWSCDVSSKIAVFTWRLLQDRLPTRE 716
               FSV S Y  L +V  Q ++   Q L   F  +W   V SK+  FTW+LL++R+PTR 
Sbjct: 219  DGSFSVSSMYWYLGSVFSQASSFNAQEL-WVFGKIWKSPVPSKVIAFTWKLLRNRIPTRC 277

Query: 717  QLICRGIIDANHGGSCVFCFRAVENCSHLFFTCNFSYHVWSVVNSWIGVAGVFHNDGVNH 896
             L  RG I    G  CV C    E+ +HLF  C+F+  +W+ +  W+G+  V   +    
Sbjct: 278  NLASRG-IQLIGGLDCVHCVGREESGTHLFMFCDFAGQIWNAIFRWLGLVLVIPPNFFLL 336

Query: 897  FIQHGDFFKGKKLRRTRNLIWMAVVWSLWGMRNKIIFQGLVADFTSVIAQIKMLSWGWFV 1076
            F         KK+R+   LIW   +W LW  RN I+F   V D   VI  IK+LSW W +
Sbjct: 337  FECFTGAAANKKIRKGYALIWHTTIWMLWKSRNDIMFSNGVIDVEKVIDDIKLLSWRWGL 396

Query: 1077 NR 1082
            +R
Sbjct: 397  SR 398


>GAU50085.1 hypothetical protein TSUD_371690 [Trifolium subterraneum]
          Length = 438

 Score =  212 bits (539), Expect = 4e-59
 Identities = 121/365 (33%), Positives = 184/365 (50%), Gaps = 5/365 (1%)
 Frame = +3

Query: 3    KRLGGLGIKNLELFNXXXXXXXXXXXXTDHDAVWVGLLSFKYGQNFTISSHNTADHRF-- 176
            K  GGLG++++ L N               + +W  +L  KYG N  ++  +  D R   
Sbjct: 57   KSKGGLGVRDVRLANLSLLAKWRWRLLLPGNPLWKEVLVAKYG-NHILNRVDWRDIRIPT 115

Query: 177  -ASIWWRDLHLIELDRGVQPM-WFSDALCRKVGDGEHTKFWSDTWLGVSSLKDCFPRLFL 350
             AS WW+D+    LD+ V    W ++++ RKVG+G  T FW   W+G + L   FP LF 
Sbjct: 116  LASKWWKDI--CTLDKVVDNHNWLAESMIRKVGNGTSTSFWCSNWIGEAPLSVTFPLLFS 173

Query: 351  VAENKEANIAEMGAWHGDIWRWEWRWRRPLFVWEEELLADFLNIMAPVQIQKNVVDSWLW 530
            ++ +K   +       G+ WRW + WRR LF WEE+L+     I+ PV +   V D W W
Sbjct: 174  LSNHKNGMVRNFCDHVGENWRWSFSWRRDLFQWEEDLVVRLREILEPV-VLSLVEDFWSW 232

Query: 531  KLEGSQVFSVKSAYNMLL-TVQRNNGENQGLDRTFKWLWSCDVSSKIAVFTWRLLQDRLP 707
            KL+    FSVKSAY  L+  + R++   + +   F  +W     SK+  F+W+LL DR+P
Sbjct: 233  KLDPEGKFSVKSAYTFLVEELTRDDDLEEAMATVFDQIWDSPAPSKVIAFSWQLLSDRIP 292

Query: 708  TREQLICRGIIDANHGGSCVFCFRAVENCSHLFFTCNFSYHVWSVVNSWIGVAGVFHNDG 887
            TR  L  RG++  +    CV C   VE+ +HLF  C  +  VW  V  W+GV  +     
Sbjct: 293  TRRNLEIRGLLGLDMPWECVGCVGRVESTTHLFLHCPSAMMVWYEVFRWLGVVLIIPPSM 352

Query: 888  VNHFIQHGDFFKGKKLRRTRNLIWMAVVWSLWGMRNKIIFQGLVADFTSVIAQIKMLSWG 1067
               F       + KK+RR   +IW A +W +W  +NK +F         ++ +IK++SW 
Sbjct: 353  EVLFEVLRGSVRIKKIRRGYLMIWHATLWCIWKAQNKALFANGTFIPKEIVEEIKVVSWK 412

Query: 1068 WFVNR 1082
            W + R
Sbjct: 413  WCLAR 417


>GAU20019.1 hypothetical protein TSUD_273540 [Trifolium subterraneum]
          Length = 504

 Score =  213 bits (543), Expect = 4e-59
 Identities = 119/357 (33%), Positives = 189/357 (52%), Gaps = 4/357 (1%)
 Frame = +3

Query: 12   GGLGIKNLELFNXXXXXXXXXXXXTDHDAVWVGLLSFKYGQNFTISSHNTADHRFASIWW 191
            GGLG++ ++ FN             + D++W  LL  KYGQ+   +          S WW
Sbjct: 131  GGLGVRRVKDFNYALLGKWVWRCFAEGDSLWCQLLKAKYGQDS--AGRVRFSEGVGSSWW 188

Query: 192  RDLHLIELDRG-VQPMWFSDALCRKVGDGEHTKFWSDTWLGVSSLKDCFPRLFLVAENKE 368
            R L+ +   RG + P W SD + RK+GDG  T FW+D+WL V  L   F RL+ +A+NK 
Sbjct: 189  RALNFVWSGRGLIDPRWLSDNIVRKIGDGRSTAFWADSWLEVGPLARVFGRLYDLADNKH 248

Query: 369  ANIAEMGAWHGDIWRWEWRWRRPLFVWEEELLADFLNIMAPVQIQKNVVDSWLWKLEGSQ 548
             ++A+M      +    W+WRR LFVWEEEL+A  + ++A   +Q +  D W+W L  SQ
Sbjct: 249  ISVADMFQAGWALNGNGWKWRRRLFVWEEELVAQCVGVLANFVLQGDATDRWVWNLHPSQ 308

Query: 549  VFSVKSAYNMLLTVQRNNGENQGLDRTFKWLWSCDVSSKIAVFTWRLLQDRLPTREQLIC 728
             +SV+SAY+ L        +   ++    +LW   V  K+ +F WR+  +RLPT++ L+ 
Sbjct: 309  SYSVRSAYSYLTA-----SDGSSMEDFASFLWVKSVPLKVNIFIWRIFLNRLPTKDNLLR 363

Query: 729  RGIIDANHGGSCVFCFRAVENCSHLFFTCNFSYHVWSVVNSWIGVAGVFHNDGVNHFIQH 908
            RG+I+ +       C +A E+  HLF  C+    VW +V +W+G++   H     H  Q 
Sbjct: 364  RGVIEVHQELCSTNCGKA-EDAVHLFIQCDVYSQVWHLVLNWLGLSTALHVSLGGHTEQF 422

Query: 909  GDFFKGKKLRRTRNL---IWMAVVWSLWGMRNKIIFQGLVADFTSVIAQIKMLSWGW 1070
                 G   + +RNL   IW++V++ +W  RN  IFQ       +++ +IK+ ++ W
Sbjct: 423  AGL--GGNSKTSRNLFTIIWVSVLFVIWKDRNDRIFQMGNDSGVTLLERIKLQTYWW 477


>KYP44439.1 Retrovirus-related Pol polyprotein LINE-1 [Cajanus cajan]
          Length = 1142

 Score =  222 bits (566), Expect = 5e-59
 Identities = 116/355 (32%), Positives = 182/355 (51%), Gaps = 2/355 (0%)
 Frame = +3

Query: 12   GGLGIKNLELFNXXXXXXXXXXXXTDHDAVWVGLLSFKYGQNFTISSH-NTADHRFASIW 188
            GGLG+K+L  FN             + +++WV ++   Y     I+SH         S W
Sbjct: 774  GGLGMKDLSAFNLSLLGKWHWRMLVEKNSLWVRVIRSLYD----IASHLPNGSGAKGSRW 829

Query: 189  WRDLHLIELDRGVQPMWFSDALCRKVGDGEHTKFWSDTWLGVSSLKDCFPRLFLVAENKE 368
            W DL+ IE    V   W S   C+ +G+G  TKFW D W+G   L   F RL+ +A NK 
Sbjct: 830  WVDLNRIEEGDLVSNEWMSSNCCKVIGNGVDTKFWLDKWVGHGILAHTFSRLYQIAINKN 889

Query: 369  ANIAEMGAWHGDIWRWEWRWRRPLFVWEEELLADFLNIMAPVQIQKNVVDSWLWKLEGSQ 548
             +IAEM  W G + +W+W WRR L VWE++LL    N +   +   +  D WLW     +
Sbjct: 890  VSIAEMFEWEGGVVKWKWSWRRRLLVWEQQLLNTLANFINGTKFIISDEDKWLWIAAPER 949

Query: 549  VFSVKSAYNMLLTVQRNNGENQGLDRTFKWLWSCDVSSKIAVFTWRLLQDRLPTREQLIC 728
            V++V SAY +L      N      +  F+W+W+    +K++ FTWR++ +R+PT++ L  
Sbjct: 950  VYTVSSAYKVL-----RNDIIFASNVIFRWIWTSIAPTKVSAFTWRVILNRIPTKDNLFR 1004

Query: 729  RGIIDANHGGSCVFCFRAVENCSHLFFTCNFSYHVWSVVNSWIGVAGVFHNDGVNHFIQ- 905
            RG++ A     C  C    E  SHLFF C  S+ +W    +W+G+  + HN  V +  Q 
Sbjct: 1005 RGVLQATQ-LECGLCRNKEETTSHLFFECEVSFQLWMACFNWLGLNSIMHNCCVQNLEQF 1063

Query: 906  HGDFFKGKKLRRTRNLIWMAVVWSLWGMRNKIIFQGLVADFTSVIAQIKMLSWGW 1070
            +G  +   K +    LI + V+W++W  RN +IF   +   + ++  +++ SW W
Sbjct: 1064 YGLRYCSVKYQNCWILIRLPVIWTIWLARNDLIFSSKIIHVSEMLNMVQLRSWRW 1118


Top