BLASTX nr result

ID: Glycyrrhiza35_contig00008217 seq

BLASTX 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Glycyrrhiza35_contig00008217
         (902 letters)

Database: ./nr 
           115,041,592 sequences; 42,171,959,267 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

GAU10400.1 hypothetical protein TSUD_423410, partial [Trifolium ...   169   2e-47
GAU33259.1 hypothetical protein TSUD_333820 [Trifolium subterran...   150   4e-40
GAU12283.1 hypothetical protein TSUD_141910 [Trifolium subterran...   155   6e-39
GAU17063.1 hypothetical protein TSUD_105620 [Trifolium subterran...   150   9e-39
GAU50297.1 hypothetical protein TSUD_288310 [Trifolium subterran...   138   1e-33
GAU35042.1 hypothetical protein TSUD_30080 [Trifolium subterraneum]   131   1e-30
AFK48593.1 unknown [Lotus japonicus]                                  120   1e-28
GAU47648.1 hypothetical protein TSUD_27720 [Trifolium subterraneum]   123   4e-28
KYP74203.1 Putative ribonuclease H protein At1g65750 family [Caj...   116   2e-25
KYP35629.1 Putative ribonuclease H protein At1g65750 family [Caj...   116   3e-25
GAU24540.1 hypothetical protein TSUD_156530 [Trifolium subterran...   113   3e-24
GAU22350.1 hypothetical protein TSUD_106780 [Trifolium subterran...   112   4e-24
GAU48830.1 hypothetical protein TSUD_190600 [Trifolium subterran...   103   2e-22
GAU30014.1 hypothetical protein TSUD_160990 [Trifolium subterran...   100   5e-22
ABN09044.1 Ribonuclease H [Medicago truncatula]                       101   7e-22
GAU11845.1 hypothetical protein TSUD_75960 [Trifolium subterraneum]   103   1e-21
KYP32780.1 LINE-1 reverse transcriptase isogeny [Cajanus cajan]       103   4e-21
KYP56001.1 Putative ribonuclease H protein At1g65750 family, par...   101   8e-21
GAU34195.1 hypothetical protein TSUD_162960 [Trifolium subterran...    93   2e-19
GAU23925.1 hypothetical protein TSUD_181140 [Trifolium subterran...    93   2e-19

>GAU10400.1 hypothetical protein TSUD_423410, partial [Trifolium subterraneum]
          Length = 284

 Score =  169 bits (429), Expect = 2e-47
 Identities = 94/267 (35%), Positives = 132/267 (49%)
 Frame = -1

Query: 860 GEETLMHCVRDCTLSKQIWGALGMTSMEFQQAQDFFTWQQRALDGSEGPRFVAGLWYAWC 681
           GEET++HC+RDC +S++IW +LG  +  F    D   W +    G   P F+AGLW+ W 
Sbjct: 13  GEETILHCLRDCPISRRIWNSLGFQNSSFFSCSDLELWLRNNSIGLNAPTFLAGLWWNWR 72

Query: 680 ARNAKCIGGEDIPLQKILRDVKIME*AIARGFRFQHHTERTTRWVTSHQSREDVVVLNVD 501
           ARN  C+G   I   K++ +V  +   I   F  + HT+   RW++ H  + D VVLNVD
Sbjct: 73  ARNIFCVGNASIHSFKVVAEVSKLVALIVFCFPARVHTDTPRRWISWHPCKTDCVVLNVD 132

Query: 500 GSADXXXXXXXXXXXXXXXXGAWMXXXXXXXXXXXXXXXXXXXXXFGLQTAWDYSVRRLV 321
           GS                  G W+                      GL+ A +    RL 
Sbjct: 133 GSCLGDPGRAGFGGLFRKGDGEWIRGFSGYLGVTNIMLAELMAVYHGLKIAREAGYNRLF 192

Query: 320 CYSDSSMALKLVRSSVNFRHYYAAWIGGVTELLRRDWVVDLVHTLREGNGSADVLAKLKA 141
           CYSDS   L L+    N  H YAA I  + +LL  +W V L H+LREGN  AD LAKL +
Sbjct: 193 CYSDSKTVLDLLSKERNSFHCYAAIIANIQDLLVLEWDVSLKHSLREGNFCADFLAKLGS 252

Query: 140 SQTSKLIVVHSPPVELGAWLMADAMRI 60
           +   K  +  SPP ++   L++DA+R+
Sbjct: 253 ANDEKFFIWESPPPDMQDLLLSDALRV 279


>GAU33259.1 hypothetical protein TSUD_333820 [Trifolium subterraneum]
          Length = 284

 Score =  150 bits (379), Expect = 4e-40
 Identities = 90/267 (33%), Positives = 126/267 (47%)
 Frame = -1

Query: 860 GEETLMHCVRDCTLSKQIWGALGMTSMEFQQAQDFFTWQQRALDGSEGPRFVAGLWYAWC 681
           GEET++HC+RDC +S++IW +LG  +  F    D   W +    G   P F+AGLW+ W 
Sbjct: 19  GEETILHCLRDCPISRRIWNSLGFQNSSFFSCSDLELWLRNNSIGLNAPTFLAGLWWNWR 78

Query: 680 ARNAKCIGGEDIPLQKILRDVKIME*AIARGFRFQHHTERTTRWVTSHQSREDVVVLNVD 501
           ARN  C+G   I   K++ +V  +   I   F     T+      T    + D VVLNVD
Sbjct: 79  ARNICCVGNASIHSFKVVAEVSKLVALIVSCFPAWVRTD------TPRVCKTDCVVLNVD 132

Query: 500 GSADXXXXXXXXXXXXXXXXGAWMXXXXXXXXXXXXXXXXXXXXXFGLQTAWDYSVRRLV 321
           GS                  G W+                      GL+ A +    RL 
Sbjct: 133 GSCLGDPGRAGFGGLFRKGDGEWIRGSSGYLGVTNITLAELMAVYHGLKIAREAGYNRLF 192

Query: 320 CYSDSSMALKLVRSSVNFRHYYAAWIGGVTELLRRDWVVDLVHTLREGNGSADVLAKLKA 141
           CYSDS   L L+    N  H YAA I  + +LL  +W V L H++REGN  AD LAKL +
Sbjct: 193 CYSDSKTVLDLLSKERNSFHCYAAIIANIQDLLVLEWDVSLKHSVREGNFCADFLAKLGS 252

Query: 140 SQTSKLIVVHSPPVELGAWLMADAMRI 60
           +   K  +  SPP ++   L++DA+R+
Sbjct: 253 ANDEKFSIWESPPPDMQDLLLSDALRV 279


>GAU12283.1 hypothetical protein TSUD_141910 [Trifolium subterraneum]
          Length = 1049

 Score =  155 bits (393), Expect = 6e-39
 Identities = 89/263 (33%), Positives = 133/263 (50%), Gaps = 1/263 (0%)
 Frame = -1

Query: 854  ETLMHCVRDCTLSKQIWGALGMTSMEFQQAQDFFTWQQRALDGSEGPRFVAGLWYAWCAR 675
            ET +HC+RDC  ++ IW ++G +++ F Q  D + W +  L  S    F+A +W+ W AR
Sbjct: 782  ETTLHCLRDCDFAQSIWKSIGFSNLNFFQGDDPYVWIRNGLHCSSMFLFMATIWWIWRAR 841

Query: 674  NAKCIGGEDIPLQKILRDVKIME*AIA-RGFRFQHHTERTTRWVTSHQSREDVVVLNVDG 498
            NA C+  E I    +   ++IM+ A+        HH   T++ V  +      ++LNVDG
Sbjct: 842  NALCLNSESILFYSL--KLRIMDYALLIENCHSNHHDTSTSKLVKWNALGGTGMILNVDG 899

Query: 497  SADXXXXXXXXXXXXXXXXGAWMXXXXXXXXXXXXXXXXXXXXXFGLQTAWDYSVRRLVC 318
            S+                 GAW+                      GL  AWD +++ L C
Sbjct: 900  SSLGNPGISGFGGLIHNADGAWVLGFFGNLGVNNILHAELRAIYKGLLLAWDLNIKDLWC 959

Query: 317  YSDSSMALKLVRSSVNFRHYYAAWIGGVTELLRRDWVVDLVHTLREGNGSADVLAKLKAS 138
            YSDS MA+KL+  SV+  H+YAA +  + ++LRRDW V ++HT REGN  AD LAK  A+
Sbjct: 960  YSDSEMAIKLISESVDQWHHYAAILNNIQDILRRDWQVLILHTFREGNAYADYLAKHGAN 1019

Query: 137  QTSKLIVVHSPPVELGAWLMADA 69
                   + +PP  L   L+ADA
Sbjct: 1020 NNKVFSSIATPPAGLNLSLLADA 1042


>GAU17063.1 hypothetical protein TSUD_105620 [Trifolium subterraneum]
          Length = 440

 Score =  150 bits (380), Expect = 9e-39
 Identities = 85/265 (32%), Positives = 125/265 (47%), Gaps = 2/265 (0%)
 Frame = -1

Query: 857 EETLMHCVRDCTLSKQIWGALGMTSMEFQQAQDFFTWQQRALDGSEGPRFVAGLWYAWCA 678
           EET+MHC+RDC   K +W  +G T   F    + + W ++  D      F+A LW+ W A
Sbjct: 172 EETIMHCLRDCEFVKHLWKTIGFTDQTFFHGDNLYAWLRKGCDSPSMFMFLAALWWIWRA 231

Query: 677 RNAKCIGGEDIPLQKILRDVKIME*AIARGFRFQHHT--ERTTRWVTSHQSREDVVVLNV 504
           RN  C+  E +    I R ++     + + +  Q  T   R  RW  +H   +  ++LNV
Sbjct: 232 RNKLCLANELVSPFTISRCIEDYALLVKKCYSQQKSTLANRLVRW-NAHDGTD--MILNV 288

Query: 503 DGSADXXXXXXXXXXXXXXXXGAWMXXXXXXXXXXXXXXXXXXXXXFGLQTAWDYSVRRL 324
           DGS+                 GAW+                      GL  AWD  ++ L
Sbjct: 289 DGSSIGNPEIYGFGGLIRNSHGAWIRGFAGNIGFSNILHAELLAVYHGLVLAWDMDIKDL 348

Query: 323 VCYSDSSMALKLVRSSVNFRHYYAAWIGGVTELLRRDWVVDLVHTLREGNGSADVLAKLK 144
           +CYSDS  A+KL+   +N  H++AA +  + ++L RDW V + HTLREGN  AD LAK  
Sbjct: 349 ICYSDSKTAIKLIGDPINEWHHFAAILQNIKDILARDWRVTVAHTLREGNACADYLAKFG 408

Query: 143 ASQTSKLIVVHSPPVELGAWLMADA 69
           A        + +PP  +   L+ADA
Sbjct: 409 AQNIKVFSTMTTPPDGMNLLLLADA 433


>GAU50297.1 hypothetical protein TSUD_288310 [Trifolium subterraneum]
          Length = 545

 Score =  138 bits (348), Expect = 1e-33
 Identities = 85/265 (32%), Positives = 122/265 (46%), Gaps = 2/265 (0%)
 Frame = -1

Query: 857  EETLMHCVRDCTLSKQIWGALGMTSMEFQQAQDFFTWQQRALDGSEGPRFVAGLWYAWCA 678
            EE+ +HC+R+C   K+ W A+G     F Q  +   W + ++DG     F+A +W+ WCA
Sbjct: 277  EESTLHCLRNCEFIKRFWKAIGFLGQTFFQGDNLNDWLRNSIDGPSSFLFMAAVWWIWCA 336

Query: 677  RNAKCIGGEDIPLQKILRDVKIME*AIARGFRFQHHTERTT--RWVTSHQSREDVVVLNV 504
            RN  C+  E I    +  + + +   +   F  Q+ +   T  RW  +H      ++LNV
Sbjct: 337  RNQLCMDNEAISYFTLRTNTENLAQLLRMCFIKQNISSTATMVRW-NAHGGIG--MILNV 393

Query: 503  DGSADXXXXXXXXXXXXXXXXGAWMXXXXXXXXXXXXXXXXXXXXXFGLQTAWDYSVRRL 324
            DGS+                 GAW+                      GL  AW+  ++ L
Sbjct: 394  DGSSIGNPGISGFGGLIRNSDGAWVHGFAGNIGHLNILQAELLAIYHGLVLAWELDIKDL 453

Query: 323  VCYSDSSMALKLVRSSVNFRHYYAAWIGGVTELLRRDWVVDLVHTLREGNGSADVLAKLK 144
             CYSDS  ALKL+   VN  H YAA I  + + L R+W V LVH LREGN  AD+L K  
Sbjct: 454  CCYSDSKTALKLIYDHVNEWHQYAAIIYNIKDFLSRNWRVRLVHMLREGNNCADILDKFG 513

Query: 143  ASQTSKLIVVHSPPVELGAWLMADA 69
            A        +  PP  +   L+ADA
Sbjct: 514  ARNPKAYCSIAVPPDGMSLLLLADA 538


>GAU35042.1 hypothetical protein TSUD_30080 [Trifolium subterraneum]
          Length = 724

 Score =  131 bits (329), Expect = 1e-30
 Identities = 80/260 (30%), Positives = 117/260 (45%), Gaps = 1/260 (0%)
 Frame = -1

Query: 836  VRDCTLSKQIWGALGMTSMEFQQAQDFFTWQQRALDGSEGPRFVAGLWYAWCARNAKCIG 657
            + DC     IW +LG T   F Q  D  +W +  L  S    F+A +W+ W  RNA C+ 
Sbjct: 463  IHDCNFVYTIWKSLGFTDRNFFQEVDSSSWLRNGLSCSSMFLFMAAIWWIWRTRNALCLD 522

Query: 656  GEDIPLQKILRDVKIME*AIA-RGFRFQHHTERTTRWVTSHQSREDVVVLNVDGSADXXX 480
             E IP   +   ++I++ A+  +   F H      + V  +      ++LNVDGS     
Sbjct: 523  NELIPQFSL--KMRIVDYALLLKNCHFNHQVTTLPKIVRWNALGGTSMILNVDGSTIGNP 580

Query: 479  XXXXXXXXXXXXXGAWMXXXXXXXXXXXXXXXXXXXXXFGLQTAWDYSVRRLVCYSDSSM 300
                         GAW+                      GL  AW+ +++ L+CYSDS+ 
Sbjct: 581  GISGFGGLIRNADGAWIHGFFGNLGVTNILHAELMAILKGLLLAWELNIKDLLCYSDSAT 640

Query: 299  ALKLVRSSVNFRHYYAAWIGGVTELLRRDWVVDLVHTLREGNGSADVLAKLKASQTSKLI 120
            A+KL+   V+  H+YAA +  + ++L RDW V + HT REGN  AD LAK  A       
Sbjct: 641  AIKLITEPVDVWHHYAAILNNIKDILNRDWQVSIFHTFREGNACADYLAKHGAHNNIVFT 700

Query: 119  VVHSPPVELGAWLMADAMRI 60
             +  PP  L   L+AD   I
Sbjct: 701  TIAIPPAGLNLHLLADVSGI 720


>AFK48593.1 unknown [Lotus japonicus]
          Length = 272

 Score =  120 bits (300), Expect = 1e-28
 Identities = 79/239 (33%), Positives = 107/239 (44%), Gaps = 4/239 (1%)
 Frame = -1

Query: 761 DFFTWQQRALDGSEGPRFVAGLWYAWCARNAKCIGGEDIPLQKILRDVKIME*AIARGFR 582
           D   W +  L     P  ++ LW+ W  RN  C+ G+ IP Q +  D+  M   IAR + 
Sbjct: 34  DSKAWLREVLK-ENSPLVMSTLWWVWRLRNVWCMEGKLIPWQVLRGDILAMFDDIARCYA 92

Query: 581 FQ----HHTERTTRWVTSHQSREDVVVLNVDGSADXXXXXXXXXXXXXXXXGAWMXXXXX 414
                  HT R  RW        D VVLNVDGS                  G W+     
Sbjct: 93  VDVDAPMHTPRLVRWTVG---LADCVVLNVDGSVHGTPQRGGFGGCFRTIHGNWLRGFFG 149

Query: 413 XXXXXXXXXXXXXXXXFGLQTAWDYSVRRLVCYSDSSMALKLVRSSVNFRHYYAAWIGGV 234
                            GL  AW+   R + C SDS  A+ LV+S+ +  H YAA +  +
Sbjct: 150 YLDECCILHLELLGMFHGLSLAWEQGYRIVECQSDSQDAVTLVKSTPSSCHRYAALVWDI 209

Query: 233 TELLRRDWVVDLVHTLREGNGSADVLAKLKASQTSKLIVVHSPPVELGAWLMADAMRIA 57
            +L  RDW+V+L HTLREGN  AD+L K  A Q   L++  +P   LG  L+ADA  ++
Sbjct: 210 KDLQSRDWIVELRHTLREGNACADLLVKHGADQNDDLVITENPIAGLGVLLLADARGVS 268


>GAU47648.1 hypothetical protein TSUD_27720 [Trifolium subterraneum]
          Length = 521

 Score =  123 bits (308), Expect = 4e-28
 Identities = 78/237 (32%), Positives = 110/237 (46%), Gaps = 1/237 (0%)
 Frame = -1

Query: 776 FQQAQDFFTWQQRALDGSEGPRFVAGLWYAWCARNAKCIGGEDIPLQKILRDVKIME*A- 600
           F +  +F+ W    LD      F A +W+ WC RN  C+  E I  Q  LR ++I + A 
Sbjct: 280 FFEDDEFYVWLWNGLDSPSKLLFTAAIWWIWCTRNNLCMNNESIS-QVSLR-MRIEDYAH 337

Query: 599 IARGFRFQHHTERTTRWVTSHQSREDVVVLNVDGSADXXXXXXXXXXXXXXXXGAWMXXX 420
           + R   F   T   T+ V  +      ++LNVDGS+                 GAW    
Sbjct: 338 LLRACLFNQITMSNTKLVKWNALGSPDMILNVDGSSIGNPGVSGFGGLIHNSKGAWAHGF 397

Query: 419 XXXXXXXXXXXXXXXXXXFGLQTAWDYSVRRLVCYSDSSMALKLVRSSVNFRHYYAAWIG 240
                              GL  AW  +++ L CYSDS  A+KL+   V+  H+YAA + 
Sbjct: 398 VGNIGFSNILHAELMALYHGLLLAWQLNIKELWCYSDSETAIKLITEPVDEWHHYAAILL 457

Query: 239 GVTELLRRDWVVDLVHTLREGNGSADVLAKLKASQTSKLIVVHSPPVELGAWLMADA 69
            + ++L R+W V++ HT REGN  AD LAKL A     L V+ +PP  L   L+ADA
Sbjct: 458 NIKDILAREWRVNIAHTFREGNACADYLAKLGACNNEALSVMTNPPASLNLLLLADA 514


>KYP74203.1 Putative ribonuclease H protein At1g65750 family [Cajanus cajan]
          Length = 611

 Score =  116 bits (290), Expect = 2e-25
 Identities = 78/263 (29%), Positives = 117/263 (44%), Gaps = 1/263 (0%)
 Frame = -1

Query: 854  ETLMHCVRDCTLSKQIWGALGMTSMEFQQAQDFFTWQQRALDGSEGPRFVAGLWYAWCAR 675
            E+++HC+RDC  SK +W   G +++ F    +  TW QR ++ ++  +F+A +W+ W  R
Sbjct: 342  ESVLHCMRDCKYSKALWLQFGFSNINFFN-HEVRTWIQRGVNSAQCSKFLAVMWWTWRWR 400

Query: 674  NAKCIGGEDIPLQKILRDVKIME*AIARGFRFQHHTERTTRWVTSHQSREDVVVLNVDGS 495
            N   +GGE+  ++++   +          F  +       + VT  +    +V +NVDGS
Sbjct: 401  NMDVLGGEEWNMEQVRYKINSTLCDWNNAFAGEQVHSHEVQEVTWCKPTPPMVKVNVDGS 460

Query: 494  ADXXXXXXXXXXXXXXXXGAWMXXXXXXXXXXXXXXXXXXXXXFGLQTAWDYSVRRLVCY 315
                              G W                       GL  AWD   R + C 
Sbjct: 461  YKSGDLCVGFGGVIRDHEGTWKTGFSGRHQTTSVLLAELLGLKHGLLLAWDAGFREVQCE 520

Query: 314  SDSSMALKLVRSS-VNFRHYYAAWIGGVTELLRRDWVVDLVHTLREGNGSADVLAKLKAS 138
            +DS  A +LV  + V   H +   I  + ELLRRDW V L H LR+GN  AD+LAK+  S
Sbjct: 521  TDSLEAFRLVSMAHVPKFHNFGVVIQQIRELLRRDWRVALSHILRQGNACADILAKMGGS 580

Query: 137  QTSKLIVVHSPPVELGAWLMADA 69
                 +V  SPP  L   L  DA
Sbjct: 581  LDRPFVVFGSPPHSLCEALRDDA 603


>KYP35629.1 Putative ribonuclease H protein At1g65750 family [Cajanus cajan]
          Length = 1411

 Score =  116 bits (290), Expect = 3e-25
 Identities = 78/263 (29%), Positives = 117/263 (44%), Gaps = 1/263 (0%)
 Frame = -1

Query: 854  ETLMHCVRDCTLSKQIWGALGMTSMEFQQAQDFFTWQQRALDGSEGPRFVAGLWYAWCAR 675
            E+++HC+RDC  SK +W   G +++ F    +  TW QR ++ ++  +F+A +W+ W  R
Sbjct: 1142 ESVLHCMRDCKYSKALWLQFGFSNINFFN-HEVRTWIQRGVNSAQCSKFLAVMWWTWRWR 1200

Query: 674  NAKCIGGEDIPLQKILRDVKIME*AIARGFRFQHHTERTTRWVTSHQSREDVVVLNVDGS 495
            N   +GGE+  ++++   +          F  +       + VT  +    +V +NVDGS
Sbjct: 1201 NMDVLGGEEWNMEQVRYKINSTLCDWNNAFAGEQVHSHEVQEVTWCKPTPPMVKVNVDGS 1260

Query: 494  ADXXXXXXXXXXXXXXXXGAWMXXXXXXXXXXXXXXXXXXXXXFGLQTAWDYSVRRLVCY 315
                              G W                       GL  AWD   R + C 
Sbjct: 1261 YKSGDLCVGFGGVIRDHEGTWKTGFSGRHQTTSVLLAELLGLKHGLLLAWDAGFREVQCE 1320

Query: 314  SDSSMALKLVRSS-VNFRHYYAAWIGGVTELLRRDWVVDLVHTLREGNGSADVLAKLKAS 138
            +DS  A +LV  + V   H +   I  + ELLRRDW V L H LR+GN  AD+LAK+  S
Sbjct: 1321 TDSLEAFRLVSMAHVPKFHNFGVVIQQIRELLRRDWRVALSHILRQGNACADILAKMGGS 1380

Query: 137  QTSKLIVVHSPPVELGAWLMADA 69
                 +V  SPP  L   L  DA
Sbjct: 1381 LDRPFVVFGSPPHSLCEALRDDA 1403


>GAU24540.1 hypothetical protein TSUD_156530 [Trifolium subterraneum]
          Length = 1147

 Score =  113 bits (282), Expect = 3e-24
 Identities = 84/275 (30%), Positives = 116/275 (42%), Gaps = 10/275 (3%)
 Frame = -1

Query: 854  ETLMHCVRDCTLSKQIWGALGMTSMEFQQAQ-DFFTWQQRALDGSEGPRFVAGLWYAWCA 678
            ET++HC+  CT +  IW A G+  +       D F W  R +  S G      +W+ WC+
Sbjct: 875  ETIVHCLFACTDAIGIWRACGLEHVLPPSTDVDLFCWC-RDVGKSHGCIIFIIMWFVWCS 933

Query: 677  RNAKCIGGEDIPLQKILRDVKIME*AIARGFRFQHHTERTT---------RWVTSHQSRE 525
            RN          +  ++  V  M       F      E TT         R V   +  E
Sbjct: 934  RNDAIFNNNKAIVHNLVAKVHYMLSFCTAAF------ENTTSGSGGNSEHRLVVWPRPDE 987

Query: 524  DVVVLNVDGSADXXXXXXXXXXXXXXXXGAWMXXXXXXXXXXXXXXXXXXXXXFGLQTAW 345
              V LNVDGS                  GA++                      GL   W
Sbjct: 988  GTVCLNVDGSMLGSLQTAGFGGLIRNSFGAFLKGFYGTASQSSVLYAEIMAILHGLHLCW 1047

Query: 344  DYSVRRLVCYSDSSMALKLVRSSVNFRHYYAAWIGGVTELLRRDWVVDLVHTLREGNGSA 165
            +   R +VCYSDS  A+ L++  V+  H +A  I  + +LLRRDW + + H LREGN  A
Sbjct: 1048 NNGYRSIVCYSDSLQAVSLIKDGVSHFHTFANEIYTIHQLLRRDWTIVIEHILREGNACA 1107

Query: 164  DVLAKLKASQTSKLIVVHSPPVELGAWLMADAMRI 60
            D+LAK  +S  S +++V SPP E    L ADA  I
Sbjct: 1108 DILAKKGSSTNSPIVIVESPPPEPSNALSADARGI 1142


>GAU22350.1 hypothetical protein TSUD_106780 [Trifolium subterraneum]
          Length = 1200

 Score =  112 bits (281), Expect = 4e-24
 Identities = 63/215 (29%), Positives = 101/215 (46%), Gaps = 1/215 (0%)
 Frame = -1

Query: 854  ETLMHCVRDCTLSKQIWGALGMTSMEFQQAQDFFTWQQRALDGSEGPRFVAGLWYAWCAR 675
            ET +HC+RDC    +IW ++G     F Q  D + W    L+      F+AG+W+ W AR
Sbjct: 988  ETTIHCLRDCDFVNRIWKSIGFLDQNFFQGVDVYAWLHNGLNSPTMMLFIAGIWWIWRAR 1047

Query: 674  NAKCIGGEDIPLQKILRDVKIME*A-IARGFRFQHHTERTTRWVTSHQSREDVVVLNVDG 498
            NA C+  E +      + ++IM+ A + +   F  H   TT++V  +      ++LNVDG
Sbjct: 1048 NAMCLDSEMVSFWS--QKLRIMDYALLLKNCYFSTHEISTTKFVKWNALGGTGLILNVDG 1105

Query: 497  SADXXXXXXXXXXXXXXXXGAWMXXXXXXXXXXXXXXXXXXXXXFGLQTAWDYSVRRLVC 318
            S+                 GAW+                      GL  AW+ +++ L C
Sbjct: 1106 SSIGNPGISGFGGLIRNADGAWIHGFFGNLGVTNILHPELMAIYKGLLLAWELNIKELWC 1165

Query: 317  YSDSSMALKLVRSSVNFRHYYAAWIGGVTELLRRD 213
            YSDS MA+KL+    +  H+YAA +  + ++L R+
Sbjct: 1166 YSDSKMAIKLITDPTDVWHHYAAILNNIKDILDRE 1200


>GAU48830.1 hypothetical protein TSUD_190600 [Trifolium subterraneum]
          Length = 298

 Score =  103 bits (258), Expect = 2e-22
 Identities = 71/238 (29%), Positives = 104/238 (43%), Gaps = 3/238 (1%)
 Frame = -1

Query: 773 QQAQDFFTWQQRALDGSEGPRFVAGLWYAWCARNAKCIGGEDIPLQKILRDVKIME*AIA 594
           Q A +F   Q + +  + GP F   LW  WC RN      +      I+  +  +  +  
Sbjct: 55  QGADNFI--QYKNMGKNHGPLFFIVLWVIWCVRNEFVFNNQRESTHIIMGKIYSLLHSCE 112

Query: 593 RGFRFQHHTERTT---RWVTSHQSREDVVVLNVDGSADXXXXXXXXXXXXXXXXGAWMXX 423
             F   H +  TT   R VT  +  E  V LNVDGS                  G ++  
Sbjct: 113 AVFTPPHSSMATTAKPRLVTWTKPAEGTVCLNVDGSLLKATNTAGYGGLIRDSNGVFLSG 172

Query: 422 XXXXXXXXXXXXXXXXXXXFGLQTAWDYSVRRLVCYSDSSMALKLVRSSVNFRHYYAAWI 243
                               GLQ  W+   RR+ C+SDS   + L+R  V+  H ++  +
Sbjct: 173 FYGTATVQSILFAELMAVLHGLQICWESGFRRITCFSDSLQIVNLIRDGVSAHHRFSNEV 232

Query: 242 GGVTELLRRDWVVDLVHTLREGNGSADVLAKLKASQTSKLIVVHSPPVELGAWLMADA 69
             + +LL +DW V + HT REGN  ADVLAK+ A+  S L+ + +PP +L   L+ADA
Sbjct: 233 FIIHQLLAKDWEVVIGHTFREGNACADVLAKMGAASDSTLVTISTPPCDLSMPLLADA 290


>GAU30014.1 hypothetical protein TSUD_160990 [Trifolium subterraneum]
          Length = 168

 Score = 99.8 bits (247), Expect = 5e-22
 Identities = 48/102 (47%), Positives = 67/102 (65%)
 Frame = -1

Query: 362 GLQTAWDYSVRRLVCYSDSSMALKLVRSSVNFRHYYAAWIGGVTELLRRDWVVDLVHTLR 183
           GL   W    R++ CYSDS  A+ L+R  V+  H YA  I  +  LLRRDW+V ++HTLR
Sbjct: 63  GLDLCWVNGYRKIECYSDSLQAVALIRDGVSPHHQYANEIQSIRHLLRRDWIVAVIHTLR 122

Query: 182 EGNGSADVLAKLKASQTSKLIVVHSPPVELGAWLMADAMRIA 57
           EGN  ADVLAK+ +S++S  +V+  PP +L + L ADA+ +A
Sbjct: 123 EGNACADVLAKMGSSESSAQVVLDEPPPQLSSALHADALGVA 164


>ABN09044.1 Ribonuclease H [Medicago truncatula]
          Length = 235

 Score =  101 bits (251), Expect = 7e-22
 Identities = 67/217 (30%), Positives = 103/217 (47%), Gaps = 4/217 (1%)
 Frame = -1

Query: 698 LWYAWCARNAKCIGGEDIP--LQKILRDVKIME*AIARGFRF--QHHTERTTRWVTSHQS 531
           +W  WC+RN KCI  EDI   +Q+I   V      I + F     H  ++  R V+  + 
Sbjct: 16  IWKIWCSRN-KCIF-EDIKHSIQEIGAQVLSSLHHILKAFAHPTSHSVQQPARIVSWQRP 73

Query: 530 REDVVVLNVDGSADXXXXXXXXXXXXXXXXGAWMXXXXXXXXXXXXXXXXXXXXXFGLQT 351
             + V LNVDG+                   +++                      GL+ 
Sbjct: 74  SMNSVALNVDGNVFLDSNLGSFGGLIRDHTSSFLHGFFGKNSRPCILHVEISGLYHGLKL 133

Query: 350 AWDYSVRRLVCYSDSSMALKLVRSSVNFRHYYAAWIGGVTELLRRDWVVDLVHTLREGNG 171
            WD  ++ +VC+SDS+  + LV+  +N  H Y   I  + +LLRRDWVV L HTL EGN 
Sbjct: 134 CWDIGIKHVVCHSDSTTVVDLVQKDLNVHHKYGNLIMAIKKLLRRDWVVSLRHTLCEGNA 193

Query: 170 SADVLAKLKASQTSKLIVVHSPPVELGAWLMADAMRI 60
           +AD LAK  A   + L++++  P ++   L+ADA+ +
Sbjct: 194 AADFLAKKGALSDTSLVILNEAPPDIAFVLLADAVGV 230


>GAU11845.1 hypothetical protein TSUD_75960 [Trifolium subterraneum]
          Length = 386

 Score =  103 bits (257), Expect = 1e-21
 Identities = 62/216 (28%), Positives = 99/216 (45%), Gaps = 1/216 (0%)
 Frame = -1

Query: 854 ETLMHCVRDCTLSKQIWGALGMTSMEFQQAQDFFTWQQRALDGSEGPRFVAGLWYAWCAR 675
           ET +HC+RDC     IW +LG T   F Q  D  +W +  L  S    F+A +W+ W  R
Sbjct: 173 ETDLHCLRDCDFVYTIWKSLGFTDHNFFQEVDSSSWLRNGLSCSSMFLFMAAIWWIWRTR 232

Query: 674 NAKCIGGEDIPLQKILRDVKIME*A-IARGFRFQHHTERTTRWVTSHQSREDVVVLNVDG 498
           NA C+  E +P  +    ++I++ A + +   F +      + V  +      ++LNVD 
Sbjct: 233 NALCLDNELVP--QFSLKMRIVDYALLLKNCHFNYQVTTLPKIVRWNALGGTSMILNVDR 290

Query: 497 SADXXXXXXXXXXXXXXXXGAWMXXXXXXXXXXXXXXXXXXXXXFGLQTAWDYSVRRLVC 318
           S+                 GAW+                      GL  AW+ +++ L C
Sbjct: 291 SSIGNPGISGFGGLICNAYGAWIHGFFGNLGVTNILHAELMAILKGLLLAWELNIKDLSC 350

Query: 317 YSDSSMALKLVRSSVNFRHYYAAWIGGVTELLRRDW 210
           YSDS+ A+KL+   V+  H+YAA +  + ++L RDW
Sbjct: 351 YSDSATAIKLITEPVDVWHHYAAILNNIKDILNRDW 386


>KYP32780.1 LINE-1 reverse transcriptase isogeny [Cajanus cajan]
          Length = 1123

 Score =  103 bits (258), Expect = 4e-21
 Identities = 71/264 (26%), Positives = 110/264 (41%), Gaps = 1/264 (0%)
 Frame = -1

Query: 857  EETLMHCVRDCTLSKQIWGALGMTSME-FQQAQDFFTWQQRALDGSEGPRFVAGLWYAWC 681
            EET+MHC RDC   +++W  L   S + F Q  +F  W    +    G  F++ +W  W 
Sbjct: 855  EETVMHCFRDCHEVQEVWRILQFVSCDTFYQIDNFKMWVNHGIKLG-GALFLSTIWEIWL 913

Query: 680  ARNAKCIGGEDIPLQKILRDVKIME*AIARGFRFQHHTERTTRWVTSHQSREDVVVLNVD 501
             RN     G      ++    K +  A+   F          +WV      E+ V+LN D
Sbjct: 914  GRNRLVFEGSKTKAWQVALAAKSLSEAMTNVFLNHEVNSNLPKWVGWSAPSENCVILNTD 973

Query: 500  GSADXXXXXXXXXXXXXXXXGAWMXXXXXXXXXXXXXXXXXXXXXFGLQTAWDYSVRRLV 321
            GS                  GAW+                      GL+ A    + R+ 
Sbjct: 974  GSV--MEDKAGFGGVLRSSNGAWIHGFCGNVDGYEIIGVELLGILQGLRIAQRLGLSRVY 1031

Query: 320  CYSDSSMALKLVRSSVNFRHYYAAWIGGVTELLRRDWVVDLVHTLREGNGSADVLAKLKA 141
            C ++S  A+K ++  V+  H+Y+  +  + +LL +DW V + H LRE N  AD  AKL  
Sbjct: 1032 CQTNSLAAVKWIQGGVSHMHHYSNLVQEIHKLLDKDWAVSISHVLRECNKCADYFAKLGL 1091

Query: 140  SQTSKLIVVHSPPVELGAWLMADA 69
            +   +L     PP+++   L ADA
Sbjct: 1092 NCPDRLTNFMEPPLDVIPLLQADA 1115


>KYP56001.1 Putative ribonuclease H protein At1g65750 family, partial [Cajanus
           cajan]
          Length = 414

 Score =  101 bits (252), Expect = 8e-21
 Identities = 70/264 (26%), Positives = 108/264 (40%), Gaps = 1/264 (0%)
 Frame = -1

Query: 857 EETLMHCVRDCTLSKQIWGALGMTSME-FQQAQDFFTWQQRALDGSEGPRFVAGLWYAWC 681
           EET+MHC RDC   +++W  L   S + F Q  +F  W    +    G  F++ +W  W 
Sbjct: 146 EETVMHCFRDCHEVQEVWKILQFVSCDTFYQIDNFKMWVNHGIKLG-GALFLSTIWEIWL 204

Query: 680 ARNAKCIGGEDIPLQKILRDVKIME*AIARGFRFQHHTERTTRWVTSHQSREDVVVLNVD 501
             N     G      ++    K    A+   F          +WV      E+ V+LN D
Sbjct: 205 GWNRLVFEGSKTKAWQVALAAKSFSEAMTNVFLNHEVNSNLPKWVGWSAPSENCVILNTD 264

Query: 500 GSADXXXXXXXXXXXXXXXXGAWMXXXXXXXXXXXXXXXXXXXXXFGLQTAWDYSVRRLV 321
           GS                  G W+                      GL+ A    + R+ 
Sbjct: 265 GSV--MEDKAGFGGVLRSSDGVWIHGFYGNVDGSDIIGVELLGILQGLRIAQRLGLSRVY 322

Query: 320 CYSDSSMALKLVRSSVNFRHYYAAWIGGVTELLRRDWVVDLVHTLREGNGSADVLAKLKA 141
           C +DS +A+K ++  V+  H+Y+  +  + +LL +DW V + H LRE N  AD  AKL  
Sbjct: 323 CQTDSLVAVKWIQGGVSHMHHYSNLVQEIHKLLDKDWAVSISHVLRECNKCADYFAKLGL 382

Query: 140 SQTSKLIVVHSPPVELGAWLMADA 69
           +   +L     PP+++   L ADA
Sbjct: 383 NCPDRLTNFMEPPLDVIPMLQADA 406


>GAU34195.1 hypothetical protein TSUD_162960 [Trifolium subterraneum]
          Length = 168

 Score = 93.2 bits (230), Expect = 2e-19
 Identities = 56/150 (37%), Positives = 71/150 (47%)
 Frame = -1

Query: 518 VVLNVDGSADXXXXXXXXXXXXXXXXGAWMXXXXXXXXXXXXXXXXXXXXXFGLQTAWDY 339
           ++LNVDGS+                 GAW+                      GL  AW+ 
Sbjct: 12  MILNVDGSSIGNPGISGFGGLIRNSDGAWVHGFAGNIGHSNILHAELLAIYHGLVLAWEL 71

Query: 338 SVRRLVCYSDSSMALKLVRSSVNFRHYYAAWIGGVTELLRRDWVVDLVHTLREGNGSADV 159
            ++ L CYSDS  ALKL+   VN  H+YAA I  + + L R+W V LVHTLREGN  AD 
Sbjct: 72  DIKDLCCYSDSKTALKLIYDHVNEWHHYAAIIYNIKDFLSRNWRVRLVHTLREGNNCADF 131

Query: 158 LAKLKASQTSKLIVVHSPPVELGAWLMADA 69
           LAK  A        +  PP E+   L+ADA
Sbjct: 132 LAKFGARNPEAYSSIAVPPDEMNLLLLADA 161


>GAU23925.1 hypothetical protein TSUD_181140 [Trifolium subterraneum]
          Length = 166

 Score = 92.8 bits (229), Expect = 2e-19
 Identities = 57/156 (36%), Positives = 79/156 (50%), Gaps = 1/156 (0%)
 Frame = -1

Query: 533 SREDVVVLNVDGSADXXXXXXXXXXXXXXXXGAWMXXXXXXXXXXXXXXXXXXXXXF-GL 357
           SR + ++LNVDGS+                 GAW+                     + GL
Sbjct: 4   SRGNGMILNVDGSSIGNPGVSGFGGLIRNADGAWVHGFFGNLGVTNNIIHAKLMAIYKGL 63

Query: 356 QTAWDYSVRRLVCYSDSSMALKLVRSSVNFRHYYAAWIGGVTELLRRDWVVDLVHTLREG 177
             AWD +++ L CYS+S MA+KL+   V+  H+YAA +  + ELL RDW V ++HT RE 
Sbjct: 64  LLAWDLNIKDLWCYSNSKMAIKLITELVDEWHHYAAILNNINELLNRDWRVLILHTFRES 123

Query: 176 NGSADVLAKLKASQTSKLIVVHSPPVELGAWLMADA 69
           N  AD LAK  A+ T   + +  PPV L   L+ADA
Sbjct: 124 NACADYLAKHGANNTDVFVSIAIPPVGLNLSLLADA 159