BLASTX nr result

ID: Glycyrrhiza28_contig00019010 seq

BLASTX 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Glycyrrhiza28_contig00019010
         (835 letters)

Database: ./nr 
           115,041,592 sequences; 42,171,959,267 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

GAU50297.1 hypothetical protein TSUD_288310 [Trifolium subterran...   189   6e-53
GAU10400.1 hypothetical protein TSUD_423410, partial [Trifolium ...   182   9e-53
GAU33259.1 hypothetical protein TSUD_333820 [Trifolium subterran...   171   2e-48
GAU17063.1 hypothetical protein TSUD_105620 [Trifolium subterran...   170   2e-46
GAU12283.1 hypothetical protein TSUD_141910 [Trifolium subterran...   166   5e-43
GAU35042.1 hypothetical protein TSUD_30080 [Trifolium subterraneum]   162   1e-41
GAU28886.1 hypothetical protein TSUD_293380 [Trifolium subterran...   147   5e-39
GAU41508.1 hypothetical protein TSUD_302460 [Trifolium subterran...   154   1e-38
GAU40444.1 hypothetical protein TSUD_397630 [Trifolium subterran...   144   3e-38
GAU47648.1 hypothetical protein TSUD_27720 [Trifolium subterraneum]   149   8e-38
ABE80156.1 Ribonuclease H [Medicago truncatula]                       147   1e-37
GAU50246.1 hypothetical protein TSUD_188980 [Trifolium subterran...   145   7e-37
ABD28627.2 RNA-directed DNA polymerase (Reverse transcriptase); ...   147   4e-36
GAU48831.1 hypothetical protein TSUD_190610 [Trifolium subterran...   137   2e-35
GAU49954.1 hypothetical protein TSUD_180180 [Trifolium subterran...   144   3e-35
GAU22350.1 hypothetical protein TSUD_106780 [Trifolium subterran...   144   4e-35
KYP61721.1 Putative ribonuclease H protein At1g65750 family [Caj...   134   5e-35
GAU34195.1 hypothetical protein TSUD_162960 [Trifolium subterran...   132   6e-35
KYP64035.1 Putative ribonuclease H protein At1g65750 family [Caj...   132   2e-34
GAU48830.1 hypothetical protein TSUD_190600 [Trifolium subterran...   135   2e-34

>GAU50297.1 hypothetical protein TSUD_288310 [Trifolium subterraneum]
          Length = 545

 Score =  189 bits (481), Expect = 6e-53
 Identities = 102/264 (38%), Positives = 142/264 (53%)
 Frame = -2

Query: 795  PRCGAVEEYVLHCVRDCNKPREMWRQLGLGDQAFFQYDDFPAWQLATSAGPHSSVFLAGL 616
            PRC   EE  LHC+R+C   +  W+ +G   Q FFQ D+   W   +  GP S +F+A +
Sbjct: 271  PRCNIEEESTLHCLRNCEFIKRFWKAIGFLGQTFFQGDNLNDWLRNSIDGPSSFLFMAAV 330

Query: 615  WTAWCMRNSQYLRGEDTPMHKVIRDVKLLVAGIEMASRATNQNMRPTPHVVPWHQSCLPG 436
            W  WC RN   +  E      +  + + L   + M      QN+  T  +V W+     G
Sbjct: 331  WWIWCARNQLCMDNEAISYFTLRTNTENLAQLLRMCF--IKQNISSTATMVRWNAHGGIG 388

Query: 435  DVLNVDGSSMGNLGPAGFGGLLRMGDGEWRGGFYGSVGPTDNLRVEIAALRFGLEFVWEA 256
             +LNVDGSS+GN G +GFGGL+R  DG W  GF G++G  + L+ E+ A+  GL   WE 
Sbjct: 389  MILNVDGSSIGNPGISGFGGLIRNSDGAWVHGFAGNIGHLNILQAELLAIYHGLVLAWEL 448

Query: 255  GERQLICYSDSTMALDCALGLVPATHRDAGLVHGIQELLAREWEVRLIHTLREGNNSADA 76
              + L CYSDS  AL      V   H+ A +++ I++ L+R W VRL+H LREGNN AD 
Sbjct: 449  DIKDLCCYSDSKTALKLIYDHVNEWHQYAAIIYNIKDFLSRNWRVRLVHMLREGNNCADI 508

Query: 75   LAKLGAMQ*ARMVVLSVPSAMLSL 4
            L K GA        ++VP   +SL
Sbjct: 509  LDKFGARNPKAYCSIAVPPDGMSL 532


>GAU10400.1 hypothetical protein TSUD_423410, partial [Trifolium subterraneum]
          Length = 284

 Score =  182 bits (462), Expect = 9e-53
 Identities = 105/257 (40%), Positives = 135/257 (52%)
 Frame = -2

Query: 792 RCGAVEEYVLHCVRDCNKPREMWRQLGLGDQAFFQYDDFPAWQLATSAGPHSSVFLAGLW 613
           RC + EE +LHC+RDC   R +W  LG  + +FF   D   W    S G ++  FLAGLW
Sbjct: 9   RCSSGEETILHCLRDCPISRRIWNSLGFQNSSFFSCSDLELWLRNNSIGLNAPTFLAGLW 68

Query: 612 TAWCMRNSQYLRGEDTPMHKVIRDVKLLVAGIEMASRATNQNMRPTPHVVPWHQSCLPGD 433
             W  RN   +        KV+ +V  LVA I     A      P    + WH       
Sbjct: 69  WNWRARNIFCVGNASIHSFKVVAEVSKLVALIVFCFPARVHTDTPR-RWISWHPCKTDCV 127

Query: 432 VLNVDGSSMGNLGPAGFGGLLRMGDGEWRGGFYGSVGPTDNLRVEIAALRFGLEFVWEAG 253
           VLNVDGS +G+ G AGFGGL R GDGEW  GF G +G T+ +  E+ A+  GL+   EAG
Sbjct: 128 VLNVDGSCLGDPGRAGFGGLFRKGDGEWIRGFSGYLGVTNIMLAELMAVYHGLKIAREAG 187

Query: 252 ERQLICYSDSTMALDCALGLVPATHRDAGLVHGIQELLAREWEVRLIHTLREGNNSADAL 73
             +L CYSDS   LD       + H  A ++  IQ+LL  EW+V L H+LREGN  AD L
Sbjct: 188 YNRLFCYSDSKTVLDLLSKERNSFHCYAAIIANIQDLLVLEWDVSLKHSLREGNFCADFL 247

Query: 72  AKLGAMQ*ARMVVLSVP 22
           AKLG+    +  +   P
Sbjct: 248 AKLGSANDEKFFIWESP 264


>GAU33259.1 hypothetical protein TSUD_333820 [Trifolium subterraneum]
          Length = 284

 Score =  171 bits (433), Expect = 2e-48
 Identities = 103/245 (42%), Positives = 132/245 (53%)
 Frame = -2

Query: 792 RCGAVEEYVLHCVRDCNKPREMWRQLGLGDQAFFQYDDFPAWQLATSAGPHSSVFLAGLW 613
           RC + EE +LHC+RDC   R +W  LG  + +FF   D   W    S G ++  FLAGLW
Sbjct: 15  RCSSGEETILHCLRDCPISRRIWNSLGFQNSSFFSCSDLELWLRNNSIGLNAPTFLAGLW 74

Query: 612 TAWCMRNSQYLRGEDTPMHKVIRDVKLLVAGIEMASRATNQNMRPTPHVVPWHQSCLPGD 433
             W  RN   +        KV+ +V  LVA I     A  +    TP V      C+   
Sbjct: 75  WNWRARNICCVGNASIHSFKVVAEVSKLVALIVSCFPAWVRT--DTPRVCK--TDCV--- 127

Query: 432 VLNVDGSSMGNLGPAGFGGLLRMGDGEWRGGFYGSVGPTDNLRVEIAALRFGLEFVWEAG 253
           VLNVDGS +G+ G AGFGGL R GDGEW  G  G +G T+    E+ A+  GL+   EAG
Sbjct: 128 VLNVDGSCLGDPGRAGFGGLFRKGDGEWIRGSSGYLGVTNITLAELMAVYHGLKIAREAG 187

Query: 252 ERQLICYSDSTMALDCALGLVPATHRDAGLVHGIQELLAREWEVRLIHTLREGNNSADAL 73
             +L CYSDS   LD       + H  A ++  IQ+LL  EW+V L H++REGN  AD L
Sbjct: 188 YNRLFCYSDSKTVLDLLSKERNSFHCYAAIIANIQDLLVLEWDVSLKHSVREGNFCADFL 247

Query: 72  AKLGA 58
           AKLG+
Sbjct: 248 AKLGS 252


>GAU17063.1 hypothetical protein TSUD_105620 [Trifolium subterraneum]
          Length = 440

 Score =  170 bits (431), Expect = 2e-46
 Identities = 98/273 (35%), Positives = 138/273 (50%), Gaps = 3/273 (1%)
 Frame = -2

Query: 813 LVIQTGPRCGAVEEYVLHCVRDCNKPREMWRQLGLGDQAFFQYDDFPAWQLATSAGPHSS 634
           L +   PR    EE ++HC+RDC   + +W+ +G  DQ FF  D+  AW       P   
Sbjct: 160 LSVHVCPRSDIAEETIMHCLRDCEFVKHLWKTIGFTDQTFFHGDNLYAWLRKGCDSPSMF 219

Query: 633 VFLAGLWTAWCMRNSQYLRGE---DTPMHKVIRDVKLLVAGIEMASRATNQNMRPTPHVV 463
           +FLA LW  W  RN   L  E      + + I D  LLV       ++T  N      +V
Sbjct: 220 MFLAALWWIWRARNKLCLANELVSPFTISRCIEDYALLVKKCYSQQKSTLAN-----RLV 274

Query: 462 PWHQSCLPGDVLNVDGSSMGNLGPAGFGGLLRMGDGEWRGGFYGSVGPTDNLRVEIAALR 283
            W+       +LNVDGSS+GN    GFGGL+R   G W  GF G++G ++ L  E+ A+ 
Sbjct: 275 RWNAHDGTDMILNVDGSSIGNPEIYGFGGLIRNSHGAWIRGFAGNIGFSNILHAELLAVY 334

Query: 282 FGLEFVWEAGERQLICYSDSTMALDCALGLVPATHRDAGLVHGIQELLAREWEVRLIHTL 103
            GL   W+   + LICYSDS  A+      +   H  A ++  I+++LAR+W V + HTL
Sbjct: 335 HGLVLAWDMDIKDLICYSDSKTAIKLIGDPINEWHHFAAILQNIKDILARDWRVTVAHTL 394

Query: 102 REGNNSADALAKLGAMQ*ARMVVLSVPSAMLSL 4
           REGN  AD LAK GA        ++ P   ++L
Sbjct: 395 REGNACADYLAKFGAQNIKVFSTMTTPPDGMNL 427


>GAU12283.1 hypothetical protein TSUD_141910 [Trifolium subterraneum]
          Length = 1049

 Score =  166 bits (421), Expect = 5e-43
 Identities = 98/268 (36%), Positives = 141/268 (52%), Gaps = 6/268 (2%)
 Frame = -2

Query: 789  CGAVEEYVLHCVRDCNKPREMWRQLGLGDQAFFQYDDFPAWQLATSAGPHSS---VFLAG 619
            C    E  LHC+RDC+  + +W+ +G  +  FFQ DD   W      G H S   +F+A 
Sbjct: 777  CNTHLETTLHCLRDCDFAQSIWKSIGFSNLNFFQGDDPYVW---IRNGLHCSSMFLFMAT 833

Query: 618  LWTAWCMRNSQYLRGEDTPMHKV---IRDVKLLVAGIEMASRATNQNMRPTPHVVPWHQS 448
            +W  W  RN+  L  E    + +   I D  LL+         +N +   T  +V W+  
Sbjct: 834  IWWIWRARNALCLNSESILFYSLKLRIMDYALLIENCH-----SNHHDTSTSKLVKWNAL 888

Query: 447  CLPGDVLNVDGSSMGNLGPAGFGGLLRMGDGEWRGGFYGSVGPTDNLRVEIAALRFGLEF 268
               G +LNVDGSS+GN G +GFGGL+   DG W  GF+G++G  + L  E+ A+  GL  
Sbjct: 889  GGTGMILNVDGSSLGNPGISGFGGLIHNADGAWVLGFFGNLGVNNILHAELRAIYKGLLL 948

Query: 267  VWEAGERQLICYSDSTMALDCALGLVPATHRDAGLVHGIQELLAREWEVRLIHTLREGNN 88
             W+   + L CYSDS MA+      V   H  A +++ IQ++L R+W+V ++HT REGN 
Sbjct: 949  AWDLNIKDLWCYSDSEMAIKLISESVDQWHHYAAILNNIQDILRRDWQVLILHTFREGNA 1008

Query: 87   SADALAKLGAMQ*ARMVVLSVPSAMLSL 4
             AD LAK GA        ++ P A L+L
Sbjct: 1009 YADYLAKHGANNNKVFSSIATPPAGLNL 1036


>GAU35042.1 hypothetical protein TSUD_30080 [Trifolium subterraneum]
          Length = 724

 Score =  162 bits (409), Expect = 1e-41
 Identities = 91/254 (35%), Positives = 133/254 (52%), Gaps = 3/254 (1%)
 Frame = -2

Query: 756  VRDCNKPREMWRQLGLGDQAFFQYDDFPAWQLATSAGPHSSVFLAGLWTAWCMRNSQYLR 577
            + DCN    +W+ LG  D+ FFQ  D  +W     +     +F+A +W  W  RN+  L 
Sbjct: 463  IHDCNFVYTIWKSLGFTDRNFFQEVDSSSWLRNGLSCSSMFLFMAAIWWIWRTRNALCLD 522

Query: 576  GEDTPMHKV---IRDVKLLVAGIEMASRATNQNMRPTPHVVPWHQSCLPGDVLNVDGSSM 406
             E  P   +   I D  LL+          N  +   P +V W+       +LNVDGS++
Sbjct: 523  NELIPQFSLKMRIVDYALLLKNCHF-----NHQVTTLPKIVRWNALGGTSMILNVDGSTI 577

Query: 405  GNLGPAGFGGLLRMGDGEWRGGFYGSVGPTDNLRVEIAALRFGLEFVWEAGERQLICYSD 226
            GN G +GFGGL+R  DG W  GF+G++G T+ L  E+ A+  GL   WE   + L+CYSD
Sbjct: 578  GNPGISGFGGLIRNADGAWIHGFFGNLGVTNILHAELMAILKGLLLAWELNIKDLLCYSD 637

Query: 225  STMALDCALGLVPATHRDAGLVHGIQELLAREWEVRLIHTLREGNNSADALAKLGAMQ*A 46
            S  A+      V   H  A +++ I+++L R+W+V + HT REGN  AD LAK GA    
Sbjct: 638  SATAIKLITEPVDVWHHYAAILNNIKDILNRDWQVSIFHTFREGNACADYLAKHGAHNNI 697

Query: 45   RMVVLSVPSAMLSL 4
                +++P A L+L
Sbjct: 698  VFTTIAIPPAGLNL 711


>GAU28886.1 hypothetical protein TSUD_293380 [Trifolium subterraneum]
          Length = 286

 Score =  147 bits (370), Expect = 5e-39
 Identities = 91/261 (34%), Positives = 135/261 (51%), Gaps = 3/261 (1%)
 Frame = -2

Query: 795 PRCGAVEEYVLHCVRDCNKPREMWRQLGLGDQAFFQYDDFPAWQLATSAGPHSSV--FLA 622
           PR G  EE  LHCVRDC+  R +W  LG     FF   D   W    S G  +    F A
Sbjct: 8   PRYGTHEETFLHCVRDCDLSRPIWHHLGFITPDFFSLSDAHEWLKFGSTGSQAFAFSFSA 67

Query: 621 GLWTAWCMRNSQYLRGEDTPMHKVIRDVKLLVAGIEMASRATNQNMRPTPHVVPWHQSCL 442
           G+W AW  RN   L+ E   ++++  +++ ++A I     + +       H + W+ +  
Sbjct: 68  GVWWAWRHRNLMCLQNETWSINRLSFNIQSMIATITSCFSSRSTTTSEEIH-IKWNNNNF 126

Query: 441 PGDVLNVDGSSMGNLGPAGFGGLLRMGDGEWRGGFYGSV-GPTDNLRVEIAALRFGLEFV 265
           PG +LNVDGS +G+   AGFGG++R   G +  GF G + G +D L  E+ A+  GL   
Sbjct: 127 PGVILNVDGSCLGSPVRAGFGGVIRNESGFYLSGFSGFIQGSSDILLAELFAIYKGLTLA 186

Query: 264 WEAGERQLICYSDSTMALDCALGLVPATHRDAGLVHGIQELLAREWEVRLIHTLREGNNS 85
                 +L+CYSDS   ++   G     H    L+  I+EL+++   + L HTLREGNN 
Sbjct: 187 KNMAIDELVCYSDSLHCINLIKGPSIKYHVYVVLIQDIKELMSQS-NITLCHTLREGNNC 245

Query: 84  ADALAKLGAMQ*ARMVVLSVP 22
           A+ LAKLGA   + + + + P
Sbjct: 246 ANFLAKLGASSDSDLTIHASP 266


>GAU41508.1 hypothetical protein TSUD_302460 [Trifolium subterraneum]
          Length = 1075

 Score =  154 bits (388), Expect = 1e-38
 Identities = 89/258 (34%), Positives = 136/258 (52%), Gaps = 1/258 (0%)
 Frame = -2

Query: 792  RCGAVEEYVLHCVRDCNKPREMWRQLGLGDQAFFQYDDFPAWQLATSAGPHSSVFLAGLW 613
            RCG  EE  LHCVRDC+  R +W  +G  D  FF + D   W    S G  +  F A +W
Sbjct: 800  RCGLQEETFLHCVRDCDFSRNIWHHIGFNDPTFFSFTDAREWLKVGSTGSQAYTFSASVW 859

Query: 612  TAWCMRNSQYLRGEDTPMHKVIRDVKLLVAGIEMASRATNQNMRPTPHVVPWHQSCLPGD 433
             AW  RN   L  E   ++++  ++  +V  I  +S ++N +      ++ W+       
Sbjct: 860  WAWRHRNLMCLNNESWSLNRLSFNIHSMV-DIFTSSFSSNSDGTSVSRLIKWNNDNFSCV 918

Query: 432  VLNVDGSSMGNLGPAGFGGLLRMGDGEWRGGFYGSV-GPTDNLRVEIAALRFGLEFVWEA 256
            +LNVDGS +G+   AG+GG++R   G +  GF G +   +D L  E+ A+  GL    + 
Sbjct: 919  ILNVDGSCLGSPVRAGYGGIIRNDSGFYLSGFSGFIRESSDILLAELYAIYQGLTLAKDL 978

Query: 255  GERQLICYSDSTMALDCALGLVPATHRDAGLVHGIQELLAREWEVRLIHTLREGNNSADA 76
               +L+CYSDS + ++   G +   H  A L+  I+EL+++   V L HT REGN  AD 
Sbjct: 979  VIDELVCYSDSLLCINLIKGPIVKYHVYAVLIQDIKELISQS-NVTLCHTFREGNQCADF 1037

Query: 75   LAKLGAMQ*ARMVVLSVP 22
            LAKLGA   A +++ + P
Sbjct: 1038 LAKLGASSDADLIIHASP 1055


>GAU40444.1 hypothetical protein TSUD_397630 [Trifolium subterraneum]
          Length = 275

 Score =  144 bits (364), Expect = 3e-38
 Identities = 83/253 (32%), Positives = 132/253 (52%), Gaps = 1/253 (0%)
 Frame = -2

Query: 777 EEYVLHCVRDCNKPREMWRQLGLGDQAFFQYDDFPAWQLATSAGPHSSVFLAGLWTAWCM 598
           +E  LHCVRDC+  R +W  +G  D  FF + D   W+   S G  +  F AG+W AW  
Sbjct: 5   KETFLHCVRDCDFSRNIWHHIGFNDPTFFSFTDAREWRKVGSTGSQAYTFSAGVWWAWRH 64

Query: 597 RNSQYLRGEDTPMHKVIRDVKLLVAGIEMASRATNQNMRPTPHVVPWHQSCLPGDVLNVD 418
           RN   L  +   ++++  +++ +V  I  +S ++N +      ++ W+       +LNVD
Sbjct: 65  RNWMCLNNDSWSLNRLSFNIQSMV-DIFTSSFSSNSDGTAVSRLIKWNNDNFSCVILNVD 123

Query: 417 GSSMGNLGPAGFGGLLRMGDGEWRGGFYGSV-GPTDNLRVEIAALRFGLEFVWEAGERQL 241
           GS + +   AG+GG++R   G +  GF G +   +D L  E+ A+  GL    +    +L
Sbjct: 124 GSCLDSPVRAGYGGIIRNDSGFYLSGFSGFIRESSDILLAELYAIYQGLTLAKDLAIDEL 183

Query: 240 ICYSDSTMALDCALGLVPATHRDAGLVHGIQELLAREWEVRLIHTLREGNNSADALAKLG 61
           +CYSDS + ++   G +   H  A L+  I EL+ +   V L HT REGN  AD L KLG
Sbjct: 184 VCYSDSLLCINLIKGPIVKYHVYAVLIQDINELICQS-NVTLCHTFREGNQCADFLTKLG 242

Query: 60  AMQ*ARMVVLSVP 22
           A   A +++ + P
Sbjct: 243 ASSDADLIIHASP 255


>GAU47648.1 hypothetical protein TSUD_27720 [Trifolium subterraneum]
          Length = 521

 Score =  149 bits (375), Expect = 8e-38
 Identities = 84/231 (36%), Positives = 122/231 (52%)
 Frame = -2

Query: 696 FFQYDDFPAWQLATSAGPHSSVFLAGLWTAWCMRNSQYLRGEDTPMHKVIRDVKLLVAGI 517
           FF+ D+F  W       P   +F A +W  WC RN+  +  E   + +V   +++     
Sbjct: 280 FFEDDEFYVWLWNGLDSPSKLLFTAAIWWIWCTRNNLCMNNES--ISQVSLRMRIEDYAH 337

Query: 516 EMASRATNQNMRPTPHVVPWHQSCLPGDVLNVDGSSMGNLGPAGFGGLLRMGDGEWRGGF 337
            + +   NQ       +V W+    P  +LNVDGSS+GN G +GFGGL+    G W  GF
Sbjct: 338 LLRACLFNQITMSNTKLVKWNALGSPDMILNVDGSSIGNPGVSGFGGLIHNSKGAWAHGF 397

Query: 336 YGSVGPTDNLRVEIAALRFGLEFVWEAGERQLICYSDSTMALDCALGLVPATHRDAGLVH 157
            G++G ++ L  E+ AL  GL   W+   ++L CYSDS  A+      V   H  A ++ 
Sbjct: 398 VGNIGFSNILHAELMALYHGLLLAWQLNIKELWCYSDSETAIKLITEPVDEWHHYAAILL 457

Query: 156 GIQELLAREWEVRLIHTLREGNNSADALAKLGAMQ*ARMVVLSVPSAMLSL 4
            I+++LAREW V + HT REGN  AD LAKLGA     + V++ P A L+L
Sbjct: 458 NIKDILAREWRVNIAHTFREGNACADYLAKLGACNNEALSVMTNPPASLNL 508


>ABE80156.1 Ribonuclease H [Medicago truncatula]
          Length = 438

 Score =  147 bits (370), Expect = 1e-37
 Identities = 89/265 (33%), Positives = 131/265 (49%), Gaps = 1/265 (0%)
 Frame = -2

Query: 801 TGPRCGAVEEYVLHCVRDCNKPREMWRQLGLGDQAFFQYDDFPAWQLATSAGPHSSVFLA 622
           T  RCG  +E  LHCVRDC+  R +W+++G     FF       W     +     +F  
Sbjct: 160 TCARCGEEDETFLHCVRDCHFSRSIWQKIGFTGNDFFTATSAHDWFKIGMSSSLPDIFFG 219

Query: 621 GLWTAWCMRNSQYLRGEDTPMHKVIRDVKLLVAGIEMASRATNQNMRPTPHVVPWHQSCL 442
           GLW AW  RN   L  E   + ++  ++      I+ A   + +N+  +   V W+    
Sbjct: 220 GLWWAWRHRNLMCLNNETMSLFRLCNNIVSAATYIKSAF-DSEENVNHSDRFVKWNNRNH 278

Query: 441 PGDVLNVDGSSMGNLGPAGFGGLLRMGDGEWRGGFYGSV-GPTDNLRVEIAALRFGLEFV 265
              +LNVDGS +G     G+GG+LR   G +  GF G +   TD L+ E+ A+   L  V
Sbjct: 279 HDHILNVDGSCLGTPSRTGYGGILRNSAGLFISGFSGFIPNSTDILQAELTAIHQSLHMV 338

Query: 264 WEAGERQLICYSDSTMALDCALGLVPATHRDAGLVHGIQELLAREWEVRLIHTLREGNNS 85
            ++    ++CYSDS +A++  +   P  H  A L+  I++LL+    + L HTLREGN  
Sbjct: 339 IDSNMNDVMCYSDSLLAVNLIMNDTPRYHTYAVLIQNIKDLLSVR-NITLHHTLREGNQC 397

Query: 84  ADALAKLGAMQ*ARMVVLSVPSAML 10
           AD  AKLGA     +VV   P A L
Sbjct: 398 ADFFAKLGANSDVHLVVHQSPPADL 422


>GAU50246.1 hypothetical protein TSUD_188980 [Trifolium subterraneum]
          Length = 458

 Score =  145 bits (366), Expect = 7e-37
 Identities = 89/259 (34%), Positives = 132/259 (50%), Gaps = 1/259 (0%)
 Frame = -2

Query: 795 PRCGAVEEYVLHCVRDCNKPREMWRQLGLGDQAFFQYDDFPAWQLATSAGPHSSVFLAGL 616
           PRC   EE  LHCVRDC+  R +W +LG     FF   D   W    S G  +  F   +
Sbjct: 182 PRCCTHEETFLHCVRDCDLSRPIWHRLGFITPDFFSSSDAHEWLKFGSTGSQAFAFSTSV 241

Query: 615 WTAWCMRNSQYLRGEDTPMHKVIRDVKLLVAGIEMASRATNQNMRPTPHVVPWHQSCLPG 436
           W AW  +N   L+ E   ++++  +++ + A I     + +       H V W+ +  PG
Sbjct: 242 WWAWRHQNLMCLQNETWSINRLSFNIQSMSATITSCFSSRSTTTSEEIH-VKWNNNNFPG 300

Query: 435 DVLNVDGSSMGNLGPAGFGGLLRMGDGEWRGGFYGSV-GPTDNLRVEIAALRFGLEFVWE 259
            +LNVDGS +G+   AGFGG++R   G +  GF G + G +D L  E+  +   L     
Sbjct: 301 VILNVDGSCLGSPVRAGFGGVIRNESGFYLSGFSGFIQGSSDILLAELFVIYKSLTLAKN 360

Query: 258 AGERQLICYSDSTMALDCALGLVPATHRDAGLVHGIQELLAREWEVRLIHTLREGNNSAD 79
               +L+CYSDS   ++   G     H  A L+  I+EL+++   + L HTLREGNN AD
Sbjct: 361 MAIDELVCYSDSLHCINLIKGPSIKYHVYAVLIQDIKELMSQS-NITLCHTLREGNNCAD 419

Query: 78  ALAKLGAMQ*ARMVVLSVP 22
            LAKLGA   + + + + P
Sbjct: 420 FLAKLGASSDSDLTIHASP 438


>ABD28627.2 RNA-directed DNA polymerase (Reverse transcriptase); Ribonuclease H
            [Medicago truncatula]
          Length = 1296

 Score =  147 bits (370), Expect = 4e-36
 Identities = 90/249 (36%), Positives = 121/249 (48%), Gaps = 1/249 (0%)
 Frame = -2

Query: 801  TGPRCGAVEEYVLHCVRDCNKPREMWRQLGLGDQAFFQYDDFPAWQLATSAGPHSSVFLA 622
            T  RCG  +E  LHCVRDC     +W +LG    AFF       W    S+G  +  FLA
Sbjct: 1017 TCSRCGENDESFLHCVRDCKHSAAIWHKLGFVTAAFFSVSSVQDWIRNFSSGSRAITFLA 1076

Query: 621  GLWTAWCMRNSQYLRGEDTPMHKVIRDVKLLVAGIEMASRATNQNMRPTPHVVPWHQSCL 442
            GLW +W  RN   L  E  P+ ++   +   +  I  A   +N     +  +V W+Q   
Sbjct: 1077 GLWWSWRHRNLMCLSNETWPLTRISFRINDSINAIRSAFVKSNVIQPDSARMVKWNQGNH 1136

Query: 441  PGDVLNVDGSSMGNLGPAGFGGLLRMGDGEWRGGFYGSVG-PTDNLRVEIAALRFGLEFV 265
               +LNVDGS +G    AGFGG+ R   G +  G+ G +   TD L  E+ AL  GL   
Sbjct: 1137 QCHILNVDGSCLGTPIRAGFGGIFRNNVGGYLSGYSGFISESTDILLAELTALHQGLIMA 1196

Query: 264  WEAGERQLICYSDSTMALDCALGLVPATHRDAGLVHGIQELLAREWEVRLIHTLREGNNS 85
             E G  +L CYSDS + ++         H  A L+  I++LL+      + H  REGN  
Sbjct: 1197 AEMGIEELACYSDSLLTINLITRTTSKYHTYAVLIQDIKDLLSAH-NFSVYHCFREGNQC 1255

Query: 84   ADALAKLGA 58
            AD LAKLGA
Sbjct: 1256 ADYLAKLGA 1264


>GAU48831.1 hypothetical protein TSUD_190610 [Trifolium subterraneum]
          Length = 259

 Score =  137 bits (344), Expect = 2e-35
 Identities = 75/142 (52%), Positives = 92/142 (64%), Gaps = 2/142 (1%)
 Frame = -2

Query: 477 TPHVVPWHQSCLPGDVL--NVDGSSMGNLGPAGFGGLLRMGDGEWRGGFYGSVGPTDNLR 304
           TP +V WH    P +V+  NVDGSS+GN GP+GFGGLLR   G W  GF GS G T N+ 
Sbjct: 88  TPRLVSWHPP--PENVIKVNVDGSSIGNQGPSGFGGLLRKTFGGWITGFAGSCGFTSNIN 145

Query: 303 VEIAALRFGLEFVWEAGERQLICYSDSTMALDCALGLVPATHRDAGLVHGIQELLAREWE 124
            E+  +  GL+  W  G R +IC SDS  AL      VP+TH  A LV+ IQ L+ +EW+
Sbjct: 146 AELQVILHGLDIAWNHGFRNVICESDSQTALKLIQEGVPSTHPYAPLVNYIQSLIHKEWK 205

Query: 123 VRLIHTLREGNNSADALAKLGA 58
           + L+HTLREGN SAD LAKLGA
Sbjct: 206 LFLVHTLREGNASADWLAKLGA 227


>GAU49954.1 hypothetical protein TSUD_180180 [Trifolium subterraneum]
          Length = 968

 Score =  144 bits (363), Expect = 3e-35
 Identities = 85/246 (34%), Positives = 125/246 (50%), Gaps = 1/246 (0%)
 Frame = -2

Query: 792  RCGAVEEYVLHCVRDCNKPREMWRQLGLGDQAFFQYDDFPAWQLATSAGPHSSVFLAGLW 613
            RCG  +E  LHC+RDC+  R +W  +G  +  FF   D   W    + G  S +F AG+W
Sbjct: 693  RCGLQDESFLHCIRDCDFSRSLWHHIGFTNPNFFSNMDVYDWLKMGATGTQSLIFSAGVW 752

Query: 612  TAWCMRNSQYLRGEDTPMHKVIRDVKLLVAGIEMASRATNQNMRPTPHVVPWHQSCLPGD 433
             +W  RN   L  E   + ++  +++ +V   +      + N+      + W  +     
Sbjct: 753  WSWRHRNLMSLNNETWTLSRLSFNIRSMVETFKNCCTPVS-NVGSVDRFIKWKNNNFSCT 811

Query: 432  VLNVDGSSMGNLGPAGFGGLLRMGDGEWRGGFYGSV-GPTDNLRVEIAALRFGLEFVWEA 256
            +LNVDGS +G+   AGFGG++R   G +  GF G + G +D L  E+ A+  GL      
Sbjct: 812  ILNVDGSCLGSPARAGFGGIIRNTFGYYLAGFSGYIQGSSDILYAELYAIYKGLLLAKNM 871

Query: 255  GERQLICYSDSTMALDCALGLVPATHRDAGLVHGIQELLAREWEVRLIHTLREGNNSADA 76
            G  +L+CYSDS   ++   G     H  A L+  I+EL++    V L HTLREGN  AD 
Sbjct: 872  GIDELVCYSDSLHCINLIKGPQVKYHIHAVLIQDIKELISLN-NVSLCHTLREGNQCADF 930

Query: 75   LAKLGA 58
             AKLGA
Sbjct: 931  FAKLGA 936


>GAU22350.1 hypothetical protein TSUD_106780 [Trifolium subterraneum]
          Length = 1200

 Score =  144 bits (362), Expect = 4e-35
 Identities = 82/225 (36%), Positives = 116/225 (51%), Gaps = 3/225 (1%)
 Frame = -2

Query: 795  PRCGAVEEYVLHCVRDCNKPREMWRQLGLGDQAFFQYDDFPAWQLATSAGPHSSVFLAGL 616
            PRC    E  +HC+RDC+    +W+ +G  DQ FFQ  D  AW       P   +F+AG+
Sbjct: 981  PRCSNHAETTIHCLRDCDFVNRIWKSIGFLDQNFFQGVDVYAWLHNGLNSPTMMLFIAGI 1040

Query: 615  WTAWCMRNSQYLRGEDTPMHKV---IRDVKLLVAGIEMASRATNQNMRPTPHVVPWHQSC 445
            W  W  RN+  L  E          I D  LL+     ++   +     T   V W+   
Sbjct: 1041 WWIWRARNAMCLDSEMVSFWSQKLRIMDYALLLKNCYFSTHEIS-----TTKFVKWNALG 1095

Query: 444  LPGDVLNVDGSSMGNLGPAGFGGLLRMGDGEWRGGFYGSVGPTDNLRVEIAALRFGLEFV 265
              G +LNVDGSS+GN G +GFGGL+R  DG W  GF+G++G T+ L  E+ A+  GL   
Sbjct: 1096 GTGLILNVDGSSIGNPGISGFGGLIRNADGAWIHGFFGNLGVTNILHPELMAIYKGLLLA 1155

Query: 264  WEAGERQLICYSDSTMALDCALGLVPATHRDAGLVHGIQELLARE 130
            WE   ++L CYSDS MA+          H  A +++ I+++L RE
Sbjct: 1156 WELNIKELWCYSDSKMAIKLITDPTDVWHHYAAILNNIKDILDRE 1200


>KYP61721.1 Putative ribonuclease H protein At1g65750 family [Cajanus cajan]
          Length = 219

 Score =  134 bits (338), Expect = 5e-35
 Identities = 72/204 (35%), Positives = 108/204 (52%)
 Frame = -2

Query: 618 LWTAWCMRNSQYLRGEDTPMHKVIRDVKLLVAGIEMASRATNQNMRPTPHVVPWHQSCLP 439
           +W  WC RN       D  +  ++     L+     A  + + + RP P +V W    + 
Sbjct: 1   MWFIWCHRNRHIFDQVDWNLTSILAQANALLQFSVSAFTSIDCSHRPLPRLVHWIHPLVD 60

Query: 438 GDVLNVDGSSMGNLGPAGFGGLLRMGDGEWRGGFYGSVGPTDNLRVEIAALRFGLEFVWE 259
              LNVDGS +G  G  G+GGL +  +G++  GFYG +G    L+ EI AL  GL   W+
Sbjct: 61  SVALNVDGSRIGTPGRGGYGGLCQNHEGQFLFGFYGFLGEASVLQTEILALLHGLHLCWD 120

Query: 258 AGERQLICYSDSTMALDCALGLVPATHRDAGLVHGIQELLAREWEVRLIHTLREGNNSAD 79
            G R+++CYSDST+ +    G +   HR    +  I +LL  +W   ++HTL EGN+ AD
Sbjct: 121 KGFRKIVCYSDSTLVVSLLQGPILMFHRYGNQLMEIHQLLNCDWTCTVVHTLCEGNSCAD 180

Query: 78  ALAKLGAMQ*ARMVVLSVPSAMLS 7
           ALA++GA+   R+V+L      LS
Sbjct: 181 ALARMGALGNDRVVILQEHPMTLS 204


>GAU34195.1 hypothetical protein TSUD_162960 [Trifolium subterraneum]
          Length = 168

 Score =  132 bits (333), Expect = 6e-35
 Identities = 69/155 (44%), Positives = 93/155 (60%)
 Frame = -2

Query: 468 VVPWHQSCLPGDVLNVDGSSMGNLGPAGFGGLLRMGDGEWRGGFYGSVGPTDNLRVEIAA 289
           +V W+     G +LNVDGSS+GN G +GFGGL+R  DG W  GF G++G ++ L  E+ A
Sbjct: 1   MVRWNAHGGIGMILNVDGSSIGNPGISGFGGLIRNSDGAWVHGFAGNIGHSNILHAELLA 60

Query: 288 LRFGLEFVWEAGERQLICYSDSTMALDCALGLVPATHRDAGLVHGIQELLAREWEVRLIH 109
           +  GL   WE   + L CYSDS  AL      V   H  A +++ I++ L+R W VRL+H
Sbjct: 61  IYHGLVLAWELDIKDLCCYSDSKTALKLIYDHVNEWHHYAAIIYNIKDFLSRNWRVRLVH 120

Query: 108 TLREGNNSADALAKLGAMQ*ARMVVLSVPSAMLSL 4
           TLREGNN AD LAK GA        ++VP   ++L
Sbjct: 121 TLREGNNCADFLAKFGARNPEAYSSIAVPPDEMNL 155


>KYP64035.1 Putative ribonuclease H protein At1g65750 family [Cajanus cajan]
          Length = 190

 Score =  132 bits (332), Expect = 2e-34
 Identities = 70/189 (37%), Positives = 100/189 (52%)
 Frame = -2

Query: 618 LWTAWCMRNSQYLRGEDTPMHKVIRDVKLLVAGIEMASRATNQNMRPTPHVVPWHQSCLP 439
           +W  WC RN       D  +  ++  V  L+     A  + + + RP P +V W    L 
Sbjct: 1   MWFIWCHRNRLIFDQVDWNLTSILAQVNALLQISVSAFTSIDCSHRPLPRLVHWIHPPLD 60

Query: 438 GDVLNVDGSSMGNLGPAGFGGLLRMGDGEWRGGFYGSVGPTDNLRVEIAALRFGLEFVWE 259
              LNVDGS +G LG  GFGGL R  +G++  GFYG +G    L+  I AL +GL   W+
Sbjct: 61  SVALNVDGSRIGTLGRGGFGGLCRNHEGQFLFGFYGFLGEVSVLQTVILALLYGLRLCWD 120

Query: 258 AGERQLICYSDSTMALDCALGLVPATHRDAGLVHGIQELLAREWEVRLIHTLREGNNSAD 79
              R++ICYSDST+ +    G +P  HR    +  I +LL  +W   ++HTLREGN+ AD
Sbjct: 121 KWFRKIICYSDSTLVVSLLQGPIPMFHRYENQLMEIHQLLNCDWTCTVVHTLREGNSCAD 180

Query: 78  ALAKLGAMQ 52
           A    G+ +
Sbjct: 181 AFGSNGSFR 189


>GAU48830.1 hypothetical protein TSUD_190600 [Trifolium subterraneum]
          Length = 298

 Score =  135 bits (340), Expect = 2e-34
 Identities = 76/215 (35%), Positives = 112/215 (52%), Gaps = 2/215 (0%)
 Frame = -2

Query: 642 HSSVFLAGLWTAWCMRNSQYLRGEDTPMHKVIRDVKLLVAGIEMASRATNQNMRPT--PH 469
           H  +F   LW  WC+RN      +    H ++  +  L+   E      + +M  T  P 
Sbjct: 70  HGPLFFIVLWVIWCVRNEFVFNNQRESTHIIMGKIYSLLHSCEAVFTPPHSSMATTAKPR 129

Query: 468 VVPWHQSCLPGDVLNVDGSSMGNLGPAGFGGLLRMGDGEWRGGFYGSVGPTDNLRVEIAA 289
           +V W +       LNVDGS +     AG+GGL+R  +G +  GFYG+      L  E+ A
Sbjct: 130 LVTWTKPAEGTVCLNVDGSLLKATNTAGYGGLIRDSNGVFLSGFYGTATVQSILFAELMA 189

Query: 288 LRFGLEFVWEAGERQLICYSDSTMALDCALGLVPATHRDAGLVHGIQELLAREWEVRLIH 109
           +  GL+  WE+G R++ C+SDS   ++     V A HR +  V  I +LLA++WEV + H
Sbjct: 190 VLHGLQICWESGFRRITCFSDSLQIVNLIRDGVSAHHRFSNEVFIIHQLLAKDWEVVIGH 249

Query: 108 TLREGNNSADALAKLGAMQ*ARMVVLSVPSAMLSL 4
           T REGN  AD LAK+GA   + +V +S P   LS+
Sbjct: 250 TFREGNACADVLAKMGAASDSTLVTISTPPCDLSM 284


Top