BLASTX nr result

ID: Glycyrrhiza30_contig00018859 seq

BLASTX 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Glycyrrhiza30_contig00018859
         (854 letters)

Database: ./nr 
           115,041,592 sequences; 42,171,959,267 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

GAU50297.1 hypothetical protein TSUD_288310 [Trifolium subterran...   166   7e-44
GAU23316.1 hypothetical protein TSUD_237700 [Trifolium subterran...   154   3e-40
GAU11845.1 hypothetical protein TSUD_75960 [Trifolium subterraneum]   152   4e-40
ABE80156.1 Ribonuclease H [Medicago truncatula]                       152   1e-39
GAU17063.1 hypothetical protein TSUD_105620 [Trifolium subterran...   151   3e-39
GAU18899.1 hypothetical protein TSUD_228890 [Trifolium subterran...   147   2e-36
GAU22350.1 hypothetical protein TSUD_106780 [Trifolium subterran...   146   8e-36
GAU20604.1 hypothetical protein TSUD_33400 [Trifolium subterraneum]   142   2e-34
GAU48622.1 hypothetical protein TSUD_133530 [Trifolium subterran...   137   2e-34
GAU47648.1 hypothetical protein TSUD_27720 [Trifolium subterraneum]   138   7e-34
GAU49954.1 hypothetical protein TSUD_180180 [Trifolium subterran...   140   7e-34
GAU35042.1 hypothetical protein TSUD_30080 [Trifolium subterraneum]   140   7e-34
ABO80459.1 RNA-directed DNA polymerase (Reverse transcriptase); ...   138   3e-33
GAU10400.1 hypothetical protein TSUD_423410, partial [Trifolium ...   128   7e-32
GAU33259.1 hypothetical protein TSUD_333820 [Trifolium subterran...   117   8e-28
XP_016178564.1 PREDICTED: uncharacterized protein LOC107621028 [...   114   4e-25
GAU41733.1 hypothetical protein TSUD_349940 [Trifolium subterran...   110   5e-25
KYP32426.1 Putative ribonuclease H protein At1g65750 family [Caj...   112   2e-24
GAU43217.1 hypothetical protein TSUD_301040 [Trifolium subterran...   112   3e-24
GAU20193.1 hypothetical protein TSUD_352570 [Trifolium subterran...   104   5e-24

>GAU50297.1 hypothetical protein TSUD_288310 [Trifolium subterraneum]
          Length = 545

 Score =  166 bits (419), Expect = 7e-44
 Identities = 99/316 (31%), Positives = 149/316 (47%), Gaps = 33/316 (10%)
 Frame = +2

Query: 2    DSLTWD-SLDGNYSVKEGFRWLTKQHAISLPAAN---WNWIWRLQVPKKVKFLLWSE*HG 169
            D  TW  +L+G Y+ KEG+ WL +  + S  A +   WN +W +  P+K+KF +WS  H 
Sbjct: 193  DRYTWKGNLNGLYTAKEGYYWLNR-FSFSDTATDDISWNSVWHIPAPEKIKFFIWSALHN 251

Query: 170  SNPVMKNKHKRGISQSGACSRCGYEEESFLHAVHDYPMVILFWQALGFTDHYRFFSELAV 349
            + P       RG+ Q+  C RC  EEES LH + +   +  FW+A+GF     F  +   
Sbjct: 252  ALPTKSMLSHRGLLQANLCPRCNIEEESTLHCLRNCEFIKRFWKAIGFLGQTFFQGDNLN 311

Query: 350  GWSL*FQASSLHSKSLVLMED*KWNVHCPGVSQCLQLGPDRWVTWH*SKEEIT------- 508
             W      +S+   S  L     W + C     C+      + T   + E +        
Sbjct: 312  DW----LRNSIDGPSSFLFMAAVWWIWCARNQLCMDNEAISYFTLRTNTENLAQLLRMCF 367

Query: 509  ----------------------VLNVDVARWASPGRVGFGGVLRDGEGNWIQGCYGNIGK 622
                                  +LNVD +   +PG  GFGG++R+ +G W+ G  GNIG 
Sbjct: 368  IKQNISSTATMVRWNAHGGIGMILNVDGSSIGNPGISGFGGLIRNSDGAWVHGFAGNIGH 427

Query: 623  FDCLKAELAGLLHGLDMAWSKGASHLVCYSDSTIALELVVVLSPPFHSYTSLIREVQEHL 802
             + L+AEL  + HGL +AW      L CYSDS  AL+L+      +H Y ++I  +++ L
Sbjct: 428  LNILQAELLAIYHGLVLAWELDIKDLCCYSDSKTALKLIYDHVNEWHQYAAIIYNIKDFL 487

Query: 803  HRPWHVRLLNTW*EGN 850
             R W VRL++   EGN
Sbjct: 488  SRNWRVRLVHMLREGN 503


>GAU23316.1 hypothetical protein TSUD_237700 [Trifolium subterraneum]
          Length = 418

 Score =  154 bits (388), Expect = 3e-40
 Identities = 101/292 (34%), Positives = 153/292 (52%), Gaps = 9/292 (3%)
 Frame = +2

Query: 2   DSLTWDS-LDGNYSVKEGFRWL-TKQHAISL--PAANWNWIWRLQVPKKVKFLLWSE*HG 169
           ++  W S  +G+Y+ K GF WL + Q+ ++   P+ +W+WIW+LQ+P+K+KF  W   H 
Sbjct: 90  NAFIWTSNKNGSYTTKSGFNWLFSLQNPVTPHNPSFSWSWIWKLQLPEKIKFFFWLVCHN 149

Query: 170 SNPVMKNKHKRGISQSGACSRCGYEEESFLHAVHDYPMVILFWQALGFTDHYRFFSEL-A 346
           S P +     R ++ S  C+RCG  EE+FLH V D    I  W  +GF D+  FFS + A
Sbjct: 150 SVPTLSLLDHRKMNLSATCARCGLREETFLHCVRDCDFSISIWHHIGF-DNPDFFSSMDA 208

Query: 347 VGWSL*FQASSLHSKSLVLMED*KWNVHCPGVSQCLQ---LGPDRWVTWH*SKEEITVLN 517
             W    +  S  SK+ +      W+     +  CL       DR V W+ +     +LN
Sbjct: 209 HDW---LKWGSTGSKAFIFSAGVWWSWRNHNL-MCLNNETWTVDRVVKWNNNNFSGVILN 264

Query: 518 VDVARWASPGRVGFGGVLRDGEGNWIQGCYGNI-GKFDCLKAELAGLLHGLDMAWSKGAS 694
           VD +   SP R GFGG+ R+  G ++ G  G I G  D + AE   + HGL +A     +
Sbjct: 265 VDESCLGSPIRSGFGGIFRNDSGFYLSGFSGFIQGSSDIMLAEPYAIYHGLSLAEDMEIN 324

Query: 695 HLVCYSDSTIALELVVVLSPPFHSYTSLIREVQEHLHRPWHVRLLNTW*EGN 850
             VCYSDS   + L+  L+  +H +  LI++++E L    +V L +T  EGN
Sbjct: 325 EFVCYSDSLHRINLITGLTLKYHVHAVLIQDIKEFLSNR-NVSLCHTLGEGN 375


>GAU11845.1 hypothetical protein TSUD_75960 [Trifolium subterraneum]
          Length = 386

 Score =  152 bits (385), Expect = 4e-40
 Identities = 93/303 (30%), Positives = 143/303 (47%), Gaps = 32/303 (10%)
 Frame = +2

Query: 2   DSLTWD-SLDGNYSVKEGFRWLTKQHAISLPAA--NWNWIWRLQVPKKVKFLLWSE*HGS 172
           D  TW  +L G Y+ ++G+ WL +    + P +  +W+W+W L  P+K+KF +W+  H S
Sbjct: 88  DCYTWKGNLQGIYTARDGYHWLNRYAFSANPTSVVSWSWLWHLPAPEKIKFFIWTLLHNS 147

Query: 173 NPVMKNKHKRGISQSGACSRCGYEEESFLHAVHDYPMVILFWQALGFTDHYRFFSELAVG 352
            P       RGI     C RC    E+ LH + D   V   W++LGFTDH  FF E+   
Sbjct: 148 LPTRDMLTHRGIIHGNMCPRCNIHVETDLHCLRDCDFVYTIWKSLGFTDH-NFFQEVD-- 204

Query: 353 WSL*FQASSLHSKSLVLMED*KWNVHCPGVSQCL--QLGPD------------------- 469
            S  +  + L   S+ L     W +     + CL  +L P                    
Sbjct: 205 -SSSWLRNGLSCSSMFLFMAAIWWIWRTRNALCLDNELVPQFSLKMRIVDYALLLKNCHF 263

Query: 470 --------RWVTWH*SKEEITVLNVDVARWASPGRVGFGGVLRDGEGNWIQGCYGNIGKF 625
                   + V W+       +LNVD +   +PG  GFGG++ +  G WI G +GN+G  
Sbjct: 264 NYQVTTLPKIVRWNALGGTSMILNVDRSSIGNPGISGFGGLICNAYGAWIHGFFGNLGVT 323

Query: 626 DCLKAELAGLLHGLDMAWSKGASHLVCYSDSTIALELVVVLSPPFHSYTSLIREVQEHLH 805
           + L AEL  +L GL +AW      L CYSDS  A++L+      +H Y +++  +++ L+
Sbjct: 324 NILHAELMAILKGLLLAWELNIKDLSCYSDSATAIKLITEPVDVWHHYAAILNNIKDILN 383

Query: 806 RPW 814
           R W
Sbjct: 384 RDW 386


>ABE80156.1 Ribonuclease H [Medicago truncatula]
          Length = 438

 Score =  152 bits (385), Expect = 1e-39
 Identities = 99/314 (31%), Positives = 154/314 (49%), Gaps = 31/314 (9%)
 Frame = +2

Query: 2    DSLTWD-SLDGNYSVKEGFRWLTKQHAIS---LPAANWNWIWRLQVPKKVKFLLWSE*HG 169
            D+L W  + +G YS K GF WL      +   +P  +W+WIW LQVP+K KFL+W     
Sbjct: 83   DALIWSQNKNGTYSTKSGFHWLLTFRVPATDIIPHPSWSWIWNLQVPEKYKFLIWLACQN 142

Query: 170  SNPVMKNKHKRGISQSGACSRCGYEEESFLHAVHDYPMVILFWQALGFTDHYRFFSELAV 349
              P +   H+R I+ S  C+RCG E+E+FLH V D       WQ +GFT +  F +  A 
Sbjct: 143  VVPTLSLLHRRNIAPSPTCARCGEEDETFLHCVRDCHFSRSIWQKIGFTGNDFFTATSAH 202

Query: 350  GWSL*FQASSL------------HSKSLVLMED*KWNVH--CPGVSQCLQL--------- 460
             W     +SSL              ++L+ + +   ++   C  +               
Sbjct: 203  DWFKIGMSSSLPDIFFGGLWWAWRHRNLMCLNNETMSLFRLCNNIVSAATYIKSAFDSEE 262

Query: 461  ---GPDRWVTWH*SKEEITVLNVDVARWASPGRVGFGGVLRDGEGNWIQGCYGNI-GKFD 628
                 DR+V W+       +LNVD +   +P R G+GG+LR+  G +I G  G I    D
Sbjct: 263  NVNHSDRFVKWNNRNHHDHILNVDGSCLGTPSRTGYGGILRNSAGLFISGFSGFIPNSTD 322

Query: 629  CLKAELAGLLHGLDMAWSKGASHLVCYSDSTIALELVVVLSPPFHSYTSLIREVQEHLHR 808
             L+AEL  +   L M      + ++CYSDS +A+ L++  +P +H+Y  LI+ +++ L  
Sbjct: 323  ILQAELTAIHQSLHMVIDSNMNDVMCYSDSLLAVNLIMNDTPRYHTYAVLIQNIKD-LLS 381

Query: 809  PWHVRLLNTW*EGN 850
              ++ L +T  EGN
Sbjct: 382  VRNITLHHTLREGN 395


>GAU17063.1 hypothetical protein TSUD_105620 [Trifolium subterraneum]
          Length = 440

 Score =  151 bits (382), Expect = 3e-39
 Identities = 90/315 (28%), Positives = 148/315 (46%), Gaps = 32/315 (10%)
 Frame = +2

Query: 2    DSLTWD-SLDGNYSVKEGFRWLTKQHAI--SLPAANWNWIWRLQVPKKVKFLLWSE*HGS 172
            D  TW  +L+G Y+ ++G+ WL +  +   S+   +W+W+W ++ P+K+KF LW+  H +
Sbjct: 88   DRYTWKGNLNGIYTARDGYHWLNRIESTNNSIEDISWSWLWHIEAPEKIKFFLWTALHNA 147

Query: 173  NPVMKNKHKRGISQSGACSRCGYEEESFLHAVHDYPMVILFWQALGFTDHYRFFSELAVG 352
             P       R +     C R    EE+ +H + D   V   W+ +GFTD   F  +    
Sbjct: 148  LPTRAMLSHRRLLSVHVCPRSDIAEETIMHCLRDCEFVKHLWKTIGFTDQTFFHGDNLYA 207

Query: 353  WSL*FQASSLHSKSLVLMED*KW------NVHCPG--------VSQCLQ----------- 457
            W          S S+ +     W      N  C          +S+C++           
Sbjct: 208  WL----RKGCDSPSMFMFLAALWWIWRARNKLCLANELVSPFTISRCIEDYALLVKKCYS 263

Query: 458  ----LGPDRWVTWH*SKEEITVLNVDVARWASPGRVGFGGVLRDGEGNWIQGCYGNIGKF 625
                   +R V W+       +LNVD +   +P   GFGG++R+  G WI+G  GNIG  
Sbjct: 264  QQKSTLANRLVRWNAHDGTDMILNVDGSSIGNPEIYGFGGLIRNSHGAWIRGFAGNIGFS 323

Query: 626  DCLKAELAGLLHGLDMAWSKGASHLVCYSDSTIALELVVVLSPPFHSYTSLIREVQEHLH 805
            + L AEL  + HGL +AW      L+CYSDS  A++L+      +H + ++++ +++ L 
Sbjct: 324  NILHAELLAVYHGLVLAWDMDIKDLICYSDSKTAIKLIGDPINEWHHFAAILQNIKDILA 383

Query: 806  RPWHVRLLNTW*EGN 850
            R W V + +T  EGN
Sbjct: 384  RDWRVTVAHTLREGN 398


>GAU18899.1 hypothetical protein TSUD_228890 [Trifolium subterraneum]
          Length = 1098

 Score =  147 bits (372), Expect = 2e-36
 Identities = 94/314 (29%), Positives = 151/314 (48%), Gaps = 31/314 (9%)
 Frame = +2

Query: 2    DSLTWD-SLDGNYSVKEGFRWL-TKQHAISLPAANWNWIWRLQVPKKVKFLLWSE*HGSN 175
            D   W+ + +G Y+ K G+ WL ++Q  +  P  +W+WIW++  P+K+K L W   H + 
Sbjct: 746  DVFVWNHNKNGVYTAKSGYDWLLSRQEQVIGPFHSWSWIWKITGPEKLKILFWLACHEAV 805

Query: 176  PVMKNKHKRGISQSGACSRCGYEEESFLHAVHDYPMVILFWQALGFTDHYRFFSELAVGW 355
            P +   H R I+ S  C RC    E+FLH V D       W  LGFT+   F S +A  W
Sbjct: 806  PTLAMLHHRNIASSPICPRCSNHNETFLHCVRDCIHSKTVWDQLGFTNSSFFDSTMAHEW 865

Query: 356  SL*FQASSLHSKSLVLMED*KW-----NVHCPG-------------------VSQCLQLG 463
                + S    + L+ +    W     N  C G                   +  C    
Sbjct: 866  ---LKHSYFSPRRLLFLAGVWWIWRHRNNMCLGDETWSAVRLSCNIYSMVEDLKNCFPSS 922

Query: 464  ----PDRWVTWH*SKEEITVLNVDVARWASPGRVGFGGVLRDGEGNWIQGCYGNI-GKFD 628
                  R + W+ +     V+N D +   +P R GFG ++R+ +G +I G  G+I    D
Sbjct: 923  TTEETSRCIKWNFTNFTGVVINTDGSCSGTPARTGFGCIIRNNDGRYITGASGHITNSSD 982

Query: 629  CLKAELAGLLHGLDMAWSKGASHLVCYSDSTIALELVVVLSPPFHSYTSLIREVQEHLHR 808
             L AEL+G+ HGL +A S G +  +CY+DS I+  L+  +S P+H Y  LI+ +++ + +
Sbjct: 983  ILLAELSGIYHGLQLAISLGITDFICYTDSLISCNLIQGVSSPYHIYGVLIQNIKDSMQQ 1042

Query: 809  PWHVRLLNTW*EGN 850
              ++ + +T  EGN
Sbjct: 1043 S-NIIICHTLREGN 1055


>GAU22350.1 hypothetical protein TSUD_106780 [Trifolium subterraneum]
          Length = 1200

 Score =  146 bits (368), Expect = 8e-36
 Identities = 83/301 (27%), Positives = 141/301 (46%), Gaps = 32/301 (10%)
 Frame = +2

Query: 2    DSLTWD-SLDGNYSVKEGFRWLTKQH--AISLPAANWNWIWRLQVPKKVKFLLWSE*HGS 172
            D  TW  +L+G Y+ ++G+ WL +    A ++  A+W+W+W +  P+K+KF  W+  H S
Sbjct: 903  DCYTWKGNLNGIYTARDGYAWLNRHSFSATTISVASWSWLWHVSAPEKLKFFFWTMLHNS 962

Query: 173  NPVMKNKHKRGISQSGACSRCGYEEESFLHAVHDYPMVILFWQALGFTDHYRFFSELAVG 352
             P       RGI     C RC    E+ +H + D   V   W+++GF D   F       
Sbjct: 963  LPTRDMLAHRGIITRNLCPRCSNHAETTIHCLRDCDFVNRIWKSIGFLDQNFFQGVDVYA 1022

Query: 353  WSL*FQASSLHSKSLVLMED*KWNVHCPGVSQCL-------------------------- 454
            W      + L+S +++L     W +     + CL                          
Sbjct: 1023 WL----HNGLNSPTMMLFIAGIWWIWRARNAMCLDSEMVSFWSQKLRIMDYALLLKNCYF 1078

Query: 455  ---QLGPDRWVTWH*SKEEITVLNVDVARWASPGRVGFGGVLRDGEGNWIQGCYGNIGKF 625
               ++   ++V W+       +LNVD +   +PG  GFGG++R+ +G WI G +GN+G  
Sbjct: 1079 STHEISTTKFVKWNALGGTGLILNVDGSSIGNPGISGFGGLIRNADGAWIHGFFGNLGVT 1138

Query: 626  DCLKAELAGLLHGLDMAWSKGASHLVCYSDSTIALELVVVLSPPFHSYTSLIREVQEHLH 805
            + L  EL  +  GL +AW      L CYSDS +A++L+   +  +H Y +++  +++ L 
Sbjct: 1139 NILHPELMAIYKGLLLAWELNIKELWCYSDSKMAIKLITDPTDVWHHYAAILNNIKDILD 1198

Query: 806  R 808
            R
Sbjct: 1199 R 1199


>GAU20604.1 hypothetical protein TSUD_33400 [Trifolium subterraneum]
          Length = 1174

 Score =  142 bits (358), Expect = 2e-34
 Identities = 93/312 (29%), Positives = 154/312 (49%), Gaps = 29/312 (9%)
 Frame = +2

Query: 2    DSLTWD-SLDGNYSVKEGFRWLTKQ-HAISLPAANWNWIWRLQVPKKVKFLLWSE*HGSN 175
            D   W  + +G Y  K G+ WL  Q   +++P  +W+WIW++  P+K+KFL W   H + 
Sbjct: 821  DVFVWTHNKNGVYITKSGYDWLLAQTDQVTIPLNSWSWIWQIAGPEKLKFLFWLSCHDAV 880

Query: 176  PVMKNKHKRGISQSGACSRCGYEEESFLHAVHDYPMVILFWQALGFTDHYRFFSELAVGW 355
            P +   H R I+    C+RCG + E+FLH V D       W  LGFT    F       W
Sbjct: 881  PTLSMLHHRNIASCPICTRCGQQIETFLHCVRDCIFSRPVWIRLGFTSRIFFDITSVHDW 940

Query: 356  --SL*FQASSL----------HSKSLVLMED*KW-------NVHCPG-------VSQCLQ 457
              S  F+                ++L+ + +  W       N++  G        S   +
Sbjct: 941  LKSAYFRPHRFVFMAGVWWLWRHRNLMCLSNETWSFVRISCNIYSMGDTLKTCFPSSSPE 1000

Query: 458  LGPDRWVTWH*SKEEITVLNVDVARWASPGRVGFGGVLRDGEGNWIQGCYGN-IGKFDCL 634
               +R V W+ +     +LN+D +   +P R GFG ++R+  G++I G  G+ IG  D L
Sbjct: 1001 ETSNRTVKWNSTDFTGFILNMDGSCSGTPIRCGFGCIIRNNVGSYIAGASGHIIGSSDIL 1060

Query: 635  KAELAGLLHGLDMAWSKGASHLVCYSDSTIALELVVVLSPPFHSYTSLIREVQEHLHRPW 814
             AEL+G+ HGL +A S G + L+CY+DS ++  L+      +H Y  LI+ +++++ +  
Sbjct: 1061 LAELSGIFHGLKLASSLGITDLICYTDSLLSCNLIQGPYSHYHIYGVLIQNIKDYMQQS- 1119

Query: 815  HVRLLNTW*EGN 850
            ++ + +T  EGN
Sbjct: 1120 NINICHTLREGN 1131


>GAU48622.1 hypothetical protein TSUD_133530 [Trifolium subterraneum]
          Length = 350

 Score =  137 bits (344), Expect = 2e-34
 Identities = 84/290 (28%), Positives = 130/290 (44%), Gaps = 7/290 (2%)
 Frame = +2

Query: 2   DSLTWD-SLDGNYSVKEGFRWLTKQHAISLPA--ANWNWIWRLQVPKKVKFLLWSE*HGS 172
           D  TW  +L+G Y+ ++G+ WL +      P    +W+W+W +  P+K+KF LW+  H +
Sbjct: 61  DCFTWKGNLNGLYTARDGYHWLNRNDFSENPTNVVSWSWLWHIPAPEKIKFFLWTALHKA 120

Query: 173 NPVMKNKHKRGISQSGACSRCGYEEESFLHAVHDYPMVILFWQALGFTDHYRFFSELAVG 352
            P       RGI    +C RC    E+ +H + D       W+++GFT            
Sbjct: 121 LPTKAMLSHRGILHDSSCPRCNNNVETTIHCLRDCDFAKNIWKSIGFTKS---------- 170

Query: 353 WSL*FQASSLHSKSLVLMED*KW---NVHCPGVSQCLQLGPD-RWVTWH*SKEEITVLNV 520
                          +  ED +W   ++  P  S  L   P+ + V W+       +LNV
Sbjct: 171 ---------------IFFEDAEWFGQSLQAPVYSSDLITTPNTKLVKWNALGSSGMILNV 215

Query: 521 DVARWASPGRVGFGGVLRDGEGNWIQGCYGNIGKFDCLKAELAGLLHGLDMAWSKGASHL 700
           D +   +PG   +GG++R+ EG W  G  GNIG  + L  EL  L  GL +AW      L
Sbjct: 216 DGSSIGNPGVSRYGGLIRNSEGAWAHGFAGNIGFSNILHPELMALYRGLLLAWQLNIKEL 275

Query: 701 VCYSDSTIALELVVVLSPPFHSYTSLIREVQEHLHRPWHVRLLNTW*EGN 850
            CYSDS  A++L                 +++ L R W V + +T+ EGN
Sbjct: 276 WCYSDSEAAIKL-----------------IKDILAREWRVNIAHTFREGN 308


>GAU47648.1 hypothetical protein TSUD_27720 [Trifolium subterraneum]
          Length = 521

 Score =  138 bits (348), Expect = 7e-34
 Identities = 83/294 (28%), Positives = 137/294 (46%), Gaps = 11/294 (3%)
 Frame = +2

Query: 2    DSLTWD-SLDGNYSVKEGFRWLTKQHAISLPA--ANWNWIWRLQVPKKVKFLLWSE*HGS 172
            D   W  +L+G Y+  +G+ WL +      P    +W+W+W +  P+K+KF LW+  H +
Sbjct: 195  DYFIWKGNLNGLYTAHDGYHWLNRNDFSENPTNVVSWSWLWHIPAPEKIKFFLWTALHKA 254

Query: 173  NPVMKNKHKRGISQSGACSRCG----YEEESFLHAVHDYPMVILFWQALGFTDHYRFFSE 340
             P       RGI    +C RC     +E++ F          +  W  L       F + 
Sbjct: 255  LPTKAMLSHRGILHDSSCPRCNKSIFFEDDEFY---------VWLWNGLDSPSKLLFTAA 305

Query: 341  LAVGW----SL*FQASSLHSKSLVLMED*KWNVHCPGVSQCLQLGPDRWVTWH*SKEEIT 508
            +   W    +L     S+   SL +  +   ++    +   + +   + V W+       
Sbjct: 306  IWWIWCTRNNLCMNNESISQVSLRMRIEDYAHLLRACLFNQITMSNTKLVKWNALGSPDM 365

Query: 509  VLNVDVARWASPGRVGFGGVLRDGEGNWIQGCYGNIGKFDCLKAELAGLLHGLDMAWSKG 688
            +LNVD +   +PG  GFGG++ + +G W  G  GNIG  + L AEL  L HGL +AW   
Sbjct: 366  ILNVDGSSIGNPGVSGFGGLIHNSKGAWAHGFVGNIGFSNILHAELMALYHGLLLAWQLN 425

Query: 689  ASHLVCYSDSTIALELVVVLSPPFHSYTSLIREVQEHLHRPWHVRLLNTW*EGN 850
               L CYSDS  A++L+      +H Y +++  +++ L R W V + +T+ EGN
Sbjct: 426  IKELWCYSDSETAIKLITEPVDEWHHYAAILLNIKDILAREWRVNIAHTFREGN 479


>GAU49954.1 hypothetical protein TSUD_180180 [Trifolium subterraneum]
          Length = 968

 Score =  140 bits (353), Expect = 7e-34
 Identities = 94/284 (33%), Positives = 141/284 (49%), Gaps = 30/284 (10%)
 Frame = +2

Query: 89   PAANWNWIWRLQVPKKVKFLLWSE*HGSNPVMKNKHKRGISQSGACSRCGYEEESFLHAV 268
            P+ +W WIW+L +P+K+KF LW   H S P +   + R ++ S  C RCG ++ESFLH +
Sbjct: 646  PSQSWTWIWKLHLPEKIKFFLWLACHNSVPTLSLLNHRKMNPSTTCVRCGLQDESFLHCI 705

Query: 269  HDYPMVILFWQALGFTDHYRFFSELAV-GW---------SL*FQAS---SLHSKSLVLME 409
             D       W  +GFT+   FFS + V  W         SL F A    S   ++L+ + 
Sbjct: 706  RDCDFSRSLWHHIGFTNP-NFFSNMDVYDWLKMGATGTQSLIFSAGVWWSWRHRNLMSLN 764

Query: 410  D*KWNVH----------------CPGVSQCLQLGPDRWVTWH*SKEEITVLNVDVARWAS 541
            +  W +                 C  VS    +  DR++ W  +    T+LNVD +   S
Sbjct: 765  NETWTLSRLSFNIRSMVETFKNCCTPVSNVGSV--DRFIKWKNNNFSCTILNVDGSCLGS 822

Query: 542  PGRVGFGGVLRDGEGNWIQGCYGNI-GKFDCLKAELAGLLHGLDMAWSKGASHLVCYSDS 718
            P R GFGG++R+  G ++ G  G I G  D L AEL  +  GL +A + G   LVCYSDS
Sbjct: 823  PARAGFGGIIRNTFGYYLAGFSGYIQGSSDILYAELYAIYKGLLLAKNMGIDELVCYSDS 882

Query: 719  TIALELVVVLSPPFHSYTSLIREVQEHLHRPWHVRLLNTW*EGN 850
               + L+      +H +  LI++++E +    +V L +T  EGN
Sbjct: 883  LHCINLIKGPQVKYHIHAVLIQDIKELISLN-NVSLCHTLREGN 925


>GAU35042.1 hypothetical protein TSUD_30080 [Trifolium subterraneum]
          Length = 724

 Score =  140 bits (352), Expect = 7e-34
 Identities = 92/315 (29%), Positives = 145/315 (46%), Gaps = 32/315 (10%)
 Frame = +2

Query: 2    DSLTWD-SLDGNYSVKEGFRWLTKQHAISLPAA--NWNWIWRLQVPKKVKFLLWSE*HGS 172
            D  TW  +L G Y+  +G+ WL +    + P +  +W+W+W L  P+K+KF +W+  H S
Sbjct: 391  DCYTWKGNLQGIYTAWDGYHWLNRNAFSANPTSVVSWSWLWHLPAPEKIKFFIWTLLHNS 450

Query: 173  NPVMKNKHKRGISQSGACSRCGYEEESFLHAVHDYPMVILFWQALGFTDHYRFFSELAVG 352
                     RGI                   +HD   V   W++LGFTD   FF E+   
Sbjct: 451  LATRDMLTHRGI-------------------IHDCNFVYTIWKSLGFTDR-NFFQEVD-- 488

Query: 353  WSL*FQASSLHSKSLVLMED*KWNVHCPGVSQCL--QLGPD------------------- 469
             S  +  + L   S+ L     W +     + CL  +L P                    
Sbjct: 489  -SSSWLRNGLSCSSMFLFMAAIWWIWRTRNALCLDNELIPQFSLKMRIVDYALLLKNCHF 547

Query: 470  --------RWVTWH*SKEEITVLNVDVARWASPGRVGFGGVLRDGEGNWIQGCYGNIGKF 625
                    + V W+       +LNVD +   +PG  GFGG++R+ +G WI G +GN+G  
Sbjct: 548  NHQVTTLPKIVRWNALGGTSMILNVDGSTIGNPGISGFGGLIRNADGAWIHGFFGNLGVT 607

Query: 626  DCLKAELAGLLHGLDMAWSKGASHLVCYSDSTIALELVVVLSPPFHSYTSLIREVQEHLH 805
            + L AEL  +L GL +AW      L+CYSDS  A++L+      +H Y +++  +++ L+
Sbjct: 608  NILHAELMAILKGLLLAWELNIKDLLCYSDSATAIKLITEPVDVWHHYAAILNNIKDILN 667

Query: 806  RPWHVRLLNTW*EGN 850
            R W V + +T+ EGN
Sbjct: 668  RDWQVSIFHTFREGN 682


>ABO80459.1 RNA-directed DNA polymerase (Reverse transcriptase); Ribonuclease H
            [Medicago truncatula]
          Length = 869

 Score =  138 bits (348), Expect = 3e-33
 Identities = 83/289 (28%), Positives = 135/289 (46%), Gaps = 30/289 (10%)
 Frame = +2

Query: 26   DGNYSVKEGFRWLTKQHAISLPAANWNWIWRLQVPKKVKFLLWSE*HGSNPVMKNKHKRG 205
            +G YS K G++WL           +W+WI + ++ +K KFL+W   H S P     H R 
Sbjct: 526  NGTYSAKSGYQWLLSLSGNDNNTHSWSWILKKKISEKYKFLIWLACHDSLPTAALLHHRQ 585

Query: 206  ISQSGACSRCGYEEESFLHAVHDYPMVILFWQALGFTDHYRFFSELAVGWSL*FQASSLH 385
            I  S  C+RCG  +ES  H + D P   + W  +GF++ Y F       W    ++  + 
Sbjct: 586  IIASATCARCGVSDESVFHCIRDCPFSKIIWHHIGFSEPYFFAVTDIEIWC---KSGLIG 642

Query: 386  SKSLVLMED*KW-----NVHC-------------------PGVSQCL-----QLGPDRWV 478
            SK+++      W     N  C                     ++ C       +  DR+V
Sbjct: 643  SKAILFAAGLWWIWRSRNARCMSEESMLLQRLAANITYFVDDINSCFFQPLPVMVSDRYV 702

Query: 479  TWH*SKEEITVLNVDVARWASPGRVGFGGVLRDGEGNWIQGCYGNI-GKFDCLKAELAGL 655
             W+ S    T+LNVD +   SP R GFGG++R+  G ++ G  G +    D L AEL  +
Sbjct: 703  KWNNSNFNCTILNVDGSCIGSPIRAGFGGLIRNSVGFYLSGFLGFLPSSSDILLAELTAI 762

Query: 656  LHGLDMAWSKGASHLVCYSDSTIALELVVVLSPPFHSYTSLIREVQEHL 802
              G++ A   G + +  YSDS +++ L+   S  FH + +LI+++++ L
Sbjct: 763  YDGINTAIDMGITDMAVYSDSLLSINLITTTSSKFHIHAALIQDIRDKL 811


>GAU10400.1 hypothetical protein TSUD_423410, partial [Trifolium subterraneum]
          Length = 284

 Score =  128 bits (322), Expect = 7e-32
 Identities = 82/243 (33%), Positives = 119/243 (48%), Gaps = 27/243 (11%)
 Frame = +2

Query: 206 ISQSGACSRCGYEEESFLHAVHDYPMVILFWQALGFTDHYRFFSELAVGWSL*FQASSLH 385
           +S S  C+RC   EE+ LH + D P+    W +LGF +   FFS   +   L   +  L+
Sbjct: 1   LSSSDICTRCSSGEETILHCLRDCPISRRIWNSLGFQNS-SFFSCSDLELWLRNNSIGLN 59

Query: 386 SKSLVLMED*KW---NVHCPG------------VSQCLQL------------GPDRWVTW 484
           + + +      W   N+ C G            VS+ + L             P RW++W
Sbjct: 60  APTFLAGLWWNWRARNIFCVGNASIHSFKVVAEVSKLVALIVFCFPARVHTDTPRRWISW 119

Query: 485 H*SKEEITVLNVDVARWASPGRVGFGGVLRDGEGNWIQGCYGNIGKFDCLKAELAGLLHG 664
           H  K +  VLNVD +    PGR GFGG+ R G+G WI+G  G +G  + + AEL  + HG
Sbjct: 120 HPCKTDCVVLNVDGSCLGDPGRAGFGGLFRKGDGEWIRGFSGYLGVTNIMLAELMAVYHG 179

Query: 665 LDMAWSKGASHLVCYSDSTIALELVVVLSPPFHSYTSLIREVQEHLHRPWHVRLLNTW*E 844
           L +A   G + L CYSDS   L+L+      FH Y ++I  +Q+ L   W V L ++  E
Sbjct: 180 LKIAREAGYNRLFCYSDSKTVLDLLSKERNSFHCYAAIIANIQDLLVLEWDVSLKHSLRE 239

Query: 845 GNF 853
           GNF
Sbjct: 240 GNF 242


>GAU33259.1 hypothetical protein TSUD_333820 [Trifolium subterraneum]
          Length = 284

 Score =  117 bits (294), Expect = 8e-28
 Identities = 80/241 (33%), Positives = 115/241 (47%), Gaps = 21/241 (8%)
 Frame = +2

Query: 194 HKRGISQSGACSRCGYEEESFLHAVHDYPMVILFWQALGFTDHYRFFSELAVGWSL*FQA 373
           H R +S S  C+RC   EE+ LH + D P+    W +LGF +   FFS   +   L   +
Sbjct: 3   HHRNLSSSDICTRCSSGEETILHCLRDCPISRRIWNSLGFQNS-SFFSCSDLELWLRNNS 61

Query: 374 SSLHSKSLVLMED*KW---NVHCPG------------VSQCLQLGPDRWVTW------H* 490
             L++ + +      W   N+ C G            VS+ + L    +  W        
Sbjct: 62  IGLNAPTFLAGLWWNWRARNICCVGNASIHSFKVVAEVSKLVALIVSCFPAWVRTDTPRV 121

Query: 491 SKEEITVLNVDVARWASPGRVGFGGVLRDGEGNWIQGCYGNIGKFDCLKAELAGLLHGLD 670
            K +  VLNVD +    PGR GFGG+ R G+G WI+G  G +G  +   AEL  + HGL 
Sbjct: 122 CKTDCVVLNVDGSCLGDPGRAGFGGLFRKGDGEWIRGSSGYLGVTNITLAELMAVYHGLK 181

Query: 671 MAWSKGASHLVCYSDSTIALELVVVLSPPFHSYTSLIREVQEHLHRPWHVRLLNTW*EGN 850
           +A   G + L CYSDS   L+L+      FH Y ++I  +Q+ L   W V L ++  EGN
Sbjct: 182 IAREAGYNRLFCYSDSKTVLDLLSKERNSFHCYAAIIANIQDLLVLEWDVSLKHSVREGN 241

Query: 851 F 853
           F
Sbjct: 242 F 242


>XP_016178564.1 PREDICTED: uncharacterized protein LOC107621028 [Arachis ipaensis]
          Length = 570

 Score =  114 bits (286), Expect = 4e-25
 Identities = 79/282 (28%), Positives = 118/282 (41%), Gaps = 25/282 (8%)
 Frame = +2

Query: 35   YSVKEGFRWLTKQHAISLPAANWNWIWRLQVPKKVKFLLWSE*HGSNPVMKNKHKRGISQ 214
            YS + G+ WL K+        NW W+WRL +P+K KFL+W   H + P+ + +  RG+  
Sbjct: 267  YSARSGYSWLAKRKFDWNEHDNWLWVWRLHIPEKYKFLIWLSLHNAIPMAEFRLSRGLDL 326

Query: 215  SGACSRCGYEEESFLHAVHDYPMVILFWQALGF-----------------TDHYRFFSEL 343
            S  C RC    ES LH + + P     W  LG                   D + FFS +
Sbjct: 327  SSTCHRCQNGSESILHCLRECPSAKEVWNLLGLYSDNSNLHDWLYRGARSGDVFLFFSSI 386

Query: 344  AVGWS------L*FQASSLHSKSLVLMED*KWNVHCPGVSQCLQLGPDRWVTWH*SKEEI 505
               W            S   SK + L+       H           P  ++ W       
Sbjct: 387  WWIWKSKNHDLFNIDDSWSASKVVSLIRSSVREFHSIFAMHQSLSHPSLYLHWVPLPVHS 446

Query: 506  TVLNVDVARWASPGRVGFGGVLRDGEGNWIQGCYGNIGKFDCLKAELAGLLHGLDMAWSK 685
              LN D + +AS    GFG ++R+ +G W++GC G +       AEL  +  GL +AW  
Sbjct: 447  VELNCDASWFASFDYAGFGCIIRNPDGCWLKGCTGKVEVCSVFFAELYAIWRGLLLAWES 506

Query: 686  GASHLVCYSDSTIALELV--VVLSPPFHSYTSLIREVQEHLH 805
            G   ++C ++   AL LV   +L      +  L + +QEH H
Sbjct: 507  GFHEVICETNCLEALFLVNQRMLGKDIPEW-DLAKHIQEHYH 547


>GAU41733.1 hypothetical protein TSUD_349940 [Trifolium subterraneum]
          Length = 283

 Score =  110 bits (275), Expect = 5e-25
 Identities = 77/273 (28%), Positives = 122/273 (44%), Gaps = 8/273 (2%)
 Frame = +2

Query: 2   DSLTW-DSLDGNYSVKEGFRWLTKQHAISLPAANWNWIW--RLQVPKKVKFLLWSE*HGS 172
           D  TW D+L+  Y+ ++G+ WL +      P  +   +     +  KK+KF LW+  H +
Sbjct: 22  DCFTWKDNLNELYTARDGYHWLNRNDFSENPIMSSLGLGCGASRHRKKIKFFLWTALHKA 81

Query: 173 NPVMKNKHKRGISQSGACSRCGYEEESFLHAVHDYPMVILFWQALGFTDHYRFFSELAVG 352
            P       RGI    +C RC    E+ +H + D       W+ +GFT    F  +    
Sbjct: 82  IPTKAVLSHRGILHDSSCPRCNNNVETTIHCLRDCDFAKSVWKYIGFTKSIFFQDDEFYV 141

Query: 353 WSL*FQASSLHSKSLVLMED*KWNVHCPGVSQCLQLGPDRWVTWH*SKE-----EITVLN 517
           W      + L S S +L     W +       C+ L    +   H + E        +LN
Sbjct: 142 WL----RNGLDSPSKLLFTAAIWWI-------CVFL-QSYYDAEHLACEMEFGSSDMILN 189

Query: 518 VDVARWASPGRVGFGGVLRDGEGNWIQGCYGNIGKFDCLKAELAGLLHGLDMAWSKGASH 697
           VD +   +PG  GFGG++R+ EG W  G  GNI   + L  EL  L HG+ +AW      
Sbjct: 190 VDESSIGNPGVSGFGGLIRNSEGAWAHGFAGNICFSNILHVELMALYHGILLAWQLNIKE 249

Query: 698 LVCYSDSTIALELVVVLSPPFHSYTSLIREVQE 796
           L CYSD   A++L++     +H Y +++  + +
Sbjct: 250 LWCYSDFETAIKLIIEPVDEWHHYAAILLNIND 282


>KYP32426.1 Putative ribonuclease H protein At1g65750 family [Cajanus cajan]
          Length = 572

 Score =  112 bits (280), Expect = 2e-24
 Identities = 80/273 (29%), Positives = 123/273 (45%), Gaps = 5/273 (1%)
 Frame = +2

Query: 29   GNYSVKEGFRWL-TKQHAISLPAANWNWIWRLQVPKKVKFLLWSE*HGSNPVMKNKHKRG 205
            G Y+V  GFR+L +  H +  P   W  +W+L V KK+KFLLW   H S P    +  R 
Sbjct: 316  GCYTVASGFRFLLSPDHGLVDPI--WKRVWKLHVLKKIKFLLWQGLHSSIPTNHFRSLRH 373

Query: 206  ISQSGACSRCGYEEESFLHAVHDYPMVILFWQALGFTDHYRFFSELAVGWSL*FQASSLH 385
            +   G+C RC   +E+ LHA+ D P     W  +G T    F       W      +SL 
Sbjct: 374  LDSVGSCPRCSCPQETILHAIRDCPHSQEVWLQVGNTPSQIFSLMDCFTWFKGVITNSLW 433

Query: 386  SKSLVLMED*KWNVHCPGVSQCLQLGPDRWVTWH*SKEEITVLNVDVARWASPGRV-GFG 562
             +++      KW+           L    W+           +NVD   W    R+ G G
Sbjct: 434  KENV------KWS-----------LPKYPWIK----------VNVD-GSWLGQSRIMGVG 465

Query: 563  GVLRDGEGNWIQGCYGNIGKFDCLKAELAGLLHGLDMAWSKGASHLVCYSDSTIALELV- 739
            GV+RD  G W  G   +    D L+AE+  L  GL + W+ G  +++C SD   A++ V 
Sbjct: 466  GVVRDAVGRWKGGFARSFEDGDSLRAEILALAEGLSLCWNAGFRYIICESDCIGAVKAVQ 525

Query: 740  --VVLSPPFHSYTSLIREVQEHLHRPWHVRLLN 832
               +     H ++ +I  V++ + R W V+++N
Sbjct: 526  GPTLDRDNIHKHSDIIGAVKDLVARDWSVKIVN 558


>GAU43217.1 hypothetical protein TSUD_301040 [Trifolium subterraneum]
          Length = 565

 Score =  112 bits (279), Expect = 3e-24
 Identities = 81/290 (27%), Positives = 126/290 (43%), Gaps = 12/290 (4%)
 Frame = +2

Query: 2    DSLTWDSLD-GNYSVKEGFRWLTKQHAISLPAANWNWIWRLQVPKKVKFLLWSE*HGSNP 178
            D  TW + + G YS K+ + WL     I+     W+WIW+L +P  ++F LW   H S P
Sbjct: 279  DIWTWQNDNTGIYSTKDAYIWLLDPMHIN-NLTGWHWIWQLCIPANIQFFLWQLVHESIP 337

Query: 179  VMKNKHKRGISQSGACSRCGYEEESFLHAVHDYPMVILFWQALGF-TDHYRFFSELAVGW 355
                 H R +  +  C RC    E+  H +      +  W   G     Y   S   + W
Sbjct: 338  TRAFLHHRHVCSTDLCPRCSAAAETIDHCLFLCADSVSVWNMCGLHAIPYSLQSTDRISW 397

Query: 356  ----------SL*FQASSLHSKSLVLMED*KWNVHCPGVSQCLQLGPDRWVTWH*SKEEI 505
                      S+    + +H  SLVL     +N      S    + P R ++W      I
Sbjct: 398  NDLVFNHHRESVETIVAKIH--SLVLACSAAFN---SPSSMGESVVPQRRMSWSRPAAGI 452

Query: 506  TVLNVDVARWASPGRVGFGGVLRDGEGNWIQGCYGNIGKFDCLKAELAGLLHGLDMAWSK 685
              LNVD +  +S    G+GG+LRD  G++I G YG     + L AE+  +  GL + W  
Sbjct: 453  MCLNVDGSLLSSKNSAGYGGLLRDNHGDFIWGYYGVAAAQNILYAEIMAIYQGLKLCWEN 512

Query: 686  GASHLVCYSDSTIALELVVVLSPPFHSYTSLIREVQEHLHRPWHVRLLNT 835
            G   ++C SDS +A+ L+     P H + + I  +++ L     V + +T
Sbjct: 513  GYRKVLCCSDSLLAVNLIRQGGTPHHRFANEIHNIRKMLENDREVVITHT 562


>GAU20193.1 hypothetical protein TSUD_352570 [Trifolium subterraneum]
          Length = 171

 Score =  104 bits (260), Expect = 5e-24
 Identities = 53/128 (41%), Positives = 79/128 (61%)
 Frame = +2

Query: 470 RWVTWH*SKEEITVLNVDVARWASPGRVGFGGVLRDGEGNWIQGCYGNIGKFDCLKAELA 649
           RW++WH S++E  VLNVD +   + GR GFGG++R G+G+WI G  G +G  D   AEL 
Sbjct: 2   RWISWHPSRQEGYVLNVDGSCLGASGRTGFGGLIRMGDGSWIVGFSGYLGLKDNTFAELM 61

Query: 650 GLLHGLDMAWSKGASHLVCYSDSTIALELVVVLSPPFHSYTSLIREVQEHLHRPWHVRLL 829
            + HGL +A   G S ++CYSDS   L+L++     +H Y ++I  +Q+ L   W+V L 
Sbjct: 62  AIYHGLRIARDLGFSSILCYSDSQTILDLILKGHSIYHCYAAVITNIQDMLKFNWNVTLS 121

Query: 830 NTW*EGNF 853
           ++  E NF
Sbjct: 122 HSLREENF 129


Top