BLASTX nr result

ID: Astragalus24_contig00021774 seq

BLASTX 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Astragalus24_contig00021774
         (1234 letters)

Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF excluding environmental samples
from WGS projects 
           149,584,005 sequences; 54,822,741,787 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

dbj|GAU29496.1| hypothetical protein TSUD_360410 [Trifolium subt...   102   1e-19
dbj|GAU34029.1| hypothetical protein TSUD_16170 [Trifolium subte...    92   2e-16
dbj|GAU43110.1| hypothetical protein TSUD_373050 [Trifolium subt...    90   2e-15
gb|PNX95738.1| ribonuclease H, partial [Trifolium pratense]            90   2e-15
dbj|GAU25690.1| hypothetical protein TSUD_266170 [Trifolium subt...    89   3e-15
gb|PNX79781.1| ribonuclease H, partial [Trifolium pratense]            87   4e-15
dbj|GAU50085.1| hypothetical protein TSUD_371690 [Trifolium subt...    87   7e-15
dbj|GAU25119.1| hypothetical protein TSUD_274080 [Trifolium subt...    87   2e-14
dbj|GAU32685.1| hypothetical protein TSUD_145590 [Trifolium subt...    85   2e-14
gb|PNY09827.1| ribonuclease H [Trifolium pratense]                     86   3e-14
dbj|GAU34179.1| hypothetical protein TSUD_162800 [Trifolium subt...    86   3e-14
dbj|GAU32048.1| hypothetical protein TSUD_214030 [Trifolium subt...    82   5e-14
dbj|GAU38652.1| hypothetical protein TSUD_276920 [Trifolium subt...    84   9e-14
dbj|GAU20198.1| hypothetical protein TSUD_352620 [Trifolium subt...    83   2e-13
dbj|GAU26515.1| hypothetical protein TSUD_361480 [Trifolium subt...    83   2e-13
dbj|GAU44350.1| hypothetical protein TSUD_129240 [Trifolium subt...    82   3e-13
dbj|GAU29855.1| hypothetical protein TSUD_379450 [Trifolium subt...    79   5e-12
gb|PNX78535.1| ribonuclease H, partial [Trifolium pratense]            77   2e-11
dbj|GAU20924.1| hypothetical protein TSUD_24870 [Trifolium subte...    75   3e-11
dbj|GAU43007.1| hypothetical protein TSUD_187280 [Trifolium subt...    77   4e-11

>dbj|GAU29496.1| hypothetical protein TSUD_360410 [Trifolium subterraneum]
          Length = 1301

 Score =  102 bits (254), Expect = 1e-19
 Identities = 77/279 (27%), Positives = 123/279 (44%), Gaps = 3/279 (1%)
 Frame = -2

Query: 1008 KRCSCARYGDRITNYLDLNKIRPPYLASLWWKDIYRVGGGLAPDKDWFFDNVSKKISDGD 829
            K    A+YG+ I N +D + +R P  AS+WW+DI  +   +   KDWF +++ +K+ +G+
Sbjct: 982  KEVLVAKYGEFILNKVDWSGVRIPSTASMWWRDISSIDK-VVSSKDWFAESIVRKVGNGN 1040

Query: 828  SXXXXXXXXXXXXLSP*GVSPRLFSILNQKDCRAVEMGSRGDLRWVWGLS*RMEFFVWEE 649
            S                 V PRLFS+ N  D    + G   + RW+W  S R + F WEE
Sbjct: 1041 STSFWSTIWIGDDPLS-VVFPRLFSLSNNNDRMVKDFGEYREGRWIWSFSWRRDLFQWEE 1099

Query: 648  WLWLLGDLTILFSPXXXXXXXXXXXXXXXNEGVCSIKXXXXXXXXXXXXXXLRSPLNDYV 469
             L  +  L  L  P                 GV S+               +     +  
Sbjct: 1100 DL--VAQLRELLDPVVLSLEEDWWRWRPETNGVFSVNSSYKLLVDELESEEVLEEA-EIT 1156

Query: 468  LLDKQWQSEFHIYKVLGFTRCLFLNRPPTRTNLFWRGVITDIADTKRVFCGEDQESKSRV 289
            +  + W S     KV+ F+  L  ++ PTR NL  R ++      + V C  + ES   +
Sbjct: 1157 VFGQIWDSPAPS-KVIAFSWQLLYDQIPTRKNLEARDMVLADMPWECVGCVGNVESSLHL 1215

Query: 288  FLKCGFSTRLWYLVFKW*AVLLVV---LSLIFKPYSHSS 181
            FL C  +  +WY VFKW  +++V+   L L+F+ +  S+
Sbjct: 1216 FLHCPSAMLVWYEVFKWLGLVIVIPPSLFLLFEIFRGSA 1254


>dbj|GAU34029.1| hypothetical protein TSUD_16170 [Trifolium subterraneum]
          Length = 865

 Score = 92.4 bits (228), Expect = 2e-16
 Identities = 80/265 (30%), Positives = 114/265 (43%), Gaps = 2/265 (0%)
 Frame = -2

Query: 1008 KRCSCARYGDRITNYLDLNKIRPPYLASLWWKDIYRVGGGLAPDKDWFFDNVSKKISDGD 829
            K    A+YG+ I N +D +  R P LAS WWKDI  +   +  +K+W  ++V +K+ DG+
Sbjct: 529  KEVLVAKYGNHILNMVDWSGYRVPSLASKWWKDINSL-DKVVENKNWIVESVGRKVGDGN 587

Query: 828  SXXXXXXXXXXXXLSP*GVSPRLFSILNQKDCRAVEMGSRGDLRWVWGLS*RMEFFVWEE 649
            S                 V PRLFS+ N K     E   +      W  S R + F WEE
Sbjct: 588  STCFWSSLWIGEAPLS-VVFPRLFSLSNHKTSMVREFYDQQGESRRWSFSRRRDLFQWEE 646

Query: 648  WLWLLGDLTILFSPXXXXXXXXXXXXXXXNEGVCSIKXXXXXXXXXXXXXXLRSPLNDYV 469
             L +   L  L  P                EG  S+K                  L++ V
Sbjct: 647  DLVI--RLKELLDPVTCSLEEDLWVWKSDPEGKFSVKSTYNLLVKELQGG---DELDEEV 701

Query: 468  LL--DKQWQSEFHIYKVLGFTRCLFLNRPPTRTNLFWRGVITDIADTKRVFCGEDQESKS 295
             L  D   +S     KV+ F+  L  +R PTR NL  RG++      + V C  + ES S
Sbjct: 702  ALVFDHLCESPAP-SKVVAFSWQLLYDRIPTRRNLEARGLLVLDTPWECVGCVGNVESSS 760

Query: 294  RVFLKCGFSTRLWYLVFKW*AVLLV 220
             +FL C  +  +WY VF+W  V++V
Sbjct: 761  HLFLHCPSAMMVWYDVFRWLGVIIV 785


>dbj|GAU43110.1| hypothetical protein TSUD_373050 [Trifolium subterraneum]
          Length = 1099

 Score = 90.1 bits (222), Expect = 2e-15
 Identities = 73/264 (27%), Positives = 114/264 (43%), Gaps = 1/264 (0%)
 Frame = -2

Query: 1008 KRCSCARYGDRITNYLDLNKIRPPYLASLWWKDIYRVGGGLAPDKDWFFDNVSKKISDGD 829
            K    A+YG+ I + +D +  R P  AS WWKDI  +   +  DK+W  + V +K+ +G+
Sbjct: 752  KEILVAKYGEHILHRVDWSDYRIPSSASKWWKDICSI-DKVVEDKNWLVEEVGRKVGNGN 810

Query: 828  SXXXXXXXXXXXXLSP*GVSPRLFSILNQKDCRAVEMGSRGDLRWVWGLS*RMEFFVWE- 652
            S                 + PRLFS+ N KDC   +          W  S R E F WE 
Sbjct: 811  STSFWSTKWIGDAPLS-VIFPRLFSLSNHKDCMVRDFYEDDGDNERWRFSWRRELFQWEV 869

Query: 651  EWLWLLGDLTILFSPXXXXXXXXXXXXXXXNEGVCSIKXXXXXXXXXXXXXXLRSPLNDY 472
            + L  L +L + F                  +GV S+K                      
Sbjct: 870  DRLTRLKELLVSF---VFSSDDDSWIWRPDPDGVFSVKSAYNLLIEELRSGEELEE-EAA 925

Query: 471  VLLDKQWQSEFHIYKVLGFTRCLFLNRPPTRTNLFWRGVITDIADTKRVFCGEDQESKSR 292
            ++ ++ W+S     KV+ F+  L  +R PTR NL  RG++   +  + V C    E+ + 
Sbjct: 926  LIFEQIWESPAP-SKVIAFSWQLLYDRIPTRRNLEVRGLLGLDSPWECVGCVGSVETTTH 984

Query: 291  VFLKCGFSTRLWYLVFKW*AVLLV 220
            +FL C  +  +WY VF+W  V++V
Sbjct: 985  LFLHCPSALMVWYEVFRWIGVIIV 1008


>gb|PNX95738.1| ribonuclease H, partial [Trifolium pratense]
          Length = 1375

 Score = 90.1 bits (222), Expect = 2e-15
 Identities = 70/270 (25%), Positives = 113/270 (41%)
 Frame = -2

Query: 1008 KRCSCARYGDRITNYLDLNKIRPPYLASLWWKDIYRVGGGLAPDKDWFFDNVSKKISDGD 829
            K     +YG+ I + LDL + R P  AS WWKD+  +   +   K W  +++S+K+ +G+
Sbjct: 1028 KEILITKYGETIKSCLDLERGRYPSNASRWWKDLCSL-DSVVESKKWLAESISRKVGNGE 1086

Query: 828  SXXXXXXXXXXXXLSP*GVSPRLFSILNQKDCRAVEMGSRGDLRWVWGLS*RMEFFVWEE 649
            +                 + PRLFSI   K     ++G   +  W W L  R   F WE+
Sbjct: 1087 TTSFWLDKWVGNSTLA-SLFPRLFSISIDKQSMVSDLGECVNGAWQWNLIWRRRIFEWEK 1145

Query: 648  WLWLLGDLTILFSPXXXXXXXXXXXXXXXNEGVCSIKXXXXXXXXXXXXXXLRSPLNDYV 469
               L+  L  L                    G  S++              +   + +  
Sbjct: 1146 --ELVEQLFQLLHTAVLSENSDCWVWKPGEGGSFSVR-SAYLVLEEELSVQVNCDVQETR 1202

Query: 468  LLDKQWQSEFHIYKVLGFTRCLFLNRPPTRTNLFWRGVITDIADTKRVFCGEDQESKSRV 289
             L + W S     KV+ F+  L LN  PTR NL  RGV+          C    ES   +
Sbjct: 1203 TLHQLWSSPAP-SKVIAFSWKLLLNSIPTRQNLAHRGVLQQTDSKLCAICVGVDESSVHL 1261

Query: 288  FLKCGFSTRLWYLVFKW*AVLLVVLSLIFK 199
            FL C F++ +WY +F+W  +++V+ + +F+
Sbjct: 1262 FLHCDFASCIWYEIFRWLGLVIVLPANLFQ 1291


>dbj|GAU25690.1| hypothetical protein TSUD_266170 [Trifolium subterraneum]
          Length = 496

 Score = 88.6 bits (218), Expect = 3e-15
 Identities = 70/259 (27%), Positives = 114/259 (44%)
 Frame = -2

Query: 993 ARYGDRITNYLDLNKIRPPYLASLWWKDIYRVGGGLAPDKDWFFDNVSKKISDGDSXXXX 814
           A+YG+ I + +D ++ R P  AS WWKDI  +   +   K+W  ++V +K+ +G+S    
Sbjct: 154 AKYGNHILHKVDWSEFRIPSFASNWWKDICTLDK-VVESKNWLVESVVRKVGNGNSTFFW 212

Query: 813 XXXXXXXXLSP*GVSPRLFSILNQKDCRAVEMGSRGDLRWVWGLS*RMEFFVWEEWLWLL 634
                        V PRLFS+ N K+   V+   + +  W W  S R   F WE  L LL
Sbjct: 213 STTWVGEAPLL-EVFPRLFSLSNHKNNMVVDFRDQQEEVWSWSFSWRRHLFQWE--LELL 269

Query: 633 GDLTILFSPXXXXXXXXXXXXXXXNEGVCSIKXXXXXXXXXXXXXXLRSPLNDYVLLDKQ 454
             L ++  P                +GV S+               +   +   V+ D+ 
Sbjct: 270 EHLRMVLEPVVMSLEEDMWRWKPDPDGVFSVNSAYNLLVDDLEEEDVLEEVVA-VVFDQI 328

Query: 453 WQSEFHIYKVLGFTRCLFLNRPPTRTNLFWRGVITDIADTKRVFCGEDQESKSRVFLKCG 274
           W+S     KV+ F+  L  +R P+R NL  RG++      + V C    ES   +FL C 
Sbjct: 329 WESPAPS-KVIAFSWQLLYDRIPSRCNLEARGLLGTDVPWECVGCVGCAESSIHLFLHCP 387

Query: 273 FSTRLWYLVFKW*AVLLVV 217
               +W  +F+W  +++V+
Sbjct: 388 SVMMVWSDIFRWIDLVVVI 406


>gb|PNX79781.1| ribonuclease H, partial [Trifolium pratense]
          Length = 384

 Score = 87.4 bits (215), Expect = 4e-15
 Identities = 73/273 (26%), Positives = 119/273 (43%), Gaps = 1/273 (0%)
 Frame = -2

Query: 990 RYGDRITNYLDLNKIRPPYLASLWWKDIYRVGGGLAPDKDWFFDNVSKKISDGDSXXXXX 811
           +YG+ +   +DL +   P+ +S WW+DI  +G  L  D +WF ++V +K+ +G+      
Sbjct: 75  KYGNAVIGKVDLGEECKPWFSSSWWRDICSIGTNL--DHNWFSEHVVRKMGNGEQ-TSFW 131

Query: 810 XXXXXXXLSP*GVSPRLFSILNQKDCRAVEMGSRGDLRWVWGLS*RMEFFVWEEWLWLLG 631
                  +S     PRLFSI  QK+     + +  D   VW    R   FVWE  + LL 
Sbjct: 132 EDIWVGEVSLRDHFPRLFSISMQKEASVASLRNLND-AVVWNFIWRRRLFVWE--VTLLD 188

Query: 630 DLTILFSPXXXXXXXXXXXXXXXNEGVCSIKXXXXXXXXXXXXXXLRSPLNDYVLLDKQW 451
           +L ++ +P                    +++                 P  + +     W
Sbjct: 189 ELLLILNPITLSSVTDCWGWRPEKGEEFTVR-STYGLVLNLIIPRDAMPNEERLAFKAIW 247

Query: 450 QSEFHIYKVLGFTRCLFLNRPPTRTNLFWRGVITDIADTKRVFCGEDQESKSRVFLKCGF 271
           +      KV GF   L  +R PTR NL+   +I +  D + VFCG+  E+ + +FL C  
Sbjct: 248 KGPTP-SKVSGFAWMLLHDRVPTRVNLYKCRIIQEDGDQRCVFCGQCAETVTHLFLYCSG 306

Query: 270 STRLWYLVFKW*AVLLVVLSLIF-KPYSHSSIM 175
            T++W+ V  W       L L F  P+S SS++
Sbjct: 307 ITQVWHRVCAW-------LGLHFLLPHSISSLL 332


>dbj|GAU50085.1| hypothetical protein TSUD_371690 [Trifolium subterraneum]
          Length = 438

 Score = 87.0 bits (214), Expect = 7e-15
 Identities = 75/265 (28%), Positives = 111/265 (41%), Gaps = 1/265 (0%)
 Frame = -2

Query: 1008 KRCSCARYGDRITNYLDLNKIRPPYLASLWWKDIYRVGGGLAPDKDWFFDNVSKKISDGD 829
            K    A+YG+ I N +D   IR P LAS WWKDI  +   +  + +W  +++ +K+ +G 
Sbjct: 91   KEVLVAKYGNHILNRVDWRDIRIPTLASKWWKDICTL-DKVVDNHNWLAESMIRKVGNGT 149

Query: 828  SXXXXXXXXXXXXLSP*GVS-PRLFSILNQKDCRAVEMGSRGDLRWVWGLS*RMEFFVWE 652
            S             +P  V+ P LFS+ N K+             W W  S R + F WE
Sbjct: 150  S--TSFWCSNWIGEAPLSVTFPLLFSLSNHKNGMVRNFCDHVGENWRWSFSWRRDLFQWE 207

Query: 651  EWLWLLGDLTILFSPXXXXXXXXXXXXXXXNEGVCSIKXXXXXXXXXXXXXXLRSPLNDY 472
            E   L+  L  +  P                EG  S+K                      
Sbjct: 208  E--DLVVRLREILEPVVLSLVEDFWSWKLDPEGKFSVKSAYTFLVEELTRDDDLEEAMAT 265

Query: 471  VLLDKQWQSEFHIYKVLGFTRCLFLNRPPTRTNLFWRGVITDIADTKRVFCGEDQESKSR 292
            V  D+ W S     KV+ F+  L  +R PTR NL  RG++      + V C    ES + 
Sbjct: 266  V-FDQIWDSPAP-SKVIAFSWQLLSDRIPTRRNLEIRGLLGLDMPWECVGCVGRVESTTH 323

Query: 291  VFLKCGFSTRLWYLVFKW*AVLLVV 217
            +FL C  +  +WY VF+W  V+L++
Sbjct: 324  LFLHCPSAMMVWYEVFRWLGVVLII 348


>dbj|GAU25119.1| hypothetical protein TSUD_274080 [Trifolium subterraneum]
          Length = 937

 Score = 86.7 bits (213), Expect = 2e-14
 Identities = 77/266 (28%), Positives = 117/266 (43%), Gaps = 2/266 (0%)
 Frame = -2

Query: 993  ARYGDRITNYLDLNKIRPPYLASLWWKDIYRVGGGLAPDKDWFFDNVSKKISDGDSXXXX 814
            ARYG+   + +       P  AS WW+D+ R+   L  +  WF  N+S+++  GD+    
Sbjct: 596  ARYGENARHNVLWIGCPIPSSASCWWRDLCRID--LTEEGSWFAKNISRRVGRGDTTRFW 653

Query: 813  XXXXXXXXLSP*GVSPRLFSILNQKDCRAVEM--GSRGDLRWVWGLS*RMEFFVWEEWLW 640
                           PRLFSI  QK+    E+  G  G   W WG   R   FVWEE L 
Sbjct: 654  KDCWVGQVPLCESF-PRLFSISLQKEALVSEIRVGGEGVSWWEWGW--RRSLFVWEEEL- 709

Query: 639  LLGDLTILFSPXXXXXXXXXXXXXXXNEGVCSIKXXXXXXXXXXXXXXLRSPLNDYVLLD 460
            LLG L    SP               + GV ++K              + + + +  +L+
Sbjct: 710  LLG-LQDFISPMAFSTDDDVWYWGLEDGGVFTVKSAYLLLGRMFASFSMFN-VCELRVLN 767

Query: 459  KQWQSEFHIYKVLGFTRCLFLNRPPTRTNLFWRGVITDIADTKRVFCGEDQESKSRVFLK 280
              W+S     KV+ F+  L  NR PTR  L  RG++      + V C   +E+   +FL 
Sbjct: 768  SIWRSPAPS-KVIAFSWKLLRNRIPTRDCLSRRGILAAGGSRECVHCQGREETALHLFLF 826

Query: 279  CGFSTRLWYLVFKW*AVLLVVLSLIF 202
            C F+ R+W  +F+W  V++V+   +F
Sbjct: 827  CDFAFRVWSAIFQWLGVVIVMPPNLF 852


>dbj|GAU32685.1| hypothetical protein TSUD_145590 [Trifolium subterraneum]
          Length = 339

 Score = 84.7 bits (208), Expect = 2e-14
 Identities = 65/237 (27%), Positives = 102/237 (43%)
 Frame = -2

Query: 930 ASLWWKDIYRVGGGLAPDKDWFFDNVSKKISDGDSXXXXXXXXXXXXLSP*GVSPRLFSI 751
           AS WW+++  +G       DWF + V+KK+ +G               S      RLF +
Sbjct: 99  ASCWWRNVSLLGDPEDAISDWFLEGVAKKVGNGRLTSFWFDPWLDGVPSM-SRFQRLFKV 157

Query: 750 LNQKDCRAVEMGSRGDLRWVWGLS*RMEFFVWEEWLWLLGDLTILFSPXXXXXXXXXXXX 571
                C   +MGS  + +W+W    R + FVWE  L LL  L  + +             
Sbjct: 158 SAHSSCMVGDMGSWVEGQWIWVFRWRRDLFVWE--LNLLESLHEILNHSTISTVEDSWFW 215

Query: 570 XXXNEGVCSIKXXXXXXXXXXXXXXLRSPLNDYVLLDKQWQSEFHIYKVLGFTRCLFLNR 391
                G  S+K              + S + +  LL K W++ +    V  F+  L  +R
Sbjct: 216 MHDPSGHYSVKSAFLALSCSTANEVIFS-VEEKRLLPKVWKT-WAPSNVAVFSWQLLQDR 273

Query: 390 PPTRTNLFWRGVITDIADTKRVFCGEDQESKSRVFLKCGFSTRLWYLVFKW*AVLLV 220
            PTR NL+ RGVI D++ +  V CG + E+   +F  C   +++WY + +W  V LV
Sbjct: 274 LPTRQNLWKRGVIGDVSASTCVLCGLEPETADHLFGSCNQISQIWYGILRWLGVELV 330


>gb|PNY09827.1| ribonuclease H [Trifolium pratense]
          Length = 958

 Score = 86.3 bits (212), Expect = 3e-14
 Identities = 78/280 (27%), Positives = 117/280 (41%), Gaps = 10/280 (3%)
 Frame = -2

Query: 1008 KRCSCARYGDRITNYLDLNKIRPPYLASLWWKDIYRVGGGLAPDKDWFFDNVSKKISDGD 829
            K    A+YG  I  + D    R P  AS WWKDI  +   +A  K+WF  ++ +K+ +G 
Sbjct: 611  KEVLVAKYGSHIVQFADWCNYRIPSSASNWWKDICALDTVVA-SKNWFAASLERKMGNGR 669

Query: 828  SXXXXXXXXXXXXLSP*GVS-PRLFSILNQKDCRAVEMGSRGDLRWVWGLS*RMEFFVWE 652
            S              P  ++ PRLFSI  QKD    E     ++   W    R   F  E
Sbjct: 670  STHFWLSTWIGDV--PLSIAFPRLFSISTQKDGMVEEFYRSNEIGCKWVFPWRRNLFQRE 727

Query: 651  EWLWLLGDLTILFSPXXXXXXXXXXXXXXXNEGVCSIKXXXXXXXXXXXXXXLRSPLNDY 472
              L L+  L  L  P                EG  S+                    +D 
Sbjct: 728  --LPLVDRLLELLDPVSLSLEEDLWRWLPNPEGTFSVNSSYNYLVKELRD-------SDV 778

Query: 471  VLLDKQ------WQSEFHIYKVLGFTRCLFLNRPPTRTNLFWRGVITDIADTKRVFCGED 310
            ++L+K       W+S     KV+ F+  L  +R PTR NL +RG++   A    V C   
Sbjct: 779  LVLEKTAVFNQIWESPAPS-KVIAFSWQLLYDRIPTRCNLDYRGILAPDAPRDCVGCVGM 837

Query: 309  QESKSRVFLKCGFSTRLWYLVFKW*AVLLVV---LSLIFK 199
             ES + +F+ C  +  +WY +F+W  V++V+   LS +F+
Sbjct: 838  TESSTHLFVHCSNAISVWYAIFRWIGVVIVIPPNLSTLFE 877


>dbj|GAU34179.1| hypothetical protein TSUD_162800 [Trifolium subterraneum]
          Length = 757

 Score = 85.9 bits (211), Expect = 3e-14
 Identities = 73/276 (26%), Positives = 115/276 (41%)
 Frame = -2

Query: 1008 KRCSCARYGDRITNYLDLNKIRPPYLASLWWKDIYRVGGGLAPDKDWFFDNVSKKISDGD 829
            K    A+YG  I + +  +   PPY ASLWWKDI  +       K+W  + V++ + +G 
Sbjct: 410  KEVLVAKYGGHILHNVVWSLGSPPYRASLWWKDINDLQA-CVNSKNWVAEMVTRFLGNG- 467

Query: 828  SXXXXXXXXXXXXLSP*GVSPRLFSILNQKDCRAVEMGSRGDLRWVWGLS*RMEFFVWEE 649
            S            +      PRLFS+  QK+    EM         W    R   F+WEE
Sbjct: 468  SRTRFWSDNWIGDVLLCSKFPRLFSLSLQKEATVSEMMVVEGETKSWNFLWRRSLFLWEE 527

Query: 648  WLWLLGDLTILFSPXXXXXXXXXXXXXXXNEGVCSIKXXXXXXXXXXXXXXLRSPLNDYV 469
                +  L  L                   +G  S+K                SP  +  
Sbjct: 528  ER--VSQLLSLLENVSLSLEEDKWHWALDPDGCFSVKSAYDSLLENLDTSPNLSPY-EAK 584

Query: 468  LLDKQWQSEFHIYKVLGFTRCLFLNRPPTRTNLFWRGVITDIADTKRVFCGEDQESKSRV 289
            +    W S   + KV+ F+  L  +R PT+ NL  RGV+   +    V+CG+ +ES + +
Sbjct: 585  IFSNIWDSPAPL-KVVVFSWRLLHDRVPTKENLIVRGVLPRESSGSCVWCGDIRESSAHL 643

Query: 288  FLKCGFSTRLWYLVFKW*AVLLVVLSLIFKPYSHSS 181
            FL C  +  +WY +F+W  V++V+   +F  + + S
Sbjct: 644  FLHCKVALVVWYEIFRWLGVVIVIPPNLFTLFDYFS 679


>dbj|GAU32048.1| hypothetical protein TSUD_214030 [Trifolium subterraneum]
          Length = 274

 Score = 82.4 bits (202), Expect = 5e-14
 Identities = 71/248 (28%), Positives = 105/248 (42%), Gaps = 2/248 (0%)
 Frame = -2

Query: 939 PYLASLWWKDIYRVGGGLAPDKDWFFDNVSKKISDGDSXXXXXXXXXXXXLSP*GVSPRL 760
           P +ASLWWKDI  +       K+W  D  S+ I +G +            L      PRL
Sbjct: 5   PRVASLWWKDICDLEA-CVDSKNWVEDMFSRSIGNGATTRFWCDKWLGDSLLSVKF-PRL 62

Query: 759 FSILNQKDCRAVEMGSRGDLRWVWGLS*RMEFFVWEEWL--WLLGDLTILFSPXXXXXXX 586
           FS+   K+    E+   G+    W  S R   F+WEE     LL DL  +          
Sbjct: 63  FSLSLNKEETVNELVVVGESTISWNFSWRRNLFLWEEDCVSLLLADLESV----NLSRDE 118

Query: 585 XXXXXXXXNEGVCSIKXXXXXXXXXXXXXXLRSPLNDYVLLDKQWQSEFHIYKVLGFTRC 406
                    EG  S+K                 P    ++    W+S     KV+ F+  
Sbjct: 119 DKWRWVLDPEGCFSVKSAYDSLSKEIVVGSSLRPFES-LIFKNIWESPAPS-KVIIFSWQ 176

Query: 405 LFLNRPPTRTNLFWRGVITDIADTKRVFCGEDQESKSRVFLKCGFSTRLWYLVFKW*AVL 226
           LF +R PT  NL  RGV+T  +    V+CG+ +ES + +FL C  +  +WY +FKW  V+
Sbjct: 177 LFYDRVPTMENLLLRGVLTSNSGGNCVWCGDIRESSTHLFLHCKVALVVWYEIFKWLGVV 236

Query: 225 LVVLSLIF 202
           +++   +F
Sbjct: 237 IIMPPNLF 244


>dbj|GAU38652.1| hypothetical protein TSUD_276920 [Trifolium subterraneum]
          Length = 719

 Score = 84.3 bits (207), Expect = 9e-14
 Identities = 75/267 (28%), Positives = 114/267 (42%), Gaps = 5/267 (1%)
 Frame = -2

Query: 987  YGDRITNYLDLNKIRPPYLASLWWKDIYRVGGGLAPDKDWFFDNVSKKISDGDSXXXXXX 808
            YGD+I + +D +    PY AS WWKDI  +   +   K+W  +++S+++ DG S      
Sbjct: 446  YGDQILHRVDWSSFSIPYKASNWWKDICSL-ENVVEAKNWLAESISRRVGDGSSTSFWAT 504

Query: 807  XXXXXXLSP*GVSPRLFSILNQKDCRAVEM-GSRGDLRWVWGLS*RMEFFVWEEW----L 643
                       V PRLFS+ NQ+     ++   RG  R  W  S R   F WEE     L
Sbjct: 505  RWIGEAPLA-VVFPRLFSLSNQQGATVKDLCEQRGGTR-TWNFSWRRNLFQWEEGIVTRL 562

Query: 642  WLLGDLTILFSPXXXXXXXXXXXXXXXNEGVCSIKXXXXXXXXXXXXXXLRSPLNDYVLL 463
              L DL IL                   +G  S+                R  ++   + 
Sbjct: 563  QELVDLVIL------SREEDRWWWRPDPDGEFSVN-----SSYKFLIGEFRLDVDAARIF 611

Query: 462  DKQWQSEFHIYKVLGFTRCLFLNRPPTRTNLFWRGVITDIADTKRVFCGEDQESKSRVFL 283
             + W+S     KV+ F+  L  +R PTR NL  RG++      + + C    ES   +FL
Sbjct: 612  GQLWESPAP-SKVIAFSWQLLYDRIPTRRNLEVRGILGRDTPWECLGCVGMVESSLHLFL 670

Query: 282  KCGFSTRLWYLVFKW*AVLLVVLSLIF 202
             C  +  +WY VF+W  V++V+   +F
Sbjct: 671  HCLSAMMMWYEVFRWLGVIIVIPPSLF 697


>dbj|GAU20198.1| hypothetical protein TSUD_352620 [Trifolium subterraneum]
          Length = 678

 Score = 83.2 bits (204), Expect = 2e-13
 Identities = 72/265 (27%), Positives = 106/265 (40%), Gaps = 1/265 (0%)
 Frame = -2

Query: 1008 KRCSCARYGDRITNYLDLNKIRPPYLASLWWKDIYRVGGGLAPDKDWFFDNVSKKISDGD 829
            K    A+YG  I N +    I  PY AS WWKDI  +       K+W  + VS+ + +G 
Sbjct: 410  KEVLVAKYGSHILNNVSWINISNPYFASSWWKDIGDLES-YVDSKNWLTEAVSRSLGNG- 467

Query: 828  SXXXXXXXXXXXXLSP*GVSPRLFSILNQKDCRAVEMGSRGDLRWV-WGLS*RMEFFVWE 652
                                PRLFS+  Q+D    E+  R +   + W L+ R   F WE
Sbjct: 468  MLTRFWSDVWIGDAKLCSKFPRLFSLSLQRDACVSEVVVREEEETLSWNLTWRRRLFQWE 527

Query: 651  EWLWLLGDLTILFSPXXXXXXXXXXXXXXXNEGVCSIKXXXXXXXXXXXXXXLRSPLNDY 472
            E    L  L                      +GV S+K                 P    
Sbjct: 528  EDS--LNQLVASLGSVRLSNVDDKWRWSCDPDGVFSVKSTYDSISKELTVGPTL-PQFQA 584

Query: 471  VLLDKQWQSEFHIYKVLGFTRCLFLNRPPTRTNLFWRGVITDIADTKRVFCGEDQESKSR 292
            ++  K W S     KV+ F+  L  +R PT+ NL  RG++   +    V+C    ES S 
Sbjct: 585  LIFKKIWDSPTPS-KVIVFSWQLLHDRVPTKVNLRLRGILPIESSCNCVWCPNIGESASH 643

Query: 291  VFLKCGFSTRLWYLVFKW*AVLLVV 217
            +FL C  +  +WY VF+W  V++++
Sbjct: 644  LFLHCKVALDIWYEVFRWLGVIIII 668


>dbj|GAU26515.1| hypothetical protein TSUD_361480 [Trifolium subterraneum]
          Length = 873

 Score = 83.2 bits (204), Expect = 2e-13
 Identities = 74/270 (27%), Positives = 110/270 (40%), Gaps = 1/270 (0%)
 Frame = -2

Query: 1008 KRCSCARYGDRITNYLDLNKIRPPYLASLWWKDIYRVGGGLAPDKDWFFDNVSKKISDGD 829
            K    A+YG+ +   L  N    P  AS WWKDI  +   +  +  WF  +V +++  GD
Sbjct: 527  KELLVAKYGEMVRQKLHWNDCPIPSRASSWWKDICEID--VCEEGSWFAQHVFRRVGKGD 584

Query: 828  SXXXXXXXXXXXXLSP*-GVSPRLFSILNQKDCRAVEMGSRGDLRWVWGLS*RMEFFVWE 652
            S             SP   + PRLFSI   K+    E+    +   +W    R   FVWE
Sbjct: 585  SIRFWKDCWFGN--SPLCDLFPRLFSIATHKEALVNEVRVVTEGLNLWNWEWRRRLFVWE 642

Query: 651  EWLWLLGDLTILFSPXXXXXXXXXXXXXXXNEGVCSIKXXXXXXXXXXXXXXLRSPLNDY 472
            + L  L  LT                    + GV ++K              + SP  + 
Sbjct: 643  QEL--LVSLTETLPLLVLSGEEDVWYWRLEDGGVFTVKSVYTLLGSVFATDAVWSP-PEL 699

Query: 471  VLLDKQWQSEFHIYKVLGFTRCLFLNRPPTRTNLFWRGVITDIADTKRVFCGEDQESKSR 292
             + D+ W+S     KV+ F   L  NR PT+ NL  RG+         V C    E  S 
Sbjct: 700  RVFDQIWKSPAPS-KVIVFPWKLLRNRIPTKANLALRGIQVVGGSLNCVHCVGSGEDASH 758

Query: 291  VFLKCGFSTRLWYLVFKW*AVLLVVLSLIF 202
            +F+ C F+ ++W  +F+W  V +V+   IF
Sbjct: 759  LFMYCNFAAQVWNSIFRWIGVTIVIPPNIF 788


>dbj|GAU44350.1| hypothetical protein TSUD_129240 [Trifolium subterraneum]
          Length = 388

 Score = 81.6 bits (200), Expect = 3e-13
 Identities = 73/266 (27%), Positives = 112/266 (42%), Gaps = 2/266 (0%)
 Frame = -2

Query: 1008 KRCSCARYGDRITNYLDLNKIRPPYLASLWWKDIYRVGGGLAPDKDWFFDNVSKKISDGD 829
            K+   A+YG+ I N +  + IR P LAS WWKD+  +   +   K+W  +++ +K+ +G 
Sbjct: 65   KQVLVAKYGNHILNRVIWSDIRIPSLASKWWKDVCSLDK-VVESKNWLGESIVRKVGNGF 123

Query: 828  SXXXXXXXXXXXXLSP*GVSPRLFSILNQKDCRAVEMGSRGDLRWVWGLS*RMEFFVWEE 649
            S                 V PRL+S+   KD    +   +    W W  S R   F WEE
Sbjct: 124  STYFWSSNWIGEAPLL-EVFPRLYSLSIHKDSMVRDFYVQEGGGWRWSFSWRRNLFQWEE 182

Query: 648  WLWLLGDLTILFSPXXXXXXXXXXXXXXXNEGVCSIKXXXXXXXXXXXXXXLRSPLNDYV 469
             L  +  L  +  P                EG  S+K                  L + V
Sbjct: 183  DL--VTRLREMVEPVPLSLEEDYWVWSPDPEGKFSVKSAYNFLGDELRVG---EDLEEEV 237

Query: 468  LL--DKQWQSEFHIYKVLGFTRCLFLNRPPTRTNLFWRGVITDIADTKRVFCGEDQESKS 295
             L  D  W S     KV+ F+  L  +R P+R NL  RG++      + V C    ES +
Sbjct: 238  ALVFDNIWGSPAPS-KVIAFSWQLLYDRIPSRRNLEARGLLCLDMPWECVGCVGSVESTT 296

Query: 294  RVFLKCGFSTRLWYLVFKW*AVLLVV 217
             +FL C  + ++W  VF+W  V++V+
Sbjct: 297  HLFLHCPSAMKVWQEVFRWLGVVIVI 322


>dbj|GAU29855.1| hypothetical protein TSUD_379450 [Trifolium subterraneum]
          Length = 498

 Score = 78.6 bits (192), Expect = 5e-12
 Identities = 76/254 (29%), Positives = 103/254 (40%), Gaps = 4/254 (1%)
 Frame = -2

Query: 960 DLNKIRPPYLASLWWKDIYRVG--GGLAPDKDWFFDNVSKKISDGDSXXXXXXXXXXXXL 787
           DL  +    L    W+ +  VG   GL   KD FF  +    + GD+            L
Sbjct: 192 DLRIMNISLLTKWKWRLLSEVGLIDGLDRTKDLFFKRIG---NGGDTRFWHDTWVGAQPL 248

Query: 786 SP*GVSPRLFSILNQKDCRAVEMGSRGDLRWVWGLS*RMEFFVWEEWLWLLGDLTILFSP 607
               V PRLF I +QK+C  +E+G      W W    R   FVWEE L  + D TIL +P
Sbjct: 249 KE--VFPRLFLISSQKECSVLEVGRWVSEVWEWNCKWRRSLFVWEEELADMLD-TIL-TP 304

Query: 606 XXXXXXXXXXXXXXXNEGVCSIKXXXXXXXXXXXXXXLRSP--LNDYVLLDKQWQSEFHI 433
                            G  S+                 +P  + D   L K W      
Sbjct: 305 IQLSHSNDEWRCHHATGGRFSVSSLYCFLSGSILPPISLNPDFVRDLGFLWKSWAPS--- 361

Query: 432 YKVLGFTRCLFLNRPPTRTNLFWRGVITDIADTKRVFCGEDQESKSRVFLKCGFSTRLWY 253
            KV+ F+  L   R PTR NL  RG+  +  D+  V C  + ES+  +F  C F++ LW 
Sbjct: 362 -KVVVFSWQLLRRRLPTRENLGKRGICDNGTDSNCVLCPLELESEGHLFCGCAFASTLWT 420

Query: 252 LVFKW*AVLLVVLS 211
            +FKW  + LVV S
Sbjct: 421 KIFKWFGLGLVVPS 434


>gb|PNX78535.1| ribonuclease H, partial [Trifolium pratense]
          Length = 564

 Score = 77.0 bits (188), Expect = 2e-11
 Identities = 71/267 (26%), Positives = 105/267 (39%)
 Frame = -2

Query: 993  ARYGDRITNYLDLNKIRPPYLASLWWKDIYRVGGGLAPDKDWFFDNVSKKISDGDSXXXX 814
            A+YG+ I           P+ ASLWWKDI R+       K+W  + V + + DG      
Sbjct: 282  AKYGNHILMKAVWPSGTIPHNASLWWKDICRLED-CVESKNWVEELVVRSLGDGACTGFW 340

Query: 813  XXXXXXXXLSP*GVSPRLFSILNQKDCRAVEMGSRGDLRWVWGLS*RMEFFVWEEWLWLL 634
                    L      PRLFS+  QK     E+      R  W    R   F+WEE    +
Sbjct: 341  YDKWMGGDLLC-TKFPRLFSLSLQKQATVRELVEVDGDRKTWNFLWRRSLFIWEEES--V 397

Query: 633  GDLTILFSPXXXXXXXXXXXXXXXNEGVCSIKXXXXXXXXXXXXXXLRSPLNDYVLLDKQ 454
              L  L                   +GV S+K                 P  +  +  K 
Sbjct: 398  NQLLALLENANFSNLADNWRWVGDPDGVFSVKSAYETLMKDMVIGPSLLPF-EAKIFSKI 456

Query: 453  WQSEFHIYKVLGFTRCLFLNRPPTRTNLFWRGVITDIADTKRVFCGEDQESKSRVFLKCG 274
            W+S     KV+ F+  L  +R PT+ NL  RG++     +  V+C   +ES   +FL C 
Sbjct: 457  WESPTPS-KVIVFSWQLLYDRVPTKANLILRGILPIENGSNCVWCDNIRESALHLFLHCK 515

Query: 273  FSTRLWYLVFKW*AVLLVVLSLIFKPY 193
             +  +W  +FKW  V++V+   +F  Y
Sbjct: 516  VAIDVWNAIFKWLGVVIVLPPNLFYLY 542


>dbj|GAU20924.1| hypothetical protein TSUD_24870 [Trifolium subterraneum]
          Length = 332

 Score = 75.1 bits (183), Expect = 3e-11
 Identities = 63/235 (26%), Positives = 96/235 (40%), Gaps = 6/235 (2%)
 Frame = -2

Query: 927 SLWWKDIYRVGGGLAPDKDWFFDNVSKKISDGDSXXXXXXXXXXXXLSP*GVSP------ 766
           S WW+++  +G       DWF + VSKK+ +G                  G +P      
Sbjct: 109 SCWWRNVSLLGDPDDAISDWFSEGVSKKVGNGHMTSFWFEPWL-------GGTPLRTQFQ 161

Query: 765 RLFSILNQKDCRAVEMGSRGDLRWVWGLS*RMEFFVWEEWLWLLGDLTILFSPXXXXXXX 586
           RLF +  Q      EM    + +W+WGL  R + FVWE  L L+  +  +          
Sbjct: 162 RLFLVSTQSTSTVREMDMWVEGQWLWGLRWRRDLFVWE--LNLMESINQIMDRSTISTTN 219

Query: 585 XXXXXXXXNEGVCSIKXXXXXXXXXXXXXXLRSPLNDYVLLDKQWQSEFHIYKVLGFTRC 406
                     G  S+K              + S + +  LL K W++ +   KV  F+  
Sbjct: 220 DYWCWKHDPSGCYSVKSAFLALSRSTTDEVIFS-VEEQRLLPKVWKT-WAPSKVAVFSWQ 277

Query: 405 LFLNRPPTRTNLFWRGVITDIADTKRVFCGEDQESKSRVFLKCGFSTRLWYLVFK 241
           L  +R PTR NL+ RGVI D + +  V CG   ES   +F  C   + +WY + +
Sbjct: 278 LLQDRLPTRQNLWHRGVIGDASASMCVLCGLGPESADHLFGSCNQISPIWYSILR 332


>dbj|GAU43007.1| hypothetical protein TSUD_187280 [Trifolium subterraneum]
          Length = 1892

 Score = 76.6 bits (187), Expect = 4e-11
 Identities = 69/252 (27%), Positives = 99/252 (39%), Gaps = 1/252 (0%)
 Frame = -2

Query: 1008 KRCSCARYGDRITNYLDLNKIRPPYLASLWWKDIYRVGGGLAPDKDWFFDNVSKKISDGD 829
            K    A+YG  I    +L  +  P +AS WWKDI      L  D +WF + V + + +G 
Sbjct: 1601 KEVVIAKYGQYIIGNGNLGNVTIPRVASTWWKDIC----SLDKDSNWFAEAVEQSVGNGH 1656

Query: 828  SXXXXXXXXXXXXLSP*GVSPRLFSILNQKDCRAVEMGSRGDLRWVWGLS*RMEFFVWEE 649
                          S     PR++SI NQKD     MG     RW W  + R   F WEE
Sbjct: 1657 -LTSFWSDIWIGDQSLQQRFPRMYSISNQKDSSIFNMGRWDGDRWRWDFNWRRNLFAWEE 1715

Query: 648  WLWL-LGDLTILFSPXXXXXXXXXXXXXXXNEGVCSIKXXXXXXXXXXXXXXLRSPLNDY 472
             + L L D+   F P                E   S+K              +  P  ++
Sbjct: 1716 PMKLELMDVLNQFRP---SDREDRWLWSENKEDGFSVKTCYDRLQYMFCERRVLEPSEEF 1772

Query: 471  VLLDKQWQSEFHIYKVLGFTRCLFLNRPPTRTNLFWRGVITDIADTKRVFCGEDQESKSR 292
            V   K W+      KV  F+  L  +R  T+ NL+ R ++     T  V C    E+   
Sbjct: 1773 VFA-KLWKCGAPT-KVCAFSWQLLWDRLQTKENLYKRRILQQ-QQTMCVLCNAAVETNRH 1829

Query: 291  VFLKCGFSTRLW 256
            +FL C F+ ++W
Sbjct: 1830 LFLHCDFAAKVW 1841


Top