BLASTX nr result

ID: Glycyrrhiza29_contig00015764 seq

BLASTX 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Glycyrrhiza29_contig00015764
         (1062 letters)

Database: ./nr 
           115,041,592 sequences; 42,171,959,267 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

GAU22483.1 hypothetical protein TSUD_296020 [Trifolium subterran...   175   1e-75
GAU45885.1 hypothetical protein TSUD_401090, partial [Trifolium ...   166   2e-75
GAU50364.1 hypothetical protein TSUD_409370 [Trifolium subterran...   169   2e-74
GAU14396.1 hypothetical protein TSUD_249360 [Trifolium subterran...   167   2e-73
GAU42656.1 hypothetical protein TSUD_398610 [Trifolium subterran...   162   5e-73
GAU21273.1 hypothetical protein TSUD_286830 [Trifolium subterran...   163   2e-71
GAU51593.1 hypothetical protein TSUD_12600 [Trifolium subterraneum]   157   2e-71
GAU43816.1 hypothetical protein TSUD_248050 [Trifolium subterran...   151   6e-71
GAU33337.1 hypothetical protein TSUD_166020 [Trifolium subterran...   157   8e-71
GAU40490.1 hypothetical protein TSUD_189710 [Trifolium subterran...   155   1e-70
KYP46096.1 LINE-1 reverse transcriptase isogeny [Cajanus cajan]       174   2e-69
GAU46774.1 hypothetical protein TSUD_402850 [Trifolium subterran...   163   4e-69
GAU32903.1 hypothetical protein TSUD_152630 [Trifolium subterran...   159   3e-68
GAU21183.1 hypothetical protein TSUD_11000 [Trifolium subterraneum]   160   8e-68
GAU48515.1 hypothetical protein TSUD_244350 [Trifolium subterran...   162   1e-67
GAU28506.1 hypothetical protein TSUD_156630 [Trifolium subterran...   158   2e-67
GAU24087.1 hypothetical protein TSUD_388800 [Trifolium subterran...   158   8e-67
GAU17884.1 hypothetical protein TSUD_330100 [Trifolium subterran...   145   2e-66
KYP44439.1 Retrovirus-related Pol polyprotein LINE-1 [Cajanus ca...   194   3e-66
GAU20577.1 hypothetical protein TSUD_33240 [Trifolium subterraneum]   153   5e-66

>GAU22483.1 hypothetical protein TSUD_296020 [Trifolium subterraneum]
          Length = 1115

 Score =  175 bits (444), Expect(2) = 1e-75
 Identities = 79/165 (47%), Positives = 109/165 (66%)
 Frame = +1

Query: 7   NKLSVSIVNVYAPCDLGGKRRVWEELGNLMASKGGGRWCVLGNFNSIKSISERKGVACNH 186
           N ++  +VN+YAPCD  GK  +W+ LG L+ +    RWCV G+FNS+++  ER G     
Sbjct: 95  NGVNFRVVNIYAPCDARGKADLWQRLGGLIQADSEARWCVCGDFNSVRNGGERCGRGGGV 154

Query: 187 RSEEIELFCDFIEVCGLIDLPLIGRKFTWYKPDGKAMSRLDRFLISEDWLFTWGNLSQWG 366
              E++ F +FI    LIDLPL GR+FTW + DG +MSRLDRFL+S  W  TW N  Q  
Sbjct: 155 VDAEVDRFNEFILNSELIDLPLHGRRFTWSRSDGSSMSRLDRFLLSGSWCTTWPNCIQVA 214

Query: 367 MQRSVSDHCAVILKEKDINWGPKPFRMMRCWEETVGYEDFVKKRM 501
           + R +SDHC +IL+E + +WGP+PFRMM+CW +  GY  FV+ +M
Sbjct: 215 ILRGLSDHCPLILREHEEDWGPRPFRMMKCWRDFPGYNSFVRDQM 259



 Score =  137 bits (346), Expect(2) = 1e-75
 Identities = 70/183 (38%), Positives = 100/183 (54%), Gaps = 1/183 (0%)
 Frame = +2

Query: 515  VRGWKGYXXXXXXXXXXXXXXXWHKDHFGNLETKIKEAKEEVYRLDLKGEVDGLNEVE-V 691
            + GW G+               WH  H  N+ +KI EAKE +  LDL+GE   L + + +
Sbjct: 264  IDGWGGFVLKEKFKLLKNSLRVWHLSHAKNIGSKILEAKERLLGLDLRGENANLTDDDLI 323

Query: 692  GDRRLCIANINRFSSMNCRILWQKSRMKWLKEGDANSKFFHRCVQRRRKANEILGLDFDG 871
             +RR   + I   S + C +LWQ SR KWL EGDANSKFFH     R++ N I+ LD +G
Sbjct: 324  IERREVTSTIFSLSRIECSMLWQNSRTKWLLEGDANSKFFHALANSRKRKNLIVLLDVNG 383

Query: 872  SFIEGVNPLRREIRGYFESHFNNRDWARPSFGNINIPTLGTAESNALTNPFSEEEVKAMV 1051
              +EGV  +R  I  +F S F ++  +RP  GN++  T+  +E+  L   F E+E K  V
Sbjct: 384  IQVEGVGNIRESIFNHFSSQFKSQRISRPDVGNLDFKTISESEATVLVEEFGEDETKQAV 443

Query: 1052 WNC 1060
            W+C
Sbjct: 444  WDC 446


>GAU45885.1 hypothetical protein TSUD_401090, partial [Trifolium subterraneum]
          Length = 751

 Score =  166 bits (419), Expect(2) = 2e-75
 Identities = 74/158 (46%), Positives = 103/158 (65%)
 Frame = +1

Query: 25  IVNVYAPCDLGGKRRVWEELGNLMASKGGGRWCVLGNFNSIKSISERKGVACNHRSEEIE 204
           ++N+YAPCDLG K+R+W  L   + S  G R CV G+FN+++   ER+         +  
Sbjct: 60  VMNIYAPCDLGAKQRLWNSLSVRLQSLAGRRVCVCGDFNAVRCQEERRSSRVGPSQADHI 119

Query: 205 LFCDFIEVCGLIDLPLIGRKFTWYKPDGKAMSRLDRFLISEDWLFTWGNLSQWGMQRSVS 384
            F  FIE   L+DLPL GRKFTWY+ DG +MSRLDRFL+SE+W   W N  Q    R +S
Sbjct: 120 PFNSFIEDNNLVDLPLGGRKFTWYRGDGLSMSRLDRFLLSEEWCLAWPNCLQVAQLRGLS 179

Query: 385 DHCAVILKEKDINWGPKPFRMMRCWEETVGYEDFVKKR 498
           DHC ++L   + NWGP+P RM++CW++  GY+ FVK++
Sbjct: 180 DHCPLMLVASEENWGPRPLRMLKCWKDIPGYDLFVKEK 217



 Score =  146 bits (368), Expect(2) = 2e-75
 Identities = 75/190 (39%), Positives = 102/190 (53%)
 Frame = +2

Query: 491  KKEWSGLKVRGWKGYXXXXXXXXXXXXXXXWHKDHFGNLETKIKEAKEEVYRLDLKGEVD 670
            K++W+ L V GW G+               WH  H  NL ++I   K  +  LD KGE +
Sbjct: 215  KEKWNSLHVDGWGGFVLKEKLKLIKVALKEWHLSHAQNLPSRIDSLKTRLSNLDNKGEEE 274

Query: 671  GLNEVEVGDRRLCIANINRFSSMNCRILWQKSRMKWLKEGDANSKFFHRCVQRRRKANEI 850
             L+  EV D R      +  S ++  I WQ+SR+ WLKEGDANSK+FH  +  RR+ N I
Sbjct: 275  DLSVDEVVDMRGITFEFHSLSRLHASISWQQSRLLWLKEGDANSKYFHSVLASRRRGNTI 334

Query: 851  LGLDFDGSFIEGVNPLRREIRGYFESHFNNRDWARPSFGNINIPTLGTAESNALTNPFSE 1030
              L  DG  +EGVNP+R+ +  +F SHF   +  RP   N+    L   +  +LT PF  
Sbjct: 335  STLQADGVTLEGVNPIRQAVFTHFASHFKASNVERPGVDNLQFKRLSWLDIGSLTRPFLV 394

Query: 1031 EEVKAMVWNC 1060
            EEVKA VW+C
Sbjct: 395  EEVKAAVWDC 404


>GAU50364.1 hypothetical protein TSUD_409370 [Trifolium subterraneum]
          Length = 546

 Score =  169 bits (428), Expect(2) = 2e-74
 Identities = 76/158 (48%), Positives = 104/158 (65%)
 Frame = +1

Query: 25  IVNVYAPCDLGGKRRVWEELGNLMASKGGGRWCVLGNFNSIKSISERKGVACNHRSEEIE 204
           + NVYAPC LG K+ +W  L   +    G R CV G+FN+++SI ER+       S +  
Sbjct: 34  LANVYAPCGLGAKQSLWNSLLGRILLLNGERVCVCGDFNAVRSIEERRSARAGSHSSDHI 93

Query: 205 LFCDFIEVCGLIDLPLIGRKFTWYKPDGKAMSRLDRFLISEDWLFTWGNLSQWGMQRSVS 384
            F  FI+   LIDLPL GRKFTWYK DG AMSR+DRFL+SE+W  TW N  Q    R +S
Sbjct: 94  PFNRFIDDAVLIDLPLSGRKFTWYKGDGLAMSRIDRFLLSEEWCLTWPNCVQVAQLRGLS 153

Query: 385 DHCAVILKEKDINWGPKPFRMMRCWEETVGYEDFVKKR 498
           DHC ++L+ ++ NWGP+P RM++CW++  GY+ FV+ +
Sbjct: 154 DHCPLVLEVEEENWGPRPSRMLKCWKDIPGYQQFVRDK 191



 Score =  140 bits (352), Expect(2) = 2e-74
 Identities = 70/190 (36%), Positives = 98/190 (51%)
 Frame = +2

Query: 491  KKEWSGLKVRGWKGYXXXXXXXXXXXXXXXWHKDHFGNLETKIKEAKEEVYRLDLKGEVD 670
            + +W  +++ GW G+               WH  H  NL  +I+  K  +  L++KGE  
Sbjct: 189  RDKWKAMQIVGWGGFVLKEKFKMIRLALKEWHAAHSQNLPGRIESLKVRLAALEVKGEAA 248

Query: 671  GLNEVEVGDRRLCIANINRFSSMNCRILWQKSRMKWLKEGDANSKFFHRCVQRRRKANEI 850
             L+E E+ +       I+  S  +  I WQ+SR  WLKEGDANSK+FH  V  RR+ N +
Sbjct: 249  VLSEAELEELHGLTTEIHSLSRRSASICWQQSRSLWLKEGDANSKYFHSVVASRRRGNAV 308

Query: 851  LGLDFDGSFIEGVNPLRREIRGYFESHFNNRDWARPSFGNINIPTLGTAESNALTNPFSE 1030
              +  DG   EGV P+R+ +  +F SHF     ARP   N+    L   E  +LT PFS 
Sbjct: 309  SFIQVDGVTTEGVQPIRQAVFEHFASHFKESHVARPGVDNLQFKRLTLLEGGSLTKPFSL 368

Query: 1031 EEVKAMVWNC 1060
            EEVK  VW+C
Sbjct: 369  EEVKTAVWDC 378


>GAU14396.1 hypothetical protein TSUD_249360 [Trifolium subterraneum]
          Length = 845

 Score =  167 bits (422), Expect(2) = 2e-73
 Identities = 74/155 (47%), Positives = 100/155 (64%)
 Frame = +1

Query: 25  IVNVYAPCDLGGKRRVWEELGNLMASKGGGRWCVLGNFNSIKSISERKGVACNHRSEEIE 204
           + NVYAPCD   K+ +W+ L   +    G R CV G+FN+ +   ER+ V    RS +  
Sbjct: 244 LFNVYAPCDDNAKQVLWDSLSGKLQQLAGKRVCVCGDFNAARGAEERRSVRLGFRSIDHG 303

Query: 205 LFCDFIEVCGLIDLPLIGRKFTWYKPDGKAMSRLDRFLISEDWLFTWGNLSQWGMQRSVS 384
            F  FIE  GL+DLPL GR++TW+K DG++MSR+DRFL+SEDW  TW N  Q    R +S
Sbjct: 304 PFNQFIEANGLVDLPLSGRRYTWFKGDGRSMSRIDRFLLSEDWCLTWPNCIQVAQLRGLS 363

Query: 385 DHCAVILKEKDINWGPKPFRMMRCWEETVGYEDFV 489
           DHC  IL   + NWGP+P RM++CW +T G++ FV
Sbjct: 364 DHCPFILSMDEENWGPRPVRMLKCWHDTPGFKQFV 398



 Score =  139 bits (349), Expect(2) = 2e-73
 Identities = 68/188 (36%), Positives = 103/188 (54%)
 Frame = +2

Query: 497  EWSGLKVRGWKGYXXXXXXXXXXXXXXXWHKDHFGNLETKIKEAKEEVYRLDLKGEVDGL 676
            +W  L+V GW G+               WH     NL  ++ + +  +  LD KGE + L
Sbjct: 401  KWRSLEVDGWGGFVLKEKLKLIKMALKDWHGTQARNLPGRLNDLRNRLSVLDSKGEEEVL 460

Query: 677  NEVEVGDRRLCIANINRFSSMNCRILWQKSRMKWLKEGDANSKFFHRCVQRRRKANEILG 856
             + E+ + R    +I+  S MN  I WQ+SR++WL+EGDANSK+FH  +  RR+ N    
Sbjct: 461  TDEELVELRTITYDIHALSRMNTSISWQQSRLQWLREGDANSKYFHSVLVSRRRQNAFSV 520

Query: 857  LDFDGSFIEGVNPLRREIRGYFESHFNNRDWARPSFGNINIPTLGTAESNALTNPFSEEE 1036
            +  DG  +EGV  +R+ +  +F SHF + +  RP+   ++ P+L  AE   L  PFS EE
Sbjct: 521  IMVDGERVEGVQAVRQALFSHFSSHFRSCNMPRPTVEELHFPSLSFAEGAGLVKPFSVEE 580

Query: 1037 VKAMVWNC 1060
            VKA +W+C
Sbjct: 581  VKAAIWDC 588


>GAU42656.1 hypothetical protein TSUD_398610 [Trifolium subterraneum]
          Length = 1707

 Score =  162 bits (409), Expect(2) = 5e-73
 Identities = 71/158 (44%), Positives = 104/158 (65%)
 Frame = +1

Query: 25   IVNVYAPCDLGGKRRVWEELGNLMASKGGGRWCVLGNFNSIKSISERKGVACNHRSEEIE 204
            + NVY+PCD G K+ +W+ L     S G  R CV G+FN++  + ER+ +    RS +  
Sbjct: 980  VANVYSPCDDGAKQGLWDSLLVRFQSLGRERVCVCGDFNAVTHVDERRSIGGALRSTDYI 1039

Query: 205  LFCDFIEVCGLIDLPLIGRKFTWYKPDGKAMSRLDRFLISEDWLFTWGNLSQWGMQRSVS 384
             F  FI+   L+DLPL GRKFTWY+ DG++MSRLDRF++SE+W  TW N  Q    R +S
Sbjct: 1040 PFNRFIDDNNLVDLPLRGRKFTWYRGDGQSMSRLDRFMLSEEWCLTWPNCEQVAKLRGLS 1099

Query: 385  DHCAVILKEKDINWGPKPFRMMRCWEETVGYEDFVKKR 498
            DHC ++L   + +WGP+P RM++CW++  GY  FV+++
Sbjct: 1100 DHCPLVLSANEEDWGPRPLRMLKCWKDVPGYNLFVREK 1137



 Score =  142 bits (358), Expect(2) = 5e-73
 Identities = 72/190 (37%), Positives = 105/190 (55%)
 Frame = +2

Query: 491  KKEWSGLKVRGWKGYXXXXXXXXXXXXXXXWHKDHFGNLETKIKEAKEEVYRLDLKGEVD 670
            +++W   +V GW GY               WHK H  NL ++I   K  V  LD KGE +
Sbjct: 1135 REKWKSFQVDGWGGYAALKE----------WHKAHVQNLPSRIDSLKYRVSELDQKGEEE 1184

Query: 671  GLNEVEVGDRRLCIANINRFSSMNCRILWQKSRMKWLKEGDANSKFFHRCVQRRRKANEI 850
             L+  EV +     A+I+  S ++  I WQ+SR  WLK+GD NSK+FH  +  RR+ N I
Sbjct: 1185 VLSGDEVAELHGATADIHSLSRLHASISWQQSRSLWLKDGDVNSKYFHSILAGRRRRNAI 1244

Query: 851  LGLDFDGSFIEGVNPLRREIRGYFESHFNNRDWARPSFGNINIPTLGTAESNALTNPFSE 1030
              +   G  +EGV+ +R+ +  +F SHF N +  RP   N+    L  +ES++LT PF+E
Sbjct: 1245 STIQVGGVALEGVSSIRQAVFSHFASHFKNSNVGRPGVDNLQFKRLDHSESSSLTKPFTE 1304

Query: 1031 EEVKAMVWNC 1060
             EVK+ VW+C
Sbjct: 1305 NEVKSAVWDC 1314


>GAU21273.1 hypothetical protein TSUD_286830 [Trifolium subterraneum]
          Length = 1449

 Score =  163 bits (412), Expect(2) = 2e-71
 Identities = 76/156 (48%), Positives = 100/156 (64%)
 Frame = +1

Query: 31  NVYAPCDLGGKRRVWEELGNLMASKGGGRWCVLGNFNSIKSISERKGVACNHRSEEIELF 210
           NVYAPCD   K+R+W+ L + + S G  R CV G+FN++KS+ E + +     S +   F
Sbjct: 435 NVYAPCDARAKQRLWDSLSSRIQSLGRQRVCVCGDFNAVKSLDETRSLRGAQNSSDFLAF 494

Query: 211 CDFIEVCGLIDLPLIGRKFTWYKPDGKAMSRLDRFLISEDWLFTWGNLSQWGMQRSVSDH 390
             FIE   L+DLPL GRK TWYK DG +MSRLDRFL+SEDW  TW    Q    R VSDH
Sbjct: 495 NLFIEDNTLVDLPLSGRKLTWYKGDGLSMSRLDRFLLSEDWCLTWPYCKQEARMRGVSDH 554

Query: 391 CAVILKEKDINWGPKPFRMMRCWEETVGYEDFVKKR 498
           C +IL   + +WGP+P RM++CW+   GY  FV+ +
Sbjct: 555 CPLILSANEEDWGPRPSRMLKCWKLVPGYNLFVRDK 590



 Score =  136 bits (342), Expect(2) = 2e-71
 Identities = 68/190 (35%), Positives = 100/190 (52%)
 Frame = +2

Query: 491  KKEWSGLKVRGWKGYXXXXXXXXXXXXXXXWHKDHFGNLETKIKEAKEEVYRLDLKGEVD 670
            + +W+   V GW GY               WH  H  NL ++I   K     LD KGE D
Sbjct: 588  RDKWNSFLVNGWGGYVLKEKFKMIKVALKEWHMTHTKNLPSRIDSLKVRQSCLDQKGEED 647

Query: 671  GLNEVEVGDRRLCIANINRFSSMNCRILWQKSRMKWLKEGDANSKFFHRCVQRRRKANEI 850
             L+E E+ +     ++I+  S ++  + WQ+SR  WLKEGD NSKFFH  +  RR+ N I
Sbjct: 648  VLSEAELEELHGVTSDIHTLSRLHASVCWQQSRSLWLKEGDVNSKFFHSVLASRRRGNAI 707

Query: 851  LGLDFDGSFIEGVNPLRREIRGYFESHFNNRDWARPSFGNINIPTLGTAESNALTNPFSE 1030
              +  DG  +EGV+ +R+ +  +F +HF   +  RP   ++   TL   E + L  PF+ 
Sbjct: 708  SSIVVDGVPLEGVSSVRQAVVSHFAAHFKTSNVVRPRVDDLIFNTLNQVECSNLIKPFTR 767

Query: 1031 EEVKAMVWNC 1060
            +EVKA VW+C
Sbjct: 768  DEVKAAVWDC 777


>GAU51593.1 hypothetical protein TSUD_12600 [Trifolium subterraneum]
          Length = 851

 Score =  157 bits (396), Expect(2) = 2e-71
 Identities = 74/161 (45%), Positives = 102/161 (63%), Gaps = 3/161 (1%)
 Frame = +1

Query: 25  IVNVYAPCDLGGKRRVWEELGNLMASKGGGRWCVLGNFNSIKSISERKGVACNHRSEEIE 204
           + NVYAPC+  G+  +W  L   ++      WCV+G+FN+++   ER G + N     + 
Sbjct: 34  LANVYAPCEASGRALLWRALEGKISHFANMAWCVVGDFNAVRGSEERSGRSNNPIQNSVV 93

Query: 205 LFCDF---IEVCGLIDLPLIGRKFTWYKPDGKAMSRLDRFLISEDWLFTWGNLSQWGMQR 375
            + DF   I+   LIDLPL GRKFTWY+ DG  MSRLDRFL+SE W+  + N  Q  + R
Sbjct: 94  EYSDFNSFIDNNFLIDLPLGGRKFTWYRGDGITMSRLDRFLLSESWISRFPNSIQEALPR 153

Query: 376 SVSDHCAVILKEKDINWGPKPFRMMRCWEETVGYEDFVKKR 498
           ++SDHC V L   ++NWGPKP RM++CW +  GY DFVK+R
Sbjct: 154 TLSDHCPVQLSIDELNWGPKPQRMLKCWVDIQGYHDFVKER 194



 Score =  142 bits (358), Expect(2) = 2e-71
 Identities = 70/190 (36%), Positives = 101/190 (53%)
 Frame = +2

Query: 491  KKEWSGLKVRGWKGYXXXXXXXXXXXXXXXWHKDHFGNLETKIKEAKEEVYRLDLKGEVD 670
            K+ WS  +V GW G+               WH +H  NL+ KI++A   +   D+ GE  
Sbjct: 192  KERWSSFQVHGWSGHILKTKLKFIKAELRNWHFNHTANLDGKIRDATTRLEEFDIIGETR 251

Query: 671  GLNEVEVGDRRLCIANINRFSSMNCRILWQKSRMKWLKEGDANSKFFHRCVQRRRKANEI 850
             L+  E  +     ANI  FS++   +LWQKSR+ WL+EGDANSKFF   +  RR++N I
Sbjct: 252  RLDTNEELELHSAQANIVSFSNLQASMLWQKSRVNWLREGDANSKFFQGLMSSRRQSNTI 311

Query: 851  LGLDFDGSFIEGVNPLRREIRGYFESHFNNRDWARPSFGNINIPTLGTAESNALTNPFSE 1030
            + L   G  +EGV  +R E+  +F +HF  +   RP    +N  T+    S  L  PF  
Sbjct: 312  ISLQAGGRVVEGVEEVRWEVFQHFRNHFRKQTVTRPDMQGLNFKTISEDNSAELVKPFLL 371

Query: 1031 EEVKAMVWNC 1060
            +E+KA VW+C
Sbjct: 372  DEIKAAVWDC 381


>GAU43816.1 hypothetical protein TSUD_248050 [Trifolium subterraneum]
          Length = 1355

 Score =  151 bits (381), Expect(2) = 6e-71
 Identities = 72/165 (43%), Positives = 101/165 (61%)
 Frame = +1

Query: 4   ENKLSVSIVNVYAPCDLGGKRRVWEELGNLMASKGGGRWCVLGNFNSIKSISERKGVACN 183
           +  ++  + NVYAP D  G+  +W  L + +       WCVL +FN ++   ER   A +
Sbjct: 354 KQNINFCLANVYAPYDYTGRPILWNNLESKILHFSQAAWCVLRDFNVVRYADERVSRASH 413

Query: 184 HRSEEIELFCDFIEVCGLIDLPLIGRKFTWYKPDGKAMSRLDRFLISEDWLFTWGNLSQW 363
              +E   F +FI+   LIDL L GRKFTWY+ DGK+MSRLDRFL S+ WL  + N  Q 
Sbjct: 414 SALDEFVAFNNFIDSTLLIDLTLCGRKFTWYRGDGKSMSRLDRFLFSDVWLAEFPNCIQA 473

Query: 364 GMQRSVSDHCAVILKEKDINWGPKPFRMMRCWEETVGYEDFVKKR 498
            + RS+SDHC + L     NWGPKP RM++CW +  GYE+FV+++
Sbjct: 474 ALPRSLSDHCPIQLSIDVQNWGPKPLRMLKCWADIAGYEEFVEEK 518



 Score =  146 bits (368), Expect(2) = 6e-71
 Identities = 72/190 (37%), Positives = 104/190 (54%)
 Frame = +2

Query: 491  KKEWSGLKVRGWKGYXXXXXXXXXXXXXXXWHKDHFGNLETKIKEAKEEVYRLDLKGEVD 670
            +++W   +V GW G+               WH +H  NL+ KI+ AK  +  LDL GE  
Sbjct: 516  EEKWHSFQVHGWSGHILKSKLKFIKSELKSWHLNHTANLDNKIQVAKTRLEELDLSGEER 575

Query: 671  GLNEVEVGDRRLCIANINRFSSMNCRILWQKSRMKWLKEGDANSKFFHRCVQRRRKANEI 850
             L EVE  +     ANI+ FS     + WQKSR+ WLKEGDANSKFFH  +  RR++N I
Sbjct: 576  WLFEVEEVEMCSLQANISAFSKSQASMHWQKSRVSWLKEGDANSKFFHGIMSSRRQSNSI 635

Query: 851  LGLDFDGSFIEGVNPLRREIRGYFESHFNNRDWARPSFGNINIPTLGTAESNALTNPFSE 1030
            + L  +G  +EGVN +R+ I  +F  HF  ++  RP    +   ++  A+   L+ PF  
Sbjct: 636  VSLSSNGRTVEGVNEIRQVIFQHFSQHFRRKNHNRPDISGLVFNSISEADGEFLSRPFLL 695

Query: 1031 EEVKAMVWNC 1060
            +E+K  VW+C
Sbjct: 696  DEIKKAVWDC 705


>GAU33337.1 hypothetical protein TSUD_166020 [Trifolium subterraneum]
          Length = 1227

 Score =  157 bits (398), Expect(2) = 8e-71
 Identities = 73/161 (45%), Positives = 103/161 (63%), Gaps = 3/161 (1%)
 Frame = +1

Query: 25  IVNVYAPCDLGGKRRVWEELGNLMASKGGGRWCVLGNFNSIKSISERKGVACNHRSEEIE 204
           + NVYAPC+  G+  +W+ L   ++      WCV+G+FN+++   E+ G + N     + 
Sbjct: 420 LANVYAPCEASGRALIWQALEGKISHFVNMAWCVVGDFNAVRGSEEQSGRSSNPIQNSVV 479

Query: 205 LFCDF---IEVCGLIDLPLIGRKFTWYKPDGKAMSRLDRFLISEDWLFTWGNLSQWGMQR 375
            + DF   I+   LIDLPL GRKFTWY+ DG  MSRLDRFL+SE W+  + N  Q  + R
Sbjct: 480 EYSDFNSFIDNNFLIDLPLGGRKFTWYRGDGITMSRLDRFLLSESWISRFPNCIQEALPR 539

Query: 376 SVSDHCAVILKEKDINWGPKPFRMMRCWEETVGYEDFVKKR 498
           ++SDHC V L   ++NWGPKP RM++CW +  GY DFVK+R
Sbjct: 540 TLSDHCPVQLSIDELNWGPKPHRMLKCWVDIQGYHDFVKER 580



 Score =  139 bits (350), Expect(2) = 8e-71
 Identities = 70/190 (36%), Positives = 102/190 (53%)
 Frame = +2

Query: 491  KKEWSGLKVRGWKGYXXXXXXXXXXXXXXXWHKDHFGNLETKIKEAKEEVYRLDLKGEVD 670
            K+ WS  +V GW G+               WH +H  NL+ KI++AK  +   D+ GE  
Sbjct: 578  KERWSSFQVHGWSGHILKTKLKFIKAELRNWHLNHTANLDGKIRDAKNRLEEFDVIGETR 637

Query: 671  GLNEVEVGDRRLCIANINRFSSMNCRILWQKSRMKWLKEGDANSKFFHRCVQRRRKANEI 850
             L+  E  +     ANI  FS +   +LWQKSR+ WLKEG+ANSKFF   +  RR++N I
Sbjct: 638  RLDTNEELELHSVQANIVSFSKLQASMLWQKSRVNWLKEGNANSKFFQGLMSSRRQSNTI 697

Query: 851  LGLDFDGSFIEGVNPLRREIRGYFESHFNNRDWARPSFGNINIPTLGTAESNALTNPFSE 1030
            + L   G  +EGV  +R E+  +F +HF  +  +RP    +N  ++    S  L  PF  
Sbjct: 698  ISLQAVGRVVEGVKEVRWEVFQHFCNHFRKQTVSRPYMQGLNFKSISKDNSAELVKPFLL 757

Query: 1031 EEVKAMVWNC 1060
            +E+KA VW+C
Sbjct: 758  DEIKAAVWDC 767


>GAU40490.1 hypothetical protein TSUD_189710 [Trifolium subterraneum]
          Length = 1087

 Score =  155 bits (393), Expect(2) = 1e-70
 Identities = 72/159 (45%), Positives = 100/159 (62%)
 Frame = +1

Query: 22  SIVNVYAPCDLGGKRRVWEELGNLMASKGGGRWCVLGNFNSIKSISERKGVACNHRSEEI 201
           S+ NVYAPCD G K+R+W+     +    G R CV G FN++++I ER+       S + 
Sbjct: 105 SVTNVYAPCDDGEKQRLWDLFSARIQLLVGRRVCVCGAFNAVRTIDERRFARGGSNSLDH 164

Query: 202 ELFCDFIEVCGLIDLPLIGRKFTWYKPDGKAMSRLDRFLISEDWLFTWGNLSQWGMQRSV 381
             F  FI+   LIDLPL GRKFTW+K DG +MSR+DRFL+SE+W   W N  Q    R +
Sbjct: 165 IPFNRFIDDNNLIDLPLSGRKFTWFKGDGFSMSRIDRFLLSEEWCLAWPNCRQVARLRGL 224

Query: 382 SDHCAVILKEKDINWGPKPFRMMRCWEETVGYEDFVKKR 498
           SDHC ++L   + +WGP+P RM++CW +  GY  FV+ +
Sbjct: 225 SDHCPIVLSANEEDWGPRPSRMLKCWRDVPGYNVFVRDK 263



 Score =  140 bits (353), Expect(2) = 1e-70
 Identities = 71/190 (37%), Positives = 104/190 (54%)
 Frame = +2

Query: 491  KKEWSGLKVRGWKGYXXXXXXXXXXXXXXXWHKDHFGNLETKIKEAKEEVYRLDLKGEVD 670
            + +W+ L+V  W GY               WH  H  NL ++I+  K  +  LD KGE  
Sbjct: 261  RDKWNSLQVDSWGGYVLKEKLKMIKAALKEWHSVHVQNLPSRIESLKARLTDLDQKGEDG 320

Query: 671  GLNEVEVGDRRLCIANINRFSSMNCRILWQKSRMKWLKEGDANSKFFHRCVQRRRKANEI 850
             L+E E+ +     ++I+  S MN  I WQ+SR  WLKEGDANSK+FH  +  RR+ N I
Sbjct: 321  VLSEDEIVELHEVSSDIHSLSRMNASICWQQSRSLWLKEGDANSKYFHSVLAGRRRRNAI 380

Query: 851  LGLDFDGSFIEGVNPLRREIRGYFESHFNNRDWARPSFGNINIPTLGTAESNALTNPFSE 1030
              +  +   +EGV+P+R+ +  +F SHF   +  RP    +    L   E ++LT PFSE
Sbjct: 381  SVIQVEEVTLEGVDPIRQAVFSHFTSHFKATNVERPGVVTLQFKRLNQLERSSLTKPFSE 440

Query: 1031 EEVKAMVWNC 1060
             EVK++VW+C
Sbjct: 441  AEVKSVVWDC 450


>KYP46096.1 LINE-1 reverse transcriptase isogeny [Cajanus cajan]
          Length = 729

 Score =  174 bits (440), Expect(2) = 2e-69
 Identities = 76/166 (45%), Positives = 113/166 (68%)
 Frame = +1

Query: 1   GENKLSVSIVNVYAPCDLGGKRRVWEELGNLMASKGGGRWCVLGNFNSIKSISERKGVAC 180
           G +K+   IVN+Y+PCDL GK+ +WEE+  +  S G GRWC+ G+FN+++  SERKGV  
Sbjct: 18  GYDKIPCFIVNIYSPCDLRGKKNLWEEIHKIKNSYGSGRWCICGDFNTVRLKSERKGVHT 77

Query: 181 NHRSEEIELFCDFIEVCGLIDLPLIGRKFTWYKPDGKAMSRLDRFLISEDWLFTWGNLSQ 360
               +E+  +  FIE   LIDLPL G K+TW++P+    SR+DRFL+S++WL  W + SQ
Sbjct: 78  RREEKEMLCYNQFIEDVELIDLPLGGGKYTWFRPNRIIASRIDRFLVSQEWLTQWPHCSQ 137

Query: 361 WGMQRSVSDHCAVILKEKDINWGPKPFRMMRCWEETVGYEDFVKKR 498
             +QR VSDH  ++LK+  ++WGPKPFR + CW +   +  FV+++
Sbjct: 138 KALQRDVSDHRPILLKDIRLDWGPKPFRSLNCWFDDPSFLGFVEEK 183



 Score =  118 bits (295), Expect(2) = 2e-69
 Identities = 65/192 (33%), Positives = 94/192 (48%), Gaps = 2/192 (1%)
 Frame = +2

Query: 491  KKEWSGLKVRGWKGYXXXXXXXXXXXXXXXWHKDHFGNLETKIKEAKEEVYRLDLKGEVD 670
            +++W G  V GW  +               W+K  FGN+ T+I+E K  +  LD   E  
Sbjct: 181  EEKWKGFSVTGWGAFILKEKLKHLKKSIKEWNKQAFGNIHTQIEEVKRNINSLDSIVETR 240

Query: 671  GLNEVEVGDRRLCIANINRFSSMNCR--ILWQKSRMKWLKEGDANSKFFHRCVQRRRKAN 844
             LNE +V DRR    N+  +  +N +  +L QKSR+KW +EGD+NS FFH CV +RRK N
Sbjct: 241  SLNERKVSDRRNL--NVKLWDLLNKKESLLLQKSRLKWAREGDSNSSFFHMCVNKRRKMN 298

Query: 845  EILGLDFDGSFIEGVNPLRREIRGYFESHFNNRDWARPSFGNINIPTLGTAESNALTNPF 1024
            EI+GLD +G ++                              I    L T +  +LT PF
Sbjct: 299  EIIGLDVNGKWL---------------------------LDGIQFQQLNTHQCRSLTRPF 331

Query: 1025 SEEEVKAMVWNC 1060
            + EE++  VW+C
Sbjct: 332  TAEEIREAVWSC 343


>GAU46774.1 hypothetical protein TSUD_402850 [Trifolium subterraneum]
          Length = 908

 Score =  163 bits (413), Expect(2) = 4e-69
 Identities = 76/158 (48%), Positives = 99/158 (62%)
 Frame = +1

Query: 25  IVNVYAPCDLGGKRRVWEELGNLMASKGGGRWCVLGNFNSIKSISERKGVACNHRSEEIE 204
           + NVYAPCDLG K+ +W  L + +   G  R CV G+FN+++ I ER+      RS +I 
Sbjct: 454 LANVYAPCDLGAKQVLWASLSDQIQLLGRRRMCVSGDFNAVRCIEERRSPRTVTRSTDIL 513

Query: 205 LFCDFIEVCGLIDLPLIGRKFTWYKPDGKAMSRLDRFLISEDWLFTWGNLSQWGMQRSVS 384
            F  FI+    IDLPL GRKFTWYK DG  MSRLDRFL+SEDW   W N       R +S
Sbjct: 514 PFNQFIDEMFFIDLPLSGRKFTWYKGDGHTMSRLDRFLLSEDWCLAWPNCVHVAQLRGLS 573

Query: 385 DHCAVILKEKDINWGPKPFRMMRCWEETVGYEDFVKKR 498
           DHC +IL   + NWGP+  RM++CW +  GY +FV+ +
Sbjct: 574 DHCPLILSADEENWGPRSSRMLKCWTDVPGYVNFVRDK 611



 Score =  127 bits (320), Expect(2) = 4e-69
 Identities = 65/190 (34%), Positives = 98/190 (51%)
 Frame = +2

Query: 491  KKEWSGLKVRGWKGYXXXXXXXXXXXXXXXWHKDHFGNLETKIKEAKEEVYRLDLKGEVD 670
            + +W+  +V GW G+               WH+ H  N+  +I   K  +  LD KGE  
Sbjct: 609  RDKWNSFQVNGWGGFVLKEKLKMIKLALKEWHEAHVRNIPRRIDSLKVRLSDLDSKGEEA 668

Query: 671  GLNEVEVGDRRLCIANINRFSSMNCRILWQKSRMKWLKEGDANSKFFHRCVQRRRKANEI 850
             L++ EV +      +I+  S +N  I  Q+SR +WL+EGDAN+K+FH  +  RR+ N I
Sbjct: 669  SLSDEEVQELHGITLDIHSLSRLNASICRQQSRSRWLREGDANTKYFHSVLTNRRRGNTI 728

Query: 851  LGLDFDGSFIEGVNPLRREIRGYFESHFNNRDWARPSFGNINIPTLGTAESNALTNPFSE 1030
              L  +G    GV+P+R+ +  +F  HF   +  RP   N++   L   E  +L   FS 
Sbjct: 729  SSLQVNGVTTRGVHPIRQAVFTHFADHFKVNNVDRPRVENLHFRRLNPLECGSLIKAFSL 788

Query: 1031 EEVKAMVWNC 1060
            EEVKA VW+C
Sbjct: 789  EEVKAAVWDC 798


>GAU32903.1 hypothetical protein TSUD_152630 [Trifolium subterraneum]
          Length = 1715

 Score =  159 bits (401), Expect(2) = 3e-68
 Identities = 77/165 (46%), Positives = 107/165 (64%)
 Frame = +1

Query: 4    ENKLSVSIVNVYAPCDLGGKRRVWEELGNLMASKGGGRWCVLGNFNSIKSISERKGVACN 183
            ++ L+  + NVYAPCD  G+  +W EL   +       WCVLG+FN+I+S  ER     +
Sbjct: 762  KDDLAFCLANVYAPCDARGRSLLWRELDVKLLQIPLSVWCVLGDFNAIRSRDERVSRGSS 821

Query: 184  HRSEEIELFCDFIEVCGLIDLPLIGRKFTWYKPDGKAMSRLDRFLISEDWLFTWGNLSQW 363
               E+   F +FI+   LIDLPL GR FTWY  DG +MSRLDRFLIS+ W+F++ +  Q 
Sbjct: 822  G-VEDYMAFKNFIDRNALIDLPLGGRSFTWYSGDGLSMSRLDRFLISDSWVFSFPHCVQM 880

Query: 364  GMQRSVSDHCAVILKEKDINWGPKPFRMMRCWEETVGYEDFVKKR 498
             + RS+SDHC ++L     +WGPKPFRMM+CW +  GY +FVK++
Sbjct: 881  ALPRSLSDHCPIMLSVDVQDWGPKPFRMMKCWADISGYAEFVKQK 925



 Score =  129 bits (324), Expect(2) = 3e-68
 Identities = 65/190 (34%), Positives = 96/190 (50%)
 Frame = +2

Query: 491  KKEWSGLKVRGWKGYXXXXXXXXXXXXXXXWHKDHFGNLETKIKEAKEEVYRLDLKGEVD 670
            K++W   ++ GW G+               WH+ H  NL+ KI+  K  +  LD+  E  
Sbjct: 923  KQKWQSFQIHGWSGHILKTKLKLLKAELRSWHQIHTANLDGKIQRDKSRLEELDICKEGR 982

Query: 671  GLNEVEVGDRRLCIANINRFSSMNCRILWQKSRMKWLKEGDANSKFFHRCVQRRRKANEI 850
            GL+ VE  +      +I   S +   + WQKSR+ WL+EGDANSKFFH  +  RR+AN I
Sbjct: 983  GLDVVEEAELLSLPVDILALSKLQASMYWQKSRVTWLREGDANSKFFHGVMSSRRRANSI 1042

Query: 851  LGLDFDGSFIEGVNPLRREIRGYFESHFNNRDWARPSFGNINIPTLGTAESNALTNPFSE 1030
              L   G  +E V  +R  +  ++ +HF  +   RP    +   +L +     LT PF  
Sbjct: 1043 GALIHAGRTVESVPEVRHIVYQHYSNHFRKQMHYRPDISRLEFRSLSSLHGAELTKPFLM 1102

Query: 1031 EEVKAMVWNC 1060
            EE+KA VW+C
Sbjct: 1103 EEIKAAVWDC 1112


>GAU21183.1 hypothetical protein TSUD_11000 [Trifolium subterraneum]
          Length = 482

 Score =  160 bits (404), Expect(2) = 8e-68
 Identities = 79/157 (50%), Positives = 100/157 (63%), Gaps = 2/157 (1%)
 Frame = +1

Query: 25  IVNVYAPCDLGGKRRVWEELGNLMASKGGGRWCVLGNFNSIKSISERKGVA--CNHRSEE 198
           + NVYAPCD  GK+ +W+ LG  + +     WCV G+FNSI+S  ERKG     N  S  
Sbjct: 171 VANVYAPCDGNGKQLLWDRLGARLLNSDVS-WCVCGDFNSIRSDEERKGRGGVVNFAS-- 227

Query: 199 IELFCDFIEVCGLIDLPLIGRKFTWYKPDGKAMSRLDRFLISEDWLFTWGNLSQWGMQRS 378
              F  FIE   L DLPL GR+FTWY+ DG +MSRLDRFL+SE W   W N  Q  + R 
Sbjct: 228 ---FNSFIEDAALSDLPLCGRQFTWYRGDGVSMSRLDRFLLSEVWCQNWPNCFQLALPRG 284

Query: 379 VSDHCAVILKEKDINWGPKPFRMMRCWEETVGYEDFV 489
           +SDHC ++L   + NWGPKPFRM+R W +  GY++FV
Sbjct: 285 LSDHCPIVLSVDEENWGPKPFRMLRSWSDMPGYKEFV 321



 Score =  127 bits (318), Expect(2) = 8e-68
 Identities = 62/147 (42%), Positives = 84/147 (57%)
 Frame = +2

Query: 494 KEWSGLKVRGWKGYXXXXXXXXXXXXXXXWHKDHFGNLETKIKEAKEEVYRLDLKGEVDG 673
           ++W    V GW G+               WH  H  N+E +IKE KE ++ LD+KGE  G
Sbjct: 323 EKWRSFNVSGWGGFVLKEKLKLLKGSLKEWHLKHGRNIEGRIKETKERMHDLDVKGEGVG 382

Query: 674 LNEVEVGDRRLCIANINRFSSMNCRILWQKSRMKWLKEGDANSKFFHRCVQRRRKANEIL 853
           LN+ E  + R+    +   SS+NCR  WQKSR+ WLK+GDANSKFFH  +  RR+ N I 
Sbjct: 383 LNDEEREELRVLSTQVLSLSSLNCRNQWQKSRLVWLKDGDANSKFFHDVMSSRRRGNAIH 442

Query: 854 GLDFDGSFIEGVNPLRREIRGYFESHF 934
            L  +G  +EGV+ +R  I  +FE HF
Sbjct: 443 NLVVEGHQVEGVSGMRNAIFNHFEKHF 469


>GAU48515.1 hypothetical protein TSUD_244350 [Trifolium subterraneum]
          Length = 1633

 Score =  162 bits (410), Expect(2) = 1e-67
 Identities = 75/158 (47%), Positives = 102/158 (64%)
 Frame = +1

Query: 25   IVNVYAPCDLGGKRRVWEELGNLMASKGGGRWCVLGNFNSIKSISERKGVACNHRSEEIE 204
            + NVYAPCD G K  +W  L   + S G  R CV G+FN++K + ER+      RS +  
Sbjct: 802  VANVYAPCDDGAKLVLWGSLSARIQSLGRQRLCVYGDFNAVKLVDERRSSRGESRSLDHI 861

Query: 205  LFCDFIEVCGLIDLPLIGRKFTWYKPDGKAMSRLDRFLISEDWLFTWGNLSQWGMQRSVS 384
             F  FI+   LIDLPL GRKFTW+K DG +MSRLDRFL+SE+W  TW N +Q    R +S
Sbjct: 862  PFNSFIDDNNLIDLPLSGRKFTWFKGDGLSMSRLDRFLLSEEWCLTWPNCTQTASLRGLS 921

Query: 385  DHCAVILKEKDINWGPKPFRMMRCWEETVGYEDFVKKR 498
            DHC+++L   + +WG +P RM++CW +  GY+ FVK +
Sbjct: 922  DHCSLVLSANEDDWGSRPSRMLKCWRDVPGYKGFVKDK 959



 Score =  124 bits (311), Expect(2) = 1e-67
 Identities = 62/162 (38%), Positives = 90/162 (55%)
 Frame = +2

Query: 491  KKEWSGLKVRGWKGYXXXXXXXXXXXXXXXWHKDHFGNLETKIKEAKEEVYRLDLKGEVD 670
            K +W+  +V GW G+               WH  H  NL ++I+  K  +  LDLKGE +
Sbjct: 957  KDKWNSFQVDGWGGFVLKEKLRMIKTALKDWHTAHAQNLPSRIESLKARLSTLDLKGEEE 1016

Query: 671  GLNEVEVGDRRLCIANINRFSSMNCRILWQKSRMKWLKEGDANSKFFHRCVQRRRKANEI 850
             L+E E+ +     A+I+  S M+  I WQ+SR  WLKEGDANSK+FH  +  RR+ N I
Sbjct: 1017 ALSEDEINELHGISADIHSLSRMHASISWQQSRSLWLKEGDANSKYFHSVLAGRRRGNTI 1076

Query: 851  LGLDFDGSFIEGVNPLRREIRGYFESHFNNRDWARPSFGNIN 976
              +  DG  +EGV P+R+ +  +F SHF   +  RP  GN++
Sbjct: 1077 SVIHADGVTLEGVLPIRQAVFSHFASHFKAINMERPRVGNLH 1118


>GAU28506.1 hypothetical protein TSUD_156630 [Trifolium subterraneum]
          Length = 1091

 Score =  158 bits (399), Expect(2) = 2e-67
 Identities = 72/159 (45%), Positives = 103/159 (64%)
 Frame = +1

Query: 22  SIVNVYAPCDLGGKRRVWEELGNLMASKGGGRWCVLGNFNSIKSISERKGVACNHRSEEI 201
           S+ NVYAPCD G K+++W+ L   + + G  R CV G+FN+++S+ ER+ V+   +S + 
Sbjct: 105 SVANVYAPCDPGAKQQLWDSLSERIQALGRSRVCVCGDFNAVRSLEERRSVSGRSQSLDH 164

Query: 202 ELFCDFIEVCGLIDLPLIGRKFTWYKPDGKAMSRLDRFLISEDWLFTWGNLSQWGMQRSV 381
             F  FI+   LIDLPL GRKFTW+K D  +MSRLDRFL+S +W  TW N +Q    R +
Sbjct: 165 ISFNRFIDDNNLIDLPLCGRKFTWFKGDDLSMSRLDRFLLSGEWCLTWPNCTQVARMRGL 224

Query: 382 SDHCAVILKEKDINWGPKPFRMMRCWEETVGYEDFVKKR 498
           S H  +IL      WGP+P RM++CW++  GY  F+K +
Sbjct: 225 SHHYPLILAVNVEEWGPRPSRMLKCWKDVPGYNTFIKDK 263



 Score =  127 bits (319), Expect(2) = 2e-67
 Identities = 65/190 (34%), Positives = 100/190 (52%)
 Frame = +2

Query: 491  KKEWSGLKVRGWKGYXXXXXXXXXXXXXXXWHKDHFGNLETKIKEAKEEVYRLDLKGEVD 670
            K +W+  +V GW G+               WHK H  NL + I+  ++ +  LD K    
Sbjct: 261  KDKWNSFQVVGWGGFVLKEKFKMIKMALKDWHKTHTQNLPSGIESLQDRLAALDEKEGDV 320

Query: 671  GLNEVEVGDRRLCIANINRFSSMNCRILWQKSRMKWLKEGDANSKFFHRCVQRRRKANEI 850
             L++VE+ +      +I+  S +N  I WQ+SR +WL EGDANSK+FH  +  RR+ N I
Sbjct: 321  VLSDVEIAELHGVTLDIHSLSRLNASICWQQSRSRWLSEGDANSKYFHSVLANRRRGNAI 380

Query: 851  LGLDFDGSFIEGVNPLRREIRGYFESHFNNRDWARPSFGNINIPTLGTAESNALTNPFSE 1030
              L    + +EGV+P+++ +  +  SHF   +  RP   ++N   L   E ++L   FS 
Sbjct: 381  SSLQVGNATVEGVDPIKQAVVCHLASHFKVVNVERPGVDSLNFKRLHPPEVSSLIKSFSL 440

Query: 1031 EEVKAMVWNC 1060
             EVKA VW+C
Sbjct: 441  AEVKAAVWDC 450


>GAU24087.1 hypothetical protein TSUD_388800 [Trifolium subterraneum]
          Length = 1985

 Score =  158 bits (400), Expect(2) = 8e-67
 Identities = 76/162 (46%), Positives = 104/162 (64%), Gaps = 6/162 (3%)
 Frame = +1

Query: 25   IVNVYAPCDLGGKRRVWEELGNLMASKGG---GRWCVLGNFNSIKSISERKGVACN---H 186
            IVNVYA C+L  KR +W    N++ SK G   G WCVLG+FNS++  +ER+GV  N    
Sbjct: 857  IVNVYAKCNLRNKRTLW---ANILMSKSGFGEGLWCVLGDFNSVRDSNERRGVVGNVDGQ 913

Query: 187  RSEEIELFCDFIEVCGLIDLPLIGRKFTWYKPDGKAMSRLDRFLISEDWLFTWGNLSQWG 366
            RS E+  F  F+    L+D+PLIGR+FTW+ P+G +MSRLDR LIS DW   WG  + W 
Sbjct: 914  RSSEMVAFDLFLNNLDLVDMPLIGRRFTWFHPNGVSMSRLDRILISSDWADVWGTPNVWA 973

Query: 367  MQRSVSDHCAVILKEKDINWGPKPFRMMRCWEETVGYEDFVK 492
            M R V+DHC ++L+    +WGP+PFR    W E   +++ +K
Sbjct: 974  MDRDVADHCPLVLRYSLADWGPRPFRFSNFWLEHREFKEVIK 1015



 Score =  125 bits (313), Expect(2) = 8e-67
 Identities = 60/190 (31%), Positives = 97/190 (51%)
 Frame = +2

Query: 491  KKEWSGLKVRGWKGYXXXXXXXXXXXXXXXWHKDHFGNLETKIKEAKEEVYRLDLKGEVD 670
            K  W      GW G+               W +  +G  E K K   +++  LDLK E  
Sbjct: 1015 KTAWDAHVAEGWMGFILKERLKVLKGVVKEWSRRTYGEAEAKKKRLIKDILALDLKSETT 1074

Query: 671  GLNEVEVGDRRLCIANINRFSSMNCRILWQKSRMKWLKEGDANSKFFHRCVQRRRKANEI 850
            GL + EV +R++   ++         +++Q+SR KWLKEGD NS++FH C++ R++ N +
Sbjct: 1075 GLLQGEVVERKILFDDLWITLKSMDAMIFQRSRSKWLKEGDTNSQYFHNCIKARKRRNNM 1134

Query: 851  LGLDFDGSFIEGVNPLRREIRGYFESHFNNRDWARPSFGNINIPTLGTAESNALTNPFSE 1030
            + L     ++EG + +R E+  +F +HF+N +W RP+   I  P L  A    LT  F+ 
Sbjct: 1135 VALRTRNGWVEGPSLVREEVVSFFRNHFSNEEWHRPTLNGIEFPRLSLARVEELTAMFTL 1194

Query: 1031 EEVKAMVWNC 1060
            EE+  +V  C
Sbjct: 1195 EEISEVVRGC 1204


>GAU17884.1 hypothetical protein TSUD_330100 [Trifolium subterraneum]
          Length = 558

 Score =  145 bits (367), Expect(2) = 2e-66
 Identities = 72/190 (37%), Positives = 102/190 (53%)
 Frame = +2

Query: 491  KKEWSGLKVRGWKGYXXXXXXXXXXXXXXXWHKDHFGNLETKIKEAKEEVYRLDLKGEVD 670
            K  W+  +V GW GY               WH  H  NL ++I   ++ +  LD KG  +
Sbjct: 154  KDRWNSYQVDGWGGYVLKEKFKMIKMALKDWHMTHTQNLPSRIMSLQDRIAELDEKGVEE 213

Query: 671  GLNEVEVGDRRLCIANINRFSSMNCRILWQKSRMKWLKEGDANSKFFHRCVQRRRKANEI 850
             L+  E+ D     ++++  S +N  I WQ+SR +WLKEGDANSK+FH  +  RR+ N I
Sbjct: 214  DLSGAEIDDLHGATSDLHTLSRLNASICWQQSRSRWLKEGDANSKYFHSVLANRRRGNAI 273

Query: 851  LGLDFDGSFIEGVNPLRREIRGYFESHFNNRDWARPSFGNINIPTLGTAESNALTNPFSE 1030
              L+  G  +EGV P+R+ I  +F SHF   D  RP   +++   L   E+  L  PFS 
Sbjct: 274  SSLEVGGVTVEGVAPIRQAIVCHFASHFKAVDVVRPGVNSLSFKRLHPTEAGNLIKPFSL 333

Query: 1031 EEVKAMVWNC 1060
            EEVKA VW+C
Sbjct: 334  EEVKAAVWDC 343



 Score =  136 bits (342), Expect(2) = 2e-66
 Identities = 65/131 (49%), Positives = 84/131 (64%)
 Frame = +1

Query: 106 GGGRWCVLGNFNSIKSISERKGVACNHRSEEIELFCDFIEVCGLIDLPLIGRKFTWYKPD 285
           G  R CV G+FN+++ I ER+      +S +   F  FIE   LIDLPL GRKFTWYK D
Sbjct: 26  GQSRVCVCGDFNAVRRIEERRSGRGRPQSLDHHSFNRFIEDNTLIDLPLSGRKFTWYKGD 85

Query: 286 GKAMSRLDRFLISEDWLFTWGNLSQWGMQRSVSDHCAVILKEKDINWGPKPFRMMRCWEE 465
           G +MSRLDRFLIS +W   W + +Q    R +SDHC +IL     +WGP+P RM++CW++
Sbjct: 86  GLSMSRLDRFLISPEWCLAWPDCTQTARMRGLSDHCPLILASNVEDWGPRPSRMLKCWKD 145

Query: 466 TVGYEDFVKKR 498
             GY  FVK R
Sbjct: 146 VPGYNIFVKDR 156


>KYP44439.1 Retrovirus-related Pol polyprotein LINE-1 [Cajanus cajan]
          Length = 1142

 Score =  194 bits (494), Expect(2) = 3e-66
 Identities = 90/184 (48%), Positives = 118/184 (64%)
 Frame = +1

Query: 1   GENKLSVSIVNVYAPCDLGGKRRVWEELGNLMASKGGGRWCVLGNFNSIKSISERKGVAC 180
           GE  +   +VNVYA C    K+ +W +L +L  S+G G+WC +G+FNSIK   ERKG + 
Sbjct: 60  GEENVDCWVVNVYASCSHELKKHLWGKLQSLKQSRGDGKWCFIGDFNSIKHADERKGTSV 119

Query: 181 NHRSEEIELFCDFIEVCGLIDLPLIGRKFTWYKPDGKAMSRLDRFLISEDWLFTWGNLSQ 360
             R EEIE F DFI+   LID+PL+GRKFTWY+PDG   SRLDR L++  WL  W N   
Sbjct: 120 ILRREEIECFVDFIDNLSLIDMPLLGRKFTWYRPDGSCKSRLDRCLVTTGWLDQWSNACL 179

Query: 361 WGMQRSVSDHCAVILKEKDINWGPKPFRMMRCWEETVGYEDFVKKRMVWAKSEGVERVHV 540
           W + R VSD+CA++LK +D+NWGPKPFR +  W    GY  FV+K     K E ++ +  
Sbjct: 180 WALNRGVSDYCAIVLKSEDVNWGPKPFRFLNSWRHEPGYAYFVRKEWFVLK-EKLKTIRS 238

Query: 541 KRKI 552
           K KI
Sbjct: 239 KLKI 242



 Score = 87.0 bits (214), Expect(2) = 3e-66
 Identities = 49/160 (30%), Positives = 74/160 (46%)
 Frame = +2

Query: 581  WHKDHFGNLETKIKEAKEEVYRLDLKGEVDGLNEVEVGDRRLCIANINRFSSMNCRILWQ 760
            W+K+ FG+L  KI + K+ + + D+K E  GL+  EV  R+  +A           +L+Q
Sbjct: 243  WNKEVFGDLNLKISKVKDGIKQCDIKDEESGLSPSEVVQRKEYMAQWQMLMQKKDTLLFQ 302

Query: 761  KSRMKWLKEGDANSKFFHRCVQRRRKANEILGLDFDGSFIEGVNPLRREIRGYFESHFNN 940
            KSR+KWL+EGDAN+K+FH C+ +R K N                                
Sbjct: 303  KSRLKWLQEGDANTKYFHGCINKRLKLNH------------------------------- 331

Query: 941  RDWARPSFGNINIPTLGTAESNALTNPFSEEEVKAMVWNC 1060
                RP    +    L   + + L  PF+ +EVK  VW+C
Sbjct: 332  ----RPVLNGLVFKRLNLDQVDVLIKPFTLQEVKEAVWDC 367


>GAU20577.1 hypothetical protein TSUD_33240 [Trifolium subterraneum]
          Length = 1732

 Score =  153 bits (386), Expect(2) = 5e-66
 Identities = 74/165 (44%), Positives = 104/165 (63%)
 Frame = +1

Query: 4    ENKLSVSIVNVYAPCDLGGKRRVWEELGNLMASKGGGRWCVLGNFNSIKSISERKGVACN 183
            ++ L++ + NVYAPCD  G+  +W EL   +       WCVLG+FN+I+S  ER      
Sbjct: 778  KDDLALCLANVYAPCDARGRSLLWRELDAKLLQIPLSVWCVLGDFNAIRSRDERVSRG-G 836

Query: 184  HRSEEIELFCDFIEVCGLIDLPLIGRKFTWYKPDGKAMSRLDRFLISEDWLFTWGNLSQW 363
               E+   F +FI+   LIDLPL GR FTWY  DG +MS LDRFLIS+ W+ ++ N  Q 
Sbjct: 837  SGVEDYMAFNNFIDRNALIDLPLGGRSFTWYSGDGLSMSHLDRFLISDSWVSSFPNCVQM 896

Query: 364  GMQRSVSDHCAVILKEKDINWGPKPFRMMRCWEETVGYEDFVKKR 498
             + RS+SDHC ++L     +WGPKPFRM++CW +  GY +F K++
Sbjct: 897  ALPRSLSDHCPIMLSVGVQDWGPKPFRMLKCWADISGYAEFFKQK 941



 Score =  127 bits (320), Expect(2) = 5e-66
 Identities = 64/190 (33%), Positives = 95/190 (50%)
 Frame = +2

Query: 491  KKEWSGLKVRGWKGYXXXXXXXXXXXXXXXWHKDHFGNLETKIKEAKEEVYRLDLKGEVD 670
            K++W   ++ GW G+               WH+ H  NL+ KI+ AK  +  LD+  E  
Sbjct: 939  KQKWQSFQIHGWSGHILKTKLKLLKAELRSWHQIHTANLDGKIQRAKSRLEELDICKEGR 998

Query: 671  GLNEVEVGDRRLCIANINRFSSMNCRILWQKSRMKWLKEGDANSKFFHRCVQRRRKANEI 850
            GL+  E  +      +I   S +   + WQKSR+ WL++GDANSKFFH  +  RR+AN I
Sbjct: 999  GLDVAEEAELMSLPVDILALSKLQASMYWQKSRVTWLRDGDANSKFFHGVMSSRRRANSI 1058

Query: 851  LGLDFDGSFIEGVNPLRREIRGYFESHFNNRDWARPSFGNINIPTLGTAESNALTNPFSE 1030
              L  +G  +E V  +R     ++ +HF  +   RP    +   +L       LT PF  
Sbjct: 1059 GALVHEGRTVESVPEVRHIAYQHYSNHFRKQLHYRPDISRLEFRSLSLLHGAELTKPFLL 1118

Query: 1031 EEVKAMVWNC 1060
            EE+KA VW+C
Sbjct: 1119 EEIKAAVWDC 1128


Top