BLASTX nr result

ID: Astragalus24_contig00015605 seq

BLASTX 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Astragalus24_contig00015605
         (1051 letters)

Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF excluding environmental samples
from WGS projects 
           149,584,005 sequences; 54,822,741,787 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

dbj|GAU46782.1| hypothetical protein TSUD_351810 [Trifolium subt...   215   1e-82
gb|KHN31995.1| Copia protein, partial [Glycine soja]                  222   1e-78
dbj|GAU31202.1| hypothetical protein TSUD_210590 [Trifolium subt...   193   1e-78
dbj|GAU41219.1| hypothetical protein TSUD_128950 [Trifolium subt...   199   1e-78
gb|PNX58721.1| putative copia-type protein, partial [Trifolium p...   212   5e-78
gb|PNX92373.1| retrovirus-related Pol polyprotein from transposo...   208   1e-76
gb|PNX87108.1| retrovirus-related Pol polyprotein from transposo...   207   3e-76
gb|PNX86354.1| retrovirus-related Pol polyprotein from transposo...   207   3e-76
dbj|GAU32754.1| hypothetical protein TSUD_323220 [Trifolium subt...   193   6e-76
gb|PNX92076.1| retrovirus-related Pol polyprotein from transposo...   201   8e-76
dbj|GAU51097.1| hypothetical protein TSUD_185270 [Trifolium subt...   189   1e-75
gb|PNX94461.1| retrovirus-related Pol polyprotein from transposo...   203   1e-75
gb|PNX93391.1| retrovirus-related Pol polyprotein from transposo...   197   2e-75
dbj|GAU36120.1| hypothetical protein TSUD_374830 [Trifolium subt...   192   4e-74
dbj|GAU41486.1| hypothetical protein TSUD_239620 [Trifolium subt...   186   4e-74
gb|AJY78065.1| putative polyprotein [Glycine max]                     197   6e-74
gb|KHN24193.1| Retrovirus-related Pol polyprotein from transposo...   197   1e-73
dbj|GAU39523.1| hypothetical protein TSUD_222930 [Trifolium subt...   184   5e-73
dbj|GAU51049.1| hypothetical protein TSUD_371270 [Trifolium subt...   182   1e-72
dbj|GAU46985.1| hypothetical protein TSUD_403190 [Trifolium subt...   182   2e-72

>dbj|GAU46782.1| hypothetical protein TSUD_351810 [Trifolium subterraneum]
          Length = 1512

 Score =  215 bits (547), Expect(2) = 1e-82
 Identities = 101/148 (68%), Positives = 124/148 (83%)
 Frame = +3

Query: 276  FFLGSSLISWRTKKQQTVSRSSSEAEYRAMSTAVRELQWLLFLLRDLNISCERPPTLYCD 455
            FF+GSSLISWR KKQ TVSRSSSEAEYR++S A  ELQW+++LL+DL+I CERPP LYCD
Sbjct: 1361 FFIGSSLISWRAKKQNTVSRSSSEAEYRSLSFASCELQWIVYLLKDLSIDCERPPVLYCD 1420

Query: 456  HQSALHIAANPVFHERTEHLEIDCHFVREKLQEGLMKLLLVPSKGQLADFFTKALTPTNF 635
            +QSA+HIA+NPVFHERT+HLEIDCH VR+K+Q G+ KLL + +K QLADFFTKAL P  F
Sbjct: 1421 NQSAIHIASNPVFHERTKHLEIDCHLVRDKVQSGVFKLLPISTKAQLADFFTKALPPKVF 1480

Query: 636  IPFISKLRMINIYHGQACGGLLKYKNDD 719
              F+SKL M+NI+H  ACG LL  +++D
Sbjct: 1481 NSFLSKLNMLNIFHVPACGRLLNEEDND 1508



 Score =  121 bits (304), Expect(2) = 1e-82
 Identities = 58/100 (58%), Positives = 70/100 (70%), Gaps = 8/100 (8%)
 Frame = +2

Query: 5    DGGGAYKDTSSYRRLIGRLLYITKTRPDITFAVQQIRQHVNSPTVTHHSAVCRVLRYLKG 184
            D   AY D   YRRLIG+LLY+T TRPDI+FA+QQ+ Q ++SPT TH    CRV+RYLKG
Sbjct: 1263 DSSPAYDDVGGYRRLIGKLLYLTTTRPDISFAIQQLSQFLSSPTTTHFDTACRVVRYLKG 1322

Query: 185  NPGMGLMFPRNFILQLSGF--------SDTRKSVTGYCFF 280
            +PG GL FPR   LQL GF        +DTR+S +GYCFF
Sbjct: 1323 SPGRGLFFPRQSPLQLLGFADADWANCADTRRSTSGYCFF 1362


>gb|KHN31995.1| Copia protein, partial [Glycine soja]
          Length = 224

 Score =  222 bits (566), Expect(2) = 1e-78
 Identities = 105/139 (75%), Positives = 121/139 (87%)
 Frame = +3

Query: 276 FFLGSSLISWRTKKQQTVSRSSSEAEYRAMSTAVRELQWLLFLLRDLNISCERPPTLYCD 455
           FFLG+SLISWR KKQQTVSRSSSEAEYRA+ST   ELQWLL+LL DL+I+C R P LYCD
Sbjct: 86  FFLGASLISWRAKKQQTVSRSSSEAEYRALSTTACELQWLLYLLHDLHITCTRAPALYCD 145

Query: 456 HQSALHIAANPVFHERTEHLEIDCHFVREKLQEGLMKLLLVPSKGQLADFFTKALTPTNF 635
           +QSALHIAANP+FHERT+HLEIDCHFVR K+QEG+++LL + SK QLADFFTK L P +F
Sbjct: 146 NQSALHIAANPMFHERTKHLEIDCHFVRNKIQEGVLRLLPISSKEQLADFFTKVLPPPSF 205

Query: 636 IPFISKLRMINIYHGQACG 692
           +PFISKL MI+IYH  ACG
Sbjct: 206 VPFISKLGMIDIYHAPACG 224



 Score =  101 bits (251), Expect(2) = 1e-78
 Identities = 48/87 (55%), Positives = 64/87 (73%), Gaps = 8/87 (9%)
 Frame = +2

Query: 44  RLIGRLLYITKTRPDITFAVQQIRQHVNSPTVTHHSAVCRVLRYLKGNPGMGLMFPRNFI 223
           RLIG+LLY+  TRP+ITFA QQ+ Q ++ PT+TH++A  RV+ YLKG+PG GL FPR   
Sbjct: 1   RLIGKLLYLNNTRPNITFATQQLSQFLSKPTMTHYNAAYRVVIYLKGSPGQGLFFPRKSE 60

Query: 224 LQLSGFS--------DTRKSVTGYCFF 280
           +QL GFS        D+R+S++GYCFF
Sbjct: 61  IQLLGFSNAYWAGCLDSRRSISGYCFF 87


>dbj|GAU31202.1| hypothetical protein TSUD_210590 [Trifolium subterraneum]
          Length = 1059

 Score =  193 bits (490), Expect(2) = 1e-78
 Identities = 92/140 (65%), Positives = 113/140 (80%)
 Frame = +3

Query: 276  FFLGSSLISWRTKKQQTVSRSSSEAEYRAMSTAVRELQWLLFLLRDLNISCERPPTLYCD 455
            FFLGSSL+SW+ KKQ TVSRSSSEAEYRA+STA  EL WL FL++DLNI C +PP +YCD
Sbjct: 919  FFLGSSLVSWKAKKQLTVSRSSSEAEYRALSTATCELIWLTFLMKDLNIHCSKPPVIYCD 978

Query: 456  HQSALHIAANPVFHERTEHLEIDCHFVREKLQEGLMKLLLVPSKGQLADFFTKALTPTNF 635
             QSA+HIA+NPVFHERT+HLEI+CHFVREKLQ+GL++LL + ++ QLAD  TK L    F
Sbjct: 979  SQSAMHIASNPVFHERTKHLEIECHFVREKLQQGLLRLLPISTEDQLADCLTKPLAAPKF 1038

Query: 636  IPFISKLRMINIYHGQACGG 695
              FISKL +++IY  +  GG
Sbjct: 1039 NSFISKLGLLDIYEPKLEGG 1058



 Score =  130 bits (326), Expect(2) = 1e-78
 Identities = 61/101 (60%), Positives = 77/101 (76%), Gaps = 8/101 (7%)
 Frame = +2

Query: 2    DDGGGAYKDTSSYRRLIGRLLYITKTRPDITFAVQQIRQHVNSPTVTHHSAVCRVLRYLK 181
            +D G  Y+D S+YRRLIGRLLY+T TRPDI+FA+QQ+ Q ++ PT+ H++A CRV+RYLK
Sbjct: 820  NDAGKLYEDISAYRRLIGRLLYLTNTRPDISFAIQQLSQFLSKPTMVHYNAACRVVRYLK 879

Query: 182  GNPGMGLMFPRNFILQLSGFS--------DTRKSVTGYCFF 280
             NPG GL FPR+F LQL GF+        DTR+S TGYCFF
Sbjct: 880  HNPGRGLFFPRHFDLQLLGFTDADWARCIDTRRSTTGYCFF 920


>dbj|GAU41219.1| hypothetical protein TSUD_128950 [Trifolium subterraneum]
          Length = 539

 Score =  199 bits (505), Expect(2) = 1e-78
 Identities = 94/136 (69%), Positives = 113/136 (83%)
 Frame = +3

Query: 276 FFLGSSLISWRTKKQQTVSRSSSEAEYRAMSTAVRELQWLLFLLRDLNISCERPPTLYCD 455
           FFLG SL+SW+ KKQ TVSRSSSEA+YRA+STA  EL WLLFLLRDLN +C +PP LYCD
Sbjct: 399 FFLGMSLVSWKAKKQVTVSRSSSEADYRALSTATCELIWLLFLLRDLNTTCSKPPVLYCD 458

Query: 456 HQSALHIAANPVFHERTEHLEIDCHFVREKLQEGLMKLLLVPSKGQLADFFTKALTPTNF 635
            QSA+HIA+NPVFHERT+HLEIDCH VREK+Q+GL+KLL + ++ QLADF TKAL    F
Sbjct: 459 SQSAMHIASNPVFHERTKHLEIDCHLVREKVQQGLLKLLPISTQEQLADFLTKALPSPKF 518

Query: 636 IPFISKLRMINIYHGQ 683
             F+SKL M++IYH +
Sbjct: 519 NSFVSKLGMLDIYHSK 534



 Score =  124 bits (311), Expect(2) = 1e-78
 Identities = 59/100 (59%), Positives = 73/100 (73%), Gaps = 8/100 (8%)
 Frame = +2

Query: 5   DGGGAYKDTSSYRRLIGRLLYITKTRPDITFAVQQIRQHVNSPTVTHHSAVCRVLRYLKG 184
           D G  Y D SSYRRLIG+LLY+T TRPDI+FA QQ+ Q ++ PTVTH+ A CRV+RYLK 
Sbjct: 301 DNGETYADISSYRRLIGKLLYLTNTRPDISFATQQLSQFLHKPTVTHYKAACRVVRYLKH 360

Query: 185 NPGMGLMFPRNFILQLSGFS--------DTRKSVTGYCFF 280
           +PG GLM PRN  +Q+ G+S        DTR+S +GYCFF
Sbjct: 361 SPGKGLMLPRNSEIQILGYSDADWAGCLDTRRSTSGYCFF 400


>gb|PNX58721.1| putative copia-type protein, partial [Trifolium pratense]
          Length = 277

 Score =  212 bits (540), Expect(2) = 5e-78
 Identities = 100/134 (74%), Positives = 118/134 (88%)
 Frame = +3

Query: 276 FFLGSSLISWRTKKQQTVSRSSSEAEYRAMSTAVRELQWLLFLLRDLNISCERPPTLYCD 455
           FFLG+SLISWR KKQ TVSRSSSEAEYRA+S A  ELQWL++LL+DL ++C +PP LYCD
Sbjct: 143 FFLGNSLISWRAKKQHTVSRSSSEAEYRALSFASCELQWLVYLLKDLQVNCIKPPVLYCD 202

Query: 456 HQSALHIAANPVFHERTEHLEIDCHFVREKLQEGLMKLLLVPSKGQLADFFTKALTPTNF 635
           +QSALHIAANPVFHERT+HLEIDCHFVREKLQ+G+ KLL + +K QLADFFTKAL P +F
Sbjct: 203 NQSALHIAANPVFHERTKHLEIDCHFVREKLQQGIFKLLPIHTKAQLADFFTKALPPKSF 262

Query: 636 IPFISKLRMINIYH 677
           + FISKL M++IYH
Sbjct: 263 LSFISKLNMLDIYH 276



 Score =  108 bits (271), Expect(2) = 5e-78
 Identities = 51/100 (51%), Positives = 67/100 (67%), Gaps = 8/100 (8%)
 Frame = +2

Query: 5   DGGGAYKDTSSYRRLIGRLLYITKTRPDITFAVQQIRQHVNSPTVTHHSAVCRVLRYLKG 184
           D    Y D + YRRL+G+LLY+T TRPDI F  QQ+ Q +++PT TH+   CRV+RYLK 
Sbjct: 45  DTASPYADIAGYRRLVGKLLYLTTTRPDIAFVTQQLSQFLSAPTQTHYDTACRVVRYLKN 104

Query: 185 NPGMGLMFPRNFILQLSGFS--------DTRKSVTGYCFF 280
           +PG GL+F R+  L L GF+        DTR+S +GYCFF
Sbjct: 105 SPGRGLLFRRDSQLHLLGFTDADWAGCLDTRRSTSGYCFF 144


>gb|PNX92373.1| retrovirus-related Pol polyprotein from transposon TNT 1-94, partial
            [Trifolium pratense]
          Length = 1125

 Score =  208 bits (529), Expect(2) = 1e-76
 Identities = 98/150 (65%), Positives = 123/150 (82%)
 Frame = +3

Query: 276  FFLGSSLISWRTKKQQTVSRSSSEAEYRAMSTAVRELQWLLFLLRDLNISCERPPTLYCD 455
            FFLG SL+SWRTKKQ TV+RSSSEAEYRA+++A  ELQWLL+LL+DL + C + P +YCD
Sbjct: 976  FFLGQSLVSWRTKKQFTVARSSSEAEYRALASATCELQWLLYLLQDLGVPCSKLPVIYCD 1035

Query: 456  HQSALHIAANPVFHERTEHLEIDCHFVREKLQEGLMKLLLVPSKGQLADFFTKALTPTNF 635
            +QSALHIAANPVFHERT+HL+IDCH VREK+  G+MKLL V SK Q+ADFFTKAL P  F
Sbjct: 1036 NQSALHIAANPVFHERTKHLDIDCHIVREKMLAGVMKLLPVSSKDQIADFFTKALLPQPF 1095

Query: 636  IPFISKLRMINIYHGQACGGLLKYKNDDST 725
               ++KL M++IYH   CG +L++K +D+T
Sbjct: 1096 GILLAKLGMVDIYHPPTCGRVLEHKTEDNT 1125



 Score =  108 bits (270), Expect(2) = 1e-76
 Identities = 53/101 (52%), Positives = 71/101 (70%), Gaps = 8/101 (7%)
 Frame = +2

Query: 2    DDGGGAYKDTSSYRRLIGRLLYITKTRPDITFAVQQIRQHVNSPTVTHHSAVCRVLRYLK 181
            +D G  ++D S+YRRL+GRLLY+T TRPDIT+  QQ+ Q ++ PT  H++A  RVL+YLK
Sbjct: 877  NDIGPIFEDVSAYRRLVGRLLYLTTTRPDITYVTQQLSQFLSRPTQMHYNAALRVLKYLK 936

Query: 182  GNPGMGLMFPRNFILQLSGFS--------DTRKSVTGYCFF 280
             +PG GL FPR   LQL GFS        D+R+S++G CFF
Sbjct: 937  TSPGRGLFFPRASQLQLLGFSDADWAGCKDSRRSISGQCFF 977


>gb|PNX87108.1| retrovirus-related Pol polyprotein from transposon TNT 1-94,
           partial [Trifolium pratense]
          Length = 517

 Score =  207 bits (526), Expect(2) = 3e-76
 Identities = 100/148 (67%), Positives = 118/148 (79%)
 Frame = +3

Query: 276 FFLGSSLISWRTKKQQTVSRSSSEAEYRAMSTAVRELQWLLFLLRDLNISCERPPTLYCD 455
           FFLG SLISWRTKKQ TVSRSSSEAEYRA++ A  ELQW+L+LL+D+ I C + P +YCD
Sbjct: 359 FFLGKSLISWRTKKQLTVSRSSSEAEYRALAAATCELQWILYLLKDIQIQCSKLPVIYCD 418

Query: 456 HQSALHIAANPVFHERTEHLEIDCHFVREKLQEGLMKLLLVPSKGQLADFFTKALTPTNF 635
           +QSALHIAANPVFHERT+HLEIDCH VREKLQ G+MKLL V S+ Q+ADFFTKAL P  F
Sbjct: 419 NQSALHIAANPVFHERTKHLEIDCHLVREKLQAGVMKLLPVTSQNQVADFFTKALLPQPF 478

Query: 636 IPFISKLRMINIYHGQACGGLLKYKNDD 719
              +SKL +++IY    CGGLL    +D
Sbjct: 479 NTLMSKLNLLDIYQPSPCGGLLHSNIED 506



 Score =  108 bits (270), Expect(2) = 3e-76
 Identities = 51/100 (51%), Positives = 67/100 (67%), Gaps = 8/100 (8%)
 Frame = +2

Query: 5   DGGGAYKDTSSYRRLIGRLLYITKTRPDITFAVQQIRQHVNSPTVTHHSAVCRVLRYLKG 184
           D G  Y D  +YRRLIGRL+Y+  TRPDIT+  QQ+ Q ++ PT  H++A  RVL+YLK 
Sbjct: 261 DNGTPYDDIPAYRRLIGRLIYLNTTRPDITYVTQQLSQFLSKPTTNHYNAAIRVLKYLKN 320

Query: 185 NPGMGLMFPRNFILQLSGFS--------DTRKSVTGYCFF 280
           +PG GL FPR+  L + GFS        DTR+S++G CFF
Sbjct: 321 SPGRGLFFPRDSSLHILGFSDADWAGCVDTRRSISGQCFF 360


>gb|PNX86354.1| retrovirus-related Pol polyprotein from transposon TNT 1-94
           [Trifolium pratense]
          Length = 512

 Score =  207 bits (526), Expect(2) = 3e-76
 Identities = 102/154 (66%), Positives = 119/154 (77%)
 Frame = +3

Query: 276 FFLGSSLISWRTKKQQTVSRSSSEAEYRAMSTAVRELQWLLFLLRDLNISCERPPTLYCD 455
           FFLG SLISWRTKKQ TVSRSSSEAEYRA++ A  ELQW+L+LL+D+ I C + P +YCD
Sbjct: 359 FFLGKSLISWRTKKQLTVSRSSSEAEYRALAAATCELQWILYLLKDIQIQCSKLPVIYCD 418

Query: 456 HQSALHIAANPVFHERTEHLEIDCHFVREKLQEGLMKLLLVPSKGQLADFFTKALTPTNF 635
           +QSALHIAANPVFHERT+HLEIDCH VREKLQ G+MKLL V S+ Q+ADFFTKAL P  F
Sbjct: 419 NQSALHIAANPVFHERTKHLEIDCHLVREKLQAGVMKLLPVTSQNQVADFFTKALLPQPF 478

Query: 636 IPFISKLRMINIYHGQACGGLLKYKNDDSTGDLS 737
              +SKL + +IY    CGGLL    +D    LS
Sbjct: 479 NTLMSKLNLQDIYQPSPCGGLLHSNIEDKDKSLS 512



 Score =  108 bits (270), Expect(2) = 3e-76
 Identities = 51/100 (51%), Positives = 67/100 (67%), Gaps = 8/100 (8%)
 Frame = +2

Query: 5   DGGGAYKDTSSYRRLIGRLLYITKTRPDITFAVQQIRQHVNSPTVTHHSAVCRVLRYLKG 184
           D G  Y D  +YRRLIGRL+Y+  TRPDIT+  QQ+ Q ++ PT  H++A  RVL+YLK 
Sbjct: 261 DNGTPYDDIPAYRRLIGRLIYLNTTRPDITYVTQQLSQFLSKPTTNHYNAAIRVLKYLKN 320

Query: 185 NPGMGLMFPRNFILQLSGFS--------DTRKSVTGYCFF 280
           +PG GL FPR+  L + GFS        DTR+S++G CFF
Sbjct: 321 SPGRGLFFPRDSSLHILGFSDADWAGCVDTRRSISGQCFF 360


>dbj|GAU32754.1| hypothetical protein TSUD_323220 [Trifolium subterraneum]
          Length = 1095

 Score =  193 bits (491), Expect(2) = 6e-76
 Identities = 97/177 (54%), Positives = 124/177 (70%)
 Frame = +3

Query: 276  FFLGSSLISWRTKKQQTVSRSSSEAEYRAMSTAVRELQWLLFLLRDLNISCERPPTLYCD 455
            FFLGSSLISW+ KKQ T+S+SSSEAEYRA+S++  EL WLL+LL+DL I C + P ++CD
Sbjct: 587  FFLGSSLISWKAKKQLTISKSSSEAEYRALSSSTCELIWLLYLLKDLQIECTQLPVIFCD 646

Query: 456  HQSALHIAANPVFHERTEHLEIDCHFVREKLQEGLMKLLLVPSKGQLADFFTKALTPTNF 635
            +QSALHIA+NPVFHERT+H+EIDCH VREK+Q GL++LL + ++ QL D  TKAL    F
Sbjct: 647  NQSALHIASNPVFHERTKHIEIDCHLVREKVQAGLLRLLPISTQDQLTDCLTKALPTAKF 706

Query: 636  IPFISKLRMINIYHGQACGGLLKYKNDDSTGDLSNNSANEDMHKEDSSKCDKAKQSE 806
              FI+KL +++IY   ACG LL  K   S    SNN     +  +D  K D     E
Sbjct: 707  NHFIAKLGLLDIYQASACGRLLNIKIASSP---SNNHEEASLANDDQVKSDLYNMQE 760



 Score =  120 bits (302), Expect(2) = 6e-76
 Identities = 55/101 (54%), Positives = 74/101 (73%), Gaps = 8/101 (7%)
 Frame = +2

Query: 2   DDGGGAYKDTSSYRRLIGRLLYITKTRPDITFAVQQIRQHVNSPTVTHHSAVCRVLRYLK 181
           +D G  ++D S YRRL+G+LLY+T TRPDI +A QQ+ Q +++PT+TH+ A CRV+RYLK
Sbjct: 488 NDNGKPFEDISLYRRLVGKLLYLTNTRPDIAYATQQLSQFLHNPTITHYKAACRVVRYLK 547

Query: 182 GNPGMGLMFPRNFILQLSGFS--------DTRKSVTGYCFF 280
            NPG GLMF RN  +Q+ G+S        DTR+S +GYCFF
Sbjct: 548 HNPGRGLMFHRNLDIQIIGYSDADWAGCLDTRRSTSGYCFF 588


>gb|PNX92076.1| retrovirus-related Pol polyprotein from transposon TNT 1-94
           [Trifolium pratense]
          Length = 720

 Score =  201 bits (510), Expect(2) = 8e-76
 Identities = 97/143 (67%), Positives = 116/143 (81%)
 Frame = +3

Query: 276 FFLGSSLISWRTKKQQTVSRSSSEAEYRAMSTAVRELQWLLFLLRDLNISCERPPTLYCD 455
           FFLG SLISWRTKKQ TV+RSSSEAEYRA++ A  ELQWL +LL+DL+I+C + P LYCD
Sbjct: 557 FFLGQSLISWRTKKQLTVARSSSEAEYRALAAATCELQWLAYLLQDLHITCPKLPVLYCD 616

Query: 456 HQSALHIAANPVFHERTEHLEIDCHFVREKLQEGLMKLLLVPSKGQLADFFTKALTPTNF 635
           +QSALHIAANPVFHERT+H++IDCH VREKLQ GLMKLL V SK Q+ADFFTK+L P  F
Sbjct: 617 NQSALHIAANPVFHERTKHIDIDCHIVREKLQAGLMKLLPVSSKDQIADFFTKSLLPQPF 676

Query: 636 IPFISKLRMINIYHGQACGGLLK 704
              ++KL M +IY    CG ++K
Sbjct: 677 GVLLAKLGMFDIYQAPTCGRVIK 699



 Score =  113 bits (282), Expect(2) = 8e-76
 Identities = 55/100 (55%), Positives = 69/100 (69%), Gaps = 8/100 (8%)
 Frame = +2

Query: 5   DGGGAYKDTSSYRRLIGRLLYITKTRPDITFAVQQIRQHVNSPTVTHHSAVCRVLRYLKG 184
           D    ++D S+YRRL+GRLLY+  TRPDITF  QQ+ Q ++ PT TH+SA  RVLRYLK 
Sbjct: 459 DDSAPFEDISAYRRLVGRLLYLNTTRPDITFITQQLSQFLSKPTHTHYSAAMRVLRYLKN 518

Query: 185 NPGMGLMFPRNFILQLSGFS--------DTRKSVTGYCFF 280
            PG GL FPRN  LQ+ GFS        D+R+S++G CFF
Sbjct: 519 CPGRGLFFPRNSTLQILGFSDADWAGCKDSRRSISGQCFF 558


>dbj|GAU51097.1| hypothetical protein TSUD_185270 [Trifolium subterraneum]
          Length = 1179

 Score =  189 bits (480), Expect(2) = 1e-75
 Identities = 87/133 (65%), Positives = 109/133 (81%)
 Frame = +3

Query: 276  FFLGSSLISWRTKKQQTVSRSSSEAEYRAMSTAVRELQWLLFLLRDLNISCERPPTLYCD 455
            FFLGSSLISW+ KKQ+T++RSSS+AEY A+++A  ELQWLL+LL DLN+ C RPP LYCD
Sbjct: 1046 FFLGSSLISWKAKKQETIARSSSKAEYIALTSATCELQWLLYLLEDLNVKCSRPPVLYCD 1105

Query: 456  HQSALHIAANPVFHERTEHLEIDCHFVREKLQEGLMKLLLVPSKGQLADFFTKALTPTNF 635
             QSA+HIA+NPVFHERT+HLEIDCH +REKLQ+G++KLL + +  Q+ADF TK L    F
Sbjct: 1106 SQSAIHIASNPVFHERTKHLEIDCHLIREKLQKGILKLLSISTNEQVADFLTKPLVSPKF 1165

Query: 636  IPFISKLRMINIY 674
               +SKL MINI+
Sbjct: 1166 KYLLSKLNMINIF 1178



 Score =  124 bits (311), Expect(2) = 1e-75
 Identities = 58/100 (58%), Positives = 74/100 (74%), Gaps = 8/100 (8%)
 Frame = +2

Query: 5    DGGGAYKDTSSYRRLIGRLLYITKTRPDITFAVQQIRQHVNSPTVTHHSAVCRVLRYLKG 184
            D G  ++D + YRRLIGRLLY+T TRPDI+ A QQ+ Q + +PT+TH++A CR+LRYLK 
Sbjct: 948  DNGALFEDITQYRRLIGRLLYLTTTRPDISLATQQLSQFLQAPTITHYNAACRILRYLKQ 1007

Query: 185  NPGMGLMFPRNFILQLSGFS--------DTRKSVTGYCFF 280
             PG+GLMFPR+  LQL GF+        D+RKS TGYCFF
Sbjct: 1008 EPGLGLMFPRDSELQLLGFADADWAGCVDSRKSTTGYCFF 1047


>gb|PNX94461.1| retrovirus-related Pol polyprotein from transposon TNT 1-94
            [Trifolium pratense]
          Length = 1000

 Score =  203 bits (517), Expect(2) = 1e-75
 Identities = 99/159 (62%), Positives = 126/159 (79%)
 Frame = +3

Query: 276  FFLGSSLISWRTKKQQTVSRSSSEAEYRAMSTAVRELQWLLFLLRDLNISCERPPTLYCD 455
            FFLG SLISWRTKKQ TV+RSSSEAEYRA+++A  ELQWL++LL+DL+++C + P LYCD
Sbjct: 840  FFLGQSLISWRTKKQITVARSSSEAEYRALASATCELQWLVYLLQDLHVTCSKLPVLYCD 899

Query: 456  HQSALHIAANPVFHERTEHLEIDCHFVREKLQEGLMKLLLVPSKGQLADFFTKALTPTNF 635
            +QSALHIAANPVFHERT+HL+IDCH VREKLQ GLMKLL V SK Q+ADFFTK L P  F
Sbjct: 900  NQSALHIAANPVFHERTKHLDIDCHVVREKLQAGLMKLLPVSSKDQIADFFTKTLLPQPF 959

Query: 636  IPFISKLRMINIYHGQACGGLLKYKNDDSTGDLSNNSAN 752
               ++KL M++IY    CG +L+  + +++ D   N+ +
Sbjct: 960  GILLAKLGMVDIYQAPPCGRVLEPIHTEASSDNKLNTTH 998



 Score =  110 bits (274), Expect(2) = 1e-75
 Identities = 52/100 (52%), Positives = 70/100 (70%), Gaps = 8/100 (8%)
 Frame = +2

Query: 5    DGGGAYKDTSSYRRLIGRLLYITKTRPDITFAVQQIRQHVNSPTVTHHSAVCRVLRYLKG 184
            D    ++D S+YRRL+GRLLY+  TRPDITF  QQ+ Q ++ PT TH++A  RVL+YLK 
Sbjct: 742  DNSAPFEDISAYRRLVGRLLYLNTTRPDITFITQQLSQFLSKPTHTHYTAALRVLKYLKN 801

Query: 185  NPGMGLMFPRNFILQLSGFS--------DTRKSVTGYCFF 280
             PG GL FPR+  LQ+ GFS        D+R+S++G+CFF
Sbjct: 802  CPGRGLFFPRSSSLQILGFSDADWAGCKDSRRSISGHCFF 841


>gb|PNX93391.1| retrovirus-related Pol polyprotein from transposon TNT 1-94
            [Trifolium pratense]
          Length = 1296

 Score =  197 bits (500), Expect(2) = 2e-75
 Identities = 95/140 (67%), Positives = 113/140 (80%)
 Frame = +3

Query: 276  FFLGSSLISWRTKKQQTVSRSSSEAEYRAMSTAVRELQWLLFLLRDLNISCERPPTLYCD 455
            FF+G SLISWR KKQ TVSRSSSEAEYRA+S+A  ELQWLL+LL DL ++  + PTLYCD
Sbjct: 1156 FFMGKSLISWRAKKQATVSRSSSEAEYRALSSATCELQWLLYLLADLKVTLTKTPTLYCD 1215

Query: 456  HQSALHIAANPVFHERTEHLEIDCHFVREKLQEGLMKLLLVPSKGQLADFFTKALTPTNF 635
            +QSA+HIA+NPVFHERT+HL+IDCH VREK+ +G++KLL V +  Q+ADF TKAL P  F
Sbjct: 1216 NQSAVHIASNPVFHERTKHLDIDCHLVREKVLQGILKLLPVSTNDQMADFLTKALAPPKF 1275

Query: 636  IPFISKLRMINIYHGQACGG 695
              FISKL MINIY  Q  GG
Sbjct: 1276 YEFISKLNMINIYQVQLEGG 1295



 Score =  115 bits (288), Expect(2) = 2e-75
 Identities = 55/98 (56%), Positives = 69/98 (70%), Gaps = 8/98 (8%)
 Frame = +2

Query: 11   GGAYKDTSSYRRLIGRLLYITKTRPDITFAVQQIRQHVNSPTVTHHSAVCRVLRYLKGNP 190
            G  Y D S YRRL+G+LLY+  TRPDI FA QQ+ Q +++PT  H++A CRVLRYLK NP
Sbjct: 1060 GTPYDDVSGYRRLVGKLLYLNTTRPDIAFATQQLSQFMHAPTNVHYNAACRVLRYLKNNP 1119

Query: 191  GMGLMFPRNFILQLSGFS--------DTRKSVTGYCFF 280
            G G++F R+  LQL G+S        DTRKS +GYCFF
Sbjct: 1120 GQGVLFSRDSELQLIGYSDADWAGCMDTRKSTSGYCFF 1157


>dbj|GAU36120.1| hypothetical protein TSUD_374830 [Trifolium subterraneum]
          Length = 1037

 Score =  192 bits (487), Expect(2) = 4e-74
 Identities = 90/146 (61%), Positives = 118/146 (80%)
 Frame = +3

Query: 276  FFLGSSLISWRTKKQQTVSRSSSEAEYRAMSTAVRELQWLLFLLRDLNISCERPPTLYCD 455
            FFLGSSL+SW+ KKQ T+S+SSSEAEYRA+S+A  EL WLL+LL+DL+I C + P ++CD
Sbjct: 874  FFLGSSLVSWKAKKQLTISKSSSEAEYRALSSATCELVWLLYLLKDLHIECTQLPVIFCD 933

Query: 456  HQSALHIAANPVFHERTEHLEIDCHFVREKLQEGLMKLLLVPSKGQLADFFTKALTPTNF 635
            +QSALHIA+NPVFHERT+H+EIDCH VREK+QEGL++LL V ++ QLAD  TKAL    F
Sbjct: 934  NQSALHIASNPVFHERTKHIEIDCHLVREKVQEGLLRLLPVSTQDQLADCLTKALPVPKF 993

Query: 636  IPFISKLRMINIYHGQACGGLLKYKN 713
              F++KL +++IY   ACG +L  K+
Sbjct: 994  NHFVTKLGLLDIYQASACGRVLSIKD 1019



 Score =  116 bits (290), Expect(2) = 4e-74
 Identities = 55/101 (54%), Positives = 72/101 (71%), Gaps = 8/101 (7%)
 Frame = +2

Query: 2    DDGGGAYKDTSSYRRLIGRLLYITKTRPDITFAVQQIRQHVNSPTVTHHSAVCRVLRYLK 181
            +D G  ++D S YRRLIG+LLY+T TRPDI +A QQ+ Q +++PTVTH  A CRV+RYLK
Sbjct: 775  NDNGKPFEDVSLYRRLIGKLLYLTNTRPDIAYATQQLSQFLHNPTVTHFKAACRVIRYLK 834

Query: 182  GNPGMGLMFPRNFILQLSGFS--------DTRKSVTGYCFF 280
             NPG GLMF R+  + + G+S        DTR+S +GYCFF
Sbjct: 835  HNPGRGLMFYRHSDIHIIGYSNADWAGCLDTRRSTSGYCFF 875


>dbj|GAU41486.1| hypothetical protein TSUD_239620 [Trifolium subterraneum]
          Length = 794

 Score =  186 bits (473), Expect(2) = 4e-74
 Identities = 90/140 (64%), Positives = 110/140 (78%)
 Frame = +3

Query: 276  FFLGSSLISWRTKKQQTVSRSSSEAEYRAMSTAVRELQWLLFLLRDLNISCERPPTLYCD 455
            FFLGSSL+SW+ KKQ TVSRSSSEAEYRA+STA  EL WLLFL++DL+I C + P +YCD
Sbjct: 654  FFLGSSLVSWKAKKQLTVSRSSSEAEYRALSTATCELIWLLFLMKDLSIQCSKQPIIYCD 713

Query: 456  HQSALHIAANPVFHERTEHLEIDCHFVREKLQEGLMKLLLVPSKGQLADFFTKALTPTNF 635
             QSA+HIA+NPVFHERT+HLEIDCH VREK+Q+GL++LL + +  QLAD  TK L    F
Sbjct: 714  SQSAIHIASNPVFHERTKHLEIDCHLVREKVQQGLLRLLPISTDDQLADCLTKPLAAPKF 773

Query: 636  IPFISKLRMINIYHGQACGG 695
              FISKL + +IY  +  GG
Sbjct: 774  NSFISKLGLFDIYEPKLEGG 793



 Score =  121 bits (304), Expect(2) = 4e-74
 Identities = 59/101 (58%), Positives = 74/101 (73%), Gaps = 8/101 (7%)
 Frame = +2

Query: 2   DDGGGAYKDTSSYRRLIGRLLYITKTRPDITFAVQQIRQHVNSPTVTHHSAVCRVLRYLK 181
           +D G  Y+D SSYRRLIGRLLY+T TRPDI+FAVQQ+ Q ++ PT+ H +A CRV+RYLK
Sbjct: 555 NDAGKLYEDISSYRRLIGRLLYLTNTRPDISFAVQQLSQFLHKPTMVHFNAACRVVRYLK 614

Query: 182 GNPGMGLMFPRNFILQLSGFSD--------TRKSVTGYCFF 280
            NPG GL+F R+   QL GF+D        TR+S +GYCFF
Sbjct: 615 HNPGRGLLFSRHSDTQLLGFADADWAGCIETRRSTSGYCFF 655


>gb|AJY78065.1| putative polyprotein [Glycine max]
          Length = 523

 Score =  197 bits (500), Expect(2) = 6e-74
 Identities = 95/140 (67%), Positives = 111/140 (79%)
 Frame = +3

Query: 276 FFLGSSLISWRTKKQQTVSRSSSEAEYRAMSTAVRELQWLLFLLRDLNISCERPPTLYCD 455
           FF+G SL+SWR KKQ TVSRSSSEAEYRA+S+A  ELQWLL+L  DL +   R PTLYCD
Sbjct: 383 FFIGKSLVSWRAKKQATVSRSSSEAEYRALSSAACELQWLLYLFADLRVQLTRTPTLYCD 442

Query: 456 HQSALHIAANPVFHERTEHLEIDCHFVREKLQEGLMKLLLVPSKGQLADFFTKALTPTNF 635
           +QSA+HIA+NPVFHERT+HLEIDCH VREKL +G +KLL V +  Q+ADF TKAL P  F
Sbjct: 443 NQSAVHIASNPVFHERTKHLEIDCHLVREKLLKGTLKLLPVSTSDQVADFLTKALAPPKF 502

Query: 636 IPFISKLRMINIYHGQACGG 695
             F+SKL MINIYH +  GG
Sbjct: 503 HDFVSKLSMINIYHDKLEGG 522



 Score =  110 bits (276), Expect(2) = 6e-74
 Identities = 52/98 (53%), Positives = 66/98 (67%), Gaps = 8/98 (8%)
 Frame = +2

Query: 11  GGAYKDTSSYRRLIGRLLYITKTRPDITFAVQQIRQHVNSPTVTHHSAVCRVLRYLKGNP 190
           G  Y D S YRR++G+LLY+  TRPDI FA QQ+ Q + +PT  H +A CRVLRYLK NP
Sbjct: 287 GTPYADISGYRRIVGKLLYLNTTRPDIAFATQQLSQFMQAPTNVHFNAACRVLRYLKNNP 346

Query: 191 GMGLMFPRNFILQLSGFS--------DTRKSVTGYCFF 280
           G G+ F R   +QL G+S        D+RKS++GYCFF
Sbjct: 347 GQGIFFSRTSEMQLIGYSDADWAGCMDSRKSISGYCFF 384


>gb|KHN24193.1| Retrovirus-related Pol polyprotein from transposon TNT 1-94,
           partial [Glycine soja]
 gb|KHN37451.1| Retrovirus-related Pol polyprotein from transposon TNT 1-94,
           partial [Glycine soja]
          Length = 234

 Score =  197 bits (500), Expect(2) = 1e-73
 Identities = 95/140 (67%), Positives = 111/140 (79%)
 Frame = +3

Query: 276 FFLGSSLISWRTKKQQTVSRSSSEAEYRAMSTAVRELQWLLFLLRDLNISCERPPTLYCD 455
           FF+G SL+SWR KKQ TVSRSSSEAEYRA+S+A  ELQWLL+L  DL +   R PTLYCD
Sbjct: 95  FFIGKSLVSWRAKKQATVSRSSSEAEYRALSSAACELQWLLYLFADLRVQLTRTPTLYCD 154

Query: 456 HQSALHIAANPVFHERTEHLEIDCHFVREKLQEGLMKLLLVPSKGQLADFFTKALTPTNF 635
           +QSA+HIA+NPVFHERT+HLEIDCH VREKL +G +KLL V +  Q+ADF TKAL P  F
Sbjct: 155 NQSAVHIASNPVFHERTKHLEIDCHLVREKLLKGTLKLLPVSTSDQVADFLTKALAPPKF 214

Query: 636 IPFISKLRMINIYHGQACGG 695
             F+SKL MINIYH +  GG
Sbjct: 215 HDFVSKLSMINIYHDKLEGG 234



 Score =  109 bits (273), Expect(2) = 1e-73
 Identities = 51/95 (53%), Positives = 65/95 (68%), Gaps = 8/95 (8%)
 Frame = +2

Query: 20  YKDTSSYRRLIGRLLYITKTRPDITFAVQQIRQHVNSPTVTHHSAVCRVLRYLKGNPGMG 199
           Y D S YRR++G+LLY+  TRPDI FA QQ+ Q + +PT  H +A CRVLRYLK NPG G
Sbjct: 2   YADISGYRRIVGKLLYLNTTRPDIAFATQQLSQFMQAPTNVHFNAACRVLRYLKNNPGQG 61

Query: 200 LMFPRNFILQLSGFS--------DTRKSVTGYCFF 280
           + F R   +QL G+S        D+RKS++GYCFF
Sbjct: 62  IFFSRTSEMQLIGYSDADWAGCMDSRKSISGYCFF 96


>dbj|GAU39523.1| hypothetical protein TSUD_222930 [Trifolium subterraneum]
          Length = 1210

 Score =  184 bits (467), Expect(2) = 5e-73
 Identities = 88/140 (62%), Positives = 110/140 (78%)
 Frame = +3

Query: 276  FFLGSSLISWRTKKQQTVSRSSSEAEYRAMSTAVRELQWLLFLLRDLNISCERPPTLYCD 455
            FF+GSSLISW+ KKQ TVSRSSSEAEYRA+S+   EL WLL L+ DL I C++PP +YCD
Sbjct: 1070 FFIGSSLISWKAKKQLTVSRSSSEAEYRALSSTTCELIWLLSLINDLKIQCDKPPVIYCD 1129

Query: 456  HQSALHIAANPVFHERTEHLEIDCHFVREKLQEGLMKLLLVPSKGQLADFFTKALTPTNF 635
             QSA+HIA+NPVFHERT+HLEIDCH VREK+Q+G+++LL + ++ QLAD  TKAL    F
Sbjct: 1130 SQSAMHIASNPVFHERTKHLEIDCHLVREKVQQGILRLLPISTQDQLADCLTKALPGPKF 1189

Query: 636  IPFISKLRMINIYHGQACGG 695
               ISKL + +IYH +  GG
Sbjct: 1190 SSIISKLGLKDIYHPKLEGG 1209



 Score =  120 bits (301), Expect(2) = 5e-73
 Identities = 58/101 (57%), Positives = 72/101 (71%), Gaps = 8/101 (7%)
 Frame = +2

Query: 2    DDGGGAYKDTSSYRRLIGRLLYITKTRPDITFAVQQIRQHVNSPTVTHHSAVCRVLRYLK 181
            +D    Y D SSYRRL+G+LLY+T TRPDI +A QQ+ Q ++ PT TH++A CRV++YLK
Sbjct: 971  NDDAKPYDDISSYRRLVGKLLYLTNTRPDIAYATQQLSQFLHKPTWTHYNAACRVVKYLK 1030

Query: 182  GNPGMGLMFPRNFILQLSGFS--------DTRKSVTGYCFF 280
             NPG GL+FPR   LQL GFS        DTR+S TGYCFF
Sbjct: 1031 QNPGRGLLFPRASDLQLLGFSDADWAGCVDTRRSTTGYCFF 1071


>dbj|GAU51049.1| hypothetical protein TSUD_371270 [Trifolium subterraneum]
          Length = 1001

 Score =  182 bits (462), Expect(2) = 1e-72
 Identities = 89/140 (63%), Positives = 111/140 (79%)
 Frame = +3

Query: 276  FFLGSSLISWRTKKQQTVSRSSSEAEYRAMSTAVRELQWLLFLLRDLNISCERPPTLYCD 455
            FF+GSSLISW+ KKQ TVSRSSSEAEYRA+S+   EL WLL L++DL I C++PP +Y D
Sbjct: 861  FFIGSSLISWKAKKQLTVSRSSSEAEYRALSSTTCELIWLLSLMKDLKIECDKPPFIYRD 920

Query: 456  HQSALHIAANPVFHERTEHLEIDCHFVREKLQEGLMKLLLVPSKGQLADFFTKALTPTNF 635
             QSA+HIA+NPVFHERT+HLEIDCH VREK+Q+GL+KLL + ++ +LAD  TKAL    F
Sbjct: 921  SQSAMHIASNPVFHERTKHLEIDCHLVREKVQQGLLKLLPISTQDKLADCLTKALPGPKF 980

Query: 636  IPFISKLRMINIYHGQACGG 695
              FISKL + +IYH +  GG
Sbjct: 981  NSFISKLGLQDIYHPKLKGG 1000



 Score =  120 bits (302), Expect(2) = 1e-72
 Identities = 58/101 (57%), Positives = 73/101 (72%), Gaps = 8/101 (7%)
 Frame = +2

Query: 2    DDGGGAYKDTSSYRRLIGRLLYITKTRPDITFAVQQIRQHVNSPTVTHHSAVCRVLRYLK 181
            +D    Y D SSYRRLIG+LLY+T TRPDI +A QQ+ Q ++ PT TH++A CRV++YLK
Sbjct: 762  NDEAKPYADISSYRRLIGKLLYLTNTRPDIAYATQQLSQFLHKPTWTHYNAACRVVKYLK 821

Query: 182  GNPGMGLMFPRNFILQLSGFS--------DTRKSVTGYCFF 280
             NPG GL+FPR+  LQ+ GFS        DTR+S TGYCFF
Sbjct: 822  QNPGRGLLFPRSSDLQILGFSDADWAGCVDTRRSTTGYCFF 862


>dbj|GAU46985.1| hypothetical protein TSUD_403190 [Trifolium subterraneum]
          Length = 1071

 Score =  182 bits (463), Expect(2) = 2e-72
 Identities = 91/169 (53%), Positives = 121/169 (71%)
 Frame = +3

Query: 276  FFLGSSLISWRTKKQQTVSRSSSEAEYRAMSTAVRELQWLLFLLRDLNISCERPPTLYCD 455
            FFLGSSLISW+ KKQ T+S+SS EAEYRA+S++  EL WLL+LL+DL I C + P ++CD
Sbjct: 900  FFLGSSLISWKAKKQLTISKSSLEAEYRALSSSTCELIWLLYLLKDLQIECTQLPVIFCD 959

Query: 456  HQSALHIAANPVFHERTEHLEIDCHFVREKLQEGLMKLLLVPSKGQLADFFTKALTPTNF 635
            +QSAL+I++NPVFHE T+H+E+DCH VREK+Q GL++LL + ++ QLAD  TKAL    F
Sbjct: 960  NQSALNISSNPVFHESTKHIELDCHLVREKVQAGLLRLLPISTQDQLADCLTKALPTAKF 1019

Query: 636  IPFISKLRMINIYHGQACGGLLKYKNDDSTGDLSNNSANEDMHKEDSSK 782
              FI+KL +++IY   ACG LL  K   S+   SNN     +   D  K
Sbjct: 1020 NHFIAKLGLLDIYQASACGRLLNNKIASSS---SNNYEEASLASNDQVK 1065



 Score =  120 bits (300), Expect(2) = 2e-72
 Identities = 55/101 (54%), Positives = 74/101 (73%), Gaps = 8/101 (7%)
 Frame = +2

Query: 2    DDGGGAYKDTSSYRRLIGRLLYITKTRPDITFAVQQIRQHVNSPTVTHHSAVCRVLRYLK 181
            +D G  ++D S YRRL+G+LLY+T TRPDI +A QQ+ Q +++PT+TH+ A CRV+RYLK
Sbjct: 801  NDNGKPFEDISLYRRLVGKLLYLTNTRPDIAYATQQLSQFLHNPTITHYKAACRVVRYLK 860

Query: 182  GNPGMGLMFPRNFILQLSGFS--------DTRKSVTGYCFF 280
             NPG GLMF RN  +Q+ G+S        DTR+S +GYCFF
Sbjct: 861  HNPGRGLMFHRNSDIQIIGYSDADWAGCLDTRRSTSGYCFF 901


Top