BLASTX nr result

ID: Astragalus22_contig00019161 seq

BLASTX 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Astragalus22_contig00019161
         (714 letters)

Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF excluding environmental samples
from WGS projects 
           149,584,005 sequences; 54,822,741,787 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

gb|PNX96089.1| copia-type polyprotein, partial [Trifolium pratense]   297   1e-90
dbj|GAU51028.1| hypothetical protein TSUD_291040 [Trifolium subt...   291   7e-88
dbj|GAU16533.1| hypothetical protein TSUD_167640 [Trifolium subt...   287   2e-86
dbj|GAU23238.1| hypothetical protein TSUD_172660 [Trifolium subt...   282   1e-84
gb|PNX94522.1| copia-type polyprotein [Trifolium pratense]            283   1e-84
gb|PNX60580.1| retrovirus-related Pol polyprotein from transposo...   261   5e-84
dbj|GAU32111.1| hypothetical protein TSUD_357950 [Trifolium subt...   281   6e-84
gb|KHN02838.1| Retrovirus-related Pol polyprotein from transposo...   256   3e-81
gb|KHN36591.1| Retrovirus-related Pol polyprotein from transposo...   258   3e-81
gb|PNX55375.1| retrovirus-related Pol polyprotein from transposo...   254   7e-81
gb|KHN39047.1| Retrovirus-related Pol polyprotein from transposo...   254   8e-81
gb|PNX72392.1| copia-type polyprotein, partial [Trifolium pratense]   268   2e-80
gb|PNX99755.1| copia-type polyprotein, partial [Trifolium pratense]   270   6e-80
dbj|GAU42405.1| hypothetical protein TSUD_324600 [Trifolium subt...   268   3e-79
gb|PNY00066.1| retrovirus-related Pol polyprotein from transposo...   248   4e-79
gb|PRQ17740.1| putative RNA-directed DNA polymerase [Rosa chinen...   265   7e-78
gb|PNX93789.1| copia-type polyprotein [Trifolium pratense]            265   8e-78
gb|PNX99782.1| copia-type polyprotein [Trifolium pratense]            260   1e-77
dbj|GAU32260.1| hypothetical protein TSUD_53880 [Trifolium subte...   263   2e-77
gb|PNX90684.1| retrovirus-related Pol polyprotein from transposo...   245   8e-77

>gb|PNX96089.1| copia-type polyprotein, partial [Trifolium pratense]
          Length = 1062

 Score =  297 bits (761), Expect = 1e-90
 Identities = 147/237 (62%), Positives = 178/237 (75%), Gaps = 1/237 (0%)
 Frame = -3

Query: 712 KMKIHGENVQERMVVEKILRSMTERFNYMVCAIEESNKVDVLSIDELHGSLLVHEQKMKV 533
           KMKIHGE V+   +VEKILRSMT +FNY+VCAIEESN V  LS+DEL  SL+VHEQ+MK 
Sbjct: 170 KMKIHGERVEPVTIVEKILRSMTPKFNYVVCAIEESNDVTALSVDELQSSLIVHEQRMKG 229

Query: 532 HKEEDQVLKVSNENRTXXXXXXXXXXXXXXXXRQFS-HKESIECYKCHKLGHYQSECPKL 356
            +EEDQ+LKV N  RT                 + S +K++IECY CHKLGH+QS+CP  
Sbjct: 230 QREEDQILKVINAGRTNNRGRGRGGFRGGRGRGRQSFNKDNIECYHCHKLGHFQSDCPAW 289

Query: 355 EENASYAEFDEDEEVLLMAYNTDIVVDSEGKIWFLDSGCSNHMCGVKEWFFNLDTKFRTT 176
           +E A+YAEFDE EE+LLMA++     D   K+WFLDSGC NHMCG K+WFFNLD +FR +
Sbjct: 290 DEKANYAEFDEGEEMLLMAHSEKGSYDK--KVWFLDSGCRNHMCGTKDWFFNLDEQFRIS 347

Query: 175 VRLGDNSLMKVTGKGNVKLQVDGLVQVITGVYYIPTLKNNLLSIGQLQHRGLTVIFE 5
           V+LGDNS M V GKGNVKL++ G+ QVIT VYYIP LKNNLLSIGQLQ +GLTV+F+
Sbjct: 348 VKLGDNSRMMVVGKGNVKLRIGGITQVITNVYYIPELKNNLLSIGQLQEKGLTVVFK 404


>dbj|GAU51028.1| hypothetical protein TSUD_291040 [Trifolium subterraneum]
          Length = 1182

 Score =  291 bits (746), Expect = 7e-88
 Identities = 144/237 (60%), Positives = 177/237 (74%)
 Frame = -3

Query: 712 KMKIHGENVQERMVVEKILRSMTERFNYMVCAIEESNKVDVLSIDELHGSLLVHEQKMKV 533
           KMK+ GE ++  +VVEKILRSM  +FNY+VCAIEESN VD LSID L GSLLVHEQKMK 
Sbjct: 142 KMKMQGEAMEHSIVVEKILRSMARKFNYVVCAIEESNDVDTLSIDGLQGSLLVHEQKMKP 201

Query: 532 HKEEDQVLKVSNENRTXXXXXXXXXXXXXXXXRQFSHKESIECYKCHKLGHYQSECPKLE 353
            KEEDQ LK+++ N                  ++  HK++IECYKCH+ GH+Q ECP  E
Sbjct: 202 AKEEDQALKITHGNGNTTRGRGRGGRTTQGRGKRL-HKDNIECYKCHRFGHFQYECPNNE 260

Query: 352 ENASYAEFDEDEEVLLMAYNTDIVVDSEGKIWFLDSGCSNHMCGVKEWFFNLDTKFRTTV 173
           + A YA+++E+EEVLLMA++       + KIW+LDSGCSNHMCGV E+FF+LDT FR TV
Sbjct: 261 DYAHYADYNENEEVLLMAFDKPSPSSVKNKIWYLDSGCSNHMCGVNEFFFDLDTNFRETV 320

Query: 172 RLGDNSLMKVTGKGNVKLQVDGLVQVITGVYYIPTLKNNLLSIGQLQHRGLTVIFEN 2
           RLGDNS M V GKGNVKLQ++G+ Q+IT VYYIP LKNNLLSIGQLQ + LT +F+N
Sbjct: 321 RLGDNSQMNVMGKGNVKLQMNGITQIITAVYYIPELKNNLLSIGQLQKKDLTFVFKN 377


>dbj|GAU16533.1| hypothetical protein TSUD_167640 [Trifolium subterraneum]
          Length = 1103

 Score =  287 bits (734), Expect = 2e-86
 Identities = 142/239 (59%), Positives = 178/239 (74%), Gaps = 2/239 (0%)
 Frame = -3

Query: 712 KMKIHGENVQERMVVEKILRSMTERFNYMVCAIEESNKVDVLSIDELHGSLLVHEQKMKV 533
           KMKI GE V++  VVEKILRSMT +FNY+VCAIEESN V+ LSIDEL GSLLVHEQKMK 
Sbjct: 142 KMKIQGEAVEQSTVVEKILRSMTSKFNYVVCAIEESNNVETLSIDELQGSLLVHEQKMKP 201

Query: 532 HKEEDQVLKVSNENRTXXXXXXXXXXXXXXXXRQFSHKESIECYKCHKLGHYQSECPKLE 353
            KEEDQ LKV++ +R                 ++ + KE++ECYKCHK GH+Q EC   E
Sbjct: 202 LKEEDQALKVTHGDRNSTRGRGRGAKGNQGRGKRIN-KENVECYKCHKFGHFQYECQNSE 260

Query: 352 EN--ASYAEFDEDEEVLLMAYNTDIVVDSEGKIWFLDSGCSNHMCGVKEWFFNLDTKFRT 179
               A+YA+FD+ EEVLLMA+ +    +   K+W++DSGCSNHMCG+KEWF +LD  FR 
Sbjct: 261 GGGYANYADFDDSEEVLLMAHESTSSNNPRAKVWYIDSGCSNHMCGIKEWFHDLDESFRE 320

Query: 178 TVRLGDNSLMKVTGKGNVKLQVDGLVQVITGVYYIPTLKNNLLSIGQLQHRGLTVIFEN 2
           +VRLGD+S M V GKGNVKLQ++G+VQ+ITGVY+IP LKNNLLS+GQLQ + LT + +N
Sbjct: 321 SVRLGDDSQMSVMGKGNVKLQMNGIVQIITGVYFIPKLKNNLLSLGQLQEKNLTFVIKN 379


>dbj|GAU23238.1| hypothetical protein TSUD_172660 [Trifolium subterraneum]
          Length = 1132

 Score =  282 bits (722), Expect = 1e-84
 Identities = 140/237 (59%), Positives = 177/237 (74%), Gaps = 2/237 (0%)
 Frame = -3

Query: 712 KMKIHGENVQERMVVEKILRSMTERFNYMVCAIEESNKVDVLSIDELHGSLLVHEQKMKV 533
           KMKI GE +++ +VVEKILRSMT +FNY+VCAIEE+N V+ LSIDEL GSLLVHEQKMK 
Sbjct: 142 KMKIQGEVMEQNIVVEKILRSMTSKFNYVVCAIEEANNVETLSIDELQGSLLVHEQKMKP 201

Query: 532 HKEEDQVLKVSNENRTXXXXXXXXXXXXXXXXRQFSHKESIECYKCHKLGHYQSECPKLE 353
            KEEDQ LK ++ +R                 ++ + KE++ECYKCHK GH+Q EC   E
Sbjct: 202 LKEEDQALKATHGDRNSGRGRGRGAKGNQGRGKRIN-KENVECYKCHKFGHFQYECQNSE 260

Query: 352 EN--ASYAEFDEDEEVLLMAYNTDIVVDSEGKIWFLDSGCSNHMCGVKEWFFNLDTKFRT 179
               A+YA+FD+ EEVLLMA+ +    +   K+W++DSGCSNHMCG+KEWF +LD  FR 
Sbjct: 261 GGGYANYADFDDSEEVLLMAHESTSSNNPRAKVWYIDSGCSNHMCGIKEWFHDLDDSFRE 320

Query: 178 TVRLGDNSLMKVTGKGNVKLQVDGLVQVITGVYYIPTLKNNLLSIGQLQHRGLTVIF 8
           +VRLGD+S M V GKGNVKLQ++G+VQVITGVY+IP LKNNLLS+GQLQ + LT I+
Sbjct: 321 SVRLGDDSQMSVMGKGNVKLQMNGIVQVITGVYFIPKLKNNLLSLGQLQEKNLTFIW 377


>gb|PNX94522.1| copia-type polyprotein [Trifolium pratense]
          Length = 1172

 Score =  283 bits (723), Expect = 1e-84
 Identities = 143/236 (60%), Positives = 178/236 (75%)
 Frame = -3

Query: 712 KMKIHGENVQERMVVEKILRSMTERFNYMVCAIEESNKVDVLSIDELHGSLLVHEQKMKV 533
           KMKI GEN++E +VVEKILRSMTE+FNY+VCAIEESN V+ LSIDEL GSLLVHE+KMK 
Sbjct: 142 KMKIQGENMEESIVVEKILRSMTEKFNYVVCAIEESNNVETLSIDELQGSLLVHERKMKP 201

Query: 532 HKEEDQVLKVSNENRTXXXXXXXXXXXXXXXXRQFSHKESIECYKCHKLGHYQSECPKLE 353
            K+EDQ LKV+  +R                 ++ + KE+IECYKCHKLGH+Q ECP + 
Sbjct: 202 IKQEDQALKVTYGDRNAGRGRGRGAKGGQGRGKRIN-KETIECYKCHKLGHFQYECPNVG 260

Query: 352 ENASYAEFDEDEEVLLMAYNTDIVVDSEGKIWFLDSGCSNHMCGVKEWFFNLDTKFRTTV 173
           + A+YA+   +EEVLLMA++      ++ +IW+LDSGC NHMCGVKEWF +LD  F+ TV
Sbjct: 261 DYANYAD---NEEVLLMAFDKSHQESTKKQIWYLDSGCINHMCGVKEWFHDLDMNFKETV 317

Query: 172 RLGDNSLMKVTGKGNVKLQVDGLVQVITGVYYIPTLKNNLLSIGQLQHRGLTVIFE 5
           RL DNS M V GKGNVKLQ++G  Q+IT VYYIP LKNNLLSIGQLQ + LT++F+
Sbjct: 318 RLRDNSQMSVVGKGNVKLQLNGFTQIITDVYYIPELKNNLLSIGQLQLKDLTIVFK 373


>gb|PNX60580.1| retrovirus-related Pol polyprotein from transposon TNT 1-94,
           partial [Trifolium pratense]
          Length = 305

 Score =  261 bits (668), Expect = 5e-84
 Identities = 133/243 (54%), Positives = 173/243 (71%), Gaps = 6/243 (2%)
 Frame = -3

Query: 712 KMKIHGENVQERMVVEKILRSMTERFNYMVCAIEESNKVDVLSIDELHGSLLVHEQKMKV 533
           KMK+HGE + +  VVEKILRS+T RFNY+ C+IEESN V   ++D+L  SLLVHE +MK 
Sbjct: 47  KMKMHGEAMTQGRVVEKILRSLTSRFNYIACSIEESNDVITWTVDQLQSSLLVHEHRMKG 106

Query: 532 HKEEDQVLKVSNENRTXXXXXXXXXXXXXXXXRQFSHKESIECYKCHKLGHYQSECPKLE 353
            K+E+QVLK+SN                    R   +KE++ECYKCHKLGH+QSECP  E
Sbjct: 107 QKDEEQVLKMSNNGGRGRGRGGRDGSRGRGRGRGRFNKENVECYKCHKLGHFQSECPNWE 166

Query: 352 E-NASYAEFDEDE--EVLLMAYNTDIVV---DSEGKIWFLDSGCSNHMCGVKEWFFNLDT 191
           E NA+YAEF+ DE  E+LL+A  T  +    DS+ +IWFLDSGCSNHM G K+W F+ D 
Sbjct: 167 EDNANYAEFEFDEAGEILLVAQETKEIESSNDSKYEIWFLDSGCSNHMVGNKDWLFDYDD 226

Query: 190 KFRTTVRLGDNSLMKVTGKGNVKLQVDGLVQVITGVYYIPTLKNNLLSIGQLQHRGLTVI 11
            F+ +V+LGD+S M V GKGN++L + G VQ++T VYY+P LKNNLLSIGQLQ + LT++
Sbjct: 227 SFKDSVKLGDDSKMAVVGKGNLRLYIVGYVQILTNVYYLPGLKNNLLSIGQLQQKNLTIV 286

Query: 10  FEN 2
           F+N
Sbjct: 287 FKN 289


>dbj|GAU32111.1| hypothetical protein TSUD_357950 [Trifolium subterraneum]
          Length = 1193

 Score =  281 bits (718), Expect = 6e-84
 Identities = 139/237 (58%), Positives = 175/237 (73%)
 Frame = -3

Query: 712 KMKIHGENVQERMVVEKILRSMTERFNYMVCAIEESNKVDVLSIDELHGSLLVHEQKMKV 533
           KMK+ GE ++  +VVEKILRSMT +FNY+VCAIEESN V+ L ID L GSLLVHEQKMK 
Sbjct: 183 KMKMQGETMEHSIVVEKILRSMTRKFNYVVCAIEESNDVETLPIDGLQGSLLVHEQKMKP 242

Query: 532 HKEEDQVLKVSNENRTXXXXXXXXXXXXXXXXRQFSHKESIECYKCHKLGHYQSECPKLE 353
            KEEDQ LK+++ +                  ++ + K++IECYKCH+ GH+Q ECP  E
Sbjct: 243 AKEEDQALKITHGSGNSTRGRGRGGRTNQGRGKRLN-KDNIECYKCHRFGHFQYECPNNE 301

Query: 352 ENASYAEFDEDEEVLLMAYNTDIVVDSEGKIWFLDSGCSNHMCGVKEWFFNLDTKFRTTV 173
           + A YA+++E+EEVLLMA++       + KIW+LD GCSNHMCGVK +FF+LDT FR TV
Sbjct: 302 DYAHYADYNENEEVLLMAFDKPSTSSVKSKIWYLDLGCSNHMCGVKVFFFDLDTSFRETV 361

Query: 172 RLGDNSLMKVTGKGNVKLQVDGLVQVITGVYYIPTLKNNLLSIGQLQHRGLTVIFEN 2
           RLGDNS M V GKGNVKLQ++G+ Q+IT VYYIP LKNNLLSIGQLQ + LT + +N
Sbjct: 362 RLGDNSQMNVMGKGNVKLQMNGITQIITVVYYIPELKNNLLSIGQLQKKNLTFVLKN 418


>gb|KHN02838.1| Retrovirus-related Pol polyprotein from transposon TNT 1-94,
           partial [Glycine soja]
          Length = 342

 Score =  256 bits (653), Expect = 3e-81
 Identities = 126/235 (53%), Positives = 167/235 (71%), Gaps = 2/235 (0%)
 Frame = -3

Query: 712 KMKIHGENVQERMVVEKILRSMTERFNYMVCAIEESNKVDVLSIDELHGSLLVHEQKMKV 533
           KMK HGE++ E ++  KILRSM  +F+Y+VC+IEESN +DV++IDEL  SLLVHEQ+M+ 
Sbjct: 127 KMKAHGESMSETVITAKILRSMISKFDYVVCSIEESNNLDVMTIDELQSSLLVHEQRMRS 186

Query: 532 HKEEDQVLKVSNENRTXXXXXXXXXXXXXXXXRQFSHKESIECYKCHKLGHYQSECPKLE 353
             EE+QVLK+S+E++                      +  IEC+KCHKLGHYQ ECP  E
Sbjct: 187 RGEEEQVLKISHEDKASRG------------------RAVIECFKCHKLGHYQYECPDWE 228

Query: 352 ENASYAEFDE--DEEVLLMAYNTDIVVDSEGKIWFLDSGCSNHMCGVKEWFFNLDTKFRT 179
           +NA+Y E ++  DEE+LLM+Y  ++  D   ++WFLDSGCSNHM G KEWF  LD  F  
Sbjct: 229 KNANYVELEKEKDEELLLMSY-VELEQDKMEEVWFLDSGCSNHMTGNKEWFSELDESFSQ 287

Query: 178 TVRLGDNSLMKVTGKGNVKLQVDGLVQVITGVYYIPTLKNNLLSIGQLQHRGLTV 14
           TV+LG+N+ M V GKG +++QV+G  Q I+GVYY+P LKNNLLSIGQLQ +GLT+
Sbjct: 288 TVKLGNNTRMVVVGKGIIRMQVNGFTQAISGVYYVPELKNNLLSIGQLQEKGLTI 342


>gb|KHN36591.1| Retrovirus-related Pol polyprotein from transposon TNT 1-94,
           partial [Glycine soja]
          Length = 430

 Score =  258 bits (660), Expect = 3e-81
 Identities = 128/243 (52%), Positives = 173/243 (71%), Gaps = 6/243 (2%)
 Frame = -3

Query: 712 KMKIHGENVQERMVVEKILRSMTERFNYMVCAIEESNKVDVLSIDELHGSLLVHEQKMKV 533
           KMK HGE++ E ++  KILRSM  +F+Y+VC+IEESN +D+++IDEL  SLLVHEQ+M+ 
Sbjct: 134 KMKAHGESMSETVITAKILRSMISKFDYVVCSIEESNNLDMMTIDELQSSLLVHEQRMRS 193

Query: 532 HKEEDQVLKVSNENRTXXXXXXXXXXXXXXXXR----QFSHKESIECYKCHKLGHYQSEC 365
             EE+QVLK+S+E++                 R    Q  +K  IEC+KCHKLGHYQ EC
Sbjct: 194 RGEEEQVLKISHEDKASRGRGRGRGNGSFRGGRGRGRQSFNKAVIECFKCHKLGHYQYEC 253

Query: 364 PKLEENASYAEFDE--DEEVLLMAYNTDIVVDSEGKIWFLDSGCSNHMCGVKEWFFNLDT 191
           P  E+NA+Y E ++  DEE+LLM+Y  ++  D   ++WFLDSGCSNHM G KEWF  LD 
Sbjct: 254 PDWEKNANYVELEKEKDEELLLMSY-VELEQDKMEEVWFLDSGCSNHMTGNKEWFSELDE 312

Query: 190 KFRTTVRLGDNSLMKVTGKGNVKLQVDGLVQVITGVYYIPTLKNNLLSIGQLQHRGLTVI 11
            F  TV+LG+N+ M V GKG +++QV+G  Q I+GVYY+P LKNNLLSIGQLQ +GLT++
Sbjct: 313 SFSQTVKLGNNTRMVVVGKGIIRMQVNGFTQAISGVYYVPELKNNLLSIGQLQEKGLTIL 372

Query: 10  FEN 2
            ++
Sbjct: 373 IQH 375


>gb|PNX55375.1| retrovirus-related Pol polyprotein from transposon TNT 1-94,
           partial [Trifolium pratense]
          Length = 327

 Score =  254 bits (649), Expect = 7e-81
 Identities = 132/240 (55%), Positives = 171/240 (71%), Gaps = 5/240 (2%)
 Frame = -3

Query: 712 KMKIHGENVQERMVVEKILRSMTERFNYMVCAIEESNKVDVLSIDELHGSLLVHEQKMKV 533
           KM   GE + + +VVEK+LRSM+E+FNY+VC+IEESN V  L+IDEL  SLLVHE++MK 
Sbjct: 47  KMTASGETMTQTIVVEKVLRSMSEKFNYVVCSIEESNDVTTLTIDELQSSLLVHEKRMKP 106

Query: 532 H--KEEDQVLKVSNENRTXXXXXXXXXXXXXXXXRQFSHKESIECYKCHKLGHYQSECPK 359
              K+E+Q LKVS   R                  +  +++ +ECY+CHKLGHYQSECP 
Sbjct: 107 TQVKDEEQALKVSY-GRGGGRGRGRNSSRGGRGRGRQQNRDLVECYRCHKLGHYQSECPT 165

Query: 358 LEENASYAEFDEDEEVLLMAYNT---DIVVDSEGKIWFLDSGCSNHMCGVKEWFFNLDTK 188
            EE A+YAEFDE  EVLLMA+      +  + + +IWFLDSGC+NHM   KEW F  D++
Sbjct: 166 WEE-ANYAEFDEHGEVLLMAHEKLKEPVKSELKDEIWFLDSGCNNHMVVKKEWLFEFDSE 224

Query: 187 FRTTVRLGDNSLMKVTGKGNVKLQVDGLVQVITGVYYIPTLKNNLLSIGQLQHRGLTVIF 8
           FR TV+LGDNS M+V  KGN++LQ++G+VQVIT VYY+P LKNNLLSIGQLQ + LT++F
Sbjct: 225 FRETVKLGDNSRMQVMIKGNLRLQIEGIVQVITSVYYLPDLKNNLLSIGQLQQKNLTIVF 284


>gb|KHN39047.1| Retrovirus-related Pol polyprotein from transposon TNT 1-94,
           partial [Glycine soja]
          Length = 342

 Score =  254 bits (650), Expect = 8e-81
 Identities = 125/235 (53%), Positives = 167/235 (71%), Gaps = 2/235 (0%)
 Frame = -3

Query: 712 KMKIHGENVQERMVVEKILRSMTERFNYMVCAIEESNKVDVLSIDELHGSLLVHEQKMKV 533
           KMK HGE++ E ++  KILRSM  +F+Y+VC+IEESN +D+++IDEL  SLLVHEQ+M+ 
Sbjct: 127 KMKAHGESMSETVITAKILRSMISKFDYVVCSIEESNNLDMMTIDELQSSLLVHEQRMRS 186

Query: 532 HKEEDQVLKVSNENRTXXXXXXXXXXXXXXXXRQFSHKESIECYKCHKLGHYQSECPKLE 353
             EE+QVLK+S+E++                      +  IEC+KCHKLGHYQ ECP  E
Sbjct: 187 RGEEEQVLKISHEDKASRG------------------RAVIECFKCHKLGHYQYECPDWE 228

Query: 352 ENASYAEFDE--DEEVLLMAYNTDIVVDSEGKIWFLDSGCSNHMCGVKEWFFNLDTKFRT 179
           +NA+Y E ++  DEE+LLM+Y  ++  D   ++WFLDSGCSNHM G KEWF  LD  F  
Sbjct: 229 KNANYVELEKEKDEELLLMSY-VELEQDKMEEVWFLDSGCSNHMTGNKEWFSELDESFSQ 287

Query: 178 TVRLGDNSLMKVTGKGNVKLQVDGLVQVITGVYYIPTLKNNLLSIGQLQHRGLTV 14
           TV+LG+N+ M V GKG +++QV+G  Q I+GVYY+P LKNNLLSIGQLQ +GLT+
Sbjct: 288 TVKLGNNTRMVVVGKGIIRMQVNGFTQAISGVYYVPELKNNLLSIGQLQEKGLTI 342


>gb|PNX72392.1| copia-type polyprotein, partial [Trifolium pratense]
          Length = 886

 Score =  268 bits (684), Expect = 2e-80
 Identities = 134/241 (55%), Positives = 175/241 (72%), Gaps = 5/241 (2%)
 Frame = -3

Query: 712 KMKIHGENVQERMVVEKILRSMTERFNYMVCAIEESNKVDVLSIDELHGSLLVHEQKMKV 533
           K+  HGEN+ +  V+EK+LRSM+ +FNY+VCAIEES+ V  +SIDE+  SL+VHE++MK 
Sbjct: 142 KITAHGENLTQASVIEKVLRSMSSKFNYVVCAIEESHDVTTMSIDEIQSSLIVHEKRMKA 201

Query: 532 H--KEEDQVLKVSNENRTXXXXXXXXXXXXXXXXRQFSHKESIECYKCHKLGHYQSECPK 359
           +  KEE+Q LKVSN  R                  +   KE+IEC++CHKLGHYQSECP 
Sbjct: 202 NLDKEEEQALKVSNYGRGANRGRGGRSSSRGRGRGRQISKENIECFRCHKLGHYQSECPN 261

Query: 358 LEE-NASYAEFDEDEEVLLMAYNTDIVVDSEGK--IWFLDSGCSNHMCGVKEWFFNLDTK 188
            E+ NA++AEFD+ EE+LLMA  TD   +S  K  +W+LDSGCSNHM G KEW F+ D  
Sbjct: 262 WEDANANFAEFDDKEEILLMAQGTD---ESNNKKVVWYLDSGCSNHMVGNKEWLFDFDDS 318

Query: 187 FRTTVRLGDNSLMKVTGKGNVKLQVDGLVQVITGVYYIPTLKNNLLSIGQLQHRGLTVIF 8
           FR +V+LGD+S M V GKGN+KL ++G+VQVIT VY++P LKNNLLSIGQLQ + +T+IF
Sbjct: 319 FRESVKLGDDSRMAVMGKGNLKLNINGMVQVITDVYFLPGLKNNLLSIGQLQQKNVTIIF 378

Query: 7   E 5
           E
Sbjct: 379 E 379


>gb|PNX99755.1| copia-type polyprotein, partial [Trifolium pratense]
          Length = 1209

 Score =  270 bits (690), Expect = 6e-80
 Identities = 135/243 (55%), Positives = 168/243 (69%), Gaps = 8/243 (3%)
 Frame = -3

Query: 712 KMKIHGENVQERMVVEKILRSMTERFNYMVCAIEESNKVDVLSIDELHGSLLVHEQKMKV 533
           KM  HGE + +  +VEKILRSMT +F Y+VC+IEES+ V  +SIDEL  SLLVHE +MK+
Sbjct: 118 KMTSHGERLTDGNIVEKILRSMTSKFEYVVCSIEESHDVTTMSIDELQSSLLVHEGRMKI 177

Query: 532 HK--EEDQVLKVSNEN------RTXXXXXXXXXXXXXXXXRQFSHKESIECYKCHKLGHY 377
           HK  EE+Q LKVSN N       +                   + KE +ECY+CHKLGHY
Sbjct: 178 HKVKEEEQALKVSNSNLGRGSANSRGRGRTSSRGRGRGRSAASTSKEFVECYRCHKLGHY 237

Query: 376 QSECPKLEENASYAEFDEDEEVLLMAYNTDIVVDSEGKIWFLDSGCSNHMCGVKEWFFNL 197
           Q+ECP  EENA++AEFDE EE+L+MA N   V  +  ++WFLDSGCSNHM G KEW F+ 
Sbjct: 238 QNECPTWEENANFAEFDEHEEMLMMAQNQSNV--NTNQVWFLDSGCSNHMIGTKEWLFDF 295

Query: 196 DTKFRTTVRLGDNSLMKVTGKGNVKLQVDGLVQVITGVYYIPTLKNNLLSIGQLQHRGLT 17
           D  FR TV+LGDNS M V GKGNVK+ + G + VIT VYY+P L+NNLLSIGQLQ + LT
Sbjct: 296 DDTFRETVKLGDNSTMSVMGKGNVKISLQGKISVITDVYYLPNLRNNLLSIGQLQQKNLT 355

Query: 16  VIF 8
           ++F
Sbjct: 356 IVF 358


>dbj|GAU42405.1| hypothetical protein TSUD_324600 [Trifolium subterraneum]
          Length = 1302

 Score =  268 bits (686), Expect = 3e-79
 Identities = 134/239 (56%), Positives = 171/239 (71%), Gaps = 2/239 (0%)
 Frame = -3

Query: 712 KMKIHGENVQERMVVEKILRSMTERFNYMVCAIEESNKVDVLSIDELHGSLLVHEQKMKV 533
           KMKI GE V++  VVEKILRSMT +FNY+VCAIEESN V+ LSIDEL GSLLVHEQKMK 
Sbjct: 142 KMKIQGEAVEQSTVVEKILRSMTSKFNYVVCAIEESNNVETLSIDELQGSLLVHEQKMKP 201

Query: 532 HKEEDQVLKVSNENRTXXXXXXXXXXXXXXXXRQFSHKESIECYKCHKLGHYQSECPKLE 353
             EEDQ LK ++ +R                 ++ + KE++ECYKCHK GH+Q EC   E
Sbjct: 202 LNEEDQALKATHGDRNSTRGRGRGAKGNQGRGKRIN-KENVECYKCHKFGHFQYECQNSE 260

Query: 352 EN--ASYAEFDEDEEVLLMAYNTDIVVDSEGKIWFLDSGCSNHMCGVKEWFFNLDTKFRT 179
               A+YA+FD+ EEVLLMA+ +    +   K+W++DSGC NHMCG+KE F +LD  FR 
Sbjct: 261 GGGYANYADFDDSEEVLLMAHESTSSNNPRAKVWYIDSGCRNHMCGIKERFHDLDESFRE 320

Query: 178 TVRLGDNSLMKVTGKGNVKLQVDGLVQVITGVYYIPTLKNNLLSIGQLQHRGLTVIFEN 2
           +VRLGD+S M V  KGNVKL ++G+VQ+ITGV++IP LKNNLLS+GQ Q + LT + +N
Sbjct: 321 SVRLGDDSQMSVMEKGNVKLHINGIVQIITGVHFIPKLKNNLLSLGQFQEKSLTFVIKN 379


>gb|PNY00066.1| retrovirus-related Pol polyprotein from transposon TNT 1-94
           [Trifolium pratense]
          Length = 276

 Score =  248 bits (633), Expect = 4e-79
 Identities = 128/248 (51%), Positives = 165/248 (66%), Gaps = 11/248 (4%)
 Frame = -3

Query: 712 KMKIHGENVQERMVVEKILRSMTERFNYMVCAIEESNKVDVLSIDELHGSLLVHEQKMKV 533
           KM  HGE V + M+VEKILRS+T +FNY+ C+IEESN    ++IDEL  SLLV EQ+MK 
Sbjct: 20  KMSSHGETVTQSMIVEKILRSLTSKFNYVACSIEESNDTTAMTIDELQSSLLVQEQRMKN 79

Query: 532 HKEE-DQVLKVSNENRTXXXXXXXXXXXXXXXXRQFS-----HKESIECYKCHKLGHYQS 371
            KEE +Q+LKVSN  R                 R        +K+++ECYKCHKLGHYQ+
Sbjct: 80  QKEEQEQILKVSNGGRGYSGRGENSYGRDRGRGRGRGRSVKINKDAVECYKCHKLGHYQA 139

Query: 370 ECPKL-EENASYAEFDEDEEVLLMAYNTDIVVDSEG----KIWFLDSGCSNHMCGVKEWF 206
           +CP   E+N +YA+FDE++E+LLM   T I    E     ++WFLDSGCS HM G K W 
Sbjct: 140 DCPSWKEDNVNYAQFDEEQEILLMEQETVIKEPQESGEKLELWFLDSGCSKHMVGNKNWL 199

Query: 205 FNLDTKFRTTVRLGDNSLMKVTGKGNVKLQVDGLVQVITGVYYIPTLKNNLLSIGQLQHR 26
           F  D  F+ +V+LGD+S M V GKGN+KL ++   Q++T VYY P LKNNL+SIGQLQ +
Sbjct: 200 FVYDDTFKDSVKLGDDSKMSVEGKGNLKLHIESFTQILTNVYYSPELKNNLISIGQLQQK 259

Query: 25  GLTVIFEN 2
            LTVIF+N
Sbjct: 260 NLTVIFKN 267


>gb|PRQ17740.1| putative RNA-directed DNA polymerase [Rosa chinensis]
          Length = 1302

 Score =  265 bits (676), Expect = 7e-78
 Identities = 132/239 (55%), Positives = 173/239 (72%), Gaps = 2/239 (0%)
 Frame = -3

Query: 712 KMKIHGENVQERMVVEKILRSMTERFNYMVCAIEESNKVDVLSIDELHGSLLVHEQKMKV 533
           KMK H EN+ E ++VEKILRSMT +F+Y+VC+IEESN +  ++IDEL  SLLVHEQ+M+ 
Sbjct: 145 KMKAHSENMAELVIVEKILRSMTAKFDYVVCSIEESNNLTTMTIDELQSSLLVHEQRMRG 204

Query: 532 HKEEDQVLKVSNENRTXXXXXXXXXXXXXXXXR--QFSHKESIECYKCHKLGHYQSECPK 359
           H  E+Q LKV+ E+RT                R  Q  +K  +ECYKCHKLGH+Q ECP+
Sbjct: 205 HGGEEQALKVTYEDRTSGRGRGRGGFRGRGRGRSRQQFNKALVECYKCHKLGHFQYECPE 264

Query: 358 LEENASYAEFDEDEEVLLMAYNTDIVVDSEGKIWFLDSGCSNHMCGVKEWFFNLDTKFRT 179
            E+ A+YAE DE EE+LLMAY  ++      ++WFLDSGCSNHM G K+WF +L+ +FR 
Sbjct: 265 WEKGANYAELDEKEEMLLMAY-VELNNSKMEEVWFLDSGCSNHMSGNKKWFIDLNEQFRQ 323

Query: 178 TVRLGDNSLMKVTGKGNVKLQVDGLVQVITGVYYIPTLKNNLLSIGQLQHRGLTVIFEN 2
           +V+LG+NS M V GKGNV+LQ +G+ QV T VYYIP LKNNLLSIGQLQ +G+ ++ +N
Sbjct: 324 SVKLGNNSKMAVMGKGNVRLQANGVTQVFTDVYYIPELKNNLLSIGQLQEKGVAILIQN 382


>gb|PNX93789.1| copia-type polyprotein [Trifolium pratense]
          Length = 1347

 Score =  265 bits (676), Expect = 8e-78
 Identities = 131/240 (54%), Positives = 179/240 (74%), Gaps = 3/240 (1%)
 Frame = -3

Query: 712 KMKIHGENVQERMVVEKILRSMTERFNYMVCAIEESNKVDVLSIDELHGSLLVHEQKMKV 533
           KMK HGE++ + ++ EKILRSM  +F+Y+VC+IEESN +D ++IDEL  SLLVHEQ+M  
Sbjct: 147 KMKAHGESMSQTIITEKILRSMISKFDYVVCSIEESNNLDTMTIDELQSSLLVHEQRMTS 206

Query: 532 HKEEDQVLKVSNENR--TXXXXXXXXXXXXXXXXRQFSHKESIECYKCHKLGHYQSECPK 359
           H+EE+QVLK+S+E+R                   RQ  +K  IEC++CH+LGHYQ ECP 
Sbjct: 207 HREEEQVLKISHEDRYGRGRGRGMFRGGRGRGRGRQPYNKALIECFRCHQLGHYQYECPD 266

Query: 358 LEENASYAEFDE-DEEVLLMAYNTDIVVDSEGKIWFLDSGCSNHMCGVKEWFFNLDTKFR 182
            E+ A+YAEF+E +EE+LLM+Y  +I  D + ++WFLDSGCSNHM G K+WF +LD  FR
Sbjct: 267 WEQKANYAEFEEKEEEILLMSY-VEIKHDEKEEMWFLDSGCSNHMSGNKKWFSDLDESFR 325

Query: 181 TTVRLGDNSLMKVTGKGNVKLQVDGLVQVITGVYYIPTLKNNLLSIGQLQHRGLTVIFEN 2
            TV+LG++S M V GKGNV+++V+G  QVI+ VYYIP LKNNLLSIGQLQ +GL+++ ++
Sbjct: 326 HTVKLGNDSRMAVIGKGNVRMRVNGFTQVISNVYYIPELKNNLLSIGQLQDKGLSILIQH 385


>gb|PNX99782.1| copia-type polyprotein [Trifolium pratense]
          Length = 912

 Score =  260 bits (665), Expect = 1e-77
 Identities = 135/243 (55%), Positives = 170/243 (69%), Gaps = 6/243 (2%)
 Frame = -3

Query: 712 KMKIHGENVQERMVVEKILRSMTERFNYMVCAIEESNKVDVLSIDELHGSLLVHEQKMKV 533
           +M  HGE +++ MVVEKILRSM E+FNY+VC+IEESN V  LSIDEL  SLLVHEQ+M+ 
Sbjct: 143 RMTAHGERIEQVMVVEKILRSMHEKFNYVVCSIEESNDVTTLSIDELQSSLLVHEQRMRG 202

Query: 532 HKE------EDQVLKVSNENRTXXXXXXXXXXXXXXXXRQFSHKESIECYKCHKLGHYQS 371
            K+      +DQ LK+SN  R                   +  KESIEC+KCHKLGHY++
Sbjct: 203 QKDYQKDHSDDQALKMSNSGRGGGRSASRGLGRG------WQSKESIECFKCHKLGHYRN 256

Query: 370 ECPKLEENASYAEFDEDEEVLLMAYNTDIVVDSEGKIWFLDSGCSNHMCGVKEWFFNLDT 191
           ECP  E N  YAE  E+EE+LLMAY+       +G +W++DSGCSNHM G KEWFF+ D 
Sbjct: 257 ECPDWEAN--YAEHREEEEMLLMAYSYTKEDFIKG-MWYIDSGCSNHMTGTKEWFFDFDD 313

Query: 190 KFRTTVRLGDNSLMKVTGKGNVKLQVDGLVQVITGVYYIPTLKNNLLSIGQLQHRGLTVI 11
           KFR +V+LG++S M V G+GNVKL +DG + VIT VYY+P L NNLLS+GQLQ RGLT +
Sbjct: 314 KFRESVKLGNDSKMTVMGRGNVKLNMDGKIHVITNVYYLPGLSNNLLSVGQLQQRGLTTV 373

Query: 10  FEN 2
           F+N
Sbjct: 374 FKN 376


>dbj|GAU32260.1| hypothetical protein TSUD_53880 [Trifolium subterraneum]
          Length = 1172

 Score =  263 bits (671), Expect = 2e-77
 Identities = 132/245 (53%), Positives = 170/245 (69%), Gaps = 8/245 (3%)
 Frame = -3

Query: 712 KMKIHGENVQERMVVEKILRSMTERFNYMVCAIEESNKVDVLSIDELHGSLLVHEQKMKV 533
           KM  HGE + +  +VE  LRS+T RFNY+VC+IE+SN V  +S+DEL  SLLV EQ+MK 
Sbjct: 120 KMSAHGETMTQGTIVEIFLRSLTSRFNYVVCSIEQSNDVTTMSVDELQSSLLVQEQRMKN 179

Query: 532 HKEEDQVLKVSNENRTXXXXXXXXXXXXXXXXRQFS-----HKESIECYKCHKLGHYQSE 368
            K+E+Q+LKVS   R                  +       +K+ IECYKCHKLGH+QSE
Sbjct: 180 QKDEEQILKVSYGGRGSSRGEGSNRGRGNRGRGRGGRAGKFNKDMIECYKCHKLGHFQSE 239

Query: 367 CPKLEE-NASYAEFDEDEEVLLMAYNT--DIVVDSEGKIWFLDSGCSNHMCGVKEWFFNL 197
           CP  EE NA+YA+FDE+EE+LLMA  T  D  +D   ++WFLDSGCSNHM G K W F+ 
Sbjct: 240 CPSWEEDNANYAQFDEEEEILLMAQETKEDGKIDVNHELWFLDSGCSNHMVGNKSWLFDY 299

Query: 196 DTKFRTTVRLGDNSLMKVTGKGNVKLQVDGLVQVITGVYYIPTLKNNLLSIGQLQHRGLT 17
           D  F+ +V+LGD+S M V GKGN+KL ++G VQ++T VYY+P LKNNLLSIGQLQ + LT
Sbjct: 300 DDTFKDSVKLGDDSRMAVVGKGNLKLHIEGYVQILTNVYYLPGLKNNLLSIGQLQQKNLT 359

Query: 16  VIFEN 2
           +IF+N
Sbjct: 360 IIFKN 364


>gb|PNX90684.1| retrovirus-related Pol polyprotein from transposon TNT 1-94
           [Trifolium pratense]
          Length = 372

 Score =  245 bits (626), Expect = 8e-77
 Identities = 123/246 (50%), Positives = 169/246 (68%), Gaps = 10/246 (4%)
 Frame = -3

Query: 709 MKIHGENVQERMVVEKILRSMTERFNYMVCAIEESNKVDVLSIDELHGSLLVHEQKMKVH 530
           MK HGE +++ +VVEKILRSM  +F+Y+V AIEESN ++ ++IDEL  SLLVHEQ+M  H
Sbjct: 1   MKAHGERMEQLVVVEKILRSMNRQFDYVVAAIEESNDLNTMTIDELQSSLLVHEQRMNSH 60

Query: 529 KEEDQVLKVSNENRTXXXXXXXXXXXXXXXXR----------QFSHKESIECYKCHKLGH 380
             E+QVLKV+ ++ +                           Q  ++E IECYKCHKLGH
Sbjct: 61  IREEQVLKVTLDDNSERNNNDRFGGRGRGRGGFRGRGRGRGRQQFNRELIECYKCHKLGH 120

Query: 379 YQSECPKLEENASYAEFDEDEEVLLMAYNTDIVVDSEGKIWFLDSGCSNHMCGVKEWFFN 200
           +Q ECP  E NA YAE +E EE+LLMA+      +   ++WFLDSGCSNHM G K+WF +
Sbjct: 121 FQYECPDWERNAHYAELNESEEILLMAHAEH--EEKSVELWFLDSGCSNHMTGNKKWFTD 178

Query: 199 LDTKFRTTVRLGDNSLMKVTGKGNVKLQVDGLVQVITGVYYIPTLKNNLLSIGQLQHRGL 20
           +D +++ +V+LG+N  M V G+GNVKL V+G++QVIT VYY+P LKNNL+SIGQL  +G+
Sbjct: 179 IDEQYQQSVKLGNNFKMAVVGRGNVKLHVNGIMQVITNVYYVPELKNNLISIGQLIEKGV 238

Query: 19  TVIFEN 2
           +V+ +N
Sbjct: 239 SVLIQN 244


Top