BLASTX nr result

ID: Astragalus23_contig00019362 seq

BLASTX 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Astragalus23_contig00019362
         (1015 letters)

Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF excluding environmental samples
from WGS projects 
           149,584,005 sequences; 54,822,741,787 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

gb|KYP40819.1| Retrovirus-related Pol polyprotein from transposo...   306   1e-96
gb|KYP35468.1| Retrovirus-related Pol polyprotein from transposo...   246   5e-74
gb|PNX95204.1| copia-type polyprotein [Trifolium pratense]            257   1e-73
dbj|GAU43011.1| hypothetical protein TSUD_28300 [Trifolium subte...   242   5e-72
dbj|GAU32260.1| hypothetical protein TSUD_53880 [Trifolium subte...   251   1e-71
dbj|GAU16533.1| hypothetical protein TSUD_167640 [Trifolium subt...   249   3e-71
gb|PNX77239.1| copia-type polyprotein, partial [Trifolium pratense]   246   3e-71
gb|PNX74620.1| putative LRR receptor-like protein kinase, partia...   245   5e-71
gb|PNY01730.1| copia-type polyprotein, partial [Trifolium pratense]   245   1e-70
gb|KYP38784.1| Retrovirus-related Pol polyprotein from transposo...   238   2e-70
gb|PNX72392.1| copia-type polyprotein, partial [Trifolium pratense]   244   2e-70
gb|PNX65055.1| retrotransposon-related protein, partial [Trifoli...   229   3e-70
gb|PNX83704.1| copia-type polyprotein, partial [Trifolium pratense]   239   3e-70
gb|PNX77752.1| copia-type polyprotein, partial [Trifolium pratense]   241   4e-70
gb|PNX74679.1| copia-type polyprotein, partial [Trifolium pratense]   240   5e-70
dbj|GAU37106.1| hypothetical protein TSUD_278930 [Trifolium subt...   244   8e-70
gb|PNX99782.1| copia-type polyprotein [Trifolium pratense]            243   1e-69
dbj|GAU36022.1| hypothetical protein TSUD_211600 [Trifolium subt...   245   2e-69
gb|PNX90684.1| retrovirus-related Pol polyprotein from transposo...   230   3e-69
emb|CBI37296.3| unnamed protein product, partial [Vitis vinifera]     245   3e-69

>gb|KYP40819.1| Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cajanus
            cajan]
          Length = 582

 Score =  306 bits (785), Expect = 1e-96
 Identities = 143/210 (68%), Positives = 172/210 (81%), Gaps = 4/210 (1%)
 Frame = -1

Query: 1015 DSGCSTHMCGTKKWFSYLDGSFRESVKLGDGRKLNVIGKGNVKLRIKGRIQVITGVYYIP 836
            DSGCSTHMCG K+WF  LD  FRE VKLGDGR L+V+G+GNVKL ++GRIQ+ITGVYYIP
Sbjct: 294  DSGCSTHMCGVKRWFIDLDEQFREVVKLGDGRTLSVMGRGNVKLCVEGRIQIITGVYYIP 353

Query: 835  NLR----SIGQLQQKNLKVEFEDDKCKVIHKEKGLIMTSFMASNRMFAVKATPVKIDTNC 668
            NL     SIGQLQQK LK+ F+DDKC+V HKEKGL+MTS+MA+NRMF +KA PV  +  C
Sbjct: 354  NLMNNLLSIGQLQQKKLKIIFDDDKCRVYHKEKGLLMTSYMANNRMFPIKAKPVITEAAC 413

Query: 667  LHIADEETTQLWHRRYGHLSYKGLSPLNAKEMVRGLPELKETTKVCAECQIGKQHRDTMS 488
            +  ++ ++T++WH+RYGHLSY GL  LN K MV+GLPELKE  KVC EC +GKQHRD +S
Sbjct: 414  IQTSNVDSTEMWHKRYGHLSYGGLKLLNQKAMVKGLPELKEMDKVCPECAVGKQHRDAIS 473

Query: 487  KQSTWRASRKLELIHSDICGPINPASNSNK 398
            KQSTWRA+R+LELIHSDICGP  P SNSN+
Sbjct: 474  KQSTWRATRRLELIHSDICGPSTPTSNSNR 503


>gb|KYP35468.1| Retrovirus-related Pol polyprotein from transposon TNT 1-94
           [Cajanus cajan]
          Length = 508

 Score =  246 bits (628), Expect = 5e-74
 Identities = 115/174 (66%), Positives = 142/174 (81%), Gaps = 4/174 (2%)
 Frame = -1

Query: 907 IGKGNVKLRIKGRIQVITGVYYIPNLR----SIGQLQQKNLKVEFEDDKCKVIHKEKGLI 740
           +G+GNVKL ++GRIQ+ITGVYYIPNL     SIGQLQQK LK+ F+DDKC+V HKEKGL+
Sbjct: 1   MGRGNVKLCVEGRIQIITGVYYIPNLMNNLLSIGQLQQKKLKIIFDDDKCRVYHKEKGLL 60

Query: 739 MTSFMASNRMFAVKATPVKIDTNCLHIADEETTQLWHRRYGHLSYKGLSPLNAKEMVRGL 560
           MTS+MA+NRMF +KA  V  +  C+  ++ ++T++WH+RYGHLSY GL  LN K MV+GL
Sbjct: 61  MTSYMANNRMFPIKAKLVITEAACIQTSNVDSTEMWHKRYGHLSYGGLKLLNQKAMVKGL 120

Query: 559 PELKETTKVCAECQIGKQHRDTMSKQSTWRASRKLELIHSDICGPINPASNSNK 398
           PELKE  KVC EC +GKQHRD +SKQSTWRA+R+LELIHSDICGP  P SNSN+
Sbjct: 121 PELKEMDKVCPECAVGKQHRDAISKQSTWRATRRLELIHSDICGPSTPTSNSNR 174


>gb|PNX95204.1| copia-type polyprotein [Trifolium pratense]
          Length = 1328

 Score =  257 bits (657), Expect = 1e-73
 Identities = 128/208 (61%), Positives = 157/208 (75%), Gaps = 4/208 (1%)
 Frame = -1

Query: 1015 DSGCSTHMCGTKKWFSYLDGSFRESVKLGDGRKLNVIGKGNVKLRIKGRIQVITGVYYIP 836
            DSGCS HM G K+W    D SFRESVKLGD  K++V+GKG +KL I G  QVI+ VYY+P
Sbjct: 301  DSGCSNHMIGNKEWLFDFDDSFRESVKLGDDSKMHVMGKGKLKLYIGGITQVISEVYYLP 360

Query: 835  ----NLRSIGQLQQKNLKVEFEDDKCKVIHKEKGLIMTSFMASNRMFAVKATPVKIDTNC 668
                NL SIGQLQQKNL + F++D CKV H+E+GLIM++ M++NRMF +KAT +     C
Sbjct: 361  GLKNNLLSIGQLQQKNLTIVFKNDICKVFHEERGLIMSTPMSANRMFVIKATVLV--PMC 418

Query: 667  LHIADEETTQLWHRRYGHLSYKGLSPLNAKEMVRGLPELKETTKVCAECQIGKQHRDTMS 488
            L   +E  +QLWH+RYGHLSYKGL+ L  KEMVRGLP LKE + VC++C  GKQHR+ + 
Sbjct: 419  LQTTNEIDSQLWHKRYGHLSYKGLNTLVKKEMVRGLPALKEASDVCSDCLFGKQHREVIP 478

Query: 487  KQSTWRASRKLELIHSDICGPINPASNS 404
            K+  WRA+ KLELIHSDICGPINPASNS
Sbjct: 479  KKVNWRATHKLELIHSDICGPINPASNS 506


>dbj|GAU43011.1| hypothetical protein TSUD_28300 [Trifolium subterraneum]
          Length = 538

 Score =  242 bits (617), Expect = 5e-72
 Identities = 115/211 (54%), Positives = 151/211 (71%), Gaps = 4/211 (1%)
 Frame = -1

Query: 1015 DSGCSTHMCGTKKWFSYLDGSFRESVKLGDGRKLNVIGKGNVKLRIKGRIQVITGVYYIP 836
            DSGCS HM G K W    D ++R+SVKLGD  K+ V+GKGNVKL I GR+ VI+ VYYIP
Sbjct: 293  DSGCSNHMIGNKDWMYEFDETYRDSVKLGDDSKMQVMGKGNVKLSINGRVHVISSVYYIP 352

Query: 835  ----NLRSIGQLQQKNLKVEFEDDKCKVIHKEKGLIMTSFMASNRMFAVKATPVKIDTNC 668
                NL SIGQ+QQKN+ + F +D CK  H EKGL+ ++ M++NRM+ +KA  + +   C
Sbjct: 353  GLKTNLLSIGQIQQKNVTIVFNEDTCKAYHDEKGLLFSTHMSANRMYVIKA--LVVTPRC 410

Query: 667  LHIADEETTQLWHRRYGHLSYKGLSPLNAKEMVRGLPELKETTKVCAECQIGKQHRDTMS 488
            L  A ++ +QLWH RYGHLS KGL+ L  K+MV+GLP LK+ ++ CA+C  GKQHR+ + 
Sbjct: 411  LQAAKKDVSQLWHNRYGHLSIKGLNTLTNKDMVKGLPALKDLSEKCADCLTGKQHREKIP 470

Query: 487  KQSTWRASRKLELIHSDICGPINPASNSNKS 395
            KQ+ WRA+  L+L+HSDICGPINP SN   S
Sbjct: 471  KQAKWRAT--LKLVHSDICGPINPTSNGGNS 499


>dbj|GAU32260.1| hypothetical protein TSUD_53880 [Trifolium subterraneum]
          Length = 1172

 Score =  251 bits (640), Expect = 1e-71
 Identities = 125/233 (53%), Positives = 157/233 (67%), Gaps = 8/233 (3%)
 Frame = -1

Query: 1015 DSGCSTHMCGTKKWFSYLDGSFRESVKLGDGRKLNVIGKGNVKLRIKGRIQVITGVYYIP 836
            DSGCS HM G K W    D +F++SVKLGD  ++ V+GKGN+KL I+G +Q++T VYY+P
Sbjct: 282  DSGCSNHMVGNKSWLFDYDDTFKDSVKLGDDSRMAVVGKGNLKLHIEGYVQILTNVYYLP 341

Query: 835  ----NLRSIGQLQQKNLKVEFEDDKCKVIHKEKGLIMTSFMASNRMFAVKATPVKIDTNC 668
                NL SIGQLQQKNL + F++D CKV H EKGLIMTS M+ NRM+ +KA  V   + C
Sbjct: 342  GLKNNLLSIGQLQQKNLTIIFKNDTCKVYHDEKGLIMTSHMSMNRMYVIKAPVVIPQSQC 401

Query: 667  LHIADEETTQLWHRRYGHLSYKGLSPLNAKEMVRGLPELKETTKVCAECQIGKQHRDTMS 488
              I   +   LWH+RYGHLS+KG++ L  K MV GLP+LKE T+ C  C  GKQ R+ + 
Sbjct: 402  FKITKNDEDTLWHKRYGHLSFKGINVLVQKNMVIGLPKLKEPTEKCTHCMKGKQQRENVP 461

Query: 487  KQSTWRASRKLELIHSDICGPINPASNSNK----SDVGDHARFTHGCLHEAKA 341
            K+S WRAS KLELIHSDICGPINP SN  K    +   D +R T  C    K+
Sbjct: 462  KKSHWRASHKLELIHSDICGPINPESNGKKRYFITFTDDMSRKTWTCFISEKS 514


>dbj|GAU16533.1| hypothetical protein TSUD_167640 [Trifolium subterraneum]
          Length = 1103

 Score =  249 bits (637), Expect = 3e-71
 Identities = 120/210 (57%), Positives = 157/210 (74%), Gaps = 4/210 (1%)
 Frame = -1

Query: 1015 DSGCSTHMCGTKKWFSYLDGSFRESVKLGDGRKLNVIGKGNVKLRIKGRIQVITGVYYIP 836
            DSGCS HMCG K+WF  LD SFRESV+LGD  +++V+GKGNVKL++ G +Q+ITGVY+IP
Sbjct: 297  DSGCSNHMCGIKEWFHDLDESFRESVRLGDDSQMSVMGKGNVKLQMNGIVQIITGVYFIP 356

Query: 835  ----NLRSIGQLQQKNLKVEFEDDKCKVIHKEKGLIMTSFMASNRMFAVKATPVKIDTNC 668
                NL S+GQLQ+KNL    +++ CKV H++KG+IM S MASN++F + A   ++   C
Sbjct: 357  KLKNNLLSLGQLQEKNLTFVIKNNWCKVYHRDKGMIMCSQMASNKLFPIMAEAKQV---C 413

Query: 667  LHIADEETTQLWHRRYGHLSYKGLSPLNAKEMVRGLPELKETTKVCAECQIGKQHRDTMS 488
            L    E+ TQLWH RYGHL+ KGL  L  K MV GLP+ +E+  VCA+C  GKQHR+++ 
Sbjct: 414  LQANIEDITQLWHCRYGHLNIKGLQNLQQKNMVIGLPKFEESNHVCADCLRGKQHRESIP 473

Query: 487  KQSTWRASRKLELIHSDICGPINPASNSNK 398
            K S W++S++LELIHSDICGPI P SNSNK
Sbjct: 474  KTSNWKSSKRLELIHSDICGPITPVSNSNK 503


>gb|PNX77239.1| copia-type polyprotein, partial [Trifolium pratense]
          Length = 803

 Score =  246 bits (627), Expect = 3e-71
 Identities = 127/240 (52%), Positives = 163/240 (67%), Gaps = 8/240 (3%)
 Frame = -1

Query: 1015 DSGCSTHMCGTKKWFSYLDGSFRESVKLGDGRKLNVIGKGNVKLRIKGRIQVITGVYYIP 836
            DSGCS HM GTK W   LD +FRESVKLG+  K+ V+GKGN++L I GRI +IT VYY+P
Sbjct: 59   DSGCSNHMIGTKDWLFDLDETFRESVKLGNDSKMAVMGKGNLRLDIGGRIIIITDVYYLP 118

Query: 835  ----NLRSIGQLQQKNLKVEFEDDKCKVIHKEKGLIMTSFMASNRMFAVKATPVKIDTNC 668
                NL SIGQLQQK L + F+++ C++ H++KGLI+++ M  NRM+ V+AT   I  NC
Sbjct: 119  GLGNNLLSIGQLQQKGLTIVFKNNVCQLFHEDKGLILSTEMTMNRMYIVRATV--IIPNC 176

Query: 667  LHIADEETTQLWHRRYGHLSYKGLSPLNAKEMVRGLPELKETTKVCAECQIGKQHRDTMS 488
            L +   E T+LWH+RY HLS KGL  LN K MV+GLPEL++T + C +C  GKQHR+ M 
Sbjct: 177  LQVTKAEETELWHKRYAHLSIKGLRVLNKKHMVKGLPELRDTEEKCTDCLSGKQHRENMP 236

Query: 487  KQSTWRASRKLELIHSDICGPINPASNSNK----SDVGDHARFTHGCLHEAKACL*LVVK 320
            KQ+ WRAS  LELIHSDICGPI P SN       +   D +R T   + + K+C   V K
Sbjct: 237  KQANWRASEILELIHSDICGPITPKSNGGNRYFLTFTDDFSRKTWTYIIQEKSCALSVFK 296


>gb|PNX74620.1| putative LRR receptor-like protein kinase, partial [Trifolium
            pratense]
          Length = 814

 Score =  245 bits (626), Expect = 5e-71
 Identities = 119/207 (57%), Positives = 153/207 (73%), Gaps = 4/207 (1%)
 Frame = -1

Query: 1015 DSGCSTHMCGTKKWFSYLDGSFRESVKLGDGRKLNVIGKGNVKLRIKGRIQVITGVYYIP 836
            DSGC+ HM G K W   LD SFRESVKLG+  K+ V+GK NVKL I+GRI VIT VYY+P
Sbjct: 21   DSGCNNHMVGNKDWLFELDESFRESVKLGNDSKMAVMGKCNVKLNIEGRIHVITDVYYLP 80

Query: 835  ----NLRSIGQLQQKNLKVEFEDDKCKVIHKEKGLIMTSFMASNRMFAVKATPVKIDTNC 668
                NL SIGQLQQK + + F+++ C++ H+EKGLI+++ M +N+M+ + A PV I  NC
Sbjct: 81   GLSNNLLSIGQLQQKGITIIFKNNTCQLFHEEKGLIISTAMTTNKMYIINA-PV-ITPNC 138

Query: 667  LHIADEETTQLWHRRYGHLSYKGLSPLNAKEMVRGLPELKETTKVCAECQIGKQHRDTMS 488
            L +  +E T LWH+RY HLS KGL  L  K MV+GLPELK+  + C++C  GKQHRD + 
Sbjct: 139  LQMTKDEETDLWHKRYAHLSLKGLKVLTGKNMVKGLPELKDNEEKCSDCLSGKQHRDNIP 198

Query: 487  KQSTWRASRKLELIHSDICGPINPASN 407
            KQ+ WRAS+KLEL+HSDICGP+NP SN
Sbjct: 199  KQTNWRASQKLELVHSDICGPLNPKSN 225


>gb|PNY01730.1| copia-type polyprotein, partial [Trifolium pratense]
          Length = 861

 Score =  245 bits (625), Expect = 1e-70
 Identities = 117/207 (56%), Positives = 150/207 (72%), Gaps = 4/207 (1%)
 Frame = -1

Query: 1015 DSGCSTHMCGTKKWFSYLDGSFRESVKLGDGRKLNVIGKGNVKLRIKGRIQVITGVYYIP 836
            DSGCS HM G K WF   D ++R+SVKLGD  ++NV+GKGNVKL I GR+ ++T VY+IP
Sbjct: 293  DSGCSYHMAGNKDWFYDFDENYRDSVKLGDDSRMNVMGKGNVKLSINGRVHILTDVYFIP 352

Query: 835  ----NLRSIGQLQQKNLKVEFEDDKCKVIHKEKGLIMTSFMASNRMFAVKATPVKIDTNC 668
                NL SIGQ+QQKN  + F++D CKV H+EKGL+  ++M++NRM+ VKA  +     C
Sbjct: 353  GLKTNLLSIGQIQQKNTTIVFKNDICKVYHREKGLLFATYMSTNRMYVVKAEVIA--PRC 410

Query: 667  LHIADEETTQLWHRRYGHLSYKGLSPLNAKEMVRGLPELKETTKVCAECQIGKQHRDTMS 488
            L  +    +QLWH RY HLS KGL+ L  K+MV+GLP LKE  + C  C +GKQH+D + 
Sbjct: 411  LQASKIVNSQLWHNRYCHLSIKGLNILVIKDMVKGLPALKELDENCVNCLVGKQHKDAIP 470

Query: 487  KQSTWRASRKLELIHSDICGPINPASN 407
            KQ+TWRAS KLEL+HSDICGPINP SN
Sbjct: 471  KQATWRASLKLELVHSDICGPINPKSN 497


>gb|KYP38784.1| Retrovirus-related Pol polyprotein from transposon TNT 1-94, partial
            [Cajanus cajan]
          Length = 560

 Score =  238 bits (608), Expect = 2e-70
 Identities = 112/210 (53%), Positives = 146/210 (69%), Gaps = 4/210 (1%)
 Frame = -1

Query: 1015 DSGCSTHMCGTKKWFSYLDGSFRESVKLGDGRKLNVIGKGNVKLRIKGRIQVITGVYYIP 836
            DSGCS HMCG K  F  L+  FR+ VKLG+  K++V+GKGNV+L++ G   V+T V+Y+P
Sbjct: 271  DSGCSNHMCGDKALFYNLNEDFRQIVKLGNNSKMSVLGKGNVRLKVNGFTHVVTEVFYVP 330

Query: 835  ----NLRSIGQLQQKNLKVEFEDDKCKVIHKEKGLIMTSFMASNRMFAVKATPVKIDTNC 668
                NL SIGQLQ+K L +  ++  CK+ H EKGL++ + M +NRMF + A P+     C
Sbjct: 331  ELKNNLLSIGQLQEKGLSILIQNGSCKIYHPEKGLVIQTEMTTNRMFVLTAVPMPQKPTC 390

Query: 667  LHIADEETTQLWHRRYGHLSYKGLSPLNAKEMVRGLPELKETTKVCAECQIGKQHRDTMS 488
            LH   +E   LWH RYGHLS+KGL  L  K+MV GLP LK  T VC  C IGKQHRD + 
Sbjct: 391  LHTTVQEIAHLWHCRYGHLSHKGLRTLQYKKMVHGLPPLKSPTDVCTVCMIGKQHRDPIP 450

Query: 487  KQSTWRASRKLELIHSDICGPINPASNSNK 398
            K++ WRA++KL+LIH+DICGPI+P SNS K
Sbjct: 451  KKANWRATQKLQLIHADICGPISPTSNSKK 480


>gb|PNX72392.1| copia-type polyprotein, partial [Trifolium pratense]
          Length = 886

 Score =  244 bits (624), Expect = 2e-70
 Identities = 119/207 (57%), Positives = 153/207 (73%), Gaps = 4/207 (1%)
 Frame = -1

Query: 1015 DSGCSTHMCGTKKWFSYLDGSFRESVKLGDGRKLNVIGKGNVKLRIKGRIQVITGVYYIP 836
            DSGCS HM G K+W    D SFRESVKLGD  ++ V+GKGN+KL I G +QVIT VY++P
Sbjct: 298  DSGCSNHMVGNKEWLFDFDDSFRESVKLGDDSRMAVMGKGNLKLNINGMVQVITDVYFLP 357

Query: 835  ----NLRSIGQLQQKNLKVEFEDDKCKVIHKEKGLIMTSFMASNRMFAVKATPVKIDTNC 668
                NL SIGQLQQKN+ + FE D+CKV H + GLI+TS M++NRMF ++A+   I   C
Sbjct: 358  GLKNNLLSIGQLQQKNVTIIFEKDQCKVFHDKWGLIITSDMSANRMFIIQASI--ISPMC 415

Query: 667  LHIADEETTQLWHRRYGHLSYKGLSPLNAKEMVRGLPELKETTKVCAECQIGKQHRDTMS 488
            L I+ +  + LWH RY HLS+KGL+ L  K+MV+GLP L+ET +VC++C  GKQ R+ + 
Sbjct: 416  LKISKDSQSHLWHCRYAHLSFKGLNTLVKKDMVKGLPTLQETDEVCSDCATGKQSREAIP 475

Query: 487  KQSTWRASRKLELIHSDICGPINPASN 407
            K + WRAS KL+L+HSDICGPINPASN
Sbjct: 476  KSNNWRASEKLQLVHSDICGPINPASN 502


>gb|PNX65055.1| retrotransposon-related protein, partial [Trifolium pratense]
          Length = 281

 Score =  229 bits (585), Expect = 3e-70
 Identities = 115/207 (55%), Positives = 144/207 (69%), Gaps = 4/207 (1%)
 Frame = -1

Query: 1015 DSGCSTHMCGTKKWFSYLDGSFRESVKLGDGRKLNVIGKGNVKLRIKGRIQVITGVYYIP 836
            DSGC+ HM G K W    D SF++SVKLG+  K++V+GKGNVKL I G+I VIT VYY+P
Sbjct: 75   DSGCNNHMIGQKDWLYDFDSSFKDSVKLGNDTKMSVMGKGNVKLFINGKIHVITNVYYLP 134

Query: 835  ----NLRSIGQLQQKNLKVEFEDDKCKVIHKEKGLIMTSFMASNRMFAVKATPVKIDTNC 668
                NL S+GQLQ+K + V F+D+ CK  H E GLI T+ M +NRMF + A PV I   C
Sbjct: 135  GLTTNLLSVGQLQEKKVTVVFKDNMCKGYHDENGLIFTTQMTANRMFLISA-PV-IMPMC 192

Query: 667  LHIADEETTQLWHRRYGHLSYKGLSPLNAKEMVRGLPELKETTKVCAECQIGKQHRDTMS 488
            +  + +E TQLWH RYGHLS  GL  L   EMV GLPEL++    C +C  GKQ R+ + 
Sbjct: 193  MQFSKQERTQLWHNRYGHLSVNGLKLLTKLEMVNGLPELEDMEGRCTDCLSGKQQREAIP 252

Query: 487  KQSTWRASRKLELIHSDICGPINPASN 407
            KQ+ WRA+ KL+LIHSDICGPINP+SN
Sbjct: 253  KQAKWRATEKLQLIHSDICGPINPSSN 279


>gb|PNX83704.1| copia-type polyprotein, partial [Trifolium pratense]
          Length = 598

 Score =  239 bits (609), Expect = 3e-70
 Identities = 121/212 (57%), Positives = 152/212 (71%), Gaps = 6/212 (2%)
 Frame = -1

Query: 1015 DSGCSTHMCGTKKWFSYLDGSFRESVKLGDGRKLNVIGKGNVKLRIKGRIQVITGVYYIP 836
            DSGCS HM G K W    DG+F++SVKLGD  K+ V GKGN+KL I+G  QV+T VYY+P
Sbjct: 315  DSGCSNHMVGNKNWLFEYDGTFKDSVKLGDDSKMAVEGKGNLKLHIEGFTQVLTNVYYLP 374

Query: 835  ----NLRSIGQLQQKNLKVEFEDDKCKVIHKEKGLIMTSFMASNRMFAVKATPVKIDTNC 668
                NL SIGQLQQKNL V F++D CKV H+EKGLIM + M  NRMF +KA PV I  +C
Sbjct: 375  GLKNNLLSIGQLQQKNLTVIFKNDTCKVFHEEKGLIMATHMTMNRMFVIKA-PV-IVPHC 432

Query: 667  LHIA--DEETTQLWHRRYGHLSYKGLSPLNAKEMVRGLPELKETTKVCAECQIGKQHRDT 494
            ++ +  + +   LWH+RYGHLS+KG++ L  K MV GLP+LK+  + C+ C  GKQ R  
Sbjct: 433  MNTSSNNHDNANLWHQRYGHLSFKGMNVLAHKGMVIGLPKLKQPDEECSNCMKGKQQRKN 492

Query: 493  MSKQSTWRASRKLELIHSDICGPINPASNSNK 398
            + K+S+WRAS KLEL+HSDICGPINP SN  K
Sbjct: 493  VPKKSSWRASTKLELVHSDICGPINPESNGRK 524


>gb|PNX77752.1| copia-type polyprotein, partial [Trifolium pratense]
          Length = 736

 Score =  241 bits (616), Expect = 4e-70
 Identities = 120/210 (57%), Positives = 149/210 (70%), Gaps = 4/210 (1%)
 Frame = -1

Query: 1015 DSGCSTHMCGTKKWFSYLDGSFRESVKLGDGRKLNVIGKGNVKLRIKGRIQVITGVYYIP 836
            DSGCS HM G   W    D +F++SVKLGD  K+ V GKGN+KL IKG +Q +T VYY+P
Sbjct: 70   DSGCSNHMVGNNNWLFDYDDTFKDSVKLGDDSKMVVEGKGNLKLYIKGFVQTLTSVYYLP 129

Query: 835  ----NLRSIGQLQQKNLKVEFEDDKCKVIHKEKGLIMTSFMASNRMFAVKATPVKIDTNC 668
                NL SIGQLQQKNL + F+DD CKV H EKGLIM + M  NRMF +KA PV I   C
Sbjct: 130  GLKNNLLSIGQLQQKNLTIIFKDDTCKVFHDEKGLIMATTMTFNRMFVIKA-PV-IVPQC 187

Query: 667  LHIADEETTQLWHRRYGHLSYKGLSPLNAKEMVRGLPELKETTKVCAECQIGKQHRDTMS 488
            + I+  + + LWH+RYGHLS+KG++ L  K+MV GLP+LKE+ + C  C  GKQ + +  
Sbjct: 188  MKISGVDDSTLWHQRYGHLSFKGMNVLTQKQMVIGLPKLKESNEKCTNCMKGKQQKQSAP 247

Query: 487  KQSTWRASRKLELIHSDICGPINPASNSNK 398
            K+S+WRAS KLEL+HSDICGPINP SN  K
Sbjct: 248  KKSSWRASTKLELVHSDICGPINPESNGKK 277


>gb|PNX74679.1| copia-type polyprotein, partial [Trifolium pratense]
          Length = 694

 Score =  240 bits (613), Expect = 5e-70
 Identities = 119/220 (54%), Positives = 155/220 (70%), Gaps = 4/220 (1%)
 Frame = -1

Query: 1015 DSGCSTHMCGTKKWFSYLDGSFRESVKLGDGRKLNVIGKGNVKLRIKGRIQVITGVYYIP 836
            DSGCS HM G K+W    DGSF ESVKLG+  K+ ++GKGNVKL+I G++ VIT VYYIP
Sbjct: 284  DSGCSNHMTGKKEWLHDFDGSFTESVKLGNDSKMAIMGKGNVKLKIAGKVHVITDVYYIP 343

Query: 835  ----NLRSIGQLQQKNLKVEFEDDKCKVIHKEKGLIMTSFMASNRMFAVKATPVKIDTNC 668
                NL S+ QLQQKNL V F++D C+VIH EKGLI+T+ +++NRM+ + A PV I  N 
Sbjct: 344  GLSSNLLSVDQLQQKNLTVVFKNDLCQVIHNEKGLILTTQISANRMYMIFA-PV-ILPNY 401

Query: 667  LHIADEETTQLWHRRYGHLSYKGLSPLNAKEMVRGLPELKETTKVCAECQIGKQHRDTMS 488
            + ++  +   LWH +Y HL++KGL  LN K MV+GLPELK+    C +C  GKQHRD++ 
Sbjct: 402  MQMSKTDEYHLWHNKYAHLNFKGLKVLNKKHMVKGLPELKDIEDKCGDCLSGKQHRDSIP 461

Query: 487  KQSTWRASRKLELIHSDICGPINPASNSNKSDVGDHARFT 368
            K+STWRAS +LEL+H+DICGPI P SN      G   + T
Sbjct: 462  KKSTWRASTRLELVHTDICGPIKPESNGGNISQGIKRKLT 501


>dbj|GAU37106.1| hypothetical protein TSUD_278930 [Trifolium subterraneum]
          Length = 1013

 Score =  244 bits (624), Expect = 8e-70
 Identities = 114/207 (55%), Positives = 148/207 (71%), Gaps = 4/207 (1%)
 Frame = -1

Query: 1015 DSGCSTHMCGTKKWFSYLDGSFRESVKLGDGRKLNVIGKGNVKLRIKGRIQVITGVYYIP 836
            DSGCS HM G K W    DGS+R+SVKLGD  K+NV+GKGN+K  I G+I VIT V+YIP
Sbjct: 296  DSGCSNHMVGNKDWLYEFDGSYRDSVKLGDDSKMNVMGKGNIKFSIAGKIHVITNVFYIP 355

Query: 835  ----NLRSIGQLQQKNLKVEFEDDKCKVIHKEKGLIMTSFMASNRMFAVKATPVKIDTNC 668
                NL SIGQ+QQK++ + F++D CK+ H +KGL+ T+ M+ NRM+ V AT +    NC
Sbjct: 356  GLKSNLLSIGQIQQKHVTIVFKNDLCKIYHDDKGLLFTTHMSPNRMYVVNATVIV--PNC 413

Query: 667  LHIADEETTQLWHRRYGHLSYKGLSPLNAKEMVRGLPELKETTKVCAECQIGKQHRDTMS 488
            L +  ++ +QLWH+RYGHLS KGL+ L  K+MV GLP L +  K C +C  GKQH D M 
Sbjct: 414  LQVTKKDWSQLWHQRYGHLSIKGLNTLAKKDMVNGLPVLDDLDKHCGDCLTGKQHMDAMP 473

Query: 487  KQSTWRASRKLELIHSDICGPINPASN 407
            K++ WRA  KLEL+HSD+CGP+NP SN
Sbjct: 474  KRAVWRAKLKLELVHSDLCGPLNPTSN 500


>gb|PNX99782.1| copia-type polyprotein [Trifolium pratense]
          Length = 912

 Score =  243 bits (620), Expect = 1e-69
 Identities = 115/207 (55%), Positives = 151/207 (72%), Gaps = 4/207 (1%)
 Frame = -1

Query: 1015 DSGCSTHMCGTKKWFSYLDGSFRESVKLGDGRKLNVIGKGNVKLRIKGRIQVITGVYYIP 836
            DSGCS HM GTK+WF   D  FRESVKLG+  K+ V+G+GNVKL + G+I VIT VYY+P
Sbjct: 294  DSGCSNHMTGTKEWFFDFDDKFRESVKLGNDSKMTVMGRGNVKLNMDGKIHVITNVYYLP 353

Query: 835  ----NLRSIGQLQQKNLKVEFEDDKCKVIHKEKGLIMTSFMASNRMFAVKATPVKIDTNC 668
                NL S+GQLQQ+ L   F+++ C++ H+EKGLI+T+ M  N+M+ VKA+ +    NC
Sbjct: 354  GLSNNLLSVGQLQQRGLTTVFKNNMCQLFHEEKGLILTTKMTFNKMYIVKASMIL--PNC 411

Query: 667  LHIADEETTQLWHRRYGHLSYKGLSPLNAKEMVRGLPELKETTKVCAECQIGKQHRDTMS 488
            L     E T LWH+RY HLS++GL  L  K+MV+GLP LKE+   C +C +GKQHR ++ 
Sbjct: 412  LQATALEETTLWHQRYAHLSFQGLKTLITKQMVKGLPNLKESGDKCTDCLVGKQHRSSIP 471

Query: 487  KQSTWRASRKLELIHSDICGPINPASN 407
            K++ WRAS+KLEL+HSDICGPINP SN
Sbjct: 472  KEANWRASKKLELVHSDICGPINPQSN 498


>dbj|GAU36022.1| hypothetical protein TSUD_211600 [Trifolium subterraneum]
          Length = 1423

 Score =  245 bits (626), Expect = 2e-69
 Identities = 119/220 (54%), Positives = 153/220 (69%), Gaps = 4/220 (1%)
 Frame = -1

Query: 1015 DSGCSTHMCGTKKWFSYLDGSFRESVKLGDGRKLNVIGKGNVKLRIKGRIQVITGVYYIP 836
            DSGCS HM G K W   LD S+R++VKLGD  K+NV+GKGNVKL I G+I VITGV+YIP
Sbjct: 298  DSGCSNHMVGNKDWLYELDESYRDTVKLGDDSKMNVMGKGNVKLSIDGKIHVITGVFYIP 357

Query: 835  ----NLRSIGQLQQKNLKVEFEDDKCKVIHKEKGLIMTSFMASNRMFAVKATPVKIDTNC 668
                NL SIGQ+QQKN+ + F++D CK+ H +KGL+ T+ M+ NRM+ V A  +      
Sbjct: 358  GLKSNLLSIGQIQQKNVTIVFKNDICKIYHDDKGLLFTTQMSPNRMYVVNANVIM--PKR 415

Query: 667  LHIADEETTQLWHRRYGHLSYKGLSPLNAKEMVRGLPELKETTKVCAECQIGKQHRDTMS 488
            L +  ++ +QLWH RYGHLS KGL+ L  KEMV+GLP L+E  + C +C  GKQHRD + 
Sbjct: 416  LQVTKKDWSQLWHNRYGHLSIKGLNTLARKEMVKGLPPLEELNEHCVDCLTGKQHRDAIP 475

Query: 487  KQSTWRASRKLELIHSDICGPINPASNSNKSDVGDHARFT 368
            KQ+ WRA  KLEL+H DICGP+NP SN   S+ G   + T
Sbjct: 476  KQAVWRAKLKLELVHLDICGPLNPISNGGNSNQGIKRQLT 515


>gb|PNX90684.1| retrovirus-related Pol polyprotein from transposon TNT 1-94
            [Trifolium pratense]
          Length = 372

 Score =  230 bits (586), Expect = 3e-69
 Identities = 106/210 (50%), Positives = 150/210 (71%), Gaps = 4/210 (1%)
 Frame = -1

Query: 1015 DSGCSTHMCGTKKWFSYLDGSFRESVKLGDGRKLNVIGKGNVKLRIKGRIQVITGVYYIP 836
            DSGCS HM G KKWF+ +D  +++SVKLG+  K+ V+G+GNVKL + G +QVIT VYY+P
Sbjct: 162  DSGCSNHMTGNKKWFTDIDEQYQQSVKLGNNFKMAVVGRGNVKLHVNGIMQVITNVYYVP 221

Query: 835  ----NLRSIGQLQQKNLKVEFEDDKCKVIHKEKGLIMTSFMASNRMFAVKATPVKIDTNC 668
                NL SIGQL +K + V  ++ +C+  H ++GL + + M++NRMF   ATP+   ++C
Sbjct: 222  ELKNNLISIGQLIEKGVSVLIQNGECRCYHSKEGLFLQTKMSANRMFVFHATPMSQFSSC 281

Query: 667  LHIADEETTQLWHRRYGHLSYKGLSPLNAKEMVRGLPELKETTKVCAECQIGKQHRDTMS 488
               A E+ T LWH RYGHL+Y GL  L +K+MV GLPE++  +K+C +C +GKQ R  + 
Sbjct: 282  FKTASEDETHLWHCRYGHLNYNGLKTLQSKKMVTGLPEIRTPSKLCNDCVMGKQQRKPIP 341

Query: 487  KQSTWRASRKLELIHSDICGPINPASNSNK 398
            K+S WRA+ KL+LIHSDICGPI P+++S K
Sbjct: 342  KKSLWRATHKLQLIHSDICGPITPSTSSGK 371


>emb|CBI37296.3| unnamed protein product, partial [Vitis vinifera]
          Length = 3048

 Score =  245 bits (625), Expect = 3e-69
 Identities = 112/210 (53%), Positives = 152/210 (72%), Gaps = 4/210 (1%)
 Frame = -1

Query: 1015 DSGCSTHMCGTKKWFSYLDGSFRESVKLGDGRKLNVIGKGNVKLRIKGRIQVITGVYYIP 836
            DSGCS HMCG K +FS  DG+FR+SVKLG+   ++V+GKGNV+L++    Q+ITGV+Y+P
Sbjct: 323  DSGCSNHMCGKKDYFSDFDGTFRDSVKLGNNTSMSVLGKGNVRLKVNEMTQIITGVFYVP 382

Query: 835  ----NLRSIGQLQQKNLKVEFEDDKCKVIHKEKGLIMTSFMASNRMFAVKATPVKIDTNC 668
                NL SIGQLQ+K L + F+  KCKV H +KGLIM + M+SNRMF + A    I + C
Sbjct: 383  ELKNNLLSIGQLQEKGLTILFQHGKCKVFHSQKGLIMDTKMSSNRMFMLYALSQPISSTC 442

Query: 667  LHIADEETTQLWHRRYGHLSYKGLSPLNAKEMVRGLPELKETTKVCAECQIGKQHRDTMS 488
             +   E+  QLWH RYGHLS++GL  L  ++MV GLP+ +  +K+C +C +GKQHR ++ 
Sbjct: 443  FNTVTEDILQLWHCRYGHLSFQGLKTLQQRKMVNGLPQFQPPSKLCKDCLVGKQHRSSIP 502

Query: 487  KQSTWRASRKLELIHSDICGPINPASNSNK 398
            K+S WRA+  L+L+H+DICGPINP SNS K
Sbjct: 503  KKSNWRAAEILQLVHADICGPINPISNSKK 532


Top