BLASTX nr result

ID: Astragalus22_contig00034596 seq

BLASTX 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Astragalus22_contig00034596
         (1033 letters)

Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF excluding environmental samples
from WGS projects 
           149,584,005 sequences; 54,822,741,787 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

emb|CBI37296.3| unnamed protein product, partial [Vitis vinifera]      76   4e-24
gb|PNX66220.1| copia-type polyprotein, partial [Trifolium pratense]    73   6e-24
gb|KYP50278.1| Retrovirus-related Pol polyprotein from transposo...    80   4e-23
gb|PNX99782.1| copia-type polyprotein [Trifolium pratense]             77   1e-22
dbj|GAU26746.1| hypothetical protein TSUD_317440 [Trifolium subt...   107   7e-22
gb|PRQ17740.1| putative RNA-directed DNA polymerase [Rosa chinen...   103   2e-20
gb|PRQ42077.1| putative RNA-directed DNA polymerase [Rosa chinen...   102   6e-20
gb|PRQ52345.1| putative RNA-directed DNA polymerase [Rosa chinen...    97   2e-18
gb|PNX93789.1| copia-type polyprotein [Trifolium pratense]             97   2e-18
gb|KYP44586.1| Retrovirus-related Pol polyprotein from transposo...    96   8e-18
ref|XP_017188308.1| PREDICTED: uncharacterized protein LOC108173...    92   6e-17
gb|PNY15642.1| copia-type polyprotein [Trifolium pratense]             92   9e-17
dbj|GAU41840.1| hypothetical protein TSUD_177510 [Trifolium subt...    92   2e-16
gb|PNX89974.1| copia-type polyprotein, partial [Trifolium pratense]    91   2e-16
gb|PNX95763.1| retrotransposon-related protein, partial [Trifoli...    87   6e-15
gb|PNX94698.1| copia-type polyprotein [Trifolium pratense]             87   9e-15
gb|PNX77752.1| copia-type polyprotein, partial [Trifolium pratense]    86   1e-14
gb|PNX91151.1| copia-type polyprotein, partial [Trifolium pratense]    85   2e-14
gb|PNX96091.1| retrotransposon-related protein [Trifolium pratense]    86   2e-14
gb|KYP37051.1| Retrovirus-related Pol polyprotein from transposo...    85   3e-14

>emb|CBI37296.3| unnamed protein product, partial [Vitis vinifera]
          Length = 3048

 Score = 76.3 bits (186), Expect(2) = 4e-24
 Identities = 38/71 (53%), Positives = 46/71 (64%)
 Frame = +3

Query: 234 QKRLVERENRTILNMVRCILSEKKLPKEFWSDAVKWAVFVEKRSPTTALHXXXXXXAWCG 413
           Q  + ER+NRTI+NMVR +LS KKLPK FW +AV W V V  RSPT A+       AW  
Sbjct: 613 QNGVAERKNRTIMNMVRSMLSAKKLPKTFWPEAVNWTVHVLNRSPTFAVQNKTPEEAWGK 672

Query: 414 LKSSVHFFQNF 446
           LK SV +F+ F
Sbjct: 673 LKPSVDYFRVF 683



 Score = 65.1 bits (157), Expect(2) = 4e-24
 Identities = 35/80 (43%), Positives = 51/80 (63%), Gaps = 8/80 (10%)
 Frame = +1

Query: 433 FFRTFRCVAHLHVHGAQRKKLDNS--------VSEESKSYMLYDPIAKRILINRDVKFVR 588
           +FR F C++H+HV  ++R KLD+         VSEESK+Y LYDPI+++I+I+RDV F  
Sbjct: 679 YFRVFGCLSHVHVPDSKRTKLDDKSFSCVLLGVSEESKAYRLYDPISQKIIISRDVVFEE 738

Query: 589 MKHGSGEKKAQIAKFRLLIW 648
            K+   +KK + A    L W
Sbjct: 739 DKNWDWDKKYEEAIVCDLEW 758


>gb|PNX66220.1| copia-type polyprotein, partial [Trifolium pratense]
          Length = 268

 Score = 73.2 bits (178), Expect(2) = 6e-24
 Identities = 32/71 (45%), Positives = 46/71 (64%)
 Frame = +3

Query: 234 QKRLVERENRTILNMVRCILSEKKLPKEFWSDAVKWAVFVEKRSPTTALHXXXXXXAWCG 413
           Q  + ER+NRT+LN+VR ++  + +PK FW +A+KWA +V  RSPT ++       AW G
Sbjct: 138 QNGVSERKNRTLLNIVRSMIHARSVPKRFWPEAIKWATYVMNRSPTLSVKDMTPEEAWSG 197

Query: 414 LKSSVHFFQNF 446
            K SVH F+ F
Sbjct: 198 RKPSVHHFKVF 208



 Score = 67.4 bits (163), Expect(2) = 6e-24
 Identities = 33/61 (54%), Positives = 42/61 (68%), Gaps = 8/61 (13%)
 Frame = +1

Query: 436 FRTFRCVAHLHVHGAQRKKLDNS--------VSEESKSYMLYDPIAKRILINRDVKFVRM 591
           F+ F CVAH+H+H +QRKKLD+         VSEESK+Y LYDPI  +I+I+RDV F   
Sbjct: 205 FKVFGCVAHVHIHDSQRKKLDDKSKKCILLGVSEESKAYKLYDPIENKIIISRDVIFEES 264

Query: 592 K 594
           K
Sbjct: 265 K 265


>gb|KYP50278.1| Retrovirus-related Pol polyprotein from transposon TNT 1-94
           [Cajanus cajan]
          Length = 180

 Score = 79.7 bits (195), Expect(2) = 4e-23
 Identities = 37/71 (52%), Positives = 47/71 (66%)
 Frame = +3

Query: 234 QKRLVERENRTILNMVRCILSEKKLPKEFWSDAVKWAVFVEKRSPTTALHXXXXXXAWCG 413
           Q  + ER+NRTI+N+VR +LSEKKLPK FW +AV W  +V  RSPT A+       AW G
Sbjct: 39  QNGVAERKNRTIMNLVRTLLSEKKLPKSFWPEAVNWVAYVLNRSPTLAVKNQMPEEAWSG 98

Query: 414 LKSSVHFFQNF 446
           +K SV  F+ F
Sbjct: 99  VKPSVEHFRVF 109



 Score = 58.2 bits (139), Expect(2) = 4e-23
 Identities = 29/57 (50%), Positives = 38/57 (66%), Gaps = 8/57 (14%)
 Frame = +1

Query: 436 FRTFRCVAHLHVHGAQRKKLDNS--------VSEESKSYMLYDPIAKRILINRDVKF 582
           FR F CV H+HV  A+R KL+N         +SEESK Y LY+PI ++I+I+RDV F
Sbjct: 106 FRVFGCVTHVHVPDARRTKLENKSCKCVLLGMSEESKGYRLYNPITRKIVISRDVVF 162


>gb|PNX99782.1| copia-type polyprotein [Trifolium pratense]
          Length = 912

 Score = 76.6 bits (187), Expect(2) = 1e-22
 Identities = 34/71 (47%), Positives = 47/71 (66%)
 Frame = +3

Query: 234 QKRLVERENRTILNMVRCILSEKKLPKEFWSDAVKWAVFVEKRSPTTALHXXXXXXAWCG 413
           Q  + ER+NRT+LNMVR ++S + +PK+FW +A KWA +V  R PT A+       AW G
Sbjct: 582 QNGVSERKNRTVLNMVRSMISARSVPKKFWPEAAKWATYVMNRCPTHAVKNVTPEEAWSG 641

Query: 414 LKSSVHFFQNF 446
           +K SVH F+ F
Sbjct: 642 IKPSVHHFRVF 652



 Score = 59.3 bits (142), Expect(2) = 1e-22
 Identities = 30/68 (44%), Positives = 42/68 (61%), Gaps = 8/68 (11%)
 Frame = +1

Query: 436 FRTFRCVAHLHVHGAQRKKLDNS--------VSEESKSYMLYDPIAKRILINRDVKFVRM 591
           FR F C+AH H+    RKKLDN         VSEESK+Y LY+PI ++I+++R V F  +
Sbjct: 649 FRVFGCLAHAHIPDVHRKKLDNKSIACVFLGVSEESKAYKLYNPIERKIIVSRVVVFEEL 708

Query: 592 KHGSGEKK 615
           K  +  K+
Sbjct: 709 KGWNWNKQ 716


>dbj|GAU26746.1| hypothetical protein TSUD_317440 [Trifolium subterraneum]
          Length = 1608

 Score =  107 bits (268), Expect = 7e-22
 Identities = 93/315 (29%), Positives = 148/315 (46%), Gaps = 21/315 (6%)
 Frame = +3

Query: 150  GGEQLWAKVE---RNQGVHLCHIFLLPYGGIQKRLVERENRTILNMVRCILSEKKLPKEF 320
            GGE +  + E   ++QG+  C      Y   Q  + ER+NRTI+N VR +L+E+++P+ F
Sbjct: 566  GGEFISNEFEEFCKDQGI--CRQLTASYTPQQNGVAERKNRTIMNAVRAVLNERQVPRVF 623

Query: 321  WSDAVKWAVFVEKRSPTTALHXXXXXXAWCGLKSSVHFFQNF*VCSSSPRTWCTKKEARQ 500
            W +AVKW V V+ RSPT+A+       AW G++ SV +F+ F  C +       K+    
Sbjct: 624  WPEAVKWCVHVQNRSPTSAVDHITPEEAWTGVRPSVDYFRIF-GCVAHAHVPDQKRSKLD 682

Query: 501  *CQ*RVKIL--------YAL*SYC-KKDLDQQGCKICEDETWKWREEGSNSKIQVADLEE 653
                R   L        Y L     KK +  +     ED++W W       K+ V D EE
Sbjct: 683  DKSKRCVFLGVSDESKAYKLFDPIEKKVIVNRDVVFEEDKSWDWGRTEEECKVDVLDWEE 742

Query: 654  KEEEGSVVGTSAGNQTRNAGANVQEQNASANEN---TNEDSLVEQGRALVEGRIKRKPAY 824
             EE+G   GT+   +  +   N     +S N+    TNE + VEQ       R +R+P +
Sbjct: 743  NEEDGEDHGTAQNEEENSGDINQGASPSSLNKTGSPTNETNDVEQEFLERAARSRRRPGW 802

Query: 825  L----QDMSLSK*SWMSSNT*KTLALLWSLKQKILSPLRML*EVKNGK--KAIDLEIEAI 986
            L    +D +LS+         +   ++    +    P       K+ K  +A+++E++AI
Sbjct: 803  LVDFEEDPNLSE---------EESLMVMMTAENGSDPYLFEEAFKSAKWREAMNMEMKAI 853

Query: 987  EKLKRHMGTDNLPKG 1031
            EK K  + TD  P+G
Sbjct: 854  EKNKTWVLTD-APRG 867


>gb|PRQ17740.1| putative RNA-directed DNA polymerase [Rosa chinensis]
          Length = 1302

 Score =  103 bits (257), Expect = 2e-20
 Identities = 79/266 (29%), Positives = 119/266 (44%), Gaps = 13/266 (4%)
 Frame = +3

Query: 234  QKRLVERENRTILNMVRCILSEKKLPKEFWSDAVKWAVFVEKRSPTTALHXXXXXXAWCG 413
            Q  + ER+NRTI+NMVR +LSEK++PK FW +AV W++ +  RSPT A+       AW G
Sbjct: 590  QNGVAERKNRTIMNMVRSMLSEKQVPKVFWPEAVNWSIHILNRSPTLAVKDMTPEEAWSG 649

Query: 414  LKSSVHFFQNF*VCSSSPRTWCTKKEARQ*CQ*RVKILYAL*SYCKKDLDQQGCKIC--- 584
            +K  VH+F+ F   +        +K+        V +  +  S   +  D    +I    
Sbjct: 650  IKPVVHYFKVFGCIAHVHIPEAKRKKLDNKSYKCVLLGVSEESKAYRLYDPISERIVVSR 709

Query: 585  -----EDETWKWREEGSNSKIQVADLEEKEEEGSVVGTSAGNQTRNAGANVQEQN---AS 740
                 EDE+W+W       ++ V D  + EEE             N  A  +E+N    S
Sbjct: 710  DVVFEEDESWEWGRTAEEVRLDVLDWSDGEEE------------ENESAQSEEENEYAQS 757

Query: 741  ANENTNEDSLVEQGRALVEGRIKRKPAYLQDMSLSK*SWMSSNT*KTLALLWSLKQKILS 920
              EN N D   E+   L EGR + +P ++QD    +       T   +     +   +  
Sbjct: 758  EEENVNNDDAEEEEEILEEGRTRMQPVWMQDYVSGEGLSEEEETNNVV-----MFTSVTD 812

Query: 921  PLRML*EVKNG--KKAIDLEIEAIEK 992
            P       K+   K A+D EIEAIE+
Sbjct: 813  PATFEEAFKSAKWKAAMDQEIEAIER 838


>gb|PRQ42077.1| putative RNA-directed DNA polymerase [Rosa chinensis]
          Length = 1044

 Score =  102 bits (253), Expect = 6e-20
 Identities = 77/263 (29%), Positives = 124/263 (47%), Gaps = 10/263 (3%)
 Frame = +3

Query: 234  QKRLVERENRTILNMVRCILSEKKLPKEFWSDAVKWAVFVEKRSPTTALHXXXXXXAWCG 413
            Q  + ER+NRTI+NMVR +LSEK++PK FW +AV W++ +  RSPT A+       AW G
Sbjct: 311  QNGVAERKNRTIMNMVRSMLSEKQVPKVFWPEAVNWSIHILNRSPTLAVKDMTPEEAWSG 370

Query: 414  LKSSVHFFQNF*VCSSSPRTWCTKKEARQ*CQ*RVKILYAL*SYCKKDLDQQGCKIC--- 584
            +K +VH+F+ F   +        +K+        V +  +  S   +  D    +I    
Sbjct: 371  IKPAVHYFRVFGCIAHVHIPEAKRKKLDDKSYKCVLLGVSKESKAYRLYDPISERIVVSR 430

Query: 585  -----EDETWKWREEGSNSKIQVADLEEKEEEGSVVGTSAGNQTRNAGANVQEQNASANE 749
                 EDE W W       ++ V D  + EEE +    SA ++  N  A  +E+N + N+
Sbjct: 431  DVVFEEDENWDWGRTAEEVRLDVLDWSDGEEEEN---ESAQSEEENEFAQSEEENVN-ND 486

Query: 750  NTNEDSLVEQGRALVEGRIKRKPAYLQDMSLSK*SWMSSNT*KTLALLWSLKQKILSPLR 929
            +  E+ ++E      EGR + +P ++QD    +       T   +   +     +  P  
Sbjct: 487  DAEEEEILE------EGRTRMQPVWMQDYVSGEGLSEEEETNNIVMFTY-----VTDPTT 535

Query: 930  ML*EVKNG--KKAIDLEIEAIEK 992
                 K+   K A+D EIEAIE+
Sbjct: 536  FEEAFKSAKWKAAMDQEIEAIER 558


>gb|PRQ52345.1| putative RNA-directed DNA polymerase [Rosa chinensis]
          Length = 1316

 Score = 97.4 bits (241), Expect = 2e-18
 Identities = 76/263 (28%), Positives = 121/263 (46%), Gaps = 10/263 (3%)
 Frame = +3

Query: 234  QKRLVERENRTILNMVRCILSEKKLPKEFWSDAVKWAVFVEKRSPTTALHXXXXXXAWCG 413
            Q  + ER+NRTI+NMVR +LSEK++PK FW +AV W++ +  RSPT A+       AW G
Sbjct: 592  QNGVAERKNRTIMNMVRSMLSEKQVPKVFWPEAVNWSIHILNRSPTLAVKDMTPEEAWSG 651

Query: 414  LKSSVHFFQNF*VCSSSPRTWCTKKEARQ*CQ*RVKILYAL*SYCKKDLDQQGCKIC--- 584
            +K +VH+F+ F   +        +K+        V +  +  S   +  D    +I    
Sbjct: 652  IKPAVHYFRVFGCIAHVHIPEAKRKKLDDKSYKCVLLGVSKESKAYRLYDPISERIVVSR 711

Query: 585  -----EDETWKWREEGSNSKIQVADLEEKEEEGSVVGTSAGNQTRNAGANVQEQNASANE 749
                 EDE+W W       ++ V D  + EEE             N  A  +E+N + N+
Sbjct: 712  DVVFEEDESWDWGRTAEEERLDVLDWSDGEEE------------ENESAQSEEENVN-ND 758

Query: 750  NTNEDSLVEQGRALVEGRIKRKPAYLQDMSLSK*SWMSSNT*KTLALLWSLKQKILSPLR 929
            +  E+ ++E      EGR + +P ++QD  +S            + +  S    +  P  
Sbjct: 759  DAEEEEILE------EGRTRLQPIWMQDY-VSGEGLSEEEETNNIVMFTS----VTDPTT 807

Query: 930  ML*EVKNG--KKAIDLEIEAIEK 992
                 K+   K A+D EIEAIE+
Sbjct: 808  FEEAFKSAKWKAAMDQEIEAIER 830


>gb|PNX93789.1| copia-type polyprotein [Trifolium pratense]
          Length = 1347

 Score = 97.4 bits (241), Expect = 2e-18
 Identities = 66/221 (29%), Positives = 104/221 (47%), Gaps = 21/221 (9%)
 Frame = +3

Query: 234  QKRLVERENRTILNMVRCILSEKKLPKEFWSDAVKWAVFVEKRSPTTALHXXXXXXAWCG 413
            Q  + ER+NRTI+NMVR +L EKK+PK FW +AVKW+V +  R PT A+       AW G
Sbjct: 593  QNGVAERKNRTIMNMVRSMLVEKKVPKMFWPEAVKWSVHILNRCPTLAVQNKTPEEAWSG 652

Query: 414  LKSSVHFFQNF*VCSSSPRTWCTKKEARQ*CQ*RVKIL--------YAL*S-YCKKDLDQ 566
            +K ++++F+ F  C +       K+        +  +L        Y L     KK +  
Sbjct: 653  IKPTINYFRVF-GCVAHAHIPDQKRSKLDDKSKKCVLLGVSDESKAYKLYDPVSKKIIIS 711

Query: 567  QGCKICEDETWKWREEGSNSKIQVA--------DLEEKEEEGSVVGTSAGNQTRN----A 710
            +     ED  W W       ++ V         D+EE  E    VG +   +  N     
Sbjct: 712  KDVIFEEDVCWNWDNNKDERRVDVLEWKNDYENDIEEAIEGNEEVGNNGNEEEHNNGNEG 771

Query: 711  GANVQEQNASANENTNEDSLVEQGRALVEGRIKRKPAYLQD 833
            G N    N+  + +++ +S  ++   +VEGR++R P+YL D
Sbjct: 772  GNNDDTTNSIESNSSSSESHEDESPNMVEGRVRRPPSYLAD 812


>gb|KYP44586.1| Retrovirus-related Pol polyprotein from transposon TNT 1-94
           [Cajanus cajan]
          Length = 783

 Score = 95.5 bits (236), Expect = 8e-18
 Identities = 62/210 (29%), Positives = 96/210 (45%), Gaps = 10/210 (4%)
 Frame = +3

Query: 234 QKRLVERENRTILNMVRCILSEKKLPKEFWSDAVKWAVFVEKRSPTTALHXXXXXXAWCG 413
           Q  + ER+N+TI+NMVR +L EK +PK FW +AV WAV V  RSPT A+       AW G
Sbjct: 36  QNGVAERKNQTIINMVRSVLGEKGVPKAFWPEAVMWAVHVLNRSPTLAVKDITPEEAWSG 95

Query: 414 LKSSVHFFQNF*VCS-----SSPRTWCTKKEARQ*C-----Q*RVKILYAL*SYCKKDLD 563
           +K SV +F+ F         +  R+    K  +  C         K         KK L 
Sbjct: 96  IKPSVSYFRIFGCIGYAYVHNQQRSKLDDKSTK--CVLLGVSEESKAYKLYDPVKKKILI 153

Query: 564 QQGCKICEDETWKWREEGSNSKIQVADLEEKEEEGSVVGTSAGNQTRNAGANVQEQNASA 743
            +  K  ED  W W E  ++S +   +L   EE   V      +++ +  A       ++
Sbjct: 154 SRDVKFQEDAAWDWSEAKNSSILDTGELNPLEESSQVAEEKTQDKSNDTAATSSATTTNS 213

Query: 744 NENTNEDSLVEQGRALVEGRIKRKPAYLQD 833
           + N  + S+     +  +GR +R P +++D
Sbjct: 214 SSNVPDYSVPAASESHDQGRTRRPPTWMRD 243


>ref|XP_017188308.1| PREDICTED: uncharacterized protein LOC108173558 [Malus domestica]
          Length = 497

 Score = 92.4 bits (228), Expect = 6e-17
 Identities = 74/214 (34%), Positives = 107/214 (50%), Gaps = 14/214 (6%)
 Frame = +3

Query: 234 QKRLVERENRTILNMVRCILSEKKLPKEFWSDAVKWAVFVEKRSPTTALHXXXXXXAWCG 413
           Q  + ER+NRTI+NMVR +L EKK+PK FW +AV W V V  RSPT A+       AW G
Sbjct: 285 QNGVAERKNRTIMNMVRSMLIEKKIPKTFWPEAVNWTVHVLNRSPTIAVKSKTPEEAWQG 344

Query: 414 LKSSVHFFQNF*VCS--SSPRTWCTKKEARQ*CQ*RVKILY---AL*SYCKKDLDQQGCK 578
           LK SV  F+ F   S    P     K EA+      +K ++   +  S   +  D    K
Sbjct: 345 LKPSVEHFRVFGCISHVHIPDNKRVKLEAKS-----LKCIFLGVSDESKAYRLFDPISSK 399

Query: 579 IC--------EDETWKWREEGSNSKIQVADLE-EKEEEGSVVGTSAGNQTRNAGANVQEQ 731
           I         ED+ W W E   + +  +ADLE E  EE S    +  N +  A   ++E 
Sbjct: 400 IIVSRDVVFEEDQEWSWDE--VHKQTILADLEWEVNEEASTEEENNENGSETA-EELEEH 456

Query: 732 NASANENTNEDSLVEQGRALVEGRIKRKPAYLQD 833
            ++++ +  ED+      +L EGR +R PA+++D
Sbjct: 457 GSNSSGSFEEDT--SNVTSLPEGRTRRPPAWMRD 488


>gb|PNY15642.1| copia-type polyprotein [Trifolium pratense]
          Length = 822

 Score = 92.4 bits (228), Expect = 9e-17
 Identities = 89/282 (31%), Positives = 132/282 (46%), Gaps = 16/282 (5%)
 Frame = +3

Query: 234  QKRLVERENRTILNMVRCILSEKKLPKEFWSDAVKWAVFVEKRSPTTALHXXXXXXAWCG 413
            Q  + ER+NRTILNMVR +L+ +++PK FW +AVKWA +V  RSPT A+       AW G
Sbjct: 80   QNGVSERKNRTILNMVRSMLAAREVPKNFWPEAVKWATYVMNRSPTFAVQDMTPEEAWSG 139

Query: 414  LKSSVHFFQNF*VCS--SSPRTWCTKKEARQ*CQ*RVKIL----------YAL*SYCKKD 557
            +K SVH F+ F   +    P     K + +      VK +          Y L +  +K 
Sbjct: 140  VKPSVHHFRVFGCLAHVHVPDVQRKKLDGKS-----VKCIHLGLSEESKAYKLYNPNEKK 194

Query: 558  LDQQGCKICEDET-WKWREEGSNSKI---QVADLEEKEEEGSVVGTSAGNQTRNAGANVQ 725
            +      I E++  W W+++   S I     +D E   +E   V    GNQ+ +A  ++ 
Sbjct: 195  IIVSRDVIFEEQKGWNWKKKNYKSPIIHDTESDSEVAAQENHPVAPE-GNQSDDAEIDMD 253

Query: 726  EQNASANENTNEDSLVEQGRALVEGRIKRKPAYLQDMSLSK*SWMSSNT*KTLALLWSLK 905
             Q   A++  NEDS  + G  L   R +R P YL+D   +       N  +  AL  S +
Sbjct: 254  TQ---ASDTENEDSDNDNGNNL-PPRTRRPPGYLEDYDTTT-GEEQENMIQHFALFSSKE 308

Query: 906  QKILSPLRML*EVKNGKKAIDLEIEAIEKLKRHMGTDNLPKG 1031
                       ++   KKA++ EIE+I K      T  LPKG
Sbjct: 309  DP--ESYEDAIKIDVWKKAMESEIESINKNDTWELT-TLPKG 347


>dbj|GAU41840.1| hypothetical protein TSUD_177510 [Trifolium subterraneum]
          Length = 936

 Score = 91.7 bits (226), Expect = 2e-16
 Identities = 85/292 (29%), Positives = 139/292 (47%), Gaps = 17/292 (5%)
 Frame = +3

Query: 207  IFLLP-YGGIQKRLVERENRTILNMVRCILSEKKLPKEFWSDAVKWAVFVEKRSPTTALH 383
            ++ +P Y   Q  + ER+NRTIL+MVR ++S + +PK FW  AV WA +V+ RSPT  + 
Sbjct: 350  VYYIPAYTPQQNGVSERKNRTILDMVRSLISARNVPKRFWPGAVNWATYVKNRSPTHVVQ 409

Query: 384  XXXXXXAWCGLKSSVHFFQNF*VCS--SSPRTWCTKKEARQ*CQ*RVKILYAL*SYC--- 548
                  AW G+K SVH F+ F   +    P     K + +      + +     +Y    
Sbjct: 410  DITLEEAWSGVKPSVHNFRIFGCVAHVHIPDVNRKKLDGKSIMCILLGVSEESKTYKLYN 469

Query: 549  ---KKDLDQQGCKICEDETWKW-REEGSNSKIQVADLEE---KEEEGSVVGTSAGNQTRN 707
               KK +  +     E ++W W ++E S++K Q  D+E+    ++ G +     G+ T N
Sbjct: 470  PSEKKIIISRDVVFEESKSWNWNKQETSSAKGQTIDIEDNDANDDTGQIEAKVTGSNTDN 529

Query: 708  AGANVQEQNASANENTNEDSLVEQG--RALVEGRIKRKPAYLQDMSLSK*SWMSSNT*KT 881
              ++       ANE+  +D  V       ++  R +R P YL+D   ++      +  + 
Sbjct: 530  TESH---DGNEANEDIQQDGHVSDSSDSEVLTPRTRRPPNYLRDYVTNQEQENEVDVMQN 586

Query: 882  LALLWSLKQKILSPLRML*EVKNG--KKAIDLEIEAIEKLKRHMGTDNLPKG 1031
             A L+S K+    P      VK+   KKA++ EIE I+K      TD LP+G
Sbjct: 587  FA-LFSFKE---DPNSYEEAVKHDVWKKAMESEIEVIKKNDTWELTD-LPQG 633


>gb|PNX89974.1| copia-type polyprotein, partial [Trifolium pratense]
          Length = 415

 Score = 90.5 bits (223), Expect = 2e-16
 Identities = 82/276 (29%), Positives = 118/276 (42%), Gaps = 10/276 (3%)
 Frame = +3

Query: 234  QKRLVERENRTILNMVRCILSEKKLPKEFWSDAVKWAVFVEKRSPTTALHXXXXXXAWCG 413
            Q  + ER+N+TI+NMVRC+LSEK LPK  W +AV WAV +  RSPT A+       AW  
Sbjct: 47   QNGVAERKNQTIMNMVRCMLSEKHLPKFLWGEAVNWAVHILNRSPTLAVKDKTPEEAWSD 106

Query: 414  LKSSVHFFQNF*VCS--SSPRTWCTKKEARQ*CQ*RVKILYAL*SY------CKKDLDQQ 569
            +K +VH+ + F   +    P     K +A+      + I     +Y       K+ +  +
Sbjct: 107  IKPAVHYLKVFGCVAHVHIPEAKRKKLDAKSFRCVMLGISDESKAYRLFDPTTKRIVISK 166

Query: 570  GCKICEDETWKWREEGSNSKIQVADLEEKEEEGSVVGTSAGNQTRNAGANVQEQNASANE 749
                 E+E W W       K  + D  E EEE            RN      E      E
Sbjct: 167  DVIFEENECWDWERSPEQMKPDLLDWGESEEE------------RN------ENTEEVRE 208

Query: 750  NTNEDSLVEQGRALVEGRIKRKPAYLQDMSLSK*SWMSSNT*KTLALLWSLKQKILSPLR 929
                 S +    A +EGR++R P ++QD +  +    S    + L +          P  
Sbjct: 209  GMGSSSSLSSEEAPIEGRVRRPPGWMQDYTSGE--EFSEEEIQNLVMFTVAS----DPTT 262

Query: 930  ML*EVKNGK--KAIDLEIEAIEKLKRHMGTDNLPKG 1031
                VK+ K   A++ E+EAIEK      TD LP G
Sbjct: 263  FEEAVKSEKWRNAMNNEMEAIEKNNTWELTD-LPTG 297


>gb|PNX95763.1| retrotransposon-related protein, partial [Trifolium pratense]
          Length = 1327

 Score = 87.0 bits (214), Expect = 6e-15
 Identities = 59/208 (28%), Positives = 92/208 (44%), Gaps = 8/208 (3%)
 Frame = +3

Query: 234  QKRLVERENRTILNMVRCILSEKKLPKEFWSDAVKWAVFVEKRSPTTALHXXXXXXAWCG 413
            Q  + ER+NRTI+NMVR +LSEK++PKEFW++A  W++ +  R PTTAL       AW G
Sbjct: 591  QNGVAERKNRTIMNMVRSMLSEKQMPKEFWAEAANWSIHILNRCPTTALENMTPQEAWTG 650

Query: 414  LKSSVHFFQNF*VCS--SSPRTWCTKKEARQ*CQ*RVKILYAL*SY------CKKDLDQQ 569
             K  V  F+ F   +    P     K + +      + +     +Y       KK    +
Sbjct: 651  CKPRVDHFRIFGCLAHVHVPDQKRIKLDDKSKTHIFLGVSKESKAYKLFDPITKKITISR 710

Query: 570  GCKICEDETWKWREEGSNSKIQVADLEEKEEEGSVVGTSAGNQTRNAGANVQEQNASANE 749
              K  E+  WKW++     +  V DLE+K  + +       +   N  +N   Q    + 
Sbjct: 711  DVKFEENACWKWKQSKGEVQSDVLDLEDKNSDANKELELEEDSDSNNTSNTISQTGGNSS 770

Query: 750  NTNEDSLVEQGRALVEGRIKRKPAYLQD 833
             T+             GR +R P ++ D
Sbjct: 771  TTSSGGSEPNSPT---GRFRRAPGWMSD 795


>gb|PNX94698.1| copia-type polyprotein [Trifolium pratense]
          Length = 1324

 Score = 86.7 bits (213), Expect = 9e-15
 Identities = 66/214 (30%), Positives = 100/214 (46%), Gaps = 14/214 (6%)
 Frame = +3

Query: 234  QKRLVERENRTILNMVRCILSEKKLPKEFWSDAVKWAVFVEKRSPTTALHXXXXXXAWCG 413
            Q  + ER+NRT+LNMVR +L+ + +PK+FW +AVKWA +V  RSPT ++       AW G
Sbjct: 589  QNGVSERKNRTLLNMVRSMLAGRNVPKKFWPEAVKWATYVLNRSPTLSVKDSTPEEAWSG 648

Query: 414  LKSSVHFFQNF*VCS--SSPRTWCTKKEARQ*CQ*RVKILYAL*SY------CKKDLDQQ 569
            LK SVH F+ F   +    P    TK +A+      + +     +Y       KK +  +
Sbjct: 649  LKPSVHHFKIFGCLAYVHVPDAKRTKLDAKSLKCVHLGVSEESKAYKLYDPVNKKIIVSR 708

Query: 570  GCKICEDETWKWREE------GSNSKIQVADLEEKEEEGSVVGTSAGNQTRNAGANVQEQ 731
                 E   W W ++       SN+   ++D  + EEE    G + GN   N  ++  + 
Sbjct: 709  DVVFEEGTEWNWNDKKKAAASSSNNNDLISDETDIEEEAK-NGVNTGN---NESSSEYDS 764

Query: 732  NASANENTNEDSLVEQGRALVEGRIKRKPAYLQD 833
                N+   E+ L          R K+KP YL D
Sbjct: 765  EQEGNDYETEEEL--------PPRPKKKPGYLND 790


>gb|PNX77752.1| copia-type polyprotein, partial [Trifolium pratense]
          Length = 736

 Score = 86.3 bits (212), Expect = 1e-14
 Identities = 63/211 (29%), Positives = 94/211 (44%), Gaps = 11/211 (5%)
 Frame = +3

Query: 234 QKRLVERENRTILNMVRCILSEKKLPKEFWSDAVKWAVFVEKRSPTTALHXXXXXXAWCG 413
           Q  + E +NRTI+NMVRC+LSEK +PK FW +AV WA  +  RSPT A+       AW G
Sbjct: 358 QNGVSESKNRTIVNMVRCMLSEKNVPKNFWPEAVNWAAHILNRSPTFAVKDITPEEAWSG 417

Query: 414 LKSSVHFFQNF*VCS--SSPRTWCTKKEARQ*CQ*RVKI-----LYAL*SYCKKDLD-QQ 569
           +K SV  F+ F   +    P     K + +      + I      Y L    KK +   +
Sbjct: 418 IKPSVSHFKVFGCIAYVHVPDNLRKKLDDKSTVCIHLGISEESKAYKLYDPIKKRIAVSK 477

Query: 570 GCKICEDETWKWRE---EGSNSKIQVADLEEKEEEGSVVGTSAGNQTRNAGANVQEQNAS 740
             K  E + W W +   E SN   Q+ D ++ E   +     A N      +N   ++  
Sbjct: 478 DVKFDESKQWNWNDKNTENSNKNKQIIDCDDIETPSTSNQNEASNDAEEQASNSHSEDMD 537

Query: 741 ANENTNEDSLVEQGRALVEGRIKRKPAYLQD 833
                +E+        L + R+ ++P YL D
Sbjct: 538 LVVTDSEEEDGNDENPLGK-RVSKRPDYLND 567


>gb|PNX91151.1| copia-type polyprotein, partial [Trifolium pratense]
          Length = 430

 Score = 84.7 bits (208), Expect = 2e-14
 Identities = 62/208 (29%), Positives = 94/208 (45%), Gaps = 8/208 (3%)
 Frame = +3

Query: 234 QKRLVERENRTILNMVRCILSEKKLPKEFWSDAVKWAVFVEKRSPTTALHXXXXXXAWCG 413
           Q  + ER+NRTI+NMVRC+LS+KK+PK+FW ++VKWAV+V  RSPT  +       AW  
Sbjct: 193 QNGVSERKNRTIMNMVRCMLSDKKVPKKFWPESVKWAVYVLNRSPTLLVKDITPEEAWSN 252

Query: 414 LKSSVHFFQNF*VCSSSPRTWCTKKEARQ*CQ*RVKILYAL*SYCKKDLDQQGCKIC--- 584
           +K SV  F+ F   +        +K+        V +  +  S   K  +    +I    
Sbjct: 253 MKPSVKHFKVFGCLAFVHVPDAQRKKLDDKSIKCVHLGVSEESKAYKLYNPADRRIIVSR 312

Query: 585 -----EDETWKWREEGSNSKIQVADLEEKEEEGSVVGTSAGNQTRNAGANVQEQNASANE 749
                E + W W   G NS+ Q    +  E E         ++   AG NV+    +   
Sbjct: 313 DVVFDESKGWNW---GENSQAQATQYDNSENE-----VYETDEEPAAGENVEADPQNITV 364

Query: 750 NTNEDSLVEQGRALVEGRIKRKPAYLQD 833
             +E     +    +  R+ R+P YL D
Sbjct: 365 PDSESDEQYESEEELPPRVIRRPGYLND 392


>gb|PNX96091.1| retrotransposon-related protein [Trifolium pratense]
          Length = 1326

 Score = 85.5 bits (210), Expect = 2e-14
 Identities = 78/277 (28%), Positives = 130/277 (46%), Gaps = 11/277 (3%)
 Frame = +3

Query: 234  QKRLVERENRTILNMVRCILSEKKLPKEFWSDAVKWAVFVEKRSPTTALHXXXXXXAWCG 413
            Q  + ER+NRT++NMVR +L +K +PK FW +AV W ++V  R PT A+       AW G
Sbjct: 589  QNGVAERKNRTVMNMVRSLLFDKNIPKTFWPEAVNWTIYVLNRCPTLAVKDVTPEEAWSG 648

Query: 414  LKSSVHFFQNF*VCS--SSPRTWCTKKEARQ*CQ*RVKILYAL*SY------CKKDLDQQ 569
            +K SV+ F+ F   +    P    TK ++R      + +      Y       KK +  +
Sbjct: 649  VKPSVNHFRVFGCIAHVHVPEAKRTKLDSRSITCVLLGVSEESKGYRFFDPVSKKIVVSR 708

Query: 570  GCKICEDETWKWREEGSNSKIQVADLEEKEEEGSVVGTSAGNQTRNAGANVQEQNASANE 749
                 ED+ W W E+ + + ++  D E +E           ++  N G NV+++     E
Sbjct: 709  DVIFEEDKQWDWEEKQTVADLEWNDGENEERVSENDNEERVSENDNQG-NVEKEREVIRE 767

Query: 750  NTNEDSLVEQGRALV-EGRIKRKPAYLQDMSLSK*SWMSSNT*KTLALLWSLKQKILSPL 926
              ++ +   +G  +V E R +R P ++ D    +     S     +AL+ S     + PL
Sbjct: 768  EEHDSN---EGEEIVKEYRERRPPGWMSDFESGE---GLSEDEAHMALMVS-----IDPL 816

Query: 927  RML*EVK--NGKKAIDLEIEAIEKLKRHMGTDNLPKG 1031
                 VK  N + A++ EI++IEK +    T+ LP G
Sbjct: 817  CFEEAVKSENWRLAMEKEIKSIEKNQTWTLTE-LPAG 852


>gb|KYP37051.1| Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cajanus
            cajan]
          Length = 1079

 Score = 85.1 bits (209), Expect = 3e-14
 Identities = 68/261 (26%), Positives = 114/261 (43%), Gaps = 8/261 (3%)
 Frame = +3

Query: 234  QKRLVERENRTILNMVRCILSEKKLPKEFWSDAVKWAVFVEKRSPTTALHXXXXXXAWCG 413
            Q  + ER+NRTI+N VR +L EK++ K FW + V+W V ++ RSPTTA+        W G
Sbjct: 572  QNGVAERKNRTIMNAVRAVLHEKQVSKSFWPEVVRWCVHIQNRSPTTAIDHGTLEEVWSG 631

Query: 414  LKSSVHFFQNF*VCS--SSPRTWCTKKEARQ------*CQ*RVKILYAL*SYCKKDLDQQ 569
            +K  V +F+ F   +    P    +K + +             K         KK +  +
Sbjct: 632  IKPRVDYFRTFGCVAHVHIPDQRRSKLDDKSHTCVLLGVSDEAKAYKLFDPISKKVIVSR 691

Query: 570  GCKICEDETWKWREEGSNSKIQVADLEEKEEEGSVVGTSAGNQTRNAGANVQEQNASANE 749
                 ED+ W W +  + S     D+E ++EEGS    S        G+N   +      
Sbjct: 692  DVVFEEDKGWNWHKGTTESTPLALDMEGQDEEGSDDVDSTPQLVATRGSNNNSEPPEPVS 751

Query: 750  NTNEDSLVEQGRALVEGRIKRKPAYLQDMSLSK*SWMSSNT*KTLALLWSLKQKILSPLR 929
            N+       +GRA    R +R+P ++ D   +    + +     LA+L +   +      
Sbjct: 752  NSESIVPPVEGRAT---RTQRQPLWMTDYETN----LFAEEESLLAML-TTNSEDPQTFE 803

Query: 930  ML*EVKNGKKAIDLEIEAIEK 992
                 +  K+A+D E++AIE+
Sbjct: 804  EASTSQKWKEAMDTEMKAIER 824


Top