BLASTX nr result

ID: Chrysanthemum21_contig00005537 seq

BLASTX 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Chrysanthemum21_contig00005537
         (1597 letters)

Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF excluding environmental samples
from WGS projects 
           149,584,005 sequences; 54,822,741,787 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

gb|OTF86030.1| putative ribonuclease H-like domain-containing pr...   331   1e-96
ref|XP_022032681.1| uncharacterized protein LOC110933783 [Helian...   303   2e-92
ref|XP_021974902.1| uncharacterized protein LOC110870015 [Helian...   286   5e-89
ref|XP_022019013.1| uncharacterized protein LOC110919044 [Helian...   286   1e-88
ref|XP_022000771.1| uncharacterized protein LOC110898294 [Helian...   275   5e-86
gb|OTG29887.1| putative gag-polypeptide of LTR copia-type [Helia...   271   2e-83
ref|XP_022012651.1| uncharacterized protein LOC110912271 [Helian...   278   6e-83
ref|XP_022011939.1| uncharacterized protein LOC110911621 [Helian...   263   2e-79
ref|XP_021996287.1| uncharacterized protein LOC110893489 [Helian...   264   6e-78
ref|XP_021992301.1| uncharacterized protein LOC110889107 [Helian...   257   4e-77
ref|XP_021994452.1| uncharacterized protein LOC110891101 [Helian...   255   2e-76
gb|OTG35332.1| putative gag-polypeptide of LTR copia-type [Helia...   256   6e-76
gb|OTG08964.1| putative gag-polypeptide of LTR copia-type [Helia...   258   3e-75
ref|XP_020537368.1| uncharacterized protein LOC110010150 [Jatrop...   251   5e-75
ref|XP_021974743.1| uncharacterized protein LOC110869838 [Helian...   245   1e-73
ref|XP_019158475.1| PREDICTED: uncharacterized protein LOC109155...   258   3e-72
gb|OTG08979.1| putative ribonuclease H-like domain, GAG-pre-inte...   255   7e-72
gb|OTG25356.1| putative gag-polypeptide of LTR copia-type [Helia...   228   7e-68
ref|XP_020535493.1| uncharacterized protein LOC110009599 [Jatrop...   229   1e-67
ref|XP_020535814.1| uncharacterized protein LOC110009728, partia...   223   1e-65

>gb|OTF86030.1| putative ribonuclease H-like domain-containing protein [Helianthus
            annuus]
          Length = 1532

 Score =  331 bits (848), Expect = 1e-96
 Identities = 157/209 (75%), Positives = 181/209 (86%), Gaps = 2/209 (0%)
 Frame = +3

Query: 975  QHDLVVFSWLIQNIEPTIAGNLTEYPTAKALWDALAITYSSGKDKLQTFNLHVKANELRQ 1154
            Q DLVVFSWLIQNIEP +AGNLTE+PTAK+LWDAL +TYSSG+DKLQTFNLHVKANE++Q
Sbjct: 85   QDDLVVFSWLIQNIEPVLAGNLTEFPTAKSLWDALVVTYSSGRDKLQTFNLHVKANEIKQ 144

Query: 1155 NDKSLEEFWIELQGVWGEIDRIDPNPMKCQEDIKIYAKIRSEQKLFQFLNGLDRKFESVK 1334
            NDKSLE+FWI LQGVWGEIDRIDPNPMKC EDI+ Y +IRSEQKLFQFLN LDRK++ +K
Sbjct: 145  NDKSLEDFWIILQGVWGEIDRIDPNPMKCPEDIRTYLRIRSEQKLFQFLNALDRKYDPIK 204

Query: 1335 REVLRVDPLPTVEAAYATIRKEAAHQNILGATNNEHQGIATGLVAGETGGTGLTTKSFRK 1514
            RE+LR+DPLP+ EAAYAT+RKEAAHQNILG T ++ QGIA GL A  T G GL TK  R+
Sbjct: 205  REILRLDPLPSAEAAYATVRKEAAHQNILGTTVDDTQGIAAGLSATGTEGLGLVTKGHRR 264

Query: 1515 YDGKK--STTKEDKSHLKCEVCGMNRHTK 1595
            +DGKK  +  KEDK+HLKC+ CGM RHTK
Sbjct: 265  FDGKKNGAPNKEDKTHLKCDHCGMTRHTK 293


>ref|XP_022032681.1| uncharacterized protein LOC110933783 [Helianthus annuus]
          Length = 592

 Score =  303 bits (776), Expect = 2e-92
 Identities = 149/209 (71%), Positives = 174/209 (83%), Gaps = 2/209 (0%)
 Frame = +3

Query: 975  QHDLVVFSWLIQNIEPTIAGNLTEYPTAKALWDALAITYSSGKDKLQTFNLHVKANELRQ 1154
            Q DLVVFSWLIQNIEP +A NLTE+PTAK+LWDAL +TYSSG+DKLQTFNLHVKAN+++Q
Sbjct: 34   QDDLVVFSWLIQNIEPALASNLTEFPTAKSLWDALVVTYSSGRDKLQTFNLHVKANDIKQ 93

Query: 1155 NDKSLEEFWIELQGVWGEIDRIDPNPMKCQEDIKIYAKIRSEQKLFQFLNGLDRKFESVK 1334
            ND SLEEFWI LQG+WGEIDR      KC EDI+ Y+KIRSEQKLFQFLN LDRK++ +K
Sbjct: 94   NDTSLEEFWITLQGIWGEIDR------KCPEDIQTYSKIRSEQKLFQFLNALDRKYDPIK 147

Query: 1335 REVLRVDPLPTVEAAYATIRKEAAHQNILGATNNEHQGIATGLVAGETGGTGLTTKSFRK 1514
            RE+LR+DPLP+ EAAYA +RKEAAHQNILGAT +E QGI  GLVA E  G GL +K  R+
Sbjct: 148  RELLRLDPLPSAEAAYAAVRKEAAHQNILGATLSETQGIGAGLVATEKEGLGLISKG-RR 206

Query: 1515 YDGKKS--TTKEDKSHLKCEVCGMNRHTK 1595
            +DGKK+    KEDKSHLKC+ CGM +HTK
Sbjct: 207  FDGKKNGPPVKEDKSHLKCDHCGMTKHTK 235


>ref|XP_021974902.1| uncharacterized protein LOC110870015 [Helianthus annuus]
          Length = 325

 Score =  286 bits (731), Expect = 5e-89
 Identities = 144/212 (67%), Positives = 166/212 (78%), Gaps = 5/212 (2%)
 Frame = +3

Query: 975  QHDLVVFSWLIQNIEPTIAGNLTEYPTAKALWDALAITYSSGKDKLQTFNLHVKANELRQ 1154
            Q DLVVFSWLIQNIEP +AGNLTE+PTAKALWDAL +TYSSGKDKLQTF+LHVKANEL+Q
Sbjct: 36   QDDLVVFSWLIQNIEPGLAGNLTEFPTAKALWDALVVTYSSGKDKLQTFDLHVKANELKQ 95

Query: 1155 NDKSLEEFWIELQGVWGEIDRIDPNPMKCQEDIKIYAKIRSEQKLFQFLNGLDRKFESVK 1334
            N  +LE+FWI+LQG+WGEIDR D NPM C  DI  Y  IRSEQKLFQFLN LDRKF+ VK
Sbjct: 96   NGSALEDFWIKLQGIWGEIDRRDLNPMTCSADIATYKNIRSEQKLFQFLNALDRKFDPVK 155

Query: 1335 REVLRVDPLPTVEAAYATIRKEAAHQNILGATNNEHQ-GIATGLVAG---ETGGTGLTTK 1502
            RE+LR DPLP+ E AYA +RKE AHQ ILG  +   Q G+A+GL+ G   E  G GL TK
Sbjct: 156  REILRWDPLPSAEQAYAAVRKEMAHQGILGTISETSQSGVASGLIVGGTNEIDGQGLITK 215

Query: 1503 SFRKYD-GKKSTTKEDKSHLKCEVCGMNRHTK 1595
              R+ D   KS+++ DKS LKC  CGMN+HTK
Sbjct: 216  GQRRSDFTGKSSSRIDKSKLKCSHCGMNKHTK 247


>ref|XP_022019013.1| uncharacterized protein LOC110919044 [Helianthus annuus]
          Length = 379

 Score =  286 bits (733), Expect = 1e-88
 Identities = 144/212 (67%), Positives = 167/212 (78%), Gaps = 5/212 (2%)
 Frame = +3

Query: 975  QHDLVVFSWLIQNIEPTIAGNLTEYPTAKALWDALAITYSSGKDKLQTFNLHVKANELRQ 1154
            Q DL VFSWLIQNIEP +AGNLTE+PTAKALWDAL +TYSSGKDKLQTF+LHVKANEL+Q
Sbjct: 93   QDDLFVFSWLIQNIEPGLAGNLTEFPTAKALWDALVVTYSSGKDKLQTFDLHVKANELKQ 152

Query: 1155 NDKSLEEFWIELQGVWGEIDRIDPNPMKCQEDIKIYAKIRSEQKLFQFLNGLDRKFESVK 1334
            N  +LE+FWI+LQG+WGEIDR+DPNPM C  D+  Y  IRSEQKLFQFLN LDRKF+ VK
Sbjct: 153  NGSALEDFWIKLQGIWGEIDRMDPNPMTCSADVATYNNIRSEQKLFQFLNALDRKFDLVK 212

Query: 1335 REVLRVDPLPTVEAAYATIRKEAAHQNILGATNNEHQ-GIATGLVAG---ETGGTGLTTK 1502
            RE+L  DPLP+ E AYAT+RKE AHQ ILG  +   Q G+A GLVAG   ET G GL TK
Sbjct: 213  REILWWDPLPSAEQAYATVRKEMAHQGILGTISETSQSGVAAGLVAGGTTETDGQGLITK 272

Query: 1503 SFRKYD-GKKSTTKEDKSHLKCEVCGMNRHTK 1595
              R+ +   KS+++ DKS LKC  CG N+HTK
Sbjct: 273  GQRRSNFTGKSSSRIDKSKLKCSHCGKNKHTK 304


>ref|XP_022000771.1| uncharacterized protein LOC110898294 [Helianthus annuus]
          Length = 247

 Score =  275 bits (703), Expect = 5e-86
 Identities = 134/208 (64%), Positives = 165/208 (79%), Gaps = 1/208 (0%)
 Frame = +3

Query: 975  QHDLVVFSWLIQNIEPTIAGNLTEYPTAKALWDALAITYSSGKDKLQTFNLHVKANELRQ 1154
            Q DL+VFSWLIQNIEP++A NLTE+PTAK LWDAL ITYSSGKDKLQTF+LHVKANE +Q
Sbjct: 34   QDDLIVFSWLIQNIEPSLASNLTEFPTAKTLWDALTITYSSGKDKLQTFDLHVKANEFKQ 93

Query: 1155 NDKSLEEFWIELQGVWGEIDRIDPNPMKCQEDIKIYAKIRSEQKLFQFLNGLDRKFESVK 1334
            N   LEEFWI +QG+WGEI R DPNP+ C  DI  Y K+RSE KLFQFLN LDRK++S+K
Sbjct: 94   NGVPLEEFWIIMQGIWGEIKRRDPNPIACPADIATYNKVRSEYKLFQFLNALDRKYDSLK 153

Query: 1335 REVLRVDPLPTVEAAYATIRKEAAHQNILGATNNEHQGIATGLVA-GETGGTGLTTKSFR 1511
            RE+L+ DPLP+VE AYA +RKE  HQ+I G   N H+G+ +GL + G+T G GL ++S R
Sbjct: 154  REILQWDPLPSVEVAYAVVRKETTHQSIFG---NPHKGVGSGLNSHGDTDGLGLVSRS-R 209

Query: 1512 KYDGKKSTTKEDKSHLKCEVCGMNRHTK 1595
            + D K S+++ DKS L+CE CGM +HTK
Sbjct: 210  RSDQKPSSSRIDKSKLRCEHCGMAKHTK 237


>gb|OTG29887.1| putative gag-polypeptide of LTR copia-type [Helianthus annuus]
          Length = 325

 Score =  271 bits (693), Expect = 2e-83
 Identities = 133/212 (62%), Positives = 160/212 (75%), Gaps = 5/212 (2%)
 Frame = +3

Query: 975  QHDLVVFSWLIQNIEPTIAGNLTEYPTAKALWDALAITYSSGKDKLQTFNLHVKANELRQ 1154
            Q DL+VFSWLIQNIEP +A NLTE+PTAK+LWDAL +TYSSGKDKLQTF+LHVKAN ++Q
Sbjct: 36   QDDLIVFSWLIQNIEPALASNLTEFPTAKSLWDALVVTYSSGKDKLQTFDLHVKANGIKQ 95

Query: 1155 NDKSLEEFWIELQGVWGEIDRIDPNPMKCQEDIKIYAKIRSEQKLFQFLNGLDRKFESVK 1334
            N   LE+FWI +QG+WGEIDR DPNPM C  DI  Y KIRSEQKLFQFLN LDR+++++K
Sbjct: 96   NGSPLEDFWIIMQGIWGEIDRRDPNPMTCTVDIATYNKIRSEQKLFQFLNALDRQYDTIK 155

Query: 1335 REVLRVDPLPTVEAAYATIRKEAAHQNILGATNNEHQGIATGLVAG---ETGGTGLTTK- 1502
            RE+LR DPLP+ E AYA +RKE AHQ ILG   + H  +A GLVA    ET   G  +K 
Sbjct: 156  REILRWDPLPSAEGAYAAVRKEMAHQGILGTATSSHNNVAAGLVANGSHETESLGFLSKG 215

Query: 1503 -SFRKYDGKKSTTKEDKSHLKCEVCGMNRHTK 1595
             S +K     S+++ DKS LKC  CGM +HTK
Sbjct: 216  RSGQKNPNSGSSSQIDKSKLKCLHCGMLKHTK 247


>ref|XP_022012651.1| uncharacterized protein LOC110912271 [Helianthus annuus]
          Length = 565

 Score =  278 bits (710), Expect = 6e-83
 Identities = 137/210 (65%), Positives = 167/210 (79%), Gaps = 3/210 (1%)
 Frame = +3

Query: 975  QHDLVVFSWLIQNIEPTIAGNLTEYPTAKALWDALAITYSSGKDKLQTFNLHVKANELRQ 1154
            Q DL+VFSWLIQNIEP++A NLTE+PTAK LWDAL +TYSSGKDKLQTF+LHVK NE +Q
Sbjct: 34   QDDLIVFSWLIQNIEPSLASNLTEFPTAKTLWDALTVTYSSGKDKLQTFDLHVKVNEFKQ 93

Query: 1155 NDKSLEEFWIELQGVWGEIDRIDPNPMKCQEDIKIYAKIRSEQKLFQFLNGLDRKFESVK 1334
            +   LEEFWI +QG+WGEI+R DPNPMKC  DI  Y K+RSE KLFQFLN LDRK++S+K
Sbjct: 94   SGLPLEEFWIVMQGIWGEIERRDPNPMKCPTDIATYNKVRSEYKLFQFLNALDRKYDSLK 153

Query: 1335 REVLRVDPLPTVEAAYATIRKEAAHQNILGATNNEHQGIATGL-VAGETGGTGLTTKSFR 1511
            RE+LR DPLP+ EAAYA +RKE AHQ+I G   N HQG+A+GL   GE+ G GL T+  R
Sbjct: 154  REILRWDPLPSAEAAYAVVRKETAHQSIFG---NVHQGVASGLNSTGESDGLGLVTRG-R 209

Query: 1512 KYDGK--KSTTKEDKSHLKCEVCGMNRHTK 1595
            + D K  +S+++ DKS LKC+ CGM +HTK
Sbjct: 210  RSDQKSNQSSSRIDKSKLKCDHCGMAKHTK 239


>ref|XP_022011939.1| uncharacterized protein LOC110911621 [Helianthus annuus]
 gb|OTF95102.1| putative gag-polypeptide of LTR copia-type [Helianthus annuus]
          Length = 394

 Score =  263 bits (673), Expect = 2e-79
 Identities = 136/215 (63%), Positives = 158/215 (73%), Gaps = 8/215 (3%)
 Frame = +3

Query: 975  QHDLVVFSWLIQNIEPTIAGNLTEYPTAKALWDALAITYSSGKDKLQTFNLHVKANELRQ 1154
            Q DLVV SWLIQNIEP +A NLTE+PTAK LWDAL +TYSSGKDKLQTF+LHVKAN ++Q
Sbjct: 91   QDDLVVISWLIQNIEPALASNLTEFPTAKTLWDALVVTYSSGKDKLQTFDLHVKANGIKQ 150

Query: 1155 NDKSLEEFWIELQGVWGEIDRIDPNPMKCQEDIKIYAKIRSEQKLFQFLNGLDRKFESVK 1334
            N   LE+FWI +QGVWGEI+R DPNPM C EDI  Y KIRSEQKLFQFLN LDR++++VK
Sbjct: 151  NGSPLEDFWIIMQGVWGEIERRDPNPMTCPEDITTYNKIRSEQKLFQFLNALDRQYDTVK 210

Query: 1335 REVLRVDPLPTVEAAYATIRKEAAHQNILGA---TNNEHQGIATGLVA-GETGGTGLTTK 1502
            RE+LR DPLP+ E AYA +RKE AHQ ILG    T+    G+A GL A G     GL   
Sbjct: 211  REILRWDPLPSAEGAYAAVRKEMAHQGILGITIDTSYNPNGVAAGLNANGSRESEGLGFL 270

Query: 1503 SFRKYDGKKSTT----KEDKSHLKCEVCGMNRHTK 1595
            S  + D K S T    + DKS LKC  CGM++HTK
Sbjct: 271  SRGRVDQKSSNTGSSFRIDKSKLKCGHCGMSKHTK 305


>ref|XP_021996287.1| uncharacterized protein LOC110893489 [Helianthus annuus]
          Length = 548

 Score =  264 bits (675), Expect = 6e-78
 Identities = 130/209 (62%), Positives = 160/209 (76%), Gaps = 2/209 (0%)
 Frame = +3

Query: 975  QHDLVVFSWLIQNIEPTIAGNLTEYPTAKALWDALAITYSSGKDKLQTFNLHVKANELRQ 1154
            Q DL+VFSWLIQNIEP++  NLTE+PTA  LWDAL++TYSSGKDKLQTF+LHVKANE +Q
Sbjct: 84   QDDLIVFSWLIQNIEPSLTSNLTEFPTANTLWDALSVTYSSGKDKLQTFDLHVKANEFKQ 143

Query: 1155 NDKSLEEFWIELQGVWGEIDRIDPNPMKCQEDIKIYAKIRSEQKLFQFLNGLDRKFESVK 1334
            N   LEEFWI +QG+ GEI R DPNPMKC +DI  Y K+RSE KLFQ LN LDRK++S+K
Sbjct: 144  NGLPLEEFWIVMQGIRGEIKRRDPNPMKCSDDIATYNKVRSENKLFQLLNALDRKYDSLK 203

Query: 1335 REVLRVDPLPTVEAAYATIRKEAAHQNILGATNNEHQGIATGLVA-GETGGTGLTTKS-F 1508
            RE+LR DPLPT EAAYA +RKE AHQ+I G   N  QG+ +GL + G + G GL ++S +
Sbjct: 204  REILRWDPLPTTEAAYAAVRKETAHQSIFG---NTQQGVGSGLNSLGSSDGLGLVSRSRW 260

Query: 1509 RKYDGKKSTTKEDKSHLKCEVCGMNRHTK 1595
                   S+++ DKS LKC+ CGM +HTK
Sbjct: 261  SDQKSNPSSSRIDKSKLKCDHCGMAKHTK 289


>ref|XP_021992301.1| uncharacterized protein LOC110889107 [Helianthus annuus]
 gb|OTG06567.1| putative gag-polypeptide of LTR copia-type [Helianthus annuus]
          Length = 380

 Score =  257 bits (656), Expect = 4e-77
 Identities = 129/214 (60%), Positives = 157/214 (73%), Gaps = 7/214 (3%)
 Frame = +3

Query: 975  QHDLVVFSWLIQNIEPTIAGNLTEYPTAKALWDALAITYSSGKDKLQTFNLHVKANELRQ 1154
            Q DLVVFSWLIQNIEP +A NLTE+PTAK LWDAL  TYSSGKDKLQTF+LHVK+N ++Q
Sbjct: 88   QDDLVVFSWLIQNIEPALASNLTEFPTAKTLWDALVTTYSSGKDKLQTFDLHVKSNGIKQ 147

Query: 1155 NDKSLEEFWIELQGVWGEIDRIDPNPMKCQEDIKIYAKIRSEQKLFQFLNGLDRKFESVK 1334
            N   LE+FWI +QGVWGEI+R DPNPM C  DI  Y K+RSEQKLFQFLN LDR+++ +K
Sbjct: 148  NGSPLEDFWIIMQGVWGEIERRDPNPMTCAADIATYNKLRSEQKLFQFLNALDRQYDPIK 207

Query: 1335 REVLRVDPLPTVEAAYATIRKEAAHQNILGATNNEH--QGIATGL---VAGETGGTGLTT 1499
            RE+LR DPLP+ E AYA +RK  AHQ ILG T+N     G+A GL    + E    G  T
Sbjct: 208  REILRWDPLPSAEGAYAAVRKVMAHQGILGTTDNSSSPSGVAAGLNTNRSSEPESLGFLT 267

Query: 1500 K--SFRKYDGKKSTTKEDKSHLKCEVCGMNRHTK 1595
            K  + +K     S+++ DK+ LKC+ CG N+HTK
Sbjct: 268  KGRTNQKNSTLGSSSRIDKTKLKCDHCGKNKHTK 301


>ref|XP_021994452.1| uncharacterized protein LOC110891101 [Helianthus annuus]
          Length = 386

 Score =  255 bits (651), Expect = 2e-76
 Identities = 127/212 (59%), Positives = 154/212 (72%), Gaps = 5/212 (2%)
 Frame = +3

Query: 975  QHDLVVFSWLIQNIEPTIAGNLTEYPTAKALWDALAITYSSGKDKLQTFNLHVKANELRQ 1154
            Q DL+VFSWLIQNIEP IA NLTE+PT+K LW+AL  TYSSGKDKLQ F+LHVKAN L+Q
Sbjct: 86   QDDLIVFSWLIQNIEPAIASNLTEFPTSKTLWEALQTTYSSGKDKLQIFDLHVKANSLKQ 145

Query: 1155 NDKSLEEFWIELQGVWGEIDRIDPNPMKCQEDIKIYAKIRSEQKLFQFLNGLDRKFESVK 1334
             +  +E+ WI LQG+WGEIDR +PNPM C  DI  Y ++RSEQKLFQFLN LD +F++VK
Sbjct: 146  KEVPVEDLWINLQGIWGEIDRREPNPMTCTTDINTYNRLRSEQKLFQFLNALDHRFDTVK 205

Query: 1335 REVLRVDPLPTVEAAYATIRKEAAHQNILGATNNEHQ--GIATGLVA---GETGGTGLTT 1499
            RE+LR +PLPT E AYATIRKE  HQ ILGA  +E Q  GIA+GL      +T G GL +
Sbjct: 206  REILRGEPLPTAEPAYATIRKETTHQIILGAGTSETQIHGIASGLATTNLQQTDGLGLIS 265

Query: 1500 KSFRKYDGKKSTTKEDKSHLKCEVCGMNRHTK 1595
            K   + +       + K+ LKC+ CG  RHTK
Sbjct: 266  KGNCRSEKTTGNKNDPKAKLKCDHCGKPRHTK 297


>gb|OTG35332.1| putative gag-polypeptide of LTR copia-type [Helianthus annuus]
          Length = 465

 Score =  256 bits (655), Expect = 6e-76
 Identities = 124/214 (57%), Positives = 160/214 (74%), Gaps = 7/214 (3%)
 Frame = +3

Query: 975  QHDLVVFSWLIQNIEPTIAGNLTEYPTAKALWDALAITYSSGKDKLQTFNLHVKANELRQ 1154
            Q+DL+VFSWLIQNIEP +A NLTE+PTAK LWDAL +TYSSGKDKLQTF+LHVK+N ++Q
Sbjct: 168  QNDLIVFSWLIQNIEPALASNLTEFPTAKTLWDALVVTYSSGKDKLQTFDLHVKSNGIKQ 227

Query: 1155 NDKSLEEFWIELQGVWGEIDRIDPNPMKCQEDIKIYAKIRSEQKLFQFLNGLDRKFESVK 1334
            N  SLE+FWI +QG+WGE +R DPNPM C  DI  Y KIRSEQKLF+FLN LDR+++++K
Sbjct: 228  NGSSLEDFWINMQGIWGETERRDPNPMTCITDIATYNKIRSEQKLFKFLNALDRQYDTIK 287

Query: 1335 REVLRVDPLPTVEAAYATIRKEAAHQNILGAT---NNEHQGIATGLVAGETGGTGL---- 1493
             E+LR DPLP+ E AYA +RKE AHQ ILG T   ++ + G+A GL A  +   GL    
Sbjct: 288  MEILRWDPLPSAEGAYAAVRKEMAHQGILGTTADNSSLNNGVAAGLAANGSKEVGLGFLS 347

Query: 1494 TTKSFRKYDGKKSTTKEDKSHLKCEVCGMNRHTK 1595
              ++ ++     S+ + DK+ LKC+ CGM +HTK
Sbjct: 348  KGRTGQRNFNSGSSPRIDKTKLKCDHCGMMKHTK 381


>gb|OTG08964.1| putative gag-polypeptide of LTR copia-type [Helianthus annuus]
          Length = 571

 Score =  258 bits (658), Expect = 3e-75
 Identities = 129/214 (60%), Positives = 157/214 (73%), Gaps = 7/214 (3%)
 Frame = +3

Query: 975  QHDLVVFSWLIQNIEPTIAGNLTEYPTAKALWDALAITYSSGKDKLQTFNLHVKANELRQ 1154
            Q DLVVFSWLIQNIEP +A NLTE+PTAK LWDAL  TYSSGKDKLQTF+LHVK+N ++Q
Sbjct: 88   QDDLVVFSWLIQNIEPALASNLTEFPTAKTLWDALVTTYSSGKDKLQTFDLHVKSNGIKQ 147

Query: 1155 NDKSLEEFWIELQGVWGEIDRIDPNPMKCQEDIKIYAKIRSEQKLFQFLNGLDRKFESVK 1334
            N   LE+FWI +QGVWGEI+R DPNPM C  DI  Y K+RSEQKLFQFLN LDR+++ +K
Sbjct: 148  NGSPLEDFWIIMQGVWGEIERRDPNPMTCAADIATYNKLRSEQKLFQFLNALDRQYDPIK 207

Query: 1335 REVLRVDPLPTVEAAYATIRKEAAHQNILGATNNEH--QGIATGL---VAGETGGTGLTT 1499
            RE+LR DPLP+ E AYA +RKE AHQ ILG  +N     G+A GL    + E    G  T
Sbjct: 208  REILRWDPLPSAEGAYAAVRKEMAHQGILGTNDNSSSPSGVAAGLNTNRSSEPESLGFLT 267

Query: 1500 K--SFRKYDGKKSTTKEDKSHLKCEVCGMNRHTK 1595
            K  + +K     S+++ DK+ LKC+ CG N+HTK
Sbjct: 268  KGRTNQKNSTLGSSSRIDKTKLKCDHCGKNKHTK 301


>ref|XP_020537368.1| uncharacterized protein LOC110010150 [Jatropha curcas]
          Length = 384

 Score =  251 bits (642), Expect = 5e-75
 Identities = 121/217 (55%), Positives = 156/217 (71%), Gaps = 10/217 (4%)
 Frame = +3

Query: 975  QHDLVVFSWLIQNIEPTIAGNLTEYPTAKALWDALAITYSSGKDKLQTFNLHVKANELRQ 1154
            Q DL+VFSWLIQN+EP +A NLTEY TAK LW+AL ITYSSGKDKLQ F+LH +AN ++Q
Sbjct: 88   QEDLIVFSWLIQNMEPQLANNLTEYSTAKDLWNALVITYSSGKDKLQIFDLHTRANSMKQ 147

Query: 1155 NDKSLEEFWIELQGVWGEIDRIDPNPMKCQEDIKIYAKIRSEQKLFQFLNGLDRKFESVK 1334
               +LEEFW+ +QG+WGE+DR +PNPM C  DI  Y K++ EQKLFQFLNG+D  ++ +K
Sbjct: 148  GSSTLEEFWLTMQGIWGEMDRREPNPMTCSTDIATYNKVKQEQKLFQFLNGIDHLYDQIK 207

Query: 1335 REVLRVDPLPTVEAAYATIRKEAAHQNILGATNNE--HQGIATGLV------AGETGGTG 1490
            RE+LR + LP+ EAAYA++RKEAA  NI+G  N E   QGI  G V      A E  G G
Sbjct: 208  REILRSEHLPSAEAAYASVRKEAARLNIMGPANRESLSQGIGDGFVIIGKKEASEATGVG 267

Query: 1491 LTTKSFRKYDGKK--STTKEDKSHLKCEVCGMNRHTK 1595
            L  +  R+ + +   S+++ DKS LKC  CGM++HTK
Sbjct: 268  LVARGQRRSEPRNDGSSSRPDKSRLKCSYCGMSKHTK 304


>ref|XP_021974743.1| uncharacterized protein LOC110869838 [Helianthus annuus]
          Length = 291

 Score =  245 bits (625), Expect = 1e-73
 Identities = 129/213 (60%), Positives = 151/213 (70%), Gaps = 6/213 (2%)
 Frame = +3

Query: 975  QHDLVVFSWLIQNIEPTIAGNLTEYPTAKALWDALAITYSSGKDKLQTFNLHVKANELRQ 1154
            Q DLVVFSWLIQNIEP +AGNLTE+PTAKALWDAL +TYSSGKDKLQTF+LHVKANEL+Q
Sbjct: 19   QDDLVVFSWLIQNIEPGLAGNLTEFPTAKALWDALVVTYSSGKDKLQTFDLHVKANELKQ 78

Query: 1155 NDKSLEEFWIELQGVWGEIDRIDPNPMKCQEDIKIYAKIRSEQKLFQFLNGLDRKFESVK 1334
            N  +LE+FWI+LQG+WGEIDR DPNPM C  DI  Y  I              RKF  VK
Sbjct: 79   NGSALEDFWIKLQGIWGEIDRRDPNPMTCSVDIATYNNI--------------RKFHPVK 124

Query: 1335 REVLRVDPLPTVEAAYATIRKEAAHQNILGATNNEHQ-GIATGLVAGET---GGTGLTTK 1502
            RE+LR DPLP+ E AYA +R E A+Q I G  +   Q G+  GL+AG T    G GL TK
Sbjct: 125  REILRRDPLPSAEQAYAAVRNEMAYQGICGTISETSQSGVTAGLIAGRTTEIDGHGLITK 184

Query: 1503 SFRK--YDGKKSTTKEDKSHLKCEVCGMNRHTK 1595
              R+  + G KS+++ DKS LKC  CGMN+HTK
Sbjct: 185  GQRRSGFTG-KSSSRIDKSKLKCSHCGMNKHTK 216


>ref|XP_019158475.1| PREDICTED: uncharacterized protein LOC109155214 [Ipomoea nil]
          Length = 975

 Score =  258 bits (658), Expect = 3e-72
 Identities = 127/217 (58%), Positives = 159/217 (73%), Gaps = 10/217 (4%)
 Frame = +3

Query: 975  QHDLVVFSWLIQNIEPTIAGNLTEYPTAKALWDALAITYSSGKDKLQTFNLHVKANELRQ 1154
            Q DLVVFSWLIQNI+P +A NLTE+PTAK+LWDAL +TYSSGKDKLQTF+LHVK NE++Q
Sbjct: 97   QDDLVVFSWLIQNIKPALASNLTEFPTAKSLWDALVVTYSSGKDKLQTFDLHVKTNEIKQ 156

Query: 1155 NDKSLEEFWIELQGVWGEIDRIDPNPMKCQEDIKIYAKIRSEQKLFQFLNGLDRKFESVK 1334
            N   LE+F I +QG+WGEI+R DPNPM C  DI  Y K+R+EQKLFQFLN +DR+++ +K
Sbjct: 157  NGAPLEDFGILMQGIWGEIERRDPNPMTCAADIATYNKLRAEQKLFQFLNAIDRQYDPIK 216

Query: 1335 REVLRVDPLPTVEAAYATIRKEAAHQNILGATN---NEHQGIATGLVA---GETGGTGLT 1496
            RE+LR DPL + E AYA +R E AHQNILGA +   +  QG+A GL      E  G GL 
Sbjct: 217  REILRWDPLTSAEGAYAAVRNETAHQNILGAVSAITSSQQGVAAGLTVTGPSEAEGLGLI 276

Query: 1497 TKSFRKYD----GKKSTTKEDKSHLKCEVCGMNRHTK 1595
            +K  R+ D       S+++ DKS L C  CGM++HTK
Sbjct: 277  SKGQRRSDQTGRTNGSSSRPDKSQLNCSHCGMSKHTK 313


>gb|OTG08979.1| putative ribonuclease H-like domain, GAG-pre-integrase domain,
            Gag-polypeptide of LTR copia-type [Helianthus annuus]
          Length = 851

 Score =  255 bits (651), Expect = 7e-72
 Identities = 127/212 (59%), Positives = 154/212 (72%), Gaps = 5/212 (2%)
 Frame = +3

Query: 975  QHDLVVFSWLIQNIEPTIAGNLTEYPTAKALWDALAITYSSGKDKLQTFNLHVKANELRQ 1154
            Q DL+VFSWLIQNIEP IA NLTE+PT+K LW+AL  TYSSGKDKLQ F+LHVKAN L+Q
Sbjct: 86   QDDLIVFSWLIQNIEPAIASNLTEFPTSKTLWEALQTTYSSGKDKLQIFDLHVKANSLKQ 145

Query: 1155 NDKSLEEFWIELQGVWGEIDRIDPNPMKCQEDIKIYAKIRSEQKLFQFLNGLDRKFESVK 1334
             +  +E+ WI LQG+WGEIDR +PNPM C  DI  Y ++RSEQKLFQFLN LD +F++VK
Sbjct: 146  KEVPVEDLWINLQGIWGEIDRREPNPMTCTTDINTYNRLRSEQKLFQFLNALDHRFDTVK 205

Query: 1335 REVLRVDPLPTVEAAYATIRKEAAHQNILGATNNEHQ--GIATGLVA---GETGGTGLTT 1499
            RE+LR +PLPT E AYATIRKE  HQ ILGA  +E Q  GIA+GL      +T G GL +
Sbjct: 206  REILRGEPLPTAEPAYATIRKETTHQIILGAGTSETQIHGIASGLATTNLQQTDGLGLIS 265

Query: 1500 KSFRKYDGKKSTTKEDKSHLKCEVCGMNRHTK 1595
            K   + +       + K+ LKC+ CG  RHTK
Sbjct: 266  KGNCRSEKTTGNKNDPKAKLKCDHCGKPRHTK 297


>gb|OTG25356.1| putative gag-polypeptide of LTR copia-type [Helianthus annuus]
          Length = 252

 Score =  228 bits (582), Expect = 7e-68
 Identities = 108/155 (69%), Positives = 128/155 (82%), Gaps = 1/155 (0%)
 Frame = +3

Query: 975  QHDLVVFSWLIQNIEPTIAGNLTEYPTAKALWDALAITYSSGKDKLQTFNLHVKANELRQ 1154
            Q DLVVFSWLIQNIE ++A NLTE+PTAK LWDAL +TYSSGKDKLQTF+LH+K+N +++
Sbjct: 89   QDDLVVFSWLIQNIERSLASNLTEFPTAKTLWDALVVTYSSGKDKLQTFDLHLKSNSIKE 148

Query: 1155 NDKSLEEFWIELQGVWGEIDRIDPNPMKCQEDIKIYAKIRSEQKLFQ-FLNGLDRKFESV 1331
            N   LE+FWI LQGVWGEI+R DPNPM C  DI  Y K+RSEQKLFQ FLN LDR+F+ +
Sbjct: 149  NGSPLEDFWIVLQGVWGEIERRDPNPMTCAVDIATYNKLRSEQKLFQFFLNALDRQFDPI 208

Query: 1332 KREVLRVDPLPTVEAAYATIRKEAAHQNILGATNN 1436
            KRE+LR DPLP+VE AYA +RKE AHQ ILG  +N
Sbjct: 209  KREILRWDPLPSVEGAYAAVRKEMAHQGILGTNDN 243


>ref|XP_020535493.1| uncharacterized protein LOC110009599 [Jatropha curcas]
          Length = 284

 Score =  229 bits (583), Expect = 1e-67
 Identities = 111/204 (54%), Positives = 144/204 (70%), Gaps = 10/204 (4%)
 Frame = +3

Query: 1014 IEPTIAGNLTEYPTAKALWDALAITYSSGKDKLQTFNLHVKANELRQNDKSLEEFWIELQ 1193
            +EP +A NLTEY TAK LW+AL ITYSSGKDKLQ F+LH +AN ++Q   +LEEFW+ +Q
Sbjct: 1    MEPQLANNLTEYSTAKDLWNALVITYSSGKDKLQIFDLHTRANSMKQGSSTLEEFWLTMQ 60

Query: 1194 GVWGEIDRIDPNPMKCQEDIKIYAKIRSEQKLFQFLNGLDRKFESVKREVLRVDPLPTVE 1373
            G+WGEIDR +PNPM C  DI  Y K++ EQKLFQFLNG+D  ++ +KRE+LR + LP+ E
Sbjct: 61   GIWGEIDRREPNPMTCSTDIATYNKVKQEQKLFQFLNGIDHLYDQIKREILRSEHLPSAE 120

Query: 1374 AAYATIRKEAAHQNILGATNNE--HQGIATGLV------AGETGGTGLTTKSFRKYDGKK 1529
            AAYA++RKEAA  NI+G  N E   QGI  G V      A E  G GL  +  R+ + + 
Sbjct: 121  AAYASVRKEAARLNIMGPANRESLSQGIGDGFVIIGKKEASEATGVGLVARGQRRSEPRN 180

Query: 1530 --STTKEDKSHLKCEVCGMNRHTK 1595
              S+++ DKS LKC  CGM++HTK
Sbjct: 181  DGSSSRPDKSRLKCSYCGMSKHTK 204


>ref|XP_020535814.1| uncharacterized protein LOC110009728, partial [Jatropha curcas]
          Length = 267

 Score =  223 bits (568), Expect = 1e-65
 Identities = 107/180 (59%), Positives = 132/180 (73%), Gaps = 8/180 (4%)
 Frame = +3

Query: 975  QHDLVVFSWLIQNIEPTIAGNLTEYPTAKALWDALAITYSSGKDKLQTFNLHVKANELRQ 1154
            Q DL+VFSWLIQN+EP +A NLTEY TAK LW+AL ITYSSGKDKLQ F+LH +AN ++Q
Sbjct: 88   QEDLIVFSWLIQNMEPQLANNLTEYSTAKDLWNALVITYSSGKDKLQIFDLHTRANSMKQ 147

Query: 1155 NDKSLEEFWIELQGVWGEIDRIDPNPMKCQEDIKIYAKIRSEQKLFQFLNGLDRKFESVK 1334
               +LEEFW+ +QG+WGEIDR +PNPM C  DI  Y K++ EQKLFQFLNG+D  ++ +K
Sbjct: 148  GSSTLEEFWLTMQGIWGEIDRREPNPMTCSTDIATYNKVKQEQKLFQFLNGIDHLYDQIK 207

Query: 1335 REVLRVDPLPTVEAAYATIRKEAAHQNILGATNNE--HQGIATGLV------AGETGGTG 1490
            RE+LR + LP+ EAAYA++RKEAA  NI+G  N E   QGI  G V      A E  G G
Sbjct: 208  REILRSEHLPSAEAAYASVRKEAARLNIMGPANRESLSQGIGDGFVIIGKKEASEATGVG 267


Top