BLASTX nr result

ID: Chrysanthemum22_contig00041062 seq

BLASTX 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Chrysanthemum22_contig00041062
         (852 letters)

Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF excluding environmental samples
from WGS projects 
           149,584,005 sequences; 54,822,741,787 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

gb|OTF86030.1| putative ribonuclease H-like domain-containing pr...   363   e-112
ref|XP_022032681.1| uncharacterized protein LOC110933783 [Helian...   327   e-105
ref|XP_021974902.1| uncharacterized protein LOC110870015 [Helian...   301   2e-98
ref|XP_022012651.1| uncharacterized protein LOC110912271 [Helian...   308   3e-98
ref|XP_021992301.1| uncharacterized protein LOC110889107 [Helian...   301   7e-98
gb|OTG08964.1| putative gag-polypeptide of LTR copia-type [Helia...   306   2e-97
gb|OTG29887.1| putative gag-polypeptide of LTR copia-type [Helia...   297   4e-97
gb|OTG35332.1| putative gag-polypeptide of LTR copia-type [Helia...   297   4e-95
ref|XP_021994452.1| uncharacterized protein LOC110891101 [Helian...   294   5e-95
ref|XP_022019013.1| uncharacterized protein LOC110919044 [Helian...   291   6e-94
ref|XP_022011939.1| uncharacterized protein LOC110911621 [Helian...   291   1e-93
ref|XP_019158475.1| PREDICTED: uncharacterized protein LOC109155...   304   8e-93
ref|XP_021996287.1| uncharacterized protein LOC110893489 [Helian...   291   5e-92
gb|OTG08979.1| putative ribonuclease H-like domain, GAG-pre-inte...   294   4e-90
ref|XP_020537368.1| uncharacterized protein LOC110010150 [Jatrop...   281   8e-90
ref|XP_022000771.1| uncharacterized protein LOC110898294 [Helian...   263   7e-85
ref|XP_020535493.1| uncharacterized protein LOC110009599 [Jatrop...   260   4e-83
ref|XP_021974743.1| uncharacterized protein LOC110869838 [Helian...   251   2e-79
gb|OTG25356.1| putative gag-polypeptide of LTR copia-type [Helia...   228   7e-71
ref|XP_022026592.1| uncharacterized protein LOC110927245 [Helian...   230   4e-65

>gb|OTF86030.1| putative ribonuclease H-like domain-containing protein [Helianthus
           annuus]
          Length = 1532

 Score =  363 bits (933), Expect = e-112
 Identities = 185/292 (63%), Positives = 216/292 (73%), Gaps = 13/292 (4%)
 Frame = -3

Query: 850 DLIVFSWIIQNIEPIIAGNLTEYPTAKMLWDALTTTYSSGKDKLQIFNLYVKENELKQNE 671
           DL+VFSW+IQNIEP++AGNLTE+PTAK LWDAL  TYSSG+DKLQ FNL+VK NE+KQN+
Sbjct: 87  DLVVFSWLIQNIEPVLAGNLTEFPTAKSLWDALVVTYSSGRDKLQTFNLHVKANEIKQND 146

Query: 670 KPLEDFWIALQGVWGEIDRIDPNPMTCADDIKTYSRFRSEQNLFQFLHGLDRKFDPIKRE 491
           K LEDFWI LQGVWGEIDRIDPNPM C +DI+TY R RSEQ LFQFL+ LDRK+DPIKRE
Sbjct: 147 KSLEDFWIILQGVWGEIDRIDPNPMKCPEDIRTYLRIRSEQKLFQFLNALDRKYDPIKRE 206

Query: 490 ILRLDTLPSAETAYATVRKEAAHHSILGT--NETQGIATGLNVGFGDTEGVGLAIKGYRR 317
           ILRLD LPSAE AYATVRKEAAH +ILGT  ++TQGIA GL+     TEG+GL  KG+RR
Sbjct: 207 ILRLDPLPSAEAAYATVRKEAAHQNILGTTVDDTQGIAAGLSA--TGTEGLGLVTKGHRR 264

Query: 316 NDGKKPFV--KEDKSHLTCDECKMTGHTKAQCFRIVGYPEWWTDGHK--------KGTKK 167
            DGKK     KEDK+HL CD C MT HTKAQCF+IVGYP+WW+DGHK        KGT  
Sbjct: 265 FDGKKNGAPNKEDKTHLKCDHCGMTRHTKAQCFKIVGYPDWWSDGHKKSKTTGPEKGTAA 324

Query: 166 KVFPTSQATTSKEGTEKGFGGLAAAGNSKGEGSFAV-TGKKERERDFISHTY 14
                 +    +     GFGG+AAA   + +  F+V TG     +  I H+Y
Sbjct: 325 AAIGDQEGAAREGRNPTGFGGVAAAAIGETDDVFSVTTGTGVERKVSIPHSY 376


>ref|XP_022032681.1| uncharacterized protein LOC110933783 [Helianthus annuus]
          Length = 592

 Score =  327 bits (837), Expect = e-105
 Identities = 172/280 (61%), Positives = 203/280 (72%), Gaps = 15/280 (5%)
 Frame = -3

Query: 850 DLIVFSWIIQNIEPIIAGNLTEYPTAKMLWDALTTTYSSGKDKLQIFNLYVKENELKQNE 671
           DL+VFSW+IQNIEP +A NLTE+PTAK LWDAL  TYSSG+DKLQ FNL+VK N++KQN+
Sbjct: 36  DLVVFSWLIQNIEPALASNLTEFPTAKSLWDALVVTYSSGRDKLQTFNLHVKANDIKQND 95

Query: 670 KPLEDFWIALQGVWGEIDRIDPNPMTCADDIKTYSRFRSEQNLFQFLHGLDRKFDPIKRE 491
             LE+FWI LQG+WGEIDR       C +DI+TYS+ RSEQ LFQFL+ LDRK+DPIKRE
Sbjct: 96  TSLEEFWITLQGIWGEIDR------KCPEDIQTYSKIRSEQKLFQFLNALDRKYDPIKRE 149

Query: 490 ILRLDTLPSAETAYATVRKEAAHHSILGT--NETQGIATGLNVGFGDTEGVGLAIKGYRR 317
           +LRLD LPSAE AYA VRKEAAH +ILG   +ETQGI  GL     + EG+GL  KG RR
Sbjct: 150 LLRLDPLPSAEAAYAAVRKEAAHQNILGATLSETQGIGAGLVA--TEKEGLGLISKG-RR 206

Query: 316 NDGKK--PFVKEDKSHLTCDECKMTGHTKAQCFRIVGYPEWWTDGHKKGTKKKVFPTSQA 143
            DGKK  P VKEDKSHL CD C MT HTK  CFR+VGYPEWW+DGHKKGTK       +A
Sbjct: 207 FDGKKNGPPVKEDKSHLKCDHCGMTKHTKEHCFRLVGYPEWWSDGHKKGTKTAGAEKGKA 266

Query: 142 -----------TTSKEGTEKGFGGLAAAGNSKGEGSFAVT 56
                      T+  +  + GF GLAAA + + EG F++T
Sbjct: 267 SAAVGNNHAANTSDGDRNDTGFEGLAAAADGE-EGVFSMT 305


>ref|XP_021974902.1| uncharacterized protein LOC110870015 [Helianthus annuus]
          Length = 325

 Score =  301 bits (770), Expect = 2e-98
 Identities = 153/228 (67%), Positives = 173/228 (75%), Gaps = 5/228 (2%)
 Frame = -3

Query: 850 DLIVFSWIIQNIEPIIAGNLTEYPTAKMLWDALTTTYSSGKDKLQIFNLYVKENELKQNE 671
           DL+VFSW+IQNIEP +AGNLTE+PTAK LWDAL  TYSSGKDKLQ F+L+VK NELKQN 
Sbjct: 38  DLVVFSWLIQNIEPGLAGNLTEFPTAKALWDALVVTYSSGKDKLQTFDLHVKANELKQNG 97

Query: 670 KPLEDFWIALQGVWGEIDRIDPNPMTCADDIKTYSRFRSEQNLFQFLHGLDRKFDPIKRE 491
             LEDFWI LQG+WGEIDR D NPMTC+ DI TY   RSEQ LFQFL+ LDRKFDP+KRE
Sbjct: 98  SALEDFWIKLQGIWGEIDRRDLNPMTCSADIATYKNIRSEQKLFQFLNALDRKFDPVKRE 157

Query: 490 ILRLDTLPSAETAYATVRKEAAHHSILGT-NET--QGIATGLNV-GFGDTEGVGLAIKGY 323
           ILR D LPSAE AYA VRKE AH  ILGT +ET   G+A+GL V G  + +G GL  KG 
Sbjct: 158 ILRWDPLPSAEQAYAAVRKEMAHQGILGTISETSQSGVASGLIVGGTNEIDGQGLITKGQ 217

Query: 322 RRND-GKKPFVKEDKSHLTCDECKMTGHTKAQCFRIVGYPEWWTDGHK 182
           RR+D   K   + DKS L C  C M  HTK QCFRIVG+P+WW+D HK
Sbjct: 218 RRSDFTGKSSSRIDKSKLKCSHCGMNKHTKDQCFRIVGFPDWWSDNHK 265


>ref|XP_022012651.1| uncharacterized protein LOC110912271 [Helianthus annuus]
          Length = 565

 Score =  308 bits (789), Expect = 3e-98
 Identities = 156/244 (63%), Positives = 183/244 (75%), Gaps = 2/244 (0%)
 Frame = -3

Query: 850 DLIVFSWIIQNIEPIIAGNLTEYPTAKMLWDALTTTYSSGKDKLQIFNLYVKENELKQNE 671
           DLIVFSW+IQNIEP +A NLTE+PTAK LWDALT TYSSGKDKLQ F+L+VK NE KQ+ 
Sbjct: 36  DLIVFSWLIQNIEPSLASNLTEFPTAKTLWDALTVTYSSGKDKLQTFDLHVKVNEFKQSG 95

Query: 670 KPLEDFWIALQGVWGEIDRIDPNPMTCADDIKTYSRFRSEQNLFQFLHGLDRKFDPIKRE 491
            PLE+FWI +QG+WGEI+R DPNPM C  DI TY++ RSE  LFQFL+ LDRK+D +KRE
Sbjct: 96  LPLEEFWIVMQGIWGEIERRDPNPMKCPTDIATYNKVRSEYKLFQFLNALDRKYDSLKRE 155

Query: 490 ILRLDTLPSAETAYATVRKEAAHHSILGTNETQGIATGLNVGFGDTEGVGLAIKGYRRND 311
           ILR D LPSAE AYA VRKE AH SI G N  QG+A+GLN   G+++G+GL  +G RR+D
Sbjct: 156 ILRWDPLPSAEAAYAVVRKETAHQSIFG-NVHQGVASGLN-STGESDGLGLVTRG-RRSD 212

Query: 310 GK--KPFVKEDKSHLTCDECKMTGHTKAQCFRIVGYPEWWTDGHKKGTKKKVFPTSQATT 137
            K  +   + DKS L CD C M  HTK QCF++VGYPEWW DGHKKG K      SQ TT
Sbjct: 213 QKSNQSSSRIDKSKLKCDHCGMAKHTKEQCFKLVGYPEWWADGHKKG-KASAAVGSQGTT 271

Query: 136 SKEG 125
           S  G
Sbjct: 272 SSGG 275


>ref|XP_021992301.1| uncharacterized protein LOC110889107 [Helianthus annuus]
 gb|OTG06567.1| putative gag-polypeptide of LTR copia-type [Helianthus annuus]
          Length = 380

 Score =  301 bits (771), Expect = 7e-98
 Identities = 150/255 (58%), Positives = 179/255 (70%), Gaps = 7/255 (2%)
 Frame = -3

Query: 850 DLIVFSWIIQNIEPIIAGNLTEYPTAKMLWDALTTTYSSGKDKLQIFNLYVKENELKQNE 671
           DL+VFSW+IQNIEP +A NLTE+PTAK LWDAL TTYSSGKDKLQ F+L+VK N +KQN 
Sbjct: 90  DLVVFSWLIQNIEPALASNLTEFPTAKTLWDALVTTYSSGKDKLQTFDLHVKSNGIKQNG 149

Query: 670 KPLEDFWIALQGVWGEIDRIDPNPMTCADDIKTYSRFRSEQNLFQFLHGLDRKFDPIKRE 491
            PLEDFWI +QGVWGEI+R DPNPMTCA DI TY++ RSEQ LFQFL+ LDR++DPIKRE
Sbjct: 150 SPLEDFWIIMQGVWGEIERRDPNPMTCAADIATYNKLRSEQKLFQFLNALDRQYDPIKRE 209

Query: 490 ILRLDTLPSAETAYATVRKEAAHHSILGTNET----QGIATGLNVG-FGDTEGVGLAIKG 326
           ILR D LPSAE AYA VRK  AH  ILGT +      G+A GLN     + E +G   KG
Sbjct: 210 ILRWDPLPSAEGAYAAVRKVMAHQGILGTTDNSSSPSGVAAGLNTNRSSEPESLGFLTKG 269

Query: 325 --YRRNDGKKPFVKEDKSHLTCDECKMTGHTKAQCFRIVGYPEWWTDGHKKGTKKKVFPT 152
              ++N       + DK+ L CD C    HTK+QCF +VGYPEWW DGHKKG K+     
Sbjct: 270 RTNQKNSTLGSSSRIDKTKLKCDHCGKNKHTKSQCFELVGYPEWWNDGHKKGNKEGGKAA 329

Query: 151 SQATTSKEGTEKGFG 107
           +    ++E   +G G
Sbjct: 330 ATIGKTEEPEHRGGG 344


>gb|OTG08964.1| putative gag-polypeptide of LTR copia-type [Helianthus annuus]
          Length = 571

 Score =  306 bits (784), Expect = 2e-97
 Identities = 152/255 (59%), Positives = 181/255 (70%), Gaps = 7/255 (2%)
 Frame = -3

Query: 850 DLIVFSWIIQNIEPIIAGNLTEYPTAKMLWDALTTTYSSGKDKLQIFNLYVKENELKQNE 671
           DL+VFSW+IQNIEP +A NLTE+PTAK LWDAL TTYSSGKDKLQ F+L+VK N +KQN 
Sbjct: 90  DLVVFSWLIQNIEPALASNLTEFPTAKTLWDALVTTYSSGKDKLQTFDLHVKSNGIKQNG 149

Query: 670 KPLEDFWIALQGVWGEIDRIDPNPMTCADDIKTYSRFRSEQNLFQFLHGLDRKFDPIKRE 491
            PLEDFWI +QGVWGEI+R DPNPMTCA DI TY++ RSEQ LFQFL+ LDR++DPIKRE
Sbjct: 150 SPLEDFWIIMQGVWGEIERRDPNPMTCAADIATYNKLRSEQKLFQFLNALDRQYDPIKRE 209

Query: 490 ILRLDTLPSAETAYATVRKEAAHHSILGTNET----QGIATGLNVG-FGDTEGVGLAIKG 326
           ILR D LPSAE AYA VRKE AH  ILGTN+      G+A GLN     + E +G   KG
Sbjct: 210 ILRWDPLPSAEGAYAAVRKEMAHQGILGTNDNSSSPSGVAAGLNTNRSSEPESLGFLTKG 269

Query: 325 --YRRNDGKKPFVKEDKSHLTCDECKMTGHTKAQCFRIVGYPEWWTDGHKKGTKKKVFPT 152
              ++N       + DK+ L CD C    HTK+QCF +VGYPEWW DGHKKG K+     
Sbjct: 270 RTNQKNSTLGSSSRIDKTKLKCDHCGKNKHTKSQCFELVGYPEWWNDGHKKGNKEGGKAA 329

Query: 151 SQATTSKEGTEKGFG 107
           +    ++E   +G G
Sbjct: 330 ATIGKTEEPEHRGGG 344


>gb|OTG29887.1| putative gag-polypeptide of LTR copia-type [Helianthus annuus]
          Length = 325

 Score =  297 bits (761), Expect = 4e-97
 Identities = 155/267 (58%), Positives = 182/267 (68%), Gaps = 12/267 (4%)
 Frame = -3

Query: 850 DLIVFSWIIQNIEPIIAGNLTEYPTAKMLWDALTTTYSSGKDKLQIFNLYVKENELKQNE 671
           DLIVFSW+IQNIEP +A NLTE+PTAK LWDAL  TYSSGKDKLQ F+L+VK N +KQN 
Sbjct: 38  DLIVFSWLIQNIEPALASNLTEFPTAKSLWDALVVTYSSGKDKLQTFDLHVKANGIKQNG 97

Query: 670 KPLEDFWIALQGVWGEIDRIDPNPMTCADDIKTYSRFRSEQNLFQFLHGLDRKFDPIKRE 491
            PLEDFWI +QG+WGEIDR DPNPMTC  DI TY++ RSEQ LFQFL+ LDR++D IKRE
Sbjct: 98  SPLEDFWIIMQGIWGEIDRRDPNPMTCTVDIATYNKIRSEQKLFQFLNALDRQYDTIKRE 157

Query: 490 ILRLDTLPSAETAYATVRKEAAHHSILG--TNETQGIATGLNV-GFGDTEGVGLAIKGY- 323
           ILR D LPSAE AYA VRKE AH  ILG  T+    +A GL   G  +TE +G   KG  
Sbjct: 158 ILRWDPLPSAEGAYAAVRKEMAHQGILGTATSSHNNVAAGLVANGSHETESLGFLSKGRS 217

Query: 322 -RRNDGKKPFVKEDKSHLTCDECKMTGHTKAQCFRIVGYPEWWTDGHKKGTK---KKVFP 155
            ++N       + DKS L C  C M  HTK QCF++VGYPEWW DGHKK  K   K    
Sbjct: 218 GQKNPNSGSSSQIDKSKLKCLHCGMLKHTKDQCFKLVGYPEWWNDGHKKRNKEGGKAAAA 277

Query: 154 TSQATTSKEGTEK----GFGGLAAAGN 86
                 +  G ++    GFGG+A AG+
Sbjct: 278 IGDTKNNSAGNDQQNSGGFGGVAFAGD 304


>gb|OTG35332.1| putative gag-polypeptide of LTR copia-type [Helianthus annuus]
          Length = 465

 Score =  297 bits (760), Expect = 4e-95
 Identities = 154/284 (54%), Positives = 182/284 (64%), Gaps = 18/284 (6%)
 Frame = -3

Query: 850  DLIVFSWIIQNIEPIIAGNLTEYPTAKMLWDALTTTYSSGKDKLQIFNLYVKENELKQNE 671
            DLIVFSW+IQNIEP +A NLTE+PTAK LWDAL  TYSSGKDKLQ F+L+VK N +KQN 
Sbjct: 170  DLIVFSWLIQNIEPALASNLTEFPTAKTLWDALVVTYSSGKDKLQTFDLHVKSNGIKQNG 229

Query: 670  KPLEDFWIALQGVWGEIDRIDPNPMTCADDIKTYSRFRSEQNLFQFLHGLDRKFDPIKRE 491
              LEDFWI +QG+WGE +R DPNPMTC  DI TY++ RSEQ LF+FL+ LDR++D IK E
Sbjct: 230  SSLEDFWINMQGIWGETERRDPNPMTCITDIATYNKIRSEQKLFKFLNALDRQYDTIKME 289

Query: 490  ILRLDTLPSAETAYATVRKEAAHHSILGTNE-----TQGIATGLNVGFGDTEGVGLAIKG 326
            ILR D LPSAE AYA VRKE AH  ILGT         G+A GL        G+G   KG
Sbjct: 290  ILRWDPLPSAEGAYAAVRKEMAHQGILGTTADNSSLNNGVAAGLAANGSKEVGLGFLSKG 349

Query: 325  Y--RRNDGKKPFVKEDKSHLTCDECKMTGHTKAQCFRIVGYPEWWTDGHKKGTKKKVFPT 152
               +RN       + DK+ L CD C M  HTK QCFR+VGYPEWW DGHK+G K+     
Sbjct: 350  RTGQRNFNSGSSPRIDKTKLKCDHCGMMKHTKDQCFRLVGYPEWWNDGHKRGNKEGKAVA 409

Query: 151  SQATTSKEGTEK-----------GFGGLAAAGNSKGEGSFAVTG 53
            +   T   G  +           GFGG+A AGN   + S  + G
Sbjct: 410  AIGNTEGIGENQPRNGNDQSRLSGFGGVAFAGNKNTQTSEEIDG 453


>ref|XP_021994452.1| uncharacterized protein LOC110891101 [Helianthus annuus]
          Length = 386

 Score =  294 bits (753), Expect = 5e-95
 Identities = 154/272 (56%), Positives = 186/272 (68%), Gaps = 11/272 (4%)
 Frame = -3

Query: 850 DLIVFSWIIQNIEPIIAGNLTEYPTAKMLWDALTTTYSSGKDKLQIFNLYVKENELKQNE 671
           DLIVFSW+IQNIEP IA NLTE+PT+K LW+AL TTYSSGKDKLQIF+L+VK N LKQ E
Sbjct: 88  DLIVFSWLIQNIEPAIASNLTEFPTSKTLWEALQTTYSSGKDKLQIFDLHVKANSLKQKE 147

Query: 670 KPLEDFWIALQGVWGEIDRIDPNPMTCADDIKTYSRFRSEQNLFQFLHGLDRKFDPIKRE 491
            P+ED WI LQG+WGEIDR +PNPMTC  DI TY+R RSEQ LFQFL+ LD +FD +KRE
Sbjct: 148 VPVEDLWINLQGIWGEIDRREPNPMTCTTDINTYNRLRSEQKLFQFLNALDHRFDTVKRE 207

Query: 490 ILRLDTLPSAETAYATVRKEAAHHSIL--GTNETQ--GIATGL-NVGFGDTEGVGLAIKG 326
           ILR + LP+AE AYAT+RKE  H  IL  GT+ETQ  GIA+GL       T+G+GL  KG
Sbjct: 208 ILRGEPLPTAEPAYATIRKETTHQIILGAGTSETQIHGIASGLATTNLQQTDGLGLISKG 267

Query: 325 YRRNDGKKPFVKEDKSHLTCDECKMTGHTKAQCFRIVGYPEWWTDGHKKGTKK--KVFPT 152
             R++       + K+ L CD C    HTK QCF +VGYP+WW  G K+  K   K   +
Sbjct: 268 NCRSEKTTGNKNDPKAKLKCDHCGKPRHTKDQCFHLVGYPDWWEIGPKRNNKDEGKRDTS 327

Query: 151 SQATTSKEGTEKGFGGLAAAGN----SKGEGS 68
           +    S  G  +GFGG+ +  N    S G GS
Sbjct: 328 TTGQGSNAGGREGFGGVVSGDNKENTSDGHGS 359


>ref|XP_022019013.1| uncharacterized protein LOC110919044 [Helianthus annuus]
          Length = 379

 Score =  291 bits (745), Expect = 6e-94
 Identities = 150/228 (65%), Positives = 169/228 (74%), Gaps = 5/228 (2%)
 Frame = -3

Query: 850 DLIVFSWIIQNIEPIIAGNLTEYPTAKMLWDALTTTYSSGKDKLQIFNLYVKENELKQNE 671
           DL VFSW+IQNIEP +AGNLTE+PTAK LWDAL  TYSSGKDKLQ F+L+VK NELKQN 
Sbjct: 95  DLFVFSWLIQNIEPGLAGNLTEFPTAKALWDALVVTYSSGKDKLQTFDLHVKANELKQNG 154

Query: 670 KPLEDFWIALQGVWGEIDRIDPNPMTCADDIKTYSRFRSEQNLFQFLHGLDRKFDPIKRE 491
             LEDFWI LQG+WGEIDR+DPNPMTC+ D+ TY+  RSEQ LFQFL+ LDRKFD +KRE
Sbjct: 155 SALEDFWIKLQGIWGEIDRMDPNPMTCSADVATYNNIRSEQKLFQFLNALDRKFDLVKRE 214

Query: 490 ILRLDTLPSAETAYATVRKEAAHHSILGT-NET--QGIATGLNV-GFGDTEGVGLAIKGY 323
           IL  D LPSAE AYATVRKE AH  ILGT +ET   G+A GL   G  +T+G GL  KG 
Sbjct: 215 ILWWDPLPSAEQAYATVRKEMAHQGILGTISETSQSGVAAGLVAGGTTETDGQGLITKGQ 274

Query: 322 RR-NDGKKPFVKEDKSHLTCDECKMTGHTKAQCFRIVGYPEWWTDGHK 182
           RR N   K   + DKS L C  C    HTK QCFRIVG+  WW+D HK
Sbjct: 275 RRSNFTGKSSSRIDKSKLKCSHCGKNKHTKDQCFRIVGFLNWWSDNHK 322


>ref|XP_022011939.1| uncharacterized protein LOC110911621 [Helianthus annuus]
 gb|OTF95102.1| putative gag-polypeptide of LTR copia-type [Helianthus annuus]
          Length = 394

 Score =  291 bits (744), Expect = 1e-93
 Identities = 151/285 (52%), Positives = 188/285 (65%), Gaps = 26/285 (9%)
 Frame = -3

Query: 850 DLIVFSWIIQNIEPIIAGNLTEYPTAKMLWDALTTTYSSGKDKLQIFNLYVKENELKQNE 671
           DL+V SW+IQNIEP +A NLTE+PTAK LWDAL  TYSSGKDKLQ F+L+VK N +KQN 
Sbjct: 93  DLVVISWLIQNIEPALASNLTEFPTAKTLWDALVVTYSSGKDKLQTFDLHVKANGIKQNG 152

Query: 670 KPLEDFWIALQGVWGEIDRIDPNPMTCADDIKTYSRFRSEQNLFQFLHGLDRKFDPIKRE 491
            PLEDFWI +QGVWGEI+R DPNPMTC +DI TY++ RSEQ LFQFL+ LDR++D +KRE
Sbjct: 153 SPLEDFWIIMQGVWGEIERRDPNPMTCPEDITTYNKIRSEQKLFQFLNALDRQYDTVKRE 212

Query: 490 ILRLDTLPSAETAYATVRKEAAHHSILG-----TNETQGIATGLNV-GFGDTEGVGLAIK 329
           ILR D LPSAE AYA VRKE AH  ILG     +    G+A GLN  G  ++EG+G   +
Sbjct: 213 ILRWDPLPSAEGAYAAVRKEMAHQGILGITIDTSYNPNGVAAGLNANGSRESEGLGFLSR 272

Query: 328 GY--RRNDGKKPFVKEDKSHLTCDECKMTGHTKAQCFRIVGYPEWWTDGHKKGTKKKVFP 155
           G   +++       + DKS L C  C M+ HTK QCF++VGYPEWW D HK     K+  
Sbjct: 273 GRVDQKSSNTGSSFRIDKSKLKCGHCGMSKHTKDQCFQLVGYPEWWNDNHKTQKGGKIST 332

Query: 154 TSQATTSKEGTEK------------------GFGGLAAAGNSKGE 74
            +  +++  G +K                  GFGG+ AAGN  G+
Sbjct: 333 AAGRSSAAIGNQKATSSGGKGQEDTDGGGASGFGGM-AAGNYIGQ 376


>ref|XP_019158475.1| PREDICTED: uncharacterized protein LOC109155214 [Ipomoea nil]
          Length = 975

 Score =  304 bits (778), Expect = 8e-93
 Identities = 165/311 (53%), Positives = 201/311 (64%), Gaps = 28/311 (9%)
 Frame = -3

Query: 850  DLIVFSWIIQNIEPIIAGNLTEYPTAKMLWDALTTTYSSGKDKLQIFNLYVKENELKQNE 671
            DL+VFSW+IQNI+P +A NLTE+PTAK LWDAL  TYSSGKDKLQ F+L+VK NE+KQN 
Sbjct: 99   DLVVFSWLIQNIKPALASNLTEFPTAKSLWDALVVTYSSGKDKLQTFDLHVKTNEIKQNG 158

Query: 670  KPLEDFWIALQGVWGEIDRIDPNPMTCADDIKTYSRFRSEQNLFQFLHGLDRKFDPIKRE 491
             PLEDF I +QG+WGEI+R DPNPMTCA DI TY++ R+EQ LFQFL+ +DR++DPIKRE
Sbjct: 159  APLEDFGILMQGIWGEIERRDPNPMTCAADIATYNKLRAEQKLFQFLNAIDRQYDPIKRE 218

Query: 490  ILRLDTLPSAETAYATVRKEAAHHSILG-----TNETQGIATGLNV-GFGDTEGVGLAIK 329
            ILR D L SAE AYA VR E AH +ILG     T+  QG+A GL V G  + EG+GL  K
Sbjct: 219  ILRWDPLTSAEGAYAAVRNETAHQNILGAVSAITSSQQGVAAGLTVTGPSEAEGLGLISK 278

Query: 328  GYRRND----GKKPFVKEDKSHLTCDECKMTGHTKAQCFRIVGYPEWWTDGHK---KGTK 170
            G RR+D          + DKS L C  C M+ HTK QCF+IVGYPEWW DGHK   K T+
Sbjct: 279  GQRRSDQTGRTNGSSSRPDKSQLNCSHCGMSKHTKEQCFKIVGYPEWWNDGHKQSGKTTR 338

Query: 169  KKVFPTSQATTSKEGT-----------EKGFGGLAAA----GNSKGEGSFAVTGKKERER 35
                  + A  + + T           E GFGG+AA        KG G F+         
Sbjct: 339  SNGGRAAAAVRNNDTTINIGDGQGNIREGGFGGMAAVKRDDETGKGFGDFS------SSP 392

Query: 34   DFISHTYQPQK 2
             F++  Y PQ+
Sbjct: 393  SFLNPKYFPQR 403


>ref|XP_021996287.1| uncharacterized protein LOC110893489 [Helianthus annuus]
          Length = 548

 Score =  291 bits (746), Expect = 5e-92
 Identities = 154/275 (56%), Positives = 186/275 (67%), Gaps = 7/275 (2%)
 Frame = -3

Query: 850 DLIVFSWIIQNIEPIIAGNLTEYPTAKMLWDALTTTYSSGKDKLQIFNLYVKENELKQNE 671
           DLIVFSW+IQNIEP +  NLTE+PTA  LWDAL+ TYSSGKDKLQ F+L+VK NE KQN 
Sbjct: 86  DLIVFSWLIQNIEPSLTSNLTEFPTANTLWDALSVTYSSGKDKLQTFDLHVKANEFKQNG 145

Query: 670 KPLEDFWIALQGVWGEIDRIDPNPMTCADDIKTYSRFRSEQNLFQFLHGLDRKFDPIKRE 491
            PLE+FWI +QG+ GEI R DPNPM C+DDI TY++ RSE  LFQ L+ LDRK+D +KRE
Sbjct: 146 LPLEEFWIVMQGIRGEIKRRDPNPMKCSDDIATYNKVRSENKLFQLLNALDRKYDSLKRE 205

Query: 490 ILRLDTLPSAETAYATVRKEAAHHSILGTNETQGIATGLNVGFGDTEGVGLAIKGYRRND 311
           ILR D LP+ E AYA VRKE AH SI G N  QG+ +GLN   G ++G+GL  +    + 
Sbjct: 206 ILRWDPLPTTEAAYAAVRKETAHQSIFG-NTQQGVGSGLN-SLGSSDGLGLVSRSRWSDQ 263

Query: 310 GKKPFVKE-DKSHLTCDECKMTGHTKAQCFRIVGYPEWWTDGHKKGTKKKVFPTSQATTS 134
              P     DKS L CD C M  HTK QCF+IVGYP+WW DGHKKG +      +Q TTS
Sbjct: 264 KSNPSSSRIDKSKLKCDHCGMAKHTKEQCFKIVGYPDWWADGHKKG-RAAAAVGNQETTS 322

Query: 133 KEGT----EKGFGGLAA--AGNSKGEGSFAVTGKK 47
             G+    +K  GGL     GNS G    A+ G++
Sbjct: 323 SGGSSGEHQKLAGGLDGIDKGNSGGCCFAALQGEE 357


>gb|OTG08979.1| putative ribonuclease H-like domain, GAG-pre-integrase domain,
           Gag-polypeptide of LTR copia-type [Helianthus annuus]
          Length = 851

 Score =  294 bits (753), Expect = 4e-90
 Identities = 154/272 (56%), Positives = 186/272 (68%), Gaps = 11/272 (4%)
 Frame = -3

Query: 850 DLIVFSWIIQNIEPIIAGNLTEYPTAKMLWDALTTTYSSGKDKLQIFNLYVKENELKQNE 671
           DLIVFSW+IQNIEP IA NLTE+PT+K LW+AL TTYSSGKDKLQIF+L+VK N LKQ E
Sbjct: 88  DLIVFSWLIQNIEPAIASNLTEFPTSKTLWEALQTTYSSGKDKLQIFDLHVKANSLKQKE 147

Query: 670 KPLEDFWIALQGVWGEIDRIDPNPMTCADDIKTYSRFRSEQNLFQFLHGLDRKFDPIKRE 491
            P+ED WI LQG+WGEIDR +PNPMTC  DI TY+R RSEQ LFQFL+ LD +FD +KRE
Sbjct: 148 VPVEDLWINLQGIWGEIDRREPNPMTCTTDINTYNRLRSEQKLFQFLNALDHRFDTVKRE 207

Query: 490 ILRLDTLPSAETAYATVRKEAAHHSIL--GTNETQ--GIATGL-NVGFGDTEGVGLAIKG 326
           ILR + LP+AE AYAT+RKE  H  IL  GT+ETQ  GIA+GL       T+G+GL  KG
Sbjct: 208 ILRGEPLPTAEPAYATIRKETTHQIILGAGTSETQIHGIASGLATTNLQQTDGLGLISKG 267

Query: 325 YRRNDGKKPFVKEDKSHLTCDECKMTGHTKAQCFRIVGYPEWWTDGHKKGTKK--KVFPT 152
             R++       + K+ L CD C    HTK QCF +VGYP+WW  G K+  K   K   +
Sbjct: 268 NCRSEKTTGNKNDPKAKLKCDHCGKPRHTKDQCFHLVGYPDWWEIGPKRNNKDEGKRDTS 327

Query: 151 SQATTSKEGTEKGFGGLAAAGN----SKGEGS 68
           +    S  G  +GFGG+ +  N    S G GS
Sbjct: 328 TTGQGSNAGGREGFGGVVSGDNKENTSDGHGS 359


>ref|XP_020537368.1| uncharacterized protein LOC110010150 [Jatropha curcas]
          Length = 384

 Score =  281 bits (718), Expect = 8e-90
 Identities = 147/299 (49%), Positives = 195/299 (65%), Gaps = 22/299 (7%)
 Frame = -3

Query: 850 DLIVFSWIIQNIEPIIAGNLTEYPTAKMLWDALTTTYSSGKDKLQIFNLYVKENELKQNE 671
           DLIVFSW+IQN+EP +A NLTEY TAK LW+AL  TYSSGKDKLQIF+L+ + N +KQ  
Sbjct: 90  DLIVFSWLIQNMEPQLANNLTEYSTAKDLWNALVITYSSGKDKLQIFDLHTRANSMKQGS 149

Query: 670 KPLEDFWIALQGVWGEIDRIDPNPMTCADDIKTYSRFRSEQNLFQFLHGLDRKFDPIKRE 491
             LE+FW+ +QG+WGE+DR +PNPMTC+ DI TY++ + EQ LFQFL+G+D  +D IKRE
Sbjct: 150 STLEEFWLTMQGIWGEMDRREPNPMTCSTDIATYNKVKQEQKLFQFLNGIDHLYDQIKRE 209

Query: 490 ILRLDTLPSAETAYATVRKEAAHHSILGTNETQGIATGLNVGF--------GDTEGVGLA 335
           ILR + LPSAE AYA+VRKEAA  +I+G    + ++ G+  GF         +  GVGL 
Sbjct: 210 ILRSEHLPSAEAAYASVRKEAARLNIMGPANRESLSQGIGDGFVIIGKKEASEATGVGLV 269

Query: 334 IKGYR----RNDGKKPFVKEDKSHLTCDECKMTGHTKAQCFRIVGYPEWWTDGHK-KGTK 170
            +G R    RNDG     + DKS L C  C M+ HTK QCF ++GYPEWW + H+ KGTK
Sbjct: 270 ARGQRRSEPRNDGSSS--RPDKSRLKCSYCGMSKHTKDQCFELIGYPEWWNENHRNKGTK 327

Query: 169 K--------KVFPTSQATTSKEG-TEKGFGGLAAAGNSKGEGSFAVTGKKERERDFISH 20
                     +   S    S+ G  E+G  G+ AA N + +G+  + G +ERE  ++ H
Sbjct: 328 TSKAAAAVGNLEAASSGGDSRGGQNERGAMGMVAAQNERADGT--LEGYQEREDYWMWH 384


>ref|XP_022000771.1| uncharacterized protein LOC110898294 [Helianthus annuus]
          Length = 247

 Score =  263 bits (673), Expect = 7e-85
 Identities = 131/211 (62%), Positives = 156/211 (73%)
 Frame = -3

Query: 850 DLIVFSWIIQNIEPIIAGNLTEYPTAKMLWDALTTTYSSGKDKLQIFNLYVKENELKQNE 671
           DLIVFSW+IQNIEP +A NLTE+PTAK LWDALT TYSSGKDKLQ F+L+VK NE KQN 
Sbjct: 36  DLIVFSWLIQNIEPSLASNLTEFPTAKTLWDALTITYSSGKDKLQTFDLHVKANEFKQNG 95

Query: 670 KPLEDFWIALQGVWGEIDRIDPNPMTCADDIKTYSRFRSEQNLFQFLHGLDRKFDPIKRE 491
            PLE+FWI +QG+WGEI R DPNP+ C  DI TY++ RSE  LFQFL+ LDRK+D +KRE
Sbjct: 96  VPLEEFWIIMQGIWGEIKRRDPNPIACPADIATYNKVRSEYKLFQFLNALDRKYDSLKRE 155

Query: 490 ILRLDTLPSAETAYATVRKEAAHHSILGTNETQGIATGLNVGFGDTEGVGLAIKGYRRND 311
           IL+ D LPS E AYA VRKE  H SI G N  +G+ +GLN   GDT+G+GL  +  RR+D
Sbjct: 156 ILQWDPLPSVEVAYAVVRKETTHQSIFG-NPHKGVGSGLN-SHGDTDGLGLVSRS-RRSD 212

Query: 310 GKKPFVKEDKSHLTCDECKMTGHTKAQCFRI 218
            K    + DKS L C+ C M  HTK QCF +
Sbjct: 213 QKPSSSRIDKSKLRCEHCGMAKHTKDQCFSV 243


>ref|XP_020535493.1| uncharacterized protein LOC110009599 [Jatropha curcas]
          Length = 284

 Score =  260 bits (665), Expect = 4e-83
 Identities = 138/288 (47%), Positives = 184/288 (63%), Gaps = 22/288 (7%)
 Frame = -3

Query: 817 IEPIIAGNLTEYPTAKMLWDALTTTYSSGKDKLQIFNLYVKENELKQNEKPLEDFWIALQ 638
           +EP +A NLTEY TAK LW+AL  TYSSGKDKLQIF+L+ + N +KQ    LE+FW+ +Q
Sbjct: 1   MEPQLANNLTEYSTAKDLWNALVITYSSGKDKLQIFDLHTRANSMKQGSSTLEEFWLTMQ 60

Query: 637 GVWGEIDRIDPNPMTCADDIKTYSRFRSEQNLFQFLHGLDRKFDPIKREILRLDTLPSAE 458
           G+WGEIDR +PNPMTC+ DI TY++ + EQ LFQFL+G+D  +D IKREILR + LPSAE
Sbjct: 61  GIWGEIDRREPNPMTCSTDIATYNKVKQEQKLFQFLNGIDHLYDQIKREILRSEHLPSAE 120

Query: 457 TAYATVRKEAAHHSILGTNETQGIATGLNVGF--------GDTEGVGLAIKGYR----RN 314
            AYA+VRKEAA  +I+G    + ++ G+  GF         +  GVGL  +G R    RN
Sbjct: 121 AAYASVRKEAARLNIMGPANRESLSQGIGDGFVIIGKKEASEATGVGLVARGQRRSEPRN 180

Query: 313 DGKKPFVKEDKSHLTCDECKMTGHTKAQCFRIVGYPEWWTDGHK-KGTKK--------KV 161
           DG     + DKS L C  C M+ HTK QCF ++GYPEWW + H+ KGTK          +
Sbjct: 181 DGSSS--RPDKSRLKCSYCGMSKHTKDQCFELIGYPEWWNENHRNKGTKTSKAAAAVGNL 238

Query: 160 FPTSQATTSKEG-TEKGFGGLAAAGNSKGEGSFAVTGKKERERDFISH 20
              S    S+ G  E+G  G+ AA N + +G+  + G +ERE  ++ H
Sbjct: 239 EAASSGGDSRGGQNERGAMGMVAAQNERADGT--LEGYQEREDYWMWH 284


>ref|XP_021974743.1| uncharacterized protein LOC110869838 [Helianthus annuus]
          Length = 291

 Score =  251 bits (641), Expect = 2e-79
 Identities = 134/228 (58%), Positives = 154/228 (67%), Gaps = 5/228 (2%)
 Frame = -3

Query: 850 DLIVFSWIIQNIEPIIAGNLTEYPTAKMLWDALTTTYSSGKDKLQIFNLYVKENELKQNE 671
           DL+VFSW+IQNIEP +AGNLTE+PTAK LWDAL  TYSSGKDKLQ F+L+VK NELKQN 
Sbjct: 21  DLVVFSWLIQNIEPGLAGNLTEFPTAKALWDALVVTYSSGKDKLQTFDLHVKANELKQNG 80

Query: 670 KPLEDFWIALQGVWGEIDRIDPNPMTCADDIKTYSRFRSEQNLFQFLHGLDRKFDPIKRE 491
             LEDFWI LQG+WGEIDR DPNPMTC+ DI TY+                RKF P+KRE
Sbjct: 81  SALEDFWIKLQGIWGEIDRRDPNPMTCSVDIATYNNI--------------RKFHPVKRE 126

Query: 490 ILRLDTLPSAETAYATVRKEAAHHSILGT-NET--QGIATGLNVG-FGDTEGVGLAIKGY 323
           ILR D LPSAE AYA VR E A+  I GT +ET   G+  GL  G   + +G GL  KG 
Sbjct: 127 ILRRDPLPSAEQAYAAVRNEMAYQGICGTISETSQSGVTAGLIAGRTTEIDGHGLITKGQ 186

Query: 322 RRND-GKKPFVKEDKSHLTCDECKMTGHTKAQCFRIVGYPEWWTDGHK 182
           RR+    K   + DKS L C  C M  HTK QCFRI G+P+ W+D HK
Sbjct: 187 RRSGFTGKSSSRIDKSKLKCSHCGMNKHTKDQCFRIAGFPDCWSDNHK 234


>gb|OTG25356.1| putative gag-polypeptide of LTR copia-type [Helianthus annuus]
          Length = 252

 Score =  228 bits (581), Expect = 7e-71
 Identities = 110/152 (72%), Positives = 125/152 (82%), Gaps = 1/152 (0%)
 Frame = -3

Query: 850 DLIVFSWIIQNIEPIIAGNLTEYPTAKMLWDALTTTYSSGKDKLQIFNLYVKENELKQNE 671
           DL+VFSW+IQNIE  +A NLTE+PTAK LWDAL  TYSSGKDKLQ F+L++K N +K+N 
Sbjct: 91  DLVVFSWLIQNIERSLASNLTEFPTAKTLWDALVVTYSSGKDKLQTFDLHLKSNSIKENG 150

Query: 670 KPLEDFWIALQGVWGEIDRIDPNPMTCADDIKTYSRFRSEQNLFQ-FLHGLDRKFDPIKR 494
            PLEDFWI LQGVWGEI+R DPNPMTCA DI TY++ RSEQ LFQ FL+ LDR+FDPIKR
Sbjct: 151 SPLEDFWIVLQGVWGEIERRDPNPMTCAVDIATYNKLRSEQKLFQFFLNALDRQFDPIKR 210

Query: 493 EILRLDTLPSAETAYATVRKEAAHHSILGTNE 398
           EILR D LPS E AYA VRKE AH  ILGTN+
Sbjct: 211 EILRWDPLPSVEGAYAAVRKEMAHQGILGTND 242


>ref|XP_022026592.1| uncharacterized protein LOC110927245 [Helianthus annuus]
          Length = 1084

 Score =  230 bits (586), Expect = 4e-65
 Identities = 119/265 (44%), Positives = 161/265 (60%), Gaps = 5/265 (1%)
 Frame = -3

Query: 850 DLIVFSWIIQNIEPIIAGNLTEYPTAKMLWDALTTTYSSGKDKLQIFNLYVKENELKQNE 671
           D  VF+WIIQN+E  +  N+++YPTAK LWD L TTY  G D LQ+F+L+ +   L+Q  
Sbjct: 91  DQCVFTWIIQNLESNLVNNVSQYPTAKALWDGLATTYGFGTDSLQVFDLHKRAKSLRQGS 150

Query: 670 KPLEDFWIALQGVWGEIDRIDPNPMTCADDIKTYSRFRSEQNLFQFLHGLDRKFDPIKRE 491
             LED W  LQ +W  IDR DPNPM   +DI+ Y++   EQ L+Q L  LD K +P+KR+
Sbjct: 151 DTLEDLWNKLQSIWMSIDRRDPNPMKDPEDIQMYNKKTQEQRLYQLLTALDDKMEPVKRD 210

Query: 490 ILRLDTLPSAETAYATVRKEAAHHSILGTNETQGIATGLNVGFGDTEGVGLAIKGYRRND 311
           IL+ D LP+ E AYAT+R+E A  +IL +  +   +T +        G+GLA K + +  
Sbjct: 211 ILKKDPLPTVEMAYATIRREDARMNILRSGPSDNESTEI--------GMGLAAKDWSQRT 262

Query: 310 GKKPFVKEDKSHLTCDECKMTGHTKAQCFRIVGYPEWWTDGHKK---GTKKKVFPTSQAT 140
             +P  KEDKS L C  C+M  HTK QCF+IVGYPEWW DG K+   G   K  P +   
Sbjct: 263 KFRPRDKEDKSKLFCTYCQMKRHTKDQCFKIVGYPEWWGDGQKQKNSGADGKGTPAAGGG 322

Query: 139 TS--KEGTEKGFGGLAAAGNSKGEG 71
            +  ++G+  GFGGLAA  +    G
Sbjct: 323 VAPVEKGSSGGFGGLAATTDDTSIG 347


Top