BLASTX nr result
ID: Chrysanthemum22_contig00041062
seq
BLASTX 2.2.26 [Sep-21-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Chrysanthemum22_contig00041062 (852 letters) Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF excluding environmental samples from WGS projects 149,584,005 sequences; 54,822,741,787 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value gb|OTF86030.1| putative ribonuclease H-like domain-containing pr... 363 e-112 ref|XP_022032681.1| uncharacterized protein LOC110933783 [Helian... 327 e-105 ref|XP_021974902.1| uncharacterized protein LOC110870015 [Helian... 301 2e-98 ref|XP_022012651.1| uncharacterized protein LOC110912271 [Helian... 308 3e-98 ref|XP_021992301.1| uncharacterized protein LOC110889107 [Helian... 301 7e-98 gb|OTG08964.1| putative gag-polypeptide of LTR copia-type [Helia... 306 2e-97 gb|OTG29887.1| putative gag-polypeptide of LTR copia-type [Helia... 297 4e-97 gb|OTG35332.1| putative gag-polypeptide of LTR copia-type [Helia... 297 4e-95 ref|XP_021994452.1| uncharacterized protein LOC110891101 [Helian... 294 5e-95 ref|XP_022019013.1| uncharacterized protein LOC110919044 [Helian... 291 6e-94 ref|XP_022011939.1| uncharacterized protein LOC110911621 [Helian... 291 1e-93 ref|XP_019158475.1| PREDICTED: uncharacterized protein LOC109155... 304 8e-93 ref|XP_021996287.1| uncharacterized protein LOC110893489 [Helian... 291 5e-92 gb|OTG08979.1| putative ribonuclease H-like domain, GAG-pre-inte... 294 4e-90 ref|XP_020537368.1| uncharacterized protein LOC110010150 [Jatrop... 281 8e-90 ref|XP_022000771.1| uncharacterized protein LOC110898294 [Helian... 263 7e-85 ref|XP_020535493.1| uncharacterized protein LOC110009599 [Jatrop... 260 4e-83 ref|XP_021974743.1| uncharacterized protein LOC110869838 [Helian... 251 2e-79 gb|OTG25356.1| putative gag-polypeptide of LTR copia-type [Helia... 228 7e-71 ref|XP_022026592.1| uncharacterized protein LOC110927245 [Helian... 230 4e-65 >gb|OTF86030.1| putative ribonuclease H-like domain-containing protein [Helianthus annuus] Length = 1532 Score = 363 bits (933), Expect = e-112 Identities = 185/292 (63%), Positives = 216/292 (73%), Gaps = 13/292 (4%) Frame = -3 Query: 850 DLIVFSWIIQNIEPIIAGNLTEYPTAKMLWDALTTTYSSGKDKLQIFNLYVKENELKQNE 671 DL+VFSW+IQNIEP++AGNLTE+PTAK LWDAL TYSSG+DKLQ FNL+VK NE+KQN+ Sbjct: 87 DLVVFSWLIQNIEPVLAGNLTEFPTAKSLWDALVVTYSSGRDKLQTFNLHVKANEIKQND 146 Query: 670 KPLEDFWIALQGVWGEIDRIDPNPMTCADDIKTYSRFRSEQNLFQFLHGLDRKFDPIKRE 491 K LEDFWI LQGVWGEIDRIDPNPM C +DI+TY R RSEQ LFQFL+ LDRK+DPIKRE Sbjct: 147 KSLEDFWIILQGVWGEIDRIDPNPMKCPEDIRTYLRIRSEQKLFQFLNALDRKYDPIKRE 206 Query: 490 ILRLDTLPSAETAYATVRKEAAHHSILGT--NETQGIATGLNVGFGDTEGVGLAIKGYRR 317 ILRLD LPSAE AYATVRKEAAH +ILGT ++TQGIA GL+ TEG+GL KG+RR Sbjct: 207 ILRLDPLPSAEAAYATVRKEAAHQNILGTTVDDTQGIAAGLSA--TGTEGLGLVTKGHRR 264 Query: 316 NDGKKPFV--KEDKSHLTCDECKMTGHTKAQCFRIVGYPEWWTDGHK--------KGTKK 167 DGKK KEDK+HL CD C MT HTKAQCF+IVGYP+WW+DGHK KGT Sbjct: 265 FDGKKNGAPNKEDKTHLKCDHCGMTRHTKAQCFKIVGYPDWWSDGHKKSKTTGPEKGTAA 324 Query: 166 KVFPTSQATTSKEGTEKGFGGLAAAGNSKGEGSFAV-TGKKERERDFISHTY 14 + + GFGG+AAA + + F+V TG + I H+Y Sbjct: 325 AAIGDQEGAAREGRNPTGFGGVAAAAIGETDDVFSVTTGTGVERKVSIPHSY 376 >ref|XP_022032681.1| uncharacterized protein LOC110933783 [Helianthus annuus] Length = 592 Score = 327 bits (837), Expect = e-105 Identities = 172/280 (61%), Positives = 203/280 (72%), Gaps = 15/280 (5%) Frame = -3 Query: 850 DLIVFSWIIQNIEPIIAGNLTEYPTAKMLWDALTTTYSSGKDKLQIFNLYVKENELKQNE 671 DL+VFSW+IQNIEP +A NLTE+PTAK LWDAL TYSSG+DKLQ FNL+VK N++KQN+ Sbjct: 36 DLVVFSWLIQNIEPALASNLTEFPTAKSLWDALVVTYSSGRDKLQTFNLHVKANDIKQND 95 Query: 670 KPLEDFWIALQGVWGEIDRIDPNPMTCADDIKTYSRFRSEQNLFQFLHGLDRKFDPIKRE 491 LE+FWI LQG+WGEIDR C +DI+TYS+ RSEQ LFQFL+ LDRK+DPIKRE Sbjct: 96 TSLEEFWITLQGIWGEIDR------KCPEDIQTYSKIRSEQKLFQFLNALDRKYDPIKRE 149 Query: 490 ILRLDTLPSAETAYATVRKEAAHHSILGT--NETQGIATGLNVGFGDTEGVGLAIKGYRR 317 +LRLD LPSAE AYA VRKEAAH +ILG +ETQGI GL + EG+GL KG RR Sbjct: 150 LLRLDPLPSAEAAYAAVRKEAAHQNILGATLSETQGIGAGLVA--TEKEGLGLISKG-RR 206 Query: 316 NDGKK--PFVKEDKSHLTCDECKMTGHTKAQCFRIVGYPEWWTDGHKKGTKKKVFPTSQA 143 DGKK P VKEDKSHL CD C MT HTK CFR+VGYPEWW+DGHKKGTK +A Sbjct: 207 FDGKKNGPPVKEDKSHLKCDHCGMTKHTKEHCFRLVGYPEWWSDGHKKGTKTAGAEKGKA 266 Query: 142 -----------TTSKEGTEKGFGGLAAAGNSKGEGSFAVT 56 T+ + + GF GLAAA + + EG F++T Sbjct: 267 SAAVGNNHAANTSDGDRNDTGFEGLAAAADGE-EGVFSMT 305 >ref|XP_021974902.1| uncharacterized protein LOC110870015 [Helianthus annuus] Length = 325 Score = 301 bits (770), Expect = 2e-98 Identities = 153/228 (67%), Positives = 173/228 (75%), Gaps = 5/228 (2%) Frame = -3 Query: 850 DLIVFSWIIQNIEPIIAGNLTEYPTAKMLWDALTTTYSSGKDKLQIFNLYVKENELKQNE 671 DL+VFSW+IQNIEP +AGNLTE+PTAK LWDAL TYSSGKDKLQ F+L+VK NELKQN Sbjct: 38 DLVVFSWLIQNIEPGLAGNLTEFPTAKALWDALVVTYSSGKDKLQTFDLHVKANELKQNG 97 Query: 670 KPLEDFWIALQGVWGEIDRIDPNPMTCADDIKTYSRFRSEQNLFQFLHGLDRKFDPIKRE 491 LEDFWI LQG+WGEIDR D NPMTC+ DI TY RSEQ LFQFL+ LDRKFDP+KRE Sbjct: 98 SALEDFWIKLQGIWGEIDRRDLNPMTCSADIATYKNIRSEQKLFQFLNALDRKFDPVKRE 157 Query: 490 ILRLDTLPSAETAYATVRKEAAHHSILGT-NET--QGIATGLNV-GFGDTEGVGLAIKGY 323 ILR D LPSAE AYA VRKE AH ILGT +ET G+A+GL V G + +G GL KG Sbjct: 158 ILRWDPLPSAEQAYAAVRKEMAHQGILGTISETSQSGVASGLIVGGTNEIDGQGLITKGQ 217 Query: 322 RRND-GKKPFVKEDKSHLTCDECKMTGHTKAQCFRIVGYPEWWTDGHK 182 RR+D K + DKS L C C M HTK QCFRIVG+P+WW+D HK Sbjct: 218 RRSDFTGKSSSRIDKSKLKCSHCGMNKHTKDQCFRIVGFPDWWSDNHK 265 >ref|XP_022012651.1| uncharacterized protein LOC110912271 [Helianthus annuus] Length = 565 Score = 308 bits (789), Expect = 3e-98 Identities = 156/244 (63%), Positives = 183/244 (75%), Gaps = 2/244 (0%) Frame = -3 Query: 850 DLIVFSWIIQNIEPIIAGNLTEYPTAKMLWDALTTTYSSGKDKLQIFNLYVKENELKQNE 671 DLIVFSW+IQNIEP +A NLTE+PTAK LWDALT TYSSGKDKLQ F+L+VK NE KQ+ Sbjct: 36 DLIVFSWLIQNIEPSLASNLTEFPTAKTLWDALTVTYSSGKDKLQTFDLHVKVNEFKQSG 95 Query: 670 KPLEDFWIALQGVWGEIDRIDPNPMTCADDIKTYSRFRSEQNLFQFLHGLDRKFDPIKRE 491 PLE+FWI +QG+WGEI+R DPNPM C DI TY++ RSE LFQFL+ LDRK+D +KRE Sbjct: 96 LPLEEFWIVMQGIWGEIERRDPNPMKCPTDIATYNKVRSEYKLFQFLNALDRKYDSLKRE 155 Query: 490 ILRLDTLPSAETAYATVRKEAAHHSILGTNETQGIATGLNVGFGDTEGVGLAIKGYRRND 311 ILR D LPSAE AYA VRKE AH SI G N QG+A+GLN G+++G+GL +G RR+D Sbjct: 156 ILRWDPLPSAEAAYAVVRKETAHQSIFG-NVHQGVASGLN-STGESDGLGLVTRG-RRSD 212 Query: 310 GK--KPFVKEDKSHLTCDECKMTGHTKAQCFRIVGYPEWWTDGHKKGTKKKVFPTSQATT 137 K + + DKS L CD C M HTK QCF++VGYPEWW DGHKKG K SQ TT Sbjct: 213 QKSNQSSSRIDKSKLKCDHCGMAKHTKEQCFKLVGYPEWWADGHKKG-KASAAVGSQGTT 271 Query: 136 SKEG 125 S G Sbjct: 272 SSGG 275 >ref|XP_021992301.1| uncharacterized protein LOC110889107 [Helianthus annuus] gb|OTG06567.1| putative gag-polypeptide of LTR copia-type [Helianthus annuus] Length = 380 Score = 301 bits (771), Expect = 7e-98 Identities = 150/255 (58%), Positives = 179/255 (70%), Gaps = 7/255 (2%) Frame = -3 Query: 850 DLIVFSWIIQNIEPIIAGNLTEYPTAKMLWDALTTTYSSGKDKLQIFNLYVKENELKQNE 671 DL+VFSW+IQNIEP +A NLTE+PTAK LWDAL TTYSSGKDKLQ F+L+VK N +KQN Sbjct: 90 DLVVFSWLIQNIEPALASNLTEFPTAKTLWDALVTTYSSGKDKLQTFDLHVKSNGIKQNG 149 Query: 670 KPLEDFWIALQGVWGEIDRIDPNPMTCADDIKTYSRFRSEQNLFQFLHGLDRKFDPIKRE 491 PLEDFWI +QGVWGEI+R DPNPMTCA DI TY++ RSEQ LFQFL+ LDR++DPIKRE Sbjct: 150 SPLEDFWIIMQGVWGEIERRDPNPMTCAADIATYNKLRSEQKLFQFLNALDRQYDPIKRE 209 Query: 490 ILRLDTLPSAETAYATVRKEAAHHSILGTNET----QGIATGLNVG-FGDTEGVGLAIKG 326 ILR D LPSAE AYA VRK AH ILGT + G+A GLN + E +G KG Sbjct: 210 ILRWDPLPSAEGAYAAVRKVMAHQGILGTTDNSSSPSGVAAGLNTNRSSEPESLGFLTKG 269 Query: 325 --YRRNDGKKPFVKEDKSHLTCDECKMTGHTKAQCFRIVGYPEWWTDGHKKGTKKKVFPT 152 ++N + DK+ L CD C HTK+QCF +VGYPEWW DGHKKG K+ Sbjct: 270 RTNQKNSTLGSSSRIDKTKLKCDHCGKNKHTKSQCFELVGYPEWWNDGHKKGNKEGGKAA 329 Query: 151 SQATTSKEGTEKGFG 107 + ++E +G G Sbjct: 330 ATIGKTEEPEHRGGG 344 >gb|OTG08964.1| putative gag-polypeptide of LTR copia-type [Helianthus annuus] Length = 571 Score = 306 bits (784), Expect = 2e-97 Identities = 152/255 (59%), Positives = 181/255 (70%), Gaps = 7/255 (2%) Frame = -3 Query: 850 DLIVFSWIIQNIEPIIAGNLTEYPTAKMLWDALTTTYSSGKDKLQIFNLYVKENELKQNE 671 DL+VFSW+IQNIEP +A NLTE+PTAK LWDAL TTYSSGKDKLQ F+L+VK N +KQN Sbjct: 90 DLVVFSWLIQNIEPALASNLTEFPTAKTLWDALVTTYSSGKDKLQTFDLHVKSNGIKQNG 149 Query: 670 KPLEDFWIALQGVWGEIDRIDPNPMTCADDIKTYSRFRSEQNLFQFLHGLDRKFDPIKRE 491 PLEDFWI +QGVWGEI+R DPNPMTCA DI TY++ RSEQ LFQFL+ LDR++DPIKRE Sbjct: 150 SPLEDFWIIMQGVWGEIERRDPNPMTCAADIATYNKLRSEQKLFQFLNALDRQYDPIKRE 209 Query: 490 ILRLDTLPSAETAYATVRKEAAHHSILGTNET----QGIATGLNVG-FGDTEGVGLAIKG 326 ILR D LPSAE AYA VRKE AH ILGTN+ G+A GLN + E +G KG Sbjct: 210 ILRWDPLPSAEGAYAAVRKEMAHQGILGTNDNSSSPSGVAAGLNTNRSSEPESLGFLTKG 269 Query: 325 --YRRNDGKKPFVKEDKSHLTCDECKMTGHTKAQCFRIVGYPEWWTDGHKKGTKKKVFPT 152 ++N + DK+ L CD C HTK+QCF +VGYPEWW DGHKKG K+ Sbjct: 270 RTNQKNSTLGSSSRIDKTKLKCDHCGKNKHTKSQCFELVGYPEWWNDGHKKGNKEGGKAA 329 Query: 151 SQATTSKEGTEKGFG 107 + ++E +G G Sbjct: 330 ATIGKTEEPEHRGGG 344 >gb|OTG29887.1| putative gag-polypeptide of LTR copia-type [Helianthus annuus] Length = 325 Score = 297 bits (761), Expect = 4e-97 Identities = 155/267 (58%), Positives = 182/267 (68%), Gaps = 12/267 (4%) Frame = -3 Query: 850 DLIVFSWIIQNIEPIIAGNLTEYPTAKMLWDALTTTYSSGKDKLQIFNLYVKENELKQNE 671 DLIVFSW+IQNIEP +A NLTE+PTAK LWDAL TYSSGKDKLQ F+L+VK N +KQN Sbjct: 38 DLIVFSWLIQNIEPALASNLTEFPTAKSLWDALVVTYSSGKDKLQTFDLHVKANGIKQNG 97 Query: 670 KPLEDFWIALQGVWGEIDRIDPNPMTCADDIKTYSRFRSEQNLFQFLHGLDRKFDPIKRE 491 PLEDFWI +QG+WGEIDR DPNPMTC DI TY++ RSEQ LFQFL+ LDR++D IKRE Sbjct: 98 SPLEDFWIIMQGIWGEIDRRDPNPMTCTVDIATYNKIRSEQKLFQFLNALDRQYDTIKRE 157 Query: 490 ILRLDTLPSAETAYATVRKEAAHHSILG--TNETQGIATGLNV-GFGDTEGVGLAIKGY- 323 ILR D LPSAE AYA VRKE AH ILG T+ +A GL G +TE +G KG Sbjct: 158 ILRWDPLPSAEGAYAAVRKEMAHQGILGTATSSHNNVAAGLVANGSHETESLGFLSKGRS 217 Query: 322 -RRNDGKKPFVKEDKSHLTCDECKMTGHTKAQCFRIVGYPEWWTDGHKKGTK---KKVFP 155 ++N + DKS L C C M HTK QCF++VGYPEWW DGHKK K K Sbjct: 218 GQKNPNSGSSSQIDKSKLKCLHCGMLKHTKDQCFKLVGYPEWWNDGHKKRNKEGGKAAAA 277 Query: 154 TSQATTSKEGTEK----GFGGLAAAGN 86 + G ++ GFGG+A AG+ Sbjct: 278 IGDTKNNSAGNDQQNSGGFGGVAFAGD 304 >gb|OTG35332.1| putative gag-polypeptide of LTR copia-type [Helianthus annuus] Length = 465 Score = 297 bits (760), Expect = 4e-95 Identities = 154/284 (54%), Positives = 182/284 (64%), Gaps = 18/284 (6%) Frame = -3 Query: 850 DLIVFSWIIQNIEPIIAGNLTEYPTAKMLWDALTTTYSSGKDKLQIFNLYVKENELKQNE 671 DLIVFSW+IQNIEP +A NLTE+PTAK LWDAL TYSSGKDKLQ F+L+VK N +KQN Sbjct: 170 DLIVFSWLIQNIEPALASNLTEFPTAKTLWDALVVTYSSGKDKLQTFDLHVKSNGIKQNG 229 Query: 670 KPLEDFWIALQGVWGEIDRIDPNPMTCADDIKTYSRFRSEQNLFQFLHGLDRKFDPIKRE 491 LEDFWI +QG+WGE +R DPNPMTC DI TY++ RSEQ LF+FL+ LDR++D IK E Sbjct: 230 SSLEDFWINMQGIWGETERRDPNPMTCITDIATYNKIRSEQKLFKFLNALDRQYDTIKME 289 Query: 490 ILRLDTLPSAETAYATVRKEAAHHSILGTNE-----TQGIATGLNVGFGDTEGVGLAIKG 326 ILR D LPSAE AYA VRKE AH ILGT G+A GL G+G KG Sbjct: 290 ILRWDPLPSAEGAYAAVRKEMAHQGILGTTADNSSLNNGVAAGLAANGSKEVGLGFLSKG 349 Query: 325 Y--RRNDGKKPFVKEDKSHLTCDECKMTGHTKAQCFRIVGYPEWWTDGHKKGTKKKVFPT 152 +RN + DK+ L CD C M HTK QCFR+VGYPEWW DGHK+G K+ Sbjct: 350 RTGQRNFNSGSSPRIDKTKLKCDHCGMMKHTKDQCFRLVGYPEWWNDGHKRGNKEGKAVA 409 Query: 151 SQATTSKEGTEK-----------GFGGLAAAGNSKGEGSFAVTG 53 + T G + GFGG+A AGN + S + G Sbjct: 410 AIGNTEGIGENQPRNGNDQSRLSGFGGVAFAGNKNTQTSEEIDG 453 >ref|XP_021994452.1| uncharacterized protein LOC110891101 [Helianthus annuus] Length = 386 Score = 294 bits (753), Expect = 5e-95 Identities = 154/272 (56%), Positives = 186/272 (68%), Gaps = 11/272 (4%) Frame = -3 Query: 850 DLIVFSWIIQNIEPIIAGNLTEYPTAKMLWDALTTTYSSGKDKLQIFNLYVKENELKQNE 671 DLIVFSW+IQNIEP IA NLTE+PT+K LW+AL TTYSSGKDKLQIF+L+VK N LKQ E Sbjct: 88 DLIVFSWLIQNIEPAIASNLTEFPTSKTLWEALQTTYSSGKDKLQIFDLHVKANSLKQKE 147 Query: 670 KPLEDFWIALQGVWGEIDRIDPNPMTCADDIKTYSRFRSEQNLFQFLHGLDRKFDPIKRE 491 P+ED WI LQG+WGEIDR +PNPMTC DI TY+R RSEQ LFQFL+ LD +FD +KRE Sbjct: 148 VPVEDLWINLQGIWGEIDRREPNPMTCTTDINTYNRLRSEQKLFQFLNALDHRFDTVKRE 207 Query: 490 ILRLDTLPSAETAYATVRKEAAHHSIL--GTNETQ--GIATGL-NVGFGDTEGVGLAIKG 326 ILR + LP+AE AYAT+RKE H IL GT+ETQ GIA+GL T+G+GL KG Sbjct: 208 ILRGEPLPTAEPAYATIRKETTHQIILGAGTSETQIHGIASGLATTNLQQTDGLGLISKG 267 Query: 325 YRRNDGKKPFVKEDKSHLTCDECKMTGHTKAQCFRIVGYPEWWTDGHKKGTKK--KVFPT 152 R++ + K+ L CD C HTK QCF +VGYP+WW G K+ K K + Sbjct: 268 NCRSEKTTGNKNDPKAKLKCDHCGKPRHTKDQCFHLVGYPDWWEIGPKRNNKDEGKRDTS 327 Query: 151 SQATTSKEGTEKGFGGLAAAGN----SKGEGS 68 + S G +GFGG+ + N S G GS Sbjct: 328 TTGQGSNAGGREGFGGVVSGDNKENTSDGHGS 359 >ref|XP_022019013.1| uncharacterized protein LOC110919044 [Helianthus annuus] Length = 379 Score = 291 bits (745), Expect = 6e-94 Identities = 150/228 (65%), Positives = 169/228 (74%), Gaps = 5/228 (2%) Frame = -3 Query: 850 DLIVFSWIIQNIEPIIAGNLTEYPTAKMLWDALTTTYSSGKDKLQIFNLYVKENELKQNE 671 DL VFSW+IQNIEP +AGNLTE+PTAK LWDAL TYSSGKDKLQ F+L+VK NELKQN Sbjct: 95 DLFVFSWLIQNIEPGLAGNLTEFPTAKALWDALVVTYSSGKDKLQTFDLHVKANELKQNG 154 Query: 670 KPLEDFWIALQGVWGEIDRIDPNPMTCADDIKTYSRFRSEQNLFQFLHGLDRKFDPIKRE 491 LEDFWI LQG+WGEIDR+DPNPMTC+ D+ TY+ RSEQ LFQFL+ LDRKFD +KRE Sbjct: 155 SALEDFWIKLQGIWGEIDRMDPNPMTCSADVATYNNIRSEQKLFQFLNALDRKFDLVKRE 214 Query: 490 ILRLDTLPSAETAYATVRKEAAHHSILGT-NET--QGIATGLNV-GFGDTEGVGLAIKGY 323 IL D LPSAE AYATVRKE AH ILGT +ET G+A GL G +T+G GL KG Sbjct: 215 ILWWDPLPSAEQAYATVRKEMAHQGILGTISETSQSGVAAGLVAGGTTETDGQGLITKGQ 274 Query: 322 RR-NDGKKPFVKEDKSHLTCDECKMTGHTKAQCFRIVGYPEWWTDGHK 182 RR N K + DKS L C C HTK QCFRIVG+ WW+D HK Sbjct: 275 RRSNFTGKSSSRIDKSKLKCSHCGKNKHTKDQCFRIVGFLNWWSDNHK 322 >ref|XP_022011939.1| uncharacterized protein LOC110911621 [Helianthus annuus] gb|OTF95102.1| putative gag-polypeptide of LTR copia-type [Helianthus annuus] Length = 394 Score = 291 bits (744), Expect = 1e-93 Identities = 151/285 (52%), Positives = 188/285 (65%), Gaps = 26/285 (9%) Frame = -3 Query: 850 DLIVFSWIIQNIEPIIAGNLTEYPTAKMLWDALTTTYSSGKDKLQIFNLYVKENELKQNE 671 DL+V SW+IQNIEP +A NLTE+PTAK LWDAL TYSSGKDKLQ F+L+VK N +KQN Sbjct: 93 DLVVISWLIQNIEPALASNLTEFPTAKTLWDALVVTYSSGKDKLQTFDLHVKANGIKQNG 152 Query: 670 KPLEDFWIALQGVWGEIDRIDPNPMTCADDIKTYSRFRSEQNLFQFLHGLDRKFDPIKRE 491 PLEDFWI +QGVWGEI+R DPNPMTC +DI TY++ RSEQ LFQFL+ LDR++D +KRE Sbjct: 153 SPLEDFWIIMQGVWGEIERRDPNPMTCPEDITTYNKIRSEQKLFQFLNALDRQYDTVKRE 212 Query: 490 ILRLDTLPSAETAYATVRKEAAHHSILG-----TNETQGIATGLNV-GFGDTEGVGLAIK 329 ILR D LPSAE AYA VRKE AH ILG + G+A GLN G ++EG+G + Sbjct: 213 ILRWDPLPSAEGAYAAVRKEMAHQGILGITIDTSYNPNGVAAGLNANGSRESEGLGFLSR 272 Query: 328 GY--RRNDGKKPFVKEDKSHLTCDECKMTGHTKAQCFRIVGYPEWWTDGHKKGTKKKVFP 155 G +++ + DKS L C C M+ HTK QCF++VGYPEWW D HK K+ Sbjct: 273 GRVDQKSSNTGSSFRIDKSKLKCGHCGMSKHTKDQCFQLVGYPEWWNDNHKTQKGGKIST 332 Query: 154 TSQATTSKEGTEK------------------GFGGLAAAGNSKGE 74 + +++ G +K GFGG+ AAGN G+ Sbjct: 333 AAGRSSAAIGNQKATSSGGKGQEDTDGGGASGFGGM-AAGNYIGQ 376 >ref|XP_019158475.1| PREDICTED: uncharacterized protein LOC109155214 [Ipomoea nil] Length = 975 Score = 304 bits (778), Expect = 8e-93 Identities = 165/311 (53%), Positives = 201/311 (64%), Gaps = 28/311 (9%) Frame = -3 Query: 850 DLIVFSWIIQNIEPIIAGNLTEYPTAKMLWDALTTTYSSGKDKLQIFNLYVKENELKQNE 671 DL+VFSW+IQNI+P +A NLTE+PTAK LWDAL TYSSGKDKLQ F+L+VK NE+KQN Sbjct: 99 DLVVFSWLIQNIKPALASNLTEFPTAKSLWDALVVTYSSGKDKLQTFDLHVKTNEIKQNG 158 Query: 670 KPLEDFWIALQGVWGEIDRIDPNPMTCADDIKTYSRFRSEQNLFQFLHGLDRKFDPIKRE 491 PLEDF I +QG+WGEI+R DPNPMTCA DI TY++ R+EQ LFQFL+ +DR++DPIKRE Sbjct: 159 APLEDFGILMQGIWGEIERRDPNPMTCAADIATYNKLRAEQKLFQFLNAIDRQYDPIKRE 218 Query: 490 ILRLDTLPSAETAYATVRKEAAHHSILG-----TNETQGIATGLNV-GFGDTEGVGLAIK 329 ILR D L SAE AYA VR E AH +ILG T+ QG+A GL V G + EG+GL K Sbjct: 219 ILRWDPLTSAEGAYAAVRNETAHQNILGAVSAITSSQQGVAAGLTVTGPSEAEGLGLISK 278 Query: 328 GYRRND----GKKPFVKEDKSHLTCDECKMTGHTKAQCFRIVGYPEWWTDGHK---KGTK 170 G RR+D + DKS L C C M+ HTK QCF+IVGYPEWW DGHK K T+ Sbjct: 279 GQRRSDQTGRTNGSSSRPDKSQLNCSHCGMSKHTKEQCFKIVGYPEWWNDGHKQSGKTTR 338 Query: 169 KKVFPTSQATTSKEGT-----------EKGFGGLAAA----GNSKGEGSFAVTGKKERER 35 + A + + T E GFGG+AA KG G F+ Sbjct: 339 SNGGRAAAAVRNNDTTINIGDGQGNIREGGFGGMAAVKRDDETGKGFGDFS------SSP 392 Query: 34 DFISHTYQPQK 2 F++ Y PQ+ Sbjct: 393 SFLNPKYFPQR 403 >ref|XP_021996287.1| uncharacterized protein LOC110893489 [Helianthus annuus] Length = 548 Score = 291 bits (746), Expect = 5e-92 Identities = 154/275 (56%), Positives = 186/275 (67%), Gaps = 7/275 (2%) Frame = -3 Query: 850 DLIVFSWIIQNIEPIIAGNLTEYPTAKMLWDALTTTYSSGKDKLQIFNLYVKENELKQNE 671 DLIVFSW+IQNIEP + NLTE+PTA LWDAL+ TYSSGKDKLQ F+L+VK NE KQN Sbjct: 86 DLIVFSWLIQNIEPSLTSNLTEFPTANTLWDALSVTYSSGKDKLQTFDLHVKANEFKQNG 145 Query: 670 KPLEDFWIALQGVWGEIDRIDPNPMTCADDIKTYSRFRSEQNLFQFLHGLDRKFDPIKRE 491 PLE+FWI +QG+ GEI R DPNPM C+DDI TY++ RSE LFQ L+ LDRK+D +KRE Sbjct: 146 LPLEEFWIVMQGIRGEIKRRDPNPMKCSDDIATYNKVRSENKLFQLLNALDRKYDSLKRE 205 Query: 490 ILRLDTLPSAETAYATVRKEAAHHSILGTNETQGIATGLNVGFGDTEGVGLAIKGYRRND 311 ILR D LP+ E AYA VRKE AH SI G N QG+ +GLN G ++G+GL + + Sbjct: 206 ILRWDPLPTTEAAYAAVRKETAHQSIFG-NTQQGVGSGLN-SLGSSDGLGLVSRSRWSDQ 263 Query: 310 GKKPFVKE-DKSHLTCDECKMTGHTKAQCFRIVGYPEWWTDGHKKGTKKKVFPTSQATTS 134 P DKS L CD C M HTK QCF+IVGYP+WW DGHKKG + +Q TTS Sbjct: 264 KSNPSSSRIDKSKLKCDHCGMAKHTKEQCFKIVGYPDWWADGHKKG-RAAAAVGNQETTS 322 Query: 133 KEGT----EKGFGGLAA--AGNSKGEGSFAVTGKK 47 G+ +K GGL GNS G A+ G++ Sbjct: 323 SGGSSGEHQKLAGGLDGIDKGNSGGCCFAALQGEE 357 >gb|OTG08979.1| putative ribonuclease H-like domain, GAG-pre-integrase domain, Gag-polypeptide of LTR copia-type [Helianthus annuus] Length = 851 Score = 294 bits (753), Expect = 4e-90 Identities = 154/272 (56%), Positives = 186/272 (68%), Gaps = 11/272 (4%) Frame = -3 Query: 850 DLIVFSWIIQNIEPIIAGNLTEYPTAKMLWDALTTTYSSGKDKLQIFNLYVKENELKQNE 671 DLIVFSW+IQNIEP IA NLTE+PT+K LW+AL TTYSSGKDKLQIF+L+VK N LKQ E Sbjct: 88 DLIVFSWLIQNIEPAIASNLTEFPTSKTLWEALQTTYSSGKDKLQIFDLHVKANSLKQKE 147 Query: 670 KPLEDFWIALQGVWGEIDRIDPNPMTCADDIKTYSRFRSEQNLFQFLHGLDRKFDPIKRE 491 P+ED WI LQG+WGEIDR +PNPMTC DI TY+R RSEQ LFQFL+ LD +FD +KRE Sbjct: 148 VPVEDLWINLQGIWGEIDRREPNPMTCTTDINTYNRLRSEQKLFQFLNALDHRFDTVKRE 207 Query: 490 ILRLDTLPSAETAYATVRKEAAHHSIL--GTNETQ--GIATGL-NVGFGDTEGVGLAIKG 326 ILR + LP+AE AYAT+RKE H IL GT+ETQ GIA+GL T+G+GL KG Sbjct: 208 ILRGEPLPTAEPAYATIRKETTHQIILGAGTSETQIHGIASGLATTNLQQTDGLGLISKG 267 Query: 325 YRRNDGKKPFVKEDKSHLTCDECKMTGHTKAQCFRIVGYPEWWTDGHKKGTKK--KVFPT 152 R++ + K+ L CD C HTK QCF +VGYP+WW G K+ K K + Sbjct: 268 NCRSEKTTGNKNDPKAKLKCDHCGKPRHTKDQCFHLVGYPDWWEIGPKRNNKDEGKRDTS 327 Query: 151 SQATTSKEGTEKGFGGLAAAGN----SKGEGS 68 + S G +GFGG+ + N S G GS Sbjct: 328 TTGQGSNAGGREGFGGVVSGDNKENTSDGHGS 359 >ref|XP_020537368.1| uncharacterized protein LOC110010150 [Jatropha curcas] Length = 384 Score = 281 bits (718), Expect = 8e-90 Identities = 147/299 (49%), Positives = 195/299 (65%), Gaps = 22/299 (7%) Frame = -3 Query: 850 DLIVFSWIIQNIEPIIAGNLTEYPTAKMLWDALTTTYSSGKDKLQIFNLYVKENELKQNE 671 DLIVFSW+IQN+EP +A NLTEY TAK LW+AL TYSSGKDKLQIF+L+ + N +KQ Sbjct: 90 DLIVFSWLIQNMEPQLANNLTEYSTAKDLWNALVITYSSGKDKLQIFDLHTRANSMKQGS 149 Query: 670 KPLEDFWIALQGVWGEIDRIDPNPMTCADDIKTYSRFRSEQNLFQFLHGLDRKFDPIKRE 491 LE+FW+ +QG+WGE+DR +PNPMTC+ DI TY++ + EQ LFQFL+G+D +D IKRE Sbjct: 150 STLEEFWLTMQGIWGEMDRREPNPMTCSTDIATYNKVKQEQKLFQFLNGIDHLYDQIKRE 209 Query: 490 ILRLDTLPSAETAYATVRKEAAHHSILGTNETQGIATGLNVGF--------GDTEGVGLA 335 ILR + LPSAE AYA+VRKEAA +I+G + ++ G+ GF + GVGL Sbjct: 210 ILRSEHLPSAEAAYASVRKEAARLNIMGPANRESLSQGIGDGFVIIGKKEASEATGVGLV 269 Query: 334 IKGYR----RNDGKKPFVKEDKSHLTCDECKMTGHTKAQCFRIVGYPEWWTDGHK-KGTK 170 +G R RNDG + DKS L C C M+ HTK QCF ++GYPEWW + H+ KGTK Sbjct: 270 ARGQRRSEPRNDGSSS--RPDKSRLKCSYCGMSKHTKDQCFELIGYPEWWNENHRNKGTK 327 Query: 169 K--------KVFPTSQATTSKEG-TEKGFGGLAAAGNSKGEGSFAVTGKKERERDFISH 20 + S S+ G E+G G+ AA N + +G+ + G +ERE ++ H Sbjct: 328 TSKAAAAVGNLEAASSGGDSRGGQNERGAMGMVAAQNERADGT--LEGYQEREDYWMWH 384 >ref|XP_022000771.1| uncharacterized protein LOC110898294 [Helianthus annuus] Length = 247 Score = 263 bits (673), Expect = 7e-85 Identities = 131/211 (62%), Positives = 156/211 (73%) Frame = -3 Query: 850 DLIVFSWIIQNIEPIIAGNLTEYPTAKMLWDALTTTYSSGKDKLQIFNLYVKENELKQNE 671 DLIVFSW+IQNIEP +A NLTE+PTAK LWDALT TYSSGKDKLQ F+L+VK NE KQN Sbjct: 36 DLIVFSWLIQNIEPSLASNLTEFPTAKTLWDALTITYSSGKDKLQTFDLHVKANEFKQNG 95 Query: 670 KPLEDFWIALQGVWGEIDRIDPNPMTCADDIKTYSRFRSEQNLFQFLHGLDRKFDPIKRE 491 PLE+FWI +QG+WGEI R DPNP+ C DI TY++ RSE LFQFL+ LDRK+D +KRE Sbjct: 96 VPLEEFWIIMQGIWGEIKRRDPNPIACPADIATYNKVRSEYKLFQFLNALDRKYDSLKRE 155 Query: 490 ILRLDTLPSAETAYATVRKEAAHHSILGTNETQGIATGLNVGFGDTEGVGLAIKGYRRND 311 IL+ D LPS E AYA VRKE H SI G N +G+ +GLN GDT+G+GL + RR+D Sbjct: 156 ILQWDPLPSVEVAYAVVRKETTHQSIFG-NPHKGVGSGLN-SHGDTDGLGLVSRS-RRSD 212 Query: 310 GKKPFVKEDKSHLTCDECKMTGHTKAQCFRI 218 K + DKS L C+ C M HTK QCF + Sbjct: 213 QKPSSSRIDKSKLRCEHCGMAKHTKDQCFSV 243 >ref|XP_020535493.1| uncharacterized protein LOC110009599 [Jatropha curcas] Length = 284 Score = 260 bits (665), Expect = 4e-83 Identities = 138/288 (47%), Positives = 184/288 (63%), Gaps = 22/288 (7%) Frame = -3 Query: 817 IEPIIAGNLTEYPTAKMLWDALTTTYSSGKDKLQIFNLYVKENELKQNEKPLEDFWIALQ 638 +EP +A NLTEY TAK LW+AL TYSSGKDKLQIF+L+ + N +KQ LE+FW+ +Q Sbjct: 1 MEPQLANNLTEYSTAKDLWNALVITYSSGKDKLQIFDLHTRANSMKQGSSTLEEFWLTMQ 60 Query: 637 GVWGEIDRIDPNPMTCADDIKTYSRFRSEQNLFQFLHGLDRKFDPIKREILRLDTLPSAE 458 G+WGEIDR +PNPMTC+ DI TY++ + EQ LFQFL+G+D +D IKREILR + LPSAE Sbjct: 61 GIWGEIDRREPNPMTCSTDIATYNKVKQEQKLFQFLNGIDHLYDQIKREILRSEHLPSAE 120 Query: 457 TAYATVRKEAAHHSILGTNETQGIATGLNVGF--------GDTEGVGLAIKGYR----RN 314 AYA+VRKEAA +I+G + ++ G+ GF + GVGL +G R RN Sbjct: 121 AAYASVRKEAARLNIMGPANRESLSQGIGDGFVIIGKKEASEATGVGLVARGQRRSEPRN 180 Query: 313 DGKKPFVKEDKSHLTCDECKMTGHTKAQCFRIVGYPEWWTDGHK-KGTKK--------KV 161 DG + DKS L C C M+ HTK QCF ++GYPEWW + H+ KGTK + Sbjct: 181 DGSSS--RPDKSRLKCSYCGMSKHTKDQCFELIGYPEWWNENHRNKGTKTSKAAAAVGNL 238 Query: 160 FPTSQATTSKEG-TEKGFGGLAAAGNSKGEGSFAVTGKKERERDFISH 20 S S+ G E+G G+ AA N + +G+ + G +ERE ++ H Sbjct: 239 EAASSGGDSRGGQNERGAMGMVAAQNERADGT--LEGYQEREDYWMWH 284 >ref|XP_021974743.1| uncharacterized protein LOC110869838 [Helianthus annuus] Length = 291 Score = 251 bits (641), Expect = 2e-79 Identities = 134/228 (58%), Positives = 154/228 (67%), Gaps = 5/228 (2%) Frame = -3 Query: 850 DLIVFSWIIQNIEPIIAGNLTEYPTAKMLWDALTTTYSSGKDKLQIFNLYVKENELKQNE 671 DL+VFSW+IQNIEP +AGNLTE+PTAK LWDAL TYSSGKDKLQ F+L+VK NELKQN Sbjct: 21 DLVVFSWLIQNIEPGLAGNLTEFPTAKALWDALVVTYSSGKDKLQTFDLHVKANELKQNG 80 Query: 670 KPLEDFWIALQGVWGEIDRIDPNPMTCADDIKTYSRFRSEQNLFQFLHGLDRKFDPIKRE 491 LEDFWI LQG+WGEIDR DPNPMTC+ DI TY+ RKF P+KRE Sbjct: 81 SALEDFWIKLQGIWGEIDRRDPNPMTCSVDIATYNNI--------------RKFHPVKRE 126 Query: 490 ILRLDTLPSAETAYATVRKEAAHHSILGT-NET--QGIATGLNVG-FGDTEGVGLAIKGY 323 ILR D LPSAE AYA VR E A+ I GT +ET G+ GL G + +G GL KG Sbjct: 127 ILRRDPLPSAEQAYAAVRNEMAYQGICGTISETSQSGVTAGLIAGRTTEIDGHGLITKGQ 186 Query: 322 RRND-GKKPFVKEDKSHLTCDECKMTGHTKAQCFRIVGYPEWWTDGHK 182 RR+ K + DKS L C C M HTK QCFRI G+P+ W+D HK Sbjct: 187 RRSGFTGKSSSRIDKSKLKCSHCGMNKHTKDQCFRIAGFPDCWSDNHK 234 >gb|OTG25356.1| putative gag-polypeptide of LTR copia-type [Helianthus annuus] Length = 252 Score = 228 bits (581), Expect = 7e-71 Identities = 110/152 (72%), Positives = 125/152 (82%), Gaps = 1/152 (0%) Frame = -3 Query: 850 DLIVFSWIIQNIEPIIAGNLTEYPTAKMLWDALTTTYSSGKDKLQIFNLYVKENELKQNE 671 DL+VFSW+IQNIE +A NLTE+PTAK LWDAL TYSSGKDKLQ F+L++K N +K+N Sbjct: 91 DLVVFSWLIQNIERSLASNLTEFPTAKTLWDALVVTYSSGKDKLQTFDLHLKSNSIKENG 150 Query: 670 KPLEDFWIALQGVWGEIDRIDPNPMTCADDIKTYSRFRSEQNLFQ-FLHGLDRKFDPIKR 494 PLEDFWI LQGVWGEI+R DPNPMTCA DI TY++ RSEQ LFQ FL+ LDR+FDPIKR Sbjct: 151 SPLEDFWIVLQGVWGEIERRDPNPMTCAVDIATYNKLRSEQKLFQFFLNALDRQFDPIKR 210 Query: 493 EILRLDTLPSAETAYATVRKEAAHHSILGTNE 398 EILR D LPS E AYA VRKE AH ILGTN+ Sbjct: 211 EILRWDPLPSVEGAYAAVRKEMAHQGILGTND 242 >ref|XP_022026592.1| uncharacterized protein LOC110927245 [Helianthus annuus] Length = 1084 Score = 230 bits (586), Expect = 4e-65 Identities = 119/265 (44%), Positives = 161/265 (60%), Gaps = 5/265 (1%) Frame = -3 Query: 850 DLIVFSWIIQNIEPIIAGNLTEYPTAKMLWDALTTTYSSGKDKLQIFNLYVKENELKQNE 671 D VF+WIIQN+E + N+++YPTAK LWD L TTY G D LQ+F+L+ + L+Q Sbjct: 91 DQCVFTWIIQNLESNLVNNVSQYPTAKALWDGLATTYGFGTDSLQVFDLHKRAKSLRQGS 150 Query: 670 KPLEDFWIALQGVWGEIDRIDPNPMTCADDIKTYSRFRSEQNLFQFLHGLDRKFDPIKRE 491 LED W LQ +W IDR DPNPM +DI+ Y++ EQ L+Q L LD K +P+KR+ Sbjct: 151 DTLEDLWNKLQSIWMSIDRRDPNPMKDPEDIQMYNKKTQEQRLYQLLTALDDKMEPVKRD 210 Query: 490 ILRLDTLPSAETAYATVRKEAAHHSILGTNETQGIATGLNVGFGDTEGVGLAIKGYRRND 311 IL+ D LP+ E AYAT+R+E A +IL + + +T + G+GLA K + + Sbjct: 211 ILKKDPLPTVEMAYATIRREDARMNILRSGPSDNESTEI--------GMGLAAKDWSQRT 262 Query: 310 GKKPFVKEDKSHLTCDECKMTGHTKAQCFRIVGYPEWWTDGHKK---GTKKKVFPTSQAT 140 +P KEDKS L C C+M HTK QCF+IVGYPEWW DG K+ G K P + Sbjct: 263 KFRPRDKEDKSKLFCTYCQMKRHTKDQCFKIVGYPEWWGDGQKQKNSGADGKGTPAAGGG 322 Query: 139 TS--KEGTEKGFGGLAAAGNSKGEG 71 + ++G+ GFGGLAA + G Sbjct: 323 VAPVEKGSSGGFGGLAATTDDTSIG 347