BLASTX nr result
ID: Chrysanthemum22_contig00027820
seq
BLASTX 2.2.26 [Sep-21-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Chrysanthemum22_contig00027820 (1345 letters) Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF excluding environmental samples from WGS projects 149,584,005 sequences; 54,822,741,787 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value gb|OTF86030.1| putative ribonuclease H-like domain-containing pr... 407 e-125 ref|XP_022032681.1| uncharacterized protein LOC110933783 [Helian... 380 e-123 ref|XP_021992301.1| uncharacterized protein LOC110889107 [Helian... 369 e-122 gb|OTG08964.1| putative gag-polypeptide of LTR copia-type [Helia... 370 e-119 gb|OTG35332.1| putative gag-polypeptide of LTR copia-type [Helia... 363 e-118 ref|XP_022011939.1| uncharacterized protein LOC110911621 [Helian... 359 e-118 gb|OTG29887.1| putative gag-polypeptide of LTR copia-type [Helia... 353 e-116 ref|XP_021974902.1| uncharacterized protein LOC110870015 [Helian... 353 e-116 ref|XP_021994452.1| uncharacterized protein LOC110891101 [Helian... 345 e-112 ref|XP_021996287.1| uncharacterized protein LOC110893489 [Helian... 350 e-112 ref|XP_019158475.1| PREDICTED: uncharacterized protein LOC109155... 360 e-111 ref|XP_022012651.1| uncharacterized protein LOC110912271 [Helian... 343 e-109 gb|OTG08979.1| putative ribonuclease H-like domain, GAG-pre-inte... 345 e-107 ref|XP_020537368.1| uncharacterized protein LOC110010150 [Jatrop... 329 e-106 ref|XP_022019013.1| uncharacterized protein LOC110919044 [Helian... 325 e-104 ref|XP_021974743.1| uncharacterized protein LOC110869838 [Helian... 294 9e-94 ref|XP_022000771.1| uncharacterized protein LOC110898294 [Helian... 291 2e-93 gb|OTG25356.1| putative gag-polypeptide of LTR copia-type [Helia... 285 5e-91 ref|XP_022019596.1| uncharacterized protein LOC110919639 [Helian... 295 3e-87 ref|XP_020535814.1| uncharacterized protein LOC110009728, partia... 268 5e-84 >gb|OTF86030.1| putative ribonuclease H-like domain-containing protein [Helianthus annuus] Length = 1532 Score = 407 bits (1045), Expect = e-125 Identities = 196/291 (67%), Positives = 228/291 (78%), Gaps = 5/291 (1%) Frame = -2 Query: 978 LWTRMIRVAIGGKSKALLSHLTKDPPEPTHESYETWEQEDLIVFSWLIQNIEPTIAGNLT 799 LWTRMIRVAIGGKSK LL HLT +PP+ E+YE WEQ+DL+VFSWLIQNIEP +AGNLT Sbjct: 48 LWTRMIRVAIGGKSKNLLKHLTSNPPKQDDETYEQWEQDDLVVFSWLIQNIEPVLAGNLT 107 Query: 798 EYPTAKTLWDALTTTYSSGKDKLQVFNLHVKANELKQEGKTLEEFWIALQGVWGEIERID 619 E+PTAK+LWDAL TYSSG+DKLQ FNLHVKANE+KQ K+LE+FWI LQGVWGEI+RID Sbjct: 108 EFPTAKSLWDALVVTYSSGRDKLQTFNLHVKANEIKQNDKSLEDFWIILQGVWGEIDRID 167 Query: 618 PNPMTCTEDIKTYSKVRSEQKLFQFLHGLNRQFDPIKREILRLETLPTAETAYGTVRKEA 439 PNPM C EDI+TY ++RSEQKLFQFL+ L+R++DPIKREILRL+ LP+AE AY TVRKEA Sbjct: 168 PNPMKCPEDIRTYLRIRSEQKLFQFLNALDRKYDPIKREILRLDPLPSAEAAYATVRKEA 227 Query: 438 AHQNILGATSSETQGVAAGLHVRSGEIEGAGLAVKGYR-----GNKPFNKEDKSHLKCDE 274 AHQNILG T +TQG+AAGL EG GL KG+R N NKEDK+HLKCD Sbjct: 228 AHQNILGTTVDDTQGIAAGLSATG--TEGLGLVTKGHRRFDGKKNGAPNKEDKTHLKCDH 285 Query: 273 CKMTGHTKAQCFRIVGYPEWWSDGHKKGARNPKSEKERSGAPTAQASTSRD 121 C MT HTKAQCF+IVGYP+WWSDGHKK + + A Q +R+ Sbjct: 286 CGMTRHTKAQCFKIVGYPDWWSDGHKKSKTTGPEKGTAAAAIGDQEGAARE 336 >ref|XP_022032681.1| uncharacterized protein LOC110933783 [Helianthus annuus] Length = 592 Score = 380 bits (977), Expect = e-123 Identities = 187/276 (67%), Positives = 218/276 (78%), Gaps = 4/276 (1%) Frame = -2 Query: 966 MIRVAIGGKSKALLSHLTKDPPEPTHESYETWEQEDLIVFSWLIQNIEPTIAGNLTEYPT 787 MIRVAIGGKSKALL+HL +PPE E++E WEQ+DL+VFSWLIQNIEP +A NLTE+PT Sbjct: 1 MIRVAIGGKSKALLNHLNSNPPEKNSETFEQWEQDDLVVFSWLIQNIEPALASNLTEFPT 60 Query: 786 AKTLWDALTTTYSSGKDKLQVFNLHVKANELKQEGKTLEEFWIALQGVWGEIERIDPNPM 607 AK+LWDAL TYSSG+DKLQ FNLHVKAN++KQ +LEEFWI LQG+WGEI+R Sbjct: 61 AKSLWDALVVTYSSGRDKLQTFNLHVKANDIKQNDTSLEEFWITLQGIWGEIDR------ 114 Query: 606 TCTEDIKTYSKVRSEQKLFQFLHGLNRQFDPIKREILRLETLPTAETAYGTVRKEAAHQN 427 C EDI+TYSK+RSEQKLFQFL+ L+R++DPIKRE+LRL+ LP+AE AY VRKEAAHQN Sbjct: 115 KCPEDIQTYSKIRSEQKLFQFLNALDRKYDPIKRELLRLDPLPSAEAAYAAVRKEAAHQN 174 Query: 426 ILGATSSETQGVAAGLHVRSGEIEGAGLAVKGYR----GNKPFNKEDKSHLKCDECKMTG 259 ILGAT SETQG+ AGL + E EG GL KG R N P KEDKSHLKCD C MT Sbjct: 175 ILGATLSETQGIGAGL--VATEKEGLGLISKGRRFDGKKNGPPVKEDKSHLKCDHCGMTK 232 Query: 258 HTKAQCFRIVGYPEWWSDGHKKGARNPKSEKERSGA 151 HTK CFR+VGYPEWWSDGHKKG + +EK ++ A Sbjct: 233 HTKEHCFRLVGYPEWWSDGHKKGTKTAGAEKGKASA 268 >ref|XP_021992301.1| uncharacterized protein LOC110889107 [Helianthus annuus] gb|OTG06567.1| putative gag-polypeptide of LTR copia-type [Helianthus annuus] Length = 380 Score = 369 bits (947), Expect = e-122 Identities = 184/273 (67%), Positives = 208/273 (76%), Gaps = 10/273 (3%) Frame = -2 Query: 978 LWTRMIRVAIGGKSKALLSHLTKDP--PEPTHESYETWEQEDLIVFSWLIQNIEPTIAGN 805 LW+RMIRVAIGGKSK LLSHLT DP PEP YE WEQ+DL+VFSWLIQNIEP +A N Sbjct: 49 LWSRMIRVAIGGKSKHLLSHLTGDPKPPEPNDTQYEQWEQDDLVVFSWLIQNIEPALASN 108 Query: 804 LTEYPTAKTLWDALTTTYSSGKDKLQVFNLHVKANELKQEGKTLEEFWIALQGVWGEIER 625 LTE+PTAKTLWDAL TTYSSGKDKLQ F+LHVK+N +KQ G LE+FWI +QGVWGEIER Sbjct: 109 LTEFPTAKTLWDALVTTYSSGKDKLQTFDLHVKSNGIKQNGSPLEDFWIIMQGVWGEIER 168 Query: 624 IDPNPMTCTEDIKTYSKVRSEQKLFQFLHGLNRQFDPIKREILRLETLPTAETAYGTVRK 445 DPNPMTC DI TY+K+RSEQKLFQFL+ L+RQ+DPIKREILR + LP+AE AY VRK Sbjct: 169 RDPNPMTCAADIATYNKLRSEQKLFQFLNALDRQYDPIKREILRWDPLPSAEGAYAAVRK 228 Query: 444 EAAHQNILGAT--SSETQGVAAGLHV-RSGEIEGAGLAVKGYRGNK-----PFNKEDKSH 289 AHQ ILG T SS GVAAGL+ RS E E G KG K ++ DK+ Sbjct: 229 VMAHQGILGTTDNSSSPSGVAAGLNTNRSSEPESLGFLTKGRTNQKNSTLGSSSRIDKTK 288 Query: 288 LKCDECKMTGHTKAQCFRIVGYPEWWSDGHKKG 190 LKCD C HTK+QCF +VGYPEWW+DGHKKG Sbjct: 289 LKCDHCGKNKHTKSQCFELVGYPEWWNDGHKKG 321 >gb|OTG08964.1| putative gag-polypeptide of LTR copia-type [Helianthus annuus] Length = 571 Score = 370 bits (949), Expect = e-119 Identities = 184/273 (67%), Positives = 208/273 (76%), Gaps = 10/273 (3%) Frame = -2 Query: 978 LWTRMIRVAIGGKSKALLSHLTKDP--PEPTHESYETWEQEDLIVFSWLIQNIEPTIAGN 805 LW+RMIRVAIGGKSK LLSHLT DP PEP YE WEQ+DL+VFSWLIQNIEP +A N Sbjct: 49 LWSRMIRVAIGGKSKHLLSHLTGDPKPPEPNDTQYEQWEQDDLVVFSWLIQNIEPALASN 108 Query: 804 LTEYPTAKTLWDALTTTYSSGKDKLQVFNLHVKANELKQEGKTLEEFWIALQGVWGEIER 625 LTE+PTAKTLWDAL TTYSSGKDKLQ F+LHVK+N +KQ G LE+FWI +QGVWGEIER Sbjct: 109 LTEFPTAKTLWDALVTTYSSGKDKLQTFDLHVKSNGIKQNGSPLEDFWIIMQGVWGEIER 168 Query: 624 IDPNPMTCTEDIKTYSKVRSEQKLFQFLHGLNRQFDPIKREILRLETLPTAETAYGTVRK 445 DPNPMTC DI TY+K+RSEQKLFQFL+ L+RQ+DPIKREILR + LP+AE AY VRK Sbjct: 169 RDPNPMTCAADIATYNKLRSEQKLFQFLNALDRQYDPIKREILRWDPLPSAEGAYAAVRK 228 Query: 444 EAAHQNILGA--TSSETQGVAAGLHV-RSGEIEGAGLAVKGYRGNK-----PFNKEDKSH 289 E AHQ ILG SS GVAAGL+ RS E E G KG K ++ DK+ Sbjct: 229 EMAHQGILGTNDNSSSPSGVAAGLNTNRSSEPESLGFLTKGRTNQKNSTLGSSSRIDKTK 288 Query: 288 LKCDECKMTGHTKAQCFRIVGYPEWWSDGHKKG 190 LKCD C HTK+QCF +VGYPEWW+DGHKKG Sbjct: 289 LKCDHCGKNKHTKSQCFELVGYPEWWNDGHKKG 321 >gb|OTG35332.1| putative gag-polypeptide of LTR copia-type [Helianthus annuus] Length = 465 Score = 363 bits (933), Expect = e-118 Identities = 180/279 (64%), Positives = 208/279 (74%), Gaps = 10/279 (3%) Frame = -2 Query: 978 LWTRMIRVAIGGKSKALLSHLTKDP--PEPTHESYETWEQEDLIVFSWLIQNIEPTIAGN 805 LW RMIRVAIGGKSKALL+HL+ DP PE T YE WEQ DLIVFSWLIQNIEP +A N Sbjct: 129 LWARMIRVAIGGKSKALLNHLSGDPKPPESTAAEYEQWEQNDLIVFSWLIQNIEPALASN 188 Query: 804 LTEYPTAKTLWDALTTTYSSGKDKLQVFNLHVKANELKQEGKTLEEFWIALQGVWGEIER 625 LTE+PTAKTLWDAL TYSSGKDKLQ F+LHVK+N +KQ G +LE+FWI +QG+WGE ER Sbjct: 189 LTEFPTAKTLWDALVVTYSSGKDKLQTFDLHVKSNGIKQNGSSLEDFWINMQGIWGETER 248 Query: 624 IDPNPMTCTEDIKTYSKVRSEQKLFQFLHGLNRQFDPIKREILRLETLPTAETAYGTVRK 445 DPNPMTC DI TY+K+RSEQKLF+FL+ L+RQ+D IK EILR + LP+AE AY VRK Sbjct: 249 RDPNPMTCITDIATYNKIRSEQKLFKFLNALDRQYDTIKMEILRWDPLPSAEGAYAAVRK 308 Query: 444 EAAHQNILGAT---SSETQGVAAGLHVRSGEIEGAGLAVKGYRGNKPFN-----KEDKSH 289 E AHQ ILG T SS GVAAGL + G G KG G + FN + DK+ Sbjct: 309 EMAHQGILGTTADNSSLNNGVAAGLAANGSKEVGLGFLSKGRTGQRNFNSGSSPRIDKTK 368 Query: 288 LKCDECKMTGHTKAQCFRIVGYPEWWSDGHKKGARNPKS 172 LKCD C M HTK QCFR+VGYPEWW+DGHK+G + K+ Sbjct: 369 LKCDHCGMMKHTKDQCFRLVGYPEWWNDGHKRGNKEGKA 407 >ref|XP_022011939.1| uncharacterized protein LOC110911621 [Helianthus annuus] gb|OTF95102.1| putative gag-polypeptide of LTR copia-type [Helianthus annuus] Length = 394 Score = 359 bits (921), Expect = e-118 Identities = 186/297 (62%), Positives = 215/297 (72%), Gaps = 13/297 (4%) Frame = -2 Query: 978 LWTRMIRVAIGGKSKALLSHLTKDP--PEPTHESYETWEQEDLIVFSWLIQNIEPTIAGN 805 LW RMIRVAIGGKSKALLSHLT P P+P ES++ WEQ+DL+V SWLIQNIEP +A N Sbjct: 52 LWARMIRVAIGGKSKALLSHLTGKPAPPKPNDESFDQWEQDDLVVISWLIQNIEPALASN 111 Query: 804 LTEYPTAKTLWDALTTTYSSGKDKLQVFNLHVKANELKQEGKTLEEFWIALQGVWGEIER 625 LTE+PTAKTLWDAL TYSSGKDKLQ F+LHVKAN +KQ G LE+FWI +QGVWGEIER Sbjct: 112 LTEFPTAKTLWDALVVTYSSGKDKLQTFDLHVKANGIKQNGSPLEDFWIIMQGVWGEIER 171 Query: 624 IDPNPMTCTEDIKTYSKVRSEQKLFQFLHGLNRQFDPIKREILRLETLPTAETAYGTVRK 445 DPNPMTC EDI TY+K+RSEQKLFQFL+ L+RQ+D +KREILR + LP+AE AY VRK Sbjct: 172 RDPNPMTCPEDITTYNKIRSEQKLFQFLNALDRQYDTVKREILRWDPLPSAEGAYAAVRK 231 Query: 444 EAAHQNILGA---TSSETQGVAAGLHVR-SGEIEGAGLAVKGYRGNKPFN-----KEDKS 292 E AHQ ILG TS GVAAGL+ S E EG G +G K N + DKS Sbjct: 232 EMAHQGILGITIDTSYNPNGVAAGLNANGSRESEGLGFLSRGRVDQKSSNTGSSFRIDKS 291 Query: 291 HLKCDECKMTGHTKAQCFRIVGYPEWWSDGHK--KGARNPKSEKERSGAPTAQASTS 127 LKC C M+ HTK QCF++VGYPEWW+D HK KG + + S A Q +TS Sbjct: 292 KLKCGHCGMSKHTKDQCFQLVGYPEWWNDNHKTQKGGKISTAAGRSSAAIGNQKATS 348 >gb|OTG29887.1| putative gag-polypeptide of LTR copia-type [Helianthus annuus] Length = 325 Score = 353 bits (907), Expect = e-116 Identities = 180/275 (65%), Positives = 206/275 (74%), Gaps = 8/275 (2%) Frame = -2 Query: 966 MIRVAIGGKSKALLSHLTKDP--PEPTHESYETWEQEDLIVFSWLIQNIEPTIAGNLTEY 793 MIRVAIGGKSK LLSHL+ +P P+PT E YE WEQ+DLIVFSWLIQNIEP +A NLTE+ Sbjct: 1 MIRVAIGGKSKPLLSHLSGNPAPPDPTDERYEQWEQDDLIVFSWLIQNIEPALASNLTEF 60 Query: 792 PTAKTLWDALTTTYSSGKDKLQVFNLHVKANELKQEGKTLEEFWIALQGVWGEIERIDPN 613 PTAK+LWDAL TYSSGKDKLQ F+LHVKAN +KQ G LE+FWI +QG+WGEI+R DPN Sbjct: 61 PTAKSLWDALVVTYSSGKDKLQTFDLHVKANGIKQNGSPLEDFWIIMQGIWGEIDRRDPN 120 Query: 612 PMTCTEDIKTYSKVRSEQKLFQFLHGLNRQFDPIKREILRLETLPTAETAYGTVRKEAAH 433 PMTCT DI TY+K+RSEQKLFQFL+ L+RQ+D IKREILR + LP+AE AY VRKE AH Sbjct: 121 PMTCTVDIATYNKIRSEQKLFQFLNALDRQYDTIKREILRWDPLPSAEGAYAAVRKEMAH 180 Query: 432 QNILGATSSETQGVAAGLHVR-SGEIEGAGLAVKGYRGNKPFN-----KEDKSHLKCDEC 271 Q ILG +S VAAGL S E E G KG G K N + DKS LKC C Sbjct: 181 QGILGTATSSHNNVAAGLVANGSHETESLGFLSKGRSGQKNPNSGSSSQIDKSKLKCLHC 240 Query: 270 KMTGHTKAQCFRIVGYPEWWSDGHKKGARNPKSEK 166 M HTK QCF++VGYPEWW+DGHKK RN + K Sbjct: 241 GMLKHTKDQCFKLVGYPEWWNDGHKK--RNKEGGK 273 >ref|XP_021974902.1| uncharacterized protein LOC110870015 [Helianthus annuus] Length = 325 Score = 353 bits (905), Expect = e-116 Identities = 182/294 (61%), Positives = 215/294 (73%), Gaps = 8/294 (2%) Frame = -2 Query: 966 MIRVAIGGKSKALLSHLTKDPP--EPTHESYETWEQEDLIVFSWLIQNIEPTIAGNLTEY 793 MI+VA+GGKSK LLSH+T P P+ E YE WEQ+DL+VFSWLIQNIEP +AGNLTE+ Sbjct: 1 MIQVALGGKSKNLLSHITGKPAPLNPSDEQYEQWEQDDLVVFSWLIQNIEPGLAGNLTEF 60 Query: 792 PTAKTLWDALTTTYSSGKDKLQVFNLHVKANELKQEGKTLEEFWIALQGVWGEIERIDPN 613 PTAK LWDAL TYSSGKDKLQ F+LHVKANELKQ G LE+FWI LQG+WGEI+R D N Sbjct: 61 PTAKALWDALVVTYSSGKDKLQTFDLHVKANELKQNGSALEDFWIKLQGIWGEIDRRDLN 120 Query: 612 PMTCTEDIKTYSKVRSEQKLFQFLHGLNRQFDPIKREILRLETLPTAETAYGTVRKEAAH 433 PMTC+ DI TY +RSEQKLFQFL+ L+R+FDP+KREILR + LP+AE AY VRKE AH Sbjct: 121 PMTCSADIATYKNIRSEQKLFQFLNALDRKFDPVKREILRWDPLPSAEQAYAAVRKEMAH 180 Query: 432 QNILGATSSETQ-GVAAGLHV-RSGEIEGAGLAVKGYRGN----KPFNKEDKSHLKCDEC 271 Q ILG S +Q GVA+GL V + EI+G GL KG R + K ++ DKS LKC C Sbjct: 181 QGILGTISETSQSGVASGLIVGGTNEIDGQGLITKGQRRSDFTGKSSSRIDKSKLKCSHC 240 Query: 270 KMTGHTKAQCFRIVGYPEWWSDGHKKGARNPKSEKERSGAPTAQASTSRDNIEK 109 M HTK QCFRIVG+P+WWSD HK +NP E + A +T D+ EK Sbjct: 241 GMNKHTKDQCFRIVGFPDWWSDNHK--TKNPNQEVKVVIAIGNNKATINDSDEK 292 >ref|XP_021994452.1| uncharacterized protein LOC110891101 [Helianthus annuus] Length = 386 Score = 345 bits (886), Expect = e-112 Identities = 182/315 (57%), Positives = 225/315 (71%), Gaps = 9/315 (2%) Frame = -2 Query: 1044 SDNKTTATMSGNSDLALQLANP---LWTRMIRVAIGGKSKALLSHLTKDPPEPTHESYET 874 ++ T ++S + + +QL + LW+ MIR AIGGKSK LL HL PPE T Y++ Sbjct: 24 TNTSTKHSVSDSLRINIQLNSQNFGLWSHMIRAAIGGKSKNLLYHLDSKPPESTDARYDS 83 Query: 873 WEQEDLIVFSWLIQNIEPTIAGNLTEYPTAKTLWDALTTTYSSGKDKLQVFNLHVKANEL 694 WEQ+DLIVFSWLIQNIEP IA NLTE+PT+KTLW+AL TTYSSGKDKLQ+F+LHVKAN L Sbjct: 84 WEQDDLIVFSWLIQNIEPAIASNLTEFPTSKTLWEALQTTYSSGKDKLQIFDLHVKANSL 143 Query: 693 KQEGKTLEEFWIALQGVWGEIERIDPNPMTCTEDIKTYSKVRSEQKLFQFLHGLNRQFDP 514 KQ+ +E+ WI LQG+WGEI+R +PNPMTCT DI TY+++RSEQKLFQFL+ L+ +FD Sbjct: 144 KQKEVPVEDLWINLQGIWGEIDRREPNPMTCTTDINTYNRLRSEQKLFQFLNALDHRFDT 203 Query: 513 IKREILRLETLPTAETAYGTVRKEAAHQNILGATSSETQ--GVAAGLHVRS-GEIEGAGL 343 +KREILR E LPTAE AY T+RKE HQ ILGA +SETQ G+A+GL + + +G GL Sbjct: 204 VKREILRGEPLPTAEPAYATIRKETTHQIILGAGTSETQIHGIASGLATTNLQQTDGLGL 263 Query: 342 AVKG-YRGNKPF-NKED-KSHLKCDECKMTGHTKAQCFRIVGYPEWWSDGHKKGARNPKS 172 KG R K NK D K+ LKCD C HTK QCF +VGYP+WW G K RN K Sbjct: 264 ISKGNCRSEKTTGNKNDPKAKLKCDHCGKPRHTKDQCFHLVGYPDWWEIGPK---RNNKD 320 Query: 171 EKERSGAPTAQASTS 127 E +R + T Q S + Sbjct: 321 EGKRDTSTTGQGSNA 335 >ref|XP_021996287.1| uncharacterized protein LOC110893489 [Helianthus annuus] Length = 548 Score = 350 bits (897), Expect = e-112 Identities = 175/295 (59%), Positives = 212/295 (71%), Gaps = 4/295 (1%) Frame = -2 Query: 1062 PSKNPPSDNKTTATMSGNSDLALQLANPLWTRMIRVAIGGKSKALLSHLTKDPPEPTHES 883 P NP K + ++ + PLW RMIRVAIGGK KALL+HLT++PPEP +E Sbjct: 29 PKPNPSDSLKISLNLTSQN-------YPLWARMIRVAIGGKLKALLNHLTQNPPEPINEQ 81 Query: 882 YETWEQEDLIVFSWLIQNIEPTIAGNLTEYPTAKTLWDALTTTYSSGKDKLQVFNLHVKA 703 WEQ+DLIVFSWLIQNIEP++ NLTE+PTA TLWDAL+ TYSSGKDKLQ F+LHVKA Sbjct: 82 ---WEQDDLIVFSWLIQNIEPSLTSNLTEFPTANTLWDALSVTYSSGKDKLQTFDLHVKA 138 Query: 702 NELKQEGKTLEEFWIALQGVWGEIERIDPNPMTCTEDIKTYSKVRSEQKLFQFLHGLNRQ 523 NE KQ G LEEFWI +QG+ GEI+R DPNPM C++DI TY+KVRSE KLFQ L+ L+R+ Sbjct: 139 NEFKQNGLPLEEFWIVMQGIRGEIKRRDPNPMKCSDDIATYNKVRSENKLFQLLNALDRK 198 Query: 522 FDPIKREILRLETLPTAETAYGTVRKEAAHQNILGATSSETQGVAAGLHVRSGEIEGAGL 343 +D +KREILR + LPT E AY VRKE AHQ+I G T QGV +GL+ G +G GL Sbjct: 199 YDSLKREILRWDPLPTTEAAYAAVRKETAHQSIFGNTQ---QGVGSGLN-SLGSSDGLGL 254 Query: 342 AVKG----YRGNKPFNKEDKSHLKCDECKMTGHTKAQCFRIVGYPEWWSDGHKKG 190 + + N ++ DKS LKCD C M HTK QCF+IVGYP+WW+DGHKKG Sbjct: 255 VSRSRWSDQKSNPSSSRIDKSKLKCDHCGMAKHTKEQCFKIVGYPDWWADGHKKG 309 >ref|XP_019158475.1| PREDICTED: uncharacterized protein LOC109155214 [Ipomoea nil] Length = 975 Score = 360 bits (923), Expect = e-111 Identities = 181/297 (60%), Positives = 214/297 (72%), Gaps = 13/297 (4%) Frame = -2 Query: 978 LWTRMIRVAIGGKSKALLSHLTKDP--PEPTHESYETWEQEDLIVFSWLIQNIEPTIAGN 805 LW RMIRVAIGGKSK LLSHL+ +P P+P E Y WEQ+DL+VFSWLIQNI+P +A N Sbjct: 58 LWARMIRVAIGGKSKTLLSHLSGNPAPPDPKDEKYVQWEQDDLVVFSWLIQNIKPALASN 117 Query: 804 LTEYPTAKTLWDALTTTYSSGKDKLQVFNLHVKANELKQEGKTLEEFWIALQGVWGEIER 625 LTE+PTAK+LWDAL TYSSGKDKLQ F+LHVK NE+KQ G LE+F I +QG+WGEIER Sbjct: 118 LTEFPTAKSLWDALVVTYSSGKDKLQTFDLHVKTNEIKQNGAPLEDFGILMQGIWGEIER 177 Query: 624 IDPNPMTCTEDIKTYSKVRSEQKLFQFLHGLNRQFDPIKREILRLETLPTAETAYGTVRK 445 DPNPMTC DI TY+K+R+EQKLFQFL+ ++RQ+DPIKREILR + L +AE AY VR Sbjct: 178 RDPNPMTCAADIATYNKLRAEQKLFQFLNAIDRQYDPIKREILRWDPLTSAEGAYAAVRN 237 Query: 444 EAAHQNILGATS---SETQGVAAGLHVRS-GEIEGAGLAVKGY-------RGNKPFNKED 298 E AHQNILGA S S QGVAAGL V E EG GL KG R N ++ D Sbjct: 238 ETAHQNILGAVSAITSSQQGVAAGLTVTGPSEAEGLGLISKGQRRSDQTGRTNGSSSRPD 297 Query: 297 KSHLKCDECKMTGHTKAQCFRIVGYPEWWSDGHKKGARNPKSEKERSGAPTAQASTS 127 KS L C C M+ HTK QCF+IVGYPEWW+DGHK+ + +S R+ A T+ Sbjct: 298 KSQLNCSHCGMSKHTKEQCFKIVGYPEWWNDGHKQSGKTTRSNGGRAAAAVRNNDTT 354 >ref|XP_022012651.1| uncharacterized protein LOC110912271 [Helianthus annuus] Length = 565 Score = 343 bits (881), Expect = e-109 Identities = 172/277 (62%), Positives = 208/277 (75%), Gaps = 4/277 (1%) Frame = -2 Query: 945 GKSKALLSHLTKDPPEPTHESYETWEQEDLIVFSWLIQNIEPTIAGNLTEYPTAKTLWDA 766 G+SKA+L+HLT++PPEP E WEQ+DLIVFSWLIQNIEP++A NLTE+PTAKTLWDA Sbjct: 11 GESKAVLNHLTQNPPEPIDEQ---WEQDDLIVFSWLIQNIEPSLASNLTEFPTAKTLWDA 67 Query: 765 LTTTYSSGKDKLQVFNLHVKANELKQEGKTLEEFWIALQGVWGEIERIDPNPMTCTEDIK 586 LT TYSSGKDKLQ F+LHVK NE KQ G LEEFWI +QG+WGEIER DPNPM C DI Sbjct: 68 LTVTYSSGKDKLQTFDLHVKVNEFKQSGLPLEEFWIVMQGIWGEIERRDPNPMKCPTDIA 127 Query: 585 TYSKVRSEQKLFQFLHGLNRQFDPIKREILRLETLPTAETAYGTVRKEAAHQNILGATSS 406 TY+KVRSE KLFQFL+ L+R++D +KREILR + LP+AE AY VRKE AHQ+I G + Sbjct: 128 TYNKVRSEYKLFQFLNALDRKYDSLKREILRWDPLPSAEAAYAVVRKETAHQSIFG---N 184 Query: 405 ETQGVAAGLHVRSGEIEGAGLAVKGYRGNKPFNKE----DKSHLKCDECKMTGHTKAQCF 238 QGVA+GL+ +GE +G GL +G R ++ N+ DKS LKCD C M HTK QCF Sbjct: 185 VHQGVASGLN-STGESDGLGLVTRGRRSDQKSNQSSSRIDKSKLKCDHCGMAKHTKEQCF 243 Query: 237 RIVGYPEWWSDGHKKGARNPKSEKERSGAPTAQASTS 127 ++VGYPEWW+DGHKKG + S A +Q +TS Sbjct: 244 KLVGYPEWWADGHKKG--------KASAAVGSQGTTS 272 >gb|OTG08979.1| putative ribonuclease H-like domain, GAG-pre-integrase domain, Gag-polypeptide of LTR copia-type [Helianthus annuus] Length = 851 Score = 345 bits (886), Expect = e-107 Identities = 182/315 (57%), Positives = 225/315 (71%), Gaps = 9/315 (2%) Frame = -2 Query: 1044 SDNKTTATMSGNSDLALQLANP---LWTRMIRVAIGGKSKALLSHLTKDPPEPTHESYET 874 ++ T ++S + + +QL + LW+ MIR AIGGKSK LL HL PPE T Y++ Sbjct: 24 TNTSTKHSVSDSLRINIQLNSQNFGLWSHMIRAAIGGKSKNLLYHLDSKPPESTDARYDS 83 Query: 873 WEQEDLIVFSWLIQNIEPTIAGNLTEYPTAKTLWDALTTTYSSGKDKLQVFNLHVKANEL 694 WEQ+DLIVFSWLIQNIEP IA NLTE+PT+KTLW+AL TTYSSGKDKLQ+F+LHVKAN L Sbjct: 84 WEQDDLIVFSWLIQNIEPAIASNLTEFPTSKTLWEALQTTYSSGKDKLQIFDLHVKANSL 143 Query: 693 KQEGKTLEEFWIALQGVWGEIERIDPNPMTCTEDIKTYSKVRSEQKLFQFLHGLNRQFDP 514 KQ+ +E+ WI LQG+WGEI+R +PNPMTCT DI TY+++RSEQKLFQFL+ L+ +FD Sbjct: 144 KQKEVPVEDLWINLQGIWGEIDRREPNPMTCTTDINTYNRLRSEQKLFQFLNALDHRFDT 203 Query: 513 IKREILRLETLPTAETAYGTVRKEAAHQNILGATSSETQ--GVAAGLHVRS-GEIEGAGL 343 +KREILR E LPTAE AY T+RKE HQ ILGA +SETQ G+A+GL + + +G GL Sbjct: 204 VKREILRGEPLPTAEPAYATIRKETTHQIILGAGTSETQIHGIASGLATTNLQQTDGLGL 263 Query: 342 AVKG-YRGNKPF-NKED-KSHLKCDECKMTGHTKAQCFRIVGYPEWWSDGHKKGARNPKS 172 KG R K NK D K+ LKCD C HTK QCF +VGYP+WW G K RN K Sbjct: 264 ISKGNCRSEKTTGNKNDPKAKLKCDHCGKPRHTKDQCFHLVGYPDWWEIGPK---RNNKD 320 Query: 171 EKERSGAPTAQASTS 127 E +R + T Q S + Sbjct: 321 EGKRDTSTTGQGSNA 335 >ref|XP_020537368.1| uncharacterized protein LOC110010150 [Jatropha curcas] Length = 384 Score = 329 bits (843), Expect = e-106 Identities = 163/315 (51%), Positives = 220/315 (69%), Gaps = 16/315 (5%) Frame = -2 Query: 1023 TMSGNSDLALQLANP---LWTRMIRVAIGGKSKALLSHLTK--DPPEPTHESYETWEQED 859 T+S N + ++L + +W+RM++VAIGGKSK LL+H+T+ PP +E WEQED Sbjct: 31 TISDNLTINVKLNSQNYAIWSRMMKVAIGGKSKKLLNHITEAATPPSAGDPHFEKWEQED 90 Query: 858 LIVFSWLIQNIEPTIAGNLTEYPTAKTLWDALTTTYSSGKDKLQVFNLHVKANELKQEGK 679 LIVFSWLIQN+EP +A NLTEY TAK LW+AL TYSSGKDKLQ+F+LH +AN +KQ Sbjct: 91 LIVFSWLIQNMEPQLANNLTEYSTAKDLWNALVITYSSGKDKLQIFDLHTRANSMKQGSS 150 Query: 678 TLEEFWIALQGVWGEIERIDPNPMTCTEDIKTYSKVRSEQKLFQFLHGLNRQFDPIKREI 499 TLEEFW+ +QG+WGE++R +PNPMTC+ DI TY+KV+ EQKLFQFL+G++ +D IKREI Sbjct: 151 TLEEFWLTMQGIWGEMDRREPNPMTCSTDIATYNKVKQEQKLFQFLNGIDHLYDQIKREI 210 Query: 498 LRLETLPTAETAYGTVRKEAAHQNILGATSSE--TQGVAAGLHV----RSGEIEGAGLAV 337 LR E LP+AE AY +VRKEAA NI+G + E +QG+ G + + E G GL Sbjct: 211 LRSEHLPSAEAAYASVRKEAARLNIMGPANRESLSQGIGDGFVIIGKKEASEATGVGLVA 270 Query: 336 KGYRGNKPFN-----KEDKSHLKCDECKMTGHTKAQCFRIVGYPEWWSDGHKKGARNPKS 172 +G R ++P N + DKS LKC C M+ HTK QCF ++GYPEWW++ H+ + K+ Sbjct: 271 RGQRRSEPRNDGSSSRPDKSRLKCSYCGMSKHTKDQCFELIGYPEWWNENHRN--KGTKT 328 Query: 171 EKERSGAPTAQASTS 127 K + +A++S Sbjct: 329 SKAAAAVGNLEAASS 343 >ref|XP_022019013.1| uncharacterized protein LOC110919044 [Helianthus annuus] Length = 379 Score = 325 bits (833), Expect = e-104 Identities = 169/282 (59%), Positives = 198/282 (70%), Gaps = 7/282 (2%) Frame = -2 Query: 933 ALLSHLTKDPPEPTHESYETWEQEDLIVFSWLIQNIEPTIAGNLTEYPTAKTLWDALTTT 754 +L L PP P+ E YE WEQ+DL VFSWLIQNIEP +AGNLTE+PTAK LWDAL T Sbjct: 71 SLFHFLKPAPPNPSDEQYEQWEQDDLFVFSWLIQNIEPGLAGNLTEFPTAKALWDALVVT 130 Query: 753 YSSGKDKLQVFNLHVKANELKQEGKTLEEFWIALQGVWGEIERIDPNPMTCTEDIKTYSK 574 YSSGKDKLQ F+LHVKANELKQ G LE+FWI LQG+WGEI+R+DPNPMTC+ D+ TY+ Sbjct: 131 YSSGKDKLQTFDLHVKANELKQNGSALEDFWIKLQGIWGEIDRMDPNPMTCSADVATYNN 190 Query: 573 VRSEQKLFQFLHGLNRQFDPIKREILRLETLPTAETAYGTVRKEAAHQNILGATSSETQ- 397 +RSEQKLFQFL+ L+R+FD +KREIL + LP+AE AY TVRKE AHQ ILG S +Q Sbjct: 191 IRSEQKLFQFLNALDRKFDLVKREILWWDPLPSAEQAYATVRKEMAHQGILGTISETSQS 250 Query: 396 GVAAGLHVRSG--EIEGAGLAVKGYRGN----KPFNKEDKSHLKCDECKMTGHTKAQCFR 235 GVAAGL V G E +G GL KG R + K ++ DKS LKC C HTK QCFR Sbjct: 251 GVAAGL-VAGGTTETDGQGLITKGQRRSNFTGKSSSRIDKSKLKCSHCGKNKHTKDQCFR 309 Query: 234 IVGYPEWWSDGHKKGARNPKSEKERSGAPTAQASTSRDNIEK 109 IVG+ WWSD HK NP E + A +T D+ EK Sbjct: 310 IVGFLNWWSDNHK--TENPNQEGKVVIAIGNNKATINDSDEK 349 >ref|XP_021974743.1| uncharacterized protein LOC110869838 [Helianthus annuus] Length = 291 Score = 294 bits (752), Expect = 9e-94 Identities = 154/272 (56%), Positives = 182/272 (66%), Gaps = 6/272 (2%) Frame = -2 Query: 906 PPEPTHESYETWEQEDLIVFSWLIQNIEPTIAGNLTEYPTAKTLWDALTTTYSSGKDKLQ 727 PP P+ E YE WEQ+DL+VFSWLIQNIEP +AGNLTE+PTAK LWDAL TYSSGKDKLQ Sbjct: 6 PPNPSDEQYEQWEQDDLVVFSWLIQNIEPGLAGNLTEFPTAKALWDALVVTYSSGKDKLQ 65 Query: 726 VFNLHVKANELKQEGKTLEEFWIALQGVWGEIERIDPNPMTCTEDIKTYSKVRSEQKLFQ 547 F+LHVKANELKQ G LE+FWI LQG+WGEI+R DPNPMTC+ DI TY+ + Sbjct: 66 TFDLHVKANELKQNGSALEDFWIKLQGIWGEIDRRDPNPMTCSVDIATYNNI-------- 117 Query: 546 FLHGLNRQFDPIKREILRLETLPTAETAYGTVRKEAAHQNILGATSSETQ-GVAAGLHV- 373 R+F P+KREILR + LP+AE AY VR E A+Q I G S +Q GV AGL Sbjct: 118 ------RKFHPVKREILRRDPLPSAEQAYAAVRNEMAYQGICGTISETSQSGVTAGLIAG 171 Query: 372 RSGEIEGAGLAVKGYRGN----KPFNKEDKSHLKCDECKMTGHTKAQCFRIVGYPEWWSD 205 R+ EI+G GL KG R + K ++ DKS LKC C M HTK QCFRI G+P+ WSD Sbjct: 172 RTTEIDGHGLITKGQRRSGFTGKSSSRIDKSKLKCSHCGMNKHTKDQCFRIAGFPDCWSD 231 Query: 204 GHKKGARNPKSEKERSGAPTAQASTSRDNIEK 109 HK +NP E + A +T DN EK Sbjct: 232 NHK--TKNPNQEGKVVIAIGNNKATINDNDEK 261 >ref|XP_022000771.1| uncharacterized protein LOC110898294 [Helianthus annuus] Length = 247 Score = 291 bits (746), Expect = 2e-93 Identities = 146/240 (60%), Positives = 181/240 (75%), Gaps = 2/240 (0%) Frame = -2 Query: 945 GKSKALLSHLTKDPPEPTHESYETWEQEDLIVFSWLIQNIEPTIAGNLTEYPTAKTLWDA 766 GKSKALL+HL+++PPEP E WEQ+DLIVFSWLIQNIEP++A NLTE+PTAKTLWDA Sbjct: 11 GKSKALLNHLSQNPPEPIDEQ---WEQDDLIVFSWLIQNIEPSLASNLTEFPTAKTLWDA 67 Query: 765 LTTTYSSGKDKLQVFNLHVKANELKQEGKTLEEFWIALQGVWGEIERIDPNPMTCTEDIK 586 LT TYSSGKDKLQ F+LHVKANE KQ G LEEFWI +QG+WGEI+R DPNP+ C DI Sbjct: 68 LTITYSSGKDKLQTFDLHVKANEFKQNGVPLEEFWIIMQGIWGEIKRRDPNPIACPADIA 127 Query: 585 TYSKVRSEQKLFQFLHGLNRQFDPIKREILRLETLPTAETAYGTVRKEAAHQNILGATSS 406 TY+KVRSE KLFQFL+ L+R++D +KREIL+ + LP+ E AY VRKE HQ+I G + Sbjct: 128 TYNKVRSEYKLFQFLNALDRKYDSLKREILQWDPLPSVEVAYAVVRKETTHQSIFG---N 184 Query: 405 ETQGVAAGLHVRSGEIEGAGLAVKGYRGN-KPFNKE-DKSHLKCDECKMTGHTKAQCFRI 232 +GV +GL+ G+ +G GL + R + KP + DKS L+C+ C M HTK QCF + Sbjct: 185 PHKGVGSGLN-SHGDTDGLGLVSRSRRSDQKPSSSRIDKSKLRCEHCGMAKHTKDQCFSV 243 >gb|OTG25356.1| putative gag-polypeptide of LTR copia-type [Helianthus annuus] Length = 252 Score = 285 bits (730), Expect = 5e-91 Identities = 142/202 (70%), Positives = 163/202 (80%), Gaps = 3/202 (1%) Frame = -2 Query: 978 LWTRMIRVAIGGKSKALLSHLTKDP--PEPTHESYETWEQEDLIVFSWLIQNIEPTIAGN 805 LW+RMIRVAIG KSK LLSHLT DP PEPT YE WEQ+DL+VFSWLIQNIE ++A N Sbjct: 50 LWSRMIRVAIGDKSKHLLSHLTGDPKPPEPTDNQYEQWEQDDLVVFSWLIQNIERSLASN 109 Query: 804 LTEYPTAKTLWDALTTTYSSGKDKLQVFNLHVKANELKQEGKTLEEFWIALQGVWGEIER 625 LTE+PTAKTLWDAL TYSSGKDKLQ F+LH+K+N +K+ G LE+FWI LQGVWGEIER Sbjct: 110 LTEFPTAKTLWDALVVTYSSGKDKLQTFDLHLKSNSIKENGSPLEDFWIVLQGVWGEIER 169 Query: 624 IDPNPMTCTEDIKTYSKVRSEQKLFQ-FLHGLNRQFDPIKREILRLETLPTAETAYGTVR 448 DPNPMTC DI TY+K+RSEQKLFQ FL+ L+RQFDPIKREILR + LP+ E AY VR Sbjct: 170 RDPNPMTCAVDIATYNKLRSEQKLFQFFLNALDRQFDPIKREILRWDPLPSVEGAYAAVR 229 Query: 447 KEAAHQNILGATSSETQGVAAG 382 KE AHQ ILG T+ T + +G Sbjct: 230 KEMAHQGILG-TNDNTSFMISG 250 >ref|XP_022019596.1| uncharacterized protein LOC110919639 [Helianthus annuus] Length = 918 Score = 295 bits (754), Expect = 3e-87 Identities = 151/226 (66%), Positives = 172/226 (76%), Gaps = 6/226 (2%) Frame = -2 Query: 975 WTRMIRVAIGGKSKALLSHLTKDP--PEPTHESYETWEQEDLIVFSWLIQNIEPTIAGNL 802 W RMIRVAIGGKSK+LL HL+ +P PEPT E YE WEQ+DL+VFSWLIQNIEP +A NL Sbjct: 52 WARMIRVAIGGKSKSLLGHLSGNPAPPEPTDEKYEQWEQDDLVVFSWLIQNIEPALASNL 111 Query: 801 TEYPTAKTLWDALTTTYSSGKDKLQVFNLHVKANELKQEGKTLEEFWIALQGVWGEIERI 622 TE+PTAKTLWDAL TYSSGKDKLQ F+ HVKAN ++Q G LE+FWI LQG+WGEIER Sbjct: 112 TEFPTAKTLWDALVVTYSSGKDKLQTFDPHVKANGIEQNGSPLEDFWIVLQGIWGEIERR 171 Query: 621 DPNPMTCTEDIKTYSKVRSEQKLFQFLHGLNRQFDPIKREILRLETLPTAETAYGTVRKE 442 DPNPMTCT DI TY+K++SEQKLFQFL+ L+RQ++ IKREILR + LP+AE AY V KE Sbjct: 172 DPNPMTCTVDITTYNKLQSEQKLFQFLNALDRQYNTIKREILRWDPLPSAEGAYEAVWKE 231 Query: 441 AAHQNILGAT---SSETQGVAAGLHV-RSGEIEGAGLAVKGYRGNK 316 AH ILG T S GVA GL RS E G GL KG G K Sbjct: 232 MAHWGILGTTIDNPSSQNGVAVGLVANRSNESGGLGLLSKGRTGQK 277 >ref|XP_020535814.1| uncharacterized protein LOC110009728, partial [Jatropha curcas] Length = 267 Score = 268 bits (685), Expect = 5e-84 Identities = 134/237 (56%), Positives = 174/237 (73%), Gaps = 11/237 (4%) Frame = -2 Query: 1023 TMSGNSDLALQLANP---LWTRMIRVAIGGKSKALLSHLTK--DPPEPTHESYETWEQED 859 T+S N + ++L + +W+RM++VAIGGKSK LL+H+T+ PP +E WEQED Sbjct: 31 TISDNLTINVKLNSQNYAIWSRMMKVAIGGKSKKLLNHITEAATPPSAGDPHFEKWEQED 90 Query: 858 LIVFSWLIQNIEPTIAGNLTEYPTAKTLWDALTTTYSSGKDKLQVFNLHVKANELKQEGK 679 LIVFSWLIQN+EP +A NLTEY TAK LW+AL TYSSGKDKLQ+F+LH +AN +KQ Sbjct: 91 LIVFSWLIQNMEPQLANNLTEYSTAKDLWNALVITYSSGKDKLQIFDLHTRANSMKQGSS 150 Query: 678 TLEEFWIALQGVWGEIERIDPNPMTCTEDIKTYSKVRSEQKLFQFLHGLNRQFDPIKREI 499 TLEEFW+ +QG+WGEI+R +PNPMTC+ DI TY+KV+ EQKLFQFL+G++ +D IKREI Sbjct: 151 TLEEFWLTMQGIWGEIDRREPNPMTCSTDIATYNKVKQEQKLFQFLNGIDHLYDQIKREI 210 Query: 498 LRLETLPTAETAYGTVRKEAAHQNILGATSSE--TQGVAAGLHV----RSGEIEGAG 346 LR E LP+AE AY +VRKEAA NI+G + E +QG+ G + + E G G Sbjct: 211 LRSEHLPSAEAAYASVRKEAARLNIMGPANRESLSQGIGDGFVIIGKKEASEATGVG 267