BLASTX nr result
ID: Chrysanthemum22_contig00020368
seq
BLASTX 2.2.26 [Sep-21-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Chrysanthemum22_contig00020368 (1063 letters) Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF excluding environmental samples from WGS projects 149,584,005 sequences; 54,822,741,787 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value gb|KVI12279.1| protein of unknown function DUF292, eukaryotic [C... 256 9e-81 ref|XP_022034988.1| uncharacterized protein LOC110936882 isoform... 249 1e-71 gb|KVI01222.1| protein of unknown function DUF292, eukaryotic, p... 249 2e-71 ref|XP_022027195.1| uncharacterized protein LOC110928490 [Helian... 223 3e-62 gb|PLY80901.1| hypothetical protein LSAT_8X88000 [Lactuca sativa] 207 3e-56 ref|XP_023769805.1| uncharacterized protein LOC111918365 [Lactuc... 207 3e-56 gb|PLY76949.1| hypothetical protein LSAT_7X39321 [Lactuca sativa] 202 3e-55 ref|XP_023729811.1| uncharacterized protein F59B2.12-like [Lactu... 202 4e-55 ref|XP_022034989.1| uncharacterized protein LOC110936882 isoform... 194 6e-52 gb|OWM66105.1| hypothetical protein CDL15_Pgr015532 [Punica gran... 173 4e-44 ref|XP_010090010.1| uncharacterized protein LOC21409863 [Morus n... 172 7e-44 ref|XP_022842914.1| uncharacterized protein LOC111366373 isoform... 172 9e-44 ref|XP_022842912.1| uncharacterized protein LOC111366373 isoform... 172 9e-44 ref|XP_022842913.1| uncharacterized protein LOC111366373 isoform... 170 4e-43 ref|XP_017222640.1| PREDICTED: filaggrin-like [Daucus carota sub... 169 8e-43 ref|XP_018819031.1| PREDICTED: uncharacterized protein LOC108989... 169 8e-43 ref|XP_018819030.1| PREDICTED: uncharacterized protein LOC108989... 169 8e-43 ref|XP_022897495.1| dentin sialophosphoprotein-like isoform X1 [... 168 1e-42 dbj|GAV62190.1| Ist1 domain-containing protein [Cephalotus folli... 168 2e-42 ref|XP_012076797.1| uncharacterized protein LOC105637790 [Jatrop... 168 2e-42 >gb|KVI12279.1| protein of unknown function DUF292, eukaryotic [Cynara cardunculus var. scolymus] Length = 999 Score = 256 bits (654), Expect(2) = 9e-81 Identities = 140/249 (56%), Positives = 172/249 (69%), Gaps = 3/249 (1%) Frame = -1 Query: 1063 DIPELANIQKQFKAKYGKDFVSAAVELRPDCGVSRMLVEKLSAVAPDVQTKVKVLSAVAK 884 DI EL++I+K F KYGK+F+SAA ELRPDCGVSR+LVEKLSAVAPD+QTK+KVLS VAK Sbjct: 94 DISELSDIRKHFTRKYGKEFISAATELRPDCGVSRLLVEKLSAVAPDLQTKIKVLSDVAK 153 Query: 883 EHNIDWDATSFEMTESKPPNDLLNGPSNIEKASLETTDPPKVQPSYVNAVPSHEEKRNAP 704 EHNI WD TSFE ESKP +DLLNGPS+ EK + T DPPK Q S + V S +EK++AP Sbjct: 154 EHNIKWDPTSFEEKESKPTSDLLNGPSSFEKIGMTTVDPPKTQASNSDVVHSRKEKQDAP 213 Query: 703 INYSEQNKRFTLGSQNSTPTDN-DVETSS-ATHGDMSSSGTTFERMEKRENWNMEFKDXX 530 I++++QN+++TL ++N+T TDN VETSS ATH DM RENWNMEFKD Sbjct: 214 IDFAQQNRKYTLDTRNTTSTDNVGVETSSAATHADM------------RENWNMEFKDAT 261 Query: 529 XXXXXXXXXXXXXXXXXXXXAQFSKEEKVAKQRPAGSHVSNVRDE-THVSAPSGSIDEGR 353 AQFS++EK+ KQ P GSHVS++RDE + VSAPSG E Sbjct: 262 SAAQAAAESAERAAMAARAAAQFSRDEKIVKQYPTGSHVSDIRDEASRVSAPSGFTGEHH 321 Query: 352 FKDSYERSS 326 +D E SS Sbjct: 322 SRDLNESSS 330 Score = 74.3 bits (181), Expect(2) = 9e-81 Identities = 42/115 (36%), Positives = 64/115 (55%), Gaps = 30/115 (26%) Frame = -2 Query: 255 RRPKKVNQHVDQNEYDTSHRATERY-----GGNTSSSRPTAFKSNDDKLEDGKFVSDVHM 91 R+PK +NQ D++E++TS +ATER+ GGN+ SS PT+F+SN+D +ED K V++ HM Sbjct: 333 RKPKMLNQKTDRSEHNTSKKATERFAGDSHGGNSRSSMPTSFRSNNDSIEDEKLVNNFHM 392 Query: 90 EDEY-------------------------YEDFKTEVGSSQKERSEYENSYYFAD 1 D Y +E+ K E+ S +K+ SE E+ FA+ Sbjct: 393 ADGYFEESLNQDQDPDPDPPHPKMTTSEGFEESKAELASGKKDSSESEDINCFAE 447 >ref|XP_022034988.1| uncharacterized protein LOC110936882 isoform X1 [Helianthus annuus] gb|OTG28547.1| putative vacuolar protein sorting-associated protein Ist1 [Helianthus annuus] Length = 928 Score = 249 bits (636), Expect = 1e-71 Identities = 133/242 (54%), Positives = 165/242 (68%) Frame = -1 Query: 1063 DIPELANIQKQFKAKYGKDFVSAAVELRPDCGVSRMLVEKLSAVAPDVQTKVKVLSAVAK 884 DI EL +++KQF AKYGKDFVSAA+EL PDCGVSRMLVEKLSAVAPD+QTK+KVLSAVAK Sbjct: 112 DISELYDVRKQFTAKYGKDFVSAAIELHPDCGVSRMLVEKLSAVAPDLQTKIKVLSAVAK 171 Query: 883 EHNIDWDATSFEMTESKPPNDLLNGPSNIEKASLETTDPPKVQPSYVNAVPSHEEKRNAP 704 +HNIDWD TSFE ESKP +DLLNGP++ E AS+ + PK+QP V SHEEK++AP Sbjct: 172 DHNIDWDPTSFEEKESKPSSDLLNGPASFENASMANVELPKIQP-----VHSHEEKQSAP 226 Query: 703 INYSEQNKRFTLGSQNSTPTDNDVETSSATHGDMSSSGTTFERMEKRENWNMEFKDXXXX 524 +++SEQN+++TL +QN T T++ VET SSSG T + ME ++NWNMEFKD Sbjct: 227 VDFSEQNRKYTLNTQNVTSTNSGVET--------SSSGVTHDWMETKQNWNMEFKDATSA 278 Query: 523 XXXXXXXXXXXXXXXXXXAQFSKEEKVAKQRPAGSHVSNVRDETHVSAPSGSIDEGRFKD 344 AQFS +EK+A + P G HV N RDE P GS G + Sbjct: 279 AQAAAESAERAAVAARAAAQFSSQEKIANRPPTGPHVFNSRDE----YPHGSTSSGYHGE 334 Query: 343 SY 338 S+ Sbjct: 335 SF 336 Score = 48.1 bits (113), Expect(2) = 3e-06 Identities = 31/98 (31%), Positives = 48/98 (48%), Gaps = 15/98 (15%) Frame = -2 Query: 252 RPKKVNQHVDQNEYDTSHRATERYGGNTSSSRPTAFKSNDDKLEDGKFVSDVHMEDEYYE 73 +PK NQ + Q+++ TS +ATE + +SN+D +E+ V++ HM D YYE Sbjct: 402 KPKTSNQDIYQSQHGTSQKATETFD-----------RSNNDSIENETLVNNFHMTDGYYE 450 Query: 72 ---------------DFKTEVGSSQKERSEYENSYYFA 4 + KTE+ S +K E EN +FA Sbjct: 451 NSLHEDQEGSSSPERESKTELVSDRKGSIENENINFFA 488 Score = 32.7 bits (73), Expect(2) = 3e-06 Identities = 19/36 (52%), Positives = 23/36 (63%), Gaps = 1/36 (2%) Frame = -1 Query: 427 AGSHVSNVRDE-THVSAPSGSIDEGRFKDSYERSSD 323 AG HVSN R E +HVSAPSG E DS ++S+ Sbjct: 372 AGPHVSNSRYEPSHVSAPSGYSGESSSDDSKPKTSN 407 >gb|KVI01222.1| protein of unknown function DUF292, eukaryotic, partial [Cynara cardunculus var. scolymus] Length = 988 Score = 249 bits (636), Expect = 2e-71 Identities = 139/260 (53%), Positives = 165/260 (63%), Gaps = 15/260 (5%) Frame = -1 Query: 1063 DIPELANIQKQFKAKYGKDFVSAAVELRPDCGVSRMLVEKLSAVAPDVQTKVKVLSAVAK 884 DIPEL +++KQF AKYGK+FVSAA+ELRPDCGVSRMLVEKLSA+APDVQTK+KVL+AVAK Sbjct: 172 DIPELLDVRKQFTAKYGKEFVSAAIELRPDCGVSRMLVEKLSAIAPDVQTKMKVLTAVAK 231 Query: 883 EHNIDWDATSFEMTESKPPNDLLNGPSNIEKASLETTDPPKVQPSYVNAVPSHEEKRNAP 704 EHNI+WD TSFE ESKPP+DLLNGP+N EK S DPPK+Q V VP HEE NAP Sbjct: 232 EHNINWDPTSFEEKESKPPDDLLNGPANFEKISTINVDPPKIQTPNVQNVPIHEENPNAP 291 Query: 703 INYSEQNKRFTLGSQNSTPTDNDVETSSATHGDMSSSGTTFERMEK-------------- 566 N+SEQN+R+TL +Q S ++S T DM SSG T E M Sbjct: 292 FNFSEQNRRYTLDTQKS--------SASTTREDMRSSGMTSETMNMRQSFNRNHYDSSSG 343 Query: 565 RENWNMEFKDXXXXXXXXXXXXXXXXXXXXXXAQFSKEEKVAKQRPAGSHVSNVRDE-TH 389 RENWNMEFKD A S + K+ +Q S+ S++R E T Sbjct: 344 RENWNMEFKDATSAAQAAAESAERASMAARAAAHLSSQGKITRQYSTESYDSHIRHERTQ 403 Query: 388 VSAPSGSIDEGRFKDSYERS 329 VS+ S D+ FKDSY RS Sbjct: 404 VSSTSEFPDQHHFKDSYNRS 423 >ref|XP_022027195.1| uncharacterized protein LOC110928490 [Helianthus annuus] gb|OTG30096.1| putative vacuolar protein sorting-associated protein Ist1 [Helianthus annuus] Length = 868 Score = 223 bits (568), Expect = 3e-62 Identities = 126/259 (48%), Positives = 157/259 (60%), Gaps = 13/259 (5%) Frame = -1 Query: 1063 DIPELANIQKQFKAKYGKDFVSAAVELRPDCGVSRMLVEKLSAVAPDVQTKVKVLSAVAK 884 DIPELA+++K F AKYGK+FVSAA+ELRPDCGVSRMLVEKLSAVAPDVQTKVKVLSAVAK Sbjct: 112 DIPELADVKKNFTAKYGKEFVSAAIELRPDCGVSRMLVEKLSAVAPDVQTKVKVLSAVAK 171 Query: 883 EHNIDWDATSFEMTESKPPNDLLNGPSNIEKASLETTDPPKVQPSYVNAVPSHEEKRNAP 704 E+N++WD TSFE ESKPP+DLLNGP+ IEK S + D PK+Q SY V SHEE N Sbjct: 172 EYNVNWDPTSFEEKESKPPDDLLNGPTYIEKPSTISVDSPKIQTSYAQNVQSHEENPNGR 231 Query: 703 INYSEQNKRFTLGSQNSTPTDNDVETSSATHGDMSSSGTTFERMEKR------------- 563 +++++QN+RFTL +QN V T+ +TH DM SG M R Sbjct: 232 VDFAQQNRRFTLDAQN-------VPTADSTHDDMGPSGLGSGTMNTRHSFNSSNNSSFSK 284 Query: 562 ENWNMEFKDXXXXXXXXXXXXXXXXXXXXXXAQFSKEEKVAKQRPAGSHVSNVRDETHVS 383 E WNMEFKD A S + K+ +Q + S+ S + + Sbjct: 285 ETWNMEFKDATSAAQAAAESAERASIAARAAAMLSSQGKITRQYSSESYSS----QQNAR 340 Query: 382 APSGSIDEGRFKDSYERSS 326 S + FKD+Y +S Sbjct: 341 VSSEYTGQHSFKDTYNNTS 359 >gb|PLY80901.1| hypothetical protein LSAT_8X88000 [Lactuca sativa] Length = 909 Score = 207 bits (526), Expect = 3e-56 Identities = 116/230 (50%), Positives = 146/230 (63%), Gaps = 11/230 (4%) Frame = -1 Query: 1063 DIPELANIQKQFKAKYGKDFVSAAVELRPDCGVSRMLVEKLSAVAPDVQTKVKVLSAVAK 884 DI EL +I+KQF +KYGK+FVSAA+ELRPD GVSRMLVEKLSAVAPD+QTKVKVL+A+AK Sbjct: 112 DISELVDIKKQFTSKYGKEFVSAALELRPDSGVSRMLVEKLSAVAPDIQTKVKVLTAIAK 171 Query: 883 EHNIDWDATSFEMTESKPPNDLLNGPSNIEKASLETTDPPKVQPSYVNAVPSHEEKRNAP 704 EHNI+W+ TSFE ESKPPNDLLNGP++ E AS+ +P K+QPS VNAV SHE+K P Sbjct: 172 EHNINWEPTSFEEKESKPPNDLLNGPNSFENASMANANPSKIQPSNVNAVHSHEKKAGPP 231 Query: 703 INYSEQNKRFTLGSQNSTPTDNDVETSSATHGDMSSSGTTFERMEKR---------ENWN 551 ++++EQN+++ + DN T SSSG T E+ME R +NWN Sbjct: 232 LDFAEQNRKYIV--------DNGAAT--------SSSGITSEKMEMRGDDDFSSGGQNWN 275 Query: 550 MEFKDXXXXXXXXXXXXXXXXXXXXXXAQFSKEEKVA--KQRPAGSHVSN 407 M FKD AQF+ EK+ + P S + N Sbjct: 276 MGFKDATSAAEAAAESAERAAMAARAAAQFASHEKITTRDETPRKSKIIN 325 >ref|XP_023769805.1| uncharacterized protein LOC111918365 [Lactuca sativa] Length = 915 Score = 207 bits (526), Expect = 3e-56 Identities = 116/230 (50%), Positives = 146/230 (63%), Gaps = 11/230 (4%) Frame = -1 Query: 1063 DIPELANIQKQFKAKYGKDFVSAAVELRPDCGVSRMLVEKLSAVAPDVQTKVKVLSAVAK 884 DI EL +I+KQF +KYGK+FVSAA+ELRPD GVSRMLVEKLSAVAPD+QTKVKVL+A+AK Sbjct: 118 DISELVDIKKQFTSKYGKEFVSAALELRPDSGVSRMLVEKLSAVAPDIQTKVKVLTAIAK 177 Query: 883 EHNIDWDATSFEMTESKPPNDLLNGPSNIEKASLETTDPPKVQPSYVNAVPSHEEKRNAP 704 EHNI+W+ TSFE ESKPPNDLLNGP++ E AS+ +P K+QPS VNAV SHE+K P Sbjct: 178 EHNINWEPTSFEEKESKPPNDLLNGPNSFENASMANANPSKIQPSNVNAVHSHEKKAGPP 237 Query: 703 INYSEQNKRFTLGSQNSTPTDNDVETSSATHGDMSSSGTTFERMEKR---------ENWN 551 ++++EQN+++ + DN T SSSG T E+ME R +NWN Sbjct: 238 LDFAEQNRKYIV--------DNGAAT--------SSSGITSEKMEMRGDDDFSSGGQNWN 281 Query: 550 MEFKDXXXXXXXXXXXXXXXXXXXXXXAQFSKEEKVA--KQRPAGSHVSN 407 M FKD AQF+ EK+ + P S + N Sbjct: 282 MGFKDATSAAEAAAESAERAAMAARAAAQFASHEKITTRDETPRKSKIIN 331 >gb|PLY76949.1| hypothetical protein LSAT_7X39321 [Lactuca sativa] Length = 737 Score = 202 bits (514), Expect = 3e-55 Identities = 115/252 (45%), Positives = 147/252 (58%), Gaps = 5/252 (1%) Frame = -1 Query: 1063 DIPELANIQKQFKAKYGKDFVSAAVELRPDCGVSRMLVEKLSAVAPDVQTKVKVLSAVAK 884 DIPEL + +K F AKYGK+F SAA+ELRPD GV+RM+VEKLSAVAPD+QTK+KVLSAVAK Sbjct: 77 DIPELVDARKNFTAKYGKEFASAALELRPDSGVNRMMVEKLSAVAPDIQTKLKVLSAVAK 136 Query: 883 EHNIDWDATSFEMTESKPPNDLLNGPSNIEKASLETTDPPKVQPSYVNAVPSHEEKRNAP 704 EHN+DWD+T FE TESKP +DLLNG N E AS+ D PK+Q S + V SH +K N Sbjct: 137 EHNVDWDSTLFEETESKPKDDLLNGSVNFENASMMNVDSPKIQTSNIQNVQSHMQKLNVT 196 Query: 703 INYSEQNKRFTLGSQNSTPTDNDVETSSATHGDMSS-----SGTTFERMEKRENWNMEFK 539 ++++QN+R+TLGSQN T ND S M+ + T + NWNMEFK Sbjct: 197 DDFTQQNRRYTLGSQNI--TSNDTNPSGMASEMMNKRHSFHTNTNNASSGRENNWNMEFK 254 Query: 538 DXXXXXXXXXXXXXXXXXXXXXXAQFSKEEKVAKQRPAGSHVSNVRDETHVSAPSGSIDE 359 D A+ S + K+ + VS S + ++ Sbjct: 255 DATSAAQAAAESAERASMAARAAAELSSKGKIGSPK--------------VSTASEAPNQ 300 Query: 358 GRFKDSYERSSD 323 +FKDSY S D Sbjct: 301 HQFKDSYNNSFD 312 >ref|XP_023729811.1| uncharacterized protein F59B2.12-like [Lactuca sativa] Length = 772 Score = 202 bits (514), Expect = 4e-55 Identities = 115/252 (45%), Positives = 147/252 (58%), Gaps = 5/252 (1%) Frame = -1 Query: 1063 DIPELANIQKQFKAKYGKDFVSAAVELRPDCGVSRMLVEKLSAVAPDVQTKVKVLSAVAK 884 DIPEL + +K F AKYGK+F SAA+ELRPD GV+RM+VEKLSAVAPD+QTK+KVLSAVAK Sbjct: 112 DIPELVDARKNFTAKYGKEFASAALELRPDSGVNRMMVEKLSAVAPDIQTKLKVLSAVAK 171 Query: 883 EHNIDWDATSFEMTESKPPNDLLNGPSNIEKASLETTDPPKVQPSYVNAVPSHEEKRNAP 704 EHN+DWD+T FE TESKP +DLLNG N E AS+ D PK+Q S + V SH +K N Sbjct: 172 EHNVDWDSTLFEETESKPKDDLLNGSVNFENASMMNVDSPKIQTSNIQNVQSHMQKLNVT 231 Query: 703 INYSEQNKRFTLGSQNSTPTDNDVETSSATHGDMSS-----SGTTFERMEKRENWNMEFK 539 ++++QN+R+TLGSQN T ND S M+ + T + NWNMEFK Sbjct: 232 DDFTQQNRRYTLGSQNI--TSNDTNPSGMASEMMNKRHSFHTNTNNASSGRENNWNMEFK 289 Query: 538 DXXXXXXXXXXXXXXXXXXXXXXAQFSKEEKVAKQRPAGSHVSNVRDETHVSAPSGSIDE 359 D A+ S + K+ + VS S + ++ Sbjct: 290 DATSAAQAAAESAERASMAARAAAELSSKGKIGSPK--------------VSTASEAPNQ 335 Query: 358 GRFKDSYERSSD 323 +FKDSY S D Sbjct: 336 HQFKDSYNNSFD 347 >ref|XP_022034989.1| uncharacterized protein LOC110936882 isoform X2 [Helianthus annuus] Length = 782 Score = 194 bits (492), Expect = 6e-52 Identities = 106/207 (51%), Positives = 134/207 (64%) Frame = -1 Query: 958 MLVEKLSAVAPDVQTKVKVLSAVAKEHNIDWDATSFEMTESKPPNDLLNGPSNIEKASLE 779 MLVEKLSAVAPD+QTK+KVLSAVAK+HNIDWD TSFE ESKP +DLLNGP++ E AS+ Sbjct: 1 MLVEKLSAVAPDLQTKIKVLSAVAKDHNIDWDPTSFEEKESKPSSDLLNGPASFENASMA 60 Query: 778 TTDPPKVQPSYVNAVPSHEEKRNAPINYSEQNKRFTLGSQNSTPTDNDVETSSATHGDMS 599 + PK+QP V SHEEK++AP+++SEQN+++TL +QN T T++ VET S Sbjct: 61 NVELPKIQP-----VHSHEEKQSAPVDFSEQNRKYTLNTQNVTSTNSGVET--------S 107 Query: 598 SSGTTFERMEKRENWNMEFKDXXXXXXXXXXXXXXXXXXXXXXAQFSKEEKVAKQRPAGS 419 SSG T + ME ++NWNMEFKD AQFS +EK+A + P G Sbjct: 108 SSGVTHDWMETKQNWNMEFKDATSAAQAAAESAERAAVAARAAAQFSSQEKIANRPPTGP 167 Query: 418 HVSNVRDETHVSAPSGSIDEGRFKDSY 338 HV N RDE P GS G +S+ Sbjct: 168 HVFNSRDE----YPHGSTSSGYHGESF 190 Score = 48.1 bits (113), Expect(2) = 3e-06 Identities = 31/98 (31%), Positives = 48/98 (48%), Gaps = 15/98 (15%) Frame = -2 Query: 252 RPKKVNQHVDQNEYDTSHRATERYGGNTSSSRPTAFKSNDDKLEDGKFVSDVHMEDEYYE 73 +PK NQ + Q+++ TS +ATE + +SN+D +E+ V++ HM D YYE Sbjct: 256 KPKTSNQDIYQSQHGTSQKATETFD-----------RSNNDSIENETLVNNFHMTDGYYE 304 Query: 72 ---------------DFKTEVGSSQKERSEYENSYYFA 4 + KTE+ S +K E EN +FA Sbjct: 305 NSLHEDQEGSSSPERESKTELVSDRKGSIENENINFFA 342 Score = 32.7 bits (73), Expect(2) = 3e-06 Identities = 19/36 (52%), Positives = 23/36 (63%), Gaps = 1/36 (2%) Frame = -1 Query: 427 AGSHVSNVRDE-THVSAPSGSIDEGRFKDSYERSSD 323 AG HVSN R E +HVSAPSG E DS ++S+ Sbjct: 226 AGPHVSNSRYEPSHVSAPSGYSGESSSDDSKPKTSN 261 >gb|OWM66105.1| hypothetical protein CDL15_Pgr015532 [Punica granatum] gb|PKI49080.1| hypothetical protein CRG98_030532 [Punica granatum] Length = 1280 Score = 173 bits (438), Expect = 4e-44 Identities = 108/273 (39%), Positives = 143/273 (52%), Gaps = 27/273 (9%) Frame = -1 Query: 1063 DIPELANIQKQFKAKYGKDFVSAAVELRPDCGVSRMLVEKLSAVAPDVQTKVKVLSAVAK 884 DIPELA+++K F AKYGK+FVSAA+ELRPD GV+R ++EKLSA APD QTK+K+L+A+AK Sbjct: 112 DIPELADVRKHFTAKYGKEFVSAAIELRPDGGVNRTMIEKLSAKAPDGQTKLKILTAIAK 171 Query: 883 EHNIDWDATSFEMTESKPPNDLLNGPSNIEKASLETTDPPKVQPSY-------VNAVPSH 725 EHNI WD SF +S P DLLNGP+ A+ + VQ Y ++ P + Sbjct: 172 EHNIKWDPKSFGEKDSNPREDLLNGPTTFGNANNMNVESSNVQAHYYGRGTPDIHNPPQN 231 Query: 724 EEKRNAPINYSEQNKRFTLGSQNSTPTDNDVETSSATHGDMSSSGTTFERME-------- 569 E K+ APINY+ N R + GSQN P DV S+A SG +RME Sbjct: 232 EVKQEAPINYNGNNIRSSFGSQNVNPA--DVNASAAPSPHWKPSGNGMDRMESGNLHSED 289 Query: 568 ------KRENWNMEFKDXXXXXXXXXXXXXXXXXXXXXXAQFSKEEKVAKQRPAGSHVSN 407 R++WNMEF+D A+ S K Q GS S+ Sbjct: 290 QSPNFANRQDWNMEFEDATAAAQAAAESAERASMAARAAAKLSSRGKAMNQYSTGSQGSS 349 Query: 406 ------VRDETHVSAPSGSIDEGRFKDSYERSS 326 R + S S + G+ +++ERSS Sbjct: 350 AYARDGTRRHSRSSFDSEAFAGGQMGNNFERSS 382 >ref|XP_010090010.1| uncharacterized protein LOC21409863 [Morus notabilis] gb|EXB38807.1| hypothetical protein L484_027240 [Morus notabilis] Length = 1100 Score = 172 bits (436), Expect = 7e-44 Identities = 95/202 (47%), Positives = 126/202 (62%), Gaps = 26/202 (12%) Frame = -1 Query: 1063 DIPELANIQKQFKAKYGKDFVSAAVELRPDCGVSRMLVEKLSAVAPDVQTKVKVLSAVAK 884 D+PEL +I+K AKYGK+FV+ A+ELRPDCGV+RMLVEKLSA APD QTK+K+L+A+A+ Sbjct: 112 DVPELMDIRKYLTAKYGKEFVTTAIELRPDCGVNRMLVEKLSAKAPDGQTKLKILTAIAE 171 Query: 883 EHNIDWDATSFEMTESKPPNDLLNGPSNIEKASLETTDPPKVQ---------PSYVNAVP 731 EHN+ WD F +S PP DLLNGP+ E A+ ++ P P V A P Sbjct: 172 EHNVKWDPDLFSGNDSMPPQDLLNGPNTFEAANKIHSEAPSGPAEPIHDDRGPPNVQAPP 231 Query: 730 SHEEKRNAPINYSEQNKRFTLGSQNSTPTD--NDVETSSAT-HGDMSSSGTTFERME--- 569 H EK++ + ++E N+R + GSQNS T + T+SAT H D+ SSG+ E +E Sbjct: 232 RHSEKQDEYVKFNEHNRRMSSGSQNSASTGVATTMATTSATFHPDLRSSGSGTEWVEYKQ 291 Query: 568 -----------KRENWNMEFKD 536 R+NWNMEFKD Sbjct: 292 SYLGSENAFPAGRQNWNMEFKD 313 >ref|XP_022842914.1| uncharacterized protein LOC111366373 isoform X3 [Olea europaea var. sylvestris] Length = 1217 Score = 172 bits (435), Expect = 9e-44 Identities = 95/230 (41%), Positives = 133/230 (57%), Gaps = 10/230 (4%) Frame = -1 Query: 1063 DIPELANIQKQFKAKYGKDFVSAAVELRPDCGVSRMLVEKLSAVAPDVQTKVKVLSAVAK 884 D+PEL +I+K AKYGKDF +AA+ELRP+CGVSRMLVEKLS +APD QTK+K+LSA+A+ Sbjct: 124 DVPELLDIRKHLTAKYGKDFTTAAIELRPECGVSRMLVEKLSPMAPDGQTKIKILSAIAE 183 Query: 883 EHNIDWDATSFEMTESKPPNDLLNGPSNIEKASLETTDPPKVQPSYVNAVPSHEEKRNAP 704 EHN+ WD SF + PP+DLLNGPS IEK S +PP + A P + ++P Sbjct: 184 EHNVKWDPNSFGEKDGMPPSDLLNGPSTIEKNSKIYAEPPLFE-----ATPVQNKMHSSP 238 Query: 703 INYSEQNKRFTLGSQNSTPTDND-----VETSSATHGD-----MSSSGTTFERMEKRENW 554 +N++EQ+ R +LG+QNST + + +++ GD + G F ++ W Sbjct: 239 LNFAEQDPRSSLGTQNSTASQSSGVGSRLKSEVRPPGDERVQSIQEDGNAF----SKQRW 294 Query: 553 NMEFKDXXXXXXXXXXXXXXXXXXXXXXAQFSKEEKVAKQRPAGSHVSNV 404 NMEFKD A+ S +++ KQ SH +V Sbjct: 295 NMEFKDATSAAQAAAESAELASMAARAAAELSSPDRIMKQYSTESHKYDV 344 >ref|XP_022842912.1| uncharacterized protein LOC111366373 isoform X1 [Olea europaea var. sylvestris] Length = 1219 Score = 172 bits (435), Expect = 9e-44 Identities = 94/229 (41%), Positives = 132/229 (57%), Gaps = 9/229 (3%) Frame = -1 Query: 1063 DIPELANIQKQFKAKYGKDFVSAAVELRPDCGVSRMLVEKLSAVAPDVQTKVKVLSAVAK 884 D+PEL +I+K AKYGKDF +AA+ELRP+CGVSRMLVEKLS +APD QTK+K+LSA+A+ Sbjct: 124 DVPELLDIRKHLTAKYGKDFTTAAIELRPECGVSRMLVEKLSPMAPDGQTKIKILSAIAE 183 Query: 883 EHNIDWDATSFEMTESKPPNDLLNGPSNIEKASLETTDPPKVQPSYVNAVPSHEEKRNAP 704 EHN+ WD SF + PP+DLLNGPS IEK S +PP + A P + ++P Sbjct: 184 EHNVKWDPNSFGEKDGMPPSDLLNGPSTIEKNSKIYAEPPLFE-----ATPVQNKMHSSP 238 Query: 703 INYSEQNKRFTLGSQNSTPTDNDVETSSATHGDMSSSGTTFERME---------KRENWN 551 +N++EQ+ R +LG+QNST + + S ++ ER++ ++ WN Sbjct: 239 LNFAEQDPRSSLGTQNSTASQSS-GVGSRLKSEVRPPAVGDERVQSIQEDGNAFSKQRWN 297 Query: 550 MEFKDXXXXXXXXXXXXXXXXXXXXXXAQFSKEEKVAKQRPAGSHVSNV 404 MEFKD A+ S +++ KQ SH +V Sbjct: 298 MEFKDATSAAQAAAESAELASMAARAAAELSSPDRIMKQYSTESHKYDV 346 >ref|XP_022842913.1| uncharacterized protein LOC111366373 isoform X2 [Olea europaea var. sylvestris] Length = 1218 Score = 170 bits (430), Expect = 4e-43 Identities = 96/231 (41%), Positives = 131/231 (56%), Gaps = 11/231 (4%) Frame = -1 Query: 1063 DIPELANIQKQFKAKYGKDFVSAAVELRPDCGVSRMLVEKLSAVAPDVQTKVKVLSAVAK 884 D+PEL +I+K AKYGKDF +AA+ELRP+CGVSRMLVEKLS +APD QTK+K+LSA+A+ Sbjct: 124 DVPELLDIRKHLTAKYGKDFTTAAIELRPECGVSRMLVEKLSPMAPDGQTKIKILSAIAE 183 Query: 883 EHNIDWDATSFEMTESKPPNDLLNGPSNIEKASLETTDPPKVQPSYVNAVPSHEEKRNAP 704 EHN+ WD SF + PP+DLLNGPS IEK S +PP + A P + ++P Sbjct: 184 EHNVKWDPNSFGEKDGMPPSDLLNGPSTIEKNSKIYAEPPLFE-----ATPVQNKMHSSP 238 Query: 703 INYSEQNKRFTLGSQNSTPTDNDVETSSATH------GD-----MSSSGTTFERMEKREN 557 +N++EQ+ R +LG+QNST + + S GD + G F ++ Sbjct: 239 LNFAEQDPRSSLGTQNSTASQSSGVGSRLKSEVRPPVGDERVQSIQEDGNAF----SKQR 294 Query: 556 WNMEFKDXXXXXXXXXXXXXXXXXXXXXXAQFSKEEKVAKQRPAGSHVSNV 404 WNMEFKD A+ S +++ KQ SH +V Sbjct: 295 WNMEFKDATSAAQAAAESAELASMAARAAAELSSPDRIMKQYSTESHKYDV 345 >ref|XP_017222640.1| PREDICTED: filaggrin-like [Daucus carota subsp. sativus] gb|KZM83950.1| hypothetical protein DCAR_028628 [Daucus carota subsp. sativus] Length = 1089 Score = 169 bits (428), Expect = 8e-43 Identities = 98/229 (42%), Positives = 132/229 (57%), Gaps = 11/229 (4%) Frame = -1 Query: 1063 DIPELANIQKQFKAKYGKDFVSAAVELRPDCGVSRMLVEKLSAVAPDVQTKVKVLSAVAK 884 D+PEL +++K F AKYGK+FV+ A+ELRP+CGV RMLVEKLSAVAPD Q K K+L+A+A+ Sbjct: 112 DVPELLDVKKHFTAKYGKEFVTTALELRPNCGVGRMLVEKLSAVAPDGQAKFKILNAIAE 171 Query: 883 EHNIDWDATSFEMTESKPPNDLLNGPSNIEKASLETTDPPKVQPSYVNAVPSHEEKRNAP 704 E NI+WD+ SFE E+KP NDLLNGPS EKA + K+ S V A SH ++ P Sbjct: 172 ERNIEWDSKSFEEKETKPTNDLLNGPSTFEKAGEMAVEATKIGVSDVQATSSH-DRHTRP 230 Query: 703 INYSEQNKRFTLGSQNSTPTDN-DVETSSATHGDMSSSGT-------TFERME---KREN 557 +N +E N ++ P D+ T+ TH D SG +F R E +R++ Sbjct: 231 LNSTETNTVSSVDVHTVLPVDHGGRNTNDITHSDPRHSGNETKVGSHSFARDENYSRRQD 290 Query: 556 WNMEFKDXXXXXXXXXXXXXXXXXXXXXXAQFSKEEKVAKQRPAGSHVS 410 WNMEFKD A+FS+ E ++ P+ S S Sbjct: 291 WNMEFKDARSAAQAAAESAERASMAARAAAEFSRREDDSRHLPSESRNS 339 >ref|XP_018819031.1| PREDICTED: uncharacterized protein LOC108989762 isoform X2 [Juglans regia] Length = 1112 Score = 169 bits (428), Expect = 8e-43 Identities = 99/235 (42%), Positives = 126/235 (53%), Gaps = 16/235 (6%) Frame = -1 Query: 1063 DIPELANIQKQFKAKYGKDFVSAAVELRPDCGVSRMLVEKLSAVAPDVQTKVKVLSAVAK 884 D+PEL +I+KQF AKYGKDFVSAA+ELRPDCGV RMLVEKLSA APD+QTK+K+LS +A+ Sbjct: 114 DVPELMDIRKQFTAKYGKDFVSAAIELRPDCGVGRMLVEKLSAKAPDIQTKIKILSTIAE 173 Query: 883 EHNIDWDATSFEMTESKPPNDLLNGPSNIEKASLETTDPPKVQPSYVNAVPSHEEKRNAP 704 EHN+ WD S E +++PP D+LNGP+ EKAS +P +V PSH++K Sbjct: 174 EHNVKWDPNSLEEQDTRPPEDILNGPNTFEKASKIYVEP------HVQVPPSHDDKGPPN 227 Query: 703 INYSE--QNKRFTLGSQNSTPTDNDVETSSATHGDMSSSGTTFERMEK------------ 566 + S +N +G++ T DM SSG E E Sbjct: 228 VRSSPHLRNPDSDIGAKGGA-------TFGTFQADMGSSGNETEETESRHSYSGSGNALS 280 Query: 565 --RENWNMEFKDXXXXXXXXXXXXXXXXXXXXXXAQFSKEEKVAKQRPAGSHVSN 407 R+NWNMEFKD A+ S KV KQ SH S+ Sbjct: 281 MGRQNWNMEFKDATAAAQAAAESAERASMAARAAAELSSRAKVIKQYSMKSHKSS 335 >ref|XP_018819030.1| PREDICTED: uncharacterized protein LOC108989762 isoform X1 [Juglans regia] Length = 1186 Score = 169 bits (428), Expect = 8e-43 Identities = 99/235 (42%), Positives = 126/235 (53%), Gaps = 16/235 (6%) Frame = -1 Query: 1063 DIPELANIQKQFKAKYGKDFVSAAVELRPDCGVSRMLVEKLSAVAPDVQTKVKVLSAVAK 884 D+PEL +I+KQF AKYGKDFVSAA+ELRPDCGV RMLVEKLSA APD+QTK+K+LS +A+ Sbjct: 114 DVPELMDIRKQFTAKYGKDFVSAAIELRPDCGVGRMLVEKLSAKAPDIQTKIKILSTIAE 173 Query: 883 EHNIDWDATSFEMTESKPPNDLLNGPSNIEKASLETTDPPKVQPSYVNAVPSHEEKRNAP 704 EHN+ WD S E +++PP D+LNGP+ EKAS +P +V PSH++K Sbjct: 174 EHNVKWDPNSLEEQDTRPPEDILNGPNTFEKASKIYVEP------HVQVPPSHDDKGPPN 227 Query: 703 INYSE--QNKRFTLGSQNSTPTDNDVETSSATHGDMSSSGTTFERMEK------------ 566 + S +N +G++ T DM SSG E E Sbjct: 228 VRSSPHLRNPDSDIGAKGGA-------TFGTFQADMGSSGNETEETESRHSYSGSGNALS 280 Query: 565 --RENWNMEFKDXXXXXXXXXXXXXXXXXXXXXXAQFSKEEKVAKQRPAGSHVSN 407 R+NWNMEFKD A+ S KV KQ SH S+ Sbjct: 281 MGRQNWNMEFKDATAAAQAAAESAERASMAARAAAELSSRAKVIKQYSMKSHKSS 335 >ref|XP_022897495.1| dentin sialophosphoprotein-like isoform X1 [Olea europaea var. sylvestris] Length = 1219 Score = 168 bits (426), Expect = 1e-42 Identities = 90/234 (38%), Positives = 134/234 (57%), Gaps = 9/234 (3%) Frame = -1 Query: 1063 DIPELANIQKQFKAKYGKDFVSAAVELRPDCGVSRMLVEKLSAVAPDVQTKVKVLSAVAK 884 D+PEL +++K F AKYGK+F +AA+ELRP+CGVSRMLVEKLSA+APD QTK+K+LSA+A+ Sbjct: 124 DVPELLDVRKHFTAKYGKEFTTAAIELRPECGVSRMLVEKLSAIAPDGQTKIKILSAIAE 183 Query: 883 EHNIDWDATSFEMTESKPPNDLLNGPSNIEKASLETTDPPKVQPSYVNAVPSHEEKRNAP 704 EHN+ WD SF + P NDLL+GPS IE +S P + S + P + + ++ Sbjct: 184 EHNVKWDPNSFGEKDGTPHNDLLSGPSTIENSSKMYAGAPLFEASQSQSPPVNNKTHSSL 243 Query: 703 INYSEQNKRFTLGSQNSTPTDNDVETSSATHGDMSSSGTTFERME---------KRENWN 551 +N+SEQ+ R ++ +QNST + SS + ++ ER++ ++ WN Sbjct: 244 LNFSEQDPRSSVETQNST-SSQSFGVSSTLNSEVRPPAVRDERVQSIHEDANAFSKQRWN 302 Query: 550 MEFKDXXXXXXXXXXXXXXXXXXXXXXAQFSKEEKVAKQRPAGSHVSNVRDETH 389 M FKD A+ S + ++ +Q SH+S+V H Sbjct: 303 MGFKDATSAAQAAAESAELASMAARAAAELSSQGRITRQYSTESHMSDVHISRH 356 >dbj|GAV62190.1| Ist1 domain-containing protein [Cephalotus follicularis] Length = 1092 Score = 168 bits (425), Expect = 2e-42 Identities = 95/203 (46%), Positives = 117/203 (57%), Gaps = 27/203 (13%) Frame = -1 Query: 1063 DIPELANIQKQFKAKYGKDFVSAAVELRPDCGVSRMLVEKLSAVAPDVQTKVKVLSAVAK 884 D+PEL + +K F AKYGK+F SAAVELRPDCGVSRMLVEKLSA APD TK+K+LSA+A Sbjct: 112 DLPELMDARKHFTAKYGKEFASAAVELRPDCGVSRMLVEKLSANAPDGPTKIKILSAIAD 171 Query: 883 EHNIDWDATSFEMTESKPPNDLLNGPSNIEKASLETTDPPKVQ---------PSYVNAVP 731 EHNI W+ SF +SKPP DLLNGP+ KAS DPP VQ P + P Sbjct: 172 EHNIKWEPKSFGEKDSKPPEDLLNGPNTFGKASQMLVDPPNVQSPSNFYDKGPPHGQVPP 231 Query: 730 SHEEKRNAPINYSEQNKRFTLGSQNSTPTD---NDVETSSATHGDMSSSGTTFERMEKR- 563 + E + P+N +E + R SQ S TD N +S H ++ SG+ E ME Sbjct: 232 KYNEMHDVPVNLNEHHARSAQYSQTSAATDVGVNKTMSSGTYHPEVRYSGSGNEGMEMEF 291 Query: 562 --------------ENWNMEFKD 536 ++WNM FKD Sbjct: 292 MQSHTGGGNSSLGGQSWNMGFKD 314 >ref|XP_012076797.1| uncharacterized protein LOC105637790 [Jatropha curcas] gb|KDP33748.1| hypothetical protein JCGZ_07319 [Jatropha curcas] Length = 1138 Score = 168 bits (425), Expect = 2e-42 Identities = 111/276 (40%), Positives = 142/276 (51%), Gaps = 30/276 (10%) Frame = -1 Query: 1063 DIPELANIQKQFKAKYGKDFVSAAVELRPDCGVSRMLVEKLSAVAPDVQTKVKVLSAVAK 884 DIPEL +++K F AKYGK+FVSAAVELRPDCGVSR+LVEKLSA APD TK+KVLSA+A+ Sbjct: 112 DIPELMDVRKHFTAKYGKEFVSAAVELRPDCGVSRLLVEKLSAKAPDGPTKIKVLSAIAE 171 Query: 883 EHNIDWDATSFEMTESKPPNDLLNGPSNIEKASLETTDPPKVQPSY---------VNAVP 731 EH++ WD TSF E KPP DLLNGPS ++ S DPP VQ + + A Sbjct: 172 EHDVKWDPTSFGEKEMKPPEDLLNGPSTFQQVSKMHVDPPNVQELHNIVEKEHPNIRAPS 231 Query: 730 SHEEKRNAPINYSEQNKRFTLGSQNSTPT---DNDVETSSATHGDMSSSGTTFERME--- 569 EK AP+N N + QN + T N ++H D GT E ME Sbjct: 232 KQYEKPGAPVNSHGSNSISSSHFQNVSSTAAATNKAIQFDSSHYDPRPLGTGSEEMEFRH 291 Query: 568 -----------KRENWNMEFKDXXXXXXXXXXXXXXXXXXXXXXAQFSKEEKVAKQRPAG 422 R++WNMEFKD A+ S + ++++Q Sbjct: 292 SHAVEQSGFSAGRQSWNMEFKDATTAAQAAAESAERASMAARAAAELSSQGRMSRQHSTE 351 Query: 421 SHVSNV---RDE-THVSAPSGSIDEGRFKDSYERSS 326 S+ S+ RDE H A S E KD+ +S Sbjct: 352 SNKSSAFRPRDEGLHNYASSRLQSEHLAKDAVNNTS 387