BLASTX nr result
ID: Chrysanthemum21_contig00007652
seq
BLASTX 2.2.26 [Sep-21-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Chrysanthemum21_contig00007652 (2028 letters) Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF excluding environmental samples from WGS projects 149,584,005 sequences; 54,822,741,787 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value gb|KVI07947.1| AT hook, DNA-binding motif-containing protein [Cy... 631 0.0 gb|OMO89342.1| hypothetical protein CCACVL1_07901 [Corchorus cap... 563 0.0 gb|EOX93901.1| DNA binding protein, putative isoform 1 [Theobrom... 558 0.0 ref|XP_017969458.1| PREDICTED: uncharacterized protein LOC186127... 555 0.0 ref|XP_022846709.1| uncharacterized protein LOC111369428 isoform... 555 0.0 gb|OMO95816.1| hypothetical protein COLO4_15658 [Corchorus olito... 556 0.0 ref|XP_017969459.1| PREDICTED: uncharacterized protein LOC186127... 551 0.0 ref|XP_017969462.1| PREDICTED: uncharacterized protein LOC186127... 551 0.0 gb|OTG38382.1| putative transducin/WD40 repeat-like superfamily ... 551 0.0 ref|XP_022846707.1| uncharacterized protein LOC111369428 isoform... 555 0.0 ref|XP_022765155.1| uncharacterized protein LOC111310200 isoform... 541 0.0 ref|XP_023920540.1| uncharacterized protein LOC112032069 isoform... 544 0.0 ref|XP_021986505.1| uncharacterized protein LOC110882925 isoform... 551 0.0 ref|XP_017969461.1| PREDICTED: uncharacterized protein LOC186127... 547 0.0 ref|XP_022765154.1| uncharacterized protein LOC111310200 isoform... 541 0.0 ref|XP_022765150.1| uncharacterized protein LOC111310200 isoform... 541 e-180 ref|XP_022765149.1| uncharacterized protein LOC111310200 isoform... 541 e-180 ref|XP_021300958.1| uncharacterized protein LOC110429313 [Herran... 543 e-180 emb|CDP15391.1| unnamed protein product [Coffea canephora] 546 e-180 ref|XP_023920539.1| uncharacterized protein LOC112032069 isoform... 544 e-180 >gb|KVI07947.1| AT hook, DNA-binding motif-containing protein [Cynara cardunculus var. scolymus] Length = 1062 Score = 631 bits (1628), Expect = 0.0 Identities = 338/590 (57%), Positives = 396/590 (67%), Gaps = 56/590 (9%) Frame = +3 Query: 27 KVVPEKASSRKRKGYEKQTSVKSAL-TSKSRLAKCKLSSES------------------- 146 ++V +K S RKRK +++ KS L S S L KCKL +S Sbjct: 493 ELVAKKDSGRKRKAHDEGHPEKSVLIASTSTLTKCKLRLKSVETATDLHLPSQKCGTSLL 552 Query: 147 ---------QNPMGS-SATGYSVPLETDPDSCDISEDVSLPRLVMCLAHNGKVAWDIKWR 296 Q+PM S V LETD DS I EDV+LPRLV+CLAHNGKVAWD+KWR Sbjct: 553 NADTSSGCGQDPMRSIEDKADPVLLETDMDSRCIPEDVALPRLVLCLAHNGKVAWDVKWR 612 Query: 297 PFDTCANHKHRMGYLGVLLGSGALEVWEVPLPHAAEAMFSACQKDGTDPRFIKLKPVFRC 476 P DT N KHRMGYL VLLG+GALEVWEVP PHA E MFSAC+K+GTDPRFIKL+PVFRC Sbjct: 613 PSDTYFNSKHRMGYLAVLLGNGALEVWEVPAPHAVEVMFSACRKEGTDPRFIKLEPVFRC 672 Query: 477 AMLKCGDRQSIPLTMEWSTASPHDLILAGCHDGMVALWKFSANSPLKDTRPLIFFSADIL 656 +MLKCGDRQSIPLT+EWST+SPHDLILAGCHDG+VALWKFSA+ PLKDTRPL+ F+AD + Sbjct: 673 SMLKCGDRQSIPLTLEWSTSSPHDLILAGCHDGVVALWKFSADGPLKDTRPLLRFTADTV 732 Query: 657 PIRALAWAPVASDFESANIIVTAGHKGVKFWDIRDPFRPLWDVNPVQRVIYGVDWLSDPR 836 PIRALAWAPV SD ESANIIVTAGHKG KFWD+RDPFRPLWDVNP QR+IYG+DW DPR Sbjct: 733 PIRALAWAPVPSDSESANIIVTAGHKGAKFWDLRDPFRPLWDVNPAQRIIYGLDWHPDPR 792 Query: 837 CVVLSFDDGEIRIISLSKAACDVPVTGKPFVGTQLQGLNSYHCXXXXXXXXXXXRLTGMV 1016 CVVLSFDDGEI+IISLSKAACDVPVTG PFV Q +SYHC RLTGMV Sbjct: 793 CVVLSFDDGEIQIISLSKAACDVPVTGAPFVAAQRHASHSYHCSSSSIWSVQVSRLTGMV 852 Query: 1017 AYCCSDGRVLRFQLTTKSVEKDPRRNREPHYLCGALTTEGSTLNVFSPLPN--------- 1169 AYCCSDG+V+ FQLT K+VEKDP RNREPHYLCGA++ E S L + SPLP+ Sbjct: 853 AYCCSDGKVVHFQLTMKAVEKDPSRNREPHYLCGAMSMEESGLTILSPLPDVPFLMKKSS 912 Query: 1170 -ELGETPRSRRGHLTISNQEKRAREQGLKSQTPEDQAIPQAREQGLRCYTPEEQPLALCY 1346 E G+TPR+RRG+ ++SNQEKRA+EQ LK C +QPLA+CY Sbjct: 913 KEWGDTPRTRRGYRSLSNQEKRAKEQMLK-----------------EC----QQPLAVCY 951 Query: 1347 --NNDPGVDSGS----------DDTMVAQXXXXXXXXXXXXXXXXAHAKQAFTSTD--EV 1484 N+D S ++ + ++ +K D E+ Sbjct: 952 DGNSDSETQQSSSSKKGKRDDEEEELPSKIVGKRDDEDEDEEEQELASKIVGRREDEEEL 1011 Query: 1485 PD--VETINEKEVFPSKMVAMHRVRWNMNKGSERWLCYAGAAGIVRCQEM 1628 P V +++E PSK+V M+ VRWN NKGSERWLCY GAAGI+RCQ + Sbjct: 1012 PSKIVGKRDDEEELPSKIVGMYGVRWNTNKGSERWLCYGGAAGILRCQHI 1061 >gb|OMO89342.1| hypothetical protein CCACVL1_07901 [Corchorus capsularis] Length = 983 Score = 563 bits (1452), Expect = 0.0 Identities = 294/556 (52%), Positives = 369/556 (66%), Gaps = 23/556 (4%) Frame = +3 Query: 36 PEKASSRKRKGYEKQTSVKSA---LTSKSRLAK--CKLSSESQNPMGSSATGYSVPLETD 200 P K K+KG ++ S A ++ KSR K + S S M S+ S LE Sbjct: 435 PIKNHHEKQKGDKEVASAPDATPKISMKSRNLKRNAREISNSDESMVSNNIQDSNSLEVG 494 Query: 201 PDSCDISEDVSLPRLVMCLAHNGKVAWDIKWRPFD-TCANHKHRMGYLGVLLGSGALEVW 377 P S I D++LPR V+CLAHNGKVAWD+KWRP+D + RMGYL VLLG+G+LEVW Sbjct: 495 PGSSSIPADMALPRAVLCLAHNGKVAWDVKWRPYDINVSKCNQRMGYLAVLLGNGSLEVW 554 Query: 378 EVPLPHAAEAMFSACQKDGTDPRFIKLKPVFRCAMLKCGDRQSIPLTMEWSTASPHDLIL 557 EVPLPH ++S+ K GTDPRF+KL+PVF+C+ LKCGD QSIPLT+EWST+ PHD +L Sbjct: 555 EVPLPHMVRTVYSSSAKQGTDPRFVKLEPVFKCSKLKCGDIQSIPLTVEWSTSPPHDYLL 614 Query: 558 AGCHDGMVALWKFSANSPLKDTRPLIFFSADILPIRALAWAPVASDFESANIIVTAGHKG 737 AGCHDGMVALWKFSA++ KDTRPL+ FSAD +PIR++AWAP SD ES N+I+TAGH G Sbjct: 615 AGCHDGMVALWKFSASASPKDTRPLLCFSADTVPIRSVAWAPSGSDMESTNVILTAGHGG 674 Query: 738 VKFWDIRDPFRPLWDVNPVQRVIYGVDWLSDPRCVVLSFDDGEIRIISLSKAACDVPVTG 917 +KFWDIRDPF PLWDV+P + IY +DWL +PRCV+LSFDDG ++++SLS+A DVPVTG Sbjct: 675 LKFWDIRDPFLPLWDVHPAPKFIYSLDWLPEPRCVILSFDDGTMKLLSLSQAVSDVPVTG 734 Query: 918 KPFVGTQLQGLNSYHCXXXXXXXXXXXRLTGMVAYCCSDGRVLRFQLTTKSVEKDPRRNR 1097 KPF GT+ QGL+ Y+C RLTGMVAYC +DG V FQLT+K+V+KD RNR Sbjct: 735 KPFTGTKQQGLHLYNCSSFAIWHIQVSRLTGMVAYCGADGTVSHFQLTSKAVDKDFSRNR 794 Query: 1098 EPHYLCGALTTEGSTLNVFSPLPN----------ELGETPRSRRGHLTISNQEKRAREQG 1247 PH+LCG+LT E S + + +PLP+ + GE PRS R LT +NQ K A+++ Sbjct: 795 APHFLCGSLTEEESAIIINTPLPDIPLTMKKSTGDYGEGPRSMRAFLTETNQAKNAKDKK 854 Query: 1248 LKSQTPEDQAIPQAREQGLRCYTPEEQPLALCYNN----DPGVDSGSDDTMVAQXXXXXX 1415 K QT C ++Q LALCY + DPGV+S S++T+ A Sbjct: 855 AKVQT---------------C---DKQTLALCYGDDPDPDPGVESDSEETLAALKCKKKQ 896 Query: 1416 XXXXXXXXXXAHAKQAFTSTDEVPDV---ETINEKEVFPSKMVAMHRVRWNMNKGSERWL 1586 + + +E + ET NE EVFP KMVAMHRVRWNMNKGSERWL Sbjct: 897 KSQSERNKKADNDQALAIRIEEATNTQKEETGNEIEVFPGKMVAMHRVRWNMNKGSERWL 956 Query: 1587 CYAGAAGIVRCQEMSL 1634 CY GAAGIVRCQE+ + Sbjct: 957 CYGGAAGIVRCQEIKV 972 >gb|EOX93901.1| DNA binding protein, putative isoform 1 [Theobroma cacao] Length = 868 Score = 558 bits (1438), Expect = 0.0 Identities = 281/507 (55%), Positives = 348/507 (68%), Gaps = 18/507 (3%) Frame = +3 Query: 162 SSATGYSVPLETDPDS--CDISEDVSLPRLVMCLAHNGKVAWDIKWRPFD-TCANHKHRM 332 S G S+P + ++ I D+ LPR V+CLAHNGKVAWD+KW+P+D RM Sbjct: 367 SETPGSSIPRDNSSETPGSSIPRDIELPRTVLCLAHNGKVAWDVKWQPYDINDCECNQRM 426 Query: 333 GYLGVLLGSGALEVWEVPLPHAAEAMFSACQKDGTDPRFIKLKPVFRCAMLKCGDRQSIP 512 GYL VLLG+G+LEVWEVPLPH ++S+ K GTDPRF+KL+PVF+C+ LKCGD QSIP Sbjct: 427 GYLAVLLGNGSLEVWEVPLPHMISIVYSSSPKQGTDPRFVKLEPVFKCSKLKCGDVQSIP 486 Query: 513 LTMEWSTASPHDLILAGCHDGMVALWKFSANSPLKDTRPLIFFSADILPIRALAWAPVAS 692 LT+EWST+ PH+ +LAGCHDGMVALWKFSA+ DTRPL+ FSAD +PIR++AWAP S Sbjct: 487 LTVEWSTSPPHNYLLAGCHDGMVALWKFSASGSPTDTRPLLCFSADTVPIRSVAWAPSGS 546 Query: 693 DFESANIIVTAGHKGVKFWDIRDPFRPLWDVNPVQRVIYGVDWLSDPRCVVLSFDDGEIR 872 D ESAN+++TAGH G+KFWDIRDPF PLWDV+P + IY +DWL +PRCV+LSFDDG ++ Sbjct: 547 DMESANVVLTAGHGGLKFWDIRDPFLPLWDVHPAPKFIYSLDWLPEPRCVILSFDDGTMK 606 Query: 873 IISLSKAACDVPVTGKPFVGTQLQGLNSYHCXXXXXXXXXXXRLTGMVAYCCSDGRVLRF 1052 ++SL +AACDVPVTGKPF GT+ QGL+ Y+C RLTGMVAYC +DG V RF Sbjct: 607 MLSLIQAACDVPVTGKPFTGTKQQGLHLYNCSSFAIWNVQVSRLTGMVAYCGADGNVTRF 666 Query: 1053 QLTTKSVEKDPRRNREPHYLCGALTTEGSTLNVFSPLP----------NELGETPRSRRG 1202 QLT+K+V+KD RNR PH++CG+LT E S + V +PLP N+ GE PRS R Sbjct: 667 QLTSKAVDKDFSRNRAPHFVCGSLTEEESAIVVNTPLPDIPLTLKKQTNDYGEGPRSMRA 726 Query: 1203 HLTISNQEKRAREQGLKSQTPEDQAIPQAREQGLRCYTPEEQPLALCYNNDPGVDSGSDD 1382 LT SNQ K A++ K++ P TP++Q LALCY NDPGV+S S++ Sbjct: 727 FLTESNQAKNAKDN--KAKVP----------------TPDKQTLALCYGNDPGVESESEE 768 Query: 1383 TM-VAQXXXXXXXXXXXXXXXXAHAKQAFTSTDEVP----DVETINEKEVFPSKMVAMHR 1547 T+ +A A QA P E NE EVFP K+VAMHR Sbjct: 769 TLTLAALKGKIKQKSKSDRMKKAGDDQALAVRINEPANTQKEEAGNEIEVFPPKIVAMHR 828 Query: 1548 VRWNMNKGSERWLCYAGAAGIVRCQEM 1628 VRWNMNKGSERWLCY GAAGIVRCQE+ Sbjct: 829 VRWNMNKGSERWLCYGGAAGIVRCQEI 855 >ref|XP_017969458.1| PREDICTED: uncharacterized protein LOC18612763 isoform X1 [Theobroma cacao] Length = 877 Score = 555 bits (1431), Expect = 0.0 Identities = 279/507 (55%), Positives = 349/507 (68%), Gaps = 18/507 (3%) Frame = +3 Query: 162 SSATGYSVPLETDPDS--CDISEDVSLPRLVMCLAHNGKVAWDIKWRPFD-TCANHKHRM 332 S G S+P + ++ I D+ LPR V+CLAHNGKVAWD+KW+P+D RM Sbjct: 376 SETPGSSIPRDNSSETPGSSIPRDIELPRTVLCLAHNGKVAWDVKWQPYDINDCECNQRM 435 Query: 333 GYLGVLLGSGALEVWEVPLPHAAEAMFSACQKDGTDPRFIKLKPVFRCAMLKCGDRQSIP 512 GYL VLLG+G+LEVWEVPLPH ++S+ K GTDPRF+KL+PVF+C+ LKCGD QSIP Sbjct: 436 GYLAVLLGNGSLEVWEVPLPHMISIVYSSSPKQGTDPRFVKLEPVFKCSKLKCGDVQSIP 495 Query: 513 LTMEWSTASPHDLILAGCHDGMVALWKFSANSPLKDTRPLIFFSADILPIRALAWAPVAS 692 LT+EWST+ P++ +LAGCHDGMVALWKFSA+ DTRPL+ FSAD +PIR++AWAP S Sbjct: 496 LTVEWSTSPPYNYLLAGCHDGMVALWKFSASGSPTDTRPLLCFSADTVPIRSVAWAPSGS 555 Query: 693 DFESANIIVTAGHKGVKFWDIRDPFRPLWDVNPVQRVIYGVDWLSDPRCVVLSFDDGEIR 872 D ESAN+++TAGH G+KFWDIRDPF PLWDV+P + IY +DWL +PRCV+LSFDDG ++ Sbjct: 556 DMESANVVLTAGHGGLKFWDIRDPFLPLWDVHPAPKFIYSLDWLPEPRCVILSFDDGTMK 615 Query: 873 IISLSKAACDVPVTGKPFVGTQLQGLNSYHCXXXXXXXXXXXRLTGMVAYCCSDGRVLRF 1052 ++SL +AACDVPVTGKPF GT+ QGL+ Y+C RLTGMVAYC +DG V RF Sbjct: 616 MLSLIQAACDVPVTGKPFTGTKQQGLHLYNCSSFAIWNVQVSRLTGMVAYCGADGNVTRF 675 Query: 1053 QLTTKSVEKDPRRNREPHYLCGALTTEGSTLNVFSPLP----------NELGETPRSRRG 1202 QLT+K+V+KD RNR PH++CG+LT E S + V +PLP N+ GE+PRS R Sbjct: 676 QLTSKAVDKDFSRNRAPHFVCGSLTEEESAIVVNTPLPDIPLTLKKQTNDYGESPRSMRA 735 Query: 1203 HLTISNQEKRAREQGLKSQTPEDQAIPQAREQGLRCYTPEEQPLALCYNNDPGVDSGSDD 1382 LT SNQ K A++ K++ P TP+++ LALCY NDPGV+S S++ Sbjct: 736 FLTESNQAKNAKDN--KAKVP----------------TPDKRTLALCYGNDPGVESESEE 777 Query: 1383 TM-VAQXXXXXXXXXXXXXXXXAHAKQAFTSTDEVP----DVETINEKEVFPSKMVAMHR 1547 T+ +A A QA P E NE EVFP K+VAMHR Sbjct: 778 TLTLAALKGKIKQKSKSDRMKKAGDDQALAVRINEPTNTQKEEAGNEIEVFPPKIVAMHR 837 Query: 1548 VRWNMNKGSERWLCYAGAAGIVRCQEM 1628 VRWNMNKGSERWLCY GAAGIVRCQE+ Sbjct: 838 VRWNMNKGSERWLCYGGAAGIVRCQEI 864 >ref|XP_022846709.1| uncharacterized protein LOC111369428 isoform X3 [Olea europaea var. sylvestris] Length = 902 Score = 555 bits (1430), Expect = 0.0 Identities = 283/494 (57%), Positives = 339/494 (68%), Gaps = 21/494 (4%) Frame = +3 Query: 216 ISEDVSLPRLVMCLAHNGKVAWDIKWRPFDTCANHK-HRMGYLGVLLGSGALEVWEVPLP 392 IS DV+LPR+++CLAHNGKVAWD+KWRPF + HRMGYL VLLG+GALEVWEVP P Sbjct: 425 ISNDVALPRMMLCLAHNGKVAWDVKWRPFSARDSESMHRMGYLAVLLGNGALEVWEVPFP 484 Query: 393 HAAEAMFSACQKDGTDPRFIKLKPVFRCAMLKCGDRQSIPLTMEWSTASPHDLILAGCHD 572 E ++ ACQK+GTDPRFIKL PVFRC+ LKCGDRQSIPLT+EWST++ HD+ILAGCHD Sbjct: 485 RTVELIYPACQKEGTDPRFIKLAPVFRCSTLKCGDRQSIPLTVEWSTSALHDMILAGCHD 544 Query: 573 GMVALWKFSANSPLKDTRPLIFFSADILPIRALAWAPVASDFESANIIVTAGHKGVKFWD 752 G+VALWKFS DTRPL+ FSAD +PIRALAWAPV SD E AN+IVTAGHKG+KFWD Sbjct: 545 GVVALWKFSTTGSSTDTRPLLCFSADTVPIRALAWAPVQSDSEGANVIVTAGHKGLKFWD 604 Query: 753 IRDPFRPLWDVNPVQRVIYGVDWLSDPRCVVLSFDDGEIRIISLSKAACDVPVTGKPFVG 932 IRDPF PLWD+NPVQ VIY +DWL DPRC++ S DDG I ++SL KAA D+P+TGKPF G Sbjct: 605 IRDPFHPLWDLNPVQGVIYSLDWLPDPRCIIGSGDDGSIWVLSLVKAAHDIPITGKPFSG 664 Query: 933 TQLQGLNSYHCXXXXXXXXXXXRLTGMVAYCCSDGRVLRFQLTTKSVEKDPRRNREPHYL 1112 TQ QGL+SYHC RLTGMVAYC DG LRFQLTT++VEKDP RNR H+L Sbjct: 665 TQRQGLHSYHCASFPIWSIQVSRLTGMVAYCGEDGTALRFQLTTRAVEKDPSRNRPLHFL 724 Query: 1113 CGALTTEGSTLNVFSPLPN-------ELG----ETPRSRRGHLTISNQEKRAREQGLKSQ 1259 G+L E STL V SPL N LG + PR + G L + NQE+ +EQ Sbjct: 725 SGSLMEEESTLTVVSPLTNTPVPMKKSLGVSADDAPRIKSGPLALLNQEEVEKEQ----- 779 Query: 1260 TPEDQAIPQAREQGLRCYTPEEQPLALCYNNDPGVD-SGSDDTMVAQXXXXXXXXXXXXX 1436 AI C +E+ LALCY NDPG + ++ ++AQ Sbjct: 780 ----TAI---------CQASDEKTLALCYGNDPGTELEPNNPCIIAQQSKPAAKSKKSDK 826 Query: 1437 XXXAHAKQAFTSTDEVPDV--------ETINEKEVFPSKMVAMHRVRWNMNKGSERWLCY 1592 QA S D++ + + +E EVFP K+VAMH+VRWNMNKGSERWLCY Sbjct: 827 NKLKADPQAQISGDDMGNFPGERSDKGQMRHEIEVFPPKIVAMHKVRWNMNKGSERWLCY 886 Query: 1593 AGAAGIVRCQEMSL 1634 GAAG++RCQE+ + Sbjct: 887 GGAAGLIRCQEVDM 900 >gb|OMO95816.1| hypothetical protein COLO4_15658 [Corchorus olitorius] Length = 1008 Score = 556 bits (1434), Expect = 0.0 Identities = 277/498 (55%), Positives = 347/498 (69%), Gaps = 16/498 (3%) Frame = +3 Query: 189 LETDP-DSCDISEDVSLPRLVMCLAHNGKVAWDIKWRPFD-TCANHKHRMGYLGVLLGSG 362 LE P S I D++LPR V+CLAHNGKVAWD+KWRP+D + RMGYL VLLG+G Sbjct: 518 LEVGPGSSSSIPADMALPRGVLCLAHNGKVAWDVKWRPYDINISKCNQRMGYLAVLLGNG 577 Query: 363 ALEVWEVPLPHAAEAMFSACQKDGTDPRFIKLKPVFRCAMLKCGDRQSIPLTMEWSTASP 542 +LEVWEVPLPH ++S+ K GTDPRF+KL+PVF+C+ LKCGD QSIPLT+EWST+ P Sbjct: 578 SLEVWEVPLPHMIRTVYSSSAKQGTDPRFVKLEPVFKCSKLKCGDIQSIPLTVEWSTSPP 637 Query: 543 HDLILAGCHDGMVALWKFSANSPLKDTRPLIFFSADILPIRALAWAPVASDFESANIIVT 722 HD +LAGCHDGMVALWKFSA++ KDTRPL+ FSAD +PIR++AWAP SD ES N+I+T Sbjct: 638 HDYLLAGCHDGMVALWKFSASASPKDTRPLLCFSADTVPIRSVAWAPSGSDMESTNVILT 697 Query: 723 AGHKGVKFWDIRDPFRPLWDVNPVQRVIYGVDWLSDPRCVVLSFDDGEIRIISLSKAACD 902 AGH G+KFWDIRDPF PLWDV+P + IY +DWL +PRCV+LSFDDG ++++SLS+A D Sbjct: 698 AGHGGLKFWDIRDPFLPLWDVHPAPKFIYSLDWLPEPRCVILSFDDGTMKLLSLSQAVSD 757 Query: 903 VPVTGKPFVGTQLQGLNSYHCXXXXXXXXXXXRLTGMVAYCCSDGRVLRFQLTTKSVEKD 1082 VPVTGKPF GT+ QGL+ Y+C RLTGMVAYC +DG V FQLT+K+V+KD Sbjct: 758 VPVTGKPFTGTKQQGLHLYNCSSFAIWNIQVSRLTGMVAYCGADGTVSHFQLTSKAVDKD 817 Query: 1083 PRRNREPHYLCGALTTEGSTLNVFSPLP----------NELGETPRSRRGHLTISNQEKR 1232 RNR PH++CG+L E S + + +PLP ++ GE PRS R LT +NQ K Sbjct: 818 FSRNRAPHFVCGSLIEEESVITINTPLPDIPLTMKKSTSDYGEGPRSMRAFLTETNQAKN 877 Query: 1233 AREQGLKSQTPEDQAIPQAREQGLRCYTPEEQPLALCYNNDPGVDSGSDDTMVA----QX 1400 A+++ K Q T ++Q LALCY +DPGV+S S++T+ A + Sbjct: 878 AKDKKAKVQ------------------TSDKQTLALCYGDDPGVESDSEETLAALKCKKK 919 Query: 1401 XXXXXXXXXXXXXXXAHAKQAFTSTDEVPDVETINEKEVFPSKMVAMHRVRWNMNKGSER 1580 A A + +T+ ET NE EVFP+KMVAMHRVRWNMNKGSER Sbjct: 920 QNSQSERNKKADNDQALAIRIEEATNNTQKEETGNEIEVFPAKMVAMHRVRWNMNKGSER 979 Query: 1581 WLCYAGAAGIVRCQEMSL 1634 WLCY GAAGIVRCQE+ + Sbjct: 980 WLCYGGAAGIVRCQEIKV 997 >ref|XP_017969459.1| PREDICTED: uncharacterized protein LOC18612763 isoform X2 [Theobroma cacao] ref|XP_017969460.1| PREDICTED: uncharacterized protein LOC18612763 isoform X2 [Theobroma cacao] Length = 869 Score = 551 bits (1419), Expect = 0.0 Identities = 279/508 (54%), Positives = 349/508 (68%), Gaps = 19/508 (3%) Frame = +3 Query: 162 SSATGYSVPLETDPDS--CDISEDVSLPRLVMCLAHNGKVAWDIKWRPFD-TCANHKHRM 332 S G S+P + ++ I D+ LPR V+CLAHNGKVAWD+KW+P+D RM Sbjct: 367 SETPGSSIPRDNSSETPGSSIPRDIELPRTVLCLAHNGKVAWDVKWQPYDINDCECNQRM 426 Query: 333 GYLGVLLGSGALEV-WEVPLPHAAEAMFSACQKDGTDPRFIKLKPVFRCAMLKCGDRQSI 509 GYL VLLG+G+LEV WEVPLPH ++S+ K GTDPRF+KL+PVF+C+ LKCGD QSI Sbjct: 427 GYLAVLLGNGSLEVRWEVPLPHMISIVYSSSPKQGTDPRFVKLEPVFKCSKLKCGDVQSI 486 Query: 510 PLTMEWSTASPHDLILAGCHDGMVALWKFSANSPLKDTRPLIFFSADILPIRALAWAPVA 689 PLT+EWST+ P++ +LAGCHDGMVALWKFSA+ DTRPL+ FSAD +PIR++AWAP Sbjct: 487 PLTVEWSTSPPYNYLLAGCHDGMVALWKFSASGSPTDTRPLLCFSADTVPIRSVAWAPSG 546 Query: 690 SDFESANIIVTAGHKGVKFWDIRDPFRPLWDVNPVQRVIYGVDWLSDPRCVVLSFDDGEI 869 SD ESAN+++TAGH G+KFWDIRDPF PLWDV+P + IY +DWL +PRCV+LSFDDG + Sbjct: 547 SDMESANVVLTAGHGGLKFWDIRDPFLPLWDVHPAPKFIYSLDWLPEPRCVILSFDDGTM 606 Query: 870 RIISLSKAACDVPVTGKPFVGTQLQGLNSYHCXXXXXXXXXXXRLTGMVAYCCSDGRVLR 1049 +++SL +AACDVPVTGKPF GT+ QGL+ Y+C RLTGMVAYC +DG V R Sbjct: 607 KMLSLIQAACDVPVTGKPFTGTKQQGLHLYNCSSFAIWNVQVSRLTGMVAYCGADGNVTR 666 Query: 1050 FQLTTKSVEKDPRRNREPHYLCGALTTEGSTLNVFSPLP----------NELGETPRSRR 1199 FQLT+K+V+KD RNR PH++CG+LT E S + V +PLP N+ GE+PRS R Sbjct: 667 FQLTSKAVDKDFSRNRAPHFVCGSLTEEESAIVVNTPLPDIPLTLKKQTNDYGESPRSMR 726 Query: 1200 GHLTISNQEKRAREQGLKSQTPEDQAIPQAREQGLRCYTPEEQPLALCYNNDPGVDSGSD 1379 LT SNQ K A++ K++ P TP+++ LALCY NDPGV+S S+ Sbjct: 727 AFLTESNQAKNAKDN--KAKVP----------------TPDKRTLALCYGNDPGVESESE 768 Query: 1380 DTM-VAQXXXXXXXXXXXXXXXXAHAKQAFTSTDEVP----DVETINEKEVFPSKMVAMH 1544 +T+ +A A QA P E NE EVFP K+VAMH Sbjct: 769 ETLTLAALKGKIKQKSKSDRMKKAGDDQALAVRINEPTNTQKEEAGNEIEVFPPKIVAMH 828 Query: 1545 RVRWNMNKGSERWLCYAGAAGIVRCQEM 1628 RVRWNMNKGSERWLCY GAAGIVRCQE+ Sbjct: 829 RVRWNMNKGSERWLCYGGAAGIVRCQEI 856 >ref|XP_017969462.1| PREDICTED: uncharacterized protein LOC18612763 isoform X4 [Theobroma cacao] Length = 878 Score = 551 bits (1419), Expect = 0.0 Identities = 279/508 (54%), Positives = 349/508 (68%), Gaps = 19/508 (3%) Frame = +3 Query: 162 SSATGYSVPLETDPDS--CDISEDVSLPRLVMCLAHNGKVAWDIKWRPFD-TCANHKHRM 332 S G S+P + ++ I D+ LPR V+CLAHNGKVAWD+KW+P+D RM Sbjct: 376 SETPGSSIPRDNSSETPGSSIPRDIELPRTVLCLAHNGKVAWDVKWQPYDINDCECNQRM 435 Query: 333 GYLGVLLGSGALEV-WEVPLPHAAEAMFSACQKDGTDPRFIKLKPVFRCAMLKCGDRQSI 509 GYL VLLG+G+LEV WEVPLPH ++S+ K GTDPRF+KL+PVF+C+ LKCGD QSI Sbjct: 436 GYLAVLLGNGSLEVRWEVPLPHMISIVYSSSPKQGTDPRFVKLEPVFKCSKLKCGDVQSI 495 Query: 510 PLTMEWSTASPHDLILAGCHDGMVALWKFSANSPLKDTRPLIFFSADILPIRALAWAPVA 689 PLT+EWST+ P++ +LAGCHDGMVALWKFSA+ DTRPL+ FSAD +PIR++AWAP Sbjct: 496 PLTVEWSTSPPYNYLLAGCHDGMVALWKFSASGSPTDTRPLLCFSADTVPIRSVAWAPSG 555 Query: 690 SDFESANIIVTAGHKGVKFWDIRDPFRPLWDVNPVQRVIYGVDWLSDPRCVVLSFDDGEI 869 SD ESAN+++TAGH G+KFWDIRDPF PLWDV+P + IY +DWL +PRCV+LSFDDG + Sbjct: 556 SDMESANVVLTAGHGGLKFWDIRDPFLPLWDVHPAPKFIYSLDWLPEPRCVILSFDDGTM 615 Query: 870 RIISLSKAACDVPVTGKPFVGTQLQGLNSYHCXXXXXXXXXXXRLTGMVAYCCSDGRVLR 1049 +++SL +AACDVPVTGKPF GT+ QGL+ Y+C RLTGMVAYC +DG V R Sbjct: 616 KMLSLIQAACDVPVTGKPFTGTKQQGLHLYNCSSFAIWNVQVSRLTGMVAYCGADGNVTR 675 Query: 1050 FQLTTKSVEKDPRRNREPHYLCGALTTEGSTLNVFSPLP----------NELGETPRSRR 1199 FQLT+K+V+KD RNR PH++CG+LT E S + V +PLP N+ GE+PRS R Sbjct: 676 FQLTSKAVDKDFSRNRAPHFVCGSLTEEESAIVVNTPLPDIPLTLKKQTNDYGESPRSMR 735 Query: 1200 GHLTISNQEKRAREQGLKSQTPEDQAIPQAREQGLRCYTPEEQPLALCYNNDPGVDSGSD 1379 LT SNQ K A++ K++ P TP+++ LALCY NDPGV+S S+ Sbjct: 736 AFLTESNQAKNAKDN--KAKVP----------------TPDKRTLALCYGNDPGVESESE 777 Query: 1380 DTM-VAQXXXXXXXXXXXXXXXXAHAKQAFTSTDEVP----DVETINEKEVFPSKMVAMH 1544 +T+ +A A QA P E NE EVFP K+VAMH Sbjct: 778 ETLTLAALKGKIKQKSKSDRMKKAGDDQALAVRINEPTNTQKEEAGNEIEVFPPKIVAMH 837 Query: 1545 RVRWNMNKGSERWLCYAGAAGIVRCQEM 1628 RVRWNMNKGSERWLCY GAAGIVRCQE+ Sbjct: 838 RVRWNMNKGSERWLCYGGAAGIVRCQEI 865 >gb|OTG38382.1| putative transducin/WD40 repeat-like superfamily protein [Helianthus annuus] Length = 922 Score = 551 bits (1421), Expect = 0.0 Identities = 304/545 (55%), Positives = 358/545 (65%), Gaps = 19/545 (3%) Frame = +3 Query: 51 SRKRKGYEKQTSVKSALT-SKSRLAKCKLSSESQN---PMGSSATGYSVPLETDPDSCDI 218 +R+ K + K S KS T SKS+ + S+S N S A G P+ + I Sbjct: 413 TRRPKKHRKDQSPKSTRTPSKSKSTRTPSESKSTNCQTESESQAKGVDPPIISQSRESPI 472 Query: 219 SEDVSLPRLVMCLAHNGKVAWDIKWRPFDTCANHKHRMGYLGVLLGSGALEVWEVPLPHA 398 ED LPRLVM LAHNGKVAWD+KWRP D KH MGYL VLLG+GALEVWEVPLP Sbjct: 473 DEDDVLPRLVMGLAHNGKVAWDVKWRPSDFRHVSKHVMGYLAVLLGNGALEVWEVPLPRV 532 Query: 399 AEAMFSACQKDGTDPRFIKLKPVFRCAMLKCGDRQSIPLTMEWSTASPHDLILAGCHDGM 578 +A+FS CQK+G DPRF+KLKPVF C+MLKCGDRQSIPLT+EWST++PHDLILAGCHDG+ Sbjct: 533 TKAIFS-CQKEGKDPRFLKLKPVFLCSMLKCGDRQSIPLTLEWSTSAPHDLILAGCHDGV 591 Query: 579 VALWKFSANSPLKDTRPLIFFSADILPIRALAWAPVASDFESANIIVTAGHKGVKFWDIR 758 VALWKFSA+S KDTRPL+ FSAD +PIRAL WAP+ASD ESANII T HKG+KFWDIR Sbjct: 592 VALWKFSASSSSKDTRPLLCFSADTVPIRALTWAPLASDPESANIIATGSHKGLKFWDIR 651 Query: 759 DPFRPLWDVNPVQRVIYGVDWLSDPRCVVLSFDDGEIRIISLSKAACDVPVTGKPFVGTQ 938 DPF PLWD+ P Q+V+ ++WL DPRCVVLSFDDGEI+IISL KAA DVPVTG P Sbjct: 652 DPFHPLWDI-PYQKVVNSLEWLPDPRCVVLSFDDGEIKIISLLKAASDVPVTGMPCDKKP 710 Query: 939 LQGLNSYHCXXXXXXXXXXXRLTGMVAYCCSDGRVLRFQLTTKSVEKDPRRNREPHYLCG 1118 L G SY+C LTGMVAYCCSDG+V+ FQLT K+VEKDP RNREPHYLCG Sbjct: 711 LHGSYSYYCSSSSIWSVQVSSLTGMVAYCCSDGKVINFQLTIKAVEKDPHRNREPHYLCG 770 Query: 1119 ALTTE-GSTLNVFSPLPN----------ELGETPRSRRGHLTISNQEKRAREQGLKSQTP 1265 +LT E S+L V SPLP+ E G+TP++ RG + SNQEKRA+ Q K QTP Sbjct: 771 SLTEEEDSSLTVLSPLPDVPIPMKRSSTEWGDTPKTSRGVKSRSNQEKRAKGQISKFQTP 830 Query: 1266 EDQAIPQAREQGLRCYTPEEQPLALCYNNDPGVDSGSDDTMVAQXXXXXXXXXXXXXXXX 1445 P+ T + NND ++ D+ Sbjct: 831 -----PKPSSGDPLVNTQKN-------NNDTSREAQIDN--------------------- 857 Query: 1446 AHAKQAFTSTDEVPDVETINEKE----VFPSKMVAMHRVRWNMNKGSERWLCYAGAAGIV 1613 QA D+ VE N KE V+P K+VAMHRVRWNMNKGSER +CY GAAGI+ Sbjct: 858 -QTSQALVCIDDDNKVEETNVKEEETDVYPPKIVAMHRVRWNMNKGSERLVCYGGAAGIL 916 Query: 1614 RCQEM 1628 RCQ++ Sbjct: 917 RCQKI 921 >ref|XP_022846707.1| uncharacterized protein LOC111369428 isoform X1 [Olea europaea var. sylvestris] ref|XP_022846708.1| uncharacterized protein LOC111369428 isoform X2 [Olea europaea var. sylvestris] Length = 1056 Score = 555 bits (1430), Expect = 0.0 Identities = 283/494 (57%), Positives = 339/494 (68%), Gaps = 21/494 (4%) Frame = +3 Query: 216 ISEDVSLPRLVMCLAHNGKVAWDIKWRPFDTCANHK-HRMGYLGVLLGSGALEVWEVPLP 392 IS DV+LPR+++CLAHNGKVAWD+KWRPF + HRMGYL VLLG+GALEVWEVP P Sbjct: 579 ISNDVALPRMMLCLAHNGKVAWDVKWRPFSARDSESMHRMGYLAVLLGNGALEVWEVPFP 638 Query: 393 HAAEAMFSACQKDGTDPRFIKLKPVFRCAMLKCGDRQSIPLTMEWSTASPHDLILAGCHD 572 E ++ ACQK+GTDPRFIKL PVFRC+ LKCGDRQSIPLT+EWST++ HD+ILAGCHD Sbjct: 639 RTVELIYPACQKEGTDPRFIKLAPVFRCSTLKCGDRQSIPLTVEWSTSALHDMILAGCHD 698 Query: 573 GMVALWKFSANSPLKDTRPLIFFSADILPIRALAWAPVASDFESANIIVTAGHKGVKFWD 752 G+VALWKFS DTRPL+ FSAD +PIRALAWAPV SD E AN+IVTAGHKG+KFWD Sbjct: 699 GVVALWKFSTTGSSTDTRPLLCFSADTVPIRALAWAPVQSDSEGANVIVTAGHKGLKFWD 758 Query: 753 IRDPFRPLWDVNPVQRVIYGVDWLSDPRCVVLSFDDGEIRIISLSKAACDVPVTGKPFVG 932 IRDPF PLWD+NPVQ VIY +DWL DPRC++ S DDG I ++SL KAA D+P+TGKPF G Sbjct: 759 IRDPFHPLWDLNPVQGVIYSLDWLPDPRCIIGSGDDGSIWVLSLVKAAHDIPITGKPFSG 818 Query: 933 TQLQGLNSYHCXXXXXXXXXXXRLTGMVAYCCSDGRVLRFQLTTKSVEKDPRRNREPHYL 1112 TQ QGL+SYHC RLTGMVAYC DG LRFQLTT++VEKDP RNR H+L Sbjct: 819 TQRQGLHSYHCASFPIWSIQVSRLTGMVAYCGEDGTALRFQLTTRAVEKDPSRNRPLHFL 878 Query: 1113 CGALTTEGSTLNVFSPLPN-------ELG----ETPRSRRGHLTISNQEKRAREQGLKSQ 1259 G+L E STL V SPL N LG + PR + G L + NQE+ +EQ Sbjct: 879 SGSLMEEESTLTVVSPLTNTPVPMKKSLGVSADDAPRIKSGPLALLNQEEVEKEQ----- 933 Query: 1260 TPEDQAIPQAREQGLRCYTPEEQPLALCYNNDPGVD-SGSDDTMVAQXXXXXXXXXXXXX 1436 AI C +E+ LALCY NDPG + ++ ++AQ Sbjct: 934 ----TAI---------CQASDEKTLALCYGNDPGTELEPNNPCIIAQQSKPAAKSKKSDK 980 Query: 1437 XXXAHAKQAFTSTDEVPDV--------ETINEKEVFPSKMVAMHRVRWNMNKGSERWLCY 1592 QA S D++ + + +E EVFP K+VAMH+VRWNMNKGSERWLCY Sbjct: 981 NKLKADPQAQISGDDMGNFPGERSDKGQMRHEIEVFPPKIVAMHKVRWNMNKGSERWLCY 1040 Query: 1593 AGAAGIVRCQEMSL 1634 GAAG++RCQE+ + Sbjct: 1041 GGAAGLIRCQEVDM 1054 >ref|XP_022765155.1| uncharacterized protein LOC111310200 isoform X7 [Durio zibethinus] ref|XP_022765156.1| uncharacterized protein LOC111310200 isoform X7 [Durio zibethinus] ref|XP_022765157.1| uncharacterized protein LOC111310200 isoform X7 [Durio zibethinus] Length = 646 Score = 541 bits (1394), Expect = 0.0 Identities = 277/494 (56%), Positives = 336/494 (68%), Gaps = 15/494 (3%) Frame = +3 Query: 192 ETDPDSCDISEDVSLPRLVMCLAHNGKVAWDIKWRPFD-TCANHKHRMGYLGVLLGSGAL 368 E P S I D++LPR V+CLAHNGKVAWD+KW+P+D + RMGYL VLLG+G+L Sbjct: 162 EVSPGS-SIPGDIALPRAVLCLAHNGKVAWDVKWKPYDINDSKFNQRMGYLAVLLGNGSL 220 Query: 369 EVWEVPLPHAAEAMFSACQKDGTDPRFIKLKPVFRCAMLKCGDRQSIPLTMEWSTASPHD 548 EVWEVPLP+ ++S K GTDPRF+KL+PV +C+ LKCGD QSIPLT+EWST+SPHD Sbjct: 221 EVWEVPLPNMIRNVYSLSPKQGTDPRFVKLEPVLKCSKLKCGDIQSIPLTVEWSTSSPHD 280 Query: 549 LILAGCHDGMVALWKFSANSPLKDTRPLIFFSADILPIRALAWAPVASDFESANIIVTAG 728 +LAGCHDGMVALWKFSA+ KDTRPL+ FSAD +PIR++AWAP SD ES N+I+TAG Sbjct: 281 YLLAGCHDGMVALWKFSASDTPKDTRPLLCFSADSVPIRSVAWAPSGSDMESRNVILTAG 340 Query: 729 HKGVKFWDIRDPFRPLWDVNPVQRVIYGVDWLSDPRCVVLSFDDGEIRIISLSKAACDVP 908 H G+KFWDIRDPF PLWDV+P + IY +DWL +PRCV+LSFDDG ++++SL AACDVP Sbjct: 341 HGGLKFWDIRDPFLPLWDVHPAPKFIYSLDWLPEPRCVILSFDDGTMKMLSLVHAACDVP 400 Query: 909 VTGKPFVGTQLQGLNSYHCXXXXXXXXXXXRLTGMVAYCCSDGRVLRFQLTTKSVEKDPR 1088 VTGKPF GT+ QGL+ Y+C RLTGMVAYC +DG V RFQLT+K+V+KD Sbjct: 401 VTGKPFTGTKQQGLHLYNCSSFAIWSVQVSRLTGMVAYCGADGTVTRFQLTSKAVDKDFS 460 Query: 1089 RNREPHYLCGALTTEGSTLNVFSPLP----------NELGETPRSRRGHLTISNQEKRAR 1238 R+R PH+ CG+LT E S + V +PLP N+ GE PRS R LT SNQ K A+ Sbjct: 461 RHRTPHFPCGSLTEEESAVIVNTPLPDIPVTLKKPSNDYGEGPRSMRAFLTESNQGKNAK 520 Query: 1239 EQGLKSQTPEDQAIPQAREQGLRCYTPEEQPLALCYNNDPGVDSGSDDTMVAQXXXXXXX 1418 ++ K +Q LAL Y NDPGV+S S++T+ A Sbjct: 521 DRKAK------------------VAASNKQTLALYYGNDPGVESESEETLAA---LQSKR 559 Query: 1419 XXXXXXXXXAHAKQAFTSTDEVP----DVETINEKEVFPSKMVAMHRVRWNMNKGSERWL 1586 A Q E P ET NE EVFP K+VAMHRVRWNMNKGSERWL Sbjct: 560 KQKSNGKKKADDDQVLAIRIEEPTNTQKEETGNEIEVFPPKIVAMHRVRWNMNKGSERWL 619 Query: 1587 CYAGAAGIVRCQEM 1628 CY GAAGIVRCQE+ Sbjct: 620 CYGGAAGIVRCQEI 633 >ref|XP_023920540.1| uncharacterized protein LOC112032069 isoform X2 [Quercus suber] Length = 733 Score = 544 bits (1402), Expect = 0.0 Identities = 271/504 (53%), Positives = 347/504 (68%), Gaps = 8/504 (1%) Frame = +3 Query: 147 QNPMGSSATGYSVPLETDPDSCDISEDVSLPRLVMCLAHNGKVAWDIKWRPFDTCANH-K 323 Q P S L T S IS+DV+LPR+V+CLAHNGKVAWD+KWRP + C + K Sbjct: 227 QEPAASDNVSDKGSLGTSSASSLISKDVALPRVVLCLAHNGKVAWDVKWRPSNACQSKCK 286 Query: 324 HRMGYLGVLLGSGALEVWEVPLPHAAEAMFSACQKDGTDPRFIKLKPVFRCAMLKCGDRQ 503 HRMGYL VLLG+G+LEVWEVPLP + ++S+ ++GTDPRF+KL+PVFR ++LKCG Q Sbjct: 287 HRMGYLAVLLGNGSLEVWEVPLPRTMKVIYSSVHQEGTDPRFVKLEPVFRGSLLKCGGIQ 346 Query: 504 SIPLTMEWSTASPHDLILAGCHDGMVALWKFSANSPLKDTRPLIFFSADILPIRALAWAP 683 SIPLT+EWS + PHD +LAGCHDG VALWKFSA+ +DTRPL+ FSAD +PIRALAWAP Sbjct: 347 SIPLTVEWSASPPHDYLLAGCHDGTVALWKFSASCSSEDTRPLLCFSADTVPIRALAWAP 406 Query: 684 VASDFESANIIVTAGHKGVKFWDIRDPFRPLWDVNPVQRVIYGVDWLSDPRCVVLSFDDG 863 + SD ESAN+IVTAGH G+KFWD+RDP+RPLWD++PV R+IY +DWLS+PRCV+LSFDDG Sbjct: 407 LESDPESANVIVTAGHGGLKFWDLRDPYRPLWDLHPVPRIIYSLDWLSNPRCVILSFDDG 466 Query: 864 EIRIISLSKAACDVPVTGKPFVGTQLQGLNSYHCXXXXXXXXXXXRLTGMVAYCCSDGRV 1043 +RI+SL KAA DVPVTGKPF GT+ QGL+SY+C R+TGM AYC +DG V Sbjct: 467 TMRILSLLKAAYDVPVTGKPFGGTKQQGLHSYYCSSFAIWSVQVSRITGMAAYCTADGTV 526 Query: 1044 LRFQLTTKSVEKDPRRNREPHYLCGALTTEGSTLNVFSPLPNELGETPRSRRGHLTISNQ 1223 LRFQLT+K+V+KDP RNR PH+LCG+LT E S + + +P+PN TP + L Sbjct: 527 LRFQLTSKAVDKDPSRNRTPHFLCGSLTEEESLITINTPVPN----TPFPLKKSLNKGGD 582 Query: 1224 EKRAREQGLKSQTPEDQAIPQAREQGLRCYTPEEQPLALCYNNDPGVDSGSDDTMVAQXX 1403 + ++ + E Q + +A ++ + + + LALCY +DPG +SG+++ + Sbjct: 583 TPLS----MREFSSEPQHVKRANDKMAKSPSTDATTLALCYGDDPGTESGTEEALTRPKS 638 Query: 1404 XXXXXXXXXXXXXXAHAKQAFTSTDEVPDV-ETINEK------EVFPSKMVAMHRVRWNM 1562 +E P+ E N K EVFP K+VAM RVRWNM Sbjct: 639 KKRPNSRSSNKKNPEDDLALVCRDEEPPNTQEKENGKAEARTIEVFPPKIVAMRRVRWNM 698 Query: 1563 NKGSERWLCYAGAAGIVRCQEMSL 1634 NKGSERWLCY G AG+VRCQE+ L Sbjct: 699 NKGSERWLCYGGEAGVVRCQEIVL 722 >ref|XP_021986505.1| uncharacterized protein LOC110882925 isoform X1 [Helianthus annuus] ref|XP_021986511.1| uncharacterized protein LOC110882925 isoform X1 [Helianthus annuus] Length = 980 Score = 551 bits (1421), Expect = 0.0 Identities = 304/545 (55%), Positives = 358/545 (65%), Gaps = 19/545 (3%) Frame = +3 Query: 51 SRKRKGYEKQTSVKSALT-SKSRLAKCKLSSESQN---PMGSSATGYSVPLETDPDSCDI 218 +R+ K + K S KS T SKS+ + S+S N S A G P+ + I Sbjct: 471 TRRPKKHRKDQSPKSTRTPSKSKSTRTPSESKSTNCQTESESQAKGVDPPIISQSRESPI 530 Query: 219 SEDVSLPRLVMCLAHNGKVAWDIKWRPFDTCANHKHRMGYLGVLLGSGALEVWEVPLPHA 398 ED LPRLVM LAHNGKVAWD+KWRP D KH MGYL VLLG+GALEVWEVPLP Sbjct: 531 DEDDVLPRLVMGLAHNGKVAWDVKWRPSDFRHVSKHVMGYLAVLLGNGALEVWEVPLPRV 590 Query: 399 AEAMFSACQKDGTDPRFIKLKPVFRCAMLKCGDRQSIPLTMEWSTASPHDLILAGCHDGM 578 +A+FS CQK+G DPRF+KLKPVF C+MLKCGDRQSIPLT+EWST++PHDLILAGCHDG+ Sbjct: 591 TKAIFS-CQKEGKDPRFLKLKPVFLCSMLKCGDRQSIPLTLEWSTSAPHDLILAGCHDGV 649 Query: 579 VALWKFSANSPLKDTRPLIFFSADILPIRALAWAPVASDFESANIIVTAGHKGVKFWDIR 758 VALWKFSA+S KDTRPL+ FSAD +PIRAL WAP+ASD ESANII T HKG+KFWDIR Sbjct: 650 VALWKFSASSSSKDTRPLLCFSADTVPIRALTWAPLASDPESANIIATGSHKGLKFWDIR 709 Query: 759 DPFRPLWDVNPVQRVIYGVDWLSDPRCVVLSFDDGEIRIISLSKAACDVPVTGKPFVGTQ 938 DPF PLWD+ P Q+V+ ++WL DPRCVVLSFDDGEI+IISL KAA DVPVTG P Sbjct: 710 DPFHPLWDI-PYQKVVNSLEWLPDPRCVVLSFDDGEIKIISLLKAASDVPVTGMPCDKKP 768 Query: 939 LQGLNSYHCXXXXXXXXXXXRLTGMVAYCCSDGRVLRFQLTTKSVEKDPRRNREPHYLCG 1118 L G SY+C LTGMVAYCCSDG+V+ FQLT K+VEKDP RNREPHYLCG Sbjct: 769 LHGSYSYYCSSSSIWSVQVSSLTGMVAYCCSDGKVINFQLTIKAVEKDPHRNREPHYLCG 828 Query: 1119 ALTTE-GSTLNVFSPLPN----------ELGETPRSRRGHLTISNQEKRAREQGLKSQTP 1265 +LT E S+L V SPLP+ E G+TP++ RG + SNQEKRA+ Q K QTP Sbjct: 829 SLTEEEDSSLTVLSPLPDVPIPMKRSSTEWGDTPKTSRGVKSRSNQEKRAKGQISKFQTP 888 Query: 1266 EDQAIPQAREQGLRCYTPEEQPLALCYNNDPGVDSGSDDTMVAQXXXXXXXXXXXXXXXX 1445 P+ T + NND ++ D+ Sbjct: 889 -----PKPSSGDPLVNTQKN-------NNDTSREAQIDN--------------------- 915 Query: 1446 AHAKQAFTSTDEVPDVETINEKE----VFPSKMVAMHRVRWNMNKGSERWLCYAGAAGIV 1613 QA D+ VE N KE V+P K+VAMHRVRWNMNKGSER +CY GAAGI+ Sbjct: 916 -QTSQALVCIDDDNKVEETNVKEEETDVYPPKIVAMHRVRWNMNKGSERLVCYGGAAGIL 974 Query: 1614 RCQEM 1628 RCQ++ Sbjct: 975 RCQKI 979 >ref|XP_017969461.1| PREDICTED: uncharacterized protein LOC18612763 isoform X3 [Theobroma cacao] Length = 865 Score = 547 bits (1410), Expect = 0.0 Identities = 275/488 (56%), Positives = 341/488 (69%), Gaps = 17/488 (3%) Frame = +3 Query: 216 ISEDVSLPRLVMCLAHNGKVAWDIKWRPFD-TCANHKHRMGYLGVLLGSGALEV-WEVPL 389 I D+ LPR V+CLAHNGKVAWD+KW+P+D RMGYL VLLG+G+LEV WEVPL Sbjct: 383 IPRDIELPRTVLCLAHNGKVAWDVKWQPYDINDCECNQRMGYLAVLLGNGSLEVRWEVPL 442 Query: 390 PHAAEAMFSACQKDGTDPRFIKLKPVFRCAMLKCGDRQSIPLTMEWSTASPHDLILAGCH 569 PH ++S+ K GTDPRF+KL+PVF+C+ LKCGD QSIPLT+EWST+ P++ +LAGCH Sbjct: 443 PHMISIVYSSSPKQGTDPRFVKLEPVFKCSKLKCGDVQSIPLTVEWSTSPPYNYLLAGCH 502 Query: 570 DGMVALWKFSANSPLKDTRPLIFFSADILPIRALAWAPVASDFESANIIVTAGHKGVKFW 749 DGMVALWKFSA+ DTRPL+ FSAD +PIR++AWAP SD ESAN+++TAGH G+KFW Sbjct: 503 DGMVALWKFSASGSPTDTRPLLCFSADTVPIRSVAWAPSGSDMESANVVLTAGHGGLKFW 562 Query: 750 DIRDPFRPLWDVNPVQRVIYGVDWLSDPRCVVLSFDDGEIRIISLSKAACDVPVTGKPFV 929 DIRDPF PLWDV+P + IY +DWL +PRCV+LSFDDG ++++SL +AACDVPVTGKPF Sbjct: 563 DIRDPFLPLWDVHPAPKFIYSLDWLPEPRCVILSFDDGTMKMLSLIQAACDVPVTGKPFT 622 Query: 930 GTQLQGLNSYHCXXXXXXXXXXXRLTGMVAYCCSDGRVLRFQLTTKSVEKDPRRNREPHY 1109 GT+ QGL+ Y+C RLTGMVAYC +DG V RFQLT+K+V+KD RNR PH+ Sbjct: 623 GTKQQGLHLYNCSSFAIWNVQVSRLTGMVAYCGADGNVTRFQLTSKAVDKDFSRNRAPHF 682 Query: 1110 LCGALTTEGSTLNVFSPLP----------NELGETPRSRRGHLTISNQEKRAREQGLKSQ 1259 +CG+LT E S + V +PLP N+ GE+PRS R LT SNQ K A++ K++ Sbjct: 683 VCGSLTEEESAIVVNTPLPDIPLTLKKQTNDYGESPRSMRAFLTESNQAKNAKDN--KAK 740 Query: 1260 TPEDQAIPQAREQGLRCYTPEEQPLALCYNNDPGVDSGSDDTM-VAQXXXXXXXXXXXXX 1436 P TP+++ LALCY NDPGV+S S++T+ +A Sbjct: 741 VP----------------TPDKRTLALCYGNDPGVESESEETLTLAALKGKIKQKSKSDR 784 Query: 1437 XXXAHAKQAFTSTDEVP----DVETINEKEVFPSKMVAMHRVRWNMNKGSERWLCYAGAA 1604 A QA P E NE EVFP K+VAMHRVRWNMNKGSERWLCY GAA Sbjct: 785 MKKAGDDQALAVRINEPTNTQKEEAGNEIEVFPPKIVAMHRVRWNMNKGSERWLCYGGAA 844 Query: 1605 GIVRCQEM 1628 GIVRCQE+ Sbjct: 845 GIVRCQEI 852 >ref|XP_022765154.1| uncharacterized protein LOC111310200 isoform X6 [Durio zibethinus] Length = 731 Score = 541 bits (1394), Expect = 0.0 Identities = 277/494 (56%), Positives = 336/494 (68%), Gaps = 15/494 (3%) Frame = +3 Query: 192 ETDPDSCDISEDVSLPRLVMCLAHNGKVAWDIKWRPFD-TCANHKHRMGYLGVLLGSGAL 368 E P S I D++LPR V+CLAHNGKVAWD+KW+P+D + RMGYL VLLG+G+L Sbjct: 247 EVSPGS-SIPGDIALPRAVLCLAHNGKVAWDVKWKPYDINDSKFNQRMGYLAVLLGNGSL 305 Query: 369 EVWEVPLPHAAEAMFSACQKDGTDPRFIKLKPVFRCAMLKCGDRQSIPLTMEWSTASPHD 548 EVWEVPLP+ ++S K GTDPRF+KL+PV +C+ LKCGD QSIPLT+EWST+SPHD Sbjct: 306 EVWEVPLPNMIRNVYSLSPKQGTDPRFVKLEPVLKCSKLKCGDIQSIPLTVEWSTSSPHD 365 Query: 549 LILAGCHDGMVALWKFSANSPLKDTRPLIFFSADILPIRALAWAPVASDFESANIIVTAG 728 +LAGCHDGMVALWKFSA+ KDTRPL+ FSAD +PIR++AWAP SD ES N+I+TAG Sbjct: 366 YLLAGCHDGMVALWKFSASDTPKDTRPLLCFSADSVPIRSVAWAPSGSDMESRNVILTAG 425 Query: 729 HKGVKFWDIRDPFRPLWDVNPVQRVIYGVDWLSDPRCVVLSFDDGEIRIISLSKAACDVP 908 H G+KFWDIRDPF PLWDV+P + IY +DWL +PRCV+LSFDDG ++++SL AACDVP Sbjct: 426 HGGLKFWDIRDPFLPLWDVHPAPKFIYSLDWLPEPRCVILSFDDGTMKMLSLVHAACDVP 485 Query: 909 VTGKPFVGTQLQGLNSYHCXXXXXXXXXXXRLTGMVAYCCSDGRVLRFQLTTKSVEKDPR 1088 VTGKPF GT+ QGL+ Y+C RLTGMVAYC +DG V RFQLT+K+V+KD Sbjct: 486 VTGKPFTGTKQQGLHLYNCSSFAIWSVQVSRLTGMVAYCGADGTVTRFQLTSKAVDKDFS 545 Query: 1089 RNREPHYLCGALTTEGSTLNVFSPLP----------NELGETPRSRRGHLTISNQEKRAR 1238 R+R PH+ CG+LT E S + V +PLP N+ GE PRS R LT SNQ K A+ Sbjct: 546 RHRTPHFPCGSLTEEESAVIVNTPLPDIPVTLKKPSNDYGEGPRSMRAFLTESNQGKNAK 605 Query: 1239 EQGLKSQTPEDQAIPQAREQGLRCYTPEEQPLALCYNNDPGVDSGSDDTMVAQXXXXXXX 1418 ++ K +Q LAL Y NDPGV+S S++T+ A Sbjct: 606 DRKAK------------------VAASNKQTLALYYGNDPGVESESEETLAA---LQSKR 644 Query: 1419 XXXXXXXXXAHAKQAFTSTDEVP----DVETINEKEVFPSKMVAMHRVRWNMNKGSERWL 1586 A Q E P ET NE EVFP K+VAMHRVRWNMNKGSERWL Sbjct: 645 KQKSNGKKKADDDQVLAIRIEEPTNTQKEETGNEIEVFPPKIVAMHRVRWNMNKGSERWL 704 Query: 1587 CYAGAAGIVRCQEM 1628 CY GAAGIVRCQE+ Sbjct: 705 CYGGAAGIVRCQEI 718 >ref|XP_022765150.1| uncharacterized protein LOC111310200 isoform X2 [Durio zibethinus] Length = 782 Score = 541 bits (1394), Expect = e-180 Identities = 277/494 (56%), Positives = 336/494 (68%), Gaps = 15/494 (3%) Frame = +3 Query: 192 ETDPDSCDISEDVSLPRLVMCLAHNGKVAWDIKWRPFD-TCANHKHRMGYLGVLLGSGAL 368 E P S I D++LPR V+CLAHNGKVAWD+KW+P+D + RMGYL VLLG+G+L Sbjct: 298 EVSPGS-SIPGDIALPRAVLCLAHNGKVAWDVKWKPYDINDSKFNQRMGYLAVLLGNGSL 356 Query: 369 EVWEVPLPHAAEAMFSACQKDGTDPRFIKLKPVFRCAMLKCGDRQSIPLTMEWSTASPHD 548 EVWEVPLP+ ++S K GTDPRF+KL+PV +C+ LKCGD QSIPLT+EWST+SPHD Sbjct: 357 EVWEVPLPNMIRNVYSLSPKQGTDPRFVKLEPVLKCSKLKCGDIQSIPLTVEWSTSSPHD 416 Query: 549 LILAGCHDGMVALWKFSANSPLKDTRPLIFFSADILPIRALAWAPVASDFESANIIVTAG 728 +LAGCHDGMVALWKFSA+ KDTRPL+ FSAD +PIR++AWAP SD ES N+I+TAG Sbjct: 417 YLLAGCHDGMVALWKFSASDTPKDTRPLLCFSADSVPIRSVAWAPSGSDMESRNVILTAG 476 Query: 729 HKGVKFWDIRDPFRPLWDVNPVQRVIYGVDWLSDPRCVVLSFDDGEIRIISLSKAACDVP 908 H G+KFWDIRDPF PLWDV+P + IY +DWL +PRCV+LSFDDG ++++SL AACDVP Sbjct: 477 HGGLKFWDIRDPFLPLWDVHPAPKFIYSLDWLPEPRCVILSFDDGTMKMLSLVHAACDVP 536 Query: 909 VTGKPFVGTQLQGLNSYHCXXXXXXXXXXXRLTGMVAYCCSDGRVLRFQLTTKSVEKDPR 1088 VTGKPF GT+ QGL+ Y+C RLTGMVAYC +DG V RFQLT+K+V+KD Sbjct: 537 VTGKPFTGTKQQGLHLYNCSSFAIWSVQVSRLTGMVAYCGADGTVTRFQLTSKAVDKDFS 596 Query: 1089 RNREPHYLCGALTTEGSTLNVFSPLP----------NELGETPRSRRGHLTISNQEKRAR 1238 R+R PH+ CG+LT E S + V +PLP N+ GE PRS R LT SNQ K A+ Sbjct: 597 RHRTPHFPCGSLTEEESAVIVNTPLPDIPVTLKKPSNDYGEGPRSMRAFLTESNQGKNAK 656 Query: 1239 EQGLKSQTPEDQAIPQAREQGLRCYTPEEQPLALCYNNDPGVDSGSDDTMVAQXXXXXXX 1418 ++ K +Q LAL Y NDPGV+S S++T+ A Sbjct: 657 DRKAK------------------VAASNKQTLALYYGNDPGVESESEETLAA---LQSKR 695 Query: 1419 XXXXXXXXXAHAKQAFTSTDEVP----DVETINEKEVFPSKMVAMHRVRWNMNKGSERWL 1586 A Q E P ET NE EVFP K+VAMHRVRWNMNKGSERWL Sbjct: 696 KQKSNGKKKADDDQVLAIRIEEPTNTQKEETGNEIEVFPPKIVAMHRVRWNMNKGSERWL 755 Query: 1587 CYAGAAGIVRCQEM 1628 CY GAAGIVRCQE+ Sbjct: 756 CYGGAAGIVRCQEI 769 >ref|XP_022765149.1| uncharacterized protein LOC111310200 isoform X1 [Durio zibethinus] Length = 783 Score = 541 bits (1394), Expect = e-180 Identities = 277/494 (56%), Positives = 336/494 (68%), Gaps = 15/494 (3%) Frame = +3 Query: 192 ETDPDSCDISEDVSLPRLVMCLAHNGKVAWDIKWRPFD-TCANHKHRMGYLGVLLGSGAL 368 E P S I D++LPR V+CLAHNGKVAWD+KW+P+D + RMGYL VLLG+G+L Sbjct: 299 EVSPGS-SIPGDIALPRAVLCLAHNGKVAWDVKWKPYDINDSKFNQRMGYLAVLLGNGSL 357 Query: 369 EVWEVPLPHAAEAMFSACQKDGTDPRFIKLKPVFRCAMLKCGDRQSIPLTMEWSTASPHD 548 EVWEVPLP+ ++S K GTDPRF+KL+PV +C+ LKCGD QSIPLT+EWST+SPHD Sbjct: 358 EVWEVPLPNMIRNVYSLSPKQGTDPRFVKLEPVLKCSKLKCGDIQSIPLTVEWSTSSPHD 417 Query: 549 LILAGCHDGMVALWKFSANSPLKDTRPLIFFSADILPIRALAWAPVASDFESANIIVTAG 728 +LAGCHDGMVALWKFSA+ KDTRPL+ FSAD +PIR++AWAP SD ES N+I+TAG Sbjct: 418 YLLAGCHDGMVALWKFSASDTPKDTRPLLCFSADSVPIRSVAWAPSGSDMESRNVILTAG 477 Query: 729 HKGVKFWDIRDPFRPLWDVNPVQRVIYGVDWLSDPRCVVLSFDDGEIRIISLSKAACDVP 908 H G+KFWDIRDPF PLWDV+P + IY +DWL +PRCV+LSFDDG ++++SL AACDVP Sbjct: 478 HGGLKFWDIRDPFLPLWDVHPAPKFIYSLDWLPEPRCVILSFDDGTMKMLSLVHAACDVP 537 Query: 909 VTGKPFVGTQLQGLNSYHCXXXXXXXXXXXRLTGMVAYCCSDGRVLRFQLTTKSVEKDPR 1088 VTGKPF GT+ QGL+ Y+C RLTGMVAYC +DG V RFQLT+K+V+KD Sbjct: 538 VTGKPFTGTKQQGLHLYNCSSFAIWSVQVSRLTGMVAYCGADGTVTRFQLTSKAVDKDFS 597 Query: 1089 RNREPHYLCGALTTEGSTLNVFSPLP----------NELGETPRSRRGHLTISNQEKRAR 1238 R+R PH+ CG+LT E S + V +PLP N+ GE PRS R LT SNQ K A+ Sbjct: 598 RHRTPHFPCGSLTEEESAVIVNTPLPDIPVTLKKPSNDYGEGPRSMRAFLTESNQGKNAK 657 Query: 1239 EQGLKSQTPEDQAIPQAREQGLRCYTPEEQPLALCYNNDPGVDSGSDDTMVAQXXXXXXX 1418 ++ K +Q LAL Y NDPGV+S S++T+ A Sbjct: 658 DRKAK------------------VAASNKQTLALYYGNDPGVESESEETLAA---LQSKR 696 Query: 1419 XXXXXXXXXAHAKQAFTSTDEVP----DVETINEKEVFPSKMVAMHRVRWNMNKGSERWL 1586 A Q E P ET NE EVFP K+VAMHRVRWNMNKGSERWL Sbjct: 697 KQKSNGKKKADDDQVLAIRIEEPTNTQKEETGNEIEVFPPKIVAMHRVRWNMNKGSERWL 756 Query: 1587 CYAGAAGIVRCQEM 1628 CY GAAGIVRCQE+ Sbjct: 757 CYGGAAGIVRCQEI 770 >ref|XP_021300958.1| uncharacterized protein LOC110429313 [Herrania umbratica] Length = 856 Score = 543 bits (1400), Expect = e-180 Identities = 276/496 (55%), Positives = 341/496 (68%), Gaps = 16/496 (3%) Frame = +3 Query: 189 LETDPDSCDISEDVSLPRLVMCLAHNGKVAWDIKWRPFD-TCANHKHRMGYLGVLLGSGA 365 LET S I D+ LPR V+CLAHNGKVAWD+KW+P+D HRMGYL VLLG+G+ Sbjct: 368 LETPGSS--IPRDIELPRTVLCLAHNGKVAWDVKWQPYDINGCECNHRMGYLAVLLGNGS 425 Query: 366 LEVWEVPLPHAAEAMFSACQKDGTDPRFIKLKPVFRCAMLKCGDRQSIPLTMEWSTASPH 545 LEVWEVPLP ++S+ K GTDPRF+KL+PVF+C+ LKCGD QSIPLT+EWST+ PH Sbjct: 426 LEVWEVPLPSMIRIVYSSSPKQGTDPRFVKLEPVFKCSKLKCGDVQSIPLTVEWSTSPPH 485 Query: 546 DLILAGCHDGMVALWKFSANSPLKDTRPLIFFSADILPIRALAWAPVASDFESANIIVTA 725 + +LAGCHDG VALWKFSA+ DTRPL+ FSAD +PIR++AWAP SD ESAN+++TA Sbjct: 486 NYLLAGCHDGKVALWKFSASGSPTDTRPLLCFSADTVPIRSVAWAPSGSDMESANVVLTA 545 Query: 726 GHKGVKFWDIRDPFRPLWDVNPVQRVIYGVDWLSDPRCVVLSFDDGEIRIISLSKAACDV 905 GH G+KFWDIRDPF PLWDV+P + IY +DWL +PRCV+LSFDDG ++++SL +AACDV Sbjct: 546 GHGGLKFWDIRDPFLPLWDVHPAPKFIYSLDWLPEPRCVILSFDDGTMKMLSLIQAACDV 605 Query: 906 PVTGKPFVGTQLQGLNSYHCXXXXXXXXXXXRLTGMVAYCCSDGRVLRFQLTTKSVEKDP 1085 PVTGKPF GT+ QGL+ Y+C RLTGMVAYC +DG V RFQLT+K+V+KD Sbjct: 606 PVTGKPFTGTKQQGLHLYNCSSFAIWNVQVSRLTGMVAYCGADGNVTRFQLTSKAVDKDF 665 Query: 1086 RRNREPHYLCGALTTEGSTLNVFSPLP----------NELGETPRSRRGHLTISNQEKRA 1235 RNR PH++CG+LT E S + V +PLP N+ GE PRS R LT SNQ K A Sbjct: 666 SRNRAPHFVCGSLTEEESAIVVNTPLPDIPLTLKKQTNDYGEGPRSMRAFLTESNQAKNA 725 Query: 1236 REQGLKSQTPEDQAIPQAREQGLRCYTPEEQPLALCYNNDPGVDSGSDDTM-VAQXXXXX 1412 +++ K++ P TP+++ ALCY ND GV+S S++T+ +A Sbjct: 726 KDK--KAKVP----------------TPDKRTFALCYGNDRGVESESEETLTLAALKGKI 767 Query: 1413 XXXXXXXXXXXAHAKQAFTSTDEVP----DVETINEKEVFPSKMVAMHRVRWNMNKGSER 1580 A QA P E E EVFP K+VAMHRVRWNMNKGSER Sbjct: 768 KQKSKSDRTKKAGDDQALAVRINEPRNTQKEEAGYEIEVFPPKIVAMHRVRWNMNKGSER 827 Query: 1581 WLCYAGAAGIVRCQEM 1628 WLCY GAAGIVRCQE+ Sbjct: 828 WLCYGGAAGIVRCQEI 843 >emb|CDP15391.1| unnamed protein product [Coffea canephora] Length = 942 Score = 546 bits (1406), Expect = e-180 Identities = 286/547 (52%), Positives = 351/547 (64%), Gaps = 12/547 (2%) Frame = +3 Query: 30 VVPEKASSRKRKGYEKQTSVKSALTSKSRLAKCKLSSESQNPMGSSATGYSVPLETDPDS 209 +VP K K KG + K A ++ LA P + + + + Sbjct: 425 IVPSKRIRLKDKGSTRVQVSKDAQEAELSLAN--------PPTEENLALNMITCDFGSAN 476 Query: 210 CDISEDVSLPRLVMCLAHNGKVAWDIKWRPFDTCANHKHRMGYLGVLLGSGALEVWEVPL 389 C I DV+LPR+V CLAHNG+VAWD KWRP D + K RMGYL VLLG GALEVWEVP Sbjct: 477 CSIPNDVALPRMVFCLAHNGEVAWDAKWRPCDV--SDKQRMGYLAVLLGDGALEVWEVPF 534 Query: 390 PHAAEAMFSACQKDGTDPRFIKLKPVFRCAMLKCGDRQSIPLTMEWSTASPHDLILAGCH 569 P + ++SA QK+GTDPRFI+L+PVFRC +K G RQSIPLT+EWS +SPHD+ILAGCH Sbjct: 535 PRTMKVIYSASQKEGTDPRFIRLRPVFRCPTIKRGGRQSIPLTLEWSASSPHDMILAGCH 594 Query: 570 DGMVALWKFSANSPLKDTRPLIFFSADILPIRALAWAPVASDFESANIIVTAGHKGVKFW 749 DG+VALWKF A L++TRPL+ FSAD + IRAL W PV+S ESANIIVTAGH+G+KFW Sbjct: 595 DGVVALWKFCATGSLQETRPLLCFSADTVTIRALTWVPVSSYSESANIIVTAGHRGLKFW 654 Query: 750 DIRDPFRPLWDVNPVQRVIYGVDWLSDPRCVVLSFDDGEIRIISLSKAACDVPVTGKPFV 929 D+RDPFRPLWD P QRVIY +DWL DPRC+++SFDDG +RI+SL KAA D PVTGKPF Sbjct: 655 DLRDPFRPLWDFYPFQRVIYSLDWLPDPRCIIVSFDDGALRILSLLKAANDAPVTGKPFE 714 Query: 930 GTQLQGLNSYHCXXXXXXXXXXXRLTGMVAYCCSDGRVLRFQLTTKSVEKDPRRNREPHY 1109 G Q +G +SY C RLTGMVAYC +DG LRFQLTT++VEKDP RNR PH+ Sbjct: 715 GAQQKGFHSYLCSPFQIWSVHTSRLTGMVAYCGADGTALRFQLTTRAVEKDPLRNRAPHF 774 Query: 1110 LCGALTTEGSTLNVFSPLPN----------ELGETPRSRRGHLTISNQEKRAREQGLKSQ 1259 LCGALT E STL +F+ LPN E GE PR+ RG++++SNQEKRA+++ +K + Sbjct: 775 LCGALTEENSTLTMFTSLPNTPFPMRKSLREWGEAPRTVRGYISVSNQEKRAKQKVVKVR 834 Query: 1260 TPEDQAIPQAREQGLRCYTPEEQPLALCYNNDPGVDSGSDDTMVAQXXXXXXXXXXXXXX 1439 + EE+ ALC D + G D V + Sbjct: 835 S-------------------EEKHKALCKRGDLDSEFGPDCMAVTE--TREAGKVKTSSN 873 Query: 1440 XXAHAKQAFTSTDEVPDV--ETINEKEVFPSKMVAMHRVRWNMNKGSERWLCYAGAAGIV 1613 A + D PD+ + E EVFPSK VAMHRVRWN NKGSE WLCY GAAG+V Sbjct: 874 SEADQRPIMVGEDN-PDIMRGEVEEVEVFPSKTVAMHRVRWNTNKGSENWLCYGGAAGVV 932 Query: 1614 RCQEMSL 1634 R QE+ + Sbjct: 933 RFQEIDM 939 >ref|XP_023920539.1| uncharacterized protein LOC112032069 isoform X1 [Quercus suber] gb|POF00175.1| general transcription factor 3c polypeptide 2 [Quercus suber] Length = 908 Score = 544 bits (1402), Expect = e-180 Identities = 271/504 (53%), Positives = 347/504 (68%), Gaps = 8/504 (1%) Frame = +3 Query: 147 QNPMGSSATGYSVPLETDPDSCDISEDVSLPRLVMCLAHNGKVAWDIKWRPFDTCANH-K 323 Q P S L T S IS+DV+LPR+V+CLAHNGKVAWD+KWRP + C + K Sbjct: 402 QEPAASDNVSDKGSLGTSSASSLISKDVALPRVVLCLAHNGKVAWDVKWRPSNACQSKCK 461 Query: 324 HRMGYLGVLLGSGALEVWEVPLPHAAEAMFSACQKDGTDPRFIKLKPVFRCAMLKCGDRQ 503 HRMGYL VLLG+G+LEVWEVPLP + ++S+ ++GTDPRF+KL+PVFR ++LKCG Q Sbjct: 462 HRMGYLAVLLGNGSLEVWEVPLPRTMKVIYSSVHQEGTDPRFVKLEPVFRGSLLKCGGIQ 521 Query: 504 SIPLTMEWSTASPHDLILAGCHDGMVALWKFSANSPLKDTRPLIFFSADILPIRALAWAP 683 SIPLT+EWS + PHD +LAGCHDG VALWKFSA+ +DTRPL+ FSAD +PIRALAWAP Sbjct: 522 SIPLTVEWSASPPHDYLLAGCHDGTVALWKFSASCSSEDTRPLLCFSADTVPIRALAWAP 581 Query: 684 VASDFESANIIVTAGHKGVKFWDIRDPFRPLWDVNPVQRVIYGVDWLSDPRCVVLSFDDG 863 + SD ESAN+IVTAGH G+KFWD+RDP+RPLWD++PV R+IY +DWLS+PRCV+LSFDDG Sbjct: 582 LESDPESANVIVTAGHGGLKFWDLRDPYRPLWDLHPVPRIIYSLDWLSNPRCVILSFDDG 641 Query: 864 EIRIISLSKAACDVPVTGKPFVGTQLQGLNSYHCXXXXXXXXXXXRLTGMVAYCCSDGRV 1043 +RI+SL KAA DVPVTGKPF GT+ QGL+SY+C R+TGM AYC +DG V Sbjct: 642 TMRILSLLKAAYDVPVTGKPFGGTKQQGLHSYYCSSFAIWSVQVSRITGMAAYCTADGTV 701 Query: 1044 LRFQLTTKSVEKDPRRNREPHYLCGALTTEGSTLNVFSPLPNELGETPRSRRGHLTISNQ 1223 LRFQLT+K+V+KDP RNR PH+LCG+LT E S + + +P+PN TP + L Sbjct: 702 LRFQLTSKAVDKDPSRNRTPHFLCGSLTEEESLITINTPVPN----TPFPLKKSLNKGGD 757 Query: 1224 EKRAREQGLKSQTPEDQAIPQAREQGLRCYTPEEQPLALCYNNDPGVDSGSDDTMVAQXX 1403 + ++ + E Q + +A ++ + + + LALCY +DPG +SG+++ + Sbjct: 758 TPLS----MREFSSEPQHVKRANDKMAKSPSTDATTLALCYGDDPGTESGTEEALTRPKS 813 Query: 1404 XXXXXXXXXXXXXXAHAKQAFTSTDEVPDV-ETINEK------EVFPSKMVAMHRVRWNM 1562 +E P+ E N K EVFP K+VAM RVRWNM Sbjct: 814 KKRPNSRSSNKKNPEDDLALVCRDEEPPNTQEKENGKAEARTIEVFPPKIVAMRRVRWNM 873 Query: 1563 NKGSERWLCYAGAAGIVRCQEMSL 1634 NKGSERWLCY G AG+VRCQE+ L Sbjct: 874 NKGSERWLCYGGEAGVVRCQEIVL 897