BLASTX nr result
ID: Chrysanthemum21_contig00019102
seq
BLASTX 2.2.26 [Sep-21-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Chrysanthemum21_contig00019102 (1347 letters) Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF excluding environmental samples from WGS projects 149,584,005 sequences; 54,822,741,787 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_021969279.1| uncharacterized protein LOC110864507 [Helian... 504 e-166 ref|XP_023751980.1| uncharacterized protein LOC111900339 isoform... 483 e-158 ref|XP_022025534.1| uncharacterized protein LOC110926090 isoform... 453 e-146 ref|XP_023729140.1| uncharacterized protein LOC111876795 isoform... 451 e-145 ref|XP_022025533.1| uncharacterized protein LOC110926090 isoform... 449 e-145 ref|XP_022017820.1| uncharacterized protein LOC110917638 [Helian... 448 e-144 gb|KVH97278.1| Glycosyl transferase, family 1 [Cynara cardunculu... 437 e-141 gb|KVH91086.1| hypothetical protein Ccrd_006880 [Cynara carduncu... 423 e-136 ref|XP_023922256.1| uncharacterized protein LOC112033706 isoform... 395 e-125 ref|XP_023922254.1| uncharacterized protein LOC112033706 isoform... 395 e-124 ref|XP_023922253.1| uncharacterized protein LOC112033706 isoform... 395 e-124 gb|EOX95825.1| Glycosyl transferase family 1 protein isoform 2 [... 383 e-123 dbj|GAV75395.1| Glycos_transf_1 domain-containing protein [Cepha... 391 e-123 ref|XP_021644116.1| uncharacterized protein LOC110638026 [Hevea ... 388 e-121 gb|OAY62220.1| hypothetical protein MANES_01G251000 [Manihot esc... 379 e-120 gb|EOX95824.1| Glycosyl transferase family 1 protein isoform 1 [... 383 e-120 ref|XP_021683395.1| uncharacterized protein LOC110667009 isoform... 379 e-120 ref|XP_007051667.2| PREDICTED: uncharacterized protein LOC186140... 382 e-119 ref|XP_022757484.1| uncharacterized protein LOC111304797 [Durio ... 381 e-119 ref|XP_012083283.1| uncharacterized protein LOC105642906 [Jatrop... 380 e-118 >ref|XP_021969279.1| uncharacterized protein LOC110864507 [Helianthus annuus] gb|OTG22022.1| putative glycosyl transferase, family 1 [Helianthus annuus] Length = 1024 Score = 504 bits (1298), Expect = e-166 Identities = 265/397 (66%), Positives = 312/397 (78%), Gaps = 8/397 (2%) Frame = -1 Query: 1167 MGSVSP-MLPIKSRTDTKIHNSTKPKSKFVFLKKIDYLQWVSALAVFIFFMFLVQLFLPL 991 MGS++P +LP+K + K + +P+S+ + LKKIDYLQW+SALAVFIFFMFL QLFLPL Sbjct: 1 MGSLNPPVLPLKRDSLLK-SSPQRPRSRILILKKIDYLQWISALAVFIFFMFLFQLFLPL 59 Query: 990 DKV--DIRKDEVGQFEFDFLVDEIVSGLDFGEDLKFVVGPSKFLMKFQRDGNVNVSRSG- 820 V D K + + D + +S LDFGED+KFV P++ KF R+ + NVS G Sbjct: 60 STVNDDFFKQNIDEGLQDLFKE--ISTLDFGEDVKFV--PTRLSTKFLREKSSNVSFGGS 115 Query: 819 ---VRFGNRKVKLGMVFADLLVDSQQVLMVSVAAGLREIGYEFEVYSLEDGPAHSIWKTM 649 RFGNRKV+L +VFADLL D Q+LMV++AA LR IGYEFEVYSLEDGP H IW+T+ Sbjct: 116 RTVTRFGNRKVQLALVFADLLDDPYQILMVTLAAALRGIGYEFEVYSLEDGPVHHIWRTI 175 Query: 648 GVPVNIIEAKD-TGITIDWLNYDGVLVNSLAAKDVISSLLQEPFRSVPLIWTIHEKTLAT 472 GVPV+I++A D + I IDWLNYDGVLVNSLAAKDV+S LLQEPF+SVPLIWT+HEK+LAT Sbjct: 176 GVPVHIMDASDKSAIMIDWLNYDGVLVNSLAAKDVVSCLLQEPFKSVPLIWTVHEKSLAT 235 Query: 471 RAAKYVSSGQVDMINDWKAIFNRATVVVFPNYALPMFYAAFDAGNYFVVPGCPSGACKIN 292 RAA+YVS+GQ ++I+DWKAIFNRATVVVFPNYALPMFY+AFDAGNYFVVPG PS CK++ Sbjct: 236 RAARYVSNGQAELIDDWKAIFNRATVVVFPNYALPMFYSAFDAGNYFVVPGTPSEICKVD 295 Query: 291 NSTIIHEENLRVNMNIGVEDFVVGIVGSEFLYKGIWLEHAXXXXXXXXXXXXXXVGDSSS 112 TI HEEN+RVNM+IG+ DFVVGIVGSEFLYKGIWLEHA GDSSS Sbjct: 296 KPTIFHEENVRVNMDIGLNDFVVGIVGSEFLYKGIWLEHALVLKALSPLLAKFPDGDSSS 355 Query: 111 RRLKIIVLSQDSTGNYSAAMEEIASNLNYPRGTVKHV 1 RLKII+LSQD T NYSAAMEEI SNLNYPRGTVKHV Sbjct: 356 PRLKIIILSQDLTDNYSAAMEEITSNLNYPRGTVKHV 392 >ref|XP_023751980.1| uncharacterized protein LOC111900339 isoform X1 [Lactuca sativa] gb|PLY94459.1| hypothetical protein LSAT_3X138461 [Lactuca sativa] Length = 1027 Score = 483 bits (1242), Expect = e-158 Identities = 263/407 (64%), Positives = 316/407 (77%), Gaps = 18/407 (4%) Frame = -1 Query: 1167 MGSVSPMLPIKSRTDTKI--HNST---KPKSKFVFL---KKIDYLQWVSALAVFIFFMFL 1012 MGS+SP+LP+K + KI +NS+ +P+S+F L KKIDYLQWVSALAVFIFFMFL Sbjct: 1 MGSLSPVLPLKRDSLLKISPNNSSHLQRPRSRFARLIGFKKIDYLQWVSALAVFIFFMFL 60 Query: 1011 VQLFLPLDKVDIRKDEV--GQFEFDFL-VDEIVSGLDFGEDLKFVVGPSKFLMKFQRDGN 841 VQ+FLPL VD + + + DF+ + + + LDFGED+KF+ P+K +MKF+R+ Sbjct: 61 VQMFLPLSMVDKADGDFLKREADSDFINLLKQIGDLDFGEDVKFM--PTKLMMKFRREEM 118 Query: 840 VNVSRSG----VRFGNRKVKLGMVFADLLVDSQQVLMVSVAAGLREIGYEFEVYSLEDGP 673 N S G RF NRK +L +VFADLLVD QQ+LMV+VAA LR IGYE EVYSLE+GP Sbjct: 119 NNASFGGSRTLARFPNRKPQLALVFADLLVDPQQILMVTVAAALRAIGYEIEVYSLENGP 178 Query: 672 AHSIWKTMGVPVNIIEAKD-TGITIDWLNYDGVLVNSLAAKDVISSLLQEPFRSVPLIWT 496 HSIWK++GVPVNI++A + T ITIDWLNYDGV+VNSL AKDV+S LL EPF+SVPLIW+ Sbjct: 179 VHSIWKSIGVPVNIMDANNKTDITIDWLNYDGVVVNSLEAKDVVSCLLHEPFKSVPLIWS 238 Query: 495 IHEKTLATRAAKYVSSGQVDMINDWKAIFNRATVVVFPNYALPMFYAAFDAGNYFVVPGC 316 IHEK+LATR A YVSSG+V++I+DWKAIFNRATVVVFPNYALPMFYAAFD GNYFV+PG Sbjct: 239 IHEKSLATRVASYVSSGKVEIIDDWKAIFNRATVVVFPNYALPMFYAAFDDGNYFVIPGS 298 Query: 315 PSGACKINNSTIIHEEN-LRVN-MNIGVEDFVVGIVGSEFLYKGIWLEHAXXXXXXXXXX 142 PS ACKI +STIIHE N LRVN MNIGV+DFVV IVGSEFLYKGIWLEHA Sbjct: 299 PSKACKIEDSTIIHEGNHLRVNMMNIGVDDFVVAIVGSEFLYKGIWLEHALVLRALFPLL 358 Query: 141 XXXXVGDSSSRRLKIIVLSQDSTGNYSAAMEEIASNLNYPRGTVKHV 1 + D+ S RLKI++ + D TGNYS+A+EEIA NLNYPRG+V HV Sbjct: 359 AEFQISDNFSPRLKILIFTHDLTGNYSSAIEEIALNLNYPRGSVSHV 405 >ref|XP_022025534.1| uncharacterized protein LOC110926090 isoform X2 [Helianthus annuus] gb|OTF87476.1| putative glycosyl transferase, family 1 [Helianthus annuus] Length = 1036 Score = 453 bits (1165), Expect = e-146 Identities = 234/394 (59%), Positives = 286/394 (72%), Gaps = 11/394 (2%) Frame = -1 Query: 1152 PMLPIKSRTDTKIHNSTKPKSKF---VFLKKIDYLQWVSALAVFIFFMFLVQLFLPLDKV 982 P+L SR + + +P+S+F + LKK+DYLQW+ A+AVF FMF+ Q+ LPL + Sbjct: 15 PLLKSLSRNERNSSFANRPRSRFARFMVLKKLDYLQWICAVAVFFLFMFVFQMLLPLSTL 74 Query: 981 D-------IRKDEVGQFEFDFLVDEIVSGLDFGEDLKFVVGPSKFLMKFQRDGNVNVSRS 823 + I+KD+ + +F L EI SGLDFGE +KF P++ L+KFQ D N NV Sbjct: 75 EKASGGFLIQKDDNFEGDFKSLFQEI-SGLDFGEGVKFE--PTRLLLKFQEDNNKNVKNL 131 Query: 822 GVRFGNRKVKLGMVFADLLVDSQQVLMVSVAAGLREIGYEFEVYSLEDGPAHSIWKTMGV 643 G+RK +L VFADL VD QQVLMVSVA LR IGYE EVYSLEDGP H++WK +GV Sbjct: 132 SFG-GSRKPQLAFVFADLFVDPQQVLMVSVAVALRAIGYEIEVYSLEDGPVHTVWKNIGV 190 Query: 642 PVNIIEAK-DTGITIDWLNYDGVLVNSLAAKDVISSLLQEPFRSVPLIWTIHEKTLATRA 466 PVNI+EA D+ I IDWL YD VLVNSL AKD +S LLQEPF+S+PLIWT+HEKTLATR Sbjct: 191 PVNIMEASGDSKIIIDWLIYDAVLVNSLEAKDAVSGLLQEPFKSLPLIWTVHEKTLATRY 250 Query: 465 AKYVSSGQVDMINDWKAIFNRATVVVFPNYALPMFYAAFDAGNYFVVPGCPSGACKINNS 286 YVS GQ +I+DWK +FNRATVVVFPN+ LPM+YAAFDAGNYFV+PG PS ACK+NNS Sbjct: 251 KNYVSDGQFQLIDDWKTVFNRATVVVFPNHVLPMYYAAFDAGNYFVIPGSPSEACKLNNS 310 Query: 285 TIIHEENLRVNMNIGVEDFVVGIVGSEFLYKGIWLEHAXXXXXXXXXXXXXXVGDSSSRR 106 ++HEE+LRVNMN DFV+ I GS+FLYKG+W+EHA V D S+R Sbjct: 311 IVVHEESLRVNMNFTARDFVIAITGSQFLYKGLWVEHALVLQALSPLLAEFPVDDRLSQR 370 Query: 105 LKIIVLSQDSTGNYSAAMEEIASNLNYPRGTVKH 4 L+II+L QD TGNYSAA+EEIASNLNYPRGTV + Sbjct: 371 LRIIILRQDLTGNYSAAIEEIASNLNYPRGTVNY 404 >ref|XP_023729140.1| uncharacterized protein LOC111876795 isoform X1 [Lactuca sativa] gb|PLY77511.1| hypothetical protein LSAT_4X34080 [Lactuca sativa] Length = 1042 Score = 451 bits (1160), Expect = e-145 Identities = 235/402 (58%), Positives = 291/402 (72%), Gaps = 19/402 (4%) Frame = -1 Query: 1152 PMLPIKSRTDTKIHNSTKPKSKF---VFLKKIDYLQWVSALAVFIFFMFLVQLFLPLDKV 982 P+L SR + + +P+SKF + LKK+DYLQW+ A+AVFIFFMF+ Q+FLPL V Sbjct: 15 PLLKSSSRNERNNSFAQRPRSKFARFMVLKKLDYLQWICAVAVFIFFMFVFQMFLPLSSV 74 Query: 981 DI--------RKDEVGQFEFDFLVDEIVSGLDFGEDLKFVVGPSKFLMKFQRDG------ 844 + ++D G +FL + + GLDFGE +KF P+K L+KF R+ Sbjct: 75 EKDSGDFLKQKEDNFGDDLTNFLKE--IGGLDFGEGVKFE--PTKLLLKFHRENRGVNNV 130 Query: 843 NVNVSRSGVRFGNRKVKLGMVFADLLVDSQQVLMVSVAAGLREIGYEFEVYSLEDGPAHS 664 + SR VRFG+RK +L VFADLLVD QQ+LM++VA LR IGYE +VYSLE+GP H+ Sbjct: 131 SFGTSRKVVRFGHRKPQLAFVFADLLVDPQQLLMLTVATALRTIGYEIQVYSLEEGPVHT 190 Query: 663 IWKTMGVPVNIIEA-KDTGITIDWLNYDGVLVNSLAAKDVISSLLQEPFRSVPLIWTIHE 487 +WK +GV VNI+EA +D IDWLNYD +LVNSL AK+ IS LLQEPF+S+PLIWTIHE Sbjct: 191 VWKNIGVHVNILEASEDKKFIIDWLNYDAILVNSLEAKEAISGLLQEPFKSLPLIWTIHE 250 Query: 486 KTLATRAAKYVSSGQVDMINDWKAIFNRATVVVFPNYALPMFYAAFDAGNYFVVPGCPSG 307 KTLATR Y+S+G++ +I+DWKA+FNRATVVVFPNYALPMFYA FDAGNYFV+PG PS Sbjct: 251 KTLATRYKNYISNGKIQLIDDWKAVFNRATVVVFPNYALPMFYAPFDAGNYFVIPGSPSN 310 Query: 306 ACKINNSTIIHEENLRVNMNIGVEDFVVGIVGSEFLYKGIWLEHA-XXXXXXXXXXXXXX 130 ACK++NST + EENLRVNMNIG DFV+ I GS+FLYKG+WLEHA Sbjct: 311 ACKLDNSTTVLEENLRVNMNIGAHDFVITITGSQFLYKGLWLEHALVLQALSPLLAQFPV 370 Query: 129 VGDSSSRRLKIIVLSQDSTGNYSAAMEEIASNLNYPRGTVKH 4 DSSS LKII+L+QD T NYS+A+EEIASNLNYP GTV H Sbjct: 371 DDDSSSPHLKIIILNQDITRNYSSAIEEIASNLNYPSGTVNH 412 >ref|XP_022025533.1| uncharacterized protein LOC110926090 isoform X1 [Helianthus annuus] Length = 1039 Score = 449 bits (1155), Expect = e-145 Identities = 235/397 (59%), Positives = 287/397 (72%), Gaps = 14/397 (3%) Frame = -1 Query: 1152 PMLPIKSRTDTKIHNSTKPKSKF---VFLKKIDYLQWVSALAVFIFFMFLVQLFLPLDKV 982 P+L SR + + +P+S+F + LKK+DYLQW+ A+AVF FMF+ Q+ LPL + Sbjct: 15 PLLKSLSRNERNSSFANRPRSRFARFMVLKKLDYLQWICAVAVFFLFMFVFQMLLPLSTL 74 Query: 981 D-------IRKDEVGQFEFDFLVDEIVSGLDFGEDLKFVVGPSKFLMKFQRDGNVNVSRS 823 + I+KD+ + +F L EI SGLDFGE +KF P++ L+KFQ D N NV Sbjct: 75 EKASGGFLIQKDDNFEGDFKSLFQEI-SGLDFGEGVKFE--PTRLLLKFQEDNNKNVKNL 131 Query: 822 GVRFGNRKVKLGMVFADLLVDSQQVLMVSVAAGLREIGYEFEVYSLEDGPAHSIWKTMGV 643 G+RK +L VFADL VD QQVLMVSVA LR IGYE EVYSLEDGP H++WK +GV Sbjct: 132 SFG-GSRKPQLAFVFADLFVDPQQVLMVSVAVALRAIGYEIEVYSLEDGPVHTVWKNIGV 190 Query: 642 PVNIIEAK-DTGITIDWLNYDGVLVNSLAAKDVIS---SLLQEPFRSVPLIWTIHEKTLA 475 PVNI+EA D+ I IDWL YD VLVNSL AKD +S SLLQEPF+S+PLIWT+HEKTLA Sbjct: 191 PVNIMEASGDSKIIIDWLIYDAVLVNSLEAKDAVSGYCSLLQEPFKSLPLIWTVHEKTLA 250 Query: 474 TRAAKYVSSGQVDMINDWKAIFNRATVVVFPNYALPMFYAAFDAGNYFVVPGCPSGACKI 295 TR YVS GQ +I+DWK +FNRATVVVFPN+ LPM+YAAFDAGNYFV+PG PS ACK+ Sbjct: 251 TRYKNYVSDGQFQLIDDWKTVFNRATVVVFPNHVLPMYYAAFDAGNYFVIPGSPSEACKL 310 Query: 294 NNSTIIHEENLRVNMNIGVEDFVVGIVGSEFLYKGIWLEHAXXXXXXXXXXXXXXVGDSS 115 NNS ++HEE+LRVNMN DFV+ I GS+FLYKG+W+EHA V D Sbjct: 311 NNSIVVHEESLRVNMNFTARDFVIAITGSQFLYKGLWVEHALVLQALSPLLAEFPVDDRL 370 Query: 114 SRRLKIIVLSQDSTGNYSAAMEEIASNLNYPRGTVKH 4 S+RL+II+L QD TGNYSAA+EEIASNLNYPRGTV + Sbjct: 371 SQRLRIIILRQDLTGNYSAAIEEIASNLNYPRGTVNY 407 >ref|XP_022017820.1| uncharacterized protein LOC110917638 [Helianthus annuus] gb|OTF92614.1| putative glycosyl transferase family 1 protein [Helianthus annuus] Length = 1048 Score = 448 bits (1152), Expect = e-144 Identities = 236/407 (57%), Positives = 289/407 (71%), Gaps = 24/407 (5%) Frame = -1 Query: 1152 PMLPIKSRTDTKIHNSTKPKSKF---VFLKKIDYLQWVSALAVFIFFMFLVQLFLPLDKV 982 P+L SR + +P+S+F + +KK+DYLQW+ A+AVFIFF+F+ Q+FLPL + Sbjct: 15 PLLKSLSRNERNSSFGQRPRSRFARFMVVKKLDYLQWICAVAVFIFFVFVFQMFLPLSTM 74 Query: 981 DI-------RKDEVGQFEFDFLVDEIVSGLDFGEDLKFVVGPSKFLMKFQRDGNVNV--- 832 + +KD+ E L E+ SGLDFGE +KF P++ L+KFQR GN + Sbjct: 75 EKAGEGFLKQKDDTFDGELKNLFQEL-SGLDFGEGVKFE--PTRLLLKFQR-GNKDFNDF 130 Query: 831 ----------SRSGVRFGNRKVKLGMVFADLLVDSQQVLMVSVAAGLREIGYEFEVYSLE 682 SR VRFGNRK +L VFADLLVD QQVLMVSVAA LR IGYE EVYSLE Sbjct: 131 NNFNNPSFEGSRKVVRFGNRKPQLAFVFADLLVDPQQVLMVSVAAALRSIGYEIEVYSLE 190 Query: 681 DGPAHSIWKTMGVPVNIIEA-KDTGITIDWLNYDGVLVNSLAAKDVISSLLQEPFRSVPL 505 DGP H++WK +GVPVNI+EA +T I IDWLNYD +LVNSL AKD IS LLQEPF+S+PL Sbjct: 191 DGPVHAVWKNIGVPVNIVEADNNTKIIIDWLNYDAILVNSLEAKDAISGLLQEPFKSLPL 250 Query: 504 IWTIHEKTLATRAAKYVSSGQVDMINDWKAIFNRATVVVFPNYALPMFYAAFDAGNYFVV 325 IWT+HEK LATR KYVS Q + +DWK +F+RA+VVVFPN+ LPM+YAAFDAGNYFV+ Sbjct: 251 IWTVHEKALATRLKKYVSDDQHPLFDDWKTVFHRASVVVFPNHVLPMYYAAFDAGNYFVI 310 Query: 324 PGCPSGACKINNSTIIHEENLRVNMNIGVEDFVVGIVGSEFLYKGIWLEHAXXXXXXXXX 145 PG PS ACK++NS I+ EENLR NMNIG D V+ I GS+FLYKG+W+EHA Sbjct: 311 PGFPSNACKLDNSMIVFEENLRGNMNIGAHDLVIAITGSQFLYKGLWVEHALVLQALSPL 370 Query: 144 XXXXXVGDSSSRRLKIIVLSQDSTGNYSAAMEEIASNLNYPRGTVKH 4 DS+S+ L+II LSQD +GNYSAA+EEIASNLNYP GTV H Sbjct: 371 LAEFNADDSTSQHLRIIFLSQDLSGNYSAAIEEIASNLNYPNGTVNH 417 >gb|KVH97278.1| Glycosyl transferase, family 1 [Cynara cardunculus var. scolymus] Length = 978 Score = 437 bits (1125), Expect = e-141 Identities = 240/430 (55%), Positives = 297/430 (69%), Gaps = 51/430 (11%) Frame = -1 Query: 1140 IKSRTDTKIHNS--TKPKSKF---VFLKKIDYLQWVSALAVFIFFMFLVQLFLPLDKVDI 976 +KS + + +NS +P+S+F + LKK+DYLQW+ A+AVFIFFM + Q+FLPL V+ Sbjct: 17 LKSSSRNERNNSFVQRPRSRFTRFMVLKKLDYLQWICAVAVFIFFMLVFQMFLPLSTVEK 76 Query: 975 --------RKDEVGQFEFDFLVDEIVSGLDFGEDLKFVVGPSKFLMKFQRDGN--VNV-- 832 ++D G +FL + + GLDFGE +KF P+K L+KFQRD NV Sbjct: 77 DGGDFLKQKEDNFGGELKNFLKE--IGGLDFGEGVKFE--PTKLLLKFQRDNRDVYNVAF 132 Query: 831 --SRSGVRFGNRKVKLGMVFADLLVDSQQVLMVSVAAGLREIGYEFEVYSLEDGPAHSIW 658 SR VRFG+RK +L VFADLLVD QQ+LMV+VAA L+ IGYE EVYSLEDGP HS+W Sbjct: 133 GGSRRVVRFGHRKPQLAFVFADLLVDPQQLLMVTVAAALKAIGYEIEVYSLEDGPVHSVW 192 Query: 657 KTMGVPVNIIEAK-DTGITIDWLNYDGVLVNSLAAKDVIS-------------------- 541 + +GVPVNI+EA D I +DWLNYD +LV SL AKDV+S Sbjct: 193 ENVGVPVNIVEAGGDPKIVVDWLNYDAILVTSLEAKDVVSGITKTARHLYSPNNLRQCGH 252 Query: 540 -----------SLLQEPFRSVPLIWTIHEKTLATRAAKYVSSGQVDMINDWKAIFNRATV 394 SLLQEPF+S+P+IW +HEKTLATR YVS+GQV++I+DWKA+FNRATV Sbjct: 253 KLHKLNKFSCYSLLQEPFKSIPVIWIVHEKTLATRFKNYVSNGQVELIDDWKAVFNRATV 312 Query: 393 VVFPNYALPMFYAAFDAGNYFVVPGCPSGACKINNSTIIHEENLRVNMNIGVEDFVVGIV 214 VVFPN+ LPMFYAAFDAGNYFV+PG PSGACK++NST + +E+LRVNMNIG DFV+ I Sbjct: 313 VVFPNHVLPMFYAAFDAGNYFVIPGSPSGACKLDNSTNVLQESLRVNMNIGDRDFVIAIT 372 Query: 213 GSEFLYKGIWLEHAXXXXXXXXXXXXXXVGDSSSRRLKIIVLSQDSTGNYSAAMEEIASN 34 GS+FLYKG+WLEHA V DS S RL+II+LSQD TGNYS A++ IASN Sbjct: 373 GSQFLYKGLWLEHALVLQALSPLLAEFPVDDSLSPRLRIIILSQDLTGNYSEAIKGIASN 432 Query: 33 LNYPRGTVKH 4 LNYP GTV H Sbjct: 433 LNYPSGTVNH 442 >gb|KVH91086.1| hypothetical protein Ccrd_006880 [Cynara cardunculus var. scolymus] Length = 903 Score = 423 bits (1088), Expect = e-136 Identities = 239/416 (57%), Positives = 282/416 (67%), Gaps = 28/416 (6%) Frame = -1 Query: 1167 MGSVSPMLPIKSRTDTKI--------HNS--TKPKSKF---VFLKKIDYLQWVSALAVFI 1027 MGS+S +LPIK K+ +NS +P+S+F + KKIDYLQW+SA+AVFI Sbjct: 1 MGSLSLVLPIKRDPSFKVSPRNEKNNNNSYVQRPRSRFGRFMVFKKIDYLQWISAIAVFI 60 Query: 1026 FFMFLVQLFLPLDKVDI---------RKDEVGQFEFDFLVDEIVSGLDFGEDLKFVVGPS 874 FFMFL QLFLPL V+ +D+ L+ EI GLDFGED+KFV P+ Sbjct: 61 FFMFLFQLFLPLSMVEKTDGDFLKGREEDDGSGGNLKNLLKEI-GGLDFGEDVKFV--PT 117 Query: 873 KFLMKFQRDGNV--NV----SRSGVRFGNRKVKLGMVFADLLVDSQQVLMVSVAAGLREI 712 KFL+KFQR+ V NV SR+ +RFGNRK +L +VFADLLVD QQ++MV+VA LR + Sbjct: 118 KFLIKFQREKGVVNNVTFDGSRTVMRFGNRKPQLALVFADLLVDPQQIMMVTVAVALRAV 177 Query: 711 GYEFEVYSLEDGPAHSIWKTMGVPVNIIEAKDTGITIDWLNYDGVLVNSLAAKDVISSLL 532 GYE E+YSLEDGP IWKT+GVPVNI Sbjct: 178 GYELEIYSLEDGPVRDIWKTIGVPVNI--------------------------------- 204 Query: 531 QEPFRSVPLIWTIHEKTLATRAAKYVSSGQVDMINDWKAIFNRATVVVFPNYALPMFYAA 352 EPF+SVPLIW +HEK LATRA +Y+ GQV++I++WK IFNRATVVVFPNYALPMFYAA Sbjct: 205 -EPFKSVPLIWAVHEKALATRATRYIWGGQVELIDEWKTIFNRATVVVFPNYALPMFYAA 263 Query: 351 FDAGNYFVVPGCPSGACKINNSTIIHEENLRVNMNIGVEDFVVGIVGSEFLYKGIWLEHA 172 FDAGNYFVVPG SGACKI+NSTII+EENLR NMNI ++FVV IVGSEFLY GIWLEHA Sbjct: 264 FDAGNYFVVPGSTSGACKIDNSTIIYEENLRENMNISNDEFVVAIVGSEFLYNGIWLEHA 323 Query: 171 XXXXXXXXXXXXXXVGDSSSRRLKIIVLSQDSTGNYSAAMEEIASNLNYPRGTVKH 4 VGDS S LKI++LS+D TGNYSAAMEEIASNLNYPRGTV H Sbjct: 324 LVLQALLPLLTKFRVGDSLSPHLKIVILSRDLTGNYSAAMEEIASNLNYPRGTVNH 379 >ref|XP_023922256.1| uncharacterized protein LOC112033706 isoform X3 [Quercus suber] Length = 895 Score = 395 bits (1015), Expect = e-125 Identities = 209/416 (50%), Positives = 281/416 (67%), Gaps = 28/416 (6%) Frame = -1 Query: 1167 MGSVSPMLPIK--------SRTDTKIHN-STKPKSKF---VFLKKIDYLQWVSALAVFIF 1024 MGS+ +P+K S T+ H S +P+S+F + KK+DYLQW+ +AVF+F Sbjct: 1 MGSLETGIPLKRDNRFRSFSSVRTERHPFSQRPRSRFSRFLLFKKLDYLQWICTVAVFLF 60 Query: 1023 FMFLVQLFLPLDKVDIRKD---------EVGQFEFDFLVDEIVSGLDFGEDLKFVVGPSK 871 F+ L Q+FLP + K EV +F FL + V LDFGED++F PSK Sbjct: 61 FVVLFQMFLP---GSVEKSGNSSLQDNVEVSSGDFKFLKEMGV--LDFGEDIRFE--PSK 113 Query: 870 FLMKFQRDGNVNV------SRSGVRFGNRKVKLGMVFADLLVDSQQVLMVSVAAGLREIG 709 L KFQR+ + +R+ RF RK +L MVFADLLVDSQ++LMV+VA L+EIG Sbjct: 114 LLDKFQREAREAILYSPAFNRTKQRFSYRKPQLAMVFADLLVDSQKLLMVTVAVALQEIG 173 Query: 708 YEFEVYSLEDGPAHSIWKTMGVPVNIIEAKD-TGITIDWLNYDGVLVNSLAAKDVISSLL 532 YEF+VYSLEDGP H +W+T+G+PV II+A D TGI +DWLNYDG+LVNS A+ V S + Sbjct: 174 YEFQVYSLEDGPVHDVWRTIGIPVTIIQAFDKTGIFVDWLNYDGILVNSFEARGVFSCFV 233 Query: 531 QEPFRSVPLIWTIHEKTLATRAAKYVSSGQVDMINDWKAIFNRATVVVFPNYALPMFYAA 352 QEPF+S+PLIWTIHE++LATR+ KY+SSG ++++NDWK IFNR++VVVFPNY LPM Y+ Sbjct: 234 QEPFKSLPLIWTIHERSLATRSRKYISSGHINLLNDWKRIFNRSSVVVFPNYILPMIYST 293 Query: 351 FDAGNYFVVPGCPSGACKINNSTIIHEENLRVNMNIGVEDFVVGIVGSEFLYKGIWLEHA 172 FD GN+FV+PG P+ A + ++ + ++NLRV M +ED V+ IVGS+F+Y+G+WLEHA Sbjct: 294 FDVGNFFVIPGTPAEAWEADSVMALRKDNLRVKMGYELEDAVIAIVGSQFMYRGLWLEHA 353 Query: 171 XXXXXXXXXXXXXXVGDSSSRRLKIIVLSQDSTGNYSAAMEEIASNLNYPRGTVKH 4 + ++S+ LKI++LS DST NY EEIA NL YP G VKH Sbjct: 354 IILQALLPVLSDFPLDNNSNSNLKIVILSGDSTSNYGVVFEEIAINLTYPSGIVKH 409 >ref|XP_023922254.1| uncharacterized protein LOC112033706 isoform X2 [Quercus suber] gb|POE98166.1| hypothetical protein CFP56_57409 [Quercus suber] Length = 1033 Score = 395 bits (1015), Expect = e-124 Identities = 209/416 (50%), Positives = 281/416 (67%), Gaps = 28/416 (6%) Frame = -1 Query: 1167 MGSVSPMLPIK--------SRTDTKIHN-STKPKSKF---VFLKKIDYLQWVSALAVFIF 1024 MGS+ +P+K S T+ H S +P+S+F + KK+DYLQW+ +AVF+F Sbjct: 1 MGSLETGIPLKRDNRFRSFSSVRTERHPFSQRPRSRFSRFLLFKKLDYLQWICTVAVFLF 60 Query: 1023 FMFLVQLFLPLDKVDIRKD---------EVGQFEFDFLVDEIVSGLDFGEDLKFVVGPSK 871 F+ L Q+FLP + K EV +F FL + V LDFGED++F PSK Sbjct: 61 FVVLFQMFLP---GSVEKSGNSSLQDNVEVSSGDFKFLKEMGV--LDFGEDIRFE--PSK 113 Query: 870 FLMKFQRDGNVNV------SRSGVRFGNRKVKLGMVFADLLVDSQQVLMVSVAAGLREIG 709 L KFQR+ + +R+ RF RK +L MVFADLLVDSQ++LMV+VA L+EIG Sbjct: 114 LLDKFQREAREAILYSPAFNRTKQRFSYRKPQLAMVFADLLVDSQKLLMVTVAVALQEIG 173 Query: 708 YEFEVYSLEDGPAHSIWKTMGVPVNIIEAKD-TGITIDWLNYDGVLVNSLAAKDVISSLL 532 YEF+VYSLEDGP H +W+T+G+PV II+A D TGI +DWLNYDG+LVNS A+ V S + Sbjct: 174 YEFQVYSLEDGPVHDVWRTIGIPVTIIQAFDKTGIFVDWLNYDGILVNSFEARGVFSCFV 233 Query: 531 QEPFRSVPLIWTIHEKTLATRAAKYVSSGQVDMINDWKAIFNRATVVVFPNYALPMFYAA 352 QEPF+S+PLIWTIHE++LATR+ KY+SSG ++++NDWK IFNR++VVVFPNY LPM Y+ Sbjct: 234 QEPFKSLPLIWTIHERSLATRSRKYISSGHINLLNDWKRIFNRSSVVVFPNYILPMIYST 293 Query: 351 FDAGNYFVVPGCPSGACKINNSTIIHEENLRVNMNIGVEDFVVGIVGSEFLYKGIWLEHA 172 FD GN+FV+PG P+ A + ++ + ++NLRV M +ED V+ IVGS+F+Y+G+WLEHA Sbjct: 294 FDVGNFFVIPGTPAEAWEADSVMALRKDNLRVKMGYELEDAVIAIVGSQFMYRGLWLEHA 353 Query: 171 XXXXXXXXXXXXXXVGDSSSRRLKIIVLSQDSTGNYSAAMEEIASNLNYPRGTVKH 4 + ++S+ LKI++LS DST NY EEIA NL YP G VKH Sbjct: 354 IILQALLPVLSDFPLDNNSNSNLKIVILSGDSTSNYGVVFEEIAINLTYPSGIVKH 409 >ref|XP_023922253.1| uncharacterized protein LOC112033706 isoform X1 [Quercus suber] gb|POE98165.1| hypothetical protein CFP56_57409 [Quercus suber] Length = 1055 Score = 395 bits (1015), Expect = e-124 Identities = 209/416 (50%), Positives = 281/416 (67%), Gaps = 28/416 (6%) Frame = -1 Query: 1167 MGSVSPMLPIK--------SRTDTKIHN-STKPKSKF---VFLKKIDYLQWVSALAVFIF 1024 MGS+ +P+K S T+ H S +P+S+F + KK+DYLQW+ +AVF+F Sbjct: 1 MGSLETGIPLKRDNRFRSFSSVRTERHPFSQRPRSRFSRFLLFKKLDYLQWICTVAVFLF 60 Query: 1023 FMFLVQLFLPLDKVDIRKD---------EVGQFEFDFLVDEIVSGLDFGEDLKFVVGPSK 871 F+ L Q+FLP + K EV +F FL + V LDFGED++F PSK Sbjct: 61 FVVLFQMFLP---GSVEKSGNSSLQDNVEVSSGDFKFLKEMGV--LDFGEDIRFE--PSK 113 Query: 870 FLMKFQRDGNVNV------SRSGVRFGNRKVKLGMVFADLLVDSQQVLMVSVAAGLREIG 709 L KFQR+ + +R+ RF RK +L MVFADLLVDSQ++LMV+VA L+EIG Sbjct: 114 LLDKFQREAREAILYSPAFNRTKQRFSYRKPQLAMVFADLLVDSQKLLMVTVAVALQEIG 173 Query: 708 YEFEVYSLEDGPAHSIWKTMGVPVNIIEAKD-TGITIDWLNYDGVLVNSLAAKDVISSLL 532 YEF+VYSLEDGP H +W+T+G+PV II+A D TGI +DWLNYDG+LVNS A+ V S + Sbjct: 174 YEFQVYSLEDGPVHDVWRTIGIPVTIIQAFDKTGIFVDWLNYDGILVNSFEARGVFSCFV 233 Query: 531 QEPFRSVPLIWTIHEKTLATRAAKYVSSGQVDMINDWKAIFNRATVVVFPNYALPMFYAA 352 QEPF+S+PLIWTIHE++LATR+ KY+SSG ++++NDWK IFNR++VVVFPNY LPM Y+ Sbjct: 234 QEPFKSLPLIWTIHERSLATRSRKYISSGHINLLNDWKRIFNRSSVVVFPNYILPMIYST 293 Query: 351 FDAGNYFVVPGCPSGACKINNSTIIHEENLRVNMNIGVEDFVVGIVGSEFLYKGIWLEHA 172 FD GN+FV+PG P+ A + ++ + ++NLRV M +ED V+ IVGS+F+Y+G+WLEHA Sbjct: 294 FDVGNFFVIPGTPAEAWEADSVMALRKDNLRVKMGYELEDAVIAIVGSQFMYRGLWLEHA 353 Query: 171 XXXXXXXXXXXXXXVGDSSSRRLKIIVLSQDSTGNYSAAMEEIASNLNYPRGTVKH 4 + ++S+ LKI++LS DST NY EEIA NL YP G VKH Sbjct: 354 IILQALLPVLSDFPLDNNSNSNLKIVILSGDSTSNYGVVFEEIAINLTYPSGIVKH 409 >gb|EOX95825.1| Glycosyl transferase family 1 protein isoform 2 [Theobroma cacao] Length = 686 Score = 383 bits (984), Expect = e-123 Identities = 193/406 (47%), Positives = 272/406 (66%), Gaps = 17/406 (4%) Frame = -1 Query: 1167 MGSVSPMLPIKSRTDTKIHNS--TKPKSKF---VFLKKIDYLQWVSALAVFIFFMFLVQL 1003 MGS+ + +K N +P+S+F + KK+DYLQW+ + VF+FF+ Q+ Sbjct: 1 MGSLESGISLKRAGSRNERNPFLNRPRSRFSRFLLFKKLDYLQWICTVVVFLFFVVFFQM 60 Query: 1002 FLPLDKVDIRKDEVGQFEFDFLVDEI-----VSGLDFGEDLKFVVGPSKFLMKFQRDGNV 838 +LP +D +D + + D + E+ + GLDFGED++ + P K L KFQR+ V Sbjct: 61 YLPGSVMDKSQDSFLE-DKDLVYGELRYLKEMGGLDFGEDIR--LEPRKLLEKFQRENKV 117 Query: 837 -------NVSRSGVRFGNRKVKLGMVFADLLVDSQQVLMVSVAAGLREIGYEFEVYSLED 679 +RS RF RK +L +VFADLLVD QQ+LMV++A LREIGY +VYSLED Sbjct: 118 LNLESSSGFNRSQHRFQYRKPQLALVFADLLVDPQQLLMVTIATALREIGYAIQVYSLED 177 Query: 678 GPAHSIWKTMGVPVNIIEAKDTGITIDWLNYDGVLVNSLAAKDVISSLLQEPFRSVPLIW 499 GP H++W+++GVPV++++ I +DWLNYDG+LV+SL AK V SS +QEPF+S+PLIW Sbjct: 178 GPVHNVWQSIGVPVSVLQVNSNEIGVDWLNYDGILVSSLEAKGVFSSFMQEPFKSIPLIW 237 Query: 498 TIHEKTLATRAAKYVSSGQVDMINDWKAIFNRATVVVFPNYALPMFYAAFDAGNYFVVPG 319 TIHE+TLA R+ ++ SSGQ++++N+WK +F+RATVVVFPNYALPM Y+AFD GNY+V+PG Sbjct: 238 TIHERTLAVRSRQFTSSGQIELVNNWKKVFSRATVVVFPNYALPMIYSAFDTGNYYVIPG 297 Query: 318 CPSGACKINNSTIIHEENLRVNMNIGVEDFVVGIVGSEFLYKGIWLEHAXXXXXXXXXXX 139 P+ A K N+ ++++N RV M G ++ ++ IVGS+F+Y+G+WLEHA Sbjct: 298 SPAEAWKGENAMNLYKDNQRVKMGYGPDEVLIAIVGSQFMYRGLWLEHAIVLQALLPLFT 357 Query: 138 XXXVGDSSSRRLKIIVLSQDSTGNYSAAMEEIASNLNYPRGTVKHV 1 +S+ KII+LS DST NYS A+E I NL YP G VKHV Sbjct: 358 DFSSDTNSNSHPKIIILSGDSTSNYSMAVERITHNLKYPSGVVKHV 403 >dbj|GAV75395.1| Glycos_transf_1 domain-containing protein [Cephalotus follicularis] Length = 1023 Score = 391 bits (1005), Expect = e-123 Identities = 205/400 (51%), Positives = 270/400 (67%), Gaps = 11/400 (2%) Frame = -1 Query: 1167 MGSVSPMLPIKSRTDTKIHNSTK-PKSKF---VFLKKIDYLQWVSALAVFIFFMFLVQLF 1000 MGS+ +P+K + ++T+ P S+F + KK+DYLQW+S + VF+FF+ +F Sbjct: 1 MGSLESGVPLKRESLFGSSSATRRPGSRFCRFLLFKKLDYLQWISTVLVFLFFLVWFPMF 60 Query: 999 LPLDKVDIRKDEVGQFEFDFLVD-EIVSGLDFGEDLKFVVGPSKFLMKFQRDG---NVNV 832 LP +D K ++ L+ + + G DFGED+ F PS L KF R+ N++ Sbjct: 61 LPGLVMD--KSHANDVDYGNLMHLKEIGGFDFGEDIVFE--PSMLLEKFHREAVEFNLSA 116 Query: 831 SRSGVR--FGNRKVKLGMVFADLLVDSQQVLMVSVAAGLREIGYEFEVYSLEDGPAHSIW 658 S +G R FG RK +L +VF DLLVD QQ+LMV+VA+ L+EIGYE ++YS EDGP H +W Sbjct: 117 SFNGTRRRFGYRKPQLALVFPDLLVDPQQLLMVTVASALQEIGYEIQIYSFEDGPVHEVW 176 Query: 657 KTMGVPVNIIEAKDT-GITIDWLNYDGVLVNSLAAKDVISSLLQEPFRSVPLIWTIHEKT 481 K MG+PV I++ I +DWLNYDG++VNSL A + S L+QEPF+SVPLIWTIHEK Sbjct: 177 KNMGIPVTIVQTSHKMEIVVDWLNYDGIIVNSLEATGIFSRLMQEPFKSVPLIWTIHEKA 236 Query: 480 LATRAAKYVSSGQVDMINDWKAIFNRATVVVFPNYALPMFYAAFDAGNYFVVPGCPSGAC 301 LA +Y S GQ+ ++NDWK +FNRATVVVFPNYALP+ Y+AFDAGNY+V+PG P A Sbjct: 237 LALCLREYNSRGQIALVNDWKKVFNRATVVVFPNYALPIIYSAFDAGNYYVIPGSPVEAW 296 Query: 300 KINNSTIIHEENLRVNMNIGVEDFVVGIVGSEFLYKGIWLEHAXXXXXXXXXXXXXXVGD 121 K N T +H+++LRV M G EDFV+ IVGS+FLYKG+WLEHA Sbjct: 297 KANTITELHKDDLRVKMGYGPEDFVIAIVGSQFLYKGLWLEHALVLQALLPLFADFSFEG 356 Query: 120 SSSRRLKIIVLSQDSTGNYSAAMEEIASNLNYPRGTVKHV 1 +SS LK++VLS DSTGNYS A+E IA NL YPRGTVK + Sbjct: 357 NSSSHLKVLVLSGDSTGNYSVAVEAIARNLKYPRGTVKFI 396 >ref|XP_021644116.1| uncharacterized protein LOC110638026 [Hevea brasiliensis] Length = 1036 Score = 388 bits (996), Expect = e-121 Identities = 199/410 (48%), Positives = 280/410 (68%), Gaps = 21/410 (5%) Frame = -1 Query: 1167 MGSVSPMLPIKSRTDTKIHNSTKPK------------SKFVFLKKIDYLQWVSALAVFIF 1024 MGS+ LP+K + + ++++ + S+F+ KK+DYLQW+ +AVF+F Sbjct: 1 MGSLESALPLKRESLLRSSSASRSERYPFLLRPRSRFSRFLLSKKLDYLQWICTVAVFLF 60 Query: 1023 FMFLVQLFLPLDKVDIRKDEVGQFEF---DFLVDEIVSGLDFGEDLKFVVGPSKFLMKFQ 853 F+FL Q FLP ++ +D Q + D L + + LDFGED+K + PSK + KFQ Sbjct: 61 FVFLFQTFLPGSVIEKSQDWRKQLDMVYGDLLYLKDMGTLDFGEDIK--LEPSKLMEKFQ 118 Query: 852 R-----DGNVNVSRSGVRFGNRKVKLGMVFADLLVDSQQVLMVSVAAGLREIGYEFEVYS 688 + D + + +R+ RFG RK +L +VFADLLVD QQ+LMV+VA L+EIGY +V+S Sbjct: 119 KEAREVDPSSSFNRTQHRFGYRKPQLALVFADLLVDPQQLLMVTVATALQEIGYTTQVFS 178 Query: 687 LEDGPAHSIWKTMGVPVNIIEAKDT-GITIDWLNYDGVLVNSLAAKDVISSLLQEPFRSV 511 LEDGPAH IWK++GVPV I ++ I +DWL YDG+LVNSL K V S +QEPF+S+ Sbjct: 179 LEDGPAHDIWKSIGVPVTIFQSNHRMEIAVDWLIYDGILVNSLETKVVFSCFMQEPFKSI 238 Query: 510 PLIWTIHEKTLATRAAKYVSSGQVDMINDWKAIFNRATVVVFPNYALPMFYAAFDAGNYF 331 PLIWTIHE+TLA R+ +Y +GQ++++NDWK +FNRATVVVFPN LP+ Y+AFDAGNY+ Sbjct: 239 PLIWTIHERTLAVRSRQYTVNGQIELVNDWKRVFNRATVVVFPNLVLPIMYSAFDAGNYY 298 Query: 330 VVPGCPSGACKINNSTIIHEENLRVNMNIGVEDFVVGIVGSEFLYKGIWLEHAXXXXXXX 151 V+PG P+ A + ++ ++++N+R+ M G +D V+ IVGS+FLY+G+WLEHA Sbjct: 299 VIPGSPAQAWEADDVVALYKDNVRLKMGYGPDDVVITIVGSQFLYRGLWLEHALILQALL 358 Query: 150 XXXXXXXVGDSSSRRLKIIVLSQDSTGNYSAAMEEIASNLNYPRGTVKHV 1 GD+S+ LKIIVLS +S+ NYS A+E IA NL+YPRG VKH+ Sbjct: 359 PLFSDIPFGDNSNFHLKIIVLSGNSSSNYSVAVEAIAVNLHYPRGAVKHI 408 >gb|OAY62220.1| hypothetical protein MANES_01G251000 [Manihot esculenta] Length = 817 Score = 379 bits (973), Expect = e-120 Identities = 196/411 (47%), Positives = 275/411 (66%), Gaps = 22/411 (5%) Frame = -1 Query: 1167 MGSVSPMLPIKSRTDTKIHN---------STKPKSKF---VFLKKIDYLQWVSALAVFIF 1024 MGS+ LP+K + + + S +P+S+F + +K+DYLQW+ +AVF+F Sbjct: 1 MGSLETALPLKRESLLRSSSAGRTERYPFSQRPRSRFSRFLLFRKLDYLQWICTVAVFLF 60 Query: 1023 FMFLVQLFLPLDKVDIRKD---EVGQFEFDFLVDEIVSGLDFGEDLKFVVGPSKFLMKFQ 853 + Q+FLP ++ +D E+ D L + LDFGED+KF PSK + KF+ Sbjct: 61 VVISFQMFLPGSVIEKSQDSWKELDMVSGDLLSLKETGTLDFGEDIKFE--PSKLIEKFE 118 Query: 852 RDG------NVNVSRSGVRFGNRKVKLGMVFADLLVDSQQVLMVSVAAGLREIGYEFEVY 691 ++ + N S + RFG +K +L +VFADLLVD QQ+LMV+VA L+EIGY +V+ Sbjct: 119 KEARDVNNLSFNFSVTQRRFGYKKPQLALVFADLLVDPQQLLMVTVATALQEIGYITQVF 178 Query: 690 SLEDGPAHSIWKTMGVPVNIIEAKDT-GITIDWLNYDGVLVNSLAAKDVISSLLQEPFRS 514 S+EDGPAH IWK++GVPV I ++K I +DWL YDG+LV+SL K V+S +QEPF+S Sbjct: 179 SIEDGPAHEIWKSIGVPVTIFQSKHRMEIAVDWLMYDGILVSSLETKVVLSCFMQEPFKS 238 Query: 513 VPLIWTIHEKTLATRAAKYVSSGQVDMINDWKAIFNRATVVVFPNYALPMFYAAFDAGNY 334 +PLIWTIHEK LA R+ KY +GQ+++ NDWK +FNRATVVVFPN+ LPM Y++FDAGNY Sbjct: 239 LPLIWTIHEKALAVRSRKYTENGQIELANDWKRVFNRATVVVFPNHVLPMMYSSFDAGNY 298 Query: 333 FVVPGCPSGACKINNSTIIHEENLRVNMNIGVEDFVVGIVGSEFLYKGIWLEHAXXXXXX 154 +V+PG P+ A + + ++++N+RV M G +D ++ IVGS+FLY+G+WLEHA Sbjct: 299 YVIPGSPAQAWEADALVALYKDNVRVKMGYGPDDIIITIVGSQFLYRGLWLEHALILQAL 358 Query: 153 XXXXXXXXVGDSSSRRLKIIVLSQDSTGNYSAAMEEIASNLNYPRGTVKHV 1 D+S RLKIIVLS +ST NY+ A+E IA NL+YPRG VKH+ Sbjct: 359 LPLFSKFPFDDNSISRLKIIVLSGNSTSNYTMAVEAIAVNLHYPRGAVKHI 409 >gb|EOX95824.1| Glycosyl transferase family 1 protein isoform 1 [Theobroma cacao] Length = 1026 Score = 383 bits (984), Expect = e-120 Identities = 193/406 (47%), Positives = 272/406 (66%), Gaps = 17/406 (4%) Frame = -1 Query: 1167 MGSVSPMLPIKSRTDTKIHNS--TKPKSKF---VFLKKIDYLQWVSALAVFIFFMFLVQL 1003 MGS+ + +K N +P+S+F + KK+DYLQW+ + VF+FF+ Q+ Sbjct: 1 MGSLESGISLKRAGSRNERNPFLNRPRSRFSRFLLFKKLDYLQWICTVVVFLFFVVFFQM 60 Query: 1002 FLPLDKVDIRKDEVGQFEFDFLVDEI-----VSGLDFGEDLKFVVGPSKFLMKFQRDGNV 838 +LP +D +D + + D + E+ + GLDFGED++ + P K L KFQR+ V Sbjct: 61 YLPGSVMDKSQDSFLE-DKDLVYGELRYLKEMGGLDFGEDIR--LEPRKLLEKFQRENKV 117 Query: 837 -------NVSRSGVRFGNRKVKLGMVFADLLVDSQQVLMVSVAAGLREIGYEFEVYSLED 679 +RS RF RK +L +VFADLLVD QQ+LMV++A LREIGY +VYSLED Sbjct: 118 LNLESSSGFNRSQHRFQYRKPQLALVFADLLVDPQQLLMVTIATALREIGYAIQVYSLED 177 Query: 678 GPAHSIWKTMGVPVNIIEAKDTGITIDWLNYDGVLVNSLAAKDVISSLLQEPFRSVPLIW 499 GP H++W+++GVPV++++ I +DWLNYDG+LV+SL AK V SS +QEPF+S+PLIW Sbjct: 178 GPVHNVWQSIGVPVSVLQVNSNEIGVDWLNYDGILVSSLEAKGVFSSFMQEPFKSIPLIW 237 Query: 498 TIHEKTLATRAAKYVSSGQVDMINDWKAIFNRATVVVFPNYALPMFYAAFDAGNYFVVPG 319 TIHE+TLA R+ ++ SSGQ++++N+WK +F+RATVVVFPNYALPM Y+AFD GNY+V+PG Sbjct: 238 TIHERTLAVRSRQFTSSGQIELVNNWKKVFSRATVVVFPNYALPMIYSAFDTGNYYVIPG 297 Query: 318 CPSGACKINNSTIIHEENLRVNMNIGVEDFVVGIVGSEFLYKGIWLEHAXXXXXXXXXXX 139 P+ A K N+ ++++N RV M G ++ ++ IVGS+F+Y+G+WLEHA Sbjct: 298 SPAEAWKGENAMNLYKDNQRVKMGYGPDEVLIAIVGSQFMYRGLWLEHAIVLQALLPLFT 357 Query: 138 XXXVGDSSSRRLKIIVLSQDSTGNYSAAMEEIASNLNYPRGTVKHV 1 +S+ KII+LS DST NYS A+E I NL YP G VKHV Sbjct: 358 DFSSDTNSNSHPKIIILSGDSTSNYSMAVERITHNLKYPSGVVKHV 403 >ref|XP_021683395.1| uncharacterized protein LOC110667009 isoform X2 [Hevea brasiliensis] Length = 863 Score = 379 bits (973), Expect = e-120 Identities = 195/406 (48%), Positives = 273/406 (67%), Gaps = 17/406 (4%) Frame = -1 Query: 1167 MGSVSPMLPIKSRTDTKIHNSTKPK------------SKFVFLKKIDYLQWVSALAVFIF 1024 MGS+ +P+K + + ++ + + S+F+ KK++ QW+ A+AVF F Sbjct: 1 MGSLDTGVPLKRESLLRSSSAARSERYPVWLRYRSRFSRFLLFKKLNNFQWICAMAVFFF 60 Query: 1023 FMFLVQLFLPLDKVDIRKD---EVGQFEFDFLVDEIVSGLDFGEDLKFVVGPSKFLMKFQ 853 F+ L ++FLP ++ +D E+ D L + + LDFGED+KF PSK + KFQ Sbjct: 61 FLILFEMFLPGFVIEKSQDSWKEMDMVSGDLLPLKEMGILDFGEDIKFE--PSKLMEKFQ 118 Query: 852 RDGN-VNVSRSGVRFGNRKVKLGMVFADLLVDSQQVLMVSVAAGLREIGYEFEVYSLEDG 676 ++ VN+S + RFG K +L +VFADLLV+ QQ+LMV+VA L+EIGY +V+S+EDG Sbjct: 119 KEAREVNLSSTQHRFGYGKPQLALVFADLLVNPQQLLMVTVATALQEIGYTIQVFSVEDG 178 Query: 675 PAHSIWKTMGVPVNIIEAKD-TGITIDWLNYDGVLVNSLAAKDVISSLLQEPFRSVPLIW 499 PAH IWK++GVPV I ++K T I +DWL +DG+LVNSL KDVIS +QEPF+S+PLIW Sbjct: 179 PAHDIWKSIGVPVTIFQSKHKTEIAVDWLIFDGILVNSLETKDVISCFMQEPFKSLPLIW 238 Query: 498 TIHEKTLATRAAKYVSSGQVDMINDWKAIFNRATVVVFPNYALPMFYAAFDAGNYFVVPG 319 TIHE+TLA R+ +Y +GQ++++NDWK +FNR TVVVFPN LPM Y+AFDAGNY+V+PG Sbjct: 239 TIHERTLAVRSRQYTENGQIELLNDWKRVFNRPTVVVFPNPVLPMMYSAFDAGNYYVIPG 298 Query: 318 CPSGACKINNSTIIHEENLRVNMNIGVEDFVVGIVGSEFLYKGIWLEHAXXXXXXXXXXX 139 P+ A K + +++N+RV M G +D V+ IVGS+FLY+G+WLEHA Sbjct: 299 SPAQAWKADAMVAFYKDNVRVKMGYGPDDVVITIVGSQFLYRGLWLEHALILRTLLPLFS 358 Query: 138 XXXVGDSSSRRLKIIVLSQDSTGNYSAAMEEIASNLNYPRGTVKHV 1 D+S+ LKIIVLS ++ NYSA +E IA L YPRG VKH+ Sbjct: 359 DFPFDDNSNSHLKIIVLSGNTISNYSAVVEAIAVKLRYPRGAVKHI 404 >ref|XP_007051667.2| PREDICTED: uncharacterized protein LOC18614048 [Theobroma cacao] Length = 1026 Score = 382 bits (981), Expect = e-119 Identities = 192/406 (47%), Positives = 272/406 (66%), Gaps = 17/406 (4%) Frame = -1 Query: 1167 MGSVSPMLPIKSRTDTKIHNS--TKPKSKF---VFLKKIDYLQWVSALAVFIFFMFLVQL 1003 MGS+ + +K N +P+S+F + KK+DYLQW+ + VF+FF+ Q+ Sbjct: 1 MGSLESGISLKRAGSRNERNPFLNRPRSRFSRFLLFKKLDYLQWICTVVVFLFFVVFFQM 60 Query: 1002 FLPLDKVDIRKDEVGQFEFDFLVDEI-----VSGLDFGEDLKFVVGPSKFLMKFQRDGNV 838 +LP +D +D + + D + E+ + GLDFGED++ + P K L KFQR+ V Sbjct: 61 YLPGSVMDKSQDSFLE-DKDLVYGELRYLKEMGGLDFGEDIR--LEPRKLLEKFQRENKV 117 Query: 837 -------NVSRSGVRFGNRKVKLGMVFADLLVDSQQVLMVSVAAGLREIGYEFEVYSLED 679 +RS RF RK +L +VFADLLVD QQ+LMV++A LREIGY +VYSLED Sbjct: 118 LNLESSSGFNRSQHRFQYRKPQLALVFADLLVDPQQLLMVTIATALREIGYAIQVYSLED 177 Query: 678 GPAHSIWKTMGVPVNIIEAKDTGITIDWLNYDGVLVNSLAAKDVISSLLQEPFRSVPLIW 499 GP H++W+++GVPV++++ I +DWLNYDG+LV+SL AK V SS +QEPF+S+PLIW Sbjct: 178 GPVHNVWQSIGVPVSVLQVNSNEIGVDWLNYDGILVSSLEAKGVFSSFMQEPFKSIPLIW 237 Query: 498 TIHEKTLATRAAKYVSSGQVDMINDWKAIFNRATVVVFPNYALPMFYAAFDAGNYFVVPG 319 TIHE+TLA R+ ++ SSGQ++++N+WK +F+RATVVVFPNYALPM Y+AFD GNY+V+PG Sbjct: 238 TIHERTLAVRSRQFTSSGQIELVNNWKKVFSRATVVVFPNYALPMIYSAFDTGNYYVIPG 297 Query: 318 CPSGACKINNSTIIHEENLRVNMNIGVEDFVVGIVGSEFLYKGIWLEHAXXXXXXXXXXX 139 P+ A K N+ ++++N R+ M G ++ ++ IVGS+F+Y+G+WLEHA Sbjct: 298 SPAEAWKGENAMNLYKDNQRMKMGYGPDEVLIAIVGSQFMYRGLWLEHAIVLQALLPLFT 357 Query: 138 XXXVGDSSSRRLKIIVLSQDSTGNYSAAMEEIASNLNYPRGTVKHV 1 +S+ KII+LS DST NYS A+E I NL YP G VKHV Sbjct: 358 DFSSDTNSNSHPKIIILSGDSTSNYSMAVERITHNLKYPSGVVKHV 403 >ref|XP_022757484.1| uncharacterized protein LOC111304797 [Durio zibethinus] Length = 1026 Score = 381 bits (979), Expect = e-119 Identities = 201/407 (49%), Positives = 271/407 (66%), Gaps = 18/407 (4%) Frame = -1 Query: 1167 MGSVSPMLPIK---SRTDTKIHNSTKPKSKFV---FLKKIDYLQWVSALAVFIFFMFLVQ 1006 MGS+ + +K SRT+ S +P+S+F KK+DY+QW+ + VF+FF+ Q Sbjct: 1 MGSLESGISLKRAGSRTERNPFLS-RPRSRFSRFWLFKKLDYIQWICTVVVFLFFVVFFQ 59 Query: 1005 LFLPLDKVDIRKDEVGQFEFDFLVDEI-----VSGLDFGEDLKFVVGPSKFLMKFQRDGN 841 +FLP +D +D D + E+ + GLDFGED++ + P K L KFQR+ Sbjct: 60 MFLPGSVMDKSQDSYLDNN-DLVFGELRYLKEIGGLDFGEDIR--LEPCKLLEKFQRENK 116 Query: 840 -VNV------SRSGVRFGNRKVKLGMVFADLLVDSQQVLMVSVAAGLREIGYEFEVYSLE 682 VN+ +RS RF RK +L +VFADLLVD QQ+LMV+VA LREIGYE +VYSLE Sbjct: 117 EVNLKSPSGFNRSQHRFQYRKPQLALVFADLLVDPQQLLMVTVATALREIGYEIQVYSLE 176 Query: 681 DGPAHSIWKTMGVPVNIIEAKDTGITIDWLNYDGVLVNSLAAKDVISSLLQEPFRSVPLI 502 DGP H++W+++GVPV I++ I +DWLNYDG+L++SL AK V SS +Q+PF+S+PLI Sbjct: 177 DGPVHNVWQSIGVPVTILKVNPNEIGVDWLNYDGILISSLEAKSVFSSFMQDPFKSIPLI 236 Query: 501 WTIHEKTLATRAAKYVSSGQVDMINDWKAIFNRATVVVFPNYALPMFYAAFDAGNYFVVP 322 WTIHE+ LA R+ KY SSGQ++++NDWK +FNRATVVVFPNY LPM Y+AFDAGNY+V+P Sbjct: 237 WTIHERALAFRSRKYTSSGQIELVNDWKKVFNRATVVVFPNYLLPMIYSAFDAGNYYVIP 296 Query: 321 GCPSGACKINNSTIIHEENLRVNMNIGVEDFVVGIVGSEFLYKGIWLEHAXXXXXXXXXX 142 G P K N + ++N R+ M G ++ ++ IVGS+F+Y+G+WLEHA Sbjct: 297 GSPVEVWKGENVMNLFKDNQRMKMGYGPKEVLIAIVGSQFMYRGLWLEHALILQALLPLF 356 Query: 141 XXXXVGDSSSRRLKIIVLSQDSTGNYSAAMEEIASNLNYPRGTVKHV 1 +SS+ KIIVLS DS NYS A+E IA NL YP G VKHV Sbjct: 357 ADFSSDNSSNSHPKIIVLSSDSISNYSMAVERIALNLRYPSGVVKHV 403 >ref|XP_012083283.1| uncharacterized protein LOC105642906 [Jatropha curcas] gb|KDP28542.1| hypothetical protein JCGZ_14313 [Jatropha curcas] Length = 1033 Score = 380 bits (975), Expect = e-118 Identities = 197/408 (48%), Positives = 273/408 (66%), Gaps = 19/408 (4%) Frame = -1 Query: 1167 MGSVSPMLPIKSRTDTKIHNSTKPK----------SKFVFLKKIDYLQWVSALAVFIFFM 1018 MGS+ +LP+K + + ++ + S+F+ KK+DYLQW+ +AVF+FF+ Sbjct: 1 MGSLETVLPLKRESLLRSSSAGRHSFMQRQPRSRFSRFLLFKKLDYLQWICTVAVFLFFV 60 Query: 1017 FLVQLFLPLDKVDIRKD---EVGQFEFDFLVDEIVSGLDFGEDLKFVVGPSKFLMKFQRD 847 L Q+FLP ++ +D EV D + + + DFGED+KF PSK L KFQ++ Sbjct: 61 VLFQMFLPGSVIEKSEDSWKEVENVSGDLMYLKEIGTWDFGEDIKFE--PSKILQKFQKE 118 Query: 846 -GNVNVS----RSGVRFGNRKVKLGMVFADLLVDSQQVLMVSVAAGLREIGYEFEVYSLE 682 VN S R+ +RFG +K +L +VFADL D QQ+LMV+VA L+EIGY +V+S++ Sbjct: 119 VREVNFSSSFNRTQLRFGYKKPQLALVFADLSADPQQLLMVTVATALQEIGYSIQVFSIQ 178 Query: 681 DGPAHSIWKTMGVPVNIIEAKDT-GITIDWLNYDGVLVNSLAAKDVISSLLQEPFRSVPL 505 DGP + IWK++GVPV I + I +DWL YDG+LVNSL K + S +QEPF+S+PL Sbjct: 179 DGPVNGIWKSIGVPVTIFQRNHKMEIAVDWLIYDGILVNSLETKAIFSCFMQEPFKSIPL 238 Query: 504 IWTIHEKTLATRAAKYVSSGQVDMINDWKAIFNRATVVVFPNYALPMFYAAFDAGNYFVV 325 IWTIHE+TLA R+ +Y S GQ ++++DWK +FNRATVVVFPNYALPM Y+AFDAGNY+V+ Sbjct: 239 IWTIHERTLAIRSRQYASDGQTELVSDWKRVFNRATVVVFPNYALPMMYSAFDAGNYYVI 298 Query: 324 PGCPSGACKINNSTIIHEENLRVNMNIGVEDFVVGIVGSEFLYKGIWLEHAXXXXXXXXX 145 PG P+ A + + ++++N+R+ M G +D V+ IVG +FLY+G+WLEHA Sbjct: 299 PGSPAEAWEA-DVMALYKDNVRLKMGYGPDDVVIAIVGGQFLYRGLWLEHALILQALLPA 357 Query: 144 XXXXXVGDSSSRRLKIIVLSQDSTGNYSAAMEEIASNLNYPRGTVKHV 1 D+S+ LKIIVLS +ST NYS A+E IA NLNYPRG VKHV Sbjct: 358 FQDFPFDDNSNSHLKIIVLSGNSTSNYSVAVETIAVNLNYPRGAVKHV 405