BLASTX nr result
ID: Chrysanthemum21_contig00010273
seq
BLASTX 2.2.26 [Sep-21-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Chrysanthemum21_contig00010273 (2380 letters) Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF excluding environmental samples from WGS projects 149,584,005 sequences; 54,822,741,787 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_022001050.1| heparanase-like protein 3 isoform X2 [Helian... 634 0.0 ref|XP_022001049.1| heparanase-like protein 3 isoform X1 [Helian... 634 0.0 ref|XP_021987587.1| hydroxyproline O-galactosyltransferase HPGT3... 605 0.0 gb|KVH90278.1| Glycoside hydrolase, family 79, partial [Cynara c... 608 0.0 gb|PLY63760.1| hypothetical protein LSAT_6X20321 [Lactuca sativa] 602 0.0 ref|XP_023747021.1| heparanase-like protein 3 isoform X2 [Lactuc... 602 0.0 ref|XP_023747020.1| heparanase-like protein 3 isoform X1 [Lactuc... 597 0.0 ref|XP_023761470.1| hydroxyproline O-galactosyltransferase HPGT3... 587 0.0 ref|XP_022001051.1| heparanase-like protein 3 isoform X3 [Helian... 583 0.0 ref|XP_018836253.1| PREDICTED: hydroxyproline O-galactosyltransf... 554 0.0 ref|XP_023747023.1| hydroxyproline O-galactosyltransferase HPGT3... 551 0.0 ref|XP_023761469.1| heparanase-like protein 3 [Lactuca sativa] 553 0.0 gb|PLY87253.1| hypothetical protein LSAT_1X43741 [Lactuca sativa] 558 0.0 ref|XP_002515480.2| PREDICTED: probable beta-1,3-galactosyltrans... 546 0.0 gb|KVH99807.1| protein of unknown function DUF4094 [Cynara cardu... 546 0.0 ref|XP_023872627.1| hydroxyproline O-galactosyltransferase HPGT3... 543 0.0 ref|XP_018822215.1| PREDICTED: hydroxyproline O-galactosyltransf... 542 0.0 ref|XP_004498753.1| PREDICTED: probable beta-1,3-galactosyltrans... 542 0.0 ref|XP_008445237.1| PREDICTED: hydroxyproline O-galactosyltransf... 542 0.0 ref|XP_007161194.1| hypothetical protein PHAVU_001G049900g [Phas... 541 0.0 >ref|XP_022001050.1| heparanase-like protein 3 isoform X2 [Helianthus annuus] Length = 406 Score = 634 bits (1634), Expect = 0.0 Identities = 307/385 (79%), Positives = 338/385 (87%) Frame = +2 Query: 2 AKIIFGLNALSQRQVNIDGSVVGPWNSSNAEALMKYTVDKGFTIHGWELGNELSANGIGA 181 AK+IFGLNAL QR+V +G+V+GPWNSS+AEALMKYTVDKGFTIHGWELGNELS GIGA Sbjct: 25 AKVIFGLNALYQREVKTNGAVIGPWNSSDAEALMKYTVDKGFTIHGWELGNELSGRGIGA 84 Query: 182 KITADQYALDTVALQNLVQKIYNTFQDKPLVLGPGGFFDYNWFDEFVRKSNDSLQVITQH 361 I ADQYA D V+LQNLVQKIY F+ KPLVLGPGGFFD NWF+EFV KS DSLQVITQH Sbjct: 85 SIGADQYASDMVSLQNLVQKIYKAFEVKPLVLGPGGFFDANWFNEFVNKSKDSLQVITQH 144 Query: 362 IYNLGPGVDKHLIEKILNPSYLDGGSQHFRDLQNILKNSGTSTVAWVGEAGGAYNSGRNQ 541 IYNLGPGVD HL+EKILNPSYLDGGSQ FRD+QNILK S STVAWVGEAGGAYNSG N Sbjct: 145 IYNLGPGVDNHLVEKILNPSYLDGGSQPFRDVQNILKKSKASTVAWVGEAGGAYNSGHNH 204 Query: 542 VTNAFVFSFWYLDQLGMASSYDTKTYCRQTLIGGNYGLLNTDTFVPNPDYYSALLWHRLM 721 VTN+FVFSFWYLDQLGMA+SYDTKTYCRQTLIGGNYGLLNTDT+VPNPDYYSALLWHRLM Sbjct: 205 VTNSFVFSFWYLDQLGMAASYDTKTYCRQTLIGGNYGLLNTDTYVPNPDYYSALLWHRLM 264 Query: 722 GRRVLSTRFHGTKKIRSYAHCSKHSDGITLLLINLASLAKTEVGLTVENVTILAASKQQE 901 GRRVLST F GT+ IRSYAHCSK S+G+T+LLINL + KTEVG+T+EN TI ASK+ Sbjct: 265 GRRVLSTSFQGTRMIRSYAHCSKRSNGMTILLINLDGITKTEVGVTIENETIAVASKR-- 322 Query: 902 HQTQKTHSSKLGEKELTREEYHLTAKNGNLNSHTILLNGNELTVNSTGSIPSLDPVQVNM 1081 Q ++THSSKL KE TREEYHLTAK+GNLNSHT+LLNG ELTVNSTG IPSLDPV+ N+ Sbjct: 323 -QIKQTHSSKLRNKEFTREEYHLTAKDGNLNSHTVLLNGKELTVNSTGIIPSLDPVEANL 381 Query: 1082 SSPINVAPFSIVFVHIPNINVPACT 1156 PI VAP+SIVFVHIP I++ ACT Sbjct: 382 RDPITVAPYSIVFVHIPGIHIQACT 406 >ref|XP_022001049.1| heparanase-like protein 3 isoform X1 [Helianthus annuus] gb|OTG01545.1| putative glycoside hydrolase, family 79 [Helianthus annuus] Length = 552 Score = 634 bits (1634), Expect = 0.0 Identities = 307/385 (79%), Positives = 338/385 (87%) Frame = +2 Query: 2 AKIIFGLNALSQRQVNIDGSVVGPWNSSNAEALMKYTVDKGFTIHGWELGNELSANGIGA 181 AK+IFGLNAL QR+V +G+V+GPWNSS+AEALMKYTVDKGFTIHGWELGNELS GIGA Sbjct: 171 AKVIFGLNALYQREVKTNGAVIGPWNSSDAEALMKYTVDKGFTIHGWELGNELSGRGIGA 230 Query: 182 KITADQYALDTVALQNLVQKIYNTFQDKPLVLGPGGFFDYNWFDEFVRKSNDSLQVITQH 361 I ADQYA D V+LQNLVQKIY F+ KPLVLGPGGFFD NWF+EFV KS DSLQVITQH Sbjct: 231 SIGADQYASDMVSLQNLVQKIYKAFEVKPLVLGPGGFFDANWFNEFVNKSKDSLQVITQH 290 Query: 362 IYNLGPGVDKHLIEKILNPSYLDGGSQHFRDLQNILKNSGTSTVAWVGEAGGAYNSGRNQ 541 IYNLGPGVD HL+EKILNPSYLDGGSQ FRD+QNILK S STVAWVGEAGGAYNSG N Sbjct: 291 IYNLGPGVDNHLVEKILNPSYLDGGSQPFRDVQNILKKSKASTVAWVGEAGGAYNSGHNH 350 Query: 542 VTNAFVFSFWYLDQLGMASSYDTKTYCRQTLIGGNYGLLNTDTFVPNPDYYSALLWHRLM 721 VTN+FVFSFWYLDQLGMA+SYDTKTYCRQTLIGGNYGLLNTDT+VPNPDYYSALLWHRLM Sbjct: 351 VTNSFVFSFWYLDQLGMAASYDTKTYCRQTLIGGNYGLLNTDTYVPNPDYYSALLWHRLM 410 Query: 722 GRRVLSTRFHGTKKIRSYAHCSKHSDGITLLLINLASLAKTEVGLTVENVTILAASKQQE 901 GRRVLST F GT+ IRSYAHCSK S+G+T+LLINL + KTEVG+T+EN TI ASK+ Sbjct: 411 GRRVLSTSFQGTRMIRSYAHCSKRSNGMTILLINLDGITKTEVGVTIENETIAVASKR-- 468 Query: 902 HQTQKTHSSKLGEKELTREEYHLTAKNGNLNSHTILLNGNELTVNSTGSIPSLDPVQVNM 1081 Q ++THSSKL KE TREEYHLTAK+GNLNSHT+LLNG ELTVNSTG IPSLDPV+ N+ Sbjct: 469 -QIKQTHSSKLRNKEFTREEYHLTAKDGNLNSHTVLLNGKELTVNSTGIIPSLDPVEANL 527 Query: 1082 SSPINVAPFSIVFVHIPNINVPACT 1156 PI VAP+SIVFVHIP I++ ACT Sbjct: 528 RDPITVAPYSIVFVHIPGIHIQACT 552 >ref|XP_021987587.1| hydroxyproline O-galactosyltransferase HPGT3-like [Helianthus annuus] gb|OTG38522.1| putative galactosyltransferase family protein [Helianthus annuus] Length = 355 Score = 605 bits (1559), Expect = 0.0 Identities = 302/354 (85%), Positives = 318/354 (89%), Gaps = 1/354 (0%) Frame = -3 Query: 2324 NNSSLYYKEGYQLPTXXXXXXXXXXXXXXXXXXXXXXXXXS-CLAWLYIAGRLWQDAENR 2148 +NS YYKEG L T C+AWLYIAGRLWQDAENR Sbjct: 2 DNSPPYYKEGLPLSTTLSKSEKQRSRSSSRSSVPSIFFAFFSCVAWLYIAGRLWQDAENR 61 Query: 2147 MLLSNLLMKNAAERPKVLTVEDKLMVLGCKDLERRIVEAEMEITLAKSQGYLKDQLKQPG 1968 MLL+NLLM+N+AERPKVLTVEDKL+VLGCKDLER+IVEAEMEITLAKSQG+LKD+LKQPG Sbjct: 62 MLLANLLMQNSAERPKVLTVEDKLVVLGCKDLERKIVEAEMEITLAKSQGFLKDRLKQPG 121 Query: 1967 LSSNKKLLAVIGVYTGFGSRLNRNVFRGSWMPKGDSLKKLEERGIVIRFVIGRSPNRGDS 1788 LSS+KKLLAVIGVYTGFGSRLNRNVFRGSWMPKGDSLKKLEERGIVIRFVIGRSPNRGDS Sbjct: 122 LSSSKKLLAVIGVYTGFGSRLNRNVFRGSWMPKGDSLKKLEERGIVIRFVIGRSPNRGDS 181 Query: 1787 LDRNIDAENRSTKDFLILDGHEEADEESPKKAKFFFSTAVQNWDAEFYVKVDNNIALDLE 1608 LDRNID ENR+TKDFLILDGHEEADEESPKKAKFFFSTAVQNWDAEFYVKVDNNIALDLE Sbjct: 182 LDRNIDEENRTTKDFLILDGHEEADEESPKKAKFFFSTAVQNWDAEFYVKVDNNIALDLE 241 Query: 1607 GLIELLESRRGQDGVYIGCMKSGEVVAEAGKPWYEPDWWKFGDEKSYFRHASGSLVILSK 1428 GLIELLESRRGQD VYIGCMKSGEVVAE GKPWYEPDWWKFGDEKSYFRHA+GSL+ILSK Sbjct: 242 GLIELLESRRGQDSVYIGCMKSGEVVAEEGKPWYEPDWWKFGDEKSYFRHAAGSLIILSK 301 Query: 1427 NFAQYININSASLKSYAHEDTTIGSWMMGIQATYIDENRVCCSSIRQDKVCSLA 1266 NFAQYININSASLKSYAHEDT+IGSWMMGI+ATYIDE+RVCCSS RQDKVCSLA Sbjct: 302 NFAQYININSASLKSYAHEDTSIGSWMMGIKATYIDESRVCCSSSRQDKVCSLA 355 >gb|KVH90278.1| Glycoside hydrolase, family 79, partial [Cynara cardunculus var. scolymus] Length = 601 Score = 608 bits (1569), Expect = 0.0 Identities = 301/398 (75%), Positives = 336/398 (84%), Gaps = 13/398 (3%) Frame = +2 Query: 2 AKIIFGLNALSQRQVNIDGSVVGPWNSSNAEALMKYTVDKGFTIHGWELGN--------- 154 AK+IFGLNALSQR V++DGSVVGPWNSSNAEALMKYTVDKGFTIHGWELGN Sbjct: 205 AKVIFGLNALSQRHVSMDGSVVGPWNSSNAEALMKYTVDKGFTIHGWELGNSFGLKTNNC 264 Query: 155 ----ELSANGIGAKITADQYALDTVALQNLVQKIYNTFQDKPLVLGPGGFFDYNWFDEFV 322 ELS NGIGA+I ADQYA DT++LQNLVQ IY +F+ KP+VLGPGGFFD NWF E+V Sbjct: 265 IVGNELSGNGIGARIMADQYASDTISLQNLVQNIYKSFEVKPIVLGPGGFFDANWFTEYV 324 Query: 323 RKSNDSLQVITQHIYNLGPGVDKHLIEKILNPSYLDGGSQHFRDLQNILKNSGTSTVAWV 502 KSN SLQ ITQHIYNLGPGVD L+ KIL+PS LDGGSQ RDLQ ILK G ST+AWV Sbjct: 325 MKSNGSLQAITQHIYNLGPGVDNDLVNKILDPSCLDGGSQPLRDLQKILKEFGNSTIAWV 384 Query: 503 GEAGGAYNSGRNQVTNAFVFSFWYLDQLGMASSYDTKTYCRQTLIGGNYGLLNTDTFVPN 682 GEAGGAYNSG ++V+N F+FSFWYLDQLGMASSYDTKTYCRQ+LIGGNYGLLNT TFVPN Sbjct: 385 GEAGGAYNSGHDRVSNTFIFSFWYLDQLGMASSYDTKTYCRQSLIGGNYGLLNTVTFVPN 444 Query: 683 PDYYSALLWHRLMGRRVLSTRFHGTKKIRSYAHCSKHSDGITLLLINLASLAKTEVGLTV 862 PDYYSALLWHRLMGR VL T F GTKKIRSYAHCSKHSDGITLLLINL S T +GL+V Sbjct: 445 PDYYSALLWHRLMGRHVLLTSFDGTKKIRSYAHCSKHSDGITLLLINLDSYTTTAIGLSV 504 Query: 863 ENVTILAASKQQEHQTQKTHSSKLGEKELTREEYHLTAKNGNLNSHTILLNGNELTVNST 1042 ENVT++ AS Q + QTQ+T S + E TREEYHLTAK+GNLNS T+LLNG EL+VNST Sbjct: 505 ENVTMITASNQLK-QTQRTQSFQSSSNEFTREEYHLTAKDGNLNSQTVLLNGKELSVNST 563 Query: 1043 GSIPSLDPVQVNMSSPINVAPFSIVFVHIPNINVPACT 1156 G IPSLDPV+VN+S+PI VAPFSIVFVH+P+I++PACT Sbjct: 564 GIIPSLDPVEVNISNPIIVAPFSIVFVHMPDIHLPACT 601 >gb|PLY63760.1| hypothetical protein LSAT_6X20321 [Lactuca sativa] Length = 538 Score = 602 bits (1551), Expect = 0.0 Identities = 299/387 (77%), Positives = 334/387 (86%), Gaps = 3/387 (0%) Frame = +2 Query: 2 AKIIFGLNALSQRQVNIDGSVVGPWNSSNAEALMKYTVDKGFTIHGWELGNELSA-NGIG 178 AK+IFGLNALS R N+D V+GPWNS+NAEALMKYT+DKGFTIHGWELGNELS N IG Sbjct: 153 AKVIFGLNALSHRNTNMD--VIGPWNSTNAEALMKYTIDKGFTIHGWELGNELSGFNAIG 210 Query: 179 AKITADQYALDTVALQNLVQKIYNTFQDKPLVLGPGGFFDYNWFDEFVRKSNDSLQVITQ 358 A I ADQYA DT++LQNLVQK+Y F KP+VLGPGGFFD NWF E+V KSN+SLQV+TQ Sbjct: 211 ASIKADQYASDTISLQNLVQKMYKNFAIKPIVLGPGGFFDENWFTEYVAKSNNSLQVLTQ 270 Query: 359 HIYNLGPGVDKHLIEKILNPSYLDGGSQHFRDLQNILKNSGTSTVAWVGEAGGAYNSGRN 538 HIYNLGPGVD HLI+KILNP YLDGGSQ F+D+QNILK SG+ TVAWVGEAGGAYNSGRN Sbjct: 271 HIYNLGPGVDTHLIKKILNPWYLDGGSQSFKDVQNILKESGSETVAWVGEAGGAYNSGRN 330 Query: 539 QVTNAFVFSFWYLDQLGMASSYDTKTYCRQTLIGGNYGLLNTDTFVPNPDYYSALLWHRL 718 +V+NAFVFSFWYLDQ+GMAS YDTKTYCRQTLIGGNYGLLNT TFVPNPDYYSALLWHRL Sbjct: 331 RVSNAFVFSFWYLDQMGMASLYDTKTYCRQTLIGGNYGLLNTATFVPNPDYYSALLWHRL 390 Query: 719 MGRRVLSTRFHGTKKIRSYAHCSKHSDGITLLLINLASLAKTEVGLTVENVTILAASKQQ 898 MGRRVLS F+G KKIRSYAHC+KHSDG+TLLLINL K +VG+++ENVTI+ AS Sbjct: 391 MGRRVLSASFNGIKKIRSYAHCAKHSDGLTLLLINLDGSIKAKVGVSIENVTIIMASTPD 450 Query: 899 EHQTQKTHSSKLGEKEL-TREEYHLTAKNGNLNSHTILLNGNELTVNS-TGSIPSLDPVQ 1072 TQ+T SS+ + EL REEYHLTAKNG LNS+TILLNG EL+VNS TG IPSLDP+Q Sbjct: 451 LKPTQETKSSRNPKGELFRREEYHLTAKNGKLNSNTILLNGKELSVNSTTGIIPSLDPIQ 510 Query: 1073 VNMSSPINVAPFSIVFVHIPNINVPAC 1153 V + SPI VAPFSIVFVHIPNI+VPAC Sbjct: 511 VKLGSPIMVAPFSIVFVHIPNIHVPAC 537 >ref|XP_023747021.1| heparanase-like protein 3 isoform X2 [Lactuca sativa] Length = 556 Score = 602 bits (1551), Expect = 0.0 Identities = 299/387 (77%), Positives = 334/387 (86%), Gaps = 3/387 (0%) Frame = +2 Query: 2 AKIIFGLNALSQRQVNIDGSVVGPWNSSNAEALMKYTVDKGFTIHGWELGNELSA-NGIG 178 AK+IFGLNALS R N+D V+GPWNS+NAEALMKYT+DKGFTIHGWELGNELS N IG Sbjct: 171 AKVIFGLNALSHRNTNMD--VIGPWNSTNAEALMKYTIDKGFTIHGWELGNELSGFNAIG 228 Query: 179 AKITADQYALDTVALQNLVQKIYNTFQDKPLVLGPGGFFDYNWFDEFVRKSNDSLQVITQ 358 A I ADQYA DT++LQNLVQK+Y F KP+VLGPGGFFD NWF E+V KSN+SLQV+TQ Sbjct: 229 ASIKADQYASDTISLQNLVQKMYKNFAIKPIVLGPGGFFDENWFTEYVAKSNNSLQVLTQ 288 Query: 359 HIYNLGPGVDKHLIEKILNPSYLDGGSQHFRDLQNILKNSGTSTVAWVGEAGGAYNSGRN 538 HIYNLGPGVD HLI+KILNP YLDGGSQ F+D+QNILK SG+ TVAWVGEAGGAYNSGRN Sbjct: 289 HIYNLGPGVDTHLIKKILNPWYLDGGSQSFKDVQNILKESGSETVAWVGEAGGAYNSGRN 348 Query: 539 QVTNAFVFSFWYLDQLGMASSYDTKTYCRQTLIGGNYGLLNTDTFVPNPDYYSALLWHRL 718 +V+NAFVFSFWYLDQ+GMAS YDTKTYCRQTLIGGNYGLLNT TFVPNPDYYSALLWHRL Sbjct: 349 RVSNAFVFSFWYLDQMGMASLYDTKTYCRQTLIGGNYGLLNTATFVPNPDYYSALLWHRL 408 Query: 719 MGRRVLSTRFHGTKKIRSYAHCSKHSDGITLLLINLASLAKTEVGLTVENVTILAASKQQ 898 MGRRVLS F+G KKIRSYAHC+KHSDG+TLLLINL K +VG+++ENVTI+ AS Sbjct: 409 MGRRVLSASFNGIKKIRSYAHCAKHSDGLTLLLINLDGSIKAKVGVSIENVTIIMASTPD 468 Query: 899 EHQTQKTHSSKLGEKEL-TREEYHLTAKNGNLNSHTILLNGNELTVNS-TGSIPSLDPVQ 1072 TQ+T SS+ + EL REEYHLTAKNG LNS+TILLNG EL+VNS TG IPSLDP+Q Sbjct: 469 LKPTQETKSSRNPKGELFRREEYHLTAKNGKLNSNTILLNGKELSVNSTTGIIPSLDPIQ 528 Query: 1073 VNMSSPINVAPFSIVFVHIPNINVPAC 1153 V + SPI VAPFSIVFVHIPNI+VPAC Sbjct: 529 VKLGSPIMVAPFSIVFVHIPNIHVPAC 555 >ref|XP_023747020.1| heparanase-like protein 3 isoform X1 [Lactuca sativa] Length = 557 Score = 597 bits (1539), Expect = 0.0 Identities = 299/388 (77%), Positives = 334/388 (86%), Gaps = 4/388 (1%) Frame = +2 Query: 2 AKIIFGLNALSQRQVNIDGSVVGPWNSSNAEALMKYTVDKGFTIHGWELGNELSA-NGIG 178 AK+IFGLNALS R N+D V+GPWNS+NAEALMKYT+DKGFTIHGWELGNELS N IG Sbjct: 171 AKVIFGLNALSHRNTNMD--VIGPWNSTNAEALMKYTIDKGFTIHGWELGNELSGFNAIG 228 Query: 179 AKITADQYALDTVALQNLVQKIYNTFQDKPLVLGPGGFFDYNWFDEFVRKSNDSLQVITQ 358 A I ADQYA DT++LQNLVQK+Y F KP+VLGPGGFFD NWF E+V KSN+SLQV+TQ Sbjct: 229 ASIKADQYASDTISLQNLVQKMYKNFAIKPIVLGPGGFFDENWFTEYVAKSNNSLQVLTQ 288 Query: 359 HIYNLGPGVDKHLIEKILNPSYLDGGSQHFRDLQNILKNSGTSTVAWVGEAGGAYNSGRN 538 HIYNLGPGVD HLI+KILNP YLDGGSQ F+D+QNILK SG+ TVAWVGEAGGAYNSGRN Sbjct: 289 HIYNLGPGVDTHLIKKILNPWYLDGGSQSFKDVQNILKESGSETVAWVGEAGGAYNSGRN 348 Query: 539 QVTNAFVFSFWYLDQLGMASSYDTKTYCRQTLIGGNYGLLNTDTFVPNPDYYSALLWHRL 718 +V+NAFVFSFWYLDQ+GMAS YDTKTYCRQTLIGGNYGLLNT TFVPNPDYYSALLWHRL Sbjct: 349 RVSNAFVFSFWYLDQMGMASLYDTKTYCRQTLIGGNYGLLNTATFVPNPDYYSALLWHRL 408 Query: 719 MGRRVLSTRFHGTKKIRSYAHCSKHS-DGITLLLINLASLAKTEVGLTVENVTILAASKQ 895 MGRRVLS F+G KKIRSYAHC+KHS DG+TLLLINL K +VG+++ENVTI+ AS Sbjct: 409 MGRRVLSASFNGIKKIRSYAHCAKHSQDGLTLLLINLDGSIKAKVGVSIENVTIIMASTP 468 Query: 896 QEHQTQKTHSSKLGEKEL-TREEYHLTAKNGNLNSHTILLNGNELTVNS-TGSIPSLDPV 1069 TQ+T SS+ + EL REEYHLTAKNG LNS+TILLNG EL+VNS TG IPSLDP+ Sbjct: 469 DLKPTQETKSSRNPKGELFRREEYHLTAKNGKLNSNTILLNGKELSVNSTTGIIPSLDPI 528 Query: 1070 QVNMSSPINVAPFSIVFVHIPNINVPAC 1153 QV + SPI VAPFSIVFVHIPNI+VPAC Sbjct: 529 QVKLGSPIMVAPFSIVFVHIPNIHVPAC 556 >ref|XP_023761470.1| hydroxyproline O-galactosyltransferase HPGT3-like isoform X1 [Lactuca sativa] ref|XP_023761472.1| hydroxyproline O-galactosyltransferase HPGT3-like isoform X1 [Lactuca sativa] ref|XP_023761473.1| hydroxyproline O-galactosyltransferase HPGT3-like isoform X1 [Lactuca sativa] gb|PLY87263.1| hypothetical protein LSAT_1X43720 [Lactuca sativa] Length = 356 Score = 587 bits (1513), Expect = 0.0 Identities = 291/356 (81%), Positives = 310/356 (87%), Gaps = 1/356 (0%) Frame = -3 Query: 2330 MENNSSLYYKEGYQLPTXXXXXXXXXXXXXXXXXXXXXXXXXS-CLAWLYIAGRLWQDAE 2154 M+NN Y+KEG LPT CLAWLYIAGRLWQDAE Sbjct: 1 MDNNGPPYHKEGSPLPTTISKTEKQRSRSSSRSSVPSIFFAFFSCLAWLYIAGRLWQDAE 60 Query: 2153 NRMLLSNLLMKNAAERPKVLTVEDKLMVLGCKDLERRIVEAEMEITLAKSQGYLKDQLKQ 1974 NR +LS+LLMKN+AERPKVLTVE+KLMVLGCKDLERRIVE+EMEI+LAKSQG+LKDQLKQ Sbjct: 61 NRKVLSHLLMKNSAERPKVLTVEEKLMVLGCKDLERRIVESEMEISLAKSQGFLKDQLKQ 120 Query: 1973 PGLSSNKKLLAVIGVYTGFGSRLNRNVFRGSWMPKGDSLKKLEERGIVIRFVIGRSPNRG 1794 PG SS+KKLLAVIGVYTGFGSRLNR VFRGSWMP GDSLKKLEERGI+IRFVIGRSPNRG Sbjct: 121 PGFSSSKKLLAVIGVYTGFGSRLNRKVFRGSWMPTGDSLKKLEERGIIIRFVIGRSPNRG 180 Query: 1793 DSLDRNIDAENRSTKDFLILDGHEEADEESPKKAKFFFSTAVQNWDAEFYVKVDNNIALD 1614 DSLDRNID ENR+TKDFLILD HEEADEES KKAKFFFSTAVQNWDAEFY+KVDNNI LD Sbjct: 181 DSLDRNIDEENRTTKDFLILDNHEEADEESSKKAKFFFSTAVQNWDAEFYIKVDNNIGLD 240 Query: 1613 LEGLIELLESRRGQDGVYIGCMKSGEVVAEAGKPWYEPDWWKFGDEKSYFRHASGSLVIL 1434 LEGLIELLESR GQD VYIGCMKSGEVV+E GKPWYEPDWWKFGDEKSYFRHASGSL+I+ Sbjct: 241 LEGLIELLESRHGQDSVYIGCMKSGEVVSEVGKPWYEPDWWKFGDEKSYFRHASGSLLII 300 Query: 1433 SKNFAQYININSASLKSYAHEDTTIGSWMMGIQATYIDENRVCCSSIRQDKVCSLA 1266 SK FAQYININSASLK+YAHEDT+IGSWMMGIQATYIDENRVCCS +QDKVCSLA Sbjct: 301 SKRFAQYININSASLKTYAHEDTSIGSWMMGIQATYIDENRVCCSGSQQDKVCSLA 356 >ref|XP_022001051.1| heparanase-like protein 3 isoform X3 [Helianthus annuus] Length = 349 Score = 583 bits (1504), Expect = 0.0 Identities = 283/352 (80%), Positives = 308/352 (87%) Frame = +2 Query: 101 MKYTVDKGFTIHGWELGNELSANGIGAKITADQYALDTVALQNLVQKIYNTFQDKPLVLG 280 MKYTVDKGFTIHGWELGNELS GIGA I ADQYA D V+LQNLVQKIY F+ KPLVLG Sbjct: 1 MKYTVDKGFTIHGWELGNELSGRGIGASIGADQYASDMVSLQNLVQKIYKAFEVKPLVLG 60 Query: 281 PGGFFDYNWFDEFVRKSNDSLQVITQHIYNLGPGVDKHLIEKILNPSYLDGGSQHFRDLQ 460 PGGFFD NWF+EFV KS DSLQVITQHIYNLGPGVD HL+EKILNPSYLDGGSQ FRD+Q Sbjct: 61 PGGFFDANWFNEFVNKSKDSLQVITQHIYNLGPGVDNHLVEKILNPSYLDGGSQPFRDVQ 120 Query: 461 NILKNSGTSTVAWVGEAGGAYNSGRNQVTNAFVFSFWYLDQLGMASSYDTKTYCRQTLIG 640 NILK S STVAWVGEAGGAYNSG N VTN+FVFSFWYLDQLGMA+SYDTKTYCRQTLIG Sbjct: 121 NILKKSKASTVAWVGEAGGAYNSGHNHVTNSFVFSFWYLDQLGMAASYDTKTYCRQTLIG 180 Query: 641 GNYGLLNTDTFVPNPDYYSALLWHRLMGRRVLSTRFHGTKKIRSYAHCSKHSDGITLLLI 820 GNYGLLNTDT+VPNPDYYSALLWHRLMGRRVLST F GT+ IRSYAHCSK S+G+T+LLI Sbjct: 181 GNYGLLNTDTYVPNPDYYSALLWHRLMGRRVLSTSFQGTRMIRSYAHCSKRSNGMTILLI 240 Query: 821 NLASLAKTEVGLTVENVTILAASKQQEHQTQKTHSSKLGEKELTREEYHLTAKNGNLNSH 1000 NL + KTEVG+T+EN TI ASK+ Q ++THSSKL KE TREEYHLTAK+GNLNSH Sbjct: 241 NLDGITKTEVGVTIENETIAVASKR---QIKQTHSSKLRNKEFTREEYHLTAKDGNLNSH 297 Query: 1001 TILLNGNELTVNSTGSIPSLDPVQVNMSSPINVAPFSIVFVHIPNINVPACT 1156 T+LLNG ELTVNSTG IPSLDPV+ N+ PI VAP+SIVFVHIP I++ ACT Sbjct: 298 TVLLNGKELTVNSTGIIPSLDPVEANLRDPITVAPYSIVFVHIPGIHIQACT 349 >ref|XP_018836253.1| PREDICTED: hydroxyproline O-galactosyltransferase HPGT3-like [Juglans regia] Length = 344 Score = 554 bits (1427), Expect = 0.0 Identities = 265/312 (84%), Positives = 292/312 (93%) Frame = -3 Query: 2201 CLAWLYIAGRLWQDAENRMLLSNLLMKNAAERPKVLTVEDKLMVLGCKDLERRIVEAEME 2022 CLAWLY+AGRLW+DAENR LL+NLL KNA +RPKVLTVEDKLMVLGC+DLERRIVEAEM+ Sbjct: 33 CLAWLYVAGRLWEDAENRKLLANLLYKNALQRPKVLTVEDKLMVLGCRDLERRIVEAEMD 92 Query: 2021 ITLAKSQGYLKDQLKQPGLSSNKKLLAVIGVYTGFGSRLNRNVFRGSWMPKGDSLKKLEE 1842 +TLAKSQGYLKD+L+Q G SS +KLLAVIGVYTGFGSRL RNVFRGSWMPKGD+L+KLEE Sbjct: 93 LTLAKSQGYLKDKLQQSGSSSGQKLLAVIGVYTGFGSRLKRNVFRGSWMPKGDALRKLEE 152 Query: 1841 RGIVIRFVIGRSPNRGDSLDRNIDAENRSTKDFLILDGHEEADEESPKKAKFFFSTAVQN 1662 RG+VIRFVIGRS NRGDSLDRNID E RSTKDFLIL+GHEEA EE PKKAKFFFSTAVQN Sbjct: 153 RGVVIRFVIGRSANRGDSLDRNIDEEYRSTKDFLILEGHEEAQEELPKKAKFFFSTAVQN 212 Query: 1661 WDAEFYVKVDNNIALDLEGLIELLESRRGQDGVYIGCMKSGEVVAEAGKPWYEPDWWKFG 1482 WDAEFYVKVD++I LDLEGLI LL+ RRGQDG YIGCMKSG+V+++ GK WYEPDWWKFG Sbjct: 213 WDAEFYVKVDDSIDLDLEGLIGLLDRRRGQDGAYIGCMKSGDVISDEGKSWYEPDWWKFG 272 Query: 1481 DEKSYFRHASGSLVILSKNFAQYININSASLKSYAHEDTTIGSWMMGIQATYIDENRVCC 1302 DEKSYFRHASGSL+ILSKN AQYININSASLKSYAH+D ++GSWMMG+QATYIDENR+CC Sbjct: 273 DEKSYFRHASGSLLILSKNLAQYININSASLKSYAHDDVSVGSWMMGLQATYIDENRLCC 332 Query: 1301 SSIRQDKVCSLA 1266 SSIRQDKVCSLA Sbjct: 333 SSIRQDKVCSLA 344 >ref|XP_023747023.1| hydroxyproline O-galactosyltransferase HPGT3-like [Lactuca sativa] gb|PLY63773.1| hypothetical protein LSAT_6X20300 [Lactuca sativa] Length = 355 Score = 551 bits (1420), Expect = 0.0 Identities = 266/312 (85%), Positives = 289/312 (92%) Frame = -3 Query: 2201 CLAWLYIAGRLWQDAENRMLLSNLLMKNAAERPKVLTVEDKLMVLGCKDLERRIVEAEME 2022 CLAWLYIAGRLWQDAENR LL+NLL+KN+++RPKVLTVEDKLMVLGCKDLERRIVE EME Sbjct: 47 CLAWLYIAGRLWQDAENRTLLANLLIKNSSQRPKVLTVEDKLMVLGCKDLERRIVETEME 106 Query: 2021 ITLAKSQGYLKDQLKQPGLSSNKKLLAVIGVYTGFGSRLNRNVFRGSWMPKGDSLKKLEE 1842 ITLAKSQG+L +QLK SSNKK LAVIG+YTGFG++L RN FRGSWMP+GDSLKKLEE Sbjct: 107 ITLAKSQGFLSNQLKS---SSNKKFLAVIGIYTGFGNKLRRNSFRGSWMPEGDSLKKLEE 163 Query: 1841 RGIVIRFVIGRSPNRGDSLDRNIDAENRSTKDFLILDGHEEADEESPKKAKFFFSTAVQN 1662 RGIVIRF+IGRSPNRGDSLDRNID ENR+TKDFLILD HEEA+EESPKKAKFFFSTAVQN Sbjct: 164 RGIVIRFIIGRSPNRGDSLDRNIDEENRTTKDFLILDAHEEAEEESPKKAKFFFSTAVQN 223 Query: 1661 WDAEFYVKVDNNIALDLEGLIELLESRRGQDGVYIGCMKSGEVVAEAGKPWYEPDWWKFG 1482 WDAEFYVKVDNNI LDLEGLIELLESR+GQD +YIGCMKSGEVV+E GK WYEPDWWKFG Sbjct: 224 WDAEFYVKVDNNINLDLEGLIELLESRQGQDSLYIGCMKSGEVVSEEGKQWYEPDWWKFG 283 Query: 1481 DEKSYFRHASGSLVILSKNFAQYININSASLKSYAHEDTTIGSWMMGIQATYIDENRVCC 1302 D KSYFRHA GSL ILS+NFAQYININS SLK+YAHEDT++GSWMMGIQATYID+ RVCC Sbjct: 284 DAKSYFRHAGGSLYILSRNFAQYININSVSLKTYAHEDTSVGSWMMGIQATYIDDTRVCC 343 Query: 1301 SSIRQDKVCSLA 1266 + RQDKVCSLA Sbjct: 344 GTSRQDKVCSLA 355 >ref|XP_023761469.1| heparanase-like protein 3 [Lactuca sativa] Length = 396 Score = 553 bits (1424), Expect = 0.0 Identities = 272/392 (69%), Positives = 317/392 (80%), Gaps = 8/392 (2%) Frame = +2 Query: 2 AKIIFGLNALSQRQVNIDGSVVGPWNSSNAEALMKYTVDKGFTIHGWELG--NELSANGI 175 AK++FGLNAL+ RQ+ DG+ G W+ SNAEAL++YTV+ G+ I+GWELG NELS GI Sbjct: 4 AKVVFGLNALTGRQIGYDGTTFGSWDLSNAEALIRYTVNNGYVIYGWELGSGNELSGRGI 63 Query: 176 GAKITADQYALDTVALQNLVQKIYNTFQDKPLVLGPGGFFDYNWFDEFVRKSNDSLQVIT 355 G + A QYA DT++LQNLVQKIYN Q+KP+VLGPGGFFD NWF+ +V +++ SLQVIT Sbjct: 64 GTSVAAKQYASDTISLQNLVQKIYNGSQEKPIVLGPGGFFDANWFNVYVTEASGSLQVIT 123 Query: 356 QHIYNLGPGVDKHLIEKILNPSYLDGGSQHFRDLQNILKNSGTSTVAWVGEAGGAYNSGR 535 QHIYNLGPGVD HL+EKILNPSYLDGGSQ FRDLQNILK S TSTVAWVGEAGGAYNSGR Sbjct: 124 QHIYNLGPGVDAHLVEKILNPSYLDGGSQPFRDLQNILKKSRTSTVAWVGEAGGAYNSGR 183 Query: 536 NQVTNAFVFSFWYLDQLGMASSYDTKTYCRQTLIGGNYGLLNTDTFVPNPDYYSALLWHR 715 N VTNAFVF FWYLDQLGMAS+Y+T TYCRQTLIGGNYGLLNT TFVPNPDYY ALLWHR Sbjct: 184 NLVTNAFVFGFWYLDQLGMASTYNTTTYCRQTLIGGNYGLLNTTTFVPNPDYYGALLWHR 243 Query: 716 LMGRRVLSTRFHGTKKIRSYAHCSKHSDGITLLLINLASLAKTEVGLTVENV--TILAAS 889 LMGR VLST F GT KIRSYAHCSK S GITLLLINL + T VG++ N I + Sbjct: 244 LMGRHVLSTNFSGTNKIRSYAHCSKTSTGITLLLINLDGIKTTNVGISFINTIKIITQTT 303 Query: 890 KQQEHQTQKTHSSKLGE----KELTREEYHLTAKNGNLNSHTILLNGNELTVNSTGSIPS 1057 K++ + ++T SK+ E+ REEYHLTAKNG+L+S +LLNG EL VNS+G IPS Sbjct: 304 KKEPKEQKRTKFSKMRRNPKVNEVIREEYHLTAKNGDLHSQIVLLNGKELIVNSSGIIPS 363 Query: 1058 LDPVQVNMSSPINVAPFSIVFVHIPNINVPAC 1153 L+P++ N SSPINVAP+SIVFVHIP++ PAC Sbjct: 364 LNPIKQNFSSPINVAPYSIVFVHIPSVRFPAC 395 >gb|PLY87253.1| hypothetical protein LSAT_1X43741 [Lactuca sativa] Length = 549 Score = 558 bits (1437), Expect = 0.0 Identities = 272/390 (69%), Positives = 317/390 (81%), Gaps = 6/390 (1%) Frame = +2 Query: 2 AKIIFGLNALSQRQVNIDGSVVGPWNSSNAEALMKYTVDKGFTIHGWELGNELSANGIGA 181 AK++FGLNAL+ RQ+ DG+ G W+ SNAEAL++YTV+ G+ I+GWELGNELS GIG Sbjct: 159 AKVVFGLNALTGRQIGYDGTTFGSWDLSNAEALIRYTVNNGYVIYGWELGNELSGRGIGT 218 Query: 182 KITADQYALDTVALQNLVQKIYNTFQDKPLVLGPGGFFDYNWFDEFVRKSNDSLQVITQH 361 + A QYA DT++LQNLVQKIYN Q+KP+VLGPGGFFD NWF+ +V +++ SLQVITQH Sbjct: 219 SVAAKQYASDTISLQNLVQKIYNGSQEKPIVLGPGGFFDANWFNVYVTEASGSLQVITQH 278 Query: 362 IYNLGPGVDKHLIEKILNPSYLDGGSQHFRDLQNILKNSGTSTVAWVGEAGGAYNSGRNQ 541 IYNLGPGVD HL+EKILNPSYLDGGSQ FRDLQNILK S TSTVAWVGEAGGAYNSGRN Sbjct: 279 IYNLGPGVDAHLVEKILNPSYLDGGSQPFRDLQNILKKSRTSTVAWVGEAGGAYNSGRNL 338 Query: 542 VTNAFVFSFWYLDQLGMASSYDTKTYCRQTLIGGNYGLLNTDTFVPNPDYYSALLWHRLM 721 VTNAFVF FWYLDQLGMAS+Y+T TYCRQTLIGGNYGLLNT TFVPNPDYY ALLWHRLM Sbjct: 339 VTNAFVFGFWYLDQLGMASTYNTTTYCRQTLIGGNYGLLNTTTFVPNPDYYGALLWHRLM 398 Query: 722 GRRVLSTRFHGTKKIRSYAHCSKHSDGITLLLINLASLAKTEVGLTVENV--TILAASKQ 895 GR VLST F GT KIRSYAHCSK S GITLLLINL + T VG++ N I +K+ Sbjct: 399 GRHVLSTNFSGTNKIRSYAHCSKTSTGITLLLINLDGIKTTNVGISFINTIKIITQTTKK 458 Query: 896 QEHQTQKTHSSKLGE----KELTREEYHLTAKNGNLNSHTILLNGNELTVNSTGSIPSLD 1063 + + ++T SK+ E+ REEYHLTAKNG+L+S +LLNG EL VNS+G IPSL+ Sbjct: 459 EPKEQKRTKFSKMRRNPKVNEVIREEYHLTAKNGDLHSQIVLLNGKELIVNSSGIIPSLN 518 Query: 1064 PVQVNMSSPINVAPFSIVFVHIPNINVPAC 1153 P++ N SSPINVAP+SIVFVHIP++ PAC Sbjct: 519 PIKQNFSSPINVAPYSIVFVHIPSVRFPAC 548 >ref|XP_002515480.2| PREDICTED: probable beta-1,3-galactosyltransferase 9 [Ricinus communis] Length = 346 Score = 546 bits (1408), Expect = 0.0 Identities = 265/313 (84%), Positives = 287/313 (91%), Gaps = 1/313 (0%) Frame = -3 Query: 2201 CLAWLYIAGRLWQDAENRMLLSNLLMKNAAERPKVLTVEDKLMVLGCKDLERRIVEAEME 2022 CLAWLY+AGRLWQDAENRMLLSNLL N+A+RP+VLTVEDKL VLGCKDLERRIVEAEME Sbjct: 34 CLAWLYVAGRLWQDAENRMLLSNLLKLNSAQRPRVLTVEDKLAVLGCKDLERRIVEAEME 93 Query: 2021 ITLAKSQGYLKDQLKQPGLSSN-KKLLAVIGVYTGFGSRLNRNVFRGSWMPKGDSLKKLE 1845 +TLAKSQGYLK+QL G SS+ KKLLAVIGVYTGFGSRL RNVFRGSWMP+GD+LKKLE Sbjct: 94 LTLAKSQGYLKNQLPHSGSSSSGKKLLAVIGVYTGFGSRLKRNVFRGSWMPRGDALKKLE 153 Query: 1844 ERGIVIRFVIGRSPNRGDSLDRNIDAENRSTKDFLILDGHEEADEESPKKAKFFFSTAVQ 1665 ERG+VIRFVIGRS NRGDSLDRNID EN STKDFLILDGHEEA EE PKKAKFFFSTAVQ Sbjct: 154 ERGVVIRFVIGRSANRGDSLDRNIDEENSSTKDFLILDGHEEAQEEIPKKAKFFFSTAVQ 213 Query: 1664 NWDAEFYVKVDNNIALDLEGLIELLESRRGQDGVYIGCMKSGEVVAEAGKPWYEPDWWKF 1485 WDAEFYVKVD+NI LDLEGLI LLE RRGQD Y+GCMKSG+V+ E GK WYEPDWWKF Sbjct: 214 KWDAEFYVKVDDNINLDLEGLIGLLERRRGQDSAYVGCMKSGDVITEEGKQWYEPDWWKF 273 Query: 1484 GDEKSYFRHASGSLVILSKNFAQYININSASLKSYAHEDTTIGSWMMGIQATYIDENRVC 1305 GDEKSYFRHASGSL ILSKN AQYININSASLK YAH+DT++GSWMMG+QATYID+NR+C Sbjct: 274 GDEKSYFRHASGSLFILSKNLAQYININSASLKMYAHDDTSVGSWMMGLQATYIDDNRLC 333 Query: 1304 CSSIRQDKVCSLA 1266 CSSI+QDKVCS+A Sbjct: 334 CSSIKQDKVCSVA 346 >gb|KVH99807.1| protein of unknown function DUF4094 [Cynara cardunculus var. scolymus] Length = 338 Score = 546 bits (1407), Expect = 0.0 Identities = 284/353 (80%), Positives = 294/353 (83%), Gaps = 1/353 (0%) Frame = -3 Query: 2321 NSSLYYKEGYQLPTXXXXXXXXXXXXXXXXXXXXXXXXXS-CLAWLYIAGRLWQDAENRM 2145 NS YYKEG LPT CLAWLYIAGRLWQDAENRM Sbjct: 3 NSPPYYKEGLPLPTTISKTEKQRSRSSSRSSIPSIFFAFFSCLAWLYIAGRLWQDAENRM 62 Query: 2144 LLSNLLMKNAAERPKVLTVEDKLMVLGCKDLERRIVEAEMEITLAKSQGYLKDQLKQPGL 1965 LLSNLLMKN+AERPKVLTVEDKLMVLGCKDLERRIVEAEMEITLAKSQG+L DQLKQPG Sbjct: 63 LLSNLLMKNSAERPKVLTVEDKLMVLGCKDLERRIVEAEMEITLAKSQGFLTDQLKQPGN 122 Query: 1964 SSNKKLLAVIGVYTGFGSRLNRNVFRGSWMPKGDSLKKLEERGIVIRFVIGRSPNRGDSL 1785 SS KKLLAVIGVYTGFGSRLNRNVFRGSWMP G+SLKKLEERGIVIRFVIGRSPNRGDSL Sbjct: 123 SSQKKLLAVIGVYTGFGSRLNRNVFRGSWMPTGNSLKKLEERGIVIRFVIGRSPNRGDSL 182 Query: 1784 DRNIDAENRSTKDFLILDGHEEADEESPKKAKFFFSTAVQNWDAEFYVKVDNNIALDLEG 1605 DRNID ENR+TKDFLILDGHEEADEESPKKAKFFFSTA+QNWDAEFYVKVDNNIALDLEG Sbjct: 183 DRNIDEENRATKDFLILDGHEEADEESPKKAKFFFSTAIQNWDAEFYVKVDNNIALDLEG 242 Query: 1604 LIELLESRRGQDGVYIGCMKSGEVVAEAGKPWYEPDWWKFGDEKSYFRHASGSLVILSKN 1425 LIELLESRRGQD VY+GCMKSGEVVAE YFRHASGSL+ILSKN Sbjct: 243 LIELLESRRGQDSVYLGCMKSGEVVAE-----------------EYFRHASGSLLILSKN 285 Query: 1424 FAQYININSASLKSYAHEDTTIGSWMMGIQATYIDENRVCCSSIRQDKVCSLA 1266 FAQYININSASLK+YAHEDT+IGSWMMGIQATYIDENR CCS QDKVCSL+ Sbjct: 286 FAQYININSASLKTYAHEDTSIGSWMMGIQATYIDENRACCSGSVQDKVCSLS 338 >ref|XP_023872627.1| hydroxyproline O-galactosyltransferase HPGT3-like [Quercus suber] gb|POF23808.1| hydroxyproline o-galactosyltransferase hpgt2 [Quercus suber] Length = 344 Score = 543 bits (1398), Expect = 0.0 Identities = 257/312 (82%), Positives = 287/312 (91%) Frame = -3 Query: 2201 CLAWLYIAGRLWQDAENRMLLSNLLMKNAAERPKVLTVEDKLMVLGCKDLERRIVEAEME 2022 CLAWLY+AGRLWQDAENR +L+NLL KN+ +RPK+LTVEDKL VLGC+DLERRIVEAEME Sbjct: 33 CLAWLYVAGRLWQDAENRKVLTNLLYKNSLQRPKILTVEDKLSVLGCRDLERRIVEAEME 92 Query: 2021 ITLAKSQGYLKDQLKQPGLSSNKKLLAVIGVYTGFGSRLNRNVFRGSWMPKGDSLKKLEE 1842 +TLAKSQGYL QL+Q G SS KKLLAVIGVYTGFGSRL RNVFRGSWMPKGD+L+KLEE Sbjct: 93 LTLAKSQGYLNKQLQQSGSSSGKKLLAVIGVYTGFGSRLKRNVFRGSWMPKGDALRKLEE 152 Query: 1841 RGIVIRFVIGRSPNRGDSLDRNIDAENRSTKDFLILDGHEEADEESPKKAKFFFSTAVQN 1662 RG+VIRFVIGRS NRGDSLDRNI+ ENRSTKDFLIL+GHEEA EE PKKAKFF STAVQ Sbjct: 153 RGVVIRFVIGRSANRGDSLDRNINEENRSTKDFLILEGHEEAQEELPKKAKFFLSTAVQK 212 Query: 1661 WDAEFYVKVDNNIALDLEGLIELLESRRGQDGVYIGCMKSGEVVAEAGKPWYEPDWWKFG 1482 WDA+F+VKVD+NI LDLE LI LLE RRGQDG YIGCMKSG+V++E GKPWYEPDWWKFG Sbjct: 213 WDADFFVKVDDNIDLDLEALIGLLERRRGQDGAYIGCMKSGDVISEEGKPWYEPDWWKFG 272 Query: 1481 DEKSYFRHASGSLVILSKNFAQYININSASLKSYAHEDTTIGSWMMGIQATYIDENRVCC 1302 DEKSYFRHA +L+ILSKN AQY+NINSASLK+YAH+DT++GSWMMG+QATYID+NR+CC Sbjct: 273 DEKSYFRHAGTALIILSKNLAQYVNINSASLKTYAHDDTSVGSWMMGLQATYIDDNRLCC 332 Query: 1301 SSIRQDKVCSLA 1266 SSIRQDKVCSLA Sbjct: 333 SSIRQDKVCSLA 344 >ref|XP_018822215.1| PREDICTED: hydroxyproline O-galactosyltransferase HPGT3-like [Juglans regia] Length = 344 Score = 542 bits (1397), Expect = 0.0 Identities = 258/312 (82%), Positives = 287/312 (91%) Frame = -3 Query: 2201 CLAWLYIAGRLWQDAENRMLLSNLLMKNAAERPKVLTVEDKLMVLGCKDLERRIVEAEME 2022 CLAWLY+AGRLWQDAENR LLSNLL KN+ +RPKVLTVEDKL VLGC+DLERRIVEAEME Sbjct: 33 CLAWLYVAGRLWQDAENRKLLSNLLYKNSLQRPKVLTVEDKLTVLGCRDLERRIVEAEME 92 Query: 2021 ITLAKSQGYLKDQLKQPGLSSNKKLLAVIGVYTGFGSRLNRNVFRGSWMPKGDSLKKLEE 1842 +TLAKSQGYL +QL+Q SS ++LLAVIG+YTGFGS L RNVFRGSWMPKGD+L+KLEE Sbjct: 93 LTLAKSQGYLNNQLQQSKSSSGRRLLAVIGLYTGFGSHLKRNVFRGSWMPKGDALRKLEE 152 Query: 1841 RGIVIRFVIGRSPNRGDSLDRNIDAENRSTKDFLILDGHEEADEESPKKAKFFFSTAVQN 1662 RG+VIRFVIGRS NRGDSLDRNID ENR+TKDFLIL+GHEEA EE PKK K+FFSTAVQ Sbjct: 153 RGVVIRFVIGRSANRGDSLDRNIDKENRTTKDFLILEGHEEAQEELPKKVKYFFSTAVQK 212 Query: 1661 WDAEFYVKVDNNIALDLEGLIELLESRRGQDGVYIGCMKSGEVVAEAGKPWYEPDWWKFG 1482 WDAEFYVKVD+NI LDLEGLI LL+ RRGQDG YIGCMKSG+V++E GKPWYEPDWWKFG Sbjct: 213 WDAEFYVKVDDNIDLDLEGLIGLLDRRRGQDGAYIGCMKSGDVISEEGKPWYEPDWWKFG 272 Query: 1481 DEKSYFRHASGSLVILSKNFAQYININSASLKSYAHEDTTIGSWMMGIQATYIDENRVCC 1302 DEKSYFRHA+GSL+ILSKN AQYININSASLK+YAH+D ++GSWMMGIQATY D+NR+CC Sbjct: 273 DEKSYFRHAAGSLLILSKNLAQYININSASLKTYAHDDVSMGSWMMGIQATYTDDNRLCC 332 Query: 1301 SSIRQDKVCSLA 1266 SSIRQDKVCSLA Sbjct: 333 SSIRQDKVCSLA 344 >ref|XP_004498753.1| PREDICTED: probable beta-1,3-galactosyltransferase 10 [Cicer arietinum] Length = 344 Score = 542 bits (1397), Expect = 0.0 Identities = 258/312 (82%), Positives = 290/312 (92%) Frame = -3 Query: 2201 CLAWLYIAGRLWQDAENRMLLSNLLMKNAAERPKVLTVEDKLMVLGCKDLERRIVEAEME 2022 C+AWLY+AGRLWQDAENR LL++LL KN+ +RPKVLTVEDKLMVLGC+DLERRIV+AEME Sbjct: 33 CVAWLYVAGRLWQDAENRNLLTSLLKKNSEQRPKVLTVEDKLMVLGCRDLERRIVDAEME 92 Query: 2021 ITLAKSQGYLKDQLKQPGLSSNKKLLAVIGVYTGFGSRLNRNVFRGSWMPKGDSLKKLEE 1842 +TLAKSQGYLK Q +Q G SS+++LLAVIGVYTGFGSRL RN FRGSWMP+GD+LKKLEE Sbjct: 93 LTLAKSQGYLKGQRQQTGSSSDRRLLAVIGVYTGFGSRLKRNEFRGSWMPRGDALKKLEE 152 Query: 1841 RGIVIRFVIGRSPNRGDSLDRNIDAENRSTKDFLILDGHEEADEESPKKAKFFFSTAVQN 1662 RG+VIRFVIGRS NRGDSLDRNID ENRSTKDFLILD HEEA EE PKKAK FFSTAVQN Sbjct: 153 RGVVIRFVIGRSANRGDSLDRNIDEENRSTKDFLILDSHEEAQEELPKKAKIFFSTAVQN 212 Query: 1661 WDAEFYVKVDNNIALDLEGLIELLESRRGQDGVYIGCMKSGEVVAEAGKPWYEPDWWKFG 1482 WDA+FYVKVD++I +DLEGLIELLE RRGQDG YIGCMKSG+V++E GK WYEPDWWKFG Sbjct: 213 WDADFYVKVDDSIGIDLEGLIELLEHRRGQDGAYIGCMKSGDVISEEGKLWYEPDWWKFG 272 Query: 1481 DEKSYFRHASGSLVILSKNFAQYININSASLKSYAHEDTTIGSWMMGIQATYIDENRVCC 1302 DEKSYFRHA+GSLVILSKN AQYININS SLK+YA++DT++GSWMMGIQ+TYID+NR+CC Sbjct: 273 DEKSYFRHAAGSLVILSKNLAQYININSVSLKTYAYDDTSLGSWMMGIQSTYIDDNRLCC 332 Query: 1301 SSIRQDKVCSLA 1266 SSIRQDKVCSLA Sbjct: 333 SSIRQDKVCSLA 344 >ref|XP_008445237.1| PREDICTED: hydroxyproline O-galactosyltransferase HPGT3-like [Cucumis melo] Length = 346 Score = 542 bits (1397), Expect = 0.0 Identities = 258/313 (82%), Positives = 290/313 (92%), Gaps = 2/313 (0%) Frame = -3 Query: 2201 CLAWLYIAGRLWQDAENRMLLSNLLMKNAAERPKVLTVEDKLMVLGCKDLERRIVEAEME 2022 CLAWLY+AGRLWQDAENR LLS LL KNA++RP +L+VEDKL VLGCKDLERRIVE EM+ Sbjct: 33 CLAWLYVAGRLWQDAENRKLLSTLLQKNASQRPVILSVEDKLQVLGCKDLERRIVEVEMD 92 Query: 2021 ITLAKSQGYLKDQLKQPGLSSN--KKLLAVIGVYTGFGSRLNRNVFRGSWMPKGDSLKKL 1848 +TLAKSQGYLK+QL+Q G SSN +KLLAVIGVYTGFGSRL RNVFRGSWMPKGD+LKKL Sbjct: 93 LTLAKSQGYLKNQLRQSGSSSNPGRKLLAVIGVYTGFGSRLRRNVFRGSWMPKGDALKKL 152 Query: 1847 EERGIVIRFVIGRSPNRGDSLDRNIDAENRSTKDFLILDGHEEADEESPKKAKFFFSTAV 1668 EERG++IRFVIGRS NRGDSLDRNID EN STKDFLIL+GHEEADEE PKKAKFFFSTAV Sbjct: 153 EERGVIIRFVIGRSANRGDSLDRNIDKENHSTKDFLILEGHEEADEELPKKAKFFFSTAV 212 Query: 1667 QNWDAEFYVKVDNNIALDLEGLIELLESRRGQDGVYIGCMKSGEVVAEAGKPWYEPDWWK 1488 QNWDAEFYVKVD++I LDLEGLI LLE RRGQDG Y+GCMKSG+V+AE GK WYEP+WWK Sbjct: 213 QNWDAEFYVKVDDHIDLDLEGLIGLLEHRRGQDGTYVGCMKSGDVIAEEGKQWYEPEWWK 272 Query: 1487 FGDEKSYFRHASGSLVILSKNFAQYININSASLKSYAHEDTTIGSWMMGIQATYIDENRV 1308 FGDEKSYFRHASG+L+ILSKN AQYININSASLK+YAH+D ++GSWM+G+QAT+ID+NR+ Sbjct: 273 FGDEKSYFRHASGALIILSKNLAQYININSASLKTYAHDDISVGSWMIGLQATHIDDNRL 332 Query: 1307 CCSSIRQDKVCSL 1269 CCSSIRQDKVCS+ Sbjct: 333 CCSSIRQDKVCSV 345 >ref|XP_007161194.1| hypothetical protein PHAVU_001G049900g [Phaseolus vulgaris] gb|ESW33188.1| hypothetical protein PHAVU_001G049900g [Phaseolus vulgaris] Length = 342 Score = 541 bits (1394), Expect = 0.0 Identities = 259/312 (83%), Positives = 292/312 (93%) Frame = -3 Query: 2201 CLAWLYIAGRLWQDAENRMLLSNLLMKNAAERPKVLTVEDKLMVLGCKDLERRIVEAEME 2022 C+AWLY+AGRLWQDAENR LL++LL KN+A+RPKVLTVEDKLMVLGC+DLERRIVEAEME Sbjct: 32 CVAWLYVAGRLWQDAENRNLLASLLKKNSAQRPKVLTVEDKLMVLGCRDLERRIVEAEME 91 Query: 2021 ITLAKSQGYLKDQLKQPGLSSNKKLLAVIGVYTGFGSRLNRNVFRGSWMPKGDSLKKLEE 1842 +TLAKSQGYLK Q ++ G SS+++LLAVIGVYTGFGSRL RNVFRGSWMP+GD+LKKLEE Sbjct: 92 LTLAKSQGYLKGQGQKSG-SSDRRLLAVIGVYTGFGSRLKRNVFRGSWMPRGDALKKLEE 150 Query: 1841 RGIVIRFVIGRSPNRGDSLDRNIDAENRSTKDFLILDGHEEADEESPKKAKFFFSTAVQN 1662 RG+VIRFVIGRS NRGDSLDRNID ENRSTKDFLIL+GHEEA EE PKK K FFSTAVQN Sbjct: 151 RGVVIRFVIGRSANRGDSLDRNIDEENRSTKDFLILEGHEEAQEELPKKVKTFFSTAVQN 210 Query: 1661 WDAEFYVKVDNNIALDLEGLIELLESRRGQDGVYIGCMKSGEVVAEAGKPWYEPDWWKFG 1482 WDA+FYVKVD+NI +DLEGLIELLE RRGQDG YIGCMKSG+V++E GKPWYEPDWWKFG Sbjct: 211 WDADFYVKVDDNIDIDLEGLIELLEHRRGQDGAYIGCMKSGDVISEDGKPWYEPDWWKFG 270 Query: 1481 DEKSYFRHASGSLVILSKNFAQYININSASLKSYAHEDTTIGSWMMGIQATYIDENRVCC 1302 DEKSYFRHA GSLVI+SKN AQYININSASLK+YA +DT++GSWMMGIQATYID++R+CC Sbjct: 271 DEKSYFRHAGGSLVIISKNLAQYININSASLKTYAFDDTSLGSWMMGIQATYIDDSRLCC 330 Query: 1301 SSIRQDKVCSLA 1266 SS+RQ+KVCSLA Sbjct: 331 SSVRQEKVCSLA 342