BLASTX nr result

ID: Chrysanthemum21_contig00010273 seq

BLASTX 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Chrysanthemum21_contig00010273
         (2380 letters)

Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF excluding environmental samples
from WGS projects 
           149,584,005 sequences; 54,822,741,787 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_022001050.1| heparanase-like protein 3 isoform X2 [Helian...   634   0.0  
ref|XP_022001049.1| heparanase-like protein 3 isoform X1 [Helian...   634   0.0  
ref|XP_021987587.1| hydroxyproline O-galactosyltransferase HPGT3...   605   0.0  
gb|KVH90278.1| Glycoside hydrolase, family 79, partial [Cynara c...   608   0.0  
gb|PLY63760.1| hypothetical protein LSAT_6X20321 [Lactuca sativa]     602   0.0  
ref|XP_023747021.1| heparanase-like protein 3 isoform X2 [Lactuc...   602   0.0  
ref|XP_023747020.1| heparanase-like protein 3 isoform X1 [Lactuc...   597   0.0  
ref|XP_023761470.1| hydroxyproline O-galactosyltransferase HPGT3...   587   0.0  
ref|XP_022001051.1| heparanase-like protein 3 isoform X3 [Helian...   583   0.0  
ref|XP_018836253.1| PREDICTED: hydroxyproline O-galactosyltransf...   554   0.0  
ref|XP_023747023.1| hydroxyproline O-galactosyltransferase HPGT3...   551   0.0  
ref|XP_023761469.1| heparanase-like protein 3 [Lactuca sativa]        553   0.0  
gb|PLY87253.1| hypothetical protein LSAT_1X43741 [Lactuca sativa]     558   0.0  
ref|XP_002515480.2| PREDICTED: probable beta-1,3-galactosyltrans...   546   0.0  
gb|KVH99807.1| protein of unknown function DUF4094 [Cynara cardu...   546   0.0  
ref|XP_023872627.1| hydroxyproline O-galactosyltransferase HPGT3...   543   0.0  
ref|XP_018822215.1| PREDICTED: hydroxyproline O-galactosyltransf...   542   0.0  
ref|XP_004498753.1| PREDICTED: probable beta-1,3-galactosyltrans...   542   0.0  
ref|XP_008445237.1| PREDICTED: hydroxyproline O-galactosyltransf...   542   0.0  
ref|XP_007161194.1| hypothetical protein PHAVU_001G049900g [Phas...   541   0.0  

>ref|XP_022001050.1| heparanase-like protein 3 isoform X2 [Helianthus annuus]
          Length = 406

 Score =  634 bits (1634), Expect = 0.0
 Identities = 307/385 (79%), Positives = 338/385 (87%)
 Frame = +2

Query: 2    AKIIFGLNALSQRQVNIDGSVVGPWNSSNAEALMKYTVDKGFTIHGWELGNELSANGIGA 181
            AK+IFGLNAL QR+V  +G+V+GPWNSS+AEALMKYTVDKGFTIHGWELGNELS  GIGA
Sbjct: 25   AKVIFGLNALYQREVKTNGAVIGPWNSSDAEALMKYTVDKGFTIHGWELGNELSGRGIGA 84

Query: 182  KITADQYALDTVALQNLVQKIYNTFQDKPLVLGPGGFFDYNWFDEFVRKSNDSLQVITQH 361
             I ADQYA D V+LQNLVQKIY  F+ KPLVLGPGGFFD NWF+EFV KS DSLQVITQH
Sbjct: 85   SIGADQYASDMVSLQNLVQKIYKAFEVKPLVLGPGGFFDANWFNEFVNKSKDSLQVITQH 144

Query: 362  IYNLGPGVDKHLIEKILNPSYLDGGSQHFRDLQNILKNSGTSTVAWVGEAGGAYNSGRNQ 541
            IYNLGPGVD HL+EKILNPSYLDGGSQ FRD+QNILK S  STVAWVGEAGGAYNSG N 
Sbjct: 145  IYNLGPGVDNHLVEKILNPSYLDGGSQPFRDVQNILKKSKASTVAWVGEAGGAYNSGHNH 204

Query: 542  VTNAFVFSFWYLDQLGMASSYDTKTYCRQTLIGGNYGLLNTDTFVPNPDYYSALLWHRLM 721
            VTN+FVFSFWYLDQLGMA+SYDTKTYCRQTLIGGNYGLLNTDT+VPNPDYYSALLWHRLM
Sbjct: 205  VTNSFVFSFWYLDQLGMAASYDTKTYCRQTLIGGNYGLLNTDTYVPNPDYYSALLWHRLM 264

Query: 722  GRRVLSTRFHGTKKIRSYAHCSKHSDGITLLLINLASLAKTEVGLTVENVTILAASKQQE 901
            GRRVLST F GT+ IRSYAHCSK S+G+T+LLINL  + KTEVG+T+EN TI  ASK+  
Sbjct: 265  GRRVLSTSFQGTRMIRSYAHCSKRSNGMTILLINLDGITKTEVGVTIENETIAVASKR-- 322

Query: 902  HQTQKTHSSKLGEKELTREEYHLTAKNGNLNSHTILLNGNELTVNSTGSIPSLDPVQVNM 1081
             Q ++THSSKL  KE TREEYHLTAK+GNLNSHT+LLNG ELTVNSTG IPSLDPV+ N+
Sbjct: 323  -QIKQTHSSKLRNKEFTREEYHLTAKDGNLNSHTVLLNGKELTVNSTGIIPSLDPVEANL 381

Query: 1082 SSPINVAPFSIVFVHIPNINVPACT 1156
              PI VAP+SIVFVHIP I++ ACT
Sbjct: 382  RDPITVAPYSIVFVHIPGIHIQACT 406


>ref|XP_022001049.1| heparanase-like protein 3 isoform X1 [Helianthus annuus]
 gb|OTG01545.1| putative glycoside hydrolase, family 79 [Helianthus annuus]
          Length = 552

 Score =  634 bits (1634), Expect = 0.0
 Identities = 307/385 (79%), Positives = 338/385 (87%)
 Frame = +2

Query: 2    AKIIFGLNALSQRQVNIDGSVVGPWNSSNAEALMKYTVDKGFTIHGWELGNELSANGIGA 181
            AK+IFGLNAL QR+V  +G+V+GPWNSS+AEALMKYTVDKGFTIHGWELGNELS  GIGA
Sbjct: 171  AKVIFGLNALYQREVKTNGAVIGPWNSSDAEALMKYTVDKGFTIHGWELGNELSGRGIGA 230

Query: 182  KITADQYALDTVALQNLVQKIYNTFQDKPLVLGPGGFFDYNWFDEFVRKSNDSLQVITQH 361
             I ADQYA D V+LQNLVQKIY  F+ KPLVLGPGGFFD NWF+EFV KS DSLQVITQH
Sbjct: 231  SIGADQYASDMVSLQNLVQKIYKAFEVKPLVLGPGGFFDANWFNEFVNKSKDSLQVITQH 290

Query: 362  IYNLGPGVDKHLIEKILNPSYLDGGSQHFRDLQNILKNSGTSTVAWVGEAGGAYNSGRNQ 541
            IYNLGPGVD HL+EKILNPSYLDGGSQ FRD+QNILK S  STVAWVGEAGGAYNSG N 
Sbjct: 291  IYNLGPGVDNHLVEKILNPSYLDGGSQPFRDVQNILKKSKASTVAWVGEAGGAYNSGHNH 350

Query: 542  VTNAFVFSFWYLDQLGMASSYDTKTYCRQTLIGGNYGLLNTDTFVPNPDYYSALLWHRLM 721
            VTN+FVFSFWYLDQLGMA+SYDTKTYCRQTLIGGNYGLLNTDT+VPNPDYYSALLWHRLM
Sbjct: 351  VTNSFVFSFWYLDQLGMAASYDTKTYCRQTLIGGNYGLLNTDTYVPNPDYYSALLWHRLM 410

Query: 722  GRRVLSTRFHGTKKIRSYAHCSKHSDGITLLLINLASLAKTEVGLTVENVTILAASKQQE 901
            GRRVLST F GT+ IRSYAHCSK S+G+T+LLINL  + KTEVG+T+EN TI  ASK+  
Sbjct: 411  GRRVLSTSFQGTRMIRSYAHCSKRSNGMTILLINLDGITKTEVGVTIENETIAVASKR-- 468

Query: 902  HQTQKTHSSKLGEKELTREEYHLTAKNGNLNSHTILLNGNELTVNSTGSIPSLDPVQVNM 1081
             Q ++THSSKL  KE TREEYHLTAK+GNLNSHT+LLNG ELTVNSTG IPSLDPV+ N+
Sbjct: 469  -QIKQTHSSKLRNKEFTREEYHLTAKDGNLNSHTVLLNGKELTVNSTGIIPSLDPVEANL 527

Query: 1082 SSPINVAPFSIVFVHIPNINVPACT 1156
              PI VAP+SIVFVHIP I++ ACT
Sbjct: 528  RDPITVAPYSIVFVHIPGIHIQACT 552


>ref|XP_021987587.1| hydroxyproline O-galactosyltransferase HPGT3-like [Helianthus annuus]
 gb|OTG38522.1| putative galactosyltransferase family protein [Helianthus annuus]
          Length = 355

 Score =  605 bits (1559), Expect = 0.0
 Identities = 302/354 (85%), Positives = 318/354 (89%), Gaps = 1/354 (0%)
 Frame = -3

Query: 2324 NNSSLYYKEGYQLPTXXXXXXXXXXXXXXXXXXXXXXXXXS-CLAWLYIAGRLWQDAENR 2148
            +NS  YYKEG  L T                           C+AWLYIAGRLWQDAENR
Sbjct: 2    DNSPPYYKEGLPLSTTLSKSEKQRSRSSSRSSVPSIFFAFFSCVAWLYIAGRLWQDAENR 61

Query: 2147 MLLSNLLMKNAAERPKVLTVEDKLMVLGCKDLERRIVEAEMEITLAKSQGYLKDQLKQPG 1968
            MLL+NLLM+N+AERPKVLTVEDKL+VLGCKDLER+IVEAEMEITLAKSQG+LKD+LKQPG
Sbjct: 62   MLLANLLMQNSAERPKVLTVEDKLVVLGCKDLERKIVEAEMEITLAKSQGFLKDRLKQPG 121

Query: 1967 LSSNKKLLAVIGVYTGFGSRLNRNVFRGSWMPKGDSLKKLEERGIVIRFVIGRSPNRGDS 1788
            LSS+KKLLAVIGVYTGFGSRLNRNVFRGSWMPKGDSLKKLEERGIVIRFVIGRSPNRGDS
Sbjct: 122  LSSSKKLLAVIGVYTGFGSRLNRNVFRGSWMPKGDSLKKLEERGIVIRFVIGRSPNRGDS 181

Query: 1787 LDRNIDAENRSTKDFLILDGHEEADEESPKKAKFFFSTAVQNWDAEFYVKVDNNIALDLE 1608
            LDRNID ENR+TKDFLILDGHEEADEESPKKAKFFFSTAVQNWDAEFYVKVDNNIALDLE
Sbjct: 182  LDRNIDEENRTTKDFLILDGHEEADEESPKKAKFFFSTAVQNWDAEFYVKVDNNIALDLE 241

Query: 1607 GLIELLESRRGQDGVYIGCMKSGEVVAEAGKPWYEPDWWKFGDEKSYFRHASGSLVILSK 1428
            GLIELLESRRGQD VYIGCMKSGEVVAE GKPWYEPDWWKFGDEKSYFRHA+GSL+ILSK
Sbjct: 242  GLIELLESRRGQDSVYIGCMKSGEVVAEEGKPWYEPDWWKFGDEKSYFRHAAGSLIILSK 301

Query: 1427 NFAQYININSASLKSYAHEDTTIGSWMMGIQATYIDENRVCCSSIRQDKVCSLA 1266
            NFAQYININSASLKSYAHEDT+IGSWMMGI+ATYIDE+RVCCSS RQDKVCSLA
Sbjct: 302  NFAQYININSASLKSYAHEDTSIGSWMMGIKATYIDESRVCCSSSRQDKVCSLA 355


>gb|KVH90278.1| Glycoside hydrolase, family 79, partial [Cynara cardunculus var.
            scolymus]
          Length = 601

 Score =  608 bits (1569), Expect = 0.0
 Identities = 301/398 (75%), Positives = 336/398 (84%), Gaps = 13/398 (3%)
 Frame = +2

Query: 2    AKIIFGLNALSQRQVNIDGSVVGPWNSSNAEALMKYTVDKGFTIHGWELGN--------- 154
            AK+IFGLNALSQR V++DGSVVGPWNSSNAEALMKYTVDKGFTIHGWELGN         
Sbjct: 205  AKVIFGLNALSQRHVSMDGSVVGPWNSSNAEALMKYTVDKGFTIHGWELGNSFGLKTNNC 264

Query: 155  ----ELSANGIGAKITADQYALDTVALQNLVQKIYNTFQDKPLVLGPGGFFDYNWFDEFV 322
                ELS NGIGA+I ADQYA DT++LQNLVQ IY +F+ KP+VLGPGGFFD NWF E+V
Sbjct: 265  IVGNELSGNGIGARIMADQYASDTISLQNLVQNIYKSFEVKPIVLGPGGFFDANWFTEYV 324

Query: 323  RKSNDSLQVITQHIYNLGPGVDKHLIEKILNPSYLDGGSQHFRDLQNILKNSGTSTVAWV 502
             KSN SLQ ITQHIYNLGPGVD  L+ KIL+PS LDGGSQ  RDLQ ILK  G ST+AWV
Sbjct: 325  MKSNGSLQAITQHIYNLGPGVDNDLVNKILDPSCLDGGSQPLRDLQKILKEFGNSTIAWV 384

Query: 503  GEAGGAYNSGRNQVTNAFVFSFWYLDQLGMASSYDTKTYCRQTLIGGNYGLLNTDTFVPN 682
            GEAGGAYNSG ++V+N F+FSFWYLDQLGMASSYDTKTYCRQ+LIGGNYGLLNT TFVPN
Sbjct: 385  GEAGGAYNSGHDRVSNTFIFSFWYLDQLGMASSYDTKTYCRQSLIGGNYGLLNTVTFVPN 444

Query: 683  PDYYSALLWHRLMGRRVLSTRFHGTKKIRSYAHCSKHSDGITLLLINLASLAKTEVGLTV 862
            PDYYSALLWHRLMGR VL T F GTKKIRSYAHCSKHSDGITLLLINL S   T +GL+V
Sbjct: 445  PDYYSALLWHRLMGRHVLLTSFDGTKKIRSYAHCSKHSDGITLLLINLDSYTTTAIGLSV 504

Query: 863  ENVTILAASKQQEHQTQKTHSSKLGEKELTREEYHLTAKNGNLNSHTILLNGNELTVNST 1042
            ENVT++ AS Q + QTQ+T S +    E TREEYHLTAK+GNLNS T+LLNG EL+VNST
Sbjct: 505  ENVTMITASNQLK-QTQRTQSFQSSSNEFTREEYHLTAKDGNLNSQTVLLNGKELSVNST 563

Query: 1043 GSIPSLDPVQVNMSSPINVAPFSIVFVHIPNINVPACT 1156
            G IPSLDPV+VN+S+PI VAPFSIVFVH+P+I++PACT
Sbjct: 564  GIIPSLDPVEVNISNPIIVAPFSIVFVHMPDIHLPACT 601


>gb|PLY63760.1| hypothetical protein LSAT_6X20321 [Lactuca sativa]
          Length = 538

 Score =  602 bits (1551), Expect = 0.0
 Identities = 299/387 (77%), Positives = 334/387 (86%), Gaps = 3/387 (0%)
 Frame = +2

Query: 2    AKIIFGLNALSQRQVNIDGSVVGPWNSSNAEALMKYTVDKGFTIHGWELGNELSA-NGIG 178
            AK+IFGLNALS R  N+D  V+GPWNS+NAEALMKYT+DKGFTIHGWELGNELS  N IG
Sbjct: 153  AKVIFGLNALSHRNTNMD--VIGPWNSTNAEALMKYTIDKGFTIHGWELGNELSGFNAIG 210

Query: 179  AKITADQYALDTVALQNLVQKIYNTFQDKPLVLGPGGFFDYNWFDEFVRKSNDSLQVITQ 358
            A I ADQYA DT++LQNLVQK+Y  F  KP+VLGPGGFFD NWF E+V KSN+SLQV+TQ
Sbjct: 211  ASIKADQYASDTISLQNLVQKMYKNFAIKPIVLGPGGFFDENWFTEYVAKSNNSLQVLTQ 270

Query: 359  HIYNLGPGVDKHLIEKILNPSYLDGGSQHFRDLQNILKNSGTSTVAWVGEAGGAYNSGRN 538
            HIYNLGPGVD HLI+KILNP YLDGGSQ F+D+QNILK SG+ TVAWVGEAGGAYNSGRN
Sbjct: 271  HIYNLGPGVDTHLIKKILNPWYLDGGSQSFKDVQNILKESGSETVAWVGEAGGAYNSGRN 330

Query: 539  QVTNAFVFSFWYLDQLGMASSYDTKTYCRQTLIGGNYGLLNTDTFVPNPDYYSALLWHRL 718
            +V+NAFVFSFWYLDQ+GMAS YDTKTYCRQTLIGGNYGLLNT TFVPNPDYYSALLWHRL
Sbjct: 331  RVSNAFVFSFWYLDQMGMASLYDTKTYCRQTLIGGNYGLLNTATFVPNPDYYSALLWHRL 390

Query: 719  MGRRVLSTRFHGTKKIRSYAHCSKHSDGITLLLINLASLAKTEVGLTVENVTILAASKQQ 898
            MGRRVLS  F+G KKIRSYAHC+KHSDG+TLLLINL    K +VG+++ENVTI+ AS   
Sbjct: 391  MGRRVLSASFNGIKKIRSYAHCAKHSDGLTLLLINLDGSIKAKVGVSIENVTIIMASTPD 450

Query: 899  EHQTQKTHSSKLGEKEL-TREEYHLTAKNGNLNSHTILLNGNELTVNS-TGSIPSLDPVQ 1072
               TQ+T SS+  + EL  REEYHLTAKNG LNS+TILLNG EL+VNS TG IPSLDP+Q
Sbjct: 451  LKPTQETKSSRNPKGELFRREEYHLTAKNGKLNSNTILLNGKELSVNSTTGIIPSLDPIQ 510

Query: 1073 VNMSSPINVAPFSIVFVHIPNINVPAC 1153
            V + SPI VAPFSIVFVHIPNI+VPAC
Sbjct: 511  VKLGSPIMVAPFSIVFVHIPNIHVPAC 537


>ref|XP_023747021.1| heparanase-like protein 3 isoform X2 [Lactuca sativa]
          Length = 556

 Score =  602 bits (1551), Expect = 0.0
 Identities = 299/387 (77%), Positives = 334/387 (86%), Gaps = 3/387 (0%)
 Frame = +2

Query: 2    AKIIFGLNALSQRQVNIDGSVVGPWNSSNAEALMKYTVDKGFTIHGWELGNELSA-NGIG 178
            AK+IFGLNALS R  N+D  V+GPWNS+NAEALMKYT+DKGFTIHGWELGNELS  N IG
Sbjct: 171  AKVIFGLNALSHRNTNMD--VIGPWNSTNAEALMKYTIDKGFTIHGWELGNELSGFNAIG 228

Query: 179  AKITADQYALDTVALQNLVQKIYNTFQDKPLVLGPGGFFDYNWFDEFVRKSNDSLQVITQ 358
            A I ADQYA DT++LQNLVQK+Y  F  KP+VLGPGGFFD NWF E+V KSN+SLQV+TQ
Sbjct: 229  ASIKADQYASDTISLQNLVQKMYKNFAIKPIVLGPGGFFDENWFTEYVAKSNNSLQVLTQ 288

Query: 359  HIYNLGPGVDKHLIEKILNPSYLDGGSQHFRDLQNILKNSGTSTVAWVGEAGGAYNSGRN 538
            HIYNLGPGVD HLI+KILNP YLDGGSQ F+D+QNILK SG+ TVAWVGEAGGAYNSGRN
Sbjct: 289  HIYNLGPGVDTHLIKKILNPWYLDGGSQSFKDVQNILKESGSETVAWVGEAGGAYNSGRN 348

Query: 539  QVTNAFVFSFWYLDQLGMASSYDTKTYCRQTLIGGNYGLLNTDTFVPNPDYYSALLWHRL 718
            +V+NAFVFSFWYLDQ+GMAS YDTKTYCRQTLIGGNYGLLNT TFVPNPDYYSALLWHRL
Sbjct: 349  RVSNAFVFSFWYLDQMGMASLYDTKTYCRQTLIGGNYGLLNTATFVPNPDYYSALLWHRL 408

Query: 719  MGRRVLSTRFHGTKKIRSYAHCSKHSDGITLLLINLASLAKTEVGLTVENVTILAASKQQ 898
            MGRRVLS  F+G KKIRSYAHC+KHSDG+TLLLINL    K +VG+++ENVTI+ AS   
Sbjct: 409  MGRRVLSASFNGIKKIRSYAHCAKHSDGLTLLLINLDGSIKAKVGVSIENVTIIMASTPD 468

Query: 899  EHQTQKTHSSKLGEKEL-TREEYHLTAKNGNLNSHTILLNGNELTVNS-TGSIPSLDPVQ 1072
               TQ+T SS+  + EL  REEYHLTAKNG LNS+TILLNG EL+VNS TG IPSLDP+Q
Sbjct: 469  LKPTQETKSSRNPKGELFRREEYHLTAKNGKLNSNTILLNGKELSVNSTTGIIPSLDPIQ 528

Query: 1073 VNMSSPINVAPFSIVFVHIPNINVPAC 1153
            V + SPI VAPFSIVFVHIPNI+VPAC
Sbjct: 529  VKLGSPIMVAPFSIVFVHIPNIHVPAC 555


>ref|XP_023747020.1| heparanase-like protein 3 isoform X1 [Lactuca sativa]
          Length = 557

 Score =  597 bits (1539), Expect = 0.0
 Identities = 299/388 (77%), Positives = 334/388 (86%), Gaps = 4/388 (1%)
 Frame = +2

Query: 2    AKIIFGLNALSQRQVNIDGSVVGPWNSSNAEALMKYTVDKGFTIHGWELGNELSA-NGIG 178
            AK+IFGLNALS R  N+D  V+GPWNS+NAEALMKYT+DKGFTIHGWELGNELS  N IG
Sbjct: 171  AKVIFGLNALSHRNTNMD--VIGPWNSTNAEALMKYTIDKGFTIHGWELGNELSGFNAIG 228

Query: 179  AKITADQYALDTVALQNLVQKIYNTFQDKPLVLGPGGFFDYNWFDEFVRKSNDSLQVITQ 358
            A I ADQYA DT++LQNLVQK+Y  F  KP+VLGPGGFFD NWF E+V KSN+SLQV+TQ
Sbjct: 229  ASIKADQYASDTISLQNLVQKMYKNFAIKPIVLGPGGFFDENWFTEYVAKSNNSLQVLTQ 288

Query: 359  HIYNLGPGVDKHLIEKILNPSYLDGGSQHFRDLQNILKNSGTSTVAWVGEAGGAYNSGRN 538
            HIYNLGPGVD HLI+KILNP YLDGGSQ F+D+QNILK SG+ TVAWVGEAGGAYNSGRN
Sbjct: 289  HIYNLGPGVDTHLIKKILNPWYLDGGSQSFKDVQNILKESGSETVAWVGEAGGAYNSGRN 348

Query: 539  QVTNAFVFSFWYLDQLGMASSYDTKTYCRQTLIGGNYGLLNTDTFVPNPDYYSALLWHRL 718
            +V+NAFVFSFWYLDQ+GMAS YDTKTYCRQTLIGGNYGLLNT TFVPNPDYYSALLWHRL
Sbjct: 349  RVSNAFVFSFWYLDQMGMASLYDTKTYCRQTLIGGNYGLLNTATFVPNPDYYSALLWHRL 408

Query: 719  MGRRVLSTRFHGTKKIRSYAHCSKHS-DGITLLLINLASLAKTEVGLTVENVTILAASKQ 895
            MGRRVLS  F+G KKIRSYAHC+KHS DG+TLLLINL    K +VG+++ENVTI+ AS  
Sbjct: 409  MGRRVLSASFNGIKKIRSYAHCAKHSQDGLTLLLINLDGSIKAKVGVSIENVTIIMASTP 468

Query: 896  QEHQTQKTHSSKLGEKEL-TREEYHLTAKNGNLNSHTILLNGNELTVNS-TGSIPSLDPV 1069
                TQ+T SS+  + EL  REEYHLTAKNG LNS+TILLNG EL+VNS TG IPSLDP+
Sbjct: 469  DLKPTQETKSSRNPKGELFRREEYHLTAKNGKLNSNTILLNGKELSVNSTTGIIPSLDPI 528

Query: 1070 QVNMSSPINVAPFSIVFVHIPNINVPAC 1153
            QV + SPI VAPFSIVFVHIPNI+VPAC
Sbjct: 529  QVKLGSPIMVAPFSIVFVHIPNIHVPAC 556


>ref|XP_023761470.1| hydroxyproline O-galactosyltransferase HPGT3-like isoform X1 [Lactuca
            sativa]
 ref|XP_023761472.1| hydroxyproline O-galactosyltransferase HPGT3-like isoform X1 [Lactuca
            sativa]
 ref|XP_023761473.1| hydroxyproline O-galactosyltransferase HPGT3-like isoform X1 [Lactuca
            sativa]
 gb|PLY87263.1| hypothetical protein LSAT_1X43720 [Lactuca sativa]
          Length = 356

 Score =  587 bits (1513), Expect = 0.0
 Identities = 291/356 (81%), Positives = 310/356 (87%), Gaps = 1/356 (0%)
 Frame = -3

Query: 2330 MENNSSLYYKEGYQLPTXXXXXXXXXXXXXXXXXXXXXXXXXS-CLAWLYIAGRLWQDAE 2154
            M+NN   Y+KEG  LPT                           CLAWLYIAGRLWQDAE
Sbjct: 1    MDNNGPPYHKEGSPLPTTISKTEKQRSRSSSRSSVPSIFFAFFSCLAWLYIAGRLWQDAE 60

Query: 2153 NRMLLSNLLMKNAAERPKVLTVEDKLMVLGCKDLERRIVEAEMEITLAKSQGYLKDQLKQ 1974
            NR +LS+LLMKN+AERPKVLTVE+KLMVLGCKDLERRIVE+EMEI+LAKSQG+LKDQLKQ
Sbjct: 61   NRKVLSHLLMKNSAERPKVLTVEEKLMVLGCKDLERRIVESEMEISLAKSQGFLKDQLKQ 120

Query: 1973 PGLSSNKKLLAVIGVYTGFGSRLNRNVFRGSWMPKGDSLKKLEERGIVIRFVIGRSPNRG 1794
            PG SS+KKLLAVIGVYTGFGSRLNR VFRGSWMP GDSLKKLEERGI+IRFVIGRSPNRG
Sbjct: 121  PGFSSSKKLLAVIGVYTGFGSRLNRKVFRGSWMPTGDSLKKLEERGIIIRFVIGRSPNRG 180

Query: 1793 DSLDRNIDAENRSTKDFLILDGHEEADEESPKKAKFFFSTAVQNWDAEFYVKVDNNIALD 1614
            DSLDRNID ENR+TKDFLILD HEEADEES KKAKFFFSTAVQNWDAEFY+KVDNNI LD
Sbjct: 181  DSLDRNIDEENRTTKDFLILDNHEEADEESSKKAKFFFSTAVQNWDAEFYIKVDNNIGLD 240

Query: 1613 LEGLIELLESRRGQDGVYIGCMKSGEVVAEAGKPWYEPDWWKFGDEKSYFRHASGSLVIL 1434
            LEGLIELLESR GQD VYIGCMKSGEVV+E GKPWYEPDWWKFGDEKSYFRHASGSL+I+
Sbjct: 241  LEGLIELLESRHGQDSVYIGCMKSGEVVSEVGKPWYEPDWWKFGDEKSYFRHASGSLLII 300

Query: 1433 SKNFAQYININSASLKSYAHEDTTIGSWMMGIQATYIDENRVCCSSIRQDKVCSLA 1266
            SK FAQYININSASLK+YAHEDT+IGSWMMGIQATYIDENRVCCS  +QDKVCSLA
Sbjct: 301  SKRFAQYININSASLKTYAHEDTSIGSWMMGIQATYIDENRVCCSGSQQDKVCSLA 356


>ref|XP_022001051.1| heparanase-like protein 3 isoform X3 [Helianthus annuus]
          Length = 349

 Score =  583 bits (1504), Expect = 0.0
 Identities = 283/352 (80%), Positives = 308/352 (87%)
 Frame = +2

Query: 101  MKYTVDKGFTIHGWELGNELSANGIGAKITADQYALDTVALQNLVQKIYNTFQDKPLVLG 280
            MKYTVDKGFTIHGWELGNELS  GIGA I ADQYA D V+LQNLVQKIY  F+ KPLVLG
Sbjct: 1    MKYTVDKGFTIHGWELGNELSGRGIGASIGADQYASDMVSLQNLVQKIYKAFEVKPLVLG 60

Query: 281  PGGFFDYNWFDEFVRKSNDSLQVITQHIYNLGPGVDKHLIEKILNPSYLDGGSQHFRDLQ 460
            PGGFFD NWF+EFV KS DSLQVITQHIYNLGPGVD HL+EKILNPSYLDGGSQ FRD+Q
Sbjct: 61   PGGFFDANWFNEFVNKSKDSLQVITQHIYNLGPGVDNHLVEKILNPSYLDGGSQPFRDVQ 120

Query: 461  NILKNSGTSTVAWVGEAGGAYNSGRNQVTNAFVFSFWYLDQLGMASSYDTKTYCRQTLIG 640
            NILK S  STVAWVGEAGGAYNSG N VTN+FVFSFWYLDQLGMA+SYDTKTYCRQTLIG
Sbjct: 121  NILKKSKASTVAWVGEAGGAYNSGHNHVTNSFVFSFWYLDQLGMAASYDTKTYCRQTLIG 180

Query: 641  GNYGLLNTDTFVPNPDYYSALLWHRLMGRRVLSTRFHGTKKIRSYAHCSKHSDGITLLLI 820
            GNYGLLNTDT+VPNPDYYSALLWHRLMGRRVLST F GT+ IRSYAHCSK S+G+T+LLI
Sbjct: 181  GNYGLLNTDTYVPNPDYYSALLWHRLMGRRVLSTSFQGTRMIRSYAHCSKRSNGMTILLI 240

Query: 821  NLASLAKTEVGLTVENVTILAASKQQEHQTQKTHSSKLGEKELTREEYHLTAKNGNLNSH 1000
            NL  + KTEVG+T+EN TI  ASK+   Q ++THSSKL  KE TREEYHLTAK+GNLNSH
Sbjct: 241  NLDGITKTEVGVTIENETIAVASKR---QIKQTHSSKLRNKEFTREEYHLTAKDGNLNSH 297

Query: 1001 TILLNGNELTVNSTGSIPSLDPVQVNMSSPINVAPFSIVFVHIPNINVPACT 1156
            T+LLNG ELTVNSTG IPSLDPV+ N+  PI VAP+SIVFVHIP I++ ACT
Sbjct: 298  TVLLNGKELTVNSTGIIPSLDPVEANLRDPITVAPYSIVFVHIPGIHIQACT 349


>ref|XP_018836253.1| PREDICTED: hydroxyproline O-galactosyltransferase HPGT3-like [Juglans
            regia]
          Length = 344

 Score =  554 bits (1427), Expect = 0.0
 Identities = 265/312 (84%), Positives = 292/312 (93%)
 Frame = -3

Query: 2201 CLAWLYIAGRLWQDAENRMLLSNLLMKNAAERPKVLTVEDKLMVLGCKDLERRIVEAEME 2022
            CLAWLY+AGRLW+DAENR LL+NLL KNA +RPKVLTVEDKLMVLGC+DLERRIVEAEM+
Sbjct: 33   CLAWLYVAGRLWEDAENRKLLANLLYKNALQRPKVLTVEDKLMVLGCRDLERRIVEAEMD 92

Query: 2021 ITLAKSQGYLKDQLKQPGLSSNKKLLAVIGVYTGFGSRLNRNVFRGSWMPKGDSLKKLEE 1842
            +TLAKSQGYLKD+L+Q G SS +KLLAVIGVYTGFGSRL RNVFRGSWMPKGD+L+KLEE
Sbjct: 93   LTLAKSQGYLKDKLQQSGSSSGQKLLAVIGVYTGFGSRLKRNVFRGSWMPKGDALRKLEE 152

Query: 1841 RGIVIRFVIGRSPNRGDSLDRNIDAENRSTKDFLILDGHEEADEESPKKAKFFFSTAVQN 1662
            RG+VIRFVIGRS NRGDSLDRNID E RSTKDFLIL+GHEEA EE PKKAKFFFSTAVQN
Sbjct: 153  RGVVIRFVIGRSANRGDSLDRNIDEEYRSTKDFLILEGHEEAQEELPKKAKFFFSTAVQN 212

Query: 1661 WDAEFYVKVDNNIALDLEGLIELLESRRGQDGVYIGCMKSGEVVAEAGKPWYEPDWWKFG 1482
            WDAEFYVKVD++I LDLEGLI LL+ RRGQDG YIGCMKSG+V+++ GK WYEPDWWKFG
Sbjct: 213  WDAEFYVKVDDSIDLDLEGLIGLLDRRRGQDGAYIGCMKSGDVISDEGKSWYEPDWWKFG 272

Query: 1481 DEKSYFRHASGSLVILSKNFAQYININSASLKSYAHEDTTIGSWMMGIQATYIDENRVCC 1302
            DEKSYFRHASGSL+ILSKN AQYININSASLKSYAH+D ++GSWMMG+QATYIDENR+CC
Sbjct: 273  DEKSYFRHASGSLLILSKNLAQYININSASLKSYAHDDVSVGSWMMGLQATYIDENRLCC 332

Query: 1301 SSIRQDKVCSLA 1266
            SSIRQDKVCSLA
Sbjct: 333  SSIRQDKVCSLA 344


>ref|XP_023747023.1| hydroxyproline O-galactosyltransferase HPGT3-like [Lactuca sativa]
 gb|PLY63773.1| hypothetical protein LSAT_6X20300 [Lactuca sativa]
          Length = 355

 Score =  551 bits (1420), Expect = 0.0
 Identities = 266/312 (85%), Positives = 289/312 (92%)
 Frame = -3

Query: 2201 CLAWLYIAGRLWQDAENRMLLSNLLMKNAAERPKVLTVEDKLMVLGCKDLERRIVEAEME 2022
            CLAWLYIAGRLWQDAENR LL+NLL+KN+++RPKVLTVEDKLMVLGCKDLERRIVE EME
Sbjct: 47   CLAWLYIAGRLWQDAENRTLLANLLIKNSSQRPKVLTVEDKLMVLGCKDLERRIVETEME 106

Query: 2021 ITLAKSQGYLKDQLKQPGLSSNKKLLAVIGVYTGFGSRLNRNVFRGSWMPKGDSLKKLEE 1842
            ITLAKSQG+L +QLK    SSNKK LAVIG+YTGFG++L RN FRGSWMP+GDSLKKLEE
Sbjct: 107  ITLAKSQGFLSNQLKS---SSNKKFLAVIGIYTGFGNKLRRNSFRGSWMPEGDSLKKLEE 163

Query: 1841 RGIVIRFVIGRSPNRGDSLDRNIDAENRSTKDFLILDGHEEADEESPKKAKFFFSTAVQN 1662
            RGIVIRF+IGRSPNRGDSLDRNID ENR+TKDFLILD HEEA+EESPKKAKFFFSTAVQN
Sbjct: 164  RGIVIRFIIGRSPNRGDSLDRNIDEENRTTKDFLILDAHEEAEEESPKKAKFFFSTAVQN 223

Query: 1661 WDAEFYVKVDNNIALDLEGLIELLESRRGQDGVYIGCMKSGEVVAEAGKPWYEPDWWKFG 1482
            WDAEFYVKVDNNI LDLEGLIELLESR+GQD +YIGCMKSGEVV+E GK WYEPDWWKFG
Sbjct: 224  WDAEFYVKVDNNINLDLEGLIELLESRQGQDSLYIGCMKSGEVVSEEGKQWYEPDWWKFG 283

Query: 1481 DEKSYFRHASGSLVILSKNFAQYININSASLKSYAHEDTTIGSWMMGIQATYIDENRVCC 1302
            D KSYFRHA GSL ILS+NFAQYININS SLK+YAHEDT++GSWMMGIQATYID+ RVCC
Sbjct: 284  DAKSYFRHAGGSLYILSRNFAQYININSVSLKTYAHEDTSVGSWMMGIQATYIDDTRVCC 343

Query: 1301 SSIRQDKVCSLA 1266
             + RQDKVCSLA
Sbjct: 344  GTSRQDKVCSLA 355


>ref|XP_023761469.1| heparanase-like protein 3 [Lactuca sativa]
          Length = 396

 Score =  553 bits (1424), Expect = 0.0
 Identities = 272/392 (69%), Positives = 317/392 (80%), Gaps = 8/392 (2%)
 Frame = +2

Query: 2    AKIIFGLNALSQRQVNIDGSVVGPWNSSNAEALMKYTVDKGFTIHGWELG--NELSANGI 175
            AK++FGLNAL+ RQ+  DG+  G W+ SNAEAL++YTV+ G+ I+GWELG  NELS  GI
Sbjct: 4    AKVVFGLNALTGRQIGYDGTTFGSWDLSNAEALIRYTVNNGYVIYGWELGSGNELSGRGI 63

Query: 176  GAKITADQYALDTVALQNLVQKIYNTFQDKPLVLGPGGFFDYNWFDEFVRKSNDSLQVIT 355
            G  + A QYA DT++LQNLVQKIYN  Q+KP+VLGPGGFFD NWF+ +V +++ SLQVIT
Sbjct: 64   GTSVAAKQYASDTISLQNLVQKIYNGSQEKPIVLGPGGFFDANWFNVYVTEASGSLQVIT 123

Query: 356  QHIYNLGPGVDKHLIEKILNPSYLDGGSQHFRDLQNILKNSGTSTVAWVGEAGGAYNSGR 535
            QHIYNLGPGVD HL+EKILNPSYLDGGSQ FRDLQNILK S TSTVAWVGEAGGAYNSGR
Sbjct: 124  QHIYNLGPGVDAHLVEKILNPSYLDGGSQPFRDLQNILKKSRTSTVAWVGEAGGAYNSGR 183

Query: 536  NQVTNAFVFSFWYLDQLGMASSYDTKTYCRQTLIGGNYGLLNTDTFVPNPDYYSALLWHR 715
            N VTNAFVF FWYLDQLGMAS+Y+T TYCRQTLIGGNYGLLNT TFVPNPDYY ALLWHR
Sbjct: 184  NLVTNAFVFGFWYLDQLGMASTYNTTTYCRQTLIGGNYGLLNTTTFVPNPDYYGALLWHR 243

Query: 716  LMGRRVLSTRFHGTKKIRSYAHCSKHSDGITLLLINLASLAKTEVGLTVENV--TILAAS 889
            LMGR VLST F GT KIRSYAHCSK S GITLLLINL  +  T VG++  N    I   +
Sbjct: 244  LMGRHVLSTNFSGTNKIRSYAHCSKTSTGITLLLINLDGIKTTNVGISFINTIKIITQTT 303

Query: 890  KQQEHQTQKTHSSKLGE----KELTREEYHLTAKNGNLNSHTILLNGNELTVNSTGSIPS 1057
            K++  + ++T  SK+       E+ REEYHLTAKNG+L+S  +LLNG EL VNS+G IPS
Sbjct: 304  KKEPKEQKRTKFSKMRRNPKVNEVIREEYHLTAKNGDLHSQIVLLNGKELIVNSSGIIPS 363

Query: 1058 LDPVQVNMSSPINVAPFSIVFVHIPNINVPAC 1153
            L+P++ N SSPINVAP+SIVFVHIP++  PAC
Sbjct: 364  LNPIKQNFSSPINVAPYSIVFVHIPSVRFPAC 395


>gb|PLY87253.1| hypothetical protein LSAT_1X43741 [Lactuca sativa]
          Length = 549

 Score =  558 bits (1437), Expect = 0.0
 Identities = 272/390 (69%), Positives = 317/390 (81%), Gaps = 6/390 (1%)
 Frame = +2

Query: 2    AKIIFGLNALSQRQVNIDGSVVGPWNSSNAEALMKYTVDKGFTIHGWELGNELSANGIGA 181
            AK++FGLNAL+ RQ+  DG+  G W+ SNAEAL++YTV+ G+ I+GWELGNELS  GIG 
Sbjct: 159  AKVVFGLNALTGRQIGYDGTTFGSWDLSNAEALIRYTVNNGYVIYGWELGNELSGRGIGT 218

Query: 182  KITADQYALDTVALQNLVQKIYNTFQDKPLVLGPGGFFDYNWFDEFVRKSNDSLQVITQH 361
             + A QYA DT++LQNLVQKIYN  Q+KP+VLGPGGFFD NWF+ +V +++ SLQVITQH
Sbjct: 219  SVAAKQYASDTISLQNLVQKIYNGSQEKPIVLGPGGFFDANWFNVYVTEASGSLQVITQH 278

Query: 362  IYNLGPGVDKHLIEKILNPSYLDGGSQHFRDLQNILKNSGTSTVAWVGEAGGAYNSGRNQ 541
            IYNLGPGVD HL+EKILNPSYLDGGSQ FRDLQNILK S TSTVAWVGEAGGAYNSGRN 
Sbjct: 279  IYNLGPGVDAHLVEKILNPSYLDGGSQPFRDLQNILKKSRTSTVAWVGEAGGAYNSGRNL 338

Query: 542  VTNAFVFSFWYLDQLGMASSYDTKTYCRQTLIGGNYGLLNTDTFVPNPDYYSALLWHRLM 721
            VTNAFVF FWYLDQLGMAS+Y+T TYCRQTLIGGNYGLLNT TFVPNPDYY ALLWHRLM
Sbjct: 339  VTNAFVFGFWYLDQLGMASTYNTTTYCRQTLIGGNYGLLNTTTFVPNPDYYGALLWHRLM 398

Query: 722  GRRVLSTRFHGTKKIRSYAHCSKHSDGITLLLINLASLAKTEVGLTVENV--TILAASKQ 895
            GR VLST F GT KIRSYAHCSK S GITLLLINL  +  T VG++  N    I   +K+
Sbjct: 399  GRHVLSTNFSGTNKIRSYAHCSKTSTGITLLLINLDGIKTTNVGISFINTIKIITQTTKK 458

Query: 896  QEHQTQKTHSSKLGE----KELTREEYHLTAKNGNLNSHTILLNGNELTVNSTGSIPSLD 1063
            +  + ++T  SK+       E+ REEYHLTAKNG+L+S  +LLNG EL VNS+G IPSL+
Sbjct: 459  EPKEQKRTKFSKMRRNPKVNEVIREEYHLTAKNGDLHSQIVLLNGKELIVNSSGIIPSLN 518

Query: 1064 PVQVNMSSPINVAPFSIVFVHIPNINVPAC 1153
            P++ N SSPINVAP+SIVFVHIP++  PAC
Sbjct: 519  PIKQNFSSPINVAPYSIVFVHIPSVRFPAC 548


>ref|XP_002515480.2| PREDICTED: probable beta-1,3-galactosyltransferase 9 [Ricinus
            communis]
          Length = 346

 Score =  546 bits (1408), Expect = 0.0
 Identities = 265/313 (84%), Positives = 287/313 (91%), Gaps = 1/313 (0%)
 Frame = -3

Query: 2201 CLAWLYIAGRLWQDAENRMLLSNLLMKNAAERPKVLTVEDKLMVLGCKDLERRIVEAEME 2022
            CLAWLY+AGRLWQDAENRMLLSNLL  N+A+RP+VLTVEDKL VLGCKDLERRIVEAEME
Sbjct: 34   CLAWLYVAGRLWQDAENRMLLSNLLKLNSAQRPRVLTVEDKLAVLGCKDLERRIVEAEME 93

Query: 2021 ITLAKSQGYLKDQLKQPGLSSN-KKLLAVIGVYTGFGSRLNRNVFRGSWMPKGDSLKKLE 1845
            +TLAKSQGYLK+QL   G SS+ KKLLAVIGVYTGFGSRL RNVFRGSWMP+GD+LKKLE
Sbjct: 94   LTLAKSQGYLKNQLPHSGSSSSGKKLLAVIGVYTGFGSRLKRNVFRGSWMPRGDALKKLE 153

Query: 1844 ERGIVIRFVIGRSPNRGDSLDRNIDAENRSTKDFLILDGHEEADEESPKKAKFFFSTAVQ 1665
            ERG+VIRFVIGRS NRGDSLDRNID EN STKDFLILDGHEEA EE PKKAKFFFSTAVQ
Sbjct: 154  ERGVVIRFVIGRSANRGDSLDRNIDEENSSTKDFLILDGHEEAQEEIPKKAKFFFSTAVQ 213

Query: 1664 NWDAEFYVKVDNNIALDLEGLIELLESRRGQDGVYIGCMKSGEVVAEAGKPWYEPDWWKF 1485
             WDAEFYVKVD+NI LDLEGLI LLE RRGQD  Y+GCMKSG+V+ E GK WYEPDWWKF
Sbjct: 214  KWDAEFYVKVDDNINLDLEGLIGLLERRRGQDSAYVGCMKSGDVITEEGKQWYEPDWWKF 273

Query: 1484 GDEKSYFRHASGSLVILSKNFAQYININSASLKSYAHEDTTIGSWMMGIQATYIDENRVC 1305
            GDEKSYFRHASGSL ILSKN AQYININSASLK YAH+DT++GSWMMG+QATYID+NR+C
Sbjct: 274  GDEKSYFRHASGSLFILSKNLAQYININSASLKMYAHDDTSVGSWMMGLQATYIDDNRLC 333

Query: 1304 CSSIRQDKVCSLA 1266
            CSSI+QDKVCS+A
Sbjct: 334  CSSIKQDKVCSVA 346


>gb|KVH99807.1| protein of unknown function DUF4094 [Cynara cardunculus var.
            scolymus]
          Length = 338

 Score =  546 bits (1407), Expect = 0.0
 Identities = 284/353 (80%), Positives = 294/353 (83%), Gaps = 1/353 (0%)
 Frame = -3

Query: 2321 NSSLYYKEGYQLPTXXXXXXXXXXXXXXXXXXXXXXXXXS-CLAWLYIAGRLWQDAENRM 2145
            NS  YYKEG  LPT                           CLAWLYIAGRLWQDAENRM
Sbjct: 3    NSPPYYKEGLPLPTTISKTEKQRSRSSSRSSIPSIFFAFFSCLAWLYIAGRLWQDAENRM 62

Query: 2144 LLSNLLMKNAAERPKVLTVEDKLMVLGCKDLERRIVEAEMEITLAKSQGYLKDQLKQPGL 1965
            LLSNLLMKN+AERPKVLTVEDKLMVLGCKDLERRIVEAEMEITLAKSQG+L DQLKQPG 
Sbjct: 63   LLSNLLMKNSAERPKVLTVEDKLMVLGCKDLERRIVEAEMEITLAKSQGFLTDQLKQPGN 122

Query: 1964 SSNKKLLAVIGVYTGFGSRLNRNVFRGSWMPKGDSLKKLEERGIVIRFVIGRSPNRGDSL 1785
            SS KKLLAVIGVYTGFGSRLNRNVFRGSWMP G+SLKKLEERGIVIRFVIGRSPNRGDSL
Sbjct: 123  SSQKKLLAVIGVYTGFGSRLNRNVFRGSWMPTGNSLKKLEERGIVIRFVIGRSPNRGDSL 182

Query: 1784 DRNIDAENRSTKDFLILDGHEEADEESPKKAKFFFSTAVQNWDAEFYVKVDNNIALDLEG 1605
            DRNID ENR+TKDFLILDGHEEADEESPKKAKFFFSTA+QNWDAEFYVKVDNNIALDLEG
Sbjct: 183  DRNIDEENRATKDFLILDGHEEADEESPKKAKFFFSTAIQNWDAEFYVKVDNNIALDLEG 242

Query: 1604 LIELLESRRGQDGVYIGCMKSGEVVAEAGKPWYEPDWWKFGDEKSYFRHASGSLVILSKN 1425
            LIELLESRRGQD VY+GCMKSGEVVAE                  YFRHASGSL+ILSKN
Sbjct: 243  LIELLESRRGQDSVYLGCMKSGEVVAE-----------------EYFRHASGSLLILSKN 285

Query: 1424 FAQYININSASLKSYAHEDTTIGSWMMGIQATYIDENRVCCSSIRQDKVCSLA 1266
            FAQYININSASLK+YAHEDT+IGSWMMGIQATYIDENR CCS   QDKVCSL+
Sbjct: 286  FAQYININSASLKTYAHEDTSIGSWMMGIQATYIDENRACCSGSVQDKVCSLS 338


>ref|XP_023872627.1| hydroxyproline O-galactosyltransferase HPGT3-like [Quercus suber]
 gb|POF23808.1| hydroxyproline o-galactosyltransferase hpgt2 [Quercus suber]
          Length = 344

 Score =  543 bits (1398), Expect = 0.0
 Identities = 257/312 (82%), Positives = 287/312 (91%)
 Frame = -3

Query: 2201 CLAWLYIAGRLWQDAENRMLLSNLLMKNAAERPKVLTVEDKLMVLGCKDLERRIVEAEME 2022
            CLAWLY+AGRLWQDAENR +L+NLL KN+ +RPK+LTVEDKL VLGC+DLERRIVEAEME
Sbjct: 33   CLAWLYVAGRLWQDAENRKVLTNLLYKNSLQRPKILTVEDKLSVLGCRDLERRIVEAEME 92

Query: 2021 ITLAKSQGYLKDQLKQPGLSSNKKLLAVIGVYTGFGSRLNRNVFRGSWMPKGDSLKKLEE 1842
            +TLAKSQGYL  QL+Q G SS KKLLAVIGVYTGFGSRL RNVFRGSWMPKGD+L+KLEE
Sbjct: 93   LTLAKSQGYLNKQLQQSGSSSGKKLLAVIGVYTGFGSRLKRNVFRGSWMPKGDALRKLEE 152

Query: 1841 RGIVIRFVIGRSPNRGDSLDRNIDAENRSTKDFLILDGHEEADEESPKKAKFFFSTAVQN 1662
            RG+VIRFVIGRS NRGDSLDRNI+ ENRSTKDFLIL+GHEEA EE PKKAKFF STAVQ 
Sbjct: 153  RGVVIRFVIGRSANRGDSLDRNINEENRSTKDFLILEGHEEAQEELPKKAKFFLSTAVQK 212

Query: 1661 WDAEFYVKVDNNIALDLEGLIELLESRRGQDGVYIGCMKSGEVVAEAGKPWYEPDWWKFG 1482
            WDA+F+VKVD+NI LDLE LI LLE RRGQDG YIGCMKSG+V++E GKPWYEPDWWKFG
Sbjct: 213  WDADFFVKVDDNIDLDLEALIGLLERRRGQDGAYIGCMKSGDVISEEGKPWYEPDWWKFG 272

Query: 1481 DEKSYFRHASGSLVILSKNFAQYININSASLKSYAHEDTTIGSWMMGIQATYIDENRVCC 1302
            DEKSYFRHA  +L+ILSKN AQY+NINSASLK+YAH+DT++GSWMMG+QATYID+NR+CC
Sbjct: 273  DEKSYFRHAGTALIILSKNLAQYVNINSASLKTYAHDDTSVGSWMMGLQATYIDDNRLCC 332

Query: 1301 SSIRQDKVCSLA 1266
            SSIRQDKVCSLA
Sbjct: 333  SSIRQDKVCSLA 344


>ref|XP_018822215.1| PREDICTED: hydroxyproline O-galactosyltransferase HPGT3-like [Juglans
            regia]
          Length = 344

 Score =  542 bits (1397), Expect = 0.0
 Identities = 258/312 (82%), Positives = 287/312 (91%)
 Frame = -3

Query: 2201 CLAWLYIAGRLWQDAENRMLLSNLLMKNAAERPKVLTVEDKLMVLGCKDLERRIVEAEME 2022
            CLAWLY+AGRLWQDAENR LLSNLL KN+ +RPKVLTVEDKL VLGC+DLERRIVEAEME
Sbjct: 33   CLAWLYVAGRLWQDAENRKLLSNLLYKNSLQRPKVLTVEDKLTVLGCRDLERRIVEAEME 92

Query: 2021 ITLAKSQGYLKDQLKQPGLSSNKKLLAVIGVYTGFGSRLNRNVFRGSWMPKGDSLKKLEE 1842
            +TLAKSQGYL +QL+Q   SS ++LLAVIG+YTGFGS L RNVFRGSWMPKGD+L+KLEE
Sbjct: 93   LTLAKSQGYLNNQLQQSKSSSGRRLLAVIGLYTGFGSHLKRNVFRGSWMPKGDALRKLEE 152

Query: 1841 RGIVIRFVIGRSPNRGDSLDRNIDAENRSTKDFLILDGHEEADEESPKKAKFFFSTAVQN 1662
            RG+VIRFVIGRS NRGDSLDRNID ENR+TKDFLIL+GHEEA EE PKK K+FFSTAVQ 
Sbjct: 153  RGVVIRFVIGRSANRGDSLDRNIDKENRTTKDFLILEGHEEAQEELPKKVKYFFSTAVQK 212

Query: 1661 WDAEFYVKVDNNIALDLEGLIELLESRRGQDGVYIGCMKSGEVVAEAGKPWYEPDWWKFG 1482
            WDAEFYVKVD+NI LDLEGLI LL+ RRGQDG YIGCMKSG+V++E GKPWYEPDWWKFG
Sbjct: 213  WDAEFYVKVDDNIDLDLEGLIGLLDRRRGQDGAYIGCMKSGDVISEEGKPWYEPDWWKFG 272

Query: 1481 DEKSYFRHASGSLVILSKNFAQYININSASLKSYAHEDTTIGSWMMGIQATYIDENRVCC 1302
            DEKSYFRHA+GSL+ILSKN AQYININSASLK+YAH+D ++GSWMMGIQATY D+NR+CC
Sbjct: 273  DEKSYFRHAAGSLLILSKNLAQYININSASLKTYAHDDVSMGSWMMGIQATYTDDNRLCC 332

Query: 1301 SSIRQDKVCSLA 1266
            SSIRQDKVCSLA
Sbjct: 333  SSIRQDKVCSLA 344


>ref|XP_004498753.1| PREDICTED: probable beta-1,3-galactosyltransferase 10 [Cicer
            arietinum]
          Length = 344

 Score =  542 bits (1397), Expect = 0.0
 Identities = 258/312 (82%), Positives = 290/312 (92%)
 Frame = -3

Query: 2201 CLAWLYIAGRLWQDAENRMLLSNLLMKNAAERPKVLTVEDKLMVLGCKDLERRIVEAEME 2022
            C+AWLY+AGRLWQDAENR LL++LL KN+ +RPKVLTVEDKLMVLGC+DLERRIV+AEME
Sbjct: 33   CVAWLYVAGRLWQDAENRNLLTSLLKKNSEQRPKVLTVEDKLMVLGCRDLERRIVDAEME 92

Query: 2021 ITLAKSQGYLKDQLKQPGLSSNKKLLAVIGVYTGFGSRLNRNVFRGSWMPKGDSLKKLEE 1842
            +TLAKSQGYLK Q +Q G SS+++LLAVIGVYTGFGSRL RN FRGSWMP+GD+LKKLEE
Sbjct: 93   LTLAKSQGYLKGQRQQTGSSSDRRLLAVIGVYTGFGSRLKRNEFRGSWMPRGDALKKLEE 152

Query: 1841 RGIVIRFVIGRSPNRGDSLDRNIDAENRSTKDFLILDGHEEADEESPKKAKFFFSTAVQN 1662
            RG+VIRFVIGRS NRGDSLDRNID ENRSTKDFLILD HEEA EE PKKAK FFSTAVQN
Sbjct: 153  RGVVIRFVIGRSANRGDSLDRNIDEENRSTKDFLILDSHEEAQEELPKKAKIFFSTAVQN 212

Query: 1661 WDAEFYVKVDNNIALDLEGLIELLESRRGQDGVYIGCMKSGEVVAEAGKPWYEPDWWKFG 1482
            WDA+FYVKVD++I +DLEGLIELLE RRGQDG YIGCMKSG+V++E GK WYEPDWWKFG
Sbjct: 213  WDADFYVKVDDSIGIDLEGLIELLEHRRGQDGAYIGCMKSGDVISEEGKLWYEPDWWKFG 272

Query: 1481 DEKSYFRHASGSLVILSKNFAQYININSASLKSYAHEDTTIGSWMMGIQATYIDENRVCC 1302
            DEKSYFRHA+GSLVILSKN AQYININS SLK+YA++DT++GSWMMGIQ+TYID+NR+CC
Sbjct: 273  DEKSYFRHAAGSLVILSKNLAQYININSVSLKTYAYDDTSLGSWMMGIQSTYIDDNRLCC 332

Query: 1301 SSIRQDKVCSLA 1266
            SSIRQDKVCSLA
Sbjct: 333  SSIRQDKVCSLA 344


>ref|XP_008445237.1| PREDICTED: hydroxyproline O-galactosyltransferase HPGT3-like [Cucumis
            melo]
          Length = 346

 Score =  542 bits (1397), Expect = 0.0
 Identities = 258/313 (82%), Positives = 290/313 (92%), Gaps = 2/313 (0%)
 Frame = -3

Query: 2201 CLAWLYIAGRLWQDAENRMLLSNLLMKNAAERPKVLTVEDKLMVLGCKDLERRIVEAEME 2022
            CLAWLY+AGRLWQDAENR LLS LL KNA++RP +L+VEDKL VLGCKDLERRIVE EM+
Sbjct: 33   CLAWLYVAGRLWQDAENRKLLSTLLQKNASQRPVILSVEDKLQVLGCKDLERRIVEVEMD 92

Query: 2021 ITLAKSQGYLKDQLKQPGLSSN--KKLLAVIGVYTGFGSRLNRNVFRGSWMPKGDSLKKL 1848
            +TLAKSQGYLK+QL+Q G SSN  +KLLAVIGVYTGFGSRL RNVFRGSWMPKGD+LKKL
Sbjct: 93   LTLAKSQGYLKNQLRQSGSSSNPGRKLLAVIGVYTGFGSRLRRNVFRGSWMPKGDALKKL 152

Query: 1847 EERGIVIRFVIGRSPNRGDSLDRNIDAENRSTKDFLILDGHEEADEESPKKAKFFFSTAV 1668
            EERG++IRFVIGRS NRGDSLDRNID EN STKDFLIL+GHEEADEE PKKAKFFFSTAV
Sbjct: 153  EERGVIIRFVIGRSANRGDSLDRNIDKENHSTKDFLILEGHEEADEELPKKAKFFFSTAV 212

Query: 1667 QNWDAEFYVKVDNNIALDLEGLIELLESRRGQDGVYIGCMKSGEVVAEAGKPWYEPDWWK 1488
            QNWDAEFYVKVD++I LDLEGLI LLE RRGQDG Y+GCMKSG+V+AE GK WYEP+WWK
Sbjct: 213  QNWDAEFYVKVDDHIDLDLEGLIGLLEHRRGQDGTYVGCMKSGDVIAEEGKQWYEPEWWK 272

Query: 1487 FGDEKSYFRHASGSLVILSKNFAQYININSASLKSYAHEDTTIGSWMMGIQATYIDENRV 1308
            FGDEKSYFRHASG+L+ILSKN AQYININSASLK+YAH+D ++GSWM+G+QAT+ID+NR+
Sbjct: 273  FGDEKSYFRHASGALIILSKNLAQYININSASLKTYAHDDISVGSWMIGLQATHIDDNRL 332

Query: 1307 CCSSIRQDKVCSL 1269
            CCSSIRQDKVCS+
Sbjct: 333  CCSSIRQDKVCSV 345


>ref|XP_007161194.1| hypothetical protein PHAVU_001G049900g [Phaseolus vulgaris]
 gb|ESW33188.1| hypothetical protein PHAVU_001G049900g [Phaseolus vulgaris]
          Length = 342

 Score =  541 bits (1394), Expect = 0.0
 Identities = 259/312 (83%), Positives = 292/312 (93%)
 Frame = -3

Query: 2201 CLAWLYIAGRLWQDAENRMLLSNLLMKNAAERPKVLTVEDKLMVLGCKDLERRIVEAEME 2022
            C+AWLY+AGRLWQDAENR LL++LL KN+A+RPKVLTVEDKLMVLGC+DLERRIVEAEME
Sbjct: 32   CVAWLYVAGRLWQDAENRNLLASLLKKNSAQRPKVLTVEDKLMVLGCRDLERRIVEAEME 91

Query: 2021 ITLAKSQGYLKDQLKQPGLSSNKKLLAVIGVYTGFGSRLNRNVFRGSWMPKGDSLKKLEE 1842
            +TLAKSQGYLK Q ++ G SS+++LLAVIGVYTGFGSRL RNVFRGSWMP+GD+LKKLEE
Sbjct: 92   LTLAKSQGYLKGQGQKSG-SSDRRLLAVIGVYTGFGSRLKRNVFRGSWMPRGDALKKLEE 150

Query: 1841 RGIVIRFVIGRSPNRGDSLDRNIDAENRSTKDFLILDGHEEADEESPKKAKFFFSTAVQN 1662
            RG+VIRFVIGRS NRGDSLDRNID ENRSTKDFLIL+GHEEA EE PKK K FFSTAVQN
Sbjct: 151  RGVVIRFVIGRSANRGDSLDRNIDEENRSTKDFLILEGHEEAQEELPKKVKTFFSTAVQN 210

Query: 1661 WDAEFYVKVDNNIALDLEGLIELLESRRGQDGVYIGCMKSGEVVAEAGKPWYEPDWWKFG 1482
            WDA+FYVKVD+NI +DLEGLIELLE RRGQDG YIGCMKSG+V++E GKPWYEPDWWKFG
Sbjct: 211  WDADFYVKVDDNIDIDLEGLIELLEHRRGQDGAYIGCMKSGDVISEDGKPWYEPDWWKFG 270

Query: 1481 DEKSYFRHASGSLVILSKNFAQYININSASLKSYAHEDTTIGSWMMGIQATYIDENRVCC 1302
            DEKSYFRHA GSLVI+SKN AQYININSASLK+YA +DT++GSWMMGIQATYID++R+CC
Sbjct: 271  DEKSYFRHAGGSLVIISKNLAQYININSASLKTYAFDDTSLGSWMMGIQATYIDDSRLCC 330

Query: 1301 SSIRQDKVCSLA 1266
            SS+RQ+KVCSLA
Sbjct: 331  SSVRQEKVCSLA 342


Top