BLASTX nr result

ID: Chrysanthemum21_contig00001878 seq

BLASTX 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Chrysanthemum21_contig00001878
         (2440 letters)

Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF excluding environmental samples
from WGS projects 
           149,584,005 sequences; 54,822,741,787 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_023734341.1| hydroxyproline O-galactosyltransferase GALT3...  1054   0.0  
gb|KVH89696.1| Concanavalin A-like lectin/glucanase superfamily ...  1037   0.0  
ref|XP_021990360.1| hydroxyproline O-galactosyltransferase GALT3...  1015   0.0  
ref|XP_017230832.1| PREDICTED: hydroxyproline O-galactosyltransf...   840   0.0  
ref|XP_023910680.1| hydroxyproline O-galactosyltransferase GALT3...   827   0.0  
dbj|GAY35602.1| hypothetical protein CUMW_017250 [Citrus unshiu]      827   0.0  
dbj|GAY35604.1| hypothetical protein CUMW_017260 [Citrus unshiu]      824   0.0  
ref|XP_006419292.1| hydroxyproline O-galactosyltransferase GALT3...   821   0.0  
gb|EOY06836.1| Beta-1,3-galactosyltransferase 16 isoform 1 [Theo...   821   0.0  
ref|XP_007035910.2| PREDICTED: hydroxyproline O-galactosyltransf...   820   0.0  
ref|XP_006488779.1| PREDICTED: probable beta-1,3-galactosyltrans...   820   0.0  
gb|KDO72085.1| hypothetical protein CISIN_1g006036mg [Citrus sin...   819   0.0  
ref|XP_019180413.1| PREDICTED: hydroxyproline O-galactosyltransf...   816   0.0  
ref|XP_024046479.1| hydroxyproline O-galactosyltransferase GALT3...   814   0.0  
ref|XP_006488778.1| PREDICTED: probable beta-1,3-galactosyltrans...   813   0.0  
ref|XP_018814246.1| PREDICTED: hydroxyproline O-galactosyltransf...   811   0.0  
gb|PNT16634.1| hypothetical protein POPTR_010G151600v3 [Populus ...   808   0.0  
ref|XP_021642914.1| hydroxyproline O-galactosyltransferase GALT3...   808   0.0  
ref|XP_010276521.1| PREDICTED: hydroxyproline O-galactosyltransf...   808   0.0  
ref|XP_006355721.1| PREDICTED: probable beta-1,3-galactosyltrans...   806   0.0  

>ref|XP_023734341.1| hydroxyproline O-galactosyltransferase GALT3 [Lactuca sativa]
 ref|XP_023734342.1| hydroxyproline O-galactosyltransferase GALT3 [Lactuca sativa]
 ref|XP_023734343.1| hydroxyproline O-galactosyltransferase GALT3 [Lactuca sativa]
 gb|PLY73382.1| hypothetical protein LSAT_6X69101 [Lactuca sativa]
          Length = 628

 Score = 1054 bits (2725), Expect = 0.0
 Identities = 528/640 (82%), Positives = 565/640 (88%), Gaps = 7/640 (1%)
 Frame = -2

Query: 2232 MKKWTXXXXXXXXXXXXLVSYSFIQKQPRKQSSYEFFHPNEEIANLSSVSVVTKVEKLQ- 2056
            MKKWT            LVSYSFIQKQP KQSSYEFFHPN+E  N S+ S   K EKLQ 
Sbjct: 1    MKKWTGGVLIIGLACILLVSYSFIQKQPEKQSSYEFFHPNQEHTNFSN-SEQPKGEKLQT 59

Query: 2055 --KRPHLVKVNGLDYLFNSTYMPKEEEAKPLLAWGQMRLLLSRSDSLPETAQGIKEAALA 1882
              KRP LV ++GL+YLFNST+MPKEEE KPLLAWGQMRLLLSRSDSLP TAQGIKEAA+A
Sbjct: 60   FVKRPRLVNLDGLNYLFNSTHMPKEEEGKPLLAWGQMRLLLSRSDSLPMTAQGIKEAAVA 119

Query: 1881 WKELRSIISKNRASKFVHNKYERNCSYSVSGVNNSTLFSSNGSIILEIPCGLIHDSSITI 1702
            WKEL+S I +NRASKFVHNK ERNCS+SV  +NN+T FS        IPCGLI DSSIT+
Sbjct: 120  WKELQSTIDENRASKFVHNK-ERNCSFSVISMNNATSFS--------IPCGLIEDSSITV 170

Query: 1701 IAIPDGKQDGFEIEFVGSEGKEEPNPPVILHYNVVLPGDNFTKDPVIAQNTWTYDFGWGK 1522
            IAIP G QDGF+IEFVG EGKEE +  VIL YNVVL G+NFTK+P IAQNTWTY+FGWGK
Sbjct: 171  IAIPKGNQDGFQIEFVGLEGKEEAD--VILSYNVVLSGNNFTKEPFIAQNTWTYEFGWGK 228

Query: 1521 EERCPNHGSPNNAKVDGLVKCNEQLTRSTMDENSNSRHLSVN---KSTNASDVNDH-NHV 1354
            EERCP+HGSPNNAKVDGLVKCNEQL  S+++ENS S+ L VN   KS+N SDV+ H NH+
Sbjct: 229  EERCPSHGSPNNAKVDGLVKCNEQLMGSSLEENSTSKQLIVNNKNKSSNGSDVSGHDNHM 288

Query: 1353 GSNFPFIDGSLFTATIWAGEEGFHGTVNGRHETSFAYREKLEPWLVSGVRVKGGLHIISA 1174
             SNFPF+DGSLFTATIW G EGFH TVNGRHETSFAYREKLEPWLVSGVRVKGGLHIIS 
Sbjct: 289  VSNFPFLDGSLFTATIWVGVEGFHATVNGRHETSFAYREKLEPWLVSGVRVKGGLHIIST 348

Query: 1173 LAKGLPVSENLDMAMDLEDLKAPLISNKSKRLVLLIGVFSSGNNFERRMALRRSWMKYEA 994
            LAKGLPVSENLDMA+DLE+LKAPLISNKSKRL+LLIGVFSSGNNFERRMALRRSWMKYEA
Sbjct: 349  LAKGLPVSENLDMAIDLEELKAPLISNKSKRLLLLIGVFSSGNNFERRMALRRSWMKYEA 408

Query: 993  VRSGVVAVRFFIGLHKNKQVNFELWKEAQAYQDVQLMPFVDYYSLLTLKTIAICIMGTKI 814
            VRSGVVAVRFFIGLHKNK+VNFELW+EAQAYQDVQLMPFVDYYSLLTLKTIAICIMGTKI
Sbjct: 409  VRSGVVAVRFFIGLHKNKEVNFELWREAQAYQDVQLMPFVDYYSLLTLKTIAICIMGTKI 468

Query: 813  FPAKYIMKTDDDAFVRVDEILISLKSKNPDGLLYGMVSLESKPQRDKDNKWYISPEEWPH 634
            FPAKYIMKTDDDAFVRVDEIL SLK+K  DGLLYG VSL+SKPQRDKDNKWYIS EEWPH
Sbjct: 469  FPAKYIMKTDDDAFVRVDEILASLKTKTSDGLLYGQVSLDSKPQRDKDNKWYISTEEWPH 528

Query: 633  ESYPPWAHGPGYVISRDIAKFIVRGHQERNLKLFKLEDVAMGIWIEQFKKHVREVQYIND 454
            ESYPPWAHGPGYVISRDIAKFIVRGH ER+LKLFKLEDVAMGIWIE+FKKHVREVQY ND
Sbjct: 529  ESYPPWAHGPGYVISRDIAKFIVRGHHERSLKLFKLEDVAMGIWIEEFKKHVREVQYEND 588

Query: 453  DRFYNAGCEPNYILAHYQNPRMVLCLWEKMQKEHKPDCCD 334
            +RFYNAGCEPNYILAHYQNPRMVLCLWEKMQKEHKPDCCD
Sbjct: 589  ERFYNAGCEPNYILAHYQNPRMVLCLWEKMQKEHKPDCCD 628


>gb|KVH89696.1| Concanavalin A-like lectin/glucanase superfamily [Cynara cardunculus
            var. scolymus]
          Length = 617

 Score = 1037 bits (2682), Expect = 0.0
 Identities = 511/636 (80%), Positives = 551/636 (86%), Gaps = 3/636 (0%)
 Frame = -2

Query: 2232 MKKWTXXXXXXXXXXXXLVSYSFIQKQPRKQSSYEFFHPNEEIANLSSVSVVTKVEKLQK 2053
            MKKWT            L+SYSF+QKQPRKQSSYEFFHPNEE  NLS+ SV    E L+ 
Sbjct: 1    MKKWTGGVLIIGLACILLISYSFVQKQPRKQSSYEFFHPNEERGNLSN-SVQKTGEMLRT 59

Query: 2052 ---RPHLVKVNGLDYLFNSTYMPKEEEAKPLLAWGQMRLLLSRSDSLPETAQGIKEAALA 1882
               RPHL+ V+GLDYLFN T+MPKEE+ KPLLAWGQMRLLLSRSDSLPETAQGIKEA++A
Sbjct: 60   SVGRPHLINVDGLDYLFNLTHMPKEEDTKPLLAWGQMRLLLSRSDSLPETAQGIKEASVA 119

Query: 1881 WKELRSIISKNRASKFVHNKYERNCSYSVSGVNNSTLFSSNGSIILEIPCGLIHDSSITI 1702
            WKEL S I +++ASKFV NK ERNCSYSVS +NNS LFSSNGS+IL +PCGLI DSSIT+
Sbjct: 120  WKELLSKIDESKASKFVDNKEERNCSYSVSLMNNSALFSSNGSVILGMPCGLIEDSSITV 179

Query: 1701 IAIPDGKQDGFEIEFVGSEGKEEPNPPVILHYNVVLPGDNFTKDPVIAQNTWTYDFGWGK 1522
            IAIPD KQDGF IE VGS GKEE +PPVILHYNVVLPGDNFTK+PVI QNTWTY+FGWGK
Sbjct: 180  IAIPDKKQDGFRIELVGSIGKEEQDPPVILHYNVVLPGDNFTKEPVIVQNTWTYEFGWGK 239

Query: 1521 EERCPNHGSPNNAKVDGLVKCNEQLTRSTMDENSNSRHLSVNKSTNASDVNDHNHVGSNF 1342
            EERCPN GS +NAKVDGLVKCNEQLT S+++ENSN +HLSVNKSTNASDV DHNH+ SNF
Sbjct: 240  EERCPNFGSSSNAKVDGLVKCNEQLTGSSLEENSNPKHLSVNKSTNASDVGDHNHMSSNF 299

Query: 1341 PFIDGSLFTATIWAGEEGFHGTVNGRHETSFAYREKLEPWLVSGVRVKGGLHIISALAKG 1162
            PF+DGS FTATIW G EGFH TVNGRHETSF YREKLEPWLVSGVRV GGL IISALAKG
Sbjct: 300  PFLDGSPFTATIWVGAEGFHATVNGRHETSFVYREKLEPWLVSGVRVTGGLRIISALAKG 359

Query: 1161 LPVSENLDMAMDLEDLKAPLISNKSKRLVLLIGVFSSGNNFERRMALRRSWMKYEAVRSG 982
            LPVSE+LD+A DLE LKAP ISNKSKRLVLLIGVFSSGNNFERRMALRRSWMKYEAVRSG
Sbjct: 360  LPVSEDLDVATDLEYLKAPSISNKSKRLVLLIGVFSSGNNFERRMALRRSWMKYEAVRSG 419

Query: 981  VVAVRFFIGLHKNKQVNFELWKEAQAYQDVQLMPFVDYYSLLTLKTIAICIMGTKIFPAK 802
            VVAVRFFIGLHKNKQVNFELW+EAQ YQDVQLMPF                  TKI PAK
Sbjct: 420  VVAVRFFIGLHKNKQVNFELWREAQTYQDVQLMPF------------------TKILPAK 461

Query: 801  YIMKTDDDAFVRVDEILISLKSKNPDGLLYGMVSLESKPQRDKDNKWYISPEEWPHESYP 622
            YIMKTDDDAFVRVDEIL SLK+K  DGLLYG+VSL+SKPQRDK+NKWYIS EEW H+SYP
Sbjct: 462  YIMKTDDDAFVRVDEILASLKTKTSDGLLYGLVSLDSKPQRDKENKWYISTEEWSHDSYP 521

Query: 621  PWAHGPGYVISRDIAKFIVRGHQERNLKLFKLEDVAMGIWIEQFKKHVREVQYINDDRFY 442
            PWAHGPGYV+SRDIAKFIVRGHQER+LKLFKLEDVAMGIW+EQF+KH  EVQYINDDRFY
Sbjct: 522  PWAHGPGYVVSRDIAKFIVRGHQERSLKLFKLEDVAMGIWVEQFQKHGHEVQYINDDRFY 581

Query: 441  NAGCEPNYILAHYQNPRMVLCLWEKMQKEHKPDCCD 334
            NAGCEPNYILAHYQNPRMVLCLWEK+QKEHKPDCCD
Sbjct: 582  NAGCEPNYILAHYQNPRMVLCLWEKLQKEHKPDCCD 617


>ref|XP_021990360.1| hydroxyproline O-galactosyltransferase GALT3 [Helianthus annuus]
 gb|OTG13102.1| putative galactosyltransferase family protein [Helianthus annuus]
          Length = 620

 Score = 1015 bits (2624), Expect = 0.0
 Identities = 492/635 (77%), Positives = 553/635 (87%), Gaps = 2/635 (0%)
 Frame = -2

Query: 2232 MKKWTXXXXXXXXXXXXLVSYSFIQ-KQPRKQSSYEFFHPNEE-IANLSSVSVVTKVEKL 2059
            MKKWT            L SYSFIQ KQP+KQS+Y+FFHP+++ IAN S   V  K+E L
Sbjct: 1    MKKWTGGVLIIGLTCVLLFSYSFIQQKQPQKQSAYQFFHPDDDKIANFSK-PVGEKLETL 59

Query: 2058 QKRPHLVKVNGLDYLFNSTYMPKEEEAKPLLAWGQMRLLLSRSDSLPETAQGIKEAALAW 1879
              +PHLV V+GLDYLFNST+  K EE KPLLAWGQMRLLLSRSDSLPETAQGIKEAA AW
Sbjct: 60   TIKPHLVDVDGLDYLFNSTHTLKREELKPLLAWGQMRLLLSRSDSLPETAQGIKEAAAAW 119

Query: 1878 KELRSIISKNRASKFVHNKYERNCSYSVSGVNNSTLFSSNGSIILEIPCGLIHDSSITII 1699
            K L+S I + R SKF H+  ++NCS+SVS VN S+L +SN ++IL IPCGLI DSS+T+I
Sbjct: 120  KVLKSEIDETRVSKFDHSTMDKNCSFSVSAVNGSSLVASNDTVILAIPCGLIVDSSVTLI 179

Query: 1698 AIPDGKQDGFEIEFVGSEGKEEPNPPVILHYNVVLPGDNFTKDPVIAQNTWTYDFGWGKE 1519
            AIPDG QDGF+IEFVGS+GK+E +PPV+LHYNVVLPGDNFTK+PVI QNTWTY+ GWGKE
Sbjct: 180  AIPDGNQDGFQIEFVGSQGKDETDPPVVLHYNVVLPGDNFTKEPVIVQNTWTYESGWGKE 239

Query: 1518 ERCPNHGSPNNAKVDGLVKCNEQLTRSTMDENSNSRHLSVNKSTNASDVNDHNHVGSNFP 1339
            ERCPNHGS NN KVDGLVKCNEQLTR++ +E SN              V+DHNH+GSNFP
Sbjct: 240  ERCPNHGSSNNVKVDGLVKCNEQLTRTSTEEKSN--------------VSDHNHMGSNFP 285

Query: 1338 FIDGSLFTATIWAGEEGFHGTVNGRHETSFAYREKLEPWLVSGVRVKGGLHIISALAKGL 1159
            F DGSLF ATIWAG+EGF+ T+NGRHETSFAYREKLEPWLVSGVRVKGGL IIS LA GL
Sbjct: 286  FSDGSLFAATIWAGKEGFYATINGRHETSFAYREKLEPWLVSGVRVKGGLRIISTLATGL 345

Query: 1158 PVSENLDMAMDLEDLKAPLISNKSKRLVLLIGVFSSGNNFERRMALRRSWMKYEAVRSGV 979
            PVSE++DM M++E LKAPLI  +SKRL+LLIGVFSSGNNF+RRMALRRSWM+Y+AVRSGV
Sbjct: 346  PVSEDIDMVMNVESLKAPLIPKQSKRLLLLIGVFSSGNNFKRRMALRRSWMQYDAVRSGV 405

Query: 978  VAVRFFIGLHKNKQVNFELWKEAQAYQDVQLMPFVDYYSLLTLKTIAICIMGTKIFPAKY 799
            VAVRFFIGLHKNKQVN+ELWKEAQAY+DVQLMPFVDYYSLLTLKTIAICIMGTKI PAKY
Sbjct: 406  VAVRFFIGLHKNKQVNYELWKEAQAYEDVQLMPFVDYYSLLTLKTIAICIMGTKILPAKY 465

Query: 798  IMKTDDDAFVRVDEILISLKSKNPDGLLYGMVSLESKPQRDKDNKWYISPEEWPHESYPP 619
            IMKTDDDAFVRVDEIL SLK+KNPDGLLYG VSL+SKPQRDKDNKWYIS EEWP +SYPP
Sbjct: 466  IMKTDDDAFVRVDEILASLKTKNPDGLLYGQVSLDSKPQRDKDNKWYISTEEWPEDSYPP 525

Query: 618  WAHGPGYVISRDIAKFIVRGHQERNLKLFKLEDVAMGIWIEQFKKHVREVQYINDDRFYN 439
            WAHGPGYVISRDIAKFIV+GHQ+RNLKLFKLEDVA+GIWI+QFKK +REVQY+ND+RF+N
Sbjct: 526  WAHGPGYVISRDIAKFIVQGHQQRNLKLFKLEDVAVGIWIQQFKKQIREVQYVNDERFHN 585

Query: 438  AGCEPNYILAHYQNPRMVLCLWEKMQKEHKPDCCD 334
            +GCEPNYILAHYQNPRMVLCLWEK+QKEHKPDCCD
Sbjct: 586  SGCEPNYILAHYQNPRMVLCLWEKLQKEHKPDCCD 620


>ref|XP_017230832.1| PREDICTED: hydroxyproline O-galactosyltransferase GALT3 [Daucus
            carota subsp. sativus]
 gb|KZN09875.1| hypothetical protein DCAR_002531 [Daucus carota subsp. sativus]
          Length = 634

 Score =  840 bits (2171), Expect = 0.0
 Identities = 416/642 (64%), Positives = 501/642 (78%), Gaps = 9/642 (1%)
 Frame = -2

Query: 2232 MKKWTXXXXXXXXXXXXLVSYSFIQKQPRKQSSYEFF--HPNEEIANLSSVSVVTKVEKL 2059
            MK W+            L+ YSF+ KQP KQS+Y+FF  H +++       S   K E +
Sbjct: 1    MKNWSGGVLIVGLGLILLLRYSFVGKQPHKQSAYDFFNSHLSKDSTESGDSSNTVKTETV 60

Query: 2058 Q---KRPHLVKVNGLDYLFNSTYMPKEEEAKPLLAWGQMRLLLSRSDSLPETAQGIKEAA 1888
            Q   KRPH V V GLD L+    M  EEE+K LL W QMR+LLSRSD+LPET QGIKEAA
Sbjct: 61   QNPEKRPHFVDVEGLDDLYAFRNM-SEEESKVLLVWSQMRMLLSRSDALPETFQGIKEAA 119

Query: 1887 LAWKELRSIISKNRASKFVHN-KYERNCSYSV---SGVNNSTLFSSNGSIILEIPCGLIH 1720
            ++WKEL S+I K+++S+   N + ++ C Y V   SG+N ST   S    ILEIPCGL+ 
Sbjct: 120  VSWKELLSLIEKDKSSQLNDNIQNDKKCPYYVGMPSGLNTST---SGSGYILEIPCGLVE 176

Query: 1719 DSSITIIAIPDGKQDGFEIEFVGSEGKEEPNPPVILHYNVVLPGDNFTKDPVIAQNTWTY 1540
            DSS+T+I IP+  Q  F IE V S+  EE NPP++LH+NV LPG+N TK+P+I QNTWT 
Sbjct: 177  DSSVTLIGIPNRGQGNFTIELVASKFPEEQNPPIVLHFNVFLPGENLTKEPIIVQNTWTN 236

Query: 1539 DFGWGKEERCPNHGSPNNAKVDGLVKCNEQLTRSTMDENSNSRHLSVNKSTNASDVNDHN 1360
            + GWGKEERCPNH S N   VDGL KCNE++  S  +E +++ +LSVN+S+N S  +   
Sbjct: 237  ETGWGKEERCPNHHSINTTNVDGLAKCNEEVATSAEEEIAHASNLSVNQSSNVS--SGSA 294

Query: 1359 HVGSNFPFIDGSLFTATIWAGEEGFHGTVNGRHETSFAYREKLEPWLVSGVRVKGGLHII 1180
            HV +NFPF +GS FTAT+W G EGFH +VNGRHETSF YREKLEPWL++GVR+ G +  +
Sbjct: 295  HVSANFPFSEGSPFTATLWTGVEGFHMSVNGRHETSFEYREKLEPWLINGVRLLGDVEPV 354

Query: 1179 SALAKGLPVSENLDMAMDLEDLKAPLISNKSKRLVLLIGVFSSGNNFERRMALRRSWMKY 1000
            SA+AKGLPVSE+LD+ +D+E LKAP+ + K  RLVLLIGVFSS NNF RRMALRRSWM+Y
Sbjct: 355  SAIAKGLPVSEDLDLIVDVEHLKAPVTAKK--RLVLLIGVFSSCNNFNRRMALRRSWMQY 412

Query: 999  EAVRSGVVAVRFFIGLHKNKQVNFELWKEAQAYQDVQLMPFVDYYSLLTLKTIAICIMGT 820
            +AVRSG VAVRFF GLHKN QVNF+LWKE+QAY D+QLMPFVDYYSL++LKTIAIC MGT
Sbjct: 413  DAVRSGEVAVRFFTGLHKNIQVNFQLWKESQAYGDMQLMPFVDYYSLISLKTIAICTMGT 472

Query: 819  KIFPAKYIMKTDDDAFVRVDEILISLKSKNPDGLLYGMVSLESKPQRDKDNKWYISPEEW 640
            KI PAKYIMKTDDDAFVR+DE+L SLK K  DGLLYG++S ESKPQRD +NKW+IS EEW
Sbjct: 473  KILPAKYIMKTDDDAFVRIDEVLSSLKQKASDGLLYGLISFESKPQRDAENKWFISTEEW 532

Query: 639  PHESYPPWAHGPGYVISRDIAKFIVRGHQERNLKLFKLEDVAMGIWIEQFKKHVREVQYI 460
            PHESYPPWAHGPGY+ISRDIAKFIV+ HQ+R LKLFKLEDV+MGIWIE+FK+   EVQYI
Sbjct: 533  PHESYPPWAHGPGYIISRDIAKFIVQAHQKRELKLFKLEDVSMGIWIEKFKERGHEVQYI 592

Query: 459  NDDRFYNAGCEPNYILAHYQNPRMVLCLWEKMQKEHKPDCCD 334
            +D+RFYNAGCEPNYILAHYQNPRMVLCLWEK+QKEHKPDCC+
Sbjct: 593  SDERFYNAGCEPNYILAHYQNPRMVLCLWEKLQKEHKPDCCE 634


>ref|XP_023910680.1| hydroxyproline O-galactosyltransferase GALT3 [Quercus suber]
 gb|POF12422.1| hydroxyproline o-galactosyltransferase galt3 [Quercus suber]
          Length = 635

 Score =  827 bits (2136), Expect = 0.0
 Identities = 407/643 (63%), Positives = 492/643 (76%), Gaps = 10/643 (1%)
 Frame = -2

Query: 2232 MKKWTXXXXXXXXXXXXLVSYSFIQKQPRKQSSYEFF--HPNEEIANLSSVSV------V 2077
            MKKW+            ++ YS I  QP+KQS+Y+FF  HP  +     S ++      V
Sbjct: 1    MKKWSGGMCIVALAVILVLRYSLIGIQPQKQSAYDFFRNHPTSDPQMKDSGTIKSSKMEV 60

Query: 2076 TKVEKLQKRPHLVKVNGLDYLFNSTYMPKEEEAKPLLAWGQMRLLLSRSDSLPETAQGIK 1897
             KV KL K+P L  + GL  L+ S  +  EEE+K LL W  MR+LLSRSD+ PET  G+K
Sbjct: 61   KKVSKLLKKPPLKDIEGLSDLYASKNI-SEEESKALLVWAHMRMLLSRSDTFPETVDGVK 119

Query: 1896 EAALAWKELRSIISKNRASKFVH--NKYERNCSYSVSGVNNSTLFSSNGSIILEIPCGLI 1723
            EA++AWK L S I K +ASKF +  N  ++NC YSVS ++ + +   +  +ILE+PCGL+
Sbjct: 120  EASIAWKGLLSTIEKEKASKFSYSNNSDDKNCPYSVSTLDKTAM---SDGVILELPCGLV 176

Query: 1722 HDSSITIIAIPDGKQDGFEIEFVGSEGKEEPNPPVILHYNVVLPGDNFTKDPVIAQNTWT 1543
             DSSIT++ IP+G+   F+IE VGS+   EP PP+ LH+NV LPGDN T++P I QNTWT
Sbjct: 177  GDSSITLVGIPNGQNGSFQIELVGSQLSGEPTPPINLHFNVSLPGDNMTEEPFIVQNTWT 236

Query: 1542 YDFGWGKEERCPNHGSPNNAKVDGLVKCNEQLTRSTMDENSNSRHLSVNKSTNASDVNDH 1363
             + GWGKEERCP HGS N  KVDGLV CNEQ+ RSTM++N N  H + +  TN S  + H
Sbjct: 237  SEAGWGKEERCPAHGSANIIKVDGLVLCNEQIARSTMEDNLNVSHPTSDMFTNVSRESAH 296

Query: 1362 NHVGSNFPFIDGSLFTATIWAGEEGFHGTVNGRHETSFAYREKLEPWLVSGVRVKGGLHI 1183
              V  +FPF++G+ FTAT+W G EGFH T+NGRHETSFAYREKLEPW V+ V+V GGL +
Sbjct: 297  GSV--SFPFVEGNPFTATLWVGLEGFHMTINGRHETSFAYREKLEPWSVNRVKVAGGLDL 354

Query: 1182 ISALAKGLPVSENLDMAMDLEDLKAPLISNKSKRLVLLIGVFSSGNNFERRMALRRSWMK 1003
            +SALAKGLPVSE+ D+ ++LE LKAP +S K  RL+LLIG+FSSGNNFERRMALRRSWM+
Sbjct: 355  LSALAKGLPVSEDNDLVVELEHLKAPSVSRK--RLILLIGIFSSGNNFERRMALRRSWMQ 412

Query: 1002 YEAVRSGVVAVRFFIGLHKNKQVNFELWKEAQAYQDVQLMPFVDYYSLLTLKTIAICIMG 823
            YEAVRSG VAVRFFIGLHKN QVN+ELW+EAQAY D+QLMPFVDYYSL+ LKT+AICI G
Sbjct: 413  YEAVRSGDVAVRFFIGLHKNNQVNYELWREAQAYGDIQLMPFVDYYSLIALKTVAICIFG 472

Query: 822  TKIFPAKYIMKTDDDAFVRVDEILISLKSKNPDGLLYGMVSLESKPQRDKDNKWYISPEE 643
            TKI PAK+IMKTDDDAFVR+DE+L SLK K P+GLLYG++S ES P RDKD+KWYIS EE
Sbjct: 473  TKILPAKFIMKTDDDAFVRIDEVLSSLKGKAPNGLLYGLISFESAPHRDKDSKWYISTEE 532

Query: 642  WPHESYPPWAHGPGYVISRDIAKFIVRGHQERNLKLFKLEDVAMGIWIEQFKKHVREVQY 463
            WP  SYPPWAHGPGY+ISRDIAKFIVRGH ER LKLFKLEDVAMGIWIEQFKK  +EV Y
Sbjct: 533  WPPASYPPWAHGPGYIISRDIAKFIVRGHHERELKLFKLEDVAMGIWIEQFKKSGQEVHY 592

Query: 462  INDDRFYNAGCEPNYILAHYQNPRMVLCLWEKMQKEHKPDCCD 334
            INDDRF+NAGCEPNY+LAHYQ PR VLCLWE + KEH+  CC+
Sbjct: 593  INDDRFHNAGCEPNYVLAHYQGPRKVLCLWETLHKEHRALCCE 635


>dbj|GAY35602.1| hypothetical protein CUMW_017250 [Citrus unshiu]
          Length = 686

 Score =  827 bits (2135), Expect = 0.0
 Identities = 407/662 (61%), Positives = 503/662 (75%), Gaps = 16/662 (2%)
 Frame = -2

Query: 2235 QMKKWTXXXXXXXXXXXXLVSYSFI------------QKQPRKQSSYEFF--HPNEE--I 2104
            +M+ W+            ++SYSF+            QKQ  KQS+ +FF  HP+ +  +
Sbjct: 12   KMRNWSGGLLIMALAIILVMSYSFMGTQTQTQHRTQTQKQKHKQSANDFFRNHPSNDSDM 71

Query: 2103 ANLSSVSVVTKVEKLQKRPHLVKVNGLDYLFNSTYMPKEEEAKPLLAWGQMRLLLSRSDS 1924
                 V  V K +KL ++PH++ V GL  L++   M   E+++PLL WG MRLLLSRSD+
Sbjct: 72   KGSQGVKEVKKTQKLFEKPHIINVQGLGDLYSLKNM-LGEDSRPLLVWGHMRLLLSRSDA 130

Query: 1923 LPETAQGIKEAALAWKELRSIISKNRASKFVHNKYERNCSYSVSGVNNSTLFSSNGSIIL 1744
            LPETAQG+KEAA+AWK+L S+I + +ASKF   K   NC   VS ++ S    S+G +I+
Sbjct: 131  LPETAQGVKEAAIAWKDLLSVIEEEKASKFSRRK---NCPPFVSNLSKSL---SSGRLII 184

Query: 1743 EIPCGLIHDSSITIIAIPDGKQDGFEIEFVGSEGKEEPNPPVILHYNVVLPGDNFTKDPV 1564
            E+PCGL+ DSSIT++ IPDG+   F+IE +GS+   E NPP+ILHYNV LPGDN T++P 
Sbjct: 185  EVPCGLVEDSSITLVGIPDGRYGSFQIELIGSQLSGESNPPIILHYNVSLPGDNMTEEPF 244

Query: 1563 IAQNTWTYDFGWGKEERCPNHGSPNNAKVDGLVKCNEQLTRSTMDENSNSRHLSVNKSTN 1384
            I QN+WT + GWGKEERCP HGS N  KVD LV CNEQ+ R +++EN N+ H + +    
Sbjct: 245  IIQNSWTNELGWGKEERCPAHGSSNTLKVDELVLCNEQVLRRSVEENQNTSHPTPSSYML 304

Query: 1383 ASDVNDHNHVGSNFPFIDGSLFTATIWAGEEGFHGTVNGRHETSFAYREKLEPWLVSGVR 1204
            A+     +H  SNFPF+DG+ FT TIW G +GFH TVNGRHETS AYREKLEPW V+GV+
Sbjct: 305  ANASRVGSHETSNFPFVDGNPFTTTIWVGLDGFHMTVNGRHETSLAYREKLEPWSVTGVK 364

Query: 1203 VKGGLHIISALAKGLPVSENLDMAMDLEDLKAPLISNKSKRLVLLIGVFSSGNNFERRMA 1024
            V GG+ + SA A+GLPVSE+ D  +D+E LKAPLIS K  RLV+LIGVFS+GNNFERRMA
Sbjct: 365  VAGGVDLFSAFAEGLPVSEDFDFIVDVEHLKAPLISRK--RLVMLIGVFSTGNNFERRMA 422

Query: 1023 LRRSWMKYEAVRSGVVAVRFFIGLHKNKQVNFELWKEAQAYQDVQLMPFVDYYSLLTLKT 844
            LRRSWM+Y AVRSG VAVRFFIGLHKN+QVNFELWKEAQAY D+Q+MPFVDYYSL++LKT
Sbjct: 423  LRRSWMQYPAVRSGDVAVRFFIGLHKNRQVNFELWKEAQAYGDIQIMPFVDYYSLISLKT 482

Query: 843  IAICIMGTKIFPAKYIMKTDDDAFVRVDEILISLKSKNPDGLLYGMVSLESKPQRDKDNK 664
            IAICI GTKI PAKYIMKTDDDAFVR+DE+L +LK K  +GLL+G++S +S PQRDKD+K
Sbjct: 483  IAICIFGTKILPAKYIMKTDDDAFVRIDEVLSNLKEKPSNGLLFGLISYDSSPQRDKDSK 542

Query: 663  WYISPEEWPHESYPPWAHGPGYVISRDIAKFIVRGHQERNLKLFKLEDVAMGIWIEQFKK 484
            WYIS EEWPH SYPPWAHGPGY+ISRDIAKFIV+GHQER+LKLFKLEDVAMGIWIEQFK 
Sbjct: 543  WYISNEEWPHSSYPPWAHGPGYIISRDIAKFIVQGHQERDLKLFKLEDVAMGIWIEQFKN 602

Query: 483  HVREVQYINDDRFYNAGCEPNYILAHYQNPRMVLCLWEKMQKEHKPDCCD*DNILHNVYI 304
              +EV Y++DDRFYNAGCE +YILAHYQ PRMVLCLWEK+QK+H+  CC+  N    +Y 
Sbjct: 603  TGQEVHYMSDDRFYNAGCESDYILAHYQGPRMVLCLWEKLQKDHRAFCCELMNPTQQLYE 662

Query: 303  ML 298
            M+
Sbjct: 663  MV 664


>dbj|GAY35604.1| hypothetical protein CUMW_017260 [Citrus unshiu]
          Length = 652

 Score =  824 bits (2129), Expect = 0.0
 Identities = 404/650 (62%), Positives = 498/650 (76%), Gaps = 16/650 (2%)
 Frame = -2

Query: 2235 QMKKWTXXXXXXXXXXXXLVSYSFI------------QKQPRKQSSYEFF--HPNEE--I 2104
            +M+ W+            ++SYSF+            QKQ  KQS+ +FF  HP+ +  +
Sbjct: 12   KMRNWSGGLLIMALAIILVMSYSFMGTQTQTQHRTQTQKQKHKQSANDFFRNHPSNDSDM 71

Query: 2103 ANLSSVSVVTKVEKLQKRPHLVKVNGLDYLFNSTYMPKEEEAKPLLAWGQMRLLLSRSDS 1924
                 V  V K +KL ++PH++ V GL  L++   M   E+++PLL WG MRLLLSRSD+
Sbjct: 72   KGSQGVKEVKKTQKLFEKPHIINVQGLGDLYSLKNM-LGEDSRPLLVWGHMRLLLSRSDA 130

Query: 1923 LPETAQGIKEAALAWKELRSIISKNRASKFVHNKYERNCSYSVSGVNNSTLFSSNGSIIL 1744
            LPETAQG+KEAA+AWK+L S+I + +ASKF   K   NC   VS ++ S    S+G +I+
Sbjct: 131  LPETAQGVKEAAIAWKDLLSVIEEEKASKFSRRK---NCPPFVSNLSKSL---SSGRLII 184

Query: 1743 EIPCGLIHDSSITIIAIPDGKQDGFEIEFVGSEGKEEPNPPVILHYNVVLPGDNFTKDPV 1564
            E+PCGL+ DSSIT++ IPDG+   F+IE +GS+   E NPP+ILHYNV LPGDN T++P 
Sbjct: 185  EVPCGLVEDSSITLVGIPDGRYGSFQIELIGSQLSGESNPPIILHYNVSLPGDNMTEEPF 244

Query: 1563 IAQNTWTYDFGWGKEERCPNHGSPNNAKVDGLVKCNEQLTRSTMDENSNSRHLSVNKSTN 1384
            I QN+WT + GWGKEERCP HGS N  KVD LV CNEQ+ R +++EN N+ H + +    
Sbjct: 245  IIQNSWTNELGWGKEERCPAHGSSNTLKVDELVLCNEQVLRRSVEENQNTSHPTPSSYML 304

Query: 1383 ASDVNDHNHVGSNFPFIDGSLFTATIWAGEEGFHGTVNGRHETSFAYREKLEPWLVSGVR 1204
            A+     +H  SNFPF+DG+ FT TIW G +GFH TVNGRHETS AYREKLEPW V+GV+
Sbjct: 305  ANASRVGSHETSNFPFVDGNPFTTTIWVGLDGFHMTVNGRHETSLAYREKLEPWSVTGVK 364

Query: 1203 VKGGLHIISALAKGLPVSENLDMAMDLEDLKAPLISNKSKRLVLLIGVFSSGNNFERRMA 1024
            V GG+ + SA A+GLPVSE+ D  +D+E LKAPLIS K  RLV+LIGVFS+GNNFERRMA
Sbjct: 365  VAGGVDLFSAFAEGLPVSEDFDFIVDVEHLKAPLISRK--RLVMLIGVFSTGNNFERRMA 422

Query: 1023 LRRSWMKYEAVRSGVVAVRFFIGLHKNKQVNFELWKEAQAYQDVQLMPFVDYYSLLTLKT 844
            LRRSWM+Y AVRSG VAVRFFIGLHKN+QVNFELWKEAQAY D+Q+MPFVDYYSL++LKT
Sbjct: 423  LRRSWMQYPAVRSGDVAVRFFIGLHKNRQVNFELWKEAQAYGDIQIMPFVDYYSLISLKT 482

Query: 843  IAICIMGTKIFPAKYIMKTDDDAFVRVDEILISLKSKNPDGLLYGMVSLESKPQRDKDNK 664
            IAICI GTKI PAKYIMKTDDDAFVR+DE+L +LK K  +GLL+G++S +S PQRDKD+K
Sbjct: 483  IAICIFGTKILPAKYIMKTDDDAFVRIDEVLSNLKEKPSNGLLFGLISYDSSPQRDKDSK 542

Query: 663  WYISPEEWPHESYPPWAHGPGYVISRDIAKFIVRGHQERNLKLFKLEDVAMGIWIEQFKK 484
            WYIS EEWPH SYPPWAHGPGY+ISRDIAKFIV+GHQER+LKLFKLEDVAMGIWIEQFK 
Sbjct: 543  WYISNEEWPHSSYPPWAHGPGYIISRDIAKFIVQGHQERDLKLFKLEDVAMGIWIEQFKN 602

Query: 483  HVREVQYINDDRFYNAGCEPNYILAHYQNPRMVLCLWEKMQKEHKPDCCD 334
              +EV Y++DDRFYNAGCE +YILAHYQ PRMVLCLWEK+QK+H+  CC+
Sbjct: 603  TGQEVHYMSDDRFYNAGCESDYILAHYQGPRMVLCLWEKLQKDHRAFCCE 652


>ref|XP_006419292.1| hydroxyproline O-galactosyltransferase GALT3 isoform X2 [Citrus
            clementina]
 gb|ESR32532.1| hypothetical protein CICLE_v10004515mg [Citrus clementina]
          Length = 652

 Score =  821 bits (2121), Expect = 0.0
 Identities = 403/650 (62%), Positives = 496/650 (76%), Gaps = 16/650 (2%)
 Frame = -2

Query: 2235 QMKKWTXXXXXXXXXXXXLVSYSFI------------QKQPRKQSSYEFF--HPNEE--I 2104
            +M+ W+            ++SYSF+            QKQ  KQS+ +FF  HP+ +  +
Sbjct: 12   KMRNWSGGLLIMALAIILVMSYSFMGTQTQTQHRTQTQKQKHKQSANDFFRNHPSNDSDM 71

Query: 2103 ANLSSVSVVTKVEKLQKRPHLVKVNGLDYLFNSTYMPKEEEAKPLLAWGQMRLLLSRSDS 1924
                 V  V K +KL ++PH++ V GL  L++   M   E+++PLL WG MRLLLSRSD+
Sbjct: 72   KGSQGVKEVKKTQKLFEKPHIINVQGLGDLYSLKNM-LGEDSRPLLVWGHMRLLLSRSDA 130

Query: 1923 LPETAQGIKEAALAWKELRSIISKNRASKFVHNKYERNCSYSVSGVNNSTLFSSNGSIIL 1744
            LPETAQG+KEAA+AWK+L S+I + +ASKF   K   NC   VS ++ S    S+G +I+
Sbjct: 131  LPETAQGVKEAAIAWKDLLSVIEEEKASKFSRRK---NCPPFVSNLSKSL---SSGRLII 184

Query: 1743 EIPCGLIHDSSITIIAIPDGKQDGFEIEFVGSEGKEEPNPPVILHYNVVLPGDNFTKDPV 1564
            E+PCGL+ DSSIT++ IPDG+   F+IE +GS+   E NPP+ILHYNV LPGDN T++P 
Sbjct: 185  EVPCGLVEDSSITLVGIPDGRYGSFQIELIGSQLSGESNPPIILHYNVSLPGDNMTEEPF 244

Query: 1563 IAQNTWTYDFGWGKEERCPNHGSPNNAKVDGLVKCNEQLTRSTMDENSNSRHLSVNKSTN 1384
            I QN+WT + GWGKEERCP HGS N  KVD LV CNEQ+ R +++EN N+ H + +    
Sbjct: 245  IIQNSWTNELGWGKEERCPAHGSSNTLKVDELVLCNEQVLRRSVEENQNTSHPTPSSDIL 304

Query: 1383 ASDVNDHNHVGSNFPFIDGSLFTATIWAGEEGFHGTVNGRHETSFAYREKLEPWLVSGVR 1204
            A+      H  SNFPF+DG+ FT TIW G +GFH TVNGRHETS AYREKLEPW V+GV+
Sbjct: 305  ANASRVGAHETSNFPFVDGNPFTTTIWVGLDGFHMTVNGRHETSLAYREKLEPWSVTGVK 364

Query: 1203 VKGGLHIISALAKGLPVSENLDMAMDLEDLKAPLISNKSKRLVLLIGVFSSGNNFERRMA 1024
            V GG+ + SA A+GLPVSE+ D  +D+E LKAPLIS K  RLV+LIGVFS+GNNFERRMA
Sbjct: 365  VAGGVDLFSAFAEGLPVSEDFDFIVDVEHLKAPLISRK--RLVMLIGVFSTGNNFERRMA 422

Query: 1023 LRRSWMKYEAVRSGVVAVRFFIGLHKNKQVNFELWKEAQAYQDVQLMPFVDYYSLLTLKT 844
            LRRSWM+Y AVRSG VAV FFIGLHKN+QVNFELWKEAQAY D+Q+MPFVDYYSL++LKT
Sbjct: 423  LRRSWMQYPAVRSGDVAVLFFIGLHKNRQVNFELWKEAQAYGDIQIMPFVDYYSLISLKT 482

Query: 843  IAICIMGTKIFPAKYIMKTDDDAFVRVDEILISLKSKNPDGLLYGMVSLESKPQRDKDNK 664
            IAICI GTKI PAKYIMKTDDDAFVR+DE+L +LK K  +GLL+G++S +S PQRDKD+K
Sbjct: 483  IAICIFGTKILPAKYIMKTDDDAFVRIDEVLSNLKEKPSNGLLFGLISYDSSPQRDKDSK 542

Query: 663  WYISPEEWPHESYPPWAHGPGYVISRDIAKFIVRGHQERNLKLFKLEDVAMGIWIEQFKK 484
            WYIS EEWPH SYPPWAHGPGY+ISRDIAKFIV+GHQER+LKLFKLEDVAMGIWIEQFK 
Sbjct: 543  WYISNEEWPHSSYPPWAHGPGYIISRDIAKFIVQGHQERDLKLFKLEDVAMGIWIEQFKN 602

Query: 483  HVREVQYINDDRFYNAGCEPNYILAHYQNPRMVLCLWEKMQKEHKPDCCD 334
              +EV Y++DDRFYNAGCE +YILAHYQ PRMVLCLWEK+QK+H+  CC+
Sbjct: 603  TGQEVHYMSDDRFYNAGCESDYILAHYQGPRMVLCLWEKLQKDHRAFCCE 652


>gb|EOY06836.1| Beta-1,3-galactosyltransferase 16 isoform 1 [Theobroma cacao]
          Length = 643

 Score =  821 bits (2120), Expect = 0.0
 Identities = 412/641 (64%), Positives = 486/641 (75%), Gaps = 7/641 (1%)
 Frame = -2

Query: 2235 QMKKWTXXXXXXXXXXXXLVSYSFIQKQPRKQSSYEFF--HP-----NEEIANLSSVSVV 2077
            +MKKW             + SYS  + QP+KQS+Y+FF  HP      +E  ++ S  V 
Sbjct: 12   RMKKWYGGVLIVVLAIILVFSYSLRETQPKKQSAYDFFNNHPPKDSHTKENDSIKSPKVE 71

Query: 2076 TKVEKLQKRPHLVKVNGLDYLFNSTYMPKEEEAKPLLAWGQMRLLLSRSDSLPETAQGIK 1897
             K   L K+P L+ V GL+ L+  T +   EE+K LL W  MRLLLSRSD+LPET QGIK
Sbjct: 72   VKKLALIKKPKLINVEGLNDLYAPTNI--SEESKALLLWPHMRLLLSRSDALPETGQGIK 129

Query: 1896 EAALAWKELRSIISKNRASKFVHNKYERNCSYSVSGVNNSTLFSSNGSIILEIPCGLIHD 1717
            EAA+AWKEL ++I + + +       E+NC +SVS ++  TLFS  G  ILE+PCGL+ D
Sbjct: 130  EAAIAWKELLAVIEEEKTTSHNIRLKEKNCPFSVSNLDK-TLFS--GGNILELPCGLVED 186

Query: 1716 SSITIIAIPDGKQDGFEIEFVGSEGKEEPNPPVILHYNVVLPGDNFTKDPVIAQNTWTYD 1537
            SSIT+I IPDG+   FEIE  GS    EP P VILHYNV + GDN T++P I QNTWT +
Sbjct: 187  SSITVIGIPDGRYRSFEIELAGSNFSGEPQPSVILHYNVSVAGDNMTEEPFIVQNTWTNE 246

Query: 1536 FGWGKEERCPNHGSPNNAKVDGLVKCNEQLTRSTMDENSNSRHLSVNKSTNASDVNDHNH 1357
             GWGKEERCP H S NN KVD L  CNEQL RS M+EN N    S N  TNAS     +H
Sbjct: 247  LGWGKEERCPAHVSSNNLKVDRLGLCNEQLVRSLMEENQNVSLSSGNALTNASQAR--SH 304

Query: 1356 VGSNFPFIDGSLFTATIWAGEEGFHGTVNGRHETSFAYREKLEPWLVSGVRVKGGLHIIS 1177
              +NFPFI+G+ FTAT+W G EGFH TVNGRHETSFAYREKLEPW VSGV+V GGL ++S
Sbjct: 305  ASANFPFIEGNPFTATLWVGLEGFHMTVNGRHETSFAYREKLEPWSVSGVKVAGGLDLLS 364

Query: 1176 ALAKGLPVSENLDMAMDLEDLKAPLISNKSKRLVLLIGVFSSGNNFERRMALRRSWMKYE 997
            A AKGLPV E+ D+ ++ + LKAP +S K  RL++L+GVFS+GNNFERRMALRRSWM+++
Sbjct: 365  AFAKGLPVPEDHDLIVNSKLLKAPAVSRK--RLLMLVGVFSTGNNFERRMALRRSWMQFQ 422

Query: 996  AVRSGVVAVRFFIGLHKNKQVNFELWKEAQAYQDVQLMPFVDYYSLLTLKTIAICIMGTK 817
            AVRSG VAVRFFIGL+KN+QVNFELWKEAQAY D+Q MPFVDYYSL++LKTIAICI+GTK
Sbjct: 423  AVRSGDVAVRFFIGLNKNRQVNFELWKEAQAYGDIQFMPFVDYYSLISLKTIAICILGTK 482

Query: 816  IFPAKYIMKTDDDAFVRVDEILISLKSKNPDGLLYGMVSLESKPQRDKDNKWYISPEEWP 637
            I PAKYIMKTDDDAFVR+DE+L SLK K  DGLLYG ++ +S P RDKD+KWYIS EEWP
Sbjct: 483  ILPAKYIMKTDDDAFVRIDEVLSSLKEKASDGLLYGRIAFDSSPHRDKDSKWYISNEEWP 542

Query: 636  HESYPPWAHGPGYVISRDIAKFIVRGHQERNLKLFKLEDVAMGIWIEQFKKHVREVQYIN 457
            H SYPPWAHGPGY+ISRDIAKFIVRGHQER LKLFKLEDVAMGIWIE+FK   REV YI 
Sbjct: 543  HSSYPPWAHGPGYIISRDIAKFIVRGHQERELKLFKLEDVAMGIWIEEFKNSGREVHYIT 602

Query: 456  DDRFYNAGCEPNYILAHYQNPRMVLCLWEKMQKEHKPDCCD 334
            D+RFYNAGCE NYILAHYQ PRMVLCLWEK+QKEH+  CC+
Sbjct: 603  DERFYNAGCESNYILAHYQGPRMVLCLWEKLQKEHQAHCCE 643


>ref|XP_007035910.2| PREDICTED: hydroxyproline O-galactosyltransferase GALT3 [Theobroma
            cacao]
 ref|XP_017975894.1| PREDICTED: hydroxyproline O-galactosyltransferase GALT3 [Theobroma
            cacao]
          Length = 643

 Score =  820 bits (2119), Expect = 0.0
 Identities = 410/641 (63%), Positives = 486/641 (75%), Gaps = 7/641 (1%)
 Frame = -2

Query: 2235 QMKKWTXXXXXXXXXXXXLVSYSFIQKQPRKQSSYEFF--HP-----NEEIANLSSVSVV 2077
            +MKKW             + SYS  + QP+KQS+Y+FF  HP      +E  ++ S  V 
Sbjct: 12   RMKKWYGGVLIVVLAIILVFSYSLRETQPKKQSAYDFFNNHPPKDSHTKENDSIKSPKVE 71

Query: 2076 TKVEKLQKRPHLVKVNGLDYLFNSTYMPKEEEAKPLLAWGQMRLLLSRSDSLPETAQGIK 1897
             K   L K+P L+ V GL+ L+  T +   E++K LL W  MRLLLSRSD+LPET QGIK
Sbjct: 72   VKKLALIKKPKLINVEGLNDLYAPTNI--SEKSKALLLWPHMRLLLSRSDALPETGQGIK 129

Query: 1896 EAALAWKELRSIISKNRASKFVHNKYERNCSYSVSGVNNSTLFSSNGSIILEIPCGLIHD 1717
            EA +AWKEL ++I + + +       E+NC +SVS ++  TLFS  G  ILE+PCGL+ D
Sbjct: 130  EATIAWKELLAVIEEEKTTSHNIRLKEKNCPFSVSNLDK-TLFS--GGNILELPCGLVED 186

Query: 1716 SSITIIAIPDGKQDGFEIEFVGSEGKEEPNPPVILHYNVVLPGDNFTKDPVIAQNTWTYD 1537
            SSIT+I IPDG+   FEIE  GS    EP P VILHYNV + GDN T++P I QNTWT +
Sbjct: 187  SSITVIGIPDGRYRSFEIELAGSNFSGEPQPSVILHYNVSVAGDNMTEEPFIVQNTWTNE 246

Query: 1536 FGWGKEERCPNHGSPNNAKVDGLVKCNEQLTRSTMDENSNSRHLSVNKSTNASDVNDHNH 1357
             GWGKEERCP H S NN KVDGL  CNEQL RS M+EN N    S N  TNAS     +H
Sbjct: 247  LGWGKEERCPAHVSSNNLKVDGLGLCNEQLVRSLMEENQNVSLSSGNALTNASQAR--SH 304

Query: 1356 VGSNFPFIDGSLFTATIWAGEEGFHGTVNGRHETSFAYREKLEPWLVSGVRVKGGLHIIS 1177
              +NFPFI+G+ FTAT+W G EGFH TVNGRHETSFAYREKLEPW VSGV+V GGL ++S
Sbjct: 305  ASANFPFIEGNPFTATLWVGLEGFHMTVNGRHETSFAYREKLEPWSVSGVKVAGGLDLLS 364

Query: 1176 ALAKGLPVSENLDMAMDLEDLKAPLISNKSKRLVLLIGVFSSGNNFERRMALRRSWMKYE 997
            A AKGLPV E+ D+ ++ + LKAP +S K  RL++L+GVFS+GNNFERRMALRRSWM+++
Sbjct: 365  AFAKGLPVPEDHDLIVNSKLLKAPAVSRK--RLLMLVGVFSTGNNFERRMALRRSWMQFQ 422

Query: 996  AVRSGVVAVRFFIGLHKNKQVNFELWKEAQAYQDVQLMPFVDYYSLLTLKTIAICIMGTK 817
            AVRSG VAVRFFIGL+KN+QVNFELWKEAQAY D+Q MPFVDYYSL++LKTIAICI+GTK
Sbjct: 423  AVRSGDVAVRFFIGLNKNRQVNFELWKEAQAYGDIQFMPFVDYYSLISLKTIAICILGTK 482

Query: 816  IFPAKYIMKTDDDAFVRVDEILISLKSKNPDGLLYGMVSLESKPQRDKDNKWYISPEEWP 637
            I PAKYIMKTDDDAFVR+DE+L SLK K  DGLLYG ++ +S P RDKD+KWYIS EEWP
Sbjct: 483  ILPAKYIMKTDDDAFVRIDEVLSSLKEKASDGLLYGRIAFDSSPHRDKDSKWYISNEEWP 542

Query: 636  HESYPPWAHGPGYVISRDIAKFIVRGHQERNLKLFKLEDVAMGIWIEQFKKHVREVQYIN 457
            H SYPPWAHGPGY+ISRDIAKFIVRGHQER LKLFKLEDVAMGIWIE+FK   REV Y+ 
Sbjct: 543  HSSYPPWAHGPGYIISRDIAKFIVRGHQERELKLFKLEDVAMGIWIEEFKNSGREVHYVT 602

Query: 456  DDRFYNAGCEPNYILAHYQNPRMVLCLWEKMQKEHKPDCCD 334
            D+RFYNAGCE NYILAHYQ PRMVLCLWEK+QKEH+  CC+
Sbjct: 603  DERFYNAGCESNYILAHYQGPRMVLCLWEKLQKEHQAHCCE 643


>ref|XP_006488779.1| PREDICTED: probable beta-1,3-galactosyltransferase 16 isoform X2
            [Citrus sinensis]
          Length = 652

 Score =  820 bits (2118), Expect = 0.0
 Identities = 402/650 (61%), Positives = 496/650 (76%), Gaps = 16/650 (2%)
 Frame = -2

Query: 2235 QMKKWTXXXXXXXXXXXXLVSYSFI------------QKQPRKQSSYEFF--HPNEE--I 2104
            +M+ W+            ++SYSF+            QKQ  KQS+ +FF  HP+ +  +
Sbjct: 12   KMRNWSGGLLIMALAIILVMSYSFMGTQTQTQHRTQTQKQKHKQSANDFFRNHPSNDSDM 71

Query: 2103 ANLSSVSVVTKVEKLQKRPHLVKVNGLDYLFNSTYMPKEEEAKPLLAWGQMRLLLSRSDS 1924
                 V  V K +KL ++PH++ V GL  L++   M   E+++PLL WG MRLLLSRSD+
Sbjct: 72   KGSQGVKEVKKTQKLFEKPHIINVQGLGDLYSLKNM-LGEDSRPLLVWGHMRLLLSRSDA 130

Query: 1923 LPETAQGIKEAALAWKELRSIISKNRASKFVHNKYERNCSYSVSGVNNSTLFSSNGSIIL 1744
            LPETAQG+KEAA+AWK+L S+I + +ASKF   K   NC   VS ++ S    S+G +I+
Sbjct: 131  LPETAQGVKEAAIAWKDLLSVIEEEKASKFSRRK---NCPPFVSNLSKSL---SSGRLII 184

Query: 1743 EIPCGLIHDSSITIIAIPDGKQDGFEIEFVGSEGKEEPNPPVILHYNVVLPGDNFTKDPV 1564
            E+PCGL+ DSSIT++ IPDG+   F+IE +GS+   E NPP+ILHYNV LPGDN T++P 
Sbjct: 185  EVPCGLVEDSSITLVGIPDGRYGSFQIELIGSQLSGESNPPIILHYNVSLPGDNMTEEPF 244

Query: 1563 IAQNTWTYDFGWGKEERCPNHGSPNNAKVDGLVKCNEQLTRSTMDENSNSRHLSVNKSTN 1384
            I QN+WT + GWGKEERCP HGS N  KVD LV CNEQ+ R +++EN N+ H + +    
Sbjct: 245  IIQNSWTNELGWGKEERCPAHGSSNTLKVDELVLCNEQVLRRSVEENQNTSHPTPSSDIL 304

Query: 1383 ASDVNDHNHVGSNFPFIDGSLFTATIWAGEEGFHGTVNGRHETSFAYREKLEPWLVSGVR 1204
            A+      H  SNFPF+DG+ FT TIW G +GFH TVNGRHETS AYREKLEPW V+GV+
Sbjct: 305  ANASRVGAHETSNFPFVDGNPFTTTIWVGLDGFHMTVNGRHETSLAYREKLEPWSVTGVK 364

Query: 1203 VKGGLHIISALAKGLPVSENLDMAMDLEDLKAPLISNKSKRLVLLIGVFSSGNNFERRMA 1024
            V GG+ + SA A+GLPVSE+ D  +D+E LKAPLIS K  RLV+LIGVFS+GNNFERRMA
Sbjct: 365  VAGGVDLFSAFAEGLPVSEDFDFIVDVEHLKAPLISRK--RLVMLIGVFSTGNNFERRMA 422

Query: 1023 LRRSWMKYEAVRSGVVAVRFFIGLHKNKQVNFELWKEAQAYQDVQLMPFVDYYSLLTLKT 844
            LRRSWM+Y AVRSG +AV FFIGLHKN+QVNFELWKEAQAY D+Q+MPFVDYYSL++LKT
Sbjct: 423  LRRSWMQYPAVRSGDLAVLFFIGLHKNRQVNFELWKEAQAYGDIQIMPFVDYYSLISLKT 482

Query: 843  IAICIMGTKIFPAKYIMKTDDDAFVRVDEILISLKSKNPDGLLYGMVSLESKPQRDKDNK 664
            IAICI GTKI PAKYIMKTDDDAFVR+DE+L +LK K  +GLL+G++S +S PQRDKD+K
Sbjct: 483  IAICIFGTKILPAKYIMKTDDDAFVRIDEVLSNLKEKPSNGLLFGLISYDSSPQRDKDSK 542

Query: 663  WYISPEEWPHESYPPWAHGPGYVISRDIAKFIVRGHQERNLKLFKLEDVAMGIWIEQFKK 484
            WYIS EEWPH SYPPWAHGPGY+ISRDIAKFIV+GHQER+LKLFKLEDVAMGIWIEQFK 
Sbjct: 543  WYISNEEWPHSSYPPWAHGPGYIISRDIAKFIVQGHQERDLKLFKLEDVAMGIWIEQFKN 602

Query: 483  HVREVQYINDDRFYNAGCEPNYILAHYQNPRMVLCLWEKMQKEHKPDCCD 334
              +EV Y++DDRFYNAGCE +YILAHYQ PRMVLCLWEK+QK+H+  CC+
Sbjct: 603  TGQEVHYMSDDRFYNAGCESDYILAHYQGPRMVLCLWEKLQKDHRAFCCE 652


>gb|KDO72085.1| hypothetical protein CISIN_1g006036mg [Citrus sinensis]
          Length = 663

 Score =  819 bits (2116), Expect = 0.0
 Identities = 406/661 (61%), Positives = 501/661 (75%), Gaps = 27/661 (4%)
 Frame = -2

Query: 2235 QMKKWTXXXXXXXXXXXXLVSYSFI------------QKQPRKQSSYEFF--HPNEE--I 2104
            +M+ W+            ++SYSF+            QKQ  KQS+ +FF  HP+ +  +
Sbjct: 12   KMRNWSGGLLIMALAIILVMSYSFMGTQTQTQHRTQTQKQKHKQSANDFFRNHPSNDSDM 71

Query: 2103 ANLSSVSVVTKVEKLQKRPHLVKVNGLDYLFNSTYMPKEEEAKPLLAWGQMRLLLSRSDS 1924
                 V  V K +KL ++PH++ V GL  L++   M   E+++PLL WG MRLLLSRSD+
Sbjct: 72   KGSQGVKEVKKTQKLFEKPHIINVQGLGDLYSLKNM-LGEDSRPLLVWGHMRLLLSRSDA 130

Query: 1923 LPETAQGIKEAALAWKELRSIISKNRASKFVHNKYERNCSYSVSGVNNSTLFSSNGSIIL 1744
            LPETAQG+KEAA+AWK+L S+I + +ASKF   K   NC   VS ++ S    S+G +I+
Sbjct: 131  LPETAQGVKEAAIAWKDLLSVIEEEKASKFSRRK---NCPPFVSNLSKSL---SSGRLII 184

Query: 1743 EIPCGLIHDSSITIIAIPDGKQDGFEIEFVGSEGKEEPNPPVILHYNVVLPGDNFTKDPV 1564
            E+PCGL+ DSSIT++ IPDG+   F+IE +GS+   E NPP+ILHYNV LPGDN T++P 
Sbjct: 185  EVPCGLVEDSSITLVGIPDGRYGSFQIELIGSQLSGESNPPIILHYNVSLPGDNMTEEPF 244

Query: 1563 IAQNTWTYDFGWGKEERCPNHGSPNNAKVDGLVKCNEQLTRSTMDENSNSRH------LS 1402
            I QN+WT + GWGKEERCP HGS N  KVD LV CNEQ+ R +++EN N+ H      + 
Sbjct: 245  IIQNSWTNELGWGKEERCPAHGSSNTLKVDELVLCNEQVLRRSVEENQNTSHPTPSSDML 304

Query: 1401 VNKSTNASDVNDHN-----HVGSNFPFIDGSLFTATIWAGEEGFHGTVNGRHETSFAYRE 1237
             N  T +SD+  +      H  SNFPF+DG+ FT TIW G +GFH TVNGRHETS AYRE
Sbjct: 305  ANAPTPSSDMLANASRVGAHETSNFPFVDGNPFTTTIWVGLDGFHMTVNGRHETSLAYRE 364

Query: 1236 KLEPWLVSGVRVKGGLHIISALAKGLPVSENLDMAMDLEDLKAPLISNKSKRLVLLIGVF 1057
            KLEPW V+GV+V GG+ + SA A+GLPVSE+ D  +D+E LKAPLIS K  RLV+LIGVF
Sbjct: 365  KLEPWSVTGVKVAGGVDLFSAFAEGLPVSEDFDFIVDVEHLKAPLISRK--RLVMLIGVF 422

Query: 1056 SSGNNFERRMALRRSWMKYEAVRSGVVAVRFFIGLHKNKQVNFELWKEAQAYQDVQLMPF 877
            S+GNNFERRMALRRSWM+Y AVRSG +AVRFFIGLHKN+QVNFELWKEAQAY D+Q+MPF
Sbjct: 423  STGNNFERRMALRRSWMQYPAVRSGDLAVRFFIGLHKNRQVNFELWKEAQAYGDIQIMPF 482

Query: 876  VDYYSLLTLKTIAICIMGTKIFPAKYIMKTDDDAFVRVDEILISLKSKNPDGLLYGMVSL 697
            VDYYSL++LKTIAICI GTKI PAKYIMKTDDDAFVR+DE+L +LK K  +GLL+G++S 
Sbjct: 483  VDYYSLISLKTIAICIFGTKILPAKYIMKTDDDAFVRIDEVLSNLKEKPSNGLLFGLMSY 542

Query: 696  ESKPQRDKDNKWYISPEEWPHESYPPWAHGPGYVISRDIAKFIVRGHQERNLKLFKLEDV 517
            +S PQRDKD+KWYIS EEWPH SYPPWAHGPGY+ISRDIAKFIV+GHQER+LKLFKLEDV
Sbjct: 543  DSSPQRDKDSKWYISNEEWPHSSYPPWAHGPGYIISRDIAKFIVQGHQERDLKLFKLEDV 602

Query: 516  AMGIWIEQFKKHVREVQYINDDRFYNAGCEPNYILAHYQNPRMVLCLWEKMQKEHKPDCC 337
            AMGIWIEQFK   +EV Y++DDRFYNAGCE +YILAHYQ PRMVLCLWEK+QK+H+  CC
Sbjct: 603  AMGIWIEQFKNTGQEVHYMSDDRFYNAGCESDYILAHYQGPRMVLCLWEKLQKDHRAFCC 662

Query: 336  D 334
            +
Sbjct: 663  E 663


>ref|XP_019180413.1| PREDICTED: hydroxyproline O-galactosyltransferase GALT3 [Ipomoea nil]
          Length = 639

 Score =  816 bits (2109), Expect = 0.0
 Identities = 413/647 (63%), Positives = 498/647 (76%), Gaps = 14/647 (2%)
 Frame = -2

Query: 2232 MKKWTXXXXXXXXXXXXLVSYSFIQKQPRKQSSYEFF--HPN--------EEIANLSSVS 2083
            MKKW             L+ Y  ++K  RKQS+++FF  HP         ++I++     
Sbjct: 1    MKKWAGILLILGLAVILLIRYGLMEKPLRKQSAFDFFNNHPPSVNVEDYLKDISDGDGAE 60

Query: 2082 VVTKVEK----LQKRPHLVKVNGLDYLFNSTYMPKEEEAKPLLAWGQMRLLLSRSDSLPE 1915
            +  K +      +++P  V  +GL  L++ + + KEE    LL W QMR LLSRSD+LPE
Sbjct: 61   LPPKKDVDYSGFKEKPRFVDFDGLGDLYSLSNVSKEESGA-LLVWSQMRALLSRSDALPE 119

Query: 1914 TAQGIKEAALAWKELRSIISKNRASKFVHNKYERNCSYSVSGVNNSTLFSSNGSIILEIP 1735
            TAQGIKEAA+AWKEL S I K++A   + +K +++C YSVS + N+TL S +GSI LEIP
Sbjct: 120  TAQGIKEAAVAWKELLSTIRKDKALNVLDDKEDKDCPYSVS-LFNATL-SRDGSI-LEIP 176

Query: 1734 CGLIHDSSITIIAIPDGKQDGFEIEFVGSEGKEEPNPPVILHYNVVLPGDNFTKDPVIAQ 1555
            CGLI DSSIT+I IPD +++ F+I  VGS+  E+P  P++LHYNVVLPG N TKDP+I Q
Sbjct: 177  CGLIEDSSITVIGIPDSEKESFQINLVGSQLPEDPKSPIVLHYNVVLPGANLTKDPIITQ 236

Query: 1554 NTWTYDFGWGKEERCPNHGSPNNAKVDGLVKCNEQLTRSTMDENSNSRHLSVNKSTNASD 1375
            NTWT   GWGKEE+CP+HG  +  KVDGL +CN ++ RS  ++ SN+  L   K TNAS 
Sbjct: 237  NTWTNASGWGKEEKCPDHGFSDTLKVDGLARCNTKIIRSNREDTSNASLLESVKLTNAS- 295

Query: 1374 VNDHNHVGSNFPFIDGSLFTATIWAGEEGFHGTVNGRHETSFAYREKLEPWLVSGVRVKG 1195
             N   H  +NFPF++G  FTAT+W G EGFH TVNGRHETSFAYREKLEPWLV+ VRV+G
Sbjct: 296  -NGTAHASANFPFVEGYPFTATLWVGVEGFHMTVNGRHETSFAYREKLEPWLVNEVRVEG 354

Query: 1194 GLHIISALAKGLPVSENLDMAMDLEDLKAPLISNKSKRLVLLIGVFSSGNNFERRMALRR 1015
             L IIS LAKGLPVS++ D+A D+E LKAP IS K KRL LLIGVFS+GNNFERRMALRR
Sbjct: 355  SLGIISTLAKGLPVSQDPDLA-DIEHLKAPPISLK-KRLTLLIGVFSTGNNFERRMALRR 412

Query: 1014 SWMKYEAVRSGVVAVRFFIGLHKNKQVNFELWKEAQAYQDVQLMPFVDYYSLLTLKTIAI 835
            SWM+YEAVRSG VAVRFFIGLHK++QVNFELW+EAQ Y DVQLMPFVDYYSLL+LKT+AI
Sbjct: 413  SWMQYEAVRSGQVAVRFFIGLHKSRQVNFELWREAQIYGDVQLMPFVDYYSLLSLKTVAI 472

Query: 834  CIMGTKIFPAKYIMKTDDDAFVRVDEILISLKSKNPDGLLYGMVSLESKPQRDKDNKWYI 655
            CI+G KI PAKYIMKTDDDAFVR+DE+L SLK K PDGLLYG VS ES P RDK+NKWYI
Sbjct: 473  CILGVKILPAKYIMKTDDDAFVRIDEVLTSLKGKGPDGLLYGRVSFESSPHRDKENKWYI 532

Query: 654  SPEEWPHESYPPWAHGPGYVISRDIAKFIVRGHQERNLKLFKLEDVAMGIWIEQFKKHVR 475
            S EEWPH SYPPWAHGPGY+ISRDIAKFIV+ HQERNL LFKLEDVA+GIWI +FK+   
Sbjct: 533  STEEWPHSSYPPWAHGPGYIISRDIAKFIVQSHQERNLILFKLEDVAVGIWINEFKRKGH 592

Query: 474  EVQYINDDRFYNAGCEPNYILAHYQNPRMVLCLWEKMQKEHKPDCCD 334
            +V+YINDDRFYNAGC+ +YILAHYQNPRMVLCLWEK+QKEH+P+CC+
Sbjct: 593  KVRYINDDRFYNAGCDTDYILAHYQNPRMVLCLWEKLQKEHQPNCCE 639


>ref|XP_024046479.1| hydroxyproline O-galactosyltransferase GALT3 isoform X1 [Citrus
            clementina]
          Length = 660

 Score =  814 bits (2102), Expect = 0.0
 Identities = 403/658 (61%), Positives = 496/658 (75%), Gaps = 24/658 (3%)
 Frame = -2

Query: 2235 QMKKWTXXXXXXXXXXXXLVSYSFI------------QKQPRKQSSYEFF--HPNEE--I 2104
            +M+ W+            ++SYSF+            QKQ  KQS+ +FF  HP+ +  +
Sbjct: 12   KMRNWSGGLLIMALAIILVMSYSFMGTQTQTQHRTQTQKQKHKQSANDFFRNHPSNDSDM 71

Query: 2103 ANLSSVSVVTKVEKLQKRPHLVKVNGLDYLFNSTYMPKEEEAKPLLAWGQMRLLLSRSDS 1924
                 V  V K +KL ++PH++ V GL  L++   M   E+++PLL WG MRLLLSRSD+
Sbjct: 72   KGSQGVKEVKKTQKLFEKPHIINVQGLGDLYSLKNM-LGEDSRPLLVWGHMRLLLSRSDA 130

Query: 1923 LPETAQGIKEAALAWKELRSIISKNRASKFVHNKYERNCSYSVSGVNNSTLFSSNGSIIL 1744
            LPETAQG+KEAA+AWK+L S+I + +ASKF   K   NC   VS ++ S    S+G +I+
Sbjct: 131  LPETAQGVKEAAIAWKDLLSVIEEEKASKFSRRK---NCPPFVSNLSKSL---SSGRLII 184

Query: 1743 EIPCGLIHDSSITIIAIPDGKQDGFEIEFVGSEGKEEPNPPVILHYNVVLPGDNFTKDPV 1564
            E+PCGL+ DSSIT++ IPDG+   F+IE +GS+   E NPP+ILHYNV LPGDN T++P 
Sbjct: 185  EVPCGLVEDSSITLVGIPDGRYGSFQIELIGSQLSGESNPPIILHYNVSLPGDNMTEEPF 244

Query: 1563 IAQNTWTYDFGWGKEERCPNHGSPNNAKVDGLVKCNEQLTRSTMDENSNSRHLSVNKSTN 1384
            I QN+WT + GWGKEERCP HGS N  KVD LV CNEQ+ R +++EN N+ H + +    
Sbjct: 245  IIQNSWTNELGWGKEERCPAHGSSNTLKVDELVLCNEQVLRRSVEENQNTSHPTPSSDIL 304

Query: 1383 ASDVNDHNHVGSNFPFIDGSLFTATIWAGEEGFHGTVNGRHETSFAYREKLEPWLVSGVR 1204
            A+      H  SNFPF+DG+ FT TIW G +GFH TVNGRHETS AYREKLEPW V+GV+
Sbjct: 305  ANASRVGAHETSNFPFVDGNPFTTTIWVGLDGFHMTVNGRHETSLAYREKLEPWSVTGVK 364

Query: 1203 VKGGLHIISALAKGLPVSENLDMAMDLEDLKAPLISNKSKRLVLLIGVFSSGNNFERRMA 1024
            V GG+ + SA A+GLPVSE+ D  +D+E LKAPLIS K  RLV+LIGVFS+GNNFERRMA
Sbjct: 365  VAGGVDLFSAFAEGLPVSEDFDFIVDVEHLKAPLISRK--RLVMLIGVFSTGNNFERRMA 422

Query: 1023 LRRSWMKYEAVRSGVVAVRFFIGL--------HKNKQVNFELWKEAQAYQDVQLMPFVDY 868
            LRRSWM+Y AVRSG VAV FFIGL        HKN+QVNFELWKEAQAY D+Q+MPFVDY
Sbjct: 423  LRRSWMQYPAVRSGDVAVLFFIGLLTLLQFLQHKNRQVNFELWKEAQAYGDIQIMPFVDY 482

Query: 867  YSLLTLKTIAICIMGTKIFPAKYIMKTDDDAFVRVDEILISLKSKNPDGLLYGMVSLESK 688
            YSL++LKTIAICI GTKI PAKYIMKTDDDAFVR+DE+L +LK K  +GLL+G++S +S 
Sbjct: 483  YSLISLKTIAICIFGTKILPAKYIMKTDDDAFVRIDEVLSNLKEKPSNGLLFGLISYDSS 542

Query: 687  PQRDKDNKWYISPEEWPHESYPPWAHGPGYVISRDIAKFIVRGHQERNLKLFKLEDVAMG 508
            PQRDKD+KWYIS EEWPH SYPPWAHGPGY+ISRDIAKFIV+GHQER+LKLFKLEDVAMG
Sbjct: 543  PQRDKDSKWYISNEEWPHSSYPPWAHGPGYIISRDIAKFIVQGHQERDLKLFKLEDVAMG 602

Query: 507  IWIEQFKKHVREVQYINDDRFYNAGCEPNYILAHYQNPRMVLCLWEKMQKEHKPDCCD 334
            IWIEQFK   +EV Y++DDRFYNAGCE +YILAHYQ PRMVLCLWEK+QK+H+  CC+
Sbjct: 603  IWIEQFKNTGQEVHYMSDDRFYNAGCESDYILAHYQGPRMVLCLWEKLQKDHRAFCCE 660


>ref|XP_006488778.1| PREDICTED: probable beta-1,3-galactosyltransferase 16 isoform X1
            [Citrus sinensis]
          Length = 660

 Score =  813 bits (2099), Expect = 0.0
 Identities = 402/658 (61%), Positives = 496/658 (75%), Gaps = 24/658 (3%)
 Frame = -2

Query: 2235 QMKKWTXXXXXXXXXXXXLVSYSFI------------QKQPRKQSSYEFF--HPNEE--I 2104
            +M+ W+            ++SYSF+            QKQ  KQS+ +FF  HP+ +  +
Sbjct: 12   KMRNWSGGLLIMALAIILVMSYSFMGTQTQTQHRTQTQKQKHKQSANDFFRNHPSNDSDM 71

Query: 2103 ANLSSVSVVTKVEKLQKRPHLVKVNGLDYLFNSTYMPKEEEAKPLLAWGQMRLLLSRSDS 1924
                 V  V K +KL ++PH++ V GL  L++   M   E+++PLL WG MRLLLSRSD+
Sbjct: 72   KGSQGVKEVKKTQKLFEKPHIINVQGLGDLYSLKNM-LGEDSRPLLVWGHMRLLLSRSDA 130

Query: 1923 LPETAQGIKEAALAWKELRSIISKNRASKFVHNKYERNCSYSVSGVNNSTLFSSNGSIIL 1744
            LPETAQG+KEAA+AWK+L S+I + +ASKF   K   NC   VS ++ S    S+G +I+
Sbjct: 131  LPETAQGVKEAAIAWKDLLSVIEEEKASKFSRRK---NCPPFVSNLSKSL---SSGRLII 184

Query: 1743 EIPCGLIHDSSITIIAIPDGKQDGFEIEFVGSEGKEEPNPPVILHYNVVLPGDNFTKDPV 1564
            E+PCGL+ DSSIT++ IPDG+   F+IE +GS+   E NPP+ILHYNV LPGDN T++P 
Sbjct: 185  EVPCGLVEDSSITLVGIPDGRYGSFQIELIGSQLSGESNPPIILHYNVSLPGDNMTEEPF 244

Query: 1563 IAQNTWTYDFGWGKEERCPNHGSPNNAKVDGLVKCNEQLTRSTMDENSNSRHLSVNKSTN 1384
            I QN+WT + GWGKEERCP HGS N  KVD LV CNEQ+ R +++EN N+ H + +    
Sbjct: 245  IIQNSWTNELGWGKEERCPAHGSSNTLKVDELVLCNEQVLRRSVEENQNTSHPTPSSDIL 304

Query: 1383 ASDVNDHNHVGSNFPFIDGSLFTATIWAGEEGFHGTVNGRHETSFAYREKLEPWLVSGVR 1204
            A+      H  SNFPF+DG+ FT TIW G +GFH TVNGRHETS AYREKLEPW V+GV+
Sbjct: 305  ANASRVGAHETSNFPFVDGNPFTTTIWVGLDGFHMTVNGRHETSLAYREKLEPWSVTGVK 364

Query: 1203 VKGGLHIISALAKGLPVSENLDMAMDLEDLKAPLISNKSKRLVLLIGVFSSGNNFERRMA 1024
            V GG+ + SA A+GLPVSE+ D  +D+E LKAPLIS K  RLV+LIGVFS+GNNFERRMA
Sbjct: 365  VAGGVDLFSAFAEGLPVSEDFDFIVDVEHLKAPLISRK--RLVMLIGVFSTGNNFERRMA 422

Query: 1023 LRRSWMKYEAVRSGVVAVRFFIGL--------HKNKQVNFELWKEAQAYQDVQLMPFVDY 868
            LRRSWM+Y AVRSG +AV FFIGL        HKN+QVNFELWKEAQAY D+Q+MPFVDY
Sbjct: 423  LRRSWMQYPAVRSGDLAVLFFIGLLTLLQFLQHKNRQVNFELWKEAQAYGDIQIMPFVDY 482

Query: 867  YSLLTLKTIAICIMGTKIFPAKYIMKTDDDAFVRVDEILISLKSKNPDGLLYGMVSLESK 688
            YSL++LKTIAICI GTKI PAKYIMKTDDDAFVR+DE+L +LK K  +GLL+G++S +S 
Sbjct: 483  YSLISLKTIAICIFGTKILPAKYIMKTDDDAFVRIDEVLSNLKEKPSNGLLFGLISYDSS 542

Query: 687  PQRDKDNKWYISPEEWPHESYPPWAHGPGYVISRDIAKFIVRGHQERNLKLFKLEDVAMG 508
            PQRDKD+KWYIS EEWPH SYPPWAHGPGY+ISRDIAKFIV+GHQER+LKLFKLEDVAMG
Sbjct: 543  PQRDKDSKWYISNEEWPHSSYPPWAHGPGYIISRDIAKFIVQGHQERDLKLFKLEDVAMG 602

Query: 507  IWIEQFKKHVREVQYINDDRFYNAGCEPNYILAHYQNPRMVLCLWEKMQKEHKPDCCD 334
            IWIEQFK   +EV Y++DDRFYNAGCE +YILAHYQ PRMVLCLWEK+QK+H+  CC+
Sbjct: 603  IWIEQFKNTGQEVHYMSDDRFYNAGCESDYILAHYQGPRMVLCLWEKLQKDHRAFCCE 660


>ref|XP_018814246.1| PREDICTED: hydroxyproline O-galactosyltransferase GALT3 isoform X1
            [Juglans regia]
 ref|XP_018814247.1| PREDICTED: hydroxyproline O-galactosyltransferase GALT3 isoform X1
            [Juglans regia]
 ref|XP_018814249.1| PREDICTED: hydroxyproline O-galactosyltransferase GALT3 isoform X1
            [Juglans regia]
          Length = 635

 Score =  811 bits (2094), Expect = 0.0
 Identities = 401/643 (62%), Positives = 485/643 (75%), Gaps = 10/643 (1%)
 Frame = -2

Query: 2232 MKKWTXXXXXXXXXXXXLVSYSFIQKQPRKQSSYEFF--HPNEEIANLSSVSV------V 2077
            MKKW             ++ YS +  QP+KQS+Y FF  HP  E     S S+      V
Sbjct: 1    MKKWFGGMFVLALVMILVLRYSLMGIQPKKQSAYSFFKNHPANESQKKDSGSIRSSEMQV 60

Query: 2076 TKVEKLQKRPHLVKVNGLDYLFNSTYMPKEEEAKPLLAWGQMRLLLSRSDSLPETAQGIK 1897
             KV K   +  LV + GL  L++S  +  E+E+K LL W  +R LLSRSD+LP TA+G+K
Sbjct: 61   KKVAKPSIKTPLVNIEGLSDLYSSKNL-SEKESKALLVWAHLRTLLSRSDALPGTAEGVK 119

Query: 1896 EAALAWKELRSIISKNRASKF--VHNKYERNCSYSVSGVNNSTLFSSNGSIILEIPCGLI 1723
            EA++AW +L S I K ++SK+   +   +RNC YSVS ++ + L   NG +ILEIPCGL+
Sbjct: 120  EASIAWNDLSSTIEKEKSSKYSNTNGSKDRNCPYSVSILDQTAL---NGVVILEIPCGLV 176

Query: 1722 HDSSITIIAIPDGKQDGFEIEFVGSEGKEEPNPPVILHYNVVLPGDNFTKDPVIAQNTWT 1543
             DSSIT++ IPDG    F+IE VGS+   EP PP+ILH+NV LPGDN T++P I QNTWT
Sbjct: 177  EDSSITLVGIPDGHHGSFQIELVGSQLSAEPTPPIILHFNVSLPGDNMTEEPFIVQNTWT 236

Query: 1542 YDFGWGKEERCPNHGSPNNAKVDGLVKCNEQLTRSTMDENSNSRHLSVNKSTNASDVNDH 1363
             + GWGKEE+CP   S N  KVDGLV CNEQ+ R+ ++ENSN+ H S +   + S     
Sbjct: 237  SEAGWGKEEKCPARRSANIVKVDGLVLCNEQIVRNAVEENSNASHPSSDMLNSVS--RGV 294

Query: 1362 NHVGSNFPFIDGSLFTATIWAGEEGFHGTVNGRHETSFAYREKLEPWLVSGVRVKGGLHI 1183
             H  ++FPF++G+ FTAT+W G EGFH TV+GRHETSFAYREKLEPW VS V V GGL +
Sbjct: 295  AHGSASFPFVEGNPFTATLWVGIEGFHMTVSGRHETSFAYREKLEPWSVSRVNVAGGLDL 354

Query: 1182 ISALAKGLPVSENLDMAMDLEDLKAPLISNKSKRLVLLIGVFSSGNNFERRMALRRSWMK 1003
            +SA AKGLPVSE+ D+ +D+E LKAP +S K  R V+L+GVFS+GNNFERRMALRRSWM+
Sbjct: 355  LSAFAKGLPVSEDNDLVIDVEHLKAPSVSRK--RCVMLVGVFSTGNNFERRMALRRSWMQ 412

Query: 1002 YEAVRSGVVAVRFFIGLHKNKQVNFELWKEAQAYQDVQLMPFVDYYSLLTLKTIAICIMG 823
            YEAVRSG VAVRFF+GLHKN QVNFELW+EAQAY DVQLMPFVDYYSL+ LKTIAICIMG
Sbjct: 413  YEAVRSGDVAVRFFVGLHKNNQVNFELWREAQAYGDVQLMPFVDYYSLIALKTIAICIMG 472

Query: 822  TKIFPAKYIMKTDDDAFVRVDEILISLKSKNPDGLLYGMVSLESKPQRDKDNKWYISPEE 643
            TK+ PAKYIMKTDDDAFVR+DE+L SLK K  +GLLYG++S ES P RDKD+KWYIS EE
Sbjct: 473  TKVLPAKYIMKTDDDAFVRIDEVLSSLKGKAVNGLLYGLISFESAPHRDKDSKWYISTEE 532

Query: 642  WPHESYPPWAHGPGYVISRDIAKFIVRGHQERNLKLFKLEDVAMGIWIEQFKKHVREVQY 463
            WPH SYPPWAHGPGY+ISRDIAKFIVRGHQER LKLFKLEDVAMGIWIEQ+K   +EV Y
Sbjct: 533  WPHASYPPWAHGPGYIISRDIAKFIVRGHQERGLKLFKLEDVAMGIWIEQYKNSGQEVHY 592

Query: 462  INDDRFYNAGCEPNYILAHYQNPRMVLCLWEKMQKEHKPDCCD 334
            INDDRF+NAGCE +Y+LAHYQ PR VLCLWE +QKEH+  CC+
Sbjct: 593  INDDRFFNAGCEQDYVLAHYQGPRKVLCLWEMLQKEHRAICCE 635


>gb|PNT16634.1| hypothetical protein POPTR_010G151600v3 [Populus trichocarpa]
          Length = 646

 Score =  808 bits (2088), Expect = 0.0
 Identities = 395/644 (61%), Positives = 492/644 (76%), Gaps = 10/644 (1%)
 Frame = -2

Query: 2235 QMKKWTXXXXXXXXXXXXLVSYSFIQKQPRKQSSYEFF--------HPNEEIANLSSVSV 2080
            +MKKW+            + SYS +  + +K+ SY+FF        H  +     S    
Sbjct: 12   KMKKWSGGVVIIALAIILVFSYSLMGTRTQKKQSYDFFRNHPAGDSHLKDNHPAKSPQLE 71

Query: 2079 VTKVEKLQKRPHLVKVNGLDYLFNSTYMPKEEEAKPLLAWGQMRLLLSRSDSLPETAQGI 1900
            + K  K  K+PH + V GL  L+    + K+E +  L+ W QMRLLLSRSD+LPET QGI
Sbjct: 72   LKKATKSSKKPHYINVEGLSDLYAQNNISKDE-SNALVVWFQMRLLLSRSDALPETNQGI 130

Query: 1899 KEAALAWKELRSIISKNRASKF--VHNKYERNCSYSVSGVNNSTLFSSNGSIILEIPCGL 1726
            +EA++AWK+L S I +N+A++   ++   ++NC YSVS ++ +T   S+G  IL+IPCGL
Sbjct: 131  REASIAWKDLLSKIKENKAAQLSNINKTEDKNCPYSVSTIDLTT---SSGETILDIPCGL 187

Query: 1725 IHDSSITIIAIPDGKQDGFEIEFVGSEGKEEPNPPVILHYNVVLPGDNFTKDPVIAQNTW 1546
              DSSI+++ IPDG    F+I+ +GS+   E NPP+IL YNV LPGDN T++P + QNTW
Sbjct: 188  AEDSSISVLGIPDGHSRSFQIQLLGSQLPVESNPPIILQYNVSLPGDNMTEEPFVVQNTW 247

Query: 1545 TYDFGWGKEERCPNHGSPNNAKVDGLVKCNEQLTRSTMDENSNSRHLSVNKSTNASDVND 1366
            T ++GWGKEERCP+H S N  KVDGLV CNE++ RSTM+EN N+  +  + S N S    
Sbjct: 248  TKEYGWGKEERCPSHRSVNIPKVDGLVLCNEKVVRSTMEENGNASSVG-DVSANVSQGIA 306

Query: 1365 HNHVGSNFPFIDGSLFTATIWAGEEGFHGTVNGRHETSFAYREKLEPWLVSGVRVKGGLH 1186
            H    +NFPF++G+ FTAT+W G EGFH TVNGRHETSF YREKLEPWLVSGV+V GG+ 
Sbjct: 307  HER--ANFPFVEGNAFTATLWVGLEGFHMTVNGRHETSFVYREKLEPWLVSGVKVTGGVD 364

Query: 1185 IISALAKGLPVSENLDMAMDLEDLKAPLISNKSKRLVLLIGVFSSGNNFERRMALRRSWM 1006
            I+SALA+GLPV E+ D+ +D+E LKAPL++ K  RLV+LIG+FS+GNNFERRMALRRSWM
Sbjct: 365  ILSALARGLPVPEDNDLVVDVEHLKAPLVTRK--RLVMLIGIFSTGNNFERRMALRRSWM 422

Query: 1005 KYEAVRSGVVAVRFFIGLHKNKQVNFELWKEAQAYQDVQLMPFVDYYSLLTLKTIAICIM 826
            +YEA RSG VAVRFFIGLHKN QVN ELWKEA  Y D+QLMPFVDYYSL++LKTIAICIM
Sbjct: 423  QYEAARSGDVAVRFFIGLHKNSQVNLELWKEALVYGDIQLMPFVDYYSLISLKTIAICIM 482

Query: 825  GTKIFPAKYIMKTDDDAFVRVDEILISLKSKNPDGLLYGMVSLESKPQRDKDNKWYISPE 646
            GTKI PAKYIMKTDDDAFVR+D++L SLK K  +GLLYG +SL+S P RD+D+KWYIS E
Sbjct: 483  GTKILPAKYIMKTDDDAFVRIDQVLTSLKEKPSNGLLYGRISLDSSPHRDRDSKWYISNE 542

Query: 645  EWPHESYPPWAHGPGYVISRDIAKFIVRGHQERNLKLFKLEDVAMGIWIEQFKKHVREVQ 466
            EWPH++YPPWAHGPGY+ISRDIAKFIVRGHQER+LKLFKLEDVAMGIWIEQFK   +EV 
Sbjct: 543  EWPHDAYPPWAHGPGYIISRDIAKFIVRGHQERDLKLFKLEDVAMGIWIEQFKNSGQEVH 602

Query: 465  YINDDRFYNAGCEPNYILAHYQNPRMVLCLWEKMQKEHKPDCCD 334
            Y+ DDRFYNAGCE +YILAHYQ+PR+VLCLWEK+QKEH+P CC+
Sbjct: 603  YMTDDRFYNAGCETDYILAHYQSPRLVLCLWEKLQKEHQPACCE 646


>ref|XP_021642914.1| hydroxyproline O-galactosyltransferase GALT3 isoform X1 [Hevea
            brasiliensis]
          Length = 650

 Score =  808 bits (2088), Expect = 0.0
 Identities = 400/647 (61%), Positives = 495/647 (76%), Gaps = 13/647 (2%)
 Frame = -2

Query: 2235 QMKKWTXXXXXXXXXXXXLVSYSFIQKQP-RKQSSYEFF--HP--NEEIANLSSVSV--- 2080
            +MKKW+            + SYS ++ QP +KQS+Y+FF  HP  +  + + S +     
Sbjct: 12   RMKKWSGGMVIIALAVILVFSYSLLRTQPQKKQSAYDFFRNHPANDSHVTDTSRIRPPQV 71

Query: 2079 -VTKVEKLQKRPHLVKVNGLDYLFNSTYMPKEEEAKPLLAWGQMRLLLSRSDSLPETAQG 1903
             V K  KL K+ H +   GL+ L+    + KEE +K LL W QMRLLLSRSD+LPETAQG
Sbjct: 72   EVGKATKLSKKLHFINFEGLNDLYAPNNISKEE-SKALLVWSQMRLLLSRSDALPETAQG 130

Query: 1902 IKEAALAWKELRSIISKNRASK--FVHNKYERNCSYSVSGVNNSTLFSSNGSIILEIPCG 1729
            IKEA++AWK+L SII + +A+K   +  + ++NC YS++ ++  T  SSNG++  +IPCG
Sbjct: 131  IKEASIAWKDLSSIIEEEKAAKSHIIDKQEDKNCPYSINAIDIMT--SSNGTVF-DIPCG 187

Query: 1728 LIHDSSITIIAIPDGKQDGFEIEFVGSEGKEEPNPPVILHYNVVLPGDNFTKDPVIAQNT 1549
            L+ DSSITI+ IP+G    F++E  GS+ + E N P+ILHY V LPGDN T +P I QNT
Sbjct: 188  LVEDSSITIVGIPNGHNGSFQVELEGSQLRGEQNHPIILHYRVSLPGDNMTDEPFIVQNT 247

Query: 1548 WTYDFGWGKEERCPNHGSPNNAK--VDGLVKCNEQLTRSTMDENSNSRHLSVNKSTNASD 1375
            W+ + GWGKEE+CP HGS NN K  VDGLV CNEQ+ RST++E  N+ H   +   N S 
Sbjct: 248  WSNEHGWGKEEKCPAHGSANNPKPKVDGLVLCNEQIVRSTVEETLNASHPGRDILANVSQ 307

Query: 1374 VNDHNHVGSNFPFIDGSLFTATIWAGEEGFHGTVNGRHETSFAYREKLEPWLVSGVRVKG 1195
             + +    +NFPF  G+ FTAT W G EGFH TVNGRHETSFAYREKLEPW VSGV+V G
Sbjct: 308  GSAY--ASANFPFSKGNPFTATFWVGSEGFHMTVNGRHETSFAYREKLEPWAVSGVKVDG 365

Query: 1194 GLHIISALAKGLPVSENLDMAMDLEDLKAPLISNKSKRLVLLIGVFSSGNNFERRMALRR 1015
            G  ++S LAKGLPVSE+ D+ +D+E LKAP+   K KRL +L+GVFS+GNNFERRMALRR
Sbjct: 366  GFDMLSVLAKGLPVSEDHDLVVDVELLKAPV--TKKKRLAMLVGVFSTGNNFERRMALRR 423

Query: 1014 SWMKYEAVRSGVVAVRFFIGLHKNKQVNFELWKEAQAYQDVQLMPFVDYYSLLTLKTIAI 835
            SWM+YEAV SG VAVRFFIGLHKN QVNFELWKEAQAY DVQLMPFVDYYSL++LKTI I
Sbjct: 424  SWMQYEAVHSGDVAVRFFIGLHKNSQVNFELWKEAQAYGDVQLMPFVDYYSLISLKTIGI 483

Query: 834  CIMGTKIFPAKYIMKTDDDAFVRVDEILISLKSKNPDGLLYGMVSLESKPQRDKDNKWYI 655
            CIMGTKI PAKYIMKTDDDAFVR+DE+L SLK K  +GLLYG++S +S P R+KD+KWYI
Sbjct: 484  CIMGTKILPAKYIMKTDDDAFVRIDEVLTSLKGKASNGLLYGLMSFDSSPHREKDSKWYI 543

Query: 654  SPEEWPHESYPPWAHGPGYVISRDIAKFIVRGHQERNLKLFKLEDVAMGIWIEQFKKHVR 475
            S EEWPH SYPPWAHGPGY++SR++AKFI +GHQER+LKLFKLEDVAMGIWIEQFKK  +
Sbjct: 544  SNEEWPHSSYPPWAHGPGYIVSRNVAKFIAQGHQERDLKLFKLEDVAMGIWIEQFKKSGQ 603

Query: 474  EVQYINDDRFYNAGCEPNYILAHYQNPRMVLCLWEKMQKEHKPDCCD 334
            EV YI+D+RF+N GCE NYILAHYQ+PR+VLCLWEK+QKEH+P+CC+
Sbjct: 604  EVHYISDERFHNTGCESNYILAHYQSPRLVLCLWEKLQKEHQPNCCE 650


>ref|XP_010276521.1| PREDICTED: hydroxyproline O-galactosyltransferase GALT3 [Nelumbo
            nucifera]
 ref|XP_010276522.1| PREDICTED: hydroxyproline O-galactosyltransferase GALT3 [Nelumbo
            nucifera]
          Length = 635

 Score =  808 bits (2086), Expect = 0.0
 Identities = 397/645 (61%), Positives = 489/645 (75%), Gaps = 12/645 (1%)
 Frame = -2

Query: 2232 MKKWTXXXXXXXXXXXXLVSYSFIQKQPRKQSSYEFF--HPNEEIA----NLSSVSVVTK 2071
            MKKWT            ++ YS +Q QP+KQS+Y+FF  HP+         ++  S + +
Sbjct: 1    MKKWTGGTLIISLAMILILRYSLMQDQPKKQSAYDFFWNHPSNNSRLGHNGVTRTSQLPQ 60

Query: 2070 VEKLQKRPHLVKVNGLDYLFNSTYMPKEEEAKPLLAWGQMRLLLSRSDSLPETAQGIKEA 1891
                 ++P+L+ V+GL+ L++   +  EE++K +L W QMR LLSRSDSL ETAQGIKEA
Sbjct: 61   ERNRARKPYLINVDGLNDLYSLKNI-SEEDSKVVLVWAQMRSLLSRSDSLSETAQGIKEA 119

Query: 1890 ALAWKELRSIISKNRASKFVHNKYE------RNCSYSVSGVNNSTLFSSNGSIILEIPCG 1729
            ++AWK+L + I   +AS+  ++  +      +NC +SV G+ NST+  S    ILE PCG
Sbjct: 120  SVAWKDLLAAIEDEKASRIRNSNIQGNDEKDKNCPFSV-GMLNSTM--SIYGTILEFPCG 176

Query: 1728 LIHDSSITIIAIPDGKQDGFEIEFVGSEGKEEPNPPVILHYNVVLPGDNFTKDPVIAQNT 1549
            L   SSIT++ IPDG+   F+IE +GS    E  PP++LHYNV LPGD  T+DPVI QNT
Sbjct: 177  LADSSSITLVGIPDGRHGSFQIELIGSLLPGESKPPIVLHYNVSLPGDKMTEDPVIIQNT 236

Query: 1548 WTYDFGWGKEERCPNHGSPNNAKVDGLVKCNEQLTRSTMDENSNSRHLSVNKSTNASDVN 1369
            WT + GWGK+ERCP  GS +N KVDGL+ CNEQ+  + + EN N    S   +T+     
Sbjct: 237  WTKELGWGKDERCPARGSSSNIKVDGLISCNEQVMGTVLKENLNGSQPSSKTNTSGGST- 295

Query: 1368 DHNHVGSNFPFIDGSLFTATIWAGEEGFHGTVNGRHETSFAYREKLEPWLVSGVRVKGGL 1189
               H+  NFPF++G+ FTAT+W G EGFH TVNGRHETSFAYREKLEPWLVSGV+V GGL
Sbjct: 296  ---HITFNFPFVEGNPFTATLWVGPEGFHMTVNGRHETSFAYREKLEPWLVSGVKVGGGL 352

Query: 1188 HIISALAKGLPVSENLDMAMDLEDLKAPLISNKSKRLVLLIGVFSSGNNFERRMALRRSW 1009
            HI+SALA GLPVSE++D+ +D + LKAP +S K  RL +LIGVFS+GNNFERRMALRRSW
Sbjct: 353  HILSALANGLPVSEDMDLIIDAKQLKAPPVSRK--RLTMLIGVFSTGNNFERRMALRRSW 410

Query: 1008 MKYEAVRSGVVAVRFFIGLHKNKQVNFELWKEAQAYQDVQLMPFVDYYSLLTLKTIAICI 829
            M+Y+AVRSG VAVRFFIGL KNKQVN ELWKE+Q Y D+QLMPFVDYY+L+TLKT+AICI
Sbjct: 411  MQYKAVRSGDVAVRFFIGLQKNKQVNIELWKESQMYGDIQLMPFVDYYNLITLKTVAICI 470

Query: 828  MGTKIFPAKYIMKTDDDAFVRVDEILISLKSKNPDGLLYGMVSLESKPQRDKDNKWYISP 649
            MG KI PAKYIMK DDDAFVR+DE+L SLK K  +GLLYG++S +SKP RD+D+KWYIS 
Sbjct: 471  MGIKILPAKYIMKMDDDAFVRIDEVLSSLKGKVSNGLLYGLISFDSKPHRDRDSKWYIST 530

Query: 648  EEWPHESYPPWAHGPGYVISRDIAKFIVRGHQERNLKLFKLEDVAMGIWIEQFKKHVREV 469
            EEWPH SYPPWAHGPGY+ISRDIAKFIV+GHQER+LKLFKLEDVAMGIWIEQFKK  +EV
Sbjct: 531  EEWPHASYPPWAHGPGYIISRDIAKFIVQGHQERDLKLFKLEDVAMGIWIEQFKKSGKEV 590

Query: 468  QYINDDRFYNAGCEPNYILAHYQNPRMVLCLWEKMQKEHKPDCCD 334
             Y++DDRFYNAGCE NYILAHYQ PRMVLCLWEK+Q EH+P CC+
Sbjct: 591  HYVSDDRFYNAGCESNYILAHYQGPRMVLCLWEKLQLEHEPTCCE 635


>ref|XP_006355721.1| PREDICTED: probable beta-1,3-galactosyltransferase 16 [Solanum
            tuberosum]
          Length = 654

 Score =  806 bits (2083), Expect = 0.0
 Identities = 398/622 (63%), Positives = 491/622 (78%), Gaps = 8/622 (1%)
 Frame = -2

Query: 2175 SYSFIQKQPRKQSSYEFF--HPN------EEIANLSSVSVVTKVEKLQKRPHLVKVNGLD 2020
            S S +++ P+KQS Y FF  HP+      +E A LS +  V  V   +++PHL+ V GL+
Sbjct: 45   SDSSVEESPKKQSVYGFFNDHPDINEGSKDENAKLSDLKPVELVS-FKEKPHLIDVEGLN 103

Query: 2019 YLFNSTYMPKEEEAKPLLAWGQMRLLLSRSDSLPETAQGIKEAALAWKELRSIISKNRAS 1840
             L++       EE+K LLAWG+MRLLLSRSD L  TAQG+KEAA++WK+L S I K   S
Sbjct: 104  DLYSLNNF-STEESKALLAWGKMRLLLSRSDGLNGTAQGVKEAAISWKDLVSFIEK---S 159

Query: 1839 KFVHNKYERNCSYSVSGVNNSTLFSSNGSIILEIPCGLIHDSSITIIAIPDGKQDGFEIE 1660
            K    K +++C YSV+  N +TL   +    L IPCGL+ DSSIT+I IPD KQ+GF+IE
Sbjct: 160  KVQDEKEKQDCPYSVTAFNTATLKDGSS---LRIPCGLVEDSSITVIGIPDAKQEGFQIE 216

Query: 1659 FVGSEGKEEPNPPVILHYNVVLPGDNFTKDPVIAQNTWTYDFGWGKEERCPNHGSPNNAK 1480
             VGS+  EE NPP++L+YNV+LPG+N TKDP+I QNTWT + GWGK E+CP+HGS +  K
Sbjct: 217  LVGSKLPEETNPPIVLNYNVILPGENLTKDPLITQNTWTNESGWGKVEKCPDHGSTDVIK 276

Query: 1479 VDGLVKCNEQLTRSTMDENSNSRHLSVNKSTNASDVNDHNHVGSNFPFIDGSLFTATIWA 1300
            VDGLVKCN ++ R+ ++E +N  + S  KS++ S  N   +  +N+PF++G+ FTAT+WA
Sbjct: 277  VDGLVKCNAKIFRNNVEETANMTNTSNPKSSDVS--NSSAYGTANYPFLEGNPFTATLWA 334

Query: 1299 GEEGFHGTVNGRHETSFAYREKLEPWLVSGVRVKGGLHIISALAKGLPVSENLDMAMDLE 1120
            G EGFH TVNGRHETSFAYREKLEPWLVSGV V GG+  IS LAKGLPVS + ++  D+E
Sbjct: 335  GIEGFHMTVNGRHETSFAYREKLEPWLVSGVNVIGGVDTISILAKGLPVSNDFNLGDDVE 394

Query: 1119 DLKAPLISNKSKRLVLLIGVFSSGNNFERRMALRRSWMKYEAVRSGVVAVRFFIGLHKNK 940
             LKAPL   + KRLV+LIG+FS+GNNFERRMALR+SWM+YEAVRSG VAVRFFIGL KN+
Sbjct: 395  HLKAPL--TRKKRLVMLIGIFSTGNNFERRMALRKSWMQYEAVRSGEVAVRFFIGLDKNR 452

Query: 939  QVNFELWKEAQAYQDVQLMPFVDYYSLLTLKTIAICIMGTKIFPAKYIMKTDDDAFVRVD 760
            QVNFELWKEAQAY D+QL+PFVDYYSLLTLKTIAICIMG KI PAKY+MKTDDDAFVR+D
Sbjct: 453  QVNFELWKEAQAYGDIQLLPFVDYYSLLTLKTIAICIMGVKILPAKYVMKTDDDAFVRID 512

Query: 759  EILISLKSKNPDGLLYGMVSLESKPQRDKDNKWYISPEEWPHESYPPWAHGPGYVISRDI 580
            E+L SLK K+P+GLLYG +S ES P RDK+NKWYISPEE+P  SYPPWAHGPGY+ISRDI
Sbjct: 513  EVLSSLKGKDPNGLLYGGISFESAPHRDKENKWYISPEEYPPASYPPWAHGPGYIISRDI 572

Query: 579  AKFIVRGHQERNLKLFKLEDVAMGIWIEQFKKHVREVQYINDDRFYNAGCEPNYILAHYQ 400
            AKFIV+GHQE  L LFKLEDVA+GIWIE+F++   ++QY+ND+RFYNAGCE  YILAHYQ
Sbjct: 573  AKFIVQGHQEMELMLFKLEDVAVGIWIEEFRRKGHKIQYVNDERFYNAGCESGYILAHYQ 632

Query: 399  NPRMVLCLWEKMQKEHKPDCCD 334
            N RMVLCLWEK+QKEH+P+CC+
Sbjct: 633  NSRMVLCLWEKLQKEHEPNCCE 654


Top