BLASTX nr result

ID: Papaver32_contig00032245 seq

BLASTX 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Papaver32_contig00032245
         (2496 letters)

Database: ./nr 
           115,041,592 sequences; 42,171,959,267 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

XP_010276521.1 PREDICTED: hydroxyproline O-galactosyltransferase...   846   0.0  
XP_007224303.1 hypothetical protein PRUPE_ppa019770mg [Prunus pe...   836   0.0  
XP_008223439.1 PREDICTED: hydroxyproline O-galactosyltransferase...   835   0.0  
XP_012482246.1 PREDICTED: probable beta-1,3-galactosyltransferas...   830   0.0  
XP_011011142.1 PREDICTED: probable beta-1,3-galactosyltransferas...   827   0.0  
KJB28788.1 hypothetical protein B456_005G069600 [Gossypium raimo...   827   0.0  
XP_018814246.1 PREDICTED: hydroxyproline O-galactosyltransferase...   820   0.0  
XP_017631926.1 PREDICTED: hydroxyproline O-galactosyltransferase...   820   0.0  
XP_016742176.1 PREDICTED: hydroxyproline O-galactosyltransferase...   816   0.0  
XP_015867148.1 PREDICTED: probable beta-1,3-galactosyltransferas...   816   0.0  
XP_002274418.1 PREDICTED: hydroxyproline O-galactosyltransferase...   815   0.0  
OAY56191.1 hypothetical protein MANES_03G209300 [Manihot esculenta]   811   0.0  
XP_007035910.2 PREDICTED: hydroxyproline O-galactosyltransferase...   807   0.0  
XP_015900118.1 PREDICTED: probable beta-1,3-galactosyltransferas...   808   0.0  
EOY06836.1 Beta-1,3-galactosyltransferase 16 isoform 1 [Theobrom...   806   0.0  
OMO58551.1 hypothetical protein COLO4_34527 [Corchorus olitorius]     805   0.0  
XP_009372069.1 PREDICTED: hydroxyproline O-galactosyltransferase...   806   0.0  
OMO53423.1 hypothetical protein CCACVL1_28661 [Corchorus capsula...   801   0.0  
KJB28790.1 hypothetical protein B456_005G069600 [Gossypium raimo...   800   0.0  
XP_008360226.1 PREDICTED: hydroxyproline O-galactosyltransferase...   796   0.0  

>XP_010276521.1 PREDICTED: hydroxyproline O-galactosyltransferase GALT3 [Nelumbo
            nucifera] XP_010276522.1 PREDICTED: hydroxyproline
            O-galactosyltransferase GALT3 [Nelumbo nucifera]
          Length = 635

 Score =  846 bits (2185), Expect = 0.0
 Identities = 426/640 (66%), Positives = 508/640 (79%), Gaps = 10/640 (1%)
 Frame = -3

Query: 2167 MRKWXXXXXXXXXXXXXXIRYSLVTKDSVNKHSAAYDFFNNHPTNAS------ITIKSSR 2006
            M+KW              +RYSL+ +D   K SA YDFF NHP+N S      +T  S  
Sbjct: 1    MKKWTGGTLIISLAMILILRYSLM-QDQPKKQSA-YDFFWNHPSNNSRLGHNGVTRTSQL 58

Query: 2005 PRLIPNDAKRADVINVYGLSELFSPTDISKEEAKVLDVWIHMRTILSRSDYLPETTQGII 1826
            P+   N A++  +INV GL++L+S  +IS+E++KV+ VW  MR++LSRSD L ET QGI 
Sbjct: 59   PQE-RNRARKPYLINVDGLNDLYSLKNISEEDSKVVLVWAQMRSLLSRSDSLSETAQGIK 117

Query: 1825 EASIAWKELLSTLGEQKASRLMVNSGSSSHVEDNKNCSFSVSMENDSVLRYGSSLQFPCG 1646
            EAS+AWK+LL+ + ++KASR+  NS    + E +KNC FSV M N ++  YG+ L+FPCG
Sbjct: 118  EASVAWKDLLAAIEDEKASRIR-NSNIQGNDEKDKNCPFSVGMLNSTMSIYGTILEFPCG 176

Query: 1645 LVEDSSITVVGFPNGLNRSFQIELIGSKLKDEIQPPVVLHYEVSLPRDELKEDPVITQNA 1466
            L + SSIT+VG P+G + SFQIELIGS L  E +PP+VLHY VSLP D++ EDPVI QN 
Sbjct: 177  LADSSSITLVGIPDGRHGSFQIELIGSLLPGESKPPIVLHYNVSLPGDKMTEDPVIIQNT 236

Query: 1465 WTAGSGWGKDEKCPAHHSISNLEVDGLVLCNEQV---VIEDNINRSQPNGEKLTNVSHGS 1295
            WT   GWGKDE+CPA  S SN++VDGL+ CNEQV   V+++N+N SQP+ +  TN S GS
Sbjct: 237  WTKELGWGKDERCPARGSSSNIKVDGLISCNEQVMGTVLKENLNGSQPSSK--TNTSGGS 294

Query: 1294 SHTSYNFPFVEGNPFTATLWVGIEGFHMTVNGRHETSFAYREKLEPWLVSDIKVTGGLDL 1115
            +H ++NFPFVEGNPFTATLWVG EGFHMTVNGRHETSFAYREKLEPWLVS +KV GGL +
Sbjct: 295  THITFNFPFVEGNPFTATLWVGPEGFHMTVNGRHETSFAYREKLEPWLVSGVKVGGGLHI 354

Query: 1114 LSALANGLPVSEDLDLI-DIERLKAPLLPTKRIIMLIGVFSIGNNFERRMALRRSWMQYE 938
            LSALANGLPVSED+DLI D ++LKAP +  KR+ MLIGVFS GNNFERRMALRRSWMQY+
Sbjct: 355  LSALANGLPVSEDMDLIIDAKQLKAPPVSRKRLTMLIGVFSTGNNFERRMALRRSWMQYK 414

Query: 937  VVLSGQVAVRFFVGLHKNSIVNAELWREAQTYGDIQLMPFVDYYDLITLKTIAICLMGTK 758
             V SG VAVRFF+GL KN  VN ELW+E+Q YGDIQLMPFVDYY+LITLKT+AIC+MG K
Sbjct: 415  AVRSGDVAVRFFIGLQKNKQVNIELWKESQMYGDIQLMPFVDYYNLITLKTVAICIMGIK 474

Query: 757  ILPAKYMMKTDDDAFVRIDEILAGLKEKPSNGLLYGLISFDSEPIRDRDSKWYISDEEWP 578
            ILPAKY+MK DDDAFVRIDE+L+ LK K SNGLLYGLISFDS+P RDRDSKWYIS EEWP
Sbjct: 475  ILPAKYIMKMDDDAFVRIDEVLSSLKGKVSNGLLYGLISFDSKPHRDRDSKWYISTEEWP 534

Query: 577  HSLYPPWAHGPGYVLSRDIAKFIVQGHQERYLKLFKLEDVAMGIWIDQFKKNGQEVHYMT 398
            H+ YPPWAHGPGY++SRDIAKFIVQGHQER LKLFKLEDVAMGIWI+QFKK+G+EVHY++
Sbjct: 535  HASYPPWAHGPGYIISRDIAKFIVQGHQERDLKLFKLEDVAMGIWIEQFKKSGKEVHYVS 594

Query: 397  DDRFKISGCEESYILAHYQNPSMVLCLWEKLQKEHEAVCC 278
            DDRF  +GCE +YILAHYQ P MVLCLWEKLQ EHE  CC
Sbjct: 595  DDRFYNAGCESNYILAHYQGPRMVLCLWEKLQLEHEPTCC 634


>XP_007224303.1 hypothetical protein PRUPE_ppa019770mg [Prunus persica] ONI27911.1
            hypothetical protein PRUPE_1G110700 [Prunus persica]
            ONI27912.1 hypothetical protein PRUPE_1G110700 [Prunus
            persica] ONI27913.1 hypothetical protein PRUPE_1G110700
            [Prunus persica]
          Length = 634

 Score =  836 bits (2159), Expect = 0.0
 Identities = 411/638 (64%), Positives = 496/638 (77%), Gaps = 8/638 (1%)
 Frame = -3

Query: 2167 MRKWXXXXXXXXXXXXXXIRYSLVTKDSVNKHS---AAYDFFNNHPTNAS-ITIKSSRPR 2000
            M+KW               RY  + K    K S   +A DFF NHPTN S IT    + +
Sbjct: 1    MKKWSGGLFIIALAMILVFRYCSIVKIEPPKQSRKQSASDFFGNHPTNDSFITSSEIKVK 60

Query: 1999 LIPNDAKRADVINVYGLSELFSPTDISKEEAKVLDVWIHMRTILSRSDYLPETTQGIIEA 1820
                  K+   I V G SELF+  DI KE ++ L VW HMR +LSRSD LPET QG+ EA
Sbjct: 61   KEAESYKKPHFIEVDGPSELFASHDIFKEGSRALLVWPHMRPLLSRSDSLPETAQGVKEA 120

Query: 1819 SIAWKELLSTLGEQKASRLMVNSGSSSHVEDNKNCSFSVSMENDSVLRYGSSLQFPCGLV 1640
            S+AWK+LLS + + KAS+L     S S+ +++KNC FSVS  +  V R G  L+ PCGLV
Sbjct: 121  SLAWKDLLSAIEKDKASKL-----SKSNSQEDKNCPFSVSTLDKIVSRDGVILEIPCGLV 175

Query: 1639 EDSSITVVGFPNGLNRSFQIELIGSKLKDEIQPPVVLHYEVSLPRDELKEDPVITQNAWT 1460
            +DSSI++VG P+G +RSFQI+L+GS+L  E +PP++LHY VSLP D + E+P + QN WT
Sbjct: 176  DDSSISLVGIPDGHSRSFQIQLLGSQLAGEPEPPIILHYNVSLPGDNMTEEPFVVQNTWT 235

Query: 1459 AGSGWGKDEKCPAHHSISNLEVDGLVLCNEQVV---IEDNINRSQPNGEKLTNVSHGSSH 1289
               GWGK+E+CP+H S +NL+VDGLVLCNEQ V   +E+N+N SQP+ + LTNVS G ++
Sbjct: 236  HELGWGKEERCPSHRSANNLKVDGLVLCNEQAVRSSLEENLNMSQPSSDMLTNVSRGGAY 295

Query: 1288 TSYNFPFVEGNPFTATLWVGIEGFHMTVNGRHETSFAYREKLEPWLVSDIKVTGGLDLLS 1109
             S NFPFVEGNPFTATLWVG+EGFHMTVNGRHETSFAYREKLEPW V+ +KV GGLDLLS
Sbjct: 296  GSANFPFVEGNPFTATLWVGLEGFHMTVNGRHETSFAYREKLEPWSVTKVKVAGGLDLLS 355

Query: 1108 ALANGLPVSEDLDLI-DIERLKAPLLPTKRIIMLIGVFSIGNNFERRMALRRSWMQYEVV 932
            ALA GLPVSED DL+ D+E LKAP    KR++ML+GVFS GNNFERRMALRR+WMQYE V
Sbjct: 356  ALAKGLPVSEDHDLVVDVEHLKAPATLKKRLLMLVGVFSTGNNFERRMALRRAWMQYEAV 415

Query: 931  LSGQVAVRFFVGLHKNSIVNAELWREAQTYGDIQLMPFVDYYDLITLKTIAICLMGTKIL 752
             SG VAVRFF+GLHKNS VN ELWREA+ YGDIQLMPFVDYY LI+LKTIAIC+ GTKIL
Sbjct: 416  RSGDVAVRFFIGLHKNSQVNIELWREAEAYGDIQLMPFVDYYSLISLKTIAICIFGTKIL 475

Query: 751  PAKYMMKTDDDAFVRIDEILAGLKEKPSNGLLYGLISFDSEPIRDRDSKWYISDEEWPHS 572
            PAKY+MKTDDDAFVRIDE+++ LK K +NGLLYGLI+F+S P R++ SKWYI ++EWPH+
Sbjct: 476  PAKYIMKTDDDAFVRIDEVISSLKGKATNGLLYGLIAFESAPDREKGSKWYIDNKEWPHA 535

Query: 571  LYPPWAHGPGYVLSRDIAKFIVQGHQERYLKLFKLEDVAMGIWIDQFKKNGQEVHYMTDD 392
            LYPPWAHGPGY++SRDIAKFIV+GHQE  LKLFKLEDVAMGIWI+QFK +G EV+Y+TDD
Sbjct: 536  LYPPWAHGPGYIISRDIAKFIVRGHQESDLKLFKLEDVAMGIWIEQFKNSGHEVNYVTDD 595

Query: 391  RFKISGCEESYILAHYQNPSMVLCLWEKLQKEHEAVCC 278
            RF  +GCE +YILAHYQ+P +VLCLWEKLQK+HE VCC
Sbjct: 596  RFYSAGCESNYILAHYQSPRLVLCLWEKLQKKHEPVCC 633


>XP_008223439.1 PREDICTED: hydroxyproline O-galactosyltransferase GALT3 [Prunus mume]
            XP_008223440.1 PREDICTED: hydroxyproline
            O-galactosyltransferase GALT3 [Prunus mume]
          Length = 634

 Score =  835 bits (2158), Expect = 0.0
 Identities = 412/638 (64%), Positives = 495/638 (77%), Gaps = 8/638 (1%)
 Frame = -3

Query: 2167 MRKWXXXXXXXXXXXXXXIRYSLVTKDSVNKHS---AAYDFFNNHPTNAS-ITIKSSRPR 2000
            M+KW               RY  + K    K S   +A DFF NHPTN S IT    + +
Sbjct: 1    MKKWSGGLFIIALAMILVFRYCSIVKIEPPKQSRKQSASDFFGNHPTNDSFITSSEIKVK 60

Query: 1999 LIPNDAKRADVINVYGLSELFSPTDISKEEAKVLDVWIHMRTILSRSDYLPETTQGIIEA 1820
                  K+   I V G +ELFS  DI KE ++ L VW HMR +LSRSD LPET QG+ EA
Sbjct: 61   KEAESYKKPHFIEVDGPNELFSSHDIFKEGSRALLVWPHMRPLLSRSDALPETAQGVKEA 120

Query: 1819 SIAWKELLSTLGEQKASRLMVNSGSSSHVEDNKNCSFSVSMENDSVLRYGSSLQFPCGLV 1640
            S+AWK+LLS + + KAS+L     S S  +++KNC FSVS  +  V R G  L+ PCGLV
Sbjct: 121  SMAWKDLLSAIDKDKASKL-----SKSDRQEDKNCPFSVSTLDKIVSRDGVILEIPCGLV 175

Query: 1639 EDSSITVVGFPNGLNRSFQIELIGSKLKDEIQPPVVLHYEVSLPRDELKEDPVITQNAWT 1460
            +DSSI++VG P+G +RSFQI+L+GS+L  E +PP++LHY VSLP D + E+P + QN WT
Sbjct: 176  DDSSISLVGIPDGHSRSFQIQLLGSQLAGEPEPPIILHYNVSLPGDNMTEEPFVVQNIWT 235

Query: 1459 AGSGWGKDEKCPAHHSISNLEVDGLVLCNEQVV---IEDNINRSQPNGEKLTNVSHGSSH 1289
               GWGK+E+CP+H S +NL+VDGLVLCNEQ V   +E+N+N SQP+ E LTNVS G ++
Sbjct: 236  HELGWGKEERCPSHGSANNLKVDGLVLCNEQAVRSSLEENLNMSQPSSEMLTNVSRGGAY 295

Query: 1288 TSYNFPFVEGNPFTATLWVGIEGFHMTVNGRHETSFAYREKLEPWLVSDIKVTGGLDLLS 1109
             S NFPFVEGNPFTATLWVG+EGFHMTVNGRHETSFAYREKLEPW V+ +KV GGLDLLS
Sbjct: 296  GSANFPFVEGNPFTATLWVGLEGFHMTVNGRHETSFAYREKLEPWSVTKVKVAGGLDLLS 355

Query: 1108 ALANGLPVSEDLDLI-DIERLKAPLLPTKRIIMLIGVFSIGNNFERRMALRRSWMQYEVV 932
            ALA GLPVSED DL+ D+E LKAP    KR++ML+GVFS GNNFERRMALRR+WMQYE V
Sbjct: 356  ALAKGLPVSEDHDLVVDVEHLKAPATLKKRLLMLVGVFSTGNNFERRMALRRAWMQYEAV 415

Query: 931  LSGQVAVRFFVGLHKNSIVNAELWREAQTYGDIQLMPFVDYYDLITLKTIAICLMGTKIL 752
             SG VAVRFF+GLHKNS VN ELWREA+ YGDIQLMPFVDYY LI+LKTIAIC+ GTKIL
Sbjct: 416  RSGDVAVRFFIGLHKNSQVNIELWREAEAYGDIQLMPFVDYYSLISLKTIAICIFGTKIL 475

Query: 751  PAKYMMKTDDDAFVRIDEILAGLKEKPSNGLLYGLISFDSEPIRDRDSKWYISDEEWPHS 572
            PAKY+MKTDDDAFVRIDE+++ LK + +NGLLYGLI+F+S P R++ SKWYI ++EWPH+
Sbjct: 476  PAKYIMKTDDDAFVRIDEVISSLKGRATNGLLYGLIAFESAPDREKGSKWYIDNKEWPHA 535

Query: 571  LYPPWAHGPGYVLSRDIAKFIVQGHQERYLKLFKLEDVAMGIWIDQFKKNGQEVHYMTDD 392
            LYPPWAHGPGY++SRDIAKFIV+GHQE  LKLFKLEDVAMGIWI+QFK +G EV+Y+TDD
Sbjct: 536  LYPPWAHGPGYIISRDIAKFIVRGHQESNLKLFKLEDVAMGIWIEQFKNSGHEVNYVTDD 595

Query: 391  RFKISGCEESYILAHYQNPSMVLCLWEKLQKEHEAVCC 278
            RF  +GCE +YILAHYQ+P +VLCLWEKLQKEHE VCC
Sbjct: 596  RFYSAGCESNYILAHYQSPRLVLCLWEKLQKEHEPVCC 633


>XP_012482246.1 PREDICTED: probable beta-1,3-galactosyltransferase 16 [Gossypium
            raimondii] KJB28791.1 hypothetical protein
            B456_005G069600 [Gossypium raimondii]
          Length = 650

 Score =  830 bits (2145), Expect = 0.0
 Identities = 412/646 (63%), Positives = 495/646 (76%), Gaps = 16/646 (2%)
 Frame = -3

Query: 2167 MRKWXXXXXXXXXXXXXXIRYSL--VTKDSVNKHSAAYDFFNNHPT-------NASIT-- 2021
            M+KW                YSL    +    K  +AYDFFNNHP        N S    
Sbjct: 13   MKKWYGGVLILVLAIVMVFSYSLRETQRPQPKKKQSAYDFFNNHPPIDSHRKGNDSFKLP 72

Query: 2020 -IKSSRPRLIPNDAKRADVINVYGLSELFSPTDISKEEAKVLDVWIHMRTILSRSDYLPE 1844
             +++ +P LI    ++  +INV GL EL++P ++S++E+ VL +W H+  +LSRSD LPE
Sbjct: 73   KVEAKKPSLI----QKPKLINVEGLDELYAPRNVSEQESNVLLLWPHLHLLLSRSDALPE 128

Query: 1843 TTQGIIEASIAWKELLSTLGEQKASRLMVNSGSSSHVEDNKNCSFSVSMENDSVLRYGSS 1664
            T QGI EA+IAWKELL+ + E+K ++L  N          KNC FSVS  ++++   G+ 
Sbjct: 129  TGQGIKEAAIAWKELLALIEEEKTTKLSNNIRLKE-----KNCPFSVSSPDNALFSGGNI 183

Query: 1663 LQFPCGLVEDSSITVVGFPNGLNRSFQIELIGSKLKDEIQPPVVLHYEVSLPRDELKEDP 1484
            L+ PCGLVEDSSIT++G PNG  RSF+I+L+GS   +E +PP+VLHY VS+  D + E+P
Sbjct: 184  LELPCGLVEDSSITLIGTPNGSYRSFEIDLVGSNFSEEPKPPIVLHYNVSVAGDNMTEEP 243

Query: 1483 VITQNAWTAGSGWGKDEKCPAHHSISNLEVDGLVLCNEQVV---IEDNINRSQPNGEKLT 1313
             I QN WT   GWGK+EKCP+H S +NL+VDGL LCNEQ+V   +E+N N S  +G+  T
Sbjct: 244  FIAQNTWTNELGWGKEEKCPSHVSSNNLKVDGLGLCNEQLVRSTMEENQNVSVSSGDAST 303

Query: 1312 NVSHGSSHTSYNFPFVEGNPFTATLWVGIEGFHMTVNGRHETSFAYREKLEPWLVSDIKV 1133
            N S  SSH S NFPFVEGNPFTATLWVG+EGFHMTVNGRHETSFAYREKLEPW VS +KV
Sbjct: 304  NASQESSHASANFPFVEGNPFTATLWVGLEGFHMTVNGRHETSFAYREKLEPWSVSGVKV 363

Query: 1132 TGGLDLLSALANGLPVSEDLDLIDIER-LKAPLLPTKRIIMLIGVFSIGNNFERRMALRR 956
             GGLDLLSA A GLPV ED DLID  + LKAP++  KR++ML+GVFS GNNFERRMALRR
Sbjct: 364  VGGLDLLSAFAKGLPVPEDHDLIDNSKILKAPVITRKRLVMLVGVFSTGNNFERRMALRR 423

Query: 955  SWMQYEVVLSGQVAVRFFVGLHKNSIVNAELWREAQTYGDIQLMPFVDYYDLITLKTIAI 776
            SWMQ+E V SG VAVRFF+GL+KN  VN ELW+EAQ YGDIQ MPFVDYY LI+LKTIAI
Sbjct: 424  SWMQFEAVRSGDVAVRFFIGLNKNLQVNFELWKEAQAYGDIQFMPFVDYYSLISLKTIAI 483

Query: 775  CLMGTKILPAKYMMKTDDDAFVRIDEILAGLKEKPSNGLLYGLISFDSEPIRDRDSKWYI 596
            C+MGTKILPAKY+MKTDDDAFVRIDE+L+ LKEKPSNGLLYGLI FDS P R++DSKWYI
Sbjct: 484  CIMGTKILPAKYIMKTDDDAFVRIDEVLSSLKEKPSNGLLYGLIEFDSSPHREKDSKWYI 543

Query: 595  SDEEWPHSLYPPWAHGPGYVLSRDIAKFIVQGHQERYLKLFKLEDVAMGIWIDQFKKNGQ 416
            SDEEWPHS YPPWAHGPGY+LSRD+AKFIVQGH+ER LKLFKLEDVAMGIWI++FK++G+
Sbjct: 544  SDEEWPHSSYPPWAHGPGYILSRDVAKFIVQGHKERELKLFKLEDVAMGIWIEEFKRSGR 603

Query: 415  EVHYMTDDRFKISGCEESYILAHYQNPSMVLCLWEKLQKEHEAVCC 278
            EVHY+TDDRF  +GCE +YILAHYQ P MVLCLWEKLQKEH+A CC
Sbjct: 604  EVHYITDDRFYNAGCESNYILAHYQGPRMVLCLWEKLQKEHQAYCC 649


>XP_011011142.1 PREDICTED: probable beta-1,3-galactosyltransferase 16 [Populus
            euphratica] XP_011011148.1 PREDICTED: probable
            beta-1,3-galactosyltransferase 16 [Populus euphratica]
            XP_011011154.1 PREDICTED: probable
            beta-1,3-galactosyltransferase 16 [Populus euphratica]
            XP_011011160.1 PREDICTED: probable
            beta-1,3-galactosyltransferase 16 [Populus euphratica]
            XP_011011170.1 PREDICTED: probable
            beta-1,3-galactosyltransferase 16 [Populus euphratica]
            XP_011011177.1 PREDICTED: probable
            beta-1,3-galactosyltransferase 16 [Populus euphratica]
          Length = 646

 Score =  827 bits (2136), Expect = 0.0
 Identities = 411/641 (64%), Positives = 491/641 (76%), Gaps = 11/641 (1%)
 Frame = -3

Query: 2167 MRKWXXXXXXXXXXXXXXIRYSLVTKDSVNKHSAAYDFFNNHPTNASITIKSSRPRLIPN 1988
            M+KW                YSL+   +  K S+ YDFF NHP + S  ++ + P   P 
Sbjct: 13   MKKWSGGVVIIALAIILVFSYSLMGTRTQKKQSS-YDFFRNHPADDS-HLEDNHPAKSPQ 70

Query: 1987 --------DAKRADVINVYGLSELFSPTDISKEEAKVLDVWIHMRTILSRSDYLPETTQG 1832
                     +K+   INV GLS+L++  +IS++E+  L VW  MR +LSRSD LPET+QG
Sbjct: 71   LELKKATKSSKKPHYINVEGLSDLYAQNNISRDESNALVVWFQMRLLLSRSDALPETSQG 130

Query: 1831 IIEASIAWKELLSTLGEQKASRLMVNSGSSSHVEDNKNCSFSVSMENDSVLRYGSSLQFP 1652
            I EASIAWK+LLS + E KA++L     S+ +  ++KNC +SVS  + +     + L  P
Sbjct: 131  IREASIAWKDLLSKIKENKAAQL-----SNINKTEDKNCPYSVSTIDLTTSSGETILDIP 185

Query: 1651 CGLVEDSSITVVGFPNGLNRSFQIELIGSKLKDEIQPPVVLHYEVSLPRDELKEDPVITQ 1472
            CGL EDSSI+V+G P+G +RSFQIEL+GS+L  E +PP+VL Y VSLP D + E+P + Q
Sbjct: 186  CGLAEDSSISVLGIPDGHSRSFQIELLGSQLPVESKPPIVLQYNVSLPGDNMTEEPFVVQ 245

Query: 1471 NAWTAGSGWGKDEKCPAHHSISNLEVDGLVLCNEQVV---IEDNINRSQPNGEKLTNVSH 1301
            N WT   GWGK+E+CP+H S++  +VDGLVLCNE+VV   +E+N N S   G+   NVS 
Sbjct: 246  NTWTKEHGWGKEERCPSHRSVNIPKVDGLVLCNEKVVRSTMEENGNASFV-GDVSANVSQ 304

Query: 1300 GSSHTSYNFPFVEGNPFTATLWVGIEGFHMTVNGRHETSFAYREKLEPWLVSDIKVTGGL 1121
            G +H   NFPFVEGN FTATLWVG+EGFHMTVNGRHETSF YREKLEPWLVS +KVTGG+
Sbjct: 305  GIAHERANFPFVEGNAFTATLWVGLEGFHMTVNGRHETSFVYREKLEPWLVSGVKVTGGV 364

Query: 1120 DLLSALANGLPVSEDLDLIDIERLKAPLLPTKRIIMLIGVFSIGNNFERRMALRRSWMQY 941
            D+LSALA GLPVSED DL+D+E LKAPL+  KR++MLIG+FS GNNFERRMALRRSWMQY
Sbjct: 365  DILSALARGLPVSEDNDLVDVEHLKAPLVTRKRLVMLIGIFSTGNNFERRMALRRSWMQY 424

Query: 940  EVVLSGQVAVRFFVGLHKNSIVNAELWREAQTYGDIQLMPFVDYYDLITLKTIAICLMGT 761
            E   SG VAVRFF+GLHKNS VN ELW+EA  YGDIQLMPFVDYY LI+LKTIAIC+MGT
Sbjct: 425  EAARSGDVAVRFFIGLHKNSQVNLELWKEALVYGDIQLMPFVDYYSLISLKTIAICIMGT 484

Query: 760  KILPAKYMMKTDDDAFVRIDEILAGLKEKPSNGLLYGLISFDSEPIRDRDSKWYISDEEW 581
            KILPAKY+MKTDDDAFVRID++L  LKEKPSNGLLYG ISFDS P RDRDSKWYIS+EEW
Sbjct: 485  KILPAKYIMKTDDDAFVRIDQVLTSLKEKPSNGLLYGRISFDSSPHRDRDSKWYISNEEW 544

Query: 580  PHSLYPPWAHGPGYVLSRDIAKFIVQGHQERYLKLFKLEDVAMGIWIDQFKKNGQEVHYM 401
            PH  YPPWAHGPGY++SRDIAKFIV+GHQER LKLFKLEDVAMGIWI+QFK +GQEVHYM
Sbjct: 545  PHDAYPPWAHGPGYIISRDIAKFIVRGHQERDLKLFKLEDVAMGIWIEQFKNSGQEVHYM 604

Query: 400  TDDRFKISGCEESYILAHYQNPSMVLCLWEKLQKEHEAVCC 278
            TDDRF  +GCE  YILAHYQ+P +VLCLWEKLQKEH+  CC
Sbjct: 605  TDDRFYNAGCETDYILAHYQSPRLVLCLWEKLQKEHQPACC 645


>KJB28788.1 hypothetical protein B456_005G069600 [Gossypium raimondii]
          Length = 649

 Score =  827 bits (2136), Expect = 0.0
 Identities = 411/645 (63%), Positives = 494/645 (76%), Gaps = 16/645 (2%)
 Frame = -3

Query: 2167 MRKWXXXXXXXXXXXXXXIRYSL--VTKDSVNKHSAAYDFFNNHPT-------NASIT-- 2021
            M+KW                YSL    +    K  +AYDFFNNHP        N S    
Sbjct: 13   MKKWYGGVLILVLAIVMVFSYSLRETQRPQPKKKQSAYDFFNNHPPIDSHRKGNDSFKLP 72

Query: 2020 -IKSSRPRLIPNDAKRADVINVYGLSELFSPTDISKEEAKVLDVWIHMRTILSRSDYLPE 1844
             +++ +P LI    ++  +INV GL EL++P ++S++E+ VL +W H+  +LSRSD LPE
Sbjct: 73   KVEAKKPSLI----QKPKLINVEGLDELYAPRNVSEQESNVLLLWPHLHLLLSRSDALPE 128

Query: 1843 TTQGIIEASIAWKELLSTLGEQKASRLMVNSGSSSHVEDNKNCSFSVSMENDSVLRYGSS 1664
            T QGI EA+IAWKELL+ + E+K ++L  N          KNC FSVS  ++++   G+ 
Sbjct: 129  TGQGIKEAAIAWKELLALIEEEKTTKLSNNIRLKE-----KNCPFSVSSPDNALFSGGNI 183

Query: 1663 LQFPCGLVEDSSITVVGFPNGLNRSFQIELIGSKLKDEIQPPVVLHYEVSLPRDELKEDP 1484
            L+ PCGLVEDSSIT++G PNG  RSF+I+L+GS   +E +PP+VLHY VS+  D + E+P
Sbjct: 184  LELPCGLVEDSSITLIGTPNGSYRSFEIDLVGSNFSEEPKPPIVLHYNVSVAGDNMTEEP 243

Query: 1483 VITQNAWTAGSGWGKDEKCPAHHSISNLEVDGLVLCNEQVV---IEDNINRSQPNGEKLT 1313
             I QN WT   GWGK+EKCP+H S +NL+VDGL LCNEQ+V   +E+N N S  +G+  T
Sbjct: 244  FIAQNTWTNELGWGKEEKCPSHVSSNNLKVDGLGLCNEQLVRSTMEENQNVSVSSGDAST 303

Query: 1312 NVSHGSSHTSYNFPFVEGNPFTATLWVGIEGFHMTVNGRHETSFAYREKLEPWLVSDIKV 1133
            N S  SSH S NFPFVEGNPFTATLWVG+EGFHMTVNGRHETSFAYREKLEPW VS +KV
Sbjct: 304  NASQESSHASANFPFVEGNPFTATLWVGLEGFHMTVNGRHETSFAYREKLEPWSVSGVKV 363

Query: 1132 TGGLDLLSALANGLPVSEDLDLIDIER-LKAPLLPTKRIIMLIGVFSIGNNFERRMALRR 956
             GGLDLLSA A GLPV ED DLID  + LKAP++  KR++ML+GVFS GNNFERRMALRR
Sbjct: 364  VGGLDLLSAFAKGLPVPEDHDLIDNSKILKAPVITRKRLVMLVGVFSTGNNFERRMALRR 423

Query: 955  SWMQYEVVLSGQVAVRFFVGLHKNSIVNAELWREAQTYGDIQLMPFVDYYDLITLKTIAI 776
            SWMQ+E V SG VAVRFF+GL+KN  VN ELW+EAQ YGDIQ MPFVDYY LI+LKTIAI
Sbjct: 424  SWMQFEAVRSGDVAVRFFIGLNKNLQVNFELWKEAQAYGDIQFMPFVDYYSLISLKTIAI 483

Query: 775  CLMGTKILPAKYMMKTDDDAFVRIDEILAGLKEKPSNGLLYGLISFDSEPIRDRDSKWYI 596
            C+MGTKILPAKY+MKTDDDAFVRIDE+L+ LKEKPSNGLLYGLI FDS P R++DSKWYI
Sbjct: 484  CIMGTKILPAKYIMKTDDDAFVRIDEVLSSLKEKPSNGLLYGLIEFDSSPHREKDSKWYI 543

Query: 595  SDEEWPHSLYPPWAHGPGYVLSRDIAKFIVQGHQERYLKLFKLEDVAMGIWIDQFKKNGQ 416
            SDEEWPHS YPPWAHGPGY+LSRD+AKFIVQGH+ER LKLFKLEDVAMGIWI++FK++G+
Sbjct: 544  SDEEWPHSSYPPWAHGPGYILSRDVAKFIVQGHKERELKLFKLEDVAMGIWIEEFKRSGR 603

Query: 415  EVHYMTDDRFKISGCEESYILAHYQNPSMVLCLWEKLQKEHEAVC 281
            EVHY+TDDRF  +GCE +YILAHYQ P MVLCLWEKLQKEH+A C
Sbjct: 604  EVHYITDDRFYNAGCESNYILAHYQGPRMVLCLWEKLQKEHQAYC 648


>XP_018814246.1 PREDICTED: hydroxyproline O-galactosyltransferase GALT3 isoform X1
            [Juglans regia] XP_018814247.1 PREDICTED: hydroxyproline
            O-galactosyltransferase GALT3 isoform X1 [Juglans regia]
            XP_018814249.1 PREDICTED: hydroxyproline
            O-galactosyltransferase GALT3 isoform X1 [Juglans regia]
          Length = 635

 Score =  820 bits (2117), Expect = 0.0
 Identities = 403/641 (62%), Positives = 493/641 (76%), Gaps = 11/641 (1%)
 Frame = -3

Query: 2167 MRKWXXXXXXXXXXXXXXIRYSLVTKDSVNKHSAAYDFFNNHPTNASI-----TIKSSRP 2003
            M+KW              +RYSL+      K  +AY FF NHP N S      +I+SS  
Sbjct: 1    MKKWFGGMFVLALVMILVLRYSLMGIQP--KKQSAYSFFKNHPANESQKKDSGSIRSSEM 58

Query: 2002 RL--IPNDAKRADVINVYGLSELFSPTDISKEEAKVLDVWIHMRTILSRSDYLPETTQGI 1829
            ++  +   + +  ++N+ GLS+L+S  ++S++E+K L VW H+RT+LSRSD LP T +G+
Sbjct: 59   QVKKVAKPSIKTPLVNIEGLSDLYSSKNLSEKESKALLVWAHLRTLLSRSDALPGTAEGV 118

Query: 1828 IEASIAWKELLSTLGEQKASRLMVNSGSSSHVEDNKNCSFSVSMENDSVLRYGSSLQFPC 1649
             EASIAW +L ST+ ++K+S+    +GS      ++NC +SVS+ + + L     L+ PC
Sbjct: 119  KEASIAWNDLSSTIEKEKSSKYSNTNGSK-----DRNCPYSVSILDQTALNGVVILEIPC 173

Query: 1648 GLVEDSSITVVGFPNGLNRSFQIELIGSKLKDEIQPPVVLHYEVSLPRDELKEDPVITQN 1469
            GLVEDSSIT+VG P+G + SFQIEL+GS+L  E  PP++LH+ VSLP D + E+P I QN
Sbjct: 174  GLVEDSSITLVGIPDGHHGSFQIELVGSQLSAEPTPPIILHFNVSLPGDNMTEEPFIVQN 233

Query: 1468 AWTAGSGWGKDEKCPAHHSISNLEVDGLVLCNEQVV---IEDNINRSQPNGEKLTNVSHG 1298
             WT+ +GWGK+EKCPA  S + ++VDGLVLCNEQ+V   +E+N N S P+ + L +VS G
Sbjct: 234  TWTSEAGWGKEEKCPARRSANIVKVDGLVLCNEQIVRNAVEENSNASHPSSDMLNSVSRG 293

Query: 1297 SSHTSYNFPFVEGNPFTATLWVGIEGFHMTVNGRHETSFAYREKLEPWLVSDIKVTGGLD 1118
             +H S +FPFVEGNPFTATLWVGIEGFHMTV+GRHETSFAYREKLEPW VS + V GGLD
Sbjct: 294  VAHGSASFPFVEGNPFTATLWVGIEGFHMTVSGRHETSFAYREKLEPWSVSRVNVAGGLD 353

Query: 1117 LLSALANGLPVSEDLDL-IDIERLKAPLLPTKRIIMLIGVFSIGNNFERRMALRRSWMQY 941
            LLSA A GLPVSED DL ID+E LKAP +  KR +ML+GVFS GNNFERRMALRRSWMQY
Sbjct: 354  LLSAFAKGLPVSEDNDLVIDVEHLKAPSVSRKRCVMLVGVFSTGNNFERRMALRRSWMQY 413

Query: 940  EVVLSGQVAVRFFVGLHKNSIVNAELWREAQTYGDIQLMPFVDYYDLITLKTIAICLMGT 761
            E V SG VAVRFFVGLHKN+ VN ELWREAQ YGD+QLMPFVDYY LI LKTIAIC+MGT
Sbjct: 414  EAVRSGDVAVRFFVGLHKNNQVNFELWREAQAYGDVQLMPFVDYYSLIALKTIAICIMGT 473

Query: 760  KILPAKYMMKTDDDAFVRIDEILAGLKEKPSNGLLYGLISFDSEPIRDRDSKWYISDEEW 581
            K+LPAKY+MKTDDDAFVRIDE+L+ LK K  NGLLYGLISF+S P RD+DSKWYIS EEW
Sbjct: 474  KVLPAKYIMKTDDDAFVRIDEVLSSLKGKAVNGLLYGLISFESAPHRDKDSKWYISTEEW 533

Query: 580  PHSLYPPWAHGPGYVLSRDIAKFIVQGHQERYLKLFKLEDVAMGIWIDQFKKNGQEVHYM 401
            PH+ YPPWAHGPGY++SRDIAKFIV+GHQER LKLFKLEDVAMGIWI+Q+K +GQEVHY+
Sbjct: 534  PHASYPPWAHGPGYIISRDIAKFIVRGHQERGLKLFKLEDVAMGIWIEQYKNSGQEVHYI 593

Query: 400  TDDRFKISGCEESYILAHYQNPSMVLCLWEKLQKEHEAVCC 278
             DDRF  +GCE+ Y+LAHYQ P  VLCLWE LQKEH A+CC
Sbjct: 594  NDDRFFNAGCEQDYVLAHYQGPRKVLCLWEMLQKEHRAICC 634


>XP_017631926.1 PREDICTED: hydroxyproline O-galactosyltransferase GALT3 [Gossypium
            arboreum] KHG08744.1 putative
            beta-1,3-galactosyltransferase 16 -like protein
            [Gossypium arboreum]
          Length = 650

 Score =  820 bits (2118), Expect = 0.0
 Identities = 407/646 (63%), Positives = 492/646 (76%), Gaps = 16/646 (2%)
 Frame = -3

Query: 2167 MRKWXXXXXXXXXXXXXXIRYSL--VTKDSVNKHSAAYDFFNNHPT-------NASIT-- 2021
            M+KW                YSL    +    K  +AYDFFNNHP        N S    
Sbjct: 13   MKKWYGGVLILVLAIVMVFSYSLRETQRPQPKKKQSAYDFFNNHPPIDSHRKGNDSFKLP 72

Query: 2020 -IKSSRPRLIPNDAKRADVINVYGLSELFSPTDISKEEAKVLDVWIHMRTILSRSDYLPE 1844
             +++ +P LI    ++  +INV GL EL++P ++S++E+ VL +W H+  +LSRSD LPE
Sbjct: 73   KVEAKKPSLI----QKPKLINVEGLDELYAPRNVSEQESNVLLLWPHLHLLLSRSDALPE 128

Query: 1843 TTQGIIEASIAWKELLSTLGEQKASRLMVNSGSSSHVEDNKNCSFSVSMENDSVLRYGSS 1664
            T QGI EA+ AWKELL+ + E+K ++L  N          KNC FSV   ++++   G+ 
Sbjct: 129  TGQGIKEAAKAWKELLALIEEEKTTKLSNNIRLKE-----KNCPFSVCSPDNALFSGGNI 183

Query: 1663 LQFPCGLVEDSSITVVGFPNGLNRSFQIELIGSKLKDEIQPPVVLHYEVSLPRDELKEDP 1484
            L+ PCGLVEDSSIT++G PNG  RSF+I+L+GS   +E +PP+VLHY VS+  D + E+P
Sbjct: 184  LELPCGLVEDSSITLIGTPNGSYRSFEIDLVGSNFSEEPKPPIVLHYNVSVAGDNMTEEP 243

Query: 1483 VITQNAWTAGSGWGKDEKCPAHHSISNLEVDGLVLCNEQVV---IEDNINRSQPNGEKLT 1313
             I QN WT   GWGK+EKCP+H S +NL+VDGL LCNEQ+V   +E+N N S  +G+  T
Sbjct: 244  FIAQNTWTNELGWGKEEKCPSHVSSNNLKVDGLGLCNEQLVRSTMEENQNVSVSSGDAAT 303

Query: 1312 NVSHGSSHTSYNFPFVEGNPFTATLWVGIEGFHMTVNGRHETSFAYREKLEPWLVSDIKV 1133
            N S  SSH S NFPFVEGNPFTATLWVG+EGFHMTVNGRHETSFAYREKLEPW VS +KV
Sbjct: 304  NASQQSSHASANFPFVEGNPFTATLWVGLEGFHMTVNGRHETSFAYREKLEPWSVSGVKV 363

Query: 1132 TGGLDLLSALANGLPVSEDLDLI-DIERLKAPLLPTKRIIMLIGVFSIGNNFERRMALRR 956
             GGLDLLSA A GLPV ED DLI + + LKAP++  KR++ML+GVFS GNNFERRMALRR
Sbjct: 364  VGGLDLLSAFAKGLPVPEDHDLIVNSKILKAPVITRKRLVMLVGVFSTGNNFERRMALRR 423

Query: 955  SWMQYEVVLSGQVAVRFFVGLHKNSIVNAELWREAQTYGDIQLMPFVDYYDLITLKTIAI 776
            SWMQ+E V SG VAVRFF+GL+KN  VN E W+EAQ YGDIQ MPFVDYY LI+LKTIAI
Sbjct: 424  SWMQFEAVRSGDVAVRFFIGLNKNLQVNFEQWKEAQAYGDIQFMPFVDYYSLISLKTIAI 483

Query: 775  CLMGTKILPAKYMMKTDDDAFVRIDEILAGLKEKPSNGLLYGLISFDSEPIRDRDSKWYI 596
            C+MGTKILPAKY+MKTDDDAFVRIDE+L+ LKEKPSNGLLYGLI FDS P R++DSKWYI
Sbjct: 484  CIMGTKILPAKYIMKTDDDAFVRIDEVLSSLKEKPSNGLLYGLIEFDSSPHREKDSKWYI 543

Query: 595  SDEEWPHSLYPPWAHGPGYVLSRDIAKFIVQGHQERYLKLFKLEDVAMGIWIDQFKKNGQ 416
            SDEEWPHS YPPWAHGPGY++SRD+AKFIVQGH+ER LKLFKLEDVAMGIWI++FK++G+
Sbjct: 544  SDEEWPHSSYPPWAHGPGYIISRDVAKFIVQGHKERELKLFKLEDVAMGIWIEEFKRSGR 603

Query: 415  EVHYMTDDRFKISGCEESYILAHYQNPSMVLCLWEKLQKEHEAVCC 278
            EVHY+TDDRF  +GCE +YILAHYQ P MVLCLWEKLQKEH+A CC
Sbjct: 604  EVHYITDDRFYNAGCESNYILAHYQGPRMVLCLWEKLQKEHQAYCC 649


>XP_016742176.1 PREDICTED: hydroxyproline O-galactosyltransferase GALT3-like isoform
            X1 [Gossypium hirsutum]
          Length = 650

 Score =  816 bits (2109), Expect = 0.0
 Identities = 405/646 (62%), Positives = 492/646 (76%), Gaps = 16/646 (2%)
 Frame = -3

Query: 2167 MRKWXXXXXXXXXXXXXXIRYSL--VTKDSVNKHSAAYDFFNNHPT-------NASIT-- 2021
            M+KW                YSL    +    K  +AYDFFNNHP        N S    
Sbjct: 13   MKKWYGGVLILVLAIVMVFSYSLRETQRPQPKKKQSAYDFFNNHPPIDSHRKGNDSFKLP 72

Query: 2020 -IKSSRPRLIPNDAKRADVINVYGLSELFSPTDISKEEAKVLDVWIHMRTILSRSDYLPE 1844
             +++ +P LI    ++  +INV GL +L++P ++S++E+ VL +W H+  +LSRSD LPE
Sbjct: 73   KVEAKKPSLI----QKPKLINVEGLDKLYAPRNVSEQESNVLLLWPHLHLLLSRSDALPE 128

Query: 1843 TTQGIIEASIAWKELLSTLGEQKASRLMVNSGSSSHVEDNKNCSFSVSMENDSVLRYGSS 1664
            T QGI EA+ AWKELL+ + E+K ++L  N          KNC FSV   ++++   G+ 
Sbjct: 129  TGQGIKEAAKAWKELLALIEEEKTTKLSNNIRLKE-----KNCPFSVCSPDNALFSGGNI 183

Query: 1663 LQFPCGLVEDSSITVVGFPNGLNRSFQIELIGSKLKDEIQPPVVLHYEVSLPRDELKEDP 1484
            L+ PCGLVEDSSIT++G PNG  RSF+I+L+GS   +E +PP+VLHY VS+  D + E+P
Sbjct: 184  LELPCGLVEDSSITLIGTPNGSYRSFEIDLVGSNFSEEPKPPIVLHYNVSVAGDNMTEEP 243

Query: 1483 VITQNAWTAGSGWGKDEKCPAHHSISNLEVDGLVLCNEQVV---IEDNINRSQPNGEKLT 1313
             I QN WT   GWGK+EKCP+H S +NL+VDGL LCNEQ+V   +E+N N S  +G+  T
Sbjct: 244  FIAQNTWTNELGWGKEEKCPSHVSSNNLKVDGLGLCNEQLVRSTMEENQNVSVSSGDAST 303

Query: 1312 NVSHGSSHTSYNFPFVEGNPFTATLWVGIEGFHMTVNGRHETSFAYREKLEPWLVSDIKV 1133
            N S  SSH S NFPFVEGNPFTATLWVG+EGFHMTVNGRHETSFAYREKLEPW VS +KV
Sbjct: 304  NASQQSSHASANFPFVEGNPFTATLWVGLEGFHMTVNGRHETSFAYREKLEPWSVSGVKV 363

Query: 1132 TGGLDLLSALANGLPVSEDLDLI-DIERLKAPLLPTKRIIMLIGVFSIGNNFERRMALRR 956
             GGLDLLSA A GLPV ED DLI + + LKAP++  KR++ML+GVFS GNNFERRMALRR
Sbjct: 364  VGGLDLLSAFAKGLPVPEDHDLIVNSKILKAPVITRKRLVMLVGVFSTGNNFERRMALRR 423

Query: 955  SWMQYEVVLSGQVAVRFFVGLHKNSIVNAELWREAQTYGDIQLMPFVDYYDLITLKTIAI 776
            SWMQ+E V SG VAV+FF+GL+KN  VN ELW+EAQ YGDIQ MPFVDYY LI+LKTIAI
Sbjct: 424  SWMQFEAVRSGDVAVQFFIGLNKNLQVNFELWKEAQAYGDIQFMPFVDYYSLISLKTIAI 483

Query: 775  CLMGTKILPAKYMMKTDDDAFVRIDEILAGLKEKPSNGLLYGLISFDSEPIRDRDSKWYI 596
            C+MGTK LPAKY+MKTDDDAFVRIDE+L+ LKEKPSNGLLYGLI FDS P R++DSKWYI
Sbjct: 484  CIMGTKSLPAKYIMKTDDDAFVRIDEVLSSLKEKPSNGLLYGLIEFDSSPHREKDSKWYI 543

Query: 595  SDEEWPHSLYPPWAHGPGYVLSRDIAKFIVQGHQERYLKLFKLEDVAMGIWIDQFKKNGQ 416
            SDEEWPHS YPPWAHGPGY++SRD+AKFIVQGH+ER LKLFKLEDVAMGIWI++FK++G+
Sbjct: 544  SDEEWPHSSYPPWAHGPGYIISRDVAKFIVQGHKERELKLFKLEDVAMGIWIEEFKRSGR 603

Query: 415  EVHYMTDDRFKISGCEESYILAHYQNPSMVLCLWEKLQKEHEAVCC 278
            EVHY+TDDRF  +GCE +YILAHYQ P MVLCLWEKLQKEH+A CC
Sbjct: 604  EVHYITDDRFYNAGCESNYILAHYQGPRMVLCLWEKLQKEHQAYCC 649


>XP_015867148.1 PREDICTED: probable beta-1,3-galactosyltransferase 16 [Ziziphus
            jujuba]
          Length = 646

 Score =  816 bits (2107), Expect = 0.0
 Identities = 400/652 (61%), Positives = 501/652 (76%), Gaps = 22/652 (3%)
 Frame = -3

Query: 2167 MRKWXXXXXXXXXXXXXXIRYSLVTKDSVNKHSAAYDFFNNHPTNASIT----------- 2021
            M+KW               RYSLV +    +  +AY FF+ HP N ++T           
Sbjct: 1    MKKWSGGVMILALAMILIFRYSLVGRQP--QKQSAYSFFHYHPENDALTKDDSEFIMPSE 58

Query: 2020 -----IKSSRPRLIPND--AKRADVINVYGLSELFSPTDISKEEAKVLDVWIHMRTILSR 1862
                 +   +P+L P+   +K+  ++NV GL +L++  ++S++++K L VW +MR +LSR
Sbjct: 59   IRVKTVPKPKPKLKPSTKPSKKPHLVNVQGLDQLYNSHNMSQQDSKALLVWAYMRPLLSR 118

Query: 1861 SDYLPETTQGIIEASIAWKELLSTLGEQKASRLMVNSGSSSHVEDNKNCSFSVSMENDSV 1682
            SD LPET QG+ EAS+AWK+L+S + E+KAS       SSS+  +NKNC  SV+  + SV
Sbjct: 119  SDALPETAQGVKEASVAWKDLVSIIEEEKASDF-----SSSNGPENKNCPSSVNTLDKSV 173

Query: 1681 LRYGSSLQFPCGLVEDSSITVVGFPNGLNRSFQIELIGSKLKDEIQPPVVLHYEVSLPRD 1502
            L  G+ LQFPCGL+EDSSI+++G P+G + SF IELIGS L  + +PP++LHY VSLP +
Sbjct: 174  LGDGAILQFPCGLIEDSSISMLGIPDGPSGSFLIELIGSTLSGDSEPPIILHYNVSLPGN 233

Query: 1501 ELKEDPVITQNAWTAGSGWGKDEKCPAHHSISNLEVDGLVLCNEQVV---IEDNINRSQP 1331
             + E+P I Q+ WT   GWGK+E+CPAH S ++L+VDGLVLCNEQVV    E+N+N S+P
Sbjct: 234  NMTEEPFIVQSTWTKELGWGKEERCPAHRSANSLKVDGLVLCNEQVVRSTSEENLNTSRP 293

Query: 1330 NGEKLTNVSHGSSHTSYNFPFVEGNPFTATLWVGIEGFHMTVNGRHETSFAYREKLEPWL 1151
            + + LTNVS GS H + +FPFVEG PFTATLWVG+EGFH+TVNGRHETSFAYREKLEPW 
Sbjct: 294  SNDMLTNVSKGSDHGTVSFPFVEGTPFTATLWVGLEGFHVTVNGRHETSFAYREKLEPWS 353

Query: 1150 VSDIKVTGGLDLLSALANGLPVSEDLDLI-DIERLKAPLLPTKRIIMLIGVFSIGNNFER 974
            V  +KV+GGL+LLSALA  LPVSED DL+ D+E LKAP +P KR++ML+GVFS GNNFER
Sbjct: 354  VGKVKVSGGLNLLSALAKDLPVSEDHDLVVDVELLKAPSIPKKRLVMLVGVFSSGNNFER 413

Query: 973  RMALRRSWMQYEVVLSGQVAVRFFVGLHKNSIVNAELWREAQTYGDIQLMPFVDYYDLIT 794
            RMALRRSWMQYE V SG VAVRFF+GLHKN  VN ELWREAQ YGDIQLMPFVDYY LI+
Sbjct: 414  RMALRRSWMQYEPVRSGDVAVRFFIGLHKNKQVNYELWREAQAYGDIQLMPFVDYYSLIS 473

Query: 793  LKTIAICLMGTKILPAKYMMKTDDDAFVRIDEILAGLKEKPSNGLLYGLISFDSEPIRDR 614
            LKTIAIC+MGTK+LPAK++MKTDDDAFVRIDE+L+ LKEK +NGLLYGLISF+S P RD+
Sbjct: 474  LKTIAICIMGTKVLPAKFIMKTDDDAFVRIDEVLSSLKEKSTNGLLYGLISFESSPQRDK 533

Query: 613  DSKWYISDEEWPHSLYPPWAHGPGYVLSRDIAKFIVQGHQERYLKLFKLEDVAMGIWIDQ 434
            DSKWYIS++EWPH+ YPPWAHGPGY++SRDIAKFIVQGHQER L+LFKLEDVAMGIWI++
Sbjct: 534  DSKWYISNKEWPHASYPPWAHGPGYIISRDIAKFIVQGHQERDLQLFKLEDVAMGIWIEE 593

Query: 433  FKKNGQEVHYMTDDRFKISGCEESYILAHYQNPSMVLCLWEKLQKEHEAVCC 278
             + +GQEVHY  D+RF  +GC+ +YILAHYQ+P +VLCLWEKL KE    CC
Sbjct: 594  LRNSGQEVHYTNDERFFNAGCQSNYILAHYQSPRLVLCLWEKLLKEKRPNCC 645


>XP_002274418.1 PREDICTED: hydroxyproline O-galactosyltransferase GALT3 [Vitis
            vinifera] CBI25973.3 unnamed protein product, partial
            [Vitis vinifera]
          Length = 637

 Score =  815 bits (2104), Expect = 0.0
 Identities = 403/639 (63%), Positives = 486/639 (76%), Gaps = 9/639 (1%)
 Frame = -3

Query: 2167 MRKWXXXXXXXXXXXXXXIRYSLVTKDSVNKHSAAYDFFNNHPTNASI-----TIKSSRP 2003
            MRKW              ++Y+L+      +    + FF NHP N S      ++ S + 
Sbjct: 1    MRKWYGGVLIIALAVILLLQYTLMGNRP--QKQPPHRFFGNHPANTSKLKDSDSVSSVKE 58

Query: 2002 RLIPNDAKRADVINVYGLSELFSPTDISKEEAKVLDVWIHMRTILSRSDYLPETTQGIIE 1823
            + + N  K+A +I+V GL +L++  +ISKE++K L VW HM  +L RSD LPET QGI E
Sbjct: 59   KKVLNHRKKAHLIDVEGLDDLYALNNISKEDSKALLVWAHMYPLLCRSDALPETAQGIKE 118

Query: 1822 ASIAWKELLSTLGEQKASRLMVNSGSSSHVEDNKNCSFSVSMENDSVLRYGSSLQFPCGL 1643
            AS AWK+L S + E KAS+   N+ S +   + K+C FSVS  + +V   G  L+FPCGL
Sbjct: 119  ASSAWKDLWSAIEEDKASKFN-NTQSENGNPEAKDCPFSVSTFDKTVYSSGCILEFPCGL 177

Query: 1642 VEDSSITVVGFPNGLNRSFQIELIGSKLKDEIQPPVVLHYEVSLPRDELKEDPVITQNAW 1463
            VEDSSITV+G P+G N SFQ+EL+G +L  E +PP++LHY VSLP D+L E+PVI QN W
Sbjct: 178  VEDSSITVIGIPDGRNGSFQVELVGLQLPGEREPPILLHYNVSLPGDKLTEEPVIVQNTW 237

Query: 1462 TAGSGWGKDEKCPAHHSISNLEVDGLVLCNEQVV---IEDNINRSQPNGEKLTNVSHGSS 1292
            T  +GWGK+E+C AH S +  +VDGLVLCN+ VV   +E+N+N + PN + LTNVS G +
Sbjct: 238  TNETGWGKEERCHAHASTNIQKVDGLVLCNQLVVRSTVEENLNMTHPNSDMLTNVSSGRA 297

Query: 1291 HTSYNFPFVEGNPFTATLWVGIEGFHMTVNGRHETSFAYREKLEPWLVSDIKVTGGLDLL 1112
            H S NFPF EGNPFTATLWVG EGFHMTVNGRHETSF YREKLEPWLVS +KV GGL+LL
Sbjct: 298  HVSANFPFAEGNPFTATLWVGSEGFHMTVNGRHETSFTYREKLEPWLVSGVKVAGGLELL 357

Query: 1111 SALANGLPVSEDLDL-IDIERLKAPLLPTKRIIMLIGVFSIGNNFERRMALRRSWMQYEV 935
            SA A  LPVSEDLDL +D+E LKAP +  KR++ML+GVFS GNNFERRMALRR+WMQYE 
Sbjct: 358  SAFAKDLPVSEDLDLAVDVEHLKAPPVSRKRLVMLVGVFSTGNNFERRMALRRTWMQYEA 417

Query: 934  VLSGQVAVRFFVGLHKNSIVNAELWREAQTYGDIQLMPFVDYYDLITLKTIAICLMGTKI 755
            V SG VAVRFF+GLHKN  VN ELWREAQ YGDIQLMPFVDYY LI+LKTIA C+MGTKI
Sbjct: 418  VRSGDVAVRFFIGLHKNRQVNLELWREAQAYGDIQLMPFVDYYSLISLKTIATCIMGTKI 477

Query: 754  LPAKYMMKTDDDAFVRIDEILAGLKEKPSNGLLYGLISFDSEPIRDRDSKWYISDEEWPH 575
            LPAKY+MKTDDDAFVRIDE+L+ LK KPSNGLLYGLISFDS P RD+DSKW+IS EEWP 
Sbjct: 478  LPAKYVMKTDDDAFVRIDEVLSSLKGKPSNGLLYGLISFDSAPHRDKDSKWHISAEEWPR 537

Query: 574  SLYPPWAHGPGYVLSRDIAKFIVQGHQERYLKLFKLEDVAMGIWIDQFKKNGQEVHYMTD 395
              YPPWAHGPGY++SRDIAKFIVQGHQER L+LFKLEDVAMGIWID+FK   Q+V+Y++D
Sbjct: 538  DTYPPWAHGPGYIISRDIAKFIVQGHQERDLQLFKLEDVAMGIWIDEFKNKDQQVNYISD 597

Query: 394  DRFKISGCEESYILAHYQNPSMVLCLWEKLQKEHEAVCC 278
            +RF  +GCE +YILAHYQ P  VLCLWE LQKE + +CC
Sbjct: 598  ERFYNTGCESNYILAHYQGPRKVLCLWEMLQKEQKPICC 636


>OAY56191.1 hypothetical protein MANES_03G209300 [Manihot esculenta]
          Length = 652

 Score =  811 bits (2096), Expect = 0.0
 Identities = 405/644 (62%), Positives = 479/644 (74%), Gaps = 14/644 (2%)
 Frame = -3

Query: 2167 MRKWXXXXXXXXXXXXXXIRYSLVT--KDSVNKHSAAYDFFNNHPTNASITIKSSRPRL- 1997
            M+KW                YSL+   +    K   AYDFF NHP N S    +S  R  
Sbjct: 13   MKKWSGGMMIVALAVILVFSYSLLKTQRSQPQKKQTAYDFFRNHPINDSHVKDTSYARSP 72

Query: 1996 ------IPNDAKRADVINVYGLSELFSPTDISKEEAKVLDVWIHMRTILSRSDYLPETTQ 1835
                  +   +K+   +NV GL++L++P + SKEE+K L VW  MR +LSRSD LPET +
Sbjct: 73   QVEVDKVAKSSKKIHFVNVEGLNDLYAPNNFSKEESKALLVWSQMRLLLSRSDALPETAK 132

Query: 1834 GIIEASIAWKELLSTLGEQKASRLMVNSGSSSHVEDNKNCSFSVSMENDSVLRYGSSLQF 1655
            GI EASIAWK+LLS + E KA++  +   +     +NKNC +SV+  N      G +   
Sbjct: 133  GIKEASIAWKDLLSMIEEDKATKSSIIDKT-----ENKNCPYSVNAINIMASSNGPTFDI 187

Query: 1654 PCGLVEDSSITVVGFPNGLNRSFQIELIGSKLKDEIQPPVVLHYEVSLPRDELKEDPVIT 1475
            PCGLVEDSSIT+VG PN  N SFQ+EL GS+L  E  PP++LHY VSLP D + E+P I 
Sbjct: 188  PCGLVEDSSITIVGIPNEHNGSFQLELEGSQLLGEQNPPIILHYRVSLPGDNITEEPFIV 247

Query: 1474 QNAWTAGSGWGKDEKCPAHHS-ISNLEVDGLVLCNEQVV---IEDNINRSQPNGEKLTNV 1307
            QN WT   GWGK+EKCPAH S I   +VDGLVLCNEQ+V   +E+ +N S P+ + L NV
Sbjct: 248  QNTWTNEHGWGKEEKCPAHGSNIPKPKVDGLVLCNEQIVRSTVEETLNASLPSRDILANV 307

Query: 1306 SHGSSHTSYNFPFVEGNPFTATLWVGIEGFHMTVNGRHETSFAYREKLEPWLVSDIKVTG 1127
            S GS+H S NFPF E NPFTATLWVG EGFHMTVNGRHETSFAYREKLEPW VS +KV G
Sbjct: 308  SQGSAHASANFPFSEANPFTATLWVGSEGFHMTVNGRHETSFAYREKLEPWAVSGVKVDG 367

Query: 1126 GLDLLSALANGLPVSEDLDL-IDIERLKAPLLPTKRIIMLIGVFSIGNNFERRMALRRSW 950
            GLD+LS LA GLPVSED DL ID E L+AP+   KR+ +L+GVFS GNNFERRMALRRSW
Sbjct: 368  GLDILSVLAKGLPVSEDHDLVIDAELLRAPVTKKKRLALLVGVFSTGNNFERRMALRRSW 427

Query: 949  MQYEVVLSGQVAVRFFVGLHKNSIVNAELWREAQTYGDIQLMPFVDYYDLITLKTIAICL 770
            MQYE V SG VAVRFF+GLHKN  VN ELW+EAQ YGD+QLMPFVDYY LI+LKTIAIC+
Sbjct: 428  MQYEAVHSGDVAVRFFIGLHKNRQVNFELWKEAQAYGDVQLMPFVDYYSLISLKTIAICI 487

Query: 769  MGTKILPAKYMMKTDDDAFVRIDEILAGLKEKPSNGLLYGLISFDSEPIRDRDSKWYISD 590
            MGTKILPAKY+MKTDDDAFVRIDE+L  LK K S+GLLYGL+SFDS P R++DSKWYIS+
Sbjct: 488  MGTKILPAKYIMKTDDDAFVRIDEVLTSLKGKASDGLLYGLMSFDSSPHREKDSKWYISN 547

Query: 589  EEWPHSLYPPWAHGPGYVLSRDIAKFIVQGHQERYLKLFKLEDVAMGIWIDQFKKNGQEV 410
            EEWPHS YPPWAHGPGY++SR+IAKFI QGHQER  KLFKLEDVAMGIWI++ KK GQEV
Sbjct: 548  EEWPHSSYPPWAHGPGYIVSRNIAKFIAQGHQERDFKLFKLEDVAMGIWIEELKKRGQEV 607

Query: 409  HYMTDDRFKISGCEESYILAHYQNPSMVLCLWEKLQKEHEAVCC 278
            HY++D+RF  +GCE +YILAHYQ+P +VLCLWEKLQKEH+  CC
Sbjct: 608  HYVSDERFHNAGCESNYILAHYQSPRLVLCLWEKLQKEHQPNCC 651


>XP_007035910.2 PREDICTED: hydroxyproline O-galactosyltransferase GALT3 [Theobroma
            cacao] XP_017975894.1 PREDICTED: hydroxyproline
            O-galactosyltransferase GALT3 [Theobroma cacao]
          Length = 643

 Score =  807 bits (2085), Expect = 0.0
 Identities = 411/653 (62%), Positives = 492/653 (75%), Gaps = 10/653 (1%)
 Frame = -3

Query: 2206 MVKYSCSFS*MRLMRKWXXXXXXXXXXXXXXIRYSLVTKDSVNKHSAAYDFFNNHPTNAS 2027
            M   S +F   R M+KW                YSL  +++  K  +AYDFFNNHP   S
Sbjct: 1    MKSLSLAFGLFR-MKKWYGGVLIVVLAIILVFSYSL--RETQPKKQSAYDFFNNHPPKDS 57

Query: 2026 IT-----IKSSRPRLIPNDA-KRADVINVYGLSELFSPTDISKEEAKVLDVWIHMRTILS 1865
             T     IKS +  +      K+  +INV GL++L++PT+IS E++K L +W HMR +LS
Sbjct: 58   HTKENDSIKSPKVEVKKLALIKKPKLINVEGLNDLYAPTNIS-EKSKALLLWPHMRLLLS 116

Query: 1864 RSDYLPETTQGIIEASIAWKELLSTLGEQKASRLMVNSGSSSHVEDNKNCSFSVSMENDS 1685
            RSD LPET QGI EA+IAWKELL+ + E+K +   +           KNC FSVS  + +
Sbjct: 117  RSDALPETGQGIKEATIAWKELLAVIEEEKTTSHNIRL-------KEKNCPFSVSNLDKT 169

Query: 1684 VLRYGSSLQFPCGLVEDSSITVVGFPNGLNRSFQIELIGSKLKDEIQPPVVLHYEVSLPR 1505
            +   G+ L+ PCGLVEDSSITV+G P+G  RSF+IEL GS    E QP V+LHY VS+  
Sbjct: 170  LFSGGNILELPCGLVEDSSITVIGIPDGRYRSFEIELAGSNFSGEPQPSVILHYNVSVAG 229

Query: 1504 DELKEDPVITQNAWTAGSGWGKDEKCPAHHSISNLEVDGLVLCNEQVV---IEDNINRSQ 1334
            D + E+P I QN WT   GWGK+E+CPAH S +NL+VDGL LCNEQ+V   +E+N N S 
Sbjct: 230  DNMTEEPFIVQNTWTNELGWGKEERCPAHVSSNNLKVDGLGLCNEQLVRSLMEENQNVSL 289

Query: 1333 PNGEKLTNVSHGSSHTSYNFPFVEGNPFTATLWVGIEGFHMTVNGRHETSFAYREKLEPW 1154
             +G  LTN S   SH S NFPF+EGNPFTATLWVG+EGFHMTVNGRHETSFAYREKLEPW
Sbjct: 290  SSGNALTNASQARSHASANFPFIEGNPFTATLWVGLEGFHMTVNGRHETSFAYREKLEPW 349

Query: 1153 LVSDIKVTGGLDLLSALANGLPVSEDLDLI-DIERLKAPLLPTKRIIMLIGVFSIGNNFE 977
             VS +KV GGLDLLSA A GLPV ED DLI + + LKAP +  KR++ML+GVFS GNNFE
Sbjct: 350  SVSGVKVAGGLDLLSAFAKGLPVPEDHDLIVNSKLLKAPAVSRKRLLMLVGVFSTGNNFE 409

Query: 976  RRMALRRSWMQYEVVLSGQVAVRFFVGLHKNSIVNAELWREAQTYGDIQLMPFVDYYDLI 797
            RRMALRRSWMQ++ V SG VAVRFF+GL+KN  VN ELW+EAQ YGDIQ MPFVDYY LI
Sbjct: 410  RRMALRRSWMQFQAVRSGDVAVRFFIGLNKNRQVNFELWKEAQAYGDIQFMPFVDYYSLI 469

Query: 796  TLKTIAICLMGTKILPAKYMMKTDDDAFVRIDEILAGLKEKPSNGLLYGLISFDSEPIRD 617
            +LKTIAIC++GTKILPAKY+MKTDDDAFVRIDE+L+ LKEK S+GLLYG I+FDS P RD
Sbjct: 470  SLKTIAICILGTKILPAKYIMKTDDDAFVRIDEVLSSLKEKASDGLLYGRIAFDSSPHRD 529

Query: 616  RDSKWYISDEEWPHSLYPPWAHGPGYVLSRDIAKFIVQGHQERYLKLFKLEDVAMGIWID 437
            +DSKWYIS+EEWPHS YPPWAHGPGY++SRDIAKFIV+GHQER LKLFKLEDVAMGIWI+
Sbjct: 530  KDSKWYISNEEWPHSSYPPWAHGPGYIISRDIAKFIVRGHQERELKLFKLEDVAMGIWIE 589

Query: 436  QFKKNGQEVHYMTDDRFKISGCEESYILAHYQNPSMVLCLWEKLQKEHEAVCC 278
            +FK +G+EVHY+TD+RF  +GCE +YILAHYQ P MVLCLWEKLQKEH+A CC
Sbjct: 590  EFKNSGREVHYVTDERFYNAGCESNYILAHYQGPRMVLCLWEKLQKEHQAHCC 642


>XP_015900118.1 PREDICTED: probable beta-1,3-galactosyltransferase 16 [Ziziphus
            jujuba] XP_015900119.1 PREDICTED: probable
            beta-1,3-galactosyltransferase 16 [Ziziphus jujuba]
            XP_015900120.1 PREDICTED: probable
            beta-1,3-galactosyltransferase 16 [Ziziphus jujuba]
            XP_015900121.1 PREDICTED: probable
            beta-1,3-galactosyltransferase 16 [Ziziphus jujuba]
            XP_015900122.1 PREDICTED: probable
            beta-1,3-galactosyltransferase 16 [Ziziphus jujuba]
            XP_015900123.1 PREDICTED: probable
            beta-1,3-galactosyltransferase 16 [Ziziphus jujuba]
          Length = 656

 Score =  808 bits (2086), Expect = 0.0
 Identities = 400/662 (60%), Positives = 501/662 (75%), Gaps = 32/662 (4%)
 Frame = -3

Query: 2167 MRKWXXXXXXXXXXXXXXIRYSLVTKDSVNKHSAAYDFFNNHPTNASIT----------- 2021
            M+KW               RYSLV +    +  +AY FF+ HP N ++T           
Sbjct: 1    MKKWSGGVMILALAMILIFRYSLVGRQP--QKQSAYSFFHYHPENDALTKDDSEFIMPSE 58

Query: 2020 -----IKSSRPRLIPND--AKRADVINVYGLSELFSPTDISKEEAKVLDVWIHMRTILSR 1862
                 +   +P+L P+   +K+  ++NV GL +L++  ++S++++K L VW +MR +LSR
Sbjct: 59   IRVKTVPKPKPKLKPSTKPSKKPHLVNVQGLDQLYNSHNMSQQDSKALLVWAYMRPLLSR 118

Query: 1861 SDYLPETTQGIIEASIAWKELLSTLGEQKASRLMVNSGSSSHVEDNKNCSFSVSMENDSV 1682
            SD LPET QG+ EAS+AWK+L+S + E+KAS       SSS+  +NKNC  SV+  + SV
Sbjct: 119  SDALPETAQGVKEASVAWKDLVSIIEEEKASDF-----SSSNGPENKNCPSSVNTLDKSV 173

Query: 1681 LRYGSSLQFPCGLVEDSSITVVGFP----------NGLNRSFQIELIGSKLKDEIQPPVV 1532
            L  G+ LQFPCGL+EDSSI+++G P          +G + SF IELIGS L  + +PP++
Sbjct: 174  LGDGAILQFPCGLIEDSSISMLGIPTSSISMLGIRDGPSGSFLIELIGSTLSGDSEPPII 233

Query: 1531 LHYEVSLPRDELKEDPVITQNAWTAGSGWGKDEKCPAHHSISNLEVDGLVLCNEQVV--- 1361
            LHY VSLP + + E+P I Q+ WT   GWGK+E+CPAH S ++L+VDGLVLCNEQVV   
Sbjct: 234  LHYNVSLPGNNMTEEPFIVQSTWTKELGWGKEERCPAHRSANSLKVDGLVLCNEQVVRST 293

Query: 1360 IEDNINRSQPNGEKLTNVSHGSSHTSYNFPFVEGNPFTATLWVGIEGFHMTVNGRHETSF 1181
             E+N+N S+P+ + LTNVS GS H + +FPFVEG PFTATLWVG+EGFH+TVNGRHETSF
Sbjct: 294  SEENLNTSRPSNDMLTNVSKGSDHGTVSFPFVEGTPFTATLWVGLEGFHVTVNGRHETSF 353

Query: 1180 AYREKLEPWLVSDIKVTGGLDLLSALANGLPVSEDLDLI-DIERLKAPLLPTKRIIMLIG 1004
            AYREKLEPW V  +KV+GGL+LLSALA  LPVSED DL+ D+E LKAP +P KR++ML+G
Sbjct: 354  AYREKLEPWSVGKVKVSGGLNLLSALAKDLPVSEDHDLVVDVELLKAPSIPKKRLVMLVG 413

Query: 1003 VFSIGNNFERRMALRRSWMQYEVVLSGQVAVRFFVGLHKNSIVNAELWREAQTYGDIQLM 824
            VFS GNNFERRMALRRSWMQYE V SG VAVRFF+GLHKN  VN ELWREAQ YGDIQLM
Sbjct: 414  VFSSGNNFERRMALRRSWMQYEPVRSGDVAVRFFIGLHKNKQVNYELWREAQAYGDIQLM 473

Query: 823  PFVDYYDLITLKTIAICLMGTKILPAKYMMKTDDDAFVRIDEILAGLKEKPSNGLLYGLI 644
            PFVDYY LI+LKTIAIC+MGTK+LPAK++MKTDDDAFVRIDE+L+ LKEK +NGLLYGLI
Sbjct: 474  PFVDYYSLISLKTIAICIMGTKVLPAKFIMKTDDDAFVRIDEVLSSLKEKSTNGLLYGLI 533

Query: 643  SFDSEPIRDRDSKWYISDEEWPHSLYPPWAHGPGYVLSRDIAKFIVQGHQERYLKLFKLE 464
            SF+S P RD+DSKWYIS++EWPH+ YPPWAHGPGY++SRDIAKFIVQGHQER L+LFKLE
Sbjct: 534  SFESSPQRDKDSKWYISNKEWPHASYPPWAHGPGYIISRDIAKFIVQGHQERDLQLFKLE 593

Query: 463  DVAMGIWIDQFKKNGQEVHYMTDDRFKISGCEESYILAHYQNPSMVLCLWEKLQKEHEAV 284
            DVAMGIWI++ + +GQEVHY  D+RF  +GC+ +YILAHYQ+P +VLCLWEKL KE    
Sbjct: 594  DVAMGIWIEELRNSGQEVHYTNDERFFNAGCQSNYILAHYQSPRLVLCLWEKLLKEKRPN 653

Query: 283  CC 278
            CC
Sbjct: 654  CC 655


>EOY06836.1 Beta-1,3-galactosyltransferase 16 isoform 1 [Theobroma cacao]
          Length = 643

 Score =  806 bits (2081), Expect = 0.0
 Identities = 411/653 (62%), Positives = 491/653 (75%), Gaps = 10/653 (1%)
 Frame = -3

Query: 2206 MVKYSCSFS*MRLMRKWXXXXXXXXXXXXXXIRYSLVTKDSVNKHSAAYDFFNNHPTNAS 2027
            M   S +F   R M+KW                YSL  +++  K  +AYDFFNNHP   S
Sbjct: 1    MKSLSLAFGLFR-MKKWYGGVLIVVLAIILVFSYSL--RETQPKKQSAYDFFNNHPPKDS 57

Query: 2026 IT-----IKSSRPRLIPNDA-KRADVINVYGLSELFSPTDISKEEAKVLDVWIHMRTILS 1865
             T     IKS +  +      K+  +INV GL++L++PT+IS EE+K L +W HMR +LS
Sbjct: 58   HTKENDSIKSPKVEVKKLALIKKPKLINVEGLNDLYAPTNIS-EESKALLLWPHMRLLLS 116

Query: 1864 RSDYLPETTQGIIEASIAWKELLSTLGEQKASRLMVNSGSSSHVEDNKNCSFSVSMENDS 1685
            RSD LPET QGI EA+IAWKELL+ + E+K +   +           KNC FSVS  + +
Sbjct: 117  RSDALPETGQGIKEAAIAWKELLAVIEEEKTTSHNIRL-------KEKNCPFSVSNLDKT 169

Query: 1684 VLRYGSSLQFPCGLVEDSSITVVGFPNGLNRSFQIELIGSKLKDEIQPPVVLHYEVSLPR 1505
            +   G+ L+ PCGLVEDSSITV+G P+G  RSF+IEL GS    E QP V+LHY VS+  
Sbjct: 170  LFSGGNILELPCGLVEDSSITVIGIPDGRYRSFEIELAGSNFSGEPQPSVILHYNVSVAG 229

Query: 1504 DELKEDPVITQNAWTAGSGWGKDEKCPAHHSISNLEVDGLVLCNEQVV---IEDNINRSQ 1334
            D + E+P I QN WT   GWGK+E+CPAH S +NL+VD L LCNEQ+V   +E+N N S 
Sbjct: 230  DNMTEEPFIVQNTWTNELGWGKEERCPAHVSSNNLKVDRLGLCNEQLVRSLMEENQNVSL 289

Query: 1333 PNGEKLTNVSHGSSHTSYNFPFVEGNPFTATLWVGIEGFHMTVNGRHETSFAYREKLEPW 1154
             +G  LTN S   SH S NFPF+EGNPFTATLWVG+EGFHMTVNGRHETSFAYREKLEPW
Sbjct: 290  SSGNALTNASQARSHASANFPFIEGNPFTATLWVGLEGFHMTVNGRHETSFAYREKLEPW 349

Query: 1153 LVSDIKVTGGLDLLSALANGLPVSEDLDLI-DIERLKAPLLPTKRIIMLIGVFSIGNNFE 977
             VS +KV GGLDLLSA A GLPV ED DLI + + LKAP +  KR++ML+GVFS GNNFE
Sbjct: 350  SVSGVKVAGGLDLLSAFAKGLPVPEDHDLIVNSKLLKAPAVSRKRLLMLVGVFSTGNNFE 409

Query: 976  RRMALRRSWMQYEVVLSGQVAVRFFVGLHKNSIVNAELWREAQTYGDIQLMPFVDYYDLI 797
            RRMALRRSWMQ++ V SG VAVRFF+GL+KN  VN ELW+EAQ YGDIQ MPFVDYY LI
Sbjct: 410  RRMALRRSWMQFQAVRSGDVAVRFFIGLNKNRQVNFELWKEAQAYGDIQFMPFVDYYSLI 469

Query: 796  TLKTIAICLMGTKILPAKYMMKTDDDAFVRIDEILAGLKEKPSNGLLYGLISFDSEPIRD 617
            +LKTIAIC++GTKILPAKY+MKTDDDAFVRIDE+L+ LKEK S+GLLYG I+FDS P RD
Sbjct: 470  SLKTIAICILGTKILPAKYIMKTDDDAFVRIDEVLSSLKEKASDGLLYGRIAFDSSPHRD 529

Query: 616  RDSKWYISDEEWPHSLYPPWAHGPGYVLSRDIAKFIVQGHQERYLKLFKLEDVAMGIWID 437
            +DSKWYIS+EEWPHS YPPWAHGPGY++SRDIAKFIV+GHQER LKLFKLEDVAMGIWI+
Sbjct: 530  KDSKWYISNEEWPHSSYPPWAHGPGYIISRDIAKFIVRGHQERELKLFKLEDVAMGIWIE 589

Query: 436  QFKKNGQEVHYMTDDRFKISGCEESYILAHYQNPSMVLCLWEKLQKEHEAVCC 278
            +FK +G+EVHY+TD+RF  +GCE +YILAHYQ P MVLCLWEKLQKEH+A CC
Sbjct: 590  EFKNSGREVHYITDERFYNAGCESNYILAHYQGPRMVLCLWEKLQKEHQAHCC 642


>OMO58551.1 hypothetical protein COLO4_34527 [Corchorus olitorius]
          Length = 634

 Score =  805 bits (2080), Expect = 0.0
 Identities = 404/641 (63%), Positives = 482/641 (75%), Gaps = 11/641 (1%)
 Frame = -3

Query: 2167 MRKWXXXXXXXXXXXXXXIRYSLVTKDSVNKHSAAYDFFNNHPT-----NASITIKSSRP 2003
            M+KW                YSL  +++  +  +AYDFF NHP      N + +I S + 
Sbjct: 1    MKKWYGGVLILFLAITLVFSYSLRVRETRPRKQSAYDFFRNHPAKDSPRNENDSIGSPKL 60

Query: 2002 RLIPNDAKRADVINVYGLSELF--SPTDISKEEAKVLDVWIHMRTILSRSDYLPETTQGI 1829
             LI    K+ ++INV GL +L+  SP +IS+EE+K L +W H+R ILSRSD LPET QGI
Sbjct: 61   PLI----KKPELINVEGLDDLYGPSPGNISEEESKALILWPHIRLILSRSDALPETGQGI 116

Query: 1828 IEASIAWKELLSTLGEQKASRLMVNSGSSSHVEDNKNCSFSVSMENDSVLRYGSSLQFPC 1649
             EA+IAWKELL+ + + K ++L  N G        KNC  S+S  + +V   G  LQ PC
Sbjct: 117  KEAAIAWKELLTVIEQDKTTQLSPNIGRLKM----KNCPSSISNLDKTVFNGGDILQLPC 172

Query: 1648 GLVEDSSITVVGFPNGLNRSFQIELIGSKLKDEIQPPVVLHYEVSLPRDELKEDPVITQN 1469
            GLVEDSSITV+G PNG NRSF IEL GS    E +PP++LHY VS+  D + ++P I QN
Sbjct: 173  GLVEDSSITVIGIPNGRNRSFVIELAGSNFSGETKPPIILHYNVSVGGDNMTQEPFIVQN 232

Query: 1468 AWTAGSGWGKDEKCPAHHSISNLEVDGLVLCNEQVV---IEDNINRSQPNGEKLTNVSHG 1298
             WT   GWGK E+CPAH S +N +VDGL LCNEQ+V    EDN N S  +G+  +N   G
Sbjct: 233  TWTNELGWGKAERCPAHLSSNNFKVDGLGLCNEQLVRSTAEDNQNVSLSSGDSSSNAPQG 292

Query: 1297 SSHTSYNFPFVEGNPFTATLWVGIEGFHMTVNGRHETSFAYREKLEPWLVSDIKVTGGLD 1118
            SSH S NFPFVEGNPF ATLWVG+EGFHMTVNGRHETSFAYREKLEPW VS +K+ GGLD
Sbjct: 293  SSHASANFPFVEGNPFIATLWVGLEGFHMTVNGRHETSFAYREKLEPWSVSRVKLAGGLD 352

Query: 1117 LLSALANGLPVSEDLDLI-DIERLKAPLLPTKRIIMLIGVFSIGNNFERRMALRRSWMQY 941
            LLSA A  LPV ED DLI + + LKAP++  KR++ML+GVFS GNNFERRMALRRSWMQ+
Sbjct: 353  LLSAFAKALPVPEDHDLIANSKLLKAPVISRKRLLMLVGVFSTGNNFERRMALRRSWMQF 412

Query: 940  EVVLSGQVAVRFFVGLHKNSIVNAELWREAQTYGDIQLMPFVDYYDLITLKTIAICLMGT 761
            E V SG VAVRFF+GL+KN  VN ELW+EAQ YGDIQ MPFVDYY LI+LKTIAIC+MGT
Sbjct: 413  EAVRSGDVAVRFFIGLNKNREVNFELWKEAQAYGDIQFMPFVDYYSLISLKTIAICIMGT 472

Query: 760  KILPAKYMMKTDDDAFVRIDEILAGLKEKPSNGLLYGLISFDSEPIRDRDSKWYISDEEW 581
            KILPAKY+MKTDDDAFVRIDE+++ LK K S+ LLYGLISFDS P RD+DSKWYIS+EEW
Sbjct: 473  KILPAKYIMKTDDDAFVRIDEVISSLKGKASDSLLYGLISFDSSPHRDKDSKWYISEEEW 532

Query: 580  PHSLYPPWAHGPGYVLSRDIAKFIVQGHQERYLKLFKLEDVAMGIWIDQFKKNGQEVHYM 401
            PHS YPPWAHGPGY++SRDIAKFIV+GHQER LKLFKLEDVAMGIWI++FKK G++VHYM
Sbjct: 533  PHSAYPPWAHGPGYIVSRDIAKFIVRGHQERELKLFKLEDVAMGIWIEEFKKGGKQVHYM 592

Query: 400  TDDRFKISGCEESYILAHYQNPSMVLCLWEKLQKEHEAVCC 278
            +D+RF   GCE +YILAHYQ P M+LCLWEKLQKEH+  CC
Sbjct: 593  SDERFYNVGCESNYILAHYQGPRMLLCLWEKLQKEHQPNCC 633


>XP_009372069.1 PREDICTED: hydroxyproline O-galactosyltransferase GALT3 [Pyrus x
            bretschneideri]
          Length = 645

 Score =  806 bits (2081), Expect = 0.0
 Identities = 400/639 (62%), Positives = 489/639 (76%), Gaps = 9/639 (1%)
 Frame = -3

Query: 2167 MRKWXXXXXXXXXXXXXXIRYSLVTK----DSVNKHSAAYDFFNNHPT-NASITIKSSRP 2003
            M+KW               RYS + K        K SAA +FF NHPT N S+ + S + 
Sbjct: 15   MKKWSGGLLIVLLAMILVFRYSSIVKIEPPTQAPKQSAA-EFFGNHPTTNDSVVVDSEKK 73

Query: 2002 RLIPNDAKRADVINVYGLSELFSPTDISKEEAKVLDVWIHMRTILSRSDYLPETTQGIIE 1823
               P   K+   + V GL +LF+  DI KE ++ L VW HMRT+LSRSD LPET +G+ E
Sbjct: 74   GEKPY--KKHHFVEVDGLDDLFASHDIFKEGSRALLVWPHMRTLLSRSDALPETAKGVKE 131

Query: 1822 ASIAWKELLSTLGEQKASRLMVNSGSSSHVEDNKNCSFSVSMENDSVLRYGSSLQFPCGL 1643
            AS+AWK+LLS + + +AS+L     + S  ++ KNC FSVS  +    RY + L  PCGL
Sbjct: 132  ASVAWKDLLSAIDKDRASKL-----NKSDNDEVKNCPFSVSTFDKIESRYENILDIPCGL 186

Query: 1642 VEDSSITVVGFPNGLNRSFQIELIGSKLKDEIQPPVVLHYEVSLPRDELKEDPVITQNAW 1463
            ++DSSI++VG PNG +RSFQI+L+GS+L  E +PP++LHY VSLP D + ++P + QN W
Sbjct: 187  IDDSSISLVGIPNGHSRSFQIQLLGSQLLGESEPPIILHYNVSLPGDNMTQEPFVVQNTW 246

Query: 1462 TAGSGWGKDEKCPAHHSISNLEVDGLVLCNEQVV---IEDNINRSQPNGEKLTNVSHGSS 1292
            T   GWGK+E+CP+H S SNL+VDGLVLCNEQ V    E+N+N SQP+G+ LTNVS G +
Sbjct: 247  THELGWGKEERCPSHWSPSNLKVDGLVLCNEQAVRSSSEENLNVSQPSGDMLTNVS-GGA 305

Query: 1291 HTSYNFPFVEGNPFTATLWVGIEGFHMTVNGRHETSFAYREKLEPWLVSDIKVTGGLDLL 1112
            +   NFPFVEGNPFTAT WVG+EGFH+TVNGRHETSFAYREKLEPW VS ++V GGLDLL
Sbjct: 306  YEGSNFPFVEGNPFTATFWVGLEGFHLTVNGRHETSFAYREKLEPWSVSKVRVAGGLDLL 365

Query: 1111 SALANGLPVSEDLDL-IDIERLKAPLLPTKRIIMLIGVFSIGNNFERRMALRRSWMQYEV 935
            SALA GLPVSED DL +D+E L+AP    KR++ML+GVFS GNNFERRMALRR+WMQY+ 
Sbjct: 366  SALAKGLPVSEDHDLAVDVEHLRAPPTSKKRLLMLVGVFSTGNNFERRMALRRAWMQYKA 425

Query: 934  VLSGQVAVRFFVGLHKNSIVNAELWREAQTYGDIQLMPFVDYYDLITLKTIAICLMGTKI 755
            V SG VAVRFF+GLHKNS VN ELWREA+ YGDIQLMPFVDYY LI+LKTIAI + GTKI
Sbjct: 426  VRSGDVAVRFFIGLHKNSQVNIELWREAEAYGDIQLMPFVDYYSLISLKTIAISIFGTKI 485

Query: 754  LPAKYMMKTDDDAFVRIDEILAGLKEKPSNGLLYGLISFDSEPIRDRDSKWYISDEEWPH 575
             PAKY+MKTDDDAFVRIDE+++ LK KP+ GLLYG ISF+S P RD+ SKW+I + EWP+
Sbjct: 486  HPAKYIMKTDDDAFVRIDEVISSLKGKPTKGLLYGRISFESSPDRDKGSKWFIDNREWPY 545

Query: 574  SLYPPWAHGPGYVLSRDIAKFIVQGHQERYLKLFKLEDVAMGIWIDQFKKNGQEVHYMTD 395
            ++YPPWAHGPGY++SRDIAKFIV+ HQE  LKLFKLEDVAMGIWI QFK  GQEV+Y+TD
Sbjct: 546  AMYPPWAHGPGYIISRDIAKFIVRSHQEGDLKLFKLEDVAMGIWIQQFKYRGQEVNYVTD 605

Query: 394  DRFKISGCEESYILAHYQNPSMVLCLWEKLQKEHEAVCC 278
            DRF  +GCE +YILAHYQ+P +VLCLWEKLQKEHEA+CC
Sbjct: 606  DRFYNAGCEANYILAHYQSPRLVLCLWEKLQKEHEAICC 644


>OMO53423.1 hypothetical protein CCACVL1_28661 [Corchorus capsularis]
          Length = 632

 Score =  801 bits (2068), Expect = 0.0
 Identities = 401/640 (62%), Positives = 480/640 (75%), Gaps = 10/640 (1%)
 Frame = -3

Query: 2167 MRKWXXXXXXXXXXXXXXIRYSLVTKDSVNKHSAAYDFFNNHPTNASITIKSSRPRL--- 1997
            M+KW                YSL  +++  +  +AYDFF NHP   S + ++    +   
Sbjct: 1    MKKWYGGVLILLLAITLVFSYSLRVRETRPRKQSAYDFFRNHPAKDSPSPRNENDSIGSP 60

Query: 1996 -IPNDAKRADVINVYGLSELF--SPTDISKEEAKVLDVWIHMRTILSRSDYLPETTQGII 1826
             +P   K+ ++INV  L +L+  SP +IS+EE+K L +W HMR ILSRSD LP+T QGI 
Sbjct: 61   KLPL-TKKPELINVERLDDLYGPSPRNISEEESKALILWPHMRLILSRSDALPQTGQGIK 119

Query: 1825 EASIAWKELLSTLGEQKASRLMVNSGSSSHVEDNKNCSFSVSMENDSVLRYGSSLQFPCG 1646
            EA+IAWKELL+ + + K ++L  N          KNC  S+S  + +V   G  LQ PCG
Sbjct: 120  EAAIAWKELLAVIEQDKTTQLSPNIRLKM-----KNCPSSISNLDKTVFNGGDILQLPCG 174

Query: 1645 LVEDSSITVVGFPNGLNRSFQIELIGSKLKDEIQPPVVLHYEVSLPRDELKEDPVITQNA 1466
            LVEDSSITV+G PNG NRSF IEL GS    E +PP++LHY VS+  D + ++P I QN 
Sbjct: 175  LVEDSSITVIGIPNGRNRSFDIELAGSNFSGETKPPIILHYNVSVAGDNMTQEPFIVQNT 234

Query: 1465 WTAGSGWGKDEKCPAHHSISNLEVDGLVLCNEQVV---IEDNINRSQPNGEKLTNVSHGS 1295
            WT   GWGK E+CPAH S SN +VDGL LCNEQ+V    EDN N S   G+  +N   GS
Sbjct: 235  WTNELGWGKAERCPAHLSSSNFKVDGLGLCNEQLVRSTAEDNQNVS---GDSSSNAPQGS 291

Query: 1294 SHTSYNFPFVEGNPFTATLWVGIEGFHMTVNGRHETSFAYREKLEPWLVSDIKVTGGLDL 1115
            SH S NFPFVEGNPF ATLWVG+EGFHMTVNGRHETSFAYREKLEPW VS +K+ GGLDL
Sbjct: 292  SHASANFPFVEGNPFIATLWVGLEGFHMTVNGRHETSFAYREKLEPWSVSRVKLAGGLDL 351

Query: 1114 LSALANGLPVSEDLDLI-DIERLKAPLLPTKRIIMLIGVFSIGNNFERRMALRRSWMQYE 938
            LSA A GLPV ED DL+ + + LKAP++  KR++ML+GVFS GNNFERRMALRRSWMQ+E
Sbjct: 352  LSAFAKGLPVPEDHDLVVNSKLLKAPVISRKRLLMLVGVFSTGNNFERRMALRRSWMQFE 411

Query: 937  VVLSGQVAVRFFVGLHKNSIVNAELWREAQTYGDIQLMPFVDYYDLITLKTIAICLMGTK 758
             V SG VAVRFF+GL+KN  VN ELW+EAQ YGDIQ MPFVDYY LI+LKTIAIC+MGTK
Sbjct: 412  AVRSGDVAVRFFIGLNKNREVNFELWKEAQAYGDIQFMPFVDYYSLISLKTIAICIMGTK 471

Query: 757  ILPAKYMMKTDDDAFVRIDEILAGLKEKPSNGLLYGLISFDSEPIRDRDSKWYISDEEWP 578
            ILPAKY+MKTDDDAFVRIDE+++ LK K S+GLLYGLISFDS P RD+DSKWYIS+EEWP
Sbjct: 472  ILPAKYIMKTDDDAFVRIDEVISSLKGKASDGLLYGLISFDSSPHRDKDSKWYISEEEWP 531

Query: 577  HSLYPPWAHGPGYVLSRDIAKFIVQGHQERYLKLFKLEDVAMGIWIDQFKKNGQEVHYMT 398
            HS YPPWAHGPGY++SRDIAKFIV+GHQER LKLFKLEDVAMGIWI++FKK G++VHYM+
Sbjct: 532  HSAYPPWAHGPGYIVSRDIAKFIVRGHQERELKLFKLEDVAMGIWIEEFKKGGKQVHYMS 591

Query: 397  DDRFKISGCEESYILAHYQNPSMVLCLWEKLQKEHEAVCC 278
            D+RF   GCE +YILAHYQ P M+LCLWEKLQKEH+  CC
Sbjct: 592  DERFYNVGCESNYILAHYQGPRMLLCLWEKLQKEHQPNCC 631


>KJB28790.1 hypothetical protein B456_005G069600 [Gossypium raimondii]
          Length = 636

 Score =  800 bits (2066), Expect = 0.0
 Identities = 401/646 (62%), Positives = 483/646 (74%), Gaps = 16/646 (2%)
 Frame = -3

Query: 2167 MRKWXXXXXXXXXXXXXXIRYSL--VTKDSVNKHSAAYDFFNNHPT-------NASIT-- 2021
            M+KW                YSL    +    K  +AYDFFNNHP        N S    
Sbjct: 13   MKKWYGGVLILVLAIVMVFSYSLRETQRPQPKKKQSAYDFFNNHPPIDSHRKGNDSFKLP 72

Query: 2020 -IKSSRPRLIPNDAKRADVINVYGLSELFSPTDISKEEAKVLDVWIHMRTILSRSDYLPE 1844
             +++ +P LI    ++  +INV GL EL++P ++S++E+ VL +W H+  +LSRSD LPE
Sbjct: 73   KVEAKKPSLI----QKPKLINVEGLDELYAPRNVSEQESNVLLLWPHLHLLLSRSDALPE 128

Query: 1843 TTQGIIEASIAWKELLSTLGEQKASRLMVNSGSSSHVEDNKNCSFSVSMENDSVLRYGSS 1664
            T QGI EA+IAWKELL+ + E+K ++L  N          KNC FSVS  ++++   G+ 
Sbjct: 129  TGQGIKEAAIAWKELLALIEEEKTTKLSNNIRLKE-----KNCPFSVSSPDNALFSGGNI 183

Query: 1663 LQFPCGLVEDSSITVVGFPNGLNRSFQIELIGSKLKDEIQPPVVLHYEVSLPRDELKEDP 1484
            L+ PCGLVEDSSIT++G PNG  RSF+I+L+GS   +E +PP+VLHY VS+  D + E+P
Sbjct: 184  LELPCGLVEDSSITLIGTPNGSYRSFEIDLVGSNFSEEPKPPIVLHYNVSVAGDNMTEEP 243

Query: 1483 VITQNAWTAGSGWGKDEKCPAHHSISNLEVDGLVLCNEQVV---IEDNINRSQPNGEKLT 1313
             I QN WT   GWGK+EKCP+H S +NL+VDGL LCNEQ+V   +E+N N S  +G+  T
Sbjct: 244  FIAQNTWTNELGWGKEEKCPSHVSSNNLKVDGLGLCNEQLVRSTMEENQNVSVSSGDAST 303

Query: 1312 NVSHGSSHTSYNFPFVEGNPFTATLWVGIEGFHMTVNGRHETSFAYREKLEPWLVSDIKV 1133
            N S  SSH S NFPFVEGNPFTATLWVG+EGFHMTVNGRHETSFAYREKLEPW VS +KV
Sbjct: 304  NASQESSHASANFPFVEGNPFTATLWVGLEGFHMTVNGRHETSFAYREKLEPWSVSGVKV 363

Query: 1132 TGGLDLLSALANGLPVSEDLDLIDIER-LKAPLLPTKRIIMLIGVFSIGNNFERRMALRR 956
             GGLDLLSA A GLPV ED DLID  + LKAP++  KR++ML+GVFS GNNFERRMALRR
Sbjct: 364  VGGLDLLSAFAKGLPVPEDHDLIDNSKILKAPVITRKRLVMLVGVFSTGNNFERRMALRR 423

Query: 955  SWMQYEVVLSGQVAVRFFVGLHKNSIVNAELWREAQTYGDIQLMPFVDYYDLITLKTIAI 776
            SWMQ+E               +KN  VN ELW+EAQ YGDIQ MPFVDYY LI+LKTIAI
Sbjct: 424  SWMQFEA--------------NKNLQVNFELWKEAQAYGDIQFMPFVDYYSLISLKTIAI 469

Query: 775  CLMGTKILPAKYMMKTDDDAFVRIDEILAGLKEKPSNGLLYGLISFDSEPIRDRDSKWYI 596
            C+MGTKILPAKY+MKTDDDAFVRIDE+L+ LKEKPSNGLLYGLI FDS P R++DSKWYI
Sbjct: 470  CIMGTKILPAKYIMKTDDDAFVRIDEVLSSLKEKPSNGLLYGLIEFDSSPHREKDSKWYI 529

Query: 595  SDEEWPHSLYPPWAHGPGYVLSRDIAKFIVQGHQERYLKLFKLEDVAMGIWIDQFKKNGQ 416
            SDEEWPHS YPPWAHGPGY+LSRD+AKFIVQGH+ER LKLFKLEDVAMGIWI++FK++G+
Sbjct: 530  SDEEWPHSSYPPWAHGPGYILSRDVAKFIVQGHKERELKLFKLEDVAMGIWIEEFKRSGR 589

Query: 415  EVHYMTDDRFKISGCEESYILAHYQNPSMVLCLWEKLQKEHEAVCC 278
            EVHY+TDDRF  +GCE +YILAHYQ P MVLCLWEKLQKEH+A CC
Sbjct: 590  EVHYITDDRFYNAGCESNYILAHYQGPRMVLCLWEKLQKEHQAYCC 635


>XP_008360226.1 PREDICTED: hydroxyproline O-galactosyltransferase GALT3-like [Malus
            domestica]
          Length = 645

 Score =  796 bits (2056), Expect = 0.0
 Identities = 398/639 (62%), Positives = 486/639 (76%), Gaps = 9/639 (1%)
 Frame = -3

Query: 2167 MRKWXXXXXXXXXXXXXXIRYSLVTK----DSVNKHSAAYDFFNNHPT-NASITIKSSRP 2003
            M+KW               RYS + K        K SAA  FF NHPT N S+ + S + 
Sbjct: 15   MKKWSGGLLIVVLAMILVFRYSSIVKIEPPTQAPKQSAAX-FFGNHPTTNDSVIVDSEKK 73

Query: 2002 RLIPNDAKRADVINVYGLSELFSPTDISKEEAKVLDVWIHMRTILSRSDYLPETTQGIIE 1823
               P   K++  + V GL +LF+  DI KE ++ L VW HMRT+LSRSD LPET +G+ E
Sbjct: 74   GEKPY--KKSHFVEVDGLDDLFASHDIFKEGSRALLVWPHMRTLLSRSDALPETAKGVKE 131

Query: 1822 ASIAWKELLSTLGEQKASRLMVNSGSSSHVEDNKNCSFSVSMENDSVLRYGSSLQFPCGL 1643
            AS+AWK+LLS + + KAS+L     + S  E++KNC FSVS  +    RY + L  PCGL
Sbjct: 132  ASVAWKDLLSAIDKDKASKL-----NKSDNEEDKNCPFSVSTFDKIESRYENILDIPCGL 186

Query: 1642 VEDSSITVVGFPNGLNRSFQIELIGSKLKDEIQPPVVLHYEVSLPRDELKEDPVITQNAW 1463
            ++DSSI++VG PNG +RSFQI+L+GS+L  E +PP+VLHY VSLP D + + P + QN W
Sbjct: 187  IDDSSISLVGIPNGHSRSFQIQLLGSQLLGESEPPIVLHYNVSLPGDNMTQXPFVVQNTW 246

Query: 1462 TAGSGWGKDEKCPAHHSISNLEVDGLVLCNEQVV---IEDNINRSQPNGEKLTNVSHGSS 1292
            T   GWGK+E+CP+H S SNL+VDGLVLCNEQ V    E+++N S+P+ + LTNVS G +
Sbjct: 247  THELGWGKEERCPSHRSPSNLKVDGLVLCNEQAVRSSSEESLNVSRPSRDMLTNVS-GGA 305

Query: 1291 HTSYNFPFVEGNPFTATLWVGIEGFHMTVNGRHETSFAYREKLEPWLVSDIKVTGGLDLL 1112
            +   NFPFVEGNPFTAT WVG+EGFH+TVNGRHETSFAYREKLEPW VS ++V GGLDLL
Sbjct: 306  YEGSNFPFVEGNPFTATFWVGLEGFHLTVNGRHETSFAYREKLEPWSVSKVRVAGGLDLL 365

Query: 1111 SALANGLPVSEDLDLI-DIERLKAPLLPTKRIIMLIGVFSIGNNFERRMALRRSWMQYEV 935
            SALA GLPVSED DL+ D+E L+AP    +R++ML+GVFS GNNFERRMALRR+WMQY+ 
Sbjct: 366  SALAKGLPVSEDHDLVVDVEHLRAPPTSKRRLLMLVGVFSTGNNFERRMALRRAWMQYKA 425

Query: 934  VLSGQVAVRFFVGLHKNSIVNAELWREAQTYGDIQLMPFVDYYDLITLKTIAICLMGTKI 755
            V SG VAVRFF+GLHKNS VN ELWREA+ YGDIQLMPFVDYY LI+LKTIAI + GTKI
Sbjct: 426  VRSGDVAVRFFIGLHKNSQVNIELWREAEAYGDIQLMPFVDYYSLISLKTIAISIFGTKI 485

Query: 754  LPAKYMMKTDDDAFVRIDEILAGLKEKPSNGLLYGLISFDSEPIRDRDSKWYISDEEWPH 575
             PAKY+MKTDDDAFVRIDE+++ LK KP+ GLLYGLISF+S P RD+ SKW+I + EWP+
Sbjct: 486  HPAKYIMKTDDDAFVRIDEVISSLKGKPTKGLLYGLISFESSPDRDKGSKWFIDNREWPY 545

Query: 574  SLYPPWAHGPGYVLSRDIAKFIVQGHQERYLKLFKLEDVAMGIWIDQFKKNGQEVHYMTD 395
            ++YPPWAHGPGY++SRDIAKFIV+ HQE  LKLFKLEDVAMGIWI Q K  GQEV+Y+TD
Sbjct: 546  AMYPPWAHGPGYIISRDIAKFIVRSHQEGDLKLFKLEDVAMGIWIQQXKYRGQEVNYVTD 605

Query: 394  DRFKISGCEESYILAHYQNPSMVLCLWEKLQKEHEAVCC 278
            DRF  +GCE +YILAHYQ+P +VLCLWEKLQKE EA CC
Sbjct: 606  DRFYNAGCEANYILAHYQSPRLVLCLWEKLQKEXEAXCC 644


Top