BLASTX nr result

ID: Catharanthus22_contig00017128 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Catharanthus22_contig00017128
         (937 letters)

Database: ./nr 
           37,332,560 sequences; 13,225,080,153 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_006356301.1| PREDICTED: probable glucuronoxylan glucurono...   446   e-123
ref|XP_004237727.1| PREDICTED: probable beta-1,4-xylosyltransfer...   442   e-121
ref|XP_006434366.1| hypothetical protein CICLE_v10001104mg [Citr...   426   e-117
ref|XP_006434365.1| hypothetical protein CICLE_v10001104mg [Citr...   426   e-117
ref|XP_006434364.1| hypothetical protein CICLE_v10001104mg [Citr...   426   e-117
ref|XP_006434363.1| hypothetical protein CICLE_v10001104mg [Citr...   426   e-117
ref|XP_006472912.1| PREDICTED: probable glycosyltransferase At5g...   422   e-115
ref|XP_004237728.1| PREDICTED: probable beta-1,4-xylosyltransfer...   419   e-115
ref|XP_002300942.1| exostosin family protein [Populus trichocarp...   418   e-114
ref|XP_002307477.2| exostosin family protein [Populus trichocarp...   410   e-112
ref|XP_002265632.1| PREDICTED: probable glucuronosyltransferase ...   409   e-112
ref|XP_006416298.1| hypothetical protein EUTSA_v10007584mg [Eutr...   409   e-111
gb|AAF87908.1|AC015447_18 Hypothetical protein [Arabidopsis thal...   409   e-111
ref|NP_973879.1| Exostosin family protein [Arabidopsis thaliana]...   409   e-111
ref|NP_564141.1| Exostosin family protein [Arabidopsis thaliana]...   409   e-111
ref|XP_002893165.1| exostosin family protein [Arabidopsis lyrata...   407   e-111
gb|AAL75891.1| At1g21480/F24J8_23 [Arabidopsis thaliana]              407   e-111
ref|XP_004148819.1| PREDICTED: probable glucuronoxylan glucurono...   405   e-111
gb|EOY16596.1| Exostosin family protein isoform 2 [Theobroma cacao]   404   e-110
gb|EOY16595.1| Exostosin family protein isoform 1 [Theobroma cacao]   404   e-110

>ref|XP_006356301.1| PREDICTED: probable glucuronoxylan glucuronosyltransferase F8H-like
           [Solanum tuberosum]
          Length = 462

 Score =  446 bits (1147), Expect = e-123
 Identities = 216/280 (77%), Positives = 245/280 (87%), Gaps = 2/280 (0%)
 Frame = +2

Query: 104 TAAPCTRTHQIGALALVIVTFFITRVFHQXXXXXXXXX-NDARYPSVGTENDAV-VQYSG 277
           T  PCTRTHQIGALALVI+TFF+TR+F Q             +Y S   +ND V    SG
Sbjct: 17  TVTPCTRTHQIGALALVIITFFVTRIFDQSLNSSSFSTPTSGQYRS---KNDVVRFSDSG 73

Query: 278 GLFFWPQRGFGTHLSLKIYVYEENEIEGLKLLLHGRDGKISSEACIKGQWGTQVKIHKLL 457
           G  FWPQRG+GTHLSLKIYVY+ENEI+GLK LL+GRDGKIS ++C+KGQWGTQVKIH++L
Sbjct: 74  GSIFWPQRGYGTHLSLKIYVYDENEIDGLKHLLYGRDGKISPDSCVKGQWGTQVKIHRML 133

Query: 458 LQSRFRTKRKDEADFFFVPTYIKCVRMMGGLNDKEINQTYVKVLSQMPYYRLSGGRNHIF 637
           LQSRFRT++K+EAD FFVP Y KCVR+MGGLNDKEINQTYVKVLSQMPY+RLSGGRNHIF
Sbjct: 134 LQSRFRTRKKEEADLFFVPAYPKCVRVMGGLNDKEINQTYVKVLSQMPYFRLSGGRNHIF 193

Query: 638 VFPSGAGAHLFKSWATYLNRSIILTPEGDRTDKRDTSAFNTWKDIIIPGNVDDGMTTRGP 817
           VFPSGAGAHLFKSW TYLNRSIILTPEGDRTDKRDTSAFNTWKDIIIPGN+DDGMTT G 
Sbjct: 194 VFPSGAGAHLFKSWVTYLNRSIILTPEGDRTDKRDTSAFNTWKDIIIPGNIDDGMTTHGS 253

Query: 818 RLVEPLPLSKRKYLANFLGRAQGKVGRLQLMDLAKQFPNK 937
           R+VEPLPLSKRK+LAN+LGRAQGKVGRL+L++L+KQ+P+K
Sbjct: 254 RIVEPLPLSKRKHLANYLGRAQGKVGRLRLIELSKQYPDK 293


>ref|XP_004237727.1| PREDICTED: probable beta-1,4-xylosyltransferase IRX10-like isoform
           1 [Solanum lycopersicum]
          Length = 462

 Score =  442 bits (1136), Expect = e-121
 Identities = 217/293 (74%), Positives = 250/293 (85%), Gaps = 5/293 (1%)
 Frame = +2

Query: 74  SGKSSRYPGA-TAAPCTRTHQIGALALVIVTFFITRVFHQXXXXXXXXXNDARYPSVG-- 244
           S K+  +P   T  PCTRTHQIGALALVI+TFF+TR+F Q         +    P+ G  
Sbjct: 6   SSKTRPFPSHHTVTPCTRTHQIGALALVIITFFLTRIFDQSLNS-----SSFSIPTSGQY 60

Query: 245 -TENDAV-VQYSGGLFFWPQRGFGTHLSLKIYVYEENEIEGLKLLLHGRDGKISSEACIK 418
            ++ND +    SGG  FWPQRG+GTHLSLKIYVY+ENEI+GLK LL+GRD KIS ++C+K
Sbjct: 61  RSKNDVIRFSDSGGSIFWPQRGYGTHLSLKIYVYDENEIDGLKHLLYGRDRKISPDSCVK 120

Query: 419 GQWGTQVKIHKLLLQSRFRTKRKDEADFFFVPTYIKCVRMMGGLNDKEINQTYVKVLSQM 598
           GQWGTQVKIH++LLQSRFRT++K+EAD FFVP Y KCVR+MGGLNDKEINQTYVKVLSQM
Sbjct: 121 GQWGTQVKIHRMLLQSRFRTRKKEEADLFFVPAYPKCVRVMGGLNDKEINQTYVKVLSQM 180

Query: 599 PYYRLSGGRNHIFVFPSGAGAHLFKSWATYLNRSIILTPEGDRTDKRDTSAFNTWKDIII 778
           PY+RLSGGRNHIFVFPSGAGAHLFKSWATYLNRSIILTPEGDRTDKRDTSAFNTWKDIII
Sbjct: 181 PYFRLSGGRNHIFVFPSGAGAHLFKSWATYLNRSIILTPEGDRTDKRDTSAFNTWKDIII 240

Query: 779 PGNVDDGMTTRGPRLVEPLPLSKRKYLANFLGRAQGKVGRLQLMDLAKQFPNK 937
           PGN+DDGMTT G R+VE LPLSKRK+LAN+LGRAQGKVGRL+L++L+KQ+P K
Sbjct: 241 PGNIDDGMTTHGSRIVESLPLSKRKHLANYLGRAQGKVGRLRLIELSKQYPEK 293


>ref|XP_006434366.1| hypothetical protein CICLE_v10001104mg [Citrus clementina]
           gi|557536488|gb|ESR47606.1| hypothetical protein
           CICLE_v10001104mg [Citrus clementina]
          Length = 310

 Score =  426 bits (1094), Expect = e-117
 Identities = 205/287 (71%), Positives = 235/287 (81%)
 Frame = +2

Query: 77  GKSSRYPGATAAPCTRTHQIGALALVIVTFFITRVFHQXXXXXXXXXNDARYPSVGTEND 256
           G S         PCTRTHQIGAL LV  TFF+TR+F Q         +      +   N 
Sbjct: 2   GSSHNNKSRIFTPCTRTHQIGALLLVTTTFFLTRLFDQSFSPCHS--SPVNQDGISHRNH 59

Query: 257 AVVQYSGGLFFWPQRGFGTHLSLKIYVYEENEIEGLKLLLHGRDGKISSEACIKGQWGTQ 436
             +   G    WP+RG+G+HLSLKIYVY+ENE++GLKLLL+GRDG IS+++C+KGQWGTQ
Sbjct: 60  VHISDGGRSISWPERGYGSHLSLKIYVYDENELDGLKLLLYGRDGAISADSCVKGQWGTQ 119

Query: 437 VKIHKLLLQSRFRTKRKDEADFFFVPTYIKCVRMMGGLNDKEINQTYVKVLSQMPYYRLS 616
           VKIH+LLLQSRFRTK+K+EAD FFVP Y KCVRMMGGLNDKEINQTYVKVL QMPY+R S
Sbjct: 120 VKIHRLLLQSRFRTKKKEEADLFFVPAYAKCVRMMGGLNDKEINQTYVKVLRQMPYFRRS 179

Query: 617 GGRNHIFVFPSGAGAHLFKSWATYLNRSIILTPEGDRTDKRDTSAFNTWKDIIIPGNVDD 796
           GGR+HIFVFPSGAGAHLFKSWAT++NRSIILTPEGDRTDKRDTSAFNTWKDIIIPGNVDD
Sbjct: 180 GGRDHIFVFPSGAGAHLFKSWATFINRSIILTPEGDRTDKRDTSAFNTWKDIIIPGNVDD 239

Query: 797 GMTTRGPRLVEPLPLSKRKYLANFLGRAQGKVGRLQLMDLAKQFPNK 937
           GMT RG  LV+PLPLSKRKYLAN+LGRAQGKVGRLQL++LA Q+P++
Sbjct: 240 GMTKRGLTLVQPLPLSKRKYLANYLGRAQGKVGRLQLIELANQYPDQ 286


>ref|XP_006434365.1| hypothetical protein CICLE_v10001104mg [Citrus clementina]
           gi|557536487|gb|ESR47605.1| hypothetical protein
           CICLE_v10001104mg [Citrus clementina]
          Length = 351

 Score =  426 bits (1094), Expect = e-117
 Identities = 205/287 (71%), Positives = 235/287 (81%)
 Frame = +2

Query: 77  GKSSRYPGATAAPCTRTHQIGALALVIVTFFITRVFHQXXXXXXXXXNDARYPSVGTEND 256
           G S         PCTRTHQIGAL LV  TFF+TR+F Q         +      +   N 
Sbjct: 2   GSSHNNKSRIFTPCTRTHQIGALLLVTTTFFLTRLFDQSFSPCHS--SPVNQDGISHRNH 59

Query: 257 AVVQYSGGLFFWPQRGFGTHLSLKIYVYEENEIEGLKLLLHGRDGKISSEACIKGQWGTQ 436
             +   G    WP+RG+G+HLSLKIYVY+ENE++GLKLLL+GRDG IS+++C+KGQWGTQ
Sbjct: 60  VHISDGGRSISWPERGYGSHLSLKIYVYDENELDGLKLLLYGRDGAISADSCVKGQWGTQ 119

Query: 437 VKIHKLLLQSRFRTKRKDEADFFFVPTYIKCVRMMGGLNDKEINQTYVKVLSQMPYYRLS 616
           VKIH+LLLQSRFRTK+K+EAD FFVP Y KCVRMMGGLNDKEINQTYVKVL QMPY+R S
Sbjct: 120 VKIHRLLLQSRFRTKKKEEADLFFVPAYAKCVRMMGGLNDKEINQTYVKVLRQMPYFRRS 179

Query: 617 GGRNHIFVFPSGAGAHLFKSWATYLNRSIILTPEGDRTDKRDTSAFNTWKDIIIPGNVDD 796
           GGR+HIFVFPSGAGAHLFKSWAT++NRSIILTPEGDRTDKRDTSAFNTWKDIIIPGNVDD
Sbjct: 180 GGRDHIFVFPSGAGAHLFKSWATFINRSIILTPEGDRTDKRDTSAFNTWKDIIIPGNVDD 239

Query: 797 GMTTRGPRLVEPLPLSKRKYLANFLGRAQGKVGRLQLMDLAKQFPNK 937
           GMT RG  LV+PLPLSKRKYLAN+LGRAQGKVGRLQL++LA Q+P++
Sbjct: 240 GMTKRGLTLVQPLPLSKRKYLANYLGRAQGKVGRLQLIELANQYPDQ 286


>ref|XP_006434364.1| hypothetical protein CICLE_v10001104mg [Citrus clementina]
           gi|557536486|gb|ESR47604.1| hypothetical protein
           CICLE_v10001104mg [Citrus clementina]
          Length = 287

 Score =  426 bits (1094), Expect = e-117
 Identities = 205/287 (71%), Positives = 235/287 (81%)
 Frame = +2

Query: 77  GKSSRYPGATAAPCTRTHQIGALALVIVTFFITRVFHQXXXXXXXXXNDARYPSVGTEND 256
           G S         PCTRTHQIGAL LV  TFF+TR+F Q         +      +   N 
Sbjct: 2   GSSHNNKSRIFTPCTRTHQIGALLLVTTTFFLTRLFDQSFSPCHS--SPVNQDGISHRNH 59

Query: 257 AVVQYSGGLFFWPQRGFGTHLSLKIYVYEENEIEGLKLLLHGRDGKISSEACIKGQWGTQ 436
             +   G    WP+RG+G+HLSLKIYVY+ENE++GLKLLL+GRDG IS+++C+KGQWGTQ
Sbjct: 60  VHISDGGRSISWPERGYGSHLSLKIYVYDENELDGLKLLLYGRDGAISADSCVKGQWGTQ 119

Query: 437 VKIHKLLLQSRFRTKRKDEADFFFVPTYIKCVRMMGGLNDKEINQTYVKVLSQMPYYRLS 616
           VKIH+LLLQSRFRTK+K+EAD FFVP Y KCVRMMGGLNDKEINQTYVKVL QMPY+R S
Sbjct: 120 VKIHRLLLQSRFRTKKKEEADLFFVPAYAKCVRMMGGLNDKEINQTYVKVLRQMPYFRRS 179

Query: 617 GGRNHIFVFPSGAGAHLFKSWATYLNRSIILTPEGDRTDKRDTSAFNTWKDIIIPGNVDD 796
           GGR+HIFVFPSGAGAHLFKSWAT++NRSIILTPEGDRTDKRDTSAFNTWKDIIIPGNVDD
Sbjct: 180 GGRDHIFVFPSGAGAHLFKSWATFINRSIILTPEGDRTDKRDTSAFNTWKDIIIPGNVDD 239

Query: 797 GMTTRGPRLVEPLPLSKRKYLANFLGRAQGKVGRLQLMDLAKQFPNK 937
           GMT RG  LV+PLPLSKRKYLAN+LGRAQGKVGRLQL++LA Q+P++
Sbjct: 240 GMTKRGLTLVQPLPLSKRKYLANYLGRAQGKVGRLQLIELANQYPDQ 286


>ref|XP_006434363.1| hypothetical protein CICLE_v10001104mg [Citrus clementina]
           gi|557536485|gb|ESR47603.1| hypothetical protein
           CICLE_v10001104mg [Citrus clementina]
          Length = 455

 Score =  426 bits (1094), Expect = e-117
 Identities = 205/287 (71%), Positives = 235/287 (81%)
 Frame = +2

Query: 77  GKSSRYPGATAAPCTRTHQIGALALVIVTFFITRVFHQXXXXXXXXXNDARYPSVGTEND 256
           G S         PCTRTHQIGAL LV  TFF+TR+F Q         +      +   N 
Sbjct: 2   GSSHNNKSRIFTPCTRTHQIGALLLVTTTFFLTRLFDQSFSPCHS--SPVNQDGISHRNH 59

Query: 257 AVVQYSGGLFFWPQRGFGTHLSLKIYVYEENEIEGLKLLLHGRDGKISSEACIKGQWGTQ 436
             +   G    WP+RG+G+HLSLKIYVY+ENE++GLKLLL+GRDG IS+++C+KGQWGTQ
Sbjct: 60  VHISDGGRSISWPERGYGSHLSLKIYVYDENELDGLKLLLYGRDGAISADSCVKGQWGTQ 119

Query: 437 VKIHKLLLQSRFRTKRKDEADFFFVPTYIKCVRMMGGLNDKEINQTYVKVLSQMPYYRLS 616
           VKIH+LLLQSRFRTK+K+EAD FFVP Y KCVRMMGGLNDKEINQTYVKVL QMPY+R S
Sbjct: 120 VKIHRLLLQSRFRTKKKEEADLFFVPAYAKCVRMMGGLNDKEINQTYVKVLRQMPYFRRS 179

Query: 617 GGRNHIFVFPSGAGAHLFKSWATYLNRSIILTPEGDRTDKRDTSAFNTWKDIIIPGNVDD 796
           GGR+HIFVFPSGAGAHLFKSWAT++NRSIILTPEGDRTDKRDTSAFNTWKDIIIPGNVDD
Sbjct: 180 GGRDHIFVFPSGAGAHLFKSWATFINRSIILTPEGDRTDKRDTSAFNTWKDIIIPGNVDD 239

Query: 797 GMTTRGPRLVEPLPLSKRKYLANFLGRAQGKVGRLQLMDLAKQFPNK 937
           GMT RG  LV+PLPLSKRKYLAN+LGRAQGKVGRLQL++LA Q+P++
Sbjct: 240 GMTKRGLTLVQPLPLSKRKYLANYLGRAQGKVGRLQLIELANQYPDQ 286


>ref|XP_006472912.1| PREDICTED: probable glycosyltransferase At5g25310-like isoform X1
           [Citrus sinensis]
          Length = 455

 Score =  422 bits (1084), Expect = e-115
 Identities = 203/287 (70%), Positives = 232/287 (80%)
 Frame = +2

Query: 77  GKSSRYPGATAAPCTRTHQIGALALVIVTFFITRVFHQXXXXXXXXXNDARYPSVGTEND 256
           G S         PCTRTHQIGAL L+  TFF+TR+F Q         +      +   N 
Sbjct: 2   GSSHNNKSRIFTPCTRTHQIGALLLITTTFFLTRLFDQSFSPCHS--SPVNQDGISHRNH 59

Query: 257 AVVQYSGGLFFWPQRGFGTHLSLKIYVYEENEIEGLKLLLHGRDGKISSEACIKGQWGTQ 436
             +   G    WP+RG+G+HLSLKIYVY+ENEI+GLKLLL+GRDG IS+++C+KGQWGTQ
Sbjct: 60  VHISDGGRSISWPERGYGSHLSLKIYVYDENEIDGLKLLLYGRDGAISADSCVKGQWGTQ 119

Query: 437 VKIHKLLLQSRFRTKRKDEADFFFVPTYIKCVRMMGGLNDKEINQTYVKVLSQMPYYRLS 616
           VKIH+LLLQSRFRTK+K+EAD FFVP Y KCVRMMGGLNDKEINQTYVKV   MPY+R S
Sbjct: 120 VKIHRLLLQSRFRTKKKEEADLFFVPAYAKCVRMMGGLNDKEINQTYVKVYPPMPYFRRS 179

Query: 617 GGRNHIFVFPSGAGAHLFKSWATYLNRSIILTPEGDRTDKRDTSAFNTWKDIIIPGNVDD 796
           GGR+HIFVFPSG GAHLFKSWAT++NRSIILTPEGDRTDKRDTSAFNTWKDIIIPGNVDD
Sbjct: 180 GGRDHIFVFPSGGGAHLFKSWATFINRSIILTPEGDRTDKRDTSAFNTWKDIIIPGNVDD 239

Query: 797 GMTTRGPRLVEPLPLSKRKYLANFLGRAQGKVGRLQLMDLAKQFPNK 937
           GMT RG  LV+PLPLSKRKYLAN+LGRAQGKVGRLQL++LA Q+P+K
Sbjct: 240 GMTKRGLTLVQPLPLSKRKYLANYLGRAQGKVGRLQLIELANQYPDK 286


>ref|XP_004237728.1| PREDICTED: probable beta-1,4-xylosyltransferase IRX10-like isoform
           2 [Solanum lycopersicum]
          Length = 453

 Score =  419 bits (1078), Expect = e-115
 Identities = 210/293 (71%), Positives = 241/293 (82%), Gaps = 5/293 (1%)
 Frame = +2

Query: 74  SGKSSRYPGA-TAAPCTRTHQIGALALVIVTFFITRVFHQXXXXXXXXXNDARYPSVG-- 244
           S K+  +P   T  PCTRTHQIGALALVI+TFF+TR+F Q         +    P+ G  
Sbjct: 6   SSKTRPFPSHHTVTPCTRTHQIGALALVIITFFLTRIFDQSLNS-----SSFSIPTSGQY 60

Query: 245 -TENDAV-VQYSGGLFFWPQRGFGTHLSLKIYVYEENEIEGLKLLLHGRDGKISSEACIK 418
            ++ND +    SGG  FWPQRG+GTHLSLKIYVY+ENEI+GLK LL+GRD KIS ++C+K
Sbjct: 61  RSKNDVIRFSDSGGSIFWPQRGYGTHLSLKIYVYDENEIDGLKHLLYGRDRKISPDSCVK 120

Query: 419 GQWGTQVKIHKLLLQSRFRTKRKDEADFFFVPTYIKCVRMMGGLNDKEINQTYVKVLSQM 598
           GQWGTQ         SRFRT++K+EAD FFVP Y KCVR+MGGLNDKEINQTYVKVLSQM
Sbjct: 121 GQWGTQ---------SRFRTRKKEEADLFFVPAYPKCVRVMGGLNDKEINQTYVKVLSQM 171

Query: 599 PYYRLSGGRNHIFVFPSGAGAHLFKSWATYLNRSIILTPEGDRTDKRDTSAFNTWKDIII 778
           PY+RLSGGRNHIFVFPSGAGAHLFKSWATYLNRSIILTPEGDRTDKRDTSAFNTWKDIII
Sbjct: 172 PYFRLSGGRNHIFVFPSGAGAHLFKSWATYLNRSIILTPEGDRTDKRDTSAFNTWKDIII 231

Query: 779 PGNVDDGMTTRGPRLVEPLPLSKRKYLANFLGRAQGKVGRLQLMDLAKQFPNK 937
           PGN+DDGMTT G R+VE LPLSKRK+LAN+LGRAQGKVGRL+L++L+KQ+P K
Sbjct: 232 PGNIDDGMTTHGSRIVESLPLSKRKHLANYLGRAQGKVGRLRLIELSKQYPEK 284


>ref|XP_002300942.1| exostosin family protein [Populus trichocarpa]
           gi|222842668|gb|EEE80215.1| exostosin family protein
           [Populus trichocarpa]
          Length = 460

 Score =  418 bits (1074), Expect = e-114
 Identities = 204/280 (72%), Positives = 236/280 (84%), Gaps = 2/280 (0%)
 Frame = +2

Query: 104 TAAPCTRTHQIGALALVIVTFFITRVFHQXXXXXXXXX--NDARYPSVGTENDAVVQYSG 277
           T+ PCTRTHQIGAL L+  TFF+TR+F Q           ND   P+V   +D      G
Sbjct: 18  TSPPCTRTHQIGALLLIATTFFLTRLFDQAFTTCPPSSLNNDHSSPNVVHVSD------G 71

Query: 278 GLFFWPQRGFGTHLSLKIYVYEENEIEGLKLLLHGRDGKISSEACIKGQWGTQVKIHKLL 457
           G   WPQRG+G+HLSLKIYVYEE+EI+GLK LL GRDGKIS++AC+KGQWGTQVKIH LL
Sbjct: 72  GSLSWPQRGYGSHLSLKIYVYEEDEIDGLKELLRGRDGKISADACLKGQWGTQVKIHGLL 131

Query: 458 LQSRFRTKRKDEADFFFVPTYIKCVRMMGGLNDKEINQTYVKVLSQMPYYRLSGGRNHIF 637
           L+SRFRT++K+EAD FFVP Y+KCVRMMGGLNDKEIN TYVKVLSQMPY+R SGGR+HIF
Sbjct: 132 LESRFRTRKKEEADLFFVPAYVKCVRMMGGLNDKEINHTYVKVLSQMPYFRRSGGRDHIF 191

Query: 638 VFPSGAGAHLFKSWATYLNRSIILTPEGDRTDKRDTSAFNTWKDIIIPGNVDDGMTTRGP 817
           VFPSGAGAHLF+SWATY+NRSIILT E DRTDK+DTSAFNTWKDIIIPGNV+DGMT R  
Sbjct: 192 VFPSGAGAHLFRSWATYINRSIILTTEADRTDKKDTSAFNTWKDIIIPGNVEDGMTKRRI 251

Query: 818 RLVEPLPLSKRKYLANFLGRAQGKVGRLQLMDLAKQFPNK 937
            +V+PLPLSKRKYLAN+LGRAQGKVGRL+L++LAKQ+P+K
Sbjct: 252 AMVQPLPLSKRKYLANYLGRAQGKVGRLKLIELAKQYPDK 291


>ref|XP_002307477.2| exostosin family protein [Populus trichocarpa]
           gi|550339425|gb|EEE94473.2| exostosin family protein
           [Populus trichocarpa]
          Length = 460

 Score =  410 bits (1054), Expect = e-112
 Identities = 199/276 (72%), Positives = 230/276 (83%), Gaps = 1/276 (0%)
 Frame = +2

Query: 113 PCTRTHQIGALALVIVTFFITRVF-HQXXXXXXXXXNDARYPSVGTENDAVVQYSGGLFF 289
           PCTR+HQIGAL LV  TFF+TR+F H          N        T  + V    GG   
Sbjct: 21  PCTRSHQIGALLLVASTFFLTRLFDHPFSTCPPSSLNHDH-----TSQNVVHFSDGGSLS 75

Query: 290 WPQRGFGTHLSLKIYVYEENEIEGLKLLLHGRDGKISSEACIKGQWGTQVKIHKLLLQSR 469
           WPQ G+GTHLSLKIYVYEE+EI+GLK LL GR+GKIS++AC+KGQWGTQVKIH+LLLQSR
Sbjct: 76  WPQMGYGTHLSLKIYVYEEDEIDGLKELLRGREGKISADACVKGQWGTQVKIHRLLLQSR 135

Query: 470 FRTKRKDEADFFFVPTYIKCVRMMGGLNDKEINQTYVKVLSQMPYYRLSGGRNHIFVFPS 649
           FRT++K EA+ FFVP Y KCVRMMGGLNDKEIN TYVK LSQMPY+R SGGR+HIFVFPS
Sbjct: 136 FRTRKKGEANLFFVPAYAKCVRMMGGLNDKEINHTYVKALSQMPYFRRSGGRDHIFVFPS 195

Query: 650 GAGAHLFKSWATYLNRSIILTPEGDRTDKRDTSAFNTWKDIIIPGNVDDGMTTRGPRLVE 829
           GAGAHLF+SWATY+NRSIIL+PEGDRTDK+DTS+FNTWKDIIIPGNV+DGMT RG  + +
Sbjct: 196 GAGAHLFRSWATYINRSIILSPEGDRTDKKDTSSFNTWKDIIIPGNVEDGMTKRGAAMAQ 255

Query: 830 PLPLSKRKYLANFLGRAQGKVGRLQLMDLAKQFPNK 937
           PLPLSKRKYLAN+LGRAQGKVGRL+L++LAKQ+P+K
Sbjct: 256 PLPLSKRKYLANYLGRAQGKVGRLKLIELAKQYPDK 291


>ref|XP_002265632.1| PREDICTED: probable glucuronosyltransferase Os03g0107900 [Vitis
           vinifera] gi|302142755|emb|CBI19958.3| unnamed protein
           product [Vitis vinifera]
          Length = 459

 Score =  409 bits (1051), Expect = e-112
 Identities = 200/274 (72%), Positives = 229/274 (83%)
 Frame = +2

Query: 116 CTRTHQIGALALVIVTFFITRVFHQXXXXXXXXXNDARYPSVGTENDAVVQYSGGLFFWP 295
           CTRTHQI ALAL+IVTFF+TR+  +         + A+ P   T    +    GG   WP
Sbjct: 20  CTRTHQIAALALIIVTFFLTRLLDRSFSPCASQASVAQLPGSRT---VLRVNGGGSLSWP 76

Query: 296 QRGFGTHLSLKIYVYEENEIEGLKLLLHGRDGKISSEACIKGQWGTQVKIHKLLLQSRFR 475
           +RG+G+ LSLKIYVYEE+EI+GLK LL+GRDG I +E C+ GQWGTQVKIH+LLL+SRFR
Sbjct: 77  ERGYGSQLSLKIYVYEEDEIDGLKSLLYGRDGSIPTEVCVTGQWGTQVKIHRLLLKSRFR 136

Query: 476 TKRKDEADFFFVPTYIKCVRMMGGLNDKEINQTYVKVLSQMPYYRLSGGRNHIFVFPSGA 655
           T+RK+EAD FFVPTYIKCVRM GGLNDKEI+Q YVKVLSQMPY+RLSGGRNHIFVFPSGA
Sbjct: 137 TRRKEEADLFFVPTYIKCVRMKGGLNDKEIDQMYVKVLSQMPYFRLSGGRNHIFVFPSGA 196

Query: 656 GAHLFKSWATYLNRSIILTPEGDRTDKRDTSAFNTWKDIIIPGNVDDGMTTRGPRLVEPL 835
           G HLFKSWATYLNRSIILTPEGDRTDK+DTSAFNTWKDIIIPGNV D MTT G   V+PL
Sbjct: 197 GPHLFKSWATYLNRSIILTPEGDRTDKKDTSAFNTWKDIIIPGNVADEMTTNGATFVQPL 256

Query: 836 PLSKRKYLANFLGRAQGKVGRLQLMDLAKQFPNK 937
           PLSKRK+LANFLGRAQ K+GRLQL++LAKQ+P+K
Sbjct: 257 PLSKRKFLANFLGRAQRKLGRLQLIELAKQYPDK 290


>ref|XP_006416298.1| hypothetical protein EUTSA_v10007584mg [Eutrema salsugineum]
           gi|557094069|gb|ESQ34651.1| hypothetical protein
           EUTSA_v10007584mg [Eutrema salsugineum]
          Length = 460

 Score =  409 bits (1050), Expect = e-111
 Identities = 196/296 (66%), Positives = 243/296 (82%), Gaps = 4/296 (1%)
 Frame = +2

Query: 62  MAGISGKSSRYPGAT--AAPCTRTHQIGALALVIVTFFITRVFHQXXXXXXXXXNDARYP 235
           MA ++ K   +   +  A PCTRTHQIGAL LV+ TFF+TR+F Q         +++  P
Sbjct: 1   MASLTSKPRNFGAYSHYATPCTRTHQIGALFLVVSTFFVTRLFDQWFSE-----SNSVTP 55

Query: 236 SVGTE--NDAVVQYSGGLFFWPQRGFGTHLSLKIYVYEENEIEGLKLLLHGRDGKISSEA 409
           ++     + + +    G+  WP+RG+G+HLSLKIYVY+ENEI+GLK L++GRDG + + A
Sbjct: 56  AIDLRRTSSSGITDDNGIPRWPERGYGSHLSLKIYVYDENEIDGLKELMYGRDGSVKTTA 115

Query: 410 CIKGQWGTQVKIHKLLLQSRFRTKRKDEADFFFVPTYIKCVRMMGGLNDKEINQTYVKVL 589
           C+KGQWG+QVKIHKLLL+S+FRT +KDEAD FFVP Y+KCVRM+GGLNDKEINQTYVKVL
Sbjct: 116 CLKGQWGSQVKIHKLLLESKFRTSKKDEADLFFVPAYVKCVRMLGGLNDKEINQTYVKVL 175

Query: 590 SQMPYYRLSGGRNHIFVFPSGAGAHLFKSWATYLNRSIILTPEGDRTDKRDTSAFNTWKD 769
           SQMPY+R SGGR+HIFVFPSGAGAHLF+SW+T++NRSIILTPEGDRTDK+DT+AFNTWKD
Sbjct: 176 SQMPYFRRSGGRDHIFVFPSGAGAHLFRSWSTFINRSIILTPEGDRTDKKDTTAFNTWKD 235

Query: 770 IIIPGNVDDGMTTRGPRLVEPLPLSKRKYLANFLGRAQGKVGRLQLMDLAKQFPNK 937
           IIIPGNVDD MT  G   V+PLPLSKRKYLAN+LGRAQGK GRL+L+DL+KQ+P+K
Sbjct: 236 IIIPGNVDDAMTKNGQPDVQPLPLSKRKYLANYLGRAQGKAGRLKLIDLSKQYPDK 291


>gb|AAF87908.1|AC015447_18 Hypothetical protein [Arabidopsis thaliana]
          Length = 414

 Score =  409 bits (1050), Expect = e-111
 Identities = 199/298 (66%), Positives = 242/298 (81%), Gaps = 6/298 (2%)
 Frame = +2

Query: 62  MAGISGKSSRYPGAT---AAPCTRTHQIGALALVIVTFFITRVFHQXXXXXXXXXNDARY 232
           MA ++    R  GA    A PCTRTHQIGAL LV+ TFF+TR+F Q         +++  
Sbjct: 1   MASLTSNKPRNFGAYSHYATPCTRTHQIGALFLVVSTFFVTRLFDQWFSE-----SNSVT 55

Query: 233 PSVG---TENDAVVQYSGGLFFWPQRGFGTHLSLKIYVYEENEIEGLKLLLHGRDGKISS 403
           P +    T +   ++   G+  WP+RG+G+HLSLKIYVY+ENEI+GLK LL+GRDG + +
Sbjct: 56  PVIDLRRTSSSYGIKTDNGIIRWPERGYGSHLSLKIYVYDENEIDGLKELLYGRDGSVKT 115

Query: 404 EACIKGQWGTQVKIHKLLLQSRFRTKRKDEADFFFVPTYIKCVRMMGGLNDKEINQTYVK 583
            AC+KGQWG+QVKIHKLLL+S+FRT +KDEAD FFVP Y+KCVRM+GGLNDKEINQTYVK
Sbjct: 116 TACLKGQWGSQVKIHKLLLESKFRTIKKDEADLFFVPAYVKCVRMLGGLNDKEINQTYVK 175

Query: 584 VLSQMPYYRLSGGRNHIFVFPSGAGAHLFKSWATYLNRSIILTPEGDRTDKRDTSAFNTW 763
           VLSQMPY+R SGGR+HIFVFPSGAGAHLF+SW+T++NRSIILTPE DRTDK+DT+AFN+W
Sbjct: 176 VLSQMPYFRRSGGRDHIFVFPSGAGAHLFRSWSTFINRSIILTPEADRTDKKDTTAFNSW 235

Query: 764 KDIIIPGNVDDGMTTRGPRLVEPLPLSKRKYLANFLGRAQGKVGRLQLMDLAKQFPNK 937
           KDIIIPGNVDD MT  G   V+PLPLSKRKYLAN+LGRAQGK GRL+L+DL+KQFP+K
Sbjct: 236 KDIIIPGNVDDAMTKNGQPDVQPLPLSKRKYLANYLGRAQGKAGRLKLIDLSKQFPDK 293


>ref|NP_973879.1| Exostosin family protein [Arabidopsis thaliana]
           gi|332191986|gb|AEE30107.1| Exostosin family protein
           [Arabidopsis thaliana]
          Length = 410

 Score =  409 bits (1050), Expect = e-111
 Identities = 199/298 (66%), Positives = 242/298 (81%), Gaps = 6/298 (2%)
 Frame = +2

Query: 62  MAGISGKSSRYPGAT---AAPCTRTHQIGALALVIVTFFITRVFHQXXXXXXXXXNDARY 232
           MA ++    R  GA    A PCTRTHQIGAL LV+ TFF+TR+F Q         +++  
Sbjct: 1   MASLTSNKPRNFGAYSHYATPCTRTHQIGALFLVVSTFFVTRLFDQWFSE-----SNSVT 55

Query: 233 PSVG---TENDAVVQYSGGLFFWPQRGFGTHLSLKIYVYEENEIEGLKLLLHGRDGKISS 403
           P +    T +   ++   G+  WP+RG+G+HLSLKIYVY+ENEI+GLK LL+GRDG + +
Sbjct: 56  PVIDLRRTSSSYGIKTDNGIIRWPERGYGSHLSLKIYVYDENEIDGLKELLYGRDGSVKT 115

Query: 404 EACIKGQWGTQVKIHKLLLQSRFRTKRKDEADFFFVPTYIKCVRMMGGLNDKEINQTYVK 583
            AC+KGQWG+QVKIHKLLL+S+FRT +KDEAD FFVP Y+KCVRM+GGLNDKEINQTYVK
Sbjct: 116 TACLKGQWGSQVKIHKLLLESKFRTIKKDEADLFFVPAYVKCVRMLGGLNDKEINQTYVK 175

Query: 584 VLSQMPYYRLSGGRNHIFVFPSGAGAHLFKSWATYLNRSIILTPEGDRTDKRDTSAFNTW 763
           VLSQMPY+R SGGR+HIFVFPSGAGAHLF+SW+T++NRSIILTPE DRTDK+DT+AFN+W
Sbjct: 176 VLSQMPYFRRSGGRDHIFVFPSGAGAHLFRSWSTFINRSIILTPEADRTDKKDTTAFNSW 235

Query: 764 KDIIIPGNVDDGMTTRGPRLVEPLPLSKRKYLANFLGRAQGKVGRLQLMDLAKQFPNK 937
           KDIIIPGNVDD MT  G   V+PLPLSKRKYLAN+LGRAQGK GRL+L+DL+KQFP+K
Sbjct: 236 KDIIIPGNVDDAMTKNGQPDVQPLPLSKRKYLANYLGRAQGKAGRLKLIDLSKQFPDK 293


>ref|NP_564141.1| Exostosin family protein [Arabidopsis thaliana]
           gi|332191985|gb|AEE30106.1| Exostosin family protein
           [Arabidopsis thaliana]
          Length = 462

 Score =  409 bits (1050), Expect = e-111
 Identities = 199/298 (66%), Positives = 242/298 (81%), Gaps = 6/298 (2%)
 Frame = +2

Query: 62  MAGISGKSSRYPGAT---AAPCTRTHQIGALALVIVTFFITRVFHQXXXXXXXXXNDARY 232
           MA ++    R  GA    A PCTRTHQIGAL LV+ TFF+TR+F Q         +++  
Sbjct: 1   MASLTSNKPRNFGAYSHYATPCTRTHQIGALFLVVSTFFVTRLFDQWFSE-----SNSVT 55

Query: 233 PSVG---TENDAVVQYSGGLFFWPQRGFGTHLSLKIYVYEENEIEGLKLLLHGRDGKISS 403
           P +    T +   ++   G+  WP+RG+G+HLSLKIYVY+ENEI+GLK LL+GRDG + +
Sbjct: 56  PVIDLRRTSSSYGIKTDNGIIRWPERGYGSHLSLKIYVYDENEIDGLKELLYGRDGSVKT 115

Query: 404 EACIKGQWGTQVKIHKLLLQSRFRTKRKDEADFFFVPTYIKCVRMMGGLNDKEINQTYVK 583
            AC+KGQWG+QVKIHKLLL+S+FRT +KDEAD FFVP Y+KCVRM+GGLNDKEINQTYVK
Sbjct: 116 TACLKGQWGSQVKIHKLLLESKFRTIKKDEADLFFVPAYVKCVRMLGGLNDKEINQTYVK 175

Query: 584 VLSQMPYYRLSGGRNHIFVFPSGAGAHLFKSWATYLNRSIILTPEGDRTDKRDTSAFNTW 763
           VLSQMPY+R SGGR+HIFVFPSGAGAHLF+SW+T++NRSIILTPE DRTDK+DT+AFN+W
Sbjct: 176 VLSQMPYFRRSGGRDHIFVFPSGAGAHLFRSWSTFINRSIILTPEADRTDKKDTTAFNSW 235

Query: 764 KDIIIPGNVDDGMTTRGPRLVEPLPLSKRKYLANFLGRAQGKVGRLQLMDLAKQFPNK 937
           KDIIIPGNVDD MT  G   V+PLPLSKRKYLAN+LGRAQGK GRL+L+DL+KQFP+K
Sbjct: 236 KDIIIPGNVDDAMTKNGQPDVQPLPLSKRKYLANYLGRAQGKAGRLKLIDLSKQFPDK 293


>ref|XP_002893165.1| exostosin family protein [Arabidopsis lyrata subsp. lyrata]
           gi|297339007|gb|EFH69424.1| exostosin family protein
           [Arabidopsis lyrata subsp. lyrata]
          Length = 462

 Score =  407 bits (1047), Expect = e-111
 Identities = 199/298 (66%), Positives = 241/298 (80%), Gaps = 6/298 (2%)
 Frame = +2

Query: 62  MAGISGKSSRYPGAT---AAPCTRTHQIGALALVIVTFFITRVFHQXXXXXXXXXNDARY 232
           MA ++    R  GA    A PCTRTHQIGAL LV+ TFF+TR+F Q         +++  
Sbjct: 1   MASLTSNKPRNFGAYSHYATPCTRTHQIGALFLVVSTFFVTRLFDQWFSE-----SNSVS 55

Query: 233 PSVG---TENDAVVQYSGGLFFWPQRGFGTHLSLKIYVYEENEIEGLKLLLHGRDGKISS 403
           P +    T +   +    G+  WP+RG+G+HLSLKIYVY+ENEI+GLK LL+GRDG + +
Sbjct: 56  PVIDLRRTSSSYGITTYNGILRWPERGYGSHLSLKIYVYDENEIDGLKELLYGRDGSVKT 115

Query: 404 EACIKGQWGTQVKIHKLLLQSRFRTKRKDEADFFFVPTYIKCVRMMGGLNDKEINQTYVK 583
            AC+KGQWG+QVKIHKLLL+S+FRT +KDEAD FFVP Y+KCVRM+GGLNDKEINQTYVK
Sbjct: 116 TACLKGQWGSQVKIHKLLLESKFRTIKKDEADLFFVPAYVKCVRMLGGLNDKEINQTYVK 175

Query: 584 VLSQMPYYRLSGGRNHIFVFPSGAGAHLFKSWATYLNRSIILTPEGDRTDKRDTSAFNTW 763
           VLSQMPY+R SGGR+HIFVFPSGAGAHLF+SW+T++NRSIILTPE DRTDK+DT+AFNTW
Sbjct: 176 VLSQMPYFRRSGGRDHIFVFPSGAGAHLFRSWSTFINRSIILTPEADRTDKKDTTAFNTW 235

Query: 764 KDIIIPGNVDDGMTTRGPRLVEPLPLSKRKYLANFLGRAQGKVGRLQLMDLAKQFPNK 937
           KDIIIPGNVDD MT  G   V+PLPLSKRKYLAN+LGRAQGK GRL+L+DL+KQ+P+K
Sbjct: 236 KDIIIPGNVDDAMTKNGQPDVQPLPLSKRKYLANYLGRAQGKAGRLKLIDLSKQYPDK 293


>gb|AAL75891.1| At1g21480/F24J8_23 [Arabidopsis thaliana]
          Length = 462

 Score =  407 bits (1045), Expect = e-111
 Identities = 198/298 (66%), Positives = 241/298 (80%), Gaps = 6/298 (2%)
 Frame = +2

Query: 62  MAGISGKSSRYPGAT---AAPCTRTHQIGALALVIVTFFITRVFHQXXXXXXXXXNDARY 232
           MA ++    R  GA    A PCTRTHQIG L LV+ TFF+TR+F Q         +++  
Sbjct: 1   MASLTSNKPRNFGAYSHYATPCTRTHQIGPLFLVVSTFFVTRLFDQWFSE-----SNSVT 55

Query: 233 PSVG---TENDAVVQYSGGLFFWPQRGFGTHLSLKIYVYEENEIEGLKLLLHGRDGKISS 403
           P +    T +   ++   G+  WP+RG+G+HLSLKIYVY+ENEI+GLK LL+GRDG + +
Sbjct: 56  PVIDLRRTSSSYGIKTDNGIIRWPERGYGSHLSLKIYVYDENEIDGLKELLYGRDGSVKT 115

Query: 404 EACIKGQWGTQVKIHKLLLQSRFRTKRKDEADFFFVPTYIKCVRMMGGLNDKEINQTYVK 583
            AC+KGQWG+QVKIHKLLL+S+FRT +KDEAD FFVP Y+KCVRM+GGLNDKEINQTYVK
Sbjct: 116 TACLKGQWGSQVKIHKLLLESKFRTIKKDEADLFFVPAYVKCVRMLGGLNDKEINQTYVK 175

Query: 584 VLSQMPYYRLSGGRNHIFVFPSGAGAHLFKSWATYLNRSIILTPEGDRTDKRDTSAFNTW 763
           VLSQMPY+R SGGR+HIFVFPSGAGAHLF+SW+T++NRSIILTPE DRTDK+DT+AFN+W
Sbjct: 176 VLSQMPYFRRSGGRDHIFVFPSGAGAHLFRSWSTFINRSIILTPEADRTDKKDTTAFNSW 235

Query: 764 KDIIIPGNVDDGMTTRGPRLVEPLPLSKRKYLANFLGRAQGKVGRLQLMDLAKQFPNK 937
           KDIIIPGNVDD MT  G   V+PLPLSKRKYLAN+LGRAQGK GRL+L+DL+KQFP+K
Sbjct: 236 KDIIIPGNVDDAMTKNGQPDVQPLPLSKRKYLANYLGRAQGKAGRLKLIDLSKQFPDK 293


>ref|XP_004148819.1| PREDICTED: probable glucuronoxylan glucuronosyltransferase F8H-like
           [Cucumis sativus] gi|449524512|ref|XP_004169266.1|
           PREDICTED: probable glucuronoxylan
           glucuronosyltransferase F8H-like [Cucumis sativus]
          Length = 458

 Score =  405 bits (1042), Expect = e-111
 Identities = 192/274 (70%), Positives = 229/274 (83%)
 Frame = +2

Query: 116 CTRTHQIGALALVIVTFFITRVFHQXXXXXXXXXNDARYPSVGTENDAVVQYSGGLFFWP 295
           CTR+HQIGAL LV  TFF+TR F +         +   +      + A+     G   WP
Sbjct: 20  CTRSHQIGALLLVCTTFFLTRAFDRLLVPF----SPNSFSGFRQSHYALQSNHDGSISWP 75

Query: 296 QRGFGTHLSLKIYVYEENEIEGLKLLLHGRDGKISSEACIKGQWGTQVKIHKLLLQSRFR 475
            RG+G+HLSLKIYVY+E EI+GLK L++GRDGKI++ AC+KGQWGTQVKIH+LLLQSRFR
Sbjct: 76  DRGYGSHLSLKIYVYDETEIQGLKALMYGRDGKITAAACLKGQWGTQVKIHRLLLQSRFR 135

Query: 476 TKRKDEADFFFVPTYIKCVRMMGGLNDKEINQTYVKVLSQMPYYRLSGGRNHIFVFPSGA 655
           T+ K+EADFFFVP Y+KCVRM+GGLNDKEIN+ Y++VL QMPY+RLSGGR+HIFVFPSGA
Sbjct: 136 TRNKEEADFFFVPAYVKCVRMLGGLNDKEINEAYIQVLGQMPYFRLSGGRDHIFVFPSGA 195

Query: 656 GAHLFKSWATYLNRSIILTPEGDRTDKRDTSAFNTWKDIIIPGNVDDGMTTRGPRLVEPL 835
           GAHLFKSWATY+NRSIILTPEGDRTDK+D SAFNTWKDIIIPGNVDDGMT+ G ++V+PL
Sbjct: 196 GAHLFKSWATYINRSIILTPEGDRTDKKDFSAFNTWKDIIIPGNVDDGMTSPGAKIVQPL 255

Query: 836 PLSKRKYLANFLGRAQGKVGRLQLMDLAKQFPNK 937
           PLSKRK+LAN+LGR QGKVGRL+L++LAKQFP K
Sbjct: 256 PLSKRKHLANYLGRDQGKVGRLKLIELAKQFPEK 289


>gb|EOY16596.1| Exostosin family protein isoform 2 [Theobroma cacao]
          Length = 410

 Score =  404 bits (1039), Expect = e-110
 Identities = 193/275 (70%), Positives = 228/275 (82%)
 Frame = +2

Query: 113 PCTRTHQIGALALVIVTFFITRVFHQXXXXXXXXXNDARYPSVGTENDAVVQYSGGLFFW 292
           PCTR HQIGAL L+  TFF TR+F Q          D    S   +        GG   W
Sbjct: 17  PCTRAHQIGALLLIAATFFFTRLFDQSFPPPCNLITDRS--SRDADLHVAKTNGGGRPLW 74

Query: 293 PQRGFGTHLSLKIYVYEENEIEGLKLLLHGRDGKISSEACIKGQWGTQVKIHKLLLQSRF 472
           P+RG+G+HLSLKIYVY+ENEI+GLK LL+GRDG +S+ AC+KGQWG+QVKIH+LLL+SRF
Sbjct: 75  PERGYGSHLSLKIYVYDENEIDGLKDLLYGRDGTVSTNACLKGQWGSQVKIHRLLLESRF 134

Query: 473 RTKRKDEADFFFVPTYIKCVRMMGGLNDKEINQTYVKVLSQMPYYRLSGGRNHIFVFPSG 652
           RT++K+EAD FFVP+Y+KCVRM+GGLNDKEINQTYVKVLSQMPY+R SGGR+HIFVFPSG
Sbjct: 135 RTRKKEEADLFFVPSYVKCVRMLGGLNDKEINQTYVKVLSQMPYFRRSGGRDHIFVFPSG 194

Query: 653 AGAHLFKSWATYLNRSIILTPEGDRTDKRDTSAFNTWKDIIIPGNVDDGMTTRGPRLVEP 832
           AGAHLF+SWATY+N SIILTPEGDRTDK+DTSAFNTWKDIIIPGNVDDGMT  G  +V+P
Sbjct: 195 AGAHLFRSWATYINLSIILTPEGDRTDKKDTSAFNTWKDIIIPGNVDDGMTKTGATVVQP 254

Query: 833 LPLSKRKYLANFLGRAQGKVGRLQLMDLAKQFPNK 937
           LPLSKRKYLAN+LGRAQ K GRL+L++L+KQ+ +K
Sbjct: 255 LPLSKRKYLANYLGRAQKKAGRLKLIELSKQYGDK 289


>gb|EOY16595.1| Exostosin family protein isoform 1 [Theobroma cacao]
          Length = 458

 Score =  404 bits (1039), Expect = e-110
 Identities = 193/275 (70%), Positives = 228/275 (82%)
 Frame = +2

Query: 113 PCTRTHQIGALALVIVTFFITRVFHQXXXXXXXXXNDARYPSVGTENDAVVQYSGGLFFW 292
           PCTR HQIGAL L+  TFF TR+F Q          D    S   +        GG   W
Sbjct: 17  PCTRAHQIGALLLIAATFFFTRLFDQSFPPPCNLITDRS--SRDADLHVAKTNGGGRPLW 74

Query: 293 PQRGFGTHLSLKIYVYEENEIEGLKLLLHGRDGKISSEACIKGQWGTQVKIHKLLLQSRF 472
           P+RG+G+HLSLKIYVY+ENEI+GLK LL+GRDG +S+ AC+KGQWG+QVKIH+LLL+SRF
Sbjct: 75  PERGYGSHLSLKIYVYDENEIDGLKDLLYGRDGTVSTNACLKGQWGSQVKIHRLLLESRF 134

Query: 473 RTKRKDEADFFFVPTYIKCVRMMGGLNDKEINQTYVKVLSQMPYYRLSGGRNHIFVFPSG 652
           RT++K+EAD FFVP+Y+KCVRM+GGLNDKEINQTYVKVLSQMPY+R SGGR+HIFVFPSG
Sbjct: 135 RTRKKEEADLFFVPSYVKCVRMLGGLNDKEINQTYVKVLSQMPYFRRSGGRDHIFVFPSG 194

Query: 653 AGAHLFKSWATYLNRSIILTPEGDRTDKRDTSAFNTWKDIIIPGNVDDGMTTRGPRLVEP 832
           AGAHLF+SWATY+N SIILTPEGDRTDK+DTSAFNTWKDIIIPGNVDDGMT  G  +V+P
Sbjct: 195 AGAHLFRSWATYINLSIILTPEGDRTDKKDTSAFNTWKDIIIPGNVDDGMTKTGATVVQP 254

Query: 833 LPLSKRKYLANFLGRAQGKVGRLQLMDLAKQFPNK 937
           LPLSKRKYLAN+LGRAQ K GRL+L++L+KQ+ +K
Sbjct: 255 LPLSKRKYLANYLGRAQKKAGRLKLIELSKQYGDK 289


Top