BLASTX nr result

ID: Rehmannia26_contig00015582 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Rehmannia26_contig00015582
         (1587 letters)

Database: ./nr 
           37,332,560 sequences; 13,225,080,153 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

emb|CBI39073.3| unnamed protein product [Vitis vinifera]              617   e-174
ref|XP_006443446.1| hypothetical protein CICLE_v10020045mg [Citr...   593   e-167
gb|EOY10573.1| Exostosin family protein isoform 2 [Theobroma cacao]   582   e-163
gb|EOY10572.1| Exostosin family protein isoform 1 [Theobroma cacao]   582   e-163
ref|XP_004296269.1| PREDICTED: probable glycosyltransferase At5g...   582   e-163
ref|XP_004493174.1| PREDICTED: probable glycosyltransferase At5g...   575   e-161
ref|XP_006604241.1| PREDICTED: probable glycosyltransferase At5g...   570   e-160
ref|XP_006346547.1| PREDICTED: probable glycosyltransferase At5g...   570   e-160
ref|XP_006604240.1| PREDICTED: probable glycosyltransferase At5g...   568   e-159
ref|XP_002265438.2| PREDICTED: probable glycosyltransferase At5g...   563   e-158
ref|XP_006443447.1| hypothetical protein CICLE_v10020045mg [Citr...   562   e-157
ref|XP_006443448.1| hypothetical protein CICLE_v10020045mg [Citr...   560   e-157
ref|XP_003624636.1| Exostosin-like protein [Medicago truncatula]...   553   e-154
ref|XP_002525728.1| catalytic, putative [Ricinus communis] gi|22...   553   e-154
ref|XP_006826373.1| hypothetical protein AMTR_s00004p00132530 [A...   546   e-153
gb|EOY10570.1| Exostosin family protein [Theobroma cacao]             542   e-151
ref|XP_006604242.1| PREDICTED: probable glycosyltransferase At5g...   540   e-151
gb|EXB31256.1| putative glycosyltransferase [Morus notabilis]         535   e-149
gb|EMJ05978.1| hypothetical protein PRUPE_ppa015806mg [Prunus pe...   532   e-148
gb|EOY10571.1| Exostosin family protein [Theobroma cacao]             525   e-146

>emb|CBI39073.3| unnamed protein product [Vitis vinifera]
          Length = 467

 Score =  617 bits (1592), Expect = e-174
 Identities = 321/480 (66%), Positives = 362/480 (75%), Gaps = 3/480 (0%)
 Frame = +2

Query: 80   SFDLLHNFLTRRRDVNYHTRTLKSLFFLMPTTLALATSLFILLYISSTSNLFFIYPH--H 253
            SF +L++F  RR   ++        FF +PT LAL TSLFIL YISSTSNLF  +P   H
Sbjct: 4    SFFILYHFSGRRFSDSFRG------FFFIPTILALITSLFILFYISSTSNLF-THPQETH 56

Query: 254  LQLTHPTSGASINHEKPTKFFSVSPPKIVGFRKIPRFAKKDGFLGHGEDQSGDFKLHMGS 433
            LQ+     G+S          + SPP      +  R   +   L  G            +
Sbjct: 57   LQVLKSALGSS----------AFSPPS----HQFMRVPAETPHLSRG--------FEFNT 94

Query: 434  HGSGMYFSDKELFHDRDVFFENYKEMNKSFKIYVYPHRQDDPFANVLLPVDFEPGGNYAS 613
             G  +      + HDR++F ENYKEMN+SFKIY YPH++DDPFAN LLPVDFEPGGNYAS
Sbjct: 95   KGRFVLLWCTIIRHDRNLFVENYKEMNRSFKIYCYPHKRDDPFANALLPVDFEPGGNYAS 154

Query: 614  ESYFKKVLASSHFITKDPSTADLFFLPFSIARLRHDPRVGINGIQDFIRDYIFNVSHVYS 793
            ESYFKKVL  SHFITKDPS ADLFFLPFSIARLRHDPRVG+ GIQDFIRDYIFN+S  Y 
Sbjct: 155  ESYFKKVLMKSHFITKDPSKADLFFLPFSIARLRHDPRVGVGGIQDFIRDYIFNISQNYP 214

Query: 794  YWNRSGGADHFYVACHSIGRSAMEKAVGVKLNAIQIVCXXXXXXXXXVAHKDASLPQIWP 973
            YWN++GGADHFYVACHSIGRSAMEKA  VKLNAIQ+VC         +AHKDASLPQIWP
Sbjct: 215  YWNQTGGADHFYVACHSIGRSAMEKADEVKLNAIQVVCSSSYFLSGYIAHKDASLPQIWP 274

Query: 974  RQGDPPNLA-NERSKLAFFAGSINSPVREKLLQVWENDSEISVHFGHLPTSYSEELLRSK 1150
            RQGDPP+LA +ER KLAFFAGSINSPVRE+LLQVW NDSEISVHFG L T Y++ELL SK
Sbjct: 275  RQGDPPDLALSERKKLAFFAGSINSPVRERLLQVWRNDSEISVHFGRLTTPYADELLGSK 334

Query: 1151 FCLHVKGFEVNTARIGDALYYGCVPVIIANHYDLPFQDILNWKSFSIVVATLDIPFLKKI 1330
            FCLHVKGFE+NTARI D+LYYGCVPVIIANHYDLPF DILNWKSFSIVVATLDIP LK++
Sbjct: 335  FCLHVKGFEINTARIADSLYYGCVPVIIANHYDLPFADILNWKSFSIVVATLDIPLLKQV 394

Query: 1331 LQGISVEEYSILQNNVLKVRKHFQWHLSPVDYDAFYMVMYELWLRRSSSRIKA*VFVDNF 1510
            L+GIS+ EY +LQ+NVLKVR HFQWH+SPVDYDAFYMVMYELWLRRSS R+   V   NF
Sbjct: 395  LKGISLNEYLMLQSNVLKVRNHFQWHVSPVDYDAFYMVMYELWLRRSSVRVPLIVLNANF 454


>ref|XP_006443446.1| hypothetical protein CICLE_v10020045mg [Citrus clementina]
            gi|568850886|ref|XP_006479128.1| PREDICTED: probable
            glycosyltransferase At5g03795-like isoform X1 [Citrus
            sinensis] gi|557545708|gb|ESR56686.1| hypothetical
            protein CICLE_v10020045mg [Citrus clementina]
          Length = 465

 Score =  593 bits (1530), Expect = e-167
 Identities = 300/453 (66%), Positives = 349/453 (77%), Gaps = 3/453 (0%)
 Frame = +2

Query: 137  RTLKSLFFLMPTTLALATSLFILLYISSTSNLFFIYP--HHLQLTHPTSGASINHEKPTK 310
            R L   FF +PTTLAL ++LFIL YIS+TS+LFF +   HH     P    +     P K
Sbjct: 15   RGLVKTFFFIPTTLALLSTLFILFYISTTSHLFFNHHQRHHQHQLTPFILKNNPLPPPLK 74

Query: 311  FFSVSPPKIVGFRKIPRFAKKDGFLGHGEDQSGDFKLHMGSHGSGMYFSDKELFHDRDVF 490
                S P +V    +   +  DG + +    +    + M ++G+ M   +KE+FHDRD+F
Sbjct: 75   ----SSPVLVSLLNVSNNSHGDGRVRNQRSVN----VPMEANGNSM---NKEVFHDRDIF 123

Query: 491  FENYKEMNKSFKIYVYPHRQDDPFANVLLPVDFEPGGNYASESYFKKVLASSHFITKDPS 670
             E+YK+MN+SF++YVYPHR++DPFANVLLPVDFEP GNYASESYFKKV   SHF+TKDPS
Sbjct: 124  LEDYKQMNRSFRVYVYPHRRNDPFANVLLPVDFEPRGNYASESYFKKVFMKSHFVTKDPS 183

Query: 671  TADLFFLPFSIARLRHDPRVGINGIQDFIRDYIFNVSHVYSYWNRSGGADHFYVACHSIG 850
             ADLFFLPFSIAR+RHD R+G  GI DFI  YIFN+S  Y YWNR+GGADHFYVACHSIG
Sbjct: 184  KADLFFLPFSIARMRHDRRIGTEGIPDFISHYIFNISQKYPYWNRTGGADHFYVACHSIG 243

Query: 851  RSAMEKAVGVKLNAIQIVCXXXXXXXXXVAHKDASLPQIWPRQGDPPNL-ANERSKLAFF 1027
            RSAMEKA  VKLNAIQ+VC         +AHKD SLPQIWPRQ DPP L +++R+KLAFF
Sbjct: 244  RSAMEKAWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQIWPRQEDPPKLGSSKRNKLAFF 303

Query: 1028 AGSINSPVREKLLQVWENDSEISVHFGHLPTSYSEELLRSKFCLHVKGFEVNTARIGDAL 1207
            AG++NSPVREKLLQVW NDSEI  H G L T Y++ LL SKFCLHVKGFEVNTARI D+L
Sbjct: 304  AGAVNSPVREKLLQVWRNDSEIYAHSGRLKTPYADGLLGSKFCLHVKGFEVNTARIADSL 363

Query: 1208 YYGCVPVIIANHYDLPFQDILNWKSFSIVVATLDIPFLKKILQGISVEEYSILQNNVLKV 1387
            YYGCVPVIIANHYDLPF DILNWKSFSIVVATLDIP LKKIL+GIS EEY +LQNNVLKV
Sbjct: 364  YYGCVPVIIANHYDLPFADILNWKSFSIVVATLDIPLLKKILKGISSEEYLLLQNNVLKV 423

Query: 1388 RKHFQWHLSPVDYDAFYMVMYELWLRRSSSRIK 1486
            RKHFQWH+ P DYDAFYMVMY+LWLRRSS R++
Sbjct: 424  RKHFQWHVFPSDYDAFYMVMYDLWLRRSSVRVQ 456


>gb|EOY10573.1| Exostosin family protein isoform 2 [Theobroma cacao]
          Length = 496

 Score =  582 bits (1501), Expect = e-163
 Identities = 306/498 (61%), Positives = 366/498 (73%), Gaps = 20/498 (4%)
 Frame = +2

Query: 50   SLTNTTMATSSFDLLHNFLTRRRDVNYHTRTLKSLFFLMPTTLALATSLFILLYISSTSN 229
            SL++  MA SS  L + F  RR      + + KS FF+ P TLAL +++FIL YI +TS 
Sbjct: 9    SLSSKAMARSSLPLYY-FSPRR----VSSPSSKSFFFV-PATLALISTIFILFYIFTTST 62

Query: 230  LFFIYPHHLQLTHPTSGASINHEKPTKFFSVSPPKIVGFRKIPRFA-KKDGF-------- 382
            LF  + H           ++  ++P   F  SP      + +P F+   +GF        
Sbjct: 63   LFTSHHHR---------HTLYLKQPLGSFPSSPLT----QNVPSFSLHNNGFKNGTFDLP 109

Query: 383  -------LGHGEDQSGD---FKLHMGSHGSGMYFSDKELFHDRDVFFENYKEMNKSFKIY 532
                   +G GED +      + H GS G+  + ++ E+FHD D+F E+YKEMN SFKIY
Sbjct: 110  KRPPLKAVGGGEDATMSQVTSRPHFGSEGN--FVNNLEVFHDGDIFLEDYKEMNNSFKIY 167

Query: 533  VYPHRQDDPFANVLLPVDFEPGGNYASESYFKKVLASSHFITKDPSTADLFFLPFSIARL 712
            VYP +++DPFA+ LLPVDFEPGGNYASESYFKK L  SHFITKDP+ ADLFFLPFSIARL
Sbjct: 168  VYPVKRNDPFAHALLPVDFEPGGNYASESYFKKALMKSHFITKDPTKADLFFLPFSIARL 227

Query: 713  RHDPRVGINGIQDFIRDYIFNVSHVYSYWNRSGGADHFYVACHSIGRSAMEKAVGVKLNA 892
            RHD R+G  GIQDFIRDYIFN+S  Y YWNRSGGADHFYVACHSIGRS M KA  +KLNA
Sbjct: 228  RHDRRIGTGGIQDFIRDYIFNISQKYPYWNRSGGADHFYVACHSIGRSVMAKARELKLNA 287

Query: 893  IQIVCXXXXXXXXXVAHKDASLPQIWPRQGDPPNLA-NERSKLAFFAGSINSPVREKLLQ 1069
            IQIVC         +AHKDASLPQ+WPR GDPPNLA ++R+KL+FFAGSINSPVREKLL+
Sbjct: 288  IQIVCSSSYFLSGYIAHKDASLPQVWPRTGDPPNLASSKRNKLSFFAGSINSPVREKLLK 347

Query: 1070 VWENDSEISVHFGHLPTSYSEELLRSKFCLHVKGFEVNTARIGDALYYGCVPVIIANHYD 1249
             W NDSEI+ H+G L T Y++ELL SKFCLHVKGFEVNTARI D+LYYGCVP+IIAN+YD
Sbjct: 348  FWRNDSEIAAHYGRLKTPYADELLSSKFCLHVKGFEVNTARIADSLYYGCVPIIIANYYD 407

Query: 1250 LPFQDILNWKSFSIVVATLDIPFLKKILQGISVEEYSILQNNVLKVRKHFQWHLSPVDYD 1429
            LPF DILNWKSFSIVV T+DIP LK+IL+GI+ +EY  LQ NVLKVRKHFQWH+ P+D+D
Sbjct: 408  LPFADILNWKSFSIVVVTVDIPSLKQILRGITSDEYLSLQRNVLKVRKHFQWHVPPIDFD 467

Query: 1430 AFYMVMYELWLRRSSSRI 1483
            AFYMVMYELWLRRSS+RI
Sbjct: 468  AFYMVMYELWLRRSSARI 485


>gb|EOY10572.1| Exostosin family protein isoform 1 [Theobroma cacao]
          Length = 492

 Score =  582 bits (1501), Expect = e-163
 Identities = 306/498 (61%), Positives = 366/498 (73%), Gaps = 20/498 (4%)
 Frame = +2

Query: 50   SLTNTTMATSSFDLLHNFLTRRRDVNYHTRTLKSLFFLMPTTLALATSLFILLYISSTSN 229
            SL++  MA SS  L + F  RR      + + KS FF+ P TLAL +++FIL YI +TS 
Sbjct: 9    SLSSKAMARSSLPLYY-FSPRR----VSSPSSKSFFFV-PATLALISTIFILFYIFTTST 62

Query: 230  LFFIYPHHLQLTHPTSGASINHEKPTKFFSVSPPKIVGFRKIPRFA-KKDGF-------- 382
            LF  + H           ++  ++P   F  SP      + +P F+   +GF        
Sbjct: 63   LFTSHHHR---------HTLYLKQPLGSFPSSPLT----QNVPSFSLHNNGFKNGTFDLP 109

Query: 383  -------LGHGEDQSGD---FKLHMGSHGSGMYFSDKELFHDRDVFFENYKEMNKSFKIY 532
                   +G GED +      + H GS G+  + ++ E+FHD D+F E+YKEMN SFKIY
Sbjct: 110  KRPPLKAVGGGEDATMSQVTSRPHFGSEGN--FVNNLEVFHDGDIFLEDYKEMNNSFKIY 167

Query: 533  VYPHRQDDPFANVLLPVDFEPGGNYASESYFKKVLASSHFITKDPSTADLFFLPFSIARL 712
            VYP +++DPFA+ LLPVDFEPGGNYASESYFKK L  SHFITKDP+ ADLFFLPFSIARL
Sbjct: 168  VYPVKRNDPFAHALLPVDFEPGGNYASESYFKKALMKSHFITKDPTKADLFFLPFSIARL 227

Query: 713  RHDPRVGINGIQDFIRDYIFNVSHVYSYWNRSGGADHFYVACHSIGRSAMEKAVGVKLNA 892
            RHD R+G  GIQDFIRDYIFN+S  Y YWNRSGGADHFYVACHSIGRS M KA  +KLNA
Sbjct: 228  RHDRRIGTGGIQDFIRDYIFNISQKYPYWNRSGGADHFYVACHSIGRSVMAKARELKLNA 287

Query: 893  IQIVCXXXXXXXXXVAHKDASLPQIWPRQGDPPNLA-NERSKLAFFAGSINSPVREKLLQ 1069
            IQIVC         +AHKDASLPQ+WPR GDPPNLA ++R+KL+FFAGSINSPVREKLL+
Sbjct: 288  IQIVCSSSYFLSGYIAHKDASLPQVWPRTGDPPNLASSKRNKLSFFAGSINSPVREKLLK 347

Query: 1070 VWENDSEISVHFGHLPTSYSEELLRSKFCLHVKGFEVNTARIGDALYYGCVPVIIANHYD 1249
             W NDSEI+ H+G L T Y++ELL SKFCLHVKGFEVNTARI D+LYYGCVP+IIAN+YD
Sbjct: 348  FWRNDSEIAAHYGRLKTPYADELLSSKFCLHVKGFEVNTARIADSLYYGCVPIIIANYYD 407

Query: 1250 LPFQDILNWKSFSIVVATLDIPFLKKILQGISVEEYSILQNNVLKVRKHFQWHLSPVDYD 1429
            LPF DILNWKSFSIVV T+DIP LK+IL+GI+ +EY  LQ NVLKVRKHFQWH+ P+D+D
Sbjct: 408  LPFADILNWKSFSIVVVTVDIPSLKQILRGITSDEYLSLQRNVLKVRKHFQWHVPPIDFD 467

Query: 1430 AFYMVMYELWLRRSSSRI 1483
            AFYMVMYELWLRRSS+RI
Sbjct: 468  AFYMVMYELWLRRSSARI 485


>ref|XP_004296269.1| PREDICTED: probable glycosyltransferase At5g03795-like [Fragaria
            vesca subsp. vesca]
          Length = 439

 Score =  582 bits (1501), Expect = e-163
 Identities = 295/442 (66%), Positives = 332/442 (75%), Gaps = 1/442 (0%)
 Frame = +2

Query: 155  FFLMPTTLALATSLFILLYISSTSNLFFIYPHHLQLTHPTSGASINHEKPTKFFSVSPPK 334
            FF +PTTLALATSL IL YIS+TSNLF   PHH  L + +S A   H  P +     PP 
Sbjct: 18   FFFIPTTLALATSLLILFYISTTSNLF---PHHPPLPNLSSFAP--HLYPFQSQRSLPPN 72

Query: 335  IVGFRKIPRFAKKDGFLGHGEDQSGDFKLHMGSHGSGMYFSDKELFHDRDVFFENYKEMN 514
                                            S  +G Y ++ E+FHD  +F ++YKEM 
Sbjct: 73   --------------------------------SAPNGNYDNNNEVFHDTHIFVQDYKEMK 100

Query: 515  KSFKIYVYPHRQDDPFANVLLPVDFEPGGNYASESYFKKVLASSHFITKDPSTADLFFLP 694
            +SFKIYVYPHR+DDPFAN LLPVDFEP GNYASESYFKKVL  SHFIT DP+ A LFFLP
Sbjct: 101  RSFKIYVYPHRKDDPFANALLPVDFEPAGNYASESYFKKVLMESHFITNDPTQAQLFFLP 160

Query: 695  FSIARLRHDPRVGINGIQDFIRDYIFNVSHVYSYWNRSGGADHFYVACHSIGRSAMEKAV 874
            FSIARLRHDPRVG+ GIQDFIRDY+FN+SH Y YWNR+GGADHFYVACHSIGRSAMEKA 
Sbjct: 161  FSIARLRHDPRVGVGGIQDFIRDYMFNISHKYEYWNRTGGADHFYVACHSIGRSAMEKAT 220

Query: 875  GVKLNAIQIVCXXXXXXXXXVAHKDASLPQIWPRQGDPPN-LANERSKLAFFAGSINSPV 1051
             VK NAIQ+VC         +AHKDA LPQIWPR+ DPPN L++ R+KLAFFAG INSPV
Sbjct: 221  QVKFNAIQLVCSSSYFLSGYIAHKDACLPQIWPRKQDPPNLLSSNRTKLAFFAGGINSPV 280

Query: 1052 REKLLQVWENDSEISVHFGHLPTSYSEELLRSKFCLHVKGFEVNTARIGDALYYGCVPVI 1231
            RE+LLQVW NDSEI V+FG L TSY++ LL S FCLHVKGFEVNTARI D+LYYGCVPVI
Sbjct: 281  RERLLQVWRNDSEIFVNFGRLKTSYADALLGSMFCLHVKGFEVNTARIADSLYYGCVPVI 340

Query: 1232 IANHYDLPFQDILNWKSFSIVVATLDIPFLKKILQGISVEEYSILQNNVLKVRKHFQWHL 1411
            IAN+YDLPF DILNWKSFS+VVATLDIP LK IL+GI  +EY  L+NNV KVR  FQWHL
Sbjct: 341  IANYYDLPFADILNWKSFSVVVATLDIPLLKNILKGIRSDEYMRLRNNVFKVRNQFQWHL 400

Query: 1412 SPVDYDAFYMVMYELWLRRSSS 1477
            SP+DYDAF+MVMYELWLRRS S
Sbjct: 401  SPIDYDAFHMVMYELWLRRSFS 422


>ref|XP_004493174.1| PREDICTED: probable glycosyltransferase At5g03795-like [Cicer
            arietinum]
          Length = 472

 Score =  575 bits (1482), Expect = e-161
 Identities = 292/453 (64%), Positives = 338/453 (74%), Gaps = 8/453 (1%)
 Frame = +2

Query: 140  TLKSLFFLMPTTLALA--TSLFILLYISSTSNLFFIYP--HHLQLTHP--TSGASINHEK 301
            + ++ FF +PTTLAL   TSL IL Y+ +TS +F  +   HHLQ T    TS +S+    
Sbjct: 18   SFRNFFFFIPTTLALLFLTSLSILFYVYTTSIIFINHHQHHHLQSTSQYFTSLSSLP--- 74

Query: 302  PTKFFSVSPPKIVGFRKIPRFAKKDGF-LGHGEDQSGDFKLHMGSHGSGMYFSDKELFHD 478
                  +  P          F K   F LGHG        L   S+ +     +  LFHD
Sbjct: 75   -----VLLSPTTTLHNNASEFTKFQTFQLGHGLPPQSQRGLPSQSNSTRKLEKNNNLFHD 129

Query: 479  RDVFFENYKEMNKSFKIYVYPHRQDDPFANVLLPVDFEPGGNYASESYFKKVLASSHFIT 658
            RD+F E+YKEMN+SFKIYVYPHR+DDPFANVLLP+  EPGGNYASESYFKKVL  SHFIT
Sbjct: 130  RDLFLEDYKEMNRSFKIYVYPHREDDPFANVLLPMKHEPGGNYASESYFKKVLMKSHFIT 189

Query: 659  KDPSTADLFFLPFSIARLRHDPRVGINGIQDFIRDYIFNVSHVYSYWNRSGGADHFYVAC 838
             DP+ ADLFF+PFSIA LRHDPRVG+ GIQDFIRDY+ N+ H Y YWNR+GGADHFYVAC
Sbjct: 190  NDPTEADLFFMPFSIASLRHDPRVGVEGIQDFIRDYVQNIVHKYPYWNRTGGADHFYVAC 249

Query: 839  HSIGRSAMEKAVGVKLNAIQIVCXXXXXXXXXVAHKDASLPQIWPRQGDPPNL-ANERSK 1015
            HSIGRSAMEKA  VK NAIQ+VC         +AHKD  LPQIWPR+ +PPNL ++ R K
Sbjct: 250  HSIGRSAMEKAPDVKFNAIQVVCSSSYFLTGYIAHKDTCLPQIWPRKQNPPNLVSSNRKK 309

Query: 1016 LAFFAGSINSPVREKLLQVWENDSEISVHFGHLPTSYSEELLRSKFCLHVKGFEVNTARI 1195
            LAFFAG +NSPVR KLL+ W+NDSEI VH G L T Y++ELL SKFCLHVKGFEVNTARI
Sbjct: 310  LAFFAGGVNSPVRIKLLETWKNDSEIFVHHGRLKTPYADELLGSKFCLHVKGFEVNTARI 369

Query: 1196 GDALYYGCVPVIIANHYDLPFQDILNWKSFSIVVATLDIPFLKKILQGISVEEYSILQNN 1375
            GD+LYYGCVPVIIAN+YDLPF D+LNWKSFS+VV TLDIP LKKIL+GIS +EY +LQ N
Sbjct: 370  GDSLYYGCVPVIIANYYDLPFADVLNWKSFSVVVTTLDIPLLKKILKGISSDEYLMLQRN 429

Query: 1376 VLKVRKHFQWHLSPVDYDAFYMVMYELWLRRSS 1474
            VLKVRKHFQWH  P+D+DAFYMV+YELWLRRSS
Sbjct: 430  VLKVRKHFQWHSPPIDFDAFYMVVYELWLRRSS 462


>ref|XP_006604241.1| PREDICTED: probable glycosyltransferase At5g03795-like isoform X2
            [Glycine max]
          Length = 489

 Score =  570 bits (1469), Expect = e-160
 Identities = 295/464 (63%), Positives = 339/464 (73%), Gaps = 12/464 (2%)
 Frame = +2

Query: 155  FFLMPTTLALATSLFILLYISSTSNLFFIYPHHLQLTHPTSGASINHEKPTKFFSVSPPK 334
            FF +PTTLAL TS FIL YI STSN+F  + HH          S +H KP   FS +P  
Sbjct: 36   FFFIPTTLALFTSFFILFYIYSTSNIFTHHNHH---------PSTSHFKPHPPFSTTPFI 86

Query: 335  IVGFRKIPRFAKKDGF---------LGHGEDQSGDFKLHMGSHGSGM-YFSDKELFHDRD 484
                  +P F               LG+G        L +    S    F + ++FHDRD
Sbjct: 87   ATTPHFVPVFNHNASESTKSPPTFQLGYGLGPQSQRGLPLPPQFSSKGKFENNDVFHDRD 146

Query: 485  VFFENYKEMNKSFKIYVYPHRQDDPFANVLLPVDFEPGGNYASESYFKKVLASSHFITKD 664
            VF E+YKEMN+S KIYVYPHR+DDPFANVLLPV+ EPGGNY SESYFKKVL  SHFITKD
Sbjct: 147  VFLEDYKEMNRSLKIYVYPHREDDPFANVLLPVESEPGGNYTSESYFKKVLMKSHFITKD 206

Query: 665  PSTADLFFLPFSIARLRHDPRVGINGIQDFIRDYIFNVSHVYSYWNRSGGADHFYVACHS 844
            P  ADLFFLPFS+ARL HD RVG+ GIQDFIRDYI N+SH Y YWN +GGADHFYVACHS
Sbjct: 207  PPEADLFFLPFSMARLWHDRRVGVGGIQDFIRDYIHNISHRYPYWNNTGGADHFYVACHS 266

Query: 845  IGRSAMEKAVGVKLNAIQIVCXXXXXXXXXVAHKDASLPQIWPRQGDPPNL-ANERSKLA 1021
            IGRSAM+KA   K NAIQ+VC          AHKDA LPQIWPR+G+PPNL +++R +LA
Sbjct: 267  IGRSAMDKAPDEKFNAIQVVCSSSYFLTGYFAHKDACLPQIWPRKGNPPNLVSSKRKRLA 326

Query: 1022 FFAGSINSPVREKLLQVWENDSEISVHFGHLPTSYSEELLRSKFCLHVKGFEVNTARIGD 1201
            FFAG +NSPVR KLL+ W+NDSEI VH G L T Y++ELL SKFCLHVKGFEVNTARIGD
Sbjct: 327  FFAGGVNSPVRVKLLETWKNDSEIFVHHGRLKTPYADELLGSKFCLHVKGFEVNTARIGD 386

Query: 1202 ALYYGCVPVIIANHYDLPFQDILNWKSFSIVVATLDIPFLKKILQG-ISVEEYSILQNNV 1378
            +LYYGCVPVIIAN+YDLPF D+LNWKSFS+VV TLDIP LKKIL+  IS  +Y +LQ+NV
Sbjct: 387  SLYYGCVPVIIANYYDLPFADVLNWKSFSVVVTTLDIPLLKKILKDIISSNKYLMLQSNV 446

Query: 1379 LKVRKHFQWHLSPVDYDAFYMVMYELWLRRSSSRIKA*VFVDNF 1510
            LKVRKHFQWH  P D+DAFYMVMYELWLRRSS +     +VD+F
Sbjct: 447  LKVRKHFQWHSPPQDFDAFYMVMYELWLRRSSIKN---TWVDSF 487


>ref|XP_006346547.1| PREDICTED: probable glycosyltransferase At5g03795-like isoform X1
            [Solanum tuberosum]
          Length = 452

 Score =  570 bits (1468), Expect = e-160
 Identities = 287/448 (64%), Positives = 337/448 (75%), Gaps = 3/448 (0%)
 Frame = +2

Query: 152  LFFLMPTTLALATSLFILLYISSTSNLFFIYPHHLQLTHPTSGASINHEKPTKFFSVSPP 331
            +F L+PT L++ + LFIL YIS TSN FFI+ H   LT                F++S  
Sbjct: 47   MFILIPTGLSVVSCLFILFYISFTSN-FFIHSHQTHLT----------------FNIS-- 87

Query: 332  KIVGFRKIPRFAKKDGFLGHGEDQSGDFKLHMGSHG--SGMYFSDKELFHDRDVFFENYK 505
                            F+G+         +H  +H   +G + +D ++FHDRD F +NYK
Sbjct: 88   ----------------FVGNP-------MIHTHTHVQFNGNHVNDNDVFHDRDAFVDNYK 124

Query: 506  EMNKSFKIYVYPHRQDDPFANVLLPVDFEPGGNYASESYFKKVLASSHFITKDPSTADLF 685
            EMN+S KIYVYPH++DDPF+NVLL VDFEPGGNYASESYFKKVL  SHFIT+DPS ADLF
Sbjct: 125  EMNRSLKIYVYPHQKDDPFSNVLLAVDFEPGGNYASESYFKKVLKMSHFITRDPSNADLF 184

Query: 686  FLPFSIARLRHDPRVGINGIQDFIRDYIFNVSHVYSYWNRSGGADHFYVACHSIGRSAME 865
            FLPFSIARLRHDPRVGINGI+DFI+ YIFN+SH Y YWN + GADHFYVACHSIGR AME
Sbjct: 185  FLPFSIARLRHDPRVGINGIKDFIKSYIFNISHEYPYWNLTNGADHFYVACHSIGRFAME 244

Query: 866  KAVGVKLNAIQIVCXXXXXXXXXVAHKDASLPQIWPRQGDPPNLA-NERSKLAFFAGSIN 1042
            K V VK+N IQ+VC         + HKDASLPQIWPR G  P+ A  +R KL FFAGS+N
Sbjct: 245  KVVDVKINVIQVVCTSSYFVSAYIPHKDASLPQIWPRLGGNPDFAPYKRKKLGFFAGSLN 304

Query: 1043 SPVREKLLQVWENDSEISVHFGHLPTSYSEELLRSKFCLHVKGFEVNTARIGDALYYGCV 1222
            SPVREKLL+ W NDS+I VH G L  SY+EELL SKFCLHVKGFEVNTARI DAL+YGCV
Sbjct: 305  SPVREKLLEWWGNDSDIFVHSGRLERSYTEELLGSKFCLHVKGFEVNTARIVDALFYGCV 364

Query: 1223 PVIIANHYDLPFQDILNWKSFSIVVATLDIPFLKKILQGISVEEYSILQNNVLKVRKHFQ 1402
            PVIIANHYDLPF DIL+WK FS++VATLDIP LKKILQGI+ +EY +LQ+NVLKVR+HFQ
Sbjct: 365  PVIIANHYDLPFADILDWKHFSVIVATLDIPLLKKILQGITQQEYLVLQSNVLKVREHFQ 424

Query: 1403 WHLSPVDYDAFYMVMYELWLRRSSSRIK 1486
            WH+SP+D+DAFYMVMYELWLRRSS R++
Sbjct: 425  WHVSPIDFDAFYMVMYELWLRRSSLRLQ 452


>ref|XP_006604240.1| PREDICTED: probable glycosyltransferase At5g03795-like isoform X1
            [Glycine max]
          Length = 500

 Score =  568 bits (1464), Expect = e-159
 Identities = 297/476 (62%), Positives = 342/476 (71%), Gaps = 24/476 (5%)
 Frame = +2

Query: 155  FFLMPTTLALATSLFILLYISSTSNLFFIYPHHLQLTHPTSGASINHEKPTKFFSVSPPK 334
            FF +PTTLAL TS FIL YI STSN+F  + HH          S +H KP   FS +P  
Sbjct: 36   FFFIPTTLALFTSFFILFYIYSTSNIFTHHNHH---------PSTSHFKPHPPFSTTPFI 86

Query: 335  IVGFRKIPRFAKKDGF---------LGHGEDQSGDFKLHMGS-------------HGSGM 448
                  +P F               LG+G        L +               +GSG 
Sbjct: 87   ATTPHFVPVFNHNASESTKSPPTFQLGYGLGPQSQRGLPLPPQFSSKVCRECCVFYGSGK 146

Query: 449  YFSDKELFHDRDVFFENYKEMNKSFKIYVYPHRQDDPFANVLLPVDFEPGGNYASESYFK 628
             F + ++FHDRDVF E+YKEMN+S KIYVYPHR+DDPFANVLLPV+ EPGGNY SESYFK
Sbjct: 147  -FENNDVFHDRDVFLEDYKEMNRSLKIYVYPHREDDPFANVLLPVESEPGGNYTSESYFK 205

Query: 629  KVLASSHFITKDPSTADLFFLPFSIARLRHDPRVGINGIQDFIRDYIFNVSHVYSYWNRS 808
            KVL  SHFITKDP  ADLFFLPFS+ARL HD RVG+ GIQDFIRDYI N+SH Y YWN +
Sbjct: 206  KVLMKSHFITKDPPEADLFFLPFSMARLWHDRRVGVGGIQDFIRDYIHNISHRYPYWNNT 265

Query: 809  GGADHFYVACHSIGRSAMEKAVGVKLNAIQIVCXXXXXXXXXVAHKDASLPQIWPRQGDP 988
            GGADHFYVACHSIGRSAM+KA   K NAIQ+VC          AHKDA LPQIWPR+G+P
Sbjct: 266  GGADHFYVACHSIGRSAMDKAPDEKFNAIQVVCSSSYFLTGYFAHKDACLPQIWPRKGNP 325

Query: 989  PNL-ANERSKLAFFAGSINSPVREKLLQVWENDSEISVHFGHLPTSYSEELLRSKFCLHV 1165
            PNL +++R +LAFFAG +NSPVR KLL+ W+NDSEI VH G L T Y++ELL SKFCLHV
Sbjct: 326  PNLVSSKRKRLAFFAGGVNSPVRVKLLETWKNDSEIFVHHGRLKTPYADELLGSKFCLHV 385

Query: 1166 KGFEVNTARIGDALYYGCVPVIIANHYDLPFQDILNWKSFSIVVATLDIPFLKKILQG-I 1342
            KGFEVNTARIGD+LYYGCVPVIIAN+YDLPF D+LNWKSFS+VV TLDIP LKKIL+  I
Sbjct: 386  KGFEVNTARIGDSLYYGCVPVIIANYYDLPFADVLNWKSFSVVVTTLDIPLLKKILKDII 445

Query: 1343 SVEEYSILQNNVLKVRKHFQWHLSPVDYDAFYMVMYELWLRRSSSRIKA*VFVDNF 1510
            S  +Y +LQ+NVLKVRKHFQWH  P D+DAFYMVMYELWLRRSS +     +VD+F
Sbjct: 446  SSNKYLMLQSNVLKVRKHFQWHSPPQDFDAFYMVMYELWLRRSSIKN---TWVDSF 498


>ref|XP_002265438.2| PREDICTED: probable glycosyltransferase At5g03795-like [Vitis
            vinifera]
          Length = 336

 Score =  563 bits (1451), Expect = e-158
 Identities = 268/326 (82%), Positives = 292/326 (89%), Gaps = 1/326 (0%)
 Frame = +2

Query: 509  MNKSFKIYVYPHRQDDPFANVLLPVDFEPGGNYASESYFKKVLASSHFITKDPSTADLFF 688
            MN+SFKIY YPH++DDPFAN LLPVDFEPGGNYASESYFKKVL  SHFITKDPS ADLFF
Sbjct: 1    MNRSFKIYCYPHKRDDPFANALLPVDFEPGGNYASESYFKKVLMKSHFITKDPSKADLFF 60

Query: 689  LPFSIARLRHDPRVGINGIQDFIRDYIFNVSHVYSYWNRSGGADHFYVACHSIGRSAMEK 868
            LPFSIARLRHDPRVG+ GIQDFIRDYIFN+S  Y YWN++GGADHFYVACHSIGRSAMEK
Sbjct: 61   LPFSIARLRHDPRVGVGGIQDFIRDYIFNISQNYPYWNQTGGADHFYVACHSIGRSAMEK 120

Query: 869  AVGVKLNAIQIVCXXXXXXXXXVAHKDASLPQIWPRQGDPPNLA-NERSKLAFFAGSINS 1045
            A  VKLNAIQ+VC         +AHKDASLPQIWPRQGDPP+LA +ER KLAFFAGSINS
Sbjct: 121  ADEVKLNAIQVVCSSSYFLSGYIAHKDASLPQIWPRQGDPPDLALSERKKLAFFAGSINS 180

Query: 1046 PVREKLLQVWENDSEISVHFGHLPTSYSEELLRSKFCLHVKGFEVNTARIGDALYYGCVP 1225
            PVRE+LLQVW NDSEISVHFG L T Y++ELL SKFCLHVKGFE+NTARI D+LYYGCVP
Sbjct: 181  PVRERLLQVWRNDSEISVHFGRLTTPYADELLGSKFCLHVKGFEINTARIADSLYYGCVP 240

Query: 1226 VIIANHYDLPFQDILNWKSFSIVVATLDIPFLKKILQGISVEEYSILQNNVLKVRKHFQW 1405
            VIIANHYDLPF DILNWKSFSIVVATLDIP LK++L+GIS+ EY +LQ+NVLKVR HFQW
Sbjct: 241  VIIANHYDLPFADILNWKSFSIVVATLDIPLLKQVLKGISLNEYLMLQSNVLKVRNHFQW 300

Query: 1406 HLSPVDYDAFYMVMYELWLRRSSSRI 1483
            H+SPVDYDAFYMVMYELWLRRSS R+
Sbjct: 301  HVSPVDYDAFYMVMYELWLRRSSVRV 326


>ref|XP_006443447.1| hypothetical protein CICLE_v10020045mg [Citrus clementina]
            gi|568850888|ref|XP_006479129.1| PREDICTED: probable
            glycosyltransferase At5g03795-like isoform X2 [Citrus
            sinensis] gi|557545709|gb|ESR56687.1| hypothetical
            protein CICLE_v10020045mg [Citrus clementina]
          Length = 374

 Score =  562 bits (1448), Expect = e-157
 Identities = 270/359 (75%), Positives = 302/359 (84%), Gaps = 5/359 (1%)
 Frame = +2

Query: 425  MGSHGSGMYFS----DKELFHDRDVFFENYKEMNKSFKIYVYPHRQDDPFANVLLPVDFE 592
            M    SG Y S    +KE+FHDRD+F E+YK+MN+SF++YVYPHR++DPFANVLLPVDFE
Sbjct: 7    MNLETSGFYASGNSMNKEVFHDRDIFLEDYKQMNRSFRVYVYPHRRNDPFANVLLPVDFE 66

Query: 593  PGGNYASESYFKKVLASSHFITKDPSTADLFFLPFSIARLRHDPRVGINGIQDFIRDYIF 772
            P GNYASESYFKKV   SHF+TKDPS ADLFFLPFSIAR+RHD R+G  GI DFI  YIF
Sbjct: 67   PRGNYASESYFKKVFMKSHFVTKDPSKADLFFLPFSIARMRHDRRIGTEGIPDFISHYIF 126

Query: 773  NVSHVYSYWNRSGGADHFYVACHSIGRSAMEKAVGVKLNAIQIVCXXXXXXXXXVAHKDA 952
            N+S  Y YWNR+GGADHFYVACHSIGRSAMEKA  VKLNAIQ+VC         +AHKD 
Sbjct: 127  NISQKYPYWNRTGGADHFYVACHSIGRSAMEKAWEVKLNAIQVVCSSSYFISGHIAHKDV 186

Query: 953  SLPQIWPRQGDPPNL-ANERSKLAFFAGSINSPVREKLLQVWENDSEISVHFGHLPTSYS 1129
            SLPQIWPRQ DPP L +++R+KLAFFAG++NSPVREKLLQVW NDSEI  H G L T Y+
Sbjct: 187  SLPQIWPRQEDPPKLGSSKRNKLAFFAGAVNSPVREKLLQVWRNDSEIYAHSGRLKTPYA 246

Query: 1130 EELLRSKFCLHVKGFEVNTARIGDALYYGCVPVIIANHYDLPFQDILNWKSFSIVVATLD 1309
            + LL SKFCLHVKGFEVNTARI D+LYYGCVPVIIANHYDLPF DILNWKSFSIVVATLD
Sbjct: 247  DGLLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADILNWKSFSIVVATLD 306

Query: 1310 IPFLKKILQGISVEEYSILQNNVLKVRKHFQWHLSPVDYDAFYMVMYELWLRRSSSRIK 1486
            IP LKKIL+GIS EEY +LQNNVLKVRKHFQWH+ P DYDAFYMVMY+LWLRRSS R++
Sbjct: 307  IPLLKKILKGISSEEYLLLQNNVLKVRKHFQWHVFPSDYDAFYMVMYDLWLRRSSVRVQ 365


>ref|XP_006443448.1| hypothetical protein CICLE_v10020045mg [Citrus clementina]
            gi|557545710|gb|ESR56688.1| hypothetical protein
            CICLE_v10020045mg [Citrus clementina]
          Length = 354

 Score =  560 bits (1443), Expect = e-157
 Identities = 265/344 (77%), Positives = 297/344 (86%), Gaps = 1/344 (0%)
 Frame = +2

Query: 458  DKELFHDRDVFFENYKEMNKSFKIYVYPHRQDDPFANVLLPVDFEPGGNYASESYFKKVL 637
            +KE+FHDRD+F E+YK+MN+SF++YVYPHR++DPFANVLLPVDFEP GNYASESYFKKV 
Sbjct: 2    NKEVFHDRDIFLEDYKQMNRSFRVYVYPHRRNDPFANVLLPVDFEPRGNYASESYFKKVF 61

Query: 638  ASSHFITKDPSTADLFFLPFSIARLRHDPRVGINGIQDFIRDYIFNVSHVYSYWNRSGGA 817
              SHF+TKDPS ADLFFLPFSIAR+RHD R+G  GI DFI  YIFN+S  Y YWNR+GGA
Sbjct: 62   MKSHFVTKDPSKADLFFLPFSIARMRHDRRIGTEGIPDFISHYIFNISQKYPYWNRTGGA 121

Query: 818  DHFYVACHSIGRSAMEKAVGVKLNAIQIVCXXXXXXXXXVAHKDASLPQIWPRQGDPPNL 997
            DHFYVACHSIGRSAMEKA  VKLNAIQ+VC         +AHKD SLPQIWPRQ DPP L
Sbjct: 122  DHFYVACHSIGRSAMEKAWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQIWPRQEDPPKL 181

Query: 998  -ANERSKLAFFAGSINSPVREKLLQVWENDSEISVHFGHLPTSYSEELLRSKFCLHVKGF 1174
             +++R+KLAFFAG++NSPVREKLLQVW NDSEI  H G L T Y++ LL SKFCLHVKGF
Sbjct: 182  GSSKRNKLAFFAGAVNSPVREKLLQVWRNDSEIYAHSGRLKTPYADGLLGSKFCLHVKGF 241

Query: 1175 EVNTARIGDALYYGCVPVIIANHYDLPFQDILNWKSFSIVVATLDIPFLKKILQGISVEE 1354
            EVNTARI D+LYYGCVPVIIANHYDLPF DILNWKSFSIVVATLDIP LKKIL+GIS EE
Sbjct: 242  EVNTARIADSLYYGCVPVIIANHYDLPFADILNWKSFSIVVATLDIPLLKKILKGISSEE 301

Query: 1355 YSILQNNVLKVRKHFQWHLSPVDYDAFYMVMYELWLRRSSSRIK 1486
            Y +LQNNVLKVRKHFQWH+ P DYDAFYMVMY+LWLRRSS R++
Sbjct: 302  YLLLQNNVLKVRKHFQWHVFPSDYDAFYMVMYDLWLRRSSVRVQ 345


>ref|XP_003624636.1| Exostosin-like protein [Medicago truncatula]
            gi|87162615|gb|ABD28410.1| Exostosin-like [Medicago
            truncatula] gi|116831751|gb|ABK28848.1| exostosin-like
            protein [Medicago truncatula] gi|355499651|gb|AES80854.1|
            Exostosin-like protein [Medicago truncatula]
          Length = 486

 Score =  553 bits (1424), Expect = e-154
 Identities = 285/464 (61%), Positives = 330/464 (71%), Gaps = 19/464 (4%)
 Frame = +2

Query: 140  TLKSLFFLMPTTLALATSLFILLYISSTSNLFFIYPHHLQLTHPTSGASINHEKPTKFFS 319
            + KS FF +PTTLAL TSL IL Y+  TS    I+ HH Q  +  S           F  
Sbjct: 18   SFKSFFFFIPTTLALLTSLSILFYVYYTS---IIFTHHHQHNNQQSTLINFKSSSPNFIL 74

Query: 320  VSP-PKIVG--FRKIPRFAKKDGF-LGHG-----------EDQSGDFKLHMGS--HGSGM 448
             SP P +          F K   F LGHG           +  S     H  S   GS  
Sbjct: 75   PSPTPHLTNTLHNNHSEFIKSHTFQLGHGLGPQSQRGLPPQSSSNGQNKHENSVFDGSRK 134

Query: 449  YFSDKELFHDRDVFFENYKEMNKSFKIYVYPHRQDDPFANVLLPVDFEPGGNYASESYFK 628
            +  +  +FHDRD+F E+YKEMN+SFKIYVYPH++DDPFANVLLPV  EP GNYASESYFK
Sbjct: 135  FKENNNVFHDRDIFLEDYKEMNRSFKIYVYPHKKDDPFANVLLPVKTEPSGNYASESYFK 194

Query: 629  KVLASSHFITKDPSTADLFFLPFSIARLRHDPRVGINGIQDFIRDYIFNVSHVYSYWNRS 808
            K L  SHFITKDP+ ADLFF+PFSIA LRHD RVG+ GIQDFIRDY+ N+ H Y YWNR+
Sbjct: 195  KALMKSHFITKDPTKADLFFMPFSIASLRHDRRVGVGGIQDFIRDYVQNMIHKYPYWNRT 254

Query: 809  GGADHFYVACHSIGRSAMEKAVGVKLNAIQIVCXXXXXXXXXVAHKDASLPQIWPRQGDP 988
             GADHFYVACHSIGRSAM+KA  VK NAIQ+VC         +AHKDA LPQIWPR  +P
Sbjct: 255  NGADHFYVACHSIGRSAMDKAPDVKFNAIQVVCSSSYFLSGYIAHKDACLPQIWPRNENP 314

Query: 989  PNL-ANERSKLAFFAGSINSPVREKLLQVWENDSEISVHFGHLPTSYSEELLRSKFCLHV 1165
            PNL ++ R KLAFFAG +NSPVR  L++ W+ND+EI VH G L T Y +ELL SKFC HV
Sbjct: 315  PNLVSSNRKKLAFFAGEVNSPVRINLVETWKNDTEIFVHNGRLKTPYGDELLGSKFCFHV 374

Query: 1166 KGFEVNTARIGDALYYGCVPVIIANHYDLPFQDILNWKSFSIVVATLDIPFLKKILQGI- 1342
            +G+EVNTARIGD+LYYGCVPVIIAN+YDLPF D+LNWKSFS+VV TLDIP LKKIL+GI 
Sbjct: 375  RGYEVNTARIGDSLYYGCVPVIIANYYDLPFADVLNWKSFSVVVTTLDIPLLKKILKGIV 434

Query: 1343 SVEEYSILQNNVLKVRKHFQWHLSPVDYDAFYMVMYELWLRRSS 1474
            +  EY +LQ NVLKVR+HFQWH  P+D+DAFYMVMYELWLRRSS
Sbjct: 435  NSGEYLMLQKNVLKVREHFQWHSPPIDFDAFYMVMYELWLRRSS 478


>ref|XP_002525728.1| catalytic, putative [Ricinus communis] gi|223535028|gb|EEF36711.1|
            catalytic, putative [Ricinus communis]
          Length = 336

 Score =  553 bits (1424), Expect = e-154
 Identities = 260/326 (79%), Positives = 289/326 (88%), Gaps = 1/326 (0%)
 Frame = +2

Query: 509  MNKSFKIYVYPHRQDDPFANVLLPVDFEPGGNYASESYFKKVLASSHFITKDPSTADLFF 688
            MN+SFKIYVYPHRQ+DPFANVLLPVDFEPGGNYASESYFKKVL  SHFITKDP+ ADLFF
Sbjct: 1    MNRSFKIYVYPHRQNDPFANVLLPVDFEPGGNYASESYFKKVLMKSHFITKDPTKADLFF 60

Query: 689  LPFSIARLRHDPRVGINGIQDFIRDYIFNVSHVYSYWNRSGGADHFYVACHSIGRSAMEK 868
            LPFSIARLRHDPR+G+ GIQDFIR Y++N+S  Y YWNR+GG DHFYVACHSIGR+AMEK
Sbjct: 61   LPFSIARLRHDPRIGVEGIQDFIRAYVYNISQKYPYWNRTGGTDHFYVACHSIGRTAMEK 120

Query: 869  AVGVKLNAIQIVCXXXXXXXXXVAHKDASLPQIWPRQGDPPNLA-NERSKLAFFAGSINS 1045
            A  VK NAIQ+VC         +AHKDASLPQ+WPRQGDPPNLA +ER KLAFFAGSINS
Sbjct: 121  AEEVKFNAIQVVCSSSYYLSGYIAHKDASLPQVWPRQGDPPNLASSERQKLAFFAGSINS 180

Query: 1046 PVREKLLQVWENDSEISVHFGHLPTSYSEELLRSKFCLHVKGFEVNTARIGDALYYGCVP 1225
            PVRE+LLQVW NDSEI VH+G L TSY++ELL SKFCLHVKGFEVNTARI D+LYYGCVP
Sbjct: 181  PVRERLLQVWRNDSEIYVHYGRLNTSYADELLGSKFCLHVKGFEVNTARIADSLYYGCVP 240

Query: 1226 VIIANHYDLPFQDILNWKSFSIVVATLDIPFLKKILQGISVEEYSILQNNVLKVRKHFQW 1405
            +IIANHYDLPF DILNW+SFS+VVATLDI +LKKILQG+S + Y +LQ+NVLKVRKHFQW
Sbjct: 241  IIIANHYDLPFTDILNWESFSVVVATLDILYLKKILQGVSSDRYVMLQSNVLKVRKHFQW 300

Query: 1406 HLSPVDYDAFYMVMYELWLRRSSSRI 1483
            H  PVDYDAF+MVMYELWLRRSS R+
Sbjct: 301  HFPPVDYDAFHMVMYELWLRRSSVRV 326


>ref|XP_006826373.1| hypothetical protein AMTR_s00004p00132530 [Amborella trichopoda]
            gi|548830687|gb|ERM93610.1| hypothetical protein
            AMTR_s00004p00132530 [Amborella trichopoda]
          Length = 453

 Score =  546 bits (1407), Expect = e-153
 Identities = 285/449 (63%), Positives = 330/449 (73%), Gaps = 2/449 (0%)
 Frame = +2

Query: 143  LKSLFFLMPTTLALATSLFILLYISSTSNLFFIYPHHLQLTHPTSGASINHEKPTKFFSV 322
            L  +F+ +PT LAL TSL I+  I+ TSN    Y   L L  P  G+ +  +K   F  +
Sbjct: 11   LPKIFYFIPTILALVTSLCIIYCINLTSN----YTGFL-LGKPYIGSFLI-QKRIPFLQI 64

Query: 323  SPPKIVGFRKIPRFAKKDGFLGHGEDQS-GDFKLHMGSHGSGMYFSDKELFHDRDVFFEN 499
             P  I    K+P         G+ E  S G+  L++G   +     +  +FHD+ VF E+
Sbjct: 65   -PNSIDIKTKVPLPDS-----GNSERLSEGELDLNIGKENN----INNGVFHDKMVFLED 114

Query: 500  YKEMNKSFKIYVYPHRQDDPFANVLLPVDFEPGGNYASESYFKKVLASSHFITKDPSTAD 679
            YK MNKS KIYVYPH +DD FANVLLPVDF+PGGNYASESYFKK L  SHFITKDP  A 
Sbjct: 115  YKAMNKSLKIYVYPHSKDDSFANVLLPVDFKPGGNYASESYFKKCLMKSHFITKDPKEAH 174

Query: 680  LFFLPFSIARLRHDPRVGINGIQDFIRDYIFNVSHVYSYWNRSGGADHFYVACHSIGRSA 859
            LFFLPFSIA LRHDPRVG++GIQDF+R YI+N+S  Y YWNRSGGADHFYVACHSIGRSA
Sbjct: 175  LFFLPFSIASLRHDPRVGVHGIQDFVRSYIYNISQAYPYWNRSGGADHFYVACHSIGRSA 234

Query: 860  MEKAVGVKLNAIQIVCXXXXXXXXXVAHKDASLPQIWPRQGDPPNL-ANERSKLAFFAGS 1036
            MEKAV VK NAIQ+VC         VAHKDAS+PQIWPR+GDPP   + +R KLAFFAGS
Sbjct: 235  MEKAVDVKFNAIQVVCSASYYLSGYVAHKDASMPQIWPREGDPPKAGSTKRDKLAFFAGS 294

Query: 1037 INSPVREKLLQVWENDSEISVHFGHLPTSYSEELLRSKFCLHVKGFEVNTARIGDALYYG 1216
             NSPVR+ LL+ W NDSEISVHFG+L   YS+ L  SKFCLHVKGFEVNTARI DAL+YG
Sbjct: 295  NNSPVRQNLLEHWRNDSEISVHFGNLSIPYSKALSHSKFCLHVKGFEVNTARIADALFYG 354

Query: 1217 CVPVIIANHYDLPFQDILNWKSFSIVVATLDIPFLKKILQGISVEEYSILQNNVLKVRKH 1396
            CVP++IANHYDLPF DIL+WK FS+VVATLDIP LK+IL  IS E+Y  LQ NVL+VRKH
Sbjct: 355  CVPIVIANHYDLPFTDILDWKKFSLVVATLDIPLLKEILHEISFEDYEELQRNVLEVRKH 414

Query: 1397 FQWHLSPVDYDAFYMVMYELWLRRSSSRI 1483
            FQWH  P +YDAFYMVMYELWLRR  +RI
Sbjct: 415  FQWHKVPENYDAFYMVMYELWLRRGLARI 443


>gb|EOY10570.1| Exostosin family protein [Theobroma cacao]
          Length = 478

 Score =  542 bits (1396), Expect = e-151
 Identities = 275/465 (59%), Positives = 340/465 (73%), Gaps = 13/465 (2%)
 Frame = +2

Query: 131  HTRTLKSLFFLMPTTLALATSLFILLYISSTSNLFFIYPHHLQLTHPTSGASINHE---- 298
            +T T K  FFL+P +LA  + L I +YI STS +F   P       P + +SI  +    
Sbjct: 16   YTPTFKIFFFLLPISLAFTSFLLIFIYIYSTSRVF-TNPQASPYLEPATNSSIFEQLFSF 74

Query: 299  ----KPTKFFSVSPPKIVGFRKIPR---FAKKDGF-LGHGEDQSGDFKLHMGSHGSGMYF 454
                + T  FS+       F  +PR   +AK++ + +G G D  G F        SG   
Sbjct: 75   STDNEETIPFSIDNTAEDLFFDLPRTASYAKQNQWSIGLG-DLFGLF--------SGYNM 125

Query: 455  SDKELFHDRDVFFENYKEMNKSFKIYVYPHRQDDPFANVLLPVDFEPGGNYASESYFKKV 634
            S+ E++HD D+F E+YKEMNKSFKI+VYPH+ DDPFANVLLPVDF+P G+YASE YFKK 
Sbjct: 126  SNTEIYHDTDIFLEDYKEMNKSFKIFVYPHKPDDPFANVLLPVDFDPKGHYASELYFKKA 185

Query: 635  LASSHFITKDPSTADLFFLPFSIARLRHDPRVGINGIQDFIRDYIFNVSHVYSYWNRSGG 814
            L +SHFITKDP+ AD F++PFSIA +RHDPR+G  G+QDFI+DYI N+SH Y YWNR+GG
Sbjct: 186  LVNSHFITKDPNEADFFYMPFSIADMRHDPRIGPEGLQDFIKDYISNISHKYPYWNRTGG 245

Query: 815  ADHFYVACHSIGRSAMEKAVGVKLNAIQIVCXXXXXXXXXVAHKDASLPQIWPRQGDPPN 994
            ADHF+VACHSIGR AM+KAV  K N+IQ+VC           HKD S+PQIWP++ DP  
Sbjct: 246  ADHFHVACHSIGRIAMDKAVEAKENSIQVVCSSTYFAAGYFPHKDVSMPQIWPKEQDPKK 305

Query: 995  L-ANERSKLAFFAGSINSPVREKLLQVWENDSEISVHFGHLPTSYSEELLRSKFCLHVKG 1171
            L +++R++LAFFAG +NSPVR  LL+ W ND+E+  HFG L T   E+ LRSKFCLHVKG
Sbjct: 306  LVSSKRNQLAFFAGQVNSPVRAALLKHWRNDTEVYAHFGRLETDDGEQQLRSKFCLHVKG 365

Query: 1172 FEVNTARIGDALYYGCVPVIIANHYDLPFQDILNWKSFSIVVATLDIPFLKKILQGISVE 1351
            FEVNTAR+ DAL+YGCVPVI+ANHYDLPF DILNWKSFS+VV  +DIP LKKILQGIS+E
Sbjct: 366  FEVNTARVTDALHYGCVPVILANHYDLPFADILNWKSFSVVVHYMDIPVLKKILQGISLE 425

Query: 1352 EYSILQNNVLKVRKHFQWHLSPVDYDAFYMVMYELWLRRSSSRIK 1486
            EYS LQ+NVLKVRKHF+W++ PVDYDAFYM MYELWLRRSS R++
Sbjct: 426  EYSWLQSNVLKVRKHFKWNVPPVDYDAFYMAMYELWLRRSSVRVR 470


>ref|XP_006604242.1| PREDICTED: probable glycosyltransferase At5g03795-like isoform X3
            [Glycine max]
          Length = 475

 Score =  540 bits (1390), Expect = e-151
 Identities = 262/361 (72%), Positives = 299/361 (82%), Gaps = 2/361 (0%)
 Frame = +2

Query: 434  HGSGMYFSDKELFHDRDVFFENYKEMNKSFKIYVYPHRQDDPFANVLLPVDFEPGGNYAS 613
            +GSG  F + ++FHDRDVF E+YKEMN+S KIYVYPHR+DDPFANVLLPV+ EPGGNY S
Sbjct: 117  YGSGK-FENNDVFHDRDVFLEDYKEMNRSLKIYVYPHREDDPFANVLLPVESEPGGNYTS 175

Query: 614  ESYFKKVLASSHFITKDPSTADLFFLPFSIARLRHDPRVGINGIQDFIRDYIFNVSHVYS 793
            ESYFKKVL  SHFITKDP  ADLFFLPFS+ARL HD RVG+ GIQDFIRDYI N+SH Y 
Sbjct: 176  ESYFKKVLMKSHFITKDPPEADLFFLPFSMARLWHDRRVGVGGIQDFIRDYIHNISHRYP 235

Query: 794  YWNRSGGADHFYVACHSIGRSAMEKAVGVKLNAIQIVCXXXXXXXXXVAHKDASLPQIWP 973
            YWN +GGADHFYVACHSIGRSAM+KA   K NAIQ+VC          AHKDA LPQIWP
Sbjct: 236  YWNNTGGADHFYVACHSIGRSAMDKAPDEKFNAIQVVCSSSYFLTGYFAHKDACLPQIWP 295

Query: 974  RQGDPPNL-ANERSKLAFFAGSINSPVREKLLQVWENDSEISVHFGHLPTSYSEELLRSK 1150
            R+G+PPNL +++R +LAFFAG +NSPVR KLL+ W+NDSEI VH G L T Y++ELL SK
Sbjct: 296  RKGNPPNLVSSKRKRLAFFAGGVNSPVRVKLLETWKNDSEIFVHHGRLKTPYADELLGSK 355

Query: 1151 FCLHVKGFEVNTARIGDALYYGCVPVIIANHYDLPFQDILNWKSFSIVVATLDIPFLKKI 1330
            FCLHVKGFEVNTARIGD+LYYGCVPVIIAN+YDLPF D+LNWKSFS+VV TLDIP LKKI
Sbjct: 356  FCLHVKGFEVNTARIGDSLYYGCVPVIIANYYDLPFADVLNWKSFSVVVTTLDIPLLKKI 415

Query: 1331 LQG-ISVEEYSILQNNVLKVRKHFQWHLSPVDYDAFYMVMYELWLRRSSSRIKA*VFVDN 1507
            L+  IS  +Y +LQ+NVLKVRKHFQWH  P D+DAFYMVMYELWLRRSS +     +VD+
Sbjct: 416  LKDIISSNKYLMLQSNVLKVRKHFQWHSPPQDFDAFYMVMYELWLRRSSIKN---TWVDS 472

Query: 1508 F 1510
            F
Sbjct: 473  F 473


>gb|EXB31256.1| putative glycosyltransferase [Morus notabilis]
          Length = 462

 Score =  535 bits (1379), Expect = e-149
 Identities = 255/350 (72%), Positives = 292/350 (83%), Gaps = 1/350 (0%)
 Frame = +2

Query: 437  GSGMYFSDKELFHDRDVFFENYKEMNKSFKIYVYPHRQDDPFANVLLPVDFEPGGNYASE 616
            GS  + +  E+FHDR +F E+Y+EM +SFKIYVYPHR+DDPFANVLLPVD +PGGNYASE
Sbjct: 105  GSAEHVNKYEVFHDRHIFQEDYEEMKRSFKIYVYPHRRDDPFANVLLPVDSKPGGNYASE 164

Query: 617  SYFKKVLASSHFITKDPSTADLFFLPFSIARLRHDPRVGINGIQDFIRDYIFNVSHVYSY 796
             YFK  L+ S F+T+DP+ ADLFFLPFSIARLRHDPRV + GI +F+RDYI NV   Y Y
Sbjct: 165  GYFKMALSKSRFVTEDPNKADLFFLPFSIARLRHDPRVSVGGIPEFVRDYISNVRRKYPY 224

Query: 797  WNRSGGADHFYVACHSIGRSAMEKAVGVKLNAIQIVCXXXXXXXXXVAHKDASLPQIWPR 976
            WNR+GGADHFYVACHSIGRSAMEKA  VKLNAIQIVC         ++HKDA LPQIWPR
Sbjct: 225  WNRTGGADHFYVACHSIGRSAMEKATEVKLNAIQIVCSSSYFVGSYISHKDACLPQIWPR 284

Query: 977  QGDPPN-LANERSKLAFFAGSINSPVREKLLQVWENDSEISVHFGHLPTSYSEELLRSKF 1153
            +GDPPN L++ R+KLAFFAG++NSPVR++L+QVW NDSEI VH G L T Y++ELL SKF
Sbjct: 285  EGDPPNLLSSNRTKLAFFAGAMNSPVRKQLVQVWRNDSEIFVHHGRLKTPYADELLGSKF 344

Query: 1154 CLHVKGFEVNTARIGDALYYGCVPVIIANHYDLPFQDILNWKSFSIVVATLDIPFLKKIL 1333
            CLH KGFEVNTARI D+LYYGCVPVI+AN+YDLPF DILNWKSFS+VVAT DIP LKKIL
Sbjct: 345  CLHAKGFEVNTARIADSLYYGCVPVILANYYDLPFIDILNWKSFSVVVATQDIPLLKKIL 404

Query: 1334 QGISVEEYSILQNNVLKVRKHFQWHLSPVDYDAFYMVMYELWLRRSSSRI 1483
            +GIS +EY  LQ NVLKVRKHF WH SP DYDAFYMVMYELWLRRS  R+
Sbjct: 405  RGISSDEYLRLQRNVLKVRKHFLWHPSPRDYDAFYMVMYELWLRRSLLRV 454


>gb|EMJ05978.1| hypothetical protein PRUPE_ppa015806mg [Prunus persica]
          Length = 331

 Score =  532 bits (1371), Expect = e-148
 Identities = 255/326 (78%), Positives = 280/326 (85%), Gaps = 1/326 (0%)
 Frame = +2

Query: 509  MNKSFKIYVYPHRQDDPFANVLLPVDFEPGGNYASESYFKKVLASSHFITKDPSTADLFF 688
            M +SFKIYVYPHRQDD FAN LLPVD EPGGNYASES+FKKVL  S FIT DP+ ADLFF
Sbjct: 1    MKRSFKIYVYPHRQDDSFANALLPVDSEPGGNYASESFFKKVLMKSRFITNDPTKADLFF 60

Query: 689  LPFSIARLRHDPRVGINGIQDFIRDYIFNVSHVYSYWNRSGGADHFYVACHSIGRSAMEK 868
            LPFSIARLRHDPRVG+ GIQDFIRDYIFNVS  Y YWNR+GGADHFYVACHSIGRSAMEK
Sbjct: 61   LPFSIARLRHDPRVGVGGIQDFIRDYIFNVSQKYQYWNRTGGADHFYVACHSIGRSAMEK 120

Query: 869  AVGVKLNAIQIVCXXXXXXXXXVAHKDASLPQIWPRQGDPPNL-ANERSKLAFFAGSINS 1045
            A  VK NAIQ+VC         + HKDA LPQIWPR+ +P +L ++ R+KLAFFAG INS
Sbjct: 121  ASEVKFNAIQVVCSSSYFLPGYIPHKDACLPQIWPRKEEPHDLLSSNRTKLAFFAGGINS 180

Query: 1046 PVREKLLQVWENDSEISVHFGHLPTSYSEELLRSKFCLHVKGFEVNTARIGDALYYGCVP 1225
            PVREKLLQVW NDSEI  HFG L T Y++ELL SKFCLHVKGFEVNTAR+ D+LYYGCVP
Sbjct: 181  PVREKLLQVWRNDSEIFAHFGRLTTPYADELLGSKFCLHVKGFEVNTARVADSLYYGCVP 240

Query: 1226 VIIANHYDLPFQDILNWKSFSIVVATLDIPFLKKILQGISVEEYSILQNNVLKVRKHFQW 1405
            VIIAN+YDLPF DILNWKSFS++VATLDIP LKKIL+GIS EEY+ LQ+NVLKVRKHFQW
Sbjct: 241  VIIANYYDLPFADILNWKSFSVIVATLDIPLLKKILKGISSEEYTRLQSNVLKVRKHFQW 300

Query: 1406 HLSPVDYDAFYMVMYELWLRRSSSRI 1483
            HLSP+DYDAFYMVMYELWLRRS S +
Sbjct: 301  HLSPIDYDAFYMVMYELWLRRSFSTV 326


>gb|EOY10571.1| Exostosin family protein [Theobroma cacao]
          Length = 465

 Score =  525 bits (1351), Expect = e-146
 Identities = 259/447 (57%), Positives = 319/447 (71%), Gaps = 3/447 (0%)
 Frame = +2

Query: 155  FFLMPTTLALATSLFILLYISSTSNLFFIYP--HHLQLTHPTSGASINHEKPTKFFSVSP 328
            FF +P +LA++T L I LYI  T++  F  P  +H Q + P S   +    P      + 
Sbjct: 24   FFFLPLSLAISTFLVIFLYIWCTNSNLFTDPQNNHYQESSPKSSL-LQQMIPFSLEKAAE 82

Query: 329  PKIVGFRKIPRFAKKDGFLGHGEDQSGDFKLHMGSHGSGMYFSDKELFHDRDVFFENYKE 508
                  R  P         G+    +  F L+      G Y ++ EL+HD D F ++YKE
Sbjct: 83   DMFYSSRSAPLSK------GNQWSMANPFGLY------GNYVNNTELYHDEDFFLQDYKE 130

Query: 509  MNKSFKIYVYPHRQDDPFANVLLPVDFEPGGNYASESYFKKVLASSHFITKDPSTADLFF 688
            MN+S K++VYPH +DDPFA+VLLPVD++P G+YASE YFKKVL+ SHFITK+PS ADLFF
Sbjct: 131  MNRSLKVFVYPHSRDDPFASVLLPVDYDPKGHYASELYFKKVLSKSHFITKNPSEADLFF 190

Query: 689  LPFSIARLRHDPRVGINGIQDFIRDYIFNVSHVYSYWNRSGGADHFYVACHSIGRSAMEK 868
            LPFSI  +RHDPR+G  G+QDFI+DYIFN+SH Y YWNR+ GADHFYVACHSIGR AM+K
Sbjct: 191  LPFSIVEMRHDPRIGPEGMQDFIKDYIFNISHKYPYWNRTDGADHFYVACHSIGRFAMDK 250

Query: 869  AVGVKLNAIQIVCXXXXXXXXXVAHKDASLPQIWPRQGDPPNLA-NERSKLAFFAGSINS 1045
                K N IQ+VC         + HKDAS+PQIWPRQ DPPN A ++R +LAFFAG+INS
Sbjct: 251  VFSAKFNVIQVVCSSSYFVAGYIPHKDASMPQIWPRQRDPPNSASSKRKQLAFFAGTINS 310

Query: 1046 PVREKLLQVWENDSEISVHFGHLPTSYSEELLRSKFCLHVKGFEVNTARIGDALYYGCVP 1225
            P R  L+Q W ND++I  HF  L T  +++LL SKFCLHVKGFEVNTAR+ DA+YYGCVP
Sbjct: 311  PARLALIQAWGNDTDIFAHFERLRTPDADQLLGSKFCLHVKGFEVNTARVADAIYYGCVP 370

Query: 1226 VIIANHYDLPFQDILNWKSFSIVVATLDIPFLKKILQGISVEEYSILQNNVLKVRKHFQW 1405
            VI+ANHYDLPF DI+NWKSFS+VV  +DIP LK ILQ IS+EEYS+LQ+N LKVRKHFQW
Sbjct: 371  VILANHYDLPFGDIINWKSFSVVVHYMDIPVLKNILQRISLEEYSLLQSNTLKVRKHFQW 430

Query: 1406 HLSPVDYDAFYMVMYELWLRRSSSRIK 1486
            +  P DYDAFY  MYELWLRRSS R++
Sbjct: 431  NDPPTDYDAFYTTMYELWLRRSSVRVR 457


Top