BLASTX nr result

ID: Akebia23_contig00024476 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Akebia23_contig00024476
         (1894 letters)

Database: ./nr 
           38,876,450 sequences; 13,856,398,315 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

emb|CBI39073.3| unnamed protein product [Vitis vinifera]              655   0.0  
ref|XP_006443446.1| hypothetical protein CICLE_v10020045mg [Citr...   644   0.0  
ref|XP_004493174.1| PREDICTED: probable glycosyltransferase At5g...   628   e-177
ref|XP_006604241.1| PREDICTED: probable glycosyltransferase At5g...   625   e-176
ref|XP_007030071.1| Exostosin family protein isoform 2 [Theobrom...   623   e-176
ref|XP_007030070.1| Exostosin family protein isoform 1 [Theobrom...   623   e-176
ref|XP_006604240.1| PREDICTED: probable glycosyltransferase At5g...   619   e-174
ref|XP_006443447.1| hypothetical protein CICLE_v10020045mg [Citr...   597   e-168
ref|XP_006443448.1| hypothetical protein CICLE_v10020045mg [Citr...   595   e-167
ref|XP_003624636.1| Exostosin-like protein [Medicago truncatula]...   595   e-167
ref|XP_006604242.1| PREDICTED: probable glycosyltransferase At5g...   588   e-165
ref|XP_004296269.1| PREDICTED: probable glycosyltransferase At5g...   588   e-165
ref|XP_002265438.2| PREDICTED: probable glycosyltransferase At5g...   586   e-164
ref|XP_002525728.1| catalytic, putative [Ricinus communis] gi|22...   575   e-161
ref|XP_006826373.1| hypothetical protein AMTR_s00004p00132530 [A...   573   e-161
gb|EXB31256.1| putative glycosyltransferase [Morus notabilis]         572   e-160
ref|XP_007030069.1| Exostosin family protein [Theobroma cacao] g...   568   e-159
ref|XP_006346547.1| PREDICTED: probable glycosyltransferase At5g...   555   e-155
ref|XP_007204779.1| hypothetical protein PRUPE_ppa015806mg [Prun...   555   e-155
ref|XP_007030066.1| Exostosin family protein [Theobroma cacao] g...   551   e-154

>emb|CBI39073.3| unnamed protein product [Vitis vinifera]
          Length = 467

 Score =  655 bits (1691), Expect = 0.0
 Identities = 327/478 (68%), Positives = 378/478 (79%), Gaps = 3/478 (0%)
 Frame = +2

Query: 26   MVCSSSLFYYFSHRRLFDSFRTFFFIPTILALITSLFILIYISSTSKLFFVHQHHQSHLL 205
            M  S  + Y+FS RR  DSFR FFFIPTILALITSLFIL YISSTS LF   Q     +L
Sbjct: 1    MARSFFILYHFSGRRFSDSFRGFFFIPTILALITSLFILFYISSTSNLFTHPQETHLQVL 60

Query: 206  PKXXXXXXXXXXXTHQIIQSPSQNPPKTTFYEXXXXXXXXXXXXXXXXXXGTDENRRYWE 385
             K           +HQ ++ P++ P  +  +E                            
Sbjct: 61   -KSALGSSAFSPPSHQFMRVPAETPHLSRGFE---------------------------- 91

Query: 386  TGRPIGSNGKYVN---KEVFHDKDIFLEDYKEMNRSFKIYVYPHREDDPYANALLPVNFE 556
                  + G++V      + HD+++F+E+YKEMNRSFKIY YPH+ DDP+ANALLPV+FE
Sbjct: 92   ----FNTKGRFVLLWCTIIRHDRNLFVENYKEMNRSFKIYCYPHKRDDPFANALLPVDFE 147

Query: 557  PGGNYASESYFKKVLMKSHFITKDPSEADLFFLPFSIARLRHDPRVGVQGIPDFIKTYIS 736
            PGGNYASESYFKKVLMKSHFITKDPS+ADLFFLPFSIARLRHDPRVGV GI DFI+ YI 
Sbjct: 148  PGGNYASESYFKKVLMKSHFITKDPSKADLFFLPFSIARLRHDPRVGVGGIQDFIRDYIF 207

Query: 737  NISQNYPYWNRSSGTDHFYVACHSIGRSAMEKATEVKFNAIQVVCSSSYFLSGYIAHKDA 916
            NISQNYPYWN++ G DHFYVACHSIGRSAMEKA EVK NAIQVVCSSSYFLSGYIAHKDA
Sbjct: 208  NISQNYPYWNQTGGADHFYVACHSIGRSAMEKADEVKLNAIQVVCSSSYFLSGYIAHKDA 267

Query: 917  SLPQIWPRQGDPPNVAISKRKKLAFFAGAINSPVRTKLLEEWKNDSEIFVHFGRLTTPYS 1096
            SLPQIWPRQGDPP++A+S+RKKLAFFAG+INSPVR +LL+ W+NDSEI VHFGRLTTPY+
Sbjct: 268  SLPQIWPRQGDPPDLALSERKKLAFFAGSINSPVRERLLQVWRNDSEISVHFGRLTTPYA 327

Query: 1097 DELLGSKFCLHVKGFEVNTARIGDAIYYGCVPVIIANHYDLPFADILNWKSFSLVVATLD 1276
            DELLGSKFCLHVKGFE+NTARI D++YYGCVPVIIANHYDLPFADILNWKSFS+VVATLD
Sbjct: 328  DELLGSKFCLHVKGFEINTARIADSLYYGCVPVIIANHYDLPFADILNWKSFSIVVATLD 387

Query: 1277 IPLLKKILREVSSDEYTTLQRNVLKVQKHFRWNLSPMDYDAFYMVIYELWLRRSSMRV 1450
            IPLLK++L+ +S +EY  LQ NVLKV+ HF+W++SP+DYDAFYMV+YELWLRRSS+RV
Sbjct: 388  IPLLKQVLKGISLNEYLMLQSNVLKVRNHFQWHVSPVDYDAFYMVMYELWLRRSSVRV 445


>ref|XP_006443446.1| hypothetical protein CICLE_v10020045mg [Citrus clementina]
            gi|568850886|ref|XP_006479128.1| PREDICTED: probable
            glycosyltransferase At5g03795-like isoform X1 [Citrus
            sinensis] gi|557545708|gb|ESR56686.1| hypothetical
            protein CICLE_v10020045mg [Citrus clementina]
          Length = 465

 Score =  644 bits (1662), Expect = 0.0
 Identities = 320/475 (67%), Positives = 367/475 (77%)
 Frame = +2

Query: 26   MVCSSSLFYYFSHRRLFDSFRTFFFIPTILALITSLFILIYISSTSKLFFVHQHHQSHLL 205
            M  +SSL  YFS  R     +TFFFIPT LAL+++LFIL YIS+TS LFF   HHQ H  
Sbjct: 1    MANNSSLILYFSRNR--GLVKTFFFIPTTLALLSTLFILFYISTTSHLFF--NHHQRH-- 54

Query: 206  PKXXXXXXXXXXXTHQIIQSPSQNPPKTTFYEXXXXXXXXXXXXXXXXXXGTDENRRYWE 385
                          HQ+     +N P     +                  G   N+R   
Sbjct: 55   ------------HQHQLTPFILKNNPLPPPLKSSPVLVSLLNVSNNSHGDGRVRNQR--S 100

Query: 386  TGRPIGSNGKYVNKEVFHDKDIFLEDYKEMNRSFKIYVYPHREDDPYANALLPVNFEPGG 565
               P+ +NG  +NKEVFHD+DIFLEDYK+MNRSF++YVYPHR +DP+AN LLPV+FEP G
Sbjct: 101  VNVPMEANGNSMNKEVFHDRDIFLEDYKQMNRSFRVYVYPHRRNDPFANVLLPVDFEPRG 160

Query: 566  NYASESYFKKVLMKSHFITKDPSEADLFFLPFSIARLRHDPRVGVQGIPDFIKTYISNIS 745
            NYASESYFKKV MKSHF+TKDPS+ADLFFLPFSIAR+RHD R+G +GIPDFI  YI NIS
Sbjct: 161  NYASESYFKKVFMKSHFVTKDPSKADLFFLPFSIARMRHDRRIGTEGIPDFISHYIFNIS 220

Query: 746  QNYPYWNRSSGTDHFYVACHSIGRSAMEKATEVKFNAIQVVCSSSYFLSGYIAHKDASLP 925
            Q YPYWNR+ G DHFYVACHSIGRSAMEKA EVK NAIQVVCSSSYF+SG+IAHKD SLP
Sbjct: 221  QKYPYWNRTGGADHFYVACHSIGRSAMEKAWEVKLNAIQVVCSSSYFISGHIAHKDVSLP 280

Query: 926  QIWPRQGDPPNVAISKRKKLAFFAGAINSPVRTKLLEEWKNDSEIFVHFGRLTTPYSDEL 1105
            QIWPRQ DPP +  SKR KLAFFAGA+NSPVR KLL+ W+NDSEI+ H GRL TPY+D L
Sbjct: 281  QIWPRQEDPPKLGSSKRNKLAFFAGAVNSPVREKLLQVWRNDSEIYAHSGRLKTPYADGL 340

Query: 1106 LGSKFCLHVKGFEVNTARIGDAIYYGCVPVIIANHYDLPFADILNWKSFSLVVATLDIPL 1285
            LGSKFCLHVKGFEVNTARI D++YYGCVPVIIANHYDLPFADILNWKSFS+VVATLDIPL
Sbjct: 341  LGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADILNWKSFSIVVATLDIPL 400

Query: 1286 LKKILREVSSDEYTTLQRNVLKVQKHFRWNLSPMDYDAFYMVIYELWLRRSSMRV 1450
            LKKIL+ +SS+EY  LQ NVLKV+KHF+W++ P DYDAFYMV+Y+LWLRRSS+RV
Sbjct: 401  LKKILKGISSEEYLLLQNNVLKVRKHFQWHVFPSDYDAFYMVMYDLWLRRSSVRV 455


>ref|XP_004493174.1| PREDICTED: probable glycosyltransferase At5g03795-like [Cicer
            arietinum]
          Length = 472

 Score =  628 bits (1620), Expect = e-177
 Identities = 323/486 (66%), Positives = 364/486 (74%), Gaps = 10/486 (2%)
 Frame = +2

Query: 26   MVCSSSLFYYFSHRRLFDSFRTFFF-IPTILALI--TSLFILIYISSTSKLFFVHQHHQS 196
            MVC SSL  Y SH  +  SFR FFF IPT LAL+  TSL IL Y+ +TS +F    HHQ 
Sbjct: 1    MVCPSSLNQY-SHLHVAASFRNFFFFIPTTLALLFLTSLSILFYVYTTSIIFI--NHHQH 57

Query: 197  HLLPKXXXXXXXXXXXTHQIIQSPSQNP----PKTTFYEXXXXXXXXXXXXXXXXXXGTD 364
            H L             T Q   S S  P    P TT +                      
Sbjct: 58   HHLQS-----------TSQYFTSLSSLPVLLSPTTTLHNNASEFTKFQTFQLGHGLPPQS 106

Query: 365  ENRRYWETGRPIGSNGKYV---NKEVFHDKDIFLEDYKEMNRSFKIYVYPHREDDPYANA 535
            +       G P  SN       N  +FHD+D+FLEDYKEMNRSFKIYVYPHREDDP+AN 
Sbjct: 107  QR------GLPSQSNSTRKLEKNNNLFHDRDLFLEDYKEMNRSFKIYVYPHREDDPFANV 160

Query: 536  LLPVNFEPGGNYASESYFKKVLMKSHFITKDPSEADLFFLPFSIARLRHDPRVGVQGIPD 715
            LLP+  EPGGNYASESYFKKVLMKSHFIT DP+EADLFF+PFSIA LRHDPRVGV+GI D
Sbjct: 161  LLPMKHEPGGNYASESYFKKVLMKSHFITNDPTEADLFFMPFSIASLRHDPRVGVEGIQD 220

Query: 716  FIKTYISNISQNYPYWNRSSGTDHFYVACHSIGRSAMEKATEVKFNAIQVVCSSSYFLSG 895
            FI+ Y+ NI   YPYWNR+ G DHFYVACHSIGRSAMEKA +VKFNAIQVVCSSSYFL+G
Sbjct: 221  FIRDYVQNIVHKYPYWNRTGGADHFYVACHSIGRSAMEKAPDVKFNAIQVVCSSSYFLTG 280

Query: 896  YIAHKDASLPQIWPRQGDPPNVAISKRKKLAFFAGAINSPVRTKLLEEWKNDSEIFVHFG 1075
            YIAHKD  LPQIWPR+ +PPN+  S RKKLAFFAG +NSPVR KLLE WKNDSEIFVH G
Sbjct: 281  YIAHKDTCLPQIWPRKQNPPNLVSSNRKKLAFFAGGVNSPVRIKLLETWKNDSEIFVHHG 340

Query: 1076 RLTTPYSDELLGSKFCLHVKGFEVNTARIGDAIYYGCVPVIIANHYDLPFADILNWKSFS 1255
            RL TPY+DELLGSKFCLHVKGFEVNTARIGD++YYGCVPVIIAN+YDLPFAD+LNWKSFS
Sbjct: 341  RLKTPYADELLGSKFCLHVKGFEVNTARIGDSLYYGCVPVIIANYYDLPFADVLNWKSFS 400

Query: 1256 LVVATLDIPLLKKILREVSSDEYTTLQRNVLKVQKHFRWNLSPMDYDAFYMVIYELWLRR 1435
            +VV TLDIPLLKKIL+ +SSDEY  LQRNVLKV+KHF+W+  P+D+DAFYMV+YELWLRR
Sbjct: 401  VVVTTLDIPLLKKILKGISSDEYLMLQRNVLKVRKHFQWHSPPIDFDAFYMVVYELWLRR 460

Query: 1436 SSMRVA 1453
            SS+ ++
Sbjct: 461  SSIIIS 466


>ref|XP_006604241.1| PREDICTED: probable glycosyltransferase At5g03795-like isoform X2
            [Glycine max]
          Length = 489

 Score =  625 bits (1611), Expect = e-176
 Identities = 312/463 (67%), Positives = 352/463 (76%), Gaps = 7/463 (1%)
 Frame = +2

Query: 80   SFRTFFFIPTILALITSLFILIYISSTSKLFFVHQHHQS--HLLPKXXXXXXXXXXXTHQ 253
            SFR FFFIPT LAL TS FIL YI STS +F  H HH S  H  P            T  
Sbjct: 32   SFRGFFFIPTTLALFTSFFILFYIYSTSNIFTHHNHHPSTSHFKPHPPFSTTPFIATTPH 91

Query: 254  IIQSPSQNPPKTTFYEXXXXXXXXXXXXXXXXXXGTDENRRYWETGRPI----GSNGKYV 421
             +   + N  ++T                     G    R     G P+     S GK+ 
Sbjct: 92   FVPVFNHNASEST---------KSPPTFQLGYGLGPQSQR-----GLPLPPQFSSKGKFE 137

Query: 422  NKEVFHDKDIFLEDYKEMNRSFKIYVYPHREDDPYANALLPVNFEPGGNYASESYFKKVL 601
            N +VFHD+D+FLEDYKEMNRS KIYVYPHREDDP+AN LLPV  EPGGNY SESYFKKVL
Sbjct: 138  NNDVFHDRDVFLEDYKEMNRSLKIYVYPHREDDPFANVLLPVESEPGGNYTSESYFKKVL 197

Query: 602  MKSHFITKDPSEADLFFLPFSIARLRHDPRVGVQGIPDFIKTYISNISQNYPYWNRSSGT 781
            MKSHFITKDP EADLFFLPFS+ARL HD RVGV GI DFI+ YI NIS  YPYWN + G 
Sbjct: 198  MKSHFITKDPPEADLFFLPFSMARLWHDRRVGVGGIQDFIRDYIHNISHRYPYWNNTGGA 257

Query: 782  DHFYVACHSIGRSAMEKATEVKFNAIQVVCSSSYFLSGYIAHKDASLPQIWPRQGDPPNV 961
            DHFYVACHSIGRSAM+KA + KFNAIQVVCSSSYFL+GY AHKDA LPQIWPR+G+PPN+
Sbjct: 258  DHFYVACHSIGRSAMDKAPDEKFNAIQVVCSSSYFLTGYFAHKDACLPQIWPRKGNPPNL 317

Query: 962  AISKRKKLAFFAGAINSPVRTKLLEEWKNDSEIFVHFGRLTTPYSDELLGSKFCLHVKGF 1141
              SKRK+LAFFAG +NSPVR KLLE WKNDSEIFVH GRL TPY+DELLGSKFCLHVKGF
Sbjct: 318  VSSKRKRLAFFAGGVNSPVRVKLLETWKNDSEIFVHHGRLKTPYADELLGSKFCLHVKGF 377

Query: 1142 EVNTARIGDAIYYGCVPVIIANHYDLPFADILNWKSFSLVVATLDIPLLKKILRE-VSSD 1318
            EVNTARIGD++YYGCVPVIIAN+YDLPFAD+LNWKSFS+VV TLDIPLLKKIL++ +SS+
Sbjct: 378  EVNTARIGDSLYYGCVPVIIANYYDLPFADVLNWKSFSVVVTTLDIPLLKKILKDIISSN 437

Query: 1319 EYTTLQRNVLKVQKHFRWNLSPMDYDAFYMVIYELWLRRSSMR 1447
            +Y  LQ NVLKV+KHF+W+  P D+DAFYMV+YELWLRRSS++
Sbjct: 438  KYLMLQSNVLKVRKHFQWHSPPQDFDAFYMVMYELWLRRSSIK 480


>ref|XP_007030071.1| Exostosin family protein isoform 2 [Theobroma cacao]
            gi|508718676|gb|EOY10573.1| Exostosin family protein
            isoform 2 [Theobroma cacao]
          Length = 496

 Score =  623 bits (1607), Expect = e-176
 Identities = 314/482 (65%), Positives = 368/482 (76%), Gaps = 6/482 (1%)
 Frame = +2

Query: 23   SMVCSSSLFYYFSHRRLFD-SFRTFFFIPTILALITSLFILIYISSTSKLFFVHQHHQSH 199
            +M  SS   YYFS RR+   S ++FFF+P  LALI+++FIL YI +TS LF  H H  + 
Sbjct: 14   AMARSSLPLYYFSPRRVSSPSSKSFFFVPATLALISTIFILFYIFTTSTLFTSHHHRHTL 73

Query: 200  LLPKXXXXXXXXXXXTHQIIQSP-SQNPPKTTFYEXXXXXXXXXXXXXXXXXX--GTDEN 370
             L +                 SP +QN P  + +                     G ++ 
Sbjct: 74   YLKQPLGSFP----------SSPLTQNVPSFSLHNNGFKNGTFDLPKRPPLKAVGGGEDA 123

Query: 371  RRYWETGRP-IGSNGKYVNK-EVFHDKDIFLEDYKEMNRSFKIYVYPHREDDPYANALLP 544
                 T RP  GS G +VN  EVFHD DIFLEDYKEMN SFKIYVYP + +DP+A+ALLP
Sbjct: 124  TMSQVTSRPHFGSEGNFVNNLEVFHDGDIFLEDYKEMNNSFKIYVYPVKRNDPFAHALLP 183

Query: 545  VNFEPGGNYASESYFKKVLMKSHFITKDPSEADLFFLPFSIARLRHDPRVGVQGIPDFIK 724
            V+FEPGGNYASESYFKK LMKSHFITKDP++ADLFFLPFSIARLRHD R+G  GI DFI+
Sbjct: 184  VDFEPGGNYASESYFKKALMKSHFITKDPTKADLFFLPFSIARLRHDRRIGTGGIQDFIR 243

Query: 725  TYISNISQNYPYWNRSSGTDHFYVACHSIGRSAMEKATEVKFNAIQVVCSSSYFLSGYIA 904
             YI NISQ YPYWNRS G DHFYVACHSIGRS M KA E+K NAIQ+VCSSSYFLSGYIA
Sbjct: 244  DYIFNISQKYPYWNRSGGADHFYVACHSIGRSVMAKARELKLNAIQIVCSSSYFLSGYIA 303

Query: 905  HKDASLPQIWPRQGDPPNVAISKRKKLAFFAGAINSPVRTKLLEEWKNDSEIFVHFGRLT 1084
            HKDASLPQ+WPR GDPPN+A SKR KL+FFAG+INSPVR KLL+ W+NDSEI  H+GRL 
Sbjct: 304  HKDASLPQVWPRTGDPPNLASSKRNKLSFFAGSINSPVREKLLKFWRNDSEIAAHYGRLK 363

Query: 1085 TPYSDELLGSKFCLHVKGFEVNTARIGDAIYYGCVPVIIANHYDLPFADILNWKSFSLVV 1264
            TPY+DELL SKFCLHVKGFEVNTARI D++YYGCVP+IIAN+YDLPFADILNWKSFS+VV
Sbjct: 364  TPYADELLSSKFCLHVKGFEVNTARIADSLYYGCVPIIIANYYDLPFADILNWKSFSIVV 423

Query: 1265 ATLDIPLLKKILREVSSDEYTTLQRNVLKVQKHFRWNLSPMDYDAFYMVIYELWLRRSSM 1444
             T+DIP LK+ILR ++SDEY +LQRNVLKV+KHF+W++ P+D+DAFYMV+YELWLRRSS 
Sbjct: 424  VTVDIPSLKQILRGITSDEYLSLQRNVLKVRKHFQWHVPPIDFDAFYMVMYELWLRRSSA 483

Query: 1445 RV 1450
            R+
Sbjct: 484  RI 485


>ref|XP_007030070.1| Exostosin family protein isoform 1 [Theobroma cacao]
            gi|508718675|gb|EOY10572.1| Exostosin family protein
            isoform 1 [Theobroma cacao]
          Length = 492

 Score =  623 bits (1607), Expect = e-176
 Identities = 314/482 (65%), Positives = 368/482 (76%), Gaps = 6/482 (1%)
 Frame = +2

Query: 23   SMVCSSSLFYYFSHRRLFD-SFRTFFFIPTILALITSLFILIYISSTSKLFFVHQHHQSH 199
            +M  SS   YYFS RR+   S ++FFF+P  LALI+++FIL YI +TS LF  H H  + 
Sbjct: 14   AMARSSLPLYYFSPRRVSSPSSKSFFFVPATLALISTIFILFYIFTTSTLFTSHHHRHTL 73

Query: 200  LLPKXXXXXXXXXXXTHQIIQSP-SQNPPKTTFYEXXXXXXXXXXXXXXXXXX--GTDEN 370
             L +                 SP +QN P  + +                     G ++ 
Sbjct: 74   YLKQPLGSFP----------SSPLTQNVPSFSLHNNGFKNGTFDLPKRPPLKAVGGGEDA 123

Query: 371  RRYWETGRP-IGSNGKYVNK-EVFHDKDIFLEDYKEMNRSFKIYVYPHREDDPYANALLP 544
                 T RP  GS G +VN  EVFHD DIFLEDYKEMN SFKIYVYP + +DP+A+ALLP
Sbjct: 124  TMSQVTSRPHFGSEGNFVNNLEVFHDGDIFLEDYKEMNNSFKIYVYPVKRNDPFAHALLP 183

Query: 545  VNFEPGGNYASESYFKKVLMKSHFITKDPSEADLFFLPFSIARLRHDPRVGVQGIPDFIK 724
            V+FEPGGNYASESYFKK LMKSHFITKDP++ADLFFLPFSIARLRHD R+G  GI DFI+
Sbjct: 184  VDFEPGGNYASESYFKKALMKSHFITKDPTKADLFFLPFSIARLRHDRRIGTGGIQDFIR 243

Query: 725  TYISNISQNYPYWNRSSGTDHFYVACHSIGRSAMEKATEVKFNAIQVVCSSSYFLSGYIA 904
             YI NISQ YPYWNRS G DHFYVACHSIGRS M KA E+K NAIQ+VCSSSYFLSGYIA
Sbjct: 244  DYIFNISQKYPYWNRSGGADHFYVACHSIGRSVMAKARELKLNAIQIVCSSSYFLSGYIA 303

Query: 905  HKDASLPQIWPRQGDPPNVAISKRKKLAFFAGAINSPVRTKLLEEWKNDSEIFVHFGRLT 1084
            HKDASLPQ+WPR GDPPN+A SKR KL+FFAG+INSPVR KLL+ W+NDSEI  H+GRL 
Sbjct: 304  HKDASLPQVWPRTGDPPNLASSKRNKLSFFAGSINSPVREKLLKFWRNDSEIAAHYGRLK 363

Query: 1085 TPYSDELLGSKFCLHVKGFEVNTARIGDAIYYGCVPVIIANHYDLPFADILNWKSFSLVV 1264
            TPY+DELL SKFCLHVKGFEVNTARI D++YYGCVP+IIAN+YDLPFADILNWKSFS+VV
Sbjct: 364  TPYADELLSSKFCLHVKGFEVNTARIADSLYYGCVPIIIANYYDLPFADILNWKSFSIVV 423

Query: 1265 ATLDIPLLKKILREVSSDEYTTLQRNVLKVQKHFRWNLSPMDYDAFYMVIYELWLRRSSM 1444
             T+DIP LK+ILR ++SDEY +LQRNVLKV+KHF+W++ P+D+DAFYMV+YELWLRRSS 
Sbjct: 424  VTVDIPSLKQILRGITSDEYLSLQRNVLKVRKHFQWHVPPIDFDAFYMVMYELWLRRSSA 483

Query: 1445 RV 1450
            R+
Sbjct: 484  RI 485


>ref|XP_006604240.1| PREDICTED: probable glycosyltransferase At5g03795-like isoform X1
            [Glycine max]
          Length = 500

 Score =  619 bits (1595), Expect = e-174
 Identities = 307/460 (66%), Positives = 350/460 (76%), Gaps = 4/460 (0%)
 Frame = +2

Query: 80   SFRTFFFIPTILALITSLFILIYISSTSKLFFVHQHHQS--HLLPKXXXXXXXXXXXTHQ 253
            SFR FFFIPT LAL TS FIL YI STS +F  H HH S  H  P            T  
Sbjct: 32   SFRGFFFIPTTLALFTSFFILFYIYSTSNIFTHHNHHPSTSHFKPHPPFSTTPFIATTPH 91

Query: 254  IIQSPSQNPPKTTFYEXXXXXXXXXXXXXXXXXXGTDE-NRRYWETGRPIGSNGKYVNKE 430
             +   + N  ++T                        + + +          +GK+ N +
Sbjct: 92   FVPVFNHNASESTKSPPTFQLGYGLGPQSQRGLPLPPQFSSKVCRECCVFYGSGKFENND 151

Query: 431  VFHDKDIFLEDYKEMNRSFKIYVYPHREDDPYANALLPVNFEPGGNYASESYFKKVLMKS 610
            VFHD+D+FLEDYKEMNRS KIYVYPHREDDP+AN LLPV  EPGGNY SESYFKKVLMKS
Sbjct: 152  VFHDRDVFLEDYKEMNRSLKIYVYPHREDDPFANVLLPVESEPGGNYTSESYFKKVLMKS 211

Query: 611  HFITKDPSEADLFFLPFSIARLRHDPRVGVQGIPDFIKTYISNISQNYPYWNRSSGTDHF 790
            HFITKDP EADLFFLPFS+ARL HD RVGV GI DFI+ YI NIS  YPYWN + G DHF
Sbjct: 212  HFITKDPPEADLFFLPFSMARLWHDRRVGVGGIQDFIRDYIHNISHRYPYWNNTGGADHF 271

Query: 791  YVACHSIGRSAMEKATEVKFNAIQVVCSSSYFLSGYIAHKDASLPQIWPRQGDPPNVAIS 970
            YVACHSIGRSAM+KA + KFNAIQVVCSSSYFL+GY AHKDA LPQIWPR+G+PPN+  S
Sbjct: 272  YVACHSIGRSAMDKAPDEKFNAIQVVCSSSYFLTGYFAHKDACLPQIWPRKGNPPNLVSS 331

Query: 971  KRKKLAFFAGAINSPVRTKLLEEWKNDSEIFVHFGRLTTPYSDELLGSKFCLHVKGFEVN 1150
            KRK+LAFFAG +NSPVR KLLE WKNDSEIFVH GRL TPY+DELLGSKFCLHVKGFEVN
Sbjct: 332  KRKRLAFFAGGVNSPVRVKLLETWKNDSEIFVHHGRLKTPYADELLGSKFCLHVKGFEVN 391

Query: 1151 TARIGDAIYYGCVPVIIANHYDLPFADILNWKSFSLVVATLDIPLLKKILRE-VSSDEYT 1327
            TARIGD++YYGCVPVIIAN+YDLPFAD+LNWKSFS+VV TLDIPLLKKIL++ +SS++Y 
Sbjct: 392  TARIGDSLYYGCVPVIIANYYDLPFADVLNWKSFSVVVTTLDIPLLKKILKDIISSNKYL 451

Query: 1328 TLQRNVLKVQKHFRWNLSPMDYDAFYMVIYELWLRRSSMR 1447
             LQ NVLKV+KHF+W+  P D+DAFYMV+YELWLRRSS++
Sbjct: 452  MLQSNVLKVRKHFQWHSPPQDFDAFYMVMYELWLRRSSIK 491


>ref|XP_006443447.1| hypothetical protein CICLE_v10020045mg [Citrus clementina]
            gi|568850888|ref|XP_006479129.1| PREDICTED: probable
            glycosyltransferase At5g03795-like isoform X2 [Citrus
            sinensis] gi|557545709|gb|ESR56687.1| hypothetical
            protein CICLE_v10020045mg [Citrus clementina]
          Length = 374

 Score =  597 bits (1540), Expect = e-168
 Identities = 277/349 (79%), Positives = 314/349 (89%)
 Frame = +2

Query: 404  SNGKYVNKEVFHDKDIFLEDYKEMNRSFKIYVYPHREDDPYANALLPVNFEPGGNYASES 583
            ++G  +NKEVFHD+DIFLEDYK+MNRSF++YVYPHR +DP+AN LLPV+FEP GNYASES
Sbjct: 16   ASGNSMNKEVFHDRDIFLEDYKQMNRSFRVYVYPHRRNDPFANVLLPVDFEPRGNYASES 75

Query: 584  YFKKVLMKSHFITKDPSEADLFFLPFSIARLRHDPRVGVQGIPDFIKTYISNISQNYPYW 763
            YFKKV MKSHF+TKDPS+ADLFFLPFSIAR+RHD R+G +GIPDFI  YI NISQ YPYW
Sbjct: 76   YFKKVFMKSHFVTKDPSKADLFFLPFSIARMRHDRRIGTEGIPDFISHYIFNISQKYPYW 135

Query: 764  NRSSGTDHFYVACHSIGRSAMEKATEVKFNAIQVVCSSSYFLSGYIAHKDASLPQIWPRQ 943
            NR+ G DHFYVACHSIGRSAMEKA EVK NAIQVVCSSSYF+SG+IAHKD SLPQIWPRQ
Sbjct: 136  NRTGGADHFYVACHSIGRSAMEKAWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQIWPRQ 195

Query: 944  GDPPNVAISKRKKLAFFAGAINSPVRTKLLEEWKNDSEIFVHFGRLTTPYSDELLGSKFC 1123
             DPP +  SKR KLAFFAGA+NSPVR KLL+ W+NDSEI+ H GRL TPY+D LLGSKFC
Sbjct: 196  EDPPKLGSSKRNKLAFFAGAVNSPVREKLLQVWRNDSEIYAHSGRLKTPYADGLLGSKFC 255

Query: 1124 LHVKGFEVNTARIGDAIYYGCVPVIIANHYDLPFADILNWKSFSLVVATLDIPLLKKILR 1303
            LHVKGFEVNTARI D++YYGCVPVIIANHYDLPFADILNWKSFS+VVATLDIPLLKKIL+
Sbjct: 256  LHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADILNWKSFSIVVATLDIPLLKKILK 315

Query: 1304 EVSSDEYTTLQRNVLKVQKHFRWNLSPMDYDAFYMVIYELWLRRSSMRV 1450
             +SS+EY  LQ NVLKV+KHF+W++ P DYDAFYMV+Y+LWLRRSS+RV
Sbjct: 316  GISSEEYLLLQNNVLKVRKHFQWHVFPSDYDAFYMVMYDLWLRRSSVRV 364


>ref|XP_006443448.1| hypothetical protein CICLE_v10020045mg [Citrus clementina]
            gi|557545710|gb|ESR56688.1| hypothetical protein
            CICLE_v10020045mg [Citrus clementina]
          Length = 354

 Score =  595 bits (1534), Expect = e-167
 Identities = 276/344 (80%), Positives = 311/344 (90%)
 Frame = +2

Query: 419  VNKEVFHDKDIFLEDYKEMNRSFKIYVYPHREDDPYANALLPVNFEPGGNYASESYFKKV 598
            +NKEVFHD+DIFLEDYK+MNRSF++YVYPHR +DP+AN LLPV+FEP GNYASESYFKKV
Sbjct: 1    MNKEVFHDRDIFLEDYKQMNRSFRVYVYPHRRNDPFANVLLPVDFEPRGNYASESYFKKV 60

Query: 599  LMKSHFITKDPSEADLFFLPFSIARLRHDPRVGVQGIPDFIKTYISNISQNYPYWNRSSG 778
             MKSHF+TKDPS+ADLFFLPFSIAR+RHD R+G +GIPDFI  YI NISQ YPYWNR+ G
Sbjct: 61   FMKSHFVTKDPSKADLFFLPFSIARMRHDRRIGTEGIPDFISHYIFNISQKYPYWNRTGG 120

Query: 779  TDHFYVACHSIGRSAMEKATEVKFNAIQVVCSSSYFLSGYIAHKDASLPQIWPRQGDPPN 958
             DHFYVACHSIGRSAMEKA EVK NAIQVVCSSSYF+SG+IAHKD SLPQIWPRQ DPP 
Sbjct: 121  ADHFYVACHSIGRSAMEKAWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQIWPRQEDPPK 180

Query: 959  VAISKRKKLAFFAGAINSPVRTKLLEEWKNDSEIFVHFGRLTTPYSDELLGSKFCLHVKG 1138
            +  SKR KLAFFAGA+NSPVR KLL+ W+NDSEI+ H GRL TPY+D LLGSKFCLHVKG
Sbjct: 181  LGSSKRNKLAFFAGAVNSPVREKLLQVWRNDSEIYAHSGRLKTPYADGLLGSKFCLHVKG 240

Query: 1139 FEVNTARIGDAIYYGCVPVIIANHYDLPFADILNWKSFSLVVATLDIPLLKKILREVSSD 1318
            FEVNTARI D++YYGCVPVIIANHYDLPFADILNWKSFS+VVATLDIPLLKKIL+ +SS+
Sbjct: 241  FEVNTARIADSLYYGCVPVIIANHYDLPFADILNWKSFSIVVATLDIPLLKKILKGISSE 300

Query: 1319 EYTTLQRNVLKVQKHFRWNLSPMDYDAFYMVIYELWLRRSSMRV 1450
            EY  LQ NVLKV+KHF+W++ P DYDAFYMV+Y+LWLRRSS+RV
Sbjct: 301  EYLLLQNNVLKVRKHFQWHVFPSDYDAFYMVMYDLWLRRSSVRV 344


>ref|XP_003624636.1| Exostosin-like protein [Medicago truncatula]
            gi|87162615|gb|ABD28410.1| Exostosin-like [Medicago
            truncatula] gi|116831751|gb|ABK28848.1| exostosin-like
            protein [Medicago truncatula] gi|355499651|gb|AES80854.1|
            Exostosin-like protein [Medicago truncatula]
          Length = 486

 Score =  595 bits (1533), Expect = e-167
 Identities = 302/479 (63%), Positives = 355/479 (74%), Gaps = 9/479 (1%)
 Frame = +2

Query: 41   SLFYYFSHRRLFDSFRTFFF-IPTILALITSLFILIYISSTSKLFFVHQHH---QSHLLP 208
            S  Y +SH  +  SF++FFF IPT LAL+TSL IL Y+  TS +F  H  H   QS L+ 
Sbjct: 5    SSLYQYSHTHVASSFKSFFFFIPTTLALLTSLSILFYVYYTSIIFTHHHQHNNQQSTLIN 64

Query: 209  -KXXXXXXXXXXXTHQIIQSPSQNPP---KTTFYEXXXXXXXXXXXXXXXXXXGTDENRR 376
             K           T  +  +   N     K+  ++                     +N+ 
Sbjct: 65   FKSSSPNFILPSPTPHLTNTLHNNHSEFIKSHTFQLGHGLGPQSQRGLPPQSSSNGQNKH 124

Query: 377  YWETGRPIGSNGKYVNKEVFHDKDIFLEDYKEMNRSFKIYVYPHREDDPYANALLPVNFE 556
              E     GS     N  VFHD+DIFLEDYKEMNRSFKIYVYPH++DDP+AN LLPV  E
Sbjct: 125  --ENSVFDGSRKFKENNNVFHDRDIFLEDYKEMNRSFKIYVYPHKKDDPFANVLLPVKTE 182

Query: 557  PGGNYASESYFKKVLMKSHFITKDPSEADLFFLPFSIARLRHDPRVGVQGIPDFIKTYIS 736
            P GNYASESYFKK LMKSHFITKDP++ADLFF+PFSIA LRHD RVGV GI DFI+ Y+ 
Sbjct: 183  PSGNYASESYFKKALMKSHFITKDPTKADLFFMPFSIASLRHDRRVGVGGIQDFIRDYVQ 242

Query: 737  NISQNYPYWNRSSGTDHFYVACHSIGRSAMEKATEVKFNAIQVVCSSSYFLSGYIAHKDA 916
            N+   YPYWNR++G DHFYVACHSIGRSAM+KA +VKFNAIQVVCSSSYFLSGYIAHKDA
Sbjct: 243  NMIHKYPYWNRTNGADHFYVACHSIGRSAMDKAPDVKFNAIQVVCSSSYFLSGYIAHKDA 302

Query: 917  SLPQIWPRQGDPPNVAISKRKKLAFFAGAINSPVRTKLLEEWKNDSEIFVHFGRLTTPYS 1096
             LPQIWPR  +PPN+  S RKKLAFFAG +NSPVR  L+E WKND+EIFVH GRL TPY 
Sbjct: 303  CLPQIWPRNENPPNLVSSNRKKLAFFAGEVNSPVRINLVETWKNDTEIFVHNGRLKTPYG 362

Query: 1097 DELLGSKFCLHVKGFEVNTARIGDAIYYGCVPVIIANHYDLPFADILNWKSFSLVVATLD 1276
            DELLGSKFC HV+G+EVNTARIGD++YYGCVPVIIAN+YDLPFAD+LNWKSFS+VV TLD
Sbjct: 363  DELLGSKFCFHVRGYEVNTARIGDSLYYGCVPVIIANYYDLPFADVLNWKSFSVVVTTLD 422

Query: 1277 IPLLKKILRE-VSSDEYTTLQRNVLKVQKHFRWNLSPMDYDAFYMVIYELWLRRSSMRV 1450
            IPLLKKIL+  V+S EY  LQ+NVLKV++HF+W+  P+D+DAFYMV+YELWLRRSS+ +
Sbjct: 423  IPLLKKILKGIVNSGEYLMLQKNVLKVREHFQWHSPPIDFDAFYMVMYELWLRRSSIPI 481


>ref|XP_006604242.1| PREDICTED: probable glycosyltransferase At5g03795-like isoform X3
            [Glycine max]
          Length = 475

 Score =  588 bits (1516), Expect = e-165
 Identities = 275/348 (79%), Positives = 310/348 (89%), Gaps = 1/348 (0%)
 Frame = +2

Query: 407  NGKYVNKEVFHDKDIFLEDYKEMNRSFKIYVYPHREDDPYANALLPVNFEPGGNYASESY 586
            +GK+ N +VFHD+D+FLEDYKEMNRS KIYVYPHREDDP+AN LLPV  EPGGNY SESY
Sbjct: 119  SGKFENNDVFHDRDVFLEDYKEMNRSLKIYVYPHREDDPFANVLLPVESEPGGNYTSESY 178

Query: 587  FKKVLMKSHFITKDPSEADLFFLPFSIARLRHDPRVGVQGIPDFIKTYISNISQNYPYWN 766
            FKKVLMKSHFITKDP EADLFFLPFS+ARL HD RVGV GI DFI+ YI NIS  YPYWN
Sbjct: 179  FKKVLMKSHFITKDPPEADLFFLPFSMARLWHDRRVGVGGIQDFIRDYIHNISHRYPYWN 238

Query: 767  RSSGTDHFYVACHSIGRSAMEKATEVKFNAIQVVCSSSYFLSGYIAHKDASLPQIWPRQG 946
             + G DHFYVACHSIGRSAM+KA + KFNAIQVVCSSSYFL+GY AHKDA LPQIWPR+G
Sbjct: 239  NTGGADHFYVACHSIGRSAMDKAPDEKFNAIQVVCSSSYFLTGYFAHKDACLPQIWPRKG 298

Query: 947  DPPNVAISKRKKLAFFAGAINSPVRTKLLEEWKNDSEIFVHFGRLTTPYSDELLGSKFCL 1126
            +PPN+  SKRK+LAFFAG +NSPVR KLLE WKNDSEIFVH GRL TPY+DELLGSKFCL
Sbjct: 299  NPPNLVSSKRKRLAFFAGGVNSPVRVKLLETWKNDSEIFVHHGRLKTPYADELLGSKFCL 358

Query: 1127 HVKGFEVNTARIGDAIYYGCVPVIIANHYDLPFADILNWKSFSLVVATLDIPLLKKILRE 1306
            HVKGFEVNTARIGD++YYGCVPVIIAN+YDLPFAD+LNWKSFS+VV TLDIPLLKKIL++
Sbjct: 359  HVKGFEVNTARIGDSLYYGCVPVIIANYYDLPFADVLNWKSFSVVVTTLDIPLLKKILKD 418

Query: 1307 -VSSDEYTTLQRNVLKVQKHFRWNLSPMDYDAFYMVIYELWLRRSSMR 1447
             +SS++Y  LQ NVLKV+KHF+W+  P D+DAFYMV+YELWLRRSS++
Sbjct: 419  IISSNKYLMLQSNVLKVRKHFQWHSPPQDFDAFYMVMYELWLRRSSIK 466


>ref|XP_004296269.1| PREDICTED: probable glycosyltransferase At5g03795-like [Fragaria
            vesca subsp. vesca]
          Length = 439

 Score =  588 bits (1515), Expect = e-165
 Identities = 305/475 (64%), Positives = 343/475 (72%), Gaps = 7/475 (1%)
 Frame = +2

Query: 35   SSSLFYYFSHRRLFDSFRTFFFIPTILALITSLFILIYISSTSKLFFVH------QHHQS 196
            +S +  Y S  RL    R+FFFIPT LAL TSL IL YIS+TS LF  H           
Sbjct: 2    ASLVLLYLSQWRLP---RSFFFIPTTLALATSLLILFYISTTSNLFPHHPPLPNLSSFAP 58

Query: 197  HLLPKXXXXXXXXXXXTHQIIQSPSQNPPKTTFYEXXXXXXXXXXXXXXXXXXGTDENRR 376
            HL P                 QS    PP +                             
Sbjct: 59   HLYP----------------FQSQRSLPPNSA---------------------------- 74

Query: 377  YWETGRPIGSNGKYVNK-EVFHDKDIFLEDYKEMNRSFKIYVYPHREDDPYANALLPVNF 553
                      NG Y N  EVFHD  IF++DYKEM RSFKIYVYPHR+DDP+ANALLPV+F
Sbjct: 75   ---------PNGNYDNNNEVFHDTHIFVQDYKEMKRSFKIYVYPHRKDDPFANALLPVDF 125

Query: 554  EPGGNYASESYFKKVLMKSHFITKDPSEADLFFLPFSIARLRHDPRVGVQGIPDFIKTYI 733
            EP GNYASESYFKKVLM+SHFIT DP++A LFFLPFSIARLRHDPRVGV GI DFI+ Y+
Sbjct: 126  EPAGNYASESYFKKVLMESHFITNDPTQAQLFFLPFSIARLRHDPRVGVGGIQDFIRDYM 185

Query: 734  SNISQNYPYWNRSSGTDHFYVACHSIGRSAMEKATEVKFNAIQVVCSSSYFLSGYIAHKD 913
             NIS  Y YWNR+ G DHFYVACHSIGRSAMEKAT+VKFNAIQ+VCSSSYFLSGYIAHKD
Sbjct: 186  FNISHKYEYWNRTGGADHFYVACHSIGRSAMEKATQVKFNAIQLVCSSSYFLSGYIAHKD 245

Query: 914  ASLPQIWPRQGDPPNVAISKRKKLAFFAGAINSPVRTKLLEEWKNDSEIFVHFGRLTTPY 1093
            A LPQIWPR+ DPPN+  S R KLAFFAG INSPVR +LL+ W+NDSEIFV+FGRL T Y
Sbjct: 246  ACLPQIWPRKQDPPNLLSSNRTKLAFFAGGINSPVRERLLQVWRNDSEIFVNFGRLKTSY 305

Query: 1094 SDELLGSKFCLHVKGFEVNTARIGDAIYYGCVPVIIANHYDLPFADILNWKSFSLVVATL 1273
            +D LLGS FCLHVKGFEVNTARI D++YYGCVPVIIAN+YDLPFADILNWKSFS+VVATL
Sbjct: 306  ADALLGSMFCLHVKGFEVNTARIADSLYYGCVPVIIANYYDLPFADILNWKSFSVVVATL 365

Query: 1274 DIPLLKKILREVSSDEYTTLQRNVLKVQKHFRWNLSPMDYDAFYMVIYELWLRRS 1438
            DIPLLK IL+ + SDEY  L+ NV KV+  F+W+LSP+DYDAF+MV+YELWLRRS
Sbjct: 366  DIPLLKNILKGIRSDEYMRLRNNVFKVRNQFQWHLSPIDYDAFHMVMYELWLRRS 420


>ref|XP_002265438.2| PREDICTED: probable glycosyltransferase At5g03795-like [Vitis
            vinifera]
          Length = 336

 Score =  586 bits (1510), Expect = e-164
 Identities = 274/326 (84%), Positives = 306/326 (93%)
 Frame = +2

Query: 473  MNRSFKIYVYPHREDDPYANALLPVNFEPGGNYASESYFKKVLMKSHFITKDPSEADLFF 652
            MNRSFKIY YPH+ DDP+ANALLPV+FEPGGNYASESYFKKVLMKSHFITKDPS+ADLFF
Sbjct: 1    MNRSFKIYCYPHKRDDPFANALLPVDFEPGGNYASESYFKKVLMKSHFITKDPSKADLFF 60

Query: 653  LPFSIARLRHDPRVGVQGIPDFIKTYISNISQNYPYWNRSSGTDHFYVACHSIGRSAMEK 832
            LPFSIARLRHDPRVGV GI DFI+ YI NISQNYPYWN++ G DHFYVACHSIGRSAMEK
Sbjct: 61   LPFSIARLRHDPRVGVGGIQDFIRDYIFNISQNYPYWNQTGGADHFYVACHSIGRSAMEK 120

Query: 833  ATEVKFNAIQVVCSSSYFLSGYIAHKDASLPQIWPRQGDPPNVAISKRKKLAFFAGAINS 1012
            A EVK NAIQVVCSSSYFLSGYIAHKDASLPQIWPRQGDPP++A+S+RKKLAFFAG+INS
Sbjct: 121  ADEVKLNAIQVVCSSSYFLSGYIAHKDASLPQIWPRQGDPPDLALSERKKLAFFAGSINS 180

Query: 1013 PVRTKLLEEWKNDSEIFVHFGRLTTPYSDELLGSKFCLHVKGFEVNTARIGDAIYYGCVP 1192
            PVR +LL+ W+NDSEI VHFGRLTTPY+DELLGSKFCLHVKGFE+NTARI D++YYGCVP
Sbjct: 181  PVRERLLQVWRNDSEISVHFGRLTTPYADELLGSKFCLHVKGFEINTARIADSLYYGCVP 240

Query: 1193 VIIANHYDLPFADILNWKSFSLVVATLDIPLLKKILREVSSDEYTTLQRNVLKVQKHFRW 1372
            VIIANHYDLPFADILNWKSFS+VVATLDIPLLK++L+ +S +EY  LQ NVLKV+ HF+W
Sbjct: 241  VIIANHYDLPFADILNWKSFSIVVATLDIPLLKQVLKGISLNEYLMLQSNVLKVRNHFQW 300

Query: 1373 NLSPMDYDAFYMVIYELWLRRSSMRV 1450
            ++SP+DYDAFYMV+YELWLRRSS+RV
Sbjct: 301  HVSPVDYDAFYMVMYELWLRRSSVRV 326


>ref|XP_002525728.1| catalytic, putative [Ricinus communis] gi|223535028|gb|EEF36711.1|
            catalytic, putative [Ricinus communis]
          Length = 336

 Score =  575 bits (1481), Expect = e-161
 Identities = 266/326 (81%), Positives = 303/326 (92%)
 Frame = +2

Query: 473  MNRSFKIYVYPHREDDPYANALLPVNFEPGGNYASESYFKKVLMKSHFITKDPSEADLFF 652
            MNRSFKIYVYPHR++DP+AN LLPV+FEPGGNYASESYFKKVLMKSHFITKDP++ADLFF
Sbjct: 1    MNRSFKIYVYPHRQNDPFANVLLPVDFEPGGNYASESYFKKVLMKSHFITKDPTKADLFF 60

Query: 653  LPFSIARLRHDPRVGVQGIPDFIKTYISNISQNYPYWNRSSGTDHFYVACHSIGRSAMEK 832
            LPFSIARLRHDPR+GV+GI DFI+ Y+ NISQ YPYWNR+ GTDHFYVACHSIGR+AMEK
Sbjct: 61   LPFSIARLRHDPRIGVEGIQDFIRAYVYNISQKYPYWNRTGGTDHFYVACHSIGRTAMEK 120

Query: 833  ATEVKFNAIQVVCSSSYFLSGYIAHKDASLPQIWPRQGDPPNVAISKRKKLAFFAGAINS 1012
            A EVKFNAIQVVCSSSY+LSGYIAHKDASLPQ+WPRQGDPPN+A S+R+KLAFFAG+INS
Sbjct: 121  AEEVKFNAIQVVCSSSYYLSGYIAHKDASLPQVWPRQGDPPNLASSERQKLAFFAGSINS 180

Query: 1013 PVRTKLLEEWKNDSEIFVHFGRLTTPYSDELLGSKFCLHVKGFEVNTARIGDAIYYGCVP 1192
            PVR +LL+ W+NDSEI+VH+GRL T Y+DELLGSKFCLHVKGFEVNTARI D++YYGCVP
Sbjct: 181  PVRERLLQVWRNDSEIYVHYGRLNTSYADELLGSKFCLHVKGFEVNTARIADSLYYGCVP 240

Query: 1193 VIIANHYDLPFADILNWKSFSLVVATLDIPLLKKILREVSSDEYTTLQRNVLKVQKHFRW 1372
            +IIANHYDLPF DILNW+SFS+VVATLDI  LKKIL+ VSSD Y  LQ NVLKV+KHF+W
Sbjct: 241  IIIANHYDLPFTDILNWESFSVVVATLDILYLKKILQGVSSDRYVMLQSNVLKVRKHFQW 300

Query: 1373 NLSPMDYDAFYMVIYELWLRRSSMRV 1450
            +  P+DYDAF+MV+YELWLRRSS+RV
Sbjct: 301  HFPPVDYDAFHMVMYELWLRRSSVRV 326


>ref|XP_006826373.1| hypothetical protein AMTR_s00004p00132530 [Amborella trichopoda]
            gi|548830687|gb|ERM93610.1| hypothetical protein
            AMTR_s00004p00132530 [Amborella trichopoda]
          Length = 453

 Score =  573 bits (1477), Expect = e-161
 Identities = 294/472 (62%), Positives = 339/472 (71%)
 Frame = +2

Query: 59   SHRRLFDSFRTFFFIPTILALITSLFILIYISSTSKLFFVHQHHQSHLLPKXXXXXXXXX 238
            SH  L    + F+FIPTILAL+TSL I+  I+ TS       ++   LL K         
Sbjct: 4    SHPGLNGLPKIFYFIPTILALVTSLCIIYCINLTS-------NYTGFLLGKPYIGSFLIQ 56

Query: 239  XXTHQIIQSPSQNPPKTTFYEXXXXXXXXXXXXXXXXXXGTDENRRYWETGRPIGSNGKY 418
                  +Q P+    KT                      G  E     E    IG     
Sbjct: 57   KRI-PFLQIPNSIDIKTKV---------------PLPDSGNSERLSEGELDLNIGKENN- 99

Query: 419  VNKEVFHDKDIFLEDYKEMNRSFKIYVYPHREDDPYANALLPVNFEPGGNYASESYFKKV 598
            +N  VFHDK +FLEDYK MN+S KIYVYPH +DD +AN LLPV+F+PGGNYASESYFKK 
Sbjct: 100  INNGVFHDKMVFLEDYKAMNKSLKIYVYPHSKDDSFANVLLPVDFKPGGNYASESYFKKC 159

Query: 599  LMKSHFITKDPSEADLFFLPFSIARLRHDPRVGVQGIPDFIKTYISNISQNYPYWNRSSG 778
            LMKSHFITKDP EA LFFLPFSIA LRHDPRVGV GI DF+++YI NISQ YPYWNRS G
Sbjct: 160  LMKSHFITKDPKEAHLFFLPFSIASLRHDPRVGVHGIQDFVRSYIYNISQAYPYWNRSGG 219

Query: 779  TDHFYVACHSIGRSAMEKATEVKFNAIQVVCSSSYFLSGYIAHKDASLPQIWPRQGDPPN 958
             DHFYVACHSIGRSAMEKA +VKFNAIQVVCS+SY+LSGY+AHKDAS+PQIWPR+GDPP 
Sbjct: 220  ADHFYVACHSIGRSAMEKAVDVKFNAIQVVCSASYYLSGYVAHKDASMPQIWPREGDPPK 279

Query: 959  VAISKRKKLAFFAGAINSPVRTKLLEEWKNDSEIFVHFGRLTTPYSDELLGSKFCLHVKG 1138
               +KR KLAFFAG+ NSPVR  LLE W+NDSEI VHFG L+ PYS  L  SKFCLHVKG
Sbjct: 280  AGSTKRDKLAFFAGSNNSPVRQNLLEHWRNDSEISVHFGNLSIPYSKALSHSKFCLHVKG 339

Query: 1139 FEVNTARIGDAIYYGCVPVIIANHYDLPFADILNWKSFSLVVATLDIPLLKKILREVSSD 1318
            FEVNTARI DA++YGCVP++IANHYDLPF DIL+WK FSLVVATLDIPLLK+IL E+S +
Sbjct: 340  FEVNTARIADALFYGCVPIVIANHYDLPFTDILDWKKFSLVVATLDIPLLKEILHEISFE 399

Query: 1319 EYTTLQRNVLKVQKHFRWNLSPMDYDAFYMVIYELWLRRSSMRVA*RESVNL 1474
            +Y  LQRNVL+V+KHF+W+  P +YDAFYMV+YELWLRR   R+   ES  L
Sbjct: 400  DYEELQRNVLEVRKHFQWHKVPENYDAFYMVMYELWLRRGLARIPVPESNQL 451


>gb|EXB31256.1| putative glycosyltransferase [Morus notabilis]
          Length = 462

 Score =  572 bits (1474), Expect = e-160
 Identities = 267/349 (76%), Positives = 306/349 (87%), Gaps = 1/349 (0%)
 Frame = +2

Query: 407  NGKYVNK-EVFHDKDIFLEDYKEMNRSFKIYVYPHREDDPYANALLPVNFEPGGNYASES 583
            + ++VNK EVFHD+ IF EDY+EM RSFKIYVYPHR DDP+AN LLPV+ +PGGNYASE 
Sbjct: 106  SAEHVNKYEVFHDRHIFQEDYEEMKRSFKIYVYPHRRDDPFANVLLPVDSKPGGNYASEG 165

Query: 584  YFKKVLMKSHFITKDPSEADLFFLPFSIARLRHDPRVGVQGIPDFIKTYISNISQNYPYW 763
            YFK  L KS F+T+DP++ADLFFLPFSIARLRHDPRV V GIP+F++ YISN+ + YPYW
Sbjct: 166  YFKMALSKSRFVTEDPNKADLFFLPFSIARLRHDPRVSVGGIPEFVRDYISNVRRKYPYW 225

Query: 764  NRSSGTDHFYVACHSIGRSAMEKATEVKFNAIQVVCSSSYFLSGYIAHKDASLPQIWPRQ 943
            NR+ G DHFYVACHSIGRSAMEKATEVK NAIQ+VCSSSYF+  YI+HKDA LPQIWPR+
Sbjct: 226  NRTGGADHFYVACHSIGRSAMEKATEVKLNAIQIVCSSSYFVGSYISHKDACLPQIWPRE 285

Query: 944  GDPPNVAISKRKKLAFFAGAINSPVRTKLLEEWKNDSEIFVHFGRLTTPYSDELLGSKFC 1123
            GDPPN+  S R KLAFFAGA+NSPVR +L++ W+NDSEIFVH GRL TPY+DELLGSKFC
Sbjct: 286  GDPPNLLSSNRTKLAFFAGAMNSPVRKQLVQVWRNDSEIFVHHGRLKTPYADELLGSKFC 345

Query: 1124 LHVKGFEVNTARIGDAIYYGCVPVIIANHYDLPFADILNWKSFSLVVATLDIPLLKKILR 1303
            LH KGFEVNTARI D++YYGCVPVI+AN+YDLPF DILNWKSFS+VVAT DIPLLKKILR
Sbjct: 346  LHAKGFEVNTARIADSLYYGCVPVILANYYDLPFIDILNWKSFSVVVATQDIPLLKKILR 405

Query: 1304 EVSSDEYTTLQRNVLKVQKHFRWNLSPMDYDAFYMVIYELWLRRSSMRV 1450
             +SSDEY  LQRNVLKV+KHF W+ SP DYDAFYMV+YELWLRRS +RV
Sbjct: 406  GISSDEYLRLQRNVLKVRKHFLWHPSPRDYDAFYMVMYELWLRRSLLRV 454


>ref|XP_007030069.1| Exostosin family protein [Theobroma cacao]
            gi|508718674|gb|EOY10571.1| Exostosin family protein
            [Theobroma cacao]
          Length = 465

 Score =  568 bits (1463), Expect = e-159
 Identities = 282/479 (58%), Positives = 335/479 (69%), Gaps = 4/479 (0%)
 Frame = +2

Query: 26   MVCSSSLFYYFSHRRLFDSFR-TFFFIPTILALITSLFILIYISSTSKLFFV--HQHHQS 196
            M  SSSL Y+ S  R   SF  +FFF+P  LA+ T L I +YI  T+   F     +H  
Sbjct: 1    MAKSSSLCYHISQHRFSTSFGGSFFFLPLSLAISTFLVIFLYIWCTNSNLFTDPQNNHYQ 60

Query: 197  HLLPKXXXXXXXXXXXTHQIIQSPSQNPPKTTFYEXXXXXXXXXXXXXXXXXXGTDENRR 376
               PK             Q+I    +   +  FY                          
Sbjct: 61   ESSPKSSLL--------QQMIPFSLEKAAEDMFYSSRSAPL---------------SKGN 97

Query: 377  YWETGRPIGSNGKYVNK-EVFHDKDIFLEDYKEMNRSFKIYVYPHREDDPYANALLPVNF 553
             W    P G  G YVN  E++HD+D FL+DYKEMNRS K++VYPH  DDP+A+ LLPV++
Sbjct: 98   QWSMANPFGLYGNYVNNTELYHDEDFFLQDYKEMNRSLKVFVYPHSRDDPFASVLLPVDY 157

Query: 554  EPGGNYASESYFKKVLMKSHFITKDPSEADLFFLPFSIARLRHDPRVGVQGIPDFIKTYI 733
            +P G+YASE YFKKVL KSHFITK+PSEADLFFLPFSI  +RHDPR+G +G+ DFIK YI
Sbjct: 158  DPKGHYASELYFKKVLSKSHFITKNPSEADLFFLPFSIVEMRHDPRIGPEGMQDFIKDYI 217

Query: 734  SNISQNYPYWNRSSGTDHFYVACHSIGRSAMEKATEVKFNAIQVVCSSSYFLSGYIAHKD 913
             NIS  YPYWNR+ G DHFYVACHSIGR AM+K    KFN IQVVCSSSYF++GYI HKD
Sbjct: 218  FNISHKYPYWNRTDGADHFYVACHSIGRFAMDKVFSAKFNVIQVVCSSSYFVAGYIPHKD 277

Query: 914  ASLPQIWPRQGDPPNVAISKRKKLAFFAGAINSPVRTKLLEEWKNDSEIFVHFGRLTTPY 1093
            AS+PQIWPRQ DPPN A SKRK+LAFFAG INSP R  L++ W ND++IF HF RL TP 
Sbjct: 278  ASMPQIWPRQRDPPNSASSKRKQLAFFAGTINSPARLALIQAWGNDTDIFAHFERLRTPD 337

Query: 1094 SDELLGSKFCLHVKGFEVNTARIGDAIYYGCVPVIIANHYDLPFADILNWKSFSLVVATL 1273
            +D+LLGSKFCLHVKGFEVNTAR+ DAIYYGCVPVI+ANHYDLPF DI+NWKSFS+VV  +
Sbjct: 338  ADQLLGSKFCLHVKGFEVNTARVADAIYYGCVPVILANHYDLPFGDIINWKSFSVVVHYM 397

Query: 1274 DIPLLKKILREVSSDEYTTLQRNVLKVQKHFRWNLSPMDYDAFYMVIYELWLRRSSMRV 1450
            DIP+LK IL+ +S +EY+ LQ N LKV+KHF+WN  P DYDAFY  +YELWLRRSS+RV
Sbjct: 398  DIPVLKNILQRISLEEYSLLQSNTLKVRKHFQWNDPPTDYDAFYTTMYELWLRRSSVRV 456


>ref|XP_006346547.1| PREDICTED: probable glycosyltransferase At5g03795-like isoform X1
            [Solanum tuberosum]
          Length = 452

 Score =  555 bits (1429), Expect = e-155
 Identities = 260/349 (74%), Positives = 303/349 (86%), Gaps = 1/349 (0%)
 Frame = +2

Query: 407  NGKYVN-KEVFHDKDIFLEDYKEMNRSFKIYVYPHREDDPYANALLPVNFEPGGNYASES 583
            NG +VN  +VFHD+D F+++YKEMNRS KIYVYPH++DDP++N LL V+FEPGGNYASES
Sbjct: 103  NGNHVNDNDVFHDRDAFVDNYKEMNRSLKIYVYPHQKDDPFSNVLLAVDFEPGGNYASES 162

Query: 584  YFKKVLMKSHFITKDPSEADLFFLPFSIARLRHDPRVGVQGIPDFIKTYISNISQNYPYW 763
            YFKKVL  SHFIT+DPS ADLFFLPFSIARLRHDPRVG+ GI DFIK+YI NIS  YPYW
Sbjct: 163  YFKKVLKMSHFITRDPSNADLFFLPFSIARLRHDPRVGINGIKDFIKSYIFNISHEYPYW 222

Query: 764  NRSSGTDHFYVACHSIGRSAMEKATEVKFNAIQVVCSSSYFLSGYIAHKDASLPQIWPRQ 943
            N ++G DHFYVACHSIGR AMEK  +VK N IQVVC+SSYF+S YI HKDASLPQIWPR 
Sbjct: 223  NLTNGADHFYVACHSIGRFAMEKVVDVKINVIQVVCTSSYFVSAYIPHKDASLPQIWPRL 282

Query: 944  GDPPNVAISKRKKLAFFAGAINSPVRTKLLEEWKNDSEIFVHFGRLTTPYSDELLGSKFC 1123
            G  P+ A  KRKKL FFAG++NSPVR KLLE W NDS+IFVH GRL   Y++ELLGSKFC
Sbjct: 283  GGNPDFAPYKRKKLGFFAGSLNSPVREKLLEWWGNDSDIFVHSGRLERSYTEELLGSKFC 342

Query: 1124 LHVKGFEVNTARIGDAIYYGCVPVIIANHYDLPFADILNWKSFSLVVATLDIPLLKKILR 1303
            LHVKGFEVNTARI DA++YGCVPVIIANHYDLPFADIL+WK FS++VATLDIPLLKKIL+
Sbjct: 343  LHVKGFEVNTARIVDALFYGCVPVIIANHYDLPFADILDWKHFSVIVATLDIPLLKKILQ 402

Query: 1304 EVSSDEYTTLQRNVLKVQKHFRWNLSPMDYDAFYMVIYELWLRRSSMRV 1450
             ++  EY  LQ NVLKV++HF+W++SP+D+DAFYMV+YELWLRRSS+R+
Sbjct: 403  GITQQEYLVLQSNVLKVREHFQWHVSPIDFDAFYMVMYELWLRRSSLRL 451


>ref|XP_007204779.1| hypothetical protein PRUPE_ppa015806mg [Prunus persica]
            gi|462400310|gb|EMJ05978.1| hypothetical protein
            PRUPE_ppa015806mg [Prunus persica]
          Length = 331

 Score =  555 bits (1429), Expect = e-155
 Identities = 261/322 (81%), Positives = 292/322 (90%)
 Frame = +2

Query: 473  MNRSFKIYVYPHREDDPYANALLPVNFEPGGNYASESYFKKVLMKSHFITKDPSEADLFF 652
            M RSFKIYVYPHR+DD +ANALLPV+ EPGGNYASES+FKKVLMKS FIT DP++ADLFF
Sbjct: 1    MKRSFKIYVYPHRQDDSFANALLPVDSEPGGNYASESFFKKVLMKSRFITNDPTKADLFF 60

Query: 653  LPFSIARLRHDPRVGVQGIPDFIKTYISNISQNYPYWNRSSGTDHFYVACHSIGRSAMEK 832
            LPFSIARLRHDPRVGV GI DFI+ YI N+SQ Y YWNR+ G DHFYVACHSIGRSAMEK
Sbjct: 61   LPFSIARLRHDPRVGVGGIQDFIRDYIFNVSQKYQYWNRTGGADHFYVACHSIGRSAMEK 120

Query: 833  ATEVKFNAIQVVCSSSYFLSGYIAHKDASLPQIWPRQGDPPNVAISKRKKLAFFAGAINS 1012
            A+EVKFNAIQVVCSSSYFL GYI HKDA LPQIWPR+ +P ++  S R KLAFFAG INS
Sbjct: 121  ASEVKFNAIQVVCSSSYFLPGYIPHKDACLPQIWPRKEEPHDLLSSNRTKLAFFAGGINS 180

Query: 1013 PVRTKLLEEWKNDSEIFVHFGRLTTPYSDELLGSKFCLHVKGFEVNTARIGDAIYYGCVP 1192
            PVR KLL+ W+NDSEIF HFGRLTTPY+DELLGSKFCLHVKGFEVNTAR+ D++YYGCVP
Sbjct: 181  PVREKLLQVWRNDSEIFAHFGRLTTPYADELLGSKFCLHVKGFEVNTARVADSLYYGCVP 240

Query: 1193 VIIANHYDLPFADILNWKSFSLVVATLDIPLLKKILREVSSDEYTTLQRNVLKVQKHFRW 1372
            VIIAN+YDLPFADILNWKSFS++VATLDIPLLKKIL+ +SS+EYT LQ NVLKV+KHF+W
Sbjct: 241  VIIANYYDLPFADILNWKSFSVIVATLDIPLLKKILKGISSEEYTRLQSNVLKVRKHFQW 300

Query: 1373 NLSPMDYDAFYMVIYELWLRRS 1438
            +LSP+DYDAFYMV+YELWLRRS
Sbjct: 301  HLSPIDYDAFYMVMYELWLRRS 322


>ref|XP_007030066.1| Exostosin family protein [Theobroma cacao]
            gi|508718671|gb|EOY10568.1| Exostosin family protein
            [Theobroma cacao]
          Length = 473

 Score =  551 bits (1420), Expect = e-154
 Identities = 270/476 (56%), Positives = 342/476 (71%), Gaps = 4/476 (0%)
 Frame = +2

Query: 35   SSSLFYYFSHRRLFDSFRTFF-FIPTILALITSLFILIYISSTSKLFFVHQHHQSHLLPK 211
            SSS  Y  S  R   +F+ FF F+P  LAL T L I IYIS+T  +   + H Q+ L  +
Sbjct: 4    SSSFLYQVSQHRFPATFKGFFYFLPISLALTTLLLIFIYISTTGDV--TNNHAQTTLYLE 61

Query: 212  XXXXXXXXXXXTHQIIQS-PSQNPPKTTFYEXXXXXXXXXXXXXXXXXXGTDENRRYWET 388
                         Q I + P +N      +                           W  
Sbjct: 62   TLPGTASVSSLVDQTIPTIPFENNDNDDLFADPSRMARLA-------------RANQWFL 108

Query: 389  GRPIG-SNGKYVN-KEVFHDKDIFLEDYKEMNRSFKIYVYPHREDDPYANALLPVNFEPG 562
            G   G +NG Y N +EV+HD D+FLEDYK+MN+S KIYVYPH +DDP+AN LLP + +  
Sbjct: 109  GNLFGLTNGNYTNNQEVYHDGDLFLEDYKQMNKSLKIYVYPHSKDDPFANVLLPPDSDSK 168

Query: 563  GNYASESYFKKVLMKSHFITKDPSEADLFFLPFSIARLRHDPRVGVQGIPDFIKTYISNI 742
            GNYASE  FKK LMKSHFITKDP+EADLF++PFSI+ +R DPR+ V GIPDF+K+YISNI
Sbjct: 169  GNYASELMFKKALMKSHFITKDPNEADLFYMPFSISPMRTDPRIDVHGIPDFVKSYISNI 228

Query: 743  SQNYPYWNRSSGTDHFYVACHSIGRSAMEKATEVKFNAIQVVCSSSYFLSGYIAHKDASL 922
            ++ YPYWNR+ G DHFYVACHSIG+ A +KA   + N IQ+VCSS+YF S Y+ HKDAS+
Sbjct: 229  TRKYPYWNRTGGADHFYVACHSIGKIAFDKAFVARLNVIQLVCSSTYFPSSYLPHKDASM 288

Query: 923  PQIWPRQGDPPNVAISKRKKLAFFAGAINSPVRTKLLEEWKNDSEIFVHFGRLTTPYSDE 1102
            PQ+WPRQGDPPN+  S+RK+LAFFAGA+NSPVR  LL+ W ND+EIF HFGRL TPYS++
Sbjct: 289  PQVWPRQGDPPNLLTSERKRLAFFAGAVNSPVRIALLKVWANDTEIFAHFGRLRTPYSEQ 348

Query: 1103 LLGSKFCLHVKGFEVNTARIGDAIYYGCVPVIIANHYDLPFADILNWKSFSLVVATLDIP 1282
            LLGSKFC+HVKG+EVNTAR+ DA++YGCVPVI+ANHYDLPF DILNWKSF++VV  +DIP
Sbjct: 349  LLGSKFCIHVKGYEVNTARVADALFYGCVPVILANHYDLPFTDILNWKSFAVVVHHIDIP 408

Query: 1283 LLKKILREVSSDEYTTLQRNVLKVQKHFRWNLSPMDYDAFYMVIYELWLRRSSMRV 1450
            +LKKIL+ +S++EY+ LQ N +KV+KHF+WN+ P+D+DAF+M +YELW RRS +RV
Sbjct: 409  VLKKILQGISNEEYSMLQSNAVKVRKHFQWNVPPLDFDAFHMSLYELWKRRSVVRV 464


Top