BLASTX nr result
ID: Akebia23_contig00024476
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Akebia23_contig00024476 (1894 letters) Database: ./nr 38,876,450 sequences; 13,856,398,315 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value emb|CBI39073.3| unnamed protein product [Vitis vinifera] 655 0.0 ref|XP_006443446.1| hypothetical protein CICLE_v10020045mg [Citr... 644 0.0 ref|XP_004493174.1| PREDICTED: probable glycosyltransferase At5g... 628 e-177 ref|XP_006604241.1| PREDICTED: probable glycosyltransferase At5g... 625 e-176 ref|XP_007030071.1| Exostosin family protein isoform 2 [Theobrom... 623 e-176 ref|XP_007030070.1| Exostosin family protein isoform 1 [Theobrom... 623 e-176 ref|XP_006604240.1| PREDICTED: probable glycosyltransferase At5g... 619 e-174 ref|XP_006443447.1| hypothetical protein CICLE_v10020045mg [Citr... 597 e-168 ref|XP_006443448.1| hypothetical protein CICLE_v10020045mg [Citr... 595 e-167 ref|XP_003624636.1| Exostosin-like protein [Medicago truncatula]... 595 e-167 ref|XP_006604242.1| PREDICTED: probable glycosyltransferase At5g... 588 e-165 ref|XP_004296269.1| PREDICTED: probable glycosyltransferase At5g... 588 e-165 ref|XP_002265438.2| PREDICTED: probable glycosyltransferase At5g... 586 e-164 ref|XP_002525728.1| catalytic, putative [Ricinus communis] gi|22... 575 e-161 ref|XP_006826373.1| hypothetical protein AMTR_s00004p00132530 [A... 573 e-161 gb|EXB31256.1| putative glycosyltransferase [Morus notabilis] 572 e-160 ref|XP_007030069.1| Exostosin family protein [Theobroma cacao] g... 568 e-159 ref|XP_006346547.1| PREDICTED: probable glycosyltransferase At5g... 555 e-155 ref|XP_007204779.1| hypothetical protein PRUPE_ppa015806mg [Prun... 555 e-155 ref|XP_007030066.1| Exostosin family protein [Theobroma cacao] g... 551 e-154 >emb|CBI39073.3| unnamed protein product [Vitis vinifera] Length = 467 Score = 655 bits (1691), Expect = 0.0 Identities = 327/478 (68%), Positives = 378/478 (79%), Gaps = 3/478 (0%) Frame = +2 Query: 26 MVCSSSLFYYFSHRRLFDSFRTFFFIPTILALITSLFILIYISSTSKLFFVHQHHQSHLL 205 M S + Y+FS RR DSFR FFFIPTILALITSLFIL YISSTS LF Q +L Sbjct: 1 MARSFFILYHFSGRRFSDSFRGFFFIPTILALITSLFILFYISSTSNLFTHPQETHLQVL 60 Query: 206 PKXXXXXXXXXXXTHQIIQSPSQNPPKTTFYEXXXXXXXXXXXXXXXXXXGTDENRRYWE 385 K +HQ ++ P++ P + +E Sbjct: 61 -KSALGSSAFSPPSHQFMRVPAETPHLSRGFE---------------------------- 91 Query: 386 TGRPIGSNGKYVN---KEVFHDKDIFLEDYKEMNRSFKIYVYPHREDDPYANALLPVNFE 556 + G++V + HD+++F+E+YKEMNRSFKIY YPH+ DDP+ANALLPV+FE Sbjct: 92 ----FNTKGRFVLLWCTIIRHDRNLFVENYKEMNRSFKIYCYPHKRDDPFANALLPVDFE 147 Query: 557 PGGNYASESYFKKVLMKSHFITKDPSEADLFFLPFSIARLRHDPRVGVQGIPDFIKTYIS 736 PGGNYASESYFKKVLMKSHFITKDPS+ADLFFLPFSIARLRHDPRVGV GI DFI+ YI Sbjct: 148 PGGNYASESYFKKVLMKSHFITKDPSKADLFFLPFSIARLRHDPRVGVGGIQDFIRDYIF 207 Query: 737 NISQNYPYWNRSSGTDHFYVACHSIGRSAMEKATEVKFNAIQVVCSSSYFLSGYIAHKDA 916 NISQNYPYWN++ G DHFYVACHSIGRSAMEKA EVK NAIQVVCSSSYFLSGYIAHKDA Sbjct: 208 NISQNYPYWNQTGGADHFYVACHSIGRSAMEKADEVKLNAIQVVCSSSYFLSGYIAHKDA 267 Query: 917 SLPQIWPRQGDPPNVAISKRKKLAFFAGAINSPVRTKLLEEWKNDSEIFVHFGRLTTPYS 1096 SLPQIWPRQGDPP++A+S+RKKLAFFAG+INSPVR +LL+ W+NDSEI VHFGRLTTPY+ Sbjct: 268 SLPQIWPRQGDPPDLALSERKKLAFFAGSINSPVRERLLQVWRNDSEISVHFGRLTTPYA 327 Query: 1097 DELLGSKFCLHVKGFEVNTARIGDAIYYGCVPVIIANHYDLPFADILNWKSFSLVVATLD 1276 DELLGSKFCLHVKGFE+NTARI D++YYGCVPVIIANHYDLPFADILNWKSFS+VVATLD Sbjct: 328 DELLGSKFCLHVKGFEINTARIADSLYYGCVPVIIANHYDLPFADILNWKSFSIVVATLD 387 Query: 1277 IPLLKKILREVSSDEYTTLQRNVLKVQKHFRWNLSPMDYDAFYMVIYELWLRRSSMRV 1450 IPLLK++L+ +S +EY LQ NVLKV+ HF+W++SP+DYDAFYMV+YELWLRRSS+RV Sbjct: 388 IPLLKQVLKGISLNEYLMLQSNVLKVRNHFQWHVSPVDYDAFYMVMYELWLRRSSVRV 445 >ref|XP_006443446.1| hypothetical protein CICLE_v10020045mg [Citrus clementina] gi|568850886|ref|XP_006479128.1| PREDICTED: probable glycosyltransferase At5g03795-like isoform X1 [Citrus sinensis] gi|557545708|gb|ESR56686.1| hypothetical protein CICLE_v10020045mg [Citrus clementina] Length = 465 Score = 644 bits (1662), Expect = 0.0 Identities = 320/475 (67%), Positives = 367/475 (77%) Frame = +2 Query: 26 MVCSSSLFYYFSHRRLFDSFRTFFFIPTILALITSLFILIYISSTSKLFFVHQHHQSHLL 205 M +SSL YFS R +TFFFIPT LAL+++LFIL YIS+TS LFF HHQ H Sbjct: 1 MANNSSLILYFSRNR--GLVKTFFFIPTTLALLSTLFILFYISTTSHLFF--NHHQRH-- 54 Query: 206 PKXXXXXXXXXXXTHQIIQSPSQNPPKTTFYEXXXXXXXXXXXXXXXXXXGTDENRRYWE 385 HQ+ +N P + G N+R Sbjct: 55 ------------HQHQLTPFILKNNPLPPPLKSSPVLVSLLNVSNNSHGDGRVRNQR--S 100 Query: 386 TGRPIGSNGKYVNKEVFHDKDIFLEDYKEMNRSFKIYVYPHREDDPYANALLPVNFEPGG 565 P+ +NG +NKEVFHD+DIFLEDYK+MNRSF++YVYPHR +DP+AN LLPV+FEP G Sbjct: 101 VNVPMEANGNSMNKEVFHDRDIFLEDYKQMNRSFRVYVYPHRRNDPFANVLLPVDFEPRG 160 Query: 566 NYASESYFKKVLMKSHFITKDPSEADLFFLPFSIARLRHDPRVGVQGIPDFIKTYISNIS 745 NYASESYFKKV MKSHF+TKDPS+ADLFFLPFSIAR+RHD R+G +GIPDFI YI NIS Sbjct: 161 NYASESYFKKVFMKSHFVTKDPSKADLFFLPFSIARMRHDRRIGTEGIPDFISHYIFNIS 220 Query: 746 QNYPYWNRSSGTDHFYVACHSIGRSAMEKATEVKFNAIQVVCSSSYFLSGYIAHKDASLP 925 Q YPYWNR+ G DHFYVACHSIGRSAMEKA EVK NAIQVVCSSSYF+SG+IAHKD SLP Sbjct: 221 QKYPYWNRTGGADHFYVACHSIGRSAMEKAWEVKLNAIQVVCSSSYFISGHIAHKDVSLP 280 Query: 926 QIWPRQGDPPNVAISKRKKLAFFAGAINSPVRTKLLEEWKNDSEIFVHFGRLTTPYSDEL 1105 QIWPRQ DPP + SKR KLAFFAGA+NSPVR KLL+ W+NDSEI+ H GRL TPY+D L Sbjct: 281 QIWPRQEDPPKLGSSKRNKLAFFAGAVNSPVREKLLQVWRNDSEIYAHSGRLKTPYADGL 340 Query: 1106 LGSKFCLHVKGFEVNTARIGDAIYYGCVPVIIANHYDLPFADILNWKSFSLVVATLDIPL 1285 LGSKFCLHVKGFEVNTARI D++YYGCVPVIIANHYDLPFADILNWKSFS+VVATLDIPL Sbjct: 341 LGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADILNWKSFSIVVATLDIPL 400 Query: 1286 LKKILREVSSDEYTTLQRNVLKVQKHFRWNLSPMDYDAFYMVIYELWLRRSSMRV 1450 LKKIL+ +SS+EY LQ NVLKV+KHF+W++ P DYDAFYMV+Y+LWLRRSS+RV Sbjct: 401 LKKILKGISSEEYLLLQNNVLKVRKHFQWHVFPSDYDAFYMVMYDLWLRRSSVRV 455 >ref|XP_004493174.1| PREDICTED: probable glycosyltransferase At5g03795-like [Cicer arietinum] Length = 472 Score = 628 bits (1620), Expect = e-177 Identities = 323/486 (66%), Positives = 364/486 (74%), Gaps = 10/486 (2%) Frame = +2 Query: 26 MVCSSSLFYYFSHRRLFDSFRTFFF-IPTILALI--TSLFILIYISSTSKLFFVHQHHQS 196 MVC SSL Y SH + SFR FFF IPT LAL+ TSL IL Y+ +TS +F HHQ Sbjct: 1 MVCPSSLNQY-SHLHVAASFRNFFFFIPTTLALLFLTSLSILFYVYTTSIIFI--NHHQH 57 Query: 197 HLLPKXXXXXXXXXXXTHQIIQSPSQNP----PKTTFYEXXXXXXXXXXXXXXXXXXGTD 364 H L T Q S S P P TT + Sbjct: 58 HHLQS-----------TSQYFTSLSSLPVLLSPTTTLHNNASEFTKFQTFQLGHGLPPQS 106 Query: 365 ENRRYWETGRPIGSNGKYV---NKEVFHDKDIFLEDYKEMNRSFKIYVYPHREDDPYANA 535 + G P SN N +FHD+D+FLEDYKEMNRSFKIYVYPHREDDP+AN Sbjct: 107 QR------GLPSQSNSTRKLEKNNNLFHDRDLFLEDYKEMNRSFKIYVYPHREDDPFANV 160 Query: 536 LLPVNFEPGGNYASESYFKKVLMKSHFITKDPSEADLFFLPFSIARLRHDPRVGVQGIPD 715 LLP+ EPGGNYASESYFKKVLMKSHFIT DP+EADLFF+PFSIA LRHDPRVGV+GI D Sbjct: 161 LLPMKHEPGGNYASESYFKKVLMKSHFITNDPTEADLFFMPFSIASLRHDPRVGVEGIQD 220 Query: 716 FIKTYISNISQNYPYWNRSSGTDHFYVACHSIGRSAMEKATEVKFNAIQVVCSSSYFLSG 895 FI+ Y+ NI YPYWNR+ G DHFYVACHSIGRSAMEKA +VKFNAIQVVCSSSYFL+G Sbjct: 221 FIRDYVQNIVHKYPYWNRTGGADHFYVACHSIGRSAMEKAPDVKFNAIQVVCSSSYFLTG 280 Query: 896 YIAHKDASLPQIWPRQGDPPNVAISKRKKLAFFAGAINSPVRTKLLEEWKNDSEIFVHFG 1075 YIAHKD LPQIWPR+ +PPN+ S RKKLAFFAG +NSPVR KLLE WKNDSEIFVH G Sbjct: 281 YIAHKDTCLPQIWPRKQNPPNLVSSNRKKLAFFAGGVNSPVRIKLLETWKNDSEIFVHHG 340 Query: 1076 RLTTPYSDELLGSKFCLHVKGFEVNTARIGDAIYYGCVPVIIANHYDLPFADILNWKSFS 1255 RL TPY+DELLGSKFCLHVKGFEVNTARIGD++YYGCVPVIIAN+YDLPFAD+LNWKSFS Sbjct: 341 RLKTPYADELLGSKFCLHVKGFEVNTARIGDSLYYGCVPVIIANYYDLPFADVLNWKSFS 400 Query: 1256 LVVATLDIPLLKKILREVSSDEYTTLQRNVLKVQKHFRWNLSPMDYDAFYMVIYELWLRR 1435 +VV TLDIPLLKKIL+ +SSDEY LQRNVLKV+KHF+W+ P+D+DAFYMV+YELWLRR Sbjct: 401 VVVTTLDIPLLKKILKGISSDEYLMLQRNVLKVRKHFQWHSPPIDFDAFYMVVYELWLRR 460 Query: 1436 SSMRVA 1453 SS+ ++ Sbjct: 461 SSIIIS 466 >ref|XP_006604241.1| PREDICTED: probable glycosyltransferase At5g03795-like isoform X2 [Glycine max] Length = 489 Score = 625 bits (1611), Expect = e-176 Identities = 312/463 (67%), Positives = 352/463 (76%), Gaps = 7/463 (1%) Frame = +2 Query: 80 SFRTFFFIPTILALITSLFILIYISSTSKLFFVHQHHQS--HLLPKXXXXXXXXXXXTHQ 253 SFR FFFIPT LAL TS FIL YI STS +F H HH S H P T Sbjct: 32 SFRGFFFIPTTLALFTSFFILFYIYSTSNIFTHHNHHPSTSHFKPHPPFSTTPFIATTPH 91 Query: 254 IIQSPSQNPPKTTFYEXXXXXXXXXXXXXXXXXXGTDENRRYWETGRPI----GSNGKYV 421 + + N ++T G R G P+ S GK+ Sbjct: 92 FVPVFNHNASEST---------KSPPTFQLGYGLGPQSQR-----GLPLPPQFSSKGKFE 137 Query: 422 NKEVFHDKDIFLEDYKEMNRSFKIYVYPHREDDPYANALLPVNFEPGGNYASESYFKKVL 601 N +VFHD+D+FLEDYKEMNRS KIYVYPHREDDP+AN LLPV EPGGNY SESYFKKVL Sbjct: 138 NNDVFHDRDVFLEDYKEMNRSLKIYVYPHREDDPFANVLLPVESEPGGNYTSESYFKKVL 197 Query: 602 MKSHFITKDPSEADLFFLPFSIARLRHDPRVGVQGIPDFIKTYISNISQNYPYWNRSSGT 781 MKSHFITKDP EADLFFLPFS+ARL HD RVGV GI DFI+ YI NIS YPYWN + G Sbjct: 198 MKSHFITKDPPEADLFFLPFSMARLWHDRRVGVGGIQDFIRDYIHNISHRYPYWNNTGGA 257 Query: 782 DHFYVACHSIGRSAMEKATEVKFNAIQVVCSSSYFLSGYIAHKDASLPQIWPRQGDPPNV 961 DHFYVACHSIGRSAM+KA + KFNAIQVVCSSSYFL+GY AHKDA LPQIWPR+G+PPN+ Sbjct: 258 DHFYVACHSIGRSAMDKAPDEKFNAIQVVCSSSYFLTGYFAHKDACLPQIWPRKGNPPNL 317 Query: 962 AISKRKKLAFFAGAINSPVRTKLLEEWKNDSEIFVHFGRLTTPYSDELLGSKFCLHVKGF 1141 SKRK+LAFFAG +NSPVR KLLE WKNDSEIFVH GRL TPY+DELLGSKFCLHVKGF Sbjct: 318 VSSKRKRLAFFAGGVNSPVRVKLLETWKNDSEIFVHHGRLKTPYADELLGSKFCLHVKGF 377 Query: 1142 EVNTARIGDAIYYGCVPVIIANHYDLPFADILNWKSFSLVVATLDIPLLKKILRE-VSSD 1318 EVNTARIGD++YYGCVPVIIAN+YDLPFAD+LNWKSFS+VV TLDIPLLKKIL++ +SS+ Sbjct: 378 EVNTARIGDSLYYGCVPVIIANYYDLPFADVLNWKSFSVVVTTLDIPLLKKILKDIISSN 437 Query: 1319 EYTTLQRNVLKVQKHFRWNLSPMDYDAFYMVIYELWLRRSSMR 1447 +Y LQ NVLKV+KHF+W+ P D+DAFYMV+YELWLRRSS++ Sbjct: 438 KYLMLQSNVLKVRKHFQWHSPPQDFDAFYMVMYELWLRRSSIK 480 >ref|XP_007030071.1| Exostosin family protein isoform 2 [Theobroma cacao] gi|508718676|gb|EOY10573.1| Exostosin family protein isoform 2 [Theobroma cacao] Length = 496 Score = 623 bits (1607), Expect = e-176 Identities = 314/482 (65%), Positives = 368/482 (76%), Gaps = 6/482 (1%) Frame = +2 Query: 23 SMVCSSSLFYYFSHRRLFD-SFRTFFFIPTILALITSLFILIYISSTSKLFFVHQHHQSH 199 +M SS YYFS RR+ S ++FFF+P LALI+++FIL YI +TS LF H H + Sbjct: 14 AMARSSLPLYYFSPRRVSSPSSKSFFFVPATLALISTIFILFYIFTTSTLFTSHHHRHTL 73 Query: 200 LLPKXXXXXXXXXXXTHQIIQSP-SQNPPKTTFYEXXXXXXXXXXXXXXXXXX--GTDEN 370 L + SP +QN P + + G ++ Sbjct: 74 YLKQPLGSFP----------SSPLTQNVPSFSLHNNGFKNGTFDLPKRPPLKAVGGGEDA 123 Query: 371 RRYWETGRP-IGSNGKYVNK-EVFHDKDIFLEDYKEMNRSFKIYVYPHREDDPYANALLP 544 T RP GS G +VN EVFHD DIFLEDYKEMN SFKIYVYP + +DP+A+ALLP Sbjct: 124 TMSQVTSRPHFGSEGNFVNNLEVFHDGDIFLEDYKEMNNSFKIYVYPVKRNDPFAHALLP 183 Query: 545 VNFEPGGNYASESYFKKVLMKSHFITKDPSEADLFFLPFSIARLRHDPRVGVQGIPDFIK 724 V+FEPGGNYASESYFKK LMKSHFITKDP++ADLFFLPFSIARLRHD R+G GI DFI+ Sbjct: 184 VDFEPGGNYASESYFKKALMKSHFITKDPTKADLFFLPFSIARLRHDRRIGTGGIQDFIR 243 Query: 725 TYISNISQNYPYWNRSSGTDHFYVACHSIGRSAMEKATEVKFNAIQVVCSSSYFLSGYIA 904 YI NISQ YPYWNRS G DHFYVACHSIGRS M KA E+K NAIQ+VCSSSYFLSGYIA Sbjct: 244 DYIFNISQKYPYWNRSGGADHFYVACHSIGRSVMAKARELKLNAIQIVCSSSYFLSGYIA 303 Query: 905 HKDASLPQIWPRQGDPPNVAISKRKKLAFFAGAINSPVRTKLLEEWKNDSEIFVHFGRLT 1084 HKDASLPQ+WPR GDPPN+A SKR KL+FFAG+INSPVR KLL+ W+NDSEI H+GRL Sbjct: 304 HKDASLPQVWPRTGDPPNLASSKRNKLSFFAGSINSPVREKLLKFWRNDSEIAAHYGRLK 363 Query: 1085 TPYSDELLGSKFCLHVKGFEVNTARIGDAIYYGCVPVIIANHYDLPFADILNWKSFSLVV 1264 TPY+DELL SKFCLHVKGFEVNTARI D++YYGCVP+IIAN+YDLPFADILNWKSFS+VV Sbjct: 364 TPYADELLSSKFCLHVKGFEVNTARIADSLYYGCVPIIIANYYDLPFADILNWKSFSIVV 423 Query: 1265 ATLDIPLLKKILREVSSDEYTTLQRNVLKVQKHFRWNLSPMDYDAFYMVIYELWLRRSSM 1444 T+DIP LK+ILR ++SDEY +LQRNVLKV+KHF+W++ P+D+DAFYMV+YELWLRRSS Sbjct: 424 VTVDIPSLKQILRGITSDEYLSLQRNVLKVRKHFQWHVPPIDFDAFYMVMYELWLRRSSA 483 Query: 1445 RV 1450 R+ Sbjct: 484 RI 485 >ref|XP_007030070.1| Exostosin family protein isoform 1 [Theobroma cacao] gi|508718675|gb|EOY10572.1| Exostosin family protein isoform 1 [Theobroma cacao] Length = 492 Score = 623 bits (1607), Expect = e-176 Identities = 314/482 (65%), Positives = 368/482 (76%), Gaps = 6/482 (1%) Frame = +2 Query: 23 SMVCSSSLFYYFSHRRLFD-SFRTFFFIPTILALITSLFILIYISSTSKLFFVHQHHQSH 199 +M SS YYFS RR+ S ++FFF+P LALI+++FIL YI +TS LF H H + Sbjct: 14 AMARSSLPLYYFSPRRVSSPSSKSFFFVPATLALISTIFILFYIFTTSTLFTSHHHRHTL 73 Query: 200 LLPKXXXXXXXXXXXTHQIIQSP-SQNPPKTTFYEXXXXXXXXXXXXXXXXXX--GTDEN 370 L + SP +QN P + + G ++ Sbjct: 74 YLKQPLGSFP----------SSPLTQNVPSFSLHNNGFKNGTFDLPKRPPLKAVGGGEDA 123 Query: 371 RRYWETGRP-IGSNGKYVNK-EVFHDKDIFLEDYKEMNRSFKIYVYPHREDDPYANALLP 544 T RP GS G +VN EVFHD DIFLEDYKEMN SFKIYVYP + +DP+A+ALLP Sbjct: 124 TMSQVTSRPHFGSEGNFVNNLEVFHDGDIFLEDYKEMNNSFKIYVYPVKRNDPFAHALLP 183 Query: 545 VNFEPGGNYASESYFKKVLMKSHFITKDPSEADLFFLPFSIARLRHDPRVGVQGIPDFIK 724 V+FEPGGNYASESYFKK LMKSHFITKDP++ADLFFLPFSIARLRHD R+G GI DFI+ Sbjct: 184 VDFEPGGNYASESYFKKALMKSHFITKDPTKADLFFLPFSIARLRHDRRIGTGGIQDFIR 243 Query: 725 TYISNISQNYPYWNRSSGTDHFYVACHSIGRSAMEKATEVKFNAIQVVCSSSYFLSGYIA 904 YI NISQ YPYWNRS G DHFYVACHSIGRS M KA E+K NAIQ+VCSSSYFLSGYIA Sbjct: 244 DYIFNISQKYPYWNRSGGADHFYVACHSIGRSVMAKARELKLNAIQIVCSSSYFLSGYIA 303 Query: 905 HKDASLPQIWPRQGDPPNVAISKRKKLAFFAGAINSPVRTKLLEEWKNDSEIFVHFGRLT 1084 HKDASLPQ+WPR GDPPN+A SKR KL+FFAG+INSPVR KLL+ W+NDSEI H+GRL Sbjct: 304 HKDASLPQVWPRTGDPPNLASSKRNKLSFFAGSINSPVREKLLKFWRNDSEIAAHYGRLK 363 Query: 1085 TPYSDELLGSKFCLHVKGFEVNTARIGDAIYYGCVPVIIANHYDLPFADILNWKSFSLVV 1264 TPY+DELL SKFCLHVKGFEVNTARI D++YYGCVP+IIAN+YDLPFADILNWKSFS+VV Sbjct: 364 TPYADELLSSKFCLHVKGFEVNTARIADSLYYGCVPIIIANYYDLPFADILNWKSFSIVV 423 Query: 1265 ATLDIPLLKKILREVSSDEYTTLQRNVLKVQKHFRWNLSPMDYDAFYMVIYELWLRRSSM 1444 T+DIP LK+ILR ++SDEY +LQRNVLKV+KHF+W++ P+D+DAFYMV+YELWLRRSS Sbjct: 424 VTVDIPSLKQILRGITSDEYLSLQRNVLKVRKHFQWHVPPIDFDAFYMVMYELWLRRSSA 483 Query: 1445 RV 1450 R+ Sbjct: 484 RI 485 >ref|XP_006604240.1| PREDICTED: probable glycosyltransferase At5g03795-like isoform X1 [Glycine max] Length = 500 Score = 619 bits (1595), Expect = e-174 Identities = 307/460 (66%), Positives = 350/460 (76%), Gaps = 4/460 (0%) Frame = +2 Query: 80 SFRTFFFIPTILALITSLFILIYISSTSKLFFVHQHHQS--HLLPKXXXXXXXXXXXTHQ 253 SFR FFFIPT LAL TS FIL YI STS +F H HH S H P T Sbjct: 32 SFRGFFFIPTTLALFTSFFILFYIYSTSNIFTHHNHHPSTSHFKPHPPFSTTPFIATTPH 91 Query: 254 IIQSPSQNPPKTTFYEXXXXXXXXXXXXXXXXXXGTDE-NRRYWETGRPIGSNGKYVNKE 430 + + N ++T + + + +GK+ N + Sbjct: 92 FVPVFNHNASESTKSPPTFQLGYGLGPQSQRGLPLPPQFSSKVCRECCVFYGSGKFENND 151 Query: 431 VFHDKDIFLEDYKEMNRSFKIYVYPHREDDPYANALLPVNFEPGGNYASESYFKKVLMKS 610 VFHD+D+FLEDYKEMNRS KIYVYPHREDDP+AN LLPV EPGGNY SESYFKKVLMKS Sbjct: 152 VFHDRDVFLEDYKEMNRSLKIYVYPHREDDPFANVLLPVESEPGGNYTSESYFKKVLMKS 211 Query: 611 HFITKDPSEADLFFLPFSIARLRHDPRVGVQGIPDFIKTYISNISQNYPYWNRSSGTDHF 790 HFITKDP EADLFFLPFS+ARL HD RVGV GI DFI+ YI NIS YPYWN + G DHF Sbjct: 212 HFITKDPPEADLFFLPFSMARLWHDRRVGVGGIQDFIRDYIHNISHRYPYWNNTGGADHF 271 Query: 791 YVACHSIGRSAMEKATEVKFNAIQVVCSSSYFLSGYIAHKDASLPQIWPRQGDPPNVAIS 970 YVACHSIGRSAM+KA + KFNAIQVVCSSSYFL+GY AHKDA LPQIWPR+G+PPN+ S Sbjct: 272 YVACHSIGRSAMDKAPDEKFNAIQVVCSSSYFLTGYFAHKDACLPQIWPRKGNPPNLVSS 331 Query: 971 KRKKLAFFAGAINSPVRTKLLEEWKNDSEIFVHFGRLTTPYSDELLGSKFCLHVKGFEVN 1150 KRK+LAFFAG +NSPVR KLLE WKNDSEIFVH GRL TPY+DELLGSKFCLHVKGFEVN Sbjct: 332 KRKRLAFFAGGVNSPVRVKLLETWKNDSEIFVHHGRLKTPYADELLGSKFCLHVKGFEVN 391 Query: 1151 TARIGDAIYYGCVPVIIANHYDLPFADILNWKSFSLVVATLDIPLLKKILRE-VSSDEYT 1327 TARIGD++YYGCVPVIIAN+YDLPFAD+LNWKSFS+VV TLDIPLLKKIL++ +SS++Y Sbjct: 392 TARIGDSLYYGCVPVIIANYYDLPFADVLNWKSFSVVVTTLDIPLLKKILKDIISSNKYL 451 Query: 1328 TLQRNVLKVQKHFRWNLSPMDYDAFYMVIYELWLRRSSMR 1447 LQ NVLKV+KHF+W+ P D+DAFYMV+YELWLRRSS++ Sbjct: 452 MLQSNVLKVRKHFQWHSPPQDFDAFYMVMYELWLRRSSIK 491 >ref|XP_006443447.1| hypothetical protein CICLE_v10020045mg [Citrus clementina] gi|568850888|ref|XP_006479129.1| PREDICTED: probable glycosyltransferase At5g03795-like isoform X2 [Citrus sinensis] gi|557545709|gb|ESR56687.1| hypothetical protein CICLE_v10020045mg [Citrus clementina] Length = 374 Score = 597 bits (1540), Expect = e-168 Identities = 277/349 (79%), Positives = 314/349 (89%) Frame = +2 Query: 404 SNGKYVNKEVFHDKDIFLEDYKEMNRSFKIYVYPHREDDPYANALLPVNFEPGGNYASES 583 ++G +NKEVFHD+DIFLEDYK+MNRSF++YVYPHR +DP+AN LLPV+FEP GNYASES Sbjct: 16 ASGNSMNKEVFHDRDIFLEDYKQMNRSFRVYVYPHRRNDPFANVLLPVDFEPRGNYASES 75 Query: 584 YFKKVLMKSHFITKDPSEADLFFLPFSIARLRHDPRVGVQGIPDFIKTYISNISQNYPYW 763 YFKKV MKSHF+TKDPS+ADLFFLPFSIAR+RHD R+G +GIPDFI YI NISQ YPYW Sbjct: 76 YFKKVFMKSHFVTKDPSKADLFFLPFSIARMRHDRRIGTEGIPDFISHYIFNISQKYPYW 135 Query: 764 NRSSGTDHFYVACHSIGRSAMEKATEVKFNAIQVVCSSSYFLSGYIAHKDASLPQIWPRQ 943 NR+ G DHFYVACHSIGRSAMEKA EVK NAIQVVCSSSYF+SG+IAHKD SLPQIWPRQ Sbjct: 136 NRTGGADHFYVACHSIGRSAMEKAWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQIWPRQ 195 Query: 944 GDPPNVAISKRKKLAFFAGAINSPVRTKLLEEWKNDSEIFVHFGRLTTPYSDELLGSKFC 1123 DPP + SKR KLAFFAGA+NSPVR KLL+ W+NDSEI+ H GRL TPY+D LLGSKFC Sbjct: 196 EDPPKLGSSKRNKLAFFAGAVNSPVREKLLQVWRNDSEIYAHSGRLKTPYADGLLGSKFC 255 Query: 1124 LHVKGFEVNTARIGDAIYYGCVPVIIANHYDLPFADILNWKSFSLVVATLDIPLLKKILR 1303 LHVKGFEVNTARI D++YYGCVPVIIANHYDLPFADILNWKSFS+VVATLDIPLLKKIL+ Sbjct: 256 LHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADILNWKSFSIVVATLDIPLLKKILK 315 Query: 1304 EVSSDEYTTLQRNVLKVQKHFRWNLSPMDYDAFYMVIYELWLRRSSMRV 1450 +SS+EY LQ NVLKV+KHF+W++ P DYDAFYMV+Y+LWLRRSS+RV Sbjct: 316 GISSEEYLLLQNNVLKVRKHFQWHVFPSDYDAFYMVMYDLWLRRSSVRV 364 >ref|XP_006443448.1| hypothetical protein CICLE_v10020045mg [Citrus clementina] gi|557545710|gb|ESR56688.1| hypothetical protein CICLE_v10020045mg [Citrus clementina] Length = 354 Score = 595 bits (1534), Expect = e-167 Identities = 276/344 (80%), Positives = 311/344 (90%) Frame = +2 Query: 419 VNKEVFHDKDIFLEDYKEMNRSFKIYVYPHREDDPYANALLPVNFEPGGNYASESYFKKV 598 +NKEVFHD+DIFLEDYK+MNRSF++YVYPHR +DP+AN LLPV+FEP GNYASESYFKKV Sbjct: 1 MNKEVFHDRDIFLEDYKQMNRSFRVYVYPHRRNDPFANVLLPVDFEPRGNYASESYFKKV 60 Query: 599 LMKSHFITKDPSEADLFFLPFSIARLRHDPRVGVQGIPDFIKTYISNISQNYPYWNRSSG 778 MKSHF+TKDPS+ADLFFLPFSIAR+RHD R+G +GIPDFI YI NISQ YPYWNR+ G Sbjct: 61 FMKSHFVTKDPSKADLFFLPFSIARMRHDRRIGTEGIPDFISHYIFNISQKYPYWNRTGG 120 Query: 779 TDHFYVACHSIGRSAMEKATEVKFNAIQVVCSSSYFLSGYIAHKDASLPQIWPRQGDPPN 958 DHFYVACHSIGRSAMEKA EVK NAIQVVCSSSYF+SG+IAHKD SLPQIWPRQ DPP Sbjct: 121 ADHFYVACHSIGRSAMEKAWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQIWPRQEDPPK 180 Query: 959 VAISKRKKLAFFAGAINSPVRTKLLEEWKNDSEIFVHFGRLTTPYSDELLGSKFCLHVKG 1138 + SKR KLAFFAGA+NSPVR KLL+ W+NDSEI+ H GRL TPY+D LLGSKFCLHVKG Sbjct: 181 LGSSKRNKLAFFAGAVNSPVREKLLQVWRNDSEIYAHSGRLKTPYADGLLGSKFCLHVKG 240 Query: 1139 FEVNTARIGDAIYYGCVPVIIANHYDLPFADILNWKSFSLVVATLDIPLLKKILREVSSD 1318 FEVNTARI D++YYGCVPVIIANHYDLPFADILNWKSFS+VVATLDIPLLKKIL+ +SS+ Sbjct: 241 FEVNTARIADSLYYGCVPVIIANHYDLPFADILNWKSFSIVVATLDIPLLKKILKGISSE 300 Query: 1319 EYTTLQRNVLKVQKHFRWNLSPMDYDAFYMVIYELWLRRSSMRV 1450 EY LQ NVLKV+KHF+W++ P DYDAFYMV+Y+LWLRRSS+RV Sbjct: 301 EYLLLQNNVLKVRKHFQWHVFPSDYDAFYMVMYDLWLRRSSVRV 344 >ref|XP_003624636.1| Exostosin-like protein [Medicago truncatula] gi|87162615|gb|ABD28410.1| Exostosin-like [Medicago truncatula] gi|116831751|gb|ABK28848.1| exostosin-like protein [Medicago truncatula] gi|355499651|gb|AES80854.1| Exostosin-like protein [Medicago truncatula] Length = 486 Score = 595 bits (1533), Expect = e-167 Identities = 302/479 (63%), Positives = 355/479 (74%), Gaps = 9/479 (1%) Frame = +2 Query: 41 SLFYYFSHRRLFDSFRTFFF-IPTILALITSLFILIYISSTSKLFFVHQHH---QSHLLP 208 S Y +SH + SF++FFF IPT LAL+TSL IL Y+ TS +F H H QS L+ Sbjct: 5 SSLYQYSHTHVASSFKSFFFFIPTTLALLTSLSILFYVYYTSIIFTHHHQHNNQQSTLIN 64 Query: 209 -KXXXXXXXXXXXTHQIIQSPSQNPP---KTTFYEXXXXXXXXXXXXXXXXXXGTDENRR 376 K T + + N K+ ++ +N+ Sbjct: 65 FKSSSPNFILPSPTPHLTNTLHNNHSEFIKSHTFQLGHGLGPQSQRGLPPQSSSNGQNKH 124 Query: 377 YWETGRPIGSNGKYVNKEVFHDKDIFLEDYKEMNRSFKIYVYPHREDDPYANALLPVNFE 556 E GS N VFHD+DIFLEDYKEMNRSFKIYVYPH++DDP+AN LLPV E Sbjct: 125 --ENSVFDGSRKFKENNNVFHDRDIFLEDYKEMNRSFKIYVYPHKKDDPFANVLLPVKTE 182 Query: 557 PGGNYASESYFKKVLMKSHFITKDPSEADLFFLPFSIARLRHDPRVGVQGIPDFIKTYIS 736 P GNYASESYFKK LMKSHFITKDP++ADLFF+PFSIA LRHD RVGV GI DFI+ Y+ Sbjct: 183 PSGNYASESYFKKALMKSHFITKDPTKADLFFMPFSIASLRHDRRVGVGGIQDFIRDYVQ 242 Query: 737 NISQNYPYWNRSSGTDHFYVACHSIGRSAMEKATEVKFNAIQVVCSSSYFLSGYIAHKDA 916 N+ YPYWNR++G DHFYVACHSIGRSAM+KA +VKFNAIQVVCSSSYFLSGYIAHKDA Sbjct: 243 NMIHKYPYWNRTNGADHFYVACHSIGRSAMDKAPDVKFNAIQVVCSSSYFLSGYIAHKDA 302 Query: 917 SLPQIWPRQGDPPNVAISKRKKLAFFAGAINSPVRTKLLEEWKNDSEIFVHFGRLTTPYS 1096 LPQIWPR +PPN+ S RKKLAFFAG +NSPVR L+E WKND+EIFVH GRL TPY Sbjct: 303 CLPQIWPRNENPPNLVSSNRKKLAFFAGEVNSPVRINLVETWKNDTEIFVHNGRLKTPYG 362 Query: 1097 DELLGSKFCLHVKGFEVNTARIGDAIYYGCVPVIIANHYDLPFADILNWKSFSLVVATLD 1276 DELLGSKFC HV+G+EVNTARIGD++YYGCVPVIIAN+YDLPFAD+LNWKSFS+VV TLD Sbjct: 363 DELLGSKFCFHVRGYEVNTARIGDSLYYGCVPVIIANYYDLPFADVLNWKSFSVVVTTLD 422 Query: 1277 IPLLKKILRE-VSSDEYTTLQRNVLKVQKHFRWNLSPMDYDAFYMVIYELWLRRSSMRV 1450 IPLLKKIL+ V+S EY LQ+NVLKV++HF+W+ P+D+DAFYMV+YELWLRRSS+ + Sbjct: 423 IPLLKKILKGIVNSGEYLMLQKNVLKVREHFQWHSPPIDFDAFYMVMYELWLRRSSIPI 481 >ref|XP_006604242.1| PREDICTED: probable glycosyltransferase At5g03795-like isoform X3 [Glycine max] Length = 475 Score = 588 bits (1516), Expect = e-165 Identities = 275/348 (79%), Positives = 310/348 (89%), Gaps = 1/348 (0%) Frame = +2 Query: 407 NGKYVNKEVFHDKDIFLEDYKEMNRSFKIYVYPHREDDPYANALLPVNFEPGGNYASESY 586 +GK+ N +VFHD+D+FLEDYKEMNRS KIYVYPHREDDP+AN LLPV EPGGNY SESY Sbjct: 119 SGKFENNDVFHDRDVFLEDYKEMNRSLKIYVYPHREDDPFANVLLPVESEPGGNYTSESY 178 Query: 587 FKKVLMKSHFITKDPSEADLFFLPFSIARLRHDPRVGVQGIPDFIKTYISNISQNYPYWN 766 FKKVLMKSHFITKDP EADLFFLPFS+ARL HD RVGV GI DFI+ YI NIS YPYWN Sbjct: 179 FKKVLMKSHFITKDPPEADLFFLPFSMARLWHDRRVGVGGIQDFIRDYIHNISHRYPYWN 238 Query: 767 RSSGTDHFYVACHSIGRSAMEKATEVKFNAIQVVCSSSYFLSGYIAHKDASLPQIWPRQG 946 + G DHFYVACHSIGRSAM+KA + KFNAIQVVCSSSYFL+GY AHKDA LPQIWPR+G Sbjct: 239 NTGGADHFYVACHSIGRSAMDKAPDEKFNAIQVVCSSSYFLTGYFAHKDACLPQIWPRKG 298 Query: 947 DPPNVAISKRKKLAFFAGAINSPVRTKLLEEWKNDSEIFVHFGRLTTPYSDELLGSKFCL 1126 +PPN+ SKRK+LAFFAG +NSPVR KLLE WKNDSEIFVH GRL TPY+DELLGSKFCL Sbjct: 299 NPPNLVSSKRKRLAFFAGGVNSPVRVKLLETWKNDSEIFVHHGRLKTPYADELLGSKFCL 358 Query: 1127 HVKGFEVNTARIGDAIYYGCVPVIIANHYDLPFADILNWKSFSLVVATLDIPLLKKILRE 1306 HVKGFEVNTARIGD++YYGCVPVIIAN+YDLPFAD+LNWKSFS+VV TLDIPLLKKIL++ Sbjct: 359 HVKGFEVNTARIGDSLYYGCVPVIIANYYDLPFADVLNWKSFSVVVTTLDIPLLKKILKD 418 Query: 1307 -VSSDEYTTLQRNVLKVQKHFRWNLSPMDYDAFYMVIYELWLRRSSMR 1447 +SS++Y LQ NVLKV+KHF+W+ P D+DAFYMV+YELWLRRSS++ Sbjct: 419 IISSNKYLMLQSNVLKVRKHFQWHSPPQDFDAFYMVMYELWLRRSSIK 466 >ref|XP_004296269.1| PREDICTED: probable glycosyltransferase At5g03795-like [Fragaria vesca subsp. vesca] Length = 439 Score = 588 bits (1515), Expect = e-165 Identities = 305/475 (64%), Positives = 343/475 (72%), Gaps = 7/475 (1%) Frame = +2 Query: 35 SSSLFYYFSHRRLFDSFRTFFFIPTILALITSLFILIYISSTSKLFFVH------QHHQS 196 +S + Y S RL R+FFFIPT LAL TSL IL YIS+TS LF H Sbjct: 2 ASLVLLYLSQWRLP---RSFFFIPTTLALATSLLILFYISTTSNLFPHHPPLPNLSSFAP 58 Query: 197 HLLPKXXXXXXXXXXXTHQIIQSPSQNPPKTTFYEXXXXXXXXXXXXXXXXXXGTDENRR 376 HL P QS PP + Sbjct: 59 HLYP----------------FQSQRSLPPNSA---------------------------- 74 Query: 377 YWETGRPIGSNGKYVNK-EVFHDKDIFLEDYKEMNRSFKIYVYPHREDDPYANALLPVNF 553 NG Y N EVFHD IF++DYKEM RSFKIYVYPHR+DDP+ANALLPV+F Sbjct: 75 ---------PNGNYDNNNEVFHDTHIFVQDYKEMKRSFKIYVYPHRKDDPFANALLPVDF 125 Query: 554 EPGGNYASESYFKKVLMKSHFITKDPSEADLFFLPFSIARLRHDPRVGVQGIPDFIKTYI 733 EP GNYASESYFKKVLM+SHFIT DP++A LFFLPFSIARLRHDPRVGV GI DFI+ Y+ Sbjct: 126 EPAGNYASESYFKKVLMESHFITNDPTQAQLFFLPFSIARLRHDPRVGVGGIQDFIRDYM 185 Query: 734 SNISQNYPYWNRSSGTDHFYVACHSIGRSAMEKATEVKFNAIQVVCSSSYFLSGYIAHKD 913 NIS Y YWNR+ G DHFYVACHSIGRSAMEKAT+VKFNAIQ+VCSSSYFLSGYIAHKD Sbjct: 186 FNISHKYEYWNRTGGADHFYVACHSIGRSAMEKATQVKFNAIQLVCSSSYFLSGYIAHKD 245 Query: 914 ASLPQIWPRQGDPPNVAISKRKKLAFFAGAINSPVRTKLLEEWKNDSEIFVHFGRLTTPY 1093 A LPQIWPR+ DPPN+ S R KLAFFAG INSPVR +LL+ W+NDSEIFV+FGRL T Y Sbjct: 246 ACLPQIWPRKQDPPNLLSSNRTKLAFFAGGINSPVRERLLQVWRNDSEIFVNFGRLKTSY 305 Query: 1094 SDELLGSKFCLHVKGFEVNTARIGDAIYYGCVPVIIANHYDLPFADILNWKSFSLVVATL 1273 +D LLGS FCLHVKGFEVNTARI D++YYGCVPVIIAN+YDLPFADILNWKSFS+VVATL Sbjct: 306 ADALLGSMFCLHVKGFEVNTARIADSLYYGCVPVIIANYYDLPFADILNWKSFSVVVATL 365 Query: 1274 DIPLLKKILREVSSDEYTTLQRNVLKVQKHFRWNLSPMDYDAFYMVIYELWLRRS 1438 DIPLLK IL+ + SDEY L+ NV KV+ F+W+LSP+DYDAF+MV+YELWLRRS Sbjct: 366 DIPLLKNILKGIRSDEYMRLRNNVFKVRNQFQWHLSPIDYDAFHMVMYELWLRRS 420 >ref|XP_002265438.2| PREDICTED: probable glycosyltransferase At5g03795-like [Vitis vinifera] Length = 336 Score = 586 bits (1510), Expect = e-164 Identities = 274/326 (84%), Positives = 306/326 (93%) Frame = +2 Query: 473 MNRSFKIYVYPHREDDPYANALLPVNFEPGGNYASESYFKKVLMKSHFITKDPSEADLFF 652 MNRSFKIY YPH+ DDP+ANALLPV+FEPGGNYASESYFKKVLMKSHFITKDPS+ADLFF Sbjct: 1 MNRSFKIYCYPHKRDDPFANALLPVDFEPGGNYASESYFKKVLMKSHFITKDPSKADLFF 60 Query: 653 LPFSIARLRHDPRVGVQGIPDFIKTYISNISQNYPYWNRSSGTDHFYVACHSIGRSAMEK 832 LPFSIARLRHDPRVGV GI DFI+ YI NISQNYPYWN++ G DHFYVACHSIGRSAMEK Sbjct: 61 LPFSIARLRHDPRVGVGGIQDFIRDYIFNISQNYPYWNQTGGADHFYVACHSIGRSAMEK 120 Query: 833 ATEVKFNAIQVVCSSSYFLSGYIAHKDASLPQIWPRQGDPPNVAISKRKKLAFFAGAINS 1012 A EVK NAIQVVCSSSYFLSGYIAHKDASLPQIWPRQGDPP++A+S+RKKLAFFAG+INS Sbjct: 121 ADEVKLNAIQVVCSSSYFLSGYIAHKDASLPQIWPRQGDPPDLALSERKKLAFFAGSINS 180 Query: 1013 PVRTKLLEEWKNDSEIFVHFGRLTTPYSDELLGSKFCLHVKGFEVNTARIGDAIYYGCVP 1192 PVR +LL+ W+NDSEI VHFGRLTTPY+DELLGSKFCLHVKGFE+NTARI D++YYGCVP Sbjct: 181 PVRERLLQVWRNDSEISVHFGRLTTPYADELLGSKFCLHVKGFEINTARIADSLYYGCVP 240 Query: 1193 VIIANHYDLPFADILNWKSFSLVVATLDIPLLKKILREVSSDEYTTLQRNVLKVQKHFRW 1372 VIIANHYDLPFADILNWKSFS+VVATLDIPLLK++L+ +S +EY LQ NVLKV+ HF+W Sbjct: 241 VIIANHYDLPFADILNWKSFSIVVATLDIPLLKQVLKGISLNEYLMLQSNVLKVRNHFQW 300 Query: 1373 NLSPMDYDAFYMVIYELWLRRSSMRV 1450 ++SP+DYDAFYMV+YELWLRRSS+RV Sbjct: 301 HVSPVDYDAFYMVMYELWLRRSSVRV 326 >ref|XP_002525728.1| catalytic, putative [Ricinus communis] gi|223535028|gb|EEF36711.1| catalytic, putative [Ricinus communis] Length = 336 Score = 575 bits (1481), Expect = e-161 Identities = 266/326 (81%), Positives = 303/326 (92%) Frame = +2 Query: 473 MNRSFKIYVYPHREDDPYANALLPVNFEPGGNYASESYFKKVLMKSHFITKDPSEADLFF 652 MNRSFKIYVYPHR++DP+AN LLPV+FEPGGNYASESYFKKVLMKSHFITKDP++ADLFF Sbjct: 1 MNRSFKIYVYPHRQNDPFANVLLPVDFEPGGNYASESYFKKVLMKSHFITKDPTKADLFF 60 Query: 653 LPFSIARLRHDPRVGVQGIPDFIKTYISNISQNYPYWNRSSGTDHFYVACHSIGRSAMEK 832 LPFSIARLRHDPR+GV+GI DFI+ Y+ NISQ YPYWNR+ GTDHFYVACHSIGR+AMEK Sbjct: 61 LPFSIARLRHDPRIGVEGIQDFIRAYVYNISQKYPYWNRTGGTDHFYVACHSIGRTAMEK 120 Query: 833 ATEVKFNAIQVVCSSSYFLSGYIAHKDASLPQIWPRQGDPPNVAISKRKKLAFFAGAINS 1012 A EVKFNAIQVVCSSSY+LSGYIAHKDASLPQ+WPRQGDPPN+A S+R+KLAFFAG+INS Sbjct: 121 AEEVKFNAIQVVCSSSYYLSGYIAHKDASLPQVWPRQGDPPNLASSERQKLAFFAGSINS 180 Query: 1013 PVRTKLLEEWKNDSEIFVHFGRLTTPYSDELLGSKFCLHVKGFEVNTARIGDAIYYGCVP 1192 PVR +LL+ W+NDSEI+VH+GRL T Y+DELLGSKFCLHVKGFEVNTARI D++YYGCVP Sbjct: 181 PVRERLLQVWRNDSEIYVHYGRLNTSYADELLGSKFCLHVKGFEVNTARIADSLYYGCVP 240 Query: 1193 VIIANHYDLPFADILNWKSFSLVVATLDIPLLKKILREVSSDEYTTLQRNVLKVQKHFRW 1372 +IIANHYDLPF DILNW+SFS+VVATLDI LKKIL+ VSSD Y LQ NVLKV+KHF+W Sbjct: 241 IIIANHYDLPFTDILNWESFSVVVATLDILYLKKILQGVSSDRYVMLQSNVLKVRKHFQW 300 Query: 1373 NLSPMDYDAFYMVIYELWLRRSSMRV 1450 + P+DYDAF+MV+YELWLRRSS+RV Sbjct: 301 HFPPVDYDAFHMVMYELWLRRSSVRV 326 >ref|XP_006826373.1| hypothetical protein AMTR_s00004p00132530 [Amborella trichopoda] gi|548830687|gb|ERM93610.1| hypothetical protein AMTR_s00004p00132530 [Amborella trichopoda] Length = 453 Score = 573 bits (1477), Expect = e-161 Identities = 294/472 (62%), Positives = 339/472 (71%) Frame = +2 Query: 59 SHRRLFDSFRTFFFIPTILALITSLFILIYISSTSKLFFVHQHHQSHLLPKXXXXXXXXX 238 SH L + F+FIPTILAL+TSL I+ I+ TS ++ LL K Sbjct: 4 SHPGLNGLPKIFYFIPTILALVTSLCIIYCINLTS-------NYTGFLLGKPYIGSFLIQ 56 Query: 239 XXTHQIIQSPSQNPPKTTFYEXXXXXXXXXXXXXXXXXXGTDENRRYWETGRPIGSNGKY 418 +Q P+ KT G E E IG Sbjct: 57 KRI-PFLQIPNSIDIKTKV---------------PLPDSGNSERLSEGELDLNIGKENN- 99 Query: 419 VNKEVFHDKDIFLEDYKEMNRSFKIYVYPHREDDPYANALLPVNFEPGGNYASESYFKKV 598 +N VFHDK +FLEDYK MN+S KIYVYPH +DD +AN LLPV+F+PGGNYASESYFKK Sbjct: 100 INNGVFHDKMVFLEDYKAMNKSLKIYVYPHSKDDSFANVLLPVDFKPGGNYASESYFKKC 159 Query: 599 LMKSHFITKDPSEADLFFLPFSIARLRHDPRVGVQGIPDFIKTYISNISQNYPYWNRSSG 778 LMKSHFITKDP EA LFFLPFSIA LRHDPRVGV GI DF+++YI NISQ YPYWNRS G Sbjct: 160 LMKSHFITKDPKEAHLFFLPFSIASLRHDPRVGVHGIQDFVRSYIYNISQAYPYWNRSGG 219 Query: 779 TDHFYVACHSIGRSAMEKATEVKFNAIQVVCSSSYFLSGYIAHKDASLPQIWPRQGDPPN 958 DHFYVACHSIGRSAMEKA +VKFNAIQVVCS+SY+LSGY+AHKDAS+PQIWPR+GDPP Sbjct: 220 ADHFYVACHSIGRSAMEKAVDVKFNAIQVVCSASYYLSGYVAHKDASMPQIWPREGDPPK 279 Query: 959 VAISKRKKLAFFAGAINSPVRTKLLEEWKNDSEIFVHFGRLTTPYSDELLGSKFCLHVKG 1138 +KR KLAFFAG+ NSPVR LLE W+NDSEI VHFG L+ PYS L SKFCLHVKG Sbjct: 280 AGSTKRDKLAFFAGSNNSPVRQNLLEHWRNDSEISVHFGNLSIPYSKALSHSKFCLHVKG 339 Query: 1139 FEVNTARIGDAIYYGCVPVIIANHYDLPFADILNWKSFSLVVATLDIPLLKKILREVSSD 1318 FEVNTARI DA++YGCVP++IANHYDLPF DIL+WK FSLVVATLDIPLLK+IL E+S + Sbjct: 340 FEVNTARIADALFYGCVPIVIANHYDLPFTDILDWKKFSLVVATLDIPLLKEILHEISFE 399 Query: 1319 EYTTLQRNVLKVQKHFRWNLSPMDYDAFYMVIYELWLRRSSMRVA*RESVNL 1474 +Y LQRNVL+V+KHF+W+ P +YDAFYMV+YELWLRR R+ ES L Sbjct: 400 DYEELQRNVLEVRKHFQWHKVPENYDAFYMVMYELWLRRGLARIPVPESNQL 451 >gb|EXB31256.1| putative glycosyltransferase [Morus notabilis] Length = 462 Score = 572 bits (1474), Expect = e-160 Identities = 267/349 (76%), Positives = 306/349 (87%), Gaps = 1/349 (0%) Frame = +2 Query: 407 NGKYVNK-EVFHDKDIFLEDYKEMNRSFKIYVYPHREDDPYANALLPVNFEPGGNYASES 583 + ++VNK EVFHD+ IF EDY+EM RSFKIYVYPHR DDP+AN LLPV+ +PGGNYASE Sbjct: 106 SAEHVNKYEVFHDRHIFQEDYEEMKRSFKIYVYPHRRDDPFANVLLPVDSKPGGNYASEG 165 Query: 584 YFKKVLMKSHFITKDPSEADLFFLPFSIARLRHDPRVGVQGIPDFIKTYISNISQNYPYW 763 YFK L KS F+T+DP++ADLFFLPFSIARLRHDPRV V GIP+F++ YISN+ + YPYW Sbjct: 166 YFKMALSKSRFVTEDPNKADLFFLPFSIARLRHDPRVSVGGIPEFVRDYISNVRRKYPYW 225 Query: 764 NRSSGTDHFYVACHSIGRSAMEKATEVKFNAIQVVCSSSYFLSGYIAHKDASLPQIWPRQ 943 NR+ G DHFYVACHSIGRSAMEKATEVK NAIQ+VCSSSYF+ YI+HKDA LPQIWPR+ Sbjct: 226 NRTGGADHFYVACHSIGRSAMEKATEVKLNAIQIVCSSSYFVGSYISHKDACLPQIWPRE 285 Query: 944 GDPPNVAISKRKKLAFFAGAINSPVRTKLLEEWKNDSEIFVHFGRLTTPYSDELLGSKFC 1123 GDPPN+ S R KLAFFAGA+NSPVR +L++ W+NDSEIFVH GRL TPY+DELLGSKFC Sbjct: 286 GDPPNLLSSNRTKLAFFAGAMNSPVRKQLVQVWRNDSEIFVHHGRLKTPYADELLGSKFC 345 Query: 1124 LHVKGFEVNTARIGDAIYYGCVPVIIANHYDLPFADILNWKSFSLVVATLDIPLLKKILR 1303 LH KGFEVNTARI D++YYGCVPVI+AN+YDLPF DILNWKSFS+VVAT DIPLLKKILR Sbjct: 346 LHAKGFEVNTARIADSLYYGCVPVILANYYDLPFIDILNWKSFSVVVATQDIPLLKKILR 405 Query: 1304 EVSSDEYTTLQRNVLKVQKHFRWNLSPMDYDAFYMVIYELWLRRSSMRV 1450 +SSDEY LQRNVLKV+KHF W+ SP DYDAFYMV+YELWLRRS +RV Sbjct: 406 GISSDEYLRLQRNVLKVRKHFLWHPSPRDYDAFYMVMYELWLRRSLLRV 454 >ref|XP_007030069.1| Exostosin family protein [Theobroma cacao] gi|508718674|gb|EOY10571.1| Exostosin family protein [Theobroma cacao] Length = 465 Score = 568 bits (1463), Expect = e-159 Identities = 282/479 (58%), Positives = 335/479 (69%), Gaps = 4/479 (0%) Frame = +2 Query: 26 MVCSSSLFYYFSHRRLFDSFR-TFFFIPTILALITSLFILIYISSTSKLFFV--HQHHQS 196 M SSSL Y+ S R SF +FFF+P LA+ T L I +YI T+ F +H Sbjct: 1 MAKSSSLCYHISQHRFSTSFGGSFFFLPLSLAISTFLVIFLYIWCTNSNLFTDPQNNHYQ 60 Query: 197 HLLPKXXXXXXXXXXXTHQIIQSPSQNPPKTTFYEXXXXXXXXXXXXXXXXXXGTDENRR 376 PK Q+I + + FY Sbjct: 61 ESSPKSSLL--------QQMIPFSLEKAAEDMFYSSRSAPL---------------SKGN 97 Query: 377 YWETGRPIGSNGKYVNK-EVFHDKDIFLEDYKEMNRSFKIYVYPHREDDPYANALLPVNF 553 W P G G YVN E++HD+D FL+DYKEMNRS K++VYPH DDP+A+ LLPV++ Sbjct: 98 QWSMANPFGLYGNYVNNTELYHDEDFFLQDYKEMNRSLKVFVYPHSRDDPFASVLLPVDY 157 Query: 554 EPGGNYASESYFKKVLMKSHFITKDPSEADLFFLPFSIARLRHDPRVGVQGIPDFIKTYI 733 +P G+YASE YFKKVL KSHFITK+PSEADLFFLPFSI +RHDPR+G +G+ DFIK YI Sbjct: 158 DPKGHYASELYFKKVLSKSHFITKNPSEADLFFLPFSIVEMRHDPRIGPEGMQDFIKDYI 217 Query: 734 SNISQNYPYWNRSSGTDHFYVACHSIGRSAMEKATEVKFNAIQVVCSSSYFLSGYIAHKD 913 NIS YPYWNR+ G DHFYVACHSIGR AM+K KFN IQVVCSSSYF++GYI HKD Sbjct: 218 FNISHKYPYWNRTDGADHFYVACHSIGRFAMDKVFSAKFNVIQVVCSSSYFVAGYIPHKD 277 Query: 914 ASLPQIWPRQGDPPNVAISKRKKLAFFAGAINSPVRTKLLEEWKNDSEIFVHFGRLTTPY 1093 AS+PQIWPRQ DPPN A SKRK+LAFFAG INSP R L++ W ND++IF HF RL TP Sbjct: 278 ASMPQIWPRQRDPPNSASSKRKQLAFFAGTINSPARLALIQAWGNDTDIFAHFERLRTPD 337 Query: 1094 SDELLGSKFCLHVKGFEVNTARIGDAIYYGCVPVIIANHYDLPFADILNWKSFSLVVATL 1273 +D+LLGSKFCLHVKGFEVNTAR+ DAIYYGCVPVI+ANHYDLPF DI+NWKSFS+VV + Sbjct: 338 ADQLLGSKFCLHVKGFEVNTARVADAIYYGCVPVILANHYDLPFGDIINWKSFSVVVHYM 397 Query: 1274 DIPLLKKILREVSSDEYTTLQRNVLKVQKHFRWNLSPMDYDAFYMVIYELWLRRSSMRV 1450 DIP+LK IL+ +S +EY+ LQ N LKV+KHF+WN P DYDAFY +YELWLRRSS+RV Sbjct: 398 DIPVLKNILQRISLEEYSLLQSNTLKVRKHFQWNDPPTDYDAFYTTMYELWLRRSSVRV 456 >ref|XP_006346547.1| PREDICTED: probable glycosyltransferase At5g03795-like isoform X1 [Solanum tuberosum] Length = 452 Score = 555 bits (1429), Expect = e-155 Identities = 260/349 (74%), Positives = 303/349 (86%), Gaps = 1/349 (0%) Frame = +2 Query: 407 NGKYVN-KEVFHDKDIFLEDYKEMNRSFKIYVYPHREDDPYANALLPVNFEPGGNYASES 583 NG +VN +VFHD+D F+++YKEMNRS KIYVYPH++DDP++N LL V+FEPGGNYASES Sbjct: 103 NGNHVNDNDVFHDRDAFVDNYKEMNRSLKIYVYPHQKDDPFSNVLLAVDFEPGGNYASES 162 Query: 584 YFKKVLMKSHFITKDPSEADLFFLPFSIARLRHDPRVGVQGIPDFIKTYISNISQNYPYW 763 YFKKVL SHFIT+DPS ADLFFLPFSIARLRHDPRVG+ GI DFIK+YI NIS YPYW Sbjct: 163 YFKKVLKMSHFITRDPSNADLFFLPFSIARLRHDPRVGINGIKDFIKSYIFNISHEYPYW 222 Query: 764 NRSSGTDHFYVACHSIGRSAMEKATEVKFNAIQVVCSSSYFLSGYIAHKDASLPQIWPRQ 943 N ++G DHFYVACHSIGR AMEK +VK N IQVVC+SSYF+S YI HKDASLPQIWPR Sbjct: 223 NLTNGADHFYVACHSIGRFAMEKVVDVKINVIQVVCTSSYFVSAYIPHKDASLPQIWPRL 282 Query: 944 GDPPNVAISKRKKLAFFAGAINSPVRTKLLEEWKNDSEIFVHFGRLTTPYSDELLGSKFC 1123 G P+ A KRKKL FFAG++NSPVR KLLE W NDS+IFVH GRL Y++ELLGSKFC Sbjct: 283 GGNPDFAPYKRKKLGFFAGSLNSPVREKLLEWWGNDSDIFVHSGRLERSYTEELLGSKFC 342 Query: 1124 LHVKGFEVNTARIGDAIYYGCVPVIIANHYDLPFADILNWKSFSLVVATLDIPLLKKILR 1303 LHVKGFEVNTARI DA++YGCVPVIIANHYDLPFADIL+WK FS++VATLDIPLLKKIL+ Sbjct: 343 LHVKGFEVNTARIVDALFYGCVPVIIANHYDLPFADILDWKHFSVIVATLDIPLLKKILQ 402 Query: 1304 EVSSDEYTTLQRNVLKVQKHFRWNLSPMDYDAFYMVIYELWLRRSSMRV 1450 ++ EY LQ NVLKV++HF+W++SP+D+DAFYMV+YELWLRRSS+R+ Sbjct: 403 GITQQEYLVLQSNVLKVREHFQWHVSPIDFDAFYMVMYELWLRRSSLRL 451 >ref|XP_007204779.1| hypothetical protein PRUPE_ppa015806mg [Prunus persica] gi|462400310|gb|EMJ05978.1| hypothetical protein PRUPE_ppa015806mg [Prunus persica] Length = 331 Score = 555 bits (1429), Expect = e-155 Identities = 261/322 (81%), Positives = 292/322 (90%) Frame = +2 Query: 473 MNRSFKIYVYPHREDDPYANALLPVNFEPGGNYASESYFKKVLMKSHFITKDPSEADLFF 652 M RSFKIYVYPHR+DD +ANALLPV+ EPGGNYASES+FKKVLMKS FIT DP++ADLFF Sbjct: 1 MKRSFKIYVYPHRQDDSFANALLPVDSEPGGNYASESFFKKVLMKSRFITNDPTKADLFF 60 Query: 653 LPFSIARLRHDPRVGVQGIPDFIKTYISNISQNYPYWNRSSGTDHFYVACHSIGRSAMEK 832 LPFSIARLRHDPRVGV GI DFI+ YI N+SQ Y YWNR+ G DHFYVACHSIGRSAMEK Sbjct: 61 LPFSIARLRHDPRVGVGGIQDFIRDYIFNVSQKYQYWNRTGGADHFYVACHSIGRSAMEK 120 Query: 833 ATEVKFNAIQVVCSSSYFLSGYIAHKDASLPQIWPRQGDPPNVAISKRKKLAFFAGAINS 1012 A+EVKFNAIQVVCSSSYFL GYI HKDA LPQIWPR+ +P ++ S R KLAFFAG INS Sbjct: 121 ASEVKFNAIQVVCSSSYFLPGYIPHKDACLPQIWPRKEEPHDLLSSNRTKLAFFAGGINS 180 Query: 1013 PVRTKLLEEWKNDSEIFVHFGRLTTPYSDELLGSKFCLHVKGFEVNTARIGDAIYYGCVP 1192 PVR KLL+ W+NDSEIF HFGRLTTPY+DELLGSKFCLHVKGFEVNTAR+ D++YYGCVP Sbjct: 181 PVREKLLQVWRNDSEIFAHFGRLTTPYADELLGSKFCLHVKGFEVNTARVADSLYYGCVP 240 Query: 1193 VIIANHYDLPFADILNWKSFSLVVATLDIPLLKKILREVSSDEYTTLQRNVLKVQKHFRW 1372 VIIAN+YDLPFADILNWKSFS++VATLDIPLLKKIL+ +SS+EYT LQ NVLKV+KHF+W Sbjct: 241 VIIANYYDLPFADILNWKSFSVIVATLDIPLLKKILKGISSEEYTRLQSNVLKVRKHFQW 300 Query: 1373 NLSPMDYDAFYMVIYELWLRRS 1438 +LSP+DYDAFYMV+YELWLRRS Sbjct: 301 HLSPIDYDAFYMVMYELWLRRS 322 >ref|XP_007030066.1| Exostosin family protein [Theobroma cacao] gi|508718671|gb|EOY10568.1| Exostosin family protein [Theobroma cacao] Length = 473 Score = 551 bits (1420), Expect = e-154 Identities = 270/476 (56%), Positives = 342/476 (71%), Gaps = 4/476 (0%) Frame = +2 Query: 35 SSSLFYYFSHRRLFDSFRTFF-FIPTILALITSLFILIYISSTSKLFFVHQHHQSHLLPK 211 SSS Y S R +F+ FF F+P LAL T L I IYIS+T + + H Q+ L + Sbjct: 4 SSSFLYQVSQHRFPATFKGFFYFLPISLALTTLLLIFIYISTTGDV--TNNHAQTTLYLE 61 Query: 212 XXXXXXXXXXXTHQIIQS-PSQNPPKTTFYEXXXXXXXXXXXXXXXXXXGTDENRRYWET 388 Q I + P +N + W Sbjct: 62 TLPGTASVSSLVDQTIPTIPFENNDNDDLFADPSRMARLA-------------RANQWFL 108 Query: 389 GRPIG-SNGKYVN-KEVFHDKDIFLEDYKEMNRSFKIYVYPHREDDPYANALLPVNFEPG 562 G G +NG Y N +EV+HD D+FLEDYK+MN+S KIYVYPH +DDP+AN LLP + + Sbjct: 109 GNLFGLTNGNYTNNQEVYHDGDLFLEDYKQMNKSLKIYVYPHSKDDPFANVLLPPDSDSK 168 Query: 563 GNYASESYFKKVLMKSHFITKDPSEADLFFLPFSIARLRHDPRVGVQGIPDFIKTYISNI 742 GNYASE FKK LMKSHFITKDP+EADLF++PFSI+ +R DPR+ V GIPDF+K+YISNI Sbjct: 169 GNYASELMFKKALMKSHFITKDPNEADLFYMPFSISPMRTDPRIDVHGIPDFVKSYISNI 228 Query: 743 SQNYPYWNRSSGTDHFYVACHSIGRSAMEKATEVKFNAIQVVCSSSYFLSGYIAHKDASL 922 ++ YPYWNR+ G DHFYVACHSIG+ A +KA + N IQ+VCSS+YF S Y+ HKDAS+ Sbjct: 229 TRKYPYWNRTGGADHFYVACHSIGKIAFDKAFVARLNVIQLVCSSTYFPSSYLPHKDASM 288 Query: 923 PQIWPRQGDPPNVAISKRKKLAFFAGAINSPVRTKLLEEWKNDSEIFVHFGRLTTPYSDE 1102 PQ+WPRQGDPPN+ S+RK+LAFFAGA+NSPVR LL+ W ND+EIF HFGRL TPYS++ Sbjct: 289 PQVWPRQGDPPNLLTSERKRLAFFAGAVNSPVRIALLKVWANDTEIFAHFGRLRTPYSEQ 348 Query: 1103 LLGSKFCLHVKGFEVNTARIGDAIYYGCVPVIIANHYDLPFADILNWKSFSLVVATLDIP 1282 LLGSKFC+HVKG+EVNTAR+ DA++YGCVPVI+ANHYDLPF DILNWKSF++VV +DIP Sbjct: 349 LLGSKFCIHVKGYEVNTARVADALFYGCVPVILANHYDLPFTDILNWKSFAVVVHHIDIP 408 Query: 1283 LLKKILREVSSDEYTTLQRNVLKVQKHFRWNLSPMDYDAFYMVIYELWLRRSSMRV 1450 +LKKIL+ +S++EY+ LQ N +KV+KHF+WN+ P+D+DAF+M +YELW RRS +RV Sbjct: 409 VLKKILQGISNEEYSMLQSNAVKVRKHFQWNVPPLDFDAFHMSLYELWKRRSVVRV 464