BLASTX nr result
ID: Cocculus23_contig00015049
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Cocculus23_contig00015049 (2274 letters) Database: ./nr 38,876,450 sequences; 13,856,398,315 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_002283936.2| PREDICTED: uncharacterized protein LOC100268... 642 0.0 ref|XP_007204617.1| hypothetical protein PRUPE_ppa002387mg [Prun... 638 e-180 ref|XP_007012125.1| Exostosin family protein, putative isoform 2... 638 e-180 ref|XP_002281263.1| PREDICTED: probable glycosyltransferase At5g... 638 e-180 ref|XP_006476045.1| PREDICTED: probable glycosyltransferase At5g... 635 e-179 ref|XP_006451253.1| hypothetical protein CICLE_v10007651mg [Citr... 632 e-178 ref|XP_006450684.1| hypothetical protein CICLE_v10007698mg [Citr... 630 e-177 ref|XP_006476044.1| PREDICTED: probable glycosyltransferase At5g... 629 e-177 ref|XP_007012124.1| Exostosin family protein, putative isoform 1... 628 e-177 ref|XP_006353481.1| PREDICTED: probable glycosyltransferase At5g... 627 e-177 ref|XP_006476046.1| PREDICTED: probable glycosyltransferase At5g... 626 e-176 ref|XP_002309547.2| hypothetical protein POPTR_0006s25540g [Popu... 626 e-176 ref|XP_004287457.1| PREDICTED: probable glycosyltransferase At3g... 625 e-176 ref|XP_007225154.1| hypothetical protein PRUPE_ppa002395mg [Prun... 623 e-175 ref|XP_002324801.2| hypothetical protein POPTR_0018s00290g [Popu... 622 e-175 ref|NP_197468.2| Exostosin family protein [Arabidopsis thaliana]... 619 e-174 gb|EXB59796.1| putative glycosyltransferase [Morus notabilis] 619 e-174 ref|XP_004251626.1| PREDICTED: probable glycosyltransferase At5g... 619 e-174 ref|XP_002324438.2| hypothetical protein POPTR_0018s09250g [Popu... 617 e-174 ref|XP_006287301.1| hypothetical protein CARUB_v10000494mg [Caps... 614 e-173 >ref|XP_002283936.2| PREDICTED: uncharacterized protein LOC100268163 [Vitis vinifera] Length = 738 Score = 642 bits (1655), Expect = 0.0 Identities = 345/663 (52%), Positives = 431/663 (65%), Gaps = 20/663 (3%) Frame = +3 Query: 60 STAEDSVSKSAIIGISSDIKNTSLTDGIVRTTSASNMEEETKHATDVKSNAAEQSNGFSP 239 S A D S + + D++ +++T +E+ + ++ E F+ Sbjct: 78 SNASDLTDTSLFVEVVEDVEKSNVTVEFGDDNGTEGTDEDIEDGLALEREDLENIVEFNE 137 Query: 240 ESRSPV--------------GASEQANLMKPAN--KYSP-EKVVDLGKNFRIENVEDPHN 368 + P G K N K P +KVVD+ +E V + N Sbjct: 138 DDNGPKEKGGDTENFASESKGMDHVVEFTKDNNISKGLPFKKVVDMDGISALEYVNNQEN 197 Query: 369 GSVSEKTREPENANVQEQFHKTEN-GSSLDIIRKEDTNLRLDGTESGHIXXXXXXXXXXX 545 S +K E + K N G S D I K D +L S Sbjct: 198 SSDLKKDSEMRHIGSAVHIVKPPNEGISTDNIVKADASLTPSTPGSLGTTFKSHLLASPG 257 Query: 546 XXXXTDPAMPIKLDGNSSTSLKFVDSNISSVAKQLQDKISKDEKPESLKRFPAHSNVSSI 725 + K+ N + S ++ISSV K ++ +SKDE L+ A N +S Sbjct: 258 VDSLFNTTYIEKMASNGNASNHLTATDISSVGKPEKEILSKDENLLVLQSDLADLNNNSA 317 Query: 726 ITSNPNKDQWMK--PPSSVMSISEMNLVLLRNHASSGSEKPRWSTPRDEEILSAKRQIQN 899 +TSNP + + PP SV SI +MN L+R+ ASS + +PRW++PRD+E+L+AK QIQN Sbjct: 318 MTSNPGRKKMQSEMPPKSVTSIYDMNRRLVRHRASSRAMRPRWASPRDQEMLAAKLQIQN 377 Query: 900 AAITEKGDQELFPPLFRNVSVFKRSYELMERTLKVYVYKEGAKPIFHRPITKGIYASEGW 1079 A K D EL PLFRNVS+FKRSYELMER LKVYVYK+G KPIFH+PI KG+YASEGW Sbjct: 378 APRV-KNDPELHAPLFRNVSMFKRSYELMERILKVYVYKDGEKPIFHQPILKGLYASEGW 436 Query: 1080 FMKHMEGHKQFVVKSPRRAHLFYLPFSSRMLEATLYVPNSHSHKNLVQYLKDYIDMISAK 1259 FMK ME +K FVVK PR+A LFY+PFSSRMLE LYV NSH+ NL QYLK Y + I+AK Sbjct: 437 FMKLMERNKHFVVKDPRQAQLFYMPFSSRMLEYKLYVRNSHNRTNLRQYLKQYSEKIAAK 496 Query: 1260 YPFWNRTGGADHFLVACHDWAPTETRHAHMSNTIRALCNADVHESFVIGKDVSLPETLVI 1439 Y FWNRTGGADHFLVACHDWAP ETRH HM I+ALCNADV F IG+DVSLPET V Sbjct: 497 YRFWNRTGGADHFLVACHDWAPYETRH-HMEQCIKALCNADVTAGFKIGRDVSLPETYVR 555 Query: 1440 SPKEPAKDPGGKPPSQRQILAFFAGNMHGYLRPILLQHWENKDPDMKIFGRMGRGTRSKM 1619 S + P +D GGKPPS+R ILAF+AGNMHGYLRPILL++W++KDPDMKI+G M G SKM Sbjct: 556 SARNPLRDLGGKPPSERHILAFYAGNMHGYLRPILLKYWKDKDPDMKIYGPMPPGVASKM 615 Query: 1620 NYIQHMKSSKYCICAKGFEVNSPRVVESILYECVPVIISDNYVPPFFEVLDWEAFAVFVA 1799 NYIQHMKSSK+CIC KG+EVNSPRVVE+I YECVPVIISDN+VPPFF+VLDW AF++ +A Sbjct: 616 NYIQHMKSSKFCICPKGYEVNSPRVVEAIFYECVPVIISDNFVPPFFDVLDWGAFSIILA 675 Query: 1800 EKDIPNLKNILLSIPQKRYLMMHLRIKKLQQHFMWHSKPVKYDIFHMILHSIWYNRVFQI 1979 EKDIPNLK++LLSIP +YL M L ++K+Q+HF+WH+KP+KYD+FHM LHSIWYNRVFQ+ Sbjct: 676 EKDIPNLKDVLLSIPNDKYLQMQLGVRKVQKHFLWHAKPLKYDLFHMTLHSIWYNRVFQV 735 Query: 1980 KAK 1988 K + Sbjct: 736 KPR 738 >ref|XP_007204617.1| hypothetical protein PRUPE_ppa002387mg [Prunus persica] gi|462400148|gb|EMJ05816.1| hypothetical protein PRUPE_ppa002387mg [Prunus persica] Length = 678 Score = 638 bits (1646), Expect = e-180 Identities = 330/649 (50%), Positives = 431/649 (66%), Gaps = 5/649 (0%) Frame = +3 Query: 57 SSTAEDSVSKSAIIGISSDIKNTSLTDGIVRTTSASNMEEETKHATDVKSNAAEQSNGFS 236 SS V + SSD + S+ G+ + S + E H K + Sbjct: 66 SSAKSVMVRNPLTVHSSSDFIDVSMFSGVEKAAGNSGLGGEIGHDRGRKGKDVHKEIDLI 125 Query: 237 PESRSPVGASEQANLMKPANKYSPEKVVDLGKNFRIENVEDPHNGSVSEKTREPENANVQ 416 E + + + E VVD + + ++E+ NGSV +K + Sbjct: 126 LEEKGIDNTFANTIHRNVDHNFPSENVVDTNGSLALVSIENQENGSVQDKANVAKYGFPL 185 Query: 417 EQFHKTENGSSLDIIRKEDTNL---RLDGTESGHIXXXXXXXXXXXXXXXTDPAMPIKLD 587 E+ +S + KE++NL + DG ++G P+ P+ L Sbjct: 186 ERIVLPNYETSTENTLKENSNLTAKKSDGVKTGF------------------PSSPLILP 227 Query: 588 GNSSTSLKFVDSNISSVAKQLQDKISKDEKPESLKRFPAHSNVSSIITSNPNKDQWMK-- 761 +S + ++++ S S K S S++ +NP + + Sbjct: 228 AAASLA-NATNASVGST---------------SFKSDVVTSKNGSVVMTNPGRKKMKSEL 271 Query: 762 PPSSVMSISEMNLVLLRNHASSGSEKPRWSTPRDEEILSAKRQIQNAAITEKGDQELFPP 941 PP S+ SI EMN +L+R+ ASS S +PRWS+ RD++IL+ K QI++ + D+EL+ P Sbjct: 272 PPKSITSIYEMNHILVRHRASSRSLRPRWSSVRDQDILAVKSQIEHPPVAIN-DRELYAP 330 Query: 942 LFRNVSVFKRSYELMERTLKVYVYKEGAKPIFHRPITKGIYASEGWFMKHMEGHKQFVVK 1121 LFRNVS+FKRSYELMERTLK+Y+YK+G KPIFH+PI KG+YASEGWFMK M+G+K+FVVK Sbjct: 331 LFRNVSMFKRSYELMERTLKIYIYKDGNKPIFHQPILKGLYASEGWFMKLMQGYKRFVVK 390 Query: 1122 SPRRAHLFYLPFSSRMLEATLYVPNSHSHKNLVQYLKDYIDMISAKYPFWNRTGGADHFL 1301 PR+AHLFY+PFSSRMLE +LYV NSH+ NL Q+LK+Y + I+AKYP+WNRTGGADHFL Sbjct: 391 DPRKAHLFYMPFSSRMLEYSLYVRNSHNRTNLRQFLKEYSEKIAAKYPYWNRTGGADHFL 450 Query: 1302 VACHDWAPTETRHAHMSNTIRALCNADVHESFVIGKDVSLPETLVISPKEPAKDPGGKPP 1481 VACHDWAP ETRH HM ++ALCNADV F IG+DVSLPET V S + P +D GGKPP Sbjct: 451 VACHDWAPYETRH-HMERCMKALCNADVTGGFKIGRDVSLPETYVRSARNPLRDLGGKPP 509 Query: 1482 SQRQILAFFAGNMHGYLRPILLQHWENKDPDMKIFGRMGRGTRSKMNYIQHMKSSKYCIC 1661 SQRQILAF+AGNMHGYLRPILL++W+++DPDMKIFG M G SKMNYIQHMKSSKYCIC Sbjct: 510 SQRQILAFYAGNMHGYLRPILLEYWKDRDPDMKIFGPMPPGVASKMNYIQHMKSSKYCIC 569 Query: 1662 AKGFEVNSPRVVESILYECVPVIISDNYVPPFFEVLDWEAFAVFVAEKDIPNLKNILLSI 1841 KG+EVNSPRVVE+I YECVPVIISDN+VPPFFEVL+W AF+V +AE+DIPNLK ILLSI Sbjct: 570 PKGYEVNSPRVVEAIFYECVPVIISDNFVPPFFEVLNWGAFSVILAERDIPNLKEILLSI 629 Query: 1842 PQKRYLMMHLRIKKLQQHFMWHSKPVKYDIFHMILHSIWYNRVFQIKAK 1988 P+++YL M ++K+Q+HF+WH++P+KYD+FHM LHSIWYNRVFQIK + Sbjct: 630 PEEKYLQMQRGVRKVQKHFLWHARPLKYDLFHMTLHSIWYNRVFQIKIR 678 >ref|XP_007012125.1| Exostosin family protein, putative isoform 2 [Theobroma cacao] gi|508782488|gb|EOY29744.1| Exostosin family protein, putative isoform 2 [Theobroma cacao] Length = 788 Score = 638 bits (1645), Expect = e-180 Identities = 318/562 (56%), Positives = 410/562 (72%) Frame = +3 Query: 303 SPEKVVDLGKNFRIENVEDPHNGSVSEKTREPENANVQEQFHKTENGSSLDIIRKEDTNL 482 S E+ VDL KN ++ E N +V+E+ + E + + N S+ +I T+ Sbjct: 232 STEQFVDLNKNSTVDYAES-FNKTVAEEASKTEESFSLKNDTIDVNTSNNNIGNGNFTS- 289 Query: 483 RLDGTESGHIXXXXXXXXXXXXXXXTDPAMPIKLDGNSSTSLKFVDSNISSVAKQLQDKI 662 + T S T+ + ++ N T + V+S+ SS+ + + Sbjct: 290 SAESTGSSDTGLGSPLPALTPTNSSTNKTLENDVETNIQTPVVSVNSSTSSLEQHVTPSF 349 Query: 663 SKDEKPESLKRFPAHSNVSSIITSNPNKDQWMKPPSSVMSISEMNLVLLRNHASSGSEKP 842 K+EK E +K S+ +S T+ P + + P ++ +I++MN + ++ S S+ P Sbjct: 350 DKNEKVEEIKNNFTTSSDNSSPTNTPKVGKKPEMPPALTTIADMNNLFYQSRVSYYSKTP 409 Query: 843 RWSTPRDEEILSAKRQIQNAAITEKGDQELFPPLFRNVSVFKRSYELMERTLKVYVYKEG 1022 RWS+ D+ +L+A+ QI+NA I K D L+ PLFRNVS+FKRSYELME TLKVYVY+EG Sbjct: 410 RWSSGADQVLLNARSQIENAPIV-KNDPRLYAPLFRNVSMFKRSYELMESTLKVYVYQEG 468 Query: 1023 AKPIFHRPITKGIYASEGWFMKHMEGHKQFVVKSPRRAHLFYLPFSSRMLEATLYVPNSH 1202 +PI H PI KGIYASEGWFMK +E +K+FV K+PR AHLFYLPFSSRMLE TLYVP+SH Sbjct: 469 KRPIVHTPILKGIYASEGWFMKQLEANKKFVTKNPREAHLFYLPFSSRMLEETLYVPDSH 528 Query: 1203 SHKNLVQYLKDYIDMISAKYPFWNRTGGADHFLVACHDWAPTETRHAHMSNTIRALCNAD 1382 +HKNL++YLK+Y+ +I+AKYPFWNRT GADHFLVACHDWAP+ETR HM+N IRALCN+D Sbjct: 529 NHKNLIEYLKNYVGIIAAKYPFWNRTEGADHFLVACHDWAPSETR-KHMANCIRALCNSD 587 Query: 1383 VHESFVIGKDVSLPETLVISPKEPAKDPGGKPPSQRQILAFFAGNMHGYLRPILLQHWEN 1562 + E ++ GKDVSLPET V +P++P +D GGKPPS+R ILAFFAG+MHGYLRPILL+ W N Sbjct: 588 IREGYIFGKDVSLPETYVRNPQKPLRDLGGKPPSKRSILAFFAGSMHGYLRPILLEQWGN 647 Query: 1563 KDPDMKIFGRMGRGTRSKMNYIQHMKSSKYCICAKGFEVNSPRVVESILYECVPVIISDN 1742 KDPDMKIFG+M + KMNYIQHMKSSKYC+C +G+EVNSPRVVE+I Y CVPVIISDN Sbjct: 648 KDPDMKIFGKM-PNVKGKMNYIQHMKSSKYCLCPRGYEVNSPRVVEAIFYGCVPVIISDN 706 Query: 1743 YVPPFFEVLDWEAFAVFVAEKDIPNLKNILLSIPQKRYLMMHLRIKKLQQHFMWHSKPVK 1922 +VPPFFEVL+WE+FAVFV EKDIPNLK ILLSIP+KR+ M LR+KK+QQHF+WH +P K Sbjct: 707 FVPPFFEVLNWESFAVFVLEKDIPNLKKILLSIPEKRFRQMQLRVKKIQQHFLWHPRPEK 766 Query: 1923 YDIFHMILHSIWYNRVFQIKAK 1988 YDIFHMILHS+WYNRVFQ+K + Sbjct: 767 YDIFHMILHSVWYNRVFQMKPR 788 >ref|XP_002281263.1| PREDICTED: probable glycosyltransferase At5g03795 [Vitis vinifera] Length = 675 Score = 638 bits (1645), Expect = e-180 Identities = 341/627 (54%), Positives = 439/627 (70%), Gaps = 8/627 (1%) Frame = +3 Query: 132 TDGIVRTTSASNMEE-ETKHATDVKSNAAEQSNGFSPESRSPVGASEQANLMKPANKYSP 308 +D + + + NM + +++DV + SN + E + ++ A++M A S Sbjct: 61 SDSLSKLGTMGNMTTAQGLNSSDVHAMHGIDSNAETMEGNNEGPKNDFASVMNGALDKSF 120 Query: 309 EKVVDLGKNFRIENVEDPHNGSVSEKTREPENANVQEQFHKTENGSSLDIIRKEDTNLRL 488 D KN +E V + N S + + E++ E N SSL I+++D L Sbjct: 121 GLDED-NKNVTVEKVNNSGNRSALKNASKHESSLYLENITADSN-SSLGKIQEDDMALLS 178 Query: 489 DGTESGHIXXXXXXXXXXXXXXXTDPAMPIKLDGNSSTSLKFVDSNI-------SSVAKQ 647 +E + PA+P + +++TSL +D + SSV + Sbjct: 179 QRSERSGVGLISPL-----------PALPQIISSSNTTSLTNLDPHPITLPPERSSVEED 227 Query: 648 LQDKISKDEKPESLKRFPAHSNVSSIITSNPNKDQWMKPPSSVMSISEMNLVLLRNHASS 827 ++KDEK E+ ++ SN SSI S P + + P+ V +ISEMN +L+++ ASS Sbjct: 228 AAHTLNKDEKAETSQKDLTLSNRSSI--SVPALETRPELPA-VTTISEMNDLLVQSRASS 284 Query: 828 GSEKPRWSTPRDEEILSAKRQIQNAAITEKGDQELFPPLFRNVSVFKRSYELMERTLKVY 1007 S KPRWS+ D+E+L AK QI+NA I K D L L+RNVSVFKRSYELME TLKVY Sbjct: 285 RSMKPRWSSAVDKELLYAKSQIENAPII-KNDPGLHASLYRNVSVFKRSYELMENTLKVY 343 Query: 1008 VYKEGAKPIFHRPITKGIYASEGWFMKHMEGHKQFVVKSPRRAHLFYLPFSSRMLEATLY 1187 Y+EG +P+FH+P KGIYASEGWFMK M+ +K+FV K+ R+AHLFYLPFSS MLE LY Sbjct: 344 TYREGERPVFHQPPIKGIYASEGWFMKLMQANKKFVTKNGRKAHLFYLPFSSLMLEEALY 403 Query: 1188 VPNSHSHKNLVQYLKDYIDMISAKYPFWNRTGGADHFLVACHDWAPTETRHAHMSNTIRA 1367 VPNSHS KNL QYLK+Y+DMI AKYPFWNRTGGADHFLVACHDWAP+ET M+N+IRA Sbjct: 404 VPNSHSRKNLEQYLKNYLDMIGAKYPFWNRTGGADHFLVACHDWAPSETLKL-MANSIRA 462 Query: 1368 LCNADVHESFVIGKDVSLPETLVISPKEPAKDPGGKPPSQRQILAFFAGNMHGYLRPILL 1547 LCN+D+ E F +GKDVSLPET V P+ P + GGKPPSQR+ILAFFAG+MHGY+RPILL Sbjct: 463 LCNSDIREGFKLGKDVSLPETCVRIPQNPLRQLGGKPPSQRRILAFFAGSMHGYVRPILL 522 Query: 1548 QHWENKDPDMKIFGRMGRGTRSKMNYIQHMKSSKYCICAKGFEVNSPRVVESILYECVPV 1727 ++WENKDPDMKI+GRM + + MNYIQHMKSSKYCICAKG+EVNSPRVVE+I YECVPV Sbjct: 523 KYWENKDPDMKIYGRMPKAKKGTMNYIQHMKSSKYCICAKGYEVNSPRVVEAIFYECVPV 582 Query: 1728 IISDNYVPPFFEVLDWEAFAVFVAEKDIPNLKNILLSIPQKRYLMMHLRIKKLQQHFMWH 1907 IISDN+VPPFF VL+WE+FAVF+ EKDIPNLK+ILLSIP+K YL + +R+K++QQHF+WH Sbjct: 583 IISDNFVPPFFGVLNWESFAVFILEKDIPNLKSILLSIPEKSYLEIQMRVKQVQQHFLWH 642 Query: 1908 SKPVKYDIFHMILHSIWYNRVFQIKAK 1988 +KPVKYD+FHMILHS+WYNRV QI+ + Sbjct: 643 AKPVKYDVFHMILHSVWYNRVLQIRVR 669 >ref|XP_006476045.1| PREDICTED: probable glycosyltransferase At5g03795-like isoform X2 [Citrus sinensis] Length = 663 Score = 635 bits (1639), Expect = e-179 Identities = 346/649 (53%), Positives = 429/649 (66%), Gaps = 24/649 (3%) Frame = +3 Query: 114 IKNTSLTDGIVRTTSASNMEEETKHATDVKS--NAAEQSNGFSPESRSPVGASEQANLM- 284 ++N SL G +E +++ A+D + N+ N + + +E ANL Sbjct: 55 VENNSLVTG--------GLESKSEIASDAVNGLNSTGTHNVHEMANDTRTSKAEDANLQA 106 Query: 285 ----------KPANKYSPEKVVDLGKNFRIENVEDPHNGSVSEKTREPENANVQEQFHKT 434 +P N EK+ L KN ++ V++ N EK RE E + +Q Sbjct: 107 DFDDGEDIHEEPTN----EKLEGLNKNSTVDTVQNAGNVPGPEKGRESEQSFIQRN---- 158 Query: 435 ENGSSLDIIRKEDTNLRLDGTESGHIXXXXXXXXXXXXXXXTDPAMPIKLDGNSSTSLKF 614 DI+ G +SG + D + I L G + ++ Sbjct: 159 ------DIM----------GGDSGGVGLSPIPVSPVM-----DLSSNITLQGANISTPIT 197 Query: 615 VDSNISSVAKQLQDKISKDEKPESLKRFPAHSNVSSIITSNPNKDQWMKPPSSVMSISEM 794 + SN SS K + K EKP N S + NK + P+ V++I+EM Sbjct: 198 IHSNSSSTDKDATPALDKIEKPAQSSLNTLGENSSGVDVPKENKKPEIPTPA-VITIAEM 256 Query: 795 NLVLLRNHASSGSEKPRWSTPRDEEILSAKRQIQNAAITEKGDQELFPPLFRNVSVFKRS 974 +LL+N AS S +PRWS+ D+E+L A+ QI+NA + K D EL+ PL+RNVS FKRS Sbjct: 257 KNMLLQNRASYRSMRPRWSSAVDQEMLYARSQIENAPLL-KNDHELYAPLYRNVSRFKRS 315 Query: 975 YELMERTLKVYVYKEGAKPIFHRPITKGIYASEGWFMKHMEGHKQFVVKSPRRAHLFYLP 1154 YELME TLKVYVYKEG +PI H P+ KGIYASEGWFMK +E +KQFV K R+AHLFYLP Sbjct: 316 YELMEETLKVYVYKEGQRPILHEPVLKGIYASEGWFMKQLEANKQFVTKDSRKAHLFYLP 375 Query: 1155 FSSRMLEATLYVPNSHSHKNLVQYLKDYIDMISAKYPFWNRTGGADHFLVACHDWAPTET 1334 FSSRMLE TLYV NSH+HKNL+QYL++Y+++ISAK+ FWNRT GADHFLVACHDWAP ET Sbjct: 376 FSSRMLEETLYVQNSHNHKNLIQYLRNYVNLISAKHNFWNRTEGADHFLVACHDWAPAET 435 Query: 1335 RHAHMSNTIRALCNADVHESFVIGKDVSLPETLVISPKEPAKDPGGKPPSQRQILAFFAG 1514 R M+N IRALCN+DV E FV GKDV+LPET V+SP+ P + GGKP SQR ILAFFAG Sbjct: 436 R-IIMANCIRALCNSDVKEGFVFGKDVALPETYVLSPQNPLRAIGGKPASQRSILAFFAG 494 Query: 1515 NMHGYLRPILLQHWENKDPDMKIFGRM----------GRGTR-SKMNYIQHMKSSKYCIC 1661 MHGYLRPILL HWENKDPDMKIFG+M G+G R KM+YIQHMKSSKYCIC Sbjct: 495 RMHGYLRPILLHHWENKDPDMKIFGQMPMVKGKGKGKGKGKRKGKMDYIQHMKSSKYCIC 554 Query: 1662 AKGFEVNSPRVVESILYECVPVIISDNYVPPFFEVLDWEAFAVFVAEKDIPNLKNILLSI 1841 AKG+EVNSPRVVE+I YECVPVIISDN+VPPFFE+L+WE+FAVFV EKDIPNLKNILLSI Sbjct: 555 AKGYEVNSPRVVEAIFYECVPVIISDNFVPPFFEILNWESFAVFVLEKDIPNLKNILLSI 614 Query: 1842 PQKRYLMMHLRIKKLQQHFMWHSKPVKYDIFHMILHSIWYNRVFQIKAK 1988 +KRY M +R+KK+QQHF+WH +PVKYDIFHM+LHSIWYNRVF +A+ Sbjct: 615 SEKRYRRMQMRVKKVQQHFLWHPQPVKYDIFHMLLHSIWYNRVFLARAR 663 >ref|XP_006451253.1| hypothetical protein CICLE_v10007651mg [Citrus clementina] gi|568883066|ref|XP_006494321.1| PREDICTED: probable glycosyltransferase At5g03795-like [Citrus sinensis] gi|557554479|gb|ESR64493.1| hypothetical protein CICLE_v10007651mg [Citrus clementina] Length = 677 Score = 632 bits (1631), Expect = e-178 Identities = 334/629 (53%), Positives = 427/629 (67%), Gaps = 1/629 (0%) Frame = +3 Query: 105 SSDIKNTSLTDGIVRTTSASNMEEETKHATDVKSNAAEQSNGFSPESRSPVGASEQANLM 284 +SD+ + S+ G + S +T + ++ + +NG E + Q N Sbjct: 80 ASDLMSDSVFKGSLEDDEDSKFGSDTGDDSGLREVDGDTNNGIVSEGKG------QDN-- 131 Query: 285 KPANKYSPEKVVDLGKNFRIENVEDPHNGSVSEKTREPENANVQEQFHKTENGSSLDIIR 464 P + +V D + ENV+D ++ S E R EN+ E + + L I Sbjct: 132 -PIELVTDREVDD---DSVAENVKDLNDLSELEIERIGENSATVEPAGEAKQSLPLKQIV 187 Query: 465 KEDTNLRLDGTESGHIXXXXXXXXXXXXXXXTDPAMPIKLDGNSSTSLKFVDSNISSVAK 644 + + + DG H P I T LK +SN SS A+ Sbjct: 188 QPNLEIVSDGVPEQHTSQSIANIGGEKTLSIVSPLTNI-------THLKTEESNASSAAR 240 Query: 645 QLQDKISKDEKPESLKRFPAHSNVSSIITS-NPNKDQWMKPPSSVMSISEMNLVLLRNHA 821 + K + S+ N+S++I S K + PP +V SI EMN +L+R+H Sbjct: 241 SA---VPKSDIATSV-------NISALIGSPGKKKMRCNMPPKTVTSIFEMNDILMRHHR 290 Query: 822 SSGSEKPRWSTPRDEEILSAKRQIQNAAITEKGDQELFPPLFRNVSVFKRSYELMERTLK 1001 SS + +PRWS+ RD+E+L+AK +I+ A+++ DQEL PLFRNVS+FKRSYELM+RTLK Sbjct: 291 SSRAMRPRWSSVRDKEVLAAKTEIEKASVSVS-DQELHAPLFRNVSMFKRSYELMDRTLK 349 Query: 1002 VYVYKEGAKPIFHRPITKGIYASEGWFMKHMEGHKQFVVKSPRRAHLFYLPFSSRMLEAT 1181 VYVY++G KPIFH+PI KG+YASEGWFMK MEG+K F VK PR+AHLFY+PFSSRMLE Sbjct: 350 VYVYRDGKKPIFHQPILKGLYASEGWFMKLMEGNKHFAVKDPRKAHLFYMPFSSRMLEYA 409 Query: 1182 LYVPNSHSHKNLVQYLKDYIDMISAKYPFWNRTGGADHFLVACHDWAPTETRHAHMSNTI 1361 LYV NSH+ NL QYLK+Y + I+AKY +WNRTGGADHFLVACHDWAP ETRH HM + I Sbjct: 410 LYVRNSHNRTNLRQYLKEYAESIAAKYRYWNRTGGADHFLVACHDWAPYETRH-HMEHCI 468 Query: 1362 RALCNADVHESFVIGKDVSLPETLVISPKEPAKDPGGKPPSQRQILAFFAGNMHGYLRPI 1541 +ALCNADV F +G+DVSLPET V S + P +D GGKPPSQR ILAF+AGN+HGYLRPI Sbjct: 469 KALCNADVTAGFKLGRDVSLPETYVRSARNPLRDLGGKPPSQRHILAFYAGNLHGYLRPI 528 Query: 1542 LLQHWENKDPDMKIFGRMGRGTRSKMNYIQHMKSSKYCICAKGFEVNSPRVVESILYECV 1721 LL++W++KDPDMKIFG M G SKMNYIQHMKSSKYCIC KG+EVNSPRVVESI YECV Sbjct: 529 LLKYWKDKDPDMKIFGPMPPGVASKMNYIQHMKSSKYCICPKGYEVNSPRVVESIFYECV 588 Query: 1722 PVIISDNYVPPFFEVLDWEAFAVFVAEKDIPNLKNILLSIPQKRYLMMHLRIKKLQQHFM 1901 PVIISDN+VPPF+EVL+WEAF+V +AE++IPNLK+ILLSIP+K+Y M ++KLQ+HF+ Sbjct: 589 PVIISDNFVPPFYEVLNWEAFSVIIAEENIPNLKDILLSIPEKKYFEMQFAVRKLQRHFL 648 Query: 1902 WHSKPVKYDIFHMILHSIWYNRVFQIKAK 1988 WH+KP KYD+FHM LHSIWYNRV+QIK + Sbjct: 649 WHAKPEKYDLFHMTLHSIWYNRVYQIKPR 677 >ref|XP_006450684.1| hypothetical protein CICLE_v10007698mg [Citrus clementina] gi|557553910|gb|ESR63924.1| hypothetical protein CICLE_v10007698mg [Citrus clementina] Length = 652 Score = 630 bits (1624), Expect = e-177 Identities = 345/637 (54%), Positives = 423/637 (66%), Gaps = 12/637 (1%) Frame = +3 Query: 114 IKNTSLTDGIVRTTSASNMEEETKHATDVKSNAAEQSNGFSPESRSPVGASEQANLMKPA 293 ++N SL G S S + +T A + S + + ++R+ +E ANL Sbjct: 55 VENNSLVTG--GPESKSEIASDT--ANGLNSTGTHNVHEMANDTRT--SKAEDANLQDDF 108 Query: 294 -----NKYSP--EKVVDLGKNFRIENVEDPHNGSVSEKTREPENANVQEQFHKTENGSSL 452 N P EK+ +L KN ++ V++ NG EK RE E + +Q G+ L Sbjct: 109 YDGEDNHEEPMTEKLEELNKNSTVDTVQNAGNGPGPEKGRESEQSFIQRN---DSGGAGL 165 Query: 453 DIIRKEDTNLRLDGTESGHIXXXXXXXXXXXXXXXTDPAMPIKLDGNSSTSLKFVDSNIS 632 I D + I L G + ++ + SN S Sbjct: 166 SPIPVSPV---------------------------MDLSSNITLQGANISTPITIHSNSS 198 Query: 633 SVAKQLQDKISKDEKPESLKRFPAHSNVSSIITSNPNKDQWMKPPSSVMSISEMNLVLLR 812 S K + K EKP N S + NK + P+ V++I+EM +LL+ Sbjct: 199 STDKDATPALDKIEKPAQSSLNTLGENSSGVDVPKENKKPEIPTPA-VITIAEMKNMLLQ 257 Query: 813 NHASSGSEKPRWSTPRDEEILSAKRQIQNAAITEKGDQELFPPLFRNVSVFKRSYELMER 992 N AS S PR S+ D+E+L A+ QI+NA + K D EL+ PL+RNVS FKRSYELME Sbjct: 258 NRASYRSMSPRLSSAVDQEMLYARSQIENAPLL-KNDHELYAPLYRNVSRFKRSYELMEE 316 Query: 993 TLKVYVYKEGAKPIFHRPITKGIYASEGWFMKHMEGHKQFVVKSPRRAHLFYLPFSSRML 1172 TLKVYVYKEG +PI H P+ KGIYASEGWFMK +E +KQFV K R+AHLFYLPFSSRML Sbjct: 317 TLKVYVYKEGQRPILHEPVLKGIYASEGWFMKQLEANKQFVTKDSRKAHLFYLPFSSRML 376 Query: 1173 EATLYVPNSHSHKNLVQYLKDYIDMISAKYPFWNRTGGADHFLVACHDWAPTETRHAHMS 1352 E TLYV NSH+HKNL+QYL++Y+++ISAK+ FWNRT GADHFLVACHDWAP ETR M+ Sbjct: 377 EETLYVQNSHNHKNLIQYLRNYVNLISAKHNFWNRTEGADHFLVACHDWAPAETR-IIMA 435 Query: 1353 NTIRALCNADVHESFVIGKDVSLPETLVISPKEPAKDPGGKPPSQRQILAFFAGNMHGYL 1532 N IRALCN+DV + FV GKDVSLPET V+SP+ P GGKP SQR ILAFFAG+MHGYL Sbjct: 436 NCIRALCNSDVKQGFVFGKDVSLPETNVLSPQNPLWAIGGKPASQRSILAFFAGSMHGYL 495 Query: 1533 RPILLQHWENKDPDMKIFGRM----GRGTR-SKMNYIQHMKSSKYCICAKGFEVNSPRVV 1697 RPILL HWENKDPDMKIFG+M GRG R KM+YIQHMKSSKYCICAKG+EV+SPRVV Sbjct: 496 RPILLHHWENKDPDMKIFGQMPKAKGRGKRKGKMDYIQHMKSSKYCICAKGYEVHSPRVV 555 Query: 1698 ESILYECVPVIISDNYVPPFFEVLDWEAFAVFVAEKDIPNLKNILLSIPQKRYLMMHLRI 1877 E+I YECVPVIISDN+VPPFFE+L+WE+FAVFV EKDIPNLKNILLSI +KRY M + + Sbjct: 556 EAIFYECVPVIISDNFVPPFFEILNWESFAVFVLEKDIPNLKNILLSISEKRYRKMQMMV 615 Query: 1878 KKLQQHFMWHSKPVKYDIFHMILHSIWYNRVFQIKAK 1988 KK+QQHF+WH +PVKYDIFHMILHSIWYNRVF +A+ Sbjct: 616 KKVQQHFLWHPRPVKYDIFHMILHSIWYNRVFLARAR 652 >ref|XP_006476044.1| PREDICTED: probable glycosyltransferase At5g03795-like isoform X1 [Citrus sinensis] Length = 670 Score = 629 bits (1621), Expect = e-177 Identities = 346/656 (52%), Positives = 429/656 (65%), Gaps = 31/656 (4%) Frame = +3 Query: 114 IKNTSLTDGIVRTTSASNMEEETKHATDVKS--NAAEQSNGFSPESRSPVGASEQANLM- 284 ++N SL G +E +++ A+D + N+ N + + +E ANL Sbjct: 55 VENNSLVTG--------GLESKSEIASDAVNGLNSTGTHNVHEMANDTRTSKAEDANLQA 106 Query: 285 ----------KPANKYSPEKVVDLGKNFRIENVEDPHNGSVSEKTREPENANVQEQFHKT 434 +P N EK+ L KN ++ V++ N EK RE E + +Q Sbjct: 107 DFDDGEDIHEEPTN----EKLEGLNKNSTVDTVQNAGNVPGPEKGRESEQSFIQRN---- 158 Query: 435 ENGSSLDIIRKEDTNLRLDGTESGHIXXXXXXXXXXXXXXXTDPAMPIKLDGNSSTSLKF 614 DI+ G +SG + D + I L G + ++ Sbjct: 159 ------DIM----------GGDSGGVGLSPIPVSPVM-----DLSSNITLQGANISTPIT 197 Query: 615 VDSNISSVAKQLQDKISKDEKPESLKRFPAHSNVSSIITSNPNKDQWMKPPSSVMSISEM 794 + SN SS K + K EKP N S + NK + P+ V++I+EM Sbjct: 198 IHSNSSSTDKDATPALDKIEKPAQSSLNTLGENSSGVDVPKENKKPEIPTPA-VITIAEM 256 Query: 795 NLVLLRNHASSGSEKPRWSTPRDEEILSAKRQIQNAAITEKGDQELFPPLFRNVSVFKR- 971 +LL+N AS S +PRWS+ D+E+L A+ QI+NA + K D EL+ PL+RNVS FKR Sbjct: 257 KNMLLQNRASYRSMRPRWSSAVDQEMLYARSQIENAPLL-KNDHELYAPLYRNVSRFKRF 315 Query: 972 ------SYELMERTLKVYVYKEGAKPIFHRPITKGIYASEGWFMKHMEGHKQFVVKSPRR 1133 SYELME TLKVYVYKEG +PI H P+ KGIYASEGWFMK +E +KQFV K R+ Sbjct: 316 YNAICRSYELMEETLKVYVYKEGQRPILHEPVLKGIYASEGWFMKQLEANKQFVTKDSRK 375 Query: 1134 AHLFYLPFSSRMLEATLYVPNSHSHKNLVQYLKDYIDMISAKYPFWNRTGGADHFLVACH 1313 AHLFYLPFSSRMLE TLYV NSH+HKNL+QYL++Y+++ISAK+ FWNRT GADHFLVACH Sbjct: 376 AHLFYLPFSSRMLEETLYVQNSHNHKNLIQYLRNYVNLISAKHNFWNRTEGADHFLVACH 435 Query: 1314 DWAPTETRHAHMSNTIRALCNADVHESFVIGKDVSLPETLVISPKEPAKDPGGKPPSQRQ 1493 DWAP ETR M+N IRALCN+DV E FV GKDV+LPET V+SP+ P + GGKP SQR Sbjct: 436 DWAPAETRII-MANCIRALCNSDVKEGFVFGKDVALPETYVLSPQNPLRAIGGKPASQRS 494 Query: 1494 ILAFFAGNMHGYLRPILLQHWENKDPDMKIFGRM----------GRGTRS-KMNYIQHMK 1640 ILAFFAG MHGYLRPILL HWENKDPDMKIFG+M G+G R KM+YIQHMK Sbjct: 495 ILAFFAGRMHGYLRPILLHHWENKDPDMKIFGQMPMVKGKGKGKGKGKRKGKMDYIQHMK 554 Query: 1641 SSKYCICAKGFEVNSPRVVESILYECVPVIISDNYVPPFFEVLDWEAFAVFVAEKDIPNL 1820 SSKYCICAKG+EVNSPRVVE+I YECVPVIISDN+VPPFFE+L+WE+FAVFV EKDIPNL Sbjct: 555 SSKYCICAKGYEVNSPRVVEAIFYECVPVIISDNFVPPFFEILNWESFAVFVLEKDIPNL 614 Query: 1821 KNILLSIPQKRYLMMHLRIKKLQQHFMWHSKPVKYDIFHMILHSIWYNRVFQIKAK 1988 KNILLSI +KRY M +R+KK+QQHF+WH +PVKYDIFHM+LHSIWYNRVF +A+ Sbjct: 615 KNILLSISEKRYRRMQMRVKKVQQHFLWHPQPVKYDIFHMLLHSIWYNRVFLARAR 670 >ref|XP_007012124.1| Exostosin family protein, putative isoform 1 [Theobroma cacao] gi|508782487|gb|EOY29743.1| Exostosin family protein, putative isoform 1 [Theobroma cacao] Length = 802 Score = 628 bits (1620), Expect = e-177 Identities = 318/576 (55%), Positives = 410/576 (71%), Gaps = 14/576 (2%) Frame = +3 Query: 303 SPEKVVDLGKNFRIENVEDPHNGSVSEKTREPENANVQEQFHKTENGSSLDIIRKEDTNL 482 S E+ VDL KN ++ E N +V+E+ + E + + N S+ +I T+ Sbjct: 232 STEQFVDLNKNSTVDYAES-FNKTVAEEASKTEESFSLKNDTIDVNTSNNNIGNGNFTS- 289 Query: 483 RLDGTESGHIXXXXXXXXXXXXXXXTDPAMPIKLDGNSSTSLKFVDSNISSVAKQLQDKI 662 + T S T+ + ++ N T + V+S+ SS+ + + Sbjct: 290 SAESTGSSDTGLGSPLPALTPTNSSTNKTLENDVETNIQTPVVSVNSSTSSLEQHVTPSF 349 Query: 663 SKDEKPESLKRFPAHSNVSSIITSNPNKDQWMKPPSSVMSISEMNLVLLRNHASSGSEKP 842 K+EK E +K S+ +S T+ P + + P ++ +I++MN + ++ S S+ P Sbjct: 350 DKNEKVEEIKNNFTTSSDNSSPTNTPKVGKKPEMPPALTTIADMNNLFYQSRVSYYSKTP 409 Query: 843 RWSTPRDEEILSAKRQIQNAAITEKGDQELFPPLFRNVSVFK--------------RSYE 980 RWS+ D+ +L+A+ QI+NA I K D L+ PLFRNVS+FK RSYE Sbjct: 410 RWSSGADQVLLNARSQIENAPIV-KNDPRLYAPLFRNVSMFKSQVHNVYTICIINFRSYE 468 Query: 981 LMERTLKVYVYKEGAKPIFHRPITKGIYASEGWFMKHMEGHKQFVVKSPRRAHLFYLPFS 1160 LME TLKVYVY+EG +PI H PI KGIYASEGWFMK +E +K+FV K+PR AHLFYLPFS Sbjct: 469 LMESTLKVYVYQEGKRPIVHTPILKGIYASEGWFMKQLEANKKFVTKNPREAHLFYLPFS 528 Query: 1161 SRMLEATLYVPNSHSHKNLVQYLKDYIDMISAKYPFWNRTGGADHFLVACHDWAPTETRH 1340 SRMLE TLYVP+SH+HKNL++YLK+Y+ +I+AKYPFWNRT GADHFLVACHDWAP+ETR Sbjct: 529 SRMLEETLYVPDSHNHKNLIEYLKNYVGIIAAKYPFWNRTEGADHFLVACHDWAPSETRK 588 Query: 1341 AHMSNTIRALCNADVHESFVIGKDVSLPETLVISPKEPAKDPGGKPPSQRQILAFFAGNM 1520 HM+N IRALCN+D+ E ++ GKDVSLPET V +P++P +D GGKPPS+R ILAFFAG+M Sbjct: 589 -HMANCIRALCNSDIREGYIFGKDVSLPETYVRNPQKPLRDLGGKPPSKRSILAFFAGSM 647 Query: 1521 HGYLRPILLQHWENKDPDMKIFGRMGRGTRSKMNYIQHMKSSKYCICAKGFEVNSPRVVE 1700 HGYLRPILL+ W NKDPDMKIFG+M + KMNYIQHMKSSKYC+C +G+EVNSPRVVE Sbjct: 648 HGYLRPILLEQWGNKDPDMKIFGKMPN-VKGKMNYIQHMKSSKYCLCPRGYEVNSPRVVE 706 Query: 1701 SILYECVPVIISDNYVPPFFEVLDWEAFAVFVAEKDIPNLKNILLSIPQKRYLMMHLRIK 1880 +I Y CVPVIISDN+VPPFFEVL+WE+FAVFV EKDIPNLK ILLSIP+KR+ M LR+K Sbjct: 707 AIFYGCVPVIISDNFVPPFFEVLNWESFAVFVLEKDIPNLKKILLSIPEKRFRQMQLRVK 766 Query: 1881 KLQQHFMWHSKPVKYDIFHMILHSIWYNRVFQIKAK 1988 K+QQHF+WH +P KYDIFHMILHS+WYNRVFQ+K + Sbjct: 767 KIQQHFLWHPRPEKYDIFHMILHSVWYNRVFQMKPR 802 >ref|XP_006353481.1| PREDICTED: probable glycosyltransferase At5g03795-like isoform X1 [Solanum tuberosum] gi|565373856|ref|XP_006353482.1| PREDICTED: probable glycosyltransferase At5g03795-like isoform X2 [Solanum tuberosum] Length = 674 Score = 627 bits (1617), Expect = e-177 Identities = 330/638 (51%), Positives = 428/638 (67%), Gaps = 10/638 (1%) Frame = +3 Query: 105 SSDIKNTSLTDGIVRTTSASNMEEETKHATDVKSNAAEQSNGFSPESRSPVGASEQANLM 284 SS +++T + + T S+ + H N +G ES + + + + Sbjct: 65 SSVVESTKVGESFSGTLSSFDDVHMLAHRLKTVDNGDVSEDGEIDESVN------EKDEV 118 Query: 285 KPANKYSPEKVVDLGKNF----------RIENVEDPHNGSVSEKTREPENANVQEQFHKT 434 KP + +S K ++ +F + V D + ++K E EQ KT Sbjct: 119 KPHSNHSVVKTMENDSDFVEDAILENDNLFDEVVDMDEETTTQKNNESRRDLSLEQVVKT 178 Query: 435 ENGSSLDIIRKEDTNLRLDGTESGHIXXXXXXXXXXXXXXXTDPAMPIKLDGNSSTSLKF 614 S D + N L+ T++ + T+ + + N +L Sbjct: 179 NGELSADSELDANRNSVLNDTKAASV---------------TNSSSVVA--SNQLDNLPL 221 Query: 615 VDSNISSVAKQLQDKISKDEKPESLKRFPAHSNVSSIITSNPNKDQWMKPPSSVMSISEM 794 V + + + S + + L P H N S + ++ K + M PP +V SIS+M Sbjct: 222 VTIGEINFIRTTSNNSSTGDLTQLL---PNHGNHSLVQSTVKKKMRCMLPPKTVTSISQM 278 Query: 795 NLVLLRNHASSGSEKPRWSTPRDEEILSAKRQIQNAAITEKGDQELFPPLFRNVSVFKRS 974 +L+R+ A S + +PRWS+ RD+EIL+A+ QI+NA + + D+EL+ P FRN+S+FKRS Sbjct: 279 ERLLVRHRARSRAMRPRWSSERDKEILAARLQIENAPLL-RNDRELYAPAFRNMSMFKRS 337 Query: 975 YELMERTLKVYVYKEGAKPIFHRPITKGIYASEGWFMKHMEGHKQFVVKSPRRAHLFYLP 1154 YELMER LKVYVYKEG KPIFH+PI KG+YASEGWFMK MEG+ +FVVK PR+AHLFYLP Sbjct: 338 YELMERILKVYVYKEGEKPIFHQPIMKGLYASEGWFMKLMEGNNRFVVKDPRKAHLFYLP 397 Query: 1155 FSSRMLEATLYVPNSHSHKNLVQYLKDYIDMISAKYPFWNRTGGADHFLVACHDWAPTET 1334 FSSRMLE +LYV NSH+ NL QYLKDY + I+AKY FWNRTGGADHFLVACHDWAP ET Sbjct: 398 FSSRMLEHSLYVHNSHNRTNLRQYLKDYSEKIAAKYRFWNRTGGADHFLVACHDWAPYET 457 Query: 1335 RHAHMSNTIRALCNADVHESFVIGKDVSLPETLVISPKEPAKDPGGKPPSQRQILAFFAG 1514 RH HM + I+ALCNADV F IG+DVSLPET V S + P +D GGKPPSQR++LAF+AG Sbjct: 458 RH-HMEHCIKALCNADVTLGFKIGRDVSLPETYVRSARNPLRDLGGKPPSQRKVLAFYAG 516 Query: 1515 NMHGYLRPILLQHWENKDPDMKIFGRMGRGTRSKMNYIQHMKSSKYCICAKGFEVNSPRV 1694 NMHGYLRPILL+HW++KDPDM+IFG M G SKMNYIQHMKSSK+CIC KG+EVNSPRV Sbjct: 517 NMHGYLRPILLEHWKDKDPDMEIFGPMPSGVASKMNYIQHMKSSKFCICPKGYEVNSPRV 576 Query: 1695 VESILYECVPVIISDNYVPPFFEVLDWEAFAVFVAEKDIPNLKNILLSIPQKRYLMMHLR 1874 VE+I YECVPVIISDN+VPPFF VL+W+ F++ +AEKDIPNLK+ILLSIP+ +YL M L Sbjct: 577 VEAIFYECVPVIISDNFVPPFFGVLNWDTFSLILAEKDIPNLKSILLSIPENKYLEMQLA 636 Query: 1875 IKKLQQHFMWHSKPVKYDIFHMILHSIWYNRVFQIKAK 1988 ++K+Q+HF+WH+KPVKYD+FHM LHSIWYNRVFQ KA+ Sbjct: 637 VRKVQRHFLWHAKPVKYDLFHMTLHSIWYNRVFQTKAR 674 >ref|XP_006476046.1| PREDICTED: probable glycosyltransferase At5g03795-like [Citrus sinensis] Length = 653 Score = 626 bits (1615), Expect = e-176 Identities = 343/639 (53%), Positives = 425/639 (66%), Gaps = 14/639 (2%) Frame = +3 Query: 114 IKNTSLTDGIVRTTSASNMEEETKHATDVKSNAAEQSNGFSPESRSPVGASEQANLMKPA 293 ++N SL G S S + +T A + S + + ++R+ +E ANL Sbjct: 55 VENNSLVTG--GPESKSEIASDT--ANGLNSTGTHNVHEMANDTRT--SKAEDANLQDDF 108 Query: 294 -----NKYSP--EKVVDLGKNFRIENVEDPHNGSVSEKTREPENANVQEQFHKTENGSSL 452 N P EK+ +L KN ++ V++ NG EK RE E + +Q S + Sbjct: 109 YDGEDNHEEPMTEKLEELNKNSTVDTVQNAGNGPGPEKGRESEQSFIQRNDSGGAGLSPI 168 Query: 453 DIIRKED--TNLRLDGTESGHIXXXXXXXXXXXXXXXTDPAMPIKLDGNSSTSLKFVDSN 626 + D +N+ L G N ST +DSN Sbjct: 169 PVSPVMDLSSNITLQGA-------------------------------NISTPPITIDSN 197 Query: 627 ISSVAKQLQDKISKDEKPESLKRFPAHSNVSSIITSNPNKDQWMKPPSSVMSISEMNLVL 806 SS+ + K EKP N S + NK + P+ V++I+EM +L Sbjct: 198 TSSMDMDATPALVKIEKPAQSSLNTLGENSSGVDVPKENKKPEIPTPA-VITIAEMKNML 256 Query: 807 LRNHASSGSEKPRWSTPRDEEILSAKRQIQNAAITEKGDQELFPPLFRNVSVFKRSYELM 986 L+N AS S +PR S+ D+E+L A+ QI+NA + K D EL+ PL+R+VS FKRSYELM Sbjct: 257 LQNRASYRSMRPRLSSAVDQEMLYARSQIENAPLL-KNDHELYAPLYRSVSRFKRSYELM 315 Query: 987 ERTLKVYVYKEGAKPIFHRPITKGIYASEGWFMKHMEGHKQFVVKSPRRAHLFYLPFSSR 1166 E TLKVYVYKEG +PI H P+ KGIYASEGWFMK +E +KQFV + R+AHLFYLPFSSR Sbjct: 316 EETLKVYVYKEGQRPILHEPVLKGIYASEGWFMKQLEANKQFVTRDSRKAHLFYLPFSSR 375 Query: 1167 MLEATLYVPNSHSHKNLVQYLKDYIDMISAKYPFWNRTGGADHFLVACHDWAPTETRHAH 1346 MLE TLYV NSH+HK+L+QYL++Y++MISAK+ FWNRT GADHFLVACHDWAP ETR Sbjct: 376 MLEETLYVQNSHNHKDLIQYLRNYVNMISAKHNFWNRTEGADHFLVACHDWAPAETR-II 434 Query: 1347 MSNTIRALCNADVHESFVIGKDVSLPETLVISPKEPAKDPGGKPPSQRQILAFFAGNMHG 1526 M+N IRALCN+DV + FV GKDVSLPET V+SP+ P GGKP SQR ILAFFAG+MHG Sbjct: 435 MANCIRALCNSDVKQGFVFGKDVSLPETNVLSPQNPLWAIGGKPASQRSILAFFAGSMHG 494 Query: 1527 YLRPILLQHWENKDPDMKIFGRM----GRGTR-SKMNYIQHMKSSKYCICAKGFEVNSPR 1691 YLRPILL HWENKDPDMKIFG+M GRG R K +YIQHMKSSKYCICAKG+EV+SPR Sbjct: 495 YLRPILLHHWENKDPDMKIFGQMPKAKGRGKRKGKTDYIQHMKSSKYCICAKGYEVHSPR 554 Query: 1692 VVESILYECVPVIISDNYVPPFFEVLDWEAFAVFVAEKDIPNLKNILLSIPQKRYLMMHL 1871 VVE+I YECVPVIISDN+VPPFFE+L+WE+FAVFV E+DIPNLKNILLSI +KRYL M + Sbjct: 555 VVEAIFYECVPVIISDNFVPPFFEILNWESFAVFVLERDIPNLKNILLSISEKRYLKMQM 614 Query: 1872 RIKKLQQHFMWHSKPVKYDIFHMILHSIWYNRVFQIKAK 1988 +KK+QQHF+WH +PVKYDIFHMILHSIWYNRVF +A+ Sbjct: 615 MVKKVQQHFLWHPRPVKYDIFHMILHSIWYNRVFLARAR 653 >ref|XP_002309547.2| hypothetical protein POPTR_0006s25540g [Populus trichocarpa] gi|550337072|gb|EEE93070.2| hypothetical protein POPTR_0006s25540g [Populus trichocarpa] Length = 705 Score = 626 bits (1615), Expect = e-176 Identities = 336/651 (51%), Positives = 430/651 (66%), Gaps = 26/651 (3%) Frame = +3 Query: 114 IKNTSLTDGIVRTTSASNMEEET--KHATDVKSNAAEQSNGFSPESRSPVGASEQANLMK 287 + NT+ ++G+ T + + +ET H T+ +N +N PE G +E + + Sbjct: 70 VSNTTQSNGLNTTAISPDRAQETDNSHGTETPANV---NNDVVPERSR--GLNESSLIDS 124 Query: 288 PANKYSPEKVVDLGKNFRIENVEDPHNGSVSE---------------KTREPENANVQEQ 422 + SPE++VD N + HNG VSE K +PE V E Sbjct: 125 RGKESSPEQLVDTNTN----STSYVHNGVVSEGISGLNKSSGIDNHGKESKPEQL-VMEP 179 Query: 423 FHKTENGSS-----LDIIRKEDTNLRLDGTESGHIXXXXXXXXXXXXXXXTDPAMPIKLD 587 + NGS+ + R++ T++ G + P + ++++ Sbjct: 180 VNSLGNGSAPQETERSLSREDVTSI---SENIGASDARIAPIAPELLPVDSPPNITLQMN 236 Query: 588 GNSSTSLKFV--DSNISSVAKQLQDKISKDEKPESLKRFPAHSNVSSIITSNPN-KDQWM 758 ST V +SN S V K + D K K+ + + +TS P K + Sbjct: 237 AEPSTIAHIVPIESNTSKVDKDAAPSLENDGKTGDQKKDLTLLHNNPSVTSFPEVKKEPQ 296 Query: 759 KPPSSVMSISEMNLVLLRNHASSGSEKPRWSTPRDEEILSAKRQIQNAAITEKGDQELFP 938 P V+SISEM + L+ +S S +PRW + D+E+L+AK QIQNA I E D L+ Sbjct: 297 TPSLEVVSISEMKNLQLQRWSSPNSRRPRWPSVVDQELLNAKSQIQNAPIVEN-DPVLYA 355 Query: 939 PLFRNVSVFKRSYELMERTLKVYVYKEGAKPIFHRPITKGIYASEGWFMKHMEGHKQFVV 1118 PL+ N+S+FK+SYELME LKVY+YKEG PIFH+P+ GIYASEGWFMK +EG+K+FV Sbjct: 356 PLYWNISMFKKSYELMEDILKVYIYKEGEMPIFHQPLLNGIYASEGWFMKLLEGNKKFVT 415 Query: 1119 KSPRRAHLFYLPFSSRMLEATLYVPNSHSHKNLVQYLKDYIDMISAKYPFWNRTGGADHF 1298 K ++AHLFYLPFSSR LE LYVPNSHSHKNL++YLK Y+DMIS KYPFWNRT GADHF Sbjct: 416 KDSKKAHLFYLPFSSRYLEIRLYVPNSHSHKNLIEYLKKYLDMISEKYPFWNRTQGADHF 475 Query: 1299 LVACHDWAPTETRHAHMSNTIRALCNADVHESFVIGKDVSLPETLVISPKEPAKDPGGKP 1478 L ACHDWAP+ETR HM+N IRALCN+D E FV GKD SLPET V++ + P +D GG Sbjct: 476 LAACHDWAPSETRQ-HMANCIRALCNSDAKEDFVYGKDASLPETYVLTQENPLRDLGGNR 534 Query: 1479 PSQRQILAFFAGNMHGYLRPILLQHWENKDPDMKIFGRMGR-GTRSKMNYIQHMKSSKYC 1655 S+R ILAFFAG+MHGYLRPILLQHWENKDPDMKIFGR+ + R KMNY ++MKSSKYC Sbjct: 535 ASKRSILAFFAGSMHGYLRPILLQHWENKDPDMKIFGRLPKVKGRGKMNYARYMKSSKYC 594 Query: 1656 ICAKGFEVNSPRVVESILYECVPVIISDNYVPPFFEVLDWEAFAVFVAEKDIPNLKNILL 1835 ICAKG+EVNSPRVVE+I YECVPVIISDN+VPPF EVL+WE+FAVFV EKDIPNLK ILL Sbjct: 595 ICAKGYEVNSPRVVEAIFYECVPVIISDNFVPPFLEVLNWESFAVFVLEKDIPNLKKILL 654 Query: 1836 SIPQKRYLMMHLRIKKLQQHFMWHSKPVKYDIFHMILHSIWYNRVFQIKAK 1988 SIP K+Y M +R+K++QQHF+WH++PVKYD+FHMILHSIWYNRVFQ++ + Sbjct: 655 SIPAKKYRRMQMRVKRVQQHFLWHARPVKYDVFHMILHSIWYNRVFQMQPR 705 >ref|XP_004287457.1| PREDICTED: probable glycosyltransferase At3g07620-like [Fragaria vesca subsp. vesca] Length = 686 Score = 625 bits (1613), Expect = e-176 Identities = 336/657 (51%), Positives = 436/657 (66%), Gaps = 16/657 (2%) Frame = +3 Query: 60 STAEDSVSKSAIIGISSDIKNTSLTDGIVRTTSASNMEEETKHATDVKSNAAEQSNGFSP 239 S+A+ + ++ + SSD+ + G+ + S++ ET V + + GF Sbjct: 65 SSAKSVMVRNPLTVNSSDLIDAPRFGGVEKYADNSSLGGET-----VDKSEPNEKEGFK- 118 Query: 240 ESRSPVGASEQANLMKPA------NKYSPEKVVDLGKNFRIENVEDPHNGSVSEKTRE-- 395 E S + E N + A + VD + + ++ NGS KT E Sbjct: 119 EIDSVLEEKEMDNTFEHAADRNVDENFPSGNGVDTDASLTLVSISKEENGSNLVKTNEAS 178 Query: 396 ---PENANVQEQFHKTENGSSLDIIRKEDTNLRLDGTESGHIXXXXXXXXXXXXXXXTDP 566 PE + + TEN +++ + +G ++ TD Sbjct: 179 YDFPEPTVLSKDEVSTENTLEVNMTMAAKHS---EGVKTIFPSSPLILPATASFTHQTDV 235 Query: 567 AMPIKLDGNSSTSL--KFVDSNISSVAKQLQDKISKDEKPESLKRFPAHSNVSSIITSNP 740 L N+S+S+ F++S+I ++ K +SL R ++P Sbjct: 236 TYVSYLVSNASSSVGSAFLESDIVTI------------KNDSLTR------------TSP 271 Query: 741 NKDQWMK---PPSSVMSISEMNLVLLRNHASSGSEKPRWSTPRDEEILSAKRQIQNAAIT 911 K + MK PP S+ SI EMNL L+R+HA + +PRWS+ RD++IL+ K QIQ+ + Sbjct: 272 GK-KMMKCNMPPKSITSIDEMNLTLVRHHAKPRALRPRWSSVRDQDILAVKSQIQHPPVA 330 Query: 912 EKGDQELFPPLFRNVSVFKRSYELMERTLKVYVYKEGAKPIFHRPITKGIYASEGWFMKH 1091 K D+EL+ PL+RNVS+FKRSYELMERTLKVY+YKEG KPIFH+PI KG+YASEGWFMK Sbjct: 331 -KNDRELYAPLYRNVSMFKRSYELMERTLKVYIYKEGNKPIFHQPIMKGLYASEGWFMKL 389 Query: 1092 MEGHKQFVVKSPRRAHLFYLPFSSRMLEATLYVPNSHSHKNLVQYLKDYIDMISAKYPFW 1271 MEG K+FVVK PR+AHLFY+PFSSRMLE TLYV NSH+ L QYLK+Y + I+AKYPFW Sbjct: 390 MEGDKRFVVKDPRKAHLFYMPFSSRMLEFTLYVRNSHNRTKLRQYLKEYSETIAAKYPFW 449 Query: 1272 NRTGGADHFLVACHDWAPTETRHAHMSNTIRALCNADVHESFVIGKDVSLPETLVISPKE 1451 NRTGGADHFLVACHDWAP ETRH HM I+ALCNADV + F IG+D+SLPET V S + Sbjct: 450 NRTGGADHFLVACHDWAPYETRH-HMERCIKALCNADVTQGFKIGRDISLPETYVRSARN 508 Query: 1452 PAKDPGGKPPSQRQILAFFAGNMHGYLRPILLQHWENKDPDMKIFGRMGRGTRSKMNYIQ 1631 P +D GGK S+RQ+L F+AGNMHGYLRPILL++W++KDPDMKIFG M G SKMNYI+ Sbjct: 509 PLRDLGGKRASERQVLTFYAGNMHGYLRPILLKYWKDKDPDMKIFGPMPPGVASKMNYIE 568 Query: 1632 HMKSSKYCICAKGFEVNSPRVVESILYECVPVIISDNYVPPFFEVLDWEAFAVFVAEKDI 1811 HMKSSKYC+C KG+EVNSPRVVE+I YEC+PVIISDN+VPPFFEVL+WEAF++ +AEKDI Sbjct: 569 HMKSSKYCLCPKGYEVNSPRVVEAIFYECIPVIISDNFVPPFFEVLNWEAFSLILAEKDI 628 Query: 1812 PNLKNILLSIPQKRYLMMHLRIKKLQQHFMWHSKPVKYDIFHMILHSIWYNRVFQIK 1982 PNLKNILLSIP ++YL M L +K++Q+HF+WH KP+KYD+FHM LHSIWYNR+FQIK Sbjct: 629 PNLKNILLSIPDEKYLQMQLAVKRVQKHFLWHPKPLKYDLFHMTLHSIWYNRLFQIK 685 >ref|XP_007225154.1| hypothetical protein PRUPE_ppa002395mg [Prunus persica] gi|462422090|gb|EMJ26353.1| hypothetical protein PRUPE_ppa002395mg [Prunus persica] Length = 678 Score = 623 bits (1606), Expect = e-175 Identities = 334/640 (52%), Positives = 431/640 (67%), Gaps = 4/640 (0%) Frame = +3 Query: 75 SVSKSAIIG---ISSDIKNTSLTDGIVRTTSASNMEEETKHATDVKSNAAEQSNGFSPES 245 S S S I+G +S+D+ NT T A + + ++D E SN Sbjct: 63 SPSNSEIVGNLSLSNDLNNTG--------TYAIHEKASNTRSSDSVLEGHEGSN------ 108 Query: 246 RSPVGASEQANLMKPANKYSPEKVVDLGKNFRIENVEDPHNGSVSEKTREPENANVQEQF 425 + +E + K A S +V + +EN++ E REPE ++V+++ Sbjct: 109 -RALEINEDEDDGKDA---SSGNLVKQNRTIIVENIKPLETNFAQEGGREPEVSSVEKK- 163 Query: 426 HKTENGSSLDIIRKEDTNLRLDGTESGHIXXXXXXXXXXXXXXXTDPAMPIKLDGNSSTS 605 + T+N I E+ + + + +G + T PA+ + N Sbjct: 164 NTTDNTYLEGRIGNENNTVDVVNSTAG-LPVSSPAPPMMNSSPSTAPAI---FETNVGAP 219 Query: 606 LKFVDSNISSVAKQLQDKISKDEKPESLKRFPAHSNVSSIITSNPN-KDQWMKPPSSVMS 782 +K VDSN++SV K K E E L + +S +T P K + P V S Sbjct: 220 IKSVDSNVTSVEKDRTTPSEKTENSEQLHSDLNQTEHNSSMTRVPEVKIEPEVPILDVYS 279 Query: 783 ISEMNLVLLRNHASSGSEKPRWSTPRDEEILSAKRQIQNAAITEKGDQELFPPLFRNVSV 962 IS+MN +LL++ AS S +WS+P D+E+ QI+NA I K D L+ L+RN+SV Sbjct: 280 ISDMNNLLLQSRASYNSMLAQWSSPADQELQYVASQIENAPII-KSDPTLYALLYRNLSV 338 Query: 963 FKRSYELMERTLKVYVYKEGAKPIFHRPITKGIYASEGWFMKHMEGHKQFVVKSPRRAHL 1142 FKRSYELME TLKVYVY+EG +PI H P KGIYASEGWFMK +E K+FV K+P++AHL Sbjct: 339 FKRSYELMEDTLKVYVYREGERPILHSPFLKGIYASEGWFMKQLEADKKFVTKNPQKAHL 398 Query: 1143 FYLPFSSRMLEATLYVPNSHSHKNLVQYLKDYIDMISAKYPFWNRTGGADHFLVACHDWA 1322 +YLPFSSR LE LYVPNSHSHKNL+QYLKDY+DMI+ K+PFWNRTGGADHFLVACHDWA Sbjct: 399 YYLPFSSRTLEERLYVPNSHSHKNLIQYLKDYVDMIAVKHPFWNRTGGADHFLVACHDWA 458 Query: 1323 PTETRHAHMSNTIRALCNADVHESFVIGKDVSLPETLVISPKEPAKDPGGKPPSQRQILA 1502 P+ET+ +M+ IRALCN+D+ E FV GKDVSLPET + + K P +D GG PS+R ILA Sbjct: 459 PSETK-KYMATCIRALCNSDIKEGFVFGKDVSLPETYIKNDKNPLRDLGGNRPSKRSILA 517 Query: 1503 FFAGNMHGYLRPILLQHWENKDPDMKIFGRMGRGTRSKMNYIQHMKSSKYCICAKGFEVN 1682 FFAG+MHGYLRPILLQHWE+KDPDMKIFG++ + + NY+++M+SSKYCICAKG+EVN Sbjct: 518 FFAGSMHGYLRPILLQHWEDKDPDMKIFGKLPK-VKGNKNYVRYMQSSKYCICAKGYEVN 576 Query: 1683 SPRVVESILYECVPVIISDNYVPPFFEVLDWEAFAVFVAEKDIPNLKNILLSIPQKRYLM 1862 SPRVVE+I YECVPVIISDN+VPPFFEVL+WE+FAVFV EKDIPNLKNILLSIP+K+YL Sbjct: 577 SPRVVEAIFYECVPVIISDNFVPPFFEVLNWESFAVFVLEKDIPNLKNILLSIPKKKYLQ 636 Query: 1863 MHLRIKKLQQHFMWHSKPVKYDIFHMILHSIWYNRVFQIK 1982 M +R+KK+Q+HF+WH+KP KYDIFHMILHSIWYNR+ Q+K Sbjct: 637 MQMRVKKVQKHFLWHAKPEKYDIFHMILHSIWYNRLHQLK 676 >ref|XP_002324801.2| hypothetical protein POPTR_0018s00290g [Populus trichocarpa] gi|550317697|gb|EEF03366.2| hypothetical protein POPTR_0018s00290g [Populus trichocarpa] Length = 707 Score = 622 bits (1603), Expect = e-175 Identities = 329/643 (51%), Positives = 426/643 (66%), Gaps = 21/643 (3%) Frame = +3 Query: 114 IKNTSLTDGIVRTTSASNMEEETKHATDVKSNA-----AEQSNGFSPESRSPVGASEQA- 275 + N + ++G+ +A E H T+ +N +E S G + S E + Sbjct: 70 LSNVTQSNGL--NYAAGGQETGDNHGTETPANVNNGVVSEGSRGMNESSLVDSRGEESSL 127 Query: 276 ---------NLMKPANKYSPEKVVDLGKNFRIENVEDPHNGSVSEKTREPENANVQEQFH 428 + + N E + L K+ I+N H S + +N N + + Sbjct: 128 DELVDTNTNSTLYVNNDVGSEGIKGLNKSLGIDN----HGRESSPEQLLDQNENSTLELN 183 Query: 429 KTENGS-SLDIIR---KEDTNLRLDGTESGHIXXXXXXXXXXXXXXXTDPAMPIKLDGNS 596 + NGS S++ R +E+ + T + T+ A+P + ++ Sbjct: 184 HSGNGSASIETDRSLFRENITSTSENTGTSQAGITPIAPALPPVDSPTNIAIPRNAEPST 243 Query: 597 STSLKFVDSNISSVAKQLQDKISKDEKP-ESLKRFPAHSNVSSIITSNPNKDQWMKPPSS 773 + V+SN S K + D K E L + N +S+ + K + P + Sbjct: 244 LAPVVPVESNTSKTDKDASHGLENDGKAGEQLNNSTSLQNNTSVTSVREVKKEPHTPSPA 303 Query: 774 VMSISEMNLVLLRNHASSGSEKPRWSTPRDEEILSAKRQIQNAAITEKGDQELFPPLFRN 953 V+SISEMN + L++ +S S +PRW + D+E+L+AK QIQ A + E D L+ PL+RN Sbjct: 304 VISISEMNNLQLQSWSSPISRRPRWPSAVDQELLNAKSQIQKAPLVES-DSMLYAPLYRN 362 Query: 954 VSVFKRSYELMERTLKVYVYKEGAKPIFHRPITKGIYASEGWFMKHMEGHKQFVVKSPRR 1133 +S+FK+SYELME LKVY+YKEG +PI H+ KGIYASEGWFMK +E +K+FV K P++ Sbjct: 363 ISMFKKSYELMEDILKVYIYKEGERPILHQAPLKGIYASEGWFMKLLETNKKFVTKDPKK 422 Query: 1134 AHLFYLPFSSRMLEATLYVPNSHSHKNLVQYLKDYIDMISAKYPFWNRTGGADHFLVACH 1313 +HLFYLPFSSR LE LYVPNSHSHKNL+QYLK+Y+DMISAKYPFWNRT GADHFLVACH Sbjct: 423 SHLFYLPFSSRNLEVNLYVPNSHSHKNLIQYLKNYLDMISAKYPFWNRTRGADHFLVACH 482 Query: 1314 DWAPTETRHAHMSNTIRALCNADVHESFVIGKDVSLPETLVISPKEPAKDPGGKPPSQRQ 1493 DWAPTETR HM+N IRALCN+D FV GKD +LPET V +P+ +D GGKP S+R Sbjct: 483 DWAPTETRQ-HMANCIRALCNSDAKGGFVFGKDAALPETTVRTPQNLLRDLGGKPASKRS 541 Query: 1494 ILAFFAGNMHGYLRPILLQHWENKDPDMKIFGRMGR-GTRSKMNYIQHMKSSKYCICAKG 1670 ILAFFAG+MHGYLRPILLQHW NKDPD+K+FG++ + R KMNY Q+MKSSKYCICAKG Sbjct: 542 ILAFFAGSMHGYLRPILLQHWGNKDPDVKVFGKLPKVKGRGKMNYPQYMKSSKYCICAKG 601 Query: 1671 FEVNSPRVVESILYECVPVIISDNYVPPFFEVLDWEAFAVFVAEKDIPNLKNILLSIPQK 1850 FEVNSPRVVE+I YECVPVIISDN+VPPFFEVL+WE+FAVFV EKDIPNLKNILLSIP+ Sbjct: 602 FEVNSPRVVEAIFYECVPVIISDNFVPPFFEVLNWESFAVFVLEKDIPNLKNILLSIPEN 661 Query: 1851 RYLMMHLRIKKLQQHFMWHSKPVKYDIFHMILHSIWYNRVFQI 1979 +Y M +R+KK+QQHF+WH++PVKYDIFHMILHS+WYNRVFQ+ Sbjct: 662 KYREMQMRVKKVQQHFLWHARPVKYDIFHMILHSVWYNRVFQV 704 >ref|NP_197468.2| Exostosin family protein [Arabidopsis thaliana] gi|332005353|gb|AED92736.1| Exostosin family protein [Arabidopsis thaliana] gi|591401784|gb|AHL38619.1| glycosyltransferase, partial [Arabidopsis thaliana] Length = 610 Score = 619 bits (1597), Expect = e-174 Identities = 292/433 (67%), Positives = 358/433 (82%), Gaps = 3/433 (0%) Frame = +3 Query: 699 PAHSNVSSIITSNPNKDQWMK---PPSSVMSISEMNLVLLRNHASSGSEKPRWSTPRDEE 869 PA N S +++ +K + M+ PP SV +I EMN +L R+ +S + +PRWS+ RDEE Sbjct: 180 PASGNSSLLVSKKVSKKKKMRCDLPPKSVTTIDEMNRILARHRRTSRAMRPRWSSRRDEE 239 Query: 870 ILSAKRQIQNAAITEKGDQELFPPLFRNVSVFKRSYELMERTLKVYVYKEGAKPIFHRPI 1049 IL+A+++I+NA + K ++EL+PP+FRNVS+FKRSYELMER LKVYVYKEG +PIFH PI Sbjct: 240 ILTARKEIENAPVA-KLERELYPPIFRNVSLFKRSYELMERILKVYVYKEGNRPIFHTPI 298 Query: 1050 TKGIYASEGWFMKHMEGHKQFVVKSPRRAHLFYLPFSSRMLEATLYVPNSHSHKNLVQYL 1229 KG+YASEGWFMK MEG+KQ+ VK PR+AHL+Y+PFS+RMLE TLYV NSH+ NL Q+L Sbjct: 299 LKGLYASEGWFMKLMEGNKQYTVKDPRKAHLYYMPFSARMLEYTLYVRNSHNRTNLRQFL 358 Query: 1230 KDYIDMISAKYPFWNRTGGADHFLVACHDWAPTETRHAHMSNTIRALCNADVHESFVIGK 1409 K+Y + IS+KYPF+NRT GADHFLVACHDWAP ETRH HM + I+ALCNADV F IG+ Sbjct: 359 KEYTEHISSKYPFFNRTDGADHFLVACHDWAPYETRH-HMEHCIKALCNADVTAGFKIGR 417 Query: 1410 DVSLPETLVISPKEPAKDPGGKPPSQRQILAFFAGNMHGYLRPILLQHWENKDPDMKIFG 1589 D+SLPET V + K P +D GGKPPSQR+ LAF+AG+MHGYLR ILLQHW++KDPDMKIFG Sbjct: 418 DISLPETYVRAAKNPLRDLGGKPPSQRRTLAFYAGSMHGYLRQILLQHWKDKDPDMKIFG 477 Query: 1590 RMGRGTRSKMNYIQHMKSSKYCICAKGFEVNSPRVVESILYECVPVIISDNYVPPFFEVL 1769 RM G SKMNYI+ MKSSKYCIC KG+EVNSPRVVESI YECVPVIISDN+VPPFFEVL Sbjct: 478 RMPFGVASKMNYIEQMKSSKYCICPKGYEVNSPRVVESIFYECVPVIISDNFVPPFFEVL 537 Query: 1770 DWEAFAVFVAEKDIPNLKNILLSIPQKRYLMMHLRIKKLQQHFMWHSKPVKYDIFHMILH 1949 DW AF+V VAEKDIP LK+ILLSIP+ +Y+ M + ++K Q+HF+WH+KP KYD+FHM+LH Sbjct: 538 DWSAFSVIVAEKDIPRLKDILLSIPEDKYVKMQMAVRKAQRHFLWHAKPEKYDLFHMVLH 597 Query: 1950 SIWYNRVFQIKAK 1988 SIWYNRVFQ K + Sbjct: 598 SIWYNRVFQAKRR 610 >gb|EXB59796.1| putative glycosyltransferase [Morus notabilis] Length = 669 Score = 619 bits (1596), Expect = e-174 Identities = 307/514 (59%), Positives = 385/514 (74%) Frame = +3 Query: 447 SLDIIRKEDTNLRLDGTESGHIXXXXXXXXXXXXXXXTDPAMPIKLDGNSSTSLKFVDSN 626 S + IR E+ +LRL ++ G P+ P+ ++ + F ++ Sbjct: 185 STENIRTENIDLRLKKSDGG-------------LDSPFQPS-PLASSADALVNASFSTTS 230 Query: 627 ISSVAKQLQDKISKDEKPESLKRFPAHSNVSSIITSNPNKDQWMKPPSSVMSISEMNLVL 806 SSV++Q I+ + HS +++ T K + PP S+ + EMN +L Sbjct: 231 TSSVSEQSGLLITNN-----------HSAIAT--TPGVKKMRCNMPPKSITTFQEMNQIL 277 Query: 807 LRNHASSGSEKPRWSTPRDEEILSAKRQIQNAAITEKGDQELFPPLFRNVSVFKRSYELM 986 +R+ A S S +PRWS+ RD+EIL+ K QI+NA + DQEL+ PLFRNVS+FKRSYELM Sbjct: 278 VRHRAKSRSLRPRWSSVRDKEILAMKPQIENAPLA-MNDQELYAPLFRNVSMFKRSYELM 336 Query: 987 ERTLKVYVYKEGAKPIFHRPITKGIYASEGWFMKHMEGHKQFVVKSPRRAHLFYLPFSSR 1166 ERTLKVYVYK+G KPIFH+PI KG+YASEGWFMK ME ++++VVK PRRAHLFY+PFSSR Sbjct: 337 ERTLKVYVYKDGDKPIFHQPIMKGLYASEGWFMKLMERNRRYVVKDPRRAHLFYMPFSSR 396 Query: 1167 MLEATLYVPNSHSHKNLVQYLKDYIDMISAKYPFWNRTGGADHFLVACHDWAPTETRHAH 1346 MLE LYV NSH+ NL QYLK+Y + ++AKYP+WNRTGGADHFLVACHDWAP ETRH H Sbjct: 397 MLEHVLYVRNSHNRTNLRQYLKEYSEKLAAKYPYWNRTGGADHFLVACHDWAPYETRH-H 455 Query: 1347 MSNTIRALCNADVHESFVIGKDVSLPETLVISPKEPAKDPGGKPPSQRQILAFFAGNMHG 1526 M ++ALCNADV F IG+DVS PET V S + P +D GGKPPS+R +LAF+AGN+HG Sbjct: 456 MERCMKALCNADVTSGFKIGRDVSFPETYVRSARNPLRDLGGKPPSRRHVLAFYAGNIHG 515 Query: 1527 YLRPILLQHWENKDPDMKIFGRMGRGTRSKMNYIQHMKSSKYCICAKGFEVNSPRVVESI 1706 YLRPILL++W++KDPDMKIFG M G +KMNYIQHMKSSKYCIC KG+EVNSPRVVESI Sbjct: 516 YLRPILLKYWKDKDPDMKIFGPMPPGVANKMNYIQHMKSSKYCICPKGYEVNSPRVVESI 575 Query: 1707 LYECVPVIISDNYVPPFFEVLDWEAFAVFVAEKDIPNLKNILLSIPQKRYLMMHLRIKKL 1886 YECVPVIISDN+VPPFFEVL+WEAF++ +AEKDIP LK ILLSIP+++YL M L ++K Sbjct: 576 FYECVPVIISDNFVPPFFEVLNWEAFSIVLAEKDIPKLKEILLSIPKEKYLEMQLAVRKA 635 Query: 1887 QQHFMWHSKPVKYDIFHMILHSIWYNRVFQIKAK 1988 Q+HF+WH+KP+KYD+FHM LHSIWYNRVFQIK + Sbjct: 636 QKHFLWHAKPMKYDLFHMTLHSIWYNRVFQIKPR 669 >ref|XP_004251626.1| PREDICTED: probable glycosyltransferase At5g03795-like [Solanum lycopersicum] Length = 674 Score = 619 bits (1596), Expect = e-174 Identities = 332/659 (50%), Positives = 439/659 (66%), Gaps = 14/659 (2%) Frame = +3 Query: 54 ASSTAEDSVSKSAIIGISSDIKNTSLTDGIVRTTSASNMEEETKHATDVKSNAAEQSNGF 233 + S + S S + SS +++T + +G T S+ + H N+ +G Sbjct: 48 SESNTQLSEKVSLLSKESSVVESTKVGEGFSGTLSSFDDVHMLAHRLKTVDNSDVSEDGE 107 Query: 234 SPESRSPVGASEQANLMKPANKYSPEKVVDLGKNF----RIEN------VEDPHNGSVSE 383 ES + + + +KP + +S K ++ +F IEN + D + + Sbjct: 108 IDESVN------EKDEVKPHSNHSVVKTMENDSDFVEDATIENDNLFDEMVDMDEETTMQ 161 Query: 384 KTREPENANVQEQFHKTENGSSLDIIRKEDTNLRLDGTESGHIXXXXXXXXXXXXXXXTD 563 K E + EQ KT + S D + N L+ T++ ++ Sbjct: 162 KNNESKWDLSIEQVVKTTDELSADSDLDANRNTVLNDTKAANVTNSSSVEASNHLDNLPL 221 Query: 564 PAMP----IKLDGNSSTSLKFVDSNISSVAKQLQDKISKDEKPESLKRFPAHSNVSSIIT 731 A+ I+ GN+S++ N++ + P + N S +++ Sbjct: 222 VAIGEINFIRTTGNNSST-----GNLTQL-------------------LPNNGNHSLVLS 257 Query: 732 SNPNKDQWMKPPSSVMSISEMNLVLLRNHASSGSEKPRWSTPRDEEILSAKRQIQNAAIT 911 + K + M PP +V +IS+M +L+R+ A S + +PRWS+ RD+EIL+A+ QI+NA + Sbjct: 258 TVKKKMRCMLPPKTVTTISQMERLLVRHRARSRAMRPRWSSERDKEILAARLQIENAPLI 317 Query: 912 EKGDQELFPPLFRNVSVFKRSYELMERTLKVYVYKEGAKPIFHRPITKGIYASEGWFMKH 1091 + D+E++ P FRN+S+FKRSYELMER L+VYVYKEG KPIFH+PI KG+YASEGWFMK Sbjct: 318 -RNDREIYAPAFRNMSMFKRSYELMERILRVYVYKEGEKPIFHQPIMKGLYASEGWFMKL 376 Query: 1092 MEGHKQFVVKSPRRAHLFYLPFSSRMLEATLYVPNSHSHKNLVQYLKDYIDMISAKYPFW 1271 MEG+ +FVVK PR+AHLFYLPFSSRMLE +LYV NSH+ NL QYLKDY + I+AKY FW Sbjct: 377 MEGNNKFVVKDPRKAHLFYLPFSSRMLEHSLYVRNSHNRTNLRQYLKDYSEKIAAKYRFW 436 Query: 1272 NRTGGADHFLVACHDWAPTETRHAHMSNTIRALCNADVHESFVIGKDVSLPETLVISPKE 1451 NRTGGADHFLVACHDWAP ETRH HM + I+ALCNADV F IG+DVSL ET V S + Sbjct: 437 NRTGGADHFLVACHDWAPYETRH-HMEHCIKALCNADVTLGFKIGRDVSLAETYVRSARN 495 Query: 1452 PAKDPGGKPPSQRQILAFFAGNMHGYLRPILLQHWENKDPDMKIFGRMGRGTRSKMNYIQ 1631 P +D GGKP SQR++LAF+AGNMHGYLRPILL+HW++KDPDM+IFG M G SKMNYIQ Sbjct: 496 PLRDLGGKPASQRKVLAFYAGNMHGYLRPILLEHWKDKDPDMEIFGPMPSGVASKMNYIQ 555 Query: 1632 HMKSSKYCICAKGFEVNSPRVVESILYECVPVIISDNYVPPFFEVLDWEAFAVFVAEKDI 1811 HMKSSK+CIC KG+EVNSPRVVE+I YECVPVIISDN+VPPFF VL+W+ F++ +AEKDI Sbjct: 556 HMKSSKFCICPKGYEVNSPRVVEAIFYECVPVIISDNFVPPFFGVLNWDTFSLILAEKDI 615 Query: 1812 PNLKNILLSIPQKRYLMMHLRIKKLQQHFMWHSKPVKYDIFHMILHSIWYNRVFQIKAK 1988 PNLK+ILLSIP+K+YL M L I+K+Q+HF+WH+KPVKYD+FHM LHSIWYNRVFQ KA+ Sbjct: 616 PNLKSILLSIPEKKYLDMQLAIRKVQRHFLWHAKPVKYDLFHMTLHSIWYNRVFQTKAR 674 >ref|XP_002324438.2| hypothetical protein POPTR_0018s09250g [Populus trichocarpa] gi|550318376|gb|EEF03003.2| hypothetical protein POPTR_0018s09250g [Populus trichocarpa] Length = 682 Score = 617 bits (1590), Expect = e-174 Identities = 326/630 (51%), Positives = 414/630 (65%), Gaps = 3/630 (0%) Frame = +3 Query: 108 SDIKNTSLTDGIVRTTSASNMEEETKHATDVKSNAAEQSNGFSPESRSPVGASEQANLMK 287 S + N DG++ SN E H K N + + FS SE+ ++ Sbjct: 81 SSLNNYFKFDGVLENADDSNGGVEEGHDDGTKKNTEDTDHDFS---------SEEGDMEV 131 Query: 288 PANKYSPEKVVDLGKNFRIENVEDPHNGSVSEKTREPENANVQEQFHKTENGSSLDIIRK 467 + E DL +F E+V+D H S + E+ V + ++ L+ K Sbjct: 132 LDDVIQLEVDRDLEDDFPSEDVKDRHETFASGGVKTEESNPVLKLANEARFNLPLERNVK 191 Query: 468 EDTNLRLDGTESGHIXXXXXXXXXXXXXXXTDPAMPIKLDG-NSSTSLKFVDSNISSVAK 644 D ++ D + + +P+ SST ++ SN SS Sbjct: 192 SDHDIPTDNVLQQN------KSQAHKEFEHVNSTLPVDSQAVASSTKATYLKSNGSSSIG 245 Query: 645 QLQDKISKDEKPESLKRFPAHSNVSSIITSNPNKDQWM--KPPSSVMSISEMNLVLLRNH 818 P +LK A + S++ + P K + PP SV I EMN +L+R+ Sbjct: 246 -----------PAALKSDSAAAKNYSVVLAKPGKKKMRCEMPPKSVTLIDEMNSILVRHR 294 Query: 819 ASSGSEKPRWSTPRDEEILSAKRQIQNAAITEKGDQELFPPLFRNVSVFKRSYELMERTL 998 SS S +PRWS+ RD+EIL+A+ QI++A D++L+ PLFRNVS FKRSYELMERTL Sbjct: 295 RSSRSMRPRWSSARDQEILAARSQIESAPAVVH-DRDLYAPLFRNVSKFKRSYELMERTL 353 Query: 999 KVYVYKEGAKPIFHRPITKGIYASEGWFMKHMEGHKQFVVKSPRRAHLFYLPFSSRMLEA 1178 K+Y+YK+G KPIFH PI KG+YASEGWFMK M+G+K FVVK PR+AHLFY+PFSSRMLE Sbjct: 354 KIYIYKDGKKPIFHLPILKGLYASEGWFMKLMQGNKHFVVKDPRKAHLFYMPFSSRMLEY 413 Query: 1179 TLYVPNSHSHKNLVQYLKDYIDMISAKYPFWNRTGGADHFLVACHDWAPTETRHAHMSNT 1358 TLYV NSH+ NL Y+K Y + I+AKY FWNRTGGADHFLVACHDWAP ETRH HM + Sbjct: 414 TLYVRNSHNRTNLRLYMKRYAESIAAKYSFWNRTGGADHFLVACHDWAPYETRH-HMEHC 472 Query: 1359 IRALCNADVHESFVIGKDVSLPETLVISPKEPAKDPGGKPPSQRQILAFFAGNMHGYLRP 1538 I+ALCNADV F IG+DVS PET V S + P +D GGKPPSQR ILAF+AGNMHGYLRP Sbjct: 473 IKALCNADVTAGFKIGRDVSFPETYVRSARNPLRDLGGKPPSQRNILAFYAGNMHGYLRP 532 Query: 1539 ILLQHWENKDPDMKIFGRMGRGTRSKMNYIQHMKSSKYCICAKGFEVNSPRVVESILYEC 1718 ILL++W++KDPDMKIFG M G SKMNYI HM+ SKYCIC KG+EVNSPRVVE+I YEC Sbjct: 533 ILLKYWKDKDPDMKIFGPMPPGVASKMNYIHHMQRSKYCICPKGYEVNSPRVVEAIFYEC 592 Query: 1719 VPVIISDNYVPPFFEVLDWEAFAVFVAEKDIPNLKNILLSIPQKRYLMMHLRIKKLQQHF 1898 VPVIISDN+VPPFF+VLDW AF++ +AEKDI NLK ILLSIP+++YL M L ++K Q+HF Sbjct: 593 VPVIISDNFVPPFFDVLDWGAFSLILAEKDISNLKEILLSIPKEKYLQMQLGVRKAQRHF 652 Query: 1899 MWHSKPVKYDIFHMILHSIWYNRVFQIKAK 1988 +WH+ P+KYD+F+M LHSIWYNRV+QIK + Sbjct: 653 LWHASPMKYDLFYMTLHSIWYNRVYQIKPR 682 >ref|XP_006287301.1| hypothetical protein CARUB_v10000494mg [Capsella rubella] gi|482556007|gb|EOA20199.1| hypothetical protein CARUB_v10000494mg [Capsella rubella] Length = 613 Score = 614 bits (1583), Expect = e-173 Identities = 298/481 (61%), Positives = 371/481 (77%), Gaps = 8/481 (1%) Frame = +3 Query: 570 MPIKLDGNSSTSLKFVDSNISSVAKQLQDKISKDEKPESLK-----RFPAHSNVSSIITS 734 M +K STS SV Q + K S SL + P N S +++ Sbjct: 135 MNVKQSAEMSTSKYGYQVQDVSVESQKKVKTSMLSASSSLAASSVGKLPVSGNSSLLVSK 194 Query: 735 NPNKDQWMK---PPSSVMSISEMNLVLLRNHASSGSEKPRWSTPRDEEILSAKRQIQNAA 905 +K + M+ PP +V +I EMN +L R+ +S + +PRWS+ RDEEIL+A+++I+NA Sbjct: 195 QVSKKKKMRCNLPPKTVTTIEEMNRILARHRRTSRAMRPRWSSRRDEEILAARKEIENAP 254 Query: 906 ITEKGDQELFPPLFRNVSVFKRSYELMERTLKVYVYKEGAKPIFHRPITKGIYASEGWFM 1085 + K ++EL+PP++RNVS+FKRSYELMERTLKVYVYKEG +PIFH PI KG+YASEGWFM Sbjct: 255 VA-KLERELYPPIYRNVSMFKRSYELMERTLKVYVYKEGNRPIFHTPILKGLYASEGWFM 313 Query: 1086 KHMEGHKQFVVKSPRRAHLFYLPFSSRMLEATLYVPNSHSHKNLVQYLKDYIDMISAKYP 1265 K ME KQ+ VK PRRAHL+Y+PFS+RMLE TLYV NSH+ NL Q+LK+Y + IS+KYP Sbjct: 314 KLMEESKQYTVKDPRRAHLYYMPFSARMLEFTLYVRNSHNRTNLRQFLKEYTEHISSKYP 373 Query: 1266 FWNRTGGADHFLVACHDWAPTETRHAHMSNTIRALCNADVHESFVIGKDVSLPETLVISP 1445 F+NRT GADHFLVACHDWAP ETRH HM + I+ALCNADV F IG+D+SLPET V + Sbjct: 374 FFNRTDGADHFLVACHDWAPYETRH-HMEHCIKALCNADVTAGFKIGRDISLPETYVRAA 432 Query: 1446 KEPAKDPGGKPPSQRQILAFFAGNMHGYLRPILLQHWENKDPDMKIFGRMGRGTRSKMNY 1625 K P +D GGKPPSQR+ LAF+AG+MHGYLR ILLQHW++KDP+MKIFGRM G SKMNY Sbjct: 433 KNPQRDLGGKPPSQRRTLAFYAGSMHGYLRAILLQHWKDKDPEMKIFGRMPLGVASKMNY 492 Query: 1626 IQHMKSSKYCICAKGFEVNSPRVVESILYECVPVIISDNYVPPFFEVLDWEAFAVFVAEK 1805 I+ MKSSKYCIC KG+EVNSPRVVESI YECVPVIISDN+VPPFFEVLDW AF+V +AEK Sbjct: 493 IEQMKSSKYCICPKGYEVNSPRVVESIFYECVPVIISDNFVPPFFEVLDWSAFSVIIAEK 552 Query: 1806 DIPNLKNILLSIPQKRYLMMHLRIKKLQQHFMWHSKPVKYDIFHMILHSIWYNRVFQIKA 1985 DIP LK+IL SIP+++Y+ M + ++K Q+HF+WH+KP +YD+FHM+LHSIWYNRVFQ K Sbjct: 553 DIPRLKDILSSIPEEKYVKMQMAVRKAQRHFLWHAKPQRYDLFHMVLHSIWYNRVFQAKR 612 Query: 1986 K 1988 + Sbjct: 613 R 613