BLASTX nr result
ID: Mentha28_contig00034281
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Mentha28_contig00034281 (801 letters) Database: ./nr 38,876,450 sequences; 13,856,398,315 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value gb|EYU22083.1| hypothetical protein MIMGU_mgv1a026874mg, partial... 363 4e-98 ref|XP_006357028.1| PREDICTED: exostosin-1c-like [Solanum tubero... 286 5e-75 ref|XP_004244466.1| PREDICTED: exostosin-1b-like [Solanum lycope... 285 1e-74 ref|XP_002523753.1| transferase, transferring glycosyl groups, p... 258 2e-66 ref|XP_007222990.1| hypothetical protein PRUPE_ppa008992mg [Prun... 256 7e-66 ref|NP_565236.1| glycosyltransferase family protein 64 [Arabidop... 253 6e-65 ref|NP_001077854.1| glycosyltransferase family protein 64 [Arabi... 253 6e-65 gb|AAM61012.1| unknown [Arabidopsis thaliana] 253 6e-65 ref|XP_004291290.1| PREDICTED: exostosin-like 3-like isoform 1 [... 251 3e-64 ref|XP_006404811.1| hypothetical protein EUTSA_v10000547mg, part... 250 4e-64 emb|CAN68561.1| hypothetical protein VITISV_033101 [Vitis vinifera] 249 8e-64 gb|EXC32770.1| Exostosin-like 3 [Morus notabilis] 248 2e-63 ref|XP_007011523.1| Nucleotide-diphospho-sugar transferases supe... 247 4e-63 ref|XP_002280907.1| PREDICTED: exostosin-1-like [Vitis vinifera] 247 4e-63 ref|XP_006850947.1| hypothetical protein AMTR_s00025p00194100 [A... 245 2e-62 ref|XP_002889300.1| glycosyltransferase family protein 47 [Arabi... 245 2e-62 ref|XP_007159619.1| hypothetical protein PHAVU_002G252600g [Phas... 244 3e-62 ref|XP_006299931.1| hypothetical protein CARUB_v10016140mg, part... 243 8e-62 ref|XP_006483629.1| PREDICTED: exostosin-like 3-like [Citrus sin... 242 1e-61 ref|XP_006450102.1| hypothetical protein CICLE_v10010680mg [Citr... 242 1e-61 >gb|EYU22083.1| hypothetical protein MIMGU_mgv1a026874mg, partial [Mimulus guttatus] Length = 305 Score = 363 bits (932), Expect = 4e-98 Identities = 172/221 (77%), Positives = 196/221 (88%) Frame = +3 Query: 3 AVLRAPSSSLNHRFYPWEAIGTRAVLICDDDIEPDPSSVAFAFSVWRSDPDRAVGFFARS 182 AVLRAPSSSLNHRF+PWE I TR VLICDDD+EPDP+SV+FAF+VWRSDPDRAVGFFARS Sbjct: 84 AVLRAPSSSLNHRFHPWEIIKTRGVLICDDDVEPDPNSVSFAFAVWRSDPDRAVGFFARS 143 Query: 183 HAYDVASRAWIYVMERAKYSIVLTKFMIVNVEYLERYSCDERYAEGRKIVDEMNNCEDIL 362 HAYDVAS+ WIY MER KYSI+LTKFMIV++++LE+YSCDE+YA+ RKIVDEMNNCEDIL Sbjct: 144 HAYDVASKTWIYTMERGKYSIILTKFMIVSIKHLEKYSCDEQYADARKIVDEMNNCEDIL 203 Query: 363 MNFVVAEESQKXXXXXXXXXXXXXRDYGDARNDGVSAEMREVGLSSRRGEHRKRRGDCIT 542 MNFVVAEE++K RDYGDARNDGVS ++EVGLSSRRGEHRKRRG+CI Sbjct: 204 MNFVVAEENRK-GPVLVGAKDGGVRDYGDARNDGVSEGVKEVGLSSRRGEHRKRRGNCIR 262 Query: 543 QFHRVLGRMPLKYSYGKMVDDVGEQGLCDKAGKLVLCDHQD 665 +FHRVLG+MPLKYSYGK+VDDVGEQGLC+ GKLVLCD+QD Sbjct: 263 EFHRVLGKMPLKYSYGKLVDDVGEQGLCEMGGKLVLCDNQD 303 >ref|XP_006357028.1| PREDICTED: exostosin-1c-like [Solanum tuberosum] Length = 333 Score = 286 bits (733), Expect = 5e-75 Identities = 134/217 (61%), Positives = 171/217 (78%) Frame = +3 Query: 12 RAPSSSLNHRFYPWEAIGTRAVLICDDDIEPDPSSVAFAFSVWRSDPDRAVGFFARSHAY 191 R PSSSLN RF+P +I TR+VLICDDDIEPD +S+ FAF++W+S+PDR +GFF RSH Y Sbjct: 117 RQPSSSLNLRFHPHTSITTRSVLICDDDIEPDTNSINFAFNIWKSNPDRLIGFFVRSHNY 176 Query: 192 DVASRAWIYVMERAKYSIVLTKFMIVNVEYLERYSCDERYAEGRKIVDEMNNCEDILMNF 371 D+ ++WIY ME KYSI+LTKFMI+N YL +Y+C++ Y++ R IVDE NNCEDILMNF Sbjct: 177 DLTHKSWIYTMETQKYSIMLTKFMILNFHYLHQYTCNKEYSKLRLIVDEKNNCEDILMNF 236 Query: 372 VVAEESQKXXXXXXXXXXXXXRDYGDARNDGVSAEMREVGLSSRRGEHRKRRGDCITQFH 551 V+AE+ +K RD+GDARN+G + REVGLSSR+GEHRKRRG+CI +FH Sbjct: 237 VIAEQVKK---GPMLVGAKKVRDWGDARNEGEGMKEREVGLSSRKGEHRKRRGECIGEFH 293 Query: 552 RVLGRMPLKYSYGKMVDDVGEQGLCDKAGKLVLCDHQ 662 R+L +MPLKYSYGK+++ +GEQGLC+K GKLV CD Q Sbjct: 294 RLLEKMPLKYSYGKVMEAIGEQGLCEKGGKLVYCDKQ 330 >ref|XP_004244466.1| PREDICTED: exostosin-1b-like [Solanum lycopersicum] Length = 332 Score = 285 bits (730), Expect = 1e-74 Identities = 133/217 (61%), Positives = 171/217 (78%) Frame = +3 Query: 12 RAPSSSLNHRFYPWEAIGTRAVLICDDDIEPDPSSVAFAFSVWRSDPDRAVGFFARSHAY 191 R PSSSLN RF+P +I TR+VLICDDDIEPD +S+ FAF++W+S+PDR +GFF RSH Y Sbjct: 117 RQPSSSLNLRFHPHTSITTRSVLICDDDIEPDTNSINFAFNIWKSNPDRLIGFFVRSHNY 176 Query: 192 DVASRAWIYVMERAKYSIVLTKFMIVNVEYLERYSCDERYAEGRKIVDEMNNCEDILMNF 371 D+ ++WIY ME KYSI+LTKFMI+N YL +Y+C++ Y++ + IVDE NNCEDILMNF Sbjct: 177 DITHKSWIYTMETQKYSIMLTKFMILNFNYLYQYTCNKEYSKLKLIVDEKNNCEDILMNF 236 Query: 372 VVAEESQKXXXXXXXXXXXXXRDYGDARNDGVSAEMREVGLSSRRGEHRKRRGDCITQFH 551 V+AE+ +K RD+GDARN+G + REVGLSSR+GEHRKRRG+CI +FH Sbjct: 237 VIAEQVKK---GPIMVGAKKVRDWGDARNEGEGMKEREVGLSSRKGEHRKRRGECIGEFH 293 Query: 552 RVLGRMPLKYSYGKMVDDVGEQGLCDKAGKLVLCDHQ 662 R+L +MPLKYSYGK+++ +GEQGLC+K GKLV CD Q Sbjct: 294 RLLEKMPLKYSYGKVMEAIGEQGLCEKGGKLVYCDKQ 330 >ref|XP_002523753.1| transferase, transferring glycosyl groups, putative [Ricinus communis] gi|223537057|gb|EEF38693.1| transferase, transferring glycosyl groups, putative [Ricinus communis] Length = 349 Score = 258 bits (658), Expect = 2e-66 Identities = 133/233 (57%), Positives = 162/233 (69%), Gaps = 13/233 (5%) Frame = +3 Query: 3 AVLRAPSSSLNHRFYPWEAIGTRAVLICDDDIEPDPSSVAFAFSVWRSDPDRAVGFFARS 182 +++R PSSSLN RF P +IGT+AVLICDDD+E DP S FAF +WR +PDR +GFF RS Sbjct: 120 SLIRQPSSSLNDRFLPRSSIGTQAVLICDDDVEVDPKSFQFAFRIWRLNPDRLIGFFVRS 179 Query: 183 HAYDVASRAWIYVMERAKYSIVLTKFMIVNVEYLERYSC--DERYAEGRKIVDEMNNCED 356 H D+ +R WIY + KYSIVLTKFMI+ +YL YSC E RKIVD M NCED Sbjct: 180 HDLDLLARKWIYTVHPDKYSIVLTKFMILKSQYLFEYSCKGGPNMGEMRKIVDRMQNCED 239 Query: 357 ILMNFVVAEESQKXXXXXXXXXXXXXRDYGDARN----------DGVSAEMREVGLSSRR 506 ILMNFVVA+ K RD+GDARN D ++++R VGLSSR Sbjct: 240 ILMNFVVAD---KANIGPILVGAEKVRDWGDARNEDNDVQFGLKDMEASKVRAVGLSSRV 296 Query: 507 GEHRKRRGDCITQFHRVLGRMPLKYSYGKMVDDVGEQGLCDKAGKLVLCD-HQ 662 GEHRKRRG+CI +FH++LGRMPL+YSYGK+++ VGEQGLC K GKL+ CD HQ Sbjct: 297 GEHRKRRGECIREFHKLLGRMPLRYSYGKVINSVGEQGLCMKGGKLIFCDRHQ 349 >ref|XP_007222990.1| hypothetical protein PRUPE_ppa008992mg [Prunus persica] gi|462419926|gb|EMJ24189.1| hypothetical protein PRUPE_ppa008992mg [Prunus persica] Length = 311 Score = 256 bits (654), Expect = 7e-66 Identities = 132/230 (57%), Positives = 161/230 (70%), Gaps = 12/230 (5%) Frame = +3 Query: 3 AVLRAPSSSLNHRFYPWEAIGTRAVLICDDDIEPDPSSVAFAFSVWRSDPDRAVGFFARS 182 +V+R S SLN+RF P +I TRAVLICDDD+E DP S FAF +W S+PDR VGFF RS Sbjct: 83 SVIRQTSDSLNNRFLPRPSIKTRAVLICDDDVEVDPKSFEFAFKMWGSNPDRLVGFFVRS 142 Query: 183 HAYDVASRAWIYVMERAKYSIVLTKFMIVNVEYLERYSC--DERYAEGRKIVDEMNNCED 356 H D++ + WIY + KYSI+LTKFM++ EYL RYSC A R+IVD+MNNCED Sbjct: 143 HDIDLSKKEWIYTIHPDKYSIMLTKFMLLKSEYLFRYSCAGGPVMAHMRRIVDKMNNCED 202 Query: 357 ILMNFVVAEESQKXXXXXXXXXXXXXRDYGDARND----------GVSAEMREVGLSSRR 506 ILMNFVVA+E RD+GDARND + E+ +VGLSSR+ Sbjct: 203 ILMNFVVADEVNS---GPILVGAERVRDWGDARNDHDDDDGNGRHRLIGEVAQVGLSSRK 259 Query: 507 GEHRKRRGDCITQFHRVLGRMPLKYSYGKMVDDVGEQGLCDKAGKLVLCD 656 G+HRKRRG+CI +FHRVLGRMPL++SYGK+V+ VGEQGLC K GKLV CD Sbjct: 260 GKHRKRRGECIGEFHRVLGRMPLRFSYGKVVNSVGEQGLCQKGGKLVFCD 309 >ref|NP_565236.1| glycosyltransferase family protein 64 [Arabidopsis thaliana] gi|12324977|gb|AAG52433.1|AC018848_4 hypothetical protein; 16105-17094 [Arabidopsis thaliana] gi|89000939|gb|ABD59059.1| At1g80290 [Arabidopsis thaliana] gi|222424350|dbj|BAH20131.1| AT1G80290 [Arabidopsis thaliana] gi|332198262|gb|AEE36383.1| glycosyltransferase family protein 64 [Arabidopsis thaliana] Length = 329 Score = 253 bits (646), Expect = 6e-65 Identities = 127/220 (57%), Positives = 157/220 (71%), Gaps = 2/220 (0%) Frame = +3 Query: 3 AVLRAPSSSLNHRFYPWEAIGTRAVLICDDDIEPDPSSVAFAFSVWRSDPDRAVGFFARS 182 ++++ SSSLN RF P ++ TRAVLICDDD+E D S+ FAFSVW+S+PDR VG F RS Sbjct: 111 SLIQQSSSSLNARFLPRSSVDTRAVLICDDDVEIDQRSLEFAFSVWKSNPDRLVGTFVRS 170 Query: 183 HAYDVASRAWIYVMERAKYSIVLTKFMIVNVEYLERYSC--DERYAEGRKIVDEMNNCED 356 H +D+ + WIY + KYSIVLTKFM++ +YL YSC E R IVD+M NCED Sbjct: 171 HGFDLQGKEWIYTVHPDKYSIVLTKFMMMKQDYLFEYSCKGGVEMEEMRMIVDQMRNCED 230 Query: 357 ILMNFVVAEESQKXXXXXXXXXXXXXRDYGDARNDGVSAEMREVGLSSRRGEHRKRRGDC 536 ILMNFV A+ + RD+GDARN+ V +R+VGLSSRR EHRKRRG+C Sbjct: 231 ILMNFVAAD---RLRAGPIMVGAERVRDWGDARNEEVEERVRDVGLSSRRVEHRKRRGNC 287 Query: 537 ITQFHRVLGRMPLKYSYGKMVDDVGEQGLCDKAGKLVLCD 656 I +FHRV+G+MPL YSYGK+V+ VGEQGLC KAGKLV CD Sbjct: 288 IREFHRVMGKMPLMYSYGKVVNSVGEQGLCRKAGKLVFCD 327 >ref|NP_001077854.1| glycosyltransferase family protein 64 [Arabidopsis thaliana] gi|332198263|gb|AEE36384.1| glycosyltransferase family protein 64 [Arabidopsis thaliana] Length = 337 Score = 253 bits (646), Expect = 6e-65 Identities = 127/220 (57%), Positives = 157/220 (71%), Gaps = 2/220 (0%) Frame = +3 Query: 3 AVLRAPSSSLNHRFYPWEAIGTRAVLICDDDIEPDPSSVAFAFSVWRSDPDRAVGFFARS 182 ++++ SSSLN RF P ++ TRAVLICDDD+E D S+ FAFSVW+S+PDR VG F RS Sbjct: 119 SLIQQSSSSLNARFLPRSSVDTRAVLICDDDVEIDQRSLEFAFSVWKSNPDRLVGTFVRS 178 Query: 183 HAYDVASRAWIYVMERAKYSIVLTKFMIVNVEYLERYSC--DERYAEGRKIVDEMNNCED 356 H +D+ + WIY + KYSIVLTKFM++ +YL YSC E R IVD+M NCED Sbjct: 179 HGFDLQGKEWIYTVHPDKYSIVLTKFMMMKQDYLFEYSCKGGVEMEEMRMIVDQMRNCED 238 Query: 357 ILMNFVVAEESQKXXXXXXXXXXXXXRDYGDARNDGVSAEMREVGLSSRRGEHRKRRGDC 536 ILMNFV A+ + RD+GDARN+ V +R+VGLSSRR EHRKRRG+C Sbjct: 239 ILMNFVAAD---RLRAGPIMVGAERVRDWGDARNEEVEERVRDVGLSSRRVEHRKRRGNC 295 Query: 537 ITQFHRVLGRMPLKYSYGKMVDDVGEQGLCDKAGKLVLCD 656 I +FHRV+G+MPL YSYGK+V+ VGEQGLC KAGKLV CD Sbjct: 296 IREFHRVMGKMPLMYSYGKVVNSVGEQGLCRKAGKLVFCD 335 >gb|AAM61012.1| unknown [Arabidopsis thaliana] Length = 329 Score = 253 bits (646), Expect = 6e-65 Identities = 127/220 (57%), Positives = 157/220 (71%), Gaps = 2/220 (0%) Frame = +3 Query: 3 AVLRAPSSSLNHRFYPWEAIGTRAVLICDDDIEPDPSSVAFAFSVWRSDPDRAVGFFARS 182 ++++ SSSLN RF P ++ TRAVLICDDD+E D S+ FAFSVW+S+PDR VG F RS Sbjct: 111 SLIQQSSSSLNARFLPRSSVDTRAVLICDDDVEIDQRSLEFAFSVWKSNPDRLVGTFVRS 170 Query: 183 HAYDVASRAWIYVMERAKYSIVLTKFMIVNVEYLERYSC--DERYAEGRKIVDEMNNCED 356 H +D+ + WIY + KYSIVLTKFM++ +YL YSC E R IVD+M NCED Sbjct: 171 HGFDLQGKEWIYTVHPDKYSIVLTKFMMMKQDYLFEYSCKGGVEMEEMRMIVDQMRNCED 230 Query: 357 ILMNFVVAEESQKXXXXXXXXXXXXXRDYGDARNDGVSAEMREVGLSSRRGEHRKRRGDC 536 ILMNFV A+ + RD+GDARN+ V +R+VGLSSRR EHRKRRG+C Sbjct: 231 ILMNFVAAD---RLRAGPIMVGAERVRDWGDARNEEVEERVRDVGLSSRRVEHRKRRGNC 287 Query: 537 ITQFHRVLGRMPLKYSYGKMVDDVGEQGLCDKAGKLVLCD 656 I +FHRV+G+MPL YSYGK+V+ VGEQGLC KAGKLV CD Sbjct: 288 IREFHRVMGKMPLMYSYGKVVNSVGEQGLCRKAGKLVFCD 327 >ref|XP_004291290.1| PREDICTED: exostosin-like 3-like isoform 1 [Fragaria vesca subsp. vesca] gi|470110032|ref|XP_004291291.1| PREDICTED: exostosin-like 3-like isoform 2 [Fragaria vesca subsp. vesca] Length = 342 Score = 251 bits (640), Expect = 3e-64 Identities = 128/224 (57%), Positives = 158/224 (70%), Gaps = 12/224 (5%) Frame = +3 Query: 21 SSSLNHRFYPWEAIGTRAVLICDDDIEPDPSSVAFAFSVWRSDPDRAVGFFARSHAYDVA 200 S+SLN+RF P I TRAVL+CDDD+E DP S FAF +W ++PDR +GFF RSH D++ Sbjct: 120 SNSLNNRFLPRPLIKTRAVLVCDDDVEVDPKSFEFAFRMWGANPDRLIGFFVRSHDIDLS 179 Query: 201 SRAWIYVMERAKYSIVLTKFMIVNVEYLERYSC--DERYAEGRKIVDEMNNCEDILMNFV 374 + WIY + KYSI+LTKFMI +YL +YSC A RKIVD+M+NCEDILMNFV Sbjct: 180 KKEWIYTVHPDKYSIMLTKFMIFKSQYLFQYSCAGGTVMANMRKIVDKMHNCEDILMNFV 239 Query: 375 VAEESQKXXXXXXXXXXXXXRDYGDARND----------GVSAEMREVGLSSRRGEHRKR 524 +A+E RD+GDARND G++ E+ +VGLSSR+ +HRKR Sbjct: 240 IADE---VNAGPILVGAERVRDWGDARNDHDNRDGEGRRGLNGEVAQVGLSSRKEKHRKR 296 Query: 525 RGDCITQFHRVLGRMPLKYSYGKMVDDVGEQGLCDKAGKLVLCD 656 RG+CI QFHRVLGRMPL+YSYGK+V+ VGEQGLC K GKLVLCD Sbjct: 297 RGECIGQFHRVLGRMPLRYSYGKVVNSVGEQGLCQKGGKLVLCD 340 >ref|XP_006404811.1| hypothetical protein EUTSA_v10000547mg, partial [Eutrema salsugineum] gi|557105939|gb|ESQ46264.1| hypothetical protein EUTSA_v10000547mg, partial [Eutrema salsugineum] Length = 302 Score = 250 bits (639), Expect = 4e-64 Identities = 127/222 (57%), Positives = 158/222 (71%), Gaps = 4/222 (1%) Frame = +3 Query: 3 AVLRAPSSSLNHRFYPWEAIGTRAVLICDDDIEPDPSSVAFAFSVWRSDPDRAVGFFARS 182 ++++ PSSSLN RF P ++ TRAVL+CDDD+E D S+ FAFSVW+S+P+R VG F RS Sbjct: 82 SLVQQPSSSLNSRFLPRPSVETRAVLVCDDDVEIDRKSLEFAFSVWKSNPERLVGLFVRS 141 Query: 183 HAYDVASRAWIYVMERAKYSIVLTKFMIVNVEYLERYSCD--ERYAEGRKIVDEMNNCED 356 H +D+ + WIY + KYSI+LTKFM++ +YL YSC E RKIVD+M NCED Sbjct: 142 HGFDLQEKDWIYTVHPDKYSILLTKFMMMKQDYLFEYSCSGGVEMEEMRKIVDQMRNCED 201 Query: 357 ILMNFVVAEESQKXXXXXXXXXXXXXRDYGDARNDGVSAE--MREVGLSSRRGEHRKRRG 530 ILMNFV A+ K RD+GDARN+ E +REVGLSSRR EHRKRRG Sbjct: 202 ILMNFVAAD---KLKAGPIMVGAERVRDWGDARNEEEELENGVREVGLSSRRVEHRKRRG 258 Query: 531 DCITQFHRVLGRMPLKYSYGKMVDDVGEQGLCDKAGKLVLCD 656 CI +FHRV+G+MPL YSYGK+V+ VGEQGLC K+GKLV CD Sbjct: 259 KCIREFHRVMGKMPLMYSYGKVVNSVGEQGLCRKSGKLVFCD 300 >emb|CAN68561.1| hypothetical protein VITISV_033101 [Vitis vinifera] Length = 328 Score = 249 bits (636), Expect = 8e-64 Identities = 125/221 (56%), Positives = 152/221 (68%), Gaps = 3/221 (1%) Frame = +3 Query: 3 AVLRAPSSSLNHRFYPWEAIGTRAVLICDDDIEPDPSSVAFAFSVWRSDPDRAVGFFARS 182 +++R S SLN RF P I TR V+ICDDD+E DP S+ FAF VW ++P R +G FAR+ Sbjct: 108 SLVRQASDSLNARFLPRPFIXTRGVIICDDDVEVDPKSIEFAFRVWAANPHRLIGLFARA 167 Query: 183 HAYDVASRAWIYVMERAKYSIVLTKFMIVNVEYLERYSCD--ERYAEGRKIVDEMNNCED 356 H D++ R WIY + KYSIVLTKFM++ EYL +YSC+ R E R+ VD NCED Sbjct: 168 HDLDLSRREWIYTVHPDKYSIVLTKFMVLKTEYLYKYSCEGGARMMEARRAVDMAQNCED 227 Query: 357 ILMNFVVAEESQKXXXXXXXXXXXXXRDYGDARND-GVSAEMREVGLSSRRGEHRKRRGD 533 ILMNFVVAEE RD+GD RND G+ + REVGLSSRRGEHRKRRG Sbjct: 228 ILMNFVVAEEGN---AGPMLVGAEKVRDWGDGRNDEGLGLKEREVGLSSRRGEHRKRRGG 284 Query: 534 CITQFHRVLGRMPLKYSYGKMVDDVGEQGLCDKAGKLVLCD 656 CI FHR+LGRMPL+Y YGK+V+ + E+GLC K GKLV CD Sbjct: 285 CIGDFHRILGRMPLRYGYGKVVNSIAEEGLCMKDGKLVFCD 325 >gb|EXC32770.1| Exostosin-like 3 [Morus notabilis] Length = 340 Score = 248 bits (632), Expect = 2e-63 Identities = 129/229 (56%), Positives = 154/229 (67%), Gaps = 11/229 (4%) Frame = +3 Query: 3 AVLRAPSSSLNHRFYPWEAIGTRAVLICDDDIEPDPSSVAFAFSVWRSDPDRAVGFFARS 182 +++R S SLN RF P +I T AVLICDDD+E D S AFAF VW S+PDR +GFFARS Sbjct: 113 SLIRQSSPSLNARFLPRPSIATVAVLICDDDVEIDAKSFAFAFRVWESNPDRLIGFFARS 172 Query: 183 HAYDVASRAWIYVMERAKYSIVLTKFMIVNVEYLERYSC--DERYAEGRKIVDEMNNCED 356 H D+ + WIY + KYSIVLTKFMI+ YL YSC A R +VD NCED Sbjct: 173 HDIDLTRKKWIYTIHPDKYSIVLTKFMILKNRYLFEYSCGGGSAMARARSVVDRARNCED 232 Query: 357 ILMNFVVAEESQKXXXXXXXXXXXXXRDYGDARND---------GVSAEMREVGLSSRRG 509 ILMNFVVAEE+ RD+GDARN+ G+S + +VGLSSRR Sbjct: 233 ILMNFVVAEET---GAGPVLVGANWARDWGDARNEDIGDGDGRRGLSGTVAQVGLSSRRA 289 Query: 510 EHRKRRGDCITQFHRVLGRMPLKYSYGKMVDDVGEQGLCDKAGKLVLCD 656 EHRKRRG+CI++FHRVLG+M L+YSYGK+V+ VGEQGLC K GKLV CD Sbjct: 290 EHRKRRGECISEFHRVLGKMALRYSYGKVVNSVGEQGLCQKGGKLVFCD 338 >ref|XP_007011523.1| Nucleotide-diphospho-sugar transferases superfamily protein, putative [Theobroma cacao] gi|508781886|gb|EOY29142.1| Nucleotide-diphospho-sugar transferases superfamily protein, putative [Theobroma cacao] Length = 339 Score = 247 bits (630), Expect = 4e-63 Identities = 125/221 (56%), Positives = 151/221 (68%), Gaps = 8/221 (3%) Frame = +3 Query: 21 SSSLNHRFYPWEAIGTRAVLICDDDIEPDPSSVAFAFSVWRSDPDRAVGFFARSHAYDVA 200 SSSLN RF P +I TRAVL+CDDD+E DP +V FAF +WR +P+R +G F RSH D+ Sbjct: 122 SSSLNARFLPRSSIRTRAVLVCDDDVEVDPKTVEFAFRMWRWNPERLIGIFVRSHDIDMT 181 Query: 201 SRAWIYVMERAKYSIVLTKFMIVNVEYLERYSCD--ERYAEGRKIVDEMNNCEDILMNFV 374 + WIY + KYS+VLTKFM++ EYL +YSC+ E +++VDEM NCEDILMNFV Sbjct: 182 RKEWIYTVHPDKYSVVLTKFMMMKTEYLFKYSCEGGAPMREMKRMVDEMRNCEDILMNFV 241 Query: 375 VAEESQKXXXXXXXXXXXXXRDYGDARNDGVSAE------MREVGLSSRRGEHRKRRGDC 536 VAEE+ RD+GD RN+G + MREVGLSSRR EHRKRRG C Sbjct: 242 VAEETN---AGPLMVEAARARDWGDPRNEGEDGDGGGIRVMREVGLSSRRAEHRKRRGHC 298 Query: 537 ITQFHRVLGRMPLKYSYGKMVDDVGEQGLCDKAGKLVLCDH 659 I +FHRVLGRMPL+YSY K+V V EQGLC K LV CDH Sbjct: 299 INEFHRVLGRMPLRYSYAKLVSSVAEQGLCRKGANLVPCDH 339 >ref|XP_002280907.1| PREDICTED: exostosin-1-like [Vitis vinifera] Length = 328 Score = 247 bits (630), Expect = 4e-63 Identities = 125/221 (56%), Positives = 151/221 (68%), Gaps = 3/221 (1%) Frame = +3 Query: 3 AVLRAPSSSLNHRFYPWEAIGTRAVLICDDDIEPDPSSVAFAFSVWRSDPDRAVGFFARS 182 +++R S SLN RF P I TR V+ICDDD+E DP S+ FAF VW ++P R +G FAR+ Sbjct: 108 SLVRQASDSLNARFLPRPFITTRGVIICDDDVEVDPKSIEFAFRVWAANPHRLIGLFARA 167 Query: 183 HAYDVASRAWIYVMERAKYSIVLTKFMIVNVEYLERYSCD--ERYAEGRKIVDEMNNCED 356 H D++ R WIY + KYSIVLTKFM++ EYL +YSC+ R E RK VD NCED Sbjct: 168 HDLDLSRREWIYTVHPDKYSIVLTKFMVLKTEYLYKYSCEGGARMMEARKAVDMAQNCED 227 Query: 357 ILMNFVVAEESQKXXXXXXXXXXXXXRDYGDARND-GVSAEMREVGLSSRRGEHRKRRGD 533 ILMNFVVAEE RD+GD RND G+ + REVGLSSRRGEHRKRRG Sbjct: 228 ILMNFVVAEEGN---AGPMLVGAEKVRDWGDGRNDEGLGLKEREVGLSSRRGEHRKRRGG 284 Query: 534 CITQFHRVLGRMPLKYSYGKMVDDVGEQGLCDKAGKLVLCD 656 CI FHR+L RMPL+Y YGK+V+ + E+GLC K GKLV CD Sbjct: 285 CIGDFHRILRRMPLRYGYGKVVNSIAEEGLCMKDGKLVFCD 325 >ref|XP_006850947.1| hypothetical protein AMTR_s00025p00194100 [Amborella trichopoda] gi|548854618|gb|ERN12528.1| hypothetical protein AMTR_s00025p00194100 [Amborella trichopoda] Length = 329 Score = 245 bits (625), Expect = 2e-62 Identities = 124/220 (56%), Positives = 159/220 (72%) Frame = +3 Query: 3 AVLRAPSSSLNHRFYPWEAIGTRAVLICDDDIEPDPSSVAFAFSVWRSDPDRAVGFFARS 182 +V+R P SLN RF P I TRAV CDDD+ DP+SV AF VW+ +P++ VGFFARS Sbjct: 114 SVIRDPHLSLNLRFSPRGIIRTRAVATCDDDVLVDPASVHLAFRVWQRNPEKLVGFFARS 173 Query: 183 HAYDVASRAWIYVMERAKYSIVLTKFMIVNVEYLERYSCDERYAEGRKIVDEMNNCEDIL 362 H+YD++S++WIY ++ AK+S+VLTKFMI+++EYL YSC + A R +VDE NCEDIL Sbjct: 174 HSYDISSKSWIYTVDPAKFSMVLTKFMILSIEYLFEYSC-KIPARARGVVDEWRNCEDIL 232 Query: 363 MNFVVAEESQKXXXXXXXXXXXXXRDYGDARNDGVSAEMREVGLSSRRGEHRKRRGDCIT 542 MNFVV+ ++ RD+GDARN+G REVGLS R EHRKRRG CI Sbjct: 233 MNFVVSSGARS----GPILVEGKARDWGDARNEG--GVGREVGLSMRNVEHRKRRGLCIE 286 Query: 543 QFHRVLGRMPLKYSYGKMVDDVGEQGLCDKAGKLVLCDHQ 662 +FHR+LGRMPL+YSYG++V VGEQG+C+K G LVLCD + Sbjct: 287 EFHRILGRMPLRYSYGRVVGGVGEQGMCEKGGDLVLCDQE 326 >ref|XP_002889300.1| glycosyltransferase family protein 47 [Arabidopsis lyrata subsp. lyrata] gi|297335141|gb|EFH65559.1| glycosyltransferase family protein 47 [Arabidopsis lyrata subsp. lyrata] Length = 385 Score = 245 bits (625), Expect = 2e-62 Identities = 126/223 (56%), Positives = 154/223 (69%), Gaps = 5/223 (2%) Frame = +3 Query: 3 AVLRAPSSSLNHRFYPWEAIGTRAVLICDDDIEPDPSSVAFAFSVWRSDPDRAVGFFARS 182 ++++ SSSLN RF P ++ TRAVLICDDD+E D S+ FAFSVW+S+PDR VG F RS Sbjct: 164 SLIQQSSSSLNARFLPRSSVHTRAVLICDDDVEIDRRSLEFAFSVWKSNPDRLVGTFVRS 223 Query: 183 HAYDVASRAWIYVMERAKYSIVLTKFMIVNVEYLERYSC--DERYAEGRKIVDEMNNCED 356 H +D+ + WIY + KYSIVLTKFM++ +YL YSC E R IVD M NCED Sbjct: 224 HGFDLQGKEWIYTVHPDKYSIVLTKFMMMKQDYLYEYSCKGGVEMEEMRMIVDRMRNCED 283 Query: 357 ILMNFVVAEESQKXXXXXXXXXXXXXRDYGDARNDG---VSAEMREVGLSSRRGEHRKRR 527 IL+NFV A+ + RD+GDARN+ V +R+ GLSSRR EHRKRR Sbjct: 284 ILLNFVAAD---RLRAGPIMVGAERVRDWGDARNEEEQVVDERVRDAGLSSRRVEHRKRR 340 Query: 528 GDCITQFHRVLGRMPLKYSYGKMVDDVGEQGLCDKAGKLVLCD 656 G CI +FHRV+G+MPL YSYGK+VD VGEQGLC KAGKLV CD Sbjct: 341 GKCIREFHRVMGKMPLMYSYGKVVDSVGEQGLCRKAGKLVFCD 383 >ref|XP_007159619.1| hypothetical protein PHAVU_002G252600g [Phaseolus vulgaris] gi|561033034|gb|ESW31613.1| hypothetical protein PHAVU_002G252600g [Phaseolus vulgaris] Length = 324 Score = 244 bits (622), Expect = 3e-62 Identities = 128/222 (57%), Positives = 153/222 (68%), Gaps = 3/222 (1%) Frame = +3 Query: 3 AVLRAPSSSLNHRFYPWEA-IGTRAVLICDDDIEPDPSSVAFAFSVWRSDPDRAVGFFAR 179 ++L PS+SLN+RF P + I T +VLICDDD+E D S+ FAF VW +P+R VG FAR Sbjct: 107 SLLSQPSASLNNRFLPRPSHISTDSVLICDDDVEVDSKSLEFAFRVWEKNPNRLVGLFAR 166 Query: 180 SHAYDVASRAWIYVMERAKYSIVLTKFMIVNVEYLERYSC--DERYAEGRKIVDEMNNCE 353 SH D+ R WIY + ++SIVLTKFM + YL Y+C R R IVD + NCE Sbjct: 167 SHDLDLNRREWIYTVHPDRFSIVLTKFMFLRARYLYLYTCVGGARMERARGIVDAVRNCE 226 Query: 354 DILMNFVVAEESQKXXXXXXXXXXXXXRDYGDARNDGVSAEMREVGLSSRRGEHRKRRGD 533 D+LMNFV+AEE+Q DYGDARNDG E VGLSSR+GEHRKRRG Sbjct: 227 DLLMNFVMAEEAQ---VGPLLVGAKRVMDYGDARNDG--EEGVRVGLSSRKGEHRKRRGW 281 Query: 534 CITQFHRVLGRMPLKYSYGKMVDDVGEQGLCDKAGKLVLCDH 659 CI +FHRVLGRMPL+YSYGK+VDDVGEQGLC K GK+V CDH Sbjct: 282 CIGEFHRVLGRMPLRYSYGKIVDDVGEQGLCYKGGKIVFCDH 323 >ref|XP_006299931.1| hypothetical protein CARUB_v10016140mg, partial [Capsella rubella] gi|482568640|gb|EOA32829.1| hypothetical protein CARUB_v10016140mg, partial [Capsella rubella] Length = 317 Score = 243 bits (619), Expect = 8e-62 Identities = 125/221 (56%), Positives = 155/221 (70%), Gaps = 3/221 (1%) Frame = +3 Query: 3 AVLRAPSSSLNHRFYPWEAIGTRAVLICDDDIEPDPSSVAFAFSVWRSDPDRAVGFFARS 182 ++++ SSSLN RF P ++ TRAVLICDDD+E D S+ AFSVW+S+PDR VG F RS Sbjct: 98 SLIQQSSSSLNARFLPRSSVDTRAVLICDDDVEIDRRSLELAFSVWKSNPDRLVGPFVRS 157 Query: 183 HAYDVASRAWIYVMERAKYSIVLTKFMIVNVEYLERYSCDE--RYAEGRKIVDEMNNCED 356 H +D+ + WIY + KYSIVLTKFM++ +YL YSC E R +V+ M NCED Sbjct: 158 HGFDLQGKEWIYTVHPDKYSIVLTKFMVMKQDYLFDYSCKGGLEMKEMRMVVERMRNCED 217 Query: 357 ILMNFVVAEESQKXXXXXXXXXXXXXRDYGDARN-DGVSAEMREVGLSSRRGEHRKRRGD 533 ILMNFVVA+ K RD+GDARN + V +R+VGLSSRR EHRKRRG Sbjct: 218 ILMNFVVAD---KLRAGPIMVGAERLRDWGDARNEEEVGEGVRDVGLSSRRVEHRKRRGK 274 Query: 534 CITQFHRVLGRMPLKYSYGKMVDDVGEQGLCDKAGKLVLCD 656 CI +FHR++G+MPL YSYGK+V+ VGEQGLC KAGKLV CD Sbjct: 275 CIREFHRIMGKMPLLYSYGKVVNSVGEQGLCRKAGKLVFCD 315 >ref|XP_006483629.1| PREDICTED: exostosin-like 3-like [Citrus sinensis] Length = 352 Score = 242 bits (618), Expect = 1e-61 Identities = 126/227 (55%), Positives = 154/227 (67%), Gaps = 9/227 (3%) Frame = +3 Query: 3 AVLRAPSSSLNHRFYPWEAIGTRAVLICDDDIEPDPSSVAFAFSVWRSDPDRAVGFFARS 182 +++R PSSSLN RF P +I T AVLICDDD+E D S+ FAF +W+S+ +R +G FARS Sbjct: 114 SLIRQPSSSLNARFLPRSSIRTHAVLICDDDVEMDQKSLEFAFRIWQSNANRLIGVFARS 173 Query: 183 HAYDVASRAWIYVMERAKYSIVLTKFMIVNVEYLERYSCDERYAEG--RKIVDEMNNCED 356 H D+ ++ WIY + KYSIVLTK M + YL YSC A G R+IVDEM NCED Sbjct: 174 HDVDLVNKEWIYTVHPDKYSIVLTKLMFLKSSYLFEYSCGGGAAMGEMRRIVDEMRNCED 233 Query: 357 ILMNFVVAEESQKXXXXXXXXXXXXXRDYGDARND-------GVSAEMREVGLSSRRGEH 515 ILMNFVVA+ + RD+GDARND G + + VGLSSR+ EH Sbjct: 234 ILMNFVVAD---RINAGPLMVGAERVRDWGDARNDRDDGGARGSRSRVSAVGLSSRKMEH 290 Query: 516 RKRRGDCITQFHRVLGRMPLKYSYGKMVDDVGEQGLCDKAGKLVLCD 656 RKRRG CI +FHRVLGRMPL+YSYGK+V+ VGEQGLC+ GKLV CD Sbjct: 291 RKRRGKCIREFHRVLGRMPLRYSYGKVVNSVGEQGLCENGGKLVFCD 337 >ref|XP_006450102.1| hypothetical protein CICLE_v10010680mg [Citrus clementina] gi|557553328|gb|ESR63342.1| hypothetical protein CICLE_v10010680mg [Citrus clementina] Length = 339 Score = 242 bits (618), Expect = 1e-61 Identities = 126/227 (55%), Positives = 154/227 (67%), Gaps = 9/227 (3%) Frame = +3 Query: 3 AVLRAPSSSLNHRFYPWEAIGTRAVLICDDDIEPDPSSVAFAFSVWRSDPDRAVGFFARS 182 +++R PSSSLN RF P +I T AVLICDDD+E D S+ FAF +W+S+ +R +G FARS Sbjct: 114 SLIRQPSSSLNARFLPRSSIRTHAVLICDDDVEMDQKSLEFAFRIWQSNANRLIGVFARS 173 Query: 183 HAYDVASRAWIYVMERAKYSIVLTKFMIVNVEYLERYSCDERYAEG--RKIVDEMNNCED 356 H D+ ++ WIY + KYSIVLTK M + YL YSC A G R+IVDEM NCED Sbjct: 174 HDVDLVNKEWIYTVHPDKYSIVLTKLMFLKSSYLFEYSCGGGAAMGEMRRIVDEMRNCED 233 Query: 357 ILMNFVVAEESQKXXXXXXXXXXXXXRDYGDARND-------GVSAEMREVGLSSRRGEH 515 ILMNFVVA+ + RD+GDARND G + + VGLSSR+ EH Sbjct: 234 ILMNFVVAD---RINAGPLMVGAERVRDWGDARNDRDDGGARGSRSRVSAVGLSSRKMEH 290 Query: 516 RKRRGDCITQFHRVLGRMPLKYSYGKMVDDVGEQGLCDKAGKLVLCD 656 RKRRG CI +FHRVLGRMPL+YSYGK+V+ VGEQGLC+ GKLV CD Sbjct: 291 RKRRGKCIREFHRVLGRMPLRYSYGKVVNSVGEQGLCENGGKLVFCD 337