BLASTX nr result
ID: Perilla23_contig00013079
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Perilla23_contig00013079 (1361 letters) Database: ./nr 77,306,371 sequences; 28,104,191,420 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_011075726.1| PREDICTED: heparan-alpha-glucosaminide N-ace... 651 0.0 ref|XP_012851011.1| PREDICTED: heparan-alpha-glucosaminide N-ace... 645 0.0 ref|XP_012847878.1| PREDICTED: heparan-alpha-glucosaminide N-ace... 640 0.0 ref|XP_011075723.1| PREDICTED: heparan-alpha-glucosaminide N-ace... 639 e-180 gb|EYU26066.1| hypothetical protein MIMGU_mgv1a018021mg, partial... 632 e-178 ref|XP_011075727.1| PREDICTED: heparan-alpha-glucosaminide N-ace... 627 e-177 ref|XP_010249387.1| PREDICTED: heparan-alpha-glucosaminide N-ace... 580 e-162 gb|KHG21108.1| Heparan-alpha-glucosaminide N-acetyltransferase [... 580 e-162 ref|XP_002275105.2| PREDICTED: heparan-alpha-glucosaminide N-ace... 573 e-161 ref|XP_012460483.1| PREDICTED: heparan-alpha-glucosaminide N-ace... 570 e-160 ref|XP_007048603.1| Uncharacterized protein isoform 1 [Theobroma... 570 e-159 ref|XP_010657686.1| PREDICTED: heparan-alpha-glucosaminide N-ace... 566 e-158 ref|XP_010657690.1| PREDICTED: heparan-alpha-glucosaminide N-ace... 565 e-158 ref|XP_012460482.1| PREDICTED: heparan-alpha-glucosaminide N-ace... 563 e-158 gb|KOM28487.1| hypothetical protein LR48_Vigan549s004200 [Vigna ... 563 e-157 ref|XP_008222296.1| PREDICTED: heparan-alpha-glucosaminide N-ace... 562 e-157 ref|XP_002303734.2| hypothetical protein POPTR_0003s15750g [Popu... 562 e-157 ref|XP_011020932.1| PREDICTED: heparan-alpha-glucosaminide N-ace... 561 e-157 gb|KHN24949.1| Heparan-alpha-glucosaminide N-acetyltransferase [... 561 e-157 ref|XP_003532336.1| PREDICTED: heparan-alpha-glucosaminide N-ace... 561 e-157 >ref|XP_011075726.1| PREDICTED: heparan-alpha-glucosaminide N-acetyltransferase-like isoform X1 [Sesamum indicum] Length = 462 Score = 651 bits (1680), Expect = 0.0 Identities = 312/426 (73%), Positives = 355/426 (83%) Frame = -3 Query: 1359 ETAYLKSNTPSPRLRQSNFGGGNASSKCTPPSGRLVSLDVFRGLTVVLMIIVDDAGGFIP 1180 E+A+ K N SP LR + G NASS+ T S RLVSLDVFRGLTVVLMI+VDDAGG +P Sbjct: 35 ESAFQKMNPSSPSLRSGSDTGANASSRSTSASVRLVSLDVFRGLTVVLMILVDDAGGILP 94 Query: 1179 SINHSPWNGLTLADFVMPFFLFMVGVSLGLVYKNMVCRVAATKKAIFRAGKLLILGIFLQ 1000 +INHSPWNGLTLAD VMPFFLFMVGVSL LVYKNM R AA+KKAI RA KLLILG+FLQ Sbjct: 95 TINHSPWNGLTLADVVMPFFLFMVGVSLALVYKNMSSRAAASKKAILRALKLLILGVFLQ 154 Query: 999 GGYFHSLNNLTYGVDLEQIRWMGILQRIAIAYWFAAMCEIWLRNDEKVDSRLSLLKKYKW 820 GGYFH +NNLTYGVD++ IRWMGILQRIA+AY AAMCEIWLRNDEKV S SLLKKY+W Sbjct: 155 GGYFHGINNLTYGVDIDLIRWMGILQRIAVAYLVAAMCEIWLRNDEKVSSGSSLLKKYQW 214 Query: 819 QWXXXXXXXXXXXXXXXXXXVPDWEYHIPAGASSGGEVFRVKCGVRGDTGPACNAAGMID 640 W VPDWEY IP GA + ++F VKCGV GDTGPACNAAGMID Sbjct: 215 HWILMSMLSTMYLLLLYGLYVPDWEYKIPIGAPAEAKIFMVKCGVHGDTGPACNAAGMID 274 Query: 639 RAVLGIQRLYRKPIYARTEQCSINSPDYGPLPPDAPSWCRAPFDPEGILSTVMAIVTCLL 460 R +LG+Q LYR+PIYART+QCSINSPDYGPLPP+APSWC+AP+DPEGILSTVMAIVTCL+ Sbjct: 275 RMILGVQHLYRRPIYARTKQCSINSPDYGPLPPNAPSWCQAPYDPEGILSTVMAIVTCLI 334 Query: 459 GLQYGHVIVHYKDHKKRLSLWSYPSTGFMLLGLLCHILGMHINKALYSVSYTCVTAGVAG 280 GLQ+GH+IVH+KDH+ RL LW PS+ F++LG+LC + GMHINKALYS SYTCVT GVAG Sbjct: 335 GLQFGHIIVHFKDHRNRLLLWIAPSSAFIVLGVLCDVFGMHINKALYSFSYTCVTGGVAG 394 Query: 279 ILLATIYLVVDVHGWRRFTALLEWMGMHALLIYILVACNILPLILQGFYWKHPHNNILTL 100 +LLATIYLVVDV+G RR+ +LEWMGM+ALLIYILVACNILPLILQGFYW+ P NNIL+L Sbjct: 395 LLLATIYLVVDVYGCRRYALVLEWMGMNALLIYILVACNILPLILQGFYWRDPRNNILSL 454 Query: 99 IGIGKR 82 +GIGK+ Sbjct: 455 VGIGKQ 460 >ref|XP_012851011.1| PREDICTED: heparan-alpha-glucosaminide N-acetyltransferase-like [Erythranthe guttatus] Length = 456 Score = 645 bits (1664), Expect = 0.0 Identities = 316/421 (75%), Positives = 352/421 (83%), Gaps = 1/421 (0%) Frame = -3 Query: 1341 SNTPSPRLRQSNFGGGNASSKCTPPSGRLVSLDVFRGLTVVLMIIVDDAGGFIPSINHSP 1162 S++PS RLR ASS T PS RLVSLDVFRGLTVVLMIIVDDAGG IPSINHSP Sbjct: 42 SSSPSHRLRPK------ASS--TQPSSRLVSLDVFRGLTVVLMIIVDDAGGIIPSINHSP 93 Query: 1161 WNGLTLADFVMPFFLFMVGVSLGLVYKNMVCRVAATKKAIFRAGKLLILGIFLQGGYFHS 982 WNGLTLADFVMPFFLFMVGVSLGLVYKNM CR A++KAIFR K LILG+FLQGGYFH Sbjct: 94 WNGLTLADFVMPFFLFMVGVSLGLVYKNMPCRATASRKAIFRTVKFLILGVFLQGGYFHG 153 Query: 981 LNNLTYGVDLEQIRWMGILQRIAIAYWFAAMCEIWLRNDEKVDSRLSLLKKYKWQWXXXX 802 +NNLTYGVD+ QIRWMGILQRIAIAY AMCEIWLRNDEKV S LSLLKKY+W W Sbjct: 154 INNLTYGVDMGQIRWMGILQRIAIAYLVGAMCEIWLRNDEKVGSGLSLLKKYQWHWVMVF 213 Query: 801 XXXXXXXXXXXXXXVPDWEYHIPAGASSG-GEVFRVKCGVRGDTGPACNAAGMIDRAVLG 625 VPDW Y +P ASS E+ +VKCGVRG+TGPACNAAGMIDR +LG Sbjct: 214 MLTTVYLVLLYGLYVPDWSYQVPLSASSEVRELVKVKCGVRGNTGPACNAAGMIDRVILG 273 Query: 624 IQRLYRKPIYARTEQCSINSPDYGPLPPDAPSWCRAPFDPEGILSTVMAIVTCLLGLQYG 445 IQ LYRKPIYART+QCSINSPDYGPLPP+APSWC+APFDPEG+LSTVMA+ TCL+G+QYG Sbjct: 274 IQHLYRKPIYARTQQCSINSPDYGPLPPNAPSWCQAPFDPEGLLSTVMALATCLIGVQYG 333 Query: 444 HVIVHYKDHKKRLSLWSYPSTGFMLLGLLCHILGMHINKALYSVSYTCVTAGVAGILLAT 265 HVIVH+KDHK RL W PS+GF++LG+LC+I GMHINKALYS SY CVT GVAGILLA+ Sbjct: 334 HVIVHFKDHKYRLLQWLVPSSGFIILGVLCNIFGMHINKALYSFSYMCVTTGVAGILLAS 393 Query: 264 IYLVVDVHGWRRFTALLEWMGMHALLIYILVACNILPLILQGFYWKHPHNNILTLIGIGK 85 IYLVVDV+G+RRFT +LEWMGM+AL+IY+LVACNILPLILQGFYW HP NNIL+LIG+GK Sbjct: 394 IYLVVDVYGYRRFTMVLEWMGMNALVIYVLVACNILPLILQGFYWNHPGNNILSLIGVGK 453 Query: 84 R 82 + Sbjct: 454 Q 454 >ref|XP_012847878.1| PREDICTED: heparan-alpha-glucosaminide N-acetyltransferase-like [Erythranthe guttatus] gi|604316187|gb|EYU28584.1| hypothetical protein MIMGU_mgv1a006117mg [Erythranthe guttata] Length = 456 Score = 640 bits (1652), Expect = 0.0 Identities = 311/420 (74%), Positives = 346/420 (82%) Frame = -3 Query: 1341 SNTPSPRLRQSNFGGGNASSKCTPPSGRLVSLDVFRGLTVVLMIIVDDAGGFIPSINHSP 1162 S++PS RLR CT PS RLVSLDVFRGLTVVLMIIVDDAGG IPSINHSP Sbjct: 46 SSSPSHRLRPK---------ACTQPSSRLVSLDVFRGLTVVLMIIVDDAGGIIPSINHSP 96 Query: 1161 WNGLTLADFVMPFFLFMVGVSLGLVYKNMVCRVAATKKAIFRAGKLLILGIFLQGGYFHS 982 WNGLTLADFVMPFFLFMVGVSLGLVYKNM CR A++KAIFR K LILG+FLQGGYFH Sbjct: 97 WNGLTLADFVMPFFLFMVGVSLGLVYKNMPCRTTASRKAIFRTAKFLILGVFLQGGYFHG 156 Query: 981 LNNLTYGVDLEQIRWMGILQRIAIAYWFAAMCEIWLRNDEKVDSRLSLLKKYKWQWXXXX 802 +NNLTYGVD+ QIRWMGILQRIAIAY AMCEIWLRND+KV S LSLLKKY+W W Sbjct: 157 INNLTYGVDMGQIRWMGILQRIAIAYLVGAMCEIWLRNDDKVGSGLSLLKKYQWHWVMVF 216 Query: 801 XXXXXXXXXXXXXXVPDWEYHIPAGASSGGEVFRVKCGVRGDTGPACNAAGMIDRAVLGI 622 VPDW Y +P AS EV +VKCGVRG+TGPACNAAGMIDR +LG+ Sbjct: 217 MLTTVYLVLLYGLYVPDWSYQVPLSASF--EVKKVKCGVRGNTGPACNAAGMIDRMILGV 274 Query: 621 QRLYRKPIYARTEQCSINSPDYGPLPPDAPSWCRAPFDPEGILSTVMAIVTCLLGLQYGH 442 Q LYRKPIYART+QCS NSPDYGPLPP+APSWC+APFDPEG+LSTVMA+ TCL+G+QYGH Sbjct: 275 QHLYRKPIYARTQQCSTNSPDYGPLPPNAPSWCQAPFDPEGLLSTVMALATCLIGVQYGH 334 Query: 441 VIVHYKDHKKRLSLWSYPSTGFMLLGLLCHILGMHINKALYSVSYTCVTAGVAGILLATI 262 VIVH+KDHK RL W PS GF++LG+LC+I GMHINKALYS SY CVT GVAGILLA+I Sbjct: 335 VIVHFKDHKYRLLQWLVPSLGFIILGVLCNIFGMHINKALYSFSYMCVTTGVAGILLASI 394 Query: 261 YLVVDVHGWRRFTALLEWMGMHALLIYILVACNILPLILQGFYWKHPHNNILTLIGIGKR 82 YLVVDV+G+RR T +LEWMGM+AL+IY+LVACNILPLILQGFYW HP NNIL+LIGIGK+ Sbjct: 395 YLVVDVYGYRRCTMVLEWMGMNALVIYVLVACNILPLILQGFYWNHPGNNILSLIGIGKQ 454 >ref|XP_011075723.1| PREDICTED: heparan-alpha-glucosaminide N-acetyltransferase-like [Sesamum indicum] gi|747058757|ref|XP_011075724.1| PREDICTED: heparan-alpha-glucosaminide N-acetyltransferase-like [Sesamum indicum] gi|747058759|ref|XP_011075725.1| PREDICTED: heparan-alpha-glucosaminide N-acetyltransferase-like [Sesamum indicum] Length = 464 Score = 639 bits (1648), Expect = e-180 Identities = 312/428 (72%), Positives = 352/428 (82%), Gaps = 2/428 (0%) Frame = -3 Query: 1359 ETAYLKSNTPSPRLRQSNFG-GGNASSKCTPPSGRLVSLDVFRGLTVVLMIIVDDAGGFI 1183 E+A+ K SP LR + N +S+ T PS RL SLDVFRGLTV LMI+VDDAGG + Sbjct: 35 ESAFQKIYPSSPPLRPAGSDTAANGASRGTSPSARLASLDVFRGLTVALMILVDDAGGIL 94 Query: 1182 PSINHSPWNGLTLADFVMPFFLFMVGVSLGLVYKNMVCRVAATKKAIFRAGKLLILGIFL 1003 P+INHSPWNGLTLAD VMPFFLFMVGVSL LVYKNM R AA+KKAIFRA KLLILG+FL Sbjct: 95 PTINHSPWNGLTLADVVMPFFLFMVGVSLALVYKNMSWRGAASKKAIFRALKLLILGVFL 154 Query: 1002 QGGYFHSLNNLTYGVDLEQIRWMGILQRIAIAYWFAAMCEIWLRNDEKVDSRLSLLKKYK 823 QGGYFH +NNLTYGVD++ IRWMGILQRIA+AY AAMCEIWLRNDEKV S LSLLKKYK Sbjct: 155 QGGYFHGINNLTYGVDMDLIRWMGILQRIAVAYLVAAMCEIWLRNDEKVSSGLSLLKKYK 214 Query: 822 WQWXXXXXXXXXXXXXXXXXXVPDWEYHIPAGA-SSGGEVFRVKCGVRGDTGPACNAAGM 646 W W VPDWEY IP GA S ++F VKCGV GDTGPACNAAGM Sbjct: 215 WHWIMMLVLTTIYMLLLYGLFVPDWEYQIPLGAPSEEAKIFMVKCGVHGDTGPACNAAGM 274 Query: 645 IDRAVLGIQRLYRKPIYARTEQCSINSPDYGPLPPDAPSWCRAPFDPEGILSTVMAIVTC 466 IDR +LG+Q LYR+PIYART+QCSINSPDYGPLPP+APSWC+APFDPEGILST+MAIVTC Sbjct: 275 IDRMILGVQHLYRRPIYARTQQCSINSPDYGPLPPNAPSWCQAPFDPEGILSTMMAIVTC 334 Query: 465 LLGLQYGHVIVHYKDHKKRLSLWSYPSTGFMLLGLLCHILGMHINKALYSVSYTCVTAGV 286 L+GLQ+GHV+VH+KDH+ RL LW PS+ F++LGLLC I GMHINKALYS SYTCVT+G+ Sbjct: 335 LIGLQFGHVVVHFKDHRNRLLLWLAPSSAFIVLGLLCDIFGMHINKALYSFSYTCVTSGL 394 Query: 285 AGILLATIYLVVDVHGWRRFTALLEWMGMHALLIYILVACNILPLILQGFYWKHPHNNIL 106 AG LLATIYLVVDV+G RRF +LEWMGM+ALLIYILVACNILPLILQGFYW+ P NNIL Sbjct: 395 AGFLLATIYLVVDVYGCRRFALVLEWMGMNALLIYILVACNILPLILQGFYWRDPRNNIL 454 Query: 105 TLIGIGKR 82 +L+GI K+ Sbjct: 455 SLVGIRKQ 462 >gb|EYU26066.1| hypothetical protein MIMGU_mgv1a018021mg, partial [Erythranthe guttata] Length = 430 Score = 632 bits (1630), Expect = e-178 Identities = 310/412 (75%), Positives = 343/412 (83%), Gaps = 1/412 (0%) Frame = -3 Query: 1341 SNTPSPRLRQSNFGGGNASSKCTPPSGRLVSLDVFRGLTVVLMIIVDDAGGFIPSINHSP 1162 S++PS RLR ASS T PS RLVSLDVFRGLTVVLMIIVDDAGG IPSINHSP Sbjct: 27 SSSPSHRLRPK------ASS--TQPSSRLVSLDVFRGLTVVLMIIVDDAGGIIPSINHSP 78 Query: 1161 WNGLTLADFVMPFFLFMVGVSLGLVYKNMVCRVAATKKAIFRAGKLLILGIFLQGGYFHS 982 WNGLTLADFVMPFFLFMVGVSLGLVYKNM CR A++KAIFR K LILG+FLQGGYFH Sbjct: 79 WNGLTLADFVMPFFLFMVGVSLGLVYKNMPCRATASRKAIFRTVKFLILGVFLQGGYFHG 138 Query: 981 LNNLTYGVDLEQIRWMGILQRIAIAYWFAAMCEIWLRNDEKVDSRLSLLKKYKWQWXXXX 802 +NNLTYGVD+ QIRWMGILQRIAIAY AMCEIWLRNDEKV S LSLLKKY+W W Sbjct: 139 INNLTYGVDMGQIRWMGILQRIAIAYLVGAMCEIWLRNDEKVGSGLSLLKKYQWHWVMVF 198 Query: 801 XXXXXXXXXXXXXXVPDWEYHIPAGASSG-GEVFRVKCGVRGDTGPACNAAGMIDRAVLG 625 VPDW Y +P ASS E+ +VKCGVRG+TGPACNAAGMIDR +LG Sbjct: 199 MLTTVYLVLLYGLYVPDWSYQVPLSASSEVRELVKVKCGVRGNTGPACNAAGMIDRVILG 258 Query: 624 IQRLYRKPIYARTEQCSINSPDYGPLPPDAPSWCRAPFDPEGILSTVMAIVTCLLGLQYG 445 IQ LYRKPIYART+QCSINSPDYGPLPP+APSWC+APFDPEG+LSTVMA+ TCL+G+QYG Sbjct: 259 IQHLYRKPIYARTQQCSINSPDYGPLPPNAPSWCQAPFDPEGLLSTVMALATCLIGVQYG 318 Query: 444 HVIVHYKDHKKRLSLWSYPSTGFMLLGLLCHILGMHINKALYSVSYTCVTAGVAGILLAT 265 HVIVH+KDHK RL W PS+GF++LG+LC+I GMHINKALYS SY CVT GVAGILLA+ Sbjct: 319 HVIVHFKDHKYRLLQWLVPSSGFIILGVLCNIFGMHINKALYSFSYMCVTTGVAGILLAS 378 Query: 264 IYLVVDVHGWRRFTALLEWMGMHALLIYILVACNILPLILQGFYWKHPHNNI 109 IYLVVDV+G+RRFT +LEWMGM+AL+IY+LVACNILPLILQGFYW HP NNI Sbjct: 379 IYLVVDVYGYRRFTMVLEWMGMNALVIYVLVACNILPLILQGFYWNHPGNNI 430 >ref|XP_011075727.1| PREDICTED: heparan-alpha-glucosaminide N-acetyltransferase-like isoform X2 [Sesamum indicum] Length = 452 Score = 627 bits (1617), Expect = e-177 Identities = 303/426 (71%), Positives = 346/426 (81%) Frame = -3 Query: 1359 ETAYLKSNTPSPRLRQSNFGGGNASSKCTPPSGRLVSLDVFRGLTVVLMIIVDDAGGFIP 1180 E+A+ K N SP LR + G NASS+ T S RLVSLDVFRGLTVVLMI+VDDAGG +P Sbjct: 35 ESAFQKMNPSSPSLRSGSDTGANASSRSTSASVRLVSLDVFRGLTVVLMILVDDAGGILP 94 Query: 1179 SINHSPWNGLTLADFVMPFFLFMVGVSLGLVYKNMVCRVAATKKAIFRAGKLLILGIFLQ 1000 +INHSPWNGLTLAD VMPFFLFM NM R AA+KKAI RA KLLILG+FLQ Sbjct: 95 TINHSPWNGLTLADVVMPFFLFM----------NMSSRAAASKKAILRALKLLILGVFLQ 144 Query: 999 GGYFHSLNNLTYGVDLEQIRWMGILQRIAIAYWFAAMCEIWLRNDEKVDSRLSLLKKYKW 820 GGYFH +NNLTYGVD++ IRWMGILQRIA+AY AAMCEIWLRNDEKV S SLLKKY+W Sbjct: 145 GGYFHGINNLTYGVDIDLIRWMGILQRIAVAYLVAAMCEIWLRNDEKVSSGSSLLKKYQW 204 Query: 819 QWXXXXXXXXXXXXXXXXXXVPDWEYHIPAGASSGGEVFRVKCGVRGDTGPACNAAGMID 640 W VPDWEY IP GA + ++F VKCGV GDTGPACNAAGMID Sbjct: 205 HWILMSMLSTMYLLLLYGLYVPDWEYKIPIGAPAEAKIFMVKCGVHGDTGPACNAAGMID 264 Query: 639 RAVLGIQRLYRKPIYARTEQCSINSPDYGPLPPDAPSWCRAPFDPEGILSTVMAIVTCLL 460 R +LG+Q LYR+PIYART+QCSINSPDYGPLPP+APSWC+AP+DPEGILSTVMAIVTCL+ Sbjct: 265 RMILGVQHLYRRPIYARTKQCSINSPDYGPLPPNAPSWCQAPYDPEGILSTVMAIVTCLI 324 Query: 459 GLQYGHVIVHYKDHKKRLSLWSYPSTGFMLLGLLCHILGMHINKALYSVSYTCVTAGVAG 280 GLQ+GH+IVH+KDH+ RL LW PS+ F++LG+LC + GMHINKALYS SYTCVT GVAG Sbjct: 325 GLQFGHIIVHFKDHRNRLLLWIAPSSAFIVLGVLCDVFGMHINKALYSFSYTCVTGGVAG 384 Query: 279 ILLATIYLVVDVHGWRRFTALLEWMGMHALLIYILVACNILPLILQGFYWKHPHNNILTL 100 +LLATIYLVVDV+G RR+ +LEWMGM+ALLIYILVACNILPLILQGFYW+ P NNIL+L Sbjct: 385 LLLATIYLVVDVYGCRRYALVLEWMGMNALLIYILVACNILPLILQGFYWRDPRNNILSL 444 Query: 99 IGIGKR 82 +GIGK+ Sbjct: 445 VGIGKQ 450 >ref|XP_010249387.1| PREDICTED: heparan-alpha-glucosaminide N-acetyltransferase [Nelumbo nucifera] Length = 462 Score = 580 bits (1495), Expect = e-162 Identities = 271/405 (66%), Positives = 325/405 (80%), Gaps = 1/405 (0%) Frame = -3 Query: 1299 GGNASSKCTPPSGRLVSLDVFRGLTVVLMIIVDDAGGFIPSINHSPWNGLTLADFVMPFF 1120 GG + SK RLVSLDVFRG+TV LMI+VDDAGG PSINHSPW+G+TLADFVMPFF Sbjct: 62 GGESVSK-----RRLVSLDVFRGITVALMILVDDAGGMFPSINHSPWDGVTLADFVMPFF 116 Query: 1119 LFMVGVSLGLVYKNMVCRVAATKKAIFRAGKLLILGIFLQGGYFHSLNNLTYGVDLEQIR 940 LF+VG+SL L YK + CR+ ATKKA+ RA KL +G+ LQGGYFH LNNLTYGVD+ QIR Sbjct: 117 LFIVGLSLALAYKKLPCRIDATKKAVLRALKLFAVGLVLQGGYFHGLNNLTYGVDINQIR 176 Query: 939 WMGILQRIAIAYWFAAMCEIWLRNDEKVDSRLSLLKKYKWQWXXXXXXXXXXXXXXXXXX 760 WMG+LQRIAIAY +A+CEIW++ D+ +DS LSLLKKY++QW Sbjct: 177 WMGVLQRIAIAYLLSALCEIWIKGDDNIDSGLSLLKKYRYQWMVAFVLTTTYLTLLYGLY 236 Query: 759 VPDWEYHIPAGASSGG-EVFRVKCGVRGDTGPACNAAGMIDRAVLGIQRLYRKPIYARTE 583 +PDWEY I G+SS ++F VKCGVRGDTGPACNA GMIDR +LG+Q LY++PIYART+ Sbjct: 237 IPDWEYQIFNGSSSAAPKIFSVKCGVRGDTGPACNAVGMIDRKILGLQHLYKRPIYARTK 296 Query: 582 QCSINSPDYGPLPPDAPSWCRAPFDPEGILSTVMAIVTCLLGLQYGHVIVHYKDHKKRLS 403 QCSINSPDYGPLPPDAPSWC+APFDPEG+LS+VMAIVTCL+GL YGH+IVH+KDHK+R+ Sbjct: 297 QCSINSPDYGPLPPDAPSWCQAPFDPEGLLSSVMAIVTCLIGLHYGHIIVHFKDHKERIF 356 Query: 402 LWSYPSTGFMLLGLLCHILGMHINKALYSVSYTCVTAGVAGILLATIYLVVDVHGWRRFT 223 W P++G ++ G + GMH+NKALY+ SYTCVTAG AGIL Y++VDV+G+RR T Sbjct: 357 QWMIPASGLVVTGFVLDFFGMHLNKALYTFSYTCVTAGAAGILFTGTYVLVDVYGYRRPT 416 Query: 222 ALLEWMGMHALLIYILVACNILPLILQGFYWKHPHNNILTLIGIG 88 +LEWMGMHAL+IYIL ACNILP++LQGFYWK P NNIL LIGIG Sbjct: 417 MILEWMGMHALMIYILAACNILPVLLQGFYWKKPENNILRLIGIG 461 >gb|KHG21108.1| Heparan-alpha-glucosaminide N-acetyltransferase [Gossypium arboreum] Length = 481 Score = 580 bits (1494), Expect = e-162 Identities = 271/397 (68%), Positives = 324/397 (81%), Gaps = 1/397 (0%) Frame = -3 Query: 1272 PPSGRLVSLDVFRGLTVVLMIIVDDAGGFIPSINHSPWNGLTLADFVMPFFLFMVGVSLG 1093 PP RL+SLDVFRGLTVVLMI+VDD GG +P+INHSPWNGLTLAD+VMPFFLF+VGVSLG Sbjct: 85 PPQHRLISLDVFRGLTVVLMILVDDVGGILPAINHSPWNGLTLADYVMPFFLFIVGVSLG 144 Query: 1092 LVYKNMVCRVAATKKAIFRAGKLLILGIFLQGGYFHSLNNLTYGVDLEQIRWMGILQRIA 913 L YK + CRV AT+KAI RA KLLILGIFLQGG+FH LNNLTYG+D++Q+R MGILQRIA Sbjct: 145 LTYKRVSCRVNATRKAILRALKLLILGIFLQGGFFHGLNNLTYGMDIQQMRLMGILQRIA 204 Query: 912 IAYWFAAMCEIWLRNDEKVDSRLSLLKKYKWQWXXXXXXXXXXXXXXXXXXVPDWEYHIP 733 IAY AA+CEIWL+ D+ V S+L+LL+KY++QW VP W+Y IP Sbjct: 205 IAYLVAALCEIWLKGDDHVTSQLALLRKYQFQWLAASVLTVIYISLLYGLYVPTWQYQIP 264 Query: 732 -AGASSGGEVFRVKCGVRGDTGPACNAAGMIDRAVLGIQRLYRKPIYARTEQCSINSPDY 556 A +SS + F VKCGVRGDTGPACN GMIDR +LGIQ LYRKP++ RT+QCSINSPDY Sbjct: 265 DATSSSAPKTFSVKCGVRGDTGPACNVVGMIDRKILGIQHLYRKPVFERTKQCSINSPDY 324 Query: 555 GPLPPDAPSWCRAPFDPEGILSTVMAIVTCLLGLQYGHVIVHYKDHKKRLSLWSYPSTGF 376 GPLP DAP+WC+APF+PEG+LS+VMA+VTCL+GL YGH+IVH+KDH R+ LW PS+ F Sbjct: 325 GPLPSDAPAWCQAPFEPEGLLSSVMAMVTCLVGLHYGHIIVHFKDHADRIRLWLIPSSAF 384 Query: 375 MLLGLLCHILGMHINKALYSVSYTCVTAGVAGILLATIYLVVDVHGWRRFTALLEWMGMH 196 ++LGL I GMHINKALY+ SY CVTA AG L A IYL+VD+HG+RR T +LEWMG H Sbjct: 385 LVLGLALDIFGMHINKALYTFSYMCVTASAAGFLFAGIYLLVDIHGYRRMTLVLEWMGKH 444 Query: 195 ALLIYILVACNILPLILQGFYWKHPHNNILTLIGIGK 85 AL+IYIL ACNI+P+++QGFYWK P NNIL+LIGIG+ Sbjct: 445 ALVIYILAACNIIPIVIQGFYWKQPQNNILSLIGIGR 481 >ref|XP_002275105.2| PREDICTED: heparan-alpha-glucosaminide N-acetyltransferase isoform X5 [Vitis vinifera] gi|296085565|emb|CBI29297.3| unnamed protein product [Vitis vinifera] Length = 444 Score = 573 bits (1478), Expect = e-161 Identities = 277/415 (66%), Positives = 325/415 (78%), Gaps = 1/415 (0%) Frame = -3 Query: 1329 SPRLRQSNFGGGNASSKCTPPSGRLVSLDVFRGLTVVLMIIVDDAGGFIPSINHSPWNGL 1150 SPR S GGGNAS + RLVSLDVFRGLTV +MI+VDDAGG +P+INHSPWNGL Sbjct: 35 SPRSDGSGRGGGNASKR------RLVSLDVFRGLTVAIMILVDDAGGILPAINHSPWNGL 88 Query: 1149 TLADFVMPFFLFMVGVSLGLVYKNMVCRVAATKKAIFRAGKLLILGIFLQGGYFHSLNNL 970 TLADFVMPFFLF+VGVSL L YKN+ ATK A+ RA KLL+ G+FLQGGYFH LNNL Sbjct: 89 TLADFVMPFFLFIVGVSLALAYKNLSSGYLATKMAVVRALKLLVFGLFLQGGYFHGLNNL 148 Query: 969 TYGVDLEQIRWMGILQRIAIAYWFAAMCEIWLRNDEKVDSRLSLLKKYKWQWXXXXXXXX 790 TYGVD+EQIR GILQRIA+AY+ AA+CEIWL+ D V S SLLKKY++QW Sbjct: 149 TYGVDIEQIRLAGILQRIAVAYFLAAVCEIWLKGDSNVKSGSSLLKKYQFQWAVVLVLTV 208 Query: 789 XXXXXXXXXXVPDWEYHIPAGASSGG-EVFRVKCGVRGDTGPACNAAGMIDRAVLGIQRL 613 VPDWEY IP+ SS ++F+VKCGVR DTGPACNA GMIDR VLGIQ L Sbjct: 209 AYCSLLYGLYVPDWEYSIPSETSSSALKIFKVKCGVRSDTGPACNAVGMIDRNVLGIQHL 268 Query: 612 YRKPIYARTEQCSINSPDYGPLPPDAPSWCRAPFDPEGILSTVMAIVTCLLGLQYGHVIV 433 Y++PIYAR +QCSINSPDYGPLPP+AP+WC+APFDPEG+LS+VMAIVTCL+GL YGH+IV Sbjct: 269 YKRPIYARMKQCSINSPDYGPLPPNAPTWCQAPFDPEGLLSSVMAIVTCLVGLHYGHIIV 328 Query: 432 HYKDHKKRLSLWSYPSTGFMLLGLLCHILGMHINKALYSVSYTCVTAGVAGILLATIYLV 253 H+KDHK R+ W PS+ ++LG GMH+NKALY++SY CVTAG AGIL A IYL+ Sbjct: 329 HFKDHKDRILHWIVPSSCLLVLGFALDFFGMHVNKALYTLSYMCVTAGAAGILFAGIYLM 388 Query: 252 VDVHGWRRFTALLEWMGMHALLIYILVACNILPLILQGFYWKHPHNNILTLIGIG 88 VD++G+RR T ++EWMGMHAL+IYIL ACNILP+ LQGFYW+ P NNI LIGIG Sbjct: 389 VDMYGYRRPTIVMEWMGMHALMIYILAACNILPVFLQGFYWRRPQNNIFRLIGIG 443 >ref|XP_012460483.1| PREDICTED: heparan-alpha-glucosaminide N-acetyltransferase isoform X2 [Gossypium raimondii] gi|763808708|gb|KJB75610.1| hypothetical protein B456_012G048100 [Gossypium raimondii] Length = 481 Score = 570 bits (1470), Expect = e-160 Identities = 267/397 (67%), Positives = 323/397 (81%), Gaps = 1/397 (0%) Frame = -3 Query: 1272 PPSGRLVSLDVFRGLTVVLMIIVDDAGGFIPSINHSPWNGLTLADFVMPFFLFMVGVSLG 1093 PP RL+SLDVFRGLTVVLMI+VDD GG +P+INHSPWNGLTLAD+VMPFFLF+VGVSLG Sbjct: 85 PPQHRLISLDVFRGLTVVLMILVDDVGGILPAINHSPWNGLTLADYVMPFFLFIVGVSLG 144 Query: 1092 LVYKNMVCRVAATKKAIFRAGKLLILGIFLQGGYFHSLNNLTYGVDLEQIRWMGILQRIA 913 L YK + CRV AT+KAI RA KLLILGIFLQGG+FH LNNLTYG+D++Q+R MGILQRIA Sbjct: 145 LTYKRVSCRVTATRKAILRALKLLILGIFLQGGFFHGLNNLTYGMDIQQMRLMGILQRIA 204 Query: 912 IAYWFAAMCEIWLRNDEKVDSRLSLLKKYKWQWXXXXXXXXXXXXXXXXXXVPDWEYHIP 733 IAY AA+CEIWL+ D+ V S+L+LL+KY++Q VP W+Y IP Sbjct: 205 IAYLVAALCEIWLKGDDHVTSQLALLRKYQFQLLAASVLTVIYIFLLYGLYVPTWQYQIP 264 Query: 732 -AGASSGGEVFRVKCGVRGDTGPACNAAGMIDRAVLGIQRLYRKPIYARTEQCSINSPDY 556 A +SS + F VKCGVRGDTGPACN GMIDR +LGIQ LYRKP++ RT+QCSINSPDY Sbjct: 265 DATSSSAPKTFSVKCGVRGDTGPACNVVGMIDRKILGIQHLYRKPVFERTKQCSINSPDY 324 Query: 555 GPLPPDAPSWCRAPFDPEGILSTVMAIVTCLLGLQYGHVIVHYKDHKKRLSLWSYPSTGF 376 GPLP DAP+WC+APF+PEG+LS+VMA+VTC +GL YGH+IVH+KDH R+ LW PS+ F Sbjct: 325 GPLPSDAPAWCQAPFEPEGLLSSVMAMVTCFVGLHYGHIIVHFKDHADRIRLWLIPSSAF 384 Query: 375 MLLGLLCHILGMHINKALYSVSYTCVTAGVAGILLATIYLVVDVHGWRRFTALLEWMGMH 196 ++LGL I GMHINKALY+ SY CVTAG AG L A IYL+VD++G++R T +LEWMG H Sbjct: 385 LVLGLALDIFGMHINKALYTFSYMCVTAGAAGFLFAGIYLLVDIYGYQRMTLVLEWMGKH 444 Query: 195 ALLIYILVACNILPLILQGFYWKHPHNNILTLIGIGK 85 AL+IYIL ACN++P+++QGFYWK P NNIL+LIGIG+ Sbjct: 445 ALVIYILAACNLIPVVIQGFYWKQPPNNILSLIGIGR 481 >ref|XP_007048603.1| Uncharacterized protein isoform 1 [Theobroma cacao] gi|508700864|gb|EOX92760.1| Uncharacterized protein isoform 1 [Theobroma cacao] Length = 480 Score = 570 bits (1468), Expect = e-159 Identities = 268/392 (68%), Positives = 318/392 (81%), Gaps = 1/392 (0%) Frame = -3 Query: 1260 RLVSLDVFRGLTVVLMIIVDDAGGFIPSINHSPWNGLTLADFVMPFFLFMVGVSLGLVYK 1081 RLVSLDVFRGLT+VLMI+VDD GG +P+INHSPWNGLTLAD+VMPFFLF+VGVSLGL YK Sbjct: 88 RLVSLDVFRGLTIVLMILVDDVGGLLPAINHSPWNGLTLADYVMPFFLFIVGVSLGLTYK 147 Query: 1080 NMVCRVAATKKAIFRAGKLLILGIFLQGGYFHSLNNLTYGVDLEQIRWMGILQRIAIAYW 901 + CRV AT+KAI RA KLL+LG+FLQGG+FH LNNLTYGVD++Q+R MGILQRIAIAY Sbjct: 148 RLSCRVTATRKAILRALKLLVLGLFLQGGFFHGLNNLTYGVDIQQMRLMGILQRIAIAYL 207 Query: 900 FAAMCEIWLRNDEKVDSRLSLLKKYKWQWXXXXXXXXXXXXXXXXXXVPDWEYHIP-AGA 724 AA+CEIWL+ D V S L+LLKK+++QW VPDWEY IP A + Sbjct: 208 VAAICEIWLKGDHHVKSELNLLKKHRFQWVAALALTIIYISLLYGLYVPDWEYQIPVATS 267 Query: 723 SSGGEVFRVKCGVRGDTGPACNAAGMIDRAVLGIQRLYRKPIYARTEQCSINSPDYGPLP 544 SS + F VKCGVRGDTGPACN GMIDR +LGI+ LYRKP++ RT+QCSINSPDYGPLP Sbjct: 268 SSAPKFFSVKCGVRGDTGPACNVVGMIDRKILGIKHLYRKPVFERTKQCSINSPDYGPLP 327 Query: 543 PDAPSWCRAPFDPEGILSTVMAIVTCLLGLQYGHVIVHYKDHKKRLSLWSYPSTGFMLLG 364 DAPSWC+APFDPEG+LS+VMA+VTCL+GL YG +IVH+KDH+ R+ LW S+G ++LG Sbjct: 328 SDAPSWCQAPFDPEGLLSSVMAMVTCLVGLHYGQIIVHFKDHRDRIRLWLISSSGLLVLG 387 Query: 363 LLCHILGMHINKALYSVSYTCVTAGVAGILLATIYLVVDVHGWRRFTALLEWMGMHALLI 184 L GMH+NKALY+ SY CVTAG AG L A IYL+VD+ G+RR T +LEWMG HAL+I Sbjct: 388 LALDFFGMHVNKALYTFSYMCVTAGAAGFLFAGIYLLVDICGYRRMTLVLEWMGKHALMI 447 Query: 183 YILVACNILPLILQGFYWKHPHNNILTLIGIG 88 YIL ACNI+P+I+QGFYWK P NNIL+LIGIG Sbjct: 448 YILAACNIVPIIIQGFYWKQPQNNILSLIGIG 479 >ref|XP_010657686.1| PREDICTED: heparan-alpha-glucosaminide N-acetyltransferase isoform X3 [Vitis vinifera] Length = 452 Score = 566 bits (1459), Expect = e-158 Identities = 277/423 (65%), Positives = 325/423 (76%), Gaps = 9/423 (2%) Frame = -3 Query: 1329 SPRLRQSNFGGGNASSKCTPPSGRLVSLDVFRGLTVVL--------MIIVDDAGGFIPSI 1174 SPR S GGGNAS + RLVSLDVFRGLTV + MI+VDDAGG +P+I Sbjct: 35 SPRSDGSGRGGGNASKR------RLVSLDVFRGLTVAIFNLNLSQIMILVDDAGGILPAI 88 Query: 1173 NHSPWNGLTLADFVMPFFLFMVGVSLGLVYKNMVCRVAATKKAIFRAGKLLILGIFLQGG 994 NHSPWNGLTLADFVMPFFLF+VGVSL L YKN+ ATK A+ RA KLL+ G+FLQGG Sbjct: 89 NHSPWNGLTLADFVMPFFLFIVGVSLALAYKNLSSGYLATKMAVVRALKLLVFGLFLQGG 148 Query: 993 YFHSLNNLTYGVDLEQIRWMGILQRIAIAYWFAAMCEIWLRNDEKVDSRLSLLKKYKWQW 814 YFH LNNLTYGVD+EQIR GILQRIA+AY+ AA+CEIWL+ D V S SLLKKY++QW Sbjct: 149 YFHGLNNLTYGVDIEQIRLAGILQRIAVAYFLAAVCEIWLKGDSNVKSGSSLLKKYQFQW 208 Query: 813 XXXXXXXXXXXXXXXXXXVPDWEYHIPAGASSGG-EVFRVKCGVRGDTGPACNAAGMIDR 637 VPDWEY IP+ SS ++F+VKCGVR DTGPACNA GMIDR Sbjct: 209 AVVLVLTVAYCSLLYGLYVPDWEYSIPSETSSSALKIFKVKCGVRSDTGPACNAVGMIDR 268 Query: 636 AVLGIQRLYRKPIYARTEQCSINSPDYGPLPPDAPSWCRAPFDPEGILSTVMAIVTCLLG 457 VLGIQ LY++PIYAR +QCSINSPDYGPLPP+AP+WC+APFDPEG+LS+VMAIVTCL+G Sbjct: 269 NVLGIQHLYKRPIYARMKQCSINSPDYGPLPPNAPTWCQAPFDPEGLLSSVMAIVTCLVG 328 Query: 456 LQYGHVIVHYKDHKKRLSLWSYPSTGFMLLGLLCHILGMHINKALYSVSYTCVTAGVAGI 277 L YGH+IVH+KDHK R+ W PS+ ++LG GMH+NKALY++SY CVTAG AGI Sbjct: 329 LHYGHIIVHFKDHKDRILHWIVPSSCLLVLGFALDFFGMHVNKALYTLSYMCVTAGAAGI 388 Query: 276 LLATIYLVVDVHGWRRFTALLEWMGMHALLIYILVACNILPLILQGFYWKHPHNNILTLI 97 L A IYL+VD++G+RR T ++EWMGMHAL+IYIL ACNILP+ LQGFYW+ P NNI LI Sbjct: 389 LFAGIYLMVDMYGYRRPTIVMEWMGMHALMIYILAACNILPVFLQGFYWRRPQNNIFRLI 448 Query: 96 GIG 88 GIG Sbjct: 449 GIG 451 >ref|XP_010657690.1| PREDICTED: heparan-alpha-glucosaminide N-acetyltransferase isoform X4 [Vitis vinifera] Length = 447 Score = 565 bits (1456), Expect = e-158 Identities = 272/409 (66%), Positives = 321/409 (78%), Gaps = 1/409 (0%) Frame = -3 Query: 1329 SPRLRQSNFGGGNASSKCTPPSGRLVSLDVFRGLTVVLMIIVDDAGGFIPSINHSPWNGL 1150 SPR S GGGNAS + RLVSLDVFRGLTV +MI+VDDAGG +P+INHSPWNGL Sbjct: 35 SPRSDGSGRGGGNASKR------RLVSLDVFRGLTVAIMILVDDAGGILPAINHSPWNGL 88 Query: 1149 TLADFVMPFFLFMVGVSLGLVYKNMVCRVAATKKAIFRAGKLLILGIFLQGGYFHSLNNL 970 TLADFVMPFFLF+VGVSL L YKN+ ATK A+ RA KLL+ G+FLQGGYFH LNNL Sbjct: 89 TLADFVMPFFLFIVGVSLALAYKNLSSGYLATKMAVVRALKLLVFGLFLQGGYFHGLNNL 148 Query: 969 TYGVDLEQIRWMGILQRIAIAYWFAAMCEIWLRNDEKVDSRLSLLKKYKWQWXXXXXXXX 790 TYGVD+EQIR GILQRIA+AY+ AA+CEIWL+ D V S SLLKKY++QW Sbjct: 149 TYGVDIEQIRLAGILQRIAVAYFLAAVCEIWLKGDSNVKSGSSLLKKYQFQWAVVLVLTV 208 Query: 789 XXXXXXXXXXVPDWEYHIPAGASSGG-EVFRVKCGVRGDTGPACNAAGMIDRAVLGIQRL 613 VPDWEY IP+ SS ++F+VKCGVR DTGPACNA GMIDR VLGIQ L Sbjct: 209 AYCSLLYGLYVPDWEYSIPSETSSSALKIFKVKCGVRSDTGPACNAVGMIDRNVLGIQHL 268 Query: 612 YRKPIYARTEQCSINSPDYGPLPPDAPSWCRAPFDPEGILSTVMAIVTCLLGLQYGHVIV 433 Y++PIYAR +QCSINSPDYGPLPP+AP+WC+APFDPEG+LS+VMAIVTCL+GL YGH+IV Sbjct: 269 YKRPIYARMKQCSINSPDYGPLPPNAPTWCQAPFDPEGLLSSVMAIVTCLVGLHYGHIIV 328 Query: 432 HYKDHKKRLSLWSYPSTGFMLLGLLCHILGMHINKALYSVSYTCVTAGVAGILLATIYLV 253 H+KDHK R+ W PS+ ++LG GMH+NKALY++SY CVTAG AGIL A IYL+ Sbjct: 329 HFKDHKDRILHWIVPSSCLLVLGFALDFFGMHVNKALYTLSYMCVTAGAAGILFAGIYLM 388 Query: 252 VDVHGWRRFTALLEWMGMHALLIYILVACNILPLILQGFYWKHPHNNIL 106 VD++G+RR T ++EWMGMHAL+IYIL ACNILP+ LQGFYW+ P NNI+ Sbjct: 389 VDMYGYRRPTIVMEWMGMHALMIYILAACNILPVFLQGFYWRRPQNNIV 437 >ref|XP_012460482.1| PREDICTED: heparan-alpha-glucosaminide N-acetyltransferase isoform X1 [Gossypium raimondii] Length = 491 Score = 563 bits (1452), Expect = e-158 Identities = 267/407 (65%), Positives = 324/407 (79%), Gaps = 11/407 (2%) Frame = -3 Query: 1272 PPSGRLVSLDVFRGLTVVLMIIVDDAGGFIPSINHSPWNGLTLADFVMPFFLFMVGVSLG 1093 PP RL+SLDVFRGLTVVLMI+VDD GG +P+INHSPWNGLTLAD+VMPFFLF+VGVSLG Sbjct: 85 PPQHRLISLDVFRGLTVVLMILVDDVGGILPAINHSPWNGLTLADYVMPFFLFIVGVSLG 144 Query: 1092 LVYKNMVCRVAATKKAIFRAGKLLILGIFLQGGYFHSLNNLTYGVDLEQIRWMGILQRIA 913 L YK + CRV AT+KAI RA KLLILGIFLQGG+FH LNNLTYG+D++Q+R MGILQRIA Sbjct: 145 LTYKRVSCRVTATRKAILRALKLLILGIFLQGGFFHGLNNLTYGMDIQQMRLMGILQRIA 204 Query: 912 IAYWFAAMCEIWLRNDEKVDSRLSLLKKYKWQWXXXXXXXXXXXXXXXXXXVPDWEYHIP 733 IAY AA+CEIWL+ D+ V S+L+LL+KY++Q VP W+Y IP Sbjct: 205 IAYLVAALCEIWLKGDDHVTSQLALLRKYQFQLLAASVLTVIYIFLLYGLYVPTWQYQIP 264 Query: 732 -AGASSGGEVF----------RVKCGVRGDTGPACNAAGMIDRAVLGIQRLYRKPIYART 586 A +SS + F +VKCGVRGDTGPACN GMIDR +LGIQ LYRKP++ RT Sbjct: 265 DATSSSAPKTFSLACLCPFHSKVKCGVRGDTGPACNVVGMIDRKILGIQHLYRKPVFERT 324 Query: 585 EQCSINSPDYGPLPPDAPSWCRAPFDPEGILSTVMAIVTCLLGLQYGHVIVHYKDHKKRL 406 +QCSINSPDYGPLP DAP+WC+APF+PEG+LS+VMA+VTC +GL YGH+IVH+KDH R+ Sbjct: 325 KQCSINSPDYGPLPSDAPAWCQAPFEPEGLLSSVMAMVTCFVGLHYGHIIVHFKDHADRI 384 Query: 405 SLWSYPSTGFMLLGLLCHILGMHINKALYSVSYTCVTAGVAGILLATIYLVVDVHGWRRF 226 LW PS+ F++LGL I GMHINKALY+ SY CVTAG AG L A IYL+VD++G++R Sbjct: 385 RLWLIPSSAFLVLGLALDIFGMHINKALYTFSYMCVTAGAAGFLFAGIYLLVDIYGYQRM 444 Query: 225 TALLEWMGMHALLIYILVACNILPLILQGFYWKHPHNNILTLIGIGK 85 T +LEWMG HAL+IYIL ACN++P+++QGFYWK P NNIL+LIGIG+ Sbjct: 445 TLVLEWMGKHALVIYILAACNLIPVVIQGFYWKQPPNNILSLIGIGR 491 >gb|KOM28487.1| hypothetical protein LR48_Vigan549s004200 [Vigna angularis] Length = 460 Score = 563 bits (1451), Expect = e-157 Identities = 264/395 (66%), Positives = 313/395 (79%) Frame = -3 Query: 1272 PPSGRLVSLDVFRGLTVVLMIIVDDAGGFIPSINHSPWNGLTLADFVMPFFLFMVGVSLG 1093 P RLVSLDVFRGLTV LMI+VDDAGG IP++NHSPWNGLTLAD+VMPFFLF+VGVSL Sbjct: 65 PKPPRLVSLDVFRGLTVALMILVDDAGGLIPALNHSPWNGLTLADYVMPFFLFIVGVSLA 124 Query: 1092 LVYKNMVCRVAATKKAIFRAGKLLILGIFLQGGYFHSLNNLTYGVDLEQIRWMGILQRIA 913 L YK + CRV A++KA RA KLL+LG+FLQGGYFH +N+LTYGVD++QIRWMGILQRIA Sbjct: 125 LTYKKLSCRVDASRKAGLRALKLLVLGLFLQGGYFHRVNDLTYGVDIKQIRWMGILQRIA 184 Query: 912 IAYWFAAMCEIWLRNDEKVDSRLSLLKKYKWQWXXXXXXXXXXXXXXXXXXVPDWEYHIP 733 +AY AA+CEIWL++D+ V+S SLL+KY++QW VPDWEY I Sbjct: 185 LAYLVAALCEIWLKSDDTVNSGPSLLRKYRYQWVVALIISFVYLCLLYGLYVPDWEYQIQ 244 Query: 732 AGASSGGEVFRVKCGVRGDTGPACNAAGMIDRAVLGIQRLYRKPIYARTEQCSINSPDYG 553 SS + F VKCGVRGDTGPACNA GMIDR + GIQ LYR+PIYART +CSINSP+YG Sbjct: 245 TEPSSEPKTFSVKCGVRGDTGPACNAVGMIDRTLFGIQHLYRRPIYARTPECSINSPNYG 304 Query: 552 PLPPDAPSWCRAPFDPEGILSTVMAIVTCLLGLQYGHVIVHYKDHKKRLSLWSYPSTGFM 373 PLPP AP+WC+APFDPEG+LS+VMAIVTCL+GL YGH+IVH+KDH+ R W+ P++ + Sbjct: 305 PLPPGAPAWCQAPFDPEGLLSSVMAIVTCLVGLHYGHIIVHFKDHRVRTIYWTIPTSCLV 364 Query: 372 LLGLLCHILGMHINKALYSVSYTCVTAGVAGILLATIYLVVDVHGWRRFTALLEWMGMHA 193 + GL + GMHINK LYS SYTCVTAG AGIL IYL+VDV G+RR T LEWMGMHA Sbjct: 365 VFGLALDLFGMHINKVLYSFSYTCVTAGTAGILFVGIYLMVDVCGYRRLTIFLEWMGMHA 424 Query: 192 LLIYILVACNILPLILQGFYWKHPHNNILTLIGIG 88 L+IYIL ACN+ P+ LQGFYW P NNIL L+G+G Sbjct: 425 LMIYILAACNVFPIFLQGFYWGSPRNNILKLVGVG 459 >ref|XP_008222296.1| PREDICTED: heparan-alpha-glucosaminide N-acetyltransferase [Prunus mume] Length = 486 Score = 562 bits (1449), Expect = e-157 Identities = 261/392 (66%), Positives = 313/392 (79%) Frame = -3 Query: 1260 RLVSLDVFRGLTVVLMIIVDDAGGFIPSINHSPWNGLTLADFVMPFFLFMVGVSLGLVYK 1081 RLVSLDVFRG TV +MI+VDD GG +P+INHSPWNGLTLAD VMPFFLFMVGVSL L YK Sbjct: 95 RLVSLDVFRGFTVAIMILVDDVGGILPAINHSPWNGLTLADLVMPFFLFMVGVSLSLTYK 154 Query: 1080 NMVCRVAATKKAIFRAGKLLILGIFLQGGYFHSLNNLTYGVDLEQIRWMGILQRIAIAYW 901 M CR AT+K + R KLL LG+FLQGGYFH + +LT+GVD+EQ+RWMGILQRIAIAY+ Sbjct: 155 KMSCRTVATRKTVLRTLKLLALGLFLQGGYFHGIKDLTFGVDIEQMRWMGILQRIAIAYF 214 Query: 900 FAAMCEIWLRNDEKVDSRLSLLKKYKWQWXXXXXXXXXXXXXXXXXXVPDWEYHIPAGAS 721 AA+CEIWL+ D+ V+S SLL+KY++QW VPDWEY IP +S Sbjct: 215 VAALCEIWLKGDDNVNSGRSLLRKYRFQWSAALIITVLYLSLLYGLHVPDWEYQIPGVSS 274 Query: 720 SGGEVFRVKCGVRGDTGPACNAAGMIDRAVLGIQRLYRKPIYARTEQCSINSPDYGPLPP 541 S + F VKCGVRGDTGPACNA GMIDR +LG++ LYR+PIYARTEQCSINSPD GPLP Sbjct: 275 SAPKTFSVKCGVRGDTGPACNAVGMIDRKILGLRHLYRRPIYARTEQCSINSPDNGPLPA 334 Query: 540 DAPSWCRAPFDPEGILSTVMAIVTCLLGLQYGHVIVHYKDHKKRLSLWSYPSTGFMLLGL 361 DAPSWC+APFDPEG+LS++MAIVTCL+GL YGH+IVH+K H+ R+ WS S+ ++LGL Sbjct: 335 DAPSWCQAPFDPEGLLSSMMAIVTCLVGLHYGHIIVHFKSHRDRILRWSISSSSLIILGL 394 Query: 360 LCHILGMHINKALYSVSYTCVTAGVAGILLATIYLVVDVHGWRRFTALLEWMGMHALLIY 181 +LGMHINK LY+ SY C+TAG AGIL IYL+VDV G+RR T ++EWMGMHAL+I+ Sbjct: 395 ALDLLGMHINKPLYTFSYMCITAGSAGILFTAIYLMVDVCGYRRPTIVMEWMGMHALMIF 454 Query: 180 ILVACNILPLILQGFYWKHPHNNILTLIGIGK 85 +LVACN+LP+I+ GFYW P NNIL+LIGIGK Sbjct: 455 VLVACNLLPVIIHGFYWGKPQNNILSLIGIGK 486 >ref|XP_002303734.2| hypothetical protein POPTR_0003s15750g [Populus trichocarpa] gi|550343268|gb|EEE78713.2| hypothetical protein POPTR_0003s15750g [Populus trichocarpa] Length = 464 Score = 562 bits (1449), Expect = e-157 Identities = 268/393 (68%), Positives = 314/393 (79%), Gaps = 1/393 (0%) Frame = -3 Query: 1260 RLVSLDVFRGLTVVLMIIVDDAGGFIPSINHSPWNGLTLADFVMPFFLFMVGVSLGLVYK 1081 RLVSLDVFRGLTV LMI+VDDAGG +P+INHSPWNGLTLAD VMPFFLFMVGVSLGL YK Sbjct: 72 RLVSLDVFRGLTVALMILVDDAGGVLPAINHSPWNGLTLADVVMPFFLFMVGVSLGLTYK 131 Query: 1080 NMVCRVAATKKAIFRAGKLLILGIFLQGGYFHSLNNLTYGVDLEQIRWMGILQRIAIAYW 901 + + AT+KAI RA KLL++G+FLQGG+ H LN+LT+GVD+ QIRWMGILQRIAI Y Sbjct: 132 KLPSKAVATRKAILRALKLLVIGLFLQGGFLHGLNDLTFGVDMVQIRWMGILQRIAIGYL 191 Query: 900 FAAMCEIWLRNDEKVDSRLSLLKKYKWQWXXXXXXXXXXXXXXXXXXVPDWEYHIPAGAS 721 AMCEIWL+ D V S LS+L+KY+ QW VPDWEY IP AS Sbjct: 192 IGAMCEIWLKGDNHVASGLSMLRKYQLQWGAVVVLVSLYLSLLYGLYVPDWEYEIPVAAS 251 Query: 720 SGG-EVFRVKCGVRGDTGPACNAAGMIDRAVLGIQRLYRKPIYARTEQCSINSPDYGPLP 544 S ++FRVKCGVRG TG ACNA GMIDR VLGIQ LYRKPIYART+ CSINSPDYGPLP Sbjct: 252 SSSPKIFRVKCGVRGTTGSACNAVGMIDRTVLGIQHLYRKPIYARTKACSINSPDYGPLP 311 Query: 543 PDAPSWCRAPFDPEGILSTVMAIVTCLLGLQYGHVIVHYKDHKKRLSLWSYPSTGFMLLG 364 PDAPSWC+APFDPEG+LS+VMAIVTCL+GL YGH+IVH+K+HK R+ W PST F++LG Sbjct: 312 PDAPSWCQAPFDPEGLLSSVMAIVTCLVGLHYGHIIVHFKEHKDRILHWMVPSTCFVVLG 371 Query: 363 LLCHILGMHINKALYSVSYTCVTAGVAGILLATIYLVVDVHGWRRFTALLEWMGMHALLI 184 L+ + GMH+NKALY+ SY CVTAG AGI+ IY++VDV G+RR T +LEWMGMHAL+I Sbjct: 372 LVLDLSGMHVNKALYTFSYMCVTAGAAGIVFTGIYMLVDVCGFRRPTLVLEWMGMHALMI 431 Query: 183 YILVACNILPLILQGFYWKHPHNNILTLIGIGK 85 +IL N+LP+++QGFYWK P NNIL LIGIG+ Sbjct: 432 FILATSNVLPVVMQGFYWKQPGNNILRLIGIGR 464 >ref|XP_011020932.1| PREDICTED: heparan-alpha-glucosaminide N-acetyltransferase-like [Populus euphratica] Length = 463 Score = 561 bits (1446), Expect = e-157 Identities = 268/393 (68%), Positives = 314/393 (79%), Gaps = 1/393 (0%) Frame = -3 Query: 1260 RLVSLDVFRGLTVVLMIIVDDAGGFIPSINHSPWNGLTLADFVMPFFLFMVGVSLGLVYK 1081 RLVSLDVFRGLTV LMI+VDDAGG +P+INHSPWNGLTLAD VMPFFLFMVGVSLGL YK Sbjct: 71 RLVSLDVFRGLTVALMILVDDAGGVLPAINHSPWNGLTLADVVMPFFLFMVGVSLGLTYK 130 Query: 1080 NMVCRVAATKKAIFRAGKLLILGIFLQGGYFHSLNNLTYGVDLEQIRWMGILQRIAIAYW 901 + + AT+KAI RA KLL++G+FLQGG+ H LN+LT+GVD+ QIRWMGILQRIAI Y Sbjct: 131 KLPSKAVATRKAILRALKLLLIGLFLQGGFLHGLNDLTFGVDMVQIRWMGILQRIAIGYL 190 Query: 900 FAAMCEIWLRNDEKVDSRLSLLKKYKWQWXXXXXXXXXXXXXXXXXXVPDWEYHIPAGAS 721 AMCEIWL+ D V S LS+L+KY+ QW VPDWEY IP AS Sbjct: 191 IGAMCEIWLKGDNHVASGLSMLRKYQLQWGVVVVLVSLYLSLLYGLYVPDWEYQIPVAAS 250 Query: 720 SGG-EVFRVKCGVRGDTGPACNAAGMIDRAVLGIQRLYRKPIYARTEQCSINSPDYGPLP 544 S ++FRVKCGVRG TG ACNA GMIDR VLGIQ LYRKPIYART+ CSINSPDYGPLP Sbjct: 251 SSSPKIFRVKCGVRGTTGSACNAVGMIDRTVLGIQHLYRKPIYARTKACSINSPDYGPLP 310 Query: 543 PDAPSWCRAPFDPEGILSTVMAIVTCLLGLQYGHVIVHYKDHKKRLSLWSYPSTGFMLLG 364 P+APSWC+APFDPEG+LS+VMAIVTCL+GL YGH+IVH+K+HK R+ W PST F++LG Sbjct: 311 PEAPSWCQAPFDPEGLLSSVMAIVTCLVGLHYGHIIVHFKEHKDRILHWMVPSTCFVVLG 370 Query: 363 LLCHILGMHINKALYSVSYTCVTAGVAGILLATIYLVVDVHGWRRFTALLEWMGMHALLI 184 L+ + GMH+NKALY+ SY CVTAG AGI+ IY++VDV G+RR T +LEWMGMHAL+I Sbjct: 371 LVLDLSGMHVNKALYTFSYMCVTAGAAGIVFTGIYMLVDVCGFRRPTLVLEWMGMHALMI 430 Query: 183 YILVACNILPLILQGFYWKHPHNNILTLIGIGK 85 +IL N+LP++LQGFYWK P NNIL LIGIG+ Sbjct: 431 FILATSNVLPVVLQGFYWKQPGNNILRLIGIGR 463 >gb|KHN24949.1| Heparan-alpha-glucosaminide N-acetyltransferase [Glycine soja] Length = 463 Score = 561 bits (1446), Expect = e-157 Identities = 263/395 (66%), Positives = 313/395 (79%) Frame = -3 Query: 1272 PPSGRLVSLDVFRGLTVVLMIIVDDAGGFIPSINHSPWNGLTLADFVMPFFLFMVGVSLG 1093 P S RLVSLDVFRGLTV LMI+VDDAGG IP++NHSPWNGLTLAD+VMPFFLF+VGVSL Sbjct: 68 PKSPRLVSLDVFRGLTVALMILVDDAGGLIPALNHSPWNGLTLADYVMPFFLFIVGVSLA 127 Query: 1092 LVYKNMVCRVAATKKAIFRAGKLLILGIFLQGGYFHSLNNLTYGVDLEQIRWMGILQRIA 913 L YK + C V A++KA RA KLL+LG+FLQGGYFH +N+LTYGVDL+QIRWMGILQRI Sbjct: 128 LTYKKLSCGVDASRKASLRALKLLVLGLFLQGGYFHRVNDLTYGVDLKQIRWMGILQRIG 187 Query: 912 IAYWFAAMCEIWLRNDEKVDSRLSLLKKYKWQWXXXXXXXXXXXXXXXXXXVPDWEYHIP 733 +AY AA+CEIWL++D+ V+S SLL+KY++QW VPDW Y I Sbjct: 188 VAYLVAALCEIWLKSDDTVNSGPSLLRKYRYQWAVALILSFLYLCLLYGLYVPDWVYQIQ 247 Query: 732 AGASSGGEVFRVKCGVRGDTGPACNAAGMIDRAVLGIQRLYRKPIYARTEQCSINSPDYG 553 SS + F VKCGVRG+TGPACNA GMIDR +LGI LY++PIYAR +CSINSP+YG Sbjct: 248 TEPSSEPKTFSVKCGVRGNTGPACNAVGMIDRTILGIHHLYQRPIYARMPECSINSPNYG 307 Query: 552 PLPPDAPSWCRAPFDPEGILSTVMAIVTCLLGLQYGHVIVHYKDHKKRLSLWSYPSTGFM 373 PLPPDAP+WC+APFDPEG+LS+VMAIVTCL+GL YGH+IVH+KDH+ R+ W P++ + Sbjct: 308 PLPPDAPAWCQAPFDPEGLLSSVMAIVTCLIGLHYGHIIVHFKDHRVRIIYWMIPTSCLV 367 Query: 372 LLGLLCHILGMHINKALYSVSYTCVTAGVAGILLATIYLVVDVHGWRRFTALLEWMGMHA 193 + GL + GMHINK LYS+SYTCVTAG AGIL IYL+VDV G RR T +LEWMGMHA Sbjct: 368 VFGLALDLFGMHINKVLYSLSYTCVTAGAAGILFVGIYLMVDVCGCRRMTLVLEWMGMHA 427 Query: 192 LLIYILVACNILPLILQGFYWKHPHNNILTLIGIG 88 L+IYIL ACN+ P+ LQGFYW PHNNIL LIG+G Sbjct: 428 LMIYILAACNVFPIFLQGFYWGSPHNNILKLIGVG 462 >ref|XP_003532336.1| PREDICTED: heparan-alpha-glucosaminide N-acetyltransferase-like [Glycine max] gi|947098319|gb|KRH46904.1| hypothetical protein GLYMA_08G363600 [Glycine max] Length = 463 Score = 561 bits (1446), Expect = e-157 Identities = 263/395 (66%), Positives = 313/395 (79%) Frame = -3 Query: 1272 PPSGRLVSLDVFRGLTVVLMIIVDDAGGFIPSINHSPWNGLTLADFVMPFFLFMVGVSLG 1093 P S RLVSLDVFRGLTV LMI+VDDAGG IP++NHSPWNGLTLAD+VMPFFLF+VGVSL Sbjct: 68 PKSPRLVSLDVFRGLTVALMILVDDAGGLIPALNHSPWNGLTLADYVMPFFLFIVGVSLA 127 Query: 1092 LVYKNMVCRVAATKKAIFRAGKLLILGIFLQGGYFHSLNNLTYGVDLEQIRWMGILQRIA 913 L YK + C V A++KA RA KLL+LG+FLQGGYFH +N+LTYGVDL+QIRWMGILQRI Sbjct: 128 LTYKKLSCGVDASRKASLRALKLLVLGLFLQGGYFHRVNDLTYGVDLKQIRWMGILQRIG 187 Query: 912 IAYWFAAMCEIWLRNDEKVDSRLSLLKKYKWQWXXXXXXXXXXXXXXXXXXVPDWEYHIP 733 +AY AA+CEIWL++D+ V+S SLL+KY++QW VPDW Y I Sbjct: 188 VAYLVAALCEIWLKSDDTVNSGPSLLRKYRYQWAVALILSFLYLCLLYGLYVPDWVYQIQ 247 Query: 732 AGASSGGEVFRVKCGVRGDTGPACNAAGMIDRAVLGIQRLYRKPIYARTEQCSINSPDYG 553 SS + F VKCGVRG+TGPACNA GMIDR +LGI LY++PIYAR +CSINSP+YG Sbjct: 248 TEPSSEPKTFSVKCGVRGNTGPACNAVGMIDRTILGIHHLYQRPIYARMPECSINSPNYG 307 Query: 552 PLPPDAPSWCRAPFDPEGILSTVMAIVTCLLGLQYGHVIVHYKDHKKRLSLWSYPSTGFM 373 PLPPDAP+WC+APFDPEG+LS+VMAIVTCL+GL YGH+IVH+KDH+ R+ W P++ + Sbjct: 308 PLPPDAPAWCQAPFDPEGLLSSVMAIVTCLIGLHYGHIIVHFKDHRVRIIYWMIPTSCLV 367 Query: 372 LLGLLCHILGMHINKALYSVSYTCVTAGVAGILLATIYLVVDVHGWRRFTALLEWMGMHA 193 + GL + GMHINK LYS+SYTCVTAG AGIL IYL+VDV G RR T +LEWMGMHA Sbjct: 368 VFGLALDLFGMHINKVLYSLSYTCVTAGAAGILFVGIYLMVDVCGCRRMTLVLEWMGMHA 427 Query: 192 LLIYILVACNILPLILQGFYWKHPHNNILTLIGIG 88 L+IYIL ACN+ P+ LQGFYW PHNNIL LIG+G Sbjct: 428 LMIYILAACNVFPIFLQGFYWGSPHNNILKLIGVG 462