BLASTX nr result
ID: Mentha25_contig00026938
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Mentha25_contig00026938 (1751 letters) Database: ./nr 38,876,450 sequences; 13,856,398,315 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value gb|EYU29888.1| hypothetical protein MIMGU_mgv1a005303mg [Mimulus... 808 0.0 ref|XP_006354406.1| PREDICTED: heparan-alpha-glucosaminide N-ace... 769 0.0 ref|XP_007203948.1| hypothetical protein PRUPE_ppa018154mg [Prun... 765 0.0 ref|XP_002268831.2| PREDICTED: heparan-alpha-glucosaminide N-ace... 760 0.0 ref|XP_002519467.1| conserved hypothetical protein [Ricinus comm... 749 0.0 ref|XP_007144374.1| hypothetical protein PHAVU_007G150900g [Phas... 748 0.0 ref|XP_006381387.1| hypothetical protein POPTR_0006s12440g [Popu... 748 0.0 ref|XP_004246913.1| PREDICTED: heparan-alpha-glucosaminide N-ace... 747 0.0 emb|CAN64496.1| hypothetical protein VITISV_004036 [Vitis vinifera] 747 0.0 ref|XP_006428456.1| hypothetical protein CICLE_v10011557mg [Citr... 746 0.0 ref|XP_004306260.1| PREDICTED: heparan-alpha-glucosaminide N-ace... 743 0.0 ref|XP_004515935.1| PREDICTED: heparan-alpha-glucosaminide N-ace... 740 0.0 ref|XP_004494901.1| PREDICTED: heparan-alpha-glucosaminide N-ace... 739 0.0 ref|XP_007027449.1| Uncharacterized protein TCM_022283 [Theobrom... 737 0.0 ref|XP_006577710.1| PREDICTED: heparan-alpha-glucosaminide N-ace... 735 0.0 ref|XP_004494902.1| PREDICTED: heparan-alpha-glucosaminide N-ace... 718 0.0 ref|XP_006595362.1| PREDICTED: heparan-alpha-glucosaminide N-ace... 717 0.0 ref|XP_006836541.1| hypothetical protein AMTR_s00131p00031270 [A... 716 0.0 ref|XP_006428455.1| hypothetical protein CICLE_v10011557mg [Citr... 712 0.0 ref|XP_003576108.1| PREDICTED: heparan-alpha-glucosaminide N-ace... 708 0.0 >gb|EYU29888.1| hypothetical protein MIMGU_mgv1a005303mg [Mimulus guttatus] Length = 490 Score = 808 bits (2088), Expect = 0.0 Identities = 400/505 (79%), Positives = 425/505 (84%), Gaps = 13/505 (2%) Frame = +1 Query: 139 MEDPKKLEEGFGASKNXXXXXXXXXXXXXXXGDNVISTETER--KSMIASELQAEETEQ- 309 MEDP+KLEEG A+K D+ IS E KS+ SE AEE + Sbjct: 1 MEDPRKLEEGLHAAKK---------------NDDDISNENNEHNKSVTKSEPVAEEKREE 45 Query: 310 -----KTKTKRVATLDAFRGLTIVLMILVDDAGGAYARIDHSPWNGCTLADFVMPFFLFI 474 K K KRVATLDAFRGLTIVLM+LVDDAGGAY+RIDHSPWNGCTLADFVMPFFLFI Sbjct: 46 EPPLVKQKAKRVATLDAFRGLTIVLMVLVDDAGGAYSRIDHSPWNGCTLADFVMPFFLFI 105 Query: 475 VGVAIALALKRIPKVPFAIRKVILRTLKLLFWGILLQGGYSHAPDDLSYGIDMKLIRWCG 654 VGVAIALALKRIPKV +A+RKVILRTLKLLFWGILLQGGYSHAP DL+YG+DMKLIRWCG Sbjct: 106 VGVAIALALKRIPKVSYAVRKVILRTLKLLFWGILLQGGYSHAPYDLAYGVDMKLIRWCG 165 Query: 655 ILQRIALVYFVVALIEILTTKIRPTTLEPSHFSIFSAYKWQWLGGFAAFVIYMVTTFALY 834 ILQRIALVYF+VALIEI TTK+RPT+L+P HFSIF+AYKWQW+GGF AFVIYMVTTF+LY Sbjct: 166 ILQRIALVYFIVALIEIATTKLRPTSLDPGHFSIFTAYKWQWVGGFVAFVIYMVTTFSLY 225 Query: 835 VPDWSF-----EHKSEKFIVKCGMRGHLGPACNAVGYVDRQAWGINHLYSDPVWKRLRAC 999 VPDWSF K EKF V CGMRGHLGPACNAVGYVDRQAWGINHLY+ PVW RL+AC Sbjct: 226 VPDWSFVAKDDSDKLEKFTVICGMRGHLGPACNAVGYVDRQAWGINHLYNQPVWSRLKAC 285 Query: 1000 TFSSPSSGKLRKDAPAWCRAPFEPEGLLSSISAIVSGTIGIHYGHVLIHFKGHAERLKHW 1179 TFSSP SG RKDAP WCRAPFEPEGLLSSISAI+SGTIGIHYGHVLIHFKGHAERLK W Sbjct: 286 TFSSPDSGPFRKDAPTWCRAPFEPEGLLSSISAIISGTIGIHYGHVLIHFKGHAERLKQW 345 Query: 1180 VSMXXXXXXXXXXXHFTDAIPINKQLYSFSYVCFTAGAAGIVFSAFYLLIDVWGKRMPFM 1359 VSM HFTDAIPINKQLYSFSYVCFTAGAAGIVFS FYLLIDVWG R PF+ Sbjct: 346 VSMALVLLIVAVILHFTDAIPINKQLYSFSYVCFTAGAAGIVFSLFYLLIDVWGMRKPFL 405 Query: 1360 FLEWIGMNAMLVFVMAAQGIFAAFINGWYYKNSDNNLVNWIQEHVFNDVWKSERVGTLLY 1539 FLEWIGMNAMLVFVMAAQGIFA FINGWY+KN DNNLVNWIQ+HVF DVWKSE+VGTLLY Sbjct: 406 FLEWIGMNAMLVFVMAAQGIFAGFINGWYFKNPDNNLVNWIQQHVFFDVWKSEKVGTLLY 465 Query: 1540 VIFAEITFWAVVSGILHKLGIYWKL 1614 VIFAEITFWAVVSGILHKL IYWKL Sbjct: 466 VIFAEITFWAVVSGILHKLRIYWKL 490 >ref|XP_006354406.1| PREDICTED: heparan-alpha-glucosaminide N-acetyltransferase-like [Solanum tuberosum] Length = 494 Score = 770 bits (1987), Expect = 0.0 Identities = 379/498 (76%), Positives = 415/498 (83%), Gaps = 6/498 (1%) Frame = +1 Query: 139 MEDPKKLEEGFGASKNXXXXXXXXXXXXXXXGDNVISTETERKSMIASELQAEETEQKT- 315 MEDPKKLEEGFG KN D + S E E K++ + LQ EE EQ T Sbjct: 1 MEDPKKLEEGFGNQKNNISEENIDTNRIDNNQD-LASHENELKTL-SQPLQKEEEEQPTI 58 Query: 316 -KTKRVATLDAFRGLTIVLMILVDDAGGAYARIDHSPWNGCTLADFVMPFFLFIVGVAIA 492 K KRVATLDAFRGLTIVLMILVDDAGGAYARIDHSPWNGCTLADFVMPFFLFIVGVAIA Sbjct: 59 KKGKRVATLDAFRGLTIVLMILVDDAGGAYARIDHSPWNGCTLADFVMPFFLFIVGVAIA 118 Query: 493 LALKRIPKVPFAIRKVILRTLKLLFWGILLQGGYSHAPDDLSYGIDMKLIRWCGILQRIA 672 LALKR+PKV AIRKV LRTLKLLFWGI+LQGGYSHAP DL+YG+DMK+IRWCGILQRIA Sbjct: 119 LALKRVPKVSAAIRKVTLRTLKLLFWGIILQGGYSHAPYDLAYGVDMKVIRWCGILQRIA 178 Query: 673 LVYFVVALIEILTTKIRPTTLEPSHFSIFSAYKWQWLGGFAAFVIYMVTTFALYVPDWSF 852 LVY +VALIEILTTK+RPTTL P HFSIF+AYKW LGGF AFV+YM T + LYVPDW+F Sbjct: 179 LVYLIVALIEILTTKLRPTTLTPGHFSIFTAYKW--LGGFVAFVVYMTTLYGLYVPDWNF 236 Query: 853 -EHK---SEKFIVKCGMRGHLGPACNAVGYVDRQAWGINHLYSDPVWKRLRACTFSSPSS 1020 EH S+++ VKCGMRGHLGPACNAVGYVDRQ WGINHLY+ PVW R +ACT S P + Sbjct: 237 LEHDGDTSQRYTVKCGMRGHLGPACNAVGYVDRQVWGINHLYNQPVWARSKACTLSYPET 296 Query: 1021 GKLRKDAPAWCRAPFEPEGLLSSISAIVSGTIGIHYGHVLIHFKGHAERLKHWVSMXXXX 1200 G R DAP WCRAPFEPEGLLSSISAI+SGTIGIHYGHVLIHFKGH ERLK W+SM Sbjct: 297 GPFRDDAPTWCRAPFEPEGLLSSISAIMSGTIGIHYGHVLIHFKGHGERLKQWISMGFGL 356 Query: 1201 XXXXXXXHFTDAIPINKQLYSFSYVCFTAGAAGIVFSAFYLLIDVWGKRMPFMFLEWIGM 1380 HF+DAIP+NKQLYSFSYVCFTAGAAGIVFS FY+LIDV G R+PF++LEWIGM Sbjct: 357 LIIAFILHFSDAIPLNKQLYSFSYVCFTAGAAGIVFSGFYILIDVLGMRIPFLWLEWIGM 416 Query: 1381 NAMLVFVMAAQGIFAAFINGWYYKNSDNNLVNWIQEHVFNDVWKSERVGTLLYVIFAEIT 1560 NAML+FVM AQGIFA FINGWY+KN DNNLVNWIQ HVF DVWKS+R+GTLLYVIFAEIT Sbjct: 417 NAMLIFVMGAQGIFAGFINGWYFKNEDNNLVNWIQHHVFFDVWKSQRLGTLLYVIFAEIT 476 Query: 1561 FWAVVSGILHKLGIYWKL 1614 FWAV++GILH+LGIYWKL Sbjct: 477 FWAVLAGILHRLGIYWKL 494 >ref|XP_007203948.1| hypothetical protein PRUPE_ppa018154mg [Prunus persica] gi|462399479|gb|EMJ05147.1| hypothetical protein PRUPE_ppa018154mg [Prunus persica] Length = 508 Score = 765 bits (1975), Expect = 0.0 Identities = 372/506 (73%), Positives = 414/506 (81%), Gaps = 15/506 (2%) Frame = +1 Query: 142 EDPKKLEEGFGASKNXXXXXXXXXXXXXXXGDNVISTETERKSMIASELQAEETEQ---- 309 +D KKLEEG +K+ + I + E K+ A E +Q Sbjct: 3 DDAKKLEEGRLHNKDDLISERVNTTSDHNGRGDAIDHDHEEKNKAAVAAAPLEADQIKGE 62 Query: 310 -------KTKTKRVATLDAFRGLTIVLMILVDDAGGAYARIDHSPWNGCTLADFVMPFFL 468 K K+KRVATLDAFRGLTIV+MILVDDAGGAYARIDHSPWNGCTLADFVMPFFL Sbjct: 63 EQPVLVVKQKSKRVATLDAFRGLTIVVMILVDDAGGAYARIDHSPWNGCTLADFVMPFFL 122 Query: 469 FIVGVAIALALKRIPKVPFAIRKVILRTLKLLFWGILLQGGYSHAPDDLSYGIDMKLIRW 648 FIVGVAIALALK+IPK+ AI+K+ILRTLKL+FWGI+LQGGYSHAP DLSYG+DMK IRW Sbjct: 123 FIVGVAIALALKKIPKINDAIKKIILRTLKLMFWGIILQGGYSHAPADLSYGVDMKQIRW 182 Query: 649 CGILQRIALVYFVVALIEILTTKIRPTTLEPSHFSIFSAYKWQWLGGFAAFVIYMVTTFA 828 GILQRIALVYFVVALIE LTTK RPT LEP H SIF+AYKWQW+GGF AF+IYM+TTF+ Sbjct: 183 FGILQRIALVYFVVALIETLTTKFRPTVLEPGHLSIFTAYKWQWIGGFLAFLIYMITTFS 242 Query: 829 LYVPDWSF----EHKSEKFIVKCGMRGHLGPACNAVGYVDRQAWGINHLYSDPVWKRLRA 996 LYVPDWSF +H+S+K++VKCGMRGHLGPACNAVGYVDRQ WGINHLY+ PVW+RL+A Sbjct: 243 LYVPDWSFVVDNDHRSKKYLVKCGMRGHLGPACNAVGYVDRQVWGINHLYTQPVWRRLKA 302 Query: 997 CTFSSPSSGKLRKDAPAWCRAPFEPEGLLSSISAIVSGTIGIHYGHVLIHFKGHAERLKH 1176 CT SSPS G LR+ AP+WCR PFEPEGLLSSISAI+SGTIGIHYGHVLIHFKGH+ERLK Sbjct: 303 CTLSSPSDGPLREGAPSWCRGPFEPEGLLSSISAILSGTIGIHYGHVLIHFKGHSERLKQ 362 Query: 1177 WVSMXXXXXXXXXXXHFTDAIPINKQLYSFSYVCFTAGAAGIVFSAFYLLIDVWGKRMPF 1356 WVSM HFTDAIPINKQLYSFSYVCFTAGAAG+VFS FYLLIDVWG R PF Sbjct: 363 WVSMGFILIVIAIILHFTDAIPINKQLYSFSYVCFTAGAAGLVFSGFYLLIDVWGYRTPF 422 Query: 1357 MFLEWIGMNAMLVFVMAAQGIFAAFINGWYYKNSDNNLVNWIQEHVFNDVWKSERVGTLL 1536 +FLEWIGMNAMLVFVMAAQGIFAAF+NGWYYK+ DN+LV+WIQEHVF +VW SER+GTLL Sbjct: 423 LFLEWIGMNAMLVFVMAAQGIFAAFVNGWYYKSPDNSLVHWIQEHVFINVWHSERLGTLL 482 Query: 1537 YVIFAEITFWAVVSGILHKLGIYWKL 1614 YVIF EI FW VV+GILHK IYWKL Sbjct: 483 YVIFGEILFWGVVAGILHKFRIYWKL 508 >ref|XP_002268831.2| PREDICTED: heparan-alpha-glucosaminide N-acetyltransferase-like [Vitis vinifera] gi|297739972|emb|CBI30154.3| unnamed protein product [Vitis vinifera] Length = 489 Score = 760 bits (1962), Expect = 0.0 Identities = 369/495 (74%), Positives = 412/495 (83%), Gaps = 5/495 (1%) Frame = +1 Query: 145 DPKKLEEGFGASKNXXXXXXXXXXXXXXXGDNVISTETERKSMIASELQAEETEQ-KTKT 321 D K++EEG G D + E+ + E + EE K K+ Sbjct: 4 DAKRVEEGLG---------HVHKEDISEKADKIEKDESSATPAQSVEQKGEEQPLIKQKS 54 Query: 322 KRVATLDAFRGLTIVLMILVDDAGGAYARIDHSPWNGCTLADFVMPFFLFIVGVAIALAL 501 KRVATLDAFRGLTIVLMILVDDAGG+YARIDHSPWNGCTLADFVMPFFLFIVGVA+ALAL Sbjct: 55 KRVATLDAFRGLTIVLMILVDDAGGSYARIDHSPWNGCTLADFVMPFFLFIVGVAVALAL 114 Query: 502 KRIPKVPFAIRKVILRTLKLLFWGILLQGGYSHAPDDLSYGIDMKLIRWCGILQRIALVY 681 K+IP++ A++K+ LRTLKLLFWGILLQGGYSHAPDDLSYG+DMK IRW GILQRIA+VY Sbjct: 115 KKIPRISLAVKKISLRTLKLLFWGILLQGGYSHAPDDLSYGVDMKHIRWFGILQRIAVVY 174 Query: 682 FVVALIEILTTKIRPTTLEPSHFSIFSAYKWQWLGGFAAFVIYMVTTFALYVPDWSF--- 852 FVVALIE LTTK RPT ++ HFSI SAYKWQW+GGF AF+IYM+TT+ALYVPDWSF Sbjct: 175 FVVALIETLTTKRRPTVIDSGHFSILSAYKWQWIGGFVAFLIYMITTYALYVPDWSFVID 234 Query: 853 -EHKSEKFIVKCGMRGHLGPACNAVGYVDRQAWGINHLYSDPVWKRLRACTFSSPSSGKL 1029 +H+++++ VKCGMRGHLGPACNAVGYVDRQ WGINHLYS PVW RL+ACT SSP+SG Sbjct: 235 QDHEAKRYTVKCGMRGHLGPACNAVGYVDRQVWGINHLYSQPVWTRLKACTLSSPNSGPF 294 Query: 1030 RKDAPAWCRAPFEPEGLLSSISAIVSGTIGIHYGHVLIHFKGHAERLKHWVSMXXXXXXX 1209 R+DAP+WC APFEPEGLLS+ISAI+SGTIGIHYGHVLIHFKGHAERLK WVSM Sbjct: 295 REDAPSWCYAPFEPEGLLSTISAILSGTIGIHYGHVLIHFKGHAERLKQWVSMGIVLLIV 354 Query: 1210 XXXXHFTDAIPINKQLYSFSYVCFTAGAAGIVFSAFYLLIDVWGKRMPFMFLEWIGMNAM 1389 HFTDAIPINKQLYSFSYVCFTAGAAGIV SAFYL+IDVWG R PF+FLEWIGMNAM Sbjct: 355 AIILHFTDAIPINKQLYSFSYVCFTAGAAGIVLSAFYLVIDVWGFRTPFLFLEWIGMNAM 414 Query: 1390 LVFVMAAQGIFAAFINGWYYKNSDNNLVNWIQEHVFNDVWKSERVGTLLYVIFAEITFWA 1569 LVFVMAAQGIFAAFINGWY+++SDN+LV+WIQ HVF DVW SER+GTLLYVIFAEITFWA Sbjct: 415 LVFVMAAQGIFAAFINGWYFESSDNSLVHWIQRHVFIDVWHSERLGTLLYVIFAEITFWA 474 Query: 1570 VVSGILHKLGIYWKL 1614 VVSGILHKL IYWKL Sbjct: 475 VVSGILHKLHIYWKL 489 >ref|XP_002519467.1| conserved hypothetical protein [Ricinus communis] gi|223541330|gb|EEF42881.1| conserved hypothetical protein [Ricinus communis] Length = 519 Score = 749 bits (1934), Expect = 0.0 Identities = 364/506 (71%), Positives = 414/506 (81%), Gaps = 14/506 (2%) Frame = +1 Query: 139 MEDPKKLEEGFG----ASKNXXXXXXXXXXXXXXXGDNVISTETERKSMIASELQAEETE 306 MEDP+KLEEG A++N G VI + S + E + E+ + Sbjct: 14 MEDPRKLEEGLAHAKVANENQQEQHLSEKLDKTHDGGGVIPEKELTSSTVLVEQEGEQLQ 73 Query: 307 Q------KTKTKRVATLDAFRGLTIVLMILVDDAGGAYARIDHSPWNGCTLADFVMPFFL 468 Q K KTKRVATLDAFRGLT+VLMILVD+AG +YARIDHSPWNGCTLADFVMPFFL Sbjct: 74 QPEQLPVKQKTKRVATLDAFRGLTVVLMILVDNAGESYARIDHSPWNGCTLADFVMPFFL 133 Query: 469 FIVGVAIALALKRIPKVPFAIRKVILRTLKLLFWGILLQGGYSHAPDDLSYGIDMKLIRW 648 FIVGVAIALALKRIP+ A++K+ LRTLKLLFWGILLQGGYSHAP DLSYG+DMKLIRW Sbjct: 134 FIVGVAIALALKRIPRKRDAVKKISLRTLKLLFWGILLQGGYSHAPVDLSYGVDMKLIRW 193 Query: 649 CGILQRIALVYFVVALIEILTTKIRPTTLEPSHFSIFSAYKWQWLGGFAAFVIYMVTTFA 828 CGILQRIALVY VALIE LT K R T L+P+HFSIF+AY+WQW+GGF AF+IYM+TT+A Sbjct: 194 CGILQRIALVYMFVALIETLTIKERQTVLQPNHFSIFTAYRWQWIGGFIAFLIYMITTYA 253 Query: 829 LYVPDWSF----EHKSEKFIVKCGMRGHLGPACNAVGYVDRQAWGINHLYSDPVWKRLRA 996 LYVPDWSF +++ ++ VKCGMRGHLGPACNAVGYVDR+ WGINHLY PVW RL+A Sbjct: 254 LYVPDWSFTAYDDNRPTRYTVKCGMRGHLGPACNAVGYVDREVWGINHLYQYPVWSRLKA 313 Query: 997 CTFSSPSSGKLRKDAPAWCRAPFEPEGLLSSISAIVSGTIGIHYGHVLIHFKGHAERLKH 1176 CTFSSP++G LR DAP+WC APFEPEGLLS+ISAI+SGTIGIHYGHVLIHFKGH+ERLK Sbjct: 314 CTFSSPATGPLRADAPSWCLAPFEPEGLLSTISAILSGTIGIHYGHVLIHFKGHSERLKQ 373 Query: 1177 WVSMXXXXXXXXXXXHFTDAIPINKQLYSFSYVCFTAGAAGIVFSAFYLLIDVWGKRMPF 1356 WVSM HFTDAIPINKQLYSFSYVCFTAGAAGIVFS FY+LIDV G R+PF Sbjct: 374 WVSMGLGLFLIAIILHFTDAIPINKQLYSFSYVCFTAGAAGIVFSGFYILIDVLGLRIPF 433 Query: 1357 MFLEWIGMNAMLVFVMAAQGIFAAFINGWYYKNSDNNLVNWIQEHVFNDVWKSERVGTLL 1536 +FLEWIGMNAMLV+VMAAQGIF FINGW+YK+++N LV WIQEHVF+ VW SE++G LL Sbjct: 434 LFLEWIGMNAMLVYVMAAQGIFEGFINGWFYKSNNNTLVYWIQEHVFDKVWNSEKLGNLL 493 Query: 1537 YVIFAEITFWAVVSGILHKLGIYWKL 1614 YVIFA+ITFWAVVSGILH+LGIYWKL Sbjct: 494 YVIFAQITFWAVVSGILHRLGIYWKL 519 >ref|XP_007144374.1| hypothetical protein PHAVU_007G150900g [Phaseolus vulgaris] gi|561017564|gb|ESW16368.1| hypothetical protein PHAVU_007G150900g [Phaseolus vulgaris] Length = 500 Score = 748 bits (1932), Expect = 0.0 Identities = 360/501 (71%), Positives = 406/501 (81%), Gaps = 9/501 (1%) Frame = +1 Query: 139 MEDPKKLEEGFGASKNXXXXXXXXXXXXXXXGDNVISTETERKSMIASELQAEETEQ--- 309 M++ K++EEG ++ GD++ R + E + EQ Sbjct: 1 MDEAKRMEEGLSSTPQNGELKQEIEKTNGD-GDSIEHDRDTRSTTQEGESTRQIVEQEQP 59 Query: 310 --KTKTKRVATLDAFRGLTIVLMILVDDAGGAYARIDHSPWNGCTLADFVMPFFLFIVGV 483 K KTKRVATLDAFRGLTIVLM+LVDDAGGAY +IDHSPWNGCTLADFVMPFFLFIVGV Sbjct: 60 LVKQKTKRVATLDAFRGLTIVLMVLVDDAGGAYPQIDHSPWNGCTLADFVMPFFLFIVGV 119 Query: 484 AIALALKRIPKVPFAIRKVILRTLKLLFWGILLQGGYSHAPDDLSYGIDMKLIRWCGILQ 663 AIALALKRIP+V A++K+ILRTLKLLFWG+LLQGGYSHAPDDLSYG+DM+ IRWCGILQ Sbjct: 120 AIALALKRIPRVKDAVKKIILRTLKLLFWGVLLQGGYSHAPDDLSYGVDMRFIRWCGILQ 179 Query: 664 RIALVYFVVALIEILTTKIRPTTLEPSHFSIFSAYKWQWLGGFAAFVIYMVTTFALYVPD 843 RIALVY VVALIE T K+RP TL+P H SIFSAY+WQW GGF AFVIYMVTTF+LYVPD Sbjct: 180 RIALVYCVVALIETYTNKLRPYTLKPGHLSIFSAYRWQWFGGFVAFVIYMVTTFSLYVPD 239 Query: 844 WSF----EHKSEKFIVKCGMRGHLGPACNAVGYVDRQAWGINHLYSDPVWKRLRACTFSS 1011 WSF ++ +++ V+CG+RGHLGPACNAVGYVDRQ WG+NHLYS PVW R ACT SS Sbjct: 240 WSFVDYNSYEPKRYTVQCGIRGHLGPACNAVGYVDRQVWGVNHLYSQPVWTRSSACTLSS 299 Query: 1012 PSSGKLRKDAPAWCRAPFEPEGLLSSISAIVSGTIGIHYGHVLIHFKGHAERLKHWVSMX 1191 P+ G RK+AP+WCRAPFEPEGLLSSISAI+SG IGIHYGHVLIHFKGH+ERLK W+S+ Sbjct: 300 PAEGHFRKNAPSWCRAPFEPEGLLSSISAILSGIIGIHYGHVLIHFKGHSERLKQWLSLG 359 Query: 1192 XXXXXXXXXXHFTDAIPINKQLYSFSYVCFTAGAAGIVFSAFYLLIDVWGKRMPFMFLEW 1371 HFTDAIPINKQLYSFSYVCFTAGAAGIVFS FYLLID+W R PF+ LEW Sbjct: 360 FFLLIIGIILHFTDAIPINKQLYSFSYVCFTAGAAGIVFSVFYLLIDIWDLRTPFLLLEW 419 Query: 1372 IGMNAMLVFVMAAQGIFAAFINGWYYKNSDNNLVNWIQEHVFNDVWKSERVGTLLYVIFA 1551 IGMNAMLVFVMAAQGIFAAF+NGWYYK+ DN+LVNWIQ HVF +VW SER+GTLLYVIFA Sbjct: 420 IGMNAMLVFVMAAQGIFAAFVNGWYYKDPDNSLVNWIQNHVFINVWHSERLGTLLYVIFA 479 Query: 1552 EITFWAVVSGILHKLGIYWKL 1614 EITFW VV+GI HKLGIYWKL Sbjct: 480 EITFWGVVAGIFHKLGIYWKL 500 >ref|XP_006381387.1| hypothetical protein POPTR_0006s12440g [Populus trichocarpa] gi|550336090|gb|ERP59184.1| hypothetical protein POPTR_0006s12440g [Populus trichocarpa] Length = 502 Score = 748 bits (1931), Expect = 0.0 Identities = 368/503 (73%), Positives = 406/503 (80%), Gaps = 11/503 (2%) Frame = +1 Query: 139 MEDPKKLEEGFGASKNXXXXXXXXXXXXXXXG--DNVISTETERKSMIASELQAEETEQ- 309 MEDPK++EEG G + G D E E + ++ E ++ Sbjct: 1 MEDPKRMEEGLGHTALVANIDDENIHLSEKEGKTDGGDDNEKEERRVVHDHQAEREGDRQ 60 Query: 310 ---KTKTKRVATLDAFRGLTIVLMILVDDAGGAYARIDHSPWNGCTLADFVMPFFLFIVG 480 K K+KRVATLDAFRGLTIVLMILVDDAGG Y RIDHSPWNGCTLADFVMPFFLFIVG Sbjct: 61 PVVKQKSKRVATLDAFRGLTIVLMILVDDAGGVYPRIDHSPWNGCTLADFVMPFFLFIVG 120 Query: 481 VAIALALKRIPKVPFAIRKVILRTLKLLFWGILLQGGYSHAPDDLSYGIDMKLIRWCGIL 660 VAIALA KRIPK A++K+ILRTLKLLFWG+LLQGGYSHAP DL+YG+DMKLIRW GIL Sbjct: 121 VAIALAFKRIPKRRDAVKKIILRTLKLLFWGVLLQGGYSHAPSDLAYGVDMKLIRWFGIL 180 Query: 661 Q-RIALVYFVVALIEILTTKIRPTTLEPSHFSIFSAYKWQWLGGFAAFVIYMVTTFALYV 837 Q RIALVY VVALIE L K R T +EP HF+IF+AY+WQW+ GF +FVIYMVTTFALYV Sbjct: 181 QQRIALVYMVVALIEALIPKNRQT-IEPDHFTIFTAYRWQWIAGFISFVIYMVTTFALYV 239 Query: 838 PDWSF----EHKSEKFIVKCGMRGHLGPACNAVGYVDRQAWGINHLYSDPVWKRLRACTF 1005 PDWSF +H+ ++ V+CGMRGHLGPACNAVGYVDR+ WGINHLY PVW RL+ACT Sbjct: 240 PDWSFTVDEDHERRRYTVECGMRGHLGPACNAVGYVDREVWGINHLYQYPVWSRLKACTL 299 Query: 1006 SSPSSGKLRKDAPAWCRAPFEPEGLLSSISAIVSGTIGIHYGHVLIHFKGHAERLKHWVS 1185 SSP SG RKDAP+WCRAPFEPEGLLSSISAI+SGTIGIHYGHVLIHFKGHAERL+ WVS Sbjct: 300 SSPGSGPFRKDAPSWCRAPFEPEGLLSSISAILSGTIGIHYGHVLIHFKGHAERLRQWVS 359 Query: 1186 MXXXXXXXXXXXHFTDAIPINKQLYSFSYVCFTAGAAGIVFSAFYLLIDVWGKRMPFMFL 1365 M HFTDAIPINKQLYSFSYVCFTAGAAGIVFS FY+LIDVWG R PF+FL Sbjct: 360 MGVILLIVAIILHFTDAIPINKQLYSFSYVCFTAGAAGIVFSGFYVLIDVWGLRPPFLFL 419 Query: 1366 EWIGMNAMLVFVMAAQGIFAAFINGWYYKNSDNNLVNWIQEHVFNDVWKSERVGTLLYVI 1545 EWIGMNAMLV+VMAAQGIF FINGWYYK+ DN LV WIQ+HVFNDVW SERVGTLLYVI Sbjct: 420 EWIGMNAMLVYVMAAQGIFEGFINGWYYKSPDNTLVYWIQDHVFNDVWHSERVGTLLYVI 479 Query: 1546 FAEITFWAVVSGILHKLGIYWKL 1614 FA+ITFWAVVSG+LHKLGIYWKL Sbjct: 480 FAQITFWAVVSGVLHKLGIYWKL 502 >ref|XP_004246913.1| PREDICTED: heparan-alpha-glucosaminide N-acetyltransferase-like [Solanum lycopersicum] Length = 494 Score = 747 bits (1929), Expect = 0.0 Identities = 365/498 (73%), Positives = 408/498 (81%), Gaps = 6/498 (1%) Frame = +1 Query: 139 MEDPKKLEEGFGASKNXXXXXXXXXXXXXXXGDNVISTETERKSMIASELQAEETEQKT- 315 MEDPKKLEEGF KN D ++S E ++K +++ LQ +E E+ Sbjct: 1 MEDPKKLEEGFSNQKNNISEENIDTNRIDNNQD-LVSHENDQK-ILSQPLQKKEEEEPII 58 Query: 316 -KTKRVATLDAFRGLTIVLMILVDDAGGAYARIDHSPWNGCTLADFVMPFFLFIVGVAIA 492 K KRVATLDAFRGLTIVLMILVDDAGGAYA IDHSPWNGCTLADFVMPFFLFIVGVAIA Sbjct: 59 KKGKRVATLDAFRGLTIVLMILVDDAGGAYACIDHSPWNGCTLADFVMPFFLFIVGVAIA 118 Query: 493 LALKRIPKVPFAIRKVILRTLKLLFWGILLQGGYSHAPDDLSYGIDMKLIRWCGILQRIA 672 LALKR+PKV AI+KV LRTLKLLFWGI+LQGGYSHAP DL+YG+DMK+IRWCGILQRIA Sbjct: 119 LALKRVPKVSAAIKKVTLRTLKLLFWGIILQGGYSHAPYDLAYGVDMKVIRWCGILQRIA 178 Query: 673 LVYFVVALIEILTTKIRPTTLEPSHFSIFSAYKWQWLGGFAAFVIYMVTTFALYVPDWSF 852 LVY +VALIEILTTK+RPTTL P HFSIF+AYKW LGGF AFV+Y T + LYVPDW+F Sbjct: 179 LVYLIVALIEILTTKLRPTTLTPGHFSIFTAYKW--LGGFVAFVVYTTTIYGLYVPDWNF 236 Query: 853 ----EHKSEKFIVKCGMRGHLGPACNAVGYVDRQAWGINHLYSDPVWKRLRACTFSSPSS 1020 S+++ VKCGMRGHLGPACNAVGYVDRQ WGINHLY+ PVW R + CT S P + Sbjct: 237 LVHDGDTSQRYTVKCGMRGHLGPACNAVGYVDRQVWGINHLYNQPVWARSKVCTLSYPET 296 Query: 1021 GKLRKDAPAWCRAPFEPEGLLSSISAIVSGTIGIHYGHVLIHFKGHAERLKHWVSMXXXX 1200 G R DAP+WCRAPFEPEGLLSSISAI+SGTIGIHYGHVLIHFKGH ERLK W+SM Sbjct: 297 GPFRDDAPSWCRAPFEPEGLLSSISAIMSGTIGIHYGHVLIHFKGHGERLKQWISMGLGL 356 Query: 1201 XXXXXXXHFTDAIPINKQLYSFSYVCFTAGAAGIVFSAFYLLIDVWGKRMPFMFLEWIGM 1380 HF+DAIP+NKQLYSFSYVCFTAG AGIVFS Y+LIDV R+PF++LEWIGM Sbjct: 357 LITAFILHFSDAIPLNKQLYSFSYVCFTAGNAGIVFSGLYILIDVLAMRIPFLWLEWIGM 416 Query: 1381 NAMLVFVMAAQGIFAAFINGWYYKNSDNNLVNWIQEHVFNDVWKSERVGTLLYVIFAEIT 1560 NAML+FVM AQGIFA FINGWY+KN DNNLVNWIQ HVF DVWKS+R+GTLLYVIFAEIT Sbjct: 417 NAMLIFVMGAQGIFAGFINGWYFKNEDNNLVNWIQHHVFFDVWKSQRLGTLLYVIFAEIT 476 Query: 1561 FWAVVSGILHKLGIYWKL 1614 FWAV++GILH+LGIYWKL Sbjct: 477 FWAVLAGILHRLGIYWKL 494 >emb|CAN64496.1| hypothetical protein VITISV_004036 [Vitis vinifera] Length = 511 Score = 747 bits (1928), Expect = 0.0 Identities = 369/517 (71%), Positives = 412/517 (79%), Gaps = 27/517 (5%) Frame = +1 Query: 145 DPKKLEEGFGASKNXXXXXXXXXXXXXXXGDNVISTETERKSMIASELQAEETEQ-KTKT 321 D K++EEG G D + E+ + E + EE K K+ Sbjct: 4 DAKRVEEGLG---------HVHKEDISEKADKIEKDESSATPAQSVEQKGEEQPLIKQKS 54 Query: 322 KRVATLDAFRGLTIVLMILVDDAGGAYARIDHSPWNGCTLADFVMPFFLFIVGVAIALAL 501 KRVATLDAFRGLTIVLMILVDDAGG+YARIDHSPWNGCTLADFVMPFFLFIVGVA+ALAL Sbjct: 55 KRVATLDAFRGLTIVLMILVDDAGGSYARIDHSPWNGCTLADFVMPFFLFIVGVAVALAL 114 Query: 502 KRIPKVPFAIRKVILRTLKLLFWGILLQGGYSHAPDDLSYGIDMKLIRWCGILQ------ 663 K+IP++ A++K+ LRTLKLLFWGILLQGGYSHAPDDLSYG+DMK IRW GILQ Sbjct: 115 KKIPRISLAVKKISLRTLKLLFWGILLQGGYSHAPDDLSYGVDMKHIRWFGILQVFPLPL 174 Query: 664 ----------------RIALVYFVVALIEILTTKIRPTTLEPSHFSIFSAYKWQWLGGFA 795 RIA+VYFVVALIE LTTK RPT ++ HFSI SAYKWQW+GGF Sbjct: 175 FTGKSIPSSSLSGFLQRIAVVYFVVALIETLTTKRRPTVIDSGHFSILSAYKWQWIGGFV 234 Query: 796 AFVIYMVTTFALYVPDWSF----EHKSEKFIVKCGMRGHLGPACNAVGYVDRQAWGINHL 963 AF+IYM+TT+ALYVPDWSF +H+++++ VKCGMRGHLGPACNAVGYVDRQ WGINHL Sbjct: 235 AFLIYMITTYALYVPDWSFVIDQDHEAKRYTVKCGMRGHLGPACNAVGYVDRQVWGINHL 294 Query: 964 YSDPVWKRLRACTFSSPSSGKLRKDAPAWCRAPFEPEGLLSSISAIVSGTIGIHYGHVLI 1143 YS PVW RL+ACT SSP+SG R+DAP+WC APFEPEGLLS+ISAI+SGTIGIHYGHVLI Sbjct: 295 YSQPVWTRLKACTLSSPNSGPFREDAPSWCYAPFEPEGLLSTISAILSGTIGIHYGHVLI 354 Query: 1144 HFKGHAERLKHWVSMXXXXXXXXXXXHFTDAIPINKQLYSFSYVCFTAGAAGIVFSAFYL 1323 HFKGHAERLK WVSM HFTDAIPINKQLYSFSYVCFTAGAAGIV SAFYL Sbjct: 355 HFKGHAERLKQWVSMGIVLLIVAIILHFTDAIPINKQLYSFSYVCFTAGAAGIVXSAFYL 414 Query: 1324 LIDVWGKRMPFMFLEWIGMNAMLVFVMAAQGIFAAFINGWYYKNSDNNLVNWIQEHVFND 1503 +IDVWG R PF+FLEWIGMNAMLVFVMAAQGIFAAFINGWY+++SDN+LV+WIQ HVF D Sbjct: 415 VIDVWGFRTPFLFLEWIGMNAMLVFVMAAQGIFAAFINGWYFESSDNSLVHWIQRHVFID 474 Query: 1504 VWKSERVGTLLYVIFAEITFWAVVSGILHKLGIYWKL 1614 VW SER+GTLLYVIFAEITFWAVVSGILHKL IYWKL Sbjct: 475 VWHSERLGTLLYVIFAEITFWAVVSGILHKLHIYWKL 511 >ref|XP_006428456.1| hypothetical protein CICLE_v10011557mg [Citrus clementina] gi|568877271|ref|XP_006491663.1| PREDICTED: heparan-alpha-glucosaminide N-acetyltransferase-like isoform X1 [Citrus sinensis] gi|557530513|gb|ESR41696.1| hypothetical protein CICLE_v10011557mg [Citrus clementina] Length = 495 Score = 746 bits (1926), Expect = 0.0 Identities = 352/448 (78%), Positives = 394/448 (87%), Gaps = 4/448 (0%) Frame = +1 Query: 283 ELQAEETEQKTKTKRVATLDAFRGLTIVLMILVDDAGGAYARIDHSPWNGCTLADFVMPF 462 ELQ ++ Q+ K+KRVATLDAFRGLT+VLMILVDDAGGAYARIDHSPWNGCTLADFVMPF Sbjct: 49 ELQLQQLLQQ-KSKRVATLDAFRGLTVVLMILVDDAGGAYARIDHSPWNGCTLADFVMPF 107 Query: 463 FLFIVGVAIALALKRIPKVPFAIRKVILRTLKLLFWGILLQGGYSHAPDDLSYGIDMKLI 642 FLFIVGVAIALALK++PK+ A++K+I RTLKLLFWGI+LQGGYSHAPD LSYG+DMK I Sbjct: 108 FLFIVGVAIALALKKVPKINGAVKKIIFRTLKLLFWGIILQGGYSHAPDALSYGVDMKHI 167 Query: 643 RWCGILQRIALVYFVVALIEILTTKIRPTTLEPSHFSIFSAYKWQWLGGFAAFVIYMVTT 822 RWCGILQRIALVY VVALIE LTTK RP LEP H SIF+AY+WQW+GGF AFVIY++TT Sbjct: 168 RWCGILQRIALVYVVVALIETLTTKRRPNVLEPRHLSIFTAYQWQWIGGFIAFVIYIITT 227 Query: 823 FALYVPDWSF----EHKSEKFIVKCGMRGHLGPACNAVGYVDRQAWGINHLYSDPVWKRL 990 ++LYVP+WSF +H +K+IVKCGMRGHLGPACNAVGYVDR+ WGINHLYSDPVW RL Sbjct: 228 YSLYVPNWSFSEHSDHGVKKYIVKCGMRGHLGPACNAVGYVDRELWGINHLYSDPVWSRL 287 Query: 991 RACTFSSPSSGKLRKDAPAWCRAPFEPEGLLSSISAIVSGTIGIHYGHVLIHFKGHAERL 1170 ACT SSP+SG LR+DAP+WCRAPFEPEGLLS+ISAI+SGTIGIHYGHVLIHFKGH+ RL Sbjct: 288 EACTLSSPNSGPLREDAPSWCRAPFEPEGLLSTISAILSGTIGIHYGHVLIHFKGHSARL 347 Query: 1171 KHWVSMXXXXXXXXXXXHFTDAIPINKQLYSFSYVCFTAGAAGIVFSAFYLLIDVWGKRM 1350 KHWVSM HFT+AIPINKQLYSFSYVCFTAGAAGIVFSA Y+L+DVW R Sbjct: 348 KHWVSMGFGLLIIATILHFTNAIPINKQLYSFSYVCFTAGAAGIVFSALYVLMDVWELRT 407 Query: 1351 PFMFLEWIGMNAMLVFVMAAQGIFAAFINGWYYKNSDNNLVNWIQEHVFNDVWKSERVGT 1530 PF+FL+WIGMNAMLVFV+ AQGI A F+NGWYYKN DN LVNWIQ H+F VW SER+GT Sbjct: 408 PFLFLKWIGMNAMLVFVLGAQGILAGFVNGWYYKNPDNTLVNWIQNHLFIHVWNSERLGT 467 Query: 1531 LLYVIFAEITFWAVVSGILHKLGIYWKL 1614 LLYVIFAEITFW VV+GILH+LGIYWKL Sbjct: 468 LLYVIFAEITFWGVVAGILHRLGIYWKL 495 >ref|XP_004306260.1| PREDICTED: heparan-alpha-glucosaminide N-acetyltransferase-like [Fragaria vesca subsp. vesca] Length = 509 Score = 743 bits (1918), Expect = 0.0 Identities = 363/507 (71%), Positives = 405/507 (79%), Gaps = 17/507 (3%) Frame = +1 Query: 145 DPKKLEEGFGAS------------KNXXXXXXXXXXXXXXXGDNVISTETERKSMIASEL 288 D K+LEEGFG + K +STE + Sbjct: 5 DAKRLEEGFGHNPVPEVYEEDEHKKEVLTSNSTGDAIDRVDEKKAVSTEPREVEQVKGAE 64 Query: 289 QAEETEQKTKTKRVATLDAFRGLTIVLMILVDDAGGAYARIDHSPWNGCTLADFVMPFFL 468 + + + K K+KRVATLDAFRGLTIV+MILVDDAGGAYARIDHSPWNGCTLADFVMPFFL Sbjct: 65 EEQPLQVKQKSKRVATLDAFRGLTIVVMILVDDAGGAYARIDHSPWNGCTLADFVMPFFL 124 Query: 469 FIVGVAIALALKRIPKVPFAIRKVILRTLKLLFWGILLQGGYSHAPDDLSYGIDMKLIRW 648 FIVGVAIALALKRIPK A++K+ILRT+KLLFWGI+LQGGYS APD L+YG+DMK IRW Sbjct: 125 FIVGVAIALALKRIPKTSDAVKKIILRTIKLLFWGIILQGGYSQAPDTLAYGVDMKKIRW 184 Query: 649 CGILQRIALVYFVVALIEILTTKIRPTTLEPSHFSIFSAYKWQWLGGFAAFVIYMVTTFA 828 GILQRIALVY VVALIE TTK+RPT L+ SIF+AYKW +GGF AF++YM+TTF+ Sbjct: 185 FGILQRIALVYCVVALIETFTTKLRPTVLKSGPVSIFTAYKW--IGGFVAFLVYMITTFS 242 Query: 829 LYVPDWSF-----EHKSEKFIVKCGMRGHLGPACNAVGYVDRQAWGINHLYSDPVWKRLR 993 LY+PDWSF + ++K++VKCGMRGHLGPACNAVGYVDRQ WGINHLY+ PVW RL+ Sbjct: 243 LYIPDWSFVKHYDDGSTKKYLVKCGMRGHLGPACNAVGYVDRQVWGINHLYNQPVWIRLK 302 Query: 994 ACTFSSPSSGKLRKDAPAWCRAPFEPEGLLSSISAIVSGTIGIHYGHVLIHFKGHAERLK 1173 ACT SSPS+G LRK AP+WCRAPFEPEGLLSSISAI+SGTIGIHYGH+LIHFKGHAERLK Sbjct: 303 ACTLSSPSTGPLRKGAPSWCRAPFEPEGLLSSISAILSGTIGIHYGHILIHFKGHAERLK 362 Query: 1174 HWVSMXXXXXXXXXXXHFTDAIPINKQLYSFSYVCFTAGAAGIVFSAFYLLIDVWGKRMP 1353 WVSM HFTDAIPINKQLYSFSYVCFTAGAAG+VFS FYLLIDVWG R P Sbjct: 363 QWVSMGLVLMIIAIILHFTDAIPINKQLYSFSYVCFTAGAAGLVFSGFYLLIDVWGYRTP 422 Query: 1354 FMFLEWIGMNAMLVFVMAAQGIFAAFINGWYYKNSDNNLVNWIQEHVFNDVWKSERVGTL 1533 F+FLEWIGMNAMLVFVMAAQGIFA F+NGWYY+ D LV WIQEHVFN+VW SER+GTL Sbjct: 423 FLFLEWIGMNAMLVFVMAAQGIFAGFVNGWYYETQDKTLVYWIQEHVFNNVWHSERLGTL 482 Query: 1534 LYVIFAEITFWAVVSGILHKLGIYWKL 1614 LYVIFAEITFWAVVSGILHKLGIYWKL Sbjct: 483 LYVIFAEITFWAVVSGILHKLGIYWKL 509 >ref|XP_004515935.1| PREDICTED: heparan-alpha-glucosaminide N-acetyltransferase-like [Cicer arietinum] Length = 480 Score = 740 bits (1911), Expect = 0.0 Identities = 363/496 (73%), Positives = 404/496 (81%), Gaps = 4/496 (0%) Frame = +1 Query: 139 MEDPKKLEEGFGASKNXXXXXXXXXXXXXXXGDNVISTETERKSMIASELQAEETEQKTK 318 M++ K++EEG + N G + I + S Q +E + K K Sbjct: 1 MDEAKRMEEGINSPHNG--------------GGDSIEHDNNDTMKGESVHQPKEPDGKHK 46 Query: 319 TKRVATLDAFRGLTIVLMILVDDAGGAYARIDHSPWNGCTLADFVMPFFLFIVGVAIALA 498 TKRVATLDAFRGLTIVLMILVDDAG AY RIDHSPWNGCTLADFVMPFFLFIVGVAIALA Sbjct: 47 TKRVATLDAFRGLTIVLMILVDDAGEAYPRIDHSPWNGCTLADFVMPFFLFIVGVAIALA 106 Query: 499 LKRIPKVPFAIRKVILRTLKLLFWGILLQGGYSHAPDDLSYGIDMKLIRWCGILQRIALV 678 LKRIPKV A++K+ILRTLKLLFWG+LLQGGYSHAPDDLSYGIDMK IRWCGILQRIALV Sbjct: 107 LKRIPKVKVAVKKIILRTLKLLFWGLLLQGGYSHAPDDLSYGIDMKFIRWCGILQRIALV 166 Query: 679 YFVVALIEILTTKIRPTTLEPSHFSIFSAYKWQWLGGFAAFVIYMVTTFALYVPDWSF-- 852 Y VVALIE T K+RPTTL P + SIF++Y+W GGF AFVIYMVTTF+LYVPDWSF Sbjct: 167 YCVVALIETFTIKLRPTTLSPGYLSIFTSYRW--FGGFVAFVIYMVTTFSLYVPDWSFVD 224 Query: 853 --EHKSEKFIVKCGMRGHLGPACNAVGYVDRQAWGINHLYSDPVWKRLRACTFSSPSSGK 1026 + +++ V CGMRGHLGPACNAVGYVDRQ W +NHLYS PVW RL+ACTFSSP+ G Sbjct: 225 YNSSELKRYTVICGMRGHLGPACNAVGYVDRQIWRVNHLYSQPVWNRLKACTFSSPAEGH 284 Query: 1027 LRKDAPAWCRAPFEPEGLLSSISAIVSGTIGIHYGHVLIHFKGHAERLKHWVSMXXXXXX 1206 LRKDAP WCRAPFEPEGLLS+ISAI+SGTIGIHYGHVLIHFKGH+ERLK W+SM Sbjct: 285 LRKDAPNWCRAPFEPEGLLSTISAILSGTIGIHYGHVLIHFKGHSERLKQWLSMGFVLFI 344 Query: 1207 XXXXXHFTDAIPINKQLYSFSYVCFTAGAAGIVFSAFYLLIDVWGKRMPFMFLEWIGMNA 1386 HFTDAIPINKQLYS SYVCFTAGAAGIVFS FY+LIDVWG R PF+FLEWIGMNA Sbjct: 345 LGIILHFTDAIPINKQLYSISYVCFTAGAAGIVFSIFYILIDVWGLRTPFLFLEWIGMNA 404 Query: 1387 MLVFVMAAQGIFAAFINGWYYKNSDNNLVNWIQEHVFNDVWKSERVGTLLYVIFAEITFW 1566 MLVFVMAAQG+FAAF+NGWYYK+ +N+LV+WIQ HVF +VW SER+GTLLYVIFAEITFW Sbjct: 405 MLVFVMAAQGLFAAFVNGWYYKDPNNSLVHWIQNHVFINVWHSERLGTLLYVIFAEITFW 464 Query: 1567 AVVSGILHKLGIYWKL 1614 VV+GILHKL IYWKL Sbjct: 465 GVVAGILHKLQIYWKL 480 >ref|XP_004494901.1| PREDICTED: heparan-alpha-glucosaminide N-acetyltransferase-like [Cicer arietinum] Length = 484 Score = 739 bits (1908), Expect = 0.0 Identities = 356/497 (71%), Positives = 406/497 (81%), Gaps = 5/497 (1%) Frame = +1 Query: 139 MEDPKKLEEGFGASKNXXXXXXXXXXXXXXXGDNVISTETERKSMIASELQAEETEQKTK 318 M++ K++EEG + + D + + E+ S + K K Sbjct: 1 MDEAKRMEEGHNLALDDDAKDDLKKQQTNIEHDRDVKLDHEQPSQV-----------KQK 49 Query: 319 TKRVATLDAFRGLTIVLMILVDDAGGAYARIDHSPWNGCTLADFVMPFFLFIVGVAIALA 498 TKRVATLDAFRGLTIVLMILVDDAGG Y RIDHSPWNGCTLADFVMPFFLFIVGVAIALA Sbjct: 50 TKRVATLDAFRGLTIVLMILVDDAGGVYPRIDHSPWNGCTLADFVMPFFLFIVGVAIALA 109 Query: 499 LKRIPKVPFAIRKVILRTLKLLFWGILLQGGYSHAPDDLSYGIDMKLIRWCGILQRIALV 678 LKRIPK+ +A++K++LRTLKLLFWGILLQGGYSHAPD+L YG++MK IRWCGILQRIALV Sbjct: 110 LKRIPKIKYAMKKIMLRTLKLLFWGILLQGGYSHAPDELIYGVNMKFIRWCGILQRIALV 169 Query: 679 YFVVALIEILTTKIRPTTLEPSHFSIFSAYKWQWLGGFAAFVIYMVTTFALYVPDWSFE- 855 Y +VALIE TTK+RPTTL P +IF+AYKW GGF AF+IYM+TTF LYVPDWSF Sbjct: 170 YCIVALIETFTTKLRPTTLSPQRLAIFTAYKW--FGGFMAFLIYMITTFTLYVPDWSFVD 227 Query: 856 ----HKSEKFIVKCGMRGHLGPACNAVGYVDRQAWGINHLYSDPVWKRLRACTFSSPSSG 1023 H S+++ V CGMRGHLGPACNAVG+VDRQ WGINHLYS PVW+RL+ACTF SP G Sbjct: 228 QVKGHGSKRYTVICGMRGHLGPACNAVGHVDRQVWGINHLYSQPVWRRLKACTFDSPGEG 287 Query: 1024 KLRKDAPAWCRAPFEPEGLLSSISAIVSGTIGIHYGHVLIHFKGHAERLKHWVSMXXXXX 1203 KLR+DAP+WC APFEPEGLLSSISAI+SGTIGIHYGHVLIHFKGH+ERLK WVSM Sbjct: 288 KLREDAPSWCLAPFEPEGLLSSISAILSGTIGIHYGHVLIHFKGHSERLKQWVSMGFVLL 347 Query: 1204 XXXXXXHFTDAIPINKQLYSFSYVCFTAGAAGIVFSAFYLLIDVWGKRMPFMFLEWIGMN 1383 HFTDAIPINKQLYSFSYVCFTAGAAGIVFSAFY+LIDVWG R PF+FLEWIGMN Sbjct: 348 IMAIILHFTDAIPINKQLYSFSYVCFTAGAAGIVFSAFYILIDVWGLRTPFLFLEWIGMN 407 Query: 1384 AMLVFVMAAQGIFAAFINGWYYKNSDNNLVNWIQEHVFNDVWKSERVGTLLYVIFAEITF 1563 AMLVFVMAA+GIFAAF+NGWYY++ +N+LV+WI++HVF +VW SERVGTLLYVIFAEITF Sbjct: 408 AMLVFVMAAEGIFAAFVNGWYYEDPNNSLVHWIKKHVFVNVWNSERVGTLLYVIFAEITF 467 Query: 1564 WAVVSGILHKLGIYWKL 1614 W +V+G+LHKL IYWKL Sbjct: 468 WGMVAGLLHKLKIYWKL 484 >ref|XP_007027449.1| Uncharacterized protein TCM_022283 [Theobroma cacao] gi|508716054|gb|EOY07951.1| Uncharacterized protein TCM_022283 [Theobroma cacao] Length = 504 Score = 737 bits (1903), Expect = 0.0 Identities = 359/504 (71%), Positives = 408/504 (80%), Gaps = 12/504 (2%) Frame = +1 Query: 139 MEDPKKLEEGF----GASKNXXXXXXXXXXXXXXXG--DNVISTETERKSMIASELQAEE 300 M DP K+EEG GA G D + + + ER + E Q E Sbjct: 1 MADPGKMEEGLAHEEGAPDQERKDKRDGKVEKEEDGAADRIGNDKEERVATTHVEQQNLE 60 Query: 301 TEQ--KTKTKRVATLDAFRGLTIVLMILVDDAGGAYARIDHSPWNGCTLADFVMPFFLFI 474 + K KTKR+ATLDAFRGLT+VLMILVDDAGGAY RIDHSPWNGCTLADFVMPFFLFI Sbjct: 61 EQPLVKQKTKRIATLDAFRGLTVVLMILVDDAGGAYPRIDHSPWNGCTLADFVMPFFLFI 120 Query: 475 VGVAIALALKRIPKVPFAIRKVILRTLKLLFWGILLQGGYSHAPDDLSYGIDMKLIRWCG 654 VGVAIALALK++PK+ AI+K+ LRTLKLLFWG+LLQGGYSHAP DL+YG+DMK IRWCG Sbjct: 121 VGVAIALALKKVPKIKDAIKKISLRTLKLLFWGVLLQGGYSHAPADLAYGVDMKQIRWCG 180 Query: 655 ILQRIALVYFVVALIEILTTKIRPTTLEPSHFSIFSAYKWQWLGGFAAFVIYMVTTFALY 834 ILQRIALVYF+VALIE LT K RPT LEP H SIF+AY+WQW+GGF AFVIYM+TT++LY Sbjct: 181 ILQRIALVYFIVALIETLTRKRRPTVLEPGHLSIFTAYRWQWIGGFVAFVIYMITTYSLY 240 Query: 835 VPDWSF----EHKSEKFIVKCGMRGHLGPACNAVGYVDRQAWGINHLYSDPVWKRLRACT 1002 VP WSF + ++ ++ VKCGMRGHLGPACNAVGYVDR+ WGINHLYS PVW+RL+ACT Sbjct: 241 VPHWSFVVDNDDEATRYTVKCGMRGHLGPACNAVGYVDREVWGINHLYSSPVWQRLKACT 300 Query: 1003 FSSPSSGKLRKDAPAWCRAPFEPEGLLSSISAIVSGTIGIHYGHVLIHFKGHAERLKHWV 1182 SSP SG R++AP+WCRAPFEPEGLLSSI AI+SGT+GIHYGHVLIHFKGH ERLK WV Sbjct: 301 LSSPGSGPFRENAPSWCRAPFEPEGLLSSILAILSGTMGIHYGHVLIHFKGHFERLKQWV 360 Query: 1183 SMXXXXXXXXXXXHFTDAIPINKQLYSFSYVCFTAGAAGIVFSAFYLLIDVWGKRMPFMF 1362 SM HFTDAIPINKQLYSFSYVCFTA AAGIVFSAFY+LIDVWG R PF+F Sbjct: 361 SMALGLLIVAIILHFTDAIPINKQLYSFSYVCFTAAAAGIVFSAFYVLIDVWGFRTPFLF 420 Query: 1363 LEWIGMNAMLVFVMAAQGIFAAFINGWYYKNSDNNLVNWIQEHVFNDVWKSERVGTLLYV 1542 LEWIGMNAMLV+V+ AQGI AAF+NGWYY++S+N LV WIQ+HVF +VW SER+GTLLYV Sbjct: 421 LEWIGMNAMLVYVLGAQGILAAFVNGWYYESSNNTLVYWIQKHVFINVWHSERLGTLLYV 480 Query: 1543 IFAEITFWAVVSGILHKLGIYWKL 1614 IFAEI F+ V+SGILHKLGIYWKL Sbjct: 481 IFAEIAFYGVLSGILHKLGIYWKL 504 >ref|XP_006577710.1| PREDICTED: heparan-alpha-glucosaminide N-acetyltransferase-like [Glycine max] Length = 506 Score = 735 bits (1898), Expect = 0.0 Identities = 361/506 (71%), Positives = 409/506 (80%), Gaps = 15/506 (2%) Frame = +1 Query: 142 EDPKKLEEGFGASKNXXXXXXXXXXXXXXXGDNVISTETERKSMIASEL-------QAEE 300 EDPK++EEG ++ N N S K +A + Q E Sbjct: 3 EDPKRMEEGLNSALNGDGNKDDLKKRATIKTSNGGSIFEHDKDTMAKPVAEGESVQQIAE 62 Query: 301 TEQ---KTKTKRVATLDAFRGLTIVLMILVDDAGGAYARIDHSPWNGCTLADFVMPFFLF 471 EQ K KTKRVATLDAFRGLTIVLMILVDDAG AY RIDHSPWNGCTLADFVMPFFLF Sbjct: 63 QEQPPVKQKTKRVATLDAFRGLTIVLMILVDDAGEAYPRIDHSPWNGCTLADFVMPFFLF 122 Query: 472 IVGVAIALALKRIPKVPFAIRKVILRTLKLLFWGILLQGGYSHAPDDLSYGIDMKLIRWC 651 IVGVAIALALKRI K+ +++K+ILRTLKLLFWGI+LQGGYSHAPDDL YG++MK IRWC Sbjct: 123 IVGVAIALALKRISKIKHSVKKIILRTLKLLFWGIILQGGYSHAPDDLEYGVNMKFIRWC 182 Query: 652 GILQRIALVYFVVALIEILTTKIRPTTLEPSHFSIFSAYKWQWLGGFAAFVIYMVTTFAL 831 GILQRIALVY VVALIE TTK+RPTTL H SIF+AYKW GGF AF+IYM+TTF+L Sbjct: 183 GILQRIALVYCVVALIETFTTKLRPTTLASGHLSIFAAYKW--FGGFVAFLIYMITTFSL 240 Query: 832 YVPDWSF-EH----KSEKFIVKCGMRGHLGPACNAVGYVDRQAWGINHLYSDPVWKRLRA 996 YVPDWSF +H + +++ V CGMRGHLGPACNAVG+VDRQ WG+NHLYS PVW+RL+A Sbjct: 241 YVPDWSFVDHFNGDEPKRYTVICGMRGHLGPACNAVGHVDRQVWGVNHLYSQPVWRRLKA 300 Query: 997 CTFSSPSSGKLRKDAPAWCRAPFEPEGLLSSISAIVSGTIGIHYGHVLIHFKGHAERLKH 1176 CTFSSP SG R DAP+WC APFEPEGLLSSISAI+SGTIGIHYGHVLIHFKGH+ERLK Sbjct: 301 CTFSSPGSGPFRDDAPSWCLAPFEPEGLLSSISAILSGTIGIHYGHVLIHFKGHSERLKQ 360 Query: 1177 WVSMXXXXXXXXXXXHFTDAIPINKQLYSFSYVCFTAGAAGIVFSAFYLLIDVWGKRMPF 1356 WVSM HFTDA+PINKQLYSFSYVCFTAGAAGIVFS FY+LIDVWG R PF Sbjct: 361 WVSMGFVLLIIAIILHFTDALPINKQLYSFSYVCFTAGAAGIVFSGFYILIDVWGLRTPF 420 Query: 1357 MFLEWIGMNAMLVFVMAAQGIFAAFINGWYYKNSDNNLVNWIQEHVFNDVWKSERVGTLL 1536 +FLEWIGMNAMLVFVMAA+GIFAAF+NGWYY++ ++LV+WI++HVF +VW SERVGT+L Sbjct: 421 LFLEWIGMNAMLVFVMAAEGIFAAFVNGWYYEDPRSSLVHWIKKHVFVNVWHSERVGTIL 480 Query: 1537 YVIFAEITFWAVVSGILHKLGIYWKL 1614 YVIFAEITFW+VV+G+LHKLGIYWKL Sbjct: 481 YVIFAEITFWSVVAGVLHKLGIYWKL 506 >ref|XP_004494902.1| PREDICTED: heparan-alpha-glucosaminide N-acetyltransferase-like [Cicer arietinum] Length = 478 Score = 718 bits (1853), Expect = 0.0 Identities = 346/497 (69%), Positives = 405/497 (81%), Gaps = 5/497 (1%) Frame = +1 Query: 139 MEDPKKLEEGFGASKNXXXXXXXXXXXXXXXGDNVISTETERKSMIASELQAEETEQKTK 318 ME+ K++EEGF + + D + + ET + ++L E +Q+ K Sbjct: 1 MEETKRMEEGFNSPLDVK--------------DELKNQETNIEYDKDTKL---EQDQQIK 43 Query: 319 TKRVATLDAFRGLTIVLMILVDDAGGAYARIDHSPWNGCTLADFVMPFFLFIVGVAIALA 498 TKRVATLDAFRGLTIV+MILVD AGGA+ RIDH+PWNGCTLADFVMPFFLFIVGVAIALA Sbjct: 44 TKRVATLDAFRGLTIVMMILVDKAGGAFPRIDHAPWNGCTLADFVMPFFLFIVGVAIALA 103 Query: 499 LKRIPKVPFAIRKVILRTLKLLFWGILLQGGYSHAPDDLSYGIDMKLIRWCGILQRIALV 678 LKRI + +A++K++LRTLKLLFWGILLQGGYSHAPDDLSYG++MK IRWCGILQRIALV Sbjct: 104 LKRIHNIKYAVKKIMLRTLKLLFWGILLQGGYSHAPDDLSYGVNMKFIRWCGILQRIALV 163 Query: 679 YFVVALIEILTTKIRPTTLEPSHFSIFSAYKWQWLGGFAAFVIYMVTTFALYVPDWSF-E 855 Y +VALIE TTK+RPTTL P +IF+AYKW GGF AF +YM+TTF LYVPDWSF + Sbjct: 164 YCIVALIETFTTKLRPTTLTPQRLAIFTAYKW--FGGFVAFFVYMITTFTLYVPDWSFVD 221 Query: 856 H----KSEKFIVKCGMRGHLGPACNAVGYVDRQAWGINHLYSDPVWKRLRACTFSSPSSG 1023 H + ++ V CGMRGHLGPACNAVG+VDRQ WG+NH YS PVW+ L+ CTF+SP G Sbjct: 222 HINGDEPRRYTVICGMRGHLGPACNAVGHVDRQVWGVNHFYSHPVWRHLKECTFNSPGEG 281 Query: 1024 KLRKDAPAWCRAPFEPEGLLSSISAIVSGTIGIHYGHVLIHFKGHAERLKHWVSMXXXXX 1203 R+DAP+WCRAPFEPEGLLSSISAI+SGTIGIHYGHVLIHFKGH+ERLK WVSM Sbjct: 282 PFREDAPSWCRAPFEPEGLLSSISAILSGTIGIHYGHVLIHFKGHSERLKQWVSMGFVLL 341 Query: 1204 XXXXXXHFTDAIPINKQLYSFSYVCFTAGAAGIVFSAFYLLIDVWGKRMPFMFLEWIGMN 1383 HFT+AIPINKQLYS SYVC TAGAAGIVFS+ Y+LIDVWG R PF+FLEWIGMN Sbjct: 342 TIAIILHFTNAIPINKQLYSISYVCLTAGAAGIVFSSLYILIDVWGIRTPFLFLEWIGMN 401 Query: 1384 AMLVFVMAAQGIFAAFINGWYYKNSDNNLVNWIQEHVFNDVWKSERVGTLLYVIFAEITF 1563 +MLVFVMAA+GIFAAF+NGWYY++ +N+LV+WI++HVF +VW SERVGTLLYVIFAEITF Sbjct: 402 SMLVFVMAAEGIFAAFVNGWYYEDPNNSLVHWIKKHVFVNVWNSERVGTLLYVIFAEITF 461 Query: 1564 WAVVSGILHKLGIYWKL 1614 W +VSG+LHKLGIYWKL Sbjct: 462 WGIVSGVLHKLGIYWKL 478 >ref|XP_006595362.1| PREDICTED: heparan-alpha-glucosaminide N-acetyltransferase-like [Glycine max] Length = 482 Score = 717 bits (1852), Expect = 0.0 Identities = 339/420 (80%), Positives = 370/420 (88%), Gaps = 4/420 (0%) Frame = +1 Query: 367 LMILVDDAGGAYARIDHSPWNGCTLADFVMPFFLFIVGVAIALALKRIPKVPFAIRKVIL 546 LM+LVDDAGGAY RIDHSPWNGCTLADFVMPFFLFIVGVAIALALKRIPKV +A++K+IL Sbjct: 65 LMVLVDDAGGAYPRIDHSPWNGCTLADFVMPFFLFIVGVAIALALKRIPKVKYAVKKIIL 124 Query: 547 RTLKLLFWGILLQGGYSHAPDDLSYGIDMKLIRWCGILQRIALVYFVVALIEILTTKIRP 726 RTLKLLFWGILLQGGYSHAPDDLSYG+DM+ IRWCGILQRIALVY VVALIE TTK+RP Sbjct: 125 RTLKLLFWGILLQGGYSHAPDDLSYGVDMRFIRWCGILQRIALVYCVVALIETYTTKLRP 184 Query: 727 TTLEPSHFSIFSAYKWQWLGGFAAFVIYMVTTFALYVPDWSF----EHKSEKFIVKCGMR 894 +TL+P H SIF+AY+W LGGF AFVIYMVT F+LYVPDWSF K +++ V+CGMR Sbjct: 185 STLKPGHLSIFTAYRW--LGGFVAFVIYMVTIFSLYVPDWSFVDYNSDKPKRYTVECGMR 242 Query: 895 GHLGPACNAVGYVDRQAWGINHLYSDPVWKRLRACTFSSPSSGKLRKDAPAWCRAPFEPE 1074 GHLGPACNAVGYVDRQ WG+NHLYS PVW RL+ACT SSP+ G LRK+APAWCRAPFEPE Sbjct: 243 GHLGPACNAVGYVDRQVWGVNHLYSQPVWTRLKACTLSSPAEGPLRKNAPAWCRAPFEPE 302 Query: 1075 GLLSSISAIVSGTIGIHYGHVLIHFKGHAERLKHWVSMXXXXXXXXXXXHFTDAIPINKQ 1254 G LSS+ AI+SGTIGIHYGHVLIHFKGH ERLK W+SM HFTDAIPINKQ Sbjct: 303 GFLSSVLAILSGTIGIHYGHVLIHFKGHFERLKQWLSMGFVLLTLGLILHFTDAIPINKQ 362 Query: 1255 LYSFSYVCFTAGAAGIVFSAFYLLIDVWGKRMPFMFLEWIGMNAMLVFVMAAQGIFAAFI 1434 LYSFSYVCFTAGAAGIVFS FYLLIDVWG R PF+FLEWIGMNAMLVFVMAAQGIFAAF+ Sbjct: 363 LYSFSYVCFTAGAAGIVFSVFYLLIDVWGLRTPFLFLEWIGMNAMLVFVMAAQGIFAAFV 422 Query: 1435 NGWYYKNSDNNLVNWIQEHVFNDVWKSERVGTLLYVIFAEITFWAVVSGILHKLGIYWKL 1614 NGWYYK+ DN+LV WIQ HVF +VW SER+GTLLYVIFAEITFW VV+GILHKLGIYWKL Sbjct: 423 NGWYYKDPDNSLVYWIQNHVFTNVWHSERLGTLLYVIFAEITFWGVVAGILHKLGIYWKL 482 >ref|XP_006836541.1| hypothetical protein AMTR_s00131p00031270 [Amborella trichopoda] gi|548839080|gb|ERM99394.1| hypothetical protein AMTR_s00131p00031270 [Amborella trichopoda] Length = 485 Score = 716 bits (1847), Expect = 0.0 Identities = 347/474 (73%), Positives = 391/474 (82%), Gaps = 13/474 (2%) Frame = +1 Query: 232 GDNVISTETERKSMIAS-----ELQAEETEQKTKTKRVATLDAFRGLTIVLMILVDDAGG 396 G+ V+ E K A+ EL +E +KT KRVATLDAFRGLTIV+MILVDDAGG Sbjct: 12 GEEVVVIHEEEKIEEATKEEKEELLQQEDGKKTSKKRVATLDAFRGLTIVVMILVDDAGG 71 Query: 397 AYARI-DHSPWNGCTLADFVMPFFLFIVGVAIALALKRIPKVPFAIRKVILRTLKLLFWG 573 AY +I DHSPWNGC LADFVMPFFLFIVGVAIALALK+IPKV A++KVILRTLKLLFWG Sbjct: 72 AYEQILDHSPWNGCRLADFVMPFFLFIVGVAIALALKKIPKVGDAVKKVILRTLKLLFWG 131 Query: 574 ILLQGGYSHAPDDLSYGIDMKLIRWCGILQRIALVYFVVALIEILTTKIRPTTLEPSHFS 753 I+LQGGYSHAPDDLSYG+DMK IRWCGILQRIALVY VVA+IEI TTKIRPT L S S Sbjct: 132 IILQGGYSHAPDDLSYGVDMKHIRWCGILQRIALVYLVVAMIEIATTKIRPTMLGSSPLS 191 Query: 754 IFSAYKWQWLGGFAAFVIYMVTTFALYVPDWSF-------EHKSEKFIVKCGMRGHLGPA 912 IF+AY+WQW GGF AF+IY++TT++LYVPDWS+ E+ + F VKCGMR HLGPA Sbjct: 192 IFNAYRWQWFGGFIAFLIYIITTYSLYVPDWSYVLHHQNNENNEKIFTVKCGMRAHLGPA 251 Query: 913 CNAVGYVDRQAWGINHLYSDPVWKRLRACTFSSPSSGKLRKDAPAWCRAPFEPEGLLSSI 1092 CNAVG+VDRQ WGINHLYS PVW+RL+ACT SSP SG LRKDA +WC AP+EPEGLLSSI Sbjct: 252 CNAVGHVDRQVWGINHLYSQPVWQRLKACTTSSPKSGPLRKDAASWCLAPYEPEGLLSSI 311 Query: 1093 SAIVSGTIGIHYGHVLIHFKGHAERLKHWVSMXXXXXXXXXXXHFTDAIPINKQLYSFSY 1272 SAI+SGTIGIHYGHVLIHFK H ERLKHW+SM HFTDA+P+NKQLYS SY Sbjct: 312 SAILSGTIGIHYGHVLIHFKSHLERLKHWLSMGITLFIIGIILHFTDAMPLNKQLYSISY 371 Query: 1273 VCFTAGAAGIVFSAFYLLIDVWGKRMPFMFLEWIGMNAMLVFVMAAQGIFAAFINGWYYK 1452 VCFTAGAAGI+FS Y+LIDVW R+ FMFLEWIGMNAML+FV+ AQGIF AF+NGWYY+ Sbjct: 372 VCFTAGAAGILFSVLYMLIDVWRARIVFMFLEWIGMNAMLIFVLGAQGIFPAFVNGWYYE 431 Query: 1453 NSDNNLVNWIQEHVFNDVWKSERVGTLLYVIFAEITFWAVVSGILHKLGIYWKL 1614 + +N LVNWIQ+HVF DVW SE +GTLLYVIFAEI FW VV+GILHKL IYWKL Sbjct: 432 DPENTLVNWIQKHVFVDVWNSENLGTLLYVIFAEIVFWGVVAGILHKLRIYWKL 485 >ref|XP_006428455.1| hypothetical protein CICLE_v10011557mg [Citrus clementina] gi|568877275|ref|XP_006491665.1| PREDICTED: heparan-alpha-glucosaminide N-acetyltransferase-like isoform X3 [Citrus sinensis] gi|557530512|gb|ESR41695.1| hypothetical protein CICLE_v10011557mg [Citrus clementina] Length = 419 Score = 712 bits (1839), Expect = 0.0 Identities = 332/419 (79%), Positives = 369/419 (88%), Gaps = 4/419 (0%) Frame = +1 Query: 370 MILVDDAGGAYARIDHSPWNGCTLADFVMPFFLFIVGVAIALALKRIPKVPFAIRKVILR 549 MILVDDAGGAYARIDHSPWNGCTLADFVMPFFLFIVGVAIALALK++PK+ A++K+I R Sbjct: 1 MILVDDAGGAYARIDHSPWNGCTLADFVMPFFLFIVGVAIALALKKVPKINGAVKKIIFR 60 Query: 550 TLKLLFWGILLQGGYSHAPDDLSYGIDMKLIRWCGILQRIALVYFVVALIEILTTKIRPT 729 TLKLLFWGI+LQGGYSHAPD LSYG+DMK IRWCGILQRIALVY VVALIE LTTK RP Sbjct: 61 TLKLLFWGIILQGGYSHAPDALSYGVDMKHIRWCGILQRIALVYVVVALIETLTTKRRPN 120 Query: 730 TLEPSHFSIFSAYKWQWLGGFAAFVIYMVTTFALYVPDWSF----EHKSEKFIVKCGMRG 897 LEP H SIF+AY+WQW+GGF AFVIY++TT++LYVP+WSF +H +K+IVKCGMRG Sbjct: 121 VLEPRHLSIFTAYQWQWIGGFIAFVIYIITTYSLYVPNWSFSEHSDHGVKKYIVKCGMRG 180 Query: 898 HLGPACNAVGYVDRQAWGINHLYSDPVWKRLRACTFSSPSSGKLRKDAPAWCRAPFEPEG 1077 HLGPACNAVGYVDR+ WGINHLYSDPVW RL ACT SSP+SG LR+DAP+WCRAPFEPEG Sbjct: 181 HLGPACNAVGYVDRELWGINHLYSDPVWSRLEACTLSSPNSGPLREDAPSWCRAPFEPEG 240 Query: 1078 LLSSISAIVSGTIGIHYGHVLIHFKGHAERLKHWVSMXXXXXXXXXXXHFTDAIPINKQL 1257 LLS+ISAI+SGTIGIHYGHVLIHFKGH+ RLKHWVSM HFT+AIPINKQL Sbjct: 241 LLSTISAILSGTIGIHYGHVLIHFKGHSARLKHWVSMGFGLLIIATILHFTNAIPINKQL 300 Query: 1258 YSFSYVCFTAGAAGIVFSAFYLLIDVWGKRMPFMFLEWIGMNAMLVFVMAAQGIFAAFIN 1437 YSFSYVCFTAGAAGIVFSA Y+L+DVW R PF+FL+WIGMNAMLVFV+ AQGI A F+N Sbjct: 301 YSFSYVCFTAGAAGIVFSALYVLMDVWELRTPFLFLKWIGMNAMLVFVLGAQGILAGFVN 360 Query: 1438 GWYYKNSDNNLVNWIQEHVFNDVWKSERVGTLLYVIFAEITFWAVVSGILHKLGIYWKL 1614 GWYYKN DN LVNWIQ H+F VW SER+GTLLYVIFAEITFW VV+GILH+LGIYWKL Sbjct: 361 GWYYKNPDNTLVNWIQNHLFIHVWNSERLGTLLYVIFAEITFWGVVAGILHRLGIYWKL 419 >ref|XP_003576108.1| PREDICTED: heparan-alpha-glucosaminide N-acetyltransferase-like [Brachypodium distachyon] Length = 498 Score = 708 bits (1827), Expect = 0.0 Identities = 328/450 (72%), Positives = 376/450 (83%), Gaps = 7/450 (1%) Frame = +1 Query: 286 LQAEETEQKTKTKRVATLDAFRGLTIVLMILVDDAGGAYARIDHSPWNGCTLADFVMPFF 465 L E QK K+ RVA LDAFRGLTIV+MILVDDAG +Y R+DHSPWNGCTLADFVMPFF Sbjct: 49 LAVVEEPQKKKSTRVAALDAFRGLTIVVMILVDDAGSSYERMDHSPWNGCTLADFVMPFF 108 Query: 466 LFIVGVAIALALKRIPKVPFAIRKVILRTLKLLFWGILLQGGYSHAPDDLSYGIDMKLIR 645 LFIVGVAIA A+KR+P + A++KV +RTLK++FWG+LLQGGYSHAPDDL+YG+DMK+IR Sbjct: 109 LFIVGVAIAFAMKRVPNMGAAVKKVSVRTLKMIFWGLLLQGGYSHAPDDLAYGVDMKMIR 168 Query: 646 WCGILQRIALVYFVVALIEILTTKIRPTTLEPSHFSIFSAYKWQWLGGFAAFVIYMVTTF 825 WCGILQRIALVYF VALIE+ TTK+RPTT+ ++IF AY+WQWLG F VIYM+TTF Sbjct: 169 WCGILQRIALVYFAVALIEVFTTKVRPTTVRSGPYAIFDAYRWQWLGAFIVLVIYMITTF 228 Query: 826 ALYVPDWSFEHKSE-------KFIVKCGMRGHLGPACNAVGYVDRQAWGINHLYSDPVWK 984 +LYVPDWSF + ++ +F V+CG+RGHL PACNAVG++DRQ WGINHLYS PVW Sbjct: 229 SLYVPDWSFVYHNDGDINDGKRFTVQCGVRGHLDPACNAVGFIDRQVWGINHLYSQPVWI 288 Query: 985 RLRACTFSSPSSGKLRKDAPAWCRAPFEPEGLLSSISAIVSGTIGIHYGHVLIHFKGHAE 1164 R + CTFSSP +GKLR DAPAWC PFEPEGLLSSIS+I+SGTIGIHYGHVLIHFK H E Sbjct: 289 RTKDCTFSSPETGKLRDDAPAWCLGPFEPEGLLSSISSIISGTIGIHYGHVLIHFKTHKE 348 Query: 1165 RLKHWVSMXXXXXXXXXXXHFTDAIPINKQLYSFSYVCFTAGAAGIVFSAFYLLIDVWGK 1344 RL HW+SM HFT+AIPINKQLYSFSY+CFT GAAGIV SAFY LIDVWG Sbjct: 349 RLTHWLSMGFALLLLGILLHFTNAIPINKQLYSFSYICFTGGAAGIVLSAFYALIDVWGL 408 Query: 1345 RMPFMFLEWIGMNAMLVFVMAAQGIFAAFINGWYYKNSDNNLVNWIQEHVFNDVWKSERV 1524 R+PF+FLEWIGMNAMLVFV+AAQGIFAAF+NGWYY++ D LVNWIQ+HVF +VW SE + Sbjct: 409 RVPFLFLEWIGMNAMLVFVLAAQGIFAAFMNGWYYESQDKTLVNWIQQHVFVNVWHSENL 468 Query: 1525 GTLLYVIFAEITFWAVVSGILHKLGIYWKL 1614 G LLYVIF EI FW VVSGILHKLGIYWKL Sbjct: 469 GNLLYVIFGEILFWGVVSGILHKLGIYWKL 498