BLASTX nr result
ID: Mentha29_contig00031781
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Mentha29_contig00031781 (2008 letters) Database: ./nr 38,876,450 sequences; 13,856,398,315 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value gb|EYU29888.1| hypothetical protein MIMGU_mgv1a005303mg [Mimulus... 809 0.0 ref|XP_006354406.1| PREDICTED: heparan-alpha-glucosaminide N-ace... 770 0.0 ref|XP_007203948.1| hypothetical protein PRUPE_ppa018154mg [Prun... 764 0.0 ref|XP_002268831.2| PREDICTED: heparan-alpha-glucosaminide N-ace... 759 0.0 ref|XP_002519467.1| conserved hypothetical protein [Ricinus comm... 749 0.0 ref|XP_007144374.1| hypothetical protein PHAVU_007G150900g [Phas... 748 0.0 ref|XP_006381387.1| hypothetical protein POPTR_0006s12440g [Popu... 748 0.0 ref|XP_004246913.1| PREDICTED: heparan-alpha-glucosaminide N-ace... 748 0.0 emb|CAN64496.1| hypothetical protein VITISV_004036 [Vitis vinifera] 746 0.0 ref|XP_006428456.1| hypothetical protein CICLE_v10011557mg [Citr... 746 0.0 ref|XP_004306260.1| PREDICTED: heparan-alpha-glucosaminide N-ace... 743 0.0 ref|XP_004515935.1| PREDICTED: heparan-alpha-glucosaminide N-ace... 740 0.0 ref|XP_004494901.1| PREDICTED: heparan-alpha-glucosaminide N-ace... 739 0.0 ref|XP_007027449.1| Uncharacterized protein TCM_022283 [Theobrom... 738 0.0 ref|XP_006577710.1| PREDICTED: heparan-alpha-glucosaminide N-ace... 735 0.0 ref|XP_004494902.1| PREDICTED: heparan-alpha-glucosaminide N-ace... 718 0.0 ref|XP_006595362.1| PREDICTED: heparan-alpha-glucosaminide N-ace... 717 0.0 ref|XP_006836541.1| hypothetical protein AMTR_s00131p00031270 [A... 715 0.0 ref|XP_006428455.1| hypothetical protein CICLE_v10011557mg [Citr... 712 0.0 ref|XP_003576108.1| PREDICTED: heparan-alpha-glucosaminide N-ace... 707 0.0 >gb|EYU29888.1| hypothetical protein MIMGU_mgv1a005303mg [Mimulus guttatus] Length = 490 Score = 809 bits (2089), Expect = 0.0 Identities = 401/505 (79%), Positives = 425/505 (84%), Gaps = 13/505 (2%) Frame = +1 Query: 394 MEDPKKLEEGFGASKNXXXXXXXXXXXXXXXGDNVISTETER--KSMIASELQAEETEQ- 564 MEDP+KLEEG A+K D+ IS E KS+ SE AEE + Sbjct: 1 MEDPRKLEEGLHAAKK---------------NDDDISNENNEHNKSVTKSEPVAEEKREE 45 Query: 565 -----KTKTKRVATLDAFRGLTIVLMILVDDAGGAYARIDHSPWNGCTLADFVMPFFLFI 729 K K KRVATLDAFRGLTIVLM+LVDDAGGAY+RIDHSPWNGCTLADFVMPFFLFI Sbjct: 46 EPPLVKQKAKRVATLDAFRGLTIVLMVLVDDAGGAYSRIDHSPWNGCTLADFVMPFFLFI 105 Query: 730 VGVAIALALKRIPKVPFAIRKVILRTLKLLFWGILLQGGYSHAPDDLSYGIDMKLIRWCG 909 VGVAIALALKRIPKV +A+RKVILRTLKLLFWGILLQGGYSHAP DL+YG+DMKLIRWCG Sbjct: 106 VGVAIALALKRIPKVSYAVRKVILRTLKLLFWGILLQGGYSHAPYDLAYGVDMKLIRWCG 165 Query: 910 ILQRIALVYFIVALIEILTTKIRPTTLEPSHFSIFSAYKWQWLGGFAAFVIYMVTTFALY 1089 ILQRIALVYFIVALIEI TTK+RPT+L+P HFSIF+AYKWQW+GGF AFVIYMVTTF+LY Sbjct: 166 ILQRIALVYFIVALIEIATTKLRPTSLDPGHFSIFTAYKWQWVGGFVAFVIYMVTTFSLY 225 Query: 1090 VPDWSF-----EHKSEKFIVKCGMRGHLGPACNAVGYVDRQAWGINHLYSDPVWKRLRAC 1254 VPDWSF K EKF V CGMRGHLGPACNAVGYVDRQAWGINHLY+ PVW RL+AC Sbjct: 226 VPDWSFVAKDDSDKLEKFTVICGMRGHLGPACNAVGYVDRQAWGINHLYNQPVWSRLKAC 285 Query: 1255 TFSSPSSGKLRKDAPAWCRAPFEPEGLLSSISAIVSGTIGIHYGHVLIHFKGHAERLKHW 1434 TFSSP SG RKDAP WCRAPFEPEGLLSSISAI+SGTIGIHYGHVLIHFKGHAERLK W Sbjct: 286 TFSSPDSGPFRKDAPTWCRAPFEPEGLLSSISAIISGTIGIHYGHVLIHFKGHAERLKQW 345 Query: 1435 VSMXXXXXXXXXXXHFTDAIPINKQLYSFSYVCFTAGAAGIVFSAFYLLIDVWGKRMPFM 1614 VSM HFTDAIPINKQLYSFSYVCFTAGAAGIVFS FYLLIDVWG R PF+ Sbjct: 346 VSMALVLLIVAVILHFTDAIPINKQLYSFSYVCFTAGAAGIVFSLFYLLIDVWGMRKPFL 405 Query: 1615 FLEWIGMNAMLVFVMAAQGIFAAFINGWYYKNSDNNLVNWIQEHVFNDVWKSERVGTLLY 1794 FLEWIGMNAMLVFVMAAQGIFA FINGWY+KN DNNLVNWIQ+HVF DVWKSE+VGTLLY Sbjct: 406 FLEWIGMNAMLVFVMAAQGIFAGFINGWYFKNPDNNLVNWIQQHVFFDVWKSEKVGTLLY 465 Query: 1795 VIFAEITFWAVVSGILHKLGIYWKL 1869 VIFAEITFWAVVSGILHKL IYWKL Sbjct: 466 VIFAEITFWAVVSGILHKLRIYWKL 490 >ref|XP_006354406.1| PREDICTED: heparan-alpha-glucosaminide N-acetyltransferase-like [Solanum tuberosum] Length = 494 Score = 770 bits (1988), Expect = 0.0 Identities = 380/498 (76%), Positives = 415/498 (83%), Gaps = 6/498 (1%) Frame = +1 Query: 394 MEDPKKLEEGFGASKNXXXXXXXXXXXXXXXGDNVISTETERKSMIASELQAEETEQKT- 570 MEDPKKLEEGFG KN D + S E E K++ + LQ EE EQ T Sbjct: 1 MEDPKKLEEGFGNQKNNISEENIDTNRIDNNQD-LASHENELKTL-SQPLQKEEEEQPTI 58 Query: 571 -KTKRVATLDAFRGLTIVLMILVDDAGGAYARIDHSPWNGCTLADFVMPFFLFIVGVAIA 747 K KRVATLDAFRGLTIVLMILVDDAGGAYARIDHSPWNGCTLADFVMPFFLFIVGVAIA Sbjct: 59 KKGKRVATLDAFRGLTIVLMILVDDAGGAYARIDHSPWNGCTLADFVMPFFLFIVGVAIA 118 Query: 748 LALKRIPKVPFAIRKVILRTLKLLFWGILLQGGYSHAPDDLSYGIDMKLIRWCGILQRIA 927 LALKR+PKV AIRKV LRTLKLLFWGI+LQGGYSHAP DL+YG+DMK+IRWCGILQRIA Sbjct: 119 LALKRVPKVSAAIRKVTLRTLKLLFWGIILQGGYSHAPYDLAYGVDMKVIRWCGILQRIA 178 Query: 928 LVYFIVALIEILTTKIRPTTLEPSHFSIFSAYKWQWLGGFAAFVIYMVTTFALYVPDWSF 1107 LVY IVALIEILTTK+RPTTL P HFSIF+AYKW LGGF AFV+YM T + LYVPDW+F Sbjct: 179 LVYLIVALIEILTTKLRPTTLTPGHFSIFTAYKW--LGGFVAFVVYMTTLYGLYVPDWNF 236 Query: 1108 -EHK---SEKFIVKCGMRGHLGPACNAVGYVDRQAWGINHLYSDPVWKRLRACTFSSPSS 1275 EH S+++ VKCGMRGHLGPACNAVGYVDRQ WGINHLY+ PVW R +ACT S P + Sbjct: 237 LEHDGDTSQRYTVKCGMRGHLGPACNAVGYVDRQVWGINHLYNQPVWARSKACTLSYPET 296 Query: 1276 GKLRKDAPAWCRAPFEPEGLLSSISAIVSGTIGIHYGHVLIHFKGHAERLKHWVSMXXXX 1455 G R DAP WCRAPFEPEGLLSSISAI+SGTIGIHYGHVLIHFKGH ERLK W+SM Sbjct: 297 GPFRDDAPTWCRAPFEPEGLLSSISAIMSGTIGIHYGHVLIHFKGHGERLKQWISMGFGL 356 Query: 1456 XXXXXXXHFTDAIPINKQLYSFSYVCFTAGAAGIVFSAFYLLIDVWGKRMPFMFLEWIGM 1635 HF+DAIP+NKQLYSFSYVCFTAGAAGIVFS FY+LIDV G R+PF++LEWIGM Sbjct: 357 LIIAFILHFSDAIPLNKQLYSFSYVCFTAGAAGIVFSGFYILIDVLGMRIPFLWLEWIGM 416 Query: 1636 NAMLVFVMAAQGIFAAFINGWYYKNSDNNLVNWIQEHVFNDVWKSERVGTLLYVIFAEIT 1815 NAML+FVM AQGIFA FINGWY+KN DNNLVNWIQ HVF DVWKS+R+GTLLYVIFAEIT Sbjct: 417 NAMLIFVMGAQGIFAGFINGWYFKNEDNNLVNWIQHHVFFDVWKSQRLGTLLYVIFAEIT 476 Query: 1816 FWAVVSGILHKLGIYWKL 1869 FWAV++GILH+LGIYWKL Sbjct: 477 FWAVLAGILHRLGIYWKL 494 >ref|XP_007203948.1| hypothetical protein PRUPE_ppa018154mg [Prunus persica] gi|462399479|gb|EMJ05147.1| hypothetical protein PRUPE_ppa018154mg [Prunus persica] Length = 508 Score = 764 bits (1974), Expect = 0.0 Identities = 371/506 (73%), Positives = 414/506 (81%), Gaps = 15/506 (2%) Frame = +1 Query: 397 EDPKKLEEGFGASKNXXXXXXXXXXXXXXXGDNVISTETERKSMIASELQAEETEQ---- 564 +D KKLEEG +K+ + I + E K+ A E +Q Sbjct: 3 DDAKKLEEGRLHNKDDLISERVNTTSDHNGRGDAIDHDHEEKNKAAVAAAPLEADQIKGE 62 Query: 565 -------KTKTKRVATLDAFRGLTIVLMILVDDAGGAYARIDHSPWNGCTLADFVMPFFL 723 K K+KRVATLDAFRGLTIV+MILVDDAGGAYARIDHSPWNGCTLADFVMPFFL Sbjct: 63 EQPVLVVKQKSKRVATLDAFRGLTIVVMILVDDAGGAYARIDHSPWNGCTLADFVMPFFL 122 Query: 724 FIVGVAIALALKRIPKVPFAIRKVILRTLKLLFWGILLQGGYSHAPDDLSYGIDMKLIRW 903 FIVGVAIALALK+IPK+ AI+K+ILRTLKL+FWGI+LQGGYSHAP DLSYG+DMK IRW Sbjct: 123 FIVGVAIALALKKIPKINDAIKKIILRTLKLMFWGIILQGGYSHAPADLSYGVDMKQIRW 182 Query: 904 CGILQRIALVYFIVALIEILTTKIRPTTLEPSHFSIFSAYKWQWLGGFAAFVIYMVTTFA 1083 GILQRIALVYF+VALIE LTTK RPT LEP H SIF+AYKWQW+GGF AF+IYM+TTF+ Sbjct: 183 FGILQRIALVYFVVALIETLTTKFRPTVLEPGHLSIFTAYKWQWIGGFLAFLIYMITTFS 242 Query: 1084 LYVPDWSF----EHKSEKFIVKCGMRGHLGPACNAVGYVDRQAWGINHLYSDPVWKRLRA 1251 LYVPDWSF +H+S+K++VKCGMRGHLGPACNAVGYVDRQ WGINHLY+ PVW+RL+A Sbjct: 243 LYVPDWSFVVDNDHRSKKYLVKCGMRGHLGPACNAVGYVDRQVWGINHLYTQPVWRRLKA 302 Query: 1252 CTFSSPSSGKLRKDAPAWCRAPFEPEGLLSSISAIVSGTIGIHYGHVLIHFKGHAERLKH 1431 CT SSPS G LR+ AP+WCR PFEPEGLLSSISAI+SGTIGIHYGHVLIHFKGH+ERLK Sbjct: 303 CTLSSPSDGPLREGAPSWCRGPFEPEGLLSSISAILSGTIGIHYGHVLIHFKGHSERLKQ 362 Query: 1432 WVSMXXXXXXXXXXXHFTDAIPINKQLYSFSYVCFTAGAAGIVFSAFYLLIDVWGKRMPF 1611 WVSM HFTDAIPINKQLYSFSYVCFTAGAAG+VFS FYLLIDVWG R PF Sbjct: 363 WVSMGFILIVIAIILHFTDAIPINKQLYSFSYVCFTAGAAGLVFSGFYLLIDVWGYRTPF 422 Query: 1612 MFLEWIGMNAMLVFVMAAQGIFAAFINGWYYKNSDNNLVNWIQEHVFNDVWKSERVGTLL 1791 +FLEWIGMNAMLVFVMAAQGIFAAF+NGWYYK+ DN+LV+WIQEHVF +VW SER+GTLL Sbjct: 423 LFLEWIGMNAMLVFVMAAQGIFAAFVNGWYYKSPDNSLVHWIQEHVFINVWHSERLGTLL 482 Query: 1792 YVIFAEITFWAVVSGILHKLGIYWKL 1869 YVIF EI FW VV+GILHK IYWKL Sbjct: 483 YVIFGEILFWGVVAGILHKFRIYWKL 508 >ref|XP_002268831.2| PREDICTED: heparan-alpha-glucosaminide N-acetyltransferase-like [Vitis vinifera] gi|297739972|emb|CBI30154.3| unnamed protein product [Vitis vinifera] Length = 489 Score = 759 bits (1961), Expect = 0.0 Identities = 368/495 (74%), Positives = 412/495 (83%), Gaps = 5/495 (1%) Frame = +1 Query: 400 DPKKLEEGFGASKNXXXXXXXXXXXXXXXGDNVISTETERKSMIASELQAEETEQ-KTKT 576 D K++EEG G D + E+ + E + EE K K+ Sbjct: 4 DAKRVEEGLG---------HVHKEDISEKADKIEKDESSATPAQSVEQKGEEQPLIKQKS 54 Query: 577 KRVATLDAFRGLTIVLMILVDDAGGAYARIDHSPWNGCTLADFVMPFFLFIVGVAIALAL 756 KRVATLDAFRGLTIVLMILVDDAGG+YARIDHSPWNGCTLADFVMPFFLFIVGVA+ALAL Sbjct: 55 KRVATLDAFRGLTIVLMILVDDAGGSYARIDHSPWNGCTLADFVMPFFLFIVGVAVALAL 114 Query: 757 KRIPKVPFAIRKVILRTLKLLFWGILLQGGYSHAPDDLSYGIDMKLIRWCGILQRIALVY 936 K+IP++ A++K+ LRTLKLLFWGILLQGGYSHAPDDLSYG+DMK IRW GILQRIA+VY Sbjct: 115 KKIPRISLAVKKISLRTLKLLFWGILLQGGYSHAPDDLSYGVDMKHIRWFGILQRIAVVY 174 Query: 937 FIVALIEILTTKIRPTTLEPSHFSIFSAYKWQWLGGFAAFVIYMVTTFALYVPDWSF--- 1107 F+VALIE LTTK RPT ++ HFSI SAYKWQW+GGF AF+IYM+TT+ALYVPDWSF Sbjct: 175 FVVALIETLTTKRRPTVIDSGHFSILSAYKWQWIGGFVAFLIYMITTYALYVPDWSFVID 234 Query: 1108 -EHKSEKFIVKCGMRGHLGPACNAVGYVDRQAWGINHLYSDPVWKRLRACTFSSPSSGKL 1284 +H+++++ VKCGMRGHLGPACNAVGYVDRQ WGINHLYS PVW RL+ACT SSP+SG Sbjct: 235 QDHEAKRYTVKCGMRGHLGPACNAVGYVDRQVWGINHLYSQPVWTRLKACTLSSPNSGPF 294 Query: 1285 RKDAPAWCRAPFEPEGLLSSISAIVSGTIGIHYGHVLIHFKGHAERLKHWVSMXXXXXXX 1464 R+DAP+WC APFEPEGLLS+ISAI+SGTIGIHYGHVLIHFKGHAERLK WVSM Sbjct: 295 REDAPSWCYAPFEPEGLLSTISAILSGTIGIHYGHVLIHFKGHAERLKQWVSMGIVLLIV 354 Query: 1465 XXXXHFTDAIPINKQLYSFSYVCFTAGAAGIVFSAFYLLIDVWGKRMPFMFLEWIGMNAM 1644 HFTDAIPINKQLYSFSYVCFTAGAAGIV SAFYL+IDVWG R PF+FLEWIGMNAM Sbjct: 355 AIILHFTDAIPINKQLYSFSYVCFTAGAAGIVLSAFYLVIDVWGFRTPFLFLEWIGMNAM 414 Query: 1645 LVFVMAAQGIFAAFINGWYYKNSDNNLVNWIQEHVFNDVWKSERVGTLLYVIFAEITFWA 1824 LVFVMAAQGIFAAFINGWY+++SDN+LV+WIQ HVF DVW SER+GTLLYVIFAEITFWA Sbjct: 415 LVFVMAAQGIFAAFINGWYFESSDNSLVHWIQRHVFIDVWHSERLGTLLYVIFAEITFWA 474 Query: 1825 VVSGILHKLGIYWKL 1869 VVSGILHKL IYWKL Sbjct: 475 VVSGILHKLHIYWKL 489 >ref|XP_002519467.1| conserved hypothetical protein [Ricinus communis] gi|223541330|gb|EEF42881.1| conserved hypothetical protein [Ricinus communis] Length = 519 Score = 749 bits (1935), Expect = 0.0 Identities = 364/506 (71%), Positives = 414/506 (81%), Gaps = 14/506 (2%) Frame = +1 Query: 394 MEDPKKLEEGFG----ASKNXXXXXXXXXXXXXXXGDNVISTETERKSMIASELQAEETE 561 MEDP+KLEEG A++N G VI + S + E + E+ + Sbjct: 14 MEDPRKLEEGLAHAKVANENQQEQHLSEKLDKTHDGGGVIPEKELTSSTVLVEQEGEQLQ 73 Query: 562 Q------KTKTKRVATLDAFRGLTIVLMILVDDAGGAYARIDHSPWNGCTLADFVMPFFL 723 Q K KTKRVATLDAFRGLT+VLMILVD+AG +YARIDHSPWNGCTLADFVMPFFL Sbjct: 74 QPEQLPVKQKTKRVATLDAFRGLTVVLMILVDNAGESYARIDHSPWNGCTLADFVMPFFL 133 Query: 724 FIVGVAIALALKRIPKVPFAIRKVILRTLKLLFWGILLQGGYSHAPDDLSYGIDMKLIRW 903 FIVGVAIALALKRIP+ A++K+ LRTLKLLFWGILLQGGYSHAP DLSYG+DMKLIRW Sbjct: 134 FIVGVAIALALKRIPRKRDAVKKISLRTLKLLFWGILLQGGYSHAPVDLSYGVDMKLIRW 193 Query: 904 CGILQRIALVYFIVALIEILTTKIRPTTLEPSHFSIFSAYKWQWLGGFAAFVIYMVTTFA 1083 CGILQRIALVY VALIE LT K R T L+P+HFSIF+AY+WQW+GGF AF+IYM+TT+A Sbjct: 194 CGILQRIALVYMFVALIETLTIKERQTVLQPNHFSIFTAYRWQWIGGFIAFLIYMITTYA 253 Query: 1084 LYVPDWSF----EHKSEKFIVKCGMRGHLGPACNAVGYVDRQAWGINHLYSDPVWKRLRA 1251 LYVPDWSF +++ ++ VKCGMRGHLGPACNAVGYVDR+ WGINHLY PVW RL+A Sbjct: 254 LYVPDWSFTAYDDNRPTRYTVKCGMRGHLGPACNAVGYVDREVWGINHLYQYPVWSRLKA 313 Query: 1252 CTFSSPSSGKLRKDAPAWCRAPFEPEGLLSSISAIVSGTIGIHYGHVLIHFKGHAERLKH 1431 CTFSSP++G LR DAP+WC APFEPEGLLS+ISAI+SGTIGIHYGHVLIHFKGH+ERLK Sbjct: 314 CTFSSPATGPLRADAPSWCLAPFEPEGLLSTISAILSGTIGIHYGHVLIHFKGHSERLKQ 373 Query: 1432 WVSMXXXXXXXXXXXHFTDAIPINKQLYSFSYVCFTAGAAGIVFSAFYLLIDVWGKRMPF 1611 WVSM HFTDAIPINKQLYSFSYVCFTAGAAGIVFS FY+LIDV G R+PF Sbjct: 374 WVSMGLGLFLIAIILHFTDAIPINKQLYSFSYVCFTAGAAGIVFSGFYILIDVLGLRIPF 433 Query: 1612 MFLEWIGMNAMLVFVMAAQGIFAAFINGWYYKNSDNNLVNWIQEHVFNDVWKSERVGTLL 1791 +FLEWIGMNAMLV+VMAAQGIF FINGW+YK+++N LV WIQEHVF+ VW SE++G LL Sbjct: 434 LFLEWIGMNAMLVYVMAAQGIFEGFINGWFYKSNNNTLVYWIQEHVFDKVWNSEKLGNLL 493 Query: 1792 YVIFAEITFWAVVSGILHKLGIYWKL 1869 YVIFA+ITFWAVVSGILH+LGIYWKL Sbjct: 494 YVIFAQITFWAVVSGILHRLGIYWKL 519 >ref|XP_007144374.1| hypothetical protein PHAVU_007G150900g [Phaseolus vulgaris] gi|561017564|gb|ESW16368.1| hypothetical protein PHAVU_007G150900g [Phaseolus vulgaris] Length = 500 Score = 748 bits (1931), Expect = 0.0 Identities = 359/501 (71%), Positives = 406/501 (81%), Gaps = 9/501 (1%) Frame = +1 Query: 394 MEDPKKLEEGFGASKNXXXXXXXXXXXXXXXGDNVISTETERKSMIASELQAEETEQ--- 564 M++ K++EEG ++ GD++ R + E + EQ Sbjct: 1 MDEAKRMEEGLSSTPQNGELKQEIEKTNGD-GDSIEHDRDTRSTTQEGESTRQIVEQEQP 59 Query: 565 --KTKTKRVATLDAFRGLTIVLMILVDDAGGAYARIDHSPWNGCTLADFVMPFFLFIVGV 738 K KTKRVATLDAFRGLTIVLM+LVDDAGGAY +IDHSPWNGCTLADFVMPFFLFIVGV Sbjct: 60 LVKQKTKRVATLDAFRGLTIVLMVLVDDAGGAYPQIDHSPWNGCTLADFVMPFFLFIVGV 119 Query: 739 AIALALKRIPKVPFAIRKVILRTLKLLFWGILLQGGYSHAPDDLSYGIDMKLIRWCGILQ 918 AIALALKRIP+V A++K+ILRTLKLLFWG+LLQGGYSHAPDDLSYG+DM+ IRWCGILQ Sbjct: 120 AIALALKRIPRVKDAVKKIILRTLKLLFWGVLLQGGYSHAPDDLSYGVDMRFIRWCGILQ 179 Query: 919 RIALVYFIVALIEILTTKIRPTTLEPSHFSIFSAYKWQWLGGFAAFVIYMVTTFALYVPD 1098 RIALVY +VALIE T K+RP TL+P H SIFSAY+WQW GGF AFVIYMVTTF+LYVPD Sbjct: 180 RIALVYCVVALIETYTNKLRPYTLKPGHLSIFSAYRWQWFGGFVAFVIYMVTTFSLYVPD 239 Query: 1099 WSF----EHKSEKFIVKCGMRGHLGPACNAVGYVDRQAWGINHLYSDPVWKRLRACTFSS 1266 WSF ++ +++ V+CG+RGHLGPACNAVGYVDRQ WG+NHLYS PVW R ACT SS Sbjct: 240 WSFVDYNSYEPKRYTVQCGIRGHLGPACNAVGYVDRQVWGVNHLYSQPVWTRSSACTLSS 299 Query: 1267 PSSGKLRKDAPAWCRAPFEPEGLLSSISAIVSGTIGIHYGHVLIHFKGHAERLKHWVSMX 1446 P+ G RK+AP+WCRAPFEPEGLLSSISAI+SG IGIHYGHVLIHFKGH+ERLK W+S+ Sbjct: 300 PAEGHFRKNAPSWCRAPFEPEGLLSSISAILSGIIGIHYGHVLIHFKGHSERLKQWLSLG 359 Query: 1447 XXXXXXXXXXHFTDAIPINKQLYSFSYVCFTAGAAGIVFSAFYLLIDVWGKRMPFMFLEW 1626 HFTDAIPINKQLYSFSYVCFTAGAAGIVFS FYLLID+W R PF+ LEW Sbjct: 360 FFLLIIGIILHFTDAIPINKQLYSFSYVCFTAGAAGIVFSVFYLLIDIWDLRTPFLLLEW 419 Query: 1627 IGMNAMLVFVMAAQGIFAAFINGWYYKNSDNNLVNWIQEHVFNDVWKSERVGTLLYVIFA 1806 IGMNAMLVFVMAAQGIFAAF+NGWYYK+ DN+LVNWIQ HVF +VW SER+GTLLYVIFA Sbjct: 420 IGMNAMLVFVMAAQGIFAAFVNGWYYKDPDNSLVNWIQNHVFINVWHSERLGTLLYVIFA 479 Query: 1807 EITFWAVVSGILHKLGIYWKL 1869 EITFW VV+GI HKLGIYWKL Sbjct: 480 EITFWGVVAGIFHKLGIYWKL 500 >ref|XP_006381387.1| hypothetical protein POPTR_0006s12440g [Populus trichocarpa] gi|550336090|gb|ERP59184.1| hypothetical protein POPTR_0006s12440g [Populus trichocarpa] Length = 502 Score = 748 bits (1930), Expect = 0.0 Identities = 367/503 (72%), Positives = 406/503 (80%), Gaps = 11/503 (2%) Frame = +1 Query: 394 MEDPKKLEEGFGASKNXXXXXXXXXXXXXXXG--DNVISTETERKSMIASELQAEETEQ- 564 MEDPK++EEG G + G D E E + ++ E ++ Sbjct: 1 MEDPKRMEEGLGHTALVANIDDENIHLSEKEGKTDGGDDNEKEERRVVHDHQAEREGDRQ 60 Query: 565 ---KTKTKRVATLDAFRGLTIVLMILVDDAGGAYARIDHSPWNGCTLADFVMPFFLFIVG 735 K K+KRVATLDAFRGLTIVLMILVDDAGG Y RIDHSPWNGCTLADFVMPFFLFIVG Sbjct: 61 PVVKQKSKRVATLDAFRGLTIVLMILVDDAGGVYPRIDHSPWNGCTLADFVMPFFLFIVG 120 Query: 736 VAIALALKRIPKVPFAIRKVILRTLKLLFWGILLQGGYSHAPDDLSYGIDMKLIRWCGIL 915 VAIALA KRIPK A++K+ILRTLKLLFWG+LLQGGYSHAP DL+YG+DMKLIRW GIL Sbjct: 121 VAIALAFKRIPKRRDAVKKIILRTLKLLFWGVLLQGGYSHAPSDLAYGVDMKLIRWFGIL 180 Query: 916 Q-RIALVYFIVALIEILTTKIRPTTLEPSHFSIFSAYKWQWLGGFAAFVIYMVTTFALYV 1092 Q RIALVY +VALIE L K R T +EP HF+IF+AY+WQW+ GF +FVIYMVTTFALYV Sbjct: 181 QQRIALVYMVVALIEALIPKNRQT-IEPDHFTIFTAYRWQWIAGFISFVIYMVTTFALYV 239 Query: 1093 PDWSF----EHKSEKFIVKCGMRGHLGPACNAVGYVDRQAWGINHLYSDPVWKRLRACTF 1260 PDWSF +H+ ++ V+CGMRGHLGPACNAVGYVDR+ WGINHLY PVW RL+ACT Sbjct: 240 PDWSFTVDEDHERRRYTVECGMRGHLGPACNAVGYVDREVWGINHLYQYPVWSRLKACTL 299 Query: 1261 SSPSSGKLRKDAPAWCRAPFEPEGLLSSISAIVSGTIGIHYGHVLIHFKGHAERLKHWVS 1440 SSP SG RKDAP+WCRAPFEPEGLLSSISAI+SGTIGIHYGHVLIHFKGHAERL+ WVS Sbjct: 300 SSPGSGPFRKDAPSWCRAPFEPEGLLSSISAILSGTIGIHYGHVLIHFKGHAERLRQWVS 359 Query: 1441 MXXXXXXXXXXXHFTDAIPINKQLYSFSYVCFTAGAAGIVFSAFYLLIDVWGKRMPFMFL 1620 M HFTDAIPINKQLYSFSYVCFTAGAAGIVFS FY+LIDVWG R PF+FL Sbjct: 360 MGVILLIVAIILHFTDAIPINKQLYSFSYVCFTAGAAGIVFSGFYVLIDVWGLRPPFLFL 419 Query: 1621 EWIGMNAMLVFVMAAQGIFAAFINGWYYKNSDNNLVNWIQEHVFNDVWKSERVGTLLYVI 1800 EWIGMNAMLV+VMAAQGIF FINGWYYK+ DN LV WIQ+HVFNDVW SERVGTLLYVI Sbjct: 420 EWIGMNAMLVYVMAAQGIFEGFINGWYYKSPDNTLVYWIQDHVFNDVWHSERVGTLLYVI 479 Query: 1801 FAEITFWAVVSGILHKLGIYWKL 1869 FA+ITFWAVVSG+LHKLGIYWKL Sbjct: 480 FAQITFWAVVSGVLHKLGIYWKL 502 >ref|XP_004246913.1| PREDICTED: heparan-alpha-glucosaminide N-acetyltransferase-like [Solanum lycopersicum] Length = 494 Score = 748 bits (1930), Expect = 0.0 Identities = 366/498 (73%), Positives = 408/498 (81%), Gaps = 6/498 (1%) Frame = +1 Query: 394 MEDPKKLEEGFGASKNXXXXXXXXXXXXXXXGDNVISTETERKSMIASELQAEETEQKT- 570 MEDPKKLEEGF KN D ++S E ++K +++ LQ +E E+ Sbjct: 1 MEDPKKLEEGFSNQKNNISEENIDTNRIDNNQD-LVSHENDQK-ILSQPLQKKEEEEPII 58 Query: 571 -KTKRVATLDAFRGLTIVLMILVDDAGGAYARIDHSPWNGCTLADFVMPFFLFIVGVAIA 747 K KRVATLDAFRGLTIVLMILVDDAGGAYA IDHSPWNGCTLADFVMPFFLFIVGVAIA Sbjct: 59 KKGKRVATLDAFRGLTIVLMILVDDAGGAYACIDHSPWNGCTLADFVMPFFLFIVGVAIA 118 Query: 748 LALKRIPKVPFAIRKVILRTLKLLFWGILLQGGYSHAPDDLSYGIDMKLIRWCGILQRIA 927 LALKR+PKV AI+KV LRTLKLLFWGI+LQGGYSHAP DL+YG+DMK+IRWCGILQRIA Sbjct: 119 LALKRVPKVSAAIKKVTLRTLKLLFWGIILQGGYSHAPYDLAYGVDMKVIRWCGILQRIA 178 Query: 928 LVYFIVALIEILTTKIRPTTLEPSHFSIFSAYKWQWLGGFAAFVIYMVTTFALYVPDWSF 1107 LVY IVALIEILTTK+RPTTL P HFSIF+AYKW LGGF AFV+Y T + LYVPDW+F Sbjct: 179 LVYLIVALIEILTTKLRPTTLTPGHFSIFTAYKW--LGGFVAFVVYTTTIYGLYVPDWNF 236 Query: 1108 ----EHKSEKFIVKCGMRGHLGPACNAVGYVDRQAWGINHLYSDPVWKRLRACTFSSPSS 1275 S+++ VKCGMRGHLGPACNAVGYVDRQ WGINHLY+ PVW R + CT S P + Sbjct: 237 LVHDGDTSQRYTVKCGMRGHLGPACNAVGYVDRQVWGINHLYNQPVWARSKVCTLSYPET 296 Query: 1276 GKLRKDAPAWCRAPFEPEGLLSSISAIVSGTIGIHYGHVLIHFKGHAERLKHWVSMXXXX 1455 G R DAP+WCRAPFEPEGLLSSISAI+SGTIGIHYGHVLIHFKGH ERLK W+SM Sbjct: 297 GPFRDDAPSWCRAPFEPEGLLSSISAIMSGTIGIHYGHVLIHFKGHGERLKQWISMGLGL 356 Query: 1456 XXXXXXXHFTDAIPINKQLYSFSYVCFTAGAAGIVFSAFYLLIDVWGKRMPFMFLEWIGM 1635 HF+DAIP+NKQLYSFSYVCFTAG AGIVFS Y+LIDV R+PF++LEWIGM Sbjct: 357 LITAFILHFSDAIPLNKQLYSFSYVCFTAGNAGIVFSGLYILIDVLAMRIPFLWLEWIGM 416 Query: 1636 NAMLVFVMAAQGIFAAFINGWYYKNSDNNLVNWIQEHVFNDVWKSERVGTLLYVIFAEIT 1815 NAML+FVM AQGIFA FINGWY+KN DNNLVNWIQ HVF DVWKS+R+GTLLYVIFAEIT Sbjct: 417 NAMLIFVMGAQGIFAGFINGWYFKNEDNNLVNWIQHHVFFDVWKSQRLGTLLYVIFAEIT 476 Query: 1816 FWAVVSGILHKLGIYWKL 1869 FWAV++GILH+LGIYWKL Sbjct: 477 FWAVLAGILHRLGIYWKL 494 >emb|CAN64496.1| hypothetical protein VITISV_004036 [Vitis vinifera] Length = 511 Score = 746 bits (1927), Expect = 0.0 Identities = 368/517 (71%), Positives = 412/517 (79%), Gaps = 27/517 (5%) Frame = +1 Query: 400 DPKKLEEGFGASKNXXXXXXXXXXXXXXXGDNVISTETERKSMIASELQAEETEQ-KTKT 576 D K++EEG G D + E+ + E + EE K K+ Sbjct: 4 DAKRVEEGLG---------HVHKEDISEKADKIEKDESSATPAQSVEQKGEEQPLIKQKS 54 Query: 577 KRVATLDAFRGLTIVLMILVDDAGGAYARIDHSPWNGCTLADFVMPFFLFIVGVAIALAL 756 KRVATLDAFRGLTIVLMILVDDAGG+YARIDHSPWNGCTLADFVMPFFLFIVGVA+ALAL Sbjct: 55 KRVATLDAFRGLTIVLMILVDDAGGSYARIDHSPWNGCTLADFVMPFFLFIVGVAVALAL 114 Query: 757 KRIPKVPFAIRKVILRTLKLLFWGILLQGGYSHAPDDLSYGIDMKLIRWCGILQ------ 918 K+IP++ A++K+ LRTLKLLFWGILLQGGYSHAPDDLSYG+DMK IRW GILQ Sbjct: 115 KKIPRISLAVKKISLRTLKLLFWGILLQGGYSHAPDDLSYGVDMKHIRWFGILQVFPLPL 174 Query: 919 ----------------RIALVYFIVALIEILTTKIRPTTLEPSHFSIFSAYKWQWLGGFA 1050 RIA+VYF+VALIE LTTK RPT ++ HFSI SAYKWQW+GGF Sbjct: 175 FTGKSIPSSSLSGFLQRIAVVYFVVALIETLTTKRRPTVIDSGHFSILSAYKWQWIGGFV 234 Query: 1051 AFVIYMVTTFALYVPDWSF----EHKSEKFIVKCGMRGHLGPACNAVGYVDRQAWGINHL 1218 AF+IYM+TT+ALYVPDWSF +H+++++ VKCGMRGHLGPACNAVGYVDRQ WGINHL Sbjct: 235 AFLIYMITTYALYVPDWSFVIDQDHEAKRYTVKCGMRGHLGPACNAVGYVDRQVWGINHL 294 Query: 1219 YSDPVWKRLRACTFSSPSSGKLRKDAPAWCRAPFEPEGLLSSISAIVSGTIGIHYGHVLI 1398 YS PVW RL+ACT SSP+SG R+DAP+WC APFEPEGLLS+ISAI+SGTIGIHYGHVLI Sbjct: 295 YSQPVWTRLKACTLSSPNSGPFREDAPSWCYAPFEPEGLLSTISAILSGTIGIHYGHVLI 354 Query: 1399 HFKGHAERLKHWVSMXXXXXXXXXXXHFTDAIPINKQLYSFSYVCFTAGAAGIVFSAFYL 1578 HFKGHAERLK WVSM HFTDAIPINKQLYSFSYVCFTAGAAGIV SAFYL Sbjct: 355 HFKGHAERLKQWVSMGIVLLIVAIILHFTDAIPINKQLYSFSYVCFTAGAAGIVXSAFYL 414 Query: 1579 LIDVWGKRMPFMFLEWIGMNAMLVFVMAAQGIFAAFINGWYYKNSDNNLVNWIQEHVFND 1758 +IDVWG R PF+FLEWIGMNAMLVFVMAAQGIFAAFINGWY+++SDN+LV+WIQ HVF D Sbjct: 415 VIDVWGFRTPFLFLEWIGMNAMLVFVMAAQGIFAAFINGWYFESSDNSLVHWIQRHVFID 474 Query: 1759 VWKSERVGTLLYVIFAEITFWAVVSGILHKLGIYWKL 1869 VW SER+GTLLYVIFAEITFWAVVSGILHKL IYWKL Sbjct: 475 VWHSERLGTLLYVIFAEITFWAVVSGILHKLHIYWKL 511 >ref|XP_006428456.1| hypothetical protein CICLE_v10011557mg [Citrus clementina] gi|568877271|ref|XP_006491663.1| PREDICTED: heparan-alpha-glucosaminide N-acetyltransferase-like isoform X1 [Citrus sinensis] gi|557530513|gb|ESR41696.1| hypothetical protein CICLE_v10011557mg [Citrus clementina] Length = 495 Score = 746 bits (1925), Expect = 0.0 Identities = 351/448 (78%), Positives = 394/448 (87%), Gaps = 4/448 (0%) Frame = +1 Query: 538 ELQAEETEQKTKTKRVATLDAFRGLTIVLMILVDDAGGAYARIDHSPWNGCTLADFVMPF 717 ELQ ++ Q+ K+KRVATLDAFRGLT+VLMILVDDAGGAYARIDHSPWNGCTLADFVMPF Sbjct: 49 ELQLQQLLQQ-KSKRVATLDAFRGLTVVLMILVDDAGGAYARIDHSPWNGCTLADFVMPF 107 Query: 718 FLFIVGVAIALALKRIPKVPFAIRKVILRTLKLLFWGILLQGGYSHAPDDLSYGIDMKLI 897 FLFIVGVAIALALK++PK+ A++K+I RTLKLLFWGI+LQGGYSHAPD LSYG+DMK I Sbjct: 108 FLFIVGVAIALALKKVPKINGAVKKIIFRTLKLLFWGIILQGGYSHAPDALSYGVDMKHI 167 Query: 898 RWCGILQRIALVYFIVALIEILTTKIRPTTLEPSHFSIFSAYKWQWLGGFAAFVIYMVTT 1077 RWCGILQRIALVY +VALIE LTTK RP LEP H SIF+AY+WQW+GGF AFVIY++TT Sbjct: 168 RWCGILQRIALVYVVVALIETLTTKRRPNVLEPRHLSIFTAYQWQWIGGFIAFVIYIITT 227 Query: 1078 FALYVPDWSF----EHKSEKFIVKCGMRGHLGPACNAVGYVDRQAWGINHLYSDPVWKRL 1245 ++LYVP+WSF +H +K+IVKCGMRGHLGPACNAVGYVDR+ WGINHLYSDPVW RL Sbjct: 228 YSLYVPNWSFSEHSDHGVKKYIVKCGMRGHLGPACNAVGYVDRELWGINHLYSDPVWSRL 287 Query: 1246 RACTFSSPSSGKLRKDAPAWCRAPFEPEGLLSSISAIVSGTIGIHYGHVLIHFKGHAERL 1425 ACT SSP+SG LR+DAP+WCRAPFEPEGLLS+ISAI+SGTIGIHYGHVLIHFKGH+ RL Sbjct: 288 EACTLSSPNSGPLREDAPSWCRAPFEPEGLLSTISAILSGTIGIHYGHVLIHFKGHSARL 347 Query: 1426 KHWVSMXXXXXXXXXXXHFTDAIPINKQLYSFSYVCFTAGAAGIVFSAFYLLIDVWGKRM 1605 KHWVSM HFT+AIPINKQLYSFSYVCFTAGAAGIVFSA Y+L+DVW R Sbjct: 348 KHWVSMGFGLLIIATILHFTNAIPINKQLYSFSYVCFTAGAAGIVFSALYVLMDVWELRT 407 Query: 1606 PFMFLEWIGMNAMLVFVMAAQGIFAAFINGWYYKNSDNNLVNWIQEHVFNDVWKSERVGT 1785 PF+FL+WIGMNAMLVFV+ AQGI A F+NGWYYKN DN LVNWIQ H+F VW SER+GT Sbjct: 408 PFLFLKWIGMNAMLVFVLGAQGILAGFVNGWYYKNPDNTLVNWIQNHLFIHVWNSERLGT 467 Query: 1786 LLYVIFAEITFWAVVSGILHKLGIYWKL 1869 LLYVIFAEITFW VV+GILH+LGIYWKL Sbjct: 468 LLYVIFAEITFWGVVAGILHRLGIYWKL 495 >ref|XP_004306260.1| PREDICTED: heparan-alpha-glucosaminide N-acetyltransferase-like [Fragaria vesca subsp. vesca] Length = 509 Score = 743 bits (1917), Expect = 0.0 Identities = 362/507 (71%), Positives = 405/507 (79%), Gaps = 17/507 (3%) Frame = +1 Query: 400 DPKKLEEGFGAS------------KNXXXXXXXXXXXXXXXGDNVISTETERKSMIASEL 543 D K+LEEGFG + K +STE + Sbjct: 5 DAKRLEEGFGHNPVPEVYEEDEHKKEVLTSNSTGDAIDRVDEKKAVSTEPREVEQVKGAE 64 Query: 544 QAEETEQKTKTKRVATLDAFRGLTIVLMILVDDAGGAYARIDHSPWNGCTLADFVMPFFL 723 + + + K K+KRVATLDAFRGLTIV+MILVDDAGGAYARIDHSPWNGCTLADFVMPFFL Sbjct: 65 EEQPLQVKQKSKRVATLDAFRGLTIVVMILVDDAGGAYARIDHSPWNGCTLADFVMPFFL 124 Query: 724 FIVGVAIALALKRIPKVPFAIRKVILRTLKLLFWGILLQGGYSHAPDDLSYGIDMKLIRW 903 FIVGVAIALALKRIPK A++K+ILRT+KLLFWGI+LQGGYS APD L+YG+DMK IRW Sbjct: 125 FIVGVAIALALKRIPKTSDAVKKIILRTIKLLFWGIILQGGYSQAPDTLAYGVDMKKIRW 184 Query: 904 CGILQRIALVYFIVALIEILTTKIRPTTLEPSHFSIFSAYKWQWLGGFAAFVIYMVTTFA 1083 GILQRIALVY +VALIE TTK+RPT L+ SIF+AYKW +GGF AF++YM+TTF+ Sbjct: 185 FGILQRIALVYCVVALIETFTTKLRPTVLKSGPVSIFTAYKW--IGGFVAFLVYMITTFS 242 Query: 1084 LYVPDWSF-----EHKSEKFIVKCGMRGHLGPACNAVGYVDRQAWGINHLYSDPVWKRLR 1248 LY+PDWSF + ++K++VKCGMRGHLGPACNAVGYVDRQ WGINHLY+ PVW RL+ Sbjct: 243 LYIPDWSFVKHYDDGSTKKYLVKCGMRGHLGPACNAVGYVDRQVWGINHLYNQPVWIRLK 302 Query: 1249 ACTFSSPSSGKLRKDAPAWCRAPFEPEGLLSSISAIVSGTIGIHYGHVLIHFKGHAERLK 1428 ACT SSPS+G LRK AP+WCRAPFEPEGLLSSISAI+SGTIGIHYGH+LIHFKGHAERLK Sbjct: 303 ACTLSSPSTGPLRKGAPSWCRAPFEPEGLLSSISAILSGTIGIHYGHILIHFKGHAERLK 362 Query: 1429 HWVSMXXXXXXXXXXXHFTDAIPINKQLYSFSYVCFTAGAAGIVFSAFYLLIDVWGKRMP 1608 WVSM HFTDAIPINKQLYSFSYVCFTAGAAG+VFS FYLLIDVWG R P Sbjct: 363 QWVSMGLVLMIIAIILHFTDAIPINKQLYSFSYVCFTAGAAGLVFSGFYLLIDVWGYRTP 422 Query: 1609 FMFLEWIGMNAMLVFVMAAQGIFAAFINGWYYKNSDNNLVNWIQEHVFNDVWKSERVGTL 1788 F+FLEWIGMNAMLVFVMAAQGIFA F+NGWYY+ D LV WIQEHVFN+VW SER+GTL Sbjct: 423 FLFLEWIGMNAMLVFVMAAQGIFAGFVNGWYYETQDKTLVYWIQEHVFNNVWHSERLGTL 482 Query: 1789 LYVIFAEITFWAVVSGILHKLGIYWKL 1869 LYVIFAEITFWAVVSGILHKLGIYWKL Sbjct: 483 LYVIFAEITFWAVVSGILHKLGIYWKL 509 >ref|XP_004515935.1| PREDICTED: heparan-alpha-glucosaminide N-acetyltransferase-like [Cicer arietinum] Length = 480 Score = 740 bits (1910), Expect = 0.0 Identities = 362/496 (72%), Positives = 404/496 (81%), Gaps = 4/496 (0%) Frame = +1 Query: 394 MEDPKKLEEGFGASKNXXXXXXXXXXXXXXXGDNVISTETERKSMIASELQAEETEQKTK 573 M++ K++EEG + N G + I + S Q +E + K K Sbjct: 1 MDEAKRMEEGINSPHNG--------------GGDSIEHDNNDTMKGESVHQPKEPDGKHK 46 Query: 574 TKRVATLDAFRGLTIVLMILVDDAGGAYARIDHSPWNGCTLADFVMPFFLFIVGVAIALA 753 TKRVATLDAFRGLTIVLMILVDDAG AY RIDHSPWNGCTLADFVMPFFLFIVGVAIALA Sbjct: 47 TKRVATLDAFRGLTIVLMILVDDAGEAYPRIDHSPWNGCTLADFVMPFFLFIVGVAIALA 106 Query: 754 LKRIPKVPFAIRKVILRTLKLLFWGILLQGGYSHAPDDLSYGIDMKLIRWCGILQRIALV 933 LKRIPKV A++K+ILRTLKLLFWG+LLQGGYSHAPDDLSYGIDMK IRWCGILQRIALV Sbjct: 107 LKRIPKVKVAVKKIILRTLKLLFWGLLLQGGYSHAPDDLSYGIDMKFIRWCGILQRIALV 166 Query: 934 YFIVALIEILTTKIRPTTLEPSHFSIFSAYKWQWLGGFAAFVIYMVTTFALYVPDWSF-- 1107 Y +VALIE T K+RPTTL P + SIF++Y+W GGF AFVIYMVTTF+LYVPDWSF Sbjct: 167 YCVVALIETFTIKLRPTTLSPGYLSIFTSYRW--FGGFVAFVIYMVTTFSLYVPDWSFVD 224 Query: 1108 --EHKSEKFIVKCGMRGHLGPACNAVGYVDRQAWGINHLYSDPVWKRLRACTFSSPSSGK 1281 + +++ V CGMRGHLGPACNAVGYVDRQ W +NHLYS PVW RL+ACTFSSP+ G Sbjct: 225 YNSSELKRYTVICGMRGHLGPACNAVGYVDRQIWRVNHLYSQPVWNRLKACTFSSPAEGH 284 Query: 1282 LRKDAPAWCRAPFEPEGLLSSISAIVSGTIGIHYGHVLIHFKGHAERLKHWVSMXXXXXX 1461 LRKDAP WCRAPFEPEGLLS+ISAI+SGTIGIHYGHVLIHFKGH+ERLK W+SM Sbjct: 285 LRKDAPNWCRAPFEPEGLLSTISAILSGTIGIHYGHVLIHFKGHSERLKQWLSMGFVLFI 344 Query: 1462 XXXXXHFTDAIPINKQLYSFSYVCFTAGAAGIVFSAFYLLIDVWGKRMPFMFLEWIGMNA 1641 HFTDAIPINKQLYS SYVCFTAGAAGIVFS FY+LIDVWG R PF+FLEWIGMNA Sbjct: 345 LGIILHFTDAIPINKQLYSISYVCFTAGAAGIVFSIFYILIDVWGLRTPFLFLEWIGMNA 404 Query: 1642 MLVFVMAAQGIFAAFINGWYYKNSDNNLVNWIQEHVFNDVWKSERVGTLLYVIFAEITFW 1821 MLVFVMAAQG+FAAF+NGWYYK+ +N+LV+WIQ HVF +VW SER+GTLLYVIFAEITFW Sbjct: 405 MLVFVMAAQGLFAAFVNGWYYKDPNNSLVHWIQNHVFINVWHSERLGTLLYVIFAEITFW 464 Query: 1822 AVVSGILHKLGIYWKL 1869 VV+GILHKL IYWKL Sbjct: 465 GVVAGILHKLQIYWKL 480 >ref|XP_004494901.1| PREDICTED: heparan-alpha-glucosaminide N-acetyltransferase-like [Cicer arietinum] Length = 484 Score = 739 bits (1909), Expect = 0.0 Identities = 357/497 (71%), Positives = 406/497 (81%), Gaps = 5/497 (1%) Frame = +1 Query: 394 MEDPKKLEEGFGASKNXXXXXXXXXXXXXXXGDNVISTETERKSMIASELQAEETEQKTK 573 M++ K++EEG + + D + + E+ S + K K Sbjct: 1 MDEAKRMEEGHNLALDDDAKDDLKKQQTNIEHDRDVKLDHEQPSQV-----------KQK 49 Query: 574 TKRVATLDAFRGLTIVLMILVDDAGGAYARIDHSPWNGCTLADFVMPFFLFIVGVAIALA 753 TKRVATLDAFRGLTIVLMILVDDAGG Y RIDHSPWNGCTLADFVMPFFLFIVGVAIALA Sbjct: 50 TKRVATLDAFRGLTIVLMILVDDAGGVYPRIDHSPWNGCTLADFVMPFFLFIVGVAIALA 109 Query: 754 LKRIPKVPFAIRKVILRTLKLLFWGILLQGGYSHAPDDLSYGIDMKLIRWCGILQRIALV 933 LKRIPK+ +A++K++LRTLKLLFWGILLQGGYSHAPD+L YG++MK IRWCGILQRIALV Sbjct: 110 LKRIPKIKYAMKKIMLRTLKLLFWGILLQGGYSHAPDELIYGVNMKFIRWCGILQRIALV 169 Query: 934 YFIVALIEILTTKIRPTTLEPSHFSIFSAYKWQWLGGFAAFVIYMVTTFALYVPDWSFE- 1110 Y IVALIE TTK+RPTTL P +IF+AYKW GGF AF+IYM+TTF LYVPDWSF Sbjct: 170 YCIVALIETFTTKLRPTTLSPQRLAIFTAYKW--FGGFMAFLIYMITTFTLYVPDWSFVD 227 Query: 1111 ----HKSEKFIVKCGMRGHLGPACNAVGYVDRQAWGINHLYSDPVWKRLRACTFSSPSSG 1278 H S+++ V CGMRGHLGPACNAVG+VDRQ WGINHLYS PVW+RL+ACTF SP G Sbjct: 228 QVKGHGSKRYTVICGMRGHLGPACNAVGHVDRQVWGINHLYSQPVWRRLKACTFDSPGEG 287 Query: 1279 KLRKDAPAWCRAPFEPEGLLSSISAIVSGTIGIHYGHVLIHFKGHAERLKHWVSMXXXXX 1458 KLR+DAP+WC APFEPEGLLSSISAI+SGTIGIHYGHVLIHFKGH+ERLK WVSM Sbjct: 288 KLREDAPSWCLAPFEPEGLLSSISAILSGTIGIHYGHVLIHFKGHSERLKQWVSMGFVLL 347 Query: 1459 XXXXXXHFTDAIPINKQLYSFSYVCFTAGAAGIVFSAFYLLIDVWGKRMPFMFLEWIGMN 1638 HFTDAIPINKQLYSFSYVCFTAGAAGIVFSAFY+LIDVWG R PF+FLEWIGMN Sbjct: 348 IMAIILHFTDAIPINKQLYSFSYVCFTAGAAGIVFSAFYILIDVWGLRTPFLFLEWIGMN 407 Query: 1639 AMLVFVMAAQGIFAAFINGWYYKNSDNNLVNWIQEHVFNDVWKSERVGTLLYVIFAEITF 1818 AMLVFVMAA+GIFAAF+NGWYY++ +N+LV+WI++HVF +VW SERVGTLLYVIFAEITF Sbjct: 408 AMLVFVMAAEGIFAAFVNGWYYEDPNNSLVHWIKKHVFVNVWNSERVGTLLYVIFAEITF 467 Query: 1819 WAVVSGILHKLGIYWKL 1869 W +V+G+LHKL IYWKL Sbjct: 468 WGMVAGLLHKLKIYWKL 484 >ref|XP_007027449.1| Uncharacterized protein TCM_022283 [Theobroma cacao] gi|508716054|gb|EOY07951.1| Uncharacterized protein TCM_022283 [Theobroma cacao] Length = 504 Score = 738 bits (1904), Expect = 0.0 Identities = 360/504 (71%), Positives = 408/504 (80%), Gaps = 12/504 (2%) Frame = +1 Query: 394 MEDPKKLEEGF----GASKNXXXXXXXXXXXXXXXG--DNVISTETERKSMIASELQAEE 555 M DP K+EEG GA G D + + + ER + E Q E Sbjct: 1 MADPGKMEEGLAHEEGAPDQERKDKRDGKVEKEEDGAADRIGNDKEERVATTHVEQQNLE 60 Query: 556 TEQ--KTKTKRVATLDAFRGLTIVLMILVDDAGGAYARIDHSPWNGCTLADFVMPFFLFI 729 + K KTKR+ATLDAFRGLT+VLMILVDDAGGAY RIDHSPWNGCTLADFVMPFFLFI Sbjct: 61 EQPLVKQKTKRIATLDAFRGLTVVLMILVDDAGGAYPRIDHSPWNGCTLADFVMPFFLFI 120 Query: 730 VGVAIALALKRIPKVPFAIRKVILRTLKLLFWGILLQGGYSHAPDDLSYGIDMKLIRWCG 909 VGVAIALALK++PK+ AI+K+ LRTLKLLFWG+LLQGGYSHAP DL+YG+DMK IRWCG Sbjct: 121 VGVAIALALKKVPKIKDAIKKISLRTLKLLFWGVLLQGGYSHAPADLAYGVDMKQIRWCG 180 Query: 910 ILQRIALVYFIVALIEILTTKIRPTTLEPSHFSIFSAYKWQWLGGFAAFVIYMVTTFALY 1089 ILQRIALVYFIVALIE LT K RPT LEP H SIF+AY+WQW+GGF AFVIYM+TT++LY Sbjct: 181 ILQRIALVYFIVALIETLTRKRRPTVLEPGHLSIFTAYRWQWIGGFVAFVIYMITTYSLY 240 Query: 1090 VPDWSF----EHKSEKFIVKCGMRGHLGPACNAVGYVDRQAWGINHLYSDPVWKRLRACT 1257 VP WSF + ++ ++ VKCGMRGHLGPACNAVGYVDR+ WGINHLYS PVW+RL+ACT Sbjct: 241 VPHWSFVVDNDDEATRYTVKCGMRGHLGPACNAVGYVDREVWGINHLYSSPVWQRLKACT 300 Query: 1258 FSSPSSGKLRKDAPAWCRAPFEPEGLLSSISAIVSGTIGIHYGHVLIHFKGHAERLKHWV 1437 SSP SG R++AP+WCRAPFEPEGLLSSI AI+SGT+GIHYGHVLIHFKGH ERLK WV Sbjct: 301 LSSPGSGPFRENAPSWCRAPFEPEGLLSSILAILSGTMGIHYGHVLIHFKGHFERLKQWV 360 Query: 1438 SMXXXXXXXXXXXHFTDAIPINKQLYSFSYVCFTAGAAGIVFSAFYLLIDVWGKRMPFMF 1617 SM HFTDAIPINKQLYSFSYVCFTA AAGIVFSAFY+LIDVWG R PF+F Sbjct: 361 SMALGLLIVAIILHFTDAIPINKQLYSFSYVCFTAAAAGIVFSAFYVLIDVWGFRTPFLF 420 Query: 1618 LEWIGMNAMLVFVMAAQGIFAAFINGWYYKNSDNNLVNWIQEHVFNDVWKSERVGTLLYV 1797 LEWIGMNAMLV+V+ AQGI AAF+NGWYY++S+N LV WIQ+HVF +VW SER+GTLLYV Sbjct: 421 LEWIGMNAMLVYVLGAQGILAAFVNGWYYESSNNTLVYWIQKHVFINVWHSERLGTLLYV 480 Query: 1798 IFAEITFWAVVSGILHKLGIYWKL 1869 IFAEI F+ V+SGILHKLGIYWKL Sbjct: 481 IFAEIAFYGVLSGILHKLGIYWKL 504 >ref|XP_006577710.1| PREDICTED: heparan-alpha-glucosaminide N-acetyltransferase-like [Glycine max] Length = 506 Score = 735 bits (1897), Expect = 0.0 Identities = 360/506 (71%), Positives = 409/506 (80%), Gaps = 15/506 (2%) Frame = +1 Query: 397 EDPKKLEEGFGASKNXXXXXXXXXXXXXXXGDNVISTETERKSMIASEL-------QAEE 555 EDPK++EEG ++ N N S K +A + Q E Sbjct: 3 EDPKRMEEGLNSALNGDGNKDDLKKRATIKTSNGGSIFEHDKDTMAKPVAEGESVQQIAE 62 Query: 556 TEQ---KTKTKRVATLDAFRGLTIVLMILVDDAGGAYARIDHSPWNGCTLADFVMPFFLF 726 EQ K KTKRVATLDAFRGLTIVLMILVDDAG AY RIDHSPWNGCTLADFVMPFFLF Sbjct: 63 QEQPPVKQKTKRVATLDAFRGLTIVLMILVDDAGEAYPRIDHSPWNGCTLADFVMPFFLF 122 Query: 727 IVGVAIALALKRIPKVPFAIRKVILRTLKLLFWGILLQGGYSHAPDDLSYGIDMKLIRWC 906 IVGVAIALALKRI K+ +++K+ILRTLKLLFWGI+LQGGYSHAPDDL YG++MK IRWC Sbjct: 123 IVGVAIALALKRISKIKHSVKKIILRTLKLLFWGIILQGGYSHAPDDLEYGVNMKFIRWC 182 Query: 907 GILQRIALVYFIVALIEILTTKIRPTTLEPSHFSIFSAYKWQWLGGFAAFVIYMVTTFAL 1086 GILQRIALVY +VALIE TTK+RPTTL H SIF+AYKW GGF AF+IYM+TTF+L Sbjct: 183 GILQRIALVYCVVALIETFTTKLRPTTLASGHLSIFAAYKW--FGGFVAFLIYMITTFSL 240 Query: 1087 YVPDWSF-EH----KSEKFIVKCGMRGHLGPACNAVGYVDRQAWGINHLYSDPVWKRLRA 1251 YVPDWSF +H + +++ V CGMRGHLGPACNAVG+VDRQ WG+NHLYS PVW+RL+A Sbjct: 241 YVPDWSFVDHFNGDEPKRYTVICGMRGHLGPACNAVGHVDRQVWGVNHLYSQPVWRRLKA 300 Query: 1252 CTFSSPSSGKLRKDAPAWCRAPFEPEGLLSSISAIVSGTIGIHYGHVLIHFKGHAERLKH 1431 CTFSSP SG R DAP+WC APFEPEGLLSSISAI+SGTIGIHYGHVLIHFKGH+ERLK Sbjct: 301 CTFSSPGSGPFRDDAPSWCLAPFEPEGLLSSISAILSGTIGIHYGHVLIHFKGHSERLKQ 360 Query: 1432 WVSMXXXXXXXXXXXHFTDAIPINKQLYSFSYVCFTAGAAGIVFSAFYLLIDVWGKRMPF 1611 WVSM HFTDA+PINKQLYSFSYVCFTAGAAGIVFS FY+LIDVWG R PF Sbjct: 361 WVSMGFVLLIIAIILHFTDALPINKQLYSFSYVCFTAGAAGIVFSGFYILIDVWGLRTPF 420 Query: 1612 MFLEWIGMNAMLVFVMAAQGIFAAFINGWYYKNSDNNLVNWIQEHVFNDVWKSERVGTLL 1791 +FLEWIGMNAMLVFVMAA+GIFAAF+NGWYY++ ++LV+WI++HVF +VW SERVGT+L Sbjct: 421 LFLEWIGMNAMLVFVMAAEGIFAAFVNGWYYEDPRSSLVHWIKKHVFVNVWHSERVGTIL 480 Query: 1792 YVIFAEITFWAVVSGILHKLGIYWKL 1869 YVIFAEITFW+VV+G+LHKLGIYWKL Sbjct: 481 YVIFAEITFWSVVAGVLHKLGIYWKL 506 >ref|XP_004494902.1| PREDICTED: heparan-alpha-glucosaminide N-acetyltransferase-like [Cicer arietinum] Length = 478 Score = 718 bits (1854), Expect = 0.0 Identities = 347/497 (69%), Positives = 405/497 (81%), Gaps = 5/497 (1%) Frame = +1 Query: 394 MEDPKKLEEGFGASKNXXXXXXXXXXXXXXXGDNVISTETERKSMIASELQAEETEQKTK 573 ME+ K++EEGF + + D + + ET + ++L E +Q+ K Sbjct: 1 MEETKRMEEGFNSPLDVK--------------DELKNQETNIEYDKDTKL---EQDQQIK 43 Query: 574 TKRVATLDAFRGLTIVLMILVDDAGGAYARIDHSPWNGCTLADFVMPFFLFIVGVAIALA 753 TKRVATLDAFRGLTIV+MILVD AGGA+ RIDH+PWNGCTLADFVMPFFLFIVGVAIALA Sbjct: 44 TKRVATLDAFRGLTIVMMILVDKAGGAFPRIDHAPWNGCTLADFVMPFFLFIVGVAIALA 103 Query: 754 LKRIPKVPFAIRKVILRTLKLLFWGILLQGGYSHAPDDLSYGIDMKLIRWCGILQRIALV 933 LKRI + +A++K++LRTLKLLFWGILLQGGYSHAPDDLSYG++MK IRWCGILQRIALV Sbjct: 104 LKRIHNIKYAVKKIMLRTLKLLFWGILLQGGYSHAPDDLSYGVNMKFIRWCGILQRIALV 163 Query: 934 YFIVALIEILTTKIRPTTLEPSHFSIFSAYKWQWLGGFAAFVIYMVTTFALYVPDWSF-E 1110 Y IVALIE TTK+RPTTL P +IF+AYKW GGF AF +YM+TTF LYVPDWSF + Sbjct: 164 YCIVALIETFTTKLRPTTLTPQRLAIFTAYKW--FGGFVAFFVYMITTFTLYVPDWSFVD 221 Query: 1111 H----KSEKFIVKCGMRGHLGPACNAVGYVDRQAWGINHLYSDPVWKRLRACTFSSPSSG 1278 H + ++ V CGMRGHLGPACNAVG+VDRQ WG+NH YS PVW+ L+ CTF+SP G Sbjct: 222 HINGDEPRRYTVICGMRGHLGPACNAVGHVDRQVWGVNHFYSHPVWRHLKECTFNSPGEG 281 Query: 1279 KLRKDAPAWCRAPFEPEGLLSSISAIVSGTIGIHYGHVLIHFKGHAERLKHWVSMXXXXX 1458 R+DAP+WCRAPFEPEGLLSSISAI+SGTIGIHYGHVLIHFKGH+ERLK WVSM Sbjct: 282 PFREDAPSWCRAPFEPEGLLSSISAILSGTIGIHYGHVLIHFKGHSERLKQWVSMGFVLL 341 Query: 1459 XXXXXXHFTDAIPINKQLYSFSYVCFTAGAAGIVFSAFYLLIDVWGKRMPFMFLEWIGMN 1638 HFT+AIPINKQLYS SYVC TAGAAGIVFS+ Y+LIDVWG R PF+FLEWIGMN Sbjct: 342 TIAIILHFTNAIPINKQLYSISYVCLTAGAAGIVFSSLYILIDVWGIRTPFLFLEWIGMN 401 Query: 1639 AMLVFVMAAQGIFAAFINGWYYKNSDNNLVNWIQEHVFNDVWKSERVGTLLYVIFAEITF 1818 +MLVFVMAA+GIFAAF+NGWYY++ +N+LV+WI++HVF +VW SERVGTLLYVIFAEITF Sbjct: 402 SMLVFVMAAEGIFAAFVNGWYYEDPNNSLVHWIKKHVFVNVWNSERVGTLLYVIFAEITF 461 Query: 1819 WAVVSGILHKLGIYWKL 1869 W +VSG+LHKLGIYWKL Sbjct: 462 WGIVSGVLHKLGIYWKL 478 >ref|XP_006595362.1| PREDICTED: heparan-alpha-glucosaminide N-acetyltransferase-like [Glycine max] Length = 482 Score = 717 bits (1851), Expect = 0.0 Identities = 338/420 (80%), Positives = 370/420 (88%), Gaps = 4/420 (0%) Frame = +1 Query: 622 LMILVDDAGGAYARIDHSPWNGCTLADFVMPFFLFIVGVAIALALKRIPKVPFAIRKVIL 801 LM+LVDDAGGAY RIDHSPWNGCTLADFVMPFFLFIVGVAIALALKRIPKV +A++K+IL Sbjct: 65 LMVLVDDAGGAYPRIDHSPWNGCTLADFVMPFFLFIVGVAIALALKRIPKVKYAVKKIIL 124 Query: 802 RTLKLLFWGILLQGGYSHAPDDLSYGIDMKLIRWCGILQRIALVYFIVALIEILTTKIRP 981 RTLKLLFWGILLQGGYSHAPDDLSYG+DM+ IRWCGILQRIALVY +VALIE TTK+RP Sbjct: 125 RTLKLLFWGILLQGGYSHAPDDLSYGVDMRFIRWCGILQRIALVYCVVALIETYTTKLRP 184 Query: 982 TTLEPSHFSIFSAYKWQWLGGFAAFVIYMVTTFALYVPDWSF----EHKSEKFIVKCGMR 1149 +TL+P H SIF+AY+W LGGF AFVIYMVT F+LYVPDWSF K +++ V+CGMR Sbjct: 185 STLKPGHLSIFTAYRW--LGGFVAFVIYMVTIFSLYVPDWSFVDYNSDKPKRYTVECGMR 242 Query: 1150 GHLGPACNAVGYVDRQAWGINHLYSDPVWKRLRACTFSSPSSGKLRKDAPAWCRAPFEPE 1329 GHLGPACNAVGYVDRQ WG+NHLYS PVW RL+ACT SSP+ G LRK+APAWCRAPFEPE Sbjct: 243 GHLGPACNAVGYVDRQVWGVNHLYSQPVWTRLKACTLSSPAEGPLRKNAPAWCRAPFEPE 302 Query: 1330 GLLSSISAIVSGTIGIHYGHVLIHFKGHAERLKHWVSMXXXXXXXXXXXHFTDAIPINKQ 1509 G LSS+ AI+SGTIGIHYGHVLIHFKGH ERLK W+SM HFTDAIPINKQ Sbjct: 303 GFLSSVLAILSGTIGIHYGHVLIHFKGHFERLKQWLSMGFVLLTLGLILHFTDAIPINKQ 362 Query: 1510 LYSFSYVCFTAGAAGIVFSAFYLLIDVWGKRMPFMFLEWIGMNAMLVFVMAAQGIFAAFI 1689 LYSFSYVCFTAGAAGIVFS FYLLIDVWG R PF+FLEWIGMNAMLVFVMAAQGIFAAF+ Sbjct: 363 LYSFSYVCFTAGAAGIVFSVFYLLIDVWGLRTPFLFLEWIGMNAMLVFVMAAQGIFAAFV 422 Query: 1690 NGWYYKNSDNNLVNWIQEHVFNDVWKSERVGTLLYVIFAEITFWAVVSGILHKLGIYWKL 1869 NGWYYK+ DN+LV WIQ HVF +VW SER+GTLLYVIFAEITFW VV+GILHKLGIYWKL Sbjct: 423 NGWYYKDPDNSLVYWIQNHVFTNVWHSERLGTLLYVIFAEITFWGVVAGILHKLGIYWKL 482 >ref|XP_006836541.1| hypothetical protein AMTR_s00131p00031270 [Amborella trichopoda] gi|548839080|gb|ERM99394.1| hypothetical protein AMTR_s00131p00031270 [Amborella trichopoda] Length = 485 Score = 715 bits (1846), Expect = 0.0 Identities = 346/474 (72%), Positives = 391/474 (82%), Gaps = 13/474 (2%) Frame = +1 Query: 487 GDNVISTETERKSMIAS-----ELQAEETEQKTKTKRVATLDAFRGLTIVLMILVDDAGG 651 G+ V+ E K A+ EL +E +KT KRVATLDAFRGLTIV+MILVDDAGG Sbjct: 12 GEEVVVIHEEEKIEEATKEEKEELLQQEDGKKTSKKRVATLDAFRGLTIVVMILVDDAGG 71 Query: 652 AYARI-DHSPWNGCTLADFVMPFFLFIVGVAIALALKRIPKVPFAIRKVILRTLKLLFWG 828 AY +I DHSPWNGC LADFVMPFFLFIVGVAIALALK+IPKV A++KVILRTLKLLFWG Sbjct: 72 AYEQILDHSPWNGCRLADFVMPFFLFIVGVAIALALKKIPKVGDAVKKVILRTLKLLFWG 131 Query: 829 ILLQGGYSHAPDDLSYGIDMKLIRWCGILQRIALVYFIVALIEILTTKIRPTTLEPSHFS 1008 I+LQGGYSHAPDDLSYG+DMK IRWCGILQRIALVY +VA+IEI TTKIRPT L S S Sbjct: 132 IILQGGYSHAPDDLSYGVDMKHIRWCGILQRIALVYLVVAMIEIATTKIRPTMLGSSPLS 191 Query: 1009 IFSAYKWQWLGGFAAFVIYMVTTFALYVPDWSF-------EHKSEKFIVKCGMRGHLGPA 1167 IF+AY+WQW GGF AF+IY++TT++LYVPDWS+ E+ + F VKCGMR HLGPA Sbjct: 192 IFNAYRWQWFGGFIAFLIYIITTYSLYVPDWSYVLHHQNNENNEKIFTVKCGMRAHLGPA 251 Query: 1168 CNAVGYVDRQAWGINHLYSDPVWKRLRACTFSSPSSGKLRKDAPAWCRAPFEPEGLLSSI 1347 CNAVG+VDRQ WGINHLYS PVW+RL+ACT SSP SG LRKDA +WC AP+EPEGLLSSI Sbjct: 252 CNAVGHVDRQVWGINHLYSQPVWQRLKACTTSSPKSGPLRKDAASWCLAPYEPEGLLSSI 311 Query: 1348 SAIVSGTIGIHYGHVLIHFKGHAERLKHWVSMXXXXXXXXXXXHFTDAIPINKQLYSFSY 1527 SAI+SGTIGIHYGHVLIHFK H ERLKHW+SM HFTDA+P+NKQLYS SY Sbjct: 312 SAILSGTIGIHYGHVLIHFKSHLERLKHWLSMGITLFIIGIILHFTDAMPLNKQLYSISY 371 Query: 1528 VCFTAGAAGIVFSAFYLLIDVWGKRMPFMFLEWIGMNAMLVFVMAAQGIFAAFINGWYYK 1707 VCFTAGAAGI+FS Y+LIDVW R+ FMFLEWIGMNAML+FV+ AQGIF AF+NGWYY+ Sbjct: 372 VCFTAGAAGILFSVLYMLIDVWRARIVFMFLEWIGMNAMLIFVLGAQGIFPAFVNGWYYE 431 Query: 1708 NSDNNLVNWIQEHVFNDVWKSERVGTLLYVIFAEITFWAVVSGILHKLGIYWKL 1869 + +N LVNWIQ+HVF DVW SE +GTLLYVIFAEI FW VV+GILHKL IYWKL Sbjct: 432 DPENTLVNWIQKHVFVDVWNSENLGTLLYVIFAEIVFWGVVAGILHKLRIYWKL 485 >ref|XP_006428455.1| hypothetical protein CICLE_v10011557mg [Citrus clementina] gi|568877275|ref|XP_006491665.1| PREDICTED: heparan-alpha-glucosaminide N-acetyltransferase-like isoform X3 [Citrus sinensis] gi|557530512|gb|ESR41695.1| hypothetical protein CICLE_v10011557mg [Citrus clementina] Length = 419 Score = 712 bits (1838), Expect = 0.0 Identities = 331/419 (78%), Positives = 369/419 (88%), Gaps = 4/419 (0%) Frame = +1 Query: 625 MILVDDAGGAYARIDHSPWNGCTLADFVMPFFLFIVGVAIALALKRIPKVPFAIRKVILR 804 MILVDDAGGAYARIDHSPWNGCTLADFVMPFFLFIVGVAIALALK++PK+ A++K+I R Sbjct: 1 MILVDDAGGAYARIDHSPWNGCTLADFVMPFFLFIVGVAIALALKKVPKINGAVKKIIFR 60 Query: 805 TLKLLFWGILLQGGYSHAPDDLSYGIDMKLIRWCGILQRIALVYFIVALIEILTTKIRPT 984 TLKLLFWGI+LQGGYSHAPD LSYG+DMK IRWCGILQRIALVY +VALIE LTTK RP Sbjct: 61 TLKLLFWGIILQGGYSHAPDALSYGVDMKHIRWCGILQRIALVYVVVALIETLTTKRRPN 120 Query: 985 TLEPSHFSIFSAYKWQWLGGFAAFVIYMVTTFALYVPDWSF----EHKSEKFIVKCGMRG 1152 LEP H SIF+AY+WQW+GGF AFVIY++TT++LYVP+WSF +H +K+IVKCGMRG Sbjct: 121 VLEPRHLSIFTAYQWQWIGGFIAFVIYIITTYSLYVPNWSFSEHSDHGVKKYIVKCGMRG 180 Query: 1153 HLGPACNAVGYVDRQAWGINHLYSDPVWKRLRACTFSSPSSGKLRKDAPAWCRAPFEPEG 1332 HLGPACNAVGYVDR+ WGINHLYSDPVW RL ACT SSP+SG LR+DAP+WCRAPFEPEG Sbjct: 181 HLGPACNAVGYVDRELWGINHLYSDPVWSRLEACTLSSPNSGPLREDAPSWCRAPFEPEG 240 Query: 1333 LLSSISAIVSGTIGIHYGHVLIHFKGHAERLKHWVSMXXXXXXXXXXXHFTDAIPINKQL 1512 LLS+ISAI+SGTIGIHYGHVLIHFKGH+ RLKHWVSM HFT+AIPINKQL Sbjct: 241 LLSTISAILSGTIGIHYGHVLIHFKGHSARLKHWVSMGFGLLIIATILHFTNAIPINKQL 300 Query: 1513 YSFSYVCFTAGAAGIVFSAFYLLIDVWGKRMPFMFLEWIGMNAMLVFVMAAQGIFAAFIN 1692 YSFSYVCFTAGAAGIVFSA Y+L+DVW R PF+FL+WIGMNAMLVFV+ AQGI A F+N Sbjct: 301 YSFSYVCFTAGAAGIVFSALYVLMDVWELRTPFLFLKWIGMNAMLVFVLGAQGILAGFVN 360 Query: 1693 GWYYKNSDNNLVNWIQEHVFNDVWKSERVGTLLYVIFAEITFWAVVSGILHKLGIYWKL 1869 GWYYKN DN LVNWIQ H+F VW SER+GTLLYVIFAEITFW VV+GILH+LGIYWKL Sbjct: 361 GWYYKNPDNTLVNWIQNHLFIHVWNSERLGTLLYVIFAEITFWGVVAGILHRLGIYWKL 419 >ref|XP_003576108.1| PREDICTED: heparan-alpha-glucosaminide N-acetyltransferase-like [Brachypodium distachyon] Length = 498 Score = 707 bits (1826), Expect = 0.0 Identities = 328/450 (72%), Positives = 376/450 (83%), Gaps = 7/450 (1%) Frame = +1 Query: 541 LQAEETEQKTKTKRVATLDAFRGLTIVLMILVDDAGGAYARIDHSPWNGCTLADFVMPFF 720 L E QK K+ RVA LDAFRGLTIV+MILVDDAG +Y R+DHSPWNGCTLADFVMPFF Sbjct: 49 LAVVEEPQKKKSTRVAALDAFRGLTIVVMILVDDAGSSYERMDHSPWNGCTLADFVMPFF 108 Query: 721 LFIVGVAIALALKRIPKVPFAIRKVILRTLKLLFWGILLQGGYSHAPDDLSYGIDMKLIR 900 LFIVGVAIA A+KR+P + A++KV +RTLK++FWG+LLQGGYSHAPDDL+YG+DMK+IR Sbjct: 109 LFIVGVAIAFAMKRVPNMGAAVKKVSVRTLKMIFWGLLLQGGYSHAPDDLAYGVDMKMIR 168 Query: 901 WCGILQRIALVYFIVALIEILTTKIRPTTLEPSHFSIFSAYKWQWLGGFAAFVIYMVTTF 1080 WCGILQRIALVYF VALIE+ TTK+RPTT+ ++IF AY+WQWLG F VIYM+TTF Sbjct: 169 WCGILQRIALVYFAVALIEVFTTKVRPTTVRSGPYAIFDAYRWQWLGAFIVLVIYMITTF 228 Query: 1081 ALYVPDWSFEHKSE-------KFIVKCGMRGHLGPACNAVGYVDRQAWGINHLYSDPVWK 1239 +LYVPDWSF + ++ +F V+CG+RGHL PACNAVG++DRQ WGINHLYS PVW Sbjct: 229 SLYVPDWSFVYHNDGDINDGKRFTVQCGVRGHLDPACNAVGFIDRQVWGINHLYSQPVWI 288 Query: 1240 RLRACTFSSPSSGKLRKDAPAWCRAPFEPEGLLSSISAIVSGTIGIHYGHVLIHFKGHAE 1419 R + CTFSSP +GKLR DAPAWC PFEPEGLLSSIS+I+SGTIGIHYGHVLIHFK H E Sbjct: 289 RTKDCTFSSPETGKLRDDAPAWCLGPFEPEGLLSSISSIISGTIGIHYGHVLIHFKTHKE 348 Query: 1420 RLKHWVSMXXXXXXXXXXXHFTDAIPINKQLYSFSYVCFTAGAAGIVFSAFYLLIDVWGK 1599 RL HW+SM HFT+AIPINKQLYSFSY+CFT GAAGIV SAFY LIDVWG Sbjct: 349 RLTHWLSMGFALLLLGILLHFTNAIPINKQLYSFSYICFTGGAAGIVLSAFYALIDVWGL 408 Query: 1600 RMPFMFLEWIGMNAMLVFVMAAQGIFAAFINGWYYKNSDNNLVNWIQEHVFNDVWKSERV 1779 R+PF+FLEWIGMNAMLVFV+AAQGIFAAF+NGWYY++ D LVNWIQ+HVF +VW SE + Sbjct: 409 RVPFLFLEWIGMNAMLVFVLAAQGIFAAFMNGWYYESQDKTLVNWIQQHVFVNVWHSENL 468 Query: 1780 GTLLYVIFAEITFWAVVSGILHKLGIYWKL 1869 G LLYVIF EI FW VVSGILHKLGIYWKL Sbjct: 469 GNLLYVIFGEILFWGVVSGILHKLGIYWKL 498