BLASTX nr result
ID: Forsythia23_contig00008104
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Forsythia23_contig00008104 (1554 letters) Database: ./nr 69,698,275 sequences; 24,982,196,650 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_011088776.1| PREDICTED: uncharacterized protein LOC105169... 669 0.0 gb|EYU37964.1| hypothetical protein MIMGU_mgv1a006778mg [Erythra... 661 0.0 ref|XP_007199106.1| hypothetical protein PRUPE_ppa020666mg [Prun... 602 e-169 ref|XP_011046664.1| PREDICTED: heparan-alpha-glucosaminide N-ace... 592 e-166 ref|XP_008377732.1| PREDICTED: heparan-alpha-glucosaminide N-ace... 589 e-165 emb|CDP14550.1| unnamed protein product [Coffea canephora] 588 e-165 ref|XP_002312951.1| hypothetical protein POPTR_0009s13900g [Popu... 587 e-165 ref|XP_012081798.1| PREDICTED: heparan-alpha-glucosaminide N-ace... 585 e-164 ref|XP_004292175.1| PREDICTED: heparan-alpha-glucosaminide N-ace... 582 e-163 ref|XP_011045893.1| PREDICTED: heparan-alpha-glucosaminide N-ace... 580 e-162 ref|XP_004248650.1| PREDICTED: heparan-alpha-glucosaminide N-ace... 580 e-162 ref|XP_012081797.1| PREDICTED: heparan-alpha-glucosaminide N-ace... 578 e-162 gb|KDP29672.1| hypothetical protein JCGZ_18834 [Jatropha curcas] 578 e-162 ref|XP_002306188.2| hypothetical protein POPTR_0004s18250g [Popu... 578 e-162 ref|XP_009779584.1| PREDICTED: heparan-alpha-glucosaminide N-ace... 578 e-162 ref|XP_010107656.1| hypothetical protein L484_008373 [Morus nota... 572 e-160 ref|XP_008377733.1| PREDICTED: heparan-alpha-glucosaminide N-ace... 570 e-160 ref|XP_009782767.1| PREDICTED: heparan-alpha-glucosaminide N-ace... 570 e-159 ref|XP_006360902.1| PREDICTED: heparan-alpha-glucosaminide N-ace... 568 e-159 gb|KHN19992.1| Heparan-alpha-glucosaminide N-acetyltransferase [... 562 e-157 >ref|XP_011088776.1| PREDICTED: uncharacterized protein LOC105169921 [Sesamum indicum] Length = 754 Score = 669 bits (1726), Expect = 0.0 Identities = 330/422 (78%), Positives = 360/422 (85%), Gaps = 1/422 (0%) Frame = -2 Query: 1496 MAEIEPLLQSTGAGEVQPAEHEEQADKS-TSTKSKAARLASLDVFRGLCVFLMMLVDYAG 1320 MAEIEPLLQ G E +P E EEQA+ S T K KAAR+ASLDVFRGLCVFLMMLVDYAG Sbjct: 1 MAEIEPLLQRFG-DEHKPPESEEQANSSNTIAKRKAARVASLDVFRGLCVFLMMLVDYAG 59 Query: 1319 SIFPIITHVPWNGVHLADFVMPFFLFVAGVSVAIVYKKVSNRVDATWKAVLRALELFLLG 1140 SIFPII H PWNG+HLADFVMPFFLFVAGVSVAIVYKK S+RV+ATWKA+ RALELF+LG Sbjct: 60 SIFPIIAHAPWNGIHLADFVMPFFLFVAGVSVAIVYKKNSDRVEATWKALFRALELFILG 119 Query: 1139 VFLQGGYFHGINSLTYGVDIERIRLLGILQRIAIGYIVAALCEIWLPCQRWRHDGFPKIY 960 VFLQGGYFHG+ SLTYGVDIER+RLLGILQRIAIGYIVAALCEIWLPCQRWR GF + Y Sbjct: 120 VFLQGGYFHGVTSLTYGVDIERMRLLGILQRIAIGYIVAALCEIWLPCQRWRGVGFLRNY 179 Query: 959 ISHWCVVLSLCTIYLGLLYGLYVPNWQFRVVQSPSSLVTANNSHVYTVKCSVRGDLGPAC 780 WCV L L IYLG YGLYVP+WQ+ VVQ+ SS++T NNS VY VKCSVRGDL PAC Sbjct: 180 NWQWCVALLLAVIYLGFSYGLYVPDWQY-VVQADSSMLTVNNSRVYMVKCSVRGDLSPAC 238 Query: 779 NSAGMIDRYVLGIDHLYAKPVYRNLKECNISSHGQVPQTAPSWCHAPFDPEGILSSLTAA 600 N+AGM+DRY+LG+DHLYAKPVYRNLKECN+SSHGQVPQT+PSWCHAPFDPEG+LSSLTAA Sbjct: 239 NAAGMVDRYILGVDHLYAKPVYRNLKECNLSSHGQVPQTSPSWCHAPFDPEGVLSSLTAA 298 Query: 599 ASCIIGLQYGHILVQLQDHKERLRNWSIXXXXXXXXXXXXXXXGIPLNKSLYTISYLLVT 420 +CIIGLQYGHIL QLQ HKERL NWSI GIPLNKSLYTISYL+VT Sbjct: 299 VTCIIGLQYGHILTQLQQHKERLWNWSIFSFSLMGLGLFLVFLGIPLNKSLYTISYLMVT 358 Query: 419 SACAGITFCILYALIDVYGWRWLTCVLEWMGKHSLSIFILITSNIIVVTVQGFYLKSPDN 240 SA AGITFCILY L+D YGWR LTCV EWMGKHSLSIFILITSNIIV+ QGFYL++P+N Sbjct: 359 SASAGITFCILYTLVDAYGWRCLTCVWEWMGKHSLSIFILITSNIIVIIAQGFYLRAPEN 418 Query: 239 NI 234 NI Sbjct: 419 NI 420 >gb|EYU37964.1| hypothetical protein MIMGU_mgv1a006778mg [Erythranthe guttata] Length = 432 Score = 661 bits (1706), Expect = 0.0 Identities = 321/433 (74%), Positives = 365/433 (84%), Gaps = 1/433 (0%) Frame = -2 Query: 1496 MAEIEPLLQSTGAGEVQPAEHEEQADKSTSTKS-KAARLASLDVFRGLCVFLMMLVDYAG 1320 MAE+EPL++S A E +P E +E A++S + KAAR+ASLDVFRGLCVFLMMLVDYAG Sbjct: 1 MAEMEPLMRSHAADEEKPMELDELANRSGGVANRKAARVASLDVFRGLCVFLMMLVDYAG 60 Query: 1319 SIFPIITHVPWNGVHLADFVMPFFLFVAGVSVAIVYKKVSNRVDATWKAVLRALELFLLG 1140 SIFP I+H PWNGVHLAD VMPFFLF AGVS+ IVYKKVS+RV+ATWKA+LR L+LF+LG Sbjct: 61 SIFPAISHAPWNGVHLADLVMPFFLFAAGVSIVIVYKKVSDRVEATWKAILRGLKLFMLG 120 Query: 1139 VFLQGGYFHGINSLTYGVDIERIRLLGILQRIAIGYIVAALCEIWLPCQRWRHDGFPKIY 960 VFLQGGYFHG+ SLTYGVD+E+IR LGILQRIA+GY+VAA+CEIWLP QR R DGF + Y Sbjct: 121 VFLQGGYFHGVTSLTYGVDVEKIRFLGILQRIAVGYVVAAMCEIWLPWQRRRGDGFRRNY 180 Query: 959 ISHWCVVLSLCTIYLGLLYGLYVPNWQFRVVQSPSSLVTANNSHVYTVKCSVRGDLGPAC 780 W V L L IYLG LYGLYVP+WQ+ VVQS SSLVTAN+++VY VKCSVRGDL PAC Sbjct: 181 HLQWFVALFLSVIYLGFLYGLYVPDWQY-VVQSDSSLVTANSTNVYEVKCSVRGDLSPAC 239 Query: 779 NSAGMIDRYVLGIDHLYAKPVYRNLKECNISSHGQVPQTAPSWCHAPFDPEGILSSLTAA 600 NSAGMIDRY+LG++HLYAKPVYRNLKECNISS G VPQ +PSWCH PFDPEGILSS+TAA Sbjct: 240 NSAGMIDRYILGVNHLYAKPVYRNLKECNISSQGHVPQNSPSWCHTPFDPEGILSSITAA 299 Query: 599 ASCIIGLQYGHILVQLQDHKERLRNWSIXXXXXXXXXXXXXXXGIPLNKSLYTISYLLVT 420 +CIIGLQYGHIL+Q Q HKERL NWS+ GIPLNKSLYTISYLLVT Sbjct: 300 VTCIIGLQYGHILIQSQHHKERLWNWSLFSVSLMGLGLLLTFFGIPLNKSLYTISYLLVT 359 Query: 419 SACAGITFCILYALIDVYGWRWLTCVLEWMGKHSLSIFILITSNIIVVTVQGFYLKSPDN 240 +A AGITFC LY L+DVYGWRWLT VLEWMGKHSLSIFIL+TSNI+V+T QGFYLKSP N Sbjct: 360 TASAGITFCTLYILVDVYGWRWLTFVLEWMGKHSLSIFILVTSNIVVITAQGFYLKSPHN 419 Query: 239 NIVHWIVSLFVHK 201 NIVHWI++ FV+K Sbjct: 420 NIVHWIITRFVNK 432 >ref|XP_007199106.1| hypothetical protein PRUPE_ppa020666mg [Prunus persica] gi|462394506|gb|EMJ00305.1| hypothetical protein PRUPE_ppa020666mg [Prunus persica] Length = 417 Score = 602 bits (1551), Expect = e-169 Identities = 281/407 (69%), Positives = 335/407 (82%) Frame = -2 Query: 1421 DKSTSTKSKAARLASLDVFRGLCVFLMMLVDYAGSIFPIITHVPWNGVHLADFVMPFFLF 1242 D + +K R+ASLDVFRGLCVFLMMLVDY GSIFPII H PWNG+HLADFVMPFFLF Sbjct: 12 DHPATICAKPPRIASLDVFRGLCVFLMMLVDYGGSIFPIIAHSPWNGLHLADFVMPFFLF 71 Query: 1241 VAGVSVAIVYKKVSNRVDATWKAVLRALELFLLGVFLQGGYFHGINSLTYGVDIERIRLL 1062 +AGVS+A+VYKKV+NR +ATWKAV +AL+LFLLGV LQGGYFHG+ SLT+GVDIERIR Sbjct: 72 IAGVSLALVYKKVTNRAEATWKAVFKALKLFLLGVLLQGGYFHGVTSLTFGVDIERIRWF 131 Query: 1061 GILQRIAIGYIVAALCEIWLPCQRWRHDGFPKIYISHWCVVLSLCTIYLGLLYGLYVPNW 882 GILQRIA+GYIVAALCEIWL Q W GF K Y HWCV+ SL IY GLLYGLYVP+W Sbjct: 132 GILQRIALGYIVAALCEIWLSRQTWDEVGFFKSYYWHWCVIFSLSAIYAGLLYGLYVPDW 191 Query: 881 QFRVVQSPSSLVTANNSHVYTVKCSVRGDLGPACNSAGMIDRYVLGIDHLYAKPVYRNLK 702 +F+ + +P+S+ +++S VY VKCSVRGDLGPACNSAGMIDR++LG+DHLY KPVYRNLK Sbjct: 192 EFKAL-TPTSMRPSSDSFVYLVKCSVRGDLGPACNSAGMIDRFILGVDHLYLKPVYRNLK 250 Query: 701 ECNISSHGQVPQTAPSWCHAPFDPEGILSSLTAAASCIIGLQYGHILVQLQDHKERLRNW 522 ECN+S+ G+VP+++PSWCHAPFDPEGILSSLTAA +CIIGLQYGHIL ++DHK RL W Sbjct: 251 ECNLSADGEVPESSPSWCHAPFDPEGILSSLTAAVTCIIGLQYGHILAHIEDHKGRLNAW 310 Query: 521 SIXXXXXXXXXXXXXXXGIPLNKSLYTISYLLVTSACAGITFCILYALIDVYGWRWLTCV 342 S+ GIP+NKSLYTISY+L+TSA AGITFC LY LIDVYG+R +T V Sbjct: 311 SLFSVSIFVLGSFLAFIGIPVNKSLYTISYMLITSASAGITFCALYLLIDVYGYRCITSV 370 Query: 341 LEWMGKHSLSIFILITSNIIVVTVQGFYLKSPDNNIVHWIVSLFVHK 201 LEWMG HSLSIF+L+TSN+ ++ +QG Y P+NNIVHW+++ F+HK Sbjct: 371 LEWMGIHSLSIFVLVTSNLAIIAIQGLYWSDPENNIVHWVITRFLHK 417 >ref|XP_011046664.1| PREDICTED: heparan-alpha-glucosaminide N-acetyltransferase-like [Populus euphratica] Length = 419 Score = 592 bits (1527), Expect = e-166 Identities = 279/407 (68%), Positives = 332/407 (81%) Frame = -2 Query: 1421 DKSTSTKSKAARLASLDVFRGLCVFLMMLVDYAGSIFPIITHVPWNGVHLADFVMPFFLF 1242 ++ T K R+ASLDVFRGLCVFLMMLVDY G+I PII H PWNG+HLADFVMPFFLF Sbjct: 13 EEQPRTSKKTPRVASLDVFRGLCVFLMMLVDYGGAIVPIIAHSPWNGLHLADFVMPFFLF 72 Query: 1241 VAGVSVAIVYKKVSNRVDATWKAVLRALELFLLGVFLQGGYFHGINSLTYGVDIERIRLL 1062 +AGVS+A+VYK+V+NR++AT KAVLRA+ELFLLGV LQGGYFHGIN LTYGVD++RIR L Sbjct: 73 IAGVSLALVYKRVTNRIEATRKAVLRAVELFLLGVILQGGYFHGINYLTYGVDMKRIRWL 132 Query: 1061 GILQRIAIGYIVAALCEIWLPCQRWRHDGFPKIYISHWCVVLSLCTIYLGLLYGLYVPNW 882 GILQRI++GYI AALCEIWL C+ R F K Y HW SL IYLGLLYGLYVP+W Sbjct: 133 GILQRISVGYIFAALCEIWLSCRSRRDVSFLKSYYWHWGAAFSLSAIYLGLLYGLYVPDW 192 Query: 881 QFRVVQSPSSLVTANNSHVYTVKCSVRGDLGPACNSAGMIDRYVLGIDHLYAKPVYRNLK 702 QF + + SS+ AN+S+VY VKCSVRGDLGPACNSAGMIDRYVLGIDHLY KPVYRNLK Sbjct: 193 QFEMANATSSVFPANHSYVYMVKCSVRGDLGPACNSAGMIDRYVLGIDHLYKKPVYRNLK 252 Query: 701 ECNISSHGQVPQTAPSWCHAPFDPEGILSSLTAAASCIIGLQYGHILVQLQDHKERLRNW 522 ECN+S++GQVP++APSWCHAPFDPEG+LSS+TAA +CIIGLQYGH L LQDHK+RL+NW Sbjct: 253 ECNMSTNGQVPESAPSWCHAPFDPEGVLSSITAAVTCIIGLQYGHSLAHLQDHKQRLQNW 312 Query: 521 SIXXXXXXXXXXXXXXXGIPLNKSLYTISYLLVTSACAGITFCILYALIDVYGWRWLTCV 342 + G P+NKSLYT SY+L+T A AGIT+ +Y L+DVYG+R LT Sbjct: 313 ILFSLSLLLIGLLLAVVGDPVNKSLYTFSYMLITCASAGITYSAIYLLVDVYGYRCLTFA 372 Query: 341 LEWMGKHSLSIFILITSNIIVVTVQGFYLKSPDNNIVHWIVSLFVHK 201 LEWMGKHSLSIF+LITSN++V+ +QGFY +P+NN++HWIV+ FV + Sbjct: 373 LEWMGKHSLSIFVLITSNLVVIAIQGFYWTAPENNLIHWIVTRFVRR 419 >ref|XP_008377732.1| PREDICTED: heparan-alpha-glucosaminide N-acetyltransferase-like isoform X1 [Malus domestica] Length = 417 Score = 589 bits (1519), Expect = e-165 Identities = 287/432 (66%), Positives = 333/432 (77%), Gaps = 1/432 (0%) Frame = -2 Query: 1496 MAEIEPLLQS-TGAGEVQPAEHEEQADKSTSTKSKAARLASLDVFRGLCVFLMMLVDYAG 1320 MA+ PLL + G G P K R+ASLDVFRGLCVFLMMLVDY G Sbjct: 1 MADYSPLLTAYDGPGTASP---------------KPPRVASLDVFRGLCVFLMMLVDYGG 45 Query: 1319 SIFPIITHVPWNGVHLADFVMPFFLFVAGVSVAIVYKKVSNRVDATWKAVLRALELFLLG 1140 SI PII H PWNG+HLADFVMPFFLF+AGVS+A+VYK+V+NRV+ATWKAV +A++LFLLG Sbjct: 46 SILPIIAHSPWNGLHLADFVMPFFLFIAGVSLALVYKRVTNRVEATWKAVFKAVKLFLLG 105 Query: 1139 VFLQGGYFHGINSLTYGVDIERIRLLGILQRIAIGYIVAALCEIWLPCQRWRHDGFPKIY 960 V LQGGYFHG+ SLTYGVDIERIR GILQRIAIGYI AALCEIWL Q GF + Y Sbjct: 106 VLLQGGYFHGVASLTYGVDIERIRWFGILQRIAIGYIAAALCEIWLSRQTLGEVGFFRTY 165 Query: 959 ISHWCVVLSLCTIYLGLLYGLYVPNWQFRVVQSPSSLVTANNSHVYTVKCSVRGDLGPAC 780 HWCV+ SL IY GLLYGLYVP+W+F+ +PSSL +N + Y VKCSVRGDLGPAC Sbjct: 166 YWHWCVIFSLSAIYAGLLYGLYVPDWEFK-ASTPSSLPPSNATTTYVVKCSVRGDLGPAC 224 Query: 779 NSAGMIDRYVLGIDHLYAKPVYRNLKECNISSHGQVPQTAPSWCHAPFDPEGILSSLTAA 600 NSA MIDRY+LG DHLY KPVYRNLKECN+S+ G+VP+++PSWCH PFDPEGILSSLTAA Sbjct: 225 NSARMIDRYILGFDHLYLKPVYRNLKECNVSADGRVPESSPSWCHTPFDPEGILSSLTAA 284 Query: 599 ASCIIGLQYGHILVQLQDHKERLRNWSIXXXXXXXXXXXXXXXGIPLNKSLYTISYLLVT 420 +CIIGLQYGHIL +QDHKERL W GIP+NKSLYTISY+L+T Sbjct: 285 VTCIIGLQYGHILAHIQDHKERLNIWFFSSVLMFVLGLFLAFIGIPVNKSLYTISYMLIT 344 Query: 419 SACAGITFCILYALIDVYGWRWLTCVLEWMGKHSLSIFILITSNIIVVTVQGFYLKSPDN 240 SA AGITFC LY L+DVYG+R +T VLEWMG HSL+IF+++TSN+ V+ +QGFYL P N Sbjct: 345 SASAGITFCTLYLLVDVYGYRCMTYVLEWMGIHSLTIFVVVTSNLAVIAIQGFYLADPQN 404 Query: 239 NIVHWIVSLFVH 204 NIVHWI++ FVH Sbjct: 405 NIVHWIITRFVH 416 >emb|CDP14550.1| unnamed protein product [Coffea canephora] Length = 426 Score = 588 bits (1516), Expect = e-165 Identities = 293/442 (66%), Positives = 338/442 (76%), Gaps = 10/442 (2%) Frame = -2 Query: 1496 MAEIEPLL----QSTGAGEVQPAEHEE------QADKSTSTKSKAARLASLDVFRGLCVF 1347 MA+ EPLL + A V A+ E +A K R+ASLDVFRGL VF Sbjct: 1 MADFEPLLLRRRDADAAVAVAVADLGEDKQDVVEAAKDNKITPPRPRVASLDVFRGLSVF 60 Query: 1346 LMMLVDYAGSIFPIITHVPWNGVHLADFVMPFFLFVAGVSVAIVYKKVSNRVDATWKAVL 1167 LMMLVDYAGSIFPII H PWNG+HLADFVMPFFLFVAGVS+AIVYKKV +R+ A+WK VL Sbjct: 61 LMMLVDYAGSIFPIIAHSPWNGLHLADFVMPFFLFVAGVSLAIVYKKVPDRIQASWKVVL 120 Query: 1166 RALELFLLGVFLQGGYFHGINSLTYGVDIERIRLLGILQRIAIGYIVAALCEIWLPCQRW 987 RAL+LF LG+ LQGGY HG+ S+TYGVDIER+R+LGILQRIAIGY+VAALCEIWLP +RW Sbjct: 121 RALKLFFLGILLQGGYLHGVTSMTYGVDIERLRILGILQRIAIGYLVAALCEIWLPRRRW 180 Query: 986 RHDGFPKIYISHWCVVLSLCTIYLGLLYGLYVPNWQFRVVQSPSSLVTANNSHVYTVKCS 807 R +GFP Y+ HW +VLSL +Y+GLL+GLYVP+W+F ++ANN +Y VKCS Sbjct: 181 RKEGFPGNYLCHWFIVLSLVAVYVGLLHGLYVPDWKF---------ISANNGDIYEVKCS 231 Query: 806 VRGDLGPACNSAGMIDRYVLGIDHLYAKPVYRNLKECNISSHGQVPQTAPSWCHAPFDPE 627 VRGDL P CNSAGMIDRY+LGI HLY KPVYRNLKECN +S PSWC APF+PE Sbjct: 232 VRGDLQPGCNSAGMIDRYILGIQHLYNKPVYRNLKECNTNS-------VPSWCLAPFEPE 284 Query: 626 GILSSLTAAASCIIGLQYGHILVQLQDHKERLRNWSIXXXXXXXXXXXXXXXGIPLNKSL 447 GILSS+TAA SCI+GLQ GHILV QDHKERL NWS+ GIPLNKSL Sbjct: 285 GILSSITAAVSCILGLQSGHILVHFQDHKERLYNWSLLSFSFLALGLLLSFIGIPLNKSL 344 Query: 446 YTISYLLVTSACAGITFCILYALIDVYGWRWLTCVLEWMGKHSLSIFILITSNIIVVTVQ 267 YTISYLLVTSA AGITFC+LY L+DV GWR LTCVLEWMGKHSLSIFIL+TSNI V+ +Q Sbjct: 345 YTISYLLVTSATAGITFCLLYVLVDVCGWRRLTCVLEWMGKHSLSIFILVTSNIAVIMIQ 404 Query: 266 GFYLKSPDNNIVHWIVSLFVHK 201 GFY ++P+NNIVHWI++ HK Sbjct: 405 GFYWRAPENNIVHWIITHVAHK 426 >ref|XP_002312951.1| hypothetical protein POPTR_0009s13900g [Populus trichocarpa] gi|222849359|gb|EEE86906.1| hypothetical protein POPTR_0009s13900g [Populus trichocarpa] Length = 419 Score = 587 bits (1514), Expect = e-165 Identities = 276/407 (67%), Positives = 329/407 (80%) Frame = -2 Query: 1421 DKSTSTKSKAARLASLDVFRGLCVFLMMLVDYAGSIFPIITHVPWNGVHLADFVMPFFLF 1242 ++ T K R ASLDVFRGLCVFLMMLVDY G+I PII H PWNG+HLAD VMPFFLF Sbjct: 13 EEQLHTSKKPPRAASLDVFRGLCVFLMMLVDYGGAIIPIIAHSPWNGLHLADSVMPFFLF 72 Query: 1241 VAGVSVAIVYKKVSNRVDATWKAVLRALELFLLGVFLQGGYFHGINSLTYGVDIERIRLL 1062 +AGVS+A+VYKKV NR++ATWKAVL+A++LFLLGV +QGGYFHGINSLTYGVD++RIR L Sbjct: 73 IAGVSLALVYKKVPNRIEATWKAVLKAIKLFLLGVVIQGGYFHGINSLTYGVDMKRIRWL 132 Query: 1061 GILQRIAIGYIVAALCEIWLPCQRWRHDGFPKIYISHWCVVLSLCTIYLGLLYGLYVPNW 882 GILQ+I++GYIVAALCEIWL C+ R F K Y HWCV SL IYLGLLYGLYVP+W Sbjct: 133 GILQKISVGYIVAALCEIWLSCRTRRGVSFLKSYYWHWCVAFSLSAIYLGLLYGLYVPDW 192 Query: 881 QFRVVQSPSSLVTANNSHVYTVKCSVRGDLGPACNSAGMIDRYVLGIDHLYAKPVYRNLK 702 QF + + SS+ N+S+VY VKCS+RGDLGPACNSAGMIDRY+LGIDHLY KPVYRNLK Sbjct: 193 QFEMSNATSSVFPTNHSNVYMVKCSLRGDLGPACNSAGMIDRYILGIDHLYKKPVYRNLK 252 Query: 701 ECNISSHGQVPQTAPSWCHAPFDPEGILSSLTAAASCIIGLQYGHILVQLQDHKERLRNW 522 ECN+S+ GQVP + SWCHAPFDPEG+LSSLTAA +CIIGLQYGH+L LQDHK R+ NW Sbjct: 253 ECNMSTDGQVPDNSASWCHAPFDPEGVLSSLTAAVTCIIGLQYGHLLAHLQDHKGRMENW 312 Query: 521 SIXXXXXXXXXXXXXXXGIPLNKSLYTISYLLVTSACAGITFCILYALIDVYGWRWLTCV 342 ++ G P+NKSLYT SY+L+TSA AGIT+ LY L+DVY +R LT V Sbjct: 313 TLFSFSLLVVGLLLVVIGDPVNKSLYTFSYMLITSASAGITYSALYLLVDVYDYRCLTFV 372 Query: 341 LEWMGKHSLSIFILITSNIIVVTVQGFYLKSPDNNIVHWIVSLFVHK 201 LEWMGKHSLSIF+L++SN+ V+T+QGF +P+NN++HWIVS FV + Sbjct: 373 LEWMGKHSLSIFVLVSSNLAVITIQGFCWAAPENNMIHWIVSRFVRR 419 >ref|XP_012081798.1| PREDICTED: heparan-alpha-glucosaminide N-acetyltransferase-like isoform X2 [Jatropha curcas] Length = 416 Score = 585 bits (1509), Expect = e-164 Identities = 277/406 (68%), Positives = 332/406 (81%) Frame = -2 Query: 1418 KSTSTKSKAARLASLDVFRGLCVFLMMLVDYAGSIFPIITHVPWNGVHLADFVMPFFLFV 1239 +S + +K R+ASLDVFRGLCVFLMM+VDY GSIFPII H PWNG+ LADFVMPFFLF+ Sbjct: 12 QSPLSDNKPTRVASLDVFRGLCVFLMMIVDYLGSIFPIIAHSPWNGLRLADFVMPFFLFI 71 Query: 1238 AGVSVAIVYKKVSNRVDATWKAVLRALELFLLGVFLQGGYFHGINSLTYGVDIERIRLLG 1059 AGVS+A+VYKKVS+RVDATWKAVL+A +LF LGVFLQGGYFHGINSL YGVDIERIR LG Sbjct: 72 AGVSLALVYKKVSDRVDATWKAVLKAAKLFFLGVFLQGGYFHGINSLAYGVDIERIRWLG 131 Query: 1058 ILQRIAIGYIVAALCEIWLPCQRWRHDGFPKIYISHWCVVLSLCTIYLGLLYGLYVPNWQ 879 ILQRI+IGYIVAALCEIWL + R GF K Y HW + SLC IY GLL+GLYVP+WQ Sbjct: 132 ILQRISIGYIVAALCEIWLSSRPIREIGFFKPYYWHWVLAFSLCAIYTGLLHGLYVPDWQ 191 Query: 878 FRVVQSPSSLVTANNSHVYTVKCSVRGDLGPACNSAGMIDRYVLGIDHLYAKPVYRNLKE 699 F + S SS++ N S+VY V CSVRGDLGPACNSAGMIDRYVLGIDHLY KPVYRNLKE Sbjct: 192 FEISNSTSSVLPNNGSYVYLVSCSVRGDLGPACNSAGMIDRYVLGIDHLYTKPVYRNLKE 251 Query: 698 CNISSHGQVPQTAPSWCHAPFDPEGILSSLTAAASCIIGLQYGHILVQLQDHKERLRNWS 519 CN+ ++GQV + +PSWCHAP+DPEG++SSLTAA +CIIGLQ+GH+L ++DHK R+ WS Sbjct: 252 CNM-TNGQVSENSPSWCHAPYDPEGLISSLTAAVTCIIGLQFGHVLAHVKDHKGRVECWS 310 Query: 518 IXXXXXXXXXXXXXXXGIPLNKSLYTISYLLVTSACAGITFCILYALIDVYGWRWLTCVL 339 GIP+NKSLYTISY+L+TSA AGITF +LY ++DVYG+RW++ L Sbjct: 311 FFSFSLLLLGSSLAFVGIPVNKSLYTISYMLITSALAGITFSVLYLVVDVYGYRWVSLPL 370 Query: 338 EWMGKHSLSIFILITSNIIVVTVQGFYLKSPDNNIVHWIVSLFVHK 201 EWMG+HSLSIF+L+TSN+I++ +QGFY P+NNI+H IV+ FVH+ Sbjct: 371 EWMGRHSLSIFVLLTSNLIIIAIQGFYWSKPENNIIHQIVASFVHR 416 >ref|XP_004292175.1| PREDICTED: heparan-alpha-glucosaminide N-acetyltransferase-like isoform X1 [Fragaria vesca subsp. vesca] Length = 419 Score = 582 bits (1500), Expect = e-163 Identities = 273/399 (68%), Positives = 326/399 (81%) Frame = -2 Query: 1397 KAARLASLDVFRGLCVFLMMLVDYAGSIFPIITHVPWNGVHLADFVMPFFLFVAGVSVAI 1218 K R+ASLDVFRGLCVFLMM+VDY GSI P I H PW G+HLADFVMPFFLF+AGVS+A+ Sbjct: 22 KPPRVASLDVFRGLCVFLMMVVDYGGSIVPAIAHSPWTGLHLADFVMPFFLFIAGVSLAL 81 Query: 1217 VYKKVSNRVDATWKAVLRALELFLLGVFLQGGYFHGINSLTYGVDIERIRLLGILQRIAI 1038 VYK+VSNRV+ATWKAV RA++LFLLGV LQGGYFHG+ SLT+GVDIERIR GILQRIAI Sbjct: 82 VYKRVSNRVEATWKAVFRAVKLFLLGVLLQGGYFHGVASLTFGVDIERIRWFGILQRIAI 141 Query: 1037 GYIVAALCEIWLPCQRWRHDGFPKIYISHWCVVLSLCTIYLGLLYGLYVPNWQFRVVQSP 858 GY+VAALCEIWL + GF + Y HWC + L IY GLLYGLYVP+W+F+ +P Sbjct: 142 GYMVAALCEIWLSRRTSSEVGFFRSYYWHWCAIFLLSAIYSGLLYGLYVPDWEFK-ASTP 200 Query: 857 SSLVTANNSHVYTVKCSVRGDLGPACNSAGMIDRYVLGIDHLYAKPVYRNLKECNISSHG 678 + L +N+SHVY VKCS+RGDLGP CNSAGMIDRY++G+DHLY+KPVYRNLKECN+S+ G Sbjct: 201 TYLTPSNDSHVYVVKCSMRGDLGPGCNSAGMIDRYIVGVDHLYSKPVYRNLKECNMSTGG 260 Query: 677 QVPQTAPSWCHAPFDPEGILSSLTAAASCIIGLQYGHILVQLQDHKERLRNWSIXXXXXX 498 ++P+++PSWCH PFDPEGILS+LTAA +CIIGLQYGHIL +QDHK RL WS+ Sbjct: 261 RIPESSPSWCHTPFDPEGILSTLTAAVTCIIGLQYGHILAHIQDHKGRLNIWSLFSVSMF 320 Query: 497 XXXXXXXXXGIPLNKSLYTISYLLVTSACAGITFCILYALIDVYGWRWLTCVLEWMGKHS 318 G+P+NKSLYTISYLL+TSA AG+TFC LY LIDVYG+R +T VLEWMG HS Sbjct: 321 VLGSFLAFIGVPVNKSLYTISYLLITSASAGMTFCALYLLIDVYGYRCITFVLEWMGIHS 380 Query: 317 LSIFILITSNIIVVTVQGFYLKSPDNNIVHWIVSLFVHK 201 LSIFI++TSN+ V+ +QGFY P+NNIVHWI++ FVHK Sbjct: 381 LSIFIVVTSNLAVIAIQGFYWTHPENNIVHWIITPFVHK 419 >ref|XP_011045893.1| PREDICTED: heparan-alpha-glucosaminide N-acetyltransferase-like isoform X1 [Populus euphratica] gi|743792967|ref|XP_011045901.1| PREDICTED: heparan-alpha-glucosaminide N-acetyltransferase-like isoform X2 [Populus euphratica] Length = 428 Score = 580 bits (1494), Expect = e-162 Identities = 270/407 (66%), Positives = 327/407 (80%) Frame = -2 Query: 1421 DKSTSTKSKAARLASLDVFRGLCVFLMMLVDYAGSIFPIITHVPWNGVHLADFVMPFFLF 1242 ++ T K R ASLDVFRGLCV LMMLVDY G+IFPII H PWNG+HLAD VMPFFLF Sbjct: 22 EEQLHTSKKPQRAASLDVFRGLCVLLMMLVDYGGAIFPIIAHSPWNGLHLADSVMPFFLF 81 Query: 1241 VAGVSVAIVYKKVSNRVDATWKAVLRALELFLLGVFLQGGYFHGINSLTYGVDIERIRLL 1062 +AGVS+A+VYKKV NR++ATWKAV++A++LFLLGV +QGGYFHGINSLTYGVD++RIR L Sbjct: 82 IAGVSLALVYKKVPNRIEATWKAVIKAIKLFLLGVVIQGGYFHGINSLTYGVDMKRIRWL 141 Query: 1061 GILQRIAIGYIVAALCEIWLPCQRWRHDGFPKIYISHWCVVLSLCTIYLGLLYGLYVPNW 882 GILQ+I++GYIVAALCEIWL C+ R F K Y HWCV SL IYLGLLYGLYVP+W Sbjct: 142 GILQKISVGYIVAALCEIWLSCRTRREVSFLKSYYWHWCVAFSLSAIYLGLLYGLYVPDW 201 Query: 881 QFRVVQSPSSLVTANNSHVYTVKCSVRGDLGPACNSAGMIDRYVLGIDHLYAKPVYRNLK 702 QF + ++ SS+ N+S++Y VKCS+RG+LGPACNSAGMIDRY+LGIDHLY KPVYRNLK Sbjct: 202 QFEMSKATSSVFPTNHSYIYMVKCSLRGNLGPACNSAGMIDRYILGIDHLYKKPVYRNLK 261 Query: 701 ECNISSHGQVPQTAPSWCHAPFDPEGILSSLTAAASCIIGLQYGHILVQLQDHKERLRNW 522 ECN+S+ G VP + SWCHAPFDPEG+LSSLTAA +CIIGLQYGH+L LQDHK R+ W Sbjct: 262 ECNMSTDGHVPDNSASWCHAPFDPEGVLSSLTAAVTCIIGLQYGHLLAHLQDHKGRMEKW 321 Query: 521 SIXXXXXXXXXXXXXXXGIPLNKSLYTISYLLVTSACAGITFCILYALIDVYGWRWLTCV 342 ++ G P+NKSLYT SY+L+TSA AGIT+ LY L+DVY +R LT V Sbjct: 322 TLFSFSLLVVGLLLAVIGDPVNKSLYTFSYMLITSASAGITYSALYLLVDVYEYRCLTFV 381 Query: 341 LEWMGKHSLSIFILITSNIIVVTVQGFYLKSPDNNIVHWIVSLFVHK 201 LEWMGKHSLSIF+L++SN+ V+T+QGF +P+NN++HW VS FV + Sbjct: 382 LEWMGKHSLSIFVLVSSNLAVITIQGFCWAAPENNMIHWFVSRFVRR 428 >ref|XP_004248650.1| PREDICTED: heparan-alpha-glucosaminide N-acetyltransferase isoform X1 [Solanum lycopersicum] gi|723736060|ref|XP_010327400.1| PREDICTED: heparan-alpha-glucosaminide N-acetyltransferase isoform X1 [Solanum lycopersicum] gi|723736063|ref|XP_010327401.1| PREDICTED: heparan-alpha-glucosaminide N-acetyltransferase isoform X1 [Solanum lycopersicum] gi|723736066|ref|XP_010327402.1| PREDICTED: heparan-alpha-glucosaminide N-acetyltransferase isoform X1 [Solanum lycopersicum] gi|723736069|ref|XP_010327403.1| PREDICTED: heparan-alpha-glucosaminide N-acetyltransferase isoform X1 [Solanum lycopersicum] gi|723736074|ref|XP_010327404.1| PREDICTED: heparan-alpha-glucosaminide N-acetyltransferase isoform X1 [Solanum lycopersicum] Length = 420 Score = 580 bits (1494), Expect = e-162 Identities = 289/432 (66%), Positives = 329/432 (76%) Frame = -2 Query: 1496 MAEIEPLLQSTGAGEVQPAEHEEQADKSTSTKSKAARLASLDVFRGLCVFLMMLVDYAGS 1317 MAE EPLL S GEV AE E +A T++K R+ SLDVFRGLCVFLM+LVDYAGS Sbjct: 1 MAENEPLLGSNNGGEVVLAERESEA-----TQTKTTRIVSLDVFRGLCVFLMILVDYAGS 55 Query: 1316 IFPIITHVPWNGVHLADFVMPFFLFVAGVSVAIVYKKVSNRVDATWKAVLRALELFLLGV 1137 +FP I H PWNGV LADFVMPFFLFV GVSVAIV K V +R AT K V+R L+LF+LG+ Sbjct: 56 VFPSIAHSPWNGVRLADFVMPFFLFVVGVSVAIVNKIVLDRTGATMKVVIRTLKLFILGI 115 Query: 1136 FLQGGYFHGINSLTYGVDIERIRLLGILQRIAIGYIVAALCEIWLPCQRWRHDGFPKIYI 957 FLQGGY HGI LTYGVDIERIR +GILQRIA+GYIVAALCE+WLPCQ + + YI Sbjct: 116 FLQGGYLHGITGLTYGVDIERIRWMGILQRIAVGYIVAALCEVWLPCQEMKRFALFRNYI 175 Query: 956 SHWCVVLSLCTIYLGLLYGLYVPNWQFRVVQSPSSLVTANNSHVYTVKCSVRGDLGPACN 777 W ++ L I+ GLLYGLYVP+WQF V QS S +Y VKCSVRGDLGPACN Sbjct: 176 CQWFIMFLLSAIHCGLLYGLYVPDWQFSVSQS-------TGSTIYEVKCSVRGDLGPACN 228 Query: 776 SAGMIDRYVLGIDHLYAKPVYRNLKECNISSHGQVPQTAPSWCHAPFDPEGILSSLTAAA 597 SAGMIDRY+LG+DHLY KPVYRN+KECN S+ V ++ PSWCHA FDPEGI+SSLTAAA Sbjct: 229 SAGMIDRYILGLDHLYTKPVYRNMKECNGSNRDTVSESMPSWCHATFDPEGIVSSLTAAA 288 Query: 596 SCIIGLQYGHILVQLQDHKERLRNWSIXXXXXXXXXXXXXXXGIPLNKSLYTISYLLVTS 417 + IIGLQYGHILVQ QDHK RL NWSI G+PLNKSLYTISY+LVTS Sbjct: 289 TSIIGLQYGHILVQFQDHKGRLYNWSILSLSLLVVGLFLDFIGMPLNKSLYTISYMLVTS 348 Query: 416 ACAGITFCILYALIDVYGWRWLTCVLEWMGKHSLSIFILITSNIIVVTVQGFYLKSPDNN 237 A GITFC+LY L+D+YGWR L VLEWMGKHSLSIFILITSNI V+ +QGFY + P+NN Sbjct: 349 AAGGITFCLLYLLVDIYGWRRLMFVLEWMGKHSLSIFILITSNIAVIFIQGFYWRDPENN 408 Query: 236 IVHWIVSLFVHK 201 I+ WIV+ FV K Sbjct: 409 IIRWIVTRFVQK 420 >ref|XP_012081797.1| PREDICTED: heparan-alpha-glucosaminide N-acetyltransferase-like isoform X1 [Jatropha curcas] Length = 418 Score = 578 bits (1491), Expect = e-162 Identities = 278/421 (66%), Positives = 337/421 (80%) Frame = -2 Query: 1463 GAGEVQPAEHEEQADKSTSTKSKAARLASLDVFRGLCVFLMMLVDYAGSIFPIITHVPWN 1284 G G ++ E E ++++ TS RLASLDVFRG+ + LMM+VDY GSIFPII H PWN Sbjct: 5 GYGLLKIDEGELKSNRRTS------RLASLDVFRGISILLMMIVDYLGSIFPIIAHSPWN 58 Query: 1283 GVHLADFVMPFFLFVAGVSVAIVYKKVSNRVDATWKAVLRALELFLLGVFLQGGYFHGIN 1104 G+ LADFVMPFFLF+AGVS+A+VYKKVS+RVDATWKAVL+A +LF LGVFLQGGYFHGIN Sbjct: 59 GLRLADFVMPFFLFIAGVSLALVYKKVSDRVDATWKAVLKAAKLFFLGVFLQGGYFHGIN 118 Query: 1103 SLTYGVDIERIRLLGILQRIAIGYIVAALCEIWLPCQRWRHDGFPKIYISHWCVVLSLCT 924 SL YGVDIERIR LGILQRI+IGYIVAALCEIWL + R GF K Y HW + SLC Sbjct: 119 SLAYGVDIERIRWLGILQRISIGYIVAALCEIWLSSRPIREIGFFKPYYWHWVLAFSLCA 178 Query: 923 IYLGLLYGLYVPNWQFRVVQSPSSLVTANNSHVYTVKCSVRGDLGPACNSAGMIDRYVLG 744 IY GLL+GLYVP+WQF + S SS++ N S+VY V CSVRGDLGPACNSAGMIDRYVLG Sbjct: 179 IYTGLLHGLYVPDWQFEISNSTSSVLPNNGSYVYLVSCSVRGDLGPACNSAGMIDRYVLG 238 Query: 743 IDHLYAKPVYRNLKECNISSHGQVPQTAPSWCHAPFDPEGILSSLTAAASCIIGLQYGHI 564 IDHLY KPVYRNLKECN+ ++GQV + +PSWCHAP+DPEG++SSLTAA +CIIGLQ+GH+ Sbjct: 239 IDHLYTKPVYRNLKECNM-TNGQVSENSPSWCHAPYDPEGLISSLTAAVTCIIGLQFGHV 297 Query: 563 LVQLQDHKERLRNWSIXXXXXXXXXXXXXXXGIPLNKSLYTISYLLVTSACAGITFCILY 384 L ++DHK R+ WS GIP+NKSLYTISY+L+TSA AGITF +LY Sbjct: 298 LAHVKDHKGRVECWSFFSFSLLLLGSSLAFVGIPVNKSLYTISYMLITSALAGITFSVLY 357 Query: 383 ALIDVYGWRWLTCVLEWMGKHSLSIFILITSNIIVVTVQGFYLKSPDNNIVHWIVSLFVH 204 ++DVYG+RW++ LEWMG+HSLSIF+L+TSN+I++ +QGFY P+NNI+H IV+ FVH Sbjct: 358 LVVDVYGYRWVSLPLEWMGRHSLSIFVLLTSNLIIIAIQGFYWSKPENNIIHQIVASFVH 417 Query: 203 K 201 + Sbjct: 418 R 418 >gb|KDP29672.1| hypothetical protein JCGZ_18834 [Jatropha curcas] Length = 416 Score = 578 bits (1491), Expect = e-162 Identities = 278/421 (66%), Positives = 337/421 (80%) Frame = -2 Query: 1463 GAGEVQPAEHEEQADKSTSTKSKAARLASLDVFRGLCVFLMMLVDYAGSIFPIITHVPWN 1284 G G ++ E E ++++ TS RLASLDVFRG+ + LMM+VDY GSIFPII H PWN Sbjct: 3 GYGLLKIDEGELKSNRRTS------RLASLDVFRGISILLMMIVDYLGSIFPIIAHSPWN 56 Query: 1283 GVHLADFVMPFFLFVAGVSVAIVYKKVSNRVDATWKAVLRALELFLLGVFLQGGYFHGIN 1104 G+ LADFVMPFFLF+AGVS+A+VYKKVS+RVDATWKAVL+A +LF LGVFLQGGYFHGIN Sbjct: 57 GLRLADFVMPFFLFIAGVSLALVYKKVSDRVDATWKAVLKAAKLFFLGVFLQGGYFHGIN 116 Query: 1103 SLTYGVDIERIRLLGILQRIAIGYIVAALCEIWLPCQRWRHDGFPKIYISHWCVVLSLCT 924 SL YGVDIERIR LGILQRI+IGYIVAALCEIWL + R GF K Y HW + SLC Sbjct: 117 SLAYGVDIERIRWLGILQRISIGYIVAALCEIWLSSRPIREIGFFKPYYWHWVLAFSLCA 176 Query: 923 IYLGLLYGLYVPNWQFRVVQSPSSLVTANNSHVYTVKCSVRGDLGPACNSAGMIDRYVLG 744 IY GLL+GLYVP+WQF + S SS++ N S+VY V CSVRGDLGPACNSAGMIDRYVLG Sbjct: 177 IYTGLLHGLYVPDWQFEISNSTSSVLPNNGSYVYLVSCSVRGDLGPACNSAGMIDRYVLG 236 Query: 743 IDHLYAKPVYRNLKECNISSHGQVPQTAPSWCHAPFDPEGILSSLTAAASCIIGLQYGHI 564 IDHLY KPVYRNLKECN+ ++GQV + +PSWCHAP+DPEG++SSLTAA +CIIGLQ+GH+ Sbjct: 237 IDHLYTKPVYRNLKECNM-TNGQVSENSPSWCHAPYDPEGLISSLTAAVTCIIGLQFGHV 295 Query: 563 LVQLQDHKERLRNWSIXXXXXXXXXXXXXXXGIPLNKSLYTISYLLVTSACAGITFCILY 384 L ++DHK R+ WS GIP+NKSLYTISY+L+TSA AGITF +LY Sbjct: 296 LAHVKDHKGRVECWSFFSFSLLLLGSSLAFVGIPVNKSLYTISYMLITSALAGITFSVLY 355 Query: 383 ALIDVYGWRWLTCVLEWMGKHSLSIFILITSNIIVVTVQGFYLKSPDNNIVHWIVSLFVH 204 ++DVYG+RW++ LEWMG+HSLSIF+L+TSN+I++ +QGFY P+NNI+H IV+ FVH Sbjct: 356 LVVDVYGYRWVSLPLEWMGRHSLSIFVLLTSNLIIIAIQGFYWSKPENNIIHQIVASFVH 415 Query: 203 K 201 + Sbjct: 416 R 416 >ref|XP_002306188.2| hypothetical protein POPTR_0004s18250g [Populus trichocarpa] gi|550341311|gb|EEE86699.2| hypothetical protein POPTR_0004s18250g [Populus trichocarpa] Length = 422 Score = 578 bits (1491), Expect = e-162 Identities = 277/410 (67%), Positives = 327/410 (79%), Gaps = 3/410 (0%) Frame = -2 Query: 1421 DKSTSTKSKAARLASLDVFRGLCVFLMMLVDYAGSIFPIITHVPWNGVHLADFVMPFFLF 1242 ++ T K R+ASLDVFRGLCVFLMMLVDY G+I PII H PWNG+HLADFVMPFFLF Sbjct: 13 EEQPRTSKKTPRVASLDVFRGLCVFLMMLVDYGGAIVPIIAHSPWNGLHLADFVMPFFLF 72 Query: 1241 VAGVSVAIVYKKVSNRVDATWKAVLRALELFLLGVFLQGGYFHGINSLTYGVDIERIRLL 1062 AGVS+A+VYK+V NR++AT KAVLRA+ELFLLGV LQGGYFHGIN LTYGVD++RIR L Sbjct: 73 TAGVSLALVYKRVPNRIEATRKAVLRAVELFLLGVILQGGYFHGINFLTYGVDMKRIRWL 132 Query: 1061 GILQRIAIGYIVAALCEIWLPCQRWRHDGFPKIYISHWCVVLSLCTIYLGLLYGLYVPNW 882 GILQRI+IGYI AALCEIWL C+ R F K Y HW SL IYLGLLYGLYVP+W Sbjct: 133 GILQRISIGYIFAALCEIWLSCRSRRDVSFLKSYYWHWGAAFSLSAIYLGLLYGLYVPDW 192 Query: 881 QFRVVQSPSSLVTANNSHVY---TVKCSVRGDLGPACNSAGMIDRYVLGIDHLYAKPVYR 711 QF + + SS+ N+S+VY VKCSVRGDLGPACNSAGMIDRYVLGIDHLY KPVYR Sbjct: 193 QFEMSNATSSVFPTNHSYVYMLTQVKCSVRGDLGPACNSAGMIDRYVLGIDHLYKKPVYR 252 Query: 710 NLKECNISSHGQVPQTAPSWCHAPFDPEGILSSLTAAASCIIGLQYGHILVQLQDHKERL 531 NLKECN+S++GQVP++APSWCHAPFDPEG+LSS+TAA +CIIGLQYGH L LQDHK+R+ Sbjct: 253 NLKECNMSTNGQVPESAPSWCHAPFDPEGVLSSITAAVACIIGLQYGHSLAHLQDHKQRM 312 Query: 530 RNWSIXXXXXXXXXXXXXXXGIPLNKSLYTISYLLVTSACAGITFCILYALIDVYGWRWL 351 +NW + G P+NKSLYT Y+L+T A AGIT+ +Y L+DVYG+R L Sbjct: 313 QNWILFSLSLLLVGLLLAVVGDPVNKSLYTFGYMLITCASAGITYSAIYLLVDVYGYRCL 372 Query: 350 TCVLEWMGKHSLSIFILITSNIIVVTVQGFYLKSPDNNIVHWIVSLFVHK 201 T LEWMGKHSLSIF+LITSN+ V+ +QGFY K+P+NN++ WIV+ FV + Sbjct: 373 TFALEWMGKHSLSIFVLITSNLAVIAIQGFYWKAPENNLIQWIVTRFVRR 422 >ref|XP_009779584.1| PREDICTED: heparan-alpha-glucosaminide N-acetyltransferase-like [Nicotiana sylvestris] gi|698588868|ref|XP_009779585.1| PREDICTED: heparan-alpha-glucosaminide N-acetyltransferase-like [Nicotiana sylvestris] Length = 419 Score = 578 bits (1490), Expect = e-162 Identities = 292/432 (67%), Positives = 333/432 (77%) Frame = -2 Query: 1496 MAEIEPLLQSTGAGEVQPAEHEEQADKSTSTKSKAARLASLDVFRGLCVFLMMLVDYAGS 1317 MAE +PLL+S V+ +E E+ KST++ AR+ SLDVFRGLCVFLMMLVDYAGS Sbjct: 1 MAENQPLLRSDDNEVVRESEGTER--KSTAS----ARVVSLDVFRGLCVFLMMLVDYAGS 54 Query: 1316 IFPIITHVPWNGVHLADFVMPFFLFVAGVSVAIVYKKVSNRVDATWKAVLRALELFLLGV 1137 +FP I H PWNGV LADFVMPFFLFV GVS+AIV K V +R AT K V+R L+LFLLGV Sbjct: 55 VFPSIAHSPWNGVRLADFVMPFFLFVVGVSLAIVNKIVVDRTRATLKVVIRTLKLFLLGV 114 Query: 1136 FLQGGYFHGINSLTYGVDIERIRLLGILQRIAIGYIVAALCEIWLPCQRWRHDGFPKIYI 957 FLQGGY HGI LTYGVDIE+IR +GILQRIA+GYIVAALCEIW PCQ + YI Sbjct: 115 FLQGGYLHGITGLTYGVDIEKIRWMGILQRIAVGYIVAALCEIWFPCQGMKRVTLLSNYI 174 Query: 956 SHWCVVLSLCTIYLGLLYGLYVPNWQFRVVQSPSSLVTANNSHVYTVKCSVRGDLGPACN 777 WC+V L I+ GLLYGLYVP+WQFR +QS S +Y VKCSVRGDLGPACN Sbjct: 175 WQWCIVFLLSAIHGGLLYGLYVPDWQFRALQS-------TGSSIYEVKCSVRGDLGPACN 227 Query: 776 SAGMIDRYVLGIDHLYAKPVYRNLKECNISSHGQVPQTAPSWCHAPFDPEGILSSLTAAA 597 SAGMIDRY+LG+DHLYAKPVYRN+KEC S++ + T PSWCHAPFDPEGILSSLTAAA Sbjct: 228 SAGMIDRYILGMDHLYAKPVYRNMKECYGSNNSRASTTTPSWCHAPFDPEGILSSLTAAA 287 Query: 596 SCIIGLQYGHILVQLQDHKERLRNWSIXXXXXXXXXXXXXXXGIPLNKSLYTISYLLVTS 417 +CIIGLQYGHILV+ QDHKERL +WS+ G+PLNKSLYTISYLLVTS Sbjct: 288 ACIIGLQYGHILVKFQDHKERLCSWSVLSLSLLVVGLFLAFIGVPLNKSLYTISYLLVTS 347 Query: 416 ACAGITFCILYALIDVYGWRWLTCVLEWMGKHSLSIFILITSNIIVVTVQGFYLKSPDNN 237 A AGITFC+LY L+D+YGWR L VLEWMGKHSLSIFILITSNI V+ +QGFY + P NN Sbjct: 348 AAAGITFCLLYVLVDIYGWRRLMFVLEWMGKHSLSIFILITSNIAVILIQGFYWRDPRNN 407 Query: 236 IVHWIVSLFVHK 201 IV W+V+ FV K Sbjct: 408 IVRWVVTKFVQK 419 >ref|XP_010107656.1| hypothetical protein L484_008373 [Morus notabilis] gi|587929407|gb|EXC16567.1| hypothetical protein L484_008373 [Morus notabilis] Length = 411 Score = 572 bits (1474), Expect = e-160 Identities = 277/393 (70%), Positives = 323/393 (82%) Frame = -2 Query: 1409 STKSKAARLASLDVFRGLCVFLMMLVDYAGSIFPIITHVPWNGVHLADFVMPFFLFVAGV 1230 +T ++ R+ASLDVFRGLC+FLMM+VDY SIFP+ITH PWNGVHLADFVMPFFLF+AGV Sbjct: 15 ATNRRSPRVASLDVFRGLCIFLMMVVDYGASIFPVITHSPWNGVHLADFVMPFFLFIAGV 74 Query: 1229 SVAIVYKKVSNRVDATWKAVLRALELFLLGVFLQGGYFHGINSLTYGVDIERIRLLGILQ 1050 S A+VYKKV +R++AT KAVLRAL+LF LGV LQGGYFHG++S+TYGVD+ERIR LGILQ Sbjct: 75 SPALVYKKVPDRLEATRKAVLRALKLFFLGVILQGGYFHGVSSMTYGVDVERIRWLGILQ 134 Query: 1049 RIAIGYIVAALCEIWLPCQRWRHDGFPKIYISHWCVVLSLCTIYLGLLYGLYVPNWQFRV 870 RI+IGYIVAALCEIWL Q GF K Y SH CV SL IY GLLYGLYVP+WQF+V Sbjct: 135 RISIGYIVAALCEIWLSHQTGWEIGFFKSYYSHLCVAFSLSAIYAGLLYGLYVPDWQFKV 194 Query: 869 VQSPSSLVTANNSHVYTVKCSVRGDLGPACNSAGMIDRYVLGIDHLYAKPVYRNLKECNI 690 + SSL +N+S VY VKCSVRGDLGPACNSAGMIDRYVLGI HLY KPVY+NLKECN+ Sbjct: 195 SPATSSL-PSNDSSVYMVKCSVRGDLGPACNSAGMIDRYVLGIGHLYTKPVYKNLKECNM 253 Query: 689 SSHGQVPQTAPSWCHAPFDPEGILSSLTAAASCIIGLQYGHILVQLQDHKERLRNWSIXX 510 +++G+VP+++PSWCHAPFDPEGILSSLTAA +CIIGLQYGH+L QLQDHK RL +WS+ Sbjct: 254 TTNGEVPESSPSWCHAPFDPEGILSSLTAAVTCIIGLQYGHVLAQLQDHKRRLESWSLFS 313 Query: 509 XXXXXXXXXXXXXGIPLNKSLYTISYLLVTSACAGITFCILYALIDVYGWRWLTCVLEWM 330 GIPLNKSLYTISY+L TSA AGITFCILY L+DVYG+R LT VLEWM Sbjct: 314 VSIFGIGLFLAFIGIPLNKSLYTISYMLTTSASAGITFCILYLLVDVYGFRSLTFVLEWM 373 Query: 329 GKHSLSIFILITSNIIVVTVQGFYLKSPDNNIV 231 G HSLSIF+L++SN+ ++ +QG Y NNIV Sbjct: 374 GMHSLSIFVLVSSNLAIIAIQGLYFHDRKNNIV 406 >ref|XP_008377733.1| PREDICTED: heparan-alpha-glucosaminide N-acetyltransferase-like isoform X2 [Malus domestica] Length = 410 Score = 570 bits (1470), Expect = e-160 Identities = 280/432 (64%), Positives = 326/432 (75%), Gaps = 1/432 (0%) Frame = -2 Query: 1496 MAEIEPLLQS-TGAGEVQPAEHEEQADKSTSTKSKAARLASLDVFRGLCVFLMMLVDYAG 1320 MA+ PLL + G G P K R+ASLDVFRGLCVFLMMLVDY G Sbjct: 1 MADYSPLLTAYDGPGTASP---------------KPPRVASLDVFRGLCVFLMMLVDYGG 45 Query: 1319 SIFPIITHVPWNGVHLADFVMPFFLFVAGVSVAIVYKKVSNRVDATWKAVLRALELFLLG 1140 SI PII H PWNG+HLADFVMPFFLF+AGVS+A+VYK+V+NRV+ATWKAV +A++LFLLG Sbjct: 46 SILPIIAHSPWNGLHLADFVMPFFLFIAGVSLALVYKRVTNRVEATWKAVFKAVKLFLLG 105 Query: 1139 VFLQGGYFHGINSLTYGVDIERIRLLGILQRIAIGYIVAALCEIWLPCQRWRHDGFPKIY 960 V LQGGYFHG+ SLTYGVDIERIR GILQRIAIGYI AALCEIWL Q GF + Y Sbjct: 106 VLLQGGYFHGVASLTYGVDIERIRWFGILQRIAIGYIAAALCEIWLSRQTLGEVGFFRTY 165 Query: 959 ISHWCVVLSLCTIYLGLLYGLYVPNWQFRVVQSPSSLVTANNSHVYTVKCSVRGDLGPAC 780 HWCV+ SL IY GLLYGLYVP+W+F+ +PSSL +N + Y VKCSVRGDLGPAC Sbjct: 166 YWHWCVIFSLSAIYAGLLYGLYVPDWEFK-ASTPSSLPPSNATTTYVVKCSVRGDLGPAC 224 Query: 779 NSAGMIDRYVLGIDHLYAKPVYRNLKECNISSHGQVPQTAPSWCHAPFDPEGILSSLTAA 600 NSA MIDRY+LG DHLY KPVYRNLKECN+S+ G+VP+++PSWCH PFDPE AA Sbjct: 225 NSARMIDRYILGFDHLYLKPVYRNLKECNVSADGRVPESSPSWCHTPFDPE-------AA 277 Query: 599 ASCIIGLQYGHILVQLQDHKERLRNWSIXXXXXXXXXXXXXXXGIPLNKSLYTISYLLVT 420 +CIIGLQYGHIL +QDHKERL W GIP+NKSLYTISY+L+T Sbjct: 278 VTCIIGLQYGHILAHIQDHKERLNIWFFSSVLMFVLGLFLAFIGIPVNKSLYTISYMLIT 337 Query: 419 SACAGITFCILYALIDVYGWRWLTCVLEWMGKHSLSIFILITSNIIVVTVQGFYLKSPDN 240 SA AGITFC LY L+DVYG+R +T VLEWMG HSL+IF+++TSN+ V+ +QGFYL P N Sbjct: 338 SASAGITFCTLYLLVDVYGYRCMTYVLEWMGIHSLTIFVVVTSNLAVIAIQGFYLADPQN 397 Query: 239 NIVHWIVSLFVH 204 NIVHWI++ FVH Sbjct: 398 NIVHWIITRFVH 409 >ref|XP_009782767.1| PREDICTED: heparan-alpha-glucosaminide N-acetyltransferase-like isoform X2 [Nicotiana sylvestris] Length = 428 Score = 570 bits (1469), Expect = e-159 Identities = 281/433 (64%), Positives = 326/433 (75%) Frame = -2 Query: 1499 SMAEIEPLLQSTGAGEVQPAEHEEQADKSTSTKSKAARLASLDVFRGLCVFLMMLVDYAG 1320 +MAE PLL + Q E A + K+K AR+ASLDVFRG+CV LMMLVDY G Sbjct: 3 TMAEDHPLLPNRAMEIEQTESGGEAAAATKKKKAKPARVASLDVFRGVCVLLMMLVDYGG 62 Query: 1319 SIFPIITHVPWNGVHLADFVMPFFLFVAGVSVAIVYKKVSNRVDATWKAVLRALELFLLG 1140 SIFP I H PWNGVHLADFVMPFFLF++GVS+AI YKKV +R AT KAV R L+L LLG Sbjct: 63 SIFPSIAHSPWNGVHLADFVMPFFLFISGVSLAIAYKKVLDRKGATLKAVFRTLKLLLLG 122 Query: 1139 VFLQGGYFHGINSLTYGVDIERIRLLGILQRIAIGYIVAALCEIWLPCQRWRHDGFPKIY 960 VFLQGGY HGI LTYGVDIE+IR LGILQRIA+GYIV ALCEIWLP QR + Y Sbjct: 123 VFLQGGYLHGITGLTYGVDIEKIRWLGILQRIAVGYIVTALCEIWLPRQRIKKRSLFSNY 182 Query: 959 ISHWCVVLSLCTIYLGLLYGLYVPNWQFRVVQSPSSLVTANNSHVYTVKCSVRGDLGPAC 780 I HWCV LC ++ LLYGLYVP+W+F V ++P + ++Y VKCSVRGDL PAC Sbjct: 183 IWHWCVAFYLCAVHTWLLYGLYVPDWEFTVSRTP-------DLNIYKVKCSVRGDLEPAC 235 Query: 779 NSAGMIDRYVLGIDHLYAKPVYRNLKECNISSHGQVPQTAPSWCHAPFDPEGILSSLTAA 600 N+AGMIDRY+LGIDHLY KPVYRNLKEC + ++PQ+ PSWCHAPF+PEGIL S+TAA Sbjct: 236 NTAGMIDRYILGIDHLYTKPVYRNLKECKGFNDDKIPQSFPSWCHAPFEPEGILGSVTAA 295 Query: 599 ASCIIGLQYGHILVQLQDHKERLRNWSIXXXXXXXXXXXXXXXGIPLNKSLYTISYLLVT 420 +CIIGLQ+GHILVQ QDHKERL NWSI G+PLNKSLYTISYLLVT Sbjct: 296 VACIIGLQFGHILVQFQDHKERLYNWSILSFPLLFLGFFLAVTGVPLNKSLYTISYLLVT 355 Query: 419 SACAGITFCILYALIDVYGWRWLTCVLEWMGKHSLSIFILITSNIIVVTVQGFYLKSPDN 240 SA AGITFC+LY L+D+YGWR L VLEWMGKHSL IFI+I SN+ V+ +QGFY + P + Sbjct: 356 SAAAGITFCLLYVLVDMYGWRRLMFVLEWMGKHSLGIFIVIISNVAVILIQGFYWRDPHS 415 Query: 239 NIVHWIVSLFVHK 201 NIV WIV+ +VHK Sbjct: 416 NIVRWIVTRYVHK 428 >ref|XP_006360902.1| PREDICTED: heparan-alpha-glucosaminide N-acetyltransferase-like [Solanum tuberosum] Length = 424 Score = 568 bits (1465), Expect = e-159 Identities = 286/432 (66%), Positives = 327/432 (75%) Frame = -2 Query: 1496 MAEIEPLLQSTGAGEVQPAEHEEQADKSTSTKSKAARLASLDVFRGLCVFLMMLVDYAGS 1317 MAE EPLL S EV AE E +A + T + ++R+ SLDVFRGLCVFLM+LVDYAGS Sbjct: 1 MAENEPLLGSNNGEEVVLAERESEATQR-KTATPSSRVISLDVFRGLCVFLMLLVDYAGS 59 Query: 1316 IFPIITHVPWNGVHLADFVMPFFLFVAGVSVAIVYKKVSNRVDATWKAVLRALELFLLGV 1137 +FP I H PWNGV LADFVMPFFLFV GVS+AIV K V +R AT K V+R L+LF+LG+ Sbjct: 60 VFPSIAHSPWNGVRLADFVMPFFLFVVGVSLAIVNKIVLDRTGATLKFVIRTLKLFILGI 119 Query: 1136 FLQGGYFHGINSLTYGVDIERIRLLGILQRIAIGYIVAALCEIWLPCQRWRHDGFPKIYI 957 FLQGGY HGI LTYGVDIERIR +GILQRIA+GYIVAALCEIWLP Q + + YI Sbjct: 120 FLQGGYLHGITGLTYGVDIERIRWMGILQRIAVGYIVAALCEIWLPTQEMKRVTLFRNYI 179 Query: 956 SHWCVVLSLCTIYLGLLYGLYVPNWQFRVVQSPSSLVTANNSHVYTVKCSVRGDLGPACN 777 WC++ L I+ GLLYGLYVP+WQF V QS S +Y VKCSVRGDLGPACN Sbjct: 180 CQWCIMFLLSAIHCGLLYGLYVPDWQFSVSQS-------TGSTIYEVKCSVRGDLGPACN 232 Query: 776 SAGMIDRYVLGIDHLYAKPVYRNLKECNISSHGQVPQTAPSWCHAPFDPEGILSSLTAAA 597 SA M+DRY+LGIDHLY KPVYRN+KECN S+ V ++ PSWCHA FDPEGI+SSLTAAA Sbjct: 233 SAAMVDRYILGIDHLYTKPVYRNMKECNGSNRETVSESMPSWCHAAFDPEGIVSSLTAAA 292 Query: 596 SCIIGLQYGHILVQLQDHKERLRNWSIXXXXXXXXXXXXXXXGIPLNKSLYTISYLLVTS 417 + IIGLQYGHILVQ QDHK RL NWSI G+PLNKSLYTISY+LVTS Sbjct: 293 TSIIGLQYGHILVQFQDHKGRLYNWSILSLSLLAVGLFLDFVGMPLNKSLYTISYMLVTS 352 Query: 416 ACAGITFCILYALIDVYGWRWLTCVLEWMGKHSLSIFILITSNIIVVTVQGFYLKSPDNN 237 AGITFC+LY L+D+YGWR L VLEW+GKHSLSIFILITSNI V+ +QGFY + P NN Sbjct: 353 GAAGITFCLLYLLVDIYGWRRLMFVLEWIGKHSLSIFILITSNIAVIFIQGFYWRDPQNN 412 Query: 236 IVHWIVSLFVHK 201 IV WIV+ FV K Sbjct: 413 IVRWIVTRFVQK 424 >gb|KHN19992.1| Heparan-alpha-glucosaminide N-acetyltransferase [Glycine soja] Length = 416 Score = 562 bits (1448), Expect = e-157 Identities = 267/404 (66%), Positives = 314/404 (77%) Frame = -2 Query: 1415 STSTKSKAARLASLDVFRGLCVFLMMLVDYAGSIFPIITHVPWNGVHLADFVMPFFLFVA 1236 S T+ + R+ASLDVFRGL VFLM+ VDYA SIFPII H PWNG+HLADFVMPFFLF+A Sbjct: 12 SEPTQFQNTRIASLDVFRGLSVFLMIFVDYAASIFPIIAHAPWNGIHLADFVMPFFLFIA 71 Query: 1235 GVSVAIVYKKVSNRVDATWKAVLRALELFLLGVFLQGGYFHGINSLTYGVDIERIRLLGI 1056 G+S+A+VYK+ +R ATWKA RAL LF LG+ LQGGYFHG+ SLT+GVDI+RIR LGI Sbjct: 72 GISLALVYKRRPHRTQATWKAFARALNLFALGILLQGGYFHGVTSLTFGVDIQRIRWLGI 131 Query: 1055 LQRIAIGYIVAALCEIWLPCQRWRHDGFPKIYISHWCVVLSLCTIYLGLLYGLYVPNWQF 876 LQRI+IGYIVAALCEIWLP RW+ GF K Y W V + L +Y GLLYGLYVP+WQF Sbjct: 132 LQRISIGYIVAALCEIWLPAPRWKELGFVKSYYWQWFVAVILLALYSGLLYGLYVPDWQF 191 Query: 875 RVVQSPSSLVTANNSHVYTVKCSVRGDLGPACNSAGMIDRYVLGIDHLYAKPVYRNLKEC 696 V S SSL +Y V CSVRGDLGPACNSAGMIDRY+LG+DHLY KPVYRNLK C Sbjct: 192 DVSASTSSLPPIGGGDIYMVNCSVRGDLGPACNSAGMIDRYILGLDHLYRKPVYRNLKGC 251 Query: 695 NISSHGQVPQTAPSWCHAPFDPEGILSSLTAAASCIIGLQYGHILVQLQDHKERLRNWSI 516 N+S+ GQV ++PSWCHAPFDPEGILSS+TAA SCIIGLQYGH+L LQDHK RL NW Sbjct: 252 NMSAKGQVSDSSPSWCHAPFDPEGILSSITAAVSCIIGLQYGHVLAHLQDHKGRLYNWMC 311 Query: 515 XXXXXXXXXXXXXXXGIPLNKSLYTISYLLVTSACAGITFCILYALIDVYGWRWLTCVLE 336 GIPLNKSLYT+SY+L+TSA +G+TF LY L+DV+G R LT +LE Sbjct: 312 FSLSFLALGLFLALIGIPLNKSLYTVSYMLLTSAASGLTFIALYFLVDVHGHRRLTALLE 371 Query: 335 WMGKHSLSIFILITSNIIVVTVQGFYLKSPDNNIVHWIVSLFVH 204 WMGKHSLSIF++++SN+ V+ VQGFY P+NNI++WIV+ F H Sbjct: 372 WMGKHSLSIFVIVSSNLAVIAVQGFYWTKPENNIINWIVTRFDH 415