BLASTX nr result

ID: Zanthoxylum22_contig00004668 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Zanthoxylum22_contig00004668
         (1402 letters)

Database: ./nr 
           77,306,371 sequences; 28,104,191,420 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_006433416.1| hypothetical protein CICLE_v10001045mg [Citr...   744   0.0  
gb|KDO56210.1| hypothetical protein CISIN_1g012127mg [Citrus sin...   739   0.0  
ref|XP_006472095.1| PREDICTED: heparan-alpha-glucosaminide N-ace...   737   0.0  
gb|KDO56215.1| hypothetical protein CISIN_1g012127mg [Citrus sin...   733   0.0  
ref|XP_006472096.1| PREDICTED: heparan-alpha-glucosaminide N-ace...   654   0.0  
ref|XP_006433415.1| hypothetical protein CICLE_v10001045mg [Citr...   651   0.0  
gb|KDO56216.1| hypothetical protein CISIN_1g012127mg [Citrus sin...   648   0.0  
gb|KDO56217.1| hypothetical protein CISIN_1g012127mg [Citrus sin...   645   0.0  
ref|XP_007031006.1| Heparan-alpha-glucosaminide N-acetyltransfer...   624   e-176
ref|XP_012089000.1| PREDICTED: heparan-alpha-glucosaminide N-ace...   619   e-174
gb|KHF97415.1| Heparan-alpha-glucosaminide N-acetyltransferase [...   616   e-173
ref|XP_008246429.1| PREDICTED: heparan-alpha-glucosaminide N-ace...   615   e-173
ref|XP_012434169.1| PREDICTED: heparan-alpha-glucosaminide N-ace...   614   e-173
ref|XP_004302388.1| PREDICTED: heparan-alpha-glucosaminide N-ace...   613   e-172
ref|XP_007205146.1| hypothetical protein PRUPE_ppa005302mg [Prun...   612   e-172
ref|XP_009376612.1| PREDICTED: heparan-alpha-glucosaminide N-ace...   608   e-171
ref|XP_010097938.1| hypothetical protein L484_000992 [Morus nota...   608   e-171
ref|XP_008370425.1| PREDICTED: heparan-alpha-glucosaminide N-ace...   605   e-170
ref|XP_011087426.1| PREDICTED: heparan-alpha-glucosaminide N-ace...   598   e-168
ref|XP_010025234.1| PREDICTED: heparan-alpha-glucosaminide N-ace...   596   e-167

>ref|XP_006433416.1| hypothetical protein CICLE_v10001045mg [Citrus clementina]
            gi|568836113|ref|XP_006472093.1| PREDICTED:
            heparan-alpha-glucosaminide N-acetyltransferase-like
            isoform X1 [Citrus sinensis]
            gi|568836115|ref|XP_006472094.1| PREDICTED:
            heparan-alpha-glucosaminide N-acetyltransferase-like
            isoform X2 [Citrus sinensis] gi|557535538|gb|ESR46656.1|
            hypothetical protein CICLE_v10001045mg [Citrus
            clementina]
          Length = 470

 Score =  744 bits (1922), Expect = 0.0
 Identities = 355/426 (83%), Positives = 376/426 (88%)
 Frame = +3

Query: 123  MGEIKPEITHHHPLLKSEPDISEQQEKPHFNTQRLASLDIFRGLTVALMILVDHAGGDWP 302
            M EIK E THHHPL+ SEPD+S+QQEK H  TQRLASLDIFRGL VALMILVDHAGGDWP
Sbjct: 1    MSEIKAETTHHHPLIISEPDVSDQQEKSHLKTQRLASLDIFRGLAVALMILVDHAGGDWP 60

Query: 303  EITHAPWNGCNLADFVMPFFLFIVGVAIALALKRVPKQGDAVMKVILRTLKLLFWGILLQ 482
            EI+HAPWNGCNLADFVMPFFLFIVGVAIALALKR+P + DAV KVI RTLKLLFWGILLQ
Sbjct: 61   EISHAPWNGCNLADFVMPFFLFIVGVAIALALKRIPDRADAVKKVIFRTLKLLFWGILLQ 120

Query: 483  GGFSHAPDELTYGVDVRMIRWCGVLQRIALSYLLVSMVEIYTKDVQDKDRSLGQFSIFRA 662
            GGFSHAPDELTYGVDVRMIR CGVLQRIALSYLLVS+VEI+TKDVQDKD+S+GQFSIFR 
Sbjct: 121  GGFSHAPDELTYGVDVRMIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGQFSIFRL 180

Query: 663  YCWHWLMAACVLIVYLALIYGTYVPDWQFIIINKDSADCGKVYNVTCGVRGKLDPPCNAV 842
            YCWHWLMAACVL+VYLAL+YGTYVPDWQF IINKDSAD GKV+NVTCGVRGKL+PPCNAV
Sbjct: 181  YCWHWLMAACVLVVYLALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRGKLNPPCNAV 240

Query: 843  GYIDRDVLGINHLYHRPAWRRSKACTQDSPYDGPLRKDAPSWCLAPFEPEGXXXXXXXXX 1022
            GYIDR VLGINH+YHRPAWRRSKACTQDSP++GPLRKDAPSWC APFEPEG         
Sbjct: 241  GYIDRKVLGINHMYHRPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSIL 300

Query: 1023 XTXXXXXXXXXXXXMKGHLARLRQWVTMGLSLLIFGLTLHFTQAIPLNKQLYTLSYVCVT 1202
             T             KGHLARL+QWVTMG +LLIFGLTLHFT AIPLNKQLYTLSYVCVT
Sbjct: 301  STIIGVHFGHVIIHTKGHLARLKQWVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVT 360

Query: 1203 SGAAALVFSAIYALVDVWNFKFVFLPLAWIGMNAMFVYVMAAEGIFACFINGWYYADPQN 1382
            SGAAALVFSAIYALVD+WN K+ FLPLAWIGMNAM VYVMAAEGIFA FINGWYY DP N
Sbjct: 361  SGAAALVFSAIYALVDIWNLKYPFLPLAWIGMNAMLVYVMAAEGIFAGFINGWYYGDPHN 420

Query: 1383 TLIYWI 1400
            TL YWI
Sbjct: 421  TLPYWI 426


>gb|KDO56210.1| hypothetical protein CISIN_1g012127mg [Citrus sinensis]
          Length = 470

 Score =  739 bits (1907), Expect = 0.0
 Identities = 352/426 (82%), Positives = 374/426 (87%)
 Frame = +3

Query: 123  MGEIKPEITHHHPLLKSEPDISEQQEKPHFNTQRLASLDIFRGLTVALMILVDHAGGDWP 302
            M EIK E THHHPL+ SEPD+S+QQEK H  TQRLASLDIFRGL VALMILVDHAGGDWP
Sbjct: 1    MSEIKAETTHHHPLIISEPDVSDQQEKSHLKTQRLASLDIFRGLAVALMILVDHAGGDWP 60

Query: 303  EITHAPWNGCNLADFVMPFFLFIVGVAIALALKRVPKQGDAVMKVILRTLKLLFWGILLQ 482
            EI+HAPWNGCNLADFVMPFFLFIVGVAIALALKR+P + DAV KVI RTLKLLFWGILLQ
Sbjct: 61   EISHAPWNGCNLADFVMPFFLFIVGVAIALALKRIPDRADAVKKVIFRTLKLLFWGILLQ 120

Query: 483  GGFSHAPDELTYGVDVRMIRWCGVLQRIALSYLLVSMVEIYTKDVQDKDRSLGQFSIFRA 662
            GGFSHAPDELTYGVDVRMIR CGVLQRIALSYLLVS+VEI+TKDVQDKD+S+G+FSIFR 
Sbjct: 121  GGFSHAPDELTYGVDVRMIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRL 180

Query: 663  YCWHWLMAACVLIVYLALIYGTYVPDWQFIIINKDSADCGKVYNVTCGVRGKLDPPCNAV 842
            YCWHWLMAACVL+VYLAL+YGTYVPDWQF IINKDSAD GKV+NVTCGVR KL+PPCNAV
Sbjct: 181  YCWHWLMAACVLVVYLALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAV 240

Query: 843  GYIDRDVLGINHLYHRPAWRRSKACTQDSPYDGPLRKDAPSWCLAPFEPEGXXXXXXXXX 1022
            GYIDR VLGINH+YH PAWRRSKACTQDSP++GPLRKDAPSWC APFEPEG         
Sbjct: 241  GYIDRKVLGINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSIL 300

Query: 1023 XTXXXXXXXXXXXXMKGHLARLRQWVTMGLSLLIFGLTLHFTQAIPLNKQLYTLSYVCVT 1202
             T             KGHLARL+QWVTMG +LLIFGLTLHFT AIPLNKQLYTLSYVCVT
Sbjct: 301  STIIGVHFGHVIIHTKGHLARLKQWVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVT 360

Query: 1203 SGAAALVFSAIYALVDVWNFKFVFLPLAWIGMNAMFVYVMAAEGIFACFINGWYYADPQN 1382
            SGAAALVFSAIYALVD+WN K+ FLPLAWIGMNAM VYVMAAEGIFA FINGWYY DP N
Sbjct: 361  SGAAALVFSAIYALVDIWNLKYPFLPLAWIGMNAMLVYVMAAEGIFAGFINGWYYGDPHN 420

Query: 1383 TLIYWI 1400
            TL YWI
Sbjct: 421  TLPYWI 426


>ref|XP_006472095.1| PREDICTED: heparan-alpha-glucosaminide N-acetyltransferase-like
            isoform X3 [Citrus sinensis]
          Length = 456

 Score =  737 bits (1903), Expect = 0.0
 Identities = 352/422 (83%), Positives = 373/422 (88%)
 Frame = +3

Query: 123  MGEIKPEITHHHPLLKSEPDISEQQEKPHFNTQRLASLDIFRGLTVALMILVDHAGGDWP 302
            M EIK E THHHPL+ SEPD+S+QQEK H  TQRLASLDIFRGL VALMILVDHAGGDWP
Sbjct: 1    MSEIKAETTHHHPLIISEPDVSDQQEKSHLKTQRLASLDIFRGLAVALMILVDHAGGDWP 60

Query: 303  EITHAPWNGCNLADFVMPFFLFIVGVAIALALKRVPKQGDAVMKVILRTLKLLFWGILLQ 482
            EI+HAPWNGCNLADFVMPFFLFIVGVAIALALKR+P + DAV KVI RTLKLLFWGILLQ
Sbjct: 61   EISHAPWNGCNLADFVMPFFLFIVGVAIALALKRIPDRADAVKKVIFRTLKLLFWGILLQ 120

Query: 483  GGFSHAPDELTYGVDVRMIRWCGVLQRIALSYLLVSMVEIYTKDVQDKDRSLGQFSIFRA 662
            GGFSHAPDELTYGVDVRMIR CGVLQRIALSYLLVS+VEI+TKDVQDKD+S+GQFSIFR 
Sbjct: 121  GGFSHAPDELTYGVDVRMIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGQFSIFRL 180

Query: 663  YCWHWLMAACVLIVYLALIYGTYVPDWQFIIINKDSADCGKVYNVTCGVRGKLDPPCNAV 842
            YCWHWLMAACVL+VYLAL+YGTYVPDWQF IINKDSAD GKV+NVTCGVRGKL+PPCNAV
Sbjct: 181  YCWHWLMAACVLVVYLALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRGKLNPPCNAV 240

Query: 843  GYIDRDVLGINHLYHRPAWRRSKACTQDSPYDGPLRKDAPSWCLAPFEPEGXXXXXXXXX 1022
            GYIDR VLGINH+YHRPAWRRSKACTQDSP++GPLRKDAPSWC APFEPEG         
Sbjct: 241  GYIDRKVLGINHMYHRPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSIL 300

Query: 1023 XTXXXXXXXXXXXXMKGHLARLRQWVTMGLSLLIFGLTLHFTQAIPLNKQLYTLSYVCVT 1202
             T             KGHLARL+QWVTMG +LLIFGLTLHFT AIPLNKQLYTLSYVCVT
Sbjct: 301  STIIGVHFGHVIIHTKGHLARLKQWVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVT 360

Query: 1203 SGAAALVFSAIYALVDVWNFKFVFLPLAWIGMNAMFVYVMAAEGIFACFINGWYYADPQN 1382
            SGAAALVFSAIYALVD+WN K+ FLPLAWIGMNAM VYVMAAEGIFA FINGWYY DP N
Sbjct: 361  SGAAALVFSAIYALVDIWNLKYPFLPLAWIGMNAMLVYVMAAEGIFAGFINGWYYGDPHN 420

Query: 1383 TL 1388
            TL
Sbjct: 421  TL 422


>gb|KDO56215.1| hypothetical protein CISIN_1g012127mg [Citrus sinensis]
          Length = 463

 Score =  733 bits (1892), Expect = 0.0
 Identities = 349/426 (81%), Positives = 374/426 (87%)
 Frame = +3

Query: 123  MGEIKPEITHHHPLLKSEPDISEQQEKPHFNTQRLASLDIFRGLTVALMILVDHAGGDWP 302
            M EIK E THHHPL+ SEPD+S+QQEK H  TQRLASLDIFRGL VALMILVDHAGGDWP
Sbjct: 1    MSEIKAETTHHHPLIISEPDVSDQQEKSHLKTQRLASLDIFRGLAVALMILVDHAGGDWP 60

Query: 303  EITHAPWNGCNLADFVMPFFLFIVGVAIALALKRVPKQGDAVMKVILRTLKLLFWGILLQ 482
            EI+HAPWNGCNLADFVMPFFLFIVGVAIALALKR+P + DAV KVI RTLKLLFWGILLQ
Sbjct: 61   EISHAPWNGCNLADFVMPFFLFIVGVAIALALKRIPDRADAVKKVIFRTLKLLFWGILLQ 120

Query: 483  GGFSHAPDELTYGVDVRMIRWCGVLQRIALSYLLVSMVEIYTKDVQDKDRSLGQFSIFRA 662
            GGFSHAPDELTYGVDVRMIR CGVLQRIALSYLLVS+VEI+TKDVQDKD+S+G+FSIFR 
Sbjct: 121  GGFSHAPDELTYGVDVRMIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRL 180

Query: 663  YCWHWLMAACVLIVYLALIYGTYVPDWQFIIINKDSADCGKVYNVTCGVRGKLDPPCNAV 842
            YCWHWLMAACVL+VYLAL+YGTYVPDWQF IINKDSAD GKV+NVTCGVR KL+PPCNAV
Sbjct: 181  YCWHWLMAACVLVVYLALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAV 240

Query: 843  GYIDRDVLGINHLYHRPAWRRSKACTQDSPYDGPLRKDAPSWCLAPFEPEGXXXXXXXXX 1022
            GYIDR VLGINH+YH PAWRRSKACTQDSP++GPLRKDAPSWC APFEPEG         
Sbjct: 241  GYIDRKVLGINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSIL 300

Query: 1023 XTXXXXXXXXXXXXMKGHLARLRQWVTMGLSLLIFGLTLHFTQAIPLNKQLYTLSYVCVT 1202
             T             KGHLARL+QWVTMG +LLIFGLTLHFT AIPLNKQLYTLSYVCVT
Sbjct: 301  STIIGVHFGHVIIHTKGHLARLKQWVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVT 360

Query: 1203 SGAAALVFSAIYALVDVWNFKFVFLPLAWIGMNAMFVYVMAAEGIFACFINGWYYADPQN 1382
            SGAAALVFSAIYALVD+WN K+ FLPLAWIGMNAM VYVMAAEGIFA FINGWYY DP N
Sbjct: 361  SGAAALVFSAIYALVDIWNLKYPFLPLAWIGMNAMLVYVMAAEGIFAGFINGWYYGDPHN 420

Query: 1383 TLIYWI 1400
            TL+ ++
Sbjct: 421  TLVCFL 426


>ref|XP_006472096.1| PREDICTED: heparan-alpha-glucosaminide N-acetyltransferase-like
            isoform X4 [Citrus sinensis]
          Length = 384

 Score =  654 bits (1686), Expect = 0.0
 Identities = 314/376 (83%), Positives = 333/376 (88%)
 Frame = +3

Query: 123  MGEIKPEITHHHPLLKSEPDISEQQEKPHFNTQRLASLDIFRGLTVALMILVDHAGGDWP 302
            M EIK E THHHPL+ SEPD+S+QQEK H  TQRLASLDIFRGL VALMILVDHAGGDWP
Sbjct: 1    MSEIKAETTHHHPLIISEPDVSDQQEKSHLKTQRLASLDIFRGLAVALMILVDHAGGDWP 60

Query: 303  EITHAPWNGCNLADFVMPFFLFIVGVAIALALKRVPKQGDAVMKVILRTLKLLFWGILLQ 482
            EI+HAPWNGCNLADFVMPFFLFIVGVAIALALKR+P + DAV KVI RTLKLLFWGILLQ
Sbjct: 61   EISHAPWNGCNLADFVMPFFLFIVGVAIALALKRIPDRADAVKKVIFRTLKLLFWGILLQ 120

Query: 483  GGFSHAPDELTYGVDVRMIRWCGVLQRIALSYLLVSMVEIYTKDVQDKDRSLGQFSIFRA 662
            GGFSHAPDELTYGVDVRMIR CGVLQRIALSYLLVS+VEI+TKDVQDKD+S+GQFSIFR 
Sbjct: 121  GGFSHAPDELTYGVDVRMIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGQFSIFRL 180

Query: 663  YCWHWLMAACVLIVYLALIYGTYVPDWQFIIINKDSADCGKVYNVTCGVRGKLDPPCNAV 842
            YCWHWLMAACVL+VYLAL+YGTYVPDWQF IINKDSAD GKV+NVTCGVRGKL+PPCNAV
Sbjct: 181  YCWHWLMAACVLVVYLALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRGKLNPPCNAV 240

Query: 843  GYIDRDVLGINHLYHRPAWRRSKACTQDSPYDGPLRKDAPSWCLAPFEPEGXXXXXXXXX 1022
            GYIDR VLGINH+YHRPAWRRSKACTQDSP++GPLRKDAPSWC APFEPEG         
Sbjct: 241  GYIDRKVLGINHMYHRPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSIL 300

Query: 1023 XTXXXXXXXXXXXXMKGHLARLRQWVTMGLSLLIFGLTLHFTQAIPLNKQLYTLSYVCVT 1202
             T             KGHLARL+QWVTMG +LLIFGLTLHFT AIPLNKQLYTLSYVCVT
Sbjct: 301  STIIGVHFGHVIIHTKGHLARLKQWVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVT 360

Query: 1203 SGAAALVFSAIYALVD 1250
            SGAAALVFSAIYALVD
Sbjct: 361  SGAAALVFSAIYALVD 376


>ref|XP_006433415.1| hypothetical protein CICLE_v10001045mg [Citrus clementina]
            gi|557535537|gb|ESR46655.1| hypothetical protein
            CICLE_v10001045mg [Citrus clementina]
          Length = 375

 Score =  651 bits (1680), Expect = 0.0
 Identities = 313/375 (83%), Positives = 332/375 (88%)
 Frame = +3

Query: 123  MGEIKPEITHHHPLLKSEPDISEQQEKPHFNTQRLASLDIFRGLTVALMILVDHAGGDWP 302
            M EIK E THHHPL+ SEPD+S+QQEK H  TQRLASLDIFRGL VALMILVDHAGGDWP
Sbjct: 1    MSEIKAETTHHHPLIISEPDVSDQQEKSHLKTQRLASLDIFRGLAVALMILVDHAGGDWP 60

Query: 303  EITHAPWNGCNLADFVMPFFLFIVGVAIALALKRVPKQGDAVMKVILRTLKLLFWGILLQ 482
            EI+HAPWNGCNLADFVMPFFLFIVGVAIALALKR+P + DAV KVI RTLKLLFWGILLQ
Sbjct: 61   EISHAPWNGCNLADFVMPFFLFIVGVAIALALKRIPDRADAVKKVIFRTLKLLFWGILLQ 120

Query: 483  GGFSHAPDELTYGVDVRMIRWCGVLQRIALSYLLVSMVEIYTKDVQDKDRSLGQFSIFRA 662
            GGFSHAPDELTYGVDVRMIR CGVLQRIALSYLLVS+VEI+TKDVQDKD+S+GQFSIFR 
Sbjct: 121  GGFSHAPDELTYGVDVRMIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGQFSIFRL 180

Query: 663  YCWHWLMAACVLIVYLALIYGTYVPDWQFIIINKDSADCGKVYNVTCGVRGKLDPPCNAV 842
            YCWHWLMAACVL+VYLAL+YGTYVPDWQF IINKDSAD GKV+NVTCGVRGKL+PPCNAV
Sbjct: 181  YCWHWLMAACVLVVYLALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRGKLNPPCNAV 240

Query: 843  GYIDRDVLGINHLYHRPAWRRSKACTQDSPYDGPLRKDAPSWCLAPFEPEGXXXXXXXXX 1022
            GYIDR VLGINH+YHRPAWRRSKACTQDSP++GPLRKDAPSWC APFEPEG         
Sbjct: 241  GYIDRKVLGINHMYHRPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSIL 300

Query: 1023 XTXXXXXXXXXXXXMKGHLARLRQWVTMGLSLLIFGLTLHFTQAIPLNKQLYTLSYVCVT 1202
             T             KGHLARL+QWVTMG +LLIFGLTLHFT AIPLNKQLYTLSYVCVT
Sbjct: 301  STIIGVHFGHVIIHTKGHLARLKQWVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVT 360

Query: 1203 SGAAALVFSAIYALV 1247
            SGAAALVFSAIYALV
Sbjct: 361  SGAAALVFSAIYALV 375


>gb|KDO56216.1| hypothetical protein CISIN_1g012127mg [Citrus sinensis]
          Length = 384

 Score =  648 bits (1671), Expect = 0.0
 Identities = 311/376 (82%), Positives = 331/376 (88%)
 Frame = +3

Query: 123  MGEIKPEITHHHPLLKSEPDISEQQEKPHFNTQRLASLDIFRGLTVALMILVDHAGGDWP 302
            M EIK E THHHPL+ SEPD+S+QQEK H  TQRLASLDIFRGL VALMILVDHAGGDWP
Sbjct: 1    MSEIKAETTHHHPLIISEPDVSDQQEKSHLKTQRLASLDIFRGLAVALMILVDHAGGDWP 60

Query: 303  EITHAPWNGCNLADFVMPFFLFIVGVAIALALKRVPKQGDAVMKVILRTLKLLFWGILLQ 482
            EI+HAPWNGCNLADFVMPFFLFIVGVAIALALKR+P + DAV KVI RTLKLLFWGILLQ
Sbjct: 61   EISHAPWNGCNLADFVMPFFLFIVGVAIALALKRIPDRADAVKKVIFRTLKLLFWGILLQ 120

Query: 483  GGFSHAPDELTYGVDVRMIRWCGVLQRIALSYLLVSMVEIYTKDVQDKDRSLGQFSIFRA 662
            GGFSHAPDELTYGVDVRMIR CGVLQRIALSYLLVS+VEI+TKDVQDKD+S+G+FSIFR 
Sbjct: 121  GGFSHAPDELTYGVDVRMIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRL 180

Query: 663  YCWHWLMAACVLIVYLALIYGTYVPDWQFIIINKDSADCGKVYNVTCGVRGKLDPPCNAV 842
            YCWHWLMAACVL+VYLAL+YGTYVPDWQF IINKDSAD GKV+NVTCGVR KL+PPCNAV
Sbjct: 181  YCWHWLMAACVLVVYLALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAV 240

Query: 843  GYIDRDVLGINHLYHRPAWRRSKACTQDSPYDGPLRKDAPSWCLAPFEPEGXXXXXXXXX 1022
            GYIDR VLGINH+YH PAWRRSKACTQDSP++GPLRKDAPSWC APFEPEG         
Sbjct: 241  GYIDRKVLGINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSIL 300

Query: 1023 XTXXXXXXXXXXXXMKGHLARLRQWVTMGLSLLIFGLTLHFTQAIPLNKQLYTLSYVCVT 1202
             T             KGHLARL+QWVTMG +LLIFGLTLHFT AIPLNKQLYTLSYVCVT
Sbjct: 301  STIIGVHFGHVIIHTKGHLARLKQWVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVT 360

Query: 1203 SGAAALVFSAIYALVD 1250
            SGAAALVFSAIYALVD
Sbjct: 361  SGAAALVFSAIYALVD 376


>gb|KDO56217.1| hypothetical protein CISIN_1g012127mg [Citrus sinensis]
          Length = 375

 Score =  645 bits (1665), Expect = 0.0
 Identities = 310/375 (82%), Positives = 330/375 (88%)
 Frame = +3

Query: 123  MGEIKPEITHHHPLLKSEPDISEQQEKPHFNTQRLASLDIFRGLTVALMILVDHAGGDWP 302
            M EIK E THHHPL+ SEPD+S+QQEK H  TQRLASLDIFRGL VALMILVDHAGGDWP
Sbjct: 1    MSEIKAETTHHHPLIISEPDVSDQQEKSHLKTQRLASLDIFRGLAVALMILVDHAGGDWP 60

Query: 303  EITHAPWNGCNLADFVMPFFLFIVGVAIALALKRVPKQGDAVMKVILRTLKLLFWGILLQ 482
            EI+HAPWNGCNLADFVMPFFLFIVGVAIALALKR+P + DAV KVI RTLKLLFWGILLQ
Sbjct: 61   EISHAPWNGCNLADFVMPFFLFIVGVAIALALKRIPDRADAVKKVIFRTLKLLFWGILLQ 120

Query: 483  GGFSHAPDELTYGVDVRMIRWCGVLQRIALSYLLVSMVEIYTKDVQDKDRSLGQFSIFRA 662
            GGFSHAPDELTYGVDVRMIR CGVLQRIALSYLLVS+VEI+TKDVQDKD+S+G+FSIFR 
Sbjct: 121  GGFSHAPDELTYGVDVRMIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRL 180

Query: 663  YCWHWLMAACVLIVYLALIYGTYVPDWQFIIINKDSADCGKVYNVTCGVRGKLDPPCNAV 842
            YCWHWLMAACVL+VYLAL+YGTYVPDWQF IINKDSAD GKV+NVTCGVR KL+PPCNAV
Sbjct: 181  YCWHWLMAACVLVVYLALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAV 240

Query: 843  GYIDRDVLGINHLYHRPAWRRSKACTQDSPYDGPLRKDAPSWCLAPFEPEGXXXXXXXXX 1022
            GYIDR VLGINH+YH PAWRRSKACTQDSP++GPLRKDAPSWC APFEPEG         
Sbjct: 241  GYIDRKVLGINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSIL 300

Query: 1023 XTXXXXXXXXXXXXMKGHLARLRQWVTMGLSLLIFGLTLHFTQAIPLNKQLYTLSYVCVT 1202
             T             KGHLARL+QWVTMG +LLIFGLTLHFT AIPLNKQLYTLSYVCVT
Sbjct: 301  STIIGVHFGHVIIHTKGHLARLKQWVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVT 360

Query: 1203 SGAAALVFSAIYALV 1247
            SGAAALVFSAIYALV
Sbjct: 361  SGAAALVFSAIYALV 375


>ref|XP_007031006.1| Heparan-alpha-glucosaminide N-acetyltransferase [Theobroma cacao]
            gi|508719611|gb|EOY11508.1| Heparan-alpha-glucosaminide
            N-acetyltransferase [Theobroma cacao]
          Length = 466

 Score =  624 bits (1609), Expect = e-176
 Identities = 297/426 (69%), Positives = 341/426 (80%)
 Frame = +3

Query: 123  MGEIKPEITHHHPLLKSEPDISEQQEKPHFNTQRLASLDIFRGLTVALMILVDHAGGDWP 302
            M EIK E    H L  + P   +  +KP+  TQR+ASLDIFRGLTVALMILVD AGG+WP
Sbjct: 1    MAEIKAEPAQRHTL--AIPMADDSAQKPN-KTQRVASLDIFRGLTVALMILVDDAGGEWP 57

Query: 303  EITHAPWNGCNLADFVMPFFLFIVGVAIALALKRVPKQGDAVMKVILRTLKLLFWGILLQ 482
             I HAPW+GCNLADFVMPFFLFIVG+AI LALKR+P +G A+ KV  RTLKLLFWG+LLQ
Sbjct: 58   VIGHAPWHGCNLADFVMPFFLFIVGMAIPLALKRIPGKGKAIQKVGFRTLKLLFWGLLLQ 117

Query: 483  GGFSHAPDELTYGVDVRMIRWCGVLQRIALSYLLVSMVEIYTKDVQDKDRSLGQFSIFRA 662
            GG+SHAPD+LTYGVD++MIR+CG+LQRIA +YL+V++ EI+ KD Q KD S G FS+FR 
Sbjct: 118  GGYSHAPDKLTYGVDMKMIRFCGILQRIAFAYLVVALAEIFLKDAQPKDVSAGHFSVFRL 177

Query: 663  YCWHWLMAACVLIVYLALIYGTYVPDWQFIIINKDSADCGKVYNVTCGVRGKLDPPCNAV 842
            YCWHWL+ AC+LI+YLAL+YGTYVPDWQF + NKDSAD GKV+ V C VRGKLDPPCNAV
Sbjct: 178  YCWHWLVGACILIMYLALLYGTYVPDWQFTVQNKDSADYGKVFTVACNVRGKLDPPCNAV 237

Query: 843  GYIDRDVLGINHLYHRPAWRRSKACTQDSPYDGPLRKDAPSWCLAPFEPEGXXXXXXXXX 1022
            GYIDR+VLGINH+Y RPAWRRS+ACT +SPY+GP +  APSWC APFEPEG         
Sbjct: 238  GYIDREVLGINHMYQRPAWRRSRACTVNSPYEGPFKDAAPSWCHAPFEPEGILSSISAVL 297

Query: 1023 XTXXXXXXXXXXXXMKGHLARLRQWVTMGLSLLIFGLTLHFTQAIPLNKQLYTLSYVCVT 1202
             T            +KGH  RLRQW+ MG++LLI G+ LHFT AIPLNKQLYT SYVCVT
Sbjct: 298  STIIGVHFGHVLVHLKGHSERLRQWIMMGIALLILGIVLHFT-AIPLNKQLYTFSYVCVT 356

Query: 1203 SGAAALVFSAIYALVDVWNFKFVFLPLAWIGMNAMFVYVMAAEGIFACFINGWYYADPQN 1382
            SGAAALVFSAIY LVD+W+ K VFLPL WIGMNAM VYVMAAEGIFA FINGWYY DP N
Sbjct: 357  SGAAALVFSAIYILVDIWDLKLVFLPLKWIGMNAMLVYVMAAEGIFAGFINGWYYQDPHN 416

Query: 1383 TLIYWI 1400
            TL+YWI
Sbjct: 417  TLVYWI 422


>ref|XP_012089000.1| PREDICTED: heparan-alpha-glucosaminide N-acetyltransferase [Jatropha
            curcas] gi|643708557|gb|KDP23473.1| hypothetical protein
            JCGZ_23306 [Jatropha curcas]
          Length = 468

 Score =  619 bits (1597), Expect = e-174
 Identities = 291/426 (68%), Positives = 337/426 (79%)
 Frame = +3

Query: 123  MGEIKPEITHHHPLLKSEPDISEQQEKPHFNTQRLASLDIFRGLTVALMILVDHAGGDWP 302
            M EIK +  H H L+ +E DIS+Q  KP     R+ASLDIFRGLTVALMILVD AGG+WP
Sbjct: 1    MAEIKTDTAHEHHLIVAEVDISDQ--KPPQPPGRVASLDIFRGLTVALMILVDDAGGEWP 58

Query: 303  EITHAPWNGCNLADFVMPFFLFIVGVAIALALKRVPKQGDAVMKVILRTLKLLFWGILLQ 482
             I HAPWNGCNLADFVMPFFLFIVG+AI LALKR+  +  AV KVI RTLKLLFWG+LLQ
Sbjct: 59   MIGHAPWNGCNLADFVMPFFLFIVGMAIPLALKRITSRSQAVKKVIFRTLKLLFWGLLLQ 118

Query: 483  GGFSHAPDELTYGVDVRMIRWCGVLQRIALSYLLVSMVEIYTKDVQDKDRSLGQFSIFRA 662
            GGFSHAPD+LTYGVD++ IRWCG+LQRIA +YL++S+VEI+TKD   KD   G+FS+FR 
Sbjct: 119  GGFSHAPDKLTYGVDMKEIRWCGILQRIAFAYLIMSLVEIFTKDTNPKDLPPGRFSMFRL 178

Query: 663  YCWHWLMAACVLIVYLALIYGTYVPDWQFIIINKDSADCGKVYNVTCGVRGKLDPPCNAV 842
            YCWHWL+ ACVL++YLA+I+GT VPDWQF I +KDS D GKV+NVTCGVRGKLDPPCNAV
Sbjct: 179  YCWHWLVGACVLVIYLAVIHGTRVPDWQFTIHDKDSPDFGKVFNVTCGVRGKLDPPCNAV 238

Query: 843  GYIDRDVLGINHLYHRPAWRRSKACTQDSPYDGPLRKDAPSWCLAPFEPEGXXXXXXXXX 1022
            GYID+ VLG+ H+Y RPAWRRSKACT++SPY+GP + DAPSWC APFEPEG         
Sbjct: 239  GYIDKKVLGLAHMYQRPAWRRSKACTENSPYEGPFQSDAPSWCRAPFEPEGIISSISAIL 298

Query: 1023 XTXXXXXXXXXXXXMKGHLARLRQWVTMGLSLLIFGLTLHFTQAIPLNKQLYTLSYVCVT 1202
             T            +K + +RL+ W+ +G +LLI G  LHFT  +PLNKQLYT SYVCVT
Sbjct: 299  STIIGIHFGHILVHLKDNSSRLKHWILLGFALLIVGFVLHFTHVVPLNKQLYTFSYVCVT 358

Query: 1203 SGAAALVFSAIYALVDVWNFKFVFLPLAWIGMNAMFVYVMAAEGIFACFINGWYYADPQN 1382
            SGAAALVFSAIY LVD+W  KF+FLP  WIGMNAM VYVMAA GIFA F+NGWYY DP N
Sbjct: 359  SGAAALVFSAIYILVDIWGLKFLFLPFQWIGMNAMLVYVMAAAGIFAGFMNGWYYDDPHN 418

Query: 1383 TLIYWI 1400
            TLIYWI
Sbjct: 419  TLIYWI 424


>gb|KHF97415.1| Heparan-alpha-glucosaminide N-acetyltransferase [Gossypium arboreum]
          Length = 467

 Score =  616 bits (1589), Expect = e-173
 Identities = 288/426 (67%), Positives = 341/426 (80%)
 Frame = +3

Query: 123  MGEIKPEITHHHPLLKSEPDISEQQEKPHFNTQRLASLDIFRGLTVALMILVDHAGGDWP 302
            M EIK +   +H ++  E +IS Q  KPH  T R+ASLDIFRGLTVALMILVD AGG+WP
Sbjct: 1    MAEIKVDPVDNHTMVIPETEISAQ--KPH-RTLRVASLDIFRGLTVALMILVDDAGGEWP 57

Query: 303  EITHAPWNGCNLADFVMPFFLFIVGVAIALALKRVPKQGDAVMKVILRTLKLLFWGILLQ 482
             I HAPW+GCNLADFVMPFFLFIVG+AI LALKR+P +G AV KV+ RTLKLLFWG+LLQ
Sbjct: 58   MIGHAPWHGCNLADFVMPFFLFIVGMAIPLALKRIPSKGKAVKKVVFRTLKLLFWGLLLQ 117

Query: 483  GGFSHAPDELTYGVDVRMIRWCGVLQRIALSYLLVSMVEIYTKDVQDKDRSLGQFSIFRA 662
            GGFSHAPD+LTYGVD++MIR+CG+LQRIA +YL+V++ EI+ KD Q  D + G  S+FR 
Sbjct: 118  GGFSHAPDKLTYGVDMQMIRFCGILQRIAFAYLVVALAEIFLKDAQSNDIAAGYCSVFRL 177

Query: 663  YCWHWLMAACVLIVYLALIYGTYVPDWQFIIINKDSADCGKVYNVTCGVRGKLDPPCNAV 842
            YCWHWL+ AC+LI+YLA++YGTYVPDWQF + +K+SAD G+V+ V C VRGKL+PPCNAV
Sbjct: 178  YCWHWLLGACILILYLAMLYGTYVPDWQFAVHDKESADYGRVFTVDCNVRGKLNPPCNAV 237

Query: 843  GYIDRDVLGINHLYHRPAWRRSKACTQDSPYDGPLRKDAPSWCLAPFEPEGXXXXXXXXX 1022
            GYIDR++LGINH+Y +PAWRR+KACT +SPY+GP   DAPSWC APFEPEG         
Sbjct: 238  GYIDREILGINHMYLKPAWRRAKACTVNSPYEGPFINDAPSWCHAPFEPEGILSSISAVL 297

Query: 1023 XTXXXXXXXXXXXXMKGHLARLRQWVTMGLSLLIFGLTLHFTQAIPLNKQLYTLSYVCVT 1202
             T            +KGH  RL+QW+ MG +LLI G+ LHFT AIPLNKQLYT SYVCVT
Sbjct: 298  STIIGVHFGHVLVHLKGHSERLKQWIMMGFALLILGIILHFTNAIPLNKQLYTFSYVCVT 357

Query: 1203 SGAAALVFSAIYALVDVWNFKFVFLPLAWIGMNAMFVYVMAAEGIFACFINGWYYADPQN 1382
            SGAAALVFSAIY LVD+W  K++F+PL WIGMNAM VYVMAAEGIFA FINGWYY DP N
Sbjct: 358  SGAAALVFSAIYILVDIWGLKYMFVPLKWIGMNAMLVYVMAAEGIFAGFINGWYYEDPHN 417

Query: 1383 TLIYWI 1400
            TL+YWI
Sbjct: 418  TLVYWI 423


>ref|XP_008246429.1| PREDICTED: heparan-alpha-glucosaminide N-acetyltransferase [Prunus
            mume]
          Length = 468

 Score =  615 bits (1585), Expect = e-173
 Identities = 291/428 (67%), Positives = 342/428 (79%), Gaps = 2/428 (0%)
 Frame = +3

Query: 123  MGEIKPEI-THHHPLLKSEPDISEQQ-EKPHFNTQRLASLDIFRGLTVALMILVDHAGGD 296
            M EIK +  THHH L  SEPDIS+ +  KP    +R+ASLDIFRGLTV+LMILVD AGG+
Sbjct: 1    MAEIKADTATHHHHLNVSEPDISDHKPSKP----KRVASLDIFRGLTVSLMILVDDAGGE 56

Query: 297  WPEITHAPWNGCNLADFVMPFFLFIVGVAIALALKRVPKQGDAVMKVILRTLKLLFWGIL 476
            WP I HAPWNGCNLADFVMPFFLFIVG+AIAL+LKR+P Q  AV +VILRT KLLFWG+L
Sbjct: 57   WPVIGHAPWNGCNLADFVMPFFLFIVGMAIALSLKRIPDQLLAVKRVILRTFKLLFWGLL 116

Query: 477  LQGGFSHAPDELTYGVDVRMIRWCGVLQRIALSYLLVSMVEIYTKDVQDKDRSLGQFSIF 656
            LQGGFSHAPD+LTYGVD++ IRWCG+LQRIAL+YL+V+++EI+++  Q K+ +   FSIF
Sbjct: 117  LQGGFSHAPDKLTYGVDMKEIRWCGILQRIALAYLVVALIEIFSRGAQTKNMAPSGFSIF 176

Query: 657  RAYCWHWLMAACVLIVYLALIYGTYVPDWQFIIINKDSADCGKVYNVTCGVRGKLDPPCN 836
            + Y WHWL+AACVL +Y A+IYGTYVPDWQF +++++S D GK + V CGVRGKLDPPCN
Sbjct: 177  KLYYWHWLVAACVLTIYFAVIYGTYVPDWQFTVLDRESIDYGKSFTVACGVRGKLDPPCN 236

Query: 837  AVGYIDRDVLGINHLYHRPAWRRSKACTQDSPYDGPLRKDAPSWCLAPFEPEGXXXXXXX 1016
            AVGYIDR+VLGINH+Y RPAW+RSKACT++SPY GP R DAPSWC APFEPEG       
Sbjct: 237  AVGYIDREVLGINHMYPRPAWKRSKACTENSPYAGPFRHDAPSWCHAPFEPEGIVSSISA 296

Query: 1017 XXXTXXXXXXXXXXXXMKGHLARLRQWVTMGLSLLIFGLTLHFTQAIPLNKQLYTLSYVC 1196
               T            MKGH ARL+ W+  G +LL+ G+ LHFT AIP NKQLYT SYVC
Sbjct: 297  ILSTIIGVHFGHVLIQMKGHPARLKHWIPAGCALLVLGIILHFTHAIPSNKQLYTFSYVC 356

Query: 1197 VTSGAAALVFSAIYALVDVWNFKFVFLPLAWIGMNAMFVYVMAAEGIFACFINGWYYADP 1376
            +TSGAAALVFSA Y LVD+W  K++FLPL WIGMNAM VYVMAAEGIFA FINGWYY DP
Sbjct: 357  ITSGAAALVFSASYILVDIWGIKYLFLPLEWIGMNAMLVYVMAAEGIFAGFINGWYYEDP 416

Query: 1377 QNTLIYWI 1400
             NTL+YWI
Sbjct: 417  HNTLVYWI 424


>ref|XP_012434169.1| PREDICTED: heparan-alpha-glucosaminide N-acetyltransferase isoform X1
            [Gossypium raimondii] gi|823197103|ref|XP_012434170.1|
            PREDICTED: heparan-alpha-glucosaminide
            N-acetyltransferase isoform X1 [Gossypium raimondii]
            gi|823197106|ref|XP_012434171.1| PREDICTED:
            heparan-alpha-glucosaminide N-acetyltransferase isoform
            X1 [Gossypium raimondii] gi|763778185|gb|KJB45308.1|
            hypothetical protein B456_007G300200 [Gossypium
            raimondii]
          Length = 467

 Score =  614 bits (1584), Expect = e-173
 Identities = 287/426 (67%), Positives = 340/426 (79%)
 Frame = +3

Query: 123  MGEIKPEITHHHPLLKSEPDISEQQEKPHFNTQRLASLDIFRGLTVALMILVDHAGGDWP 302
            M EIK +    H ++  E +IS Q  KPH  T R+ASLDIFRGLTVALMILVD AGG+WP
Sbjct: 1    MAEIKVDPVDDHTMVIPETEISAQ--KPH-RTLRVASLDIFRGLTVALMILVDDAGGEWP 57

Query: 303  EITHAPWNGCNLADFVMPFFLFIVGVAIALALKRVPKQGDAVMKVILRTLKLLFWGILLQ 482
             I HAPW+GCNLADFVMPFFLFIVG+AI LALKR+P +G AV KV+ RTLKLLFWG+LLQ
Sbjct: 58   MIGHAPWHGCNLADFVMPFFLFIVGMAIPLALKRIPSKGKAVKKVVFRTLKLLFWGLLLQ 117

Query: 483  GGFSHAPDELTYGVDVRMIRWCGVLQRIALSYLLVSMVEIYTKDVQDKDRSLGQFSIFRA 662
            GGFSHAPD+LTYGVD++MIR+CG+LQRIA +YL+V++ EI+ KD Q  D + G  S+FR 
Sbjct: 118  GGFSHAPDKLTYGVDMQMIRFCGILQRIAFAYLVVALAEIFLKDAQSNDIAAGYCSVFRL 177

Query: 663  YCWHWLMAACVLIVYLALIYGTYVPDWQFIIINKDSADCGKVYNVTCGVRGKLDPPCNAV 842
            YCWHWL+ AC+LI+YLA++YG YVPDWQF + +K+SA  G+V+ V C VRGKL+PPCNAV
Sbjct: 178  YCWHWLLGACILILYLAMLYGIYVPDWQFAVHDKESAVYGRVFTVDCNVRGKLNPPCNAV 237

Query: 843  GYIDRDVLGINHLYHRPAWRRSKACTQDSPYDGPLRKDAPSWCLAPFEPEGXXXXXXXXX 1022
            GYIDR++LGINHLY +PAWRR+KACT++SPY+GP + DAPSWC APFEPEG         
Sbjct: 238  GYIDREILGINHLYLKPAWRRAKACTENSPYEGPFKNDAPSWCHAPFEPEGILSSISAVL 297

Query: 1023 XTXXXXXXXXXXXXMKGHLARLRQWVTMGLSLLIFGLTLHFTQAIPLNKQLYTLSYVCVT 1202
             T            +KGH  RL+QW+ MG +LLI G+ LHFT AIPLNKQLYT SYVCVT
Sbjct: 298  STIIGVHFGHVLVHLKGHSERLKQWIMMGFALLILGIVLHFTHAIPLNKQLYTFSYVCVT 357

Query: 1203 SGAAALVFSAIYALVDVWNFKFVFLPLAWIGMNAMFVYVMAAEGIFACFINGWYYADPQN 1382
            SGAAALVFSAIY LVD+W  K++F+PL WIGMNAM VYVMAAEGIFA FINGWYY DP N
Sbjct: 358  SGAAALVFSAIYILVDIWGLKYMFVPLKWIGMNAMLVYVMAAEGIFAGFINGWYYEDPHN 417

Query: 1383 TLIYWI 1400
            TL+YWI
Sbjct: 418  TLVYWI 423


>ref|XP_004302388.1| PREDICTED: heparan-alpha-glucosaminide N-acetyltransferase [Fragaria
            vesca subsp. vesca]
          Length = 467

 Score =  613 bits (1580), Expect = e-172
 Identities = 292/427 (68%), Positives = 339/427 (79%), Gaps = 1/427 (0%)
 Frame = +3

Query: 123  MGEIKPEITHHHPLLKSEPDISEQQ-EKPHFNTQRLASLDIFRGLTVALMILVDHAGGDW 299
            M EIKP+  H   L+ SEPDIS+Q+  KP    +RLASLDIFRGLTV+LMILVD AGG+W
Sbjct: 1    MAEIKPDTAHQQYLIVSEPDISDQKPSKP----KRLASLDIFRGLTVSLMILVDDAGGEW 56

Query: 300  PEITHAPWNGCNLADFVMPFFLFIVGVAIALALKRVPKQGDAVMKVILRTLKLLFWGILL 479
            P I HAPWNGCNLADFVMPFFLFIVG++IAL+LKR+P Q  AV KVILRTLKLLFWG+LL
Sbjct: 57   PMIGHAPWNGCNLADFVMPFFLFIVGMSIALSLKRIPDQLLAVKKVILRTLKLLFWGLLL 116

Query: 480  QGGFSHAPDELTYGVDVRMIRWCGVLQRIALSYLLVSMVEIYTKDVQDKDRSLGQFSIFR 659
            QGGFSHAPD+LTYGVD++ IRWCG+LQRIAL+YL+V+++EI+ +  Q KD + G+ SIF+
Sbjct: 117  QGGFSHAPDKLTYGVDMKEIRWCGILQRIALAYLVVALIEIFLRCAQRKDVAPGRLSIFK 176

Query: 660  AYCWHWLMAACVLIVYLALIYGTYVPDWQFIIINKDSADCGKVYNVTCGVRGKLDPPCNA 839
             Y W WL+AACVL  Y A+IYGTYVPDWQF + ++DS+D GK + V CG RGKLDPPCNA
Sbjct: 177  LYYWQWLVAACVLTTYFAVIYGTYVPDWQFTVHDRDSSDYGKSFTVFCGARGKLDPPCNA 236

Query: 840  VGYIDRDVLGINHLYHRPAWRRSKACTQDSPYDGPLRKDAPSWCLAPFEPEGXXXXXXXX 1019
            VGYIDR VLGINH+Y  PAW+RSKACT++SPY+GP R DAPSWC APFEPEG        
Sbjct: 237  VGYIDRQVLGINHMYQHPAWKRSKACTENSPYEGPFRNDAPSWCRAPFEPEGIVSSISAI 296

Query: 1020 XXTXXXXXXXXXXXXMKGHLARLRQWVTMGLSLLIFGLTLHFTQAIPLNKQLYTLSYVCV 1199
              T            MKGH ARL+ W+ MG +LLI G  LHFT AIP NKQLYT SYVC+
Sbjct: 297  LSTIIGVHFGHVLIHMKGHQARLKHWILMGFALLILGNVLHFTHAIPSNKQLYTFSYVCI 356

Query: 1200 TSGAAALVFSAIYALVDVWNFKFVFLPLAWIGMNAMFVYVMAAEGIFACFINGWYYADPQ 1379
            TSGAAALVFSA Y +VD+++ KF+FLPL WIGMNAM VYVMAAEGIFA F NGWYY DP 
Sbjct: 357  TSGAAALVFSAFYIMVDIYDKKFMFLPLEWIGMNAMLVYVMAAEGIFAGFFNGWYYKDPH 416

Query: 1380 NTLIYWI 1400
            NTLIYWI
Sbjct: 417  NTLIYWI 423


>ref|XP_007205146.1| hypothetical protein PRUPE_ppa005302mg [Prunus persica]
            gi|462400788|gb|EMJ06345.1| hypothetical protein
            PRUPE_ppa005302mg [Prunus persica]
          Length = 468

 Score =  612 bits (1578), Expect = e-172
 Identities = 290/428 (67%), Positives = 341/428 (79%), Gaps = 2/428 (0%)
 Frame = +3

Query: 123  MGEIKPEIT-HHHPLLKSEPDISEQQ-EKPHFNTQRLASLDIFRGLTVALMILVDHAGGD 296
            M EIK + T HHH L  SEPDIS+ +  KP    +R+ASLDIFRGLTV+LMILVD AGG+
Sbjct: 1    MAEIKADTTTHHHHLNLSEPDISDHKPSKP----KRVASLDIFRGLTVSLMILVDDAGGE 56

Query: 297  WPEITHAPWNGCNLADFVMPFFLFIVGVAIALALKRVPKQGDAVMKVILRTLKLLFWGIL 476
            WP I HAPWNGCNLADFVMPFFLFIVG+AIAL+LKR+P Q  AV +VILRTLKLLFWG+L
Sbjct: 57   WPVIGHAPWNGCNLADFVMPFFLFIVGMAIALSLKRIPDQLLAVKRVILRTLKLLFWGVL 116

Query: 477  LQGGFSHAPDELTYGVDVRMIRWCGVLQRIALSYLLVSMVEIYTKDVQDKDRSLGQFSIF 656
            LQGGF H PD+LTYGVD++ IRWCG+LQRIAL+YL+V+++EI+++  Q K+ +  +FSIF
Sbjct: 117  LQGGFLHDPDKLTYGVDMKEIRWCGILQRIALAYLVVALIEIFSRGAQTKNMAPSRFSIF 176

Query: 657  RAYCWHWLMAACVLIVYLALIYGTYVPDWQFIIINKDSADCGKVYNVTCGVRGKLDPPCN 836
            + Y WHWL AACVL +Y A+IYGTYVPDWQF ++ ++S D GK + V CGVRGKLDPPCN
Sbjct: 177  KLYYWHWLAAACVLTIYFAVIYGTYVPDWQFTVLYRESIDYGKSFTVACGVRGKLDPPCN 236

Query: 837  AVGYIDRDVLGINHLYHRPAWRRSKACTQDSPYDGPLRKDAPSWCLAPFEPEGXXXXXXX 1016
            AVGYIDR+VLGINH+Y RPAW+RSKACT++SPY GP R DAPSWC APFEPEG       
Sbjct: 237  AVGYIDREVLGINHMYPRPAWKRSKACTENSPYAGPFRHDAPSWCHAPFEPEGIVSSISA 296

Query: 1017 XXXTXXXXXXXXXXXXMKGHLARLRQWVTMGLSLLIFGLTLHFTQAIPLNKQLYTLSYVC 1196
               T            MKGH ARL+ W+ +G +LL+ G+ LHFT AIP NKQLYT SYVC
Sbjct: 297  ILSTIIGVHFGHVLIQMKGHPARLKHWIPVGCALLVLGIILHFTHAIPSNKQLYTFSYVC 356

Query: 1197 VTSGAAALVFSAIYALVDVWNFKFVFLPLAWIGMNAMFVYVMAAEGIFACFINGWYYADP 1376
            +TSGAAALV+SA Y LVD+W  K++FLPL WIGMNAM VYVMAAEGIFA FINGWYY DP
Sbjct: 357  ITSGAAALVYSAFYILVDIWGIKYLFLPLEWIGMNAMLVYVMAAEGIFAGFINGWYYEDP 416

Query: 1377 QNTLIYWI 1400
             NTLIYWI
Sbjct: 417  HNTLIYWI 424


>ref|XP_009376612.1| PREDICTED: heparan-alpha-glucosaminide N-acetyltransferase [Pyrus x
            bretschneideri]
          Length = 467

 Score =  608 bits (1569), Expect = e-171
 Identities = 280/426 (65%), Positives = 338/426 (79%)
 Frame = +3

Query: 123  MGEIKPEITHHHPLLKSEPDISEQQEKPHFNTQRLASLDIFRGLTVALMILVDHAGGDWP 302
            M EIK + +HH  L+ SEPD+S+ +      ++RLASLDIFRGLTV+LMILVD AGG+WP
Sbjct: 1    MAEIKADTSHHQSLIVSEPDVSDPKPS---KSKRLASLDIFRGLTVSLMILVDDAGGEWP 57

Query: 303  EITHAPWNGCNLADFVMPFFLFIVGVAIALALKRVPKQGDAVMKVILRTLKLLFWGILLQ 482
             I HAPWNGCNLADFVMPFFLFIVG++IAL+LKR+P Q  AV KV LRTLKLLFWG+LLQ
Sbjct: 58   VIGHAPWNGCNLADFVMPFFLFIVGMSIALSLKRIPDQFVAVKKVTLRTLKLLFWGLLLQ 117

Query: 483  GGFSHAPDELTYGVDVRMIRWCGVLQRIALSYLLVSMVEIYTKDVQDKDRSLGQFSIFRA 662
            GG+SHAPD+LTYGVDV+ IRWCG+LQRIAL+YL+V+++EI ++  + KD + G FSIF+ 
Sbjct: 118  GGYSHAPDKLTYGVDVKEIRWCGILQRIALAYLVVALIEIVSRGAETKDMAPGTFSIFKL 177

Query: 663  YCWHWLMAACVLIVYLALIYGTYVPDWQFIIINKDSADCGKVYNVTCGVRGKLDPPCNAV 842
            Y WHWL+AACVL++Y A+IYG YVPDWQF + +++  D GK Y VTCGVRGKLDPPCNAV
Sbjct: 178  YYWHWLVAACVLVIYFAVIYGAYVPDWQFTVQDRERTDYGKSYTVTCGVRGKLDPPCNAV 237

Query: 843  GYIDRDVLGINHLYHRPAWRRSKACTQDSPYDGPLRKDAPSWCLAPFEPEGXXXXXXXXX 1022
            GYIDR+VLGI+H+Y RPAW+RS+ACT++SPY GP R DAPSWC APFEPEG         
Sbjct: 238  GYIDREVLGISHMYQRPAWKRSRACTENSPYAGPFRNDAPSWCRAPFEPEGIVSSISAIL 297

Query: 1023 XTXXXXXXXXXXXXMKGHLARLRQWVTMGLSLLIFGLTLHFTQAIPLNKQLYTLSYVCVT 1202
                          M+GH ARL+ WV  G +LL  G+ LHF+ AIP NKQLYT SYVC+T
Sbjct: 298  SAIIGVHFGHVLIHMQGHPARLKHWVPTGCALLALGIILHFSHAIPSNKQLYTFSYVCIT 357

Query: 1203 SGAAALVFSAIYALVDVWNFKFVFLPLAWIGMNAMFVYVMAAEGIFACFINGWYYADPQN 1382
            SGAAALVFSA Y +VD+W+ +++FLPL WIGMNAM VYVMAAEGI A F+NGWYY DP N
Sbjct: 358  SGAAALVFSAFYLMVDIWSIRYLFLPLEWIGMNAMLVYVMAAEGILAGFVNGWYYKDPHN 417

Query: 1383 TLIYWI 1400
            TL+YWI
Sbjct: 418  TLVYWI 423


>ref|XP_010097938.1| hypothetical protein L484_000992 [Morus notabilis]
            gi|587884413|gb|EXB73306.1| hypothetical protein
            L484_000992 [Morus notabilis]
          Length = 467

 Score =  608 bits (1568), Expect = e-171
 Identities = 287/427 (67%), Positives = 340/427 (79%), Gaps = 1/427 (0%)
 Frame = +3

Query: 123  MGEIKPEITHHHPLLKSEPDISEQQ-EKPHFNTQRLASLDIFRGLTVALMILVDHAGGDW 299
            M EIK + TH H +   E D+  Q+ +KP    +R+ASLDIFRGLTVALMILVD AGG+W
Sbjct: 1    MAEIKADTTHQHNVSVPEDDVPVQKLQKP----KRVASLDIFRGLTVALMILVDDAGGEW 56

Query: 300  PEITHAPWNGCNLADFVMPFFLFIVGVAIALALKRVPKQGDAVMKVILRTLKLLFWGILL 479
            P I HAPWNGCNLADFVMPFFLFIVG+AIALALKR+P Q  A+ KVILRTLKLLFWG+LL
Sbjct: 57   PMIGHAPWNGCNLADFVMPFFLFIVGMAIALALKRIPDQLGAIKKVILRTLKLLFWGLLL 116

Query: 480  QGGFSHAPDELTYGVDVRMIRWCGVLQRIALSYLLVSMVEIYTKDVQDKDRSLGQFSIFR 659
            QGGFSHAPDELTYGVD++ IRWCG+LQRIAL+YL+V+++EI+++D Q +D   G FSIF+
Sbjct: 117  QGGFSHAPDELTYGVDMKEIRWCGILQRIALAYLVVAVLEIFSRDSQVEDLPPGWFSIFK 176

Query: 660  AYCWHWLMAACVLIVYLALIYGTYVPDWQFIIINKDSADCGKVYNVTCGVRGKLDPPCNA 839
             Y WHW++ A VL+VYLA +YGTYVPDWQFI+ NKDS + GK +NV CGVRG LDPPCNA
Sbjct: 177  LYFWHWIVGAGVLVVYLAALYGTYVPDWQFIVQNKDSINYGKSFNVACGVRGNLDPPCNA 236

Query: 840  VGYIDRDVLGINHLYHRPAWRRSKACTQDSPYDGPLRKDAPSWCLAPFEPEGXXXXXXXX 1019
            VGYIDR VLG++H+Y  PAWRRS+ACT++SPY+GP RKDAPSWC  PFEPEG        
Sbjct: 237  VGYIDRKVLGLSHMYRHPAWRRSEACTENSPYEGPFRKDAPSWCYGPFEPEGILSSISAI 296

Query: 1020 XXTXXXXXXXXXXXXMKGHLARLRQWVTMGLSLLIFGLTLHFTQAIPLNKQLYTLSYVCV 1199
              T            +KGH  RL+ W++MG  L++ G+ LHFT AIPLNKQLYTLSY+CV
Sbjct: 297  LSTIIGLHFGHVLIHLKGHQERLKHWISMGSVLVVSGIILHFTDAIPLNKQLYTLSYICV 356

Query: 1200 TSGAAALVFSAIYALVDVWNFKFVFLPLAWIGMNAMFVYVMAAEGIFACFINGWYYADPQ 1379
            TSGAAALVFS+ Y +VD+ NF++VFLPL WIGMNAM VYVMAA GIFA F+NGWYY DP 
Sbjct: 357  TSGAAALVFSSFYIMVDISNFRYVFLPLEWIGMNAMLVYVMAAAGIFAGFVNGWYYDDPH 416

Query: 1380 NTLIYWI 1400
            N LIYWI
Sbjct: 417  NALIYWI 423


>ref|XP_008370425.1| PREDICTED: heparan-alpha-glucosaminide N-acetyltransferase [Malus
            domestica]
          Length = 467

 Score =  605 bits (1559), Expect = e-170
 Identities = 277/426 (65%), Positives = 336/426 (78%)
 Frame = +3

Query: 123  MGEIKPEITHHHPLLKSEPDISEQQEKPHFNTQRLASLDIFRGLTVALMILVDHAGGDWP 302
            M EIK + +HH  L+ SEPD+S+ +      ++RLASLDIFRGLTV+LMILVD AGG+WP
Sbjct: 1    MAEIKADTSHHQSLIVSEPDVSDPKPS---KSKRLASLDIFRGLTVSLMILVDDAGGEWP 57

Query: 303  EITHAPWNGCNLADFVMPFFLFIVGVAIALALKRVPKQGDAVMKVILRTLKLLFWGILLQ 482
             I HAPWNGCNLADFVMPFFLFIVG++IAL+LKR+P Q  AV KVILRTLKLLFWG+LLQ
Sbjct: 58   VIGHAPWNGCNLADFVMPFFLFIVGMSIALSLKRIPDQFVAVKKVILRTLKLLFWGLLLQ 117

Query: 483  GGFSHAPDELTYGVDVRMIRWCGVLQRIALSYLLVSMVEIYTKDVQDKDRSLGQFSIFRA 662
            GG+SHAPD+LTYGVD++ +RWCG+LQRIAL+YL+V+++EI ++  + KD + G FSIF+ 
Sbjct: 118  GGYSHAPDKLTYGVDMKELRWCGILQRIALAYLVVALIEIVSRGAETKDMAPGTFSIFKL 177

Query: 663  YCWHWLMAACVLIVYLALIYGTYVPDWQFIIINKDSADCGKVYNVTCGVRGKLDPPCNAV 842
            Y WHWL+A CVL++Y A+IYGTYVPDWQF + +++  D GK Y V CG RGKLDPPCNAV
Sbjct: 178  YYWHWLVAGCVLVIYFAVIYGTYVPDWQFTVQDRERTDYGKSYTVACGXRGKLDPPCNAV 237

Query: 843  GYIDRDVLGINHLYHRPAWRRSKACTQDSPYDGPLRKDAPSWCLAPFEPEGXXXXXXXXX 1022
            GYIDR+VLGI+H+Y RPAW+RSKACT++SPY GP R DAPSWC APFEPEG         
Sbjct: 238  GYIDREVLGISHMYQRPAWKRSKACTENSPYAGPFRNDAPSWCRAPFEPEGIVSSISAIL 297

Query: 1023 XTXXXXXXXXXXXXMKGHLARLRQWVTMGLSLLIFGLTLHFTQAIPLNKQLYTLSYVCVT 1202
                          M+GH ARL+ WV  G +LL  G+ LHF+ AIP NKQLYT SYVC+T
Sbjct: 298  SAIIGVHFGHVLIHMQGHPARLKHWVPTGCALLALGIILHFSHAIPSNKQLYTFSYVCIT 357

Query: 1203 SGAAALVFSAIYALVDVWNFKFVFLPLAWIGMNAMFVYVMAAEGIFACFINGWYYADPQN 1382
            SG AALVFSA Y +VD+W+ +++FLPL WIGMNAM VYVMAAEGI A F+NGWYY DP N
Sbjct: 358  SGVAALVFSAFYLMVDIWSIRYLFLPLEWIGMNAMLVYVMAAEGILAGFVNGWYYKDPHN 417

Query: 1383 TLIYWI 1400
            TL+YWI
Sbjct: 418  TLVYWI 423


>ref|XP_011087426.1| PREDICTED: heparan-alpha-glucosaminide N-acetyltransferase isoform X1
            [Sesamum indicum]
          Length = 477

 Score =  598 bits (1543), Expect = e-168
 Identities = 289/433 (66%), Positives = 333/433 (76%), Gaps = 7/433 (1%)
 Frame = +3

Query: 123  MGEIKPEITHH-HPLLKSE-PDISEQQE---KPH--FNTQRLASLDIFRGLTVALMILVD 281
            M +IK ++ HH H +L +E P + E++E   KP      +R+ASLDIFRGLTVALMILVD
Sbjct: 1    MTDIKEDVDHHRHLILPTEAPKLQEEEEEKAKPQQISKPKRVASLDIFRGLTVALMILVD 60

Query: 282  HAGGDWPEITHAPWNGCNLADFVMPFFLFIVGVAIALALKRVPKQGDAVMKVILRTLKLL 461
             AGG+WP I HAPWNGCNLADFVMPFFLFIVG+AI LA KR+  +  A+ KVILRTLKLL
Sbjct: 61   DAGGEWPVIGHAPWNGCNLADFVMPFFLFIVGMAIPLAFKRIQDRITAISKVILRTLKLL 120

Query: 462  FWGILLQGGFSHAPDELTYGVDVRMIRWCGVLQRIALSYLLVSMVEIYTKDVQDKDRSLG 641
            FWG+LLQGGFSHAPD+LTYGVD++ IRWCG+LQRIAL+YL+VS VEI T++   K    G
Sbjct: 121  FWGLLLQGGFSHAPDKLTYGVDMKKIRWCGILQRIALAYLVVSFVEIATRNSLTKGLLPG 180

Query: 642  QFSIFRAYCWHWLMAACVLIVYLALIYGTYVPDWQFIIINKDSADCGKVYNVTCGVRGKL 821
            +FS+F+ Y WHW++ ACVL VYL  +YGTYVPDWQF + N DS+D GKV  V C VRGKL
Sbjct: 181  KFSVFKLYFWHWMVGACVLTVYLGTLYGTYVPDWQFTVQNTDSSDFGKVLTVACNVRGKL 240

Query: 822  DPPCNAVGYIDRDVLGINHLYHRPAWRRSKACTQDSPYDGPLRKDAPSWCLAPFEPEGXX 1001
            DPPCNAVGYIDR VLGINH+Y  PAW+RSKACT  SP++GP R DAP WC APFEPEG  
Sbjct: 241  DPPCNAVGYIDRKVLGINHMYPHPAWKRSKACTNSSPHEGPFRADAPPWCWAPFEPEGIL 300

Query: 1002 XXXXXXXXTXXXXXXXXXXXXMKGHLARLRQWVTMGLSLLIFGLTLHFTQAIPLNKQLYT 1181
                    T            MKGH +RL+ W  MGL LLI G+ LHFT AIPLNKQLYT
Sbjct: 301  SSISAILSTIFGVHFGHVLVHMKGHSSRLQHWTIMGLGLLILGIILHFTDAIPLNKQLYT 360

Query: 1182 LSYVCVTSGAAALVFSAIYALVDVWNFKFVFLPLAWIGMNAMFVYVMAAEGIFACFINGW 1361
            LSYVCVTSG AALVFSA Y +VD+WNFK+ FLPL WIGMNAM VYVMAAEGIFA F+NGW
Sbjct: 361  LSYVCVTSGVAALVFSAFYIMVDIWNFKYPFLPLEWIGMNAMLVYVMAAEGIFAGFVNGW 420

Query: 1362 YYADPQNTLIYWI 1400
            YY DP NTLI+WI
Sbjct: 421  YYDDPHNTLIHWI 433


>ref|XP_010025234.1| PREDICTED: heparan-alpha-glucosaminide N-acetyltransferase
            [Eucalyptus grandis]
          Length = 470

 Score =  596 bits (1537), Expect = e-167
 Identities = 281/427 (65%), Positives = 328/427 (76%), Gaps = 1/427 (0%)
 Frame = +3

Query: 123  MGEIKPEIT-HHHPLLKSEPDISEQQEKPHFNTQRLASLDIFRGLTVALMILVDHAGGDW 299
            M EIK +    HH L    PD     +KP    +R+ASLDIFRGLTVALMILVD AGG+W
Sbjct: 1    MAEIKAQAAPRHHELPLVMPDAGASGDKPR-KPKRVASLDIFRGLTVALMILVDDAGGEW 59

Query: 300  PEITHAPWNGCNLADFVMPFFLFIVGVAIALALKRVPKQGDAVMKVILRTLKLLFWGILL 479
            P I HAPWNGCNLADFVMPFFLFIVG+AI LALKR+P +  A+ KVILRTLKLLFWG+LL
Sbjct: 60   PMIGHAPWNGCNLADFVMPFFLFIVGMAIPLALKRIPDRFAAIKKVILRTLKLLFWGLLL 119

Query: 480  QGGFSHAPDELTYGVDVRMIRWCGVLQRIALSYLLVSMVEIYTKDVQDKDRSLGQFSIFR 659
            QGG+SHAPD+L+YGVD++MIRWCG+LQRIAL+YL+V++VEI+TK+ Q+   S   FSIFR
Sbjct: 120  QGGYSHAPDKLSYGVDMKMIRWCGILQRIALAYLVVALVEIFTKNPQENYESSSSFSIFR 179

Query: 660  AYCWHWLMAACVLIVYLALIYGTYVPDWQFIIINKDSADCGKVYNVTCGVRGKLDPPCNA 839
             YCWHWL  ACV+ VYLA++YG YVPDWQF + ++D+ + GK + VTCG RG LDPPCNA
Sbjct: 180  KYCWHWLFGACVVTVYLAILYGMYVPDWQFTVQDRDNPNYGKFFRVTCGTRGNLDPPCNA 239

Query: 840  VGYIDRDVLGINHLYHRPAWRRSKACTQDSPYDGPLRKDAPSWCLAPFEPEGXXXXXXXX 1019
            VGY+DR+VLGINH+Y  PAW+RSK CT +SPY+G +R DAP+WC APFEPEG        
Sbjct: 240  VGYVDREVLGINHMYQHPAWKRSKDCTVNSPYEGLIRNDAPAWCYAPFEPEGILSSISAI 299

Query: 1020 XXTXXXXXXXXXXXXMKGHLARLRQWVTMGLSLLIFGLTLHFTQAIPLNKQLYTLSYVCV 1199
                           ++ H  RL+ WV MGLSLLI G+ LHFT AIPLNKQLYTLSYVCV
Sbjct: 300  LSAIIGVHCGHVLLHLENHSTRLKHWVLMGLSLLILGIILHFTHAIPLNKQLYTLSYVCV 359

Query: 1200 TSGAAALVFSAIYALVDVWNFKFVFLPLAWIGMNAMFVYVMAAEGIFACFINGWYYADPQ 1379
            T+GAAALVFSA Y LVDVW     FLPL WIGMNAM VYVMAAEGIFA FINGWYY DP 
Sbjct: 360  TAGAAALVFSAFYVLVDVWGLHLPFLPLKWIGMNAMLVYVMAAEGIFAAFINGWYYDDPH 419

Query: 1380 NTLIYWI 1400
            N L++WI
Sbjct: 420  NNLVHWI 426


Top