BLASTX nr result
ID: Coptis24_contig00005292
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Coptis24_contig00005292 (1580 letters) Database: ./nr 23,641,837 sequences; 8,123,359,852 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value sp|Q9AR73.1|HQGT_RAUSE RecName: Full=Hydroquinone glucosyltransf... 477 e-132 dbj|BAG80556.1| UDP-glucose:glucosyltransferase [Lycium barbarum] 474 e-131 ref|XP_002303638.1| predicted protein [Populus trichocarpa] gi|2... 468 e-129 gb|ACB56923.1| glycosyltransferase UGT72B11 [Hieracium pilosella] 462 e-127 emb|CBI34463.3| unnamed protein product [Vitis vinifera] 460 e-127 >sp|Q9AR73.1|HQGT_RAUSE RecName: Full=Hydroquinone glucosyltransferase; AltName: Full=Arbutin synthase gi|13508844|emb|CAC35167.1| arbutin synthase [Rauvolfia serpentina] Length = 470 Score = 477 bits (1227), Expect = e-132 Identities = 240/464 (51%), Positives = 312/464 (67%), Gaps = 6/464 (1%) Frame = +3 Query: 21 QQTPHIIIHPSPGMGHLIPLAEFAKRLVLLHDFTVTLVIPIENNFPTESMKLFLIVLPEG 200 + TPHI + P+PGMGHLIPL EFAKRLVL H+F VT +IP + P ++ K FL LP G Sbjct: 2 EHTPHIAMVPTPGMGHLIPLVEFAKRLVLRHNFGVTFIIPTDGPLP-KAQKSFLDALPAG 60 Query: 201 IDYIFLPSVNFDDLPHDTRAEARICLLMNRSIPLLRHTFMNITSTNHVVALVVDLFGSEA 380 ++Y+ LP V+FDDLP D R E RICL + RS+P +R + +T + ALVVDLFG++A Sbjct: 61 VNYVLLPPVSFDDLPADVRIETRICLTITRSLPFVRDAVKTLLATTKLAALVVDLFGTDA 120 Query: 381 FEVAREFKVPPYMFFPASAMSXXXXXXXXXXDETYDCEFRDIPEPIKLPGCVPFKGIDVM 560 F+VA EFKV PY+F+P +AM D+ CE+RD+PEP+++PGC+P G D + Sbjct: 121 FDVAIEFKVSPYIFYPTTAMCLSLFFHLPKLDQMVSCEYRDVPEPLQIPGCIPIHGKDFL 180 Query: 561 DGAKDKTNVAYKMVLHISTLYSSAEGILINTFGSLENETLKALNQ--LGKPPIFPVGPLI 734 D A+D+ N AYK +LH + Y AEGI++NTF LE LKAL + GKPP++P+GPLI Sbjct: 181 DPAQDRKNDAYKCLLHQAKRYRLAEGIMVNTFNDLEPGPLKALQEEDQGKPPVYPIGPLI 240 Query: 735 QLKITS--DDSECIKFLNEQPRXXXXXXXXXXXXTXXXXXXXXXXXXXXXXXXRFLWVIR 908 + +S DD EC+K+L++QPR RFLWV+R Sbjct: 241 RADSSSKVDDCECLKWLDDQPRGSVLFISFGSGGAVSHNQFIELALGLEMSEQRFLWVVR 300 Query: 909 SPSDA-TNATYFE-QKIEDPFDFLPNGFLDRTKGLGLVVPSWAPQIQVLSHVSTGGFVTH 1082 SP+D NATYF Q D +LP GFL+RTKG L+VPSWAPQ ++LSH STGGF+TH Sbjct: 301 SPNDKIANATYFSIQNQNDALAYLPEGFLERTKGRCLLVPSWAPQTEILSHGSTGGFLTH 360 Query: 1083 CGWNSILESIMHGVPLIAWPLYAEQKMNAEMITAGLKIALRPKVDEKFIVDREEVARTVK 1262 CGWNSILES+++GVPLIAWPLYAEQKMNA M+T GLK+ALRPK E ++ R E+A VK Sbjct: 361 CGWNSILESVVNGVPLIAWPLYAEQKMNAVMLTEGLKVALRPKAGENGLIGRVEIANAVK 420 Query: 1263 CLMREEVGNRIRERINELKDAGFMLLNEDGSSTKALSEVANMWK 1394 LM E G + R + +LKDA L++DGSSTKAL+E+A W+ Sbjct: 421 GLMEGEEGKKFRSTMKDLKDAASRALSDDGSSTKALAELACKWE 464 >dbj|BAG80556.1| UDP-glucose:glucosyltransferase [Lycium barbarum] Length = 476 Score = 474 bits (1220), Expect = e-131 Identities = 243/465 (52%), Positives = 313/465 (67%), Gaps = 8/465 (1%) Frame = +3 Query: 27 TPHIIIHPSPGMGHLIPLAEFAKRLVLLHDFTVTLVIPIENNFPTESMKLFLIVLPEGID 206 TPHI I PSPGMGHLIPL EF+KRL+ H F+VTL++P + + + K++L LP +D Sbjct: 8 TPHIAILPSPGMGHLIPLVEFSKRLIQNHHFSVTLILPTDGPV-SNAQKIYLNSLPCSMD 66 Query: 207 YIFLPSVNFDDLPHDTRAEARICLLMNRSIPLLRHTFMNITSTNHVVALVVDLFGSEAFE 386 Y LP VNFDDLP DT+ E RI L + RS+P LR F + T VALVVDLFG++AF+ Sbjct: 67 YHLLPPVNFDDLPLDTKMETRISLTVTRSLPSLREVFKTLVETKKTVALVVDLFGTDAFD 126 Query: 387 VAREFKVPPYMFFPASAMSXXXXXXXXXXDETYDCEFRDIPEPIKLPGCVPFKGIDVMDG 566 VA +FKV PY+F+P++AM+ DET CE+ D+P+P+++PGC+P G D++D Sbjct: 127 VANDFKVSPYIFYPSTAMALSLFLYLPKLDETVSCEYTDLPDPVQIPGCIPIHGKDLLDP 186 Query: 567 AKDKTNVAYKMVLHISTLYSSAEGILINTFGSLENETLKALNQL--GKPPIFPVGPLIQL 740 +D+ N AYK VLH S Y AEGI+ N+F LE +KAL + GKPP++PVGPLIQ+ Sbjct: 187 VQDRKNEAYKWVLHHSKRYRMAEGIVANSFKELEGGAIKALQEEEPGKPPVYPVGPLIQM 246 Query: 741 KITS----DDSECIKFLNEQPRXXXXXXXXXXXXTXXXXXXXXXXXXXXXXXXRFLWVIR 908 S D SEC+ +L+EQPR T RFLWVIR Sbjct: 247 DSGSGSKADRSECLTWLDEQPRGSVLYISFGSGGTLSHEQMIELASGLEMSEQRFLWVIR 306 Query: 909 SPSDA-TNATYFE-QKIEDPFDFLPNGFLDRTKGLGLVVPSWAPQIQVLSHVSTGGFVTH 1082 +P+D +ATYF Q +P DFLP GFL++TKGLGLVVP+WAPQ Q+L H ST GF+TH Sbjct: 307 TPNDKMASATYFNVQDSTNPLDFLPKGFLEKTKGLGLVVPNWAPQAQILGHGSTSGFLTH 366 Query: 1083 CGWNSILESIMHGVPLIAWPLYAEQKMNAEMITAGLKIALRPKVDEKFIVDREEVARTVK 1262 CGWNS LES++HGVP IAWPLYAEQKMNA M++ +K+ALRPK +E IV R E+A+ VK Sbjct: 367 CGWNSTLESVVHGVPFIAWPLYAEQKMNAVMLSEDIKVALRPKANENGIVGRLEIAKVVK 426 Query: 1263 CLMREEVGNRIRERINELKDAGFMLLNEDGSSTKALSEVANMWKK 1397 LM E G +R R+ +LKDA +L+EDGSSTKAL+E+A KK Sbjct: 427 GLMEGEEGKVVRSRMRDLKDAAAKVLSEDGSSTKALAELATKLKK 471 >ref|XP_002303638.1| predicted protein [Populus trichocarpa] gi|222841070|gb|EEE78617.1| predicted protein [Populus trichocarpa] Length = 476 Score = 468 bits (1204), Expect = e-129 Identities = 241/466 (51%), Positives = 315/466 (67%), Gaps = 8/466 (1%) Frame = +3 Query: 21 QQTPHIIIHPSPGMGHLIPLAEFAKRLVLLHDFTVTLVIPIENNFPTESMKLFLIVLPEG 200 + +P ++I PSPGMGHLIP E AK+LV H+F+VT +IP + + P + + L LP+G Sbjct: 8 EASPQVVIVPSPGMGHLIPFVELAKKLVHQHNFSVTFIIPNDGS-PMKPHRQLLQALPKG 66 Query: 201 IDYIFLPSVNFDDLPHDTRAEARICLLMNRSIPLLRHTFMNITSTNHVVALVVDLFGSEA 380 + +FLP VNFDDLP D E RI L + RS+ LR + +T + VVALVVD FG A Sbjct: 67 VSSVFLPPVNFDDLPPDVLMETRITLSLTRSLDALRDSLKTLTDSTKVVALVVDFFGPFA 126 Query: 381 FEVAREFKVPPYMFFPASAMSXXXXXXXXXXDETYDCEFRDIPEPIKLPGCVPFKGIDVM 560 FE+A+EF V P++FFP SAM DETY E++D+ EP++LPGCVP +G D++ Sbjct: 127 FEIAKEFDVLPFVFFPTSAMLLSLSFHLPRLDETYSGEYKDMTEPVRLPGCVPVQGRDLV 186 Query: 561 DGAKDKTNVAYKMVLHISTLYSSAEGILINTFGSLENETLKAL---NQLGKPPIFPVGPL 731 D +DK + AYK +LH+ LY+SA GI+IN+F LE KAL N +GKPP++PVGPL Sbjct: 187 DPVQDKKDDAYKWILHLCKLYNSAAGIMINSFIDLEPGAFKALMEENNIGKPPVYPVGPL 246 Query: 732 IQLKITSDD---SECIKFLNEQPRXXXXXXXXXXXXTXXXXXXXXXXXXXXXXXXRFLWV 902 Q+ TS D SEC+ +L++QP+ T RFLWV Sbjct: 247 TQIGSTSGDVGESECLNWLDKQPKGSVLFVSFGSGGTLSHAQLNELSLGLEMSRQRFLWV 306 Query: 903 IRSPSD-ATNATYFE-QKIEDPFDFLPNGFLDRTKGLGLVVPSWAPQIQVLSHVSTGGFV 1076 +RSP D ATNATYF + +DP FLP GFLDRTKG+GLVVPSWAPQIQVLSH STGGF+ Sbjct: 307 VRSPHDEATNATYFGIRSSDDPLAFLPEGFLDRTKGVGLVVPSWAPQIQVLSHSSTGGFL 366 Query: 1077 THCGWNSILESIMHGVPLIAWPLYAEQKMNAEMITAGLKIALRPKVDEKFIVDREEVART 1256 THCGWNSILESI++GVPLIAWPLYAEQ+MN+ ++ GLK+ALR KV+E +V +E++A Sbjct: 367 THCGWNSILESIVNGVPLIAWPLYAEQRMNSVLLADGLKVALRVKVNENGLVMKEDIANY 426 Query: 1257 VKCLMREEVGNRIRERINELKDAGFMLLNEDGSSTKALSEVANMWK 1394 + + E G I+ ++NELK A L+EDGSSTK+L+EVA +WK Sbjct: 427 ARSIFEGEEGKSIKSKMNELKSAATRALSEDGSSTKSLAEVARIWK 472 >gb|ACB56923.1| glycosyltransferase UGT72B11 [Hieracium pilosella] Length = 466 Score = 462 bits (1188), Expect = e-127 Identities = 238/462 (51%), Positives = 306/462 (66%), Gaps = 6/462 (1%) Frame = +3 Query: 27 TPHIIIHPSPGMGHLIPLAEFAKRLVLLHDFTVTLVIPIENNFP-TESMKLFLIVLPEGI 203 TPHI I PSPGMGHLIPL EFAKRL H+ + +IP N+ P ++S FL LP+G+ Sbjct: 4 TPHIAIVPSPGMGHLIPLVEFAKRLNTNHNISAIFIIP--NDGPLSKSQIAFLDSLPDGL 61 Query: 204 DYIFLPSVNFDDLPHDTRAEARICLLMNRSIPLLRHTFMNITSTNHVVALVVDLFGSEAF 383 Y+ LP VNFDDLP DT E RI L++ RS+P LR F ++ + H+VAL +DLFG++AF Sbjct: 62 SYLILPPVNFDDLPKDTLMETRISLMVTRSVPSLRQVFKSLVAEKHMVALFIDLFGTDAF 121 Query: 384 EVAREFKVPPYMFFPASAMSXXXXXXXXXXDETYDCEFRDIPEPIKLPGCVPFKGIDVMD 563 +VA EF V PY+FFP++AM D+ CE+RD+PEP+++PGC+P +G D++D Sbjct: 122 DVAIEFGVSPYVFFPSTAMVLSMFLNLPRLDQEVSCEYRDLPEPVQIPGCIPVRGEDLLD 181 Query: 564 GAKDKTNVAYKMVLHISTLYSSAEGILINTFGSLENETLKAL--NQLGKPPIFPVGPLIQ 737 +D+ N AYK VLH + Y AEGI +N+F LE LK L + GKP ++PVGPLIQ Sbjct: 182 PVQDRKNDAYKWVLHNAKRYRMAEGIAVNSFQELEGGALKVLLEEEPGKPRVYPVGPLIQ 241 Query: 738 LKITSD--DSECIKFLNEQPRXXXXXXXXXXXXTXXXXXXXXXXXXXXXXXXRFLWVIRS 911 +SD S+C+++L+ QP T RFLWV+RS Sbjct: 242 SGSSSDLDGSDCLRWLDSQPCGSVLYISFGSGGTLSSTQLNELAMGLELSEQRFLWVVRS 301 Query: 912 PSDATNATYFEQK-IEDPFDFLPNGFLDRTKGLGLVVPSWAPQIQVLSHVSTGGFVTHCG 1088 P+D NATYF+ DP FLP GFL+RTK G VVPSWAPQ Q+LSH STGGF+THCG Sbjct: 302 PNDQPNATYFDSHGHNDPLGFLPKGFLERTKNTGFVVPSWAPQAQILSHSSTGGFLTHCG 361 Query: 1089 WNSILESIMHGVPLIAWPLYAEQKMNAEMITAGLKIALRPKVDEKFIVDREEVARTVKCL 1268 WNSILE+++HGVP+IAWPLYAEQKMNA +T GLK+ALRPKV + IV R E+AR VK L Sbjct: 362 WNSILETVVHGVPVIAWPLYAEQKMNAVSLTEGLKVALRPKVGDNGIVGRLEIARVVKGL 421 Query: 1269 MREEVGNRIRERINELKDAGFMLLNEDGSSTKALSEVANMWK 1394 + E G IR RI +LKDA +L +DG STK L ++A+ K Sbjct: 422 LEGEEGKGIRSRIRDLKDAAANVLGKDGCSTKTLDQLASKLK 463 >emb|CBI34463.3| unnamed protein product [Vitis vinifera] Length = 468 Score = 460 bits (1184), Expect = e-127 Identities = 235/462 (50%), Positives = 310/462 (67%), Gaps = 6/462 (1%) Frame = +3 Query: 27 TPHIIIHPSPGMGHLIPLAEFAKRLVLLHDFTVTLVIPIENNFPTESMKLFLIVLPEGID 206 TPHI I P+PGMGHLIPL EFA+RLVL H+F+VT +IP + + P K L LP I+ Sbjct: 5 TPHIAIVPNPGMGHLIPLIEFARRLVLHHNFSVTFLIPTDGS-PVTPQKSVLKALPTSIN 63 Query: 207 YIFLPSVNFDDLPHDTRAEARICLLMNRSIPLLRHTFMNITSTNHVVALVVDLFGSEAFE 386 Y+FLP V FDDLP D R E RI L M RS+P LR + +T + +VALVVDLFG++AF+ Sbjct: 64 YVFLPPVAFDDLPEDVRIETRISLSMTRSVPALRDSLRTLTESTRLVALVVDLFGTDAFD 123 Query: 387 VAREFKVPPYMFFPASAMSXXXXXXXXXXDETYDCEFRDIPEPIKLPGCVPFKGIDVMDG 566 VA EF +PPY+FFP +AM D+ + CE+RD+PEP+K PGCVP +G D++D Sbjct: 124 VANEFGIPPYIFFPTTAMVLSLIFHVPELDQKFSCEYRDLPEPVKFPGCVPVQGRDLIDP 183 Query: 567 AKDKTNVAYKMVLHISTLYSSAEGILINTFGSLENETLKALNQL--GKPPIFPVGPLIQL 740 +D+ N AYK V+H + Y + GI++N+F LE KAL ++ PP++PVGPL + Sbjct: 184 LQDRKNEAYKWVVHHAKRYKTGPGIIVNSFMDLEPGAFKALKEIEPDYPPVYPVGPLTRS 243 Query: 741 KITS--DDSECIKFLNEQPRXXXXXXXXXXXXTXXXXXXXXXXXXXXXXXXRFLWVIRSP 914 T+ D SEC+ +L+ QP T RFLWV++SP Sbjct: 244 GSTNGDDGSECLTWLDHQPSGSVLFVSFGSGGTLSQEQITELALGLEMSGQRFLWVVKSP 303 Query: 915 SD-ATNATYFE-QKIEDPFDFLPNGFLDRTKGLGLVVPSWAPQIQVLSHVSTGGFVTHCG 1088 + A NA++F Q I+DPFDFLP GFLDRT+GLGLVV SWAPQ+QVLSH STGGF+THCG Sbjct: 304 HETAANASFFSAQTIKDPFDFLPKGFLDRTQGLGLVVSSWAPQVQVLSHGSTGGFLTHCG 363 Query: 1089 WNSILESIMHGVPLIAWPLYAEQKMNAEMITAGLKIALRPKVDEKFIVDREEVARTVKCL 1268 WNS LE+I+ GVP+IAWPL+AEQ+MNA ++ LK A+ + +V REE+A+TVK L Sbjct: 364 WNSTLETIVQGVPIIAWPLFAEQRMNATLLANDLKAAVTLN-NNNGLVSREEIAKTVKSL 422 Query: 1269 MREEVGNRIRERINELKDAGFMLLNEDGSSTKALSEVANMWK 1394 + E G IR +I +LKDA M L++DGSST++L+EVA +WK Sbjct: 423 IEGEKGKMIRNKIKDLKDAATMALSQDGSSTRSLAEVAQIWK 464