BLASTX nr result
ID: Coptis24_contig00001051
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Coptis24_contig00001051 (1869 letters) Database: ./nr 23,641,837 sequences; 8,123,359,852 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_002320190.1| predicted protein [Populus trichocarpa] gi|2... 645 0.0 emb|CBI34463.3| unnamed protein product [Vitis vinifera] 638 e-180 ref|XP_002301402.1| predicted protein [Populus trichocarpa] gi|2... 634 e-179 dbj|BAG80556.1| UDP-glucose:glucosyltransferase [Lycium barbarum] 634 e-179 sp|Q9AR73.1|HQGT_RAUSE RecName: Full=Hydroquinone glucosyltransf... 619 e-175 >ref|XP_002320190.1| predicted protein [Populus trichocarpa] gi|222860963|gb|EEE98505.1| predicted protein [Populus trichocarpa] Length = 478 Score = 645 bits (1665), Expect = 0.0 Identities = 311/470 (66%), Positives = 374/470 (79%), Gaps = 2/470 (0%) Frame = +3 Query: 141 MGEIQQNPHIIVLPTPGMGHLIPLIEFAKRLVLRHDFSFTFTIPSDGSPSEAQKTVLGSL 320 M E PH+ +LP+PGMGHLIPL+E AKRLV +H+ S TF IP+DGSPS+AQ++VLGSL Sbjct: 1 MAETDSPPHVAILPSPGMGHLIPLVELAKRLVHQHNLSVTFIIPTDGSPSKAQRSVLGSL 60 Query: 321 PDSIDYIFLPPVNFDDLSGDVKIEXXXXXXXXXXXXXXXEVLKEMV-DNKRVVALVVDLF 497 P +I +FLPPVN DL DVKIE +VL +V RVVALVVDLF Sbjct: 61 PSTIHSVFLPPVNLSDLPEDVKIETLISLTVARSLPSLRDVLSSLVASGTRVVALVVDLF 120 Query: 498 GTDAFDVAKEFKLPSYIFYPSTANALSLFLELPKLDEMYSCEYRDLLEPVKLPGCVPLHG 677 GTDAFDVA+EFK YIFYP+ A ALSLF LPKLDEM SCEY ++ EPV++PGC+P+HG Sbjct: 121 GTDAFDVAREFKASPYIFYPAPAMALSLFFYLPKLDEMVSCEYSEMQEPVEIPGCLPIHG 180 Query: 678 RDFLDPLQDRKNEAYTWLLHHSNRYKLAEGILVNSFVDLEPDTFEALKEEKPDRPPIYPV 857 + LDP +DRKN+AY WLLHHS RY+LAEG++VNSF+DLE +AL+E +P +PP+YPV Sbjct: 181 GELLDPTRDRKNDAYKWLLHHSKRYRLAEGVMVNSFIDLERGALKALQEVEPGKPPVYPV 240 Query: 858 GPLIQSGSTNKGADGSDCLKWLDEQPHGSVLFISFGSGGTLSKEQLNELAIGLEMSEQRF 1037 GPL+ S G +GS+CLKWLD+QP GSVLF+SFGSGGTLS +Q+ ELA+GLEMSEQRF Sbjct: 241 GPLVNMDSNTSGVEGSECLKWLDDQPLGSVLFVSFGSGGTLSFDQITELALGLEMSEQRF 300 Query: 1038 LWVVRSPSEKAANASYFSAQSAKDPFDFLPKGFVERTKGLGLVVPSWAPQVQVLSHGSTG 1217 LWV R P++K ANA+YFS + KDPFDFLPKGF++RTKG GLVVPSWAPQ QVLSHGSTG Sbjct: 301 LWVARVPNDKVANATYFSVDNHKDPFDFLPKGFLDRTKGRGLVVPSWAPQAQVLSHGSTG 360 Query: 1218 GFLSHCGWNSTLESIVNGVPLIAWPLFAEQKMNAVMLDY-MKVALRPKFDENGIVRRDEI 1394 GFL+HCGWNSTLES+VN VPLI WPL+AEQKMNA ML ++VALRPK ENG++ R+EI Sbjct: 361 GFLTHCGWNSTLESVVNAVPLIVWPLYAEQKMNAWMLTKDVEVALRPKASENGLIGREEI 420 Query: 1395 AKVVKGLMEGEAGKKLRNKMKDLKSAAGTVLTEDGSSMKSLCEVALKWKN 1544 A +V+GLMEGE GK++RN+MKDLK AA VL+E GSS K+L EVA KWKN Sbjct: 421 ANIVRGLMEGEEGKRVRNRMKDLKDAAAEVLSEAGSSTKALSEVARKWKN 470 >emb|CBI34463.3| unnamed protein product [Vitis vinifera] Length = 468 Score = 638 bits (1646), Expect = e-180 Identities = 310/461 (67%), Positives = 369/461 (80%) Frame = +3 Query: 162 PHIIVLPTPGMGHLIPLIEFAKRLVLRHDFSFTFTIPSDGSPSEAQKTVLGSLPDSIDYI 341 PHI ++P PGMGHLIPLIEFA+RLVL H+FS TF IP+DGSP QK+VL +LP SI+Y+ Sbjct: 6 PHIAIVPNPGMGHLIPLIEFARRLVLHHNFSVTFLIPTDGSPVTPQKSVLKALPTSINYV 65 Query: 342 FLPPVNFDDLSGDVKIEXXXXXXXXXXXXXXXEVLKEMVDNKRVVALVVDLFGTDAFDVA 521 FLPPV FDDL DV+IE + L+ + ++ R+VALVVDLFGTDAFDVA Sbjct: 66 FLPPVAFDDLPEDVRIETRISLSMTRSVPALRDSLRTLTESTRLVALVVDLFGTDAFDVA 125 Query: 522 KEFKLPSYIFYPSTANALSLFLELPKLDEMYSCEYRDLLEPVKLPGCVPLHGRDFLDPLQ 701 EF +P YIF+P+TA LSL +P+LD+ +SCEYRDL EPVK PGCVP+ GRD +DPLQ Sbjct: 126 NEFGIPPYIFFPTTAMVLSLIFHVPELDQKFSCEYRDLPEPVKFPGCVPVQGRDLIDPLQ 185 Query: 702 DRKNEAYTWLLHHSNRYKLAEGILVNSFVDLEPDTFEALKEEKPDRPPIYPVGPLIQSGS 881 DRKNEAY W++HH+ RYK GI+VNSF+DLEP F+ALKE +PD PP+YPVGPL +SGS Sbjct: 186 DRKNEAYKWVVHHAKRYKTGPGIIVNSFMDLEPGAFKALKEIEPDYPPVYPVGPLTRSGS 245 Query: 882 TNKGADGSDCLKWLDEQPHGSVLFISFGSGGTLSKEQLNELAIGLEMSEQRFLWVVRSPS 1061 TN G DGS+CL WLD QP GSVLF+SFGSGGTLS+EQ+ ELA+GLEMS QRFLWVV+SP Sbjct: 246 TN-GDDGSECLTWLDHQPSGSVLFVSFGSGGTLSQEQITELALGLEMSGQRFLWVVKSPH 304 Query: 1062 EKAANASYFSAQSAKDPFDFLPKGFVERTKGLGLVVPSWAPQVQVLSHGSTGGFLSHCGW 1241 E AANAS+FSAQ+ KDPFDFLPKGF++RT+GLGLVV SWAPQVQVLSHGSTGGFL+HCGW Sbjct: 305 ETAANASFFSAQTIKDPFDFLPKGFLDRTQGLGLVVSSWAPQVQVLSHGSTGGFLTHCGW 364 Query: 1242 NSTLESIVNGVPLIAWPLFAEQKMNAVMLDYMKVALRPKFDENGIVRRDEIAKVVKGLME 1421 NSTLE+IV GVP+IAWPLFAEQ+MNA +L A + NG+V R+EIAK VK L+E Sbjct: 365 NSTLETIVQGVPIIAWPLFAEQRMNATLLANDLKAAVTLNNNNGLVSREEIAKTVKSLIE 424 Query: 1422 GEAGKKLRNKMKDLKSAAGTVLTEDGSSMKSLCEVALKWKN 1544 GE GK +RNK+KDLK AA L++DGSS +SL EVA WKN Sbjct: 425 GEKGKMIRNKIKDLKDAATMALSQDGSSTRSLAEVAQIWKN 465 >ref|XP_002301402.1| predicted protein [Populus trichocarpa] gi|222843128|gb|EEE80675.1| predicted protein [Populus trichocarpa] Length = 469 Score = 634 bits (1635), Expect = e-179 Identities = 308/471 (65%), Positives = 375/471 (79%), Gaps = 2/471 (0%) Frame = +3 Query: 141 MGEIQQNPHIIVLPTPGMGHLIPLIEFAKRLVLRHDFSFTFTIPSDGSPSEAQKTVLGSL 320 M + H+ +LP+PGMGHLIPL+E AKRLV +H+FS TF IP+DGS S+AQ++VLGSL Sbjct: 1 MAQTDAPAHVAILPSPGMGHLIPLVELAKRLVHQHNFSITFVIPTDGSTSKAQRSVLGSL 60 Query: 321 PDSIDYIFLPPVNFDDLSGDVKIEXXXXXXXXXXXXXXXEVLKEMVDN-KRVVALVVDLF 497 P +I +FLP VN DL DVKIE +V + +VD RVVALVVDLF Sbjct: 61 PSAIHSVFLPQVNLSDLPEDVKIETTISHTVARSLPSLRDVFRSLVDGGARVVALVVDLF 120 Query: 498 GTDAFDVAKEFKLPSYIFYPSTANALSLFLELPKLDEMYSCEYRDLLEPVKLPGCVPLHG 677 GTDAFDVA+EF + YIF+PSTA ALSLF LPKLDEM SCEYR++ EPVK+PGC+P+HG Sbjct: 121 GTDAFDVAREFNVSPYIFFPSTAMALSLFFHLPKLDEMVSCEYREMQEPVKIPGCLPIHG 180 Query: 678 RDFLDPLQDRKNEAYTWLLHHSNRYKLAEGILVNSFVDLEPDTFEALKEEKPDRPPIYPV 857 + LDP QDRKN+AY WLL+H+NRY++AEG++VNSF+DLE +AL+E +P +P +YPV Sbjct: 181 GELLDPTQDRKNDAYKWLLYHTNRYRMAEGVMVNSFMDLEKGALKALQEVEPGKPTVYPV 240 Query: 858 GPLIQSGSTNKGADGSDCLKWLDEQPHGSVLFISFGSGGTLSKEQLNELAIGLEMSEQRF 1037 GPL+ S+ G +GS+CL+WLD+QPHGSVLF+SFGSGGTLS +Q+ ELA+GLEMSEQRF Sbjct: 241 GPLVNMDSS-AGVEGSECLRWLDDQPHGSVLFVSFGSGGTLSLDQITELALGLEMSEQRF 299 Query: 1038 LWVVRSPSEKAANASYFSAQSAKDPFDFLPKGFVERTKGLGLVVPSWAPQVQVLSHGSTG 1217 LWVVRSP++K +NA++FS S KDPFDFLPKGF +RTKG GL VPSWAPQ QVL HGSTG Sbjct: 300 LWVVRSPNDKVSNATFFSVDSHKDPFDFLPKGFSDRTKGRGLAVPSWAPQPQVLGHGSTG 359 Query: 1218 GFLSHCGWNSTLESIVNGVPLIAWPLFAEQKMNAVMLDY-MKVALRPKFDENGIVRRDEI 1394 GFL+HCGWNSTLES+VNGVPLI WPL+AEQKMNA ML +KVALRPK ENG++ R+EI Sbjct: 360 GFLTHCGWNSTLESVVNGVPLIVWPLYAEQKMNAWMLTKDIKVALRPKASENGLIGREEI 419 Query: 1395 AKVVKGLMEGEAGKKLRNKMKDLKSAAGTVLTEDGSSMKSLCEVALKWKNQ 1547 A V+GLMEGE GK++RN+MKDLK AA VL+EDG SL E+A KWKNQ Sbjct: 420 ANAVRGLMEGEEGKRVRNRMKDLKEAAARVLSEDG----SLSELAHKWKNQ 466 >dbj|BAG80556.1| UDP-glucose:glucosyltransferase [Lycium barbarum] Length = 476 Score = 634 bits (1634), Expect = e-179 Identities = 312/468 (66%), Positives = 376/468 (80%), Gaps = 3/468 (0%) Frame = +3 Query: 162 PHIIVLPTPGMGHLIPLIEFAKRLVLRHDFSFTFTIPSDGSPSEAQKTVLGSLPDSIDYI 341 PHI +LP+PGMGHLIPL+EF+KRL+ H FS T +P+DG S AQK L SLP S+DY Sbjct: 9 PHIAILPSPGMGHLIPLVEFSKRLIQNHHFSVTLILPTDGPVSNAQKIYLNSLPCSMDYH 68 Query: 342 FLPPVNFDDLSGDVKIEXXXXXXXXXXXXXXXEVLKEMVDNKRVVALVVDLFGTDAFDVA 521 LPPVNFDDL D K+E EV K +V+ K+ VALVVDLFGTDAFDVA Sbjct: 69 LLPPVNFDDLPLDTKMETRISLTVTRSLPSLREVFKTLVETKKTVALVVDLFGTDAFDVA 128 Query: 522 KEFKLPSYIFYPSTANALSLFLELPKLDEMYSCEYRDLLEPVKLPGCVPLHGRDFLDPLQ 701 +FK+ YIFYPSTA ALSLFL LPKLDE SCEY DL +PV++PGC+P+HG+D LDP+Q Sbjct: 129 NDFKVSPYIFYPSTAMALSLFLYLPKLDETVSCEYTDLPDPVQIPGCIPIHGKDLLDPVQ 188 Query: 702 DRKNEAYTWLLHHSNRYKLAEGILVNSFVDLEPDTFEALKEEKPDRPPIYPVGPLIQ--S 875 DRKNEAY W+LHHS RY++AEGI+ NSF +LE +AL+EE+P +PP+YPVGPLIQ S Sbjct: 189 DRKNEAYKWVLHHSKRYRMAEGIVANSFKELEGGAIKALQEEEPGKPPVYPVGPLIQMDS 248 Query: 876 GSTNKGADGSDCLKWLDEQPHGSVLFISFGSGGTLSKEQLNELAIGLEMSEQRFLWVVRS 1055 GS +K AD S+CL WLDEQP GSVL+ISFGSGGTLS EQ+ ELA GLEMSEQRFLWV+R+ Sbjct: 249 GSGSK-ADRSECLTWLDEQPRGSVLYISFGSGGTLSHEQMIELASGLEMSEQRFLWVIRT 307 Query: 1056 PSEKAANASYFSAQSAKDPFDFLPKGFVERTKGLGLVVPSWAPQVQVLSHGSTGGFLSHC 1235 P++K A+A+YF+ Q + +P DFLPKGF+E+TKGLGLVVP+WAPQ Q+L HGST GFL+HC Sbjct: 308 PNDKMASATYFNVQDSTNPLDFLPKGFLEKTKGLGLVVPNWAPQAQILGHGSTSGFLTHC 367 Query: 1236 GWNSTLESIVNGVPLIAWPLFAEQKMNAVML-DYMKVALRPKFDENGIVRRDEIAKVVKG 1412 GWNSTLES+V+GVP IAWPL+AEQKMNAVML + +KVALRPK +ENGIV R EIAKVVKG Sbjct: 368 GWNSTLESVVHGVPFIAWPLYAEQKMNAVMLSEDIKVALRPKANENGIVGRLEIAKVVKG 427 Query: 1413 LMEGEAGKKLRNKMKDLKSAAGTVLTEDGSSMKSLCEVALKWKNQCSS 1556 LMEGE GK +R++M+DLK AA VL+EDGSS K+L E+A K K + S+ Sbjct: 428 LMEGEEGKVVRSRMRDLKDAAAKVLSEDGSSTKALAELATKLKKKVSN 475 >sp|Q9AR73.1|HQGT_RAUSE RecName: Full=Hydroquinone glucosyltransferase; AltName: Full=Arbutin synthase gi|13508844|emb|CAC35167.1| arbutin synthase [Rauvolfia serpentina] Length = 470 Score = 619 bits (1597), Expect = e-175 Identities = 298/470 (63%), Positives = 371/470 (78%), Gaps = 1/470 (0%) Frame = +3 Query: 150 IQQNPHIIVLPTPGMGHLIPLIEFAKRLVLRHDFSFTFTIPSDGSPSEAQKTVLGSLPDS 329 ++ PHI ++PTPGMGHLIPL+EFAKRLVLRH+F TF IP+DG +AQK+ L +LP Sbjct: 1 MEHTPHIAMVPTPGMGHLIPLVEFAKRLVLRHNFGVTFIIPTDGPLPKAQKSFLDALPAG 60 Query: 330 IDYIFLPPVNFDDLSGDVKIEXXXXXXXXXXXXXXXEVLKEMVDNKRVVALVVDLFGTDA 509 ++Y+ LPPV+FDDL DV+IE + +K ++ ++ ALVVDLFGTDA Sbjct: 61 VNYVLLPPVSFDDLPADVRIETRICLTITRSLPFVRDAVKTLLATTKLAALVVDLFGTDA 120 Query: 510 FDVAKEFKLPSYIFYPSTANALSLFLELPKLDEMYSCEYRDLLEPVKLPGCVPLHGRDFL 689 FDVA EFK+ YIFYP+TA LSLF LPKLD+M SCEYRD+ EP+++PGC+P+HG+DFL Sbjct: 121 FDVAIEFKVSPYIFYPTTAMCLSLFFHLPKLDQMVSCEYRDVPEPLQIPGCIPIHGKDFL 180 Query: 690 DPLQDRKNEAYTWLLHHSNRYKLAEGILVNSFVDLEPDTFEALKEEKPDRPPIYPVGPLI 869 DP QDRKN+AY LLH + RY+LAEGI+VN+F DLEP +AL+EE +PP+YP+GPLI Sbjct: 181 DPAQDRKNDAYKCLLHQAKRYRLAEGIMVNTFNDLEPGPLKALQEEDQGKPPVYPIGPLI 240 Query: 870 QSGSTNKGADGSDCLKWLDEQPHGSVLFISFGSGGTLSKEQLNELAIGLEMSEQRFLWVV 1049 ++ S++K D +CLKWLD+QP GSVLFISFGSGG +S Q ELA+GLEMSEQRFLWVV Sbjct: 241 RADSSSK-VDDCECLKWLDDQPRGSVLFISFGSGGAVSHNQFIELALGLEMSEQRFLWVV 299 Query: 1050 RSPSEKAANASYFSAQSAKDPFDFLPKGFVERTKGLGLVVPSWAPQVQVLSHGSTGGFLS 1229 RSP++K ANA+YFS Q+ D +LP+GF+ERTKG L+VPSWAPQ ++LSHGSTGGFL+ Sbjct: 300 RSPNDKIANATYFSIQNQNDALAYLPEGFLERTKGRCLLVPSWAPQTEILSHGSTGGFLT 359 Query: 1230 HCGWNSTLESIVNGVPLIAWPLFAEQKMNAVML-DYMKVALRPKFDENGIVRRDEIAKVV 1406 HCGWNS LES+VNGVPLIAWPL+AEQKMNAVML + +KVALRPK ENG++ R EIA V Sbjct: 360 HCGWNSILESVVNGVPLIAWPLYAEQKMNAVMLTEGLKVALRPKAGENGLIGRVEIANAV 419 Query: 1407 KGLMEGEAGKKLRNKMKDLKSAAGTVLTEDGSSMKSLCEVALKWKNQCSS 1556 KGLMEGE GKK R+ MKDLK AA L++DGSS K+L E+A KW+N+ SS Sbjct: 420 KGLMEGEEGKKFRSTMKDLKDAASRALSDDGSSTKALAELACKWENKISS 469