BLASTX nr result

ID: Coptis25_contig00002608 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Coptis25_contig00002608
         (1791 letters)

Database: ./nr 
           23,641,837 sequences; 8,123,359,852 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_002320190.1| predicted protein [Populus trichocarpa] gi|2...   645   0.0  
emb|CBI34463.3| unnamed protein product [Vitis vinifera]              638   e-180
ref|XP_002301402.1| predicted protein [Populus trichocarpa] gi|2...   634   e-179
dbj|BAG80556.1| UDP-glucose:glucosyltransferase [Lycium barbarum]     634   e-179
sp|Q9AR73.1|HQGT_RAUSE RecName: Full=Hydroquinone glucosyltransf...   619   e-175

>ref|XP_002320190.1| predicted protein [Populus trichocarpa] gi|222860963|gb|EEE98505.1|
            predicted protein [Populus trichocarpa]
          Length = 478

 Score =  645 bits (1665), Expect = 0.0
 Identities = 311/470 (66%), Positives = 374/470 (79%), Gaps = 2/470 (0%)
 Frame = +3

Query: 129  MGEIQQNPHIIVLPTPGMGHLIPLIEFAKRLVLRHDFSFTFTIPSDGSPSEAQKTVLGSL 308
            M E    PH+ +LP+PGMGHLIPL+E AKRLV +H+ S TF IP+DGSPS+AQ++VLGSL
Sbjct: 1    MAETDSPPHVAILPSPGMGHLIPLVELAKRLVHQHNLSVTFIIPTDGSPSKAQRSVLGSL 60

Query: 309  PDSIDYIFLPPVNFDDLSGDVKIEXXXXXXXXXXXXXXXEVLKEMV-DNKRVVALVVDLF 485
            P +I  +FLPPVN  DL  DVKIE               +VL  +V    RVVALVVDLF
Sbjct: 61   PSTIHSVFLPPVNLSDLPEDVKIETLISLTVARSLPSLRDVLSSLVASGTRVVALVVDLF 120

Query: 486  GTDAFDVAKEFKLPSYIFYPSTANALSLFLELPKLDEMYSCEYRDLLEPVKLPGCVPLHG 665
            GTDAFDVA+EFK   YIFYP+ A ALSLF  LPKLDEM SCEY ++ EPV++PGC+P+HG
Sbjct: 121  GTDAFDVAREFKASPYIFYPAPAMALSLFFYLPKLDEMVSCEYSEMQEPVEIPGCLPIHG 180

Query: 666  RDFLDPLQDRKNEAYTWLLHHSNRYKLAEGILVNSFVDLEPDTFEALKEEKPDRPPIYPV 845
             + LDP +DRKN+AY WLLHHS RY+LAEG++VNSF+DLE    +AL+E +P +PP+YPV
Sbjct: 181  GELLDPTRDRKNDAYKWLLHHSKRYRLAEGVMVNSFIDLERGALKALQEVEPGKPPVYPV 240

Query: 846  GPLIQSGSTNKGADGSDCLKWLDEQPHGSVLFISFGSGGTLSKEQLNELAIGLEMSEQRF 1025
            GPL+   S   G +GS+CLKWLD+QP GSVLF+SFGSGGTLS +Q+ ELA+GLEMSEQRF
Sbjct: 241  GPLVNMDSNTSGVEGSECLKWLDDQPLGSVLFVSFGSGGTLSFDQITELALGLEMSEQRF 300

Query: 1026 LWVVRSPSEKAANASYFSAQSAKDPFDFLPKGFVERTKGLGLVVPSWAPQVQVLSHGSTG 1205
            LWV R P++K ANA+YFS  + KDPFDFLPKGF++RTKG GLVVPSWAPQ QVLSHGSTG
Sbjct: 301  LWVARVPNDKVANATYFSVDNHKDPFDFLPKGFLDRTKGRGLVVPSWAPQAQVLSHGSTG 360

Query: 1206 GFLSHCGWNSTLESIVNGVPLIAWPLFAEQKMNAVMLDY-MKVALRPKFDENGIVRRDEI 1382
            GFL+HCGWNSTLES+VN VPLI WPL+AEQKMNA ML   ++VALRPK  ENG++ R+EI
Sbjct: 361  GFLTHCGWNSTLESVVNAVPLIVWPLYAEQKMNAWMLTKDVEVALRPKASENGLIGREEI 420

Query: 1383 AKVVKGLMEGEAGKKLRNKMKDLKSAAGTVLTEDGSSMKSLCEVALKWKN 1532
            A +V+GLMEGE GK++RN+MKDLK AA  VL+E GSS K+L EVA KWKN
Sbjct: 421  ANIVRGLMEGEEGKRVRNRMKDLKDAAAEVLSEAGSSTKALSEVARKWKN 470


>emb|CBI34463.3| unnamed protein product [Vitis vinifera]
          Length = 468

 Score =  638 bits (1646), Expect = e-180
 Identities = 310/461 (67%), Positives = 369/461 (80%)
 Frame = +3

Query: 150  PHIIVLPTPGMGHLIPLIEFAKRLVLRHDFSFTFTIPSDGSPSEAQKTVLGSLPDSIDYI 329
            PHI ++P PGMGHLIPLIEFA+RLVL H+FS TF IP+DGSP   QK+VL +LP SI+Y+
Sbjct: 6    PHIAIVPNPGMGHLIPLIEFARRLVLHHNFSVTFLIPTDGSPVTPQKSVLKALPTSINYV 65

Query: 330  FLPPVNFDDLSGDVKIEXXXXXXXXXXXXXXXEVLKEMVDNKRVVALVVDLFGTDAFDVA 509
            FLPPV FDDL  DV+IE               + L+ + ++ R+VALVVDLFGTDAFDVA
Sbjct: 66   FLPPVAFDDLPEDVRIETRISLSMTRSVPALRDSLRTLTESTRLVALVVDLFGTDAFDVA 125

Query: 510  KEFKLPSYIFYPSTANALSLFLELPKLDEMYSCEYRDLLEPVKLPGCVPLHGRDFLDPLQ 689
             EF +P YIF+P+TA  LSL   +P+LD+ +SCEYRDL EPVK PGCVP+ GRD +DPLQ
Sbjct: 126  NEFGIPPYIFFPTTAMVLSLIFHVPELDQKFSCEYRDLPEPVKFPGCVPVQGRDLIDPLQ 185

Query: 690  DRKNEAYTWLLHHSNRYKLAEGILVNSFVDLEPDTFEALKEEKPDRPPIYPVGPLIQSGS 869
            DRKNEAY W++HH+ RYK   GI+VNSF+DLEP  F+ALKE +PD PP+YPVGPL +SGS
Sbjct: 186  DRKNEAYKWVVHHAKRYKTGPGIIVNSFMDLEPGAFKALKEIEPDYPPVYPVGPLTRSGS 245

Query: 870  TNKGADGSDCLKWLDEQPHGSVLFISFGSGGTLSKEQLNELAIGLEMSEQRFLWVVRSPS 1049
            TN G DGS+CL WLD QP GSVLF+SFGSGGTLS+EQ+ ELA+GLEMS QRFLWVV+SP 
Sbjct: 246  TN-GDDGSECLTWLDHQPSGSVLFVSFGSGGTLSQEQITELALGLEMSGQRFLWVVKSPH 304

Query: 1050 EKAANASYFSAQSAKDPFDFLPKGFVERTKGLGLVVPSWAPQVQVLSHGSTGGFLSHCGW 1229
            E AANAS+FSAQ+ KDPFDFLPKGF++RT+GLGLVV SWAPQVQVLSHGSTGGFL+HCGW
Sbjct: 305  ETAANASFFSAQTIKDPFDFLPKGFLDRTQGLGLVVSSWAPQVQVLSHGSTGGFLTHCGW 364

Query: 1230 NSTLESIVNGVPLIAWPLFAEQKMNAVMLDYMKVALRPKFDENGIVRRDEIAKVVKGLME 1409
            NSTLE+IV GVP+IAWPLFAEQ+MNA +L     A     + NG+V R+EIAK VK L+E
Sbjct: 365  NSTLETIVQGVPIIAWPLFAEQRMNATLLANDLKAAVTLNNNNGLVSREEIAKTVKSLIE 424

Query: 1410 GEAGKKLRNKMKDLKSAAGTVLTEDGSSMKSLCEVALKWKN 1532
            GE GK +RNK+KDLK AA   L++DGSS +SL EVA  WKN
Sbjct: 425  GEKGKMIRNKIKDLKDAATMALSQDGSSTRSLAEVAQIWKN 465


>ref|XP_002301402.1| predicted protein [Populus trichocarpa] gi|222843128|gb|EEE80675.1|
            predicted protein [Populus trichocarpa]
          Length = 469

 Score =  634 bits (1635), Expect = e-179
 Identities = 308/471 (65%), Positives = 375/471 (79%), Gaps = 2/471 (0%)
 Frame = +3

Query: 129  MGEIQQNPHIIVLPTPGMGHLIPLIEFAKRLVLRHDFSFTFTIPSDGSPSEAQKTVLGSL 308
            M +     H+ +LP+PGMGHLIPL+E AKRLV +H+FS TF IP+DGS S+AQ++VLGSL
Sbjct: 1    MAQTDAPAHVAILPSPGMGHLIPLVELAKRLVHQHNFSITFVIPTDGSTSKAQRSVLGSL 60

Query: 309  PDSIDYIFLPPVNFDDLSGDVKIEXXXXXXXXXXXXXXXEVLKEMVDN-KRVVALVVDLF 485
            P +I  +FLP VN  DL  DVKIE               +V + +VD   RVVALVVDLF
Sbjct: 61   PSAIHSVFLPQVNLSDLPEDVKIETTISHTVARSLPSLRDVFRSLVDGGARVVALVVDLF 120

Query: 486  GTDAFDVAKEFKLPSYIFYPSTANALSLFLELPKLDEMYSCEYRDLLEPVKLPGCVPLHG 665
            GTDAFDVA+EF +  YIF+PSTA ALSLF  LPKLDEM SCEYR++ EPVK+PGC+P+HG
Sbjct: 121  GTDAFDVAREFNVSPYIFFPSTAMALSLFFHLPKLDEMVSCEYREMQEPVKIPGCLPIHG 180

Query: 666  RDFLDPLQDRKNEAYTWLLHHSNRYKLAEGILVNSFVDLEPDTFEALKEEKPDRPPIYPV 845
             + LDP QDRKN+AY WLL+H+NRY++AEG++VNSF+DLE    +AL+E +P +P +YPV
Sbjct: 181  GELLDPTQDRKNDAYKWLLYHTNRYRMAEGVMVNSFMDLEKGALKALQEVEPGKPTVYPV 240

Query: 846  GPLIQSGSTNKGADGSDCLKWLDEQPHGSVLFISFGSGGTLSKEQLNELAIGLEMSEQRF 1025
            GPL+   S+  G +GS+CL+WLD+QPHGSVLF+SFGSGGTLS +Q+ ELA+GLEMSEQRF
Sbjct: 241  GPLVNMDSS-AGVEGSECLRWLDDQPHGSVLFVSFGSGGTLSLDQITELALGLEMSEQRF 299

Query: 1026 LWVVRSPSEKAANASYFSAQSAKDPFDFLPKGFVERTKGLGLVVPSWAPQVQVLSHGSTG 1205
            LWVVRSP++K +NA++FS  S KDPFDFLPKGF +RTKG GL VPSWAPQ QVL HGSTG
Sbjct: 300  LWVVRSPNDKVSNATFFSVDSHKDPFDFLPKGFSDRTKGRGLAVPSWAPQPQVLGHGSTG 359

Query: 1206 GFLSHCGWNSTLESIVNGVPLIAWPLFAEQKMNAVMLDY-MKVALRPKFDENGIVRRDEI 1382
            GFL+HCGWNSTLES+VNGVPLI WPL+AEQKMNA ML   +KVALRPK  ENG++ R+EI
Sbjct: 360  GFLTHCGWNSTLESVVNGVPLIVWPLYAEQKMNAWMLTKDIKVALRPKASENGLIGREEI 419

Query: 1383 AKVVKGLMEGEAGKKLRNKMKDLKSAAGTVLTEDGSSMKSLCEVALKWKNQ 1535
            A  V+GLMEGE GK++RN+MKDLK AA  VL+EDG    SL E+A KWKNQ
Sbjct: 420  ANAVRGLMEGEEGKRVRNRMKDLKEAAARVLSEDG----SLSELAHKWKNQ 466


>dbj|BAG80556.1| UDP-glucose:glucosyltransferase [Lycium barbarum]
          Length = 476

 Score =  634 bits (1634), Expect = e-179
 Identities = 312/468 (66%), Positives = 376/468 (80%), Gaps = 3/468 (0%)
 Frame = +3

Query: 150  PHIIVLPTPGMGHLIPLIEFAKRLVLRHDFSFTFTIPSDGSPSEAQKTVLGSLPDSIDYI 329
            PHI +LP+PGMGHLIPL+EF+KRL+  H FS T  +P+DG  S AQK  L SLP S+DY 
Sbjct: 9    PHIAILPSPGMGHLIPLVEFSKRLIQNHHFSVTLILPTDGPVSNAQKIYLNSLPCSMDYH 68

Query: 330  FLPPVNFDDLSGDVKIEXXXXXXXXXXXXXXXEVLKEMVDNKRVVALVVDLFGTDAFDVA 509
             LPPVNFDDL  D K+E               EV K +V+ K+ VALVVDLFGTDAFDVA
Sbjct: 69   LLPPVNFDDLPLDTKMETRISLTVTRSLPSLREVFKTLVETKKTVALVVDLFGTDAFDVA 128

Query: 510  KEFKLPSYIFYPSTANALSLFLELPKLDEMYSCEYRDLLEPVKLPGCVPLHGRDFLDPLQ 689
             +FK+  YIFYPSTA ALSLFL LPKLDE  SCEY DL +PV++PGC+P+HG+D LDP+Q
Sbjct: 129  NDFKVSPYIFYPSTAMALSLFLYLPKLDETVSCEYTDLPDPVQIPGCIPIHGKDLLDPVQ 188

Query: 690  DRKNEAYTWLLHHSNRYKLAEGILVNSFVDLEPDTFEALKEEKPDRPPIYPVGPLIQ--S 863
            DRKNEAY W+LHHS RY++AEGI+ NSF +LE    +AL+EE+P +PP+YPVGPLIQ  S
Sbjct: 189  DRKNEAYKWVLHHSKRYRMAEGIVANSFKELEGGAIKALQEEEPGKPPVYPVGPLIQMDS 248

Query: 864  GSTNKGADGSDCLKWLDEQPHGSVLFISFGSGGTLSKEQLNELAIGLEMSEQRFLWVVRS 1043
            GS +K AD S+CL WLDEQP GSVL+ISFGSGGTLS EQ+ ELA GLEMSEQRFLWV+R+
Sbjct: 249  GSGSK-ADRSECLTWLDEQPRGSVLYISFGSGGTLSHEQMIELASGLEMSEQRFLWVIRT 307

Query: 1044 PSEKAANASYFSAQSAKDPFDFLPKGFVERTKGLGLVVPSWAPQVQVLSHGSTGGFLSHC 1223
            P++K A+A+YF+ Q + +P DFLPKGF+E+TKGLGLVVP+WAPQ Q+L HGST GFL+HC
Sbjct: 308  PNDKMASATYFNVQDSTNPLDFLPKGFLEKTKGLGLVVPNWAPQAQILGHGSTSGFLTHC 367

Query: 1224 GWNSTLESIVNGVPLIAWPLFAEQKMNAVML-DYMKVALRPKFDENGIVRRDEIAKVVKG 1400
            GWNSTLES+V+GVP IAWPL+AEQKMNAVML + +KVALRPK +ENGIV R EIAKVVKG
Sbjct: 368  GWNSTLESVVHGVPFIAWPLYAEQKMNAVMLSEDIKVALRPKANENGIVGRLEIAKVVKG 427

Query: 1401 LMEGEAGKKLRNKMKDLKSAAGTVLTEDGSSMKSLCEVALKWKNQCSS 1544
            LMEGE GK +R++M+DLK AA  VL+EDGSS K+L E+A K K + S+
Sbjct: 428  LMEGEEGKVVRSRMRDLKDAAAKVLSEDGSSTKALAELATKLKKKVSN 475


>sp|Q9AR73.1|HQGT_RAUSE RecName: Full=Hydroquinone glucosyltransferase; AltName: Full=Arbutin
            synthase gi|13508844|emb|CAC35167.1| arbutin synthase
            [Rauvolfia serpentina]
          Length = 470

 Score =  619 bits (1597), Expect = e-175
 Identities = 298/470 (63%), Positives = 371/470 (78%), Gaps = 1/470 (0%)
 Frame = +3

Query: 138  IQQNPHIIVLPTPGMGHLIPLIEFAKRLVLRHDFSFTFTIPSDGSPSEAQKTVLGSLPDS 317
            ++  PHI ++PTPGMGHLIPL+EFAKRLVLRH+F  TF IP+DG   +AQK+ L +LP  
Sbjct: 1    MEHTPHIAMVPTPGMGHLIPLVEFAKRLVLRHNFGVTFIIPTDGPLPKAQKSFLDALPAG 60

Query: 318  IDYIFLPPVNFDDLSGDVKIEXXXXXXXXXXXXXXXEVLKEMVDNKRVVALVVDLFGTDA 497
            ++Y+ LPPV+FDDL  DV+IE               + +K ++   ++ ALVVDLFGTDA
Sbjct: 61   VNYVLLPPVSFDDLPADVRIETRICLTITRSLPFVRDAVKTLLATTKLAALVVDLFGTDA 120

Query: 498  FDVAKEFKLPSYIFYPSTANALSLFLELPKLDEMYSCEYRDLLEPVKLPGCVPLHGRDFL 677
            FDVA EFK+  YIFYP+TA  LSLF  LPKLD+M SCEYRD+ EP+++PGC+P+HG+DFL
Sbjct: 121  FDVAIEFKVSPYIFYPTTAMCLSLFFHLPKLDQMVSCEYRDVPEPLQIPGCIPIHGKDFL 180

Query: 678  DPLQDRKNEAYTWLLHHSNRYKLAEGILVNSFVDLEPDTFEALKEEKPDRPPIYPVGPLI 857
            DP QDRKN+AY  LLH + RY+LAEGI+VN+F DLEP   +AL+EE   +PP+YP+GPLI
Sbjct: 181  DPAQDRKNDAYKCLLHQAKRYRLAEGIMVNTFNDLEPGPLKALQEEDQGKPPVYPIGPLI 240

Query: 858  QSGSTNKGADGSDCLKWLDEQPHGSVLFISFGSGGTLSKEQLNELAIGLEMSEQRFLWVV 1037
            ++ S++K  D  +CLKWLD+QP GSVLFISFGSGG +S  Q  ELA+GLEMSEQRFLWVV
Sbjct: 241  RADSSSK-VDDCECLKWLDDQPRGSVLFISFGSGGAVSHNQFIELALGLEMSEQRFLWVV 299

Query: 1038 RSPSEKAANASYFSAQSAKDPFDFLPKGFVERTKGLGLVVPSWAPQVQVLSHGSTGGFLS 1217
            RSP++K ANA+YFS Q+  D   +LP+GF+ERTKG  L+VPSWAPQ ++LSHGSTGGFL+
Sbjct: 300  RSPNDKIANATYFSIQNQNDALAYLPEGFLERTKGRCLLVPSWAPQTEILSHGSTGGFLT 359

Query: 1218 HCGWNSTLESIVNGVPLIAWPLFAEQKMNAVML-DYMKVALRPKFDENGIVRRDEIAKVV 1394
            HCGWNS LES+VNGVPLIAWPL+AEQKMNAVML + +KVALRPK  ENG++ R EIA  V
Sbjct: 360  HCGWNSILESVVNGVPLIAWPLYAEQKMNAVMLTEGLKVALRPKAGENGLIGRVEIANAV 419

Query: 1395 KGLMEGEAGKKLRNKMKDLKSAAGTVLTEDGSSMKSLCEVALKWKNQCSS 1544
            KGLMEGE GKK R+ MKDLK AA   L++DGSS K+L E+A KW+N+ SS
Sbjct: 420  KGLMEGEEGKKFRSTMKDLKDAASRALSDDGSSTKALAELACKWENKISS 469


Top