BLASTX nr result
ID: Coptis21_contig00003796
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Coptis21_contig00003796 (1573 letters) Database: ./nr 23,641,837 sequences; 8,123,359,852 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value gb|AAR06921.1| UDP-glycosyltransferase 89B2 [Stevia rebaudiana] 568 e-159 ref|XP_002514595.1| UDP-glucosyltransferase, putative [Ricinus c... 546 e-153 ref|XP_002318584.1| predicted protein [Populus trichocarpa] gi|2... 533 e-149 ref|XP_002887513.1| UDP-glucoronosyl/UDP-glucosyl transferase fa... 520 e-145 gb|AFJ53023.1| UDP-glycosyltransferase 1 [Linum usitatissimum] 516 e-144 >gb|AAR06921.1| UDP-glycosyltransferase 89B2 [Stevia rebaudiana] Length = 468 Score = 568 bits (1465), Expect = e-159 Identities = 275/473 (58%), Positives = 350/473 (73%), Gaps = 2/473 (0%) Frame = +2 Query: 29 MPTNATTTGAHILVFPFPAQGHMLPLLDFTHQLALRNLTITILVTPKNVPLLTPLLAKHP 208 MP + G+HILVFP+PAQGHML LLD THQLA+RNLTITILVTPKN+P ++PLLA HP Sbjct: 1 MPISDINAGSHILVFPYPAQGHMLTLLDLTHQLAIRNLTITILVTPKNLPTISPLLAAHP 60 Query: 209 S-IHTLVLPFPKDPYIPSGVENVKDLPASYFPAMICTMGKLYKPTLEWFQSHPSPPTVIL 385 + + L+LP P P IPSG+ENVKDLP F AM+ +G LY P +WF++ P+PP I+ Sbjct: 61 TTVSALLLPLPPHPAIPSGIENVKDLPNDAFKAMMVALGDLYNPLRDWFRNQPNPPVAII 120 Query: 386 SDFFLGWTHHLACQLGIRRIVFSPSGVGAFTLCFLTIWRTCPKRVNPNDEYEIISFPEVP 565 SDFFLGWTHHLA +LGIRR FSPSG A ++ F ++WR PKR++ +E E I FP++P Sbjct: 121 SDFFLGWTHHLAVELGIRRYTFSPSGALALSVIF-SLWRYQPKRIDVENEKEAIKFPKIP 179 Query: 566 NSPSYPWWQLGPLYRTYKKGDPVSEFIKKCSLANVDSWGIVFNSFSELESGYLDYLKQSC 745 NSP YPWWQL P+YR+Y +GDP SEFIK LA++ SWGIV NSF+ELE Y+D+LK Sbjct: 180 NSPEYPWWQLSPIYRSYVEGDPDSEFIKDGFLADIASWGIVINSFTELEQVYVDHLKHEL 239 Query: 746 GHDRIWAIGPLIGHVDKNKSNGPESILVPGNEILSWLDTCGDHSVVYVCFGSQAVLTNKQ 925 GHD+++A+GPL+ DK G S N++LSWLDTC D +VVYVCFGSQ VLTN Q Sbjct: 240 GHDQVFAVGPLLPPGDKTSGRGGSS----SNDVLSWLDTCADRTVVYVCFGSQMVLTNGQ 295 Query: 926 MEELALGLEQSGVKFIWPVKEPTIGHVSSEYGMIPDGFEDRAAGRGLVYKGWAPQALILK 1105 ME +ALGLE+S VKF+W VKEPT+GH ++ YG +P GFEDR +GRGLV +GW PQ IL Sbjct: 296 MEVVALGLEKSRVKFVWSVKEPTVGHEAANYGRVPPGFEDRVSGRGLVIRGWVPQVAILS 355 Query: 1106 HRAVGAFMTHCGWNSVLESLIAGVSMLTWPMNADQFMNAKLLVDQVGVAVRVCEGEKTIP 1285 H +VG F+THCGWNSV+E++ A V MLTWPM+ADQF NA LL ++ V ++VCEG +P Sbjct: 356 HDSVGVFLTHCGWNSVMEAVAAEVLMLTWPMSADQFSNATLL-HELKVGIKVCEGSNIVP 414 Query: 1286 NATELAKFLAKSVCDNGFENERVRAKELSKAALGAV-KGGSSFKDLDSLVEDI 1441 N+ ELA+ +KS+ D ER R KE +K+A AV GSS +L+ LV+++ Sbjct: 415 NSDELAELFSKSLSDE-TRLERKRVKEFAKSAKEAVGPKGSSVGELERLVDNL 466 >ref|XP_002514595.1| UDP-glucosyltransferase, putative [Ricinus communis] gi|223546199|gb|EEF47701.1| UDP-glucosyltransferase, putative [Ricinus communis] Length = 472 Score = 546 bits (1406), Expect = e-153 Identities = 274/472 (58%), Positives = 341/472 (72%), Gaps = 6/472 (1%) Frame = +2 Query: 59 HILVFPFPAQGHMLPLLDFTHQLALRNLTITILVTPKNVPLLTPLLAKHPSIHTLVLPFP 238 HILVFPFPAQGHM+PLLD T +LA+ LTITILVTPKN+ L PLL+ HPSI TLV PFP Sbjct: 11 HILVFPFPAQGHMIPLLDLTRKLAVHGLTITILVTPKNLSFLHPLLSTHPSIETLVFPFP 70 Query: 239 KDPYIPSGVENVKDLPASYFPAMICTMGKLYKPTLEWFQSHPSPPTVILSDFFLGWTHHL 418 P IPSGVEN KDLPA P +I +G LY P L WF SHPSPP I+SD FLGWT +L Sbjct: 71 AHPLIPSGVENNKDLPAECTPVLIRALGGLYDPLLHWFISHPSPPVAIISDMFLGWTQNL 130 Query: 419 ACQLGIRRIVFSPSGVGAFTLCFLTIWRTCPKRVNPNDEYEIISFPEVPNSPSYPWWQLG 598 A QL IRRIVFSPSG A ++ + ++WR P+R ++ E++SF +PN P+YPW Q+ Sbjct: 131 ASQLNIRRIVFSPSGAMALSIIY-SLWRDMPRR----NQNEVVSFSRIPNCPNYPWRQIS 185 Query: 599 PLYRTYKKGDPVSEFIKKCSLANVDSWGIVFNSFSELESGYLDYLKQSCGHDRIWAIGPL 778 P+YR+Y + D EFIK AN+ SWG+V NSF+ELE YLDY K+ G D +WA+GPL Sbjct: 186 PIYRSYIENDTNWEFIKDSFRANLVSWGLVVNSFTELEEIYLDYFKKELGSDHVWAVGPL 245 Query: 779 I-GHVD----KNKSNGPESILVPGNEILSWLDTCGDHSVVYVCFGSQAVLTNKQMEELAL 943 + H D +++ GP S VP +++++WLDTC DH VVYVCFGSQ LT Q+EELAL Sbjct: 246 LPPHHDSISRQSERGGPSS--VPVHDVMAWLDTCEDHRVVYVCFGSQTWLTKDQIEELAL 303 Query: 944 GLEQSGVKFIWPVKEPTIGHVSSEYGMIPDGFEDRAAGRGLVYKGWAPQALILKHRAVGA 1123 LE S V FIW VKE H++ +Y +IP GFEDR AGRGLV +GW PQ LIL H AVGA Sbjct: 304 SLEMSKVNFIWCVKE----HINGKYSVIPSGFEDRVAGRGLVIRGWVPQVLILSHPAVGA 359 Query: 1124 FMTHCGWNSVLESLIAGVSMLTWPMNADQFMNAKLLVDQVGVAVRVCEGEKTIPNATELA 1303 F+THCGWNSVLE L+A V ML WPM ADQF+NA+LLVD++ VAVRVCEG KT+PN+ ELA Sbjct: 360 FLTHCGWNSVLEGLVAAVPMLAWPMGADQFVNARLLVDELQVAVRVCEGAKTVPNSDELA 419 Query: 1304 KFLAKSVCDNGFENERVRAKELSKAALGAVKG-GSSFKDLDSLVEDIFALNM 1456 + + +SV +N E E +AK+L + A+ +K G S KD D LV+++F L + Sbjct: 420 RVIMESVSENRVERE--QAKKLRRVAMDTIKDRGRSMKDFDGLVKNLFRLKV 469 >ref|XP_002318584.1| predicted protein [Populus trichocarpa] gi|222859257|gb|EEE96804.1| predicted protein [Populus trichocarpa] Length = 472 Score = 533 bits (1374), Expect = e-149 Identities = 260/476 (54%), Positives = 341/476 (71%), Gaps = 2/476 (0%) Frame = +2 Query: 47 TTGAHILVFPFPAQGHMLPLLDFTHQLALRNLTITILVTPKNVPLLTPLLAKHPSIHTLV 226 + GAH+L+FPFPAQGH++PLLD H L +R LTITILVTPKN+P+L PLL+K+ +I+TLV Sbjct: 2 SAGAHVLLFPFPAQGHLIPLLDLAHHLVIRGLTITILVTPKNLPILNPLLSKNSTINTLV 61 Query: 227 LPFPKDPYIPSGVENVKDLPASYFP-AMICTMGKLYKPTLEWFQSHPSPPTVILSDFFLG 403 LPFP P IP G+EN+KDLP + P +MI +G+LY+P L WF+SHPSPP I+SD FLG Sbjct: 62 LPFPNYPSIPLGIENLKDLPPNIRPTSMIHALGELYQPLLSWFRSHPSPPVAIISDMFLG 121 Query: 404 WTHHLACQLGIRRIVFSPSGVGAFTLCFLTIWRTCPKRVNPNDEYEIISFPEVPNSPSYP 583 WTH LACQLG+RR VFSPSG A + ++W+ P P D+ E+ SF ++P+ P YP Sbjct: 122 WTHRLACQLGVRRFVFSPSGAMALATMY-SLWQEMPNA--PKDQNELFSFSKIPSCPKYP 178 Query: 584 WWQLGPLYRTYKKGDPVSEFIKKCSLANVDSWGIVFNSFSELESGYLDYLKQSCGHDRIW 763 W Q+ +YR+Y +GDPVSEF K+ AN+ SWG++ NS + LE Y ++L++ GHDR+W Sbjct: 179 WLQISTIYRSYVEGDPVSEFTKEGMEANIASWGLIVNSLTLLEGIYFEHLRKQLGHDRVW 238 Query: 764 AIGPLIGHVDKNKSNGPESILVPGNEILSWLDTCGDHSVVYVCFGSQAVLTNKQMEELAL 943 A+GP++ +K P V +++ +WLDTC DH VVYVC+G+Q VLT QME +A Sbjct: 239 AVGPILP--EKTIDMTPPERGVSMHDLKTWLDTCEDHKVVYVCYGTQVVLTKYQMEAVAS 296 Query: 944 GLEQSGVKFIWPVKEPTIGHVSSEYGMIPDGFEDRAAGRGLVYKGWAPQALILKHRAVGA 1123 GLE+SGV FIW VK+P+ HV Y MIP GFEDR AGRGL+ +GWAPQ IL HRAVGA Sbjct: 297 GLEKSGVHFIWCVKQPSKEHVGEGYSMIPSGFEDRVAGRGLIIRGWAPQVWILSHRAVGA 356 Query: 1124 FMTHCGWNSVLESLIAGVSMLTWPMNADQFMNAKLLVDQVGVAVRVCEGEKTIPNATELA 1303 F+THCGWNS+LE ++AGV ML PM ADQF+ A LLV+ + VA RVC+G + N+ +LA Sbjct: 357 FLTHCGWNSILEGIVAGVPMLACPMAADQFVGATLLVEDLKVAKRVCDGANLVSNSAKLA 416 Query: 1304 KFLAKSVCDNGFENERVRAKELSKAALGAVK-GGSSFKDLDSLVEDIFALNMMNHK 1468 + L +SV D + E+ RAKEL AAL A+K GSS K L++ V+ + L M K Sbjct: 417 RTLMESVSDES-QVEKERAKELRMAALDAIKEDGSSDKHLNAFVKHVVGLGMETDK 471 >ref|XP_002887513.1| UDP-glucoronosyl/UDP-glucosyl transferase family protein [Arabidopsis lyrata subsp. lyrata] gi|297333354|gb|EFH63772.1| UDP-glucoronosyl/UDP-glucosyl transferase family protein [Arabidopsis lyrata subsp. lyrata] Length = 473 Score = 520 bits (1340), Expect = e-145 Identities = 266/480 (55%), Positives = 337/480 (70%), Gaps = 5/480 (1%) Frame = +2 Query: 38 NATTTGAHILVFPFPAQGHMLPLLDFTHQLALRN---LTITILVTPKNVPLLTPLLAKHP 208 N T H+L+FPFPAQGHM+PLLDFTH+LALR LTIT+LVTPKN+P L+PLL+ Sbjct: 7 NNKPTKTHVLIFPFPAQGHMIPLLDFTHRLALRGGAALTITVLVTPKNLPFLSPLLSAVS 66 Query: 209 SIHTLVLPFPKDPYIPSGVENVKDLPASYFPAMICTMGKLYKPTLEWFQSHPSPPTVILS 388 +I TL+LPFP P IPSGVENV+DLP S FP MI +G L+ P L W SHPSPP I+S Sbjct: 67 NIETLILPFPSHPSIPSGVENVQDLPPSGFPLMIHALGNLHAPLLSWITSHPSPPVAIVS 126 Query: 389 DFFLGWTHHLACQLGIRRIVFSPSGVGAFTLCFL-TIWRTCPKRVNPNDEYEIISFPEVP 565 DFFLGWT++L GI R FSPS A T C L T+W P ++N +D+ EI+ FP++P Sbjct: 127 DFFLGWTNNL----GIPRFDFSPSA--AITCCILNTLWIEMPTKINEDDDNEILQFPKIP 180 Query: 566 NSPSYPWWQLGPLYRTYKKGDPVSEFIKKCSLANVDSWGIVFNSFSELESGYLDYLKQSC 745 N P YP+ Q+ LYR+Y GDP EFI+ N SWG+V NSF+ +E YL++LK+ Sbjct: 181 NCPKYPFNQISSLYRSYVHGDPAWEFIRDSFRDNAASWGLVVNSFTAMEGVYLEHLKREM 240 Query: 746 GHDRIWAIGPLIGHVDKNKSNGPESILVPGNEILSWLDTCGDHSVVYVCFGSQAVLTNKQ 925 GHD +WA+GP++ D N+ GP S+ V + ++SWLD D VVYVCFGSQ VLT +Q Sbjct: 241 GHDCVWAVGPILPLSDGNRG-GPTSVSV--DHVMSWLDAREDDHVVYVCFGSQTVLTKEQ 297 Query: 926 MEELALGLEQSGVKFIWPVKEPTIGHVSSEYGMIPDGFEDRAAGRGLVYKGWAPQALILK 1105 LA GLE+SGV FIW VKEP G S G I DGF+DR AGRGLV +GWAPQ +L+ Sbjct: 298 TLALASGLEKSGVHFIWAVKEPVEGE--SPRGNILDGFDDRVAGRGLVIRGWAPQVAVLR 355 Query: 1106 HRAVGAFMTHCGWNSVLESLIAGVSMLTWPMNADQFMNAKLLVDQVGVAVRVCEGEKTIP 1285 HRAVGAF+THCGWNSV+E+++AGV MLTWPM ADQ+ +A L+VD++ V VR CEG T+P Sbjct: 356 HRAVGAFLTHCGWNSVIEAVVAGVLMLTWPMRADQYTDASLVVDELKVGVRACEGPDTVP 415 Query: 1286 NATELAKFLAKSVCDNGFENERVRAKELSKAALGAV-KGGSSFKDLDSLVEDIFALNMMN 1462 + ELA+ A SV G + ER++A EL KAAL A+ + GSS KDLD ++ + L + N Sbjct: 416 DPDELARVFADSV--TGKQTERIKAVELRKAALDAIQERGSSVKDLDGFIQHVVNLRLNN 473 >gb|AFJ53023.1| UDP-glycosyltransferase 1 [Linum usitatissimum] Length = 475 Score = 516 bits (1328), Expect = e-144 Identities = 267/479 (55%), Positives = 342/479 (71%), Gaps = 6/479 (1%) Frame = +2 Query: 35 TNATTTGAHILVFPFPAQGHMLPLLDFTHQLALRN-LTITILVTPKNVPLLTPLLAKHPS 211 T A T HIL+FP+PAQGH++P+LDF H LALR L ITILVTPKN+PLL PLL++HPS Sbjct: 2 TVAAITLPHILIFPYPAQGHLIPILDFAHYLALRRQLHITILVTPKNLPLLQPLLSRHPS 61 Query: 212 IHTLVLPFPKDPYIPSGVENVKDLPASYFPA----MICTMGKLYKPTLEWFQSHPSPPTV 379 I L LPFP P+IP GVEN KDLP S + + + L P L WFQ+ PSPP+V Sbjct: 62 IQPLTLPFPDTPHIPPGVENTKDLPPSLTKSSHVSFMYALAGLRSPLLNWFQTTPSPPSV 121 Query: 380 ILSDFFLGWTHHLACQLGIRRIVFSPSGVGAFTLCFLTIWRTCPKRVNPNDEYEIISFPE 559 I+SD FLGWTHHLA LGI RIVFSPS A ++ + +WR P+ P E I+FP+ Sbjct: 122 IISDMFLGWTHHLATDLGIPRIVFSPSAAFALSVIY-HLWRNMPQL--PESPDESITFPD 178 Query: 560 VPNSPSYPWWQLGPLYRTYKKGDPVSEFIKKCSLANVDSWGIVFNSFSELESGYLDYLKQ 739 +PNSPS+ QL P+YR+Y GDP+SEF+K LA++DSWGI FNSF+ LES YLDYLK Sbjct: 179 LPNSPSWIKSQLSPIYRSYVPGDPLSEFVKDGFLADIDSWGIAFNSFAGLESKYLDYLKI 238 Query: 740 SCGHDRIWAIGPLIGHVDKNKSNGPESILVPGNEILSWLDTCGDHSVVYVCFGSQAVLTN 919 GHDR+WA+GPL+ ++ ++ + V ++ +WLDTC + VVYVCFGS+AVLT Sbjct: 239 ELGHDRVWAVGPLLSPPSESVASRGGTSSVSVADLEAWLDTCQEGKVVYVCFGSEAVLTV 298 Query: 920 KQMEELALGLEQSGVKFIWPVKEPTIGHVSSEYGMIPDGFEDRAAGRGLVYKGWAPQALI 1099 Q ELA GLE+SGV+F+W VK+ V E IP+GFEDR AGRG+V +GWAPQ +I Sbjct: 299 DQSNELASGLEKSGVQFVWRVKD-----VEGERPSIPEGFEDRVAGRGVVIRGWAPQVMI 353 Query: 1100 LKHRAVGAFMTHCGWNSVLESLIAGVSMLTWPMNADQFMNAKLLVDQVGVAVRVCEGEKT 1279 L HRAVGAF+THCGWNSVLE ++AGV+ML WPM ADQF +A LLV+++ +AVRVCEG++ Sbjct: 354 LSHRAVGAFLTHCGWNSVLEGIVAGVAMLAWPMGADQFTDATLLVEELKMAVRVCEGKEA 413 Query: 1280 IPNATELAKFLAKSVCDNGFENERVRAKELSKAALGAV-KGGSSFKDLDSLVEDIFALN 1453 +P++ +A L + + ++ ER AKELS AA AV +GGSS KD++SLVE + LN Sbjct: 414 VPDSEVVASQLRELMEED--REERKVAKELSLAAKEAVGEGGSSVKDMESLVEQLVQLN 470