BLASTX nr result
ID: Cephaelis21_contig00007855
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Cephaelis21_contig00007855 (1513 letters) Database: ./nr 23,641,837 sequences; 8,123,359,852 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_002303861.1| predicted protein [Populus trichocarpa] gi|2... 545 e-152 ref|XP_002527371.1| UDP-glucosyltransferase, putative [Ricinus c... 525 e-146 ref|NP_199780.1| UDP-glycosyltransferase-like protein [Arabidops... 519 e-145 ref|XP_002865752.1| UDP-glucoronosyl/UDP-glucosyl transferase fa... 506 e-141 gb|AFJ53030.1| UDP-glycosyltransferase 1 [Linum usitatissimum] 484 e-134 >ref|XP_002303861.1| predicted protein [Populus trichocarpa] gi|222841293|gb|EEE78840.1| predicted protein [Populus trichocarpa] Length = 473 Score = 545 bits (1403), Expect = e-152 Identities = 269/472 (56%), Positives = 349/472 (73%), Gaps = 3/472 (0%) Frame = +2 Query: 32 MEKSNTLHVVMFPWLAMGHFIPFLQLSKHLARRGHKVSFVSTPRNIQRLPKIPQDIASLI 211 ME N L VVMFPWLA GH IPFLQLSK LA +GHK+ FVSTPRN+ RLPKIP+ ++S I Sbjct: 1 MEDGNRLQVVMFPWLATGHLIPFLQLSKLLAEKGHKIFFVSTPRNLNRLPKIPKQLSSEI 60 Query: 212 QLVSIPLPKTENLREQAESSMDVPREEQRFLKIAFDLLQSPIAAFLENATPKPDWIIFDY 391 LVS P P NL AESS DVP +Q+ LK FDLL+ P+ FLE++ KPDWI +DY Sbjct: 61 ILVSFPFPHVPNLPSCAESSTDVPYTKQQLLKKGFDLLEPPLTTFLESS--KPDWIFYDY 118 Query: 392 ASHWLPQIAARNGTSSAYFSLFNAATLAFIGPPSILLNEGDGRSTAESFTIVPKWIPFPS 571 ASHWLP +AAR G S A+FSLF AA L++IGPPS L+ GD RS AE FT+VPKWIPF S Sbjct: 119 ASHWLPSVAARLGISCAFFSLFTAACLSYIGPPSALMTIGDPRSKAEDFTVVPKWIPFES 178 Query: 572 SIVYRLHELTKYFEDSLGNESVTSDVIRFATSIKESDLVVIRTSVEFEPEWFKLV-CELY 748 +V+RLHE+TKY E + +E+ SD+IRF + SD+V+IR+S EFEPEWF L+ +LY Sbjct: 179 DLVFRLHEVTKYVEKTEEDETGPSDLIRFGFAAGGSDVVIIRSSPEFEPEWFNLLHDQLY 238 Query: 749 NKPVVSLGVLPPSLE--EEDESGTDEKWFKIKGWLDEQSLSKVVYVALGTEATLSENEIQ 922 KP++ +G LPP +E EED++ +W IK WLD+Q + VVYVA+GTEA+LS E++ Sbjct: 239 KKPIIPVGFLPPIVEHNEEDDNIDGHEWSNIKEWLDKQKVHSVVYVAIGTEASLSGEELK 298 Query: 923 DLALGLEQSELPFFWVLRKPPTSVKDVSEMLPEGFMERINASGRGMVYTEWVPQVKILSH 1102 +LALGLE S LPFFWVL K P S K+ +MLP+GF ER+ RG+++ W PQVKILSH Sbjct: 299 ELALGLENSTLPFFWVLNKIPGSTKNALDMLPDGFQERV--KNRGIIHGGWAPQVKILSH 356 Query: 1103 PAVGGFLTHCGWNSVIEALSFGRVLILFPVTNEQGLNARLLQGRKVGVEIPREAENGLFT 1282 +VGGF+THCGWNS+IE L+FGRVLIL P+ NEQGLN+RLL G+K+G+EIPR+ ++G FT Sbjct: 357 DSVGGFMTHCGWNSIIEGLTFGRVLILLPILNEQGLNSRLLHGKKLGLEIPRKEQDGSFT 416 Query: 1283 STAMADTLKSAMISEEGEPMRANAREMTSLFGNWNRNQGYIASFIRLLEAEK 1438 ++A+++++AM+ + G R ARE+ LFG+ +RN ++AS + L K Sbjct: 417 WASVAESMRTAMVDDSGVSWRNRAREIRYLFGDVDRNNCFVASLVNYLTENK 468 >ref|XP_002527371.1| UDP-glucosyltransferase, putative [Ricinus communis] gi|223533290|gb|EEF35043.1| UDP-glucosyltransferase, putative [Ricinus communis] Length = 470 Score = 525 bits (1351), Expect = e-146 Identities = 256/470 (54%), Positives = 343/470 (72%), Gaps = 1/470 (0%) Frame = +2 Query: 32 MEKSNTLHVVMFPWLAMGHFIPFLQLSKHLARRGHKVSFVSTPRNIQRLPKIPQDIASLI 211 M++++ LHV +FPWLAMGH IPFL+ S LA++GH VSF+STP N+ RLPKIP ++S I Sbjct: 1 MKRTSKLHVAVFPWLAMGHLIPFLRFSNLLAQKGHLVSFISTPGNLHRLPKIPPQLSSHI 60 Query: 212 QLVSIPLPKTENLREQAESSMDVPREEQRFLKIAFDLLQSPIAAFLENATPKPDWIIFDY 391 L+S+PLP L AE++ DVP +Q+ LK AFDLL+SP+A FLE T KPDW+I+DY Sbjct: 61 SLISLPLPSVPGLPSNAETTTDVPYTKQQLLKKAFDLLESPLATFLE--TKKPDWVIYDY 118 Query: 392 ASHWLPQIAARNGTSSAYFSLFNAATLAFIGPPSILLNEGDGRSTAESFTIVPKWIPFPS 571 ASHWLP IA++ G SSA+FSLF AATL+FIGPPS+ +N GD R TAE FTIVP+W+PF S Sbjct: 119 ASHWLPSIASKVGISSAFFSLFTAATLSFIGPPSLTMNGGDLRLTAEDFTIVPRWVPFES 178 Query: 572 SIVYRLHELTKYFEDSLGNESVTSDVIRFATSIKESDLVVIRTSVEFEPEWFKLVCELYN 751 +I Y +HE+TKY E + +E+ +D +RFA + +D+V+IR+S EFEPEWF L ++ Sbjct: 179 NIKYCIHEVTKYIEKTEEDETGPNDTVRFAFASGGADVVIIRSSPEFEPEWFDLYSKMSE 238 Query: 752 KPVVSLGVLPP-SLEEEDESGTDEKWFKIKGWLDEQSLSKVVYVALGTEATLSENEIQDL 928 KP++ LG LPP +EEED+ + W I WLD++ VVYVALGTEA L+ E+++L Sbjct: 239 KPIIPLGFLPPLEVEEEDDDIDVKGWADIIEWLDKKEAESVVYVALGTEAALTRQEVREL 298 Query: 929 ALGLEQSELPFFWVLRKPPTSVKDVSEMLPEGFMERINASGRGMVYTEWVPQVKILSHPA 1108 ALGLE+S PF WVL+ PP + ++ EML +G+ ER+ RGM+Y WVPQVKILSH + Sbjct: 299 ALGLEKSRSPFIWVLKNPPGTTQNALEMLQDGYEERV--KDRGMIYCGWVPQVKILSHES 356 Query: 1109 VGGFLTHCGWNSVIEALSFGRVLILFPVTNEQGLNARLLQGRKVGVEIPREAENGLFTST 1288 VGGFLTHCGWNSV+E LSFGRVLILFPV N+QGLNARLL G+K+G+E+PR +G FTS Sbjct: 357 VGGFLTHCGWNSVVEGLSFGRVLILFPVLNDQGLNARLLHGKKIGLEVPRNESDGAFTSD 416 Query: 1289 AMADTLKSAMISEEGEPMRANAREMTSLFGNWNRNQGYIASFIRLLEAEK 1438 ++A+ ++ A + + + A+EM +LFG+ +RN + LE + Sbjct: 417 SVAELVRKAKVDDPAD----LAKEMRNLFGDRDRNNRLAEGVVHYLEENR 462 >ref|NP_199780.1| UDP-glycosyltransferase-like protein [Arabidopsis thaliana] gi|75264223|sp|Q9LTA3.1|U91C1_ARATH RecName: Full=UDP-glycosyltransferase 91C1 gi|8978266|dbj|BAA98157.1| anthocyanidin-3-glucoside rhamnosyltransferase-like [Arabidopsis thaliana] gi|26449402|dbj|BAC41828.1| putative anthocyanidin-3-glucoside rhamnosyltransferase [Arabidopsis thaliana] gi|28951061|gb|AAO63454.1| At5g49690 [Arabidopsis thaliana] gi|332008462|gb|AED95845.1| UDP-glycosyltransferase-like protein [Arabidopsis thaliana] Length = 460 Score = 519 bits (1336), Expect = e-145 Identities = 257/467 (55%), Positives = 332/467 (71%) Frame = +2 Query: 35 EKSNTLHVVMFPWLAMGHFIPFLQLSKHLARRGHKVSFVSTPRNIQRLPKIPQDIASLIQ 214 ++ +HV MFPWLAMGH +PFL+LSK LA++GHK+SF+STPRNI+RLPK+ ++AS I Sbjct: 4 KREEVMHVAMFPWLAMGHLLPFLRLSKLLAQKGHKISFISTPRNIERLPKLQSNLASSIT 63 Query: 215 LVSIPLPKTENLREQAESSMDVPREEQRFLKIAFDLLQSPIAAFLENATPKPDWIIFDYA 394 VS PLP L +ESSMDVP +Q+ LK AFDLLQ P+ FL ++P DWII+DYA Sbjct: 64 FVSFPLPPISGLPPSSESSMDVPYNKQQSLKAAFDLLQPPLKEFLRRSSP--DWIIYDYA 121 Query: 395 SHWLPQIAARNGTSSAYFSLFNAATLAFIGPPSILLNEGDGRSTAESFTIVPKWIPFPSS 574 SHWLP IAA G S A+FSLFNAATL F+GP S L+ E RST E FT+VP W+PF S+ Sbjct: 122 SHWLPSIAAELGISKAFFSLFNAATLCFMGPSSSLIEEI--RSTPEDFTVVPPWVPFKSN 179 Query: 575 IVYRLHELTKYFEDSLGNESVTSDVIRFATSIKESDLVVIRTSVEFEPEWFKLVCELYNK 754 IV+R HE+T+Y E + + + SD +RF SI ESD V +R+ EFEPEWF L+ +LY K Sbjct: 180 IVFRYHEVTRYVEKTEEDVTGVSDSVRFGYSIDESDAVFVRSCPEFEPEWFGLLKDLYRK 239 Query: 755 PVVSLGVLPPSLEEEDESGTDEKWFKIKGWLDEQSLSKVVYVALGTEATLSENEIQDLAL 934 PV +G LPP +E++D D W +IK WLD+Q L+ VVYV+LGTEA+L E+ +LAL Sbjct: 240 PVFPIGFLPPVIEDDD--AVDTTWVRIKKWLDKQRLNSVVYVSLGTEASLRHEEVTELAL 297 Query: 935 GLEQSELPFFWVLRKPPTSVKDVSEMLPEGFMERINASGRGMVYTEWVPQVKILSHPAVG 1114 GLE+SE PFFWVLR P +P+GF R+ GRGMV+ WVPQVKILSH +VG Sbjct: 298 GLEKSETPFFWVLRNEPK--------IPDGFKTRVK--GRGMVHVGWVPQVKILSHESVG 347 Query: 1115 GFLTHCGWNSVIEALSFGRVLILFPVTNEQGLNARLLQGRKVGVEIPREAENGLFTSTAM 1294 GFLTHCGWNSV+E L FG+V I FPV NEQGLN RLL G+ +GVE+ R+ +G F S ++ Sbjct: 348 GFLTHCGWNSVVEGLGFGKVPIFFPVLNEQGLNTRLLHGKGLGVEVSRDERDGSFDSDSV 407 Query: 1295 ADTLKSAMISEEGEPMRANAREMTSLFGNWNRNQGYIASFIRLLEAE 1435 AD+++ MI + GE +RA A+ M LFGN + N Y+ +R + ++ Sbjct: 408 ADSIRLVMIDDAGEEIRAKAKVMKDLFGNMDENIRYVDELVRFMRSK 454 >ref|XP_002865752.1| UDP-glucoronosyl/UDP-glucosyl transferase family protein [Arabidopsis lyrata subsp. lyrata] gi|297311587|gb|EFH42011.1| UDP-glucoronosyl/UDP-glucosyl transferase family protein [Arabidopsis lyrata subsp. lyrata] Length = 515 Score = 506 bits (1303), Expect = e-141 Identities = 249/469 (53%), Positives = 330/469 (70%) Frame = +2 Query: 35 EKSNTLHVVMFPWLAMGHFIPFLQLSKHLARRGHKVSFVSTPRNIQRLPKIPQDIASLIQ 214 +K +H+ MFPWLAMGH +PFL+LSK LA++GHK+SF+STPRNI RLPK+P +++S I Sbjct: 4 KKEEVMHIAMFPWLAMGHLLPFLRLSKLLAQKGHKISFISTPRNILRLPKLPSNLSSSIT 63 Query: 215 LVSIPLPKTENLREQAESSMDVPREEQRFLKIAFDLLQSPIAAFLENATPKPDWIIFDYA 394 VS PLP L +ESSMDVP +Q+ LK AFDLLQ P+ FL ++P DWII+DYA Sbjct: 64 FVSFPLPSISGLPPSSESSMDVPYNKQQSLKAAFDLLQPPLTEFLRLSSP--DWIIYDYA 121 Query: 395 SHWLPQIAARNGTSSAYFSLFNAATLAFIGPPSILLNEGDGRSTAESFTIVPKWIPFPSS 574 SHWLP IA G S A+FSLFNAATL F+GP S L+ E RST E FT+VP W+PF S+ Sbjct: 122 SHWLPSIAKELGISKAFFSLFNAATLCFMGPSSSLIEES--RSTPEDFTVVPPWVPFKST 179 Query: 575 IVYRLHELTKYFEDSLGNESVTSDVIRFATSIKESDLVVIRTSVEFEPEWFKLVCELYNK 754 IV+R HE+++Y E + + + SD +RF +I SD V +R+ EFEPEWF L+ +LY K Sbjct: 180 IVFRYHEVSRYVEKTDEDVTGVSDSVRFGYTIDGSDAVFVRSCPEFEPEWFSLLQDLYRK 239 Query: 755 PVVSLGVLPPSLEEEDESGTDEKWFKIKGWLDEQSLSKVVYVALGTEATLSENEIQDLAL 934 PV +G LPP +E++D+ D W +IK WLD+Q ++ VVYV+LGTEA+L E+ +LAL Sbjct: 240 PVFPIGFLPPVIEDDDD---DTTWVRIKEWLDKQRVNSVVYVSLGTEASLRREELTELAL 296 Query: 935 GLEQSELPFFWVLRKPPTSVKDVSEMLPEGFMERINASGRGMVYTEWVPQVKILSHPAVG 1114 GLE+SE PFFWVLR P +P+GF ER+ GRGMV+ WVPQVKILSH +VG Sbjct: 297 GLEKSETPFFWVLRNEP--------QIPDGFEERVK--GRGMVHVGWVPQVKILSHESVG 346 Query: 1115 GFLTHCGWNSVIEALSFGRVLILFPVTNEQGLNARLLQGRKVGVEIPREAENGLFTSTAM 1294 GFLTHCGWNSV+E + FG+V I PV NEQGLN RLLQG+ +GVE+ R+ +G F S ++ Sbjct: 347 GFLTHCGWNSVVEGIGFGKVPIFLPVLNEQGLNTRLLQGKGLGVEVLRDERDGSFGSDSV 406 Query: 1295 ADTLKSAMISEEGEPMRANAREMTSLFGNWNRNQGYIASFIRLLEAEKT 1441 AD+++ MI + GE +R + M LFGN + N Y+ + + +++ Sbjct: 407 ADSVRLVMIDDAGEEIREKVKLMKGLFGNMDENIRYVDELVGFMRNDES 455 >gb|AFJ53030.1| UDP-glycosyltransferase 1 [Linum usitatissimum] Length = 472 Score = 484 bits (1246), Expect = e-134 Identities = 251/473 (53%), Positives = 325/473 (68%), Gaps = 7/473 (1%) Frame = +2 Query: 41 SNTLHVVMFPWLAMGHFIPFLQLSKHLARRGHKVSFVSTPRNIQRLPKIPQDIASLIQLV 220 S + +V+FPWLAMGH IPFL SK LA+ GH + FVSTP+N+ RLPK+P ++S I V Sbjct: 9 SGKMEIVVFPWLAMGHLIPFLHFSKLLAQNGHNIHFVSTPKNLSRLPKLPLRLSSQITFV 68 Query: 221 SIPLPKTENLREQAESSMDVPREEQRFLKIAFDLLQSPIAAFLENATPKPDWIIFDYASH 400 PLP NL AESSMDVP Q+ LK AFD L+ P+ FL KPDW+I+DYASH Sbjct: 69 PFPLPPVPNLPPDAESSMDVPYNNQQLLKKAFDSLRPPLTDFLRQL--KPDWVIYDYASH 126 Query: 401 WLPQIAAR---NGTSSAYFSLFNAATLAFIGPPSILLNEGDGRSTAESFTIVPKWIPFP- 568 WLP AA G A+FSLF A TL F+GPP GD R AE FT+VP WIP Sbjct: 127 WLPSAAADAGGGGIGCAFFSLFTATTLCFVGPPG-----GDSRRNAEDFTVVPDWIPIEI 181 Query: 569 -SSIVYRLHELTKYFEDSLGNESVTSDVIRFATSIKESDLVVIRTSVEFEPEWFKLVCEL 745 S+I YRLHE++KY E + + S SD IRFA +++ES+ +++R+S EFEPEWF+L+ ++ Sbjct: 182 KSNIAYRLHEVSKYVEKTDEDTSGPSDQIRFAVAMEESNALLVRSSREFEPEWFELLGQM 241 Query: 746 YN-KPVVSLGVLPPSLEEED-ESGTDEKWFKIKGWLDEQSLSKVVYVALGTEATLSENEI 919 Y K ++ +G LPP + D E D W +I+ WLD+Q ++ VVYVALGTEA L+ +EI Sbjct: 242 YKEKTIIPVGFLPPPIAANDKEDQNDAVWREIRDWLDKQRVNTVVYVALGTEAALTRDEI 301 Query: 920 QDLALGLEQSELPFFWVLRKPPTSVKDVSEMLPEGFMERINASGRGMVYTEWVPQVKILS 1099 +LA GLE+S LPFFW LR S + MLP GF ER+ GRG+VY EWVPQV+ILS Sbjct: 302 AELASGLEKSALPFFWALRDHSVSGR---MMLPGGFEERVK--GRGIVYREWVPQVRILS 356 Query: 1100 HPAVGGFLTHCGWNSVIEALSFGRVLILFPVTNEQGLNARLLQGRKVGVEIPREAENGLF 1279 H +VGGFLTHCG+NSV+E L+FGRVLILFPV N+QGLNARLL+G+K+G+EIPRE ++G F Sbjct: 357 HDSVGGFLTHCGYNSVVEGLAFGRVLILFPVINDQGLNARLLEGKKLGIEIPREEKDGSF 416 Query: 1280 TSTAMADTLKSAMISEEGEPMRANAREMTSLFGNWNRNQGYIASFIRLLEAEK 1438 TS A+A+T+K+A++ E GE R + LFG +N + + +R L K Sbjct: 417 TSDAVAETVKAAVVGESGEGWRRAVKGAKGLFGGREKNGEMVDALVRYLTENK 469