BLASTX nr result
ID: Cornus23_contig00021545
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Cornus23_contig00021545 (992 letters) Database: ./nr 77,306,371 sequences; 28,104,191,420 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_002305226.2| hypothetical protein POPTR_0004s08200g [Popu... 360 1e-96 ref|XP_011043909.1| PREDICTED: crocetin glucosyltransferase, chl... 357 8e-96 ref|XP_002263532.1| PREDICTED: crocetin glucosyltransferase, chl... 357 1e-95 ref|XP_011021104.1| PREDICTED: crocetin glucosyltransferase, chl... 355 4e-95 gb|ABQ02257.1| O-glucosyltransferase 2 [Vitis labrusca] 355 4e-95 emb|CDP20003.1| unnamed protein product [Coffea canephora] 353 9e-95 emb|CDP21504.1| unnamed protein product [Coffea canephora] 353 9e-95 gb|AKA44583.1| UGTPg37 [Panax ginseng] 353 1e-94 emb|CAN73416.1| hypothetical protein VITISV_017052 [Vitis vinifera] 353 1e-94 gb|ABQ02256.1| O-glucosyltransferase 1 [Vitis labrusca] gi|28149... 353 2e-94 ref|XP_006373420.1| hypothetical protein POPTR_0017s13620g [Popu... 352 3e-94 emb|CBI39412.3| unnamed protein product [Vitis vinifera] 351 6e-94 ref|XP_006365463.1| PREDICTED: anthocyanidin 3-O-glucoside 5-O-g... 350 1e-93 ref|XP_002263497.1| PREDICTED: crocetin glucosyltransferase, chl... 348 3e-93 ref|XP_004239848.1| PREDICTED: crocetin glucosyltransferase, chl... 346 1e-92 ref|XP_009778639.1| PREDICTED: crocetin glucosyltransferase, chl... 344 5e-92 ref|XP_009604418.1| PREDICTED: crocetin glucosyltransferase, chl... 344 5e-92 ref|XP_007041388.1| UDP-Glycosyltransferase superfamily protein ... 343 1e-91 emb|CDP20005.1| unnamed protein product [Coffea canephora] 340 8e-91 emb|CDP21497.1| unnamed protein product [Coffea canephora] 340 1e-90 >ref|XP_002305226.2| hypothetical protein POPTR_0004s08200g [Populus trichocarpa] gi|550340586|gb|EEE85737.2| hypothetical protein POPTR_0004s08200g [Populus trichocarpa] Length = 496 Score = 360 bits (924), Expect = 1e-96 Identities = 186/346 (53%), Positives = 253/346 (73%), Gaps = 16/346 (4%) Frame = +2 Query: 2 LLQLAQRLTRAGA-RVTFATTIHGLRQIRTLPTYPGLSYASFSNGNDDGTK--SSDDGTM 172 +LQLA+ L +AGA RVTFATT+HGL QI+T P+ GL +ASFS+G DDG K ++ + Sbjct: 20 MLQLAKNLRQAGAARVTFATTVHGLTQIKTFPSLDGLYFASFSDGFDDGIKHTTNSQDML 79 Query: 173 AKLKRVGSETLTNLLHTLSDDNRPVTFLIYTLLLPWAAEVARGMHVPSAFLCIHSATSFA 352 ++LKR GS+TLT L+ T S + PV+FLIYTL+LPWAA+VAR M +PSAFL I SATS A Sbjct: 80 SELKRAGSQTLTKLIMTFSKNRHPVSFLIYTLILPWAADVARYMSIPSAFLYIQSATSLA 139 Query: 353 IYQQFFNRHDGLYHHDNNKIN-PSMSLELPDLPLFTYDELPSFLLPTNPYSSAMIPSFQE 529 + FFNRH G+Y N+ N P S+++P LP F +++PSFLLP P+SS + P FQ+ Sbjct: 140 LCHHFFNRHGGVYDLYNSSENKPPSSIQVPGLPPFETEDIPSFLLPNGPHSS-LNPVFQQ 198 Query: 530 HIQTLEKDPNPYILFNTFHPLEEQSIKAIDNMNVIPIGPLIP---------SNKSYDCDL 682 HIQ LE++P+P++L N+F LEE+ I AI N++ IPIGPLIP S+ S CDL Sbjct: 199 HIQVLEQEPSPWVLLNSFDCLEEEVIAAIGNISPIPIGPLIPFALLDKNHQSDTSCGCDL 258 Query: 683 FESSGDYLQWLDTKPERSVVYVSFGTFVVLTKRHVEEMLQGLIGSGRPFLWVVRSPESE- 859 FE S +Y+QWL++KP+ SV+Y+SFG+ VL K +EEML GLIG+ RPFLW++RS +++ Sbjct: 259 FEKSTEYIQWLNSKPKTSVIYISFGSVAVLQKNQMEEMLLGLIGTCRPFLWIIRSSDNKD 318 Query: 860 -EVKGLIDCDLG-EEGLIVPWCSQVEVLSHRSVGSFATHCGWNSTV 991 E + ++ + E+GLIVPWCSQ+EVL+H S+G + HCGWNST+ Sbjct: 319 TEFEEMVREKVNKEKGLIVPWCSQMEVLAHESIGCYMMHCGWNSTM 364 >ref|XP_011043909.1| PREDICTED: crocetin glucosyltransferase, chloroplastic-like [Populus euphratica] Length = 498 Score = 357 bits (916), Expect = 8e-96 Identities = 187/346 (54%), Positives = 250/346 (72%), Gaps = 16/346 (4%) Frame = +2 Query: 2 LLQLAQRLTRAGA-RVTFATTIHGLRQIRTLPTYPGLSYASFSNGNDDGTK--SSDDGTM 172 +LQLA+ L +AGA RVTFATT+HGL QI+T P+ GL YASFS+G DDG K ++ + Sbjct: 20 MLQLAKNLRQAGAARVTFATTVHGLTQIKTFPSLDGLYYASFSDGFDDGIKHATNSQDML 79 Query: 173 AKLKRVGSETLTNLLHTLSDDNRPVTFLIYTLLLPWAAEVARGMHVPSAFLCIHSATSFA 352 ++LKR GS+TLT L+ T S ++ PV+FLIYTL+LPWAA+VAR M +PSA L I SATS A Sbjct: 80 SELKRAGSQTLTELIMTFSKNSHPVSFLIYTLILPWAADVARYMSIPSALLYIQSATSLA 139 Query: 353 IYQQFFNRHDGLYHHDNNKIN-PSMSLELPDLPLFTYDELPSFLLPTNPYSSAMIPSFQE 529 + FFNRH G+Y N+ N P S+++P LP +++PSFLLP P+SS + P FQ Sbjct: 140 LCHHFFNRHGGVYDLYNSSENKPPSSIQVPGLPPLETEDIPSFLLPNGPHSS-LNPVFQH 198 Query: 530 HIQTLEKDPNPYILFNTFHPLEEQSIKAIDNMNVIPIGPLIP---------SNKSYDCDL 682 HIQ LE++P+P++L NTF LEE+ I AI N++ IPIGPLIP S+ S CDL Sbjct: 199 HIQVLEQEPSPWVLLNTFACLEEEVIAAIGNISPIPIGPLIPFSLLDKNHQSDTSCGCDL 258 Query: 683 FESSGDYLQWLDTKPERSVVYVSFGTFVVLTKRHVEEMLQGLIGSGRPFLWVVRSPESE- 859 FE S +Y+QWL++KP+RSV+Y+SFG+ VL K +EE+L GLIG+ RPFLW++RS +++ Sbjct: 259 FEKSTEYIQWLNSKPKRSVIYISFGSIAVLQKDQMEEILLGLIGTCRPFLWIIRSSDNKD 318 Query: 860 -EVKGLIDCDLGEE-GLIVPWCSQVEVLSHRSVGSFATHCGWNSTV 991 E ++ + EE GLIVPWCSQ+EVL+H S+G HCGWNST+ Sbjct: 319 TEFDEMVREKVNEEKGLIVPWCSQMEVLAHESIGCCMMHCGWNSTM 364 >ref|XP_002263532.1| PREDICTED: crocetin glucosyltransferase, chloroplastic-like [Vitis vinifera] Length = 447 Score = 357 bits (915), Expect = 1e-95 Identities = 187/332 (56%), Positives = 239/332 (71%), Gaps = 3/332 (0%) Frame = +2 Query: 5 LQLAQRLTRAGARVTFATTIHGLRQIRTLPTYPGLSYASFSNGNDDGTKSSDDGTMAKLK 184 L LA L R G RVTFAT + GLR+I TLPT PGL +ASFS+G DDG S+ +M ++K Sbjct: 21 LHLAMLLLRLGVRVTFATFVSGLRRIATLPTIPGLHFASFSDGYDDGNNSNY--SMEEMK 78 Query: 185 RVGSETLTNLLHTLSDDNRPVTFLIYTLLLPWAAEVARGMHVPSAFLCIHSATSFAIYQQ 364 RVGS++L+NLL +LS++ PVT+LIY LLPWAA VAR +PSAFL SAT A+Y + Sbjct: 79 RVGSQSLSNLLLSLSNERGPVTYLIYGFLLPWAATVAREHGIPSAFLSTQSATVIAVYHR 138 Query: 365 FFNRHDGLYHHDNNKINPSM--SLELPDLPLFTYDELPSFLLPTNPYSSAMIPSFQEHIQ 538 + HDGL+ N ++ S+ SLELP LP Y++LPS LLPT+P++S ++PSFQEH+Q Sbjct: 139 YLKAHDGLF---NTELGSSLNISLELPGLPPLKYEDLPSILLPTSPHAS-VVPSFQEHVQ 194 Query: 539 TLEKDPNPYILFNTFHPLEEQSIKAI-DNMNVIPIGPLIPSNKSYDCDLFESSGDYLQWL 715 LE+DPN +L NTF+ LEE IKA+ D MNV+ IGPL+ + S CDLFE S DYL WL Sbjct: 195 NLEQDPNTCLLINTFNALEEDVIKALGDFMNVVAIGPLMQLDSSISCDLFERSKDYLPWL 254 Query: 716 DTKPERSVVYVSFGTFVVLTKRHVEEMLQGLIGSGRPFLWVVRSPESEEVKGLIDCDLGE 895 ++KPE SV+YVSFG+ L K +EE+ GL+ S RPFLWV+RS ESE + + E Sbjct: 255 NSKPEGSVIYVSFGSLATLQKNQMEEIFHGLMESHRPFLWVIRSIESELEEKMNSSLSEE 314 Query: 896 EGLIVPWCSQVEVLSHRSVGSFATHCGWNSTV 991 +GLIV WCSQVEVL H++VG F THCGWNST+ Sbjct: 315 QGLIVQWCSQVEVLCHQAVGCFLTHCGWNSTM 346 >ref|XP_011021104.1| PREDICTED: crocetin glucosyltransferase, chloroplastic-like [Populus euphratica] Length = 461 Score = 355 bits (910), Expect = 4e-95 Identities = 189/346 (54%), Positives = 240/346 (69%), Gaps = 16/346 (4%) Frame = +2 Query: 2 LLQLAQRLTRAGA-RVTFATTIHGLRQIRTLPTYPGLSYASFSNGNDDGTKSSDDG--TM 172 + QL + L AGA RVTFATT HGL Q+ P+ L YASFS+G DDG K ++D Sbjct: 20 MFQLGKCLIHAGAGRVTFATTAHGLTQVEAFPSLENLHYASFSDGFDDGIKPTNDPHRIR 79 Query: 173 AKLKRVGSETLTNLLHTLSDDNRPVTFLIYTLLLPWAAEVARGMHVPSAFLCIHSATSFA 352 A+LKRVGS+TLT LL +LS + PV++LIYTLLLPWAA+VAR M +PSAFLCI S T+FA Sbjct: 80 AELKRVGSQTLTELLLSLSKEGNPVSYLIYTLLLPWAADVARDMSIPSAFLCILSTTAFA 139 Query: 353 IYQQFFNRHDGLYHHDNNKINPSMSLELPDLPLFTYDELPSFLLPTNPYSSAMIPSFQEH 532 + FF DG+Y D+N P S+E+P LPLFT ++PSFLLP +P++S +IP FQ H Sbjct: 140 LCYCFFKERDGVY--DSNDNGPPSSIEMPGLPLFTSKDMPSFLLPNDPHASILIPLFQHH 197 Query: 533 IQTLEKDPNPYILFNTFHPLEEQSIKAIDNMNVIPIGPLI---------PSNKSYDCDLF 685 IQ LEKD NP +L NT LEE++I+ I N+N IPIGPL+ ++ S DLF Sbjct: 198 IQALEKDSNPCVLLNTSDCLEEEAIRRISNLNPIPIGPLVSYAFLDENNSTDSSCGIDLF 257 Query: 686 ESSGDYLQWLDTKPERSVVYVSFGTFVVLTKRHVEEMLQGLIGSGRPFLWVVR----SPE 853 E S +Y QWL++KP+ SVVYVSFG+ VL K +E++L GL G+ RPFLWVVR S + Sbjct: 258 EKSTEYSQWLNSKPKGSVVYVSFGSLAVLQKNQMEKILLGLTGTCRPFLWVVRPSAGSDD 317 Query: 854 SEEVKGLIDCDLGEEGLIVPWCSQVEVLSHRSVGSFATHCGWNSTV 991 E + + D EEGLIVPWCSQ+EVL+H S+G F HCGWNST+ Sbjct: 318 REFEEKIRDKVNEEEGLIVPWCSQMEVLAHESIGCFMMHCGWNSTL 363 >gb|ABQ02257.1| O-glucosyltransferase 2 [Vitis labrusca] Length = 447 Score = 355 bits (910), Expect = 4e-95 Identities = 186/331 (56%), Positives = 241/331 (72%), Gaps = 2/331 (0%) Frame = +2 Query: 5 LQLAQRLTRAGARVTFATTIHGLRQIRTLPTYPGLSYASFSNGNDDGTKSSDDGTMAKLK 184 L LA+ L R G RVTFAT + GLR+I TLPT PGL +ASFS+G DDG S+ +M ++K Sbjct: 21 LHLAKLLLRLGVRVTFATFVSGLRRIATLPTIPGLHFASFSDGYDDGNNSNY--SMEEMK 78 Query: 185 RVGSETLTNLLHTLSDDNRPVTFLIYTLLLPWAAEVARGMHVPSAFLCIHSATSFAIYQQ 364 RVGS++L+NLL +LS++ PVT+LIY LLPWAA VAR +PSAFL SAT+ A+Y + Sbjct: 79 RVGSQSLSNLLLSLSNERGPVTYLIYGFLLPWAATVAREHGIPSAFLSTQSATAIAVYHR 138 Query: 365 FFNRHDGLYHHD-NNKINPSMSLELPDLPLFTYDELPSFLLPTNPYSSAMIPSFQEHIQT 541 +F HDGL++ + N +N +SLELP LP Y++LPS LLPT+P++ ++PSFQE IQ Sbjct: 139 YFKAHDGLFNTELGNSLN--ISLELPGLPPLKYEDLPSILLPTSPHAW-VVPSFQELIQN 195 Query: 542 LEKDPNPYILFNTFHPLEEQSIKAI-DNMNVIPIGPLIPSNKSYDCDLFESSGDYLQWLD 718 LE+DPNP +L NTF+ LEE IKA+ D MNV+ IGPL+ + S CDLF S DY WL+ Sbjct: 196 LEQDPNPCVLINTFNALEEDVIKALGDFMNVVAIGPLMQLDSSISCDLFGRSKDYHPWLN 255 Query: 719 TKPERSVVYVSFGTFVVLTKRHVEEMLQGLIGSGRPFLWVVRSPESEEVKGLIDCDLGEE 898 +KPE SV+YVSFG+ L K+ +EE+ GL+ S RPFLWV+RS ESE + + E+ Sbjct: 256 SKPEGSVIYVSFGSLATLQKKQMEEIFHGLMESHRPFLWVIRSMESELEEKMNSSLSEEQ 315 Query: 899 GLIVPWCSQVEVLSHRSVGSFATHCGWNSTV 991 GLIV WCSQVEVL H++VG F THCGWNST+ Sbjct: 316 GLIVQWCSQVEVLCHQAVGCFLTHCGWNSTM 346 >emb|CDP20003.1| unnamed protein product [Coffea canephora] Length = 461 Score = 353 bits (907), Expect = 9e-95 Identities = 188/343 (54%), Positives = 240/343 (69%), Gaps = 14/343 (4%) Frame = +2 Query: 5 LQLAQRLTRAGARVTFATTIHGLRQIRTLPTYPGLSYASFSNGNDD--GTKSSDDGTMAK 178 LQLA+ L R GA+VTFATT++G +IR LP LS+ASFS+G DD K+ D + + Sbjct: 21 LQLAKNLARTGAQVTFATTVYGFSRIRNLPASGCLSFASFSDGYDDEKSQKNRDFTSFSS 80 Query: 179 -LKRVGSETLTNLLHTLSDDNRPVTFLIYTLLLPWAAEVARGMHVPSAFLCIHSATSFAI 355 KR G + LT L+ T S + RPVTFLIYT++LPW AEVAR MH+PSAFL I SAT+FAI Sbjct: 81 DTKRFGYKDLTKLIQTTSKEGRPVTFLIYTVMLPWVAEVAREMHIPSAFLAIQSATTFAI 140 Query: 356 YQQFFNRHDGLYHHDNNKINPSMSLELPDLPLFTYDELPSFLLPTNPYSSAMIPSFQEHI 535 Y ++FN HDG Y S+S++LPDLPLF ++LP+FLLP + + + +P F EHI Sbjct: 141 YHRYFNSHDGFYDGVREVECSSISIKLPDLPLFEKEDLPTFLLPNDQFFAFTVPFFHEHI 200 Query: 536 QTLEKDPNPYILFNTFHPLEEQSIKAIDNMNVIPIGPLIPS---------NKSYDCDLFE 688 + LE+D P +L NTF+ LEE SIKA+D MN+I IGPLIPS +KS DLF+ Sbjct: 201 KILEQDSKPCVLVNTFNELEESSIKAVDGMNLISIGPLIPSAFSDRNDLTDKSIGGDLFD 260 Query: 689 S-SGDYLQWLDTKPERSVVYVSFGTFVVLTKRHVEEMLQGLIGSGRPFLWVVRSP-ESEE 862 + S +LQWLD KPERSV+YVSFG+ V L K E+L GL +GR +L V++S E EE Sbjct: 261 TPSKGFLQWLDPKPERSVIYVSFGSLVALKKAEKIEILHGLEEAGRAYLLVLQSDNEEEE 320 Query: 863 VKGLIDCDLGEEGLIVPWCSQVEVLSHRSVGSFATHCGWNSTV 991 VK +I+ EEG+IVPWCSQ+EVL HRS+G F THCGWNST+ Sbjct: 321 VKAMIENASSEEGMIVPWCSQMEVLCHRSIGCFITHCGWNSTL 363 >emb|CDP21504.1| unnamed protein product [Coffea canephora] Length = 461 Score = 353 bits (907), Expect = 9e-95 Identities = 188/343 (54%), Positives = 240/343 (69%), Gaps = 14/343 (4%) Frame = +2 Query: 5 LQLAQRLTRAGARVTFATTIHGLRQIRTLPTYPGLSYASFSNGNDD--GTKSSDDGTMAK 178 LQLA+ L R GA+VTFATT++G +IR LP LS+ASFS+G DD K+ D + + Sbjct: 21 LQLAKNLARTGAQVTFATTVYGFSRIRNLPASGCLSFASFSDGYDDEKSQKNRDFTSFSS 80 Query: 179 -LKRVGSETLTNLLHTLSDDNRPVTFLIYTLLLPWAAEVARGMHVPSAFLCIHSATSFAI 355 KR G + LT L+ T S + RPVTFLIYT++LPW AEVAR MH+PSAFL I SAT+FAI Sbjct: 81 DTKRFGYKDLTKLIQTTSKEGRPVTFLIYTVMLPWVAEVAREMHIPSAFLAIQSATTFAI 140 Query: 356 YQQFFNRHDGLYHHDNNKINPSMSLELPDLPLFTYDELPSFLLPTNPYSSAMIPSFQEHI 535 Y ++FN HDG Y S+S++LPDLPLF ++LP+FLLP + + + +P F EHI Sbjct: 141 YHRYFNSHDGFYDGVREVECSSISIKLPDLPLFEKEDLPTFLLPNDQFFAFTVPFFHEHI 200 Query: 536 QTLEKDPNPYILFNTFHPLEEQSIKAIDNMNVIPIGPLIPS---------NKSYDCDLFE 688 + LE+D P +L NTF+ LEE SIKA+D MN+I IGPLIPS +KS DLF+ Sbjct: 201 KILEQDSKPCVLVNTFNELEESSIKAVDGMNLISIGPLIPSAFSDRNDLTDKSIGGDLFD 260 Query: 689 S-SGDYLQWLDTKPERSVVYVSFGTFVVLTKRHVEEMLQGLIGSGRPFLWVVRSP-ESEE 862 + S +LQWLD KPERSV+YVSFG+ V L K E+L GL +GR +L V++S E EE Sbjct: 261 TPSKGFLQWLDPKPERSVIYVSFGSLVALKKAEKIEILHGLEEAGRAYLLVLQSDNEEEE 320 Query: 863 VKGLIDCDLGEEGLIVPWCSQVEVLSHRSVGSFATHCGWNSTV 991 VK +I+ EEG+IVPWCSQ+EVL HRS+G F THCGWNST+ Sbjct: 321 VKAMIENASSEEGMIVPWCSQLEVLCHRSIGCFITHCGWNSTL 363 >gb|AKA44583.1| UGTPg37 [Panax ginseng] Length = 454 Score = 353 bits (906), Expect = 1e-94 Identities = 182/340 (53%), Positives = 237/340 (69%), Gaps = 11/340 (3%) Frame = +2 Query: 5 LQLAQRLTRAGARVTFATTIHGLRQIRTLPTYPGLSYASFSNGNDDGTKSSDDGTMAKLK 184 LQLA+ LTR+GA VT+AT G+ ++ LPT GLSYA+FS+GN+ + MA L+ Sbjct: 22 LQLAKILTRSGANVTYATA--GIGRLNALPTIDGLSYATFSDGNEHNATLPANDYMAMLR 79 Query: 185 RVGSETLTNLLHTLSDDNRPVTFLIYTLLLPWAAEVARGMHVPSAFLCIHSATSFAIYQQ 364 RVG ++LT L+H LS PVTF++YT+LLPW AEVAR MH+PSAFL I A +FAI+ + Sbjct: 80 RVGPQSLTKLVHDLSTKGTPVTFIVYTVLLPWVAEVARDMHLPSAFLFIQCAIAFAIFHR 139 Query: 365 FFNRHDGLYHHDNNKINPSMSLELPDLPLFTYDELPSFLLPTNPYSSAMIPSFQEHIQTL 544 FFN DGL H ++INP++S++LP LPLFT E+P FL P N + S M P+FQEHIQTL Sbjct: 140 FFNSQDGL-HSGVHEINPNVSVKLPGLPLFTCKEIPDFLFPHNQFYSPMAPAFQEHIQTL 198 Query: 545 EKDPNPYILFNTFHPLEEQSIKAIDNMNVIPIGPLIPS---------NKSYDCDLFESSG 697 EK+PNP +L NTF+ LE IK+ NM ++ IGPL+PS +KS+ LF++ Sbjct: 199 EKEPNPCVLVNTFNALEGDIIKSFPNMKLMAIGPLLPSAFSDGNDLNDKSFGGTLFQNPN 258 Query: 698 DYLQWLDTKPERSVVYVSFGTFVVLTKRHVEEMLQGLIGSGRPFLWVVRSPESEEVKGL- 874 +YL WLD+KP+RSV+Y SFG+ + L + EE+L GL RPFLWV+R E K + Sbjct: 259 NYLTWLDSKPDRSVIYASFGSLMQLKETQKEEVLHGLRICNRPFLWVIRDINEEVAKSMK 318 Query: 875 IDCDLGEE-GLIVPWCSQVEVLSHRSVGSFATHCGWNSTV 991 +D + +E G IVPWCSQVEVL HRS+G F THCGWNSTV Sbjct: 319 LDNGITDELGFIVPWCSQVEVLCHRSIGCFVTHCGWNSTV 358 >emb|CAN73416.1| hypothetical protein VITISV_017052 [Vitis vinifera] Length = 453 Score = 353 bits (906), Expect = 1e-94 Identities = 185/331 (55%), Positives = 239/331 (72%), Gaps = 2/331 (0%) Frame = +2 Query: 5 LQLAQRLTRAGARVTFATTIHGLRQIRTLPTYPGLSYASFSNGNDDGTKSSDDGTMAKLK 184 L LA+ L R G RVTFAT + GLR+I TLPT PGL +ASFS+G DDG S+ +M ++K Sbjct: 21 LHLAKLLLRVGVRVTFATFVSGLRRIATLPTIPGLHFASFSDGYDDGNNSNY--SMEEMK 78 Query: 185 RVGSETLTNLLHTLSDDNRPVTFLIYTLLLPWAAEVARGMHVPSAFLCIHSATSFAIYQQ 364 RVGS++L++LL +LS++ PVT+LIY LL WAA VAR +PSAFL SAT A+Y + Sbjct: 79 RVGSQSLSSLLLSLSNERGPVTYLIYGFLLSWAATVAREHGIPSAFLSTQSATVIAVYHR 138 Query: 365 FFNRHDGLYHHD-NNKINPSMSLELPDLPLFTYDELPSFLLPTNPYSSAMIPSFQEHIQT 541 +F HDGL++ + N +N +SLELP LP Y++LPS LLPT+ ++S +PS QEHIQ Sbjct: 139 YFKAHDGLFNTELGNSLN--ISLELPGLPPLKYEDLPSILLPTSRHAS-FVPSLQEHIQN 195 Query: 542 LEKDPNPYILFNTFHPLEEQSIKAI-DNMNVIPIGPLIPSNKSYDCDLFESSGDYLQWLD 718 LE+DPNP +L NTF+ LEE IKA+ D MNV+ IGPL+ + S CDLFE S DYL WL+ Sbjct: 196 LEQDPNPCVLINTFNALEEDVIKALGDFMNVVAIGPLVQLDSSISCDLFERSKDYLPWLN 255 Query: 719 TKPERSVVYVSFGTFVVLTKRHVEEMLQGLIGSGRPFLWVVRSPESEEVKGLIDCDLGEE 898 +KPE SV+YVSFG+ L K+ +EE+ GL+ S RPFLWV+RS ESE + + E+ Sbjct: 256 SKPEGSVIYVSFGSLATLQKKQMEEIFHGLMESHRPFLWVIRSIESELEEKMNSSLSEEQ 315 Query: 899 GLIVPWCSQVEVLSHRSVGSFATHCGWNSTV 991 GLIV WC QVEVL H++VG F THCGWNST+ Sbjct: 316 GLIVQWCFQVEVLCHQAVGCFLTHCGWNSTM 346 >gb|ABQ02256.1| O-glucosyltransferase 1 [Vitis labrusca] gi|281494522|gb|ADA72017.1| O-glucosyltransferase [Vitis amurensis] Length = 448 Score = 353 bits (905), Expect = 2e-94 Identities = 188/328 (57%), Positives = 238/328 (72%), Gaps = 3/328 (0%) Frame = +2 Query: 11 LAQRLTRAGARVTFATTIHGLRQIRTLPTYPGLSYASFSNGNDDGTKSSDDGTMAKLKRV 190 L + L R G RVTF T G RQI TLPT PGL +AS S+G DDG +S+ +M ++KRV Sbjct: 23 LVKLLLRLGVRVTFTTFASGFRQIATLPTLPGLHFASVSDGYDDGNRSNF--SMDEMKRV 80 Query: 191 GSETLTNLLHTLSDDNRPVTFLIYTLLLPWAAEVARGMHVPSAFLCIHSATSFAIYQQFF 370 GS++L+NLL +LS++ PVTFLIY L+LPWAA VAR +PSAFL SAT A+Y ++F Sbjct: 81 GSQSLSNLLLSLSNERGPVTFLIYGLVLPWAATVAREHGIPSAFLSTQSATVIAVYHRYF 140 Query: 371 NRHDGLYHHD-NNKINPSMSLELPDLPLFTYDELPSFLLPTNPYSSAMIPSFQEHIQTLE 547 HDGL++ + N +N +SLELP LP Y++LPS LLP NPY+S ++P FQEHIQ LE Sbjct: 141 KAHDGLFNTELGNPLN--ISLELPGLPPLKYEDLPSILLPGNPYAS-VLPCFQEHIQNLE 197 Query: 548 KDPNPYILFNTFHPLEEQSIKAIDN-MNVIPIGPLIPSNKSYDCDLFESSGDYLQWLDTK 724 +DPNP +L NTF LEE IKA+ + MNV+ IGPL+ + S CDLFE S DYL WL++K Sbjct: 198 QDPNPCVLVNTFDALEEDVIKALGHYMNVVAIGPLMQLDSSISCDLFERSEDYLPWLNSK 257 Query: 725 PERSVVYVSFGTFVVLTKRHVEEMLQGLIGSGRPFLWVVRSPESEEVKGLIDCDLGEE-G 901 P+ SV+YVSFG+ VL K+ +EE+ GL+ S RPFLWV RS ES EV+ + + L EE G Sbjct: 258 PDGSVIYVSFGSLAVLQKKQMEEIFHGLMESHRPFLWVTRSTES-EVEEMTNNSLSEEQG 316 Query: 902 LIVPWCSQVEVLSHRSVGSFATHCGWNS 985 LIV WCSQVEVL H++VG F THCGWNS Sbjct: 317 LIVQWCSQVEVLCHQAVGCFLTHCGWNS 344 >ref|XP_006373420.1| hypothetical protein POPTR_0017s13620g [Populus trichocarpa] gi|550320242|gb|ERP51217.1| hypothetical protein POPTR_0017s13620g [Populus trichocarpa] Length = 460 Score = 352 bits (902), Expect = 3e-94 Identities = 185/345 (53%), Positives = 239/345 (69%), Gaps = 15/345 (4%) Frame = +2 Query: 2 LLQLAQRLTRAGA-RVTFATTIHGLRQIRTLPTYPGLSYASFSNGNDDGTKSSDDG--TM 172 + QL + L AGA RVTFATT HGL Q+ P+ L YASFS+G DDG K ++D M Sbjct: 20 MFQLGKCLIHAGAGRVTFATTAHGLTQVEAFPSLENLHYASFSDGFDDGIKPTNDPHRIM 79 Query: 173 AKLKRVGSETLTNLLHTLSDDNRPVTFLIYTLLLPWAAEVARGMHVPSAFLCIHSATSFA 352 A+LKRVGS+TLT LL +LS + PV++LIYTLLLPWAA++AR M +PSAFLCI S T+FA Sbjct: 80 AELKRVGSQTLTELLLSLSKEGNPVSYLIYTLLLPWAADIARDMSIPSAFLCILSTTAFA 139 Query: 353 IYQQFFNRHDGLYHHDNNKINPSMSLELPDLPLFTYDELPSFLLPTNPYSSAMIPSFQEH 532 + FF DG+Y D+N P S+E+P LPLFT ++PSFLLP +P++S +IP FQ H Sbjct: 140 LCYCFFEERDGVY--DSNDNRPPSSIEMPGLPLFTSKDMPSFLLPNDPHASTLIPIFQHH 197 Query: 533 IQTLEKDPNPYILFNTFHPLEEQSIKAIDNMNVIPIGPLI---------PSNKSYDCDLF 685 IQ LEKD NP +L NT +EE++I+ I N+N IPIGPL+ ++ S DLF Sbjct: 198 IQALEKDSNPCVLLNTSDCVEEEAIRLISNLNPIPIGPLVSYAFLDENNSTDSSCGIDLF 257 Query: 686 ESSGDYLQWLDTKPERSVVYVSFGTFVVLTKRHVEEMLQGLIGSGRPFLWVVR---SPES 856 E S +Y QWL++KPE SVVYVSFG+ VL + +E++L GL + RPFLWV+R S + Sbjct: 258 EKSAEYSQWLNSKPEGSVVYVSFGSLAVLQRNQMEKILLGLTSNCRPFLWVIRPSGSNDR 317 Query: 857 EEVKGLIDCDLGEEGLIVPWCSQVEVLSHRSVGSFATHCGWNSTV 991 E + + D E GLIVPWCSQ+EVL+H S+G F HCGWNST+ Sbjct: 318 EFEEKIRDKVNEEVGLIVPWCSQMEVLTHESIGCFMMHCGWNSTL 362 >emb|CBI39412.3| unnamed protein product [Vitis vinifera] Length = 646 Score = 351 bits (900), Expect = 6e-94 Identities = 184/328 (56%), Positives = 235/328 (71%), Gaps = 1/328 (0%) Frame = +2 Query: 11 LAQRLTRAGARVTFATTIHGLRQIRTLPTYPGLSYASFSNGNDDGTKSSDDGTMAKLKRV 190 L + L R G RVTF T G R+I TLPT PGL +AS S+G DDG S+ +M ++KRV Sbjct: 222 LVKLLLRLGVRVTFTTFASGFRRIATLPTLPGLHFASVSDGYDDGNHSNF--SMDEMKRV 279 Query: 191 GSETLTNLLHTLSDDNRPVTFLIYTLLLPWAAEVARGMHVPSAFLCIHSATSFAIYQQFF 370 GS++L+NLL +LS++ PVTFLIY L+LPWAA VAR +PSAFL SAT A+Y ++F Sbjct: 280 GSQSLSNLLLSLSNERGPVTFLIYGLVLPWAATVAREHGIPSAFLSTQSATVIAVYHRYF 339 Query: 371 NRHDGLYHHDNNKINPSMSLELPDLPLFTYDELPSFLLPTNPYSSAMIPSFQEHIQTLEK 550 HDGL+ + I ++SLELP LP Y++LPS LLP NPY+S ++P FQEHIQ LE+ Sbjct: 340 KAHDGLFKTELG-IPLNISLELPGLPPLKYEDLPSILLPGNPYAS-VLPCFQEHIQNLEQ 397 Query: 551 DPNPYILFNTFHPLEEQSIKAIDN-MNVIPIGPLIPSNKSYDCDLFESSGDYLQWLDTKP 727 DPNP +L NTF LEE IKA+ + MNV+ IGPL+ + S CDLFE S DYL WL++KP Sbjct: 398 DPNPCVLVNTFDALEEDVIKALGHYMNVVAIGPLMQLDSSISCDLFERSKDYLPWLNSKP 457 Query: 728 ERSVVYVSFGTFVVLTKRHVEEMLQGLIGSGRPFLWVVRSPESEEVKGLIDCDLGEEGLI 907 + SV+YVSFG+ VL K+ +EE+ GL+ S RPFLWV+RS ESE + + E+GLI Sbjct: 458 DGSVIYVSFGSLAVLQKKQMEEIFHGLMESHRPFLWVIRSTESEVEEMTNNSMSEEQGLI 517 Query: 908 VPWCSQVEVLSHRSVGSFATHCGWNSTV 991 V WCSQVEVL H++VG F THCGWNST+ Sbjct: 518 VQWCSQVEVLCHQAVGCFLTHCGWNSTM 545 >ref|XP_006365463.1| PREDICTED: anthocyanidin 3-O-glucoside 5-O-glucosyltransferase 1-like [Solanum tuberosum] Length = 458 Score = 350 bits (897), Expect = 1e-93 Identities = 177/340 (52%), Positives = 234/340 (68%), Gaps = 11/340 (3%) Frame = +2 Query: 5 LQLAQRLTRAGARVTFATTIHGLRQIRTLPTYPGLSYASFSNGNDDGTKSSDDGTMAK-L 181 LQLA+ L+RAG R TF TT++G R++ LP+ GL YAS S+GNDDG D G K L Sbjct: 21 LQLAKNLSRAGTRCTFVTTVNGFRKLNNLPSIDGLFYASISDGNDDGAAKMDFGDYLKQL 80 Query: 182 KRVGSETLTNLLHTLSDDNRPVTFLIYTLLLPWAAEVARGMHVPSAFLCIHSATSFAIYQ 361 KRVGSE L L+ L+ D PVT L+YT L W AEVAR +++PSAFL I SAT+FAIY Sbjct: 81 KRVGSENLKKLIDELAGDGHPVTCLVYTFLWAWVAEVAREINLPSAFLAIQSATAFAIYH 140 Query: 362 QFFNRHDGLYHHDNNKINPSMSLELPDLPLFTYDELPSFLLPTNPYSSAMIPSFQEHIQT 541 F+ ++ + ++I S ++LP+LPLF+ D++PSFLL +PYSS MIP +EHIQ Sbjct: 141 HLFSINNNGVYSSTSEIELSFPIKLPELPLFSRDDIPSFLLQNDPYSSFMIPVMREHIQN 200 Query: 542 LEKDPNPYILFNTFHPLEEQSIKAIDNMNVIPIGPLIPS---------NKSYDCDLFESS 694 LE DPNP +L NTF LEE+S+K +D + + IGPLIPS +KS+ CDLFE S Sbjct: 201 LEHDPNPRVLINTFDKLEEKSLKILDKIGICSIGPLIPSAFLNGNELEDKSFGCDLFEKS 260 Query: 695 GDYLQWLDTKPERSVVYVSFGTFVVLTKRHVEEMLQGLIGSGRPFLWVVRSPESEEVKGL 874 Y QWLD+KPE SVVYV+FG+ ++ + EE+LQ L+ S PFLWV+RS + ++ K Sbjct: 261 ETYCQWLDSKPEGSVVYVAFGSVAMVKEEQKEEVLQSLMESEMPFLWVIRSSKEDDKKKN 320 Query: 875 IDC-DLGEEGLIVPWCSQVEVLSHRSVGSFATHCGWNSTV 991 + L +G+IVPWCSQ+EVL H+S+G F +HCGWNST+ Sbjct: 321 DEIYGLNGKGMIVPWCSQMEVLFHKSIGCFVSHCGWNSTL 360 >ref|XP_002263497.1| PREDICTED: crocetin glucosyltransferase, chloroplastic-like [Vitis vinifera] Length = 448 Score = 348 bits (894), Expect = 3e-93 Identities = 183/326 (56%), Positives = 233/326 (71%), Gaps = 1/326 (0%) Frame = +2 Query: 11 LAQRLTRAGARVTFATTIHGLRQIRTLPTYPGLSYASFSNGNDDGTKSSDDGTMAKLKRV 190 L + L R G RVTF T G R+I TLPT PGL +AS S+G DDG S+ +M ++KRV Sbjct: 23 LVKLLLRLGVRVTFTTFASGFRRIATLPTLPGLHFASVSDGYDDGNHSNF--SMDEMKRV 80 Query: 191 GSETLTNLLHTLSDDNRPVTFLIYTLLLPWAAEVARGMHVPSAFLCIHSATSFAIYQQFF 370 GS++L+NLL +LS++ PVTFLIY L+LPWAA VAR +PSAFL SAT A+Y ++F Sbjct: 81 GSQSLSNLLLSLSNERGPVTFLIYGLVLPWAATVAREHGIPSAFLSTQSATVIAVYHRYF 140 Query: 371 NRHDGLYHHDNNKINPSMSLELPDLPLFTYDELPSFLLPTNPYSSAMIPSFQEHIQTLEK 550 HDGL+ + I ++SLELP LP Y++LPS LLP NPY+S ++P FQEHIQ LE+ Sbjct: 141 KAHDGLFKTELG-IPLNISLELPGLPPLKYEDLPSILLPGNPYAS-VLPCFQEHIQNLEQ 198 Query: 551 DPNPYILFNTFHPLEEQSIKAIDN-MNVIPIGPLIPSNKSYDCDLFESSGDYLQWLDTKP 727 DPNP +L NTF LEE IKA+ + MNV+ IGPL+ + S CDLFE S DYL WL++KP Sbjct: 199 DPNPCVLVNTFDALEEDVIKALGHYMNVVAIGPLMQLDSSISCDLFERSKDYLPWLNSKP 258 Query: 728 ERSVVYVSFGTFVVLTKRHVEEMLQGLIGSGRPFLWVVRSPESEEVKGLIDCDLGEEGLI 907 + SV+YVSFG+ VL K+ +EE+ GL+ S RPFLWV+RS ESE + + E+GLI Sbjct: 259 DGSVIYVSFGSLAVLQKKQMEEIFHGLMESHRPFLWVIRSTESEVEEMTNNSMSEEQGLI 318 Query: 908 VPWCSQVEVLSHRSVGSFATHCGWNS 985 V WCSQVEVL H++VG F THCGWNS Sbjct: 319 VQWCSQVEVLCHQAVGCFLTHCGWNS 344 >ref|XP_004239848.1| PREDICTED: crocetin glucosyltransferase, chloroplastic-like [Solanum lycopersicum] Length = 458 Score = 346 bits (888), Expect = 1e-92 Identities = 176/340 (51%), Positives = 232/340 (68%), Gaps = 11/340 (3%) Frame = +2 Query: 5 LQLAQRLTRAGARVTFATTIHGLRQIRTLPTYPGLSYASFSNGNDDGTKSSD-DGTMAKL 181 LQLA+ L+RAG R TF TT++G ++ LP+ GL YAS S+GNDDGT D M +L Sbjct: 21 LQLAKNLSRAGVRCTFVTTVNGFSKLNNLPSIDGLFYASISDGNDDGTAKMDFSDYMKQL 80 Query: 182 KRVGSETLTNLLHTLSDDNRPVTFLIYTLLLPWAAEVARGMHVPSAFLCIHSATSFAIYQ 361 KRVGSE L L+ + D PVT L+YT + PW AEVAR +++PSAFL I SAT+FAIY Sbjct: 81 KRVGSENLKKLIDRYAGDGHPVTCLVYTFIWPWVAEVAREINLPSAFLVIQSATAFAIYH 140 Query: 362 QFFNRHDGLYHHDNNKINPSMSLELPDLPLFTYDELPSFLLPTNPYSSAMIPSFQEHIQT 541 F+ ++ + N+IN S ++LP+LPL D++PSFLL +PYSS MIP +EHIQ Sbjct: 141 HLFSINNNGVYSSTNEINLSFPIKLPELPLLFRDDIPSFLLQNDPYSSFMIPVMREHIQN 200 Query: 542 LEKDPNPYILFNTFHPLEEQSIKAIDNMNVIPIGPLIPS---------NKSYDCDLFESS 694 LE D NP +L NTF+ LEE+S+K ID + + IGPLIPS +KS+ CDLFE S Sbjct: 201 LEHDTNPRVLINTFNKLEEKSLKIIDKIGIYSIGPLIPSAFLDGIELEDKSFGCDLFEKS 260 Query: 695 GDYLQWLDTKPERSVVYVSFGTFVVLTKRHVEEMLQGLIGSGRPFLWVVRSPESEEVKGL 874 Y QWLD+K E SVVYV+FG+ + + EE+LQGL+ S PFLWV+RS + ++ K Sbjct: 261 ETYCQWLDSKLEGSVVYVAFGSIATVKEEQKEEVLQGLLESEMPFLWVIRSSKEDDKKKN 320 Query: 875 IDC-DLGEEGLIVPWCSQVEVLSHRSVGSFATHCGWNSTV 991 + L +G+IVPWCSQ+EVL H+S+G F +HCGWNST+ Sbjct: 321 DEIYGLNGKGMIVPWCSQMEVLFHKSIGCFVSHCGWNSTL 360 >ref|XP_009778639.1| PREDICTED: crocetin glucosyltransferase, chloroplastic-like [Nicotiana sylvestris] Length = 465 Score = 344 bits (883), Expect = 5e-92 Identities = 175/349 (50%), Positives = 236/349 (67%), Gaps = 20/349 (5%) Frame = +2 Query: 5 LQLAQRLTRAGARVTFATTIHGLRQIRTLPTYPGLSYASFSNGNDDGTKSSDDGT--MAK 178 LQ+A+ L RAGAR TF TT++GL+++ LPT L Y+SFS+G DD S+ D M Sbjct: 21 LQMAKNLARAGARATFVTTVYGLKRMNNLPTQERLFYSSFSDGYDDDWISNTDHNDYMNN 80 Query: 179 LKRVGSETLTNLLHTLSDDNRPVTFLIYTLLLPWAAEVARGMHVPSAFLCIHSATSFAIY 358 LK GS+ L NLL SD+ PVTFL+YT+LLPW A VAR +HVPSAFL I T+FAIY Sbjct: 81 LKYEGSKNLKNLLRKFSDEGHPVTFLVYTILLPWVAVVARELHVPSAFLVIQCGTAFAIY 140 Query: 359 QQFFNRHDGLYHHDNNKIN--PSMSLELPDLPLFTYDELPSFLLPTNPYSSAMIPSFQEH 532 FN +G+Y + I PS ++E P+LPLF+ +++P+ +LP P+SS MIP +EH Sbjct: 141 NHLFNSINGVYSSSVSDITVTPSFAIEFPELPLFSSNDIPTIVLPNAPHSSVMIPIMREH 200 Query: 533 IQTLEKDPNPYILFNTFHPLEEQSIKAIDNMNVIPIGPLIPS---------NKSYDCDLF 685 IQ LEKDPN +L N+F LEE+S++ +D + + +GPL+PS +KS+ C+LF Sbjct: 201 IQNLEKDPNSCVLINSFDALEEKSMRIVDKLRIFSVGPLVPSAFSDGNDPKDKSFGCELF 260 Query: 686 ES-SGDYLQWLDTKPERSVVYVSFGTFVVLTKRHVEEMLQGLIGSGRPFLWVVRSPESEE 862 E+ +Y QWLD+KPE SV+YVSFG+ VL K EE+L GL+ S RPFLWV+R + E Sbjct: 261 ENREKNYRQWLDSKPEGSVIYVSFGSIAVLEKEQKEEILHGLLESERPFLWVMRKGKEEV 320 Query: 863 VKG------LIDCDLGEEGLIVPWCSQVEVLSHRSVGSFATHCGWNSTV 991 +G D L ++G+I+PWC+Q+EVL H+SVG F THCGWNST+ Sbjct: 321 EEGNNYKNEFDDILLNKKGIIIPWCAQMEVLFHKSVGCFVTHCGWNSTL 369 >ref|XP_009604418.1| PREDICTED: crocetin glucosyltransferase, chloroplastic-like [Nicotiana tomentosiformis] Length = 465 Score = 344 bits (883), Expect = 5e-92 Identities = 176/349 (50%), Positives = 234/349 (67%), Gaps = 20/349 (5%) Frame = +2 Query: 5 LQLAQRLTRAGARVTFATTIHGLRQIRTLPTYPGLSYASFSNGNDDGTKSSDDGT--MAK 178 LQ+A+ L RAGAR TF TT++GL+++ LPT L Y+SFS+G DD S+ D M Sbjct: 21 LQMAKNLARAGARATFITTVYGLKRMNNLPTQERLFYSSFSDGYDDDWISNTDHNDYMNN 80 Query: 179 LKRVGSETLTNLLHTLSDDNRPVTFLIYTLLLPWAAEVARGMHVPSAFLCIHSATSFAIY 358 LK GS+ L N+L SD+ PVTFL+YT+LLPW A VAR +HVPSAFL I T+FAIY Sbjct: 81 LKHEGSKNLKNILRKFSDEGHPVTFLVYTILLPWVAVVARDIHVPSAFLVIQCGTAFAIY 140 Query: 359 QQFFNRHDGLYHHDNNKIN--PSMSLELPDLPLFTYDELPSFLLPTNPYSSAMIPSFQEH 532 FN +G+Y + I PS +E P LPLF+ +++P+ +LP +P+SS MIP +EH Sbjct: 141 NHLFNSINGVYSSSVSDITVTPSFPIEFPGLPLFSCNDIPTIVLPNDPHSSVMIPIMREH 200 Query: 533 IQTLEKDPNPYILFNTFHPLEEQSIKAIDNMNVIPIGPLIPS---------NKSYDCDLF 685 IQ LE DPN +L NTF LEE+S++ +D M + +GPL+PS +KS+ C+LF Sbjct: 201 IQNLENDPNSCVLINTFDTLEEKSMRIVDKMRIFSVGPLVPSAFSDGNDPKDKSFGCELF 260 Query: 686 ES-SGDYLQWLDTKPERSVVYVSFGTFVVLTKRHVEEMLQGLIGSGRPFLWVVRSPESEE 862 E+ +Y +WLD+KP+ SVVYVSFG+ VL K EE+L GL+ S RPFLWV+R + E Sbjct: 261 ENPEKNYRRWLDSKPKGSVVYVSFGSIAVLKKEQKEEILHGLLESERPFLWVMRKGKEEV 320 Query: 863 VKG------LIDCDLGEEGLIVPWCSQVEVLSHRSVGSFATHCGWNSTV 991 KG D L E+GLI+PWC+Q+EVL H+S+G F THCGWNST+ Sbjct: 321 EKGNNYKNEYDDSLLNEKGLIIPWCAQMEVLFHKSIGCFVTHCGWNSTL 369 >ref|XP_007041388.1| UDP-Glycosyltransferase superfamily protein [Theobroma cacao] gi|508705323|gb|EOX97219.1| UDP-Glycosyltransferase superfamily protein [Theobroma cacao] Length = 468 Score = 343 bits (880), Expect = 1e-91 Identities = 183/341 (53%), Positives = 239/341 (70%), Gaps = 13/341 (3%) Frame = +2 Query: 5 LQLAQRLTRAGARVTFATTIHGLRQIRTLPTYPGLSYASFSNGNDDGTKSSD--DGTMAK 178 LQLA+RL +AGARVTFATT G R+I++ P+ GL+YA FS+G DDGT SD + M+K Sbjct: 31 LQLAKRLIQAGARVTFATTTSGQRKIKSFPSLEGLAYAFFSDGFDDGTSPSDKQEDIMSK 90 Query: 179 LKRVGSETLTNLLHTLSDDNRPVTFLIYTLLLPWAAEVARGMHVPSAFLCIHSATSFAIY 358 L+ +GS+TLTNLL +LS + PV+FLIY+LLL W A+VAR + +PSA LC HS +FAIY Sbjct: 91 LEHIGSQTLTNLLLSLSGEGHPVSFLIYSLLLSWVADVARDLSIPSALLCNHSGAAFAIY 150 Query: 359 QQFFNRHDGLYHHDNNKIN-PSMSLELPDLPLFTYDELPSFLLPTNPYSSAMIPSFQEHI 535 + N G Y ++KIN P + LP F + +LPSFLLP +P+ S + SFQ+HI Sbjct: 151 HHYLNSQTGAY---DSKINCPPSFINFEGLPPFKWKDLPSFLLPYSPH-SFVTTSFQKHI 206 Query: 536 QTLEKDPNPYILFNTFHPLEEQSIKAI---DNMNVIPIGPLIPSNKSYDCDLFESSGD-- 700 + LEKDPN +L NTF LEE +IK + N+N+I IGPL+PS+K CDLFE+S Sbjct: 207 RVLEKDPNSCVLINTFDELEEYAIKTLAHDSNINLITIGPLVPSDKFVGCDLFENSSHDY 266 Query: 701 YLQWLDTKPERSVVYVSFGTFVVLTKRHVEEMLQGLIGSGRPFLWVVR-SPESEEVKG-- 871 Y WLD+KP+ SVVY+SFG+ VL + +EE+ G++ SG FLWV+R S + EE +G Sbjct: 267 YTHWLDSKPDCSVVYISFGSLAVLPRNQMEEIFHGIVDSGYTFLWVIRPSKDGEEEEGFE 326 Query: 872 --LIDCDLGEEGLIVPWCSQVEVLSHRSVGSFATHCGWNST 988 + D E+GLIVPWCSQVEVL+HR+VG F THCGWNST Sbjct: 327 NAIKDKIKEEQGLIVPWCSQVEVLNHRAVGCFVTHCGWNST 367 >emb|CDP20005.1| unnamed protein product [Coffea canephora] Length = 463 Score = 340 bits (873), Expect = 8e-91 Identities = 181/345 (52%), Positives = 240/345 (69%), Gaps = 16/345 (4%) Frame = +2 Query: 5 LQLAQRLTRAGARVTFATTIHGLRQI-RTLPTYPGLSYASFSNGNDD--GTKSSDDGTM- 172 LQLA+ L R GARVTFATT+HG I + LP Y GLSYA+FS+G DD +K D G Sbjct: 22 LQLAKSLARNGARVTFATTVHGFSCINKALPRYNGLSYATFSDGCDDEESSKRRDRGRFF 81 Query: 173 AKLKRVGSETLTNLLHTLSDDNRPVTFLIYTLLLPWAAEVARGMHVPSAFLCIHSATSFA 352 A LK G++T+ L+ TLS++ RPVT LIYT+LLPW AEVA M +PS F I AT+FA Sbjct: 82 ADLKHFGTQTVRELIKTLSEEGRPVTCLIYTILLPWVAEVAFEMEIPSVFFVIQCATAFA 141 Query: 353 IYQQFFNRHDGLYHHDNNKINPSMSLELPDLPLFTYDELPSFLLPTNPYSSAMIPSFQEH 532 IY ++FN DG+Y +I+PS+S++LP+LPLF +LP+ ++P+NPY ++ +P F EH Sbjct: 142 IYLRYFNSQDGVYD-GVREIDPSISIQLPNLPLFLSTDLPTIIMPSNPYFASTVPVFHEH 200 Query: 533 IQTLEKDPNPYILFNTFHPLEEQSIKAIDNMNVIPIGPLIPS---------NKSYDCDLF 685 I+ LE+D +L NTF+ LE+ S++AI NMNVIPIGPLIPS +KS DLF Sbjct: 201 IKILEQDTKACVLVNTFNDLEQASLRAITNMNVIPIGPLIPSAFSDGTDLTDKSVGGDLF 260 Query: 686 ES-SGDYLQWLDTKPERSVVYVSFGTFVVLTKRHVEEMLQGLIGSGRPFLWVVRSPESE- 859 +S DY++WLD KPERSVVYVSFG+ L K E+ GL +G +L V+R ++E Sbjct: 261 DSPKQDYIRWLDLKPERSVVYVSFGSLATLNKEQKIEIFHGLEEAGWDYLMVIRKSDNED 320 Query: 860 -EVKGLIDCDLGEEGLIVPWCSQVEVLSHRSVGSFATHCGWNSTV 991 EVK +++ L +G+IVPWCSQ+EVL H+S+G F THCGWNST+ Sbjct: 321 QEVKEMMENGLSGKGIIVPWCSQMEVLCHKSIGCFLTHCGWNSTL 365 >emb|CDP21497.1| unnamed protein product [Coffea canephora] Length = 464 Score = 340 bits (871), Expect = 1e-90 Identities = 179/344 (52%), Positives = 238/344 (69%), Gaps = 15/344 (4%) Frame = +2 Query: 5 LQLAQRLTRAGARVTFATTIHGLRQIRTLPTYPGLSYASFSNGNDD--GTKSSDDGT-MA 175 LQLA+ L RAGA+VTFATT++GL +I+ P GLS+ASFS+G DD K+ D ++ Sbjct: 23 LQLAKNLARAGAQVTFATTVYGLSRIKNRPASNGLSFASFSDGYDDEKSMKNRDFACFLS 82 Query: 176 KLKRVGSETLTNLLHTLSDDNRPVTFLIYTLLLPWAAEVARGMHVPSAFLCIHSATSFAI 355 +K GS+ LT L+ S++ RPVTF IYT+LLPW AE+A M+VPSAFL I ATSFA+ Sbjct: 83 DVKCFGSKDLTKLIQASSNEGRPVTFAIYTILLPWVAELASEMNVPSAFLVIQCATSFAL 142 Query: 356 YQQFFNRHDGLYHHDNNKINPSMSLELPDLPLFTYDELPSFLLPTNPYSSAMIPSFQEHI 535 Y ++FN HDG+Y S+S++LPDL LF ++LP+F P +P +++PSF EHI Sbjct: 143 YHRYFNSHDGIYDGVREVDYSSISIKLPDLSLFQKEDLPTFFFPNDPLFPSVVPSFHEHI 202 Query: 536 QTLEKDPNPYILFNTFHPLEEQSIKAIDNMNVIPIGPLIP---------SNKSYDCDLFE 688 + LE++ +L NTF+ LEE SIKA+D MN+IPIGPLIP S+KS +LF+ Sbjct: 203 KILEQESTACVLVNTFNELEEASIKAVDGMNLIPIGPLIPSAFCDGYDSSDKSVGGNLFD 262 Query: 689 -SSGDYLQWLDTKPERSVVYVSFGTFVVLTKRHVEEMLQGLIGSGRPFLWVVR--SPESE 859 DYLQWLD+KPE SVVY SFG+ + L K E+L GL +GR +L V+R + + E Sbjct: 263 IPENDYLQWLDSKPESSVVYASFGSLLSLKKEEKMEILHGLKEAGRSYLLVLRADNEQEE 322 Query: 860 EVKGLIDCDLGEEGLIVPWCSQVEVLSHRSVGSFATHCGWNSTV 991 EVK +++ EEG+IVPWCSQ+EVL HRS+G F THCGWNST+ Sbjct: 323 EVKAVVENISSEEGMIVPWCSQMEVLCHRSIGCFLTHCGWNSTL 366