BLASTX nr result
ID: Coptis23_contig00030331
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Coptis23_contig00030331 (799 letters) Database: ./nr 23,641,837 sequences; 8,123,359,852 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_002306046.1| predicted protein [Populus trichocarpa] gi|2... 228 2e-57 ref|XP_002324085.1| predicted protein [Populus trichocarpa] gi|2... 218 2e-54 ref|XP_002532899.1| UDP-glucosyltransferase, putative [Ricinus c... 214 2e-53 ref|XP_004135442.1| PREDICTED: anthocyanidin 5,3-O-glucosyltrans... 211 1e-52 ref|XP_002438250.1| hypothetical protein SORBIDRAFT_10g010590 [S... 202 7e-50 >ref|XP_002306046.1| predicted protein [Populus trichocarpa] gi|222849010|gb|EEE86557.1| predicted protein [Populus trichocarpa] Length = 461 Score = 228 bits (580), Expect = 2e-57 Identities = 121/262 (46%), Positives = 159/262 (60%), Gaps = 2/262 (0%) Frame = +2 Query: 20 IPKAWIPSILLDF-TTLFATQFFENGKMMKESNGILVNTFESFEPESLGALSEGKXXXXX 196 +PK+WIP LL + T F E+ + + ES+GILVNTFESFE ESL L++ + Sbjct: 177 MPKSWIPPPLLKKGNNILKTSFIEDSRKVAESSGILVNTFESFEQESLRKLNDCQLLLER 236 Query: 197 XXXXXXXXXXX-CEFERGSSPLSWLDDQPDSSVVYVSFGSRTAMSKEQISELGIGLVRSG 373 C+FE+ L+WLDDQP SVVYVSFGSRTA+S++Q+ ELG GLVRSG Sbjct: 237 LPSVVAIGPLPPCDFEKSQLQLTWLDDQPAGSVVYVSFGSRTALSRDQVRELGEGLVRSG 296 Query: 374 CRFVWXXXXXXXXXXXXXXXXXXXXXXXFIEKVKEKGLALKTWVDQGEVLSHKSVGGFVS 553 RF+W +E++KEKGL ++ WV+Q +VLSH +VGGF S Sbjct: 297 SRFIWVVKDKKVDREDNEGLEGVIGDE-LMERMKEKGLVVRNWVNQEDVLSHPAVGGFFS 355 Query: 554 HCGWNSVSEAAWNGVPMLAWPQGGDQKINAEVVRKSGLGIWNESXXXXXXXXXXXXXXXX 733 HCGWNSV EAAW+GV +LAWPQ GDQK+NA++V + GLG W +S Sbjct: 356 HCGWNSVMEAAWHGVKILAWPQHGDQKVNADIVERIGLGTWVKSWGWGEEMIVNRAEIAE 415 Query: 734 XXXXLMGSEGLRLQAGKVREEA 799 +MG+E LR+QA ++EEA Sbjct: 416 KIGEIMGNESLRIQALGIKEEA 437 >ref|XP_002324085.1| predicted protein [Populus trichocarpa] gi|222867087|gb|EEF04218.1| predicted protein [Populus trichocarpa] Length = 344 Score = 218 bits (554), Expect = 2e-54 Identities = 115/261 (44%), Positives = 156/261 (59%), Gaps = 1/261 (0%) Frame = +2 Query: 20 IPKAWIPSILLDFTT-LFATQFFENGKMMKESNGILVNTFESFEPESLGALSEGKXXXXX 196 +PK+ +P LL + F F E+G+ + ES GIL+NTF SFE ESL +++G+ Sbjct: 61 MPKSLLPPPLLQKSNNFFKNSFIEDGRKVTESCGILLNTFVSFELESLRKINDGQVLERP 120 Query: 197 XXXXXXXXXXXCEFERGSSPLSWLDDQPDSSVVYVSFGSRTAMSKEQISELGIGLVRSGC 376 C E+ L+WLDDQP SV+YVSFGSRTA++++QI ELG GL++SG Sbjct: 121 PSVVAIGPFPPCNSEKSQLQLTWLDDQPAGSVLYVSFGSRTALARDQIRELGEGLIKSGS 180 Query: 377 RFVWXXXXXXXXXXXXXXXXXXXXXXXFIEKVKEKGLALKTWVDQGEVLSHKSVGGFVSH 556 RFVW +E+VKEKGL +K W++Q +LSH++VGGF+SH Sbjct: 181 RFVWMVKDKKVDKEDSEELEEVIGYE-LMERVKEKGLIVKDWLNQDGILSHRAVGGFLSH 239 Query: 557 CGWNSVSEAAWNGVPMLAWPQGGDQKINAEVVRKSGLGIWNESXXXXXXXXXXXXXXXXX 736 CGWNSV EAAW+GV +LAWPQ GDQKINA++V + GLG W +S Sbjct: 240 CGWNSVMEAAWHGVRILAWPQNGDQKINADIVERIGLGTWVKSWGWSGEMLVKGAEIAER 299 Query: 737 XXXLMGSEGLRLQAGKVREEA 799 MG+E LR+QA ++E+A Sbjct: 300 IRESMGNESLRIQALGIKEDA 320 >ref|XP_002532899.1| UDP-glucosyltransferase, putative [Ricinus communis] gi|223527333|gb|EEF29479.1| UDP-glucosyltransferase, putative [Ricinus communis] Length = 462 Score = 214 bits (545), Expect = 2e-53 Identities = 116/261 (44%), Positives = 156/261 (59%), Gaps = 1/261 (0%) Frame = +2 Query: 20 IPKAWIPSILL-DFTTLFATQFFENGKMMKESNGILVNTFESFEPESLGALSEGKXXXXX 196 IP++WIP LL D L T F +NGK M ES+GILVNTF+S E E L L+ GK Sbjct: 180 IPRSWIPPPLLQDTNNLLKTYFIKNGKKMAESSGILVNTFDSIEHEVLEQLNAGKVIENL 239 Query: 197 XXXXXXXXXXXCEFERGSSPLSWLDDQPDSSVVYVSFGSRTAMSKEQISELGIGLVRSGC 376 CE E + L+WLD Q + SV++VSFGSRTA+S+ Q++ELG GLVRSG Sbjct: 240 PPVIAIGSLASCESETKQA-LAWLDSQQNGSVLFVSFGSRTAISRAQLTELGEGLVRSGI 298 Query: 377 RFVWXXXXXXXXXXXXXXXXXXXXXXXFIEKVKEKGLALKTWVDQGEVLSHKSVGGFVSH 556 RF+W IE++KE+GL +K+W++Q +VL H ++GGF+SH Sbjct: 299 RFLWIVKDKKVDKEDEEDLSQVIGNR-LIERLKERGLVVKSWLNQEDVLRHSAIGGFLSH 357 Query: 557 CGWNSVSEAAWNGVPMLAWPQGGDQKINAEVVRKSGLGIWNESXXXXXXXXXXXXXXXXX 736 CGWNSV+EA +G+P+LAWPQ GDQKINA++V + LG W +S Sbjct: 358 CGWNSVTEAVQHGIPILAWPQHGDQKINADIVERIVLGTWEKSWGWGGEVVVKGNDIAEM 417 Query: 737 XXXLMGSEGLRLQAGKVREEA 799 +MG++ LR A ++REEA Sbjct: 418 IKEMMGNDLLRAHAVQIREEA 438 >ref|XP_004135442.1| PREDICTED: anthocyanidin 5,3-O-glucosyltransferase-like [Cucumis sativus] gi|449530181|ref|XP_004172074.1| PREDICTED: anthocyanidin 5,3-O-glucosyltransferase-like [Cucumis sativus] Length = 458 Score = 211 bits (538), Expect = 1e-52 Identities = 114/262 (43%), Positives = 148/262 (56%), Gaps = 2/262 (0%) Frame = +2 Query: 20 IPKAWIPSILLDFTTLFATQFFENGKMMKESNGILVNTFESFEPESLGALSEGKXXXXXX 199 IPK +P LL ++F F ++G+ +KE NGIL+N + E ++L AL+ GK Sbjct: 173 IPKTSLPPPLLINNSIFGKIFAQDGQRIKELNGILINAMDGIEGDTLTALNTGKVLNGVP 232 Query: 200 XXXXXXXXXXCEFER--GSSPLSWLDDQPDSSVVYVSFGSRTAMSKEQISELGIGLVRSG 373 C+FE SP+ WLD+ P SVV+ SFGSRTA S++QI E+G GLV SG Sbjct: 233 PVIPIGPFLPCDFENPDAKSPIKWLDNLPPRSVVFASFGSRTATSRDQIKEIGSGLVSSG 292 Query: 374 CRFVWXXXXXXXXXXXXXXXXXXXXXXXFIEKVKEKGLALKTWVDQGEVLSHKSVGGFVS 553 RFVW ++K+KEKG+ LK WV+Q E+L H++VGGF+ Sbjct: 293 YRFVWVVKDKVVDKEDKEGLEDIMGEE-LMKKLKEKGMVLKEWVNQQEILGHRAVGGFIC 351 Query: 554 HCGWNSVSEAAWNGVPMLAWPQGGDQKINAEVVRKSGLGIWNESXXXXXXXXXXXXXXXX 733 HCGWNSV EAA NGVP+L WPQ GDQ INAE++ K GLG+W E Sbjct: 352 HCGWNSVMEAALNGVPILGWPQIGDQMINAELIAKKGLGMWVEEWGWGQKCLVKGEEVGG 411 Query: 734 XXXXLMGSEGLRLQAGKVREEA 799 +M SE LR QA K R+EA Sbjct: 412 RIKEMMESEALRKQAAKFRDEA 433 >ref|XP_002438250.1| hypothetical protein SORBIDRAFT_10g010590 [Sorghum bicolor] gi|241916473|gb|EER89617.1| hypothetical protein SORBIDRAFT_10g010590 [Sorghum bicolor] Length = 487 Score = 202 bits (514), Expect = 7e-50 Identities = 104/229 (45%), Positives = 140/229 (61%), Gaps = 2/229 (0%) Frame = +2 Query: 2 PGLSKKIPKAWIPSILLDFTTLFATQFFENGKMMKESNGILVNTFESFEPESLGALSEGK 181 PG+ ++IP++++P LLD LF QF +NG+ + ++G LVNTF++ EP +L AL +GK Sbjct: 193 PGV-RRIPQSYLPQPLLDLNKLFTKQFIDNGREIINADGFLVNTFDALEPVALAALRDGK 251 Query: 182 XXXXXXXXXXXXXXXXCEFER--GSSPLSWLDDQPDSSVVYVSFGSRTAMSKEQISELGI 355 E E GS P++WLD+QP SVVYV+FG+R A+S EQI E+ Sbjct: 252 VVAGFPPVYAIGPLRSKEEEATTGSPPVAWLDEQPARSVVYVAFGNRNAVSLEQIREIAA 311 Query: 356 GLVRSGCRFVWXXXXXXXXXXXXXXXXXXXXXXXFIEKVKEKGLALKTWVDQGEVLSHKS 535 GL SGCRF+W F+E+V+ +GL K WVDQ VL H S Sbjct: 312 GLEASGCRFLWVLKTTTVDRDDTAELTDDVLGEGFLERVQGRGLVTKAWVDQEAVLKHAS 371 Query: 536 VGGFVSHCGWNSVSEAAWNGVPMLAWPQGGDQKINAEVVRKSGLGIWNE 682 VG F+SH GWNSV+EAA GVP+LAWP+GGD ++NA VV G+G+W E Sbjct: 372 VGLFLSHSGWNSVTEAAAAGVPLLAWPRGGDHRVNATVVVSGGVGVWME 420