BLASTX nr result
ID: Coptis21_contig00014545
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Coptis21_contig00014545 (2447 letters) Database: ./nr 23,641,837 sequences; 8,123,359,852 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_002273935.1| PREDICTED: histone-lysine N-methyltransferas... 711 0.0 ref|XP_002278728.1| PREDICTED: histone-lysine N-methyltransferas... 684 0.0 ref|XP_002525581.1| histone-lysine n-methyltransferase, suvh, pu... 682 0.0 ref|XP_002303967.1| SET domain protein [Populus trichocarpa] gi|... 680 0.0 ref|XP_002336307.1| SET domain protein [Populus trichocarpa] gi|... 676 0.0 >ref|XP_002273935.1| PREDICTED: histone-lysine N-methyltransferase, H3 lysine-9 specific SUVH3 [Vitis vinifera] Length = 716 Score = 711 bits (1836), Expect = 0.0 Identities = 375/685 (54%), Positives = 452/685 (65%), Gaps = 29/685 (4%) Frame = +1 Query: 1 PPFGPVPNGYNPFYPFYNTTDXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXSSNK 180 PPFGP P G+ PFYPF S+N Sbjct: 47 PPFGPFPPGFTPFYPF----------SVAQGPQSSPELNQHKTPTGATNHETPISASANL 96 Query: 181 FKNK-----VVDGGREESGDNGW-----SDNMRK-----FDEMQSGHRVNLARSPSLVNV 315 F+ VV+G E S + G + NM FD+ + A + S Sbjct: 97 FRTPPHFPGVVNGDAETSREYGVQFLNENSNMGVKQDGFFDDPKRAAPHLRASNSSRKKA 156 Query: 316 SSSVGEGTSNGVQKTKSHKK--VKRNDEIRFFAPADGDVESVGEVRMKFDALRRRFSQLE 489 S S V K K V R D ++ DG+ E V V M FDALRRR SQ+E Sbjct: 157 KKSKDVDISLTVDNEKGSSKNFVMRFDSLQL---DDGNREMVNYVLMTFDALRRRLSQIE 213 Query: 490 DAKETSSGN-KRADLRASTALMNNGIRANTRKRIGAVPGIEIGDIFFFRIEMCLVGLHAP 666 +AKE+ G KRADL+A+ LM+ G+R N RKRIG PG+E+GDIFFFR+EMCL GLHA Sbjct: 214 EAKESPGGGIKRADLKAANILMSKGVRTNMRKRIGVTPGVEVGDIFFFRMEMCLAGLHAQ 273 Query: 667 SMAGIDYMNLKFDREEEPVAVSIVSSGGYEDDVEDKDVLIYSGQGGVSKRKDKKEKEAGD 846 SMAGIDYM +K EEEPVAVSIVSSGGY+DD +D DVLIYSGQGG RKDK + D Sbjct: 274 SMAGIDYMFVKGGLEEEPVAVSIVSSGGYDDDADDADVLIYSGQGGNVNRKDK---QVAD 330 Query: 847 QKLERGNLALERSLHRGNEVRVIRGMRDLVNVTGKIYVYDGLYRVHESWTEKGKSGCNVF 1026 QKLERGNLAL+RS HR NEVRVIRG++D+VN K+YVYDGLY + ESWTEKGKSGCN+F Sbjct: 331 QKLERGNLALDRSFHRANEVRVIRGVKDVVNPLSKVYVYDGLYTIQESWTEKGKSGCNMF 390 Query: 1027 KYKLLRIPGQPEAFVIWKSVQQWRDNVSSRPGLILPDLTSGAEKLPVSLVNDVDYEKGPA 1206 KYKL+RIPGQP AF WKS+Q+W++ SSR GLILPDLTSGAE +PVSLVNDVD EKGPA Sbjct: 391 KYKLVRIPGQPGAFAHWKSIQKWKEGFSSRIGLILPDLTSGAESIPVSLVNDVDDEKGPA 450 Query: 1207 HFTYSRTLKYSKPIKSMGPSVGCSCRSTCLPGDPHCSCVMKNGGSLPHTANGVLVVRKHV 1386 HFTY TL+YSK PS GC+C++ CLPGD +CSC+ KNGG P+T+NG+LV R+ + Sbjct: 451 HFTYFPTLRYSKSFNLKHPSFGCNCQNACLPGDLNCSCIRKNGGDFPYTSNGILVARRPL 510 Query: 1387 IHECGPLCKCYPNCRNQVSQNGLRVRLEVFKTNDRGWGLRSWDPIRAGTFICEYAXXXXX 1566 +HECGP C C PNC+N++SQ GL+VRLEVFKTN+RGWGLRSWDPIR GTFICEYA Sbjct: 511 VHECGPTCPCIPNCKNRMSQTGLKVRLEVFKTNNRGWGLRSWDPIRTGTFICEYAGEVLD 570 Query: 1567 XXXXXXXXXXXXXNEYVFDSTCDIDDSFEWNYAPELLDE----ERHEARGPQLSLVISAK 1734 NEY+FD+T D++F+WN+ P LLDE E +E L+ISAK Sbjct: 571 KVKVYQERDEGESNEYLFDTTHVYDNAFKWNHEPGLLDEEPSAEPNEYYDIPSPLIISAK 630 Query: 1735 HSGNVARFMNHSCSPNVFWQPVLYDHSDESFPHIMFYAMKHIPPMTELTYDYGLCGTYFR 1914 + GNVARFMNHSCSPNVFWQPVLY+H++ESF HI F+A+KHIPPMTELTYDYG+ + Sbjct: 631 YVGNVARFMNHSCSPNVFWQPVLYEHNNESFLHIAFFAIKHIPPMTELTYDYGMLQSENY 690 Query: 1915 NMQ-------RRVCLCGSYKCRGFF 1968 +Q ++ CLCGS CRG++ Sbjct: 691 EVQSNHTPNGKKKCLCGSSNCRGYY 715 >ref|XP_002278728.1| PREDICTED: histone-lysine N-methyltransferase, H3 lysine-9 specific SUVH1 [Vitis vinifera] Length = 737 Score = 684 bits (1765), Expect = 0.0 Identities = 347/614 (56%), Positives = 430/614 (70%), Gaps = 27/614 (4%) Frame = +1 Query: 211 EESGDNGWSDNMRKFDEMQSGHRVNLARSPSLVNVSSSVGEGTSNGVQKTKSHKKVKRND 390 EE+ DN +S+ + + S +++ + S K+KS K+ ++ Sbjct: 146 EEADDNEYSETPNQNAQYLSSFSMHVTDAERTSKAQRS----------KSKSQKRGRKGQ 195 Query: 391 EIRFFAP-------------------------ADGDVESVGEVRMKFDALRRRFSQLEDA 495 E+ F +P ADGD ESVG + M +D LRRR +Q+ED Sbjct: 196 EVNFSSPEVDVELIISNILNSCNLMAFDTFRRADGDKESVGYILMVYDLLRRRITQIEDG 255 Query: 496 KETSSG-NKRADLRASTALMNNGIRANTRKRIGAVPGIEIGDIFFFRIEMCLVGLHAPSM 672 KE + G +R DLR+ T LMN GIR N +KRIG VPG+E+GDIFFFR+EMCLVGLHAP M Sbjct: 256 KEATPGVTRRPDLRSGTILMNKGIRTNIKKRIGLVPGVEVGDIFFFRMEMCLVGLHAPCM 315 Query: 673 AGIDYMNLKFDREEEPVAVSIVSSGGYEDDVEDKDVLIYSGQGGVSKRKDKKEKEAGDQK 852 AGIDYM LK EEEPVAVSIVSSGGYED+VED DVLIYSGQGG RKDK + DQK Sbjct: 316 AGIDYMGLKISLEEEPVAVSIVSSGGYEDNVEDGDVLIYSGQGGNIYRKDK---QIIDQK 372 Query: 853 LERGNLALERSLHRGNEVRVIRGMRDLVNVTGKIYVYDGLYRVHESWTEKGKSGCNVFKY 1032 LERGNLALE+SLHRGNEVRVIRG+RD+VN TGK+YVYDGLY++ ESW EKGK+GCNVFKY Sbjct: 373 LERGNLALEKSLHRGNEVRVIRGLRDVVNPTGKVYVYDGLYKIQESWVEKGKAGCNVFKY 432 Query: 1033 KLLRIPGQPEAFVIWKSVQQWRDNVSSRPGLILPDLTSGAEKLPVSLVNDVDYEKGPAHF 1212 KL+R+PGQPEAF+ WKS+QQW++ +SSR G+ILPDLTSGAE LPVSLVNDVD EKGPA+F Sbjct: 433 KLVRLPGQPEAFITWKSIQQWKEGLSSRAGVILPDLTSGAENLPVSLVNDVDDEKGPAYF 492 Query: 1213 TYSRTLKYSKPIKSMGPSVGCSCRSTCLPGDPHCSCVMKNGGSLPHTANGVLVVRKHVIH 1392 TY +L+YSKP+ PS C+C+ CLPG+ +CSC+ KNGG +P+ GVLV K +I+ Sbjct: 493 TYFPSLRYSKPVNLTEPSFSCNCQGGCLPGNSNCSCIKKNGGYIPYNVAGVLVNNKSLIY 552 Query: 1393 ECGPLCKCYPNCRNQVSQNGLRVRLEVFKTNDRGWGLRSWDPIRAGTFICEYAXXXXXXX 1572 ECGP C C NCRN++SQ GL+VRLEVFKT D+GWGLRSWDPIRAG FICEYA Sbjct: 553 ECGPCCSCPINCRNRISQAGLKVRLEVFKTKDKGWGLRSWDPIRAGAFICEYA-GEVIND 611 Query: 1573 XXXXXXXXXXXNEYVFDSTCDIDDSFEWNYAP-ELLDEERHEARGPQLSLVISAKHSGNV 1749 ++Y+FD+T Y P +L + ++A L+ISAK+ GNV Sbjct: 612 CKVEELGSESEDDYIFDAT--------RTYQPLGVLPGDSNKAHQVPFPLIISAKNVGNV 663 Query: 1750 ARFMNHSCSPNVFWQPVLYDHSDESFPHIMFYAMKHIPPMTELTYDYGLCGTYFRNMQRR 1929 ARFMNHSCSPNVFWQPVL + + ES+ HI F+A++HIPPMTELTYDYG+ + + +++ Sbjct: 664 ARFMNHSCSPNVFWQPVLRESNSESYLHIAFFAIRHIPPMTELTYDYGITQSGKADERKK 723 Query: 1930 VCLCGSYKCRGFFH 1971 CLCGS KCRG F+ Sbjct: 724 RCLCGSLKCRGHFY 737 >ref|XP_002525581.1| histone-lysine n-methyltransferase, suvh, putative [Ricinus communis] gi|223535160|gb|EEF36840.1| histone-lysine n-methyltransferase, suvh, putative [Ricinus communis] Length = 681 Score = 682 bits (1761), Expect = 0.0 Identities = 343/579 (59%), Positives = 422/579 (72%), Gaps = 33/579 (5%) Frame = +1 Query: 331 EGTSNGVQK----------TKSHKKVKR----------NDEIRFFAPA---DGDVESVGE 441 EGTS+G K + S K+ K+ N+ + P+ DGD V Sbjct: 108 EGTSDGRPKRPVGRPRNSTSSSQKRAKKDLDFTLSVVDNNFVAGITPSQREDGDRGVVIN 167 Query: 442 VRMKFDALRRRFSQLEDAKETSSGN-KRADLRASTALMNNGIRANTRKRIGAVPGIEIGD 618 + M+FDALRRR SQLED+KE +G KRADL+A LM+ G+R+N RKRIGAVPG+EIGD Sbjct: 168 IMMRFDALRRRLSQLEDSKEAPTGLIKRADLKAGNVLMSKGVRSNMRKRIGAVPGVEIGD 227 Query: 619 IFFFRIEMCLVGLHAPSMAGIDYMNLKFDREEEPVAVSIVSSGGYEDDVEDKDVLIYSGQ 798 IFFFR+EMC++GLH+ SMAGIDYM ++ D +E+P+AVSIVSSGGY+D+ ED+DVLIYSGQ Sbjct: 228 IFFFRMEMCVIGLHSQSMAGIDYMIVRGDIDEDPLAVSIVSSGGYDDEAEDRDVLIYSGQ 287 Query: 799 GGVSKRKDKKEKEAGDQKLERGNLALERSLHRGNEVRVIRGMRDLVNVTGKIYVYDGLYR 978 GG + +KEA DQKLERGNLALERSLHR NEVRVIRGM+D ++ K+Y+YDGLYR Sbjct: 288 GG---NANSNKKEAADQKLERGNLALERSLHRANEVRVIRGMKDTLSQAAKVYMYDGLYR 344 Query: 979 VHESWTEKGKSGCNVFKYKLLRIPGQPEAFVIWKSVQQWRDNVSSRPGLILPDLTSGAEK 1158 + ESW +KGKSGCN+FKYKL+R+PGQP AF +WKS+QQW++ +S+R GLILPDLTSGAE Sbjct: 345 IQESWVDKGKSGCNIFKYKLVRVPGQPGAFSVWKSIQQWKEGISTRVGLILPDLTSGAET 404 Query: 1159 LPVSLVNDVDYEKGPAHFTYSRTLKYSKPIKSMGPSVGCSCRSTCLPGDPHCSCVMKNGG 1338 LPVSLVNDVD EKGPA+FTY T+KY K K PS GC+CR+ C PGD CSC+ KNGG Sbjct: 405 LPVSLVNDVDEEKGPAYFTYFPTVKYIKSFKLTEPSYGCNCRNACSPGDLDCSCIRKNGG 464 Query: 1339 SLPHTANGVLVVRKHVIHECGPLCKCYPNCRNQVSQNGLRVRLEVFKTNDRGWGLRSWDP 1518 P+TANGVLV R+ ++HECGP C C PNC+N+VSQ GL+VRLEVFKT DRGWGLRSWDP Sbjct: 465 DFPYTANGVLVSRRPLVHECGPTCPCIPNCKNRVSQTGLKVRLEVFKTKDRGWGLRSWDP 524 Query: 1519 IRAGTFICEYAXXXXXXXXXXXXXXXXXXNEYVFDSTCDIDDSFEWNYAPELLDE---ER 1689 IR+GTFICEYA +EYVFD+T + + F+WN P L++E + Sbjct: 525 IRSGTFICEYA--GEVIEKVKGKQDGEGEDEYVFDTT-RVYEPFKWNCEPGLVEEGDNDI 581 Query: 1690 HEARGPQLSLVISAKHSGNVARFMNHSCSPNVFWQPVLYDHSDESFPHIMFYAMKHIPPM 1869 E L+ISA++ GNVARFMNHSC+PNVFWQPV Y+H+ ES+ HI F+A++HIPPM Sbjct: 582 TEECNIPSPLIISARNVGNVARFMNHSCNPNVFWQPVAYEHNSESYVHIAFFAVRHIPPM 641 Query: 1870 TELTYDYGLC------GTYFRNMQRRVCLCGSYKCRGFF 1968 TELTYDYG+ G R+ CLCGS KCRG F Sbjct: 642 TELTYDYGISRSDEAEGNNNVQHGRKKCLCGSQKCRGSF 680 >ref|XP_002303967.1| SET domain protein [Populus trichocarpa] gi|222841399|gb|EEE78946.1| SET domain protein [Populus trichocarpa] Length = 653 Score = 680 bits (1754), Expect = 0.0 Identities = 341/533 (63%), Positives = 405/533 (75%), Gaps = 15/533 (2%) Frame = +1 Query: 415 DGDVESVGEVRMKFDALRRRFSQLEDAKETSSGN-KRADLRASTALMNNGIRANTRKRIG 591 DG+ E V +RM+FDALRRR SQLEDAKE+ G +RADL+A LM +R N RKRIG Sbjct: 130 DGNGEVVHSIRMRFDALRRRLSQLEDAKESPVGIIRRADLKAGNILMTKQVRTNMRKRIG 189 Query: 592 AVPGIEIGDIFFFRIEMCLVGLHAPSMAGIDYMNLKFDREEEPVAVSIVSSGGYEDDVED 771 AVPG+EIGDIFFFRIEMCL+GLHAPSMAGIDYM+L+ D EEEP+AVSIVSSG YED+ ED Sbjct: 190 AVPGVEIGDIFFFRIEMCLLGLHAPSMAGIDYMSLRNDLEEEPLAVSIVSSGYYEDNAED 249 Query: 772 KDVLIYSGQGGVSKRKDKKEKEAGDQKLERGNLALERSLHRGNEVRVIRGMRDLVNVTGK 951 KDVLIYSGQGG + K+K A DQKLERGNLALERSL RGNEVRVIRGM+D VN K Sbjct: 250 KDVLIYSGQGGAAN----KDKGATDQKLERGNLALERSLRRGNEVRVIRGMKDSVNQASK 305 Query: 952 IYVYDGLYRVHESWTEKGKSGCNVFKYKLLRIPGQPEAFVIWKSVQQWRDNVSSRPGLIL 1131 +YVYDGLYRV ESW EK KSGCN+FKYKL+RIPGQP+AF +WKS+++W++ +SSR GLIL Sbjct: 306 VYVYDGLYRVQESWVEKAKSGCNIFKYKLVRIPGQPDAFGVWKSIEKWKEGLSSRAGLIL 365 Query: 1132 PDLTSGAEKLPVSLVNDVDYEKGPAHFTYSRTLKYSKPIKSMGPSVGCSCRSTCLPGDPH 1311 PDLTSGAE VSL+NDVD EKGPA+FTY T+KYSK K P+ GC+C + C PG+ + Sbjct: 366 PDLTSGAESTAVSLLNDVDEEKGPAYFTYVSTVKYSKSFKLTQPAYGCNCPNACQPGNLN 425 Query: 1312 CSCVMKNGGSLPHTANGVLVVRKHVIHECGPLCKCYPNCRNQVSQNGLRVRLEVFKTNDR 1491 CSC+ KN G+ P+TANGVLV R +I ECGP C C+PNC+N+VSQ GL+VRLEVFKT DR Sbjct: 426 CSCIRKNEGNFPYTANGVLVCRAPMIDECGPTCPCFPNCKNRVSQTGLKVRLEVFKTKDR 485 Query: 1492 GWGLRSWDPIRAGTFICEYAXXXXXXXXXXXXXXXXXXNEYVFDSTCDIDDSFEWNYAPE 1671 GWGLRSWDPIRAGTFICEYA ++YVFD T + +SF WNY P Sbjct: 486 GWGLRSWDPIRAGTFICEYA--GEVVEKVSQPGEEGDGDDYVFD-TSRVYESFRWNYEPG 542 Query: 1672 LLDEER-----HEARGPQLSLVISAKHSGNVARFMNHSCSPNVFWQPVLYDHSDESFPHI 1836 L++E+ E + P LVIS+++ GNVARFMNH C PNVFWQP++Y+H+ ESF HI Sbjct: 543 LVEEDSSIEAIEEPKVPS-PLVISSRNVGNVARFMNHGCYPNVFWQPIMYEHNSESFIHI 601 Query: 1837 MFYAMKHIPPMTELTYDYGLC---------GTYFRNMQRRVCLCGSYKCRGFF 1968 F+AM+HIPPMTELTYDYG G+ R RR CLCG+ +CRG+F Sbjct: 602 GFFAMRHIPPMTELTYDYGKSCVGEAEADGGSTPRG--RRKCLCGAPRCRGYF 652 >ref|XP_002336307.1| SET domain protein [Populus trichocarpa] gi|222834460|gb|EEE72937.1| SET domain protein [Populus trichocarpa] Length = 669 Score = 676 bits (1743), Expect = 0.0 Identities = 335/532 (62%), Positives = 404/532 (75%), Gaps = 14/532 (2%) Frame = +1 Query: 415 DGDVESVGEVRMKFDALRRRFSQLEDAKETSSGN-KRADLRASTALMNNGIRANTRKRIG 591 DG+ E V ++M+FDALRRR SQLEDAKE+ +G +RADL+A LM +R N RKRIG Sbjct: 147 DGNREVVHSIQMRFDALRRRLSQLEDAKESPAGIIRRADLKAGNILMTKQVRTNMRKRIG 206 Query: 592 AVPGIEIGDIFFFRIEMCLVGLHAPSMAGIDYMNLKFDREEEPVAVSIVSSGGYEDDVED 771 VPG+EIGDIFFFR+EMCL+GLHAPSMAGIDYM+++ D EEEP+AVSIVSSG Y+DD ED Sbjct: 207 TVPGVEIGDIFFFRMEMCLLGLHAPSMAGIDYMSVRNDLEEEPLAVSIVSSGYYDDDAED 266 Query: 772 KDVLIYSGQGGVSKRKDKKEKEAGDQKLERGNLALERSLHRGNEVRVIRGMRDLVNVTGK 951 KDVLIYSGQGG + K+K A DQKLERGNLALERSL RGNEVRVIRGM+D VN K Sbjct: 267 KDVLIYSGQGGAAN----KDKGATDQKLERGNLALERSLRRGNEVRVIRGMKDSVNQASK 322 Query: 952 IYVYDGLYRVHESWTEKGKSGCNVFKYKLLRIPGQPEAFVIWKSVQQWRDNVSSRPGLIL 1131 +YVYDGL+R+ ESW EK KSGCN+FKYKL+RIPGQP+AF +WKS+++WR+ +SSR GLIL Sbjct: 323 VYVYDGLFRIQESWVEKAKSGCNIFKYKLVRIPGQPDAFGVWKSIEKWREGLSSRAGLIL 382 Query: 1132 PDLTSGAEKLPVSLVNDVDYEKGPAHFTYSRTLKYSKPIKSMGPSVGCSCRSTCLPGDPH 1311 PDLTSGAE +PV+LVNDVD EKGPA+FTY T+KYSK K P+ GC+CR+ C PG+ + Sbjct: 383 PDLTSGAESVPVALVNDVDEEKGPAYFTYVSTVKYSKSFKLTQPAYGCNCRNACQPGNLN 442 Query: 1312 CSCVMKNGGSLPHTANGVLVVRKHVIHECGPLCKCYPNCRNQVSQNGLRVRLEVFKTNDR 1491 CSC+ KN G+ P+TANGVLV R +IHECGP C C+PNC+N+ SQ GL+ RLEVFKT DR Sbjct: 443 CSCIRKNEGNFPYTANGVLVCRAPMIHECGPTCPCFPNCKNRASQTGLKARLEVFKTKDR 502 Query: 1492 GWGLRSWDPIRAGTFICEYAXXXXXXXXXXXXXXXXXXNEYVFDSTCDIDDSFEWNYAPE 1671 GWGLRSWD RAGTFICEYA + YVFD T + +SF+WNY P Sbjct: 503 GWGLRSWDSFRAGTFICEYA---GEVIEKVSQVGEGEGDGYVFD-TSHVYESFKWNYEPG 558 Query: 1672 LLDE----ERHEARGPQLSLVISAKHSGNVARFMNHSCSPNVFWQPVLYDHSDESFPHIM 1839 L++E E E LVIS+K+ GNVARFMNHSC PNVFWQP++Y++++ESF HI Sbjct: 559 LVEEDGSIEAIEEPNVPSPLVISSKNVGNVARFMNHSCYPNVFWQPIMYENNNESFIHIA 618 Query: 1840 FYAMKHIPPMTELTYDYGLC---------GTYFRNMQRRVCLCGSYKCRGFF 1968 F+AM+HIPPMTELT+DYG G+ R RR CLCG+ CRG+F Sbjct: 619 FFAMRHIPPMTELTFDYGKSCSGEAAADGGSTSRG--RRKCLCGAPICRGYF 668