BLASTX nr result
ID: Atractylodes22_contig00004553
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Atractylodes22_contig00004553 (1789 letters) Database: ./nr 23,641,837 sequences; 8,123,359,852 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value gb|ACY78690.1| trans-cinnamate 4-monooxygenase [Cynara carduncul... 865 0.0 gb|ADO16181.1| cytochrome P450 mono-oxygenase [Artemisia annua] 862 0.0 gb|ACJ37399.1| trans-cinnamate 4-monooxygenase [Echinacea angust... 850 0.0 gb|ACF74449.1| trans-cinnamate 4-monooxygenase [Echinacea angust... 850 0.0 sp|Q04468.1|TCMO_HELTU RecName: Full=Trans-cinnamate 4-monooxyge... 849 0.0 >gb|ACY78690.1| trans-cinnamate 4-monooxygenase [Cynara cardunculus var. scolymus] Length = 505 Score = 865 bits (2235), Expect = 0.0 Identities = 435/491 (88%), Positives = 441/491 (89%) Frame = -1 Query: 1675 FVAILGAIFISKLRGKRFKLPPGPIPVPIFGNWLQVGDDLNHRNLTDLAKKFGQIFLLRM 1496 FVAILGAIFISKLRGKRFKLPPGP PVPIFGNWLQVGDDLNHRNLTDLAKKFGQIFLLRM Sbjct: 14 FVAILGAIFISKLRGKRFKLPPGPFPVPIFGNWLQVGDDLNHRNLTDLAKKFGQIFLLRM 73 Query: 1495 GQRNLVVVSSPDLAKEVLHTQGVEFGSRTRNVVFDIFTGKGQDMVFTVYGEHWRKMRRIM 1316 GQRNLVVVSSPDLAKEVLHTQGVEFGSRTRNVVFDIFTGKGQDMVFTVYGEHWRKMRRIM Sbjct: 74 GQRNLVVVSSPDLAKEVLHTQGVEFGSRTRNVVFDIFTGKGQDMVFTVYGEHWRKMRRIM 133 Query: 1315 TVPFFTNKVVQQYRFGWXXXXXXXXXXXXXXXXXATEGIVIRRRLQLMMYNNMFRIMFDR 1136 TVPFFTNKVVQQYRFG ATEGIVIRRRLQLMMYNNMFRIMFDR Sbjct: 134 TVPFFTNKVVQQYRFGREAEAAAVVHDVKKNPAAATEGIVIRRRLQLMMYNNMFRIMFDR 193 Query: 1135 RFESEDDPLFLKLKALNGERSRLAQSFDYNYGDFIPILRPFLKGYLKICKEVKEKRLQLF 956 RFESEDDPLFLKLKALNGERSRLAQSFDYNYGDFIPILRPFLKGYLK+CKEVKEKRLQLF Sbjct: 194 RFESEDDPLFLKLKALNGERSRLAQSFDYNYGDFIPILRPFLKGYLKMCKEVKEKRLQLF 253 Query: 955 KDYFVDERKKLGSTKSMDNNQLKCAIDHILEAQDKGEINEDNVLYIVENINVAAIETTLW 776 KDYFVDERKK+GSTKSMDNNQ+KCAIDHILEAQDKGEINEDNVLYIVENINVAAIETTLW Sbjct: 254 KDYFVDERKKIGSTKSMDNNQIKCAIDHILEAQDKGEINEDNVLYIVENINVAAIETTLW 313 Query: 775 SIEWGIAELVNHPDIQAKLRHELDTKLGPGVQVTEPDIQKLPYLQAVIKETLRLRMAIPL 596 SIEWGIAELVNHP+IQAKLRHELDTKLGPGVQVTEPDIQKLPYLQAV+KETLRLRMAIPL Sbjct: 314 SIEWGIAELVNHPEIQAKLRHELDTKLGPGVQVTEPDIQKLPYLQAVVKETLRLRMAIPL 373 Query: 595 LVPHMNLHDAKLGSYDIPAESKILVNAWWLANNPEQWKKXXXXXXXXXXXXESKVEANGN 416 LVPHMNLHDAKLGSYDIPAESKILVNAWWLANNPEQWKK ES VEANGN Sbjct: 374 LVPHMNLHDAKLGSYDIPAESKILVNAWWLANNPEQWKKPEEFRPERFFEEESHVEANGN 433 Query: 415 DFRYLPFGVGRRSCPXXXXXXXXXXXXXGRLVQNFELLPPPGESKVDTSEKGGQFSLHIL 236 DFRYLPFGVGRRSCP GRLVQNFELLPPPG SK+D EKGGQFSLHI Sbjct: 434 DFRYLPFGVGRRSCPGIILALPILGITIGRLVQNFELLPPPGMSKIDVKEKGGQFSLHIX 493 Query: 235 KHSTIVAKPRA 203 HSTIVAKPRA Sbjct: 494 NHSTIVAKPRA 504 >gb|ADO16181.1| cytochrome P450 mono-oxygenase [Artemisia annua] Length = 505 Score = 862 bits (2227), Expect = 0.0 Identities = 429/490 (87%), Positives = 444/490 (90%) Frame = -1 Query: 1672 VAILGAIFISKLRGKRFKLPPGPIPVPIFGNWLQVGDDLNHRNLTDLAKKFGQIFLLRMG 1493 VAILGAIFISKLRGKRFKLPPGPIPVPIFGNWLQVGDDLNHRNLTDLAKKFG+IFLLRMG Sbjct: 15 VAILGAIFISKLRGKRFKLPPGPIPVPIFGNWLQVGDDLNHRNLTDLAKKFGEIFLLRMG 74 Query: 1492 QRNLVVVSSPDLAKEVLHTQGVEFGSRTRNVVFDIFTGKGQDMVFTVYGEHWRKMRRIMT 1313 QRNLVVVSSPDLAK+VLHTQGVEFGSRTRNVVFDIFTGKGQDMVFTVYGEHWRKMRRIMT Sbjct: 75 QRNLVVVSSPDLAKDVLHTQGVEFGSRTRNVVFDIFTGKGQDMVFTVYGEHWRKMRRIMT 134 Query: 1312 VPFFTNKVVQQYRFGWXXXXXXXXXXXXXXXXXATEGIVIRRRLQLMMYNNMFRIMFDRR 1133 VPFFTNKVVQQYRFGW ATEG V+RRRLQLMMYNNMFRIMFDRR Sbjct: 135 VPFFTNKVVQQYRFGWEAEAAAVVEDVKKNPASATEGTVLRRRLQLMMYNNMFRIMFDRR 194 Query: 1132 FESEDDPLFLKLKALNGERSRLAQSFDYNYGDFIPILRPFLKGYLKICKEVKEKRLQLFK 953 FESEDDPLFLKLKALNGERSRLAQSF+YNYGDFIPILRPFL+GYLK+CKEVK+KRLQLFK Sbjct: 195 FESEDDPLFLKLKALNGERSRLAQSFEYNYGDFIPILRPFLRGYLKLCKEVKDKRLQLFK 254 Query: 952 DYFVDERKKLGSTKSMDNNQLKCAIDHILEAQDKGEINEDNVLYIVENINVAAIETTLWS 773 DYFVDERKKLGSTKS+DNNQ+KCAIDHILEA+DKGEINEDNVLYIVENINVAAIETTLWS Sbjct: 255 DYFVDERKKLGSTKSLDNNQIKCAIDHILEAKDKGEINEDNVLYIVENINVAAIETTLWS 314 Query: 772 IEWGIAELVNHPDIQAKLRHELDTKLGPGVQVTEPDIQKLPYLQAVIKETLRLRMAIPLL 593 IEWGIAELVNHP+IQAKLRHELDTKLGPGVQVTEPDIQ LPYLQAVIKETLRLRMAIPLL Sbjct: 315 IEWGIAELVNHPEIQAKLRHELDTKLGPGVQVTEPDIQNLPYLQAVIKETLRLRMAIPLL 374 Query: 592 VPHMNLHDAKLGSYDIPAESKILVNAWWLANNPEQWKKXXXXXXXXXXXXESKVEANGND 413 VPHMNLHDAKLG +DIPAESKILVNAWWLANNP+QWKK ESKVEANGND Sbjct: 375 VPHMNLHDAKLGGFDIPAESKILVNAWWLANNPDQWKKPEEFRPERFLEEESKVEANGND 434 Query: 412 FRYLPFGVGRRSCPXXXXXXXXXXXXXGRLVQNFELLPPPGESKVDTSEKGGQFSLHILK 233 FRYLPFGVGRRSCP GRLVQNFELLPPPG SK+DTSEKGGQFSLHILK Sbjct: 435 FRYLPFGVGRRSCPGIILALPILGITIGRLVQNFELLPPPGVSKIDTSEKGGQFSLHILK 494 Query: 232 HSTIVAKPRA 203 HSTIVAKPR+ Sbjct: 495 HSTIVAKPRS 504 >gb|ACJ37399.1| trans-cinnamate 4-monooxygenase [Echinacea angustifolia] Length = 505 Score = 850 bits (2197), Expect = 0.0 Identities = 423/490 (86%), Positives = 439/490 (89%) Frame = -1 Query: 1675 FVAILGAIFISKLRGKRFKLPPGPIPVPIFGNWLQVGDDLNHRNLTDLAKKFGQIFLLRM 1496 F AI+ +IFISKLRGKRFKLPPGP+PVPIFGNWLQVGDDLNHRNLTDLAKKFG+I LLRM Sbjct: 14 FAAIIASIFISKLRGKRFKLPPGPLPVPIFGNWLQVGDDLNHRNLTDLAKKFGEILLLRM 73 Query: 1495 GQRNLVVVSSPDLAKEVLHTQGVEFGSRTRNVVFDIFTGKGQDMVFTVYGEHWRKMRRIM 1316 GQRNLVVVSSP+LAKEVLHTQGVEFGSRTRNVVFDIFTGKGQDMVFTVYGEHWRKMRRIM Sbjct: 74 GQRNLVVVSSPNLAKEVLHTQGVEFGSRTRNVVFDIFTGKGQDMVFTVYGEHWRKMRRIM 133 Query: 1315 TVPFFTNKVVQQYRFGWXXXXXXXXXXXXXXXXXATEGIVIRRRLQLMMYNNMFRIMFDR 1136 TVPFFTNKVVQQY GW ATEG+VIRRRLQLMMYNNMFRIMFDR Sbjct: 134 TVPFFTNKVVQQYHSGWEAEAAAVVEDVRKNPKAATEGVVIRRRLQLMMYNNMFRIMFDR 193 Query: 1135 RFESEDDPLFLKLKALNGERSRLAQSFDYNYGDFIPILRPFLKGYLKICKEVKEKRLQLF 956 RFESEDDPLFLKLKALNGERSRLAQSF+YNYGDFIPILRPFLKGYLK+CKEVKEKR QLF Sbjct: 194 RFESEDDPLFLKLKALNGERSRLAQSFEYNYGDFIPILRPFLKGYLKLCKEVKEKRFQLF 253 Query: 955 KDYFVDERKKLGSTKSMDNNQLKCAIDHILEAQDKGEINEDNVLYIVENINVAAIETTLW 776 KDYFVDERKKLGSTKSMDNNQLKCAIDHIL+A+DKGEINEDNVLYIVENINVAAIETTLW Sbjct: 254 KDYFVDERKKLGSTKSMDNNQLKCAIDHILDAKDKGEINEDNVLYIVENINVAAIETTLW 313 Query: 775 SIEWGIAELVNHPDIQAKLRHELDTKLGPGVQVTEPDIQKLPYLQAVIKETLRLRMAIPL 596 SIEWGIAELVNHP+IQAKLRHELDTKLGPGVQVTEPD+ KLPYLQAVIKETLRLRMAIPL Sbjct: 314 SIEWGIAELVNHPEIQAKLRHELDTKLGPGVQVTEPDLDKLPYLQAVIKETLRLRMAIPL 373 Query: 595 LVPHMNLHDAKLGSYDIPAESKILVNAWWLANNPEQWKKXXXXXXXXXXXXESKVEANGN 416 LVPHMNLHDAKLG YDIPAESKILVNAWWLANNP+QWK+ ESKVEANGN Sbjct: 374 LVPHMNLHDAKLGGYDIPAESKILVNAWWLANNPDQWKEPEVFRPERFLEEESKVEANGN 433 Query: 415 DFRYLPFGVGRRSCPXXXXXXXXXXXXXGRLVQNFELLPPPGESKVDTSEKGGQFSLHIL 236 DFRYLPFGVGRRSCP GRLVQNFELLPPPG+SKV T+EKGGQFSLHIL Sbjct: 434 DFRYLPFGVGRRSCPGIILALPILGITIGRLVQNFELLPPPGQSKVGTTEKGGQFSLHIL 493 Query: 235 KHSTIVAKPR 206 KHSTIVAKPR Sbjct: 494 KHSTIVAKPR 503 >gb|ACF74449.1| trans-cinnamate 4-monooxygenase [Echinacea angustifolia] Length = 505 Score = 850 bits (2197), Expect = 0.0 Identities = 417/491 (84%), Positives = 443/491 (90%) Frame = -1 Query: 1675 FVAILGAIFISKLRGKRFKLPPGPIPVPIFGNWLQVGDDLNHRNLTDLAKKFGQIFLLRM 1496 F AI+GAI +SKLRGK+FKLPPGPIPVPIFGNWLQVGDDLNHRNLTDLAKKFGQIFLLRM Sbjct: 14 FAAIIGAIVVSKLRGKKFKLPPGPIPVPIFGNWLQVGDDLNHRNLTDLAKKFGQIFLLRM 73 Query: 1495 GQRNLVVVSSPDLAKEVLHTQGVEFGSRTRNVVFDIFTGKGQDMVFTVYGEHWRKMRRIM 1316 GQRNLVVVSSP+LAK+VLHTQGVEFGSRTRNVVFDIFTGKGQDMVFTVYGEHWRKMRRIM Sbjct: 74 GQRNLVVVSSPELAKDVLHTQGVEFGSRTRNVVFDIFTGKGQDMVFTVYGEHWRKMRRIM 133 Query: 1315 TVPFFTNKVVQQYRFGWXXXXXXXXXXXXXXXXXATEGIVIRRRLQLMMYNNMFRIMFDR 1136 TVPFFTNKVVQQYR+GW ATEG+VIRRRLQLMMYNNMFRIMFDR Sbjct: 134 TVPFFTNKVVQQYRYGWEAEAAAVVEDVKKNPAAATEGVVIRRRLQLMMYNNMFRIMFDR 193 Query: 1135 RFESEDDPLFLKLKALNGERSRLAQSFDYNYGDFIPILRPFLKGYLKICKEVKEKRLQLF 956 RFESE+DPLFLKLKALNGERSRLAQSF+YNYGDFIPILRPFL+ YLK+CKEVKEKR+QLF Sbjct: 194 RFESEEDPLFLKLKALNGERSRLAQSFEYNYGDFIPILRPFLRNYLKLCKEVKEKRIQLF 253 Query: 955 KDYFVDERKKLGSTKSMDNNQLKCAIDHILEAQDKGEINEDNVLYIVENINVAAIETTLW 776 KDYFVDERKKLGSTK MD+NQLKCAIDHILEA+DKGEINEDNVLYIVENINVAAIETTLW Sbjct: 254 KDYFVDERKKLGSTKKMDDNQLKCAIDHILEAKDKGEINEDNVLYIVENINVAAIETTLW 313 Query: 775 SIEWGIAELVNHPDIQAKLRHELDTKLGPGVQVTEPDIQKLPYLQAVIKETLRLRMAIPL 596 SIEWGIAELVNHP+IQAKLRHELDTKLGPGVQ+TEPD+Q LPYLQAVIKETLRLRMAIPL Sbjct: 314 SIEWGIAELVNHPEIQAKLRHELDTKLGPGVQITEPDVQNLPYLQAVIKETLRLRMAIPL 373 Query: 595 LVPHMNLHDAKLGSYDIPAESKILVNAWWLANNPEQWKKXXXXXXXXXXXXESKVEANGN 416 LVPHMNLHDAKLG +DIPAESKILVNAWWLANNP+QWKK E+KV+ANGN Sbjct: 374 LVPHMNLHDAKLGGFDIPAESKILVNAWWLANNPDQWKKPEEFRPERFLEEEAKVDANGN 433 Query: 415 DFRYLPFGVGRRSCPXXXXXXXXXXXXXGRLVQNFELLPPPGESKVDTSEKGGQFSLHIL 236 DFRYLPFGVGRRSCP GRLVQNFELLPPPG+SK+DT+EKGGQFSLHIL Sbjct: 434 DFRYLPFGVGRRSCPGIILALPILGITIGRLVQNFELLPPPGQSKIDTAEKGGQFSLHIL 493 Query: 235 KHSTIVAKPRA 203 KHST+VAKPR+ Sbjct: 494 KHSTVVAKPRS 504 >sp|Q04468.1|TCMO_HELTU RecName: Full=Trans-cinnamate 4-monooxygenase; AltName: Full=Cinnamic acid 4-hydroxylase; Short=C4H; Short=CA4H; AltName: Full=Cytochrome P450 73; AltName: Full=Cytochrome P450C4H gi|18859|emb|CAA78982.1| trans-cinnamate 4-monooxygenase [Helianthus tuberosus] Length = 505 Score = 849 bits (2193), Expect = 0.0 Identities = 417/491 (84%), Positives = 441/491 (89%) Frame = -1 Query: 1675 FVAILGAIFISKLRGKRFKLPPGPIPVPIFGNWLQVGDDLNHRNLTDLAKKFGQIFLLRM 1496 F AI+GAI ISKLRGK+FKLPPGPIPVPIFGNWLQVGDDLNHRNLTDLAK+FG+I LLRM Sbjct: 14 FAAIIGAILISKLRGKKFKLPPGPIPVPIFGNWLQVGDDLNHRNLTDLAKRFGEILLLRM 73 Query: 1495 GQRNLVVVSSPDLAKEVLHTQGVEFGSRTRNVVFDIFTGKGQDMVFTVYGEHWRKMRRIM 1316 GQRNLVVVSSP+LAKEVLHTQGVEFGSRTRNVVFDIFTGKGQDMVFTVYGEHWRKMRRIM Sbjct: 74 GQRNLVVVSSPELAKEVLHTQGVEFGSRTRNVVFDIFTGKGQDMVFTVYGEHWRKMRRIM 133 Query: 1315 TVPFFTNKVVQQYRFGWXXXXXXXXXXXXXXXXXATEGIVIRRRLQLMMYNNMFRIMFDR 1136 TVPFFTNKVVQQYR+GW ATEGIVIRRRLQLMMYNNMFRIMFDR Sbjct: 134 TVPFFTNKVVQQYRYGWEAEAAAVVDDVKKNPAAATEGIVIRRRLQLMMYNNMFRIMFDR 193 Query: 1135 RFESEDDPLFLKLKALNGERSRLAQSFDYNYGDFIPILRPFLKGYLKICKEVKEKRLQLF 956 RFESEDDPLFLKLKALNGERSRLAQSF+YNYGDFIPILRPFL+ YLK+CKEVK+KR+QLF Sbjct: 194 RFESEDDPLFLKLKALNGERSRLAQSFEYNYGDFIPILRPFLRNYLKLCKEVKDKRIQLF 253 Query: 955 KDYFVDERKKLGSTKSMDNNQLKCAIDHILEAQDKGEINEDNVLYIVENINVAAIETTLW 776 KDYFVDERKK+GSTK MDNNQLKCAIDHILEA++KGEINEDNVLYIVENINVAAIETTLW Sbjct: 254 KDYFVDERKKIGSTKKMDNNQLKCAIDHILEAKEKGEINEDNVLYIVENINVAAIETTLW 313 Query: 775 SIEWGIAELVNHPDIQAKLRHELDTKLGPGVQVTEPDIQKLPYLQAVIKETLRLRMAIPL 596 SIEWGIAELVNHP+IQAKLRHELDTKLGPGVQ+TEPD+Q LPYLQAV+KETLRLRMAIPL Sbjct: 314 SIEWGIAELVNHPEIQAKLRHELDTKLGPGVQITEPDVQNLPYLQAVVKETLRLRMAIPL 373 Query: 595 LVPHMNLHDAKLGSYDIPAESKILVNAWWLANNPEQWKKXXXXXXXXXXXXESKVEANGN 416 LVPHMNLHDAKLG +DIPAESKILVNAWWLANNP+QWKK E+KVEANGN Sbjct: 374 LVPHMNLHDAKLGGFDIPAESKILVNAWWLANNPDQWKKPEEFRPERFLEEEAKVEANGN 433 Query: 415 DFRYLPFGVGRRSCPXXXXXXXXXXXXXGRLVQNFELLPPPGESKVDTSEKGGQFSLHIL 236 DFRYLPFGVGRRSCP GRLVQNFELLPPPG+SK+DT EKGGQFSLHIL Sbjct: 434 DFRYLPFGVGRRSCPGIILALPILGITIGRLVQNFELLPPPGQSKIDTDEKGGQFSLHIL 493 Query: 235 KHSTIVAKPRA 203 KHSTIVAKPR+ Sbjct: 494 KHSTIVAKPRS 504