BLASTX nr result

ID: Coptis25_contig00033694 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Coptis25_contig00033694
         (953 letters)

Database: ./nr 
           23,641,837 sequences; 8,123,359,852 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_002324212.1| predicted protein [Populus trichocarpa] gi|2...   202   1e-49
ref|XP_002516521.1| hypothetical protein RCOM_0800710 [Ricinus c...   198   1e-48
ref|NP_001235323.1| uncharacterized protein LOC100527896 [Glycin...   197   3e-48
ref|NP_180148.1| polyketide cyclase/dehydrase domain protein [Ar...   196   5e-48
ref|XP_002878882.1| hypothetical protein ARALYDRAFT_901237 [Arab...   193   4e-47

>ref|XP_002324212.1| predicted protein [Populus trichocarpa] gi|222865646|gb|EEF02777.1|
           predicted protein [Populus trichocarpa]
          Length = 175

 Score =  202 bits (514), Expect = 1e-49
 Identities = 94/167 (56%), Positives = 126/167 (75%), Gaps = 6/167 (3%)
 Frame = +2

Query: 278 MEQEQQPPKWEGNVHAKLEGPKADQVWPLLEDYFNFHKWFPTLNICYGIEGANGEPGCIR 457
           MEQ+ QP KWEG V  +L    ADQ+WPLL D+FN HKWFP+L  CYGI G NGEPGCIR
Sbjct: 1   MEQDPQP-KWEGKVSERLPKATADQIWPLLNDFFNLHKWFPSLATCYGIHGTNGEPGCIR 59

Query: 458 YCAGSSIPS--NGSDGSN--VNWSTEKLVAIDPVKMVLTYEIIDGNVGFESYVSTIRLLP 625
           +C GSSIPS    +DG +  V+WS+E+L  +D V+  L+YEI+D N+GF+SYVST++++P
Sbjct: 60  HCEGSSIPSTDTNTDGHSQPVSWSSERLTVVDHVERSLSYEIVDSNIGFKSYVSTVKVVP 119

Query: 626 RDED--DGCVIEWSFAVNPVKGWKLENLIEKYHKGLERMARRIEESI 760
           + +D  DGCVIEWSF V+PV G  L+ L+ KY  GL++MA R+E+++
Sbjct: 120 QGDDGQDGCVIEWSFNVDPVAGLVLDELVRKYKVGLQQMAERLEDAV 166


>ref|XP_002516521.1| hypothetical protein RCOM_0800710 [Ricinus communis]
           gi|223544341|gb|EEF45862.1| hypothetical protein
           RCOM_0800710 [Ricinus communis]
          Length = 343

 Score =  198 bits (504), Expect = 1e-48
 Identities = 89/163 (54%), Positives = 120/163 (73%), Gaps = 4/163 (2%)
 Frame = +2

Query: 281 EQEQQPPKWEGNVHAKLEGPKADQVWPLLEDYFNFHKWFPTLNICYGIEGANGEPGCIRY 460
           + +QQ  KWEG V   L   KA+Q+WPL  D+FN HKW PTL  CYGI G NGE GC+RY
Sbjct: 5   QTQQQQQKWEGKVSTGLPKAKAEQIWPLFTDFFNIHKWLPTLRTCYGICGTNGERGCVRY 64

Query: 461 CAGSSIPSNGSDGSNVN----WSTEKLVAIDPVKMVLTYEIIDGNVGFESYVSTIRLLPR 628
           CAG SIP   +D S++N    WS E+LVA+D V+  LTYEI+D N+GF+SYVST++++P 
Sbjct: 65  CAGFSIPPEVTDKSHLNHNSSWSKERLVAVDHVERCLTYEIVDSNIGFKSYVSTVKIVPA 124

Query: 629 DEDDGCVIEWSFAVNPVKGWKLENLIEKYHKGLERMARRIEES 757
              +GCVIEWSF V+PVKG+ L++LI+KY + L+ + +R+E+S
Sbjct: 125 GVGNGCVIEWSFQVDPVKGYVLDDLIKKYERALQVIGKRMEDS 167



 Score =  159 bits (402), Expect = 9e-37
 Identities = 75/173 (43%), Positives = 110/173 (63%), Gaps = 5/173 (2%)
 Frame = +2

Query: 263 ITKEMMEQEQQPPKWEGNVHAKLEGPKADQVWPLLEDYFNFHKWFPTLNICYGIEGANGE 442
           ++K ++  E    KW+G    +L+G  ADQVWP + D+ N HKWFP L+ CY +EG  G+
Sbjct: 170 VSKPLLLTEISERKWDGKATVELKGLTADQVWPFVADFCNLHKWFPNLDTCYQVEGQLGQ 229

Query: 443 PGCIRYCAGSSIPSNGSDGS---NVNWSTEKLVAIDPVKMVLTYEIIDGNVGFESYVSTI 613
           PG +RYCA  S+P   SDGS     +W  EKLV I+P +  L+YE++D ++GFESY +T 
Sbjct: 230 PGLVRYCA--SVPQPSSDGSGETTFSWVKEKLVMINPDERCLSYEVVDSSMGFESYAATF 287

Query: 614 RLLP--RDEDDGCVIEWSFAVNPVKGWKLENLIEKYHKGLERMARRIEESIMS 766
           RLL    D   GC IEWSF  +PV+ W  ++ +   +  L+ MA++IE+++ S
Sbjct: 288 RLLQVNGDAQHGCKIEWSFVSDPVEAWSFQDFVTYANSCLQFMAKKIEDAVSS 340


>ref|NP_001235323.1| uncharacterized protein LOC100527896 [Glycine max]
           gi|255633494|gb|ACU17105.1| unknown [Glycine max]
          Length = 166

 Score =  197 bits (501), Expect = 3e-48
 Identities = 88/155 (56%), Positives = 118/155 (76%)
 Frame = +2

Query: 302 KWEGNVHAKLEGPKADQVWPLLEDYFNFHKWFPTLNICYGIEGANGEPGCIRYCAGSSIP 481
           +WEG V AKL     +Q WPL++D+FN HK FP+L  CYG+ G+NGEPGCIR+CAGSSIP
Sbjct: 8   RWEGKVSAKLRNTTKEQAWPLVKDFFNLHKRFPSLATCYGVHGSNGEPGCIRFCAGSSIP 67

Query: 482 SNGSDGSNVNWSTEKLVAIDPVKMVLTYEIIDGNVGFESYVSTIRLLPRDEDDGCVIEWS 661
           S+   GS V+WS E+LVA+  V + L YE +D N+GF SY ST+R+L  D+ +GC++EWS
Sbjct: 68  SSNGSGS-VSWSKERLVAVHDVDLSLKYETVDNNIGFRSYESTMRVLSDDDSNGCLLEWS 126

Query: 662 FAVNPVKGWKLENLIEKYHKGLERMARRIEESIMS 766
           FAV+PVKG  LE+L+ KYH GL+ MA ++E+ I+S
Sbjct: 127 FAVDPVKGLVLEDLVRKYHVGLQLMALKMEDEIVS 161


>ref|NP_180148.1| polyketide cyclase/dehydrase domain protein [Arabidopsis thaliana]
           gi|79323057|ref|NP_001031416.1| polyketide
           cyclase/dehydrase domain protein [Arabidopsis thaliana]
           gi|3643606|gb|AAC42253.1| hypothetical protein
           [Arabidopsis thaliana] gi|50253476|gb|AAT71940.1|
           At2g25770 [Arabidopsis thaliana]
           gi|56381959|gb|AAV85698.1| At2g25770 [Arabidopsis
           thaliana] gi|330252656|gb|AEC07750.1| polyketide
           cyclase/dehydrase domain protein [Arabidopsis thaliana]
           gi|330252657|gb|AEC07751.1| polyketide cyclase/dehydrase
           domain protein [Arabidopsis thaliana]
          Length = 167

 Score =  196 bits (499), Expect = 5e-48
 Identities = 90/162 (55%), Positives = 115/162 (70%)
 Frame = +2

Query: 278 MEQEQQPPKWEGNVHAKLEGPKADQVWPLLEDYFNFHKWFPTLNICYGIEGANGEPGCIR 457
           ME+   P KW   V   L   K D++WPL  D+FN HKW PTL  C+G+ G NGE GCIR
Sbjct: 1   MEKASSPEKWLAKVSVTLTKAKPDEIWPLFTDFFNLHKWLPTLATCHGVHGNNGEQGCIR 60

Query: 458 YCAGSSIPSNGSDGSNVNWSTEKLVAIDPVKMVLTYEIIDGNVGFESYVSTIRLLPRDED 637
           +C+G SI SNG D S   WS EKLVA++PV+ V+ YEI++ N GFESYVST+++LPR E 
Sbjct: 61  FCSGFSIGSNGVD-SAARWSKEKLVAVNPVERVMRYEIVESNTGFESYVSTVKILPRGE- 118

Query: 638 DGCVIEWSFAVNPVKGWKLENLIEKYHKGLERMARRIEESIM 763
           DGCVIEWSF V+PV+G  LENL++KY K LE + + +EE  +
Sbjct: 119 DGCVIEWSFTVDPVRGLSLENLVKKYEKALEIITKNMEEDAL 160


>ref|XP_002878882.1| hypothetical protein ARALYDRAFT_901237 [Arabidopsis lyrata subsp.
           lyrata] gi|297324721|gb|EFH55141.1| hypothetical protein
           ARALYDRAFT_901237 [Arabidopsis lyrata subsp. lyrata]
          Length = 170

 Score =  193 bits (491), Expect = 4e-47
 Identities = 88/166 (53%), Positives = 116/166 (69%), Gaps = 3/166 (1%)
 Frame = +2

Query: 278 MEQEQQPPKWEGNVHAKLEGPKADQVWPLLEDYFNFHKWFPTLNICYGIEGANGEPGCIR 457
           ME+   P KW   V   L   K DQ+W L  D+FN HKW PTL  C+G+ G NGEPGCIR
Sbjct: 1   MEKASSPEKWRAKVSTTLTKAKPDQIWLLFTDFFNLHKWLPTLVTCHGVHGNNGEPGCIR 60

Query: 458 YCAGSSIPSNGSDGSNVNWSTEKLVAIDPVKMVLTYEIIDGNVGFESYVSTIRLLPRDED 637
           +C+ S+I SNG + S   WS EKLVA+DPV+ V+ YEI++ N+GFESYVST+++ PR ED
Sbjct: 61  FCSSSAIRSNGVE-SAAGWSKEKLVAVDPVERVMRYEIVESNIGFESYVSTVKISPRGED 119

Query: 638 ---DGCVIEWSFAVNPVKGWKLENLIEKYHKGLERMARRIEESIMS 766
              DGCVIEWSF V+PV+G  L++L+ KY K LE + + +EE  ++
Sbjct: 120 GDVDGCVIEWSFTVDPVRGLSLDDLVMKYEKALEVITKNMEEEALT 165


Top