BLASTX nr result

ID: Coptis21_contig00019939 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Coptis21_contig00019939
         (1800 letters)

Database: ./nr 
           23,641,837 sequences; 8,123,359,852 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

gb|ACH87183.1| unknown protein [Camellia sinensis]                    273   1e-70
ref|XP_002335272.1| predicted protein [Populus trichocarpa] gi|2...   270   7e-70
ref|XP_002532192.1| conserved hypothetical protein [Ricinus comm...   267   8e-69
ref|XP_002316037.1| predicted protein [Populus trichocarpa] gi|2...   265   2e-68
ref|NP_190585.2| uncharacterized protein [Arabidopsis thaliana] ...   265   3e-68

>gb|ACH87183.1| unknown protein [Camellia sinensis]
          Length = 417

 Score =  273 bits (697), Expect = 1e-70
 Identities = 163/407 (40%), Positives = 233/407 (57%), Gaps = 11/407 (2%)
 Frame = -3

Query: 1396 DELVSSIKQKLENVPRLPSECCIYRVHDSLRAVNERAYVPQLVSIGPYHHGKENLKAMEK 1217
            D +  +I + LE     P+E  I+R+H+ LR +N++AY P+++SIGPYH GK+NL+ ME+
Sbjct: 8    DFVSMNIYEMLEVFSHPPTEFFIFRLHEELRQLNDKAYDPEIISIGPYHRGKQNLQMMER 67

Query: 1216 HKWLYLKSFMHRSPVSLEDCVQAIRDLEQKARRCYAETISLSSNVFVEMLLVDGCFLIEL 1037
            HK  Y  S +    +S ED V AI  LE  A   YAE ISL S+  ++M+++DGCF+IEL
Sbjct: 68   HKLRYFHSLLQEKNLSPEDFVYAIGSLELHACDFYAEPISLDSDEMIKMMVLDGCFIIEL 127

Query: 1036 FLKYRSNDWREKEDPIFNTSWMLTSLNRDLMLLENQIPFSVLECLFILNQVDPYKDPFHY 857
              K+     R++ DPIF   W+   L RDLML ENQIPF VL  LF + +         Y
Sbjct: 128  LRKFDMEFLRDENDPIFKRDWIFNRLQRDLMLFENQIPFFVLCKLFDMIEAPGNHKRLIY 187

Query: 856  LTESALGFFSWMMPTTPQ----MXXXXXXXXXXXXXXXXXXXSNTALTAIED-SNRHAVN 692
            L   AL FFS ++P T +                        S   +  +ED SN+    
Sbjct: 188  L---ALRFFSDLLPGTGKREDGKESQGKISHLLGLIHSNWHPSFAGVEPVEDASNKKGKG 244

Query: 691  PVKRIPCAMTLRQFGIKFKKGE--DDNLCSMKFSNGVLEIPPLTIQDLTEPFFRNLIAFE 518
              + IP    L++ G+K +K E    +L  ++F NGV++IPPLTI+  TE FFRNLIA+E
Sbjct: 245  NWRFIPSTRELQESGVKIEKFEVTGGSLFDIEFKNGVMQIPPLTIEGRTESFFRNLIAYE 304

Query: 517  QCHKDCTNQFT---SYASLMDSLISNTDDVALLHHHGIIDSWLGNYEDVSLLFNKLCAEV 347
            Q   D  NQF+    Y   +D LI +  DV +L   GIID+WLG+ E VS LFNK+   V
Sbjct: 305  QYSPD--NQFSYVADYVKFLDFLIDSPKDVKILSRRGIIDNWLGDDEAVSNLFNKISDTV 362

Query: 346  N-LEHHYHFFNLSEQVNAYCKNRWHEWVAALKRDYFNNPWAILSFVA 209
            +    H+ + ++  +VN +C   W+ + A L R+YFNNPWA+++ +A
Sbjct: 363  SGTSMHFRYADIFNRVNIHCSQPWNLYRATLNRNYFNNPWAMIAIMA 409


>ref|XP_002335272.1| predicted protein [Populus trichocarpa] gi|222833186|gb|EEE71663.1|
            predicted protein [Populus trichocarpa]
          Length = 458

 Score =  270 bits (691), Expect = 7e-70
 Identities = 157/424 (37%), Positives = 229/424 (54%), Gaps = 11/424 (2%)
 Frame = -3

Query: 1444 QNMEEGQGH-SSIEIIDDELVSSIKQKLENVPRL------PSECCIYRVHDSLRAVNERA 1286
            +N E    H +SI   D + + S+++K + +P+L       S CCI+RV  SL  +N++A
Sbjct: 14   ENAETSNDHVTSITEADRQWLKSVEKKTKLLPKLLKKSAGKSSCCIFRVPQSLVEINKKA 73

Query: 1285 YVPQLVSIGPYHHGKENLKAMEKHKWLYLKSFMHRSP---VSLEDCVQAIRDLEQKARRC 1115
            Y P +VSIGPYHHGK +LK +E+HKW +L   + R+    + + D  +AI  +E+K R C
Sbjct: 74   YHPHIVSIGPYHHGKVHLKMIEEHKWRFLGGVLARTQQHGIGINDFFKAIAPIEEKIRDC 133

Query: 1114 YAETISLSSNVFVEMLLVDGCFLIELFLKYRSNDWREKEDPIFNTSWMLTSLNRDLMLLE 935
            Y+ETI  S   F+EM+++DGCF+IELF        R+ +DPIFN + M   + RDL+ LE
Sbjct: 134  YSETIECSRQEFIEMMVLDGCFIIELFCIIGGIVQRDIDDPIFNMTLMFFFIMRDLLKLE 193

Query: 934  NQIPFSVLECLFILNQVDPYKDPFHYLTESALGFFSWMMPTTPQMXXXXXXXXXXXXXXX 755
            NQIPF VLE LF  + +   K     LTE AL FF       P++               
Sbjct: 194  NQIPFFVLETLFETSILSSRKQIVSSLTELALKFFDHAAERPPEV---LRRYKDIRGEHL 250

Query: 754  XXXXSNTALTAIEDSNRHAVNPVKRIPCAMTLRQFGIKFKKGEDDNLCSMKFSNGVLEIP 575
                 +T + A        ++P + I  A  L Q GIKFK  E D+   ++FSNGVLEIP
Sbjct: 251  LDLFRSTIIPASSQEVHRKISPFQLIHSAKKLHQAGIKFKPRETDSFLDIEFSNGVLEIP 310

Query: 574  PLTIQDLTEPFFRNLIAFEQCHKDCTNQFTSYASLMDSLISNTDDVALLHHHGIIDSWLG 395
             LTI D       N +AFEQC+  C+N  TSY + M  LI+   D   L  + I+++  G
Sbjct: 311  LLTIDDFITSVILNCVAFEQCYNHCSNHITSYVTFMGCLINAPSDAGFLCDYKIVENSFG 370

Query: 394  NYEDVSLLFNKLCAEVNLEHHYHFFN-LSEQVNAYCKNRWHEWVAALKRDYFNNPWAILS 218
              E+V+  FN +  +V  +  + + + + E VN +  N W    A  K  YF+  W+ +S
Sbjct: 371  ADEEVASFFNNIGKDVTFDTQWSYLSKVFEDVNEHYSNNWQVRWAEFKHTYFDTTWSFIS 430

Query: 217  FVAA 206
             +AA
Sbjct: 431  ALAA 434


>ref|XP_002532192.1| conserved hypothetical protein [Ricinus communis]
            gi|223528124|gb|EEF30195.1| conserved hypothetical
            protein [Ricinus communis]
          Length = 461

 Score =  267 bits (682), Expect = 8e-69
 Identities = 157/427 (36%), Positives = 231/427 (54%), Gaps = 15/427 (3%)
 Frame = -3

Query: 1441 NMEEGQGHSSIEIIDDEL---VSSIKQKLENVPRL------PSECCIYRVHDSLRAVNER 1289
            N E G+    +  I  E    ++S++ K+  +P+L       S CCI+RV  SL  +NE+
Sbjct: 12   NHENGKTSDHVIAISTEASEWLASLEDKISKMPKLLNKTAGKSSCCIFRVPKSLAQINEK 71

Query: 1288 AYVPQLVSIGPYHHGKENLKAMEKHKWLYLKSFMHRS---PVSLEDCVQAIRDLEQKARR 1118
            AY P +VSIGPYHHG +  + +E+HKW +L + + R+    +SL+D  +AI   E++ R 
Sbjct: 72   AYQPHIVSIGPYHHGNDQFQMIEEHKWRFLGAVLTRTKAKDISLDDFFKAIAPKEEEIRE 131

Query: 1117 CYAETISLSSNVFVEMLLVDGCFLIELFLKYRSNDWREKEDPIFNTSWMLTSLNRDLMLL 938
            CY+E I  SS+  +EM+++DGCF+IEL            +DPIFN +WM + L RDL+ L
Sbjct: 132  CYSENIGYSSHQLIEMMILDGCFIIELLCIVARLVQTHLDDPIFNMAWMFSFLMRDLLRL 191

Query: 937  ENQIPFSVLECLFILNQVDPYKDPFHYLTESALGFFSWMMPTTPQMXXXXXXXXXXXXXX 758
            ENQIPF VLE LF L  V    D    LTE AL FF + +    +               
Sbjct: 192  ENQIPFFVLESLFGLVSV-TLDDSSRSLTELALEFFDYAVERPAEF---LDRFKDKKGKH 247

Query: 757  XXXXXSNTALTAIEDS-NRHAVNP-VKRIPCAMTLRQFGIKFKKGEDDNLCSMKFSNGVL 584
                  +T +   +   N  A +P ++ I  A  LR  GIKFK    D+   + FS+GVL
Sbjct: 248  LLDLFRSTFIPPPQGKPNEDAYSPFLQLIQSAKKLRLSGIKFKPKRTDSFLDISFSHGVL 307

Query: 583  EIPPLTIQDLTEPFFRNLIAFEQCHKDCTNQFTSYASLMDSLISNTDDVALLHHHGIIDS 404
             IPPLT+ D T  F  N +AFEQC+K C+   TSY + M  LI+   D   L  H II++
Sbjct: 308  RIPPLTVDDFTSSFLLNCVAFEQCYKHCSKHITSYVTFMGCLINTPADAGYLSDHRIIEN 367

Query: 403  WLGNYEDVSLLFNKLCAEVNLE-HHYHFFNLSEQVNAYCKNRWHEWVAALKRDYFNNPWA 227
            + G  ++V+  FN +  ++  +    +   L + VN Y +N +H   A  K  YF++PW+
Sbjct: 368  YFGTDDEVAKFFNDVGKDITFDIQRSYLSKLFKDVNKYYRNNFHIKWAGFKHTYFDSPWS 427

Query: 226  ILSFVAA 206
             +S +AA
Sbjct: 428  FMSAMAA 434


>ref|XP_002316037.1| predicted protein [Populus trichocarpa] gi|222865077|gb|EEF02208.1|
            predicted protein [Populus trichocarpa]
          Length = 378

 Score =  265 bits (678), Expect = 2e-68
 Identities = 151/371 (40%), Positives = 204/371 (54%), Gaps = 5/371 (1%)
 Frame = -3

Query: 1330 IYRVHDSLRAVNERAYVPQLVSIGPYHHGKENLKAMEKHKWLYLKSFMHRSP---VSLED 1160
            I RV +++R  NE+AY+P  VSIGPYHHGK+ L+ ME+HKW Y+ + + R P    SL+D
Sbjct: 10   ICRVKENIRNANEKAYIPDKVSIGPYHHGKQGLETMEEHKWRYMDALLSRKPDLEASLDD 69

Query: 1159 CVQAIRDLEQKARRCYAETISLSSNVFVEMLLVDGCFLIELFLKYRSNDWREKEDPIFNT 980
            C+ A+R++E +AR CY E I+++ + F++M+LVDGCF+IELFLKY     R + DP+F T
Sbjct: 70   CLTALREVEHRARACYEEEINVTDDEFLQMMLVDGCFIIELFLKYSIKSLRRRNDPVFTT 129

Query: 979  SWMLTSLNRDLMLLENQIPFSVLECLFILNQVDPYKDPFHYLTESALGFFSWMMPTTPQM 800
              ML  L  +LMLLENQIP  +L+ LF    V   K   H L   A  FF +M+P  PQ+
Sbjct: 130  PGMLFDLRSNLMLLENQIPLFILQRLF--EVVPTPKQCTHSLATLAFHFFKYMIPGDPQI 187

Query: 799  XXXXXXXXXXXXXXXXXXXSNTALTAIEDSNRHAVNPVKRIPCAMTLRQFGIKFKKGEDD 620
                                      +  +     +  K   CA  L+  GI+ K+    
Sbjct: 188  HQQKFNQEGNHILDLICHCLLPRYPRVPGTK----SDQKHFRCATELQAAGIRIKRARTK 243

Query: 619  NLCSMKFSNGVLEIPPLTIQDLTEPFFRNLIAFEQCHKDCTNQFTSYASLMDSLISNTDD 440
            NL  +KF +GVLEIP + I   TE  F+NLIA E C  D     TSY  LM SLI + +D
Sbjct: 244  NLLDIKFVSGVLEIPNVLIHQYTESLFKNLIALEHCSGDSVQHITSYVFLMKSLIGSDED 303

Query: 439  VALLHHHGIIDSWLGNYEDVSLLFNKLCAEVNLEHHYHFFNLSEQVNAYCKNR--WHEWV 266
            V LL    I+ ++  N ++V+ LF K C EVNL   Y +  L EQV  +   R  WH   
Sbjct: 304  VKLLKKKDILTNYDVNEKEVAKLFEKSCEEVNLNESY-YDGLFEQVKGHKSTRKTWHLRS 362

Query: 265  AALKRDYFNNP 233
               KR Y  NP
Sbjct: 363  EEFKRSYRRNP 373


>ref|NP_190585.2| uncharacterized protein [Arabidopsis thaliana]
            gi|71905501|gb|AAZ52728.1| hypothetical protein At3g50160
            [Arabidopsis thaliana] gi|93007370|gb|ABE97188.1|
            hypothetical protein At3g50160 [Arabidopsis thaliana]
            gi|332645112|gb|AEE78633.1| uncharacterized protein
            [Arabidopsis thaliana]
          Length = 503

 Score =  265 bits (677), Expect = 3e-68
 Identities = 157/440 (35%), Positives = 244/440 (55%), Gaps = 5/440 (1%)
 Frame = -3

Query: 1510 YTSIQQRLVIREMVESNDSVWLQNMEEGQGHSSIEIIDDELVSSIKQKLENVPRLPSECC 1331
            Y  ++   VI E         + ++E+       EI    L   +K   +N        C
Sbjct: 46   YGELENIEVIEEKPRETQVESVVSIEDKNEQKLREIWVISLNDKMKTLGDNATTSWDNLC 105

Query: 1330 IYRVHDSLRAVNERAYVPQLVSIGPYHHGKENLKAMEKHKWLYLKSFMHRSPVSLEDCVQ 1151
            IYRV   L+  + ++Y+PQ+VSIGPYHHG ++L  ME+HKW  +   M R+   +E  + 
Sbjct: 106  IYRVPPYLQENDTKSYMPQIVSIGPYHHGHKHLMPMERHKWRAVNMVMARAKHDIEMYID 165

Query: 1150 AIRDLEQKARRCYAETISLSSNVFVEMLLVDGCFLIELFLKYRSNDWRE----KEDPIFN 983
            A+++LE+KAR CY   I+++ N F+EML++DG F+IE+F K  S  ++E      DP+F 
Sbjct: 166  AMKELEEKARACYQGPINMNRNEFIEMLVLDGVFIIEIF-KGTSEGFQEIGYAPNDPVFG 224

Query: 982  TSWMLTSLNRDLMLLENQIPFSVLECLFILNQVDPYKDPFHYLTESALGFFSWMMPTTPQ 803
               ++ S+ RD+++LENQ+P+SVL+ L  L + D        L +    FF  ++PT   
Sbjct: 225  MRGLMQSIRRDMVMLENQLPWSVLKGLLQLQRPDVLDKVNVQLFQP---FFQPLLPTREV 281

Query: 802  MXXXXXXXXXXXXXXXXXXXSNTALTAIEDSNRHAVNPVKRIPCAMTLRQFGIKFKKGED 623
            +                     ++ T+ ED +     P + I C   LR  G++F + E 
Sbjct: 282  LTEEGGLHCLDVLRRGLL---QSSGTSDEDMSMVNKQPQQLIHCVTELRNAGVEFMRKET 338

Query: 622  DNLCSMKFSNGVLEIPPLTIQDLTEPFFRNLIAFEQCHKDCTNQFTSYASLMDSLISNTD 443
             +   ++F NG L+IP L I D T+  F NLIAFEQCH   + + TSY   MD+LI++++
Sbjct: 339  GHFWDIEFKNGYLKIPKLLIHDGTKSLFLNLIAFEQCHIKSSKKITSYIIFMDNLINSSE 398

Query: 442  DVALLHHHGIIDSWLGNYEDVSLLFNKLCAEVNLEHHYHFFN-LSEQVNAYCKNRWHEWV 266
            DV+ LHH+GII++WLG+  +VS LFN L  EV  + +  + + L+ +VN Y + +W+   
Sbjct: 399  DVSYLHHYGIIENWLGSDSEVSDLFNGLGKEVIFDPNDGYLSALTGEVNIYYRRKWNYLK 458

Query: 265  AALKRDYFNNPWAILSFVAA 206
            A L+  YFNNPWA  SF+AA
Sbjct: 459  ATLRHKYFNNPWAYFSFIAA 478


Top