BLASTX nr result

ID: Coptis21_contig00012998 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Coptis21_contig00012998
         (1521 letters)

Database: ./nr 
           23,641,837 sequences; 8,123,359,852 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_002310568.1| predicted protein [Populus trichocarpa] gi|2...   366   1e-98
ref|XP_002866887.1| hypothetical protein ARALYDRAFT_490769 [Arab...   360   5e-97
ref|XP_002307092.1| predicted protein [Populus trichocarpa] gi|2...   360   7e-97
ref|NP_195694.1| uncharacterized protein [Arabidopsis thaliana] ...   358   2e-96
ref|XP_002533203.1| conserved hypothetical protein [Ricinus comm...   357   4e-96

>ref|XP_002310568.1| predicted protein [Populus trichocarpa] gi|222853471|gb|EEE91018.1|
            predicted protein [Populus trichocarpa]
          Length = 451

 Score =  366 bits (939), Expect = 1e-98
 Identities = 214/430 (49%), Positives = 263/430 (61%), Gaps = 40/430 (9%)
 Frame = +3

Query: 180  QSQPQGKQLMGRRRMXXXXXXXXXXXXQVQSKKKTNLVSKNQTKLIXXXXXXXXXXXXXX 359
            Q+Q + KQ + RRRM            QV  KK TN+ SKNQTKL               
Sbjct: 38   QTQLEAKQFLVRRRMLEVETNED----QVPKKKSTNMSSKNQTKLAKPSLSTKNQTKLIK 93

Query: 360  XXXXXXXXXXXXXXXXXXXLNATSRSSNST-------------KLNTITT-------TKK 479
                                 AT   SNST             KLN+ +        TKK
Sbjct: 94   TGSLSTKNQTKITKSTNST-KATPTPSNSTSQLKKLNSTSQLKKLNSTSKAAANSAPTKK 152

Query: 480  SIDLPK-----NKT---------------KVIITQTEKKPKSQNNQEAIKIXXXXXXXXX 599
            + DL K     NKT               KV  T+++K+ K+Q      K          
Sbjct: 153  TSDLLKLGPSTNKTTKPASTKQTPSLVDKKVGDTESQKQNKNQKQTNPKKTPQTKKQQ-- 210

Query: 600  XXXXXXXXNWMEIEDDDDFLVSEFRDLPSKFQQTLLPDLERISLTSRAYINQANKEISEG 779
                    +W++ +D+DD LV+EFRDLPSKF QT+LPDLER+S+TS+ Y+ QANK++++G
Sbjct: 211  --------SWLDQDDEDD-LVAEFRDLPSKFHQTILPDLERLSITSKKYLTQANKDLTKG 261

Query: 780  FRPLVGKQYAPTIASIVSCVFMFLPLLLVSLIFNRIKAYFSLQKILIFVQVYLSIYFSIL 959
            F+P+VG +YA TIAS VS  F+ +PLLLVSL+FNRIKAYFS+QKILIF+QVYLSIYF+IL
Sbjct: 262  FKPIVGSKYASTIASTVSFAFILIPLLLVSLVFNRIKAYFSIQKILIFIQVYLSIYFTIL 321

Query: 960  SFASIVTGLEPLKFFYASSQSSYICLQVFQTXXXXXXXXXXXXXXXXVFSTETGLGLKVL 1139
              +++VTGLEPLKFFYA+SQS+Y+CL VFQT                VFS E G+G K+L
Sbjct: 322  CLSALVTGLEPLKFFYATSQSNYVCLMVFQTLGYVLYLLLLLMYLILVFSAECGMGSKLL 381

Query: 1140 GLGQTFVGFSVGLHYYVTVFHRAVLHQPPRTNWKIHGIYATCFLVICLLANAERRKKAYL 1319
            GLGQT VGF++GLHYYV VFHR VLHQPP+TNWKIHGIYATCFLVICL ANAERRKKAYL
Sbjct: 382  GLGQTLVGFAIGLHYYVAVFHRVVLHQPPKTNWKIHGIYATCFLVICLFANAERRKKAYL 441

Query: 1320 EDGGEEGKKS 1349
            E+GGEEGKK+
Sbjct: 442  EEGGEEGKKN 451


>ref|XP_002866887.1| hypothetical protein ARALYDRAFT_490769 [Arabidopsis lyrata subsp.
            lyrata] gi|297312723|gb|EFH43146.1| hypothetical protein
            ARALYDRAFT_490769 [Arabidopsis lyrata subsp. lyrata]
          Length = 449

 Score =  360 bits (924), Expect = 5e-97
 Identities = 189/330 (57%), Positives = 233/330 (70%), Gaps = 19/330 (5%)
 Frame = +3

Query: 417  LNATSRSSNSTKL--------------NTITTTKKSIDL-----PKNKTKVIITQTEKKP 539
            LN+T  SSN+TK               N+ ++ KKS DL     PKNKT      +   P
Sbjct: 125  LNSTKSSSNTTKTSPELKKLNSGTKSTNSTSSIKKSADLSKSSSPKNKTTTKSKLSSPPP 184

Query: 540  KSQNNQEAIKIXXXXXXXXXXXXXXXXXNWMEIEDDDDFLVSEFRDLPSKFQQTLLPDLE 719
              + +  + K                   W++ E+D+DF VSEFRDLP+KFQ++L+PDLE
Sbjct: 185  SEKKSPPSSKPVTKSKQPEKEIKPF----WLDDEEDEDF-VSEFRDLPTKFQRSLIPDLE 239

Query: 720  RISLTSRAYINQANKEISEGFRPLVGKQYAPTIASIVSCVFMFLPLLLVSLIFNRIKAYF 899
            RIS TS+ YIN+ANKEI++ F+P  G +YAPTIAS+VS VF+ +PLLLVSLIFNR KAYF
Sbjct: 240  RISTTSKNYINKANKEITKNFKPYFGNKYAPTIASVVSFVFILVPLLLVSLIFNRFKAYF 299

Query: 900  SLQKILIFVQVYLSIYFSILSFASIVTGLEPLKFFYASSQSSYICLQVFQTXXXXXXXXX 1079
            SLQKILIF+Q+YLSIYFSIL  +S+VTG+EPLKF YA+S S+Y+CLQ+ QT         
Sbjct: 300  SLQKILIFIQIYLSIYFSILCLSSLVTGIEPLKFLYATSSSTYVCLQILQTLGYVFYLLL 359

Query: 1080 XXXXXXXVFSTETGLGLKVLGLGQTFVGFSVGLHYYVTVFHRAVLHQPPRTNWKIHGIYA 1259
                   VFST+ GLGLKVLGL QTFVGF+VGLHYYV VFHR VL QPP+TNWKIHG+YA
Sbjct: 360  LLMYLVLVFSTDCGLGLKVLGLAQTFVGFAVGLHYYVAVFHRVVLRQPPKTNWKIHGVYA 419

Query: 1260 TCFLVICLLANAERRKKAYLEDGGEEGKKS 1349
            TCFL+ICLL+NAERRKK YLE+GG+EGKK+
Sbjct: 420  TCFLLICLLSNAERRKKEYLEEGGDEGKKN 449


>ref|XP_002307092.1| predicted protein [Populus trichocarpa] gi|222856541|gb|EEE94088.1|
            predicted protein [Populus trichocarpa]
          Length = 413

 Score =  360 bits (923), Expect = 7e-97
 Identities = 190/322 (59%), Positives = 232/322 (72%), Gaps = 11/322 (3%)
 Frame = +3

Query: 417  LNATSRSSNSTKLNTITTTKKSIDLPK-----------NKTKVIITQTEKKPKSQNNQEA 563
            LN+TS++SN TK +   +TKK+ DL K             TK   +  +KK  +Q +Q+ 
Sbjct: 95   LNSTSKASNFTK-SIAGSTKKTPDLLKLGSSTDKTTKPTSTKQTQSLVDKKVGNQESQKQ 153

Query: 564  IKIXXXXXXXXXXXXXXXXXNWMEIEDDDDFLVSEFRDLPSKFQQTLLPDLERISLTSRA 743
             K                  +W+   D+DD LV+EFRDLPSKF QTL+PDLERIS+TS+ 
Sbjct: 154  NK-NQKQTNEKKTTQSKKQPSWIGQHDEDD-LVAEFRDLPSKFHQTLIPDLERISITSKK 211

Query: 744  YINQANKEISEGFRPLVGKQYAPTIASIVSCVFMFLPLLLVSLIFNRIKAYFSLQKILIF 923
            Y+ +AN +++ GF+P+VG +YAPTIASIVS  F+ +PLLLVSLIFN IKAYFS+QKILIF
Sbjct: 212  YLTKANNDLTRGFKPIVGNKYAPTIASIVSFAFILIPLLLVSLIFNHIKAYFSIQKILIF 271

Query: 924  VQVYLSIYFSILSFASIVTGLEPLKFFYASSQSSYICLQVFQTXXXXXXXXXXXXXXXXV 1103
            +QVYLSIYF+IL  +S+VTGLEPLKFFYA+SQS+Y+CL V QT                V
Sbjct: 272  IQVYLSIYFTILCLSSLVTGLEPLKFFYATSQSTYVCLMVLQTLGYALYLLLLLMYLILV 331

Query: 1104 FSTETGLGLKVLGLGQTFVGFSVGLHYYVTVFHRAVLHQPPRTNWKIHGIYATCFLVICL 1283
            FSTE GL  K LGLGQT VG++VGLHYYV VFHR VLHQPP+TNWK+HGIYATCFLVICL
Sbjct: 332  FSTECGLSSKFLGLGQTLVGYAVGLHYYVAVFHRVVLHQPPKTNWKVHGIYATCFLVICL 391

Query: 1284 LANAERRKKAYLEDGGEEGKKS 1349
             ANAERRKKAYLE+GGEEGKK+
Sbjct: 392  FANAERRKKAYLEEGGEEGKKN 413


>ref|NP_195694.1| uncharacterized protein [Arabidopsis thaliana]
            gi|4490735|emb|CAB38897.1| putative protein [Arabidopsis
            thaliana] gi|7271039|emb|CAB80647.1| putative protein
            [Arabidopsis thaliana] gi|18175809|gb|AAL59931.1| unknown
            protein [Arabidopsis thaliana] gi|22136848|gb|AAM91768.1|
            unknown protein [Arabidopsis thaliana]
            gi|332661725|gb|AEE87125.1| uncharacterized protein
            [Arabidopsis thaliana]
          Length = 451

 Score =  358 bits (920), Expect = 2e-96
 Identities = 186/328 (56%), Positives = 233/328 (71%), Gaps = 17/328 (5%)
 Frame = +3

Query: 417  LNATSRSSNSTKL--------------NTITTTKKSIDLPKN---KTKVIITQTEKKPKS 545
            LN+T  SSN+TK               N+ ++ KKS DL K+   K K  I     K  S
Sbjct: 125  LNSTKSSSNTTKTSSELKKLNSGTKSTNSTSSIKKSADLSKSSSSKNKTTIKPPSSKLSS 184

Query: 546  QNNQEAIKIXXXXXXXXXXXXXXXXXNWMEIEDDDDFLVSEFRDLPSKFQQTLLPDLERI 725
              +++  +                   W++ E+D+DF VSEFRDLP++FQ++L+PDLERI
Sbjct: 185  PPSEKKSQPSSKPVTKSKQSEKEIKPFWLDDEEDEDF-VSEFRDLPTRFQRSLIPDLERI 243

Query: 726  SLTSRAYINQANKEISEGFRPLVGKQYAPTIASIVSCVFMFLPLLLVSLIFNRIKAYFSL 905
            S TS+ YIN+ANK+I++ F+P  G +YAPTIAS+VS VF+ +PLLLVSLIFNR KAYFSL
Sbjct: 244  STTSKNYINKANKQITKNFKPYFGNKYAPTIASVVSFVFILVPLLLVSLIFNRFKAYFSL 303

Query: 906  QKILIFVQVYLSIYFSILSFASIVTGLEPLKFFYASSQSSYICLQVFQTXXXXXXXXXXX 1085
            QKILIF+Q+YLSIYFSIL  +S+VTG+EPLKF YA+S S+Y+CLQ+ QT           
Sbjct: 304  QKILIFIQIYLSIYFSILCLSSLVTGIEPLKFLYATSSSTYVCLQILQTLGYVFYLLLLL 363

Query: 1086 XXXXXVFSTETGLGLKVLGLGQTFVGFSVGLHYYVTVFHRAVLHQPPRTNWKIHGIYATC 1265
                 VFST+ GLGLKVLGL QTFVGF+VGLHYYV VFHR VL QPP+TNWKIHG+YATC
Sbjct: 364  MYLVLVFSTDCGLGLKVLGLAQTFVGFAVGLHYYVAVFHRVVLRQPPKTNWKIHGVYATC 423

Query: 1266 FLVICLLANAERRKKAYLEDGGEEGKKS 1349
            FL+ICLL+NAERRKK YLE+GG+EGKK+
Sbjct: 424  FLLICLLSNAERRKKEYLEEGGDEGKKN 451


>ref|XP_002533203.1| conserved hypothetical protein [Ricinus communis]
            gi|223526979|gb|EEF29174.1| conserved hypothetical
            protein [Ricinus communis]
          Length = 458

 Score =  357 bits (917), Expect = 4e-96
 Identities = 208/426 (48%), Positives = 262/426 (61%), Gaps = 40/426 (9%)
 Frame = +3

Query: 192  QGKQLMGRRRMXXXXXXXXXXXXQVQSKKK-TNLVSKNQTKLIXXXXXXXXXXXXXXXXX 368
            + KQ  GRRRM            Q+  KKK  NL +KNQTKLI                 
Sbjct: 42   ESKQFFGRRRMLEVENDQEQD--QLSPKKKIANLSTKNQTKLIKSSLSTKNQTISKLTKP 99

Query: 369  XXXXXXXXXXXXXXXX-------------------LNATSRSSNSTKLNTITTTKKSIDL 491
                                               LN+TS+S NSTK  T + TKK+ DL
Sbjct: 100  TNSTKSSTTLSKNELKKLNSTSQLKKLNSISQLKKLNSTSKSYNSTK-PTSSFTKKTSDL 158

Query: 492  -----PKNKT---------------KVIITQTEKKPKSQNNQEAIKIXXXXXXXXXXXXX 611
                 PKNKT               K   ++++K+ K+Q   E                 
Sbjct: 159  LKLGSPKNKTTKPTSSKDTQILADKKNSDSESQKQTKNQKTNER-----KASTKKTTTQS 213

Query: 612  XXXXNWMEIEDDDDFLVSEFRDLPSKFQQTLLPDLERISLTSRAYINQANKEISEGFRPL 791
                +W++ E +DD LV+EFRDLPSKFQQTLLPDLERIS+TS+ Y+ +ANKE+++GF+P+
Sbjct: 214  QKQPSWLDQEMEDD-LVAEFRDLPSKFQQTLLPDLERISITSKKYLTKANKEMTKGFKPI 272

Query: 792  VGKQYAPTIASIVSCVFMFLPLLLVSLIFNRIKAYFSLQKILIFVQVYLSIYFSILSFAS 971
            VG +YA   A++VS  F+ +PLLLVSL+FNRIKAYFS+QKI+IF+QVYLSIYFSIL  +S
Sbjct: 273  VGNKYASATATVVSFAFITIPLLLVSLVFNRIKAYFSIQKIVIFIQVYLSIYFSILCLSS 332

Query: 972  IVTGLEPLKFFYASSQSSYICLQVFQTXXXXXXXXXXXXXXXXVFSTETGLGLKVLGLGQ 1151
            +VTGLEPL+FFYA+S+S+Y+CL V QT                VFSTE+GLG ++LGLGQ
Sbjct: 333  LVTGLEPLRFFYATSESTYVCLMVLQTLGYVLYLLLLLMYLILVFSTESGLGSRLLGLGQ 392

Query: 1152 TFVGFSVGLHYYVTVFHRAVLHQPPRTNWKIHGIYATCFLVICLLANAERRKKAYLEDGG 1331
             FVGF++GLH+YV+VFHR VLHQPP+TNWK+H IYATCFLVICLLA  +RRKKAYLEDGG
Sbjct: 393  IFVGFAIGLHFYVSVFHRVVLHQPPKTNWKVHAIYATCFLVICLLAKFDRRKKAYLEDGG 452

Query: 1332 EEGKKS 1349
            EEGKK+
Sbjct: 453  EEGKKN 458


Top