BLASTX nr result

ID: Coptis25_contig00015223 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Coptis25_contig00015223
         (1860 letters)

Database: ./nr 
           23,641,837 sequences; 8,123,359,852 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_002314378.1| predicted protein [Populus trichocarpa] gi|2...   204   8e-50
ref|XP_002865709.1| hypothetical protein ARALYDRAFT_917876 [Arab...   171   8e-40
ref|XP_002270026.1| PREDICTED: uncharacterized protein LOC100244...   164   6e-38
ref|XP_003550539.1| PREDICTED: uncharacterized protein LOC100798...   157   7e-36
ref|XP_002516943.1| conserved hypothetical protein [Ricinus comm...   151   7e-34

>ref|XP_002314378.1| predicted protein [Populus trichocarpa] gi|222863418|gb|EEF00549.1|
            predicted protein [Populus trichocarpa]
          Length = 389

 Score =  204 bits (518), Expect = 8e-50
 Identities = 154/415 (37%), Positives = 192/415 (46%), Gaps = 29/415 (6%)
 Frame = +1

Query: 199  DDIGEGMQCSNHPYKNNPGGICAFCLQEKLGKLLXXXXXXXXXXXXXXXXXXXXXXXXXX 378
            +D+G+GMQCS+HPY+NNPGGICAFCLQEKLGKL+                          
Sbjct: 2    EDLGDGMQCSDHPYRNNPGGICAFCLQEKLGKLVSSSFPLPIRGSSSSSSSPSFRSVIGV 61

Query: 379  XXXNNKVGSVKTYGSLGFTTT------GGATXXXXXXXXXXXXXXXXXXTD-----VHSN 525
               +N VG+  +       TT      GG+                          V S+
Sbjct: 62   GGSSN-VGAGTSLSLAARPTTTKCRNDGGSNSHYQEYYTRRARIPFLLAKKKKKIMVASS 120

Query: 526  SSN--LALKRSKSSSSGAVAIIHHPHLFDGEEYSPRKNGFWSFLHFXXXXXXXDKMKGE- 696
            +S+  +  KRSKS+++   +        DGE++SPR+ GFWSFL+           K E 
Sbjct: 121  TSDRDIVFKRSKSTTTPRRSHFLDAATDDGEDFSPRRRGFWSFLYLSSSKPGTSTKKIEK 180

Query: 697  ------SRPSVHPTTTNNHAI-AKDKCVGESSSSMRRGE-----YGDENEXXXXXXXXXX 840
                  S  ++  T+TN   +  K+KC+G S S  R+G+       D++           
Sbjct: 181  VSSLASSTRAITTTSTNGSTVRPKEKCLGSSLS--RKGDSIVVVEDDDDSPNSQATASAS 238

Query: 841  XFGRKVARSRSVGCGSRSFSGDLLERISTGFGDCTLRRVESQREAKQASKXXXXXXXXXX 1020
             F RKV+RSRSVGCGSRSFSGD  ERISTGFGDCTLRRVESQRE K              
Sbjct: 239  TFERKVSRSRSVGCGSRSFSGDFFERISTGFGDCTLRRVESQREGKPKVTSGASH----- 293

Query: 1021 XXXXXXEHHHIIKERVKCGGIFGGF---XXXXXXXXXXXXXXEEVNNDNRRGLGIGNHIT 1191
                       +KERV+CGGIFGGF                     + N +  G G    
Sbjct: 294  -----------MKERVRCGGIFGGFNITSSSSSSSSSSYWVSSSAEDMNGKSSGAGP--- 339

Query: 1192 SLPPHSRSKSWGWAFASPMRAFRSTMKPEDKXXXXXXXXXXXXXXLSAIPSLLTV 1356
                H RS+SWGWAFASPMRAF S  KP  K              LSAIPSLL V
Sbjct: 340  --LAHGRSRSWGWAFASPMRAFGS--KPSSK--DGKRNIKHTTPNLSAIPSLLAV 388


>ref|XP_002865709.1| hypothetical protein ARALYDRAFT_917876 [Arabidopsis lyrata subsp.
            lyrata] gi|297311544|gb|EFH41968.1| hypothetical protein
            ARALYDRAFT_917876 [Arabidopsis lyrata subsp. lyrata]
          Length = 384

 Score =  171 bits (432), Expect = 8e-40
 Identities = 138/415 (33%), Positives = 177/415 (42%), Gaps = 30/415 (7%)
 Frame = +1

Query: 202  DIGEGMQCSNHPYKNNPGGICAFCLQEKLGKLLXXXXXXXXXXXXXXXXXXXXXXXXXXX 381
            D+G+GMQC NHP+  NPGGICAFCLQEKLGKL+                           
Sbjct: 8    DMGDGMQCINHPFTKNPGGICAFCLQEKLGKLVTSSFPLPKHLSSSSTSSSPSFR----- 62

Query: 382  XXNNKVGSVKTYGSLGFT-TTGGATXXXXXXXXXXXXXXXXXXTDVHSNSSNLALKRSKS 558
              ++ VGS  T  +   + +  GAT                      + ++N+  KRS+S
Sbjct: 63   --SDSVGSTTTASAASLSLSVSGATNNNKLPFLLAKKKKKMLTASSSATTANIVYKRSQS 120

Query: 559  SSSGAVAIIHHPHLFDGEEYSPRK-NGFWSFLHFXXXXXXXDKMKGESRP--SVHPTTTN 729
            + +           +   + SPRK NGFWSFLH         K  G S+   + H  T+ 
Sbjct: 121  TRTTKTT-------YGDSDLSPRKRNGFWSFLHLYS-----SKHHGSSKKVGNFHQPTSQ 168

Query: 730  ---NHAIAKDKCVGESSSS-----MRRGEYGDENEXXXXXXXXXXXFG----------RK 855
                  + +   VG SSSS     M +   G  N             G          RK
Sbjct: 169  IEIKTELTETTTVGSSSSSSASSSMSKRVVGGSNSNRNGIDVIVEEDGSPNIEVTPSERK 228

Query: 856  VARSRSVGCGSRSFSGDLLERISTGFGDCTLRRVESQREAKQASKXXXXXXXXXXXXXXX 1035
            V+RSRSVGCGSRSFSGD  ERI+ GFGDCTLRRVESQRE                     
Sbjct: 229  VSRSRSVGCGSRSFSGDFFERITNGFGDCTLRRVESQREGNN-----------NKGNKVS 277

Query: 1036 XEHHHIIKERVKCGGIFGGF-------XXXXXXXXXXXXXXEEVNNDNRRGLGIGNHITS 1194
                + ++E V+CGGIFGGF                     E  ++++  G G G     
Sbjct: 278  SNPSNGVREMVRCGGIFGGFMIMTSSSSSSSSSSWVSSSSAEHHHHNHNMGHGGG----- 332

Query: 1195 LPPHSRSKSWGWAFASPMRAFRSTMK-PEDKXXXXXXXXXXXXXXLSAIPSLLTV 1356
                 R++SWGWAFASPMRAF S+    +                L AIPSLL+V
Sbjct: 333  -----RNRSWGWAFASPMRAFSSSSSFGKRGRTISDSTSKNTTPNLGAIPSLLSV 382


>ref|XP_002270026.1| PREDICTED: uncharacterized protein LOC100244834 [Vitis vinifera]
          Length = 420

 Score =  164 bits (416), Expect = 6e-38
 Identities = 124/298 (41%), Positives = 143/298 (47%), Gaps = 19/298 (6%)
 Frame = +1

Query: 520  SNSSNLALKRSKSSSSGAVAIIHHPHLF----DGEEYSPRKNGFWSFLHFXXXXXXX--D 681
            S++  + LKRSKS+++         H      D  +YSP+K GFWSFL+          D
Sbjct: 134  SDAVGIVLKRSKSTTTP-----RRGHFLVESEDANDYSPQKRGFWSFLYLPKSTATRKMD 188

Query: 682  KMKGES----RPSVHPTTTNNHAIA----KDKCVGESSSSMRRGEYGDENEXXXXXXXXX 837
            K  G +       +    T  HA      KDK +G SSS  ++ E+ DE+E         
Sbjct: 189  KAVGSAVTPRESKLAAAITQTHASLSHKPKDKGLG-SSSLAKKEEFVDESESPNSHATAS 247

Query: 838  XX-FGRKVARSRSVGCGSRSFSGDLLERISTGFGDCTLRRVESQREAKQASKXXXXXXXX 1014
               FGRKV+RSRSVGCGSRSFSGD  ERISTGFGDCTLRRVESQRE K  S         
Sbjct: 248  SSSFGRKVSRSRSVGCGSRSFSGDFFERISTGFGDCTLRRVESQREGKPKSS------GA 301

Query: 1015 XXXXXXXXEHHHIIKERVKCGGIFGGF--XXXXXXXXXXXXXXEEVNNDNRRGLGIGNHI 1188
                      H  IKERVKCGGIFGGF                     DN  G       
Sbjct: 302  HRGGAPGGPDHQCIKERVKCGGIFGGFIMTSSSSSSSSSSYWMSSTVEDNVNGKSTAAAA 361

Query: 1189 TSLPPHSRSKSWGWAFASPMRAFR--STMKPEDKXXXXXXXXXXXXXXLSAIPSLLTV 1356
                 H RSKSWGWAFASPMRA    S+ K E K              L+AIPSLL V
Sbjct: 362  PGPLSHGRSKSWGWAFASPMRALSKPSSSKVEYK-DAGKRDITPNKPNLAAIPSLLAV 418



 Score = 74.7 bits (182), Expect = 8e-11
 Identities = 30/34 (88%), Positives = 34/34 (100%)
 Frame = +1

Query: 199 DDIGEGMQCSNHPYKNNPGGICAFCLQEKLGKLL 300
           DD+GEGMQCS+HPY+NNPGGICAFCLQEKLGKL+
Sbjct: 10  DDVGEGMQCSDHPYRNNPGGICAFCLQEKLGKLV 43


>ref|XP_003550539.1| PREDICTED: uncharacterized protein LOC100798085 [Glycine max]
          Length = 444

 Score =  157 bits (398), Expect = 7e-36
 Identities = 112/300 (37%), Positives = 139/300 (46%), Gaps = 23/300 (7%)
 Frame = +1

Query: 526  SSNLALKRSKSSSSGAVAIIHHPHLFDGEE---------YSPRK-NGFWSFLHFXXXXXX 675
            S N+ LKRSKS+++       +  L D ++         +SPRK NGFWSFL+       
Sbjct: 149  SDNILLKRSKSTATPR----RNRSLVDDDDNDDDLVIGPFSPRKRNGFWSFLYLSSKSSK 204

Query: 676  XDKMKGESRPSVHPT----------TTNNHAIAKDKCVGESSSSMRRGEYGDENEXXXXX 825
                K     +++ T           + + A  K+KC   SS         D N      
Sbjct: 205  KLNSKSFRDNNINNTPRISSINLAPASTSSAKLKEKCCSGSSLKTDIVVEQDNNNSNSPN 264

Query: 826  XXXXXXFGRKVARSRSVGCGSRSFSGDLLERISTGFGDCTLRRVESQREAKQASKXXXXX 1005
                  F RKV+RS+SVGCGSRSFSGD  ERISTGFGDCTLRRVESQRE K   K     
Sbjct: 265  TASASSFERKVSRSKSVGCGSRSFSGDFFERISTGFGDCTLRRVESQREGK--PKGTGGG 322

Query: 1006 XXXXXXXXXXXEHHHIIKERVKCGGIFGGFXXXXXXXXXXXXXXEEVNNDNRRGLGIGNH 1185
                        HHH IKERV+CGG+F GF                 ++ +      G  
Sbjct: 323  ASAAVSRAGEQHHHHCIKERVRCGGLFSGFMMTSSSSSSSSSSYWVSSSADDAAAVNGKS 382

Query: 1186 ITSLPPHSRSKSWGWAFASPMRAFR---STMKPEDKXXXXXXXXXXXXXXLSAIPSLLTV 1356
             T    H+R +SWGWAFASPMRAF    S+ +   +              LSAIPSLL V
Sbjct: 383  ATVALSHNRGRSWGWAFASPMRAFSGKPSSKESNRRDIIRDANDKNATPNLSAIPSLLAV 442



 Score = 69.3 bits (168), Expect = 3e-09
 Identities = 27/33 (81%), Positives = 33/33 (100%)
 Frame = +1

Query: 202 DIGEGMQCSNHPYKNNPGGICAFCLQEKLGKLL 300
           D+G+GMQCS+HPY+NNPGGICAFCLQ+KLGKL+
Sbjct: 45  DMGDGMQCSDHPYRNNPGGICAFCLQDKLGKLV 77


>ref|XP_002516943.1| conserved hypothetical protein [Ricinus communis]
            gi|223544031|gb|EEF45557.1| conserved hypothetical
            protein [Ricinus communis]
          Length = 450

 Score =  151 bits (381), Expect = 7e-34
 Identities = 117/312 (37%), Positives = 143/312 (45%), Gaps = 31/312 (9%)
 Frame = +1

Query: 514  VHSNSSNLALKRSKSSSSGAVAIIHH---PHLFDGEE--YSPRKNG-FWSFLHFXXXXXX 675
            V S+  ++  KRSKS+++      HH       DG+E   SPR+ G FWSFL+       
Sbjct: 153  VASSDRDIVFKRSKSTATPGRRNHHHFLDASTDDGDEDFTSPRRRGGFWSFLYLSSSSSK 212

Query: 676  X----------DKMKG---ESRPSVHPTTTNNHAIA--KDKCVGESSSSMRR-GEYGDEN 807
                       DK+      + P+   TTT N ++   KDKC+G S S         D++
Sbjct: 213  SITTTATKKTSDKVSSLIVTTTPATTATTTANGSMMRPKDKCLGTSLSKKSDIVAVEDDD 272

Query: 808  EXXXXXXXXXXXFGRKVARSRSVGCGSRSFSGDLLERISTGFGDCTLRRVESQREAKQAS 987
                        F RKV+RSRSVGCGSRSFSGD  ERISTGFGDCTLRRVESQRE K   
Sbjct: 273  SPNSQATASASSFERKVSRSRSVGCGSRSFSGDFFERISTGFGDCTLRRVESQREGKPKG 332

Query: 988  KXXXXXXXXXXXXXXXXEHHHIIKERVKCGGIFGGF--------XXXXXXXXXXXXXXEE 1143
                                  +KERVKCGGIFGGF                        
Sbjct: 333  PGAASH----------------MKERVKCGGIFGGFMITSSSSSSSSSSYWVSSSAEEHH 376

Query: 1144 VNNDNRRGLGIGNHITSLPPHSRSKSWGWAFASPMRAF-RSTMKPEDKXXXXXXXXXXXX 1320
            +N  +  G+           H RS+SWGWAFASPMRAF + + K   +            
Sbjct: 377  MNGKSTHGVAAAAGAGGPLAHGRSRSWGWAFASPMRAFSKPSSKDGKRDIIREASNKNTT 436

Query: 1321 XXLSAIPSLLTV 1356
              LSAIPSLL V
Sbjct: 437  PNLSAIPSLLAV 448



 Score = 72.4 bits (176), Expect = 4e-10
 Identities = 29/38 (76%), Positives = 36/38 (94%)
 Frame = +1

Query: 187 GLVVDDIGEGMQCSNHPYKNNPGGICAFCLQEKLGKLL 300
           G+  +D+G+GMQCS+HPY+NNPGGICAFCLQEKLGKL+
Sbjct: 24  GIGEEDMGDGMQCSDHPYRNNPGGICAFCLQEKLGKLV 61


Top