BLASTX nr result

ID: Coptis24_contig00010522 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Coptis24_contig00010522
         (1471 letters)

Database: ./nr 
           23,641,837 sequences; 8,123,359,852 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_002314378.1| predicted protein [Populus trichocarpa] gi|2...   204   6e-50
ref|XP_002865709.1| hypothetical protein ARALYDRAFT_917876 [Arab...   171   6e-40
ref|XP_002270026.1| PREDICTED: uncharacterized protein LOC100244...   164   4e-38
ref|XP_003550539.1| PREDICTED: uncharacterized protein LOC100798...   157   5e-36
ref|XP_002516943.1| conserved hypothetical protein [Ricinus comm...   151   5e-34

>ref|XP_002314378.1| predicted protein [Populus trichocarpa] gi|222863418|gb|EEF00549.1|
            predicted protein [Populus trichocarpa]
          Length = 389

 Score =  204 bits (518), Expect = 6e-50
 Identities = 157/415 (37%), Positives = 197/415 (47%), Gaps = 29/415 (6%)
 Frame = -2

Query: 1287 DDIGEGMQCSNHPYKNNPGGICAFCLQEKLGKLLXXXXXXXXXXXXXXXXXXXXXXXXXX 1108
            +D+G+GMQCS+HPY+NNPGGICAFCLQEKLGKL+                          
Sbjct: 2    EDLGDGMQCSDHPYRNNPGGICAFCLQEKLGKLVSSSFPLPIRGSSSSSSSPSFRSVIGV 61

Query: 1107 XXSNNKVGSVKTYGSLGFTTT------GGATXXXXXXXXXXXXXXXXXSTD-----VHSN 961
              S+N VG+  +       TT      GG+                  +       V S+
Sbjct: 62   GGSSN-VGAGTSLSLAARPTTTKCRNDGGSNSHYQEYYTRRARIPFLLAKKKKKIMVASS 120

Query: 960  SSN--LALKRSKSSSSGAVAIIHHPHLFDGEEYSPRKNGFWSFLHFXXXXXXRDKMKGE- 790
            +S+  +  KRSKS+++   +        DGE++SPR+ GFWSFL+           K E 
Sbjct: 121  TSDRDIVFKRSKSTTTPRRSHFLDAATDDGEDFSPRRRGFWSFLYLSSSKPGTSTKKIEK 180

Query: 789  ------SRPSVHPTTTNNHAI-AKDKCVGESSSSMRRGE-----YGDENEXXXXXXXXXX 646
                  S  ++  T+TN   +  K+KC+G S S  R+G+       D++           
Sbjct: 181  VSSLASSTRAITTTSTNGSTVRPKEKCLGSSLS--RKGDSIVVVEDDDDSPNSQATASAS 238

Query: 645  SFGRKVARSRSVGCGSRSFSGDLLERISTGFGDCTLRRVESQREAKQASKXXXXXXXXXX 466
            +F RKV+RSRSVGCGSRSFSGD  ERISTGFGDCTLRRVESQRE K              
Sbjct: 239  TFERKVSRSRSVGCGSRSFSGDFFERISTGFGDCTLRRVESQREGKPKVTSGASH----- 293

Query: 465  XXXXXGEHHHIIKERVKCGGIFGGF---XXXXXXXXXXXXXSEEVNNDNRRGLGIGNHIT 295
                       +KERV+CGGIFGGF                S    + N +  G G    
Sbjct: 294  -----------MKERVRCGGIFGGFNITSSSSSSSSSSYWVSSSAEDMNGKSSGAGP--- 339

Query: 294  SLPPHSRSKSWGWAFASPMRAFRSTMKPEDKXXXXXXXXXXXXXNLSAIPSLLTV 130
                H RS+SWGWAFASPMRAF S  KP  K             NLSAIPSLL V
Sbjct: 340  --LAHGRSRSWGWAFASPMRAFGS--KPSSK--DGKRNIKHTTPNLSAIPSLLAV 388


>ref|XP_002865709.1| hypothetical protein ARALYDRAFT_917876 [Arabidopsis lyrata subsp.
            lyrata] gi|297311544|gb|EFH41968.1| hypothetical protein
            ARALYDRAFT_917876 [Arabidopsis lyrata subsp. lyrata]
          Length = 384

 Score =  171 bits (432), Expect = 6e-40
 Identities = 139/415 (33%), Positives = 180/415 (43%), Gaps = 30/415 (7%)
 Frame = -2

Query: 1284 DIGEGMQCSNHPYKNNPGGICAFCLQEKLGKLLXXXXXXXXXXXXXXXXXXXXXXXXXXX 1105
            D+G+GMQC NHP+  NPGGICAFCLQEKLGKL+                           
Sbjct: 8    DMGDGMQCINHPFTKNPGGICAFCLQEKLGKLVTSSFPLPKHLSSSSTSSSPSFR----- 62

Query: 1104 XSNNKVGSVKTYGSLGFT-TTGGATXXXXXXXXXXXXXXXXXSTDVHSNSSNLALKRSKS 928
              ++ VGS  T  +   + +  GAT                 +    + ++N+  KRS+S
Sbjct: 63   --SDSVGSTTTASAASLSLSVSGATNNNKLPFLLAKKKKKMLTASSSATTANIVYKRSQS 120

Query: 927  SSSGAVAIIHHPHLFDGEEYSPRK-NGFWSFLHFXXXXXXRDKMKGESRP--SVHPTTTN 757
            + +           +   + SPRK NGFWSFLH         K  G S+   + H  T+ 
Sbjct: 121  TRTTKTT-------YGDSDLSPRKRNGFWSFLHLYS-----SKHHGSSKKVGNFHQPTSQ 168

Query: 756  ---NHAIAKDKCVGESSSS-----MRRGEYGDENEXXXXXXXXXXSFG----------RK 631
                  + +   VG SSSS     M +   G  N             G          RK
Sbjct: 169  IEIKTELTETTTVGSSSSSSASSSMSKRVVGGSNSNRNGIDVIVEEDGSPNIEVTPSERK 228

Query: 630  VARSRSVGCGSRSFSGDLLERISTGFGDCTLRRVESQREAKQASKXXXXXXXXXXXXXXX 451
            V+RSRSVGCGSRSFSGD  ERI+ GFGDCTLRRVESQRE                     
Sbjct: 229  VSRSRSVGCGSRSFSGDFFERITNGFGDCTLRRVESQREGNN-----------NKGNKVS 277

Query: 450  GEHHHIIKERVKCGGIFGGF-------XXXXXXXXXXXXXSEEVNNDNRRGLGIGNHITS 292
                + ++E V+CGGIFGGF                    +E  ++++  G G G     
Sbjct: 278  SNPSNGVREMVRCGGIFGGFMIMTSSSSSSSSSSWVSSSSAEHHHHNHNMGHGGG----- 332

Query: 291  LPPHSRSKSWGWAFASPMRAFRSTMK-PEDKXXXXXXXXXXXXXNLSAIPSLLTV 130
                 R++SWGWAFASPMRAF S+    +               NL AIPSLL+V
Sbjct: 333  -----RNRSWGWAFASPMRAFSSSSSFGKRGRTISDSTSKNTTPNLGAIPSLLSV 382


>ref|XP_002270026.1| PREDICTED: uncharacterized protein LOC100244834 [Vitis vinifera]
          Length = 420

 Score =  164 bits (416), Expect = 4e-38
 Identities = 128/298 (42%), Positives = 147/298 (49%), Gaps = 19/298 (6%)
 Frame = -2

Query: 966 SNSSNLALKRSKSSSSGAVAIIHHPHLF----DGEEYSPRKNGFWSFLHFXXXXXXR--D 805
           S++  + LKRSKS+++         H      D  +YSP+K GFWSFL+       R  D
Sbjct: 134 SDAVGIVLKRSKSTTTP-----RRGHFLVESEDANDYSPQKRGFWSFLYLPKSTATRKMD 188

Query: 804 KMKGES----RPSVHPTTTNNHAIA----KDKCVGESSSSMRRGEYGDENEXXXXXXXXX 649
           K  G +       +    T  HA      KDK +G SSS  ++ E+ DE+E         
Sbjct: 189 KAVGSAVTPRESKLAAAITQTHASLSHKPKDKGLG-SSSLAKKEEFVDESESPNSHATAS 247

Query: 648 XS-FGRKVARSRSVGCGSRSFSGDLLERISTGFGDCTLRRVESQREAKQASKXXXXXXXX 472
            S FGRKV+RSRSVGCGSRSFSGD  ERISTGFGDCTLRRVESQRE K  S         
Sbjct: 248 SSSFGRKVSRSRSVGCGSRSFSGDFFERISTGFGDCTLRRVESQREGKPKSS------GA 301

Query: 471 XXXXXXXGEHHHIIKERVKCGGIFGGF--XXXXXXXXXXXXXSEEVNNDNRRGLGIGNHI 298
                  G  H  IKERVKCGGIFGGF                     DN  G       
Sbjct: 302 HRGGAPGGPDHQCIKERVKCGGIFGGFIMTSSSSSSSSSSYWMSSTVEDNVNGKSTAAAA 361

Query: 297 TSLPPHSRSKSWGWAFASPMRAFR--STMKPEDKXXXXXXXXXXXXXNLSAIPSLLTV 130
                H RSKSWGWAFASPMRA    S+ K E K             NL+AIPSLL V
Sbjct: 362 PGPLSHGRSKSWGWAFASPMRALSKPSSSKVEYK-DAGKRDITPNKPNLAAIPSLLAV 418



 Score = 74.7 bits (182), Expect = 6e-11
 Identities = 30/34 (88%), Positives = 34/34 (100%)
 Frame = -2

Query: 1287 DDIGEGMQCSNHPYKNNPGGICAFCLQEKLGKLL 1186
            DD+GEGMQCS+HPY+NNPGGICAFCLQEKLGKL+
Sbjct: 10   DDVGEGMQCSDHPYRNNPGGICAFCLQEKLGKLV 43


>ref|XP_003550539.1| PREDICTED: uncharacterized protein LOC100798085 [Glycine max]
          Length = 444

 Score =  157 bits (398), Expect = 5e-36
 Identities = 114/300 (38%), Positives = 142/300 (47%), Gaps = 23/300 (7%)
 Frame = -2

Query: 960  SSNLALKRSKSSSSGAVAIIHHPHLFDGEE---------YSPRK-NGFWSFLHFXXXXXX 811
            S N+ LKRSKS+++       +  L D ++         +SPRK NGFWSFL+       
Sbjct: 149  SDNILLKRSKSTATPR----RNRSLVDDDDNDDDLVIGPFSPRKRNGFWSFLYLSSKSSK 204

Query: 810  RDKMKGESRPSVHPT----------TTNNHAIAKDKCVGESSSSMRRGEYGDENEXXXXX 661
            +   K     +++ T           + + A  K+KC   SS         D N      
Sbjct: 205  KLNSKSFRDNNINNTPRISSINLAPASTSSAKLKEKCCSGSSLKTDIVVEQDNNNSNSPN 264

Query: 660  XXXXXSFGRKVARSRSVGCGSRSFSGDLLERISTGFGDCTLRRVESQREAKQASKXXXXX 481
                 SF RKV+RS+SVGCGSRSFSGD  ERISTGFGDCTLRRVESQRE K   K     
Sbjct: 265  TASASSFERKVSRSKSVGCGSRSFSGDFFERISTGFGDCTLRRVESQREGK--PKGTGGG 322

Query: 480  XXXXXXXXXXGEHHHIIKERVKCGGIFGGFXXXXXXXXXXXXXSEEVNNDNRRGLGIGNH 301
                        HHH IKERV+CGG+F GF                 ++ +      G  
Sbjct: 323  ASAAVSRAGEQHHHHCIKERVRCGGLFSGFMMTSSSSSSSSSSYWVSSSADDAAAVNGKS 382

Query: 300  ITSLPPHSRSKSWGWAFASPMRAFR---STMKPEDKXXXXXXXXXXXXXNLSAIPSLLTV 130
             T    H+R +SWGWAFASPMRAF    S+ +   +             NLSAIPSLL V
Sbjct: 383  ATVALSHNRGRSWGWAFASPMRAFSGKPSSKESNRRDIIRDANDKNATPNLSAIPSLLAV 442



 Score = 69.3 bits (168), Expect = 2e-09
 Identities = 27/33 (81%), Positives = 33/33 (100%)
 Frame = -2

Query: 1284 DIGEGMQCSNHPYKNNPGGICAFCLQEKLGKLL 1186
            D+G+GMQCS+HPY+NNPGGICAFCLQ+KLGKL+
Sbjct: 45   DMGDGMQCSDHPYRNNPGGICAFCLQDKLGKLV 77


>ref|XP_002516943.1| conserved hypothetical protein [Ricinus communis]
            gi|223544031|gb|EEF45557.1| conserved hypothetical
            protein [Ricinus communis]
          Length = 450

 Score =  151 bits (381), Expect = 5e-34
 Identities = 119/312 (38%), Positives = 145/312 (46%), Gaps = 31/312 (9%)
 Frame = -2

Query: 972  VHSNSSNLALKRSKSSSSGAVAIIHH---PHLFDGEE--YSPRKNG-FWSFLHFXXXXXX 811
            V S+  ++  KRSKS+++      HH       DG+E   SPR+ G FWSFL+       
Sbjct: 153  VASSDRDIVFKRSKSTATPGRRNHHHFLDASTDDGDEDFTSPRRRGGFWSFLYLSSSSSK 212

Query: 810  R----------DKMKG---ESRPSVHPTTTNNHAIA--KDKCVGESSSSMRR-GEYGDEN 679
                       DK+      + P+   TTT N ++   KDKC+G S S         D++
Sbjct: 213  SITTTATKKTSDKVSSLIVTTTPATTATTTANGSMMRPKDKCLGTSLSKKSDIVAVEDDD 272

Query: 678  EXXXXXXXXXXSFGRKVARSRSVGCGSRSFSGDLLERISTGFGDCTLRRVESQREAKQAS 499
                       SF RKV+RSRSVGCGSRSFSGD  ERISTGFGDCTLRRVESQRE K   
Sbjct: 273  SPNSQATASASSFERKVSRSRSVGCGSRSFSGDFFERISTGFGDCTLRRVESQREGKPKG 332

Query: 498  KXXXXXXXXXXXXXXXGEHHHIIKERVKCGGIFGGF--------XXXXXXXXXXXXXSEE 343
                                  +KERVKCGGIFGGF                        
Sbjct: 333  PGAASH----------------MKERVKCGGIFGGFMITSSSSSSSSSSYWVSSSAEEHH 376

Query: 342  VNNDNRRGLGIGNHITSLPPHSRSKSWGWAFASPMRAF-RSTMKPEDKXXXXXXXXXXXX 166
            +N  +  G+           H RS+SWGWAFASPMRAF + + K   +            
Sbjct: 377  MNGKSTHGVAAAAGAGGPLAHGRSRSWGWAFASPMRAFSKPSSKDGKRDIIREASNKNTT 436

Query: 165  XNLSAIPSLLTV 130
             NLSAIPSLL V
Sbjct: 437  PNLSAIPSLLAV 448



 Score = 72.4 bits (176), Expect = 3e-10
 Identities = 29/38 (76%), Positives = 36/38 (94%)
 Frame = -2

Query: 1299 GLVVDDIGEGMQCSNHPYKNNPGGICAFCLQEKLGKLL 1186
            G+  +D+G+GMQCS+HPY+NNPGGICAFCLQEKLGKL+
Sbjct: 24   GIGEEDMGDGMQCSDHPYRNNPGGICAFCLQEKLGKLV 61


Top