BLASTX nr result

ID: Coptis24_contig00008582 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Coptis24_contig00008582
         (2338 letters)

Database: ./nr 
           23,641,837 sequences; 8,123,359,852 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_002521120.1| conserved hypothetical protein [Ricinus comm...   316   2e-83
ref|XP_004139654.1| PREDICTED: uncharacterized protein LOC101208...   309   2e-81
ref|XP_004139655.1| PREDICTED: uncharacterized protein LOC101208...   308   6e-81
ref|XP_002883554.1| hypothetical protein ARALYDRAFT_479993 [Arab...   306   1e-80
ref|XP_002265815.1| PREDICTED: uncharacterized protein LOC100263...   303   2e-79

>ref|XP_002521120.1| conserved hypothetical protein [Ricinus communis]
            gi|223539689|gb|EEF41271.1| conserved hypothetical
            protein [Ricinus communis]
          Length = 386

 Score =  316 bits (810), Expect = 2e-83
 Identities = 175/363 (48%), Positives = 218/363 (60%), Gaps = 14/363 (3%)
 Frame = -2

Query: 1596 MTDLKGSTSLGIDDSALHKEWDDALCPICMDHPHNAVLLLCSSFDNGCRPYICDSSYRHS 1417
            MT +K S     D   LH E D+  CPICMDHPHNAVLLLCSS + GCR YICD+S RHS
Sbjct: 1    MTGVKRSRYTDSDIRTLHNELDEVSCPICMDHPHNAVLLLCSSHEKGCRSYICDTSSRHS 60

Query: 1416 NCLDRFKKYFRSGPSQPDSVFIENPENPRLDTSESIFRASGSRNN---------LMEDXX 1264
            NCLDR+KK   S  S           N  LD+S  I   S S  +         +++   
Sbjct: 61   NCLDRYKKLRDSSGS-----------NTTLDSSLPINSFSSSNISDTSLTLGARVLDSYE 109

Query: 1263 XXXXXXXNTASIARTLEVIGDSNREESHRYIDMQDGSILGRDSFEPFRQRIGGEHI---- 1096
                   +  +  R  E + +++ +  +R ++ +   +L     E F  RI  E      
Sbjct: 110  NHNQSDSDNITSVRMPEQLLENSIQHPNRQVETRGEGVLEAGDSESFPDRIELEEADVVN 169

Query: 1095 -SESSLNLKCPLCRGSVKCWEIVEEARRYLDLKQRSCSLESCSFTGNYGELRRHARRVHP 919
             SE+ L+LKCPLCRG+V  WE+VEEAR+YL+LK+RSCS ESCSF GNY ELRRHARRVHP
Sbjct: 170  SSEAGLSLKCPLCRGAVLGWEVVEEARKYLNLKKRSCSRESCSFCGNYQELRRHARRVHP 229

Query: 918  TNRPALVDQSRQRNWRHLENQQEYGDIVSAIRSAMPGAIVFGDYVIETGDGFSANRDNNS 739
            T RP+ VD SR+R WR LE Q+EYGDIVSA+RSAMPGA+V GDYVIE GD FS  R+  +
Sbjct: 230  TTRPSDVDPSRERAWRCLERQREYGDIVSALRSAMPGAVVVGDYVIENGDRFSVEREGGA 289

Query: 738  GESSGPLLTSFILYQMMRPFSPVSEPXXXXXXXXXXXXXXXSLSDRRHLWGENLLGLQDD 559
            GE + P  T+F L+QM+      +EP               +L +RR LWGENLLGLQDD
Sbjct: 290  GEVNAPWWTTFFLFQMIGSIDGAAEPRARSRAWTRHRRSGGALPERRFLWGENLLGLQDD 349

Query: 558  AAD 550
              D
Sbjct: 350  DED 352


>ref|XP_004139654.1| PREDICTED: uncharacterized protein LOC101208460 isoform 1 [Cucumis
            sativus]
          Length = 389

 Score =  309 bits (792), Expect = 2e-81
 Identities = 173/367 (47%), Positives = 218/367 (59%), Gaps = 8/367 (2%)
 Frame = -2

Query: 1599 KMTDLKGSTSLGIDDSALHKEWDDALCPICMDHPHNAVLLLCSSFDNGCRPYICDSSYRH 1420
            KM  +K       D  ALHKE D+  CPICMDHPHNAVLLLCSS   GC+PYICD+S+RH
Sbjct: 3    KMAGVKRRIHNDSDILALHKELDEVSCPICMDHPHNAVLLLCSSHHKGCKPYICDTSHRH 62

Query: 1419 SNCLDRFKKYFRSGPSQPDSVFIENPENPRLDTSESIFRASGSRNNLMEDXXXXXXXXXN 1240
            SNC D+FKK  R    +   +    P NP   ++ S      S +    D          
Sbjct: 63   SNCFDQFKK-LREETRKSPRLSSPLPINPYSFSNPSTNNLGLSIDLNEVDDNQNINERNT 121

Query: 1239 TASIARTLEVIGDSNREESHRYIDMQDGSILGRDSFEPFRQRIGGEHI----SESSLNLK 1072
             AS       +GD+  E S+R +D  +   +         +R+  E +    S    NLK
Sbjct: 122  VASAGLPGLALGDNGTENSNRTVDTNEAGDMDTAGSGSITERVDQEGLDAGNSSEYSNLK 181

Query: 1071 CPLCRGSVKCWEIVEEARRYLDLKQRSCSLESCSFTGNYGELRRHARRVHPTNRPALVDQ 892
            CP+CRG+V   E++EEAR YL+LK+RSCS E+CSF+GNY ELRRHARRVHPT+RPA++D 
Sbjct: 182  CPMCRGAVLGLEVIEEAREYLNLKKRSCSRETCSFSGNYQELRRHARRVHPTSRPAVIDP 241

Query: 891  SRQRNWRHLENQQEYGDIVSAIRSAMPGAIVFGDYVIETGDGFSA-NRDNNSGESSGPLL 715
            SR+R WR LE Q+E GD+VSAIRSAMPGA+V GDYVIE GDG  A  RDN +G+ +GPLL
Sbjct: 242  SRERAWRRLERQREVGDVVSAIRSAMPGALVVGDYVIENGDGMVAGERDNGTGDVNGPLL 301

Query: 714  TSFILYQMMRPFSPVSE--PXXXXXXXXXXXXXXXSLSDRRHLWGENLLGLQDDA-ADWN 544
            TSF L+ M        E  P                +S+RR LWGENLLGLQ+D   D+ 
Sbjct: 302  TSFFLFHMFGSVEGAREPRPRSRSWVRHRRSGGGTPVSERRFLWGENLLGLQEDTDEDFR 361

Query: 543  LSNQMGE 523
            +   MG+
Sbjct: 362  IYIGMGD 368


>ref|XP_004139655.1| PREDICTED: uncharacterized protein LOC101208460 isoform 2 [Cucumis
            sativus] gi|449443782|ref|XP_004139656.1| PREDICTED:
            uncharacterized protein LOC101208460 isoform 3 [Cucumis
            sativus] gi|449527327|ref|XP_004170663.1| PREDICTED:
            uncharacterized protein LOC101225264 isoform 1 [Cucumis
            sativus] gi|449527329|ref|XP_004170664.1| PREDICTED:
            uncharacterized protein LOC101225264 isoform 2 [Cucumis
            sativus]
          Length = 386

 Score =  308 bits (788), Expect = 6e-81
 Identities = 170/354 (48%), Positives = 214/354 (60%), Gaps = 8/354 (2%)
 Frame = -2

Query: 1560 DDSALHKEWDDALCPICMDHPHNAVLLLCSSFDNGCRPYICDSSYRHSNCLDRFKKYFRS 1381
            D  ALHKE D+  CPICMDHPHNAVLLLCSS   GC+PYICD+S+RHSNC D+FKK  R 
Sbjct: 13   DILALHKELDEVSCPICMDHPHNAVLLLCSSHHKGCKPYICDTSHRHSNCFDQFKK-LRE 71

Query: 1380 GPSQPDSVFIENPENPRLDTSESIFRASGSRNNLMEDXXXXXXXXXNTASIARTLEVIGD 1201
               +   +    P NP   ++ S      S +    D           AS       +GD
Sbjct: 72   ETRKSPRLSSPLPINPYSFSNPSTNNLGLSIDLNEVDDNQNINERNTVASAGLPGLALGD 131

Query: 1200 SNREESHRYIDMQDGSILGRDSFEPFRQRIGGEHI----SESSLNLKCPLCRGSVKCWEI 1033
            +  E S+R +D  +   +         +R+  E +    S    NLKCP+CRG+V   E+
Sbjct: 132  NGTENSNRTVDTNEAGDMDTAGSGSITERVDQEGLDAGNSSEYSNLKCPMCRGAVLGLEV 191

Query: 1032 VEEARRYLDLKQRSCSLESCSFTGNYGELRRHARRVHPTNRPALVDQSRQRNWRHLENQQ 853
            +EEAR YL+LK+RSCS E+CSF+GNY ELRRHARRVHPT+RPA++D SR+R WR LE Q+
Sbjct: 192  IEEAREYLNLKKRSCSRETCSFSGNYQELRRHARRVHPTSRPAVIDPSRERAWRRLERQR 251

Query: 852  EYGDIVSAIRSAMPGAIVFGDYVIETGDGFSA-NRDNNSGESSGPLLTSFILYQMMRPFS 676
            E GD+VSAIRSAMPGA+V GDYVIE GDG  A  RDN +G+ +GPLLTSF L+ M     
Sbjct: 252  EVGDVVSAIRSAMPGALVVGDYVIENGDGMVAGERDNGTGDVNGPLLTSFFLFHMFGSVE 311

Query: 675  PVSE--PXXXXXXXXXXXXXXXSLSDRRHLWGENLLGLQDDA-ADWNLSNQMGE 523
               E  P                +S+RR LWGENLLGLQ+D   D+ +   MG+
Sbjct: 312  GAREPRPRSRSWVRHRRSGGGTPVSERRFLWGENLLGLQEDTDEDFRIYIGMGD 365


>ref|XP_002883554.1| hypothetical protein ARALYDRAFT_479993 [Arabidopsis lyrata subsp.
            lyrata] gi|297329394|gb|EFH59813.1| hypothetical protein
            ARALYDRAFT_479993 [Arabidopsis lyrata subsp. lyrata]
          Length = 354

 Score =  306 bits (785), Expect = 1e-80
 Identities = 176/352 (50%), Positives = 205/352 (58%), Gaps = 6/352 (1%)
 Frame = -2

Query: 1596 MTDLKGSTSLGIDDSALHKEWDDALCPICMDHPHNAVLLLCSSFDNGCRPYICDSSYRHS 1417
            M  +K   S   D  ALHKE D+  CP+CMDHPHNAVLLLCSS D GCR YICD+SYRHS
Sbjct: 1    MAGVKRKLSTESDVHALHKELDEVSCPVCMDHPHNAVLLLCSSHDKGCRSYICDTSYRHS 60

Query: 1416 NCLDRFKKYFRSGPSQPDSVFIENPENPRLDTSESIFRASGSRNNLMEDXXXXXXXXXNT 1237
            NCLDRFKK     P+ P       PE            AS   NN               
Sbjct: 61   NCLDRFKKLHSESPNDP------TPEGNL---------ASRENNN--------------- 90

Query: 1236 ASIARTLEVIGDSNREESHRYIDMQDGSILGRDSFEPFRQRIGGEHISESSLNLKCPLCR 1057
                 +L   G ++R   HR      GS    +S    R+R+  E  SE   NLKCPLCR
Sbjct: 91   ----ESLNEHGTASRSSFHRE-STNRGSAWDSESLRR-RRRVDEEEQSEDITNLKCPLCR 144

Query: 1056 GSVKCWEIVEEARRYLDLKQRSCSLESCSFTGNYGELRRHARRVHPTNRPALVDQSRQRN 877
            G+V  W++VEE R YLDLK RSCS ESCSFTGNY +LRRHARR HPT RP+  D SR+R 
Sbjct: 145  GTVLGWKVVEEVRTYLDLKNRSCSRESCSFTGNYQDLRRHARRTHPTTRPSDTDPSRERA 204

Query: 876  WRHLENQQEYGDIVSAIRSAMPGAIVFGDYVIETGDGFSANRDNNSGESSGPLLTSFILY 697
            WRHLENQ+EYGDIVSAIRSAMPGA+V GDYVIE GD FS  R+  +G S   L T+ +L+
Sbjct: 205  WRHLENQREYGDIVSAIRSAMPGAVVVGDYVIENGDRFSGERETGNGGSD--LWTTLVLF 262

Query: 696  QMMRPF------SPVSEPXXXXXXXXXXXXXXXSLSDRRHLWGENLLGLQDD 559
            QM+         +  S                 S SDRR+LWGENLLGLQ++
Sbjct: 263  QMIGSLDNGGSSASGSGGGSRSHRSRAWRNHRRSSSDRRYLWGENLLGLQEE 314


>ref|XP_002265815.1| PREDICTED: uncharacterized protein LOC100263112 [Vitis vinifera]
          Length = 347

 Score =  303 bits (775), Expect = 2e-79
 Identities = 169/347 (48%), Positives = 212/347 (61%), Gaps = 1/347 (0%)
 Frame = -2

Query: 1596 MTDLKGSTSLGIDDSALHKEWDDALCPICMDHPHNAVLLLCSSFDNGCRPYICDSSYRHS 1417
            M   K S S   D  AL KEWDD  CPICMDHPHNAVLLLCSS + GCR YICD+SYRH+
Sbjct: 1    MAGKKQSMSTDADIHALPKEWDDVSCPICMDHPHNAVLLLCSSHEMGCRSYICDTSYRHA 60

Query: 1416 NCLDRFKKYFRSGPSQPDSVFIENPENPRLDTSESIFRASGSRNNLMEDXXXXXXXXXNT 1237
            NCLDRFK   R G + P++        P   T+   + ++ S  NL              
Sbjct: 61   NCLDRFK---RLGANLPNTSL-----QPSSSTTNQSYSSNASIVNL-----------GLR 101

Query: 1236 ASIARTLEVIGDSNREESHRYIDMQDGSILGRDSFEPFRQRIGGEHISESSLNLKCPLCR 1057
              I  T E  G+ N  E +  + ++           P R  +  E+ SE SL+L CPLCR
Sbjct: 102  LGIDST-EAHGNGNPNEGNGLLSVRI----------PRRSELNAENSSELSLSLTCPLCR 150

Query: 1056 GSVKCWEIVEEARRYLDLKQRSCSLESCSFTGNYGELRRHARRVHPTNRPALVDQSRQRN 877
            G+V  W++VEEAR  L+LK RSCS ESCSF+GNY ELRRHARRVHPT RPA +D SR+R+
Sbjct: 151  GAVLGWKVVEEARESLNLKPRSCSRESCSFSGNYRELRRHARRVHPTTRPADIDPSRERS 210

Query: 876  WRHLENQQEYGDIVSAIRSAMPGAIVFGDYVIETGDGFSANRDNNSGESSGPLLTSFILY 697
            WR LE+Q+E+GDI+SAIRSAMPGAIV GDY IE+ D  +  R++ + E +GP  T+F  +
Sbjct: 211  WRRLEHQREHGDIISAIRSAMPGAIVLGDYAIESEDMLAGGRESGNEEGNGPWWTTFFWF 270

Query: 696  QMMRPFSPVSEP-XXXXXXXXXXXXXXXSLSDRRHLWGENLLGLQDD 559
            QM+   +  +EP                +L+ RR LWGENLLGLQDD
Sbjct: 271  QMIGSINSAAEPRSRSRALTRRRQSARAALTRRRFLWGENLLGLQDD 317


Top