BLASTX nr result

ID: Coptis21_contig00009935 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Coptis21_contig00009935
         (2188 letters)

Database: ./nr 
           23,641,837 sequences; 8,123,359,852 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_002521120.1| conserved hypothetical protein [Ricinus comm...   317   1e-83
ref|XP_004139654.1| PREDICTED: uncharacterized protein LOC101208...   308   3e-81
ref|XP_004139655.1| PREDICTED: uncharacterized protein LOC101208...   307   9e-81
ref|XP_002883554.1| hypothetical protein ARALYDRAFT_479993 [Arab...   304   6e-80
ref|XP_002265815.1| PREDICTED: uncharacterized protein LOC100263...   303   2e-79

>ref|XP_002521120.1| conserved hypothetical protein [Ricinus communis]
            gi|223539689|gb|EEF41271.1| conserved hypothetical
            protein [Ricinus communis]
          Length = 386

 Score =  317 bits (811), Expect = 1e-83
 Identities = 175/363 (48%), Positives = 219/363 (60%), Gaps = 14/363 (3%)
 Frame = -3

Query: 1568 MTDMKGSTSLGIDDSALHKEWDDALCPICMDHPHNAVLLLCSSFDNGCRPYICDSSYRHS 1389
            MT +K S     D   LH E D+  CPICMDHPHNAVLLLCSS + GCR YICD+S RHS
Sbjct: 1    MTGVKRSRYTDSDIRTLHNELDEVSCPICMDHPHNAVLLLCSSHEKGCRSYICDTSSRHS 60

Query: 1388 NCLDRFKKYFRSGPSQPDSVFIENPENPRLDTSESIFRASGSRNN---------LMEDXX 1236
            NCLDR+KK   S  S           N  LD+S  I   S S  +         +++   
Sbjct: 61   NCLDRYKKLRDSSGS-----------NTTLDSSLPINSFSSSNISDTSLTLGARVLDSYE 109

Query: 1235 XXXXXXXNTASIARTLEVIGDSNREESHRYIDMQDGSILGRDSFEPFRQRIGGE-----H 1071
                   +  +  R  E + +++ +  +R ++ +   +L     E F  RI  E     +
Sbjct: 110  NHNQSDSDNITSVRMPEQLLENSIQHPNRQVETRGEGVLEAGDSESFPDRIELEEADVVN 169

Query: 1070 MSESSLNLKCPLCRGSVKCWEIVEEARRYLDLKQRSCSLESCSFTGNYGELRRHARRVHP 891
             SE+ L+LKCPLCRG+V  WE+VEEAR+YL+LK+RSCS ESCSF GNY ELRRHARRVHP
Sbjct: 170  SSEAGLSLKCPLCRGAVLGWEVVEEARKYLNLKKRSCSRESCSFCGNYQELRRHARRVHP 229

Query: 890  TNRPALVDQSRQRNWRDLENQQEYGDIVSAIRSAMPGAIVFGDYVIETGDGFSANRDNNS 711
            T RP+ VD SR+R WR LE Q+EYGDIVSA+RSAMPGA+V GDYVIE GD FS  R+  +
Sbjct: 230  TTRPSDVDPSRERAWRCLERQREYGDIVSALRSAMPGAVVVGDYVIENGDRFSVEREGGA 289

Query: 710  GESSGPLLTSFILYQMMRPFSPVSEPXXXXXXXXXXXXXXXSLSDRRHLWGENLLGLQDD 531
            GE + P  T+F L+QM+      +EP               +L +RR LWGENLLGLQDD
Sbjct: 290  GEVNAPWWTTFFLFQMIGSIDGAAEPRARSRAWTRHRRSGGALPERRFLWGENLLGLQDD 349

Query: 530  AAD 522
              D
Sbjct: 350  DED 352


>ref|XP_004139654.1| PREDICTED: uncharacterized protein LOC101208460 isoform 1 [Cucumis
            sativus]
          Length = 389

 Score =  308 bits (790), Expect = 3e-81
 Identities = 173/367 (47%), Positives = 218/367 (59%), Gaps = 8/367 (2%)
 Frame = -3

Query: 1571 KMTDMKGSTSLGIDDSALHKEWDDALCPICMDHPHNAVLLLCSSFDNGCRPYICDSSYRH 1392
            KM  +K       D  ALHKE D+  CPICMDHPHNAVLLLCSS   GC+PYICD+S+RH
Sbjct: 3    KMAGVKRRIHNDSDILALHKELDEVSCPICMDHPHNAVLLLCSSHHKGCKPYICDTSHRH 62

Query: 1391 SNCLDRFKKYFRSGPSQPDSVFIENPENPRLDTSESIFRASGSRNNLMEDXXXXXXXXXN 1212
            SNC D+FKK  R    +   +    P NP   ++ S      S +    D          
Sbjct: 63   SNCFDQFKK-LREETRKSPRLSSPLPINPYSFSNPSTNNLGLSIDLNEVDDNQNINERNT 121

Query: 1211 TASIARTLEVIGDSNREESHRYIDMQDGSILGRDSFEPFRQRIGGEHM----SESSLNLK 1044
             AS       +GD+  E S+R +D  +   +         +R+  E +    S    NLK
Sbjct: 122  VASAGLPGLALGDNGTENSNRTVDTNEAGDMDTAGSGSITERVDQEGLDAGNSSEYSNLK 181

Query: 1043 CPLCRGSVKCWEIVEEARRYLDLKQRSCSLESCSFTGNYGELRRHARRVHPTNRPALVDQ 864
            CP+CRG+V   E++EEAR YL+LK+RSCS E+CSF+GNY ELRRHARRVHPT+RPA++D 
Sbjct: 182  CPMCRGAVLGLEVIEEAREYLNLKKRSCSRETCSFSGNYQELRRHARRVHPTSRPAVIDP 241

Query: 863  SRQRNWRDLENQQEYGDIVSAIRSAMPGAIVFGDYVIETGDGFSA-NRDNNSGESSGPLL 687
            SR+R WR LE Q+E GD+VSAIRSAMPGA+V GDYVIE GDG  A  RDN +G+ +GPLL
Sbjct: 242  SRERAWRRLERQREVGDVVSAIRSAMPGALVVGDYVIENGDGMVAGERDNGTGDVNGPLL 301

Query: 686  TSFILYQMMRPFSPVSE--PXXXXXXXXXXXXXXXSLSDRRHLWGENLLGLQDDA-ADWN 516
            TSF L+ M        E  P                +S+RR LWGENLLGLQ+D   D+ 
Sbjct: 302  TSFFLFHMFGSVEGAREPRPRSRSWVRHRRSGGGTPVSERRFLWGENLLGLQEDTDEDFR 361

Query: 515  LSNQMGE 495
            +   MG+
Sbjct: 362  IYIGMGD 368


>ref|XP_004139655.1| PREDICTED: uncharacterized protein LOC101208460 isoform 2 [Cucumis
            sativus] gi|449443782|ref|XP_004139656.1| PREDICTED:
            uncharacterized protein LOC101208460 isoform 3 [Cucumis
            sativus] gi|449527327|ref|XP_004170663.1| PREDICTED:
            uncharacterized protein LOC101225264 isoform 1 [Cucumis
            sativus] gi|449527329|ref|XP_004170664.1| PREDICTED:
            uncharacterized protein LOC101225264 isoform 2 [Cucumis
            sativus]
          Length = 386

 Score =  307 bits (786), Expect = 9e-81
 Identities = 170/354 (48%), Positives = 214/354 (60%), Gaps = 8/354 (2%)
 Frame = -3

Query: 1532 DDSALHKEWDDALCPICMDHPHNAVLLLCSSFDNGCRPYICDSSYRHSNCLDRFKKYFRS 1353
            D  ALHKE D+  CPICMDHPHNAVLLLCSS   GC+PYICD+S+RHSNC D+FKK  R 
Sbjct: 13   DILALHKELDEVSCPICMDHPHNAVLLLCSSHHKGCKPYICDTSHRHSNCFDQFKK-LRE 71

Query: 1352 GPSQPDSVFIENPENPRLDTSESIFRASGSRNNLMEDXXXXXXXXXNTASIARTLEVIGD 1173
               +   +    P NP   ++ S      S +    D           AS       +GD
Sbjct: 72   ETRKSPRLSSPLPINPYSFSNPSTNNLGLSIDLNEVDDNQNINERNTVASAGLPGLALGD 131

Query: 1172 SNREESHRYIDMQDGSILGRDSFEPFRQRIGGEHM----SESSLNLKCPLCRGSVKCWEI 1005
            +  E S+R +D  +   +         +R+  E +    S    NLKCP+CRG+V   E+
Sbjct: 132  NGTENSNRTVDTNEAGDMDTAGSGSITERVDQEGLDAGNSSEYSNLKCPMCRGAVLGLEV 191

Query: 1004 VEEARRYLDLKQRSCSLESCSFTGNYGELRRHARRVHPTNRPALVDQSRQRNWRDLENQQ 825
            +EEAR YL+LK+RSCS E+CSF+GNY ELRRHARRVHPT+RPA++D SR+R WR LE Q+
Sbjct: 192  IEEAREYLNLKKRSCSRETCSFSGNYQELRRHARRVHPTSRPAVIDPSRERAWRRLERQR 251

Query: 824  EYGDIVSAIRSAMPGAIVFGDYVIETGDGFSA-NRDNNSGESSGPLLTSFILYQMMRPFS 648
            E GD+VSAIRSAMPGA+V GDYVIE GDG  A  RDN +G+ +GPLLTSF L+ M     
Sbjct: 252  EVGDVVSAIRSAMPGALVVGDYVIENGDGMVAGERDNGTGDVNGPLLTSFFLFHMFGSVE 311

Query: 647  PVSE--PXXXXXXXXXXXXXXXSLSDRRHLWGENLLGLQDDA-ADWNLSNQMGE 495
               E  P                +S+RR LWGENLLGLQ+D   D+ +   MG+
Sbjct: 312  GAREPRPRSRSWVRHRRSGGGTPVSERRFLWGENLLGLQEDTDEDFRIYIGMGD 365


>ref|XP_002883554.1| hypothetical protein ARALYDRAFT_479993 [Arabidopsis lyrata subsp.
            lyrata] gi|297329394|gb|EFH59813.1| hypothetical protein
            ARALYDRAFT_479993 [Arabidopsis lyrata subsp. lyrata]
          Length = 354

 Score =  304 bits (779), Expect = 6e-80
 Identities = 175/352 (49%), Positives = 204/352 (57%), Gaps = 6/352 (1%)
 Frame = -3

Query: 1568 MTDMKGSTSLGIDDSALHKEWDDALCPICMDHPHNAVLLLCSSFDNGCRPYICDSSYRHS 1389
            M  +K   S   D  ALHKE D+  CP+CMDHPHNAVLLLCSS D GCR YICD+SYRHS
Sbjct: 1    MAGVKRKLSTESDVHALHKELDEVSCPVCMDHPHNAVLLLCSSHDKGCRSYICDTSYRHS 60

Query: 1388 NCLDRFKKYFRSGPSQPDSVFIENPENPRLDTSESIFRASGSRNNLMEDXXXXXXXXXNT 1209
            NCLDRFKK     P+ P       PE            AS   NN               
Sbjct: 61   NCLDRFKKLHSESPNDP------TPEGNL---------ASRENNN--------------- 90

Query: 1208 ASIARTLEVIGDSNREESHRYIDMQDGSILGRDSFEPFRQRIGGEHMSESSLNLKCPLCR 1029
                 +L   G ++R   HR      GS    +S    R+R+  E  SE   NLKCPLCR
Sbjct: 91   ----ESLNEHGTASRSSFHRE-STNRGSAWDSESLRR-RRRVDEEEQSEDITNLKCPLCR 144

Query: 1028 GSVKCWEIVEEARRYLDLKQRSCSLESCSFTGNYGELRRHARRVHPTNRPALVDQSRQRN 849
            G+V  W++VEE R YLDLK RSCS ESCSFTGNY +LRRHARR HPT RP+  D SR+R 
Sbjct: 145  GTVLGWKVVEEVRTYLDLKNRSCSRESCSFTGNYQDLRRHARRTHPTTRPSDTDPSRERA 204

Query: 848  WRDLENQQEYGDIVSAIRSAMPGAIVFGDYVIETGDGFSANRDNNSGESSGPLLTSFILY 669
            WR LENQ+EYGDIVSAIRSAMPGA+V GDYVIE GD FS  R+  +G S   L T+ +L+
Sbjct: 205  WRHLENQREYGDIVSAIRSAMPGAVVVGDYVIENGDRFSGERETGNGGSD--LWTTLVLF 262

Query: 668  QMMRPF------SPVSEPXXXXXXXXXXXXXXXSLSDRRHLWGENLLGLQDD 531
            QM+         +  S                 S SDRR+LWGENLLGLQ++
Sbjct: 263  QMIGSLDNGGSSASGSGGGSRSHRSRAWRNHRRSSSDRRYLWGENLLGLQEE 314


>ref|XP_002265815.1| PREDICTED: uncharacterized protein LOC100263112 [Vitis vinifera]
          Length = 347

 Score =  303 bits (775), Expect = 2e-79
 Identities = 169/347 (48%), Positives = 212/347 (61%), Gaps = 1/347 (0%)
 Frame = -3

Query: 1568 MTDMKGSTSLGIDDSALHKEWDDALCPICMDHPHNAVLLLCSSFDNGCRPYICDSSYRHS 1389
            M   K S S   D  AL KEWDD  CPICMDHPHNAVLLLCSS + GCR YICD+SYRH+
Sbjct: 1    MAGKKQSMSTDADIHALPKEWDDVSCPICMDHPHNAVLLLCSSHEMGCRSYICDTSYRHA 60

Query: 1388 NCLDRFKKYFRSGPSQPDSVFIENPENPRLDTSESIFRASGSRNNLMEDXXXXXXXXXNT 1209
            NCLDRFK   R G + P++        P   T+   + ++ S  NL              
Sbjct: 61   NCLDRFK---RLGANLPNTSL-----QPSSSTTNQSYSSNASIVNL-----------GLR 101

Query: 1208 ASIARTLEVIGDSNREESHRYIDMQDGSILGRDSFEPFRQRIGGEHMSESSLNLKCPLCR 1029
              I  T E  G+ N  E +  + ++           P R  +  E+ SE SL+L CPLCR
Sbjct: 102  LGIDST-EAHGNGNPNEGNGLLSVRI----------PRRSELNAENSSELSLSLTCPLCR 150

Query: 1028 GSVKCWEIVEEARRYLDLKQRSCSLESCSFTGNYGELRRHARRVHPTNRPALVDQSRQRN 849
            G+V  W++VEEAR  L+LK RSCS ESCSF+GNY ELRRHARRVHPT RPA +D SR+R+
Sbjct: 151  GAVLGWKVVEEARESLNLKPRSCSRESCSFSGNYRELRRHARRVHPTTRPADIDPSRERS 210

Query: 848  WRDLENQQEYGDIVSAIRSAMPGAIVFGDYVIETGDGFSANRDNNSGESSGPLLTSFILY 669
            WR LE+Q+E+GDI+SAIRSAMPGAIV GDY IE+ D  +  R++ + E +GP  T+F  +
Sbjct: 211  WRRLEHQREHGDIISAIRSAMPGAIVLGDYAIESEDMLAGGRESGNEEGNGPWWTTFFWF 270

Query: 668  QMMRPFSPVSEP-XXXXXXXXXXXXXXXSLSDRRHLWGENLLGLQDD 531
            QM+   +  +EP                +L+ RR LWGENLLGLQDD
Sbjct: 271  QMIGSINSAAEPRSRSRALTRRRQSARAALTRRRFLWGENLLGLQDD 317


Top