BLASTX nr result

ID: Coptis23_contig00005430 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Coptis23_contig00005430
         (1499 letters)

Database: ./nr 
           23,641,837 sequences; 8,123,359,852 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

emb|CBI40233.3| unnamed protein product [Vitis vinifera]              250   6e-64
ref|XP_002533661.1| hypothetical protein RCOM_0152200 [Ricinus c...   219   2e-54
ref|XP_004140985.1| PREDICTED: uncharacterized protein LOC101207...   209   1e-51
ref|XP_002311037.1| predicted protein [Populus trichocarpa] gi|2...   208   3e-51
ref|XP_002458794.1| hypothetical protein SORBIDRAFT_03g040400 [S...   126   2e-26

>emb|CBI40233.3| unnamed protein product [Vitis vinifera]
          Length = 682

 Score =  250 bits (639), Expect = 6e-64
 Identities = 159/385 (41%), Positives = 224/385 (58%), Gaps = 8/385 (2%)
 Frame = -1

Query: 1499 MEESTSMTIEFLRARLLSERSVSRTARHRADELAKRVVELEELLTHVSEQRKKAEKATAE 1320
            ME+ST+MTIEFLRARLLSERSVSRTAR RADELA+RV +LEE L  VS QR KAEKATA+
Sbjct: 1    MEDSTAMTIEFLRARLLSERSVSRTARQRADELAQRVWKLEEQLKIVSIQRNKAEKATAD 60

Query: 1319 VLAILESNGISDIXXXXXXXXXXXXXXXXXXDTRKEDANITTSGAGDXXXXXXXXXXXSW 1140
            VLAILE++ ISD+                   +  ++  +  S  G             W
Sbjct: 61   VLAILENHAISDVSWEFDS-------------SSDQEVALCDSHVGGGRRLS-------W 100

Query: 1139 KSCSDAPNSVDTK------RRSTSIVCTSGSSTRPHIGRSCRQMKQKGARSMV-AVNNET 981
            KS  D+ +S++ +      RR  S   +  SS + ++G+SCRQ++++  RS V  +    
Sbjct: 101  KSSKDSSHSIEKRYLDCSIRRRHSFASSGSSSPKHNLGKSCRQIRRRETRSAVDELKVGR 160

Query: 980  VLVESQENKVARRSSDVSNCSSAMYEVLKESCKSEKDKVYFENLVQVLPEAQNEECKHGF 801
            V+V+SQ N +   S  + N   +  E+L+E  ++++++   +  V    E+Q +      
Sbjct: 161  VMVDSQNNGIISSSEGLPNGFDSGQEILREGSENQEEEALMDGQVSDSLESQRDATGSNH 220

Query: 800  NLEGSERVAEMERALEHQAQLIGRFEAEENAQREWEDKFRENNCYTPESCEHGNQSDITE 621
            +L  + R  +MERALEHQAQLIG++EAEE AQREWE+KFRENN  TP+SCE GN SD+TE
Sbjct: 221  HLNRNGRDRDMERALEHQAQLIGQYEAEEKAQREWEEKFRENNSSTPDSCEPGNHSDVTE 280

Query: 620  ERDEIRAETSDAADTIPSFNQVGKSGLEEDCKNKDAMSKVFSDQILPHLHQDNRMLQEQQ 441
            ERDE++ +   AA  + S +Q G    +ED    +  S+        HLH D   LQEQ 
Sbjct: 281  ERDEVKPQAPSAAGILTSQDQ-GTKLDDEDVHFNEESSQTLPTISTTHLHGDMECLQEQN 339

Query: 440  YIISTVNQPQKSSPEFSFP-RQENL 369
                ++   +  +P+F FP  +ENL
Sbjct: 340  R--CSMLAYESLAPDFVFPMAKENL 362


>ref|XP_002533661.1| hypothetical protein RCOM_0152200 [Ricinus communis]
            gi|223526443|gb|EEF28720.1| hypothetical protein
            RCOM_0152200 [Ricinus communis]
          Length = 665

 Score =  219 bits (557), Expect = 2e-54
 Identities = 159/399 (39%), Positives = 212/399 (53%), Gaps = 15/399 (3%)
 Frame = -1

Query: 1499 MEESTSMTIEFLRARLLSERSVSRTARHRADELAKRVVELEELLTHVSEQRKKAEKATAE 1320
            ME+ST+MTIEFLRARLLSERSVSRTAR RADELA RV ELEE L  VS QR KAEKATA+
Sbjct: 17   MEDSTAMTIEFLRARLLSERSVSRTARQRADELATRVAELEEQLRIVSLQRMKAEKATAD 76

Query: 1319 VLAILESNGISDIXXXXXXXXXXXXXXXXXXDTR--KEDANITTS---------GAGDXX 1173
            +LAILE NGISDI                    R  KE+ +I +             D  
Sbjct: 77   ILAILEGNGISDISETFDSCSDRDTPCESKVGNRSSKEENSINSKVRNNDSEELSGSDFD 136

Query: 1172 XXXXXXXXXSWKSCSDAPNSV----DTKRRSTSIVCTSGSSTRPHIGRSCRQMKQKGARS 1005
                     SWK   ++P S+    D+  R  S   + GSS +   G+SCRQ+++K +R 
Sbjct: 137  FSSVPGRSLSWKGRKNSPRSLEKSKDSSMRRRSSFSSVGSSPKQRPGKSCRQIRRKESR- 195

Query: 1004 MVAVNNETVLVESQENKVARRSSDVSNCSSAMYEVLKESCKSEKDKVYFENLVQVLPEAQ 825
                    V  +  E++VA  S++  +CS        E  + E   +  ++    L   +
Sbjct: 196  -FEYKASPVKRDCPEDEVAATSANFPSCSDF------EPKRGEVKPLLEDSHSDCLGNER 248

Query: 824  NEECKHGFNLEGSERVAEMERALEHQAQLIGRFEAEENAQREWEDKFRENNCYTPESCEH 645
            N    +G +        +ME+ALEHQAQLIG++EA E  QREWE+KFRENN  TP+SC+H
Sbjct: 249  NAS-DNGLDYNVYRGDRDMEKALEHQAQLIGQYEAMEKVQREWEEKFRENNSSTPDSCDH 307

Query: 644  GNQSDITEERDEIRAETSDAADTIPSFNQVGKSGLEEDCKNKDAMSKVFSDQILPHLHQD 465
            GN+SDITEER EIR      A T    N +   GL       + +S       LP  H D
Sbjct: 308  GNRSDITEERYEIREPAKGPATT----NAIQTEGL---LSVVEGVSNTQPHGFLPSSHVD 360

Query: 464  NRMLQEQQYIISTVNQPQKSSPEFSFPRQENLEARTNEK 348
               L+E++  I+ V  P+ S+ + +FP     +A+ N+K
Sbjct: 361  AVCLEERKSSIAPV--PEFSTQDSAFPM---AKAKQNQK 394


>ref|XP_004140985.1| PREDICTED: uncharacterized protein LOC101207733 [Cucumis sativus]
          Length = 671

 Score =  209 bits (533), Expect = 1e-51
 Identities = 154/391 (39%), Positives = 217/391 (55%), Gaps = 19/391 (4%)
 Frame = -1

Query: 1499 MEESTSMTIEFLRARLLSERSVSRTARHRADELAKRVVELEELLTHVSEQRKKAEKATAE 1320
            +E++T+MTIEFLRARLLSERSVS++AR RADELAKRV ELEE L  VS QRK AEKATA+
Sbjct: 17   VEDTTAMTIEFLRARLLSERSVSKSARQRADELAKRVAELEEQLKIVSLQRKMAEKATAD 76

Query: 1319 VLAILESNGISDIXXXXXXXXXXXXXXXXXXDTRKEDANITTS---------GAGDXXXX 1167
            VLAILE NG SDI                     +ED +  T             +    
Sbjct: 77   VLAILEDNGASDISETLDSNSDHETEPKVEDGLAREDVSSGTVRRRNEHEEYSGSNIDTS 136

Query: 1166 XXXXXXXSWKSCSDAPNSVDTKR----RSTSIVCTSGSSTRPH-IGRSCRQMKQKGARSM 1002
                   SWK  +D+P++ +  +    RS S   + GSS+  H +GRSCRQ+K++  R +
Sbjct: 137  PVLGGSLSWKGRNDSPHTREKYKKHSIRSRSSFTSIGSSSPKHQLGRSCRQIKRRDTRPL 196

Query: 1001 VAVNN--ETVLVESQENKVARRSSDVSNCSSAMYEVLKESCK-SEKDKVYFENLVQVLPE 831
                      LV+S E   +    D  N S   + +L++  +  EK +     +   +  
Sbjct: 197  DGEQELKSDALVDSSEEIPSTSLEDSQNYSVNGHSILRDGYEVREKTRSSSSGVHNSVGN 256

Query: 830  AQNEECKHGFNLEGSERVAEMERALEHQAQLIGRFEAEENAQREWEDKFRENNCYTPESC 651
            +  +      +++G E+V +ME+AL+ QAQLI ++EA E AQREWE+KFRENN  TP+SC
Sbjct: 257  SDQDN-----DIDGYEKVDDMEKALKCQAQLIDQYEAMEKAQREWEEKFRENNNSTPDSC 311

Query: 650  EHGNQSDITEERDEIRAETSDAADTIPSFNQVGKSGLEEDCKNKDAMSKVFSDQILPHL- 474
            + GN SDITEERDE+RA+  + ++  P+     K  +  DC  +D +S+  ++ + P + 
Sbjct: 312  DPGNHSDITEERDEMRAQAPNLSNN-PA--NEAKPQVAFDCDTRD-LSQAQTNGLGPSMC 367

Query: 473  HQDNRMLQEQQ-YIISTVNQPQKSSPEFSFP 384
              D   LQ+Q    IST     KS  EF+FP
Sbjct: 368  AVDVEDLQDQNTNSIST----SKSLEEFTFP 394


>ref|XP_002311037.1| predicted protein [Populus trichocarpa] gi|222850857|gb|EEE88404.1|
            predicted protein [Populus trichocarpa]
          Length = 684

 Score =  208 bits (529), Expect = 3e-51
 Identities = 161/404 (39%), Positives = 220/404 (54%), Gaps = 20/404 (4%)
 Frame = -1

Query: 1499 MEESTSMTIEFLRARLLSERSVSRTARHRADELAKRVVELEELLTHVSEQRKKAEKATAE 1320
            ME+ST++TIEFLRARLL+ERSVSRTAR RADELA+RV ELEE L  VS QR KAEKAT +
Sbjct: 17   MEDSTAITIEFLRARLLAERSVSRTARQRADELAERVAELEEQLRIVSLQRMKAEKATVD 76

Query: 1319 VLAILESNGISDIXXXXXXXXXXXXXXXXXXD--TRKEDANITT----------SGAGDX 1176
            VLAILESNGISD                      T++E++++ +          SG+G  
Sbjct: 77   VLAILESNGISDDSEIFGSSSDQDTPCESKVGKKTKQEESSVISKVTKYKLEEHSGSGHD 136

Query: 1175 XXXXXXXXXXSWKSCSDAPNSVD-----TKRRSTSIVCTSGSSTRPHIGRSCRQMKQKGA 1011
                       WK    +P S++     + RR +S   TS SS + H G+SCRQ++ K +
Sbjct: 137  FSSSQGRNLS-WKGRKHSPRSLEKCKDPSLRRRSSFASTS-SSPKHHQGKSCRQVRNKES 194

Query: 1010 RSMV-AVNNETVLVESQENKVARRSSDVSNCSSAMYEVLKESCKSEKDKVYFENLVQVLP 834
            R  + A       V+S EN VA  S    NCS    EV +     EK        ++   
Sbjct: 195  RLTIGAFRTNPDKVDSPENGVATTSEVFPNCSEP--EVGRIENGEEKTLPPISVGLENGQ 252

Query: 833  EAQNEECKHGFNLEGSERVAEMERALEHQAQLIGRFEAEENAQREWEDKFRENNCYTPES 654
             A + E +   N+ GS+R  +ME+ALEHQAQLI R++A E  QREWE+KFRENN  TP+S
Sbjct: 253  RADSNELED--NVYGSDR--DMEKALEHQAQLIDRYKAMEKVQREWEEKFRENNGSTPDS 308

Query: 653  CEHGNQSDITEERDEIRAETSDAADTIPSFNQVGKSGLEEDCKNKDAMSKVFSDQILPHL 474
             + GN+SD+TEE  EI+A+      T+ + +   KS +E+        S +  + IL   
Sbjct: 309  YDAGNRSDVTEEGYEIKAQVQQHTGTVAAQSNRAKSEVEK-------ASNIQPNGILRPS 361

Query: 473  HQDNRMLQEQQYIISTVNQPQKSSP--EFSFPRQENLEARTNEK 348
            H +   LQE +    + + P   SP  +F+F R E  +   NE+
Sbjct: 362  HVNIGQLQEWK----SSSAPTSESPAQDFAF-RAEKQKQNENEE 400


>ref|XP_002458794.1| hypothetical protein SORBIDRAFT_03g040400 [Sorghum bicolor]
            gi|241930769|gb|EES03914.1| hypothetical protein
            SORBIDRAFT_03g040400 [Sorghum bicolor]
          Length = 745

 Score =  126 bits (316), Expect = 2e-26
 Identities = 103/313 (32%), Positives = 146/313 (46%), Gaps = 38/313 (12%)
 Frame = -1

Query: 1499 MEESTSMTIEFLRARLLSERSVSRTARHRADELAKRVVELEELLTHVSEQRKKAEKATAE 1320
            M +ST+MT++FLRARLLSERSVSR A+ RADELAKRV ELEE +  V+ QR++AE+A A+
Sbjct: 1    MADSTAMTVDFLRARLLSERSVSRAAKERADELAKRVAELEEQVRAVTAQRRQAERAAAD 60

Query: 1319 VLAILESNGI----------SDIXXXXXXXXXXXXXXXXXXDTRKEDANITTSG----AG 1182
            VLA+LES G           SD+                      E+     S     A 
Sbjct: 61   VLAVLESQGFGGHLSDVDDDSDVSGQDSGEVDDGKRRGDTAGATVEEGEREPSAAKGEAE 120

Query: 1181 DXXXXXXXXXXXSWKSCSDAPNSVDT-----KRRSTSIVCTSGSSTRPHIGRSCRQMKQK 1017
            D           SWK  S +P          +R    ++ +S SS +  +G+SCR+ K++
Sbjct: 121  DALSGTAQPGGLSWKGRSVSPRKARQLKQRHRRSFFYLLSSSDSSPKYRVGQSCRKNKRR 180

Query: 1016 GARSMVAVNNETVLVESQENKVARRSSDVSNCSSAMYEVLKE-----------SCKSEKD 870
                   V        +++      SS ++ C+S +    +            S K  +D
Sbjct: 181  -------VELRCGHTHTRQGPSFAVSSYLTTCASVLSNASRPMATEDDGGGGGSQKGRQD 233

Query: 869  KVYF--ENLVQVLPEAQNEECKHGFNLEGSERV------AEMERALEHQAQLIGRFEAEE 714
               F  +    +  E   +E   G    G + V       EMER LE QA+LIG++E EE
Sbjct: 234  GSDFTDDGQADMDGEVGGDERSSGGGGGGGQYVIRYEEDGEMERVLERQAELIGQYEEEE 293

Query: 713  NAQREWEDKFREN 675
             AQREWE ++ EN
Sbjct: 294  KAQREWEKQYNEN 306


Top