BLASTX nr result

ID: Coptis21_contig00023725 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Coptis21_contig00023725
         (1656 letters)

Database: ./nr 
           23,641,837 sequences; 8,123,359,852 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

emb|CBI21104.3| unnamed protein product [Vitis vinifera]              234   7e-59
emb|CAN76638.1| hypothetical protein VITISV_027480 [Vitis vinifera]   234   7e-59
ref|XP_002519907.1| mixed-lineage leukemia protein, mll, putativ...   125   3e-26
ref|XP_003545448.1| PREDICTED: uncharacterized protein LOC100812...   122   2e-25
ref|XP_003549306.1| PREDICTED: uncharacterized protein LOC100816...    93   2e-16

>emb|CBI21104.3| unnamed protein product [Vitis vinifera]
          Length = 1111

 Score =  234 bits (596), Expect = 7e-59
 Identities = 161/439 (36%), Positives = 224/439 (51%), Gaps = 18/439 (4%)
 Frame = -2

Query: 1265 GGVTGKVSKSVPSIFDNGRLVHEDKSSTLNQSENLNRKIPVQNGCHASQWRDVPSKHI-- 1092
            GG+ GK S    + F    +V ++K+ + +Q+E    +   +  CHASQW+DVPSK I  
Sbjct: 2    GGMNGKPSMLFTTRFHKDHIVQKEKNISFHQNEKSKGQNHKKIDCHASQWKDVPSKVIVS 61

Query: 1091 -----------RVGSCTNVEKPTTVLDASNVNVQVVEIAVKRVDGT-HKAESLKEQQMSN 948
                        +G   N E    +    N   Q+ + A KR +G   +   LKEQ+MSN
Sbjct: 62   CDMKCVRPSVDGLGGRKNDEDQPAMYGRKNDEDQLADTAAKRFNGNLQEINCLKEQEMSN 121

Query: 947  AYSSCSAPAVTEITVEVNTAGSCTIDAGTDGFVNDHVVDEGSGIERCWSTDDELDSERST 768
              S CSAPAVT+ ++EVN   SCT+DAG  G  ND VVDE SGIE+CWS+DD LDSERS 
Sbjct: 122  ISSGCSAPAVTQASIEVNNMDSCTVDAGDTGCANDLVVDEASGIEKCWSSDDALDSERSA 181

Query: 767  ETIS-ACHTLHLSKGGRSSSCLPIRSSHGLVDNHRLQSPFGRKKVQKRLPTGNLGCESIL 591
            E +   C T  + +G  SS  L  +SS  L+D  + +  F  K+V+    TG    E   
Sbjct: 182  EFLGFTCKTSFIKEG--SSKALANQSSRSLIDELKFRDSFRWKRVRNESHTGLAIHEKNS 239

Query: 590  EVQPLGSILRTERRKRTMKWKRIDVSCPVSGLSSVQYDSP--IGDTDSHSCSSRGTQTLS 417
                +   L+T +RK+TMK K ++ S P SG SS  Y+     G  +  S S +   TL 
Sbjct: 240  HSPKIERGLKTRKRKKTMKMKMLNASFPASGFSSGHYEHTECAGSAEWRSFSYKDVDTLL 299

Query: 416  KPKHGVQRACGVPFNQPSGLKRRCSALSSAKTLSQNKDLCELANHHREWEDDSQTLLKDH 237
            + + G    CG     PS  KRR S LSSAK  S+ +D+ ++    RE ED  Q   K  
Sbjct: 300  QCELGTSHTCGACTIGPS-FKRRRSTLSSAKNFSRKRDVDKI-YADREGEDGYQAQSKGK 357

Query: 236  MNHHKVFKLSNGKKLKRRWTPDVSRE-CSSEGLNQGDTGEEAKYVSTVGAKDFSSNRADT 60
                 + ++S  K++    T +  R+ C  E  +     +  KY S    K+ S  + D 
Sbjct: 358  TEFLSIHEVSGAKRIGPDRTAEAFRQFCMQEPSHT----KAVKYNSVGCVKESSCLKLDV 413

Query: 59   FGKKAKPVVCGKSGVISNG 3
              ++ KPVVCGK GVISNG
Sbjct: 414  SNRREKPVVCGKYGVISNG 432


>emb|CAN76638.1| hypothetical protein VITISV_027480 [Vitis vinifera]
          Length = 578

 Score =  234 bits (596), Expect = 7e-59
 Identities = 161/439 (36%), Positives = 224/439 (51%), Gaps = 18/439 (4%)
 Frame = -2

Query: 1265 GGVTGKVSKSVPSIFDNGRLVHEDKSSTLNQSENLNRKIPVQNGCHASQWRDVPSKHI-- 1092
            GG+ GK S    + F    +V ++K+ + +Q+E    +   +  CHASQW+DVPSK I  
Sbjct: 2    GGMNGKPSMLFTTRFHKDHIVQKEKNISFHQNEKSKGQNHKKIDCHASQWKDVPSKVIVS 61

Query: 1091 -----------RVGSCTNVEKPTTVLDASNVNVQVVEIAVKRVDGT-HKAESLKEQQMSN 948
                        +G   N E    +    N   Q+ + A KR +G   +   LKEQ+MSN
Sbjct: 62   CDMKCVRPSVDGLGGRKNDEDQPAMYGRKNDEDQLADTAAKRFNGNLQEINCLKEQEMSN 121

Query: 947  AYSSCSAPAVTEITVEVNTAGSCTIDAGTDGFVNDHVVDEGSGIERCWSTDDELDSERST 768
              S CSAPAVT+ ++EVN   SCT+DAG  G  ND VVDE SGIE+CWS+DD LDSERS 
Sbjct: 122  ISSGCSAPAVTQASIEVNNMDSCTVDAGDTGCANDLVVDEASGIEKCWSSDDALDSERSA 181

Query: 767  ETIS-ACHTLHLSKGGRSSSCLPIRSSHGLVDNHRLQSPFGRKKVQKRLPTGNLGCESIL 591
            E +   C T  + +G  SS  L  +SS  L+D  + +  F  K+V+    TG    E   
Sbjct: 182  EFLGFTCKTSFIKEG--SSKALANQSSRSLIDELKFRDSFRWKRVRNESHTGLAIHEKNS 239

Query: 590  EVQPLGSILRTERRKRTMKWKRIDVSCPVSGLSSVQYDSP--IGDTDSHSCSSRGTQTLS 417
                +   L+T +RK+TMK K ++ S P SG SS  Y+     G  +  S S +   TL 
Sbjct: 240  HSPKIERGLKTRKRKKTMKMKMLNASFPASGFSSGHYEHTKCAGSAEWRSFSYKDVDTLL 299

Query: 416  KPKHGVQRACGVPFNQPSGLKRRCSALSSAKTLSQNKDLCELANHHREWEDDSQTLLKDH 237
            + + G    CG     PS  KRR S LSSAK  S+ +D+ ++    RE ED  Q   K  
Sbjct: 300  QCELGTSHTCGACTIGPS-FKRRRSTLSSAKNFSRKRDVDKI-YADREGEDGYQAQSKGK 357

Query: 236  MNHHKVFKLSNGKKLKRRWTPDVSRE-CSSEGLNQGDTGEEAKYVSTVGAKDFSSNRADT 60
                 + ++S  K++    T +  R+ C  E  +     +  KY S    K+ S  + D 
Sbjct: 358  TEFLSIHEVSGAKRIGPDRTAEAFRQFCMQEPSHT----KAVKYNSVGCVKESSCLKLDV 413

Query: 59   FGKKAKPVVCGKSGVISNG 3
              ++ KPVVCGK GVISNG
Sbjct: 414  SNRREKPVVCGKYGVISNG 432


>ref|XP_002519907.1| mixed-lineage leukemia protein, mll, putative [Ricinus communis]
            gi|223540953|gb|EEF42511.1| mixed-lineage leukemia
            protein, mll, putative [Ricinus communis]
          Length = 1125

 Score =  125 bits (315), Expect = 3e-26
 Identities = 118/397 (29%), Positives = 175/397 (44%), Gaps = 3/397 (0%)
 Frame = -2

Query: 1184 TLNQSENLNRKIPVQNGCHASQWRDVPSKHIRVG--SCTNVEKPTTVLDASNVNVQVVEI 1011
            + +Q   L  ++P     H SQW+DVP K  RV   +C      T++     +  Q+ + 
Sbjct: 26   SFDQCGMLKGELPKNATFHTSQWKDVPRKLKRVCEVACAKQSADTSLKREYKLG-QLGDN 84

Query: 1010 AVKRVDGT-HKAESLKEQQMSNAYSSCSAPAVTEITVEVNTAGSCTIDAGTDGFVNDHVV 834
            A    DG    A S KEQ MSN  S CS PAVT+ + E     S T+  G  G +N+ VV
Sbjct: 85   AANCFDGAVAAAASFKEQDMSNISSGCSTPAVTQASTEFTNVESSTV-VGNSGCINNLVV 143

Query: 833  DEGSGIERCWSTDDELDSERSTETISACHTLHLSKGGRSSSCLPIRSSHGLVDNHRLQSP 654
            DEGSGI++CWS+DD  +S+RS +   +    +L   G  ++ +  +SS  L+D  +L   
Sbjct: 144  DEGSGIDKCWSSDDAFESDRSADFHGSTCKKNLVYMGSHNTAVN-KSSRSLLDEVKLMDS 202

Query: 653  FGRKKVQKRLPTGNLGCESILEVQPLGSILRTERRKRTMKWKRIDVSCPVSGLSSVQYDS 474
               KK Q +   G          Q     L+T +RKR +  K  D     +         
Sbjct: 203  LTWKKGQNQKHNGITVHGKNNHSQEFDRGLKTGKRKREIIPKVSDAPLGTAAPMLHGKYP 262

Query: 473  PIGDTDSHSCSSRGTQTLSKPKHGVQRACGVPFNQPSGLKRRCSALSSAKTLSQNKDLCE 294
              G T    C S   Q +S  +   Q + G    + +     C   S +K+LS+N+DL  
Sbjct: 263  EYGGTADWPCLSENVQMVSAGQESSQTS-GAHCVKANPKDGNCMQ-SVSKSLSRNRDLHR 320

Query: 293  LANHHREWEDDSQTLLKDHMNHHKVFKLSNGKKLKRRWTPDVSRECSSEGLNQGDTGEEA 114
            L N   + E +    +    N  +V ++   KK +     D+S +   +   Q    +  
Sbjct: 321  LYN-AGDGEANPHNDINHDDNSCEVLEILGRKKFRSIHAADLSIQFQRQDCTQAVGEKAG 379

Query: 113  KYVSTVGAKDFSSNRADTFGKKAKPVVCGKSGVISNG 3
            KY S    K  +S+       KAKPV CGK G I NG
Sbjct: 380  KYDSLDRIK--ASSAQHLCHGKAKPVACGKYGEIVNG 414


>ref|XP_003545448.1| PREDICTED: uncharacterized protein LOC100812602 [Glycine max]
          Length = 1985

 Score =  122 bits (307), Expect = 2e-25
 Identities = 154/549 (28%), Positives = 248/549 (45%), Gaps = 17/549 (3%)
 Frame = -2

Query: 1598 NHYVDGNYEKLGGVTGTKNWCNFS--TPSRGISVEPDVAYPLSHEPLTDKQSLASLGRTE 1425
            NH      EKL  +TG  ++C+ S  +P    S E +     S++   ++ SL SLG  +
Sbjct: 772  NHESTVGLEKLASLTGMNSYCHLSGLSPRPLHSKEKESQCNHSYDLQNEETSL-SLGINK 830

Query: 1424 NSIMISGEHDVCGQREPSSFFPGKCSSAVHSISFEGNRVSRSVAPDDTSRTRFGGVTGKV 1245
            ++   S   + C ++  +  F GK + A      + N  S         + +    +G+ 
Sbjct: 831  DNTR-SSVFEKCSEQPSNICFGGKYTCAAQINCCKSNFFSGIEPLCYIIKQKLANASGET 889

Query: 1244 SKSVPSIFDNGRLVHEDKSSTLNQSENLNRKIPVQNGCHASQWRDVPSKHIRVGSC--TN 1071
            S  + S  D  R ++  K   + Q   L+ +  ++ G    QWRDVPSK +R   C  T+
Sbjct: 890  SLKMAS--DLSRDMNSFKGENIEQGGKLDGQDSIKIGFRTPQWRDVPSK-VRKAVCDATS 946

Query: 1070 VEKPTTVLD-ASNVNVQVVEIAVKRVDGT-HKAESLKEQQMSNAYSSCSAPAVTEITVEV 897
            + +  T +D     +VQ+  I++KR   T    +  KEQ+ SN  S CSAP VT+ ++EV
Sbjct: 947  LGQTATGMDWEGQDSVQLGNISMKRFKRTIDMGDMSKEQENSNVSSGCSAPVVTQASLEV 1006

Query: 896  NTAGSCTIDAGTDGFVNDHVVDEGSGIERCWSTDDELDSERSTETISACHTLHLSKGGRS 717
            N    C  DA   GFVN+ VVDEGSGI++ WS+D     E+S E +          G  S
Sbjct: 1007 NKIEPCMGDAVDTGFVNNLVVDEGSGIDKGWSSD---LVEKSDEFL----------GSSS 1053

Query: 716  SSCLP---IRSSHG------LVDNHRLQSPFGRKKVQKRLPTGNLGCESILEVQPLGSIL 564
             SCL    +R  +       L D   L S   +K   +     +  C+S  + Q +   L
Sbjct: 1054 GSCLKNDYLRVLNDQPCCNLLDDLKLLDSLIWKKGWNQNNFVLSSNCKS-NQSQKVKKGL 1112

Query: 563  RTERRKRTM-KWKRIDVSCPVSGLSSVQYDSPIGDTDSHSCSSRGTQTLSKPKHGVQRAC 387
            + ++RKR + +     +S     L   + +   G  +S S  S+  Q   +P   +Q++ 
Sbjct: 1113 KGKKRKRNLVRILDASLSSEFPSLLHKKNEEVTGICNSSSSCSKEMQ--MRPLSSLQKSS 1170

Query: 386  G-VPFNQPSGLKRRCSALSSAKTLSQNKDLCELANHHREWEDDSQTLLKDHMNHHKVFKL 210
                F QPS  K++ +A SS K LS    L    N H+ ++   ++          +  +
Sbjct: 1171 NKSSFVQPSN-KQKHTAFSS-KFLSCKNHL----NKHQSYKVGYESESSSDAEFRTLPGV 1224

Query: 209  SNGKKLKRRWTPDVSRECSSEGLNQGDTGEEAKYVSTVGAKDFSSNRADTFGKKAKPVVC 30
            S  KKLK+    D++ +C  +   Q    EE +       + FS  R +   +  +PVVC
Sbjct: 1225 SGSKKLKK----DLTSDCFEQFQMQEPAYEEPE---NDKLRPFSC-RKENAHRITRPVVC 1276

Query: 29   GKSGVISNG 3
            GK G IS+G
Sbjct: 1277 GKYGEISSG 1285


>ref|XP_003549306.1| PREDICTED: uncharacterized protein LOC100816713 [Glycine max]
          Length = 992

 Score = 92.8 bits (229), Expect = 2e-16
 Identities = 99/325 (30%), Positives = 150/325 (46%), Gaps = 3/325 (0%)
 Frame = -2

Query: 968 KEQQMSNAYSSCSAPAVTEITVEVNTAGSCTIDAGTDGFVNDHVVDEGSGIERCWSTD-D 792
           KEQ+ SN  S CSAP VT+ +VEVN   SCT DA   GFVN+ VVDEGSGI++ WS+D  
Sbjct: 15  KEQKSSNVSSGCSAPVVTQASVEVNKIDSCTDDAVDTGFVNNLVVDEGSGIDQGWSSDLV 74

Query: 791 ELDSERSTETISACHTLHLSKGGRSSSCLPIRSSHGLVDNHRLQSPFGRKKVQKRLPTGN 612
           E   E    T  +C      +      C  +     L D   L S   +K   +     +
Sbjct: 75  ERSDEFLGSTTGSCLKNDYLRVLYDQPCCNL-----LDDLKLLDSLIWKKGRNQNHFVLS 129

Query: 611 LGCESILEVQPLGSILRTERRKRTMKWKRIDVSCPVSGLSSVQYDSPIGDTDSHSCSSRG 432
             C++  + Q +  +L+ ++RKR +  + +D S   S L   + +   G  +S S  SR 
Sbjct: 130 SNCKT-NQSQKVKKVLKGKKRKRNVV-RIVDAS---SSLLHKKNEEGAGICNSSSSLSRE 184

Query: 431 TQ--TLSKPKHGVQRACGVPFNQPSGLKRRCSALSSAKTLSQNKDLCELANHHREWEDDS 258
            Q  +LS  K    ++    F QPS  K++ +A SS     +N+      N H+ ++   
Sbjct: 185 MQMHSLSSLKKSSNKS---SFVQPSN-KQKHTAYSSKFLSCKNR-----LNKHQSFKVGY 235

Query: 257 QTLLKDHMNHHKVFKLSNGKKLKRRWTPDVSRECSSEGLNQGDTGEEAKYVSTVGAKDFS 78
           ++        H +  +S  KKL++    D+S +C  +   Q    EE +       + FS
Sbjct: 236 ESESSSDAEFHTLPGVSGTKKLEK----DLSSDCFEQFQMQELAYEEPE---NDKLRPFS 288

Query: 77  SNRADTFGKKAKPVVCGKSGVISNG 3
             + +        VVCGK G ISNG
Sbjct: 289 CRKENAHRITRPVVVCGKYGEISNG 313


Top