BLASTX nr result

ID: Coptis23_contig00004776 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Coptis23_contig00004776
         (1821 letters)

Database: ./nr 
           23,641,837 sequences; 8,123,359,852 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_003635099.1| PREDICTED: cysteine protease ATG4-like [Viti...   546   e-153
emb|CAN81102.1| hypothetical protein VITISV_021940 [Vitis vinifera]   542   e-151
ref|XP_002331599.1| predicted protein [Populus trichocarpa] gi|2...   537   e-150
ref|XP_003552483.1| PREDICTED: cysteine protease ATG4-like [Glyc...   536   e-150
ref|XP_003534478.1| PREDICTED: cysteine protease ATG4-like [Glyc...   531   e-148

>ref|XP_003635099.1| PREDICTED: cysteine protease ATG4-like [Vitis vinifera]
            gi|296086874|emb|CBI33041.3| unnamed protein product
            [Vitis vinifera]
          Length = 486

 Score =  546 bits (1407), Expect = e-153
 Identities = 280/472 (59%), Positives = 331/472 (70%), Gaps = 12/472 (2%)
 Frame = +2

Query: 290  DLTTSEKKASET-CSKVSYWXXXXXXXXXXXXXXXXXXXXXCCEKKTSHGRTYGWTFSLK 466
            D + SE ++S+T  SKVS W                       +K   +GR  GWT +++
Sbjct: 20   DSSNSEPQSSDTKLSKVSLWSSVFASAFSVFETNSESSPSASEKKAIDNGRNNGWTTAVR 79

Query: 467  RIVGGGSMRRLHERLIGXXXXXXXXXXXDIWLLGVCYRVSSEESSDDACNTNGLYEFSVD 646
            ++V G SMRR+ ER++G           DIWLLG+CY++S EESS+ A ++NGL EF  D
Sbjct: 80   KVVTGVSMRRIQERVLGTSKTGISSSTSDIWLLGLCYKISQEESSNHASSSNGLAEFEQD 139

Query: 647  FSSRIWMTYRKGFDAIGDSKFTSDVNWGCMLRSSQMLVAQALLFHNLGRSWRKPMEKPFC 826
            FSSRI MTYRKGF+AIGDSK TSDVNWGCMLRSSQMLVAQALL H +GRSWRK   KP  
Sbjct: 140  FSSRILMTYRKGFEAIGDSKLTSDVNWGCMLRSSQMLVAQALLLHRMGRSWRKTSHKPMD 199

Query: 827  SEYIEILDLFGDSEASAFSIHNLLQVGKAYGLAAGSWIGPYAMCRSWETLVNSTQQS--- 997
             +YIEIL  FGDS+ASAFSIHN+LQ GKAYGLAAGSW+GPYAMCRSWETL  S ++    
Sbjct: 200  QDYIEILHHFGDSKASAFSIHNILQAGKAYGLAAGSWVGPYAMCRSWETLARSKREETDL 259

Query: 998  DKPTLSMTIYVVSGDENGERGGAPVLCIEDVAKLCSEFCEGQAVWKPIXXXXXXXXXXDK 1177
            +  +L M IY+VSGDE+GERGGAPV+ IE+ ++ C EF +GQ  W PI          +K
Sbjct: 260  ECQSLPMAIYIVSGDEDGERGGAPVVYIEEASRHCLEFSKGQVDWTPILLLVPLVLGLEK 319

Query: 1178 VNPRYIPSLCATFTFPQSLGILGGKPGVSTYIVGVQDDKAFYLDPHEVQQVMDIKRNNSE 1357
            VNPRYIPSL ATFTFPQSLGILGGKPG STYIVGVQD+KAFYLDPHE Q V+DI+R N E
Sbjct: 320  VNPRYIPSLAATFTFPQSLGILGGKPGASTYIVGVQDEKAFYLDPHEAQSVVDIRRENLE 379

Query: 1358 IDTSSYHCSVVRHLPLDTIDPSLAIGFYCRDKDDFDDFCARASKLTDASNGAPLLTITKS 1537
             DTSSYHC+++RH+ LD+IDPSLAIGFYCRDKDDFDDFC RASKL D SNGAPL T+   
Sbjct: 380  ADTSSYHCNIIRHICLDSIDPSLAIGFYCRDKDDFDDFCIRASKLADKSNGAPLFTVAHI 439

Query: 1538 HSSPKTV-CKDFLVHSVDDESHGDFEMDSMN-------ESEISTQEDDWQLL 1669
            HS PK + C D +     D+  G  E DS +       E      EDDWQLL
Sbjct: 440  HSLPKPISCSDGM-----DDCSGFREDDSFDVVSNKGAEGYEHEHEDDWQLL 486


>emb|CAN81102.1| hypothetical protein VITISV_021940 [Vitis vinifera]
          Length = 489

 Score =  542 bits (1396), Expect = e-151
 Identities = 281/475 (59%), Positives = 331/475 (69%), Gaps = 15/475 (3%)
 Frame = +2

Query: 290  DLTTSEKKASET-CSKVSYWXXXXXXXXXXXXXXXXXXXXXCCEKKTSHGRTYGWTFSLK 466
            D + SE ++S+T  SKVS W                       +K   +GR  GWT +++
Sbjct: 20   DSSNSEPQSSDTKLSKVSLWSSVFASAFSVFETNSESSPSASEKKAIDNGRNNGWTTAVR 79

Query: 467  RIVGGGSMRRLHERLIGXXXXXXXXXXXDIWLLGVCYRVSSEESSDDACNTNGLYEFSVD 646
            ++V G SMRR+ ER++G           DIWLLG+CY++S EESS+ A ++NGL EF  D
Sbjct: 80   KVVTGVSMRRIQERVLGTSKTGISSSTSDIWLLGLCYKISQEESSNHASSSNGLAEFEQD 139

Query: 647  FSSRIWMTYRKGFDAIGDSKFTSDVNWGCMLRSSQMLVAQALLFHNLGRSWRKPMEKPFC 826
            FSSRI MTYRKGF+AIGDSK TSDVNWGCMLRSSQMLVAQALL H +GRSWRK   KP  
Sbjct: 140  FSSRILMTYRKGFEAIGDSKLTSDVNWGCMLRSSQMLVAQALLLHRMGRSWRKTSHKPMD 199

Query: 827  SEYIEILDLFGDSEASAFSIHNLLQVGKAYGLAAGSWIGPYAMCRSWETLVNSTQQS--- 997
             +YIEIL  FGDS+ASAFSIHN+LQ GKAYGLAAGSW+GPYAMCRSWETL  S ++    
Sbjct: 200  QDYIEILHHFGDSKASAFSIHNILQAGKAYGLAAGSWVGPYAMCRSWETLARSKREETDL 259

Query: 998  DKPTLSMTIYVVSGDENGERGGAPVLCIEDVAKLCSEFCEGQAVWKPIXXXXXXXXXXDK 1177
            +  +L M IY+VSGDE+GERGGAPV+ IE+ ++ C EF +GQ  W PI          +K
Sbjct: 260  ECQSLPMAIYIVSGDEDGERGGAPVVYIEEASRHCLEFSKGQVDWTPILLLVPLVLGLEK 319

Query: 1178 VNPRYIPSLCATFTFPQSLGILGGKPGVSTYIVGVQDDKAFYLDPHEVQQVMDIKRNNSE 1357
            VNPRYIPSL ATFTFPQSLGILGGKPG STYIVGVQD+KAFYLDPHE Q V+DI+R N E
Sbjct: 320  VNPRYIPSLAATFTFPQSLGILGGKPGASTYIVGVQDEKAFYLDPHEAQSVVDIRRENLE 379

Query: 1358 IDTSSYHC---SVVRHLPLDTIDPSLAIGFYCRDKDDFDDFCARASKLTDASNGAPLLTI 1528
             DTSSYHC   S++RH+ LD+IDPSLAIGFYCRDKDDFDDFC RASKL D SNGAPL T+
Sbjct: 380  ADTSSYHCNCSSIIRHICLDSIDPSLAIGFYCRDKDDFDDFCIRASKLADESNGAPLFTV 439

Query: 1529 TKSHSSPKTV-CKDFLVHSVDDESHGDFEMDSMN-------ESEISTQEDDWQLL 1669
               HS PK + C D +     D+  G  E DS +       E      EDDWQLL
Sbjct: 440  AHIHSLPKPISCSDGM-----DDCSGFREDDSFDVVSNKGAEGYEHEHEDDWQLL 489


>ref|XP_002331599.1| predicted protein [Populus trichocarpa] gi|222873995|gb|EEF11126.1|
            predicted protein [Populus trichocarpa]
          Length = 482

 Score =  537 bits (1384), Expect = e-150
 Identities = 271/423 (64%), Positives = 316/423 (74%), Gaps = 5/423 (1%)
 Frame = +2

Query: 416  EKKTSHGRT-YGWTFSLKRIVGGGSMRRLHERLIGXXXXXXXXXXXDIWLLGVCYRVSSE 592
            EKK  H R   GWT ++K+IV GGSMRR+ E ++G           DIWLLG CY++S +
Sbjct: 68   EKKAPHIRHGNGWTSAVKKIVAGGSMRRIQECVLGTSKTGISNTTGDIWLLGACYKISQD 127

Query: 593  ESSDDACNTNGLYEFSVDFSSRIWMTYRKGFDAIGDSKFTSDVNWGCMLRSSQMLVAQAL 772
             SS DA  TN L  F+ DFSSRI +TYRKGFDAI DSK TSDV+WGCMLRSSQMLVAQAL
Sbjct: 128  NSSGDAAATNALAAFNHDFSSRILITYRKGFDAIEDSKLTSDVSWGCMLRSSQMLVAQAL 187

Query: 773  LFHNLGRSWRKPMEKPFCSEYIEILDLFGDSEASAFSIHNLLQVGKAYGLAAGSWIGPYA 952
            LFH LGRSWRKP++KP   EY+EIL LFGDSE+SAFSIHNLL+ GKAYGLAAGSW+GPYA
Sbjct: 188  LFHRLGRSWRKPLDKPLDREYVEILHLFGDSESSAFSIHNLLRAGKAYGLAAGSWVGPYA 247

Query: 953  MCRSWETLVNSTQQS---DKPTLSMTIYVVSGDENGERGGAPVLCIEDVAKLCSEFCEGQ 1123
            +C SWE+LV S ++    +  +LSM +YVVSG E+GERGGAPVLCIE+ A+ CSEF +GQ
Sbjct: 248  VCHSWESLVRSRREETNLEYQSLSMAVYVVSGSEDGERGGAPVLCIEEAARHCSEFSKGQ 307

Query: 1124 AVWKPIXXXXXXXXXXDKVNPRYIPSLCATFTFPQSLGILGGKPGVSTYIVGVQDDKAFY 1303
              W PI          DK+NPRYIPSL ATFTFPQSLGILGGKPG STYIVGVQD+ AFY
Sbjct: 308  EDWTPILLLVPLVLGLDKINPRYIPSLQATFTFPQSLGILGGKPGASTYIVGVQDENAFY 367

Query: 1304 LDPHEVQQVMDIKRNNSEIDTSSYHCSVVRHLPLDTIDPSLAIGFYCRDKDDFDDFCARA 1483
            LDPHEVQ V+++ R++ E +TSSYHC+VVRH+PLD IDPSLAIGFYCRDKDDFDDFC  A
Sbjct: 368  LDPHEVQPVVNVSRDDVEANTSSYHCNVVRHMPLDLIDPSLAIGFYCRDKDDFDDFCTLA 427

Query: 1484 SKLTDASNGAPLLTITKSHSSPKTVCKDFLVH-SVDDESHGDFEMDSMNESEISTQEDDW 1660
            SKLTD SNGAPL T+  S        +  L H S +  S     + +MN+ E    EDDW
Sbjct: 428  SKLTDESNGAPLFTVAHS--------RKLLKHDSGEVRSDDSLGVMTMNDVEGCVHEDDW 479

Query: 1661 QLL 1669
            QLL
Sbjct: 480  QLL 482


>ref|XP_003552483.1| PREDICTED: cysteine protease ATG4-like [Glycine max]
          Length = 485

 Score =  536 bits (1382), Expect = e-150
 Identities = 261/419 (62%), Positives = 311/419 (74%), Gaps = 1/419 (0%)
 Frame = +2

Query: 416  EKKTSHGRTYGWTFSLKRIVGGGSMRRLHERLIGXXXXXXXXXXXDIWLLGVCYRVSSEE 595
            EKK    R+ GW  +++++V GGSMRR  ER++G           DIWLLGVC+++S +E
Sbjct: 69   EKKAVPSRSSGWAAAVRKVVTGGSMRRFQERVLGSSRTDISSSDGDIWLLGVCHKISQQE 128

Query: 596  SSDDACNTNGLYEFSVDFSSRIWMTYRKGFDAIGDSKFTSDVNWGCMLRSSQMLVAQALL 775
            S+     +NGL  F  DFSS+I +TYRKGFDAIGD+K+TSDVNWGCMLRSSQMLVAQALL
Sbjct: 129  STGGVDTSNGLASFEQDFSSKILVTYRKGFDAIGDTKYTSDVNWGCMLRSSQMLVAQALL 188

Query: 776  FHNLGRSWRKPMEKPFCSEYIEILDLFGDSEASAFSIHNLLQVGKAYGLAAGSWIGPYAM 955
            FH LGRSWRKP++KP   EYI++L LFGDSEASAFSIHNLLQ GK YGLA GSW+GPYAM
Sbjct: 189  FHKLGRSWRKPIDKPLDKEYIDVLQLFGDSEASAFSIHNLLQAGKGYGLAVGSWVGPYAM 248

Query: 956  CRSWETLVNSTQQSDKPTLSMTIYVVSGDENGERGGAPVLCIEDVAKLCSEFCEGQAVWK 1135
            CR+WE L        +P L M IYVVSGDE+GERGGAPV+CIED +K CSEF  G AVW 
Sbjct: 249  CRTWEVLARKKNDLGEPPLPMAIYVVSGDEDGERGGAPVVCIEDASKRCSEFSSGLAVWT 308

Query: 1136 PIXXXXXXXXXXDKVNPRYIPSLCATFTFPQSLGILGGKPGVSTYIVGVQDDKAFYLDPH 1315
            P+          DKVNPRYIP L +TF FPQSLGI+GGKPG STYI+GVQ++KAFYLDPH
Sbjct: 309  PLLLLVPLVLGLDKVNPRYIPLLRSTFKFPQSLGIMGGKPGASTYIIGVQNEKAFYLDPH 368

Query: 1316 EVQQVMDIKRNNSE-IDTSSYHCSVVRHLPLDTIDPSLAIGFYCRDKDDFDDFCARASKL 1492
            +VQQV++I  +  E   TSSYHC+V+RH+PLD+IDPSLAIGFYCRDKDDFDDFC++ASKL
Sbjct: 369  DVQQVVNISGDTQEPTGTSSYHCNVMRHIPLDSIDPSLAIGFYCRDKDDFDDFCSQASKL 428

Query: 1493 TDASNGAPLLTITKSHSSPKTVCKDFLVHSVDDESHGDFEMDSMNESEISTQEDDWQLL 1669
             + SNGAPL T+ KS S  K V  D    +   +      MD  N++   T EDDWQLL
Sbjct: 429  AEESNGAPLFTVAKSRSFSKQVSNDVSGDNTGFQEDDFPGMDCGNDT--VTNEDDWQLL 485


>ref|XP_003534478.1| PREDICTED: cysteine protease ATG4-like [Glycine max]
          Length = 486

 Score =  531 bits (1368), Expect = e-148
 Identities = 259/420 (61%), Positives = 311/420 (74%), Gaps = 2/420 (0%)
 Frame = +2

Query: 416  EKKTSHGRTYGWTFSLKRIVGGGSMRRLHERLIGXXXXXXXXXXXDIWLLGVCYRVSSEE 595
            EKK  H R+ GW  +++++V GGSMRR  ER++G           DIWLLGVC+++S +E
Sbjct: 69   EKKAVHSRSSGWAAAVRKVVTGGSMRRFQERVLGSSRTDISSSDGDIWLLGVCHKISQQE 128

Query: 596  SSDDACNTNGLYEFSVDFSSRIWMTYRKGFDAIGDSKFTSDVNWGCMLRSSQMLVAQALL 775
            SS    N+NGL  F  DFSS+I +TYRKGFDAIGD+K+TSDV+WGCMLRSSQMLVAQALL
Sbjct: 129  SSGGVDNSNGLASFEQDFSSKILVTYRKGFDAIGDTKYTSDVHWGCMLRSSQMLVAQALL 188

Query: 776  FHNLGRSWRKPMEKPFCSEYIEILDLFGDSEASAFSIHNLLQVGKAYGLAAGSWIGPYAM 955
            FH LGRSWRKP++KP   EYI++L LFGDSEASAFSIHNLLQ GK YGLA GSW+GPYAM
Sbjct: 189  FHKLGRSWRKPIDKPPDKEYIDVLQLFGDSEASAFSIHNLLQAGKGYGLAVGSWVGPYAM 248

Query: 956  CRSWETLVNSTQQSDKPTLSMTIYVVSGDENGERGGAPVLCIEDVAKLCSEFCEGQAVWK 1135
            CR+WE L        +  L M IYVVSGDE+GERGGAPV+CIED +K C EF  G A W 
Sbjct: 249  CRTWEVLARKKNDLGELPLPMAIYVVSGDEDGERGGAPVVCIEDASKRCFEFSSGLAAWT 308

Query: 1136 PIXXXXXXXXXXDKVNPRYIPSLCATFTFPQSLGILGGKPGVSTYIVGVQDDKAFYLDPH 1315
            P+          DKVNPRYIP L +TF FPQSLGI+GGKPG STYI+G Q++KAFYLDPH
Sbjct: 309  PLLLLVPLVLGLDKVNPRYIPLLRSTFKFPQSLGIMGGKPGASTYIIGAQNEKAFYLDPH 368

Query: 1316 EVQQVMDIKRNNSE-IDTSSYHCSVVRHLPLDTIDPSLAIGFYCRDKDDFDDFCARASKL 1492
            +VQQV++I  +  E   TSSYHC+++RH+PLD+IDPSLAIGFYCRDKDDFDDFC++ASKL
Sbjct: 369  DVQQVVNISGDTQEPTSTSSYHCNIMRHIPLDSIDPSLAIGFYCRDKDDFDDFCSQASKL 428

Query: 1493 TDASNGAPLLTITKSHSSPKTVCKDFLVHSVDDESHGDFE-MDSMNESEISTQEDDWQLL 1669
             + SNGAPL T+T+S S  K V  + +          DF  MD  N++   T EDDWQLL
Sbjct: 429  AEESNGAPLFTVTQSRSFSKQVTSNDVSGDNTGFQEEDFPGMDRGNDT--GTNEDDWQLL 486


Top