BLASTX nr result

ID: Coptis24_contig00017560 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Coptis24_contig00017560
         (1818 letters)

Database: ./nr 
           23,641,837 sequences; 8,123,359,852 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_003552483.1| PREDICTED: cysteine protease ATG4-like [Glyc...   537   e-150
ref|XP_003635099.1| PREDICTED: cysteine protease ATG4-like [Viti...   536   e-150
ref|XP_002331599.1| predicted protein [Populus trichocarpa] gi|2...   535   e-149
ref|XP_003534478.1| PREDICTED: cysteine protease ATG4-like [Glyc...   533   e-149
emb|CAN81102.1| hypothetical protein VITISV_021940 [Vitis vinifera]   532   e-148

>ref|XP_003552483.1| PREDICTED: cysteine protease ATG4-like [Glycine max]
          Length = 485

 Score =  537 bits (1384), Expect = e-150
 Identities = 263/420 (62%), Positives = 315/420 (75%), Gaps = 2/420 (0%)
 Frame = -1

Query: 1392 EKKTSHGRTYGWTFSLKRLVGGGSMRRLHERLIGXXXXXXXXXXSDIWLLGVCYRVSSEE 1213
            EKK    R+ GW  +++++V GGSMRR  ER++G           DIWLLGVC+++S +E
Sbjct: 69   EKKAVPSRSSGWAAAVRKVVTGGSMRRFQERVLGSSRTDISSSDGDIWLLGVCHKISQQE 128

Query: 1212 SSDEACNTNGLYEFSVDFSSRIWMTYRKGFDAIGDSKFTSDVNWGCMLRSSQMLVAQALL 1033
            S+     +NGL  F  DFSS+I +TYRKGFDAIGD+K+TSDVNWGCMLRSSQMLVAQALL
Sbjct: 129  STGGVDTSNGLASFEQDFSSKILVTYRKGFDAIGDTKYTSDVNWGCMLRSSQMLVAQALL 188

Query: 1032 FHNLGRSWRKPMEKPFCSEYIEILDLFGDSEASAFSIHNLLQVGKAYGLAAGSWIGPYAM 853
            FH LGRSWRKP++KP   EYI++L LFGDSEASAFSIHNLLQ GK YGLA GSW+GPYAM
Sbjct: 189  FHKLGRSWRKPIDKPLDKEYIDVLQLFGDSEASAFSIHNLLQAGKGYGLAVGSWVGPYAM 248

Query: 852  CRSWETLVNSTQQSDKPTLSMTIYVVSGDENGERGGAPVLCIEDVAKLCSEFCEGQAVWK 673
            CR+WE L        +P L M IYVVSGDE+GERGGAPV+CIED +K CSEF  G AVW 
Sbjct: 249  CRTWEVLARKKNDLGEPPLPMAIYVVSGDEDGERGGAPVVCIEDASKRCSEFSSGLAVWT 308

Query: 672  PIXXXXXXXXXLDKVNPRYIPSLCATFTFPQSLGILGGKPGVSTYIVGVQDDKAFYLDPH 493
            P+         LDKVNPRYIP L +TF FPQSLGI+GGKPG STYI+GVQ++KAFYLDPH
Sbjct: 309  PLLLLVPLVLGLDKVNPRYIPLLRSTFKFPQSLGIMGGKPGASTYIIGVQNEKAFYLDPH 368

Query: 492  EVQQVMDIKRNNSE-VDTSSYHCSVVRHLPLDTIDPSLAIGFYCRDKDDFDDFCARASKL 316
            +VQQV++I  +  E   TSSYHC+V+RH+PLD+IDPSLAIGFYCRDKDDFDDFC++ASKL
Sbjct: 369  DVQQVVNISGDTQEPTGTSSYHCNVMRHIPLDSIDPSLAIGFYCRDKDDFDDFCSQASKL 428

Query: 315  TDASNGAPLLTITKSHSSSKTVCKDFLVHSVDDESHGDFEMDNMN-ESEISTQEDDWQLL 139
             + SNGAPL T+ KS S SK V  D    S D+    + +   M+  ++  T EDDWQLL
Sbjct: 429  AEESNGAPLFTVAKSRSFSKQVSNDV---SGDNTGFQEDDFPGMDCGNDTVTNEDDWQLL 485


>ref|XP_003635099.1| PREDICTED: cysteine protease ATG4-like [Vitis vinifera]
            gi|296086874|emb|CBI33041.3| unnamed protein product
            [Vitis vinifera]
          Length = 486

 Score =  536 bits (1382), Expect = e-150
 Identities = 270/429 (62%), Positives = 318/429 (74%), Gaps = 11/429 (2%)
 Frame = -1

Query: 1392 EKKTSHGRTYGWTFSLKRLVGGGSMRRLHERLIGXXXXXXXXXXSDIWLLGVCYRVSSEE 1213
            +K   +GR  GWT +++++V G SMRR+ ER++G          SDIWLLG+CY++S EE
Sbjct: 63   KKAIDNGRNNGWTTAVRKVVTGVSMRRIQERVLGTSKTGISSSTSDIWLLGLCYKISQEE 122

Query: 1212 SSDEACNTNGLYEFSVDFSSRIWMTYRKGFDAIGDSKFTSDVNWGCMLRSSQMLVAQALL 1033
            SS+ A ++NGL EF  DFSSRI MTYRKGF+AIGDSK TSDVNWGCMLRSSQMLVAQALL
Sbjct: 123  SSNHASSSNGLAEFEQDFSSRILMTYRKGFEAIGDSKLTSDVNWGCMLRSSQMLVAQALL 182

Query: 1032 FHNLGRSWRKPMEKPFCSEYIEILDLFGDSEASAFSIHNLLQVGKAYGLAAGSWIGPYAM 853
             H +GRSWRK   KP   +YIEIL  FGDS+ASAFSIHN+LQ GKAYGLAAGSW+GPYAM
Sbjct: 183  LHRMGRSWRKTSHKPMDQDYIEILHHFGDSKASAFSIHNILQAGKAYGLAAGSWVGPYAM 242

Query: 852  CRSWETLVNSTQQS---DKPTLSMTIYVVSGDENGERGGAPVLCIEDVAKLCSEFCEGQA 682
            CRSWETL  S ++    +  +L M IY+VSGDE+GERGGAPV+ IE+ ++ C EF +GQ 
Sbjct: 243  CRSWETLARSKREETDLECQSLPMAIYIVSGDEDGERGGAPVVYIEEASRHCLEFSKGQV 302

Query: 681  VWKPIXXXXXXXXXLDKVNPRYIPSLCATFTFPQSLGILGGKPGVSTYIVGVQDDKAFYL 502
             W PI         L+KVNPRYIPSL ATFTFPQSLGILGGKPG STYIVGVQD+KAFYL
Sbjct: 303  DWTPILLLVPLVLGLEKVNPRYIPSLAATFTFPQSLGILGGKPGASTYIVGVQDEKAFYL 362

Query: 501  DPHEVQQVMDIKRNNSEVDTSSYHCSVVRHLPLDTIDPSLAIGFYCRDKDDFDDFCARAS 322
            DPHE Q V+DI+R N E DTSSYHC+++RH+ LD+IDPSLAIGFYCRDKDDFDDFC RAS
Sbjct: 363  DPHEAQSVVDIRRENLEADTSSYHCNIIRHICLDSIDPSLAIGFYCRDKDDFDDFCIRAS 422

Query: 321  KLTDASNGAPLLTITKSHSSSKTV-CKDFLVHSVDDESHGDFEMDNMN-------ESEIS 166
            KL D SNGAPL T+   HS  K + C D +     D+  G  E D+ +       E    
Sbjct: 423  KLADKSNGAPLFTVAHIHSLPKPISCSDGM-----DDCSGFREDDSFDVVSNKGAEGYEH 477

Query: 165  TQEDDWQLL 139
              EDDWQLL
Sbjct: 478  EHEDDWQLL 486


>ref|XP_002331599.1| predicted protein [Populus trichocarpa] gi|222873995|gb|EEF11126.1|
            predicted protein [Populus trichocarpa]
          Length = 482

 Score =  535 bits (1378), Expect = e-149
 Identities = 269/422 (63%), Positives = 314/422 (74%), Gaps = 4/422 (0%)
 Frame = -1

Query: 1392 EKKTSHGRT-YGWTFSLKRLVGGGSMRRLHERLIGXXXXXXXXXXSDIWLLGVCYRVSSE 1216
            EKK  H R   GWT ++K++V GGSMRR+ E ++G           DIWLLG CY++S +
Sbjct: 68   EKKAPHIRHGNGWTSAVKKIVAGGSMRRIQECVLGTSKTGISNTTGDIWLLGACYKISQD 127

Query: 1215 ESSDEACNTNGLYEFSVDFSSRIWMTYRKGFDAIGDSKFTSDVNWGCMLRSSQMLVAQAL 1036
             SS +A  TN L  F+ DFSSRI +TYRKGFDAI DSK TSDV+WGCMLRSSQMLVAQAL
Sbjct: 128  NSSGDAAATNALAAFNHDFSSRILITYRKGFDAIEDSKLTSDVSWGCMLRSSQMLVAQAL 187

Query: 1035 LFHNLGRSWRKPMEKPFCSEYIEILDLFGDSEASAFSIHNLLQVGKAYGLAAGSWIGPYA 856
            LFH LGRSWRKP++KP   EY+EIL LFGDSE+SAFSIHNLL+ GKAYGLAAGSW+GPYA
Sbjct: 188  LFHRLGRSWRKPLDKPLDREYVEILHLFGDSESSAFSIHNLLRAGKAYGLAAGSWVGPYA 247

Query: 855  MCRSWETLVNSTQQS---DKPTLSMTIYVVSGDENGERGGAPVLCIEDVAKLCSEFCEGQ 685
            +C SWE+LV S ++    +  +LSM +YVVSG E+GERGGAPVLCIE+ A+ CSEF +GQ
Sbjct: 248  VCHSWESLVRSRREETNLEYQSLSMAVYVVSGSEDGERGGAPVLCIEEAARHCSEFSKGQ 307

Query: 684  AVWKPIXXXXXXXXXLDKVNPRYIPSLCATFTFPQSLGILGGKPGVSTYIVGVQDDKAFY 505
              W PI         LDK+NPRYIPSL ATFTFPQSLGILGGKPG STYIVGVQD+ AFY
Sbjct: 308  EDWTPILLLVPLVLGLDKINPRYIPSLQATFTFPQSLGILGGKPGASTYIVGVQDENAFY 367

Query: 504  LDPHEVQQVMDIKRNNSEVDTSSYHCSVVRHLPLDTIDPSLAIGFYCRDKDDFDDFCARA 325
            LDPHEVQ V+++ R++ E +TSSYHC+VVRH+PLD IDPSLAIGFYCRDKDDFDDFC  A
Sbjct: 368  LDPHEVQPVVNVSRDDVEANTSSYHCNVVRHMPLDLIDPSLAIGFYCRDKDDFDDFCTLA 427

Query: 324  SKLTDASNGAPLLTITKSHSSSKTVCKDFLVHSVDDESHGDFEMDNMNESEISTQEDDWQ 145
            SKLTD SNGAPL T+  S    K         S +  S     +  MN+ E    EDDWQ
Sbjct: 428  SKLTDESNGAPLFTVAHSRKLLKH-------DSGEVRSDDSLGVMTMNDVEGCVHEDDWQ 480

Query: 144  LL 139
            LL
Sbjct: 481  LL 482


>ref|XP_003534478.1| PREDICTED: cysteine protease ATG4-like [Glycine max]
          Length = 486

 Score =  533 bits (1372), Expect = e-149
 Identities = 261/420 (62%), Positives = 313/420 (74%), Gaps = 2/420 (0%)
 Frame = -1

Query: 1392 EKKTSHGRTYGWTFSLKRLVGGGSMRRLHERLIGXXXXXXXXXXSDIWLLGVCYRVSSEE 1213
            EKK  H R+ GW  +++++V GGSMRR  ER++G           DIWLLGVC+++S +E
Sbjct: 69   EKKAVHSRSSGWAAAVRKVVTGGSMRRFQERVLGSSRTDISSSDGDIWLLGVCHKISQQE 128

Query: 1212 SSDEACNTNGLYEFSVDFSSRIWMTYRKGFDAIGDSKFTSDVNWGCMLRSSQMLVAQALL 1033
            SS    N+NGL  F  DFSS+I +TYRKGFDAIGD+K+TSDV+WGCMLRSSQMLVAQALL
Sbjct: 129  SSGGVDNSNGLASFEQDFSSKILVTYRKGFDAIGDTKYTSDVHWGCMLRSSQMLVAQALL 188

Query: 1032 FHNLGRSWRKPMEKPFCSEYIEILDLFGDSEASAFSIHNLLQVGKAYGLAAGSWIGPYAM 853
            FH LGRSWRKP++KP   EYI++L LFGDSEASAFSIHNLLQ GK YGLA GSW+GPYAM
Sbjct: 189  FHKLGRSWRKPIDKPPDKEYIDVLQLFGDSEASAFSIHNLLQAGKGYGLAVGSWVGPYAM 248

Query: 852  CRSWETLVNSTQQSDKPTLSMTIYVVSGDENGERGGAPVLCIEDVAKLCSEFCEGQAVWK 673
            CR+WE L        +  L M IYVVSGDE+GERGGAPV+CIED +K C EF  G A W 
Sbjct: 249  CRTWEVLARKKNDLGELPLPMAIYVVSGDEDGERGGAPVVCIEDASKRCFEFSSGLAAWT 308

Query: 672  PIXXXXXXXXXLDKVNPRYIPSLCATFTFPQSLGILGGKPGVSTYIVGVQDDKAFYLDPH 493
            P+         LDKVNPRYIP L +TF FPQSLGI+GGKPG STYI+G Q++KAFYLDPH
Sbjct: 309  PLLLLVPLVLGLDKVNPRYIPLLRSTFKFPQSLGIMGGKPGASTYIIGAQNEKAFYLDPH 368

Query: 492  EVQQVMDIKRNNSE-VDTSSYHCSVVRHLPLDTIDPSLAIGFYCRDKDDFDDFCARASKL 316
            +VQQV++I  +  E   TSSYHC+++RH+PLD+IDPSLAIGFYCRDKDDFDDFC++ASKL
Sbjct: 369  DVQQVVNISGDTQEPTSTSSYHCNIMRHIPLDSIDPSLAIGFYCRDKDDFDDFCSQASKL 428

Query: 315  TDASNGAPLLTITKSHSSSKTVCKDFLVHSVDDESHGDFE-MDNMNESEISTQEDDWQLL 139
             + SNGAPL T+T+S S SK V  + +          DF  MD  N++   T EDDWQLL
Sbjct: 429  AEESNGAPLFTVTQSRSFSKQVTSNDVSGDNTGFQEEDFPGMDRGNDT--GTNEDDWQLL 486


>emb|CAN81102.1| hypothetical protein VITISV_021940 [Vitis vinifera]
          Length = 489

 Score =  532 bits (1371), Expect = e-148
 Identities = 271/432 (62%), Positives = 318/432 (73%), Gaps = 14/432 (3%)
 Frame = -1

Query: 1392 EKKTSHGRTYGWTFSLKRLVGGGSMRRLHERLIGXXXXXXXXXXSDIWLLGVCYRVSSEE 1213
            +K   +GR  GWT +++++V G SMRR+ ER++G          SDIWLLG+CY++S EE
Sbjct: 63   KKAIDNGRNNGWTTAVRKVVTGVSMRRIQERVLGTSKTGISSSTSDIWLLGLCYKISQEE 122

Query: 1212 SSDEACNTNGLYEFSVDFSSRIWMTYRKGFDAIGDSKFTSDVNWGCMLRSSQMLVAQALL 1033
            SS+ A ++NGL EF  DFSSRI MTYRKGF+AIGDSK TSDVNWGCMLRSSQMLVAQALL
Sbjct: 123  SSNHASSSNGLAEFEQDFSSRILMTYRKGFEAIGDSKLTSDVNWGCMLRSSQMLVAQALL 182

Query: 1032 FHNLGRSWRKPMEKPFCSEYIEILDLFGDSEASAFSIHNLLQVGKAYGLAAGSWIGPYAM 853
             H +GRSWRK   KP   +YIEIL  FGDS+ASAFSIHN+LQ GKAYGLAAGSW+GPYAM
Sbjct: 183  LHRMGRSWRKTSHKPMDQDYIEILHHFGDSKASAFSIHNILQAGKAYGLAAGSWVGPYAM 242

Query: 852  CRSWETLVNSTQQS---DKPTLSMTIYVVSGDENGERGGAPVLCIEDVAKLCSEFCEGQA 682
            CRSWETL  S ++    +  +L M IY+VSGDE+GERGGAPV+ IE+ ++ C EF +GQ 
Sbjct: 243  CRSWETLARSKREETDLECQSLPMAIYIVSGDEDGERGGAPVVYIEEASRHCLEFSKGQV 302

Query: 681  VWKPIXXXXXXXXXLDKVNPRYIPSLCATFTFPQSLGILGGKPGVSTYIVGVQDDKAFYL 502
             W PI         L+KVNPRYIPSL ATFTFPQSLGILGGKPG STYIVGVQD+KAFYL
Sbjct: 303  DWTPILLLVPLVLGLEKVNPRYIPSLAATFTFPQSLGILGGKPGASTYIVGVQDEKAFYL 362

Query: 501  DPHEVQQVMDIKRNNSEVDTSSYHC---SVVRHLPLDTIDPSLAIGFYCRDKDDFDDFCA 331
            DPHE Q V+DI+R N E DTSSYHC   S++RH+ LD+IDPSLAIGFYCRDKDDFDDFC 
Sbjct: 363  DPHEAQSVVDIRRENLEADTSSYHCNCSSIIRHICLDSIDPSLAIGFYCRDKDDFDDFCI 422

Query: 330  RASKLTDASNGAPLLTITKSHSSSKTV-CKDFLVHSVDDESHGDFEMDNMN-------ES 175
            RASKL D SNGAPL T+   HS  K + C D +     D+  G  E D+ +       E 
Sbjct: 423  RASKLADESNGAPLFTVAHIHSLPKPISCSDGM-----DDCSGFREDDSFDVVSNKGAEG 477

Query: 174  EISTQEDDWQLL 139
                 EDDWQLL
Sbjct: 478  YEHEHEDDWQLL 489


Top