BLASTX nr result

ID: Coptis25_contig00004396 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Coptis25_contig00004396
         (1849 letters)

Database: ./nr 
           23,641,837 sequences; 8,123,359,852 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_003552483.1| PREDICTED: cysteine protease ATG4-like [Glyc...   537   e-150
ref|XP_003635099.1| PREDICTED: cysteine protease ATG4-like [Viti...   536   e-150
ref|XP_002331599.1| predicted protein [Populus trichocarpa] gi|2...   535   e-149
ref|XP_003534478.1| PREDICTED: cysteine protease ATG4-like [Glyc...   533   e-149
emb|CAN81102.1| hypothetical protein VITISV_021940 [Vitis vinifera]   532   e-148

>ref|XP_003552483.1| PREDICTED: cysteine protease ATG4-like [Glycine max]
          Length = 485

 Score =  537 bits (1384), Expect = e-150
 Identities = 262/420 (62%), Positives = 314/420 (74%), Gaps = 2/420 (0%)
 Frame = +3

Query: 429  EKKTSHGRTYGWTFSLKRLVGGGSMRRLHERLIGXXXXXXXXXXXDIWLLGVCYRVSSEE 608
            EKK    R+ GW  +++++V GGSMRR  ER++G           DIWLLGVC+++S +E
Sbjct: 69   EKKAVPSRSSGWAAAVRKVVTGGSMRRFQERVLGSSRTDISSSDGDIWLLGVCHKISQQE 128

Query: 609  SSDEACNTNGLYEFSVDFSSRIWMTYRKGFDAIGDSKFTSDVNWGCMLRSSQMLVAQALL 788
            S+     +NGL  F  DFSS+I +TYRKGFDAIGD+K+TSDVNWGCMLRSSQMLVAQALL
Sbjct: 129  STGGVDTSNGLASFEQDFSSKILVTYRKGFDAIGDTKYTSDVNWGCMLRSSQMLVAQALL 188

Query: 789  FHNLGRSWRKPMEKPFCSEYIEILDLFGDSEASAFSIHNLLQVGKAYGLAAGSWIGPYAM 968
            FH LGRSWRKP++KP   EYI++L LFGDSEASAFSIHNLLQ GK YGLA GSW+GPYAM
Sbjct: 189  FHKLGRSWRKPIDKPLDKEYIDVLQLFGDSEASAFSIHNLLQAGKGYGLAVGSWVGPYAM 248

Query: 969  CRSWETLVNSTQQSDKPTLSMTIYVVSGDENGERGGAPVLCIEDVAKLCSEFCEGQAVWK 1148
            CR+WE L        +P L M IYVVSGDE+GERGGAPV+CIED +K CSEF  G AVW 
Sbjct: 249  CRTWEVLARKKNDLGEPPLPMAIYVVSGDEDGERGGAPVVCIEDASKRCSEFSSGLAVWT 308

Query: 1149 PIXXXXXXXXXXDKVNPRYIPSLCATFTFPQSLGILGGKPGVSTYIVGVQDDKAFYLDPH 1328
            P+          DKVNPRYIP L +TF FPQSLGI+GGKPG STYI+GVQ++KAFYLDPH
Sbjct: 309  PLLLLVPLVLGLDKVNPRYIPLLRSTFKFPQSLGIMGGKPGASTYIIGVQNEKAFYLDPH 368

Query: 1329 EVQQVMDIKRNNSE-VDTSSYHCSVVRHLPLDTIDPSLAIGFYCRDKDDFDDFCARASKL 1505
            +VQQV++I  +  E   TSSYHC+V+RH+PLD+IDPSLAIGFYCRDKDDFDDFC++ASKL
Sbjct: 369  DVQQVVNISGDTQEPTGTSSYHCNVMRHIPLDSIDPSLAIGFYCRDKDDFDDFCSQASKL 428

Query: 1506 TDASNGAPLLTITKSHSSSKTVCKDFLVHSVDDESHGDFEMDNMN-ESEISTQEDDWQLL 1682
             + SNGAPL T+ KS S SK V  D    S D+    + +   M+  ++  T EDDWQLL
Sbjct: 429  AEESNGAPLFTVAKSRSFSKQVSNDV---SGDNTGFQEDDFPGMDCGNDTVTNEDDWQLL 485


>ref|XP_003635099.1| PREDICTED: cysteine protease ATG4-like [Vitis vinifera]
            gi|296086874|emb|CBI33041.3| unnamed protein product
            [Vitis vinifera]
          Length = 486

 Score =  536 bits (1382), Expect = e-150
 Identities = 268/429 (62%), Positives = 316/429 (73%), Gaps = 11/429 (2%)
 Frame = +3

Query: 429  EKKTSHGRTYGWTFSLKRLVGGGSMRRLHERLIGXXXXXXXXXXXDIWLLGVCYRVSSEE 608
            +K   +GR  GWT +++++V G SMRR+ ER++G           DIWLLG+CY++S EE
Sbjct: 63   KKAIDNGRNNGWTTAVRKVVTGVSMRRIQERVLGTSKTGISSSTSDIWLLGLCYKISQEE 122

Query: 609  SSDEACNTNGLYEFSVDFSSRIWMTYRKGFDAIGDSKFTSDVNWGCMLRSSQMLVAQALL 788
            SS+ A ++NGL EF  DFSSRI MTYRKGF+AIGDSK TSDVNWGCMLRSSQMLVAQALL
Sbjct: 123  SSNHASSSNGLAEFEQDFSSRILMTYRKGFEAIGDSKLTSDVNWGCMLRSSQMLVAQALL 182

Query: 789  FHNLGRSWRKPMEKPFCSEYIEILDLFGDSEASAFSIHNLLQVGKAYGLAAGSWIGPYAM 968
             H +GRSWRK   KP   +YIEIL  FGDS+ASAFSIHN+LQ GKAYGLAAGSW+GPYAM
Sbjct: 183  LHRMGRSWRKTSHKPMDQDYIEILHHFGDSKASAFSIHNILQAGKAYGLAAGSWVGPYAM 242

Query: 969  CRSWETLVNSTQQS---DKPTLSMTIYVVSGDENGERGGAPVLCIEDVAKLCSEFCEGQA 1139
            CRSWETL  S ++    +  +L M IY+VSGDE+GERGGAPV+ IE+ ++ C EF +GQ 
Sbjct: 243  CRSWETLARSKREETDLECQSLPMAIYIVSGDEDGERGGAPVVYIEEASRHCLEFSKGQV 302

Query: 1140 VWKPIXXXXXXXXXXDKVNPRYIPSLCATFTFPQSLGILGGKPGVSTYIVGVQDDKAFYL 1319
             W PI          +KVNPRYIPSL ATFTFPQSLGILGGKPG STYIVGVQD+KAFYL
Sbjct: 303  DWTPILLLVPLVLGLEKVNPRYIPSLAATFTFPQSLGILGGKPGASTYIVGVQDEKAFYL 362

Query: 1320 DPHEVQQVMDIKRNNSEVDTSSYHCSVVRHLPLDTIDPSLAIGFYCRDKDDFDDFCARAS 1499
            DPHE Q V+DI+R N E DTSSYHC+++RH+ LD+IDPSLAIGFYCRDKDDFDDFC RAS
Sbjct: 363  DPHEAQSVVDIRRENLEADTSSYHCNIIRHICLDSIDPSLAIGFYCRDKDDFDDFCIRAS 422

Query: 1500 KLTDASNGAPLLTITKSHSSSKTV-CKDFLVHSVDDESHGDFEMDNMN-------ESEIS 1655
            KL D SNGAPL T+   HS  K + C D +     D+  G  E D+ +       E    
Sbjct: 423  KLADKSNGAPLFTVAHIHSLPKPISCSDGM-----DDCSGFREDDSFDVVSNKGAEGYEH 477

Query: 1656 TQEDDWQLL 1682
              EDDWQLL
Sbjct: 478  EHEDDWQLL 486


>ref|XP_002331599.1| predicted protein [Populus trichocarpa] gi|222873995|gb|EEF11126.1|
            predicted protein [Populus trichocarpa]
          Length = 482

 Score =  535 bits (1378), Expect = e-149
 Identities = 268/422 (63%), Positives = 313/422 (74%), Gaps = 4/422 (0%)
 Frame = +3

Query: 429  EKKTSHGRT-YGWTFSLKRLVGGGSMRRLHERLIGXXXXXXXXXXXDIWLLGVCYRVSSE 605
            EKK  H R   GWT ++K++V GGSMRR+ E ++G           DIWLLG CY++S +
Sbjct: 68   EKKAPHIRHGNGWTSAVKKIVAGGSMRRIQECVLGTSKTGISNTTGDIWLLGACYKISQD 127

Query: 606  ESSDEACNTNGLYEFSVDFSSRIWMTYRKGFDAIGDSKFTSDVNWGCMLRSSQMLVAQAL 785
             SS +A  TN L  F+ DFSSRI +TYRKGFDAI DSK TSDV+WGCMLRSSQMLVAQAL
Sbjct: 128  NSSGDAAATNALAAFNHDFSSRILITYRKGFDAIEDSKLTSDVSWGCMLRSSQMLVAQAL 187

Query: 786  LFHNLGRSWRKPMEKPFCSEYIEILDLFGDSEASAFSIHNLLQVGKAYGLAAGSWIGPYA 965
            LFH LGRSWRKP++KP   EY+EIL LFGDSE+SAFSIHNLL+ GKAYGLAAGSW+GPYA
Sbjct: 188  LFHRLGRSWRKPLDKPLDREYVEILHLFGDSESSAFSIHNLLRAGKAYGLAAGSWVGPYA 247

Query: 966  MCRSWETLVNSTQQS---DKPTLSMTIYVVSGDENGERGGAPVLCIEDVAKLCSEFCEGQ 1136
            +C SWE+LV S ++    +  +LSM +YVVSG E+GERGGAPVLCIE+ A+ CSEF +GQ
Sbjct: 248  VCHSWESLVRSRREETNLEYQSLSMAVYVVSGSEDGERGGAPVLCIEEAARHCSEFSKGQ 307

Query: 1137 AVWKPIXXXXXXXXXXDKVNPRYIPSLCATFTFPQSLGILGGKPGVSTYIVGVQDDKAFY 1316
              W PI          DK+NPRYIPSL ATFTFPQSLGILGGKPG STYIVGVQD+ AFY
Sbjct: 308  EDWTPILLLVPLVLGLDKINPRYIPSLQATFTFPQSLGILGGKPGASTYIVGVQDENAFY 367

Query: 1317 LDPHEVQQVMDIKRNNSEVDTSSYHCSVVRHLPLDTIDPSLAIGFYCRDKDDFDDFCARA 1496
            LDPHEVQ V+++ R++ E +TSSYHC+VVRH+PLD IDPSLAIGFYCRDKDDFDDFC  A
Sbjct: 368  LDPHEVQPVVNVSRDDVEANTSSYHCNVVRHMPLDLIDPSLAIGFYCRDKDDFDDFCTLA 427

Query: 1497 SKLTDASNGAPLLTITKSHSSSKTVCKDFLVHSVDDESHGDFEMDNMNESEISTQEDDWQ 1676
            SKLTD SNGAPL T+  S    K         S +  S     +  MN+ E    EDDWQ
Sbjct: 428  SKLTDESNGAPLFTVAHSRKLLKH-------DSGEVRSDDSLGVMTMNDVEGCVHEDDWQ 480

Query: 1677 LL 1682
            LL
Sbjct: 481  LL 482


>ref|XP_003534478.1| PREDICTED: cysteine protease ATG4-like [Glycine max]
          Length = 486

 Score =  533 bits (1372), Expect = e-149
 Identities = 260/420 (61%), Positives = 312/420 (74%), Gaps = 2/420 (0%)
 Frame = +3

Query: 429  EKKTSHGRTYGWTFSLKRLVGGGSMRRLHERLIGXXXXXXXXXXXDIWLLGVCYRVSSEE 608
            EKK  H R+ GW  +++++V GGSMRR  ER++G           DIWLLGVC+++S +E
Sbjct: 69   EKKAVHSRSSGWAAAVRKVVTGGSMRRFQERVLGSSRTDISSSDGDIWLLGVCHKISQQE 128

Query: 609  SSDEACNTNGLYEFSVDFSSRIWMTYRKGFDAIGDSKFTSDVNWGCMLRSSQMLVAQALL 788
            SS    N+NGL  F  DFSS+I +TYRKGFDAIGD+K+TSDV+WGCMLRSSQMLVAQALL
Sbjct: 129  SSGGVDNSNGLASFEQDFSSKILVTYRKGFDAIGDTKYTSDVHWGCMLRSSQMLVAQALL 188

Query: 789  FHNLGRSWRKPMEKPFCSEYIEILDLFGDSEASAFSIHNLLQVGKAYGLAAGSWIGPYAM 968
            FH LGRSWRKP++KP   EYI++L LFGDSEASAFSIHNLLQ GK YGLA GSW+GPYAM
Sbjct: 189  FHKLGRSWRKPIDKPPDKEYIDVLQLFGDSEASAFSIHNLLQAGKGYGLAVGSWVGPYAM 248

Query: 969  CRSWETLVNSTQQSDKPTLSMTIYVVSGDENGERGGAPVLCIEDVAKLCSEFCEGQAVWK 1148
            CR+WE L        +  L M IYVVSGDE+GERGGAPV+CIED +K C EF  G A W 
Sbjct: 249  CRTWEVLARKKNDLGELPLPMAIYVVSGDEDGERGGAPVVCIEDASKRCFEFSSGLAAWT 308

Query: 1149 PIXXXXXXXXXXDKVNPRYIPSLCATFTFPQSLGILGGKPGVSTYIVGVQDDKAFYLDPH 1328
            P+          DKVNPRYIP L +TF FPQSLGI+GGKPG STYI+G Q++KAFYLDPH
Sbjct: 309  PLLLLVPLVLGLDKVNPRYIPLLRSTFKFPQSLGIMGGKPGASTYIIGAQNEKAFYLDPH 368

Query: 1329 EVQQVMDIKRNNSE-VDTSSYHCSVVRHLPLDTIDPSLAIGFYCRDKDDFDDFCARASKL 1505
            +VQQV++I  +  E   TSSYHC+++RH+PLD+IDPSLAIGFYCRDKDDFDDFC++ASKL
Sbjct: 369  DVQQVVNISGDTQEPTSTSSYHCNIMRHIPLDSIDPSLAIGFYCRDKDDFDDFCSQASKL 428

Query: 1506 TDASNGAPLLTITKSHSSSKTVCKDFLVHSVDDESHGDFE-MDNMNESEISTQEDDWQLL 1682
             + SNGAPL T+T+S S SK V  + +          DF  MD  N++   T EDDWQLL
Sbjct: 429  AEESNGAPLFTVTQSRSFSKQVTSNDVSGDNTGFQEEDFPGMDRGNDT--GTNEDDWQLL 486


>emb|CAN81102.1| hypothetical protein VITISV_021940 [Vitis vinifera]
          Length = 489

 Score =  532 bits (1371), Expect = e-148
 Identities = 269/432 (62%), Positives = 316/432 (73%), Gaps = 14/432 (3%)
 Frame = +3

Query: 429  EKKTSHGRTYGWTFSLKRLVGGGSMRRLHERLIGXXXXXXXXXXXDIWLLGVCYRVSSEE 608
            +K   +GR  GWT +++++V G SMRR+ ER++G           DIWLLG+CY++S EE
Sbjct: 63   KKAIDNGRNNGWTTAVRKVVTGVSMRRIQERVLGTSKTGISSSTSDIWLLGLCYKISQEE 122

Query: 609  SSDEACNTNGLYEFSVDFSSRIWMTYRKGFDAIGDSKFTSDVNWGCMLRSSQMLVAQALL 788
            SS+ A ++NGL EF  DFSSRI MTYRKGF+AIGDSK TSDVNWGCMLRSSQMLVAQALL
Sbjct: 123  SSNHASSSNGLAEFEQDFSSRILMTYRKGFEAIGDSKLTSDVNWGCMLRSSQMLVAQALL 182

Query: 789  FHNLGRSWRKPMEKPFCSEYIEILDLFGDSEASAFSIHNLLQVGKAYGLAAGSWIGPYAM 968
             H +GRSWRK   KP   +YIEIL  FGDS+ASAFSIHN+LQ GKAYGLAAGSW+GPYAM
Sbjct: 183  LHRMGRSWRKTSHKPMDQDYIEILHHFGDSKASAFSIHNILQAGKAYGLAAGSWVGPYAM 242

Query: 969  CRSWETLVNSTQQS---DKPTLSMTIYVVSGDENGERGGAPVLCIEDVAKLCSEFCEGQA 1139
            CRSWETL  S ++    +  +L M IY+VSGDE+GERGGAPV+ IE+ ++ C EF +GQ 
Sbjct: 243  CRSWETLARSKREETDLECQSLPMAIYIVSGDEDGERGGAPVVYIEEASRHCLEFSKGQV 302

Query: 1140 VWKPIXXXXXXXXXXDKVNPRYIPSLCATFTFPQSLGILGGKPGVSTYIVGVQDDKAFYL 1319
             W PI          +KVNPRYIPSL ATFTFPQSLGILGGKPG STYIVGVQD+KAFYL
Sbjct: 303  DWTPILLLVPLVLGLEKVNPRYIPSLAATFTFPQSLGILGGKPGASTYIVGVQDEKAFYL 362

Query: 1320 DPHEVQQVMDIKRNNSEVDTSSYHC---SVVRHLPLDTIDPSLAIGFYCRDKDDFDDFCA 1490
            DPHE Q V+DI+R N E DTSSYHC   S++RH+ LD+IDPSLAIGFYCRDKDDFDDFC 
Sbjct: 363  DPHEAQSVVDIRRENLEADTSSYHCNCSSIIRHICLDSIDPSLAIGFYCRDKDDFDDFCI 422

Query: 1491 RASKLTDASNGAPLLTITKSHSSSKTV-CKDFLVHSVDDESHGDFEMDNMN-------ES 1646
            RASKL D SNGAPL T+   HS  K + C D +     D+  G  E D+ +       E 
Sbjct: 423  RASKLADESNGAPLFTVAHIHSLPKPISCSDGM-----DDCSGFREDDSFDVVSNKGAEG 477

Query: 1647 EISTQEDDWQLL 1682
                 EDDWQLL
Sbjct: 478  YEHEHEDDWQLL 489


Top