BLASTX nr result

ID: Coptis21_contig00004534 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Coptis21_contig00004534
         (1845 letters)

Database: ./nr 
           23,641,837 sequences; 8,123,359,852 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_003635099.1| PREDICTED: cysteine protease ATG4-like [Viti...   537   e-150
ref|XP_002331599.1| predicted protein [Populus trichocarpa] gi|2...   535   e-149
ref|XP_003552483.1| PREDICTED: cysteine protease ATG4-like [Glyc...   533   e-149
emb|CAN81102.1| hypothetical protein VITISV_021940 [Vitis vinifera]   533   e-149
ref|XP_002529225.1| Cysteine protease ATG4B, putative [Ricinus c...   530   e-148

>ref|XP_003635099.1| PREDICTED: cysteine protease ATG4-like [Vitis vinifera]
            gi|296086874|emb|CBI33041.3| unnamed protein product
            [Vitis vinifera]
          Length = 486

 Score =  537 bits (1383), Expect = e-150
 Identities = 268/429 (62%), Positives = 316/429 (73%), Gaps = 11/429 (2%)
 Frame = +2

Query: 419  EKKTSHGRTYGWTFSLKRIVGGGSMRRLHERLIGXXXXXXXXXXXDIWLLGVCYRVSSEE 598
            +K   +GR  GWT +++++V G SMRR+ ER++G           DIWLLG+CY++S EE
Sbjct: 63   KKAIDNGRNNGWTTAVRKVVTGVSMRRIQERVLGTSKTGISSSTSDIWLLGLCYKISQEE 122

Query: 599  SSDDACNANRLYEFSVDFSSRIWMTYRKGFDAIGDSKFTSDVNWGCMLRSSQMLVAQALL 778
            SS+ A ++N L EF  DFSSRI MTYRKGF+AIGDSK TSDVNWGCMLRSSQMLVAQALL
Sbjct: 123  SSNHASSSNGLAEFEQDFSSRILMTYRKGFEAIGDSKLTSDVNWGCMLRSSQMLVAQALL 182

Query: 779  FHNLGRSWRKPMEKPFCSEYIEILDLFGDSEASAFSIHNLLQVGKAYGLAAGSWIGPYAM 958
             H +GRSWRK   KP   +YIEIL  FGDS+ASAFSIHN+LQ GKAYGLAAGSW+GPYAM
Sbjct: 183  LHRMGRSWRKTSHKPMDQDYIEILHHFGDSKASAFSIHNILQAGKAYGLAAGSWVGPYAM 242

Query: 959  CRSWETLVNSTQQS---DKPTLSMTIYVVSGDENGERGGAPVLCIEDVAKLCSEFCEGQA 1129
            CRSWETL  S ++    +  +L M IY+VSGDE+GERGGAPV+ IE+ ++ C EF +GQ 
Sbjct: 243  CRSWETLARSKREETDLECQSLPMAIYIVSGDEDGERGGAPVVYIEEASRHCLEFSKGQV 302

Query: 1130 VWKPIXXXXXXXXXXDKVNPRYIPSLCATFTFPQSLGILGGKPGVSTYIVGVQDDKAFYL 1309
             W PI          +KVNPRYIPSL ATFTFPQSLGILGGKPG STYIVGVQD+KAFYL
Sbjct: 303  DWTPILLLVPLVLGLEKVNPRYIPSLAATFTFPQSLGILGGKPGASTYIVGVQDEKAFYL 362

Query: 1310 DPHEVQQVMDIKRNNSEVDTSSYHCSVVRHLPLDTIDPSLAIGFYCRDKDDFDDFCARAS 1489
            DPHE Q V+DI+R N E DTSSYHC+++RH+ LD+IDPSLAIGFYCRDKDDFDDFC RAS
Sbjct: 363  DPHEAQSVVDIRRENLEADTSSYHCNIIRHICLDSIDPSLAIGFYCRDKDDFDDFCIRAS 422

Query: 1490 KLTDASNGAPLLTITKSHSSPKTV-CKDFLVHSVDDESHGDFEMDNMN-------ESEIS 1645
            KL D SNGAPL T+   HS PK + C D +     D+  G  E D+ +       E    
Sbjct: 423  KLADKSNGAPLFTVAHIHSLPKPISCSDGM-----DDCSGFREDDSFDVVSNKGAEGYEH 477

Query: 1646 TQEDDWQLL 1672
              EDDWQLL
Sbjct: 478  EHEDDWQLL 486


>ref|XP_002331599.1| predicted protein [Populus trichocarpa] gi|222873995|gb|EEF11126.1|
            predicted protein [Populus trichocarpa]
          Length = 482

 Score =  535 bits (1378), Expect = e-149
 Identities = 270/423 (63%), Positives = 314/423 (74%), Gaps = 5/423 (1%)
 Frame = +2

Query: 419  EKKTSHGRT-YGWTFSLKRIVGGGSMRRLHERLIGXXXXXXXXXXXDIWLLGVCYRVSSE 595
            EKK  H R   GWT ++K+IV GGSMRR+ E ++G           DIWLLG CY++S +
Sbjct: 68   EKKAPHIRHGNGWTSAVKKIVAGGSMRRIQECVLGTSKTGISNTTGDIWLLGACYKISQD 127

Query: 596  ESSDDACNANRLYEFSVDFSSRIWMTYRKGFDAIGDSKFTSDVNWGCMLRSSQMLVAQAL 775
             SS DA   N L  F+ DFSSRI +TYRKGFDAI DSK TSDV+WGCMLRSSQMLVAQAL
Sbjct: 128  NSSGDAAATNALAAFNHDFSSRILITYRKGFDAIEDSKLTSDVSWGCMLRSSQMLVAQAL 187

Query: 776  LFHNLGRSWRKPMEKPFCSEYIEILDLFGDSEASAFSIHNLLQVGKAYGLAAGSWIGPYA 955
            LFH LGRSWRKP++KP   EY+EIL LFGDSE+SAFSIHNLL+ GKAYGLAAGSW+GPYA
Sbjct: 188  LFHRLGRSWRKPLDKPLDREYVEILHLFGDSESSAFSIHNLLRAGKAYGLAAGSWVGPYA 247

Query: 956  MCRSWETLVNSTQQS---DKPTLSMTIYVVSGDENGERGGAPVLCIEDVAKLCSEFCEGQ 1126
            +C SWE+LV S ++    +  +LSM +YVVSG E+GERGGAPVLCIE+ A+ CSEF +GQ
Sbjct: 248  VCHSWESLVRSRREETNLEYQSLSMAVYVVSGSEDGERGGAPVLCIEEAARHCSEFSKGQ 307

Query: 1127 AVWKPIXXXXXXXXXXDKVNPRYIPSLCATFTFPQSLGILGGKPGVSTYIVGVQDDKAFY 1306
              W PI          DK+NPRYIPSL ATFTFPQSLGILGGKPG STYIVGVQD+ AFY
Sbjct: 308  EDWTPILLLVPLVLGLDKINPRYIPSLQATFTFPQSLGILGGKPGASTYIVGVQDENAFY 367

Query: 1307 LDPHEVQQVMDIKRNNSEVDTSSYHCSVVRHLPLDTIDPSLAIGFYCRDKDDFDDFCARA 1486
            LDPHEVQ V+++ R++ E +TSSYHC+VVRH+PLD IDPSLAIGFYCRDKDDFDDFC  A
Sbjct: 368  LDPHEVQPVVNVSRDDVEANTSSYHCNVVRHMPLDLIDPSLAIGFYCRDKDDFDDFCTLA 427

Query: 1487 SKLTDASNGAPLLTITKSHSSPKTVCKDFLVH-SVDDESHGDFEMDNMNESEISTQEDDW 1663
            SKLTD SNGAPL T+  S        +  L H S +  S     +  MN+ E    EDDW
Sbjct: 428  SKLTDESNGAPLFTVAHS--------RKLLKHDSGEVRSDDSLGVMTMNDVEGCVHEDDW 479

Query: 1664 QLL 1672
            QLL
Sbjct: 480  QLL 482


>ref|XP_003552483.1| PREDICTED: cysteine protease ATG4-like [Glycine max]
          Length = 485

 Score =  533 bits (1374), Expect = e-149
 Identities = 260/420 (61%), Positives = 312/420 (74%), Gaps = 2/420 (0%)
 Frame = +2

Query: 419  EKKTSHGRTYGWTFSLKRIVGGGSMRRLHERLIGXXXXXXXXXXXDIWLLGVCYRVSSEE 598
            EKK    R+ GW  +++++V GGSMRR  ER++G           DIWLLGVC+++S +E
Sbjct: 69   EKKAVPSRSSGWAAAVRKVVTGGSMRRFQERVLGSSRTDISSSDGDIWLLGVCHKISQQE 128

Query: 599  SSDDACNANRLYEFSVDFSSRIWMTYRKGFDAIGDSKFTSDVNWGCMLRSSQMLVAQALL 778
            S+     +N L  F  DFSS+I +TYRKGFDAIGD+K+TSDVNWGCMLRSSQMLVAQALL
Sbjct: 129  STGGVDTSNGLASFEQDFSSKILVTYRKGFDAIGDTKYTSDVNWGCMLRSSQMLVAQALL 188

Query: 779  FHNLGRSWRKPMEKPFCSEYIEILDLFGDSEASAFSIHNLLQVGKAYGLAAGSWIGPYAM 958
            FH LGRSWRKP++KP   EYI++L LFGDSEASAFSIHNLLQ GK YGLA GSW+GPYAM
Sbjct: 189  FHKLGRSWRKPIDKPLDKEYIDVLQLFGDSEASAFSIHNLLQAGKGYGLAVGSWVGPYAM 248

Query: 959  CRSWETLVNSTQQSDKPTLSMTIYVVSGDENGERGGAPVLCIEDVAKLCSEFCEGQAVWK 1138
            CR+WE L        +P L M IYVVSGDE+GERGGAPV+CIED +K CSEF  G AVW 
Sbjct: 249  CRTWEVLARKKNDLGEPPLPMAIYVVSGDEDGERGGAPVVCIEDASKRCSEFSSGLAVWT 308

Query: 1139 PIXXXXXXXXXXDKVNPRYIPSLCATFTFPQSLGILGGKPGVSTYIVGVQDDKAFYLDPH 1318
            P+          DKVNPRYIP L +TF FPQSLGI+GGKPG STYI+GVQ++KAFYLDPH
Sbjct: 309  PLLLLVPLVLGLDKVNPRYIPLLRSTFKFPQSLGIMGGKPGASTYIIGVQNEKAFYLDPH 368

Query: 1319 EVQQVMDIKRNNSE-VDTSSYHCSVVRHLPLDTIDPSLAIGFYCRDKDDFDDFCARASKL 1495
            +VQQV++I  +  E   TSSYHC+V+RH+PLD+IDPSLAIGFYCRDKDDFDDFC++ASKL
Sbjct: 369  DVQQVVNISGDTQEPTGTSSYHCNVMRHIPLDSIDPSLAIGFYCRDKDDFDDFCSQASKL 428

Query: 1496 TDASNGAPLLTITKSHSSPKTVCKDFLVHSVDDESHGDFEMDNMN-ESEISTQEDDWQLL 1672
             + SNGAPL T+ KS S  K V  D    S D+    + +   M+  ++  T EDDWQLL
Sbjct: 429  AEESNGAPLFTVAKSRSFSKQVSNDV---SGDNTGFQEDDFPGMDCGNDTVTNEDDWQLL 485


>emb|CAN81102.1| hypothetical protein VITISV_021940 [Vitis vinifera]
          Length = 489

 Score =  533 bits (1372), Expect = e-149
 Identities = 269/432 (62%), Positives = 316/432 (73%), Gaps = 14/432 (3%)
 Frame = +2

Query: 419  EKKTSHGRTYGWTFSLKRIVGGGSMRRLHERLIGXXXXXXXXXXXDIWLLGVCYRVSSEE 598
            +K   +GR  GWT +++++V G SMRR+ ER++G           DIWLLG+CY++S EE
Sbjct: 63   KKAIDNGRNNGWTTAVRKVVTGVSMRRIQERVLGTSKTGISSSTSDIWLLGLCYKISQEE 122

Query: 599  SSDDACNANRLYEFSVDFSSRIWMTYRKGFDAIGDSKFTSDVNWGCMLRSSQMLVAQALL 778
            SS+ A ++N L EF  DFSSRI MTYRKGF+AIGDSK TSDVNWGCMLRSSQMLVAQALL
Sbjct: 123  SSNHASSSNGLAEFEQDFSSRILMTYRKGFEAIGDSKLTSDVNWGCMLRSSQMLVAQALL 182

Query: 779  FHNLGRSWRKPMEKPFCSEYIEILDLFGDSEASAFSIHNLLQVGKAYGLAAGSWIGPYAM 958
             H +GRSWRK   KP   +YIEIL  FGDS+ASAFSIHN+LQ GKAYGLAAGSW+GPYAM
Sbjct: 183  LHRMGRSWRKTSHKPMDQDYIEILHHFGDSKASAFSIHNILQAGKAYGLAAGSWVGPYAM 242

Query: 959  CRSWETLVNSTQQS---DKPTLSMTIYVVSGDENGERGGAPVLCIEDVAKLCSEFCEGQA 1129
            CRSWETL  S ++    +  +L M IY+VSGDE+GERGGAPV+ IE+ ++ C EF +GQ 
Sbjct: 243  CRSWETLARSKREETDLECQSLPMAIYIVSGDEDGERGGAPVVYIEEASRHCLEFSKGQV 302

Query: 1130 VWKPIXXXXXXXXXXDKVNPRYIPSLCATFTFPQSLGILGGKPGVSTYIVGVQDDKAFYL 1309
             W PI          +KVNPRYIPSL ATFTFPQSLGILGGKPG STYIVGVQD+KAFYL
Sbjct: 303  DWTPILLLVPLVLGLEKVNPRYIPSLAATFTFPQSLGILGGKPGASTYIVGVQDEKAFYL 362

Query: 1310 DPHEVQQVMDIKRNNSEVDTSSYHC---SVVRHLPLDTIDPSLAIGFYCRDKDDFDDFCA 1480
            DPHE Q V+DI+R N E DTSSYHC   S++RH+ LD+IDPSLAIGFYCRDKDDFDDFC 
Sbjct: 363  DPHEAQSVVDIRRENLEADTSSYHCNCSSIIRHICLDSIDPSLAIGFYCRDKDDFDDFCI 422

Query: 1481 RASKLTDASNGAPLLTITKSHSSPKTV-CKDFLVHSVDDESHGDFEMDNMN-------ES 1636
            RASKL D SNGAPL T+   HS PK + C D +     D+  G  E D+ +       E 
Sbjct: 423  RASKLADESNGAPLFTVAHIHSLPKPISCSDGM-----DDCSGFREDDSFDVVSNKGAEG 477

Query: 1637 EISTQEDDWQLL 1672
                 EDDWQLL
Sbjct: 478  YEHEHEDDWQLL 489


>ref|XP_002529225.1| Cysteine protease ATG4B, putative [Ricinus communis]
            gi|223531343|gb|EEF33181.1| Cysteine protease ATG4B,
            putative [Ricinus communis]
          Length = 489

 Score =  530 bits (1366), Expect = e-148
 Identities = 264/426 (61%), Positives = 312/426 (73%), Gaps = 8/426 (1%)
 Frame = +2

Query: 419  EKKTSHGRTYGWTFSLKRIVGGGSMRRLHERLIGXXXXXXXXXXXDIWLLGVCYRVSSEE 598
            EKK SH R  GWT ++K+IV GGSMRR+HER++G           DIWLLGVCY++S +E
Sbjct: 65   EKKGSHSRHNGWTSAVKKIVSGGSMRRIHERVLGPSRTGISSTTSDIWLLGVCYKISEDE 124

Query: 599  SSDDACNANRLYEFSVDFSSRIWMTYRKGFDAIGDSKFTSDVNWGCMLRSSQMLVAQALL 778
            S + A   N L EF+ D+SSRI MTYR+GFDAIGDSK+ SDV WGCMLRSSQMLVAQALL
Sbjct: 125  SGN-ADTGNALAEFTHDYSSRILMTYRRGFDAIGDSKYISDVGWGCMLRSSQMLVAQALL 183

Query: 779  FHNLGRSWRKPMEKPFCSEYIEILDLFGDSEASAFSIHNLLQVGKAYGLAAGSWIGPYAM 958
            FH LGR+W KP +KP    Y+EIL LFGDSEA+ FSIHNL+Q GKAY LAAGSW+GPYAM
Sbjct: 184  FHKLGRAWTKPFQKPMDQAYVEILHLFGDSEAAPFSIHNLIQAGKAYSLAAGSWVGPYAM 243

Query: 959  CRSWETLVNSTQQSDK---PTLSMTIYVVSGDENGERGGAPVLCIEDVAKLCSEFCEGQA 1129
            CRSWE+L  S ++ +     +L M +YVVSGDE+GERGGAPV+ IED ++ C EF  GQA
Sbjct: 244  CRSWESLARSKREENSLEYQSLPMAVYVVSGDEDGERGGAPVVYIEDASRHCLEFSRGQA 303

Query: 1130 VWKPIXXXXXXXXXXDKVNPRYIPSLCATFTFPQSLGILGGKPGVSTYIVGVQDDKAFYL 1309
             W PI          DKVNPRYIPSL ATFTF QSLGI+GGKPG STYIVGVQDD AFYL
Sbjct: 304  DWTPILLLVPLVLGLDKVNPRYIPSLQATFTFSQSLGIMGGKPGASTYIVGVQDDNAFYL 363

Query: 1310 DPHEVQQVMDIKRNNSEVDTSSYHCSVVRHLPLDTIDPSLAIGFYCRDKDDFDDFCARAS 1489
            DPHEVQ V++I R++ E DTSSYH  +VRH+PL +IDPSLAIGFYCRDKDDFD+FC  AS
Sbjct: 364  DPHEVQSVVNIGRDDIEADTSSYHSDIVRHIPLHSIDPSLAIGFYCRDKDDFDEFCLLAS 423

Query: 1490 KLTDASNGAPLLTITKSHSSPKTVCKDFLVHSVDDESHGDFEMD-----NMNESEISTQE 1654
            KL D S GAPL T+   H  PK V    ++++ DDE   D  ++     N +      QE
Sbjct: 424  KLADDSQGAPLFTVAHCHKLPKPVSHGDMLNNEDDEVQEDDSVNVMMPVNDDAEGGGAQE 483

Query: 1655 DDWQLL 1672
            D+WQLL
Sbjct: 484  DEWQLL 489


Top