BLASTX nr result

ID: Rauwolfia21_contig00001471 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Rauwolfia21_contig00001471
         (1339 letters)

Database: ./nr 
           37,332,560 sequences; 13,225,080,153 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_006354186.1| PREDICTED: cysteine protease ATG4-like isofo...   424   e-116
ref|XP_004228630.1| PREDICTED: cysteine protease ATG4-like [Sola...   421   e-115
ref|XP_006354185.1| PREDICTED: cysteine protease ATG4-like isofo...   410   e-112
gb|EMJ19125.1| hypothetical protein PRUPE_ppa004885mg [Prunus pe...   394   e-107
ref|XP_002331599.1| predicted protein [Populus trichocarpa] gi|5...   383   e-103
ref|XP_006441973.1| hypothetical protein CICLE_v10019906mg [Citr...   382   e-103
ref|XP_006478507.1| PREDICTED: cysteine protease ATG4-like isofo...   382   e-103
gb|ESW11661.1| hypothetical protein PHAVU_008G048900g [Phaseolus...   380   e-103
ref|XP_003624282.1| Cysteine protease ATG4 [Medicago truncatula]...   380   e-103
ref|XP_003635099.1| PREDICTED: cysteine protease ATG4-like [Viti...   378   e-102
emb|CAN81102.1| hypothetical protein VITISV_021940 [Vitis vinifera]   378   e-102
ref|XP_004492910.1| PREDICTED: cysteine protease ATG4-like isofo...   377   e-102
gb|EPS69655.1| hypothetical protein M569_05108, partial [Genlise...   371   e-100
ref|XP_002529225.1| Cysteine protease ATG4B, putative [Ricinus c...   369   1e-99
ref|XP_002309707.1| autophagy 4b family protein [Populus trichoc...   367   5e-99
gb|EOX94074.1| Peptidase family C54 protein isoform 3 [Theobroma...   366   1e-98
ref|XP_004138950.1| PREDICTED: cysteine protease ATG4-like [Cucu...   361   4e-97
gb|EOX94072.1| Peptidase family C54 protein isoform 1 [Theobroma...   356   1e-95
gb|EXB53615.1| hypothetical protein L484_005165 [Morus notabilis]     345   2e-92
ref|XP_004492911.1| PREDICTED: cysteine protease ATG4-like isofo...   342   3e-91

>ref|XP_006354186.1| PREDICTED: cysteine protease ATG4-like isoform X2 [Solanum tuberosum]
          Length = 496

 Score =  424 bits (1091), Expect = e-116
 Identities = 221/357 (61%), Positives = 266/357 (74%), Gaps = 22/357 (6%)
 Frame = +1

Query: 334  SKYSGEIPSENSSR--IANSVSSEAGP-------SSREFHKPSLWSGFLVSAFSVFENCS 486
            S++S      +S R  + +SVS+E GP       S +  +K S+WSG LVS FS+F+  S
Sbjct: 5    SQFSRSSSDSDSPRKNLGSSVSTEPGPGPSSSSSSCKVNNKSSVWSGLLVSPFSIFD--S 62

Query: 487  EAKST---NRVCNSKSY------GWTATVKRIMNSGSMRRIFGLNKTSFPS-SKSEIWLL 636
            E K        C SK Y      GWT+ VKR++NSGSMRRIFG++KT  P+ SKS+IWLL
Sbjct: 63   EPKGCLKKGEFCCSKKYNGIVGIGWTSAVKRMINSGSMRRIFGMDKTGIPNGSKSDIWLL 122

Query: 637  GVCYNVAQDDSS--DPTQSEGFASFVEDFTSRILITYRKGFAPIEDSKYTSDVNWGCMIR 810
            GVCY V QDD S  +PTQSEGF++FV+DF+SRIL+TYRKGFAPI D+KYTSDVNWGCM+R
Sbjct: 123  GVCYKVVQDDDSSIEPTQSEGFSAFVDDFSSRILVTYRKGFAPIGDTKYTSDVNWGCMLR 182

Query: 811  SSQMLIAQAFLFHLLGRSWRKTTNKPLEQNYSEILHLFGDSESSPCSIHNLLQAGKVYGL 990
            SSQML+AQA L H LGRSWRK+ +KPLE+ Y EILHLFGDSE S  SIHNLLQAGK YGL
Sbjct: 183  SSQMLVAQALLLHRLGRSWRKSMDKPLEEKYVEILHLFGDSEGSAYSIHNLLQAGKTYGL 242

Query: 991  SPGSWVGPYAVCRTWEALMRNKK-GTITGDLTSSISMYVVSGDEDGERGGAPMVCIEDIG 1167
            SPGSWVGPYA+CRTWE L R+K+  T   D++ ++++YVVSGDEDGERGGAP++CIEDI 
Sbjct: 243  SPGSWVGPYAMCRTWETLARSKREETGNADVSPAMAIYVVSGDEDGERGGAPVLCIEDIV 302

Query: 1168 RHCLEYSRGQAAWTXXXXXXXXXXXXDKLNPRYLPLLAATFCFPQSLGILGGRPGAS 1338
            +HC   S+G+  WT            DK+N RYLPLLAATF FPQSLGILGGRPGAS
Sbjct: 303  KHCSGLSKGEVDWTPVVFLVPLVLGLDKINSRYLPLLAATFSFPQSLGILGGRPGAS 359


>ref|XP_004228630.1| PREDICTED: cysteine protease ATG4-like [Solanum lycopersicum]
          Length = 493

 Score =  421 bits (1082), Expect = e-115
 Identities = 219/352 (62%), Positives = 262/352 (74%), Gaps = 17/352 (4%)
 Frame = +1

Query: 334  SKYSGEIPSENSSRIANSVSSEAGP-------SSREFHKPSLWSGFLVSAFSVFENCSEA 492
            S    + P +N   + +SVS+E GP       S +  +K S+WSG LVS FS+F+  SE 
Sbjct: 10   SSSDSDSPRKN---LGSSVSTEPGPGPSSSSSSCKVNNKSSVWSGLLVSPFSIFD--SEP 64

Query: 493  KST---NRVCNSKSY---GWTATVKRIMNSGSMRRIFGLNKTSFPS-SKSEIWLLGVCYN 651
            K        C SK Y   GWT+ VKR++NSGSMRRIFG++KT  P+ SKS+IWLLGVCY 
Sbjct: 65   KGCLKKREFCCSKKYNGIGWTSAVKRMINSGSMRRIFGMDKTGMPNGSKSDIWLLGVCYK 124

Query: 652  VAQDDSS--DPTQSEGFASFVEDFTSRILITYRKGFAPIEDSKYTSDVNWGCMIRSSQML 825
            V QDD S  +PTQSEGFA+FV+DF+SRIL+TYRKGFAPIED+KYTSDVNWGCM+RSSQML
Sbjct: 125  VVQDDDSSIEPTQSEGFAAFVDDFSSRILVTYRKGFAPIEDTKYTSDVNWGCMLRSSQML 184

Query: 826  IAQAFLFHLLGRSWRKTTNKPLEQNYSEILHLFGDSESSPCSIHNLLQAGKVYGLSPGSW 1005
            +AQA L H LGRSWRK+ +KPLEQ Y EILHLFGDS  S  SIHNLLQAGK YGLSPGSW
Sbjct: 185  VAQALLLHRLGRSWRKSMDKPLEQKYVEILHLFGDSVESAYSIHNLLQAGKTYGLSPGSW 244

Query: 1006 VGPYAVCRTWEALMRNKK-GTITGDLTSSISMYVVSGDEDGERGGAPMVCIEDIGRHCLE 1182
            VGPYA+CRTWE L R K+  T    ++ ++++YVVSGDEDGERGGAP++C+EDI +HC  
Sbjct: 245  VGPYAMCRTWETLARCKREETGNAVMSPAMAIYVVSGDEDGERGGAPVLCVEDIVKHCSG 304

Query: 1183 YSRGQAAWTXXXXXXXXXXXXDKLNPRYLPLLAATFCFPQSLGILGGRPGAS 1338
             ++G+  WT            DK+N RYLPLLAATF FPQSLGILGGRPGAS
Sbjct: 305  LAKGEVDWTPVLFLVPLVLGLDKINSRYLPLLAATFSFPQSLGILGGRPGAS 356


>ref|XP_006354185.1| PREDICTED: cysteine protease ATG4-like isoform X1 [Solanum tuberosum]
          Length = 524

 Score =  410 bits (1053), Expect = e-112
 Identities = 221/385 (57%), Positives = 266/385 (69%), Gaps = 50/385 (12%)
 Frame = +1

Query: 334  SKYSGEIPSENSSR--IANSVSSEAGP-------SSREFHKPSLWSGFLVSAFSVFENCS 486
            S++S      +S R  + +SVS+E GP       S +  +K S+WSG LVS FS+F+  S
Sbjct: 5    SQFSRSSSDSDSPRKNLGSSVSTEPGPGPSSSSSSCKVNNKSSVWSGLLVSPFSIFD--S 62

Query: 487  EAKST---NRVCNSKSY------GWTATVKRIMNSGSMRRIFGLNKTSFPS-SKSEIWLL 636
            E K        C SK Y      GWT+ VKR++NSGSMRRIFG++KT  P+ SKS+IWLL
Sbjct: 63   EPKGCLKKGEFCCSKKYNGIVGIGWTSAVKRMINSGSMRRIFGMDKTGIPNGSKSDIWLL 122

Query: 637  GVCYNVAQDDSS--DPTQSEGFASFVEDFTSRILITYRKGFAPIEDSKYTSDVNWGCMIR 810
            GVCY V QDD S  +PTQSEGF++FV+DF+SRIL+TYRKGFAPI D+KYTSDVNWGCM+R
Sbjct: 123  GVCYKVVQDDDSSIEPTQSEGFSAFVDDFSSRILVTYRKGFAPIGDTKYTSDVNWGCMLR 182

Query: 811  SSQMLIAQAFLFHLLGRSWRKTT----------------------------NKPLEQNYS 906
            SSQML+AQA L H LGRSWRK+                             N+PLE+ Y 
Sbjct: 183  SSQMLVAQALLLHRLGRSWRKSMDKVLESQNTAVLSVVKMLNFQFKTTIMHNQPLEEKYV 242

Query: 907  EILHLFGDSESSPCSIHNLLQAGKVYGLSPGSWVGPYAVCRTWEALMRNKKG-TITGDLT 1083
            EILHLFGDSE S  SIHNLLQAGK YGLSPGSWVGPYA+CRTWE L R+K+  T   D++
Sbjct: 243  EILHLFGDSEGSAYSIHNLLQAGKTYGLSPGSWVGPYAMCRTWETLARSKREETGNADVS 302

Query: 1084 SSISMYVVSGDEDGERGGAPMVCIEDIGRHCLEYSRGQAAWTXXXXXXXXXXXXDKLNPR 1263
             ++++YVVSGDEDGERGGAP++CIEDI +HC   S+G+  WT            DK+N R
Sbjct: 303  PAMAIYVVSGDEDGERGGAPVLCIEDIVKHCSGLSKGEVDWTPVVFLVPLVLGLDKINSR 362

Query: 1264 YLPLLAATFCFPQSLGILGGRPGAS 1338
            YLPLLAATF FPQSLGILGGRPGAS
Sbjct: 363  YLPLLAATFSFPQSLGILGGRPGAS 387


>gb|EMJ19125.1| hypothetical protein PRUPE_ppa004885mg [Prunus persica]
          Length = 487

 Score =  394 bits (1013), Expect = e-107
 Identities = 200/352 (56%), Positives = 255/352 (72%), Gaps = 7/352 (1%)
 Frame = +1

Query: 304  MKDLYERALSSKYSGEIPSENSSRIANSVSSEAGPSSREFHKPSLWSGFLVSAFSVFENC 483
            MK   ERA++SKYS +  +E++ R  +SV S++G    +  K SLWS F  SAFS+FE  
Sbjct: 1    MKGFCERAVASKYSSKSSTESTDRGPSSVCSDSGSRDSKHDKASLWSNFFASAFSIFETH 60

Query: 484  SEAKSTNRV-CNSKSYGWTATVKRIMNSGSMRRI----FGLNKTSFPSSKSEIWLLGVCY 648
            SE+  T +   +S++ GWT  V++++  GSMRRI     G ++T   SS S+IWLLGV Y
Sbjct: 61   SESSITEKKEIHSRNNGWTEAVRKVVTGGSMRRIHERVLGSSRTGI-SSASDIWLLGVLY 119

Query: 649  NVAQDDSS-DPTQSEGFASFVEDFTSRILITYRKGFAPIEDSKYTSDVNWGCMIRSSQML 825
             V+QD+SS D   + G  +F +DF+SRIL+TYRKGF  I DSKYTSDVNWGCM+RSSQML
Sbjct: 120  KVSQDESSGDAATNNGLRAFEQDFSSRILMTYRKGFDAIGDSKYTSDVNWGCMLRSSQML 179

Query: 826  IAQAFLFHLLGRSWRKTTNKPLEQNYSEILHLFGDSESSPCSIHNLLQAGKVYGLSPGSW 1005
            +AQA LFH LGRSWR+T +KPL++ Y EILH FGDSE S  SIHNLLQAGK Y L+ GSW
Sbjct: 180  VAQALLFHRLGRSWRRTLHKPLDEQYIEILHHFGDSEGSAFSIHNLLQAGKAYDLAAGSW 239

Query: 1006 VGPYAVCRTWEALMRNKK-GTITGDLTSSISMYVVSGDEDGERGGAPMVCIEDIGRHCLE 1182
            VGPYA+CR+WE L+R K+ GT   +    +++Y+VSGDEDGERGGAP+VCI+D  RHCLE
Sbjct: 240  VGPYAMCRSWETLVRCKREGTAFDNQPLPMAVYIVSGDEDGERGGAPVVCIQDASRHCLE 299

Query: 1183 YSRGQAAWTXXXXXXXXXXXXDKLNPRYLPLLAATFCFPQSLGILGGRPGAS 1338
            +SRG+  WT            +K+NPRY+P L ATF FPQSLGI+GG+PGAS
Sbjct: 300  FSRGRVDWTPILLLVPLVLGLEKVNPRYIPSLWATFTFPQSLGIMGGKPGAS 351


>ref|XP_002331599.1| predicted protein [Populus trichocarpa]
            gi|566210458|ref|XP_006372315.1| autophagy 4b family
            protein [Populus trichocarpa] gi|550318931|gb|ERP50112.1|
            autophagy 4b family protein [Populus trichocarpa]
          Length = 482

 Score =  383 bits (983), Expect = e-103
 Identities = 196/354 (55%), Positives = 245/354 (69%), Gaps = 9/354 (2%)
 Frame = +1

Query: 304  MKDLYERALSSKYSGEIPSENSSRIANSVSSEAGPSSREFHKPSLWSGFLVSAFSVFENC 483
            MK   ER   +       +E+ +R   S SSE G +  +F KPSLWS F  SAFSVF+  
Sbjct: 1    MKGFRERGFVASSKSSSTAESPNRSFTSDSSELGSADTKFSKPSLWSTFFASAFSVFDTH 60

Query: 484  SEAKSTNRVCNSK---SYGWTATVKRIMNSGSMRRI----FGLNKTSFPSSKSEIWLLGV 642
             ++ ST+           GWT+ VK+I+  GSMRRI     G +KT   ++  +IWLLG 
Sbjct: 61   CDSSSTSEKKAPHIRHGNGWTSAVKKIVAGGSMRRIQECVLGTSKTGISNTTGDIWLLGA 120

Query: 643  CYNVAQDDSS-DPTQSEGFASFVEDFTSRILITYRKGFAPIEDSKYTSDVNWGCMIRSSQ 819
            CY ++QD+SS D   +   A+F  DF+SRILITYRKGF  IEDSK TSDV+WGCM+RSSQ
Sbjct: 121  CYKISQDNSSGDAAATNALAAFNHDFSSRILITYRKGFDAIEDSKLTSDVSWGCMLRSSQ 180

Query: 820  MLIAQAFLFHLLGRSWRKTTNKPLEQNYSEILHLFGDSESSPCSIHNLLQAGKVYGLSPG 999
            ML+AQA LFH LGRSWRK  +KPL++ Y EILHLFGDSESS  SIHNLL+AGK YGL+ G
Sbjct: 181  MLVAQALLFHRLGRSWRKPLDKPLDREYVEILHLFGDSESSAFSIHNLLRAGKAYGLAAG 240

Query: 1000 SWVGPYAVCRTWEALMRNKKGTITGDLTS-SISMYVVSGDEDGERGGAPMVCIEDIGRHC 1176
            SWVGPYAVC +WE+L+R+++     +  S S+++YVVSG EDGERGGAP++CIE+  RHC
Sbjct: 241  SWVGPYAVCHSWESLVRSRREETNLEYQSLSMAVYVVSGSEDGERGGAPVLCIEEAARHC 300

Query: 1177 LEYSRGQAAWTXXXXXXXXXXXXDKLNPRYLPLLAATFCFPQSLGILGGRPGAS 1338
             E+S+GQ  WT            DK+NPRY+P L ATF FPQSLGILGG+PGAS
Sbjct: 301  SEFSKGQEDWTPILLLVPLVLGLDKINPRYIPSLQATFTFPQSLGILGGKPGAS 354


>ref|XP_006441973.1| hypothetical protein CICLE_v10019906mg [Citrus clementina]
            gi|557544235|gb|ESR55213.1| hypothetical protein
            CICLE_v10019906mg [Citrus clementina]
          Length = 486

 Score =  382 bits (982), Expect = e-103
 Identities = 199/352 (56%), Positives = 246/352 (69%), Gaps = 7/352 (1%)
 Frame = +1

Query: 304  MKDLYERALSSKYSGEIPSENSSRIANSVSSEAGPSSREFHKPSLWSGFLVSAFSVFENC 483
            MK   E+A +SK   +   +  +R   SV SE G S  +  K SL S    SAFSVFE  
Sbjct: 1    MKGFREKAGASKCFSKSTPDTPNRSLASVGSEPGSSESKSSKGSLLSSLFNSAFSVFETY 60

Query: 484  SEAK-STNRVCNSKSYGWTATVKRIMNSGSMRRI----FGLNKTSFPSSKSEIWLLGVCY 648
            SE+  S  +  ++KS GWTA VKR++ +GSMRRI     G ++T   SS S+IWLLGVC+
Sbjct: 61   SESSASEKKAVHNKSNGWTAAVKRLVTAGSMRRIHERVLGPSRTGISSSTSDIWLLGVCH 120

Query: 649  NVAQDDS-SDPTQSEGFASFVEDFTSRILITYRKGFAPIEDSKYTSDVNWGCMIRSSQML 825
             +AQD++  D   + G A F +DF+SRILI+YRKGF PI DSK TSDV WGCM+RSSQML
Sbjct: 121  KIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQML 180

Query: 826  IAQAFLFHLLGRSWRKTTNKPLEQNYSEILHLFGDSESSPCSIHNLLQAGKVYGLSPGSW 1005
            +AQA LFH LGR WRK   KP ++ Y EILHLFGDSE+SP SIHNLLQAGK YGL+ GSW
Sbjct: 181  VAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSW 240

Query: 1006 VGPYAVCRTWEALMRNKKG-TITGDLTSSISMYVVSGDEDGERGGAPMVCIEDIGRHCLE 1182
            VGPYA+CR+WEAL R ++  T  G  +  +++YVVSGDEDGERGGAP+VCI+D  RHC  
Sbjct: 241  VGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSV 300

Query: 1183 YSRGQAAWTXXXXXXXXXXXXDKLNPRYLPLLAATFCFPQSLGILGGRPGAS 1338
            +S+GQA WT            +K+NPRY+P L  TF FPQSLGI+GG+PGAS
Sbjct: 301  FSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGAS 352


>ref|XP_006478507.1| PREDICTED: cysteine protease ATG4-like isoform X1 [Citrus sinensis]
          Length = 486

 Score =  382 bits (980), Expect = e-103
 Identities = 198/352 (56%), Positives = 246/352 (69%), Gaps = 7/352 (1%)
 Frame = +1

Query: 304  MKDLYERALSSKYSGEIPSENSSRIANSVSSEAGPSSREFHKPSLWSGFLVSAFSVFENC 483
            MK   E+A +SK   +   +  +R   SV SE G S  +  K SL S    SAFSVFE  
Sbjct: 1    MKGFREKAGASKCFSKSTPDTPNRSLASVGSELGSSESKSSKGSLLSSLFNSAFSVFETY 60

Query: 484  SEAKSTNR-VCNSKSYGWTATVKRIMNSGSMRRI----FGLNKTSFPSSKSEIWLLGVCY 648
            SE+ +  +   ++KS GWTA VKR++ +GSMRRI     G ++T   SS S+IWLLGVC+
Sbjct: 61   SESSANEKKAVHNKSNGWTAAVKRLVTAGSMRRIHERVLGPSRTGISSSTSDIWLLGVCH 120

Query: 649  NVAQDDS-SDPTQSEGFASFVEDFTSRILITYRKGFAPIEDSKYTSDVNWGCMIRSSQML 825
             +AQD++  D   + G A F +DF+SRILI+YRKGF PI DSK TSDV WGCM+RSSQML
Sbjct: 121  KIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQML 180

Query: 826  IAQAFLFHLLGRSWRKTTNKPLEQNYSEILHLFGDSESSPCSIHNLLQAGKVYGLSPGSW 1005
            +AQA LFH LGR WRK   KP ++ Y EILHLFGDSE+SP SIHNLLQAGK YGL+ GSW
Sbjct: 181  VAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSW 240

Query: 1006 VGPYAVCRTWEALMRNKKG-TITGDLTSSISMYVVSGDEDGERGGAPMVCIEDIGRHCLE 1182
            VGPYA+CR+WEAL R ++  T  G  +  +++YVVSGDEDGERGGAP+VCI+D  RHC  
Sbjct: 241  VGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSV 300

Query: 1183 YSRGQAAWTXXXXXXXXXXXXDKLNPRYLPLLAATFCFPQSLGILGGRPGAS 1338
            +S+GQA WT            +K+NPRY+P L  TF FPQSLGI+GG+PGAS
Sbjct: 301  FSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGAS 352


>gb|ESW11661.1| hypothetical protein PHAVU_008G048900g [Phaseolus vulgaris]
            gi|561012801|gb|ESW11662.1| hypothetical protein
            PHAVU_008G048900g [Phaseolus vulgaris]
          Length = 489

 Score =  380 bits (976), Expect = e-103
 Identities = 194/353 (54%), Positives = 248/353 (70%), Gaps = 7/353 (1%)
 Frame = +1

Query: 301  VMKDLYERALSSKYSGEIPSENSSRIANSVSSEAGPSSREFHKPSLWSGFLVSAFSVFEN 480
            V+K L ER ++SK + +  +E           +AG S  +F K SLWS    S FSV E+
Sbjct: 2    VLKSLCERIVASKCTSKSSTETVDCTQVPAYLKAGSSDSKFPKVSLWSSIFTSGFSVAES 61

Query: 481  CSEAK-STNRVCNSKSYGWTATVKRIMNSGSMRR----IFGLNKTSFPSSKSEIWLLGVC 645
             SE+  S  +  +S+S GW A V++++ SGSMRR    + G ++T   SS  +IWLLGVC
Sbjct: 62   FSESSASEKKAVHSRSSGWAAAVRKVVTSGSMRRFHERVLGSSRTDISSSDGDIWLLGVC 121

Query: 646  YNVAQDDSSDPTQ-SEGFASFVEDFTSRILITYRKGFAPIEDSKYTSDVNWGCMIRSSQM 822
            + ++Q +SS     S G ASF +DF+S+IL+TYRKGF  I DSKYTSDVNWGCM+RSSQM
Sbjct: 122  HKISQQESSGGVDTSNGLASFEQDFSSKILVTYRKGFDAIGDSKYTSDVNWGCMLRSSQM 181

Query: 823  LIAQAFLFHLLGRSWRKTTNKPLEQNYSEILHLFGDSESSPCSIHNLLQAGKVYGLSPGS 1002
            L+AQA +FH LGRSWRKT +KP+++ Y +IL LFGDSE+S  SIHNLLQAGK YGL+ GS
Sbjct: 182  LVAQALVFHKLGRSWRKTVDKPVDKEYLKILQLFGDSEASAFSIHNLLQAGKGYGLAVGS 241

Query: 1003 WVGPYAVCRTWEALMR-NKKGTITGDLTSSISMYVVSGDEDGERGGAPMVCIEDIGRHCL 1179
            WVGPYA+CRTWE L R  ++    G+    +++YVVSGDEDGERGGAP+ CIED  +HC 
Sbjct: 242  WVGPYAMCRTWEVLARIQREKNDLGEPPLPMAIYVVSGDEDGERGGAPVFCIEDAFKHCS 301

Query: 1180 EYSRGQAAWTXXXXXXXXXXXXDKLNPRYLPLLAATFCFPQSLGILGGRPGAS 1338
            E+SRG AAWT            DK+NPRY+PLL +TF FPQSLGI+GG+PGAS
Sbjct: 302  EFSRGLAAWTPLLLLVPLVLGLDKVNPRYIPLLHSTFKFPQSLGIMGGKPGAS 354


>ref|XP_003624282.1| Cysteine protease ATG4 [Medicago truncatula]
            gi|147742964|sp|A2Q1V6.1|ATG4_MEDTR RecName:
            Full=Cysteine protease ATG4; AltName:
            Full=Autophagy-related protein 4
            gi|124359485|gb|ABN05923.1| Peptidase C54 [Medicago
            truncatula] gi|355499297|gb|AES80500.1| Cysteine protease
            ATG4 [Medicago truncatula]
          Length = 487

 Score =  380 bits (976), Expect = e-103
 Identities = 195/353 (55%), Positives = 249/353 (70%), Gaps = 7/353 (1%)
 Frame = +1

Query: 301  VMKDLYERALSSKYSGEIPSENSSRIANSVSSEAGPSSREFHKPSLWSGFLVSAFSVFEN 480
            V+KDL +R +++K S +  +E         SS+AG S  +F K SLWS F  S FSV E 
Sbjct: 2    VLKDLCDRIVAAKCSSKSSTEIVDNTQVPASSKAGSSDSKFPKASLWSTFFTSGFSVDET 61

Query: 481  CSEAKSTNR-VCNSKSYGWTATVKRIMNSGSMRR----IFGLNKTSFPSSKSEIWLLGVC 645
             SE+ S+ +   +S++ GW A V+++++ GSMRR    + G  +T   SS  +IWLLGVC
Sbjct: 62   YSESSSSEKKTVHSRNSGWAAAVRKVVSGGSMRRFQERVLGSCRTDVSSSDGDIWLLGVC 121

Query: 646  YNVAQDDSSDPTQSEG-FASFVEDFTSRILITYRKGFAPIEDSKYTSDVNWGCMIRSSQM 822
            + ++Q +S+        FA+F +DF SRILITYRKGF  IEDSKYTSDVNWGCM+RSSQM
Sbjct: 122  HKISQHESTGDVDIRNVFAAFEQDFFSRILITYRKGFDAIEDSKYTSDVNWGCMLRSSQM 181

Query: 823  LIAQAFLFHLLGRSWRKTTNKPLEQNYSEILHLFGDSESSPCSIHNLLQAGKVYGLSPGS 1002
            L+AQA LFH LGRSWRKT +KP+++ Y +IL LFGDSE++  SIHNLLQAGK YGL+ GS
Sbjct: 182  LVAQALLFHKLGRSWRKTVDKPVDKEYIDILQLFGDSEAAAFSIHNLLQAGKGYGLAVGS 241

Query: 1003 WVGPYAVCRTWEALMRN-KKGTITGDLTSSISMYVVSGDEDGERGGAPMVCIEDIGRHCL 1179
            WVGPYA+CRTWE L RN ++    G+    +++YVVSGDEDGERGGAP+VCIED  + CL
Sbjct: 242  WVGPYAMCRTWEVLARNQREKNEQGEQLLPMAIYVVSGDEDGERGGAPVVCIEDACKRCL 301

Query: 1180 EYSRGQAAWTXXXXXXXXXXXXDKLNPRYLPLLAATFCFPQSLGILGGRPGAS 1338
            E+SRG   WT            DK+N RY+PLL +TF FPQSLGILGG+PGAS
Sbjct: 302  EFSRGLVPWTPLLLLVPLVLGLDKVNLRYIPLLQSTFKFPQSLGILGGKPGAS 354


>ref|XP_003635099.1| PREDICTED: cysteine protease ATG4-like [Vitis vinifera]
            gi|296086874|emb|CBI33041.3| unnamed protein product
            [Vitis vinifera]
          Length = 486

 Score =  378 bits (970), Expect = e-102
 Identities = 193/355 (54%), Positives = 250/355 (70%), Gaps = 10/355 (2%)
 Frame = +1

Query: 304  MKDLYERALSSKYSGEIPSENSSRIANSVSSEAGPSSREFHKPSLWSGFLVSAFSVFENC 483
            MK   E+A++SK+S +  S++S+       SE   S  +  K SLWS    SAFSVFE  
Sbjct: 1    MKGFCEKAVASKFSCKTKSDSSN-------SEPQSSDTKLSKVSLWSSVFASAFSVFETN 53

Query: 484  SE----AKSTNRVCNSKSYGWTATVKRIMNSGSMRRI----FGLNKTSFPSSKSEIWLLG 639
            SE    A     + N ++ GWT  V++++   SMRRI     G +KT   SS S+IWLLG
Sbjct: 54   SESSPSASEKKAIDNGRNNGWTTAVRKVVTGVSMRRIQERVLGTSKTGISSSTSDIWLLG 113

Query: 640  VCYNVAQDDSSD-PTQSEGFASFVEDFTSRILITYRKGFAPIEDSKYTSDVNWGCMIRSS 816
            +CY ++Q++SS+  + S G A F +DF+SRIL+TYRKGF  I DSK TSDVNWGCM+RSS
Sbjct: 114  LCYKISQEESSNHASSSNGLAEFEQDFSSRILMTYRKGFEAIGDSKLTSDVNWGCMLRSS 173

Query: 817  QMLIAQAFLFHLLGRSWRKTTNKPLEQNYSEILHLFGDSESSPCSIHNLLQAGKVYGLSP 996
            QML+AQA L H +GRSWRKT++KP++Q+Y EILH FGDS++S  SIHN+LQAGK YGL+ 
Sbjct: 174  QMLVAQALLLHRMGRSWRKTSHKPMDQDYIEILHHFGDSKASAFSIHNILQAGKAYGLAA 233

Query: 997  GSWVGPYAVCRTWEALMRNKKGTITGDLTS-SISMYVVSGDEDGERGGAPMVCIEDIGRH 1173
            GSWVGPYA+CR+WE L R+K+     +  S  +++Y+VSGDEDGERGGAP+V IE+  RH
Sbjct: 234  GSWVGPYAMCRSWETLARSKREETDLECQSLPMAIYIVSGDEDGERGGAPVVYIEEASRH 293

Query: 1174 CLEYSRGQAAWTXXXXXXXXXXXXDKLNPRYLPLLAATFCFPQSLGILGGRPGAS 1338
            CLE+S+GQ  WT            +K+NPRY+P LAATF FPQSLGILGG+PGAS
Sbjct: 294  CLEFSKGQVDWTPILLLVPLVLGLEKVNPRYIPSLAATFTFPQSLGILGGKPGAS 348


>emb|CAN81102.1| hypothetical protein VITISV_021940 [Vitis vinifera]
          Length = 489

 Score =  378 bits (970), Expect = e-102
 Identities = 193/355 (54%), Positives = 250/355 (70%), Gaps = 10/355 (2%)
 Frame = +1

Query: 304  MKDLYERALSSKYSGEIPSENSSRIANSVSSEAGPSSREFHKPSLWSGFLVSAFSVFENC 483
            MK   E+A++SK+S +  S++S+       SE   S  +  K SLWS    SAFSVFE  
Sbjct: 1    MKGFCEKAVASKFSCKTKSDSSN-------SEPQSSDTKLSKVSLWSSVFASAFSVFETN 53

Query: 484  SE----AKSTNRVCNSKSYGWTATVKRIMNSGSMRRI----FGLNKTSFPSSKSEIWLLG 639
            SE    A     + N ++ GWT  V++++   SMRRI     G +KT   SS S+IWLLG
Sbjct: 54   SESSPSASEKKAIDNGRNNGWTTAVRKVVTGVSMRRIQERVLGTSKTGISSSTSDIWLLG 113

Query: 640  VCYNVAQDDSSD-PTQSEGFASFVEDFTSRILITYRKGFAPIEDSKYTSDVNWGCMIRSS 816
            +CY ++Q++SS+  + S G A F +DF+SRIL+TYRKGF  I DSK TSDVNWGCM+RSS
Sbjct: 114  LCYKISQEESSNHASSSNGLAEFEQDFSSRILMTYRKGFEAIGDSKLTSDVNWGCMLRSS 173

Query: 817  QMLIAQAFLFHLLGRSWRKTTNKPLEQNYSEILHLFGDSESSPCSIHNLLQAGKVYGLSP 996
            QML+AQA L H +GRSWRKT++KP++Q+Y EILH FGDS++S  SIHN+LQAGK YGL+ 
Sbjct: 174  QMLVAQALLLHRMGRSWRKTSHKPMDQDYIEILHHFGDSKASAFSIHNILQAGKAYGLAA 233

Query: 997  GSWVGPYAVCRTWEALMRNKKGTITGDLTS-SISMYVVSGDEDGERGGAPMVCIEDIGRH 1173
            GSWVGPYA+CR+WE L R+K+     +  S  +++Y+VSGDEDGERGGAP+V IE+  RH
Sbjct: 234  GSWVGPYAMCRSWETLARSKREETDLECQSLPMAIYIVSGDEDGERGGAPVVYIEEASRH 293

Query: 1174 CLEYSRGQAAWTXXXXXXXXXXXXDKLNPRYLPLLAATFCFPQSLGILGGRPGAS 1338
            CLE+S+GQ  WT            +K+NPRY+P LAATF FPQSLGILGG+PGAS
Sbjct: 294  CLEFSKGQVDWTPILLLVPLVLGLEKVNPRYIPSLAATFTFPQSLGILGGKPGAS 348


>ref|XP_004492910.1| PREDICTED: cysteine protease ATG4-like isoform X1 [Cicer arietinum]
          Length = 490

 Score =  377 bits (969), Expect = e-102
 Identities = 192/354 (54%), Positives = 250/354 (70%), Gaps = 8/354 (2%)
 Frame = +1

Query: 301  VMKDLYERALSSKYSGEIPSENSSRIANSVSSEAGPSSREFHKPSLWSGFLVSAFSVFEN 480
            V+K   ER +++K S +  ++         SS+AG S  +F K SLWS F  S FSV E 
Sbjct: 2    VLKGFCERIVAAKCSAKSSTDTVDNTQVPASSKAGSSDSKFPKASLWSTFFTSGFSVDET 61

Query: 481  CSEAK-STNRVCNSKSYGWTATVKRIMNSG-SMRR----IFGLNKTSFPSSKSEIWLLGV 642
             SE+  S  +   S++ GW A V++++++G SMRR    + G ++T    S  ++WLLGV
Sbjct: 62   YSESSASEKKAVYSRNSGWAAAVRKVVSAGGSMRRFQERVLGSSRTDISCSDGDVWLLGV 121

Query: 643  CYNVAQDDSSDPTQSEG-FASFVEDFTSRILITYRKGFAPIEDSKYTSDVNWGCMIRSSQ 819
            C+ ++Q +S+    +   FA+F +DF S+IL+TYRKGF  IEDSKYTSDVNWGCM+RSSQ
Sbjct: 122  CHKISQQESTGDVDTRNVFAAFEQDFFSKILVTYRKGFDAIEDSKYTSDVNWGCMLRSSQ 181

Query: 820  MLIAQAFLFHLLGRSWRKTTNKPLEQNYSEILHLFGDSESSPCSIHNLLQAGKVYGLSPG 999
            ML+AQA LFH LGRSWRKTT+KP+++ Y +IL LFGDSE++  SIHNLLQAGK YGL+ G
Sbjct: 182  MLVAQALLFHKLGRSWRKTTDKPVDKEYIDILQLFGDSEAAAFSIHNLLQAGKGYGLAVG 241

Query: 1000 SWVGPYAVCRTWEALMRNKKGT-ITGDLTSSISMYVVSGDEDGERGGAPMVCIEDIGRHC 1176
            SWVGPYA+CRTWE L RN++GT   G+ T  +++YVVSGDEDGERGGAP+VCIED  + C
Sbjct: 242  SWVGPYAMCRTWEVLARNQRGTNEIGEQTLPMAIYVVSGDEDGERGGAPVVCIEDASKCC 301

Query: 1177 LEYSRGQAAWTXXXXXXXXXXXXDKLNPRYLPLLAATFCFPQSLGILGGRPGAS 1338
             E+SRG   WT            DK+N RY+PLL +TF FPQSLGILGG+PGAS
Sbjct: 302  SEFSRGLVPWTPLLLLVPLVLGLDKVNLRYIPLLQSTFKFPQSLGILGGKPGAS 355


>gb|EPS69655.1| hypothetical protein M569_05108, partial [Genlisea aurea]
          Length = 403

 Score =  371 bits (952), Expect = e-100
 Identities = 184/272 (67%), Positives = 210/272 (77%), Gaps = 3/272 (1%)
 Frame = +1

Query: 532  WTATVKR-IMNSGSMRRIFGLNKTSFPSSKSEIWLLGVCYNVAQ--DDSSDPTQSEGFAS 702
            W A V+R IMN GSMRRI+G  +    +SK +IWLLGVCY V    DDSSDPTQSEGFA+
Sbjct: 14   WRAAVRRVIMNGGSMRRIWGFGRMWASASKGDIWLLGVCYQVFHEGDDSSDPTQSEGFAA 73

Query: 703  FVEDFTSRILITYRKGFAPIEDSKYTSDVNWGCMIRSSQMLIAQAFLFHLLGRSWRKTTN 882
            FVED +SRI ITYR+GF PIE+SKY SD NWGCM+RSSQML+AQAFLFH LGRSWRKT+N
Sbjct: 74   FVEDLSSRIWITYRRGFLPIENSKYCSDANWGCMLRSSQMLVAQAFLFHKLGRSWRKTSN 133

Query: 883  KPLEQNYSEILHLFGDSESSPCSIHNLLQAGKVYGLSPGSWVGPYAVCRTWEALMRNKKG 1062
            +P E  Y EIL LFGDSE SPCSIHNLLQ GK YGL+PGSWVGPYA+CR WE LMR    
Sbjct: 134  QPHE--YIEILQLFGDSEESPCSIHNLLQVGKAYGLAPGSWVGPYAMCRAWECLMRY--- 188

Query: 1063 TITGDLTSSISMYVVSGDEDGERGGAPMVCIEDIGRHCLEYSRGQAAWTXXXXXXXXXXX 1242
            T  G LTS +++YVVSGD DGERGGAP++C+ED+ R C E+ RGQ  W            
Sbjct: 189  TDCGFLTSMMTLYVVSGDGDGERGGAPVLCVEDVSRRCSEFGRGQDNWAPVLLLVPLVLG 248

Query: 1243 XDKLNPRYLPLLAATFCFPQSLGILGGRPGAS 1338
             +K+NPRYLPLL+ATF FPQSLGILGGRPG S
Sbjct: 249  LEKVNPRYLPLLSATFTFPQSLGILGGRPGVS 280


>ref|XP_002529225.1| Cysteine protease ATG4B, putative [Ricinus communis]
            gi|223531343|gb|EEF33181.1| Cysteine protease ATG4B,
            putative [Ricinus communis]
          Length = 489

 Score =  369 bits (948), Expect = 1e-99
 Identities = 191/352 (54%), Positives = 244/352 (69%), Gaps = 7/352 (1%)
 Frame = +1

Query: 304  MKDLYERALSSKYSGEIPSENSSRIANSVSSEAGPSSREFHKPSLWSGFLVSAFSVFENC 483
            MK   ER ++S+ S + P +  +R   S   E+G  S    K SLWS F  SAFSVFE  
Sbjct: 1    MKGFRER-VASRCSSKCPVDTPNRSLTSDCLESG--SNFSTKGSLWSSFFASAFSVFETY 57

Query: 484  SEAK--STNRVCNSKSYGWTATVKRIMNSGSMRRI----FGLNKTSFPSSKSEIWLLGVC 645
             E+   S  +  +S+  GWT+ VK+I++ GSMRRI     G ++T   S+ S+IWLLGVC
Sbjct: 58   RESPPASEKKGSHSRHNGWTSAVKKIVSGGSMRRIHERVLGPSRTGISSTTSDIWLLGVC 117

Query: 646  YNVAQDDSSDPTQSEGFASFVEDFTSRILITYRKGFAPIEDSKYTSDVNWGCMIRSSQML 825
            Y +++D+S +       A F  D++SRIL+TYR+GF  I DSKY SDV WGCM+RSSQML
Sbjct: 118  YKISEDESGNADTGNALAEFTHDYSSRILMTYRRGFDAIGDSKYISDVGWGCMLRSSQML 177

Query: 826  IAQAFLFHLLGRSWRKTTNKPLEQNYSEILHLFGDSESSPCSIHNLLQAGKVYGLSPGSW 1005
            +AQA LFH LGR+W K   KP++Q Y EILHLFGDSE++P SIHNL+QAGK Y L+ GSW
Sbjct: 178  VAQALLFHKLGRAWTKPFQKPMDQAYVEILHLFGDSEAAPFSIHNLIQAGKAYSLAAGSW 237

Query: 1006 VGPYAVCRTWEALMRNKKGTITGDLTS-SISMYVVSGDEDGERGGAPMVCIEDIGRHCLE 1182
            VGPYA+CR+WE+L R+K+   + +  S  +++YVVSGDEDGERGGAP+V IED  RHCLE
Sbjct: 238  VGPYAMCRSWESLARSKREENSLEYQSLPMAVYVVSGDEDGERGGAPVVYIEDASRHCLE 297

Query: 1183 YSRGQAAWTXXXXXXXXXXXXDKLNPRYLPLLAATFCFPQSLGILGGRPGAS 1338
            +SRGQA WT            DK+NPRY+P L ATF F QSLGI+GG+PGAS
Sbjct: 298  FSRGQADWTPILLLVPLVLGLDKVNPRYIPSLQATFTFSQSLGIMGGKPGAS 349


>ref|XP_002309707.1| autophagy 4b family protein [Populus trichocarpa]
            gi|222852610|gb|EEE90157.1| autophagy 4b family protein
            [Populus trichocarpa]
          Length = 481

 Score =  367 bits (943), Expect = 5e-99
 Identities = 190/327 (58%), Positives = 237/327 (72%), Gaps = 9/327 (2%)
 Frame = +1

Query: 385  SVSSEAGPSSREFHKPSLWSGFLVSAFSVFENCSEAKST--NRVCNSK-SYGWTATVKRI 555
            S SSE G +  +  KPSLWS F  SAFSVF+   ++ ST  N   + + S GWT++VK+I
Sbjct: 27   SDSSEPGSTDTKVSKPSLWSSFFASAFSVFDIYRDSSSTSHNEAPHIRHSNGWTSSVKKI 86

Query: 556  MNSGSMRRI----FGLNKTSFPSSKSEIWLLGVCYNVAQDDSS-DPTQSEGFASFVEDFT 720
            +  G+MRRI     G +KT   ++ S+IWLLG  Y ++QDDSS +   +   A+F  DF+
Sbjct: 87   VAGGTMRRIQERVLGTSKTGISNTTSDIWLLGARYKISQDDSSGNADATNALAAFHRDFS 146

Query: 721  SRILITYRKGFAPIEDSKYTSDVNWGCMIRSSQMLIAQAFLFHLLGRSWRKTTNKPLEQN 900
            SRILITYRKGF  IEDSK TSDVNWGCM+RSSQML+AQA LFH LGRSWRK  +KPL+++
Sbjct: 147  SRILITYRKGFDMIEDSKLTSDVNWGCMLRSSQMLVAQALLFHRLGRSWRKPVDKPLDRD 206

Query: 901  YSEILHLFGDSESSPCSIHNLLQAGKVYGLSPGSWVGPYAVCRTWEALMRNKKGTITGDL 1080
            Y EILHLFGDSE+S  SIHNLLQAGK YGL+ GSWVGPYA+CR+WE+L R+K+     + 
Sbjct: 207  YVEILHLFGDSEASAFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWESLARSKREETNLEY 266

Query: 1081 -TSSISMYVVSGDEDGERGGAPMVCIEDIGRHCLEYSRGQAAWTXXXXXXXXXXXXDKLN 1257
             T  +++YVVSG EDGERGGAP++ IED  RHC E+S+G+  WT            DK+N
Sbjct: 267  QTLPMAVYVVSGCEDGERGGAPVLSIEDAARHCSEFSKGREDWTPILLLVPLVLGLDKIN 326

Query: 1258 PRYLPLLAATFCFPQSLGILGGRPGAS 1338
            PRY+P L ATF FPQSLGILGG+PGAS
Sbjct: 327  PRYIPSLQATFTFPQSLGILGGKPGAS 353


>gb|EOX94074.1| Peptidase family C54 protein isoform 3 [Theobroma cacao]
          Length = 486

 Score =  366 bits (940), Expect = 1e-98
 Identities = 194/355 (54%), Positives = 245/355 (69%), Gaps = 10/355 (2%)
 Frame = +1

Query: 304  MKDLYERALSSKYSGEIPSENSSRIANSVSSEAGPSSREFHKPSLWSGFLVSAFSVFENC 483
            MK  +ER ++ K   +  S +SS  + S  SE GPS  +F K S+WS    SAFS+F+  
Sbjct: 1    MKGFHERNVALKCPSK-SSIDSSHSSPSSGSEPGPSDCKFSKSSVWSNLFASAFSIFDTY 59

Query: 484  SEAKSTNR-VCNSKSYGWTATVKRIMNSGSMRRI----FGLNKTSFPSSKSEIWLLGVCY 648
            SE+ +  +   ++++ GWTA VKR+++ GSMRRI     G +K    SS S+IWLLGVCY
Sbjct: 60   SESSACEKKALHARNNGWTAAVKRVVSGGSMRRIHERVLGPSKIGISSSTSDIWLLGVCY 119

Query: 649  NVAQDDSS-DPTQSEGFASFVEDFTSRILITYRKGFAPIEDSKYTSDVNWGCMIRSSQML 825
             ++Q  SS D   S G A+F  DF+SRIL+TYRKGF  I D+K TSD  WGCM+RSSQML
Sbjct: 120  KISQVSSSGDVDASNGLAAFKRDFSSRILMTYRKGFDAIGDTKITSDFGWGCMLRSSQML 179

Query: 826  IAQ-AFLFHLLGRSWRKTTNKPLEQNYSEILHLFGDSESSPCSIHNLLQAGKVYGLSPGS 1002
            +AQ A LFH LGRSWRK   KP EQ Y EILH FGDSE++  SIHNL++AGK+YGL+ GS
Sbjct: 180  VAQQALLFHQLGRSWRKPLQKPFEQAYIEILHQFGDSEATAFSIHNLVEAGKIYGLAAGS 239

Query: 1003 WVGPYAVCRTWEALMRNKKGTITGDL---TSSISMYVVSGDEDGERGGAPMVCIEDIGRH 1173
            WVGPYA+CR+WE+L R K+     DL   +  +++YVVSGDEDGERGGAP+VC+ED  RH
Sbjct: 240  WVGPYAMCRSWESLARFKRE--ENDLEHQSLPMAVYVVSGDEDGERGGAPVVCVEDASRH 297

Query: 1174 CLEYSRGQAAWTXXXXXXXXXXXXDKLNPRYLPLLAATFCFPQSLGILGGRPGAS 1338
            C E+SR +A WT            DK+N RY+P L ATF FPQ LGILGG+PGAS
Sbjct: 298  CFEFSRCRADWTPILLLVPLVLGLDKVNSRYIPSLQATFTFPQCLGILGGKPGAS 352


>ref|XP_004138950.1| PREDICTED: cysteine protease ATG4-like [Cucumis sativus]
            gi|449512710|ref|XP_004164121.1| PREDICTED: cysteine
            protease ATG4-like [Cucumis sativus]
          Length = 483

 Score =  361 bits (926), Expect = 4e-97
 Identities = 184/343 (53%), Positives = 234/343 (68%), Gaps = 6/343 (1%)
 Frame = +1

Query: 328  LSSKYSGEIPSENSSRIANSVSSEAGPSSREFHKPSLWSGFLVSAFSVFENCSEAKSTNR 507
            L S  S E  ++   R   SV  E G  +    K S WSGF  S FS+FE+  ++  T +
Sbjct: 7    LKSTCSPEPAADAIDRTHRSVYPELGSKNHISSKASSWSGFFSSNFSIFEHHKDSSVTEK 66

Query: 508  VCNSKSYGWTATVKRIMNSGSMRRI----FGLNKTSFPSSKSEIWLLGVCYNVAQDDS-S 672
                  +   ATV+++M SGSMRRI     G  ++   SS  +IWLLGVC+ ++QD    
Sbjct: 67   KVFHPRHNVWATVRKVMTSGSMRRIQERLLGSRRSGVYSSGGDIWLLGVCHKISQDHPPD 126

Query: 673  DPTQSEGFASFVEDFTSRILITYRKGFAPIEDSKYTSDVNWGCMIRSSQMLIAQAFLFHL 852
            D   S G A + +DF+SRIL+TYRKGF  I+DSKYTSDVNWGCM+RSSQML+AQA LFH 
Sbjct: 127  DAASSPGVAGYEQDFSSRILMTYRKGFHVIQDSKYTSDVNWGCMLRSSQMLVAQALLFHR 186

Query: 853  LGRSWRKTTNKPLEQNYSEILHLFGDSESSPCSIHNLLQAGKVYGLSPGSWVGPYAVCRT 1032
            LGRSWRK + KPL++ Y EILHLFGDSE+S  SIHNLLQAG+ Y L+ GSWVGPYA+CR+
Sbjct: 187  LGRSWRKPSQKPLDKEYVEILHLFGDSETSAFSIHNLLQAGRAYDLAAGSWVGPYAMCRS 246

Query: 1033 WEALMRNKKGT-ITGDLTSSISMYVVSGDEDGERGGAPMVCIEDIGRHCLEYSRGQAAWT 1209
            WE L+R+K+ T I  D    +++Y+VSGDEDGERGGAP++ I+D  RHC E+S+GQ  W+
Sbjct: 247  WETLVRSKRETPILQDQQLPMAIYIVSGDEDGERGGAPVLYIDDASRHCFEFSKGQHDWS 306

Query: 1210 XXXXXXXXXXXXDKLNPRYLPLLAATFCFPQSLGILGGRPGAS 1338
                        +K+NPRY+P L  TF FPQSLGILGG+PGAS
Sbjct: 307  PILLLVPLVLGLEKINPRYIPSLRTTFTFPQSLGILGGKPGAS 349


>gb|EOX94072.1| Peptidase family C54 protein isoform 1 [Theobroma cacao]
          Length = 514

 Score =  356 bits (914), Expect = 1e-95
 Identities = 193/376 (51%), Positives = 244/376 (64%), Gaps = 31/376 (8%)
 Frame = +1

Query: 304  MKDLYERALSSKYSGEIPSENSSRIANSVSSEAGPSSREFHKPSLWSGFLVSAFSVFENC 483
            MK  +ER ++ K   +  S +SS  + S  SE GPS  +F K S+WS    SAFS+F+  
Sbjct: 1    MKGFHERNVALKCPSK-SSIDSSHSSPSSGSEPGPSDCKFSKSSVWSNLFASAFSIFDTY 59

Query: 484  SEAKSTNR-VCNSKSYGWTATVKRIMNSGSMRRI----FGLNKTSFPSSKSEIWLLGVCY 648
            SE+ +  +   ++++ GWTA VKR+++ GSMRRI     G +K    SS S+IWLLGVCY
Sbjct: 60   SESSACEKKALHARNNGWTAAVKRVVSGGSMRRIHERVLGPSKIGISSSTSDIWLLGVCY 119

Query: 649  NVAQDDSS-DPTQSEGFASFVEDFTSRILITYRKGFAPIEDSKYTSDVNWGCMIRSSQML 825
             ++Q  SS D   S G A+F  DF+SRIL+TYRKGF  I D+K TSD  WGCM+RSSQML
Sbjct: 120  KISQVSSSGDVDASNGLAAFKRDFSSRILMTYRKGFDAIGDTKITSDFGWGCMLRSSQML 179

Query: 826  IAQAFLFHLLGRSWRKTTNKPLEQNYSEILHLFGDSESSPCSIHNLLQAGKVYGLSPGSW 1005
            +AQA LFH LGRSWRK   KP EQ Y EILH FGDSE++  SIHNL++AGK+YGL+ GSW
Sbjct: 180  VAQALLFHQLGRSWRKPLQKPFEQAYIEILHQFGDSEATAFSIHNLVEAGKIYGLAAGSW 239

Query: 1006 VGPYAVCRTWEALMRNKKGTITGDL---TSSISMYVVSGDEDGERGGAPMVCIEDIGRHC 1176
            VGPYA+CR+WE+L R K+     DL   +  +++YVVSGDEDGERGGAP+VC+ED  RHC
Sbjct: 240  VGPYAMCRSWESLARFKRE--ENDLEHQSLPMAVYVVSGDEDGERGGAPVVCVEDASRHC 297

Query: 1177 LEYSRGQAAWTXXXXXXXXXXXXDKLNP----------------------RYLPLLAATF 1290
             E+SR +A WT            DK+N                        Y+P L ATF
Sbjct: 298  FEFSRCRADWTPILLLVPLVLGLDKVNSSFCKEDSTFETEGELHLDFAYLEYIPSLQATF 357

Query: 1291 CFPQSLGILGGRPGAS 1338
             FPQ LGILGG+PGAS
Sbjct: 358  TFPQCLGILGGKPGAS 373


>gb|EXB53615.1| hypothetical protein L484_005165 [Morus notabilis]
          Length = 444

 Score =  345 bits (885), Expect = 2e-92
 Identities = 176/309 (56%), Positives = 216/309 (69%), Gaps = 12/309 (3%)
 Frame = +1

Query: 448  FLVSAFSVFENCSEAKST---NRVCNSKSYGWTATVKRIMNSGSMRR----IFGLNKTSF 606
            FL SAFS FE  S ++      +   S+  GWTA V++ ++ GSMRR    I G  +T  
Sbjct: 4    FLSSAFSAFEARSSSEPPVVEKKAIRSRFNGWTAAVRKAVSVGSMRRFHERILGYARTGV 63

Query: 607  PSSKSEIWLLGVCYNVAQDDSSD--PTQSEGFASFVEDFTSRILITYRKGFAPIEDSKYT 780
             SS S+IWLLGVCY ++QD+ S   P  + G A F +DF+SRIL+TYRKGF  I DSKYT
Sbjct: 64   SSSTSDIWLLGVCYKISQDEPSVDLPAANSGLADFEQDFSSRILMTYRKGFGAIGDSKYT 123

Query: 781  SDVNWGCMIRSSQMLIAQAFLFHLLGRSWRKTTNKPLEQNYSEILHLFGDSESSPCSIHN 960
            SDVNWGCM+RSSQML+AQA LFH LGR WR+    PL+Q Y +IL+ F DSE S  SIHN
Sbjct: 124  SDVNWGCMLRSSQMLVAQALLFHRLGRCWRRPVQSPLDQEYIDILNHFDDSEESAFSIHN 183

Query: 961  LLQAGKVYGLSPGSWVGPYAVCRTWEALMRNKKGTITGDLTS---SISMYVVSGDEDGER 1131
            LLQAGK Y L+ GSW+GPYA+CRTWE L+R+K+     D  +    +++Y+VSGDEDGER
Sbjct: 184  LLQAGKAYDLTAGSWMGPYAMCRTWETLVRSKRE--ENDFENHPLPMAVYIVSGDEDGER 241

Query: 1132 GGAPMVCIEDIGRHCLEYSRGQAAWTXXXXXXXXXXXXDKLNPRYLPLLAATFCFPQSLG 1311
            GGAP+VC+ED  RHCLE+SRGQA WT            D +NPRY+P L  TF FPQSLG
Sbjct: 242  GGAPVVCVEDAFRHCLEFSRGQANWTPMLLLVPLVLGLDTVNPRYIPSLRETFTFPQSLG 301

Query: 1312 ILGGRPGAS 1338
            I+GGRPGAS
Sbjct: 302  IMGGRPGAS 310


>ref|XP_004492911.1| PREDICTED: cysteine protease ATG4-like isoform X2 [Cicer arietinum]
          Length = 457

 Score =  342 bits (876), Expect = 3e-91
 Identities = 173/312 (55%), Positives = 221/312 (70%), Gaps = 22/312 (7%)
 Frame = +1

Query: 469  VFENCSEAKSTNRVCN---------------SKSYGWTATVKRIMNSG-SMRR----IFG 588
            V   CS   ST+ V N               S++ GW A V++++++G SMRR    + G
Sbjct: 11   VAAKCSAKSSTDTVDNTQVPASSKAGSSDIYSRNSGWAAAVRKVVSAGGSMRRFQERVLG 70

Query: 589  LNKTSFPSSKSEIWLLGVCYNVAQDDSSDPTQSEG-FASFVEDFTSRILITYRKGFAPIE 765
             ++T    S  ++WLLGVC+ ++Q +S+    +   FA+F +DF S+IL+TYRKGF  IE
Sbjct: 71   SSRTDISCSDGDVWLLGVCHKISQQESTGDVDTRNVFAAFEQDFFSKILVTYRKGFDAIE 130

Query: 766  DSKYTSDVNWGCMIRSSQMLIAQAFLFHLLGRSWRKTTNKPLEQNYSEILHLFGDSESSP 945
            DSKYTSDVNWGCM+RSSQML+AQA LFH LGRSWRKTT+KP+++ Y +IL LFGDSE++ 
Sbjct: 131  DSKYTSDVNWGCMLRSSQMLVAQALLFHKLGRSWRKTTDKPVDKEYIDILQLFGDSEAAA 190

Query: 946  CSIHNLLQAGKVYGLSPGSWVGPYAVCRTWEALMRNKKGT-ITGDLTSSISMYVVSGDED 1122
             SIHNLLQAGK YGL+ GSWVGPYA+CRTWE L RN++GT   G+ T  +++YVVSGDED
Sbjct: 191  FSIHNLLQAGKGYGLAVGSWVGPYAMCRTWEVLARNQRGTNEIGEQTLPMAIYVVSGDED 250

Query: 1123 GERGGAPMVCIEDIGRHCLEYSRGQAAWTXXXXXXXXXXXXDKLNPRYLPLLAATFCFPQ 1302
            GERGGAP+VCIED  + C E+SRG   WT            DK+N RY+PLL +TF FPQ
Sbjct: 251  GERGGAPVVCIEDASKCCSEFSRGLVPWTPLLLLVPLVLGLDKVNLRYIPLLQSTFKFPQ 310

Query: 1303 SLGILGGRPGAS 1338
            SLGILGG+PGAS
Sbjct: 311  SLGILGGKPGAS 322


Top