BLASTX nr result

ID: Cornus23_contig00008525 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Cornus23_contig00008525
         (2233 letters)

Database: ./nr 
           77,306,371 sequences; 28,104,191,420 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_009354662.1| PREDICTED: cysteine protease ATG4-like [Pyru...   665   0.0  
ref|XP_008232834.1| PREDICTED: cysteine protease ATG4 [Prunus mume]   656   0.0  
ref|XP_007217926.1| hypothetical protein PRUPE_ppa004885mg [Prun...   653   0.0  
ref|XP_008346919.1| PREDICTED: cysteine protease ATG4-like isofo...   649   0.0  
ref|XP_008375115.1| PREDICTED: cysteine protease ATG4 [Malus dom...   635   e-179
ref|XP_010093156.1| hypothetical protein L484_005165 [Morus nota...   621   e-175
ref|XP_003635099.1| PREDICTED: cysteine protease ATG4 isoform X2...   619   e-174
gb|KHG08139.1| Cysteine protease ATG4 [Gossypium arboreum]            618   e-174
emb|CAN81102.1| hypothetical protein VITISV_021940 [Vitis vinifera]   615   e-173
ref|XP_010646415.1| PREDICTED: cysteine protease ATG4 isoform X1...   614   e-172
ref|XP_012435801.1| PREDICTED: cysteine protease ATG4 isoform X2...   612   e-172
ref|XP_007049917.1| Peptidase family C54 protein isoform 3 [Theo...   610   e-171
gb|KDO40316.1| hypothetical protein CISIN_1g011418mg [Citrus sin...   610   e-171
ref|XP_006441973.1| hypothetical protein CICLE_v10019906mg [Citr...   610   e-171
ref|XP_006478507.1| PREDICTED: cysteine protease ATG4-like isofo...   609   e-171
ref|XP_012081889.1| PREDICTED: cysteine protease ATG4-like isofo...   607   e-170
ref|XP_012435798.1| PREDICTED: cysteine protease ATG4 isoform X1...   605   e-170
ref|XP_002309707.1| autophagy 4b family protein [Populus trichoc...   597   e-167
ref|XP_006372315.1| autophagy 4b family protein [Populus trichoc...   595   e-167
ref|XP_007049915.1| Peptidase family C54 protein isoform 1 [Theo...   593   e-166

>ref|XP_009354662.1| PREDICTED: cysteine protease ATG4-like [Pyrus x bretschneideri]
            gi|694327605|ref|XP_009354663.1| PREDICTED: cysteine
            protease ATG4-like [Pyrus x bretschneideri]
          Length = 487

 Score =  665 bits (1716), Expect = 0.0
 Identities = 325/476 (68%), Positives = 373/476 (78%), Gaps = 27/476 (5%)
 Frame = -1

Query: 1687 YSCMSSTDSPNSNPGATCSQPGSSDS---------------------------CQKKAFR 1589
            YS  SSTDS +    + CS  GS DS                            +KK   
Sbjct: 13   YSSKSSTDSTDRGSSSACSDSGSRDSKHNKASLWTNFFASAFSIFETHSESSITEKKESH 72

Query: 1588 TRSNGWTAAVKRVMNCGLMRRLHERVLGLNKTGISSLTSDIWLLGVCYKISQEDLSGDPA 1409
            +R+NGWTAAV++V+  G MRR+HERVLG ++TGISS  SDIWLLGVCYK+SQ+D SGD  
Sbjct: 73   SRNNGWTAAVRKVVTSGSMRRIHERVLGSSRTGISS-ASDIWLLGVCYKVSQDDSSGDAP 131

Query: 1408 YSNGLAPFVEDFSSRILMTYRKGFDAIEDSKYTSDQNWGCMLRSSQMLVAQALLFHRIGR 1229
             +NGL  F +DFSS+ILMTYRKGF+AI DSKYTSD NWGCMLRSSQMLVAQALLFHR+GR
Sbjct: 132  INNGLGAFEQDFSSKILMTYRKGFEAIGDSKYTSDVNWGCMLRSSQMLVAQALLFHRLGR 191

Query: 1228 SWRKPSHKPLDQDYTEILQFFGDSVASTFSIHNLLQAGKGYDLAAGLWVGPYAMCRTWET 1049
            SWR+P HKPLD+ Y EIL  FGDS  STFSIHNLLQAGK YDLAAG WVGPYAMCRTWET
Sbjct: 192  SWRRPLHKPLDEAYIEILYHFGDSETSTFSIHNLLQAGKAYDLAAGSWVGPYAMCRTWET 251

Query: 1048 LVCCKRGDAELEGQSLPMAVYVVSGDEDGERGGAPVVCIEDASRHCFEFSRGLVDWSPIX 869
            LV C+R   +L+ Q LPMAVY+VSGDEDGERGGAPVVCIEDASRHC EFSRG VDW+PI 
Sbjct: 252  LVRCRREVTDLDDQPLPMAVYIVSGDEDGERGGAPVVCIEDASRHCLEFSRGQVDWTPIL 311

Query: 868  XXXXXXXXLDKVNPRYIPLLRATFTFPQSLGILGGRPGASTYIVGVQDEKAIYLDPHEVQ 689
                    L+KVNPRYIP LRATFTFPQSLGI+GG+PGASTYI+GVQDEKA+YLDPHEVQ
Sbjct: 312  LLVPLVLGLEKVNPRYIPSLRATFTFPQSLGIMGGKPGASTYIIGVQDEKALYLDPHEVQ 371

Query: 688  PAVDIRRDNLEADTSSYHCDLVRHIPLDSLDPSLAIGFYCRDKDDFDNFCFLASKLADES 509
            P ++IRRD+LEADT SYHC+++RHIPLD +DPSLAIGFYCRD+DDF++FCF ASKLADES
Sbjct: 372  PVINIRRDDLEADTLSYHCNVIRHIPLDLIDPSLAIGFYCRDRDDFNDFCFRASKLADES 431

Query: 508  NGAPLFTVTHTRTSSKPAANRDVLGDSGGVQEDGSFDMVPTSDAEGCTQEDDWQLI 341
            NGAPLFTVT T +  +P  + D LGDSG V+ D SF ++P SDA+G  QEDDWQL+
Sbjct: 432  NGAPLFTVTQTHSFPRPVNHSDALGDSGAVENDDSFSVLPMSDADGSAQEDDWQLL 487


>ref|XP_008232834.1| PREDICTED: cysteine protease ATG4 [Prunus mume]
          Length = 487

 Score =  656 bits (1693), Expect = 0.0
 Identities = 324/476 (68%), Positives = 367/476 (77%), Gaps = 27/476 (5%)
 Frame = -1

Query: 1687 YSCMSSTDSPNSNPGATCSQPGSSDS---------------------------CQKKAFR 1589
            YS  SST+S +  P + CS  GS DS                            +KK   
Sbjct: 13   YSSKSSTESTDRGPSSVCSDSGSRDSKHDKASLWSNFFASAFSIFETHSESSNTEKKEIH 72

Query: 1588 TRSNGWTAAVKRVMNCGLMRRLHERVLGLNKTGISSLTSDIWLLGVCYKISQEDLSGDPA 1409
            +R+NGWT AV++V+  G MRR+HERVLG ++TGISS  SDIWLLGV YK+SQ++ SGD A
Sbjct: 73   SRNNGWTEAVRKVVTGGSMRRIHERVLGSSRTGISS-ASDIWLLGVRYKVSQDEFSGDAA 131

Query: 1408 YSNGLAPFVEDFSSRILMTYRKGFDAIEDSKYTSDQNWGCMLRSSQMLVAQALLFHRIGR 1229
             +NGL  F +DFSSRILMTYRKGFDAI DSKYTSD NWGCMLRSSQMLVAQALLFHR+GR
Sbjct: 132  TNNGLRAFEQDFSSRILMTYRKGFDAIGDSKYTSDVNWGCMLRSSQMLVAQALLFHRLGR 191

Query: 1228 SWRKPSHKPLDQDYTEILQFFGDSVASTFSIHNLLQAGKGYDLAAGLWVGPYAMCRTWET 1049
            SWR+P HKPLD+ Y EIL  FGDS  S FSIHNLLQ+GK YDLAAG WVGPYAMCR+WET
Sbjct: 192  SWRRPLHKPLDEQYIEILHHFGDSEGSAFSIHNLLQSGKAYDLAAGSWVGPYAMCRSWET 251

Query: 1048 LVCCKRGDAELEGQSLPMAVYVVSGDEDGERGGAPVVCIEDASRHCFEFSRGLVDWSPIX 869
            LV CKR     + Q LPMAVY+VSGDEDGERGGAPVVCI+DASRHC EFSRG VDW+PI 
Sbjct: 252  LVRCKREGTAFDNQPLPMAVYIVSGDEDGERGGAPVVCIQDASRHCLEFSRGRVDWTPIL 311

Query: 868  XXXXXXXXLDKVNPRYIPLLRATFTFPQSLGILGGRPGASTYIVGVQDEKAIYLDPHEVQ 689
                    L+KVNPRYIP L ATFTFPQSLGI+GG+PGASTYI+GVQDEKA+YLDPHEVQ
Sbjct: 312  LLVPLVLGLEKVNPRYIPSLWATFTFPQSLGIMGGKPGASTYIIGVQDEKALYLDPHEVQ 371

Query: 688  PAVDIRRDNLEADTSSYHCDLVRHIPLDSLDPSLAIGFYCRDKDDFDNFCFLASKLADES 509
            PA++IRRD+LEADT SYHC+++RHIPLDS+DPSLAIGFYCRD+DDFD+FCF ASKLAD S
Sbjct: 372  PAINIRRDDLEADTLSYHCNVIRHIPLDSIDPSLAIGFYCRDRDDFDDFCFRASKLADGS 431

Query: 508  NGAPLFTVTHTRTSSKPAANRDVLGDSGGVQEDGSFDMVPTSDAEGCTQEDDWQLI 341
            NGAPLFTVT T    KP  + DVL DSGGVQ D SF   P SDA+G   EDDWQL+
Sbjct: 432  NGAPLFTVTETHNFPKPVNHSDVLDDSGGVQNDDSFVAPPISDADGSAHEDDWQLL 487


>ref|XP_007217926.1| hypothetical protein PRUPE_ppa004885mg [Prunus persica]
            gi|462414388|gb|EMJ19125.1| hypothetical protein
            PRUPE_ppa004885mg [Prunus persica]
          Length = 487

 Score =  653 bits (1684), Expect = 0.0
 Identities = 323/476 (67%), Positives = 366/476 (76%), Gaps = 27/476 (5%)
 Frame = -1

Query: 1687 YSCMSSTDSPNSNPGATCSQPGSSDS---------------------------CQKKAFR 1589
            YS  SST+S +  P + CS  GS DS                            +KK   
Sbjct: 13   YSSKSSTESTDRGPSSVCSDSGSRDSKHDKASLWSNFFASAFSIFETHSESSITEKKEIH 72

Query: 1588 TRSNGWTAAVKRVMNCGLMRRLHERVLGLNKTGISSLTSDIWLLGVCYKISQEDLSGDPA 1409
            +R+NGWT AV++V+  G MRR+HERVLG ++TGISS  SDIWLLGV YK+SQ++ SGD A
Sbjct: 73   SRNNGWTEAVRKVVTGGSMRRIHERVLGSSRTGISS-ASDIWLLGVLYKVSQDESSGDAA 131

Query: 1408 YSNGLAPFVEDFSSRILMTYRKGFDAIEDSKYTSDQNWGCMLRSSQMLVAQALLFHRIGR 1229
             +NGL  F +DFSSRILMTYRKGFDAI DSKYTSD NWGCMLRSSQMLVAQALLFHR+GR
Sbjct: 132  TNNGLRAFEQDFSSRILMTYRKGFDAIGDSKYTSDVNWGCMLRSSQMLVAQALLFHRLGR 191

Query: 1228 SWRKPSHKPLDQDYTEILQFFGDSVASTFSIHNLLQAGKGYDLAAGLWVGPYAMCRTWET 1049
            SWR+  HKPLD+ Y EIL  FGDS  S FSIHNLLQAGK YDLAAG WVGPYAMCR+WET
Sbjct: 192  SWRRTLHKPLDEQYIEILHHFGDSEGSAFSIHNLLQAGKAYDLAAGSWVGPYAMCRSWET 251

Query: 1048 LVCCKRGDAELEGQSLPMAVYVVSGDEDGERGGAPVVCIEDASRHCFEFSRGLVDWSPIX 869
            LV CKR     + Q LPMAVY+VSGDEDGERGGAPVVCI+DASRHC EFSRG VDW+PI 
Sbjct: 252  LVRCKREGTAFDNQPLPMAVYIVSGDEDGERGGAPVVCIQDASRHCLEFSRGRVDWTPIL 311

Query: 868  XXXXXXXXLDKVNPRYIPLLRATFTFPQSLGILGGRPGASTYIVGVQDEKAIYLDPHEVQ 689
                    L+KVNPRYIP L ATFTFPQSLGI+GG+PGASTYI+GVQDEKA+YLDPHEVQ
Sbjct: 312  LLVPLVLGLEKVNPRYIPSLWATFTFPQSLGIMGGKPGASTYIIGVQDEKALYLDPHEVQ 371

Query: 688  PAVDIRRDNLEADTSSYHCDLVRHIPLDSLDPSLAIGFYCRDKDDFDNFCFLASKLADES 509
            PA++IRRD+LEADT SYHC+++RHIPLDS+DPSLAIGFYCRD+DDFD+FCF ASKLAD S
Sbjct: 372  PAINIRRDDLEADTLSYHCNVIRHIPLDSIDPSLAIGFYCRDRDDFDDFCFRASKLADGS 431

Query: 508  NGAPLFTVTHTRTSSKPAANRDVLGDSGGVQEDGSFDMVPTSDAEGCTQEDDWQLI 341
            NGAPLFTVT +    KP  + DVL DSGGVQ D SF   P SDA+G   EDDWQL+
Sbjct: 432  NGAPLFTVTQSHNFPKPVNHSDVLDDSGGVQNDDSFVAPPISDADGSAHEDDWQLL 487


>ref|XP_008346919.1| PREDICTED: cysteine protease ATG4-like isoform X2 [Malus domestica]
          Length = 487

 Score =  649 bits (1673), Expect = 0.0
 Identities = 319/486 (65%), Positives = 369/486 (75%), Gaps = 27/486 (5%)
 Frame = -1

Query: 1717 DVIPCCVNPTYSCMSSTDSPNSNPGATCSQPGSSDS------------------------ 1610
            D     V   YS  SSTDS +    + CS  GS DS                        
Sbjct: 3    DFCETAVASKYSSKSSTDSTDRGSSSACSDSGSRDSKHNKASLWSNFFESAFSIFETHSE 62

Query: 1609 ---CQKKAFRTRSNGWTAAVKRVMNCGLMRRLHERVLGLNKTGISSLTSDIWLLGVCYKI 1439
                 KK   +R+NGWTAAV++ ++ G MRR+ E VLG ++ GISS  SDIWLLGVCYK+
Sbjct: 63   SSITDKKESHSRNNGWTAAVRKAVSSGSMRRIQEHVLGSSRIGISS-ASDIWLLGVCYKV 121

Query: 1438 SQEDLSGDPAYSNGLAPFVEDFSSRILMTYRKGFDAIEDSKYTSDQNWGCMLRSSQMLVA 1259
            SQ+D SGD   +NGL  F +DFSSRILMTYRKGF+AI +SKYTSD NWGCMLRSSQMLVA
Sbjct: 122  SQDDSSGDAPINNGLGAFEQDFSSRILMTYRKGFEAIGNSKYTSDVNWGCMLRSSQMLVA 181

Query: 1258 QALLFHRIGRSWRKPSHKPLDQDYTEILQFFGDSVASTFSIHNLLQAGKGYDLAAGLWVG 1079
            QALLFHR+GRSW +P HKPLD+ Y  IL  FGDS  STFSIHNLLQAG+ YDLAAG WVG
Sbjct: 182  QALLFHRLGRSWTRPLHKPLDEAYIGILYHFGDSETSTFSIHNLLQAGRAYDLAAGSWVG 241

Query: 1078 PYAMCRTWETLVCCKRGDAELEGQSLPMAVYVVSGDEDGERGGAPVVCIEDASRHCFEFS 899
            PYAMCRTWETLV C+R   +L+ Q LPMAVY+VSGDEDGERGGAPVVCIEDASRHC EFS
Sbjct: 242  PYAMCRTWETLVRCRREATDLDDQPLPMAVYIVSGDEDGERGGAPVVCIEDASRHCLEFS 301

Query: 898  RGLVDWSPIXXXXXXXXXLDKVNPRYIPLLRATFTFPQSLGILGGRPGASTYIVGVQDEK 719
            RG VDW+PI         L+KVNPRYIP LRATFTFPQSLGI+GG+PGASTYI+GVQDEK
Sbjct: 302  RGQVDWTPILLLVPLVLGLEKVNPRYIPSLRATFTFPQSLGIMGGKPGASTYIIGVQDEK 361

Query: 718  AIYLDPHEVQPAVDIRRDNLEADTSSYHCDLVRHIPLDSLDPSLAIGFYCRDKDDFDNFC 539
            A+YLDPHEVQP ++IRRD+LEADT SYHC+++RHIPLD +DPSLAIGFYCRD+DDF++FC
Sbjct: 362  ALYLDPHEVQPVINIRRDDLEADTLSYHCNVIRHIPLDLIDPSLAIGFYCRDRDDFNDFC 421

Query: 538  FLASKLADESNGAPLFTVTHTRTSSKPAANRDVLGDSGGVQEDGSFDMVPTSDAEGCTQE 359
            F ASKLADESNGAPLFTVT T +  +P  + D LGDSG V+ D SF ++P SDA+G  QE
Sbjct: 422  FRASKLADESNGAPLFTVTQTHSVPRPVNHSDALGDSGAVENDDSFSVLPMSDADGSAQE 481

Query: 358  DDWQLI 341
            D+WQL+
Sbjct: 482  DEWQLL 487


>ref|XP_008375115.1| PREDICTED: cysteine protease ATG4 [Malus domestica]
          Length = 492

 Score =  635 bits (1639), Expect = e-179
 Identities = 316/481 (65%), Positives = 363/481 (75%), Gaps = 32/481 (6%)
 Frame = -1

Query: 1687 YSCMSSTDSPNSNPGATCSQPGSSDSCQKKA-------------FRT------------- 1586
            YS  SSTDS +    + CS  GS DS   KA             F T             
Sbjct: 13   YSSKSSTDSTDRGSSSACSDSGSRDSKHNKASLWSNFFXSAFSIFETHSESSITXKKESH 72

Query: 1585 -RSNGWTAAVKRVMNCGLMRRLHERVLGLNKTGISSLTSDIWLLGVCYKISQEDLSGDPA 1409
             R+NGWT AV++ +  G MRR+ E VLG ++ GISS  SDIWLLGVCYK+SQ+D SGD  
Sbjct: 73   SRNNGWTXAVRKAVXSGSMRRIXEXVLGSSRXGISS-ASDIWLLGVCYKVSQDDSSGDAP 131

Query: 1408 YSNGLAPFVEDFSSRILMTYRKGFDAIEDSKYTSDQNWGCMLRSSQMLVAQ-----ALLF 1244
             +NGL  F +DFSSRILMTYRKGF+AI +SKYTSD NWGCMLRSSQMLV Q     ALLF
Sbjct: 132  INNGLGAFEQDFSSRILMTYRKGFEAIGBSKYTSDVNWGCMLRSSQMLVXQXIFLQALLF 191

Query: 1243 HRIGRSWRKPSHKPLDQDYTEILQFFGDSVASTFSIHNLLQAGKGYDLAAGLWVGPYAMC 1064
            HR+GRSW +P HKPLD+ Y  IL  FGDS  STFSIHNLLQAG  YDLAAG WVGPYAMC
Sbjct: 192  HRLGRSWXRPLHKPLDEAYIXILYHFGDSETSTFSIHNLLQAGXAYDLAAGSWVGPYAMC 251

Query: 1063 RTWETLVCCKRGDAELEGQSLPMAVYVVSGDEDGERGGAPVVCIEDASRHCFEFSRGLVD 884
            RTWETLV C+R   +L+ Q LPMAVY+VSGDEDGERGGAPVVCIEDASRHC EFSRG VD
Sbjct: 252  RTWETLVRCRREATDLDDQPLPMAVYIVSGDEDGERGGAPVVCIEDASRHCLEFSRGQVD 311

Query: 883  WSPIXXXXXXXXXLDKVNPRYIPLLRATFTFPQSLGILGGRPGASTYIVGVQDEKAIYLD 704
            W+PI         L+KVNPRYIP LRATFTFPQSLGI+GG+PG STYI+GVQDEKA+YLD
Sbjct: 312  WTPILLLVPLVLGLEKVNPRYIPSLRATFTFPQSLGIMGGKPGVSTYIIGVQDEKALYLD 371

Query: 703  PHEVQPAVDIRRDNLEADTSSYHCDLVRHIPLDSLDPSLAIGFYCRDKDDFDNFCFLASK 524
            PHEVQP ++IRRD++EADT SYHC+++RHIPLD +DPSLAIGFYCRD+DDF++FCF ASK
Sbjct: 372  PHEVQPVINIRRDDMEADTLSYHCNVIRHIPLDLIDPSLAIGFYCRDRDDFNDFCFRASK 431

Query: 523  LADESNGAPLFTVTHTRTSSKPAANRDVLGDSGGVQEDGSFDMVPTSDAEGCTQEDDWQL 344
            LADESNGAPLFTVT T +  +P  + D LGDSG V+ D SF ++P SDA+G  QEDDWQL
Sbjct: 432  LADESNGAPLFTVTQTHSVPRPVNHSDALGDSGAVENDDSFSVLPMSDADGSAQEDDWQL 491

Query: 343  I 341
            +
Sbjct: 492  L 492


>ref|XP_010093156.1| hypothetical protein L484_005165 [Morus notabilis]
            gi|587863878|gb|EXB53615.1| hypothetical protein
            L484_005165 [Morus notabilis]
          Length = 444

 Score =  621 bits (1602), Expect = e-175
 Identities = 299/423 (70%), Positives = 346/423 (81%), Gaps = 1/423 (0%)
 Frame = -1

Query: 1606 QKKAFRTRSNGWTAAVKRVMNCGLMRRLHERVLGLNKTGISSLTSDIWLLGVCYKISQED 1427
            +KKA R+R NGWTAAV++ ++ G MRR HER+LG  +TG+SS TSDIWLLGVCYKISQ++
Sbjct: 24   EKKAIRSRFNGWTAAVRKAVSVGSMRRFHERILGYARTGVSSSTSDIWLLGVCYKISQDE 83

Query: 1426 LSGD-PAYSNGLAPFVEDFSSRILMTYRKGFDAIEDSKYTSDQNWGCMLRSSQMLVAQAL 1250
             S D PA ++GLA F +DFSSRILMTYRKGF AI DSKYTSD NWGCMLRSSQMLVAQAL
Sbjct: 84   PSVDLPAANSGLADFEQDFSSRILMTYRKGFGAIGDSKYTSDVNWGCMLRSSQMLVAQAL 143

Query: 1249 LFHRIGRSWRKPSHKPLDQDYTEILQFFGDSVASTFSIHNLLQAGKGYDLAAGLWVGPYA 1070
            LFHR+GR WR+P   PLDQ+Y +IL  F DS  S FSIHNLLQAGK YDL AG W+GPYA
Sbjct: 144  LFHRLGRCWRRPVQSPLDQEYIDILNHFDDSEESAFSIHNLLQAGKAYDLTAGSWMGPYA 203

Query: 1069 MCRTWETLVCCKRGDAELEGQSLPMAVYVVSGDEDGERGGAPVVCIEDASRHCFEFSRGL 890
            MCRTWETLV  KR + + E   LPMAVY+VSGDEDGERGGAPVVC+EDA RHC EFSRG 
Sbjct: 204  MCRTWETLVRSKREENDFENHPLPMAVYIVSGDEDGERGGAPVVCVEDAFRHCLEFSRGQ 263

Query: 889  VDWSPIXXXXXXXXXLDKVNPRYIPLLRATFTFPQSLGILGGRPGASTYIVGVQDEKAIY 710
             +W+P+         LD VNPRYIP LR TFTFPQSLGI+GGRPGASTYIVGVQDEKA Y
Sbjct: 264  ANWTPMLLLVPLVLGLDTVNPRYIPSLRETFTFPQSLGIMGGRPGASTYIVGVQDEKAFY 323

Query: 709  LDPHEVQPAVDIRRDNLEADTSSYHCDLVRHIPLDSLDPSLAIGFYCRDKDDFDNFCFLA 530
            LDPHEVQP +DI R+++EADTSSYH +++RHI LDS+DPSLAIGFYCRDK+DFD+FCF A
Sbjct: 324  LDPHEVQPVIDISRNSVEADTSSYHSNVIRHIGLDSIDPSLAIGFYCRDKNDFDDFCFRA 383

Query: 529  SKLADESNGAPLFTVTHTRTSSKPAANRDVLGDSGGVQEDGSFDMVPTSDAEGCTQEDDW 350
            SKLADESNGAPLFTVT T+   KP  + DVLGDS G+ +  SFD +P+++ E C+ EDDW
Sbjct: 384  SKLADESNGAPLFTVTRTKNLPKPVGHADVLGDSSGISD--SFDALPSNNTEDCSHEDDW 441

Query: 349  QLI 341
            QL+
Sbjct: 442  QLL 444


>ref|XP_003635099.1| PREDICTED: cysteine protease ATG4 isoform X2 [Vitis vinifera]
            gi|296086874|emb|CBI33041.3| unnamed protein product
            [Vitis vinifera]
          Length = 486

 Score =  619 bits (1597), Expect = e-174
 Identities = 314/474 (66%), Positives = 357/474 (75%), Gaps = 25/474 (5%)
 Frame = -1

Query: 1687 YSCMSSTDSPNSNPGA-----------------------TCSQPGSSDSCQKKAFRTRSN 1577
            +SC + +DS NS P +                       T S+   S S +K     R+N
Sbjct: 13   FSCKTKSDSSNSEPQSSDTKLSKVSLWSSVFASAFSVFETNSESSPSASEKKAIDNGRNN 72

Query: 1576 GWTAAVKRVMNCGLMRRLHERVLGLNKTGISSLTSDIWLLGVCYKISQEDLSGDPAYSNG 1397
            GWT AV++V+    MRR+ ERVLG +KTGISS TSDIWLLG+CYKISQE+ S   + SNG
Sbjct: 73   GWTTAVRKVVTGVSMRRIQERVLGTSKTGISSSTSDIWLLGLCYKISQEESSNHASSSNG 132

Query: 1396 LAPFVEDFSSRILMTYRKGFDAIEDSKYTSDQNWGCMLRSSQMLVAQALLFHRIGRSWRK 1217
            LA F +DFSSRILMTYRKGF+AI DSK TSD NWGCMLRSSQMLVAQALL HR+GRSWRK
Sbjct: 133  LAEFEQDFSSRILMTYRKGFEAIGDSKLTSDVNWGCMLRSSQMLVAQALLLHRMGRSWRK 192

Query: 1216 PSHKPLDQDYTEILQFFGDSVASTFSIHNLLQAGKGYDLAAGLWVGPYAMCRTWETLVCC 1037
             SHKP+DQDY EIL  FGDS AS FSIHN+LQAGK Y LAAG WVGPYAMCR+WETL   
Sbjct: 193  TSHKPMDQDYIEILHHFGDSKASAFSIHNILQAGKAYGLAAGSWVGPYAMCRSWETLARS 252

Query: 1036 KRGDAELEGQSLPMAVYVVSGDEDGERGGAPVVCIEDASRHCFEFSRGLVDWSPIXXXXX 857
            KR + +LE QSLPMA+Y+VSGDEDGERGGAPVV IE+ASRHC EFS+G VDW+PI     
Sbjct: 253  KREETDLECQSLPMAIYIVSGDEDGERGGAPVVYIEEASRHCLEFSKGQVDWTPILLLVP 312

Query: 856  XXXXLDKVNPRYIPLLRATFTFPQSLGILGGRPGASTYIVGVQDEKAIYLDPHEVQPAVD 677
                L+KVNPRYIP L ATFTFPQSLGILGG+PGASTYIVGVQDEKA YLDPHE Q  VD
Sbjct: 313  LVLGLEKVNPRYIPSLAATFTFPQSLGILGGKPGASTYIVGVQDEKAFYLDPHEAQSVVD 372

Query: 676  IRRDNLEADTSSYHCDLVRHIPLDSLDPSLAIGFYCRDKDDFDNFCFLASKLADESNGAP 497
            IRR+NLEADTSSYHC+++RHI LDS+DPSLAIGFYCRDKDDFD+FC  ASKLAD+SNGAP
Sbjct: 373  IRRENLEADTSSYHCNIIRHICLDSIDPSLAIGFYCRDKDDFDDFCIRASKLADKSNGAP 432

Query: 496  LFTVTHTRTSSKPAANRDVLGDSGGVQEDGSFDMVPTSDAEGC--TQEDDWQLI 341
            LFTV H  +  KP +  D + D  G +ED SFD+V    AEG     EDDWQL+
Sbjct: 433  LFTVAHIHSLPKPISCSDGMDDCSGFREDDSFDVVSNKGAEGYEHEHEDDWQLL 486


>gb|KHG08139.1| Cysteine protease ATG4 [Gossypium arboreum]
          Length = 530

 Score =  618 bits (1593), Expect = e-174
 Identities = 316/493 (64%), Positives = 366/493 (74%), Gaps = 30/493 (6%)
 Frame = -1

Query: 1729 LL*TDVIPCCVNPTYSCMSSTDSPNSNPGA-----------------------TCSQPGS 1619
            LL  D+I CC++      S T    S PG                        T S+  S
Sbjct: 44   LLRLDIIVCCMSS-----SPTPGTGSEPGPNDSKFSKTSLWSNFFASAFSVFDTYSESSS 98

Query: 1618 SDSCQKKAFRTRSNGWTAAVKRVMNCGLMRRLHERVLGLNKTGISSLTSDIWLLGVCYKI 1439
            S +C+KK+  +++NGWTAAVKRV++ G MRR+HERVLG +K GISS TSDIWLLG+CYKI
Sbjct: 99   SSACEKKSSFSKTNGWTAAVKRVVSGGSMRRIHERVLGPSKIGISSSTSDIWLLGLCYKI 158

Query: 1438 SQEDLSGDPAYSNGLAPFVEDFSSRILMTYRKGFDAIEDSKYTSDQNWGCMLRSSQMLVA 1259
            SQE  SGD   ++ LA F +DFSSRILMTYRKGFDAI ++K TSD +WGCMLRSSQMLVA
Sbjct: 159  SQES-SGDVDATSALAAFKQDFSSRILMTYRKGFDAIGETKITSDASWGCMLRSSQMLVA 217

Query: 1258 QALLFHRIGRSWRKPSHKPLDQDYTEILQFFGDSVASTFSIHNLLQAGKGYDLAAGLWVG 1079
            QALLFHR+GRSWRKPS KP D  Y EIL  FGDS AS FSIHNL++AGK Y LAAG WVG
Sbjct: 218  QALLFHRLGRSWRKPSQKPFDLAYIEILHQFGDSEASAFSIHNLVEAGKNYGLAAGSWVG 277

Query: 1078 PYAMCRTWETLVCCKRGDAELEGQSLPMAVYVVSGDEDGERGGAPVVCIEDASRHCFEFS 899
            PYAMCR+WE+L   KR + +LE Q LPMAVYVVSGDEDGERGGAPVVCIEDASRHCFEFS
Sbjct: 278  PYAMCRSWESLARSKREENDLECQLLPMAVYVVSGDEDGERGGAPVVCIEDASRHCFEFS 337

Query: 898  RGLVDWSPIXXXXXXXXXLDKVNPRYIPLLRATFTFPQSLGILGGRPGASTYIVGVQDEK 719
            R   DW+PI         LDKVNPRYIP L+ATFTFPQ LGILGG+PGASTYIVGVQ+E 
Sbjct: 338  RHQADWTPILLLVPLVLGLDKVNPRYIPSLQATFTFPQCLGILGGKPGASTYIVGVQEEN 397

Query: 718  AIYLDPHEVQPAVDIRRDNLEADTSSYHCDLVRHIPLDSLDPSLAIGFYCRDK------- 560
              YLDPH+VQP V++  DNLEADTSSYHCD++R+IPLDSLDPSLAIGF+CRDK       
Sbjct: 398  VFYLDPHDVQPVVNLSSDNLEADTSSYHCDIIRYIPLDSLDPSLAIGFFCRDKGFPVNLV 457

Query: 559  DDFDNFCFLASKLADESNGAPLFTVTHTRTSSKPAANRDVLGDSGGVQEDGSFDMVPTSD 380
            DDFD+FCF ASKLADESNGAPLFTV  T +  KP  + D + D+GG + D S  ++PT D
Sbjct: 458  DDFDDFCFRASKLADESNGAPLFTVAQTHSVFKPINHGDTMADAGGDRMDDSIGVLPTGD 517

Query: 379  AEGCTQEDDWQLI 341
             +G + EDDWQL+
Sbjct: 518  VDGNSHEDDWQLL 530


>emb|CAN81102.1| hypothetical protein VITISV_021940 [Vitis vinifera]
          Length = 489

 Score =  615 bits (1587), Expect = e-173
 Identities = 315/477 (66%), Positives = 357/477 (74%), Gaps = 28/477 (5%)
 Frame = -1

Query: 1687 YSCMSSTDSPNSNPGA-----------------------TCSQPGSSDSCQKKAFRTRSN 1577
            +SC + +DS NS P +                       T S+   S S +K     R+N
Sbjct: 13   FSCKTKSDSSNSEPQSSDTKLSKVSLWSSVFASAFSVFETNSESSPSASEKKAIDNGRNN 72

Query: 1576 GWTAAVKRVMNCGLMRRLHERVLGLNKTGISSLTSDIWLLGVCYKISQEDLSGDPAYSNG 1397
            GWT AV++V+    MRR+ ERVLG +KTGISS TSDIWLLG+CYKISQE+ S   + SNG
Sbjct: 73   GWTTAVRKVVTGVSMRRIQERVLGTSKTGISSSTSDIWLLGLCYKISQEESSNHASSSNG 132

Query: 1396 LAPFVEDFSSRILMTYRKGFDAIEDSKYTSDQNWGCMLRSSQMLVAQALLFHRIGRSWRK 1217
            LA F +DFSSRILMTYRKGF+AI DSK TSD NWGCMLRSSQMLVAQALL HR+GRSWRK
Sbjct: 133  LAEFEQDFSSRILMTYRKGFEAIGDSKLTSDVNWGCMLRSSQMLVAQALLLHRMGRSWRK 192

Query: 1216 PSHKPLDQDYTEILQFFGDSVASTFSIHNLLQAGKGYDLAAGLWVGPYAMCRTWETLVCC 1037
             SHKP+DQDY EIL  FGDS AS FSIHN+LQAGK Y LAAG WVGPYAMCR+WETL   
Sbjct: 193  TSHKPMDQDYIEILHHFGDSKASAFSIHNILQAGKAYGLAAGSWVGPYAMCRSWETLARS 252

Query: 1036 KRGDAELEGQSLPMAVYVVSGDEDGERGGAPVVCIEDASRHCFEFSRGLVDWSPIXXXXX 857
            KR + +LE QSLPMA+Y+VSGDEDGERGGAPVV IE+ASRHC EFS+G VDW+PI     
Sbjct: 253  KREETDLECQSLPMAIYIVSGDEDGERGGAPVVYIEEASRHCLEFSKGQVDWTPILLLVP 312

Query: 856  XXXXLDKVNPRYIPLLRATFTFPQSLGILGGRPGASTYIVGVQDEKAIYLDPHEVQPAVD 677
                L+KVNPRYIP L ATFTFPQSLGILGG+PGASTYIVGVQDEKA YLDPHE Q  VD
Sbjct: 313  LVLGLEKVNPRYIPSLAATFTFPQSLGILGGKPGASTYIVGVQDEKAFYLDPHEAQSVVD 372

Query: 676  IRRDNLEADTSSYHCD---LVRHIPLDSLDPSLAIGFYCRDKDDFDNFCFLASKLADESN 506
            IRR+NLEADTSSYHC+   ++RHI LDS+DPSLAIGFYCRDKDDFD+FC  ASKLADESN
Sbjct: 373  IRRENLEADTSSYHCNCSSIIRHICLDSIDPSLAIGFYCRDKDDFDDFCIRASKLADESN 432

Query: 505  GAPLFTVTHTRTSSKPAANRDVLGDSGGVQEDGSFDMVPTSDAEGC--TQEDDWQLI 341
            GAPLFTV H  +  KP +  D + D  G +ED SFD+V    AEG     EDDWQL+
Sbjct: 433  GAPLFTVAHIHSLPKPISCSDGMDDCSGFREDDSFDVVSNKGAEGYEHEHEDDWQLL 489


>ref|XP_010646415.1| PREDICTED: cysteine protease ATG4 isoform X1 [Vitis vinifera]
          Length = 489

 Score =  614 bits (1583), Expect = e-172
 Identities = 314/477 (65%), Positives = 357/477 (74%), Gaps = 28/477 (5%)
 Frame = -1

Query: 1687 YSCMSSTDSPNSNPGA-----------------------TCSQPGSSDSCQKKAFRTRSN 1577
            +SC + +DS NS P +                       T S+   S S +K     R+N
Sbjct: 13   FSCKTKSDSSNSEPQSSDTKLSKVSLWSSVFASAFSVFETNSESSPSASEKKAIDNGRNN 72

Query: 1576 GWTAAVKRVMNCGLMRRLHERVLGLNKTGISSLTSDIWLLGVCYKISQEDLSGDPAYSNG 1397
            GWT AV++V+    MRR+ ERVLG +KTGISS TSDIWLLG+CYKISQE+ S   + SNG
Sbjct: 73   GWTTAVRKVVTGVSMRRIQERVLGTSKTGISSSTSDIWLLGLCYKISQEESSNHASSSNG 132

Query: 1396 LAPFVEDFSSRILMTYRKGFDAIEDSKYTSDQNWGCMLRSSQMLVAQALLFHRIGRSWRK 1217
            LA F +DFSSRILMTYRKGF+AI DSK TSD NWGCMLRSSQMLVAQALL HR+GRSWRK
Sbjct: 133  LAEFEQDFSSRILMTYRKGFEAIGDSKLTSDVNWGCMLRSSQMLVAQALLLHRMGRSWRK 192

Query: 1216 PSHKPLDQDYTEILQFFGDSVASTFSIHNLLQAGKGYDLAAGLWVGPYAMCRTWETLVCC 1037
             SHKP+DQDY EIL  FGDS AS FSIHN+LQAGK Y LAAG WVGPYAMCR+WETL   
Sbjct: 193  TSHKPMDQDYIEILHHFGDSKASAFSIHNILQAGKAYGLAAGSWVGPYAMCRSWETLARS 252

Query: 1036 KRGDAELEGQSLPMAVYVVSGDEDGERGGAPVVCIEDASRHCFEFSRGLVDWSPIXXXXX 857
            KR + +LE QSLPMA+Y+VSGDEDGERGGAPVV IE+ASRHC EFS+G VDW+PI     
Sbjct: 253  KREETDLECQSLPMAIYIVSGDEDGERGGAPVVYIEEASRHCLEFSKGQVDWTPILLLVP 312

Query: 856  XXXXLDKVNPRYIPLLRATFTFPQSLGILGGRPGASTYIVGVQDEKAIYLDPHEVQPAVD 677
                L+KVNPRYIP L ATFTFPQSLGILGG+PGASTYIVGVQDEKA YLDPHE Q  VD
Sbjct: 313  LVLGLEKVNPRYIPSLAATFTFPQSLGILGGKPGASTYIVGVQDEKAFYLDPHEAQSVVD 372

Query: 676  IRRDNLEADTSSYHCD---LVRHIPLDSLDPSLAIGFYCRDKDDFDNFCFLASKLADESN 506
            IRR+NLEADTSSYHC+   ++RHI LDS+DPSLAIGFYCRDKDDFD+FC  ASKLAD+SN
Sbjct: 373  IRRENLEADTSSYHCNCSSIIRHICLDSIDPSLAIGFYCRDKDDFDDFCIRASKLADKSN 432

Query: 505  GAPLFTVTHTRTSSKPAANRDVLGDSGGVQEDGSFDMVPTSDAEGC--TQEDDWQLI 341
            GAPLFTV H  +  KP +  D + D  G +ED SFD+V    AEG     EDDWQL+
Sbjct: 433  GAPLFTVAHIHSLPKPISCSDGMDDCSGFREDDSFDVVSNKGAEGYEHEHEDDWQLL 489


>ref|XP_012435801.1| PREDICTED: cysteine protease ATG4 isoform X2 [Gossypium raimondii]
            gi|763779844|gb|KJB46915.1| hypothetical protein
            B456_008G001300 [Gossypium raimondii]
            gi|763779845|gb|KJB46916.1| hypothetical protein
            B456_008G001300 [Gossypium raimondii]
            gi|763779850|gb|KJB46921.1| hypothetical protein
            B456_008G001300 [Gossypium raimondii]
          Length = 488

 Score =  612 bits (1579), Expect = e-172
 Identities = 298/433 (68%), Positives = 351/433 (81%)
 Frame = -1

Query: 1639 TCSQPGSSDSCQKKAFRTRSNGWTAAVKRVMNCGLMRRLHERVLGLNKTGISSLTSDIWL 1460
            T S+  SS +C++K+  +++NGWTAAVKRV++ G MRR+HERVLG +K GISS TSDIWL
Sbjct: 57   TYSESSSSSACERKSSFSKTNGWTAAVKRVVSGGSMRRIHERVLGPSKIGISSSTSDIWL 116

Query: 1459 LGVCYKISQEDLSGDPAYSNGLAPFVEDFSSRILMTYRKGFDAIEDSKYTSDQNWGCMLR 1280
            LG+CYKISQE  SGD   ++ LA F +DFSSRILMTYRKGFDAI ++K TSD +WGCMLR
Sbjct: 117  LGLCYKISQES-SGDVDATSALAAFKQDFSSRILMTYRKGFDAIGETKITSDASWGCMLR 175

Query: 1279 SSQMLVAQALLFHRIGRSWRKPSHKPLDQDYTEILQFFGDSVASTFSIHNLLQAGKGYDL 1100
            SSQMLVAQALLFHR+GRSWRKPS KP D  Y EIL  FGDS AS FSIHNL++AGK Y L
Sbjct: 176  SSQMLVAQALLFHRLGRSWRKPSQKPFDLAYIEILHQFGDSEASAFSIHNLVEAGKNYGL 235

Query: 1099 AAGLWVGPYAMCRTWETLVCCKRGDAELEGQSLPMAVYVVSGDEDGERGGAPVVCIEDAS 920
            AAG WVGPYAMCR+WE+L   KR + +LE Q LPMAVYVVSGDEDGERGGAPVVCIEDAS
Sbjct: 236  AAGSWVGPYAMCRSWESLARSKREEIDLECQLLPMAVYVVSGDEDGERGGAPVVCIEDAS 295

Query: 919  RHCFEFSRGLVDWSPIXXXXXXXXXLDKVNPRYIPLLRATFTFPQSLGILGGRPGASTYI 740
            RHCFEFSR   DW+PI         LDKVNPRYIP L+ATFTFPQ LGILGG+PGASTYI
Sbjct: 296  RHCFEFSRHQADWTPILLLVPLVLGLDKVNPRYIPSLQATFTFPQCLGILGGKPGASTYI 355

Query: 739  VGVQDEKAIYLDPHEVQPAVDIRRDNLEADTSSYHCDLVRHIPLDSLDPSLAIGFYCRDK 560
            VG+Q+E   YLDPH+VQP V++  +NLEADTSSYHC+++R+IPL+SLDPSLAIGF+CRDK
Sbjct: 356  VGIQEENVFYLDPHDVQPVVNLSTENLEADTSSYHCNIIRYIPLESLDPSLAIGFFCRDK 415

Query: 559  DDFDNFCFLASKLADESNGAPLFTVTHTRTSSKPAANRDVLGDSGGVQEDGSFDMVPTSD 380
            DDFD+FCF ASKLADESNGAPLFTV  T +  KP  + D + ++GG + D S  ++PT D
Sbjct: 416  DDFDDFCFRASKLADESNGAPLFTVAQTHSVFKPINHGDTMANAGGDRMDDSVRVLPTGD 475

Query: 379  AEGCTQEDDWQLI 341
             +G + EDDWQ +
Sbjct: 476  VDGNSHEDDWQFL 488


>ref|XP_007049917.1| Peptidase family C54 protein isoform 3 [Theobroma cacao]
            gi|508702178|gb|EOX94074.1| Peptidase family C54 protein
            isoform 3 [Theobroma cacao]
          Length = 486

 Score =  610 bits (1574), Expect = e-171
 Identities = 313/473 (66%), Positives = 358/473 (75%), Gaps = 28/473 (5%)
 Frame = -1

Query: 1675 SSTDSPNSNPGATCSQPGSSD---------------------------SCQKKAFRTRSN 1577
            SS DS +S+P +  S+PG SD                           +C+KKA   R+N
Sbjct: 17   SSIDSSHSSPSSG-SEPGPSDCKFSKSSVWSNLFASAFSIFDTYSESSACEKKALHARNN 75

Query: 1576 GWTAAVKRVMNCGLMRRLHERVLGLNKTGISSLTSDIWLLGVCYKISQEDLSGDPAYSNG 1397
            GWTAAVKRV++ G MRR+HERVLG +K GISS TSDIWLLGVCYKISQ   SGD   SNG
Sbjct: 76   GWTAAVKRVVSGGSMRRIHERVLGPSKIGISSSTSDIWLLGVCYKISQVSSSGDVDASNG 135

Query: 1396 LAPFVEDFSSRILMTYRKGFDAIEDSKYTSDQNWGCMLRSSQMLVAQ-ALLFHRIGRSWR 1220
            LA F  DFSSRILMTYRKGFDAI D+K TSD  WGCMLRSSQMLVAQ ALLFH++GRSWR
Sbjct: 136  LAAFKRDFSSRILMTYRKGFDAIGDTKITSDFGWGCMLRSSQMLVAQQALLFHQLGRSWR 195

Query: 1219 KPSHKPLDQDYTEILQFFGDSVASTFSIHNLLQAGKGYDLAAGLWVGPYAMCRTWETLVC 1040
            KP  KP +Q Y EIL  FGDS A+ FSIHNL++AGK Y LAAG WVGPYAMCR+WE+L  
Sbjct: 196  KPLQKPFEQAYIEILHQFGDSEATAFSIHNLVEAGKIYGLAAGSWVGPYAMCRSWESLAR 255

Query: 1039 CKRGDAELEGQSLPMAVYVVSGDEDGERGGAPVVCIEDASRHCFEFSRGLVDWSPIXXXX 860
             KR + +LE QSLPMAVYVVSGDEDGERGGAPVVC+EDASRHCFEFSR   DW+PI    
Sbjct: 256  FKREENDLEHQSLPMAVYVVSGDEDGERGGAPVVCVEDASRHCFEFSRCRADWTPILLLV 315

Query: 859  XXXXXLDKVNPRYIPLLRATFTFPQSLGILGGRPGASTYIVGVQDEKAIYLDPHEVQPAV 680
                 LDKVN RYIP L+ATFTFPQ LGILGG+PGASTYIVGVQ+E   YLDPH+VQ  V
Sbjct: 316  PLVLGLDKVNSRYIPSLQATFTFPQCLGILGGKPGASTYIVGVQEENVFYLDPHDVQLVV 375

Query: 679  DIRRDNLEADTSSYHCDLVRHIPLDSLDPSLAIGFYCRDKDDFDNFCFLASKLADESNGA 500
            ++ +DN EADTSSYHCD++RHIPLDS+DPSLAIGF+CRDKDDFD+FC  ASKLADESNGA
Sbjct: 376  NLSQDNQEADTSSYHCDIIRHIPLDSIDPSLAIGFFCRDKDDFDDFCLRASKLADESNGA 435

Query: 499  PLFTVTHTRTSSKPAANRDVLGDSGGVQEDGSFDMVPTSDAEGCTQEDDWQLI 341
            PLFTV  T +S KP ++ + L D+G V+ED S  +VP  D +G   EDDWQL+
Sbjct: 436  PLFTVAQTHSSFKPISHGNALDDTGEVREDDSLGVVP--DMDGSIHEDDWQLL 486


>gb|KDO40316.1| hypothetical protein CISIN_1g011418mg [Citrus sinensis]
            gi|641820321|gb|KDO40317.1| hypothetical protein
            CISIN_1g011418mg [Citrus sinensis]
          Length = 486

 Score =  610 bits (1573), Expect = e-171
 Identities = 296/426 (69%), Positives = 345/426 (80%)
 Frame = -1

Query: 1618 SDSCQKKAFRTRSNGWTAAVKRVMNCGLMRRLHERVLGLNKTGISSLTSDIWLLGVCYKI 1439
            S + +KKA   +SNGWTAAVKR++  G MRR+HERVLG ++TGISS TSDIWLLGVC+KI
Sbjct: 63   SSASEKKAVHNKSNGWTAAVKRLVTAGSMRRIHERVLGPSRTGISSSTSDIWLLGVCHKI 122

Query: 1438 SQEDLSGDPAYSNGLAPFVEDFSSRILMTYRKGFDAIEDSKYTSDQNWGCMLRSSQMLVA 1259
            +Q++  GD A +NGLA F +DFSSRIL++YRKGFD I DSK TSD  WGCMLRSSQMLVA
Sbjct: 123  AQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVA 182

Query: 1258 QALLFHRIGRSWRKPSHKPLDQDYTEILQFFGDSVASTFSIHNLLQAGKGYDLAAGLWVG 1079
            QALLFHR+GR WRKP  KP D++Y EIL  FGDS  S FSIHNLLQAGK Y LAAG WVG
Sbjct: 183  QALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVG 242

Query: 1078 PYAMCRTWETLVCCKRGDAELEGQSLPMAVYVVSGDEDGERGGAPVVCIEDASRHCFEFS 899
            PYAMCR+WE L  C+R +  L  QSLPMA+YVVSGDEDGERGGAPVVCI+DASRHC  FS
Sbjct: 243  PYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFS 302

Query: 898  RGLVDWSPIXXXXXXXXXLDKVNPRYIPLLRATFTFPQSLGILGGRPGASTYIVGVQDEK 719
            +G  DW+PI         L+KVNPRYIP LR TFTFPQSLGI+GG+PGASTYIVGVQ+E 
Sbjct: 303  KGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEES 362

Query: 718  AIYLDPHEVQPAVDIRRDNLEADTSSYHCDLVRHIPLDSLDPSLAIGFYCRDKDDFDNFC 539
            AIYLDPH+VQP ++I +D+LEADTS+YH D++RHI LDS+DPSLAIGFYCRDKDDFD+FC
Sbjct: 363  AIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFC 422

Query: 538  FLASKLADESNGAPLFTVTHTRTSSKPAANRDVLGDSGGVQEDGSFDMVPTSDAEGCTQE 359
              ASKLA+ESNGAPLFTV  T+T  KP  + DVLG++GGV ED S  ++  +DA G   E
Sbjct: 423  ARASKLAEESNGAPLFTV--TQTHKKPVNHSDVLGETGGVPEDDSLGVMSMNDAVGNAHE 480

Query: 358  DDWQLI 341
            DDWQL+
Sbjct: 481  DDWQLL 486


>ref|XP_006441973.1| hypothetical protein CICLE_v10019906mg [Citrus clementina]
            gi|557544235|gb|ESR55213.1| hypothetical protein
            CICLE_v10019906mg [Citrus clementina]
          Length = 486

 Score =  610 bits (1572), Expect = e-171
 Identities = 304/472 (64%), Positives = 358/472 (75%), Gaps = 27/472 (5%)
 Frame = -1

Query: 1675 SSTDSPNSNPGATCSQPGSSDS---------------------------CQKKAFRTRSN 1577
            S+ D+PN +  +  S+PGSS+S                            +KKA   +SN
Sbjct: 17   STPDTPNRSLASVGSEPGSSESKSSKGSLLSSLFNSAFSVFETYSESSASEKKAVHNKSN 76

Query: 1576 GWTAAVKRVMNCGLMRRLHERVLGLNKTGISSLTSDIWLLGVCYKISQEDLSGDPAYSNG 1397
            GWTAAVKR++  G MRR+HERVLG ++TGISS TSDIWLLGVC+KI+Q++  GD A +NG
Sbjct: 77   GWTAAVKRLVTAGSMRRIHERVLGPSRTGISSSTSDIWLLGVCHKIAQDEALGDAAGNNG 136

Query: 1396 LAPFVEDFSSRILMTYRKGFDAIEDSKYTSDQNWGCMLRSSQMLVAQALLFHRIGRSWRK 1217
            LA F +DFSSRIL++YRKGFD I DSK TSD  WGCMLRSSQMLVAQALLFHR+GR WRK
Sbjct: 137  LAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRK 196

Query: 1216 PSHKPLDQDYTEILQFFGDSVASTFSIHNLLQAGKGYDLAAGLWVGPYAMCRTWETLVCC 1037
            P  KP D++Y EIL  FGDS  S FSIHNLLQAGK Y LAAG WVGPYAMCR+WE L  C
Sbjct: 197  PLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARC 256

Query: 1036 KRGDAELEGQSLPMAVYVVSGDEDGERGGAPVVCIEDASRHCFEFSRGLVDWSPIXXXXX 857
            +R +  L  QSLPMA+YVVSGDEDGERGGAPVVCI+DASRHC  FS+G  DW+PI     
Sbjct: 257  QRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVP 316

Query: 856  XXXXLDKVNPRYIPLLRATFTFPQSLGILGGRPGASTYIVGVQDEKAIYLDPHEVQPAVD 677
                L+KVNPRYIP LR TFTFPQSLGI+GG+PGASTYIVGVQ+E AIYLDPH+VQ  ++
Sbjct: 317  LVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQLVIN 376

Query: 676  IRRDNLEADTSSYHCDLVRHIPLDSLDPSLAIGFYCRDKDDFDNFCFLASKLADESNGAP 497
            I +D+LEADTS+YH D++RHI LDS+DPSLAIGFYCRDKDDFD+FC  ASKLA+ESNGAP
Sbjct: 377  IGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAP 436

Query: 496  LFTVTHTRTSSKPAANRDVLGDSGGVQEDGSFDMVPTSDAEGCTQEDDWQLI 341
            LFTV  T+T  KP  + DVLG++GGV ED S  ++  +DA G   EDDWQL+
Sbjct: 437  LFTV--TQTHKKPVNHSDVLGETGGVPEDDSLGVMSMNDAVGNAHEDDWQLL 486


>ref|XP_006478507.1| PREDICTED: cysteine protease ATG4-like isoform X1 [Citrus sinensis]
          Length = 486

 Score =  609 bits (1571), Expect = e-171
 Identities = 296/426 (69%), Positives = 345/426 (80%)
 Frame = -1

Query: 1618 SDSCQKKAFRTRSNGWTAAVKRVMNCGLMRRLHERVLGLNKTGISSLTSDIWLLGVCYKI 1439
            S + +KKA   +SNGWTAAVKR++  G MRR+HERVLG ++TGISS TSDIWLLGVC+KI
Sbjct: 63   SSANEKKAVHNKSNGWTAAVKRLVTAGSMRRIHERVLGPSRTGISSSTSDIWLLGVCHKI 122

Query: 1438 SQEDLSGDPAYSNGLAPFVEDFSSRILMTYRKGFDAIEDSKYTSDQNWGCMLRSSQMLVA 1259
            +Q++  GD A +NGLA F +DFSSRIL++YRKGFD I DSK TSD  WGCMLRSSQMLVA
Sbjct: 123  AQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVA 182

Query: 1258 QALLFHRIGRSWRKPSHKPLDQDYTEILQFFGDSVASTFSIHNLLQAGKGYDLAAGLWVG 1079
            QALLFHR+GR WRKP  KP D++Y EIL  FGDS  S FSIHNLLQAGK Y LAAG WVG
Sbjct: 183  QALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVG 242

Query: 1078 PYAMCRTWETLVCCKRGDAELEGQSLPMAVYVVSGDEDGERGGAPVVCIEDASRHCFEFS 899
            PYAMCR+WE L  C+R +  L  QSLPMA+YVVSGDEDGERGGAPVVCI+DASRHC  FS
Sbjct: 243  PYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFS 302

Query: 898  RGLVDWSPIXXXXXXXXXLDKVNPRYIPLLRATFTFPQSLGILGGRPGASTYIVGVQDEK 719
            +G  DW+PI         L+KVNPRYIP LR TFTFPQSLGI+GG+PGASTYIVGVQ+E 
Sbjct: 303  KGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEES 362

Query: 718  AIYLDPHEVQPAVDIRRDNLEADTSSYHCDLVRHIPLDSLDPSLAIGFYCRDKDDFDNFC 539
            AIYLDPH+VQP ++I +D+LEADTS+YH D++RHI LDS+DPSLAIGFYCRDKDDFD+FC
Sbjct: 363  AIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFC 422

Query: 538  FLASKLADESNGAPLFTVTHTRTSSKPAANRDVLGDSGGVQEDGSFDMVPTSDAEGCTQE 359
              ASKLA+ESNGAPLFTV  T+T  KP  + DVLG++GGV ED S  ++  +DA G   E
Sbjct: 423  ARASKLAEESNGAPLFTV--TQTHKKPVNHSDVLGETGGVPEDDSLGVMSMNDAVGNAHE 480

Query: 358  DDWQLI 341
            DDWQL+
Sbjct: 481  DDWQLL 486


>ref|XP_012081889.1| PREDICTED: cysteine protease ATG4-like isoform X1 [Jatropha curcas]
            gi|802675786|ref|XP_012081890.1| PREDICTED: cysteine
            protease ATG4-like isoform X1 [Jatropha curcas]
            gi|802675813|ref|XP_012081891.1| PREDICTED: cysteine
            protease ATG4-like isoform X1 [Jatropha curcas]
            gi|643718243|gb|KDP29532.1| hypothetical protein
            JCGZ_19245 [Jatropha curcas]
          Length = 492

 Score =  607 bits (1566), Expect = e-170
 Identities = 295/423 (69%), Positives = 337/423 (79%), Gaps = 1/423 (0%)
 Frame = -1

Query: 1606 QKKAFRTRSNGWTAAVKRVMNCGLMRRLHERVLGLNKTGISSLTSDIWLLGVCYKISQED 1427
            +KK   TR+NGWT+AVK+++  G MRR+HERVLG ++TGIS+ TS+IWLLGVCYKISQ+ 
Sbjct: 70   EKKGSHTRNNGWTSAVKKIVAGGSMRRIHERVLGPSRTGISNTTSEIWLLGVCYKISQDG 129

Query: 1426 LSGDPAYSN-GLAPFVEDFSSRILMTYRKGFDAIEDSKYTSDQNWGCMLRSSQMLVAQAL 1250
             + D A SN GLA F  DFSSRILMTYRKGFDAI DSK+TSD  WGCMLRSSQMLVAQAL
Sbjct: 130  SNADAATSNNGLADFTHDFSSRILMTYRKGFDAIGDSKFTSDVGWGCMLRSSQMLVAQAL 189

Query: 1249 LFHRIGRSWRKPSHKPLDQDYTEILQFFGDSVASTFSIHNLLQAGKGYDLAAGLWVGPYA 1070
            LFH++GRSWRKP  KPLDQ Y EIL  FGDS AS FSIHNL+ AGK Y LAAG WVGPYA
Sbjct: 190  LFHQLGRSWRKPIQKPLDQKYVEILHLFGDSEASPFSIHNLIHAGKAYGLAAGSWVGPYA 249

Query: 1069 MCRTWETLVCCKRGDAELEGQSLPMAVYVVSGDEDGERGGAPVVCIEDASRHCFEFSRGL 890
            MCR+WE L  CKR +  LE ++LPMAVYVVSGDEDGERGGAPVVCIEDASRHC +FSRG 
Sbjct: 250  MCRSWELLARCKREENNLEHEALPMAVYVVSGDEDGERGGAPVVCIEDASRHCLDFSRGQ 309

Query: 889  VDWSPIXXXXXXXXXLDKVNPRYIPLLRATFTFPQSLGILGGRPGASTYIVGVQDEKAIY 710
             +W+PI         L+KVN RYIP L+AT TFPQSLGI+GG+PGASTYIVGVQD+ A Y
Sbjct: 310  ANWTPILLLVPLVLGLEKVNLRYIPSLQATLTFPQSLGIMGGKPGASTYIVGVQDDNAFY 369

Query: 709  LDPHEVQPAVDIRRDNLEADTSSYHCDLVRHIPLDSLDPSLAIGFYCRDKDDFDNFCFLA 530
            LDPH VQP V+I RD + ADTSSYH D +RHIPL+S+DPSLAIGFYCRDKDDFD FCFLA
Sbjct: 370  LDPHGVQPVVNISRDGIGADTSSYHSDFIRHIPLESIDPSLAIGFYCRDKDDFDEFCFLA 429

Query: 529  SKLADESNGAPLFTVTHTRTSSKPAANRDVLGDSGGVQEDGSFDMVPTSDAEGCTQEDDW 350
            SKLAD+S+GAPLFTV H+    K     D   D+ GVQED S  ++P +DAE    EDDW
Sbjct: 430  SKLADDSHGAPLFTVAHSHKLPKSVGRTDTSNDNSGVQEDDSLGVMPMNDAEAPVNEDDW 489

Query: 349  QLI 341
            QL+
Sbjct: 490  QLL 492


>ref|XP_012435798.1| PREDICTED: cysteine protease ATG4 isoform X1 [Gossypium raimondii]
            gi|823202390|ref|XP_012435799.1| PREDICTED: cysteine
            protease ATG4 isoform X1 [Gossypium raimondii]
            gi|823202393|ref|XP_012435800.1| PREDICTED: cysteine
            protease ATG4 isoform X1 [Gossypium raimondii]
          Length = 495

 Score =  605 bits (1561), Expect = e-170
 Identities = 298/440 (67%), Positives = 351/440 (79%), Gaps = 7/440 (1%)
 Frame = -1

Query: 1639 TCSQPGSSDSCQKKAFRTRSNGWTAAVKRVMNCGLMRRLHERVLGLNKTGISSLTSDIWL 1460
            T S+  SS +C++K+  +++NGWTAAVKRV++ G MRR+HERVLG +K GISS TSDIWL
Sbjct: 57   TYSESSSSSACERKSSFSKTNGWTAAVKRVVSGGSMRRIHERVLGPSKIGISSSTSDIWL 116

Query: 1459 LGVCYKISQEDLSGDPAYSNGLAPFVEDFSSRILMTYRKGFDAIEDSKYTSDQNWGCMLR 1280
            LG+CYKISQE  SGD   ++ LA F +DFSSRILMTYRKGFDAI ++K TSD +WGCMLR
Sbjct: 117  LGLCYKISQES-SGDVDATSALAAFKQDFSSRILMTYRKGFDAIGETKITSDASWGCMLR 175

Query: 1279 SSQMLVAQALLFHRIGRSWRKPSHKPLDQDYTEILQFFGDSVASTFSIHNLLQAGKGYDL 1100
            SSQMLVAQALLFHR+GRSWRKPS KP D  Y EIL  FGDS AS FSIHNL++AGK Y L
Sbjct: 176  SSQMLVAQALLFHRLGRSWRKPSQKPFDLAYIEILHQFGDSEASAFSIHNLVEAGKNYGL 235

Query: 1099 AAGLWVGPYAMCRTWETLVCCKRGDAELEGQSLPMAVYVVSGDEDGERGGAPVVCIEDAS 920
            AAG WVGPYAMCR+WE+L   KR + +LE Q LPMAVYVVSGDEDGERGGAPVVCIEDAS
Sbjct: 236  AAGSWVGPYAMCRSWESLARSKREEIDLECQLLPMAVYVVSGDEDGERGGAPVVCIEDAS 295

Query: 919  RHCFEFSRGLVDWSPIXXXXXXXXXLDKVNPRYIPLLRATFTFPQSLGILGGRPGASTYI 740
            RHCFEFSR   DW+PI         LDKVNPRYIP L+ATFTFPQ LGILGG+PGASTYI
Sbjct: 296  RHCFEFSRHQADWTPILLLVPLVLGLDKVNPRYIPSLQATFTFPQCLGILGGKPGASTYI 355

Query: 739  VGVQDEKAIYLDPHEVQPAVDIRRDNLEADTSSYHCDLVRHIPLDSLDPSLAIGFYCRDK 560
            VG+Q+E   YLDPH+VQP V++  +NLEADTSSYHC+++R+IPL+SLDPSLAIGF+CRDK
Sbjct: 356  VGIQEENVFYLDPHDVQPVVNLSTENLEADTSSYHCNIIRYIPLESLDPSLAIGFFCRDK 415

Query: 559  -------DDFDNFCFLASKLADESNGAPLFTVTHTRTSSKPAANRDVLGDSGGVQEDGSF 401
                   DDFD+FCF ASKLADESNGAPLFTV  T +  KP  + D + ++GG + D S 
Sbjct: 416  GFLVNLVDDFDDFCFRASKLADESNGAPLFTVAQTHSVFKPINHGDTMANAGGDRMDDSV 475

Query: 400  DMVPTSDAEGCTQEDDWQLI 341
             ++PT D +G + EDDWQ +
Sbjct: 476  RVLPTGDVDGNSHEDDWQFL 495


>ref|XP_002309707.1| autophagy 4b family protein [Populus trichocarpa]
            gi|222852610|gb|EEE90157.1| autophagy 4b family protein
            [Populus trichocarpa]
          Length = 481

 Score =  597 bits (1540), Expect = e-167
 Identities = 306/474 (64%), Positives = 354/474 (74%), Gaps = 29/474 (6%)
 Frame = -1

Query: 1675 SSTDSPNSNPGATCSQPGSSDSCQKK----------AFRT-------------------R 1583
            ++TD+P S+  +  S+PGS+D+   K          AF                      
Sbjct: 16   TTTDTPKSSFISDSSEPGSTDTKVSKPSLWSSFFASAFSVFDIYRDSSSTSHNEAPHIRH 75

Query: 1582 SNGWTAAVKRVMNCGLMRRLHERVLGLNKTGISSLTSDIWLLGVCYKISQEDLSGDPAYS 1403
            SNGWT++VK+++  G MRR+ ERVLG +KTGIS+ TSDIWLLG  YKISQ+D SG+   +
Sbjct: 76   SNGWTSSVKKIVAGGTMRRIQERVLGTSKTGISNTTSDIWLLGARYKISQDDSSGNADAT 135

Query: 1402 NGLAPFVEDFSSRILMTYRKGFDAIEDSKYTSDQNWGCMLRSSQMLVAQALLFHRIGRSW 1223
            N LA F  DFSSRIL+TYRKGFD IEDSK TSD NWGCMLRSSQMLVAQALLFHR+GRSW
Sbjct: 136  NALAAFHRDFSSRILITYRKGFDMIEDSKLTSDVNWGCMLRSSQMLVAQALLFHRLGRSW 195

Query: 1222 RKPSHKPLDQDYTEILQFFGDSVASTFSIHNLLQAGKGYDLAAGLWVGPYAMCRTWETLV 1043
            RKP  KPLD+DY EIL  FGDS AS FSIHNLLQAGK Y LAAG WVGPYAMCR+WE+L 
Sbjct: 196  RKPVDKPLDRDYVEILHLFGDSEASAFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWESLA 255

Query: 1042 CCKRGDAELEGQSLPMAVYVVSGDEDGERGGAPVVCIEDASRHCFEFSRGLVDWSPIXXX 863
              KR +  LE Q+LPMAVYVVSG EDGERGGAPV+ IEDA+RHC EFS+G  DW+PI   
Sbjct: 256  RSKREETNLEYQTLPMAVYVVSGCEDGERGGAPVLSIEDAARHCSEFSKGREDWTPILLL 315

Query: 862  XXXXXXLDKVNPRYIPLLRATFTFPQSLGILGGRPGASTYIVGVQDEKAIYLDPHEVQPA 683
                  LDK+NPRYIP L+ATFTFPQSLGILGG+PGASTYIVGVQDE A YLDPHEVQP 
Sbjct: 316  VPLVLGLDKINPRYIPSLQATFTFPQSLGILGGKPGASTYIVGVQDENAFYLDPHEVQPV 375

Query: 682  VDIRRDNLEADTSSYHCDLVRHIPLDSLDPSLAIGFYCRDKDDFDNFCFLASKLADESNG 503
            V+  RD++EA+TSSYHCD+VRHIPLD +DPSLAIGFYCRDKDDFD+FC LASKLADESNG
Sbjct: 376  VNFSRDDVEANTSSYHCDVVRHIPLDLIDPSLAIGFYCRDKDDFDDFCSLASKLADESNG 435

Query: 502  APLFTVTHTRTSSKPAANRDVLGDSGGVQEDGSFDMVPTSDAEGCTQEDDWQLI 341
            APLFTV ++  SSK         DS  V++D    ++  +DAEGC  EDDWQL+
Sbjct: 436  APLFTVANSYKSSK--------HDSSEVRDDDPLGVMTMNDAEGCLNEDDWQLL 481


>ref|XP_006372315.1| autophagy 4b family protein [Populus trichocarpa]
            gi|550318931|gb|ERP50112.1| autophagy 4b family protein
            [Populus trichocarpa]
          Length = 482

 Score =  595 bits (1534), Expect = e-167
 Identities = 293/428 (68%), Positives = 341/428 (79%), Gaps = 1/428 (0%)
 Frame = -1

Query: 1621 SSDSCQKKAFRTR-SNGWTAAVKRVMNCGLMRRLHERVLGLNKTGISSLTSDIWLLGVCY 1445
            SS + +KKA   R  NGWT+AVK+++  G MRR+ E VLG +KTGIS+ T DIWLLG CY
Sbjct: 63   SSSTSEKKAPHIRHGNGWTSAVKKIVAGGSMRRIQECVLGTSKTGISNTTGDIWLLGACY 122

Query: 1444 KISQEDLSGDPAYSNGLAPFVEDFSSRILMTYRKGFDAIEDSKYTSDQNWGCMLRSSQML 1265
            KISQ++ SGD A +N LA F  DFSSRIL+TYRKGFDAIEDSK TSD +WGCMLRSSQML
Sbjct: 123  KISQDNSSGDAAATNALAAFNHDFSSRILITYRKGFDAIEDSKLTSDVSWGCMLRSSQML 182

Query: 1264 VAQALLFHRIGRSWRKPSHKPLDQDYTEILQFFGDSVASTFSIHNLLQAGKGYDLAAGLW 1085
            VAQALLFHR+GRSWRKP  KPLD++Y EIL  FGDS +S FSIHNLL+AGK Y LAAG W
Sbjct: 183  VAQALLFHRLGRSWRKPLDKPLDREYVEILHLFGDSESSAFSIHNLLRAGKAYGLAAGSW 242

Query: 1084 VGPYAMCRTWETLVCCKRGDAELEGQSLPMAVYVVSGDEDGERGGAPVVCIEDASRHCFE 905
            VGPYA+C +WE+LV  +R +  LE QSL MAVYVVSG EDGERGGAPV+CIE+A+RHC E
Sbjct: 243  VGPYAVCHSWESLVRSRREETNLEYQSLSMAVYVVSGSEDGERGGAPVLCIEEAARHCSE 302

Query: 904  FSRGLVDWSPIXXXXXXXXXLDKVNPRYIPLLRATFTFPQSLGILGGRPGASTYIVGVQD 725
            FS+G  DW+PI         LDK+NPRYIP L+ATFTFPQSLGILGG+PGASTYIVGVQD
Sbjct: 303  FSKGQEDWTPILLLVPLVLGLDKINPRYIPSLQATFTFPQSLGILGGKPGASTYIVGVQD 362

Query: 724  EKAIYLDPHEVQPAVDIRRDNLEADTSSYHCDLVRHIPLDSLDPSLAIGFYCRDKDDFDN 545
            E A YLDPHEVQP V++ RD++EA+TSSYHC++VRH+PLD +DPSLAIGFYCRDKDDFD+
Sbjct: 363  ENAFYLDPHEVQPVVNVSRDDVEANTSSYHCNVVRHMPLDLIDPSLAIGFYCRDKDDFDD 422

Query: 544  FCFLASKLADESNGAPLFTVTHTRTSSKPAANRDVLGDSGGVQEDGSFDMVPTSDAEGCT 365
            FC LASKL DESNGAPLFTV H+R   K         DSG V+ D S  ++  +D EGC 
Sbjct: 423  FCTLASKLTDESNGAPLFTVAHSRKLLK--------HDSGEVRSDDSLGVMTMNDVEGCV 474

Query: 364  QEDDWQLI 341
             EDDWQL+
Sbjct: 475  HEDDWQLL 482


>ref|XP_007049915.1| Peptidase family C54 protein isoform 1 [Theobroma cacao]
            gi|508702176|gb|EOX94072.1| Peptidase family C54 protein
            isoform 1 [Theobroma cacao]
          Length = 514

 Score =  593 bits (1530), Expect = e-166
 Identities = 312/501 (62%), Positives = 357/501 (71%), Gaps = 56/501 (11%)
 Frame = -1

Query: 1675 SSTDSPNSNPGATCSQPGSSD---------------------------SCQKKAFRTRSN 1577
            SS DS +S+P +  S+PG SD                           +C+KKA   R+N
Sbjct: 17   SSIDSSHSSPSSG-SEPGPSDCKFSKSSVWSNLFASAFSIFDTYSESSACEKKALHARNN 75

Query: 1576 GWTAAVKRVMNCGLMRRLHERVLGLNKTGISSLTSDIWLLGVCYKISQEDLSGDPAYSNG 1397
            GWTAAVKRV++ G MRR+HERVLG +K GISS TSDIWLLGVCYKISQ   SGD   SNG
Sbjct: 76   GWTAAVKRVVSGGSMRRIHERVLGPSKIGISSSTSDIWLLGVCYKISQVSSSGDVDASNG 135

Query: 1396 LAPFVEDFSSRILMTYRKGFDAIEDSKYTSDQNWGCMLRSSQMLVAQALLFHRIGRSWRK 1217
            LA F  DFSSRILMTYRKGFDAI D+K TSD  WGCMLRSSQMLVAQALLFH++GRSWRK
Sbjct: 136  LAAFKRDFSSRILMTYRKGFDAIGDTKITSDFGWGCMLRSSQMLVAQALLFHQLGRSWRK 195

Query: 1216 PSHKPLDQDYTEILQFFGDSVASTFSIHNLLQAGKGYDLAAGLWVGPYAMCRTWETLVCC 1037
            P  KP +Q Y EIL  FGDS A+ FSIHNL++AGK Y LAAG WVGPYAMCR+WE+L   
Sbjct: 196  PLQKPFEQAYIEILHQFGDSEATAFSIHNLVEAGKIYGLAAGSWVGPYAMCRSWESLARF 255

Query: 1036 KRGDAELEGQSLPMAVYVVSGDEDGERGGAPVVCIEDASRHCFEFSRGLVDWSPIXXXXX 857
            KR + +LE QSLPMAVYVVSGDEDGERGGAPVVC+EDASRHCFEFSR   DW+PI     
Sbjct: 256  KREENDLEHQSLPMAVYVVSGDEDGERGGAPVVCVEDASRHCFEFSRCRADWTPILLLVP 315

Query: 856  XXXXLDKVNP----------------------RYIPLLRATFTFPQSLGILGGRPGASTY 743
                LDKVN                        YIP L+ATFTFPQ LGILGG+PGASTY
Sbjct: 316  LVLGLDKVNSSFCKEDSTFETEGELHLDFAYLEYIPSLQATFTFPQCLGILGGKPGASTY 375

Query: 742  IVGVQDEKAIYLDPHEVQPAVDIRRDNLEADTSSYHCDLVRHIPLDSLDPSLAIGFYCRD 563
            IVGVQ+E   YLDPH+VQ  V++ +DN EADTSSYHCD++RHIPLDS+DPSLAIGF+CRD
Sbjct: 376  IVGVQEENVFYLDPHDVQLVVNLSQDNQEADTSSYHCDIIRHIPLDSIDPSLAIGFFCRD 435

Query: 562  K-------DDFDNFCFLASKLADESNGAPLFTVTHTRTSSKPAANRDVLGDSGGVQEDGS 404
            K       DDFD+FC  ASKLADESNGAPLFTV  T +S KP ++ + L D+G V+ED S
Sbjct: 436  KGLPVDLVDDFDDFCLRASKLADESNGAPLFTVAQTHSSFKPISHGNALDDTGEVREDDS 495

Query: 403  FDMVPTSDAEGCTQEDDWQLI 341
              +VP  D +G   EDDWQL+
Sbjct: 496  LGVVP--DMDGSIHEDDWQLL 514


Top