BLASTX nr result

ID: Atropa21_contig00001137 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Atropa21_contig00001137
         (1423 letters)

Database: ./nr 
           37,332,560 sequences; 13,225,080,153 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

gb|ABF47216.1| cathepsin B [Nicotiana benthamiana]                    649   0.0  
emb|CAA57522.1| cathepsin B-like cysteine proteinase [Nicotiana ...   646   0.0  
ref|XP_004233221.1| PREDICTED: cathepsin B-like [Solanum lycoper...   632   e-178
ref|XP_006362602.1| PREDICTED: cathepsin B-like [Solanum tuberosum]   631   e-178
ref|XP_004233222.1| PREDICTED: cathepsin B-like [Solanum lycoper...   578   e-162
ref|XP_006362603.1| PREDICTED: cathepsin B-like [Solanum tuberosum]   573   e-161
ref|XP_004233219.1| PREDICTED: cathepsin B-like [Solanum lycoper...   568   e-159
ref|NP_001275088.1| cathepsin B-like cysteine proteinase precurs...   563   e-158
ref|XP_006444663.1| hypothetical protein CICLE_v10020859mg [Citr...   521   e-145
ref|XP_002301457.2| putative cathepsin B-like protease family pr...   520   e-145
ref|XP_006491433.1| PREDICTED: cathepsin B-like isoform X1 [Citr...   519   e-144
gb|EOX95504.1| Cysteine proteinases superfamily protein [Theobro...   519   e-144
ref|XP_006375410.1| hypothetical protein POPTR_0014s10540g [Popu...   518   e-144
ref|XP_002320244.2| putative cathepsin B-like protease family pr...   517   e-144
gb|EXB94879.1| Cathepsin B [Morus notabilis]                          505   e-140
ref|XP_006375409.1| hypothetical protein POPTR_0014s10540g [Popu...   505   e-140
ref|XP_002281936.1| PREDICTED: cathepsin B-like isoform 1 [Vitis...   504   e-140
ref|XP_002515139.1| cathepsin B, putative [Ricinus communis] gi|...   502   e-139
gb|EXB94880.1| Cathepsin B [Morus notabilis]                          501   e-139
gb|EMJ22685.1| hypothetical protein PRUPE_ppa007538mg [Prunus pe...   499   e-139

>gb|ABF47216.1| cathepsin B [Nicotiana benthamiana]
          Length = 356

 Score =  649 bits (1674), Expect = 0.0
 Identities = 307/356 (86%), Positives = 321/356 (90%)
 Frame = +2

Query: 80   MGMNHMXXXXXXXXXXXXIFILQVVAENPISQAKGETAILQDSIVKQVNENEKAGWKAAL 259
            M MNHM            + +LQVVAE PISQAK E+AILQDSIVKQVNENEKAGWKAAL
Sbjct: 1    MAMNHMSLVTFLLLIGASVLVLQVVAEQPISQAKAESAILQDSIVKQVNENEKAGWKAAL 60

Query: 260  NPRFSNLTVSQFKRLLGVKPTRKGDLKGIPILTHPKLSKLPQEFDARVAWPNCSTIGRIL 439
            NPRFSN TVSQFKRLLGVKPTRKGDLKGIPILTHPKL +LPQEFDARVAWPNCSTIGRIL
Sbjct: 61   NPRFSNFTVSQFKRLLGVKPTRKGDLKGIPILTHPKLLELPQEFDARVAWPNCSTIGRIL 120

Query: 440  DQGHCGSCWAFGAAESLSDRFCIHYGLNISLSANDILAXXXXXXXXXXXXXXPLQAWKYF 619
            DQGHCGSCWAFGA ESLSDRFCIHYGLNISLSAND+LA              PLQAWKYF
Sbjct: 121  DQGHCGSCWAFGAVESLSDRFCIHYGLNISLSANDLLACCGFLCGDGCDGGYPLQAWKYF 180

Query: 620  VRKGVVTEECDPYFDNEGCSHPGCEPAYPTPKCHRKCVKQNLLWSKSKHFGVNAYVISSD 799
            VRKGVVT+ECDPYFDNEGCSHPGCEPAYPTPKCHRKCVKQNLLWSKSKHFGVNAY+ISSD
Sbjct: 181  VRKGVVTDECDPYFDNEGCSHPGCEPAYPTPKCHRKCVKQNLLWSKSKHFGVNAYMISSD 240

Query: 800  PYSIMTEVYKNGPVEVSFTVYEDFAHYKSGVYKHVTGDVMGGHAVKLIGWGTSEDGEDYW 979
            P+SIMTE+YKNGPVEVSFTVYEDFAHYKSGVYKHVTGDVMGGHAVKLIGWGTSEDGEDYW
Sbjct: 241  PHSIMTELYKNGPVEVSFTVYEDFAHYKSGVYKHVTGDVMGGHAVKLIGWGTSEDGEDYW 300

Query: 980  LLANQWNRGWGDDGYFKIRRGTNECGIEEEVVGGLPSARNLNVELNVSDAFLDASM 1147
            LLANQWNRGWGDDGYFKIRRGT+EC IE+EVV GLPSARNLN+EL+VSDAFLDA+M
Sbjct: 301  LLANQWNRGWGDDGYFKIRRGTDECEIEDEVVAGLPSARNLNMELDVSDAFLDAAM 356


>emb|CAA57522.1| cathepsin B-like cysteine proteinase [Nicotiana rustica]
          Length = 356

 Score =  646 bits (1666), Expect = 0.0
 Identities = 306/356 (85%), Positives = 319/356 (89%)
 Frame = +2

Query: 80   MGMNHMXXXXXXXXXXXXIFILQVVAENPISQAKGETAILQDSIVKQVNENEKAGWKAAL 259
            M +NHM            I +LQVVAE PISQAK E+AILQDSIVKQVNENEKAGWKAAL
Sbjct: 1    MALNHMSLTTLFLLIGASIIVLQVVAEQPISQAKAESAILQDSIVKQVNENEKAGWKAAL 60

Query: 260  NPRFSNLTVSQFKRLLGVKPTRKGDLKGIPILTHPKLSKLPQEFDARVAWPNCSTIGRIL 439
            NPRFSN TVSQFKRLLGVKPTRKGDLKGIPILTHPKL +LPQEFDARVAW NCSTIGRIL
Sbjct: 61   NPRFSNFTVSQFKRLLGVKPTRKGDLKGIPILTHPKLLELPQEFDARVAWSNCSTIGRIL 120

Query: 440  DQGHCGSCWAFGAAESLSDRFCIHYGLNISLSANDILAXXXXXXXXXXXXXXPLQAWKYF 619
            DQGHCGSCWAFGA ESLSDRFCIHYGLNISLSAND+ A              PLQAWKYF
Sbjct: 121  DQGHCGSCWAFGAVESLSDRFCIHYGLNISLSANDLYACCGFLCGDGCDGGYPLQAWKYF 180

Query: 620  VRKGVVTEECDPYFDNEGCSHPGCEPAYPTPKCHRKCVKQNLLWSKSKHFGVNAYVISSD 799
            VRKGVVT+ECDPYFDNEGCSHPGCEPAYPTPKCHRKCVKQNLLWS+SKHFGVNAY+ISSD
Sbjct: 181  VRKGVVTDECDPYFDNEGCSHPGCEPAYPTPKCHRKCVKQNLLWSRSKHFGVNAYMISSD 240

Query: 800  PYSIMTEVYKNGPVEVSFTVYEDFAHYKSGVYKHVTGDVMGGHAVKLIGWGTSEDGEDYW 979
            P+SIMTEVYKNGPVEVSFTVYEDFAHYKSGVYKHVTGD+MGGHAVKLIGWGTSEDGEDYW
Sbjct: 241  PHSIMTEVYKNGPVEVSFTVYEDFAHYKSGVYKHVTGDIMGGHAVKLIGWGTSEDGEDYW 300

Query: 980  LLANQWNRGWGDDGYFKIRRGTNECGIEEEVVGGLPSARNLNVELNVSDAFLDASM 1147
            LLANQWNRGWGDDGYFKIRRGTNEC IE+EVV GLPSARNLNVEL+VSDAFLDA+M
Sbjct: 301  LLANQWNRGWGDDGYFKIRRGTNECEIEDEVVAGLPSARNLNVELDVSDAFLDAAM 356


>ref|XP_004233221.1| PREDICTED: cathepsin B-like [Solanum lycopersicum]
          Length = 352

 Score =  632 bits (1630), Expect = e-178
 Identities = 295/336 (87%), Positives = 312/336 (92%)
 Frame = +2

Query: 140  ILQVVAENPISQAKGETAILQDSIVKQVNENEKAGWKAALNPRFSNLTVSQFKRLLGVKP 319
            +LQVVAENPISQAK E+AILQDSIVKQVNENEKAGW+AALNP+FSN TVSQFKRLLGVKP
Sbjct: 17   VLQVVAENPISQAKAESAILQDSIVKQVNENEKAGWRAALNPQFSNFTVSQFKRLLGVKP 76

Query: 320  TRKGDLKGIPILTHPKLSKLPQEFDARVAWPNCSTIGRILDQGHCGSCWAFGAAESLSDR 499
            TRKGDLKGIPILTHPKL KLPQEFDARVAWP CSTIGRILDQGHCGSCWAFGAAESLSDR
Sbjct: 77   TRKGDLKGIPILTHPKLLKLPQEFDARVAWPQCSTIGRILDQGHCGSCWAFGAAESLSDR 136

Query: 500  FCIHYGLNISLSANDILAXXXXXXXXXXXXXXPLQAWKYFVRKGVVTEECDPYFDNEGCS 679
            FCIHYGLNISLSANDI+A              PL+AWKYFVRKGVVTEECDPYFDN+GCS
Sbjct: 137  FCIHYGLNISLSANDIIACCGYLCGDGCDGGYPLEAWKYFVRKGVVTEECDPYFDNKGCS 196

Query: 680  HPGCEPAYPTPKCHRKCVKQNLLWSKSKHFGVNAYVISSDPYSIMTEVYKNGPVEVSFTV 859
            HPGCEP YPTP+C RKCVK+NLLWSKSKHFG+NAY+I+SDPYSIMTEVYKNGPVEVSFTV
Sbjct: 197  HPGCEPGYPTPQCKRKCVKENLLWSKSKHFGINAYLINSDPYSIMTEVYKNGPVEVSFTV 256

Query: 860  YEDFAHYKSGVYKHVTGDVMGGHAVKLIGWGTSEDGEDYWLLANQWNRGWGDDGYFKIRR 1039
            YEDFAHYKSGVYKH+ G+ MGGHAVKLIGWGTSEDGEDYWLLANQWNRGWGDDGYFKIRR
Sbjct: 257  YEDFAHYKSGVYKHINGEEMGGHAVKLIGWGTSEDGEDYWLLANQWNRGWGDDGYFKIRR 316

Query: 1040 GTNECGIEEEVVGGLPSARNLNVELNVSDAFLDASM 1147
            GTNECGIEEEVV G+PSA+NLNVEL+VSDA LDASM
Sbjct: 317  GTNECGIEEEVVAGMPSAKNLNVELDVSDALLDASM 352


>ref|XP_006362602.1| PREDICTED: cathepsin B-like [Solanum tuberosum]
          Length = 352

 Score =  631 bits (1627), Expect = e-178
 Identities = 295/336 (87%), Positives = 313/336 (93%)
 Frame = +2

Query: 140  ILQVVAENPISQAKGETAILQDSIVKQVNENEKAGWKAALNPRFSNLTVSQFKRLLGVKP 319
            +LQVVAENPISQAK E+AILQDSIVKQVNENEKAGW+AALNP+FSN TVSQFKRLLGVKP
Sbjct: 17   VLQVVAENPISQAKAESAILQDSIVKQVNENEKAGWRAALNPQFSNFTVSQFKRLLGVKP 76

Query: 320  TRKGDLKGIPILTHPKLSKLPQEFDARVAWPNCSTIGRILDQGHCGSCWAFGAAESLSDR 499
            TRKGDLKGIPILTHP+L KLPQEFDARVAWP CSTIGRILDQGHCGSCWAFGAAESLSDR
Sbjct: 77   TRKGDLKGIPILTHPELLKLPQEFDARVAWPQCSTIGRILDQGHCGSCWAFGAAESLSDR 136

Query: 500  FCIHYGLNISLSANDILAXXXXXXXXXXXXXXPLQAWKYFVRKGVVTEECDPYFDNEGCS 679
            FCIHYGLNISLSANDI+A              PL+AWKYFVRKGVVTEECDPYFDN+GCS
Sbjct: 137  FCIHYGLNISLSANDIVACCGYLCGDGCDGGYPLEAWKYFVRKGVVTEECDPYFDNKGCS 196

Query: 680  HPGCEPAYPTPKCHRKCVKQNLLWSKSKHFGVNAYVISSDPYSIMTEVYKNGPVEVSFTV 859
            HPGCEP YPTP+C RKCVK+NLLWSKSKHFGVNAY+I+SDPYSIMTEVYKNGPVEVSFTV
Sbjct: 197  HPGCEPGYPTPQCKRKCVKENLLWSKSKHFGVNAYLINSDPYSIMTEVYKNGPVEVSFTV 256

Query: 860  YEDFAHYKSGVYKHVTGDVMGGHAVKLIGWGTSEDGEDYWLLANQWNRGWGDDGYFKIRR 1039
            YEDFAHYKSGVYKH+ G+ MGGHAVKLIGWGTSEDGE+YWLLANQWNRGWGDDGYFKIRR
Sbjct: 257  YEDFAHYKSGVYKHINGEEMGGHAVKLIGWGTSEDGENYWLLANQWNRGWGDDGYFKIRR 316

Query: 1040 GTNECGIEEEVVGGLPSARNLNVELNVSDAFLDASM 1147
            GTNECGIEEEVV G+PSA+NLNVEL+VSDAFLDASM
Sbjct: 317  GTNECGIEEEVVAGMPSAKNLNVELDVSDAFLDASM 352


>ref|XP_004233222.1| PREDICTED: cathepsin B-like [Solanum lycopersicum]
          Length = 354

 Score =  578 bits (1491), Expect = e-162
 Identities = 271/356 (76%), Positives = 297/356 (83%)
 Frame = +2

Query: 80   MGMNHMXXXXXXXXXXXXIFILQVVAENPISQAKGETAILQDSIVKQVNENEKAGWKAAL 259
            MGMN M            IFILQVVAE PIS+AKGE+ IL++SI+K+VNEN KAGWKAA 
Sbjct: 1    MGMNKMFLPTPLLLCAFFIFILQVVAEKPISEAKGESVILRESIIKEVNENGKAGWKAAF 60

Query: 260  NPRFSNLTVSQFKRLLGVKPTRKGDLKGIPILTHPKLSKLPQEFDARVAWPNCSTIGRIL 439
            NPRFSN TVSQFKRLLGVKP R+GDLK IPILTHPKL  LP+EFDAR AW  CSTIGRIL
Sbjct: 61   NPRFSNFTVSQFKRLLGVKPPREGDLKSIPILTHPKLKNLPKEFDARTAWSECSTIGRIL 120

Query: 440  DQGHCGSCWAFGAAESLSDRFCIHYGLNISLSANDILAXXXXXXXXXXXXXXPLQAWKYF 619
            DQGHCGSCWAFGA ESLSDRFCIHYGLNISLS ND++A              P+ AW YF
Sbjct: 121  DQGHCGSCWAFGAVESLSDRFCIHYGLNISLSVNDVIACCGFHCGNGCDGGSPIAAWHYF 180

Query: 620  VRKGVVTEECDPYFDNEGCSHPGCEPAYPTPKCHRKCVKQNLLWSKSKHFGVNAYVISSD 799
            +RKGVV+E+CDPYFDN GCSHPGCEP YPTP+C+RKCV +NLLWSKSKHFGVNAY+ISS+
Sbjct: 181  IRKGVVSEKCDPYFDNIGCSHPGCEPTYPTPQCNRKCVNENLLWSKSKHFGVNAYMISSN 240

Query: 800  PYSIMTEVYKNGPVEVSFTVYEDFAHYKSGVYKHVTGDVMGGHAVKLIGWGTSEDGEDYW 979
            PYSIMTEVYKNGPVEV+  VYEDFAHYKSGVYKHVTG+ +GGHAVKLIGWGTSE+GEDYW
Sbjct: 241  PYSIMTEVYKNGPVEVALNVYEDFAHYKSGVYKHVTGEYIGGHAVKLIGWGTSEEGEDYW 300

Query: 980  LLANQWNRGWGDDGYFKIRRGTNECGIEEEVVGGLPSARNLNVELNVSDAFLDASM 1147
            LL N WN+GWG+DGYFKIRRGTNEC IE  VV GLPSARNLNVEL+  D FLD SM
Sbjct: 301  LLVNSWNKGWGNDGYFKIRRGTNECDIESNVVAGLPSARNLNVELD--DDFLDTSM 354


>ref|XP_006362603.1| PREDICTED: cathepsin B-like [Solanum tuberosum]
          Length = 354

 Score =  573 bits (1476), Expect = e-161
 Identities = 268/356 (75%), Positives = 294/356 (82%)
 Frame = +2

Query: 80   MGMNHMXXXXXXXXXXXXIFILQVVAENPISQAKGETAILQDSIVKQVNENEKAGWKAAL 259
            MGM  +            IFILQV AE PIS+AKGE+ ILQ+SI+K+VNEN KAGWKAA 
Sbjct: 1    MGMTKISLATPLLLCAFFIFILQVFAEKPISEAKGESVILQESIIKEVNENVKAGWKAAF 60

Query: 260  NPRFSNLTVSQFKRLLGVKPTRKGDLKGIPILTHPKLSKLPQEFDARVAWPNCSTIGRIL 439
            NPRFSN TVSQFK LLGVKP R+GDLK IPILTHPKL  LP+EFDAR AWP CSTIGRIL
Sbjct: 61   NPRFSNFTVSQFKFLLGVKPPREGDLKSIPILTHPKLKNLPKEFDARTAWPQCSTIGRIL 120

Query: 440  DQGHCGSCWAFGAAESLSDRFCIHYGLNISLSANDILAXXXXXXXXXXXXXXPLQAWKYF 619
            DQGHCGSCWAFGA ESLSDRFCIHYGLNISLS ND++A              P++AW YF
Sbjct: 121  DQGHCGSCWAFGAVESLSDRFCIHYGLNISLSVNDVIACCGFYCGNGCDGGSPIRAWHYF 180

Query: 620  VRKGVVTEECDPYFDNEGCSHPGCEPAYPTPKCHRKCVKQNLLWSKSKHFGVNAYVISSD 799
            + KGVV+E+CDPYFDN GCSHPGCEP YPTP+C+RKCVK+NLLWSKSKHFGVNAY+ISSD
Sbjct: 181  IHKGVVSEKCDPYFDNIGCSHPGCEPIYPTPQCNRKCVKENLLWSKSKHFGVNAYMISSD 240

Query: 800  PYSIMTEVYKNGPVEVSFTVYEDFAHYKSGVYKHVTGDVMGGHAVKLIGWGTSEDGEDYW 979
            PYSIMTEVYKNGPVEV+  VYEDFAHYKSGVYKHVTG+ +GGHAVKLIGWGTSE+GEDYW
Sbjct: 241  PYSIMTEVYKNGPVEVALNVYEDFAHYKSGVYKHVTGEYIGGHAVKLIGWGTSEEGEDYW 300

Query: 980  LLANQWNRGWGDDGYFKIRRGTNECGIEEEVVGGLPSARNLNVELNVSDAFLDASM 1147
            LL N WN+ WGDDGYFKIRRGTNEC IE   V GLPSARNLNVEL+  D FL+ SM
Sbjct: 301  LLVNSWNKSWGDDGYFKIRRGTNECDIESNTVAGLPSARNLNVELD--DDFLNVSM 354


>ref|XP_004233219.1| PREDICTED: cathepsin B-like [Solanum lycopersicum]
          Length = 354

 Score =  568 bits (1463), Expect = e-159
 Identities = 267/338 (78%), Positives = 287/338 (84%)
 Frame = +2

Query: 134  IFILQVVAENPISQAKGETAILQDSIVKQVNENEKAGWKAALNPRFSNLTVSQFKRLLGV 313
            I ILQV AE PI++AK E+AILQDSIVKQVNEN +AGWKAA NP+ SN TVSQFKRLLGV
Sbjct: 19   ILILQVAAEKPITEAKLESAILQDSIVKQVNENAEAGWKAAFNPQLSNFTVSQFKRLLGV 78

Query: 314  KPTRKGDLKGIPILTHPKLSKLPQEFDARVAWPNCSTIGRILDQGHCGSCWAFGAAESLS 493
            KP R+GDL+GIP+LTHPKL +LP+EFDAR AWP CSTIGRILDQGHCGSCWAFGA ESLS
Sbjct: 79   KPAREGDLEGIPVLTHPKLKELPKEFDARKAWPQCSTIGRILDQGHCGSCWAFGAVESLS 138

Query: 494  DRFCIHYGLNISLSANDILAXXXXXXXXXXXXXXPLQAWKYFVRKGVVTEECDPYFDNEG 673
            DRFCIHY L+ISLS ND+LA              P+ AW+YF R+GVVTEECDPYFD  G
Sbjct: 139  DRFCIHYNLSISLSVNDLLACCGFLCGSGCDGGYPIAAWRYFKRRGVVTEECDPYFDTTG 198

Query: 674  CSHPGCEPAYPTPKCHRKCVKQNLLWSKSKHFGVNAYVISSDPYSIMTEVYKNGPVEVSF 853
            CSHPGCEP YPTPKCHRKCVK N+LW KSKH+GVNAY +S DP SIM EVYKNGPVEVSF
Sbjct: 199  CSHPGCEPLYPTPKCHRKCVKGNVLWRKSKHYGVNAYRVSHDPQSIMAEVYKNGPVEVSF 258

Query: 854  TVYEDFAHYKSGVYKHVTGDVMGGHAVKLIGWGTSEDGEDYWLLANQWNRGWGDDGYFKI 1033
            TVYEDFAHYKSGVYKHVTG  MGGHAVKLIGWGTSE GEDYWL+AN WNRGWG+DGYFKI
Sbjct: 259  TVYEDFAHYKSGVYKHVTGGNMGGHAVKLIGWGTSEQGEDYWLIANSWNRGWGEDGYFKI 318

Query: 1034 RRGTNECGIEEEVVGGLPSARNLNVELNVSDAFLDASM 1147
            RRGTNECGIE  VV GLPSARNLNVEL   DA LDASM
Sbjct: 319  RRGTNECGIEHSVVAGLPSARNLNVEL--GDAVLDASM 354


>ref|NP_001275088.1| cathepsin B-like cysteine proteinase precursor [Solanum tuberosum]
            gi|38639325|gb|AAR25800.1| cathepsin B-like cysteine
            proteinase [Solanum tuberosum]
          Length = 354

 Score =  563 bits (1450), Expect = e-158
 Identities = 264/338 (78%), Positives = 285/338 (84%)
 Frame = +2

Query: 134  IFILQVVAENPISQAKGETAILQDSIVKQVNENEKAGWKAALNPRFSNLTVSQFKRLLGV 313
            I ILQV AE PIS+AK E+AILQDSIVK+VNEN +AGWKAA NP+ SN TVSQFKRLLGV
Sbjct: 19   ILILQVAAEKPISEAKLESAILQDSIVKRVNENAEAGWKAAFNPQLSNFTVSQFKRLLGV 78

Query: 314  KPTRKGDLKGIPILTHPKLSKLPQEFDARVAWPNCSTIGRILDQGHCGSCWAFGAAESLS 493
            KP R+GDL+GIP+LTHP+L +LP+EFDAR AWP CSTIG+ILDQGHCGSCWAFGA ESLS
Sbjct: 79   KPAREGDLEGIPVLTHPRLKELPKEFDARKAWPQCSTIGKILDQGHCGSCWAFGAVESLS 138

Query: 494  DRFCIHYGLNISLSANDILAXXXXXXXXXXXXXXPLQAWKYFVRKGVVTEECDPYFDNEG 673
            DRFCIHY L+ISLS ND+LA              P+ AW+YF R GVVTEECDPYFD  G
Sbjct: 139  DRFCIHYNLSISLSVNDLLACCSFLCGSGCDGGYPIAAWRYFKRSGVVTEECDPYFDTTG 198

Query: 674  CSHPGCEPAYPTPKCHRKCVKQNLLWSKSKHFGVNAYVISSDPYSIMTEVYKNGPVEVSF 853
            CSHPGCEP YPTPKCHRKCVK N+LW KSKH+GVNAY +S DP SIM EVYKNGPVEVSF
Sbjct: 199  CSHPGCEPLYPTPKCHRKCVKGNVLWRKSKHYGVNAYRVSHDPQSIMAEVYKNGPVEVSF 258

Query: 854  TVYEDFAHYKSGVYKHVTGDVMGGHAVKLIGWGTSEDGEDYWLLANQWNRGWGDDGYFKI 1033
            TVYEDFAHYKSGVYKHVTG  MGGHAVKLIGWGTSE GEDYWL+ N WNRGWG+DGYFKI
Sbjct: 259  TVYEDFAHYKSGVYKHVTGGNMGGHAVKLIGWGTSEQGEDYWLIVNSWNRGWGEDGYFKI 318

Query: 1034 RRGTNECGIEEEVVGGLPSARNLNVELNVSDAFLDASM 1147
            RRGTNECGIE  VV GLPSARNLNVEL   DA LDASM
Sbjct: 319  RRGTNECGIEHSVVAGLPSARNLNVEL--GDAVLDASM 354


>ref|XP_006444663.1| hypothetical protein CICLE_v10020859mg [Citrus clementina]
            gi|568876746|ref|XP_006491434.1| PREDICTED: cathepsin
            B-like isoform X2 [Citrus sinensis]
            gi|557546925|gb|ESR57903.1| hypothetical protein
            CICLE_v10020859mg [Citrus clementina]
          Length = 354

 Score =  521 bits (1341), Expect = e-145
 Identities = 238/333 (71%), Positives = 272/333 (81%)
 Frame = +2

Query: 146  QVVAENPISQAKGETAILQDSIVKQVNENEKAGWKAALNPRFSNLTVSQFKRLLGVKPTR 325
            Q  AE  +S+ K ++ ILQDSI+K+VNEN KAGWKAA NP+FSN TV QFK LLGVKPT 
Sbjct: 21   QTFAEGVVSKLKLDSHILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGVKPTP 80

Query: 326  KGDLKGIPILTHPKLSKLPQEFDARVAWPNCSTIGRILDQGHCGSCWAFGAAESLSDRFC 505
            KG L G+P+ TH K  KLP+ FDAR AWP CSTI RILDQGHCGSCWAFGA E+LSDRFC
Sbjct: 81   KGLLLGVPVKTHDKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFC 140

Query: 506  IHYGLNISLSANDILAXXXXXXXXXXXXXXPLQAWKYFVRKGVVTEECDPYFDNEGCSHP 685
            IH+G+N+SLS ND+LA              P+ AW+YFV  GVVTEECDPYFD+ GCSHP
Sbjct: 141  IHFGMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHP 200

Query: 686  GCEPAYPTPKCHRKCVKQNLLWSKSKHFGVNAYVISSDPYSIMTEVYKNGPVEVSFTVYE 865
            GCEPAYPTPKC RKCVK+N LW  SKH+ ++AY I+SDP  IM E+YKNGPVEVSFTVYE
Sbjct: 201  GCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYE 260

Query: 866  DFAHYKSGVYKHVTGDVMGGHAVKLIGWGTSEDGEDYWLLANQWNRGWGDDGYFKIRRGT 1045
            DFAHYKSGVYKH+TGDVMGGHAVKLIGWGTS+DGEDYW+LANQWNR WG DGYFKI+RG+
Sbjct: 261  DFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGS 320

Query: 1046 NECGIEEEVVGGLPSARNLNVELNVSDAFLDAS 1144
            NECGIEE+VV GLPS++NL  E+  +D F DAS
Sbjct: 321  NECGIEEDVVAGLPSSKNLVKEITSADMFEDAS 353


>ref|XP_002301457.2| putative cathepsin B-like protease family protein [Populus
            trichocarpa] gi|550345314|gb|EEE80730.2| putative
            cathepsin B-like protease family protein [Populus
            trichocarpa]
          Length = 357

 Score =  520 bits (1340), Expect = e-145
 Identities = 242/336 (72%), Positives = 271/336 (80%)
 Frame = +2

Query: 137  FILQVVAENPISQAKGETAILQDSIVKQVNENEKAGWKAALNPRFSNLTVSQFKRLLGVK 316
            F  QV+A  P+S  K  + ILQDSI+K+VN N KAGWKA +N  FSN TV+QFK LLGVK
Sbjct: 21   FQSQVIAVEPVSDLKLNSRILQDSILKKVNGNPKAGWKATMNHHFSNYTVAQFKYLLGVK 80

Query: 317  PTRKGDLKGIPILTHPKLSKLPQEFDARVAWPNCSTIGRILDQGHCGSCWAFGAAESLSD 496
            PT K +L+GIP+++HPK  +LP+EFDAR AWP CSTIG+ILDQGHCGSCWAFGA ESLSD
Sbjct: 81   PTPKEELRGIPVISHPKSLRLPEEFDARTAWPQCSTIGKILDQGHCGSCWAFGAVESLSD 140

Query: 497  RFCIHYGLNISLSANDILAXXXXXXXXXXXXXXPLQAWKYFVRKGVVTEECDPYFDNEGC 676
            RFCIHYG+NISLS ND+LA              P+ AW+YFV  GVVTEECDPYFD+ GC
Sbjct: 141  RFCIHYGMNISLSVNDLLACCGFLCGSGCNGGYPISAWRYFVHHGVVTEECDPYFDDIGC 200

Query: 677  SHPGCEPAYPTPKCHRKCVKQNLLWSKSKHFGVNAYVISSDPYSIMTEVYKNGPVEVSFT 856
            SHPGCEP YPTPKC RKCV +N LW KSKH+GV  Y I SDP SIM E+YKNGPVEV+FT
Sbjct: 201  SHPGCEPGYPTPKCARKCVNKNQLWKKSKHYGVKPYRIDSDPDSIMAEIYKNGPVEVAFT 260

Query: 857  VYEDFAHYKSGVYKHVTGDVMGGHAVKLIGWGTSEDGEDYWLLANQWNRGWGDDGYFKIR 1036
            VYEDFAHYKSGVYKH+TG +MGGHAVKLIGWGTSEDGE YWLLANQWNRGWGDDG+FKIR
Sbjct: 261  VYEDFAHYKSGVYKHITGGMMGGHAVKLIGWGTSEDGEAYWLLANQWNRGWGDDGFFKIR 320

Query: 1037 RGTNECGIEEEVVGGLPSARNLNVELNVSDAFLDAS 1144
            RGTNECGIE +VV GLPS RNL  E+   DA  DAS
Sbjct: 321  RGTNECGIEGDVVAGLPSTRNLVREVVSIDAREDAS 356


>ref|XP_006491433.1| PREDICTED: cathepsin B-like isoform X1 [Citrus sinensis]
          Length = 362

 Score =  519 bits (1337), Expect = e-144
 Identities = 237/330 (71%), Positives = 271/330 (82%)
 Frame = +2

Query: 155  AENPISQAKGETAILQDSIVKQVNENEKAGWKAALNPRFSNLTVSQFKRLLGVKPTRKGD 334
            AE  +S+ K ++ ILQDSI+K+VNEN KAGWKAA NP+FSN TV QFK LLGVKPT KG 
Sbjct: 32   AEGVVSKLKLDSHILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGVKPTPKGL 91

Query: 335  LKGIPILTHPKLSKLPQEFDARVAWPNCSTIGRILDQGHCGSCWAFGAAESLSDRFCIHY 514
            L G+P+ TH K  KLP+ FDAR AWP CSTI RILDQGHCGSCWAFGA E+LSDRFCIH+
Sbjct: 92   LLGVPVKTHDKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHF 151

Query: 515  GLNISLSANDILAXXXXXXXXXXXXXXPLQAWKYFVRKGVVTEECDPYFDNEGCSHPGCE 694
            G+N+SLS ND+LA              P+ AW+YFV  GVVTEECDPYFD+ GCSHPGCE
Sbjct: 152  GMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCE 211

Query: 695  PAYPTPKCHRKCVKQNLLWSKSKHFGVNAYVISSDPYSIMTEVYKNGPVEVSFTVYEDFA 874
            PAYPTPKC RKCVK+N LW  SKH+ ++AY I+SDP  IM E+YKNGPVEVSFTVYEDFA
Sbjct: 212  PAYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFA 271

Query: 875  HYKSGVYKHVTGDVMGGHAVKLIGWGTSEDGEDYWLLANQWNRGWGDDGYFKIRRGTNEC 1054
            HYKSGVYKH+TGDVMGGHAVKLIGWGTS+DGEDYW+LANQWNR WG DGYFKI+RG+NEC
Sbjct: 272  HYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNEC 331

Query: 1055 GIEEEVVGGLPSARNLNVELNVSDAFLDAS 1144
            GIEE+VV GLPS++NL  E+  +D F DAS
Sbjct: 332  GIEEDVVAGLPSSKNLVKEITSADMFEDAS 361


>gb|EOX95504.1| Cysteine proteinases superfamily protein [Theobroma cacao]
          Length = 359

 Score =  519 bits (1337), Expect = e-144
 Identities = 241/332 (72%), Positives = 269/332 (81%)
 Frame = +2

Query: 146  QVVAENPISQAKGETAILQDSIVKQVNENEKAGWKAALNPRFSNLTVSQFKRLLGVKPTR 325
            +V+A   +S+ K  + ILQDSIVKQVNEN KAGWKAALNPR SN TV +FK LLGVKPT 
Sbjct: 24   KVIAVEQLSEVKLNSQILQDSIVKQVNENPKAGWKAALNPRLSNYTVGEFKHLLGVKPTP 83

Query: 326  KGDLKGIPILTHPKLSKLPQEFDARVAWPNCSTIGRILDQGHCGSCWAFGAAESLSDRFC 505
            K +L GIP++TH K  K+P +FDAR AWP CSTIGRILDQGHCGSCWAFGA ESLSDRFC
Sbjct: 84   KKELLGIPVITHGKSLKVPTKFDARTAWPQCSTIGRILDQGHCGSCWAFGAVESLSDRFC 143

Query: 506  IHYGLNISLSANDILAXXXXXXXXXXXXXXPLQAWKYFVRKGVVTEECDPYFDNEGCSHP 685
            IH+ +NISLS ND+LA              P+ AW+YFVR+GVVTEECDPYFD+ GCSHP
Sbjct: 144  IHFSMNISLSVNDLLACCGFLCGSGCDGGYPISAWRYFVRRGVVTEECDPYFDDTGCSHP 203

Query: 686  GCEPAYPTPKCHRKCVKQNLLWSKSKHFGVNAYVISSDPYSIMTEVYKNGPVEVSFTVYE 865
            GCEPAYPTP+C +KCVK N LW +SKH+ V AY I+SDP  IM EVY NGPVEVSFTVYE
Sbjct: 204  GCEPAYPTPRCVKKCVKGNQLWRESKHYSVGAYRINSDPADIMAEVYTNGPVEVSFTVYE 263

Query: 866  DFAHYKSGVYKHVTGDVMGGHAVKLIGWGTSEDGEDYWLLANQWNRGWGDDGYFKIRRGT 1045
            DFAHYKSGVYKHVTG VMGGHAVKLIGWGTS+DGEDYWLLANQWNRGWGDDGYFKI RGT
Sbjct: 264  DFAHYKSGVYKHVTGGVMGGHAVKLIGWGTSDDGEDYWLLANQWNRGWGDDGYFKISRGT 323

Query: 1046 NECGIEEEVVGGLPSARNLNVELNVSDAFLDA 1141
            NECGIE++VV GLPS +NL  E+   D   DA
Sbjct: 324  NECGIEDDVVAGLPSTKNLVREVGDMDTLEDA 355


>ref|XP_006375410.1| hypothetical protein POPTR_0014s10540g [Populus trichocarpa]
            gi|550323924|gb|ERP53207.1| hypothetical protein
            POPTR_0014s10540g [Populus trichocarpa]
          Length = 352

 Score =  518 bits (1334), Expect = e-144
 Identities = 235/321 (73%), Positives = 266/321 (82%)
 Frame = +2

Query: 140  ILQVVAENPISQAKGETAILQDSIVKQVNENEKAGWKAALNPRFSNLTVSQFKRLLGVKP 319
            I Q  AE P+S+ K  + ILQDSIV++VNEN KAGW+A +NP+FSN +V +FK LLGVK 
Sbjct: 17   ISQATAEEPVSKLKLNSRILQDSIVQKVNENPKAGWEATMNPQFSNYSVGEFKYLLGVKQ 76

Query: 320  TRKGDLKGIPILTHPKLSKLPQEFDARVAWPNCSTIGRILDQGHCGSCWAFGAAESLSDR 499
            T + +L+G+P+L HPK  KLP EFDAR AWP+CSTIGRILDQGHCGSCWAFGA ESLSDR
Sbjct: 77   TPRKELRGVPLLRHPKSMKLPIEFDARTAWPHCSTIGRILDQGHCGSCWAFGAVESLSDR 136

Query: 500  FCIHYGLNISLSANDILAXXXXXXXXXXXXXXPLQAWKYFVRKGVVTEECDPYFDNEGCS 679
            FCIHYG+N+SLS ND+LA              P+ AW+YFV+ GVVTEECDPYFD+ GCS
Sbjct: 137  FCIHYGMNLSLSVNDLLACCGWMCGAGCDGGSPIDAWRYFVQSGVVTEECDPYFDDIGCS 196

Query: 680  HPGCEPAYPTPKCHRKCVKQNLLWSKSKHFGVNAYVISSDPYSIMTEVYKNGPVEVSFTV 859
            HPGCEP +PTPKC RKC  +N LW++SKHF VNAY I SDP+SIM EV  NGPVEV+FTV
Sbjct: 197  HPGCEPGFPTPKCERKCADKNKLWAESKHFSVNAYRIDSDPHSIMAEVSSNGPVEVAFTV 256

Query: 860  YEDFAHYKSGVYKHVTGDVMGGHAVKLIGWGTSEDGEDYWLLANQWNRGWGDDGYFKIRR 1039
            YEDFAHYKSGVYKH+TGD MGGHAVKLIGWGTSEDGEDYWLLANQWNRGWGDDGYFKI+R
Sbjct: 257  YEDFAHYKSGVYKHITGDAMGGHAVKLIGWGTSEDGEDYWLLANQWNRGWGDDGYFKIKR 316

Query: 1040 GTNECGIEEEVVGGLPSARNL 1102
            GTNECGIEE VV GLPS RNL
Sbjct: 317  GTNECGIEEAVVAGLPSTRNL 337


>ref|XP_002320244.2| putative cathepsin B-like protease family protein [Populus
            trichocarpa] gi|550323923|gb|EEE98559.2| putative
            cathepsin B-like protease family protein [Populus
            trichocarpa]
          Length = 339

 Score =  517 bits (1332), Expect = e-144
 Identities = 234/319 (73%), Positives = 265/319 (83%)
 Frame = +2

Query: 146  QVVAENPISQAKGETAILQDSIVKQVNENEKAGWKAALNPRFSNLTVSQFKRLLGVKPTR 325
            Q  AE P+S+ K  + ILQDSIV++VNEN KAGW+A +NP+FSN +V +FK LLGVK T 
Sbjct: 6    QATAEEPVSKLKLNSRILQDSIVQKVNENPKAGWEATMNPQFSNYSVGEFKYLLGVKQTP 65

Query: 326  KGDLKGIPILTHPKLSKLPQEFDARVAWPNCSTIGRILDQGHCGSCWAFGAAESLSDRFC 505
            + +L+G+P+L HPK  KLP EFDAR AWP+CSTIGRILDQGHCGSCWAFGA ESLSDRFC
Sbjct: 66   RKELRGVPLLRHPKSMKLPIEFDARTAWPHCSTIGRILDQGHCGSCWAFGAVESLSDRFC 125

Query: 506  IHYGLNISLSANDILAXXXXXXXXXXXXXXPLQAWKYFVRKGVVTEECDPYFDNEGCSHP 685
            IHYG+N+SLS ND+LA              P+ AW+YFV+ GVVTEECDPYFD+ GCSHP
Sbjct: 126  IHYGMNLSLSVNDLLACCGWMCGAGCDGGSPIDAWRYFVQSGVVTEECDPYFDDIGCSHP 185

Query: 686  GCEPAYPTPKCHRKCVKQNLLWSKSKHFGVNAYVISSDPYSIMTEVYKNGPVEVSFTVYE 865
            GCEP +PTPKC RKC  +N LW++SKHF VNAY I SDP+SIM EV  NGPVEV+FTVYE
Sbjct: 186  GCEPGFPTPKCERKCADKNKLWAESKHFSVNAYRIDSDPHSIMAEVSSNGPVEVAFTVYE 245

Query: 866  DFAHYKSGVYKHVTGDVMGGHAVKLIGWGTSEDGEDYWLLANQWNRGWGDDGYFKIRRGT 1045
            DFAHYKSGVYKH+TGD MGGHAVKLIGWGTSEDGEDYWLLANQWNRGWGDDGYFKI+RGT
Sbjct: 246  DFAHYKSGVYKHITGDAMGGHAVKLIGWGTSEDGEDYWLLANQWNRGWGDDGYFKIKRGT 305

Query: 1046 NECGIEEEVVGGLPSARNL 1102
            NECGIEE VV GLPS RNL
Sbjct: 306  NECGIEEAVVAGLPSTRNL 324


>gb|EXB94879.1| Cathepsin B [Morus notabilis]
          Length = 420

 Score =  505 bits (1301), Expect = e-140
 Identities = 229/319 (71%), Positives = 259/319 (81%)
 Frame = +2

Query: 146  QVVAENPISQAKGETAILQDSIVKQVNENEKAGWKAALNPRFSNLTVSQFKRLLGVKPTR 325
            +V+A  P+S  K  + ILQ+SIVK+VNEN +AGW+A +NPRFSN T  +F+RLLGVK T 
Sbjct: 26   RVIALQPLSNLKLNSPILQESIVKRVNENPEAGWRAEMNPRFSNFTAGEFRRLLGVKETP 85

Query: 326  KGDLKGIPILTHPKLSKLPQEFDARVAWPNCSTIGRILDQGHCGSCWAFGAAESLSDRFC 505
            K +L+  P++THPK  KLP +FDAR AWP CSTI RILDQGHCGSCWAFGA ESLSDRFC
Sbjct: 86   KHELESTPVITHPKSLKLPDKFDARTAWPQCSTIKRILDQGHCGSCWAFGAVESLSDRFC 145

Query: 506  IHYGLNISLSANDILAXXXXXXXXXXXXXXPLQAWKYFVRKGVVTEECDPYFDNEGCSHP 685
            IH+  NISLS ND+LA              PL AW+Y    GVVTEECDPYFDN GCSHP
Sbjct: 146  IHFNTNISLSVNDVLACCGFLCGAGCDGGTPLFAWRYLHHHGVVTEECDPYFDNTGCSHP 205

Query: 686  GCEPAYPTPKCHRKCVKQNLLWSKSKHFGVNAYVISSDPYSIMTEVYKNGPVEVSFTVYE 865
            GCEPAYPTP+CHRKCV +N LW +SKH+ VNAY ISSDP+SIM EVYKNGPVEV FTVYE
Sbjct: 206  GCEPAYPTPRCHRKCVNKNNLWRQSKHYSVNAYKISSDPHSIMAEVYKNGPVEVDFTVYE 265

Query: 866  DFAHYKSGVYKHVTGDVMGGHAVKLIGWGTSEDGEDYWLLANQWNRGWGDDGYFKIRRGT 1045
            DFAHYKSGVYKH+TG VMGGHAVKLIGWGTS+ GEDYWL+ANQWNR WGDDGYFKIRRGT
Sbjct: 266  DFAHYKSGVYKHITGSVMGGHAVKLIGWGTSDTGEDYWLVANQWNRSWGDDGYFKIRRGT 325

Query: 1046 NECGIEEEVVGGLPSARNL 1102
            NECGIE++ V G+PS RNL
Sbjct: 326  NECGIEKDAVAGMPSKRNL 344


>ref|XP_006375409.1| hypothetical protein POPTR_0014s10540g [Populus trichocarpa]
            gi|550323922|gb|ERP53206.1| hypothetical protein
            POPTR_0014s10540g [Populus trichocarpa]
          Length = 362

 Score =  505 bits (1301), Expect = e-140
 Identities = 233/331 (70%), Positives = 264/331 (79%), Gaps = 10/331 (3%)
 Frame = +2

Query: 140  ILQVVAENPISQAKGETAILQDSIVKQVNENEKAGWKAALNPRFSNLTVSQFKRLLGVKP 319
            I Q  AE P+S+ K  + ILQDSIV++VNEN KAGW+A +NP+FSN +V +FK LLGVK 
Sbjct: 17   ISQATAEEPVSKLKLNSRILQDSIVQKVNENPKAGWEATMNPQFSNYSVGEFKYLLGVKQ 76

Query: 320  TRKGDLKGIPILTHPKLSKLPQEFDARVAWPNCSTIGRILDQ----------GHCGSCWA 469
            T + +L+G+P+L HPK  KLP EFDAR AWP+CSTIGRIL            GHCGSCWA
Sbjct: 77   TPRKELRGVPLLRHPKSMKLPIEFDARTAWPHCSTIGRILGHFLPACIAWFSGHCGSCWA 136

Query: 470  FGAAESLSDRFCIHYGLNISLSANDILAXXXXXXXXXXXXXXPLQAWKYFVRKGVVTEEC 649
            FGA ESLSDRFCIHYG+N+SLS ND+LA              P+ AW+YFV+ GVVTEEC
Sbjct: 137  FGAVESLSDRFCIHYGMNLSLSVNDLLACCGWMCGAGCDGGSPIDAWRYFVQSGVVTEEC 196

Query: 650  DPYFDNEGCSHPGCEPAYPTPKCHRKCVKQNLLWSKSKHFGVNAYVISSDPYSIMTEVYK 829
            DPYFD+ GCSHPGCEP +PTPKC RKC  +N LW++SKHF VNAY I SDP+SIM EV  
Sbjct: 197  DPYFDDIGCSHPGCEPGFPTPKCERKCADKNKLWAESKHFSVNAYRIDSDPHSIMAEVSS 256

Query: 830  NGPVEVSFTVYEDFAHYKSGVYKHVTGDVMGGHAVKLIGWGTSEDGEDYWLLANQWNRGW 1009
            NGPVEV+FTVYEDFAHYKSGVYKH+TGD MGGHAVKLIGWGTSEDGEDYWLLANQWNRGW
Sbjct: 257  NGPVEVAFTVYEDFAHYKSGVYKHITGDAMGGHAVKLIGWGTSEDGEDYWLLANQWNRGW 316

Query: 1010 GDDGYFKIRRGTNECGIEEEVVGGLPSARNL 1102
            GDDGYFKI+RGTNECGIEE VV GLPS RNL
Sbjct: 317  GDDGYFKIKRGTNECGIEEAVVAGLPSTRNL 347


>ref|XP_002281936.1| PREDICTED: cathepsin B-like isoform 1 [Vitis vinifera]
            gi|359480250|ref|XP_003632421.1| PREDICTED: cathepsin
            B-like [Vitis vinifera]
          Length = 358

 Score =  504 bits (1297), Expect = e-140
 Identities = 227/321 (70%), Positives = 261/321 (81%)
 Frame = +2

Query: 146  QVVAENPISQAKGETAILQDSIVKQVNENEKAGWKAALNPRFSNLTVSQFKRLLGVKPTR 325
            +VVA   +SQ K  T ILQ+S+V+ +N N KAGWKAA+NPRFSN +V QF  LLGVKPT 
Sbjct: 24   EVVALKSVSQLKFNTKILQESMVELINANPKAGWKAAMNPRFSNYSVGQFMHLLGVKPTL 83

Query: 326  KGDLKGIPILTHPKLSKLPQEFDARVAWPNCSTIGRILDQGHCGSCWAFGAAESLSDRFC 505
            + DL+G+P++THPK  KLP+ FDAR AWP CSTIG+ILDQGHCGSCWAFGA ESLSDRFC
Sbjct: 84   QKDLEGVPVITHPKTLKLPKHFDARTAWPQCSTIGKILDQGHCGSCWAFGAVESLSDRFC 143

Query: 506  IHYGLNISLSANDILAXXXXXXXXXXXXXXPLQAWKYFVRKGVVTEECDPYFDNEGCSHP 685
            IH+G+NISLS ND+LA              PL AW+YF+  GVVTEECDPYFD  GCSHP
Sbjct: 144  IHFGMNISLSVNDLLACCGFLCGSGCDGGYPLYAWRYFIHHGVVTEECDPYFDATGCSHP 203

Query: 686  GCEPAYPTPKCHRKCVKQNLLWSKSKHFGVNAYVISSDPYSIMTEVYKNGPVEVSFTVYE 865
            GCEP YPTPKC RKC  +N LW K+K +G +AY ISSDPY IM EVYKNGPVEV+FTVYE
Sbjct: 204  GCEPGYPTPKCVRKCTDENQLWRKAKRYGQSAYRISSDPYQIMAEVYKNGPVEVAFTVYE 263

Query: 866  DFAHYKSGVYKHVTGDVMGGHAVKLIGWGTSEDGEDYWLLANQWNRGWGDDGYFKIRRGT 1045
            DFAHY+SGVY++ TGDVMGGHAVKLIGWGT++DGEDYW+LANQWNR WGDDGYF IRRG 
Sbjct: 264  DFAHYESGVYRYTTGDVMGGHAVKLIGWGTTDDGEDYWILANQWNRNWGDDGYFMIRRGV 323

Query: 1046 NECGIEEEVVGGLPSARNLNV 1108
            NECGIEE VV GLPS++NL +
Sbjct: 324  NECGIEEGVVAGLPSSKNLMI 344


>ref|XP_002515139.1| cathepsin B, putative [Ricinus communis] gi|223545619|gb|EEF47123.1|
            cathepsin B, putative [Ricinus communis]
          Length = 376

 Score =  502 bits (1292), Expect = e-139
 Identities = 235/354 (66%), Positives = 275/354 (77%), Gaps = 19/354 (5%)
 Frame = +2

Query: 137  FILQVVAENPISQAKGETAILQDSIVKQVNENEKAGWKAALNPRFSNLTVSQFKRLLGVK 316
            F  +V++    S+ K  + ILQ+SI+K+VNEN  AGW+AA+NP+ SN TV QFK LLG K
Sbjct: 21   FHSRVISTELDSKLKLNSRILQESIIKKVNENPDAGWEAAMNPQLSNFTVGQFKYLLGAK 80

Query: 317  PTRKGDLKGIPILTHPKLSKLPQEFDARVAWPNCSTIGRILDQ----------------- 445
            PT K +L G+P+++HPK  KLP+EFDAR AWP+CSTIG+IL Q                 
Sbjct: 81   PTPKKELMGVPMISHPKTLKLPKEFDARTAWPHCSTIGKILGQLLSFYNIFSIFFFLFLE 140

Query: 446  GHCGSCWAFGAAESLSDRFCIHYGLNISLSANDILAXXXXXXXXXXXXXXPLQAWKYFVR 625
            GHCGSCWAFGA ESLSDRFCIH+G+NISLS ND+LA              P+ AW+YFV 
Sbjct: 141  GHCGSCWAFGAVESLSDRFCIHFGMNISLSVNDLLACCGFLCGDGCDGGYPMYAWRYFVH 200

Query: 626  KGVVTEECDPYFDNEGCSHPGCEPAYPTPKCHRKCVKQNLLWSKSKHFGVNAYVISSDPY 805
             GVVTEECDPYFDN GCSHPGCEP +PTPKC RKC+ +N LW +SKH+ VNAY ISSDP+
Sbjct: 201  HGVVTEECDPYFDNIGCSHPGCEPGFPTPKCVRKCIDKNQLWRQSKHYSVNAYRISSDPH 260

Query: 806  SIMTEVYKNGPVEVSFTVYEDFAHYKSGVYKHVTGDVMGGHAVKLIGWGTSEDGEDYWLL 985
             +M EVYKNGPVEVSFTVYEDFAHYKSGVYKH+TG+VMGGHAVKLIGWGTS++GEDYWLL
Sbjct: 261  DVMAEVYKNGPVEVSFTVYEDFAHYKSGVYKHITGEVMGGHAVKLIGWGTSDNGEDYWLL 320

Query: 986  ANQWNRGWGDDGYFKIRRGTNECGIEEEVVGGLPSARNLNV--ELNVSDAFLDA 1141
            ANQWNRGWGDDGYFKIRRGTNECGIE++ V GLPSARNL++  E+   DA  DA
Sbjct: 321  ANQWNRGWGDDGYFKIRRGTNECGIEDDAVAGLPSARNLDLVREVASMDALEDA 374


>gb|EXB94880.1| Cathepsin B [Morus notabilis]
          Length = 342

 Score =  501 bits (1291), Expect = e-139
 Identities = 229/315 (72%), Positives = 256/315 (81%)
 Frame = +2

Query: 200  QDSIVKQVNENEKAGWKAALNPRFSNLTVSQFKRLLGVKPTRKGDLKGIPILTHPKLSKL 379
            Q+SIVK VNEN +AGW+AA+NPRFSN TV++F+R+LGVKP  K DL+  P+ T+PK  KL
Sbjct: 27   QESIVKHVNENPEAGWEAAMNPRFSNSTVAEFRRMLGVKPRPKQDLRSAPVKTYPKSLKL 86

Query: 380  PQEFDARVAWPNCSTIGRILDQGHCGSCWAFGAAESLSDRFCIHYGLNISLSANDILAXX 559
            P+EFDAR AWP CSTIGRILDQGHCGSCWAFGA ESLSDRFCIH+GLNISLS ND+LA  
Sbjct: 87   PKEFDARTAWPQCSTIGRILDQGHCGSCWAFGAVESLSDRFCIHFGLNISLSVNDLLACC 146

Query: 560  XXXXXXXXXXXXPLQAWKYFVRKGVVTEECDPYFDNEGCSHPGCEPAYPTPKCHRKCVKQ 739
                        PL+AW+Y  R GVVTEECDPYFDN GCSHPGCEPA+PTP+C RKCV  
Sbjct: 147  GFFCGEGCDGGYPLEAWEYLARSGVVTEECDPYFDNIGCSHPGCEPAFPTPRCVRKCVDG 206

Query: 740  NLLWSKSKHFGVNAYVISSDPYSIMTEVYKNGPVEVSFTVYEDFAHYKSGVYKHVTGDVM 919
            N LWS SK + VNAY I SDP+SIM E+YKNGPVEV FTVYEDFAHYKSGVYKH+TG +M
Sbjct: 207  NQLWSDSKRYSVNAYTIDSDPHSIMVEIYKNGPVEVDFTVYEDFAHYKSGVYKHITGGIM 266

Query: 920  GGHAVKLIGWGTSEDGEDYWLLANQWNRGWGDDGYFKIRRGTNECGIEEEVVGGLPSARN 1099
            GGHAVKLIGWGTS+ GEDYWLLANQWNR WGDDGYFKIRRGTNECGIE + V GLPS RN
Sbjct: 267  GGHAVKLIGWGTSDAGEDYWLLANQWNRSWGDDGYFKIRRGTNECGIESDPVAGLPSTRN 326

Query: 1100 LNVELNVSDAFLDAS 1144
            +  E+    A  DAS
Sbjct: 327  IIKEVASDGAITDAS 341


>gb|EMJ22685.1| hypothetical protein PRUPE_ppa007538mg [Prunus persica]
          Length = 364

 Score =  499 bits (1286), Expect = e-139
 Identities = 226/331 (68%), Positives = 263/331 (79%)
 Frame = +2

Query: 137  FILQVVAENPISQAKGETAILQDSIVKQVNENEKAGWKAALNPRFSNLTVSQFKRLLGVK 316
            F  Q +A  P++++K  + ILQDSI+KQ+N+N  AGW+AA+NPRFSN TVSQF  LLGVK
Sbjct: 27   FYPQFIAAKPVTKSKLNSRILQDSIIKQINDNPMAGWEAAMNPRFSNYTVSQFMHLLGVK 86

Query: 317  PTRKGDLKGIPILTHPKLSKLPQEFDARVAWPNCSTIGRILDQGHCGSCWAFGAAESLSD 496
            PT + DL+  PILTHPK  KLP  FDAR AWP C+TIGRILDQGHCGSCWAF A E+LSD
Sbjct: 87   PTPRKDLQSFPILTHPKSLKLPTNFDARTAWPQCNTIGRILDQGHCGSCWAFAAVEALSD 146

Query: 497  RFCIHYGLNISLSANDILAXXXXXXXXXXXXXXPLQAWKYFVRKGVVTEECDPYFDNEGC 676
            RFCIH+G+NISLS ND+LA              P+ AW+YFV  GVVTEECDPYFD  GC
Sbjct: 147  RFCIHFGMNISLSVNDLLACCGFMCGDGCDGGYPIYAWRYFVHHGVVTEECDPYFDPTGC 206

Query: 677  SHPGCEPAYPTPKCHRKCVKQNLLWSKSKHFGVNAYVISSDPYSIMTEVYKNGPVEVSFT 856
            SHPGCEPAYPTPKC +KC  +N LW  SK + +NAY I+SD +SIM EVY NGPVEV+FT
Sbjct: 207  SHPGCEPAYPTPKCVKKCTDKNQLWKNSKRYSINAYRINSDSHSIMAEVYSNGPVEVAFT 266

Query: 857  VYEDFAHYKSGVYKHVTGDVMGGHAVKLIGWGTSEDGEDYWLLANQWNRGWGDDGYFKIR 1036
            VYEDFAHYKSGVY+H+ GDV+GGHAVKLIGWGT++ GEDYWLLANQWNR WGDDGYF I+
Sbjct: 267  VYEDFAHYKSGVYRHIKGDVLGGHAVKLIGWGTTDAGEDYWLLANQWNRSWGDDGYFMIK 326

Query: 1037 RGTNECGIEEEVVGGLPSARNLNVELNVSDA 1129
            RGTNECGIEE+VV GLPS +N   E+  +DA
Sbjct: 327  RGTNECGIEEDVVAGLPSLKNFIREVASADA 357


Top