BLASTX nr result
ID: Atropa21_contig00005184
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Atropa21_contig00005184 (1250 letters) Database: ./nr 37,332,560 sequences; 13,225,080,153 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_004233219.1| PREDICTED: cathepsin B-like [Solanum lycoper... 634 e-179 ref|NP_001275088.1| cathepsin B-like cysteine proteinase precurs... 630 e-178 gb|ABF47216.1| cathepsin B [Nicotiana benthamiana] 564 e-158 ref|XP_006362603.1| PREDICTED: cathepsin B-like [Solanum tuberosum] 561 e-157 emb|CAA57522.1| cathepsin B-like cysteine proteinase [Nicotiana ... 557 e-156 ref|XP_004233222.1| PREDICTED: cathepsin B-like [Solanum lycoper... 556 e-156 ref|XP_006362602.1| PREDICTED: cathepsin B-like [Solanum tuberosum] 551 e-154 ref|XP_004233221.1| PREDICTED: cathepsin B-like [Solanum lycoper... 550 e-154 gb|EOX95504.1| Cysteine proteinases superfamily protein [Theobro... 515 e-143 ref|XP_002301457.2| putative cathepsin B-like protease family pr... 503 e-140 ref|XP_006444663.1| hypothetical protein CICLE_v10020859mg [Citr... 500 e-139 gb|AAK69541.1|AF283476_1 cathepsin B-like cysteine proteinase [I... 500 e-139 ref|XP_006491433.1| PREDICTED: cathepsin B-like isoform X1 [Citr... 498 e-138 ref|XP_003554589.1| PREDICTED: cathepsin B-like [Glycine max] 497 e-138 ref|NP_563648.1| putative cathepsin B-like cysteine protease [Ar... 497 e-138 gb|AAF04727.1|AF101239_1 cathepsin B-like cysteine proteinase [I... 496 e-138 ref|XP_002889395.1| hypothetical protein ARALYDRAFT_887368 [Arab... 496 e-138 gb|EXB94879.1| Cathepsin B [Morus notabilis] 496 e-137 ref|XP_006305170.1| hypothetical protein CARUB_v10009537mg [Caps... 493 e-137 ref|XP_002515139.1| cathepsin B, putative [Ricinus communis] gi|... 492 e-136 >ref|XP_004233219.1| PREDICTED: cathepsin B-like [Solanum lycopersicum] Length = 354 Score = 634 bits (1636), Expect = e-179 Identities = 301/354 (85%), Positives = 312/354 (88%), Gaps = 31/354 (8%) Frame = +2 Query: 137 MAMTLKSLITPLLLGAFFILILQVAAEKPISEAKVESAILQDSIVKHVNENAKAGWKAAF 316 MA+TLKSLITPLL GAFFILILQVAAEKPI+EAK+ESAILQDSIVK VNENA+AGWKAAF Sbjct: 1 MALTLKSLITPLLFGAFFILILQVAAEKPITEAKLESAILQDSIVKQVNENAEAGWKAAF 60 Query: 317 NPQLSNFT-------------------------------LPKEFDARKAWPQCRTIGRIL 403 NPQLSNFT LPKEFDARKAWPQC TIGRIL Sbjct: 61 NPQLSNFTVSQFKRLLGVKPAREGDLEGIPVLTHPKLKELPKEFDARKAWPQCSTIGRIL 120 Query: 404 DQGHCGSCWAFGAVESLSDRFCIHYNLSISLSVNDLLACCGFLCGNGCDGGYPIAAWRYF 583 DQGHCGSCWAFGAVESLSDRFCIHYNLSISLSVNDLLACCGFLCG+GCDGGYPIAAWRYF Sbjct: 121 DQGHCGSCWAFGAVESLSDRFCIHYNLSISLSVNDLLACCGFLCGSGCDGGYPIAAWRYF 180 Query: 584 KRRGVVTEECDPYFDNTGCSHPGCEPAYPTPKCHRKCVKGNLLWRKSKHYGVNAYRVSHD 763 KRRGVVTEECDPYFD TGCSHPGCEP YPTPKCHRKCVKGN+LWRKSKHYGVNAYRVSHD Sbjct: 181 KRRGVVTEECDPYFDTTGCSHPGCEPLYPTPKCHRKCVKGNVLWRKSKHYGVNAYRVSHD 240 Query: 764 PHSIMTEIYKNGPVEVAFTVYQDFAHYKSGVYKHVTGESMGGHAVKLIGWGTSEQGEDYW 943 P SIM E+YKNGPVEV+FTVY+DFAHYKSGVYKHVTG +MGGHAVKLIGWGTSEQGEDYW Sbjct: 241 PQSIMAEVYKNGPVEVSFTVYEDFAHYKSGVYKHVTGGNMGGHAVKLIGWGTSEQGEDYW 300 Query: 944 LIVNSWNRGWGDDGYFKIRRGTNECGIEHSVVAGLPSARNLNVELDDAFLDASM 1105 LI NSWNRGWG+DGYFKIRRGTNECGIEHSVVAGLPSARNLNVEL DA LDASM Sbjct: 301 LIANSWNRGWGEDGYFKIRRGTNECGIEHSVVAGLPSARNLNVELGDAVLDASM 354 >ref|NP_001275088.1| cathepsin B-like cysteine proteinase precursor [Solanum tuberosum] gi|38639325|gb|AAR25800.1| cathepsin B-like cysteine proteinase [Solanum tuberosum] Length = 354 Score = 630 bits (1626), Expect = e-178 Identities = 300/354 (84%), Positives = 311/354 (87%), Gaps = 31/354 (8%) Frame = +2 Query: 137 MAMTLKSLITPLLLGAFFILILQVAAEKPISEAKVESAILQDSIVKHVNENAKAGWKAAF 316 M +TLKSLITPLLLGAFFILILQVAAEKPISEAK+ESAILQDSIVK VNENA+AGWKAAF Sbjct: 1 MYLTLKSLITPLLLGAFFILILQVAAEKPISEAKLESAILQDSIVKRVNENAEAGWKAAF 60 Query: 317 NPQLSNFT-------------------------------LPKEFDARKAWPQCRTIGRIL 403 NPQLSNFT LPKEFDARKAWPQC TIG+IL Sbjct: 61 NPQLSNFTVSQFKRLLGVKPAREGDLEGIPVLTHPRLKELPKEFDARKAWPQCSTIGKIL 120 Query: 404 DQGHCGSCWAFGAVESLSDRFCIHYNLSISLSVNDLLACCGFLCGNGCDGGYPIAAWRYF 583 DQGHCGSCWAFGAVESLSDRFCIHYNLSISLSVNDLLACC FLCG+GCDGGYPIAAWRYF Sbjct: 121 DQGHCGSCWAFGAVESLSDRFCIHYNLSISLSVNDLLACCSFLCGSGCDGGYPIAAWRYF 180 Query: 584 KRRGVVTEECDPYFDNTGCSHPGCEPAYPTPKCHRKCVKGNLLWRKSKHYGVNAYRVSHD 763 KR GVVTEECDPYFD TGCSHPGCEP YPTPKCHRKCVKGN+LWRKSKHYGVNAYRVSHD Sbjct: 181 KRSGVVTEECDPYFDTTGCSHPGCEPLYPTPKCHRKCVKGNVLWRKSKHYGVNAYRVSHD 240 Query: 764 PHSIMTEIYKNGPVEVAFTVYQDFAHYKSGVYKHVTGESMGGHAVKLIGWGTSEQGEDYW 943 P SIM E+YKNGPVEV+FTVY+DFAHYKSGVYKHVTG +MGGHAVKLIGWGTSEQGEDYW Sbjct: 241 PQSIMAEVYKNGPVEVSFTVYEDFAHYKSGVYKHVTGGNMGGHAVKLIGWGTSEQGEDYW 300 Query: 944 LIVNSWNRGWGDDGYFKIRRGTNECGIEHSVVAGLPSARNLNVELDDAFLDASM 1105 LIVNSWNRGWG+DGYFKIRRGTNECGIEHSVVAGLPSARNLNVEL DA LDASM Sbjct: 301 LIVNSWNRGWGEDGYFKIRRGTNECGIEHSVVAGLPSARNLNVELGDAVLDASM 354 >gb|ABF47216.1| cathepsin B [Nicotiana benthamiana] Length = 356 Score = 564 bits (1454), Expect = e-158 Identities = 267/356 (75%), Positives = 290/356 (81%), Gaps = 33/356 (9%) Frame = +2 Query: 137 MAMTLKSLITPLLLGAFFILILQVAAEKPISEAKVESAILQDSIVKHVNENAKAGWKAAF 316 MAM SL+T LLL +L+LQV AE+PIS+AK ESAILQDSIVK VNEN KAGWKAA Sbjct: 1 MAMNHMSLVTFLLLIGASVLVLQVVAEQPISQAKAESAILQDSIVKQVNENEKAGWKAAL 60 Query: 317 NPQLSNFT-------------------------------LPKEFDARKAWPQCRTIGRIL 403 NP+ SNFT LP+EFDAR AWP C TIGRIL Sbjct: 61 NPRFSNFTVSQFKRLLGVKPTRKGDLKGIPILTHPKLLELPQEFDARVAWPNCSTIGRIL 120 Query: 404 DQGHCGSCWAFGAVESLSDRFCIHYNLSISLSVNDLLACCGFLCGNGCDGGYPIAAWRYF 583 DQGHCGSCWAFGAVESLSDRFCIHY L+ISLS NDLLACCGFLCG+GCDGGYP+ AW+YF Sbjct: 121 DQGHCGSCWAFGAVESLSDRFCIHYGLNISLSANDLLACCGFLCGDGCDGGYPLQAWKYF 180 Query: 584 KRRGVVTEECDPYFDNTGCSHPGCEPAYPTPKCHRKCVKGNLLWRKSKHYGVNAYRVSHD 763 R+GVVT+ECDPYFDN GCSHPGCEPAYPTPKCHRKCVK NLLW KSKH+GVNAY +S D Sbjct: 181 VRKGVVTDECDPYFDNEGCSHPGCEPAYPTPKCHRKCVKQNLLWSKSKHFGVNAYMISSD 240 Query: 764 PHSIMTEIYKNGPVEVAFTVYQDFAHYKSGVYKHVTGESMGGHAVKLIGWGTSEQGEDYW 943 PHSIMTE+YKNGPVEV+FTVY+DFAHYKSGVYKHVTG+ MGGHAVKLIGWGTSE GEDYW Sbjct: 241 PHSIMTELYKNGPVEVSFTVYEDFAHYKSGVYKHVTGDVMGGHAVKLIGWGTSEDGEDYW 300 Query: 944 LIVNSWNRGWGDDGYFKIRRGTNECGIEHSVVAGLPSARNLNVELD--DAFLDASM 1105 L+ N WNRGWGDDGYFKIRRGT+EC IE VVAGLPSARNLN+ELD DAFLDA+M Sbjct: 301 LLANQWNRGWGDDGYFKIRRGTDECEIEDEVVAGLPSARNLNMELDVSDAFLDAAM 356 >ref|XP_006362603.1| PREDICTED: cathepsin B-like [Solanum tuberosum] Length = 354 Score = 561 bits (1447), Expect = e-157 Identities = 265/354 (74%), Positives = 287/354 (81%), Gaps = 31/354 (8%) Frame = +2 Query: 137 MAMTLKSLITPLLLGAFFILILQVAAEKPISEAKVESAILQDSIVKHVNENAKAGWKAAF 316 M MT SL TPLLL AFFI ILQV AEKPISEAK ES ILQ+SI+K VNEN KAGWKAAF Sbjct: 1 MGMTKISLATPLLLCAFFIFILQVFAEKPISEAKGESVILQESIIKEVNENVKAGWKAAF 60 Query: 317 NPQLSNFT-------------------------------LPKEFDARKAWPQCRTIGRIL 403 NP+ SNFT LPKEFDAR AWPQC TIGRIL Sbjct: 61 NPRFSNFTVSQFKFLLGVKPPREGDLKSIPILTHPKLKNLPKEFDARTAWPQCSTIGRIL 120 Query: 404 DQGHCGSCWAFGAVESLSDRFCIHYNLSISLSVNDLLACCGFLCGNGCDGGYPIAAWRYF 583 DQGHCGSCWAFGAVESLSDRFCIHY L+ISLSVND++ACCGF CGNGCDGG PI AW YF Sbjct: 121 DQGHCGSCWAFGAVESLSDRFCIHYGLNISLSVNDVIACCGFYCGNGCDGGSPIRAWHYF 180 Query: 584 KRRGVVTEECDPYFDNTGCSHPGCEPAYPTPKCHRKCVKGNLLWRKSKHYGVNAYRVSHD 763 +GVV+E+CDPYFDN GCSHPGCEP YPTP+C+RKCVK NLLW KSKH+GVNAY +S D Sbjct: 181 IHKGVVSEKCDPYFDNIGCSHPGCEPIYPTPQCNRKCVKENLLWSKSKHFGVNAYMISSD 240 Query: 764 PHSIMTEIYKNGPVEVAFTVYQDFAHYKSGVYKHVTGESMGGHAVKLIGWGTSEQGEDYW 943 P+SIMTE+YKNGPVEVA VY+DFAHYKSGVYKHVTGE +GGHAVKLIGWGTSE+GEDYW Sbjct: 241 PYSIMTEVYKNGPVEVALNVYEDFAHYKSGVYKHVTGEYIGGHAVKLIGWGTSEEGEDYW 300 Query: 944 LIVNSWNRGWGDDGYFKIRRGTNECGIEHSVVAGLPSARNLNVELDDAFLDASM 1105 L+VNSWN+ WGDDGYFKIRRGTNEC IE + VAGLPSARNLNVELDD FL+ SM Sbjct: 301 LLVNSWNKSWGDDGYFKIRRGTNECDIESNTVAGLPSARNLNVELDDDFLNVSM 354 >emb|CAA57522.1| cathepsin B-like cysteine proteinase [Nicotiana rustica] Length = 356 Score = 557 bits (1436), Expect = e-156 Identities = 264/356 (74%), Positives = 286/356 (80%), Gaps = 33/356 (9%) Frame = +2 Query: 137 MAMTLKSLITPLLLGAFFILILQVAAEKPISEAKVESAILQDSIVKHVNENAKAGWKAAF 316 MA+ SL T LL I++LQV AE+PIS+AK ESAILQDSIVK VNEN KAGWKAA Sbjct: 1 MALNHMSLTTLFLLIGASIIVLQVVAEQPISQAKAESAILQDSIVKQVNENEKAGWKAAL 60 Query: 317 NPQLSNFT-------------------------------LPKEFDARKAWPQCRTIGRIL 403 NP+ SNFT LP+EFDAR AW C TIGRIL Sbjct: 61 NPRFSNFTVSQFKRLLGVKPTRKGDLKGIPILTHPKLLELPQEFDARVAWSNCSTIGRIL 120 Query: 404 DQGHCGSCWAFGAVESLSDRFCIHYNLSISLSVNDLLACCGFLCGNGCDGGYPIAAWRYF 583 DQGHCGSCWAFGAVESLSDRFCIHY L+ISLS NDL ACCGFLCG+GCDGGYP+ AW+YF Sbjct: 121 DQGHCGSCWAFGAVESLSDRFCIHYGLNISLSANDLYACCGFLCGDGCDGGYPLQAWKYF 180 Query: 584 KRRGVVTEECDPYFDNTGCSHPGCEPAYPTPKCHRKCVKGNLLWRKSKHYGVNAYRVSHD 763 R+GVVT+ECDPYFDN GCSHPGCEPAYPTPKCHRKCVK NLLW +SKH+GVNAY +S D Sbjct: 181 VRKGVVTDECDPYFDNEGCSHPGCEPAYPTPKCHRKCVKQNLLWSRSKHFGVNAYMISSD 240 Query: 764 PHSIMTEIYKNGPVEVAFTVYQDFAHYKSGVYKHVTGESMGGHAVKLIGWGTSEQGEDYW 943 PHSIMTE+YKNGPVEV+FTVY+DFAHYKSGVYKHVTG+ MGGHAVKLIGWGTSE GEDYW Sbjct: 241 PHSIMTEVYKNGPVEVSFTVYEDFAHYKSGVYKHVTGDIMGGHAVKLIGWGTSEDGEDYW 300 Query: 944 LIVNSWNRGWGDDGYFKIRRGTNECGIEHSVVAGLPSARNLNVELD--DAFLDASM 1105 L+ N WNRGWGDDGYFKIRRGTNEC IE VVAGLPSARNLNVELD DAFLDA+M Sbjct: 301 LLANQWNRGWGDDGYFKIRRGTNECEIEDEVVAGLPSARNLNVELDVSDAFLDAAM 356 >ref|XP_004233222.1| PREDICTED: cathepsin B-like [Solanum lycopersicum] Length = 354 Score = 556 bits (1432), Expect = e-156 Identities = 262/354 (74%), Positives = 287/354 (81%), Gaps = 31/354 (8%) Frame = +2 Query: 137 MAMTLKSLITPLLLGAFFILILQVAAEKPISEAKVESAILQDSIVKHVNENAKAGWKAAF 316 M M L TPLLL AFFI ILQV AEKPISEAK ES IL++SI+K VNEN KAGWKAAF Sbjct: 1 MGMNKMFLPTPLLLCAFFIFILQVVAEKPISEAKGESVILRESIIKEVNENGKAGWKAAF 60 Query: 317 NPQLSNFT-------------------------------LPKEFDARKAWPQCRTIGRIL 403 NP+ SNFT LPKEFDAR AW +C TIGRIL Sbjct: 61 NPRFSNFTVSQFKRLLGVKPPREGDLKSIPILTHPKLKNLPKEFDARTAWSECSTIGRIL 120 Query: 404 DQGHCGSCWAFGAVESLSDRFCIHYNLSISLSVNDLLACCGFLCGNGCDGGYPIAAWRYF 583 DQGHCGSCWAFGAVESLSDRFCIHY L+ISLSVND++ACCGF CGNGCDGG PIAAW YF Sbjct: 121 DQGHCGSCWAFGAVESLSDRFCIHYGLNISLSVNDVIACCGFHCGNGCDGGSPIAAWHYF 180 Query: 584 KRRGVVTEECDPYFDNTGCSHPGCEPAYPTPKCHRKCVKGNLLWRKSKHYGVNAYRVSHD 763 R+GVV+E+CDPYFDN GCSHPGCEP YPTP+C+RKCV NLLW KSKH+GVNAY +S + Sbjct: 181 IRKGVVSEKCDPYFDNIGCSHPGCEPTYPTPQCNRKCVNENLLWSKSKHFGVNAYMISSN 240 Query: 764 PHSIMTEIYKNGPVEVAFTVYQDFAHYKSGVYKHVTGESMGGHAVKLIGWGTSEQGEDYW 943 P+SIMTE+YKNGPVEVA VY+DFAHYKSGVYKHVTGE +GGHAVKLIGWGTSE+GEDYW Sbjct: 241 PYSIMTEVYKNGPVEVALNVYEDFAHYKSGVYKHVTGEYIGGHAVKLIGWGTSEEGEDYW 300 Query: 944 LIVNSWNRGWGDDGYFKIRRGTNECGIEHSVVAGLPSARNLNVELDDAFLDASM 1105 L+VNSWN+GWG+DGYFKIRRGTNEC IE +VVAGLPSARNLNVELDD FLD SM Sbjct: 301 LLVNSWNKGWGNDGYFKIRRGTNECDIESNVVAGLPSARNLNVELDDDFLDTSM 354 >ref|XP_006362602.1| PREDICTED: cathepsin B-like [Solanum tuberosum] Length = 352 Score = 551 bits (1421), Expect = e-154 Identities = 257/352 (73%), Positives = 284/352 (80%), Gaps = 33/352 (9%) Frame = +2 Query: 149 LKSLITPLLLGAFFILILQVAAEKPISEAKVESAILQDSIVKHVNENAKAGWKAAFNPQL 328 +K + T LLL A L+LQV AE PIS+AK ESAILQDSIVK VNEN KAGW+AA NPQ Sbjct: 1 MKHITTFLLLVAVSTLVLQVVAENPISQAKAESAILQDSIVKQVNENEKAGWRAALNPQF 60 Query: 329 SNFT-------------------------------LPKEFDARKAWPQCRTIGRILDQGH 415 SNFT LP+EFDAR AWPQC TIGRILDQGH Sbjct: 61 SNFTVSQFKRLLGVKPTRKGDLKGIPILTHPELLKLPQEFDARVAWPQCSTIGRILDQGH 120 Query: 416 CGSCWAFGAVESLSDRFCIHYNLSISLSVNDLLACCGFLCGNGCDGGYPIAAWRYFKRRG 595 CGSCWAFGA ESLSDRFCIHY L+ISLS ND++ACCG+LCG+GCDGGYP+ AW+YF R+G Sbjct: 121 CGSCWAFGAAESLSDRFCIHYGLNISLSANDIVACCGYLCGDGCDGGYPLEAWKYFVRKG 180 Query: 596 VVTEECDPYFDNTGCSHPGCEPAYPTPKCHRKCVKGNLLWRKSKHYGVNAYRVSHDPHSI 775 VVTEECDPYFDN GCSHPGCEP YPTP+C RKCVK NLLW KSKH+GVNAY ++ DP+SI Sbjct: 181 VVTEECDPYFDNKGCSHPGCEPGYPTPQCKRKCVKENLLWSKSKHFGVNAYLINSDPYSI 240 Query: 776 MTEIYKNGPVEVAFTVYQDFAHYKSGVYKHVTGESMGGHAVKLIGWGTSEQGEDYWLIVN 955 MTE+YKNGPVEV+FTVY+DFAHYKSGVYKH+ GE MGGHAVKLIGWGTSE GE+YWL+ N Sbjct: 241 MTEVYKNGPVEVSFTVYEDFAHYKSGVYKHINGEEMGGHAVKLIGWGTSEDGENYWLLAN 300 Query: 956 SWNRGWGDDGYFKIRRGTNECGIEHSVVAGLPSARNLNVELD--DAFLDASM 1105 WNRGWGDDGYFKIRRGTNECGIE VVAG+PSA+NLNVELD DAFLDASM Sbjct: 301 QWNRGWGDDGYFKIRRGTNECGIEEEVVAGMPSAKNLNVELDVSDAFLDASM 352 >ref|XP_004233221.1| PREDICTED: cathepsin B-like [Solanum lycopersicum] Length = 352 Score = 550 bits (1417), Expect = e-154 Identities = 255/352 (72%), Positives = 283/352 (80%), Gaps = 33/352 (9%) Frame = +2 Query: 149 LKSLITPLLLGAFFILILQVAAEKPISEAKVESAILQDSIVKHVNENAKAGWKAAFNPQL 328 +K + T LLL + L+LQV AE PIS+AK ESAILQDSIVK VNEN KAGW+AA NPQ Sbjct: 1 MKHIATFLLLVSVSTLVLQVVAENPISQAKAESAILQDSIVKQVNENEKAGWRAALNPQF 60 Query: 329 SNFT-------------------------------LPKEFDARKAWPQCRTIGRILDQGH 415 SNFT LP+EFDAR AWPQC TIGRILDQGH Sbjct: 61 SNFTVSQFKRLLGVKPTRKGDLKGIPILTHPKLLKLPQEFDARVAWPQCSTIGRILDQGH 120 Query: 416 CGSCWAFGAVESLSDRFCIHYNLSISLSVNDLLACCGFLCGNGCDGGYPIAAWRYFKRRG 595 CGSCWAFGA ESLSDRFCIHY L+ISLS ND++ACCG+LCG+GCDGGYP+ AW+YF R+G Sbjct: 121 CGSCWAFGAAESLSDRFCIHYGLNISLSANDIIACCGYLCGDGCDGGYPLEAWKYFVRKG 180 Query: 596 VVTEECDPYFDNTGCSHPGCEPAYPTPKCHRKCVKGNLLWRKSKHYGVNAYRVSHDPHSI 775 VVTEECDPYFDN GCSHPGCEP YPTP+C RKCVK NLLW KSKH+G+NAY ++ DP+SI Sbjct: 181 VVTEECDPYFDNKGCSHPGCEPGYPTPQCKRKCVKENLLWSKSKHFGINAYLINSDPYSI 240 Query: 776 MTEIYKNGPVEVAFTVYQDFAHYKSGVYKHVTGESMGGHAVKLIGWGTSEQGEDYWLIVN 955 MTE+YKNGPVEV+FTVY+DFAHYKSGVYKH+ GE MGGHAVKLIGWGTSE GEDYWL+ N Sbjct: 241 MTEVYKNGPVEVSFTVYEDFAHYKSGVYKHINGEEMGGHAVKLIGWGTSEDGEDYWLLAN 300 Query: 956 SWNRGWGDDGYFKIRRGTNECGIEHSVVAGLPSARNLNVELD--DAFLDASM 1105 WNRGWGDDGYFKIRRGTNECGIE VVAG+PSA+NLNVELD DA LDASM Sbjct: 301 QWNRGWGDDGYFKIRRGTNECGIEEEVVAGMPSAKNLNVELDVSDALLDASM 352 >gb|EOX95504.1| Cysteine proteinases superfamily protein [Theobroma cacao] Length = 359 Score = 515 bits (1327), Expect = e-143 Identities = 241/348 (69%), Positives = 272/348 (78%), Gaps = 36/348 (10%) Frame = +2 Query: 149 LKSLITPLLLGAFFILIL-----QVAAEKPISEAKVESAILQDSIVKHVNENAKAGWKAA 313 +K + PLL A F+L+L +V A + +SE K+ S ILQDSIVK VNEN KAGWKAA Sbjct: 1 MKDMANPLLFLASFLLLLSTVHPKVIAVEQLSEVKLNSQILQDSIVKQVNENPKAGWKAA 60 Query: 314 FNPQLSNFTL-------------------------------PKEFDARKAWPQCRTIGRI 400 NP+LSN+T+ P +FDAR AWPQC TIGRI Sbjct: 61 LNPRLSNYTVGEFKHLLGVKPTPKKELLGIPVITHGKSLKVPTKFDARTAWPQCSTIGRI 120 Query: 401 LDQGHCGSCWAFGAVESLSDRFCIHYNLSISLSVNDLLACCGFLCGNGCDGGYPIAAWRY 580 LDQGHCGSCWAFGAVESLSDRFCIH++++ISLSVNDLLACCGFLCG+GCDGGYPI+AWRY Sbjct: 121 LDQGHCGSCWAFGAVESLSDRFCIHFSMNISLSVNDLLACCGFLCGSGCDGGYPISAWRY 180 Query: 581 FKRRGVVTEECDPYFDNTGCSHPGCEPAYPTPKCHRKCVKGNLLWRKSKHYGVNAYRVSH 760 F RRGVVTEECDPYFD+TGCSHPGCEPAYPTP+C +KCVKGN LWR+SKHY V AYR++ Sbjct: 181 FVRRGVVTEECDPYFDDTGCSHPGCEPAYPTPRCVKKCVKGNQLWRESKHYSVGAYRINS 240 Query: 761 DPHSIMTEIYKNGPVEVAFTVYQDFAHYKSGVYKHVTGESMGGHAVKLIGWGTSEQGEDY 940 DP IM E+Y NGPVEV+FTVY+DFAHYKSGVYKHVTG MGGHAVKLIGWGTS+ GEDY Sbjct: 241 DPADIMAEVYTNGPVEVSFTVYEDFAHYKSGVYKHVTGGVMGGHAVKLIGWGTSDDGEDY 300 Query: 941 WLIVNSWNRGWGDDGYFKIRRGTNECGIEHSVVAGLPSARNLNVELDD 1084 WL+ N WNRGWGDDGYFKI RGTNECGIE VVAGLPS +NL E+ D Sbjct: 301 WLLANQWNRGWGDDGYFKISRGTNECGIEDDVVAGLPSTKNLVREVGD 348 >ref|XP_002301457.2| putative cathepsin B-like protease family protein [Populus trichocarpa] gi|550345314|gb|EEE80730.2| putative cathepsin B-like protease family protein [Populus trichocarpa] Length = 357 Score = 503 bits (1295), Expect = e-140 Identities = 240/349 (68%), Positives = 262/349 (75%), Gaps = 33/349 (9%) Frame = +2 Query: 155 SLITPLLLGAFFILILQVAAEKPISEAKVESAILQDSIVKHVNENAKAGWKAAFNPQLSN 334 S + LL+GA F QV A +P+S+ K+ S ILQDSI+K VN N KAGWKA N SN Sbjct: 8 STLLLLLIGAIFTFQSQVIAVEPVSDLKLNSRILQDSILKKVNGNPKAGWKATMNHHFSN 67 Query: 335 FT-------------------------------LPKEFDARKAWPQCRTIGRILDQGHCG 421 +T LP+EFDAR AWPQC TIG+ILDQGHCG Sbjct: 68 YTVAQFKYLLGVKPTPKEELRGIPVISHPKSLRLPEEFDARTAWPQCSTIGKILDQGHCG 127 Query: 422 SCWAFGAVESLSDRFCIHYNLSISLSVNDLLACCGFLCGNGCDGGYPIAAWRYFKRRGVV 601 SCWAFGAVESLSDRFCIHY ++ISLSVNDLLACCGFLCG+GC+GGYPI+AWRYF GVV Sbjct: 128 SCWAFGAVESLSDRFCIHYGMNISLSVNDLLACCGFLCGSGCNGGYPISAWRYFVHHGVV 187 Query: 602 TEECDPYFDNTGCSHPGCEPAYPTPKCHRKCVKGNLLWRKSKHYGVNAYRVSHDPHSIMT 781 TEECDPYFD+ GCSHPGCEP YPTPKC RKCV N LW+KSKHYGV YR+ DP SIM Sbjct: 188 TEECDPYFDDIGCSHPGCEPGYPTPKCARKCVNKNQLWKKSKHYGVKPYRIDSDPDSIMA 247 Query: 782 EIYKNGPVEVAFTVYQDFAHYKSGVYKHVTGESMGGHAVKLIGWGTSEQGEDYWLIVNSW 961 EIYKNGPVEVAFTVY+DFAHYKSGVYKH+TG MGGHAVKLIGWGTSE GE YWL+ N W Sbjct: 248 EIYKNGPVEVAFTVYEDFAHYKSGVYKHITGGMMGGHAVKLIGWGTSEDGEAYWLLANQW 307 Query: 962 NRGWGDDGYFKIRRGTNECGIEHSVVAGLPSARNL--NVELDDAFLDAS 1102 NRGWGDDG+FKIRRGTNECGIE VVAGLPS RNL V DA DAS Sbjct: 308 NRGWGDDGFFKIRRGTNECGIEGDVVAGLPSTRNLVREVVSIDAREDAS 356 >ref|XP_006444663.1| hypothetical protein CICLE_v10020859mg [Citrus clementina] gi|568876746|ref|XP_006491434.1| PREDICTED: cathepsin B-like isoform X2 [Citrus sinensis] gi|557546925|gb|ESR57903.1| hypothetical protein CICLE_v10020859mg [Citrus clementina] Length = 354 Score = 500 bits (1288), Expect = e-139 Identities = 233/344 (67%), Positives = 266/344 (77%), Gaps = 33/344 (9%) Frame = +2 Query: 170 LLLGAFFILILQVAAEKPISEAKVESAILQDSIVKHVNENAKAGWKAAFNPQLSNFT--- 340 L+LG ++ Q AE +S+ K++S ILQDSI+K VNEN KAGWKAA NPQ SN+T Sbjct: 13 LILG---VISSQTFAEGVVSKLKLDSHILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQ 69 Query: 341 ----------------------------LPKEFDARKAWPQCRTIGRILDQGHCGSCWAF 436 LPK FDAR AWPQC TI RILDQGHCGSCWAF Sbjct: 70 FKHLLGVKPTPKGLLLGVPVKTHDKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAF 129 Query: 437 GAVESLSDRFCIHYNLSISLSVNDLLACCGFLCGNGCDGGYPIAAWRYFKRRGVVTEECD 616 GAVE+LSDRFCIH+ +++SLSVNDLLACCGFLCG+GCDGGYPI+AWRYF GVVTEECD Sbjct: 130 GAVEALSDRFCIHFGMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECD 189 Query: 617 PYFDNTGCSHPGCEPAYPTPKCHRKCVKGNLLWRKSKHYGVNAYRVSHDPHSIMTEIYKN 796 PYFD+TGCSHPGCEPAYPTPKC RKCVK N LWR SKHY ++AYR++ DP IM EIYKN Sbjct: 190 PYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKN 249 Query: 797 GPVEVAFTVYQDFAHYKSGVYKHVTGESMGGHAVKLIGWGTSEQGEDYWLIVNSWNRGWG 976 GPVEV+FTVY+DFAHYKSGVYKH+TG+ MGGHAVKLIGWGTS+ GEDYW++ N WNR WG Sbjct: 250 GPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWG 309 Query: 977 DDGYFKIRRGTNECGIEHSVVAGLPSARNLNVELD--DAFLDAS 1102 DGYFKI+RG+NECGIE VVAGLPS++NL E+ D F DAS Sbjct: 310 ADGYFKIKRGSNECGIEEDVVAGLPSSKNLVKEITSADMFEDAS 353 >gb|AAK69541.1|AF283476_1 cathepsin B-like cysteine proteinase [Ipomoea batatas] Length = 352 Score = 500 bits (1287), Expect = e-139 Identities = 236/353 (66%), Positives = 268/353 (75%), Gaps = 34/353 (9%) Frame = +2 Query: 149 LKSLITPLLLGAFFILILQVAAEKPISEAKVESAILQDSIVKHVNENAKAGWKAAFNPQL 328 ++++ T LL+GA +LILQV A KP++ +V+ ILQD IVK VNEN +AGWKA NP+ Sbjct: 1 METIKTLLLIGAISLLILQVVAVKPVTLTEVDPKILQDEIVKTVNENPEAGWKADMNPRF 60 Query: 329 SNFT-------------------------------LPKEFDARKAWPQCRTIGRILDQGH 415 S+FT LPK FDAR AWPQC +I ILDQGH Sbjct: 61 SDFTVSQFKRLLGVKKAPKSLLKRTPVVTHSKEIELPKTFDARTAWPQCLSIADILDQGH 120 Query: 416 CGSCWAFGAVESLSDRFCIHYNLSISLSVNDLLACCGFLCGNGCDGGYPIAAWRYFKRRG 595 CGSCWAFGAVESL+DRFCIHY +++LSVNDLLACCGFLCG GCDGGYPIAAW+YFKR G Sbjct: 121 CGSCWAFGAVESLTDRFCIHYGTNVTLSVNDLLACCGFLCGEGCDGGYPIAAWQYFKRTG 180 Query: 596 VVTEECDPYFDNTGCSHPGCEPAYPTPKCHRKCVKGNLLWRKSKHYGVNAYRVSHDPHSI 775 VVT ECDPYFD TGCSHPGCEPAYPTP C +KCVK NLLW +SKH+ VNAYRV+ D HSI Sbjct: 181 VVTSECDPYFDQTGCSHPGCEPAYPTPACEKKCVKKNLLWSESKHFSVNAYRVNSDQHSI 240 Query: 776 MTEIYKNGPVEVAFTVYQDFAHYKSGVYKHVTGESMGGHAVKLIGWGTSEQGEDYWLIVN 955 MTE+Y NGP EV+FTVY+DFAHYKSGVYKHVTG MGGHAVKLIGWGTSE GEDYWL+ N Sbjct: 241 MTEVYTNGPAEVSFTVYEDFAHYKSGVYKHVTGSEMGGHAVKLIGWGTSEDGEDYWLLAN 300 Query: 956 SWNRGWGDDGYFKIRRGTNECGIEHSVVAGLPSARNLNVE---LDDAFLDASM 1105 WNR WGDDGYFKI RGTNECGIE V AG+PS +NL++E DD L AS+ Sbjct: 301 QWNRSWGDDGYFKIIRGTNECGIE-DVTAGMPSTKNLDIESGVRDDDSLVASV 352 >ref|XP_006491433.1| PREDICTED: cathepsin B-like isoform X1 [Citrus sinensis] Length = 362 Score = 498 bits (1282), Expect = e-138 Identities = 229/330 (69%), Positives = 259/330 (78%), Gaps = 33/330 (10%) Frame = +2 Query: 212 AEKPISEAKVESAILQDSIVKHVNENAKAGWKAAFNPQLSNFT----------------- 340 AE +S+ K++S ILQDSI+K VNEN KAGWKAA NPQ SN+T Sbjct: 32 AEGVVSKLKLDSHILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGVKPTPKGL 91 Query: 341 --------------LPKEFDARKAWPQCRTIGRILDQGHCGSCWAFGAVESLSDRFCIHY 478 LPK FDAR AWPQC TI RILDQGHCGSCWAFGAVE+LSDRFCIH+ Sbjct: 92 LLGVPVKTHDKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHF 151 Query: 479 NLSISLSVNDLLACCGFLCGNGCDGGYPIAAWRYFKRRGVVTEECDPYFDNTGCSHPGCE 658 +++SLSVNDLLACCGFLCG+GCDGGYPI+AWRYF GVVTEECDPYFD+TGCSHPGCE Sbjct: 152 GMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCE 211 Query: 659 PAYPTPKCHRKCVKGNLLWRKSKHYGVNAYRVSHDPHSIMTEIYKNGPVEVAFTVYQDFA 838 PAYPTPKC RKCVK N LWR SKHY ++AYR++ DP IM EIYKNGPVEV+FTVY+DFA Sbjct: 212 PAYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFA 271 Query: 839 HYKSGVYKHVTGESMGGHAVKLIGWGTSEQGEDYWLIVNSWNRGWGDDGYFKIRRGTNEC 1018 HYKSGVYKH+TG+ MGGHAVKLIGWGTS+ GEDYW++ N WNR WG DGYFKI+RG+NEC Sbjct: 272 HYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNEC 331 Query: 1019 GIEHSVVAGLPSARNLNVELD--DAFLDAS 1102 GIE VVAGLPS++NL E+ D F DAS Sbjct: 332 GIEEDVVAGLPSSKNLVKEITSADMFEDAS 361 >ref|XP_003554589.1| PREDICTED: cathepsin B-like [Glycine max] Length = 356 Score = 497 bits (1280), Expect = e-138 Identities = 232/354 (65%), Positives = 264/354 (74%), Gaps = 31/354 (8%) Frame = +2 Query: 137 MAMTLKSLITPLLLGAFFILILQVAAEKPISEAKVESAILQDSIVKHVNENAKAGWKAAF 316 MA TL L T L+ + L + A +P++ K+ S ILQ+SI K +NEN +AGW+AA Sbjct: 1 MASTLLPLATFFLVLSASYLQIAGAKAQPLTSLKLNSPILQESIAKEINENPEAGWEAAI 60 Query: 317 NPQLSNFT-------------------------------LPKEFDARKAWPQCRTIGRIL 403 NP SN+T LPK FDAR AW QC TIGRIL Sbjct: 61 NPHFSNYTVEQFKRLLGVKPTPKKELRSTPAISHPKSLKLPKNFDARTAWSQCSTIGRIL 120 Query: 404 DQGHCGSCWAFGAVESLSDRFCIHYNLSISLSVNDLLACCGFLCGNGCDGGYPIAAWRYF 583 DQGHCGSCWAFGAVESLSDRFCIH++++ISLSVNDLLACCGFLCG+GCDGGYP+ AW+Y Sbjct: 121 DQGHCGSCWAFGAVESLSDRFCIHFDVNISLSVNDLLACCGFLCGSGCDGGYPLYAWQYL 180 Query: 584 KRRGVVTEECDPYFDNTGCSHPGCEPAYPTPKCHRKCVKGNLLWRKSKHYGVNAYRVSHD 763 GVVTEECDPYFD GCSHPGCEPAY TPKC +KCV GN +W+KSKHY VNAYRVS D Sbjct: 181 AHHGVVTEECDPYFDQIGCSHPGCEPAYRTPKCVKKCVSGNQVWKKSKHYSVNAYRVSSD 240 Query: 764 PHSIMTEIYKNGPVEVAFTVYQDFAHYKSGVYKHVTGESMGGHAVKLIGWGTSEQGEDYW 943 PH IMTE+YKNGPVEVAFTVY+DFAHYKSGVYKH+TG +GGHAVKLIGWGT+E GEDYW Sbjct: 241 PHDIMTEVYKNGPVEVAFTVYEDFAHYKSGVYKHITGYELGGHAVKLIGWGTTEDGEDYW 300 Query: 944 LIVNSWNRGWGDDGYFKIRRGTNECGIEHSVVAGLPSARNLNVELDDAFLDASM 1105 L+ N WNR WGDDGYFKIRRGTNECGIE V AGLPS +NL E+ D DA++ Sbjct: 301 LLANQWNREWGDDGYFKIRRGTNECGIEEDVTAGLPSTKNLVREVTDMDADAAV 354 >ref|NP_563648.1| putative cathepsin B-like cysteine protease [Arabidopsis thaliana] gi|16226808|gb|AAL16267.1|AF428337_1 At1g02300/T6A9_10 [Arabidopsis thaliana] gi|14532526|gb|AAK63991.1| At1g02300/T6A9_10 [Arabidopsis thaliana] gi|25090140|gb|AAN72238.1| At1g02300/T6A9_10 [Arabidopsis thaliana] gi|332189292|gb|AEE27413.1| putative cathepsin B-like cysteine protease [Arabidopsis thaliana] Length = 362 Score = 497 bits (1279), Expect = e-138 Identities = 232/329 (70%), Positives = 258/329 (78%), Gaps = 31/329 (9%) Frame = +2 Query: 173 LLGAFFILILQVAAEKPISEAKVESAILQDSIVKHVNENAKAGWKAAFNPQLSNFT---- 340 LL + F L+ +AAE +S+ K+ S ILQ+ IVK VNEN AGWKA+FN + +N T Sbjct: 20 LLISSFNLLQGIAAEN-LSKQKLTSWILQNEIVKEVNENPNAGWKASFNDRFANATVAEF 78 Query: 341 ---------------------------LPKEFDARKAWPQCRTIGRILDQGHCGSCWAFG 439 LPKEFDAR AW QC +IGRILDQGHCGSCWAFG Sbjct: 79 KRLLGVKPTPKTEFLGVPIVSHDISLKLPKEFDARTAWSQCTSIGRILDQGHCGSCWAFG 138 Query: 440 AVESLSDRFCIHYNLSISLSVNDLLACCGFLCGNGCDGGYPIAAWRYFKRRGVVTEECDP 619 AVESLSDRFCI YN+++SLSVNDLLACCGFLCG GC+GGYPIAAWRYFK GVVTEECDP Sbjct: 139 AVESLSDRFCIKYNMNVSLSVNDLLACCGFLCGQGCNGGYPIAAWRYFKHHGVVTEECDP 198 Query: 620 YFDNTGCSHPGCEPAYPTPKCHRKCVKGNLLWRKSKHYGVNAYRVSHDPHSIMTEIYKNG 799 YFDNTGCSHPGCEPAYPTPKC RKCV GN LWR+SKHYGV+AY+V P IM E+YKNG Sbjct: 199 YFDNTGCSHPGCEPAYPTPKCARKCVSGNQLWRESKHYGVSAYKVRSHPDDIMAEVYKNG 258 Query: 800 PVEVAFTVYQDFAHYKSGVYKHVTGESMGGHAVKLIGWGTSEQGEDYWLIVNSWNRGWGD 979 PVEVAFTVY+DFAHYKSGVYKH+TG ++GGHAVKLIGWGTS+ GEDYWL+ N WNR WGD Sbjct: 259 PVEVAFTVYEDFAHYKSGVYKHITGTNIGGHAVKLIGWGTSDDGEDYWLLANQWNRSWGD 318 Query: 980 DGYFKIRRGTNECGIEHSVVAGLPSARNL 1066 DGYFKIRRGTNECGIEH VVAGLPS RN+ Sbjct: 319 DGYFKIRRGTNECGIEHGVVAGLPSDRNV 347 >gb|AAF04727.1|AF101239_1 cathepsin B-like cysteine proteinase [Ipomoea batatas] Length = 352 Score = 496 bits (1277), Expect = e-138 Identities = 235/353 (66%), Positives = 266/353 (75%), Gaps = 34/353 (9%) Frame = +2 Query: 149 LKSLITPLLLGAFFILILQVAAEKPISEAKVESAILQDSIVKHVNENAKAGWKAAFNPQL 328 ++++ T LL+GA +LILQV A KP++ +V+ ILQD IVK VNEN +AGWKA NP+ Sbjct: 1 METIKTLLLIGAISLLILQVVAVKPVTLTEVDPKILQDEIVKTVNENPEAGWKADMNPRF 60 Query: 329 SNFT-------------------------------LPKEFDARKAWPQCRTIGRILDQGH 415 S+FT LPK FDAR AWPQC +I ILDQGH Sbjct: 61 SDFTVSQFKRLLGVKKAPKSLLKRTPVVTHSKEIELPKTFDARTAWPQCLSIADILDQGH 120 Query: 416 CGSCWAFGAVESLSDRFCIHYNLSISLSVNDLLACCGFLCGNGCDGGYPIAAWRYFKRRG 595 CGSCWAFGAVESL+DRFCIHY +++LSVNDLLACCGFLCG GCDGGYPIAAW+YFKR G Sbjct: 121 CGSCWAFGAVESLTDRFCIHYGTNVTLSVNDLLACCGFLCGEGCDGGYPIAAWQYFKRTG 180 Query: 596 VVTEECDPYFDNTGCSHPGCEPAYPTPKCHRKCVKGNLLWRKSKHYGVNAYRVSHDPHSI 775 VVT ECDPYFD TGCSHPGCEPAYPTP C +KCVK NLLW +SKH+ VNAYRV+ D HSI Sbjct: 181 VVTSECDPYFDQTGCSHPGCEPAYPTPACEKKCVKKNLLWSESKHFSVNAYRVNSDQHSI 240 Query: 776 MTEIYKNGPVEVAFTVYQDFAHYKSGVYKHVTGESMGGHAVKLIGWGTSEQGEDYWLIVN 955 MTE+Y NGP EV+FTVY+DFAHYKSGVYKHVTG MGGHAVKLIGWGTSE GEDYWL+ N Sbjct: 241 MTEVYTNGPAEVSFTVYEDFAHYKSGVYKHVTGSEMGGHAVKLIGWGTSEDGEDYWLLAN 300 Query: 956 SWNRGWGDDGYFKIRRGTNECGIEHSVVAGLPSARNLNVE---LDDAFLDASM 1105 WNR WG DGYFKI RGTNECGIE V AG PS +NL++E DD L AS+ Sbjct: 301 QWNRSWGGDGYFKIIRGTNECGIE-DVTAGTPSTKNLDIESGVRDDDSLVASV 352 >ref|XP_002889395.1| hypothetical protein ARALYDRAFT_887368 [Arabidopsis lyrata subsp. lyrata] gi|297335237|gb|EFH65654.1| hypothetical protein ARALYDRAFT_887368 [Arabidopsis lyrata subsp. lyrata] Length = 360 Score = 496 bits (1277), Expect = e-138 Identities = 229/320 (71%), Positives = 252/320 (78%), Gaps = 31/320 (9%) Frame = +2 Query: 200 LQVAAEKPISEAKVESAILQDSIVKHVNENAKAGWKAAFNPQLSNFT------------- 340 LQ A + +S+ K+ S ILQ+ IVK VNEN AGWKAAFN + +N T Sbjct: 26 LQGIAAENLSKQKLTSWILQNEIVKEVNENPNAGWKAAFNDRFANATVAEFKRLLGVKPT 85 Query: 341 ------------------LPKEFDARKAWPQCRTIGRILDQGHCGSCWAFGAVESLSDRF 466 LPKEFDAR AW QC ++GRILDQGHCGSCWAFGAVESLSDRF Sbjct: 86 PKTEFLGVPIVSHDISLKLPKEFDARTAWSQCTSVGRILDQGHCGSCWAFGAVESLSDRF 145 Query: 467 CIHYNLSISLSVNDLLACCGFLCGNGCDGGYPIAAWRYFKRRGVVTEECDPYFDNTGCSH 646 CI YN++ISLSVNDLLACCGFLCG GC+GGYPIAAWRYFK GVVTEECDPYFDNTGCSH Sbjct: 146 CIKYNMNISLSVNDLLACCGFLCGQGCNGGYPIAAWRYFKHHGVVTEECDPYFDNTGCSH 205 Query: 647 PGCEPAYPTPKCHRKCVKGNLLWRKSKHYGVNAYRVSHDPHSIMTEIYKNGPVEVAFTVY 826 PGCEPAYPTPKC RKCV GN LWR+SKHYGV+AY+V P IM E+YKNGPVEVAFTVY Sbjct: 206 PGCEPAYPTPKCARKCVSGNQLWRESKHYGVSAYKVRSHPDDIMAEVYKNGPVEVAFTVY 265 Query: 827 QDFAHYKSGVYKHVTGESMGGHAVKLIGWGTSEQGEDYWLIVNSWNRGWGDDGYFKIRRG 1006 +DFAHYKSGVYKH+TG ++GGHAVKLIGWGTS+ GEDYWL+ N WNR WGDDGYFKIRRG Sbjct: 266 EDFAHYKSGVYKHITGTNIGGHAVKLIGWGTSDDGEDYWLLANQWNRSWGDDGYFKIRRG 325 Query: 1007 TNECGIEHSVVAGLPSARNL 1066 TNECGIEH VVAGLPS RN+ Sbjct: 326 TNECGIEHGVVAGLPSDRNV 345 >gb|EXB94879.1| Cathepsin B [Morus notabilis] Length = 420 Score = 496 bits (1276), Expect = e-137 Identities = 231/344 (67%), Positives = 259/344 (75%), Gaps = 34/344 (9%) Frame = +2 Query: 137 MAMTLKSLIT---PLLLGAFFILILQVAAEKPISEAKVESAILQDSIVKHVNENAKAGWK 307 MA T K+++ L+L + +V A +P+S K+ S ILQ+SIVK VNEN +AGW+ Sbjct: 1 MATTQKNILAISLSLILVISICHLQRVIALQPLSNLKLNSPILQESIVKRVNENPEAGWR 60 Query: 308 AAFNPQLSNFT-------------------------------LPKEFDARKAWPQCRTIG 394 A NP+ SNFT LP +FDAR AWPQC TI Sbjct: 61 AEMNPRFSNFTAGEFRRLLGVKETPKHELESTPVITHPKSLKLPDKFDARTAWPQCSTIK 120 Query: 395 RILDQGHCGSCWAFGAVESLSDRFCIHYNLSISLSVNDLLACCGFLCGNGCDGGYPIAAW 574 RILDQGHCGSCWAFGAVESLSDRFCIH+N +ISLSVND+LACCGFLCG GCDGG P+ AW Sbjct: 121 RILDQGHCGSCWAFGAVESLSDRFCIHFNTNISLSVNDVLACCGFLCGAGCDGGTPLFAW 180 Query: 575 RYFKRRGVVTEECDPYFDNTGCSHPGCEPAYPTPKCHRKCVKGNLLWRKSKHYGVNAYRV 754 RY GVVTEECDPYFDNTGCSHPGCEPAYPTP+CHRKCV N LWR+SKHY VNAY++ Sbjct: 181 RYLHHHGVVTEECDPYFDNTGCSHPGCEPAYPTPRCHRKCVNKNNLWRQSKHYSVNAYKI 240 Query: 755 SHDPHSIMTEIYKNGPVEVAFTVYQDFAHYKSGVYKHVTGESMGGHAVKLIGWGTSEQGE 934 S DPHSIM E+YKNGPVEV FTVY+DFAHYKSGVYKH+TG MGGHAVKLIGWGTS+ GE Sbjct: 241 SSDPHSIMAEVYKNGPVEVDFTVYEDFAHYKSGVYKHITGSVMGGHAVKLIGWGTSDTGE 300 Query: 935 DYWLIVNSWNRGWGDDGYFKIRRGTNECGIEHSVVAGLPSARNL 1066 DYWL+ N WNR WGDDGYFKIRRGTNECGIE VAG+PS RNL Sbjct: 301 DYWLVANQWNRSWGDDGYFKIRRGTNECGIEKDAVAGMPSKRNL 344 >ref|XP_006305170.1| hypothetical protein CARUB_v10009537mg [Capsella rubella] gi|482573881|gb|EOA38068.1| hypothetical protein CARUB_v10009537mg [Capsella rubella] Length = 360 Score = 493 bits (1268), Expect = e-137 Identities = 231/331 (69%), Positives = 254/331 (76%), Gaps = 32/331 (9%) Frame = +2 Query: 170 LLLGAFFILI-LQVAAEKPISEAKVESAILQDSIVKHVNENAKAGWKAAFNPQLSNFT-- 340 LLLG LQ A + +S+ K+ S ILQ+ IV VN N KAGWKAA N + +N T Sbjct: 15 LLLGLIISTFSLQGIAAENLSKQKLSSRILQNEIVNEVNANPKAGWKAALNDRFANATVA 74 Query: 341 -----------------------------LPKEFDARKAWPQCRTIGRILDQGHCGSCWA 433 LPKEFDAR AW QC +IGRILDQGHCGSCWA Sbjct: 75 EFKRLLGVKPTPKTEFLGVPIVSHGISLKLPKEFDARTAWSQCTSIGRILDQGHCGSCWA 134 Query: 434 FGAVESLSDRFCIHYNLSISLSVNDLLACCGFLCGNGCDGGYPIAAWRYFKRRGVVTEEC 613 FGAVESLSDRFCI YN++ISLSVNDLLACCGFLCG GC+GGYPI+AWRYFK GVVTEEC Sbjct: 135 FGAVESLSDRFCIKYNMNISLSVNDLLACCGFLCGQGCNGGYPISAWRYFKHHGVVTEEC 194 Query: 614 DPYFDNTGCSHPGCEPAYPTPKCHRKCVKGNLLWRKSKHYGVNAYRVSHDPHSIMTEIYK 793 DPYFDNTGCSHPGCEPAYPTPKC RKCV GN LWR+SKHYGV+AY+V P IM E+YK Sbjct: 195 DPYFDNTGCSHPGCEPAYPTPKCVRKCVSGNQLWRESKHYGVSAYKVRSHPEDIMAEVYK 254 Query: 794 NGPVEVAFTVYQDFAHYKSGVYKHVTGESMGGHAVKLIGWGTSEQGEDYWLIVNSWNRGW 973 NGPVEVAFTVY+DFAHYKSGVYKH+TG ++GGHAVKLIGWGTS+ GEDYWL+ N WNR W Sbjct: 255 NGPVEVAFTVYEDFAHYKSGVYKHITGTNIGGHAVKLIGWGTSDDGEDYWLLANQWNRSW 314 Query: 974 GDDGYFKIRRGTNECGIEHSVVAGLPSARNL 1066 GDDGYFKIRRGTNECGIEH VVAGLPS RN+ Sbjct: 315 GDDGYFKIRRGTNECGIEHGVVAGLPSDRNV 345 >ref|XP_002515139.1| cathepsin B, putative [Ricinus communis] gi|223545619|gb|EEF47123.1| cathepsin B, putative [Ricinus communis] Length = 376 Score = 492 bits (1266), Expect = e-136 Identities = 235/368 (63%), Positives = 264/368 (71%), Gaps = 48/368 (13%) Frame = +2 Query: 140 AMTLKSLITPLLLGAFFILILQVAAEKPISEAKVESAILQDSIVKHVNENAKAGWKAAFN 319 A L S L L A +V + + S+ K+ S ILQ+SI+K VNEN AGW+AA N Sbjct: 3 ASILSSFALLLFLVALSSFHSRVISTELDSKLKLNSRILQESIIKKVNENPDAGWEAAMN 62 Query: 320 PQLSNFT-------------------------------LPKEFDARKAWPQCRTIGRILD 406 PQLSNFT LPKEFDAR AWP C TIG+IL Sbjct: 63 PQLSNFTVGQFKYLLGAKPTPKKELMGVPMISHPKTLKLPKEFDARTAWPHCSTIGKILG 122 Query: 407 Q-----------------GHCGSCWAFGAVESLSDRFCIHYNLSISLSVNDLLACCGFLC 535 Q GHCGSCWAFGAVESLSDRFCIH+ ++ISLSVNDLLACCGFLC Sbjct: 123 QLLSFYNIFSIFFFLFLEGHCGSCWAFGAVESLSDRFCIHFGMNISLSVNDLLACCGFLC 182 Query: 536 GNGCDGGYPIAAWRYFKRRGVVTEECDPYFDNTGCSHPGCEPAYPTPKCHRKCVKGNLLW 715 G+GCDGGYP+ AWRYF GVVTEECDPYFDN GCSHPGCEP +PTPKC RKC+ N LW Sbjct: 183 GDGCDGGYPMYAWRYFVHHGVVTEECDPYFDNIGCSHPGCEPGFPTPKCVRKCIDKNQLW 242 Query: 716 RKSKHYGVNAYRVSHDPHSIMTEIYKNGPVEVAFTVYQDFAHYKSGVYKHVTGESMGGHA 895 R+SKHY VNAYR+S DPH +M E+YKNGPVEV+FTVY+DFAHYKSGVYKH+TGE MGGHA Sbjct: 243 RQSKHYSVNAYRISSDPHDVMAEVYKNGPVEVSFTVYEDFAHYKSGVYKHITGEVMGGHA 302 Query: 896 VKLIGWGTSEQGEDYWLIVNSWNRGWGDDGYFKIRRGTNECGIEHSVVAGLPSARNLNVE 1075 VKLIGWGTS+ GEDYWL+ N WNRGWGDDGYFKIRRGTNECGIE VAGLPSARNL++ Sbjct: 303 VKLIGWGTSDNGEDYWLLANQWNRGWGDDGYFKIRRGTNECGIEDDAVAGLPSARNLDLV 362 Query: 1076 LDDAFLDA 1099 + A +DA Sbjct: 363 REVASMDA 370