BLASTX nr result

ID: Achyranthes23_contig00002237 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Achyranthes23_contig00002237
         (1607 letters)

Database: ./nr 
           37,332,560 sequences; 13,225,080,153 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_003626103.1| Cathepsin B [Medicago truncatula] gi|8724098...   520   e-145
gb|ACJ84734.1| unknown [Medicago truncatula] gi|388505480|gb|AFK...   520   e-145
ref|XP_003554589.1| PREDICTED: cathepsin B-like [Glycine max]         515   e-143
gb|AFK38097.1| unknown [Lotus japonicus]                              514   e-143
gb|ACJ85175.1| unknown [Medicago truncatula]                          514   e-143
ref|XP_006418358.1| hypothetical protein EUTSA_v10008009mg [Eutr...   511   e-142
gb|EOX95504.1| Cysteine proteinases superfamily protein [Theobro...   510   e-142
ref|NP_563648.1| putative cathepsin B-like cysteine protease [Ar...   509   e-141
ref|XP_002889395.1| hypothetical protein ARALYDRAFT_887368 [Arab...   509   e-141
ref|XP_006305170.1| hypothetical protein CARUB_v10009537mg [Caps...   508   e-141
ref|XP_006491433.1| PREDICTED: cathepsin B-like isoform X1 [Citr...   508   e-141
ref|XP_006444663.1| hypothetical protein CICLE_v10020859mg [Citr...   508   e-141
ref|XP_002301457.2| putative cathepsin B-like protease family pr...   506   e-141
ref|XP_003521632.1| PREDICTED: cathepsin B [Glycine max]              506   e-140
gb|AGV54421.1| cathepsin B-like protein [Phaseolus vulgaris]          503   e-140
ref|XP_006396358.1| hypothetical protein EUTSA_v10028753mg [Eutr...   503   e-139
ref|XP_002515139.1| cathepsin B, putative [Ricinus communis] gi|...   500   e-139
gb|EXB94879.1| Cathepsin B [Morus notabilis]                          498   e-138
ref|XP_006840260.1| hypothetical protein AMTR_s00045p00036790 [A...   497   e-138
gb|ESW35262.1| hypothetical protein PHAVU_001G220100g [Phaseolus...   496   e-137

>ref|XP_003626103.1| Cathepsin B [Medicago truncatula] gi|87240982|gb|ABD32840.1|
            Peptidase C1A, papain; Somatotropin hormone; Peptidase
            C1, propeptide [Medicago truncatula]
            gi|355501118|gb|AES82321.1| Cathepsin B [Medicago
            truncatula]
          Length = 357

 Score =  520 bits (1338), Expect = e-145
 Identities = 232/313 (74%), Positives = 262/313 (83%), Gaps = 1/313 (0%)
 Frame = +1

Query: 364  RDLIIDSVNSNPNVRWKAGVNQHYSDYTDGQFKRILGARRTPESEKRTIPLVHHDRSLNL 543
            ++ I   +N NP   W+A +N  +S++T GQFKR+LG ++ P+ E  + P+V H +SL L
Sbjct: 42   QESIAKQINENPEAGWEAAINPRFSNFTVGQFKRLLGVKQAPKKELLSTPVVTHPKSLKL 101

Query: 544  PKEFDARTAWPQCRTIGRILDQGHCGSCWAFAAVEALSDRFCIKYNMNISLSVNDLLACC 723
            PKEFDARTAW QC TIG+ILDQGHCGSCWAF AVE+L DRFCI ++MNISLSVNDLLACC
Sbjct: 102  PKEFDARTAWSQCSTIGKILDQGHCGSCWAFGAVESLQDRFCIHFDMNISLSVNDLLACC 161

Query: 724  GFLCGNGCDGGTPIYAWRYLVHHGVVSEECDPYFDTEGCSHPGCEPGYPTPKCVRKCVAG 903
            GFLCG GCDGGTPIYAWRYL HHGVV+EECDPYFD  GCSHPGCEP Y TPKCVRKCV G
Sbjct: 162  GFLCGAGCDGGTPIYAWRYLAHHGVVTEECDPYFDQIGCSHPGCEPAYQTPKCVRKCVKG 221

Query: 904  NQLWRESKHYSKNAYRVKRDPYQIMAELYKNGPVEVAFDVFEDFAHYKSGVYKHVTGSYL 1083
            NQ+W+ SKHYS  AYRVK DP  IMAE+YKNGPVEVAF VFEDFAHYKSGVYKH+TGS L
Sbjct: 222  NQIWKRSKHYSVKAYRVKSDPQDIMAEVYKNGPVEVAFTVFEDFAHYKSGVYKHITGSAL 281

Query: 1084 GGHAVKLIGWGTSADGEDYWLIANQWNRSWGDDGYFKISRGTNECGIEDDVTGGLPSSKN 1263
            GGHAVKLIGWGTS +GEDYWL+ANQWN +WGDDGYFKI RGTNECGIEDDVT GLPS+KN
Sbjct: 282  GGHAVKLIGWGTSDEGEDYWLLANQWNTNWGDDGYFKIKRGTNECGIEDDVTAGLPSTKN 341

Query: 1264 LGAVLPDSD-DAG 1299
            +   + D D DAG
Sbjct: 342  IVREVTDMDVDAG 354


>gb|ACJ84734.1| unknown [Medicago truncatula] gi|388505480|gb|AFK40806.1| unknown
            [Medicago truncatula]
          Length = 359

 Score =  520 bits (1338), Expect = e-145
 Identities = 232/313 (74%), Positives = 262/313 (83%), Gaps = 1/313 (0%)
 Frame = +1

Query: 364  RDLIIDSVNSNPNVRWKAGVNQHYSDYTDGQFKRILGARRTPESEKRTIPLVHHDRSLNL 543
            ++ I   +N NP   W+A +N  +S++T GQFKR+LG ++ P+ E  + P+V H +SL L
Sbjct: 44   QESIAKQINENPEAGWEAAINPRFSNFTVGQFKRLLGVKQAPKKELLSTPVVTHPKSLKL 103

Query: 544  PKEFDARTAWPQCRTIGRILDQGHCGSCWAFAAVEALSDRFCIKYNMNISLSVNDLLACC 723
            PKEFDARTAW QC TIG+ILDQGHCGSCWAF AVE+L DRFCI ++MNISLSVNDLLACC
Sbjct: 104  PKEFDARTAWSQCSTIGKILDQGHCGSCWAFGAVESLQDRFCIHFDMNISLSVNDLLACC 163

Query: 724  GFLCGNGCDGGTPIYAWRYLVHHGVVSEECDPYFDTEGCSHPGCEPGYPTPKCVRKCVAG 903
            GFLCG GCDGGTPIYAWRYL HHGVV+EECDPYFD  GCSHPGCEP Y TPKCVRKCV G
Sbjct: 164  GFLCGAGCDGGTPIYAWRYLAHHGVVTEECDPYFDQIGCSHPGCEPAYQTPKCVRKCVKG 223

Query: 904  NQLWRESKHYSKNAYRVKRDPYQIMAELYKNGPVEVAFDVFEDFAHYKSGVYKHVTGSYL 1083
            NQ+W+ SKHYS  AYRVK DP  IMAE+YKNGPVEVAF VFEDFAHYKSGVYKH+TGS L
Sbjct: 224  NQIWKRSKHYSVKAYRVKSDPQDIMAEVYKNGPVEVAFTVFEDFAHYKSGVYKHITGSAL 283

Query: 1084 GGHAVKLIGWGTSADGEDYWLIANQWNRSWGDDGYFKISRGTNECGIEDDVTGGLPSSKN 1263
            GGHAVKLIGWGTS +GEDYWL+ANQWN +WGDDGYFKI RGTNECGIEDDVT GLPS+KN
Sbjct: 284  GGHAVKLIGWGTSDEGEDYWLLANQWNTNWGDDGYFKIKRGTNECGIEDDVTAGLPSTKN 343

Query: 1264 LGAVLPDSD-DAG 1299
            +   + D D DAG
Sbjct: 344  IVREVTDMDVDAG 356


>ref|XP_003554589.1| PREDICTED: cathepsin B-like [Glycine max]
          Length = 356

 Score =  515 bits (1326), Expect = e-143
 Identities = 225/309 (72%), Positives = 260/309 (84%)
 Frame = +1

Query: 364  RDLIIDSVNSNPNVRWKAGVNQHYSDYTDGQFKRILGARRTPESEKRTIPLVHHDRSLNL 543
            ++ I   +N NP   W+A +N H+S+YT  QFKR+LG + TP+ E R+ P + H +SL L
Sbjct: 41   QESIAKEINENPEAGWEAAINPHFSNYTVEQFKRLLGVKPTPKKELRSTPAISHPKSLKL 100

Query: 544  PKEFDARTAWPQCRTIGRILDQGHCGSCWAFAAVEALSDRFCIKYNMNISLSVNDLLACC 723
            PK FDARTAW QC TIGRILDQGHCGSCWAF AVE+LSDRFCI +++NISLSVNDLLACC
Sbjct: 101  PKNFDARTAWSQCSTIGRILDQGHCGSCWAFGAVESLSDRFCIHFDVNISLSVNDLLACC 160

Query: 724  GFLCGNGCDGGTPIYAWRYLVHHGVVSEECDPYFDTEGCSHPGCEPGYPTPKCVRKCVAG 903
            GFLCG+GCDGG P+YAW+YL HHGVV+EECDPYFD  GCSHPGCEP Y TPKCV+KCV+G
Sbjct: 161  GFLCGSGCDGGYPLYAWQYLAHHGVVTEECDPYFDQIGCSHPGCEPAYRTPKCVKKCVSG 220

Query: 904  NQLWRESKHYSKNAYRVKRDPYQIMAELYKNGPVEVAFDVFEDFAHYKSGVYKHVTGSYL 1083
            NQ+W++SKHYS NAYRV  DP+ IM E+YKNGPVEVAF V+EDFAHYKSGVYKH+TG  L
Sbjct: 221  NQVWKKSKHYSVNAYRVSSDPHDIMTEVYKNGPVEVAFTVYEDFAHYKSGVYKHITGYEL 280

Query: 1084 GGHAVKLIGWGTSADGEDYWLIANQWNRSWGDDGYFKISRGTNECGIEDDVTGGLPSSKN 1263
            GGHAVKLIGWGT+ DGEDYWL+ANQWNR WGDDGYFKI RGTNECGIE+DVT GLPS+KN
Sbjct: 281  GGHAVKLIGWGTTEDGEDYWLLANQWNREWGDDGYFKIRRGTNECGIEEDVTAGLPSTKN 340

Query: 1264 LGAVLPDSD 1290
            L   + D D
Sbjct: 341  LVREVTDMD 349


>gb|AFK38097.1| unknown [Lotus japonicus]
          Length = 357

 Score =  514 bits (1324), Expect = e-143
 Identities = 226/309 (73%), Positives = 261/309 (84%)
 Frame = +1

Query: 364  RDLIIDSVNSNPNVRWKAGVNQHYSDYTDGQFKRILGARRTPESEKRTIPLVHHDRSLNL 543
            ++ I   +N NP   W+A ++  +S+YT  QFKR+LG + +P+ E R+ P+V H RSL L
Sbjct: 42   QESIAKEINENPGAGWEAAISPRFSNYTVAQFKRLLGVKPSPKKELRSTPVVSHPRSLKL 101

Query: 544  PKEFDARTAWPQCRTIGRILDQGHCGSCWAFAAVEALSDRFCIKYNMNISLSVNDLLACC 723
            PK FDARTAW QC TIGRILDQGHCGSCWAF AVE+LSDRFCI  ++N+SLSVNDLLACC
Sbjct: 102  PKSFDARTAWSQCSTIGRILDQGHCGSCWAFGAVESLSDRFCIHLDVNVSLSVNDLLACC 161

Query: 724  GFLCGNGCDGGTPIYAWRYLVHHGVVSEECDPYFDTEGCSHPGCEPGYPTPKCVRKCVAG 903
            GFLCG+GCDGG P+YAWRYL HHGVV+EECDPYFD  GCSHPGCEP Y TPKCVRKCV G
Sbjct: 162  GFLCGSGCDGGYPLYAWRYLAHHGVVTEECDPYFDQIGCSHPGCEPAYQTPKCVRKCVKG 221

Query: 904  NQLWRESKHYSKNAYRVKRDPYQIMAELYKNGPVEVAFDVFEDFAHYKSGVYKHVTGSYL 1083
            NQ+W++SK++S NAY VK DPY IMAE+YKNGPVEVAF V+EDFAHYKSGVYKH+TGS L
Sbjct: 222  NQIWKKSKYFSVNAYSVKSDPYDIMAEVYKNGPVEVAFTVYEDFAHYKSGVYKHITGSQL 281

Query: 1084 GGHAVKLIGWGTSADGEDYWLIANQWNRSWGDDGYFKISRGTNECGIEDDVTGGLPSSKN 1263
            GGHAVKLIGWGT+ +GEDYWLIANQWNRSWGDDGYF I RGTNECGIE+DVT GLPS+KN
Sbjct: 282  GGHAVKLIGWGTTDEGEDYWLIANQWNRSWGDDGYFMIRRGTNECGIEEDVTAGLPSTKN 341

Query: 1264 LGAVLPDSD 1290
            +G  + D D
Sbjct: 342  MGRWVMDMD 350


>gb|ACJ85175.1| unknown [Medicago truncatula]
          Length = 359

 Score =  514 bits (1323), Expect = e-143
 Identities = 229/313 (73%), Positives = 259/313 (82%), Gaps = 1/313 (0%)
 Frame = +1

Query: 364  RDLIIDSVNSNPNVRWKAGVNQHYSDYTDGQFKRILGARRTPESEKRTIPLVHHDRSLNL 543
            ++ I   +N NP   W+A +N  +S++T GQFKR+LG ++ P+ E  + P+V H +SL L
Sbjct: 44   QESIAKQINENPEAGWEAAINPRFSNFTVGQFKRLLGVKQAPKKELLSTPVVTHPKSLKL 103

Query: 544  PKEFDARTAWPQCRTIGRILDQGHCGSCWAFAAVEALSDRFCIKYNMNISLSVNDLLACC 723
            PKEFDAR AW QC TIG+ILDQGHCGSCWAF AVE+L DRFC  ++MNISLSVNDLLACC
Sbjct: 104  PKEFDARAAWSQCSTIGKILDQGHCGSCWAFGAVESLQDRFCSHFDMNISLSVNDLLACC 163

Query: 724  GFLCGNGCDGGTPIYAWRYLVHHGVVSEECDPYFDTEGCSHPGCEPGYPTPKCVRKCVAG 903
            GFLCG GCDGGTPIYAWRYL HHGVV+EECDPYFD  GCSHPGCEP Y TPKCVRKCV G
Sbjct: 164  GFLCGAGCDGGTPIYAWRYLAHHGVVTEECDPYFDQIGCSHPGCEPAYQTPKCVRKCVKG 223

Query: 904  NQLWRESKHYSKNAYRVKRDPYQIMAELYKNGPVEVAFDVFEDFAHYKSGVYKHVTGSYL 1083
            NQ+W+ SKHYS  AYRVK DP  IM E+YKNGPVEVAF VFEDFAHYKSGVYKH+TGS L
Sbjct: 224  NQIWKRSKHYSVKAYRVKSDPQDIMTEVYKNGPVEVAFTVFEDFAHYKSGVYKHITGSAL 283

Query: 1084 GGHAVKLIGWGTSADGEDYWLIANQWNRSWGDDGYFKISRGTNECGIEDDVTGGLPSSKN 1263
            GGHAVKLIGWGTS +GEDYWL+ANQWN +WGDDGYFKI RGTNECGIEDDVT GLPS+KN
Sbjct: 284  GGHAVKLIGWGTSDEGEDYWLLANQWNTNWGDDGYFKIKRGTNECGIEDDVTAGLPSTKN 343

Query: 1264 LGAVLPDSD-DAG 1299
            +   + D D DAG
Sbjct: 344  IVREVTDMDVDAG 356


>ref|XP_006418358.1| hypothetical protein EUTSA_v10008009mg [Eutrema salsugineum]
            gi|557096129|gb|ESQ36711.1| hypothetical protein
            EUTSA_v10008009mg [Eutrema salsugineum]
          Length = 360

 Score =  511 bits (1316), Expect = e-142
 Identities = 226/310 (72%), Positives = 256/310 (82%)
 Frame = +1

Query: 364  RDLIIDSVNSNPNVRWKAGVNQHYSDYTDGQFKRILGARRTPESEKRTIPLVHHDRSLNL 543
            +D I+  VN NPN  WKA  N  +++ T  +FKR+LG + TP+ E   +P+V HDRSL L
Sbjct: 45   QDEIVKEVNENPNAGWKAAFNDRFANATVAEFKRLLGVKPTPKKEYLGVPIVSHDRSLKL 104

Query: 544  PKEFDARTAWPQCRTIGRILDQGHCGSCWAFAAVEALSDRFCIKYNMNISLSVNDLLACC 723
            PKEFDARTAW QC +I RILDQGHCGSCWAF AVE+LSDRFCIKYNMNISLSVNDLLACC
Sbjct: 105  PKEFDARTAWSQCTSIARILDQGHCGSCWAFGAVESLSDRFCIKYNMNISLSVNDLLACC 164

Query: 724  GFLCGNGCDGGTPIYAWRYLVHHGVVSEECDPYFDTEGCSHPGCEPGYPTPKCVRKCVAG 903
            GFLCG GC+GG PI AWRY  HHGVV+EECDPYFD  GCSHPGCEP YPTP+CVRKCV+G
Sbjct: 165  GFLCGQGCNGGYPISAWRYFKHHGVVTEECDPYFDNTGCSHPGCEPAYPTPRCVRKCVSG 224

Query: 904  NQLWRESKHYSKNAYRVKRDPYQIMAELYKNGPVEVAFDVFEDFAHYKSGVYKHVTGSYL 1083
            NQLWRESKHY  +AY+++ +P  IMAE+YKNGPVEV F V+EDFAHYKSGVYKH+TGS +
Sbjct: 225  NQLWRESKHYGVSAYKIRSNPQDIMAEVYKNGPVEVDFTVYEDFAHYKSGVYKHITGSNI 284

Query: 1084 GGHAVKLIGWGTSADGEDYWLIANQWNRSWGDDGYFKISRGTNECGIEDDVTGGLPSSKN 1263
            GGHAVKLIGWGTS DGEDYWL+ANQWNRSWGDDGYFKI RGTNECGIE     GLPS +N
Sbjct: 285  GGHAVKLIGWGTSDDGEDYWLLANQWNRSWGDDGYFKIRRGTNECGIEHGAVAGLPSDRN 344

Query: 1264 LGAVLPDSDD 1293
            +   +  SDD
Sbjct: 345  VFKGITTSDD 354


>gb|EOX95504.1| Cysteine proteinases superfamily protein [Theobroma cacao]
          Length = 359

 Score =  510 bits (1313), Expect = e-142
 Identities = 227/309 (73%), Positives = 255/309 (82%)
 Frame = +1

Query: 364  RDLIIDSVNSNPNVRWKAGVNQHYSDYTDGQFKRILGARRTPESEKRTIPLVHHDRSLNL 543
            +D I+  VN NP   WKA +N   S+YT G+FK +LG + TP+ E   IP++ H +SL +
Sbjct: 42   QDSIVKQVNENPKAGWKAALNPRLSNYTVGEFKHLLGVKPTPKKELLGIPVITHGKSLKV 101

Query: 544  PKEFDARTAWPQCRTIGRILDQGHCGSCWAFAAVEALSDRFCIKYNMNISLSVNDLLACC 723
            P +FDARTAWPQC TIGRILDQGHCGSCWAF AVE+LSDRFCI ++MNISLSVNDLLACC
Sbjct: 102  PTKFDARTAWPQCSTIGRILDQGHCGSCWAFGAVESLSDRFCIHFSMNISLSVNDLLACC 161

Query: 724  GFLCGNGCDGGTPIYAWRYLVHHGVVSEECDPYFDTEGCSHPGCEPGYPTPKCVRKCVAG 903
            GFLCG+GCDGG PI AWRY V  GVV+EECDPYFD  GCSHPGCEP YPTP+CV+KCV G
Sbjct: 162  GFLCGSGCDGGYPISAWRYFVRRGVVTEECDPYFDDTGCSHPGCEPAYPTPRCVKKCVKG 221

Query: 904  NQLWRESKHYSKNAYRVKRDPYQIMAELYKNGPVEVAFDVFEDFAHYKSGVYKHVTGSYL 1083
            NQLWRESKHYS  AYR+  DP  IMAE+Y NGPVEV+F V+EDFAHYKSGVYKHVTG  +
Sbjct: 222  NQLWRESKHYSVGAYRINSDPADIMAEVYTNGPVEVSFTVYEDFAHYKSGVYKHVTGGVM 281

Query: 1084 GGHAVKLIGWGTSADGEDYWLIANQWNRSWGDDGYFKISRGTNECGIEDDVTGGLPSSKN 1263
            GGHAVKLIGWGTS DGEDYWL+ANQWNR WGDDGYFKISRGTNECGIEDDV  GLPS+KN
Sbjct: 282  GGHAVKLIGWGTSDDGEDYWLLANQWNRGWGDDGYFKISRGTNECGIEDDVVAGLPSTKN 341

Query: 1264 LGAVLPDSD 1290
            L   + D D
Sbjct: 342  LVREVGDMD 350


>ref|NP_563648.1| putative cathepsin B-like cysteine protease [Arabidopsis thaliana]
            gi|16226808|gb|AAL16267.1|AF428337_1 At1g02300/T6A9_10
            [Arabidopsis thaliana] gi|14532526|gb|AAK63991.1|
            At1g02300/T6A9_10 [Arabidopsis thaliana]
            gi|25090140|gb|AAN72238.1| At1g02300/T6A9_10 [Arabidopsis
            thaliana] gi|332189292|gb|AEE27413.1| putative cathepsin
            B-like cysteine protease [Arabidopsis thaliana]
          Length = 362

 Score =  509 bits (1312), Expect = e-141
 Identities = 226/307 (73%), Positives = 255/307 (83%)
 Frame = +1

Query: 373  IIDSVNSNPNVRWKAGVNQHYSDYTDGQFKRILGARRTPESEKRTIPLVHHDRSLNLPKE 552
            I+  VN NPN  WKA  N  +++ T  +FKR+LG + TP++E   +P+V HD SL LPKE
Sbjct: 50   IVKEVNENPNAGWKASFNDRFANATVAEFKRLLGVKPTPKTEFLGVPIVSHDISLKLPKE 109

Query: 553  FDARTAWPQCRTIGRILDQGHCGSCWAFAAVEALSDRFCIKYNMNISLSVNDLLACCGFL 732
            FDARTAW QC +IGRILDQGHCGSCWAF AVE+LSDRFCIKYNMN+SLSVNDLLACCGFL
Sbjct: 110  FDARTAWSQCTSIGRILDQGHCGSCWAFGAVESLSDRFCIKYNMNVSLSVNDLLACCGFL 169

Query: 733  CGNGCDGGTPIYAWRYLVHHGVVSEECDPYFDTEGCSHPGCEPGYPTPKCVRKCVAGNQL 912
            CG GC+GG PI AWRY  HHGVV+EECDPYFD  GCSHPGCEP YPTPKC RKCV+GNQL
Sbjct: 170  CGQGCNGGYPIAAWRYFKHHGVVTEECDPYFDNTGCSHPGCEPAYPTPKCARKCVSGNQL 229

Query: 913  WRESKHYSKNAYRVKRDPYQIMAELYKNGPVEVAFDVFEDFAHYKSGVYKHVTGSYLGGH 1092
            WRESKHY  +AY+V+  P  IMAE+YKNGPVEVAF V+EDFAHYKSGVYKH+TG+ +GGH
Sbjct: 230  WRESKHYGVSAYKVRSHPDDIMAEVYKNGPVEVAFTVYEDFAHYKSGVYKHITGTNIGGH 289

Query: 1093 AVKLIGWGTSADGEDYWLIANQWNRSWGDDGYFKISRGTNECGIEDDVTGGLPSSKNLGA 1272
            AVKLIGWGTS DGEDYWL+ANQWNRSWGDDGYFKI RGTNECGIE  V  GLPS +N+  
Sbjct: 290  AVKLIGWGTSDDGEDYWLLANQWNRSWGDDGYFKIRRGTNECGIEHGVVAGLPSDRNVVK 349

Query: 1273 VLPDSDD 1293
             +  SDD
Sbjct: 350  GITTSDD 356


>ref|XP_002889395.1| hypothetical protein ARALYDRAFT_887368 [Arabidopsis lyrata subsp.
            lyrata] gi|297335237|gb|EFH65654.1| hypothetical protein
            ARALYDRAFT_887368 [Arabidopsis lyrata subsp. lyrata]
          Length = 360

 Score =  509 bits (1312), Expect = e-141
 Identities = 226/307 (73%), Positives = 255/307 (83%)
 Frame = +1

Query: 373  IIDSVNSNPNVRWKAGVNQHYSDYTDGQFKRILGARRTPESEKRTIPLVHHDRSLNLPKE 552
            I+  VN NPN  WKA  N  +++ T  +FKR+LG + TP++E   +P+V HD SL LPKE
Sbjct: 48   IVKEVNENPNAGWKAAFNDRFANATVAEFKRLLGVKPTPKTEFLGVPIVSHDISLKLPKE 107

Query: 553  FDARTAWPQCRTIGRILDQGHCGSCWAFAAVEALSDRFCIKYNMNISLSVNDLLACCGFL 732
            FDARTAW QC ++GRILDQGHCGSCWAF AVE+LSDRFCIKYNMNISLSVNDLLACCGFL
Sbjct: 108  FDARTAWSQCTSVGRILDQGHCGSCWAFGAVESLSDRFCIKYNMNISLSVNDLLACCGFL 167

Query: 733  CGNGCDGGTPIYAWRYLVHHGVVSEECDPYFDTEGCSHPGCEPGYPTPKCVRKCVAGNQL 912
            CG GC+GG PI AWRY  HHGVV+EECDPYFD  GCSHPGCEP YPTPKC RKCV+GNQL
Sbjct: 168  CGQGCNGGYPIAAWRYFKHHGVVTEECDPYFDNTGCSHPGCEPAYPTPKCARKCVSGNQL 227

Query: 913  WRESKHYSKNAYRVKRDPYQIMAELYKNGPVEVAFDVFEDFAHYKSGVYKHVTGSYLGGH 1092
            WRESKHY  +AY+V+  P  IMAE+YKNGPVEVAF V+EDFAHYKSGVYKH+TG+ +GGH
Sbjct: 228  WRESKHYGVSAYKVRSHPDDIMAEVYKNGPVEVAFTVYEDFAHYKSGVYKHITGTNIGGH 287

Query: 1093 AVKLIGWGTSADGEDYWLIANQWNRSWGDDGYFKISRGTNECGIEDDVTGGLPSSKNLGA 1272
            AVKLIGWGTS DGEDYWL+ANQWNRSWGDDGYFKI RGTNECGIE  V  GLPS +N+  
Sbjct: 288  AVKLIGWGTSDDGEDYWLLANQWNRSWGDDGYFKIRRGTNECGIEHGVVAGLPSDRNVFK 347

Query: 1273 VLPDSDD 1293
             +  SDD
Sbjct: 348  GITTSDD 354


>ref|XP_006305170.1| hypothetical protein CARUB_v10009537mg [Capsella rubella]
            gi|482573881|gb|EOA38068.1| hypothetical protein
            CARUB_v10009537mg [Capsella rubella]
          Length = 360

 Score =  508 bits (1309), Expect = e-141
 Identities = 226/307 (73%), Positives = 257/307 (83%)
 Frame = +1

Query: 373  IIDSVNSNPNVRWKAGVNQHYSDYTDGQFKRILGARRTPESEKRTIPLVHHDRSLNLPKE 552
            I++ VN+NP   WKA +N  +++ T  +FKR+LG + TP++E   +P+V H  SL LPKE
Sbjct: 48   IVNEVNANPKAGWKAALNDRFANATVAEFKRLLGVKPTPKTEFLGVPIVSHGISLKLPKE 107

Query: 553  FDARTAWPQCRTIGRILDQGHCGSCWAFAAVEALSDRFCIKYNMNISLSVNDLLACCGFL 732
            FDARTAW QC +IGRILDQGHCGSCWAF AVE+LSDRFCIKYNMNISLSVNDLLACCGFL
Sbjct: 108  FDARTAWSQCTSIGRILDQGHCGSCWAFGAVESLSDRFCIKYNMNISLSVNDLLACCGFL 167

Query: 733  CGNGCDGGTPIYAWRYLVHHGVVSEECDPYFDTEGCSHPGCEPGYPTPKCVRKCVAGNQL 912
            CG GC+GG PI AWRY  HHGVV+EECDPYFD  GCSHPGCEP YPTPKCVRKCV+GNQL
Sbjct: 168  CGQGCNGGYPISAWRYFKHHGVVTEECDPYFDNTGCSHPGCEPAYPTPKCVRKCVSGNQL 227

Query: 913  WRESKHYSKNAYRVKRDPYQIMAELYKNGPVEVAFDVFEDFAHYKSGVYKHVTGSYLGGH 1092
            WRESKHY  +AY+V+  P  IMAE+YKNGPVEVAF V+EDFAHYKSGVYKH+TG+ +GGH
Sbjct: 228  WRESKHYGVSAYKVRSHPEDIMAEVYKNGPVEVAFTVYEDFAHYKSGVYKHITGTNIGGH 287

Query: 1093 AVKLIGWGTSADGEDYWLIANQWNRSWGDDGYFKISRGTNECGIEDDVTGGLPSSKNLGA 1272
            AVKLIGWGTS DGEDYWL+ANQWNRSWGDDGYFKI RGTNECGIE  V  GLPS +N+  
Sbjct: 288  AVKLIGWGTSDDGEDYWLLANQWNRSWGDDGYFKIRRGTNECGIEHGVVAGLPSDRNVFK 347

Query: 1273 VLPDSDD 1293
             +  SDD
Sbjct: 348  GITASDD 354


>ref|XP_006491433.1| PREDICTED: cathepsin B-like isoform X1 [Citrus sinensis]
          Length = 362

 Score =  508 bits (1308), Expect = e-141
 Identities = 226/309 (73%), Positives = 254/309 (82%)
 Frame = +1

Query: 364  RDLIIDSVNSNPNVRWKAGVNQHYSDYTDGQFKRILGARRTPESEKRTIPLVHHDRSLNL 543
            +D II  VN NP   WKA  N  +S+YT GQFK +LG + TP+     +P+  HD+SL L
Sbjct: 47   QDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGVKPTPKGLLLGVPVKTHDKSLKL 106

Query: 544  PKEFDARTAWPQCRTIGRILDQGHCGSCWAFAAVEALSDRFCIKYNMNISLSVNDLLACC 723
            PK FDAR+AWPQC TI RILDQGHCGSCWAF AVEALSDRFCI + MN+SLSVNDLLACC
Sbjct: 107  PKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVNDLLACC 166

Query: 724  GFLCGNGCDGGTPIYAWRYLVHHGVVSEECDPYFDTEGCSHPGCEPGYPTPKCVRKCVAG 903
            GFLCG+GCDGG PI AWRY VHHGVV+EECDPYFD+ GCSHPGCEP YPTPKCVRKCV  
Sbjct: 167  GFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKK 226

Query: 904  NQLWRESKHYSKNAYRVKRDPYQIMAELYKNGPVEVAFDVFEDFAHYKSGVYKHVTGSYL 1083
            NQLWR SKHYS +AYR+  DP  IMAE+YKNGPVEV+F V+EDFAHYKSGVYKH+TG  +
Sbjct: 227  NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVM 286

Query: 1084 GGHAVKLIGWGTSADGEDYWLIANQWNRSWGDDGYFKISRGTNECGIEDDVTGGLPSSKN 1263
            GGHAVKLIGWGTS DGEDYW++ANQWNRSWG DGYFKI RG+NECGIE+DV  GLPSSKN
Sbjct: 287  GGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSKN 346

Query: 1264 LGAVLPDSD 1290
            L   +  +D
Sbjct: 347  LVKEITSAD 355


>ref|XP_006444663.1| hypothetical protein CICLE_v10020859mg [Citrus clementina]
            gi|568876746|ref|XP_006491434.1| PREDICTED: cathepsin
            B-like isoform X2 [Citrus sinensis]
            gi|557546925|gb|ESR57903.1| hypothetical protein
            CICLE_v10020859mg [Citrus clementina]
          Length = 354

 Score =  508 bits (1308), Expect = e-141
 Identities = 226/309 (73%), Positives = 254/309 (82%)
 Frame = +1

Query: 364  RDLIIDSVNSNPNVRWKAGVNQHYSDYTDGQFKRILGARRTPESEKRTIPLVHHDRSLNL 543
            +D II  VN NP   WKA  N  +S+YT GQFK +LG + TP+     +P+  HD+SL L
Sbjct: 39   QDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGVKPTPKGLLLGVPVKTHDKSLKL 98

Query: 544  PKEFDARTAWPQCRTIGRILDQGHCGSCWAFAAVEALSDRFCIKYNMNISLSVNDLLACC 723
            PK FDAR+AWPQC TI RILDQGHCGSCWAF AVEALSDRFCI + MN+SLSVNDLLACC
Sbjct: 99   PKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVNDLLACC 158

Query: 724  GFLCGNGCDGGTPIYAWRYLVHHGVVSEECDPYFDTEGCSHPGCEPGYPTPKCVRKCVAG 903
            GFLCG+GCDGG PI AWRY VHHGVV+EECDPYFD+ GCSHPGCEP YPTPKCVRKCV  
Sbjct: 159  GFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKK 218

Query: 904  NQLWRESKHYSKNAYRVKRDPYQIMAELYKNGPVEVAFDVFEDFAHYKSGVYKHVTGSYL 1083
            NQLWR SKHYS +AYR+  DP  IMAE+YKNGPVEV+F V+EDFAHYKSGVYKH+TG  +
Sbjct: 219  NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVM 278

Query: 1084 GGHAVKLIGWGTSADGEDYWLIANQWNRSWGDDGYFKISRGTNECGIEDDVTGGLPSSKN 1263
            GGHAVKLIGWGTS DGEDYW++ANQWNRSWG DGYFKI RG+NECGIE+DV  GLPSSKN
Sbjct: 279  GGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSKN 338

Query: 1264 LGAVLPDSD 1290
            L   +  +D
Sbjct: 339  LVKEITSAD 347


>ref|XP_002301457.2| putative cathepsin B-like protease family protein [Populus
            trichocarpa] gi|550345314|gb|EEE80730.2| putative
            cathepsin B-like protease family protein [Populus
            trichocarpa]
          Length = 357

 Score =  506 bits (1304), Expect = e-141
 Identities = 223/301 (74%), Positives = 251/301 (83%)
 Frame = +1

Query: 364  RDLIIDSVNSNPNVRWKAGVNQHYSDYTDGQFKRILGARRTPESEKRTIPLVHHDRSLNL 543
            +D I+  VN NP   WKA +N H+S+YT  QFK +LG + TP+ E R IP++ H +SL L
Sbjct: 42   QDSILKKVNGNPKAGWKATMNHHFSNYTVAQFKYLLGVKPTPKEELRGIPVISHPKSLRL 101

Query: 544  PKEFDARTAWPQCRTIGRILDQGHCGSCWAFAAVEALSDRFCIKYNMNISLSVNDLLACC 723
            P+EFDARTAWPQC TIG+ILDQGHCGSCWAF AVE+LSDRFCI Y MNISLSVNDLLACC
Sbjct: 102  PEEFDARTAWPQCSTIGKILDQGHCGSCWAFGAVESLSDRFCIHYGMNISLSVNDLLACC 161

Query: 724  GFLCGNGCDGGTPIYAWRYLVHHGVVSEECDPYFDTEGCSHPGCEPGYPTPKCVRKCVAG 903
            GFLCG+GC+GG PI AWRY VHHGVV+EECDPYFD  GCSHPGCEPGYPTPKC RKCV  
Sbjct: 162  GFLCGSGCNGGYPISAWRYFVHHGVVTEECDPYFDDIGCSHPGCEPGYPTPKCARKCVNK 221

Query: 904  NQLWRESKHYSKNAYRVKRDPYQIMAELYKNGPVEVAFDVFEDFAHYKSGVYKHVTGSYL 1083
            NQLW++SKHY    YR+  DP  IMAE+YKNGPVEVAF V+EDFAHYKSGVYKH+TG  +
Sbjct: 222  NQLWKKSKHYGVKPYRIDSDPDSIMAEIYKNGPVEVAFTVYEDFAHYKSGVYKHITGGMM 281

Query: 1084 GGHAVKLIGWGTSADGEDYWLIANQWNRSWGDDGYFKISRGTNECGIEDDVTGGLPSSKN 1263
            GGHAVKLIGWGTS DGE YWL+ANQWNR WGDDG+FKI RGTNECGIE DV  GLPS++N
Sbjct: 282  GGHAVKLIGWGTSEDGEAYWLLANQWNRGWGDDGFFKIRRGTNECGIEGDVVAGLPSTRN 341

Query: 1264 L 1266
            L
Sbjct: 342  L 342


>ref|XP_003521632.1| PREDICTED: cathepsin B [Glycine max]
          Length = 357

 Score =  506 bits (1302), Expect = e-140
 Identities = 221/302 (73%), Positives = 256/302 (84%)
 Frame = +1

Query: 385  VNSNPNVRWKAGVNQHYSDYTDGQFKRILGARRTPESEKRTIPLVHHDRSLNLPKEFDAR 564
            +N NP   W+A +N  +S+YT  QFKR+LG +  P+ E R+ P + H ++L LPK FDAR
Sbjct: 49   INENPEAGWEAAINPRFSNYTVEQFKRLLGVKPMPKKELRSTPAISHPKTLKLPKNFDAR 108

Query: 565  TAWPQCRTIGRILDQGHCGSCWAFAAVEALSDRFCIKYNMNISLSVNDLLACCGFLCGNG 744
            TAW QC TIGRILDQGHCGSCWAF AVE+LSDRFCI +++NISLSVNDLLACCGFLCG+G
Sbjct: 109  TAWSQCSTIGRILDQGHCGSCWAFGAVESLSDRFCIHFDVNISLSVNDLLACCGFLCGSG 168

Query: 745  CDGGTPIYAWRYLVHHGVVSEECDPYFDTEGCSHPGCEPGYPTPKCVRKCVAGNQLWRES 924
            CDGG P+YAWRYL HHGVV+EECDPYFD  GCSHPGCEP Y TPKCV+KCV+GNQ+W++S
Sbjct: 169  CDGGYPLYAWRYLAHHGVVTEECDPYFDQIGCSHPGCEPAYRTPKCVKKCVSGNQVWKKS 228

Query: 925  KHYSKNAYRVKRDPYQIMAELYKNGPVEVAFDVFEDFAHYKSGVYKHVTGSYLGGHAVKL 1104
            KHYS +AYRV  DP+ IMAE+YKNGPVEVAF V+EDFA+YKSGVYKH+TG  LGGHAVKL
Sbjct: 229  KHYSVSAYRVNSDPHDIMAEVYKNGPVEVAFTVYEDFAYYKSGVYKHITGYELGGHAVKL 288

Query: 1105 IGWGTSADGEDYWLIANQWNRSWGDDGYFKISRGTNECGIEDDVTGGLPSSKNLGAVLPD 1284
            IGWGT+ DGEDYWL+ANQWNR WGDDGYFKI RGTNECGIE+DVT GLPS+KNL   + D
Sbjct: 289  IGWGTTDDGEDYWLLANQWNREWGDDGYFKIRRGTNECGIEEDVTAGLPSTKNLVREVTD 348

Query: 1285 SD 1290
             D
Sbjct: 349  MD 350


>gb|AGV54421.1| cathepsin B-like protein [Phaseolus vulgaris]
          Length = 356

 Score =  503 bits (1296), Expect = e-140
 Identities = 221/306 (72%), Positives = 256/306 (83%)
 Frame = +1

Query: 373  IIDSVNSNPNVRWKAGVNQHYSDYTDGQFKRILGARRTPESEKRTIPLVHHDRSLNLPKE 552
            I   +N NP   W+A +N  +S+YT  QFKR+LG ++TP+ E R+ P++ H +SL LP  
Sbjct: 44   IAKQINENPEAGWEAAINPRFSNYTVEQFKRLLGVKQTPKIELRSTPVISHSKSLKLPVN 103

Query: 553  FDARTAWPQCRTIGRILDQGHCGSCWAFAAVEALSDRFCIKYNMNISLSVNDLLACCGFL 732
            FDARTAW QC TIGRILDQGHCGSCWAF AVE+LSDRFCI +++NISLSVNDLLACCGFL
Sbjct: 104  FDARTAWSQCNTIGRILDQGHCGSCWAFGAVESLSDRFCIHFDVNISLSVNDLLACCGFL 163

Query: 733  CGNGCDGGTPIYAWRYLVHHGVVSEECDPYFDTEGCSHPGCEPGYPTPKCVRKCVAGNQL 912
            CG+GCDGG P+YAWRYL HHGVV+EECDPYFD  GCSHPGCEP Y TPKCV+KCV GNQL
Sbjct: 164  CGSGCDGGYPLYAWRYLAHHGVVTEECDPYFDQIGCSHPGCEPAYRTPKCVKKCVNGNQL 223

Query: 913  WRESKHYSKNAYRVKRDPYQIMAELYKNGPVEVAFDVFEDFAHYKSGVYKHVTGSYLGGH 1092
            W++SKH+S NAY V  +P+ IMAE+Y NGPVEVAF V+EDFAHYKSGVYKHVTG  LGGH
Sbjct: 224  WKKSKHFSVNAYTVNSNPHDIMAEVYTNGPVEVAFTVYEDFAHYKSGVYKHVTGHALGGH 283

Query: 1093 AVKLIGWGTSADGEDYWLIANQWNRSWGDDGYFKISRGTNECGIEDDVTGGLPSSKNLGA 1272
            AVKLIGWGT+ DG+DYWL+ANQWNR WGDDGYFKI RGTNECGIE++VT GLPS+KNL  
Sbjct: 284  AVKLIGWGTTDDGQDYWLLANQWNREWGDDGYFKIRRGTNECGIEEEVTAGLPSTKNLVR 343

Query: 1273 VLPDSD 1290
             + D D
Sbjct: 344  EVTDMD 349


>ref|XP_006396358.1| hypothetical protein EUTSA_v10028753mg [Eutrema salsugineum]
            gi|312283137|dbj|BAJ34434.1| unnamed protein product
            [Thellungiella halophila] gi|557097375|gb|ESQ37811.1|
            hypothetical protein EUTSA_v10028753mg [Eutrema
            salsugineum]
          Length = 362

 Score =  503 bits (1294), Expect = e-139
 Identities = 222/313 (70%), Positives = 258/313 (82%)
 Frame = +1

Query: 373  IIDSVNSNPNVRWKAGVNQHYSDYTDGQFKRILGARRTPESEKRTIPLVHHDRSLNLPKE 552
            I+  VN NP+  WKA +N  +S+ T  +FKR+LG + TP+     +P+V HDRSL LPKE
Sbjct: 50   IVKKVNQNPDAGWKAAINDRFSNATVAEFKRLLGVKPTPKKHFLGVPIVSHDRSLKLPKE 109

Query: 553  FDARTAWPQCRTIGRILDQGHCGSCWAFAAVEALSDRFCIKYNMNISLSVNDLLACCGFL 732
            FDARTAWPQC +IG ILDQGHCGSCWAF AVE+LSDRFCI++ MNISLSVNDLLACCGF 
Sbjct: 110  FDARTAWPQCTSIGNILDQGHCGSCWAFGAVESLSDRFCIEFGMNISLSVNDLLACCGFR 169

Query: 733  CGNGCDGGTPIYAWRYLVHHGVVSEECDPYFDTEGCSHPGCEPGYPTPKCVRKCVAGNQL 912
            CG+GCDGG PI AW+Y  + GVV+EECDPYFD  GCSHPGCEP YPTPKC+RKCV+GNQL
Sbjct: 170  CGDGCDGGYPIAAWQYFSYSGVVTEECDPYFDDTGCSHPGCEPAYPTPKCMRKCVSGNQL 229

Query: 913  WRESKHYSKNAYRVKRDPYQIMAELYKNGPVEVAFDVFEDFAHYKSGVYKHVTGSYLGGH 1092
            W +SKHYS + Y VK +P  IMAE+YKNGPVEV+F V+EDFAHYKSGVYKH+TGS +GGH
Sbjct: 230  WSQSKHYSVSTYTVKSNPQDIMAEVYKNGPVEVSFTVYEDFAHYKSGVYKHITGSNIGGH 289

Query: 1093 AVKLIGWGTSADGEDYWLIANQWNRSWGDDGYFKISRGTNECGIEDDVTGGLPSSKNLGA 1272
            AVKLIGWGT+ +GEDYWL+ANQWNRSWGDDGYF I RGTNECGIED+   GLPSS+N+  
Sbjct: 290  AVKLIGWGTTDEGEDYWLLANQWNRSWGDDGYFMIRRGTNECGIEDEPVAGLPSSRNVFK 349

Query: 1273 VLPDSDDAGYASV 1311
            V+  SDD   ASV
Sbjct: 350  VITGSDDLSVASV 362


>ref|XP_002515139.1| cathepsin B, putative [Ricinus communis] gi|223545619|gb|EEF47123.1|
            cathepsin B, putative [Ricinus communis]
          Length = 376

 Score =  500 bits (1287), Expect = e-139
 Identities = 221/321 (68%), Positives = 257/321 (80%), Gaps = 17/321 (5%)
 Frame = +1

Query: 364  RDLIIDSVNSNPNVRWKAGVNQHYSDYTDGQFKRILGARRTPESEKRTIPLVHHDRSLNL 543
            ++ II  VN NP+  W+A +N   S++T GQFK +LGA+ TP+ E   +P++ H ++L L
Sbjct: 42   QESIIKKVNENPDAGWEAAMNPQLSNFTVGQFKYLLGAKPTPKKELMGVPMISHPKTLKL 101

Query: 544  PKEFDARTAWPQCRTIGRILDQ-----------------GHCGSCWAFAAVEALSDRFCI 672
            PKEFDARTAWP C TIG+IL Q                 GHCGSCWAF AVE+LSDRFCI
Sbjct: 102  PKEFDARTAWPHCSTIGKILGQLLSFYNIFSIFFFLFLEGHCGSCWAFGAVESLSDRFCI 161

Query: 673  KYNMNISLSVNDLLACCGFLCGNGCDGGTPIYAWRYLVHHGVVSEECDPYFDTEGCSHPG 852
             + MNISLSVNDLLACCGFLCG+GCDGG P+YAWRY VHHGVV+EECDPYFD  GCSHPG
Sbjct: 162  HFGMNISLSVNDLLACCGFLCGDGCDGGYPMYAWRYFVHHGVVTEECDPYFDNIGCSHPG 221

Query: 853  CEPGYPTPKCVRKCVAGNQLWRESKHYSKNAYRVKRDPYQIMAELYKNGPVEVAFDVFED 1032
            CEPG+PTPKCVRKC+  NQLWR+SKHYS NAYR+  DP+ +MAE+YKNGPVEV+F V+ED
Sbjct: 222  CEPGFPTPKCVRKCIDKNQLWRQSKHYSVNAYRISSDPHDVMAEVYKNGPVEVSFTVYED 281

Query: 1033 FAHYKSGVYKHVTGSYLGGHAVKLIGWGTSADGEDYWLIANQWNRSWGDDGYFKISRGTN 1212
            FAHYKSGVYKH+TG  +GGHAVKLIGWGTS +GEDYWL+ANQWNR WGDDGYFKI RGTN
Sbjct: 282  FAHYKSGVYKHITGEVMGGHAVKLIGWGTSDNGEDYWLLANQWNRGWGDDGYFKIRRGTN 341

Query: 1213 ECGIEDDVTGGLPSSKNLGAV 1275
            ECGIEDD   GLPS++NL  V
Sbjct: 342  ECGIEDDAVAGLPSARNLDLV 362


>gb|EXB94879.1| Cathepsin B [Morus notabilis]
          Length = 420

 Score =  498 bits (1283), Expect = e-138
 Identities = 215/301 (71%), Positives = 251/301 (83%)
 Frame = +1

Query: 364  RDLIIDSVNSNPNVRWKAGVNQHYSDYTDGQFKRILGARRTPESEKRTIPLVHHDRSLNL 543
            ++ I+  VN NP   W+A +N  +S++T G+F+R+LG + TP+ E  + P++ H +SL L
Sbjct: 44   QESIVKRVNENPEAGWRAEMNPRFSNFTAGEFRRLLGVKETPKHELESTPVITHPKSLKL 103

Query: 544  PKEFDARTAWPQCRTIGRILDQGHCGSCWAFAAVEALSDRFCIKYNMNISLSVNDLLACC 723
            P +FDARTAWPQC TI RILDQGHCGSCWAF AVE+LSDRFCI +N NISLSVND+LACC
Sbjct: 104  PDKFDARTAWPQCSTIKRILDQGHCGSCWAFGAVESLSDRFCIHFNTNISLSVNDVLACC 163

Query: 724  GFLCGNGCDGGTPIYAWRYLVHHGVVSEECDPYFDTEGCSHPGCEPGYPTPKCVRKCVAG 903
            GFLCG GCDGGTP++AWRYL HHGVV+EECDPYFD  GCSHPGCEP YPTP+C RKCV  
Sbjct: 164  GFLCGAGCDGGTPLFAWRYLHHHGVVTEECDPYFDNTGCSHPGCEPAYPTPRCHRKCVNK 223

Query: 904  NQLWRESKHYSKNAYRVKRDPYQIMAELYKNGPVEVAFDVFEDFAHYKSGVYKHVTGSYL 1083
            N LWR+SKHYS NAY++  DP+ IMAE+YKNGPVEV F V+EDFAHYKSGVYKH+TGS +
Sbjct: 224  NNLWRQSKHYSVNAYKISSDPHSIMAEVYKNGPVEVDFTVYEDFAHYKSGVYKHITGSVM 283

Query: 1084 GGHAVKLIGWGTSADGEDYWLIANQWNRSWGDDGYFKISRGTNECGIEDDVTGGLPSSKN 1263
            GGHAVKLIGWGTS  GEDYWL+ANQWNRSWGDDGYFKI RGTNECGIE D   G+PS +N
Sbjct: 284  GGHAVKLIGWGTSDTGEDYWLVANQWNRSWGDDGYFKIRRGTNECGIEKDAVAGMPSKRN 343

Query: 1264 L 1266
            L
Sbjct: 344  L 344


>ref|XP_006840260.1| hypothetical protein AMTR_s00045p00036790 [Amborella trichopoda]
            gi|548841978|gb|ERN01935.1| hypothetical protein
            AMTR_s00045p00036790 [Amborella trichopoda]
          Length = 351

 Score =  497 bits (1280), Expect = e-138
 Identities = 217/316 (68%), Positives = 261/316 (82%)
 Frame = +1

Query: 364  RDLIIDSVNSNPNVRWKAGVNQHYSDYTDGQFKRILGARRTPESEKRTIPLVHHDRSLNL 543
            +D II+ +N NPN  W+A +N  +S+YT GQFK ILG +  P++    +P+  +++++ L
Sbjct: 37   QDSIIEKINGNPNAGWQAALNPRFSNYTIGQFKYILGVKPVPQNSLVPVPIRRYEKTVKL 96

Query: 544  PKEFDARTAWPQCRTIGRILDQGHCGSCWAFAAVEALSDRFCIKYNMNISLSVNDLLACC 723
            PK+FDARTAW QC TI RILDQGHCGSCWAF AVE+LSDRFCI + MNISLSVNDLL+CC
Sbjct: 97   PKDFDARTAWTQCATISRILDQGHCGSCWAFGAVESLSDRFCIHFGMNISLSVNDLLSCC 156

Query: 724  GFLCGNGCDGGTPIYAWRYLVHHGVVSEECDPYFDTEGCSHPGCEPGYPTPKCVRKCVAG 903
            GF+CG+GCDGG PIYAWRY V +GVV+EECDPYFD  GCSHPGCEPG+PTP+C RKC   
Sbjct: 157  GFMCGDGCDGGYPIYAWRYFVQNGVVTEECDPYFDDIGCSHPGCEPGFPTPQCERKCKVK 216

Query: 904  NQLWRESKHYSKNAYRVKRDPYQIMAELYKNGPVEVAFDVFEDFAHYKSGVYKHVTGSYL 1083
            NQLW+ESKH+S NAYR+  DP  IMAE+YKNGPVEVAF V+EDFAHYKSG+YKH+TG  +
Sbjct: 217  NQLWQESKHFSVNAYRIDSDPSSIMAEVYKNGPVEVAFTVYEDFAHYKSGIYKHITGGIM 276

Query: 1084 GGHAVKLIGWGTSADGEDYWLIANQWNRSWGDDGYFKISRGTNECGIEDDVTGGLPSSKN 1263
            GGHAVKLIGWGTS +GEDYWL+ANQWNR WGDDGYFKI RGTNECGIE+DV  G+PS+KN
Sbjct: 277  GGHAVKLIGWGTSEEGEDYWLLANQWNRGWGDDGYFKIRRGTNECGIEEDVVAGMPSTKN 336

Query: 1264 LGAVLPDSDDAGYASV 1311
            L   + D  D+G+ +V
Sbjct: 337  LIKNMADG-DSGHVTV 351


>gb|ESW35262.1| hypothetical protein PHAVU_001G220100g [Phaseolus vulgaris]
          Length = 357

 Score =  496 bits (1276), Expect = e-137
 Identities = 220/307 (71%), Positives = 255/307 (83%), Gaps = 1/307 (0%)
 Frame = +1

Query: 373  IIDSVNSNPNVRWKAGVNQHYSDYTDGQFKRILGARRTPESEKRTIPLVHHDRSLNLPKE 552
            I   +N NP   W+A +N  +S+YT  QFKR+LG ++TP+ E R+ P++ H +SL LP  
Sbjct: 44   IAKQINENPEAGWEAALNPRFSNYTVEQFKRLLGVKQTPKIELRSTPVISHPKSLKLPVN 103

Query: 553  FDARTAWPQCRTIGRILDQGHCGSCWAFAAVEALSDRFCIKYNMNISLSVNDLLACCGFL 732
            FDAR AW QC TIGRILDQGHCGSCWAF AVE+LSDRFCI +++NISLSVNDLLACCGFL
Sbjct: 104  FDARKAWSQCNTIGRILDQGHCGSCWAFGAVESLSDRFCIHFDVNISLSVNDLLACCGFL 163

Query: 733  CGNGCDGGTPIYAWRYLVHHGVVSEECDPYFDTEGCSHPGCEPGYPTPKCVRKCV-AGNQ 909
            CG+GCDGG P+YAWRYL HHGVV+EECDPYFD  GCSHPGCEP Y TPKCV+KCV  GNQ
Sbjct: 164  CGSGCDGGYPLYAWRYLAHHGVVTEECDPYFDQIGCSHPGCEPAYRTPKCVKKCVDGGNQ 223

Query: 910  LWRESKHYSKNAYRVKRDPYQIMAELYKNGPVEVAFDVFEDFAHYKSGVYKHVTGSYLGG 1089
            LW++SKH+S NAY V  +P+ IMAE+Y NGPVEVAF V+EDFAHYKSGVYKHVTG  LGG
Sbjct: 224  LWKKSKHFSVNAYTVNSNPHDIMAEVYTNGPVEVAFTVYEDFAHYKSGVYKHVTGYALGG 283

Query: 1090 HAVKLIGWGTSADGEDYWLIANQWNRSWGDDGYFKISRGTNECGIEDDVTGGLPSSKNLG 1269
            HAVKLIGWGT+ DG+DYWL+ANQWNR WGDDGYFKI RGTNECGIE++VT GLPS+KNL 
Sbjct: 284  HAVKLIGWGTTDDGQDYWLLANQWNREWGDDGYFKIRRGTNECGIEEEVTAGLPSTKNLV 343

Query: 1270 AVLPDSD 1290
              + D D
Sbjct: 344  REVTDMD 350


Top