BLASTX nr result

ID: Ziziphus21_contig00002279 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Ziziphus21_contig00002279
         (1493 letters)

Database: ./nr 
           77,306,371 sequences; 28,104,191,420 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_007221486.1| hypothetical protein PRUPE_ppa007538mg [Prun...   580   e-162
ref|XP_008227748.1| PREDICTED: cathepsin B [Prunus mume]              575   e-161
ref|XP_008343231.1| PREDICTED: cathepsin B-like [Malus domestica]     574   e-161
ref|XP_009341533.1| PREDICTED: cathepsin B-like [Pyrus x bretsch...   572   e-160
gb|KCW88609.1| hypothetical protein EUGRSUZ_A00982 [Eucalyptus g...   569   e-159
ref|XP_006491433.1| PREDICTED: cathepsin B-like isoform X1 [Citr...   565   e-158
ref|XP_006444663.1| hypothetical protein CICLE_v10020859mg [Citr...   564   e-158
gb|KDO86712.1| hypothetical protein CISIN_1g018568mg [Citrus sin...   564   e-158
gb|KDO86711.1| hypothetical protein CISIN_1g018568mg [Citrus sin...   564   e-158
ref|XP_003521632.1| PREDICTED: cathepsin B [Glycine max] gi|7343...   563   e-157
ref|XP_012083054.1| PREDICTED: cathepsin B [Jatropha curcas] gi|...   562   e-157
ref|XP_003554589.1| PREDICTED: cathepsin B-like [Glycine max] gi...   560   e-156
ref|XP_011025297.1| PREDICTED: cathepsin B-like [Populus euphrat...   559   e-156
ref|XP_007051347.1| Cysteine proteinases superfamily protein [Th...   558   e-156
ref|XP_002301457.2| putative cathepsin B-like protease family pr...   558   e-156
ref|XP_010247244.1| PREDICTED: cathepsin B-like [Nelumbo nucifera]    558   e-156
ref|XP_011025296.1| PREDICTED: cathepsin B-like [Populus euphrat...   556   e-155
ref|XP_012489194.1| PREDICTED: cathepsin B-like isoform X2 [Goss...   553   e-154
ref|XP_010045446.1| PREDICTED: cathepsin B-like [Eucalyptus gran...   553   e-154
ref|XP_012489193.1| PREDICTED: cathepsin B-like isoform X1 [Goss...   551   e-154

>ref|XP_007221486.1| hypothetical protein PRUPE_ppa007538mg [Prunus persica]
            gi|462418236|gb|EMJ22685.1| hypothetical protein
            PRUPE_ppa007538mg [Prunus persica]
          Length = 364

 Score =  580 bits (1494), Expect = e-162
 Identities = 267/330 (80%), Positives = 291/330 (88%)
 Frame = -2

Query: 1216 VAERPVNKIKLNSHILQESIIKQINGNPNAGWEAAVNPRFSNYTIGQFRHLLGVKPTPQK 1037
            +A +PV K KLNS ILQ+SIIKQIN NP AGWEAA+NPRFSNYT+ QF HLLGVKPTP+K
Sbjct: 32   IAAKPVTKSKLNSRILQDSIIKQINDNPMAGWEAAMNPRFSNYTVSQFMHLLGVKPTPRK 91

Query: 1036 DLKNIPVIAHAKSLKLPNKFDARTAWPQCSTIGRILDQGHCGSCWAFGAVESLSDRFCIH 857
            DL++ P++ H KSLKLP  FDARTAWPQC+TIGRILDQGHCGSCWAF AVE+LSDRFCIH
Sbjct: 92   DLQSFPILTHPKSLKLPTNFDARTAWPQCNTIGRILDQGHCGSCWAFAAVEALSDRFCIH 151

Query: 856  FDVNISLSVNDLLAXXXXXXXXXXXXGYPIYAWRYFVHHGVVTEECDPYFDDTGCSHPGC 677
            F +NISLSVNDLLA            GYPIYAWRYFVHHGVVTEECDPYFD TGCSHPGC
Sbjct: 152  FGMNISLSVNDLLACCGFMCGDGCDGGYPIYAWRYFVHHGVVTEECDPYFDPTGCSHPGC 211

Query: 676  EPAYPTPKCVRKCVNKNQLWSNSKHYSISAYRIKSDPHSIMAEIYKNGPVEVSFTVYEDF 497
            EPAYPTPKCV+KC +KNQLW NSK YSI+AYRI SD HSIMAE+Y NGPVEV+FTVYEDF
Sbjct: 212  EPAYPTPKCVKKCTDKNQLWKNSKRYSINAYRINSDSHSIMAEVYSNGPVEVAFTVYEDF 271

Query: 496  AHYKSGVYKHITGDVMGGHAVKLIGWGTTDAGEDYWLLANQWNRSWGDDGYFMIRRGTNE 317
            AHYKSGVY+HI GDV+GGHAVKLIGWGTTDAGEDYWLLANQWNRSWGDDGYFMI+RGTNE
Sbjct: 272  AHYKSGVYRHIKGDVLGGHAVKLIGWGTTDAGEDYWLLANQWNRSWGDDGYFMIKRGTNE 331

Query: 316  CGIEEDVVAGLPSSRNLVREIASVEADAVV 227
            CGIEEDVVAGLPS +N +RE+AS  ADAVV
Sbjct: 332  CGIEEDVVAGLPSLKNFIREVAS--ADAVV 359


>ref|XP_008227748.1| PREDICTED: cathepsin B [Prunus mume]
          Length = 364

 Score =  575 bits (1483), Expect = e-161
 Identities = 261/326 (80%), Positives = 286/326 (87%)
 Frame = -2

Query: 1216 VAERPVNKIKLNSHILQESIIKQINGNPNAGWEAAVNPRFSNYTIGQFRHLLGVKPTPQK 1037
            +A +PV K KLNS ILQ+SIIKQIN NP AGWEAA+NPRFSNYT+ QF HLLGVKPTP+K
Sbjct: 32   IAAKPVTKSKLNSRILQDSIIKQINDNPMAGWEAAMNPRFSNYTVSQFMHLLGVKPTPRK 91

Query: 1036 DLKNIPVIAHAKSLKLPNKFDARTAWPQCSTIGRILDQGHCGSCWAFGAVESLSDRFCIH 857
            D ++ P++ H KSLKLP  FDARTAWPQC+TIGRILDQGHCGSCWAF AVE+LSDRFCIH
Sbjct: 92   DFQSFPILTHPKSLKLPTNFDARTAWPQCNTIGRILDQGHCGSCWAFAAVEALSDRFCIH 151

Query: 856  FDVNISLSVNDLLAXXXXXXXXXXXXGYPIYAWRYFVHHGVVTEECDPYFDDTGCSHPGC 677
            F +NISLSVNDLLA            GYPIYAWRYFVHHGVVTEECDPYFD TGCSHPGC
Sbjct: 152  FGMNISLSVNDLLACCGFMCGDGCDGGYPIYAWRYFVHHGVVTEECDPYFDPTGCSHPGC 211

Query: 676  EPAYPTPKCVRKCVNKNQLWSNSKHYSISAYRIKSDPHSIMAEIYKNGPVEVSFTVYEDF 497
            EPAYPTPKCV+KC +KNQLW N K YSI+AYRI SD HSIMAE+Y NGPVEV+FTVYEDF
Sbjct: 212  EPAYPTPKCVKKCADKNQLWKNLKRYSINAYRINSDSHSIMAEVYSNGPVEVAFTVYEDF 271

Query: 496  AHYKSGVYKHITGDVMGGHAVKLIGWGTTDAGEDYWLLANQWNRSWGDDGYFMIRRGTNE 317
            AHYKSGVY+HI GD +GGHAVKLIGWGTTDAGEDYWLLANQWNR WGDDGYFMI+RGTNE
Sbjct: 272  AHYKSGVYRHIKGDALGGHAVKLIGWGTTDAGEDYWLLANQWNRGWGDDGYFMIKRGTNE 331

Query: 316  CGIEEDVVAGLPSSRNLVREIASVEA 239
            CGIEEDVVAGLPSS+N +RE+ASV+A
Sbjct: 332  CGIEEDVVAGLPSSKNFIREVASVDA 357


>ref|XP_008343231.1| PREDICTED: cathepsin B-like [Malus domestica]
          Length = 363

 Score =  574 bits (1480), Expect = e-161
 Identities = 260/332 (78%), Positives = 292/332 (87%)
 Frame = -2

Query: 1228 QAKAVAERPVNKIKLNSHILQESIIKQINGNPNAGWEAAVNPRFSNYTIGQFRHLLGVKP 1049
            Q + +A +P++K KL S ILQ+SIIKQIN NP AGWEAA+NPRFSNYT+ QF HLLGVKP
Sbjct: 27   QPQLIAAKPLSKTKLQSPILQDSIIKQINENPKAGWEAAMNPRFSNYTVSQFMHLLGVKP 86

Query: 1048 TPQKDLKNIPVIAHAKSLKLPNKFDARTAWPQCSTIGRILDQGHCGSCWAFGAVESLSDR 869
            TPQKDL++ P+  + KSLKLPN FDARTAWPQC+TIGRILDQGHCGSCWAF AVE+LSDR
Sbjct: 87   TPQKDLQSFPIKTYPKSLKLPNNFDARTAWPQCNTIGRILDQGHCGSCWAFAAVEALSDR 146

Query: 868  FCIHFDVNISLSVNDLLAXXXXXXXXXXXXGYPIYAWRYFVHHGVVTEECDPYFDDTGCS 689
            FC+H+ +NISLSVNDLLA            GYPIYAWRYF+HHGVVTEECDPYFD TGCS
Sbjct: 147  FCVHYGMNISLSVNDLLACCGFMCGAGCNGGYPIYAWRYFIHHGVVTEECDPYFDSTGCS 206

Query: 688  HPGCEPAYPTPKCVRKCVNKNQLWSNSKHYSISAYRIKSDPHSIMAEIYKNGPVEVSFTV 509
            HPGCEPAYPTPKCV+KCV+ NQ+W NSK YSISAYRI SDPHSIMAE+Y+NGPVEV+FTV
Sbjct: 207  HPGCEPAYPTPKCVKKCVDGNQIWKNSKRYSISAYRINSDPHSIMAEVYRNGPVEVAFTV 266

Query: 508  YEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTTDAGEDYWLLANQWNRSWGDDGYFMIRR 329
            YEDFAHYKSGVYKH+ GDV+GGHAVKLIGWGTT+ GEDYWLLANQWNRSWGDDGYF IRR
Sbjct: 267  YEDFAHYKSGVYKHVKGDVLGGHAVKLIGWGTTNDGEDYWLLANQWNRSWGDDGYFKIRR 326

Query: 328  GTNECGIEEDVVAGLPSSRNLVREIASVEADA 233
            GTNECGIEEDVVAGLPSS+N + ++ASV+A A
Sbjct: 327  GTNECGIEEDVVAGLPSSKNFISQVASVDAVA 358


>ref|XP_009341533.1| PREDICTED: cathepsin B-like [Pyrus x bretschneideri]
          Length = 363

 Score =  572 bits (1474), Expect = e-160
 Identities = 259/332 (78%), Positives = 292/332 (87%)
 Frame = -2

Query: 1228 QAKAVAERPVNKIKLNSHILQESIIKQINGNPNAGWEAAVNPRFSNYTIGQFRHLLGVKP 1049
            Q + +A +P++K KL S ILQ+SIIKQIN NP AGWEAA+NPRFSNYT+ QF HLLGVKP
Sbjct: 27   QPQLIAVKPLSKTKLQSPILQDSIIKQINENPKAGWEAAMNPRFSNYTVSQFMHLLGVKP 86

Query: 1048 TPQKDLKNIPVIAHAKSLKLPNKFDARTAWPQCSTIGRILDQGHCGSCWAFGAVESLSDR 869
            TPQKDL++ P+  + KSLKLPN FDARTAWPQC+TIGRILDQGHCGSCWAF AVE+LSDR
Sbjct: 87   TPQKDLQSFPIKTYPKSLKLPNNFDARTAWPQCNTIGRILDQGHCGSCWAFAAVEALSDR 146

Query: 868  FCIHFDVNISLSVNDLLAXXXXXXXXXXXXGYPIYAWRYFVHHGVVTEECDPYFDDTGCS 689
            FC+ + +NISLSVNDLLA            GYPIYAWRYF+HHGVVTEECDPYFD TGCS
Sbjct: 147  FCVRYGMNISLSVNDLLACCGFMCGAGCNGGYPIYAWRYFIHHGVVTEECDPYFDSTGCS 206

Query: 688  HPGCEPAYPTPKCVRKCVNKNQLWSNSKHYSISAYRIKSDPHSIMAEIYKNGPVEVSFTV 509
            HPGC+PAYPTPKCV+KCV+ NQ+W NSKHYSISAYRI SDPHSIMAE+Y+NGPVEV+FTV
Sbjct: 207  HPGCDPAYPTPKCVKKCVDGNQIWKNSKHYSISAYRINSDPHSIMAEVYRNGPVEVAFTV 266

Query: 508  YEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTTDAGEDYWLLANQWNRSWGDDGYFMIRR 329
            YEDFAHYKSGVYKH+ GDV+GGHAVKLIGWGTT+ GEDYWLLANQWN SWGDDGYFMIRR
Sbjct: 267  YEDFAHYKSGVYKHVKGDVLGGHAVKLIGWGTTNDGEDYWLLANQWNISWGDDGYFMIRR 326

Query: 328  GTNECGIEEDVVAGLPSSRNLVREIASVEADA 233
            GTNECGIEEDVVAGLPSS+N + ++ASV+A A
Sbjct: 327  GTNECGIEEDVVAGLPSSKNFISQVASVDAVA 358


>gb|KCW88609.1| hypothetical protein EUGRSUZ_A00982 [Eucalyptus grandis]
          Length = 357

 Score =  569 bits (1466), Expect = e-159
 Identities = 257/328 (78%), Positives = 290/328 (88%)
 Frame = -2

Query: 1225 AKAVAERPVNKIKLNSHILQESIIKQINGNPNAGWEAAVNPRFSNYTIGQFRHLLGVKPT 1046
            A+  AE+ ++++KLNSHILQ SIIK+IN NPNAGW+AA+NPRFSN+T+GQF+HLLGVKPT
Sbjct: 23   AQVNAEKSLSQLKLNSHILQNSIIKEINENPNAGWQAAMNPRFSNFTVGQFKHLLGVKPT 82

Query: 1045 PQKDLKNIPVIAHAKSLKLPNKFDARTAWPQCSTIGRILDQGHCGSCWAFGAVESLSDRF 866
            P  +L  +P+  H KSLKLP KFDARTAW QCSTIGRILDQGHCGSCWAFGAVESLSDRF
Sbjct: 83   PHGELTQVPIKTHPKSLKLPEKFDARTAWSQCSTIGRILDQGHCGSCWAFGAVESLSDRF 142

Query: 865  CIHFDVNISLSVNDLLAXXXXXXXXXXXXGYPIYAWRYFVHHGVVTEECDPYFDDTGCSH 686
            CIHF +NISLSVNDLLA            GYP++AWRYF+HHGVVTEEC PYFDD GCSH
Sbjct: 143  CIHFGMNISLSVNDLLACCGFMCGAGCNGGYPMFAWRYFMHHGVVTEECYPYFDDIGCSH 202

Query: 685  PGCEPAYPTPKCVRKCVNKNQLWSNSKHYSISAYRIKSDPHSIMAEIYKNGPVEVSFTVY 506
            PGCEP YPTPKCVRKCVN NQ+W +SKHYS+SAYR+ SDP++IMAEIYKNGPVEVSFTVY
Sbjct: 203  PGCEPEYPTPKCVRKCVNGNQMWRSSKHYSVSAYRVDSDPYNIMAEIYKNGPVEVSFTVY 262

Query: 505  EDFAHYKSGVYKHITGDVMGGHAVKLIGWGTTDAGEDYWLLANQWNRSWGDDGYFMIRRG 326
            EDFAHYKSGVYKH+TGDV+GGHAVKLIGWGTTD GEDYWL+ANQWNRSWGDDGYF IRRG
Sbjct: 263  EDFAHYKSGVYKHVTGDVLGGHAVKLIGWGTTDDGEDYWLIANQWNRSWGDDGYFKIRRG 322

Query: 325  TNECGIEEDVVAGLPSSRNLVREIASVE 242
            TNECGIE DVV GLPS++NLVR++ SV+
Sbjct: 323  TNECGIEGDVVTGLPSTKNLVRKVVSVD 350


>ref|XP_006491433.1| PREDICTED: cathepsin B-like isoform X1 [Citrus sinensis]
          Length = 362

 Score =  565 bits (1457), Expect = e-158
 Identities = 262/327 (80%), Positives = 285/327 (87%)
 Frame = -2

Query: 1222 KAVAERPVNKIKLNSHILQESIIKQINGNPNAGWEAAVNPRFSNYTIGQFRHLLGVKPTP 1043
            K  AE  V+K+KL+SHILQ+SIIK++N NP AGW+AA NP+FSNYT+GQF+HLLGVKPTP
Sbjct: 29   KTFAEGVVSKLKLDSHILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGVKPTP 88

Query: 1042 QKDLKNIPVIAHAKSLKLPNKFDARTAWPQCSTIGRILDQGHCGSCWAFGAVESLSDRFC 863
            +  L  +PV  H KSLKLP  FDAR+AWPQCSTI RILDQGHCGSCWAFGAVE+LSDRFC
Sbjct: 89   KGLLLGVPVKTHDKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFC 148

Query: 862  IHFDVNISLSVNDLLAXXXXXXXXXXXXGYPIYAWRYFVHHGVVTEECDPYFDDTGCSHP 683
            IHF +N+SLSVNDLLA            GYPI AWRYFVHHGVVTEECDPYFD TGCSHP
Sbjct: 149  IHFGMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHP 208

Query: 682  GCEPAYPTPKCVRKCVNKNQLWSNSKHYSISAYRIKSDPHSIMAEIYKNGPVEVSFTVYE 503
            GCEPAYPTPKCVRKCV KNQLW NSKHYSISAYRI SDP  IMAEIYKNGPVEVSFTVYE
Sbjct: 209  GCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYE 268

Query: 502  DFAHYKSGVYKHITGDVMGGHAVKLIGWGTTDAGEDYWLLANQWNRSWGDDGYFMIRRGT 323
            DFAHYKSGVYKHITGDVMGGHAVKLIGWGT+D GEDYW+LANQWNRSWG DGYF I+RG+
Sbjct: 269  DFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGS 328

Query: 322  NECGIEEDVVAGLPSSRNLVREIASVE 242
            NECGIEEDVVAGLPSS+NLV+EI S +
Sbjct: 329  NECGIEEDVVAGLPSSKNLVKEITSAD 355


>ref|XP_006444663.1| hypothetical protein CICLE_v10020859mg [Citrus clementina]
            gi|568876746|ref|XP_006491434.1| PREDICTED: cathepsin
            B-like isoform X2 [Citrus sinensis]
            gi|557546925|gb|ESR57903.1| hypothetical protein
            CICLE_v10020859mg [Citrus clementina]
            gi|641868024|gb|KDO86708.1| hypothetical protein
            CISIN_1g018568mg [Citrus sinensis]
          Length = 354

 Score =  564 bits (1454), Expect = e-158
 Identities = 261/328 (79%), Positives = 286/328 (87%)
 Frame = -2

Query: 1225 AKAVAERPVNKIKLNSHILQESIIKQINGNPNAGWEAAVNPRFSNYTIGQFRHLLGVKPT 1046
            ++  AE  V+K+KL+SHILQ+SIIK++N NP AGW+AA NP+FSNYT+GQF+HLLGVKPT
Sbjct: 20   SQTFAEGVVSKLKLDSHILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGVKPT 79

Query: 1045 PQKDLKNIPVIAHAKSLKLPNKFDARTAWPQCSTIGRILDQGHCGSCWAFGAVESLSDRF 866
            P+  L  +PV  H KSLKLP  FDAR+AWPQCSTI RILDQGHCGSCWAFGAVE+LSDRF
Sbjct: 80   PKGLLLGVPVKTHDKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRF 139

Query: 865  CIHFDVNISLSVNDLLAXXXXXXXXXXXXGYPIYAWRYFVHHGVVTEECDPYFDDTGCSH 686
            CIHF +N+SLSVNDLLA            GYPI AWRYFVHHGVVTEECDPYFD TGCSH
Sbjct: 140  CIHFGMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSH 199

Query: 685  PGCEPAYPTPKCVRKCVNKNQLWSNSKHYSISAYRIKSDPHSIMAEIYKNGPVEVSFTVY 506
            PGCEPAYPTPKCVRKCV KNQLW NSKHYSISAYRI SDP  IMAEIYKNGPVEVSFTVY
Sbjct: 200  PGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVY 259

Query: 505  EDFAHYKSGVYKHITGDVMGGHAVKLIGWGTTDAGEDYWLLANQWNRSWGDDGYFMIRRG 326
            EDFAHYKSGVYKHITGDVMGGHAVKLIGWGT+D GEDYW+LANQWNRSWG DGYF I+RG
Sbjct: 260  EDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRG 319

Query: 325  TNECGIEEDVVAGLPSSRNLVREIASVE 242
            +NECGIEEDVVAGLPSS+NLV+EI S +
Sbjct: 320  SNECGIEEDVVAGLPSSKNLVKEITSAD 347


>gb|KDO86712.1| hypothetical protein CISIN_1g018568mg [Citrus sinensis]
          Length = 349

 Score =  564 bits (1453), Expect = e-158
 Identities = 261/324 (80%), Positives = 284/324 (87%)
 Frame = -2

Query: 1213 AERPVNKIKLNSHILQESIIKQINGNPNAGWEAAVNPRFSNYTIGQFRHLLGVKPTPQKD 1034
            AE  V+K+KL+SHILQ+SIIK++N NP AGW+AA NP+FSNYT+GQF+HLLGVKPTP+  
Sbjct: 19   AEGVVSKLKLDSHILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGVKPTPKGL 78

Query: 1033 LKNIPVIAHAKSLKLPNKFDARTAWPQCSTIGRILDQGHCGSCWAFGAVESLSDRFCIHF 854
            L  +PV  H KSLKLP  FDAR+AWPQCSTI RILDQGHCGSCWAFGAVE+LSDRFCIHF
Sbjct: 79   LLGVPVKTHDKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHF 138

Query: 853  DVNISLSVNDLLAXXXXXXXXXXXXGYPIYAWRYFVHHGVVTEECDPYFDDTGCSHPGCE 674
             +N+SLSVNDLLA            GYPI AWRYFVHHGVVTEECDPYFD TGCSHPGCE
Sbjct: 139  GMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCE 198

Query: 673  PAYPTPKCVRKCVNKNQLWSNSKHYSISAYRIKSDPHSIMAEIYKNGPVEVSFTVYEDFA 494
            PAYPTPKCVRKCV KNQLW NSKHYSISAYRI SDP  IMAEIYKNGPVEVSFTVYEDFA
Sbjct: 199  PAYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFA 258

Query: 493  HYKSGVYKHITGDVMGGHAVKLIGWGTTDAGEDYWLLANQWNRSWGDDGYFMIRRGTNEC 314
            HYKSGVYKHITGDVMGGHAVKLIGWGT+D GEDYW+LANQWNRSWG DGYF I+RG+NEC
Sbjct: 259  HYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNEC 318

Query: 313  GIEEDVVAGLPSSRNLVREIASVE 242
            GIEEDVVAGLPSS+NLV+EI S +
Sbjct: 319  GIEEDVVAGLPSSKNLVKEITSAD 342


>gb|KDO86711.1| hypothetical protein CISIN_1g018568mg [Citrus sinensis]
          Length = 351

 Score =  564 bits (1453), Expect = e-158
 Identities = 261/324 (80%), Positives = 284/324 (87%)
 Frame = -2

Query: 1213 AERPVNKIKLNSHILQESIIKQINGNPNAGWEAAVNPRFSNYTIGQFRHLLGVKPTPQKD 1034
            AE  V+K+KL+SHILQ+SIIK++N NP AGW+AA NP+FSNYT+GQF+HLLGVKPTP+  
Sbjct: 21   AEGVVSKLKLDSHILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGVKPTPKGL 80

Query: 1033 LKNIPVIAHAKSLKLPNKFDARTAWPQCSTIGRILDQGHCGSCWAFGAVESLSDRFCIHF 854
            L  +PV  H KSLKLP  FDAR+AWPQCSTI RILDQGHCGSCWAFGAVE+LSDRFCIHF
Sbjct: 81   LLGVPVKTHDKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHF 140

Query: 853  DVNISLSVNDLLAXXXXXXXXXXXXGYPIYAWRYFVHHGVVTEECDPYFDDTGCSHPGCE 674
             +N+SLSVNDLLA            GYPI AWRYFVHHGVVTEECDPYFD TGCSHPGCE
Sbjct: 141  GMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCE 200

Query: 673  PAYPTPKCVRKCVNKNQLWSNSKHYSISAYRIKSDPHSIMAEIYKNGPVEVSFTVYEDFA 494
            PAYPTPKCVRKCV KNQLW NSKHYSISAYRI SDP  IMAEIYKNGPVEVSFTVYEDFA
Sbjct: 201  PAYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFA 260

Query: 493  HYKSGVYKHITGDVMGGHAVKLIGWGTTDAGEDYWLLANQWNRSWGDDGYFMIRRGTNEC 314
            HYKSGVYKHITGDVMGGHAVKLIGWGT+D GEDYW+LANQWNRSWG DGYF I+RG+NEC
Sbjct: 261  HYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNEC 320

Query: 313  GIEEDVVAGLPSSRNLVREIASVE 242
            GIEEDVVAGLPSS+NLV+EI S +
Sbjct: 321  GIEEDVVAGLPSSKNLVKEITSAD 344


>ref|XP_003521632.1| PREDICTED: cathepsin B [Glycine max] gi|734356537|gb|KHN14189.1|
            Cathepsin B [Glycine soja] gi|947120120|gb|KRH68369.1|
            hypothetical protein GLYMA_03G226300 [Glycine max]
          Length = 357

 Score =  563 bits (1452), Expect = e-157
 Identities = 256/336 (76%), Positives = 284/336 (84%)
 Frame = -2

Query: 1228 QAKAVAERPVNKIKLNSHILQESIIKQINGNPNAGWEAAVNPRFSNYTIGQFRHLLGVKP 1049
            Q      +P+  +KLNSHILQES  K+IN NP AGWEAA+NPRFSNYT+ QF+ LLGVKP
Sbjct: 22   QIAGAEAQPLTSLKLNSHILQESTAKEINENPEAGWEAAINPRFSNYTVEQFKRLLGVKP 81

Query: 1048 TPQKDLKNIPVIAHAKSLKLPNKFDARTAWPQCSTIGRILDQGHCGSCWAFGAVESLSDR 869
             P+K+L++ P I+H K+LKLP  FDARTAW QCSTIGRILDQGHCGSCWAFGAVESLSDR
Sbjct: 82   MPKKELRSTPAISHPKTLKLPKNFDARTAWSQCSTIGRILDQGHCGSCWAFGAVESLSDR 141

Query: 868  FCIHFDVNISLSVNDLLAXXXXXXXXXXXXGYPIYAWRYFVHHGVVTEECDPYFDDTGCS 689
            FCIHFDVNISLSVNDLLA            GYP+YAWRY  HHGVVTEECDPYFD  GCS
Sbjct: 142  FCIHFDVNISLSVNDLLACCGFLCGSGCDGGYPLYAWRYLAHHGVVTEECDPYFDQIGCS 201

Query: 688  HPGCEPAYPTPKCVRKCVNKNQLWSNSKHYSISAYRIKSDPHSIMAEIYKNGPVEVSFTV 509
            HPGCEPAY TPKCV+KCV+ NQ+W  SKHYS+SAYR+ SDPH IMAE+YKNGPVEV+FTV
Sbjct: 202  HPGCEPAYRTPKCVKKCVSGNQVWKKSKHYSVSAYRVNSDPHDIMAEVYKNGPVEVAFTV 261

Query: 508  YEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTTDAGEDYWLLANQWNRSWGDDGYFMIRR 329
            YEDFA+YKSGVYKHITG  +GGHAVKLIGWGTTD GEDYWLLANQWNR WGDDGYF IRR
Sbjct: 262  YEDFAYYKSGVYKHITGYELGGHAVKLIGWGTTDDGEDYWLLANQWNREWGDDGYFKIRR 321

Query: 328  GTNECGIEEDVVAGLPSSRNLVREIASVEADAVVSF 221
            GTNECGIEEDV AGLPS++NLVRE+  ++ADA VSF
Sbjct: 322  GTNECGIEEDVTAGLPSTKNLVREVTDMDADAAVSF 357


>ref|XP_012083054.1| PREDICTED: cathepsin B [Jatropha curcas] gi|643716748|gb|KDP28374.1|
            hypothetical protein JCGZ_14145 [Jatropha curcas]
          Length = 358

 Score =  562 bits (1448), Expect = e-157
 Identities = 252/329 (76%), Positives = 292/329 (88%), Gaps = 2/329 (0%)
 Frame = -2

Query: 1222 KAVAERPVNKIKLNSHILQESIIKQINGNPNAGWEAAVNPRFSNYTIGQFRHLLGVKPTP 1043
            + +AE P +K+KL+S +LQ+SII++IN NPNAGWEAA+NPRFSNYT+G+F++LLGVKPTP
Sbjct: 23   QVIAEAPDSKLKLSSRVLQDSIIRKINENPNAGWEAAMNPRFSNYTVGEFKYLLGVKPTP 82

Query: 1042 QKDLKNIPVIAHAKSLKLPNKFDARTAWPQCSTIGRILDQGHCGSCWAFGAVESLSDRFC 863
            +K+L+ +P+++H KSLKLP +FDAR+AWPQCSTIGRILDQGHCGSCWAFGAVESLSDRFC
Sbjct: 83   KKELRGVPLVSHPKSLKLPKEFDARSAWPQCSTIGRILDQGHCGSCWAFGAVESLSDRFC 142

Query: 862  IHFDVNISLSVNDLLAXXXXXXXXXXXXGYPIYAWRYFVHHGVVTEECDPYFDDTGCSHP 683
            I+F +NISLSVNDLLA            GYP+YAWRY VHHGVVTEECDPYFDD GCSHP
Sbjct: 143  INFGMNISLSVNDLLACCGFLCGNGCDGGYPLYAWRYLVHHGVVTEECDPYFDDIGCSHP 202

Query: 682  GCEPAYPTPKCVRKCVNKNQLWSNSKHYSISAYRIKSDPHSIMAEIYKNGPVEVSFTVYE 503
            GCEP +PTP+CVRKCV+KNQ W  SKHYS++AYRI+SDP+ IMAE+YKNGPVEV+FTVYE
Sbjct: 203  GCEPGFPTPRCVRKCVDKNQFWRQSKHYSVNAYRIRSDPYDIMAELYKNGPVEVAFTVYE 262

Query: 502  DFAHYKSGVYKHITGDVMGGHAVKLIGWGTTDAGEDYWLLANQWNRSWGDDGYFMIRRGT 323
            DFAHYKSGVYKHITGD +GGHAVKLIGWGT+D GEDYWLLANQWNR WGDDGYF I+RG 
Sbjct: 263  DFAHYKSGVYKHITGDQLGGHAVKLIGWGTSDDGEDYWLLANQWNRGWGDDGYFKIKRGV 322

Query: 322  NECGIEEDVVAGLPSSR--NLVREIASVE 242
            NECGIEEDVVAGLPSSR  NLVRE+A  +
Sbjct: 323  NECGIEEDVVAGLPSSRNLNLVREVAGTD 351


>ref|XP_003554589.1| PREDICTED: cathepsin B-like [Glycine max] gi|947047007|gb|KRG96636.1|
            hypothetical protein GLYMA_19G223300 [Glycine max]
          Length = 356

 Score =  560 bits (1443), Expect = e-156
 Identities = 256/343 (74%), Positives = 285/343 (83%)
 Frame = -2

Query: 1249 VFSTLKNQAKAVAERPVNKIKLNSHILQESIIKQINGNPNAGWEAAVNPRFSNYTIGQFR 1070
            V S    Q      +P+  +KLNS ILQESI K+IN NP AGWEAA+NP FSNYT+ QF+
Sbjct: 14   VLSASYLQIAGAKAQPLTSLKLNSPILQESIAKEINENPEAGWEAAINPHFSNYTVEQFK 73

Query: 1069 HLLGVKPTPQKDLKNIPVIAHAKSLKLPNKFDARTAWPQCSTIGRILDQGHCGSCWAFGA 890
             LLGVKPTP+K+L++ P I+H KSLKLP  FDARTAW QCSTIGRILDQGHCGSCWAFGA
Sbjct: 74   RLLGVKPTPKKELRSTPAISHPKSLKLPKNFDARTAWSQCSTIGRILDQGHCGSCWAFGA 133

Query: 889  VESLSDRFCIHFDVNISLSVNDLLAXXXXXXXXXXXXGYPIYAWRYFVHHGVVTEECDPY 710
            VESLSDRFCIHFDVNISLSVNDLLA            GYP+YAW+Y  HHGVVTEECDPY
Sbjct: 134  VESLSDRFCIHFDVNISLSVNDLLACCGFLCGSGCDGGYPLYAWQYLAHHGVVTEECDPY 193

Query: 709  FDDTGCSHPGCEPAYPTPKCVRKCVNKNQLWSNSKHYSISAYRIKSDPHSIMAEIYKNGP 530
            FD  GCSHPGCEPAY TPKCV+KCV+ NQ+W  SKHYS++AYR+ SDPH IM E+YKNGP
Sbjct: 194  FDQIGCSHPGCEPAYRTPKCVKKCVSGNQVWKKSKHYSVNAYRVSSDPHDIMTEVYKNGP 253

Query: 529  VEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTTDAGEDYWLLANQWNRSWGDD 350
            VEV+FTVYEDFAHYKSGVYKHITG  +GGHAVKLIGWGTT+ GEDYWLLANQWNR WGDD
Sbjct: 254  VEVAFTVYEDFAHYKSGVYKHITGYELGGHAVKLIGWGTTEDGEDYWLLANQWNREWGDD 313

Query: 349  GYFMIRRGTNECGIEEDVVAGLPSSRNLVREIASVEADAVVSF 221
            GYF IRRGTNECGIEEDV AGLPS++NLVRE+  ++ADA VSF
Sbjct: 314  GYFKIRRGTNECGIEEDVTAGLPSTKNLVREVTDMDADAAVSF 356


>ref|XP_011025297.1| PREDICTED: cathepsin B-like [Populus euphratica]
          Length = 357

 Score =  559 bits (1440), Expect = e-156
 Identities = 252/330 (76%), Positives = 287/330 (86%)
 Frame = -2

Query: 1228 QAKAVAERPVNKIKLNSHILQESIIKQINGNPNAGWEAAVNPRFSNYTIGQFRHLLGVKP 1049
            Q++ +A  PV+ +KLNS ILQ+SI+K++NGNP AGW+A +N  FSNY++ QF++LLGVKP
Sbjct: 22   QSQVIAVEPVSDLKLNSRILQDSILKKVNGNPKAGWKATMNHHFSNYSVAQFKYLLGVKP 81

Query: 1048 TPQKDLKNIPVIAHAKSLKLPNKFDARTAWPQCSTIGRILDQGHCGSCWAFGAVESLSDR 869
            TP+++L+ IPVI+H KSL+LP +FDARTAWPQCSTIG+ILDQGHCGSCWAFGAVESLSDR
Sbjct: 82   TPKEELRGIPVISHPKSLRLPEEFDARTAWPQCSTIGKILDQGHCGSCWAFGAVESLSDR 141

Query: 868  FCIHFDVNISLSVNDLLAXXXXXXXXXXXXGYPIYAWRYFVHHGVVTEECDPYFDDTGCS 689
            FCIH+ +NISLSVNDLLA            GYPI AWRYFVHHGVVTEECDPYFDD GCS
Sbjct: 142  FCIHYGMNISLSVNDLLACCGFLCGSGCDGGYPISAWRYFVHHGVVTEECDPYFDDIGCS 201

Query: 688  HPGCEPAYPTPKCVRKCVNKNQLWSNSKHYSISAYRIKSDPHSIMAEIYKNGPVEVSFTV 509
            HPGCEP YPTPKC RKCVNKNQLW  SKHY +  YRI SDP+SIMAEIYKNGPVEV+FTV
Sbjct: 202  HPGCEPGYPTPKCTRKCVNKNQLWKKSKHYGVKPYRIDSDPNSIMAEIYKNGPVEVAFTV 261

Query: 508  YEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTTDAGEDYWLLANQWNRSWGDDGYFMIRR 329
            YEDFAHYKSGVYKHITG +MGGHAVKLIGWGT++ GE YWLLANQWNR WGDDGYF IRR
Sbjct: 262  YEDFAHYKSGVYKHITGGMMGGHAVKLIGWGTSEDGEAYWLLANQWNRGWGDDGYFKIRR 321

Query: 328  GTNECGIEEDVVAGLPSSRNLVREIASVEA 239
            GTNECGIE DVVAGLPS+RNLVRE+ S++A
Sbjct: 322  GTNECGIEGDVVAGLPSTRNLVREVVSIDA 351


>ref|XP_007051347.1| Cysteine proteinases superfamily protein [Theobroma cacao]
            gi|508703608|gb|EOX95504.1| Cysteine proteinases
            superfamily protein [Theobroma cacao]
          Length = 359

 Score =  558 bits (1439), Expect = e-156
 Identities = 253/327 (77%), Positives = 282/327 (86%)
 Frame = -2

Query: 1222 KAVAERPVNKIKLNSHILQESIIKQINGNPNAGWEAAVNPRFSNYTIGQFRHLLGVKPTP 1043
            K +A   ++++KLNS ILQ+SI+KQ+N NP AGW+AA+NPR SNYT+G+F+HLLGVKPTP
Sbjct: 24   KVIAVEQLSEVKLNSQILQDSIVKQVNENPKAGWKAALNPRLSNYTVGEFKHLLGVKPTP 83

Query: 1042 QKDLKNIPVIAHAKSLKLPNKFDARTAWPQCSTIGRILDQGHCGSCWAFGAVESLSDRFC 863
            +K+L  IPVI H KSLK+P KFDARTAWPQCSTIGRILDQGHCGSCWAFGAVESLSDRFC
Sbjct: 84   KKELLGIPVITHGKSLKVPTKFDARTAWPQCSTIGRILDQGHCGSCWAFGAVESLSDRFC 143

Query: 862  IHFDVNISLSVNDLLAXXXXXXXXXXXXGYPIYAWRYFVHHGVVTEECDPYFDDTGCSHP 683
            IHF +NISLSVNDLLA            GYPI AWRYFV  GVVTEECDPYFDDTGCSHP
Sbjct: 144  IHFSMNISLSVNDLLACCGFLCGSGCDGGYPISAWRYFVRRGVVTEECDPYFDDTGCSHP 203

Query: 682  GCEPAYPTPKCVRKCVNKNQLWSNSKHYSISAYRIKSDPHSIMAEIYKNGPVEVSFTVYE 503
            GCEPAYPTP+CV+KCV  NQLW  SKHYS+ AYRI SDP  IMAE+Y NGPVEVSFTVYE
Sbjct: 204  GCEPAYPTPRCVKKCVKGNQLWRESKHYSVGAYRINSDPADIMAEVYTNGPVEVSFTVYE 263

Query: 502  DFAHYKSGVYKHITGDVMGGHAVKLIGWGTTDAGEDYWLLANQWNRSWGDDGYFMIRRGT 323
            DFAHYKSGVYKH+TG VMGGHAVKLIGWGT+D GEDYWLLANQWNR WGDDGYF I RGT
Sbjct: 264  DFAHYKSGVYKHVTGGVMGGHAVKLIGWGTSDDGEDYWLLANQWNRGWGDDGYFKISRGT 323

Query: 322  NECGIEEDVVAGLPSSRNLVREIASVE 242
            NECGIE+DVVAGLPS++NLVRE+  ++
Sbjct: 324  NECGIEDDVVAGLPSTKNLVREVGDMD 350


>ref|XP_002301457.2| putative cathepsin B-like protease family protein [Populus
            trichocarpa] gi|550345314|gb|EEE80730.2| putative
            cathepsin B-like protease family protein [Populus
            trichocarpa]
          Length = 357

 Score =  558 bits (1438), Expect = e-156
 Identities = 252/330 (76%), Positives = 286/330 (86%)
 Frame = -2

Query: 1228 QAKAVAERPVNKIKLNSHILQESIIKQINGNPNAGWEAAVNPRFSNYTIGQFRHLLGVKP 1049
            Q++ +A  PV+ +KLNS ILQ+SI+K++NGNP AGW+A +N  FSNYT+ QF++LLGVKP
Sbjct: 22   QSQVIAVEPVSDLKLNSRILQDSILKKVNGNPKAGWKATMNHHFSNYTVAQFKYLLGVKP 81

Query: 1048 TPQKDLKNIPVIAHAKSLKLPNKFDARTAWPQCSTIGRILDQGHCGSCWAFGAVESLSDR 869
            TP+++L+ IPVI+H KSL+LP +FDARTAWPQCSTIG+ILDQGHCGSCWAFGAVESLSDR
Sbjct: 82   TPKEELRGIPVISHPKSLRLPEEFDARTAWPQCSTIGKILDQGHCGSCWAFGAVESLSDR 141

Query: 868  FCIHFDVNISLSVNDLLAXXXXXXXXXXXXGYPIYAWRYFVHHGVVTEECDPYFDDTGCS 689
            FCIH+ +NISLSVNDLLA            GYPI AWRYFVHHGVVTEECDPYFDD GCS
Sbjct: 142  FCIHYGMNISLSVNDLLACCGFLCGSGCNGGYPISAWRYFVHHGVVTEECDPYFDDIGCS 201

Query: 688  HPGCEPAYPTPKCVRKCVNKNQLWSNSKHYSISAYRIKSDPHSIMAEIYKNGPVEVSFTV 509
            HPGCEP YPTPKC RKCVNKNQLW  SKHY +  YRI SDP SIMAEIYKNGPVEV+FTV
Sbjct: 202  HPGCEPGYPTPKCARKCVNKNQLWKKSKHYGVKPYRIDSDPDSIMAEIYKNGPVEVAFTV 261

Query: 508  YEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTTDAGEDYWLLANQWNRSWGDDGYFMIRR 329
            YEDFAHYKSGVYKHITG +MGGHAVKLIGWGT++ GE YWLLANQWNR WGDDG+F IRR
Sbjct: 262  YEDFAHYKSGVYKHITGGMMGGHAVKLIGWGTSEDGEAYWLLANQWNRGWGDDGFFKIRR 321

Query: 328  GTNECGIEEDVVAGLPSSRNLVREIASVEA 239
            GTNECGIE DVVAGLPS+RNLVRE+ S++A
Sbjct: 322  GTNECGIEGDVVAGLPSTRNLVREVVSIDA 351


>ref|XP_010247244.1| PREDICTED: cathepsin B-like [Nelumbo nucifera]
          Length = 355

 Score =  558 bits (1437), Expect = e-156
 Identities = 259/334 (77%), Positives = 282/334 (84%)
 Frame = -2

Query: 1222 KAVAERPVNKIKLNSHILQESIIKQINGNPNAGWEAAVNPRFSNYTIGQFRHLLGVKPTP 1043
            + VA +P+  IK  + ILQE I+  IN NP A WEAA+NPRF+NYTI QF+HLLGVKP P
Sbjct: 22   QVVAVKPILPIKPRTEILQEEIVWHINANPKARWEAAMNPRFTNYTIAQFKHLLGVKPAP 81

Query: 1042 QKDLKNIPVIAHAKSLKLPNKFDARTAWPQCSTIGRILDQGHCGSCWAFGAVESLSDRFC 863
            Q DL+ IPVI H KSL LP +FDAR AWPQCSTIG+ILDQGHCGSCWAFGAVESLSDRFC
Sbjct: 82   QNDLEGIPVITHPKSLNLPKQFDARMAWPQCSTIGKILDQGHCGSCWAFGAVESLSDRFC 141

Query: 862  IHFDVNISLSVNDLLAXXXXXXXXXXXXGYPIYAWRYFVHHGVVTEECDPYFDDTGCSHP 683
            IHF +NISLSVNDLLA            GYPI AWRYFVH GVVTEECDPYFD+ GCSHP
Sbjct: 142  IHFGMNISLSVNDLLACCGFMCGDGCDGGYPISAWRYFVHDGVVTEECDPYFDEIGCSHP 201

Query: 682  GCEPAYPTPKCVRKCVNKNQLWSNSKHYSISAYRIKSDPHSIMAEIYKNGPVEVSFTVYE 503
            GCEPAYPTPKC RKC + NQ+W  SKH+S+ AYRI SDP+SIMAE+YKNGPVEVSFTVYE
Sbjct: 202  GCEPAYPTPKCERKCKDANQVWQESKHFSVGAYRISSDPYSIMAEVYKNGPVEVSFTVYE 261

Query: 502  DFAHYKSGVYKHITGDVMGGHAVKLIGWGTTDAGEDYWLLANQWNRSWGDDGYFMIRRGT 323
            DFAHYKSGVYKHITGDVMGGHAVKLIGWGTTD GEDYWLLANQWNRSWGDDGYFMIRRGT
Sbjct: 262  DFAHYKSGVYKHITGDVMGGHAVKLIGWGTTDDGEDYWLLANQWNRSWGDDGYFMIRRGT 321

Query: 322  NECGIEEDVVAGLPSSRNLVREIASVEADAVVSF 221
            NECGIEEDVVAGLPSS+NL+R+   V+A   VSF
Sbjct: 322  NECGIEEDVVAGLPSSKNLIRKQIGVDASLHVSF 355


>ref|XP_011025296.1| PREDICTED: cathepsin B-like [Populus euphratica]
          Length = 356

 Score =  556 bits (1434), Expect = e-155
 Identities = 248/329 (75%), Positives = 289/329 (87%)
 Frame = -2

Query: 1225 AKAVAERPVNKIKLNSHILQESIIKQINGNPNAGWEAAVNPRFSNYTIGQFRHLLGVKPT 1046
            ++ +A  PV+K+KLNS ILQ+SI+++IN NPNAGWEA +NP+FSNY++G+F++LLGVKPT
Sbjct: 22   SQVIAVEPVSKLKLNSRILQDSIVQKINENPNAGWEATMNPQFSNYSVGEFKYLLGVKPT 81

Query: 1045 PQKDLKNIPVIAHAKSLKLPNKFDARTAWPQCSTIGRILDQGHCGSCWAFGAVESLSDRF 866
            P K+L+ +P++ H KS+KLP +FDARTAWP CSTIGRILDQGHCGSCWAFGAVESLSDRF
Sbjct: 82   PGKELRGVPLVRHPKSMKLPKEFDARTAWPHCSTIGRILDQGHCGSCWAFGAVESLSDRF 141

Query: 865  CIHFDVNISLSVNDLLAXXXXXXXXXXXXGYPIYAWRYFVHHGVVTEECDPYFDDTGCSH 686
            CIH+ +N+SLSVNDLLA            GYPI AWRYFV  GVVTEECDPYFDD GCSH
Sbjct: 142  CIHYGMNLSLSVNDLLACCGWMCGDGCDGGYPIDAWRYFVQSGVVTEECDPYFDDIGCSH 201

Query: 685  PGCEPAYPTPKCVRKCVNKNQLWSNSKHYSISAYRIKSDPHSIMAEIYKNGPVEVSFTVY 506
            PGCEP +PTPKC RKC +KN+LW+ SKH+S++AYRI SDPHSIMAE+ +NGPVEVSFTVY
Sbjct: 202  PGCEPGFPTPKCERKCADKNKLWAESKHFSVNAYRIDSDPHSIMAEVSRNGPVEVSFTVY 261

Query: 505  EDFAHYKSGVYKHITGDVMGGHAVKLIGWGTTDAGEDYWLLANQWNRSWGDDGYFMIRRG 326
            EDFAHYKSGVYKHITGD MGGHAVKLIGWGT+D GEDYWLLANQWNR WGDDGYF I+RG
Sbjct: 262  EDFAHYKSGVYKHITGDAMGGHAVKLIGWGTSDDGEDYWLLANQWNRGWGDDGYFKIKRG 321

Query: 325  TNECGIEEDVVAGLPSSRNLVREIASVEA 239
            TNECGIEEDVVAGLPS+RNLVRE+A ++A
Sbjct: 322  TNECGIEEDVVAGLPSTRNLVREVAKIDA 350


>ref|XP_012489194.1| PREDICTED: cathepsin B-like isoform X2 [Gossypium raimondii]
            gi|763773151|gb|KJB40274.1| hypothetical protein
            B456_007G055400 [Gossypium raimondii]
          Length = 354

 Score =  553 bits (1425), Expect = e-154
 Identities = 250/323 (77%), Positives = 280/323 (86%)
 Frame = -2

Query: 1222 KAVAERPVNKIKLNSHILQESIIKQINGNPNAGWEAAVNPRFSNYTIGQFRHLLGVKPTP 1043
            K +A   ++ +KLNS ILQ+SI+KQ+N NP AGWEAA+NPRFSNYTIG+F+HLLGVKPTP
Sbjct: 21   KVIAVEQLSDVKLNSRILQDSIVKQVNQNPKAGWEAALNPRFSNYTIGEFKHLLGVKPTP 80

Query: 1042 QKDLKNIPVIAHAKSLKLPNKFDARTAWPQCSTIGRILDQGHCGSCWAFGAVESLSDRFC 863
            +K+L  +P++ H KSLKLP  FDARTAWPQC++IGRILDQGHCGSCWAFGAVESLSDRFC
Sbjct: 81   KKELLGVPILTHDKSLKLPTSFDARTAWPQCTSIGRILDQGHCGSCWAFGAVESLSDRFC 140

Query: 862  IHFDVNISLSVNDLLAXXXXXXXXXXXXGYPIYAWRYFVHHGVVTEECDPYFDDTGCSHP 683
            IHFD+NISLSVNDLLA            GYPI AWRYFV  GVVTEECDPYFDD GCSHP
Sbjct: 141  IHFDMNISLSVNDLLACCGFLCGDGCDGGYPISAWRYFVRSGVVTEECDPYFDDIGCSHP 200

Query: 682  GCEPAYPTPKCVRKCVNKNQLWSNSKHYSISAYRIKSDPHSIMAEIYKNGPVEVSFTVYE 503
            GCEPA+PTPKCVRKCV  N LW  SKHYS+ AYRIKS+P  IMAE+YKNGPVEVSFTVYE
Sbjct: 201  GCEPAFPTPKCVRKCVKGNLLWKQSKHYSVGAYRIKSNPADIMAEVYKNGPVEVSFTVYE 260

Query: 502  DFAHYKSGVYKHITGDVMGGHAVKLIGWGTTDAGEDYWLLANQWNRSWGDDGYFMIRRGT 323
            DFAHYKSGVYKH+TGDVMGGHAVKLIGWGT+D GEDYWLLANQWNR WGDDGYF I+RG 
Sbjct: 261  DFAHYKSGVYKHLTGDVMGGHAVKLIGWGTSDDGEDYWLLANQWNRGWGDDGYFKIKRGV 320

Query: 322  NECGIEEDVVAGLPSSRNLVREI 254
            +ECGIE DVVAGLPS++NLVR++
Sbjct: 321  DECGIESDVVAGLPSTKNLVRQV 343


>ref|XP_010045446.1| PREDICTED: cathepsin B-like [Eucalyptus grandis]
          Length = 380

 Score =  553 bits (1425), Expect = e-154
 Identities = 256/351 (72%), Positives = 289/351 (82%), Gaps = 23/351 (6%)
 Frame = -2

Query: 1225 AKAVAERPVNKIKLNSHILQESIIKQINGNPNAGWEAAVNPRFSNYTIGQFRHLLGVKPT 1046
            A+  AE+ ++++KLNSHILQ SIIK+IN NPNAGW+AA+NPRFSN+T+GQF+HLLGVKPT
Sbjct: 23   AQVNAEKSLSQLKLNSHILQNSIIKEINENPNAGWQAAMNPRFSNFTVGQFKHLLGVKPT 82

Query: 1045 PQKDLKNIPVIAHAKSLKLPNKFDARTAWPQCSTIGRILDQ------------------- 923
            P  +L  +P+  H KSLKLP KFDARTAW QCSTIGRIL Q                   
Sbjct: 83   PHGELTQVPIKTHPKSLKLPEKFDARTAWSQCSTIGRILGQFVCLVFHLIYSRYYADAFS 142

Query: 922  ----GHCGSCWAFGAVESLSDRFCIHFDVNISLSVNDLLAXXXXXXXXXXXXGYPIYAWR 755
                GHCGSCWAFGAVESLSDRFCIHF +NISLSVNDLLA            GYP++AWR
Sbjct: 143  LLCDGHCGSCWAFGAVESLSDRFCIHFGMNISLSVNDLLACCGFMCGAGCNGGYPMFAWR 202

Query: 754  YFVHHGVVTEECDPYFDDTGCSHPGCEPAYPTPKCVRKCVNKNQLWSNSKHYSISAYRIK 575
            YF+HHGVVTEEC PYFDD GCSHPGCEP YPTPKCVRKCVN NQ+W +SKHYS+SAYR+ 
Sbjct: 203  YFMHHGVVTEECYPYFDDIGCSHPGCEPEYPTPKCVRKCVNGNQMWRSSKHYSVSAYRVD 262

Query: 574  SDPHSIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTTDAGED 395
            SDP++IMAEIYKNGPVEVSFTVYEDFAHYKSGVYKH+TGDV+GGHAVKLIGWGTTD GED
Sbjct: 263  SDPYNIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHVTGDVLGGHAVKLIGWGTTDDGED 322

Query: 394  YWLLANQWNRSWGDDGYFMIRRGTNECGIEEDVVAGLPSSRNLVREIASVE 242
            YWL+ANQWNRSWGDDGYF IRRGTNECGIE DVV GLPS++NLVR++ SV+
Sbjct: 323  YWLIANQWNRSWGDDGYFKIRRGTNECGIEGDVVTGLPSTKNLVRKVVSVD 373


>ref|XP_012489193.1| PREDICTED: cathepsin B-like isoform X1 [Gossypium raimondii]
            gi|763773152|gb|KJB40275.1| hypothetical protein
            B456_007G055400 [Gossypium raimondii]
          Length = 355

 Score =  551 bits (1421), Expect = e-154
 Identities = 249/323 (77%), Positives = 280/323 (86%)
 Frame = -2

Query: 1222 KAVAERPVNKIKLNSHILQESIIKQINGNPNAGWEAAVNPRFSNYTIGQFRHLLGVKPTP 1043
            + +A   ++ +KLNS ILQ+SI+KQ+N NP AGWEAA+NPRFSNYTIG+F+HLLGVKPTP
Sbjct: 22   QVIAVEQLSDVKLNSRILQDSIVKQVNQNPKAGWEAALNPRFSNYTIGEFKHLLGVKPTP 81

Query: 1042 QKDLKNIPVIAHAKSLKLPNKFDARTAWPQCSTIGRILDQGHCGSCWAFGAVESLSDRFC 863
            +K+L  +P++ H KSLKLP  FDARTAWPQC++IGRILDQGHCGSCWAFGAVESLSDRFC
Sbjct: 82   KKELLGVPILTHDKSLKLPTSFDARTAWPQCTSIGRILDQGHCGSCWAFGAVESLSDRFC 141

Query: 862  IHFDVNISLSVNDLLAXXXXXXXXXXXXGYPIYAWRYFVHHGVVTEECDPYFDDTGCSHP 683
            IHFD+NISLSVNDLLA            GYPI AWRYFV  GVVTEECDPYFDD GCSHP
Sbjct: 142  IHFDMNISLSVNDLLACCGFLCGDGCDGGYPISAWRYFVRSGVVTEECDPYFDDIGCSHP 201

Query: 682  GCEPAYPTPKCVRKCVNKNQLWSNSKHYSISAYRIKSDPHSIMAEIYKNGPVEVSFTVYE 503
            GCEPA+PTPKCVRKCV  N LW  SKHYS+ AYRIKS+P  IMAE+YKNGPVEVSFTVYE
Sbjct: 202  GCEPAFPTPKCVRKCVKGNLLWKQSKHYSVGAYRIKSNPADIMAEVYKNGPVEVSFTVYE 261

Query: 502  DFAHYKSGVYKHITGDVMGGHAVKLIGWGTTDAGEDYWLLANQWNRSWGDDGYFMIRRGT 323
            DFAHYKSGVYKH+TGDVMGGHAVKLIGWGT+D GEDYWLLANQWNR WGDDGYF I+RG 
Sbjct: 262  DFAHYKSGVYKHLTGDVMGGHAVKLIGWGTSDDGEDYWLLANQWNRGWGDDGYFKIKRGV 321

Query: 322  NECGIEEDVVAGLPSSRNLVREI 254
            +ECGIE DVVAGLPS++NLVR++
Sbjct: 322  DECGIESDVVAGLPSTKNLVRQV 344


Top