BLASTX nr result

ID: Alisma22_contig00001149 seq

BLASTX 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Alisma22_contig00001149
         (1302 letters)

Database: ./nr 
           115,041,592 sequences; 42,171,959,267 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

XP_015639550.1 PREDICTED: cathepsin B isoform X3 [Oryza sativa J...   197   1e-55
XP_011025296.1 PREDICTED: cathepsin B-like [Populus euphratica]       194   2e-54
XP_006664995.1 PREDICTED: cathepsin B-like [Oryza brachyantha] X...   193   5e-54
XP_006664962.1 PREDICTED: cathepsin B-like [Oryza brachyantha] X...   191   4e-53
ACU24206.1 unknown, partial [Glycine max]                             189   4e-53
KQL06903.1 hypothetical protein SETIT_001407mg [Setaria italica]      190   8e-53
XP_003521632.1 PREDICTED: cathepsin B [Glycine max] KHN14189.1 C...   189   9e-53
KQK93220.1 hypothetical protein SETIT_026554mg [Setaria italica]      189   9e-53
XP_015572445.1 PREDICTED: cathepsin B [Ricinus communis]              189   1e-52
XP_004978407.1 PREDICTED: cathepsin B-like [Setaria italica] XP_...   189   1e-52
XP_004969895.1 PREDICTED: cathepsin B-like [Setaria italica] XP_...   189   1e-52
AAR25797.1 cathepsin B-like cysteine proteinase, partial [Solanu...   184   2e-52
XP_010907756.1 PREDICTED: cathepsin B [Elaeis guineensis]             189   2e-52
XP_008776548.1 PREDICTED: cathepsin B-like isoform X3 [Phoenix d...   189   2e-52
OAY48801.1 hypothetical protein MANES_05G006300 [Manihot esculenta]   188   2e-52
XP_012083054.1 PREDICTED: cathepsin B [Jatropha curcas] KDP28374...   188   2e-52
KMZ74768.1 Cathepsin B [Zostera marina]                               188   3e-52
XP_009393126.1 PREDICTED: cathepsin B-like [Musa acuminata subsp...   188   3e-52
KQL06905.1 hypothetical protein SETIT_001407mg [Setaria italica]      190   6e-52
KXG36304.1 hypothetical protein SORBI_002G315800 [Sorghum bicolor]    186   6e-52

>XP_015639550.1 PREDICTED: cathepsin B isoform X3 [Oryza sativa Japonica Group]
            XP_015639551.1 PREDICTED: cathepsin B isoform X3 [Oryza
            sativa Japonica Group] AAX11351.1 cathepsin B-like
            cysteine protease [Oryza sativa Japonica Group]
            EAY97476.1 hypothetical protein OsI_19406 [Oryza sativa
            Indica Group] BAG89222.1 unnamed protein product [Oryza
            sativa Japonica Group] BAG94499.1 unnamed protein product
            [Oryza sativa Japonica Group] BAG87079.1 unnamed protein
            product [Oryza sativa Japonica Group] EEE63190.1
            hypothetical protein OsJ_17999 [Oryza sativa Japonica
            Group] AIV98516.1 cathepsin B-like cysteine protease
            [Oryza sativa Japonica Group]
          Length = 358

 Score =  197 bits (500), Expect = 1e-55
 Identities = 97/180 (53%), Positives = 113/180 (62%)
 Frame = +2

Query: 761  QAAKPIPRLWAAAGAGSDSRIIQSSIVDRINSDPSVGWTAEANPRFANYTVDEFCXXXXX 940
            +AAKPIP L      G  SRIIQ  I+  IN  P+ GWTA  NP FANYT  +F      
Sbjct: 21   RAAKPIPNLQLMTKEGGSSRIIQDDIIKAINKHPNAGWTAARNPYFANYTTAQF--KHIL 78

Query: 941  XXXXXXXXXXEDEPLVVSHPRTLQLPKEFDSRTAWPHCPTIGAILDQGHCGSCWAFGAVE 1120
                       D P V ++PR+L LPKEFD+R+AW  C TIG ILDQGHCGSCWAFGAVE
Sbjct: 79   GVKPTPHSVLNDVP-VKTYPRSLMLPKEFDARSAWSQCNTIGTILDQGHCGSCWAFGAVE 137

Query: 1121 ALTDRFCIHFNATVKLSVNDLLSXXXXXXXXXXXXXWPMQAWRYLSKSGVVTDECDPYFD 1300
             L DRFCIHFN  + LSVNDL++             +P+ AWRY  ++GVVTDECDPYFD
Sbjct: 138  CLQDRFCIHFNMNISLSVNDLVACCGFMCGDGCDGGYPIMAWRYFVRNGVVTDECDPYFD 197


>XP_011025296.1 PREDICTED: cathepsin B-like [Populus euphratica]
          Length = 356

 Score =  194 bits (492), Expect = 2e-54
 Identities = 99/194 (51%), Positives = 125/194 (64%), Gaps = 2/194 (1%)
 Frame = +2

Query: 725  FFLIAASVLPLLQ--AAKPIPRLWAAAGAGSDSRIIQSSIVDRINSDPSVGWTAEANPRF 898
            FFL+AA      Q  A +P+ +L        +SRI+Q SIV +IN +P+ GW A  NP+F
Sbjct: 11   FFLVAALFTFYSQVIAVEPVSKLKL------NSRILQDSIVQKINENPNAGWEATMNPQF 64

Query: 899  ANYTVDEFCXXXXXXXXXXXXXXXEDEPLVVSHPRTLQLPKEFDSRTAWPHCPTIGAILD 1078
            +NY+V EF                   PLV  HP++++LPKEFD+RTAWPHC TIG ILD
Sbjct: 65   SNYSVGEF--KYLLGVKPTPGKELRGVPLV-RHPKSMKLPKEFDARTAWPHCSTIGRILD 121

Query: 1079 QGHCGSCWAFGAVEALTDRFCIHFNATVKLSVNDLLSXXXXXXXXXXXXXWPMQAWRYLS 1258
            QGHCGSCWAFGAVE+L+DRFCIH+   + LSVNDLL+             +P+ AWRY  
Sbjct: 122  QGHCGSCWAFGAVESLSDRFCIHYGMNLSLSVNDLLACCGWMCGDGCDGGYPIDAWRYFV 181

Query: 1259 KSGVVTDECDPYFD 1300
            +SGVVT+ECDPYFD
Sbjct: 182  QSGVVTEECDPYFD 195


>XP_006664995.1 PREDICTED: cathepsin B-like [Oryza brachyantha] XP_006664996.1
            PREDICTED: cathepsin B-like [Oryza brachyantha]
            XP_015698993.1 PREDICTED: cathepsin B-like [Oryza
            brachyantha]
          Length = 362

 Score =  193 bits (490), Expect = 5e-54
 Identities = 102/199 (51%), Positives = 117/199 (58%)
 Frame = +2

Query: 704  ILVINCCFFLIAASVLPLLQAAKPIPRLWAAAGAGSDSRIIQSSIVDRINSDPSVGWTAE 883
            ILV  C      AS     +AAK IP        G  SRIIQ  I+  IN  P+ GWTA 
Sbjct: 14   ILVFTC------ASAPQATKAAKSIPDPQLTIEEGDSSRIIQDDIIKTINKHPNAGWTAA 67

Query: 884  ANPRFANYTVDEFCXXXXXXXXXXXXXXXEDEPLVVSHPRTLQLPKEFDSRTAWPHCPTI 1063
             NP FANYTV +F                 + P   ++ R+L LPKEFD+R+AW HC TI
Sbjct: 68   QNPYFANYTVAQF--KHILGVKATPHSLLSNVP-AKTYSRSLMLPKEFDARSAWSHCSTI 124

Query: 1064 GAILDQGHCGSCWAFGAVEALTDRFCIHFNATVKLSVNDLLSXXXXXXXXXXXXXWPMQA 1243
            G ILDQGHCGSCWAFGAVE L DRFCIHFN  + LSVNDLLS             +P+ A
Sbjct: 125  GTILDQGHCGSCWAFGAVECLQDRFCIHFNMNISLSVNDLLSCCGFMCGDGCDGGYPIMA 184

Query: 1244 WRYLSKSGVVTDECDPYFD 1300
            WRY  ++GVVTDECDPYFD
Sbjct: 185  WRYFVQNGVVTDECDPYFD 203


>XP_006664962.1 PREDICTED: cathepsin B-like [Oryza brachyantha] XP_006664963.1
            PREDICTED: cathepsin B-like [Oryza brachyantha]
            XP_006664964.1 PREDICTED: cathepsin B-like [Oryza
            brachyantha]
          Length = 362

 Score =  191 bits (484), Expect = 4e-53
 Identities = 101/199 (50%), Positives = 116/199 (58%)
 Frame = +2

Query: 704  ILVINCCFFLIAASVLPLLQAAKPIPRLWAAAGAGSDSRIIQSSIVDRINSDPSVGWTAE 883
            ILV  C      AS     +AAK IP        G  SRIIQ  I+  IN  P+ GWTA 
Sbjct: 14   ILVFTC------ASAPQATKAAKSIPDPQLTIEEGDSSRIIQDDIIKTINKHPNAGWTAA 67

Query: 884  ANPRFANYTVDEFCXXXXXXXXXXXXXXXEDEPLVVSHPRTLQLPKEFDSRTAWPHCPTI 1063
             NP FANYTV +F                 + P   ++ R+L LPKEFD+R+AW HC TI
Sbjct: 68   QNPYFANYTVAQF--KHILGVKATPHSLLSNVP-AKTYSRSLMLPKEFDARSAWSHCSTI 124

Query: 1064 GAILDQGHCGSCWAFGAVEALTDRFCIHFNATVKLSVNDLLSXXXXXXXXXXXXXWPMQA 1243
            G ILDQGHCGSCWAFGAVE L DRFCIHFN    LSVNDLL+             +P+ A
Sbjct: 125  GTILDQGHCGSCWAFGAVECLQDRFCIHFNMNTSLSVNDLLACCGFMCGDGCDGGYPIMA 184

Query: 1244 WRYLSKSGVVTDECDPYFD 1300
            WRY  ++GVVTDECDPYFD
Sbjct: 185  WRYFVQNGVVTDECDPYFD 203


>ACU24206.1 unknown, partial [Glycine max]
          Length = 327

 Score =  189 bits (481), Expect = 4e-53
 Identities = 97/203 (47%), Positives = 125/203 (61%), Gaps = 1/203 (0%)
 Frame = +2

Query: 695  NSQILVINCCFFLIAASVLPLLQA-AKPIPRLWAAAGAGSDSRIIQSSIVDRINSDPSVG 871
            ++ +L +   F L++AS L +  A A+P+  L        +S I+Q S    IN +P  G
Sbjct: 3    STHLLPLATFFLLLSASYLQIAGAEAQPLTSLKL------NSHILQESTAKEINENPEAG 56

Query: 872  WTAEANPRFANYTVDEFCXXXXXXXXXXXXXXXEDEPLVVSHPRTLQLPKEFDSRTAWPH 1051
            W A  NPRF+NYTV++F                   P  +SHP+TL+LPK FD+RTAW  
Sbjct: 57   WEAAINPRFSNYTVEQF--KRLLGVKPMPKKELRSTP-AISHPKTLKLPKNFDARTAWSQ 113

Query: 1052 CPTIGAILDQGHCGSCWAFGAVEALTDRFCIHFNATVKLSVNDLLSXXXXXXXXXXXXXW 1231
            C TIG ILDQGHCGSCWAFGAVE+L+DRFCIHF+  + LSVNDLL+             +
Sbjct: 114  CSTIGRILDQGHCGSCWAFGAVESLSDRFCIHFDVNISLSVNDLLACCGFLCGSGCDGGY 173

Query: 1232 PMQAWRYLSKSGVVTDECDPYFD 1300
            P+ AWRYL+  GVVT+ECDPYFD
Sbjct: 174  PLYAWRYLAHHGVVTEECDPYFD 196


>KQL06903.1 hypothetical protein SETIT_001407mg [Setaria italica]
          Length = 366

 Score =  190 bits (482), Expect = 8e-53
 Identities = 100/226 (44%), Positives = 131/226 (57%), Gaps = 1/226 (0%)
 Frame = +2

Query: 626  SRERLVAAQRVRGSSSRDKMGKLNSQILVINCCFFLIAASVLPLLQAAKPIPRLWAAAGA 805
            S++    +++++      KMG    Q L++   FF ++A    +++A KPIP        
Sbjct: 72   SKQATRESRQLKRIDKNKKMGGTLQQQLLL---FFFLSAVAPQVVRAVKPIPNSNLGVEE 128

Query: 806  GSDS-RIIQSSIVDRINSDPSVGWTAEANPRFANYTVDEFCXXXXXXXXXXXXXXXEDEP 982
            G +S  IIQ  I+  +N  P  GWTA  NP FANYT+ +F                 D P
Sbjct: 129  GDNSIGIIQKDIIQTVNKHPDAGWTAAHNPYFANYTIAQF--KHILGVKPTPRDALTDVP 186

Query: 983  LVVSHPRTLQLPKEFDSRTAWPHCPTIGAILDQGHCGSCWAFGAVEALTDRFCIHFNATV 1162
               ++ R+L+LPKEFD+R+ W HC TIG ILDQGHCGSCWAFGAVE L DRFCIH N  +
Sbjct: 187  -AKTYSRSLKLPKEFDARSKWSHCSTIGNILDQGHCGSCWAFGAVECLQDRFCIHMNVNI 245

Query: 1163 KLSVNDLLSXXXXXXXXXXXXXWPMQAWRYLSKSGVVTDECDPYFD 1300
             LSVNDLL+             +P+ AWRY  ++GVVTDECDPYFD
Sbjct: 246  SLSVNDLLACCGFMCGDGCDGGYPIMAWRYFVQNGVVTDECDPYFD 291


>XP_003521632.1 PREDICTED: cathepsin B [Glycine max] KHN14189.1 Cathepsin B [Glycine
            soja] KRH68369.1 hypothetical protein GLYMA_03G226300
            [Glycine max]
          Length = 357

 Score =  189 bits (481), Expect = 9e-53
 Identities = 97/203 (47%), Positives = 125/203 (61%), Gaps = 1/203 (0%)
 Frame = +2

Query: 695  NSQILVINCCFFLIAASVLPLLQA-AKPIPRLWAAAGAGSDSRIIQSSIVDRINSDPSVG 871
            ++ +L +   F L++AS L +  A A+P+  L        +S I+Q S    IN +P  G
Sbjct: 3    STHLLPLATFFLLLSASYLQIAGAEAQPLTSLKL------NSHILQESTAKEINENPEAG 56

Query: 872  WTAEANPRFANYTVDEFCXXXXXXXXXXXXXXXEDEPLVVSHPRTLQLPKEFDSRTAWPH 1051
            W A  NPRF+NYTV++F                   P  +SHP+TL+LPK FD+RTAW  
Sbjct: 57   WEAAINPRFSNYTVEQF--KRLLGVKPMPKKELRSTP-AISHPKTLKLPKNFDARTAWSQ 113

Query: 1052 CPTIGAILDQGHCGSCWAFGAVEALTDRFCIHFNATVKLSVNDLLSXXXXXXXXXXXXXW 1231
            C TIG ILDQGHCGSCWAFGAVE+L+DRFCIHF+  + LSVNDLL+             +
Sbjct: 114  CSTIGRILDQGHCGSCWAFGAVESLSDRFCIHFDVNISLSVNDLLACCGFLCGSGCDGGY 173

Query: 1232 PMQAWRYLSKSGVVTDECDPYFD 1300
            P+ AWRYL+  GVVT+ECDPYFD
Sbjct: 174  PLYAWRYLAHHGVVTEECDPYFD 196


>KQK93220.1 hypothetical protein SETIT_026554mg [Setaria italica]
          Length = 348

 Score =  189 bits (480), Expect = 9e-53
 Identities = 98/207 (47%), Positives = 125/207 (60%), Gaps = 1/207 (0%)
 Frame = +2

Query: 683  MGKLNSQILVINCCFFLIAASVLPLLQAAKPIPRLWAAAGAGSDS-RIIQSSIVDRINSD 859
            MG    Q+L++   FF ++A    +++A KPIP        G +S  IIQ  I++ +N  
Sbjct: 1    MGGTLQQLLLL---FFFLSAVAPQVVRAVKPIPNSNLGVEEGDNSIGIIQKDIIETVNKH 57

Query: 860  PSVGWTAEANPRFANYTVDEFCXXXXXXXXXXXXXXXEDEPLVVSHPRTLQLPKEFDSRT 1039
            P+ GWTA  NP FANYT+ +F                 D P   ++ R+L+LPKEFD+R+
Sbjct: 58   PNAGWTAAQNPYFANYTIAQF--KHILGVKPTPQDALTDVPSK-TYSRSLKLPKEFDARS 114

Query: 1040 AWPHCPTIGAILDQGHCGSCWAFGAVEALTDRFCIHFNATVKLSVNDLLSXXXXXXXXXX 1219
             W HC TIG ILDQGHCGSCWAFGAVE L DRFCIH N  + LSVNDLL+          
Sbjct: 115  KWSHCSTIGNILDQGHCGSCWAFGAVECLQDRFCIHMNMNISLSVNDLLACCGFMCGDGC 174

Query: 1220 XXXWPMQAWRYLSKSGVVTDECDPYFD 1300
               +P+ AWRY  ++GVVTDECDPYFD
Sbjct: 175  NGGYPIMAWRYFVQNGVVTDECDPYFD 201


>XP_015572445.1 PREDICTED: cathepsin B [Ricinus communis]
          Length = 359

 Score =  189 bits (480), Expect = 1e-52
 Identities = 87/163 (53%), Positives = 108/163 (66%)
 Frame = +2

Query: 812  DSRIIQSSIVDRINSDPSVGWTAEANPRFANYTVDEFCXXXXXXXXXXXXXXXEDEPLVV 991
            +SRI+Q SI+ ++N +P  GW A  NP+ +N+TV +F                     ++
Sbjct: 37   NSRILQESIIKKVNENPDAGWEAAMNPQLSNFTVGQFKYLLGAKPTPKKELMGVP---MI 93

Query: 992  SHPRTLQLPKEFDSRTAWPHCPTIGAILDQGHCGSCWAFGAVEALTDRFCIHFNATVKLS 1171
            SHP+TL+LPKEFD+RTAWPHC TIG ILDQGHCGSCWAFGAVE+L+DRFCIHF   + LS
Sbjct: 94   SHPKTLKLPKEFDARTAWPHCSTIGKILDQGHCGSCWAFGAVESLSDRFCIHFGMNISLS 153

Query: 1172 VNDLLSXXXXXXXXXXXXXWPMQAWRYLSKSGVVTDECDPYFD 1300
            VNDLL+             +PM AWRY    GVVT+ECDPYFD
Sbjct: 154  VNDLLACCGFLCGDGCDGGYPMYAWRYFVHHGVVTEECDPYFD 196


>XP_004978407.1 PREDICTED: cathepsin B-like [Setaria italica] XP_004978408.1
            PREDICTED: cathepsin B-like [Setaria italica] KQK93221.1
            hypothetical protein SETIT_026554mg [Setaria italica]
            KQK93222.1 hypothetical protein SETIT_026554mg [Setaria
            italica]
          Length = 360

 Score =  189 bits (480), Expect = 1e-52
 Identities = 98/207 (47%), Positives = 125/207 (60%), Gaps = 1/207 (0%)
 Frame = +2

Query: 683  MGKLNSQILVINCCFFLIAASVLPLLQAAKPIPRLWAAAGAGSDS-RIIQSSIVDRINSD 859
            MG    Q+L++   FF ++A    +++A KPIP        G +S  IIQ  I++ +N  
Sbjct: 1    MGGTLQQLLLL---FFFLSAVAPQVVRAVKPIPNSNLGVEEGDNSIGIIQKDIIETVNKH 57

Query: 860  PSVGWTAEANPRFANYTVDEFCXXXXXXXXXXXXXXXEDEPLVVSHPRTLQLPKEFDSRT 1039
            P+ GWTA  NP FANYT+ +F                 D P   ++ R+L+LPKEFD+R+
Sbjct: 58   PNAGWTAAQNPYFANYTIAQF--KHILGVKPTPQDALTDVPSK-TYSRSLKLPKEFDARS 114

Query: 1040 AWPHCPTIGAILDQGHCGSCWAFGAVEALTDRFCIHFNATVKLSVNDLLSXXXXXXXXXX 1219
             W HC TIG ILDQGHCGSCWAFGAVE L DRFCIH N  + LSVNDLL+          
Sbjct: 115  KWSHCSTIGNILDQGHCGSCWAFGAVECLQDRFCIHMNMNISLSVNDLLACCGFMCGDGC 174

Query: 1220 XXXWPMQAWRYLSKSGVVTDECDPYFD 1300
               +P+ AWRY  ++GVVTDECDPYFD
Sbjct: 175  NGGYPIMAWRYFVQNGVVTDECDPYFD 201


>XP_004969895.1 PREDICTED: cathepsin B-like [Setaria italica] XP_004969896.1
            PREDICTED: cathepsin B-like [Setaria italica]
          Length = 360

 Score =  189 bits (480), Expect = 1e-52
 Identities = 98/206 (47%), Positives = 122/206 (59%), Gaps = 1/206 (0%)
 Frame = +2

Query: 686  GKLNSQILVINCCFFLIAASVLPLLQAAKPIPRLWAAAGAGSDS-RIIQSSIVDRINSDP 862
            G L  Q+L+    FF ++A    +++A KPIP        G +S  IIQ  I+  +N  P
Sbjct: 3    GTLQQQLLL----FFFLSAVAPQVVRAVKPIPNSNLGVEEGDNSIGIIQKDIIQTVNKHP 58

Query: 863  SVGWTAEANPRFANYTVDEFCXXXXXXXXXXXXXXXEDEPLVVSHPRTLQLPKEFDSRTA 1042
              GWTA  NP FANYT+ +F                 D P   ++ R+L+LPKEFD+R+ 
Sbjct: 59   DAGWTAAHNPYFANYTIAQF--KHILGVKPTPRDALTDVP-AKTYSRSLKLPKEFDARSK 115

Query: 1043 WPHCPTIGAILDQGHCGSCWAFGAVEALTDRFCIHFNATVKLSVNDLLSXXXXXXXXXXX 1222
            W HC TIG ILDQGHCGSCWAFGAVE L DRFCIH N  + LSVNDLL+           
Sbjct: 116  WSHCSTIGNILDQGHCGSCWAFGAVECLQDRFCIHMNVNISLSVNDLLACCGFMCGDGCD 175

Query: 1223 XXWPMQAWRYLSKSGVVTDECDPYFD 1300
              +P+ AWRY  ++GVVTDECDPYFD
Sbjct: 176  GGYPIMAWRYFVQNGVVTDECDPYFD 201


>AAR25797.1 cathepsin B-like cysteine proteinase, partial [Solanum tuberosum]
          Length = 218

 Score =  184 bits (467), Expect = 2e-52
 Identities = 93/190 (48%), Positives = 119/190 (62%)
 Frame = +2

Query: 731  LIAASVLPLLQAAKPIPRLWAAAGAGSDSRIIQSSIVDRINSDPSVGWTAEANPRFANYT 910
            L+ A  + +LQ A   P     + A  +S I+Q SIV R+N +   GW A  NP+ +N+T
Sbjct: 11   LLGAFFILILQVAAEKP----ISEAKLESAILQDSIVKRVNENAEAGWKAAFNPQLSNFT 66

Query: 911  VDEFCXXXXXXXXXXXXXXXEDEPLVVSHPRTLQLPKEFDSRTAWPHCPTIGAILDQGHC 1090
            V +F                E  P V++HPR  +LPKEFD+R AWP C TIG ILDQGHC
Sbjct: 67   VSQF--KRLLGVKPAREGDLEGIP-VLTHPRLKELPKEFDARKAWPQCSTIGKILDQGHC 123

Query: 1091 GSCWAFGAVEALTDRFCIHFNATVKLSVNDLLSXXXXXXXXXXXXXWPMQAWRYLSKSGV 1270
            GSCWAFGAVE+L+DRFCIH+N ++ LSVNDLL+             +P+ AWRY  +SGV
Sbjct: 124  GSCWAFGAVESLSDRFCIHYNLSISLSVNDLLACCSFLCGSGCDGGYPIAAWRYFKRSGV 183

Query: 1271 VTDECDPYFD 1300
            VT+ECDPYFD
Sbjct: 184  VTEECDPYFD 193


>XP_010907756.1 PREDICTED: cathepsin B [Elaeis guineensis]
          Length = 380

 Score =  189 bits (480), Expect = 2e-52
 Identities = 97/192 (50%), Positives = 124/192 (64%), Gaps = 2/192 (1%)
 Frame = +2

Query: 731  LIAASVLPLLQ--AAKPIPRLWAAAGAGSDSRIIQSSIVDRINSDPSVGWTAEANPRFAN 904
            LI A+ L   Q  A KP+P+L        +S I+Q+SI+++IN +P  GW A  N RF+N
Sbjct: 37   LILATALHPQQVIAGKPMPKL------KMESMILQNSIIEKINGNPIAGWKASTNSRFSN 90

Query: 905  YTVDEFCXXXXXXXXXXXXXXXEDEPLVVSHPRTLQLPKEFDSRTAWPHCPTIGAILDQG 1084
            YTV +F                ED P V +H ++++LPK+FD+RTAWP C TIG ILDQG
Sbjct: 91   YTVGQF--KHILGVKPAPRNAWEDIP-VKTHQKSVKLPKQFDARTAWPQCSTIGRILDQG 147

Query: 1085 HCGSCWAFGAVEALTDRFCIHFNATVKLSVNDLLSXXXXXXXXXXXXXWPMQAWRYLSKS 1264
            HCGSCWAFGAVE+L+DRFC+HF   V LSVNDLL+             +P+ AWRY  +S
Sbjct: 148  HCGSCWAFGAVESLSDRFCVHFGMNVSLSVNDLLACCGFMCGDGCDGGYPIYAWRYFVQS 207

Query: 1265 GVVTDECDPYFD 1300
            GVVT+ECDPYFD
Sbjct: 208  GVVTEECDPYFD 219


>XP_008776548.1 PREDICTED: cathepsin B-like isoform X3 [Phoenix dactylifera]
          Length = 368

 Score =  189 bits (479), Expect = 2e-52
 Identities = 92/179 (51%), Positives = 119/179 (66%)
 Frame = +2

Query: 764  AAKPIPRLWAAAGAGSDSRIIQSSIVDRINSDPSVGWTAEANPRFANYTVDEFCXXXXXX 943
            AAK +P+L       + S I+Q+SI+++IN++P+ GW A  N RF NYT+D+F       
Sbjct: 38   AAKRMPKL------RTGSMILQNSIIEKINANPNAGWKASMNSRFVNYTIDQF--KHLLG 89

Query: 944  XXXXXXXXXEDEPLVVSHPRTLQLPKEFDSRTAWPHCPTIGAILDQGHCGSCWAFGAVEA 1123
                     ED P V++H ++L LPK+FD+RTAWP C TIG IL QGHCGSCWAFGAVE+
Sbjct: 90   VKPMPCNTLEDIP-VMTHQKSLNLPKQFDARTAWPQCSTIGRILGQGHCGSCWAFGAVES 148

Query: 1124 LTDRFCIHFNATVKLSVNDLLSXXXXXXXXXXXXXWPMQAWRYLSKSGVVTDECDPYFD 1300
            L+DRFCIHF   + LSVNDLL+             +P+ AWRY  +SGVVT+ECDPYFD
Sbjct: 149  LSDRFCIHFGMNISLSVNDLLACCGFMCGDGCDGGYPIYAWRYFIQSGVVTEECDPYFD 207


>OAY48801.1 hypothetical protein MANES_05G006300 [Manihot esculenta]
          Length = 358

 Score =  188 bits (478), Expect = 2e-52
 Identities = 89/163 (54%), Positives = 108/163 (66%)
 Frame = +2

Query: 812  DSRIIQSSIVDRINSDPSVGWTAEANPRFANYTVDEFCXXXXXXXXXXXXXXXEDEPLVV 991
            +SRI+Q SI+ +IN +P+ GW A  NP+F+NYTV EF                     V+
Sbjct: 36   NSRILQESIIKKINENPNAGWEAAMNPQFSNYTVGEFKYLLGAKPTPKKELRGFP---VI 92

Query: 992  SHPRTLQLPKEFDSRTAWPHCPTIGAILDQGHCGSCWAFGAVEALTDRFCIHFNATVKLS 1171
            SHPR+L+LPKEFD+R AWP C TIG ILDQGHCGSCWAFGAVE+L+DRFCIHF   + LS
Sbjct: 93   SHPRSLKLPKEFDARKAWPQCSTIGRILDQGHCGSCWAFGAVESLSDRFCIHFGMNISLS 152

Query: 1172 VNDLLSXXXXXXXXXXXXXWPMQAWRYLSKSGVVTDECDPYFD 1300
            VNDLL+             +P+ AWRY    GVVT+ECDPYFD
Sbjct: 153  VNDLLACCGFLCGAGCNGGYPIYAWRYFVHHGVVTEECDPYFD 195


>XP_012083054.1 PREDICTED: cathepsin B [Jatropha curcas] KDP28374.1 hypothetical
            protein JCGZ_14145 [Jatropha curcas]
          Length = 358

 Score =  188 bits (478), Expect = 2e-52
 Identities = 90/162 (55%), Positives = 110/162 (67%)
 Frame = +2

Query: 815  SRIIQSSIVDRINSDPSVGWTAEANPRFANYTVDEFCXXXXXXXXXXXXXXXEDEPLVVS 994
            SR++Q SI+ +IN +P+ GW A  NPRF+NYTV EF                   PLV S
Sbjct: 37   SRVLQDSIIRKINENPNAGWEAAMNPRFSNYTVGEF--KYLLGVKPTPKKELRGVPLV-S 93

Query: 995  HPRTLQLPKEFDSRTAWPHCPTIGAILDQGHCGSCWAFGAVEALTDRFCIHFNATVKLSV 1174
            HP++L+LPKEFD+R+AWP C TIG ILDQGHCGSCWAFGAVE+L+DRFCI+F   + LSV
Sbjct: 94   HPKSLKLPKEFDARSAWPQCSTIGRILDQGHCGSCWAFGAVESLSDRFCINFGMNISLSV 153

Query: 1175 NDLLSXXXXXXXXXXXXXWPMQAWRYLSKSGVVTDECDPYFD 1300
            NDLL+             +P+ AWRYL   GVVT+ECDPYFD
Sbjct: 154  NDLLACCGFLCGNGCDGGYPLYAWRYLVHHGVVTEECDPYFD 195


>KMZ74768.1 Cathepsin B [Zostera marina]
          Length = 355

 Score =  188 bits (477), Expect = 3e-52
 Identities = 91/162 (56%), Positives = 106/162 (65%)
 Frame = +2

Query: 815  SRIIQSSIVDRINSDPSVGWTAEANPRFANYTVDEFCXXXXXXXXXXXXXXXEDEPLVVS 994
            S I+Q SIV ++N +P  GW A ANPR AN+T+ +F                     VVS
Sbjct: 34   SLILQDSIVQQVNGNPGSGWKAAANPRLANFTIGQFKHLLGVKPMPKNELVGIP---VVS 90

Query: 995  HPRTLQLPKEFDSRTAWPHCPTIGAILDQGHCGSCWAFGAVEALTDRFCIHFNATVKLSV 1174
            +P+ LQLPKEFD+RTAWPHCPTI  ILDQGHCGSCWAF AVE+L+DRFCIH N +V LSV
Sbjct: 91   YPKNLQLPKEFDARTAWPHCPTISNILDQGHCGSCWAFAAVESLSDRFCIHLNISVALSV 150

Query: 1175 NDLLSXXXXXXXXXXXXXWPMQAWRYLSKSGVVTDECDPYFD 1300
            NDLLS             +P QAW+Y  K GVVT ECDPYFD
Sbjct: 151  NDLLSCCGFMCGYGCDGGYPYQAWQYFVKHGVVTSECDPYFD 192


>XP_009393126.1 PREDICTED: cathepsin B-like [Musa acuminata subsp. malaccensis]
          Length = 358

 Score =  188 bits (477), Expect = 3e-52
 Identities = 91/179 (50%), Positives = 118/179 (65%)
 Frame = +2

Query: 764  AAKPIPRLWAAAGAGSDSRIIQSSIVDRINSDPSVGWTAEANPRFANYTVDEFCXXXXXX 943
            A KP+PRL       +DS I+Q+SI+ +IN++P+ GW A  N RF NYT+ +F       
Sbjct: 28   AVKPMPRL------RTDSMILQNSIIQKINANPNAGWKASMNSRFENYTIGQF--KHILG 79

Query: 944  XXXXXXXXXEDEPLVVSHPRTLQLPKEFDSRTAWPHCPTIGAILDQGHCGSCWAFGAVEA 1123
                      D P   ++ ++L+LPK+FD+RTAWP C TIG ILDQGHCGSCWAFGAVE+
Sbjct: 80   VKPMPHNEVMDIP-TKTYTKSLKLPKQFDARTAWPQCSTIGRILDQGHCGSCWAFGAVES 138

Query: 1124 LTDRFCIHFNATVKLSVNDLLSXXXXXXXXXXXXXWPMQAWRYLSKSGVVTDECDPYFD 1300
            L+DRFC+HF   + LSVNDLLS             +P++AWRY  ++GVVTDECDPYFD
Sbjct: 139  LSDRFCVHFGMNISLSVNDLLSCCGFMCGDGCDGGYPIRAWRYFVENGVVTDECDPYFD 197


>KQL06905.1 hypothetical protein SETIT_001407mg [Setaria italica]
          Length = 450

 Score =  190 bits (482), Expect = 6e-52
 Identities = 100/226 (44%), Positives = 131/226 (57%), Gaps = 1/226 (0%)
 Frame = +2

Query: 626  SRERLVAAQRVRGSSSRDKMGKLNSQILVINCCFFLIAASVLPLLQAAKPIPRLWAAAGA 805
            S++    +++++      KMG    Q L++   FF ++A    +++A KPIP        
Sbjct: 72   SKQATRESRQLKRIDKNKKMGGTLQQQLLL---FFFLSAVAPQVVRAVKPIPNSNLGVEE 128

Query: 806  GSDS-RIIQSSIVDRINSDPSVGWTAEANPRFANYTVDEFCXXXXXXXXXXXXXXXEDEP 982
            G +S  IIQ  I+  +N  P  GWTA  NP FANYT+ +F                 D P
Sbjct: 129  GDNSIGIIQKDIIQTVNKHPDAGWTAAHNPYFANYTIAQF--KHILGVKPTPRDALTDVP 186

Query: 983  LVVSHPRTLQLPKEFDSRTAWPHCPTIGAILDQGHCGSCWAFGAVEALTDRFCIHFNATV 1162
               ++ R+L+LPKEFD+R+ W HC TIG ILDQGHCGSCWAFGAVE L DRFCIH N  +
Sbjct: 187  -AKTYSRSLKLPKEFDARSKWSHCSTIGNILDQGHCGSCWAFGAVECLQDRFCIHMNVNI 245

Query: 1163 KLSVNDLLSXXXXXXXXXXXXXWPMQAWRYLSKSGVVTDECDPYFD 1300
             LSVNDLL+             +P+ AWRY  ++GVVTDECDPYFD
Sbjct: 246  SLSVNDLLACCGFMCGDGCDGGYPIMAWRYFVQNGVVTDECDPYFD 291


>KXG36304.1 hypothetical protein SORBI_002G315800 [Sorghum bicolor]
          Length = 316

 Score =  186 bits (472), Expect = 6e-52
 Identities = 93/186 (50%), Positives = 118/186 (63%), Gaps = 1/186 (0%)
 Frame = +2

Query: 746  VLPLLQAAKPIPRLWAAAGAGSDS-RIIQSSIVDRINSDPSVGWTAEANPRFANYTVDEF 922
            +L LL  +   P++    G G +S RIIQ  I++ +N+ PS GWTA  NP F+NYT+ +F
Sbjct: 15   LLALLLVSAAAPQV-VGVGVGDNSLRIIQEDIIETVNNHPSAGWTASRNPYFSNYTIAQF 73

Query: 923  CXXXXXXXXXXXXXXXEDEPLVVSHPRTLQLPKEFDSRTAWPHCPTIGAILDQGHCGSCW 1102
                             D P V ++PR+L+LPKEFD+R+ W  C TIG ILDQGHCGSCW
Sbjct: 74   --KHILGVKPAPKNVLSDVP-VKTYPRSLELPKEFDARSVWSRCSTIGNILDQGHCGSCW 130

Query: 1103 AFGAVEALTDRFCIHFNATVKLSVNDLLSXXXXXXXXXXXXXWPMQAWRYLSKSGVVTDE 1282
            AFGAVE L DRFCIHFN ++ LSVNDLL+             +P+ AW Y  ++GVVTDE
Sbjct: 131  AFGAVECLQDRFCIHFNTSILLSVNDLLACCGFMCGDGCDGGYPIMAWHYFVQNGVVTDE 190

Query: 1283 CDPYFD 1300
            CDPYFD
Sbjct: 191  CDPYFD 196


Top