BLASTX nr result
ID: Alisma22_contig00001149
seq
BLASTX 2.2.26 [Sep-21-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Alisma22_contig00001149 (1302 letters) Database: ./nr 115,041,592 sequences; 42,171,959,267 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value XP_015639550.1 PREDICTED: cathepsin B isoform X3 [Oryza sativa J... 197 1e-55 XP_011025296.1 PREDICTED: cathepsin B-like [Populus euphratica] 194 2e-54 XP_006664995.1 PREDICTED: cathepsin B-like [Oryza brachyantha] X... 193 5e-54 XP_006664962.1 PREDICTED: cathepsin B-like [Oryza brachyantha] X... 191 4e-53 ACU24206.1 unknown, partial [Glycine max] 189 4e-53 KQL06903.1 hypothetical protein SETIT_001407mg [Setaria italica] 190 8e-53 XP_003521632.1 PREDICTED: cathepsin B [Glycine max] KHN14189.1 C... 189 9e-53 KQK93220.1 hypothetical protein SETIT_026554mg [Setaria italica] 189 9e-53 XP_015572445.1 PREDICTED: cathepsin B [Ricinus communis] 189 1e-52 XP_004978407.1 PREDICTED: cathepsin B-like [Setaria italica] XP_... 189 1e-52 XP_004969895.1 PREDICTED: cathepsin B-like [Setaria italica] XP_... 189 1e-52 AAR25797.1 cathepsin B-like cysteine proteinase, partial [Solanu... 184 2e-52 XP_010907756.1 PREDICTED: cathepsin B [Elaeis guineensis] 189 2e-52 XP_008776548.1 PREDICTED: cathepsin B-like isoform X3 [Phoenix d... 189 2e-52 OAY48801.1 hypothetical protein MANES_05G006300 [Manihot esculenta] 188 2e-52 XP_012083054.1 PREDICTED: cathepsin B [Jatropha curcas] KDP28374... 188 2e-52 KMZ74768.1 Cathepsin B [Zostera marina] 188 3e-52 XP_009393126.1 PREDICTED: cathepsin B-like [Musa acuminata subsp... 188 3e-52 KQL06905.1 hypothetical protein SETIT_001407mg [Setaria italica] 190 6e-52 KXG36304.1 hypothetical protein SORBI_002G315800 [Sorghum bicolor] 186 6e-52 >XP_015639550.1 PREDICTED: cathepsin B isoform X3 [Oryza sativa Japonica Group] XP_015639551.1 PREDICTED: cathepsin B isoform X3 [Oryza sativa Japonica Group] AAX11351.1 cathepsin B-like cysteine protease [Oryza sativa Japonica Group] EAY97476.1 hypothetical protein OsI_19406 [Oryza sativa Indica Group] BAG89222.1 unnamed protein product [Oryza sativa Japonica Group] BAG94499.1 unnamed protein product [Oryza sativa Japonica Group] BAG87079.1 unnamed protein product [Oryza sativa Japonica Group] EEE63190.1 hypothetical protein OsJ_17999 [Oryza sativa Japonica Group] AIV98516.1 cathepsin B-like cysteine protease [Oryza sativa Japonica Group] Length = 358 Score = 197 bits (500), Expect = 1e-55 Identities = 97/180 (53%), Positives = 113/180 (62%) Frame = +2 Query: 761 QAAKPIPRLWAAAGAGSDSRIIQSSIVDRINSDPSVGWTAEANPRFANYTVDEFCXXXXX 940 +AAKPIP L G SRIIQ I+ IN P+ GWTA NP FANYT +F Sbjct: 21 RAAKPIPNLQLMTKEGGSSRIIQDDIIKAINKHPNAGWTAARNPYFANYTTAQF--KHIL 78 Query: 941 XXXXXXXXXXEDEPLVVSHPRTLQLPKEFDSRTAWPHCPTIGAILDQGHCGSCWAFGAVE 1120 D P V ++PR+L LPKEFD+R+AW C TIG ILDQGHCGSCWAFGAVE Sbjct: 79 GVKPTPHSVLNDVP-VKTYPRSLMLPKEFDARSAWSQCNTIGTILDQGHCGSCWAFGAVE 137 Query: 1121 ALTDRFCIHFNATVKLSVNDLLSXXXXXXXXXXXXXWPMQAWRYLSKSGVVTDECDPYFD 1300 L DRFCIHFN + LSVNDL++ +P+ AWRY ++GVVTDECDPYFD Sbjct: 138 CLQDRFCIHFNMNISLSVNDLVACCGFMCGDGCDGGYPIMAWRYFVRNGVVTDECDPYFD 197 >XP_011025296.1 PREDICTED: cathepsin B-like [Populus euphratica] Length = 356 Score = 194 bits (492), Expect = 2e-54 Identities = 99/194 (51%), Positives = 125/194 (64%), Gaps = 2/194 (1%) Frame = +2 Query: 725 FFLIAASVLPLLQ--AAKPIPRLWAAAGAGSDSRIIQSSIVDRINSDPSVGWTAEANPRF 898 FFL+AA Q A +P+ +L +SRI+Q SIV +IN +P+ GW A NP+F Sbjct: 11 FFLVAALFTFYSQVIAVEPVSKLKL------NSRILQDSIVQKINENPNAGWEATMNPQF 64 Query: 899 ANYTVDEFCXXXXXXXXXXXXXXXEDEPLVVSHPRTLQLPKEFDSRTAWPHCPTIGAILD 1078 +NY+V EF PLV HP++++LPKEFD+RTAWPHC TIG ILD Sbjct: 65 SNYSVGEF--KYLLGVKPTPGKELRGVPLV-RHPKSMKLPKEFDARTAWPHCSTIGRILD 121 Query: 1079 QGHCGSCWAFGAVEALTDRFCIHFNATVKLSVNDLLSXXXXXXXXXXXXXWPMQAWRYLS 1258 QGHCGSCWAFGAVE+L+DRFCIH+ + LSVNDLL+ +P+ AWRY Sbjct: 122 QGHCGSCWAFGAVESLSDRFCIHYGMNLSLSVNDLLACCGWMCGDGCDGGYPIDAWRYFV 181 Query: 1259 KSGVVTDECDPYFD 1300 +SGVVT+ECDPYFD Sbjct: 182 QSGVVTEECDPYFD 195 >XP_006664995.1 PREDICTED: cathepsin B-like [Oryza brachyantha] XP_006664996.1 PREDICTED: cathepsin B-like [Oryza brachyantha] XP_015698993.1 PREDICTED: cathepsin B-like [Oryza brachyantha] Length = 362 Score = 193 bits (490), Expect = 5e-54 Identities = 102/199 (51%), Positives = 117/199 (58%) Frame = +2 Query: 704 ILVINCCFFLIAASVLPLLQAAKPIPRLWAAAGAGSDSRIIQSSIVDRINSDPSVGWTAE 883 ILV C AS +AAK IP G SRIIQ I+ IN P+ GWTA Sbjct: 14 ILVFTC------ASAPQATKAAKSIPDPQLTIEEGDSSRIIQDDIIKTINKHPNAGWTAA 67 Query: 884 ANPRFANYTVDEFCXXXXXXXXXXXXXXXEDEPLVVSHPRTLQLPKEFDSRTAWPHCPTI 1063 NP FANYTV +F + P ++ R+L LPKEFD+R+AW HC TI Sbjct: 68 QNPYFANYTVAQF--KHILGVKATPHSLLSNVP-AKTYSRSLMLPKEFDARSAWSHCSTI 124 Query: 1064 GAILDQGHCGSCWAFGAVEALTDRFCIHFNATVKLSVNDLLSXXXXXXXXXXXXXWPMQA 1243 G ILDQGHCGSCWAFGAVE L DRFCIHFN + LSVNDLLS +P+ A Sbjct: 125 GTILDQGHCGSCWAFGAVECLQDRFCIHFNMNISLSVNDLLSCCGFMCGDGCDGGYPIMA 184 Query: 1244 WRYLSKSGVVTDECDPYFD 1300 WRY ++GVVTDECDPYFD Sbjct: 185 WRYFVQNGVVTDECDPYFD 203 >XP_006664962.1 PREDICTED: cathepsin B-like [Oryza brachyantha] XP_006664963.1 PREDICTED: cathepsin B-like [Oryza brachyantha] XP_006664964.1 PREDICTED: cathepsin B-like [Oryza brachyantha] Length = 362 Score = 191 bits (484), Expect = 4e-53 Identities = 101/199 (50%), Positives = 116/199 (58%) Frame = +2 Query: 704 ILVINCCFFLIAASVLPLLQAAKPIPRLWAAAGAGSDSRIIQSSIVDRINSDPSVGWTAE 883 ILV C AS +AAK IP G SRIIQ I+ IN P+ GWTA Sbjct: 14 ILVFTC------ASAPQATKAAKSIPDPQLTIEEGDSSRIIQDDIIKTINKHPNAGWTAA 67 Query: 884 ANPRFANYTVDEFCXXXXXXXXXXXXXXXEDEPLVVSHPRTLQLPKEFDSRTAWPHCPTI 1063 NP FANYTV +F + P ++ R+L LPKEFD+R+AW HC TI Sbjct: 68 QNPYFANYTVAQF--KHILGVKATPHSLLSNVP-AKTYSRSLMLPKEFDARSAWSHCSTI 124 Query: 1064 GAILDQGHCGSCWAFGAVEALTDRFCIHFNATVKLSVNDLLSXXXXXXXXXXXXXWPMQA 1243 G ILDQGHCGSCWAFGAVE L DRFCIHFN LSVNDLL+ +P+ A Sbjct: 125 GTILDQGHCGSCWAFGAVECLQDRFCIHFNMNTSLSVNDLLACCGFMCGDGCDGGYPIMA 184 Query: 1244 WRYLSKSGVVTDECDPYFD 1300 WRY ++GVVTDECDPYFD Sbjct: 185 WRYFVQNGVVTDECDPYFD 203 >ACU24206.1 unknown, partial [Glycine max] Length = 327 Score = 189 bits (481), Expect = 4e-53 Identities = 97/203 (47%), Positives = 125/203 (61%), Gaps = 1/203 (0%) Frame = +2 Query: 695 NSQILVINCCFFLIAASVLPLLQA-AKPIPRLWAAAGAGSDSRIIQSSIVDRINSDPSVG 871 ++ +L + F L++AS L + A A+P+ L +S I+Q S IN +P G Sbjct: 3 STHLLPLATFFLLLSASYLQIAGAEAQPLTSLKL------NSHILQESTAKEINENPEAG 56 Query: 872 WTAEANPRFANYTVDEFCXXXXXXXXXXXXXXXEDEPLVVSHPRTLQLPKEFDSRTAWPH 1051 W A NPRF+NYTV++F P +SHP+TL+LPK FD+RTAW Sbjct: 57 WEAAINPRFSNYTVEQF--KRLLGVKPMPKKELRSTP-AISHPKTLKLPKNFDARTAWSQ 113 Query: 1052 CPTIGAILDQGHCGSCWAFGAVEALTDRFCIHFNATVKLSVNDLLSXXXXXXXXXXXXXW 1231 C TIG ILDQGHCGSCWAFGAVE+L+DRFCIHF+ + LSVNDLL+ + Sbjct: 114 CSTIGRILDQGHCGSCWAFGAVESLSDRFCIHFDVNISLSVNDLLACCGFLCGSGCDGGY 173 Query: 1232 PMQAWRYLSKSGVVTDECDPYFD 1300 P+ AWRYL+ GVVT+ECDPYFD Sbjct: 174 PLYAWRYLAHHGVVTEECDPYFD 196 >KQL06903.1 hypothetical protein SETIT_001407mg [Setaria italica] Length = 366 Score = 190 bits (482), Expect = 8e-53 Identities = 100/226 (44%), Positives = 131/226 (57%), Gaps = 1/226 (0%) Frame = +2 Query: 626 SRERLVAAQRVRGSSSRDKMGKLNSQILVINCCFFLIAASVLPLLQAAKPIPRLWAAAGA 805 S++ +++++ KMG Q L++ FF ++A +++A KPIP Sbjct: 72 SKQATRESRQLKRIDKNKKMGGTLQQQLLL---FFFLSAVAPQVVRAVKPIPNSNLGVEE 128 Query: 806 GSDS-RIIQSSIVDRINSDPSVGWTAEANPRFANYTVDEFCXXXXXXXXXXXXXXXEDEP 982 G +S IIQ I+ +N P GWTA NP FANYT+ +F D P Sbjct: 129 GDNSIGIIQKDIIQTVNKHPDAGWTAAHNPYFANYTIAQF--KHILGVKPTPRDALTDVP 186 Query: 983 LVVSHPRTLQLPKEFDSRTAWPHCPTIGAILDQGHCGSCWAFGAVEALTDRFCIHFNATV 1162 ++ R+L+LPKEFD+R+ W HC TIG ILDQGHCGSCWAFGAVE L DRFCIH N + Sbjct: 187 -AKTYSRSLKLPKEFDARSKWSHCSTIGNILDQGHCGSCWAFGAVECLQDRFCIHMNVNI 245 Query: 1163 KLSVNDLLSXXXXXXXXXXXXXWPMQAWRYLSKSGVVTDECDPYFD 1300 LSVNDLL+ +P+ AWRY ++GVVTDECDPYFD Sbjct: 246 SLSVNDLLACCGFMCGDGCDGGYPIMAWRYFVQNGVVTDECDPYFD 291 >XP_003521632.1 PREDICTED: cathepsin B [Glycine max] KHN14189.1 Cathepsin B [Glycine soja] KRH68369.1 hypothetical protein GLYMA_03G226300 [Glycine max] Length = 357 Score = 189 bits (481), Expect = 9e-53 Identities = 97/203 (47%), Positives = 125/203 (61%), Gaps = 1/203 (0%) Frame = +2 Query: 695 NSQILVINCCFFLIAASVLPLLQA-AKPIPRLWAAAGAGSDSRIIQSSIVDRINSDPSVG 871 ++ +L + F L++AS L + A A+P+ L +S I+Q S IN +P G Sbjct: 3 STHLLPLATFFLLLSASYLQIAGAEAQPLTSLKL------NSHILQESTAKEINENPEAG 56 Query: 872 WTAEANPRFANYTVDEFCXXXXXXXXXXXXXXXEDEPLVVSHPRTLQLPKEFDSRTAWPH 1051 W A NPRF+NYTV++F P +SHP+TL+LPK FD+RTAW Sbjct: 57 WEAAINPRFSNYTVEQF--KRLLGVKPMPKKELRSTP-AISHPKTLKLPKNFDARTAWSQ 113 Query: 1052 CPTIGAILDQGHCGSCWAFGAVEALTDRFCIHFNATVKLSVNDLLSXXXXXXXXXXXXXW 1231 C TIG ILDQGHCGSCWAFGAVE+L+DRFCIHF+ + LSVNDLL+ + Sbjct: 114 CSTIGRILDQGHCGSCWAFGAVESLSDRFCIHFDVNISLSVNDLLACCGFLCGSGCDGGY 173 Query: 1232 PMQAWRYLSKSGVVTDECDPYFD 1300 P+ AWRYL+ GVVT+ECDPYFD Sbjct: 174 PLYAWRYLAHHGVVTEECDPYFD 196 >KQK93220.1 hypothetical protein SETIT_026554mg [Setaria italica] Length = 348 Score = 189 bits (480), Expect = 9e-53 Identities = 98/207 (47%), Positives = 125/207 (60%), Gaps = 1/207 (0%) Frame = +2 Query: 683 MGKLNSQILVINCCFFLIAASVLPLLQAAKPIPRLWAAAGAGSDS-RIIQSSIVDRINSD 859 MG Q+L++ FF ++A +++A KPIP G +S IIQ I++ +N Sbjct: 1 MGGTLQQLLLL---FFFLSAVAPQVVRAVKPIPNSNLGVEEGDNSIGIIQKDIIETVNKH 57 Query: 860 PSVGWTAEANPRFANYTVDEFCXXXXXXXXXXXXXXXEDEPLVVSHPRTLQLPKEFDSRT 1039 P+ GWTA NP FANYT+ +F D P ++ R+L+LPKEFD+R+ Sbjct: 58 PNAGWTAAQNPYFANYTIAQF--KHILGVKPTPQDALTDVPSK-TYSRSLKLPKEFDARS 114 Query: 1040 AWPHCPTIGAILDQGHCGSCWAFGAVEALTDRFCIHFNATVKLSVNDLLSXXXXXXXXXX 1219 W HC TIG ILDQGHCGSCWAFGAVE L DRFCIH N + LSVNDLL+ Sbjct: 115 KWSHCSTIGNILDQGHCGSCWAFGAVECLQDRFCIHMNMNISLSVNDLLACCGFMCGDGC 174 Query: 1220 XXXWPMQAWRYLSKSGVVTDECDPYFD 1300 +P+ AWRY ++GVVTDECDPYFD Sbjct: 175 NGGYPIMAWRYFVQNGVVTDECDPYFD 201 >XP_015572445.1 PREDICTED: cathepsin B [Ricinus communis] Length = 359 Score = 189 bits (480), Expect = 1e-52 Identities = 87/163 (53%), Positives = 108/163 (66%) Frame = +2 Query: 812 DSRIIQSSIVDRINSDPSVGWTAEANPRFANYTVDEFCXXXXXXXXXXXXXXXEDEPLVV 991 +SRI+Q SI+ ++N +P GW A NP+ +N+TV +F ++ Sbjct: 37 NSRILQESIIKKVNENPDAGWEAAMNPQLSNFTVGQFKYLLGAKPTPKKELMGVP---MI 93 Query: 992 SHPRTLQLPKEFDSRTAWPHCPTIGAILDQGHCGSCWAFGAVEALTDRFCIHFNATVKLS 1171 SHP+TL+LPKEFD+RTAWPHC TIG ILDQGHCGSCWAFGAVE+L+DRFCIHF + LS Sbjct: 94 SHPKTLKLPKEFDARTAWPHCSTIGKILDQGHCGSCWAFGAVESLSDRFCIHFGMNISLS 153 Query: 1172 VNDLLSXXXXXXXXXXXXXWPMQAWRYLSKSGVVTDECDPYFD 1300 VNDLL+ +PM AWRY GVVT+ECDPYFD Sbjct: 154 VNDLLACCGFLCGDGCDGGYPMYAWRYFVHHGVVTEECDPYFD 196 >XP_004978407.1 PREDICTED: cathepsin B-like [Setaria italica] XP_004978408.1 PREDICTED: cathepsin B-like [Setaria italica] KQK93221.1 hypothetical protein SETIT_026554mg [Setaria italica] KQK93222.1 hypothetical protein SETIT_026554mg [Setaria italica] Length = 360 Score = 189 bits (480), Expect = 1e-52 Identities = 98/207 (47%), Positives = 125/207 (60%), Gaps = 1/207 (0%) Frame = +2 Query: 683 MGKLNSQILVINCCFFLIAASVLPLLQAAKPIPRLWAAAGAGSDS-RIIQSSIVDRINSD 859 MG Q+L++ FF ++A +++A KPIP G +S IIQ I++ +N Sbjct: 1 MGGTLQQLLLL---FFFLSAVAPQVVRAVKPIPNSNLGVEEGDNSIGIIQKDIIETVNKH 57 Query: 860 PSVGWTAEANPRFANYTVDEFCXXXXXXXXXXXXXXXEDEPLVVSHPRTLQLPKEFDSRT 1039 P+ GWTA NP FANYT+ +F D P ++ R+L+LPKEFD+R+ Sbjct: 58 PNAGWTAAQNPYFANYTIAQF--KHILGVKPTPQDALTDVPSK-TYSRSLKLPKEFDARS 114 Query: 1040 AWPHCPTIGAILDQGHCGSCWAFGAVEALTDRFCIHFNATVKLSVNDLLSXXXXXXXXXX 1219 W HC TIG ILDQGHCGSCWAFGAVE L DRFCIH N + LSVNDLL+ Sbjct: 115 KWSHCSTIGNILDQGHCGSCWAFGAVECLQDRFCIHMNMNISLSVNDLLACCGFMCGDGC 174 Query: 1220 XXXWPMQAWRYLSKSGVVTDECDPYFD 1300 +P+ AWRY ++GVVTDECDPYFD Sbjct: 175 NGGYPIMAWRYFVQNGVVTDECDPYFD 201 >XP_004969895.1 PREDICTED: cathepsin B-like [Setaria italica] XP_004969896.1 PREDICTED: cathepsin B-like [Setaria italica] Length = 360 Score = 189 bits (480), Expect = 1e-52 Identities = 98/206 (47%), Positives = 122/206 (59%), Gaps = 1/206 (0%) Frame = +2 Query: 686 GKLNSQILVINCCFFLIAASVLPLLQAAKPIPRLWAAAGAGSDS-RIIQSSIVDRINSDP 862 G L Q+L+ FF ++A +++A KPIP G +S IIQ I+ +N P Sbjct: 3 GTLQQQLLL----FFFLSAVAPQVVRAVKPIPNSNLGVEEGDNSIGIIQKDIIQTVNKHP 58 Query: 863 SVGWTAEANPRFANYTVDEFCXXXXXXXXXXXXXXXEDEPLVVSHPRTLQLPKEFDSRTA 1042 GWTA NP FANYT+ +F D P ++ R+L+LPKEFD+R+ Sbjct: 59 DAGWTAAHNPYFANYTIAQF--KHILGVKPTPRDALTDVP-AKTYSRSLKLPKEFDARSK 115 Query: 1043 WPHCPTIGAILDQGHCGSCWAFGAVEALTDRFCIHFNATVKLSVNDLLSXXXXXXXXXXX 1222 W HC TIG ILDQGHCGSCWAFGAVE L DRFCIH N + LSVNDLL+ Sbjct: 116 WSHCSTIGNILDQGHCGSCWAFGAVECLQDRFCIHMNVNISLSVNDLLACCGFMCGDGCD 175 Query: 1223 XXWPMQAWRYLSKSGVVTDECDPYFD 1300 +P+ AWRY ++GVVTDECDPYFD Sbjct: 176 GGYPIMAWRYFVQNGVVTDECDPYFD 201 >AAR25797.1 cathepsin B-like cysteine proteinase, partial [Solanum tuberosum] Length = 218 Score = 184 bits (467), Expect = 2e-52 Identities = 93/190 (48%), Positives = 119/190 (62%) Frame = +2 Query: 731 LIAASVLPLLQAAKPIPRLWAAAGAGSDSRIIQSSIVDRINSDPSVGWTAEANPRFANYT 910 L+ A + +LQ A P + A +S I+Q SIV R+N + GW A NP+ +N+T Sbjct: 11 LLGAFFILILQVAAEKP----ISEAKLESAILQDSIVKRVNENAEAGWKAAFNPQLSNFT 66 Query: 911 VDEFCXXXXXXXXXXXXXXXEDEPLVVSHPRTLQLPKEFDSRTAWPHCPTIGAILDQGHC 1090 V +F E P V++HPR +LPKEFD+R AWP C TIG ILDQGHC Sbjct: 67 VSQF--KRLLGVKPAREGDLEGIP-VLTHPRLKELPKEFDARKAWPQCSTIGKILDQGHC 123 Query: 1091 GSCWAFGAVEALTDRFCIHFNATVKLSVNDLLSXXXXXXXXXXXXXWPMQAWRYLSKSGV 1270 GSCWAFGAVE+L+DRFCIH+N ++ LSVNDLL+ +P+ AWRY +SGV Sbjct: 124 GSCWAFGAVESLSDRFCIHYNLSISLSVNDLLACCSFLCGSGCDGGYPIAAWRYFKRSGV 183 Query: 1271 VTDECDPYFD 1300 VT+ECDPYFD Sbjct: 184 VTEECDPYFD 193 >XP_010907756.1 PREDICTED: cathepsin B [Elaeis guineensis] Length = 380 Score = 189 bits (480), Expect = 2e-52 Identities = 97/192 (50%), Positives = 124/192 (64%), Gaps = 2/192 (1%) Frame = +2 Query: 731 LIAASVLPLLQ--AAKPIPRLWAAAGAGSDSRIIQSSIVDRINSDPSVGWTAEANPRFAN 904 LI A+ L Q A KP+P+L +S I+Q+SI+++IN +P GW A N RF+N Sbjct: 37 LILATALHPQQVIAGKPMPKL------KMESMILQNSIIEKINGNPIAGWKASTNSRFSN 90 Query: 905 YTVDEFCXXXXXXXXXXXXXXXEDEPLVVSHPRTLQLPKEFDSRTAWPHCPTIGAILDQG 1084 YTV +F ED P V +H ++++LPK+FD+RTAWP C TIG ILDQG Sbjct: 91 YTVGQF--KHILGVKPAPRNAWEDIP-VKTHQKSVKLPKQFDARTAWPQCSTIGRILDQG 147 Query: 1085 HCGSCWAFGAVEALTDRFCIHFNATVKLSVNDLLSXXXXXXXXXXXXXWPMQAWRYLSKS 1264 HCGSCWAFGAVE+L+DRFC+HF V LSVNDLL+ +P+ AWRY +S Sbjct: 148 HCGSCWAFGAVESLSDRFCVHFGMNVSLSVNDLLACCGFMCGDGCDGGYPIYAWRYFVQS 207 Query: 1265 GVVTDECDPYFD 1300 GVVT+ECDPYFD Sbjct: 208 GVVTEECDPYFD 219 >XP_008776548.1 PREDICTED: cathepsin B-like isoform X3 [Phoenix dactylifera] Length = 368 Score = 189 bits (479), Expect = 2e-52 Identities = 92/179 (51%), Positives = 119/179 (66%) Frame = +2 Query: 764 AAKPIPRLWAAAGAGSDSRIIQSSIVDRINSDPSVGWTAEANPRFANYTVDEFCXXXXXX 943 AAK +P+L + S I+Q+SI+++IN++P+ GW A N RF NYT+D+F Sbjct: 38 AAKRMPKL------RTGSMILQNSIIEKINANPNAGWKASMNSRFVNYTIDQF--KHLLG 89 Query: 944 XXXXXXXXXEDEPLVVSHPRTLQLPKEFDSRTAWPHCPTIGAILDQGHCGSCWAFGAVEA 1123 ED P V++H ++L LPK+FD+RTAWP C TIG IL QGHCGSCWAFGAVE+ Sbjct: 90 VKPMPCNTLEDIP-VMTHQKSLNLPKQFDARTAWPQCSTIGRILGQGHCGSCWAFGAVES 148 Query: 1124 LTDRFCIHFNATVKLSVNDLLSXXXXXXXXXXXXXWPMQAWRYLSKSGVVTDECDPYFD 1300 L+DRFCIHF + LSVNDLL+ +P+ AWRY +SGVVT+ECDPYFD Sbjct: 149 LSDRFCIHFGMNISLSVNDLLACCGFMCGDGCDGGYPIYAWRYFIQSGVVTEECDPYFD 207 >OAY48801.1 hypothetical protein MANES_05G006300 [Manihot esculenta] Length = 358 Score = 188 bits (478), Expect = 2e-52 Identities = 89/163 (54%), Positives = 108/163 (66%) Frame = +2 Query: 812 DSRIIQSSIVDRINSDPSVGWTAEANPRFANYTVDEFCXXXXXXXXXXXXXXXEDEPLVV 991 +SRI+Q SI+ +IN +P+ GW A NP+F+NYTV EF V+ Sbjct: 36 NSRILQESIIKKINENPNAGWEAAMNPQFSNYTVGEFKYLLGAKPTPKKELRGFP---VI 92 Query: 992 SHPRTLQLPKEFDSRTAWPHCPTIGAILDQGHCGSCWAFGAVEALTDRFCIHFNATVKLS 1171 SHPR+L+LPKEFD+R AWP C TIG ILDQGHCGSCWAFGAVE+L+DRFCIHF + LS Sbjct: 93 SHPRSLKLPKEFDARKAWPQCSTIGRILDQGHCGSCWAFGAVESLSDRFCIHFGMNISLS 152 Query: 1172 VNDLLSXXXXXXXXXXXXXWPMQAWRYLSKSGVVTDECDPYFD 1300 VNDLL+ +P+ AWRY GVVT+ECDPYFD Sbjct: 153 VNDLLACCGFLCGAGCNGGYPIYAWRYFVHHGVVTEECDPYFD 195 >XP_012083054.1 PREDICTED: cathepsin B [Jatropha curcas] KDP28374.1 hypothetical protein JCGZ_14145 [Jatropha curcas] Length = 358 Score = 188 bits (478), Expect = 2e-52 Identities = 90/162 (55%), Positives = 110/162 (67%) Frame = +2 Query: 815 SRIIQSSIVDRINSDPSVGWTAEANPRFANYTVDEFCXXXXXXXXXXXXXXXEDEPLVVS 994 SR++Q SI+ +IN +P+ GW A NPRF+NYTV EF PLV S Sbjct: 37 SRVLQDSIIRKINENPNAGWEAAMNPRFSNYTVGEF--KYLLGVKPTPKKELRGVPLV-S 93 Query: 995 HPRTLQLPKEFDSRTAWPHCPTIGAILDQGHCGSCWAFGAVEALTDRFCIHFNATVKLSV 1174 HP++L+LPKEFD+R+AWP C TIG ILDQGHCGSCWAFGAVE+L+DRFCI+F + LSV Sbjct: 94 HPKSLKLPKEFDARSAWPQCSTIGRILDQGHCGSCWAFGAVESLSDRFCINFGMNISLSV 153 Query: 1175 NDLLSXXXXXXXXXXXXXWPMQAWRYLSKSGVVTDECDPYFD 1300 NDLL+ +P+ AWRYL GVVT+ECDPYFD Sbjct: 154 NDLLACCGFLCGNGCDGGYPLYAWRYLVHHGVVTEECDPYFD 195 >KMZ74768.1 Cathepsin B [Zostera marina] Length = 355 Score = 188 bits (477), Expect = 3e-52 Identities = 91/162 (56%), Positives = 106/162 (65%) Frame = +2 Query: 815 SRIIQSSIVDRINSDPSVGWTAEANPRFANYTVDEFCXXXXXXXXXXXXXXXEDEPLVVS 994 S I+Q SIV ++N +P GW A ANPR AN+T+ +F VVS Sbjct: 34 SLILQDSIVQQVNGNPGSGWKAAANPRLANFTIGQFKHLLGVKPMPKNELVGIP---VVS 90 Query: 995 HPRTLQLPKEFDSRTAWPHCPTIGAILDQGHCGSCWAFGAVEALTDRFCIHFNATVKLSV 1174 +P+ LQLPKEFD+RTAWPHCPTI ILDQGHCGSCWAF AVE+L+DRFCIH N +V LSV Sbjct: 91 YPKNLQLPKEFDARTAWPHCPTISNILDQGHCGSCWAFAAVESLSDRFCIHLNISVALSV 150 Query: 1175 NDLLSXXXXXXXXXXXXXWPMQAWRYLSKSGVVTDECDPYFD 1300 NDLLS +P QAW+Y K GVVT ECDPYFD Sbjct: 151 NDLLSCCGFMCGYGCDGGYPYQAWQYFVKHGVVTSECDPYFD 192 >XP_009393126.1 PREDICTED: cathepsin B-like [Musa acuminata subsp. malaccensis] Length = 358 Score = 188 bits (477), Expect = 3e-52 Identities = 91/179 (50%), Positives = 118/179 (65%) Frame = +2 Query: 764 AAKPIPRLWAAAGAGSDSRIIQSSIVDRINSDPSVGWTAEANPRFANYTVDEFCXXXXXX 943 A KP+PRL +DS I+Q+SI+ +IN++P+ GW A N RF NYT+ +F Sbjct: 28 AVKPMPRL------RTDSMILQNSIIQKINANPNAGWKASMNSRFENYTIGQF--KHILG 79 Query: 944 XXXXXXXXXEDEPLVVSHPRTLQLPKEFDSRTAWPHCPTIGAILDQGHCGSCWAFGAVEA 1123 D P ++ ++L+LPK+FD+RTAWP C TIG ILDQGHCGSCWAFGAVE+ Sbjct: 80 VKPMPHNEVMDIP-TKTYTKSLKLPKQFDARTAWPQCSTIGRILDQGHCGSCWAFGAVES 138 Query: 1124 LTDRFCIHFNATVKLSVNDLLSXXXXXXXXXXXXXWPMQAWRYLSKSGVVTDECDPYFD 1300 L+DRFC+HF + LSVNDLLS +P++AWRY ++GVVTDECDPYFD Sbjct: 139 LSDRFCVHFGMNISLSVNDLLSCCGFMCGDGCDGGYPIRAWRYFVENGVVTDECDPYFD 197 >KQL06905.1 hypothetical protein SETIT_001407mg [Setaria italica] Length = 450 Score = 190 bits (482), Expect = 6e-52 Identities = 100/226 (44%), Positives = 131/226 (57%), Gaps = 1/226 (0%) Frame = +2 Query: 626 SRERLVAAQRVRGSSSRDKMGKLNSQILVINCCFFLIAASVLPLLQAAKPIPRLWAAAGA 805 S++ +++++ KMG Q L++ FF ++A +++A KPIP Sbjct: 72 SKQATRESRQLKRIDKNKKMGGTLQQQLLL---FFFLSAVAPQVVRAVKPIPNSNLGVEE 128 Query: 806 GSDS-RIIQSSIVDRINSDPSVGWTAEANPRFANYTVDEFCXXXXXXXXXXXXXXXEDEP 982 G +S IIQ I+ +N P GWTA NP FANYT+ +F D P Sbjct: 129 GDNSIGIIQKDIIQTVNKHPDAGWTAAHNPYFANYTIAQF--KHILGVKPTPRDALTDVP 186 Query: 983 LVVSHPRTLQLPKEFDSRTAWPHCPTIGAILDQGHCGSCWAFGAVEALTDRFCIHFNATV 1162 ++ R+L+LPKEFD+R+ W HC TIG ILDQGHCGSCWAFGAVE L DRFCIH N + Sbjct: 187 -AKTYSRSLKLPKEFDARSKWSHCSTIGNILDQGHCGSCWAFGAVECLQDRFCIHMNVNI 245 Query: 1163 KLSVNDLLSXXXXXXXXXXXXXWPMQAWRYLSKSGVVTDECDPYFD 1300 LSVNDLL+ +P+ AWRY ++GVVTDECDPYFD Sbjct: 246 SLSVNDLLACCGFMCGDGCDGGYPIMAWRYFVQNGVVTDECDPYFD 291 >KXG36304.1 hypothetical protein SORBI_002G315800 [Sorghum bicolor] Length = 316 Score = 186 bits (472), Expect = 6e-52 Identities = 93/186 (50%), Positives = 118/186 (63%), Gaps = 1/186 (0%) Frame = +2 Query: 746 VLPLLQAAKPIPRLWAAAGAGSDS-RIIQSSIVDRINSDPSVGWTAEANPRFANYTVDEF 922 +L LL + P++ G G +S RIIQ I++ +N+ PS GWTA NP F+NYT+ +F Sbjct: 15 LLALLLVSAAAPQV-VGVGVGDNSLRIIQEDIIETVNNHPSAGWTASRNPYFSNYTIAQF 73 Query: 923 CXXXXXXXXXXXXXXXEDEPLVVSHPRTLQLPKEFDSRTAWPHCPTIGAILDQGHCGSCW 1102 D P V ++PR+L+LPKEFD+R+ W C TIG ILDQGHCGSCW Sbjct: 74 --KHILGVKPAPKNVLSDVP-VKTYPRSLELPKEFDARSVWSRCSTIGNILDQGHCGSCW 130 Query: 1103 AFGAVEALTDRFCIHFNATVKLSVNDLLSXXXXXXXXXXXXXWPMQAWRYLSKSGVVTDE 1282 AFGAVE L DRFCIHFN ++ LSVNDLL+ +P+ AW Y ++GVVTDE Sbjct: 131 AFGAVECLQDRFCIHFNTSILLSVNDLLACCGFMCGDGCDGGYPIMAWHYFVQNGVVTDE 190 Query: 1283 CDPYFD 1300 CDPYFD Sbjct: 191 CDPYFD 196