BLASTX nr result
ID: Achyranthes23_contig00002237
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Achyranthes23_contig00002237 (1607 letters) Database: ./nr 37,332,560 sequences; 13,225,080,153 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_003626103.1| Cathepsin B [Medicago truncatula] gi|8724098... 520 e-145 gb|ACJ84734.1| unknown [Medicago truncatula] gi|388505480|gb|AFK... 520 e-145 ref|XP_003554589.1| PREDICTED: cathepsin B-like [Glycine max] 515 e-143 gb|AFK38097.1| unknown [Lotus japonicus] 514 e-143 gb|ACJ85175.1| unknown [Medicago truncatula] 514 e-143 ref|XP_006418358.1| hypothetical protein EUTSA_v10008009mg [Eutr... 511 e-142 gb|EOX95504.1| Cysteine proteinases superfamily protein [Theobro... 510 e-142 ref|NP_563648.1| putative cathepsin B-like cysteine protease [Ar... 509 e-141 ref|XP_002889395.1| hypothetical protein ARALYDRAFT_887368 [Arab... 509 e-141 ref|XP_006305170.1| hypothetical protein CARUB_v10009537mg [Caps... 508 e-141 ref|XP_006491433.1| PREDICTED: cathepsin B-like isoform X1 [Citr... 508 e-141 ref|XP_006444663.1| hypothetical protein CICLE_v10020859mg [Citr... 508 e-141 ref|XP_002301457.2| putative cathepsin B-like protease family pr... 506 e-141 ref|XP_003521632.1| PREDICTED: cathepsin B [Glycine max] 506 e-140 gb|AGV54421.1| cathepsin B-like protein [Phaseolus vulgaris] 503 e-140 ref|XP_006396358.1| hypothetical protein EUTSA_v10028753mg [Eutr... 503 e-139 ref|XP_002515139.1| cathepsin B, putative [Ricinus communis] gi|... 500 e-139 gb|EXB94879.1| Cathepsin B [Morus notabilis] 498 e-138 ref|XP_006840260.1| hypothetical protein AMTR_s00045p00036790 [A... 497 e-138 gb|ESW35262.1| hypothetical protein PHAVU_001G220100g [Phaseolus... 496 e-137 >ref|XP_003626103.1| Cathepsin B [Medicago truncatula] gi|87240982|gb|ABD32840.1| Peptidase C1A, papain; Somatotropin hormone; Peptidase C1, propeptide [Medicago truncatula] gi|355501118|gb|AES82321.1| Cathepsin B [Medicago truncatula] Length = 357 Score = 520 bits (1338), Expect = e-145 Identities = 232/313 (74%), Positives = 262/313 (83%), Gaps = 1/313 (0%) Frame = +1 Query: 364 RDLIIDSVNSNPNVRWKAGVNQHYSDYTDGQFKRILGARRTPESEKRTIPLVHHDRSLNL 543 ++ I +N NP W+A +N +S++T GQFKR+LG ++ P+ E + P+V H +SL L Sbjct: 42 QESIAKQINENPEAGWEAAINPRFSNFTVGQFKRLLGVKQAPKKELLSTPVVTHPKSLKL 101 Query: 544 PKEFDARTAWPQCRTIGRILDQGHCGSCWAFAAVEALSDRFCIKYNMNISLSVNDLLACC 723 PKEFDARTAW QC TIG+ILDQGHCGSCWAF AVE+L DRFCI ++MNISLSVNDLLACC Sbjct: 102 PKEFDARTAWSQCSTIGKILDQGHCGSCWAFGAVESLQDRFCIHFDMNISLSVNDLLACC 161 Query: 724 GFLCGNGCDGGTPIYAWRYLVHHGVVSEECDPYFDTEGCSHPGCEPGYPTPKCVRKCVAG 903 GFLCG GCDGGTPIYAWRYL HHGVV+EECDPYFD GCSHPGCEP Y TPKCVRKCV G Sbjct: 162 GFLCGAGCDGGTPIYAWRYLAHHGVVTEECDPYFDQIGCSHPGCEPAYQTPKCVRKCVKG 221 Query: 904 NQLWRESKHYSKNAYRVKRDPYQIMAELYKNGPVEVAFDVFEDFAHYKSGVYKHVTGSYL 1083 NQ+W+ SKHYS AYRVK DP IMAE+YKNGPVEVAF VFEDFAHYKSGVYKH+TGS L Sbjct: 222 NQIWKRSKHYSVKAYRVKSDPQDIMAEVYKNGPVEVAFTVFEDFAHYKSGVYKHITGSAL 281 Query: 1084 GGHAVKLIGWGTSADGEDYWLIANQWNRSWGDDGYFKISRGTNECGIEDDVTGGLPSSKN 1263 GGHAVKLIGWGTS +GEDYWL+ANQWN +WGDDGYFKI RGTNECGIEDDVT GLPS+KN Sbjct: 282 GGHAVKLIGWGTSDEGEDYWLLANQWNTNWGDDGYFKIKRGTNECGIEDDVTAGLPSTKN 341 Query: 1264 LGAVLPDSD-DAG 1299 + + D D DAG Sbjct: 342 IVREVTDMDVDAG 354 >gb|ACJ84734.1| unknown [Medicago truncatula] gi|388505480|gb|AFK40806.1| unknown [Medicago truncatula] Length = 359 Score = 520 bits (1338), Expect = e-145 Identities = 232/313 (74%), Positives = 262/313 (83%), Gaps = 1/313 (0%) Frame = +1 Query: 364 RDLIIDSVNSNPNVRWKAGVNQHYSDYTDGQFKRILGARRTPESEKRTIPLVHHDRSLNL 543 ++ I +N NP W+A +N +S++T GQFKR+LG ++ P+ E + P+V H +SL L Sbjct: 44 QESIAKQINENPEAGWEAAINPRFSNFTVGQFKRLLGVKQAPKKELLSTPVVTHPKSLKL 103 Query: 544 PKEFDARTAWPQCRTIGRILDQGHCGSCWAFAAVEALSDRFCIKYNMNISLSVNDLLACC 723 PKEFDARTAW QC TIG+ILDQGHCGSCWAF AVE+L DRFCI ++MNISLSVNDLLACC Sbjct: 104 PKEFDARTAWSQCSTIGKILDQGHCGSCWAFGAVESLQDRFCIHFDMNISLSVNDLLACC 163 Query: 724 GFLCGNGCDGGTPIYAWRYLVHHGVVSEECDPYFDTEGCSHPGCEPGYPTPKCVRKCVAG 903 GFLCG GCDGGTPIYAWRYL HHGVV+EECDPYFD GCSHPGCEP Y TPKCVRKCV G Sbjct: 164 GFLCGAGCDGGTPIYAWRYLAHHGVVTEECDPYFDQIGCSHPGCEPAYQTPKCVRKCVKG 223 Query: 904 NQLWRESKHYSKNAYRVKRDPYQIMAELYKNGPVEVAFDVFEDFAHYKSGVYKHVTGSYL 1083 NQ+W+ SKHYS AYRVK DP IMAE+YKNGPVEVAF VFEDFAHYKSGVYKH+TGS L Sbjct: 224 NQIWKRSKHYSVKAYRVKSDPQDIMAEVYKNGPVEVAFTVFEDFAHYKSGVYKHITGSAL 283 Query: 1084 GGHAVKLIGWGTSADGEDYWLIANQWNRSWGDDGYFKISRGTNECGIEDDVTGGLPSSKN 1263 GGHAVKLIGWGTS +GEDYWL+ANQWN +WGDDGYFKI RGTNECGIEDDVT GLPS+KN Sbjct: 284 GGHAVKLIGWGTSDEGEDYWLLANQWNTNWGDDGYFKIKRGTNECGIEDDVTAGLPSTKN 343 Query: 1264 LGAVLPDSD-DAG 1299 + + D D DAG Sbjct: 344 IVREVTDMDVDAG 356 >ref|XP_003554589.1| PREDICTED: cathepsin B-like [Glycine max] Length = 356 Score = 515 bits (1326), Expect = e-143 Identities = 225/309 (72%), Positives = 260/309 (84%) Frame = +1 Query: 364 RDLIIDSVNSNPNVRWKAGVNQHYSDYTDGQFKRILGARRTPESEKRTIPLVHHDRSLNL 543 ++ I +N NP W+A +N H+S+YT QFKR+LG + TP+ E R+ P + H +SL L Sbjct: 41 QESIAKEINENPEAGWEAAINPHFSNYTVEQFKRLLGVKPTPKKELRSTPAISHPKSLKL 100 Query: 544 PKEFDARTAWPQCRTIGRILDQGHCGSCWAFAAVEALSDRFCIKYNMNISLSVNDLLACC 723 PK FDARTAW QC TIGRILDQGHCGSCWAF AVE+LSDRFCI +++NISLSVNDLLACC Sbjct: 101 PKNFDARTAWSQCSTIGRILDQGHCGSCWAFGAVESLSDRFCIHFDVNISLSVNDLLACC 160 Query: 724 GFLCGNGCDGGTPIYAWRYLVHHGVVSEECDPYFDTEGCSHPGCEPGYPTPKCVRKCVAG 903 GFLCG+GCDGG P+YAW+YL HHGVV+EECDPYFD GCSHPGCEP Y TPKCV+KCV+G Sbjct: 161 GFLCGSGCDGGYPLYAWQYLAHHGVVTEECDPYFDQIGCSHPGCEPAYRTPKCVKKCVSG 220 Query: 904 NQLWRESKHYSKNAYRVKRDPYQIMAELYKNGPVEVAFDVFEDFAHYKSGVYKHVTGSYL 1083 NQ+W++SKHYS NAYRV DP+ IM E+YKNGPVEVAF V+EDFAHYKSGVYKH+TG L Sbjct: 221 NQVWKKSKHYSVNAYRVSSDPHDIMTEVYKNGPVEVAFTVYEDFAHYKSGVYKHITGYEL 280 Query: 1084 GGHAVKLIGWGTSADGEDYWLIANQWNRSWGDDGYFKISRGTNECGIEDDVTGGLPSSKN 1263 GGHAVKLIGWGT+ DGEDYWL+ANQWNR WGDDGYFKI RGTNECGIE+DVT GLPS+KN Sbjct: 281 GGHAVKLIGWGTTEDGEDYWLLANQWNREWGDDGYFKIRRGTNECGIEEDVTAGLPSTKN 340 Query: 1264 LGAVLPDSD 1290 L + D D Sbjct: 341 LVREVTDMD 349 >gb|AFK38097.1| unknown [Lotus japonicus] Length = 357 Score = 514 bits (1324), Expect = e-143 Identities = 226/309 (73%), Positives = 261/309 (84%) Frame = +1 Query: 364 RDLIIDSVNSNPNVRWKAGVNQHYSDYTDGQFKRILGARRTPESEKRTIPLVHHDRSLNL 543 ++ I +N NP W+A ++ +S+YT QFKR+LG + +P+ E R+ P+V H RSL L Sbjct: 42 QESIAKEINENPGAGWEAAISPRFSNYTVAQFKRLLGVKPSPKKELRSTPVVSHPRSLKL 101 Query: 544 PKEFDARTAWPQCRTIGRILDQGHCGSCWAFAAVEALSDRFCIKYNMNISLSVNDLLACC 723 PK FDARTAW QC TIGRILDQGHCGSCWAF AVE+LSDRFCI ++N+SLSVNDLLACC Sbjct: 102 PKSFDARTAWSQCSTIGRILDQGHCGSCWAFGAVESLSDRFCIHLDVNVSLSVNDLLACC 161 Query: 724 GFLCGNGCDGGTPIYAWRYLVHHGVVSEECDPYFDTEGCSHPGCEPGYPTPKCVRKCVAG 903 GFLCG+GCDGG P+YAWRYL HHGVV+EECDPYFD GCSHPGCEP Y TPKCVRKCV G Sbjct: 162 GFLCGSGCDGGYPLYAWRYLAHHGVVTEECDPYFDQIGCSHPGCEPAYQTPKCVRKCVKG 221 Query: 904 NQLWRESKHYSKNAYRVKRDPYQIMAELYKNGPVEVAFDVFEDFAHYKSGVYKHVTGSYL 1083 NQ+W++SK++S NAY VK DPY IMAE+YKNGPVEVAF V+EDFAHYKSGVYKH+TGS L Sbjct: 222 NQIWKKSKYFSVNAYSVKSDPYDIMAEVYKNGPVEVAFTVYEDFAHYKSGVYKHITGSQL 281 Query: 1084 GGHAVKLIGWGTSADGEDYWLIANQWNRSWGDDGYFKISRGTNECGIEDDVTGGLPSSKN 1263 GGHAVKLIGWGT+ +GEDYWLIANQWNRSWGDDGYF I RGTNECGIE+DVT GLPS+KN Sbjct: 282 GGHAVKLIGWGTTDEGEDYWLIANQWNRSWGDDGYFMIRRGTNECGIEEDVTAGLPSTKN 341 Query: 1264 LGAVLPDSD 1290 +G + D D Sbjct: 342 MGRWVMDMD 350 >gb|ACJ85175.1| unknown [Medicago truncatula] Length = 359 Score = 514 bits (1323), Expect = e-143 Identities = 229/313 (73%), Positives = 259/313 (82%), Gaps = 1/313 (0%) Frame = +1 Query: 364 RDLIIDSVNSNPNVRWKAGVNQHYSDYTDGQFKRILGARRTPESEKRTIPLVHHDRSLNL 543 ++ I +N NP W+A +N +S++T GQFKR+LG ++ P+ E + P+V H +SL L Sbjct: 44 QESIAKQINENPEAGWEAAINPRFSNFTVGQFKRLLGVKQAPKKELLSTPVVTHPKSLKL 103 Query: 544 PKEFDARTAWPQCRTIGRILDQGHCGSCWAFAAVEALSDRFCIKYNMNISLSVNDLLACC 723 PKEFDAR AW QC TIG+ILDQGHCGSCWAF AVE+L DRFC ++MNISLSVNDLLACC Sbjct: 104 PKEFDARAAWSQCSTIGKILDQGHCGSCWAFGAVESLQDRFCSHFDMNISLSVNDLLACC 163 Query: 724 GFLCGNGCDGGTPIYAWRYLVHHGVVSEECDPYFDTEGCSHPGCEPGYPTPKCVRKCVAG 903 GFLCG GCDGGTPIYAWRYL HHGVV+EECDPYFD GCSHPGCEP Y TPKCVRKCV G Sbjct: 164 GFLCGAGCDGGTPIYAWRYLAHHGVVTEECDPYFDQIGCSHPGCEPAYQTPKCVRKCVKG 223 Query: 904 NQLWRESKHYSKNAYRVKRDPYQIMAELYKNGPVEVAFDVFEDFAHYKSGVYKHVTGSYL 1083 NQ+W+ SKHYS AYRVK DP IM E+YKNGPVEVAF VFEDFAHYKSGVYKH+TGS L Sbjct: 224 NQIWKRSKHYSVKAYRVKSDPQDIMTEVYKNGPVEVAFTVFEDFAHYKSGVYKHITGSAL 283 Query: 1084 GGHAVKLIGWGTSADGEDYWLIANQWNRSWGDDGYFKISRGTNECGIEDDVTGGLPSSKN 1263 GGHAVKLIGWGTS +GEDYWL+ANQWN +WGDDGYFKI RGTNECGIEDDVT GLPS+KN Sbjct: 284 GGHAVKLIGWGTSDEGEDYWLLANQWNTNWGDDGYFKIKRGTNECGIEDDVTAGLPSTKN 343 Query: 1264 LGAVLPDSD-DAG 1299 + + D D DAG Sbjct: 344 IVREVTDMDVDAG 356 >ref|XP_006418358.1| hypothetical protein EUTSA_v10008009mg [Eutrema salsugineum] gi|557096129|gb|ESQ36711.1| hypothetical protein EUTSA_v10008009mg [Eutrema salsugineum] Length = 360 Score = 511 bits (1316), Expect = e-142 Identities = 226/310 (72%), Positives = 256/310 (82%) Frame = +1 Query: 364 RDLIIDSVNSNPNVRWKAGVNQHYSDYTDGQFKRILGARRTPESEKRTIPLVHHDRSLNL 543 +D I+ VN NPN WKA N +++ T +FKR+LG + TP+ E +P+V HDRSL L Sbjct: 45 QDEIVKEVNENPNAGWKAAFNDRFANATVAEFKRLLGVKPTPKKEYLGVPIVSHDRSLKL 104 Query: 544 PKEFDARTAWPQCRTIGRILDQGHCGSCWAFAAVEALSDRFCIKYNMNISLSVNDLLACC 723 PKEFDARTAW QC +I RILDQGHCGSCWAF AVE+LSDRFCIKYNMNISLSVNDLLACC Sbjct: 105 PKEFDARTAWSQCTSIARILDQGHCGSCWAFGAVESLSDRFCIKYNMNISLSVNDLLACC 164 Query: 724 GFLCGNGCDGGTPIYAWRYLVHHGVVSEECDPYFDTEGCSHPGCEPGYPTPKCVRKCVAG 903 GFLCG GC+GG PI AWRY HHGVV+EECDPYFD GCSHPGCEP YPTP+CVRKCV+G Sbjct: 165 GFLCGQGCNGGYPISAWRYFKHHGVVTEECDPYFDNTGCSHPGCEPAYPTPRCVRKCVSG 224 Query: 904 NQLWRESKHYSKNAYRVKRDPYQIMAELYKNGPVEVAFDVFEDFAHYKSGVYKHVTGSYL 1083 NQLWRESKHY +AY+++ +P IMAE+YKNGPVEV F V+EDFAHYKSGVYKH+TGS + Sbjct: 225 NQLWRESKHYGVSAYKIRSNPQDIMAEVYKNGPVEVDFTVYEDFAHYKSGVYKHITGSNI 284 Query: 1084 GGHAVKLIGWGTSADGEDYWLIANQWNRSWGDDGYFKISRGTNECGIEDDVTGGLPSSKN 1263 GGHAVKLIGWGTS DGEDYWL+ANQWNRSWGDDGYFKI RGTNECGIE GLPS +N Sbjct: 285 GGHAVKLIGWGTSDDGEDYWLLANQWNRSWGDDGYFKIRRGTNECGIEHGAVAGLPSDRN 344 Query: 1264 LGAVLPDSDD 1293 + + SDD Sbjct: 345 VFKGITTSDD 354 >gb|EOX95504.1| Cysteine proteinases superfamily protein [Theobroma cacao] Length = 359 Score = 510 bits (1313), Expect = e-142 Identities = 227/309 (73%), Positives = 255/309 (82%) Frame = +1 Query: 364 RDLIIDSVNSNPNVRWKAGVNQHYSDYTDGQFKRILGARRTPESEKRTIPLVHHDRSLNL 543 +D I+ VN NP WKA +N S+YT G+FK +LG + TP+ E IP++ H +SL + Sbjct: 42 QDSIVKQVNENPKAGWKAALNPRLSNYTVGEFKHLLGVKPTPKKELLGIPVITHGKSLKV 101 Query: 544 PKEFDARTAWPQCRTIGRILDQGHCGSCWAFAAVEALSDRFCIKYNMNISLSVNDLLACC 723 P +FDARTAWPQC TIGRILDQGHCGSCWAF AVE+LSDRFCI ++MNISLSVNDLLACC Sbjct: 102 PTKFDARTAWPQCSTIGRILDQGHCGSCWAFGAVESLSDRFCIHFSMNISLSVNDLLACC 161 Query: 724 GFLCGNGCDGGTPIYAWRYLVHHGVVSEECDPYFDTEGCSHPGCEPGYPTPKCVRKCVAG 903 GFLCG+GCDGG PI AWRY V GVV+EECDPYFD GCSHPGCEP YPTP+CV+KCV G Sbjct: 162 GFLCGSGCDGGYPISAWRYFVRRGVVTEECDPYFDDTGCSHPGCEPAYPTPRCVKKCVKG 221 Query: 904 NQLWRESKHYSKNAYRVKRDPYQIMAELYKNGPVEVAFDVFEDFAHYKSGVYKHVTGSYL 1083 NQLWRESKHYS AYR+ DP IMAE+Y NGPVEV+F V+EDFAHYKSGVYKHVTG + Sbjct: 222 NQLWRESKHYSVGAYRINSDPADIMAEVYTNGPVEVSFTVYEDFAHYKSGVYKHVTGGVM 281 Query: 1084 GGHAVKLIGWGTSADGEDYWLIANQWNRSWGDDGYFKISRGTNECGIEDDVTGGLPSSKN 1263 GGHAVKLIGWGTS DGEDYWL+ANQWNR WGDDGYFKISRGTNECGIEDDV GLPS+KN Sbjct: 282 GGHAVKLIGWGTSDDGEDYWLLANQWNRGWGDDGYFKISRGTNECGIEDDVVAGLPSTKN 341 Query: 1264 LGAVLPDSD 1290 L + D D Sbjct: 342 LVREVGDMD 350 >ref|NP_563648.1| putative cathepsin B-like cysteine protease [Arabidopsis thaliana] gi|16226808|gb|AAL16267.1|AF428337_1 At1g02300/T6A9_10 [Arabidopsis thaliana] gi|14532526|gb|AAK63991.1| At1g02300/T6A9_10 [Arabidopsis thaliana] gi|25090140|gb|AAN72238.1| At1g02300/T6A9_10 [Arabidopsis thaliana] gi|332189292|gb|AEE27413.1| putative cathepsin B-like cysteine protease [Arabidopsis thaliana] Length = 362 Score = 509 bits (1312), Expect = e-141 Identities = 226/307 (73%), Positives = 255/307 (83%) Frame = +1 Query: 373 IIDSVNSNPNVRWKAGVNQHYSDYTDGQFKRILGARRTPESEKRTIPLVHHDRSLNLPKE 552 I+ VN NPN WKA N +++ T +FKR+LG + TP++E +P+V HD SL LPKE Sbjct: 50 IVKEVNENPNAGWKASFNDRFANATVAEFKRLLGVKPTPKTEFLGVPIVSHDISLKLPKE 109 Query: 553 FDARTAWPQCRTIGRILDQGHCGSCWAFAAVEALSDRFCIKYNMNISLSVNDLLACCGFL 732 FDARTAW QC +IGRILDQGHCGSCWAF AVE+LSDRFCIKYNMN+SLSVNDLLACCGFL Sbjct: 110 FDARTAWSQCTSIGRILDQGHCGSCWAFGAVESLSDRFCIKYNMNVSLSVNDLLACCGFL 169 Query: 733 CGNGCDGGTPIYAWRYLVHHGVVSEECDPYFDTEGCSHPGCEPGYPTPKCVRKCVAGNQL 912 CG GC+GG PI AWRY HHGVV+EECDPYFD GCSHPGCEP YPTPKC RKCV+GNQL Sbjct: 170 CGQGCNGGYPIAAWRYFKHHGVVTEECDPYFDNTGCSHPGCEPAYPTPKCARKCVSGNQL 229 Query: 913 WRESKHYSKNAYRVKRDPYQIMAELYKNGPVEVAFDVFEDFAHYKSGVYKHVTGSYLGGH 1092 WRESKHY +AY+V+ P IMAE+YKNGPVEVAF V+EDFAHYKSGVYKH+TG+ +GGH Sbjct: 230 WRESKHYGVSAYKVRSHPDDIMAEVYKNGPVEVAFTVYEDFAHYKSGVYKHITGTNIGGH 289 Query: 1093 AVKLIGWGTSADGEDYWLIANQWNRSWGDDGYFKISRGTNECGIEDDVTGGLPSSKNLGA 1272 AVKLIGWGTS DGEDYWL+ANQWNRSWGDDGYFKI RGTNECGIE V GLPS +N+ Sbjct: 290 AVKLIGWGTSDDGEDYWLLANQWNRSWGDDGYFKIRRGTNECGIEHGVVAGLPSDRNVVK 349 Query: 1273 VLPDSDD 1293 + SDD Sbjct: 350 GITTSDD 356 >ref|XP_002889395.1| hypothetical protein ARALYDRAFT_887368 [Arabidopsis lyrata subsp. lyrata] gi|297335237|gb|EFH65654.1| hypothetical protein ARALYDRAFT_887368 [Arabidopsis lyrata subsp. lyrata] Length = 360 Score = 509 bits (1312), Expect = e-141 Identities = 226/307 (73%), Positives = 255/307 (83%) Frame = +1 Query: 373 IIDSVNSNPNVRWKAGVNQHYSDYTDGQFKRILGARRTPESEKRTIPLVHHDRSLNLPKE 552 I+ VN NPN WKA N +++ T +FKR+LG + TP++E +P+V HD SL LPKE Sbjct: 48 IVKEVNENPNAGWKAAFNDRFANATVAEFKRLLGVKPTPKTEFLGVPIVSHDISLKLPKE 107 Query: 553 FDARTAWPQCRTIGRILDQGHCGSCWAFAAVEALSDRFCIKYNMNISLSVNDLLACCGFL 732 FDARTAW QC ++GRILDQGHCGSCWAF AVE+LSDRFCIKYNMNISLSVNDLLACCGFL Sbjct: 108 FDARTAWSQCTSVGRILDQGHCGSCWAFGAVESLSDRFCIKYNMNISLSVNDLLACCGFL 167 Query: 733 CGNGCDGGTPIYAWRYLVHHGVVSEECDPYFDTEGCSHPGCEPGYPTPKCVRKCVAGNQL 912 CG GC+GG PI AWRY HHGVV+EECDPYFD GCSHPGCEP YPTPKC RKCV+GNQL Sbjct: 168 CGQGCNGGYPIAAWRYFKHHGVVTEECDPYFDNTGCSHPGCEPAYPTPKCARKCVSGNQL 227 Query: 913 WRESKHYSKNAYRVKRDPYQIMAELYKNGPVEVAFDVFEDFAHYKSGVYKHVTGSYLGGH 1092 WRESKHY +AY+V+ P IMAE+YKNGPVEVAF V+EDFAHYKSGVYKH+TG+ +GGH Sbjct: 228 WRESKHYGVSAYKVRSHPDDIMAEVYKNGPVEVAFTVYEDFAHYKSGVYKHITGTNIGGH 287 Query: 1093 AVKLIGWGTSADGEDYWLIANQWNRSWGDDGYFKISRGTNECGIEDDVTGGLPSSKNLGA 1272 AVKLIGWGTS DGEDYWL+ANQWNRSWGDDGYFKI RGTNECGIE V GLPS +N+ Sbjct: 288 AVKLIGWGTSDDGEDYWLLANQWNRSWGDDGYFKIRRGTNECGIEHGVVAGLPSDRNVFK 347 Query: 1273 VLPDSDD 1293 + SDD Sbjct: 348 GITTSDD 354 >ref|XP_006305170.1| hypothetical protein CARUB_v10009537mg [Capsella rubella] gi|482573881|gb|EOA38068.1| hypothetical protein CARUB_v10009537mg [Capsella rubella] Length = 360 Score = 508 bits (1309), Expect = e-141 Identities = 226/307 (73%), Positives = 257/307 (83%) Frame = +1 Query: 373 IIDSVNSNPNVRWKAGVNQHYSDYTDGQFKRILGARRTPESEKRTIPLVHHDRSLNLPKE 552 I++ VN+NP WKA +N +++ T +FKR+LG + TP++E +P+V H SL LPKE Sbjct: 48 IVNEVNANPKAGWKAALNDRFANATVAEFKRLLGVKPTPKTEFLGVPIVSHGISLKLPKE 107 Query: 553 FDARTAWPQCRTIGRILDQGHCGSCWAFAAVEALSDRFCIKYNMNISLSVNDLLACCGFL 732 FDARTAW QC +IGRILDQGHCGSCWAF AVE+LSDRFCIKYNMNISLSVNDLLACCGFL Sbjct: 108 FDARTAWSQCTSIGRILDQGHCGSCWAFGAVESLSDRFCIKYNMNISLSVNDLLACCGFL 167 Query: 733 CGNGCDGGTPIYAWRYLVHHGVVSEECDPYFDTEGCSHPGCEPGYPTPKCVRKCVAGNQL 912 CG GC+GG PI AWRY HHGVV+EECDPYFD GCSHPGCEP YPTPKCVRKCV+GNQL Sbjct: 168 CGQGCNGGYPISAWRYFKHHGVVTEECDPYFDNTGCSHPGCEPAYPTPKCVRKCVSGNQL 227 Query: 913 WRESKHYSKNAYRVKRDPYQIMAELYKNGPVEVAFDVFEDFAHYKSGVYKHVTGSYLGGH 1092 WRESKHY +AY+V+ P IMAE+YKNGPVEVAF V+EDFAHYKSGVYKH+TG+ +GGH Sbjct: 228 WRESKHYGVSAYKVRSHPEDIMAEVYKNGPVEVAFTVYEDFAHYKSGVYKHITGTNIGGH 287 Query: 1093 AVKLIGWGTSADGEDYWLIANQWNRSWGDDGYFKISRGTNECGIEDDVTGGLPSSKNLGA 1272 AVKLIGWGTS DGEDYWL+ANQWNRSWGDDGYFKI RGTNECGIE V GLPS +N+ Sbjct: 288 AVKLIGWGTSDDGEDYWLLANQWNRSWGDDGYFKIRRGTNECGIEHGVVAGLPSDRNVFK 347 Query: 1273 VLPDSDD 1293 + SDD Sbjct: 348 GITASDD 354 >ref|XP_006491433.1| PREDICTED: cathepsin B-like isoform X1 [Citrus sinensis] Length = 362 Score = 508 bits (1308), Expect = e-141 Identities = 226/309 (73%), Positives = 254/309 (82%) Frame = +1 Query: 364 RDLIIDSVNSNPNVRWKAGVNQHYSDYTDGQFKRILGARRTPESEKRTIPLVHHDRSLNL 543 +D II VN NP WKA N +S+YT GQFK +LG + TP+ +P+ HD+SL L Sbjct: 47 QDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGVKPTPKGLLLGVPVKTHDKSLKL 106 Query: 544 PKEFDARTAWPQCRTIGRILDQGHCGSCWAFAAVEALSDRFCIKYNMNISLSVNDLLACC 723 PK FDAR+AWPQC TI RILDQGHCGSCWAF AVEALSDRFCI + MN+SLSVNDLLACC Sbjct: 107 PKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVNDLLACC 166 Query: 724 GFLCGNGCDGGTPIYAWRYLVHHGVVSEECDPYFDTEGCSHPGCEPGYPTPKCVRKCVAG 903 GFLCG+GCDGG PI AWRY VHHGVV+EECDPYFD+ GCSHPGCEP YPTPKCVRKCV Sbjct: 167 GFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKK 226 Query: 904 NQLWRESKHYSKNAYRVKRDPYQIMAELYKNGPVEVAFDVFEDFAHYKSGVYKHVTGSYL 1083 NQLWR SKHYS +AYR+ DP IMAE+YKNGPVEV+F V+EDFAHYKSGVYKH+TG + Sbjct: 227 NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVM 286 Query: 1084 GGHAVKLIGWGTSADGEDYWLIANQWNRSWGDDGYFKISRGTNECGIEDDVTGGLPSSKN 1263 GGHAVKLIGWGTS DGEDYW++ANQWNRSWG DGYFKI RG+NECGIE+DV GLPSSKN Sbjct: 287 GGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSKN 346 Query: 1264 LGAVLPDSD 1290 L + +D Sbjct: 347 LVKEITSAD 355 >ref|XP_006444663.1| hypothetical protein CICLE_v10020859mg [Citrus clementina] gi|568876746|ref|XP_006491434.1| PREDICTED: cathepsin B-like isoform X2 [Citrus sinensis] gi|557546925|gb|ESR57903.1| hypothetical protein CICLE_v10020859mg [Citrus clementina] Length = 354 Score = 508 bits (1308), Expect = e-141 Identities = 226/309 (73%), Positives = 254/309 (82%) Frame = +1 Query: 364 RDLIIDSVNSNPNVRWKAGVNQHYSDYTDGQFKRILGARRTPESEKRTIPLVHHDRSLNL 543 +D II VN NP WKA N +S+YT GQFK +LG + TP+ +P+ HD+SL L Sbjct: 39 QDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGVKPTPKGLLLGVPVKTHDKSLKL 98 Query: 544 PKEFDARTAWPQCRTIGRILDQGHCGSCWAFAAVEALSDRFCIKYNMNISLSVNDLLACC 723 PK FDAR+AWPQC TI RILDQGHCGSCWAF AVEALSDRFCI + MN+SLSVNDLLACC Sbjct: 99 PKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVNDLLACC 158 Query: 724 GFLCGNGCDGGTPIYAWRYLVHHGVVSEECDPYFDTEGCSHPGCEPGYPTPKCVRKCVAG 903 GFLCG+GCDGG PI AWRY VHHGVV+EECDPYFD+ GCSHPGCEP YPTPKCVRKCV Sbjct: 159 GFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKK 218 Query: 904 NQLWRESKHYSKNAYRVKRDPYQIMAELYKNGPVEVAFDVFEDFAHYKSGVYKHVTGSYL 1083 NQLWR SKHYS +AYR+ DP IMAE+YKNGPVEV+F V+EDFAHYKSGVYKH+TG + Sbjct: 219 NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVM 278 Query: 1084 GGHAVKLIGWGTSADGEDYWLIANQWNRSWGDDGYFKISRGTNECGIEDDVTGGLPSSKN 1263 GGHAVKLIGWGTS DGEDYW++ANQWNRSWG DGYFKI RG+NECGIE+DV GLPSSKN Sbjct: 279 GGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSKN 338 Query: 1264 LGAVLPDSD 1290 L + +D Sbjct: 339 LVKEITSAD 347 >ref|XP_002301457.2| putative cathepsin B-like protease family protein [Populus trichocarpa] gi|550345314|gb|EEE80730.2| putative cathepsin B-like protease family protein [Populus trichocarpa] Length = 357 Score = 506 bits (1304), Expect = e-141 Identities = 223/301 (74%), Positives = 251/301 (83%) Frame = +1 Query: 364 RDLIIDSVNSNPNVRWKAGVNQHYSDYTDGQFKRILGARRTPESEKRTIPLVHHDRSLNL 543 +D I+ VN NP WKA +N H+S+YT QFK +LG + TP+ E R IP++ H +SL L Sbjct: 42 QDSILKKVNGNPKAGWKATMNHHFSNYTVAQFKYLLGVKPTPKEELRGIPVISHPKSLRL 101 Query: 544 PKEFDARTAWPQCRTIGRILDQGHCGSCWAFAAVEALSDRFCIKYNMNISLSVNDLLACC 723 P+EFDARTAWPQC TIG+ILDQGHCGSCWAF AVE+LSDRFCI Y MNISLSVNDLLACC Sbjct: 102 PEEFDARTAWPQCSTIGKILDQGHCGSCWAFGAVESLSDRFCIHYGMNISLSVNDLLACC 161 Query: 724 GFLCGNGCDGGTPIYAWRYLVHHGVVSEECDPYFDTEGCSHPGCEPGYPTPKCVRKCVAG 903 GFLCG+GC+GG PI AWRY VHHGVV+EECDPYFD GCSHPGCEPGYPTPKC RKCV Sbjct: 162 GFLCGSGCNGGYPISAWRYFVHHGVVTEECDPYFDDIGCSHPGCEPGYPTPKCARKCVNK 221 Query: 904 NQLWRESKHYSKNAYRVKRDPYQIMAELYKNGPVEVAFDVFEDFAHYKSGVYKHVTGSYL 1083 NQLW++SKHY YR+ DP IMAE+YKNGPVEVAF V+EDFAHYKSGVYKH+TG + Sbjct: 222 NQLWKKSKHYGVKPYRIDSDPDSIMAEIYKNGPVEVAFTVYEDFAHYKSGVYKHITGGMM 281 Query: 1084 GGHAVKLIGWGTSADGEDYWLIANQWNRSWGDDGYFKISRGTNECGIEDDVTGGLPSSKN 1263 GGHAVKLIGWGTS DGE YWL+ANQWNR WGDDG+FKI RGTNECGIE DV GLPS++N Sbjct: 282 GGHAVKLIGWGTSEDGEAYWLLANQWNRGWGDDGFFKIRRGTNECGIEGDVVAGLPSTRN 341 Query: 1264 L 1266 L Sbjct: 342 L 342 >ref|XP_003521632.1| PREDICTED: cathepsin B [Glycine max] Length = 357 Score = 506 bits (1302), Expect = e-140 Identities = 221/302 (73%), Positives = 256/302 (84%) Frame = +1 Query: 385 VNSNPNVRWKAGVNQHYSDYTDGQFKRILGARRTPESEKRTIPLVHHDRSLNLPKEFDAR 564 +N NP W+A +N +S+YT QFKR+LG + P+ E R+ P + H ++L LPK FDAR Sbjct: 49 INENPEAGWEAAINPRFSNYTVEQFKRLLGVKPMPKKELRSTPAISHPKTLKLPKNFDAR 108 Query: 565 TAWPQCRTIGRILDQGHCGSCWAFAAVEALSDRFCIKYNMNISLSVNDLLACCGFLCGNG 744 TAW QC TIGRILDQGHCGSCWAF AVE+LSDRFCI +++NISLSVNDLLACCGFLCG+G Sbjct: 109 TAWSQCSTIGRILDQGHCGSCWAFGAVESLSDRFCIHFDVNISLSVNDLLACCGFLCGSG 168 Query: 745 CDGGTPIYAWRYLVHHGVVSEECDPYFDTEGCSHPGCEPGYPTPKCVRKCVAGNQLWRES 924 CDGG P+YAWRYL HHGVV+EECDPYFD GCSHPGCEP Y TPKCV+KCV+GNQ+W++S Sbjct: 169 CDGGYPLYAWRYLAHHGVVTEECDPYFDQIGCSHPGCEPAYRTPKCVKKCVSGNQVWKKS 228 Query: 925 KHYSKNAYRVKRDPYQIMAELYKNGPVEVAFDVFEDFAHYKSGVYKHVTGSYLGGHAVKL 1104 KHYS +AYRV DP+ IMAE+YKNGPVEVAF V+EDFA+YKSGVYKH+TG LGGHAVKL Sbjct: 229 KHYSVSAYRVNSDPHDIMAEVYKNGPVEVAFTVYEDFAYYKSGVYKHITGYELGGHAVKL 288 Query: 1105 IGWGTSADGEDYWLIANQWNRSWGDDGYFKISRGTNECGIEDDVTGGLPSSKNLGAVLPD 1284 IGWGT+ DGEDYWL+ANQWNR WGDDGYFKI RGTNECGIE+DVT GLPS+KNL + D Sbjct: 289 IGWGTTDDGEDYWLLANQWNREWGDDGYFKIRRGTNECGIEEDVTAGLPSTKNLVREVTD 348 Query: 1285 SD 1290 D Sbjct: 349 MD 350 >gb|AGV54421.1| cathepsin B-like protein [Phaseolus vulgaris] Length = 356 Score = 503 bits (1296), Expect = e-140 Identities = 221/306 (72%), Positives = 256/306 (83%) Frame = +1 Query: 373 IIDSVNSNPNVRWKAGVNQHYSDYTDGQFKRILGARRTPESEKRTIPLVHHDRSLNLPKE 552 I +N NP W+A +N +S+YT QFKR+LG ++TP+ E R+ P++ H +SL LP Sbjct: 44 IAKQINENPEAGWEAAINPRFSNYTVEQFKRLLGVKQTPKIELRSTPVISHSKSLKLPVN 103 Query: 553 FDARTAWPQCRTIGRILDQGHCGSCWAFAAVEALSDRFCIKYNMNISLSVNDLLACCGFL 732 FDARTAW QC TIGRILDQGHCGSCWAF AVE+LSDRFCI +++NISLSVNDLLACCGFL Sbjct: 104 FDARTAWSQCNTIGRILDQGHCGSCWAFGAVESLSDRFCIHFDVNISLSVNDLLACCGFL 163 Query: 733 CGNGCDGGTPIYAWRYLVHHGVVSEECDPYFDTEGCSHPGCEPGYPTPKCVRKCVAGNQL 912 CG+GCDGG P+YAWRYL HHGVV+EECDPYFD GCSHPGCEP Y TPKCV+KCV GNQL Sbjct: 164 CGSGCDGGYPLYAWRYLAHHGVVTEECDPYFDQIGCSHPGCEPAYRTPKCVKKCVNGNQL 223 Query: 913 WRESKHYSKNAYRVKRDPYQIMAELYKNGPVEVAFDVFEDFAHYKSGVYKHVTGSYLGGH 1092 W++SKH+S NAY V +P+ IMAE+Y NGPVEVAF V+EDFAHYKSGVYKHVTG LGGH Sbjct: 224 WKKSKHFSVNAYTVNSNPHDIMAEVYTNGPVEVAFTVYEDFAHYKSGVYKHVTGHALGGH 283 Query: 1093 AVKLIGWGTSADGEDYWLIANQWNRSWGDDGYFKISRGTNECGIEDDVTGGLPSSKNLGA 1272 AVKLIGWGT+ DG+DYWL+ANQWNR WGDDGYFKI RGTNECGIE++VT GLPS+KNL Sbjct: 284 AVKLIGWGTTDDGQDYWLLANQWNREWGDDGYFKIRRGTNECGIEEEVTAGLPSTKNLVR 343 Query: 1273 VLPDSD 1290 + D D Sbjct: 344 EVTDMD 349 >ref|XP_006396358.1| hypothetical protein EUTSA_v10028753mg [Eutrema salsugineum] gi|312283137|dbj|BAJ34434.1| unnamed protein product [Thellungiella halophila] gi|557097375|gb|ESQ37811.1| hypothetical protein EUTSA_v10028753mg [Eutrema salsugineum] Length = 362 Score = 503 bits (1294), Expect = e-139 Identities = 222/313 (70%), Positives = 258/313 (82%) Frame = +1 Query: 373 IIDSVNSNPNVRWKAGVNQHYSDYTDGQFKRILGARRTPESEKRTIPLVHHDRSLNLPKE 552 I+ VN NP+ WKA +N +S+ T +FKR+LG + TP+ +P+V HDRSL LPKE Sbjct: 50 IVKKVNQNPDAGWKAAINDRFSNATVAEFKRLLGVKPTPKKHFLGVPIVSHDRSLKLPKE 109 Query: 553 FDARTAWPQCRTIGRILDQGHCGSCWAFAAVEALSDRFCIKYNMNISLSVNDLLACCGFL 732 FDARTAWPQC +IG ILDQGHCGSCWAF AVE+LSDRFCI++ MNISLSVNDLLACCGF Sbjct: 110 FDARTAWPQCTSIGNILDQGHCGSCWAFGAVESLSDRFCIEFGMNISLSVNDLLACCGFR 169 Query: 733 CGNGCDGGTPIYAWRYLVHHGVVSEECDPYFDTEGCSHPGCEPGYPTPKCVRKCVAGNQL 912 CG+GCDGG PI AW+Y + GVV+EECDPYFD GCSHPGCEP YPTPKC+RKCV+GNQL Sbjct: 170 CGDGCDGGYPIAAWQYFSYSGVVTEECDPYFDDTGCSHPGCEPAYPTPKCMRKCVSGNQL 229 Query: 913 WRESKHYSKNAYRVKRDPYQIMAELYKNGPVEVAFDVFEDFAHYKSGVYKHVTGSYLGGH 1092 W +SKHYS + Y VK +P IMAE+YKNGPVEV+F V+EDFAHYKSGVYKH+TGS +GGH Sbjct: 230 WSQSKHYSVSTYTVKSNPQDIMAEVYKNGPVEVSFTVYEDFAHYKSGVYKHITGSNIGGH 289 Query: 1093 AVKLIGWGTSADGEDYWLIANQWNRSWGDDGYFKISRGTNECGIEDDVTGGLPSSKNLGA 1272 AVKLIGWGT+ +GEDYWL+ANQWNRSWGDDGYF I RGTNECGIED+ GLPSS+N+ Sbjct: 290 AVKLIGWGTTDEGEDYWLLANQWNRSWGDDGYFMIRRGTNECGIEDEPVAGLPSSRNVFK 349 Query: 1273 VLPDSDDAGYASV 1311 V+ SDD ASV Sbjct: 350 VITGSDDLSVASV 362 >ref|XP_002515139.1| cathepsin B, putative [Ricinus communis] gi|223545619|gb|EEF47123.1| cathepsin B, putative [Ricinus communis] Length = 376 Score = 500 bits (1287), Expect = e-139 Identities = 221/321 (68%), Positives = 257/321 (80%), Gaps = 17/321 (5%) Frame = +1 Query: 364 RDLIIDSVNSNPNVRWKAGVNQHYSDYTDGQFKRILGARRTPESEKRTIPLVHHDRSLNL 543 ++ II VN NP+ W+A +N S++T GQFK +LGA+ TP+ E +P++ H ++L L Sbjct: 42 QESIIKKVNENPDAGWEAAMNPQLSNFTVGQFKYLLGAKPTPKKELMGVPMISHPKTLKL 101 Query: 544 PKEFDARTAWPQCRTIGRILDQ-----------------GHCGSCWAFAAVEALSDRFCI 672 PKEFDARTAWP C TIG+IL Q GHCGSCWAF AVE+LSDRFCI Sbjct: 102 PKEFDARTAWPHCSTIGKILGQLLSFYNIFSIFFFLFLEGHCGSCWAFGAVESLSDRFCI 161 Query: 673 KYNMNISLSVNDLLACCGFLCGNGCDGGTPIYAWRYLVHHGVVSEECDPYFDTEGCSHPG 852 + MNISLSVNDLLACCGFLCG+GCDGG P+YAWRY VHHGVV+EECDPYFD GCSHPG Sbjct: 162 HFGMNISLSVNDLLACCGFLCGDGCDGGYPMYAWRYFVHHGVVTEECDPYFDNIGCSHPG 221 Query: 853 CEPGYPTPKCVRKCVAGNQLWRESKHYSKNAYRVKRDPYQIMAELYKNGPVEVAFDVFED 1032 CEPG+PTPKCVRKC+ NQLWR+SKHYS NAYR+ DP+ +MAE+YKNGPVEV+F V+ED Sbjct: 222 CEPGFPTPKCVRKCIDKNQLWRQSKHYSVNAYRISSDPHDVMAEVYKNGPVEVSFTVYED 281 Query: 1033 FAHYKSGVYKHVTGSYLGGHAVKLIGWGTSADGEDYWLIANQWNRSWGDDGYFKISRGTN 1212 FAHYKSGVYKH+TG +GGHAVKLIGWGTS +GEDYWL+ANQWNR WGDDGYFKI RGTN Sbjct: 282 FAHYKSGVYKHITGEVMGGHAVKLIGWGTSDNGEDYWLLANQWNRGWGDDGYFKIRRGTN 341 Query: 1213 ECGIEDDVTGGLPSSKNLGAV 1275 ECGIEDD GLPS++NL V Sbjct: 342 ECGIEDDAVAGLPSARNLDLV 362 >gb|EXB94879.1| Cathepsin B [Morus notabilis] Length = 420 Score = 498 bits (1283), Expect = e-138 Identities = 215/301 (71%), Positives = 251/301 (83%) Frame = +1 Query: 364 RDLIIDSVNSNPNVRWKAGVNQHYSDYTDGQFKRILGARRTPESEKRTIPLVHHDRSLNL 543 ++ I+ VN NP W+A +N +S++T G+F+R+LG + TP+ E + P++ H +SL L Sbjct: 44 QESIVKRVNENPEAGWRAEMNPRFSNFTAGEFRRLLGVKETPKHELESTPVITHPKSLKL 103 Query: 544 PKEFDARTAWPQCRTIGRILDQGHCGSCWAFAAVEALSDRFCIKYNMNISLSVNDLLACC 723 P +FDARTAWPQC TI RILDQGHCGSCWAF AVE+LSDRFCI +N NISLSVND+LACC Sbjct: 104 PDKFDARTAWPQCSTIKRILDQGHCGSCWAFGAVESLSDRFCIHFNTNISLSVNDVLACC 163 Query: 724 GFLCGNGCDGGTPIYAWRYLVHHGVVSEECDPYFDTEGCSHPGCEPGYPTPKCVRKCVAG 903 GFLCG GCDGGTP++AWRYL HHGVV+EECDPYFD GCSHPGCEP YPTP+C RKCV Sbjct: 164 GFLCGAGCDGGTPLFAWRYLHHHGVVTEECDPYFDNTGCSHPGCEPAYPTPRCHRKCVNK 223 Query: 904 NQLWRESKHYSKNAYRVKRDPYQIMAELYKNGPVEVAFDVFEDFAHYKSGVYKHVTGSYL 1083 N LWR+SKHYS NAY++ DP+ IMAE+YKNGPVEV F V+EDFAHYKSGVYKH+TGS + Sbjct: 224 NNLWRQSKHYSVNAYKISSDPHSIMAEVYKNGPVEVDFTVYEDFAHYKSGVYKHITGSVM 283 Query: 1084 GGHAVKLIGWGTSADGEDYWLIANQWNRSWGDDGYFKISRGTNECGIEDDVTGGLPSSKN 1263 GGHAVKLIGWGTS GEDYWL+ANQWNRSWGDDGYFKI RGTNECGIE D G+PS +N Sbjct: 284 GGHAVKLIGWGTSDTGEDYWLVANQWNRSWGDDGYFKIRRGTNECGIEKDAVAGMPSKRN 343 Query: 1264 L 1266 L Sbjct: 344 L 344 >ref|XP_006840260.1| hypothetical protein AMTR_s00045p00036790 [Amborella trichopoda] gi|548841978|gb|ERN01935.1| hypothetical protein AMTR_s00045p00036790 [Amborella trichopoda] Length = 351 Score = 497 bits (1280), Expect = e-138 Identities = 217/316 (68%), Positives = 261/316 (82%) Frame = +1 Query: 364 RDLIIDSVNSNPNVRWKAGVNQHYSDYTDGQFKRILGARRTPESEKRTIPLVHHDRSLNL 543 +D II+ +N NPN W+A +N +S+YT GQFK ILG + P++ +P+ +++++ L Sbjct: 37 QDSIIEKINGNPNAGWQAALNPRFSNYTIGQFKYILGVKPVPQNSLVPVPIRRYEKTVKL 96 Query: 544 PKEFDARTAWPQCRTIGRILDQGHCGSCWAFAAVEALSDRFCIKYNMNISLSVNDLLACC 723 PK+FDARTAW QC TI RILDQGHCGSCWAF AVE+LSDRFCI + MNISLSVNDLL+CC Sbjct: 97 PKDFDARTAWTQCATISRILDQGHCGSCWAFGAVESLSDRFCIHFGMNISLSVNDLLSCC 156 Query: 724 GFLCGNGCDGGTPIYAWRYLVHHGVVSEECDPYFDTEGCSHPGCEPGYPTPKCVRKCVAG 903 GF+CG+GCDGG PIYAWRY V +GVV+EECDPYFD GCSHPGCEPG+PTP+C RKC Sbjct: 157 GFMCGDGCDGGYPIYAWRYFVQNGVVTEECDPYFDDIGCSHPGCEPGFPTPQCERKCKVK 216 Query: 904 NQLWRESKHYSKNAYRVKRDPYQIMAELYKNGPVEVAFDVFEDFAHYKSGVYKHVTGSYL 1083 NQLW+ESKH+S NAYR+ DP IMAE+YKNGPVEVAF V+EDFAHYKSG+YKH+TG + Sbjct: 217 NQLWQESKHFSVNAYRIDSDPSSIMAEVYKNGPVEVAFTVYEDFAHYKSGIYKHITGGIM 276 Query: 1084 GGHAVKLIGWGTSADGEDYWLIANQWNRSWGDDGYFKISRGTNECGIEDDVTGGLPSSKN 1263 GGHAVKLIGWGTS +GEDYWL+ANQWNR WGDDGYFKI RGTNECGIE+DV G+PS+KN Sbjct: 277 GGHAVKLIGWGTSEEGEDYWLLANQWNRGWGDDGYFKIRRGTNECGIEEDVVAGMPSTKN 336 Query: 1264 LGAVLPDSDDAGYASV 1311 L + D D+G+ +V Sbjct: 337 LIKNMADG-DSGHVTV 351 >gb|ESW35262.1| hypothetical protein PHAVU_001G220100g [Phaseolus vulgaris] Length = 357 Score = 496 bits (1276), Expect = e-137 Identities = 220/307 (71%), Positives = 255/307 (83%), Gaps = 1/307 (0%) Frame = +1 Query: 373 IIDSVNSNPNVRWKAGVNQHYSDYTDGQFKRILGARRTPESEKRTIPLVHHDRSLNLPKE 552 I +N NP W+A +N +S+YT QFKR+LG ++TP+ E R+ P++ H +SL LP Sbjct: 44 IAKQINENPEAGWEAALNPRFSNYTVEQFKRLLGVKQTPKIELRSTPVISHPKSLKLPVN 103 Query: 553 FDARTAWPQCRTIGRILDQGHCGSCWAFAAVEALSDRFCIKYNMNISLSVNDLLACCGFL 732 FDAR AW QC TIGRILDQGHCGSCWAF AVE+LSDRFCI +++NISLSVNDLLACCGFL Sbjct: 104 FDARKAWSQCNTIGRILDQGHCGSCWAFGAVESLSDRFCIHFDVNISLSVNDLLACCGFL 163 Query: 733 CGNGCDGGTPIYAWRYLVHHGVVSEECDPYFDTEGCSHPGCEPGYPTPKCVRKCV-AGNQ 909 CG+GCDGG P+YAWRYL HHGVV+EECDPYFD GCSHPGCEP Y TPKCV+KCV GNQ Sbjct: 164 CGSGCDGGYPLYAWRYLAHHGVVTEECDPYFDQIGCSHPGCEPAYRTPKCVKKCVDGGNQ 223 Query: 910 LWRESKHYSKNAYRVKRDPYQIMAELYKNGPVEVAFDVFEDFAHYKSGVYKHVTGSYLGG 1089 LW++SKH+S NAY V +P+ IMAE+Y NGPVEVAF V+EDFAHYKSGVYKHVTG LGG Sbjct: 224 LWKKSKHFSVNAYTVNSNPHDIMAEVYTNGPVEVAFTVYEDFAHYKSGVYKHVTGYALGG 283 Query: 1090 HAVKLIGWGTSADGEDYWLIANQWNRSWGDDGYFKISRGTNECGIEDDVTGGLPSSKNLG 1269 HAVKLIGWGT+ DG+DYWL+ANQWNR WGDDGYFKI RGTNECGIE++VT GLPS+KNL Sbjct: 284 HAVKLIGWGTTDDGQDYWLLANQWNREWGDDGYFKIRRGTNECGIEEEVTAGLPSTKNLV 343 Query: 1270 AVLPDSD 1290 + D D Sbjct: 344 REVTDMD 350