BLASTX nr result
ID: Akebia27_contig00017535
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Akebia27_contig00017535 (1490 letters) Database: ./nr 38,876,450 sequences; 13,856,398,315 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value dbj|BAJ53169.1| JHL18I08.3 [Jatropha curcas] 586 e-164 ref|XP_002307688.2| cysteine protease family protein [Populus tr... 582 e-163 ref|XP_007017656.1| JHL18I08.3 protein [Theobroma cacao] gi|5087... 575 e-161 ref|XP_004152671.1| PREDICTED: cysteine proteinase RD21a-like [C... 573 e-161 ref|XP_007136041.1| hypothetical protein PHAVU_009G013000g [Phas... 570 e-160 ref|XP_006435079.1| hypothetical protein CICLE_v10001178mg [Citr... 570 e-160 ref|XP_006473576.1| PREDICTED: cysteine proteinase RD21a-like [C... 568 e-159 ref|XP_006341989.1| PREDICTED: cysteine proteinase RD21a-like [S... 563 e-158 gb|AGV54418.1| oryzain alpha chain-like protein [Phaseolus vulga... 562 e-157 gb|EXC25025.1| Oryzain alpha chain [Morus notabilis] 560 e-157 ref|XP_003523725.1| PREDICTED: oryzain alpha chain-like [Glycine... 559 e-156 ref|XP_002510459.1| cysteine protease, putative [Ricinus communi... 556 e-156 ref|XP_004238304.1| PREDICTED: cysteine proteinase RD21a-like [S... 551 e-154 gb|AAK71314.1|AF388175_1 papain-like cysteine peptidase XBCP3 [A... 551 e-154 ref|XP_006417526.1| hypothetical protein EUTSA_v10007640mg [Eutr... 550 e-154 ref|NP_563855.1| papain-like cysteine peptidase [Arabidopsis tha... 550 e-154 ref|XP_002889773.1| hypothetical protein ARALYDRAFT_471096 [Arab... 549 e-153 ref|XP_004500967.1| PREDICTED: oryzain alpha chain-like [Cicer a... 548 e-153 ref|XP_007223363.1| hypothetical protein PRUPE_ppa005615mg [Prun... 547 e-153 ref|XP_004291399.1| PREDICTED: cysteine proteinase RD21a-like [F... 544 e-152 >dbj|BAJ53169.1| JHL18I08.3 [Jatropha curcas] Length = 441 Score = 586 bits (1510), Expect = e-164 Identities = 279/426 (65%), Positives = 327/426 (76%), Gaps = 9/426 (2%) Frame = +3 Query: 60 FSSFTSD---LFNSWCEQNGKNYSSEEEKLYRFSVFEDNLAFITNHNNMENSTYTLALNA 230 FSS +S+ LF +WC+Q+GK Y+S+EEKL+R VF+DN F+T HN+ NS+YTL+LNA Sbjct: 19 FSSSSSEIAHLFETWCQQHGKTYASQEEKLFRLKVFQDNYDFVTEHNSQGNSSYTLSLNA 78 Query: 231 FADLTHHEFKSSRFGLSLAAS---NLDRSNSQLTGFSVFRDVPSSIDWRKKGAVTPVKDQ 401 FADLTHHEFK+SR GLS AAS N+DRSN Q+ F DVP+S+DWRK GAVT VKDQ Sbjct: 79 FADLTHHEFKASRLGLSSAASASLNVDRSNRQIPDFVA--DVPASVDWRKNGAVTQVKDQ 136 Query: 402 ASCGACWAFSATGAIEGINQIVSGSLVSLSEQELVDCDKTYNSGCGGGLMDYAFQFVVDN 581 +CGACW+FSATGAIEGIN+IV+GSLVSLSEQELVDCDK+YN+GC GG+MDYAFQFV+DN Sbjct: 137 GNCGACWSFSATGAIEGINKIVTGSLVSLSEQELVDCDKSYNNGCEGGIMDYAFQFVIDN 196 Query: 582 KGIDTEKDYPYQAKGTSCNKNKLKRHVVTIDGYTDLP-SKEEEIMQAVASQPVSVGICGS 758 GIDTE+DYPYQ + SCNK KLKRHVVTIDGY D+P + E+E+++AVA+QPVSVGICGS Sbjct: 197 HGIDTEEDYPYQGRDRSCNKEKLKRHVVTIDGYVDVPQNNEKELLKAVANQPVSVGICGS 256 Query: 759 ERAFQSYSKGIFNGACSTSLDHAVLIVGYGSENGVDYWILKNSWGKSWGMDGYMHMARNT 938 ERAFQ YSKGIF G CSTSLDHAVLIVGYGSENGVDYWI+KNSWG WGMDGYMHM RN+ Sbjct: 257 ERAFQLYSKGIFTGPCSTSLDHAVLIVGYGSENGVDYWIVKNSWGSYWGMDGYMHMQRNS 316 Query: 939 GDQLGICGINTLA--XXXXXXXXXXXXXXXXXKCSLLTYCGAGETCCCTRRLLGICFHWK 1112 G G+CGIN LA +C L T+CG GETCCC + GIC WK Sbjct: 317 GSSRGLCGINMLASYPKKTSPNPPPPAPPGPTRCDLFTHCGEGETCCCVHHIFGICLSWK 376 Query: 1113 CCEVNSAVCCKDHRFCCPHDYPVCDTKNKLCFKGTVNSTMVKGLERKKGSFGKLGGWNSL 1292 CCE++SAVCCKD R CCP DYPVCDT +C K N+T ++ K S GK W+SL Sbjct: 377 CCELDSAVCCKDGRHCCPRDYPVCDTTRNICLKHYGNATRIEKF-AKNSSSGKFRSWSSL 435 Query: 1293 FEAWNL 1310 E W L Sbjct: 436 LEGWIL 441 >ref|XP_002307688.2| cysteine protease family protein [Populus trichocarpa] gi|550339725|gb|EEE94684.2| cysteine protease family protein [Populus trichocarpa] Length = 436 Score = 582 bits (1501), Expect = e-163 Identities = 272/417 (65%), Positives = 326/417 (78%), Gaps = 3/417 (0%) Frame = +3 Query: 63 SSFTSDLFNSWCEQNGKNYSSEEEKLYRFSVFEDNLAFITNHNNMENSTYTLALNAFADL 242 SS S LF +WC+++GK+Y+S+EE+ +R VFEDN F+T HN+ NS+Y+LALNAFADL Sbjct: 22 SSDISQLFETWCKEHGKSYTSQEERSHRLKVFEDNYDFVTKHNSKGNSSYSLALNAFADL 81 Query: 243 THHEFKSSRFGLSLAASNLDRSNSQLTGFSVFRDVPSSIDWRKKGAVTPVKDQASCGACW 422 THHEFK+SR GLS A NL N ++TG V D+P+SIDWR KG VT VKDQ SCGACW Sbjct: 82 THHEFKTSRLGLSAAPLNLAHRNLEITG--VVGDIPASIDWRNKGVVTNVKDQGSCGACW 139 Query: 423 AFSATGAIEGINQIVSGSLVSLSEQELVDCDKTYNSGCGGGLMDYAFQFVVDNKGIDTEK 602 +FSATGAIEGIN+IV+GSLVSLSEQEL++CDK+YN GCGGGLMDYAFQFV++N GIDTE+ Sbjct: 140 SFSATGAIEGINKIVTGSLVSLSEQELIECDKSYNDGCGGGLMDYAFQFVINNHGIDTEE 199 Query: 603 DYPYQAKGTSCNKNKLKRHVVTIDGYTDLP-SKEEEIMQAVASQPVSVGICGSERAFQSY 779 DYPY+A+ +CNK+++KR VVTID Y D+P + E++++QAVA+QPVSVGICGSERAFQ Y Sbjct: 200 DYPYRARDGTCNKDRMKRRVVTIDKYVDVPENNEKQLLQAVAAQPVSVGICGSERAFQMY 259 Query: 780 SKGIFNGACSTSLDHAVLIVGYGSENGVDYWILKNSWGKSWGMDGYMHMARNTGDQLGIC 959 SKGIF G CSTSLDHAVLIVGYGSENGVDYWI+KNSWG WGM GYMHM RN+G+ G+C Sbjct: 260 SKGIFTGPCSTSLDHAVLIVGYGSENGVDYWIVKNSWGTGWGMRGYMHMQRNSGNSQGVC 319 Query: 960 GINTLA--XXXXXXXXXXXXXXXXXKCSLLTYCGAGETCCCTRRLLGICFHWKCCEVNSA 1133 GIN LA KC+LLTYC AGETCCC R+ GIC WKCC ++SA Sbjct: 320 GINMLASYPVKTSPNPPPPPPPGPTKCNLLTYCAAGETCCCARKFFGICISWKCCGLDSA 379 Query: 1134 VCCKDHRFCCPHDYPVCDTKNKLCFKGTVNSTMVKGLERKKGSFGKLGGWNSLFEAW 1304 VCCKD CCPHDYPVCDT +CFK N+T ++ +E K + GK G WNSL EAW Sbjct: 380 VCCKDRLHCCPHDYPVCDTDKNMCFKRAGNATRMEAIEGK--TSGKFGSWNSLPEAW 434 >ref|XP_007017656.1| JHL18I08.3 protein [Theobroma cacao] gi|508722984|gb|EOY14881.1| JHL18I08.3 protein [Theobroma cacao] Length = 438 Score = 575 bits (1483), Expect = e-161 Identities = 273/418 (65%), Positives = 317/418 (75%), Gaps = 3/418 (0%) Frame = +3 Query: 66 SFTSDLFNSWCEQNGKNYSSEEEKLYRFSVFEDNLAFITNHNNMENSTYTLALNAFADLT 245 S S LF +WC+Q+GK YSSEEEK YR VFE+N AF+T HN + NS+Y+LALNAFADLT Sbjct: 24 SHISHLFETWCDQHGKRYSSEEEKSYRLKVFEENYAFVTQHNGVGNSSYSLALNAFADLT 83 Query: 246 HHEFKSSRFGLSLAASNLDRSNSQLTGFSVFRDVPSSIDWRKKGAVTPVKDQASCGACWA 425 HHEFK+SR GLS AA R N QL G + RD+P+S+DWR KGAVT VKDQ SCGACW+ Sbjct: 84 HHEFKASRLGLSAAAIEGSRPNLQLPG--LVRDIPASMDWRTKGAVTKVKDQGSCGACWS 141 Query: 426 FSATGAIEGINQIVSGSLVSLSEQELVDCDKTYNSGCGGGLMDYAFQFVVDNKGIDTEKD 605 FSATGAIEGIN+IV+G+LVSLSEQELVDCD++YNSGC GGLMDYA+QFV+DN GID E+D Sbjct: 142 FSATGAIEGINKIVTGTLVSLSEQELVDCDRSYNSGCEGGLMDYAYQFVIDNHGIDNEED 201 Query: 606 YPYQAKGTSCNKNKLKRHVVTIDGYTDLPSKEEE-IMQAVASQPVSVGICGSERAFQSYS 782 YPY + +CNK K KR VVTIDGY +P+ E+ ++QAVA QPVSVGICGSERAFQ YS Sbjct: 202 YPYLGREKTCNKEKRKRRVVTIDGYAGVPANNEDLLLQAVAKQPVSVGICGSERAFQLYS 261 Query: 783 KGIFNGACSTSLDHAVLIVGYGSENGVDYWILKNSWGKSWGMDGYMHMARNTGDQLGICG 962 KGIF G CS+SLDHAVLIVGYGSENGVDYWI+KNSWG WGM+GY+HM RN+GD G+CG Sbjct: 262 KGIFTGPCSSSLDHAVLIVGYGSENGVDYWIVKNSWGTRWGMNGYIHMLRNSGDSKGLCG 321 Query: 963 INTLA--XXXXXXXXXXXXXXXXXKCSLLTYCGAGETCCCTRRLLGICFHWKCCEVNSAV 1136 IN LA KC L TYC AGETCCCT R+ GICF WKCCE++SAV Sbjct: 322 INMLASYPTKTSPNPPSPPPPGPTKCDLFTYCSAGETCCCTHRIFGICFSWKCCELDSAV 381 Query: 1137 CCKDHRFCCPHDYPVCDTKNKLCFKGTVNSTMVKGLERKKGSFGKLGGWNSLFEAWNL 1310 CCKD+R CCP+DYPVCDTK C K N+T ++ E K+ S K W E W L Sbjct: 382 CCKDNRHCCPYDYPVCDTKKSQCLKRVGNATRMEAFE-KRHSTRKFSSWRPFVENWVL 438 >ref|XP_004152671.1| PREDICTED: cysteine proteinase RD21a-like [Cucumis sativus] gi|449529596|ref|XP_004171784.1| PREDICTED: cysteine proteinase RD21a-like [Cucumis sativus] Length = 431 Score = 573 bits (1476), Expect = e-161 Identities = 270/412 (65%), Positives = 321/412 (77%), Gaps = 3/412 (0%) Frame = +3 Query: 63 SSFTSDLFNSWCEQNGKNYSSEEEKLYRFSVFEDNLAFITNHNNMENSTYTLALNAFADL 242 +S S+LF WC ++GK+YSS EEKLYR VF DN F+T+HNN++NS+YTL+LN++ADL Sbjct: 22 TSNVSELFEIWCTEHGKSYSSAEEKLYRLGVFADNYEFVTHHNNLDNSSYTLSLNSYADL 81 Query: 243 THHEFKSSRFGLSLAASNLDRSNSQLTGFSVFRDVPSSIDWRKKGAVTPVKDQASCGACW 422 THHEFK SR G S A N Q S+ RDVP S+DWRKKGAVT VKDQ SCGACW Sbjct: 82 THHEFKVSRLGFSPALRNFRPVLPQEP--SLPRDVPDSLDWRKKGAVTAVKDQGSCGACW 139 Query: 423 AFSATGAIEGINQIVSGSLVSLSEQELVDCDKTYNSGCGGGLMDYAFQFVVDNKGIDTEK 602 +FSATGA+EGINQI++GSL+SLSEQEL+DCD++YNSGCGGGLMDYA+QFV+ N GIDTE Sbjct: 140 SFSATGAMEGINQIMTGSLISLSEQELIDCDRSYNSGCGGGLMDYAYQFVISNHGIDTEN 199 Query: 603 DYPYQAKGTSCNKNKLKRHVVTIDGYTDLPSKEE-EIMQAVASQPVSVGICGSERAFQSY 779 DYPYQA+ SC K+KL+R+VVTIDGY D+PS +E +++QAVA+QPVSVGICGSERAFQ Y Sbjct: 200 DYPYQARDGSCRKDKLQRNVVTIDGYADIPSNDEGKLLQAVAAQPVSVGICGSERAFQLY 259 Query: 780 SKGIFNGACSTSLDHAVLIVGYGSENGVDYWILKNSWGKSWGMDGYMHMARNTGDQLGIC 959 SKGIF+G CSTSLDHAVLIVGYGSENGVDYWI+KNSWGKSWGMDGYMHM RN+G+ G+C Sbjct: 260 SKGIFSGPCSTSLDHAVLIVGYGSENGVDYWIVKNSWGKSWGMDGYMHMQRNSGNSEGVC 319 Query: 960 GINTLA--XXXXXXXXXXXXXXXXXKCSLLTYCGAGETCCCTRRLLGICFHWKCCEVNSA 1133 GIN LA KCS+LT C AGETCCC ++ LG+C WKCC ++SA Sbjct: 320 GINKLASYPTKTNPNPPPSPPPGPTKCSILTSCAAGETCCCAKKFLGLCLSWKCCGLSSA 379 Query: 1134 VCCKDHRFCCPHDYPVCDTKNKLCFKGTVNSTMVKGLERKKGSFGKLGGWNS 1289 VCCKD R CCP DYP+CDT LC K T+N T + LE + S G G W+S Sbjct: 380 VCCKDGRHCCPFDYPICDTDRNLCLKQTMNGTRTEILENRSSS-GSSGTWSS 430 >ref|XP_007136041.1| hypothetical protein PHAVU_009G013000g [Phaseolus vulgaris] gi|561009128|gb|ESW08035.1| hypothetical protein PHAVU_009G013000g [Phaseolus vulgaris] Length = 428 Score = 570 bits (1469), Expect = e-160 Identities = 273/412 (66%), Positives = 310/412 (75%), Gaps = 3/412 (0%) Frame = +3 Query: 63 SSFTSDLFNSWCEQNGKNYSSEEEKLYRFSVFEDNLAFITNHN---NMENSTYTLALNAF 233 +S TSDLF WC+++ K YSSEEEK YRF VFEDN AF++ HN N NSTYTL+LNAF Sbjct: 20 ASDTSDLFERWCKEHAKTYSSEEEKRYRFHVFEDNYAFVSQHNRNANDNNSTYTLSLNAF 79 Query: 234 ADLTHHEFKSSRFGLSLAASNLDRSNSQLTGFSVFRDVPSSIDWRKKGAVTPVKDQASCG 413 ADLTHHEFK+SR G S + R +Q + PS IDWR+ GAVTPVKDQASCG Sbjct: 80 ADLTHHEFKTSRLGFSPSLLRFKRVQNQQPRHLLHN--PSQIDWRQSGAVTPVKDQASCG 137 Query: 414 ACWAFSATGAIEGINQIVSGSLVSLSEQELVDCDKTYNSGCGGGLMDYAFQFVVDNKGID 593 ACWAFSATGAIEGIN+IV+GSL SLSEQELVDCD +YNSGC GGLMDYA+QFV+DNKGID Sbjct: 138 ACWAFSATGAIEGINKIVTGSLESLSEQELVDCDTSYNSGCEGGLMDYAYQFVIDNKGID 197 Query: 594 TEKDYPYQAKGTSCNKNKLKRHVVTIDGYTDLPSKEEEIMQAVASQPVSVGICGSERAFQ 773 TE DYPYQA+ CNK+KLKRH+VTID Y DLP EEE+++AVASQPVSVGICGSERAFQ Sbjct: 198 TEDDYPYQARQRPCNKDKLKRHIVTIDDYVDLPPNEEELLKAVASQPVSVGICGSERAFQ 257 Query: 774 SYSKGIFNGACSTSLDHAVLIVGYGSENGVDYWILKNSWGKSWGMDGYMHMARNTGDQLG 953 YS+GIF+G CSTSLDHAVLIVGYGSENGVDYWI+KNSWGK WGM+GY+HM RNTGD G Sbjct: 258 LYSQGIFSGPCSTSLDHAVLIVGYGSENGVDYWIVKNSWGKYWGMEGYIHMIRNTGDPKG 317 Query: 954 ICGINTLAXXXXXXXXXXXXXXXXXKCSLLTYCGAGETCCCTRRLLGICFHWKCCEVNSA 1133 ICGINTLA +C+L T+C GETCCC + LGICF WKCC + SA Sbjct: 318 ICGINTLASYPIKTKPNPPPPPAPVRCNLFTHCSEGETCCCAKSFLGICFSWKCCGLTSA 377 Query: 1134 VCCKDHRFCCPHDYPVCDTKNKLCFKGTVNSTMVKGLERKKGSFGKLGGWNS 1289 VCCKD R CCP DYP+CDT+ C K T +T + K K GW S Sbjct: 378 VCCKDKRHCCPRDYPICDTEKSQCLKITNGTTTI--TSGNKDISNKPRGWKS 427 >ref|XP_006435079.1| hypothetical protein CICLE_v10001178mg [Citrus clementina] gi|557537201|gb|ESR48319.1| hypothetical protein CICLE_v10001178mg [Citrus clementina] Length = 441 Score = 570 bits (1469), Expect = e-160 Identities = 269/420 (64%), Positives = 321/420 (76%), Gaps = 5/420 (1%) Frame = +3 Query: 60 FSSFTSDLFNSWCEQNGKNYSSEEEKLYRFSVFEDNLAFITNHNNMENSTYTLALNAFAD 239 + S ++LF +WC+Q+GK YSSE+EK R +FEDN AF+T HNNM NS++TL+LNAFAD Sbjct: 21 YCSDINELFETWCKQHGKVYSSEQEKQQRLKIFEDNYAFVTQHNNMGNSSFTLSLNAFAD 80 Query: 240 LTHHEFKSSRFGLSLAASNLDRS-NSQLTGFSVFRDVPSSIDWRKKGAVTPVKDQASCGA 416 LTH EFK+S G S A+ + DR N+ + RDVP+SIDWRKKGAVT VKDQASCGA Sbjct: 81 LTHQEFKASFLGFSAASIDHDRRRNASVQSPGTLRDVPASIDWRKKGAVTEVKDQASCGA 140 Query: 417 CWAFSATGAIEGINQIVSGSLVSLSEQELVDCDKTYNSGCGGGLMDYAFQFVVDNKGIDT 596 CWAFSATGAIEGIN+IV+GSLVSLSEQEL+DCD++YNSGCGGGLMDYA+QFV+ N GIDT Sbjct: 141 CWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSYNSGCGGGLMDYAYQFVIKNHGIDT 200 Query: 597 EKDYPYQAKGTSCNKNKLKRHVVTIDGYTDLP-SKEEEIMQAVASQPVSVGICGSERAFQ 773 EKDYPY+ + CNK KL RH+VTIDGY D+P + E++++QAV +QPVSVGICGSERAFQ Sbjct: 201 EKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVAQPVSVGICGSERAFQ 260 Query: 774 SYSKGIFNGACSTSLDHAVLIVGYGSENGVDYWILKNSWGKSWGMDGYMHMARNTGDQLG 953 YS GIF G CSTSLDHAVLIVGY SENGVDYWI+KNSWG+SWGM+GYMHM RNTG+ LG Sbjct: 261 LYSSGIFTGPCSTSLDHAVLIVGYDSENGVDYWIIKNSWGRSWGMNGYMHMQRNTGNSLG 320 Query: 954 ICGINTLA--XXXXXXXXXXXXXXXXXKCSLLTYCGAGETCCCTRRLLGICFHWKCCEVN 1127 ICGIN LA +CSLLTYC AGETCCC +LGIC WKCC + Sbjct: 321 ICGINMLASYPTKTGQNPPPSPPPGPTRCSLLTYCAAGETCCCGSSILGICLSWKCCGFS 380 Query: 1128 SAVCCKDHRFCCPHDYPVCDTKNKLCF-KGTVNSTMVKGLERKKGSFGKLGGWNSLFEAW 1304 SAVCC DHR+CCP +YP+CD+ C + T N T + +E +GS K G W+S + W Sbjct: 381 SAVCCSDHRYCCPSNYPICDSVRHQCLTRFTGNVTAAEAIE-MRGSSWKFGSWSSFIDVW 439 >ref|XP_006473576.1| PREDICTED: cysteine proteinase RD21a-like [Citrus sinensis] Length = 441 Score = 568 bits (1464), Expect = e-159 Identities = 268/420 (63%), Positives = 321/420 (76%), Gaps = 5/420 (1%) Frame = +3 Query: 60 FSSFTSDLFNSWCEQNGKNYSSEEEKLYRFSVFEDNLAFITNHNNMENSTYTLALNAFAD 239 + S ++LF +WC+Q+GK YSSE+EK R +FEDN AF+T HNNM NS++TL+LNAFAD Sbjct: 21 YCSDINELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHNNMGNSSFTLSLNAFAD 80 Query: 240 LTHHEFKSSRFGLSLAASNLDRS-NSQLTGFSVFRDVPSSIDWRKKGAVTPVKDQASCGA 416 LTH EFK+S G S A+ + DR N+ + RDVP+SIDWRKKGAVT VKDQASCGA Sbjct: 81 LTHQEFKASFLGFSAASIDHDRRRNASVQSPGNLRDVPASIDWRKKGAVTEVKDQASCGA 140 Query: 417 CWAFSATGAIEGINQIVSGSLVSLSEQELVDCDKTYNSGCGGGLMDYAFQFVVDNKGIDT 596 CWAFSATGAIEGIN+IV+GSLVSLSEQEL+DCD++YNSGCGGGLMDYA+QFV+ N GIDT Sbjct: 141 CWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSYNSGCGGGLMDYAYQFVIKNHGIDT 200 Query: 597 EKDYPYQAKGTSCNKNKLKRHVVTIDGYTDLP-SKEEEIMQAVASQPVSVGICGSERAFQ 773 EKDYPY+ + CNK KL RH+VTIDGY D+P + E++++QAV +QPVSVGICGSERAFQ Sbjct: 201 EKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVAQPVSVGICGSERAFQ 260 Query: 774 SYSKGIFNGACSTSLDHAVLIVGYGSENGVDYWILKNSWGKSWGMDGYMHMARNTGDQLG 953 YS GIF G CSTSLDHAVLI+GY SENGVDYWI+KNSWG+SWGM+GYMHM RNTG+ LG Sbjct: 261 LYSSGIFTGPCSTSLDHAVLIIGYDSENGVDYWIIKNSWGRSWGMNGYMHMQRNTGNSLG 320 Query: 954 ICGINTLA--XXXXXXXXXXXXXXXXXKCSLLTYCGAGETCCCTRRLLGICFHWKCCEVN 1127 ICGIN LA +CSLLTYC GETCCC +LGIC WKCC + Sbjct: 321 ICGINMLASYPTKTGQNPPPSPPPGPTRCSLLTYCAPGETCCCGSSILGICLSWKCCGFS 380 Query: 1128 SAVCCKDHRFCCPHDYPVCDTKNKLCF-KGTVNSTMVKGLERKKGSFGKLGGWNSLFEAW 1304 SAVCC DHR+CCP +YP+CD+ C + T N T + +E +GS K G W+S +AW Sbjct: 381 SAVCCSDHRYCCPSNYPICDSVRHQCLTRLTGNVTAAEAIE-MRGSSWKFGSWSSFIDAW 439 >ref|XP_006341989.1| PREDICTED: cysteine proteinase RD21a-like [Solanum tuberosum] Length = 439 Score = 563 bits (1452), Expect = e-158 Identities = 265/416 (63%), Positives = 322/416 (77%), Gaps = 8/416 (1%) Frame = +3 Query: 57 CFSSFTSDLFNSWCEQNGKNYSSEEEKLYRFSVFEDNLAFITNHNNMENSTYTLALNAFA 236 C S SDLF +WC+QNGK YSSE+E++YRF VFE+N A+IT HN+ ENS+YTL LNA++ Sbjct: 20 CTCSSISDLFETWCQQNGKKYSSEQERVYRFKVFEENYAYITEHNSKENSSYTLGLNAYS 79 Query: 237 DLTHHEFKSSRFGLSLAASNLDR-----SNSQLTGFSVFRDVPSSIDWRKKGAVTPVKDQ 401 DLTHHEF++S GLS +A++ R S S TG D PSS+DWR+KGAVT VK+Q Sbjct: 80 DLTHHEFRNSFLGLSSSANDFIRLKGRGSGSSETGVLSDVDAPSSLDWREKGAVTDVKNQ 139 Query: 402 ASCGACWAFSATGAIEGINQIVSGSLVSLSEQELVDCDKTYNSGCGGGLMDYAFQFVVDN 581 SCGACW+FSATGA+EGIN+I +GSLVSLSEQEL+DCD++YN GCGGGLMDYAF+FV+ N Sbjct: 140 GSCGACWSFSATGAMEGINKITTGSLVSLSEQELIDCDRSYNEGCGGGLMDYAFEFVIKN 199 Query: 582 KGIDTEKDYPYQAKGTSCNKNKLKRHVVTIDGYTDLPSKEEE-IMQAVASQPVSVGICGS 758 GIDTEKDYP++ + +CNKNKL+RHVVTIDGYTD+P +E+ +++AVA+QPVSVGICGS Sbjct: 200 GGIDTEKDYPFREREGTCNKNKLQRHVVTIDGYTDIPQNDEDKLLKAVATQPVSVGICGS 259 Query: 759 ERAFQSYSKGIFNGACSTSLDHAVLIVGYGSENGVDYWILKNSWGKSWGMDGYMHMARNT 938 RAFQSYSKGIF G CST+LDHAVLIVGYGSENGVDYWI+KNSWG SWG++GY+HM RN+ Sbjct: 260 ARAFQSYSKGIFTGPCSTALDHAVLIVGYGSENGVDYWIIKNSWGTSWGINGYIHMQRNS 319 Query: 939 GDQLGICGINTLA--XXXXXXXXXXXXXXXXXKCSLLTYCGAGETCCCTRRLLGICFHWK 1112 G+Q GICGIN LA KCS+ T CG GETCCC + LGIC WK Sbjct: 320 GNQEGICGINKLASYPTKTSPNPPTPPAPGPSKCSMFTSCGQGETCCCGSKFLGICLSWK 379 Query: 1113 CCEVNSAVCCKDHRFCCPHDYPVCDTKNKLCFKGTVNSTMVKGLERKKGSFGKLGG 1280 CC ++SAVCCKD R CCP DYP+CDT LC K N+T+V+ +K+ GK GG Sbjct: 380 CCGLDSAVCCKDGRHCCPQDYPICDTSRNLCLKRMNNATIVQ-QPQKEAFTGKFGG 434 >gb|AGV54418.1| oryzain alpha chain-like protein [Phaseolus vulgaris] Length = 467 Score = 562 bits (1449), Expect = e-157 Identities = 270/412 (65%), Positives = 309/412 (75%), Gaps = 3/412 (0%) Frame = +3 Query: 63 SSFTSDLFNSWCEQNGKNYSSEEEKLYRFSVFEDNLAFITNHN---NMENSTYTLALNAF 233 +S TSDLF WC+++ K YSSEEEK YRF VFEDN AF++ HN N NSTYTL+LNAF Sbjct: 59 ASDTSDLFERWCKEHAKTYSSEEEKRYRFHVFEDNYAFVSQHNRNANDNNSTYTLSLNAF 118 Query: 234 ADLTHHEFKSSRFGLSLAASNLDRSNSQLTGFSVFRDVPSSIDWRKKGAVTPVKDQASCG 413 ADLTHHEFK+SR G S + R +Q + PS IDWR+ GAVTPVKDQASCG Sbjct: 119 ADLTHHEFKTSRLGFSPSLLRFKRVQNQQPRHLLHN--PSQIDWRQSGAVTPVKDQASCG 176 Query: 414 ACWAFSATGAIEGINQIVSGSLVSLSEQELVDCDKTYNSGCGGGLMDYAFQFVVDNKGID 593 ACWAFSATGAIEGIN+IV+GSL SLSEQELVDCD +YNSGC GGLMD+A+QFV+DNKGID Sbjct: 177 ACWAFSATGAIEGINKIVTGSLESLSEQELVDCDTSYNSGCEGGLMDFAYQFVIDNKGID 236 Query: 594 TEKDYPYQAKGTSCNKNKLKRHVVTIDGYTDLPSKEEEIMQAVASQPVSVGICGSERAFQ 773 TE DYPYQA+ SC+K+KLKR VTI+ Y D+P EEEI++AVASQPVSVGICGSERAFQ Sbjct: 237 TEDDYPYQARQRSCSKDKLKRRAVTIEDYVDVPPSEEEILKAVASQPVSVGICGSERAFQ 296 Query: 774 SYSKGIFNGACSTSLDHAVLIVGYGSENGVDYWILKNSWGKSWGMDGYMHMARNTGDQLG 953 YS+GIF+G CSTSLDHAVLIVGYGSENGVDYWI+KNSWGK WG+DGY+HM RNTGD G Sbjct: 297 LYSQGIFSGPCSTSLDHAVLIVGYGSENGVDYWIVKNSWGKYWGIDGYIHMIRNTGDPKG 356 Query: 954 ICGINTLAXXXXXXXXXXXXXXXXXKCSLLTYCGAGETCCCTRRLLGICFHWKCCEVNSA 1133 ICGINTLA +C+L T+C GETCCC + LGICF WKCC + SA Sbjct: 357 ICGINTLASYPIKTKPNPPPPPAPVRCNLFTHCSEGETCCCAKSFLGICFSWKCCGLTSA 416 Query: 1134 VCCKDHRFCCPHDYPVCDTKNKLCFKGTVNSTMVKGLERKKGSFGKLGGWNS 1289 VCCKD R CCP DYP+CDT+ C K T +T + K K GW S Sbjct: 417 VCCKDKRHCCPRDYPICDTEKSQCLKITNGTTTI--TSGNKDISNKPRGWKS 466 >gb|EXC25025.1| Oryzain alpha chain [Morus notabilis] Length = 517 Score = 560 bits (1442), Expect = e-157 Identities = 264/384 (68%), Positives = 306/384 (79%), Gaps = 4/384 (1%) Frame = +3 Query: 72 TSDLFNSWCEQNGKNYSSEEEKLYRFSVFEDNLAFITNHNNMENSTYTLALNAFADLTHH 251 +S LF +WCE++G++YSSEEE+LYR +VFEDNLAF+T HNNM NS+YTL+LNAFADLTHH Sbjct: 26 SSQLFEAWCEKHGQSYSSEEERLYRLTVFEDNLAFVTQHNNMGNSSYTLSLNAFADLTHH 85 Query: 252 EFKSSRFGLSLAA-SNLDRSNSQLTGFSVFRDVPSSIDWRKKGAVTPVKDQASCGACWAF 428 EFKSSR G S A S+L + S+L RDVP+S+DWRKKGAVT VKDQ SCGACWAF Sbjct: 86 EFKSSRLGFSSALLSSLPKLGSKLLDL---RDVPASLDWRKKGAVTNVKDQGSCGACWAF 142 Query: 429 SATGAIEGINQIVSGSLVSLSEQELVDCDKTYNSGCGGGLMDYAFQFVVDNKGIDTEKDY 608 SATGAIEGIN+IV+GSLVSLSEQEL+DCD +YN+GC GGLMDYA+QFV+DN GIDTE+DY Sbjct: 143 SATGAIEGINKIVTGSLVSLSEQELIDCDTSYNAGCDGGLMDYAYQFVIDNHGIDTEEDY 202 Query: 609 PYQAKGTSCNKNKLKRHVVTIDGYTDL-PSKEEEIMQAVASQPVSVGICGSERAFQSYSK 785 PYQA+ SC K KLKR VVTIDGYTD+ P+ +++QAV +QPVSVGICGSERAFQ YSK Sbjct: 203 PYQARDKSCRKEKLKRRVVTIDGYTDVAPNNGLQLLQAVVTQPVSVGICGSERAFQLYSK 262 Query: 786 GIFNGACSTSLDHAVLIVGYGSENGVDYWILKNSWGKSWGMDGYMHMARNTGDQLGICGI 965 GIF G CSTSLDHAVLIVGY SENGVDYWI+KNSWGK WGMDGY+HM RNTG+ G+CGI Sbjct: 263 GIFTGPCSTSLDHAVLIVGYDSENGVDYWIVKNSWGKQWGMDGYIHMQRNTGNSQGVCGI 322 Query: 966 NTLA--XXXXXXXXXXXXXXXXXKCSLLTYCGAGETCCCTRRLLGICFHWKCCEVNSAVC 1139 N LA +CS CG GETCCC+ R LG+CF WKCC +NSAVC Sbjct: 323 NMLASYPTKTSPNPPPSPSPGPTRCSFFAQCGEGETCCCSWRFLGLCFSWKCCGLNSAVC 382 Query: 1140 CKDHRFCCPHDYPVCDTKNKLCFK 1211 CKD CCP DYP+CDT+ +C K Sbjct: 383 CKDKIHCCPQDYPLCDTQRNVCLK 406 >ref|XP_003523725.1| PREDICTED: oryzain alpha chain-like [Glycine max] Length = 439 Score = 559 bits (1440), Expect = e-156 Identities = 270/419 (64%), Positives = 311/419 (74%), Gaps = 10/419 (2%) Frame = +3 Query: 63 SSFTSDLFNSWCEQNGKNYSSEEEKLYRFSVFEDNLAFITNHN-----NMENSTYTLALN 227 +S TS+LF WC+++ K YSSEEEKLYR VFEDN AF+ HN N NS+YTL+LN Sbjct: 26 ASDTSELFEKWCKEHSKTYSSEEEKLYRLKVFEDNYAFVAQHNQNANNNNNNSSYTLSLN 85 Query: 228 AFADLTHHEFKSSRFGLSLAASNLDRSNSQLTGFSVFRD---VPSSIDWRKKGAVTPVKD 398 AFADLTHHEFK++R GL L R +Q + RD +PS IDWR+ GAVTPVKD Sbjct: 86 AFADLTHHEFKTTRLGLPLTLLRFKRPQNQQS-----RDLLHIPSQIDWRQSGAVTPVKD 140 Query: 399 QASCGACWAFSATGAIEGINQIVSGSLVSLSEQELVDCDKTYNSGCGGGLMDYAFQFVVD 578 QASCGACWAFSATGAIEGIN+IV+GSLVSLSEQEL+DCD +YNSGCGGGLMD+A+QFV+D Sbjct: 141 QASCGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDTSYNSGCGGGLMDFAYQFVID 200 Query: 579 NKGIDTEKDYPYQAKGTSCNKNKLKRHVVTIDGYTDLPSKEEEIMQAVASQPVSVGICGS 758 NKGIDTE DYPYQA+ SC+K+KLKR VTI+ Y D+P EEEI++AVASQPVSVGICGS Sbjct: 201 NKGIDTEDDYPYQARQRSCSKDKLKRRAVTIEDYVDVPPSEEEILKAVASQPVSVGICGS 260 Query: 759 ERAFQSYSKGIFNGACSTSLDHAVLIVGYGSENGVDYWILKNSWGKSWGMDGYMHMARNT 938 ER FQ YSKGIF G CST LDHAVLIVGYGSENGVDYWI+KNSWGK WGM+GY+HM RN+ Sbjct: 261 EREFQLYSKGIFTGPCSTFLDHAVLIVGYGSENGVDYWIVKNSWGKYWGMNGYIHMIRNS 320 Query: 939 GDQLGICGINTLA--XXXXXXXXXXXXXXXXXKCSLLTYCGAGETCCCTRRLLGICFHWK 1112 G+ GICGINTLA +C+L T+C GETCCC + LGICF WK Sbjct: 321 GNSKGICGINTLASYPVKTKPNPPIPPPPGPVRCNLFTHCSEGETCCCAKSFLGICFSWK 380 Query: 1113 CCEVNSAVCCKDHRFCCPHDYPVCDTKNKLCFKGTVNSTMVKGLERKKGSFGKLGGWNS 1289 CC + SAVCCKD R CCP DYP+CDT+ C K T N T E + S K GW S Sbjct: 381 CCGLTSAVCCKDKRHCCPQDYPICDTRRGQCLKRTANGTTTITSENQDFSH-KSRGWKS 438 >ref|XP_002510459.1| cysteine protease, putative [Ricinus communis] gi|223551160|gb|EEF52646.1| cysteine protease, putative [Ricinus communis] Length = 422 Score = 556 bits (1434), Expect = e-156 Identities = 262/393 (66%), Positives = 313/393 (79%), Gaps = 5/393 (1%) Frame = +3 Query: 63 SSFTSDLFNSWCEQNGKNYSSEEEKLYRFSVFEDNLAFITNHNNMENSTYTLALNAFADL 242 SS S LF SW +++GK Y+S+E+KLYRF +FE+N F+ HN+ NS+YTL+LNAFADL Sbjct: 25 SSDISKLFESWTKEHGKTYTSKEDKLYRFKIFEENYEFVKKHNSQGNSSYTLSLNAFADL 84 Query: 243 THHEFKSSRFGLSLAASN--LDRSNSQLTGFSVFRDVPSSIDWRKKGAVTPVKDQASCGA 416 THHEFK+SR GLS +++ L R N L F DVP SIDWRKKGAV+ VKDQ +CGA Sbjct: 85 THHEFKASRLGLSAFSTSGKLSRRNFPLHDF--VGDVPISIDWRKKGAVSQVKDQGNCGA 142 Query: 417 CWAFSATGAIEGINQIVSGSLVSLSEQELVDCDKTYNSGCGGGLMDYAFQFVVDNKGIDT 596 CW+FSATGAIEGIN+IV+GSLVSLSEQELVDCD++YN+GC GGLMDYA+QFV++N GIDT Sbjct: 143 CWSFSATGAIEGINKIVTGSLVSLSEQELVDCDRSYNNGCEGGLMDYAYQFVIENNGIDT 202 Query: 597 EKDYPYQAKGTSCNKNKLKRHVVTIDGYTDLP-SKEEEIMQAVASQPVSVGICGSERAFQ 773 E+DYPYQA+ +CNK KLKRHVVTIDGYTD+P + E+E+++AVA+QPVSVGICGSERAFQ Sbjct: 203 EEDYPYQAREKTCNKEKLKRHVVTIDGYTDVPQNNEKELLKAVAAQPVSVGICGSERAFQ 262 Query: 774 SYSKGIFNGACSTSLDHAVLIVGYGSENGVDYWILKNSWGKSWGMDGYMHMARNTGDQLG 953 YSKGIF G CSTSLDHAVLIVGYGSENGVDYWI+KNSWG WG++GYM+M RN+G+ G Sbjct: 263 LYSKGIFTGPCSTSLDHAVLIVGYGSENGVDYWIVKNSWGTHWGINGYMYMLRNSGNSQG 322 Query: 954 ICGINTLA--XXXXXXXXXXXXXXXXXKCSLLTYCGAGETCCCTRRLLGICFHWKCCEVN 1127 +CGIN LA KC L T CG GETCCCTRR+ G+CF WKCCE++ Sbjct: 323 LCGINMLASFPVKTSPNPPPPAPPGPTKCDLFTRCGEGETCCCTRRIFGLCFSWKCCELD 382 Query: 1128 SAVCCKDHRFCCPHDYPVCDTKNKLCFKGTVNS 1226 SAVCCKD CCPHDYPVCDTK +C K ++ S Sbjct: 383 SAVCCKDGLHCCPHDYPVCDTKRNMCLKVSIFS 415 >ref|XP_004238304.1| PREDICTED: cysteine proteinase RD21a-like [Solanum lycopersicum] Length = 439 Score = 551 bits (1421), Expect = e-154 Identities = 261/417 (62%), Positives = 315/417 (75%), Gaps = 8/417 (1%) Frame = +3 Query: 54 ICFSSFTSDLFNSWCEQNGKNYSSEEEKLYRFSVFEDNLAFITNHNNMENSTYTLALNAF 233 +C S SDLF +WC+QNGK YSSE+E++YRF VFE+N A+IT HN+ NS+YTL LNA+ Sbjct: 19 LCTCSSISDLFETWCQQNGKKYSSEQERMYRFKVFEENYAYITEHNSKGNSSYTLGLNAY 78 Query: 234 ADLTHHEFKSSRFGLSLAASNLDR-----SNSQLTGFSVFRDVPSSIDWRKKGAVTPVKD 398 +DLTHHEF++S GLS +A++ R S S G D PSS+DWR KGAVT VK+ Sbjct: 79 SDLTHHEFRNSFLGLSSSANDFIRLKGRGSGSSAAGVLSDVDAPSSLDWRDKGAVTNVKN 138 Query: 399 QASCGACWAFSATGAIEGINQIVSGSLVSLSEQELVDCDKTYNSGCGGGLMDYAFQFVVD 578 Q SCGACW+FSATGAIEGIN+I +GSLVSLSEQEL+DCD++YN GCGGGLMDYAF+FV+ Sbjct: 139 QGSCGACWSFSATGAIEGINKITTGSLVSLSEQELIDCDRSYNQGCGGGLMDYAFEFVIK 198 Query: 579 NKGIDTEKDYPYQAKGTSCNKNKLKRHVVTIDGYTDLPSKEEE-IMQAVASQPVSVGICG 755 N GIDTEKDYP++ K +CNKNKL+R VVTIDGYTD+P +E+ +++AVA+QPVSVGICG Sbjct: 199 NGGIDTEKDYPFREKEGTCNKNKLQRRVVTIDGYTDIPQNDEDKLLKAVATQPVSVGICG 258 Query: 756 SERAFQSYSKGIFNGACSTSLDHAVLIVGYGSENGVDYWILKNSWGKSWGMDGYMHMARN 935 S RAFQSYSKGIF G C T LDHAVLIVGYGSENG DYWI+KNSWG SWG++GY+HM RN Sbjct: 259 SARAFQSYSKGIFTGPCPTDLDHAVLIVGYGSENGFDYWIIKNSWGTSWGINGYIHMQRN 318 Query: 936 TGDQLGICGINTLA--XXXXXXXXXXXXXXXXXKCSLLTYCGAGETCCCTRRLLGICFHW 1109 +G+Q GICG+N LA KCS T CG GETCCC + LGIC W Sbjct: 319 SGNQEGICGVNKLASYPTKTSPNPPNPPAPGPSKCSTFTSCGQGETCCCGLKFLGICLSW 378 Query: 1110 KCCEVNSAVCCKDHRFCCPHDYPVCDTKNKLCFKGTVNSTMVKGLERKKGSFGKLGG 1280 KCC ++SAVCCKD R CCP DYP+CDT LC K N+T+V+ +K+ GK GG Sbjct: 379 KCCGLDSAVCCKDGRHCCPWDYPICDTSRNLCLKRMSNATIVQ-QPQKEPFTGKFGG 434 >gb|AAK71314.1|AF388175_1 papain-like cysteine peptidase XBCP3 [Arabidopsis thaliana] Length = 437 Score = 551 bits (1420), Expect = e-154 Identities = 261/409 (63%), Positives = 312/409 (76%), Gaps = 6/409 (1%) Frame = +3 Query: 75 SDLFNSWCEQNGKNYSSEEEKLYRFSVFEDNLAFITNHNNMENSTYTLALNAFADLTHHE 254 S+LF+ WC+++GK Y SEEE+ R +F+DN F+T HN + N+TY+L+LNAFADLTHHE Sbjct: 29 SELFDDWCQKHGKTYGSEEERQQRIQIFKDNHDFVTQHNLITNATYSLSLNAFADLTHHE 88 Query: 255 FKSSRFGLSLAA-SNLDRSNSQLTGFSVFRDVPSSIDWRKKGAVTPVKDQASCGACWAFS 431 FK+SR GLS++A S + S Q G SV VP S+DWRKKGAVT VKDQ SCGACW+FS Sbjct: 89 FKASRLGLSVSAPSVIMASKGQSLGGSV--KVPDSVDWRKKGAVTNVKDQGSCGACWSFS 146 Query: 432 ATGAIEGINQIVSGSLVSLSEQELVDCDKTYNSGCGGGLMDYAFQFVVDNKGIDTEKDYP 611 ATGA+EGINQIV+G L+SLSEQEL+DCDK+YN+GC GGLMDYAF+FV+ N GIDTEKDYP Sbjct: 147 ATGAMEGINQIVTGDLISLSEQELIDCDKSYNAGCNGGLMDYAFEFVIKNHGIDTEKDYP 206 Query: 612 YQAKGTSCNKNKLKRHVVTIDGYTDLPSKEEE-IMQAVASQPVSVGICGSERAFQSYSKG 788 YQ + +C K+KLK+ VVTID Y + S +E+ +M+AVA+QPVSVGICGSERAFQ YS+G Sbjct: 207 YQERDGTCKKDKLKQKVVTIDSYAGVKSNDEKALMEAVAAQPVSVGICGSERAFQLYSRG 266 Query: 789 IFNGACSTSLDHAVLIVGYGSENGVDYWILKNSWGKSWGMDGYMHMARNTGDQLGICGIN 968 IF+G CSTSLDHAVLIVGYGS+NGVDYWI+KNSWGKSWGMDG+MHM RNT + G+CGIN Sbjct: 267 IFSGPCSTSLDHAVLIVGYGSQNGVDYWIVKNSWGKSWGMDGFMHMQRNTENSDGVCGIN 326 Query: 969 TLA--XXXXXXXXXXXXXXXXXKCSLLTYCGAGETCCCTRRLLGICFHWKCCEVNSAVCC 1142 LA KC+L TYC +GETCCC R L G+CF WKCCE+ SAVCC Sbjct: 327 MLASYPIKTHPNPPPPSPPGPTKCNLFTYCSSGETCCCARELFGLCFSWKCCEIESAVCC 386 Query: 1143 KDHRFCCPHDYPVCDTKNKLCFKGTVNSTMVKGLERKKGS--FGKLGGW 1283 KD R CCPHDYPVCDT LC K T N T +K +K S G+ W Sbjct: 387 KDGRHCCPHDYPVCDTTRSLCLKKTGNFTAIKPFWKKNSSKQLGRFEEW 435 >ref|XP_006417526.1| hypothetical protein EUTSA_v10007640mg [Eutrema salsugineum] gi|557095297|gb|ESQ35879.1| hypothetical protein EUTSA_v10007640mg [Eutrema salsugineum] Length = 444 Score = 550 bits (1418), Expect = e-154 Identities = 265/415 (63%), Positives = 311/415 (74%), Gaps = 5/415 (1%) Frame = +3 Query: 75 SDLFNSWCEQNGKNYSSEEEKLYRFSVFEDNLAFITNHNNMENSTYTLALNAFADLTHHE 254 ++LF+ WC ++GK Y SEEE+ +R +F DN F+T HN++ NSTY+L+LNAFADLTHHE Sbjct: 34 AELFDDWCHRHGKTYGSEEERQHRIQIFRDNHDFVTQHNHISNSTYSLSLNAFADLTHHE 93 Query: 255 FKSSRFGLSLAASNLDRSNSQLTGFS--VFRDVPSSIDWRKKGAVTPVKDQASCGACWAF 428 FK+SR GLS + +L + Q G S V VP S+DWRKKGAVT VKDQ SCGACW+F Sbjct: 94 FKASRLGLSAPSPSL-MAKEQSLGVSERVRVKVPDSVDWRKKGAVTNVKDQGSCGACWSF 152 Query: 429 SATGAIEGINQIVSGSLVSLSEQELVDCDKTYNSGCGGGLMDYAFQFVVDNKGIDTEKDY 608 SATGA+EGINQIV+G L+SLSEQEL+DCDK+YN+GC GGLMDYAF+FV+ N GIDTEKDY Sbjct: 153 SATGAMEGINQIVTGDLISLSEQELIDCDKSYNAGCNGGLMDYAFEFVIKNHGIDTEKDY 212 Query: 609 PYQAKGTSCNKNKLKRHVVTIDGYTDLPSKEEE-IMQAVASQPVSVGICGSERAFQSYSK 785 PYQ + +C K+KLK+ VVTID Y + S E+ +M+AVASQPVSVGICGSERAFQ YS Sbjct: 213 PYQEQDGTCKKDKLKKRVVTIDSYAGVASNNEKALMEAVASQPVSVGICGSERAFQLYSS 272 Query: 786 GIFNGACSTSLDHAVLIVGYGSENGVDYWILKNSWGKSWGMDGYMHMARNTGDQLGICGI 965 GIF+G CSTSLDHAVLIVGYGS+NGVDYWI+KNSWGKSWGMDG+MHM RNTG+ G+CGI Sbjct: 273 GIFSGPCSTSLDHAVLIVGYGSQNGVDYWIVKNSWGKSWGMDGFMHMQRNTGNSEGVCGI 332 Query: 966 NTLA--XXXXXXXXXXXXXXXXXKCSLLTYCGAGETCCCTRRLLGICFHWKCCEVNSAVC 1139 N LA KC+L TYC +GETCCC R L G+CF WKCCE+ SAVC Sbjct: 333 NMLASYPIKTHPNPPPPSPPGPTKCNLFTYCSSGETCCCARTLFGLCFSWKCCELESAVC 392 Query: 1140 CKDHRFCCPHDYPVCDTKNKLCFKGTVNSTMVKGLERKKGSFGKLGGWNSLFEAW 1304 CKD R CCP DYPVCDT LC K T N T +K KK S KLG FE W Sbjct: 393 CKDGRHCCPRDYPVCDTTKSLCLKKTGNFTEIKPF-WKKNSSNKLG----RFEEW 442 >ref|NP_563855.1| papain-like cysteine peptidase [Arabidopsis thaliana] gi|110741821|dbj|BAE98853.1| papain-like cysteine peptidase XBCP3 [Arabidopsis thaliana] gi|111074448|gb|ABH04597.1| At1g09850 [Arabidopsis thaliana] gi|332190386|gb|AEE28507.1| papain-like cysteine peptidase [Arabidopsis thaliana] Length = 437 Score = 550 bits (1418), Expect = e-154 Identities = 261/409 (63%), Positives = 311/409 (76%), Gaps = 6/409 (1%) Frame = +3 Query: 75 SDLFNSWCEQNGKNYSSEEEKLYRFSVFEDNLAFITNHNNMENSTYTLALNAFADLTHHE 254 S+LF+ WC+++GK Y SEEE+ R +F+DN F+T HN + N+TY+L+LNAFADLTHHE Sbjct: 29 SELFDDWCQKHGKTYGSEEERQQRIQIFKDNHDFVTQHNLITNATYSLSLNAFADLTHHE 88 Query: 255 FKSSRFGLSLAA-SNLDRSNSQLTGFSVFRDVPSSIDWRKKGAVTPVKDQASCGACWAFS 431 FK+SR GLS++A S + S Q G SV VP S+DWRKKGAVT VKDQ SCGACW+FS Sbjct: 89 FKASRLGLSVSAPSVIMASKGQSLGGSV--KVPDSVDWRKKGAVTNVKDQGSCGACWSFS 146 Query: 432 ATGAIEGINQIVSGSLVSLSEQELVDCDKTYNSGCGGGLMDYAFQFVVDNKGIDTEKDYP 611 ATGA+EGINQIV+G L+SLSEQEL+DCDK+YN+GC GGLMDYAF+FV+ N GIDTEKDYP Sbjct: 147 ATGAMEGINQIVTGDLISLSEQELIDCDKSYNAGCNGGLMDYAFEFVIKNHGIDTEKDYP 206 Query: 612 YQAKGTSCNKNKLKRHVVTIDGYTDLPSKEEE-IMQAVASQPVSVGICGSERAFQSYSKG 788 YQ + +C K+KLK+ VVTID Y + S +E+ +M+AVA+QPVSVGICGSERAFQ YS G Sbjct: 207 YQERDGTCKKDKLKQKVVTIDSYAGVKSNDEKALMEAVAAQPVSVGICGSERAFQLYSSG 266 Query: 789 IFNGACSTSLDHAVLIVGYGSENGVDYWILKNSWGKSWGMDGYMHMARNTGDQLGICGIN 968 IF+G CSTSLDHAVLIVGYGS+NGVDYWI+KNSWGKSWGMDG+MHM RNT + G+CGIN Sbjct: 267 IFSGPCSTSLDHAVLIVGYGSQNGVDYWIVKNSWGKSWGMDGFMHMQRNTENSDGVCGIN 326 Query: 969 TLA--XXXXXXXXXXXXXXXXXKCSLLTYCGAGETCCCTRRLLGICFHWKCCEVNSAVCC 1142 LA KC+L TYC +GETCCC R L G+CF WKCCE+ SAVCC Sbjct: 327 MLASYPIKTHPNPPPPSPPGPTKCNLFTYCSSGETCCCARELFGLCFSWKCCEIESAVCC 386 Query: 1143 KDHRFCCPHDYPVCDTKNKLCFKGTVNSTMVKGLERKKGS--FGKLGGW 1283 KD R CCPHDYPVCDT LC K T N T +K +K S G+ W Sbjct: 387 KDGRHCCPHDYPVCDTTRSLCLKKTGNFTAIKPFWKKNSSKQLGRFEEW 435 >ref|XP_002889773.1| hypothetical protein ARALYDRAFT_471096 [Arabidopsis lyrata subsp. lyrata] gi|297335615|gb|EFH66032.1| hypothetical protein ARALYDRAFT_471096 [Arabidopsis lyrata subsp. lyrata] Length = 439 Score = 549 bits (1415), Expect = e-153 Identities = 262/411 (63%), Positives = 312/411 (75%), Gaps = 8/411 (1%) Frame = +3 Query: 75 SDLFNSWCEQNGKNYSSEEEKLYRFSVFEDNLAFITNHNNMENSTYTLALNAFADLTHHE 254 S+LF+ WC+++GK Y SEEE+ R +F+DN F+T HN + N+TY+L+LNAFADLTHHE Sbjct: 29 SELFDDWCQRHGKTYGSEEERQQRIQIFKDNHDFVTQHNLITNATYSLSLNAFADLTHHE 88 Query: 255 FKSSRFGLSLAASNLDR-SNSQLTGFSVFRDVPSSIDWRKKGAVTPVKDQASCGACWAFS 431 FK+SR GLS++AS+L S Q G + VP S+DWRKKGAVT VKDQ SCGACW+FS Sbjct: 89 FKASRLGLSVSASSLIMASKGQSLGGNA--KVPDSVDWRKKGAVTNVKDQGSCGACWSFS 146 Query: 432 ATGAIEGINQIVSGSLVSLSEQELVDCDKTYNSGCGGGLMDYAFQFVVDNKGIDTEKDYP 611 ATGA+EGINQIV+G L+SLSEQEL+DCDK+YN+GC GGLMDYAF+FV+ N GIDTEKDYP Sbjct: 147 ATGAMEGINQIVTGDLISLSEQELIDCDKSYNAGCNGGLMDYAFEFVIKNHGIDTEKDYP 206 Query: 612 YQAKGTSCNKNKLKRHVVTIDGYTDLPSKEEE-IMQAVASQPVSVGICGSERAFQSYSK- 785 YQ + +C K+KLK+ VVTID Y + S +E+ + +AVA+QPVSVGICGSERAFQ YS+ Sbjct: 207 YQERDGTCKKDKLKQKVVTIDSYAGVKSNDEKALREAVAAQPVSVGICGSERAFQLYSRV 266 Query: 786 -GIFNGACSTSLDHAVLIVGYGSENGVDYWILKNSWGKSWGMDGYMHMARNTGDQLGICG 962 GIF+G CSTSLDHAVLIVGYGS+NGVDYWI+KNSWGKSWGMDG+MHM RNTG+ GICG Sbjct: 267 SGIFSGPCSTSLDHAVLIVGYGSQNGVDYWIVKNSWGKSWGMDGFMHMQRNTGNSEGICG 326 Query: 963 INTLA--XXXXXXXXXXXXXXXXXKCSLLTYCGAGETCCCTRRLLGICFHWKCCEVNSAV 1136 IN LA KC+L TYC AGETCCC R L G+CF WKCCE+ SAV Sbjct: 327 INMLASYPIKTHPNPPPPSPPGPTKCNLFTYCSAGETCCCARNLFGLCFSWKCCEIESAV 386 Query: 1137 CCKDHRFCCPHDYPVCDTKNKLCFKGTVNSTMVKGLERKKGS--FGKLGGW 1283 CC D R CCPHDYPVCDT LC K T N T +K +K S G+ GW Sbjct: 387 CCSDGRHCCPHDYPVCDTTRSLCLKKTGNFTAIKPFWKKDSSNKLGRFEGW 437 >ref|XP_004500967.1| PREDICTED: oryzain alpha chain-like [Cicer arietinum] Length = 436 Score = 548 bits (1412), Expect = e-153 Identities = 265/412 (64%), Positives = 305/412 (74%), Gaps = 6/412 (1%) Frame = +3 Query: 72 TSDLFNSWCEQNGKNYSSEEEKLYRFSVFEDNLAFITNHNNMENSTYTLALNAFADLTHH 251 TS LF WC+Q+GK Y SE+EK YRF+VFEDN AF+ HN + NS+YTL+LNAFADLTHH Sbjct: 26 TSKLFQEWCKQHGKTYPSEQEKRYRFNVFEDNYAFVAQHNQIGNSSYTLSLNAFADLTHH 85 Query: 252 EFKSSRFGL---SLAASNLDRSNSQLTGFSVFRDVPSSIDWRKKGAVTPVKDQASCGACW 422 EFK++R GL SL +R Q F VPS IDWRK GAV+ VKDQ SCGACW Sbjct: 86 EFKATRLGLPPSSLLRFKFNRFQDQQRSDD-FLQVPSEIDWRKNGAVSIVKDQGSCGACW 144 Query: 423 AFSATGAIEGINQIVSGSLVSLSEQELVDCDKTYNSGCGGGLMDYAFQFVVDNKGIDTEK 602 +FSATGAIEGIN+IV+GSLVSLSEQELVDCD TYNSGC GGLMDYA+QF++DN GIDTE+ Sbjct: 145 SFSATGAIEGINKIVTGSLVSLSEQELVDCDTTYNSGCDGGLMDYAYQFIIDNNGIDTEE 204 Query: 603 DYPYQAKGTSCNKNKLKRHVVTIDGYTDL-PSKEEEIMQAVASQPVSVGICGSERAFQSY 779 DYPYQA+ C K+KLKR VVTIDGYTD+ P+ E+++++AVA QPVSVGICGS RAFQ Y Sbjct: 205 DYPYQARQLLCKKDKLKRRVVTIDGYTDVPPNDEKKLLKAVAVQPVSVGICGSARAFQLY 264 Query: 780 SKGIFNGACSTSLDHAVLIVGYGSENGVDYWILKNSWGKSWGMDGYMHMARNTGDQLGIC 959 SKGIF G CSTSLDHAVLIVGYGSENGVDYWI+KNSWGK WGM+GY+HM RNT G+C Sbjct: 265 SKGIFTGPCSTSLDHAVLIVGYGSENGVDYWIVKNSWGKYWGMNGYIHMLRNTDSSAGLC 324 Query: 960 GINTLA--XXXXXXXXXXXXXXXXXKCSLLTYCGAGETCCCTRRLLGICFHWKCCEVNSA 1133 GIN LA KC+L TYC GETCCC ++ LGICF WKCC V SA Sbjct: 325 GINMLASYPTKTKPNPPVPPPPGPIKCNLFTYCSGGETCCCAKKFLGICFSWKCCGVTSA 384 Query: 1134 VCCKDHRFCCPHDYPVCDTKNKLCFKGTVNSTMVKGLERKKGSFGKLGGWNS 1289 VCCKD R CCP DYPVCD N C K N T++ + K+ F + W S Sbjct: 385 VCCKDKRHCCPLDYPVCDASNGQCLKRIANGTILMTSD-KEDPFHQTRDWRS 435 >ref|XP_007223363.1| hypothetical protein PRUPE_ppa005615mg [Prunus persica] gi|462420299|gb|EMJ24562.1| hypothetical protein PRUPE_ppa005615mg [Prunus persica] Length = 451 Score = 547 bits (1410), Expect = e-153 Identities = 268/429 (62%), Positives = 318/429 (74%), Gaps = 13/429 (3%) Frame = +3 Query: 63 SSFTSDLFNSWCEQNGKNYSSEEEKLYRFSVFEDNLAFITNHNNMENSTYTLALNAFADL 242 S TS+LF WC+Q GK+YSS +EKLYR SVFEDNLAF+T HN+M NS+YTL+LN F+DL Sbjct: 26 SQTTSELFEVWCKQYGKSYSSAQEKLYRLSVFEDNLAFVTQHNDMGNSSYTLSLNDFSDL 85 Query: 243 THHEFKSSRFGLSLAASNLDRSNSQLTGFSVFRDVPSSIDWRKKGAVTPVKDQASCGACW 422 THHEFKSSR G S + +L + + SV RD+PSS+DWRKKGAVT VKDQ SCGACW Sbjct: 86 THHEFKSSRLGFSPSFLSLKLKSDRKP--SVVRDLPSSLDWRKKGAVTNVKDQGSCGACW 143 Query: 423 AFSATGAIEGINQIVSGSLVSLSEQELVDCDKTY-NSGCGGGLMDYAFQFVVDNKGIDTE 599 AFS TGAIEGIN+IV+GSL+SLSEQELVDCD+ Y N+GC GGLMD AF+FV+DN GIDTE Sbjct: 144 AFSTTGAIEGINKIVTGSLISLSEQELVDCDRVYPNNGCNGGLMDDAFRFVIDNNGIDTE 203 Query: 600 KDYPYQAKGTSCNKNKLKRHVVTIDGYTDLPSKEEE-IMQAVASQPVSVGICGSERAFQS 776 +DYPY+ +C K KLKR+ VTID YTD+PS +EE ++QAVASQPVSVGI GS+ FQ Sbjct: 204 EDYPYKGWDDTCIKKKLKRNAVTIDDYTDVPSNDEEQLLQAVASQPVSVGISGSDMGFQL 263 Query: 777 YSKGIFNGACSTSLDHAVLIVGYGSENGVDYWILKNSWGKSWGMDGYMHMARNTGDQLGI 956 YSKGIFNG CSTSLDHAVLIVGYGSENGVDYWI+KNSWG WGM+GYMHM R+ + GI Sbjct: 264 YSKGIFNGPCSTSLDHAVLIVGYGSENGVDYWIVKNSWGTHWGMNGYMHMLRDHSNPKGI 323 Query: 957 CGINTLA-XXXXXXXXXXXXXXXXXKCSLLTYCGAGETCCCTRRLLGICFHWKCCEVNSA 1133 CGINTLA +C + T+C AGETCCC +R++GICF W+CCE++SA Sbjct: 324 CGINTLASYPIKTGENPPLPPPGPTRCDIFTHCAAGETCCCAKRVVGICFSWRCCELDSA 383 Query: 1134 VCCKDHRFCCPHDYPVCDTKNKLCFKG---------TVNSTMVKGLERKKGSFGKLG-GW 1283 VCCKD R CCP DYP+CDT+ LC + + K LE +GS K G GW Sbjct: 384 VCCKDQRHCCPRDYPICDTERTLCLQSNEQLSTQSHATGNLTSKALE-SRGSLRKSGRGW 442 Query: 1284 NSLFEAWNL 1310 S+ W L Sbjct: 443 GSMIRDWIL 451 >ref|XP_004291399.1| PREDICTED: cysteine proteinase RD21a-like [Fragaria vesca subsp. vesca] Length = 441 Score = 544 bits (1401), Expect = e-152 Identities = 264/419 (63%), Positives = 313/419 (74%), Gaps = 12/419 (2%) Frame = +3 Query: 63 SSFTSDLFNSWCEQNGKNYSSEEEKLYRFSVFEDNLAFITNHNNMENSTYTLALNAFADL 242 SS +S+LF +WC+Q GK+YSS+EEKLYR S+FE NLAFIT HN++ NS+YTL+LN+F+DL Sbjct: 25 SSSSSELFEAWCKQYGKSYSSQEEKLYRLSLFEQNLAFITQHNDLGNSSYTLSLNSFSDL 84 Query: 243 THHEFKSSRFGLSLAASNLDRSNSQLTGFSVFRDVPSSIDWRKKGAVTPVKDQASCGACW 422 THHEFK+SR G S L R + SV R VPSSIDWRK GAVT VKDQ SCGACW Sbjct: 85 THHEFKASRLGFSPTFLRLYRKSDPKP--SVVRHVPSSIDWRKNGAVTNVKDQGSCGACW 142 Query: 423 AFSATGAIEGINQIVSGSLVSLSEQELVDCDKTY-NSGCGGGLMDYAFQFVVDNKGIDTE 599 +FSATGAIEGIN+IV+GSLVSLSEQEL+DCD+ Y NSGC GGLMD AFQF++DN GIDTE Sbjct: 143 SFSATGAIEGINKIVTGSLVSLSEQELIDCDRVYPNSGCNGGLMDDAFQFIIDNNGIDTE 202 Query: 600 KDYPYQAKGTSCNKNKLKRHVVTIDGYTDLPSK-EEEIMQAVASQPVSVGICGSERAFQS 776 +DYPYQ +CNK KLKRHVVTIDGYTD+P+ EE++++AVA+QPVSVGI GS R FQ Sbjct: 203 EDYPYQGADGTCNKQKLKRHVVTIDGYTDVPANNEEQLLKAVATQPVSVGIAGSGREFQF 262 Query: 777 YSKGIFNGACSTSLDHAVLIVGYGSENGVDYWILKNSWGKSWGMDGYMHMARNTGDQLGI 956 YSKGIF G CST+LDHAVLIVGYGSENGVDYWI+KNSWGK+WGM+GY+H+ R+ + G+ Sbjct: 263 YSKGIFAGPCSTTLDHAVLIVGYGSENGVDYWIVKNSWGKNWGMNGYIHILRDHSNSKGL 322 Query: 957 CGINTLA-----XXXXXXXXXXXXXXXXXKCSLLTYCGAGETCCCTRRLLGICFHWKCCE 1121 CGIN LA KC L + CG GETCCC R++LGIC W+CCE Sbjct: 323 CGINMLASYPTKTGENPPFPPPSPPPGPTKCDLFSKCGVGETCCCARKILGICLSWRCCE 382 Query: 1122 VNSAVCCKDHRFCCPHDYPVCDTKNKLCFKGTVNSTM----VKGLERKKG-SFGKLGGW 1283 SAVCCKD CCPHDYP+CDT+ C + N TM ++G RK S KL W Sbjct: 383 FTSAVCCKDRLHCCPHDYPICDTERNYCLQANGNLTMRANEIRGSLRKSSRSKAKLSYW 441