BLASTX nr result
ID: Sinomenium21_contig00000835
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Sinomenium21_contig00000835 (1926 letters) Database: ./nr 38,876,450 sequences; 13,856,398,315 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_006473275.1| PREDICTED: cysteine proteinase RD21a-like [C... 693 0.0 ref|XP_002284973.1| PREDICTED: cysteine proteinase RD21a-like [V... 688 0.0 emb|CAN61026.1| hypothetical protein VITISV_001146 [Vitis vinifera] 687 0.0 ref|XP_006374886.1| cysteine proteinase [Populus trichocarpa] gi... 684 0.0 gb|AEZ65083.1| cysteine protease [Carica papaya] 681 0.0 gb|ABQ10204.1| cysteine protease Cp6 [Actinidia deliciosa] 681 0.0 gb|AFO83614.1| papain-like cysteine protease [Fagopyrum esculentum] 677 0.0 ref|XP_007017153.1| Granulin repeat cysteine protease family pro... 677 0.0 ref|XP_002510170.1| cysteine protease, putative [Ricinus communi... 675 0.0 gb|ABG33750.1| cysteine protease [Hevea brasiliensis] 665 0.0 gb|ABR19827.1| cysteine proteinase [Elaeis guineensis] 664 0.0 gb|AAX84673.1| cysteine protease CP1 [Manihot esculenta] 662 0.0 ref|XP_007160879.1| hypothetical protein PHAVU_001G024500g [Phas... 660 0.0 gb|ABR19828.1| cysteine proteinase [Elaeis guineensis] 660 0.0 emb|CAB17076.1| cysteine proteinase precursor [Phaseolus vulgaris] 660 0.0 ref|XP_003545327.1| PREDICTED: cysteine proteinase RD21a-like [G... 656 0.0 gb|EXB56503.1| Cysteine proteinase RD21a [Morus notabilis] 655 0.0 ref|XP_004291075.1| PREDICTED: cysteine proteinase RD21a-like [F... 655 0.0 ref|XP_003589133.1| Cysteine proteinase [Medicago truncatula] gi... 655 0.0 dbj|BAE80740.1| cysteine proteinase [Platycodon grandiflorus] 654 0.0 >ref|XP_006473275.1| PREDICTED: cysteine proteinase RD21a-like [Citrus sinensis] Length = 468 Score = 693 bits (1789), Expect = 0.0 Identities = 324/436 (74%), Positives = 357/436 (81%), Gaps = 7/436 (1%) Frame = +2 Query: 125 DMSIIN------QAEESWRTDEEVKNLYESWLVKHGKAYNALGEKEKRLEIFKDNLRFID 286 DMSI++ SWRTD+EV ++E+WLVKHGKAYNALGEKEKR EIFK+NLRFID Sbjct: 24 DMSIVSYDKTHESKSSSWRTDDEVMAMFEAWLVKHGKAYNALGEKEKRFEIFKENLRFID 83 Query: 287 EHNAGNHTFTVGLNRFADLTNEEYRKMYLGVRTDAHRRFSRPK-SNRYALRAGDDLPDSI 463 EHN+ N T+ VGLNRFADLTNEEYR M+LG R A R + K SNRY R G+ LPDS+ Sbjct: 84 EHNSENRTYKVGLNRFADLTNEEYRSMHLGARFGAKRTSLKSKRSNRYLPRVGEALPDSV 143 Query: 464 DWRKKGAVVAIKDQGGCGSCWAFSTIAAVEGINQIVTGDLLSLSEQELVDCDTSYNEGCN 643 DWRKKGAV +KDQG CGSCWAFSTIAAVEGIN+IVTG L+SLSEQELVDCDTSYNEGCN Sbjct: 144 DWRKKGAVAEVKDQGSCGSCWAFSTIAAVEGINKIVTGSLISLSEQELVDCDTSYNEGCN 203 Query: 644 GGLMDYAFEFIIKNGGIDTEDDYPYKSRDGVCDQYRKNARVVTIDSYEDVPVNDEKALQK 823 GGLMDYAFEFII NGGIDTE+DYPYK+ DG CD YRKNA+VVTID YEDVP+NDEKALQK Sbjct: 204 GGLMDYAFEFIIDNGGIDTEEDYPYKAIDGSCDTYRKNAKVVTIDDYEDVPLNDEKALQK 263 Query: 824 AVANQPVSVAIEGGGRAFQLYVSGIFTGRCGTALDHGVAAVGYGTENGKDYWIVKNSWGA 1003 AVANQPVSVAIEGGG AFQLY SGIFTGRCGT+LDHGV AVGYGTENG DYWIVKNSWG+ Sbjct: 264 AVANQPVSVAIEGGGMAFQLYESGIFTGRCGTSLDHGVTAVGYGTENGADYWIVKNSWGS 323 Query: 1004 SWGEDGYIRMERNLASAKTGKCGIAMQASYPIXXXXXXXXXXXXXXXXXXXXXXCDDHYA 1183 SWGE GYIRMERN+A TGKCGIAM+ASYPI CD++Y+ Sbjct: 324 SWGEAGYIRMERNVAGTLTGKCGIAMEASYPIKKGQNPPNPGPSPPSPTKPPAVCDNYYS 383 Query: 1184 CPESTTCCCVYQYGSSCLGWGCCPLEGATCCDDHYSCCPHNYPICNVEEGTCLMSNNNPL 1363 CPES TCCCV++YG+SC WGCCPLE ATCCDDHYSCCPH+YPICNV GTCLMS +NPL Sbjct: 384 CPESNTCCCVFEYGNSCFAWGCCPLEAATCCDDHYSCCPHDYPICNVRAGTCLMSKDNPL 443 Query: 1364 GVKALLRTPAKPRWGH 1411 GV+AL RTPAKP W H Sbjct: 444 GVRALRRTPAKPYWAH 459 >ref|XP_002284973.1| PREDICTED: cysteine proteinase RD21a-like [Vitis vinifera] Length = 467 Score = 688 bits (1776), Expect = 0.0 Identities = 320/434 (73%), Positives = 355/434 (81%), Gaps = 5/434 (1%) Frame = +2 Query: 125 DMSIINQAE-----ESWRTDEEVKNLYESWLVKHGKAYNALGEKEKRLEIFKDNLRFIDE 289 DMSII E SWRTDE+V +YE+WL KHGK+YNALGEKE+R +IFKDNLRFIDE Sbjct: 25 DMSIIGYDETHGDKSSWRTDEDVMAVYEAWLAKHGKSYNALGEKERRFQIFKDNLRFIDE 84 Query: 290 HNAGNHTFTVGLNRFADLTNEEYRKMYLGVRTDAHRRFSRPKSNRYALRAGDDLPDSIDW 469 HNA N T+ VGLNRFADLTNEEYR MYLG RT A RR S S+RYA R GD LP+S+DW Sbjct: 85 HNAENRTYKVGLNRFADLTNEEYRSMYLGTRTAAKRRSSNKISDRYAFRVGDSLPESVDW 144 Query: 470 RKKGAVVAIKDQGGCGSCWAFSTIAAVEGINQIVTGDLLSLSEQELVDCDTSYNEGCNGG 649 RKKGAVV +KDQG CGSCWAFSTIAAVEGIN+IVTG L+SLSEQELVDCDTSYNEGCNGG Sbjct: 145 RKKGAVVEVKDQGSCGSCWAFSTIAAVEGINKIVTGGLISLSEQELVDCDTSYNEGCNGG 204 Query: 650 LMDYAFEFIIKNGGIDTEDDYPYKSRDGVCDQYRKNARVVTIDSYEDVPVNDEKALQKAV 829 LMDYAFEFII NGGID+E+DYPYK+ DG CDQYRKNA+VVTID YEDVP NDEK+L+KAV Sbjct: 205 LMDYAFEFIINNGGIDSEEDYPYKASDGRCDQYRKNAKVVTIDGYEDVPENDEKSLEKAV 264 Query: 830 ANQPVSVAIEGGGRAFQLYVSGIFTGRCGTALDHGVAAVGYGTENGKDYWIVKNSWGASW 1009 ANQPVSVAIE GGR FQLY SGIFTGRCGTALDHGV AVGYGTENG DYWIVKNSWGASW Sbjct: 265 ANQPVSVAIEAGGREFQLYQSGIFTGRCGTALDHGVTAVGYGTENGVDYWIVKNSWGASW 324 Query: 1010 GEDGYIRMERNLASAKTGKCGIAMQASYPIXXXXXXXXXXXXXXXXXXXXXXCDDHYACP 1189 GE+GYIRMER+LA++ TGKCGIAM+ASYPI CD++YACP Sbjct: 325 GEEGYIRMERDLATSATGKCGIAMEASYPIKKGQNPPNPGPSPPSPIKPPTVCDNYYACP 384 Query: 1190 ESTTCCCVYQYGSSCLGWGCCPLEGATCCDDHYSCCPHNYPICNVEEGTCLMSNNNPLGV 1369 ES+TCCC+++Y C WGCCPLE ATCC+DH SCCP YP+CNV GTC+MS +NPLGV Sbjct: 385 ESSTCCCIFEYAKYCFQWGCCPLEAATCCEDHDSCCPQEYPVCNVRAGTCMMSKDNPLGV 444 Query: 1370 KALLRTPAKPRWGH 1411 KAL RT AKP W + Sbjct: 445 KALKRTAAKPHWAY 458 >emb|CAN61026.1| hypothetical protein VITISV_001146 [Vitis vinifera] Length = 469 Score = 687 bits (1773), Expect = 0.0 Identities = 320/434 (73%), Positives = 354/434 (81%), Gaps = 5/434 (1%) Frame = +2 Query: 125 DMSIINQAE-----ESWRTDEEVKNLYESWLVKHGKAYNALGEKEKRLEIFKDNLRFIDE 289 DMSII E SWRTDE+V +YE+WL KHGK+YNALGEKE+R +IFKDNLRFIDE Sbjct: 27 DMSIIGYDETHGDKSSWRTDEDVMAVYEAWLAKHGKSYNALGEKERRFQIFKDNLRFIDE 86 Query: 290 HNAGNHTFTVGLNRFADLTNEEYRKMYLGVRTDAHRRFSRPKSNRYALRAGDDLPDSIDW 469 HNA N T+ VGLNRFADLTNEEYR MYLG RT A RR S S+RYA R GD LP+S+DW Sbjct: 87 HNAENRTYKVGLNRFADLTNEEYRSMYLGTRTAAKRRSSNKISDRYAFRVGDSLPESVDW 146 Query: 470 RKKGAVVAIKDQGGCGSCWAFSTIAAVEGINQIVTGDLLSLSEQELVDCDTSYNEGCNGG 649 RKKGAVV +KDQG CGSCWAFSTIAAVEGIN+IVTG L+SLSEQELVDCDTSYNEGCNGG Sbjct: 147 RKKGAVVEVKDQGSCGSCWAFSTIAAVEGINKIVTGGLISLSEQELVDCDTSYNEGCNGG 206 Query: 650 LMDYAFEFIIKNGGIDTEDDYPYKSRDGVCDQYRKNARVVTIDSYEDVPVNDEKALQKAV 829 LMDYAFEFII NGGID+E+DYPYK+ DG CDQYRKNA VVTID YEDVP NDEK+L+KAV Sbjct: 207 LMDYAFEFIINNGGIDSEEDYPYKASDGRCDQYRKNAXVVTIDGYEDVPENDEKSLEKAV 266 Query: 830 ANQPVSVAIEGGGRAFQLYVSGIFTGRCGTALDHGVAAVGYGTENGKDYWIVKNSWGASW 1009 ANQPVSVAIE GGR FQLY SGIFTGRCGTALDHGV AVGYGTENG DYWIVKNSWGASW Sbjct: 267 ANQPVSVAIEAGGREFQLYQSGIFTGRCGTALDHGVTAVGYGTENGVDYWIVKNSWGASW 326 Query: 1010 GEDGYIRMERNLASAKTGKCGIAMQASYPIXXXXXXXXXXXXXXXXXXXXXXCDDHYACP 1189 GE+GYIRMER+LA++ TGKCGIAM+ASYPI CD++YACP Sbjct: 327 GEEGYIRMERDLATSATGKCGIAMEASYPIKKGQNPPNPGPSPPSPIKPPTVCDNYYACP 386 Query: 1190 ESTTCCCVYQYGSSCLGWGCCPLEGATCCDDHYSCCPHNYPICNVEEGTCLMSNNNPLGV 1369 ES+TCCC+++Y C WGCCPLE ATCC+DH SCCP YP+CNV GTC+MS +NPLGV Sbjct: 387 ESSTCCCIFEYAKYCFQWGCCPLEAATCCEDHDSCCPQEYPVCNVRAGTCMMSKDNPLGV 446 Query: 1370 KALLRTPAKPRWGH 1411 KAL RT AKP W + Sbjct: 447 KALKRTAAKPHWAY 460 >ref|XP_006374886.1| cysteine proteinase [Populus trichocarpa] gi|118486542|gb|ABK95110.1| unknown [Populus trichocarpa] gi|550323195|gb|ERP52683.1| cysteine proteinase [Populus trichocarpa] Length = 465 Score = 684 bits (1766), Expect = 0.0 Identities = 317/434 (73%), Positives = 354/434 (81%), Gaps = 5/434 (1%) Frame = +2 Query: 125 DMSIINQ-----AEESWRTDEEVKNLYESWLVKHGKAYNALGEKEKRLEIFKDNLRFIDE 289 DMSII+ + SWRTD+EV +YE WLVKHGK YNALGEKEKR EIFKDNL FID+ Sbjct: 25 DMSIISYHQTHATKSSWRTDDEVMAMYEEWLVKHGKNYNALGEKEKRFEIFKDNLMFIDQ 84 Query: 290 HNAGNHTFTVGLNRFADLTNEEYRKMYLGVRTDAHRRFSRPKSNRYALRAGDDLPDSIDW 469 HN+ N T+TVGLNRFADLTNEE+R MYLG RT +R + S+RYA R GD LPDS+DW Sbjct: 85 HNSENRTYTVGLNRFADLTNEEFRSMYLGTRTGHKKRLPKT-SDRYAPRVGDSLPDSVDW 143 Query: 470 RKKGAVVAIKDQGGCGSCWAFSTIAAVEGINQIVTGDLLSLSEQELVDCDTSYNEGCNGG 649 RK+GAV +KDQGGCGSCWAFSTIAAVEGIN+IVTGDL++LSEQELVDCDTSYNEGCNGG Sbjct: 144 RKEGAVAEVKDQGGCGSCWAFSTIAAVEGINKIVTGDLIALSEQELVDCDTSYNEGCNGG 203 Query: 650 LMDYAFEFIIKNGGIDTEDDYPYKSRDGVCDQYRKNARVVTIDSYEDVPVNDEKALQKAV 829 LMDYAFEFII NGGIDTEDDYPY RDG CD YRKNA+VV+IDSYEDVP NDE AL+KAV Sbjct: 204 LMDYAFEFIINNGGIDTEDDYPYLGRDGRCDTYRKNAKVVSIDSYEDVPENDETALKKAV 263 Query: 830 ANQPVSVAIEGGGRAFQLYVSGIFTGRCGTALDHGVAAVGYGTENGKDYWIVKNSWGASW 1009 ANQPVSVAIEGGGR FQLY SG+FTG CGT+LDHGVAAVGYGTE GKDYWIV+NSWG SW Sbjct: 264 ANQPVSVAIEGGGRNFQLYNSGVFTGECGTSLDHGVAAVGYGTEKGKDYWIVRNSWGKSW 323 Query: 1010 GEDGYIRMERNLASAKTGKCGIAMQASYPIXXXXXXXXXXXXXXXXXXXXXXCDDHYACP 1189 GE GYIRMERN+AS TGKCGIA++ SYPI CD++++CP Sbjct: 324 GESGYIRMERNIAS-PTGKCGIAIEPSYPIKKGQNPPNPGPSPPSPVKPPSVCDNYFSCP 382 Query: 1190 ESTTCCCVYQYGSSCLGWGCCPLEGATCCDDHYSCCPHNYPICNVEEGTCLMSNNNPLGV 1369 +S+TCCC+++YG C WGCCPLEGATCCDDHYSCCPH YP+CNV EGTCL+S NP GV Sbjct: 383 DSSTCCCIFEYGKYCFAWGCCPLEGATCCDDHYSCCPHEYPVCNVNEGTCLISKGNPFGV 442 Query: 1370 KALLRTPAKPRWGH 1411 KAL RTPAKP W H Sbjct: 443 KALRRTPAKPHWAH 456 >gb|AEZ65083.1| cysteine protease [Carica papaya] Length = 467 Score = 681 bits (1756), Expect = 0.0 Identities = 318/435 (73%), Positives = 358/435 (82%), Gaps = 8/435 (1%) Frame = +2 Query: 125 DMSIINQAEE-----SWRTDEEVKNLYESWLVKHGKAYNALGEKEKRLEIFKDNLRFIDE 289 DMSI++ + SWRTD+EV +YE+WLVKHGKAYNALGEKEKR IFKDNLRFIDE Sbjct: 23 DMSIVSYDQRHADKSSWRTDDEVMAMYEAWLVKHGKAYNALGEKEKRFGIFKDNLRFIDE 82 Query: 290 HNAGNHTFTVGLNRFADLTNEEYRKMYLGVR---TDAHRRFSRPKSNRYALRAGDDLPDS 460 HN+ N T+ +GLNRFADLTNEEYR MYLGV+ T R+ SR KS+R+A R GD LPD Sbjct: 83 HNSQNLTYRLGLNRFADLTNEEYRSMYLGVKPGATRVTRKVSR-KSDRFAARVGDALPDF 141 Query: 461 IDWRKKGAVVAIKDQGGCGSCWAFSTIAAVEGINQIVTGDLLSLSEQELVDCDTSYNEGC 640 IDWRK+GAVV +KDQG CGSCWAFSTIAAVEGINQIVTGDL+SLSEQELVDCDTSYNEGC Sbjct: 142 IDWRKEGAVVGVKDQGSCGSCWAFSTIAAVEGINQIVTGDLISLSEQELVDCDTSYNEGC 201 Query: 641 NGGLMDYAFEFIIKNGGIDTEDDYPYKSRDGVCDQYRKNARVVTIDSYEDVPVNDEKALQ 820 NGGLMDYAFEFII NGGID+E+DYPY++ D CDQYRKNA VV+ID YEDVP NDE AL+ Sbjct: 202 NGGLMDYAFEFIINNGGIDSEEDYPYRAADQKCDQYRKNANVVSIDGYEDVPENDEAALK 261 Query: 821 KAVANQPVSVAIEGGGRAFQLYVSGIFTGRCGTALDHGVAAVGYGTENGKDYWIVKNSWG 1000 KAVA QPVSVAIE GGRAFQLY SG+FTG+CGT+LDHGVAAVGYGTENG+DYWIV NSWG Sbjct: 262 KAVAKQPVSVAIEAGGRAFQLYQSGVFTGKCGTSLDHGVAAVGYGTENGQDYWIVGNSWG 321 Query: 1001 ASWGEDGYIRMERNLASAKTGKCGIAMQASYPIXXXXXXXXXXXXXXXXXXXXXXCDDHY 1180 +WGEDGYIRMERNLA + +GKCGIA+ SYPI CD++Y Sbjct: 322 KNWGEDGYIRMERNLAGSSSGKCGIAIGPSYPIKNGPNPPNPGPSPPSPVQPPTVCDNYY 381 Query: 1181 ACPESTTCCCVYQYGSSCLGWGCCPLEGATCCDDHYSCCPHNYPICNVEEGTCLMSNNNP 1360 +CPE TTCCC+Y+YG C WGCCPLEGATCC+DHYSCCPH+YPICNV++GTCLMS NNP Sbjct: 382 SCPERTTCCCIYEYGKYCFAWGCCPLEGATCCEDHYSCCPHDYPICNVKDGTCLMSKNNP 441 Query: 1361 LGVKALLRTPAKPRW 1405 LGVKA+ RTPAKP W Sbjct: 442 LGVKAIRRTPAKPYW 456 >gb|ABQ10204.1| cysteine protease Cp6 [Actinidia deliciosa] Length = 461 Score = 681 bits (1756), Expect = 0.0 Identities = 319/428 (74%), Positives = 352/428 (82%), Gaps = 1/428 (0%) Frame = +2 Query: 125 DMSIINQAEESWRTDEEVKNLYESWLVKHGKAYNALGEKEKRLEIFKDNLRFIDEHNAGN 304 DMSII + S RTD+EV +YESWLVKHGK+YNA+GEKEKR +IFKDNLRFIDEHNA + Sbjct: 26 DMSIIGELSSS-RTDDEVMAMYESWLVKHGKSYNAIGEKEKRFQIFKDNLRFIDEHNAES 84 Query: 305 HTFTVGLNRFADLTNEEYRKMYLGVRTDAHRRFSRPK-SNRYALRAGDDLPDSIDWRKKG 481 T+ VGLNRFADLTN+EYR MYLG RT + RR S K S+RY AG+ LPDS+DWR+KG Sbjct: 85 RTYKVGLNRFADLTNDEYRSMYLGARTGSRRRLSTQKRSDRYVPVAGESLPDSVDWREKG 144 Query: 482 AVVAIKDQGGCGSCWAFSTIAAVEGINQIVTGDLLSLSEQELVDCDTSYNEGCNGGLMDY 661 AVV +KDQG CGSCWAFSTIAAVEGINQIVTGDL+SLSEQELVDCDTSYNEGCNGGLMDY Sbjct: 145 AVVGVKDQGSCGSCWAFSTIAAVEGINQIVTGDLISLSEQELVDCDTSYNEGCNGGLMDY 204 Query: 662 AFEFIIKNGGIDTEDDYPYKSRDGVCDQYRKNARVVTIDSYEDVPVNDEKALQKAVANQP 841 AFEFIIKNGGIDTE+DYPY +RDG CDQYRKNA+VVTID YEDVPVN+E+ALQKAVANQP Sbjct: 205 AFEFIIKNGGIDTEEDYPYNARDGRCDQYRKNAKVVTIDDYEDVPVNNEQALQKAVANQP 264 Query: 842 VSVAIEGGGRAFQLYVSGIFTGRCGTALDHGVAAVGYGTENGKDYWIVKNSWGASWGEDG 1021 VSVAIE G AFQ Y SG+FTG CGTALDHGV AVGYGTEN DYWIVKNSWG+SWGE G Sbjct: 265 VSVAIEASGMAFQFYESGVFTGNCGTALDHGVTAVGYGTENSVDYWIVKNSWGSSWGESG 324 Query: 1022 YIRMERNLASAKTGKCGIAMQASYPIXXXXXXXXXXXXXXXXXXXXXXCDDHYACPESTT 1201 YIRMERN + TGKCGIA++ SYPI CDD+Y CPES+T Sbjct: 325 YIRMERN--TGATGKCGIAVEPSYPIKTSQNPPNPGPSPPSPIKPPTVCDDYYTCPESST 382 Query: 1202 CCCVYQYGSSCLGWGCCPLEGATCCDDHYSCCPHNYPICNVEEGTCLMSNNNPLGVKALL 1381 CCCVY+YG C WGCCPLEGATCCDDHYSCCPH+YPICNV GTCLMS +NPLGVKA+ Sbjct: 383 CCCVYEYGKYCFAWGCCPLEGATCCDDHYSCCPHDYPICNVYAGTCLMSKDNPLGVKAMK 442 Query: 1382 RTPAKPRW 1405 R AKP+W Sbjct: 443 RIQAKPQW 450 >gb|AFO83614.1| papain-like cysteine protease [Fagopyrum esculentum] Length = 468 Score = 677 bits (1747), Expect = 0.0 Identities = 304/422 (72%), Positives = 354/422 (83%) Frame = +2 Query: 140 NQAEESWRTDEEVKNLYESWLVKHGKAYNALGEKEKRLEIFKDNLRFIDEHNAGNHTFTV 319 + + + R+D+EV +++ESWLV HGK YNALGEKEKR ++FKDNLRFIDEHN+ ++ + Sbjct: 41 SSSSPNMRSDDEVMSMFESWLVHHGKNYNALGEKEKRFQVFKDNLRFIDEHNSEERSYKL 100 Query: 320 GLNRFADLTNEEYRKMYLGVRTDAHRRFSRPKSNRYALRAGDDLPDSIDWRKKGAVVAIK 499 GLN+FADL+NEEYR YLG +TDA RR S+ +S+RYA RAGD LP+S+DWRK+GAVV +K Sbjct: 101 GLNKFADLSNEEYRNKYLGAKTDARRRLSKKRSSRYAFRAGDSLPESVDWRKEGAVVDVK 160 Query: 500 DQGGCGSCWAFSTIAAVEGINQIVTGDLLSLSEQELVDCDTSYNEGCNGGLMDYAFEFII 679 DQG CGSCWAFSTIAAVEG+N+IVTGDL+SLSEQELVDCDTSYNEGCNGGLMDYAFEFII Sbjct: 161 DQGSCGSCWAFSTIAAVEGVNKIVTGDLISLSEQELVDCDTSYNEGCNGGLMDYAFEFII 220 Query: 680 KNGGIDTEDDYPYKSRDGVCDQYRKNARVVTIDSYEDVPVNDEKALQKAVANQPVSVAIE 859 KNGGID+EDDYPYK+ DG CD YRKNA+VVTIDSYEDVPVNDEK+LQKAVA+QP+SVAIE Sbjct: 221 KNGGIDSEDDYPYKAVDGRCDVYRKNAKVVTIDSYEDVPVNDEKSLQKAVASQPISVAIE 280 Query: 860 GGGRAFQLYVSGIFTGRCGTALDHGVAAVGYGTENGKDYWIVKNSWGASWGEDGYIRMER 1039 GGR+FQ Y SGIF+G CGT+LDHGVAAVGYG+E+GKDYWIV+NSWG SWGEDGYIRMER Sbjct: 281 AGGRSFQFYESGIFSGTCGTSLDHGVAAVGYGSEDGKDYWIVRNSWGLSWGEDGYIRMER 340 Query: 1040 NLASAKTGKCGIAMQASYPIXXXXXXXXXXXXXXXXXXXXXXCDDHYACPESTTCCCVYQ 1219 NLA GKCGIAM+ASYPI CD++Y+CP+S TCCC+Y+ Sbjct: 341 NLAGTANGKCGIAMEASYPIKKGQNPPNPGPSPPSPIKPPAVCDNYYSCPDSNTCCCLYE 400 Query: 1220 YGSSCLGWGCCPLEGATCCDDHYSCCPHNYPICNVEEGTCLMSNNNPLGVKALLRTPAKP 1399 YG C WGCCPLEGATCCDD+YSCCP +YP+CNV +GTCLMS NNPL VKAL RTPA P Sbjct: 401 YGKYCFSWGCCPLEGATCCDDNYSCCPSDYPVCNVNQGTCLMSKNNPLSVKALKRTPAVP 460 Query: 1400 RW 1405 W Sbjct: 461 NW 462 >ref|XP_007017153.1| Granulin repeat cysteine protease family protein [Theobroma cacao] gi|508722481|gb|EOY14378.1| Granulin repeat cysteine protease family protein [Theobroma cacao] Length = 466 Score = 677 bits (1746), Expect = 0.0 Identities = 320/443 (72%), Positives = 355/443 (80%), Gaps = 8/443 (1%) Frame = +2 Query: 125 DMSIIN-------QAEESWRTDEEVKNLYESWLVKHGKAYNALGEKEKRLEIFKDNLRFI 283 DMSII+ +++ WRTD+EV +YE WLVKHGKAYN LGEKE+R EIFKDNLRFI Sbjct: 23 DMSIISYDEGHPDKSKSIWRTDDEVMAMYEEWLVKHGKAYNGLGEKERRFEIFKDNLRFI 82 Query: 284 DEHNAGN-HTFTVGLNRFADLTNEEYRKMYLGVRTDAHRRFSRPKSNRYALRAGDDLPDS 460 DEHNA + H+F VGLNRFADLTNEEYR MYLG + R+ S+ +S+RYA G++LPDS Sbjct: 83 DEHNADDSHSFKVGLNRFADLTNEEYRAMYLGTKKP-ERKVSK-RSDRYAPSLGEELPDS 140 Query: 461 IDWRKKGAVVAIKDQGGCGSCWAFSTIAAVEGINQIVTGDLLSLSEQELVDCDTSYNEGC 640 IDWR+KGAV A+KDQGGCGSCWAFS IAAVEGIN+IVTGDL+ LSEQELVDCDT+YNEGC Sbjct: 141 IDWREKGAVAAVKDQGGCGSCWAFSAIAAVEGINKIVTGDLIVLSEQELVDCDTTYNEGC 200 Query: 641 NGGLMDYAFEFIIKNGGIDTEDDYPYKSRDGVCDQYRKNARVVTIDSYEDVPVNDEKALQ 820 NGGLMDYAFEFII NGGIDTE+DYPY RDG CD YRKNARVV+ID+YEDVPVNDE AL+ Sbjct: 201 NGGLMDYAFEFIINNGGIDTEEDYPYTGRDGTCDPYRKNARVVSIDAYEDVPVNDETALK 260 Query: 821 KAVANQPVSVAIEGGGRAFQLYVSGIFTGRCGTALDHGVAAVGYGTENGKDYWIVKNSWG 1000 KAVANQPVSVAIE GGRAFQLY SGIF G+CGT LDHGV AVGYGTE GKDYWIVKNSWG Sbjct: 261 KAVANQPVSVAIEAGGRAFQLYQSGIFDGKCGTQLDHGVTAVGYGTEKGKDYWIVKNSWG 320 Query: 1001 ASWGEDGYIRMERNLASAKTGKCGIAMQASYPIXXXXXXXXXXXXXXXXXXXXXXCDDHY 1180 +SWGE+GYIRM RN A++ TGKCGIA++ASYPI CD +Y Sbjct: 321 SSWGEEGYIRMARNEANSVTGKCGIAIEASYPIKKGQNPPNPGPSPPSPIKPPTVCDSYY 380 Query: 1181 ACPESTTCCCVYQYGSSCLGWGCCPLEGATCCDDHYSCCPHNYPICNVEEGTCLMSNNNP 1360 CPES TCCCVY+Y C WGCCPLE ATCCDDHYSCCPH YPICN+ EGTCLMS NP Sbjct: 381 TCPESNTCCCVYEYYGYCFAWGCCPLEAATCCDDHYSCCPHEYPICNINEGTCLMSKGNP 440 Query: 1361 LGVKALLRTPAKPRWGHKSAEAK 1429 LGVKAL RTPAKP W H S K Sbjct: 441 LGVKALRRTPAKPFWAHGSVGKK 463 >ref|XP_002510170.1| cysteine protease, putative [Ricinus communis] gi|223550871|gb|EEF52357.1| cysteine protease, putative [Ricinus communis] Length = 469 Score = 675 bits (1742), Expect = 0.0 Identities = 311/437 (71%), Positives = 354/437 (81%), Gaps = 8/437 (1%) Frame = +2 Query: 125 DMSIINQ-----AEESWRTDEEVKNLYESWLVKHGKAY---NALGEKEKRLEIFKDNLRF 280 DMSI++ + SWRTD+EV +YE WLVK+GKA+ NALGEKE+R ++FKDNLRF Sbjct: 25 DMSIVSYDQTHLTKSSWRTDDEVMAIYEEWLVKNGKAHSNNNALGEKERRFQVFKDNLRF 84 Query: 281 IDEHNAGNHTFTVGLNRFADLTNEEYRKMYLGVRTDAHRRFSRPKSNRYALRAGDDLPDS 460 IDEHN+ N ++ VGLNRFADLTNEEYR MYLG R+ A R SNRY R GD LPDS Sbjct: 85 IDEHNSENRSYKVGLNRFADLTNEEYRSMYLGARSGAKRNRLSRSSNRYLPRVGDSLPDS 144 Query: 461 IDWRKKGAVVAIKDQGGCGSCWAFSTIAAVEGINQIVTGDLLSLSEQELVDCDTSYNEGC 640 +DWRK+GAV +KDQG CGSCWAFSTIAAVEGIN+IVTGDL+SLSEQELVDCD SYNEGC Sbjct: 145 VDWRKEGAVAEVKDQGSCGSCWAFSTIAAVEGINKIVTGDLISLSEQELVDCDRSYNEGC 204 Query: 641 NGGLMDYAFEFIIKNGGIDTEDDYPYKSRDGVCDQYRKNARVVTIDSYEDVPVNDEKALQ 820 NGGLMDYAF+FII NGGID+E+DYPY +RDG CD YRKNA+VVTID+YEDVPVNDEKALQ Sbjct: 205 NGGLMDYAFQFIINNGGIDSEEDYPYLARDGTCDTYRKNAKVVTIDNYEDVPVNDEKALQ 264 Query: 821 KAVANQPVSVAIEGGGRAFQLYVSGIFTGRCGTALDHGVAAVGYGTENGKDYWIVKNSWG 1000 KAVANQPVSVAIE GGR FQ Y SGIFTGRCGTALDHGVAAVGYGTENGKDYWIV+NSWG Sbjct: 265 KAVANQPVSVAIEAGGREFQFYQSGIFTGRCGTALDHGVAAVGYGTENGKDYWIVRNSWG 324 Query: 1001 ASWGEDGYIRMERNLASAKTGKCGIAMQASYPIXXXXXXXXXXXXXXXXXXXXXXCDDHY 1180 SWGE GYIRMERN+A+A TGKCGIA++ SYPI CD ++ Sbjct: 325 KSWGESGYIRMERNIATA-TGKCGIAIEPSYPIKKGQNPPNPGPSPPSPIKPPSVCDSYF 383 Query: 1181 ACPESTTCCCVYQYGSSCLGWGCCPLEGATCCDDHYSCCPHNYPICNVEEGTCLMSNNNP 1360 +CPESTTCCC+++Y C WGCCPLEGATCCDDHYSCCPH+YP+CN+ EGTCL+ +NP Sbjct: 384 SCPESTTCCCIFEYAKYCFEWGCCPLEGATCCDDHYSCCPHDYPVCNINEGTCLIGKDNP 443 Query: 1361 LGVKALLRTPAKPRWGH 1411 GVKA+ RTPAKP W + Sbjct: 444 FGVKAMRRTPAKPHWAY 460 >gb|ABG33750.1| cysteine protease [Hevea brasiliensis] Length = 457 Score = 665 bits (1716), Expect = 0.0 Identities = 310/444 (69%), Positives = 353/444 (79%), Gaps = 5/444 (1%) Frame = +2 Query: 125 DMSIINQ-----AEESWRTDEEVKNLYESWLVKHGKAYNALGEKEKRLEIFKDNLRFIDE 289 D+SII+ + SWRTD+EV +YE WLVKHGKAYN+LGEKE+R E+FKDNLRFIDE Sbjct: 16 DLSIISYDQSHGTKSSWRTDDEVMAIYEDWLVKHGKAYNSLGEKERRFEVFKDNLRFIDE 75 Query: 290 HNAGNHTFTVGLNRFADLTNEEYRKMYLGVRTDAHRRFSRPKSNRYALRAGDDLPDSIDW 469 HN+ N T+ VGLNRFADLTNEEYR MYLG + R R S+RY R GD LPDS+DW Sbjct: 76 HNSENRTYRVGLNRFADLTNEEYRSMYLGALSGIRRNKLRKISDRYTPRVGDSLPDSVDW 135 Query: 470 RKKGAVVAIKDQGGCGSCWAFSTIAAVEGINQIVTGDLLSLSEQELVDCDTSYNEGCNGG 649 RK+GAVV +KDQG CGSCWAFS +AAVEGIN+IVTGDL+SLSEQELVDCD SYNEGCNGG Sbjct: 136 RKEGAVVGVKDQGSCGSCWAFSAVAAVEGINKIVTGDLISLSEQELVDCDNSYNEGCNGG 195 Query: 650 LMDYAFEFIIKNGGIDTEDDYPYKSRDGVCDQYRKNARVVTIDSYEDVPVNDEKALQKAV 829 LMDY FEFII NGGID+E+DYPY +RDG CD YRKNARVV+IDSYEDVPVN+E ALQKAV Sbjct: 196 LMDYGFEFIINNGGIDSEEDYPYLARDGRCDTYRKNARVVSIDSYEDVPVNNEAALQKAV 255 Query: 830 ANQPVSVAIEGGGRAFQLYVSGIFTGRCGTALDHGVAAVGYGTENGKDYWIVKNSWGASW 1009 ANQPVSVAIE GGR FQLY SG+F+GRCGTALDHGV AVGYGTENG+DYWIV+NSWG SW Sbjct: 256 ANQPVSVAIEAGGRDFQLYSSGVFSGRCGTALDHGVVAVGYGTENGQDYWIVRNSWGKSW 315 Query: 1010 GEDGYIRMERNLASAKTGKCGIAMQASYPIXXXXXXXXXXXXXXXXXXXXXXCDDHYACP 1189 GE GY+RM RN+ TG CGIAM+ASYPI CD++++CP Sbjct: 316 GESGYLRMARNIRK-PTGICGIAMEASYPIKKGQNPPNPGPSPPSPVKPPSVCDNYFSCP 374 Query: 1190 ESTTCCCVYQYGSSCLGWGCCPLEGATCCDDHYSCCPHNYPICNVEEGTCLMSNNNPLGV 1369 ES TCCC+++Y + C WGCCPLEGATCCDDHYSCCPH+YPICNV +GTCLMS +NPLGV Sbjct: 375 ESNTCCCIFEYANFCFEWGCCPLEGATCCDDHYSCCPHDYPICNVNQGTCLMSKDNPLGV 434 Query: 1370 KALLRTPAKPRWGHKSAEAKLISS 1441 KA+ RT AKP W AE K S+ Sbjct: 435 KAIRRTRAKPHWA-LGAEGKKSST 457 >gb|ABR19827.1| cysteine proteinase [Elaeis guineensis] Length = 470 Score = 664 bits (1712), Expect = 0.0 Identities = 309/448 (68%), Positives = 353/448 (78%), Gaps = 9/448 (2%) Frame = +2 Query: 125 DMSIINQAEESW-----RTDEEVKNLYESWLVKHGKAYNALGEKEKRLEIFKDNLRFIDE 289 DMSII+ E R++EE++ LYE WL KHG+AYNALGEKE+R EIFKDN+ FID Sbjct: 24 DMSIISYDEAHGVRGLERSEEEMRILYEGWLAKHGRAYNALGEKERRFEIFKDNVLFIDA 83 Query: 290 HNA----GNHTFTVGLNRFADLTNEEYRKMYLGVRTDAHRRFSRPKSNRYALRAGDDLPD 457 HNA G+ +F +GLNRFAD+TNEEYR +YLG R HRR +R S+RY AG+DLP+ Sbjct: 84 HNAAADAGHRSFRLGLNRFADMTNEEYRAVYLGTRPAGHRRRARVGSDRYRYNAGEDLPE 143 Query: 458 SIDWRKKGAVVAIKDQGGCGSCWAFSTIAAVEGINQIVTGDLLSLSEQELVDCDTSYNEG 637 S+DWR KGAV A+KDQG CGSCWAFST+AAVEGIN+IVTGDL+SLSEQELVDCD YN+G Sbjct: 144 SVDWRAKGAVAAVKDQGSCGSCWAFSTVAAVEGINKIVTGDLISLSEQELVDCDNGYNQG 203 Query: 638 CNGGLMDYAFEFIIKNGGIDTEDDYPYKSRDGVCDQYRKNARVVTIDSYEDVPVNDEKAL 817 CNGGLMDY FEFII NGGIDTE+DYPY +RDG CDQYRKNA+VV+ID YEDVPVNDEKAL Sbjct: 204 CNGGLMDYGFEFIINNGGIDTEEDYPYTARDGKCDQYRKNAKVVSIDGYEDVPVNDEKAL 263 Query: 818 QKAVANQPVSVAIEGGGRAFQLYVSGIFTGRCGTALDHGVAAVGYGTENGKDYWIVKNSW 997 QKAVANQPVSVAIE GGR FQLY SGIFTGRCGT LDHGV AVGYGTENGKDYWIV+NSW Sbjct: 264 QKAVANQPVSVAIEAGGREFQLYHSGIFTGRCGTDLDHGVVAVGYGTENGKDYWIVRNSW 323 Query: 998 GASWGEDGYIRMERNLASAKTGKCGIAMQASYPIXXXXXXXXXXXXXXXXXXXXXXCDDH 1177 G WGE GYIRMERN+ + TGKCGIA++ SYP CD++ Sbjct: 324 GGDWGESGYIRMERNV-NTSTGKCGIAIEPSYPTKKGQNPPKPAPSPPSPVSPPTVCDNY 382 Query: 1178 YACPESTTCCCVYQYGSSCLGWGCCPLEGATCCDDHYSCCPHNYPICNVEEGTCLMSNNN 1357 Y+CP STTCCCVY+YG C WGCCPLEGATCC+DHYSCCPH+YP+CNV+ GTC +S +N Sbjct: 383 YSCPSSTTCCCVYEYGRYCFAWGCCPLEGATCCEDHYSCCPHDYPVCNVKAGTCQLSKDN 442 Query: 1358 PLGVKALLRTPAKPRWGHKSAEAKLISS 1441 PLGVKAL RTPAKP W A K I++ Sbjct: 443 PLGVKALARTPAKPHWAFLGAGGKKINA 470 >gb|AAX84673.1| cysteine protease CP1 [Manihot esculenta] Length = 467 Score = 662 bits (1707), Expect = 0.0 Identities = 305/440 (69%), Positives = 351/440 (79%), Gaps = 5/440 (1%) Frame = +2 Query: 125 DMSIINQ-----AEESWRTDEEVKNLYESWLVKHGKAYNALGEKEKRLEIFKDNLRFIDE 289 DMSII+ + SWRTD+EV +YE WLVK GK YNALGE+EKR ++FKDNLRFIDE Sbjct: 26 DMSIISYDQTHATKSSWRTDDEVMAIYEEWLVKQGKVYNALGEREKRFQVFKDNLRFIDE 85 Query: 290 HNAGNHTFTVGLNRFADLTNEEYRKMYLGVRTDAHRRFSRPKSNRYALRAGDDLPDSIDW 469 HN+ N T+ +GLN FADLTNEEYR YLG R R R S+RYA R G+ LPDS+DW Sbjct: 86 HNSENRTYKLGLNGFADLTNEEYRSTYLGARGGMKRNRLRKTSDRYAPRVGESLPDSVDW 145 Query: 470 RKKGAVVAIKDQGGCGSCWAFSTIAAVEGINQIVTGDLLSLSEQELVDCDTSYNEGCNGG 649 RK+GAV +KDQG CGSCWAFSTIAAVEGIN+IVTGDL+SLSEQELVDCDTSYNEGCNGG Sbjct: 146 RKEGAVAEVKDQGSCGSCWAFSTIAAVEGINKIVTGDLISLSEQELVDCDTSYNEGCNGG 205 Query: 650 LMDYAFEFIIKNGGIDTEDDYPYKSRDGVCDQYRKNARVVTIDSYEDVPVNDEKALQKAV 829 LMDYAFEFII NGGIDTE+DYPY +RDG CD YRKNA+VVTID YEDVPVN E ALQKAV Sbjct: 206 LMDYAFEFIINNGGIDTEEDYPYLARDGRCDTYRKNAKVVTIDDYEDVPVNSETALQKAV 265 Query: 830 ANQPVSVAIEGGGRAFQLYVSGIFTGRCGTALDHGVAAVGYGTENGKDYWIVKNSWGASW 1009 ANQPVSVAIE GGR FQ Y SGIF+GRCGT LDHGVAAVGYGTENGKDYWIV+NSWG SW Sbjct: 266 ANQPVSVAIEAGGRDFQFYASGIFSGRCGTQLDHGVAAVGYGTENGKDYWIVRNSWGKSW 325 Query: 1010 GEDGYIRMERNLASAKTGKCGIAMQASYPIXXXXXXXXXXXXXXXXXXXXXXCDDHYACP 1189 GE+GY+RM R++ ++ TG CGIAM+ASYPI CD++Y+CP Sbjct: 326 GENGYLRMARSI-NSPTGICGIAMEASYPIKKGQNPPNPAPLPPSPVTPPTVCDNYYSCP 384 Query: 1190 ESTTCCCVYQYGSSCLGWGCCPLEGATCCDDHYSCCPHNYPICNVEEGTCLMSNNNPLGV 1369 ++ TCCC+++YG+ C WGCCPLEGATCC+DHYSCCPH+YPICN+ +GTCLMS +NPL V Sbjct: 385 DNNTCCCLFEYGNFCFEWGCCPLEGATCCEDHYSCCPHDYPICNINQGTCLMSKDNPLAV 444 Query: 1370 KALLRTPAKPRWGHKSAEAK 1429 KA++R PAKP W +A K Sbjct: 445 KAMIRIPAKPHWALGAAAKK 464 >ref|XP_007160879.1| hypothetical protein PHAVU_001G024500g [Phaseolus vulgaris] gi|561034343|gb|ESW32873.1| hypothetical protein PHAVU_001G024500g [Phaseolus vulgaris] Length = 466 Score = 660 bits (1704), Expect = 0.0 Identities = 301/430 (70%), Positives = 347/430 (80%), Gaps = 5/430 (1%) Frame = +2 Query: 125 DMSIIN-----QAEESWRTDEEVKNLYESWLVKHGKAYNALGEKEKRLEIFKDNLRFIDE 289 DMSII+ Q + +WRTDEEV +LYE WLVKHGK YNALGEK+KR +IFKDNLRFID+ Sbjct: 25 DMSIISYDNAHQDKATWRTDEEVNSLYEEWLVKHGKLYNALGEKDKRFQIFKDNLRFIDQ 84 Query: 290 HNAGNHTFTVGLNRFADLTNEEYRKMYLGVRTDAHRRFSRPKSNRYALRAGDDLPDSIDW 469 NA N T+ +GLNRFADLTNEEYR YLG + D +RR R SNRYA R G+ LPDS+DW Sbjct: 85 QNAENRTYKLGLNRFADLTNEEYRARYLGTKIDPNRRLGRTPSNRYAPRVGETLPDSVDW 144 Query: 470 RKKGAVVAIKDQGGCGSCWAFSTIAAVEGINQIVTGDLLSLSEQELVDCDTSYNEGCNGG 649 RK+GAVV +KDQ CGSCWAFS I AVEGIN+IVTGDL+SLSEQELVDCDT YN GCNGG Sbjct: 145 RKEGAVVPVKDQASCGSCWAFSAIGAVEGINKIVTGDLISLSEQELVDCDTGYNMGCNGG 204 Query: 650 LMDYAFEFIIKNGGIDTEDDYPYKSRDGVCDQYRKNARVVTIDSYEDVPVNDEKALQKAV 829 LMDYAFEFIIKNGGID+E+DYPYK DG CD+YRKNA+VV+ID YEDV DE AL+KAV Sbjct: 205 LMDYAFEFIIKNGGIDSEEDYPYKGVDGRCDEYRKNAKVVSIDGYEDVNTYDELALKKAV 264 Query: 830 ANQPVSVAIEGGGRAFQLYVSGIFTGRCGTALDHGVAAVGYGTENGKDYWIVKNSWGASW 1009 ANQPVSVA+EGGGR FQLY SG+FTGRCGTALDHGV AVGYGT+NG D+WIV+NSWGA W Sbjct: 265 ANQPVSVAVEGGGREFQLYSSGVFTGRCGTALDHGVVAVGYGTDNGHDFWIVRNSWGADW 324 Query: 1010 GEDGYIRMERNLASAKTGKCGIAMQASYPIXXXXXXXXXXXXXXXXXXXXXXCDDHYACP 1189 GE+GYIR+ERNL ++++GKCGIA++ SYPI CD++Y+C Sbjct: 325 GEEGYIRLERNLGNSRSGKCGIAIEPSYPIKTGQNPPNPGPSPPSPVKPPNVCDNYYSCS 384 Query: 1190 ESTTCCCVYQYGSSCLGWGCCPLEGATCCDDHYSCCPHNYPICNVEEGTCLMSNNNPLGV 1369 +S TCCC++++G +C WGCCPLEGATCCDDHYSCCPH+YPICN GTCL S NNP GV Sbjct: 385 DSATCCCIFEFGKTCFEWGCCPLEGATCCDDHYSCCPHDYPICNTYAGTCLRSKNNPFGV 444 Query: 1370 KALLRTPAKP 1399 KAL RTPAKP Sbjct: 445 KALRRTPAKP 454 >gb|ABR19828.1| cysteine proteinase [Elaeis guineensis] Length = 469 Score = 660 bits (1704), Expect = 0.0 Identities = 313/444 (70%), Positives = 353/444 (79%), Gaps = 6/444 (1%) Frame = +2 Query: 128 MSIINQAEESWRTDEEVKNLYESWLVKHGKAYNALGEKEKRLEIFKDNLRFIDEHNA--- 298 MSI++ E R+D+EV LY++W +H ++YNAL E E+RLEIF+DNLRFID+HNA Sbjct: 30 MSILSYGE---RSDDEVHRLYQAWKAQHARSYNALDEDEQRLEIFRDNLRFIDQHNAAAN 86 Query: 299 -GNHTFTVGLNRFADLTNEEYRKMYLGVRT--DAHRRFSRPKSNRYALRAGDDLPDSIDW 469 G ++F +GL RFADLTNEEYR YLGVRT RR S SNRY R+ DDLPDSIDW Sbjct: 87 AGKYSFRLGLTRFADLTNEEYRSTYLGVRTAGSRRRRNSTVGSNRYRFRSSDDLPDSIDW 146 Query: 470 RKKGAVVAIKDQGGCGSCWAFSTIAAVEGINQIVTGDLLSLSEQELVDCDTSYNEGCNGG 649 R KGAVV +KDQG CGSCWAFSTIAAVEGIN IVTGDL+SLSEQELVDCDT YN+GCNGG Sbjct: 147 RDKGAVVDVKDQGSCGSCWAFSTIAAVEGINHIVTGDLISLSEQELVDCDTYYNQGCNGG 206 Query: 650 LMDYAFEFIIKNGGIDTEDDYPYKSRDGVCDQYRKNARVVTIDSYEDVPVNDEKALQKAV 829 LMDYAFEFII NGGIDT++DYPY RDG CDQYRKNA VVTIDSYEDVP+NDEK+LQKAV Sbjct: 207 LMDYAFEFIISNGGIDTDEDYPYTGRDGSCDQYRKNAHVVTIDSYEDVPINDEKSLQKAV 266 Query: 830 ANQPVSVAIEGGGRAFQLYVSGIFTGRCGTALDHGVAAVGYGTENGKDYWIVKNSWGASW 1009 ANQPVSVAIE GGRAFQLY SGIFTG CGT LDHGV A+GYG+ENGK YWIVKNSWG+ W Sbjct: 267 ANQPVSVAIEAGGRAFQLYESGIFTGYCGTELDHGVTAIGYGSENGKYYWIVKNSWGSDW 326 Query: 1010 GEDGYIRMERNLASAKTGKCGIAMQASYPIXXXXXXXXXXXXXXXXXXXXXXCDDHYACP 1189 GE GYIRMERN+ SA TGKCGIAM+ASYPI CD +Y+CP Sbjct: 327 GESGYIRMERNINSA-TGKCGIAMEASYPIKNGQNPPNPGPSPPSPSKPPTVCDSYYSCP 385 Query: 1190 ESTTCCCVYQYGSSCLGWGCCPLEGATCCDDHYSCCPHNYPICNVEEGTCLMSNNNPLGV 1369 ES TCCCVY++GS C WGCCPLEGATCC+DHYSCCPH+YPICNV+EGTCL+S NNPLGV Sbjct: 386 ESMTCCCVYEFGSYCFAWGCCPLEGATCCEDHYSCCPHDYPICNVQEGTCLVSKNNPLGV 445 Query: 1370 KALLRTPAKPRWGHKSAEAKLISS 1441 KA R PAKP W + A+ + S+ Sbjct: 446 KATKRIPAKPYWAYFGAQGERSSA 469 >emb|CAB17076.1| cysteine proteinase precursor [Phaseolus vulgaris] Length = 455 Score = 660 bits (1704), Expect = 0.0 Identities = 301/430 (70%), Positives = 347/430 (80%), Gaps = 5/430 (1%) Frame = +2 Query: 125 DMSIIN-----QAEESWRTDEEVKNLYESWLVKHGKAYNALGEKEKRLEIFKDNLRFIDE 289 DMSII+ Q + +WRTDEEV +LYE WLVKHGK YNALGEK+KR +IFKDNLRFID+ Sbjct: 14 DMSIISYDNAHQDKATWRTDEEVNSLYEEWLVKHGKLYNALGEKDKRFQIFKDNLRFIDQ 73 Query: 290 HNAGNHTFTVGLNRFADLTNEEYRKMYLGVRTDAHRRFSRPKSNRYALRAGDDLPDSIDW 469 NA N T+ +GLNRFADLTNEEYR YLG + D +RR R SNRYA R G+ LPDS+DW Sbjct: 74 QNAENRTYKLGLNRFADLTNEEYRARYLGTKIDPNRRLGRTPSNRYAPRVGETLPDSVDW 133 Query: 470 RKKGAVVAIKDQGGCGSCWAFSTIAAVEGINQIVTGDLLSLSEQELVDCDTSYNEGCNGG 649 RK+GAVV +KDQ CGSCWAFS I AVEGIN+IVTGDL+SLSEQELVDCDT YN GCNGG Sbjct: 134 RKEGAVVPVKDQASCGSCWAFSAIGAVEGINKIVTGDLISLSEQELVDCDTGYNMGCNGG 193 Query: 650 LMDYAFEFIIKNGGIDTEDDYPYKSRDGVCDQYRKNARVVTIDSYEDVPVNDEKALQKAV 829 LMDYAFEFIIKNGGID+E+DYPYK DG CD+YRKNA+VV+ID YEDV DE AL+KAV Sbjct: 194 LMDYAFEFIIKNGGIDSEEDYPYKGVDGRCDEYRKNAKVVSIDGYEDVNTYDELALKKAV 253 Query: 830 ANQPVSVAIEGGGRAFQLYVSGIFTGRCGTALDHGVAAVGYGTENGKDYWIVKNSWGASW 1009 ANQPVSVA+EGGGR FQLY SG+FTGRCGTALDHGV AVGYGT+NG D+WIV+NSWGA W Sbjct: 254 ANQPVSVAVEGGGREFQLYSSGVFTGRCGTALDHGVVAVGYGTDNGHDFWIVRNSWGADW 313 Query: 1010 GEDGYIRMERNLASAKTGKCGIAMQASYPIXXXXXXXXXXXXXXXXXXXXXXCDDHYACP 1189 GE+GYIR+ERNL ++++GKCGIA++ SYPI CD++Y+C Sbjct: 314 GEEGYIRLERNLGNSRSGKCGIAIEPSYPIKTGQNPPNPGPSPPSPVKPPNVCDNYYSCS 373 Query: 1190 ESTTCCCVYQYGSSCLGWGCCPLEGATCCDDHYSCCPHNYPICNVEEGTCLMSNNNPLGV 1369 +S TCCC++++G +C WGCCPLEGATCCDDHYSCCPH+YPICN GTCL S NNP GV Sbjct: 374 DSATCCCIFEFGKTCFEWGCCPLEGATCCDDHYSCCPHDYPICNTYAGTCLRSKNNPFGV 433 Query: 1370 KALLRTPAKP 1399 KAL RTPAKP Sbjct: 434 KALRRTPAKP 443 >ref|XP_003545327.1| PREDICTED: cysteine proteinase RD21a-like [Glycine max] Length = 496 Score = 656 bits (1693), Expect = 0.0 Identities = 300/431 (69%), Positives = 349/431 (80%), Gaps = 4/431 (0%) Frame = +2 Query: 125 DMSII---NQAEESWRTDEEVKNLYESWLVKHGKAYNALGEKEKRLEIFKDNLRFIDEHN 295 DMSII N + R+DEE+ ++YE WLVKHGK YNALGEKEKR +IFKDNLRFID+HN Sbjct: 55 DMSIISYDNAHAATSRSDEELMSMYEQWLVKHGKVYNALGEKEKRFQIFKDNLRFIDDHN 114 Query: 296 AG-NHTFTVGLNRFADLTNEEYRKMYLGVRTDAHRRFSRPKSNRYALRAGDDLPDSIDWR 472 + + T+ +GLNRFADLTNEEYR YLG + D +RR + SNRYA R GD LP+S+DWR Sbjct: 115 SQEDRTYKLGLNRFADLTNEEYRAKYLGTKIDPNRRLGKTPSNRYAPRVGDKLPESVDWR 174 Query: 473 KKGAVVAIKDQGGCGSCWAFSTIAAVEGINQIVTGDLLSLSEQELVDCDTSYNEGCNGGL 652 K+GAV +KDQGGCGSCWAFS I AVEGIN+IVTG+L+SLSEQELVDCDT YNEGCNGGL Sbjct: 175 KEGAVPPVKDQGGCGSCWAFSAIGAVEGINKIVTGELISLSEQELVDCDTGYNEGCNGGL 234 Query: 653 MDYAFEFIIKNGGIDTEDDYPYKSRDGVCDQYRKNARVVTIDSYEDVPVNDEKALQKAVA 832 MDYAFEFII NGGID+E+DYPY+ DG CD YRKNA+VV+ID YEDVP DE AL+KAVA Sbjct: 235 MDYAFEFIINNGGIDSEEDYPYRGVDGRCDTYRKNAKVVSIDDYEDVPAYDELALKKAVA 294 Query: 833 NQPVSVAIEGGGRAFQLYVSGIFTGRCGTALDHGVAAVGYGTENGKDYWIVKNSWGASWG 1012 NQPVSVAIEGGGR FQLYVSG+FTGRCGTALDHGV AVGYGT NG DYWIV+NSWG SWG Sbjct: 295 NQPVSVAIEGGGREFQLYVSGVFTGRCGTALDHGVVAVGYGTANGHDYWIVRNSWGPSWG 354 Query: 1013 EDGYIRMERNLASAKTGKCGIAMQASYPIXXXXXXXXXXXXXXXXXXXXXXCDDHYACPE 1192 EDGYIR+ERNLA++++GKCGIA++ SYP+ CD++Y+C + Sbjct: 355 EDGYIRLERNLANSRSGKCGIAIEPSYPLKNGPNPPNPGPSPPSPVKPPNVCDNYYSCAD 414 Query: 1193 STTCCCVYQYGSSCLGWGCCPLEGATCCDDHYSCCPHNYPICNVEEGTCLMSNNNPLGVK 1372 S TCCC++++G++C WGCCPLEGATCCDDHYSCCP++YPICN GTCL S NNP GVK Sbjct: 415 SATCCCIFEFGNACFEWGCCPLEGATCCDDHYSCCPNDYPICNTYAGTCLKSKNNPFGVK 474 Query: 1373 ALLRTPAKPRW 1405 AL RTPAKP W Sbjct: 475 ALRRTPAKPHW 485 >gb|EXB56503.1| Cysteine proteinase RD21a [Morus notabilis] Length = 463 Score = 655 bits (1691), Expect = 0.0 Identities = 305/433 (70%), Positives = 348/433 (80%), Gaps = 3/433 (0%) Frame = +2 Query: 125 DMSIINQA---EESWRTDEEVKNLYESWLVKHGKAYNALGEKEKRLEIFKDNLRFIDEHN 295 DMSI++ + S RTD+EV +YE+WLVKHGK YNALGEKEKR +IFKDNLRFIDEHN Sbjct: 27 DMSIVSYNLVDQSSSRTDDEVMAIYEAWLVKHGKVYNALGEKEKRFQIFKDNLRFIDEHN 86 Query: 296 AGNHTFTVGLNRFADLTNEEYRKMYLGVRTDAHRRFSRPKSNRYALRAGDDLPDSIDWRK 475 A + + +GLNRFADLTNEEYR YLG + ++ S+ RYA R GDDLP +DWRK Sbjct: 87 AQDRPYKLGLNRFADLTNEEYRSTYLGTKIRGQKKVSQ----RYAPRVGDDLPSFVDWRK 142 Query: 476 KGAVVAIKDQGGCGSCWAFSTIAAVEGINQIVTGDLLSLSEQELVDCDTSYNEGCNGGLM 655 +GAVV +KDQG CGSCWAFSTIAAVEGI++IVTGDL+SLSEQELVDCDTSYNEGCNGGLM Sbjct: 143 EGAVVGVKDQGSCGSCWAFSTIAAVEGISKIVTGDLVSLSEQELVDCDTSYNEGCNGGLM 202 Query: 656 DYAFEFIIKNGGIDTEDDYPYKSRDGVCDQYRKNARVVTIDSYEDVPVNDEKALQKAVAN 835 DYAFEFIIKNGGID+E+DYPYK D CDQYRKNA+VV+ID YEDVP NDEKAL KAVAN Sbjct: 203 DYAFEFIIKNGGIDSEEDYPYKGYDSRCDQYRKNAKVVSIDDYEDVPANDEKALLKAVAN 262 Query: 836 QPVSVAIEGGGRAFQLYVSGIFTGRCGTALDHGVAAVGYGTENGKDYWIVKNSWGASWGE 1015 QPV+VAIEGGGRAFQLYVSGIFT CGT+LDHGVA VGYGTENG DYWIV+NSWG++WGE Sbjct: 263 QPVAVAIEGGGRAFQLYVSGIFTETCGTSLDHGVAVVGYGTENGLDYWIVRNSWGSNWGE 322 Query: 1016 DGYIRMERNLASAKTGKCGIAMQASYPIXXXXXXXXXXXXXXXXXXXXXXCDDHYACPES 1195 DGYIRMERNLAS TGKCGIA++ SYPI CD++Y+C +S Sbjct: 323 DGYIRMERNLASTPTGKCGIAIEPSYPIKKGQNPPNPGPSPPSPVKPPTVCDNYYSCADS 382 Query: 1196 TTCCCVYQYGSSCLGWGCCPLEGATCCDDHYSCCPHNYPICNVEEGTCLMSNNNPLGVKA 1375 TCCCV+++G C WGCCPLE ATCC+DH SCCPH+YPICN+ GTCLMS +NPLGVKA Sbjct: 383 DTCCCVFEWGRYCFSWGCCPLEAATCCEDHNSCCPHDYPICNINAGTCLMSKDNPLGVKA 442 Query: 1376 LLRTPAKPRWGHK 1414 L RT AKP W + Sbjct: 443 LKRTAAKPHWAFR 455 >ref|XP_004291075.1| PREDICTED: cysteine proteinase RD21a-like [Fragaria vesca subsp. vesca] Length = 469 Score = 655 bits (1690), Expect = 0.0 Identities = 300/433 (69%), Positives = 345/433 (79%), Gaps = 6/433 (1%) Frame = +2 Query: 125 DMSIIN------QAEESWRTDEEVKNLYESWLVKHGKAYNALGEKEKRLEIFKDNLRFID 286 DMSII + +S R+D+EV +LYE WL +HGKAYN LGEKEKR +IFKDNL+FID Sbjct: 26 DMSIIAYDNNHASSSDSGRSDDEVMSLYERWLAEHGKAYNGLGEKEKRFQIFKDNLKFID 85 Query: 287 EHNAGNHTFTVGLNRFADLTNEEYRKMYLGVRTDAHRRFSRPKSNRYALRAGDDLPDSID 466 EHNA N ++ +GLNRFADL+N+EYR +LG + A R S+ KS+RYA R GD LPDS+D Sbjct: 86 EHNALNLSYKLGLNRFADLSNDEYRSTFLGTKPRAMNRLSKTKSDRYAPRVGDQLPDSVD 145 Query: 467 WRKKGAVVAIKDQGGCGSCWAFSTIAAVEGINQIVTGDLLSLSEQELVDCDTSYNEGCNG 646 WRK+GAV A+KDQG CGSCWAFSTI AVEGIN+IVTGDL+SLSEQELVDCD +YNEGCNG Sbjct: 146 WRKEGAVTAVKDQGQCGSCWAFSTICAVEGINKIVTGDLISLSEQELVDCDKTYNEGCNG 205 Query: 647 GLMDYAFEFIIKNGGIDTEDDYPYKSRDGVCDQYRKNARVVTIDSYEDVPVNDEKALQKA 826 GLMDYAFEFII NGGID+EDDYPYK D CD YRKNARVV+IDSYEDVP DEKAL+KA Sbjct: 206 GLMDYAFEFIINNGGIDSEDDYPYKGYDSTCDTYRKNARVVSIDSYEDVPTYDEKALKKA 265 Query: 827 VANQPVSVAIEGGGRAFQLYVSGIFTGRCGTALDHGVAAVGYGTENGKDYWIVKNSWGAS 1006 VANQP++VAIEGGGR FQLY SG+FTGRCGTALDHGVA VGYGTE+G DYWIV+NSWG S Sbjct: 266 VANQPIAVAIEGGGREFQLYSSGVFTGRCGTALDHGVAVVGYGTEHGSDYWIVRNSWGDS 325 Query: 1007 WGEDGYIRMERNLASAKTGKCGIAMQASYPIXXXXXXXXXXXXXXXXXXXXXXCDDHYAC 1186 WGE GYIRMERNL ++ TGKCGIAM+ SYP+ CD++++C Sbjct: 326 WGESGYIRMERNLGNSATGKCGIAMEPSYPVKIGQNPPNPGPSPPSPIKPPQVCDNYFSC 385 Query: 1187 PESTTCCCVYQYGSSCLGWGCCPLEGATCCDDHYSCCPHNYPICNVEEGTCLMSNNNPLG 1366 PES TCCC+YQY + C WGCCPLEGATCCDDHYSCCP +YP+CNV GTC +S NP+ Sbjct: 386 PESNTCCCIYQYQNYCFAWGCCPLEGATCCDDHYSCCPSDYPVCNVNAGTCQLSKGNPMS 445 Query: 1367 VKALLRTPAKPRW 1405 VKAL RTPAK W Sbjct: 446 VKALKRTPAKAHW 458 >ref|XP_003589133.1| Cysteine proteinase [Medicago truncatula] gi|87240770|gb|ABD32628.1| Granulin; Peptidase C1A, papain [Medicago truncatula] gi|355478181|gb|AES59384.1| Cysteine proteinase [Medicago truncatula] Length = 474 Score = 655 bits (1689), Expect = 0.0 Identities = 302/436 (69%), Positives = 342/436 (78%), Gaps = 9/436 (2%) Frame = +2 Query: 125 DMSIIN------QAEESWRTDEEVKNLYESWLVKHGKAYNALGEKEKRLEIFKDNLRFID 286 DMSII+ S RT++EV +YE WLVKHGK+YN LGEK+KR EIFKDNL+FID Sbjct: 28 DMSIISYDKTHPDKSTSKRTNKEVLTMYEEWLVKHGKSYNGLGEKDKRFEIFKDNLKFID 87 Query: 287 EHNAGNHTFTVGLNRFADLTNEEYRKMYLGVRTDAHRRFSR---PKSNRYALRAGDDLPD 457 EHN N T+ +GL RFADLTNEEYR +LG + D +RR + KSNRYA R GD LP+ Sbjct: 88 EHNGLNSTYRLGLTRFADLTNEEYRSKFLGTKIDPNRRMKKLGGSKSNRYAPRVGDKLPE 147 Query: 458 SIDWRKKGAVVAIKDQGGCGSCWAFSTIAAVEGINQIVTGDLLSLSEQELVDCDTSYNEG 637 S+DWRK+GAVV +KDQ CGSCWAFS IAAVEGIN+IVTGDL+SLSEQELVDCDTSYNEG Sbjct: 148 SVDWRKEGAVVGVKDQASCGSCWAFSAIAAVEGINKIVTGDLISLSEQELVDCDTSYNEG 207 Query: 638 CNGGLMDYAFEFIIKNGGIDTEDDYPYKSRDGVCDQYRKNARVVTIDSYEDVPVNDEKAL 817 CNGGLMDYAFEFII NGGID+EDDYPYK+ DG CDQ RKNA+VVTID YEDVP DE AL Sbjct: 208 CNGGLMDYAFEFIISNGGIDSEDDYPYKAVDGRCDQNRKNAKVVTIDDYEDVPAYDELAL 267 Query: 818 QKAVANQPVSVAIEGGGRAFQLYVSGIFTGRCGTALDHGVAAVGYGTENGKDYWIVKNSW 997 QKAVANQP++VA+EGGGR FQLY G+FTGRCGTALDHGVAAVGYGTENGKDYWIV+NSW Sbjct: 268 QKAVANQPIAVAVEGGGREFQLYEYGVFTGRCGTALDHGVAAVGYGTENGKDYWIVRNSW 327 Query: 998 GASWGEDGYIRMERNLASAKTGKCGIAMQASYPIXXXXXXXXXXXXXXXXXXXXXXCDDH 1177 G SWGE GYIR+ERNLAS++ GKCGIA++ SYPI CD + Sbjct: 328 GGSWGEQGYIRLERNLASSRAGKCGIAIEPSYPIKNGQNPPNPGPSPPSPIKPPSVCDSY 387 Query: 1178 YACPESTTCCCVYQYGSSCLGWGCCPLEGATCCDDHYSCCPHNYPICNVEEGTCLMSNNN 1357 Y+C E +TCCC+Y+YG SC WGCCPLE ATCCDDHYSCCPH YP+C+ G CL NN Sbjct: 388 YSCAEGSTCCCIYEYGRSCFEWGCCPLESATCCDDHYSCCPHEYPVCDTRAGLCLKGKNN 447 Query: 1358 PLGVKALLRTPAKPRW 1405 PLGVK+ RTPAKP W Sbjct: 448 PLGVKSFKRTPAKPHW 463 >dbj|BAE80740.1| cysteine proteinase [Platycodon grandiflorus] Length = 462 Score = 654 bits (1686), Expect = 0.0 Identities = 306/431 (70%), Positives = 345/431 (80%), Gaps = 6/431 (1%) Frame = +2 Query: 125 DMSIINQ-----AEESWRTDEEVKNLYESWLVKHGKAYNALGEKEKRLEIFKDNLRFIDE 289 DMSIIN ++ SWRTD+EV +YESWLVKHGK+YNALGEKEKR +IFKDNLRFIDE Sbjct: 24 DMSIINYDATHASKSSWRTDDEVMAMYESWLVKHGKSYNALGEKEKRFQIFKDNLRFIDE 83 Query: 290 HNAG-NHTFTVGLNRFADLTNEEYRKMYLGVRTDAHRRFSRPKSNRYALRAGDDLPDSID 466 HNA N ++ VGLNRFADLTNEEYR YLG ++ + S+ KS+RYA R GD LP+S+D Sbjct: 84 HNAEENLSYKVGLNRFADLTNEEYRSTYLGAKSKP--KLSKVKSDRYAPRVGDSLPESVD 141 Query: 467 WRKKGAVVAIKDQGGCGSCWAFSTIAAVEGINQIVTGDLLSLSEQELVDCDTSYNEGCNG 646 WR KGAV IKDQG CGSCWAFST+ AVEGINQIVTG+L++LSEQELVDCD SYNEGC+G Sbjct: 142 WRAKGAVAPIKDQGSCGSCWAFSTVNAVEGINQIVTGELITLSEQELVDCDKSYNEGCDG 201 Query: 647 GLMDYAFEFIIKNGGIDTEDDYPYKSRDGVCDQYRKNARVVTIDSYEDVPVNDEKALQKA 826 GLMDY FEFII NGGIDT+ DYPY RD CDQYRKNA+VVTIDSYEDVPVN+E+AL+KA Sbjct: 202 GLMDYGFEFIINNGGIDTDKDYPYLGRDARCDQYRKNAKVVTIDSYEDVPVNNEEALKKA 261 Query: 827 VANQPVSVAIEGGGRAFQLYVSGIFTGRCGTALDHGVAAVGYGTENGKDYWIVKNSWGAS 1006 VA+QPVSV IEGGGRAFQ Y SGIFTG+CGTALDHGV VGYGTE GKDYWIV+NSWG+S Sbjct: 262 VASQPVSVGIEGGGRAFQFYDSGIFTGKCGTALDHGVNVVGYGTEKGKDYWIVRNSWGSS 321 Query: 1007 WGEDGYIRMERNLASAKTGKCGIAMQASYPIXXXXXXXXXXXXXXXXXXXXXXCDDHYAC 1186 WGE GYIRMERNLA GKCGIAM+ SYP+ CDD+Y C Sbjct: 322 WGEAGYIRMERNLAGTSVGKCGIAMEPSYPLKNGQNPPNPGPSPPTPVRPPTVCDDYYTC 381 Query: 1187 PESTTCCCVYQYGSSCLGWGCCPLEGATCCDDHYSCCPHNYPICNVEEGTCLMSNNNPLG 1366 PES+TCCCVY+Y C WGCCPL+GATCCDDHYSCCPH+YP+CNV+ GTC MS NNPLG Sbjct: 382 PESSTCCCVYEYYGYCFSWGCCPLDGATCCDDHYSCCPHDYPVCNVQAGTCSMSKNNPLG 441 Query: 1367 VKALLRTPAKP 1399 VKA+ R A P Sbjct: 442 VKAIQRILATP 452