BLASTX nr result
ID: Mentha24_contig00022000
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Mentha24_contig00022000 (1359 letters) Database: ./nr 38,876,450 sequences; 13,856,398,315 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value gb|EYU36745.1| hypothetical protein MIMGU_mgv1a006749mg [Mimulus... 654 0.0 dbj|BAJ53169.1| JHL18I08.3 [Jatropha curcas] 577 e-162 ref|XP_006341989.1| PREDICTED: cysteine proteinase RD21a-like [S... 576 e-162 ref|NP_563855.1| papain-like cysteine peptidase [Arabidopsis tha... 575 e-161 gb|AAK71314.1|AF388175_1 papain-like cysteine peptidase XBCP3 [A... 574 e-161 ref|XP_002889773.1| hypothetical protein ARALYDRAFT_471096 [Arab... 568 e-159 ref|XP_006417526.1| hypothetical protein EUTSA_v10007640mg [Eutr... 568 e-159 ref|XP_002307688.2| cysteine protease family protein [Populus tr... 565 e-158 ref|XP_004238304.1| PREDICTED: cysteine proteinase RD21a-like [S... 565 e-158 ref|XP_006435079.1| hypothetical protein CICLE_v10001178mg [Citr... 563 e-158 ref|XP_006473576.1| PREDICTED: cysteine proteinase RD21a-like [C... 562 e-157 ref|XP_007017656.1| JHL18I08.3 protein [Theobroma cacao] gi|5087... 558 e-156 ref|XP_006307431.1| hypothetical protein CARUB_v10009056mg [Caps... 558 e-156 ref|XP_004500967.1| PREDICTED: oryzain alpha chain-like [Cicer a... 558 e-156 gb|EPS60205.1| hypothetical protein M569_14597, partial [Genlise... 557 e-156 ref|XP_002510459.1| cysteine protease, putative [Ricinus communi... 556 e-156 gb|AAB60738.1| Strong similarity to Dianthus cysteine proteinase... 553 e-155 ref|XP_004152671.1| PREDICTED: cysteine proteinase RD21a-like [C... 552 e-154 ref|XP_003523725.1| PREDICTED: oryzain alpha chain-like [Glycine... 550 e-154 gb|EXC25025.1| Oryzain alpha chain [Morus notabilis] 548 e-153 >gb|EYU36745.1| hypothetical protein MIMGU_mgv1a006749mg [Mimulus guttatus] Length = 433 Score = 654 bits (1688), Expect = 0.0 Identities = 303/426 (71%), Positives = 347/426 (81%), Gaps = 1/426 (0%) Frame = -1 Query: 1356 LSLFLLISQFPTCNSSSISELFDHWCEEHGKTYASEEEKQHRLRVFEHNYEIVVEHNTKA 1177 L+L +L SQ P SS IS+LFD WCEE+GKTYASE+EKQHRL VF NY+ V +HN A Sbjct: 8 LNLIMLFSQLPISKSSLISDLFDSWCEEYGKTYASEQEKQHRLNVFHENYKYVNQHNADA 67 Query: 1176 NSSHTLSLNAFADLTNQEFKDKYLGLLPSADDLLIRLNSREF-AIDGPDLVEESDLPASV 1000 NSS+TLS+NAFADLTN EF+ YLGL PS D +IRLNSR AIDG +L++ES++P+S+ Sbjct: 68 NSSYTLSVNAFADLTNHEFRANYLGLSPSKSDSVIRLNSRSASAIDGDNLIKESEIPSSL 127 Query: 999 DWRKKGAVTAVKDQGSCGACWSFSATGAVEGINQIKTGSLVSLSEQELIDCDTTYNSGCG 820 DWR KGAVTAVKDQGSCGACWSFSATGAVEGINQIKTGSLVSLSEQELIDCD +YN GC Sbjct: 128 DWRNKGAVTAVKDQGSCGACWSFSATGAVEGINQIKTGSLVSLSEQELIDCDKSYNDGCN 187 Query: 819 GGLMDYAFEFIIKNKGIDTEEDYPYRGRDGKCSKEKLKRHVVTIDSYADVPPRKEKKLLQ 640 GGLMDYA++FIIKNKGIDTEEDY Y+GR C K K+ +HVVTIDSY D+P + EKKLLQ Sbjct: 188 GGLMDYAYDFIIKNKGIDTEEDYSYKGRSATCDKNKMNKHVVTIDSYVDIPEKDEKKLLQ 247 Query: 639 AVATQPVSVGICGSDYKFQLYSGGIFSGPCSTALDHAVLIVGYDSQDGVDYWIVKNSWGK 460 AVATQP+SVGICGSD FQLYSGGIF+GPCST+LDHAVLIVGYDS+DG DYWI+KNSWGK Sbjct: 248 AVATQPISVGICGSDSSFQLYSGGIFTGPCSTSLDHAVLIVGYDSKDGKDYWIIKNSWGK 307 Query: 459 YWGMNGYIHMLRNSEDAEGVCGINTLASFPIKXXXXXXXXXXXXXTKCDLFTYCGTDETC 280 WG+ GY+HM+RNS EGVCGINTLAS+P+K TKC++FTYC + ETC Sbjct: 308 SWGIKGYMHMVRNSGSEEGVCGINTLASYPVKSSTNPPPSPTPGPTKCNIFTYCSSGETC 367 Query: 279 CCYWSLLGICFSWRCCEAESAVCCDDHEHCCPRDYPTCDTARNLCLKRIGNSTLSKPFEK 100 CC LG+C SW CCEAESAVCCDDH HCCP DYP CDT +NLCLK+ GN+T+SKP K Sbjct: 368 CCARYFLGVCLSWNCCEAESAVCCDDHRHCCPHDYPVCDTKKNLCLKKSGNTTVSKPLGK 427 Query: 99 KGFFTS 82 K F S Sbjct: 428 KSFSAS 433 >dbj|BAJ53169.1| JHL18I08.3 [Jatropha curcas] Length = 441 Score = 577 bits (1486), Expect = e-162 Identities = 274/421 (65%), Positives = 325/421 (77%), Gaps = 1/421 (0%) Frame = -1 Query: 1359 LLSLFLLISQFPTCNSSSISELFDHWCEEHGKTYASEEEKQHRLRVFEHNYEIVVEHNTK 1180 LLS L S +SS I+ LF+ WC++HGKTYAS+EEK RL+VF+ NY+ V EHN++ Sbjct: 12 LLSYLFLFSS----SSSEIAHLFETWCQQHGKTYASQEEKLFRLKVFQDNYDFVTEHNSQ 67 Query: 1179 ANSSHTLSLNAFADLTNQEFKDKYLGLLPSAD-DLLIRLNSREFAIDGPDLVEESDLPAS 1003 NSS+TLSLNAFADLT+ EFK LGL +A L + ++R+ PD V +D+PAS Sbjct: 68 GNSSYTLSLNAFADLTHHEFKASRLGLSSAASASLNVDRSNRQI----PDFV--ADVPAS 121 Query: 1002 VDWRKKGAVTAVKDQGSCGACWSFSATGAVEGINQIKTGSLVSLSEQELIDCDTTYNSGC 823 VDWRK GAVT VKDQG+CGACWSFSATGA+EGIN+I TGSLVSLSEQEL+DCD +YN+GC Sbjct: 122 VDWRKNGAVTQVKDQGNCGACWSFSATGAIEGINKIVTGSLVSLSEQELVDCDKSYNNGC 181 Query: 822 GGGLMDYAFEFIIKNKGIDTEEDYPYRGRDGKCSKEKLKRHVVTIDSYADVPPRKEKKLL 643 GG+MDYAF+F+I N GIDTEEDYPY+GRD C+KEKLKRHVVTID Y DVP EK+LL Sbjct: 182 EGGIMDYAFQFVIDNHGIDTEEDYPYQGRDRSCNKEKLKRHVVTIDGYVDVPQNNEKELL 241 Query: 642 QAVATQPVSVGICGSDYKFQLYSGGIFSGPCSTALDHAVLIVGYDSQDGVDYWIVKNSWG 463 +AVA QPVSVGICGS+ FQLYS GIF+GPCST+LDHAVLIVGY S++GVDYWIVKNSWG Sbjct: 242 KAVANQPVSVGICGSERAFQLYSKGIFTGPCSTSLDHAVLIVGYGSENGVDYWIVKNSWG 301 Query: 462 KYWGMNGYIHMLRNSEDAEGVCGINTLASFPIKXXXXXXXXXXXXXTKCDLFTYCGTDET 283 YWGM+GY+HM RNS + G+CGIN LAS+P K T+CDLFT+CG ET Sbjct: 302 SYWGMDGYMHMQRNSGSSRGLCGINMLASYPKKTSPNPPPPAPPGPTRCDLFTHCGEGET 361 Query: 282 CCCYWSLLGICFSWRCCEAESAVCCDDHEHCCPRDYPTCDTARNLCLKRIGNSTLSKPFE 103 CCC + GIC SW+CCE +SAVCC D HCCPRDYP CDT RN+CLK GN+T + F Sbjct: 362 CCCVHHIFGICLSWKCCELDSAVCCKDGRHCCPRDYPVCDTTRNICLKHYGNATRIEKFA 421 Query: 102 K 100 K Sbjct: 422 K 422 >ref|XP_006341989.1| PREDICTED: cysteine proteinase RD21a-like [Solanum tuberosum] Length = 439 Score = 576 bits (1485), Expect = e-162 Identities = 268/422 (63%), Positives = 326/422 (77%) Frame = -1 Query: 1356 LSLFLLISQFPTCNSSSISELFDHWCEEHGKTYASEEEKQHRLRVFEHNYEIVVEHNTKA 1177 L L LLI Q P C SSIS+LF+ WC+++GK Y+SE+E+ +R +VFE NY + EHN+K Sbjct: 8 LVLVLLIFQQPFCTCSSISDLFETWCQQNGKKYSSEQERVYRFKVFEENYAYITEHNSKE 67 Query: 1176 NSSHTLSLNAFADLTNQEFKDKYLGLLPSADDLLIRLNSREFAIDGPDLVEESDLPASVD 997 NSS+TL LNA++DLT+ EF++ +LGL SA+D IRL R ++ + D P+S+D Sbjct: 68 NSSYTLGLNAYSDLTHHEFRNSFLGLSSSANDF-IRLKGRGSGSSETGVLSDVDAPSSLD 126 Query: 996 WRKKGAVTAVKDQGSCGACWSFSATGAVEGINQIKTGSLVSLSEQELIDCDTTYNSGCGG 817 WR+KGAVT VK+QGSCGACWSFSATGA+EGIN+I TGSLVSLSEQELIDCD +YN GCGG Sbjct: 127 WREKGAVTDVKNQGSCGACWSFSATGAMEGINKITTGSLVSLSEQELIDCDRSYNEGCGG 186 Query: 816 GLMDYAFEFIIKNKGIDTEEDYPYRGRDGKCSKEKLKRHVVTIDSYADVPPRKEKKLLQA 637 GLMDYAFEF+IKN GIDTE+DYP+R R+G C+K KL+RHVVTID Y D+P E KLL+A Sbjct: 187 GLMDYAFEFVIKNGGIDTEKDYPFREREGTCNKNKLQRHVVTIDGYTDIPQNDEDKLLKA 246 Query: 636 VATQPVSVGICGSDYKFQLYSGGIFSGPCSTALDHAVLIVGYDSQDGVDYWIVKNSWGKY 457 VATQPVSVGICGS FQ YS GIF+GPCSTALDHAVLIVGY S++GVDYWI+KNSWG Sbjct: 247 VATQPVSVGICGSARAFQSYSKGIFTGPCSTALDHAVLIVGYGSENGVDYWIIKNSWGTS 306 Query: 456 WGMNGYIHMLRNSEDAEGVCGINTLASFPIKXXXXXXXXXXXXXTKCDLFTYCGTDETCC 277 WG+NGYIHM RNS + EG+CGIN LAS+P K +KC +FT CG ETCC Sbjct: 307 WGINGYIHMQRNSGNQEGICGINKLASYPTKTSPNPPTPPAPGPSKCSMFTSCGQGETCC 366 Query: 276 CYWSLLGICFSWRCCEAESAVCCDDHEHCCPRDYPTCDTARNLCLKRIGNSTLSKPFEKK 97 C LGIC SW+CC +SAVCC D HCCP+DYP CDT+RNLCLKR+ N+T+ + +K+ Sbjct: 367 CGSKFLGICLSWKCCGLDSAVCCKDGRHCCPQDYPICDTSRNLCLKRMNNATIVQQPQKE 426 Query: 96 GF 91 F Sbjct: 427 AF 428 >ref|NP_563855.1| papain-like cysteine peptidase [Arabidopsis thaliana] gi|110741821|dbj|BAE98853.1| papain-like cysteine peptidase XBCP3 [Arabidopsis thaliana] gi|111074448|gb|ABH04597.1| At1g09850 [Arabidopsis thaliana] gi|332190386|gb|AEE28507.1| papain-like cysteine peptidase [Arabidopsis thaliana] Length = 437 Score = 575 bits (1481), Expect = e-161 Identities = 271/420 (64%), Positives = 322/420 (76%) Frame = -1 Query: 1356 LSLFLLISQFPTCNSSSISELFDHWCEEHGKTYASEEEKQHRLRVFEHNYEIVVEHNTKA 1177 L+ F L+ + +S ISELFD WC++HGKTY SEEE+Q R+++F+ N++ V +HN Sbjct: 11 LTFFFLLLVSSSSSSDDISELFDDWCQKHGKTYGSEEERQQRIQIFKDNHDFVTQHNLIT 70 Query: 1176 NSSHTLSLNAFADLTNQEFKDKYLGLLPSADDLLIRLNSREFAIDGPDLVEESDLPASVD 997 N++++LSLNAFADLT+ EFK LGL SA +++ A G L +P SVD Sbjct: 71 NATYSLSLNAFADLTHHEFKASRLGLSVSAPSVIM-------ASKGQSLGGSVKVPDSVD 123 Query: 996 WRKKGAVTAVKDQGSCGACWSFSATGAVEGINQIKTGSLVSLSEQELIDCDTTYNSGCGG 817 WRKKGAVT VKDQGSCGACWSFSATGA+EGINQI TG L+SLSEQELIDCD +YN+GC G Sbjct: 124 WRKKGAVTNVKDQGSCGACWSFSATGAMEGINQIVTGDLISLSEQELIDCDKSYNAGCNG 183 Query: 816 GLMDYAFEFIIKNKGIDTEEDYPYRGRDGKCSKEKLKRHVVTIDSYADVPPRKEKKLLQA 637 GLMDYAFEF+IKN GIDTE+DYPY+ RDG C K+KLK+ VVTIDSYA V EK L++A Sbjct: 184 GLMDYAFEFVIKNHGIDTEKDYPYQERDGTCKKDKLKQKVVTIDSYAGVKSNDEKALMEA 243 Query: 636 VATQPVSVGICGSDYKFQLYSGGIFSGPCSTALDHAVLIVGYDSQDGVDYWIVKNSWGKY 457 VA QPVSVGICGS+ FQLYS GIFSGPCST+LDHAVLIVGY SQ+GVDYWIVKNSWGK Sbjct: 244 VAAQPVSVGICGSERAFQLYSSGIFSGPCSTSLDHAVLIVGYGSQNGVDYWIVKNSWGKS 303 Query: 456 WGMNGYIHMLRNSEDAEGVCGINTLASFPIKXXXXXXXXXXXXXTKCDLFTYCGTDETCC 277 WGM+G++HM RN+E+++GVCGIN LAS+PIK TKC+LFTYC + ETCC Sbjct: 304 WGMDGFMHMQRNTENSDGVCGINMLASYPIKTHPNPPPPSPPGPTKCNLFTYCSSGETCC 363 Query: 276 CYWSLLGICFSWRCCEAESAVCCDDHEHCCPRDYPTCDTARNLCLKRIGNSTLSKPFEKK 97 C L G+CFSW+CCE ESAVCC D HCCP DYP CDT R+LCLK+ GN T KPF KK Sbjct: 364 CARELFGLCFSWKCCEIESAVCCKDGRHCCPHDYPVCDTTRSLCLKKTGNFTAIKPFWKK 423 >gb|AAK71314.1|AF388175_1 papain-like cysteine peptidase XBCP3 [Arabidopsis thaliana] Length = 437 Score = 574 bits (1479), Expect = e-161 Identities = 271/420 (64%), Positives = 322/420 (76%) Frame = -1 Query: 1356 LSLFLLISQFPTCNSSSISELFDHWCEEHGKTYASEEEKQHRLRVFEHNYEIVVEHNTKA 1177 L+ F L+ + +S ISELFD WC++HGKTY SEEE+Q R+++F+ N++ V +HN Sbjct: 11 LTFFFLLLVSSSSSSDDISELFDDWCQKHGKTYGSEEERQQRIQIFKDNHDFVTQHNLIT 70 Query: 1176 NSSHTLSLNAFADLTNQEFKDKYLGLLPSADDLLIRLNSREFAIDGPDLVEESDLPASVD 997 N++++LSLNAFADLT+ EFK LGL SA +++ A G L +P SVD Sbjct: 71 NATYSLSLNAFADLTHHEFKASRLGLSVSAPSVIM-------ASKGQSLGGSVKVPDSVD 123 Query: 996 WRKKGAVTAVKDQGSCGACWSFSATGAVEGINQIKTGSLVSLSEQELIDCDTTYNSGCGG 817 WRKKGAVT VKDQGSCGACWSFSATGA+EGINQI TG L+SLSEQELIDCD +YN+GC G Sbjct: 124 WRKKGAVTNVKDQGSCGACWSFSATGAMEGINQIVTGDLISLSEQELIDCDKSYNAGCNG 183 Query: 816 GLMDYAFEFIIKNKGIDTEEDYPYRGRDGKCSKEKLKRHVVTIDSYADVPPRKEKKLLQA 637 GLMDYAFEF+IKN GIDTE+DYPY+ RDG C K+KLK+ VVTIDSYA V EK L++A Sbjct: 184 GLMDYAFEFVIKNHGIDTEKDYPYQERDGTCKKDKLKQKVVTIDSYAGVKSNDEKALMEA 243 Query: 636 VATQPVSVGICGSDYKFQLYSGGIFSGPCSTALDHAVLIVGYDSQDGVDYWIVKNSWGKY 457 VA QPVSVGICGS+ FQLYS GIFSGPCST+LDHAVLIVGY SQ+GVDYWIVKNSWGK Sbjct: 244 VAAQPVSVGICGSERAFQLYSRGIFSGPCSTSLDHAVLIVGYGSQNGVDYWIVKNSWGKS 303 Query: 456 WGMNGYIHMLRNSEDAEGVCGINTLASFPIKXXXXXXXXXXXXXTKCDLFTYCGTDETCC 277 WGM+G++HM RN+E+++GVCGIN LAS+PIK TKC+LFTYC + ETCC Sbjct: 304 WGMDGFMHMQRNTENSDGVCGINMLASYPIKTHPNPPPPSPPGPTKCNLFTYCSSGETCC 363 Query: 276 CYWSLLGICFSWRCCEAESAVCCDDHEHCCPRDYPTCDTARNLCLKRIGNSTLSKPFEKK 97 C L G+CFSW+CCE ESAVCC D HCCP DYP CDT R+LCLK+ GN T KPF KK Sbjct: 364 CARELFGLCFSWKCCEIESAVCCKDGRHCCPHDYPVCDTTRSLCLKKTGNFTAIKPFWKK 423 >ref|XP_002889773.1| hypothetical protein ARALYDRAFT_471096 [Arabidopsis lyrata subsp. lyrata] gi|297335615|gb|EFH66032.1| hypothetical protein ARALYDRAFT_471096 [Arabidopsis lyrata subsp. lyrata] Length = 439 Score = 568 bits (1465), Expect = e-159 Identities = 273/423 (64%), Positives = 321/423 (75%), Gaps = 2/423 (0%) Frame = -1 Query: 1359 LLSLFLLISQFPTCNSSSISELFDHWCEEHGKTYASEEEKQHRLRVFEHNYEIVVEHNTK 1180 L FLL+ P+ +S ISELFD WC+ HGKTY SEEE+Q R+++F+ N++ V +HN Sbjct: 11 LTFFFLLLVSSPS-SSDDISELFDDWCQRHGKTYGSEEERQQRIQIFKDNHDFVTQHNLI 69 Query: 1179 ANSSHTLSLNAFADLTNQEFKDKYLGLLPSADDLLIRLNSREFAIDGPDLVEESDLPASV 1000 N++++LSLNAFADLT+ EFK LGL SA L++ A G L + +P SV Sbjct: 70 TNATYSLSLNAFADLTHHEFKASRLGLSVSASSLIM-------ASKGQSLGGNAKVPDSV 122 Query: 999 DWRKKGAVTAVKDQGSCGACWSFSATGAVEGINQIKTGSLVSLSEQELIDCDTTYNSGCG 820 DWRKKGAVT VKDQGSCGACWSFSATGA+EGINQI TG L+SLSEQELIDCD +YN+GC Sbjct: 123 DWRKKGAVTNVKDQGSCGACWSFSATGAMEGINQIVTGDLISLSEQELIDCDKSYNAGCN 182 Query: 819 GGLMDYAFEFIIKNKGIDTEEDYPYRGRDGKCSKEKLKRHVVTIDSYADVPPRKEKKLLQ 640 GGLMDYAFEF+IKN GIDTE+DYPY+ RDG C K+KLK+ VVTIDSYA V EK L + Sbjct: 183 GGLMDYAFEFVIKNHGIDTEKDYPYQERDGTCKKDKLKQKVVTIDSYAGVKSNDEKALRE 242 Query: 639 AVATQPVSVGICGSDYKFQLYS--GGIFSGPCSTALDHAVLIVGYDSQDGVDYWIVKNSW 466 AVA QPVSVGICGS+ FQLYS GIFSGPCST+LDHAVLIVGY SQ+GVDYWIVKNSW Sbjct: 243 AVAAQPVSVGICGSERAFQLYSRVSGIFSGPCSTSLDHAVLIVGYGSQNGVDYWIVKNSW 302 Query: 465 GKYWGMNGYIHMLRNSEDAEGVCGINTLASFPIKXXXXXXXXXXXXXTKCDLFTYCGTDE 286 GK WGM+G++HM RN+ ++EG+CGIN LAS+PIK TKC+LFTYC E Sbjct: 303 GKSWGMDGFMHMQRNTGNSEGICGINMLASYPIKTHPNPPPPSPPGPTKCNLFTYCSAGE 362 Query: 285 TCCCYWSLLGICFSWRCCEAESAVCCDDHEHCCPRDYPTCDTARNLCLKRIGNSTLSKPF 106 TCCC +L G+CFSW+CCE ESAVCC D HCCP DYP CDT R+LCLK+ GN T KPF Sbjct: 363 TCCCARNLFGLCFSWKCCEIESAVCCSDGRHCCPHDYPVCDTTRSLCLKKTGNFTAIKPF 422 Query: 105 EKK 97 KK Sbjct: 423 WKK 425 >ref|XP_006417526.1| hypothetical protein EUTSA_v10007640mg [Eutrema salsugineum] gi|557095297|gb|ESQ35879.1| hypothetical protein EUTSA_v10007640mg [Eutrema salsugineum] Length = 444 Score = 568 bits (1463), Expect = e-159 Identities = 269/421 (63%), Positives = 324/421 (76%) Frame = -1 Query: 1359 LLSLFLLISQFPTCNSSSISELFDHWCEEHGKTYASEEEKQHRLRVFEHNYEIVVEHNTK 1180 LL + L + +S I+ELFD WC HGKTY SEEE+QHR+++F N++ V +HN Sbjct: 15 LLLVSSLSFSISSSSSDDIAELFDDWCHRHGKTYGSEEERQHRIQIFRDNHDFVTQHNHI 74 Query: 1179 ANSSHTLSLNAFADLTNQEFKDKYLGLLPSADDLLIRLNSREFAIDGPDLVEESDLPASV 1000 +NS+++LSLNAFADLT+ EFK LGL + L+ ++E ++ + V +P SV Sbjct: 75 SNSTYSLSLNAFADLTHHEFKASRLGLSAPSPSLM----AKEQSLGVSERVRVK-VPDSV 129 Query: 999 DWRKKGAVTAVKDQGSCGACWSFSATGAVEGINQIKTGSLVSLSEQELIDCDTTYNSGCG 820 DWRKKGAVT VKDQGSCGACWSFSATGA+EGINQI TG L+SLSEQELIDCD +YN+GC Sbjct: 130 DWRKKGAVTNVKDQGSCGACWSFSATGAMEGINQIVTGDLISLSEQELIDCDKSYNAGCN 189 Query: 819 GGLMDYAFEFIIKNKGIDTEEDYPYRGRDGKCSKEKLKRHVVTIDSYADVPPRKEKKLLQ 640 GGLMDYAFEF+IKN GIDTE+DYPY+ +DG C K+KLK+ VVTIDSYA V EK L++ Sbjct: 190 GGLMDYAFEFVIKNHGIDTEKDYPYQEQDGTCKKDKLKKRVVTIDSYAGVASNNEKALME 249 Query: 639 AVATQPVSVGICGSDYKFQLYSGGIFSGPCSTALDHAVLIVGYDSQDGVDYWIVKNSWGK 460 AVA+QPVSVGICGS+ FQLYS GIFSGPCST+LDHAVLIVGY SQ+GVDYWIVKNSWGK Sbjct: 250 AVASQPVSVGICGSERAFQLYSSGIFSGPCSTSLDHAVLIVGYGSQNGVDYWIVKNSWGK 309 Query: 459 YWGMNGYIHMLRNSEDAEGVCGINTLASFPIKXXXXXXXXXXXXXTKCDLFTYCGTDETC 280 WGM+G++HM RN+ ++EGVCGIN LAS+PIK TKC+LFTYC + ETC Sbjct: 310 SWGMDGFMHMQRNTGNSEGVCGINMLASYPIKTHPNPPPPSPPGPTKCNLFTYCSSGETC 369 Query: 279 CCYWSLLGICFSWRCCEAESAVCCDDHEHCCPRDYPTCDTARNLCLKRIGNSTLSKPFEK 100 CC +L G+CFSW+CCE ESAVCC D HCCPRDYP CDT ++LCLK+ GN T KPF K Sbjct: 370 CCARTLFGLCFSWKCCELESAVCCKDGRHCCPRDYPVCDTTKSLCLKKTGNFTEIKPFWK 429 Query: 99 K 97 K Sbjct: 430 K 430 >ref|XP_002307688.2| cysteine protease family protein [Populus trichocarpa] gi|550339725|gb|EEE94684.2| cysteine protease family protein [Populus trichocarpa] Length = 436 Score = 565 bits (1457), Expect = e-158 Identities = 266/422 (63%), Positives = 318/422 (75%), Gaps = 1/422 (0%) Frame = -1 Query: 1359 LLSLFLLISQF-PTCNSSSISELFDHWCEEHGKTYASEEEKQHRLRVFEHNYEIVVEHNT 1183 + +L LLIS P+ +SS IS+LF+ WC+EHGK+Y S+EE+ HRL+VFE NY+ V +HN+ Sbjct: 6 IFALTLLISVLSPSTSSSDISQLFETWCKEHGKSYTSQEERSHRLKVFEDNYDFVTKHNS 65 Query: 1182 KANSSHTLSLNAFADLTNQEFKDKYLGLLPSADDLLIRLNSREFAIDGPDLVEESDLPAS 1003 K NSS++L+LNAFADLT+ EFK LGL + + L R I G D+PAS Sbjct: 66 KGNSSYSLALNAFADLTHHEFKTSRLGLSAAP----LNLAHRNLEITGV----VGDIPAS 117 Query: 1002 VDWRKKGAVTAVKDQGSCGACWSFSATGAVEGINQIKTGSLVSLSEQELIDCDTTYNSGC 823 +DWR KG VT VKDQGSCGACWSFSATGA+EGIN+I TGSLVSLSEQELI+CD +YN GC Sbjct: 118 IDWRNKGVVTNVKDQGSCGACWSFSATGAIEGINKIVTGSLVSLSEQELIECDKSYNDGC 177 Query: 822 GGGLMDYAFEFIIKNKGIDTEEDYPYRGRDGKCSKEKLKRHVVTIDSYADVPPRKEKKLL 643 GGGLMDYAF+F+I N GIDTEEDYPYR RDG C+K+++KR VVTID Y DVP EK+LL Sbjct: 178 GGGLMDYAFQFVINNHGIDTEEDYPYRARDGTCNKDRMKRRVVTIDKYVDVPENNEKQLL 237 Query: 642 QAVATQPVSVGICGSDYKFQLYSGGIFSGPCSTALDHAVLIVGYDSQDGVDYWIVKNSWG 463 QAVA QPVSVGICGS+ FQ+YS GIF+GPCST+LDHAVLIVGY S++GVDYWIVKNSWG Sbjct: 238 QAVAAQPVSVGICGSERAFQMYSKGIFTGPCSTSLDHAVLIVGYGSENGVDYWIVKNSWG 297 Query: 462 KYWGMNGYIHMLRNSEDAEGVCGINTLASFPIKXXXXXXXXXXXXXTKCDLFTYCGTDET 283 WGM GY+HM RNS +++GVCGIN LAS+P+K TKC+L TYC ET Sbjct: 298 TGWGMRGYMHMQRNSGNSQGVCGINMLASYPVKTSPNPPPPPPPGPTKCNLLTYCAAGET 357 Query: 282 CCCYWSLLGICFSWRCCEAESAVCCDDHEHCCPRDYPTCDTARNLCLKRIGNSTLSKPFE 103 CCC GIC SW+CC +SAVCC D HCCP DYP CDT +N+C KR GN+T + E Sbjct: 358 CCCARKFFGICISWKCCGLDSAVCCKDRLHCCPHDYPVCDTDKNMCFKRAGNATRMEAIE 417 Query: 102 KK 97 K Sbjct: 418 GK 419 >ref|XP_004238304.1| PREDICTED: cysteine proteinase RD21a-like [Solanum lycopersicum] Length = 439 Score = 565 bits (1455), Expect = e-158 Identities = 262/422 (62%), Positives = 319/422 (75%) Frame = -1 Query: 1356 LSLFLLISQFPTCNSSSISELFDHWCEEHGKTYASEEEKQHRLRVFEHNYEIVVEHNTKA 1177 L L LLI Q P C SSIS+LF+ WC+++GK Y+SE+E+ +R +VFE NY + EHN+K Sbjct: 8 LVLVLLIFQQPLCTCSSISDLFETWCQQNGKKYSSEQERMYRFKVFEENYAYITEHNSKG 67 Query: 1176 NSSHTLSLNAFADLTNQEFKDKYLGLLPSADDLLIRLNSREFAIDGPDLVEESDLPASVD 997 NSS+TL LNA++DLT+ EF++ +LGL SA+D IRL R ++ + D P+S+D Sbjct: 68 NSSYTLGLNAYSDLTHHEFRNSFLGLSSSANDF-IRLKGRGSGSSAAGVLSDVDAPSSLD 126 Query: 996 WRKKGAVTAVKDQGSCGACWSFSATGAVEGINQIKTGSLVSLSEQELIDCDTTYNSGCGG 817 WR KGAVT VK+QGSCGACWSFSATGA+EGIN+I TGSLVSLSEQELIDCD +YN GCGG Sbjct: 127 WRDKGAVTNVKNQGSCGACWSFSATGAIEGINKITTGSLVSLSEQELIDCDRSYNQGCGG 186 Query: 816 GLMDYAFEFIIKNKGIDTEEDYPYRGRDGKCSKEKLKRHVVTIDSYADVPPRKEKKLLQA 637 GLMDYAFEF+IKN GIDTE+DYP+R ++G C+K KL+R VVTID Y D+P E KLL+A Sbjct: 187 GLMDYAFEFVIKNGGIDTEKDYPFREKEGTCNKNKLQRRVVTIDGYTDIPQNDEDKLLKA 246 Query: 636 VATQPVSVGICGSDYKFQLYSGGIFSGPCSTALDHAVLIVGYDSQDGVDYWIVKNSWGKY 457 VATQPVSVGICGS FQ YS GIF+GPC T LDHAVLIVGY S++G DYWI+KNSWG Sbjct: 247 VATQPVSVGICGSARAFQSYSKGIFTGPCPTDLDHAVLIVGYGSENGFDYWIIKNSWGTS 306 Query: 456 WGMNGYIHMLRNSEDAEGVCGINTLASFPIKXXXXXXXXXXXXXTKCDLFTYCGTDETCC 277 WG+NGYIHM RNS + EG+CG+N LAS+P K +KC FT CG ETCC Sbjct: 307 WGINGYIHMQRNSGNQEGICGVNKLASYPTKTSPNPPNPPAPGPSKCSTFTSCGQGETCC 366 Query: 276 CYWSLLGICFSWRCCEAESAVCCDDHEHCCPRDYPTCDTARNLCLKRIGNSTLSKPFEKK 97 C LGIC SW+CC +SAVCC D HCCP DYP CDT+RNLCLKR+ N+T+ + +K+ Sbjct: 367 CGLKFLGICLSWKCCGLDSAVCCKDGRHCCPWDYPICDTSRNLCLKRMSNATIVQQPQKE 426 Query: 96 GF 91 F Sbjct: 427 PF 428 >ref|XP_006435079.1| hypothetical protein CICLE_v10001178mg [Citrus clementina] gi|557537201|gb|ESR48319.1| hypothetical protein CICLE_v10001178mg [Citrus clementina] Length = 441 Score = 563 bits (1451), Expect = e-158 Identities = 265/423 (62%), Positives = 323/423 (76%), Gaps = 1/423 (0%) Frame = -1 Query: 1359 LLSLFLLISQFPTCNSSSISELFDHWCEEHGKTYASEEEKQHRLRVFEHNYEIVVEHNTK 1180 LLS+ LL+S P S I+ELF+ WC++HGK Y+SE+EKQ RL++FE NY V +HN Sbjct: 8 LLSI-LLLSSLPLNYCSDINELFETWCKQHGKVYSSEQEKQQRLKIFEDNYAFVTQHNNM 66 Query: 1179 ANSSHTLSLNAFADLTNQEFKDKYLGLLPSADDLLIRLNSREFAIDGPDLVEESDLPASV 1000 NSS TLSLNAFADLT+QEFK +LG ++ D R N+ ++ P + D+PAS+ Sbjct: 67 GNSSFTLSLNAFADLTHQEFKASFLGFSAASIDHDRRRNA---SVQSPGTLR--DVPASI 121 Query: 999 DWRKKGAVTAVKDQGSCGACWSFSATGAVEGINQIKTGSLVSLSEQELIDCDTTYNSGCG 820 DWRKKGAVT VKDQ SCGACW+FSATGA+EGIN+I TGSLVSLSEQELIDCD +YNSGCG Sbjct: 122 DWRKKGAVTEVKDQASCGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSYNSGCG 181 Query: 819 GGLMDYAFEFIIKNKGIDTEEDYPYRGRDGKCSKEKLKRHVVTIDSYADVPPRKEKKLLQ 640 GGLMDYA++F+IKN GIDTE+DYPYRG+ G+C+K+KL RH+VTID Y DVP EK+LLQ Sbjct: 182 GGLMDYAYQFVIKNHGIDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQ 241 Query: 639 AVATQPVSVGICGSDYKFQLYSGGIFSGPCSTALDHAVLIVGYDSQDGVDYWIVKNSWGK 460 AV QPVSVGICGS+ FQLYS GIF+GPCST+LDHAVLIVGYDS++GVDYWI+KNSWG+ Sbjct: 242 AVVAQPVSVGICGSERAFQLYSSGIFTGPCSTSLDHAVLIVGYDSENGVDYWIIKNSWGR 301 Query: 459 YWGMNGYIHMLRNSEDAEGVCGINTLASFPIKXXXXXXXXXXXXXTKCDLFTYCGTDETC 280 WGMNGY+HM RN+ ++ G+CGIN LAS+P K T+C L TYC ETC Sbjct: 302 SWGMNGYMHMQRNTGNSLGICGINMLASYPTKTGQNPPPSPPPGPTRCSLLTYCAAGETC 361 Query: 279 CCYWSLLGICFSWRCCEAESAVCCDDHEHCCPRDYPTCDTARNLCLKRI-GNSTLSKPFE 103 CC S+LGIC SW+CC SAVCC DH +CCP +YP CD+ R+ CL R GN T ++ E Sbjct: 362 CCGSSILGICLSWKCCGFSSAVCCSDHRYCCPSNYPICDSVRHQCLTRFTGNVTAAEAIE 421 Query: 102 KKG 94 +G Sbjct: 422 MRG 424 >ref|XP_006473576.1| PREDICTED: cysteine proteinase RD21a-like [Citrus sinensis] Length = 441 Score = 562 bits (1449), Expect = e-157 Identities = 264/423 (62%), Positives = 324/423 (76%), Gaps = 1/423 (0%) Frame = -1 Query: 1359 LLSLFLLISQFPTCNSSSISELFDHWCEEHGKTYASEEEKQHRLRVFEHNYEIVVEHNTK 1180 LLS+ LL+S P S I+ELF+ WC++HGK Y+SE+EKQ RL++FE NY V +HN Sbjct: 8 LLSI-LLLSSLPPNYCSDINELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHNNM 66 Query: 1179 ANSSHTLSLNAFADLTNQEFKDKYLGLLPSADDLLIRLNSREFAIDGPDLVEESDLPASV 1000 NSS TLSLNAFADLT+QEFK +LG ++ D R N+ ++ P + D+PAS+ Sbjct: 67 GNSSFTLSLNAFADLTHQEFKASFLGFSAASIDHDRRRNA---SVQSPGNLR--DVPASI 121 Query: 999 DWRKKGAVTAVKDQGSCGACWSFSATGAVEGINQIKTGSLVSLSEQELIDCDTTYNSGCG 820 DWRKKGAVT VKDQ SCGACW+FSATGA+EGIN+I TGSLVSLSEQELIDCD +YNSGCG Sbjct: 122 DWRKKGAVTEVKDQASCGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSYNSGCG 181 Query: 819 GGLMDYAFEFIIKNKGIDTEEDYPYRGRDGKCSKEKLKRHVVTIDSYADVPPRKEKKLLQ 640 GGLMDYA++F+IKN GIDTE+DYPYRG+ G+C+K+KL RH+VTID Y DVP EK+LLQ Sbjct: 182 GGLMDYAYQFVIKNHGIDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQ 241 Query: 639 AVATQPVSVGICGSDYKFQLYSGGIFSGPCSTALDHAVLIVGYDSQDGVDYWIVKNSWGK 460 AV QPVSVGICGS+ FQLYS GIF+GPCST+LDHAVLI+GYDS++GVDYWI+KNSWG+ Sbjct: 242 AVVAQPVSVGICGSERAFQLYSSGIFTGPCSTSLDHAVLIIGYDSENGVDYWIIKNSWGR 301 Query: 459 YWGMNGYIHMLRNSEDAEGVCGINTLASFPIKXXXXXXXXXXXXXTKCDLFTYCGTDETC 280 WGMNGY+HM RN+ ++ G+CGIN LAS+P K T+C L TYC ETC Sbjct: 302 SWGMNGYMHMQRNTGNSLGICGINMLASYPTKTGQNPPPSPPPGPTRCSLLTYCAPGETC 361 Query: 279 CCYWSLLGICFSWRCCEAESAVCCDDHEHCCPRDYPTCDTARNLCLKRI-GNSTLSKPFE 103 CC S+LGIC SW+CC SAVCC DH +CCP +YP CD+ R+ CL R+ GN T ++ E Sbjct: 362 CCGSSILGICLSWKCCGFSSAVCCSDHRYCCPSNYPICDSVRHQCLTRLTGNVTAAEAIE 421 Query: 102 KKG 94 +G Sbjct: 422 MRG 424 >ref|XP_007017656.1| JHL18I08.3 protein [Theobroma cacao] gi|508722984|gb|EOY14881.1| JHL18I08.3 protein [Theobroma cacao] Length = 438 Score = 558 bits (1439), Expect = e-156 Identities = 266/417 (63%), Positives = 313/417 (75%) Frame = -1 Query: 1347 FLLISQFPTCNSSSISELFDHWCEEHGKTYASEEEKQHRLRVFEHNYEIVVEHNTKANSS 1168 FLL + S IS LF+ WC++HGK Y+SEEEK +RL+VFE NY V +HN NSS Sbjct: 12 FLLFFDPSFASPSHISHLFETWCDQHGKRYSSEEEKSYRLKVFEENYAFVTQHNGVGNSS 71 Query: 1167 HTLSLNAFADLTNQEFKDKYLGLLPSADDLLIRLNSREFAIDGPDLVEESDLPASVDWRK 988 ++L+LNAFADLT+ EFK LGL +A + + P LV D+PAS+DWR Sbjct: 72 YSLALNAFADLTHHEFKASRLGLSAAA------IEGSRPNLQLPGLVR--DIPASMDWRT 123 Query: 987 KGAVTAVKDQGSCGACWSFSATGAVEGINQIKTGSLVSLSEQELIDCDTTYNSGCGGGLM 808 KGAVT VKDQGSCGACWSFSATGA+EGIN+I TG+LVSLSEQEL+DCD +YNSGC GGLM Sbjct: 124 KGAVTKVKDQGSCGACWSFSATGAIEGINKIVTGTLVSLSEQELVDCDRSYNSGCEGGLM 183 Query: 807 DYAFEFIIKNKGIDTEEDYPYRGRDGKCSKEKLKRHVVTIDSYADVPPRKEKKLLQAVAT 628 DYA++F+I N GID EEDYPY GR+ C+KEK KR VVTID YA VP E LLQAVA Sbjct: 184 DYAYQFVIDNHGIDNEEDYPYLGREKTCNKEKRKRRVVTIDGYAGVPANNEDLLLQAVAK 243 Query: 627 QPVSVGICGSDYKFQLYSGGIFSGPCSTALDHAVLIVGYDSQDGVDYWIVKNSWGKYWGM 448 QPVSVGICGS+ FQLYS GIF+GPCS++LDHAVLIVGY S++GVDYWIVKNSWG WGM Sbjct: 244 QPVSVGICGSERAFQLYSKGIFTGPCSSSLDHAVLIVGYGSENGVDYWIVKNSWGTRWGM 303 Query: 447 NGYIHMLRNSEDAEGVCGINTLASFPIKXXXXXXXXXXXXXTKCDLFTYCGTDETCCCYW 268 NGYIHMLRNS D++G+CGIN LAS+P K TKCDLFTYC ETCCC Sbjct: 304 NGYIHMLRNSGDSKGLCGINMLASYPTKTSPNPPSPPPPGPTKCDLFTYCSAGETCCCTH 363 Query: 267 SLLGICFSWRCCEAESAVCCDDHEHCCPRDYPTCDTARNLCLKRIGNSTLSKPFEKK 97 + GICFSW+CCE +SAVCC D+ HCCP DYP CDT ++ CLKR+GN+T + FEK+ Sbjct: 364 RIFGICFSWKCCELDSAVCCKDNRHCCPYDYPVCDTKKSQCLKRVGNATRMEAFEKR 420 >ref|XP_006307431.1| hypothetical protein CARUB_v10009056mg [Capsella rubella] gi|482576142|gb|EOA40329.1| hypothetical protein CARUB_v10009056mg [Capsella rubella] Length = 467 Score = 558 bits (1438), Expect = e-156 Identities = 270/444 (60%), Positives = 317/444 (71%), Gaps = 28/444 (6%) Frame = -1 Query: 1344 LLISQFPTCNSSSISELFDHWCEEHGKTYASEEEKQHRLRVFEHNYEIVVEHNTKANSSH 1165 LL+S + +S ISELFD WC+ HGKTYASE EKQHR ++F N++ V +HN N+++ Sbjct: 17 LLVSSSSSSSSDDISELFDDWCQRHGKTYASEAEKQHRFQIFRDNHDFVTQHNLITNATY 76 Query: 1164 TLSLNAFADLTNQEFKDKYLGLLPSADDLLIRLNSREFAIDGPDLVEESDLPASVDWRKK 985 +LSLNAFADL + EFK LGL SA +++ A G L +P S+DWRKK Sbjct: 77 SLSLNAFADLNHSEFKTSRLGLSVSAPSVIM-------ASKGKSLGGSVKVPDSLDWRKK 129 Query: 984 GAVTAVKDQGSCGACWSFSATGAVEGINQIKTGSLVSLSEQELIDCDTTYNSGCGGGLMD 805 GAVT VKDQGSCGACWSFSATGA+EGINQI TG L+SLSEQELIDCD +YN GC GGLMD Sbjct: 130 GAVTNVKDQGSCGACWSFSATGAMEGINQIVTGDLISLSEQELIDCDKSYNDGCNGGLMD 189 Query: 804 YAFEFIIKNKGIDTEEDYPYRGRDGKCSKEKLKRHVVTIDSYADVPPRKEKKLLQAVATQ 625 YAFEF+IKNKGIDTE+DYPY+ RDG C K+KLK+ VV+IDSYA V P EK LL+AVA Q Sbjct: 190 YAFEFVIKNKGIDTEKDYPYQERDGTCKKDKLKQRVVSIDSYAGVKPSDEKALLEAVAAQ 249 Query: 624 PVSVGICGSDYKFQLYSG----------------------------GIFSGPCSTALDHA 529 PVSVGICGS+ FQLYS GIFSGPCST+LDHA Sbjct: 250 PVSVGICGSERAFQLYSSVSFKIRDTSILSSECSTFPCLKLYLMMQGIFSGPCSTSLDHA 309 Query: 528 VLIVGYDSQDGVDYWIVKNSWGKYWGMNGYIHMLRNSEDAEGVCGINTLASFPIKXXXXX 349 VLIVGY SQ+GVDYWIVKNSWGK WGM+G++HM RN+ +++G+CGIN LAS+PIK Sbjct: 310 VLIVGYGSQNGVDYWIVKNSWGKSWGMDGFMHMQRNTGNSQGICGINMLASYPIKTHPNP 369 Query: 348 XXXXXXXXTKCDLFTYCGTDETCCCYWSLLGICFSWRCCEAESAVCCDDHEHCCPRDYPT 169 TKC+LFTYC ETCCC +L G+C SW+CCE ESAVCC D HCCP DYP Sbjct: 370 PPPSPPGPTKCNLFTYCSAAETCCCARNLFGLCLSWKCCEIESAVCCKDGRHCCPHDYPV 429 Query: 168 CDTARNLCLKRIGNSTLSKPFEKK 97 CDT R+LCLK+ GN T KPF KK Sbjct: 430 CDTTRSLCLKKTGNFTAIKPFWKK 453 >ref|XP_004500967.1| PREDICTED: oryzain alpha chain-like [Cicer arietinum] Length = 436 Score = 558 bits (1437), Expect = e-156 Identities = 267/423 (63%), Positives = 308/423 (72%) Frame = -1 Query: 1356 LSLFLLISQFPTCNSSSISELFDHWCEEHGKTYASEEEKQHRLRVFEHNYEIVVEHNTKA 1177 L L L +S + S+LF WC++HGKTY SE+EK++R VFE NY V +HN Sbjct: 9 LQLLLSLSLLSFVTAIDTSKLFQEWCKQHGKTYPSEQEKRYRFNVFEDNYAFVAQHNQIG 68 Query: 1176 NSSHTLSLNAFADLTNQEFKDKYLGLLPSADDLLIRLNSREFAIDGPDLVEESDLPASVD 997 NSS+TLSLNAFADLT+ EFK LGL PS+ L+R F D + +P+ +D Sbjct: 69 NSSYTLSLNAFADLTHHEFKATRLGLPPSS---LLRFKFNRFQ-DQQRSDDFLQVPSEID 124 Query: 996 WRKKGAVTAVKDQGSCGACWSFSATGAVEGINQIKTGSLVSLSEQELIDCDTTYNSGCGG 817 WRK GAV+ VKDQGSCGACWSFSATGA+EGIN+I TGSLVSLSEQEL+DCDTTYNSGC G Sbjct: 125 WRKNGAVSIVKDQGSCGACWSFSATGAIEGINKIVTGSLVSLSEQELVDCDTTYNSGCDG 184 Query: 816 GLMDYAFEFIIKNKGIDTEEDYPYRGRDGKCSKEKLKRHVVTIDSYADVPPRKEKKLLQA 637 GLMDYA++FII N GIDTEEDYPY+ R C K+KLKR VVTID Y DVPP EKKLL+A Sbjct: 185 GLMDYAYQFIIDNNGIDTEEDYPYQARQLLCKKDKLKRRVVTIDGYTDVPPNDEKKLLKA 244 Query: 636 VATQPVSVGICGSDYKFQLYSGGIFSGPCSTALDHAVLIVGYDSQDGVDYWIVKNSWGKY 457 VA QPVSVGICGS FQLYS GIF+GPCST+LDHAVLIVGY S++GVDYWIVKNSWGKY Sbjct: 245 VAVQPVSVGICGSARAFQLYSKGIFTGPCSTSLDHAVLIVGYGSENGVDYWIVKNSWGKY 304 Query: 456 WGMNGYIHMLRNSEDAEGVCGINTLASFPIKXXXXXXXXXXXXXTKCDLFTYCGTDETCC 277 WGMNGYIHMLRN++ + G+CGIN LAS+P K KC+LFTYC ETCC Sbjct: 305 WGMNGYIHMLRNTDSSAGLCGINMLASYPTKTKPNPPVPPPPGPIKCNLFTYCSGGETCC 364 Query: 276 CYWSLLGICFSWRCCEAESAVCCDDHEHCCPRDYPTCDTARNLCLKRIGNSTLSKPFEKK 97 C LGICFSW+CC SAVCC D HCCP DYP CD + CLKRI N T+ +K+ Sbjct: 365 CAKKFLGICFSWKCCGVTSAVCCKDKRHCCPLDYPVCDASNGQCLKRIANGTILMTSDKE 424 Query: 96 GFF 88 F Sbjct: 425 DPF 427 >gb|EPS60205.1| hypothetical protein M569_14597, partial [Genlisea aurea] Length = 424 Score = 557 bits (1436), Expect = e-156 Identities = 264/418 (63%), Positives = 318/418 (76%), Gaps = 1/418 (0%) Frame = -1 Query: 1359 LLSLFLLISQFPTCNSSSISELFDHWCEEHGKTYASEEEKQHRLRVFEHNYEIVVEHNTK 1180 L+ LFLL Q SSSIS+LFD WC+EHGKTY SEEE++HRL VF NY+ + HN + Sbjct: 10 LIQLFLL--QVHPIVSSSISDLFDSWCQEHGKTYVSEEEREHRLGVFSENYDFIASHNAR 67 Query: 1179 ANSSHTLSLNAFADLTNQEFKDKYLGLLPSADDLLIRLNSREFAIDGPDLVEESDLPASV 1000 AN S+TLSLNAFADLT EF +YLG PS DLLIR N + + S +P+S+ Sbjct: 68 ANYSYTLSLNAFADLTRSEFGGRYLGFSPSGHDLLIRKNRGSGSYRSRNY---SAVPSSI 124 Query: 999 DWRKKGAVTAVKDQGSCGACWSFSATGAVEGINQIKTGSLVSLSEQELIDCDTTYNSGCG 820 DWRKKGAVT +KDQGSCGACWSFSATGA+EGINQI TGSLVSLSEQELIDCD +YN GC Sbjct: 125 DWRKKGAVTGIKDQGSCGACWSFSATGAIEGINQIVTGSLVSLSEQELIDCDHSYNQGCN 184 Query: 819 GGLMDYAFEFIIKNKGIDTEEDYPYRGRDGKCSKEKLKRHVVTIDSYADVPPRKEKKLLQ 640 GGLMDYA+EFI+KNKGIDTEEDY Y+GRD CS+ KL + VVTIDSY D+P + E+ LL+ Sbjct: 185 GGLMDYAYEFILKNKGIDTEEDYSYKGRDASCSQNKLNKRVVTIDSYVDIPEKNEQMLLE 244 Query: 639 AVATQPVSVGICGSDYKFQLYSGGIFSGPCSTALDHAVLIVGYDSQDGVDYWIVKNSWGK 460 AVA+QPVSVGI G D FQ YS GIF+GPCST+LDHAVLIVGYDS++G DYWIVKNSWGK Sbjct: 245 AVASQPVSVGISGGDAPFQFYSQGIFTGPCSTSLDHAVLIVGYDSKNGKDYWIVKNSWGK 304 Query: 459 YWGMNGYIHMLRNSEDAEGVCGINTLASFPIK-XXXXXXXXXXXXXTKCDLFTYCGTDET 283 WGM+GY+++ RN+ + G+C IN +AS+P+K TKC LF+YC ET Sbjct: 305 SWGMDGYMYVQRNTGNQNGICEINMMASYPVKTNPNPSPSPSPPGPTKCSLFSYCSQGET 364 Query: 282 CCCYWSLLGICFSWRCCEAESAVCCDDHEHCCPRDYPTCDTARNLCLKRIGNSTLSKP 109 CCC LG+C ++CC AESAVCC+D+ HCCP+DYP CDTA+++C K GNST++ P Sbjct: 365 CCCARRFLGLCMRYKCCGAESAVCCEDNVHCCPQDYPICDTAQSVCRKMSGNSTMAIP 422 >ref|XP_002510459.1| cysteine protease, putative [Ricinus communis] gi|223551160|gb|EEF52646.1| cysteine protease, putative [Ricinus communis] Length = 422 Score = 556 bits (1434), Expect = e-156 Identities = 265/407 (65%), Positives = 319/407 (78%) Frame = -1 Query: 1359 LLSLFLLISQFPTCNSSSISELFDHWCEEHGKTYASEEEKQHRLRVFEHNYEIVVEHNTK 1180 LL L IS F + SS IS+LF+ W +EHGKTY S+E+K +R ++FE NYE V +HN++ Sbjct: 12 LLFFNLSISSFSS--SSDISKLFESWTKEHGKTYTSKEDKLYRFKIFEENYEFVKKHNSQ 69 Query: 1179 ANSSHTLSLNAFADLTNQEFKDKYLGLLPSADDLLIRLNSREFAIDGPDLVEESDLPASV 1000 NSS+TLSLNAFADLT+ EFK LGL SA +L+ R F + D V D+P S+ Sbjct: 70 GNSSYTLSLNAFADLTHHEFKASRLGL--SAFSTSGKLSRRNFPLH--DFV--GDVPISI 123 Query: 999 DWRKKGAVTAVKDQGSCGACWSFSATGAVEGINQIKTGSLVSLSEQELIDCDTTYNSGCG 820 DWRKKGAV+ VKDQG+CGACWSFSATGA+EGIN+I TGSLVSLSEQEL+DCD +YN+GC Sbjct: 124 DWRKKGAVSQVKDQGNCGACWSFSATGAIEGINKIVTGSLVSLSEQELVDCDRSYNNGCE 183 Query: 819 GGLMDYAFEFIIKNKGIDTEEDYPYRGRDGKCSKEKLKRHVVTIDSYADVPPRKEKKLLQ 640 GGLMDYA++F+I+N GIDTEEDYPY+ R+ C+KEKLKRHVVTID Y DVP EK+LL+ Sbjct: 184 GGLMDYAYQFVIENNGIDTEEDYPYQAREKTCNKEKLKRHVVTIDGYTDVPQNNEKELLK 243 Query: 639 AVATQPVSVGICGSDYKFQLYSGGIFSGPCSTALDHAVLIVGYDSQDGVDYWIVKNSWGK 460 AVA QPVSVGICGS+ FQLYS GIF+GPCST+LDHAVLIVGY S++GVDYWIVKNSWG Sbjct: 244 AVAAQPVSVGICGSERAFQLYSKGIFTGPCSTSLDHAVLIVGYGSENGVDYWIVKNSWGT 303 Query: 459 YWGMNGYIHMLRNSEDAEGVCGINTLASFPIKXXXXXXXXXXXXXTKCDLFTYCGTDETC 280 +WG+NGY++MLRNS +++G+CGIN LASFP+K TKCDLFT CG ETC Sbjct: 304 HWGINGYMYMLRNSGNSQGLCGINMLASFPVKTSPNPPPPAPPGPTKCDLFTRCGEGETC 363 Query: 279 CCYWSLLGICFSWRCCEAESAVCCDDHEHCCPRDYPTCDTARNLCLK 139 CC + G+CFSW+CCE +SAVCC D HCCP DYP CDT RN+CLK Sbjct: 364 CCTRRIFGLCFSWKCCELDSAVCCKDGLHCCPHDYPVCDTKRNMCLK 410 >gb|AAB60738.1| Strong similarity to Dianthus cysteine proteinase (gb|U17135) [Arabidopsis thaliana] Length = 416 Score = 553 bits (1425), Expect = e-155 Identities = 263/413 (63%), Positives = 313/413 (75%), Gaps = 7/413 (1%) Frame = -1 Query: 1356 LSLFLLISQFPTCNSSSISELFDHWCEEHGKTYASEEEKQHRLRVFEHNYEIVVEHNTKA 1177 L+ F L+ + +S ISELFD WC++HGKTY SEEE+Q R+++F+ N++ V +HN Sbjct: 9 LTFFFLLLVSSSSSSDDISELFDDWCQKHGKTYGSEEERQQRIQIFKDNHDFVTQHNLIT 68 Query: 1176 NSSHTLSLNAFADLTNQEFKDKYLGLLPSADDLLIRLNSREFAIDGPDLVEESDLPASVD 997 N++++LSLNAFADLT+ EFK LGL SA +++ A G L +P SVD Sbjct: 69 NATYSLSLNAFADLTHHEFKASRLGLSVSAPSVIM-------ASKGQSLGGSVKVPDSVD 121 Query: 996 WRKKGAVTAVKDQGSCGACWSFSATGAVEGINQIKTGSLVSLSEQELIDCDTTYNSGCGG 817 WRKKGAVT VKDQGSCGACWSFSATGA+EGINQI TG L+SLSEQELIDCD +YN+GC G Sbjct: 122 WRKKGAVTNVKDQGSCGACWSFSATGAMEGINQIVTGDLISLSEQELIDCDKSYNAGCNG 181 Query: 816 GLMDYAFEFIIKNKGIDTEEDYPYRGRDGKCSKEKLKRHVVTIDSYADVPPRKEKKLLQA 637 GLMDYAFEF+IKN GIDTE+DYPY+ RDG C K+KLK+ VVTIDSYA V EK L++A Sbjct: 182 GLMDYAFEFVIKNHGIDTEKDYPYQERDGTCKKDKLKQKVVTIDSYAGVKSNDEKALMEA 241 Query: 636 VATQPVSVGICGSDYKFQLYSG-------GIFSGPCSTALDHAVLIVGYDSQDGVDYWIV 478 VA QPVSVGICGS+ FQLYS GIFSGPCST+LDHAVLIVGY SQ+GVDYWIV Sbjct: 242 VAAQPVSVGICGSERAFQLYSSKFYLLMQGIFSGPCSTSLDHAVLIVGYGSQNGVDYWIV 301 Query: 477 KNSWGKYWGMNGYIHMLRNSEDAEGVCGINTLASFPIKXXXXXXXXXXXXXTKCDLFTYC 298 KNSWGK WGM+G++HM RN+E+++GVCGIN LAS+PIK TKC+LFTYC Sbjct: 302 KNSWGKSWGMDGFMHMQRNTENSDGVCGINMLASYPIKTHPNPPPPSPPGPTKCNLFTYC 361 Query: 297 GTDETCCCYWSLLGICFSWRCCEAESAVCCDDHEHCCPRDYPTCDTARNLCLK 139 + ETCCC L G+CFSW+CCE ESAVCC D HCCP DYP CDT R+LCLK Sbjct: 362 SSGETCCCARELFGLCFSWKCCEIESAVCCKDGRHCCPHDYPVCDTTRSLCLK 414 >ref|XP_004152671.1| PREDICTED: cysteine proteinase RD21a-like [Cucumis sativus] gi|449529596|ref|XP_004171784.1| PREDICTED: cysteine proteinase RD21a-like [Cucumis sativus] Length = 431 Score = 552 bits (1423), Expect = e-154 Identities = 268/420 (63%), Positives = 312/420 (74%) Frame = -1 Query: 1356 LSLFLLISQFPTCNSSSISELFDHWCEEHGKTYASEEEKQHRLRVFEHNYEIVVEHNTKA 1177 L+LFLL+ + P +S++SELF+ WC EHGK+Y+S EEK +RL VF NYE V HN Sbjct: 9 LTLFLLLFR-PLSATSNVSELFEIWCTEHGKSYSSAEEKLYRLGVFADNYEFVTHHNNLD 67 Query: 1176 NSSHTLSLNAFADLTNQEFKDKYLGLLPSADDLLIRLNSREFAIDGPDLVEESDLPASVD 997 NSS+TLSLN++ADLT+ EFK LG P+ N R P L D+P S+D Sbjct: 68 NSSYTLSLNSYADLTHHEFKVSRLGFSPALR------NFRPVLPQEPSLPR--DVPDSLD 119 Query: 996 WRKKGAVTAVKDQGSCGACWSFSATGAVEGINQIKTGSLVSLSEQELIDCDTTYNSGCGG 817 WRKKGAVTAVKDQGSCGACWSFSATGA+EGINQI TGSL+SLSEQELIDCD +YNSGCGG Sbjct: 120 WRKKGAVTAVKDQGSCGACWSFSATGAMEGINQIMTGSLISLSEQELIDCDRSYNSGCGG 179 Query: 816 GLMDYAFEFIIKNKGIDTEEDYPYRGRDGKCSKEKLKRHVVTIDSYADVPPRKEKKLLQA 637 GLMDYA++F+I N GIDTE DYPY+ RDG C K+KL+R+VVTID YAD+P E KLLQA Sbjct: 180 GLMDYAYQFVISNHGIDTENDYPYQARDGSCRKDKLQRNVVTIDGYADIPSNDEGKLLQA 239 Query: 636 VATQPVSVGICGSDYKFQLYSGGIFSGPCSTALDHAVLIVGYDSQDGVDYWIVKNSWGKY 457 VA QPVSVGICGS+ FQLYS GIFSGPCST+LDHAVLIVGY S++GVDYWIVKNSWGK Sbjct: 240 VAAQPVSVGICGSERAFQLYSKGIFSGPCSTSLDHAVLIVGYGSENGVDYWIVKNSWGKS 299 Query: 456 WGMNGYIHMLRNSEDAEGVCGINTLASFPIKXXXXXXXXXXXXXTKCDLFTYCGTDETCC 277 WGM+GY+HM RNS ++EGVCGIN LAS+P K TKC + T C ETCC Sbjct: 300 WGMDGYMHMQRNSGNSEGVCGINKLASYPTKTNPNPPPSPPPGPTKCSILTSCAAGETCC 359 Query: 276 CYWSLLGICFSWRCCEAESAVCCDDHEHCCPRDYPTCDTARNLCLKRIGNSTLSKPFEKK 97 C LG+C SW+CC SAVCC D HCCP DYP CDT RNLCLK+ N T ++ E + Sbjct: 360 CAKKFLGLCLSWKCCGLSSAVCCKDGRHCCPFDYPICDTDRNLCLKQTMNGTRTEILENR 419 >ref|XP_003523725.1| PREDICTED: oryzain alpha chain-like [Glycine max] Length = 439 Score = 550 bits (1417), Expect = e-154 Identities = 266/427 (62%), Positives = 319/427 (74%), Gaps = 5/427 (1%) Frame = -1 Query: 1356 LSLFLLISQFPTCNSSSISELFDHWCEEHGKTYASEEEKQHRLRVFEHNYEIVVEHNTKA 1177 LSL LL + F ++S SELF+ WC+EH KTY+SEEEK +RL+VFE NY V +HN A Sbjct: 13 LSLILLFTLF-FLSASDTSELFEKWCKEHSKTYSSEEEKLYRLKVFEDNYAFVAQHNQNA 71 Query: 1176 N-----SSHTLSLNAFADLTNQEFKDKYLGLLPSADDLLIRLNSREFAIDGPDLVEESDL 1012 N SS+TLSLNAFADLT+ EFK LGL L + R DL+ + Sbjct: 72 NNNNNNSSYTLSLNAFADLTHHEFKTTRLGL-----PLTLLRFKRPQNQQSRDLLH---I 123 Query: 1011 PASVDWRKKGAVTAVKDQGSCGACWSFSATGAVEGINQIKTGSLVSLSEQELIDCDTTYN 832 P+ +DWR+ GAVT VKDQ SCGACW+FSATGA+EGIN+I TGSLVSLSEQELIDCDT+YN Sbjct: 124 PSQIDWRQSGAVTPVKDQASCGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDTSYN 183 Query: 831 SGCGGGLMDYAFEFIIKNKGIDTEEDYPYRGRDGKCSKEKLKRHVVTIDSYADVPPRKEK 652 SGCGGGLMD+A++F+I NKGIDTE+DYPY+ R CSK+KLKR VTI+ Y DVPP E+ Sbjct: 184 SGCGGGLMDFAYQFVIDNKGIDTEDDYPYQARQRSCSKDKLKRRAVTIEDYVDVPP-SEE 242 Query: 651 KLLQAVATQPVSVGICGSDYKFQLYSGGIFSGPCSTALDHAVLIVGYDSQDGVDYWIVKN 472 ++L+AVA+QPVSVGICGS+ +FQLYS GIF+GPCST LDHAVLIVGY S++GVDYWIVKN Sbjct: 243 EILKAVASQPVSVGICGSEREFQLYSKGIFTGPCSTFLDHAVLIVGYGSENGVDYWIVKN 302 Query: 471 SWGKYWGMNGYIHMLRNSEDAEGVCGINTLASFPIKXXXXXXXXXXXXXTKCDLFTYCGT 292 SWGKYWGMNGYIHM+RNS +++G+CGINTLAS+P+K +C+LFT+C Sbjct: 303 SWGKYWGMNGYIHMIRNSGNSKGICGINTLASYPVKTKPNPPIPPPPGPVRCNLFTHCSE 362 Query: 291 DETCCCYWSLLGICFSWRCCEAESAVCCDDHEHCCPRDYPTCDTARNLCLKRIGNSTLSK 112 ETCCC S LGICFSW+CC SAVCC D HCCP+DYP CDT R CLKR N T + Sbjct: 363 GETCCCAKSFLGICFSWKCCGLTSAVCCKDKRHCCPQDYPICDTRRGQCLKRTANGTTTI 422 Query: 111 PFEKKGF 91 E + F Sbjct: 423 TSENQDF 429 >gb|EXC25025.1| Oryzain alpha chain [Morus notabilis] Length = 517 Score = 548 bits (1413), Expect = e-153 Identities = 261/409 (63%), Positives = 309/409 (75%), Gaps = 2/409 (0%) Frame = -1 Query: 1359 LLSLFLLISQFPTCNSSSI--SELFDHWCEEHGKTYASEEEKQHRLRVFEHNYEIVVEHN 1186 L L LL+S + +S S+ S+LF+ WCE+HG++Y+SEEE+ +RL VFE N V +HN Sbjct: 6 LFLLSLLLSSHLSLSSPSLNSSQLFEAWCEKHGQSYSSEEERLYRLTVFEDNLAFVTQHN 65 Query: 1185 TKANSSHTLSLNAFADLTNQEFKDKYLGLLPSADDLLIRLNSREFAIDGPDLVEESDLPA 1006 NSS+TLSLNAFADLT+ EFK LG + L +L S+ L++ D+PA Sbjct: 66 NMGNSSYTLSLNAFADLTHHEFKSSRLGFSSALLSSLPKLGSK--------LLDLRDVPA 117 Query: 1005 SVDWRKKGAVTAVKDQGSCGACWSFSATGAVEGINQIKTGSLVSLSEQELIDCDTTYNSG 826 S+DWRKKGAVT VKDQGSCGACW+FSATGA+EGIN+I TGSLVSLSEQELIDCDT+YN+G Sbjct: 118 SLDWRKKGAVTNVKDQGSCGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDTSYNAG 177 Query: 825 CGGGLMDYAFEFIIKNKGIDTEEDYPYRGRDGKCSKEKLKRHVVTIDSYADVPPRKEKKL 646 C GGLMDYA++F+I N GIDTEEDYPY+ RD C KEKLKR VVTID Y DV P +L Sbjct: 178 CDGGLMDYAYQFVIDNHGIDTEEDYPYQARDKSCRKEKLKRRVVTIDGYTDVAPNNGLQL 237 Query: 645 LQAVATQPVSVGICGSDYKFQLYSGGIFSGPCSTALDHAVLIVGYDSQDGVDYWIVKNSW 466 LQAV TQPVSVGICGS+ FQLYS GIF+GPCST+LDHAVLIVGYDS++GVDYWIVKNSW Sbjct: 238 LQAVVTQPVSVGICGSERAFQLYSKGIFTGPCSTSLDHAVLIVGYDSENGVDYWIVKNSW 297 Query: 465 GKYWGMNGYIHMLRNSEDAEGVCGINTLASFPIKXXXXXXXXXXXXXTKCDLFTYCGTDE 286 GK WGM+GYIHM RN+ +++GVCGIN LAS+P K T+C F CG E Sbjct: 298 GKQWGMDGYIHMQRNTGNSQGVCGINMLASYPTKTSPNPPPSPSPGPTRCSFFAQCGEGE 357 Query: 285 TCCCYWSLLGICFSWRCCEAESAVCCDDHEHCCPRDYPTCDTARNLCLK 139 TCCC W LG+CFSW+CC SAVCC D HCCP+DYP CDT RN+CLK Sbjct: 358 TCCCSWRFLGLCFSWKCCGLNSAVCCKDKIHCCPQDYPLCDTQRNVCLK 406