BLASTX nr result
ID: Mentha29_contig00008506
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Mentha29_contig00008506 (1333 letters) Database: ./nr 38,876,450 sequences; 13,856,398,315 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value gb|EYU36745.1| hypothetical protein MIMGU_mgv1a006749mg [Mimulus... 580 e-163 dbj|BAJ53169.1| JHL18I08.3 [Jatropha curcas] 513 e-143 ref|XP_006417526.1| hypothetical protein EUTSA_v10007640mg [Eutr... 512 e-142 ref|XP_006341989.1| PREDICTED: cysteine proteinase RD21a-like [S... 511 e-142 ref|XP_002889773.1| hypothetical protein ARALYDRAFT_471096 [Arab... 511 e-142 ref|NP_563855.1| papain-like cysteine peptidase [Arabidopsis tha... 511 e-142 gb|AAK71314.1|AF388175_1 papain-like cysteine peptidase XBCP3 [A... 510 e-142 ref|XP_007017656.1| JHL18I08.3 protein [Theobroma cacao] gi|5087... 505 e-140 ref|XP_004500967.1| PREDICTED: oryzain alpha chain-like [Cicer a... 499 e-138 ref|XP_004238304.1| PREDICTED: cysteine proteinase RD21a-like [S... 499 e-138 ref|XP_006435079.1| hypothetical protein CICLE_v10001178mg [Citr... 498 e-138 gb|EXC25025.1| Oryzain alpha chain [Morus notabilis] 498 e-138 ref|XP_006473576.1| PREDICTED: cysteine proteinase RD21a-like [C... 497 e-138 ref|XP_002307688.2| cysteine protease family protein [Populus tr... 496 e-138 ref|XP_002510459.1| cysteine protease, putative [Ricinus communi... 496 e-137 ref|XP_003523725.1| PREDICTED: oryzain alpha chain-like [Glycine... 493 e-136 ref|XP_006307431.1| hypothetical protein CARUB_v10009056mg [Caps... 491 e-136 ref|XP_004152671.1| PREDICTED: cysteine proteinase RD21a-like [C... 491 e-136 gb|AAB60738.1| Strong similarity to Dianthus cysteine proteinase... 489 e-135 gb|EPS60205.1| hypothetical protein M569_14597, partial [Genlise... 486 e-135 >gb|EYU36745.1| hypothetical protein MIMGU_mgv1a006749mg [Mimulus guttatus] Length = 433 Score = 580 bits (1494), Expect = e-163 Identities = 265/366 (72%), Positives = 304/366 (83%), Gaps = 1/366 (0%) Frame = +3 Query: 3 NSSHTLSLNAFADLTNQEFKAKYLGLLPSADDLLIRLNSRK-SAIEGSDLVEESDLPASV 179 NSS+TLS+NAFADLTN EF+A YLGL PS D +IRLNSR SAI+G +L++ES++P+S+ Sbjct: 68 NSSYTLSVNAFADLTNHEFRANYLGLSPSKSDSVIRLNSRSASAIDGDNLIKESEIPSSL 127 Query: 180 DWRTKGAVTAVKDQGSCGACWSFSATGAVEGINQIKTGSLVSLSEQELIDCDTTYNAGCG 359 DWR KGAVTAVKDQGSCGACWSFSATGAVEGINQIKTGSLVSLSEQELIDCD +YN GC Sbjct: 128 DWRNKGAVTAVKDQGSCGACWSFSATGAVEGINQIKTGSLVSLSEQELIDCDKSYNDGCN 187 Query: 360 GGLMDYAFEFIIKNEGIDTEEDYPYRGRDGKCSKEKLKRHVVTIDSYADVPPRKEKKLLQ 539 GGLMDYA++FIIKN+GIDTEEDY Y+GR C K K+ +HVVTIDSY D+P + EKKLLQ Sbjct: 188 GGLMDYAYDFIIKNKGIDTEEDYSYKGRSATCDKNKMNKHVVTIDSYVDIPEKDEKKLLQ 247 Query: 540 AVATQPVSVGICGSDYKFQLYSGGIFSGPCSTALDHAVLIVGYDSQDGVDYWIVKNSWGK 719 AVATQP+SVGICGSD FQLYSGGIF+GPCST+LDHAVLIVGYDS+DG DYWI+KNSWGK Sbjct: 248 AVATQPISVGICGSDSSFQLYSGGIFTGPCSTSLDHAVLIVGYDSKDGKDYWIIKNSWGK 307 Query: 720 YWGMNGYIHMLRNSGDAEGVCGINTLASFPIKXXXXXXXXXXXXXXKCDLFTYCGTDETC 899 WG+ GY+HM+RNSG EGVCGINTLAS+P+K KC++FTYC + ETC Sbjct: 308 SWGIKGYMHMVRNSGSEEGVCGINTLASYPVKSSTNPPPSPTPGPTKCNIFTYCSSGETC 367 Query: 900 CCHWSLLGICFSWRCCEAESAVCCDDHEHCCPRDYPTCDTARNLCLKRIGNSTLSKPFEK 1079 CC LG+C SW CCEAESAVCCDDH HCCP DYP CDT +NLCLK+ GN+T+SKP K Sbjct: 368 CCARYFLGVCLSWNCCEAESAVCCDDHRHCCPHDYPVCDTKKNLCLKKSGNTTVSKPLGK 427 Query: 1080 KGFFTS 1097 K F S Sbjct: 428 KSFSAS 433 >dbj|BAJ53169.1| JHL18I08.3 [Jatropha curcas] Length = 441 Score = 513 bits (1322), Expect = e-143 Identities = 242/359 (67%), Positives = 279/359 (77%) Frame = +3 Query: 3 NSSHTLSLNAFADLTNQEFKAKYLGLLPSADDLLIRLNSRKSAIEGSDLVEESDLPASVD 182 NSS+TLSLNAFADLT+ EFKA LGL +A LN +S + D V +D+PASVD Sbjct: 69 NSSYTLSLNAFADLTHHEFKASRLGLSSAAS---ASLNVDRSNRQIPDFV--ADVPASVD 123 Query: 183 WRTKGAVTAVKDQGSCGACWSFSATGAVEGINQIKTGSLVSLSEQELIDCDTTYNAGCGG 362 WR GAVT VKDQG+CGACWSFSATGA+EGIN+I TGSLVSLSEQEL+DCD +YN GC G Sbjct: 124 WRKNGAVTQVKDQGNCGACWSFSATGAIEGINKIVTGSLVSLSEQELVDCDKSYNNGCEG 183 Query: 363 GLMDYAFEFIIKNEGIDTEEDYPYRGRDGKCSKEKLKRHVVTIDSYADVPPRKEKKLLQA 542 G+MDYAF+F+I N GIDTEEDYPY+GRD C+KEKLKRHVVTID Y DVP EK+LL+A Sbjct: 184 GIMDYAFQFVIDNHGIDTEEDYPYQGRDRSCNKEKLKRHVVTIDGYVDVPQNNEKELLKA 243 Query: 543 VATQPVSVGICGSDYKFQLYSGGIFSGPCSTALDHAVLIVGYDSQDGVDYWIVKNSWGKY 722 VA QPVSVGICGS+ FQLYS GIF+GPCST+LDHAVLIVGY S++GVDYWIVKNSWG Y Sbjct: 244 VANQPVSVGICGSERAFQLYSKGIFTGPCSTSLDHAVLIVGYGSENGVDYWIVKNSWGSY 303 Query: 723 WGMNGYIHMLRNSGDAEGVCGINTLASFPIKXXXXXXXXXXXXXXKCDLFTYCGTDETCC 902 WGM+GY+HM RNSG + G+CGIN LAS+P K +CDLFT+CG ETCC Sbjct: 304 WGMDGYMHMQRNSGSSRGLCGINMLASYPKKTSPNPPPPAPPGPTRCDLFTHCGEGETCC 363 Query: 903 CHWSLLGICFSWRCCEAESAVCCDDHEHCCPRDYPTCDTARNLCLKRIGNSTLSKPFEK 1079 C + GIC SW+CCE +SAVCC D HCCPRDYP CDT RN+CLK GN+T + F K Sbjct: 364 CVHHIFGICLSWKCCELDSAVCCKDGRHCCPRDYPVCDTTRNICLKHYGNATRIEKFAK 422 >ref|XP_006417526.1| hypothetical protein EUTSA_v10007640mg [Eutrema salsugineum] gi|557095297|gb|ESQ35879.1| hypothetical protein EUTSA_v10007640mg [Eutrema salsugineum] Length = 444 Score = 512 bits (1319), Expect = e-142 Identities = 242/360 (67%), Positives = 285/360 (79%) Frame = +3 Query: 3 NSSHTLSLNAFADLTNQEFKAKYLGLLPSADDLLIRLNSRKSAIEGSDLVEESDLPASVD 182 NS+++LSLNAFADLT+ EFKA LGL + L+ +++ ++ S+ V +P SVD Sbjct: 76 NSTYSLSLNAFADLTHHEFKASRLGLSAPSPSLM----AKEQSLGVSERVRVK-VPDSVD 130 Query: 183 WRTKGAVTAVKDQGSCGACWSFSATGAVEGINQIKTGSLVSLSEQELIDCDTTYNAGCGG 362 WR KGAVT VKDQGSCGACWSFSATGA+EGINQI TG L+SLSEQELIDCD +YNAGC G Sbjct: 131 WRKKGAVTNVKDQGSCGACWSFSATGAMEGINQIVTGDLISLSEQELIDCDKSYNAGCNG 190 Query: 363 GLMDYAFEFIIKNEGIDTEEDYPYRGRDGKCSKEKLKRHVVTIDSYADVPPRKEKKLLQA 542 GLMDYAFEF+IKN GIDTE+DYPY+ +DG C K+KLK+ VVTIDSYA V EK L++A Sbjct: 191 GLMDYAFEFVIKNHGIDTEKDYPYQEQDGTCKKDKLKKRVVTIDSYAGVASNNEKALMEA 250 Query: 543 VATQPVSVGICGSDYKFQLYSGGIFSGPCSTALDHAVLIVGYDSQDGVDYWIVKNSWGKY 722 VA+QPVSVGICGS+ FQLYS GIFSGPCST+LDHAVLIVGY SQ+GVDYWIVKNSWGK Sbjct: 251 VASQPVSVGICGSERAFQLYSSGIFSGPCSTSLDHAVLIVGYGSQNGVDYWIVKNSWGKS 310 Query: 723 WGMNGYIHMLRNSGDAEGVCGINTLASFPIKXXXXXXXXXXXXXXKCDLFTYCGTDETCC 902 WGM+G++HM RN+G++EGVCGIN LAS+PIK KC+LFTYC + ETCC Sbjct: 311 WGMDGFMHMQRNTGNSEGVCGINMLASYPIKTHPNPPPPSPPGPTKCNLFTYCSSGETCC 370 Query: 903 CHWSLLGICFSWRCCEAESAVCCDDHEHCCPRDYPTCDTARNLCLKRIGNSTLSKPFEKK 1082 C +L G+CFSW+CCE ESAVCC D HCCPRDYP CDT ++LCLK+ GN T KPF KK Sbjct: 371 CARTLFGLCFSWKCCELESAVCCKDGRHCCPRDYPVCDTTKSLCLKKTGNFTEIKPFWKK 430 >ref|XP_006341989.1| PREDICTED: cysteine proteinase RD21a-like [Solanum tuberosum] Length = 439 Score = 511 bits (1317), Expect = e-142 Identities = 238/362 (65%), Positives = 282/362 (77%) Frame = +3 Query: 3 NSSHTLSLNAFADLTNQEFKAKYLGLLPSADDLLIRLNSRKSAIEGSDLVEESDLPASVD 182 NSS+TL LNA++DLT+ EF+ +LGL SA+D IRL R S + ++ + D P+S+D Sbjct: 68 NSSYTLGLNAYSDLTHHEFRNSFLGLSSSANDF-IRLKGRGSGSSETGVLSDVDAPSSLD 126 Query: 183 WRTKGAVTAVKDQGSCGACWSFSATGAVEGINQIKTGSLVSLSEQELIDCDTTYNAGCGG 362 WR KGAVT VK+QGSCGACWSFSATGA+EGIN+I TGSLVSLSEQELIDCD +YN GCGG Sbjct: 127 WREKGAVTDVKNQGSCGACWSFSATGAMEGINKITTGSLVSLSEQELIDCDRSYNEGCGG 186 Query: 363 GLMDYAFEFIIKNEGIDTEEDYPYRGRDGKCSKEKLKRHVVTIDSYADVPPRKEKKLLQA 542 GLMDYAFEF+IKN GIDTE+DYP+R R+G C+K KL+RHVVTID Y D+P E KLL+A Sbjct: 187 GLMDYAFEFVIKNGGIDTEKDYPFREREGTCNKNKLQRHVVTIDGYTDIPQNDEDKLLKA 246 Query: 543 VATQPVSVGICGSDYKFQLYSGGIFSGPCSTALDHAVLIVGYDSQDGVDYWIVKNSWGKY 722 VATQPVSVGICGS FQ YS GIF+GPCSTALDHAVLIVGY S++GVDYWI+KNSWG Sbjct: 247 VATQPVSVGICGSARAFQSYSKGIFTGPCSTALDHAVLIVGYGSENGVDYWIIKNSWGTS 306 Query: 723 WGMNGYIHMLRNSGDAEGVCGINTLASFPIKXXXXXXXXXXXXXXKCDLFTYCGTDETCC 902 WG+NGYIHM RNSG+ EG+CGIN LAS+P K KC +FT CG ETCC Sbjct: 307 WGINGYIHMQRNSGNQEGICGINKLASYPTKTSPNPPTPPAPGPSKCSMFTSCGQGETCC 366 Query: 903 CHWSLLGICFSWRCCEAESAVCCDDHEHCCPRDYPTCDTARNLCLKRIGNSTLSKPFEKK 1082 C LGIC SW+CC +SAVCC D HCCP+DYP CDT+RNLCLKR+ N+T+ + +K+ Sbjct: 367 CGSKFLGICLSWKCCGLDSAVCCKDGRHCCPQDYPICDTSRNLCLKRMNNATIVQQPQKE 426 Query: 1083 GF 1088 F Sbjct: 427 AF 428 >ref|XP_002889773.1| hypothetical protein ARALYDRAFT_471096 [Arabidopsis lyrata subsp. lyrata] gi|297335615|gb|EFH66032.1| hypothetical protein ARALYDRAFT_471096 [Arabidopsis lyrata subsp. lyrata] Length = 439 Score = 511 bits (1316), Expect = e-142 Identities = 244/362 (67%), Positives = 280/362 (77%), Gaps = 2/362 (0%) Frame = +3 Query: 3 NSSHTLSLNAFADLTNQEFKAKYLGLLPSADDLLIRLNSRKSAIEGSDLVEESDLPASVD 182 N++++LSLNAFADLT+ EFKA LGL SA L++ A +G L + +P SVD Sbjct: 71 NATYSLSLNAFADLTHHEFKASRLGLSVSASSLIM-------ASKGQSLGGNAKVPDSVD 123 Query: 183 WRTKGAVTAVKDQGSCGACWSFSATGAVEGINQIKTGSLVSLSEQELIDCDTTYNAGCGG 362 WR KGAVT VKDQGSCGACWSFSATGA+EGINQI TG L+SLSEQELIDCD +YNAGC G Sbjct: 124 WRKKGAVTNVKDQGSCGACWSFSATGAMEGINQIVTGDLISLSEQELIDCDKSYNAGCNG 183 Query: 363 GLMDYAFEFIIKNEGIDTEEDYPYRGRDGKCSKEKLKRHVVTIDSYADVPPRKEKKLLQA 542 GLMDYAFEF+IKN GIDTE+DYPY+ RDG C K+KLK+ VVTIDSYA V EK L +A Sbjct: 184 GLMDYAFEFVIKNHGIDTEKDYPYQERDGTCKKDKLKQKVVTIDSYAGVKSNDEKALREA 243 Query: 543 VATQPVSVGICGSDYKFQLYS--GGIFSGPCSTALDHAVLIVGYDSQDGVDYWIVKNSWG 716 VA QPVSVGICGS+ FQLYS GIFSGPCST+LDHAVLIVGY SQ+GVDYWIVKNSWG Sbjct: 244 VAAQPVSVGICGSERAFQLYSRVSGIFSGPCSTSLDHAVLIVGYGSQNGVDYWIVKNSWG 303 Query: 717 KYWGMNGYIHMLRNSGDAEGVCGINTLASFPIKXXXXXXXXXXXXXXKCDLFTYCGTDET 896 K WGM+G++HM RN+G++EG+CGIN LAS+PIK KC+LFTYC ET Sbjct: 304 KSWGMDGFMHMQRNTGNSEGICGINMLASYPIKTHPNPPPPSPPGPTKCNLFTYCSAGET 363 Query: 897 CCCHWSLLGICFSWRCCEAESAVCCDDHEHCCPRDYPTCDTARNLCLKRIGNSTLSKPFE 1076 CCC +L G+CFSW+CCE ESAVCC D HCCP DYP CDT R+LCLK+ GN T KPF Sbjct: 364 CCCARNLFGLCFSWKCCEIESAVCCSDGRHCCPHDYPVCDTTRSLCLKKTGNFTAIKPFW 423 Query: 1077 KK 1082 KK Sbjct: 424 KK 425 >ref|NP_563855.1| papain-like cysteine peptidase [Arabidopsis thaliana] gi|110741821|dbj|BAE98853.1| papain-like cysteine peptidase XBCP3 [Arabidopsis thaliana] gi|111074448|gb|ABH04597.1| At1g09850 [Arabidopsis thaliana] gi|332190386|gb|AEE28507.1| papain-like cysteine peptidase [Arabidopsis thaliana] Length = 437 Score = 511 bits (1315), Expect = e-142 Identities = 242/360 (67%), Positives = 279/360 (77%) Frame = +3 Query: 3 NSSHTLSLNAFADLTNQEFKAKYLGLLPSADDLLIRLNSRKSAIEGSDLVEESDLPASVD 182 N++++LSLNAFADLT+ EFKA LGL SA +++ A +G L +P SVD Sbjct: 71 NATYSLSLNAFADLTHHEFKASRLGLSVSAPSVIM-------ASKGQSLGGSVKVPDSVD 123 Query: 183 WRTKGAVTAVKDQGSCGACWSFSATGAVEGINQIKTGSLVSLSEQELIDCDTTYNAGCGG 362 WR KGAVT VKDQGSCGACWSFSATGA+EGINQI TG L+SLSEQELIDCD +YNAGC G Sbjct: 124 WRKKGAVTNVKDQGSCGACWSFSATGAMEGINQIVTGDLISLSEQELIDCDKSYNAGCNG 183 Query: 363 GLMDYAFEFIIKNEGIDTEEDYPYRGRDGKCSKEKLKRHVVTIDSYADVPPRKEKKLLQA 542 GLMDYAFEF+IKN GIDTE+DYPY+ RDG C K+KLK+ VVTIDSYA V EK L++A Sbjct: 184 GLMDYAFEFVIKNHGIDTEKDYPYQERDGTCKKDKLKQKVVTIDSYAGVKSNDEKALMEA 243 Query: 543 VATQPVSVGICGSDYKFQLYSGGIFSGPCSTALDHAVLIVGYDSQDGVDYWIVKNSWGKY 722 VA QPVSVGICGS+ FQLYS GIFSGPCST+LDHAVLIVGY SQ+GVDYWIVKNSWGK Sbjct: 244 VAAQPVSVGICGSERAFQLYSSGIFSGPCSTSLDHAVLIVGYGSQNGVDYWIVKNSWGKS 303 Query: 723 WGMNGYIHMLRNSGDAEGVCGINTLASFPIKXXXXXXXXXXXXXXKCDLFTYCGTDETCC 902 WGM+G++HM RN+ +++GVCGIN LAS+PIK KC+LFTYC + ETCC Sbjct: 304 WGMDGFMHMQRNTENSDGVCGINMLASYPIKTHPNPPPPSPPGPTKCNLFTYCSSGETCC 363 Query: 903 CHWSLLGICFSWRCCEAESAVCCDDHEHCCPRDYPTCDTARNLCLKRIGNSTLSKPFEKK 1082 C L G+CFSW+CCE ESAVCC D HCCP DYP CDT R+LCLK+ GN T KPF KK Sbjct: 364 CARELFGLCFSWKCCEIESAVCCKDGRHCCPHDYPVCDTTRSLCLKKTGNFTAIKPFWKK 423 >gb|AAK71314.1|AF388175_1 papain-like cysteine peptidase XBCP3 [Arabidopsis thaliana] Length = 437 Score = 510 bits (1313), Expect = e-142 Identities = 242/360 (67%), Positives = 279/360 (77%) Frame = +3 Query: 3 NSSHTLSLNAFADLTNQEFKAKYLGLLPSADDLLIRLNSRKSAIEGSDLVEESDLPASVD 182 N++++LSLNAFADLT+ EFKA LGL SA +++ A +G L +P SVD Sbjct: 71 NATYSLSLNAFADLTHHEFKASRLGLSVSAPSVIM-------ASKGQSLGGSVKVPDSVD 123 Query: 183 WRTKGAVTAVKDQGSCGACWSFSATGAVEGINQIKTGSLVSLSEQELIDCDTTYNAGCGG 362 WR KGAVT VKDQGSCGACWSFSATGA+EGINQI TG L+SLSEQELIDCD +YNAGC G Sbjct: 124 WRKKGAVTNVKDQGSCGACWSFSATGAMEGINQIVTGDLISLSEQELIDCDKSYNAGCNG 183 Query: 363 GLMDYAFEFIIKNEGIDTEEDYPYRGRDGKCSKEKLKRHVVTIDSYADVPPRKEKKLLQA 542 GLMDYAFEF+IKN GIDTE+DYPY+ RDG C K+KLK+ VVTIDSYA V EK L++A Sbjct: 184 GLMDYAFEFVIKNHGIDTEKDYPYQERDGTCKKDKLKQKVVTIDSYAGVKSNDEKALMEA 243 Query: 543 VATQPVSVGICGSDYKFQLYSGGIFSGPCSTALDHAVLIVGYDSQDGVDYWIVKNSWGKY 722 VA QPVSVGICGS+ FQLYS GIFSGPCST+LDHAVLIVGY SQ+GVDYWIVKNSWGK Sbjct: 244 VAAQPVSVGICGSERAFQLYSRGIFSGPCSTSLDHAVLIVGYGSQNGVDYWIVKNSWGKS 303 Query: 723 WGMNGYIHMLRNSGDAEGVCGINTLASFPIKXXXXXXXXXXXXXXKCDLFTYCGTDETCC 902 WGM+G++HM RN+ +++GVCGIN LAS+PIK KC+LFTYC + ETCC Sbjct: 304 WGMDGFMHMQRNTENSDGVCGINMLASYPIKTHPNPPPPSPPGPTKCNLFTYCSSGETCC 363 Query: 903 CHWSLLGICFSWRCCEAESAVCCDDHEHCCPRDYPTCDTARNLCLKRIGNSTLSKPFEKK 1082 C L G+CFSW+CCE ESAVCC D HCCP DYP CDT R+LCLK+ GN T KPF KK Sbjct: 364 CARELFGLCFSWKCCEIESAVCCKDGRHCCPHDYPVCDTTRSLCLKKTGNFTAIKPFWKK 423 >ref|XP_007017656.1| JHL18I08.3 protein [Theobroma cacao] gi|508722984|gb|EOY14881.1| JHL18I08.3 protein [Theobroma cacao] Length = 438 Score = 505 bits (1300), Expect = e-140 Identities = 237/360 (65%), Positives = 279/360 (77%) Frame = +3 Query: 3 NSSHTLSLNAFADLTNQEFKAKYLGLLPSADDLLIRLNSRKSAIEGSDLVEESDLPASVD 182 NSS++L+LNAFADLT+ EFKA LGL +A + + ++ LV D+PAS+D Sbjct: 69 NSSYSLALNAFADLTHHEFKASRLGLSAAA------IEGSRPNLQLPGLVR--DIPASMD 120 Query: 183 WRTKGAVTAVKDQGSCGACWSFSATGAVEGINQIKTGSLVSLSEQELIDCDTTYNAGCGG 362 WRTKGAVT VKDQGSCGACWSFSATGA+EGIN+I TG+LVSLSEQEL+DCD +YN+GC G Sbjct: 121 WRTKGAVTKVKDQGSCGACWSFSATGAIEGINKIVTGTLVSLSEQELVDCDRSYNSGCEG 180 Query: 363 GLMDYAFEFIIKNEGIDTEEDYPYRGRDGKCSKEKLKRHVVTIDSYADVPPRKEKKLLQA 542 GLMDYA++F+I N GID EEDYPY GR+ C+KEK KR VVTID YA VP E LLQA Sbjct: 181 GLMDYAYQFVIDNHGIDNEEDYPYLGREKTCNKEKRKRRVVTIDGYAGVPANNEDLLLQA 240 Query: 543 VATQPVSVGICGSDYKFQLYSGGIFSGPCSTALDHAVLIVGYDSQDGVDYWIVKNSWGKY 722 VA QPVSVGICGS+ FQLYS GIF+GPCS++LDHAVLIVGY S++GVDYWIVKNSWG Sbjct: 241 VAKQPVSVGICGSERAFQLYSKGIFTGPCSSSLDHAVLIVGYGSENGVDYWIVKNSWGTR 300 Query: 723 WGMNGYIHMLRNSGDAEGVCGINTLASFPIKXXXXXXXXXXXXXXKCDLFTYCGTDETCC 902 WGMNGYIHMLRNSGD++G+CGIN LAS+P K KCDLFTYC ETCC Sbjct: 301 WGMNGYIHMLRNSGDSKGLCGINMLASYPTKTSPNPPSPPPPGPTKCDLFTYCSAGETCC 360 Query: 903 CHWSLLGICFSWRCCEAESAVCCDDHEHCCPRDYPTCDTARNLCLKRIGNSTLSKPFEKK 1082 C + GICFSW+CCE +SAVCC D+ HCCP DYP CDT ++ CLKR+GN+T + FEK+ Sbjct: 361 CTHRIFGICFSWKCCELDSAVCCKDNRHCCPYDYPVCDTKKSQCLKRVGNATRMEAFEKR 420 >ref|XP_004500967.1| PREDICTED: oryzain alpha chain-like [Cicer arietinum] Length = 436 Score = 499 bits (1285), Expect = e-138 Identities = 238/363 (65%), Positives = 272/363 (74%) Frame = +3 Query: 3 NSSHTLSLNAFADLTNQEFKAKYLGLLPSADDLLIRLNSRKSAIEGSDLVEESDLPASVD 182 NSS+TLSLNAFADLT+ EFKA LGL PS+ L + N + D ++ +P+ +D Sbjct: 69 NSSYTLSLNAFADLTHHEFKATRLGLPPSSL-LRFKFNRFQDQQRSDDFLQ---VPSEID 124 Query: 183 WRTKGAVTAVKDQGSCGACWSFSATGAVEGINQIKTGSLVSLSEQELIDCDTTYNAGCGG 362 WR GAV+ VKDQGSCGACWSFSATGA+EGIN+I TGSLVSLSEQEL+DCDTTYN+GC G Sbjct: 125 WRKNGAVSIVKDQGSCGACWSFSATGAIEGINKIVTGSLVSLSEQELVDCDTTYNSGCDG 184 Query: 363 GLMDYAFEFIIKNEGIDTEEDYPYRGRDGKCSKEKLKRHVVTIDSYADVPPRKEKKLLQA 542 GLMDYA++FII N GIDTEEDYPY+ R C K+KLKR VVTID Y DVPP EKKLL+A Sbjct: 185 GLMDYAYQFIIDNNGIDTEEDYPYQARQLLCKKDKLKRRVVTIDGYTDVPPNDEKKLLKA 244 Query: 543 VATQPVSVGICGSDYKFQLYSGGIFSGPCSTALDHAVLIVGYDSQDGVDYWIVKNSWGKY 722 VA QPVSVGICGS FQLYS GIF+GPCST+LDHAVLIVGY S++GVDYWIVKNSWGKY Sbjct: 245 VAVQPVSVGICGSARAFQLYSKGIFTGPCSTSLDHAVLIVGYGSENGVDYWIVKNSWGKY 304 Query: 723 WGMNGYIHMLRNSGDAEGVCGINTLASFPIKXXXXXXXXXXXXXXKCDLFTYCGTDETCC 902 WGMNGYIHMLRN+ + G+CGIN LAS+P K KC+LFTYC ETCC Sbjct: 305 WGMNGYIHMLRNTDSSAGLCGINMLASYPTKTKPNPPVPPPPGPIKCNLFTYCSGGETCC 364 Query: 903 CHWSLLGICFSWRCCEAESAVCCDDHEHCCPRDYPTCDTARNLCLKRIGNSTLSKPFEKK 1082 C LGICFSW+CC SAVCC D HCCP DYP CD + CLKRI N T+ +K+ Sbjct: 365 CAKKFLGICFSWKCCGVTSAVCCKDKRHCCPLDYPVCDASNGQCLKRIANGTILMTSDKE 424 Query: 1083 GFF 1091 F Sbjct: 425 DPF 427 >ref|XP_004238304.1| PREDICTED: cysteine proteinase RD21a-like [Solanum lycopersicum] Length = 439 Score = 499 bits (1285), Expect = e-138 Identities = 232/362 (64%), Positives = 276/362 (76%) Frame = +3 Query: 3 NSSHTLSLNAFADLTNQEFKAKYLGLLPSADDLLIRLNSRKSAIEGSDLVEESDLPASVD 182 NSS+TL LNA++DLT+ EF+ +LGL SA+D IRL R S + ++ + D P+S+D Sbjct: 68 NSSYTLGLNAYSDLTHHEFRNSFLGLSSSANDF-IRLKGRGSGSSAAGVLSDVDAPSSLD 126 Query: 183 WRTKGAVTAVKDQGSCGACWSFSATGAVEGINQIKTGSLVSLSEQELIDCDTTYNAGCGG 362 WR KGAVT VK+QGSCGACWSFSATGA+EGIN+I TGSLVSLSEQELIDCD +YN GCGG Sbjct: 127 WRDKGAVTNVKNQGSCGACWSFSATGAIEGINKITTGSLVSLSEQELIDCDRSYNQGCGG 186 Query: 363 GLMDYAFEFIIKNEGIDTEEDYPYRGRDGKCSKEKLKRHVVTIDSYADVPPRKEKKLLQA 542 GLMDYAFEF+IKN GIDTE+DYP+R ++G C+K KL+R VVTID Y D+P E KLL+A Sbjct: 187 GLMDYAFEFVIKNGGIDTEKDYPFREKEGTCNKNKLQRRVVTIDGYTDIPQNDEDKLLKA 246 Query: 543 VATQPVSVGICGSDYKFQLYSGGIFSGPCSTALDHAVLIVGYDSQDGVDYWIVKNSWGKY 722 VATQPVSVGICGS FQ YS GIF+GPC T LDHAVLIVGY S++G DYWI+KNSWG Sbjct: 247 VATQPVSVGICGSARAFQSYSKGIFTGPCPTDLDHAVLIVGYGSENGFDYWIIKNSWGTS 306 Query: 723 WGMNGYIHMLRNSGDAEGVCGINTLASFPIKXXXXXXXXXXXXXXKCDLFTYCGTDETCC 902 WG+NGYIHM RNSG+ EG+CG+N LAS+P K KC FT CG ETCC Sbjct: 307 WGINGYIHMQRNSGNQEGICGVNKLASYPTKTSPNPPNPPAPGPSKCSTFTSCGQGETCC 366 Query: 903 CHWSLLGICFSWRCCEAESAVCCDDHEHCCPRDYPTCDTARNLCLKRIGNSTLSKPFEKK 1082 C LGIC SW+CC +SAVCC D HCCP DYP CDT+RNLCLKR+ N+T+ + +K+ Sbjct: 367 CGLKFLGICLSWKCCGLDSAVCCKDGRHCCPWDYPICDTSRNLCLKRMSNATIVQQPQKE 426 Query: 1083 GF 1088 F Sbjct: 427 PF 428 >ref|XP_006435079.1| hypothetical protein CICLE_v10001178mg [Citrus clementina] gi|557537201|gb|ESR48319.1| hypothetical protein CICLE_v10001178mg [Citrus clementina] Length = 441 Score = 498 bits (1282), Expect = e-138 Identities = 231/362 (63%), Positives = 280/362 (77%), Gaps = 1/362 (0%) Frame = +3 Query: 3 NSSHTLSLNAFADLTNQEFKAKYLGLLPSADDLLIRLNSRKSAIEGSDLVEESDLPASVD 182 NSS TLSLNAFADLT+QEFKA +LG ++ D R R ++++ + D+PAS+D Sbjct: 68 NSSFTLSLNAFADLTHQEFKASFLGFSAASIDHDRR---RNASVQSPGTLR--DVPASID 122 Query: 183 WRTKGAVTAVKDQGSCGACWSFSATGAVEGINQIKTGSLVSLSEQELIDCDTTYNAGCGG 362 WR KGAVT VKDQ SCGACW+FSATGA+EGIN+I TGSLVSLSEQELIDCD +YN+GCGG Sbjct: 123 WRKKGAVTEVKDQASCGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSYNSGCGG 182 Query: 363 GLMDYAFEFIIKNEGIDTEEDYPYRGRDGKCSKEKLKRHVVTIDSYADVPPRKEKKLLQA 542 GLMDYA++F+IKN GIDTE+DYPYRG+ G+C+K+KL RH+VTID Y DVP EK+LLQA Sbjct: 183 GLMDYAYQFVIKNHGIDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQA 242 Query: 543 VATQPVSVGICGSDYKFQLYSGGIFSGPCSTALDHAVLIVGYDSQDGVDYWIVKNSWGKY 722 V QPVSVGICGS+ FQLYS GIF+GPCST+LDHAVLIVGYDS++GVDYWI+KNSWG+ Sbjct: 243 VVAQPVSVGICGSERAFQLYSSGIFTGPCSTSLDHAVLIVGYDSENGVDYWIIKNSWGRS 302 Query: 723 WGMNGYIHMLRNSGDAEGVCGINTLASFPIKXXXXXXXXXXXXXXKCDLFTYCGTDETCC 902 WGMNGY+HM RN+G++ G+CGIN LAS+P K +C L TYC ETCC Sbjct: 303 WGMNGYMHMQRNTGNSLGICGINMLASYPTKTGQNPPPSPPPGPTRCSLLTYCAAGETCC 362 Query: 903 CHWSLLGICFSWRCCEAESAVCCDDHEHCCPRDYPTCDTARNLCLKRI-GNSTLSKPFEK 1079 C S+LGIC SW+CC SAVCC DH +CCP +YP CD+ R+ CL R GN T ++ E Sbjct: 363 CGSSILGICLSWKCCGFSSAVCCSDHRYCCPSNYPICDSVRHQCLTRFTGNVTAAEAIEM 422 Query: 1080 KG 1085 +G Sbjct: 423 RG 424 >gb|EXC25025.1| Oryzain alpha chain [Morus notabilis] Length = 517 Score = 498 bits (1281), Expect = e-138 Identities = 233/346 (67%), Positives = 267/346 (77%) Frame = +3 Query: 3 NSSHTLSLNAFADLTNQEFKAKYLGLLPSADDLLIRLNSRKSAIEGSDLVEESDLPASVD 182 NSS+TLSLNAFADLT+ EFK+ LG + L +L GS L++ D+PAS+D Sbjct: 69 NSSYTLSLNAFADLTHHEFKSSRLGFSSALLSSLPKL--------GSKLLDLRDVPASLD 120 Query: 183 WRTKGAVTAVKDQGSCGACWSFSATGAVEGINQIKTGSLVSLSEQELIDCDTTYNAGCGG 362 WR KGAVT VKDQGSCGACW+FSATGA+EGIN+I TGSLVSLSEQELIDCDT+YNAGC G Sbjct: 121 WRKKGAVTNVKDQGSCGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDTSYNAGCDG 180 Query: 363 GLMDYAFEFIIKNEGIDTEEDYPYRGRDGKCSKEKLKRHVVTIDSYADVPPRKEKKLLQA 542 GLMDYA++F+I N GIDTEEDYPY+ RD C KEKLKR VVTID Y DV P +LLQA Sbjct: 181 GLMDYAYQFVIDNHGIDTEEDYPYQARDKSCRKEKLKRRVVTIDGYTDVAPNNGLQLLQA 240 Query: 543 VATQPVSVGICGSDYKFQLYSGGIFSGPCSTALDHAVLIVGYDSQDGVDYWIVKNSWGKY 722 V TQPVSVGICGS+ FQLYS GIF+GPCST+LDHAVLIVGYDS++GVDYWIVKNSWGK Sbjct: 241 VVTQPVSVGICGSERAFQLYSKGIFTGPCSTSLDHAVLIVGYDSENGVDYWIVKNSWGKQ 300 Query: 723 WGMNGYIHMLRNSGDAEGVCGINTLASFPIKXXXXXXXXXXXXXXKCDLFTYCGTDETCC 902 WGM+GYIHM RN+G+++GVCGIN LAS+P K +C F CG ETCC Sbjct: 301 WGMDGYIHMQRNTGNSQGVCGINMLASYPTKTSPNPPPSPSPGPTRCSFFAQCGEGETCC 360 Query: 903 CHWSLLGICFSWRCCEAESAVCCDDHEHCCPRDYPTCDTARNLCLK 1040 C W LG+CFSW+CC SAVCC D HCCP+DYP CDT RN+CLK Sbjct: 361 CSWRFLGLCFSWKCCGLNSAVCCKDKIHCCPQDYPLCDTQRNVCLK 406 >ref|XP_006473576.1| PREDICTED: cysteine proteinase RD21a-like [Citrus sinensis] Length = 441 Score = 497 bits (1280), Expect = e-138 Identities = 232/362 (64%), Positives = 280/362 (77%), Gaps = 1/362 (0%) Frame = +3 Query: 3 NSSHTLSLNAFADLTNQEFKAKYLGLLPSADDLLIRLNSRKSAIEGSDLVEESDLPASVD 182 NSS TLSLNAFADLT+QEFKA +LG ++ D R N+ S +L D+PAS+D Sbjct: 68 NSSFTLSLNAFADLTHQEFKASFLGFSAASIDHDRRRNA--SVQSPGNL---RDVPASID 122 Query: 183 WRTKGAVTAVKDQGSCGACWSFSATGAVEGINQIKTGSLVSLSEQELIDCDTTYNAGCGG 362 WR KGAVT VKDQ SCGACW+FSATGA+EGIN+I TGSLVSLSEQELIDCD +YN+GCGG Sbjct: 123 WRKKGAVTEVKDQASCGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSYNSGCGG 182 Query: 363 GLMDYAFEFIIKNEGIDTEEDYPYRGRDGKCSKEKLKRHVVTIDSYADVPPRKEKKLLQA 542 GLMDYA++F+IKN GIDTE+DYPYRG+ G+C+K+KL RH+VTID Y DVP EK+LLQA Sbjct: 183 GLMDYAYQFVIKNHGIDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQA 242 Query: 543 VATQPVSVGICGSDYKFQLYSGGIFSGPCSTALDHAVLIVGYDSQDGVDYWIVKNSWGKY 722 V QPVSVGICGS+ FQLYS GIF+GPCST+LDHAVLI+GYDS++GVDYWI+KNSWG+ Sbjct: 243 VVAQPVSVGICGSERAFQLYSSGIFTGPCSTSLDHAVLIIGYDSENGVDYWIIKNSWGRS 302 Query: 723 WGMNGYIHMLRNSGDAEGVCGINTLASFPIKXXXXXXXXXXXXXXKCDLFTYCGTDETCC 902 WGMNGY+HM RN+G++ G+CGIN LAS+P K +C L TYC ETCC Sbjct: 303 WGMNGYMHMQRNTGNSLGICGINMLASYPTKTGQNPPPSPPPGPTRCSLLTYCAPGETCC 362 Query: 903 CHWSLLGICFSWRCCEAESAVCCDDHEHCCPRDYPTCDTARNLCLKRI-GNSTLSKPFEK 1079 C S+LGIC SW+CC SAVCC DH +CCP +YP CD+ R+ CL R+ GN T ++ E Sbjct: 363 CGSSILGICLSWKCCGFSSAVCCSDHRYCCPSNYPICDSVRHQCLTRLTGNVTAAEAIEM 422 Query: 1080 KG 1085 +G Sbjct: 423 RG 424 >ref|XP_002307688.2| cysteine protease family protein [Populus trichocarpa] gi|550339725|gb|EEE94684.2| cysteine protease family protein [Populus trichocarpa] Length = 436 Score = 496 bits (1277), Expect = e-138 Identities = 232/360 (64%), Positives = 272/360 (75%) Frame = +3 Query: 3 NSSHTLSLNAFADLTNQEFKAKYLGLLPSADDLLIRLNSRKSAIEGSDLVEESDLPASVD 182 NSS++L+LNAFADLT+ EFK LGL + +L R N + + G D+PAS+D Sbjct: 68 NSSYSLALNAFADLTHHEFKTSRLGLSAAPLNLAHR-NLEITGVVG-------DIPASID 119 Query: 183 WRTKGAVTAVKDQGSCGACWSFSATGAVEGINQIKTGSLVSLSEQELIDCDTTYNAGCGG 362 WR KG VT VKDQGSCGACWSFSATGA+EGIN+I TGSLVSLSEQELI+CD +YN GCGG Sbjct: 120 WRNKGVVTNVKDQGSCGACWSFSATGAIEGINKIVTGSLVSLSEQELIECDKSYNDGCGG 179 Query: 363 GLMDYAFEFIIKNEGIDTEEDYPYRGRDGKCSKEKLKRHVVTIDSYADVPPRKEKKLLQA 542 GLMDYAF+F+I N GIDTEEDYPYR RDG C+K+++KR VVTID Y DVP EK+LLQA Sbjct: 180 GLMDYAFQFVINNHGIDTEEDYPYRARDGTCNKDRMKRRVVTIDKYVDVPENNEKQLLQA 239 Query: 543 VATQPVSVGICGSDYKFQLYSGGIFSGPCSTALDHAVLIVGYDSQDGVDYWIVKNSWGKY 722 VA QPVSVGICGS+ FQ+YS GIF+GPCST+LDHAVLIVGY S++GVDYWIVKNSWG Sbjct: 240 VAAQPVSVGICGSERAFQMYSKGIFTGPCSTSLDHAVLIVGYGSENGVDYWIVKNSWGTG 299 Query: 723 WGMNGYIHMLRNSGDAEGVCGINTLASFPIKXXXXXXXXXXXXXXKCDLFTYCGTDETCC 902 WGM GY+HM RNSG+++GVCGIN LAS+P+K KC+L TYC ETCC Sbjct: 300 WGMRGYMHMQRNSGNSQGVCGINMLASYPVKTSPNPPPPPPPGPTKCNLLTYCAAGETCC 359 Query: 903 CHWSLLGICFSWRCCEAESAVCCDDHEHCCPRDYPTCDTARNLCLKRIGNSTLSKPFEKK 1082 C GIC SW+CC +SAVCC D HCCP DYP CDT +N+C KR GN+T + E K Sbjct: 360 CARKFFGICISWKCCGLDSAVCCKDRLHCCPHDYPVCDTDKNMCFKRAGNATRMEAIEGK 419 >ref|XP_002510459.1| cysteine protease, putative [Ricinus communis] gi|223551160|gb|EEF52646.1| cysteine protease, putative [Ricinus communis] Length = 422 Score = 496 bits (1276), Expect = e-137 Identities = 233/346 (67%), Positives = 274/346 (79%) Frame = +3 Query: 3 NSSHTLSLNAFADLTNQEFKAKYLGLLPSADDLLIRLNSRKSAIEGSDLVEESDLPASVD 182 NSS+TLSLNAFADLT+ EFKA LGL SA +L+ R + D V D+P S+D Sbjct: 71 NSSYTLSLNAFADLTHHEFKASRLGL--SAFSTSGKLSRRNFPLH--DFV--GDVPISID 124 Query: 183 WRTKGAVTAVKDQGSCGACWSFSATGAVEGINQIKTGSLVSLSEQELIDCDTTYNAGCGG 362 WR KGAV+ VKDQG+CGACWSFSATGA+EGIN+I TGSLVSLSEQEL+DCD +YN GC G Sbjct: 125 WRKKGAVSQVKDQGNCGACWSFSATGAIEGINKIVTGSLVSLSEQELVDCDRSYNNGCEG 184 Query: 363 GLMDYAFEFIIKNEGIDTEEDYPYRGRDGKCSKEKLKRHVVTIDSYADVPPRKEKKLLQA 542 GLMDYA++F+I+N GIDTEEDYPY+ R+ C+KEKLKRHVVTID Y DVP EK+LL+A Sbjct: 185 GLMDYAYQFVIENNGIDTEEDYPYQAREKTCNKEKLKRHVVTIDGYTDVPQNNEKELLKA 244 Query: 543 VATQPVSVGICGSDYKFQLYSGGIFSGPCSTALDHAVLIVGYDSQDGVDYWIVKNSWGKY 722 VA QPVSVGICGS+ FQLYS GIF+GPCST+LDHAVLIVGY S++GVDYWIVKNSWG + Sbjct: 245 VAAQPVSVGICGSERAFQLYSKGIFTGPCSTSLDHAVLIVGYGSENGVDYWIVKNSWGTH 304 Query: 723 WGMNGYIHMLRNSGDAEGVCGINTLASFPIKXXXXXXXXXXXXXXKCDLFTYCGTDETCC 902 WG+NGY++MLRNSG+++G+CGIN LASFP+K KCDLFT CG ETCC Sbjct: 305 WGINGYMYMLRNSGNSQGLCGINMLASFPVKTSPNPPPPAPPGPTKCDLFTRCGEGETCC 364 Query: 903 CHWSLLGICFSWRCCEAESAVCCDDHEHCCPRDYPTCDTARNLCLK 1040 C + G+CFSW+CCE +SAVCC D HCCP DYP CDT RN+CLK Sbjct: 365 CTRRIFGLCFSWKCCELDSAVCCKDGLHCCPHDYPVCDTKRNMCLK 410 >ref|XP_003523725.1| PREDICTED: oryzain alpha chain-like [Glycine max] Length = 439 Score = 493 bits (1268), Expect = e-136 Identities = 231/362 (63%), Positives = 277/362 (76%) Frame = +3 Query: 3 NSSHTLSLNAFADLTNQEFKAKYLGLLPSADDLLIRLNSRKSAIEGSDLVEESDLPASVD 182 NSS+TLSLNAFADLT+ EFK LGL L + R + DL+ +P+ +D Sbjct: 77 NSSYTLSLNAFADLTHHEFKTTRLGL-----PLTLLRFKRPQNQQSRDLLH---IPSQID 128 Query: 183 WRTKGAVTAVKDQGSCGACWSFSATGAVEGINQIKTGSLVSLSEQELIDCDTTYNAGCGG 362 WR GAVT VKDQ SCGACW+FSATGA+EGIN+I TGSLVSLSEQELIDCDT+YN+GCGG Sbjct: 129 WRQSGAVTPVKDQASCGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDTSYNSGCGG 188 Query: 363 GLMDYAFEFIIKNEGIDTEEDYPYRGRDGKCSKEKLKRHVVTIDSYADVPPRKEKKLLQA 542 GLMD+A++F+I N+GIDTE+DYPY+ R CSK+KLKR VTI+ Y DVPP E+++L+A Sbjct: 189 GLMDFAYQFVIDNKGIDTEDDYPYQARQRSCSKDKLKRRAVTIEDYVDVPP-SEEEILKA 247 Query: 543 VATQPVSVGICGSDYKFQLYSGGIFSGPCSTALDHAVLIVGYDSQDGVDYWIVKNSWGKY 722 VA+QPVSVGICGS+ +FQLYS GIF+GPCST LDHAVLIVGY S++GVDYWIVKNSWGKY Sbjct: 248 VASQPVSVGICGSEREFQLYSKGIFTGPCSTFLDHAVLIVGYGSENGVDYWIVKNSWGKY 307 Query: 723 WGMNGYIHMLRNSGDAEGVCGINTLASFPIKXXXXXXXXXXXXXXKCDLFTYCGTDETCC 902 WGMNGYIHM+RNSG+++G+CGINTLAS+P+K +C+LFT+C ETCC Sbjct: 308 WGMNGYIHMIRNSGNSKGICGINTLASYPVKTKPNPPIPPPPGPVRCNLFTHCSEGETCC 367 Query: 903 CHWSLLGICFSWRCCEAESAVCCDDHEHCCPRDYPTCDTARNLCLKRIGNSTLSKPFEKK 1082 C S LGICFSW+CC SAVCC D HCCP+DYP CDT R CLKR N T + E + Sbjct: 368 CAKSFLGICFSWKCCGLTSAVCCKDKRHCCPQDYPICDTRRGQCLKRTANGTTTITSENQ 427 Query: 1083 GF 1088 F Sbjct: 428 DF 429 >ref|XP_006307431.1| hypothetical protein CARUB_v10009056mg [Capsella rubella] gi|482576142|gb|EOA40329.1| hypothetical protein CARUB_v10009056mg [Capsella rubella] Length = 467 Score = 491 bits (1265), Expect = e-136 Identities = 238/388 (61%), Positives = 278/388 (71%), Gaps = 28/388 (7%) Frame = +3 Query: 3 NSSHTLSLNAFADLTNQEFKAKYLGLLPSADDLLIRLNSRKSAIEGSDLVEESDLPASVD 182 N++++LSLNAFADL + EFK LGL SA +++ A +G L +P S+D Sbjct: 73 NATYSLSLNAFADLNHSEFKTSRLGLSVSAPSVIM-------ASKGKSLGGSVKVPDSLD 125 Query: 183 WRTKGAVTAVKDQGSCGACWSFSATGAVEGINQIKTGSLVSLSEQELIDCDTTYNAGCGG 362 WR KGAVT VKDQGSCGACWSFSATGA+EGINQI TG L+SLSEQELIDCD +YN GC G Sbjct: 126 WRKKGAVTNVKDQGSCGACWSFSATGAMEGINQIVTGDLISLSEQELIDCDKSYNDGCNG 185 Query: 363 GLMDYAFEFIIKNEGIDTEEDYPYRGRDGKCSKEKLKRHVVTIDSYADVPPRKEKKLLQA 542 GLMDYAFEF+IKN+GIDTE+DYPY+ RDG C K+KLK+ VV+IDSYA V P EK LL+A Sbjct: 186 GLMDYAFEFVIKNKGIDTEKDYPYQERDGTCKKDKLKQRVVSIDSYAGVKPSDEKALLEA 245 Query: 543 VATQPVSVGICGSDYKFQLYSG----------------------------GIFSGPCSTA 638 VA QPVSVGICGS+ FQLYS GIFSGPCST+ Sbjct: 246 VAAQPVSVGICGSERAFQLYSSVSFKIRDTSILSSECSTFPCLKLYLMMQGIFSGPCSTS 305 Query: 639 LDHAVLIVGYDSQDGVDYWIVKNSWGKYWGMNGYIHMLRNSGDAEGVCGINTLASFPIKX 818 LDHAVLIVGY SQ+GVDYWIVKNSWGK WGM+G++HM RN+G+++G+CGIN LAS+PIK Sbjct: 306 LDHAVLIVGYGSQNGVDYWIVKNSWGKSWGMDGFMHMQRNTGNSQGICGINMLASYPIKT 365 Query: 819 XXXXXXXXXXXXXKCDLFTYCGTDETCCCHWSLLGICFSWRCCEAESAVCCDDHEHCCPR 998 KC+LFTYC ETCCC +L G+C SW+CCE ESAVCC D HCCP Sbjct: 366 HPNPPPPSPPGPTKCNLFTYCSAAETCCCARNLFGLCLSWKCCEIESAVCCKDGRHCCPH 425 Query: 999 DYPTCDTARNLCLKRIGNSTLSKPFEKK 1082 DYP CDT R+LCLK+ GN T KPF KK Sbjct: 426 DYPVCDTTRSLCLKKTGNFTAIKPFWKK 453 >ref|XP_004152671.1| PREDICTED: cysteine proteinase RD21a-like [Cucumis sativus] gi|449529596|ref|XP_004171784.1| PREDICTED: cysteine proteinase RD21a-like [Cucumis sativus] Length = 431 Score = 491 bits (1263), Expect = e-136 Identities = 232/360 (64%), Positives = 268/360 (74%) Frame = +3 Query: 3 NSSHTLSLNAFADLTNQEFKAKYLGLLPSADDLLIRLNSRKSAIEGSDLVEESDLPASVD 182 NSS+TLSLN++ADLT+ EFK LG P+ + L S D+P S+D Sbjct: 68 NSSYTLSLNSYADLTHHEFKVSRLGFSPALRNFRPVLPQEPSL--------PRDVPDSLD 119 Query: 183 WRTKGAVTAVKDQGSCGACWSFSATGAVEGINQIKTGSLVSLSEQELIDCDTTYNAGCGG 362 WR KGAVTAVKDQGSCGACWSFSATGA+EGINQI TGSL+SLSEQELIDCD +YN+GCGG Sbjct: 120 WRKKGAVTAVKDQGSCGACWSFSATGAMEGINQIMTGSLISLSEQELIDCDRSYNSGCGG 179 Query: 363 GLMDYAFEFIIKNEGIDTEEDYPYRGRDGKCSKEKLKRHVVTIDSYADVPPRKEKKLLQA 542 GLMDYA++F+I N GIDTE DYPY+ RDG C K+KL+R+VVTID YAD+P E KLLQA Sbjct: 180 GLMDYAYQFVISNHGIDTENDYPYQARDGSCRKDKLQRNVVTIDGYADIPSNDEGKLLQA 239 Query: 543 VATQPVSVGICGSDYKFQLYSGGIFSGPCSTALDHAVLIVGYDSQDGVDYWIVKNSWGKY 722 VA QPVSVGICGS+ FQLYS GIFSGPCST+LDHAVLIVGY S++GVDYWIVKNSWGK Sbjct: 240 VAAQPVSVGICGSERAFQLYSKGIFSGPCSTSLDHAVLIVGYGSENGVDYWIVKNSWGKS 299 Query: 723 WGMNGYIHMLRNSGDAEGVCGINTLASFPIKXXXXXXXXXXXXXXKCDLFTYCGTDETCC 902 WGM+GY+HM RNSG++EGVCGIN LAS+P K KC + T C ETCC Sbjct: 300 WGMDGYMHMQRNSGNSEGVCGINKLASYPTKTNPNPPPSPPPGPTKCSILTSCAAGETCC 359 Query: 903 CHWSLLGICFSWRCCEAESAVCCDDHEHCCPRDYPTCDTARNLCLKRIGNSTLSKPFEKK 1082 C LG+C SW+CC SAVCC D HCCP DYP CDT RNLCLK+ N T ++ E + Sbjct: 360 CAKKFLGLCLSWKCCGLSSAVCCKDGRHCCPFDYPICDTDRNLCLKQTMNGTRTEILENR 419 >gb|AAB60738.1| Strong similarity to Dianthus cysteine proteinase (gb|U17135) [Arabidopsis thaliana] Length = 416 Score = 489 bits (1259), Expect = e-135 Identities = 234/353 (66%), Positives = 270/353 (76%), Gaps = 7/353 (1%) Frame = +3 Query: 3 NSSHTLSLNAFADLTNQEFKAKYLGLLPSADDLLIRLNSRKSAIEGSDLVEESDLPASVD 182 N++++LSLNAFADLT+ EFKA LGL SA +++ A +G L +P SVD Sbjct: 69 NATYSLSLNAFADLTHHEFKASRLGLSVSAPSVIM-------ASKGQSLGGSVKVPDSVD 121 Query: 183 WRTKGAVTAVKDQGSCGACWSFSATGAVEGINQIKTGSLVSLSEQELIDCDTTYNAGCGG 362 WR KGAVT VKDQGSCGACWSFSATGA+EGINQI TG L+SLSEQELIDCD +YNAGC G Sbjct: 122 WRKKGAVTNVKDQGSCGACWSFSATGAMEGINQIVTGDLISLSEQELIDCDKSYNAGCNG 181 Query: 363 GLMDYAFEFIIKNEGIDTEEDYPYRGRDGKCSKEKLKRHVVTIDSYADVPPRKEKKLLQA 542 GLMDYAFEF+IKN GIDTE+DYPY+ RDG C K+KLK+ VVTIDSYA V EK L++A Sbjct: 182 GLMDYAFEFVIKNHGIDTEKDYPYQERDGTCKKDKLKQKVVTIDSYAGVKSNDEKALMEA 241 Query: 543 VATQPVSVGICGSDYKFQLYSG-------GIFSGPCSTALDHAVLIVGYDSQDGVDYWIV 701 VA QPVSVGICGS+ FQLYS GIFSGPCST+LDHAVLIVGY SQ+GVDYWIV Sbjct: 242 VAAQPVSVGICGSERAFQLYSSKFYLLMQGIFSGPCSTSLDHAVLIVGYGSQNGVDYWIV 301 Query: 702 KNSWGKYWGMNGYIHMLRNSGDAEGVCGINTLASFPIKXXXXXXXXXXXXXXKCDLFTYC 881 KNSWGK WGM+G++HM RN+ +++GVCGIN LAS+PIK KC+LFTYC Sbjct: 302 KNSWGKSWGMDGFMHMQRNTENSDGVCGINMLASYPIKTHPNPPPPSPPGPTKCNLFTYC 361 Query: 882 GTDETCCCHWSLLGICFSWRCCEAESAVCCDDHEHCCPRDYPTCDTARNLCLK 1040 + ETCCC L G+CFSW+CCE ESAVCC D HCCP DYP CDT R+LCLK Sbjct: 362 SSGETCCCARELFGLCFSWKCCEIESAVCCKDGRHCCPHDYPVCDTTRSLCLK 414 >gb|EPS60205.1| hypothetical protein M569_14597, partial [Genlisea aurea] Length = 424 Score = 486 bits (1251), Expect = e-135 Identities = 226/357 (63%), Positives = 273/357 (76%), Gaps = 1/357 (0%) Frame = +3 Query: 3 NSSHTLSLNAFADLTNQEFKAKYLGLLPSADDLLIRLNSRKSAIEGSDLVEESDLPASVD 182 N S+TLSLNAFADLT EF +YLG PS DLLIR N + + S +P+S+D Sbjct: 69 NYSYTLSLNAFADLTRSEFGGRYLGFSPSGHDLLIRKNRGSGSYRSRNY---SAVPSSID 125 Query: 183 WRTKGAVTAVKDQGSCGACWSFSATGAVEGINQIKTGSLVSLSEQELIDCDTTYNAGCGG 362 WR KGAVT +KDQGSCGACWSFSATGA+EGINQI TGSLVSLSEQELIDCD +YN GC G Sbjct: 126 WRKKGAVTGIKDQGSCGACWSFSATGAIEGINQIVTGSLVSLSEQELIDCDHSYNQGCNG 185 Query: 363 GLMDYAFEFIIKNEGIDTEEDYPYRGRDGKCSKEKLKRHVVTIDSYADVPPRKEKKLLQA 542 GLMDYA+EFI+KN+GIDTEEDY Y+GRD CS+ KL + VVTIDSY D+P + E+ LL+A Sbjct: 186 GLMDYAYEFILKNKGIDTEEDYSYKGRDASCSQNKLNKRVVTIDSYVDIPEKNEQMLLEA 245 Query: 543 VATQPVSVGICGSDYKFQLYSGGIFSGPCSTALDHAVLIVGYDSQDGVDYWIVKNSWGKY 722 VA+QPVSVGI G D FQ YS GIF+GPCST+LDHAVLIVGYDS++G DYWIVKNSWGK Sbjct: 246 VASQPVSVGISGGDAPFQFYSQGIFTGPCSTSLDHAVLIVGYDSKNGKDYWIVKNSWGKS 305 Query: 723 WGMNGYIHMLRNSGDAEGVCGINTLASFPIK-XXXXXXXXXXXXXXKCDLFTYCGTDETC 899 WGM+GY+++ RN+G+ G+C IN +AS+P+K KC LF+YC ETC Sbjct: 306 WGMDGYMYVQRNTGNQNGICEINMMASYPVKTNPNPSPSPSPPGPTKCSLFSYCSQGETC 365 Query: 900 CCHWSLLGICFSWRCCEAESAVCCDDHEHCCPRDYPTCDTARNLCLKRIGNSTLSKP 1070 CC LG+C ++CC AESAVCC+D+ HCCP+DYP CDTA+++C K GNST++ P Sbjct: 366 CCARRFLGLCMRYKCCGAESAVCCEDNVHCCPQDYPICDTAQSVCRKMSGNSTMAIP 422