BLASTX nr result
ID: Sinomenium21_contig00001187
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Sinomenium21_contig00001187 (1647 letters) Database: ./nr 38,876,450 sequences; 13,856,398,315 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_006435079.1| hypothetical protein CICLE_v10001178mg [Citr... 595 e-167 ref|XP_006473576.1| PREDICTED: cysteine proteinase RD21a-like [C... 594 e-167 ref|XP_002307688.2| cysteine protease family protein [Populus tr... 594 e-167 dbj|BAJ53169.1| JHL18I08.3 [Jatropha curcas] 587 e-165 ref|XP_006341989.1| PREDICTED: cysteine proteinase RD21a-like [S... 577 e-162 ref|XP_007017656.1| JHL18I08.3 protein [Theobroma cacao] gi|5087... 577 e-162 ref|XP_004238304.1| PREDICTED: cysteine proteinase RD21a-like [S... 574 e-161 ref|XP_006417526.1| hypothetical protein EUTSA_v10007640mg [Eutr... 573 e-161 gb|AAK71314.1|AF388175_1 papain-like cysteine peptidase XBCP3 [A... 572 e-160 gb|EXC25025.1| Oryzain alpha chain [Morus notabilis] 572 e-160 ref|NP_563855.1| papain-like cysteine peptidase [Arabidopsis tha... 572 e-160 ref|XP_002889773.1| hypothetical protein ARALYDRAFT_471096 [Arab... 571 e-160 ref|XP_004152671.1| PREDICTED: cysteine proteinase RD21a-like [C... 569 e-159 ref|XP_002510459.1| cysteine protease, putative [Ricinus communi... 563 e-158 ref|XP_002280937.1| PREDICTED: cysteine proteinase RD21a [Vitis ... 561 e-157 ref|XP_004291399.1| PREDICTED: cysteine proteinase RD21a-like [F... 560 e-157 gb|AAB60738.1| Strong similarity to Dianthus cysteine proteinase... 548 e-153 ref|XP_006838704.1| hypothetical protein AMTR_s00002p00249780 [A... 547 e-153 ref|XP_007223363.1| hypothetical protein PRUPE_ppa005615mg [Prun... 546 e-153 gb|EYU36745.1| hypothetical protein MIMGU_mgv1a006749mg [Mimulus... 545 e-152 >ref|XP_006435079.1| hypothetical protein CICLE_v10001178mg [Citrus clementina] gi|557537201|gb|ESR48319.1| hypothetical protein CICLE_v10001178mg [Citrus clementina] Length = 441 Score = 595 bits (1533), Expect = e-167 Identities = 279/440 (63%), Positives = 333/440 (75%), Gaps = 1/440 (0%) Frame = -3 Query: 1561 IDSSRVILLLFLVSVNFHLYLCSPTSDLFDSWCKQHGKSYSSEEERLYRLTVFEDNLAFV 1382 ++S LL L+ + L CS ++LF++WCKQHGK YSSE+E+ RL +FEDN AFV Sbjct: 1 MNSLAFFLLSILLLSSLPLNYCSDINELFETWCKQHGKVYSSEQEKQQRLKIFEDNYAFV 60 Query: 1381 NQHNSLTNSTYTLGVNAFADLTHHEFRVXXXXXXXXXXXXXXXXXSESTAGAGFVGDVPK 1202 QHN++ NS++TL +NAFADLTH EF+ + S G + DVP Sbjct: 61 TQHNNMGNSSFTLSLNAFADLTHQEFKASFLGFSAASIDHDRRRNA-SVQSPGTLRDVPA 119 Query: 1201 SIDWRSKGAVTPVKDQGRCGACWSFSATGAIEGINQIVTGSLVSVSEQELIDCDRSYNSG 1022 SIDWR KGAVT VKDQ CGACW+FSATGAIEGIN+IVTGSLVS+SEQELIDCDRSYNSG Sbjct: 120 SIDWRKKGAVTEVKDQASCGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSYNSG 179 Query: 1021 CGGGLMDYAFQFVVKNHGIDTEKDYPYQETDRTCNRNKLKRLVVTIDGFIDVPSYNEEEI 842 CGGGLMDYA+QFV+KNHGIDTEKDYPY+ CN+ KL R +VTIDG+ DVP NE+++ Sbjct: 180 CGGGLMDYAYQFVIKNHGIDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQL 239 Query: 841 LKAVASQPVSVGICGSDRGFQLYSKGIFSGPCSTSLDHAVLIVGYGSENGVDYWILKNSW 662 L+AV +QPVSVGICGS+R FQLYS GIF+GPCSTSLDHAVLIVGY SENGVDYWI+KNSW Sbjct: 240 LQAVVAQPVSVGICGSERAFQLYSSGIFTGPCSTSLDHAVLIVGYDSENGVDYWIIKNSW 299 Query: 661 GKNWGMNGYMHMQRNSGNKEGVCGINMLASYPIKTXXXXXXXXXXXPTQCSLLSYCGSGE 482 G++WGMNGYMHMQRN+GN G+CGINMLASYP KT PT+CSLL+YC +GE Sbjct: 300 GRSWGMNGYMHMQRNTGNSLGICGINMLASYPTKTGQNPPPSPPPGPTRCSLLTYCAAGE 359 Query: 481 TCCCGWRLLGICLSWKCCEAESAVCCNDHVSCCPSHYPVCDTERKQCL-QAAGNSTMVKG 305 TCCCG +LGICLSWKCC SAVCC+DH CCPS+YP+CD+ R QCL + GN T + Sbjct: 360 TCCCGSSILGICLSWKCCGFSSAVCCSDHRYCCPSNYPICDSVRHQCLTRFTGNVTAAEA 419 Query: 304 LEKKGSIWKFGGLNPLFEAW 245 +E +GS WKFG + + W Sbjct: 420 IEMRGSSWKFGSWSSFIDVW 439 >ref|XP_006473576.1| PREDICTED: cysteine proteinase RD21a-like [Citrus sinensis] Length = 441 Score = 594 bits (1532), Expect = e-167 Identities = 279/434 (64%), Positives = 332/434 (76%), Gaps = 1/434 (0%) Frame = -3 Query: 1543 ILLLFLVSVNFHLYLCSPTSDLFDSWCKQHGKSYSSEEERLYRLTVFEDNLAFVNQHNSL 1364 ILLL + N+ CS ++LF++WCKQHGK+YSSE+E+ RL +FEDN AFV QHN++ Sbjct: 11 ILLLSSLPPNY----CSDINELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHNNM 66 Query: 1363 TNSTYTLGVNAFADLTHHEFRVXXXXXXXXXXXXXXXXXSESTAGAGFVGDVPKSIDWRS 1184 NS++TL +NAFADLTH EF+ + S G + DVP SIDWR Sbjct: 67 GNSSFTLSLNAFADLTHQEFKASFLGFSAASIDHDRRRNA-SVQSPGNLRDVPASIDWRK 125 Query: 1183 KGAVTPVKDQGRCGACWSFSATGAIEGINQIVTGSLVSVSEQELIDCDRSYNSGCGGGLM 1004 KGAVT VKDQ CGACW+FSATGAIEGIN+IVTGSLVS+SEQELIDCDRSYNSGCGGGLM Sbjct: 126 KGAVTEVKDQASCGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSYNSGCGGGLM 185 Query: 1003 DYAFQFVVKNHGIDTEKDYPYQETDRTCNRNKLKRLVVTIDGFIDVPSYNEEEILKAVAS 824 DYA+QFV+KNHGIDTEKDYPY+ CN+ KL R +VTIDG+ DVP NE+++L+AV + Sbjct: 186 DYAYQFVIKNHGIDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVA 245 Query: 823 QPVSVGICGSDRGFQLYSKGIFSGPCSTSLDHAVLIVGYGSENGVDYWILKNSWGKNWGM 644 QPVSVGICGS+R FQLYS GIF+GPCSTSLDHAVLI+GY SENGVDYWI+KNSWG++WGM Sbjct: 246 QPVSVGICGSERAFQLYSSGIFTGPCSTSLDHAVLIIGYDSENGVDYWIIKNSWGRSWGM 305 Query: 643 NGYMHMQRNSGNKEGVCGINMLASYPIKTXXXXXXXXXXXPTQCSLLSYCGSGETCCCGW 464 NGYMHMQRN+GN G+CGINMLASYP KT PT+CSLL+YC GETCCCG Sbjct: 306 NGYMHMQRNTGNSLGICGINMLASYPTKTGQNPPPSPPPGPTRCSLLTYCAPGETCCCGS 365 Query: 463 RLLGICLSWKCCEAESAVCCNDHVSCCPSHYPVCDTERKQCL-QAAGNSTMVKGLEKKGS 287 +LGICLSWKCC SAVCC+DH CCPS+YP+CD+ R QCL + GN T + +E +GS Sbjct: 366 SILGICLSWKCCGFSSAVCCSDHRYCCPSNYPICDSVRHQCLTRLTGNVTAAEAIEMRGS 425 Query: 286 IWKFGGLNPLFEAW 245 WKFG + +AW Sbjct: 426 SWKFGSWSSFIDAW 439 >ref|XP_002307688.2| cysteine protease family protein [Populus trichocarpa] gi|550339725|gb|EEE94684.2| cysteine protease family protein [Populus trichocarpa] Length = 436 Score = 594 bits (1531), Expect = e-167 Identities = 277/434 (63%), Positives = 332/434 (76%) Frame = -3 Query: 1546 VILLLFLVSVNFHLYLCSPTSDLFDSWCKQHGKSYSSEEERLYRLTVFEDNLAFVNQHNS 1367 + L L+SV S S LF++WCK+HGKSY+S+EER +RL VFEDN FV +HNS Sbjct: 6 IFALTLLISVLSPSTSSSDISQLFETWCKEHGKSYTSQEERSHRLKVFEDNYDFVTKHNS 65 Query: 1366 LTNSTYTLGVNAFADLTHHEFRVXXXXXXXXXXXXXXXXXSESTAGAGFVGDVPKSIDWR 1187 NS+Y+L +NAFADLTHHEF+ + G VGD+P SIDWR Sbjct: 66 KGNSSYSLALNAFADLTHHEFKTSRLGLSAAPLNLAH----RNLEITGVVGDIPASIDWR 121 Query: 1186 SKGAVTPVKDQGRCGACWSFSATGAIEGINQIVTGSLVSVSEQELIDCDRSYNSGCGGGL 1007 +KG VT VKDQG CGACWSFSATGAIEGIN+IVTGSLVS+SEQELI+CD+SYN GCGGGL Sbjct: 122 NKGVVTNVKDQGSCGACWSFSATGAIEGINKIVTGSLVSLSEQELIECDKSYNDGCGGGL 181 Query: 1006 MDYAFQFVVKNHGIDTEKDYPYQETDRTCNRNKLKRLVVTIDGFIDVPSYNEEEILKAVA 827 MDYAFQFV+ NHGIDTE+DYPY+ D TCN++++KR VVTID ++DVP NE+++L+AVA Sbjct: 182 MDYAFQFVINNHGIDTEEDYPYRARDGTCNKDRMKRRVVTIDKYVDVPENNEKQLLQAVA 241 Query: 826 SQPVSVGICGSDRGFQLYSKGIFSGPCSTSLDHAVLIVGYGSENGVDYWILKNSWGKNWG 647 +QPVSVGICGS+R FQ+YSKGIF+GPCSTSLDHAVLIVGYGSENGVDYWI+KNSWG WG Sbjct: 242 AQPVSVGICGSERAFQMYSKGIFTGPCSTSLDHAVLIVGYGSENGVDYWIVKNSWGTGWG 301 Query: 646 MNGYMHMQRNSGNKEGVCGINMLASYPIKTXXXXXXXXXXXPTQCSLLSYCGSGETCCCG 467 M GYMHMQRNSGN +GVCGINMLASYP+KT PT+C+LL+YC +GETCCC Sbjct: 302 MRGYMHMQRNSGNSQGVCGINMLASYPVKTSPNPPPPPPPGPTKCNLLTYCAAGETCCCA 361 Query: 466 WRLLGICLSWKCCEAESAVCCNDHVSCCPSHYPVCDTERKQCLQAAGNSTMVKGLEKKGS 287 + GIC+SWKCC +SAVCC D + CCP YPVCDT++ C + AGN+T ++ +E K S Sbjct: 362 RKFFGICISWKCCGLDSAVCCKDRLHCCPHDYPVCDTDKNMCFKRAGNATRMEAIEGKTS 421 Query: 286 IWKFGGLNPLFEAW 245 KFG N L EAW Sbjct: 422 -GKFGSWNSLPEAW 434 >dbj|BAJ53169.1| JHL18I08.3 [Jatropha curcas] Length = 441 Score = 587 bits (1512), Expect = e-165 Identities = 279/443 (62%), Positives = 333/443 (75%), Gaps = 4/443 (0%) Frame = -3 Query: 1561 IDSSRVILLLFLVSVNFHLYLCSPTSD----LFDSWCKQHGKSYSSEEERLYRLTVFEDN 1394 ++S+ + + FL+S +L+L S +S LF++WC+QHGK+Y+S+EE+L+RL VF+DN Sbjct: 1 MNSNCALFVAFLLS---YLFLFSSSSSEIAHLFETWCQQHGKTYASQEEKLFRLKVFQDN 57 Query: 1393 LAFVNQHNSLTNSTYTLGVNAFADLTHHEFRVXXXXXXXXXXXXXXXXXSESTAGAGFVG 1214 FV +HNS NS+YTL +NAFADLTHHEF+ S FV Sbjct: 58 YDFVTEHNSQGNSSYTLSLNAFADLTHHEFKASRLGLSSAASASLNVDRSNRQI-PDFVA 116 Query: 1213 DVPKSIDWRSKGAVTPVKDQGRCGACWSFSATGAIEGINQIVTGSLVSVSEQELIDCDRS 1034 DVP S+DWR GAVT VKDQG CGACWSFSATGAIEGIN+IVTGSLVS+SEQEL+DCD+S Sbjct: 117 DVPASVDWRKNGAVTQVKDQGNCGACWSFSATGAIEGINKIVTGSLVSLSEQELVDCDKS 176 Query: 1033 YNSGCGGGLMDYAFQFVVKNHGIDTEKDYPYQETDRTCNRNKLKRLVVTIDGFIDVPSYN 854 YN+GC GG+MDYAFQFV+ NHGIDTE+DYPYQ DR+CN+ KLKR VVTIDG++DVP N Sbjct: 177 YNNGCEGGIMDYAFQFVIDNHGIDTEEDYPYQGRDRSCNKEKLKRHVVTIDGYVDVPQNN 236 Query: 853 EEEILKAVASQPVSVGICGSDRGFQLYSKGIFSGPCSTSLDHAVLIVGYGSENGVDYWIL 674 E+E+LKAVA+QPVSVGICGS+R FQLYSKGIF+GPCSTSLDHAVLIVGYGSENGVDYWI+ Sbjct: 237 EKELLKAVANQPVSVGICGSERAFQLYSKGIFTGPCSTSLDHAVLIVGYGSENGVDYWIV 296 Query: 673 KNSWGKNWGMNGYMHMQRNSGNKEGVCGINMLASYPIKTXXXXXXXXXXXPTQCSLLSYC 494 KNSWG WGM+GYMHMQRNSG+ G+CGINMLASYP KT PT+C L ++C Sbjct: 297 KNSWGSYWGMDGYMHMQRNSGSSRGLCGINMLASYPKKTSPNPPPPAPPGPTRCDLFTHC 356 Query: 493 GSGETCCCGWRLLGICLSWKCCEAESAVCCNDHVSCCPSHYPVCDTERKQCLQAAGNSTM 314 G GETCCC + GICLSWKCCE +SAVCC D CCP YPVCDT R CL+ GN+T Sbjct: 357 GEGETCCCVHHIFGICLSWKCCELDSAVCCKDGRHCCPRDYPVCDTTRNICLKHYGNATR 416 Query: 313 VKGLEKKGSIWKFGGLNPLFEAW 245 ++ K S KF + L E W Sbjct: 417 IEKFAKNSSSGKFRSWSSLLEGW 439 >ref|XP_006341989.1| PREDICTED: cysteine proteinase RD21a-like [Solanum tuberosum] Length = 439 Score = 577 bits (1486), Expect = e-162 Identities = 269/428 (62%), Positives = 328/428 (76%), Gaps = 3/428 (0%) Frame = -3 Query: 1540 LLLFLVSVNFHLYLCSPTSDLFDSWCKQHGKSYSSEEERLYRLTVFEDNLAFVNQHNSLT 1361 L+L L+ CS SDLF++WC+Q+GK YSSE+ER+YR VFE+N A++ +HNS Sbjct: 8 LVLVLLIFQQPFCTCSSISDLFETWCQQNGKKYSSEQERVYRFKVFEENYAYITEHNSKE 67 Query: 1360 NSTYTLGVNAFADLTHHEFRVXXXXXXXXXXXXXXXXXSES-TAGAGFVGDV--PKSIDW 1190 NS+YTLG+NA++DLTHHEFR S ++ G + DV P S+DW Sbjct: 68 NSSYTLGLNAYSDLTHHEFRNSFLGLSSSANDFIRLKGRGSGSSETGVLSDVDAPSSLDW 127 Query: 1189 RSKGAVTPVKDQGRCGACWSFSATGAIEGINQIVTGSLVSVSEQELIDCDRSYNSGCGGG 1010 R KGAVT VK+QG CGACWSFSATGA+EGIN+I TGSLVS+SEQELIDCDRSYN GCGGG Sbjct: 128 REKGAVTDVKNQGSCGACWSFSATGAMEGINKITTGSLVSLSEQELIDCDRSYNEGCGGG 187 Query: 1009 LMDYAFQFVVKNHGIDTEKDYPYQETDRTCNRNKLKRLVVTIDGFIDVPSYNEEEILKAV 830 LMDYAF+FV+KN GIDTEKDYP++E + TCN+NKL+R VVTIDG+ D+P +E+++LKAV Sbjct: 188 LMDYAFEFVIKNGGIDTEKDYPFREREGTCNKNKLQRHVVTIDGYTDIPQNDEDKLLKAV 247 Query: 829 ASQPVSVGICGSDRGFQLYSKGIFSGPCSTSLDHAVLIVGYGSENGVDYWILKNSWGKNW 650 A+QPVSVGICGS R FQ YSKGIF+GPCST+LDHAVLIVGYGSENGVDYWI+KNSWG +W Sbjct: 248 ATQPVSVGICGSARAFQSYSKGIFTGPCSTALDHAVLIVGYGSENGVDYWIIKNSWGTSW 307 Query: 649 GMNGYMHMQRNSGNKEGVCGINMLASYPIKTXXXXXXXXXXXPTQCSLLSYCGSGETCCC 470 G+NGY+HMQRNSGN+EG+CGIN LASYP KT P++CS+ + CG GETCCC Sbjct: 308 GINGYIHMQRNSGNQEGICGINKLASYPTKTSPNPPTPPAPGPSKCSMFTSCGQGETCCC 367 Query: 469 GWRLLGICLSWKCCEAESAVCCNDHVSCCPSHYPVCDTERKQCLQAAGNSTMVKGLEKKG 290 G + LGICLSWKCC +SAVCC D CCP YP+CDT R CL+ N+T+V+ +K+ Sbjct: 368 GSKFLGICLSWKCCGLDSAVCCKDGRHCCPQDYPICDTSRNLCLKRMNNATIVQQPQKEA 427 Query: 289 SIWKFGGL 266 KFGGL Sbjct: 428 FTGKFGGL 435 >ref|XP_007017656.1| JHL18I08.3 protein [Theobroma cacao] gi|508722984|gb|EOY14881.1| JHL18I08.3 protein [Theobroma cacao] Length = 438 Score = 577 bits (1486), Expect = e-162 Identities = 271/432 (62%), Positives = 322/432 (74%) Frame = -3 Query: 1540 LLLFLVSVNFHLYLCSPTSDLFDSWCKQHGKSYSSEEERLYRLTVFEDNLAFVNQHNSLT 1361 LL FL+ + S S LF++WC QHGK YSSEEE+ YRL VFE+N AFV QHN + Sbjct: 9 LLSFLLFFDPSFASPSHISHLFETWCDQHGKRYSSEEEKSYRLKVFEENYAFVTQHNGVG 68 Query: 1360 NSTYTLGVNAFADLTHHEFRVXXXXXXXXXXXXXXXXXSESTAGAGFVGDVPKSIDWRSK 1181 NS+Y+L +NAFADLTHHEF+ + G V D+P S+DWR+K Sbjct: 69 NSSYSLALNAFADLTHHEFKASRLGLSAAAIEGSRP----NLQLPGLVRDIPASMDWRTK 124 Query: 1180 GAVTPVKDQGRCGACWSFSATGAIEGINQIVTGSLVSVSEQELIDCDRSYNSGCGGGLMD 1001 GAVT VKDQG CGACWSFSATGAIEGIN+IVTG+LVS+SEQEL+DCDRSYNSGC GGLMD Sbjct: 125 GAVTKVKDQGSCGACWSFSATGAIEGINKIVTGTLVSLSEQELVDCDRSYNSGCEGGLMD 184 Query: 1000 YAFQFVVKNHGIDTEKDYPYQETDRTCNRNKLKRLVVTIDGFIDVPSYNEEEILKAVASQ 821 YA+QFV+ NHGID E+DYPY ++TCN+ K KR VVTIDG+ VP+ NE+ +L+AVA Q Sbjct: 185 YAYQFVIDNHGIDNEEDYPYLGREKTCNKEKRKRRVVTIDGYAGVPANNEDLLLQAVAKQ 244 Query: 820 PVSVGICGSDRGFQLYSKGIFSGPCSTSLDHAVLIVGYGSENGVDYWILKNSWGKNWGMN 641 PVSVGICGS+R FQLYSKGIF+GPCS+SLDHAVLIVGYGSENGVDYWI+KNSWG WGMN Sbjct: 245 PVSVGICGSERAFQLYSKGIFTGPCSSSLDHAVLIVGYGSENGVDYWIVKNSWGTRWGMN 304 Query: 640 GYMHMQRNSGNKEGVCGINMLASYPIKTXXXXXXXXXXXPTQCSLLSYCGSGETCCCGWR 461 GY+HM RNSG+ +G+CGINMLASYP KT PT+C L +YC +GETCCC R Sbjct: 305 GYIHMLRNSGDSKGLCGINMLASYPTKTSPNPPSPPPPGPTKCDLFTYCSAGETCCCTHR 364 Query: 460 LLGICLSWKCCEAESAVCCNDHVSCCPSHYPVCDTERKQCLQAAGNSTMVKGLEKKGSIW 281 + GIC SWKCCE +SAVCC D+ CCP YPVCDT++ QCL+ GN+T ++ EK+ S Sbjct: 365 IFGICFSWKCCELDSAVCCKDNRHCCPYDYPVCDTKKSQCLKRVGNATRMEAFEKRHSTR 424 Query: 280 KFGGLNPLFEAW 245 KF P E W Sbjct: 425 KFSSWRPFVENW 436 >ref|XP_004238304.1| PREDICTED: cysteine proteinase RD21a-like [Solanum lycopersicum] Length = 439 Score = 574 bits (1480), Expect = e-161 Identities = 269/428 (62%), Positives = 326/428 (76%), Gaps = 3/428 (0%) Frame = -3 Query: 1540 LLLFLVSVNFHLYLCSPTSDLFDSWCKQHGKSYSSEEERLYRLTVFEDNLAFVNQHNSLT 1361 L+L L+ L CS SDLF++WC+Q+GK YSSE+ER+YR VFE+N A++ +HNS Sbjct: 8 LVLVLLIFQQPLCTCSSISDLFETWCQQNGKKYSSEQERMYRFKVFEENYAYITEHNSKG 67 Query: 1360 NSTYTLGVNAFADLTHHEFRVXXXXXXXXXXXXXXXXXSES-TAGAGFVGDV--PKSIDW 1190 NS+YTLG+NA++DLTHHEFR S ++ AG + DV P S+DW Sbjct: 68 NSSYTLGLNAYSDLTHHEFRNSFLGLSSSANDFIRLKGRGSGSSAAGVLSDVDAPSSLDW 127 Query: 1189 RSKGAVTPVKDQGRCGACWSFSATGAIEGINQIVTGSLVSVSEQELIDCDRSYNSGCGGG 1010 R KGAVT VK+QG CGACWSFSATGAIEGIN+I TGSLVS+SEQELIDCDRSYN GCGGG Sbjct: 128 RDKGAVTNVKNQGSCGACWSFSATGAIEGINKITTGSLVSLSEQELIDCDRSYNQGCGGG 187 Query: 1009 LMDYAFQFVVKNHGIDTEKDYPYQETDRTCNRNKLKRLVVTIDGFIDVPSYNEEEILKAV 830 LMDYAF+FV+KN GIDTEKDYP++E + TCN+NKL+R VVTIDG+ D+P +E+++LKAV Sbjct: 188 LMDYAFEFVIKNGGIDTEKDYPFREKEGTCNKNKLQRRVVTIDGYTDIPQNDEDKLLKAV 247 Query: 829 ASQPVSVGICGSDRGFQLYSKGIFSGPCSTSLDHAVLIVGYGSENGVDYWILKNSWGKNW 650 A+QPVSVGICGS R FQ YSKGIF+GPC T LDHAVLIVGYGSENG DYWI+KNSWG +W Sbjct: 248 ATQPVSVGICGSARAFQSYSKGIFTGPCPTDLDHAVLIVGYGSENGFDYWIIKNSWGTSW 307 Query: 649 GMNGYMHMQRNSGNKEGVCGINMLASYPIKTXXXXXXXXXXXPTQCSLLSYCGSGETCCC 470 G+NGY+HMQRNSGN+EG+CG+N LASYP KT P++CS + CG GETCCC Sbjct: 308 GINGYIHMQRNSGNQEGICGVNKLASYPTKTSPNPPNPPAPGPSKCSTFTSCGQGETCCC 367 Query: 469 GWRLLGICLSWKCCEAESAVCCNDHVSCCPSHYPVCDTERKQCLQAAGNSTMVKGLEKKG 290 G + LGICLSWKCC +SAVCC D CCP YP+CDT R CL+ N+T+V+ +K+ Sbjct: 368 GLKFLGICLSWKCCGLDSAVCCKDGRHCCPWDYPICDTSRNLCLKRMSNATIVQQPQKEP 427 Query: 289 SIWKFGGL 266 KFGGL Sbjct: 428 FTGKFGGL 435 >ref|XP_006417526.1| hypothetical protein EUTSA_v10007640mg [Eutrema salsugineum] gi|557095297|gb|ESQ35879.1| hypothetical protein EUTSA_v10007640mg [Eutrema salsugineum] Length = 444 Score = 573 bits (1477), Expect = e-161 Identities = 278/445 (62%), Positives = 325/445 (73%), Gaps = 2/445 (0%) Frame = -3 Query: 1567 SSIDSSRVILLLFLVSVNFHLYLCSPT--SDLFDSWCKQHGKSYSSEEERLYRLTVFEDN 1394 SS S LL + S++F + S ++LFD WC +HGK+Y SEEER +R+ +F DN Sbjct: 5 SSFVSLTFFFLLLVSSLSFSISSSSSDDIAELFDDWCHRHGKTYGSEEERQHRIQIFRDN 64 Query: 1393 LAFVNQHNSLTNSTYTLGVNAFADLTHHEFRVXXXXXXXXXXXXXXXXXSESTAGAGFVG 1214 FV QHN ++NSTY+L +NAFADLTHHEF+ S + V Sbjct: 65 HDFVTQHNHISNSTYSLSLNAFADLTHHEFKASRLGLSAPSPSLMAKEQSLGVSERVRV- 123 Query: 1213 DVPKSIDWRSKGAVTPVKDQGRCGACWSFSATGAIEGINQIVTGSLVSVSEQELIDCDRS 1034 VP S+DWR KGAVT VKDQG CGACWSFSATGA+EGINQIVTG L+S+SEQELIDCD+S Sbjct: 124 KVPDSVDWRKKGAVTNVKDQGSCGACWSFSATGAMEGINQIVTGDLISLSEQELIDCDKS 183 Query: 1033 YNSGCGGGLMDYAFQFVVKNHGIDTEKDYPYQETDRTCNRNKLKRLVVTIDGFIDVPSYN 854 YN+GC GGLMDYAF+FV+KNHGIDTEKDYPYQE D TC ++KLK+ VVTID + V S N Sbjct: 184 YNAGCNGGLMDYAFEFVIKNHGIDTEKDYPYQEQDGTCKKDKLKKRVVTIDSYAGVASNN 243 Query: 853 EEEILKAVASQPVSVGICGSDRGFQLYSKGIFSGPCSTSLDHAVLIVGYGSENGVDYWIL 674 E+ +++AVASQPVSVGICGS+R FQLYS GIFSGPCSTSLDHAVLIVGYGS+NGVDYWI+ Sbjct: 244 EKALMEAVASQPVSVGICGSERAFQLYSSGIFSGPCSTSLDHAVLIVGYGSQNGVDYWIV 303 Query: 673 KNSWGKNWGMNGYMHMQRNSGNKEGVCGINMLASYPIKTXXXXXXXXXXXPTQCSLLSYC 494 KNSWGK+WGM+G+MHMQRN+GN EGVCGINMLASYPIKT PT+C+L +YC Sbjct: 304 KNSWGKSWGMDGFMHMQRNTGNSEGVCGINMLASYPIKTHPNPPPPSPPGPTKCNLFTYC 363 Query: 493 GSGETCCCGWRLLGICLSWKCCEAESAVCCNDHVSCCPSHYPVCDTERKQCLQAAGNSTM 314 SGETCCC L G+C SWKCCE ESAVCC D CCP YPVCDT + CL+ GN T Sbjct: 364 SSGETCCCARTLFGLCFSWKCCELESAVCCKDGRHCCPRDYPVCDTTKSLCLKKTGNFTE 423 Query: 313 VKGLEKKGSIWKFGGLNPLFEAWNM 239 +K KK S K G FE W M Sbjct: 424 IKPFWKKNSSNKLG----RFEEWVM 444 >gb|AAK71314.1|AF388175_1 papain-like cysteine peptidase XBCP3 [Arabidopsis thaliana] Length = 437 Score = 572 bits (1475), Expect = e-160 Identities = 273/444 (61%), Positives = 324/444 (72%) Frame = -3 Query: 1570 MSSIDSSRVILLLFLVSVNFHLYLCSPTSDLFDSWCKQHGKSYSSEEERLYRLTVFEDNL 1391 MS SS + L F + + S+LFD WC++HGK+Y SEEER R+ +F+DN Sbjct: 1 MSMSSSSFISLTFFFLLLVSSSSSSDDISELFDDWCQKHGKTYGSEEERQQRIQIFKDNH 60 Query: 1390 AFVNQHNSLTNSTYTLGVNAFADLTHHEFRVXXXXXXXXXXXXXXXXXSESTAGAGFVGD 1211 FV QHN +TN+TY+L +NAFADLTHHEF+ +S G+ Sbjct: 61 DFVTQHNLITNATYSLSLNAFADLTHHEFKASRLGLSVSAPSVIMASKGQSLGGSV---K 117 Query: 1210 VPKSIDWRSKGAVTPVKDQGRCGACWSFSATGAIEGINQIVTGSLVSVSEQELIDCDRSY 1031 VP S+DWR KGAVT VKDQG CGACWSFSATGA+EGINQIVTG L+S+SEQELIDCD+SY Sbjct: 118 VPDSVDWRKKGAVTNVKDQGSCGACWSFSATGAMEGINQIVTGDLISLSEQELIDCDKSY 177 Query: 1030 NSGCGGGLMDYAFQFVVKNHGIDTEKDYPYQETDRTCNRNKLKRLVVTIDGFIDVPSYNE 851 N+GC GGLMDYAF+FV+KNHGIDTEKDYPYQE D TC ++KLK+ VVTID + V S +E Sbjct: 178 NAGCNGGLMDYAFEFVIKNHGIDTEKDYPYQERDGTCKKDKLKQKVVTIDSYAGVKSNDE 237 Query: 850 EEILKAVASQPVSVGICGSDRGFQLYSKGIFSGPCSTSLDHAVLIVGYGSENGVDYWILK 671 + +++AVA+QPVSVGICGS+R FQLYS+GIFSGPCSTSLDHAVLIVGYGS+NGVDYWI+K Sbjct: 238 KALMEAVAAQPVSVGICGSERAFQLYSRGIFSGPCSTSLDHAVLIVGYGSQNGVDYWIVK 297 Query: 670 NSWGKNWGMNGYMHMQRNSGNKEGVCGINMLASYPIKTXXXXXXXXXXXPTQCSLLSYCG 491 NSWGK+WGM+G+MHMQRN+ N +GVCGINMLASYPIKT PT+C+L +YC Sbjct: 298 NSWGKSWGMDGFMHMQRNTENSDGVCGINMLASYPIKTHPNPPPPSPPGPTKCNLFTYCS 357 Query: 490 SGETCCCGWRLLGICLSWKCCEAESAVCCNDHVSCCPSHYPVCDTERKQCLQAAGNSTMV 311 SGETCCC L G+C SWKCCE ESAVCC D CCP YPVCDT R CL+ GN T + Sbjct: 358 SGETCCCARELFGLCFSWKCCEIESAVCCKDGRHCCPHDYPVCDTTRSLCLKKTGNFTAI 417 Query: 310 KGLEKKGSIWKFGGLNPLFEAWNM 239 K KK S + G FE W M Sbjct: 418 KPFWKKNSSKQLG----RFEEWVM 437 >gb|EXC25025.1| Oryzain alpha chain [Morus notabilis] Length = 517 Score = 572 bits (1474), Expect = e-160 Identities = 271/408 (66%), Positives = 317/408 (77%), Gaps = 5/408 (1%) Frame = -3 Query: 1543 ILLLFLVSV--NFHLYLCSPT---SDLFDSWCKQHGKSYSSEEERLYRLTVFEDNLAFVN 1379 IL LFL+S+ + HL L SP+ S LF++WC++HG+SYSSEEERLYRLTVFEDNLAFV Sbjct: 3 ILCLFLLSLLLSSHLSLSSPSLNSSQLFEAWCEKHGQSYSSEEERLYRLTVFEDNLAFVT 62 Query: 1378 QHNSLTNSTYTLGVNAFADLTHHEFRVXXXXXXXXXXXXXXXXXSESTAGAGFVGDVPKS 1199 QHN++ NS+YTL +NAFADLTHHEF+ S+ + DVP S Sbjct: 63 QHNNMGNSSYTLSLNAFADLTHHEFKSSRLGFSSALLSSLPKLGSKLLD----LRDVPAS 118 Query: 1198 IDWRSKGAVTPVKDQGRCGACWSFSATGAIEGINQIVTGSLVSVSEQELIDCDRSYNSGC 1019 +DWR KGAVT VKDQG CGACW+FSATGAIEGIN+IVTGSLVS+SEQELIDCD SYN+GC Sbjct: 119 LDWRKKGAVTNVKDQGSCGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDTSYNAGC 178 Query: 1018 GGGLMDYAFQFVVKNHGIDTEKDYPYQETDRTCNRNKLKRLVVTIDGFIDVPSYNEEEIL 839 GGLMDYA+QFV+ NHGIDTE+DYPYQ D++C + KLKR VVTIDG+ DV N ++L Sbjct: 179 DGGLMDYAYQFVIDNHGIDTEEDYPYQARDKSCRKEKLKRRVVTIDGYTDVAPNNGLQLL 238 Query: 838 KAVASQPVSVGICGSDRGFQLYSKGIFSGPCSTSLDHAVLIVGYGSENGVDYWILKNSWG 659 +AV +QPVSVGICGS+R FQLYSKGIF+GPCSTSLDHAVLIVGY SENGVDYWI+KNSWG Sbjct: 239 QAVVTQPVSVGICGSERAFQLYSKGIFTGPCSTSLDHAVLIVGYDSENGVDYWIVKNSWG 298 Query: 658 KNWGMNGYMHMQRNSGNKEGVCGINMLASYPIKTXXXXXXXXXXXPTQCSLLSYCGSGET 479 K WGM+GY+HMQRN+GN +GVCGINMLASYP KT PT+CS + CG GET Sbjct: 299 KQWGMDGYIHMQRNTGNSQGVCGINMLASYPTKTSPNPPPSPSPGPTRCSFFAQCGEGET 358 Query: 478 CCCGWRLLGICLSWKCCEAESAVCCNDHVSCCPSHYPVCDTERKQCLQ 335 CCC WR LG+C SWKCC SAVCC D + CCP YP+CDT+R CL+ Sbjct: 359 CCCSWRFLGLCFSWKCCGLNSAVCCKDKIHCCPQDYPLCDTQRNVCLK 406 >ref|NP_563855.1| papain-like cysteine peptidase [Arabidopsis thaliana] gi|110741821|dbj|BAE98853.1| papain-like cysteine peptidase XBCP3 [Arabidopsis thaliana] gi|111074448|gb|ABH04597.1| At1g09850 [Arabidopsis thaliana] gi|332190386|gb|AEE28507.1| papain-like cysteine peptidase [Arabidopsis thaliana] Length = 437 Score = 572 bits (1473), Expect = e-160 Identities = 273/444 (61%), Positives = 323/444 (72%) Frame = -3 Query: 1570 MSSIDSSRVILLLFLVSVNFHLYLCSPTSDLFDSWCKQHGKSYSSEEERLYRLTVFEDNL 1391 MS SS + L F + + S+LFD WC++HGK+Y SEEER R+ +F+DN Sbjct: 1 MSMSSSSFISLTFFFLLLVSSSSSSDDISELFDDWCQKHGKTYGSEEERQQRIQIFKDNH 60 Query: 1390 AFVNQHNSLTNSTYTLGVNAFADLTHHEFRVXXXXXXXXXXXXXXXXXSESTAGAGFVGD 1211 FV QHN +TN+TY+L +NAFADLTHHEF+ +S G+ Sbjct: 61 DFVTQHNLITNATYSLSLNAFADLTHHEFKASRLGLSVSAPSVIMASKGQSLGGSV---K 117 Query: 1210 VPKSIDWRSKGAVTPVKDQGRCGACWSFSATGAIEGINQIVTGSLVSVSEQELIDCDRSY 1031 VP S+DWR KGAVT VKDQG CGACWSFSATGA+EGINQIVTG L+S+SEQELIDCD+SY Sbjct: 118 VPDSVDWRKKGAVTNVKDQGSCGACWSFSATGAMEGINQIVTGDLISLSEQELIDCDKSY 177 Query: 1030 NSGCGGGLMDYAFQFVVKNHGIDTEKDYPYQETDRTCNRNKLKRLVVTIDGFIDVPSYNE 851 N+GC GGLMDYAF+FV+KNHGIDTEKDYPYQE D TC ++KLK+ VVTID + V S +E Sbjct: 178 NAGCNGGLMDYAFEFVIKNHGIDTEKDYPYQERDGTCKKDKLKQKVVTIDSYAGVKSNDE 237 Query: 850 EEILKAVASQPVSVGICGSDRGFQLYSKGIFSGPCSTSLDHAVLIVGYGSENGVDYWILK 671 + +++AVA+QPVSVGICGS+R FQLYS GIFSGPCSTSLDHAVLIVGYGS+NGVDYWI+K Sbjct: 238 KALMEAVAAQPVSVGICGSERAFQLYSSGIFSGPCSTSLDHAVLIVGYGSQNGVDYWIVK 297 Query: 670 NSWGKNWGMNGYMHMQRNSGNKEGVCGINMLASYPIKTXXXXXXXXXXXPTQCSLLSYCG 491 NSWGK+WGM+G+MHMQRN+ N +GVCGINMLASYPIKT PT+C+L +YC Sbjct: 298 NSWGKSWGMDGFMHMQRNTENSDGVCGINMLASYPIKTHPNPPPPSPPGPTKCNLFTYCS 357 Query: 490 SGETCCCGWRLLGICLSWKCCEAESAVCCNDHVSCCPSHYPVCDTERKQCLQAAGNSTMV 311 SGETCCC L G+C SWKCCE ESAVCC D CCP YPVCDT R CL+ GN T + Sbjct: 358 SGETCCCARELFGLCFSWKCCEIESAVCCKDGRHCCPHDYPVCDTTRSLCLKKTGNFTAI 417 Query: 310 KGLEKKGSIWKFGGLNPLFEAWNM 239 K KK S + G FE W M Sbjct: 418 KPFWKKNSSKQLG----RFEEWVM 437 >ref|XP_002889773.1| hypothetical protein ARALYDRAFT_471096 [Arabidopsis lyrata subsp. lyrata] gi|297335615|gb|EFH66032.1| hypothetical protein ARALYDRAFT_471096 [Arabidopsis lyrata subsp. lyrata] Length = 439 Score = 571 bits (1471), Expect = e-160 Identities = 277/449 (61%), Positives = 328/449 (73%), Gaps = 7/449 (1%) Frame = -3 Query: 1564 SIDSSRVILLLFLVSVNFHLYLCSPTS-----DLFDSWCKQHGKSYSSEEERLYRLTVFE 1400 S+ SS + L F F L + SP+S +LFD WC++HGK+Y SEEER R+ +F+ Sbjct: 2 SMSSSSFVSLTFF----FLLLVSSPSSSDDISELFDDWCQRHGKTYGSEEERQQRIQIFK 57 Query: 1399 DNLAFVNQHNSLTNSTYTLGVNAFADLTHHEFRVXXXXXXXXXXXXXXXXXSESTAGAGF 1220 DN FV QHN +TN+TY+L +NAFADLTHHEF+ +S G Sbjct: 58 DNHDFVTQHNLITNATYSLSLNAFADLTHHEFKASRLGLSVSASSLIMASKGQSLGGNA- 116 Query: 1219 VGDVPKSIDWRSKGAVTPVKDQGRCGACWSFSATGAIEGINQIVTGSLVSVSEQELIDCD 1040 VP S+DWR KGAVT VKDQG CGACWSFSATGA+EGINQIVTG L+S+SEQELIDCD Sbjct: 117 --KVPDSVDWRKKGAVTNVKDQGSCGACWSFSATGAMEGINQIVTGDLISLSEQELIDCD 174 Query: 1039 RSYNSGCGGGLMDYAFQFVVKNHGIDTEKDYPYQETDRTCNRNKLKRLVVTIDGFIDVPS 860 +SYN+GC GGLMDYAF+FV+KNHGIDTEKDYPYQE D TC ++KLK+ VVTID + V S Sbjct: 175 KSYNAGCNGGLMDYAFEFVIKNHGIDTEKDYPYQERDGTCKKDKLKQKVVTIDSYAGVKS 234 Query: 859 YNEEEILKAVASQPVSVGICGSDRGFQLYSK--GIFSGPCSTSLDHAVLIVGYGSENGVD 686 +E+ + +AVA+QPVSVGICGS+R FQLYS+ GIFSGPCSTSLDHAVLIVGYGS+NGVD Sbjct: 235 NDEKALREAVAAQPVSVGICGSERAFQLYSRVSGIFSGPCSTSLDHAVLIVGYGSQNGVD 294 Query: 685 YWILKNSWGKNWGMNGYMHMQRNSGNKEGVCGINMLASYPIKTXXXXXXXXXXXPTQCSL 506 YWI+KNSWGK+WGM+G+MHMQRN+GN EG+CGINMLASYPIKT PT+C+L Sbjct: 295 YWIVKNSWGKSWGMDGFMHMQRNTGNSEGICGINMLASYPIKTHPNPPPPSPPGPTKCNL 354 Query: 505 LSYCGSGETCCCGWRLLGICLSWKCCEAESAVCCNDHVSCCPSHYPVCDTERKQCLQAAG 326 +YC +GETCCC L G+C SWKCCE ESAVCC+D CCP YPVCDT R CL+ G Sbjct: 355 FTYCSAGETCCCARNLFGLCFSWKCCEIESAVCCSDGRHCCPHDYPVCDTTRSLCLKKTG 414 Query: 325 NSTMVKGLEKKGSIWKFGGLNPLFEAWNM 239 N T +K KK S K G FE W M Sbjct: 415 NFTAIKPFWKKDSSNKLG----RFEGWVM 439 >ref|XP_004152671.1| PREDICTED: cysteine proteinase RD21a-like [Cucumis sativus] gi|449529596|ref|XP_004171784.1| PREDICTED: cysteine proteinase RD21a-like [Cucumis sativus] Length = 431 Score = 569 bits (1467), Expect = e-159 Identities = 271/418 (64%), Positives = 316/418 (75%) Frame = -3 Query: 1540 LLLFLVSVNFHLYLCSPTSDLFDSWCKQHGKSYSSEEERLYRLTVFEDNLAFVNQHNSLT 1361 L LFL+ L S S+LF+ WC +HGKSYSS EE+LYRL VF DN FV HN+L Sbjct: 9 LTLFLLLFR-PLSATSNVSELFEIWCTEHGKSYSSAEEKLYRLGVFADNYEFVTHHNNLD 67 Query: 1360 NSTYTLGVNAFADLTHHEFRVXXXXXXXXXXXXXXXXXSESTAGAGFVGDVPKSIDWRSK 1181 NS+YTL +N++ADLTHHEF+V E + DVP S+DWR K Sbjct: 68 NSSYTLSLNSYADLTHHEFKVSRLGFSPALRNFRPVLPQEPSLPR----DVPDSLDWRKK 123 Query: 1180 GAVTPVKDQGRCGACWSFSATGAIEGINQIVTGSLVSVSEQELIDCDRSYNSGCGGGLMD 1001 GAVT VKDQG CGACWSFSATGA+EGINQI+TGSL+S+SEQELIDCDRSYNSGCGGGLMD Sbjct: 124 GAVTAVKDQGSCGACWSFSATGAMEGINQIMTGSLISLSEQELIDCDRSYNSGCGGGLMD 183 Query: 1000 YAFQFVVKNHGIDTEKDYPYQETDRTCNRNKLKRLVVTIDGFIDVPSYNEEEILKAVASQ 821 YA+QFV+ NHGIDTE DYPYQ D +C ++KL+R VVTIDG+ D+PS +E ++L+AVA+Q Sbjct: 184 YAYQFVISNHGIDTENDYPYQARDGSCRKDKLQRNVVTIDGYADIPSNDEGKLLQAVAAQ 243 Query: 820 PVSVGICGSDRGFQLYSKGIFSGPCSTSLDHAVLIVGYGSENGVDYWILKNSWGKNWGMN 641 PVSVGICGS+R FQLYSKGIFSGPCSTSLDHAVLIVGYGSENGVDYWI+KNSWGK+WGM+ Sbjct: 244 PVSVGICGSERAFQLYSKGIFSGPCSTSLDHAVLIVGYGSENGVDYWIVKNSWGKSWGMD 303 Query: 640 GYMHMQRNSGNKEGVCGINMLASYPIKTXXXXXXXXXXXPTQCSLLSYCGSGETCCCGWR 461 GYMHMQRNSGN EGVCGIN LASYP KT PT+CS+L+ C +GETCCC + Sbjct: 304 GYMHMQRNSGNSEGVCGINKLASYPTKTNPNPPPSPPPGPTKCSILTSCAAGETCCCAKK 363 Query: 460 LLGICLSWKCCEAESAVCCNDHVSCCPSHYPVCDTERKQCLQAAGNSTMVKGLEKKGS 287 LG+CLSWKCC SAVCC D CCP YP+CDT+R CL+ N T + LE + S Sbjct: 364 FLGLCLSWKCCGLSSAVCCKDGRHCCPFDYPICDTDRNLCLKQTMNGTRTEILENRSS 421 >ref|XP_002510459.1| cysteine protease, putative [Ricinus communis] gi|223551160|gb|EEF52646.1| cysteine protease, putative [Ricinus communis] Length = 422 Score = 563 bits (1452), Expect = e-158 Identities = 265/417 (63%), Positives = 320/417 (76%), Gaps = 12/417 (2%) Frame = -3 Query: 1543 ILLLFLVSVNFHLYLCSPTSD---LFDSWCKQHGKSYSSEEERLYRLTVFEDNLAFVNQH 1373 + L+ L+ N + S +SD LF+SW K+HGK+Y+S+E++LYR +FE+N FV +H Sbjct: 7 LFLITLLFFNLSISSFSSSSDISKLFESWTKEHGKTYTSKEDKLYRFKIFEENYEFVKKH 66 Query: 1372 NSLTNSTYTLGVNAFADLTHHEFRVXXXXXXXXXXXXXXXXXSESTAGA---------GF 1220 NS NS+YTL +NAFADLTHHEF+ ST+G F Sbjct: 67 NSQGNSSYTLSLNAFADLTHHEFKASRLGLSAF-----------STSGKLSRRNFPLHDF 115 Query: 1219 VGDVPKSIDWRSKGAVTPVKDQGRCGACWSFSATGAIEGINQIVTGSLVSVSEQELIDCD 1040 VGDVP SIDWR KGAV+ VKDQG CGACWSFSATGAIEGIN+IVTGSLVS+SEQEL+DCD Sbjct: 116 VGDVPISIDWRKKGAVSQVKDQGNCGACWSFSATGAIEGINKIVTGSLVSLSEQELVDCD 175 Query: 1039 RSYNSGCGGGLMDYAFQFVVKNHGIDTEKDYPYQETDRTCNRNKLKRLVVTIDGFIDVPS 860 RSYN+GC GGLMDYA+QFV++N+GIDTE+DYPYQ ++TCN+ KLKR VVTIDG+ DVP Sbjct: 176 RSYNNGCEGGLMDYAYQFVIENNGIDTEEDYPYQAREKTCNKEKLKRHVVTIDGYTDVPQ 235 Query: 859 YNEEEILKAVASQPVSVGICGSDRGFQLYSKGIFSGPCSTSLDHAVLIVGYGSENGVDYW 680 NE+E+LKAVA+QPVSVGICGS+R FQLYSKGIF+GPCSTSLDHAVLIVGYGSENGVDYW Sbjct: 236 NNEKELLKAVAAQPVSVGICGSERAFQLYSKGIFTGPCSTSLDHAVLIVGYGSENGVDYW 295 Query: 679 ILKNSWGKNWGMNGYMHMQRNSGNKEGVCGINMLASYPIKTXXXXXXXXXXXPTQCSLLS 500 I+KNSWG +WG+NGYM+M RNSGN +G+CGINMLAS+P+KT PT+C L + Sbjct: 296 IVKNSWGTHWGINGYMYMLRNSGNSQGLCGINMLASFPVKTSPNPPPPAPPGPTKCDLFT 355 Query: 499 YCGSGETCCCGWRLLGICLSWKCCEAESAVCCNDHVSCCPSHYPVCDTERKQCLQAA 329 CG GETCCC R+ G+C SWKCCE +SAVCC D + CCP YPVCDT+R CL+ + Sbjct: 356 RCGEGETCCCTRRIFGLCFSWKCCELDSAVCCKDGLHCCPHDYPVCDTKRNMCLKVS 412 >ref|XP_002280937.1| PREDICTED: cysteine proteinase RD21a [Vitis vinifera] gi|302142569|emb|CBI19772.3| unnamed protein product [Vitis vinifera] Length = 436 Score = 561 bits (1446), Expect = e-157 Identities = 254/435 (58%), Positives = 330/435 (75%), Gaps = 3/435 (0%) Frame = -3 Query: 1534 LFLVSVNFHLYLCSPTSDLFDSWCKQHGKSYSSEEERLYRLTVFEDNLAFVNQHNSLTNS 1355 + +++V+ + S T+DLF++WC+Q+GK+YSSEEE+ RL VFE+N AFV QHNS+ N+ Sbjct: 10 ILILAVHSSVSEASSTADLFEAWCEQYGKTYSSEEEKASRLKVFEENHAFVTQHNSMANA 69 Query: 1354 TYTLGVNAFADLTHHEFRVXXXXXXXXXXXXXXXXXSESTAGAGFVGD---VPKSIDWRS 1184 +YTL +NAFADLTHHEF+ +S G VP ++DWR Sbjct: 70 SYTLALNAFADLTHHEFKASRLGFSPGRA--------QSIRSVGTPVQELHVPPAVDWRK 121 Query: 1183 KGAVTPVKDQGRCGACWSFSATGAIEGINQIVTGSLVSVSEQELIDCDRSYNSGCGGGLM 1004 GAVT VKDQG CG CWSFS TGAIEGIN+IVTGSLVS+SEQEL+DCDRSYNSGC GGLM Sbjct: 122 SGAVTGVKDQGNCGGCWSFSTTGAIEGINKIVTGSLVSLSEQELVDCDRSYNSGCEGGLM 181 Query: 1003 DYAFQFVVKNHGIDTEKDYPYQETDRTCNRNKLKRLVVTIDGFIDVPSYNEEEILKAVAS 824 DYA+QFV+KN GID+E DYPY D+ CN+ KLK+ +VTIDG+ D+P +E+++L+ VA Sbjct: 182 DYAYQFVIKNQGIDSEADYPYVGMDKPCNKEKLKKHIVTIDGYTDIPPNDEKQLLQVVAK 241 Query: 823 QPVSVGICGSDRGFQLYSKGIFSGPCSTSLDHAVLIVGYGSENGVDYWILKNSWGKNWGM 644 QPVSVGICGS++ FQLYSKG+++GPCS++LDHAVLIVGYG+E+GVD+WI+KNSWG++WGM Sbjct: 242 QPVSVGICGSEKTFQLYSKGVYTGPCSSTLDHAVLIVGYGTEDGVDFWIVKNSWGEHWGM 301 Query: 643 NGYMHMQRNSGNKEGVCGINMLASYPIKTXXXXXXXXXXXPTQCSLLSYCGSGETCCCGW 464 GY+HM RN+G EG+CGINMLASYP KT PT+C S C GETCCC W Sbjct: 302 RGYIHMLRNNGTAEGICGINMLASYPAKTSPNPPPPPTPGPTKCDFFSSCSEGETCCCSW 361 Query: 463 RLLGICLSWKCCEAESAVCCNDHVSCCPSHYPVCDTERKQCLQAAGNSTMVKGLEKKGSI 284 R +G+CLSW CC A+SAVCC+++ CCP+ +P+CDT+R +CL+ AGN T V+ L+++GS Sbjct: 362 RFIGVCLSWNCCTAKSAVCCDNNNYCCPASHPICDTKRNRCLKPAGNGTGVEVLKRRGSS 421 Query: 283 WKFGGLNPLFEAWNM 239 KFGG + + +AWN+ Sbjct: 422 VKFGGWSSINDAWNL 436 >ref|XP_004291399.1| PREDICTED: cysteine proteinase RD21a-like [Fragaria vesca subsp. vesca] Length = 441 Score = 560 bits (1444), Expect = e-157 Identities = 276/435 (63%), Positives = 324/435 (74%), Gaps = 4/435 (0%) Frame = -3 Query: 1570 MSSIDSSRVILLLFLVSVNFHLYLCSPTSDLFDSWCKQHGKSYSSEEERLYRLTVFEDNL 1391 M+ + S + LLL L S +S+LF++WCKQ+GKSYSS+EE+LYRL++FE NL Sbjct: 1 MNPLLLSLLTLLLLLSHPCLSSSSSSSSSELFEAWCKQYGKSYSSQEEKLYRLSLFEQNL 60 Query: 1390 AFVNQHNSLTNSTYTLGVNAFADLTHHEFRVXXXXXXXXXXXXXXXXXSESTAGAGFVGD 1211 AF+ QHN L NS+YTL +N+F+DLTHHEF+ +S V Sbjct: 61 AFITQHNDLGNSSYTLSLNSFSDLTHHEFKASRLGFSPTFLRLYR----KSDPKPSVVRH 116 Query: 1210 VPKSIDWRSKGAVTPVKDQGRCGACWSFSATGAIEGINQIVTGSLVSVSEQELIDCDRSY 1031 VP SIDWR GAVT VKDQG CGACWSFSATGAIEGIN+IVTGSLVS+SEQELIDCDR Y Sbjct: 117 VPSSIDWRKNGAVTNVKDQGSCGACWSFSATGAIEGINKIVTGSLVSLSEQELIDCDRVY 176 Query: 1030 -NSGCGGGLMDYAFQFVVKNHGIDTEKDYPYQETDRTCNRNKLKRLVVTIDGFIDVPSYN 854 NSGC GGLMD AFQF++ N+GIDTE+DYPYQ D TCN+ KLKR VVTIDG+ DVP+ N Sbjct: 177 PNSGCNGGLMDDAFQFIIDNNGIDTEEDYPYQGADGTCNKQKLKRHVVTIDGYTDVPANN 236 Query: 853 EEEILKAVASQPVSVGICGSDRGFQLYSKGIFSGPCSTSLDHAVLIVGYGSENGVDYWIL 674 EE++LKAVA+QPVSVGI GS R FQ YSKGIF+GPCST+LDHAVLIVGYGSENGVDYWI+ Sbjct: 237 EEQLLKAVATQPVSVGIAGSGREFQFYSKGIFAGPCSTTLDHAVLIVGYGSENGVDYWIV 296 Query: 673 KNSWGKNWGMNGYMHMQRNSGNKEGVCGINMLASYPIKT---XXXXXXXXXXXPTQCSLL 503 KNSWGKNWGMNGY+H+ R+ N +G+CGINMLASYP KT PT+C L Sbjct: 297 KNSWGKNWGMNGYIHILRDHSNSKGLCGINMLASYPTKTGENPPFPPPSPPPGPTKCDLF 356 Query: 502 SYCGSGETCCCGWRLLGICLSWKCCEAESAVCCNDHVSCCPSHYPVCDTERKQCLQAAGN 323 S CG GETCCC ++LGICLSW+CCE SAVCC D + CCP YP+CDTER CLQA GN Sbjct: 357 SKCGVGETCCCARKILGICLSWRCCEFTSAVCCKDRLHCCPHDYPICDTERNYCLQANGN 416 Query: 322 STMVKGLEKKGSIWK 278 TM + E +GS+ K Sbjct: 417 LTM-RANEIRGSLRK 430 >gb|AAB60738.1| Strong similarity to Dianthus cysteine proteinase (gb|U17135) [Arabidopsis thaliana] Length = 416 Score = 548 bits (1412), Expect = e-153 Identities = 255/391 (65%), Positives = 301/391 (76%), Gaps = 7/391 (1%) Frame = -3 Query: 1486 SDLFDSWCKQHGKSYSSEEERLYRLTVFEDNLAFVNQHNSLTNSTYTLGVNAFADLTHHE 1307 S+LFD WC++HGK+Y SEEER R+ +F+DN FV QHN +TN+TY+L +NAFADLTHHE Sbjct: 27 SELFDDWCQKHGKTYGSEEERQQRIQIFKDNHDFVTQHNLITNATYSLSLNAFADLTHHE 86 Query: 1306 FRVXXXXXXXXXXXXXXXXXSESTAGAGFVGDVPKSIDWRSKGAVTPVKDQGRCGACWSF 1127 F+ +S G+ VP S+DWR KGAVT VKDQG CGACWSF Sbjct: 87 FKASRLGLSVSAPSVIMASKGQSLGGSV---KVPDSVDWRKKGAVTNVKDQGSCGACWSF 143 Query: 1126 SATGAIEGINQIVTGSLVSVSEQELIDCDRSYNSGCGGGLMDYAFQFVVKNHGIDTEKDY 947 SATGA+EGINQIVTG L+S+SEQELIDCD+SYN+GC GGLMDYAF+FV+KNHGIDTEKDY Sbjct: 144 SATGAMEGINQIVTGDLISLSEQELIDCDKSYNAGCNGGLMDYAFEFVIKNHGIDTEKDY 203 Query: 946 PYQETDRTCNRNKLKRLVVTIDGFIDVPSYNEEEILKAVASQPVSVGICGSDRGFQLYS- 770 PYQE D TC ++KLK+ VVTID + V S +E+ +++AVA+QPVSVGICGS+R FQLYS Sbjct: 204 PYQERDGTCKKDKLKQKVVTIDSYAGVKSNDEKALMEAVAAQPVSVGICGSERAFQLYSS 263 Query: 769 ------KGIFSGPCSTSLDHAVLIVGYGSENGVDYWILKNSWGKNWGMNGYMHMQRNSGN 608 +GIFSGPCSTSLDHAVLIVGYGS+NGVDYWI+KNSWGK+WGM+G+MHMQRN+ N Sbjct: 264 KFYLLMQGIFSGPCSTSLDHAVLIVGYGSQNGVDYWIVKNSWGKSWGMDGFMHMQRNTEN 323 Query: 607 KEGVCGINMLASYPIKTXXXXXXXXXXXPTQCSLLSYCGSGETCCCGWRLLGICLSWKCC 428 +GVCGINMLASYPIKT PT+C+L +YC SGETCCC L G+C SWKCC Sbjct: 324 SDGVCGINMLASYPIKTHPNPPPPSPPGPTKCNLFTYCSSGETCCCARELFGLCFSWKCC 383 Query: 427 EAESAVCCNDHVSCCPSHYPVCDTERKQCLQ 335 E ESAVCC D CCP YPVCDT R CL+ Sbjct: 384 EIESAVCCKDGRHCCPHDYPVCDTTRSLCLK 414 >ref|XP_006838704.1| hypothetical protein AMTR_s00002p00249780 [Amborella trichopoda] gi|548841210|gb|ERN01273.1| hypothetical protein AMTR_s00002p00249780 [Amborella trichopoda] Length = 475 Score = 547 bits (1409), Expect = e-153 Identities = 252/408 (61%), Positives = 305/408 (74%) Frame = -3 Query: 1483 DLFDSWCKQHGKSYSSEEERLYRLTVFEDNLAFVNQHNSLTNSTYTLGVNAFADLTHHEF 1304 D+F+SWC++HG++Y + EE+ R VF DNL F+ +HN NS YT+G+NAFADLTHHEF Sbjct: 71 DIFESWCRRHGRTYGTVEEKEQRFRVFSDNLVFIREHNQRANSNYTVGLNAFADLTHHEF 130 Query: 1303 RVXXXXXXXXXXXXXXXXXSESTAGAGFVGDVPKSIDWRSKGAVTPVKDQGRCGACWSFS 1124 ++ DVP S+DWR KGAVT VKDQG CGACW+FS Sbjct: 131 KIKRLGLCPSILRFSSSNFRSDQKKI----DVPSSLDWRDKGAVTNVKDQGSCGACWAFS 186 Query: 1123 ATGAIEGINQIVTGSLVSVSEQELIDCDRSYNSGCGGGLMDYAFQFVVKNHGIDTEKDYP 944 ATGAIEGIN+IVTGSL+S+SEQE+IDCD +YNSGCGGGLMDYAF++V KNHGIDTEKDYP Sbjct: 187 ATGAIEGINKIVTGSLISLSEQEIIDCDTTYNSGCGGGLMDYAFKWVTKNHGIDTEKDYP 246 Query: 943 YQETDRTCNRNKLKRLVVTIDGFIDVPSYNEEEILKAVASQPVSVGICGSDRGFQLYSKG 764 Y+E +C ++K +R VVTIDG D+PS +E+ IL+AVA QPVSVGICGS+R FQLYS G Sbjct: 247 YREVQGSCIKDKAERHVVTIDGHTDIPSNSEDLILQAVAKQPVSVGICGSERSFQLYSSG 306 Query: 763 IFSGPCSTSLDHAVLIVGYGSENGVDYWILKNSWGKNWGMNGYMHMQRNSGNKEGVCGIN 584 IFSGPCSTSLDHAVLIVGYGS+NGVDYWI+KNSWG +WGM+GYMHM RNSG+ +GVCGIN Sbjct: 307 IFSGPCSTSLDHAVLIVGYGSKNGVDYWIVKNSWGTSWGMDGYMHMLRNSGDSQGVCGIN 366 Query: 583 MLASYPIKTXXXXXXXXXXXPTQCSLLSYCGSGETCCCGWRLLGICLSWKCCEAESAVCC 404 M+ SYP K+ P +CSLL+YC SG TCCC WR LGICLSW CC+ ++AVCC Sbjct: 367 MMPSYPTKSGANPPPSPPPGPVKCSLLTYCPSGNTCCCTWRFLGICLSWSCCDLDNAVCC 426 Query: 403 NDHVSCCPSHYPVCDTERKQCLQAAGNSTMVKGLEKKGSIWKFGGLNP 260 D CCP YPVC+T CL+ +GN T + GL+++ S FGG P Sbjct: 427 KDGQYCCPQDYPVCNTATGYCLKGSGNWTEMDGLKRRQS---FGGFRP 471 >ref|XP_007223363.1| hypothetical protein PRUPE_ppa005615mg [Prunus persica] gi|462420299|gb|EMJ24562.1| hypothetical protein PRUPE_ppa005615mg [Prunus persica] Length = 451 Score = 546 bits (1408), Expect = e-153 Identities = 276/458 (60%), Positives = 328/458 (71%), Gaps = 16/458 (3%) Frame = -3 Query: 1570 MSSIDSSRVILLLFLVSVNFHLYLCSP----TSDLFDSWCKQHGKSYSSEEERLYRLTVF 1403 M+ S +L LFLVS H L S TS+LF+ WCKQ+GKSYSS +E+LYRL+VF Sbjct: 1 MNPSSLSVFLLTLFLVS---HTCLSSSSSQTTSELFEVWCKQYGKSYSSAQEKLYRLSVF 57 Query: 1402 EDNLAFVNQHNSLTNSTYTLGVNAFADLTHHEFRVXXXXXXXXXXXXXXXXXSESTAGAG 1223 EDNLAFV QHN + NS+YTL +N F+DLTHHEF+ +S Sbjct: 58 EDNLAFVTQHNDMGNSSYTLSLNDFSDLTHHEFK----SSRLGFSPSFLSLKLKSDRKPS 113 Query: 1222 FVGDVPKSIDWRSKGAVTPVKDQGRCGACWSFSATGAIEGINQIVTGSLVSVSEQELIDC 1043 V D+P S+DWR KGAVT VKDQG CGACW+FS TGAIEGIN+IVTGSL+S+SEQEL+DC Sbjct: 114 VVRDLPSSLDWRKKGAVTNVKDQGSCGACWAFSTTGAIEGINKIVTGSLISLSEQELVDC 173 Query: 1042 DRSY-NSGCGGGLMDYAFQFVVKNHGIDTEKDYPYQETDRTCNRNKLKRLVVTIDGFIDV 866 DR Y N+GC GGLMD AF+FV+ N+GIDTE+DYPY+ D TC + KLKR VTID + DV Sbjct: 174 DRVYPNNGCNGGLMDDAFRFVIDNNGIDTEEDYPYKGWDDTCIKKKLKRNAVTIDDYTDV 233 Query: 865 PSYNEEEILKAVASQPVSVGICGSDRGFQLYSKGIFSGPCSTSLDHAVLIVGYGSENGVD 686 PS +EE++L+AVASQPVSVGI GSD GFQLYSKGIF+GPCSTSLDHAVLIVGYGSENGVD Sbjct: 234 PSNDEEQLLQAVASQPVSVGISGSDMGFQLYSKGIFNGPCSTSLDHAVLIVGYGSENGVD 293 Query: 685 YWILKNSWGKNWGMNGYMHMQRNSGNKEGVCGINMLASYPIKTXXXXXXXXXXXPTQCSL 506 YWI+KNSWG +WGMNGYMHM R+ N +G+CGIN LASYPIKT PT+C + Sbjct: 294 YWIVKNSWGTHWGMNGYMHMLRDHSNPKGICGINTLASYPIKT-GENPPLPPPGPTRCDI 352 Query: 505 LSYCGSGETCCCGWRLLGICLSWKCCEAESAVCCNDHVSCCPSHYPVCDTERKQCLQ--- 335 ++C +GETCCC R++GIC SW+CCE +SAVCC D CCP YP+CDTER CLQ Sbjct: 353 FTHCAAGETCCCAKRVVGICFSWRCCELDSAVCCKDQRHCCPRDYPICDTERTLCLQSNE 412 Query: 334 -------AAGNSTMVKGLEKKGSIWKFG-GLNPLFEAW 245 A GN T K LE +GS+ K G G + W Sbjct: 413 QLSTQSHATGNLTS-KALESRGSLRKSGRGWGSMIRDW 449 >gb|EYU36745.1| hypothetical protein MIMGU_mgv1a006749mg [Mimulus guttatus] Length = 433 Score = 545 bits (1405), Expect = e-152 Identities = 249/403 (61%), Positives = 312/403 (77%), Gaps = 5/403 (1%) Frame = -3 Query: 1486 SDLFDSWCKQHGKSYSSEEERLYRLTVFEDNLAFVNQHNSLTNSTYTLGVNAFADLTHHE 1307 SDLFDSWC+++GK+Y+SE+E+ +RL VF +N +VNQHN+ NS+YTL VNAFADLT+HE Sbjct: 26 SDLFDSWCEEYGKTYASEQEKQHRLNVFHENYKYVNQHNADANSSYTLSVNAFADLTNHE 85 Query: 1306 FRVXXXXXXXXXXXXXXXXXSESTA---GAGFV--GDVPKSIDWRSKGAVTPVKDQGRCG 1142 FR S S + G + ++P S+DWR+KGAVT VKDQG CG Sbjct: 86 FRANYLGLSPSKSDSVIRLNSRSASAIDGDNLIKESEIPSSLDWRNKGAVTAVKDQGSCG 145 Query: 1141 ACWSFSATGAIEGINQIVTGSLVSVSEQELIDCDRSYNSGCGGGLMDYAFQFVVKNHGID 962 ACWSFSATGA+EGINQI TGSLVS+SEQELIDCD+SYN GC GGLMDYA+ F++KN GID Sbjct: 146 ACWSFSATGAVEGINQIKTGSLVSLSEQELIDCDKSYNDGCNGGLMDYAYDFIIKNKGID 205 Query: 961 TEKDYPYQETDRTCNRNKLKRLVVTIDGFIDVPSYNEEEILKAVASQPVSVGICGSDRGF 782 TE+DY Y+ TC++NK+ + VVTID ++D+P +E+++L+AVA+QP+SVGICGSD F Sbjct: 206 TEEDYSYKGRSATCDKNKMNKHVVTIDSYVDIPEKDEKKLLQAVATQPISVGICGSDSSF 265 Query: 781 QLYSKGIFSGPCSTSLDHAVLIVGYGSENGVDYWILKNSWGKNWGMNGYMHMQRNSGNKE 602 QLYS GIF+GPCSTSLDHAVLIVGY S++G DYWI+KNSWGK+WG+ GYMHM RNSG++E Sbjct: 266 QLYSGGIFTGPCSTSLDHAVLIVGYDSKDGKDYWIIKNSWGKSWGIKGYMHMVRNSGSEE 325 Query: 601 GVCGINMLASYPIKTXXXXXXXXXXXPTQCSLLSYCGSGETCCCGWRLLGICLSWKCCEA 422 GVCGIN LASYP+K+ PT+C++ +YC SGETCCC LG+CLSW CCEA Sbjct: 326 GVCGINTLASYPVKSSTNPPPSPTPGPTKCNIFTYCSSGETCCCARYFLGVCLSWNCCEA 385 Query: 421 ESAVCCNDHVSCCPSHYPVCDTERKQCLQAAGNSTMVKGLEKK 293 ESAVCC+DH CCP YPVCDT++ CL+ +GN+T+ K L KK Sbjct: 386 ESAVCCDDHRHCCPHDYPVCDTKKNLCLKKSGNTTVSKPLGKK 428