BLASTX nr result
ID: Catharanthus23_contig00002882
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Catharanthus23_contig00002882 (2291 letters) Database: ./nr 37,332,560 sequences; 13,225,080,153 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_006341989.1| PREDICTED: cysteine proteinase RD21a-like [S... 615 e-173 ref|XP_004238304.1| PREDICTED: cysteine proteinase RD21a-like [S... 605 e-170 gb|AAK71314.1|AF388175_1 papain-like cysteine peptidase XBCP3 [A... 593 e-166 ref|XP_002510459.1| cysteine protease, putative [Ricinus communi... 591 e-166 ref|NP_563855.1| papain-like cysteine peptidase [Arabidopsis tha... 590 e-166 dbj|BAJ53169.1| JHL18I08.3 [Jatropha curcas] 590 e-166 ref|XP_002889773.1| hypothetical protein ARALYDRAFT_471096 [Arab... 588 e-165 ref|XP_006435079.1| hypothetical protein CICLE_v10001178mg [Citr... 588 e-165 ref|XP_006473576.1| PREDICTED: cysteine proteinase RD21a-like [C... 588 e-165 ref|XP_002307688.2| cysteine protease family protein [Populus tr... 588 e-165 gb|EOY14881.1| JHL18I08.3 protein [Theobroma cacao] 584 e-164 ref|XP_006417526.1| hypothetical protein EUTSA_v10007640mg [Eutr... 583 e-164 ref|XP_004152671.1| PREDICTED: cysteine proteinase RD21a-like [C... 583 e-163 gb|EXC25025.1| Oryzain alpha chain [Morus notabilis] 578 e-162 ref|XP_006307431.1| hypothetical protein CARUB_v10009056mg [Caps... 570 e-159 gb|AAB60738.1| Strong similarity to Dianthus cysteine proteinase... 569 e-159 ref|XP_002280937.1| PREDICTED: cysteine proteinase RD21a [Vitis ... 560 e-156 ref|XP_004500967.1| PREDICTED: oryzain alpha chain-like [Cicer a... 550 e-153 gb|ESW08035.1| hypothetical protein PHAVU_009G013000g [Phaseolus... 547 e-153 ref|XP_004291399.1| PREDICTED: cysteine proteinase RD21a-like [F... 547 e-153 >ref|XP_006341989.1| PREDICTED: cysteine proteinase RD21a-like [Solanum tuberosum] Length = 439 Score = 615 bits (1586), Expect = e-173 Identities = 288/437 (65%), Positives = 345/437 (78%), Gaps = 1/437 (0%) Frame = +2 Query: 194 MNWVWPFWT-VFLLFHVSICSSSSLTADLFENWSKEHGKRYSSEQEKQYRFKVFEDNYDY 370 MNW+ P V L+F C+ SS++ DLFE W +++GK+YSSEQE+ YRFKVFE+NY Y Sbjct: 1 MNWLLPSLVLVLLIFQQPFCTCSSIS-DLFETWCQQNGKKYSSEQERVYRFKVFEENYAY 59 Query: 371 ITQHNNMGNSTYTLSLNAFADLTHHEFKAKYLGLSASANDLIRLNRGGLSPFGESGAVGT 550 IT+HN+ NS+YTL LNA++DLTHHEF+ +LGLS+SAND IRL G S E+G + Sbjct: 60 ITEHNSKENSSYTLGLNAYSDLTHHEFRNSFLGLSSSANDFIRLKGRG-SGSSETGVLSD 118 Query: 551 NDIPSSLDWRKKGAVTNVKDQGSCGACWAFSATGAMEGINKIVTGSLVSLSEQELVDCDR 730 D PSSLDWR+KGAVT+VK+QGSCGACW+FSATGAMEGINKI TGSLVSLSEQEL+DCDR Sbjct: 119 VDAPSSLDWREKGAVTDVKNQGSCGACWSFSATGAMEGINKITTGSLVSLSEQELIDCDR 178 Query: 731 SYNSGCEGGLMDPAYEFVVKNKGIDTEKDYPFQARDGTCNRNKLNRHVVTIDGYRDVTRG 910 SYN GC GGLMD A+EFV+KN GIDTEKDYPF+ R+GTCN+NKL RHVVTIDGY D+ + Sbjct: 179 SYNEGCGGGLMDYAFEFVIKNGGIDTEKDYPFREREGTCNKNKLQRHVVTIDGYTDIPQN 238 Query: 911 NEKELLQAVAAQPVSVGICGSERAFQLYSRGIFTGPCSTNLDHAVLIVGYDSQDGVDYWI 1090 +E +LL+AVA QPVSVGICGS RAFQ YS+GIFTGPCST LDHAVLIVGY S++GVDYWI Sbjct: 239 DEDKLLKAVATQPVSVGICGSARAFQSYSKGIFTGPCSTALDHAVLIVGYGSENGVDYWI 298 Query: 1091 VKNSWGSSWGIDGYIHIARSTGNSEGVCGINTLAXXXXXXXXXXXXXXXXXXVKCSLFSS 1270 +KNSWG+SWGI+GYIH+ R++GN EG+CGIN LA KCS+F+S Sbjct: 299 IKNSWGTSWGINGYIHMQRNSGNQEGICGINKLASYPTKTSPNPPTPPAPGPSKCSMFTS 358 Query: 1271 CPGGETCCCSWQLLGICFSWQCCELDSAVCCKDHRHCCPHDYPICDTKRNLCLKKTGNST 1450 C GETCCC + LGIC SW+CC LDSAVCCKD RHCCP DYPICDT RNLCLK+ N+T Sbjct: 359 CGQGETCCCGSKFLGICLSWKCCGLDSAVCCKDGRHCCPQDYPICDTSRNLCLKRMNNAT 418 Query: 1451 LVKKLENRGVFGELGNL 1501 +V++ + G+ G L Sbjct: 419 IVQQPQKEAFTGKFGGL 435 >ref|XP_004238304.1| PREDICTED: cysteine proteinase RD21a-like [Solanum lycopersicum] Length = 439 Score = 605 bits (1560), Expect = e-170 Identities = 282/437 (64%), Positives = 341/437 (78%), Gaps = 1/437 (0%) Frame = +2 Query: 194 MNWVWPFWT-VFLLFHVSICSSSSLTADLFENWSKEHGKRYSSEQEKQYRFKVFEDNYDY 370 M W+ P V L+F +C+ SS++ DLFE W +++GK+YSSEQE+ YRFKVFE+NY Y Sbjct: 1 MKWLLPSLVLVLLIFQQPLCTCSSIS-DLFETWCQQNGKKYSSEQERMYRFKVFEENYAY 59 Query: 371 ITQHNNMGNSTYTLSLNAFADLTHHEFKAKYLGLSASANDLIRLNRGGLSPFGESGAVGT 550 IT+HN+ GNS+YTL LNA++DLTHHEF+ +LGLS+SAND IRL G S +G + Sbjct: 60 ITEHNSKGNSSYTLGLNAYSDLTHHEFRNSFLGLSSSANDFIRLKGRG-SGSSAAGVLSD 118 Query: 551 NDIPSSLDWRKKGAVTNVKDQGSCGACWAFSATGAMEGINKIVTGSLVSLSEQELVDCDR 730 D PSSLDWR KGAVTNVK+QGSCGACW+FSATGA+EGINKI TGSLVSLSEQEL+DCDR Sbjct: 119 VDAPSSLDWRDKGAVTNVKNQGSCGACWSFSATGAIEGINKITTGSLVSLSEQELIDCDR 178 Query: 731 SYNSGCEGGLMDPAYEFVVKNKGIDTEKDYPFQARDGTCNRNKLNRHVVTIDGYRDVTRG 910 SYN GC GGLMD A+EFV+KN GIDTEKDYPF+ ++GTCN+NKL R VVTIDGY D+ + Sbjct: 179 SYNQGCGGGLMDYAFEFVIKNGGIDTEKDYPFREKEGTCNKNKLQRRVVTIDGYTDIPQN 238 Query: 911 NEKELLQAVAAQPVSVGICGSERAFQLYSRGIFTGPCSTNLDHAVLIVGYDSQDGVDYWI 1090 +E +LL+AVA QPVSVGICGS RAFQ YS+GIFTGPC T+LDHAVLIVGY S++G DYWI Sbjct: 239 DEDKLLKAVATQPVSVGICGSARAFQSYSKGIFTGPCPTDLDHAVLIVGYGSENGFDYWI 298 Query: 1091 VKNSWGSSWGIDGYIHIARSTGNSEGVCGINTLAXXXXXXXXXXXXXXXXXXVKCSLFSS 1270 +KNSWG+SWGI+GYIH+ R++GN EG+CG+N LA KCS F+S Sbjct: 299 IKNSWGTSWGINGYIHMQRNSGNQEGICGVNKLASYPTKTSPNPPNPPAPGPSKCSTFTS 358 Query: 1271 CPGGETCCCSWQLLGICFSWQCCELDSAVCCKDHRHCCPHDYPICDTKRNLCLKKTGNST 1450 C GETCCC + LGIC SW+CC LDSAVCCKD RHCCP DYPICDT RNLCLK+ N+T Sbjct: 359 CGQGETCCCGLKFLGICLSWKCCGLDSAVCCKDGRHCCPWDYPICDTSRNLCLKRMSNAT 418 Query: 1451 LVKKLENRGVFGELGNL 1501 +V++ + G+ G L Sbjct: 419 IVQQPQKEPFTGKFGGL 435 >gb|AAK71314.1|AF388175_1 papain-like cysteine peptidase XBCP3 [Arabidopsis thaliana] Length = 437 Score = 593 bits (1528), Expect = e-166 Identities = 275/435 (63%), Positives = 336/435 (77%) Frame = +2 Query: 218 TVFLLFHVSICSSSSLTADLFENWSKEHGKRYSSEQEKQYRFKVFEDNYDYITQHNNMGN 397 T F L VS SSS ++LF++W ++HGK Y SE+E+Q R ++F+DN+D++TQHN + N Sbjct: 12 TFFFLLLVSSSSSSDDISELFDDWCQKHGKTYGSEEERQQRIQIFKDNHDFVTQHNLITN 71 Query: 398 STYTLSLNAFADLTHHEFKAKYLGLSASANDLIRLNRGGLSPFGESGAVGTNDIPSSLDW 577 +TY+LSLNAFADLTHHEFKA LGLS SA +I ++G G+ +P S+DW Sbjct: 72 ATYSLSLNAFADLTHHEFKASRLGLSVSAPSVIMASKG-------QSLGGSVKVPDSVDW 124 Query: 578 RKKGAVTNVKDQGSCGACWAFSATGAMEGINKIVTGSLVSLSEQELVDCDRSYNSGCEGG 757 RKKGAVTNVKDQGSCGACW+FSATGAMEGIN+IVTG L+SLSEQEL+DCD+SYN+GC GG Sbjct: 125 RKKGAVTNVKDQGSCGACWSFSATGAMEGINQIVTGDLISLSEQELIDCDKSYNAGCNGG 184 Query: 758 LMDPAYEFVVKNKGIDTEKDYPFQARDGTCNRNKLNRHVVTIDGYRDVTRGNEKELLQAV 937 LMD A+EFV+KN GIDTEKDYP+Q RDGTC ++KL + VVTID Y V +EK L++AV Sbjct: 185 LMDYAFEFVIKNHGIDTEKDYPYQERDGTCKKDKLKQKVVTIDSYAGVKSNDEKALMEAV 244 Query: 938 AAQPVSVGICGSERAFQLYSRGIFTGPCSTNLDHAVLIVGYDSQDGVDYWIVKNSWGSSW 1117 AAQPVSVGICGSERAFQLYSRGIF+GPCST+LDHAVLIVGY SQ+GVDYWIVKNSWG SW Sbjct: 245 AAQPVSVGICGSERAFQLYSRGIFSGPCSTSLDHAVLIVGYGSQNGVDYWIVKNSWGKSW 304 Query: 1118 GIDGYIHIARSTGNSEGVCGINTLAXXXXXXXXXXXXXXXXXXVKCSLFSSCPGGETCCC 1297 G+DG++H+ R+T NS+GVCGIN LA KC+LF+ C GETCCC Sbjct: 305 GMDGFMHMQRNTENSDGVCGINMLASYPIKTHPNPPPPSPPGPTKCNLFTYCSSGETCCC 364 Query: 1298 SWQLLGICFSWQCCELDSAVCCKDHRHCCPHDYPICDTKRNLCLKKTGNSTLVKKLENRG 1477 + +L G+CFSW+CCE++SAVCCKD RHCCPHDYP+CDT R+LCLKKTGN T +K + Sbjct: 365 ARELFGLCFSWKCCEIESAVCCKDGRHCCPHDYPVCDTTRSLCLKKTGNFTAIKPFWKKN 424 Query: 1478 VFGELGNLNAFFQNW 1522 +LG F+ W Sbjct: 425 SSKQLGR----FEEW 435 >ref|XP_002510459.1| cysteine protease, putative [Ricinus communis] gi|223551160|gb|EEF52646.1| cysteine protease, putative [Ricinus communis] Length = 422 Score = 591 bits (1523), Expect = e-166 Identities = 279/418 (66%), Positives = 338/418 (80%), Gaps = 5/418 (1%) Frame = +2 Query: 194 MNWVWPFWTVFLLFH---VSICSSSSLTADLFENWSKEHGKRYSSEQEKQYRFKVFEDNY 364 MN++ + + LLF +S SSSS + LFE+W+KEHGK Y+S+++K YRFK+FE+NY Sbjct: 1 MNFLSALFLITLLFFNLSISSFSSSSDISKLFESWTKEHGKTYTSKEDKLYRFKIFEENY 60 Query: 365 DYITQHNNMGNSTYTLSLNAFADLTHHEFKAKYLGLSA--SANDLIRLNRGGLSPFGESG 538 +++ +HN+ GNS+YTLSLNAFADLTHHEFKA LGLSA ++ L R N F Sbjct: 61 EFVKKHNSQGNSSYTLSLNAFADLTHHEFKASRLGLSAFSTSGKLSRRN------FPLHD 114 Query: 539 AVGTNDIPSSLDWRKKGAVTNVKDQGSCGACWAFSATGAMEGINKIVTGSLVSLSEQELV 718 VG D+P S+DWRKKGAV+ VKDQG+CGACW+FSATGA+EGINKIVTGSLVSLSEQELV Sbjct: 115 FVG--DVPISIDWRKKGAVSQVKDQGNCGACWSFSATGAIEGINKIVTGSLVSLSEQELV 172 Query: 719 DCDRSYNSGCEGGLMDPAYEFVVKNKGIDTEKDYPFQARDGTCNRNKLNRHVVTIDGYRD 898 DCDRSYN+GCEGGLMD AY+FV++N GIDTE+DYP+QAR+ TCN+ KL RHVVTIDGY D Sbjct: 173 DCDRSYNNGCEGGLMDYAYQFVIENNGIDTEEDYPYQAREKTCNKEKLKRHVVTIDGYTD 232 Query: 899 VTRGNEKELLQAVAAQPVSVGICGSERAFQLYSRGIFTGPCSTNLDHAVLIVGYDSQDGV 1078 V + NEKELL+AVAAQPVSVGICGSERAFQLYS+GIFTGPCST+LDHAVLIVGY S++GV Sbjct: 233 VPQNNEKELLKAVAAQPVSVGICGSERAFQLYSKGIFTGPCSTSLDHAVLIVGYGSENGV 292 Query: 1079 DYWIVKNSWGSSWGIDGYIHIARSTGNSEGVCGINTLAXXXXXXXXXXXXXXXXXXVKCS 1258 DYWIVKNSWG+ WGI+GY+++ R++GNS+G+CGIN LA KC Sbjct: 293 DYWIVKNSWGTHWGINGYMYMLRNSGNSQGLCGINMLASFPVKTSPNPPPPAPPGPTKCD 352 Query: 1259 LFSSCPGGETCCCSWQLLGICFSWQCCELDSAVCCKDHRHCCPHDYPICDTKRNLCLK 1432 LF+ C GETCCC+ ++ G+CFSW+CCELDSAVCCKD HCCPHDYP+CDTKRN+CLK Sbjct: 353 LFTRCGEGETCCCTRRIFGLCFSWKCCELDSAVCCKDGLHCCPHDYPVCDTKRNMCLK 410 >ref|NP_563855.1| papain-like cysteine peptidase [Arabidopsis thaliana] gi|110741821|dbj|BAE98853.1| papain-like cysteine peptidase XBCP3 [Arabidopsis thaliana] gi|111074448|gb|ABH04597.1| At1g09850 [Arabidopsis thaliana] gi|332190386|gb|AEE28507.1| papain-like cysteine peptidase [Arabidopsis thaliana] Length = 437 Score = 590 bits (1522), Expect = e-166 Identities = 274/435 (62%), Positives = 335/435 (77%) Frame = +2 Query: 218 TVFLLFHVSICSSSSLTADLFENWSKEHGKRYSSEQEKQYRFKVFEDNYDYITQHNNMGN 397 T F L VS SSS ++LF++W ++HGK Y SE+E+Q R ++F+DN+D++TQHN + N Sbjct: 12 TFFFLLLVSSSSSSDDISELFDDWCQKHGKTYGSEEERQQRIQIFKDNHDFVTQHNLITN 71 Query: 398 STYTLSLNAFADLTHHEFKAKYLGLSASANDLIRLNRGGLSPFGESGAVGTNDIPSSLDW 577 +TY+LSLNAFADLTHHEFKA LGLS SA +I ++G G+ +P S+DW Sbjct: 72 ATYSLSLNAFADLTHHEFKASRLGLSVSAPSVIMASKG-------QSLGGSVKVPDSVDW 124 Query: 578 RKKGAVTNVKDQGSCGACWAFSATGAMEGINKIVTGSLVSLSEQELVDCDRSYNSGCEGG 757 RKKGAVTNVKDQGSCGACW+FSATGAMEGIN+IVTG L+SLSEQEL+DCD+SYN+GC GG Sbjct: 125 RKKGAVTNVKDQGSCGACWSFSATGAMEGINQIVTGDLISLSEQELIDCDKSYNAGCNGG 184 Query: 758 LMDPAYEFVVKNKGIDTEKDYPFQARDGTCNRNKLNRHVVTIDGYRDVTRGNEKELLQAV 937 LMD A+EFV+KN GIDTEKDYP+Q RDGTC ++KL + VVTID Y V +EK L++AV Sbjct: 185 LMDYAFEFVIKNHGIDTEKDYPYQERDGTCKKDKLKQKVVTIDSYAGVKSNDEKALMEAV 244 Query: 938 AAQPVSVGICGSERAFQLYSRGIFTGPCSTNLDHAVLIVGYDSQDGVDYWIVKNSWGSSW 1117 AAQPVSVGICGSERAFQLYS GIF+GPCST+LDHAVLIVGY SQ+GVDYWIVKNSWG SW Sbjct: 245 AAQPVSVGICGSERAFQLYSSGIFSGPCSTSLDHAVLIVGYGSQNGVDYWIVKNSWGKSW 304 Query: 1118 GIDGYIHIARSTGNSEGVCGINTLAXXXXXXXXXXXXXXXXXXVKCSLFSSCPGGETCCC 1297 G+DG++H+ R+T NS+GVCGIN LA KC+LF+ C GETCCC Sbjct: 305 GMDGFMHMQRNTENSDGVCGINMLASYPIKTHPNPPPPSPPGPTKCNLFTYCSSGETCCC 364 Query: 1298 SWQLLGICFSWQCCELDSAVCCKDHRHCCPHDYPICDTKRNLCLKKTGNSTLVKKLENRG 1477 + +L G+CFSW+CCE++SAVCCKD RHCCPHDYP+CDT R+LCLKKTGN T +K + Sbjct: 365 ARELFGLCFSWKCCEIESAVCCKDGRHCCPHDYPVCDTTRSLCLKKTGNFTAIKPFWKKN 424 Query: 1478 VFGELGNLNAFFQNW 1522 +LG F+ W Sbjct: 425 SSKQLGR----FEEW 435 >dbj|BAJ53169.1| JHL18I08.3 [Jatropha curcas] Length = 441 Score = 590 bits (1522), Expect = e-166 Identities = 276/448 (61%), Positives = 342/448 (76%), Gaps = 3/448 (0%) Frame = +2 Query: 194 MNWVWPFWTVFLLFHVSICSSSSLT-ADLFENWSKEHGKRYSSEQEKQYRFKVFEDNYDY 370 MN + FLL ++ + SSSS A LFE W ++HGK Y+S++EK +R KVF+DNYD+ Sbjct: 1 MNSNCALFVAFLLSYLFLFSSSSSEIAHLFETWCQQHGKTYASQEEKLFRLKVFQDNYDF 60 Query: 371 ITQHNNMGNSTYTLSLNAFADLTHHEFKAKYLGLSASANDLIRLNRGG--LSPFGESGAV 544 +T+HN+ GNS+YTLSLNAFADLTHHEFKA LGLS++A+ + ++R + F Sbjct: 61 VTEHNSQGNSSYTLSLNAFADLTHHEFKASRLGLSSAASASLNVDRSNRQIPDF------ 114 Query: 545 GTNDIPSSLDWRKKGAVTNVKDQGSCGACWAFSATGAMEGINKIVTGSLVSLSEQELVDC 724 D+P+S+DWRK GAVT VKDQG+CGACW+FSATGA+EGINKIVTGSLVSLSEQELVDC Sbjct: 115 -VADVPASVDWRKNGAVTQVKDQGNCGACWSFSATGAIEGINKIVTGSLVSLSEQELVDC 173 Query: 725 DRSYNSGCEGGLMDPAYEFVVKNKGIDTEKDYPFQARDGTCNRNKLNRHVVTIDGYRDVT 904 D+SYN+GCEGG+MD A++FV+ N GIDTE+DYP+Q RD +CN+ KL RHVVTIDGY DV Sbjct: 174 DKSYNNGCEGGIMDYAFQFVIDNHGIDTEEDYPYQGRDRSCNKEKLKRHVVTIDGYVDVP 233 Query: 905 RGNEKELLQAVAAQPVSVGICGSERAFQLYSRGIFTGPCSTNLDHAVLIVGYDSQDGVDY 1084 + NEKELL+AVA QPVSVGICGSERAFQLYS+GIFTGPCST+LDHAVLIVGY S++GVDY Sbjct: 234 QNNEKELLKAVANQPVSVGICGSERAFQLYSKGIFTGPCSTSLDHAVLIVGYGSENGVDY 293 Query: 1085 WIVKNSWGSSWGIDGYIHIARSTGNSEGVCGINTLAXXXXXXXXXXXXXXXXXXVKCSLF 1264 WIVKNSWGS WG+DGY+H+ R++G+S G+CGIN LA +C LF Sbjct: 294 WIVKNSWGSYWGMDGYMHMQRNSGSSRGLCGINMLASYPKKTSPNPPPPAPPGPTRCDLF 353 Query: 1265 SSCPGGETCCCSWQLLGICFSWQCCELDSAVCCKDHRHCCPHDYPICDTKRNLCLKKTGN 1444 + C GETCCC + GIC SW+CCELDSAVCCKD RHCCP DYP+CDT RN+CLK GN Sbjct: 354 THCGEGETCCCVHHIFGICLSWKCCELDSAVCCKDGRHCCPRDYPVCDTTRNICLKHYGN 413 Query: 1445 STLVKKLENRGVFGELGNLNAFFQNWNL 1528 +T ++K G+ + ++ + W L Sbjct: 414 ATRIEKFAKNSSSGKFRSWSSLLEGWIL 441 >ref|XP_002889773.1| hypothetical protein ARALYDRAFT_471096 [Arabidopsis lyrata subsp. lyrata] gi|297335615|gb|EFH66032.1| hypothetical protein ARALYDRAFT_471096 [Arabidopsis lyrata subsp. lyrata] Length = 439 Score = 588 bits (1517), Expect = e-165 Identities = 276/437 (63%), Positives = 333/437 (76%), Gaps = 2/437 (0%) Frame = +2 Query: 218 TVFLLFHVSICSSSSLTADLFENWSKEHGKRYSSEQEKQYRFKVFEDNYDYITQHNNMGN 397 T F L VS SSS ++LF++W + HGK Y SE+E+Q R ++F+DN+D++TQHN + N Sbjct: 12 TFFFLLLVSSPSSSDDISELFDDWCQRHGKTYGSEEERQQRIQIFKDNHDFVTQHNLITN 71 Query: 398 STYTLSLNAFADLTHHEFKAKYLGLSASANDLIRLNRGGLSPFGESGAVGTNDIPSSLDW 577 +TY+LSLNAFADLTHHEFKA LGLS SA+ LI ++G G +P S+DW Sbjct: 72 ATYSLSLNAFADLTHHEFKASRLGLSVSASSLIMASKG-------QSLGGNAKVPDSVDW 124 Query: 578 RKKGAVTNVKDQGSCGACWAFSATGAMEGINKIVTGSLVSLSEQELVDCDRSYNSGCEGG 757 RKKGAVTNVKDQGSCGACW+FSATGAMEGIN+IVTG L+SLSEQEL+DCD+SYN+GC GG Sbjct: 125 RKKGAVTNVKDQGSCGACWSFSATGAMEGINQIVTGDLISLSEQELIDCDKSYNAGCNGG 184 Query: 758 LMDPAYEFVVKNKGIDTEKDYPFQARDGTCNRNKLNRHVVTIDGYRDVTRGNEKELLQAV 937 LMD A+EFV+KN GIDTEKDYP+Q RDGTC ++KL + VVTID Y V +EK L +AV Sbjct: 185 LMDYAFEFVIKNHGIDTEKDYPYQERDGTCKKDKLKQKVVTIDSYAGVKSNDEKALREAV 244 Query: 938 AAQPVSVGICGSERAFQLYSR--GIFTGPCSTNLDHAVLIVGYDSQDGVDYWIVKNSWGS 1111 AAQPVSVGICGSERAFQLYSR GIF+GPCST+LDHAVLIVGY SQ+GVDYWIVKNSWG Sbjct: 245 AAQPVSVGICGSERAFQLYSRVSGIFSGPCSTSLDHAVLIVGYGSQNGVDYWIVKNSWGK 304 Query: 1112 SWGIDGYIHIARSTGNSEGVCGINTLAXXXXXXXXXXXXXXXXXXVKCSLFSSCPGGETC 1291 SWG+DG++H+ R+TGNSEG+CGIN LA KC+LF+ C GETC Sbjct: 305 SWGMDGFMHMQRNTGNSEGICGINMLASYPIKTHPNPPPPSPPGPTKCNLFTYCSAGETC 364 Query: 1292 CCSWQLLGICFSWQCCELDSAVCCKDHRHCCPHDYPICDTKRNLCLKKTGNSTLVKKLEN 1471 CC+ L G+CFSW+CCE++SAVCC D RHCCPHDYP+CDT R+LCLKKTGN T +K Sbjct: 365 CCARNLFGLCFSWKCCEIESAVCCSDGRHCCPHDYPVCDTTRSLCLKKTGNFTAIKPFWK 424 Query: 1472 RGVFGELGNLNAFFQNW 1522 + +LG F+ W Sbjct: 425 KDSSNKLGR----FEGW 437 >ref|XP_006435079.1| hypothetical protein CICLE_v10001178mg [Citrus clementina] gi|557537201|gb|ESR48319.1| hypothetical protein CICLE_v10001178mg [Citrus clementina] Length = 441 Score = 588 bits (1516), Expect = e-165 Identities = 278/439 (63%), Positives = 330/439 (75%), Gaps = 6/439 (1%) Frame = +2 Query: 224 FLLFHVSICSSSSLTA-----DLFENWSKEHGKRYSSEQEKQYRFKVFEDNYDYITQHNN 388 F L + + SS L +LFE W K+HGK YSSEQEKQ R K+FEDNY ++TQHNN Sbjct: 6 FFLLSILLLSSLPLNYCSDINELFETWCKQHGKVYSSEQEKQQRLKIFEDNYAFVTQHNN 65 Query: 389 MGNSTYTLSLNAFADLTHHEFKAKYLGLSASANDLIRLNRGGLSPFGESGAVGTNDIPSS 568 MGNS++TLSLNAFADLTH EFKA +LG SA++ D R + G D+P+S Sbjct: 66 MGNSSFTLSLNAFADLTHQEFKASFLGFSAASIDHDRRRNASVQSPGT-----LRDVPAS 120 Query: 569 LDWRKKGAVTNVKDQGSCGACWAFSATGAMEGINKIVTGSLVSLSEQELVDCDRSYNSGC 748 +DWRKKGAVT VKDQ SCGACWAFSATGA+EGINKIVTGSLVSLSEQEL+DCDRSYNSGC Sbjct: 121 IDWRKKGAVTEVKDQASCGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSYNSGC 180 Query: 749 EGGLMDPAYEFVVKNKGIDTEKDYPFQARDGTCNRNKLNRHVVTIDGYRDVTRGNEKELL 928 GGLMD AY+FV+KN GIDTEKDYP++ + G CN+ KLNRH+VTIDGY+DV NEK+LL Sbjct: 181 GGGLMDYAYQFVIKNHGIDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLL 240 Query: 929 QAVAAQPVSVGICGSERAFQLYSRGIFTGPCSTNLDHAVLIVGYDSQDGVDYWIVKNSWG 1108 QAV AQPVSVGICGSERAFQLYS GIFTGPCST+LDHAVLIVGYDS++GVDYWI+KNSWG Sbjct: 241 QAVVAQPVSVGICGSERAFQLYSSGIFTGPCSTSLDHAVLIVGYDSENGVDYWIIKNSWG 300 Query: 1109 SSWGIDGYIHIARSTGNSEGVCGINTLAXXXXXXXXXXXXXXXXXXVKCSLFSSCPGGET 1288 SWG++GY+H+ R+TGNS G+CGIN LA +CSL + C GET Sbjct: 301 RSWGMNGYMHMQRNTGNSLGICGINMLASYPTKTGQNPPPSPPPGPTRCSLLTYCAAGET 360 Query: 1289 CCCSWQLLGICFSWQCCELDSAVCCKDHRHCCPHDYPICDTKRNLCLKK-TGNSTLVKKL 1465 CCC +LGIC SW+CC SAVCC DHR+CCP +YPICD+ R+ CL + TGN T + + Sbjct: 361 CCCGSSILGICLSWKCCGFSSAVCCSDHRYCCPSNYPICDSVRHQCLTRFTGNVTAAEAI 420 Query: 1466 ENRGVFGELGNLNAFFQNW 1522 E RG + G+ ++F W Sbjct: 421 EMRGSSWKFGSWSSFIDVW 439 >ref|XP_006473576.1| PREDICTED: cysteine proteinase RD21a-like [Citrus sinensis] Length = 441 Score = 588 bits (1515), Expect = e-165 Identities = 272/418 (65%), Positives = 323/418 (77%), Gaps = 1/418 (0%) Frame = +2 Query: 272 DLFENWSKEHGKRYSSEQEKQYRFKVFEDNYDYITQHNNMGNSTYTLSLNAFADLTHHEF 451 +LFE W K+HGK YSSEQEKQ R K+FEDNY ++TQHNNMGNS++TLSLNAFADLTH EF Sbjct: 27 ELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHNNMGNSSFTLSLNAFADLTHQEF 86 Query: 452 KAKYLGLSASANDLIRLNRGGLSPFGESGAVGTNDIPSSLDWRKKGAVTNVKDQGSCGAC 631 KA +LG SA++ D R + G D+P+S+DWRKKGAVT VKDQ SCGAC Sbjct: 87 KASFLGFSAASIDHDRRRNASVQSPGN-----LRDVPASIDWRKKGAVTEVKDQASCGAC 141 Query: 632 WAFSATGAMEGINKIVTGSLVSLSEQELVDCDRSYNSGCEGGLMDPAYEFVVKNKGIDTE 811 WAFSATGA+EGINKIVTGSLVSLSEQEL+DCDRSYNSGC GGLMD AY+FV+KN GIDTE Sbjct: 142 WAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSYNSGCGGGLMDYAYQFVIKNHGIDTE 201 Query: 812 KDYPFQARDGTCNRNKLNRHVVTIDGYRDVTRGNEKELLQAVAAQPVSVGICGSERAFQL 991 KDYP++ + G CN+ KLNRH+VTIDGY+DV NEK+LLQAV AQPVSVGICGSERAFQL Sbjct: 202 KDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVAQPVSVGICGSERAFQL 261 Query: 992 YSRGIFTGPCSTNLDHAVLIVGYDSQDGVDYWIVKNSWGSSWGIDGYIHIARSTGNSEGV 1171 YS GIFTGPCST+LDHAVLI+GYDS++GVDYWI+KNSWG SWG++GY+H+ R+TGNS G+ Sbjct: 262 YSSGIFTGPCSTSLDHAVLIIGYDSENGVDYWIIKNSWGRSWGMNGYMHMQRNTGNSLGI 321 Query: 1172 CGINTLAXXXXXXXXXXXXXXXXXXVKCSLFSSCPGGETCCCSWQLLGICFSWQCCELDS 1351 CGIN LA +CSL + C GETCCC +LGIC SW+CC S Sbjct: 322 CGINMLASYPTKTGQNPPPSPPPGPTRCSLLTYCAPGETCCCGSSILGICLSWKCCGFSS 381 Query: 1352 AVCCKDHRHCCPHDYPICDTKRNLCLKK-TGNSTLVKKLENRGVFGELGNLNAFFQNW 1522 AVCC DHR+CCP +YPICD+ R+ CL + TGN T + +E RG + G+ ++F W Sbjct: 382 AVCCSDHRYCCPSNYPICDSVRHQCLTRLTGNVTAAEAIEMRGSSWKFGSWSSFIDAW 439 >ref|XP_002307688.2| cysteine protease family protein [Populus trichocarpa] gi|550339725|gb|EEE94684.2| cysteine protease family protein [Populus trichocarpa] Length = 436 Score = 588 bits (1515), Expect = e-165 Identities = 272/443 (61%), Positives = 342/443 (77%) Frame = +2 Query: 194 MNWVWPFWTVFLLFHVSICSSSSLTADLFENWSKEHGKRYSSEQEKQYRFKVFEDNYDYI 373 MN+++ F L+ +S +SSS + LFE W KEHGK Y+S++E+ +R KVFEDNYD++ Sbjct: 1 MNFLYIFALTLLISVLSPSTSSSDISQLFETWCKEHGKSYTSQEERSHRLKVFEDNYDFV 60 Query: 374 TQHNNMGNSTYTLSLNAFADLTHHEFKAKYLGLSASANDLIRLNRGGLSPFGESGAVGTN 553 T+HN+ GNS+Y+L+LNAFADLTHHEFK LGLSA+ +L N +G VG Sbjct: 61 TKHNSKGNSSYSLALNAFADLTHHEFKTSRLGLSAAPLNLAHRN------LEITGVVG-- 112 Query: 554 DIPSSLDWRKKGAVTNVKDQGSCGACWAFSATGAMEGINKIVTGSLVSLSEQELVDCDRS 733 DIP+S+DWR KG VTNVKDQGSCGACW+FSATGA+EGINKIVTGSLVSLSEQEL++CD+S Sbjct: 113 DIPASIDWRNKGVVTNVKDQGSCGACWSFSATGAIEGINKIVTGSLVSLSEQELIECDKS 172 Query: 734 YNSGCEGGLMDPAYEFVVKNKGIDTEKDYPFQARDGTCNRNKLNRHVVTIDGYRDVTRGN 913 YN GC GGLMD A++FV+ N GIDTE+DYP++ARDGTCN++++ R VVTID Y DV N Sbjct: 173 YNDGCGGGLMDYAFQFVINNHGIDTEEDYPYRARDGTCNKDRMKRRVVTIDKYVDVPENN 232 Query: 914 EKELLQAVAAQPVSVGICGSERAFQLYSRGIFTGPCSTNLDHAVLIVGYDSQDGVDYWIV 1093 EK+LLQAVAAQPVSVGICGSERAFQ+YS+GIFTGPCST+LDHAVLIVGY S++GVDYWIV Sbjct: 233 EKQLLQAVAAQPVSVGICGSERAFQMYSKGIFTGPCSTSLDHAVLIVGYGSENGVDYWIV 292 Query: 1094 KNSWGSSWGIDGYIHIARSTGNSEGVCGINTLAXXXXXXXXXXXXXXXXXXVKCSLFSSC 1273 KNSWG+ WG+ GY+H+ R++GNS+GVCGIN LA KC+L + C Sbjct: 293 KNSWGTGWGMRGYMHMQRNSGNSQGVCGINMLASYPVKTSPNPPPPPPPGPTKCNLLTYC 352 Query: 1274 PGGETCCCSWQLLGICFSWQCCELDSAVCCKDHRHCCPHDYPICDTKRNLCLKKTGNSTL 1453 GETCCC+ + GIC SW+CC LDSAVCCKD HCCPHDYP+CDT +N+C K+ GN+T Sbjct: 353 AAGETCCCARKFFGICISWKCCGLDSAVCCKDRLHCCPHDYPVCDTDKNMCFKRAGNATR 412 Query: 1454 VKKLENRGVFGELGNLNAFFQNW 1522 ++ +E + G+ G+ N+ + W Sbjct: 413 MEAIEGK-TSGKFGSWNSLPEAW 434 >gb|EOY14881.1| JHL18I08.3 protein [Theobroma cacao] Length = 438 Score = 584 bits (1505), Expect = e-164 Identities = 278/439 (63%), Positives = 333/439 (75%) Frame = +2 Query: 212 FWTVFLLFHVSICSSSSLTADLFENWSKEHGKRYSSEQEKQYRFKVFEDNYDYITQHNNM 391 F FLLF +S S + LFE W +HGKRYSSE+EK YR KVFE+NY ++TQHN + Sbjct: 8 FLLSFLLFFDPSFASPSHISHLFETWCDQHGKRYSSEEEKSYRLKVFEENYAFVTQHNGV 67 Query: 392 GNSTYTLSLNAFADLTHHEFKAKYLGLSASANDLIRLNRGGLSPFGESGAVGTNDIPSSL 571 GNS+Y+L+LNAFADLTHHEFKA LGLSA+A + R N G V DIP+S+ Sbjct: 68 GNSSYSLALNAFADLTHHEFKASRLGLSAAAIEGSRPN------LQLPGLV--RDIPASM 119 Query: 572 DWRKKGAVTNVKDQGSCGACWAFSATGAMEGINKIVTGSLVSLSEQELVDCDRSYNSGCE 751 DWR KGAVT VKDQGSCGACW+FSATGA+EGINKIVTG+LVSLSEQELVDCDRSYNSGCE Sbjct: 120 DWRTKGAVTKVKDQGSCGACWSFSATGAIEGINKIVTGTLVSLSEQELVDCDRSYNSGCE 179 Query: 752 GGLMDPAYEFVVKNKGIDTEKDYPFQARDGTCNRNKLNRHVVTIDGYRDVTRGNEKELLQ 931 GGLMD AY+FV+ N GID E+DYP+ R+ TCN+ K R VVTIDGY V NE LLQ Sbjct: 180 GGLMDYAYQFVIDNHGIDNEEDYPYLGREKTCNKEKRKRRVVTIDGYAGVPANNEDLLLQ 239 Query: 932 AVAAQPVSVGICGSERAFQLYSRGIFTGPCSTNLDHAVLIVGYDSQDGVDYWIVKNSWGS 1111 AVA QPVSVGICGSERAFQLYS+GIFTGPCS++LDHAVLIVGY S++GVDYWIVKNSWG+ Sbjct: 240 AVAKQPVSVGICGSERAFQLYSKGIFTGPCSSSLDHAVLIVGYGSENGVDYWIVKNSWGT 299 Query: 1112 SWGIDGYIHIARSTGNSEGVCGINTLAXXXXXXXXXXXXXXXXXXVKCSLFSSCPGGETC 1291 WG++GYIH+ R++G+S+G+CGIN LA KC LF+ C GETC Sbjct: 300 RWGMNGYIHMLRNSGDSKGLCGINMLASYPTKTSPNPPSPPPPGPTKCDLFTYCSAGETC 359 Query: 1292 CCSWQLLGICFSWQCCELDSAVCCKDHRHCCPHDYPICDTKRNLCLKKTGNSTLVKKLEN 1471 CC+ ++ GICFSW+CCELDSAVCCKD+RHCCP+DYP+CDTK++ CLK+ GN+T ++ E Sbjct: 360 CCTHRIFGICFSWKCCELDSAVCCKDNRHCCPYDYPVCDTKKSQCLKRVGNATRMEAFEK 419 Query: 1472 RGVFGELGNLNAFFQNWNL 1528 R + + F +NW L Sbjct: 420 RHSTRKFSSWRPFVENWVL 438 >ref|XP_006417526.1| hypothetical protein EUTSA_v10007640mg [Eutrema salsugineum] gi|557095297|gb|ESQ35879.1| hypothetical protein EUTSA_v10007640mg [Eutrema salsugineum] Length = 444 Score = 583 bits (1504), Expect = e-164 Identities = 273/432 (63%), Positives = 331/432 (76%) Frame = +2 Query: 227 LLFHVSICSSSSLTADLFENWSKEHGKRYSSEQEKQYRFKVFEDNYDYITQHNNMGNSTY 406 L F +S SS + A+LF++W HGK Y SE+E+Q+R ++F DN+D++TQHN++ NSTY Sbjct: 21 LSFSISSSSSDDI-AELFDDWCHRHGKTYGSEEERQHRIQIFRDNHDFVTQHNHISNSTY 79 Query: 407 TLSLNAFADLTHHEFKAKYLGLSASANDLIRLNRGGLSPFGESGAVGTNDIPSSLDWRKK 586 +LSLNAFADLTHHEFKA LGLSA + L+ + G S V +P S+DWRKK Sbjct: 80 SLSLNAFADLTHHEFKASRLGLSAPSPSLMAKEQS----LGVSERVRVK-VPDSVDWRKK 134 Query: 587 GAVTNVKDQGSCGACWAFSATGAMEGINKIVTGSLVSLSEQELVDCDRSYNSGCEGGLMD 766 GAVTNVKDQGSCGACW+FSATGAMEGIN+IVTG L+SLSEQEL+DCD+SYN+GC GGLMD Sbjct: 135 GAVTNVKDQGSCGACWSFSATGAMEGINQIVTGDLISLSEQELIDCDKSYNAGCNGGLMD 194 Query: 767 PAYEFVVKNKGIDTEKDYPFQARDGTCNRNKLNRHVVTIDGYRDVTRGNEKELLQAVAAQ 946 A+EFV+KN GIDTEKDYP+Q +DGTC ++KL + VVTID Y V NEK L++AVA+Q Sbjct: 195 YAFEFVIKNHGIDTEKDYPYQEQDGTCKKDKLKKRVVTIDSYAGVASNNEKALMEAVASQ 254 Query: 947 PVSVGICGSERAFQLYSRGIFTGPCSTNLDHAVLIVGYDSQDGVDYWIVKNSWGSSWGID 1126 PVSVGICGSERAFQLYS GIF+GPCST+LDHAVLIVGY SQ+GVDYWIVKNSWG SWG+D Sbjct: 255 PVSVGICGSERAFQLYSSGIFSGPCSTSLDHAVLIVGYGSQNGVDYWIVKNSWGKSWGMD 314 Query: 1127 GYIHIARSTGNSEGVCGINTLAXXXXXXXXXXXXXXXXXXVKCSLFSSCPGGETCCCSWQ 1306 G++H+ R+TGNSEGVCGIN LA KC+LF+ C GETCCC+ Sbjct: 315 GFMHMQRNTGNSEGVCGINMLASYPIKTHPNPPPPSPPGPTKCNLFTYCSSGETCCCART 374 Query: 1307 LLGICFSWQCCELDSAVCCKDHRHCCPHDYPICDTKRNLCLKKTGNSTLVKKLENRGVFG 1486 L G+CFSW+CCEL+SAVCCKD RHCCP DYP+CDT ++LCLKKTGN T +K + Sbjct: 375 LFGLCFSWKCCELESAVCCKDGRHCCPRDYPVCDTTKSLCLKKTGNFTEIKPFWKKNSSN 434 Query: 1487 ELGNLNAFFQNW 1522 +LG F+ W Sbjct: 435 KLGR----FEEW 442 >ref|XP_004152671.1| PREDICTED: cysteine proteinase RD21a-like [Cucumis sativus] gi|449529596|ref|XP_004171784.1| PREDICTED: cysteine proteinase RD21a-like [Cucumis sativus] Length = 431 Score = 583 bits (1502), Expect = e-163 Identities = 275/438 (62%), Positives = 331/438 (75%) Frame = +2 Query: 197 NWVWPFWTVFLLFHVSICSSSSLTADLFENWSKEHGKRYSSEQEKQYRFKVFEDNYDYIT 376 N+ + F T+FLL + ++S+++ +LFE W EHGK YSS +EK YR VF DNY+++T Sbjct: 3 NYAFHFLTLFLLLFRPLSATSNVS-ELFEIWCTEHGKSYSSAEEKLYRLGVFADNYEFVT 61 Query: 377 QHNNMGNSTYTLSLNAFADLTHHEFKAKYLGLSASANDLIRLNRGGLSPFGESGAVGTND 556 HNN+ NS+YTLSLN++ADLTHHEFK LG S + + P D Sbjct: 62 HHNNLDNSSYTLSLNSYADLTHHEFKVSRLGFSPALRNF--------RPVLPQEPSLPRD 113 Query: 557 IPSSLDWRKKGAVTNVKDQGSCGACWAFSATGAMEGINKIVTGSLVSLSEQELVDCDRSY 736 +P SLDWRKKGAVT VKDQGSCGACW+FSATGAMEGIN+I+TGSL+SLSEQEL+DCDRSY Sbjct: 114 VPDSLDWRKKGAVTAVKDQGSCGACWSFSATGAMEGINQIMTGSLISLSEQELIDCDRSY 173 Query: 737 NSGCEGGLMDPAYEFVVKNKGIDTEKDYPFQARDGTCNRNKLNRHVVTIDGYRDVTRGNE 916 NSGC GGLMD AY+FV+ N GIDTE DYP+QARDG+C ++KL R+VVTIDGY D+ +E Sbjct: 174 NSGCGGGLMDYAYQFVISNHGIDTENDYPYQARDGSCRKDKLQRNVVTIDGYADIPSNDE 233 Query: 917 KELLQAVAAQPVSVGICGSERAFQLYSRGIFTGPCSTNLDHAVLIVGYDSQDGVDYWIVK 1096 +LLQAVAAQPVSVGICGSERAFQLYS+GIF+GPCST+LDHAVLIVGY S++GVDYWIVK Sbjct: 234 GKLLQAVAAQPVSVGICGSERAFQLYSKGIFSGPCSTSLDHAVLIVGYGSENGVDYWIVK 293 Query: 1097 NSWGSSWGIDGYIHIARSTGNSEGVCGINTLAXXXXXXXXXXXXXXXXXXVKCSLFSSCP 1276 NSWG SWG+DGY+H+ R++GNSEGVCGIN LA KCS+ +SC Sbjct: 294 NSWGKSWGMDGYMHMQRNSGNSEGVCGINKLASYPTKTNPNPPPSPPPGPTKCSILTSCA 353 Query: 1277 GGETCCCSWQLLGICFSWQCCELDSAVCCKDHRHCCPHDYPICDTKRNLCLKKTGNSTLV 1456 GETCCC+ + LG+C SW+CC L SAVCCKD RHCCP DYPICDT RNLCLK+T N T Sbjct: 354 AGETCCCAKKFLGLCLSWKCCGLSSAVCCKDGRHCCPFDYPICDTDRNLCLKQTMNGTRT 413 Query: 1457 KKLENRGVFGELGNLNAF 1510 + LENR G G ++F Sbjct: 414 EILENRSSSGSSGTWSSF 431 >gb|EXC25025.1| Oryzain alpha chain [Morus notabilis] Length = 517 Score = 578 bits (1490), Expect = e-162 Identities = 270/406 (66%), Positives = 320/406 (78%) Frame = +2 Query: 218 TVFLLFHVSICSSSSLTADLFENWSKEHGKRYSSEQEKQYRFKVFEDNYDYITQHNNMGN 397 ++ L H+S+ S S ++ LFE W ++HG+ YSSE+E+ YR VFEDN ++TQHNNMGN Sbjct: 10 SLLLSSHLSLSSPSLNSSQLFEAWCEKHGQSYSSEEERLYRLTVFEDNLAFVTQHNNMGN 69 Query: 398 STYTLSLNAFADLTHHEFKAKYLGLSASANDLIRLNRGGLSPFGESGAVGTNDIPSSLDW 577 S+YTLSLNAFADLTHHEFK+ LG S++ L L + G S + D+P+SLDW Sbjct: 70 SSYTLSLNAFADLTHHEFKSSRLGFSSAL--LSSLPKLG------SKLLDLRDVPASLDW 121 Query: 578 RKKGAVTNVKDQGSCGACWAFSATGAMEGINKIVTGSLVSLSEQELVDCDRSYNSGCEGG 757 RKKGAVTNVKDQGSCGACWAFSATGA+EGINKIVTGSLVSLSEQEL+DCD SYN+GC+GG Sbjct: 122 RKKGAVTNVKDQGSCGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDTSYNAGCDGG 181 Query: 758 LMDPAYEFVVKNKGIDTEKDYPFQARDGTCNRNKLNRHVVTIDGYRDVTRGNEKELLQAV 937 LMD AY+FV+ N GIDTE+DYP+QARD +C + KL R VVTIDGY DV N +LLQAV Sbjct: 182 LMDYAYQFVIDNHGIDTEEDYPYQARDKSCRKEKLKRRVVTIDGYTDVAPNNGLQLLQAV 241 Query: 938 AAQPVSVGICGSERAFQLYSRGIFTGPCSTNLDHAVLIVGYDSQDGVDYWIVKNSWGSSW 1117 QPVSVGICGSERAFQLYS+GIFTGPCST+LDHAVLIVGYDS++GVDYWIVKNSWG W Sbjct: 242 VTQPVSVGICGSERAFQLYSKGIFTGPCSTSLDHAVLIVGYDSENGVDYWIVKNSWGKQW 301 Query: 1118 GIDGYIHIARSTGNSEGVCGINTLAXXXXXXXXXXXXXXXXXXVKCSLFSSCPGGETCCC 1297 G+DGYIH+ R+TGNS+GVCGIN LA +CS F+ C GETCCC Sbjct: 302 GMDGYIHMQRNTGNSQGVCGINMLASYPTKTSPNPPPSPSPGPTRCSFFAQCGEGETCCC 361 Query: 1298 SWQLLGICFSWQCCELDSAVCCKDHRHCCPHDYPICDTKRNLCLKK 1435 SW+ LG+CFSW+CC L+SAVCCKD HCCP DYP+CDT+RN+CLK+ Sbjct: 362 SWRFLGLCFSWKCCGLNSAVCCKDKIHCCPQDYPLCDTQRNVCLKE 407 >ref|XP_006307431.1| hypothetical protein CARUB_v10009056mg [Capsella rubella] gi|482576142|gb|EOA40329.1| hypothetical protein CARUB_v10009056mg [Capsella rubella] Length = 467 Score = 570 bits (1468), Expect = e-159 Identities = 274/465 (58%), Positives = 332/465 (71%), Gaps = 30/465 (6%) Frame = +2 Query: 218 TVFLLFHVSICSSSSLT--ADLFENWSKEHGKRYSSEQEKQYRFKVFEDNYDYITQHNNM 391 T F L VS SSSS ++LF++W + HGK Y+SE EKQ+RF++F DN+D++TQHN + Sbjct: 12 TFFFLLLVSSSSSSSSDDISELFDDWCQRHGKTYASEAEKQHRFQIFRDNHDFVTQHNLI 71 Query: 392 GNSTYTLSLNAFADLTHHEFKAKYLGLSASANDLIRLNRGGLSPFGESGAVGTNDIPSSL 571 N+TY+LSLNAFADL H EFK LGLS SA +I ++G G+ +P SL Sbjct: 72 TNATYSLSLNAFADLNHSEFKTSRLGLSVSAPSVIMASKG-------KSLGGSVKVPDSL 124 Query: 572 DWRKKGAVTNVKDQGSCGACWAFSATGAMEGINKIVTGSLVSLSEQELVDCDRSYNSGCE 751 DWRKKGAVTNVKDQGSCGACW+FSATGAMEGIN+IVTG L+SLSEQEL+DCD+SYN GC Sbjct: 125 DWRKKGAVTNVKDQGSCGACWSFSATGAMEGINQIVTGDLISLSEQELIDCDKSYNDGCN 184 Query: 752 GGLMDPAYEFVVKNKGIDTEKDYPFQARDGTCNRNKLNRHVVTIDGYRDVTRGNEKELLQ 931 GGLMD A+EFV+KNKGIDTEKDYP+Q RDGTC ++KL + VV+ID Y V +EK LL+ Sbjct: 185 GGLMDYAFEFVIKNKGIDTEKDYPYQERDGTCKKDKLKQRVVSIDSYAGVKPSDEKALLE 244 Query: 932 AVAAQPVSVGICGSERAFQLYS----------------------------RGIFTGPCST 1027 AVAAQPVSVGICGSERAFQLYS +GIF+GPCST Sbjct: 245 AVAAQPVSVGICGSERAFQLYSSVSFKIRDTSILSSECSTFPCLKLYLMMQGIFSGPCST 304 Query: 1028 NLDHAVLIVGYDSQDGVDYWIVKNSWGSSWGIDGYIHIARSTGNSEGVCGINTLAXXXXX 1207 +LDHAVLIVGY SQ+GVDYWIVKNSWG SWG+DG++H+ R+TGNS+G+CGIN LA Sbjct: 305 SLDHAVLIVGYGSQNGVDYWIVKNSWGKSWGMDGFMHMQRNTGNSQGICGINMLASYPIK 364 Query: 1208 XXXXXXXXXXXXXVKCSLFSSCPGGETCCCSWQLLGICFSWQCCELDSAVCCKDHRHCCP 1387 KC+LF+ C ETCCC+ L G+C SW+CCE++SAVCCKD RHCCP Sbjct: 365 THPNPPPPSPPGPTKCNLFTYCSAAETCCCARNLFGLCLSWKCCEIESAVCCKDGRHCCP 424 Query: 1388 HDYPICDTKRNLCLKKTGNSTLVKKLENRGVFGELGNLNAFFQNW 1522 HDYP+CDT R+LCLKKTGN T +K + +LG F+ W Sbjct: 425 HDYPVCDTTRSLCLKKTGNFTAIKPFWKKNSSNKLGR----FEEW 465 >gb|AAB60738.1| Strong similarity to Dianthus cysteine proteinase (gb|U17135) [Arabidopsis thaliana] Length = 416 Score = 569 bits (1466), Expect = e-159 Identities = 264/412 (64%), Positives = 322/412 (78%), Gaps = 7/412 (1%) Frame = +2 Query: 218 TVFLLFHVSICSSSSLTADLFENWSKEHGKRYSSEQEKQYRFKVFEDNYDYITQHNNMGN 397 T F L VS SSS ++LF++W ++HGK Y SE+E+Q R ++F+DN+D++TQHN + N Sbjct: 10 TFFFLLLVSSSSSSDDISELFDDWCQKHGKTYGSEEERQQRIQIFKDNHDFVTQHNLITN 69 Query: 398 STYTLSLNAFADLTHHEFKAKYLGLSASANDLIRLNRGGLSPFGESGAVGTNDIPSSLDW 577 +TY+LSLNAFADLTHHEFKA LGLS SA +I ++G G+ +P S+DW Sbjct: 70 ATYSLSLNAFADLTHHEFKASRLGLSVSAPSVIMASKG-------QSLGGSVKVPDSVDW 122 Query: 578 RKKGAVTNVKDQGSCGACWAFSATGAMEGINKIVTGSLVSLSEQELVDCDRSYNSGCEGG 757 RKKGAVTNVKDQGSCGACW+FSATGAMEGIN+IVTG L+SLSEQEL+DCD+SYN+GC GG Sbjct: 123 RKKGAVTNVKDQGSCGACWSFSATGAMEGINQIVTGDLISLSEQELIDCDKSYNAGCNGG 182 Query: 758 LMDPAYEFVVKNKGIDTEKDYPFQARDGTCNRNKLNRHVVTIDGYRDVTRGNEKELLQAV 937 LMD A+EFV+KN GIDTEKDYP+Q RDGTC ++KL + VVTID Y V +EK L++AV Sbjct: 183 LMDYAFEFVIKNHGIDTEKDYPYQERDGTCKKDKLKQKVVTIDSYAGVKSNDEKALMEAV 242 Query: 938 AAQPVSVGICGSERAFQLYS-------RGIFTGPCSTNLDHAVLIVGYDSQDGVDYWIVK 1096 AAQPVSVGICGSERAFQLYS +GIF+GPCST+LDHAVLIVGY SQ+GVDYWIVK Sbjct: 243 AAQPVSVGICGSERAFQLYSSKFYLLMQGIFSGPCSTSLDHAVLIVGYGSQNGVDYWIVK 302 Query: 1097 NSWGSSWGIDGYIHIARSTGNSEGVCGINTLAXXXXXXXXXXXXXXXXXXVKCSLFSSCP 1276 NSWG SWG+DG++H+ R+T NS+GVCGIN LA KC+LF+ C Sbjct: 303 NSWGKSWGMDGFMHMQRNTENSDGVCGINMLASYPIKTHPNPPPPSPPGPTKCNLFTYCS 362 Query: 1277 GGETCCCSWQLLGICFSWQCCELDSAVCCKDHRHCCPHDYPICDTKRNLCLK 1432 GETCCC+ +L G+CFSW+CCE++SAVCCKD RHCCPHDYP+CDT R+LCLK Sbjct: 363 SGETCCCARELFGLCFSWKCCEIESAVCCKDGRHCCPHDYPVCDTTRSLCLK 414 >ref|XP_002280937.1| PREDICTED: cysteine proteinase RD21a [Vitis vinifera] gi|302142569|emb|CBI19772.3| unnamed protein product [Vitis vinifera] Length = 436 Score = 560 bits (1443), Expect = e-156 Identities = 261/448 (58%), Positives = 321/448 (71%), Gaps = 4/448 (0%) Frame = +2 Query: 197 NWVWPFWTVFLLFHVSICSSSSLTADLFENWSKEHGKRYSSEQEKQYRFKVFEDNYDYIT 376 +W+W + L H S+ +SS TADLFE W +++GK YSSE+EK R KVFE+N+ ++T Sbjct: 3 SWLWAVSILILAVHSSVSEASS-TADLFEAWCEQYGKTYSSEEEKASRLKVFEENHAFVT 61 Query: 377 QHNNMGNSTYTLSLNAFADLTHHEFKAKYLGLSASANDLIRLNRGGLSPFGESGAVGTN- 553 QHN+M N++YTL+LNAFADLTHHEFKA LG S IR +VGT Sbjct: 62 QHNSMANASYTLALNAFADLTHHEFKASRLGFSPGRAQSIR-------------SVGTPV 108 Query: 554 ---DIPSSLDWRKKGAVTNVKDQGSCGACWAFSATGAMEGINKIVTGSLVSLSEQELVDC 724 +P ++DWRK GAVT VKDQG+CG CW+FS TGA+EGINKIVTGSLVSLSEQELVDC Sbjct: 109 QELHVPPAVDWRKSGAVTGVKDQGNCGGCWSFSTTGAIEGINKIVTGSLVSLSEQELVDC 168 Query: 725 DRSYNSGCEGGLMDPAYEFVVKNKGIDTEKDYPFQARDGTCNRNKLNRHVVTIDGYRDVT 904 DRSYNSGCEGGLMD AY+FV+KN+GID+E DYP+ D CN+ KL +H+VTIDGY D+ Sbjct: 169 DRSYNSGCEGGLMDYAYQFVIKNQGIDSEADYPYVGMDKPCNKEKLKKHIVTIDGYTDIP 228 Query: 905 RGNEKELLQAVAAQPVSVGICGSERAFQLYSRGIFTGPCSTNLDHAVLIVGYDSQDGVDY 1084 +EK+LLQ VA QPVSVGICGSE+ FQLYS+G++TGPCS+ LDHAVLIVGY ++DGVD+ Sbjct: 229 PNDEKQLLQVVAKQPVSVGICGSEKTFQLYSKGVYTGPCSSTLDHAVLIVGYGTEDGVDF 288 Query: 1085 WIVKNSWGSSWGIDGYIHIARSTGNSEGVCGINTLAXXXXXXXXXXXXXXXXXXVKCSLF 1264 WIVKNSWG WG+ GYIH+ R+ G +EG+CGIN LA KC F Sbjct: 289 WIVKNSWGEHWGMRGYIHMLRNNGTAEGICGINMLASYPAKTSPNPPPPPTPGPTKCDFF 348 Query: 1265 SSCPGGETCCCSWQLLGICFSWQCCELDSAVCCKDHRHCCPHDYPICDTKRNLCLKKTGN 1444 SSC GETCCCSW+ +G+C SW CC SAVCC ++ +CCP +PICDTKRN CLK GN Sbjct: 349 SSCSEGETCCCSWRFIGVCLSWNCCTAKSAVCCDNNNYCCPASHPICDTKRNRCLKPAGN 408 Query: 1445 STLVKKLENRGVFGELGNLNAFFQNWNL 1528 T V+ L+ RG + G ++ WNL Sbjct: 409 GTGVEVLKRRGSSVKFGGWSSINDAWNL 436 >ref|XP_004500967.1| PREDICTED: oryzain alpha chain-like [Cicer arietinum] Length = 436 Score = 550 bits (1416), Expect = e-153 Identities = 261/426 (61%), Positives = 316/426 (74%), Gaps = 2/426 (0%) Frame = +2 Query: 212 FWTVFLLFHVSICSSSSL--TADLFENWSKEHGKRYSSEQEKQYRFKVFEDNYDYITQHN 385 F + LL +S+ S + T+ LF+ W K+HGK Y SEQEK+YRF VFEDNY ++ QHN Sbjct: 6 FMFLQLLLSLSLLSFVTAIDTSKLFQEWCKQHGKTYPSEQEKRYRFNVFEDNYAFVAQHN 65 Query: 386 NMGNSTYTLSLNAFADLTHHEFKAKYLGLSASANDLIRLNRGGLSPFGESGAVGTNDIPS 565 +GNS+YTLSLNAFADLTHHEFKA LGL S+ + NR + + +PS Sbjct: 66 QIGNSSYTLSLNAFADLTHHEFKATRLGLPPSSLLRFKFNRFQ----DQQRSDDFLQVPS 121 Query: 566 SLDWRKKGAVTNVKDQGSCGACWAFSATGAMEGINKIVTGSLVSLSEQELVDCDRSYNSG 745 +DWRK GAV+ VKDQGSCGACW+FSATGA+EGINKIVTGSLVSLSEQELVDCD +YNSG Sbjct: 122 EIDWRKNGAVSIVKDQGSCGACWSFSATGAIEGINKIVTGSLVSLSEQELVDCDTTYNSG 181 Query: 746 CEGGLMDPAYEFVVKNKGIDTEKDYPFQARDGTCNRNKLNRHVVTIDGYRDVTRGNEKEL 925 C+GGLMD AY+F++ N GIDTE+DYP+QAR C ++KL R VVTIDGY DV +EK+L Sbjct: 182 CDGGLMDYAYQFIIDNNGIDTEEDYPYQARQLLCKKDKLKRRVVTIDGYTDVPPNDEKKL 241 Query: 926 LQAVAAQPVSVGICGSERAFQLYSRGIFTGPCSTNLDHAVLIVGYDSQDGVDYWIVKNSW 1105 L+AVA QPVSVGICGS RAFQLYS+GIFTGPCST+LDHAVLIVGY S++GVDYWIVKNSW Sbjct: 242 LKAVAVQPVSVGICGSARAFQLYSKGIFTGPCSTSLDHAVLIVGYGSENGVDYWIVKNSW 301 Query: 1106 GSSWGIDGYIHIARSTGNSEGVCGINTLAXXXXXXXXXXXXXXXXXXVKCSLFSSCPGGE 1285 G WG++GYIH+ R+T +S G+CGIN LA +KC+LF+ C GGE Sbjct: 302 GKYWGMNGYIHMLRNTDSSAGLCGINMLASYPTKTKPNPPVPPPPGPIKCNLFTYCSGGE 361 Query: 1286 TCCCSWQLLGICFSWQCCELDSAVCCKDHRHCCPHDYPICDTKRNLCLKKTGNSTLVKKL 1465 TCCC+ + LGICFSW+CC + SAVCCKD RHCCP DYP+CD CLK+ N T++ Sbjct: 362 TCCCAKKFLGICFSWKCCGVTSAVCCKDKRHCCPLDYPVCDASNGQCLKRIANGTILMTS 421 Query: 1466 ENRGVF 1483 + F Sbjct: 422 DKEDPF 427 >gb|ESW08035.1| hypothetical protein PHAVU_009G013000g [Phaseolus vulgaris] Length = 428 Score = 547 bits (1410), Expect = e-153 Identities = 269/419 (64%), Positives = 319/419 (76%), Gaps = 4/419 (0%) Frame = +2 Query: 212 FWTVFLLFHVSI-CSSSSLTADLFENWSKEHGKRYSSEQEKQYRFKVFEDNYDYITQHN- 385 F+++ +LF + + +S+S T+DLFE W KEH K YSSE+EK+YRF VFEDNY +++QHN Sbjct: 4 FFSLIILFTLYLPFASASDTSDLFERWCKEHAKTYSSEEEKRYRFHVFEDNYAFVSQHNR 63 Query: 386 --NMGNSTYTLSLNAFADLTHHEFKAKYLGLSASANDLIRLNRGGLSPFGESGAVGTNDI 559 N NSTYTLSLNAFADLTHHEFK LG S S L+R R Sbjct: 64 NANDNNSTYTLSLNAFADLTHHEFKTSRLGFSPS---LLRFKR-----VQNQQPRHLLHN 115 Query: 560 PSSLDWRKKGAVTNVKDQGSCGACWAFSATGAMEGINKIVTGSLVSLSEQELVDCDRSYN 739 PS +DWR+ GAVT VKDQ SCGACWAFSATGA+EGINKIVTGSL SLSEQELVDCD SYN Sbjct: 116 PSQIDWRQSGAVTPVKDQASCGACWAFSATGAIEGINKIVTGSLESLSEQELVDCDTSYN 175 Query: 740 SGCEGGLMDPAYEFVVKNKGIDTEKDYPFQARDGTCNRNKLNRHVVTIDGYRDVTRGNEK 919 SGCEGGLMD AY+FV+ NKGIDTE DYP+QAR CN++KL RH+VTID Y D+ NE+ Sbjct: 176 SGCEGGLMDYAYQFVIDNKGIDTEDDYPYQARQRPCNKDKLKRHIVTIDDYVDLP-PNEE 234 Query: 920 ELLQAVAAQPVSVGICGSERAFQLYSRGIFTGPCSTNLDHAVLIVGYDSQDGVDYWIVKN 1099 ELL+AVA+QPVSVGICGSERAFQLYS+GIF+GPCST+LDHAVLIVGY S++GVDYWIVKN Sbjct: 235 ELLKAVASQPVSVGICGSERAFQLYSQGIFSGPCSTSLDHAVLIVGYGSENGVDYWIVKN 294 Query: 1100 SWGSSWGIDGYIHIARSTGNSEGVCGINTLAXXXXXXXXXXXXXXXXXXVKCSLFSSCPG 1279 SWG WG++GYIH+ R+TG+ +G+CGINTLA V+C+LF+ C Sbjct: 295 SWGKYWGMEGYIHMIRNTGDPKGICGINTLA--SYPIKTKPNPPPPPAPVRCNLFTHCSE 352 Query: 1280 GETCCCSWQLLGICFSWQCCELDSAVCCKDHRHCCPHDYPICDTKRNLCLKKTGNSTLV 1456 GETCCC+ LGICFSW+CC L SAVCCKD RHCCP DYPICDT+++ CLK T +T + Sbjct: 353 GETCCCAKSFLGICFSWKCCGLTSAVCCKDKRHCCPRDYPICDTEKSQCLKITNGTTTI 411 >ref|XP_004291399.1| PREDICTED: cysteine proteinase RD21a-like [Fragaria vesca subsp. vesca] Length = 441 Score = 547 bits (1409), Expect = e-153 Identities = 267/439 (60%), Positives = 318/439 (72%), Gaps = 5/439 (1%) Frame = +2 Query: 221 VFLLFHVSICSSSSLTA-DLFENWSKEHGKRYSSEQEKQYRFKVFEDNYDYITQHNNMGN 397 + LL H + SSSS ++ +LFE W K++GK YSS++EK YR +FE N +ITQHN++GN Sbjct: 12 LLLLSHPCLSSSSSSSSSELFEAWCKQYGKSYSSQEEKLYRLSLFEQNLAFITQHNDLGN 71 Query: 398 STYTLSLNAFADLTHHEFKAKYLGLSASANDLIRLNRGGLSPFGESGAVGTNDIPSSLDW 577 S+YTLSLN+F+DLTHHEFKA LG S + +RL R + +PSS+DW Sbjct: 72 SSYTLSLNSFSDLTHHEFKASRLGFSPT---FLRLYRKS-----DPKPSVVRHVPSSIDW 123 Query: 578 RKKGAVTNVKDQGSCGACWAFSATGAMEGINKIVTGSLVSLSEQELVDCDRSY-NSGCEG 754 RK GAVTNVKDQGSCGACW+FSATGA+EGINKIVTGSLVSLSEQEL+DCDR Y NSGC G Sbjct: 124 RKNGAVTNVKDQGSCGACWSFSATGAIEGINKIVTGSLVSLSEQELIDCDRVYPNSGCNG 183 Query: 755 GLMDPAYEFVVKNKGIDTEKDYPFQARDGTCNRNKLNRHVVTIDGYRDVTRGNEKELLQA 934 GLMD A++F++ N GIDTE+DYP+Q DGTCN+ KL RHVVTIDGY DV NE++LL+A Sbjct: 184 GLMDDAFQFIIDNNGIDTEEDYPYQGADGTCNKQKLKRHVVTIDGYTDVPANNEEQLLKA 243 Query: 935 VAAQPVSVGICGSERAFQLYSRGIFTGPCSTNLDHAVLIVGYDSQDGVDYWIVKNSWGSS 1114 VA QPVSVGI GS R FQ YS+GIF GPCST LDHAVLIVGY S++GVDYWIVKNSWG + Sbjct: 244 VATQPVSVGIAGSGREFQFYSKGIFAGPCSTTLDHAVLIVGYGSENGVDYWIVKNSWGKN 303 Query: 1115 WGIDGYIHIARSTGNSEGVCGINTLA---XXXXXXXXXXXXXXXXXXVKCSLFSSCPGGE 1285 WG++GYIHI R NS+G+CGIN LA KC LFS C GE Sbjct: 304 WGMNGYIHILRDHSNSKGLCGINMLASYPTKTGENPPFPPPSPPPGPTKCDLFSKCGVGE 363 Query: 1286 TCCCSWQLLGICFSWQCCELDSAVCCKDHRHCCPHDYPICDTKRNLCLKKTGNSTLVKKL 1465 TCCC+ ++LGIC SW+CCE SAVCCKD HCCPHDYPICDT+RN CL+ GN T+ + Sbjct: 364 TCCCARKILGICLSWRCCEFTSAVCCKDRLHCCPHDYPICDTERNYCLQANGNLTM-RAN 422 Query: 1466 ENRGVFGELGNLNAFFQNW 1522 E RG + A W Sbjct: 423 EIRGSLRKSSRSKAKLSYW 441