BLASTX nr result
ID: Catharanthus22_contig00008746
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Catharanthus22_contig00008746 (2347 letters) Database: ./nr 37,332,560 sequences; 13,225,080,153 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_006341989.1| PREDICTED: cysteine proteinase RD21a-like [S... 615 e-173 ref|XP_004238304.1| PREDICTED: cysteine proteinase RD21a-like [S... 605 e-170 gb|AAK71314.1|AF388175_1 papain-like cysteine peptidase XBCP3 [A... 593 e-166 ref|XP_002510459.1| cysteine protease, putative [Ricinus communi... 591 e-166 ref|NP_563855.1| papain-like cysteine peptidase [Arabidopsis tha... 590 e-166 dbj|BAJ53169.1| JHL18I08.3 [Jatropha curcas] 590 e-166 ref|XP_002889773.1| hypothetical protein ARALYDRAFT_471096 [Arab... 588 e-165 ref|XP_006435079.1| hypothetical protein CICLE_v10001178mg [Citr... 588 e-165 ref|XP_006473576.1| PREDICTED: cysteine proteinase RD21a-like [C... 588 e-165 ref|XP_002307688.2| cysteine protease family protein [Populus tr... 588 e-165 gb|EOY14881.1| JHL18I08.3 protein [Theobroma cacao] 584 e-164 ref|XP_006417526.1| hypothetical protein EUTSA_v10007640mg [Eutr... 583 e-164 ref|XP_004152671.1| PREDICTED: cysteine proteinase RD21a-like [C... 583 e-163 gb|EXC25025.1| Oryzain alpha chain [Morus notabilis] 578 e-162 ref|XP_006307431.1| hypothetical protein CARUB_v10009056mg [Caps... 570 e-159 gb|AAB60738.1| Strong similarity to Dianthus cysteine proteinase... 569 e-159 ref|XP_002280937.1| PREDICTED: cysteine proteinase RD21a [Vitis ... 560 e-156 ref|XP_004500967.1| PREDICTED: oryzain alpha chain-like [Cicer a... 550 e-153 gb|ESW08035.1| hypothetical protein PHAVU_009G013000g [Phaseolus... 547 e-153 ref|XP_004291399.1| PREDICTED: cysteine proteinase RD21a-like [F... 547 e-153 >ref|XP_006341989.1| PREDICTED: cysteine proteinase RD21a-like [Solanum tuberosum] Length = 439 Score = 615 bits (1586), Expect = e-173 Identities = 288/437 (65%), Positives = 345/437 (78%), Gaps = 1/437 (0%) Frame = +2 Query: 269 MNWVWPFWT-VFLLFHVSICSSSSLTADLFENWSKEHGKRYSSEQEKQYRFKVFEDNYDY 445 MNW+ P V L+F C+ SS++ DLFE W +++GK+YSSEQE+ YRFKVFE+NY Y Sbjct: 1 MNWLLPSLVLVLLIFQQPFCTCSSIS-DLFETWCQQNGKKYSSEQERVYRFKVFEENYAY 59 Query: 446 ITQHNNMGNSTYTLSLNAFADLTHHEFKAKYLGLSASANDLIRLNRGGLSPFGESGAVGT 625 IT+HN+ NS+YTL LNA++DLTHHEF+ +LGLS+SAND IRL G S E+G + Sbjct: 60 ITEHNSKENSSYTLGLNAYSDLTHHEFRNSFLGLSSSANDFIRLKGRG-SGSSETGVLSD 118 Query: 626 NDIPSSLDWRKKGAVTNVKDQGSCGACWAFSATGAMEGINKIVTGSLVSLSEQELVDCDR 805 D PSSLDWR+KGAVT+VK+QGSCGACW+FSATGAMEGINKI TGSLVSLSEQEL+DCDR Sbjct: 119 VDAPSSLDWREKGAVTDVKNQGSCGACWSFSATGAMEGINKITTGSLVSLSEQELIDCDR 178 Query: 806 SYNSGCEGGLMDPAYEFVVKNKGIDTEKDYPFQARDGTCNRNKLNRHVVTIDGYRDVTRG 985 SYN GC GGLMD A+EFV+KN GIDTEKDYPF+ R+GTCN+NKL RHVVTIDGY D+ + Sbjct: 179 SYNEGCGGGLMDYAFEFVIKNGGIDTEKDYPFREREGTCNKNKLQRHVVTIDGYTDIPQN 238 Query: 986 NEKELLQAVAAQPVSVGICGSERAFQLYSRGIFTGPCSTNLDHAVLIVGYDSQDGVDYWI 1165 +E +LL+AVA QPVSVGICGS RAFQ YS+GIFTGPCST LDHAVLIVGY S++GVDYWI Sbjct: 239 DEDKLLKAVATQPVSVGICGSARAFQSYSKGIFTGPCSTALDHAVLIVGYGSENGVDYWI 298 Query: 1166 VKNSWGSSWGIDGYIHIARSTGNSEGVCGINTLAXXXXXXXXXXXXXXXXXXVKCSLFSS 1345 +KNSWG+SWGI+GYIH+ R++GN EG+CGIN LA KCS+F+S Sbjct: 299 IKNSWGTSWGINGYIHMQRNSGNQEGICGINKLASYPTKTSPNPPTPPAPGPSKCSMFTS 358 Query: 1346 CPGGETCCCSWQLLGICFSWQCCELDSAVCCKDHRHCCPHDYPICDTKRNLCLKKTGNST 1525 C GETCCC + LGIC SW+CC LDSAVCCKD RHCCP DYPICDT RNLCLK+ N+T Sbjct: 359 CGQGETCCCGSKFLGICLSWKCCGLDSAVCCKDGRHCCPQDYPICDTSRNLCLKRMNNAT 418 Query: 1526 LVKKLENRGVFGELGNL 1576 +V++ + G+ G L Sbjct: 419 IVQQPQKEAFTGKFGGL 435 >ref|XP_004238304.1| PREDICTED: cysteine proteinase RD21a-like [Solanum lycopersicum] Length = 439 Score = 605 bits (1560), Expect = e-170 Identities = 282/437 (64%), Positives = 341/437 (78%), Gaps = 1/437 (0%) Frame = +2 Query: 269 MNWVWPFWT-VFLLFHVSICSSSSLTADLFENWSKEHGKRYSSEQEKQYRFKVFEDNYDY 445 M W+ P V L+F +C+ SS++ DLFE W +++GK+YSSEQE+ YRFKVFE+NY Y Sbjct: 1 MKWLLPSLVLVLLIFQQPLCTCSSIS-DLFETWCQQNGKKYSSEQERMYRFKVFEENYAY 59 Query: 446 ITQHNNMGNSTYTLSLNAFADLTHHEFKAKYLGLSASANDLIRLNRGGLSPFGESGAVGT 625 IT+HN+ GNS+YTL LNA++DLTHHEF+ +LGLS+SAND IRL G S +G + Sbjct: 60 ITEHNSKGNSSYTLGLNAYSDLTHHEFRNSFLGLSSSANDFIRLKGRG-SGSSAAGVLSD 118 Query: 626 NDIPSSLDWRKKGAVTNVKDQGSCGACWAFSATGAMEGINKIVTGSLVSLSEQELVDCDR 805 D PSSLDWR KGAVTNVK+QGSCGACW+FSATGA+EGINKI TGSLVSLSEQEL+DCDR Sbjct: 119 VDAPSSLDWRDKGAVTNVKNQGSCGACWSFSATGAIEGINKITTGSLVSLSEQELIDCDR 178 Query: 806 SYNSGCEGGLMDPAYEFVVKNKGIDTEKDYPFQARDGTCNRNKLNRHVVTIDGYRDVTRG 985 SYN GC GGLMD A+EFV+KN GIDTEKDYPF+ ++GTCN+NKL R VVTIDGY D+ + Sbjct: 179 SYNQGCGGGLMDYAFEFVIKNGGIDTEKDYPFREKEGTCNKNKLQRRVVTIDGYTDIPQN 238 Query: 986 NEKELLQAVAAQPVSVGICGSERAFQLYSRGIFTGPCSTNLDHAVLIVGYDSQDGVDYWI 1165 +E +LL+AVA QPVSVGICGS RAFQ YS+GIFTGPC T+LDHAVLIVGY S++G DYWI Sbjct: 239 DEDKLLKAVATQPVSVGICGSARAFQSYSKGIFTGPCPTDLDHAVLIVGYGSENGFDYWI 298 Query: 1166 VKNSWGSSWGIDGYIHIARSTGNSEGVCGINTLAXXXXXXXXXXXXXXXXXXVKCSLFSS 1345 +KNSWG+SWGI+GYIH+ R++GN EG+CG+N LA KCS F+S Sbjct: 299 IKNSWGTSWGINGYIHMQRNSGNQEGICGVNKLASYPTKTSPNPPNPPAPGPSKCSTFTS 358 Query: 1346 CPGGETCCCSWQLLGICFSWQCCELDSAVCCKDHRHCCPHDYPICDTKRNLCLKKTGNST 1525 C GETCCC + LGIC SW+CC LDSAVCCKD RHCCP DYPICDT RNLCLK+ N+T Sbjct: 359 CGQGETCCCGLKFLGICLSWKCCGLDSAVCCKDGRHCCPWDYPICDTSRNLCLKRMSNAT 418 Query: 1526 LVKKLENRGVFGELGNL 1576 +V++ + G+ G L Sbjct: 419 IVQQPQKEPFTGKFGGL 435 >gb|AAK71314.1|AF388175_1 papain-like cysteine peptidase XBCP3 [Arabidopsis thaliana] Length = 437 Score = 593 bits (1528), Expect = e-166 Identities = 275/435 (63%), Positives = 336/435 (77%) Frame = +2 Query: 293 TVFLLFHVSICSSSSLTADLFENWSKEHGKRYSSEQEKQYRFKVFEDNYDYITQHNNMGN 472 T F L VS SSS ++LF++W ++HGK Y SE+E+Q R ++F+DN+D++TQHN + N Sbjct: 12 TFFFLLLVSSSSSSDDISELFDDWCQKHGKTYGSEEERQQRIQIFKDNHDFVTQHNLITN 71 Query: 473 STYTLSLNAFADLTHHEFKAKYLGLSASANDLIRLNRGGLSPFGESGAVGTNDIPSSLDW 652 +TY+LSLNAFADLTHHEFKA LGLS SA +I ++G G+ +P S+DW Sbjct: 72 ATYSLSLNAFADLTHHEFKASRLGLSVSAPSVIMASKG-------QSLGGSVKVPDSVDW 124 Query: 653 RKKGAVTNVKDQGSCGACWAFSATGAMEGINKIVTGSLVSLSEQELVDCDRSYNSGCEGG 832 RKKGAVTNVKDQGSCGACW+FSATGAMEGIN+IVTG L+SLSEQEL+DCD+SYN+GC GG Sbjct: 125 RKKGAVTNVKDQGSCGACWSFSATGAMEGINQIVTGDLISLSEQELIDCDKSYNAGCNGG 184 Query: 833 LMDPAYEFVVKNKGIDTEKDYPFQARDGTCNRNKLNRHVVTIDGYRDVTRGNEKELLQAV 1012 LMD A+EFV+KN GIDTEKDYP+Q RDGTC ++KL + VVTID Y V +EK L++AV Sbjct: 185 LMDYAFEFVIKNHGIDTEKDYPYQERDGTCKKDKLKQKVVTIDSYAGVKSNDEKALMEAV 244 Query: 1013 AAQPVSVGICGSERAFQLYSRGIFTGPCSTNLDHAVLIVGYDSQDGVDYWIVKNSWGSSW 1192 AAQPVSVGICGSERAFQLYSRGIF+GPCST+LDHAVLIVGY SQ+GVDYWIVKNSWG SW Sbjct: 245 AAQPVSVGICGSERAFQLYSRGIFSGPCSTSLDHAVLIVGYGSQNGVDYWIVKNSWGKSW 304 Query: 1193 GIDGYIHIARSTGNSEGVCGINTLAXXXXXXXXXXXXXXXXXXVKCSLFSSCPGGETCCC 1372 G+DG++H+ R+T NS+GVCGIN LA KC+LF+ C GETCCC Sbjct: 305 GMDGFMHMQRNTENSDGVCGINMLASYPIKTHPNPPPPSPPGPTKCNLFTYCSSGETCCC 364 Query: 1373 SWQLLGICFSWQCCELDSAVCCKDHRHCCPHDYPICDTKRNLCLKKTGNSTLVKKLENRG 1552 + +L G+CFSW+CCE++SAVCCKD RHCCPHDYP+CDT R+LCLKKTGN T +K + Sbjct: 365 ARELFGLCFSWKCCEIESAVCCKDGRHCCPHDYPVCDTTRSLCLKKTGNFTAIKPFWKKN 424 Query: 1553 VFGELGNLNAFFQNW 1597 +LG F+ W Sbjct: 425 SSKQLGR----FEEW 435 >ref|XP_002510459.1| cysteine protease, putative [Ricinus communis] gi|223551160|gb|EEF52646.1| cysteine protease, putative [Ricinus communis] Length = 422 Score = 591 bits (1523), Expect = e-166 Identities = 279/418 (66%), Positives = 338/418 (80%), Gaps = 5/418 (1%) Frame = +2 Query: 269 MNWVWPFWTVFLLFH---VSICSSSSLTADLFENWSKEHGKRYSSEQEKQYRFKVFEDNY 439 MN++ + + LLF +S SSSS + LFE+W+KEHGK Y+S+++K YRFK+FE+NY Sbjct: 1 MNFLSALFLITLLFFNLSISSFSSSSDISKLFESWTKEHGKTYTSKEDKLYRFKIFEENY 60 Query: 440 DYITQHNNMGNSTYTLSLNAFADLTHHEFKAKYLGLSA--SANDLIRLNRGGLSPFGESG 613 +++ +HN+ GNS+YTLSLNAFADLTHHEFKA LGLSA ++ L R N F Sbjct: 61 EFVKKHNSQGNSSYTLSLNAFADLTHHEFKASRLGLSAFSTSGKLSRRN------FPLHD 114 Query: 614 AVGTNDIPSSLDWRKKGAVTNVKDQGSCGACWAFSATGAMEGINKIVTGSLVSLSEQELV 793 VG D+P S+DWRKKGAV+ VKDQG+CGACW+FSATGA+EGINKIVTGSLVSLSEQELV Sbjct: 115 FVG--DVPISIDWRKKGAVSQVKDQGNCGACWSFSATGAIEGINKIVTGSLVSLSEQELV 172 Query: 794 DCDRSYNSGCEGGLMDPAYEFVVKNKGIDTEKDYPFQARDGTCNRNKLNRHVVTIDGYRD 973 DCDRSYN+GCEGGLMD AY+FV++N GIDTE+DYP+QAR+ TCN+ KL RHVVTIDGY D Sbjct: 173 DCDRSYNNGCEGGLMDYAYQFVIENNGIDTEEDYPYQAREKTCNKEKLKRHVVTIDGYTD 232 Query: 974 VTRGNEKELLQAVAAQPVSVGICGSERAFQLYSRGIFTGPCSTNLDHAVLIVGYDSQDGV 1153 V + NEKELL+AVAAQPVSVGICGSERAFQLYS+GIFTGPCST+LDHAVLIVGY S++GV Sbjct: 233 VPQNNEKELLKAVAAQPVSVGICGSERAFQLYSKGIFTGPCSTSLDHAVLIVGYGSENGV 292 Query: 1154 DYWIVKNSWGSSWGIDGYIHIARSTGNSEGVCGINTLAXXXXXXXXXXXXXXXXXXVKCS 1333 DYWIVKNSWG+ WGI+GY+++ R++GNS+G+CGIN LA KC Sbjct: 293 DYWIVKNSWGTHWGINGYMYMLRNSGNSQGLCGINMLASFPVKTSPNPPPPAPPGPTKCD 352 Query: 1334 LFSSCPGGETCCCSWQLLGICFSWQCCELDSAVCCKDHRHCCPHDYPICDTKRNLCLK 1507 LF+ C GETCCC+ ++ G+CFSW+CCELDSAVCCKD HCCPHDYP+CDTKRN+CLK Sbjct: 353 LFTRCGEGETCCCTRRIFGLCFSWKCCELDSAVCCKDGLHCCPHDYPVCDTKRNMCLK 410 >ref|NP_563855.1| papain-like cysteine peptidase [Arabidopsis thaliana] gi|110741821|dbj|BAE98853.1| papain-like cysteine peptidase XBCP3 [Arabidopsis thaliana] gi|111074448|gb|ABH04597.1| At1g09850 [Arabidopsis thaliana] gi|332190386|gb|AEE28507.1| papain-like cysteine peptidase [Arabidopsis thaliana] Length = 437 Score = 590 bits (1522), Expect = e-166 Identities = 274/435 (62%), Positives = 335/435 (77%) Frame = +2 Query: 293 TVFLLFHVSICSSSSLTADLFENWSKEHGKRYSSEQEKQYRFKVFEDNYDYITQHNNMGN 472 T F L VS SSS ++LF++W ++HGK Y SE+E+Q R ++F+DN+D++TQHN + N Sbjct: 12 TFFFLLLVSSSSSSDDISELFDDWCQKHGKTYGSEEERQQRIQIFKDNHDFVTQHNLITN 71 Query: 473 STYTLSLNAFADLTHHEFKAKYLGLSASANDLIRLNRGGLSPFGESGAVGTNDIPSSLDW 652 +TY+LSLNAFADLTHHEFKA LGLS SA +I ++G G+ +P S+DW Sbjct: 72 ATYSLSLNAFADLTHHEFKASRLGLSVSAPSVIMASKG-------QSLGGSVKVPDSVDW 124 Query: 653 RKKGAVTNVKDQGSCGACWAFSATGAMEGINKIVTGSLVSLSEQELVDCDRSYNSGCEGG 832 RKKGAVTNVKDQGSCGACW+FSATGAMEGIN+IVTG L+SLSEQEL+DCD+SYN+GC GG Sbjct: 125 RKKGAVTNVKDQGSCGACWSFSATGAMEGINQIVTGDLISLSEQELIDCDKSYNAGCNGG 184 Query: 833 LMDPAYEFVVKNKGIDTEKDYPFQARDGTCNRNKLNRHVVTIDGYRDVTRGNEKELLQAV 1012 LMD A+EFV+KN GIDTEKDYP+Q RDGTC ++KL + VVTID Y V +EK L++AV Sbjct: 185 LMDYAFEFVIKNHGIDTEKDYPYQERDGTCKKDKLKQKVVTIDSYAGVKSNDEKALMEAV 244 Query: 1013 AAQPVSVGICGSERAFQLYSRGIFTGPCSTNLDHAVLIVGYDSQDGVDYWIVKNSWGSSW 1192 AAQPVSVGICGSERAFQLYS GIF+GPCST+LDHAVLIVGY SQ+GVDYWIVKNSWG SW Sbjct: 245 AAQPVSVGICGSERAFQLYSSGIFSGPCSTSLDHAVLIVGYGSQNGVDYWIVKNSWGKSW 304 Query: 1193 GIDGYIHIARSTGNSEGVCGINTLAXXXXXXXXXXXXXXXXXXVKCSLFSSCPGGETCCC 1372 G+DG++H+ R+T NS+GVCGIN LA KC+LF+ C GETCCC Sbjct: 305 GMDGFMHMQRNTENSDGVCGINMLASYPIKTHPNPPPPSPPGPTKCNLFTYCSSGETCCC 364 Query: 1373 SWQLLGICFSWQCCELDSAVCCKDHRHCCPHDYPICDTKRNLCLKKTGNSTLVKKLENRG 1552 + +L G+CFSW+CCE++SAVCCKD RHCCPHDYP+CDT R+LCLKKTGN T +K + Sbjct: 365 ARELFGLCFSWKCCEIESAVCCKDGRHCCPHDYPVCDTTRSLCLKKTGNFTAIKPFWKKN 424 Query: 1553 VFGELGNLNAFFQNW 1597 +LG F+ W Sbjct: 425 SSKQLGR----FEEW 435 >dbj|BAJ53169.1| JHL18I08.3 [Jatropha curcas] Length = 441 Score = 590 bits (1522), Expect = e-166 Identities = 276/448 (61%), Positives = 342/448 (76%), Gaps = 3/448 (0%) Frame = +2 Query: 269 MNWVWPFWTVFLLFHVSICSSSSLT-ADLFENWSKEHGKRYSSEQEKQYRFKVFEDNYDY 445 MN + FLL ++ + SSSS A LFE W ++HGK Y+S++EK +R KVF+DNYD+ Sbjct: 1 MNSNCALFVAFLLSYLFLFSSSSSEIAHLFETWCQQHGKTYASQEEKLFRLKVFQDNYDF 60 Query: 446 ITQHNNMGNSTYTLSLNAFADLTHHEFKAKYLGLSASANDLIRLNRGG--LSPFGESGAV 619 +T+HN+ GNS+YTLSLNAFADLTHHEFKA LGLS++A+ + ++R + F Sbjct: 61 VTEHNSQGNSSYTLSLNAFADLTHHEFKASRLGLSSAASASLNVDRSNRQIPDF------ 114 Query: 620 GTNDIPSSLDWRKKGAVTNVKDQGSCGACWAFSATGAMEGINKIVTGSLVSLSEQELVDC 799 D+P+S+DWRK GAVT VKDQG+CGACW+FSATGA+EGINKIVTGSLVSLSEQELVDC Sbjct: 115 -VADVPASVDWRKNGAVTQVKDQGNCGACWSFSATGAIEGINKIVTGSLVSLSEQELVDC 173 Query: 800 DRSYNSGCEGGLMDPAYEFVVKNKGIDTEKDYPFQARDGTCNRNKLNRHVVTIDGYRDVT 979 D+SYN+GCEGG+MD A++FV+ N GIDTE+DYP+Q RD +CN+ KL RHVVTIDGY DV Sbjct: 174 DKSYNNGCEGGIMDYAFQFVIDNHGIDTEEDYPYQGRDRSCNKEKLKRHVVTIDGYVDVP 233 Query: 980 RGNEKELLQAVAAQPVSVGICGSERAFQLYSRGIFTGPCSTNLDHAVLIVGYDSQDGVDY 1159 + NEKELL+AVA QPVSVGICGSERAFQLYS+GIFTGPCST+LDHAVLIVGY S++GVDY Sbjct: 234 QNNEKELLKAVANQPVSVGICGSERAFQLYSKGIFTGPCSTSLDHAVLIVGYGSENGVDY 293 Query: 1160 WIVKNSWGSSWGIDGYIHIARSTGNSEGVCGINTLAXXXXXXXXXXXXXXXXXXVKCSLF 1339 WIVKNSWGS WG+DGY+H+ R++G+S G+CGIN LA +C LF Sbjct: 294 WIVKNSWGSYWGMDGYMHMQRNSGSSRGLCGINMLASYPKKTSPNPPPPAPPGPTRCDLF 353 Query: 1340 SSCPGGETCCCSWQLLGICFSWQCCELDSAVCCKDHRHCCPHDYPICDTKRNLCLKKTGN 1519 + C GETCCC + GIC SW+CCELDSAVCCKD RHCCP DYP+CDT RN+CLK GN Sbjct: 354 THCGEGETCCCVHHIFGICLSWKCCELDSAVCCKDGRHCCPRDYPVCDTTRNICLKHYGN 413 Query: 1520 STLVKKLENRGVFGELGNLNAFFQNWNL 1603 +T ++K G+ + ++ + W L Sbjct: 414 ATRIEKFAKNSSSGKFRSWSSLLEGWIL 441 >ref|XP_002889773.1| hypothetical protein ARALYDRAFT_471096 [Arabidopsis lyrata subsp. lyrata] gi|297335615|gb|EFH66032.1| hypothetical protein ARALYDRAFT_471096 [Arabidopsis lyrata subsp. lyrata] Length = 439 Score = 588 bits (1517), Expect = e-165 Identities = 276/437 (63%), Positives = 333/437 (76%), Gaps = 2/437 (0%) Frame = +2 Query: 293 TVFLLFHVSICSSSSLTADLFENWSKEHGKRYSSEQEKQYRFKVFEDNYDYITQHNNMGN 472 T F L VS SSS ++LF++W + HGK Y SE+E+Q R ++F+DN+D++TQHN + N Sbjct: 12 TFFFLLLVSSPSSSDDISELFDDWCQRHGKTYGSEEERQQRIQIFKDNHDFVTQHNLITN 71 Query: 473 STYTLSLNAFADLTHHEFKAKYLGLSASANDLIRLNRGGLSPFGESGAVGTNDIPSSLDW 652 +TY+LSLNAFADLTHHEFKA LGLS SA+ LI ++G G +P S+DW Sbjct: 72 ATYSLSLNAFADLTHHEFKASRLGLSVSASSLIMASKG-------QSLGGNAKVPDSVDW 124 Query: 653 RKKGAVTNVKDQGSCGACWAFSATGAMEGINKIVTGSLVSLSEQELVDCDRSYNSGCEGG 832 RKKGAVTNVKDQGSCGACW+FSATGAMEGIN+IVTG L+SLSEQEL+DCD+SYN+GC GG Sbjct: 125 RKKGAVTNVKDQGSCGACWSFSATGAMEGINQIVTGDLISLSEQELIDCDKSYNAGCNGG 184 Query: 833 LMDPAYEFVVKNKGIDTEKDYPFQARDGTCNRNKLNRHVVTIDGYRDVTRGNEKELLQAV 1012 LMD A+EFV+KN GIDTEKDYP+Q RDGTC ++KL + VVTID Y V +EK L +AV Sbjct: 185 LMDYAFEFVIKNHGIDTEKDYPYQERDGTCKKDKLKQKVVTIDSYAGVKSNDEKALREAV 244 Query: 1013 AAQPVSVGICGSERAFQLYSR--GIFTGPCSTNLDHAVLIVGYDSQDGVDYWIVKNSWGS 1186 AAQPVSVGICGSERAFQLYSR GIF+GPCST+LDHAVLIVGY SQ+GVDYWIVKNSWG Sbjct: 245 AAQPVSVGICGSERAFQLYSRVSGIFSGPCSTSLDHAVLIVGYGSQNGVDYWIVKNSWGK 304 Query: 1187 SWGIDGYIHIARSTGNSEGVCGINTLAXXXXXXXXXXXXXXXXXXVKCSLFSSCPGGETC 1366 SWG+DG++H+ R+TGNSEG+CGIN LA KC+LF+ C GETC Sbjct: 305 SWGMDGFMHMQRNTGNSEGICGINMLASYPIKTHPNPPPPSPPGPTKCNLFTYCSAGETC 364 Query: 1367 CCSWQLLGICFSWQCCELDSAVCCKDHRHCCPHDYPICDTKRNLCLKKTGNSTLVKKLEN 1546 CC+ L G+CFSW+CCE++SAVCC D RHCCPHDYP+CDT R+LCLKKTGN T +K Sbjct: 365 CCARNLFGLCFSWKCCEIESAVCCSDGRHCCPHDYPVCDTTRSLCLKKTGNFTAIKPFWK 424 Query: 1547 RGVFGELGNLNAFFQNW 1597 + +LG F+ W Sbjct: 425 KDSSNKLGR----FEGW 437 >ref|XP_006435079.1| hypothetical protein CICLE_v10001178mg [Citrus clementina] gi|557537201|gb|ESR48319.1| hypothetical protein CICLE_v10001178mg [Citrus clementina] Length = 441 Score = 588 bits (1516), Expect = e-165 Identities = 278/439 (63%), Positives = 330/439 (75%), Gaps = 6/439 (1%) Frame = +2 Query: 299 FLLFHVSICSSSSLTA-----DLFENWSKEHGKRYSSEQEKQYRFKVFEDNYDYITQHNN 463 F L + + SS L +LFE W K+HGK YSSEQEKQ R K+FEDNY ++TQHNN Sbjct: 6 FFLLSILLLSSLPLNYCSDINELFETWCKQHGKVYSSEQEKQQRLKIFEDNYAFVTQHNN 65 Query: 464 MGNSTYTLSLNAFADLTHHEFKAKYLGLSASANDLIRLNRGGLSPFGESGAVGTNDIPSS 643 MGNS++TLSLNAFADLTH EFKA +LG SA++ D R + G D+P+S Sbjct: 66 MGNSSFTLSLNAFADLTHQEFKASFLGFSAASIDHDRRRNASVQSPGT-----LRDVPAS 120 Query: 644 LDWRKKGAVTNVKDQGSCGACWAFSATGAMEGINKIVTGSLVSLSEQELVDCDRSYNSGC 823 +DWRKKGAVT VKDQ SCGACWAFSATGA+EGINKIVTGSLVSLSEQEL+DCDRSYNSGC Sbjct: 121 IDWRKKGAVTEVKDQASCGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSYNSGC 180 Query: 824 EGGLMDPAYEFVVKNKGIDTEKDYPFQARDGTCNRNKLNRHVVTIDGYRDVTRGNEKELL 1003 GGLMD AY+FV+KN GIDTEKDYP++ + G CN+ KLNRH+VTIDGY+DV NEK+LL Sbjct: 181 GGGLMDYAYQFVIKNHGIDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLL 240 Query: 1004 QAVAAQPVSVGICGSERAFQLYSRGIFTGPCSTNLDHAVLIVGYDSQDGVDYWIVKNSWG 1183 QAV AQPVSVGICGSERAFQLYS GIFTGPCST+LDHAVLIVGYDS++GVDYWI+KNSWG Sbjct: 241 QAVVAQPVSVGICGSERAFQLYSSGIFTGPCSTSLDHAVLIVGYDSENGVDYWIIKNSWG 300 Query: 1184 SSWGIDGYIHIARSTGNSEGVCGINTLAXXXXXXXXXXXXXXXXXXVKCSLFSSCPGGET 1363 SWG++GY+H+ R+TGNS G+CGIN LA +CSL + C GET Sbjct: 301 RSWGMNGYMHMQRNTGNSLGICGINMLASYPTKTGQNPPPSPPPGPTRCSLLTYCAAGET 360 Query: 1364 CCCSWQLLGICFSWQCCELDSAVCCKDHRHCCPHDYPICDTKRNLCLKK-TGNSTLVKKL 1540 CCC +LGIC SW+CC SAVCC DHR+CCP +YPICD+ R+ CL + TGN T + + Sbjct: 361 CCCGSSILGICLSWKCCGFSSAVCCSDHRYCCPSNYPICDSVRHQCLTRFTGNVTAAEAI 420 Query: 1541 ENRGVFGELGNLNAFFQNW 1597 E RG + G+ ++F W Sbjct: 421 EMRGSSWKFGSWSSFIDVW 439 >ref|XP_006473576.1| PREDICTED: cysteine proteinase RD21a-like [Citrus sinensis] Length = 441 Score = 588 bits (1515), Expect = e-165 Identities = 272/418 (65%), Positives = 323/418 (77%), Gaps = 1/418 (0%) Frame = +2 Query: 347 DLFENWSKEHGKRYSSEQEKQYRFKVFEDNYDYITQHNNMGNSTYTLSLNAFADLTHHEF 526 +LFE W K+HGK YSSEQEKQ R K+FEDNY ++TQHNNMGNS++TLSLNAFADLTH EF Sbjct: 27 ELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHNNMGNSSFTLSLNAFADLTHQEF 86 Query: 527 KAKYLGLSASANDLIRLNRGGLSPFGESGAVGTNDIPSSLDWRKKGAVTNVKDQGSCGAC 706 KA +LG SA++ D R + G D+P+S+DWRKKGAVT VKDQ SCGAC Sbjct: 87 KASFLGFSAASIDHDRRRNASVQSPGN-----LRDVPASIDWRKKGAVTEVKDQASCGAC 141 Query: 707 WAFSATGAMEGINKIVTGSLVSLSEQELVDCDRSYNSGCEGGLMDPAYEFVVKNKGIDTE 886 WAFSATGA+EGINKIVTGSLVSLSEQEL+DCDRSYNSGC GGLMD AY+FV+KN GIDTE Sbjct: 142 WAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSYNSGCGGGLMDYAYQFVIKNHGIDTE 201 Query: 887 KDYPFQARDGTCNRNKLNRHVVTIDGYRDVTRGNEKELLQAVAAQPVSVGICGSERAFQL 1066 KDYP++ + G CN+ KLNRH+VTIDGY+DV NEK+LLQAV AQPVSVGICGSERAFQL Sbjct: 202 KDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVAQPVSVGICGSERAFQL 261 Query: 1067 YSRGIFTGPCSTNLDHAVLIVGYDSQDGVDYWIVKNSWGSSWGIDGYIHIARSTGNSEGV 1246 YS GIFTGPCST+LDHAVLI+GYDS++GVDYWI+KNSWG SWG++GY+H+ R+TGNS G+ Sbjct: 262 YSSGIFTGPCSTSLDHAVLIIGYDSENGVDYWIIKNSWGRSWGMNGYMHMQRNTGNSLGI 321 Query: 1247 CGINTLAXXXXXXXXXXXXXXXXXXVKCSLFSSCPGGETCCCSWQLLGICFSWQCCELDS 1426 CGIN LA +CSL + C GETCCC +LGIC SW+CC S Sbjct: 322 CGINMLASYPTKTGQNPPPSPPPGPTRCSLLTYCAPGETCCCGSSILGICLSWKCCGFSS 381 Query: 1427 AVCCKDHRHCCPHDYPICDTKRNLCLKK-TGNSTLVKKLENRGVFGELGNLNAFFQNW 1597 AVCC DHR+CCP +YPICD+ R+ CL + TGN T + +E RG + G+ ++F W Sbjct: 382 AVCCSDHRYCCPSNYPICDSVRHQCLTRLTGNVTAAEAIEMRGSSWKFGSWSSFIDAW 439 >ref|XP_002307688.2| cysteine protease family protein [Populus trichocarpa] gi|550339725|gb|EEE94684.2| cysteine protease family protein [Populus trichocarpa] Length = 436 Score = 588 bits (1515), Expect = e-165 Identities = 272/443 (61%), Positives = 342/443 (77%) Frame = +2 Query: 269 MNWVWPFWTVFLLFHVSICSSSSLTADLFENWSKEHGKRYSSEQEKQYRFKVFEDNYDYI 448 MN+++ F L+ +S +SSS + LFE W KEHGK Y+S++E+ +R KVFEDNYD++ Sbjct: 1 MNFLYIFALTLLISVLSPSTSSSDISQLFETWCKEHGKSYTSQEERSHRLKVFEDNYDFV 60 Query: 449 TQHNNMGNSTYTLSLNAFADLTHHEFKAKYLGLSASANDLIRLNRGGLSPFGESGAVGTN 628 T+HN+ GNS+Y+L+LNAFADLTHHEFK LGLSA+ +L N +G VG Sbjct: 61 TKHNSKGNSSYSLALNAFADLTHHEFKTSRLGLSAAPLNLAHRN------LEITGVVG-- 112 Query: 629 DIPSSLDWRKKGAVTNVKDQGSCGACWAFSATGAMEGINKIVTGSLVSLSEQELVDCDRS 808 DIP+S+DWR KG VTNVKDQGSCGACW+FSATGA+EGINKIVTGSLVSLSEQEL++CD+S Sbjct: 113 DIPASIDWRNKGVVTNVKDQGSCGACWSFSATGAIEGINKIVTGSLVSLSEQELIECDKS 172 Query: 809 YNSGCEGGLMDPAYEFVVKNKGIDTEKDYPFQARDGTCNRNKLNRHVVTIDGYRDVTRGN 988 YN GC GGLMD A++FV+ N GIDTE+DYP++ARDGTCN++++ R VVTID Y DV N Sbjct: 173 YNDGCGGGLMDYAFQFVINNHGIDTEEDYPYRARDGTCNKDRMKRRVVTIDKYVDVPENN 232 Query: 989 EKELLQAVAAQPVSVGICGSERAFQLYSRGIFTGPCSTNLDHAVLIVGYDSQDGVDYWIV 1168 EK+LLQAVAAQPVSVGICGSERAFQ+YS+GIFTGPCST+LDHAVLIVGY S++GVDYWIV Sbjct: 233 EKQLLQAVAAQPVSVGICGSERAFQMYSKGIFTGPCSTSLDHAVLIVGYGSENGVDYWIV 292 Query: 1169 KNSWGSSWGIDGYIHIARSTGNSEGVCGINTLAXXXXXXXXXXXXXXXXXXVKCSLFSSC 1348 KNSWG+ WG+ GY+H+ R++GNS+GVCGIN LA KC+L + C Sbjct: 293 KNSWGTGWGMRGYMHMQRNSGNSQGVCGINMLASYPVKTSPNPPPPPPPGPTKCNLLTYC 352 Query: 1349 PGGETCCCSWQLLGICFSWQCCELDSAVCCKDHRHCCPHDYPICDTKRNLCLKKTGNSTL 1528 GETCCC+ + GIC SW+CC LDSAVCCKD HCCPHDYP+CDT +N+C K+ GN+T Sbjct: 353 AAGETCCCARKFFGICISWKCCGLDSAVCCKDRLHCCPHDYPVCDTDKNMCFKRAGNATR 412 Query: 1529 VKKLENRGVFGELGNLNAFFQNW 1597 ++ +E + G+ G+ N+ + W Sbjct: 413 MEAIEGK-TSGKFGSWNSLPEAW 434 >gb|EOY14881.1| JHL18I08.3 protein [Theobroma cacao] Length = 438 Score = 584 bits (1505), Expect = e-164 Identities = 278/439 (63%), Positives = 333/439 (75%) Frame = +2 Query: 287 FWTVFLLFHVSICSSSSLTADLFENWSKEHGKRYSSEQEKQYRFKVFEDNYDYITQHNNM 466 F FLLF +S S + LFE W +HGKRYSSE+EK YR KVFE+NY ++TQHN + Sbjct: 8 FLLSFLLFFDPSFASPSHISHLFETWCDQHGKRYSSEEEKSYRLKVFEENYAFVTQHNGV 67 Query: 467 GNSTYTLSLNAFADLTHHEFKAKYLGLSASANDLIRLNRGGLSPFGESGAVGTNDIPSSL 646 GNS+Y+L+LNAFADLTHHEFKA LGLSA+A + R N G V DIP+S+ Sbjct: 68 GNSSYSLALNAFADLTHHEFKASRLGLSAAAIEGSRPN------LQLPGLV--RDIPASM 119 Query: 647 DWRKKGAVTNVKDQGSCGACWAFSATGAMEGINKIVTGSLVSLSEQELVDCDRSYNSGCE 826 DWR KGAVT VKDQGSCGACW+FSATGA+EGINKIVTG+LVSLSEQELVDCDRSYNSGCE Sbjct: 120 DWRTKGAVTKVKDQGSCGACWSFSATGAIEGINKIVTGTLVSLSEQELVDCDRSYNSGCE 179 Query: 827 GGLMDPAYEFVVKNKGIDTEKDYPFQARDGTCNRNKLNRHVVTIDGYRDVTRGNEKELLQ 1006 GGLMD AY+FV+ N GID E+DYP+ R+ TCN+ K R VVTIDGY V NE LLQ Sbjct: 180 GGLMDYAYQFVIDNHGIDNEEDYPYLGREKTCNKEKRKRRVVTIDGYAGVPANNEDLLLQ 239 Query: 1007 AVAAQPVSVGICGSERAFQLYSRGIFTGPCSTNLDHAVLIVGYDSQDGVDYWIVKNSWGS 1186 AVA QPVSVGICGSERAFQLYS+GIFTGPCS++LDHAVLIVGY S++GVDYWIVKNSWG+ Sbjct: 240 AVAKQPVSVGICGSERAFQLYSKGIFTGPCSSSLDHAVLIVGYGSENGVDYWIVKNSWGT 299 Query: 1187 SWGIDGYIHIARSTGNSEGVCGINTLAXXXXXXXXXXXXXXXXXXVKCSLFSSCPGGETC 1366 WG++GYIH+ R++G+S+G+CGIN LA KC LF+ C GETC Sbjct: 300 RWGMNGYIHMLRNSGDSKGLCGINMLASYPTKTSPNPPSPPPPGPTKCDLFTYCSAGETC 359 Query: 1367 CCSWQLLGICFSWQCCELDSAVCCKDHRHCCPHDYPICDTKRNLCLKKTGNSTLVKKLEN 1546 CC+ ++ GICFSW+CCELDSAVCCKD+RHCCP+DYP+CDTK++ CLK+ GN+T ++ E Sbjct: 360 CCTHRIFGICFSWKCCELDSAVCCKDNRHCCPYDYPVCDTKKSQCLKRVGNATRMEAFEK 419 Query: 1547 RGVFGELGNLNAFFQNWNL 1603 R + + F +NW L Sbjct: 420 RHSTRKFSSWRPFVENWVL 438 >ref|XP_006417526.1| hypothetical protein EUTSA_v10007640mg [Eutrema salsugineum] gi|557095297|gb|ESQ35879.1| hypothetical protein EUTSA_v10007640mg [Eutrema salsugineum] Length = 444 Score = 583 bits (1504), Expect = e-164 Identities = 273/432 (63%), Positives = 331/432 (76%) Frame = +2 Query: 302 LLFHVSICSSSSLTADLFENWSKEHGKRYSSEQEKQYRFKVFEDNYDYITQHNNMGNSTY 481 L F +S SS + A+LF++W HGK Y SE+E+Q+R ++F DN+D++TQHN++ NSTY Sbjct: 21 LSFSISSSSSDDI-AELFDDWCHRHGKTYGSEEERQHRIQIFRDNHDFVTQHNHISNSTY 79 Query: 482 TLSLNAFADLTHHEFKAKYLGLSASANDLIRLNRGGLSPFGESGAVGTNDIPSSLDWRKK 661 +LSLNAFADLTHHEFKA LGLSA + L+ + G S V +P S+DWRKK Sbjct: 80 SLSLNAFADLTHHEFKASRLGLSAPSPSLMAKEQS----LGVSERVRVK-VPDSVDWRKK 134 Query: 662 GAVTNVKDQGSCGACWAFSATGAMEGINKIVTGSLVSLSEQELVDCDRSYNSGCEGGLMD 841 GAVTNVKDQGSCGACW+FSATGAMEGIN+IVTG L+SLSEQEL+DCD+SYN+GC GGLMD Sbjct: 135 GAVTNVKDQGSCGACWSFSATGAMEGINQIVTGDLISLSEQELIDCDKSYNAGCNGGLMD 194 Query: 842 PAYEFVVKNKGIDTEKDYPFQARDGTCNRNKLNRHVVTIDGYRDVTRGNEKELLQAVAAQ 1021 A+EFV+KN GIDTEKDYP+Q +DGTC ++KL + VVTID Y V NEK L++AVA+Q Sbjct: 195 YAFEFVIKNHGIDTEKDYPYQEQDGTCKKDKLKKRVVTIDSYAGVASNNEKALMEAVASQ 254 Query: 1022 PVSVGICGSERAFQLYSRGIFTGPCSTNLDHAVLIVGYDSQDGVDYWIVKNSWGSSWGID 1201 PVSVGICGSERAFQLYS GIF+GPCST+LDHAVLIVGY SQ+GVDYWIVKNSWG SWG+D Sbjct: 255 PVSVGICGSERAFQLYSSGIFSGPCSTSLDHAVLIVGYGSQNGVDYWIVKNSWGKSWGMD 314 Query: 1202 GYIHIARSTGNSEGVCGINTLAXXXXXXXXXXXXXXXXXXVKCSLFSSCPGGETCCCSWQ 1381 G++H+ R+TGNSEGVCGIN LA KC+LF+ C GETCCC+ Sbjct: 315 GFMHMQRNTGNSEGVCGINMLASYPIKTHPNPPPPSPPGPTKCNLFTYCSSGETCCCART 374 Query: 1382 LLGICFSWQCCELDSAVCCKDHRHCCPHDYPICDTKRNLCLKKTGNSTLVKKLENRGVFG 1561 L G+CFSW+CCEL+SAVCCKD RHCCP DYP+CDT ++LCLKKTGN T +K + Sbjct: 375 LFGLCFSWKCCELESAVCCKDGRHCCPRDYPVCDTTKSLCLKKTGNFTEIKPFWKKNSSN 434 Query: 1562 ELGNLNAFFQNW 1597 +LG F+ W Sbjct: 435 KLGR----FEEW 442 >ref|XP_004152671.1| PREDICTED: cysteine proteinase RD21a-like [Cucumis sativus] gi|449529596|ref|XP_004171784.1| PREDICTED: cysteine proteinase RD21a-like [Cucumis sativus] Length = 431 Score = 583 bits (1502), Expect = e-163 Identities = 275/438 (62%), Positives = 331/438 (75%) Frame = +2 Query: 272 NWVWPFWTVFLLFHVSICSSSSLTADLFENWSKEHGKRYSSEQEKQYRFKVFEDNYDYIT 451 N+ + F T+FLL + ++S+++ +LFE W EHGK YSS +EK YR VF DNY+++T Sbjct: 3 NYAFHFLTLFLLLFRPLSATSNVS-ELFEIWCTEHGKSYSSAEEKLYRLGVFADNYEFVT 61 Query: 452 QHNNMGNSTYTLSLNAFADLTHHEFKAKYLGLSASANDLIRLNRGGLSPFGESGAVGTND 631 HNN+ NS+YTLSLN++ADLTHHEFK LG S + + P D Sbjct: 62 HHNNLDNSSYTLSLNSYADLTHHEFKVSRLGFSPALRNF--------RPVLPQEPSLPRD 113 Query: 632 IPSSLDWRKKGAVTNVKDQGSCGACWAFSATGAMEGINKIVTGSLVSLSEQELVDCDRSY 811 +P SLDWRKKGAVT VKDQGSCGACW+FSATGAMEGIN+I+TGSL+SLSEQEL+DCDRSY Sbjct: 114 VPDSLDWRKKGAVTAVKDQGSCGACWSFSATGAMEGINQIMTGSLISLSEQELIDCDRSY 173 Query: 812 NSGCEGGLMDPAYEFVVKNKGIDTEKDYPFQARDGTCNRNKLNRHVVTIDGYRDVTRGNE 991 NSGC GGLMD AY+FV+ N GIDTE DYP+QARDG+C ++KL R+VVTIDGY D+ +E Sbjct: 174 NSGCGGGLMDYAYQFVISNHGIDTENDYPYQARDGSCRKDKLQRNVVTIDGYADIPSNDE 233 Query: 992 KELLQAVAAQPVSVGICGSERAFQLYSRGIFTGPCSTNLDHAVLIVGYDSQDGVDYWIVK 1171 +LLQAVAAQPVSVGICGSERAFQLYS+GIF+GPCST+LDHAVLIVGY S++GVDYWIVK Sbjct: 234 GKLLQAVAAQPVSVGICGSERAFQLYSKGIFSGPCSTSLDHAVLIVGYGSENGVDYWIVK 293 Query: 1172 NSWGSSWGIDGYIHIARSTGNSEGVCGINTLAXXXXXXXXXXXXXXXXXXVKCSLFSSCP 1351 NSWG SWG+DGY+H+ R++GNSEGVCGIN LA KCS+ +SC Sbjct: 294 NSWGKSWGMDGYMHMQRNSGNSEGVCGINKLASYPTKTNPNPPPSPPPGPTKCSILTSCA 353 Query: 1352 GGETCCCSWQLLGICFSWQCCELDSAVCCKDHRHCCPHDYPICDTKRNLCLKKTGNSTLV 1531 GETCCC+ + LG+C SW+CC L SAVCCKD RHCCP DYPICDT RNLCLK+T N T Sbjct: 354 AGETCCCAKKFLGLCLSWKCCGLSSAVCCKDGRHCCPFDYPICDTDRNLCLKQTMNGTRT 413 Query: 1532 KKLENRGVFGELGNLNAF 1585 + LENR G G ++F Sbjct: 414 EILENRSSSGSSGTWSSF 431 >gb|EXC25025.1| Oryzain alpha chain [Morus notabilis] Length = 517 Score = 578 bits (1490), Expect = e-162 Identities = 270/406 (66%), Positives = 320/406 (78%) Frame = +2 Query: 293 TVFLLFHVSICSSSSLTADLFENWSKEHGKRYSSEQEKQYRFKVFEDNYDYITQHNNMGN 472 ++ L H+S+ S S ++ LFE W ++HG+ YSSE+E+ YR VFEDN ++TQHNNMGN Sbjct: 10 SLLLSSHLSLSSPSLNSSQLFEAWCEKHGQSYSSEEERLYRLTVFEDNLAFVTQHNNMGN 69 Query: 473 STYTLSLNAFADLTHHEFKAKYLGLSASANDLIRLNRGGLSPFGESGAVGTNDIPSSLDW 652 S+YTLSLNAFADLTHHEFK+ LG S++ L L + G S + D+P+SLDW Sbjct: 70 SSYTLSLNAFADLTHHEFKSSRLGFSSAL--LSSLPKLG------SKLLDLRDVPASLDW 121 Query: 653 RKKGAVTNVKDQGSCGACWAFSATGAMEGINKIVTGSLVSLSEQELVDCDRSYNSGCEGG 832 RKKGAVTNVKDQGSCGACWAFSATGA+EGINKIVTGSLVSLSEQEL+DCD SYN+GC+GG Sbjct: 122 RKKGAVTNVKDQGSCGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDTSYNAGCDGG 181 Query: 833 LMDPAYEFVVKNKGIDTEKDYPFQARDGTCNRNKLNRHVVTIDGYRDVTRGNEKELLQAV 1012 LMD AY+FV+ N GIDTE+DYP+QARD +C + KL R VVTIDGY DV N +LLQAV Sbjct: 182 LMDYAYQFVIDNHGIDTEEDYPYQARDKSCRKEKLKRRVVTIDGYTDVAPNNGLQLLQAV 241 Query: 1013 AAQPVSVGICGSERAFQLYSRGIFTGPCSTNLDHAVLIVGYDSQDGVDYWIVKNSWGSSW 1192 QPVSVGICGSERAFQLYS+GIFTGPCST+LDHAVLIVGYDS++GVDYWIVKNSWG W Sbjct: 242 VTQPVSVGICGSERAFQLYSKGIFTGPCSTSLDHAVLIVGYDSENGVDYWIVKNSWGKQW 301 Query: 1193 GIDGYIHIARSTGNSEGVCGINTLAXXXXXXXXXXXXXXXXXXVKCSLFSSCPGGETCCC 1372 G+DGYIH+ R+TGNS+GVCGIN LA +CS F+ C GETCCC Sbjct: 302 GMDGYIHMQRNTGNSQGVCGINMLASYPTKTSPNPPPSPSPGPTRCSFFAQCGEGETCCC 361 Query: 1373 SWQLLGICFSWQCCELDSAVCCKDHRHCCPHDYPICDTKRNLCLKK 1510 SW+ LG+CFSW+CC L+SAVCCKD HCCP DYP+CDT+RN+CLK+ Sbjct: 362 SWRFLGLCFSWKCCGLNSAVCCKDKIHCCPQDYPLCDTQRNVCLKE 407 >ref|XP_006307431.1| hypothetical protein CARUB_v10009056mg [Capsella rubella] gi|482576142|gb|EOA40329.1| hypothetical protein CARUB_v10009056mg [Capsella rubella] Length = 467 Score = 570 bits (1468), Expect = e-159 Identities = 274/465 (58%), Positives = 332/465 (71%), Gaps = 30/465 (6%) Frame = +2 Query: 293 TVFLLFHVSICSSSSLT--ADLFENWSKEHGKRYSSEQEKQYRFKVFEDNYDYITQHNNM 466 T F L VS SSSS ++LF++W + HGK Y+SE EKQ+RF++F DN+D++TQHN + Sbjct: 12 TFFFLLLVSSSSSSSSDDISELFDDWCQRHGKTYASEAEKQHRFQIFRDNHDFVTQHNLI 71 Query: 467 GNSTYTLSLNAFADLTHHEFKAKYLGLSASANDLIRLNRGGLSPFGESGAVGTNDIPSSL 646 N+TY+LSLNAFADL H EFK LGLS SA +I ++G G+ +P SL Sbjct: 72 TNATYSLSLNAFADLNHSEFKTSRLGLSVSAPSVIMASKG-------KSLGGSVKVPDSL 124 Query: 647 DWRKKGAVTNVKDQGSCGACWAFSATGAMEGINKIVTGSLVSLSEQELVDCDRSYNSGCE 826 DWRKKGAVTNVKDQGSCGACW+FSATGAMEGIN+IVTG L+SLSEQEL+DCD+SYN GC Sbjct: 125 DWRKKGAVTNVKDQGSCGACWSFSATGAMEGINQIVTGDLISLSEQELIDCDKSYNDGCN 184 Query: 827 GGLMDPAYEFVVKNKGIDTEKDYPFQARDGTCNRNKLNRHVVTIDGYRDVTRGNEKELLQ 1006 GGLMD A+EFV+KNKGIDTEKDYP+Q RDGTC ++KL + VV+ID Y V +EK LL+ Sbjct: 185 GGLMDYAFEFVIKNKGIDTEKDYPYQERDGTCKKDKLKQRVVSIDSYAGVKPSDEKALLE 244 Query: 1007 AVAAQPVSVGICGSERAFQLYS----------------------------RGIFTGPCST 1102 AVAAQPVSVGICGSERAFQLYS +GIF+GPCST Sbjct: 245 AVAAQPVSVGICGSERAFQLYSSVSFKIRDTSILSSECSTFPCLKLYLMMQGIFSGPCST 304 Query: 1103 NLDHAVLIVGYDSQDGVDYWIVKNSWGSSWGIDGYIHIARSTGNSEGVCGINTLAXXXXX 1282 +LDHAVLIVGY SQ+GVDYWIVKNSWG SWG+DG++H+ R+TGNS+G+CGIN LA Sbjct: 305 SLDHAVLIVGYGSQNGVDYWIVKNSWGKSWGMDGFMHMQRNTGNSQGICGINMLASYPIK 364 Query: 1283 XXXXXXXXXXXXXVKCSLFSSCPGGETCCCSWQLLGICFSWQCCELDSAVCCKDHRHCCP 1462 KC+LF+ C ETCCC+ L G+C SW+CCE++SAVCCKD RHCCP Sbjct: 365 THPNPPPPSPPGPTKCNLFTYCSAAETCCCARNLFGLCLSWKCCEIESAVCCKDGRHCCP 424 Query: 1463 HDYPICDTKRNLCLKKTGNSTLVKKLENRGVFGELGNLNAFFQNW 1597 HDYP+CDT R+LCLKKTGN T +K + +LG F+ W Sbjct: 425 HDYPVCDTTRSLCLKKTGNFTAIKPFWKKNSSNKLGR----FEEW 465 >gb|AAB60738.1| Strong similarity to Dianthus cysteine proteinase (gb|U17135) [Arabidopsis thaliana] Length = 416 Score = 569 bits (1466), Expect = e-159 Identities = 264/412 (64%), Positives = 322/412 (78%), Gaps = 7/412 (1%) Frame = +2 Query: 293 TVFLLFHVSICSSSSLTADLFENWSKEHGKRYSSEQEKQYRFKVFEDNYDYITQHNNMGN 472 T F L VS SSS ++LF++W ++HGK Y SE+E+Q R ++F+DN+D++TQHN + N Sbjct: 10 TFFFLLLVSSSSSSDDISELFDDWCQKHGKTYGSEEERQQRIQIFKDNHDFVTQHNLITN 69 Query: 473 STYTLSLNAFADLTHHEFKAKYLGLSASANDLIRLNRGGLSPFGESGAVGTNDIPSSLDW 652 +TY+LSLNAFADLTHHEFKA LGLS SA +I ++G G+ +P S+DW Sbjct: 70 ATYSLSLNAFADLTHHEFKASRLGLSVSAPSVIMASKG-------QSLGGSVKVPDSVDW 122 Query: 653 RKKGAVTNVKDQGSCGACWAFSATGAMEGINKIVTGSLVSLSEQELVDCDRSYNSGCEGG 832 RKKGAVTNVKDQGSCGACW+FSATGAMEGIN+IVTG L+SLSEQEL+DCD+SYN+GC GG Sbjct: 123 RKKGAVTNVKDQGSCGACWSFSATGAMEGINQIVTGDLISLSEQELIDCDKSYNAGCNGG 182 Query: 833 LMDPAYEFVVKNKGIDTEKDYPFQARDGTCNRNKLNRHVVTIDGYRDVTRGNEKELLQAV 1012 LMD A+EFV+KN GIDTEKDYP+Q RDGTC ++KL + VVTID Y V +EK L++AV Sbjct: 183 LMDYAFEFVIKNHGIDTEKDYPYQERDGTCKKDKLKQKVVTIDSYAGVKSNDEKALMEAV 242 Query: 1013 AAQPVSVGICGSERAFQLYS-------RGIFTGPCSTNLDHAVLIVGYDSQDGVDYWIVK 1171 AAQPVSVGICGSERAFQLYS +GIF+GPCST+LDHAVLIVGY SQ+GVDYWIVK Sbjct: 243 AAQPVSVGICGSERAFQLYSSKFYLLMQGIFSGPCSTSLDHAVLIVGYGSQNGVDYWIVK 302 Query: 1172 NSWGSSWGIDGYIHIARSTGNSEGVCGINTLAXXXXXXXXXXXXXXXXXXVKCSLFSSCP 1351 NSWG SWG+DG++H+ R+T NS+GVCGIN LA KC+LF+ C Sbjct: 303 NSWGKSWGMDGFMHMQRNTENSDGVCGINMLASYPIKTHPNPPPPSPPGPTKCNLFTYCS 362 Query: 1352 GGETCCCSWQLLGICFSWQCCELDSAVCCKDHRHCCPHDYPICDTKRNLCLK 1507 GETCCC+ +L G+CFSW+CCE++SAVCCKD RHCCPHDYP+CDT R+LCLK Sbjct: 363 SGETCCCARELFGLCFSWKCCEIESAVCCKDGRHCCPHDYPVCDTTRSLCLK 414 >ref|XP_002280937.1| PREDICTED: cysteine proteinase RD21a [Vitis vinifera] gi|302142569|emb|CBI19772.3| unnamed protein product [Vitis vinifera] Length = 436 Score = 560 bits (1443), Expect = e-156 Identities = 261/448 (58%), Positives = 321/448 (71%), Gaps = 4/448 (0%) Frame = +2 Query: 272 NWVWPFWTVFLLFHVSICSSSSLTADLFENWSKEHGKRYSSEQEKQYRFKVFEDNYDYIT 451 +W+W + L H S+ +SS TADLFE W +++GK YSSE+EK R KVFE+N+ ++T Sbjct: 3 SWLWAVSILILAVHSSVSEASS-TADLFEAWCEQYGKTYSSEEEKASRLKVFEENHAFVT 61 Query: 452 QHNNMGNSTYTLSLNAFADLTHHEFKAKYLGLSASANDLIRLNRGGLSPFGESGAVGTN- 628 QHN+M N++YTL+LNAFADLTHHEFKA LG S IR +VGT Sbjct: 62 QHNSMANASYTLALNAFADLTHHEFKASRLGFSPGRAQSIR-------------SVGTPV 108 Query: 629 ---DIPSSLDWRKKGAVTNVKDQGSCGACWAFSATGAMEGINKIVTGSLVSLSEQELVDC 799 +P ++DWRK GAVT VKDQG+CG CW+FS TGA+EGINKIVTGSLVSLSEQELVDC Sbjct: 109 QELHVPPAVDWRKSGAVTGVKDQGNCGGCWSFSTTGAIEGINKIVTGSLVSLSEQELVDC 168 Query: 800 DRSYNSGCEGGLMDPAYEFVVKNKGIDTEKDYPFQARDGTCNRNKLNRHVVTIDGYRDVT 979 DRSYNSGCEGGLMD AY+FV+KN+GID+E DYP+ D CN+ KL +H+VTIDGY D+ Sbjct: 169 DRSYNSGCEGGLMDYAYQFVIKNQGIDSEADYPYVGMDKPCNKEKLKKHIVTIDGYTDIP 228 Query: 980 RGNEKELLQAVAAQPVSVGICGSERAFQLYSRGIFTGPCSTNLDHAVLIVGYDSQDGVDY 1159 +EK+LLQ VA QPVSVGICGSE+ FQLYS+G++TGPCS+ LDHAVLIVGY ++DGVD+ Sbjct: 229 PNDEKQLLQVVAKQPVSVGICGSEKTFQLYSKGVYTGPCSSTLDHAVLIVGYGTEDGVDF 288 Query: 1160 WIVKNSWGSSWGIDGYIHIARSTGNSEGVCGINTLAXXXXXXXXXXXXXXXXXXVKCSLF 1339 WIVKNSWG WG+ GYIH+ R+ G +EG+CGIN LA KC F Sbjct: 289 WIVKNSWGEHWGMRGYIHMLRNNGTAEGICGINMLASYPAKTSPNPPPPPTPGPTKCDFF 348 Query: 1340 SSCPGGETCCCSWQLLGICFSWQCCELDSAVCCKDHRHCCPHDYPICDTKRNLCLKKTGN 1519 SSC GETCCCSW+ +G+C SW CC SAVCC ++ +CCP +PICDTKRN CLK GN Sbjct: 349 SSCSEGETCCCSWRFIGVCLSWNCCTAKSAVCCDNNNYCCPASHPICDTKRNRCLKPAGN 408 Query: 1520 STLVKKLENRGVFGELGNLNAFFQNWNL 1603 T V+ L+ RG + G ++ WNL Sbjct: 409 GTGVEVLKRRGSSVKFGGWSSINDAWNL 436 >ref|XP_004500967.1| PREDICTED: oryzain alpha chain-like [Cicer arietinum] Length = 436 Score = 550 bits (1416), Expect = e-153 Identities = 261/426 (61%), Positives = 316/426 (74%), Gaps = 2/426 (0%) Frame = +2 Query: 287 FWTVFLLFHVSICSSSSL--TADLFENWSKEHGKRYSSEQEKQYRFKVFEDNYDYITQHN 460 F + LL +S+ S + T+ LF+ W K+HGK Y SEQEK+YRF VFEDNY ++ QHN Sbjct: 6 FMFLQLLLSLSLLSFVTAIDTSKLFQEWCKQHGKTYPSEQEKRYRFNVFEDNYAFVAQHN 65 Query: 461 NMGNSTYTLSLNAFADLTHHEFKAKYLGLSASANDLIRLNRGGLSPFGESGAVGTNDIPS 640 +GNS+YTLSLNAFADLTHHEFKA LGL S+ + NR + + +PS Sbjct: 66 QIGNSSYTLSLNAFADLTHHEFKATRLGLPPSSLLRFKFNRFQ----DQQRSDDFLQVPS 121 Query: 641 SLDWRKKGAVTNVKDQGSCGACWAFSATGAMEGINKIVTGSLVSLSEQELVDCDRSYNSG 820 +DWRK GAV+ VKDQGSCGACW+FSATGA+EGINKIVTGSLVSLSEQELVDCD +YNSG Sbjct: 122 EIDWRKNGAVSIVKDQGSCGACWSFSATGAIEGINKIVTGSLVSLSEQELVDCDTTYNSG 181 Query: 821 CEGGLMDPAYEFVVKNKGIDTEKDYPFQARDGTCNRNKLNRHVVTIDGYRDVTRGNEKEL 1000 C+GGLMD AY+F++ N GIDTE+DYP+QAR C ++KL R VVTIDGY DV +EK+L Sbjct: 182 CDGGLMDYAYQFIIDNNGIDTEEDYPYQARQLLCKKDKLKRRVVTIDGYTDVPPNDEKKL 241 Query: 1001 LQAVAAQPVSVGICGSERAFQLYSRGIFTGPCSTNLDHAVLIVGYDSQDGVDYWIVKNSW 1180 L+AVA QPVSVGICGS RAFQLYS+GIFTGPCST+LDHAVLIVGY S++GVDYWIVKNSW Sbjct: 242 LKAVAVQPVSVGICGSARAFQLYSKGIFTGPCSTSLDHAVLIVGYGSENGVDYWIVKNSW 301 Query: 1181 GSSWGIDGYIHIARSTGNSEGVCGINTLAXXXXXXXXXXXXXXXXXXVKCSLFSSCPGGE 1360 G WG++GYIH+ R+T +S G+CGIN LA +KC+LF+ C GGE Sbjct: 302 GKYWGMNGYIHMLRNTDSSAGLCGINMLASYPTKTKPNPPVPPPPGPIKCNLFTYCSGGE 361 Query: 1361 TCCCSWQLLGICFSWQCCELDSAVCCKDHRHCCPHDYPICDTKRNLCLKKTGNSTLVKKL 1540 TCCC+ + LGICFSW+CC + SAVCCKD RHCCP DYP+CD CLK+ N T++ Sbjct: 362 TCCCAKKFLGICFSWKCCGVTSAVCCKDKRHCCPLDYPVCDASNGQCLKRIANGTILMTS 421 Query: 1541 ENRGVF 1558 + F Sbjct: 422 DKEDPF 427 >gb|ESW08035.1| hypothetical protein PHAVU_009G013000g [Phaseolus vulgaris] Length = 428 Score = 547 bits (1410), Expect = e-153 Identities = 269/419 (64%), Positives = 319/419 (76%), Gaps = 4/419 (0%) Frame = +2 Query: 287 FWTVFLLFHVSI-CSSSSLTADLFENWSKEHGKRYSSEQEKQYRFKVFEDNYDYITQHN- 460 F+++ +LF + + +S+S T+DLFE W KEH K YSSE+EK+YRF VFEDNY +++QHN Sbjct: 4 FFSLIILFTLYLPFASASDTSDLFERWCKEHAKTYSSEEEKRYRFHVFEDNYAFVSQHNR 63 Query: 461 --NMGNSTYTLSLNAFADLTHHEFKAKYLGLSASANDLIRLNRGGLSPFGESGAVGTNDI 634 N NSTYTLSLNAFADLTHHEFK LG S S L+R R Sbjct: 64 NANDNNSTYTLSLNAFADLTHHEFKTSRLGFSPS---LLRFKR-----VQNQQPRHLLHN 115 Query: 635 PSSLDWRKKGAVTNVKDQGSCGACWAFSATGAMEGINKIVTGSLVSLSEQELVDCDRSYN 814 PS +DWR+ GAVT VKDQ SCGACWAFSATGA+EGINKIVTGSL SLSEQELVDCD SYN Sbjct: 116 PSQIDWRQSGAVTPVKDQASCGACWAFSATGAIEGINKIVTGSLESLSEQELVDCDTSYN 175 Query: 815 SGCEGGLMDPAYEFVVKNKGIDTEKDYPFQARDGTCNRNKLNRHVVTIDGYRDVTRGNEK 994 SGCEGGLMD AY+FV+ NKGIDTE DYP+QAR CN++KL RH+VTID Y D+ NE+ Sbjct: 176 SGCEGGLMDYAYQFVIDNKGIDTEDDYPYQARQRPCNKDKLKRHIVTIDDYVDLP-PNEE 234 Query: 995 ELLQAVAAQPVSVGICGSERAFQLYSRGIFTGPCSTNLDHAVLIVGYDSQDGVDYWIVKN 1174 ELL+AVA+QPVSVGICGSERAFQLYS+GIF+GPCST+LDHAVLIVGY S++GVDYWIVKN Sbjct: 235 ELLKAVASQPVSVGICGSERAFQLYSQGIFSGPCSTSLDHAVLIVGYGSENGVDYWIVKN 294 Query: 1175 SWGSSWGIDGYIHIARSTGNSEGVCGINTLAXXXXXXXXXXXXXXXXXXVKCSLFSSCPG 1354 SWG WG++GYIH+ R+TG+ +G+CGINTLA V+C+LF+ C Sbjct: 295 SWGKYWGMEGYIHMIRNTGDPKGICGINTLA--SYPIKTKPNPPPPPAPVRCNLFTHCSE 352 Query: 1355 GETCCCSWQLLGICFSWQCCELDSAVCCKDHRHCCPHDYPICDTKRNLCLKKTGNSTLV 1531 GETCCC+ LGICFSW+CC L SAVCCKD RHCCP DYPICDT+++ CLK T +T + Sbjct: 353 GETCCCAKSFLGICFSWKCCGLTSAVCCKDKRHCCPRDYPICDTEKSQCLKITNGTTTI 411 >ref|XP_004291399.1| PREDICTED: cysteine proteinase RD21a-like [Fragaria vesca subsp. vesca] Length = 441 Score = 547 bits (1409), Expect = e-153 Identities = 267/439 (60%), Positives = 318/439 (72%), Gaps = 5/439 (1%) Frame = +2 Query: 296 VFLLFHVSICSSSSLTA-DLFENWSKEHGKRYSSEQEKQYRFKVFEDNYDYITQHNNMGN 472 + LL H + SSSS ++ +LFE W K++GK YSS++EK YR +FE N +ITQHN++GN Sbjct: 12 LLLLSHPCLSSSSSSSSSELFEAWCKQYGKSYSSQEEKLYRLSLFEQNLAFITQHNDLGN 71 Query: 473 STYTLSLNAFADLTHHEFKAKYLGLSASANDLIRLNRGGLSPFGESGAVGTNDIPSSLDW 652 S+YTLSLN+F+DLTHHEFKA LG S + +RL R + +PSS+DW Sbjct: 72 SSYTLSLNSFSDLTHHEFKASRLGFSPT---FLRLYRKS-----DPKPSVVRHVPSSIDW 123 Query: 653 RKKGAVTNVKDQGSCGACWAFSATGAMEGINKIVTGSLVSLSEQELVDCDRSY-NSGCEG 829 RK GAVTNVKDQGSCGACW+FSATGA+EGINKIVTGSLVSLSEQEL+DCDR Y NSGC G Sbjct: 124 RKNGAVTNVKDQGSCGACWSFSATGAIEGINKIVTGSLVSLSEQELIDCDRVYPNSGCNG 183 Query: 830 GLMDPAYEFVVKNKGIDTEKDYPFQARDGTCNRNKLNRHVVTIDGYRDVTRGNEKELLQA 1009 GLMD A++F++ N GIDTE+DYP+Q DGTCN+ KL RHVVTIDGY DV NE++LL+A Sbjct: 184 GLMDDAFQFIIDNNGIDTEEDYPYQGADGTCNKQKLKRHVVTIDGYTDVPANNEEQLLKA 243 Query: 1010 VAAQPVSVGICGSERAFQLYSRGIFTGPCSTNLDHAVLIVGYDSQDGVDYWIVKNSWGSS 1189 VA QPVSVGI GS R FQ YS+GIF GPCST LDHAVLIVGY S++GVDYWIVKNSWG + Sbjct: 244 VATQPVSVGIAGSGREFQFYSKGIFAGPCSTTLDHAVLIVGYGSENGVDYWIVKNSWGKN 303 Query: 1190 WGIDGYIHIARSTGNSEGVCGINTLA---XXXXXXXXXXXXXXXXXXVKCSLFSSCPGGE 1360 WG++GYIHI R NS+G+CGIN LA KC LFS C GE Sbjct: 304 WGMNGYIHILRDHSNSKGLCGINMLASYPTKTGENPPFPPPSPPPGPTKCDLFSKCGVGE 363 Query: 1361 TCCCSWQLLGICFSWQCCELDSAVCCKDHRHCCPHDYPICDTKRNLCLKKTGNSTLVKKL 1540 TCCC+ ++LGIC SW+CCE SAVCCKD HCCPHDYPICDT+RN CL+ GN T+ + Sbjct: 364 TCCCARKILGICLSWRCCEFTSAVCCKDRLHCCPHDYPICDTERNYCLQANGNLTM-RAN 422 Query: 1541 ENRGVFGELGNLNAFFQNW 1597 E RG + A W Sbjct: 423 EIRGSLRKSSRSKAKLSYW 441