BLASTX nr result
ID: Mentha27_contig00002398
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Mentha27_contig00002398 (827 letters) Database: ./nr 38,876,450 sequences; 13,856,398,315 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value gb|EYU34027.1| hypothetical protein MIMGU_mgv1a024943mg, partial... 378 e-102 gb|ABG37021.1| aspartic protease [Nicotiana tabacum] 357 4e-96 ref|XP_004243484.1| PREDICTED: aspartic proteinase A1-like isofo... 351 2e-94 ref|XP_006364164.1| PREDICTED: aspartic proteinase-like [Solanum... 348 1e-93 ref|XP_002298827.2| aspartic protease family protein [Populus tr... 345 2e-92 ref|XP_007020768.1| Aspartic proteinase A1 [Theobroma cacao] gi|... 345 2e-92 emb|CAA70340.1| aspartic proteinase [Centaurea calcitrapa] 344 2e-92 gb|AAC49730.1| aspartic proteinase [Arabidopsis thaliana] 344 3e-92 ref|NP_172655.1| aspartic proteinase A1 [Arabidopsis thaliana] g... 344 3e-92 ref|XP_002892661.1| aspartyl protease family protein [Arabidopsi... 344 3e-92 dbj|BAH20208.1| AT1G11910 [Arabidopsis thaliana] 344 3e-92 emb|CAC86003.1| aspartic proteinase [Theobroma cacao] 343 3e-92 emb|CAA57510.1| cyprosin [Cynara cardunculus] 343 4e-92 gb|AFB73927.2| preprocirsin [Cirsium vulgare] 340 3e-91 gb|AGE15494.1| preprosilpepsin 1 [Silybum marianum] 340 5e-91 gb|AAB03108.1| aspartic protease [Brassica napus] 340 5e-91 ref|XP_007049085.1| Aspartic protease isoform 2 [Theobroma cacao... 339 6e-91 gb|EXB66327.1| Aspartic proteinase [Morus notabilis] 339 8e-91 ref|XP_006303952.1| hypothetical protein CARUB_v10008798mg, part... 338 1e-90 ref|XP_006475035.1| PREDICTED: aspartic proteinase-like [Citrus ... 338 2e-90 >gb|EYU34027.1| hypothetical protein MIMGU_mgv1a024943mg, partial [Mimulus guttatus] Length = 382 Score = 378 bits (970), Expect = e-102 Identities = 181/215 (84%), Positives = 202/215 (93%) Frame = -1 Query: 827 DSGTSLLTGPTTVVTLINHAIGASGVVSQECKSVVSMYGQTILDLLLSETSPQKICSQVN 648 DSGTSLL GPTT+VTLINHAIGA+GVVSQECKSVVS+YG+TIL++L SET PQKICSQ+ Sbjct: 171 DSGTSLLAGPTTIVTLINHAIGATGVVSQECKSVVSVYGKTILEMLTSETQPQKICSQIG 230 Query: 647 LCSSDGTRDVSMIIESVVDKGSGVGDEMCSVCEMAVVWIQNQLKKNETQEKILDFINQLC 468 LC+SDGTRDVSMIIESVV+KGS DEMC+ CEMAVVW+QNQ+K+NET+EKILD+INQLC Sbjct: 231 LCASDGTRDVSMIIESVVEKGS---DEMCTACEMAVVWMQNQVKRNETEEKILDYINQLC 287 Query: 467 DRLPSPMGESAVDCNALSSMPNISFTIGGKSFALTPEQYVLKVGEGDVAQCISGFTALDV 288 +RLPSPMGESAVDC LSSMPNISFTIGGKSF LTP+QYVLK+GEG+ AQCISGFTALDV Sbjct: 288 ERLPSPMGESAVDCGVLSSMPNISFTIGGKSFELTPQQYVLKIGEGEAAQCISGFTALDV 347 Query: 287 APPRGPLWILGDVFMGQYHSVFDYGNMKVGFAEAA 183 APPRGPLWILGDVFMG YH+VFDYGN+KVGFAEAA Sbjct: 348 APPRGPLWILGDVFMGPYHTVFDYGNLKVGFAEAA 382 >gb|ABG37021.1| aspartic protease [Nicotiana tabacum] Length = 508 Score = 357 bits (915), Expect = 4e-96 Identities = 167/219 (76%), Positives = 198/219 (90%), Gaps = 4/219 (1%) Frame = -1 Query: 827 DSGTSLLTGPTTVVTLINHAIGASGVVSQECKSVVSMYGQTILDLLLSETSPQKICSQVN 648 DSGTSLL GPTT++T INH IGASGVVSQECKS+V+ YG+TILDLL S+ +PQKICSQ+ Sbjct: 290 DSGTSLLAGPTTIITQINHVIGASGVVSQECKSLVTEYGKTILDLLESKAAPQKICSQIG 349 Query: 647 LCSSDGTRDVSMIIESVVDK----GSGVGDEMCSVCEMAVVWIQNQLKKNETQEKILDFI 480 LCSSDG+RDVSMIIESVVDK +G+GDEMC VCEMAV+W+QNQ+++NET + I D++ Sbjct: 350 LCSSDGSRDVSMIIESVVDKHNGASNGLGDEMCRVCEMAVIWMQNQMRRNETADSIYDYV 409 Query: 479 NQLCDRLPSPMGESAVDCNALSSMPNISFTIGGKSFALTPEQYVLKVGEGDVAQCISGFT 300 NQLCDRLPSPMGESAVDC++L+SMPN+SFT+G ++F LTP+QYVL+VGEG VAQCISGFT Sbjct: 410 NQLCDRLPSPMGESAVDCSSLASMPNVSFTVGNQTFGLTPQQYVLQVGEGPVAQCISGFT 469 Query: 299 ALDVAPPRGPLWILGDVFMGQYHSVFDYGNMKVGFAEAA 183 ALDV PPRGPLWILGDVFMG+YH+VFDYGN +VGFAEAA Sbjct: 470 ALDVPPPRGPLWILGDVFMGRYHTVFDYGNSRVGFAEAA 508 >ref|XP_004243484.1| PREDICTED: aspartic proteinase A1-like isoform 1 [Solanum lycopersicum] gi|460395834|ref|XP_004243485.1| PREDICTED: aspartic proteinase A1-like isoform 2 [Solanum lycopersicum] Length = 508 Score = 351 bits (900), Expect = 2e-94 Identities = 168/219 (76%), Positives = 195/219 (89%), Gaps = 4/219 (1%) Frame = -1 Query: 827 DSGTSLLTGPTTVVTLINHAIGASGVVSQECKSVVSMYGQTILDLLLSETSPQKICSQVN 648 DSGTSLL GPTT++T INHAIGASGVVSQECKSVVS YG+TILDLL S+ +PQ+ICSQ+ Sbjct: 290 DSGTSLLAGPTTIITQINHAIGASGVVSQECKSVVSEYGKTILDLLESKAAPQQICSQIG 349 Query: 647 LCSSDGTRDVSMIIESVVDK----GSGVGDEMCSVCEMAVVWIQNQLKKNETQEKILDFI 480 LCS DG +DVSMIIESVVDK +GV DEMC VCEMAVVW+QNQL++NET ++I D++ Sbjct: 350 LCSRDGGKDVSMIIESVVDKHNEASNGVHDEMCRVCEMAVVWMQNQLRRNETADRIFDYM 409 Query: 479 NQLCDRLPSPMGESAVDCNALSSMPNISFTIGGKSFALTPEQYVLKVGEGDVAQCISGFT 300 N+LCDR+PSPMGESAVDCN+L+SMPN+SFT+G K+F LTP+QYVLKVGE VAQCISGFT Sbjct: 410 NKLCDRIPSPMGESAVDCNSLASMPNVSFTVGDKTFELTPQQYVLKVGEAPVAQCISGFT 469 Query: 299 ALDVAPPRGPLWILGDVFMGQYHSVFDYGNMKVGFAEAA 183 ALDV PPRGPLWILGDVFMG+YH+VFDY M+VGFAEAA Sbjct: 470 ALDVPPPRGPLWILGDVFMGRYHTVFDYEKMRVGFAEAA 508 >ref|XP_006364164.1| PREDICTED: aspartic proteinase-like [Solanum tuberosum] Length = 508 Score = 348 bits (894), Expect = 1e-93 Identities = 168/219 (76%), Positives = 194/219 (88%), Gaps = 4/219 (1%) Frame = -1 Query: 827 DSGTSLLTGPTTVVTLINHAIGASGVVSQECKSVVSMYGQTILDLLLSETSPQKICSQVN 648 DSGTSLL GPTT++T INHAIGASGVVSQECKSVVS YG+TILDLL S+ +PQ+ICSQ+N Sbjct: 290 DSGTSLLAGPTTIITQINHAIGASGVVSQECKSVVSEYGKTILDLLESKAAPQQICSQIN 349 Query: 647 LCSSDGTRDVSMIIESVVDK----GSGVGDEMCSVCEMAVVWIQNQLKKNETQEKILDFI 480 LCS DG RDVSMIIESVVDK +GV DEMC VCEMAVVW+QNQL++NET ++I D++ Sbjct: 350 LCSRDGGRDVSMIIESVVDKHNEASNGVHDEMCRVCEMAVVWMQNQLRRNETADRIFDYM 409 Query: 479 NQLCDRLPSPMGESAVDCNALSSMPNISFTIGGKSFALTPEQYVLKVGEGDVAQCISGFT 300 N+LCDRLPSPMGESAV+CN+L+SMPN+SFT+G K+F LTP+QYVLKVGE V QCISGFT Sbjct: 410 NKLCDRLPSPMGESAVNCNSLASMPNVSFTVGNKTFELTPQQYVLKVGEAPVTQCISGFT 469 Query: 299 ALDVAPPRGPLWILGDVFMGQYHSVFDYGNMKVGFAEAA 183 ALDV PPRGPLWILGDVFMG+YH+VFD M+VGFAEAA Sbjct: 470 ALDVPPPRGPLWILGDVFMGRYHTVFDSEKMRVGFAEAA 508 >ref|XP_002298827.2| aspartic protease family protein [Populus trichocarpa] gi|550349038|gb|EEE83632.2| aspartic protease family protein [Populus trichocarpa] Length = 515 Score = 345 bits (884), Expect = 2e-92 Identities = 162/219 (73%), Positives = 193/219 (88%), Gaps = 5/219 (2%) Frame = -1 Query: 827 DSGTSLLTGPTTVVTLINHAIGASGVVSQECKSVVSMYGQTILDLLLSETSPQKICSQVN 648 DSGTSLL GPTT++T +NHAIGA+GVVSQECK+VV+ YG TI+++LL++ PQKIC+Q+ Sbjct: 296 DSGTSLLAGPTTIITEVNHAIGATGVVSQECKAVVAQYGDTIMEMLLAKDQPQKICAQIG 355 Query: 647 LCSSDGTRDVSMIIESVVDK-----GSGVGDEMCSVCEMAVVWIQNQLKKNETQEKILDF 483 LC+ DGTR VSM IESVV++ G D MCS CEMAVVW+QNQLK+N+TQE+ILD+ Sbjct: 356 LCTFDGTRGVSMGIESVVNEHAQKASDGFHDAMCSTCEMAVVWMQNQLKQNQTQERILDY 415 Query: 482 INQLCDRLPSPMGESAVDCNALSSMPNISFTIGGKSFALTPEQYVLKVGEGDVAQCISGF 303 +N+LC+RLPSPMGESAVDC+ LSSMPN+SFTIGG+ F L+PEQYVLKVGEGDVAQCISGF Sbjct: 416 VNELCERLPSPMGESAVDCDGLSSMPNVSFTIGGRVFELSPEQYVLKVGEGDVAQCISGF 475 Query: 302 TALDVAPPRGPLWILGDVFMGQYHSVFDYGNMKVGFAEA 186 TALDV PPRGPLWILGDVFMG +H+VFDYGNM+VGFAEA Sbjct: 476 TALDVPPPRGPLWILGDVFMGSFHTVFDYGNMRVGFAEA 514 >ref|XP_007020768.1| Aspartic proteinase A1 [Theobroma cacao] gi|508720396|gb|EOY12293.1| Aspartic proteinase A1 [Theobroma cacao] Length = 514 Score = 345 bits (884), Expect = 2e-92 Identities = 162/220 (73%), Positives = 194/220 (88%), Gaps = 5/220 (2%) Frame = -1 Query: 827 DSGTSLLTGPTTVVTLINHAIGASGVVSQECKSVVSMYGQTILDLLLSETSPQKICSQVN 648 DSGTSL+TGPT ++ +NHAIGASGVVSQECK+VVS YG+TI+D+LLS+ P KICSQ+ Sbjct: 295 DSGTSLITGPTAIIAQVNHAIGASGVVSQECKTVVSQYGETIIDMLLSKDQPLKICSQIG 354 Query: 647 LCSSDGTRDVSMIIESVVDKGSG-----VGDEMCSVCEMAVVWIQNQLKKNETQEKILDF 483 LC+ DGTR VS IESVV + +G + D MCS CEM V+W+QNQLK+N+TQE+IL++ Sbjct: 355 LCTFDGTRGVSTGIESVVHENAGKATGDLHDAMCSTCEMTVIWMQNQLKQNQTQERILEY 414 Query: 482 INQLCDRLPSPMGESAVDCNALSSMPNISFTIGGKSFALTPEQYVLKVGEGDVAQCISGF 303 IN+LCDRLPSPMGESAVDC++LS+MPN+SFTIGGK F L+PEQYVLKVGEGDVAQC+SGF Sbjct: 415 INELCDRLPSPMGESAVDCSSLSTMPNVSFTIGGKIFELSPEQYVLKVGEGDVAQCLSGF 474 Query: 302 TALDVAPPRGPLWILGDVFMGQYHSVFDYGNMKVGFAEAA 183 TALDV PPRGPLWILGDVFMGQ+H+VFDYGN++VGFAEAA Sbjct: 475 TALDVPPPRGPLWILGDVFMGQFHTVFDYGNLQVGFAEAA 514 >emb|CAA70340.1| aspartic proteinase [Centaurea calcitrapa] Length = 509 Score = 344 bits (883), Expect = 2e-92 Identities = 159/220 (72%), Positives = 192/220 (87%), Gaps = 5/220 (2%) Frame = -1 Query: 827 DSGTSLLTGPTTVVTLINHAIGASGVVSQECKSVVSMYGQTILDLLLSETSPQKICSQVN 648 DSGTSLL GPT ++T INHAIGA GV+SQ+CK++V YG+TI+++LLSE P KICSQ+ Sbjct: 290 DSGTSLLAGPTAIITQINHAIGAKGVMSQQCKTLVDQYGKTIIEMLLSEAQPDKICSQMK 349 Query: 647 LCSSDGTRDVSMIIESVVDKGSG-----VGDEMCSVCEMAVVWIQNQLKKNETQEKILDF 483 LC+ DG RDVS IIESVVDK +G V DEMC+ CEMAVVW+QNQ+K+N+T++ I+++ Sbjct: 350 LCTFDGARDVSSIIESVVDKNNGKSSGGVHDEMCTFCEMAVVWMQNQIKRNQTEDNIINY 409 Query: 482 INQLCDRLPSPMGESAVDCNALSSMPNISFTIGGKSFALTPEQYVLKVGEGDVAQCISGF 303 +N+LCDRLPSPMGESAVDCN LSSMPNI+FTIGGK F L PEQY+LK+GEG+ AQCISGF Sbjct: 410 VNELCDRLPSPMGESAVDCNDLSSMPNIAFTIGGKVFELCPEQYILKIGEGEAAQCISGF 469 Query: 302 TALDVAPPRGPLWILGDVFMGQYHSVFDYGNMKVGFAEAA 183 TA+DVAPPRGPLWILGDVFMGQYH+VFDYG ++VGFAEAA Sbjct: 470 TAMDVAPPRGPLWILGDVFMGQYHTVFDYGKLRVGFAEAA 509 >gb|AAC49730.1| aspartic proteinase [Arabidopsis thaliana] Length = 486 Score = 344 bits (882), Expect = 3e-92 Identities = 166/220 (75%), Positives = 192/220 (87%), Gaps = 5/220 (2%) Frame = -1 Query: 827 DSGTSLLTGPTTVVTLINHAIGASGVVSQECKSVVSMYGQTILDLLLSETSPQKICSQVN 648 DSGTSLL GPTT++T+INHAIGA+GVVSQ+CK+VV YGQTILDLLLSET P+KICSQ+ Sbjct: 267 DSGTSLLAGPTTIITMINHAIGAAGVVSQQCKTVVDQYGQTILDLLLSETQPKKICSQIG 326 Query: 647 LCSSDGTRDVSMIIESVVDK-----GSGVGDEMCSVCEMAVVWIQNQLKKNETQEKILDF 483 LC+ DGTR VSM IESVVDK +GVGD CS CEMAVVWIQ+QL++N TQE+IL++ Sbjct: 327 LCTFDGTRGVSMGIESVVDKENAKLSNGVGDAACSACEMAVVWIQSQLRQNMTQERILNY 386 Query: 482 INQLCDRLPSPMGESAVDCNALSSMPNISFTIGGKSFALTPEQYVLKVGEGDVAQCISGF 303 +N+LC+RLPSPMGESAVDC LS+MP +S TIGGK F L PE+YVLKVGEG VAQCISGF Sbjct: 387 VNELCERLPSPMGESAVDCAQLSTMPTVSLTIGGKVFDLAPEEYVLKVGEGPVAQCISGF 446 Query: 302 TALDVAPPRGPLWILGDVFMGQYHSVFDYGNMKVGFAEAA 183 ALDVAPPRGPLWILGDVFMG+YH+VFD+GN +VGFAEAA Sbjct: 447 IALDVAPPRGPLWILGDVFMGKYHTVFDFGNEQVGFAEAA 486 >ref|NP_172655.1| aspartic proteinase A1 [Arabidopsis thaliana] gi|75318541|sp|O65390.1|APA1_ARATH RecName: Full=Aspartic proteinase A1; Flags: Precursor gi|3157937|gb|AAC17620.1| Identical to aspartic proteinase cDNA gb|U51036 from A. thaliana. ESTs gb|N96313, gb|T21893, gb|R30158, gb|T21482, gb|T43650, gb|R64749, gb|R65157, gb|T88269, gb|T44552, gb|T22542, gb|T76533, gb|T44350, gb|Z34591, gb|AA728734, gb|T46003, gb|R65157, gb|N38290, gb|AA395468, gb|T20815 and gb|Z34173 come from this gene [Arabidopsis thaliana] gi|15912219|gb|AAL08243.1| At1g11910/F12F1_24 [Arabidopsis thaliana] gi|15912251|gb|AAL08259.1| At1g11910/F12F1_24 [Arabidopsis thaliana] gi|17381036|gb|AAL36330.1| putative aspartic proteinase [Arabidopsis thaliana] gi|21617929|gb|AAM66979.1| putative aspartic proteinase [Arabidopsis thaliana] gi|25055040|gb|AAN71979.1| putative aspartic proteinase [Arabidopsis thaliana] gi|332190692|gb|AEE28813.1| aspartic proteinase A1 [Arabidopsis thaliana] Length = 506 Score = 344 bits (882), Expect = 3e-92 Identities = 166/220 (75%), Positives = 192/220 (87%), Gaps = 5/220 (2%) Frame = -1 Query: 827 DSGTSLLTGPTTVVTLINHAIGASGVVSQECKSVVSMYGQTILDLLLSETSPQKICSQVN 648 DSGTSLL GPTT++T+INHAIGA+GVVSQ+CK+VV YGQTILDLLLSET P+KICSQ+ Sbjct: 287 DSGTSLLAGPTTIITMINHAIGAAGVVSQQCKTVVDQYGQTILDLLLSETQPKKICSQIG 346 Query: 647 LCSSDGTRDVSMIIESVVDK-----GSGVGDEMCSVCEMAVVWIQNQLKKNETQEKILDF 483 LC+ DGTR VSM IESVVDK +GVGD CS CEMAVVWIQ+QL++N TQE+IL++ Sbjct: 347 LCTFDGTRGVSMGIESVVDKENAKLSNGVGDAACSACEMAVVWIQSQLRQNMTQERILNY 406 Query: 482 INQLCDRLPSPMGESAVDCNALSSMPNISFTIGGKSFALTPEQYVLKVGEGDVAQCISGF 303 +N+LC+RLPSPMGESAVDC LS+MP +S TIGGK F L PE+YVLKVGEG VAQCISGF Sbjct: 407 VNELCERLPSPMGESAVDCAQLSTMPTVSLTIGGKVFDLAPEEYVLKVGEGPVAQCISGF 466 Query: 302 TALDVAPPRGPLWILGDVFMGQYHSVFDYGNMKVGFAEAA 183 ALDVAPPRGPLWILGDVFMG+YH+VFD+GN +VGFAEAA Sbjct: 467 IALDVAPPRGPLWILGDVFMGKYHTVFDFGNEQVGFAEAA 506 >ref|XP_002892661.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata] gi|297338503|gb|EFH68920.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata] Length = 506 Score = 344 bits (882), Expect = 3e-92 Identities = 166/220 (75%), Positives = 192/220 (87%), Gaps = 5/220 (2%) Frame = -1 Query: 827 DSGTSLLTGPTTVVTLINHAIGASGVVSQECKSVVSMYGQTILDLLLSETSPQKICSQVN 648 DSGTSLL GPTT++T+INHAIGA+GVVSQ+CK+VV YGQTILDLLLSET P+KICSQ+ Sbjct: 287 DSGTSLLAGPTTIITMINHAIGAAGVVSQQCKTVVDQYGQTILDLLLSETQPKKICSQIG 346 Query: 647 LCSSDGTRDVSMIIESVVDK-----GSGVGDEMCSVCEMAVVWIQNQLKKNETQEKILDF 483 LC+ DGTR VSM IESVVDK +GVGD CS CEMAVVWIQ+QL++N TQE+IL++ Sbjct: 347 LCTFDGTRGVSMGIESVVDKENSKLSNGVGDAACSACEMAVVWIQSQLRQNMTQERILNY 406 Query: 482 INQLCDRLPSPMGESAVDCNALSSMPNISFTIGGKSFALTPEQYVLKVGEGDVAQCISGF 303 +N+LC+RLPSPMGESAVDC LS+MP +S TIGGK F L PE+YVLKVGEG VAQCISGF Sbjct: 407 VNELCERLPSPMGESAVDCAQLSTMPTVSLTIGGKVFDLAPEEYVLKVGEGPVAQCISGF 466 Query: 302 TALDVAPPRGPLWILGDVFMGQYHSVFDYGNMKVGFAEAA 183 ALDVAPPRGPLWILGDVFMG+YH+VFD+GN +VGFAEAA Sbjct: 467 IALDVAPPRGPLWILGDVFMGKYHTVFDFGNEQVGFAEAA 506 >dbj|BAH20208.1| AT1G11910 [Arabidopsis thaliana] Length = 389 Score = 344 bits (882), Expect = 3e-92 Identities = 166/220 (75%), Positives = 192/220 (87%), Gaps = 5/220 (2%) Frame = -1 Query: 827 DSGTSLLTGPTTVVTLINHAIGASGVVSQECKSVVSMYGQTILDLLLSETSPQKICSQVN 648 DSGTSLL GPTT++T+INHAIGA+GVVSQ+CK+VV YGQTILDLLLSET P+KICSQ+ Sbjct: 170 DSGTSLLAGPTTIITMINHAIGAAGVVSQQCKTVVDQYGQTILDLLLSETQPKKICSQIG 229 Query: 647 LCSSDGTRDVSMIIESVVDK-----GSGVGDEMCSVCEMAVVWIQNQLKKNETQEKILDF 483 LC+ DGTR VSM IESVVDK +GVGD CS CEMAVVWIQ+QL++N TQE+IL++ Sbjct: 230 LCTFDGTRGVSMGIESVVDKENAKLSNGVGDAACSACEMAVVWIQSQLRQNMTQERILNY 289 Query: 482 INQLCDRLPSPMGESAVDCNALSSMPNISFTIGGKSFALTPEQYVLKVGEGDVAQCISGF 303 +N+LC+RLPSPMGESAVDC LS+MP +S TIGGK F L PE+YVLKVGEG VAQCISGF Sbjct: 290 VNELCERLPSPMGESAVDCAQLSTMPTVSLTIGGKVFDLAPEEYVLKVGEGPVAQCISGF 349 Query: 302 TALDVAPPRGPLWILGDVFMGQYHSVFDYGNMKVGFAEAA 183 ALDVAPPRGPLWILGDVFMG+YH+VFD+GN +VGFAEAA Sbjct: 350 IALDVAPPRGPLWILGDVFMGKYHTVFDFGNEQVGFAEAA 389 >emb|CAC86003.1| aspartic proteinase [Theobroma cacao] Length = 514 Score = 343 bits (881), Expect = 3e-92 Identities = 162/220 (73%), Positives = 193/220 (87%), Gaps = 5/220 (2%) Frame = -1 Query: 827 DSGTSLLTGPTTVVTLINHAIGASGVVSQECKSVVSMYGQTILDLLLSETSPQKICSQVN 648 DSGTSL+TGPT ++ +NHAIGASGVVSQECK+VVS YG+TI+D+LLS+ P KICSQ+ Sbjct: 295 DSGTSLITGPTAIIAQVNHAIGASGVVSQECKTVVSQYGETIIDMLLSKDQPLKICSQIG 354 Query: 647 LCSSDGTRDVSMIIESVVDKGSG-----VGDEMCSVCEMAVVWIQNQLKKNETQEKILDF 483 LC+ DGTR VS IESVV + G + D MCS CEM V+W+QNQLK+N+TQE+IL++ Sbjct: 355 LCTFDGTRGVSTGIESVVHENVGKATGDLHDAMCSTCEMTVIWMQNQLKQNQTQERILEY 414 Query: 482 INQLCDRLPSPMGESAVDCNALSSMPNISFTIGGKSFALTPEQYVLKVGEGDVAQCISGF 303 IN+LCDRLPSPMGESAVDC++LS+MPN+SFTIGGK F L+PEQYVLKVGEGDVAQC+SGF Sbjct: 415 INELCDRLPSPMGESAVDCSSLSTMPNVSFTIGGKIFELSPEQYVLKVGEGDVAQCLSGF 474 Query: 302 TALDVAPPRGPLWILGDVFMGQYHSVFDYGNMKVGFAEAA 183 TALDV PPRGPLWILGDVFMGQ+H+VFDYGN++VGFAEAA Sbjct: 475 TALDVPPPRGPLWILGDVFMGQFHTVFDYGNLQVGFAEAA 514 >emb|CAA57510.1| cyprosin [Cynara cardunculus] Length = 509 Score = 343 bits (880), Expect = 4e-92 Identities = 158/220 (71%), Positives = 193/220 (87%), Gaps = 5/220 (2%) Frame = -1 Query: 827 DSGTSLLTGPTTVVTLINHAIGASGVVSQECKSVVSMYGQTILDLLLSETSPQKICSQVN 648 DSGTSLL GPT ++T INHAIGA GV+SQ+CK++VS YG+T++++LLSE P KICSQ+ Sbjct: 290 DSGTSLLAGPTAIITEINHAIGAKGVMSQQCKTLVSQYGKTMIEMLLSEAQPDKICSQMK 349 Query: 647 LCSSDGTRDVSMIIESVVDKG-----SGVGDEMCSVCEMAVVWIQNQLKKNETQEKILDF 483 LC+ DG RD S IIESVVD+ SGV DEMC+ CEMAVVW+QNQ+K+NET++ I+++ Sbjct: 350 LCTFDGARDASSIIESVVDENNGKSSSGVHDEMCTFCEMAVVWMQNQIKRNETEDNIINY 409 Query: 482 INQLCDRLPSPMGESAVDCNALSSMPNISFTIGGKSFALTPEQYVLKVGEGDVAQCISGF 303 +N+LCDRLPSPMGESAVDCN+LSSMPNI+FTIGGK F L PEQY+LK+GEG+ AQCISGF Sbjct: 410 VNELCDRLPSPMGESAVDCNSLSSMPNIAFTIGGKVFELCPEQYILKIGEGEAAQCISGF 469 Query: 302 TALDVAPPRGPLWILGDVFMGQYHSVFDYGNMKVGFAEAA 183 TA+DVAPPRGPLWILGDVFMG+YH+VFDYG ++VGFAEAA Sbjct: 470 TAMDVAPPRGPLWILGDVFMGRYHTVFDYGKLRVGFAEAA 509 >gb|AFB73927.2| preprocirsin [Cirsium vulgare] Length = 509 Score = 340 bits (873), Expect = 3e-91 Identities = 157/220 (71%), Positives = 191/220 (86%), Gaps = 5/220 (2%) Frame = -1 Query: 827 DSGTSLLTGPTTVVTLINHAIGASGVVSQECKSVVSMYGQTILDLLLSETSPQKICSQVN 648 DSGTSLL GPT ++T INHA GA GV+SQ+CK++VS YG++I+++LLSE P KICSQ+ Sbjct: 290 DSGTSLLAGPTAIITEINHASGAKGVMSQQCKTLVSQYGKSIIEMLLSEAQPDKICSQMK 349 Query: 647 LCSSDGTRDVSMIIESVVDKGSG-----VGDEMCSVCEMAVVWIQNQLKKNETQEKILDF 483 LC+ DG RDVS IIESVVDK +G DEMC+ CEMAVVW+QNQ+K+NET++ I+++ Sbjct: 350 LCTFDGARDVSSIIESVVDKNNGKSSGGANDEMCTFCEMAVVWMQNQIKRNETEDNIINY 409 Query: 482 INQLCDRLPSPMGESAVDCNALSSMPNISFTIGGKSFALTPEQYVLKVGEGDVAQCISGF 303 +N+LCDRLPSPMGESAVDCN+LSSMPNI+FTIGGK F L PEQY+LK+GEG+ AQCISGF Sbjct: 410 VNELCDRLPSPMGESAVDCNSLSSMPNIAFTIGGKVFELCPEQYILKIGEGEAAQCISGF 469 Query: 302 TALDVAPPRGPLWILGDVFMGQYHSVFDYGNMKVGFAEAA 183 TA+DVAPPRGPLWILGDVFMG+YH+VFDYG +VGFAEAA Sbjct: 470 TAMDVAPPRGPLWILGDVFMGRYHTVFDYGKSRVGFAEAA 509 >gb|AGE15494.1| preprosilpepsin 1 [Silybum marianum] Length = 506 Score = 340 bits (871), Expect = 5e-91 Identities = 155/217 (71%), Positives = 193/217 (88%), Gaps = 2/217 (0%) Frame = -1 Query: 827 DSGTSLLTGPTTVVTLINHAIGASGVVSQECKSVVSMYGQTILDLLLSETSPQKICSQVN 648 DSGTSLL GPT ++T INHAIGA GV+SQ+CK++V YG++I+++LLSE P KICSQ+ Sbjct: 290 DSGTSLLAGPTAIITQINHAIGAKGVMSQQCKTLVDQYGKSIIEMLLSEAQPDKICSQMK 349 Query: 647 LCSSDGTRDVSMIIESVVDKGSGV--GDEMCSVCEMAVVWIQNQLKKNETQEKILDFINQ 474 LC+ +G RDVS IIESVVDK +G G+EMC+ CEMAVVW+QNQ+K+N+TQ+ I++++++ Sbjct: 350 LCTFNGARDVSSIIESVVDKNNGKSSGNEMCTFCEMAVVWMQNQIKRNQTQDNIINYVSE 409 Query: 473 LCDRLPSPMGESAVDCNALSSMPNISFTIGGKSFALTPEQYVLKVGEGDVAQCISGFTAL 294 LCDRLPSPMGESAVDCN+LSSMPNISFTIGGK F L PEQY+LK+G+G+ AQCISGFTA+ Sbjct: 410 LCDRLPSPMGESAVDCNSLSSMPNISFTIGGKVFELCPEQYILKIGDGEAAQCISGFTAM 469 Query: 293 DVAPPRGPLWILGDVFMGQYHSVFDYGNMKVGFAEAA 183 DVAPPRGPLWILGDVFMG+YH+VFDYG ++VGFAEAA Sbjct: 470 DVAPPRGPLWILGDVFMGRYHTVFDYGKLRVGFAEAA 506 >gb|AAB03108.1| aspartic protease [Brassica napus] Length = 506 Score = 340 bits (871), Expect = 5e-91 Identities = 167/220 (75%), Positives = 187/220 (85%), Gaps = 5/220 (2%) Frame = -1 Query: 827 DSGTSLLTGPTTVVTLINHAIGASGVVSQECKSVVSMYGQTILDLLLSETSPQKICSQVN 648 DSGTSLL GPTTV+T+INHAIGA+GVVSQ+CK VV YGQTILDLLLSET P+KICSQ+ Sbjct: 287 DSGTSLLAGPTTVITMINHAIGAAGVVSQQCKIVVDQYGQTILDLLLSETQPKKICSQIG 346 Query: 647 LCSSDGTRDVSMIIESVVDK-----GSGVGDEMCSVCEMAVVWIQNQLKKNETQEKILDF 483 LC+ DG R VSM IESVVDK SGVGD CS CEMAVVWIQ+QL++N TQE+ILD+ Sbjct: 347 LCTFDGKRGVSMGIESVVDKENAKSSSGVGDAACSACEMAVVWIQSQLRQNMTQERILDY 406 Query: 482 INQLCDRLPSPMGESAVDCNALSSMPNISFTIGGKSFALTPEQYVLKVGEGDVAQCISGF 303 IN LC+RLPSPMGESAVDC LS+MP +S TIGGK F L PE+YVLKVGEG AQCISGF Sbjct: 407 INDLCERLPSPMGESAVDCAQLSTMPTVSLTIGGKVFDLAPEEYVLKVGEGPAAQCISGF 466 Query: 302 TALDVAPPRGPLWILGDVFMGQYHSVFDYGNMKVGFAEAA 183 ALDVAPPRGPLWILGDVFMG+YH+VFD+G +VGFAEAA Sbjct: 467 IALDVAPPRGPLWILGDVFMGKYHTVFDFGKEQVGFAEAA 506 >ref|XP_007049085.1| Aspartic protease isoform 2 [Theobroma cacao] gi|508701346|gb|EOX93242.1| Aspartic protease isoform 2 [Theobroma cacao] Length = 514 Score = 339 bits (870), Expect = 6e-91 Identities = 162/220 (73%), Positives = 189/220 (85%), Gaps = 5/220 (2%) Frame = -1 Query: 827 DSGTSLLTGPTTVVTLINHAIGASGVVSQECKSVVSMYGQTILDLLLSETSPQKICSQVN 648 DSGTSLL GPTT++T INHAIGASGVVSQECK++VS YG+ IL+LL+SET PQKICSQ+ Sbjct: 295 DSGTSLLAGPTTIITQINHAIGASGVVSQECKAIVSQYGKMILELLVSETQPQKICSQIG 354 Query: 647 LCSSDGTRDVSMIIESVVDK-----GSGVGDEMCSVCEMAVVWIQNQLKKNETQEKILDF 483 C+ DGTR VS IESV D+ GV D MC+ CEMAVVW+QN+L++NET+E+ILD+ Sbjct: 355 FCTFDGTRGVSTRIESVADEIVGKSSDGVHDAMCTACEMAVVWMQNKLRRNETEEQILDY 414 Query: 482 INQLCDRLPSPMGESAVDCNALSSMPNISFTIGGKSFALTPEQYVLKVGEGDVAQCISGF 303 +N+LC+RLPSP GES VDC++LSSMP +SFTIGGK F L PE+YVLKVGEG VAQCISGF Sbjct: 415 VNELCERLPSPNGESVVDCSSLSSMPGVSFTIGGKVFDLAPEEYVLKVGEGAVAQCISGF 474 Query: 302 TALDVAPPRGPLWILGDVFMGQYHSVFDYGNMKVGFAEAA 183 TALDV PPRGPLWILGDVFMG+YH+VFDYGNM VGFAEAA Sbjct: 475 TALDVPPPRGPLWILGDVFMGRYHTVFDYGNMTVGFAEAA 514 >gb|EXB66327.1| Aspartic proteinase [Morus notabilis] Length = 514 Score = 339 bits (869), Expect = 8e-91 Identities = 157/220 (71%), Positives = 194/220 (88%), Gaps = 5/220 (2%) Frame = -1 Query: 827 DSGTSLLTGPTTVVTLINHAIGASGVVSQECKSVVSMYGQTILDLLLSETSPQKICSQVN 648 DSGTSLL GPTT++T +NHAIGA+GVVS+ECK++V YGQTI++ LL++ P+KIC+Q+ Sbjct: 295 DSGTSLLAGPTTIITELNHAIGATGVVSEECKAIVEQYGQTIIESLLAKDQPKKICAQIG 354 Query: 647 LCSSDGTRDVSMIIESVVDKGSG-----VGDEMCSVCEMAVVWIQNQLKKNETQEKILDF 483 LCS DGTR VSM I+SVVD+ G + D MCS CEMAVVW+QNQ+K+N+TQ++IL++ Sbjct: 355 LCSFDGTRGVSMGIKSVVDENVGKASGDLRDGMCSACEMAVVWMQNQIKQNQTQDQILNY 414 Query: 482 INQLCDRLPSPMGESAVDCNALSSMPNISFTIGGKSFALTPEQYVLKVGEGDVAQCISGF 303 +NQLC+RLPSPMGESAVDC +LSS+P++SFTIGGK F L PE+Y+LKVGEGDVAQCISGF Sbjct: 415 VNQLCERLPSPMGESAVDCGSLSSLPDVSFTIGGKKFELKPEEYILKVGEGDVAQCISGF 474 Query: 302 TALDVAPPRGPLWILGDVFMGQYHSVFDYGNMKVGFAEAA 183 TALDV PPRGPLWILGDVFMG+YH+VFDYGNM++GFAEAA Sbjct: 475 TALDVPPPRGPLWILGDVFMGRYHTVFDYGNMRIGFAEAA 514 >ref|XP_006303952.1| hypothetical protein CARUB_v10008798mg, partial [Capsella rubella] gi|482572663|gb|EOA36850.1| hypothetical protein CARUB_v10008798mg, partial [Capsella rubella] Length = 538 Score = 338 bits (867), Expect = 1e-90 Identities = 163/219 (74%), Positives = 190/219 (86%), Gaps = 5/219 (2%) Frame = -1 Query: 827 DSGTSLLTGPTTVVTLINHAIGASGVVSQECKSVVSMYGQTILDLLLSETSPQKICSQVN 648 DSGTSLL GPT ++T+INHAIGA+GVVSQ+CK+VV YG+TILDLLLSET P+KICSQ+ Sbjct: 319 DSGTSLLAGPTPIITMINHAIGAAGVVSQQCKTVVDQYGETILDLLLSETQPKKICSQIG 378 Query: 647 LCSSDGTRDVSMIIESVVDK-----GSGVGDEMCSVCEMAVVWIQNQLKKNETQEKILDF 483 LC+ DGTR VSM IESVVDK +GVGD CS CEMAVVWIQ+QL++N TQE+IL++ Sbjct: 379 LCTFDGTRGVSMGIESVVDKENAKLSNGVGDAACSACEMAVVWIQSQLRQNMTQERILNY 438 Query: 482 INQLCDRLPSPMGESAVDCNALSSMPNISFTIGGKSFALTPEQYVLKVGEGDVAQCISGF 303 IN+LC+RLPSPMGESAVDC LS+MP +S TIGGK F L+PE+YVLKVGEG AQCISGF Sbjct: 439 INELCERLPSPMGESAVDCAQLSTMPTVSLTIGGKVFDLSPEEYVLKVGEGPAAQCISGF 498 Query: 302 TALDVAPPRGPLWILGDVFMGQYHSVFDYGNMKVGFAEA 186 ALDVAPPRGPLWILGDVFMG+YH+VFD+GN +VGFAEA Sbjct: 499 IALDVAPPRGPLWILGDVFMGKYHTVFDFGNEQVGFAEA 537 >ref|XP_006475035.1| PREDICTED: aspartic proteinase-like [Citrus sinensis] Length = 514 Score = 338 bits (866), Expect = 2e-90 Identities = 159/220 (72%), Positives = 192/220 (87%), Gaps = 5/220 (2%) Frame = -1 Query: 827 DSGTSLLTGPTTVVTLINHAIGASGVVSQECKSVVSMYGQTILDLLLSETSPQKICSQVN 648 DSGTSLL GPTT++T +NHAIGA+G+VSQECK+VVS YG+ I+++LL++ PQKICSQ+ Sbjct: 295 DSGTSLLAGPTTIITQVNHAIGATGIVSQECKAVVSQYGEEIINMLLAKDEPQKICSQIG 354 Query: 647 LCSSDGTRDVSMIIESVVDKGS-----GVGDEMCSVCEMAVVWIQNQLKKNETQEKILDF 483 LC+ DG+R VSM IESVV + + G D MCS CEMAVVW+QNQLK+N+TQE+IL++ Sbjct: 355 LCTFDGSRGVSMGIESVVPENNHRASGGFHDAMCSTCEMAVVWMQNQLKQNQTQERILNY 414 Query: 482 INQLCDRLPSPMGESAVDCNALSSMPNISFTIGGKSFALTPEQYVLKVGEGDVAQCISGF 303 +N+LCDRLPSPMGESAVDC+ LSS+P +SFTIGGK F LTP+QY+LKVGEGD AQCISGF Sbjct: 415 VNELCDRLPSPMGESAVDCSRLSSLPIVSFTIGGKIFDLTPDQYILKVGEGDAAQCISGF 474 Query: 302 TALDVAPPRGPLWILGDVFMGQYHSVFDYGNMKVGFAEAA 183 +ALDVAPPRGPLWILGDVFMG YH+VFDY NM+VGFAEAA Sbjct: 475 SALDVAPPRGPLWILGDVFMGPYHTVFDYSNMRVGFAEAA 514