BLASTX nr result

ID: Akebia22_contig00014039 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Akebia22_contig00014039
         (1226 letters)

Database: ./nr 
           38,876,450 sequences; 13,856,398,315 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_002263620.2| PREDICTED: aspartic proteinase nepenthesin-2...   430   e-118
ref|XP_006372550.1| aspartyl protease family protein [Populus tr...   420   e-115
ref|XP_002305886.2| aspartyl protease family protein [Populus tr...   417   e-114
gb|EXB95902.1| Aspartic proteinase nepenthesin-2 [Morus notabili...   416   e-113
gb|EYU45777.1| hypothetical protein MIMGU_mgv1a027161mg [Mimulus...   415   e-113
ref|XP_003538390.1| PREDICTED: aspartic proteinase PCS1-like [Gl...   408   e-111
ref|XP_006482080.1| PREDICTED: aspartic proteinase PCS1-like [Ci...   407   e-111
ref|XP_006430555.1| hypothetical protein CICLE_v10011736mg [Citr...   407   e-111
ref|XP_007032118.1| Eukaryotic aspartyl protease family protein ...   406   e-110
ref|XP_002532530.1| Aspartic proteinase nepenthesin-1 precursor,...   406   e-110
ref|XP_006352439.1| PREDICTED: aspartic proteinase PCS1-like [So...   405   e-110
ref|XP_006405864.1| hypothetical protein EUTSA_v10028350mg [Eutr...   405   e-110
ref|XP_006391353.1| hypothetical protein EUTSA_v10018599mg [Eutr...   405   e-110
ref|XP_004289585.1| PREDICTED: aspartic proteinase PCS1-like [Fr...   404   e-110
ref|XP_002868527.1| aspartyl protease family protein [Arabidopsi...   404   e-110
ref|XP_003544977.1| PREDICTED: aspartic proteinase PCS1-like [Gl...   403   e-110
ref|XP_006300488.1| hypothetical protein CARUB_v10020336mg [Caps...   402   e-109
ref|XP_003616254.1| Aspartic proteinase nepenthesin-1 [Medicago ...   401   e-109
ref|NP_568551.1| aspartyl protease family protein [Arabidopsis t...   400   e-109
ref|XP_004250199.1| PREDICTED: aspartic proteinase PCS1-like [So...   400   e-109

>ref|XP_002263620.2| PREDICTED: aspartic proteinase nepenthesin-2-like [Vitis vinifera]
          Length = 434

 Score =  430 bits (1106), Expect = e-118
 Identities = 225/375 (60%), Positives = 266/375 (70%)
 Frame = +3

Query: 99   KPLNFSLSFPLVTRPLTHQSLSSPNLQTSSLSTPIKPKPGSKSNNRNPFLYKSSFKYTMG 278
            KP N SLSF L + PL+  S +S  L +SSL++  K  P +K+ + N   Y+SSFKY+M 
Sbjct: 23   KPSNHSLSFSLTSIPLSSHSKNS--LFSSSLASQFKQNPNTKTTSYN---YRSSFKYSMA 77

Query: 279  LIVSLPIGTPPQTQQMVLDTGSQLSWIKCAPPAKLPTXXXXXXXXXXXXXXXXCNHPICK 458
            LIVSLPIGTPPQTQQMVLDTGSQLSWI+C  P K P                 CNH +CK
Sbjct: 78   LIVSLPIGTPPQTQQMVLDTGSQLSWIQCKVPPKTPPTAFDPLLSSSFSVLP-CNHSLCK 136

Query: 459  PRIPDFTIPTSCDQNRLCHYSYFYADGTLAEGNLAREKITFSNSQTTPPLILGCTRDTSA 638
            PR+PD+T+PTSCDQNRLCHYSYFYADGT AEGNL REK TFS+SQTTPPLILGC  D+S 
Sbjct: 137  PRVPDYTLPTSCDQNRLCHYSYFYADGTYAEGNLVREKFTFSSSQTTPPLILGCATDSSD 196

Query: 639  DEGILGMNLGRLSFTSQTKISKFSYCVPIRGSQVKSSAATGVFYLGENPNSGSFKYVDLV 818
             +GILGMNLGRLSF+S  KISKFSYCVP R SQ   S+ TG FYLG NP+S  FKYV+L+
Sbjct: 197  TQGILGMNLGRLSFSSLAKISKFSYCVPPRRSQ-SGSSPTGSFYLGPNPSSAGFKYVNLM 255

Query: 819  TFGQSQRMPNLDPMAYTVGMEGIRIGGKKLNISRSVFRPDAGGSGQTMIDSGSEYTFMVE 998
            T+ QSQRMPNLDP+AYT+ M GIRI GKKLNIS S FR D  G+GQT+IDSG+ +TF+V+
Sbjct: 256  TYRQSQRMPNLDPLAYTLPMLGIRINGKKLNISTSAFRADPSGAGQTLIDSGTWFTFLVD 315

Query: 999  EAYVKXXXXXXXXXXGPTLKKGFVYEGVLDLCYVGDVMVIGRLVGDLXXXXXXXXXXXXX 1178
            EAY K          GP LKKG+VY G LD+C+ GD MVIGR++G++             
Sbjct: 316  EAYSK-VKEEIVKLAGPKLKKGYVYGGSLDMCFDGDAMVIGRMIGNMAFEFENGVEIVVE 374

Query: 1179 XXXXXTNVGGGVGCL 1223
                  +VGGGV CL
Sbjct: 375  REKMLADVGGGVQCL 389


>ref|XP_006372550.1| aspartyl protease family protein [Populus trichocarpa]
            gi|550319179|gb|ERP50347.1| aspartyl protease family
            protein [Populus trichocarpa]
          Length = 441

 Score =  420 bits (1080), Expect = e-115
 Identities = 219/376 (58%), Positives = 263/376 (69%), Gaps = 4/376 (1%)
 Frame = +3

Query: 108  NFSLSFPLVTRPLTHQSLSSPNLQTSSLSTPIKPKP-GSKSNNRNPFLYKSSFKYTMGLI 284
            + S SFPL + P + Q+  SPN   S +S   K     S S + +P+ Y+S FKY+M L+
Sbjct: 26   SLSFSFPLTSLPRSPQA--SPNFYPSFISQTKKASTLKSSSFSSSPYNYRSGFKYSMILL 83

Query: 285  VSLPIGTPPQTQQMVLDTGSQLSWIKCAP--PAKLPTXXXXXXXXXXXXXXXXCNHPICK 458
            VSLPIGTPPQTQQM+LDTGSQLSWI+C    P K P                 CNHP+CK
Sbjct: 84   VSLPIGTPPQTQQMILDTGSQLSWIQCHKKVPRKPPPSSVFDPSLSSSFSVLPCNHPLCK 143

Query: 459  PRIPDFTIPTSCDQNRLCHYSYFYADGTLAEGNLAREKITFSNSQTTPPLILGCTRDTSA 638
            PRIPDFT+PTSCDQNRLCHYSYFYADGTLAEGNL REKITFS SQ+TPPLILGC  ++S 
Sbjct: 144  PRIPDFTLPTSCDQNRLCHYSYFYADGTLAEGNLVREKITFSRSQSTPPLILGCAEESSD 203

Query: 639  DEGILGMNLGRLSFTSQTKISKFSYCVPIRGSQVKSS-AATGVFYLGENPNSGSFKYVDL 815
             +GILGMNLGRLSF SQ K++KFSYCVP R  QV+     TG FYLGENPNSG F+Y++L
Sbjct: 204  AKGILGMNLGRLSFASQAKLTKFSYCVPTR--QVRPGFTPTGSFYLGENPNSGGFRYINL 261

Query: 816  VTFGQSQRMPNLDPMAYTVGMEGIRIGGKKLNISRSVFRPDAGGSGQTMIDSGSEYTFMV 995
            +TF QSQRMPNLDP+AYTV M+GIRIG +KLNI  S FRPD  G+GQTMIDSGSE+T++V
Sbjct: 262  LTFSQSQRMPNLDPLAYTVAMQGIRIGNQKLNIPISAFRPDPSGAGQTMIDSGSEFTYLV 321

Query: 996  EEAYVKXXXXXXXXXXGPTLKKGFVYEGVLDLCYVGDVMVIGRLVGDLXXXXXXXXXXXX 1175
            +EAY K          G  LKKG+VY GV D+C+ G+ + IGRL+G++            
Sbjct: 322  DEAYNK-VREEVVRLVGARLKKGYVYGGVSDMCFNGNAIEIGRLIGNMVFEFDKGVEIVV 380

Query: 1176 XXXXXXTNVGGGVGCL 1223
                   +VGGGV C+
Sbjct: 381  EKERVLADVGGGVHCV 396


>ref|XP_002305886.2| aspartyl protease family protein [Populus trichocarpa]
            gi|550340595|gb|EEE86397.2| aspartyl protease family
            protein [Populus trichocarpa]
          Length = 446

 Score =  417 bits (1071), Expect = e-114
 Identities = 223/384 (58%), Positives = 266/384 (69%), Gaps = 4/384 (1%)
 Frame = +3

Query: 84   TQET-LKPLNFSLSFPLVTRPLTHQSLSSPNLQTSSLSTPIKPKPGSKSNNRNPFLYKSS 260
            TQET LK  + S SFPL + P + Q+  SP+  +S +S   K  P  KS   +P+ Y+S 
Sbjct: 25   TQETELKNDSLSFSFPLTSLPRSPQT--SPSFYSSFISQS-KKTPALKSA-ASPYNYRSR 80

Query: 261  FKYTMGLIVSLPIGTPPQTQQMVLDTGSQLSWIKCAP--PAKLPTXXXXXXXXXXXXXXX 434
            FKY+M L+VSLPIGTPPQ+QQM+LDTGSQLSWI+C    P K P                
Sbjct: 81   FKYSMILLVSLPIGTPPQSQQMILDTGSQLSWIQCHKKVPRKPPPSTVFDPSLSSSFSVL 140

Query: 435  XCNHPICKPRIPDFTIPTSCDQNRLCHYSYFYADGTLAEGNLAREKITFSNSQTTPPLIL 614
             CNHP+CKPRIPDFT+PTSCD NRLCHYSYFYADGTLAEGNL REKITFS SQ+TPPLIL
Sbjct: 141  PCNHPLCKPRIPDFTLPTSCDLNRLCHYSYFYADGTLAEGNLVREKITFSTSQSTPPLIL 200

Query: 615  GCTRDTSADEGILGMNLGRLSFTSQTKISKFSYCVPIRGSQVKSS-AATGVFYLGENPNS 791
            GC  D S D+GILGMNLGRLSF SQ KI+KFSYCVP R  QV+     TG FYLGENPNS
Sbjct: 201  GCAEDASDDKGILGMNLGRLSFASQAKITKFSYCVPTR--QVRPGFTPTGSFYLGENPNS 258

Query: 792  GSFKYVDLVTFGQSQRMPNLDPMAYTVGMEGIRIGGKKLNISRSVFRPDAGGSGQTMIDS 971
              F+Y+ L+TF QSQRMPNLDP+A+TV ++GIRIG KKLNI  S FR D  G+GQ+MIDS
Sbjct: 259  AGFQYISLLTFSQSQRMPNLDPLAHTVALQGIRIGNKKLNIPVSAFRADPSGAGQSMIDS 318

Query: 972  GSEYTFMVEEAYVKXXXXXXXXXXGPTLKKGFVYEGVLDLCYVGDVMVIGRLVGDLXXXX 1151
            GSE+T++V+ AY K          GP LKKG+VY GV D+C+ G+ M IGRL+G++    
Sbjct: 319  GSEFTYLVDVAYNK-VREEVVRLAGPRLKKGYVYSGVSDMCFDGNAMEIGRLIGNMVFEF 377

Query: 1152 XXXXXXXXXXXXXXTNVGGGVGCL 1223
                           +VGGGV C+
Sbjct: 378  DKGVEIVIEKGRVLADVGGGVHCV 401


>gb|EXB95902.1| Aspartic proteinase nepenthesin-2 [Morus notabilis]
            gi|587989511|gb|EXC73811.1| Aspartic proteinase
            nepenthesin-2 [Morus notabilis]
          Length = 451

 Score =  416 bits (1068), Expect = e-113
 Identities = 220/379 (58%), Positives = 255/379 (67%), Gaps = 5/379 (1%)
 Frame = +3

Query: 105  LNFSLSFPLVTRPLTHQSLSSPNLQTSSLSTPIKPKPGSKSNNRNPFLYKSSFKYTMGLI 284
            L  SLSFPL    + H S S      +  +  I  +  +       + YK  FKY+M LI
Sbjct: 32   LPLSLSFPLTA--VRHNSSSGSESSGTFRAALIANRKQNPRLRTASYNYKLLFKYSMALI 89

Query: 285  VSLPIGTPPQTQQMVLDTGSQLSWIKC---APPAKLPTXXXXXXXXXXXXXXXXCNHPIC 455
            VSLPIGTPPQTQQMVLDTGSQLSWI+C   AP    P                 C+HP+C
Sbjct: 90   VSLPIGTPPQTQQMVLDTGSQLSWIQCDKKAPKVAPPPTASFDPSLSSTFSVLPCSHPVC 149

Query: 456  KPRIPDFTIPTSCDQNRLCHYSYFYADGTLAEGNLAREKITFSNSQTTPPLILGCTRDTS 635
            KPRIPDFT+PTSCDQNRLCHYSYFYADGT AEGNL REK TFS S TTPP ILGC +D S
Sbjct: 150  KPRIPDFTLPTSCDQNRLCHYSYFYADGTFAEGNLVREKFTFSRSVTTPPFILGCAKDPS 209

Query: 636  ADEGILGMNLGRLSFTSQTKISKFSYCVPIRGSQVKS-SAATGVFYLGENPNSGSFKYVD 812
              +GILGMNLGRLSF SQ KI+KFSYCVP RG Q KS S  TG FYLG NPNS  FKYV+
Sbjct: 210  DSQGILGMNLGRLSFASQAKINKFSYCVPTRGRQTKSGSLPTGSFYLGNNPNSRWFKYVN 269

Query: 813  LVTFGQSQRMPNLDPMAYTVGMEGIRIGGKKLNISRSVFRPDAGGSGQTMIDSGSEYTFM 992
            L+TF QSQRMPNLDP+A+T+ M+GIRIG ++LNI  +VFRPD+ GSGQTMIDSGSE+TF+
Sbjct: 270  LLTFRQSQRMPNLDPLAFTLPMQGIRIGARRLNIPATVFRPDSSGSGQTMIDSGSEFTFL 329

Query: 993  VEEAYVKXXXXXXXXXXGPTLKKGFVYEGVLDLCYVG-DVMVIGRLVGDLXXXXXXXXXX 1169
            V+EAY K          GP +KKG+VY GV D+C+ G D + IGRLVGD+          
Sbjct: 330  VDEAYNK-VREEIVRLVGPRIKKGYVYGGVADMCFQGTDAVAIGRLVGDMAFEFEKGVEI 388

Query: 1170 XXXXXXXXTNVGGGVGCLS 1226
                     +VGGGV CL+
Sbjct: 389  VAPKERILADVGGGVHCLA 407


>gb|EYU45777.1| hypothetical protein MIMGU_mgv1a027161mg [Mimulus guttatus]
          Length = 454

 Score =  415 bits (1067), Expect = e-113
 Identities = 220/393 (55%), Positives = 258/393 (65%), Gaps = 22/393 (5%)
 Frame = +3

Query: 114  SLSFPLVTRPLTHQS------LSSPNLQTSSLSTPIKPKPGSKSNNRNPFLYKSSFKYTM 275
            SLSFPL + PL+  S      LSSPN   ++ +    P  G        + Y+SSFKY+M
Sbjct: 27   SLSFPLTSAPLSETSAFKAKLLSSPNKNAAAAAAASSPPLG--------YDYRSSFKYSM 78

Query: 276  GLIVSLPIGTPPQTQQMVLDTGSQLSWIKCAPPA----KLPTXXXXXXXXXXXXXXXXCN 443
             LIVSLPIGTPPQTQQMVLDTGSQLSWI+C        K P                 CN
Sbjct: 79   ALIVSLPIGTPPQTQQMVLDTGSQLSWIQCHRKTPVVRKPPPTTSFDPSLSSSFSVLPCN 138

Query: 444  HPICKPRIPDFTIPTSCDQNRLCHYSYFYADGTLAEGNLAREKITFSNSQTTPPLILGCT 623
            HP+CKPRIPDFT+PT+CDQNRLCHYSYFYADGTLAEGNL REK TFSNSQ+TPPLILGC 
Sbjct: 139  HPLCKPRIPDFTLPTTCDQNRLCHYSYFYADGTLAEGNLVREKFTFSNSQSTPPLILGCA 198

Query: 624  RDTSADEGILGMNLGRLSFTSQTKISKFSYCVPIR-----------GSQVKSSAATGVFY 770
             D++  EGILGMNLGRLSF SQ K+ +FSYCVP+R                +   TG FY
Sbjct: 199  ADSNDAEGILGMNLGRLSFISQAKVPRFSYCVPLRRQGRVAQNINNNKNNNNINPTGAFY 258

Query: 771  LGENPNSGSFKYVDLVTFGQSQRMPNLDPMAYTVGMEGIRIGGKKLNISRSVFRPDAGGS 950
            +G NPNSG+F Y+ ++TF +SQR PN DP AYTVG+ GIRIGGKKLNIS +VFRPDAGGS
Sbjct: 259  IGHNPNSGTFHYISILTFPKSQRAPNFDPHAYTVGLLGIRIGGKKLNISAAVFRPDAGGS 318

Query: 951  GQTMIDSGSEYTFMVEEAYVKXXXXXXXXXXGPTLKKGFVYEGVLDLCY-VGDVMVIGRL 1127
            GQTMIDSG++YTF+V+ AY K          GP LKKGFVY G LD+C+  GD   IGRL
Sbjct: 319  GQTMIDSGTQYTFLVDAAYAK-VREEVARLAGPKLKKGFVYGGALDMCFDGGDEAEIGRL 377

Query: 1128 VGDLXXXXXXXXXXXXXXXXXXTNVGGGVGCLS 1226
            +GD+                   +VGGG+ C +
Sbjct: 378  IGDVVFEFERGVEILTNKERVLDDVGGGIRCFA 410


>ref|XP_003538390.1| PREDICTED: aspartic proteinase PCS1-like [Glycine max]
          Length = 457

 Score =  408 bits (1048), Expect = e-111
 Identities = 208/373 (55%), Positives = 252/373 (67%), Gaps = 2/373 (0%)
 Frame = +3

Query: 111  FSLSFPLVTRPLTHQSLSSPNLQTSSLSTPIKPKPGSKSNNRNPFLYKSSFKYTMGLIVS 290
            FSLSFPL +  L+  +     L+ S ++         KS   +P+ YK SFKY+M LIV 
Sbjct: 41   FSLSFPLTSLSLSTNTALKMMLRNSLIANTNNNNTQLKSPPSSPYNYKLSFKYSMALIVD 100

Query: 291  LPIGTPPQTQQMVLDTGSQLSWIKC--APPAKLPTXXXXXXXXXXXXXXXXCNHPICKPR 464
            LPIGTPPQ Q MVLDTGSQLSWI+C    PAK P                 C HP+CKPR
Sbjct: 101  LPIGTPPQVQPMVLDTGSQLSWIQCHKKAPAKPPPTASFDPSLSSTFSTLPCTHPVCKPR 160

Query: 465  IPDFTIPTSCDQNRLCHYSYFYADGTLAEGNLAREKITFSNSQTTPPLILGCTRDTSADE 644
            IPDFT+PTSCDQNRLCHYSYFYADGT AEGNL REK TFS S  TPPLILGC  +++   
Sbjct: 161  IPDFTLPTSCDQNRLCHYSYFYADGTYAEGNLVREKFTFSRSLFTPPLILGCATESTDPR 220

Query: 645  GILGMNLGRLSFTSQTKISKFSYCVPIRGSQVKSSAATGVFYLGENPNSGSFKYVDLVTF 824
            GILGMN GRLSF SQ+KI+KFSYCVP R ++      TG FYLG NPNS +F+Y++++TF
Sbjct: 221  GILGMNRGRLSFASQSKITKFSYCVPTRVTR-PGYTPTGSFYLGHNPNSNTFRYIEMLTF 279

Query: 825  GQSQRMPNLDPMAYTVGMEGIRIGGKKLNISRSVFRPDAGGSGQTMIDSGSEYTFMVEEA 1004
             +SQRMPNLDP+AYTV ++GIRIGG+KLNIS +VFR DAGGSGQTM+DSGSE+T++V EA
Sbjct: 280  ARSQRMPNLDPLAYTVALQGIRIGGRKLNISPAVFRADAGGSGQTMLDSGSEFTYLVNEA 339

Query: 1005 YVKXXXXXXXXXXGPTLKKGFVYEGVLDLCYVGDVMVIGRLVGDLXXXXXXXXXXXXXXX 1184
            Y K          GP +KKG+VY GV D+C+ G+ + IGRL+GD+               
Sbjct: 340  YDK-VRAEVVRAVGPRMKKGYVYGGVADMCFDGNAIEIGRLIGDMVFEFEKGVQIVVPKE 398

Query: 1185 XXXTNVGGGVGCL 1223
                 V GGV C+
Sbjct: 399  RVLATVEGGVHCI 411


>ref|XP_006482080.1| PREDICTED: aspartic proteinase PCS1-like [Citrus sinensis]
          Length = 441

 Score =  407 bits (1045), Expect = e-111
 Identities = 207/371 (55%), Positives = 252/371 (67%)
 Frame = +3

Query: 111  FSLSFPLVTRPLTHQSLSSPNLQTSSLSTPIKPKPGSKSNNRNPFLYKSSFKYTMGLIVS 290
            FS+SF L++R  +H  LS P+  +S +S   + K   K        Y+S FKY+M L+VS
Sbjct: 32   FSVSFALISRRFSHDDLS-PSYYSSFVS---QTKQNRKVARAPSLRYRSKFKYSMALVVS 87

Query: 291  LPIGTPPQTQQMVLDTGSQLSWIKCAPPAKLPTXXXXXXXXXXXXXXXXCNHPICKPRIP 470
            LPIGTPPQTQ+MVLDTGSQLSWIKC   A  P                 C HP+CKPRI 
Sbjct: 88   LPIGTPPQTQEMVLDTGSQLSWIKCHKKAPAPPTTSFDPSRSSSFSVLPCTHPLCKPRIV 147

Query: 471  DFTIPTSCDQNRLCHYSYFYADGTLAEGNLAREKITFSNSQTTPPLILGCTRDTSADEGI 650
            DFT+PT CDQNRLCHYSYFYADGT AEGNL +EK TFS +Q+T PLILGC +DTS D+GI
Sbjct: 148  DFTLPTDCDQNRLCHYSYFYADGTFAEGNLVKEKFTFSAAQSTLPLILGCAKDTSEDKGI 207

Query: 651  LGMNLGRLSFTSQTKISKFSYCVPIRGSQVKSSAATGVFYLGENPNSGSFKYVDLVTFGQ 830
            LGMNLGRLSF SQ KISKFSYCVP R S+V     TG FYLGENPNS  F+YV  +TF Q
Sbjct: 208  LGMNLGRLSFASQAKISKFSYCVPTRVSRV-GYTPTGSFYLGENPNSAGFRYVSFLTFPQ 266

Query: 831  SQRMPNLDPMAYTVGMEGIRIGGKKLNISRSVFRPDAGGSGQTMIDSGSEYTFMVEEAYV 1010
            SQR PNLDP+AY+V M+G+RI GK+L+I  + F PDA GSGQT++DSGSE+T++V+ AY 
Sbjct: 267  SQRSPNLDPLAYSVPMQGVRIQGKRLDIPATAFHPDASGSGQTIVDSGSEFTYLVDVAYN 326

Query: 1011 KXXXXXXXXXXGPTLKKGFVYEGVLDLCYVGDVMVIGRLVGDLXXXXXXXXXXXXXXXXX 1190
            K          GP +KKG+VY GV D+C+ G+ M +GRL+GD+                 
Sbjct: 327  K-IKEEIVRLAGPRMKKGYVYGGVADMCFDGNAMEVGRLIGDMVFEFERGVEILIEKERV 385

Query: 1191 XTNVGGGVGCL 1223
              +VGGGV C+
Sbjct: 386  LADVGGGVHCV 396


>ref|XP_006430555.1| hypothetical protein CICLE_v10011736mg [Citrus clementina]
            gi|557532612|gb|ESR43795.1| hypothetical protein
            CICLE_v10011736mg [Citrus clementina]
          Length = 441

 Score =  407 bits (1045), Expect = e-111
 Identities = 207/371 (55%), Positives = 252/371 (67%)
 Frame = +3

Query: 111  FSLSFPLVTRPLTHQSLSSPNLQTSSLSTPIKPKPGSKSNNRNPFLYKSSFKYTMGLIVS 290
            FS+SF L++R  +H  LS P+  +S +S   + K   K        Y+S FKY+M L+VS
Sbjct: 32   FSVSFALISRRFSHDDLS-PSYYSSFVS---QTKQNRKVARAPSLRYRSKFKYSMALVVS 87

Query: 291  LPIGTPPQTQQMVLDTGSQLSWIKCAPPAKLPTXXXXXXXXXXXXXXXXCNHPICKPRIP 470
            LPIGTPPQTQ+MVLDTGSQLSWIKC   A  P                 C HP+CKPRI 
Sbjct: 88   LPIGTPPQTQEMVLDTGSQLSWIKCHKKAPAPPTTSFDPSRSSSFSVLPCTHPLCKPRIV 147

Query: 471  DFTIPTSCDQNRLCHYSYFYADGTLAEGNLAREKITFSNSQTTPPLILGCTRDTSADEGI 650
            DFT+PT CDQNRLCHYSYFYADGT AEGNL +EK TFS +Q+T PLILGC +DTS D+GI
Sbjct: 148  DFTLPTDCDQNRLCHYSYFYADGTFAEGNLVKEKFTFSAAQSTLPLILGCAKDTSEDKGI 207

Query: 651  LGMNLGRLSFTSQTKISKFSYCVPIRGSQVKSSAATGVFYLGENPNSGSFKYVDLVTFGQ 830
            LGMNLGRLSF SQ KISKFSYCVP R S+V     TG FYLGENPNS  F+YV  +TF Q
Sbjct: 208  LGMNLGRLSFASQAKISKFSYCVPTRVSRV-GYTPTGSFYLGENPNSAGFRYVSFLTFPQ 266

Query: 831  SQRMPNLDPMAYTVGMEGIRIGGKKLNISRSVFRPDAGGSGQTMIDSGSEYTFMVEEAYV 1010
            SQR PNLDP+AY+V M+G+RI GK+L+I  + F PDA GSGQT++DSGSE+T++V+ AY 
Sbjct: 267  SQRSPNLDPLAYSVPMQGVRIQGKRLDIPATAFHPDASGSGQTIVDSGSEFTYLVDVAYN 326

Query: 1011 KXXXXXXXXXXGPTLKKGFVYEGVLDLCYVGDVMVIGRLVGDLXXXXXXXXXXXXXXXXX 1190
            K          GP +KKG+VY GV D+C+ G+ M +GRL+GD+                 
Sbjct: 327  K-IKEEIVRLAGPRMKKGYVYGGVADMCFDGNAMEVGRLIGDMVFEFERGVEILIEKERV 385

Query: 1191 XTNVGGGVGCL 1223
              +VGGGV C+
Sbjct: 386  LADVGGGVHCV 396


>ref|XP_007032118.1| Eukaryotic aspartyl protease family protein [Theobroma cacao]
            gi|508711147|gb|EOY03044.1| Eukaryotic aspartyl protease
            family protein [Theobroma cacao]
          Length = 446

 Score =  406 bits (1043), Expect = e-110
 Identities = 218/379 (57%), Positives = 257/379 (67%), Gaps = 7/379 (1%)
 Frame = +3

Query: 108  NFSLSFPLVTRPLTHQSLSSPNLQTSSLSTPIKPKPGSKSNNR-NPFLYKSSFKYTMGLI 284
            N S+SF     PLT    S  N+QT   S  +  KP S    R + + YK++FKY+M LI
Sbjct: 32   NNSISFSF---PLTSLRFSRDNVQTLYRSL-VSTKPNSTVQPRPSSYNYKTTFKYSMALI 87

Query: 285  VSLPIGTPPQTQQMVLDTGSQLSWIKC------APPAKLPTXXXXXXXXXXXXXXXXCNH 446
            V+LPIGTPPQTQQMVLDTGSQLSWI+C       PP   P                 C H
Sbjct: 88   VALPIGTPPQTQQMVLDTGSQLSWIQCHKKVARKPP---PPPTSFDPSLSSSFSVLPCTH 144

Query: 447  PICKPRIPDFTIPTSCDQNRLCHYSYFYADGTLAEGNLAREKITFSNSQTTPPLILGCTR 626
            P+CKPRIPDFT+PTSCDQNRLCHYSYFYADGTLAEGNL REK TFS SQ+TPPLILGC  
Sbjct: 145  PLCKPRIPDFTLPTSCDQNRLCHYSYFYADGTLAEGNLVREKFTFSRSQSTPPLILGCAT 204

Query: 627  DTSADEGILGMNLGRLSFTSQTKISKFSYCVPIRGSQVKSSAATGVFYLGENPNSGSFKY 806
            DTS D+GILGMNLGRLSF SQ KISKFSYCVP R +Q    + TG FYLGENP+S  F+Y
Sbjct: 205  DTSEDKGILGMNLGRLSFASQAKISKFSYCVPTRRTQ-PGFSPTGSFYLGENPSSRGFQY 263

Query: 807  VDLVTFGQSQRMPNLDPMAYTVGMEGIRIGGKKLNISRSVFRPDAGGSGQTMIDSGSEYT 986
            V+L+ F +S   PN+DP+AYT+ M+GIRIG KKL I  SVFRPDAGGSGQTMIDSGSE+T
Sbjct: 264  VNLMIFPESGTRPNMDPLAYTLPMQGIRIGAKKLPIPTSVFRPDAGGSGQTMIDSGSEFT 323

Query: 987  FMVEEAYVKXXXXXXXXXXGPTLKKGFVYEGVLDLCYVGDVMVIGRLVGDLXXXXXXXXX 1166
            ++V++AY K          GP +KKG+VY GV D+C+ G+ + IGRL+GD+         
Sbjct: 324  YLVDDAYNK-VREEVVRLVGPRIKKGYVYGGVADMCFDGNPIEIGRLIGDMVLEFEKGVE 382

Query: 1167 XXXXXXXXXTNVGGGVGCL 1223
                      +V GGV CL
Sbjct: 383  ITVEKERVLADVEGGVHCL 401


>ref|XP_002532530.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
            communis] gi|223527742|gb|EEF29846.1| Aspartic proteinase
            nepenthesin-1 precursor, putative [Ricinus communis]
          Length = 440

 Score =  406 bits (1043), Expect = e-110
 Identities = 209/379 (55%), Positives = 261/379 (68%), Gaps = 3/379 (0%)
 Frame = +3

Query: 96   LKPLNFSLSFPLVTRPLTHQSLSSPNLQTSSLSTPIKPKPGSKSNNRNPFLYKSSFKYTM 275
            L   +FS SFPL + P +  S  S   ++S ++   +P           + Y+SSFKY+M
Sbjct: 29   LNTSSFSFSFPLRSLPASSPSKPSSPFRSSFVAQTKQPS----------YNYRSSFKYSM 78

Query: 276  GLIVSLPIGTPPQTQQMVLDTGSQLSWIKC---APPAKLPTXXXXXXXXXXXXXXXXCNH 446
             LIVSLPIGTPPQTQQMVLDTGSQLSWI+C   + P K P                 CNH
Sbjct: 79   ALIVSLPIGTPPQTQQMVLDTGSQLSWIQCHKKSVPKKPPPTTSFDPSLSSSFSVLPCNH 138

Query: 447  PICKPRIPDFTIPTSCDQNRLCHYSYFYADGTLAEGNLAREKITFSNSQTTPPLILGCTR 626
            P+CKPRIPDFT+PT+CDQNRLCHYSYFYADGT AEG+L REKITFS+SQ+TPPLILGC  
Sbjct: 139  PLCKPRIPDFTLPTTCDQNRLCHYSYFYADGTYAEGSLVREKITFSSSQSTPPLILGCAE 198

Query: 627  DTSADEGILGMNLGRLSFTSQTKISKFSYCVPIRGSQVKSSAATGVFYLGENPNSGSFKY 806
             ++ ++GILGMNLGR SF SQ KISKFSYCVP R ++   S +TG FYLG NPNSG F+Y
Sbjct: 199  ASTDEKGILGMNLGRRSFASQAKISKFSYCVPTRQARAGLS-STGSFYLGNNPNSGRFQY 257

Query: 807  VDLVTFGQSQRMPNLDPMAYTVGMEGIRIGGKKLNISRSVFRPDAGGSGQTMIDSGSEYT 986
            ++L+TF  SQR PNLDP+AYT+ M+GIR+G  +LNIS ++FRPD  G+GQT+IDSGSE+T
Sbjct: 258  INLLTFTPSQRSPNLDPLAYTIPMQGIRMGNARLNISATLFRPDPSGAGQTIIDSGSEFT 317

Query: 987  FMVEEAYVKXXXXXXXXXXGPTLKKGFVYEGVLDLCYVGDVMVIGRLVGDLXXXXXXXXX 1166
            ++V+EAY K          GP LKKG+VY GV D+C+ G+ M IGRL+G++         
Sbjct: 318  YLVDEAYNK-VREEVVRLVGPKLKKGYVYGGVSDMCFDGNPMEIGRLIGNMVFEFEKGVE 376

Query: 1167 XXXXXXXXXTNVGGGVGCL 1223
                      +VGGGV C+
Sbjct: 377  IVIDKWRVLADVGGGVHCI 395


>ref|XP_006352439.1| PREDICTED: aspartic proteinase PCS1-like [Solanum tuberosum]
          Length = 431

 Score =  405 bits (1042), Expect = e-110
 Identities = 216/374 (57%), Positives = 255/374 (68%), Gaps = 4/374 (1%)
 Frame = +3

Query: 114  SLSFPLVTRPLTHQSLSSPNLQTSSLSTPIKPKPGSKSNNRNPFLYKSSFKYTMGLIVSL 293
            SLSFPL T PL+  S    N   SS +          +NN     YKS+FKY+M LIV+L
Sbjct: 30   SLSFPLTTTPLSQNSTLKSNYLFSSKAM---------TNNIPSLNYKSNFKYSMALIVTL 80

Query: 294  PIGTPPQTQQMVLDTGSQLSWIKCAP--PAKLPTXXXXXXXXXXXXXXXXCNHPICKPRI 467
            PIGTPPQ QQMVLDTGSQLSWI+C    P K P                 CNHP+CKPRI
Sbjct: 81   PIGTPPQDQQMVLDTGSQLSWIQCNKKLPKKTPPATFDPSLSSSFSVLP-CNHPLCKPRI 139

Query: 468  PDFTIPTSCDQNRLCHYSYFYADGTLAEGNLAREKITFSNSQTTPPLILGCTRDTSADEG 647
            PDFT+PTSCDQNRLCHYSYFYADGTLAEGNL REKITF NSQTTPPLILGC  ++   EG
Sbjct: 140  PDFTLPTSCDQNRLCHYSYFYADGTLAEGNLVREKITFPNSQTTPPLILGCATESRDAEG 199

Query: 648  ILGMNLGRLSFTSQTKISKFSYCVPIR-GSQVKSSAATGVFYLGENPNSGSFKYVDLVTF 824
            ILGMNLGR SF SQ K+ KFSYCVP + G+++  S   G FYLG+NPNS  F+Y++L+TF
Sbjct: 200  ILGMNLGRYSFVSQAKVQKFSYCVPRKQGNKIMPS---GTFYLGQNPNSHMFQYINLLTF 256

Query: 825  GQSQRMPNLDPMAYTVGMEGIRIGGKKLNISRSVFRPDAGGSGQTMIDSGSEYTFMVEEA 1004
             QSQRMPN+DP+AYT+GM GI+IGGKKLNIS  VFRPDAGGSGQTMIDSG++YTF+VEEA
Sbjct: 257  PQSQRMPNMDPLAYTLGMVGIKIGGKKLNISEKVFRPDAGGSGQTMIDSGTQYTFLVEEA 316

Query: 1005 YVKXXXXXXXXXXGPTLKKGFVYEGVLDLCYVG-DVMVIGRLVGDLXXXXXXXXXXXXXX 1181
            Y K          GP LKKG+V+   LD+C+   + +  G+ +GD+              
Sbjct: 317  YSK-VRDEVVRLVGPKLKKGYVFGESLDMCFDAINSVQAGQAIGDMTLEFENGVEILINK 375

Query: 1182 XXXXTNVGGGVGCL 1223
                 +VGGGV C+
Sbjct: 376  ENVLDDVGGGVHCV 389


>ref|XP_006405864.1| hypothetical protein EUTSA_v10028350mg [Eutrema salsugineum]
            gi|557107002|gb|ESQ47317.1| hypothetical protein
            EUTSA_v10028350mg [Eutrema salsugineum]
          Length = 437

 Score =  405 bits (1041), Expect = e-110
 Identities = 215/377 (57%), Positives = 262/377 (69%), Gaps = 5/377 (1%)
 Frame = +3

Query: 108  NFSLSFPLVTRPLTH--QSLSSPNLQTSSLSTPIKPKPGSKSNNRNPFLYKSSFKYTMGL 281
            + SL FPL +  LT    S SS +  +SS  T +  +   ++ +  P+ ++S+FKY+M L
Sbjct: 21   SLSLHFPLKSLRLTPTTNSSSSSSSSSSSFQTSLASR---RTPSSLPYSFRSNFKYSMAL 77

Query: 282  IVSLPIGTPPQTQQMVLDTGSQLSWIKCAPPAKL--PTXXXXXXXXXXXXXXXXCNHPIC 455
            I+SLPIGTP QTQ++VLDTGSQLSWI+C P  K   PT                C+HP+C
Sbjct: 78   ILSLPIGTPAQTQELVLDTGSQLSWIQCHPKKKKKKPTTSFDPSLSSSFSDLP-CSHPLC 136

Query: 456  KPRIPDFTIPTSCDQNRLCHYSYFYADGTLAEGNLAREKITFSNSQTTPPLILGCTRDTS 635
            KPRIPDFT+PT+CD NRLCHYSYFYADGT AEGNL +EK TFSN+Q TPPLILGC  +++
Sbjct: 137  KPRIPDFTLPTTCDSNRLCHYSYFYADGTFAEGNLVKEKFTFSNTQITPPLILGCAAEST 196

Query: 636  ADEGILGMNLGRLSFTSQTKISKFSYCVPIRGSQVKSSAATGVFYLGENPNSGSFKYVDL 815
             D+GILGMNLGRLSF SQ KISKFSYC+P R +Q    ++TG FYLGENP+S  FKYV L
Sbjct: 197  DDKGILGMNLGRLSFVSQAKISKFSYCIPTRSNQ-PGLSSTGSFYLGENPSSRGFKYVSL 255

Query: 816  VTFGQSQRMPNLDPMAYTVGMEGIRIGGKKLNISRSVFRPDAGGSGQTMIDSGSEYTFMV 995
            +TF QSQRMPNLDP+AYTV ++GIRIG K+LNIS SVFRPDAGGSGQTM+DSGSE+T +V
Sbjct: 256  LTFPQSQRMPNLDPLAYTVPLQGIRIGQKRLNISASVFRPDAGGSGQTMVDSGSEFTHLV 315

Query: 996  EEAYVKXXXXXXXXXXGPTLKKGFVYEGVLDLCYVGDVMV-IGRLVGDLXXXXXXXXXXX 1172
            + AY K          GP LKKG+VY    D+C+ G+  V IGRL+GDL           
Sbjct: 316  DVAYDK-VKEEIVRLVGPRLKKGYVYGATADMCFDGNNPVEIGRLIGDLVFEFGRGVEIL 374

Query: 1173 XXXXXXXTNVGGGVGCL 1223
                    NVGGGV CL
Sbjct: 375  VEKQRLLVNVGGGVHCL 391


>ref|XP_006391353.1| hypothetical protein EUTSA_v10018599mg [Eutrema salsugineum]
            gi|557087787|gb|ESQ28639.1| hypothetical protein
            EUTSA_v10018599mg [Eutrema salsugineum]
          Length = 427

 Score =  405 bits (1041), Expect = e-110
 Identities = 211/373 (56%), Positives = 253/373 (67%)
 Frame = +3

Query: 105  LNFSLSFPLVTRPLTHQSLSSPNLQTSSLSTPIKPKPGSKSNNRNPFLYKSSFKYTMGLI 284
            L+ SLSF    + L H S ++ +  TS +S    P P S       + ++S FKY+M LI
Sbjct: 21   LSSSLSFHFPLKSL-HISPTTTHFTTSLISRR-NPSPSS-------YNFRSKFKYSMALI 71

Query: 285  VSLPIGTPPQTQQMVLDTGSQLSWIKCAPPAKLPTXXXXXXXXXXXXXXXXCNHPICKPR 464
            +SLPIGTPPQ QQMVLDTGSQLSWI+C      P                 C+HP+CKPR
Sbjct: 72   ISLPIGTPPQAQQMVLDTGSQLSWIQCHRKKIPPPKTSFDPSLSSSFSDLPCSHPLCKPR 131

Query: 465  IPDFTIPTSCDQNRLCHYSYFYADGTLAEGNLAREKITFSNSQTTPPLILGCTRDTSADE 644
            IPDFT+PTSCD NRLCHYSYFYADGT AEGNL +EKITFSN+++TPPLILGC  ++S D 
Sbjct: 132  IPDFTLPTSCDSNRLCHYSYFYADGTFAEGNLVKEKITFSNTESTPPLILGCATESSEDR 191

Query: 645  GILGMNLGRLSFTSQTKISKFSYCVPIRGSQVKSSAATGVFYLGENPNSGSFKYVDLVTF 824
            GILGMN GRLSF SQ KISKFSYC+P + ++      TG FYLGENPNS  F+YV L+TF
Sbjct: 192  GILGMNRGRLSFVSQAKISKFSYCIPPKSNR-PGLIPTGSFYLGENPNSHGFRYVSLLTF 250

Query: 825  GQSQRMPNLDPMAYTVGMEGIRIGGKKLNISRSVFRPDAGGSGQTMIDSGSEYTFMVEEA 1004
             +SQRMPNLDP+AYTV M GIRIG K+LNIS SVFRPDAGGSGQTM+DSGSE+T +V+ A
Sbjct: 251  PESQRMPNLDPLAYTVPMVGIRIGQKRLNISGSVFRPDAGGSGQTMVDSGSEFTHLVDAA 310

Query: 1005 YVKXXXXXXXXXXGPTLKKGFVYEGVLDLCYVGDVMVIGRLVGDLXXXXXXXXXXXXXXX 1184
            Y K          GP LKKG+VY G  D+C+ G+ MVI RL+GD+               
Sbjct: 311  YEK-VREEIVRLVGPKLKKGYVYGGTADMCFAGNPMVIQRLIGDIVFELRRGVEILVPKE 369

Query: 1185 XXXTNVGGGVGCL 1223
                NVGGG+ C+
Sbjct: 370  RVLANVGGGIHCI 382


>ref|XP_004289585.1| PREDICTED: aspartic proteinase PCS1-like [Fragaria vesca subsp.
            vesca]
          Length = 431

 Score =  404 bits (1038), Expect = e-110
 Identities = 215/375 (57%), Positives = 251/375 (66%), Gaps = 2/375 (0%)
 Frame = +3

Query: 108  NFSLSFPLVTRPLTHQSLSSPNLQTSSLSTPIKPKPGSKSNNRNPFLYKSSFKYTMGLIV 287
            N SLSFPL+  P      S P   T+SL   +  +P +K+ N     YK  FKY+M  +V
Sbjct: 30   NLSLSFPLLFTPH-----SRPPSTTNSLHDSLISRPNNKAYN-----YKLPFKYSMAPVV 79

Query: 288  SLPIGTPPQTQQMVLDTGSQLSWIKCAP--PAKLPTXXXXXXXXXXXXXXXXCNHPICKP 461
            SLPIGTPPQTQQMVLDTGSQLSWI+C    P   P                 CNHPICKP
Sbjct: 80   SLPIGTPPQTQQMVLDTGSQLSWIQCHKKLPRPSPPAPMFDPSLSSTFSVLPCNHPICKP 139

Query: 462  RIPDFTIPTSCDQNRLCHYSYFYADGTLAEGNLAREKITFSNSQTTPPLILGCTRDTSAD 641
            R+PDFT+PTSCDQNRLCHYSYFYADGTLAEGNL REK TFS + +TPPL LGC +DTS  
Sbjct: 140  RVPDFTLPTSCDQNRLCHYSYFYADGTLAEGNLVREKFTFSRATSTPPLTLGCAKDTSDT 199

Query: 642  EGILGMNLGRLSFTSQTKISKFSYCVPIRGSQVKSSAATGVFYLGENPNSGSFKYVDLVT 821
            +GILGMNLGRLSF SQ KI+KFSYC+P R     S+  TG FYLG NPNS +F+YVDL+T
Sbjct: 200  KGILGMNLGRLSFPSQAKITKFSYCIPAR---TGSAFPTGAFYLGNNPNSAAFRYVDLLT 256

Query: 822  FGQSQRMPNLDPMAYTVGMEGIRIGGKKLNISRSVFRPDAGGSGQTMIDSGSEYTFMVEE 1001
            FGQSQRMPNLDP+AYTV M G+RIGGK+L+I  SVFRPDA GSGQTM+DSGSE T+ V+E
Sbjct: 257  FGQSQRMPNLDPLAYTVVMVGVRIGGKRLSILPSVFRPDASGSGQTMVDSGSELTYFVDE 316

Query: 1002 AYVKXXXXXXXXXXGPTLKKGFVYEGVLDLCYVGDVMVIGRLVGDLXXXXXXXXXXXXXX 1181
            AY K          GP LKKG+VY  V D+C+  +   IG L+GD+              
Sbjct: 317  AY-KKVKEEIVRLVGPKLKKGYVYGNVADMCFDSN---IGPLIGDMVLEFDKGAEIVIGK 372

Query: 1182 XXXXTNVGGGVGCLS 1226
                 NV G V C++
Sbjct: 373  EQMLHNVEGKVWCVA 387


>ref|XP_002868527.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
            gi|297314363|gb|EFH44786.1| aspartyl protease family
            protein [Arabidopsis lyrata subsp. lyrata]
          Length = 444

 Score =  404 bits (1037), Expect = e-110
 Identities = 217/378 (57%), Positives = 261/378 (69%), Gaps = 6/378 (1%)
 Frame = +3

Query: 108  NFSLSFPLVTRPLTHQSLSSPNLQTSSLSTPIKPKPGSKSNNRNPFLYKSSFKYTMGLIV 287
            + SL FPL +  LT  + SS + +TS LS    P P S     +P+ ++S+FKY+M LI+
Sbjct: 31   SLSLHFPLTSLRLTPTTNSS-SFKTSLLSRR-NPSPSS-----SPYTFRSNFKYSMALIL 83

Query: 288  SLPIGTPPQTQQMVLDTGSQLSWIKCAP-----PAKLPTXXXXXXXXXXXXXXXXCNHPI 452
            SLPIGTP Q+Q++VLDTGSQLSWI+C P     P   PT                C+HP+
Sbjct: 84   SLPIGTPSQSQELVLDTGSQLSWIQCHPKKIKKPLPPPTTSFDPSLSSSFSDLP-CSHPL 142

Query: 453  CKPRIPDFTIPTSCDQNRLCHYSYFYADGTLAEGNLAREKITFSNSQTTPPLILGCTRDT 632
            CKPRIPDFT+PTSCD NRLCHYSYFYADGT AEGNL +EK TFSNSQTTPPLILGC +++
Sbjct: 143  CKPRIPDFTLPTSCDSNRLCHYSYFYADGTFAEGNLVKEKFTFSNSQTTPPLILGCAKES 202

Query: 633  SADEGILGMNLGRLSFTSQTKISKFSYCVPIRGSQVKSSAATGVFYLGENPNSGSFKYVD 812
            +  +GILGMNLGRLSF SQ KISKFSYC+P R ++    A+TG FYLGENPNS  FKYV 
Sbjct: 203  TDVKGILGMNLGRLSFISQAKISKFSYCIPTRSNR-PGLASTGSFYLGENPNSRGFKYVS 261

Query: 813  LVTFGQSQRMPNLDPMAYTVGMEGIRIGGKKLNISRSVFRPDAGGSGQTMIDSGSEYTFM 992
            L+TF QSQRMPNLDP+AYTV + GIRIG K+LNI  SVFRPDAGGSGQTM+DSGSE+T +
Sbjct: 262  LLTFPQSQRMPNLDPLAYTVPLLGIRIGQKRLNIPSSVFRPDAGGSGQTMVDSGSEFTHL 321

Query: 993  VEEAYVKXXXXXXXXXXGPTLKKGFVYEGVLDLCYVGD-VMVIGRLVGDLXXXXXXXXXX 1169
            V+ AY K          G  LKKG+VY    D+C+ G+  MVIGRL+GDL          
Sbjct: 322  VDVAYDK-VKEEIVRLVGSRLKKGYVYGSTADMCFDGNHQMVIGRLIGDLVFEFGRGVEI 380

Query: 1170 XXXXXXXXTNVGGGVGCL 1223
                     NVGGG+ C+
Sbjct: 381  LVEKQRLLVNVGGGIHCV 398


>ref|XP_003544977.1| PREDICTED: aspartic proteinase PCS1-like [Glycine max]
          Length = 445

 Score =  403 bits (1036), Expect = e-110
 Identities = 211/372 (56%), Positives = 255/372 (68%)
 Frame = +3

Query: 108  NFSLSFPLVTRPLTHQSLSSPNLQTSSLSTPIKPKPGSKSNNRNPFLYKSSFKYTMGLIV 287
            + SLSFPL + PL   S + P      L+T  K +  S S++ N    KSSFKY+M L+V
Sbjct: 43   SLSLSFPLTSLPL---STAKP------LNTNPKLRTLSSSSSYN---IKSSFKYSMALVV 90

Query: 288  SLPIGTPPQTQQMVLDTGSQLSWIKCAPPAKLPTXXXXXXXXXXXXXXXXCNHPICKPRI 467
            +LPIGTPPQ QQMVLDTGSQLSWI+C    K P                 C HP+CKPR+
Sbjct: 91   TLPIGTPPQPQQMVLDTGSQLSWIQCHN--KTPPTASFDPSLSSSFYVLPCTHPLCKPRV 148

Query: 468  PDFTIPTSCDQNRLCHYSYFYADGTLAEGNLAREKITFSNSQTTPPLILGCTRDTSADEG 647
            PDFT+PT+CDQNRLCHYSYFYADGT AEGNL REK+ FS SQTTPPLILGC+ ++    G
Sbjct: 149  PDFTLPTTCDQNRLCHYSYFYADGTYAEGNLVREKLAFSPSQTTPPLILGCSSESRDARG 208

Query: 648  ILGMNLGRLSFTSQTKISKFSYCVPIRGSQVKSSAATGVFYLGENPNSGSFKYVDLVTFG 827
            ILGMNLGRLSF  Q K++KFSYCVP R     ++  TG FYLG NPNS  F+YV ++TF 
Sbjct: 209  ILGMNLGRLSFPFQAKVTKFSYCVPTRQPANNNNFPTGSFYLGNNPNSARFRYVSMLTFP 268

Query: 828  QSQRMPNLDPMAYTVGMEGIRIGGKKLNISRSVFRPDAGGSGQTMIDSGSEYTFMVEEAY 1007
            QSQRMPNLDP+AYTV M+GIRIGG+KLNI  SVFRP+AGGSGQTM+DSGSE+TF+V+ AY
Sbjct: 269  QSQRMPNLDPLAYTVPMQGIRIGGRKLNIPPSVFRPNAGGSGQTMVDSGSEFTFLVDVAY 328

Query: 1008 VKXXXXXXXXXXGPTLKKGFVYEGVLDLCYVGDVMVIGRLVGDLXXXXXXXXXXXXXXXX 1187
             +          GP +KKG+VY GV D+C+ G+ M IGRL+GD+                
Sbjct: 329  DR-VREEIIRVLGPRVKKGYVYGGVADMCFDGNAMEIGRLLGDVAFEFEKGVEIVVPKER 387

Query: 1188 XXTNVGGGVGCL 1223
               +VGGGV C+
Sbjct: 388  VLADVGGGVHCV 399


>ref|XP_006300488.1| hypothetical protein CARUB_v10020336mg [Capsella rubella]
            gi|482569198|gb|EOA33386.1| hypothetical protein
            CARUB_v10020336mg [Capsella rubella]
          Length = 427

 Score =  402 bits (1034), Expect = e-109
 Identities = 207/372 (55%), Positives = 250/372 (67%)
 Frame = +3

Query: 108  NFSLSFPLVTRPLTHQSLSSPNLQTSSLSTPIKPKPGSKSNNRNPFLYKSSFKYTMGLIV 287
            + SL FPL + PL   +       T+SL +   P P S     +P+ ++S FKY+M L +
Sbjct: 23   SLSLHFPLTSLPLPPSTTF-----TTSLLSRKNPSPSS-----HPYNFRSRFKYSMALTI 72

Query: 288  SLPIGTPPQTQQMVLDTGSQLSWIKCAPPAKLPTXXXXXXXXXXXXXXXXCNHPICKPRI 467
            SLPIGTPPQ QQMVLDTGSQLSWI+C    KLP                 C+HP+CKPRI
Sbjct: 73   SLPIGTPPQAQQMVLDTGSQLSWIQCHRKKKLPPKTSFDPSLSSSFSTLPCSHPLCKPRI 132

Query: 468  PDFTIPTSCDQNRLCHYSYFYADGTLAEGNLAREKITFSNSQTTPPLILGCTRDTSADEG 647
            PDFT+PTSCD NRLCHYSYFYADGT AEGNL +E+ITFSN++ TPPLILGC  ++S D G
Sbjct: 133  PDFTLPTSCDSNRLCHYSYFYADGTFAEGNLVKERITFSNTEITPPLILGCATESSEDRG 192

Query: 648  ILGMNLGRLSFTSQTKISKFSYCVPIRGSQVKSSAATGVFYLGENPNSGSFKYVDLVTFG 827
            ILGMN GRLSF SQ K+S+FSYC+P + ++      TG FYLG+NPNS  FKYV L+TF 
Sbjct: 193  ILGMNRGRLSFISQAKVSRFSYCIPPKSNR-PGFTPTGSFYLGDNPNSHGFKYVSLLTFP 251

Query: 828  QSQRMPNLDPMAYTVGMEGIRIGGKKLNISRSVFRPDAGGSGQTMIDSGSEYTFMVEEAY 1007
            +SQRMPNLDP+AYTV M GI+ G KKLNIS SVFRPDAGGSGQTM+DSGSE+T +V+ AY
Sbjct: 252  ESQRMPNLDPLAYTVPMIGIKFGLKKLNISGSVFRPDAGGSGQTMVDSGSEFTHLVDAAY 311

Query: 1008 VKXXXXXXXXXXGPTLKKGFVYEGVLDLCYVGDVMVIGRLVGDLXXXXXXXXXXXXXXXX 1187
                        G  LKKG+VY G  D+C+ G+V +I RL+GDL                
Sbjct: 312  -DAVRGEVVRRVGRRLKKGYVYGGTADMCFDGNVAMIQRLIGDLVFEFIRGVEILVPKER 370

Query: 1188 XXTNVGGGVGCL 1223
               NVGGGV C+
Sbjct: 371  VLVNVGGGVHCV 382


>ref|XP_003616254.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
            gi|355517589|gb|AES99212.1| Aspartic proteinase
            nepenthesin-1 [Medicago truncatula]
          Length = 442

 Score =  401 bits (1030), Expect = e-109
 Identities = 211/377 (55%), Positives = 255/377 (67%), Gaps = 5/377 (1%)
 Frame = +3

Query: 108  NFSLSFPLVTRPLTHQSLSSPNLQTSSLSTPIKPKPGSKSNNRNPFLYKSSFKYTMGLIV 287
            +FSLSFPL +  ++  S +  N Q ++LS+       S S++ N    KSSFKY+M L+V
Sbjct: 35   SFSLSFPLTSLQISTNSKTKTNQQFTTLSS-------SSSSSIN---VKSSFKYSMALVV 84

Query: 288  SLPIGTPPQTQQMVLDTGSQLSWIKC----APPAKLP-TXXXXXXXXXXXXXXXXCNHPI 452
            +LPIGTPPQ QQMVLDTGSQLSWI+C     P  K P T                CNHP+
Sbjct: 85   TLPIGTPPQLQQMVLDTGSQLSWIQCHNKKTPQKKQPPTTSSFDPSLSSSFFVLPCNHPL 144

Query: 453  CKPRIPDFTIPTSCDQNRLCHYSYFYADGTLAEGNLAREKITFSNSQTTPPLILGCTRDT 632
            CKPR+PDF++PT CD N LCHYSYFYADGT AEGNL REKI FS SQTTPP+ILGC   +
Sbjct: 145  CKPRVPDFSLPTDCDANSLCHYSYFYADGTYAEGNLVREKIAFSPSQTTPPIILGCATQS 204

Query: 633  SADEGILGMNLGRLSFTSQTKISKFSYCVPIRGSQVKSSAATGVFYLGENPNSGSFKYVD 812
                GILGMNLGRL F SQ KI+KFSYCVP + +Q     A+G FYLG NP S SF+YV+
Sbjct: 205  DDARGILGMNLGRLGFPSQAKITKFSYCVPTKQAQ----PASGSFYLGNNPASSSFRYVN 260

Query: 813  LVTFGQSQRMPNLDPMAYTVGMEGIRIGGKKLNISRSVFRPDAGGSGQTMIDSGSEYTFM 992
            L+TFGQSQRMPNLDP+AYT+ ++GI IGGKKLNI  SVF+P+AGGSGQTMIDSGSE+T++
Sbjct: 261  LLTFGQSQRMPNLDPLAYTLPLQGISIGGKKLNIPPSVFKPNAGGSGQTMIDSGSEFTYL 320

Query: 993  VEEAYVKXXXXXXXXXXGPTLKKGFVYEGVLDLCYVGDVMVIGRLVGDLXXXXXXXXXXX 1172
            V+EAY            GP +KKG++Y GV D+C+ GD + IGRLVGD+           
Sbjct: 321  VDEAY-NVIREELVKKVGPKIKKGYMYGGVADICFDGDAIEIGRLVGDMVFEFEKGVQIV 379

Query: 1173 XXXXXXXTNVGGGVGCL 1223
                     V GGV CL
Sbjct: 380  IPKERVLATVDGGVHCL 396


>ref|NP_568551.1| aspartyl protease family protein [Arabidopsis thaliana]
            gi|10177438|dbj|BAB10671.1| unnamed protein product
            [Arabidopsis thaliana] gi|15809850|gb|AAL06853.1|
            AT5g37540/mpa22_p_70 [Arabidopsis thaliana]
            gi|20260182|gb|AAM12989.1| unknown protein [Arabidopsis
            thaliana] gi|23197748|gb|AAN15401.1| unknown protein
            [Arabidopsis thaliana] gi|332006821|gb|AED94204.1|
            aspartyl protease family protein [Arabidopsis thaliana]
          Length = 442

 Score =  400 bits (1029), Expect = e-109
 Identities = 214/378 (56%), Positives = 261/378 (69%), Gaps = 6/378 (1%)
 Frame = +3

Query: 108  NFSLSFPLVTRPLTHQSLSSPNLQTSSLSTPIKPKPGSKSNNRNPFLYKSSFKYTMGLIV 287
            + SL FPL +  LT  + SS + +TS LS    P P S     +P+ ++S+ KY+M LI+
Sbjct: 30   SLSLHFPLTSLRLTPTTNSS-SFKTSLLSRR-NPSPPS-----SPYTFRSNIKYSMALIL 82

Query: 288  SLPIGTPPQTQQMVLDTGSQLSWIKCAP-----PAKLPTXXXXXXXXXXXXXXXXCNHPI 452
            SLPIGTP Q+Q++VLDTGSQLSWI+C P     P   PT                C+HP+
Sbjct: 83   SLPIGTPSQSQELVLDTGSQLSWIQCHPKKIKKPLPPPTTSFDPSLSSSFSDLP-CSHPL 141

Query: 453  CKPRIPDFTIPTSCDQNRLCHYSYFYADGTLAEGNLAREKITFSNSQTTPPLILGCTRDT 632
            CKPRIPDFT+PTSCD NRLCHYSYFYADGT AEGNL +EK TFSNSQTTPPLILGC +++
Sbjct: 142  CKPRIPDFTLPTSCDSNRLCHYSYFYADGTFAEGNLVKEKFTFSNSQTTPPLILGCAKES 201

Query: 633  SADEGILGMNLGRLSFTSQTKISKFSYCVPIRGSQVKSSAATGVFYLGENPNSGSFKYVD 812
            + ++GILGMNLGRLSF SQ KISKFSYC+P R ++    A+TG FYLG+NPNS  FKYV 
Sbjct: 202  TDEKGILGMNLGRLSFISQAKISKFSYCIPTRSNR-PGLASTGSFYLGDNPNSRGFKYVS 260

Query: 813  LVTFGQSQRMPNLDPMAYTVGMEGIRIGGKKLNISRSVFRPDAGGSGQTMIDSGSEYTFM 992
            L+TF QSQRMPNLDP+AYTV ++GIRIG K+LNI  SVFRPDAGGSGQTM+DSGSE+T +
Sbjct: 261  LLTFPQSQRMPNLDPLAYTVPLQGIRIGQKRLNIPGSVFRPDAGGSGQTMVDSGSEFTHL 320

Query: 993  VEEAYVKXXXXXXXXXXGPTLKKGFVYEGVLDLCYVGD-VMVIGRLVGDLXXXXXXXXXX 1169
            V+ AY K          G  LKKG+VY    D+C+ G+  M IGRL+GDL          
Sbjct: 321  VDVAYDK-VKEEIVRLVGSRLKKGYVYGSTADMCFDGNHSMEIGRLIGDLVFEFGRGVEI 379

Query: 1170 XXXXXXXXTNVGGGVGCL 1223
                     NVGGG+ C+
Sbjct: 380  LVEKQSLLVNVGGGIHCV 397


>ref|XP_004250199.1| PREDICTED: aspartic proteinase PCS1-like [Solanum lycopersicum]
          Length = 432

 Score =  400 bits (1028), Expect = e-109
 Identities = 215/375 (57%), Positives = 255/375 (68%), Gaps = 5/375 (1%)
 Frame = +3

Query: 114  SLSFPLVTRPLTHQSLSSPNLQTSSLSTPIKPKPGSKSNNRNPFL-YKSSFKYTMGLIVS 290
            SLSFPL T PL+  S    N   SS +          +NN  P L YKS+FKY+M LIV+
Sbjct: 30   SLSFPLTTTPLSQNSTLKSNYLFSSKAM---------TNNIIPSLNYKSNFKYSMALIVT 80

Query: 291  LPIGTPPQTQQMVLDTGSQLSWIKCAP--PAKLPTXXXXXXXXXXXXXXXXCNHPICKPR 464
            LPIGTPPQ QQMVLDTGSQLSWI+C    P K P                 CNHP+CKPR
Sbjct: 81   LPIGTPPQDQQMVLDTGSQLSWIQCNKKLPKKTPPTTFDPSLSSSFSVLP-CNHPLCKPR 139

Query: 465  IPDFTIPTSCDQNRLCHYSYFYADGTLAEGNLAREKITFSNSQTTPPLILGCTRDTSADE 644
            IPDFT+PTSCDQNRLCHYSYFYADGTLAEGNL REKITF NSQTTPPLILGC  ++   E
Sbjct: 140  IPDFTLPTSCDQNRLCHYSYFYADGTLAEGNLVREKITFPNSQTTPPLILGCATESRDAE 199

Query: 645  GILGMNLGRLSFTSQTKISKFSYCVPIR-GSQVKSSAATGVFYLGENPNSGSFKYVDLVT 821
            GILGMNLGR SF SQ K+ KFSYCVP + G+++  S   G FYLG+NPNS  F+Y++L+T
Sbjct: 200  GILGMNLGRYSFVSQAKVQKFSYCVPHKQGNKIMPS---GTFYLGQNPNSHRFQYINLLT 256

Query: 822  FGQSQRMPNLDPMAYTVGMEGIRIGGKKLNISRSVFRPDAGGSGQTMIDSGSEYTFMVEE 1001
            F QSQ MPN+DP+AYT+GM GI++GGK+LNIS  VFRPDAGGSGQTMIDSG++YTF+VEE
Sbjct: 257  FPQSQSMPNMDPLAYTLGMVGIKMGGKRLNISEKVFRPDAGGSGQTMIDSGTQYTFLVEE 316

Query: 1002 AYVKXXXXXXXXXXGPTLKKGFVYEGVLDLCYVG-DVMVIGRLVGDLXXXXXXXXXXXXX 1178
            AY K          GP LKKG+VY   LD+C+   + +   + +GD+             
Sbjct: 317  AYSK-VRDEVVRLVGPKLKKGYVYGESLDMCFDAINSVQASQAIGDMTLEFENGVEIVIN 375

Query: 1179 XXXXXTNVGGGVGCL 1223
                  +VGGGV C+
Sbjct: 376  KENVLDDVGGGVHCV 390


Top