BLASTX nr result

ID: Akebia24_contig00007443 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Akebia24_contig00007443
         (1555 letters)

Database: ./nr 
           38,876,450 sequences; 13,856,398,315 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_002523993.1| Aspartic proteinase nepenthesin-1 precursor,...   426   e-116
ref|XP_002266614.1| PREDICTED: probable aspartic protease At2g35...   417   e-114
ref|XP_006437358.1| hypothetical protein CICLE_v10033646mg [Citr...   416   e-113
ref|XP_006484733.1| PREDICTED: aspartic proteinase CDR1-like [Ci...   414   e-113
ref|XP_006361597.1| PREDICTED: aspartic proteinase CDR1-like [So...   412   e-112
ref|XP_002266533.1| PREDICTED: probable aspartic protease At2g35...   412   e-112
ref|XP_002266575.1| PREDICTED: probable aspartic protease At2g35...   410   e-111
ref|XP_004289322.1| PREDICTED: probable aspartic protease At2g35...   409   e-111
ref|XP_002304395.1| hypothetical protein POPTR_0003s10440g [Popu...   406   e-110
ref|XP_007210699.1| hypothetical protein PRUPE_ppa025167mg [Prun...   405   e-110
ref|XP_002320947.1| aspartyl protease family protein [Populus tr...   404   e-110
ref|XP_007029843.1| Eukaryotic aspartyl protease family protein,...   401   e-109
ref|XP_004244685.1| PREDICTED: aspartic proteinase CDR1-like [So...   399   e-108
ref|XP_006285435.1| hypothetical protein CARUB_v10006851mg [Caps...   399   e-108
ref|XP_007022588.1| Eukaryotic aspartyl protease family protein,...   398   e-108
ref|XP_006393532.1| hypothetical protein EUTSA_v10012077mg, part...   397   e-108
ref|XP_002517617.1| Aspartic proteinase nepenthesin-2 precursor,...   397   e-108
ref|XP_002266461.1| PREDICTED: probable aspartic protease At2g35...   394   e-107
ref|XP_002870403.1| predicted protein [Arabidopsis lyrata subsp....   393   e-106
gb|ABK28718.1| unknown [Arabidopsis thaliana]                         393   e-106

>ref|XP_002523993.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
            communis] gi|223536720|gb|EEF38361.1| Aspartic proteinase
            nepenthesin-1 precursor, putative [Ricinus communis]
          Length = 439

 Score =  426 bits (1096), Expect = e-116
 Identities = 218/434 (50%), Positives = 282/434 (64%), Gaps = 6/434 (1%)
 Frame = +1

Query: 103  IVLAFVVVIIFSLLPNLPVVKAQIGGFSIELIHHESPLSPFYNPSDTYSDRIRKAARHSI 282
            + L  +V +IFS    L  + A   GF++ELI+ +SP SPFYNP +T + RI  A R S+
Sbjct: 5    VSLLAIVTLIFS--GTLVPIDAAKDGFTVELINRDSPKSPFYNPRETPTQRIVSAVRRSM 62

Query: 283  ARTNRFRXXXXTNL--DTIRSDVIPNDGSYLMKLSIGTPPLEVLAIADTGSDLIWTQCKP 456
            +R + F     +++  DT +S++I N G YLMK S+GTP  ++LAIADTGSDLIWTQCKP
Sbjct: 63   SRVHHFSPTKNSDIFTDTAQSEMISNQGEYLMKFSLGTPAFDILAIADTGSDLIWTQCKP 122

Query: 457  CQECYEQDANLFDPIQSSTYREISCQSDYCQSLPR-AHC-GNTSQDCQYLYSYGDESHTN 630
            C +CYEQDA LFDP  SSTYR+ISC +  C  L   A C G  ++ C Y YSYGD S T+
Sbjct: 123  CDQCYEQDAPLFDPKSSSTYRDISCSTKQCDLLKEGASCSGEGNKTCHYSYSYGDRSFTS 182

Query: 631  GILATETFTFDSTTGRPIAIPTLVFGCGHNNAGTFXXXXXXXXXXXXXXXXXXXXXXXKI 810
            G +A +T T  ST+GRP+ +P  + GCGHNN G+F                        I
Sbjct: 183  GNVAADTITLGSTSGRPVLLPKAIIGCGHNNGGSFTEKGSGIVGLGGGPISLISQLGSTI 242

Query: 811  EGKFSYCLVPMSEKHT--SKLNFGSQAVISGQGVVSTPLVPKDPDTYYYLTLEGISVGNI 984
            +GKFSYCLVP+S   T  SKLNFGS  ++SG GV STPL+ KDPDT+Y+LTLE +SVG+ 
Sbjct: 243  DGKFSYCLVPLSSNATNSSKLNFGSNGIVSGGGVQSTPLISKDPDTFYFLTLEAVSVGSE 302

Query: 985  RIENNKISLDQEEGNIVIDSGTTLTILDEALYTNLESTVKEAINLDTSPDPEGTFGLCYS 1164
            RI+    S    EGNI+IDSGTTLT+  E  ++ L S V++A+      DP G   LCYS
Sbjct: 303  RIKFPGSSFGTSEGNIIIDSGTTLTLFPEDFFSELSSAVQDAVAGTPVEDPSGILSLCYS 362

Query: 1165 TKSDIKVPEFTFHFTGADLQLNEMNTFIKVSDDLVCLSMVPAESLSIFGNLAQINFQVEY 1344
              +D+K P  T HF GAD++LN +NTF++VSD ++C +  P  S +IFGNLAQ+NF V Y
Sbjct: 363  IDADLKFPSITAHFDGADVKLNPLNTFVQVSDTVLCFAFNPINSGAIFGNLAQMNFLVGY 422

Query: 1345 DLVGKKVSFTPADC 1386
            DL GK VSF P DC
Sbjct: 423  DLEGKTVSFKPTDC 436


>ref|XP_002266614.1| PREDICTED: probable aspartic protease At2g35615-like [Vitis vinifera]
          Length = 444

 Score =  417 bits (1071), Expect = e-114
 Identities = 212/434 (48%), Positives = 279/434 (64%), Gaps = 8/434 (1%)
 Frame = +1

Query: 109  LAFVVVIIFSLLPNLPVVKAQIGGFSIELIHHESPLSPFYNPSDTYSDRIRKAARHSIAR 288
            LA +++I FS   +    +A+I GF+ + I  +SP SPFYNPS+T   R++KA R SI R
Sbjct: 13   LAIIILIHFSEHSH---AEAKIDGFTTDFISRDSPHSPFYNPSETKYQRLQKAFRRSILR 69

Query: 289  TNRFRXXXXTNLDTIRSDVIPNDGSYLMKLSIGTPPLEVLAIADTGSDLIWTQCKPCQEC 468
             N FR    +  D I+SDVI   G+YLM +S+GTPP+ +L IADTGSDLIW QC PC  C
Sbjct: 70   GNHFRAMRASPND-IQSDVISGGGAYLMNISLGTPPVPMLGIADTGSDLIWRQCLPCPNC 128

Query: 469  YEQDANLFDPIQSSTYREISCQSDYCQSLPRAHCGNTSQDCQYLYSYGDESHTNGILATE 648
            YEQ   LFDP +S TY+ + C +++CQ L +    +    C Y YSYGD S+T G L+++
Sbjct: 129  YEQVEPLFDPKESETYKTLDCDNEFCQDLGQQGSCDDDNTCTYSYSYGDRSYTRGDLSSD 188

Query: 649  TFTFDSTTGRPIAIPTLVFGCGHNNAGTFXXXXXXXXXXXXXXXXXXXXXXXKIEGKFSY 828
            T T  ST G P + P + FGCGH+N GTF                       ++ G+FSY
Sbjct: 189  TLTIGSTEGDPASFPGIAFGCGHDNGGTFNEKDGGLIGLGGGPLSLVMQLSSEVGGQFSY 248

Query: 829  CLVPMSEKHT--SKLNFGSQAVISGQGVVSTPLVPKDPDTYYYLTLEGISVGNIRI---- 990
            CLVP+S   T  SK+NFG   V+SG G VSTPL+   PDT+YYLTLEG+SVG+  +    
Sbjct: 249  CLVPLSSDSTVSSKINFGKSGVVSGSGTVSTPLIKGTPDTFYYLTLEGLSVGSETVAFKG 308

Query: 991  --ENNKISLDQEEGNIVIDSGTTLTILDEALYTNLESTVKEAINLDTSPDPEGTFGLCYS 1164
              EN       EEGNI+IDSGTTLT+L +  YT++ES +  AI   T+ DP G F LCYS
Sbjct: 309  FSENKSSPAAVEEGNIIIDSGTTLTLLPQDFYTDVESALTNAIGGQTTTDPNGIFSLCYS 368

Query: 1165 TKSDIKVPEFTFHFTGADLQLNEMNTFIKVSDDLVCLSMVPAESLSIFGNLAQINFQVEY 1344
            + +++++P  T HFTGAD+QL  +NTF++V +DLVC SM+P+ +L+IFGNLAQINF V Y
Sbjct: 369  SVNNLEIPTITAHFTGADVQLPPLNTFVQVQEDLVCFSMIPSSNLAIFGNLAQINFLVGY 428

Query: 1345 DLVGKKVSFTPADC 1386
            DL   KVSF   DC
Sbjct: 429  DLKNNKVSFKQTDC 442


>ref|XP_006437358.1| hypothetical protein CICLE_v10033646mg [Citrus clementina]
            gi|557539554|gb|ESR50598.1| hypothetical protein
            CICLE_v10033646mg [Citrus clementina]
          Length = 426

 Score =  416 bits (1069), Expect = e-113
 Identities = 216/437 (49%), Positives = 279/437 (63%), Gaps = 1/437 (0%)
 Frame = +1

Query: 85   MATTTPIVLAFVVVIIFSLLPNLPVVKAQIGGFSIELIHHESPLSPFYNPSDTYSDRIRK 264
            MAT   + ++F+++ + SL     + +A+ GGFS++LI  ++P SPFY+P +TY  R+ K
Sbjct: 1    MATVNALAISFLILCLSSL----SITEAK-GGFSLDLIRRDAPKSPFYSPDETYHQRVTK 55

Query: 265  AARHSIARTNRFRXXXXTNLDTIRSDVIPNDGSYLMKLSIGTPPLEVLAIADTGSDLIWT 444
            A + S+ R + F     T  +T ++D+I   G Y+M +SIGTPP+E+LAIADTGSDLIWT
Sbjct: 56   ALKRSVNRVSHFDPAIITP-NTAQADIISALGEYVMNISIGTPPVEILAIADTGSDLIWT 114

Query: 445  QCKPCQECYEQDANLFDPIQSSTYREISCQSDYCQSLPRAHCGNTSQDCQYLYSYGDESH 624
            QCKPC ECY+Q A  FDP QSSTY+++SC S  C +  R  C +T + C+Y  +YGD S 
Sbjct: 115  QCKPCTECYKQAAPFFDPEQSSTYKDLSCDSRQCTAYERTSC-STEEICEYSATYGDRSF 173

Query: 625  TNGILATETFTFDSTTGRPIAIPTLVFGCGHNNAGTFXXXXXXXXXXXXXXXXXXXXXXX 804
            +NG LA ET T  ST GRP+A+  L+FGCGHN+ GTF                       
Sbjct: 174  SNGNLAVETVTLGSTNGRPVALRNLIFGCGHNDDGTFNENATGIVGLGGGSVSLVTQMGS 233

Query: 805  KIEGKFSYCLVP-MSEKHTSKLNFGSQAVISGQGVVSTPLVPKDPDTYYYLTLEGISVGN 981
             I GKFSYCLVP +S + +SK+NFGS  V+SG GVV+TPLV KDPDT+Y+LTLE ISVG 
Sbjct: 234  SIGGKFSYCLVPFLSSESSSKINFGSNGVVSGTGVVTTPLVAKDPDTFYFLTLESISVGK 293

Query: 982  IRIENNKISLDQEEGNIVIDSGTTLTILDEALYTNLESTVKEAINLDTSPDPEGTFGLCY 1161
             +I  +    D  EGNI+IDSGTTLT L   + + L S V + I  D   DPEG   LCY
Sbjct: 294  KKIHFD----DASEGNIIIDSGTTLTFLPPDIVSKLTSAVSDLIKADPISDPEGVLDLCY 349

Query: 1162 STKSDIKVPEFTFHFTGADLQLNEMNTFIKVSDDLVCLSMVPAESLSIFGNLAQINFQVE 1341
               SD K P+ T HF+GAD+ L+  NTFI+ SD  VC +    E  SI+GNLAQ NF V 
Sbjct: 350  PYSSDFKAPQITVHFSGADVVLSPENTFIRTSDTTVCFTFKGMEGQSIYGNLAQANFLVG 409

Query: 1342 YDLVGKKVSFTPADCIK 1392
            YD   K VSF P DC K
Sbjct: 410  YDTKAKTVSFKPTDCSK 426


>ref|XP_006484733.1| PREDICTED: aspartic proteinase CDR1-like [Citrus sinensis]
          Length = 426

 Score =  414 bits (1064), Expect = e-113
 Identities = 215/437 (49%), Positives = 278/437 (63%), Gaps = 1/437 (0%)
 Frame = +1

Query: 85   MATTTPIVLAFVVVIIFSLLPNLPVVKAQIGGFSIELIHHESPLSPFYNPSDTYSDRIRK 264
            MAT     ++F+++ + SL     + +A+ GGFS++LI  ++P SPFY+P +TY  R+ K
Sbjct: 1    MATVNASAISFLILCLSSL----SITEAK-GGFSLDLIRRDAPKSPFYSPDETYHQRVTK 55

Query: 265  AARHSIARTNRFRXXXXTNLDTIRSDVIPNDGSYLMKLSIGTPPLEVLAIADTGSDLIWT 444
            A + S+ R + F     T  +T ++D+I   G Y+M +SIGTPP+E+LAIADTGSDLIWT
Sbjct: 56   ALKRSVNRVSHFDPAIITP-NTAQADIISALGEYVMNISIGTPPVEILAIADTGSDLIWT 114

Query: 445  QCKPCQECYEQDANLFDPIQSSTYREISCQSDYCQSLPRAHCGNTSQDCQYLYSYGDESH 624
            QCKPC ECY+Q A  FDP QSSTY+++SC S  C +  R  C +T + C+Y  +YGD S 
Sbjct: 115  QCKPCTECYKQAAPFFDPEQSSTYKDLSCDSRQCTAYERTSC-STEETCEYSATYGDRSF 173

Query: 625  TNGILATETFTFDSTTGRPIAIPTLVFGCGHNNAGTFXXXXXXXXXXXXXXXXXXXXXXX 804
            +NG LA ET T  ST GRP+A+  ++FGCGHN+ GTF                       
Sbjct: 174  SNGNLAVETVTLGSTNGRPVALRNIIFGCGHNDDGTFNENATGIVGLGGGSVSLVTQMGS 233

Query: 805  KIEGKFSYCLVP-MSEKHTSKLNFGSQAVISGQGVVSTPLVPKDPDTYYYLTLEGISVGN 981
             I GKFSYCLVP +S + +SK+NFGS  V+SG GVV+TPLV KDPDT+Y+LTLE ISVG 
Sbjct: 234  SIGGKFSYCLVPFLSSESSSKINFGSNGVVSGTGVVTTPLVAKDPDTFYFLTLESISVGK 293

Query: 982  IRIENNKISLDQEEGNIVIDSGTTLTILDEALYTNLESTVKEAINLDTSPDPEGTFGLCY 1161
             +I  +    D  EGNI+IDSGTTLT L   + + L S V + I  D   DPEG   LCY
Sbjct: 294  KKIHFD----DASEGNIIIDSGTTLTFLPPDIVSKLTSAVSDLIKADPISDPEGVLDLCY 349

Query: 1162 STKSDIKVPEFTFHFTGADLQLNEMNTFIKVSDDLVCLSMVPAESLSIFGNLAQINFQVE 1341
               SD K P+ T HF+GAD+ L+  NTFI+ SD  VC +    E  SI+GNLAQ NF V 
Sbjct: 350  PYSSDFKAPQITVHFSGADVVLSPENTFIRTSDTSVCFTFKGMEGQSIYGNLAQANFLVG 409

Query: 1342 YDLVGKKVSFTPADCIK 1392
            YD   K VSF P DC K
Sbjct: 410  YDTKAKTVSFKPTDCSK 426


>ref|XP_006361597.1| PREDICTED: aspartic proteinase CDR1-like [Solanum tuberosum]
          Length = 428

 Score =  412 bits (1060), Expect = e-112
 Identities = 208/424 (49%), Positives = 274/424 (64%), Gaps = 18/424 (4%)
 Frame = +1

Query: 178  GFSIELIHHESPLSPFYNPSDTYSDRIRKAARHSIARTNRFRXXXXTNLDTIRSDVIPND 357
            GF+++LIH +SPLSPFYNPS+T S+R+R A   S +R + F+       +TI+SD+ P  
Sbjct: 8    GFTLDLIHRDSPLSPFYNPSNTQSNRLRNAFHRSFSRASFFKKSSLATTNTIQSDISPIP 67

Query: 358  GSYLMKLSIGTPPLEVLAIADTGSDLIWTQCKPCQECYEQDANLFDPIQSSTYREISCQS 537
            G YLMKLSIGTPP+E++AIADTGSDL WTQC PC+ C++Q + LFD  +SSTY+ + C  
Sbjct: 68   GEYLMKLSIGTPPVEIVAIADTGSDLTWTQCMPCENCFQQSSPLFDSKKSSTYKTVGCNV 127

Query: 538  DYCQSLPRAHC--GNTSQDCQYLYSYGDESHTNGILATETFTFDSTTGRPIAIPTLVFGC 711
            + C SL  + C  GN    C+Y  SYGD+SHT G LA + FTF ST+G  + IP + FGC
Sbjct: 128  EVCTSLEGSSCVKGNV---CEYQMSYGDQSHTIGDLAFDKFTFPSTSGENVVIPNVAFGC 184

Query: 712  GHNNAGTFXXXXXXXXXXXXXXXXXXXXXXXKIEGKFSYCLVP------MSEKHTSKLNF 873
            GH+N GTF                       +I GKFSYCL+P      ++   TS +NF
Sbjct: 185  GHDNGGTFNNYTSGIIGLGGGKVSMINQLDKEINGKFSYCLIPIPFDSSINSNITSHINF 244

Query: 874  GSQAVISGQGVVSTPLVPKDPDTYYYLTLEGISVGN--IRIENNKISL-------DQEEG 1026
            G  A++SG  VVSTPL+ K+P TYYYL LEG+SVGN  ++ +++K S        D + G
Sbjct: 245  GISAIVSGPNVVSTPLIKKEPSTYYYLNLEGVSVGNKTLKFKSSKTSPSDNASGGDGQAG 304

Query: 1027 NIVIDSGTTLTILDEALYTNLESTVKEAINLDTSPDPEGTFGLCY-STKSDIKVPEFTFH 1203
            NI+IDSGTTLT+L    Y+NLEST+  +I  +   DP G F LCY S    I  P    H
Sbjct: 305  NIIIDSGTTLTLLPNDFYSNLESTLVNSIRANRKDDPSGNFHLCYESENGTIDAPTIVTH 364

Query: 1204 FTGADLQLNEMNTFIKVSDDLVCLSMVPAESLSIFGNLAQINFQVEYDLVGKKVSFTPAD 1383
            FT ADL+L+  +TF ++   LVCL++VPA+ ++IFGNLAQ NF +EYDLV  K+SF P D
Sbjct: 365  FTNADLELSPSSTFAEIEQGLVCLTIVPADEIAIFGNLAQGNFLIEYDLVANKISFQPTD 424

Query: 1384 CIKY 1395
            C KY
Sbjct: 425  CTKY 428


>ref|XP_002266533.1| PREDICTED: probable aspartic protease At2g35615-like [Vitis vinifera]
          Length = 447

 Score =  412 bits (1059), Expect = e-112
 Identities = 211/436 (48%), Positives = 274/436 (62%), Gaps = 8/436 (1%)
 Frame = +1

Query: 109  LAFVVVIIFSLLPNLPVVKAQIGGFSIELIHHESPLSPFYNPSDTYSDRIRKAARHSIAR 288
            LA +  I FS L +     +  GGFS +LI  +SPLSPFYNPS+T  DR++KA   SI+R
Sbjct: 13   LAVIFFIHFSGLSHTEA--SNKGGFSTDLISRDSPLSPFYNPSETQFDRLQKAFHRSISR 70

Query: 289  TNRFRXXXXTNLDTIRSDVIPNDGSYLMKLSIGTPPLEVLAIADTGSDLIWTQCKPCQEC 468
             N FR    +  ++I+S VI N+G YLM +S+GTPP+ +  IADTGSDL+W QCKPC  C
Sbjct: 71   ANHFRANGVST-NSIQSPVISNNGEYLMNISLGTPPVSMHGIADTGSDLLWRQCKPCDSC 129

Query: 469  YEQDANLFDPIQSSTYREISCQSDYCQSLPRAHCGNTSQDCQYLYSYGDESHTNGILATE 648
            YEQ   +FDP +S TY+ +SC+   C +L      +    C Y YSYGD SHT+G LA +
Sbjct: 130  YEQIEPIFDPAKSKTYQILSCEGKSCSNLGGQGGCSDDNTCIYSYSYGDGSHTSGDLAVD 189

Query: 649  TFTFDSTTGRPIAIPTLVFGCGHNNAGTFXXXXXXXXXXXXXXXXXXXXXXXKIEGKFSY 828
            T T  STTGRP+++P +VFGCGHNN GTF                        I G+FSY
Sbjct: 190  TLTIGSTTGRPVSVPKVVFGCGHNNGGTFELHGSGLVGLGGGPLSMISQLRPLIGGRFSY 249

Query: 829  CLVPMSE--KHTSKLNFGSQAVISGQGVVSTPLVPKDPDTYYYLTLEGISVGNIRIENNK 1002
            CLVP+      +SK++FGS+ ++SG G VSTPL  + PDT+YYLTLE +SVG+ ++    
Sbjct: 250  CLVPLGNDPSVSSKMHFGSRGIVSGAGAVSTPLASRQPDTFYYLTLESMSVGSKKLAYKG 309

Query: 1003 IS------LDQEEGNIVIDSGTTLTILDEALYTNLESTVKEAINLDTSPDPEGTFGLCYS 1164
             S       D +EGNI+IDSGTTLT+L +  Y  LES V  AI      DP   F LCYS
Sbjct: 310  FSKVGSPLADADEGNIIIDSGTTLTLLPQDFYGTLESNVVSAIGGKPVRDPNNVFSLCYS 369

Query: 1165 TKSDIKVPEFTFHFTGADLQLNEMNTFIKVSDDLVCLSMVPAESLSIFGNLAQINFQVEY 1344
              S +++P  T HF GADL+L  +NTF++V +DL C +M+P   L+IFGNLAQ+NF V Y
Sbjct: 370  NLSGLRIPTITAHFVGADLELKPLNTFVQVQEDLFCFAMIPVSDLAIFGNLAQMNFLVGY 429

Query: 1345 DLVGKKVSFTPADCIK 1392
            DL  + VSF P DC K
Sbjct: 430  DLKSRTVSFKPTDCTK 445


>ref|XP_002266575.1| PREDICTED: probable aspartic protease At2g35615 [Vitis vinifera]
          Length = 445

 Score =  410 bits (1053), Expect = e-111
 Identities = 208/437 (47%), Positives = 276/437 (63%), Gaps = 8/437 (1%)
 Frame = +1

Query: 106  VLAFVVVIIFSLLPNLPVVKAQIGGFSIELIHHESPLSPFYNPSDTYSDRIRKAARHSIA 285
            +LA + +I F+        +A++ GF+ + I  +SP SPFYNPS+T   R++KA R SI 
Sbjct: 12   LLAIIFLIYFAKHSQ---AEAKVDGFTTDFISRDSPRSPFYNPSETKYQRLQKAFRRSIL 68

Query: 286  RTNRFRXXXXTNLDTIRSDVIPNDGSYLMKLSIGTPPLEVLAIADTGSDLIWTQCKPCQE 465
            R N FR    +  D I+S+VI   GSYLM +S+GTPP+ +L IADTGSDLIW QC PC +
Sbjct: 69   RGNHFRAIRASPND-IQSNVISGGGSYLMNISLGTPPVSMLGIADTGSDLIWRQCLPCDD 127

Query: 466  CYEQDANLFDPIQSSTYREISCQSDYCQSLPRAHCGNTSQDCQYLYSYGDESHTNGILAT 645
            CY+Q   LFDP +S TY+ + C +D+CQ L +         C   YSYGD+S+T   L++
Sbjct: 128  CYKQVEPLFDPKKSKTYKTLGCNNDFCQDLGQQGSCGDDNTCTSSYSYGDQSYTRRDLSS 187

Query: 646  ETFTFDSTTGRPIAIPTLVFGCGHNNAGTFXXXXXXXXXXXXXXXXXXXXXXXKIEGKFS 825
            ETFT  ST G P + P L FGCGH+N GTF                       K+ G+FS
Sbjct: 188  ETFTIGSTEGDPASFPGLAFGCGHSNGGTFNEKDSGLIGLGGGPLSLVMQLSSKVGGQFS 247

Query: 826  YCLVPMSEKHT--SKLNFGSQAVISGQGVVSTPLVPKDPDTYYYLTLEGISVGNIRI--- 990
            YCLVP+S   T  SK+NFG  AV+SG G VSTPL+   PDT+YYLTLEG+S+G+ ++   
Sbjct: 248  YCLVPLSSDSTASSKINFGKSAVVSGSGTVSTPLIKGTPDTFYYLTLEGMSLGSEKVAFK 307

Query: 991  ---ENNKISLDQEEGNIVIDSGTTLTILDEALYTNLESTVKEAINLDTSPDPEGTFGLCY 1161
               +N       EE NI+IDSGTTLT+L    YT++ES + + I   T+ DP GTF LCY
Sbjct: 308  GFSKNKSSPAAAEESNIIIDSGTTLTLLPRDFYTDMESALTKVIGGQTTTDPRGTFSLCY 367

Query: 1162 STKSDIKVPEFTFHFTGADLQLNEMNTFIKVSDDLVCLSMVPAESLSIFGNLAQINFQVE 1341
            S    +++P  T HF GAD+QL  +NTF++  +DLVC SM+P+ +L+IFGNL+Q+NF V 
Sbjct: 368  SGVKKLEIPTITAHFIGADVQLPPLNTFVQAQEDLVCFSMIPSSNLAIFGNLSQMNFLVG 427

Query: 1342 YDLVGKKVSFTPADCIK 1392
            YDL   KVSF P DC K
Sbjct: 428  YDLKNNKVSFKPTDCTK 444


>ref|XP_004289322.1| PREDICTED: probable aspartic protease At2g35615-like [Fragaria vesca
            subsp. vesca]
          Length = 430

 Score =  409 bits (1050), Expect = e-111
 Identities = 211/436 (48%), Positives = 285/436 (65%), Gaps = 9/436 (2%)
 Frame = +1

Query: 106  VLAFVVVIIFSLLPNLPVVKAQIGGFSIELIHHESPLSPFYNPSDTYSDRIRKAARHSIA 285
            ++A  +++ FS        +A  GGF+++LI  +S LSP+Y+ S T+ DR+  A R SI+
Sbjct: 6    IVACFILLSFS-------AEASYGGFTVDLIQRDSLLSPWYDSSTTHFDRLHNAFRRSIS 58

Query: 286  RTNRFRXXXXTNLDTIRSDVIPNDGSYLMKLSIGTPPLEVLAIADTGSDLIWTQCKPCQE 465
            R  RF      + +TI+S ++P+ G YLM +SIGTPP+EVL IADTGSDLIWTQCKPC++
Sbjct: 59   RAQRF---IKPSTNTIQSKIVPSGGEYLMNISIGTPPVEVLGIADTGSDLIWTQCKPCKQ 115

Query: 466  CYEQDANLFDPIQSSTYREISCQSDYCQSLPRAHCGNTSQD-CQYLYSYGDESHTNGILA 642
            C+ Q+  LFDP +SSTYR + CQS+ C +L  A CG    D C Y Y YGD S T G LA
Sbjct: 116  CFNQNPPLFDPKRSSTYRTVPCQSNSCSNLEEASCGADRGDTCVYSYRYGDRSFTRGSLA 175

Query: 643  TETFTFDSTTGRPIAIPTLVFGCGHNNAGTFXXXXXXXXXXXXXXXXXXXXXXXKIEGKF 822
             ETFT  S +G+P+++  ++FGCGH N GTF                          GKF
Sbjct: 176  QETFTIGSASGQPVSLLKIIFGCGHENGGTFDESGSGLIGLGGGPLSFISQLNG---GKF 232

Query: 823  SYCLVPMSEKHT--SKLNFGSQAVISGQGVVSTPLVPKDPDTYYYLTLEGISVG----NI 984
            SYCLVP S K +  SK++FG+ A++SG+G VSTPLV K PDT+YYLTLE ISVG    + 
Sbjct: 233  SYCLVPTSAKSSIASKISFGTAAIVSGKGAVSTPLVSKQPDTFYYLTLEAISVGEKRQSY 292

Query: 985  RIENNKISLDQEEGNIVIDSGTTLTILDEALYTNLESTVKEAINLDTSPDPEGTFGLCYS 1164
            +   +  ++   EGNI+IDSGTTLT+L    Y  + S ++ AIN++   DP+G   LC+ 
Sbjct: 293  KTSQSTKAVAASEGNIIIDSGTTLTLLPPGFYDEVISALEVAINVERVSDPKGVLSLCFR 352

Query: 1165 TKS--DIKVPEFTFHFTGADLQLNEMNTFIKVSDDLVCLSMVPAESLSIFGNLAQINFQV 1338
            +KS  DI VP  T HF+GAD++LN +NTF +V DD+VC +M+ +E ++IFGNLAQ+NF V
Sbjct: 353  SKSDHDIDVPVITMHFSGADVKLNALNTFARVEDDMVCFTMIQSEDVAIFGNLAQMNFLV 412

Query: 1339 EYDLVGKKVSFTPADC 1386
             YDL  + VSF PADC
Sbjct: 413  GYDLEERTVSFKPADC 428


>ref|XP_002304395.1| hypothetical protein POPTR_0003s10440g [Populus trichocarpa]
            gi|222841827|gb|EEE79374.1| hypothetical protein
            POPTR_0003s10440g [Populus trichocarpa]
          Length = 443

 Score =  406 bits (1043), Expect = e-110
 Identities = 208/444 (46%), Positives = 281/444 (63%), Gaps = 7/444 (1%)
 Frame = +1

Query: 85   MATTTPIVLAFVVVII-FSLLPNLPVVKAQIGGFSIELIHHESPLSPFYNPSDTYSDRIR 261
            MATT+   +  V+  I  S  P L    +   GFS+ LIH +SPLSP YNP+ T  DR+R
Sbjct: 1    MATTSFSFVTIVICFISLSPFPLLGAAASPDPGFSLNLIHRDSPLSPLYNPNHTDFDRLR 60

Query: 262  KAARHSIARTNRFRXXXXTNLDTIRSDVIPNDGSYLMKLSIGTPPLEVLAIADTGSDLIW 441
             A   SI+R N F+     ++++ ++D++PN G Y MK+SIGTP +EV+ IADTGSDL W
Sbjct: 61   NAFSRSISRVNVFKTKA-VDINSFQNDLVPNGGEYFMKMSIGTPLVEVIVIADTGSDLTW 119

Query: 442  TQCKPCQECYEQDANLFDPIQSSTYREISCQSDYCQSLPRAH--CGNTSQDCQYLYSYGD 615
             QC PC  CY Q + LFDP +SS+YR + C S +C +L  +   C   +  C+Y YSYGD
Sbjct: 120  VQCLPCDPCYRQKSPLFDPSRSSSYRHMLCGSRFCNALDVSEQACTMDTNICEYHYSYGD 179

Query: 616  ESHTNGILATETFTFDSTTGRPIAIPTLVFGCGHNNAGTFXXXXXXXXXXXXXXXXXXXX 795
            +S+TNG LATE FT  ST+ RP+ +  +VFGCG  N GTF                    
Sbjct: 180  KSYTNGNLATEKFTIGSTSSRPVHLSPIVFGCGTGNGGTFDELGSGIVGLGGGALSLVSQ 239

Query: 796  XXXKIEGKFSYCLVPMSEKH--TSKLNFGSQAVISGQGVVSTPLVPKDPDTYYYLTLEGI 969
                I+GKFSYCLVP+SE+   TSK+ FG+ +VISG  VVSTPLV K PDTYYY+TLE I
Sbjct: 240  LSSIIKGKFSYCLVPLSEQSNVTSKIKFGTDSVISGPQVVSTPLVSKQPDTYYYVTLEAI 299

Query: 970  SVGNIRIE--NNKISLDQEEGNIVIDSGTTLTILDEALYTNLESTVKEAINLDTSPDPEG 1143
            SVGN R+   N  ++ + E+GN++IDSGTTLT LD   +T LE  ++E +  +   DP G
Sbjct: 300  SVGNKRLPYTNGLLNGNVEKGNVIIDSGTTLTFLDSEFFTELERVLEETVKAERVSDPRG 359

Query: 1144 TFGLCYSTKSDIKVPEFTFHFTGADLQLNEMNTFIKVSDDLVCLSMVPAESLSIFGNLAQ 1323
             F +C+ +  DI +P    HF  AD++L  +NTF+K  +DL+C +M+ +  + IFGNLAQ
Sbjct: 360  LFSVCFRSAGDIDLPVIAVHFNDADVKLQPLNTFVKADEDLLCFTMISSNQIGIFGNLAQ 419

Query: 1324 INFQVEYDLVGKKVSFTPADCIKY 1395
            ++F V YDL  + VSF P DC K+
Sbjct: 420  MDFLVGYDLEKRTVSFKPTDCTKH 443


>ref|XP_007210699.1| hypothetical protein PRUPE_ppa025167mg [Prunus persica]
            gi|462406434|gb|EMJ11898.1| hypothetical protein
            PRUPE_ppa025167mg [Prunus persica]
          Length = 457

 Score =  405 bits (1041), Expect = e-110
 Identities = 216/459 (47%), Positives = 283/459 (61%), Gaps = 20/459 (4%)
 Frame = +1

Query: 79   ATMATTTPIVLAFVVVIIFSLLPNLPVVKAQIGGFSIELIHHESPLSPFYNPSDTYSDRI 258
            A   T+T +     ++  F LL      +A   GF+ +LIH +SPLSP YN S ++ DR+
Sbjct: 4    AAAPTSTKLYFPLALLACFILL-----AQASSHGFTADLIHRDSPLSPLYNSSMSHLDRL 58

Query: 259  RKAARHSIARTNRFRXXXXTNLDT------IRSDVIPNDGSYLMKLSIGTPPLEVLAIAD 420
              A R S+ R + F     T+L +      I+S +IP+ G YLM +SIGTPP+EVL IAD
Sbjct: 59   HNAFRRSVTRVHHFIKPTMTSLSSSLAAPNIQSIIIPSAGEYLMNVSIGTPPVEVLGIAD 118

Query: 421  TGSDLIWTQCKPCQECYEQDANLFDPIQSSTYREISCQSDYCQSLPRAHCG---NTSQD- 588
            TGSDLIWTQCKPC++C+ Q+  LFDP +SSTY  I CQS  C  L  A CG   N   D 
Sbjct: 119  TGSDLIWTQCKPCKQCFNQNPPLFDPKKSSTYHSIPCQSSSCTYLEEAACGTLINGDHDT 178

Query: 589  CQYLYSYGDESHTNGILATETFTFDSTTGRPIAIPTLVFGCGHNNAGTFXXXXXXXXXXX 768
            C+Y Y YGD S T G LA ET TF ST+GRP ++P +VFGCGH N GTF           
Sbjct: 179  CEYSYRYGDRSFTRGTLALETLTFGSTSGRPTSLPKVVFGCGHENGGTFDESGSGLIGLG 238

Query: 769  XXXXXXXXXXXXKIE-GKFSYCLVPMSEKHTSKLNFGSQAVISGQGVVSTPLVPKDPDTY 945
                            GKFSYCL+P +    SK++FGS  ++SG G VSTPLV K+PDT+
Sbjct: 239  GGPLSLISQLTKLTNGGKFSYCLLPTANTAASKISFGSAGIVSGSGAVSTPLVAKNPDTF 298

Query: 946  YYLTLEGISVGNIRI-------ENNKISLDQEEGNIVIDSGTTLTILDEALYTNLESTVK 1104
            YYLTLE ISVG  R+       +  K ++   EGNI+IDSGTTLT+L    + +L S ++
Sbjct: 299  YYLTLEAISVGEKRLAYKTKSPDCEKAAVAANEGNIIIDSGTTLTLLPPGFHDDLVSALE 358

Query: 1105 EAINLDTSPDPEGTFGLCYSTKS-DIKVPEFTFHFT-GADLQLNEMNTFIKVSDDLVCLS 1278
             AIN +   DP G   LC+ +KS DI VP  T HF+ GAD++L  +NTF ++ DD++C +
Sbjct: 359  TAINAERVSDPRGILSLCFKSKSDDIGVPVITVHFSGGADVKLQALNTFARMDDDMICFT 418

Query: 1279 MVPAESLSIFGNLAQINFQVEYDLVGKKVSFTPADCIKY 1395
            M+P+  ++IFGNLAQ+NF V YDL  + VSF P DC K+
Sbjct: 419  MIPSSDVAIFGNLAQMNFLVGYDLEERSVSFKPTDCTKH 457


>ref|XP_002320947.1| aspartyl protease family protein [Populus trichocarpa]
            gi|222861720|gb|EEE99262.1| aspartyl protease family
            protein [Populus trichocarpa]
          Length = 440

 Score =  404 bits (1037), Expect = e-110
 Identities = 207/434 (47%), Positives = 274/434 (63%), Gaps = 6/434 (1%)
 Frame = +1

Query: 109  LAFVVVIIFSLLPNLPVVKAQIGGFSIELIHHESPLSPFYNPSDTYSDRIRKAARHSIAR 288
            L+F + I    +     + A+  GF+++LIH +SPLSPFYN  +T   RI  A R SI+R
Sbjct: 8    LSFALAIALLCVSGFGCIYARKVGFTVDLIHRDSPLSPFYNSEETDLQRINNALRRSISR 67

Query: 289  TNRFRXXXXTNLD--TIRSDVIPNDGSYLMKLSIGTPPLEVLAIADTGSDLIWTQCKPCQ 462
             + F      ++      SDV  N G YLM LS+GTPP +++ IADTGSDLIWTQCKPC+
Sbjct: 68   VHHFDPIAAASVSPKAAESDVTSNRGEYLMSLSLGTPPFKIMGIADTGSDLIWTQCKPCE 127

Query: 463  ECYEQDANLFDPIQSSTYREISCQSDYCQSLPRAHCGNTSQDCQYLYSYGDESHTNGILA 642
             CY+Q   LFDP  S TYR+ SC +  C  L ++ C      CQY YSYGD S+T G +A
Sbjct: 128  RCYKQVDPLFDPKSSKTYRDFSCDARQCSLLDQSTCSGNI--CQYQYSYGDRSYTMGNVA 185

Query: 643  TETFTFDSTTGRPIAIPTLVFGCGHNNAGTFXXXXXXXXXXXXXXXXXXXXXXXKIEGKF 822
            ++T T DSTTG P++ P  V GCGH N GTF                        + GKF
Sbjct: 186  SDTITLDSTTGSPVSFPKTVIGCGHENDGTFSDKGSGIVGLGAGPLSLISQMGSSVGGKF 245

Query: 823  SYCLVPMSEK--HTSKLNFGSQAVISGQGVVSTPLVPKDP-DTYYYLTLEGISVGNIRIE 993
            SYCLVP+S +  ++SKLNFGS AV+SG GV STPL+  +   ++Y+LTLE +SVGN RI+
Sbjct: 246  SYCLVPLSSRAGNSSKLNFGSNAVVSGPGVQSTPLLSSETMSSFYFLTLEAMSVGNERIK 305

Query: 994  NNKISLDQEEGNIVIDSGTTLTILDEALYTNLESTVKEAINLDTSPDPEGTFGLCYSTKS 1173
                SL   EGNI+IDSGTTLTI+ +  ++NL + V   +    + DP G   +CYS  S
Sbjct: 306  FGDSSLGTGEGNIIIDSGTTLTIVPDDFFSNLSTAVGNQVEGRRAEDPSGFLSVCYSATS 365

Query: 1174 DIKVPEFTFHFTGADLQLNEMNTFIKVSDDLVCLSMVPAES-LSIFGNLAQINFQVEYDL 1350
            D+KVP  T HFTGAD++L  +NTF++VSDD+VCL+     S +SI+GN+AQ+NF VEY++
Sbjct: 366  DLKVPAITAHFTGADVKLKPINTFVQVSDDVVCLAFASTTSGISIYGNVAQMNFLVEYNI 425

Query: 1351 VGKKVSFTPADCIK 1392
             GK +SF P DC K
Sbjct: 426  QGKSLSFKPTDCTK 439


>ref|XP_007029843.1| Eukaryotic aspartyl protease family protein, putative [Theobroma
            cacao] gi|508718448|gb|EOY10345.1| Eukaryotic aspartyl
            protease family protein, putative [Theobroma cacao]
          Length = 435

 Score =  401 bits (1030), Expect = e-109
 Identities = 202/441 (45%), Positives = 276/441 (62%), Gaps = 1/441 (0%)
 Frame = +1

Query: 73   MCATMATTTPIVLAFVVVIIFSLLPNLPVVKAQIGGFSIELIHHESPLSPFYNPSDTYSD 252
            M AT  TT+   + F ++++        +++AQ GGFS+ELIH +SP SP YNP +T S+
Sbjct: 1    MAATANTTSMFFIGFAILVLSCFC----LIEAQKGGFSVELIHRDSPKSPLYNPLETASN 56

Query: 253  RIRKAARHSIARTNRFRXXXXTNLDTIRSDVIPNDGSYLMKLSIGTPPLEVLAIADTGSD 432
            R+  A R S  R  RF+    +    + +D+I + G YLM +SIGTP  +++AIADTGSD
Sbjct: 57   RVANALRRSFNRAQRFKPSSIST-KAVDADLIADSGEYLMNVSIGTPAFDIVAIADTGSD 115

Query: 433  LIWTQCKPCQECYEQDANLFDPIQSSTYREISCQSDYCQSLPRAHCGNTSQDCQYLYSYG 612
            LIWTQCKPC +C+ QDA LFDP +SST+R  SC +  C++L  + C +++  C+Y  +YG
Sbjct: 116  LIWTQCKPCSQCFRQDAPLFDPSKSSTFRTFSCSASQCENLEGSSC-SSNNTCRYSVTYG 174

Query: 613  DESHTNGILATETFTFDSTTGRPIAIPTLVFGCGHNNAGTFXXXXXXXXXXXXXXXXXXX 792
            D S +NG +A +T T  STTGRP+A    + GCGHNN GTF                   
Sbjct: 175  DNSFSNGDVAADTLTLPSTTGRPVAFRNTIIGCGHNNDGTFDENTSGIIGLGGGDVSLIS 234

Query: 793  XXXXKIEGKFSYCLVPMSEK-HTSKLNFGSQAVISGQGVVSTPLVPKDPDTYYYLTLEGI 969
                 I GKFSYCL+P+S+   ++K+NFG+ A++SG GVVSTPL  K P T+Y+LTLE +
Sbjct: 235  QLGTSIAGKFSYCLLPLSDAGESNKMNFGTDAIVSGAGVVSTPLTKKFPSTFYFLTLEAV 294

Query: 970  SVGNIRIENNKISLDQEEGNIVIDSGTTLTILDEALYTNLESTVKEAINLDTSPDPEGTF 1149
            SVG+ RI+    SL  ++GNI+IDSGTTLT+L E  Y+ LES V   I       P+G  
Sbjct: 295  SVGSKRIKFTGSSLGTDDGNIIIDSGTTLTLLPEDFYSELESAVASQIKARRVDGPQG-L 353

Query: 1150 GLCYSTKSDIKVPEFTFHFTGADLQLNEMNTFIKVSDDLVCLSMVPAESLSIFGNLAQIN 1329
             LCY   +D  VP  T HFT AD++L  +NTF+ VSD + C +    +  +I+GNLAQ+N
Sbjct: 354  SLCYDATTDFAVPNITIHFTNADVKLAPLNTFVLVSDTVSCFTFSSLQGFAIYGNLAQMN 413

Query: 1330 FQVEYDLVGKKVSFTPADCIK 1392
            F V YD   + VSF P DC K
Sbjct: 414  FLVGYDTEKQTVSFKPTDCSK 434


>ref|XP_004244685.1| PREDICTED: aspartic proteinase CDR1-like [Solanum lycopersicum]
          Length = 448

 Score =  399 bits (1026), Expect = e-108
 Identities = 204/417 (48%), Positives = 274/417 (65%), Gaps = 11/417 (2%)
 Frame = +1

Query: 178  GFSIELIHHESPLSPFYNPSDTYSDRIRKAARHSIARTNRFRXXXXTNLDTIRSDVIPND 357
            GF++ LIH +SPLSP YN S T S+R+  A   S +R + F+       +TIRSD+ P  
Sbjct: 35   GFTLHLIHRDSPLSPLYNSSITQSNRLINAFHRSFSRASFFKKSSFVTPNTIRSDISPIP 94

Query: 358  GSYLMKLSIGTPPLEVLAIADTGSDLIWTQCKPCQECYEQDANLFDPIQSSTYREISCQS 537
            G Y+MKLSIGTPP+E++AIADTGSDL WTQC+PC  C+EQ + LFD  +SS+Y+   C +
Sbjct: 95   GEYIMKLSIGTPPVEIVAIADTGSDLTWTQCEPCLNCFEQSSPLFDSKKSSSYKTAGCDT 154

Query: 538  DYCQSLPRAHC--GNTSQDCQYLYSYGDESHTNGILATETFTFDST-TGRPIAIPTLVFG 708
              C S+  + C  GN    C+Y  SYGD+S+T G LA + FTF ST +   +AIP + FG
Sbjct: 155  KECTSIGSSSCVKGNV---CEYQMSYGDQSYTIGDLAFDIFTFPSTNSSENVAIPNVAFG 211

Query: 709  CGHNNAGTFXXXXXXXXXXXXXXXXXXXXXXXKIEGKFSYCLVPMS-----EKHTSKLNF 873
            CGH+N GTF                       +I GKFSYCL+ ++        TS +NF
Sbjct: 212  CGHHNGGTFNNHTSGIIGLGGGNVSIINQLDKEINGKFSYCLISIALGSPISNVTSHINF 271

Query: 874  GSQAVISGQGVVSTPLVPKDPDTYYYLTLEGISVGN--IRIENNKISLDQEEGNIVIDSG 1047
            GS A +SG  VVSTPL+ K+P T+YYL LEG+SVGN  ++ +++K+S   EEGNI+IDSG
Sbjct: 272  GSSASVSGPDVVSTPLIKKEPSTFYYLNLEGVSVGNRTLKFKSSKVSSGGEEGNIIIDSG 331

Query: 1048 TTLTILDEALYTNLESTVKEAINLDTSPDPEGTFGLCYSTKS-DIKVPEFTFHFTGADLQ 1224
            TTLT+L    Y++LEST+ ++I+     DP GTF LCY +K+  I  P  T HFT ADL+
Sbjct: 332  TTLTLLPNEFYSSLESTLVDSISATRKEDPSGTFRLCYESKNGTIDAPTITTHFTNADLE 391

Query: 1225 LNEMNTFIKVSDDLVCLSMVPAESLSIFGNLAQINFQVEYDLVGKKVSFTPADCIKY 1395
            L+  +TF ++ + LVCL++VPA+ ++IFGNLAQ NF + YDLV  K+SF PADC KY
Sbjct: 392  LSPSSTFAQIEEGLVCLTIVPADEIAIFGNLAQGNFLIGYDLVANKISFKPADCTKY 448


>ref|XP_006285435.1| hypothetical protein CARUB_v10006851mg [Capsella rubella]
            gi|482554140|gb|EOA18333.1| hypothetical protein
            CARUB_v10006851mg [Capsella rubella]
          Length = 436

 Score =  399 bits (1025), Expect = e-108
 Identities = 196/408 (48%), Positives = 259/408 (63%), Gaps = 3/408 (0%)
 Frame = +1

Query: 178  GFSIELIHHESPLSPFYNPSDTYSDRIRKAARHSIARTNRFRXXXXTNLDTIRSDVIPND 357
            GF+ +LIH +SP SPF+NP++T S R+R +   S+ R   F     T+ ++ + ++  N 
Sbjct: 30   GFTADLIHRDSPKSPFFNPTETPSQRLRNSINRSVNRA--FHFTEDTSANSPQVEITSNG 87

Query: 358  GSYLMKLSIGTPPLEVLAIADTGSDLIWTQCKPCQECYEQDANLFDPIQSSTYREISCQS 537
            G YLM +S+GTPP  ++AIADTGSDL+WTQCKPC +CY QD  LFDP  SSTY+++SC S
Sbjct: 88   GEYLMNVSLGTPPFPIMAIADTGSDLLWTQCKPCDDCYTQDDPLFDPKASSTYKDVSCSS 147

Query: 538  DYCQSLP-RAHCGNTSQDCQYLYSYGDESHTNGILATETFTFDSTTGRPIAIPTLVFGCG 714
              C +L   A C      C Y  SYGD S+T G +A +T T  ST  RP+ I  ++ GCG
Sbjct: 148  SQCNALEDHASCSVDDTTCSYSMSYGDHSYTRGNIAADTLTLGSTNNRPVQIKNVLIGCG 207

Query: 715  HNNAGTFXXXXXXXXXXXXXXXXXXXXXXXKIEGKFSYCLVPMSEK--HTSKLNFGSQAV 888
            HNN+GTF                        I+GKFSYCLVP++ +   TSKLNFG+ A 
Sbjct: 208  HNNSGTFNEKGSGIIGLGGGAASLITQLGDSIDGKFSYCLVPLTSETDRTSKLNFGTNAE 267

Query: 889  ISGQGVVSTPLVPKDPDTYYYLTLEGISVGNIRIENNKISLDQEEGNIVIDSGTTLTILD 1068
            +SG GVVSTPL+ K P+T+YYLTLE ISVG+ +I          EGNI+IDSGTTLT+L 
Sbjct: 268  VSGTGVVSTPLISKSPETFYYLTLESISVGSKKIPFPVSESGTTEGNIIIDSGTTLTLLP 327

Query: 1069 EALYTNLESTVKEAINLDTSPDPEGTFGLCYSTKSDIKVPEFTFHFTGADLQLNEMNTFI 1248
               Y+ LE  V  AI  +   DP+    LCYS   D+KVP  T HF GAD++L+  N+F+
Sbjct: 328  AEFYSELEDAVASAITAERKEDPKKVLSLCYSATEDLKVPIITMHFDGADVKLDSSNSFV 387

Query: 1249 KVSDDLVCLSMVPAESLSIFGNLAQINFQVEYDLVGKKVSFTPADCIK 1392
            ++S +LVC +   + SL+I+GNL+Q+NF V YD V KKVSF P DC K
Sbjct: 388  QISQELVCFAFSGSPSLAIYGNLSQMNFLVGYDTVSKKVSFKPTDCAK 435


>ref|XP_007022588.1| Eukaryotic aspartyl protease family protein, putative [Theobroma
            cacao] gi|508722216|gb|EOY14113.1| Eukaryotic aspartyl
            protease family protein, putative [Theobroma cacao]
          Length = 429

 Score =  398 bits (1022), Expect = e-108
 Identities = 207/409 (50%), Positives = 269/409 (65%), Gaps = 4/409 (0%)
 Frame = +1

Query: 178  GFSIELIHHESPLSPFYNPSDTYSDRIRKAARHSIARTNRFRXXXXTNLDTIRSDVIPND 357
            GFS+ELIH +SP+SPF+N S T S+ +RK A HS+ R    +          +S VIPN 
Sbjct: 31   GFSVELIHRDSPVSPFFNDSITSSELLRKNALHSMDRIKNIQFYIDQK--ATQSVVIPNG 88

Query: 358  GSYLMKLSIGTPPLEVLAIADTGSDLIWTQCKPC--QECYEQDANLFDPIQSSTYREISC 531
            G+YLMKLS GTPP+E +AIADTGSDL W QC PC   +CY Q ++ FDP  SSTYR++SC
Sbjct: 89   GTYLMKLSFGTPPVEYVAIADTGSDLTWIQCAPCPQSQCYSQGSSPFDPAASSTYRKLSC 148

Query: 532  QSDYCQSLPRAHCGNTSQDCQYLYSYGDESHTNGILATETFTFDSTTGRPIAIPTLVFGC 711
             S+ CQ+LPR  C NT++ C+Y YSYGD+S+T GIL+++T +FDS++    + PT +FGC
Sbjct: 149  VSEACQALPRKSCLNTNE-CEYFYSYGDKSYTIGILSSDTLSFDSSSSPKTSFPTSIFGC 207

Query: 712  GHNNAGTFXXXXXXXXXXXXXXXXXXXXXXXKIEGKFSYCLVPMSEKHTSKLNFGSQAVI 891
            GHNN G F                       +I+ +FSYCLVP S   + KL FG +A+I
Sbjct: 208  GHNNQGNFRRPGAGLVGLGGGPLSLISQIGTQIDHRFSYCLVPRSATSSGKLVFGQEAII 267

Query: 892  SGQGVVSTPLVPKDPDTYYYLTLEGISVGNIRIENNKISLDQEEGNIVIDSGTTLTILDE 1071
            S  G VSTPL+ K P T+YYL LEGIS+G     +        +GNI+IDSGTTLTIL+ 
Sbjct: 268  SRPGAVSTPLITKTPATFYYLNLEGISIG-----DKTAQAASSQGNIIIDSGTTLTILES 322

Query: 1072 ALYTNLESTVKEAINLDTSPDPEGTFGLCYSTKSDIKVPEFTFHFTGADLQLNEMNTFIK 1251
              Y ++E+ VK AI  +   DP GTF LCY  +++ K+P+  FHFTGADL+L  +NTF  
Sbjct: 323  NFYNSVETMVKGAIGAEPEQDPSGTFTLCY--RAETKIPDMVFHFTGADLRLQPVNTF-G 379

Query: 1252 VSDDLVCLSMVPA--ESLSIFGNLAQINFQVEYDLVGKKVSFTPADCIK 1392
            V+D L+C+ +VP+   S SIFGN AQINFQVEYDL  + VSF P DC K
Sbjct: 380  VNDGLLCMLIVPSNTNSNSIFGNYAQINFQVEYDLQKRTVSFAPTDCTK 428


>ref|XP_006393532.1| hypothetical protein EUTSA_v10012077mg, partial [Eutrema salsugineum]
            gi|557090110|gb|ESQ30818.1| hypothetical protein
            EUTSA_v10012077mg, partial [Eutrema salsugineum]
          Length = 452

 Score =  397 bits (1020), Expect = e-108
 Identities = 196/410 (47%), Positives = 264/410 (64%), Gaps = 5/410 (1%)
 Frame = +1

Query: 178  GFSIELIHHESPLSPFYNPSDTYSDRIRKAARHSIARTNRFRXXXXTNLDTIRSDVIPND 357
            GF+ +LIH +SP SPFY P++T S R+R A R S+ R   F      ++D+ ++++  N 
Sbjct: 43   GFTTDLIHRDSPKSPFYKPTETSSQRLRNAIRRSVNRVVHFSSKD-ASVDSPQTEITSNR 101

Query: 358  GSYLMKLSIGTPPLEVLAIADTGSDLIWTQCKPCQECYEQDANLFDPIQSSTYREISCQS 537
            G YLM +S+GTPP  ++AIADTGSDLIWTQCKPC +CY Q+  LFDP  SSTY+  SC S
Sbjct: 102  GEYLMNISLGTPPFPIMAIADTGSDLIWTQCKPCDDCYTQNDPLFDPKASSTYKYFSCSS 161

Query: 538  DYCQSLP-RAHCGNTSQDCQYLYSYGDESHTNGILATETFTFDSTTGRPIAIPTLVFGCG 714
              C +L  +A C      C Y  SYGD S+TNG +A +T T  ST  RP+ +  ++ GCG
Sbjct: 162  SQCSALGNQASCSTEDNTCPYSISYGDHSYTNGNVAADTLTLGSTNKRPVQLKNVIIGCG 221

Query: 715  HNNAGTFXXXXXXXXXXXXXXXXXXXXXXXKIEGKFSYCLVPMSEKH--TSKLNFGSQAV 888
            HNN GTF                        I+GKFSYCL+P+S ++  TSK+NFG+ AV
Sbjct: 222  HNNNGTFNKEGSGIVGLGGGPVSLISQLGESIDGKFSYCLIPLSSENDKTSKINFGTSAV 281

Query: 889  ISGQGVVSTPLVPKDPDTYYYLTLEGISVG--NIRIENNKISLDQEEGNIVIDSGTTLTI 1062
            +SG G VSTPL+ K  +T+YYLTLE ISVG  NI+   +     + EGNI+IDSGTTLT+
Sbjct: 282  VSGTGAVSTPLITKSRETFYYLTLESISVGSKNIKFPVSDPGSGEGEGNIIIDSGTTLTM 341

Query: 1063 LDEALYTNLESTVKEAINLDTSPDPEGTFGLCYSTKSDIKVPEFTFHFTGADLQLNEMNT 1242
            L    Y+ LE  V  +I+ +   DPE    LCYS  +++KVP  T HF GAD++L+  N+
Sbjct: 342  LPTTFYSELEDAVASSIDAERQNDPESPLSLCYSATANLKVPVITMHFDGADVKLDSSNS 401

Query: 1243 FIKVSDDLVCLSMVPAESLSIFGNLAQINFQVEYDLVGKKVSFTPADCIK 1392
            F+++S++LVC +   +E L+I+GNL+Q+NF V YD V K VSF PADC K
Sbjct: 402  FVQLSEELVCFAFRGSEDLAIYGNLSQMNFLVGYDTVSKTVSFKPADCAK 451


>ref|XP_002517617.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
            communis] gi|223543249|gb|EEF44781.1| Aspartic proteinase
            nepenthesin-2 precursor, putative [Ricinus communis]
          Length = 449

 Score =  397 bits (1019), Expect = e-108
 Identities = 206/449 (45%), Positives = 280/449 (62%), Gaps = 13/449 (2%)
 Frame = +1

Query: 85   MATTTPIVLAFVVVIIFSLLPNLPVVKAQIGGFSIELIHHESPLSPFYNPSDTYSDRIRK 264
            MA  + I ++  +  I S++    +V+A+  GFS  LIH +S +SP YNP DTY DR+R 
Sbjct: 1    MAAVSSIYVSLFIAFI-SMVSAFSLVEARNAGFSANLIHRDSSVSPLYNPRDTYFDRLRN 59

Query: 265  AARHSIARTNRFRXXXXTNLDTIRSDVIPNDGSYLMKLSIGTPPLEVLAIADTGSDLIWT 444
            +   SI+R NRF+    +    ++SD++P  G YLM++SIG P +E+LAIADTGSDLIW 
Sbjct: 60   SFHRSISRANRFKPNSISARALVQSDIVPGGGEYLMRISIGNPQVEILAIADTGSDLIWV 119

Query: 445  QCKPCQECYEQDANLFDPIQSSTYREISCQSDYCQSLP----RAHCGNTSQDCQYLYSYG 612
            QC+PC+ CY+Q++ +FDP +SS+YR + C +++C  L             + C Y YSYG
Sbjct: 120  QCQPCEMCYKQNSPIFDPRRSSSYRNVLCGNEFCNKLDGEARSCDARGFVKTCGYTYSYG 179

Query: 613  DESHTNGILATETFTFDSTTGRPIA----IPTLVFGCGHNNAGTFXXXXXXXXXXXXXXX 780
            D+S ++G LA E F   ST     A       + FGCG  N GTF               
Sbjct: 180  DQSFSDGHLAIERFGIGSTNSNTSAAIAYFQEVAFGCGTKNGGTFDELGSGIIGLGGGSM 239

Query: 781  XXXXXXXXKIEGKFSYCLVPMSEK--HTSKLNFGSQAVISGQ--GVVSTPLVPKDPDTYY 948
                    K+ GKFSYCLVP SE+  +TSK+NFG+   ISG    VVSTPL+PK P+TYY
Sbjct: 240  SLVSQLGPKLSGKFSYCLVPTSEQSNYTSKINFGNDINISGSNYNVVSTPLLPKKPETYY 299

Query: 949  YLTLEGISVGNIRIE-NNKISLDQEEGNIVIDSGTTLTILDEALYTNLESTVKEAINLDT 1125
            YLTLE ISV N R+   N  + + E+GNI+IDSGTTLT LD   + NL+S V+EA+  + 
Sbjct: 300  YLTLEAISVENKRLPYTNLWNGEVEKGNIIIDSGTTLTFLDSEFFNNLDSAVEEAVKGER 359

Query: 1126 SPDPEGTFGLCYSTKSDIKVPEFTFHFTGADLQLNEMNTFIKVSDDLVCLSMVPAESLSI 1305
              DP G F +C+  +  I++P  T HFTGAD++L  +NTF KV +DL+C +M+P+  ++I
Sbjct: 360  VSDPHGLFNICFKDEKAIELPIITAHFTGADVELQPVNTFAKVEEDLLCFTMIPSNDIAI 419

Query: 1306 FGNLAQINFQVEYDLVGKKVSFTPADCIK 1392
            FGNLAQ+NF V YDL  K VSF P DC K
Sbjct: 420  FGNLAQMNFLVGYDLEKKAVSFLPTDCTK 448


>ref|XP_002266461.1| PREDICTED: probable aspartic protease At2g35615 [Vitis vinifera]
          Length = 439

 Score =  394 bits (1012), Expect = e-107
 Identities = 198/435 (45%), Positives = 272/435 (62%), Gaps = 4/435 (0%)
 Frame = +1

Query: 103  IVLAFVVVIIFSLLPNLPVVKAQIGGFSIELIHHESPLSPFYNPSDTYSDRIRKAARHSI 282
            + + F VV++  L   L V  A+ GGFS++LIH +SP SPF++PS T ++R+  A R S+
Sbjct: 6    VKIFFNVVVVGFLFQLLEVALARGGGFSVDLIHRDSPHSPFFDPSKTQAERLTDAFRRSV 65

Query: 283  ARTNRFRXXXXTNLDTIRSDVIPNDGSYLMKLSIGTPPLEVLAIADTGSDLIWTQCKPCQ 462
            +R  RFR    T+ D I+S ++P+ G YLM L IGTPP+ V+AI DTGSDL WTQC+PC 
Sbjct: 66   SRVGRFRPTAMTS-DGIQSRIVPSAGEYLMNLYIGTPPVPVIAIVDTGSDLTWTQCRPCT 124

Query: 463  ECYEQDANLFDPIQSSTYREISCQSDYCQSLPRAHCGNTSQDCQYLYSYGDESHTNGILA 642
             CY+Q   LFDP  SSTYR+ SC + +C +L +    +  + C + YSY D S T G LA
Sbjct: 125  HCYKQVVPLFDPKNSSTYRDSSCGTSFCLALGKDRSCSKEKKCTFRYSYADGSFTGGNLA 184

Query: 643  TETFTFDSTTGRPIAIPTLVFGCGHNNAGTFXXXXXXXXXXXXXXXXXXXXXXXKIEGKF 822
            +ET T DST G+P++ P   FGCGH++ G F                        I G F
Sbjct: 185  SETLTVDSTAGKPVSFPGFAFGCGHSSGGIFDKSSSGIVGLGGGELSLISQLKSTINGLF 244

Query: 823  SYCLVPMS--EKHTSKLNFGSQAVISGQGVVSTPLVPKDPDTYYYLTLEGISVGNIRIEN 996
            SYCL+P+S     +S++NFG+   +SG G VSTPLV K PDT+YYLTLEGISVG  R+  
Sbjct: 245  SYCLLPVSTDSSISSRINFGASGRVSGYGTVSTPLVQKSPDTFYYLTLEGISVGKKRLPY 304

Query: 997  NKIS--LDQEEGNIVIDSGTTLTILDEALYTNLESTVKEAINLDTSPDPEGTFGLCYSTK 1170
               S   + EEGNI++DSGTT T L +  Y+ LE +V  +I      DP G F LCY+T 
Sbjct: 305  KGYSKKTEVEEGNIIVDSGTTYTFLPQEFYSKLEKSVANSIKGKRVRDPNGIFSLCYNTT 364

Query: 1171 SDIKVPEFTFHFTGADLQLNEMNTFIKVSDDLVCLSMVPAESLSIFGNLAQINFQVEYDL 1350
            ++I  P  T HF  A+++L  +NTF+++ +DLVC ++ P   + + GNLAQ+NF V +DL
Sbjct: 365  AEINAPIITAHFKDANVELQPLNTFMRMQEDLVCFTVAPTSDIGVLGNLAQVNFLVGFDL 424

Query: 1351 VGKKVSFTPADCIKY 1395
              K+VSF  ADC ++
Sbjct: 425  RKKRVSFKAADCTQH 439


>ref|XP_002870403.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
            gi|297316239|gb|EFH46662.1| predicted protein
            [Arabidopsis lyrata subsp. lyrata]
          Length = 440

 Score =  393 bits (1010), Expect = e-106
 Identities = 193/410 (47%), Positives = 256/410 (62%), Gaps = 5/410 (1%)
 Frame = +1

Query: 178  GFSIELIHHESPLSPFYNPSDTYSDRIRKAARHSIARTNRFRXXXXTNL--DTIRSDVIP 351
            GF+ +LIH +SP SPFYNP++T S R+R A   S++R   F      +   +  + D+  
Sbjct: 30   GFTADLIHRDSPKSPFYNPTETSSQRLRNAIHRSVSRVFHFTDISQKDASDNAPQIDLTS 89

Query: 352  NDGSYLMKLSIGTPPLEVLAIADTGSDLIWTQCKPCQECYEQDANLFDPIQSSTYREISC 531
            N G YLM +S+GTPP  ++AIADTGSDL+WTQCKPC +CY Q   LFDP  SSTY+++SC
Sbjct: 90   NSGEYLMNISLGTPPFPIMAIADTGSDLLWTQCKPCDDCYTQVDPLFDPKASSTYKDVSC 149

Query: 532  QSDYCQSLP-RAHCGNTSQDCQYLYSYGDESHTNGILATETFTFDSTTGRPIAIPTLVFG 708
             S  C +L  +A C      C Y  SYGD S+T G +A +T T  ST  RP+ +  ++ G
Sbjct: 150  SSSQCTALENQASCSTEDNTCSYSTSYGDRSYTKGNIAVDTLTLGSTDTRPVQLKNIIIG 209

Query: 709  CGHNNAGTFXXXXXXXXXXXXXXXXXXXXXXXKIEGKFSYCLVPMSEKH--TSKLNFGSQ 882
            CGHNNAGTF                        I+GKFSYCLVP++ ++  TSK+NFG+ 
Sbjct: 210  CGHNNAGTFNKKGSGIVGLGGGAVSLITQLGDSIDGKFSYCLVPLTSENDRTSKINFGTN 269

Query: 883  AVISGQGVVSTPLVPKDPDTYYYLTLEGISVGNIRIENNKISLDQEEGNIVIDSGTTLTI 1062
            AV+SG GVVSTPL+ K  +T+YYLTL+ ISVG+  ++         EGNI+IDSGTTLT+
Sbjct: 270  AVVSGTGVVSTPLIAKSQETFYYLTLKSISVGSKEVQYPGSDSGSGEGNIIIDSGTTLTL 329

Query: 1063 LDEALYTNLESTVKEAINLDTSPDPEGTFGLCYSTKSDIKVPEFTFHFTGADLQLNEMNT 1242
            L    Y+ LE  V  +I+ +   DP+    LCYS   D+KVP  T HF GAD+ L   N 
Sbjct: 330  LPTEFYSELEDAVASSIDAEKKQDPQTGLSLCYSATGDLKVPAITMHFDGADVNLKPSNC 389

Query: 1243 FIKVSDDLVCLSMVPAESLSIFGNLAQINFQVEYDLVGKKVSFTPADCIK 1392
            F+++S+DLVC +   + S SI+GN+AQ+NF V YD V K VSF P DC K
Sbjct: 390  FVQISEDLVCFAFRGSPSFSIYGNVAQMNFLVGYDTVSKTVSFKPTDCAK 439


>gb|ABK28718.1| unknown [Arabidopsis thaliana]
          Length = 438

 Score =  393 bits (1010), Expect = e-106
 Identities = 196/409 (47%), Positives = 256/409 (62%), Gaps = 4/409 (0%)
 Frame = +1

Query: 178  GFSIELIHHESPLSPFYNPSDTYSDRIRKAARHSIARTNRFRXXXXTNLDTIRSDVIPND 357
            GF+ +LIH +SP SPFYNP +T S R+R A   S+ R   F     T    I  D+  N 
Sbjct: 30   GFTADLIHRDSPKSPFYNPMETSSQRLRNAIHRSVNRVFHFTEKDNTPQPQI--DLTSNS 87

Query: 358  GSYLMKLSIGTPPLEVLAIADTGSDLIWTQCKPCQECYEQDANLFDPIQSSTYREISCQS 537
            G YLM +SIGTPP  ++AIADTGSDL+WTQC PC +CY Q   LFDP  SSTY+++SC S
Sbjct: 88   GEYLMNVSIGTPPFPIMAIADTGSDLLWTQCAPCDDCYTQVDPLFDPKTSSTYKDVSCSS 147

Query: 538  DYCQSLP-RAHCGNTSQDCQYLYSYGDESHTNGILATETFTFDSTTGRPIAIPTLVFGCG 714
              C +L  +A C      C Y  SYGD S+T G +A +T T  S+  RP+ +  ++ GCG
Sbjct: 148  SQCTALENQASCSTNDNTCSYSLSYGDNSYTKGNIAVDTLTLGSSDTRPMQLKNIIIGCG 207

Query: 715  HNNAGTFXXXXXXXXXXXXXXXXXXXXXXXKIEGKFSYCLVPMSEK--HTSKLNFGSQAV 888
            HNNAGTF                        I+GKFSYCLVP++ K   TSK+NFG+ A+
Sbjct: 208  HNNAGTFNKKGSGIVGLGGGPVSLIKQLGDSIDGKFSYCLVPLTSKKDQTSKINFGTNAI 267

Query: 889  ISGQGVVSTPLVPK-DPDTYYYLTLEGISVGNIRIENNKISLDQEEGNIVIDSGTTLTIL 1065
            +SG GVVSTPL+ K   +T+YYLTL+ ISVG+ +I+ +    +  EGNI+IDSGTTLT+L
Sbjct: 268  VSGSGVVSTPLIAKASQETFYYLTLKSISVGSKQIQYSGSDSESSEGNIIIDSGTTLTLL 327

Query: 1066 DEALYTNLESTVKEAINLDTSPDPEGTFGLCYSTKSDIKVPEFTFHFTGADLQLNEMNTF 1245
                Y+ LE  V  +I+ +   DP+    LCYS   D+KVP  T HF GAD++L+  N F
Sbjct: 328  PTEFYSELEDAVASSIDAEKKQDPQSGLSLCYSATGDLKVPVITMHFDGADVKLDSSNAF 387

Query: 1246 IKVSDDLVCLSMVPAESLSIFGNLAQINFQVEYDLVGKKVSFTPADCIK 1392
            ++VS+DLVC +   + S SI+GN+AQ+NF V YD V K VSF P DC K
Sbjct: 388  VQVSEDLVCFAFRGSPSFSIYGNVAQMNFLVGYDTVSKTVSFKPTDCAK 436


Top