BLASTX nr result

ID: Akebia23_contig00000494 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Akebia23_contig00000494
         (1615 letters)

Database: ./nr 
           38,876,450 sequences; 13,856,398,315 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_002523993.1| Aspartic proteinase nepenthesin-1 precursor,...   396   e-107
ref|XP_006437358.1| hypothetical protein CICLE_v10033646mg [Citr...   382   e-103
ref|XP_006484733.1| PREDICTED: aspartic proteinase CDR1-like [Ci...   380   e-103
ref|XP_004289322.1| PREDICTED: probable aspartic protease At2g35...   377   e-101
ref|XP_002266614.1| PREDICTED: probable aspartic protease At2g35...   377   e-101
ref|XP_006361597.1| PREDICTED: aspartic proteinase CDR1-like [So...   375   e-101
ref|XP_007210699.1| hypothetical protein PRUPE_ppa025167mg [Prun...   374   e-101
ref|XP_002266533.1| PREDICTED: probable aspartic protease At2g35...   374   e-101
ref|XP_002304395.1| hypothetical protein POPTR_0003s10440g [Popu...   372   e-100
ref|XP_002320947.1| aspartyl protease family protein [Populus tr...   372   e-100
ref|XP_002266575.1| PREDICTED: probable aspartic protease At2g35...   371   e-100
ref|XP_006393532.1| hypothetical protein EUTSA_v10012077mg, part...   367   1e-98
ref|XP_006285435.1| hypothetical protein CARUB_v10006851mg [Caps...   367   1e-98
ref|XP_007029843.1| Eukaryotic aspartyl protease family protein,...   364   7e-98
ref|XP_006385613.1| aspartyl protease family protein [Populus tr...   363   9e-98
ref|XP_006403054.1| hypothetical protein EUTSA_v10003479mg [Eutr...   362   2e-97
ref|XP_004244685.1| PREDICTED: aspartic proteinase CDR1-like [So...   362   2e-97
ref|XP_007022588.1| Eukaryotic aspartyl protease family protein,...   362   3e-97
ref|XP_002870403.1| predicted protein [Arabidopsis lyrata subsp....   361   6e-97
ref|XP_002266461.1| PREDICTED: probable aspartic protease At2g35...   360   1e-96

>ref|XP_002523993.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
            communis] gi|223536720|gb|EEF38361.1| Aspartic proteinase
            nepenthesin-1 precursor, putative [Ricinus communis]
          Length = 439

 Score =  396 bits (1017), Expect = e-107
 Identities = 208/437 (47%), Positives = 270/437 (61%), Gaps = 23/437 (5%)
 Frame = +1

Query: 73   ATPIVLAFVVVIIFSLLPNLPVVKAQIGGFSIELIHHESPLSPFYNPSDTYSDRIRKAAR 252
            A  + L  +V +IFS    L  + A   GF++ELI+ +SP SPFYNP +T + RI  A R
Sbjct: 2    AASVSLLAIVTLIFS--GTLVPIDAAKDGFTVELINRDSPKSPFYNPRETPTQRIVSAVR 59

Query: 253  HSIARTKLFRSSSSTNL--DTIQSDVIPNDGSYLMKLSIGTPPLEVLAIADTGSDLIWTQ 426
             S++R   F  + ++++  DT QS++I N G YLMK S+GTP  ++LAIADTGSDLIWTQ
Sbjct: 60   RSMSRVHHFSPTKNSDIFTDTAQSEMISNQGEYLMKFSLGTPAFDILAIADTGSDLIWTQ 119

Query: 427  CKPCQECYEQDANLFDPIQSSTYREISCQSDYCQSLPR-AHC-GNTSQDCQYLYSYGDES 600
            CKPC +CYEQDA LFDP  SSTYR+ISC +  C  L   A C G  ++ C Y YSYGD S
Sbjct: 120  CKPCDQCYEQDAPLFDPKSSSTYRDISCSTKQCDLLKEGASCSGEGNKTCHYSYSYGDRS 179

Query: 601  HTNGILATETFTFDSTTGRPIAIPTLVFGCGHNNAGTFXXXXXXXXXXXXXXXXXXXXXX 780
             T+G +A +T T  ST+GRP+ +P  + GCGHNN G+F                      
Sbjct: 180  FTSGNVAADTITLGSTSGRPVLLPKAIIGCGHNNGGSFTEKGSGIVGLGGGPISLISQLG 239

Query: 781  XKIEGKFSYCLVPMSEKHT--SKLNFGSQAVISGQGVVSTPLVPKDPDTYYYLTLEGISV 954
              I+GKFSYCLVP+S   T  SKLNFGS  ++SG GV STPL+ KDPDT+Y+LTLE +SV
Sbjct: 240  STIDGKFSYCLVPLSSNATNSSKLNFGSNGIVSGGGVQSTPLISKDPDTFYFLTLEAVSV 299

Query: 955  GNIRIENNKISLNQEEGNIV-----------------XXXXVKEAINLDTYPDPEGTFGL 1083
            G+ RI+    S    EGNI+                     V++A+      DP G   L
Sbjct: 300  GSERIKFPGSSFGTSEGNIIIDSGTTLTLFPEDFFSELSSAVQDAVAGTPVEDPSGILSL 359

Query: 1084 CYSTKSDIKVPKFTFHFTGADLQLNEMNTFIKISDDLVCLSMVPAQSLSIFGNLAQINFQ 1263
            CYS  +D+K P  T HF GAD++LN +NTF+++SD ++C +  P  S +IFGNLAQ+NF 
Sbjct: 360  CYSIDADLKFPSITAHFDGADVKLNPLNTFVQVSDTVLCFAFNPINSGAIFGNLAQMNFL 419

Query: 1264 VEYDLVGKKVSFTPADC 1314
            V YDL GK VSF P DC
Sbjct: 420  VGYDLEGKTVSFKPTDC 436


>ref|XP_006437358.1| hypothetical protein CICLE_v10033646mg [Citrus clementina]
            gi|557539554|gb|ESR50598.1| hypothetical protein
            CICLE_v10033646mg [Citrus clementina]
          Length = 426

 Score =  382 bits (982), Expect = e-103
 Identities = 205/437 (46%), Positives = 266/437 (60%), Gaps = 18/437 (4%)
 Frame = +1

Query: 64   MATATPIVLAFVVVIIFSLLPNLPVVKAQIGGFSIELIHHESPLSPFYNPSDTYSDRIRK 243
            MAT   + ++F+++ + SL     + +A+ GGFS++LI  ++P SPFY+P +TY  R+ K
Sbjct: 1    MATVNALAISFLILCLSSL----SITEAK-GGFSLDLIRRDAPKSPFYSPDETYHQRVTK 55

Query: 244  AARHSIARTKLFRSSSSTNLDTIQSDVIPNDGSYLMKLSIGTPPLEVLAIADTGSDLIWT 423
            A + S+ R   F  +  T  +T Q+D+I   G Y+M +SIGTPP+E+LAIADTGSDLIWT
Sbjct: 56   ALKRSVNRVSHFDPAIITP-NTAQADIISALGEYVMNISIGTPPVEILAIADTGSDLIWT 114

Query: 424  QCKPCQECYEQDANLFDPIQSSTYREISCQSDYCQSLPRAHCGNTSQDCQYLYSYGDESH 603
            QCKPC ECY+Q A  FDP QSSTY+++SC S  C +  R  C +T + C+Y  +YGD S 
Sbjct: 115  QCKPCTECYKQAAPFFDPEQSSTYKDLSCDSRQCTAYERTSC-STEEICEYSATYGDRSF 173

Query: 604  TNGILATETFTFDSTTGRPIAIPTLVFGCGHNNAGTFXXXXXXXXXXXXXXXXXXXXXXX 783
            +NG LA ET T  ST GRP+A+  L+FGCGHN+ GTF                       
Sbjct: 174  SNGNLAVETVTLGSTNGRPVALRNLIFGCGHNDDGTFNENATGIVGLGGGSVSLVTQMGS 233

Query: 784  KIEGKFSYCLVP-MSEKHTSKLNFGSQAVISGQGVVSTPLVPKDPDTYYYLTLEGISVGN 960
             I GKFSYCLVP +S + +SK+NFGS  V+SG GVV+TPLV KDPDT+Y+LTLE ISVG 
Sbjct: 234  SIGGKFSYCLVPFLSSESSSKINFGSNGVVSGTGVVTTPLVAKDPDTFYFLTLESISVGK 293

Query: 961  IRIENNKISLNQEEGNIV-----------------XXXXVKEAINLDTYPDPEGTFGLCY 1089
             +I  +  S    EGNI+                     V + I  D   DPEG   LCY
Sbjct: 294  KKIHFDDAS----EGNIIIDSGTTLTFLPPDIVSKLTSAVSDLIKADPISDPEGVLDLCY 349

Query: 1090 STKSDIKVPKFTFHFTGADLQLNEMNTFIKISDDLVCLSMVPAQSLSIFGNLAQINFQVE 1269
               SD K P+ T HF+GAD+ L+  NTFI+ SD  VC +    +  SI+GNLAQ NF V 
Sbjct: 350  PYSSDFKAPQITVHFSGADVVLSPENTFIRTSDTTVCFTFKGMEGQSIYGNLAQANFLVG 409

Query: 1270 YDLVGKKVSFTPADCIK 1320
            YD   K VSF P DC K
Sbjct: 410  YDTKAKTVSFKPTDCSK 426


>ref|XP_006484733.1| PREDICTED: aspartic proteinase CDR1-like [Citrus sinensis]
          Length = 426

 Score =  380 bits (977), Expect = e-103
 Identities = 204/437 (46%), Positives = 265/437 (60%), Gaps = 18/437 (4%)
 Frame = +1

Query: 64   MATATPIVLAFVVVIIFSLLPNLPVVKAQIGGFSIELIHHESPLSPFYNPSDTYSDRIRK 243
            MAT     ++F+++ + SL     + +A+ GGFS++LI  ++P SPFY+P +TY  R+ K
Sbjct: 1    MATVNASAISFLILCLSSL----SITEAK-GGFSLDLIRRDAPKSPFYSPDETYHQRVTK 55

Query: 244  AARHSIARTKLFRSSSSTNLDTIQSDVIPNDGSYLMKLSIGTPPLEVLAIADTGSDLIWT 423
            A + S+ R   F  +  T  +T Q+D+I   G Y+M +SIGTPP+E+LAIADTGSDLIWT
Sbjct: 56   ALKRSVNRVSHFDPAIITP-NTAQADIISALGEYVMNISIGTPPVEILAIADTGSDLIWT 114

Query: 424  QCKPCQECYEQDANLFDPIQSSTYREISCQSDYCQSLPRAHCGNTSQDCQYLYSYGDESH 603
            QCKPC ECY+Q A  FDP QSSTY+++SC S  C +  R  C +T + C+Y  +YGD S 
Sbjct: 115  QCKPCTECYKQAAPFFDPEQSSTYKDLSCDSRQCTAYERTSC-STEETCEYSATYGDRSF 173

Query: 604  TNGILATETFTFDSTTGRPIAIPTLVFGCGHNNAGTFXXXXXXXXXXXXXXXXXXXXXXX 783
            +NG LA ET T  ST GRP+A+  ++FGCGHN+ GTF                       
Sbjct: 174  SNGNLAVETVTLGSTNGRPVALRNIIFGCGHNDDGTFNENATGIVGLGGGSVSLVTQMGS 233

Query: 784  KIEGKFSYCLVP-MSEKHTSKLNFGSQAVISGQGVVSTPLVPKDPDTYYYLTLEGISVGN 960
             I GKFSYCLVP +S + +SK+NFGS  V+SG GVV+TPLV KDPDT+Y+LTLE ISVG 
Sbjct: 234  SIGGKFSYCLVPFLSSESSSKINFGSNGVVSGTGVVTTPLVAKDPDTFYFLTLESISVGK 293

Query: 961  IRIENNKISLNQEEGNIV-----------------XXXXVKEAINLDTYPDPEGTFGLCY 1089
             +I  +  S    EGNI+                     V + I  D   DPEG   LCY
Sbjct: 294  KKIHFDDAS----EGNIIIDSGTTLTFLPPDIVSKLTSAVSDLIKADPISDPEGVLDLCY 349

Query: 1090 STKSDIKVPKFTFHFTGADLQLNEMNTFIKISDDLVCLSMVPAQSLSIFGNLAQINFQVE 1269
               SD K P+ T HF+GAD+ L+  NTFI+ SD  VC +    +  SI+GNLAQ NF V 
Sbjct: 350  PYSSDFKAPQITVHFSGADVVLSPENTFIRTSDTSVCFTFKGMEGQSIYGNLAQANFLVG 409

Query: 1270 YDLVGKKVSFTPADCIK 1320
            YD   K VSF P DC K
Sbjct: 410  YDTKAKTVSFKPTDCSK 426


>ref|XP_004289322.1| PREDICTED: probable aspartic protease At2g35615-like [Fragaria vesca
            subsp. vesca]
          Length = 430

 Score =  377 bits (967), Expect = e-101
 Identities = 200/440 (45%), Positives = 274/440 (62%), Gaps = 26/440 (5%)
 Frame = +1

Query: 73   ATPIVLAFVVVIIFSLLPNLPVVKAQIGGFSIELIHHESPLSPFYNPSDTYSDRIRKAAR 252
            A   ++A  +++ FS        +A  GGF+++LI  +S LSP+Y+ S T+ DR+  A R
Sbjct: 2    ALAAIVACFILLSFS-------AEASYGGFTVDLIQRDSLLSPWYDSSTTHFDRLHNAFR 54

Query: 253  HSIARTKLFRSSSSTNLDTIQSDVIPNDGSYLMKLSIGTPPLEVLAIADTGSDLIWTQCK 432
             SI+R + F   S+   +TIQS ++P+ G YLM +SIGTPP+EVL IADTGSDLIWTQCK
Sbjct: 55   RSISRAQRFIKPST---NTIQSKIVPSGGEYLMNISIGTPPVEVLGIADTGSDLIWTQCK 111

Query: 433  PCQECYEQDANLFDPIQSSTYREISCQSDYCQSLPRAHCGNTSQD-CQYLYSYGDESHTN 609
            PC++C+ Q+  LFDP +SSTYR + CQS+ C +L  A CG    D C Y Y YGD S T 
Sbjct: 112  PCKQCFNQNPPLFDPKRSSTYRTVPCQSNSCSNLEEASCGADRGDTCVYSYRYGDRSFTR 171

Query: 610  GILATETFTFDSTTGRPIAIPTLVFGCGHNNAGTFXXXXXXXXXXXXXXXXXXXXXXXKI 789
            G LA ETFT  S +G+P+++  ++FGCGH N GTF                         
Sbjct: 172  GSLAQETFTIGSASGQPVSLLKIIFGCGHENGGTFDESGSGLIGLGGGPLSFISQLNG-- 229

Query: 790  EGKFSYCLVPMSEKHT--SKLNFGSQAVISGQGVVSTPLVPKDPDTYYYLTLEGISVG-- 957
             GKFSYCLVP S K +  SK++FG+ A++SG+G VSTPLV K PDT+YYLTLE ISVG  
Sbjct: 230  -GKFSYCLVPTSAKSSIASKISFGTAAIVSGKGAVSTPLVSKQPDTFYYLTLEAISVGEK 288

Query: 958  --NIRIENNKISLNQEEGNIV-----------------XXXXVKEAINLDTYPDPEGTFG 1080
              + +   +  ++   EGNI+                     ++ AIN++   DP+G   
Sbjct: 289  RQSYKTSQSTKAVAASEGNIIIDSGTTLTLLPPGFYDEVISALEVAINVERVSDPKGVLS 348

Query: 1081 LCYSTKS--DIKVPKFTFHFTGADLQLNEMNTFIKISDDLVCLSMVPAQSLSIFGNLAQI 1254
            LC+ +KS  DI VP  T HF+GAD++LN +NTF ++ DD+VC +M+ ++ ++IFGNLAQ+
Sbjct: 349  LCFRSKSDHDIDVPVITMHFSGADVKLNALNTFARVEDDMVCFTMIQSEDVAIFGNLAQM 408

Query: 1255 NFQVEYDLVGKKVSFTPADC 1314
            NF V YDL  + VSF PADC
Sbjct: 409  NFLVGYDLEERTVSFKPADC 428


>ref|XP_002266614.1| PREDICTED: probable aspartic protease At2g35615-like [Vitis vinifera]
          Length = 444

 Score =  377 bits (967), Expect = e-101
 Identities = 198/434 (45%), Positives = 262/434 (60%), Gaps = 25/434 (5%)
 Frame = +1

Query: 88   LAFVVVIIFSLLPNLPVVKAQIGGFSIELIHHESPLSPFYNPSDTYSDRIRKAARHSIAR 267
            LA +++I FS   +    +A+I GF+ + I  +SP SPFYNPS+T   R++KA R SI R
Sbjct: 13   LAIIILIHFSEHSH---AEAKIDGFTTDFISRDSPHSPFYNPSETKYQRLQKAFRRSILR 69

Query: 268  TKLFRSSSSTNLDTIQSDVIPNDGSYLMKLSIGTPPLEVLAIADTGSDLIWTQCKPCQEC 447
               FR+  ++  D IQSDVI   G+YLM +S+GTPP+ +L IADTGSDLIW QC PC  C
Sbjct: 70   GNHFRAMRASPND-IQSDVISGGGAYLMNISLGTPPVPMLGIADTGSDLIWRQCLPCPNC 128

Query: 448  YEQDANLFDPIQSSTYREISCQSDYCQSLPRAHCGNTSQDCQYLYSYGDESHTNGILATE 627
            YEQ   LFDP +S TY+ + C +++CQ L +    +    C Y YSYGD S+T G L+++
Sbjct: 129  YEQVEPLFDPKESETYKTLDCDNEFCQDLGQQGSCDDDNTCTYSYSYGDRSYTRGDLSSD 188

Query: 628  TFTFDSTTGRPIAIPTLVFGCGHNNAGTFXXXXXXXXXXXXXXXXXXXXXXXKIEGKFSY 807
            T T  ST G P + P + FGCGH+N GTF                       ++ G+FSY
Sbjct: 189  TLTIGSTEGDPASFPGIAFGCGHDNGGTFNEKDGGLIGLGGGPLSLVMQLSSEVGGQFSY 248

Query: 808  CLVPMSEKHT--SKLNFGSQAVISGQGVVSTPLVPKDPDTYYYLTLEGISVGNIRI---- 969
            CLVP+S   T  SK+NFG   V+SG G VSTPL+   PDT+YYLTLEG+SVG+  +    
Sbjct: 249  CLVPLSSDSTVSSKINFGKSGVVSGSGTVSTPLIKGTPDTFYYLTLEGLSVGSETVAFKG 308

Query: 970  --ENNKISLNQEEGNIV-----------------XXXXVKEAINLDTYPDPEGTFGLCYS 1092
              EN       EEGNI+                     +  AI   T  DP G F LCYS
Sbjct: 309  FSENKSSPAAVEEGNIIIDSGTTLTLLPQDFYTDVESALTNAIGGQTTTDPNGIFSLCYS 368

Query: 1093 TKSDIKVPKFTFHFTGADLQLNEMNTFIKISDDLVCLSMVPAQSLSIFGNLAQINFQVEY 1272
            + +++++P  T HFTGAD+QL  +NTF+++ +DLVC SM+P+ +L+IFGNLAQINF V Y
Sbjct: 369  SVNNLEIPTITAHFTGADVQLPPLNTFVQVQEDLVCFSMIPSSNLAIFGNLAQINFLVGY 428

Query: 1273 DLVGKKVSFTPADC 1314
            DL   KVSF   DC
Sbjct: 429  DLKNNKVSFKQTDC 442


>ref|XP_006361597.1| PREDICTED: aspartic proteinase CDR1-like [Solanum tuberosum]
          Length = 428

 Score =  375 bits (964), Expect = e-101
 Identities = 196/424 (46%), Positives = 257/424 (60%), Gaps = 35/424 (8%)
 Frame = +1

Query: 157  GFSIELIHHESPLSPFYNPSDTYSDRIRKAARHSIARTKLFRSSSSTNLDTIQSDVIPND 336
            GF+++LIH +SPLSPFYNPS+T S+R+R A   S +R   F+ SS    +TIQSD+ P  
Sbjct: 8    GFTLDLIHRDSPLSPFYNPSNTQSNRLRNAFHRSFSRASFFKKSSLATTNTIQSDISPIP 67

Query: 337  GSYLMKLSIGTPPLEVLAIADTGSDLIWTQCKPCQECYEQDANLFDPIQSSTYREISCQS 516
            G YLMKLSIGTPP+E++AIADTGSDL WTQC PC+ C++Q + LFD  +SSTY+ + C  
Sbjct: 68   GEYLMKLSIGTPPVEIVAIADTGSDLTWTQCMPCENCFQQSSPLFDSKKSSTYKTVGCNV 127

Query: 517  DYCQSLPRAHC--GNTSQDCQYLYSYGDESHTNGILATETFTFDSTTGRPIAIPTLVFGC 690
            + C SL  + C  GN    C+Y  SYGD+SHT G LA + FTF ST+G  + IP + FGC
Sbjct: 128  EVCTSLEGSSCVKGNV---CEYQMSYGDQSHTIGDLAFDKFTFPSTSGENVVIPNVAFGC 184

Query: 691  GHNNAGTFXXXXXXXXXXXXXXXXXXXXXXXKIEGKFSYCLVP------MSEKHTSKLNF 852
            GH+N GTF                       +I GKFSYCL+P      ++   TS +NF
Sbjct: 185  GHDNGGTFNNYTSGIIGLGGGKVSMINQLDKEINGKFSYCLIPIPFDSSINSNITSHINF 244

Query: 853  GSQAVISGQGVVSTPLVPKDPDTYYYLTLEGISVGN--IRIENNKISLNQ-------EEG 1005
            G  A++SG  VVSTPL+ K+P TYYYL LEG+SVGN  ++ +++K S +        + G
Sbjct: 245  GISAIVSGPNVVSTPLIKKEPSTYYYLNLEGVSVGNKTLKFKSSKTSPSDNASGGDGQAG 304

Query: 1006 NIV-----------------XXXXVKEAINLDTYPDPEGTFGLCY-STKSDIKVPKFTFH 1131
            NI+                     +  +I  +   DP G F LCY S    I  P    H
Sbjct: 305  NIIIDSGTTLTLLPNDFYSNLESTLVNSIRANRKDDPSGNFHLCYESENGTIDAPTIVTH 364

Query: 1132 FTGADLQLNEMNTFIKISDDLVCLSMVPAQSLSIFGNLAQINFQVEYDLVGKKVSFTPAD 1311
            FT ADL+L+  +TF +I   LVCL++VPA  ++IFGNLAQ NF +EYDLV  K+SF P D
Sbjct: 365  FTNADLELSPSSTFAEIEQGLVCLTIVPADEIAIFGNLAQGNFLIEYDLVANKISFQPTD 424

Query: 1312 CIKY 1323
            C KY
Sbjct: 425  CTKY 428


>ref|XP_007210699.1| hypothetical protein PRUPE_ppa025167mg [Prunus persica]
            gi|462406434|gb|EMJ11898.1| hypothetical protein
            PRUPE_ppa025167mg [Prunus persica]
          Length = 457

 Score =  374 bits (961), Expect = e-101
 Identities = 206/459 (44%), Positives = 269/459 (58%), Gaps = 37/459 (8%)
 Frame = +1

Query: 58   ATMATATPIVLAFVVVIIFSLLPNLPVVKAQIGGFSIELIHHESPLSPFYNPSDTYSDRI 237
            A   T+T +     ++  F LL      +A   GF+ +LIH +SPLSP YN S ++ DR+
Sbjct: 4    AAAPTSTKLYFPLALLACFILL-----AQASSHGFTADLIHRDSPLSPLYNSSMSHLDRL 58

Query: 238  RKAARHSIARTKLFRSSSSTNLDT------IQSDVIPNDGSYLMKLSIGTPPLEVLAIAD 399
              A R S+ R   F   + T+L +      IQS +IP+ G YLM +SIGTPP+EVL IAD
Sbjct: 59   HNAFRRSVTRVHHFIKPTMTSLSSSLAAPNIQSIIIPSAGEYLMNVSIGTPPVEVLGIAD 118

Query: 400  TGSDLIWTQCKPCQECYEQDANLFDPIQSSTYREISCQSDYCQSLPRAHCG---NTSQD- 567
            TGSDLIWTQCKPC++C+ Q+  LFDP +SSTY  I CQS  C  L  A CG   N   D 
Sbjct: 119  TGSDLIWTQCKPCKQCFNQNPPLFDPKKSSTYHSIPCQSSSCTYLEEAACGTLINGDHDT 178

Query: 568  CQYLYSYGDESHTNGILATETFTFDSTTGRPIAIPTLVFGCGHNNAGTFXXXXXXXXXXX 747
            C+Y Y YGD S T G LA ET TF ST+GRP ++P +VFGCGH N GTF           
Sbjct: 179  CEYSYRYGDRSFTRGTLALETLTFGSTSGRPTSLPKVVFGCGHENGGTFDESGSGLIGLG 238

Query: 748  XXXXXXXXXXXXKIE-GKFSYCLVPMSEKHTSKLNFGSQAVISGQGVVSTPLVPKDPDTY 924
                            GKFSYCL+P +    SK++FGS  ++SG G VSTPLV K+PDT+
Sbjct: 239  GGPLSLISQLTKLTNGGKFSYCLLPTANTAASKISFGSAGIVSGSGAVSTPLVAKNPDTF 298

Query: 925  YYLTLEGISVGNIRI-------ENNKISLNQEEGNIV-----------------XXXXVK 1032
            YYLTLE ISVG  R+       +  K ++   EGNI+                     ++
Sbjct: 299  YYLTLEAISVGEKRLAYKTKSPDCEKAAVAANEGNIIIDSGTTLTLLPPGFHDDLVSALE 358

Query: 1033 EAINLDTYPDPEGTFGLCYSTKS-DIKVPKFTFHFT-GADLQLNEMNTFIKISDDLVCLS 1206
             AIN +   DP G   LC+ +KS DI VP  T HF+ GAD++L  +NTF ++ DD++C +
Sbjct: 359  TAINAERVSDPRGILSLCFKSKSDDIGVPVITVHFSGGADVKLQALNTFARMDDDMICFT 418

Query: 1207 MVPAQSLSIFGNLAQINFQVEYDLVGKKVSFTPADCIKY 1323
            M+P+  ++IFGNLAQ+NF V YDL  + VSF P DC K+
Sbjct: 419  MIPSSDVAIFGNLAQMNFLVGYDLEERSVSFKPTDCTKH 457


>ref|XP_002266533.1| PREDICTED: probable aspartic protease At2g35615-like [Vitis vinifera]
          Length = 447

 Score =  374 bits (961), Expect = e-101
 Identities = 199/437 (45%), Positives = 261/437 (59%), Gaps = 26/437 (5%)
 Frame = +1

Query: 88   LAFVVVIIFSLLPNLPVVKAQIGGFSIELIHHESPLSPFYNPSDTYSDRIRKAARHSIAR 267
            LA +  I FS L +     +  GGFS +LI  +SPLSPFYNPS+T  DR++KA   SI+R
Sbjct: 13   LAVIFFIHFSGLSHTEA--SNKGGFSTDLISRDSPLSPFYNPSETQFDRLQKAFHRSISR 70

Query: 268  TKLFRSSS-STNLDTIQSDVIPNDGSYLMKLSIGTPPLEVLAIADTGSDLIWTQCKPCQE 444
               FR++  STN  +IQS VI N+G YLM +S+GTPP+ +  IADTGSDL+W QCKPC  
Sbjct: 71   ANHFRANGVSTN--SIQSPVISNNGEYLMNISLGTPPVSMHGIADTGSDLLWRQCKPCDS 128

Query: 445  CYEQDANLFDPIQSSTYREISCQSDYCQSLPRAHCGNTSQDCQYLYSYGDESHTNGILAT 624
            CYEQ   +FDP +S TY+ +SC+   C +L      +    C Y YSYGD SHT+G LA 
Sbjct: 129  CYEQIEPIFDPAKSKTYQILSCEGKSCSNLGGQGGCSDDNTCIYSYSYGDGSHTSGDLAV 188

Query: 625  ETFTFDSTTGRPIAIPTLVFGCGHNNAGTFXXXXXXXXXXXXXXXXXXXXXXXKIEGKFS 804
            +T T  STTGRP+++P +VFGCGHNN GTF                        I G+FS
Sbjct: 189  DTLTIGSTTGRPVSVPKVVFGCGHNNGGTFELHGSGLVGLGGGPLSMISQLRPLIGGRFS 248

Query: 805  YCLVPMSE--KHTSKLNFGSQAVISGQGVVSTPLVPKDPDTYYYLTLEGISVGNIRIENN 978
            YCLVP+      +SK++FGS+ ++SG G VSTPL  + PDT+YYLTLE +SVG+ ++   
Sbjct: 249  YCLVPLGNDPSVSSKMHFGSRGIVSGAGAVSTPLASRQPDTFYYLTLESMSVGSKKLAYK 308

Query: 979  KIS------LNQEEGNIV-----------------XXXXVKEAINLDTYPDPEGTFGLCY 1089
              S       + +EGNI+                     V  AI      DP   F LCY
Sbjct: 309  GFSKVGSPLADADEGNIIIDSGTTLTLLPQDFYGTLESNVVSAIGGKPVRDPNNVFSLCY 368

Query: 1090 STKSDIKVPKFTFHFTGADLQLNEMNTFIKISDDLVCLSMVPAQSLSIFGNLAQINFQVE 1269
            S  S +++P  T HF GADL+L  +NTF+++ +DL C +M+P   L+IFGNLAQ+NF V 
Sbjct: 369  SNLSGLRIPTITAHFVGADLELKPLNTFVQVQEDLFCFAMIPVSDLAIFGNLAQMNFLVG 428

Query: 1270 YDLVGKKVSFTPADCIK 1320
            YDL  + VSF P DC K
Sbjct: 429  YDLKSRTVSFKPTDCTK 445


>ref|XP_002304395.1| hypothetical protein POPTR_0003s10440g [Populus trichocarpa]
            gi|222841827|gb|EEE79374.1| hypothetical protein
            POPTR_0003s10440g [Populus trichocarpa]
          Length = 443

 Score =  372 bits (956), Expect = e-100
 Identities = 195/447 (43%), Positives = 269/447 (60%), Gaps = 23/447 (5%)
 Frame = +1

Query: 52   MCATMATATPIVLAFVVVIIFSLLPNLPVVKAQIGGFSIELIHHESPLSPFYNPSDTYSD 231
            M  T  +   IV+ F+ +  F   P L    +   GFS+ LIH +SPLSP YNP+ T  D
Sbjct: 1    MATTSFSFVTIVICFISLSPF---PLLGAAASPDPGFSLNLIHRDSPLSPLYNPNHTDFD 57

Query: 232  RIRKAARHSIARTKLFRSSSSTNLDTIQSDVIPNDGSYLMKLSIGTPPLEVLAIADTGSD 411
            R+R A   SI+R  +F++ +  ++++ Q+D++PN G Y MK+SIGTP +EV+ IADTGSD
Sbjct: 58   RLRNAFSRSISRVNVFKTKA-VDINSFQNDLVPNGGEYFMKMSIGTPLVEVIVIADTGSD 116

Query: 412  LIWTQCKPCQECYEQDANLFDPIQSSTYREISCQSDYCQSLPRAH--CGNTSQDCQYLYS 585
            L W QC PC  CY Q + LFDP +SS+YR + C S +C +L  +   C   +  C+Y YS
Sbjct: 117  LTWVQCLPCDPCYRQKSPLFDPSRSSSYRHMLCGSRFCNALDVSEQACTMDTNICEYHYS 176

Query: 586  YGDESHTNGILATETFTFDSTTGRPIAIPTLVFGCGHNNAGTFXXXXXXXXXXXXXXXXX 765
            YGD+S+TNG LATE FT  ST+ RP+ +  +VFGCG  N GTF                 
Sbjct: 177  YGDKSYTNGNLATEKFTIGSTSSRPVHLSPIVFGCGTGNGGTFDELGSGIVGLGGGALSL 236

Query: 766  XXXXXXKIEGKFSYCLVPMSEKH--TSKLNFGSQAVISGQGVVSTPLVPKDPDTYYYLTL 939
                   I+GKFSYCLVP+SE+   TSK+ FG+ +VISG  VVSTPLV K PDTYYY+TL
Sbjct: 237  VSQLSSIIKGKFSYCLVPLSEQSNVTSKIKFGTDSVISGPQVVSTPLVSKQPDTYYYVTL 296

Query: 940  EGISVGNIRI--ENNKISLNQEEGNIV-----------------XXXXVKEAINLDTYPD 1062
            E ISVGN R+   N  ++ N E+GN++                     ++E +  +   D
Sbjct: 297  EAISVGNKRLPYTNGLLNGNVEKGNVIIDSGTTLTFLDSEFFTELERVLEETVKAERVSD 356

Query: 1063 PEGTFGLCYSTKSDIKVPKFTFHFTGADLQLNEMNTFIKISDDLVCLSMVPAQSLSIFGN 1242
            P G F +C+ +  DI +P    HF  AD++L  +NTF+K  +DL+C +M+ +  + IFGN
Sbjct: 357  PRGLFSVCFRSAGDIDLPVIAVHFNDADVKLQPLNTFVKADEDLLCFTMISSNQIGIFGN 416

Query: 1243 LAQINFQVEYDLVGKKVSFTPADCIKY 1323
            LAQ++F V YDL  + VSF P DC K+
Sbjct: 417  LAQMDFLVGYDLEKRTVSFKPTDCTKH 443


>ref|XP_002320947.1| aspartyl protease family protein [Populus trichocarpa]
            gi|222861720|gb|EEE99262.1| aspartyl protease family
            protein [Populus trichocarpa]
          Length = 440

 Score =  372 bits (954), Expect = e-100
 Identities = 199/434 (45%), Positives = 263/434 (60%), Gaps = 23/434 (5%)
 Frame = +1

Query: 88   LAFVVVIIFSLLPNLPVVKAQIGGFSIELIHHESPLSPFYNPSDTYSDRIRKAARHSIAR 267
            L+F + I    +     + A+  GF+++LIH +SPLSPFYN  +T   RI  A R SI+R
Sbjct: 8    LSFALAIALLCVSGFGCIYARKVGFTVDLIHRDSPLSPFYNSEETDLQRINNALRRSISR 67

Query: 268  TKLFR--SSSSTNLDTIQSDVIPNDGSYLMKLSIGTPPLEVLAIADTGSDLIWTQCKPCQ 441
               F   +++S +    +SDV  N G YLM LS+GTPP +++ IADTGSDLIWTQCKPC+
Sbjct: 68   VHHFDPIAAASVSPKAAESDVTSNRGEYLMSLSLGTPPFKIMGIADTGSDLIWTQCKPCE 127

Query: 442  ECYEQDANLFDPIQSSTYREISCQSDYCQSLPRAHCGNTSQDCQYLYSYGDESHTNGILA 621
             CY+Q   LFDP  S TYR+ SC +  C  L ++ C  +   CQY YSYGD S+T G +A
Sbjct: 128  RCYKQVDPLFDPKSSKTYRDFSCDARQCSLLDQSTC--SGNICQYQYSYGDRSYTMGNVA 185

Query: 622  TETFTFDSTTGRPIAIPTLVFGCGHNNAGTFXXXXXXXXXXXXXXXXXXXXXXXKIEGKF 801
            ++T T DSTTG P++ P  V GCGH N GTF                        + GKF
Sbjct: 186  SDTITLDSTTGSPVSFPKTVIGCGHENDGTFSDKGSGIVGLGAGPLSLISQMGSSVGGKF 245

Query: 802  SYCLVPMSEK--HTSKLNFGSQAVISGQGVVSTPLVPKDP-DTYYYLTLEGISVGNIRIE 972
            SYCLVP+S +  ++SKLNFGS AV+SG GV STPL+  +   ++Y+LTLE +SVGN RI+
Sbjct: 246  SYCLVPLSSRAGNSSKLNFGSNAVVSGPGVQSTPLLSSETMSSFYFLTLEAMSVGNERIK 305

Query: 973  NNKISLNQEEGNIVXXXXVKEAI-------NLDT----------YPDPEGTFGLCYSTKS 1101
                SL   EGNI+        I       NL T            DP G   +CYS  S
Sbjct: 306  FGDSSLGTGEGNIIIDSGTTLTIVPDDFFSNLSTAVGNQVEGRRAEDPSGFLSVCYSATS 365

Query: 1102 DIKVPKFTFHFTGADLQLNEMNTFIKISDDLVCLSMVPAQS-LSIFGNLAQINFQVEYDL 1278
            D+KVP  T HFTGAD++L  +NTF+++SDD+VCL+     S +SI+GN+AQ+NF VEY++
Sbjct: 366  DLKVPAITAHFTGADVKLKPINTFVQVSDDVVCLAFASTTSGISIYGNVAQMNFLVEYNI 425

Query: 1279 VGKKVSFTPADCIK 1320
             GK +SF P DC K
Sbjct: 426  QGKSLSFKPTDCTK 439


>ref|XP_002266575.1| PREDICTED: probable aspartic protease At2g35615 [Vitis vinifera]
          Length = 445

 Score =  371 bits (952), Expect = e-100
 Identities = 196/437 (44%), Positives = 261/437 (59%), Gaps = 25/437 (5%)
 Frame = +1

Query: 85   VLAFVVVIIFSLLPNLPVVKAQIGGFSIELIHHESPLSPFYNPSDTYSDRIRKAARHSIA 264
            +LA + +I F+        +A++ GF+ + I  +SP SPFYNPS+T   R++KA R SI 
Sbjct: 12   LLAIIFLIYFAKHSQ---AEAKVDGFTTDFISRDSPRSPFYNPSETKYQRLQKAFRRSIL 68

Query: 265  RTKLFRSSSSTNLDTIQSDVIPNDGSYLMKLSIGTPPLEVLAIADTGSDLIWTQCKPCQE 444
            R   FR+  ++  D IQS+VI   GSYLM +S+GTPP+ +L IADTGSDLIW QC PC +
Sbjct: 69   RGNHFRAIRASPND-IQSNVISGGGSYLMNISLGTPPVSMLGIADTGSDLIWRQCLPCDD 127

Query: 445  CYEQDANLFDPIQSSTYREISCQSDYCQSLPRAHCGNTSQDCQYLYSYGDESHTNGILAT 624
            CY+Q   LFDP +S TY+ + C +D+CQ L +         C   YSYGD+S+T   L++
Sbjct: 128  CYKQVEPLFDPKKSKTYKTLGCNNDFCQDLGQQGSCGDDNTCTSSYSYGDQSYTRRDLSS 187

Query: 625  ETFTFDSTTGRPIAIPTLVFGCGHNNAGTFXXXXXXXXXXXXXXXXXXXXXXXKIEGKFS 804
            ETFT  ST G P + P L FGCGH+N GTF                       K+ G+FS
Sbjct: 188  ETFTIGSTEGDPASFPGLAFGCGHSNGGTFNEKDSGLIGLGGGPLSLVMQLSSKVGGQFS 247

Query: 805  YCLVPMSEKHT--SKLNFGSQAVISGQGVVSTPLVPKDPDTYYYLTLEGISVGNIRIENN 978
            YCLVP+S   T  SK+NFG  AV+SG G VSTPL+   PDT+YYLTLEG+S+G+ ++   
Sbjct: 248  YCLVPLSSDSTASSKINFGKSAVVSGSGTVSTPLIKGTPDTFYYLTLEGMSLGSEKVAFK 307

Query: 979  KISLNQ------EEGNIV-----------------XXXXVKEAINLDTYPDPEGTFGLCY 1089
              S N+      EE NI+                     + + I   T  DP GTF LCY
Sbjct: 308  GFSKNKSSPAAAEESNIIIDSGTTLTLLPRDFYTDMESALTKVIGGQTTTDPRGTFSLCY 367

Query: 1090 STKSDIKVPKFTFHFTGADLQLNEMNTFIKISDDLVCLSMVPAQSLSIFGNLAQINFQVE 1269
            S    +++P  T HF GAD+QL  +NTF++  +DLVC SM+P+ +L+IFGNL+Q+NF V 
Sbjct: 368  SGVKKLEIPTITAHFIGADVQLPPLNTFVQAQEDLVCFSMIPSSNLAIFGNLSQMNFLVG 427

Query: 1270 YDLVGKKVSFTPADCIK 1320
            YDL   KVSF P DC K
Sbjct: 428  YDLKNNKVSFKPTDCTK 444


>ref|XP_006393532.1| hypothetical protein EUTSA_v10012077mg, partial [Eutrema salsugineum]
            gi|557090110|gb|ESQ30818.1| hypothetical protein
            EUTSA_v10012077mg, partial [Eutrema salsugineum]
          Length = 452

 Score =  367 bits (941), Expect = 1e-98
 Identities = 186/410 (45%), Positives = 252/410 (61%), Gaps = 22/410 (5%)
 Frame = +1

Query: 157  GFSIELIHHESPLSPFYNPSDTYSDRIRKAARHSIARTKLFRSSSSTNLDTIQSDVIPND 336
            GF+ +LIH +SP SPFY P++T S R+R A R S+ R   F SS   ++D+ Q+++  N 
Sbjct: 43   GFTTDLIHRDSPKSPFYKPTETSSQRLRNAIRRSVNRVVHF-SSKDASVDSPQTEITSNR 101

Query: 337  GSYLMKLSIGTPPLEVLAIADTGSDLIWTQCKPCQECYEQDANLFDPIQSSTYREISCQS 516
            G YLM +S+GTPP  ++AIADTGSDLIWTQCKPC +CY Q+  LFDP  SSTY+  SC S
Sbjct: 102  GEYLMNISLGTPPFPIMAIADTGSDLIWTQCKPCDDCYTQNDPLFDPKASSTYKYFSCSS 161

Query: 517  DYCQSL-PRAHCGNTSQDCQYLYSYGDESHTNGILATETFTFDSTTGRPIAIPTLVFGCG 693
              C +L  +A C      C Y  SYGD S+TNG +A +T T  ST  RP+ +  ++ GCG
Sbjct: 162  SQCSALGNQASCSTEDNTCPYSISYGDHSYTNGNVAADTLTLGSTNKRPVQLKNVIIGCG 221

Query: 694  HNNAGTFXXXXXXXXXXXXXXXXXXXXXXXKIEGKFSYCLVPMSEKH--TSKLNFGSQAV 867
            HNN GTF                        I+GKFSYCL+P+S ++  TSK+NFG+ AV
Sbjct: 222  HNNNGTFNKEGSGIVGLGGGPVSLISQLGESIDGKFSYCLIPLSSENDKTSKINFGTSAV 281

Query: 868  ISGQGVVSTPLVPKDPDTYYYLTLEGISVG--NIRIENNKISLNQEEGNIV--------- 1014
            +SG G VSTPL+ K  +T+YYLTLE ISVG  NI+   +     + EGNI+         
Sbjct: 282  VSGTGAVSTPLITKSRETFYYLTLESISVGSKNIKFPVSDPGSGEGEGNIIIDSGTTLTM 341

Query: 1015 --------XXXXVKEAINLDTYPDPEGTFGLCYSTKSDIKVPKFTFHFTGADLQLNEMNT 1170
                        V  +I+ +   DPE    LCYS  +++KVP  T HF GAD++L+  N+
Sbjct: 342  LPTTFYSELEDAVASSIDAERQNDPESPLSLCYSATANLKVPVITMHFDGADVKLDSSNS 401

Query: 1171 FIKISDDLVCLSMVPAQSLSIFGNLAQINFQVEYDLVGKKVSFTPADCIK 1320
            F+++S++LVC +   ++ L+I+GNL+Q+NF V YD V K VSF PADC K
Sbjct: 402  FVQLSEELVCFAFRGSEDLAIYGNLSQMNFLVGYDTVSKTVSFKPADCAK 451


>ref|XP_006285435.1| hypothetical protein CARUB_v10006851mg [Capsella rubella]
            gi|482554140|gb|EOA18333.1| hypothetical protein
            CARUB_v10006851mg [Capsella rubella]
          Length = 436

 Score =  367 bits (941), Expect = 1e-98
 Identities = 186/408 (45%), Positives = 246/408 (60%), Gaps = 20/408 (4%)
 Frame = +1

Query: 157  GFSIELIHHESPLSPFYNPSDTYSDRIRKAARHSIARTKLFRSSSSTNLDTIQSDVIPND 336
            GF+ +LIH +SP SPF+NP++T S R+R +   S+ R   F  +  T+ ++ Q ++  N 
Sbjct: 30   GFTADLIHRDSPKSPFFNPTETPSQRLRNSINRSVNRA--FHFTEDTSANSPQVEITSNG 87

Query: 337  GSYLMKLSIGTPPLEVLAIADTGSDLIWTQCKPCQECYEQDANLFDPIQSSTYREISCQS 516
            G YLM +S+GTPP  ++AIADTGSDL+WTQCKPC +CY QD  LFDP  SSTY+++SC S
Sbjct: 88   GEYLMNVSLGTPPFPIMAIADTGSDLLWTQCKPCDDCYTQDDPLFDPKASSTYKDVSCSS 147

Query: 517  DYCQSL-PRAHCGNTSQDCQYLYSYGDESHTNGILATETFTFDSTTGRPIAIPTLVFGCG 693
              C +L   A C      C Y  SYGD S+T G +A +T T  ST  RP+ I  ++ GCG
Sbjct: 148  SQCNALEDHASCSVDDTTCSYSMSYGDHSYTRGNIAADTLTLGSTNNRPVQIKNVLIGCG 207

Query: 694  HNNAGTFXXXXXXXXXXXXXXXXXXXXXXXKIEGKFSYCLVPMSEK--HTSKLNFGSQAV 867
            HNN+GTF                        I+GKFSYCLVP++ +   TSKLNFG+ A 
Sbjct: 208  HNNSGTFNEKGSGIIGLGGGAASLITQLGDSIDGKFSYCLVPLTSETDRTSKLNFGTNAE 267

Query: 868  ISGQGVVSTPLVPKDPDTYYYLTLEGISVGNIRIENNKISLNQEEGNIV----------- 1014
            +SG GVVSTPL+ K P+T+YYLTLE ISVG+ +I          EGNI+           
Sbjct: 268  VSGTGVVSTPLISKSPETFYYLTLESISVGSKKIPFPVSESGTTEGNIIIDSGTTLTLLP 327

Query: 1015 ------XXXXVKEAINLDTYPDPEGTFGLCYSTKSDIKVPKFTFHFTGADLQLNEMNTFI 1176
                      V  AI  +   DP+    LCYS   D+KVP  T HF GAD++L+  N+F+
Sbjct: 328  AEFYSELEDAVASAITAERKEDPKKVLSLCYSATEDLKVPIITMHFDGADVKLDSSNSFV 387

Query: 1177 KISDDLVCLSMVPAQSLSIFGNLAQINFQVEYDLVGKKVSFTPADCIK 1320
            +IS +LVC +   + SL+I+GNL+Q+NF V YD V KKVSF P DC K
Sbjct: 388  QISQELVCFAFSGSPSLAIYGNLSQMNFLVGYDTVSKKVSFKPTDCAK 435


>ref|XP_007029843.1| Eukaryotic aspartyl protease family protein, putative [Theobroma
            cacao] gi|508718448|gb|EOY10345.1| Eukaryotic aspartyl
            protease family protein, putative [Theobroma cacao]
          Length = 435

 Score =  364 bits (934), Expect = 7e-98
 Identities = 188/441 (42%), Positives = 261/441 (59%), Gaps = 18/441 (4%)
 Frame = +1

Query: 52   MCATMATATPIVLAFVVVIIFSLLPNLPVVKAQIGGFSIELIHHESPLSPFYNPSDTYSD 231
            M AT  T +   + F ++++        +++AQ GGFS+ELIH +SP SP YNP +T S+
Sbjct: 1    MAATANTTSMFFIGFAILVLSCFC----LIEAQKGGFSVELIHRDSPKSPLYNPLETASN 56

Query: 232  RIRKAARHSIARTKLFRSSSSTNLDTIQSDVIPNDGSYLMKLSIGTPPLEVLAIADTGSD 411
            R+  A R S  R + F+ SS +    + +D+I + G YLM +SIGTP  +++AIADTGSD
Sbjct: 57   RVANALRRSFNRAQRFKPSSIST-KAVDADLIADSGEYLMNVSIGTPAFDIVAIADTGSD 115

Query: 412  LIWTQCKPCQECYEQDANLFDPIQSSTYREISCQSDYCQSLPRAHCGNTSQDCQYLYSYG 591
            LIWTQCKPC +C+ QDA LFDP +SST+R  SC +  C++L  + C +++  C+Y  +YG
Sbjct: 116  LIWTQCKPCSQCFRQDAPLFDPSKSSTFRTFSCSASQCENLEGSSC-SSNNTCRYSVTYG 174

Query: 592  DESHTNGILATETFTFDSTTGRPIAIPTLVFGCGHNNAGTFXXXXXXXXXXXXXXXXXXX 771
            D S +NG +A +T T  STTGRP+A    + GCGHNN GTF                   
Sbjct: 175  DNSFSNGDVAADTLTLPSTTGRPVAFRNTIIGCGHNNDGTFDENTSGIIGLGGGDVSLIS 234

Query: 772  XXXXKIEGKFSYCLVPMSEK-HTSKLNFGSQAVISGQGVVSTPLVPKDPDTYYYLTLEGI 948
                 I GKFSYCL+P+S+   ++K+NFG+ A++SG GVVSTPL  K P T+Y+LTLE +
Sbjct: 235  QLGTSIAGKFSYCLLPLSDAGESNKMNFGTDAIVSGAGVVSTPLTKKFPSTFYFLTLEAV 294

Query: 949  SVGNIRIENNKISLNQEEGNIV-----------------XXXXVKEAINLDTYPDPEGTF 1077
            SVG+ RI+    SL  ++GNI+                     V   I       P+G  
Sbjct: 295  SVGSKRIKFTGSSLGTDDGNIIIDSGTTLTLLPEDFYSELESAVASQIKARRVDGPQG-L 353

Query: 1078 GLCYSTKSDIKVPKFTFHFTGADLQLNEMNTFIKISDDLVCLSMVPAQSLSIFGNLAQIN 1257
             LCY   +D  VP  T HFT AD++L  +NTF+ +SD + C +    Q  +I+GNLAQ+N
Sbjct: 354  SLCYDATTDFAVPNITIHFTNADVKLAPLNTFVLVSDTVSCFTFSSLQGFAIYGNLAQMN 413

Query: 1258 FQVEYDLVGKKVSFTPADCIK 1320
            F V YD   + VSF P DC K
Sbjct: 414  FLVGYDTEKQTVSFKPTDCSK 434


>ref|XP_006385613.1| aspartyl protease family protein [Populus trichocarpa]
            gi|550342742|gb|ERP63410.1| aspartyl protease family
            protein [Populus trichocarpa]
          Length = 440

 Score =  363 bits (933), Expect = 9e-98
 Identities = 195/432 (45%), Positives = 251/432 (58%), Gaps = 22/432 (5%)
 Frame = +1

Query: 85   VLAFVVVIIFSLLPNLPVVKAQIGGFSIELIHHESPLSPFYNPSDTYSDRIRKAARHSIA 264
            VL+F   I   +  +   + A   GF+ EL+H +SP SP YN   T+  R  KA R S++
Sbjct: 7    VLSFASAIALCVA-SFGCIYAHNAGFTTELVHRDSPKSPLYNSQQTHLQRWNKAMRRSVS 65

Query: 265  RTKLF-RSSSSTNLDTIQSDVIPNDGSYLMKLSIGTPPLEVLAIADTGSDLIWTQCKPCQ 441
            R   F R++++ +   ++S++I N G YLM LS+GTPP E+LAIADTGSDLIWTQC PC 
Sbjct: 66   RVHHFQRTAATVSPKEVESEIIANGGEYLMSLSLGTPPFEILAIADTGSDLIWTQCTPCD 125

Query: 442  ECYEQDANLFDPIQSSTYREISCQSDYCQSLPRAHCGNTSQDCQYLYSYGDESHTNGILA 621
            +CY+Q A LFDP  S TYR++SC +  CQ+L  +   ++ Q CQY Y YGD S TNG LA
Sbjct: 126  KCYKQIAPLFDPKSSKTYRDLSCDTRQCQNLGESSSCSSEQLCQYSYYYGDRSFTNGNLA 185

Query: 622  TETFTFDSTTGRPIAIPTLVFGCGHNNAGTFXXXXXXXXXXXXXXXXXXXXXXXKIEGKF 801
             +T T  ST G P+  P  V GCG  N GTF                        I GKF
Sbjct: 186  VDTVTLPSTNGVPVYFPKTVIGCGRRNNGTFDKKDSGIIGLGGGPMSLISQMGSSIGGKF 245

Query: 802  SYCLVPMSEK---HTSKLNFGSQAVISGQGVVSTPLVPKDPDTYYYLTLEGISVGNIRIE 972
            SYCLVP S +   ++SKL+FG  AV+SG GV STPL+ K+PDT+YYLTLE +SVG+ +IE
Sbjct: 246  SYCLVPFSSESAGNSSKLHFGRNAVVSGSGVQSTPLISKNPDTFYYLTLEAMSVGDKKIE 305

Query: 973  NNKISLNQEEGNIVXXXXV------------------KEAINLDTYPDPEGTFGLCYSTK 1098
                S    EGNI+                          IN +   D  G    CY   
Sbjct: 306  FGGSSFGGSEGNIIIDSGTSLTLFPVNFFTEFATAVENAVINGERTQDASGLLSHCYRPT 365

Query: 1099 SDIKVPKFTFHFTGADLQLNEMNTFIKISDDLVCLSMVPAQSLSIFGNLAQINFQVEYDL 1278
             D+KVP  T HF GAD+ L  +NTFI ISDD++CL+    QS +IFGN+AQ+NF + YD+
Sbjct: 366  PDLKVPVITAHFNGADVVLQTLNTFILISDDVLCLAFNSTQSGAIFGNVAQMNFLIGYDI 425

Query: 1279 VGKKVSFTPADC 1314
             GK VSF P DC
Sbjct: 426  QGKSVSFKPTDC 437


>ref|XP_006403054.1| hypothetical protein EUTSA_v10003479mg [Eutrema salsugineum]
            gi|557104161|gb|ESQ44507.1| hypothetical protein
            EUTSA_v10003479mg [Eutrema salsugineum]
          Length = 439

 Score =  362 bits (930), Expect = 2e-97
 Identities = 182/410 (44%), Positives = 250/410 (60%), Gaps = 22/410 (5%)
 Frame = +1

Query: 157  GFSIELIHHESPLSPFYNPSDTYSDRIRKAARHSIARTKLFRSSSSTNLDTIQSDVIPND 336
            GF+ +LIH +SP SPFY P++T S R+R A R S+     F SS   ++D+ Q+++  N 
Sbjct: 30   GFTTDLIHRDSPKSPFYKPTETSSQRLRNAIRRSVNHVVHF-SSKDASVDSPQTEITSNR 88

Query: 337  GSYLMKLSIGTPPLEVLAIADTGSDLIWTQCKPCQECYEQDANLFDPIQSSTYREISCQS 516
            G YLM +S+GTPP  ++AIADTGSDL+WTQCKPC +CY Q+  LFDP  SSTY++ SC S
Sbjct: 89   GEYLMNISLGTPPFPIMAIADTGSDLLWTQCKPCDDCYTQNDPLFDPKASSTYKDFSCSS 148

Query: 517  DYCQSL-PRAHCGNTSQDCQYLYSYGDESHTNGILATETFTFDSTTGRPIAIPTLVFGCG 693
              C +L  +A C      C Y  SYGD S+TNG +A +T T  ST  RP+ +  ++ GCG
Sbjct: 149  SQCSALGNQASCSTEDNTCSYSMSYGDHSYTNGNVAADTLTLGSTNNRPVQLKNVIIGCG 208

Query: 694  HNNAGTFXXXXXXXXXXXXXXXXXXXXXXXKIEGKFSYCLVPMSEKH--TSKLNFGSQAV 867
            HNN GTF                        I+GKFSYCL+P+S ++  TS +NFG+ AV
Sbjct: 209  HNNNGTFNKEGSGIVGLGGGPVSLISQLGESIDGKFSYCLIPLSSENGKTSNINFGTSAV 268

Query: 868  ISGQGVVSTPLVPKDPDTYYYLTLEGISVG--NIRIENNKISLNQEEGNIV--------- 1014
            +SG G VSTPL+ K  +T+YYLTL  ISVG  NI+   +     + EGNI+         
Sbjct: 269  VSGTGAVSTPLITKSRETFYYLTLASISVGSKNIKFPVSDPGSGEGEGNIIIDSGTTLTM 328

Query: 1015 --------XXXXVKEAINLDTYPDPEGTFGLCYSTKSDIKVPKFTFHFTGADLQLNEMNT 1170
                        V  +I+ +   DPE    LCYS  +++KVP  T HF GAD++L+  N+
Sbjct: 329  LPTTFYSELEDAVASSIDAERQNDPESPLSLCYSATANLKVPVITMHFDGADVKLDSSNS 388

Query: 1171 FIKISDDLVCLSMVPAQSLSIFGNLAQINFQVEYDLVGKKVSFTPADCIK 1320
            F+++S++LVC +   ++ L+I+GNL+Q+NF V YD V K VSF PADC K
Sbjct: 389  FVQLSEELVCFAFRGSEDLAIYGNLSQMNFLVGYDTVSKTVSFKPADCAK 438


>ref|XP_004244685.1| PREDICTED: aspartic proteinase CDR1-like [Solanum lycopersicum]
          Length = 448

 Score =  362 bits (930), Expect = 2e-97
 Identities = 192/417 (46%), Positives = 257/417 (61%), Gaps = 28/417 (6%)
 Frame = +1

Query: 157  GFSIELIHHESPLSPFYNPSDTYSDRIRKAARHSIARTKLFRSSSSTNLDTIQSDVIPND 336
            GF++ LIH +SPLSP YN S T S+R+  A   S +R   F+ SS    +TI+SD+ P  
Sbjct: 35   GFTLHLIHRDSPLSPLYNSSITQSNRLINAFHRSFSRASFFKKSSFVTPNTIRSDISPIP 94

Query: 337  GSYLMKLSIGTPPLEVLAIADTGSDLIWTQCKPCQECYEQDANLFDPIQSSTYREISCQS 516
            G Y+MKLSIGTPP+E++AIADTGSDL WTQC+PC  C+EQ + LFD  +SS+Y+   C +
Sbjct: 95   GEYIMKLSIGTPPVEIVAIADTGSDLTWTQCEPCLNCFEQSSPLFDSKKSSSYKTAGCDT 154

Query: 517  DYCQSLPRAHC--GNTSQDCQYLYSYGDESHTNGILATETFTFDST-TGRPIAIPTLVFG 687
              C S+  + C  GN    C+Y  SYGD+S+T G LA + FTF ST +   +AIP + FG
Sbjct: 155  KECTSIGSSSCVKGNV---CEYQMSYGDQSYTIGDLAFDIFTFPSTNSSENVAIPNVAFG 211

Query: 688  CGHNNAGTFXXXXXXXXXXXXXXXXXXXXXXXKIEGKFSYCLVPMS-----EKHTSKLNF 852
            CGH+N GTF                       +I GKFSYCL+ ++        TS +NF
Sbjct: 212  CGHHNGGTFNNHTSGIIGLGGGNVSIINQLDKEINGKFSYCLISIALGSPISNVTSHINF 271

Query: 853  GSQAVISGQGVVSTPLVPKDPDTYYYLTLEGISVGN--IRIENNKISLNQEEGNIV---- 1014
            GS A +SG  VVSTPL+ K+P T+YYL LEG+SVGN  ++ +++K+S   EEGNI+    
Sbjct: 272  GSSASVSGPDVVSTPLIKKEPSTFYYLNLEGVSVGNRTLKFKSSKVSSGGEEGNIIIDSG 331

Query: 1015 -------------XXXXVKEAINLDTYPDPEGTFGLCYSTKS-DIKVPKFTFHFTGADLQ 1152
                             + ++I+     DP GTF LCY +K+  I  P  T HFT ADL+
Sbjct: 332  TTLTLLPNEFYSSLESTLVDSISATRKEDPSGTFRLCYESKNGTIDAPTITTHFTNADLE 391

Query: 1153 LNEMNTFIKISDDLVCLSMVPAQSLSIFGNLAQINFQVEYDLVGKKVSFTPADCIKY 1323
            L+  +TF +I + LVCL++VPA  ++IFGNLAQ NF + YDLV  K+SF PADC KY
Sbjct: 392  LSPSSTFAQIEEGLVCLTIVPADEIAIFGNLAQGNFLIGYDLVANKISFKPADCTKY 448


>ref|XP_007022588.1| Eukaryotic aspartyl protease family protein, putative [Theobroma
            cacao] gi|508722216|gb|EOY14113.1| Eukaryotic aspartyl
            protease family protein, putative [Theobroma cacao]
          Length = 429

 Score =  362 bits (929), Expect = 3e-97
 Identities = 199/405 (49%), Positives = 256/405 (63%), Gaps = 17/405 (4%)
 Frame = +1

Query: 157  GFSIELIHHESPLSPFYNPSDTYSDRIRKAARHSIARTKLFRSSSSTNLDTIQSDVIPND 336
            GFS+ELIH +SP+SPF+N S T S+ +RK A HS+ R K  +          QS VIPN 
Sbjct: 31   GFSVELIHRDSPVSPFFNDSITSSELLRKNALHSMDRIKNIQFYIDQK--ATQSVVIPNG 88

Query: 337  GSYLMKLSIGTPPLEVLAIADTGSDLIWTQCKPC--QECYEQDANLFDPIQSSTYREISC 510
            G+YLMKLS GTPP+E +AIADTGSDL W QC PC   +CY Q ++ FDP  SSTYR++SC
Sbjct: 89   GTYLMKLSFGTPPVEYVAIADTGSDLTWIQCAPCPQSQCYSQGSSPFDPAASSTYRKLSC 148

Query: 511  QSDYCQSLPRAHCGNTSQDCQYLYSYGDESHTNGILATETFTFDSTTGRPIAIPTLVFGC 690
             S+ CQ+LPR  C NT++ C+Y YSYGD+S+T GIL+++T +FDS++    + PT +FGC
Sbjct: 149  VSEACQALPRKSCLNTNE-CEYFYSYGDKSYTIGILSSDTLSFDSSSSPKTSFPTSIFGC 207

Query: 691  GHNNAGTFXXXXXXXXXXXXXXXXXXXXXXXKIEGKFSYCLVPMSEKHTSKLNFGSQAVI 870
            GHNN G F                       +I+ +FSYCLVP S   + KL FG +A+I
Sbjct: 208  GHNNQGNFRRPGAGLVGLGGGPLSLISQIGTQIDHRFSYCLVPRSATSSGKLVFGQEAII 267

Query: 871  SGQGVVSTPLVPKDPDTYYYLTLEGISV-----------GNIRIENNKISLNQEEGNIV- 1014
            S  G VSTPL+ K P T+YYL LEGIS+           GNI I++   +L   E N   
Sbjct: 268  SRPGAVSTPLITKTPATFYYLNLEGISIGDKTAQAASSQGNIIIDSG-TTLTILESNFYN 326

Query: 1015 -XXXXVKEAINLDTYPDPEGTFGLCYSTKSDIKVPKFTFHFTGADLQLNEMNTFIKISDD 1191
                 VK AI  +   DP GTF LCY  +++ K+P   FHFTGADL+L  +NTF  ++D 
Sbjct: 327  SVETMVKGAIGAEPEQDPSGTFTLCY--RAETKIPDMVFHFTGADLRLQPVNTF-GVNDG 383

Query: 1192 LVCLSMVPA--QSLSIFGNLAQINFQVEYDLVGKKVSFTPADCIK 1320
            L+C+ +VP+   S SIFGN AQINFQVEYDL  + VSF P DC K
Sbjct: 384  LLCMLIVPSNTNSNSIFGNYAQINFQVEYDLQKRTVSFAPTDCTK 428


>ref|XP_002870403.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
            gi|297316239|gb|EFH46662.1| predicted protein
            [Arabidopsis lyrata subsp. lyrata]
          Length = 440

 Score =  361 bits (926), Expect = 6e-97
 Identities = 184/410 (44%), Positives = 243/410 (59%), Gaps = 22/410 (5%)
 Frame = +1

Query: 157  GFSIELIHHESPLSPFYNPSDTYSDRIRKAARHSIARTKLFRSSSSTNL--DTIQSDVIP 330
            GF+ +LIH +SP SPFYNP++T S R+R A   S++R   F   S  +   +  Q D+  
Sbjct: 30   GFTADLIHRDSPKSPFYNPTETSSQRLRNAIHRSVSRVFHFTDISQKDASDNAPQIDLTS 89

Query: 331  NDGSYLMKLSIGTPPLEVLAIADTGSDLIWTQCKPCQECYEQDANLFDPIQSSTYREISC 510
            N G YLM +S+GTPP  ++AIADTGSDL+WTQCKPC +CY Q   LFDP  SSTY+++SC
Sbjct: 90   NSGEYLMNISLGTPPFPIMAIADTGSDLLWTQCKPCDDCYTQVDPLFDPKASSTYKDVSC 149

Query: 511  QSDYCQSLP-RAHCGNTSQDCQYLYSYGDESHTNGILATETFTFDSTTGRPIAIPTLVFG 687
             S  C +L  +A C      C Y  SYGD S+T G +A +T T  ST  RP+ +  ++ G
Sbjct: 150  SSSQCTALENQASCSTEDNTCSYSTSYGDRSYTKGNIAVDTLTLGSTDTRPVQLKNIIIG 209

Query: 688  CGHNNAGTFXXXXXXXXXXXXXXXXXXXXXXXKIEGKFSYCLVPMSEKH--TSKLNFGSQ 861
            CGHNNAGTF                        I+GKFSYCLVP++ ++  TSK+NFG+ 
Sbjct: 210  CGHNNAGTFNKKGSGIVGLGGGAVSLITQLGDSIDGKFSYCLVPLTSENDRTSKINFGTN 269

Query: 862  AVISGQGVVSTPLVPKDPDTYYYLTLEGISVGNIRIENNKISLNQEEGNIV--------- 1014
            AV+SG GVVSTPL+ K  +T+YYLTL+ ISVG+  ++         EGNI+         
Sbjct: 270  AVVSGTGVVSTPLIAKSQETFYYLTLKSISVGSKEVQYPGSDSGSGEGNIIIDSGTTLTL 329

Query: 1015 --------XXXXVKEAINLDTYPDPEGTFGLCYSTKSDIKVPKFTFHFTGADLQLNEMNT 1170
                        V  +I+ +   DP+    LCYS   D+KVP  T HF GAD+ L   N 
Sbjct: 330  LPTEFYSELEDAVASSIDAEKKQDPQTGLSLCYSATGDLKVPAITMHFDGADVNLKPSNC 389

Query: 1171 FIKISDDLVCLSMVPAQSLSIFGNLAQINFQVEYDLVGKKVSFTPADCIK 1320
            F++IS+DLVC +   + S SI+GN+AQ+NF V YD V K VSF P DC K
Sbjct: 390  FVQISEDLVCFAFRGSPSFSIYGNVAQMNFLVGYDTVSKTVSFKPTDCAK 439


>ref|XP_002266461.1| PREDICTED: probable aspartic protease At2g35615 [Vitis vinifera]
          Length = 439

 Score =  360 bits (923), Expect = 1e-96
 Identities = 188/435 (43%), Positives = 258/435 (59%), Gaps = 21/435 (4%)
 Frame = +1

Query: 82   IVLAFVVVIIFSLLPNLPVVKAQIGGFSIELIHHESPLSPFYNPSDTYSDRIRKAARHSI 261
            + + F VV++  L   L V  A+ GGFS++LIH +SP SPF++PS T ++R+  A R S+
Sbjct: 6    VKIFFNVVVVGFLFQLLEVALARGGGFSVDLIHRDSPHSPFFDPSKTQAERLTDAFRRSV 65

Query: 262  ARTKLFRSSSSTNLDTIQSDVIPNDGSYLMKLSIGTPPLEVLAIADTGSDLIWTQCKPCQ 441
            +R   FR ++ T+ D IQS ++P+ G YLM L IGTPP+ V+AI DTGSDL WTQC+PC 
Sbjct: 66   SRVGRFRPTAMTS-DGIQSRIVPSAGEYLMNLYIGTPPVPVIAIVDTGSDLTWTQCRPCT 124

Query: 442  ECYEQDANLFDPIQSSTYREISCQSDYCQSLPRAHCGNTSQDCQYLYSYGDESHTNGILA 621
             CY+Q   LFDP  SSTYR+ SC + +C +L +    +  + C + YSY D S T G LA
Sbjct: 125  HCYKQVVPLFDPKNSSTYRDSSCGTSFCLALGKDRSCSKEKKCTFRYSYADGSFTGGNLA 184

Query: 622  TETFTFDSTTGRPIAIPTLVFGCGHNNAGTFXXXXXXXXXXXXXXXXXXXXXXXKIEGKF 801
            +ET T DST G+P++ P   FGCGH++ G F                        I G F
Sbjct: 185  SETLTVDSTAGKPVSFPGFAFGCGHSSGGIFDKSSSGIVGLGGGELSLISQLKSTINGLF 244

Query: 802  SYCLVPMS--EKHTSKLNFGSQAVISGQGVVSTPLVPKDPDTYYYLTLEGISVGNIRIEN 975
            SYCL+P+S     +S++NFG+   +SG G VSTPLV K PDT+YYLTLEGISVG  R+  
Sbjct: 245  SYCLLPVSTDSSISSRINFGASGRVSGYGTVSTPLVQKSPDTFYYLTLEGISVGKKRLPY 304

Query: 976  NKIS--LNQEEGNIV-----------------XXXXVKEAINLDTYPDPEGTFGLCYSTK 1098
               S     EEGNI+                     V  +I      DP G F LCY+T 
Sbjct: 305  KGYSKKTEVEEGNIIVDSGTTYTFLPQEFYSKLEKSVANSIKGKRVRDPNGIFSLCYNTT 364

Query: 1099 SDIKVPKFTFHFTGADLQLNEMNTFIKISDDLVCLSMVPAQSLSIFGNLAQINFQVEYDL 1278
            ++I  P  T HF  A+++L  +NTF+++ +DLVC ++ P   + + GNLAQ+NF V +DL
Sbjct: 365  AEINAPIITAHFKDANVELQPLNTFMRMQEDLVCFTVAPTSDIGVLGNLAQVNFLVGFDL 424

Query: 1279 VGKKVSFTPADCIKY 1323
              K+VSF  ADC ++
Sbjct: 425  RKKRVSFKAADCTQH 439


Top