BLASTX nr result

ID: Glycyrrhiza35_contig00006803 seq

BLASTX 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Glycyrrhiza35_contig00006803
         (1626 letters)

Database: ./nr 
           115,041,592 sequences; 42,171,959,267 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

KYP73163.1 Aspartic proteinase nepenthesin-1 [Cajanus cajan]          122   1e-27
KFK23778.1 hypothetical protein AALP_AAs63080U000100 [Arabis alp...   117   6e-25
KFK23777.1 hypothetical protein AALP_AAs63080U000100 [Arabis alp...   115   8e-25
NP_198319.1 Eukaryotic aspartyl protease family protein [Arabido...   116   2e-24
ABK28718.1 unknown, partial [Arabidopsis thaliana]                    116   2e-24
OAO95654.1 CDR1 [Arabidopsis thaliana]                                116   2e-24
KFK24181.1 hypothetical protein AALP_AAs68745U000200 [Arabis alp...   115   5e-24
XP_010435383.1 PREDICTED: aspartic proteinase CDR1-like [Camelin...   114   9e-24
KFK24180.1 hypothetical protein AALP_AAs68745U000200 [Arabis alp...   111   2e-23
XP_010496761.1 PREDICTED: aspartic proteinase CDR1-like [Camelin...   112   3e-23
XP_010449037.2 PREDICTED: aspartic proteinase CDR1-like [Camelin...   112   3e-23
GAU46736.1 hypothetical protein TSUD_402700 [Trifolium subterran...   110   4e-23
GAV71898.1 Asp domain-containing protein [Cephalotus follicularis]    110   7e-23
XP_002517617.1 PREDICTED: probable aspartic protease At2g35615 [...   111   9e-23
XP_010440678.1 PREDICTED: aspartic proteinase CDR1-like [Camelin...   110   1e-22
XP_002870403.1 predicted protein [Arabidopsis lyrata subsp. lyra...   110   1e-22
XP_006395734.1 hypothetical protein EUTSA_v10004149mg [Eutrema s...   110   3e-22
XP_018482491.1 PREDICTED: probable aspartic protease At2g35615 [...   109   4e-22
XP_010507561.1 PREDICTED: probable aspartic protease At2g35615 [...   109   4e-22
NP_850251.1 Eukaryotic aspartyl protease family protein [Arabido...   108   9e-22

>KYP73163.1 Aspartic proteinase nepenthesin-1 [Cajanus cajan]
          Length = 282

 Score =  122 bits (305), Expect = 1e-27
 Identities = 112/322 (34%), Positives = 166/322 (51%), Gaps = 9/322 (2%)
 Frame = -3

Query: 1063 YKDRAGYEGYLGKGTFSDSHDQKLENMEYGVSTGT--KEKNSKGVVGLGRGELSLFQQLN 890
            Y D +   G L KG    S+ + +  + +G+S  +  ++K S GVVGLGRGE+SL  QL 
Sbjct: 3    YDDNSRTGGKLTKGELW-SNSKVIAPVWFGLSNISEGRKKESAGVVGLGRGEISLISQLG 61

Query: 889  NSARARVEFKFSYCLPQYEKKVDSNKNAQYATGKLVFGSRVNTNPETSTPLLDENPEEKA 710
                  +  KFSYCL           N+  AT KL+ G  V  +P   TPL++E      
Sbjct: 62   ------IPSKFSYCL----------SNSISATTKLLLGKDVKIDPTYYTPLVEE------ 99

Query: 709  KDPGKKAEDYCKTRYYCVNLTSIKVDGRQGILGKDTDTTEVMIIDSGSTFTSLRGELFKE 530
                   ED C  RY C++L SI ++ ++       D   +M IDSGSTFT L    F E
Sbjct: 100  -------ED-CYGRY-CIHLESILLNNQR----VKQDGIILMRIDSGSTFTHLNKTFFPE 146

Query: 529  FLKRVEQQIGDKE-EKPISDDYMHCFLKGSADKLEKVSLGFEG-----TTVELKRENIFD 368
            F++ VE+  G+K       + Y HC+  G  ++L +V   F+G      ++ LKR N F 
Sbjct: 147  FIEGVEKITGEKGIPFKGGNKYKHCYEDG--ERLREVKFKFKGDHHDQCSLNLKRVNFF- 203

Query: 367  HIVKKGEGEEKKDYLCLTVKKLDEGKMNGYVVPNVHILGSRAQMDFKVAFDVP-KREVSF 191
              VKK + +  K Y CLTVK+ D           ++ LG++AQ+DF+VA   P + +VSF
Sbjct: 204  --VKKKDND--KTYFCLTVKQHD----------TLNFLGNKAQIDFEVAISEPSEAKVSF 249

Query: 190  VKVETCNQEKKNSNDGVDQRIE 125
            V+ +TC+ E +N N+  ++  E
Sbjct: 250  VE-KTCSSE-QNVNEEEEEEEE 269


>KFK23778.1 hypothetical protein AALP_AAs63080U000100 [Arabis alpina]
          Length = 439

 Score =  117 bits (294), Expect = 6e-25
 Identities = 111/398 (27%), Positives = 178/398 (44%), Gaps = 10/398 (2%)
 Frame = -3

Query: 1354 VTALRHTYLMSLQVRTEDNKFVKAYATPDTGSDLIWLEPTCKTTTANACIKEPETPFKCG 1175
            +T+    YLM++ + T     +   A  DTGSDL+W +  CK        ++P    K  
Sbjct: 87   ITSNHGEYLMNVSLGTPPFPIM---AIADTGSDLLWTQ--CKPCEDCYTQEDPLFDPKAS 141

Query: 1174 DGDEDEYCK-KMWSYLGMEAKCIESTDHKDEYCGYKIVYKDRAGYEGYLGKGTFS----D 1010
               +D  C  +  S LG EA C  STD+    C Y + Y DR+  +G L   T +    +
Sbjct: 142  STYKDVSCSSRQCSDLGREAAC--STDNT---CAYSMAYGDRSSTKGNLAADTLTLGSTN 196

Query: 1009 SHDQKLENMEYGV---STGTKEKNSKGVVGLGRGELSLFQQLNNSARARVEFKFSYCLPQ 839
            S   +L+N+  G    + GT  K   G+VGLG G +SL  QL +S    ++ KFSYCL  
Sbjct: 197  SRPVQLKNVIIGCGHNNAGTFSKKGSGIVGLGGGPVSLVSQLGDS----IDGKFSYCLVP 252

Query: 838  YEKKVDSNKNAQYATGKLVFGSRVNTNPETSTPLLDENPEEKAKDPGKKAEDYCKTRYYC 659
               + D      + T  +V G         STPL+ ++PE                 +Y 
Sbjct: 253  LTSEKDRTSKINFGTNAVVSGKEA-----VSTPLVKKSPET----------------FYY 291

Query: 658  VNLTSIKVDGRQGIL-GKDTDTTEV-MIIDSGSTFTSLRGELFKEFLKRVEQQIGDKEEK 485
            + L SI V  ++  L G  + T+E  +IIDSG+T T L  E + +  + V  QI  + ++
Sbjct: 292  LTLESISVGSKKIPLPGSVSGTSEGNIIIDSGTTLTMLPTEFYSQLEEAVASQIEAERQE 351

Query: 484  PISDDYMHCFLKGSADKLEKVSLGFEGTTVELKRENIFDHIVKKGEGEEKKDYLCLTVKK 305
                    C+   +  K   +++ F+G  V+L   N F  +         ++ +C   + 
Sbjct: 352  DPQGVLSLCYSATADLKAPVITMHFDGADVKLDFSNSFVQL--------SEELVCFAFRG 403

Query: 304  LDEGKMNGYVVPNVHILGSRAQMDFKVAFDVPKREVSF 191
             D+          + I G+ +QM+F V +D   ++VSF
Sbjct: 404  SDD----------LSIYGNLSQMNFLVGYDTVSQKVSF 431


>KFK23777.1 hypothetical protein AALP_AAs63080U000100 [Arabis alpina]
          Length = 332

 Score =  115 bits (288), Expect = 8e-25
 Identities = 106/373 (28%), Positives = 167/373 (44%), Gaps = 10/373 (2%)
 Frame = -3

Query: 1279 ATPDTGSDLIWLEPTCKTTTANACIKEPETPFKCGDGDEDEYCK-KMWSYLGMEAKCIES 1103
            A  DTGSDL+W +  CK        ++P    K     +D  C  +  S LG EA C  S
Sbjct: 2    AIADTGSDLLWTQ--CKPCEDCYTQEDPLFDPKASSTYKDVSCSSRQCSDLGREAAC--S 57

Query: 1102 TDHKDEYCGYKIVYKDRAGYEGYLGKGTFS----DSHDQKLENMEYGV---STGTKEKNS 944
            TD+    C Y + Y DR+  +G L   T +    +S   +L+N+  G    + GT  K  
Sbjct: 58   TDNT---CAYSMAYGDRSSTKGNLAADTLTLGSTNSRPVQLKNVIIGCGHNNAGTFSKKG 114

Query: 943  KGVVGLGRGELSLFQQLNNSARARVEFKFSYCLPQYEKKVDSNKNAQYATGKLVFGSRVN 764
             G+VGLG G +SL  QL +S    ++ KFSYCL     + D      + T  +V G    
Sbjct: 115  SGIVGLGGGPVSLVSQLGDS----IDGKFSYCLVPLTSEKDRTSKINFGTNAVVSGKEA- 169

Query: 763  TNPETSTPLLDENPEEKAKDPGKKAEDYCKTRYYCVNLTSIKVDGRQGIL-GKDTDTTEV 587
                 STPL+ ++PE                 +Y + L SI V  ++  L G  + T+E 
Sbjct: 170  ----VSTPLVKKSPET----------------FYYLTLESISVGSKKIPLPGSVSGTSEG 209

Query: 586  -MIIDSGSTFTSLRGELFKEFLKRVEQQIGDKEEKPISDDYMHCFLKGSADKLEKVSLGF 410
             +IIDSG+T T L  E + +  + V  QI  + ++        C+   +  K   +++ F
Sbjct: 210  NIIIDSGTTLTMLPTEFYSQLEEAVASQIEAERQEDPQGVLSLCYSATADLKAPVITMHF 269

Query: 409  EGTTVELKRENIFDHIVKKGEGEEKKDYLCLTVKKLDEGKMNGYVVPNVHILGSRAQMDF 230
            +G  V+L   N F  +         ++ +C   +  D+          + I G+ +QM+F
Sbjct: 270  DGADVKLDFSNSFVQL--------SEELVCFAFRGSDD----------LSIYGNLSQMNF 311

Query: 229  KVAFDVPKREVSF 191
             V +D   ++VSF
Sbjct: 312  LVGYDTVSQKVSF 324


>NP_198319.1 Eukaryotic aspartyl protease family protein [Arabidopsis thaliana]
            Q6XBF8.1 RecName: Full=Aspartic proteinase CDR1; AltName:
            Full=Protein CONSTITUTIVE DISEASE RESISTANCE 1; Flags:
            Precursor AAP72988.1 CDR1 [Arabidopsis thaliana]
            ABE66189.1 aspartyl protease family protein [Arabidopsis
            thaliana] ABG48485.1 At5g33340 [Arabidopsis thaliana]
            AED93896.1 Eukaryotic aspartyl protease family protein
            [Arabidopsis thaliana]
          Length = 437

 Score =  116 bits (291), Expect = 2e-24
 Identities = 110/403 (27%), Positives = 177/403 (43%), Gaps = 12/403 (2%)
 Frame = -3

Query: 1363 EFNVTALRHTYLMSLQVRTEDNKFVKAYATPDTGSDLIWLEPTCKTTTANACIKEPETPF 1184
            + ++T+    YLM++ + T     +   A  DTGSDL+W +        + C  + +  F
Sbjct: 80   QIDLTSNSGEYLMNVSIGTPPFPIM---AIADTGSDLLWTQ----CAPCDDCYTQVDPLF 132

Query: 1183 --KCGDGDEDEYCKK-MWSYLGMEAKCIESTDHKDEYCGYKIVYKDRAGYEGYLGKGTF- 1016
              K     +D  C     + L  +A C  ST+  D  C Y + Y D +  +G +   T  
Sbjct: 133  DPKTSSTYKDVSCSSSQCTALENQASC--STN--DNTCSYSLSYGDNSYTKGNIAVDTLT 188

Query: 1015 ---SDSHDQKLENMEYGV---STGTKEKNSKGVVGLGRGELSLFQQLNNSARARVEFKFS 854
               SD+   +L+N+  G    + GT  K   G+VGLG G +SL +QL +S    ++ KFS
Sbjct: 189  LGSSDTRPMQLKNIIIGCGHNNAGTFNKKGSGIVGLGGGPVSLIKQLGDS----IDGKFS 244

Query: 853  YCLPQYEKKVDSNKNAQYATGKLVFGSRVNTNPETSTPLLDENPEEKAKDPGKKAEDYCK 674
            YCL     K D      + T  +V GS V      STPL+ +  +E              
Sbjct: 245  YCLVPLTSKKDQTSKINFGTNAIVSGSGV-----VSTPLIAKASQET------------- 286

Query: 673  TRYYCVNLTSIKVDGRQ-GILGKDTDTTEV-MIIDSGSTFTSLRGELFKEFLKRVEQQIG 500
              +Y + L SI V  +Q    G D++++E  +IIDSG+T T L  E + E    V   I 
Sbjct: 287  --FYYLTLKSISVGSKQIQYSGSDSESSEGNIIIDSGTTLTLLPTEFYSELEDAVASSID 344

Query: 499  DKEEKPISDDYMHCFLKGSADKLEKVSLGFEGTTVELKRENIFDHIVKKGEGEEKKDYLC 320
             ++++        C+      K+  +++ F+G  V+L   N F  +         +D +C
Sbjct: 345  AEKKQDPQSGLSLCYSATGDLKVPVITMHFDGADVKLDSSNAFVQV--------SEDLVC 396

Query: 319  LTVKKLDEGKMNGYVVPNVHILGSRAQMDFKVAFDVPKREVSF 191
               +            P+  I G+ AQM+F V +D   + VSF
Sbjct: 397  FAFRG----------SPSFSIYGNVAQMNFLVGYDTVSKTVSF 429


>ABK28718.1 unknown, partial [Arabidopsis thaliana]
          Length = 438

 Score =  116 bits (291), Expect = 2e-24
 Identities = 110/403 (27%), Positives = 177/403 (43%), Gaps = 12/403 (2%)
 Frame = -3

Query: 1363 EFNVTALRHTYLMSLQVRTEDNKFVKAYATPDTGSDLIWLEPTCKTTTANACIKEPETPF 1184
            + ++T+    YLM++ + T     +   A  DTGSDL+W +        + C  + +  F
Sbjct: 80   QIDLTSNSGEYLMNVSIGTPPFPIM---AIADTGSDLLWTQ----CAPCDDCYTQVDPLF 132

Query: 1183 --KCGDGDEDEYCKK-MWSYLGMEAKCIESTDHKDEYCGYKIVYKDRAGYEGYLGKGTF- 1016
              K     +D  C     + L  +A C  ST+  D  C Y + Y D +  +G +   T  
Sbjct: 133  DPKTSSTYKDVSCSSSQCTALENQASC--STN--DNTCSYSLSYGDNSYTKGNIAVDTLT 188

Query: 1015 ---SDSHDQKLENMEYGV---STGTKEKNSKGVVGLGRGELSLFQQLNNSARARVEFKFS 854
               SD+   +L+N+  G    + GT  K   G+VGLG G +SL +QL +S    ++ KFS
Sbjct: 189  LGSSDTRPMQLKNIIIGCGHNNAGTFNKKGSGIVGLGGGPVSLIKQLGDS----IDGKFS 244

Query: 853  YCLPQYEKKVDSNKNAQYATGKLVFGSRVNTNPETSTPLLDENPEEKAKDPGKKAEDYCK 674
            YCL     K D      + T  +V GS V      STPL+ +  +E              
Sbjct: 245  YCLVPLTSKKDQTSKINFGTNAIVSGSGV-----VSTPLIAKASQET------------- 286

Query: 673  TRYYCVNLTSIKVDGRQ-GILGKDTDTTEV-MIIDSGSTFTSLRGELFKEFLKRVEQQIG 500
              +Y + L SI V  +Q    G D++++E  +IIDSG+T T L  E + E    V   I 
Sbjct: 287  --FYYLTLKSISVGSKQIQYSGSDSESSEGNIIIDSGTTLTLLPTEFYSELEDAVASSID 344

Query: 499  DKEEKPISDDYMHCFLKGSADKLEKVSLGFEGTTVELKRENIFDHIVKKGEGEEKKDYLC 320
             ++++        C+      K+  +++ F+G  V+L   N F  +         +D +C
Sbjct: 345  AEKKQDPQSGLSLCYSATGDLKVPVITMHFDGADVKLDSSNAFVQV--------SEDLVC 396

Query: 319  LTVKKLDEGKMNGYVVPNVHILGSRAQMDFKVAFDVPKREVSF 191
               +            P+  I G+ AQM+F V +D   + VSF
Sbjct: 397  FAFRG----------SPSFSIYGNVAQMNFLVGYDTVSKTVSF 429


>OAO95654.1 CDR1 [Arabidopsis thaliana]
          Length = 437

 Score =  116 bits (290), Expect = 2e-24
 Identities = 110/403 (27%), Positives = 177/403 (43%), Gaps = 12/403 (2%)
 Frame = -3

Query: 1363 EFNVTALRHTYLMSLQVRTEDNKFVKAYATPDTGSDLIWLEPTCKTTTANACIKEPETPF 1184
            + ++T+    YLM++ + T     +   A  DTGSDL+W +        + C  + +  F
Sbjct: 80   QIDLTSNSGEYLMNVSIGTPPFPIM---AIADTGSDLLWTQ----CAPCDDCYTQVDPLF 132

Query: 1183 --KCGDGDEDEYCKK-MWSYLGMEAKCIESTDHKDEYCGYKIVYKDRAGYEGYLGKGTF- 1016
              K     +D  C     + L  +A C  ST+  D  C Y + Y D +  +G +   T  
Sbjct: 133  DPKTSSTYKDVSCSSSQCTALENQASC--STN--DNTCSYSLSYGDNSYTKGNIAVDTLT 188

Query: 1015 ---SDSHDQKLENMEYGV---STGTKEKNSKGVVGLGRGELSLFQQLNNSARARVEFKFS 854
               SD+   +L+N+  G    + GT  K   G+VGLG G +SL +QL +S    ++ KFS
Sbjct: 189  LGSSDTRPMQLKNIIIGCGHNNAGTFNKKGSGIVGLGGGPVSLIKQLGDS----IDGKFS 244

Query: 853  YCLPQYEKKVDSNKNAQYATGKLVFGSRVNTNPETSTPLLDENPEEKAKDPGKKAEDYCK 674
            YCL     K D      + T  +V GS V      STPL+ +  +E              
Sbjct: 245  YCLVPLTSKKDQTSKINFGTNAIVSGSGV-----VSTPLIAKASQET------------- 286

Query: 673  TRYYCVNLTSIKVDGRQ-GILGKDTDTTEV-MIIDSGSTFTSLRGELFKEFLKRVEQQIG 500
              +Y + L SI V  +Q    G D++++E  +IIDSG+T T L  E + E    V   I 
Sbjct: 287  --FYYLTLKSISVGSKQIQYSGSDSESSEGNIIIDSGTTLTLLPTEFYSELEDAVASSID 344

Query: 499  DKEEKPISDDYMHCFLKGSADKLEKVSLGFEGTTVELKRENIFDHIVKKGEGEEKKDYLC 320
             ++++        C+      K+  +++ F+G  V+L   N F  +         +D +C
Sbjct: 345  AEKKQDPQSGLSLCYSATGDLKVPIITMHFDGADVKLDSSNAFVQV--------SEDLVC 396

Query: 319  LTVKKLDEGKMNGYVVPNVHILGSRAQMDFKVAFDVPKREVSF 191
               +            P+  I G+ AQM+F V +D   + VSF
Sbjct: 397  FAFRG----------SPSFSIYGNVAQMNFLVGYDTVSKTVSF 429


>KFK24181.1 hypothetical protein AALP_AAs68745U000200 [Arabis alpina]
          Length = 439

 Score =  115 bits (287), Expect = 5e-24
 Identities = 106/398 (26%), Positives = 176/398 (44%), Gaps = 10/398 (2%)
 Frame = -3

Query: 1354 VTALRHTYLMSLQVRTEDNKFVKAYATPDTGSDLIWLEPTCKTTTANACIKEPETPFKCG 1175
            +T++   YLM++ + T     +   A  DTGSDL+W +  CK        ++P    K  
Sbjct: 87   ITSIGGDYLMNVSLGTPPFPIM---AIADTGSDLLWTQ--CKPCEDCYTQEDPLFDPKAS 141

Query: 1174 DGDEDEYCKK-MWSYLGMEAKCIESTDHKDEYCGYKIVYKDRAGYEGYLGKGTFS----D 1010
               +D  C     S  G  A C       D+ C Y I Y DR+  +G +   T +    +
Sbjct: 142  STYKDVSCSSSQCSEFGTRASC-----STDKTCSYSIAYGDRSHSKGNVAADTLTLGSTN 196

Query: 1009 SHDQKLENMEYGVS---TGTKEKNSKGVVGLGRGELSLFQQLNNSARARVEFKFSYCLPQ 839
            S   +L+N+  G      G   K + G+VGLG G +SL  QL +S    ++ KFSYCL  
Sbjct: 197  SRPVQLKNVIIGCGHNDAGNFNKKTSGIVGLGGGAVSLVSQLGDS----IDGKFSYCLVP 252

Query: 838  YEKKVDSNKNAQYATGKLVFGSRVNTNPETSTPLLDENPEEKAKDPGKKAEDYCKTRYYC 659
            +  + D      + T  +V+G  V      STPL+ ++PE                 +Y 
Sbjct: 253  FTSEKDLASKINFGTNAIVWGKGV-----VSTPLIKKSPET----------------FYY 291

Query: 658  VNLTSIKVDGRQ-GILGKDTDTTEV-MIIDSGSTFTSLRGELFKEFLKRVEQQIGDKEEK 485
            + L SI V  ++    G ++ T+E  +IIDSG+T T L  E + +  + V  QI  + ++
Sbjct: 292  LTLESISVGSKKIQFPGINSGTSEGNIIIDSGTTLTMLPTEFYSQLEEAVASQIEAERQE 351

Query: 484  PISDDYMHCFLKGSADKLEKVSLGFEGTTVELKRENIFDHIVKKGEGEEKKDYLCLTVKK 305
                    C+   +  K   +++ F+G  V+L   N F  +         ++ +C   + 
Sbjct: 352  DPQGVLSLCYSATADLKAPVITMHFDGADVKLDFSNSFVQL--------SEELVCFAFRG 403

Query: 304  LDEGKMNGYVVPNVHILGSRAQMDFKVAFDVPKREVSF 191
             D+          + I G+ +QM+F V +D   ++VSF
Sbjct: 404  SDD----------LSIYGNLSQMNFLVGYDTVSQKVSF 431


>XP_010435383.1 PREDICTED: aspartic proteinase CDR1-like [Camelina sativa]
          Length = 436

 Score =  114 bits (285), Expect = 9e-24
 Identities = 109/402 (27%), Positives = 174/402 (43%), Gaps = 10/402 (2%)
 Frame = -3

Query: 1354 VTALRHTYLMSLQVRTEDNKFVKAYATPDTGSDLIWLEPTCKTTTANACIKEPETPFKCG 1175
            +T+    YLM++ + T     +   A  DTGSDL+W +  CK         +P       
Sbjct: 83   ITSNGGEYLMNVSLGTPPFPIM---AIADTGSDLLWTQ--CKPCDDCYTQDDPLFDPTAS 137

Query: 1174 DGDEDEYCKK-MWSYLGMEAKCIESTDHKDEYCGYKIVYKDRAGYEGYLGKGTFS----D 1010
               +D  C     S L  +A C  STD     C Y + Y D +  +G +   T +    D
Sbjct: 138  STYKDVACSSSQCSALENQASC--STDGST--CSYSMSYGDHSYTKGNVAADTLTLGSTD 193

Query: 1009 SHDQKLENMEYGV---STGTKEKNSKGVVGLGRGELSLFQQLNNSARARVEFKFSYCLPQ 839
            +   +L+N+  G    + GT  +   G++GLG G +SL  QL +S    +  KFSYCL  
Sbjct: 194  NRPVQLKNVIIGCGHNNAGTFNEKGSGIIGLGGGAVSLVTQLGDS----INGKFSYCLVP 249

Query: 838  YEKKVDSNKNAQYATGKLVFGSRVNTNPETSTPLLDENPEEKAKDPGKKAEDYCKTRYYC 659
            +  + D      + T   V G+ V      STPL+ ++PE                 +Y 
Sbjct: 250  FSSETDKTSKINFGTNADVSGTGV-----VSTPLITKSPET----------------FYY 288

Query: 658  VNLTSIKVDGRQ-GILGKDTDTTEV-MIIDSGSTFTSLRGELFKEFLKRVEQQIGDKEEK 485
            + L +I V  ++    G D+   E  +IIDSG+T T L  E +KE    V   I  + +K
Sbjct: 289  LTLEAISVGSKKLPFQGSDSGRGEGNIIIDSGTTMTLLPTEFYKELEDAVASSIDAERQK 348

Query: 484  PISDDYMHCFLKGSADKLEKVSLGFEGTTVELKRENIFDHIVKKGEGEEKKDYLCLTVKK 305
                    C+   S  K+  +++ F+G  V+L   NIF  I         +D +C   + 
Sbjct: 349  DPQGGLSLCYSATSDLKVPVITMHFDGADVKLDSSNIFVQI--------SQDLVCFAFR- 399

Query: 304  LDEGKMNGYVVPNVHILGSRAQMDFKVAFDVPKREVSFVKVE 179
                       P++ I G+ +QM+F V +D   ++VSF+  +
Sbjct: 400  ---------ASPSLSIYGNLSQMNFLVGYDTVSKKVSFMPTD 432


>KFK24180.1 hypothetical protein AALP_AAs68745U000200 [Arabis alpina]
          Length = 344

 Score =  111 bits (278), Expect = 2e-23
 Identities = 101/373 (27%), Positives = 164/373 (43%), Gaps = 10/373 (2%)
 Frame = -3

Query: 1279 ATPDTGSDLIWLEPTCKTTTANACIKEPETPFKCGDGDEDEYCKK-MWSYLGMEAKCIES 1103
            A  DTGSDL+W +  CK        ++P    K     +D  C     S  G  A C   
Sbjct: 14   AIADTGSDLLWTQ--CKPCEDCYTQEDPLFDPKASSTYKDVSCSSSQCSEFGTRASC--- 68

Query: 1102 TDHKDEYCGYKIVYKDRAGYEGYLGKGTFS----DSHDQKLENMEYGVS---TGTKEKNS 944
                D+ C Y I Y DR+  +G +   T +    +S   +L+N+  G      G   K +
Sbjct: 69   --STDKTCSYSIAYGDRSHSKGNVAADTLTLGSTNSRPVQLKNVIIGCGHNDAGNFNKKT 126

Query: 943  KGVVGLGRGELSLFQQLNNSARARVEFKFSYCLPQYEKKVDSNKNAQYATGKLVFGSRVN 764
             G+VGLG G +SL  QL +S    ++ KFSYCL  +  + D      + T  +V+G  V 
Sbjct: 127  SGIVGLGGGAVSLVSQLGDS----IDGKFSYCLVPFTSEKDLASKINFGTNAIVWGKGV- 181

Query: 763  TNPETSTPLLDENPEEKAKDPGKKAEDYCKTRYYCVNLTSIKVDGRQ-GILGKDTDTTEV 587
                 STPL+ ++PE                 +Y + L SI V  ++    G ++ T+E 
Sbjct: 182  ----VSTPLIKKSPET----------------FYYLTLESISVGSKKIQFPGINSGTSEG 221

Query: 586  -MIIDSGSTFTSLRGELFKEFLKRVEQQIGDKEEKPISDDYMHCFLKGSADKLEKVSLGF 410
             +IIDSG+T T L  E + +  + V  QI  + ++        C+   +  K   +++ F
Sbjct: 222  NIIIDSGTTLTMLPTEFYSQLEEAVASQIEAERQEDPQGVLSLCYSATADLKAPVITMHF 281

Query: 409  EGTTVELKRENIFDHIVKKGEGEEKKDYLCLTVKKLDEGKMNGYVVPNVHILGSRAQMDF 230
            +G  V+L   N F  +         ++ +C   +  D+          + I G+ +QM+F
Sbjct: 282  DGADVKLDFSNSFVQL--------SEELVCFAFRGSDD----------LSIYGNLSQMNF 323

Query: 229  KVAFDVPKREVSF 191
             V +D   ++VSF
Sbjct: 324  LVGYDTVSQKVSF 336


>XP_010496761.1 PREDICTED: aspartic proteinase CDR1-like [Camelina sativa]
          Length = 434

 Score =  112 bits (281), Expect = 3e-23
 Identities = 106/400 (26%), Positives = 172/400 (43%), Gaps = 8/400 (2%)
 Frame = -3

Query: 1354 VTALRHTYLMSLQVRTEDNKFVKAYATPDTGSDLIWLEPTCKTTTANACIKEPETPFKCG 1175
            +T+    YLM++ + T     +   A  DTGSDL+W +  CK         +P       
Sbjct: 83   ITSNGGEYLMNVSLGTPPFPIM---AIADTGSDLLWTQ--CKPCDDCYTQDDPLFDPTAS 137

Query: 1174 DGDEDEYCKK-MWSYLGMEAKCIESTDHKDEYCGYKIVYKDRAGYEGYLGKGTFS----D 1010
               +D  C     S L  +A C  STD     C Y + Y D +  +G L   T +    +
Sbjct: 138  STYKDVACSSSQCSALENQASC--STDGST--CSYSMSYGDHSYTKGNLAADTLTLGSTN 193

Query: 1009 SHDQKLENMEYGV---STGTKEKNSKGVVGLGRGELSLFQQLNNSARARVEFKFSYCLPQ 839
            +   +L+N+  G    + GT  +   G++GLG G +SL  QL +S    +  KFSYCL  
Sbjct: 194  NRPVQLKNVIIGCGHNNAGTFNEKGSGIIGLGGGAVSLVTQLGDS----INGKFSYCLVP 249

Query: 838  YEKKVDSNKNAQYATGKLVFGSRVNTNPETSTPLLDENPEEKAKDPGKKAEDYCKTRYYC 659
            +  + D      + T   V G+ V      STPL+ ++PE                 +Y 
Sbjct: 250  FSSETDKTSKINFGTNADVSGTGV-----VSTPLIAKSPET----------------FYY 288

Query: 658  VNLTSIKVDGRQGILGKDTDTTEVMIIDSGSTFTSLRGELFKEFLKRVEQQIGDKEEKPI 479
            + L +I V  ++            +IIDSG+T T L  E +KE    V   I  +++K  
Sbjct: 289  LTLEAISVGSKKLPFQGSGSGEGNIIIDSGTTMTLLPTEFYKELEDAVASSIDAEKQKNP 348

Query: 478  SDDYMHCFLKGSADKLEKVSLGFEGTTVELKRENIFDHIVKKGEGEEKKDYLCLTVKKLD 299
             +    C+   S  K+  +++ F+G  V+L   NIF  I         +D +C   +   
Sbjct: 349  QEGLSLCYSATSDLKVPVITMHFDGADVKLDSSNIFVQI--------SQDLVCFAFR--- 397

Query: 298  EGKMNGYVVPNVHILGSRAQMDFKVAFDVPKREVSFVKVE 179
                     P++ I G+ +QM+F V +D   ++VSF+  +
Sbjct: 398  -------ASPSLSIYGNLSQMNFLVGYDTVSKKVSFMPTD 430


>XP_010449037.2 PREDICTED: aspartic proteinase CDR1-like [Camelina sativa]
          Length = 442

 Score =  112 bits (281), Expect = 3e-23
 Identities = 109/400 (27%), Positives = 170/400 (42%), Gaps = 9/400 (2%)
 Frame = -3

Query: 1363 EFNVTALRHTYLMSLQVRTEDNKFVKAYATPDTGSDLIWLEPTCKTTTANACIKEPETPF 1184
            +  +T+    YLM++ + T     +   A  DTGSDL+W +  CK         +P    
Sbjct: 80   QIEITSNGGEYLMNVSLGTPPFPIM---AIADTGSDLLWTQ--CKPCDDCYTQDDPLFDP 134

Query: 1183 KCGDGDEDEYCKK-MWSYLGMEAKCIESTDHKDEYCGYKIVYKDRAGYEGYLGKGTFS-- 1013
                  +D  C     S L  +A C  STD     C Y + Y D +  +G L   T +  
Sbjct: 135  NASSTYKDVPCSSSQCSALENQASC--STDGST--CSYSMSYGDHSYTKGNLAADTLTLG 190

Query: 1012 --DSHDQKLENMEYGV---STGTKEKNSKGVVGLGRGELSLFQQLNNSARARVEFKFSYC 848
              ++   +L+N+  G    + GT  +   G++GLG G +SL  QL +S    +  KFSYC
Sbjct: 191  STNNRPVQLKNVIIGCGHNNAGTFNEKGSGIIGLGGGAVSLVTQLGDS----INGKFSYC 246

Query: 847  LPQYEKKVDSNKNAQYATGKLVFGSRVNTNPETSTPLLDENPEEKAKDPGKKAEDYCKTR 668
            L  +  + D      + T   V G+ V      STPL+ ++PE            Y    
Sbjct: 247  LVPFSSETDKTSKINFGTNADVSGTGV-----VSTPLIAKSPETFY---------YLTLE 292

Query: 667  YYCVNLTSIKVDGRQGILGKDTDTTEV-MIIDSGSTFTSLRGELFKEFLKRVEQQIGDKE 491
               V    I   G     G D+   E  +IIDSG+T T L  E +KE    V   I  ++
Sbjct: 293  AISVGSKKIPFQGSVSDSGSDSGRGEGNIIIDSGTTMTLLPTEFYKELEDAVASSIDAEK 352

Query: 490  EKPISDDYMHCFLKGSADKLEKVSLGFEGTTVELKRENIFDHIVKKGEGEEKKDYLCLTV 311
            +K   +    C+   S  K+  +++ F+G  V+L   NIF  I         +D +C   
Sbjct: 353  QKNPQEGLSLCYSATSDLKVPVITMHFDGADVKLDSSNIFVQI--------SQDLVCFAF 404

Query: 310  KKLDEGKMNGYVVPNVHILGSRAQMDFKVAFDVPKREVSF 191
            +            P++ I G+ +QM+F V +D   ++VSF
Sbjct: 405  R----------ASPSLSIYGNLSQMNFLVGYDTVSKKVSF 434


>GAU46736.1 hypothetical protein TSUD_402700 [Trifolium subterraneum]
          Length = 338

 Score =  110 bits (276), Expect = 4e-23
 Identities = 113/386 (29%), Positives = 167/386 (43%), Gaps = 15/386 (3%)
 Frame = -3

Query: 1291 VKAYATPDTGSDLIWLEPTCKTTTANACIKEPETPFKCGDGDEDEYCKKMWSYLGMEAK- 1115
            ++ +A  DTGSDLIW++    ++    C  + +TP+       D      +  L  ++K 
Sbjct: 10   IEKFAVADTGSDLIWVQ----SSPCENCFPQ-DTPYY------DPNKSSTFISLSCDSKP 58

Query: 1114 --CIESTDH---KDEYCGYKIVYKDRAGYEGYLGKGTFS-DSHDQKLENMEYG-----VS 968
               +    H   K   C Y  VY D +   G LG  + +  S+D       +G     V 
Sbjct: 59   CSLLPPRQHRCEKSNKCEYLYVYGDNSYTIGELGTDSINFGSNDVTFPKSIFGCGHNNVV 118

Query: 967  TGTKEKNSKGVVGLGRGELSLFQQLNNSARARVEFKFSYCL-PQYEKKVDSNKNAQYATG 791
            T  +     G+VGLG G LSL  QL +S    +  KFSYCL P Y            +T 
Sbjct: 119  TFNRTSKVTGLVGLGAGPLSLVSQLGDS----IGHKFSYCLIPSYSN----------STS 164

Query: 790  KLVFGSR--VNTNPETSTPLLDENPEEKAKDPGKKAEDYCKTRYYCVNLTSIKVDGRQGI 617
            KL FG    +N N   STPL+      K+  P           YY +NL  I V G + +
Sbjct: 165  KLKFGDEAIINGNGVVSTPLII-----KSSHP----------TYYYLNLEGITV-GEKTV 208

Query: 616  LGKDTDTTEVMIIDSGSTFTSLRGELFKEFLKRVEQQIGDKEEKPISDDYMHCFLKGSAD 437
                 D    +IIDSG+TFT L    + + +  V++ IG +E K     +  CF+ G   
Sbjct: 209  QTSQIDGN--IIIDSGTTFTYLEPNFYNDLIASVKEVIGVEEVKDPPTPFSFCFIYGDQT 266

Query: 436  KLEKVSLGFEGTTVELKRENIFDHIVKKGEGEEKKDYLCLTVKKLDEGKMNGYVVPNVHI 257
            K       F G  V LK EN+F           + +  CL ++       NG     + I
Sbjct: 267  KAPDFVFHFTGADVILKLENVF--------AVFENNLSCLWIE-----PSNG-----LSI 308

Query: 256  LGSRAQMDFKVAFDVPKREVSFVKVE 179
             G++AQ+DF V +DV  ++VSF + +
Sbjct: 309  FGNKAQVDFLVEYDVEGKKVSFAETD 334


>GAV71898.1 Asp domain-containing protein [Cephalotus follicularis]
          Length = 357

 Score =  110 bits (275), Expect = 7e-23
 Identities = 114/410 (27%), Positives = 175/410 (42%), Gaps = 21/410 (5%)
 Frame = -3

Query: 1333 YLMSLQVRTEDNKFVKAYATPDTGSDLIWLE--PTCKTTTANACIKEPETP-----FKCG 1175
            Y M++ + T     VK  A  DTGSDLIW++  P  +    N  + +P+         CG
Sbjct: 8    YFMAMSIGTPP---VKVLAIADTGSDLIWIQCKPCKRCYRQNPALFDPKKSSSYENLPCG 64

Query: 1174 DGDEDEYCKKMWSYLGMEAKCIESTDHKDEYCGYKIVYKDRAGYEGYLG--KGTFSDSHD 1001
                 + CK + S          S       C Y   Y D++   G L   K TF  +  
Sbjct: 65   ----SDSCKAIQS-------PGRSCRQDQNACKYSYSYADQSFSRGNLALEKFTFGSTRG 113

Query: 1000 Q--KLENMEYGV---STGTKEKNSKGVVGLGRGELSLFQQLNNSARARVEFKFSYCLPQY 836
                L  M +G    + G  +K   G+VGLG G LSL  QL     A + +KFSYCL  Y
Sbjct: 114  PPVSLPMMVFGCGHDNGGDFDKFGSGIVGLGGGPLSLVPQLG----ASINWKFSYCLIPY 169

Query: 835  EKKVDSNKNAQYATGKLVFGSR--VNTNPETSTPLLDENPEEKAKDPGKKAEDYCKTRYY 662
                  N  +   T K+ FG+   V+    ++TPL+D+ P                + YY
Sbjct: 170  -----VNVESSDVTNKITFGANAMVSDANVSTTPLVDKQP----------------STYY 208

Query: 661  CVNLTSIKVDGRQGI---LGKDTDTTEVMIIDSGSTFTSLRGELFKEFLKRVEQQIGDKE 491
             + L +I V  ++     LG DT+   + IIDSG+T T +    +++ +  +E+ IG K 
Sbjct: 209  YLTLEAISVGNKRLAHKGLGSDTEEGNI-IIDSGTTLTFIGSHFYEKLVSALEKVIGAKL 267

Query: 490  EKPISDDYMHCFLKGSADKLEKVSLGFEGTTVELKRENIFDHIVKKGEGEEKKDYLCLTV 311
                      CF       L  ++  F G  VEL+  N F  +        + D+LCLT 
Sbjct: 268  VSDPRGFLSACFKYHRKIDLPIITFHFTGADVELQSYNTFARL--------EDDFLCLT- 318

Query: 310  KKLDEGKMNGYVVP--NVHILGSRAQMDFKVAFDVPKREVSFVKVETCNQ 167
                       ++P  ++ + G+ AQ +F + +D+ KR VSF + +   Q
Sbjct: 319  -----------MIPSKDLGVFGNLAQANFLIGYDLEKRTVSFKQTDCTKQ 357


>XP_002517617.1 PREDICTED: probable aspartic protease At2g35615 [Ricinus communis]
            EEF44781.1 Aspartic proteinase nepenthesin-2 precursor,
            putative [Ricinus communis]
          Length = 449

 Score =  111 bits (278), Expect = 9e-23
 Identities = 113/415 (27%), Positives = 185/415 (44%), Gaps = 26/415 (6%)
 Frame = -3

Query: 1333 YLMSLQVRTEDNKFVKAYATPDTGSDLIWLE--PTCKTTTANACIKEPETPFK-----CG 1175
            YLM + +    N  V+  A  DTGSDLIW++  P       N+ I +P          CG
Sbjct: 93   YLMRISI---GNPQVEILAIADTGSDLIWVQCQPCEMCYKQNSPIFDPRRSSSYRNVLCG 149

Query: 1174 DGDEDEYCKKMWSYLGMEAKCIESTDHKDEYCGYKIVYKDRAGYEGYL-----GKGTFSD 1010
            +    E+C K    L  EA+  ++     + CGY   Y D++  +G+L     G G+ + 
Sbjct: 150  N----EFCNK----LDGEARSCDARGFV-KTCGYTYSYGDQSFSDGHLAIERFGIGSTNS 200

Query: 1009 SHDQKL---ENMEYGVST---GTKEKNSKGVVGLGRGELSLFQQLNNSARARVEFKFSYC 848
            +    +   + + +G  T   GT ++   G++GLG G +SL  QL      ++  KFSYC
Sbjct: 201  NTSAAIAYFQEVAFGCGTKNGGTFDELGSGIIGLGGGSMSLVSQLG----PKLSGKFSYC 256

Query: 847  LPQYEKKVDSNKNAQYATGKLVFGSRVNTNPET----STPLLDENPEEKAKDPGKKAEDY 680
            L      V +++ + Y T K+ FG+ +N +       STPLL + PE             
Sbjct: 257  L------VPTSEQSNY-TSKINFGNDINISGSNYNVVSTPLLPKKPET------------ 297

Query: 679  CKTRYYCVNLTSIKVDGRQ----GILGKDTDTTEVMIIDSGSTFTSLRGELFKEFLKRVE 512
                YY + L +I V+ ++     +   + +   + IIDSG+T T L  E F      VE
Sbjct: 298  ----YYYLTLEAISVENKRLPYTNLWNGEVEKGNI-IIDSGTTLTFLDSEFFNNLDSAVE 352

Query: 511  QQIGDKEEKPISDDYMHCFLKGSADKLEKVSLGFEGTTVELKRENIFDHIVKKGEGEEKK 332
            + +  +        +  CF    A +L  ++  F G  VEL+  N F  +        ++
Sbjct: 353  EAVKGERVSDPHGLFNICFKDEKAIELPIITAHFTGADVELQPVNTFAKV--------EE 404

Query: 331  DYLCLTVKKLDEGKMNGYVVPNVHILGSRAQMDFKVAFDVPKREVSFVKVETCNQ 167
            D LC T+   ++          + I G+ AQM+F V +D+ K+ VSF+  +   Q
Sbjct: 405  DLLCFTMIPSND----------IAIFGNLAQMNFLVGYDLEKKAVSFLPTDCTKQ 449


>XP_010440678.1 PREDICTED: aspartic proteinase CDR1-like [Camelina sativa]
          Length = 436

 Score =  110 bits (276), Expect = 1e-22
 Identities = 109/402 (27%), Positives = 176/402 (43%), Gaps = 10/402 (2%)
 Frame = -3

Query: 1354 VTALRHTYLMSLQVRTEDNKFVKAYATPDTGSDLIWLEPTCKTTTANACIKEPETPFKCG 1175
            +T+    YLM++ + T     +   A  DTGSDL+W +  CK         +P       
Sbjct: 83   ITSNGGEYLMNVSLGTPPFPIM---AIADTGSDLLWTQ--CKPCDDCYTQDDPLFDPNAS 137

Query: 1174 DGDEDEYCKK-MWSYLGMEAKCIESTDHKDEYCGYKIVYKDRAGYEGYLGKGTFS----D 1010
               +D  C     S L  +A C  STD     C Y + Y D +  EG L   T +    +
Sbjct: 138  STYKDVPCSSSQCSALENQASC--STDGST--CSYSMSYGDHSYTEGNLAADTLTLGSTN 193

Query: 1009 SHDQKLENMEYGV---STGTKEKNSKGVVGLGRGELSLFQQLNNSARARVEFKFSYCLPQ 839
            +   +L+N+  G    ++GT  +   G++GLG G +SL  QL +S    ++ KFSYCL  
Sbjct: 194  NRPVQLKNVIIGCGHNNSGTFNEKGSGIIGLGGGAVSLVTQLGDS----IDGKFSYCLVP 249

Query: 838  YEKKVDSNKNAQYATGKLVFGSRVNTNPETSTPLLDENPEEKAKDPGKKAEDYCKTRYYC 659
               + D      + T   V G+ V      STPL+ ++ E                 +Y 
Sbjct: 250  LSSETDKTSKINFGTNADVSGTGV-----VSTPLIAKSSET----------------FYY 288

Query: 658  VNLTSIKVDGRQ-GILGKDTDTTEV-MIIDSGSTFTSLRGELFKEFLKRVEQQIGDKEEK 485
            + L +I V  ++    G D+ + E  +IIDSG+T T L  E +KE    V   I  +++K
Sbjct: 289  LTLEAISVGSKKIPFQGSDSGSGEGNIIIDSGTTMTLLPTEFYKELEDVVASSIDAEKQK 348

Query: 484  PISDDYMHCFLKGSADKLEKVSLGFEGTTVELKRENIFDHIVKKGEGEEKKDYLCLTVKK 305
                    C+   S  K+  +++ F+G  V+L   NIF  I         +D +C   + 
Sbjct: 349  DPQGVLSLCYSATSDLKVPVITMHFDGADVKLDSSNIFVQI--------SQDLVCFAFR- 399

Query: 304  LDEGKMNGYVVPNVHILGSRAQMDFKVAFDVPKREVSFVKVE 179
                       P++ I G+ +QM+F V +D   ++VSF+  +
Sbjct: 400  ---------ASPSLSIYGNLSQMNFLVGYDTVSKKVSFMPTD 432


>XP_002870403.1 predicted protein [Arabidopsis lyrata subsp. lyrata] EFH46662.1
            predicted protein [Arabidopsis lyrata subsp. lyrata]
          Length = 440

 Score =  110 bits (276), Expect = 1e-22
 Identities = 110/401 (27%), Positives = 173/401 (43%), Gaps = 10/401 (2%)
 Frame = -3

Query: 1363 EFNVTALRHTYLMSLQVRTEDNKFVKAYATPDTGSDLIWLEPTCKTTTANACIKEPETPF 1184
            + ++T+    YLM++ + T     +   A  DTGSDL+W +  CK         +P    
Sbjct: 84   QIDLTSNSGEYLMNISLGTPPFPIM---AIADTGSDLLWTQ--CKPCDDCYTQVDPLFDP 138

Query: 1183 KCGDGDEDEYCKK-MWSYLGMEAKCIESTDHKDEYCGYKIVYKDRAGYEGYLGKGTFS-- 1013
            K     +D  C     + L  +A C  ST+  D  C Y   Y DR+  +G +   T +  
Sbjct: 139  KASSTYKDVSCSSSQCTALENQASC--STE--DNTCSYSTSYGDRSYTKGNIAVDTLTLG 194

Query: 1012 --DSHDQKLENMEYGV---STGTKEKNSKGVVGLGRGELSLFQQLNNSARARVEFKFSYC 848
              D+   +L+N+  G    + GT  K   G+VGLG G +SL  QL +S    ++ KFSYC
Sbjct: 195  STDTRPVQLKNIIIGCGHNNAGTFNKKGSGIVGLGGGAVSLITQLGDS----IDGKFSYC 250

Query: 847  LPQYEKKVDSNKNAQYATGKLVFGSRVNTNPETSTPLLDENPEEKAKDPGKKAEDYCKTR 668
            L     + D      + T  +V G+ V      STPL+ ++ E                 
Sbjct: 251  LVPLTSENDRTSKINFGTNAVVSGTGV-----VSTPLIAKSQET---------------- 289

Query: 667  YYCVNLTSIKVDGRQ-GILGKDTDTTEV-MIIDSGSTFTSLRGELFKEFLKRVEQQIGDK 494
            +Y + L SI V  ++    G D+ + E  +IIDSG+T T L  E + E    V   I  +
Sbjct: 290  FYYLTLKSISVGSKEVQYPGSDSGSGEGNIIIDSGTTLTLLPTEFYSELEDAVASSIDAE 349

Query: 493  EEKPISDDYMHCFLKGSADKLEKVSLGFEGTTVELKRENIFDHIVKKGEGEEKKDYLCLT 314
            +++        C+      K+  +++ F+G  V LK  N F  I         +D +C  
Sbjct: 350  KKQDPQTGLSLCYSATGDLKVPAITMHFDGADVNLKPSNCFVQI--------SEDLVCFA 401

Query: 313  VKKLDEGKMNGYVVPNVHILGSRAQMDFKVAFDVPKREVSF 191
             +            P+  I G+ AQM+F V +D   + VSF
Sbjct: 402  FRG----------SPSFSIYGNVAQMNFLVGYDTVSKTVSF 432


>XP_006395734.1 hypothetical protein EUTSA_v10004149mg [Eutrema salsugineum]
            ESQ33020.1 hypothetical protein EUTSA_v10004149mg
            [Eutrema salsugineum]
          Length = 467

 Score =  110 bits (275), Expect = 3e-22
 Identities = 123/411 (29%), Positives = 169/411 (41%), Gaps = 18/411 (4%)
 Frame = -3

Query: 1357 NVTALRHT----YLMSLQVRTEDNKFVKAYATPDTGSDLIWLEPTCKTTTANACIKEPET 1190
            N+ A  H+    +LM L +    N  VK  A  DTGSDLIW +  CK  T   C  +P  
Sbjct: 101  NIKAPTHSGSGEFLMELSI---GNPPVKYSAIVDTGSDLIWTQ--CKPCTE--CFDQPTP 153

Query: 1189 PFKCGDGDEDEYCKKMWSYLGMEAKCIESTDHKDEY-CGYKIVYKDRAGYEGYLGKGTFS 1013
             F   D  +     K+    G+      ST ++DE  C Y   Y D +   G L   TF+
Sbjct: 154  IF---DPKQSSSYSKVGCSSGLCNALSRSTCNQDEAACEYLYTYGDYSSTRGILATETFT 210

Query: 1012 DSHDQKLENMEYGV---STGTKEKNSKGVVGLGRGELSLFQQLNNSARARVEFKFSYCLP 842
               D  +  + +G    + G       G+VGLGRG LSL  QL        E KFSYCL 
Sbjct: 211  FEEDNSVSGIGFGCGKENEGDGFSQGSGLVGLGRGPLSLISQLK-------ETKFSYCLT 263

Query: 841  QYE-KKVDSNKNAQYATGKLVFGSRVNTNPE-TSTPLLDENPEEKAKDPGKKAEDYCKTR 668
              E  +  S+         +V  +  N + E T T  L  NP + +              
Sbjct: 264  SIEDNEASSSLFIGSLASNIVNKTSANLDGEVTKTISLLRNPNQPS-------------- 309

Query: 667  YYCVNLTSIKVDGRQGILGKDT-----DTTEVMIIDSGSTFTSLRGELFKEFLKRVEQQI 503
            +Y ++L  I V G++  + K T     D T  MIIDSG+T T L    F+E  K    ++
Sbjct: 310  FYYLDLQGITVGGKRLSVEKSTFELAEDGTGGMIIDSGTTITYLEEAAFQELKKEFTSRM 369

Query: 502  GDKEEKPISDDYMHCFLKGSADK---LEKVSLGFEGTTVELKRENIFDHIVKKGEGEEKK 332
                +   S     CF   SA K   + K+   F+G  +EL  EN           +   
Sbjct: 370  SLPVDDSGSTGLDLCFTLPSAAKKIAVPKLIFHFKGADLELPGENYM-------VADSST 422

Query: 331  DYLCLTVKKLDEGKMNGYVVPNVHILGSRAQMDFKVAFDVPKREVSFVKVE 179
              LCL +     G  NG     + I G+  Q +F V  D+ K  VSFV  E
Sbjct: 423  GVLCLAM-----GSSNG-----MSIFGNVQQQNFNVLHDLEKDTVSFVPTE 463


>XP_018482491.1 PREDICTED: probable aspartic protease At2g35615 [Raphanus sativus]
          Length = 447

 Score =  109 bits (273), Expect = 4e-22
 Identities = 110/400 (27%), Positives = 160/400 (40%), Gaps = 12/400 (3%)
 Frame = -3

Query: 1333 YLMSLQVRTEDNKFVKAYATPDTGSDLIWLEPTCKTTTANACIKEPETPFKCGDGDEDEY 1154
            + MS+ + T     +   A  DTGSDL W++  CK      C KE    F        + 
Sbjct: 84   FFMSITIGTPP---MNVLAIADTGSDLTWVQ--CKP--CQQCYKENGAIFDKTHSSTYKS 136

Query: 1153 CKKMWSYLGMEAKCIESTDHKDEYCGYKIVYKDRAGYEGYLGKGTFS----DSHDQKLEN 986
                 S+    +      D     C Y+  Y D++  +G +   T S             
Sbjct: 137  VPCESSHCNALSTNERGCDESKNVCKYRYSYGDQSFTKGDVATETISIGSASGSPVSFPG 196

Query: 985  MEYGV---STGTKEKNSKGVVGLGRGELSLFQQLNNSARARVEFKFSYCLPQYEKKVDSN 815
              +G    + GT ++   G++GLG G+LSL  QL +S    +  KFSYCL      V+  
Sbjct: 197  TVFGCGYDNGGTFDETGSGIIGLGGGQLSLISQLGSS----ISNKFSYCLSHKSSTVNGT 252

Query: 814  KNAQYATGKLVFGSRVNTNPETSTPLLDENPEEKAKDPGKKAEDYCKTRYYCVNLTSIKV 635
                  T  +  G    ++   STPL+D+ P         K   Y       V  T I  
Sbjct: 253  SVINLGTNSIPSGLNEPSSHVISTPLVDKEP---------KTYYYLTLEAISVGKTKIPY 303

Query: 634  DGRQGILGKDTDTTEV---MIIDSGSTFTSLRGELFKEFLKRVEQQI-GDKEEKPISDDY 467
             G       + D   V   +IIDSG+T T L    +  F   VE+ + G K         
Sbjct: 304  TGSSSYYYPNNDDVSVKGNIIIDSGTTLTLLESGFYDGFGAAVEEAVTGAKRVSDPQGLL 363

Query: 466  MHCFLKGSAD-KLEKVSLGFEGTTVELKRENIFDHIVKKGEGEEKKDYLCLTVKKLDEGK 290
             HCF  GSA+  L ++++ F G  V L   N F   VK  E     D +C+++    E  
Sbjct: 364  SHCFKSGSAEIGLPEITMHFTGADVRLSPLNAF---VKMSE-----DMVCMSMIPTTE-- 413

Query: 289  MNGYVVPNVHILGSRAQMDFKVAFDVPKREVSFVKVETCN 170
                    V I G+ AQMDF V +D+  R VSF +++  N
Sbjct: 414  --------VAIYGNFAQMDFLVGYDLETRTVSFQRMDCTN 445


>XP_010507561.1 PREDICTED: probable aspartic protease At2g35615 [Camelina sativa]
          Length = 449

 Score =  109 bits (273), Expect = 4e-22
 Identities = 118/404 (29%), Positives = 175/404 (43%), Gaps = 19/404 (4%)
 Frame = -3

Query: 1333 YLMSLQVRTEDNKFVKAYATPDTGSDLIWLEPTCKTTTANACIKEPETPF--KCGDGDED 1160
            + MS+ + T     +K     DTGSDL W++  CK      C KE    F  K     ++
Sbjct: 86   FFMSITIGTPP---IKVLGIADTGSDLTWIQ--CKP--CQQCYKENGQIFDKKKSSTYKN 138

Query: 1159 EYC--KKMWSYLGMEAKCIESTDHKDEYCGYKIVYKDRAGYEGYLGKGTFS----DSHDQ 998
            E C  +   +    E  C EST+     C Y+  Y D++  +G +   T S         
Sbjct: 139  EPCDSRNCQALSTSERGCDESTN----VCKYRYSYGDQSFSKGDVATETISIDSASGSPV 194

Query: 997  KLENMEYGV---STGTKEKNSKGVVGLGRGELSLFQQLNNSARARVEFKFSYCLPQYEKK 827
                  +G    + GT ++   G++GLG G+LSL  QL++S    +  KFSYCL      
Sbjct: 195  SFPGTVFGCGYNNGGTFDETGSGIIGLGGGQLSLISQLSSS----ISKKFSYCLSHKSST 250

Query: 826  VDSNKNAQYATGKLVFGSRVNTNPETSTPLLDENP------EEKAKDPGKKAEDYCKTRY 665
             +        TG +   S    +   STPL+D+ P        +A   GKK   Y    Y
Sbjct: 251  TNGTSVINLGTGSIP-SSLSKESGVVSTPLVDKEPLTYYYLTLEAISVGKKKIPYTGGTY 309

Query: 664  YCVNLTSIKVDGRQGILGKDTDTTEVMIIDSGSTFTSLRGELFKEFLKRVEQQI-GDKEE 488
               +      DG     G  ++T+  +IIDSG+T T L    F +F   VE+ + G K  
Sbjct: 310  IPND------DG-----GIFSETSGNIIIDSGTTLTLLESGFFDKFGAAVEESVTGAKRL 358

Query: 487  KPISDDYMHCFLKGSAD-KLEKVSLGFEGTTVELKRENIFDHIVKKGEGEEKKDYLCLTV 311
                    HCF  GSA+  L ++++ F G  V+L   N F   VK  E     D +CL++
Sbjct: 359  SDPQGGLSHCFKSGSAEIGLPEITVHFTGADVKLSPINAF---VKVSE-----DMVCLSM 410

Query: 310  KKLDEGKMNGYVVPNVHILGSRAQMDFKVAFDVPKREVSFVKVE 179
                          +V I G+ AQMDF V +D+  R VSF +++
Sbjct: 411  ----------IPTTDVAIYGNFAQMDFLVGYDLETRTVSFQRMD 444


>NP_850251.1 Eukaryotic aspartyl protease family protein [Arabidopsis thaliana]
            Q3EBM5.1 RecName: Full=Probable aspartic protease
            At2g35615; Flags: Precursor AEC09130.1 Eukaryotic
            aspartyl protease family protein [Arabidopsis thaliana]
            OAP07729.1 hypothetical protein AXX17_AT2G32220
            [Arabidopsis thaliana]
          Length = 447

 Score =  108 bits (270), Expect = 9e-22
 Identities = 118/400 (29%), Positives = 168/400 (42%), Gaps = 19/400 (4%)
 Frame = -3

Query: 1333 YLMSLQVRTEDNKFVKAYATPDTGSDLIWLEPTCKTTTANACIKEPETPF--KCGDGDED 1160
            + MS+ + T     +K +A  DTGSDL W++  CK      C KE    F  K     + 
Sbjct: 85   FFMSITIGTPP---IKVFAIADTGSDLTWVQ--CKP--CQQCYKENGPIFDKKKSSTYKS 137

Query: 1159 EYC--KKMWSYLGMEAKCIESTDHKDEYCGYKIVYKDRAGYEGYLGKGTFS----DSHDQ 998
            E C  +   +    E  C ES +     C Y+  Y D++  +G +   T S         
Sbjct: 138  EPCDSRNCQALSSTERGCDESNN----ICKYRYSYGDQSFSKGDVATETVSIDSASGSPV 193

Query: 997  KLENMEYGV---STGTKEKNSKGVVGLGRGELSLFQQLNNSARARVEFKFSYCLPQYEKK 827
                  +G    + GT ++   G++GLG G LSL  QL +S    +  KFSYCL      
Sbjct: 194  SFPGTVFGCGYNNGGTFDETGSGIIGLGGGHLSLISQLGSS----ISKKFSYCLSHKSAT 249

Query: 826  VDSNKNAQYATGKLVFGSRVNTNPETSTPLLDENP------EEKAKDPGKKAEDYCKTRY 665
             +        T  +   S    +   STPL+D+ P        +A   GKK   Y  + Y
Sbjct: 250  TNGTSVINLGTNSIP-SSLSKDSGVVSTPLVDKEPLTYYYLTLEAISVGKKKIPYTGSSY 308

Query: 664  YCVNLTSIKVDGRQGILGKDTDTTEVMIIDSGSTFTSLRGELFKEFLKRVEQQI-GDKEE 488
                          GIL   ++T+  +IIDSG+T T L    F +F   VE+ + G K  
Sbjct: 309  N---------PNDDGIL---SETSGNIIIDSGTTLTLLEAGFFDKFSSAVEESVTGAKRV 356

Query: 487  KPISDDYMHCFLKGSAD-KLEKVSLGFEGTTVELKRENIFDHIVKKGEGEEKKDYLCLTV 311
                    HCF  GSA+  L ++++ F G  V L   N F   VK  E     D +CL++
Sbjct: 357  SDPQGLLSHCFKSGSAEIGLPEITVHFTGADVRLSPINAF---VKLSE-----DMVCLSM 408

Query: 310  KKLDEGKMNGYVVPNVHILGSRAQMDFKVAFDVPKREVSF 191
                E          V I G+ AQMDF V +D+  R VSF
Sbjct: 409  VPTTE----------VAIYGNFAQMDFLVGYDLETRTVSF 438


Top