BLASTX nr result

ID: Ephedra29_contig00001794 seq

BLASTX 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Ephedra29_contig00001794
         (1883 letters)

Database: ./nr 
           115,041,592 sequences; 42,171,959,267 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

XP_019238066.1 PREDICTED: aspartyl protease family protein 2 [Ni...   415   e-136
XP_010262754.1 PREDICTED: aspartyl protease family protein 2 [Ne...   412   e-135
XP_006395632.1 hypothetical protein EUTSA_v10004188mg [Eutrema s...   412   e-135
XP_009767089.1 PREDICTED: aspartic proteinase nepenthesin-1 [Nic...   411   e-135
XP_006291121.1 hypothetical protein CARUB_v10017234mg [Capsella ...   410   e-134
XP_016506989.1 PREDICTED: aspartyl protease family protein 2-lik...   410   e-134
XP_015076367.1 PREDICTED: aspartic proteinase CDR1 [Solanum penn...   410   e-134
XP_004238970.1 PREDICTED: aspartyl protease family protein 2 [So...   409   e-134
OAP05619.1 hypothetical protein AXX17_AT3G27700 [Arabidopsis tha...   409   e-134
NP_189198.1 Eukaryotic aspartyl protease family protein [Arabido...   409   e-134
CDP00568.1 unnamed protein product [Coffea canephora]                 408   e-133
XP_006362527.1 PREDICTED: aspartic proteinase nepenthesin-1 [Sol...   407   e-133
XP_002875271.1 hypothetical protein ARALYDRAFT_484331 [Arabidops...   406   e-133
XP_010514187.1 PREDICTED: aspartyl protease family protein 2-lik...   406   e-133
JAU72320.1 Aspartic proteinase nepenthesin-2 [Noccaea caerulesce...   405   e-132
JAU21452.1 Aspartic proteinase nepenthesin-2, partial [Noccaea c...   406   e-132
XP_012082020.1 PREDICTED: aspartic proteinase nepenthesin-1 [Jat...   405   e-132
XP_013616585.1 PREDICTED: aspartic proteinase nepenthesin-2-like...   404   e-132
XP_016487000.1 PREDICTED: aspartyl protease family protein 2-lik...   404   e-132
JAU52374.1 Aspartic proteinase nepenthesin-2, partial [Noccaea c...   405   e-132

>XP_019238066.1 PREDICTED: aspartyl protease family protein 2 [Nicotiana attenuata]
            OIT21988.1 aspartyl protease family protein 2 [Nicotiana
            attenuata]
          Length = 453

 Score =  415 bits (1066), Expect = e-136
 Identities = 200/389 (51%), Positives = 263/389 (67%), Gaps = 1/389 (0%)
 Frame = -3

Query: 1512 SSLAFRGHGNILAPVISGAAAGSGQYIVDFTVGTPPQHFMLVADTGSDLLWVKCSGCRDR 1333
            SSL  R   +   P+ SGA++GSGQY VD  +GTPPQ  +LVADTGSDL+WV CS CR+ 
Sbjct: 62   SSLNHRSIRSAKLPLTSGASSGSGQYFVDLKLGTPPQRLLLVADTGSDLVWVTCSACRNC 121

Query: 1332 CLRKPGTAFFARRSRTFSPLHCYAPACELVPGPPDATCNATREHSSCMYEYVYADLSDST 1153
              R+ G+AF AR S T+ P HCY   C LVP P    CN TR+HS C Y Y Y+D S++ 
Sbjct: 122  SSRRRGSAFLARHSSTYFPFHCYDKKCRLVPNPRGVACNHTRQHSPCRYVYSYSDESETR 181

Query: 1152 GVFARDTATLNASDGSLVRVRGVAFGCGMHSTXXXXXXXXXXSAHGVMGLGKGAISFASQ 973
            G F+ +T TLNAS GS V+ +  AFGC   ++           A GVMGLG+G+IS ASQ
Sbjct: 182  GFFSTETTTLNASSGSAVKFKKFAFGCSFEASGPSITGPSFNGAQGVMGLGRGSISLASQ 241

Query: 972  IGRKVGDKFSYCLVDYTASPPRSSYLFIGHHGAIHRS-LHYTPLIHNKFAETFYYLGVEK 796
            +GR+ G+KFSYCL+DYT SP  +SYL IG    ++ S + YTP+I+N F  TFYY+G+E 
Sbjct: 242  LGRRFGNKFSYCLMDYTLSPTPTSYLLIGRSAEVNDSKMSYTPMINNPFTSTFYYIGIES 301

Query: 795  LWIGDRVLRLPAKIWEIDLHGNGGTIIDSGTTLTFLAGPAYVAVLKAYEKSVKYPKVSHA 616
            ++I D  L++   +W ID  GNGGT++DSGTTLTFLA PAY  +L+ +++ V  PKV   
Sbjct: 302  VYIEDVKLQISPSVWAIDELGNGGTVMDSGTTLTFLAEPAYRRILREFKRRVTLPKVDDP 361

Query: 615  FEDFDLCYNVSGIRRVHFPRFRISLKGNVRFEPPTSNYFINAAEDVRCLALRGVSSRSGF 436
               FDLC NVSG+ R  FP+    L G+    PP+ NYFI+ AEDV+CLAL+ +++ SGF
Sbjct: 362  TLGFDLCVNVSGVSRPSFPKMSFKLSGDSVLSPPSGNYFIDTAEDVKCLALQPLAAPSGF 421

Query: 435  SIIGNLMQQNFYIVYDRERSRLGFSQTDC 349
            S+IGNLMQQ F   +DR+RSR+GF++  C
Sbjct: 422  SVIGNLMQQGFVFEFDRDRSRIGFTRHGC 450


>XP_010262754.1 PREDICTED: aspartyl protease family protein 2 [Nelumbo nucifera]
          Length = 460

 Score =  412 bits (1060), Expect = e-135
 Identities = 201/389 (51%), Positives = 264/389 (67%), Gaps = 3/389 (0%)
 Frame = -3

Query: 1503 AFRGHGNILAPVISGAAAGSGQYIVDFTVGTPPQHFMLVADTGSDLLWVKCSGCRDRCLR 1324
            A +   ++ +PV+SGA+ G GQY VDF +G+PPQ  +LVADTGSDL+WVKCS CR+    
Sbjct: 70   ALQSRKSLKSPVVSGASTGFGQYFVDFRIGSPPQKLLLVADTGSDLVWVKCSACRNCSKH 129

Query: 1323 KPGTAFFARRSRTFSPLHCYAPACELVPGP-PDATCNATREHSSCMYEYVYADLSDSTGV 1147
             PG AF AR S TF+P+HCY PAC+LVP P     CN T  HS+C YEY+YAD S ++G 
Sbjct: 130  APGLAFLARHSTTFAPIHCYDPACQLVPHPVKHQPCNHTLLHSTCRYEYLYADESRTSGF 189

Query: 1146 FARDTATLNASDGSLVRVRGVAFGCGMHSTXXXXXXXXXXSAHGVMGLGKGAISFASQIG 967
            F+R+T TLN S G + R++ +AFGCG H +           AHGVMGLG+G  SF+SQ+G
Sbjct: 190  FSRETVTLNTSFGRVARLKKLAFGCGFHISGPSVSGASFNGAHGVMGLGRGPTSFSSQVG 249

Query: 966  RKVGDKFSYCLVDYTASPPRSSYLFIGHHGAIHRS--LHYTPLIHNKFAETFYYLGVEKL 793
            ++ G KFSYCL+DYT SPP +SYL IG    I R   + +TPL  +  + +FYY+G++ +
Sbjct: 250  KRFGYKFSYCLMDYTISPPPTSYLLIGETQPITRKQMMSFTPLHTSALSPSFYYIGIKSV 309

Query: 792  WIGDRVLRLPAKIWEIDLHGNGGTIIDSGTTLTFLAGPAYVAVLKAYEKSVKYPKVSHAF 613
            +I    L +   IW +D  GNGGT+IDSGTTLTF+A PAY  VL A++K ++ P+ +   
Sbjct: 310  FIDGVGLPIDPSIWALDNQGNGGTVIDSGTTLTFIAEPAYRQVLTAFKKRIRLPRTTDPS 369

Query: 612  EDFDLCYNVSGIRRVHFPRFRISLKGNVRFEPPTSNYFINAAEDVRCLALRGVSSRSGFS 433
               D C NVSG+     P+    L+G+  F PP  NYFI+AAE V+CLA+R V++ SGFS
Sbjct: 370  SSLDFCVNVSGVANPSLPKLSFRLEGDSVFSPPARNYFIDAAEGVKCLAMRPVTTPSGFS 429

Query: 432  IIGNLMQQNFYIVYDRERSRLGFSQTDCA 346
            IIGNLMQQ F   +DRERSRLGFS+  CA
Sbjct: 430  IIGNLMQQGFLFEFDRERSRLGFSRHGCA 458


>XP_006395632.1 hypothetical protein EUTSA_v10004188mg [Eutrema salsugineum]
            ESQ32918.1 hypothetical protein EUTSA_v10004188mg
            [Eutrema salsugineum]
          Length = 455

 Score =  412 bits (1058), Expect = e-135
 Identities = 219/455 (48%), Positives = 280/455 (61%), Gaps = 8/455 (1%)
 Frame = -3

Query: 1686 LKLKLIHRHSPDSPFRKPYANRREHLGDLIRDDERRLHGFQKLLDRNLPVPHDDLPTSSS 1507
            LKL L+ +    SPF  P  +        +  D RRLH F  L  R  PVP         
Sbjct: 30   LKLPLLRK----SPFPSPTQS--------LALDTRRLH-FLSL--RRKPVPF-------- 66

Query: 1506 LAFRGHGNILAPVISGAAAGSGQYIVDFTVGTPPQHFMLVADTGSDLLWVKCSGCRDRCL 1327
                    + +PV+SGA++GSGQY VD  +G PPQ  +L+ADTGSDL+WVKCS CR+  L
Sbjct: 67   --------VKSPVVSGASSGSGQYFVDLRIGQPPQSLLLIADTGSDLVWVKCSACRNCSL 118

Query: 1326 RKPGTAFFARRSRTFSPLHCYAPACELVPGPPDAT-CNATREHSSCMYEYVYADLSDSTG 1150
              PGT FF R S TFSP HCY P C LVP P  A  CN TR HS+C YEY YAD S ++G
Sbjct: 119  HSPGTVFFPRHSSTFSPAHCYDPICRLVPEPGRAPKCNHTRIHSTCPYEYAYADGSLTSG 178

Query: 1149 VFARDTATLNASDGSLVRVRGVAFGCGMHSTXXXXXXXXXXSAHGVMGLGKGAISFASQI 970
            +FAR+T TL  S G    ++ VAFGCG   +           AHGVMGLG+G ISFASQ+
Sbjct: 179  LFARETTTLKTSSGREAYLKSVAFGCGFRISGQSVSGTSFNGAHGVMGLGRGPISFASQL 238

Query: 969  GRKVGDKFSYCLVDYTASPPRSSYLFIGHHGAIHRS-----LHYTPLIHNKFAETFYYLG 805
            GR+ G+KFSYCL+DYT SPP +SYL IG  G   RS     L +TPL+ N  + TFYY+ 
Sbjct: 239  GRRFGNKFSYCLMDYTLSPPPTSYLIIGDGGGGVRSDAVSKLSFTPLLTNPLSPTFYYVR 298

Query: 804  VEKLWIGDRVLRLPAKIWEIDLHGNGGTIIDSGTTLTFLAGPAYVAVLKAYEKSVKYPKV 625
            ++ +++    LR+   +WEID  GNGGT++DSGTTL FLA PAY +V+ A  + ++ P  
Sbjct: 299  LKSIFVNGAKLRIDPSVWEIDDSGNGGTVVDSGTTLAFLAEPAYRSVIAAVRRRIRLPIA 358

Query: 624  SHAFEDFDLCYNVSGIRRVH--FPRFRISLKGNVRFEPPTSNYFINAAEDVRCLALRGVS 451
            +     FDLC N+SG+ +     PR +  L G   F PP  NYFI   E ++CLA++ V+
Sbjct: 359  AEVTPGFDLCVNISGVSKPEKIMPRLKFELAGGALFVPPPRNYFIETEEQIQCLAIQSVN 418

Query: 450  SRSGFSIIGNLMQQNFYIVYDRERSRLGFSQTDCA 346
             + GFS+IGNLMQQ F   +DR+RSRLGFS+  CA
Sbjct: 419  PKVGFSVIGNLMQQGFLFEFDRDRSRLGFSRRGCA 453


>XP_009767089.1 PREDICTED: aspartic proteinase nepenthesin-1 [Nicotiana sylvestris]
          Length = 448

 Score =  411 bits (1057), Expect = e-135
 Identities = 198/389 (50%), Positives = 263/389 (67%), Gaps = 1/389 (0%)
 Frame = -3

Query: 1512 SSLAFRGHGNILAPVISGAAAGSGQYIVDFTVGTPPQHFMLVADTGSDLLWVKCSGCRDR 1333
            SSL  R   +   P+ SGA++GSGQY VD  +GTPPQ  +LVADTGSDL+WV CS CR+ 
Sbjct: 57   SSLNHRSIRSAKLPLTSGASSGSGQYFVDLKLGTPPQRLLLVADTGSDLVWVTCSACRNC 116

Query: 1332 CLRKPGTAFFARRSRTFSPLHCYAPACELVPGPPDATCNATREHSSCMYEYVYADLSDST 1153
              R+ G+AF AR S T+ P HCY   C LVP P    CN TR+HS C Y Y Y+D S++ 
Sbjct: 117  SSRRRGSAFLARHSSTYFPFHCYDKKCRLVPNPRGVACNLTRQHSPCRYVYSYSDESETR 176

Query: 1152 GVFARDTATLNASDGSLVRVRGVAFGCGMHSTXXXXXXXXXXSAHGVMGLGKGAISFASQ 973
            G F+ +T TLNAS GS V+ +  AFGC   +T           A GVMGLG+G+IS ASQ
Sbjct: 177  GFFSTETTTLNASSGSAVKFKKFAFGCSFEATGPSITGPSFNGAQGVMGLGRGSISLASQ 236

Query: 972  IGRKVGDKFSYCLVDYTASPPRSSYLFIGHHGAIHRS-LHYTPLIHNKFAETFYYLGVEK 796
            +GR+ G+KFSYCL+DYT SP  +SYL IG    ++ S + YTP+I+N F  TFYY+G+E 
Sbjct: 237  LGRRFGNKFSYCLMDYTLSPTPTSYLLIGRSAQVNDSKMSYTPMINNPFTSTFYYIGIES 296

Query: 795  LWIGDRVLRLPAKIWEIDLHGNGGTIIDSGTTLTFLAGPAYVAVLKAYEKSVKYPKVSHA 616
            ++I    L++   +W ID  GNGGT++DSGTTLTFLA PAY  +++ +++ V+ P+V+  
Sbjct: 297  VYIEHVKLQINPSVWAIDELGNGGTVMDSGTTLTFLAEPAYRRIVREFKRLVRLPEVNDP 356

Query: 615  FEDFDLCYNVSGIRRVHFPRFRISLKGNVRFEPPTSNYFINAAEDVRCLALRGVSSRSGF 436
               FDLC NVSG+ R  FP+    L G+    PP  NYFI+ AEDV+CLAL+ +++ SGF
Sbjct: 357  TLGFDLCVNVSGVSRPSFPKMSFKLSGDSVLSPPPGNYFIDTAEDVKCLALQPLAAPSGF 416

Query: 435  SIIGNLMQQNFYIVYDRERSRLGFSQTDC 349
            S+IGNLMQQ F   +DR+RSR+GF++  C
Sbjct: 417  SVIGNLMQQGFVFEFDRDRSRIGFTRHGC 445


>XP_006291121.1 hypothetical protein CARUB_v10017234mg [Capsella rubella] EOA24019.1
            hypothetical protein CARUB_v10017234mg [Capsella rubella]
          Length = 452

 Score =  410 bits (1055), Expect = e-134
 Identities = 215/463 (46%), Positives = 279/463 (60%), Gaps = 8/463 (1%)
 Frame = -3

Query: 1710 AITTKHAALKLKLIHRHSPDSPFRKPYANRREHLGDLIRDDERRLHGFQKLLDRNLPVPH 1531
            A++  H  LKL L+ +    SPF  P           +  D RRLH    L  R  P+P 
Sbjct: 19   AVSNDHKYLKLPLLRK----SPFPSPT--------QALALDTRRLHF---LALRRKPIPF 63

Query: 1530 DDLPTSSSLAFRGHGNILAPVISGAAAGSGQYIVDFTVGTPPQHFMLVADTGSDLLWVKC 1351
                            + +PV+SGAA+GSGQY VD  +G PPQ  +L+ADTGSDL+WVKC
Sbjct: 64   ----------------VKSPVVSGAASGSGQYFVDLRIGQPPQSLLLIADTGSDLVWVKC 107

Query: 1350 SGCRDRCLRKPGTAFFARRSRTFSPLHCYAPACELVPGPPDAT-CNATREHSSCMYEYVY 1174
            S CR+     P T FF R S TFSP HCY P C LVP P  A  CN TR HS+C YEY Y
Sbjct: 108  SACRNCSHHSPATVFFPRHSSTFSPAHCYDPVCRLVPQPSRAPKCNHTRIHSTCHYEYGY 167

Query: 1173 ADLSDSTGVFARDTATLNASDGSLVRVRGVAFGCGMHSTXXXXXXXXXXSAHGVMGLGKG 994
            AD S ++G+F R+T +L  S G   +++ VAFGCG   +           AHGVMGLG+G
Sbjct: 168  ADGSLTSGLFGRETTSLKTSSGKEAKLKNVAFGCGFRISGQSVSGASFNGAHGVMGLGRG 227

Query: 993  AISFASQIGRKVGDKFSYCLVDYTASPPRSSYLFIGHHGAIHR-----SLHYTPLIHNKF 829
             ISFASQ+GR+ G+KFSYCL+DYT SPP +SYL IG  G   R      L +TPL+ N F
Sbjct: 228  PISFASQLGRRFGNKFSYCLMDYTLSPPPTSYLIIGDGGGGERINAVSKLLFTPLLTNPF 287

Query: 828  AETFYYLGVEKLWIGDRVLRLPAKIWEIDLHGNGGTIIDSGTTLTFLAGPAYVAVLKAYE 649
            + TFYY  ++ + +    LR+   +WEID  GNGGT++DSGT+L+FLA PAY  VL A+ 
Sbjct: 288  SPTFYYAKLKSISVNGAKLRIDPSVWEIDDSGNGGTVVDSGTSLSFLADPAYRLVLAAFR 347

Query: 648  KSVKYPKVSHAFEDFDLCYNVSGIRRVH--FPRFRISLKGNVRFEPPTSNYFINAAEDVR 475
            + +K P        FDLC+N+SG+ +    +PR +    G   F PP  NYF +  E ++
Sbjct: 348  RRIKLPNADELPPGFDLCFNISGVSKPEKFYPRLKFEFSGGAVFVPPPRNYFTDTEEQIQ 407

Query: 474  CLALRGVSSRSGFSIIGNLMQQNFYIVYDRERSRLGFSQTDCA 346
            CLA++ V+ + GFS+IGNLMQQ F   +DR+RSRLGFS+  CA
Sbjct: 408  CLAIQSVNPKDGFSVIGNLMQQGFLFEFDRDRSRLGFSRRGCA 450


>XP_016506989.1 PREDICTED: aspartyl protease family protein 2-like [Nicotiana
            tabacum]
          Length = 448

 Score =  410 bits (1053), Expect = e-134
 Identities = 197/389 (50%), Positives = 262/389 (67%), Gaps = 1/389 (0%)
 Frame = -3

Query: 1512 SSLAFRGHGNILAPVISGAAAGSGQYIVDFTVGTPPQHFMLVADTGSDLLWVKCSGCRDR 1333
            SSL  R   +   P+ SGA++GSGQY VD  +GTPPQ  +LVADTGSDL+WV CS CR+ 
Sbjct: 57   SSLNHRSIRSAKLPLTSGASSGSGQYFVDLKLGTPPQRLLLVADTGSDLVWVTCSACRNC 116

Query: 1332 CLRKPGTAFFARRSRTFSPLHCYAPACELVPGPPDATCNATREHSSCMYEYVYADLSDST 1153
              R+ G+AF AR S T+ P HCY   C LVP P    CN TR+HS C Y Y Y+D S++ 
Sbjct: 117  SSRRRGSAFLARHSSTYFPFHCYDKKCRLVPNPRGVACNLTRQHSPCRYVYSYSDESETR 176

Query: 1152 GVFARDTATLNASDGSLVRVRGVAFGCGMHSTXXXXXXXXXXSAHGVMGLGKGAISFASQ 973
            G F+ +T TLNAS GS V+ +  AFGC   +T           A GVMGLG+G+IS ASQ
Sbjct: 177  GFFSTETTTLNASSGSAVKFKKFAFGCSFEATGPSITGPSFNGAQGVMGLGRGSISLASQ 236

Query: 972  IGRKVGDKFSYCLVDYTASPPRSSYLFIGHHGAIHRS-LHYTPLIHNKFAETFYYLGVEK 796
            +GR+ G+KFSYC +DYT SP  +SYL IG    ++ S + YTP+I+N F  TFYY+G+E 
Sbjct: 237  LGRRFGNKFSYCFMDYTLSPTPTSYLLIGRSAQVNDSKMSYTPMINNPFTSTFYYIGIES 296

Query: 795  LWIGDRVLRLPAKIWEIDLHGNGGTIIDSGTTLTFLAGPAYVAVLKAYEKSVKYPKVSHA 616
            ++I    L++   +W ID  GNGGT++DSGTTLTFLA PAY  +++ +++ V+ P+V+  
Sbjct: 297  VYIEHVKLQINPSVWAIDELGNGGTVMDSGTTLTFLAEPAYRRIVREFKRLVRLPEVNDP 356

Query: 615  FEDFDLCYNVSGIRRVHFPRFRISLKGNVRFEPPTSNYFINAAEDVRCLALRGVSSRSGF 436
               FDLC NVSG+ R  FP+    L G+    PP  NYFI+ AEDV+CLAL+ +++ SGF
Sbjct: 357  TLGFDLCVNVSGVSRPSFPKMSFKLSGDSVLSPPPGNYFIDTAEDVKCLALQPLAAPSGF 416

Query: 435  SIIGNLMQQNFYIVYDRERSRLGFSQTDC 349
            S+IGNLMQQ F   +DR+RSR+GF++  C
Sbjct: 417  SVIGNLMQQGFVFEFDRDRSRIGFTRHGC 445


>XP_015076367.1 PREDICTED: aspartic proteinase CDR1 [Solanum pennellii]
          Length = 456

 Score =  410 bits (1053), Expect = e-134
 Identities = 208/419 (49%), Positives = 268/419 (63%), Gaps = 20/419 (4%)
 Frame = -3

Query: 1545 LPVPHDD-LPTSSSLAFR-------------GHGNILA----PVISGAAAGSGQYIVDFT 1420
            LP+ H D LPT+ S +               GH +I      P+ SGA  GSGQY VD  
Sbjct: 35   LPLLHKDTLPTTPSQSLSSDIHRLNTLYSSLGHRSITRSAKLPLTSGATTGSGQYFVDLR 94

Query: 1419 VGTPPQHFMLVADTGSDLLWVKCSGCRDRCLRKPGTAFFARRSRTFSPLHCYAPACELVP 1240
            +GTPPQ  +LVADTGSDL+WV CS CR+   R   +AF AR S T+ P HCY   C LVP
Sbjct: 95   LGTPPQRLLLVADTGSDLVWVSCSACRNCSSRPRNSAFLARHSSTYLPYHCYDKKCRLVP 154

Query: 1239 GPPDATCNATREHSSCMYEYVYADLSDSTGVFARDTATLNASDGSLVRVRGVAFGCGMHS 1060
             P    CN TR HS C YEY Y+D S++ G F+ +T TLNAS G  V+ R +AFGC   +
Sbjct: 155  NPTGVACNHTRLHSPCRYEYSYSDGSETKGFFSTETTTLNASSGRPVKFRNLAFGCSFEA 214

Query: 1059 TXXXXXXXXXXSAHGVMGLGKGAISFASQIGRKVGDKFSYCLVDYTASPPRSSYLFIGHH 880
            +           A GVMGLG+G+IS ASQ+GR+ G+KFSYCL+DYT SP  +SYL IG  
Sbjct: 215  SGPSIAGPSFNGAQGVMGLGRGSISLASQLGRRFGNKFSYCLMDYTLSPTPTSYLLIGRS 274

Query: 879  GAIH--RSLHYTPLIHNKFAETFYYLGVEKLWIGDRVLRLPAKIWEIDLHGNGGTIIDSG 706
             A++  + + YTP+I N F  TFYY+G+E ++I D  L +   +WEID  GNGGT++DSG
Sbjct: 275  TAVNDPKKMKYTPMISNPFTSTFYYIGIESVYIEDVKLPIRPSVWEIDELGNGGTVMDSG 334

Query: 705  TTLTFLAGPAYVAVLKAYEKSVKYPKVSHAFEDFDLCYNVSGIRRVHFPRFRISLKGNVR 526
            TTLTFLA PAY  +++A+++ V  P+       FDLC NVSG  R  FP+    L GN  
Sbjct: 335  TTLTFLAEPAYRRIVQAFKRLVTLPEADEPTVGFDLCVNVSGESRPSFPKMSFKLSGNSI 394

Query: 525  FEPPTSNYFINAAEDVRCLALRGVSSRSGFSIIGNLMQQNFYIVYDRERSRLGFSQTDC 349
              PP+ NYFI+ AEDV+CLAL+ +++ SGFS+IGNLMQQ F   +DR+RSR+GFS+  C
Sbjct: 395  LSPPSGNYFIDTAEDVKCLALQPLTAPSGFSVIGNLMQQGFMFEFDRDRSRIGFSRHGC 453


>XP_004238970.1 PREDICTED: aspartyl protease family protein 2 [Solanum lycopersicum]
          Length = 453

 Score =  409 bits (1052), Expect = e-134
 Identities = 213/454 (46%), Positives = 279/454 (61%), Gaps = 3/454 (0%)
 Frame = -3

Query: 1701 TKHAALKLKLIHRHS-PDSPFRKPYANRREHLGDLIRDDERRLHGFQKLLDRNLPVPHDD 1525
            TK   LKL L+H+ + P +P +             +  D  RL+     L       H  
Sbjct: 25   TKFEYLKLPLLHKDTFPTTPSQS------------LSSDIHRLNTLYSSLG------HRS 66

Query: 1524 LPTSSSLAFRGHGNILAPVISGAAAGSGQYIVDFTVGTPPQHFMLVADTGSDLLWVKCSG 1345
            +  S+ L          P+ SGA  GSGQY VD  +GTPPQ  +LVADTGSDL+WV CS 
Sbjct: 67   ITRSAKL----------PLTSGATTGSGQYFVDLRLGTPPQRLLLVADTGSDLVWVSCSA 116

Query: 1344 CRDRCLRKPGTAFFARRSRTFSPLHCYAPACELVPGPPDATCNATREHSSCMYEYVYADL 1165
            CR+   R   +AF AR S T+ P HCY   C LVP P    CN TR HS C YEY Y+D 
Sbjct: 117  CRNCSSRPRNSAFLARHSSTYLPYHCYDKKCRLVPNPTGVACNHTRLHSPCRYEYSYSDG 176

Query: 1164 SDSTGVFARDTATLNASDGSLVRVRGVAFGCGMHSTXXXXXXXXXXSAHGVMGLGKGAIS 985
            S++ G F+ +T TLNAS G  V+ R +AFGC   ++           A GVMGLG+G+IS
Sbjct: 177  SETKGFFSTETTTLNASSGRPVKFRNLAFGCSFEASGPSIAGPSFNGAQGVMGLGRGSIS 236

Query: 984  FASQIGRKVGDKFSYCLVDYTASPPRSSYLFIGHHGAIH--RSLHYTPLIHNKFAETFYY 811
             ASQ+GR+ G+KFSYCL+DYT SP  +SYL IG   A++  + ++YTP+I N F  TFYY
Sbjct: 237  LASQLGRRFGNKFSYCLMDYTLSPTPTSYLLIGRSTAVNDPKKMNYTPMISNPFTSTFYY 296

Query: 810  LGVEKLWIGDRVLRLPAKIWEIDLHGNGGTIIDSGTTLTFLAGPAYVAVLKAYEKSVKYP 631
            +G+E ++I D  L +   +WEID  GNGGT++DSGTTLTFLA PAY  +++A+++ V  P
Sbjct: 297  IGIESVYIEDVKLPIRPSVWEIDELGNGGTVMDSGTTLTFLAEPAYRRIVQAFKRLVTLP 356

Query: 630  KVSHAFEDFDLCYNVSGIRRVHFPRFRISLKGNVRFEPPTSNYFINAAEDVRCLALRGVS 451
            +       FDLC NVSG  R  FP+    L GN    PP+ NYFI+ AEDV+CLAL+ ++
Sbjct: 357  EADEPTVGFDLCVNVSGESRPSFPKMSFKLSGNSILSPPSGNYFIDTAEDVKCLALQPLT 416

Query: 450  SRSGFSIIGNLMQQNFYIVYDRERSRLGFSQTDC 349
            + SGFS+IGNLMQQ F   +DR+RSR+GFS+  C
Sbjct: 417  APSGFSVIGNLMQQGFMFEFDRDRSRIGFSRHGC 450


>OAP05619.1 hypothetical protein AXX17_AT3G27700 [Arabidopsis thaliana]
          Length = 452

 Score =  409 bits (1051), Expect = e-134
 Identities = 219/462 (47%), Positives = 279/462 (60%), Gaps = 3/462 (0%)
 Frame = -3

Query: 1722 STAEAITTKHAALKLKLIHRHSPDSPFRKPYANRREHLGDLIRDDERRLHGFQKLLDRNL 1543
            S   A++  +  LKL L+ +    SPF  P           +  D RRLH F  L  R  
Sbjct: 20   SNIAAVSNHNKYLKLPLLRK----SPFPSPT--------QALALDTRRLH-FLSL--RRK 64

Query: 1542 PVPHDDLPTSSSLAFRGHGNILAPVISGAAAGSGQYIVDFTVGTPPQHFMLVADTGSDLL 1363
            P+P                 + +PV+SGAA+GSGQY VD  +G PPQ  +L+ADTGSDL+
Sbjct: 65   PIPF----------------VKSPVVSGAASGSGQYFVDLRIGQPPQSLLLIADTGSDLV 108

Query: 1362 WVKCSGCRDRCLRKPGTAFFARRSRTFSPLHCYAPACELVPGPPDAT-CNATREHSSCMY 1186
            WVKCS CR+     P T FF R S TFSP HCY P C LVP P  A  CN TR HS+C Y
Sbjct: 109  WVKCSACRNCSHHSPATVFFPRHSSTFSPAHCYDPVCRLVPKPDRAPICNHTRIHSTCHY 168

Query: 1185 EYVYADLSDSTGVFARDTATLNASDGSLVRVRGVAFGCGMHSTXXXXXXXXXXSAHGVMG 1006
            EY YAD S ++G+FAR+T +L  S G   R++ VAFGCG   +           A+GVMG
Sbjct: 169  EYGYADGSLTSGLFARETTSLKTSSGKEARLKSVAFGCGFRISGQSVSGTSFNGANGVMG 228

Query: 1005 LGKGAISFASQIGRKVGDKFSYCLVDYTASPPRSSYLFIGHHGAIHRSLHYTPLIHNKFA 826
            LG+G ISFASQ+GR+ G+KFSYCL+DYT SPP +SYL IG+ G     L +TPL+ N  +
Sbjct: 229  LGRGPISFASQLGRRFGNKFSYCLMDYTLSPPPTSYLIIGNGGDGVSKLFFTPLLTNPLS 288

Query: 825  ETFYYLGVEKLWIGDRVLRLPAKIWEIDLHGNGGTIIDSGTTLTFLAGPAYVAVLKAYEK 646
             TFYY+ ++ +++    LR+   IWEID  GNGGT++DSGTTL FLA PAY +V+ A  +
Sbjct: 289  PTFYYVKLKSVFVNGAKLRIDPSIWEIDDSGNGGTVVDSGTTLAFLAEPAYRSVIAAVRR 348

Query: 645  SVKYPKVSHAFEDFDLCYNVSGIRRVH--FPRFRISLKGNVRFEPPTSNYFINAAEDVRC 472
             VK P        FDLC NVSG+ +     PR +    G   F PP  NYFI   E ++C
Sbjct: 349  RVKLPIADALTPGFDLCVNVSGVTKPEKILPRLKFEFSGGAVFVPPPRNYFIETEEQIQC 408

Query: 471  LALRGVSSRSGFSIIGNLMQQNFYIVYDRERSRLGFSQTDCA 346
            LA++ V  + GFS+IGNLMQQ F   +DR+RSRLGFS+  CA
Sbjct: 409  LAIQSVDPKVGFSVIGNLMQQGFLFEFDRDRSRLGFSRRGCA 450


>NP_189198.1 Eukaryotic aspartyl protease family protein [Arabidopsis thaliana]
            BAB03090.1 chloroplast nucleoid DNA binding protein-like;
            nucellin-like protein [Arabidopsis thaliana] ACD89063.1
            At3g25700 [Arabidopsis thaliana] AEE77054.1 Eukaryotic
            aspartyl protease family protein [Arabidopsis thaliana]
          Length = 452

 Score =  409 bits (1051), Expect = e-134
 Identities = 219/462 (47%), Positives = 279/462 (60%), Gaps = 3/462 (0%)
 Frame = -3

Query: 1722 STAEAITTKHAALKLKLIHRHSPDSPFRKPYANRREHLGDLIRDDERRLHGFQKLLDRNL 1543
            S   A++  +  LKL L+ +    SPF  P           +  D RRLH F  L  R  
Sbjct: 20   SNIAAVSNHNKYLKLPLLRK----SPFPSPT--------QALALDTRRLH-FLSL--RRK 64

Query: 1542 PVPHDDLPTSSSLAFRGHGNILAPVISGAAAGSGQYIVDFTVGTPPQHFMLVADTGSDLL 1363
            P+P                 + +PV+SGAA+GSGQY VD  +G PPQ  +L+ADTGSDL+
Sbjct: 65   PIPF----------------VKSPVVSGAASGSGQYFVDLRIGQPPQSLLLIADTGSDLV 108

Query: 1362 WVKCSGCRDRCLRKPGTAFFARRSRTFSPLHCYAPACELVPGPPDAT-CNATREHSSCMY 1186
            WVKCS CR+     P T FF R S TFSP HCY P C LVP P  A  CN TR HS+C Y
Sbjct: 109  WVKCSACRNCSHHSPATVFFPRHSSTFSPAHCYDPVCRLVPKPDRAPICNHTRIHSTCHY 168

Query: 1185 EYVYADLSDSTGVFARDTATLNASDGSLVRVRGVAFGCGMHSTXXXXXXXXXXSAHGVMG 1006
            EY YAD S ++G+FAR+T +L  S G   R++ VAFGCG   +           A+GVMG
Sbjct: 169  EYGYADGSLTSGLFARETTSLKTSSGKEARLKSVAFGCGFRISGQSVSGTSFNGANGVMG 228

Query: 1005 LGKGAISFASQIGRKVGDKFSYCLVDYTASPPRSSYLFIGHHGAIHRSLHYTPLIHNKFA 826
            LG+G ISFASQ+GR+ G+KFSYCL+DYT SPP +SYL IG+ G     L +TPL+ N  +
Sbjct: 229  LGRGPISFASQLGRRFGNKFSYCLMDYTLSPPPTSYLIIGNGGDGISKLFFTPLLTNPLS 288

Query: 825  ETFYYLGVEKLWIGDRVLRLPAKIWEIDLHGNGGTIIDSGTTLTFLAGPAYVAVLKAYEK 646
             TFYY+ ++ +++    LR+   IWEID  GNGGT++DSGTTL FLA PAY +V+ A  +
Sbjct: 289  PTFYYVKLKSVFVNGAKLRIDPSIWEIDDSGNGGTVVDSGTTLAFLAEPAYRSVIAAVRR 348

Query: 645  SVKYPKVSHAFEDFDLCYNVSGIRRVH--FPRFRISLKGNVRFEPPTSNYFINAAEDVRC 472
             VK P        FDLC NVSG+ +     PR +    G   F PP  NYFI   E ++C
Sbjct: 349  RVKLPIADALTPGFDLCVNVSGVTKPEKILPRLKFEFSGGAVFVPPPRNYFIETEEQIQC 408

Query: 471  LALRGVSSRSGFSIIGNLMQQNFYIVYDRERSRLGFSQTDCA 346
            LA++ V  + GFS+IGNLMQQ F   +DR+RSRLGFS+  CA
Sbjct: 409  LAIQSVDPKVGFSVIGNLMQQGFLFEFDRDRSRLGFSRRGCA 450


>CDP00568.1 unnamed protein product [Coffea canephora]
          Length = 476

 Score =  408 bits (1049), Expect = e-133
 Identities = 200/382 (52%), Positives = 252/382 (65%), Gaps = 6/382 (1%)
 Frame = -3

Query: 1473 PVISGAAAGSGQYIVDFTVGTPPQHFMLVADTGSDLLWVKCSGCRDRCLRKPGTAFFARR 1294
            P+ SGA+ G+GQY V  ++GTPPQ F+LVADTGSDL+WV CS CR+   R P +AF AR 
Sbjct: 93   PLTSGASFGAGQYFVSLSLGTPPQPFLLVADTGSDLIWVTCSACRNCSSRPPNSAFLARH 152

Query: 1293 SRTFSPLHCYAPACELVPGPPDATCNATREHSSCMYEYVYADLSDSTGVFARDTATLNAS 1114
            S TFSP HCY   C+LVP P    CN TR HS+C YEY Y+D S S+G+F+R+T T N S
Sbjct: 153  STTFSPSHCYDSVCQLVPHPHRVPCNHTRRHSTCRYEYSYSDGSLSSGIFSRETTTFNTS 212

Query: 1113 DGSLVRVRGVAFGCGMHSTXXXXXXXXXXSAHGVMGLGKGAISFASQIGRKVGDKFSYCL 934
             G +V+ R +AFGCG  ++           A GV+GLG G ISF SQ+GRK G+KFSYCL
Sbjct: 213  SGKVVKFRDLAFGCGFRASGPSVTGPSFNGAQGVLGLGLGPISFPSQLGRKFGNKFSYCL 272

Query: 933  VDYTASPPRSSYLFIGHHGAIH------RSLHYTPLIHNKFAETFYYLGVEKLWIGDRVL 772
            +DYT SP  +SYL IG  G           + YTPLI+N  + TFYY+G+E  ++G   L
Sbjct: 273  MDYTLSPTPTSYLLIGGGGGPEDGVVGGAKMSYTPLINNSLSPTFYYIGIEAAYVGGIEL 332

Query: 771  RLPAKIWEIDLHGNGGTIIDSGTTLTFLAGPAYVAVLKAYEKSVKYPKVSHAFEDFDLCY 592
            R+   +W ID  GNGGT++DSGTTLTFL  PAY  VL+ + + VK PK      +FD C 
Sbjct: 333  RISPSVWAIDDLGNGGTVMDSGTTLTFLVKPAYDKVLQEFMRRVKLPKSDRRNPNFDFCV 392

Query: 591  NVSGIRRVHFPRFRISLKGNVRFEPPTSNYFINAAEDVRCLALRGVSSRSGFSIIGNLMQ 412
            NVSG+ R   PR R  L G   F PP  NYFI+ AE+V+CLAL+ V   SGFS+IGN+MQ
Sbjct: 393  NVSGVSRPSLPRLRFKLAGGSMFSPPPQNYFIDTAENVKCLALQPVVQPSGFSLIGNVMQ 452

Query: 411  QNFYIVYDRERSRLGFSQTDCA 346
            Q F   +DR+R RLGF++  CA
Sbjct: 453  QGFMFEFDRDRWRLGFTRRGCA 474


>XP_006362527.1 PREDICTED: aspartic proteinase nepenthesin-1 [Solanum tuberosum]
          Length = 454

 Score =  407 bits (1046), Expect = e-133
 Identities = 196/377 (51%), Positives = 254/377 (67%), Gaps = 2/377 (0%)
 Frame = -3

Query: 1473 PVISGAAAGSGQYIVDFTVGTPPQHFMLVADTGSDLLWVKCSGCRDRCLRKPGTAFFARR 1294
            PV SGA  GSGQY VD  +GTPPQ  +LVADTGSDL+WV CS CR+   R P +AF AR 
Sbjct: 75   PVTSGATTGSGQYFVDLRLGTPPQRLLLVADTGSDLVWVSCSACRNCSSRPPNSAFLARH 134

Query: 1293 SRTFSPLHCYAPACELVPGPPDATCNATREHSSCMYEYVYADLSDSTGVFARDTATLNAS 1114
            S T+ P HCY   C LVP P    CN TR HS C YEY Y+D S++ G F+ +T TLNAS
Sbjct: 135  SSTYFPYHCYDKKCRLVPNPTGVACNHTRLHSPCRYEYSYSDGSETKGFFSTETTTLNAS 194

Query: 1113 DGSLVRVRGVAFGCGMHSTXXXXXXXXXXSAHGVMGLGKGAISFASQIGRKVGDKFSYCL 934
             G  V+ R +AFGC   +T           A GVMGLG+G+IS +SQ+GR+ G+KFSYCL
Sbjct: 195  SGRPVKFRNLAFGCSFEATGPSIAGPSFNGAQGVMGLGRGSISLSSQLGRRFGNKFSYCL 254

Query: 933  VDYTASPPRSSYLFIGHHGAIH--RSLHYTPLIHNKFAETFYYLGVEKLWIGDRVLRLPA 760
            +DYT SP  +SYL IG   A++  + ++YTP+I N F+ TFYY+G+E + I D  L +  
Sbjct: 255  MDYTLSPTPTSYLLIGRSTAVNDPKKMNYTPMISNPFSSTFYYIGIESVHIEDVKLPIRP 314

Query: 759  KIWEIDLHGNGGTIIDSGTTLTFLAGPAYVAVLKAYEKSVKYPKVSHAFEDFDLCYNVSG 580
             +W ID  GNGGT++DSGTTLTFLA PAY  +++A+++ V  P+       FDLC NVSG
Sbjct: 315  SVWAIDELGNGGTVMDSGTTLTFLAEPAYRRIVQAFKRLVTLPEADEPTVGFDLCVNVSG 374

Query: 579  IRRVHFPRFRISLKGNVRFEPPTSNYFINAAEDVRCLALRGVSSRSGFSIIGNLMQQNFY 400
              R  FP+    L GN    PP+ NYFI+ AE+V+CLAL+ +++ SGFS+IGNLMQQ F 
Sbjct: 375  ESRPSFPKMSFKLSGNSILSPPSGNYFIDTAENVKCLALQPLTTPSGFSVIGNLMQQGFM 434

Query: 399  IVYDRERSRLGFSQTDC 349
              +DR++SR+GFS+  C
Sbjct: 435  FEFDRDQSRIGFSRHGC 451


>XP_002875271.1 hypothetical protein ARALYDRAFT_484331 [Arabidopsis lyrata subsp.
            lyrata] EFH51530.1 hypothetical protein ARALYDRAFT_484331
            [Arabidopsis lyrata subsp. lyrata]
          Length = 451

 Score =  406 bits (1044), Expect = e-133
 Identities = 216/458 (47%), Positives = 276/458 (60%), Gaps = 3/458 (0%)
 Frame = -3

Query: 1710 AITTKHAALKLKLIHRHSPDSPFRKPYANRREHLGDLIRDDERRLHGFQKLLDRNLPVPH 1531
            A++     LKL L+ +    SPF  P           +  D RRLH F  L  R  PVP 
Sbjct: 23   AVSNDRKYLKLPLLRK----SPFPSPT--------QALALDTRRLH-FLSL--RRKPVPF 67

Query: 1530 DDLPTSSSLAFRGHGNILAPVISGAAAGSGQYIVDFTVGTPPQHFMLVADTGSDLLWVKC 1351
                            + +PV+SGA++GSGQY VD  +G PPQ  +L+ADTGSDL+WVKC
Sbjct: 68   ----------------VKSPVVSGASSGSGQYFVDLRIGQPPQSLLLIADTGSDLVWVKC 111

Query: 1350 SGCRDRCLRKPGTAFFARRSRTFSPLHCYAPACELVPGPPDAT-CNATREHSSCMYEYVY 1174
            S CR+     P T FF R S TFSP HCY P C LVP P  A  CN TR HS+C YEY Y
Sbjct: 112  SACRNCSHHSPATVFFPRHSSTFSPAHCYDPVCRLVPKPGRAPRCNHTRIHSTCPYEYGY 171

Query: 1173 ADLSDSTGVFARDTATLNASDGSLVRVRGVAFGCGMHSTXXXXXXXXXXSAHGVMGLGKG 994
            AD S ++G+FAR+T +L  S G   +++ VAFGCG   +           A+GVMGLG+G
Sbjct: 172  ADGSLTSGLFARETTSLKTSSGKEAKLKSVAFGCGFRISGQSVSGTSFNGANGVMGLGRG 231

Query: 993  AISFASQIGRKVGDKFSYCLVDYTASPPRSSYLFIGHHGAIHRSLHYTPLIHNKFAETFY 814
             ISFASQ+GR+ G+KFSYCL+DYT SPP +SYL IG  G     L +TPL+ N  + TFY
Sbjct: 232  PISFASQLGRRFGNKFSYCLMDYTLSPPPTSYLIIGDGGDAVSKLFFTPLLTNPLSPTFY 291

Query: 813  YLGVEKLWIGDRVLRLPAKIWEIDLHGNGGTIIDSGTTLTFLAGPAYVAVLKAYEKSVKY 634
            Y+ ++ +++    LR+   IWEID  GNGGT++DSGTTL FLA PAY  V+ A ++ +K 
Sbjct: 292  YVKLKSVFVNGAKLRIDPSIWEIDDSGNGGTVMDSGTTLAFLADPAYRLVIAAVKQRIKL 351

Query: 633  PKVSHAFEDFDLCYNVSGIRRVH--FPRFRISLKGNVRFEPPTSNYFINAAEDVRCLALR 460
            P        FDLC NVSG+ +     PR +    G   F PP  NYFI   E ++CLA++
Sbjct: 352  PNADELTPGFDLCVNVSGVTKPEKILPRLKFEFSGGAVFVPPPRNYFIETEEQIQCLAIQ 411

Query: 459  GVSSRSGFSIIGNLMQQNFYIVYDRERSRLGFSQTDCA 346
             V  + GFS+IGNLMQQ F   +DR+RSRLGFS+  CA
Sbjct: 412  SVDPKVGFSVIGNLMQQGFLFEFDRDRSRLGFSRRGCA 449


>XP_010514187.1 PREDICTED: aspartyl protease family protein 2-like [Camelina sativa]
          Length = 454

 Score =  406 bits (1043), Expect = e-133
 Identities = 221/471 (46%), Positives = 280/471 (59%), Gaps = 11/471 (2%)
 Frame = -3

Query: 1725 TSTAEAITTKHAALKLKLIHRHSPDSPFRKPYANRREHLGDLIRDDERRLHGFQKLLDRN 1546
            T+   A++     LKL L+ +    SPF  P           +  D RRLH F  L  R 
Sbjct: 14   TANLAAVSNDGKYLKLPLLRK----SPFPSPT--------QALALDTRRLH-FLSL--RR 58

Query: 1545 LPVPHDDLPTSSSLAFRGHGNILAPVISGAAAGSGQYIVDFTVGTPPQHFMLVADTGSDL 1366
             P+P                 I +PV+SGA++GSGQY VD  +G PPQ  +L+ADTGSDL
Sbjct: 59   KPIPF----------------IKSPVVSGASSGSGQYFVDLRIGQPPQSLLLIADTGSDL 102

Query: 1365 LWVKCSGCRDRCLRKPGTAFFARRSRTFSPLHCYAPACELVPGPPDAT-CNATREHSSCM 1189
            +WVKCS CR+     P T FF R S TFSP HCY P C LVP P  A  CN TR HS+C 
Sbjct: 103  VWVKCSACRNCSHHSPATVFFPRHSSTFSPAHCYDPVCRLVPQPGRAPKCNHTRIHSTCH 162

Query: 1188 YEYVYADLSDSTGVFARDTATLNASDGSLVRVRGVAFGCGMHSTXXXXXXXXXXSAHGVM 1009
            YEY YAD S ++G+F R+T +L  S G   +++ VAFGCG   +           AHGVM
Sbjct: 163  YEYGYADGSLTSGLFGRETTSLKTSSGKEAKLKNVAFGCGFRISGQSVSGTSFNGAHGVM 222

Query: 1008 GLGKGAISFASQIGRKVGDKFSYCLVDYTASPPRSSYLFIGHHG--------AIHRSLHY 853
            GLG+G ISFASQ+GR+ G+KFSYCL+DYT SPP +SYL IG  G        A+ + L +
Sbjct: 223  GLGRGPISFASQLGRRFGNKFSYCLMDYTLSPPPTSYLIIGDGGGRGGEQINAVSKLL-F 281

Query: 852  TPLIHNKFAETFYYLGVEKLWIGDRVLRLPAKIWEIDLHGNGGTIIDSGTTLTFLAGPAY 673
            TPL+ N F+ TFYY+ +  + +    LR+   IWEID  GNGGT++DSGTTL FLA PAY
Sbjct: 282  TPLLTNTFSPTFYYVKLRSVSVNGAKLRIDPSIWEIDSSGNGGTVVDSGTTLAFLADPAY 341

Query: 672  VAVLKAYEKSVKYPKVSHAFEDFDLCYNVSGIRRVH--FPRFRISLKGNVRFEPPTSNYF 499
              VL A  + +K P        FDLC NVSG+ +     PR +    G   F PP  NYF
Sbjct: 342  RLVLAAIRRRIKLPNADELTPGFDLCLNVSGVSKPEKLLPRLKFEFSGGAVFVPPPRNYF 401

Query: 498  INAAEDVRCLALRGVSSRSGFSIIGNLMQQNFYIVYDRERSRLGFSQTDCA 346
            I   E+V+CLA++ V+ + GFS+IGNLMQQ F   +DR+RSRLGFS+  CA
Sbjct: 402  IETEEEVQCLAIQSVNPKVGFSVIGNLMQQGFLFEFDRDRSRLGFSRRGCA 452


>JAU72320.1 Aspartic proteinase nepenthesin-2 [Noccaea caerulescens] JAU92564.1
            Aspartic proteinase nepenthesin-2 [Noccaea caerulescens]
          Length = 454

 Score =  405 bits (1041), Expect = e-132
 Identities = 217/463 (46%), Positives = 279/463 (60%), Gaps = 8/463 (1%)
 Frame = -3

Query: 1710 AITTKHAALKLKLIHRHSPDSPFRKPYANRREHLGDLIRDDERRLHGFQKLLDRNLPVPH 1531
            A  T    LKL L+ +    SPF  P           +  D RRLH F  L  + +P   
Sbjct: 22   AAVTDDEYLKLPLLRK----SPFPSPT--------QALALDTRRLH-FLSLRRKTVPF-- 66

Query: 1530 DDLPTSSSLAFRGHGNILAPVISGAAAGSGQYIVDFTVGTPPQHFMLVADTGSDLLWVKC 1351
                            + +PV+SGA++GSGQY VD  +G PPQ  +L+ADTGSDL+WVKC
Sbjct: 67   ----------------VKSPVVSGASSGSGQYFVDLRIGQPPQSLLLIADTGSDLVWVKC 110

Query: 1350 SGCRDRCLRKPGTAFFARRSRTFSPLHCYAPACELVPGPPDAT-CNATREHSSCMYEYVY 1174
            S CR+  L  P T FF R S TFSP HCY P C LVP P  A  CN TR HS+C YEY Y
Sbjct: 111  SACRNCSLHSPATVFFPRHSSTFSPTHCYDPICRLVPKPSRAPKCNHTRIHSTCPYEYAY 170

Query: 1173 ADLSDSTGVFARDTATLNASDGSLVRVRGVAFGCGMHSTXXXXXXXXXXSAHGVMGLGKG 994
            AD S ++G+FAR+T TL  S G    ++ VAFGCG   +           AHGVMGLG+G
Sbjct: 171  ADGSLTSGLFARETTTLKTSSGREANLKSVAFGCGFRISGQSVSGTSFNGAHGVMGLGRG 230

Query: 993  AISFASQIGRKVGDKFSYCLVDYTASPPRSSYLFIGHHGAIHRS-----LHYTPLIHNKF 829
             ISFASQ+GR+ G+KFSYCL+DYT SPP +SYL IG  G  +RS     L++TPL+ N  
Sbjct: 231  PISFASQLGRRFGNKFSYCLMDYTLSPPPTSYLIIGDGGG-NRSNAVSKLYFTPLLANPL 289

Query: 828  AETFYYLGVEKLWIGDRVLRLPAKIWEIDLHGNGGTIIDSGTTLTFLAGPAYVAVLKAYE 649
            + TFYY+ ++ +++    LR+   IWEID  GNGGT++DSGTTL FLA PAY  ++ A  
Sbjct: 290  SPTFYYVRLKSVFVNGAKLRIDRSIWEIDGLGNGGTVVDSGTTLAFLADPAYRLLIAAVR 349

Query: 648  KSVKYPKVSHAFEDFDLCYNVSGIRRVH--FPRFRISLKGNVRFEPPTSNYFINAAEDVR 475
            + ++ P  +     F+LC NVSGI +     PR +    G   F PP  NYFI   E ++
Sbjct: 350  RRIRLPMAAELTPGFELCVNVSGISKPEKIMPRLKFEFSGGAVFVPPPRNYFIETEEQIQ 409

Query: 474  CLALRGVSSRSGFSIIGNLMQQNFYIVYDRERSRLGFSQTDCA 346
            CLA++ V+ + GFS+IGNLMQQ F   +DR+RSRLGFS+  CA
Sbjct: 410  CLAIQSVNPKVGFSVIGNLMQQGFLFEFDRDRSRLGFSRRGCA 452


>JAU21452.1 Aspartic proteinase nepenthesin-2, partial [Noccaea caerulescens]
          Length = 485

 Score =  406 bits (1043), Expect = e-132
 Identities = 217/463 (46%), Positives = 280/463 (60%), Gaps = 8/463 (1%)
 Frame = -3

Query: 1710 AITTKHAALKLKLIHRHSPDSPFRKPYANRREHLGDLIRDDERRLHGFQKLLDRNLPVPH 1531
            A  T    LKL L+ +    SPF  P           +  D RRLH F  L  +++P   
Sbjct: 53   AAVTDDEYLKLPLLRK----SPFPSPT--------QALALDTRRLH-FLSLRRKSVPF-- 97

Query: 1530 DDLPTSSSLAFRGHGNILAPVISGAAAGSGQYIVDFTVGTPPQHFMLVADTGSDLLWVKC 1351
                            + +PV+SGA++GSGQY VD  +G PPQ  +L+ADTGSDL+WVKC
Sbjct: 98   ----------------VKSPVVSGASSGSGQYFVDLRIGQPPQSLLLIADTGSDLVWVKC 141

Query: 1350 SGCRDRCLRKPGTAFFARRSRTFSPLHCYAPACELVPGPPDAT-CNATREHSSCMYEYVY 1174
            S CR+  L  P T FF R S TFSP HCY P C LVP P  A  CN TR HS+C YEY Y
Sbjct: 142  SACRNCSLHSPATVFFPRHSSTFSPTHCYDPICRLVPKPSRAPKCNHTRIHSTCPYEYAY 201

Query: 1173 ADLSDSTGVFARDTATLNASDGSLVRVRGVAFGCGMHSTXXXXXXXXXXSAHGVMGLGKG 994
            AD S ++G+FAR+T TL  S G    ++ VAFGCG   +           AHGVMGLG+G
Sbjct: 202  ADGSLTSGLFARETTTLKTSSGGEANLKNVAFGCGFRISGQSVSGTSFNGAHGVMGLGRG 261

Query: 993  AISFASQIGRKVGDKFSYCLVDYTASPPRSSYLFIGHHGAIHRS-----LHYTPLIHNKF 829
             ISFASQ+GR+ G+KFSYCL+DYT SPP +SYL IG  G  +RS     L++TPL+ N  
Sbjct: 262  PISFASQLGRRFGNKFSYCLMDYTLSPPPTSYLIIGDGGG-NRSNAVSKLYFTPLLANPL 320

Query: 828  AETFYYLGVEKLWIGDRVLRLPAKIWEIDLHGNGGTIIDSGTTLTFLAGPAYVAVLKAYE 649
            + TFYY+ ++ +++    LR+   IWEID  GNGGT++DSGTTL FLA PAY  ++ A  
Sbjct: 321  SPTFYYVRLKSVFVNGAKLRIDRSIWEIDGLGNGGTVVDSGTTLAFLADPAYRLLIAAVR 380

Query: 648  KSVKYPKVSHAFEDFDLCYNVSGIRRVH--FPRFRISLKGNVRFEPPTSNYFINAAEDVR 475
            + ++ P  +     F+LC NVSGI +     PR +    G   F PP  NYFI   E ++
Sbjct: 381  RRIRLPMAAELTPGFELCVNVSGISKPEKIMPRLKFEFSGGAVFVPPPRNYFIETEEQIQ 440

Query: 474  CLALRGVSSRSGFSIIGNLMQQNFYIVYDRERSRLGFSQTDCA 346
            CLA++ V+ + GFS+IGNLMQQ F   +DR+RSRLGFS+  CA
Sbjct: 441  CLAIQSVNPKVGFSVIGNLMQQGFLFEFDRDRSRLGFSRRGCA 483


>XP_012082020.1 PREDICTED: aspartic proteinase nepenthesin-1 [Jatropha curcas]
            KDP29358.1 hypothetical protein JCGZ_18279 [Jatropha
            curcas]
          Length = 455

 Score =  405 bits (1040), Expect = e-132
 Identities = 217/463 (46%), Positives = 285/463 (61%), Gaps = 4/463 (0%)
 Frame = -3

Query: 1722 STAEAITTKHAALKLKLIHRHSPDSPFRKPYANRREHLGDLIRDDERRLHGFQKLLDRNL 1543
            +T  +  TK   LKL L+HR    +PF+ P           +  D RRL           
Sbjct: 26   TTVNSTATKEY-LKLPLLHR----TPFKSP--------AQALPFDIRRL----------- 61

Query: 1542 PVPHDDLPTSSSLAFRGHGNILAPVISGAAAGSGQYIVDFTVGTPPQHFMLVADTGSDLL 1363
                       SL  R   ++ +PVISGA+ GSGQY V   +G+P Q  +LVADTGSDL+
Sbjct: 62   -----------SLLHRQRTSLKSPVISGASTGSGQYFVSLRLGSPAQTLLLVADTGSDLV 110

Query: 1362 WVKCSGCRDRCLRKPGTAFFARRSRTFSPLHCYAPACELVPGPPDATCNATREHSSCMYE 1183
            WVKCS C++     PG+AF AR S TFS +HC+   C LVP P    CN TR HS C YE
Sbjct: 111  WVKCSACKNCSNYSPGSAFLARHSSTFSLIHCFNSQCRLVPHPRPNPCNRTRLHSPCRYE 170

Query: 1182 YVYADLSDSTGVFARDTATLNASDGSLVRVRGVAFGCGMHSTXXXXXXXXXXSAHGVMGL 1003
            Y YAD S ++G F+++T TLN S G   +++ +AFGCG   +           AHGV+GL
Sbjct: 171  YSYADGSSTSGFFSKETTTLNTSAGREKKLKNLAFGCGFRISGPSLTGASFAGAHGVIGL 230

Query: 1002 GKGAISFASQIGRKVGDKFSYCLVDYTASPPRSSYLFIGHH--GAIHRS--LHYTPLIHN 835
            G+  ISF+SQ+GR+ G+KFSYCL+DYT SPP +SYL IG H   A+ R   L++TPL+ N
Sbjct: 231  GRAPISFSSQLGRRFGNKFSYCLMDYTLSPPPTSYLMIGGHQNSAVSRKRILNFTPLLVN 290

Query: 834  KFAETFYYLGVEKLWIGDRVLRLPAKIWEIDLHGNGGTIIDSGTTLTFLAGPAYVAVLKA 655
              + TFYY+G++ + +    L +   +W ID  GNGGTIIDSGTTLTFL  PAY  +L A
Sbjct: 291  SLSPTFYYIGIKSVSVDGVKLPINPSVWSIDDLGNGGTIIDSGTTLTFLVEPAYREILSA 350

Query: 654  YEKSVKYPKVSHAFEDFDLCYNVSGIRRVHFPRFRISLKGNVRFEPPTSNYFINAAEDVR 475
             ++ VK P        FDLC NVSG+RR  FPR  + L GN  F PP  NYFI+ +E V+
Sbjct: 351  IKRRVKLPGPGELTPGFDLCVNVSGVRRPVFPRMSLELAGNSVFSPPPRNYFIDTSEGVK 410

Query: 474  CLALRGVSSRSGFSIIGNLMQQNFYIVYDRERSRLGFSQTDCA 346
            CLA++ V+S SGFS+IGNLMQQ + + +DR+RSRLGF+++ CA
Sbjct: 411  CLAIQPVNSGSGFSVIGNLMQQGYLLEFDRDRSRLGFARSGCA 453


>XP_013616585.1 PREDICTED: aspartic proteinase nepenthesin-2-like [Brassica oleracea
            var. oleracea]
          Length = 451

 Score =  404 bits (1039), Expect = e-132
 Identities = 209/419 (49%), Positives = 263/419 (62%), Gaps = 4/419 (0%)
 Frame = -3

Query: 1590 DERRLHGFQKLLDRNLPVPHDDLPTSSSLAFRGHGNILAPVISGAAAGSGQYIVDFTVGT 1411
            D RRLH F  L  R  P+P                 + +PV+SGA++GSGQY VD  +G 
Sbjct: 50   DTRRLH-FLSL--RRKPIPF----------------VKSPVVSGASSGSGQYFVDLRIGQ 90

Query: 1410 PPQHFMLVADTGSDLLWVKCSGCRDRCLRKPGTAFFARRSRTFSPLHCYAPACELVPGPP 1231
            PPQ  +L+ADTGSDL+WVKCS CR+  L  P T FF R S TFSP HCY P C LVPGP 
Sbjct: 91   PPQSLLLIADTGSDLVWVKCSACRNCSLHSPATVFFPRHSTTFSPTHCYDPLCRLVPGPV 150

Query: 1230 DAT-CNATREHSSCMYEYVYADLSDSTGVFARDTATLNASDGSLVRVRGVAFGCGMHSTX 1054
             A  CN TR HS+C YEY YAD S ++G+FA +T TL  S G    ++ VAFGCG   + 
Sbjct: 151  RALKCNHTRIHSTCHYEYAYADGSLTSGLFATETTTLKTSSGREAYLKSVAFGCGFRISG 210

Query: 1053 XXXXXXXXXSAHGVMGLGKGAISFASQIGRKVGDKFSYCLVDYTASPPRSSYLFIGHHGA 874
                      AHGVMGLG+G ISFASQ+GR+ G+KFSYCL+DYT SPP +SYL IG  G 
Sbjct: 211  QSVSGASFNGAHGVMGLGRGPISFASQLGRRFGNKFSYCLMDYTLSPPPTSYLIIGDGGG 270

Query: 873  IHRS-LHYTPLIHNKFAETFYYLGVEKLWIGDRVLRLPAKIWEIDLHGNGGTIIDSGTTL 697
              RS L +TPL+ N F+ TFYY+ ++ + +    LR+   +WEID  GNGGT++DSGTTL
Sbjct: 271  GARSKLLFTPLLTNPFSPTFYYIRLKSVSVNGAKLRIHPSVWEIDGSGNGGTVVDSGTTL 330

Query: 696  TFLAGPAYVAVLKAYEKSVKYPKVSHAFEDFDLCYNVSGIRRVH--FPRFRISLKGNVRF 523
             FLA P+Y  V+    + ++ P  +     FDLC NVSG+ +     PR +    G   F
Sbjct: 331  AFLADPSYRLVIATVRRRIRLPIAAEMTPGFDLCVNVSGVSKPEKFMPRLKFEFAGGAVF 390

Query: 522  EPPTSNYFINAAEDVRCLALRGVSSRSGFSIIGNLMQQNFYIVYDRERSRLGFSQTDCA 346
             PP  NYFI   E V+CLA++ V+ + GFS+IGNLMQQ F   +DR+RSRLGFS   CA
Sbjct: 391  VPPPRNYFIETEEHVQCLAIQSVNPKVGFSVIGNLMQQGFLFEFDRDRSRLGFSPRGCA 449


>XP_016487000.1 PREDICTED: aspartyl protease family protein 2-like [Nicotiana
            tabacum]
          Length = 453

 Score =  404 bits (1039), Expect = e-132
 Identities = 193/389 (49%), Positives = 261/389 (67%), Gaps = 1/389 (0%)
 Frame = -3

Query: 1512 SSLAFRGHGNILAPVISGAAAGSGQYIVDFTVGTPPQHFMLVADTGSDLLWVKCSGCRDR 1333
            SS+  R   +   P+ SGA++GSGQY VD  +GTPPQ  +LVADTGSDL+WV CS CR+ 
Sbjct: 62   SSVNHRSIRSAKLPLTSGASSGSGQYFVDLKLGTPPQRLLLVADTGSDLVWVTCSACRNC 121

Query: 1332 CLRKPGTAFFARRSRTFSPLHCYAPACELVPGPPDATCNATREHSSCMYEYVYADLSDST 1153
              R+ G+AF AR S T+ P HCY   C LVP P    CN TR+HS C Y Y Y+D S++ 
Sbjct: 122  SSRRRGSAFLARHSSTYFPFHCYDKKCRLVPNPRGVACNHTRQHSPCRYVYSYSDESETR 181

Query: 1152 GVFARDTATLNASDGSLVRVRGVAFGCGMHSTXXXXXXXXXXSAHGVMGLGKGAISFASQ 973
            G F+ +T TLNAS GS V+ +   FGC   ++           A GVMGLG+G+IS ASQ
Sbjct: 182  GFFSTETTTLNASSGSAVKFKKFVFGCSFEASGPSITGPSFNGAQGVMGLGRGSISLASQ 241

Query: 972  IGRKVGDKFSYCLVDYTASPPRSSYLFIGHHGAIHRS-LHYTPLIHNKFAETFYYLGVEK 796
            +GR+ G+KFSYCL+DYT SP  +SYL IG    ++ S + YTP+I+N F  TFYY+G+E 
Sbjct: 242  LGRRFGNKFSYCLMDYTLSPTPTSYLLIGRSAEVNDSKMSYTPMINNPFTSTFYYIGIES 301

Query: 795  LWIGDRVLRLPAKIWEIDLHGNGGTIIDSGTTLTFLAGPAYVAVLKAYEKSVKYPKVSHA 616
            ++I D  L++   +W ID  GNGGT++DSGTTLTFLA PAY  ++K +++ V+ P+V   
Sbjct: 302  VYIEDIKLQISPSVWAIDELGNGGTVMDSGTTLTFLAEPAYRRIVKEFKRLVRLPEVDDP 361

Query: 615  FEDFDLCYNVSGIRRVHFPRFRISLKGNVRFEPPTSNYFINAAEDVRCLALRGVSSRSGF 436
              +FD C NVS + +  FP+    L+G+    P   NYFI+ AEDV+CLAL+ +++ SGF
Sbjct: 362  TLEFDFCVNVSSVSKPSFPKMSFKLRGDSVLSPTPGNYFIDTAEDVKCLALQPLAAPSGF 421

Query: 435  SIIGNLMQQNFYIVYDRERSRLGFSQTDC 349
            S+IGNLMQQ F   +DR+RSR+GF++  C
Sbjct: 422  SVIGNLMQQGFVFEFDRDRSRIGFTRHGC 450


>JAU52374.1 Aspartic proteinase nepenthesin-2, partial [Noccaea caerulescens]
          Length = 483

 Score =  405 bits (1041), Expect = e-132
 Identities = 217/463 (46%), Positives = 279/463 (60%), Gaps = 8/463 (1%)
 Frame = -3

Query: 1710 AITTKHAALKLKLIHRHSPDSPFRKPYANRREHLGDLIRDDERRLHGFQKLLDRNLPVPH 1531
            A  T    LKL L+ +    SPF  P           +  D RRLH F  L  + +P   
Sbjct: 51   AAVTDDEYLKLPLLRK----SPFPSPT--------QALALDTRRLH-FLSLRRKTVPF-- 95

Query: 1530 DDLPTSSSLAFRGHGNILAPVISGAAAGSGQYIVDFTVGTPPQHFMLVADTGSDLLWVKC 1351
                            + +PV+SGA++GSGQY VD  +G PPQ  +L+ADTGSDL+WVKC
Sbjct: 96   ----------------VKSPVVSGASSGSGQYFVDLRIGQPPQSLLLIADTGSDLVWVKC 139

Query: 1350 SGCRDRCLRKPGTAFFARRSRTFSPLHCYAPACELVPGPPDAT-CNATREHSSCMYEYVY 1174
            S CR+  L  P T FF R S TFSP HCY P C LVP P  A  CN TR HS+C YEY Y
Sbjct: 140  SACRNCSLHSPATVFFPRHSSTFSPTHCYDPICRLVPKPSRAPKCNHTRIHSTCPYEYAY 199

Query: 1173 ADLSDSTGVFARDTATLNASDGSLVRVRGVAFGCGMHSTXXXXXXXXXXSAHGVMGLGKG 994
            AD S ++G+FAR+T TL  S G    ++ VAFGCG   +           AHGVMGLG+G
Sbjct: 200  ADGSLTSGLFARETTTLKTSSGREANLKSVAFGCGFRISGQSVSGTSFNGAHGVMGLGRG 259

Query: 993  AISFASQIGRKVGDKFSYCLVDYTASPPRSSYLFIGHHGAIHRS-----LHYTPLIHNKF 829
             ISFASQ+GR+ G+KFSYCL+DYT SPP +SYL IG  G  +RS     L++TPL+ N  
Sbjct: 260  PISFASQLGRRFGNKFSYCLMDYTLSPPPTSYLIIGDGGG-NRSNAVSKLYFTPLLANPL 318

Query: 828  AETFYYLGVEKLWIGDRVLRLPAKIWEIDLHGNGGTIIDSGTTLTFLAGPAYVAVLKAYE 649
            + TFYY+ ++ +++    LR+   IWEID  GNGGT++DSGTTL FLA PAY  ++ A  
Sbjct: 319  SPTFYYVRLKSVFVNGAKLRIDRSIWEIDGLGNGGTVVDSGTTLAFLADPAYRLLIAAVR 378

Query: 648  KSVKYPKVSHAFEDFDLCYNVSGIRRVH--FPRFRISLKGNVRFEPPTSNYFINAAEDVR 475
            + ++ P  +     F+LC NVSGI +     PR +    G   F PP  NYFI   E ++
Sbjct: 379  RRIRLPMAAELTPGFELCVNVSGISKPEKIMPRLKFEFSGGAVFVPPPRNYFIETEEQIQ 438

Query: 474  CLALRGVSSRSGFSIIGNLMQQNFYIVYDRERSRLGFSQTDCA 346
            CLA++ V+ + GFS+IGNLMQQ F   +DR+RSRLGFS+  CA
Sbjct: 439  CLAIQSVNPKVGFSVIGNLMQQGFLFEFDRDRSRLGFSRRGCA 481


Top