BLASTX nr result

ID: Zingiber25_contig00023980 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Zingiber25_contig00023980
         (1345 letters)

Database: ./nr 
           37,332,560 sequences; 13,225,080,153 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

dbj|BAJ86414.1| predicted protein [Hordeum vulgare subsp. vulgar...   440   e-121
ref|NP_001043485.1| Os01g0598600 [Oryza sativa Japonica Group] g...   436   e-120
ref|XP_006418409.1| hypothetical protein EUTSA_v10009389mg [Eutr...   436   e-120
ref|XP_002523337.1| Aspartic proteinase nepenthesin-1 precursor,...   436   e-119
ref|XP_002455897.1| hypothetical protein SORBIDRAFT_03g026970 [S...   434   e-119
gb|EMJ19165.1| hypothetical protein PRUPE_ppa005289mg [Prunus pe...   432   e-118
ref|XP_002892074.1| aspartyl protease family protein [Arabidopsi...   431   e-118
ref|XP_003569303.1| PREDICTED: aspartic proteinase nepenthesin-1...   430   e-118
tpg|DAA58835.1| TPA: hypothetical protein ZEAMMB73_701358 [Zea m...   429   e-117
ref|NP_171637.1| aspartyl protease family protein [Arabidopsis t...   427   e-117
gb|AAM66061.1| chloroplast nucleoid DNA binding protein, putativ...   427   e-117
ref|XP_002302634.2| aspartyl protease family protein [Populus tr...   426   e-117
ref|XP_006307379.1| hypothetical protein CARUB_v10009005mg [Caps...   426   e-117
gb|EXB62168.1| Aspartic proteinase nepenthesin-1 [Morus notabilis]    425   e-116
gb|EOX95694.1| Eukaryotic aspartyl protease family protein isofo...   425   e-116
gb|EOX95693.1| Eukaryotic aspartyl protease family protein isofo...   425   e-116
ref|XP_006491285.1| PREDICTED: protein ASPARTIC PROTEASE IN GUAR...   424   e-116
ref|XP_004306664.1| PREDICTED: protein ASPARTIC PROTEASE IN GUAR...   424   e-116
ref|XP_002263964.1| PREDICTED: aspartic proteinase nepenthesin-1...   424   e-116
ref|XP_004969076.1| PREDICTED: protein ASPARTIC PROTEASE IN GUAR...   422   e-115

>dbj|BAJ86414.1| predicted protein [Hordeum vulgare subsp. vulgare]
            gi|326516358|dbj|BAJ92334.1| predicted protein [Hordeum
            vulgare subsp. vulgare]
          Length = 492

 Score =  440 bits (1131), Expect = e-121
 Identities = 231/421 (54%), Positives = 280/421 (66%), Gaps = 4/421 (0%)
 Frame = +3

Query: 87   APPXXXXXXXXXXXXVVAGNGTSKPLQRQTLLVTPLRSPATVVPEE---DEAPSIATGVD 257
            APP              A N ++KP+Q Q LL TPL       P E   D+  S+  G  
Sbjct: 4    APPPLLPLSALLLLLAAASNASAKPVQTQALLATPLSPDRVSAPSELARDDDDSVFAGNL 63

Query: 258  SESEGTLSPSSFHFELSHRDSLLAATASPEQIFSLRLDRDAERVESLRQMLAEVAAPLPR 437
            + +E   + S+  F L HRD   +  A+  ++ + RL+RDA+R   L    +  A P   
Sbjct: 64   ASAEDAPA-STVRFRLVHRDDF-SVNATAAELLAYRLERDAKRAARL----SAAAGPANG 117

Query: 438  NVTGRRGFSSKVVSGLAQGSGEYFARIGIGTPPRYVYMVLDTGSDIVWLQCAPCRRCYSQ 617
               G  G  + VVSGLAQGSGEYF +IG+GTP     MVLDTGSD+VWLQCAPCRRCY Q
Sbjct: 118  TRRGGGGVVAPVVSGLAQGSGEYFTKIGVGTPATPALMVLDTGSDVVWLQCAPCRRCYEQ 177

Query: 618  SDPIFDPRRSHTYAAVPCGTPLCRRLDVAGCDTRRRSCQYQVSYGDGSITMGEFSTETLT 797
            S  +FDPRRS +Y AV C  PLCRRLD  GCD RR +C YQV+YGDGS+T G+F+TETLT
Sbjct: 178  SGQVFDPRRSRSYNAVGCAAPLCRRLDSGGCDLRRSACLYQVAYGDGSVTAGDFATETLT 237

Query: 798  FRRSVRVPRVALGCGHDNEXXXXXXXXXXXXXXXXXSFPSQAGRRFGRMFSYCLVDRT-S 974
            F    RV RVALGCGHDNE                 SFP+Q  RR+GR FSYCLVDRT S
Sbjct: 238  FAGGARVARVALGCGHDNEGLFVAAAGLLGLGRGSLSFPTQISRRYGRSFSYCLVDRTSS 297

Query: 975  AGAPNRSSTVVFGNSAVPRASSRVAYTPMLRNPKVDSFYYIELTGVSVGGTRVPGVLASD 1154
            A   +RSSTV FG+ AV  ++   ++TPM++NP++++FYY++L G+SVGG RVPGV  SD
Sbjct: 298  ANTASRSSTVTFGSGAV-GSTVASSFTPMVKNPRMETFYYVQLIGISVGGARVPGVANSD 356

Query: 1155 LRLDPSTGRGGVIIDSGTSVTRLARSAYQALRDAFRAGTTGLKLAPGGFSLFDTCYDLSG 1334
            LRLDPS+GRGGVI+DSGTSVTRLAR AY ALRDAFR    GL+L+PGGFSLFDTCYDLSG
Sbjct: 357  LRLDPSSGRGGVIVDSGTSVTRLARPAYSALRDAFRGAAAGLRLSPGGFSLFDTCYDLSG 416

Query: 1335 R 1337
            R
Sbjct: 417  R 417


>ref|NP_001043485.1| Os01g0598600 [Oryza sativa Japonica Group]
            gi|15290061|dbj|BAB63755.1| nucleoid DNA-binding protein
            cnd41-like [Oryza sativa Japonica Group]
            gi|113533016|dbj|BAF05399.1| Os01g0598600 [Oryza sativa
            Japonica Group] gi|125526702|gb|EAY74816.1| hypothetical
            protein OsI_02707 [Oryza sativa Indica Group]
          Length = 500

 Score =  436 bits (1122), Expect = e-120
 Identities = 229/400 (57%), Positives = 274/400 (68%), Gaps = 6/400 (1%)
 Frame = +3

Query: 153  SKPLQRQTLLVTPLRS-PATVVPEEDEAPSIATGVDSESEGTLSPSSFHFELSHRDSLLA 329
            ++ ++ QTL+ TPL   P T    ED+   +  G  +  EG  + S+    + HRD   A
Sbjct: 29   AEAVRYQTLVATPLSPHPYTATAVEDDG--LFQGSLAADEGGAAASTVGLRVVHRDDF-A 85

Query: 330  ATASPEQIFSLRLDRDAERVESLRQMLAEVAAPLPRNVTGR---RGFSSKVVSGLAQGSG 500
              A+  ++ + RL RD  R   +       AA     V G     GF + VVSGLAQGSG
Sbjct: 86   VNATAAELLAHRLRRDKRRASRISAAAGGAAAANGTRVGGGGGGSGFVAPVVSGLAQGSG 145

Query: 501  EYFARIGIGTPPRYVYMVLDTGSDIVWLQCAPCRRCYSQSDPIFDPRRSHTYAAVPCGTP 680
            EYF +IG+GTP     MVLDTGSD+VWLQCAPCRRCY QS  +FDPR SH+Y AV C  P
Sbjct: 146  EYFTKIGVGTPVTPALMVLDTGSDVVWLQCAPCRRCYDQSGQMFDPRASHSYGAVDCAAP 205

Query: 681  LCRRLDVAGCDTRRRSCQYQVSYGDGSITMGEFSTETLTFRRSVRVPRVALGCGHDNEXX 860
            LCRRLD  GCD RR++C YQV+YGDGS+T G+F+TETLTF    RVPRVALGCGHDNE  
Sbjct: 206  LCRRLDSGGCDLRRKACLYQVAYGDGSVTAGDFATETLTFASGARVPRVALGCGHDNEGL 265

Query: 861  XXXXXXXXXXXXXXXSFPSQAGRRFGRMFSYCLVDRT--SAGAPNRSSTVVFGNSAVPRA 1034
                           SFPSQ  RRFGR FSYCLVDRT  SA A +RSSTV FG+ AV   
Sbjct: 266  FVAAAGLLGLGRGSLSFPSQISRRFGRSFSYCLVDRTSSSASATSRSSTVTFGSGAV-GP 324

Query: 1035 SSRVAYTPMLRNPKVDSFYYIELTGVSVGGTRVPGVLASDLRLDPSTGRGGVIIDSGTSV 1214
            S+  ++TPM++NP++++FYY++L G+SVGG RVPGV  SDLRLDPSTGRGGVI+DSGTSV
Sbjct: 325  SAAASFTPMVKNPRMETFYYVQLMGISVGGARVPGVAVSDLRLDPSTGRGGVIVDSGTSV 384

Query: 1215 TRLARSAYQALRDAFRAGTTGLKLAPGGFSLFDTCYDLSG 1334
            TRLAR AY ALRDAFRA   GL+L+PGGFSLFDTCYDLSG
Sbjct: 385  TRLARPAYAALRDAFRAAAAGLRLSPGGFSLFDTCYDLSG 424


>ref|XP_006418409.1| hypothetical protein EUTSA_v10009389mg [Eutrema salsugineum]
            gi|557096180|gb|ESQ36762.1| hypothetical protein
            EUTSA_v10009389mg [Eutrema salsugineum]
          Length = 486

 Score =  436 bits (1122), Expect = e-120
 Identities = 234/390 (60%), Positives = 280/390 (71%), Gaps = 5/390 (1%)
 Frame = +3

Query: 189  PLRSPATVVPE-EDEAPSIATGVDSESEGTLSPSSFHFELSHRDSLLAATASPEQIFSLR 365
            P  SP +  PE E ++ S+  G +SE  G+ S SS    L H D+L +   +P+++FS R
Sbjct: 39   PSASPTSFQPESEPDSESLLGGSESEY-GSDSESSITLNLDHIDAL-STNRTPQELFSFR 96

Query: 366  LDRDAERVESLRQMLAEV----AAPLPRNVTGRRGFSSKVVSGLAQGSGEYFARIGIGTP 533
            L RD+ RVES+  + A +    A   PR V    GFSS VVSGL+QGSGEYF R+G+GTP
Sbjct: 97   LQRDSRRVESIATLAARIPRRNATHAPRTV----GFSSSVVSGLSQGSGEYFTRLGVGTP 152

Query: 534  PRYVYMVLDTGSDIVWLQCAPCRRCYSQSDPIFDPRRSHTYAAVPCGTPLCRRLDVAGCD 713
             RYVYMVLDTGSDIVWLQCAPCR+CYSQSDPIFDPR+S TY+ +PC +PLCRRLD AGC+
Sbjct: 153  ARYVYMVLDTGSDIVWLQCAPCRKCYSQSDPIFDPRKSRTYSTIPCSSPLCRRLDSAGCN 212

Query: 714  TRRRSCQYQVSYGDGSITMGEFSTETLTFRRSVRVPRVALGCGHDNEXXXXXXXXXXXXX 893
            TRRR+C YQVSYGDGS T+G+FSTETLTFRR+ RV  VALGCGHDNE             
Sbjct: 213  TRRRTCLYQVSYGDGSFTVGDFSTETLTFRRN-RVKGVALGCGHDNEGLFVGAAGLLGLG 271

Query: 894  XXXXSFPSQAGRRFGRMFSYCLVDRTSAGAPNRSSTVVFGNSAVPRASSRVAYTPMLRNP 1073
                SFP Q G RF + FSYCLVDR+++  P   S+VVFGN+AV R +    +TP+L NP
Sbjct: 272  KGRLSFPGQTGHRFNQKFSYCLVDRSASSKP---SSVVFGNAAVSRTA---RFTPLLSNP 325

Query: 1074 KVDSFYYIELTGVSVGGTRVPGVLASDLRLDPSTGRGGVIIDSGTSVTRLARSAYQALRD 1253
            K+D+FYY+EL G+SVGGTRVPGV AS  +LD   G GGVIIDSGTSVTRL R AY A+RD
Sbjct: 326  KLDTFYYVELLGISVGGTRVPGVTASLFKLD-QIGNGGVIIDSGTSVTRLIRPAYIAMRD 384

Query: 1254 AFRAGTTGLKLAPGGFSLFDTCYDLSGRTE 1343
            AFR G   LK AP  FSLFDTC+DLS + E
Sbjct: 385  AFRVGAKTLKRAP-DFSLFDTCFDLSNQNE 413


>ref|XP_002523337.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
            communis] gi|223537425|gb|EEF39053.1| Aspartic proteinase
            nepenthesin-1 precursor, putative [Ricinus communis]
          Length = 469

 Score =  436 bits (1121), Expect = e-119
 Identities = 232/400 (58%), Positives = 288/400 (72%), Gaps = 3/400 (0%)
 Frame = +3

Query: 153  SKPLQRQTLLVTPLRSPATVVPEEDEAPSIATGVDSESEGTLSPSSFHFELSHRDSLLAA 332
            S  L  QTL+  PLRS  T+   + E+P+         +   S ++F  +L H D+L + 
Sbjct: 23   STSLNYQTLVANPLRSQPTLSWTDSESPT---------DTAESSATFSVQLHHVDAL-SF 72

Query: 333  TASPEQIFSLRLDRDAERVESLRQMLAEVAAPLPRNVTGRR---GFSSKVVSGLAQGSGE 503
             ++PE +F+ RL RDA RVE++   LAE A       TG+R   GFSS V+SGLAQGSGE
Sbjct: 73   NSTPETLFTTRLQRDAARVEAI-SYLAETAG------TGKRVGTGFSSSVISGLAQGSGE 125

Query: 504  YFARIGIGTPPRYVYMVLDTGSDIVWLQCAPCRRCYSQSDPIFDPRRSHTYAAVPCGTPL 683
            YF RIG+GTPPRYVYMVLDTGSDIVW+QCAPC+RCY+QSDP+FDPR+S ++A++ C +PL
Sbjct: 126  YFTRIGVGTPPRYVYMVLDTGSDIVWIQCAPCKRCYAQSDPVFDPRKSRSFASIACRSPL 185

Query: 684  CRRLDVAGCDTRRRSCQYQVSYGDGSITMGEFSTETLTFRRSVRVPRVALGCGHDNEXXX 863
            C RLD  GC+T++++C YQVSYGDGS T G+FSTETLTFRR+ RV RVALGCGHDNE   
Sbjct: 186  CHRLDSPGCNTQKQTCMYQVSYGDGSFTFGDFSTETLTFRRT-RVARVALGCGHDNEGLF 244

Query: 864  XXXXXXXXXXXXXXSFPSQAGRRFGRMFSYCLVDRTSAGAPNRSSTVVFGNSAVPRASSR 1043
                          SFPSQ GRRF   FSYCLVDR+++  P   S++VFG+SAV R +  
Sbjct: 245  VGAAGLLGLGRGRLSFPSQTGRRFNHKFSYCLVDRSASSKP---SSMVFGDSAVSRTA-- 299

Query: 1044 VAYTPMLRNPKVDSFYYIELTGVSVGGTRVPGVLASDLRLDPSTGRGGVIIDSGTSVTRL 1223
              +TP++ NPK+D+FYY+EL G+SVGGTRVPG+ AS  +LD  TG GGVIIDSGTSVTRL
Sbjct: 300  -RFTPLVSNPKLDTFYYVELLGISVGGTRVPGITASLFKLD-QTGNGGVIIDSGTSVTRL 357

Query: 1224 ARSAYQALRDAFRAGTTGLKLAPGGFSLFDTCYDLSGRTE 1343
             R AY A RDAFRAG + LK AP  FSLFDTC+DLSG+TE
Sbjct: 358  TRPAYIAFRDAFRAGASNLKRAP-QFSLFDTCFDLSGKTE 396


>ref|XP_002455897.1| hypothetical protein SORBIDRAFT_03g026970 [Sorghum bicolor]
            gi|241927872|gb|EES01017.1| hypothetical protein
            SORBIDRAFT_03g026970 [Sorghum bicolor]
          Length = 493

 Score =  434 bits (1116), Expect = e-119
 Identities = 224/404 (55%), Positives = 274/404 (67%), Gaps = 8/404 (1%)
 Frame = +3

Query: 150  TSKPLQRQTLLVTPLRSPA-TVVPEEDEAPSIATG-VDSESEGTLSPSSFHFELSHRDSL 323
            ++K ++  + + TPL   A T  P  D    +  G +    EG  + S+ HF + HRD+ 
Sbjct: 20   SAKAVEYHSFVATPLSPHAYTAAPSADADEDLFGGSLAVADEGAAAASAVHFRVVHRDAF 79

Query: 324  LAATASPEQIFSLRLDRDAERVESLRQMLAEVAAPLPRNVTGRRG-FSSKVVSGLAQGSG 500
             AA A+  ++   RL RD  R   + +  A   A        R G  ++ VVSGLAQGSG
Sbjct: 80   -AANATAAELLRHRLQRDKRRAARISKAAAGGGAGAANGTRSRGGAVAAPVVSGLAQGSG 138

Query: 501  EYFARIGIGTPPRYVYMVLDTGSDIVWLQCAPCRRCYSQSDPIFDPRRSHTYAAVPCGTP 680
            EYF +IG+GTP     MVLDTGSD+VWLQCAPCRRCY QS P+FDPRRS +Y AV C  P
Sbjct: 139  EYFTKIGVGTPSTPALMVLDTGSDVVWLQCAPCRRCYDQSGPVFDPRRSSSYGAVDCAAP 198

Query: 681  LCRRLDVAGCDTRRRSCQYQVSYGDGSITMGEFSTETLTFRRSVRVPRVALGCGHDNEXX 860
            LCRRLD  GCD RRR+C YQV+YGDGS+T G+F+TETLTF    RV RVALGCGHDNE  
Sbjct: 199  LCRRLDSGGCDLRRRACLYQVAYGDGSVTAGDFATETLTFAGGARVARVALGCGHDNEGL 258

Query: 861  XXXXXXXXXXXXXXXSFPSQAGRRFGRMFSYCLVDRTS-----AGAPNRSSTVVFGNSAV 1025
                           SFP+Q  RR+G+ FSYCLVDRTS     A + +RSSTV FG    
Sbjct: 259  FVAAAGLLGLGRGSLSFPTQISRRYGKSFSYCLVDRTSSSSSGAASRSRSSTVTFG---- 314

Query: 1026 PRASSRVAYTPMLRNPKVDSFYYIELTGVSVGGTRVPGVLASDLRLDPSTGRGGVIIDSG 1205
            P ++S  ++TPM+RNP++++FYY++L G+SVGG RVPGV  SDLRLDPSTGRGGVI+DSG
Sbjct: 315  PPSASAASFTPMVRNPRMETFYYVQLVGISVGGARVPGVAESDLRLDPSTGRGGVIVDSG 374

Query: 1206 TSVTRLARSAYQALRDAFRAGTTGLKLAPGGFSLFDTCYDLSGR 1337
            TSVTRLAR +Y ALRDAFRA   GL+L+PGGFSLFDTCYDL GR
Sbjct: 375  TSVTRLARPSYSALRDAFRAAAAGLRLSPGGFSLFDTCYDLGGR 418


>gb|EMJ19165.1| hypothetical protein PRUPE_ppa005289mg [Prunus persica]
          Length = 468

 Score =  432 bits (1112), Expect = e-118
 Identities = 226/397 (56%), Positives = 286/397 (72%), Gaps = 3/397 (0%)
 Frame = +3

Query: 162  LQRQTLLVTPLRSPATVVPEEDEAPSIATGVDSESEGTLSPSSFHFELSHRDSLLAATAS 341
            L+ QTL++ PL +P T+               S  E    P++   +L H D+L +   +
Sbjct: 25   LEHQTLVLNPLPNPPTL---------------SWPESVTDPNTLSVQLHHLDAL-SLNKT 68

Query: 342  PEQIFSLRLDRDAERVESLRQMLAEVAAPLPRNVTGR---RGFSSKVVSGLAQGSGEYFA 512
            P Q+F+LRL RDA RV++L  + A  A+P      GR   RGFSS VVSGLAQGSGEYF 
Sbjct: 69   PSQLFNLRLQRDAVRVKTLSSIAAAAASPNRTARGGRVPIRGFSSSVVSGLAQGSGEYFT 128

Query: 513  RIGIGTPPRYVYMVLDTGSDIVWLQCAPCRRCYSQSDPIFDPRRSHTYAAVPCGTPLCRR 692
            R+G+GTPP+YVYMVLDTGSD+VWLQCAPC+RCYSQ+DP+FDPR+S T++ +PCG+PLCR+
Sbjct: 129  RLGVGTPPKYVYMVLDTGSDVVWLQCAPCKRCYSQTDPVFDPRKSGTFSTIPCGSPLCRK 188

Query: 693  LDVAGCDTRRRSCQYQVSYGDGSITMGEFSTETLTFRRSVRVPRVALGCGHDNEXXXXXX 872
            LD +GC   R++C YQVSYGDGS T+G+FSTETLTF R  +V RVALGCGHDNE      
Sbjct: 189  LDSSGCKA-RKTCLYQVSYGDGSFTVGDFSTETLTF-RGTKVGRVALGCGHDNEGLFVGA 246

Query: 873  XXXXXXXXXXXSFPSQAGRRFGRMFSYCLVDRTSAGAPNRSSTVVFGNSAVPRASSRVAY 1052
                       SFP+Q G RF + FSYCLVDR+++  P   S+VVFG+SAV R +    +
Sbjct: 247  AGLLGLGRGKLSFPTQTGVRFNKKFSYCLVDRSASSKP---SSVVFGDSAVSRTA---RF 300

Query: 1053 TPMLRNPKVDSFYYIELTGVSVGGTRVPGVLASDLRLDPSTGRGGVIIDSGTSVTRLARS 1232
            TP++ NPK+D+FYY+EL G+SVGGTRV G+ AS  +LDP+ G GGVI+DSGTSVTRL R 
Sbjct: 301  TPLIANPKLDTFYYVELIGISVGGTRVRGITASLFKLDPA-GNGGVILDSGTSVTRLTRV 359

Query: 1233 AYQALRDAFRAGTTGLKLAPGGFSLFDTCYDLSGRTE 1343
            AY +LRDAFRAGT+GLK AP  FSLFDTC+DLSG++E
Sbjct: 360  AYNSLRDAFRAGTSGLKRAP-EFSLFDTCFDLSGKSE 395


>ref|XP_002892074.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
            gi|297337916|gb|EFH68333.1| aspartyl protease family
            protein [Arabidopsis lyrata subsp. lyrata]
          Length = 485

 Score =  431 bits (1108), Expect = e-118
 Identities = 227/385 (58%), Positives = 273/385 (70%)
 Frame = +3

Query: 189  PLRSPATVVPEEDEAPSIATGVDSESEGTLSPSSFHFELSHRDSLLAATASPEQIFSLRL 368
            P  SP +  PE +       G + ES G+ S SS    L H D+L ++  +P+++FS RL
Sbjct: 39   PSASPISFQPESEPDSESLLGSEFES-GSDSESSITLNLDHIDAL-SSNKTPQELFSSRL 96

Query: 369  DRDAERVESLRQMLAEVAAPLPRNVTGRRGFSSKVVSGLAQGSGEYFARIGIGTPPRYVY 548
             RD+ RV+S+  + A++      +     GFSS VVSGL+QGSGEYF R+G+GTP RYVY
Sbjct: 97   QRDSRRVKSIATLAAQIPGRNVTHAPRTGGFSSSVVSGLSQGSGEYFTRLGVGTPARYVY 156

Query: 549  MVLDTGSDIVWLQCAPCRRCYSQSDPIFDPRRSHTYAAVPCGTPLCRRLDVAGCDTRRRS 728
            MVLDTGSDIVWLQCAPCRRCYSQSDPIFDPR+S TYA +PC +P CRRLD AGC+TRR++
Sbjct: 157  MVLDTGSDIVWLQCAPCRRCYSQSDPIFDPRKSKTYATIPCSSPHCRRLDSAGCNTRRKT 216

Query: 729  CQYQVSYGDGSITMGEFSTETLTFRRSVRVPRVALGCGHDNEXXXXXXXXXXXXXXXXXS 908
            C YQVSYGDGS T+G+FSTETLTFRR+ RV  VALGCGHDNE                 S
Sbjct: 217  CLYQVSYGDGSFTVGDFSTETLTFRRN-RVKGVALGCGHDNEGLFVGAAGLLGLGKGKLS 275

Query: 909  FPSQAGRRFGRMFSYCLVDRTSAGAPNRSSTVVFGNSAVPRASSRVAYTPMLRNPKVDSF 1088
            FP Q G RF + FSYCLVDR+++  P   S+VVFGN+AV R +    +TP+L NPK+D+F
Sbjct: 276  FPGQTGHRFNQKFSYCLVDRSASSKP---SSVVFGNAAVSRIA---RFTPLLSNPKLDTF 329

Query: 1089 YYIELTGVSVGGTRVPGVLASDLRLDPSTGRGGVIIDSGTSVTRLARSAYQALRDAFRAG 1268
            YY+EL G+SVGGTRVPGV AS  +LD   G GGVIIDSGTSVTRL R AY A+RDAFR G
Sbjct: 330  YYVELLGISVGGTRVPGVAASLFKLD-QIGNGGVIIDSGTSVTRLIRPAYIAMRDAFRVG 388

Query: 1269 TTGLKLAPGGFSLFDTCYDLSGRTE 1343
               LK AP  FSLFDTC+DLS   E
Sbjct: 389  AKALKRAP-DFSLFDTCFDLSNMNE 412


>ref|XP_003569303.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
            distachyon]
          Length = 494

 Score =  430 bits (1106), Expect = e-118
 Identities = 225/407 (55%), Positives = 279/407 (68%), Gaps = 7/407 (1%)
 Frame = +3

Query: 138  AGNGTSKPLQRQTLLVTPLR----SPATVVPEEDEAPSIATGVDSESEGTLSPSSFHFEL 305
            A + T+KP+Q Q+LLVTPL     S ++ +   D+    A  + +  + T  PS+  F +
Sbjct: 23   ASSATAKPVQTQSLLVTPLSPTPFSASSELARGDDKDVFAGNLAAAEDAT--PSTVQFSV 80

Query: 306  SHRDSLLAATASPEQIFSLRLDRDAERVESLRQMLAEVAAPLPRNVTGRRGFS--SKVVS 479
             HRD  +   A+  ++   RL RD +R   +       AA    N T R G    + VVS
Sbjct: 81   VHRDDFVV-NATAAELLGHRLQRDGKRAARIS------AAAGAANGTRRTGSGVVAPVVS 133

Query: 480  GLAQGSGEYFARIGIGTPPRYVYMVLDTGSDIVWLQCAPCRRCYSQSDPIFDPRRSHTYA 659
            GLAQGSGEYF +IG+GTP     MVLDTGSD+VWLQCAPCRRCY QS  +FDPRRS +Y 
Sbjct: 134  GLAQGSGEYFTKIGVGTPATPALMVLDTGSDVVWLQCAPCRRCYDQSGQVFDPRRSRSYG 193

Query: 660  AVPCGTPLCRRLDVAGCDTRRRSCQYQVSYGDGSITMGEFSTETLTFRRSVRVPRVALGC 839
            AV C  PLCRRLD  GCD RR++C YQV+YGDGS+T G+F+TETLTF    RV R+ALGC
Sbjct: 194  AVGCSAPLCRRLDSGGCDLRRKACLYQVAYGDGSVTAGDFATETLTFAGGARVARIALGC 253

Query: 840  GHDNEXXXXXXXXXXXXXXXXXSFPSQAGRRFGRMFSYCLVDRTSAGAP-NRSSTVVFGN 1016
            GHDNE                 SFP+Q  RR+GR FSYCLVDRTS+  P + SSTV FG+
Sbjct: 254  GHDNEGLFVAAAGLLGLGRGSLSFPAQISRRYGRSFSYCLVDRTSSANPASHSSTVTFGS 313

Query: 1017 SAVPRASSRVAYTPMLRNPKVDSFYYIELTGVSVGGTRVPGVLASDLRLDPSTGRGGVII 1196
             AV  ++   ++TPM++NP++++FYY++L G+SVGG RV GV  SDLRLDPS+GRGGVI+
Sbjct: 314  GAV-GSTVAASFTPMVKNPRMETFYYVQLVGISVGGARVSGVADSDLRLDPSSGRGGVIV 372

Query: 1197 DSGTSVTRLARSAYQALRDAFRAGTTGLKLAPGGFSLFDTCYDLSGR 1337
            DSGTSVTRLAR AY ALRDAFRA   GL+L+PGGFSLFDTCYDLSGR
Sbjct: 373  DSGTSVTRLARPAYSALRDAFRAAAAGLRLSPGGFSLFDTCYDLSGR 419


>tpg|DAA58835.1| TPA: hypothetical protein ZEAMMB73_701358 [Zea mays]
          Length = 485

 Score =  429 bits (1104), Expect = e-117
 Identities = 225/410 (54%), Positives = 275/410 (67%), Gaps = 8/410 (1%)
 Frame = +3

Query: 132  VVAGNGTSKPLQRQTLLVTPLRSPATVVPEEDEAPSIATG--VDSESEGTLSPSSFHFEL 305
            +VA +   K ++  + + TPL       P  D    +  G    +E     S S+ HF +
Sbjct: 10   LVAASNVVKAVEYHSFVATPLSPHLYTAPSLDADEDVFGGSLAVAEEAAAASDSAVHFRV 69

Query: 306  SHRDSLLAATASPEQIFSLRLDRDAERVESLRQMLAEVAAPLPRNVTGRRGFSSKVVSGL 485
             HRD+  A  A+  ++   RL RD  R   +    +E A     N  GR+G ++ VVSGL
Sbjct: 70   VHRDTF-AVNATAGELLKHRLQRDKRRAARI----SEAAGAGGGN--GRKGVAAPVVSGL 122

Query: 486  AQGSGEYFARIGIGTPPRYVYMVLDTGSDIVWLQCAPCRRCYSQSDPIFDPRRSHTYAAV 665
            AQGSGEYF +IG+GTP     MVLDTGSD+VW+QCAPCRRCY QS P+FDPRRS +Y AV
Sbjct: 123  AQGSGEYFTKIGVGTPATQALMVLDTGSDVVWVQCAPCRRCYEQSGPVFDPRRSSSYGAV 182

Query: 666  PCGTPLCRRLDVAGCDTRRRSCQYQVSYGDGSITMGEFSTETLTFRRSVRVPRVALGCGH 845
             CG  LCRRLD  GCD RR +C YQV+YGDGS+T G+F TETLTF    RV RVALGCGH
Sbjct: 183  GCGAALCRRLDSGGCDLRRGACMYQVAYGDGSVTAGDFVTETLTFAGGARVARVALGCGH 242

Query: 846  DNEXXXXXXXXXXXXXXXXXSFPSQAGRRFGRMFSYCLVDRTSAGA-----PNRSSTVVF 1010
            DNE                 SFP+Q  RR+GR FSYCLVDRTS+GA      +RSSTV F
Sbjct: 243  DNEGLFVAAAGLLGLGRGGLSFPTQISRRYGRSFSYCLVDRTSSGAGAAPGSHRSSTVSF 302

Query: 1011 GNSAVPRASSRVAYTPMLRNPKVDSFYYIELTGVSVGGTRVPGVLASDLRLDPSTGRGGV 1190
            G  +V   +S  ++TPM+RNP++++FYY++L G+SVGG RVPGV  SDLRLDPSTGRGGV
Sbjct: 303  GAGSV--GASSASFTPMVRNPRMETFYYVQLVGISVGGARVPGVAESDLRLDPSTGRGGV 360

Query: 1191 IIDSGTSVTRLARSAYQALRDAFRAGTT-GLKLAPGGFSLFDTCYDLSGR 1337
            I+DSGTSVTRLAR++Y ALRDAFRA    GL+L+PGGFSLFDTCYDL GR
Sbjct: 361  IVDSGTSVTRLARASYSALRDAFRAAAAGGLRLSPGGFSLFDTCYDLGGR 410


>ref|NP_171637.1| aspartyl protease family protein [Arabidopsis thaliana]
            gi|9665144|gb|AAF97328.1|AC023628_9 Unknown protein
            [Arabidopsis thaliana] gi|22135930|gb|AAM91547.1|
            chloroplast nucleoid DNA binding protein, putative
            [Arabidopsis thaliana] gi|30387595|gb|AAP31963.1|
            At1g01300 [Arabidopsis thaliana]
            gi|332189147|gb|AEE27268.1| aspartyl protease family
            protein [Arabidopsis thaliana]
          Length = 485

 Score =  427 bits (1098), Expect = e-117
 Identities = 228/388 (58%), Positives = 275/388 (70%), Gaps = 3/388 (0%)
 Frame = +3

Query: 189  PLRSPATVVPEEDEAPSIATGVDSESEGTLSPSSFHFELSHRDSLLAATASPEQIFSLRL 368
            P  SP +  P+ D    + +  +S S+   S SS    L H D+L ++  +P+++FS RL
Sbjct: 39   PCASPVSFQPDSDSESLLESEFESGSDSE-SSSSITLNLDHIDAL-SSNKTPDELFSSRL 96

Query: 369  DRDAERVESLRQMLAEVAAPLPRNVTGRR---GFSSKVVSGLAQGSGEYFARIGIGTPPR 539
             RD+ RV+S+  + A++     RNVT      GFSS VVSGL+QGSGEYF R+G+GTP R
Sbjct: 97   QRDSRRVKSIATLAAQIPG---RNVTHAPRPGGFSSSVVSGLSQGSGEYFTRLGVGTPAR 153

Query: 540  YVYMVLDTGSDIVWLQCAPCRRCYSQSDPIFDPRRSHTYAAVPCGTPLCRRLDVAGCDTR 719
            YVYMVLDTGSDIVWLQCAPCRRCYSQSDPIFDPR+S TYA +PC +P CRRLD AGC+TR
Sbjct: 154  YVYMVLDTGSDIVWLQCAPCRRCYSQSDPIFDPRKSKTYATIPCSSPHCRRLDSAGCNTR 213

Query: 720  RRSCQYQVSYGDGSITMGEFSTETLTFRRSVRVPRVALGCGHDNEXXXXXXXXXXXXXXX 899
            R++C YQVSYGDGS T+G+FSTETLTFRR+ RV  VALGCGHDNE               
Sbjct: 214  RKTCLYQVSYGDGSFTVGDFSTETLTFRRN-RVKGVALGCGHDNEGLFVGAAGLLGLGKG 272

Query: 900  XXSFPSQAGRRFGRMFSYCLVDRTSAGAPNRSSTVVFGNSAVPRASSRVAYTPMLRNPKV 1079
              SFP Q G RF + FSYCLVDR+++  P   S+VVFGN+AV R +    +TP+L NPK+
Sbjct: 273  KLSFPGQTGHRFNQKFSYCLVDRSASSKP---SSVVFGNAAVSRIA---RFTPLLSNPKL 326

Query: 1080 DSFYYIELTGVSVGGTRVPGVLASDLRLDPSTGRGGVIIDSGTSVTRLARSAYQALRDAF 1259
            D+FYY+ L G+SVGGTRVPGV AS  +LD   G GGVIIDSGTSVTRL R AY A+RDAF
Sbjct: 327  DTFYYVGLLGISVGGTRVPGVTASLFKLD-QIGNGGVIIDSGTSVTRLIRPAYIAMRDAF 385

Query: 1260 RAGTTGLKLAPGGFSLFDTCYDLSGRTE 1343
            R G   LK AP  FSLFDTC+DLS   E
Sbjct: 386  RVGAKTLKRAP-DFSLFDTCFDLSNMNE 412


>gb|AAM66061.1| chloroplast nucleoid DNA binding protein, putative [Arabidopsis
            thaliana]
          Length = 485

 Score =  427 bits (1098), Expect = e-117
 Identities = 228/388 (58%), Positives = 274/388 (70%), Gaps = 3/388 (0%)
 Frame = +3

Query: 189  PLRSPATVVPEEDEAPSIATGVDSESEGTLSPSSFHFELSHRDSLLAATASPEQIFSLRL 368
            P  SP +  P+ D    + +  +S S+   S SS    L H D+L ++  +P+++FS RL
Sbjct: 39   PCASPVSFQPDSDSESLLESEFESGSDSE-SSSSITLNLDHIDAL-SSNKTPQELFSSRL 96

Query: 369  DRDAERVESLRQMLAEVAAPLPRNVTGRR---GFSSKVVSGLAQGSGEYFARIGIGTPPR 539
             RD+ RV S+  + A++     RNVT      GFSS VVSGL+QGSGEYF R+G+GTP R
Sbjct: 97   QRDSRRVRSIATLAAQIPG---RNVTHAPRPGGFSSSVVSGLSQGSGEYFTRLGVGTPAR 153

Query: 540  YVYMVLDTGSDIVWLQCAPCRRCYSQSDPIFDPRRSHTYAAVPCGTPLCRRLDVAGCDTR 719
            YVYMVLDTGSDIVWLQCAPCRRCYSQSDPIFDPR+S TYA +PC +P CRRLD AGC+TR
Sbjct: 154  YVYMVLDTGSDIVWLQCAPCRRCYSQSDPIFDPRKSKTYATIPCSSPHCRRLDSAGCNTR 213

Query: 720  RRSCQYQVSYGDGSITMGEFSTETLTFRRSVRVPRVALGCGHDNEXXXXXXXXXXXXXXX 899
            R++C YQVSYGDGS T+G+FSTETLTFRR+ RV  VALGCGHDNE               
Sbjct: 214  RKTCLYQVSYGDGSFTVGDFSTETLTFRRN-RVKGVALGCGHDNEGLFVGAAGLLGLGKG 272

Query: 900  XXSFPSQAGRRFGRMFSYCLVDRTSAGAPNRSSTVVFGNSAVPRASSRVAYTPMLRNPKV 1079
              SFP Q G RF + FSYCLVDR+++  P   S+VVFGN+AV R +    +TP+L NPK+
Sbjct: 273  KLSFPGQTGHRFNQKFSYCLVDRSASSKP---SSVVFGNAAVSRIA---RFTPLLSNPKL 326

Query: 1080 DSFYYIELTGVSVGGTRVPGVLASDLRLDPSTGRGGVIIDSGTSVTRLARSAYQALRDAF 1259
            D+FYY+ L G+SVGGTRVPGV AS  +LD   G GGVIIDSGTSVTRL R AY A+RDAF
Sbjct: 327  DTFYYVGLLGISVGGTRVPGVTASLFKLD-QIGNGGVIIDSGTSVTRLIRPAYIAMRDAF 385

Query: 1260 RAGTTGLKLAPGGFSLFDTCYDLSGRTE 1343
            R G   LK AP  FSLFDTC+DLS   E
Sbjct: 386  RVGAKTLKRAP-NFSLFDTCFDLSNMNE 412


>ref|XP_002302634.2| aspartyl protease family protein [Populus trichocarpa]
            gi|550345206|gb|EEE81907.2| aspartyl protease family
            protein [Populus trichocarpa]
          Length = 490

 Score =  426 bits (1096), Expect = e-117
 Identities = 228/398 (57%), Positives = 279/398 (70%), Gaps = 5/398 (1%)
 Frame = +3

Query: 165  QRQTLLVTPLRSPATVV-----PEEDEAPSIATGVDSESEGTLSPSSFHFELSHRDSLLA 329
            Q QTL V PL +  T+      PE +  P   T  DS S    + +S   +L H D+L +
Sbjct: 33   QFQTLTVNPLPNKPTLSWADTGPESE--PETQTLTDSTSTEASTTTSLSVQLHHLDAL-S 89

Query: 330  ATASPEQIFSLRLDRDAERVESLRQMLAEVAAPLPRNVTGRRGFSSKVVSGLAQGSGEYF 509
            +  +P+ +F+ RL RDA RV+SL  + A V +       G  GFSS V SGLAQGSGEYF
Sbjct: 90   SDETPQDLFNSRLARDASRVKSLTSLAAAVGSTNRTRARGP-GFSSSVTSGLAQGSGEYF 148

Query: 510  ARIGIGTPPRYVYMVLDTGSDIVWLQCAPCRRCYSQSDPIFDPRRSHTYAAVPCGTPLCR 689
             R+G+GTP RYV+MVLDTGSD+VW+QCAPC++CYSQ+DP+F+P +S ++A +PCG+PLCR
Sbjct: 149  TRLGVGTPARYVFMVLDTGSDVVWIQCAPCKKCYSQTDPVFNPTKSRSFANIPCGSPLCR 208

Query: 690  RLDVAGCDTRRRSCQYQVSYGDGSITMGEFSTETLTFRRSVRVPRVALGCGHDNEXXXXX 869
            RLD  GC T++  C YQVSYGDGS T GEFSTETLTF R  RV RVALGCGHDNE     
Sbjct: 209  RLDSPGCSTKKHICLYQVSYGDGSFTYGEFSTETLTF-RGTRVGRVALGCGHDNEGLFIG 267

Query: 870  XXXXXXXXXXXXSFPSQAGRRFGRMFSYCLVDRTSAGAPNRSSTVVFGNSAVPRASSRVA 1049
                        SFPSQ GRRF R FSYCLVDR+++  P   S +VFG+SA+ R +    
Sbjct: 268  AAGLLGLGRGRLSFPSQIGRRFSRKFSYCLVDRSASSKP---SYMVFGDSAISRTA---R 321

Query: 1050 YTPMLRNPKVDSFYYIELTGVSVGGTRVPGVLASDLRLDPSTGRGGVIIDSGTSVTRLAR 1229
            +TP++ NPK+D+FYY+EL GVSVGGTRVPG+ AS  +LD STG GGVIIDSGTSVTRL R
Sbjct: 322  FTPLVSNPKLDTFYYVELLGVSVGGTRVPGITASLFKLD-STGNGGVIIDSGTSVTRLTR 380

Query: 1230 SAYQALRDAFRAGTTGLKLAPGGFSLFDTCYDLSGRTE 1343
             AY ALRDAFR G + LK AP  FSLFDTC+DLSG+TE
Sbjct: 381  PAYVALRDAFRVGASNLKRAP-EFSLFDTCFDLSGKTE 417


>ref|XP_006307379.1| hypothetical protein CARUB_v10009005mg [Capsella rubella]
            gi|482576090|gb|EOA40277.1| hypothetical protein
            CARUB_v10009005mg [Capsella rubella]
          Length = 481

 Score =  426 bits (1096), Expect = e-117
 Identities = 230/388 (59%), Positives = 277/388 (71%), Gaps = 3/388 (0%)
 Frame = +3

Query: 189  PLRSPATVVPEEDEAPSIATGVDSESEGTLSPSSFHFELSHRDSLLAATASPEQIFSLRL 368
            P  SP +  P+ D    + + ++SESE   S +S    L H D+L ++  +P+++FS RL
Sbjct: 39   PSASPVSFQPDSDSL--LGSELESESE---SEASISLNLDHIDAL-SSNKTPDELFSSRL 92

Query: 369  DRDAERVESLRQMLAEVAAPLPRNVTGRR---GFSSKVVSGLAQGSGEYFARIGIGTPPR 539
             RD+ RV+S+  + A V     RNVT      GFSS VVSGL+QGSGEYF R+G+GTP R
Sbjct: 93   LRDSRRVKSIVTLAARVPR---RNVTHAPRPGGFSSSVVSGLSQGSGEYFTRLGVGTPAR 149

Query: 540  YVYMVLDTGSDIVWLQCAPCRRCYSQSDPIFDPRRSHTYAAVPCGTPLCRRLDVAGCDTR 719
            YVYMVLDTGSDIVWLQCAPCRRCYSQSDPIFDPR+S TY+ +PC +P CRRLD AGC+TR
Sbjct: 150  YVYMVLDTGSDIVWLQCAPCRRCYSQSDPIFDPRKSRTYSTIPCSSPQCRRLDSAGCNTR 209

Query: 720  RRSCQYQVSYGDGSITMGEFSTETLTFRRSVRVPRVALGCGHDNEXXXXXXXXXXXXXXX 899
            R++C YQVSYGDGS T+G+FSTETLTFRR+ RV  VALGCGHDNE               
Sbjct: 210  RKTCLYQVSYGDGSFTVGDFSTETLTFRRN-RVKGVALGCGHDNEGLFVGAAGLLGLGKG 268

Query: 900  XXSFPSQAGRRFGRMFSYCLVDRTSAGAPNRSSTVVFGNSAVPRASSRVAYTPMLRNPKV 1079
              SFP Q G RF + FSYCLVDR+++  P   S+VVFGN+AV R +    +TP+L NPK+
Sbjct: 269  KLSFPGQTGHRFNQKFSYCLVDRSASSKP---SSVVFGNAAVSRTA---RFTPLLSNPKL 322

Query: 1080 DSFYYIELTGVSVGGTRVPGVLASDLRLDPSTGRGGVIIDSGTSVTRLARSAYQALRDAF 1259
            D+FYY+EL G+SVGGTRVPGV AS  +LD   G GGVIIDSGTSVTRL R AY A+RDAF
Sbjct: 323  DTFYYVELLGISVGGTRVPGVTASLFKLD-QIGNGGVIIDSGTSVTRLIRPAYIAMRDAF 381

Query: 1260 RAGTTGLKLAPGGFSLFDTCYDLSGRTE 1343
            R G   LK AP  FSLFDTC+DLS   E
Sbjct: 382  RVGARTLKRAP-DFSLFDTCFDLSNMNE 408


>gb|EXB62168.1| Aspartic proteinase nepenthesin-1 [Morus notabilis]
          Length = 491

 Score =  425 bits (1093), Expect = e-116
 Identities = 233/409 (56%), Positives = 289/409 (70%), Gaps = 9/409 (2%)
 Frame = +3

Query: 144  NGTSKPLQRQTLLVTPLRSPATVVPEEDEAPSIATGVDSESEG-----TLSPSSFHFELS 308
            + ++ PL+ +TLL+T L  P   +   D    + TG D ESE      T +  S   +L 
Sbjct: 25   SASTPPLEYETLLLTSLPIPQQTLSWPDSESEL-TGSDLESETAAAEETETSLSISAQLH 83

Query: 309  HRDSLLAATASPEQIFSLRLDRDAERVESLRQMLAEVAAPLPRNVTGRRG----FSSKVV 476
            H D+L +A  SPEQ+F LRL RDA RV++L ++ A  A+   RNV+  RG    FSS V+
Sbjct: 84   HIDAL-SADKSPEQLFDLRLQRDALRVKNLVEVTAAAAS---RNVSRTRGAAPGFSSSVI 139

Query: 477  SGLAQGSGEYFARIGIGTPPRYVYMVLDTGSDIVWLQCAPCRRCYSQSDPIFDPRRSHTY 656
            SGLAQGSGEYF R+G+GTPPRYVYMVLDTGSD+VWLQCAPCR+CY+Q+DP+FDP +S ++
Sbjct: 140  SGLAQGSGEYFTRLGVGTPPRYVYMVLDTGSDVVWLQCAPCRKCYTQADPVFDPSKSRSF 199

Query: 657  AAVPCGTPLCRRLDVAGCDTRRRSCQYQVSYGDGSITMGEFSTETLTFRRSVRVPRVALG 836
            A + CG+PLCR+LD  GC+ +R+ C YQVSYGDGS T GEFSTETLTFRR+ R+ RVALG
Sbjct: 200  ARISCGSPLCRKLDSPGCN-QRKMCLYQVSYGDGSFTTGEFSTETLTFRRT-RIGRVALG 257

Query: 837  CGHDNEXXXXXXXXXXXXXXXXXSFPSQAGRRFGRMFSYCLVDRTSAGAPNRSSTVVFGN 1016
            CGHDNE                 SFP Q G RF R FSYCL DR+++  P   S++VFG+
Sbjct: 258  CGHDNEGLFVGAAGLLGLGRGRLSFPFQTGLRFNRKFSYCLADRSASSKP---SSMVFGD 314

Query: 1017 SAVPRASSRVAYTPMLRNPKVDSFYYIELTGVSVGGTRVPGVLASDLRLDPSTGRGGVII 1196
            SAV R +    +TP+L NPK+D+FYY+EL  +SVGG+RV G+ AS  +LD   G GGVII
Sbjct: 315  SAVSRTA---RFTPLLTNPKLDTFYYLELLAISVGGSRVRGISASLFKLD-QAGNGGVII 370

Query: 1197 DSGTSVTRLARSAYQALRDAFRAGTTGLKLAPGGFSLFDTCYDLSGRTE 1343
            DSGTSVTRL R AY ALRDAFRAG+  LK AP  FSLFDTCYDLSG+TE
Sbjct: 371  DSGTSVTRLTRPAYVALRDAFRAGSVNLKRAP-EFSLFDTCYDLSGKTE 418


>gb|EOX95694.1| Eukaryotic aspartyl protease family protein isoform 2 [Theobroma
            cacao]
          Length = 488

 Score =  425 bits (1093), Expect = e-116
 Identities = 222/399 (55%), Positives = 285/399 (71%), Gaps = 3/399 (0%)
 Frame = +3

Query: 153  SKPLQRQTLLVTPLRSPATVVPEEDE--APSIATGVDSESEGTLSPSSFHFELSHRDSLL 326
            S P Q QTL+   L SP+T+  ++ E  + S+    D ++  + +      EL H D+  
Sbjct: 27   STPFQLQTLVPRTLPSPSTLSGQDSELESDSLVETSDLDTVNSNTTLEVQLELHHVDAF- 85

Query: 327  AATASPEQIFSLRLDRDAERVESLRQMLAEVAAPLPRNVTGRR-GFSSKVVSGLAQGSGE 503
            ++   PE++F LRL RD  R E++  ++A+  A  P    GRR GFSS ++SGLAQGSGE
Sbjct: 86   SSEEIPERLFDLRLQRDELRAETINSLVAKAVARNPPRAPGRRSGFSSSIISGLAQGSGE 145

Query: 504  YFARIGIGTPPRYVYMVLDTGSDIVWLQCAPCRRCYSQSDPIFDPRRSHTYAAVPCGTPL 683
            YF R+G+GTPPRY+YMVLDTGSD+VW+QC+PC++CYSQSDPIFDP +S +++ +PCG+PL
Sbjct: 146  YFTRLGVGTPPRYLYMVLDTGSDVVWVQCSPCKKCYSQSDPIFDPTKSRSFSGIPCGSPL 205

Query: 684  CRRLDVAGCDTRRRSCQYQVSYGDGSITMGEFSTETLTFRRSVRVPRVALGCGHDNEXXX 863
            CR LD +GC+ +RR C YQVSYGDGS+T G+FSTETLTFRR+ RV RVA+GCGHDNE   
Sbjct: 206  CRSLDSSGCN-QRRMCLYQVSYGDGSVTFGDFSTETLTFRRT-RVGRVAIGCGHDNEGLF 263

Query: 864  XXXXXXXXXXXXXXSFPSQAGRRFGRMFSYCLVDRTSAGAPNRSSTVVFGNSAVPRASSR 1043
                          SFPSQ GRRF + FSYCLVDR+   A +R S++VFG++AVPRA+  
Sbjct: 264  VGAAGLLGLGRGRLSFPSQTGRRFNQKFSYCLVDRS---ASSRPSSLVFGDAAVPRAA-- 318

Query: 1044 VAYTPMLRNPKVDSFYYIELTGVSVGGTRVPGVLASDLRLDPSTGRGGVIIDSGTSVTRL 1223
               TP+L NPK+D+FYYIEL G+SVGG RVP +  S  ++D   G GGVIIDSGTSVTRL
Sbjct: 319  -MLTPLLTNPKLDTFYYIELLGISVGGIRVPRITPSLFKMD-QAGNGGVIIDSGTSVTRL 376

Query: 1224 ARSAYQALRDAFRAGTTGLKLAPGGFSLFDTCYDLSGRT 1340
             R AY A+RDAFR G + LK AP  FSLFDTC+DLSG+T
Sbjct: 377  TRPAYIAMRDAFRIGASNLKGAP-DFSLFDTCFDLSGKT 414


>gb|EOX95693.1| Eukaryotic aspartyl protease family protein isoform 1 [Theobroma
            cacao]
          Length = 557

 Score =  425 bits (1093), Expect = e-116
 Identities = 222/399 (55%), Positives = 285/399 (71%), Gaps = 3/399 (0%)
 Frame = +3

Query: 153  SKPLQRQTLLVTPLRSPATVVPEEDE--APSIATGVDSESEGTLSPSSFHFELSHRDSLL 326
            S P Q QTL+   L SP+T+  ++ E  + S+    D ++  + +      EL H D+  
Sbjct: 27   STPFQLQTLVPRTLPSPSTLSGQDSELESDSLVETSDLDTVNSNTTLEVQLELHHVDAF- 85

Query: 327  AATASPEQIFSLRLDRDAERVESLRQMLAEVAAPLPRNVTGRR-GFSSKVVSGLAQGSGE 503
            ++   PE++F LRL RD  R E++  ++A+  A  P    GRR GFSS ++SGLAQGSGE
Sbjct: 86   SSEEIPERLFDLRLQRDELRAETINSLVAKAVARNPPRAPGRRSGFSSSIISGLAQGSGE 145

Query: 504  YFARIGIGTPPRYVYMVLDTGSDIVWLQCAPCRRCYSQSDPIFDPRRSHTYAAVPCGTPL 683
            YF R+G+GTPPRY+YMVLDTGSD+VW+QC+PC++CYSQSDPIFDP +S +++ +PCG+PL
Sbjct: 146  YFTRLGVGTPPRYLYMVLDTGSDVVWVQCSPCKKCYSQSDPIFDPTKSRSFSGIPCGSPL 205

Query: 684  CRRLDVAGCDTRRRSCQYQVSYGDGSITMGEFSTETLTFRRSVRVPRVALGCGHDNEXXX 863
            CR LD +GC+ +RR C YQVSYGDGS+T G+FSTETLTFRR+ RV RVA+GCGHDNE   
Sbjct: 206  CRSLDSSGCN-QRRMCLYQVSYGDGSVTFGDFSTETLTFRRT-RVGRVAIGCGHDNEGLF 263

Query: 864  XXXXXXXXXXXXXXSFPSQAGRRFGRMFSYCLVDRTSAGAPNRSSTVVFGNSAVPRASSR 1043
                          SFPSQ GRRF + FSYCLVDR+   A +R S++VFG++AVPRA+  
Sbjct: 264  VGAAGLLGLGRGRLSFPSQTGRRFNQKFSYCLVDRS---ASSRPSSLVFGDAAVPRAA-- 318

Query: 1044 VAYTPMLRNPKVDSFYYIELTGVSVGGTRVPGVLASDLRLDPSTGRGGVIIDSGTSVTRL 1223
               TP+L NPK+D+FYYIEL G+SVGG RVP +  S  ++D   G GGVIIDSGTSVTRL
Sbjct: 319  -MLTPLLTNPKLDTFYYIELLGISVGGIRVPRITPSLFKMD-QAGNGGVIIDSGTSVTRL 376

Query: 1224 ARSAYQALRDAFRAGTTGLKLAPGGFSLFDTCYDLSGRT 1340
             R AY A+RDAFR G + LK AP  FSLFDTC+DLSG+T
Sbjct: 377  TRPAYIAMRDAFRIGASNLKGAP-DFSLFDTCFDLSGKT 414


>ref|XP_006491285.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 2-like [Citrus
            sinensis]
          Length = 480

 Score =  424 bits (1090), Expect = e-116
 Identities = 230/403 (57%), Positives = 285/403 (70%), Gaps = 9/403 (2%)
 Frame = +3

Query: 162  LQRQTLLVTPLRSPATVVPEEDEAPSIATGVDSESEGTL------SPSSFHFELSHRDSL 323
            LQ QT ++  L +P+T+     E+ S+     SESE +L      + SS    L H DSL
Sbjct: 23   LQYQTFVLNSLPTPSTL--SWPESVSV-----SESESSLPLPAPDAESSLSLRLHHVDSL 75

Query: 324  LAATASPEQIFSLRLDRDAERVESLRQMLAEVAAPLPRNVTGRR---GFSSKVVSGLAQG 494
             +   +PE +F+LR+ RD  RV+SL           PRN +  R   GFSS V+SGLAQG
Sbjct: 76   -SFNRTPEHLFNLRIQRDVLRVKSLTAFAESAVRVPPRNRSRGRANGGFSSSVISGLAQG 134

Query: 495  SGEYFARIGIGTPPRYVYMVLDTGSDIVWLQCAPCRRCYSQSDPIFDPRRSHTYAAVPCG 674
            SGEYF R+G+GTPPRYVYMVLDTGSD+VW+QCAPC++CYSQ+DP+FDP +S ++A VPC 
Sbjct: 135  SGEYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPCKKCYSQTDPVFDPAKSRSFATVPCR 194

Query: 675  TPLCRRLDVAGCDTRRRSCQYQVSYGDGSITMGEFSTETLTFRRSVRVPRVALGCGHDNE 854
            +PLCR+LD +GC+ RR +C YQVSYGDGSIT+G+FSTETLTF R  RV RVALGCGHDNE
Sbjct: 195  SPLCRKLDSSGCN-RRNTCLYQVSYGDGSITVGDFSTETLTF-RGTRVARVALGCGHDNE 252

Query: 855  XXXXXXXXXXXXXXXXXSFPSQAGRRFGRMFSYCLVDRTSAGAPNRSSTVVFGNSAVPRA 1034
                             SFP+Q GRRF R FSYCLVDR+++  P   S++VFG+SAV R 
Sbjct: 253  GLFVAAAGLLGLGRGRLSFPTQTGRRFNRKFSYCLVDRSTSAKP---SSMVFGDSAVSRT 309

Query: 1035 SSRVAYTPMLRNPKVDSFYYIELTGVSVGGTRVPGVLASDLRLDPSTGRGGVIIDSGTSV 1214
            +    +TP+L NPK+D+FYY+EL G+SVGG  V G+ AS  +LDP+ G GGVIIDSGTSV
Sbjct: 310  A---RFTPLLANPKLDTFYYVELVGISVGGAHVRGITASLFKLDPA-GNGGVIIDSGTSV 365

Query: 1215 TRLARSAYQALRDAFRAGTTGLKLAPGGFSLFDTCYDLSGRTE 1343
            TRL R AY ALRDAFRAG + LK AP  FSLFDTC+DLSG+TE
Sbjct: 366  TRLTRPAYIALRDAFRAGASSLKRAP-DFSLFDTCFDLSGKTE 407


>ref|XP_004306664.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Fragaria
            vesca subsp. vesca]
          Length = 467

 Score =  424 bits (1090), Expect = e-116
 Identities = 224/394 (56%), Positives = 280/394 (71%)
 Frame = +3

Query: 162  LQRQTLLVTPLRSPATVVPEEDEAPSIATGVDSESEGTLSPSSFHFELSHRDSLLAATAS 341
            L  QTLL++PL S  ++   E  +       ++ SE    PSS    L H D+L ++  +
Sbjct: 23   LDHQTLLLSPLPSAPSLSQPESFS-------ETTSEPDSDPSSLSLPLHHLDAL-SSDQT 74

Query: 342  PEQIFSLRLDRDAERVESLRQMLAEVAAPLPRNVTGRRGFSSKVVSGLAQGSGEYFARIG 521
            P Q+F LRL RD+ R  SL  +      P P +     GFSS +VSGL+QGSGEYF RIG
Sbjct: 75   PSQLFHLRLRRDSLRFNSLTSLAYNRTRPGPSS-----GFSSSIVSGLSQGSGEYFTRIG 129

Query: 522  IGTPPRYVYMVLDTGSDIVWLQCAPCRRCYSQSDPIFDPRRSHTYAAVPCGTPLCRRLDV 701
            +G+PP+Y+YMVLDTGSD+VWLQCAPC+RCYSQ+D +FDPR+S +Y+++PC +PLCRRLD 
Sbjct: 130  VGSPPKYLYMVLDTGSDVVWLQCAPCKRCYSQTDLVFDPRKSSSYSSLPCSSPLCRRLDS 189

Query: 702  AGCDTRRRSCQYQVSYGDGSITMGEFSTETLTFRRSVRVPRVALGCGHDNEXXXXXXXXX 881
             GC ++ ++C YQVSYGDGS T G+FSTETLTFRRS +VP+VALGCGHDNE         
Sbjct: 190  PGCSSKSKTCLYQVSYGDGSFTFGDFSTETLTFRRS-KVPKVALGCGHDNEGLFVGAAGL 248

Query: 882  XXXXXXXXSFPSQAGRRFGRMFSYCLVDRTSAGAPNRSSTVVFGNSAVPRASSRVAYTPM 1061
                    SFP+Q G RF   FSYCLVDR+++  P   S+VVFG+SAV R +    +TP+
Sbjct: 249  LGLGRGKLSFPTQTGSRFNSKFSYCLVDRSASSKP---SSVVFGDSAVSRTA---RFTPL 302

Query: 1062 LRNPKVDSFYYIELTGVSVGGTRVPGVLASDLRLDPSTGRGGVIIDSGTSVTRLARSAYQ 1241
            + NPK+D+FYYIEL G+SVGGTRV G+ AS  +LDPS G GGVIIDSGTSVTRL RSAY 
Sbjct: 303  VPNPKLDTFYYIELLGISVGGTRVRGITASLFKLDPS-GNGGVIIDSGTSVTRLTRSAYI 361

Query: 1242 ALRDAFRAGTTGLKLAPGGFSLFDTCYDLSGRTE 1343
            +LRDAFRAG   LK AP  FSLFDTC+DLSG+TE
Sbjct: 362  SLRDAFRAGARSLKRAP-EFSLFDTCFDLSGKTE 394


>ref|XP_002263964.1| PREDICTED: aspartic proteinase nepenthesin-1 [Vitis vinifera]
            gi|147788999|emb|CAN64659.1| hypothetical protein
            VITISV_009613 [Vitis vinifera]
          Length = 489

 Score =  424 bits (1089), Expect = e-116
 Identities = 229/405 (56%), Positives = 287/405 (70%), Gaps = 6/405 (1%)
 Frame = +3

Query: 147  GTSKPLQRQTLLVTPLRSPATVVPE---EDEAPSIATGVDSESEGTLSPSSFHFELSHRD 317
            G  KPL+ Q+L+V PL    T   +    +    I+T   SE++ T++       L HRD
Sbjct: 28   GADKPLEYQSLVVRPLGENPTTKSQLSWTETETQISTLPVSETDPTMT-----MHLEHRD 82

Query: 318  SLLAATASPEQIFSLRLDRDAERVESLRQMLAEVAAPLP-RNVTGRRG--FSSKVVSGLA 488
             +LA  A+PE +F+LRL RDA RVE+L +M A        RN T  +G  FSS V SGLA
Sbjct: 83   -VLAFNATPEALFNLRLQRDAFRVEALSKMAAAAGGRRAGRNGTHAQGGGFSSSVTSGLA 141

Query: 489  QGSGEYFARIGIGTPPRYVYMVLDTGSDIVWLQCAPCRRCYSQSDPIFDPRRSHTYAAVP 668
            QGSGEYF R+G+GTPP+YVYMVLDTGSD+VW+QCAPCR+CYSQ+DP+FDP++S +++++ 
Sbjct: 142  QGSGEYFTRLGVGTPPKYVYMVLDTGSDVVWIQCAPCRKCYSQTDPVFDPKKSGSFSSIS 201

Query: 669  CGTPLCRRLDVAGCDTRRRSCQYQVSYGDGSITMGEFSTETLTFRRSVRVPRVALGCGHD 848
            C +PLC RLD  GC++ R+SC YQV+YGDGS T GEFSTETLTF R  RVP+VALGCGHD
Sbjct: 202  CRSPLCLRLDSPGCNS-RQSCLYQVAYGDGSFTFGEFSTETLTF-RGTRVPKVALGCGHD 259

Query: 849  NEXXXXXXXXXXXXXXXXXSFPSQAGRRFGRMFSYCLVDRTSAGAPNRSSTVVFGNSAVP 1028
            NE                 SFP+Q G RFGR FSYCLVDR+++  P   S+VVFG SAV 
Sbjct: 260  NEGLFVGAAGLLGLGRGRLSFPTQTGLRFGRKFSYCLVDRSASSKP---SSVVFGQSAVS 316

Query: 1029 RASSRVAYTPMLRNPKVDSFYYIELTGVSVGGTRVPGVLASDLRLDPSTGRGGVIIDSGT 1208
            R +    +TP++ NPK+D+FYY+ELTG+SVGG RV G+ AS  +LD + G GGVIIDSGT
Sbjct: 317  RTA---VFTPLITNPKLDTFYYLELTGISVGGARVAGITASLFKLD-TAGNGGVIIDSGT 372

Query: 1209 SVTRLARSAYQALRDAFRAGTTGLKLAPGGFSLFDTCYDLSGRTE 1343
            SVTRL R AY +LRDAFRAG   LK AP  +SLFDTC+DLSG+TE
Sbjct: 373  SVTRLTRRAYVSLRDAFRAGAADLKRAP-DYSLFDTCFDLSGKTE 416


>ref|XP_004969076.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 2-like [Setaria
            italica]
          Length = 493

 Score =  422 bits (1085), Expect = e-115
 Identities = 225/410 (54%), Positives = 275/410 (67%), Gaps = 15/410 (3%)
 Frame = +3

Query: 153  SKPLQRQTLLVTPLRSPATVVPEEDEAPSIATGVDSESE--GTLSPS----SFHFELSHR 314
            +K ++  + + TPL       P    AP++ TG D E    G+L+ +    +  F + HR
Sbjct: 21   AKTVEYHSFVATPLS------PHPYTAPAV-TGADDEDVFGGSLAAAEDAAAVRFRVVHR 73

Query: 315  DSLLAATASPEQIFSLRLDRDAERVESLRQMLAEVAAPLPRNVTGRRG-FSSKVVSGLAQ 491
            D+  A  A+  ++   RL RD  R   + +   E A     N T R G  ++ VVSGLA+
Sbjct: 74   DAF-AVNATAAELLKHRLRRDKRRAARISK---EAAGGAAANGTSRGGGVAAPVVSGLAE 129

Query: 492  GSGEYFARIGIGTPPRYVYMVLDTGSDIVWLQCAPCRRCYSQSDPIFDPRRSHTYAAVPC 671
            GSGEYF +IG+GTP     MVLDTGSD+VWLQCAPCRRCY QS P+FDPRRS +Y AV C
Sbjct: 130  GSGEYFTKIGVGTPATPALMVLDTGSDVVWLQCAPCRRCYDQSGPVFDPRRSSSYGAVDC 189

Query: 672  GTPLCRRLDVAGCDTRRRSCQYQVSYGDGSITMGEFSTETLTFRRSVRVPRVALGCGHDN 851
              PLCRRLD  GCD RRR+C YQV+YGDGS+T G+F+TETLTF    RV RVALGCGHDN
Sbjct: 190  AAPLCRRLDSGGCDLRRRACMYQVAYGDGSVTAGDFATETLTFAGGARVARVALGCGHDN 249

Query: 852  EXXXXXXXXXXXXXXXXXSFPSQAGRRFGRMFSYCLVDRTSAG--------APNRSSTVV 1007
            E                 SFP+Q  RR+GR FSYCLVDRTS+         A +RSSTV 
Sbjct: 250  EGLFVAAAGLLGLGRGSLSFPTQISRRYGRSFSYCLVDRTSSASSASSGNQAGSRSSTVT 309

Query: 1008 FGNSAVPRASSRVAYTPMLRNPKVDSFYYIELTGVSVGGTRVPGVLASDLRLDPSTGRGG 1187
            FG  AV  ++S  ++TPM+RNP++++FYY++L G+SVGG RVPGV  SDLRLDPSTGRGG
Sbjct: 310  FGPGAVGPSAS-ASFTPMVRNPRMETFYYVQLVGISVGGARVPGVAESDLRLDPSTGRGG 368

Query: 1188 VIIDSGTSVTRLARSAYQALRDAFRAGTTGLKLAPGGFSLFDTCYDLSGR 1337
            VI+DSGTSVTRLAR AY ALRDAFR    GL+L+P GFSLFDTCYDL GR
Sbjct: 369  VIVDSGTSVTRLARPAYSALRDAFRGAAAGLRLSPSGFSLFDTCYDLGGR 418


Top