BLASTX nr result

ID: Mentha26_contig00027844 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Mentha26_contig00027844
         (659 letters)

Database: ./nr 
           38,876,450 sequences; 13,856,398,315 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

gb|EYU22027.1| hypothetical protein MIMGU_mgv1a005281mg [Mimulus...   271   1e-70
gb|EYU29201.1| hypothetical protein MIMGU_mgv1a005225mg [Mimulus...   264   2e-68
dbj|BAA22813.1| CND41, chloroplast nucleoid DNA binding protein ...   240   3e-61
dbj|BAC22609.1| 41 kD chloroplast nucleoid DNA binding protein (...   237   2e-60
ref|XP_004245197.1| PREDICTED: protein ASPARTIC PROTEASE IN GUAR...   231   1e-58
ref|XP_007011664.1| Eukaryotic aspartyl protease family protein ...   230   3e-58
ref|XP_006364268.1| PREDICTED: protein ASPARTIC PROTEASE IN GUAR...   229   4e-58
ref|XP_004291984.1| PREDICTED: protein ASPARTIC PROTEASE IN GUAR...   229   8e-58
emb|CBI21177.3| unnamed protein product [Vitis vinifera]              229   8e-58
ref|XP_002285593.1| PREDICTED: aspartic proteinase nepenthesin-2...   229   8e-58
ref|XP_007011665.1| Eukaryotic aspartyl protease family protein,...   227   2e-57
ref|XP_007011663.1| Eukaryotic aspartyl protease family protein,...   227   2e-57
ref|XP_007011662.1| Eukaryotic aspartyl protease family protein,...   227   2e-57
ref|XP_002324349.1| nucleoid DNA-binding family protein [Populus...   219   8e-55
ref|XP_006483511.1| PREDICTED: protein ASPARTIC PROTEASE IN GUAR...   217   2e-54
ref|XP_006450237.1| hypothetical protein CICLE_v10008143mg [Citr...   217   2e-54
ref|XP_006287637.1| hypothetical protein CARUB_v10000848mg [Caps...   216   4e-54
ref|NP_196638.2| aspartyl protease family protein [Arabidopsis t...   216   7e-54
ref|XP_002873476.1| hypothetical protein ARALYDRAFT_325615 [Arab...   215   9e-54
ref|XP_007225640.1| hypothetical protein PRUPE_ppa004762mg [Prun...   215   1e-53

>gb|EYU22027.1| hypothetical protein MIMGU_mgv1a005281mg [Mimulus guttatus]
          Length = 490

 Score =  271 bits (693), Expect = 1e-70
 Identities = 140/223 (62%), Positives = 162/223 (72%), Gaps = 13/223 (5%)
 Frame = -3

Query: 630 SNFHTIQISSLFPASICSPSK-DAIKKRPSTLEVFHRHGPCSKLT---------SATRPP 481
           + FHT+QISSL PAS+C+PS      K+ STLEV H+HGPCS LT         +A  PP
Sbjct: 35  TQFHTLQISSLQPASLCTPSTASGSSKKQSTLEVIHKHGPCSILTQDKSSTTTTAAASPP 94

Query: 480 LKDILSHDQSRVESIHARLNPASNT-EKVKDKKVNLPVQPGSSLGSGNYLVSVGLGTPKK 304
           L +IL+HDQSRVESI ++L P S    K+ +KK N+P Q G SLGSGNYL+++GLGTPKK
Sbjct: 95  LSEILTHDQSRVESIQSKLKPNSKKPNKLNEKKTNIPAQSGKSLGSGNYLIAIGLGTPKK 154

Query: 303 TLSLIFDTGSDLTWTQCQPCARSCYKQQDPIFNPKTSASYSNIXXXXXXXXXXXXXXGNN 124
           TL+LIFDTGSDL WTQCQPCARSCY Q+DPIFNP  S SYSNI              GNN
Sbjct: 155 TLNLIFDTGSDLMWTQCQPCARSCYTQKDPIFNPSLSGSYSNISCSSAQCSLLTSATGNN 214

Query: 123 AGC-SVGTCVYGIQYGDQSFSVGFFSKDTLTIA-NDVFPNFQF 1
            GC +  TCVYGIQYGD+SFSVGFF+KDTLTI  NDVFPNF F
Sbjct: 215 PGCTAASTCVYGIQYGDKSFSVGFFAKDTLTITPNDVFPNFLF 257


>gb|EYU29201.1| hypothetical protein MIMGU_mgv1a005225mg [Mimulus guttatus]
          Length = 492

 Score =  264 bits (675), Expect = 2e-68
 Identities = 140/229 (61%), Positives = 169/229 (73%), Gaps = 11/229 (4%)
 Frame = -3

Query: 654 ASRTKIQESNFHTIQISSLFPASICSPSKD--AIKKRPSTLEVFHRHGPCSK----LTSA 493
           AS     E ++HT++ISSL PAS+C+PS +     KR STLEV H+HGPCS+     ++A
Sbjct: 35  ASAAAAIEIHYHTLEISSLLPASVCTPSTNFKGSNKRQSTLEVLHQHGPCSRGPNNPSAA 94

Query: 492 TRPP--LKDILSHDQSRVESIHARLNPASNTE-KVKDKKVNLPVQPGSSLGSGNYLVSVG 322
           T PP  L +ILSHDQ RV+ I+AR+   S T+ ++K KKVNLPVQ G SLGSGNY+V++G
Sbjct: 95  TSPPPLLSEILSHDQIRVDKINARIKQTSYTKNQIKGKKVNLPVQSGRSLGSGNYIVTLG 154

Query: 321 LGTPKKTLSLIFDTGSDLTWTQCQPCARSCYKQQDPIFNPKTSASYSNIXXXXXXXXXXX 142
           LGTP+KTLSLIFDTGSDLTWTQCQPC +SCY+QQDPIFNP  S SYSN+           
Sbjct: 155 LGTPQKTLSLIFDTGSDLTWTQCQPCVKSCYQQQDPIFNPSDSTSYSNVSCNSPQCSQLS 214

Query: 141 XXXGNNAGC-SVGTCVYGIQYGDQSFSVGFFSKDTLTIA-NDVFPNFQF 1
              GN+ GC +  TCVYGIQYGDQSFSVGFFSKD LTIA N+VF +F F
Sbjct: 215 AATGNSPGCTNAATCVYGIQYGDQSFSVGFFSKDKLTIAPNEVFQDFLF 263


>dbj|BAA22813.1| CND41, chloroplast nucleoid DNA binding protein [Nicotiana tabacum]
          Length = 502

 Score =  240 bits (612), Expect = 3e-61
 Identities = 126/227 (55%), Positives = 155/227 (68%), Gaps = 16/227 (7%)
 Frame = -3

Query: 633 ESNFHTIQISSLFPASICSPSKDAIKKRPSTLEVFHRHGPCSKLTS--ATRPPLKDILSH 460
           ES+FHT+Q+SSL P+S C+P+    K+R ++LEV +R GPC+ L    A  P L +IL+H
Sbjct: 42  ESHFHTLQLSSLLPSSSCNPATKG-KRRGASLEVVNRQGPCTLLNQKGAKAPTLTEILAH 100

Query: 459 DQSRVESIHARLNP------------ASNTEK-VKDKKVNLPVQPGSSLGSGNYLVSVGL 319
           DQ+RV+SI AR+              +SN +K VKD K NLP Q G  LG+GNY+V+VGL
Sbjct: 101 DQARVDSIQARITDQSYDLFKKKDKKSSNKKKSVKDSKANLPAQSGLPLGTGNYIVNVGL 160

Query: 318 GTPKKTLSLIFDTGSDLTWTQCQPCARSCYKQQDPIFNPKTSASYSNIXXXXXXXXXXXX 139
           GTPKK LSLIFDTGSDLTWTQCQPC +SCY QQ PIF+P TS +YSNI            
Sbjct: 161 GTPKKDLSLIFDTGSDLTWTQCQPCVKSCYAQQQPIFDPSTSKTYSNISCTSAACSSLKS 220

Query: 138 XXGNNAGCSVGTCVYGIQYGDQSFSVGFFSKDTLTIA-NDVFPNFQF 1
             GN+ GCS   CVYGIQYGD SF++GFF+KD LT+  NDVF  F F
Sbjct: 221 ATGNSPGCSSSNCVYGIQYGDSSFTIGFFAKDKLTLTQNDVFDGFMF 267


>dbj|BAC22609.1| 41 kD chloroplast nucleoid DNA binding protein (CND41) [Nicotiana
           sylvestris]
          Length = 502

 Score =  237 bits (605), Expect = 2e-60
 Identities = 125/227 (55%), Positives = 155/227 (68%), Gaps = 16/227 (7%)
 Frame = -3

Query: 633 ESNFHTIQISSLFPASICSPSKDAIKKRPSTLEVFHRHGPCSKLTS--ATRPPLKDILSH 460
           ES+FHT+Q++SL P+S C+ +    K+R ++LEV +R GPC++L    A  P L +IL+H
Sbjct: 42  ESHFHTLQLTSLLPSSSCNTATKG-KRRGASLEVVNRQGPCTQLNQKGAKAPTLTEILAH 100

Query: 459 DQSRVESIHARLNP------------ASNTEK-VKDKKVNLPVQPGSSLGSGNYLVSVGL 319
           DQ+RV+SI AR+              +SN +K VKD K NLP Q G  LG+GNY+V+VGL
Sbjct: 101 DQARVDSIQARVTDQSYDLFKKKDKKSSNKKKSVKDSKANLPAQSGLPLGTGNYIVNVGL 160

Query: 318 GTPKKTLSLIFDTGSDLTWTQCQPCARSCYKQQDPIFNPKTSASYSNIXXXXXXXXXXXX 139
           GTPKK LSLIFDTGSDLTWTQCQPC +SCY QQ PIF+P  S +YSNI            
Sbjct: 161 GTPKKDLSLIFDTGSDLTWTQCQPCVKSCYAQQQPIFDPSASKTYSNISCTSTACSGLKS 220

Query: 138 XXGNNAGCSVGTCVYGIQYGDQSFSVGFFSKDTLTIA-NDVFPNFQF 1
             GN+ GCS   CVYGIQYGD SF+VGFF+KDTLT+  NDVF  F F
Sbjct: 221 ATGNSPGCSSSNCVYGIQYGDSSFTVGFFAKDTLTLTQNDVFDGFMF 267


>ref|XP_004245197.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Solanum
           lycopersicum]
          Length = 501

 Score =  231 bits (590), Expect = 1e-58
 Identities = 120/228 (52%), Positives = 153/228 (67%), Gaps = 14/228 (6%)
 Frame = -3

Query: 642 KIQESNFHTIQISSLFPASICSPSKDAIKKRPSTLEVFHRHGPCSKLTSATR--PPLKDI 469
           K  ESNFHTIQ++S+ P+S C PS    K+  ++LEV ++HGPCS+L       P L ++
Sbjct: 47  KTIESNFHTIQLTSILPSSSCKPSSKG-KRGGASLEVINKHGPCSQLNKKGEKGPTLTEM 105

Query: 468 LSHDQSRVESIHARL-----NPASNTEKV------KDKKVNLPVQPGSSLGSGNYLVSVG 322
           L+HDQ+RV+SI  R+     N    TEK       KD K  LP QPG +L +GNY+V+VG
Sbjct: 106 LAHDQARVDSIQTRIAAQNFNLFRKTEKTSKKYRAKDSKTTLPAQPGIALSTGNYIVTVG 165

Query: 321 LGTPKKTLSLIFDTGSDLTWTQCQPCARSCYKQQDPIFNPKTSASYSNIXXXXXXXXXXX 142
           +GTPKK L+LIFDTGSDLTWTQC+PC ++C+ QQ PIFNP +S++YSNI           
Sbjct: 166 IGTPKKDLTLIFDTGSDLTWTQCEPCFKTCFPQQQPIFNPSSSSTYSNISCSSTACSGLK 225

Query: 141 XXXGNNAGCSVGTCVYGIQYGDQSFSVGFFSKDTLTI-ANDVFPNFQF 1
              GN+  CS  TCVYGIQYGD SFS+GFF+KD LT+ A DVF  F F
Sbjct: 226 SATGNSPVCSSSTCVYGIQYGDSSFSIGFFAKDRLTLSATDVFDGFMF 273


>ref|XP_007011664.1| Eukaryotic aspartyl protease family protein isoform 3, partial
           [Theobroma cacao] gi|508782027|gb|EOY29283.1| Eukaryotic
           aspartyl protease family protein isoform 3, partial
           [Theobroma cacao]
          Length = 377

 Score =  230 bits (586), Expect = 3e-58
 Identities = 116/214 (54%), Positives = 149/214 (69%), Gaps = 4/214 (1%)
 Frame = -3

Query: 630 SNFHTIQISSLFPASICSPSKDAIKKRPSTLEVFHRHGPCSKL--TSATRPPLKDILSHD 457
           SN HT+ +SSL P+S+CSPS  A+ K+ S+L+V H+HGPCS+L    A  P   ++L  D
Sbjct: 13  SNSHTVHVSSLLPSSVCSPSAKALDKK-SSLQVVHKHGPCSQLHQDKANIPTHAEVLLQD 71

Query: 456 QSRVESIHARLNPASNTEKVKDKKV-NLPVQPGSSLGSGNYLVSVGLGTPKKTLSLIFDT 280
           ++RV+SIH+RL     +  V +     LP + GS +GSGNY+V+VGLGTPKK LSL+FDT
Sbjct: 72  EARVKSIHSRLGRKPGSSDVDETDAAQLPAKDGSVVGSGNYIVTVGLGTPKKGLSLVFDT 131

Query: 279 GSDLTWTQCQPCARSCYKQQDPIFNPKTSASYSNIXXXXXXXXXXXXXXGNNAGCSVGTC 100
           GSD+TWTQCQPCA+SCYKQ+DPIF P  S++YSNI              GN+ GC+   C
Sbjct: 132 GSDITWTQCQPCAKSCYKQRDPIFAPSQSSTYSNISCTSTACSSLTSATGNSPGCASSAC 191

Query: 99  VYGIQYGDQSFSVGFFSKDTLTIA-NDVFPNFQF 1
           VYGIQYGD SFSVGFF+K+ LT+   D F NF F
Sbjct: 192 VYGIQYGDSSFSVGFFAKEKLTLTPTDEFDNFLF 225


>ref|XP_006364268.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Solanum
           tuberosum]
          Length = 485

 Score =  229 bits (585), Expect = 4e-58
 Identities = 119/228 (52%), Positives = 152/228 (66%), Gaps = 14/228 (6%)
 Frame = -3

Query: 642 KIQESNFHTIQISSLFPASICSPSKDAIKKRPSTLEVFHRHGPCSKLTSATRP--PLKDI 469
           K  ESNFHTIQ++S+ P+S C PS    K+  ++LEV ++HGPCS+L         L +I
Sbjct: 31  KTIESNFHTIQLTSILPSSSCKPSSKG-KRGGTSLEVINKHGPCSQLNKKGEKGQTLTEI 89

Query: 468 LSHDQSRVESIHARL-----NPASNTEKV------KDKKVNLPVQPGSSLGSGNYLVSVG 322
           L+HDQ+RV+SI  R+     N    TEK       KD K  LP QPG++L +GNY+V++G
Sbjct: 90  LAHDQARVDSIQTRIAAQNFNLFRKTEKTSKKYRAKDSKTTLPAQPGTALSTGNYIVTIG 149

Query: 321 LGTPKKTLSLIFDTGSDLTWTQCQPCARSCYKQQDPIFNPKTSASYSNIXXXXXXXXXXX 142
           +GTPKK L+LIFDTGSDLTWTQC+PC ++C+ QQ PIFNP +S++YSNI           
Sbjct: 150 IGTPKKDLTLIFDTGSDLTWTQCEPCFKTCFPQQQPIFNPSSSSTYSNISCSSTACSGLK 209

Query: 141 XXXGNNAGCSVGTCVYGIQYGDQSFSVGFFSKDTLTI-ANDVFPNFQF 1
              GN   CS  TCVYGIQYGD SFS+GFF+KD LT+ A DVF  F F
Sbjct: 210 SATGNTPLCSSSTCVYGIQYGDSSFSIGFFAKDKLTLSATDVFDGFMF 257


>ref|XP_004291984.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 2-like [Fragaria
           vesca subsp. vesca]
          Length = 492

 Score =  229 bits (583), Expect = 8e-58
 Identities = 115/226 (50%), Positives = 148/226 (65%), Gaps = 9/226 (3%)
 Frame = -3

Query: 651 SRTKIQESNFHTIQISSLFPASICSPSKDAIKKRPSTLEVFHRHGPCSKL------TSAT 490
           + T    +  H +Q++SL PAS CSPS     ++ ++LEV HRHGPCSK       T   
Sbjct: 36  TETPADTTKTHLLQLNSLLPASTCSPSTRGHDRKKASLEVVHRHGPCSKRNQHKTQTPTP 95

Query: 489 RPPLKDILSHDQSRVESIHARLNPASNTEKVKDKKVNLPVQPGSSLGSGNYLVSVGLGTP 310
            P   +IL  DQ+RV SIHAR++P    + ++    ++P + GS +GSGNY+V+VGLG+P
Sbjct: 96  TPTHTEILQQDQARVNSIHARVSPKKGDDDLQQSDTSIPAKSGSVVGSGNYIVTVGLGSP 155

Query: 309 KKTLSLIFDTGSDLTWTQCQPCARSCYKQQDPIFNPKTSASYSNIXXXXXXXXXXXXXXG 130
            K LSLIFDTGSDLTWTQCQPC +SCYKQ++PIF+P  S SY+NI              G
Sbjct: 156 AKQLSLIFDTGSDLTWTQCQPCVKSCYKQKEPIFDPSLSKSYANISCNSPVCSQLISATG 215

Query: 129 NNAGCSVG--TCVYGIQYGDQSFSVGFFSKDTLTI-ANDVFPNFQF 1
           N  GCS G  TC+YGIQYGDQSFSVG+F K+ LT+ + DVF  F F
Sbjct: 216 NTPGCSSGTSTCIYGIQYGDQSFSVGYFGKERLTLTSTDVFDGFLF 261


>emb|CBI21177.3| unnamed protein product [Vitis vinifera]
          Length = 376

 Score =  229 bits (583), Expect = 8e-58
 Identities = 119/215 (55%), Positives = 146/215 (67%), Gaps = 5/215 (2%)
 Frame = -3

Query: 630 SNFHTIQISSLFPASICSPSKDAIKKRPSTLEVFHRHGPCSKLTS--ATRPPLKDILSHD 457
           S  H + I+SL P+S+CSPS     KR S LEV H+HGPCSKL+      P    +L  D
Sbjct: 39  STLHNVHITSLMPSSVCSPSPKGDDKRAS-LEVIHKHGPCSKLSQDKGRSPSRTQMLDQD 97

Query: 456 QSRVESIHARL--NPASNTEKVKDKKVNLPVQPGSSLGSGNYLVSVGLGTPKKTLSLIFD 283
           +SRV SI +RL  NPA    K+K  KV LP + GS++G+GNY+V+VGLGTPK+ L+ IFD
Sbjct: 98  ESRVNSIRSRLAKNPADGG-KLKGSKVTLPSKSGSTIGTGNYVVTVGLGTPKRDLTFIFD 156

Query: 282 TGSDLTWTQCQPCARSCYKQQDPIFNPKTSASYSNIXXXXXXXXXXXXXXGNNAGCSVGT 103
           TGSDLTWTQC+PCAR CY QQ+PIFNP  S SY+NI              GN+  CS  T
Sbjct: 157 TGSDLTWTQCEPCARYCYHQQEPIFNPSKSTSYTNISCSSPTCDELKSGTGNSPSCSAST 216

Query: 102 CVYGIQYGDQSFSVGFFSKDTLTI-ANDVFPNFQF 1
           CVYGIQYGDQS+SVGFF++D L + + DVF NF F
Sbjct: 217 CVYGIQYGDQSYSVGFFAQDKLALTSTDVFNNFLF 251


>ref|XP_002285593.1| PREDICTED: aspartic proteinase nepenthesin-2 [Vitis vinifera]
          Length = 481

 Score =  229 bits (583), Expect = 8e-58
 Identities = 119/215 (55%), Positives = 146/215 (67%), Gaps = 5/215 (2%)
 Frame = -3

Query: 630 SNFHTIQISSLFPASICSPSKDAIKKRPSTLEVFHRHGPCSKLTS--ATRPPLKDILSHD 457
           S  H + I+SL P+S+CSPS     KR S LEV H+HGPCSKL+      P    +L  D
Sbjct: 39  STLHNVHITSLMPSSVCSPSPKGDDKRAS-LEVIHKHGPCSKLSQDKGRSPSRTQMLDQD 97

Query: 456 QSRVESIHARL--NPASNTEKVKDKKVNLPVQPGSSLGSGNYLVSVGLGTPKKTLSLIFD 283
           +SRV SI +RL  NPA    K+K  KV LP + GS++G+GNY+V+VGLGTPK+ L+ IFD
Sbjct: 98  ESRVNSIRSRLAKNPADGG-KLKGSKVTLPSKSGSTIGTGNYVVTVGLGTPKRDLTFIFD 156

Query: 282 TGSDLTWTQCQPCARSCYKQQDPIFNPKTSASYSNIXXXXXXXXXXXXXXGNNAGCSVGT 103
           TGSDLTWTQC+PCAR CY QQ+PIFNP  S SY+NI              GN+  CS  T
Sbjct: 157 TGSDLTWTQCEPCARYCYHQQEPIFNPSKSTSYTNISCSSPTCDELKSGTGNSPSCSAST 216

Query: 102 CVYGIQYGDQSFSVGFFSKDTLTI-ANDVFPNFQF 1
           CVYGIQYGDQS+SVGFF++D L + + DVF NF F
Sbjct: 217 CVYGIQYGDQSYSVGFFAQDKLALTSTDVFNNFLF 251


>ref|XP_007011665.1| Eukaryotic aspartyl protease family protein, putative isoform 4,
           partial [Theobroma cacao] gi|508782028|gb|EOY29284.1|
           Eukaryotic aspartyl protease family protein, putative
           isoform 4, partial [Theobroma cacao]
          Length = 477

 Score =  227 bits (579), Expect = 2e-57
 Identities = 115/216 (53%), Positives = 149/216 (68%), Gaps = 4/216 (1%)
 Frame = -3

Query: 636 QESNFHTIQISSLFPASICSPSKDAIKKRPSTLEVFHRHGPCSKL--TSATRPPLKDILS 463
           Q  + HT+ +SSL P+S+CSPS  A+ K+ S+L+V H+HGPCS+L    A  P   ++L 
Sbjct: 33  QLQHSHTVHVSSLLPSSVCSPSAKALDKK-SSLQVVHKHGPCSQLHQDKANIPTHAEVLL 91

Query: 462 HDQSRVESIHARLNPASNTEKVKDKKV-NLPVQPGSSLGSGNYLVSVGLGTPKKTLSLIF 286
            D++RV+SIH+RL     +  V +     LP + GS +GSGNY+V+VGLGTPKK LSL+F
Sbjct: 92  QDEARVKSIHSRLGRKPGSSDVDETDAAQLPAKDGSVVGSGNYIVTVGLGTPKKGLSLVF 151

Query: 285 DTGSDLTWTQCQPCARSCYKQQDPIFNPKTSASYSNIXXXXXXXXXXXXXXGNNAGCSVG 106
           DTGSD+TWTQCQPCA+SCYKQ+DPIF P  S++YSNI              GN+ GC+  
Sbjct: 152 DTGSDITWTQCQPCAKSCYKQRDPIFAPSQSSTYSNISCTSTACSSLTSATGNSPGCASS 211

Query: 105 TCVYGIQYGDQSFSVGFFSKDTLTIA-NDVFPNFQF 1
            CVYGIQYGD SFSVGFF+K+ LT+   D F NF F
Sbjct: 212 ACVYGIQYGDSSFSVGFFAKEKLTLTPTDEFDNFLF 247


>ref|XP_007011663.1| Eukaryotic aspartyl protease family protein, putative isoform 2,
           partial [Theobroma cacao] gi|508782026|gb|EOY29282.1|
           Eukaryotic aspartyl protease family protein, putative
           isoform 2, partial [Theobroma cacao]
          Length = 395

 Score =  227 bits (579), Expect = 2e-57
 Identities = 115/216 (53%), Positives = 149/216 (68%), Gaps = 4/216 (1%)
 Frame = -3

Query: 636 QESNFHTIQISSLFPASICSPSKDAIKKRPSTLEVFHRHGPCSKL--TSATRPPLKDILS 463
           Q  + HT+ +SSL P+S+CSPS  A+ K+ S+L+V H+HGPCS+L    A  P   ++L 
Sbjct: 29  QLQHSHTVHVSSLLPSSVCSPSAKALDKK-SSLQVVHKHGPCSQLHQDKANIPTHAEVLL 87

Query: 462 HDQSRVESIHARLNPASNTEKVKDKKV-NLPVQPGSSLGSGNYLVSVGLGTPKKTLSLIF 286
            D++RV+SIH+RL     +  V +     LP + GS +GSGNY+V+VGLGTPKK LSL+F
Sbjct: 88  QDEARVKSIHSRLGRKPGSSDVDETDAAQLPAKDGSVVGSGNYIVTVGLGTPKKGLSLVF 147

Query: 285 DTGSDLTWTQCQPCARSCYKQQDPIFNPKTSASYSNIXXXXXXXXXXXXXXGNNAGCSVG 106
           DTGSD+TWTQCQPCA+SCYKQ+DPIF P  S++YSNI              GN+ GC+  
Sbjct: 148 DTGSDITWTQCQPCAKSCYKQRDPIFAPSQSSTYSNISCTSTACSSLTSATGNSPGCASS 207

Query: 105 TCVYGIQYGDQSFSVGFFSKDTLTIA-NDVFPNFQF 1
            CVYGIQYGD SFSVGFF+K+ LT+   D F NF F
Sbjct: 208 ACVYGIQYGDSSFSVGFFAKEKLTLTPTDEFDNFLF 243


>ref|XP_007011662.1| Eukaryotic aspartyl protease family protein, putative isoform 1
           [Theobroma cacao] gi|508782025|gb|EOY29281.1| Eukaryotic
           aspartyl protease family protein, putative isoform 1
           [Theobroma cacao]
          Length = 474

 Score =  227 bits (579), Expect = 2e-57
 Identities = 115/216 (53%), Positives = 149/216 (68%), Gaps = 4/216 (1%)
 Frame = -3

Query: 636 QESNFHTIQISSLFPASICSPSKDAIKKRPSTLEVFHRHGPCSKL--TSATRPPLKDILS 463
           Q  + HT+ +SSL P+S+CSPS  A+ K+ S+L+V H+HGPCS+L    A  P   ++L 
Sbjct: 30  QLQHSHTVHVSSLLPSSVCSPSAKALDKK-SSLQVVHKHGPCSQLHQDKANIPTHAEVLL 88

Query: 462 HDQSRVESIHARLNPASNTEKVKDKKV-NLPVQPGSSLGSGNYLVSVGLGTPKKTLSLIF 286
            D++RV+SIH+RL     +  V +     LP + GS +GSGNY+V+VGLGTPKK LSL+F
Sbjct: 89  QDEARVKSIHSRLGRKPGSSDVDETDAAQLPAKDGSVVGSGNYIVTVGLGTPKKGLSLVF 148

Query: 285 DTGSDLTWTQCQPCARSCYKQQDPIFNPKTSASYSNIXXXXXXXXXXXXXXGNNAGCSVG 106
           DTGSD+TWTQCQPCA+SCYKQ+DPIF P  S++YSNI              GN+ GC+  
Sbjct: 149 DTGSDITWTQCQPCAKSCYKQRDPIFAPSQSSTYSNISCTSTACSSLTSATGNSPGCASS 208

Query: 105 TCVYGIQYGDQSFSVGFFSKDTLTIA-NDVFPNFQF 1
            CVYGIQYGD SFSVGFF+K+ LT+   D F NF F
Sbjct: 209 ACVYGIQYGDSSFSVGFFAKEKLTLTPTDEFDNFLF 244


>ref|XP_002324349.1| nucleoid DNA-binding family protein [Populus trichocarpa]
           gi|222865783|gb|EEF02914.1| nucleoid DNA-binding family
           protein [Populus trichocarpa]
          Length = 490

 Score =  219 bits (557), Expect = 8e-55
 Identities = 117/225 (52%), Positives = 151/225 (67%), Gaps = 11/225 (4%)
 Frame = -3

Query: 642 KIQESNF-HTIQISSLFPASICSPSKDAIKKRPS--TLEVFHRHGPCSKLT---SATRPP 481
           K+ ES+  H+I++SSL P++ C PS   +    +  +L+V H+HGPCSKL+   ++  P 
Sbjct: 39  KVAESHHSHSIEVSSLLPSASCKPSTKVLSNNDNKASLKVVHKHGPCSKLSQDEASAAPT 98

Query: 480 LKDILSHDQSRVESIHARLNPASNTEKVKDKKVN----LPVQPGSSLGSGNYLVSVGLGT 313
             +IL  DQSRV+SIH+RL+  S T   KD KV     +P + GS++GSGNY+V+VGLGT
Sbjct: 99  HTEILLQDQSRVKSIHSRLSN-SKTSGGKDVKVTDSTTIPAKDGSTVGSGNYIVTVGLGT 157

Query: 312 PKKTLSLIFDTGSDLTWTQCQPCARSCYKQQDPIFNPKTSASYSNIXXXXXXXXXXXXXX 133
           PKK LSLIFDTGSD+TWTQCQPCARSCYKQ++ IF+P  S SY+NI              
Sbjct: 158 PKKDLSLIFDTGSDITWTQCQPCARSCYKQKEQIFDPSQSTSYTNISCSSSICNSLTSAT 217

Query: 132 GNNAGCSVGTCVYGIQYGDQSFSVGFFSKDTLTI-ANDVFPNFQF 1
           GN  GC+   CVYGIQYGD SFSVGFF  + LT+ + D F N  F
Sbjct: 218 GNTPGCASSACVYGIQYGDSSFSVGFFGTEKLTLTSTDAFNNIYF 262


>ref|XP_006483511.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 2-like [Citrus
           sinensis]
          Length = 481

 Score =  217 bits (553), Expect = 2e-54
 Identities = 114/229 (49%), Positives = 152/229 (66%), Gaps = 11/229 (4%)
 Frame = -3

Query: 654 ASRTKIQESNFHTIQISSLFPASICSPSKDAIKKRPSTLEVFHRHGPCSKLTS------A 493
           A+ ++ +  + HTIQ+SSL P+S+C+PS     K+ S+L+V H+HGPC K  S      +
Sbjct: 26  AAESQHELQHMHTIQLSSLLPSSVCNPSTKGNAKK-SSLKVVHKHGPCFKPYSNGEKAAS 84

Query: 492 TRPPLK--DILSHDQSRVESIHARL--NPASNTEKVKDKKVNLPVQPGSSLGSGNYLVSV 325
             P +   +IL  DQSRV+SIH+RL  N  S  E  +     LP + GS +G+GNY+V+V
Sbjct: 85  PSPSVSHAEILRQDQSRVKSIHSRLSKNSGSLDEIRQSDDATLPAKDGSVVGAGNYIVTV 144

Query: 324 GLGTPKKTLSLIFDTGSDLTWTQCQPCARSCYKQQDPIFNPKTSASYSNIXXXXXXXXXX 145
           G+GTPKK LSLIFDTGSDLTWTQC+PC + CY+Q++P F+P  S SYSN+          
Sbjct: 145 GIGTPKKDLSLIFDTGSDLTWTQCEPCVKYCYEQKEPKFDPTVSQSYSNVSCSSTICTSL 204

Query: 144 XXXXGNNAGCSVGTCVYGIQYGDQSFSVGFFSKDTLTIA-NDVFPNFQF 1
               GN+  C+  TC+YGIQYGD SFS+GFF K+TLT+   DVFPNF F
Sbjct: 205 QSATGNSPACASSTCLYGIQYGDSSFSIGFFGKETLTLTPRDVFPNFLF 253


>ref|XP_006450237.1| hypothetical protein CICLE_v10008143mg [Citrus clementina]
           gi|557553463|gb|ESR63477.1| hypothetical protein
           CICLE_v10008143mg [Citrus clementina]
          Length = 481

 Score =  217 bits (553), Expect = 2e-54
 Identities = 114/229 (49%), Positives = 152/229 (66%), Gaps = 11/229 (4%)
 Frame = -3

Query: 654 ASRTKIQESNFHTIQISSLFPASICSPSKDAIKKRPSTLEVFHRHGPCSKLTS------A 493
           A+ ++ +  + HTIQ+SSL P+S+C+PS     K+ S+L+V H+HGPC K  S      +
Sbjct: 26  AAESQHELQHMHTIQLSSLLPSSVCNPSTKGNAKK-SSLKVVHKHGPCFKPYSNGEKAAS 84

Query: 492 TRPPLK--DILSHDQSRVESIHARL--NPASNTEKVKDKKVNLPVQPGSSLGSGNYLVSV 325
             P +   +IL  DQSRV+SIH+RL  N  S  E  +     LP + GS +G+GNY+V+V
Sbjct: 85  PSPSVSHAEILRQDQSRVKSIHSRLSKNSGSLDEIRQSDDATLPAKDGSVVGAGNYIVTV 144

Query: 324 GLGTPKKTLSLIFDTGSDLTWTQCQPCARSCYKQQDPIFNPKTSASYSNIXXXXXXXXXX 145
           G+GTPKK LSLIFDTGSDLTWTQC+PC + CY+Q++P F+P  S SYSN+          
Sbjct: 145 GIGTPKKDLSLIFDTGSDLTWTQCEPCVKYCYEQKEPKFDPTVSQSYSNVSCSSTICTSL 204

Query: 144 XXXXGNNAGCSVGTCVYGIQYGDQSFSVGFFSKDTLTIA-NDVFPNFQF 1
               GN+  C+  TC+YGIQYGD SFS+GFF K+TLT+   DVFPNF F
Sbjct: 205 QSATGNSPACASSTCLYGIQYGDSSFSIGFFGKETLTLTPTDVFPNFLF 253


>ref|XP_006287637.1| hypothetical protein CARUB_v10000848mg [Capsella rubella]
           gi|482556343|gb|EOA20535.1| hypothetical protein
           CARUB_v10000848mg [Capsella rubella]
          Length = 481

 Score =  216 bits (551), Expect = 4e-54
 Identities = 116/227 (51%), Positives = 149/227 (65%), Gaps = 9/227 (3%)
 Frame = -3

Query: 654 ASRTKIQESNFHTI-QISSLFPASICSPSKDAIKKRP----STLEVFHRHGPCSKLTS-- 496
           A  ++ ++ ++HTI Q+SSLFP+S  S S   +  R     S+L V HRHG CS L +  
Sbjct: 26  AQESQKKDIDYHTILQVSSLFPSSSSSSSPCVLSPRATKTKSSLHVTHRHGTCSPLNNGK 85

Query: 495 ATRPPLKDILSHDQSRVESIHARLNPASNTEKV-KDKKVNLPVQPGSSLGSGNYLVSVGL 319
           ATRP   +IL  DQ+RV SIH++L+    T  V + +  +LP + GS+LGSGNY+V+VGL
Sbjct: 86  ATRPDHVEILKLDQARVNSIHSKLSKKLTTNHVGQSQSTDLPAKDGSTLGSGNYIVTVGL 145

Query: 318 GTPKKTLSLIFDTGSDLTWTQCQPCARSCYKQQDPIFNPKTSASYSNIXXXXXXXXXXXX 139
           GTPK  LSLIFDTGSDLTWTQC+PC R+CY Q++PIFNP  S+SY N+            
Sbjct: 146 GTPKHDLSLIFDTGSDLTWTQCEPCVRTCYSQKEPIFNPSKSSSYYNVSCSSPACTSLSS 205

Query: 138 XXGNNAGCSVGTCVYGIQYGDQSFSVGFFSKDTLTIAN-DVFPNFQF 1
             GN   CS  TC+YGIQYGDQSFSVGF +K+  T+ N DVF    F
Sbjct: 206 ATGNAGSCSASTCIYGIQYGDQSFSVGFLAKEKFTLTNSDVFDGVYF 252


>ref|NP_196638.2| aspartyl protease family protein [Arabidopsis thaliana]
           gi|18700103|gb|AAL77663.1| AT5g10770/T30N20_40
           [Arabidopsis thaliana] gi|24111269|gb|AAN46758.1|
           At5g10770/T30N20_40 [Arabidopsis thaliana]
           gi|332004211|gb|AED91594.1| aspartyl protease family
           protein [Arabidopsis thaliana]
          Length = 474

 Score =  216 bits (549), Expect = 7e-54
 Identities = 115/220 (52%), Positives = 144/220 (65%), Gaps = 8/220 (3%)
 Frame = -3

Query: 636 QESNFHTIQISSLFPAS----ICSPSKDAIKKRPSTLEVFHRHGPCSKLTS--ATRPPLK 475
           +E++ HTIQ+SSL P+S    + SP     K   S+L V HRHG CS+L +  AT P   
Sbjct: 29  RETDSHTIQVSSLLPSSSSSCVLSPRASTTK---SSLHVTHRHGTCSRLNNGKATSPDHV 85

Query: 474 DILSHDQSRVESIHARLNPASNTEKVKDKK-VNLPVQPGSSLGSGNYLVSVGLGTPKKTL 298
           +IL  DQ+RV SIH++L+    T+ V + K  +LP + GS+LGSGNY+V+VGLGTPK  L
Sbjct: 86  EILRLDQARVNSIHSKLSKKLATDHVSESKSTDLPAKDGSTLGSGNYIVTVGLGTPKNDL 145

Query: 297 SLIFDTGSDLTWTQCQPCARSCYKQQDPIFNPKTSASYSNIXXXXXXXXXXXXXXGNNAG 118
           SLIFDTGSDLTWTQCQPC R+CY Q++PIFNP  S SY N+              GN   
Sbjct: 146 SLIFDTGSDLTWTQCQPCVRTCYDQKEPIFNPSKSTSYYNVSCSSAACGSLSSATGNAGS 205

Query: 117 CSVGTCVYGIQYGDQSFSVGFFSKDTLTIAN-DVFPNFQF 1
           CS   C+YGIQYGDQSFSVGF +K+  T+ N DVF    F
Sbjct: 206 CSASNCIYGIQYGDQSFSVGFLAKEKFTLTNSDVFDGVYF 245


>ref|XP_002873476.1| hypothetical protein ARALYDRAFT_325615 [Arabidopsis lyrata subsp.
           lyrata] gi|297319313|gb|EFH49735.1| hypothetical protein
           ARALYDRAFT_325615 [Arabidopsis lyrata subsp. lyrata]
          Length = 475

 Score =  215 bits (548), Expect = 9e-54
 Identities = 118/226 (52%), Positives = 146/226 (64%), Gaps = 8/226 (3%)
 Frame = -3

Query: 654 ASRTKIQESNFHTIQISSLFPAS----ICSPSKDAIKKRPSTLEVFHRHGPCSKLTS--A 493
           A   +I +S  HTIQ+SSLFPAS    + SP     K   S+L V HRHG CS+L +  A
Sbjct: 26  AQEREIDDS--HTIQVSSLFPASSSSCVLSPRASTTK---SSLHVTHRHGTCSRLNNGKA 80

Query: 492 TRPPLKDILSHDQSRVESIHARLNPASNTEKV-KDKKVNLPVQPGSSLGSGNYLVSVGLG 316
           T P   +IL  DQ+RV SIH++L+    T  V + +  +LP + GS+LGSGNY+V+VGLG
Sbjct: 81  TSPDHVEILRLDQARVNSIHSKLSKKLTTNHVSQSQSTDLPAKDGSTLGSGNYIVTVGLG 140

Query: 315 TPKKTLSLIFDTGSDLTWTQCQPCARSCYKQQDPIFNPKTSASYSNIXXXXXXXXXXXXX 136
           TPK  LSLIFDTGSDLTWTQCQPC R+CY Q++PIFNP  S SY N+             
Sbjct: 141 TPKNDLSLIFDTGSDLTWTQCQPCVRTCYDQKEPIFNPSKSTSYYNVSCSSAACGSLSSA 200

Query: 135 XGNNAGCSVGTCVYGIQYGDQSFSVGFFSKDTLTI-ANDVFPNFQF 1
            GN   CS   C+YGIQYGDQSFSVGF +KD  T+ ++DVF    F
Sbjct: 201 TGNAGSCSASNCIYGIQYGDQSFSVGFLAKDKFTLTSSDVFDGVYF 246


>ref|XP_007225640.1| hypothetical protein PRUPE_ppa004762mg [Prunus persica]
           gi|462422576|gb|EMJ26839.1| hypothetical protein
           PRUPE_ppa004762mg [Prunus persica]
          Length = 492

 Score =  215 bits (547), Expect = 1e-53
 Identities = 112/226 (49%), Positives = 148/226 (65%), Gaps = 14/226 (6%)
 Frame = -3

Query: 636 QESNFHTIQISSLFPASICSPS---KDAIKKRPST--LEVFHRHGPCSKLTS--ATRPPL 478
           +  + HT++++SL PA+ CS S   K  + K  S+  L+V H+HGPCS+L    +  P  
Sbjct: 37  EREHAHTVEVNSLLPATTCSSSSSTKGHMSKHASSSVLKVVHKHGPCSRLKKHKSKTPTH 96

Query: 477 KDILSHDQSRVESIHARLNPASNTEKVKDKK----VNLPVQPGSSLGSGNYLVSVGLGTP 310
             IL  DQ+RV SIH+R+N     + V D +      +P Q GS +G+GNY+V+VGLG+P
Sbjct: 97  AQILQQDQARVNSIHSRVNSKKQLKSVDDLRESAATTIPAQSGSVVGAGNYIVNVGLGSP 156

Query: 309 KKTLSLIFDTGSDLTWTQCQPCARSCYKQQDPIFNPKTSASYSNIXXXXXXXXXXXXXXG 130
           KK LSLIFDTGSDLTWTQC+PC +SCYKQ++PIF+P  SASY+N+              G
Sbjct: 157 KKQLSLIFDTGSDLTWTQCRPCVKSCYKQKEPIFDPSLSASYANVSCTSATCTQLGSATG 216

Query: 129 NNAGC--SVGTCVYGIQYGDQSFSVGFFSKDTLTIAN-DVFPNFQF 1
           N  GC  S  TC+YGIQYGDQSFSVG+F K+ L++ N DVF  F F
Sbjct: 217 NTPGCTASTSTCIYGIQYGDQSFSVGYFGKEKLSLTNTDVFDGFLF 262


Top