BLASTX nr result

ID: Mentha22_contig00012464 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Mentha22_contig00012464
         (1082 letters)

Database: ./nr 
           38,876,450 sequences; 13,856,398,315 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

gb|EYU22027.1| hypothetical protein MIMGU_mgv1a005281mg [Mimulus...   431   e-118
gb|EYU29201.1| hypothetical protein MIMGU_mgv1a005225mg [Mimulus...   423   e-116
ref|XP_004245197.1| PREDICTED: protein ASPARTIC PROTEASE IN GUAR...   396   e-108
dbj|BAC22609.1| 41 kD chloroplast nucleoid DNA binding protein (...   394   e-107
ref|XP_006364268.1| PREDICTED: protein ASPARTIC PROTEASE IN GUAR...   393   e-107
dbj|BAA22813.1| CND41, chloroplast nucleoid DNA binding protein ...   391   e-106
ref|XP_006483511.1| PREDICTED: protein ASPARTIC PROTEASE IN GUAR...   376   e-101
ref|XP_006450237.1| hypothetical protein CICLE_v10008143mg [Citr...   375   e-101
ref|XP_007011665.1| Eukaryotic aspartyl protease family protein,...   368   3e-99
ref|XP_007011662.1| Eukaryotic aspartyl protease family protein,...   368   3e-99
ref|XP_002285593.1| PREDICTED: aspartic proteinase nepenthesin-2...   368   3e-99
ref|XP_002324349.1| nucleoid DNA-binding family protein [Populus...   364   3e-98
ref|XP_004291984.1| PREDICTED: protein ASPARTIC PROTEASE IN GUAR...   363   9e-98
ref|XP_002515388.1| Aspartic proteinase nepenthesin-2 precursor,...   361   3e-97
ref|XP_006287637.1| hypothetical protein CARUB_v10000848mg [Caps...   357   5e-96
ref|XP_007225640.1| hypothetical protein PRUPE_ppa004762mg [Prun...   355   2e-95
ref|XP_003551807.1| PREDICTED: protein ASPARTIC PROTEASE IN GUAR...   353   9e-95
emb|CAB96832.1| nucleoid DNA-binding protein cnd41-like protein ...   350   4e-94
ref|NP_196638.2| aspartyl protease family protein [Arabidopsis t...   350   4e-94
ref|XP_002873476.1| hypothetical protein ARALYDRAFT_325615 [Arab...   349   1e-93

>gb|EYU22027.1| hypothetical protein MIMGU_mgv1a005281mg [Mimulus guttatus]
          Length = 490

 Score =  431 bits (1107), Expect = e-118
 Identities = 227/368 (61%), Positives = 258/368 (70%), Gaps = 8/368 (2%)
 Frame = -1

Query: 1082 SIHARLN-RASNTEKVEDKKVNLPVQRGSSLGSGNYLVSVGLGTPKKTLTLIFDTGSDLT 906
            SI ++L   +    K+ +KK N+P Q G SLGSGNYL+++GLGTPKKTL LIFDTGSDL 
Sbjct: 108  SIQSKLKPNSKKPNKLNEKKTNIPAQSGKSLGSGNYLIAIGLGTPKKTLNLIFDTGSDLM 167

Query: 905  WTQCQPCAESCYKQQDPIFNPKTSASYSNIXXXXXXXXXXXXXXGNSARC-SVGTCVYGI 729
            WTQCQPCA SCY Q+DPIFNP  S SYSNI              GN+  C +  TCVYGI
Sbjct: 168  WTQCQPCARSCYTQKDPIFNPSLSGSYSNISCSSAQCSLLTSATGNNPGCTAASTCVYGI 227

Query: 728  QYGDQSFSVGFFSKDTLTIA-NDVFPNFQFGCGQNNQGLFGRTAGLIGLGRDPLSVISQT 552
            QYGD+SFSVGFF+KDTLTI  NDVFPNF FGCGQNNQGLFG TAGL+GLGRD LS++SQT
Sbjct: 228  QYGDKSFSVGFFAKDTLTITPNDVFPNFLFGCGQNNQGLFGNTAGLLGLGRDSLSLVSQT 287

Query: 551  AAKYGKYFSYCLP-----XXXXXXXXXXXXXXGATKNVQFTPLDSSQGNSFYFISIVSLA 387
            + KYGKYFSYCLP                     +  V+FTP  +SQG+SFYFI IVS++
Sbjct: 288  SQKYGKYFSYCLPSTSSSTGHLTLGKNNGGAALTSSTVKFTPFATSQGSSFYFIDIVSIS 347

Query: 386  VGGRQLAIGQSVFKTSGAIIDSGTVITRXXXXXXXXXXXAFKQGMTKYPTAPAYSILDTC 207
            VGG QL IGQSVFK +GAIIDSGTVI+R           AF+Q M +Y +APAYSILDTC
Sbjct: 348  VGGAQLPIGQSVFKAAGAIIDSGTVISRLPPAAYSAMSSAFRQQMKQYTSAPAYSILDTC 407

Query: 206  FDFSNYXXXXXXXXXXXXXXXVKVDLSLSGILVAVSSTQACLAFAGNGDASDVGIFGNTQ 27
            FDF N                ++VDL  SGILVAVSSTQACLAFAGNGDA DVGIFGNTQ
Sbjct: 408  FDFGNLTSVSIPTISFVFSGNLRVDLHPSGILVAVSSTQACLAFAGNGDAGDVGIFGNTQ 467

Query: 26   QLTYEVVY 3
            Q T EVVY
Sbjct: 468  QKTLEVVY 475


>gb|EYU29201.1| hypothetical protein MIMGU_mgv1a005225mg [Mimulus guttatus]
          Length = 492

 Score =  423 bits (1088), Expect = e-116
 Identities = 225/363 (61%), Positives = 261/363 (71%), Gaps = 4/363 (1%)
 Frame = -1

Query: 1079 IHARLNRASNTE-KVEDKKVNLPVQRGSSLGSGNYLVSVGLGTPKKTLTLIFDTGSDLTW 903
            I+AR+ + S T+ +++ KKVNLPVQ G SLGSGNY+V++GLGTP+KTL+LIFDTGSDLTW
Sbjct: 115  INARIKQTSYTKNQIKGKKVNLPVQSGRSLGSGNYIVTLGLGTPQKTLSLIFDTGSDLTW 174

Query: 902  TQCQPCAESCYKQQDPIFNPKTSASYSNIXXXXXXXXXXXXXXGNSARC-SVGTCVYGIQ 726
            TQCQPC +SCY+QQDPIFNP  S SYSN+              GNS  C +  TCVYGIQ
Sbjct: 175  TQCQPCVKSCYQQQDPIFNPSDSTSYSNVSCNSPQCSQLSAATGNSPGCTNAATCVYGIQ 234

Query: 725  YGDQSFSVGFFSKDTLTIA-NDVFPNFQFGCGQNNQGLFGRTAGLIGLGRDPLSVISQTA 549
            YGDQSFSVGFFSKD LTIA N+VF +F FGCGQNNQGLFG TAGL+GLGRD LS+ISQTA
Sbjct: 235  YGDQSFSVGFFSKDKLTIAPNEVFQDFLFGCGQNNQGLFGNTAGLLGLGRDKLSIISQTA 294

Query: 548  AKYGKYFSYCLPXXXXXXXXXXXXXXGATKNVQFTP-LDSSQGNSFYFISIVSLAVGGRQ 372
             KYGKYFSYCLP              G ++NV+FTP + + QG+SFYFI+IVS++VGGRQ
Sbjct: 295  QKYGKYFSYCLPSTSSSTGHLTLGKTGNSRNVKFTPFVTNQQGSSFYFINIVSISVGGRQ 354

Query: 371  LAIGQSVFKTSGAIIDSGTVITRXXXXXXXXXXXAFKQGMTKYPTAPAYSILDTCFDFSN 192
            LAI  SVFK  G IIDSGTVI+R           AFK+ M KY  APAYSILDTC+D S 
Sbjct: 355  LAISGSVFKAGGTIIDSGTVISRIPPTAYSALSGAFKKMMAKYKRAPAYSILDTCYDLSG 414

Query: 191  YXXXXXXXXXXXXXXXVKVDLSLSGILVAVSSTQACLAFAGNGDASDVGIFGNTQQLTYE 12
            Y               V+VDL  SGI+VAV  T+ CLAFAGN D  DVGIFGN+QQ T E
Sbjct: 415  YTSVTVPTVSFTFGGNVRVDLDPSGIIVAVGGTRVCLAFAGNSDDGDVGIFGNSQQKTLE 474

Query: 11   VVY 3
            VVY
Sbjct: 475  VVY 477


>ref|XP_004245197.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Solanum
            lycopersicum]
          Length = 501

 Score =  396 bits (1018), Expect = e-108
 Identities = 195/361 (54%), Positives = 243/361 (67%), Gaps = 1/361 (0%)
 Frame = -1

Query: 1082 SIHARLNRASNTEKVEDKKVNLPVQRGSSLGSGNYLVSVGLGTPKKTLTLIFDTGSDLTW 903
            ++  +  + S   + +D K  LP Q G +L +GNY+V+VG+GTPKK LTLIFDTGSDLTW
Sbjct: 126  NLFRKTEKTSKKYRAKDSKTTLPAQPGIALSTGNYIVTVGIGTPKKDLTLIFDTGSDLTW 185

Query: 902  TQCQPCAESCYKQQDPIFNPKTSASYSNIXXXXXXXXXXXXXXGNSARCSVGTCVYGIQY 723
            TQC+PC ++C+ QQ PIFNP +S++YSNI              GNS  CS  TCVYGIQY
Sbjct: 186  TQCEPCFKTCFPQQQPIFNPSSSSTYSNISCSSTACSGLKSATGNSPVCSSSTCVYGIQY 245

Query: 722  GDQSFSVGFFSKDTLTI-ANDVFPNFQFGCGQNNQGLFGRTAGLIGLGRDPLSVISQTAA 546
            GD SFS+GFF+KD LT+ A DVF  F FGCGQ+N+GLFG+TAGLIGLGRDPLS++SQT+A
Sbjct: 246  GDSSFSIGFFAKDRLTLSATDVFDGFMFGCGQDNKGLFGKTAGLIGLGRDPLSIVSQTSA 305

Query: 545  KYGKYFSYCLPXXXXXXXXXXXXXXGATKNVQFTPLDSSQGNSFYFISIVSLAVGGRQLA 366
            K+GKYFSYCLP              GA  N+QFTP  SSQG SFYFI ++ ++VGG+ LA
Sbjct: 306  KFGKYFSYCLPTRRGSNGHLSFGKNGAKSNLQFTPFASSQGTSFYFIDVLGISVGGKSLA 365

Query: 365  IGQSVFKTSGAIIDSGTVITRXXXXXXXXXXXAFKQGMTKYPTAPAYSILDTCFDFSNYX 186
            I   VFK +G IIDSGTVITR            F++ M+KYP AP  S+LDTC+D SNY 
Sbjct: 366  ISPMVFKNAGTIIDSGTVITRLPSTAYSNLRATFREFMSKYPRAPDLSLLDTCYDLSNYT 425

Query: 185  XXXXXXXXXXXXXXVKVDLSLSGILVAVSSTQACLAFAGNGDASDVGIFGNTQQLTYEVV 6
                           K+D+  +GI +   ++Q CLAFAGNGD   +GIFGNTQQ T E+V
Sbjct: 426  TISIPKISFNFNGNTKMDIVPNGIFIVNGASQVCLAFAGNGDDDSIGIFGNTQQQTMEIV 485

Query: 5    Y 3
            Y
Sbjct: 486  Y 486


>dbj|BAC22609.1| 41 kD chloroplast nucleoid DNA binding protein (CND41) [Nicotiana
            sylvestris]
          Length = 502

 Score =  394 bits (1011), Expect = e-107
 Identities = 201/362 (55%), Positives = 239/362 (66%), Gaps = 9/362 (2%)
 Frame = -1

Query: 1061 RASNTEK-VEDKKVNLPVQRGSSLGSGNYLVSVGLGTPKKTLTLIFDTGSDLTWTQCQPC 885
            ++SN +K V+D K NLP Q G  LG+GNY+V+VGLGTPKK L+LIFDTGSDLTWTQCQPC
Sbjct: 126  KSSNKKKSVKDSKANLPAQSGLPLGTGNYIVNVGLGTPKKDLSLIFDTGSDLTWTQCQPC 185

Query: 884  AESCYKQQDPIFNPKTSASYSNIXXXXXXXXXXXXXXGNSARCSVGTCVYGIQYGDQSFS 705
             +SCY QQ PIF+P  S +YSNI              GNS  CS   CVYGIQYGD SF+
Sbjct: 186  VKSCYAQQQPIFDPSASKTYSNISCTSTACSGLKSATGNSPGCSSSNCVYGIQYGDSSFT 245

Query: 704  VGFFSKDTLTIA-NDVFPNFQFGCGQNNQGLFGRTAGLIGLGRDPLSVISQTAAKYGKYF 528
            VGFF+KDTLT+  NDVF  F FGCGQNN+GLFG+TAGLIGLGRDPLS++ QTA K+GKYF
Sbjct: 246  VGFFAKDTLTLTQNDVFDGFMFGCGQNNRGLFGKTAGLIGLGRDPLSIVQQTAQKFGKYF 305

Query: 527  SYCLPXXXXXXXXXXXXXXGATK-------NVQFTPLDSSQGNSFYFISIVSLAVGGRQL 369
            SYCLP                 K        + FTP  SSQG +FYFI ++ ++VGG+ L
Sbjct: 306  SYCLPTSRGSNGHLTFGNGNGVKTSKAVKNGITFTPFASSQGATFYFIDVLGISVGGKAL 365

Query: 368  AIGQSVFKTSGAIIDSGTVITRXXXXXXXXXXXAFKQGMTKYPTAPAYSILDTCFDFSNY 189
            +I   +F+ +G IIDSGTVITR            FKQ M+KYPTAPA S+LDTC+D SNY
Sbjct: 366  SISPMLFQNAGTIIDSGTVITRLPSTVYGSLKSTFKQFMSKYPTAPALSLLDTCYDLSNY 425

Query: 188  XXXXXXXXXXXXXXXVKVDLSLSGILVAVSSTQACLAFAGNGDASDVGIFGNTQQLTYEV 9
                             VDL  +GIL+   ++Q CLAFAGNGD   +GIFGN QQ T EV
Sbjct: 426  TSISIPKISFNFNGNANVDLEPNGILITNGASQVCLAFAGNGDDDTIGIFGNIQQQTLEV 485

Query: 8    VY 3
            VY
Sbjct: 486  VY 487


>ref|XP_006364268.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Solanum
            tuberosum]
          Length = 485

 Score =  393 bits (1009), Expect = e-107
 Identities = 193/361 (53%), Positives = 242/361 (67%), Gaps = 1/361 (0%)
 Frame = -1

Query: 1082 SIHARLNRASNTEKVEDKKVNLPVQRGSSLGSGNYLVSVGLGTPKKTLTLIFDTGSDLTW 903
            ++  +  + S   + +D K  LP Q G++L +GNY+V++G+GTPKK LTLIFDTGSDLTW
Sbjct: 110  NLFRKTEKTSKKYRAKDSKTTLPAQPGTALSTGNYIVTIGIGTPKKDLTLIFDTGSDLTW 169

Query: 902  TQCQPCAESCYKQQDPIFNPKTSASYSNIXXXXXXXXXXXXXXGNSARCSVGTCVYGIQY 723
            TQC+PC ++C+ QQ PIFNP +S++YSNI              GN+  CS  TCVYGIQY
Sbjct: 170  TQCEPCFKTCFPQQQPIFNPSSSSTYSNISCSSTACSGLKSATGNTPLCSSSTCVYGIQY 229

Query: 722  GDQSFSVGFFSKDTLTI-ANDVFPNFQFGCGQNNQGLFGRTAGLIGLGRDPLSVISQTAA 546
            GD SFS+GFF+KD LT+ A DVF  F FGCGQ+N+GLFG+TAGLIGLGRDPLS++SQT+A
Sbjct: 230  GDSSFSIGFFAKDKLTLSATDVFDGFMFGCGQDNKGLFGKTAGLIGLGRDPLSIVSQTSA 289

Query: 545  KYGKYFSYCLPXXXXXXXXXXXXXXGATKNVQFTPLDSSQGNSFYFISIVSLAVGGRQLA 366
            K+GKYFSYCLP              GA  N+QFTP  SSQG SFYFI ++ ++VGG+ LA
Sbjct: 290  KFGKYFSYCLPTRRGSNGHLTFGKNGAKSNLQFTPFASSQGTSFYFIDVLGISVGGKALA 349

Query: 365  IGQSVFKTSGAIIDSGTVITRXXXXXXXXXXXAFKQGMTKYPTAPAYSILDTCFDFSNYX 186
            I   VFK +G IIDSGTVITR            F++ M+KYP AP  S+LDTC+D SNY 
Sbjct: 350  ISPMVFKNAGTIIDSGTVITRLPSTAYANMRATFREFMSKYPRAPDLSLLDTCYDLSNYT 409

Query: 185  XXXXXXXXXXXXXXVKVDLSLSGILVAVSSTQACLAFAGNGDASDVGIFGNTQQLTYEVV 6
                           K+DL  +GI     ++Q CLAFA NGD   +GIFGNTQQ T E+V
Sbjct: 410  TVSIPKISFNFNGNTKMDLVPNGIFFVNGASQVCLAFASNGDDDSIGIFGNTQQQTMEIV 469

Query: 5    Y 3
            Y
Sbjct: 470  Y 470


>dbj|BAA22813.1| CND41, chloroplast nucleoid DNA binding protein [Nicotiana tabacum]
          Length = 502

 Score =  391 bits (1005), Expect = e-106
 Identities = 199/362 (54%), Positives = 240/362 (66%), Gaps = 9/362 (2%)
 Frame = -1

Query: 1061 RASNTEK-VEDKKVNLPVQRGSSLGSGNYLVSVGLGTPKKTLTLIFDTGSDLTWTQCQPC 885
            ++SN +K V+D K NLP Q G  LG+GNY+V+VGLGTPKK L+LIFDTGSDLTWTQCQPC
Sbjct: 126  KSSNKKKSVKDSKANLPAQSGLPLGTGNYIVNVGLGTPKKDLSLIFDTGSDLTWTQCQPC 185

Query: 884  AESCYKQQDPIFNPKTSASYSNIXXXXXXXXXXXXXXGNSARCSVGTCVYGIQYGDQSFS 705
             +SCY QQ PIF+P TS +YSNI              GNS  CS   CVYGIQYGD SF+
Sbjct: 186  VKSCYAQQQPIFDPSTSKTYSNISCTSAACSSLKSATGNSPGCSSSNCVYGIQYGDSSFT 245

Query: 704  VGFFSKDTLTIA-NDVFPNFQFGCGQNNQGLFGRTAGLIGLGRDPLSVISQTAAKYGKYF 528
            +GFF+KD LT+  NDVF  F FGCGQNN+GLFG+TAGLIGLGRDPLS++ QTA K+GKYF
Sbjct: 246  IGFFAKDKLTLTQNDVFDGFMFGCGQNNKGLFGKTAGLIGLGRDPLSIVQQTAQKFGKYF 305

Query: 527  SYCLPXXXXXXXXXXXXXXGATK-------NVQFTPLDSSQGNSFYFISIVSLAVGGRQL 369
            SYCLP                 K        + FTP  SSQG ++YFI ++ ++VGG+ L
Sbjct: 306  SYCLPTSRGSNGHLTFGNGNGVKASKAVKNGITFTPFASSQGTAYYFIDVLGISVGGKAL 365

Query: 368  AIGQSVFKTSGAIIDSGTVITRXXXXXXXXXXXAFKQGMTKYPTAPAYSILDTCFDFSNY 189
            +I   +F+ +G IIDSGTVITR           AFKQ M+KYPTAPA S+LDTC+D SNY
Sbjct: 366  SISPMLFQNAGTIIDSGTVITRLPSTAYGSLKSAFKQFMSKYPTAPALSLLDTCYDLSNY 425

Query: 188  XXXXXXXXXXXXXXXVKVDLSLSGILVAVSSTQACLAFAGNGDASDVGIFGNTQQLTYEV 9
                             V+L  +GIL+   ++Q CLAFAGNGD   +GIFGN QQ T EV
Sbjct: 426  TSISIPKISFNFNGNANVELDPNGILITNGASQVCLAFAGNGDDDSIGIFGNIQQQTLEV 485

Query: 8    VY 3
            VY
Sbjct: 486  VY 487


>ref|XP_006483511.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 2-like [Citrus
            sinensis]
          Length = 481

 Score =  376 bits (965), Expect = e-101
 Identities = 197/364 (54%), Positives = 249/364 (68%), Gaps = 4/364 (1%)
 Frame = -1

Query: 1082 SIHARLNRASNT--EKVEDKKVNLPVQRGSSLGSGNYLVSVGLGTPKKTLTLIFDTGSDL 909
            SIH+RL++ S +  E  +     LP + GS +G+GNY+V+VG+GTPKK L+LIFDTGSDL
Sbjct: 104  SIHSRLSKNSGSLDEIRQSDDATLPAKDGSVVGAGNYIVTVGIGTPKKDLSLIFDTGSDL 163

Query: 908  TWTQCQPCAESCYKQQDPIFNPKTSASYSNIXXXXXXXXXXXXXXGNSARCSVGTCVYGI 729
            TWTQC+PC + CY+Q++P F+P  S SYSN+              GNS  C+  TC+YGI
Sbjct: 164  TWTQCEPCVKYCYEQKEPKFDPTVSQSYSNVSCSSTICTSLQSATGNSPACASSTCLYGI 223

Query: 728  QYGDQSFSVGFFSKDTLTIA-NDVFPNFQFGCGQNNQGLFGRTAGLIGLGRDPLSVISQT 552
            QYGD SFS+GFF K+TLT+   DVFPNF FGCGQNN+GLFG  AGL+GLGRDP+S++SQT
Sbjct: 224  QYGDSSFSIGFFGKETLTLTPRDVFPNFLFGCGQNNRGLFGGAAGLMGLGRDPISLVSQT 283

Query: 551  AAKYGKYFSYCLPXXXXXXXXXXXXXXGATKNVQFTPLDS-SQGNSFYFISIVSLAVGGR 375
            A KY K FSYCLP              GA+K+VQFTPL S S G+SFY + ++ ++VGG+
Sbjct: 284  ATKYKKLFSYCLP-SSASSTGHLTFGPGASKSVQFTPLSSISGGSSFYGLEMIGISVGGQ 342

Query: 374  QLAIGQSVFKTSGAIIDSGTVITRXXXXXXXXXXXAFKQGMTKYPTAPAYSILDTCFDFS 195
            +L+I  SVF T+G IIDSGTVITR           AF+Q M+KYPTAPA S+LDTC+DFS
Sbjct: 343  KLSIAASVFTTAGTIIDSGTVITRLPPDAYTPLRTAFRQFMSKYPTAPALSLLDTCYDFS 402

Query: 194  NYXXXXXXXXXXXXXXXVKVDLSLSGILVAVSSTQACLAFAGNGDASDVGIFGNTQQLTY 15
             Y               V+V +  +GI+ A + +Q CLAFAGN D +DV IFGNTQQ T 
Sbjct: 403  KYSTVTLPQISLFFSGGVEVSVDKTGIMYASNISQVCLAFAGNSDPTDVSIFGNTQQHTL 462

Query: 14   EVVY 3
            EVVY
Sbjct: 463  EVVY 466


>ref|XP_006450237.1| hypothetical protein CICLE_v10008143mg [Citrus clementina]
            gi|557553463|gb|ESR63477.1| hypothetical protein
            CICLE_v10008143mg [Citrus clementina]
          Length = 481

 Score =  375 bits (964), Expect = e-101
 Identities = 197/364 (54%), Positives = 248/364 (68%), Gaps = 4/364 (1%)
 Frame = -1

Query: 1082 SIHARLNRASNT--EKVEDKKVNLPVQRGSSLGSGNYLVSVGLGTPKKTLTLIFDTGSDL 909
            SIH+RL++ S +  E  +     LP + GS +G+GNY+V+VG+GTPKK L+LIFDTGSDL
Sbjct: 104  SIHSRLSKNSGSLDEIRQSDDATLPAKDGSVVGAGNYIVTVGIGTPKKDLSLIFDTGSDL 163

Query: 908  TWTQCQPCAESCYKQQDPIFNPKTSASYSNIXXXXXXXXXXXXXXGNSARCSVGTCVYGI 729
            TWTQC+PC + CY+Q++P F+P  S SYSN+              GNS  C+  TC+YGI
Sbjct: 164  TWTQCEPCVKYCYEQKEPKFDPTVSQSYSNVSCSSTICTSLQSATGNSPACASSTCLYGI 223

Query: 728  QYGDQSFSVGFFSKDTLTIA-NDVFPNFQFGCGQNNQGLFGRTAGLIGLGRDPLSVISQT 552
            QYGD SFS+GFF K+TLT+   DVFPNF FGCGQNN GLFG  AGL+GLGRDP+S++SQT
Sbjct: 224  QYGDSSFSIGFFGKETLTLTPTDVFPNFLFGCGQNNHGLFGGAAGLMGLGRDPISLVSQT 283

Query: 551  AAKYGKYFSYCLPXXXXXXXXXXXXXXGATKNVQFTPLDS-SQGNSFYFISIVSLAVGGR 375
            A KY K FSYCLP              GA+K+VQFTPL S S G+SFY + ++ ++VGG+
Sbjct: 284  ATKYKKLFSYCLP-SSASSTGHLTFGPGASKSVQFTPLSSISGGSSFYGLEMIGISVGGQ 342

Query: 374  QLAIGQSVFKTSGAIIDSGTVITRXXXXXXXXXXXAFKQGMTKYPTAPAYSILDTCFDFS 195
            +L+I  SVF T+G IIDSGTVITR           AF+Q M+KYPTAPA S+LDTC+DFS
Sbjct: 343  KLSIAASVFTTAGTIIDSGTVITRLPPDAYTPLRTAFRQFMSKYPTAPALSLLDTCYDFS 402

Query: 194  NYXXXXXXXXXXXXXXXVKVDLSLSGILVAVSSTQACLAFAGNGDASDVGIFGNTQQLTY 15
             Y               V+V +  +GI+ A + +Q CLAFAGN D +DV IFGNTQQ T 
Sbjct: 403  KYSTVTLPQISLFFSGGVEVSVDKTGIMYASNISQVCLAFAGNSDPTDVSIFGNTQQHTL 462

Query: 14   EVVY 3
            EVVY
Sbjct: 463  EVVY 466


>ref|XP_007011665.1| Eukaryotic aspartyl protease family protein, putative isoform 4,
            partial [Theobroma cacao] gi|508782028|gb|EOY29284.1|
            Eukaryotic aspartyl protease family protein, putative
            isoform 4, partial [Theobroma cacao]
          Length = 477

 Score =  368 bits (944), Expect = 3e-99
 Identities = 197/364 (54%), Positives = 243/364 (66%), Gaps = 4/364 (1%)
 Frame = -1

Query: 1082 SIHARLNRASNTEKVEDKKV-NLPVQRGSSLGSGNYLVSVGLGTPKKTLTLIFDTGSDLT 906
            SIH+RL R   +  V++     LP + GS +GSGNY+V+VGLGTPKK L+L+FDTGSD+T
Sbjct: 99   SIHSRLGRKPGSSDVDETDAAQLPAKDGSVVGSGNYIVTVGLGTPKKGLSLVFDTGSDIT 158

Query: 905  WTQCQPCAESCYKQQDPIFNPKTSASYSNIXXXXXXXXXXXXXXGNSARCSVGTCVYGIQ 726
            WTQCQPCA+SCYKQ+DPIF P  S++YSNI              GNS  C+   CVYGIQ
Sbjct: 159  WTQCQPCAKSCYKQRDPIFAPSQSSTYSNISCTSTACSSLTSATGNSPGCASSACVYGIQ 218

Query: 725  YGDQSFSVGFFSKDTLTIA-NDVFPNFQFGCGQNNQGLFGRTAGLIGLGRDPLSVISQTA 549
            YGD SFSVGFF+K+ LT+   D F NF FGCGQNNQGLFG +AGL+GLGRD LS+ SQTA
Sbjct: 219  YGDSSFSVGFFAKEKLTLTPTDEFDNFLFGCGQNNQGLFGGSAGLLGLGRDQLSLPSQTA 278

Query: 548  AKYGKYFSYCLP-XXXXXXXXXXXXXXGATKNVQFTPLDS-SQGNSFYFISIVSLAVGGR 375
            +KY K+FSYCLP               G +K+V+FT L + SQG SFY I I  ++VGG+
Sbjct: 279  SKYKKFFSYCLPSSASSDGFLAFGYGGGVSKSVKFTTLSTVSQGESFYGIDITGISVGGQ 338

Query: 374  QLAIGQSVFKTSGAIIDSGTVITRXXXXXXXXXXXAFKQGMTKYPTAPAYSILDTCFDFS 195
            +L+I  S+F T+G IIDSGTVITR           +F+Q MT+YP A A +ILDTC+DFS
Sbjct: 339  KLSISASLFTTAGTIIDSGTVITRLPPTAYAALRSSFRQKMTQYPRAQALAILDTCYDFS 398

Query: 194  NYXXXXXXXXXXXXXXXVKVDLSLSGILVAVSSTQACLAFAGNGDASDVGIFGNTQQLTY 15
             Y               V+V +   GIL A S +Q CLAFAGN D +D+GI GNTQQ T 
Sbjct: 399  KYSSVSIPKISFFFSGGVEVPIDAKGILYANSISQVCLAFAGNSDDTDIGIVGNTQQKTL 458

Query: 14   EVVY 3
            +VVY
Sbjct: 459  QVVY 462


>ref|XP_007011662.1| Eukaryotic aspartyl protease family protein, putative isoform 1
            [Theobroma cacao] gi|508782025|gb|EOY29281.1| Eukaryotic
            aspartyl protease family protein, putative isoform 1
            [Theobroma cacao]
          Length = 474

 Score =  368 bits (944), Expect = 3e-99
 Identities = 197/364 (54%), Positives = 243/364 (66%), Gaps = 4/364 (1%)
 Frame = -1

Query: 1082 SIHARLNRASNTEKVEDKKV-NLPVQRGSSLGSGNYLVSVGLGTPKKTLTLIFDTGSDLT 906
            SIH+RL R   +  V++     LP + GS +GSGNY+V+VGLGTPKK L+L+FDTGSD+T
Sbjct: 96   SIHSRLGRKPGSSDVDETDAAQLPAKDGSVVGSGNYIVTVGLGTPKKGLSLVFDTGSDIT 155

Query: 905  WTQCQPCAESCYKQQDPIFNPKTSASYSNIXXXXXXXXXXXXXXGNSARCSVGTCVYGIQ 726
            WTQCQPCA+SCYKQ+DPIF P  S++YSNI              GNS  C+   CVYGIQ
Sbjct: 156  WTQCQPCAKSCYKQRDPIFAPSQSSTYSNISCTSTACSSLTSATGNSPGCASSACVYGIQ 215

Query: 725  YGDQSFSVGFFSKDTLTIA-NDVFPNFQFGCGQNNQGLFGRTAGLIGLGRDPLSVISQTA 549
            YGD SFSVGFF+K+ LT+   D F NF FGCGQNNQGLFG +AGL+GLGRD LS+ SQTA
Sbjct: 216  YGDSSFSVGFFAKEKLTLTPTDEFDNFLFGCGQNNQGLFGGSAGLLGLGRDQLSLPSQTA 275

Query: 548  AKYGKYFSYCLP-XXXXXXXXXXXXXXGATKNVQFTPLDS-SQGNSFYFISIVSLAVGGR 375
            +KY K+FSYCLP               G +K+V+FT L + SQG SFY I I  ++VGG+
Sbjct: 276  SKYKKFFSYCLPSSASSDGFLAFGYGGGVSKSVKFTTLSTVSQGESFYGIDITGISVGGQ 335

Query: 374  QLAIGQSVFKTSGAIIDSGTVITRXXXXXXXXXXXAFKQGMTKYPTAPAYSILDTCFDFS 195
            +L+I  S+F T+G IIDSGTVITR           +F+Q MT+YP A A +ILDTC+DFS
Sbjct: 336  KLSISASLFTTAGTIIDSGTVITRLPPTAYAALRSSFRQKMTQYPRAQALAILDTCYDFS 395

Query: 194  NYXXXXXXXXXXXXXXXVKVDLSLSGILVAVSSTQACLAFAGNGDASDVGIFGNTQQLTY 15
             Y               V+V +   GIL A S +Q CLAFAGN D +D+GI GNTQQ T 
Sbjct: 396  KYSSVSIPKISFFFSGGVEVPIDAKGILYANSISQVCLAFAGNSDDTDIGIVGNTQQKTL 455

Query: 14   EVVY 3
            +VVY
Sbjct: 456  QVVY 459


>ref|XP_002285593.1| PREDICTED: aspartic proteinase nepenthesin-2 [Vitis vinifera]
          Length = 481

 Score =  368 bits (944), Expect = 3e-99
 Identities = 194/364 (53%), Positives = 245/364 (67%), Gaps = 4/364 (1%)
 Frame = -1

Query: 1082 SIHARLNR-ASNTEKVEDKKVNLPVQRGSSLGSGNYLVSVGLGTPKKTLTLIFDTGSDLT 906
            SI +RL +  ++  K++  KV LP + GS++G+GNY+V+VGLGTPK+ LT IFDTGSDLT
Sbjct: 103  SIRSRLAKNPADGGKLKGSKVTLPSKSGSTIGTGNYVVTVGLGTPKRDLTFIFDTGSDLT 162

Query: 905  WTQCQPCAESCYKQQDPIFNPKTSASYSNIXXXXXXXXXXXXXXGNSARCSVGTCVYGIQ 726
            WTQC+PCA  CY QQ+PIFNP  S SY+NI              GNS  CS  TCVYGIQ
Sbjct: 163  WTQCEPCARYCYHQQEPIFNPSKSTSYTNISCSSPTCDELKSGTGNSPSCSASTCVYGIQ 222

Query: 725  YGDQSFSVGFFSKDTLTI-ANDVFPNFQFGCGQNNQGLFGRTAGLIGLGRDPLSVISQTA 549
            YGDQS+SVGFF++D L + + DVF NF FGCGQNN+GLF   AGLIGLGR+ LS++SQTA
Sbjct: 223  YGDQSYSVGFFAQDKLALTSTDVFNNFLFGCGQNNRGLFVGVAGLIGLGRNALSLVSQTA 282

Query: 548  AKYGKYFSYCLPXXXXXXXXXXXXXXGAT-KNVQFTP-LDSSQGNSFYFISIVSLAVGGR 375
             KYGK FSYCLP              G T K V+FTP L +SQG SFYF+++++++VGGR
Sbjct: 283  QKYGKLFSYCLPSTSSSTGYLTFGSGGGTSKAVKFTPSLVNSQGPSFYFLNLIAISVGGR 342

Query: 374  QLAIGQSVFKTSGAIIDSGTVITRXXXXXXXXXXXAFKQGMTKYPTAPAYSILDTCFDFS 195
            +L+   SVF T+G IIDSGTVI+R           +F+Q M+KYP A   SILDTC+DFS
Sbjct: 343  KLSTSASVFSTAGTIIDSGTVISRLPPTAYSDLRASFQQQMSKYPKAAPASILDTCYDFS 402

Query: 194  NYXXXXXXXXXXXXXXXVKVDLSLSGILVAVSSTQACLAFAGNGDASDVGIFGNTQQLTY 15
             Y                ++DL  SGI   ++ +Q CLAFAGN DA+D+ I GN QQ T+
Sbjct: 403  QYDTVDVPKINLYFSDGAEMDLDPSGIFYILNISQVCLAFAGNSDATDIAILGNVQQKTF 462

Query: 14   EVVY 3
            +VVY
Sbjct: 463  DVVY 466


>ref|XP_002324349.1| nucleoid DNA-binding family protein [Populus trichocarpa]
            gi|222865783|gb|EEF02914.1| nucleoid DNA-binding family
            protein [Populus trichocarpa]
          Length = 490

 Score =  364 bits (935), Expect = 3e-98
 Identities = 197/366 (53%), Positives = 240/366 (65%), Gaps = 6/366 (1%)
 Frame = -1

Query: 1082 SIHARLNRASNTEKVEDKKVN----LPVQRGSSLGSGNYLVSVGLGTPKKTLTLIFDTGS 915
            SIH+RL+  S T   +D KV     +P + GS++GSGNY+V+VGLGTPKK L+LIFDTGS
Sbjct: 112  SIHSRLSN-SKTSGGKDVKVTDSTTIPAKDGSTVGSGNYIVTVGLGTPKKDLSLIFDTGS 170

Query: 914  DLTWTQCQPCAESCYKQQDPIFNPKTSASYSNIXXXXXXXXXXXXXXGNSARCSVGTCVY 735
            D+TWTQCQPCA SCYKQ++ IF+P  S SY+NI              GN+  C+   CVY
Sbjct: 171  DITWTQCQPCARSCYKQKEQIFDPSQSTSYTNISCSSSICNSLTSATGNTPGCASSACVY 230

Query: 734  GIQYGDQSFSVGFFSKDTLTI-ANDVFPNFQFGCGQNNQGLFGRTAGLIGLGRDPLSVIS 558
            GIQYGD SFSVGFF  + LT+ + D F N  FGCGQNNQGLFG +AGL+GLGRD LSV+S
Sbjct: 231  GIQYGDSSFSVGFFGTEKLTLTSTDAFNNIYFGCGQNNQGLFGGSAGLLGLGRDKLSVVS 290

Query: 557  QTAAKYGKYFSYCLPXXXXXXXXXXXXXXGATKNVQFTPLDS-SQGNSFYFISIVSLAVG 381
            QTA KY K FSYCLP               A+KN +FTPL + S G SFY +    ++VG
Sbjct: 291  QTAQKYNKIFSYCLP-SSSSSTGFLTFGGSASKNAKFTPLSTISAGPSFYGLDFTGISVG 349

Query: 380  GRQLAIGQSVFKTSGAIIDSGTVITRXXXXXXXXXXXAFKQGMTKYPTAPAYSILDTCFD 201
            G++LAI  SVF T+GAIIDSGTVITR           +F+  M+KYP   A SILDTC+D
Sbjct: 350  GKKLAISASVFSTAGAIIDSGTVITRLPPAAYSALRASFRNLMSKYPMTKALSILDTCYD 409

Query: 200  FSNYXXXXXXXXXXXXXXXVKVDLSLSGILVAVSSTQACLAFAGNGDASDVGIFGNTQQL 21
            FS+Y               ++VD+  +GIL A S +Q CLAFAGN DA+DV IFGN QQ 
Sbjct: 410  FSSYTTISVPKIGFSFSSGIEVDIDATGILYASSLSQVCLAFAGNSDATDVFIFGNVQQK 469

Query: 20   TYEVVY 3
            T EV Y
Sbjct: 470  TLEVFY 475


>ref|XP_004291984.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 2-like [Fragaria
            vesca subsp. vesca]
          Length = 492

 Score =  363 bits (931), Expect = 9e-98
 Identities = 192/366 (52%), Positives = 241/366 (65%), Gaps = 6/366 (1%)
 Frame = -1

Query: 1082 SIHARLNRASNTEKVEDKKVNLPVQRGSSLGSGNYLVSVGLGTPKKTLTLIFDTGSDLTW 903
            SIHAR++     + ++    ++P + GS +GSGNY+V+VGLG+P K L+LIFDTGSDLTW
Sbjct: 112  SIHARVSPKKGDDDLQQSDTSIPAKSGSVVGSGNYIVTVGLGSPAKQLSLIFDTGSDLTW 171

Query: 902  TQCQPCAESCYKQQDPIFNPKTSASYSNIXXXXXXXXXXXXXXGNSARCSVG--TCVYGI 729
            TQCQPC +SCYKQ++PIF+P  S SY+NI              GN+  CS G  TC+YGI
Sbjct: 172  TQCQPCVKSCYKQKEPIFDPSLSKSYANISCNSPVCSQLISATGNTPGCSSGTSTCIYGI 231

Query: 728  QYGDQSFSVGFFSKDTLTI-ANDVFPNFQFGCGQNNQGLFGRTAGLIGLGRDPLSVISQT 552
            QYGDQSFSVG+F K+ LT+ + DVF  F FGCGQNNQGLFG +AGL+GLGR+ +S++ Q+
Sbjct: 232  QYGDQSFSVGYFGKERLTLTSTDVFDGFLFGCGQNNQGLFGGSAGLLGLGRNKISLVEQS 291

Query: 551  AAKYGKYFSYCLP--XXXXXXXXXXXXXXGATKNVQFTPLDS-SQGNSFYFISIVSLAVG 381
            A KYG+YFSYCLP                G++  V+FTPL + SQG SFY +S+V ++VG
Sbjct: 292  APKYGRYFSYCLPSTSSSTGYLSFGRGGGGSSSAVKFTPLSTVSQGGSFYGLSVVGISVG 351

Query: 380  GRQLAIGQSVFKTSGAIIDSGTVITRXXXXXXXXXXXAFKQGMTKYPTAPAYSILDTCFD 201
            GRQL+I  SVF +SG IIDSGTVITR           AF+QGM  YP A A SILDTC+D
Sbjct: 352  GRQLSIPASVFSSSGTIIDSGTVITRLPATAYSALRDAFRQGMKSYPQAEALSILDTCYD 411

Query: 200  FSNYXXXXXXXXXXXXXXXVKVDLSLSGILVAVSSTQACLAFAGNGDASDVGIFGNTQQL 21
             S                 V +DL  +GIL   S +Q CLAFAGN D SD+ IFGN QQ 
Sbjct: 412  LSGSKTVSYPKIAFAFGGGVTLDLDATGILYVASVSQVCLAFAGNSDDSDIAIFGNVQQK 471

Query: 20   TYEVVY 3
              +VVY
Sbjct: 472  RLQVVY 477


>ref|XP_002515388.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
            communis] gi|223545332|gb|EEF46837.1| Aspartic proteinase
            nepenthesin-2 precursor, putative [Ricinus communis]
          Length = 494

 Score =  361 bits (926), Expect = 3e-97
 Identities = 192/363 (52%), Positives = 234/363 (64%), Gaps = 3/363 (0%)
 Frame = -1

Query: 1082 SIHARLNRASNTEKVE-DKKVNLPVQRGSSLGSGNYLVSVGLGTPKKTLTLIFDTGSDLT 906
            SIH++L++ S    V+      LP + GS +GSGNY V+VGLGTPKK  +LIFDTGSDLT
Sbjct: 118  SIHSKLSKDSGLSDVKATAATTLPAKDGSIIGSGNYFVTVGLGTPKKDFSLIFDTGSDLT 177

Query: 905  WTQCQPCAESCYKQQDPIFNPKTSASYSNIXXXXXXXXXXXXXXGNSARCSVGTCVYGIQ 726
            WTQC+PC +SCY Q++ IFNP  S SY+NI              GN   C+  TCVYGIQ
Sbjct: 178  WTQCEPCVKSCYNQKEAIFNPSQSTSYANISCGSTLCDSLASATGNIFNCASSTCVYGIQ 237

Query: 725  YGDQSFSVGFFSKDTLTI-ANDVFPNFQFGCGQNNQGLFGRTAGLIGLGRDPLSVISQTA 549
            YGD SFS+GFF K+ L++ A DVF +F FGCGQNN+GLFG  AGL+GLGRD LS++SQTA
Sbjct: 238  YGDSSFSIGFFGKEKLSLTATDVFNDFYFGCGQNNKGLFGGAAGLLGLGRDKLSLVSQTA 297

Query: 548  AKYGKYFSYCLPXXXXXXXXXXXXXXGATKNVQFTPLDS-SQGNSFYFISIVSLAVGGRQ 372
             +Y K FSYCLP                +K+  FTPL + S G+SFY + +  ++VGGR+
Sbjct: 298  QRYNKIFSYCLP-SSSSSTGFLTFGGSTSKSASFTPLATISGGSSFYGLDLTGISVGGRK 356

Query: 371  LAIGQSVFKTSGAIIDSGTVITRXXXXXXXXXXXAFKQGMTKYPTAPAYSILDTCFDFSN 192
            LAI  SVF T+G IIDSGTVITR            F++ M++YP APA SILDTCFDFSN
Sbjct: 357  LAISPSVFSTAGTIIDSGTVITRLPPAAYSALSSTFRKLMSQYPAAPALSILDTCFDFSN 416

Query: 191  YXXXXXXXXXXXXXXXVKVDLSLSGILVAVSSTQACLAFAGNGDASDVGIFGNTQQLTYE 12
            +               V VD+  +GI      TQ CLAFAGN DASDV IFGN QQ T E
Sbjct: 417  HDTISVPKIGLFFSGGVVVDIDKTGIFYVNDLTQVCLAFAGNSDASDVAIFGNVQQKTLE 476

Query: 11   VVY 3
            VVY
Sbjct: 477  VVY 479


>ref|XP_006287637.1| hypothetical protein CARUB_v10000848mg [Capsella rubella]
            gi|482556343|gb|EOA20535.1| hypothetical protein
            CARUB_v10000848mg [Capsella rubella]
          Length = 481

 Score =  357 bits (916), Expect = 5e-96
 Identities = 187/363 (51%), Positives = 231/363 (63%), Gaps = 3/363 (0%)
 Frame = -1

Query: 1082 SIHARLNRASNTEKV-EDKKVNLPVQRGSSLGSGNYLVSVGLGTPKKTLTLIFDTGSDLT 906
            SIH++L++   T  V + +  +LP + GS+LGSGNY+V+VGLGTPK  L+LIFDTGSDLT
Sbjct: 104  SIHSKLSKKLTTNHVGQSQSTDLPAKDGSTLGSGNYIVTVGLGTPKHDLSLIFDTGSDLT 163

Query: 905  WTQCQPCAESCYKQQDPIFNPKTSASYSNIXXXXXXXXXXXXXXGNSARCSVGTCVYGIQ 726
            WTQC+PC  +CY Q++PIFNP  S+SY N+              GN+  CS  TC+YGIQ
Sbjct: 164  WTQCEPCVRTCYSQKEPIFNPSKSSSYYNVSCSSPACTSLSSATGNAGSCSASTCIYGIQ 223

Query: 725  YGDQSFSVGFFSKDTLTIAN-DVFPNFQFGCGQNNQGLFGRTAGLIGLGRDPLSVISQTA 549
            YGDQSFSVGF +K+  T+ N DVF    FGCG+NNQGLF   AGL+GLGRD LS  SQTA
Sbjct: 224  YGDQSFSVGFLAKEKFTLTNSDVFDGVYFGCGENNQGLFTGVAGLLGLGRDKLSFPSQTA 283

Query: 548  AKYGKYFSYCLPXXXXXXXXXXXXXXGATKNVQFTPLDS-SQGNSFYFISIVSLAVGGRQ 372
              Y K FSYCLP              G +++V+FTP+ + S GNSFY ++IV + VGG++
Sbjct: 284  TAYNKIFSYCLPSSASYTGHLTFGSAGISRSVKFTPISTISDGNSFYGLNIVGITVGGQK 343

Query: 371  LAIGQSVFKTSGAIIDSGTVITRXXXXXXXXXXXAFKQGMTKYPTAPAYSILDTCFDFSN 192
            LAI  +VF T GA+IDSGTVITR           +FK  M+KYPTA   SILDTCFD S 
Sbjct: 344  LAIPSTVFSTPGALIDSGTVITRLPPKAYAALRSSFKAQMSKYPTASGVSILDTCFDLSG 403

Query: 191  YXXXXXXXXXXXXXXXVKVDLSLSGILVAVSSTQACLAFAGNGDASDVGIFGNTQQLTYE 12
            +                 V+L   GI  A   +Q CLAFAGN D S+  IFGN QQ T E
Sbjct: 404  FKTVTIPKVAFSFSGGAVVELGSKGIFYAFKISQVCLAFAGNSDDSNAAIFGNVQQQTLE 463

Query: 11   VVY 3
            VVY
Sbjct: 464  VVY 466


>ref|XP_007225640.1| hypothetical protein PRUPE_ppa004762mg [Prunus persica]
            gi|462422576|gb|EMJ26839.1| hypothetical protein
            PRUPE_ppa004762mg [Prunus persica]
          Length = 492

 Score =  355 bits (911), Expect = 2e-95
 Identities = 187/369 (50%), Positives = 239/369 (64%), Gaps = 9/369 (2%)
 Frame = -1

Query: 1082 SIHARLNRASNTEKVEDKK----VNLPVQRGSSLGSGNYLVSVGLGTPKKTLTLIFDTGS 915
            SIH+R+N     + V+D +      +P Q GS +G+GNY+V+VGLG+PKK L+LIFDTGS
Sbjct: 109  SIHSRVNSKKQLKSVDDLRESAATTIPAQSGSVVGAGNYIVNVGLGSPKKQLSLIFDTGS 168

Query: 914  DLTWTQCQPCAESCYKQQDPIFNPKTSASYSNIXXXXXXXXXXXXXXGNSARC--SVGTC 741
            DLTWTQC+PC +SCYKQ++PIF+P  SASY+N+              GN+  C  S  TC
Sbjct: 169  DLTWTQCRPCVKSCYKQKEPIFDPSLSASYANVSCTSATCTQLGSATGNTPGCTASTSTC 228

Query: 740  VYGIQYGDQSFSVGFFSKDTLTIAN-DVFPNFQFGCGQNNQGLFGRTAGLIGLGRDPLSV 564
            +YGIQYGDQSFSVG+F K+ L++ N DVF  F FGCGQNNQGLFG  AGL+GLGR+ +S+
Sbjct: 229  IYGIQYGDQSFSVGYFGKEKLSLTNTDVFDGFLFGCGQNNQGLFGGAAGLLGLGRNQISL 288

Query: 563  ISQTAAKYGKYFSYCLPXXXXXXXXXXXXXXGATKN-VQFTPLDS-SQGNSFYFISIVSL 390
            + Q+A KY ++FSYCLP              G + N V+FT L + SQG+SFY +++V +
Sbjct: 289  VEQSAKKYNRFFSYCLPSTSSSTGYLSFGKGGGSSNAVKFTALSTVSQGDSFYGLNVVGI 348

Query: 389  AVGGRQLAIGQSVFKTSGAIIDSGTVITRXXXXXXXXXXXAFKQGMTKYPTAPAYSILDT 210
             VGG +L I  SVF +SG IIDSGTVITR           AF+Q M  YP     SILDT
Sbjct: 349  NVGGTKLPISASVFSSSGTIIDSGTVITRLPPTAYSSLKAAFRQRMKSYPLTQELSILDT 408

Query: 209  CFDFSNYXXXXXXXXXXXXXXXVKVDLSLSGILVAVSSTQACLAFAGNGDASDVGIFGNT 30
            C+DFS++               +  DL  +GIL   S+ Q CLAFAGNGD SD+GIFGN 
Sbjct: 409  CYDFSSFKTVSYPKISFVFDGGLTQDLDATGILYVASADQVCLAFAGNGDDSDIGIFGNV 468

Query: 29   QQLTYEVVY 3
            QQ   +VVY
Sbjct: 469  QQKRLQVVY 477


>ref|XP_003551807.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 2-like [Glycine
            max]
          Length = 490

 Score =  353 bits (905), Expect = 9e-95
 Identities = 191/359 (53%), Positives = 228/359 (63%), Gaps = 4/359 (1%)
 Frame = -1

Query: 1067 LNRASNTEKVEDKKVNLPVQRGSSLGSGNYLVSVGLGTPKKTLTLIFDTGSDLTWTQCQP 888
            L + S+ E+++     LP + GS +GSGNY V VGLGTPK+ L+LIFDTGSDLTWTQC+P
Sbjct: 119  LGQDSSVEELDS--ATLPAKSGSLIGSGNYFVVVGLGTPKRDLSLIFDTGSDLTWTQCEP 176

Query: 887  CAESCYKQQDPIFNPKTSASYSNIXXXXXXXXXXXXXXGNSARCSVGT--CVYGIQYGDQ 714
            CA SCYKQQD IF+P  S SYSNI              GN   CS  T  C+YGIQYGD 
Sbjct: 177  CARSCYKQQDVIFDPSKSTSYSNITCTSALCTQLSTATGNDPGCSASTKACIYGIQYGDS 236

Query: 713  SFSVGFFSKDTLTI-ANDVFPNFQFGCGQNNQGLFGRTAGLIGLGRDPLSVISQTAAKYG 537
            SFSVG+FS++ LT+ A DV  NF FGCGQNNQGLFG +AGLIGLGR P+S + QTAAKY 
Sbjct: 237  SFSVGYFSRERLTVTATDVVDNFLFGCGQNNQGLFGGSAGLIGLGRHPISFVQQTAAKYR 296

Query: 536  KYFSYCLPXXXXXXXXXXXXXXGATKNVQFTPLDS-SQGNSFYFISIVSLAVGGRQLAIG 360
            K FSYCLP                 + +++TP  + S+G+SFY + I ++AVGG +L + 
Sbjct: 297  KIFSYCLPSTSSSTGHLSFGPAATGRYLKYTPFSTISRGSSFYGLDITAIAVGGVKLPVS 356

Query: 359  QSVFKTSGAIIDSGTVITRXXXXXXXXXXXAFKQGMTKYPTAPAYSILDTCFDFSNYXXX 180
             S F T GAIIDSGTVITR           AF+QGM+KYP+A   SILDTC+D S Y   
Sbjct: 357  SSTFSTGGAIIDSGTVITRLPPTAYGALRSAFRQGMSKYPSAGELSILDTCYDLSGYKVF 416

Query: 179  XXXXXXXXXXXXVKVDLSLSGILVAVSSTQACLAFAGNGDASDVGIFGNTQQLTYEVVY 3
                        V V L   GIL   S+ Q CLAFA NGD SDV I+GN QQ T EVVY
Sbjct: 417  SIPTIEFSFAGGVTVKLPPQGILFVASTKQVCLAFAANGDDSDVTIYGNVQQRTIEVVY 475


>emb|CAB96832.1| nucleoid DNA-binding protein cnd41-like protein [Arabidopsis
            thaliana]
          Length = 446

 Score =  350 bits (899), Expect = 4e-94
 Identities = 184/363 (50%), Positives = 227/363 (62%), Gaps = 3/363 (0%)
 Frame = -1

Query: 1082 SIHARLNRASNTEKV-EDKKVNLPVQRGSSLGSGNYLVSVGLGTPKKTLTLIFDTGSDLT 906
            SIH++L++   T+ V E K  +LP + GS+LGSGNY+V+VGLGTPK  L+LIFDTGSDLT
Sbjct: 69   SIHSKLSKKLATDHVSESKSTDLPAKDGSTLGSGNYIVTVGLGTPKNDLSLIFDTGSDLT 128

Query: 905  WTQCQPCAESCYKQQDPIFNPKTSASYSNIXXXXXXXXXXXXXXGNSARCSVGTCVYGIQ 726
            WTQCQPC  +CY Q++PIFNP  S SY N+              GN+  CS   C+YGIQ
Sbjct: 129  WTQCQPCVRTCYDQKEPIFNPSKSTSYYNVSCSSAACGSLSSATGNAGSCSASNCIYGIQ 188

Query: 725  YGDQSFSVGFFSKDTLTIAN-DVFPNFQFGCGQNNQGLFGRTAGLIGLGRDPLSVISQTA 549
            YGDQSFSVGF +K+  T+ N DVF    FGCG+NNQGLF   AGL+GLGRD LS  SQTA
Sbjct: 189  YGDQSFSVGFLAKEKFTLTNSDVFDGVYFGCGENNQGLFTGVAGLLGLGRDKLSFPSQTA 248

Query: 548  AKYGKYFSYCLPXXXXXXXXXXXXXXGATKNVQFTPLDS-SQGNSFYFISIVSLAVGGRQ 372
              Y K FSYCLP              G +++V+FTP+ + + G SFY ++IV++ VGG++
Sbjct: 249  TAYNKIFSYCLPSSASYTGHLTFGSAGISRSVKFTPISTITDGTSFYGLNIVAITVGGQK 308

Query: 371  LAIGQSVFKTSGAIIDSGTVITRXXXXXXXXXXXAFKQGMTKYPTAPAYSILDTCFDFSN 192
            L I  +VF T GA+IDSGTVITR           +FK  M+KYPT    SILDTCFD S 
Sbjct: 309  LPIPSTVFSTPGALIDSGTVITRLPPKAYAALRSSFKAKMSKYPTTSGVSILDTCFDLSG 368

Query: 191  YXXXXXXXXXXXXXXXVKVDLSLSGILVAVSSTQACLAFAGNGDASDVGIFGNTQQLTYE 12
            +                 V+L   GI      +Q CLAFAGN D S+  IFGN QQ T E
Sbjct: 369  FKTVTIPKVAFSFSGGAVVELGSKGIFYVFKISQVCLAFAGNSDDSNAAIFGNVQQQTLE 428

Query: 11   VVY 3
            VVY
Sbjct: 429  VVY 431


>ref|NP_196638.2| aspartyl protease family protein [Arabidopsis thaliana]
            gi|18700103|gb|AAL77663.1| AT5g10770/T30N20_40
            [Arabidopsis thaliana] gi|24111269|gb|AAN46758.1|
            At5g10770/T30N20_40 [Arabidopsis thaliana]
            gi|332004211|gb|AED91594.1| aspartyl protease family
            protein [Arabidopsis thaliana]
          Length = 474

 Score =  350 bits (899), Expect = 4e-94
 Identities = 184/363 (50%), Positives = 227/363 (62%), Gaps = 3/363 (0%)
 Frame = -1

Query: 1082 SIHARLNRASNTEKV-EDKKVNLPVQRGSSLGSGNYLVSVGLGTPKKTLTLIFDTGSDLT 906
            SIH++L++   T+ V E K  +LP + GS+LGSGNY+V+VGLGTPK  L+LIFDTGSDLT
Sbjct: 97   SIHSKLSKKLATDHVSESKSTDLPAKDGSTLGSGNYIVTVGLGTPKNDLSLIFDTGSDLT 156

Query: 905  WTQCQPCAESCYKQQDPIFNPKTSASYSNIXXXXXXXXXXXXXXGNSARCSVGTCVYGIQ 726
            WTQCQPC  +CY Q++PIFNP  S SY N+              GN+  CS   C+YGIQ
Sbjct: 157  WTQCQPCVRTCYDQKEPIFNPSKSTSYYNVSCSSAACGSLSSATGNAGSCSASNCIYGIQ 216

Query: 725  YGDQSFSVGFFSKDTLTIAN-DVFPNFQFGCGQNNQGLFGRTAGLIGLGRDPLSVISQTA 549
            YGDQSFSVGF +K+  T+ N DVF    FGCG+NNQGLF   AGL+GLGRD LS  SQTA
Sbjct: 217  YGDQSFSVGFLAKEKFTLTNSDVFDGVYFGCGENNQGLFTGVAGLLGLGRDKLSFPSQTA 276

Query: 548  AKYGKYFSYCLPXXXXXXXXXXXXXXGATKNVQFTPLDS-SQGNSFYFISIVSLAVGGRQ 372
              Y K FSYCLP              G +++V+FTP+ + + G SFY ++IV++ VGG++
Sbjct: 277  TAYNKIFSYCLPSSASYTGHLTFGSAGISRSVKFTPISTITDGTSFYGLNIVAITVGGQK 336

Query: 371  LAIGQSVFKTSGAIIDSGTVITRXXXXXXXXXXXAFKQGMTKYPTAPAYSILDTCFDFSN 192
            L I  +VF T GA+IDSGTVITR           +FK  M+KYPT    SILDTCFD S 
Sbjct: 337  LPIPSTVFSTPGALIDSGTVITRLPPKAYAALRSSFKAKMSKYPTTSGVSILDTCFDLSG 396

Query: 191  YXXXXXXXXXXXXXXXVKVDLSLSGILVAVSSTQACLAFAGNGDASDVGIFGNTQQLTYE 12
            +                 V+L   GI      +Q CLAFAGN D S+  IFGN QQ T E
Sbjct: 397  FKTVTIPKVAFSFSGGAVVELGSKGIFYVFKISQVCLAFAGNSDDSNAAIFGNVQQQTLE 456

Query: 11   VVY 3
            VVY
Sbjct: 457  VVY 459


>ref|XP_002873476.1| hypothetical protein ARALYDRAFT_325615 [Arabidopsis lyrata subsp.
            lyrata] gi|297319313|gb|EFH49735.1| hypothetical protein
            ARALYDRAFT_325615 [Arabidopsis lyrata subsp. lyrata]
          Length = 475

 Score =  349 bits (896), Expect = 1e-93
 Identities = 183/363 (50%), Positives = 228/363 (62%), Gaps = 3/363 (0%)
 Frame = -1

Query: 1082 SIHARLNRASNTEKV-EDKKVNLPVQRGSSLGSGNYLVSVGLGTPKKTLTLIFDTGSDLT 906
            SIH++L++   T  V + +  +LP + GS+LGSGNY+V+VGLGTPK  L+LIFDTGSDLT
Sbjct: 98   SIHSKLSKKLTTNHVSQSQSTDLPAKDGSTLGSGNYIVTVGLGTPKNDLSLIFDTGSDLT 157

Query: 905  WTQCQPCAESCYKQQDPIFNPKTSASYSNIXXXXXXXXXXXXXXGNSARCSVGTCVYGIQ 726
            WTQCQPC  +CY Q++PIFNP  S SY N+              GN+  CS   C+YGIQ
Sbjct: 158  WTQCQPCVRTCYDQKEPIFNPSKSTSYYNVSCSSAACGSLSSATGNAGSCSASNCIYGIQ 217

Query: 725  YGDQSFSVGFFSKDTLTI-ANDVFPNFQFGCGQNNQGLFGRTAGLIGLGRDPLSVISQTA 549
            YGDQSFSVGF +KD  T+ ++DVF    FGCG+NNQGLF   AGL+GLGRD LS  SQTA
Sbjct: 218  YGDQSFSVGFLAKDKFTLTSSDVFDGVYFGCGENNQGLFTGVAGLLGLGRDKLSFPSQTA 277

Query: 548  AKYGKYFSYCLPXXXXXXXXXXXXXXGATKNVQFTPLDS-SQGNSFYFISIVSLAVGGRQ 372
              Y K FSYCLP              G +++V+FTP+ + + G SFY ++IV++ VGG++
Sbjct: 278  TAYNKIFSYCLPSSASYTGHLTFGSAGISRSVKFTPISTITDGTSFYGLNIVAITVGGQK 337

Query: 371  LAIGQSVFKTSGAIIDSGTVITRXXXXXXXXXXXAFKQGMTKYPTAPAYSILDTCFDFSN 192
            L I  +VF T GA+IDSGTVITR           +FK  M+KYPT    SILDTCFD S 
Sbjct: 338  LPIPSTVFSTPGALIDSGTVITRLPPKAYAALRSSFKAKMSKYPTTSGVSILDTCFDLSG 397

Query: 191  YXXXXXXXXXXXXXXXVKVDLSLSGILVAVSSTQACLAFAGNGDASDVGIFGNTQQLTYE 12
            +                 V+L   GI  A   +Q CLAFAGN D S+  IFGN QQ T E
Sbjct: 398  FKTVTIPKVAFSFSGGAVVELGSKGIFYAFKISQVCLAFAGNSDDSNAAIFGNVQQQTLE 457

Query: 11   VVY 3
            VVY
Sbjct: 458  VVY 460


Top