BLASTX nr result
ID: Mentha22_contig00012464
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Mentha22_contig00012464 (1082 letters) Database: ./nr 38,876,450 sequences; 13,856,398,315 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value gb|EYU22027.1| hypothetical protein MIMGU_mgv1a005281mg [Mimulus... 431 e-118 gb|EYU29201.1| hypothetical protein MIMGU_mgv1a005225mg [Mimulus... 423 e-116 ref|XP_004245197.1| PREDICTED: protein ASPARTIC PROTEASE IN GUAR... 396 e-108 dbj|BAC22609.1| 41 kD chloroplast nucleoid DNA binding protein (... 394 e-107 ref|XP_006364268.1| PREDICTED: protein ASPARTIC PROTEASE IN GUAR... 393 e-107 dbj|BAA22813.1| CND41, chloroplast nucleoid DNA binding protein ... 391 e-106 ref|XP_006483511.1| PREDICTED: protein ASPARTIC PROTEASE IN GUAR... 376 e-101 ref|XP_006450237.1| hypothetical protein CICLE_v10008143mg [Citr... 375 e-101 ref|XP_007011665.1| Eukaryotic aspartyl protease family protein,... 368 3e-99 ref|XP_007011662.1| Eukaryotic aspartyl protease family protein,... 368 3e-99 ref|XP_002285593.1| PREDICTED: aspartic proteinase nepenthesin-2... 368 3e-99 ref|XP_002324349.1| nucleoid DNA-binding family protein [Populus... 364 3e-98 ref|XP_004291984.1| PREDICTED: protein ASPARTIC PROTEASE IN GUAR... 363 9e-98 ref|XP_002515388.1| Aspartic proteinase nepenthesin-2 precursor,... 361 3e-97 ref|XP_006287637.1| hypothetical protein CARUB_v10000848mg [Caps... 357 5e-96 ref|XP_007225640.1| hypothetical protein PRUPE_ppa004762mg [Prun... 355 2e-95 ref|XP_003551807.1| PREDICTED: protein ASPARTIC PROTEASE IN GUAR... 353 9e-95 emb|CAB96832.1| nucleoid DNA-binding protein cnd41-like protein ... 350 4e-94 ref|NP_196638.2| aspartyl protease family protein [Arabidopsis t... 350 4e-94 ref|XP_002873476.1| hypothetical protein ARALYDRAFT_325615 [Arab... 349 1e-93 >gb|EYU22027.1| hypothetical protein MIMGU_mgv1a005281mg [Mimulus guttatus] Length = 490 Score = 431 bits (1107), Expect = e-118 Identities = 227/368 (61%), Positives = 258/368 (70%), Gaps = 8/368 (2%) Frame = -1 Query: 1082 SIHARLN-RASNTEKVEDKKVNLPVQRGSSLGSGNYLVSVGLGTPKKTLTLIFDTGSDLT 906 SI ++L + K+ +KK N+P Q G SLGSGNYL+++GLGTPKKTL LIFDTGSDL Sbjct: 108 SIQSKLKPNSKKPNKLNEKKTNIPAQSGKSLGSGNYLIAIGLGTPKKTLNLIFDTGSDLM 167 Query: 905 WTQCQPCAESCYKQQDPIFNPKTSASYSNIXXXXXXXXXXXXXXGNSARC-SVGTCVYGI 729 WTQCQPCA SCY Q+DPIFNP S SYSNI GN+ C + TCVYGI Sbjct: 168 WTQCQPCARSCYTQKDPIFNPSLSGSYSNISCSSAQCSLLTSATGNNPGCTAASTCVYGI 227 Query: 728 QYGDQSFSVGFFSKDTLTIA-NDVFPNFQFGCGQNNQGLFGRTAGLIGLGRDPLSVISQT 552 QYGD+SFSVGFF+KDTLTI NDVFPNF FGCGQNNQGLFG TAGL+GLGRD LS++SQT Sbjct: 228 QYGDKSFSVGFFAKDTLTITPNDVFPNFLFGCGQNNQGLFGNTAGLLGLGRDSLSLVSQT 287 Query: 551 AAKYGKYFSYCLP-----XXXXXXXXXXXXXXGATKNVQFTPLDSSQGNSFYFISIVSLA 387 + KYGKYFSYCLP + V+FTP +SQG+SFYFI IVS++ Sbjct: 288 SQKYGKYFSYCLPSTSSSTGHLTLGKNNGGAALTSSTVKFTPFATSQGSSFYFIDIVSIS 347 Query: 386 VGGRQLAIGQSVFKTSGAIIDSGTVITRXXXXXXXXXXXAFKQGMTKYPTAPAYSILDTC 207 VGG QL IGQSVFK +GAIIDSGTVI+R AF+Q M +Y +APAYSILDTC Sbjct: 348 VGGAQLPIGQSVFKAAGAIIDSGTVISRLPPAAYSAMSSAFRQQMKQYTSAPAYSILDTC 407 Query: 206 FDFSNYXXXXXXXXXXXXXXXVKVDLSLSGILVAVSSTQACLAFAGNGDASDVGIFGNTQ 27 FDF N ++VDL SGILVAVSSTQACLAFAGNGDA DVGIFGNTQ Sbjct: 408 FDFGNLTSVSIPTISFVFSGNLRVDLHPSGILVAVSSTQACLAFAGNGDAGDVGIFGNTQ 467 Query: 26 QLTYEVVY 3 Q T EVVY Sbjct: 468 QKTLEVVY 475 >gb|EYU29201.1| hypothetical protein MIMGU_mgv1a005225mg [Mimulus guttatus] Length = 492 Score = 423 bits (1088), Expect = e-116 Identities = 225/363 (61%), Positives = 261/363 (71%), Gaps = 4/363 (1%) Frame = -1 Query: 1079 IHARLNRASNTE-KVEDKKVNLPVQRGSSLGSGNYLVSVGLGTPKKTLTLIFDTGSDLTW 903 I+AR+ + S T+ +++ KKVNLPVQ G SLGSGNY+V++GLGTP+KTL+LIFDTGSDLTW Sbjct: 115 INARIKQTSYTKNQIKGKKVNLPVQSGRSLGSGNYIVTLGLGTPQKTLSLIFDTGSDLTW 174 Query: 902 TQCQPCAESCYKQQDPIFNPKTSASYSNIXXXXXXXXXXXXXXGNSARC-SVGTCVYGIQ 726 TQCQPC +SCY+QQDPIFNP S SYSN+ GNS C + TCVYGIQ Sbjct: 175 TQCQPCVKSCYQQQDPIFNPSDSTSYSNVSCNSPQCSQLSAATGNSPGCTNAATCVYGIQ 234 Query: 725 YGDQSFSVGFFSKDTLTIA-NDVFPNFQFGCGQNNQGLFGRTAGLIGLGRDPLSVISQTA 549 YGDQSFSVGFFSKD LTIA N+VF +F FGCGQNNQGLFG TAGL+GLGRD LS+ISQTA Sbjct: 235 YGDQSFSVGFFSKDKLTIAPNEVFQDFLFGCGQNNQGLFGNTAGLLGLGRDKLSIISQTA 294 Query: 548 AKYGKYFSYCLPXXXXXXXXXXXXXXGATKNVQFTP-LDSSQGNSFYFISIVSLAVGGRQ 372 KYGKYFSYCLP G ++NV+FTP + + QG+SFYFI+IVS++VGGRQ Sbjct: 295 QKYGKYFSYCLPSTSSSTGHLTLGKTGNSRNVKFTPFVTNQQGSSFYFINIVSISVGGRQ 354 Query: 371 LAIGQSVFKTSGAIIDSGTVITRXXXXXXXXXXXAFKQGMTKYPTAPAYSILDTCFDFSN 192 LAI SVFK G IIDSGTVI+R AFK+ M KY APAYSILDTC+D S Sbjct: 355 LAISGSVFKAGGTIIDSGTVISRIPPTAYSALSGAFKKMMAKYKRAPAYSILDTCYDLSG 414 Query: 191 YXXXXXXXXXXXXXXXVKVDLSLSGILVAVSSTQACLAFAGNGDASDVGIFGNTQQLTYE 12 Y V+VDL SGI+VAV T+ CLAFAGN D DVGIFGN+QQ T E Sbjct: 415 YTSVTVPTVSFTFGGNVRVDLDPSGIIVAVGGTRVCLAFAGNSDDGDVGIFGNSQQKTLE 474 Query: 11 VVY 3 VVY Sbjct: 475 VVY 477 >ref|XP_004245197.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Solanum lycopersicum] Length = 501 Score = 396 bits (1018), Expect = e-108 Identities = 195/361 (54%), Positives = 243/361 (67%), Gaps = 1/361 (0%) Frame = -1 Query: 1082 SIHARLNRASNTEKVEDKKVNLPVQRGSSLGSGNYLVSVGLGTPKKTLTLIFDTGSDLTW 903 ++ + + S + +D K LP Q G +L +GNY+V+VG+GTPKK LTLIFDTGSDLTW Sbjct: 126 NLFRKTEKTSKKYRAKDSKTTLPAQPGIALSTGNYIVTVGIGTPKKDLTLIFDTGSDLTW 185 Query: 902 TQCQPCAESCYKQQDPIFNPKTSASYSNIXXXXXXXXXXXXXXGNSARCSVGTCVYGIQY 723 TQC+PC ++C+ QQ PIFNP +S++YSNI GNS CS TCVYGIQY Sbjct: 186 TQCEPCFKTCFPQQQPIFNPSSSSTYSNISCSSTACSGLKSATGNSPVCSSSTCVYGIQY 245 Query: 722 GDQSFSVGFFSKDTLTI-ANDVFPNFQFGCGQNNQGLFGRTAGLIGLGRDPLSVISQTAA 546 GD SFS+GFF+KD LT+ A DVF F FGCGQ+N+GLFG+TAGLIGLGRDPLS++SQT+A Sbjct: 246 GDSSFSIGFFAKDRLTLSATDVFDGFMFGCGQDNKGLFGKTAGLIGLGRDPLSIVSQTSA 305 Query: 545 KYGKYFSYCLPXXXXXXXXXXXXXXGATKNVQFTPLDSSQGNSFYFISIVSLAVGGRQLA 366 K+GKYFSYCLP GA N+QFTP SSQG SFYFI ++ ++VGG+ LA Sbjct: 306 KFGKYFSYCLPTRRGSNGHLSFGKNGAKSNLQFTPFASSQGTSFYFIDVLGISVGGKSLA 365 Query: 365 IGQSVFKTSGAIIDSGTVITRXXXXXXXXXXXAFKQGMTKYPTAPAYSILDTCFDFSNYX 186 I VFK +G IIDSGTVITR F++ M+KYP AP S+LDTC+D SNY Sbjct: 366 ISPMVFKNAGTIIDSGTVITRLPSTAYSNLRATFREFMSKYPRAPDLSLLDTCYDLSNYT 425 Query: 185 XXXXXXXXXXXXXXVKVDLSLSGILVAVSSTQACLAFAGNGDASDVGIFGNTQQLTYEVV 6 K+D+ +GI + ++Q CLAFAGNGD +GIFGNTQQ T E+V Sbjct: 426 TISIPKISFNFNGNTKMDIVPNGIFIVNGASQVCLAFAGNGDDDSIGIFGNTQQQTMEIV 485 Query: 5 Y 3 Y Sbjct: 486 Y 486 >dbj|BAC22609.1| 41 kD chloroplast nucleoid DNA binding protein (CND41) [Nicotiana sylvestris] Length = 502 Score = 394 bits (1011), Expect = e-107 Identities = 201/362 (55%), Positives = 239/362 (66%), Gaps = 9/362 (2%) Frame = -1 Query: 1061 RASNTEK-VEDKKVNLPVQRGSSLGSGNYLVSVGLGTPKKTLTLIFDTGSDLTWTQCQPC 885 ++SN +K V+D K NLP Q G LG+GNY+V+VGLGTPKK L+LIFDTGSDLTWTQCQPC Sbjct: 126 KSSNKKKSVKDSKANLPAQSGLPLGTGNYIVNVGLGTPKKDLSLIFDTGSDLTWTQCQPC 185 Query: 884 AESCYKQQDPIFNPKTSASYSNIXXXXXXXXXXXXXXGNSARCSVGTCVYGIQYGDQSFS 705 +SCY QQ PIF+P S +YSNI GNS CS CVYGIQYGD SF+ Sbjct: 186 VKSCYAQQQPIFDPSASKTYSNISCTSTACSGLKSATGNSPGCSSSNCVYGIQYGDSSFT 245 Query: 704 VGFFSKDTLTIA-NDVFPNFQFGCGQNNQGLFGRTAGLIGLGRDPLSVISQTAAKYGKYF 528 VGFF+KDTLT+ NDVF F FGCGQNN+GLFG+TAGLIGLGRDPLS++ QTA K+GKYF Sbjct: 246 VGFFAKDTLTLTQNDVFDGFMFGCGQNNRGLFGKTAGLIGLGRDPLSIVQQTAQKFGKYF 305 Query: 527 SYCLPXXXXXXXXXXXXXXGATK-------NVQFTPLDSSQGNSFYFISIVSLAVGGRQL 369 SYCLP K + FTP SSQG +FYFI ++ ++VGG+ L Sbjct: 306 SYCLPTSRGSNGHLTFGNGNGVKTSKAVKNGITFTPFASSQGATFYFIDVLGISVGGKAL 365 Query: 368 AIGQSVFKTSGAIIDSGTVITRXXXXXXXXXXXAFKQGMTKYPTAPAYSILDTCFDFSNY 189 +I +F+ +G IIDSGTVITR FKQ M+KYPTAPA S+LDTC+D SNY Sbjct: 366 SISPMLFQNAGTIIDSGTVITRLPSTVYGSLKSTFKQFMSKYPTAPALSLLDTCYDLSNY 425 Query: 188 XXXXXXXXXXXXXXXVKVDLSLSGILVAVSSTQACLAFAGNGDASDVGIFGNTQQLTYEV 9 VDL +GIL+ ++Q CLAFAGNGD +GIFGN QQ T EV Sbjct: 426 TSISIPKISFNFNGNANVDLEPNGILITNGASQVCLAFAGNGDDDTIGIFGNIQQQTLEV 485 Query: 8 VY 3 VY Sbjct: 486 VY 487 >ref|XP_006364268.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Solanum tuberosum] Length = 485 Score = 393 bits (1009), Expect = e-107 Identities = 193/361 (53%), Positives = 242/361 (67%), Gaps = 1/361 (0%) Frame = -1 Query: 1082 SIHARLNRASNTEKVEDKKVNLPVQRGSSLGSGNYLVSVGLGTPKKTLTLIFDTGSDLTW 903 ++ + + S + +D K LP Q G++L +GNY+V++G+GTPKK LTLIFDTGSDLTW Sbjct: 110 NLFRKTEKTSKKYRAKDSKTTLPAQPGTALSTGNYIVTIGIGTPKKDLTLIFDTGSDLTW 169 Query: 902 TQCQPCAESCYKQQDPIFNPKTSASYSNIXXXXXXXXXXXXXXGNSARCSVGTCVYGIQY 723 TQC+PC ++C+ QQ PIFNP +S++YSNI GN+ CS TCVYGIQY Sbjct: 170 TQCEPCFKTCFPQQQPIFNPSSSSTYSNISCSSTACSGLKSATGNTPLCSSSTCVYGIQY 229 Query: 722 GDQSFSVGFFSKDTLTI-ANDVFPNFQFGCGQNNQGLFGRTAGLIGLGRDPLSVISQTAA 546 GD SFS+GFF+KD LT+ A DVF F FGCGQ+N+GLFG+TAGLIGLGRDPLS++SQT+A Sbjct: 230 GDSSFSIGFFAKDKLTLSATDVFDGFMFGCGQDNKGLFGKTAGLIGLGRDPLSIVSQTSA 289 Query: 545 KYGKYFSYCLPXXXXXXXXXXXXXXGATKNVQFTPLDSSQGNSFYFISIVSLAVGGRQLA 366 K+GKYFSYCLP GA N+QFTP SSQG SFYFI ++ ++VGG+ LA Sbjct: 290 KFGKYFSYCLPTRRGSNGHLTFGKNGAKSNLQFTPFASSQGTSFYFIDVLGISVGGKALA 349 Query: 365 IGQSVFKTSGAIIDSGTVITRXXXXXXXXXXXAFKQGMTKYPTAPAYSILDTCFDFSNYX 186 I VFK +G IIDSGTVITR F++ M+KYP AP S+LDTC+D SNY Sbjct: 350 ISPMVFKNAGTIIDSGTVITRLPSTAYANMRATFREFMSKYPRAPDLSLLDTCYDLSNYT 409 Query: 185 XXXXXXXXXXXXXXVKVDLSLSGILVAVSSTQACLAFAGNGDASDVGIFGNTQQLTYEVV 6 K+DL +GI ++Q CLAFA NGD +GIFGNTQQ T E+V Sbjct: 410 TVSIPKISFNFNGNTKMDLVPNGIFFVNGASQVCLAFASNGDDDSIGIFGNTQQQTMEIV 469 Query: 5 Y 3 Y Sbjct: 470 Y 470 >dbj|BAA22813.1| CND41, chloroplast nucleoid DNA binding protein [Nicotiana tabacum] Length = 502 Score = 391 bits (1005), Expect = e-106 Identities = 199/362 (54%), Positives = 240/362 (66%), Gaps = 9/362 (2%) Frame = -1 Query: 1061 RASNTEK-VEDKKVNLPVQRGSSLGSGNYLVSVGLGTPKKTLTLIFDTGSDLTWTQCQPC 885 ++SN +K V+D K NLP Q G LG+GNY+V+VGLGTPKK L+LIFDTGSDLTWTQCQPC Sbjct: 126 KSSNKKKSVKDSKANLPAQSGLPLGTGNYIVNVGLGTPKKDLSLIFDTGSDLTWTQCQPC 185 Query: 884 AESCYKQQDPIFNPKTSASYSNIXXXXXXXXXXXXXXGNSARCSVGTCVYGIQYGDQSFS 705 +SCY QQ PIF+P TS +YSNI GNS CS CVYGIQYGD SF+ Sbjct: 186 VKSCYAQQQPIFDPSTSKTYSNISCTSAACSSLKSATGNSPGCSSSNCVYGIQYGDSSFT 245 Query: 704 VGFFSKDTLTIA-NDVFPNFQFGCGQNNQGLFGRTAGLIGLGRDPLSVISQTAAKYGKYF 528 +GFF+KD LT+ NDVF F FGCGQNN+GLFG+TAGLIGLGRDPLS++ QTA K+GKYF Sbjct: 246 IGFFAKDKLTLTQNDVFDGFMFGCGQNNKGLFGKTAGLIGLGRDPLSIVQQTAQKFGKYF 305 Query: 527 SYCLPXXXXXXXXXXXXXXGATK-------NVQFTPLDSSQGNSFYFISIVSLAVGGRQL 369 SYCLP K + FTP SSQG ++YFI ++ ++VGG+ L Sbjct: 306 SYCLPTSRGSNGHLTFGNGNGVKASKAVKNGITFTPFASSQGTAYYFIDVLGISVGGKAL 365 Query: 368 AIGQSVFKTSGAIIDSGTVITRXXXXXXXXXXXAFKQGMTKYPTAPAYSILDTCFDFSNY 189 +I +F+ +G IIDSGTVITR AFKQ M+KYPTAPA S+LDTC+D SNY Sbjct: 366 SISPMLFQNAGTIIDSGTVITRLPSTAYGSLKSAFKQFMSKYPTAPALSLLDTCYDLSNY 425 Query: 188 XXXXXXXXXXXXXXXVKVDLSLSGILVAVSSTQACLAFAGNGDASDVGIFGNTQQLTYEV 9 V+L +GIL+ ++Q CLAFAGNGD +GIFGN QQ T EV Sbjct: 426 TSISIPKISFNFNGNANVELDPNGILITNGASQVCLAFAGNGDDDSIGIFGNIQQQTLEV 485 Query: 8 VY 3 VY Sbjct: 486 VY 487 >ref|XP_006483511.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 2-like [Citrus sinensis] Length = 481 Score = 376 bits (965), Expect = e-101 Identities = 197/364 (54%), Positives = 249/364 (68%), Gaps = 4/364 (1%) Frame = -1 Query: 1082 SIHARLNRASNT--EKVEDKKVNLPVQRGSSLGSGNYLVSVGLGTPKKTLTLIFDTGSDL 909 SIH+RL++ S + E + LP + GS +G+GNY+V+VG+GTPKK L+LIFDTGSDL Sbjct: 104 SIHSRLSKNSGSLDEIRQSDDATLPAKDGSVVGAGNYIVTVGIGTPKKDLSLIFDTGSDL 163 Query: 908 TWTQCQPCAESCYKQQDPIFNPKTSASYSNIXXXXXXXXXXXXXXGNSARCSVGTCVYGI 729 TWTQC+PC + CY+Q++P F+P S SYSN+ GNS C+ TC+YGI Sbjct: 164 TWTQCEPCVKYCYEQKEPKFDPTVSQSYSNVSCSSTICTSLQSATGNSPACASSTCLYGI 223 Query: 728 QYGDQSFSVGFFSKDTLTIA-NDVFPNFQFGCGQNNQGLFGRTAGLIGLGRDPLSVISQT 552 QYGD SFS+GFF K+TLT+ DVFPNF FGCGQNN+GLFG AGL+GLGRDP+S++SQT Sbjct: 224 QYGDSSFSIGFFGKETLTLTPRDVFPNFLFGCGQNNRGLFGGAAGLMGLGRDPISLVSQT 283 Query: 551 AAKYGKYFSYCLPXXXXXXXXXXXXXXGATKNVQFTPLDS-SQGNSFYFISIVSLAVGGR 375 A KY K FSYCLP GA+K+VQFTPL S S G+SFY + ++ ++VGG+ Sbjct: 284 ATKYKKLFSYCLP-SSASSTGHLTFGPGASKSVQFTPLSSISGGSSFYGLEMIGISVGGQ 342 Query: 374 QLAIGQSVFKTSGAIIDSGTVITRXXXXXXXXXXXAFKQGMTKYPTAPAYSILDTCFDFS 195 +L+I SVF T+G IIDSGTVITR AF+Q M+KYPTAPA S+LDTC+DFS Sbjct: 343 KLSIAASVFTTAGTIIDSGTVITRLPPDAYTPLRTAFRQFMSKYPTAPALSLLDTCYDFS 402 Query: 194 NYXXXXXXXXXXXXXXXVKVDLSLSGILVAVSSTQACLAFAGNGDASDVGIFGNTQQLTY 15 Y V+V + +GI+ A + +Q CLAFAGN D +DV IFGNTQQ T Sbjct: 403 KYSTVTLPQISLFFSGGVEVSVDKTGIMYASNISQVCLAFAGNSDPTDVSIFGNTQQHTL 462 Query: 14 EVVY 3 EVVY Sbjct: 463 EVVY 466 >ref|XP_006450237.1| hypothetical protein CICLE_v10008143mg [Citrus clementina] gi|557553463|gb|ESR63477.1| hypothetical protein CICLE_v10008143mg [Citrus clementina] Length = 481 Score = 375 bits (964), Expect = e-101 Identities = 197/364 (54%), Positives = 248/364 (68%), Gaps = 4/364 (1%) Frame = -1 Query: 1082 SIHARLNRASNT--EKVEDKKVNLPVQRGSSLGSGNYLVSVGLGTPKKTLTLIFDTGSDL 909 SIH+RL++ S + E + LP + GS +G+GNY+V+VG+GTPKK L+LIFDTGSDL Sbjct: 104 SIHSRLSKNSGSLDEIRQSDDATLPAKDGSVVGAGNYIVTVGIGTPKKDLSLIFDTGSDL 163 Query: 908 TWTQCQPCAESCYKQQDPIFNPKTSASYSNIXXXXXXXXXXXXXXGNSARCSVGTCVYGI 729 TWTQC+PC + CY+Q++P F+P S SYSN+ GNS C+ TC+YGI Sbjct: 164 TWTQCEPCVKYCYEQKEPKFDPTVSQSYSNVSCSSTICTSLQSATGNSPACASSTCLYGI 223 Query: 728 QYGDQSFSVGFFSKDTLTIA-NDVFPNFQFGCGQNNQGLFGRTAGLIGLGRDPLSVISQT 552 QYGD SFS+GFF K+TLT+ DVFPNF FGCGQNN GLFG AGL+GLGRDP+S++SQT Sbjct: 224 QYGDSSFSIGFFGKETLTLTPTDVFPNFLFGCGQNNHGLFGGAAGLMGLGRDPISLVSQT 283 Query: 551 AAKYGKYFSYCLPXXXXXXXXXXXXXXGATKNVQFTPLDS-SQGNSFYFISIVSLAVGGR 375 A KY K FSYCLP GA+K+VQFTPL S S G+SFY + ++ ++VGG+ Sbjct: 284 ATKYKKLFSYCLP-SSASSTGHLTFGPGASKSVQFTPLSSISGGSSFYGLEMIGISVGGQ 342 Query: 374 QLAIGQSVFKTSGAIIDSGTVITRXXXXXXXXXXXAFKQGMTKYPTAPAYSILDTCFDFS 195 +L+I SVF T+G IIDSGTVITR AF+Q M+KYPTAPA S+LDTC+DFS Sbjct: 343 KLSIAASVFTTAGTIIDSGTVITRLPPDAYTPLRTAFRQFMSKYPTAPALSLLDTCYDFS 402 Query: 194 NYXXXXXXXXXXXXXXXVKVDLSLSGILVAVSSTQACLAFAGNGDASDVGIFGNTQQLTY 15 Y V+V + +GI+ A + +Q CLAFAGN D +DV IFGNTQQ T Sbjct: 403 KYSTVTLPQISLFFSGGVEVSVDKTGIMYASNISQVCLAFAGNSDPTDVSIFGNTQQHTL 462 Query: 14 EVVY 3 EVVY Sbjct: 463 EVVY 466 >ref|XP_007011665.1| Eukaryotic aspartyl protease family protein, putative isoform 4, partial [Theobroma cacao] gi|508782028|gb|EOY29284.1| Eukaryotic aspartyl protease family protein, putative isoform 4, partial [Theobroma cacao] Length = 477 Score = 368 bits (944), Expect = 3e-99 Identities = 197/364 (54%), Positives = 243/364 (66%), Gaps = 4/364 (1%) Frame = -1 Query: 1082 SIHARLNRASNTEKVEDKKV-NLPVQRGSSLGSGNYLVSVGLGTPKKTLTLIFDTGSDLT 906 SIH+RL R + V++ LP + GS +GSGNY+V+VGLGTPKK L+L+FDTGSD+T Sbjct: 99 SIHSRLGRKPGSSDVDETDAAQLPAKDGSVVGSGNYIVTVGLGTPKKGLSLVFDTGSDIT 158 Query: 905 WTQCQPCAESCYKQQDPIFNPKTSASYSNIXXXXXXXXXXXXXXGNSARCSVGTCVYGIQ 726 WTQCQPCA+SCYKQ+DPIF P S++YSNI GNS C+ CVYGIQ Sbjct: 159 WTQCQPCAKSCYKQRDPIFAPSQSSTYSNISCTSTACSSLTSATGNSPGCASSACVYGIQ 218 Query: 725 YGDQSFSVGFFSKDTLTIA-NDVFPNFQFGCGQNNQGLFGRTAGLIGLGRDPLSVISQTA 549 YGD SFSVGFF+K+ LT+ D F NF FGCGQNNQGLFG +AGL+GLGRD LS+ SQTA Sbjct: 219 YGDSSFSVGFFAKEKLTLTPTDEFDNFLFGCGQNNQGLFGGSAGLLGLGRDQLSLPSQTA 278 Query: 548 AKYGKYFSYCLP-XXXXXXXXXXXXXXGATKNVQFTPLDS-SQGNSFYFISIVSLAVGGR 375 +KY K+FSYCLP G +K+V+FT L + SQG SFY I I ++VGG+ Sbjct: 279 SKYKKFFSYCLPSSASSDGFLAFGYGGGVSKSVKFTTLSTVSQGESFYGIDITGISVGGQ 338 Query: 374 QLAIGQSVFKTSGAIIDSGTVITRXXXXXXXXXXXAFKQGMTKYPTAPAYSILDTCFDFS 195 +L+I S+F T+G IIDSGTVITR +F+Q MT+YP A A +ILDTC+DFS Sbjct: 339 KLSISASLFTTAGTIIDSGTVITRLPPTAYAALRSSFRQKMTQYPRAQALAILDTCYDFS 398 Query: 194 NYXXXXXXXXXXXXXXXVKVDLSLSGILVAVSSTQACLAFAGNGDASDVGIFGNTQQLTY 15 Y V+V + GIL A S +Q CLAFAGN D +D+GI GNTQQ T Sbjct: 399 KYSSVSIPKISFFFSGGVEVPIDAKGILYANSISQVCLAFAGNSDDTDIGIVGNTQQKTL 458 Query: 14 EVVY 3 +VVY Sbjct: 459 QVVY 462 >ref|XP_007011662.1| Eukaryotic aspartyl protease family protein, putative isoform 1 [Theobroma cacao] gi|508782025|gb|EOY29281.1| Eukaryotic aspartyl protease family protein, putative isoform 1 [Theobroma cacao] Length = 474 Score = 368 bits (944), Expect = 3e-99 Identities = 197/364 (54%), Positives = 243/364 (66%), Gaps = 4/364 (1%) Frame = -1 Query: 1082 SIHARLNRASNTEKVEDKKV-NLPVQRGSSLGSGNYLVSVGLGTPKKTLTLIFDTGSDLT 906 SIH+RL R + V++ LP + GS +GSGNY+V+VGLGTPKK L+L+FDTGSD+T Sbjct: 96 SIHSRLGRKPGSSDVDETDAAQLPAKDGSVVGSGNYIVTVGLGTPKKGLSLVFDTGSDIT 155 Query: 905 WTQCQPCAESCYKQQDPIFNPKTSASYSNIXXXXXXXXXXXXXXGNSARCSVGTCVYGIQ 726 WTQCQPCA+SCYKQ+DPIF P S++YSNI GNS C+ CVYGIQ Sbjct: 156 WTQCQPCAKSCYKQRDPIFAPSQSSTYSNISCTSTACSSLTSATGNSPGCASSACVYGIQ 215 Query: 725 YGDQSFSVGFFSKDTLTIA-NDVFPNFQFGCGQNNQGLFGRTAGLIGLGRDPLSVISQTA 549 YGD SFSVGFF+K+ LT+ D F NF FGCGQNNQGLFG +AGL+GLGRD LS+ SQTA Sbjct: 216 YGDSSFSVGFFAKEKLTLTPTDEFDNFLFGCGQNNQGLFGGSAGLLGLGRDQLSLPSQTA 275 Query: 548 AKYGKYFSYCLP-XXXXXXXXXXXXXXGATKNVQFTPLDS-SQGNSFYFISIVSLAVGGR 375 +KY K+FSYCLP G +K+V+FT L + SQG SFY I I ++VGG+ Sbjct: 276 SKYKKFFSYCLPSSASSDGFLAFGYGGGVSKSVKFTTLSTVSQGESFYGIDITGISVGGQ 335 Query: 374 QLAIGQSVFKTSGAIIDSGTVITRXXXXXXXXXXXAFKQGMTKYPTAPAYSILDTCFDFS 195 +L+I S+F T+G IIDSGTVITR +F+Q MT+YP A A +ILDTC+DFS Sbjct: 336 KLSISASLFTTAGTIIDSGTVITRLPPTAYAALRSSFRQKMTQYPRAQALAILDTCYDFS 395 Query: 194 NYXXXXXXXXXXXXXXXVKVDLSLSGILVAVSSTQACLAFAGNGDASDVGIFGNTQQLTY 15 Y V+V + GIL A S +Q CLAFAGN D +D+GI GNTQQ T Sbjct: 396 KYSSVSIPKISFFFSGGVEVPIDAKGILYANSISQVCLAFAGNSDDTDIGIVGNTQQKTL 455 Query: 14 EVVY 3 +VVY Sbjct: 456 QVVY 459 >ref|XP_002285593.1| PREDICTED: aspartic proteinase nepenthesin-2 [Vitis vinifera] Length = 481 Score = 368 bits (944), Expect = 3e-99 Identities = 194/364 (53%), Positives = 245/364 (67%), Gaps = 4/364 (1%) Frame = -1 Query: 1082 SIHARLNR-ASNTEKVEDKKVNLPVQRGSSLGSGNYLVSVGLGTPKKTLTLIFDTGSDLT 906 SI +RL + ++ K++ KV LP + GS++G+GNY+V+VGLGTPK+ LT IFDTGSDLT Sbjct: 103 SIRSRLAKNPADGGKLKGSKVTLPSKSGSTIGTGNYVVTVGLGTPKRDLTFIFDTGSDLT 162 Query: 905 WTQCQPCAESCYKQQDPIFNPKTSASYSNIXXXXXXXXXXXXXXGNSARCSVGTCVYGIQ 726 WTQC+PCA CY QQ+PIFNP S SY+NI GNS CS TCVYGIQ Sbjct: 163 WTQCEPCARYCYHQQEPIFNPSKSTSYTNISCSSPTCDELKSGTGNSPSCSASTCVYGIQ 222 Query: 725 YGDQSFSVGFFSKDTLTI-ANDVFPNFQFGCGQNNQGLFGRTAGLIGLGRDPLSVISQTA 549 YGDQS+SVGFF++D L + + DVF NF FGCGQNN+GLF AGLIGLGR+ LS++SQTA Sbjct: 223 YGDQSYSVGFFAQDKLALTSTDVFNNFLFGCGQNNRGLFVGVAGLIGLGRNALSLVSQTA 282 Query: 548 AKYGKYFSYCLPXXXXXXXXXXXXXXGAT-KNVQFTP-LDSSQGNSFYFISIVSLAVGGR 375 KYGK FSYCLP G T K V+FTP L +SQG SFYF+++++++VGGR Sbjct: 283 QKYGKLFSYCLPSTSSSTGYLTFGSGGGTSKAVKFTPSLVNSQGPSFYFLNLIAISVGGR 342 Query: 374 QLAIGQSVFKTSGAIIDSGTVITRXXXXXXXXXXXAFKQGMTKYPTAPAYSILDTCFDFS 195 +L+ SVF T+G IIDSGTVI+R +F+Q M+KYP A SILDTC+DFS Sbjct: 343 KLSTSASVFSTAGTIIDSGTVISRLPPTAYSDLRASFQQQMSKYPKAAPASILDTCYDFS 402 Query: 194 NYXXXXXXXXXXXXXXXVKVDLSLSGILVAVSSTQACLAFAGNGDASDVGIFGNTQQLTY 15 Y ++DL SGI ++ +Q CLAFAGN DA+D+ I GN QQ T+ Sbjct: 403 QYDTVDVPKINLYFSDGAEMDLDPSGIFYILNISQVCLAFAGNSDATDIAILGNVQQKTF 462 Query: 14 EVVY 3 +VVY Sbjct: 463 DVVY 466 >ref|XP_002324349.1| nucleoid DNA-binding family protein [Populus trichocarpa] gi|222865783|gb|EEF02914.1| nucleoid DNA-binding family protein [Populus trichocarpa] Length = 490 Score = 364 bits (935), Expect = 3e-98 Identities = 197/366 (53%), Positives = 240/366 (65%), Gaps = 6/366 (1%) Frame = -1 Query: 1082 SIHARLNRASNTEKVEDKKVN----LPVQRGSSLGSGNYLVSVGLGTPKKTLTLIFDTGS 915 SIH+RL+ S T +D KV +P + GS++GSGNY+V+VGLGTPKK L+LIFDTGS Sbjct: 112 SIHSRLSN-SKTSGGKDVKVTDSTTIPAKDGSTVGSGNYIVTVGLGTPKKDLSLIFDTGS 170 Query: 914 DLTWTQCQPCAESCYKQQDPIFNPKTSASYSNIXXXXXXXXXXXXXXGNSARCSVGTCVY 735 D+TWTQCQPCA SCYKQ++ IF+P S SY+NI GN+ C+ CVY Sbjct: 171 DITWTQCQPCARSCYKQKEQIFDPSQSTSYTNISCSSSICNSLTSATGNTPGCASSACVY 230 Query: 734 GIQYGDQSFSVGFFSKDTLTI-ANDVFPNFQFGCGQNNQGLFGRTAGLIGLGRDPLSVIS 558 GIQYGD SFSVGFF + LT+ + D F N FGCGQNNQGLFG +AGL+GLGRD LSV+S Sbjct: 231 GIQYGDSSFSVGFFGTEKLTLTSTDAFNNIYFGCGQNNQGLFGGSAGLLGLGRDKLSVVS 290 Query: 557 QTAAKYGKYFSYCLPXXXXXXXXXXXXXXGATKNVQFTPLDS-SQGNSFYFISIVSLAVG 381 QTA KY K FSYCLP A+KN +FTPL + S G SFY + ++VG Sbjct: 291 QTAQKYNKIFSYCLP-SSSSSTGFLTFGGSASKNAKFTPLSTISAGPSFYGLDFTGISVG 349 Query: 380 GRQLAIGQSVFKTSGAIIDSGTVITRXXXXXXXXXXXAFKQGMTKYPTAPAYSILDTCFD 201 G++LAI SVF T+GAIIDSGTVITR +F+ M+KYP A SILDTC+D Sbjct: 350 GKKLAISASVFSTAGAIIDSGTVITRLPPAAYSALRASFRNLMSKYPMTKALSILDTCYD 409 Query: 200 FSNYXXXXXXXXXXXXXXXVKVDLSLSGILVAVSSTQACLAFAGNGDASDVGIFGNTQQL 21 FS+Y ++VD+ +GIL A S +Q CLAFAGN DA+DV IFGN QQ Sbjct: 410 FSSYTTISVPKIGFSFSSGIEVDIDATGILYASSLSQVCLAFAGNSDATDVFIFGNVQQK 469 Query: 20 TYEVVY 3 T EV Y Sbjct: 470 TLEVFY 475 >ref|XP_004291984.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 2-like [Fragaria vesca subsp. vesca] Length = 492 Score = 363 bits (931), Expect = 9e-98 Identities = 192/366 (52%), Positives = 241/366 (65%), Gaps = 6/366 (1%) Frame = -1 Query: 1082 SIHARLNRASNTEKVEDKKVNLPVQRGSSLGSGNYLVSVGLGTPKKTLTLIFDTGSDLTW 903 SIHAR++ + ++ ++P + GS +GSGNY+V+VGLG+P K L+LIFDTGSDLTW Sbjct: 112 SIHARVSPKKGDDDLQQSDTSIPAKSGSVVGSGNYIVTVGLGSPAKQLSLIFDTGSDLTW 171 Query: 902 TQCQPCAESCYKQQDPIFNPKTSASYSNIXXXXXXXXXXXXXXGNSARCSVG--TCVYGI 729 TQCQPC +SCYKQ++PIF+P S SY+NI GN+ CS G TC+YGI Sbjct: 172 TQCQPCVKSCYKQKEPIFDPSLSKSYANISCNSPVCSQLISATGNTPGCSSGTSTCIYGI 231 Query: 728 QYGDQSFSVGFFSKDTLTI-ANDVFPNFQFGCGQNNQGLFGRTAGLIGLGRDPLSVISQT 552 QYGDQSFSVG+F K+ LT+ + DVF F FGCGQNNQGLFG +AGL+GLGR+ +S++ Q+ Sbjct: 232 QYGDQSFSVGYFGKERLTLTSTDVFDGFLFGCGQNNQGLFGGSAGLLGLGRNKISLVEQS 291 Query: 551 AAKYGKYFSYCLP--XXXXXXXXXXXXXXGATKNVQFTPLDS-SQGNSFYFISIVSLAVG 381 A KYG+YFSYCLP G++ V+FTPL + SQG SFY +S+V ++VG Sbjct: 292 APKYGRYFSYCLPSTSSSTGYLSFGRGGGGSSSAVKFTPLSTVSQGGSFYGLSVVGISVG 351 Query: 380 GRQLAIGQSVFKTSGAIIDSGTVITRXXXXXXXXXXXAFKQGMTKYPTAPAYSILDTCFD 201 GRQL+I SVF +SG IIDSGTVITR AF+QGM YP A A SILDTC+D Sbjct: 352 GRQLSIPASVFSSSGTIIDSGTVITRLPATAYSALRDAFRQGMKSYPQAEALSILDTCYD 411 Query: 200 FSNYXXXXXXXXXXXXXXXVKVDLSLSGILVAVSSTQACLAFAGNGDASDVGIFGNTQQL 21 S V +DL +GIL S +Q CLAFAGN D SD+ IFGN QQ Sbjct: 412 LSGSKTVSYPKIAFAFGGGVTLDLDATGILYVASVSQVCLAFAGNSDDSDIAIFGNVQQK 471 Query: 20 TYEVVY 3 +VVY Sbjct: 472 RLQVVY 477 >ref|XP_002515388.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus communis] gi|223545332|gb|EEF46837.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus communis] Length = 494 Score = 361 bits (926), Expect = 3e-97 Identities = 192/363 (52%), Positives = 234/363 (64%), Gaps = 3/363 (0%) Frame = -1 Query: 1082 SIHARLNRASNTEKVE-DKKVNLPVQRGSSLGSGNYLVSVGLGTPKKTLTLIFDTGSDLT 906 SIH++L++ S V+ LP + GS +GSGNY V+VGLGTPKK +LIFDTGSDLT Sbjct: 118 SIHSKLSKDSGLSDVKATAATTLPAKDGSIIGSGNYFVTVGLGTPKKDFSLIFDTGSDLT 177 Query: 905 WTQCQPCAESCYKQQDPIFNPKTSASYSNIXXXXXXXXXXXXXXGNSARCSVGTCVYGIQ 726 WTQC+PC +SCY Q++ IFNP S SY+NI GN C+ TCVYGIQ Sbjct: 178 WTQCEPCVKSCYNQKEAIFNPSQSTSYANISCGSTLCDSLASATGNIFNCASSTCVYGIQ 237 Query: 725 YGDQSFSVGFFSKDTLTI-ANDVFPNFQFGCGQNNQGLFGRTAGLIGLGRDPLSVISQTA 549 YGD SFS+GFF K+ L++ A DVF +F FGCGQNN+GLFG AGL+GLGRD LS++SQTA Sbjct: 238 YGDSSFSIGFFGKEKLSLTATDVFNDFYFGCGQNNKGLFGGAAGLLGLGRDKLSLVSQTA 297 Query: 548 AKYGKYFSYCLPXXXXXXXXXXXXXXGATKNVQFTPLDS-SQGNSFYFISIVSLAVGGRQ 372 +Y K FSYCLP +K+ FTPL + S G+SFY + + ++VGGR+ Sbjct: 298 QRYNKIFSYCLP-SSSSSTGFLTFGGSTSKSASFTPLATISGGSSFYGLDLTGISVGGRK 356 Query: 371 LAIGQSVFKTSGAIIDSGTVITRXXXXXXXXXXXAFKQGMTKYPTAPAYSILDTCFDFSN 192 LAI SVF T+G IIDSGTVITR F++ M++YP APA SILDTCFDFSN Sbjct: 357 LAISPSVFSTAGTIIDSGTVITRLPPAAYSALSSTFRKLMSQYPAAPALSILDTCFDFSN 416 Query: 191 YXXXXXXXXXXXXXXXVKVDLSLSGILVAVSSTQACLAFAGNGDASDVGIFGNTQQLTYE 12 + V VD+ +GI TQ CLAFAGN DASDV IFGN QQ T E Sbjct: 417 HDTISVPKIGLFFSGGVVVDIDKTGIFYVNDLTQVCLAFAGNSDASDVAIFGNVQQKTLE 476 Query: 11 VVY 3 VVY Sbjct: 477 VVY 479 >ref|XP_006287637.1| hypothetical protein CARUB_v10000848mg [Capsella rubella] gi|482556343|gb|EOA20535.1| hypothetical protein CARUB_v10000848mg [Capsella rubella] Length = 481 Score = 357 bits (916), Expect = 5e-96 Identities = 187/363 (51%), Positives = 231/363 (63%), Gaps = 3/363 (0%) Frame = -1 Query: 1082 SIHARLNRASNTEKV-EDKKVNLPVQRGSSLGSGNYLVSVGLGTPKKTLTLIFDTGSDLT 906 SIH++L++ T V + + +LP + GS+LGSGNY+V+VGLGTPK L+LIFDTGSDLT Sbjct: 104 SIHSKLSKKLTTNHVGQSQSTDLPAKDGSTLGSGNYIVTVGLGTPKHDLSLIFDTGSDLT 163 Query: 905 WTQCQPCAESCYKQQDPIFNPKTSASYSNIXXXXXXXXXXXXXXGNSARCSVGTCVYGIQ 726 WTQC+PC +CY Q++PIFNP S+SY N+ GN+ CS TC+YGIQ Sbjct: 164 WTQCEPCVRTCYSQKEPIFNPSKSSSYYNVSCSSPACTSLSSATGNAGSCSASTCIYGIQ 223 Query: 725 YGDQSFSVGFFSKDTLTIAN-DVFPNFQFGCGQNNQGLFGRTAGLIGLGRDPLSVISQTA 549 YGDQSFSVGF +K+ T+ N DVF FGCG+NNQGLF AGL+GLGRD LS SQTA Sbjct: 224 YGDQSFSVGFLAKEKFTLTNSDVFDGVYFGCGENNQGLFTGVAGLLGLGRDKLSFPSQTA 283 Query: 548 AKYGKYFSYCLPXXXXXXXXXXXXXXGATKNVQFTPLDS-SQGNSFYFISIVSLAVGGRQ 372 Y K FSYCLP G +++V+FTP+ + S GNSFY ++IV + VGG++ Sbjct: 284 TAYNKIFSYCLPSSASYTGHLTFGSAGISRSVKFTPISTISDGNSFYGLNIVGITVGGQK 343 Query: 371 LAIGQSVFKTSGAIIDSGTVITRXXXXXXXXXXXAFKQGMTKYPTAPAYSILDTCFDFSN 192 LAI +VF T GA+IDSGTVITR +FK M+KYPTA SILDTCFD S Sbjct: 344 LAIPSTVFSTPGALIDSGTVITRLPPKAYAALRSSFKAQMSKYPTASGVSILDTCFDLSG 403 Query: 191 YXXXXXXXXXXXXXXXVKVDLSLSGILVAVSSTQACLAFAGNGDASDVGIFGNTQQLTYE 12 + V+L GI A +Q CLAFAGN D S+ IFGN QQ T E Sbjct: 404 FKTVTIPKVAFSFSGGAVVELGSKGIFYAFKISQVCLAFAGNSDDSNAAIFGNVQQQTLE 463 Query: 11 VVY 3 VVY Sbjct: 464 VVY 466 >ref|XP_007225640.1| hypothetical protein PRUPE_ppa004762mg [Prunus persica] gi|462422576|gb|EMJ26839.1| hypothetical protein PRUPE_ppa004762mg [Prunus persica] Length = 492 Score = 355 bits (911), Expect = 2e-95 Identities = 187/369 (50%), Positives = 239/369 (64%), Gaps = 9/369 (2%) Frame = -1 Query: 1082 SIHARLNRASNTEKVEDKK----VNLPVQRGSSLGSGNYLVSVGLGTPKKTLTLIFDTGS 915 SIH+R+N + V+D + +P Q GS +G+GNY+V+VGLG+PKK L+LIFDTGS Sbjct: 109 SIHSRVNSKKQLKSVDDLRESAATTIPAQSGSVVGAGNYIVNVGLGSPKKQLSLIFDTGS 168 Query: 914 DLTWTQCQPCAESCYKQQDPIFNPKTSASYSNIXXXXXXXXXXXXXXGNSARC--SVGTC 741 DLTWTQC+PC +SCYKQ++PIF+P SASY+N+ GN+ C S TC Sbjct: 169 DLTWTQCRPCVKSCYKQKEPIFDPSLSASYANVSCTSATCTQLGSATGNTPGCTASTSTC 228 Query: 740 VYGIQYGDQSFSVGFFSKDTLTIAN-DVFPNFQFGCGQNNQGLFGRTAGLIGLGRDPLSV 564 +YGIQYGDQSFSVG+F K+ L++ N DVF F FGCGQNNQGLFG AGL+GLGR+ +S+ Sbjct: 229 IYGIQYGDQSFSVGYFGKEKLSLTNTDVFDGFLFGCGQNNQGLFGGAAGLLGLGRNQISL 288 Query: 563 ISQTAAKYGKYFSYCLPXXXXXXXXXXXXXXGATKN-VQFTPLDS-SQGNSFYFISIVSL 390 + Q+A KY ++FSYCLP G + N V+FT L + SQG+SFY +++V + Sbjct: 289 VEQSAKKYNRFFSYCLPSTSSSTGYLSFGKGGGSSNAVKFTALSTVSQGDSFYGLNVVGI 348 Query: 389 AVGGRQLAIGQSVFKTSGAIIDSGTVITRXXXXXXXXXXXAFKQGMTKYPTAPAYSILDT 210 VGG +L I SVF +SG IIDSGTVITR AF+Q M YP SILDT Sbjct: 349 NVGGTKLPISASVFSSSGTIIDSGTVITRLPPTAYSSLKAAFRQRMKSYPLTQELSILDT 408 Query: 209 CFDFSNYXXXXXXXXXXXXXXXVKVDLSLSGILVAVSSTQACLAFAGNGDASDVGIFGNT 30 C+DFS++ + DL +GIL S+ Q CLAFAGNGD SD+GIFGN Sbjct: 409 CYDFSSFKTVSYPKISFVFDGGLTQDLDATGILYVASADQVCLAFAGNGDDSDIGIFGNV 468 Query: 29 QQLTYEVVY 3 QQ +VVY Sbjct: 469 QQKRLQVVY 477 >ref|XP_003551807.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 2-like [Glycine max] Length = 490 Score = 353 bits (905), Expect = 9e-95 Identities = 191/359 (53%), Positives = 228/359 (63%), Gaps = 4/359 (1%) Frame = -1 Query: 1067 LNRASNTEKVEDKKVNLPVQRGSSLGSGNYLVSVGLGTPKKTLTLIFDTGSDLTWTQCQP 888 L + S+ E+++ LP + GS +GSGNY V VGLGTPK+ L+LIFDTGSDLTWTQC+P Sbjct: 119 LGQDSSVEELDS--ATLPAKSGSLIGSGNYFVVVGLGTPKRDLSLIFDTGSDLTWTQCEP 176 Query: 887 CAESCYKQQDPIFNPKTSASYSNIXXXXXXXXXXXXXXGNSARCSVGT--CVYGIQYGDQ 714 CA SCYKQQD IF+P S SYSNI GN CS T C+YGIQYGD Sbjct: 177 CARSCYKQQDVIFDPSKSTSYSNITCTSALCTQLSTATGNDPGCSASTKACIYGIQYGDS 236 Query: 713 SFSVGFFSKDTLTI-ANDVFPNFQFGCGQNNQGLFGRTAGLIGLGRDPLSVISQTAAKYG 537 SFSVG+FS++ LT+ A DV NF FGCGQNNQGLFG +AGLIGLGR P+S + QTAAKY Sbjct: 237 SFSVGYFSRERLTVTATDVVDNFLFGCGQNNQGLFGGSAGLIGLGRHPISFVQQTAAKYR 296 Query: 536 KYFSYCLPXXXXXXXXXXXXXXGATKNVQFTPLDS-SQGNSFYFISIVSLAVGGRQLAIG 360 K FSYCLP + +++TP + S+G+SFY + I ++AVGG +L + Sbjct: 297 KIFSYCLPSTSSSTGHLSFGPAATGRYLKYTPFSTISRGSSFYGLDITAIAVGGVKLPVS 356 Query: 359 QSVFKTSGAIIDSGTVITRXXXXXXXXXXXAFKQGMTKYPTAPAYSILDTCFDFSNYXXX 180 S F T GAIIDSGTVITR AF+QGM+KYP+A SILDTC+D S Y Sbjct: 357 SSTFSTGGAIIDSGTVITRLPPTAYGALRSAFRQGMSKYPSAGELSILDTCYDLSGYKVF 416 Query: 179 XXXXXXXXXXXXVKVDLSLSGILVAVSSTQACLAFAGNGDASDVGIFGNTQQLTYEVVY 3 V V L GIL S+ Q CLAFA NGD SDV I+GN QQ T EVVY Sbjct: 417 SIPTIEFSFAGGVTVKLPPQGILFVASTKQVCLAFAANGDDSDVTIYGNVQQRTIEVVY 475 >emb|CAB96832.1| nucleoid DNA-binding protein cnd41-like protein [Arabidopsis thaliana] Length = 446 Score = 350 bits (899), Expect = 4e-94 Identities = 184/363 (50%), Positives = 227/363 (62%), Gaps = 3/363 (0%) Frame = -1 Query: 1082 SIHARLNRASNTEKV-EDKKVNLPVQRGSSLGSGNYLVSVGLGTPKKTLTLIFDTGSDLT 906 SIH++L++ T+ V E K +LP + GS+LGSGNY+V+VGLGTPK L+LIFDTGSDLT Sbjct: 69 SIHSKLSKKLATDHVSESKSTDLPAKDGSTLGSGNYIVTVGLGTPKNDLSLIFDTGSDLT 128 Query: 905 WTQCQPCAESCYKQQDPIFNPKTSASYSNIXXXXXXXXXXXXXXGNSARCSVGTCVYGIQ 726 WTQCQPC +CY Q++PIFNP S SY N+ GN+ CS C+YGIQ Sbjct: 129 WTQCQPCVRTCYDQKEPIFNPSKSTSYYNVSCSSAACGSLSSATGNAGSCSASNCIYGIQ 188 Query: 725 YGDQSFSVGFFSKDTLTIAN-DVFPNFQFGCGQNNQGLFGRTAGLIGLGRDPLSVISQTA 549 YGDQSFSVGF +K+ T+ N DVF FGCG+NNQGLF AGL+GLGRD LS SQTA Sbjct: 189 YGDQSFSVGFLAKEKFTLTNSDVFDGVYFGCGENNQGLFTGVAGLLGLGRDKLSFPSQTA 248 Query: 548 AKYGKYFSYCLPXXXXXXXXXXXXXXGATKNVQFTPLDS-SQGNSFYFISIVSLAVGGRQ 372 Y K FSYCLP G +++V+FTP+ + + G SFY ++IV++ VGG++ Sbjct: 249 TAYNKIFSYCLPSSASYTGHLTFGSAGISRSVKFTPISTITDGTSFYGLNIVAITVGGQK 308 Query: 371 LAIGQSVFKTSGAIIDSGTVITRXXXXXXXXXXXAFKQGMTKYPTAPAYSILDTCFDFSN 192 L I +VF T GA+IDSGTVITR +FK M+KYPT SILDTCFD S Sbjct: 309 LPIPSTVFSTPGALIDSGTVITRLPPKAYAALRSSFKAKMSKYPTTSGVSILDTCFDLSG 368 Query: 191 YXXXXXXXXXXXXXXXVKVDLSLSGILVAVSSTQACLAFAGNGDASDVGIFGNTQQLTYE 12 + V+L GI +Q CLAFAGN D S+ IFGN QQ T E Sbjct: 369 FKTVTIPKVAFSFSGGAVVELGSKGIFYVFKISQVCLAFAGNSDDSNAAIFGNVQQQTLE 428 Query: 11 VVY 3 VVY Sbjct: 429 VVY 431 >ref|NP_196638.2| aspartyl protease family protein [Arabidopsis thaliana] gi|18700103|gb|AAL77663.1| AT5g10770/T30N20_40 [Arabidopsis thaliana] gi|24111269|gb|AAN46758.1| At5g10770/T30N20_40 [Arabidopsis thaliana] gi|332004211|gb|AED91594.1| aspartyl protease family protein [Arabidopsis thaliana] Length = 474 Score = 350 bits (899), Expect = 4e-94 Identities = 184/363 (50%), Positives = 227/363 (62%), Gaps = 3/363 (0%) Frame = -1 Query: 1082 SIHARLNRASNTEKV-EDKKVNLPVQRGSSLGSGNYLVSVGLGTPKKTLTLIFDTGSDLT 906 SIH++L++ T+ V E K +LP + GS+LGSGNY+V+VGLGTPK L+LIFDTGSDLT Sbjct: 97 SIHSKLSKKLATDHVSESKSTDLPAKDGSTLGSGNYIVTVGLGTPKNDLSLIFDTGSDLT 156 Query: 905 WTQCQPCAESCYKQQDPIFNPKTSASYSNIXXXXXXXXXXXXXXGNSARCSVGTCVYGIQ 726 WTQCQPC +CY Q++PIFNP S SY N+ GN+ CS C+YGIQ Sbjct: 157 WTQCQPCVRTCYDQKEPIFNPSKSTSYYNVSCSSAACGSLSSATGNAGSCSASNCIYGIQ 216 Query: 725 YGDQSFSVGFFSKDTLTIAN-DVFPNFQFGCGQNNQGLFGRTAGLIGLGRDPLSVISQTA 549 YGDQSFSVGF +K+ T+ N DVF FGCG+NNQGLF AGL+GLGRD LS SQTA Sbjct: 217 YGDQSFSVGFLAKEKFTLTNSDVFDGVYFGCGENNQGLFTGVAGLLGLGRDKLSFPSQTA 276 Query: 548 AKYGKYFSYCLPXXXXXXXXXXXXXXGATKNVQFTPLDS-SQGNSFYFISIVSLAVGGRQ 372 Y K FSYCLP G +++V+FTP+ + + G SFY ++IV++ VGG++ Sbjct: 277 TAYNKIFSYCLPSSASYTGHLTFGSAGISRSVKFTPISTITDGTSFYGLNIVAITVGGQK 336 Query: 371 LAIGQSVFKTSGAIIDSGTVITRXXXXXXXXXXXAFKQGMTKYPTAPAYSILDTCFDFSN 192 L I +VF T GA+IDSGTVITR +FK M+KYPT SILDTCFD S Sbjct: 337 LPIPSTVFSTPGALIDSGTVITRLPPKAYAALRSSFKAKMSKYPTTSGVSILDTCFDLSG 396 Query: 191 YXXXXXXXXXXXXXXXVKVDLSLSGILVAVSSTQACLAFAGNGDASDVGIFGNTQQLTYE 12 + V+L GI +Q CLAFAGN D S+ IFGN QQ T E Sbjct: 397 FKTVTIPKVAFSFSGGAVVELGSKGIFYVFKISQVCLAFAGNSDDSNAAIFGNVQQQTLE 456 Query: 11 VVY 3 VVY Sbjct: 457 VVY 459 >ref|XP_002873476.1| hypothetical protein ARALYDRAFT_325615 [Arabidopsis lyrata subsp. lyrata] gi|297319313|gb|EFH49735.1| hypothetical protein ARALYDRAFT_325615 [Arabidopsis lyrata subsp. lyrata] Length = 475 Score = 349 bits (896), Expect = 1e-93 Identities = 183/363 (50%), Positives = 228/363 (62%), Gaps = 3/363 (0%) Frame = -1 Query: 1082 SIHARLNRASNTEKV-EDKKVNLPVQRGSSLGSGNYLVSVGLGTPKKTLTLIFDTGSDLT 906 SIH++L++ T V + + +LP + GS+LGSGNY+V+VGLGTPK L+LIFDTGSDLT Sbjct: 98 SIHSKLSKKLTTNHVSQSQSTDLPAKDGSTLGSGNYIVTVGLGTPKNDLSLIFDTGSDLT 157 Query: 905 WTQCQPCAESCYKQQDPIFNPKTSASYSNIXXXXXXXXXXXXXXGNSARCSVGTCVYGIQ 726 WTQCQPC +CY Q++PIFNP S SY N+ GN+ CS C+YGIQ Sbjct: 158 WTQCQPCVRTCYDQKEPIFNPSKSTSYYNVSCSSAACGSLSSATGNAGSCSASNCIYGIQ 217 Query: 725 YGDQSFSVGFFSKDTLTI-ANDVFPNFQFGCGQNNQGLFGRTAGLIGLGRDPLSVISQTA 549 YGDQSFSVGF +KD T+ ++DVF FGCG+NNQGLF AGL+GLGRD LS SQTA Sbjct: 218 YGDQSFSVGFLAKDKFTLTSSDVFDGVYFGCGENNQGLFTGVAGLLGLGRDKLSFPSQTA 277 Query: 548 AKYGKYFSYCLPXXXXXXXXXXXXXXGATKNVQFTPLDS-SQGNSFYFISIVSLAVGGRQ 372 Y K FSYCLP G +++V+FTP+ + + G SFY ++IV++ VGG++ Sbjct: 278 TAYNKIFSYCLPSSASYTGHLTFGSAGISRSVKFTPISTITDGTSFYGLNIVAITVGGQK 337 Query: 371 LAIGQSVFKTSGAIIDSGTVITRXXXXXXXXXXXAFKQGMTKYPTAPAYSILDTCFDFSN 192 L I +VF T GA+IDSGTVITR +FK M+KYPT SILDTCFD S Sbjct: 338 LPIPSTVFSTPGALIDSGTVITRLPPKAYAALRSSFKAKMSKYPTTSGVSILDTCFDLSG 397 Query: 191 YXXXXXXXXXXXXXXXVKVDLSLSGILVAVSSTQACLAFAGNGDASDVGIFGNTQQLTYE 12 + V+L GI A +Q CLAFAGN D S+ IFGN QQ T E Sbjct: 398 FKTVTIPKVAFSFSGGAVVELGSKGIFYAFKISQVCLAFAGNSDDSNAAIFGNVQQQTLE 457 Query: 11 VVY 3 VVY Sbjct: 458 VVY 460