BLASTX nr result
ID: Mentha28_contig00004441
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Mentha28_contig00004441 (1214 letters) Database: ./nr 38,876,450 sequences; 13,856,398,315 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value gb|EYU27603.1| hypothetical protein MIMGU_mgv1a004950mg [Mimulus... 447 e-123 emb|CAN65806.1| hypothetical protein VITISV_015630 [Vitis vinifera] 435 e-119 ref|XP_002265771.2| PREDICTED: aspartic proteinase nepenthesin-2... 434 e-119 emb|CBI24128.3| unnamed protein product [Vitis vinifera] 434 e-119 ref|XP_007049083.1| Eukaryotic aspartyl protease family protein,... 378 e-102 ref|XP_006422317.1| hypothetical protein CICLE_v10004908mg [Citr... 375 e-101 gb|EPS68033.1| hypothetical protein M569_06741 [Genlisea aurea] 370 e-100 ref|XP_006297668.1| hypothetical protein CARUB_v10013693mg [Caps... 368 2e-99 gb|EXB51212.1| Aspartic proteinase nepenthesin-1 [Morus notabilis] 367 7e-99 ref|XP_007022806.1| Eukaryotic aspartyl protease family protein,... 362 2e-97 ref|XP_002882786.1| hypothetical protein ARALYDRAFT_478632 [Arab... 356 1e-95 ref|XP_006407304.1| hypothetical protein EUTSA_v10020732mg [Eutr... 350 9e-94 ref|NP_187876.2| aspartyl protease family protein [Arabidopsis t... 345 2e-92 gb|AAL49921.1| unknown protein [Arabidopsis thaliana] 345 2e-92 ref|XP_004293837.1| PREDICTED: aspartic proteinase nepenthesin-1... 345 3e-92 ref|XP_002518395.1| Aspartic proteinase Asp1 precursor, putative... 340 7e-91 ref|XP_006429804.1| hypothetical protein CICLE_v10013820mg [Citr... 334 5e-89 ref|XP_007211847.1| hypothetical protein PRUPE_ppa004710mg [Prun... 328 4e-87 ref|NP_001056286.1| Os05g0557100 [Oryza sativa Japonica Group] g... 319 1e-84 ref|XP_003566992.1| PREDICTED: aspartic proteinase nepenthesin-2... 315 2e-83 >gb|EYU27603.1| hypothetical protein MIMGU_mgv1a004950mg [Mimulus guttatus] Length = 503 Score = 447 bits (1150), Expect = e-123 Identities = 236/374 (63%), Positives = 267/374 (71%), Gaps = 5/374 (1%) Frame = -2 Query: 1213 IHSAADYGAGLYLVKLRVGSPAQKLELIADTGSDLTWSNXXXXXXXXXXG-TXXXXXXXX 1037 I S AD+G G Y V+ RVGSPAQK+ LIADTGSDLTW N G Sbjct: 130 ISSGADFGTGQYFVQFRVGSPAQKVVLIADTGSDLTWMNCKYRCRGGGGGGCRRNSNKRR 189 Query: 1036 XXXXXXXXXXRTVPCSSTICKVDLANLFXXXXXXXXXXXCAYDYRYSDGSATVGLFANET 857 RTVPCSST C DLANLF CAYDYRYSDGSA GLF NET Sbjct: 190 LFWADRSSSFRTVPCSSTTCTNDLANLFSLTRCPSPISPCAYDYRYSDGSAAQGLFGNET 249 Query: 856 VTFGLSNGRKRRVHDVLVGCSESSRGQSFVAADGVMGLGYSNYSFAVRAADKFGGKFSYC 677 VT L+NGRK R+H+VL+GCS SS G +F +ADGV+GLGYSNYS AV+A++ F G FSYC Sbjct: 250 VTLSLTNGRKTRLHNVLIGCSISSSGPTFQSADGVIGLGYSNYSLAVKASNLFRGIFSYC 309 Query: 676 LVDHLSPNNLSSYLIFGSQPQHT-RMRYTELVLGVVNPFYAVAIKGISIGGAMLDIPPDT 500 LVDHLSP N+SSYL FGS Q T M YT L+L V+NPFYAV++ GISIGG+MLDIP + Sbjct: 310 LVDHLSPKNISSYLTFGSAKQQTDTMHYTALILDVINPFYAVSMNGISIGGSMLDIPAEV 369 Query: 499 WDLDGGGGAIVDSGTSLTVLTLPAYKLVVAALQLPL---ERVDLGIGPLEYCFNSSAGFN 329 WD+ G GG I+DSGTSLT L PAY+ V+AAL L E++ L +GPLEYCFNS+ GF Sbjct: 370 WDVKGSGGVILDSGTSLTSLVGPAYRPVMAALTASLSGFEKLGLDVGPLEYCFNST-GFV 428 Query: 328 ETAVPRLVVHFADGARFEPPVKSYVIDAAPGVKCLGFAAAAWPGVSVVGNIMQQNHLWEF 149 E+ VPRLV HF DGARFEPPVKSYVIDAAPGVKCLGF AWPGVSVVGNIMQQN+ WEF Sbjct: 429 ESVVPRLVFHFGDGARFEPPVKSYVIDAAPGVKCLGFVGGAWPGVSVVGNIMQQNYFWEF 488 Query: 148 DIVKSRLGFAPSSC 107 D+V RLGF SSC Sbjct: 489 DLVNKRLGFGSSSC 502 >emb|CAN65806.1| hypothetical protein VITISV_015630 [Vitis vinifera] Length = 449 Score = 435 bits (1118), Expect = e-119 Identities = 223/378 (58%), Positives = 265/378 (70%), Gaps = 9/378 (2%) Frame = -2 Query: 1213 IHSAADYGAGLYLVKLRVGSPAQKLELIADTGSDLTWSNXXXXXXXXXXGTXXXXXXXXX 1034 +H AADYG G Y V +VG+P+QK L+ADTGSDLTW + Sbjct: 72 MHPAADYGIGQYFVAFKVGTPSQKFMLVADTGSDLTWMSCKYHCRSRNCSNRKARRIRHK 131 Query: 1033 XXXXXXXXXR--TVPCSSTICKVDLANLFXXXXXXXXXXXCAYDYRYSDGSATVGLFANE 860 T+PC + +CK++L +LF C YDYRYSDGS +G FANE Sbjct: 132 RVFHANLSSSFKTIPCLTDMCKIELMDLFSLTNCPTPLTPCGYDYRYSDGSTALGFFANE 191 Query: 859 TVTFGLSNGRKRRVHDVLVGCSESSRGQSFVAADGVMGLGYSNYSFAVRAADKFGGKFSY 680 TVT L GRK ++H+VL+GCSES +GQSF AADGVMGLGYS YSFA++AA+KFGGKFSY Sbjct: 192 TVTVELKEGRKMKLHNVLIGCSESFQGQSFQAADGVMGLGYSKYSFAIKAAEKFGGKFSY 251 Query: 679 CLVDHLSPNNLSSYLIFGSQPQH----TRMRYTELVLGVVNPFYAVAIKGISIGGAMLDI 512 CLVDHLS N+S+YL FGS M YTELVLG+VN FYAV + GISIGGAML I Sbjct: 252 CLVDHLSHKNVSNYLTFGSSRSKEALLNNMTYTELVLGMVNSFYAVNMMGISIGGAMLKI 311 Query: 511 PPDTWDLDGGGGAIVDSGTSLTVLTLPAYKLVVAALQLPL---ERVDLGIGPLEYCFNSS 341 P + WD+ G GG I+DSG+SLT LT PAY+ V+AAL++ L +V++ IGPLEYCFNS+ Sbjct: 312 PSEVWDVKGAGGTILDSGSSLTFLTEPAYQPVMAALRVSLLKFRKVEMDIGPLEYCFNST 371 Query: 340 AGFNETAVPRLVVHFADGARFEPPVKSYVIDAAPGVKCLGFAAAAWPGVSVVGNIMQQNH 161 GF E+ VPRLV HFADGA FEPPVKSYVI AA GV+CLGF + AWPG SVVGNIMQQNH Sbjct: 372 -GFEESLVPRLVFHFADGAEFEPPVKSYVISAADGVRCLGFVSVAWPGTSVVGNIMQQNH 430 Query: 160 LWEFDIVKSRLGFAPSSC 107 LWEFD+ +LGFAPSSC Sbjct: 431 LWEFDLGLKKLGFAPSSC 448 >ref|XP_002265771.2| PREDICTED: aspartic proteinase nepenthesin-2-like [Vitis vinifera] Length = 449 Score = 434 bits (1116), Expect = e-119 Identities = 223/378 (58%), Positives = 265/378 (70%), Gaps = 9/378 (2%) Frame = -2 Query: 1213 IHSAADYGAGLYLVKLRVGSPAQKLELIADTGSDLTWSNXXXXXXXXXXGTXXXXXXXXX 1034 +H AADYG G Y V +VG+P+QK L+ADTGSDLTW + Sbjct: 72 MHPAADYGIGQYSVAFKVGTPSQKFMLVADTGSDLTWMSCKYHCRSRNCSNRKARRIRHK 131 Query: 1033 XXXXXXXXXR--TVPCSSTICKVDLANLFXXXXXXXXXXXCAYDYRYSDGSATVGLFANE 860 T+PC + +CK++L +LF C YDYRYSDGS +G FANE Sbjct: 132 RVFHANLSSSFKTIPCLTDMCKIELMDLFSLTNCPTPLTPCGYDYRYSDGSTALGFFANE 191 Query: 859 TVTFGLSNGRKRRVHDVLVGCSESSRGQSFVAADGVMGLGYSNYSFAVRAADKFGGKFSY 680 TVT L GRK ++H+VL+GCSES +GQSF AADGVMGLGYS YSFA++AA+KFGGKFSY Sbjct: 192 TVTVELKEGRKMKLHNVLIGCSESFQGQSFQAADGVMGLGYSKYSFAIKAAEKFGGKFSY 251 Query: 679 CLVDHLSPNNLSSYLIFGSQPQH----TRMRYTELVLGVVNPFYAVAIKGISIGGAMLDI 512 CLVDHLS N+S+YL FGS M YTELVLG+VN FYAV + GISIGGAML I Sbjct: 252 CLVDHLSHKNVSNYLTFGSSRSKEALLNNMTYTELVLGMVNSFYAVNMMGISIGGAMLKI 311 Query: 511 PPDTWDLDGGGGAIVDSGTSLTVLTLPAYKLVVAALQLPL---ERVDLGIGPLEYCFNSS 341 P + WD+ G GG I+DSG+SLT LT PAY+ V+AAL++ L +V++ IGPLEYCFNS+ Sbjct: 312 PSEVWDVKGAGGTILDSGSSLTFLTEPAYQPVMAALRVSLLKFRKVEMDIGPLEYCFNST 371 Query: 340 AGFNETAVPRLVVHFADGARFEPPVKSYVIDAAPGVKCLGFAAAAWPGVSVVGNIMQQNH 161 GF E+ VPRLV HFADGA FEPPVKSYVI AA GV+CLGF + AWPG SVVGNIMQQNH Sbjct: 372 -GFEESLVPRLVFHFADGAEFEPPVKSYVISAADGVRCLGFVSVAWPGTSVVGNIMQQNH 430 Query: 160 LWEFDIVKSRLGFAPSSC 107 LWEFD+ +LGFAPSSC Sbjct: 431 LWEFDLGLKKLGFAPSSC 448 >emb|CBI24128.3| unnamed protein product [Vitis vinifera] Length = 378 Score = 434 bits (1116), Expect = e-119 Identities = 223/378 (58%), Positives = 265/378 (70%), Gaps = 9/378 (2%) Frame = -2 Query: 1213 IHSAADYGAGLYLVKLRVGSPAQKLELIADTGSDLTWSNXXXXXXXXXXGTXXXXXXXXX 1034 +H AADYG G Y V +VG+P+QK L+ADTGSDLTW + Sbjct: 1 MHPAADYGIGQYSVAFKVGTPSQKFMLVADTGSDLTWMSCKYHCRSRNCSNRKARRIRHK 60 Query: 1033 XXXXXXXXXR--TVPCSSTICKVDLANLFXXXXXXXXXXXCAYDYRYSDGSATVGLFANE 860 T+PC + +CK++L +LF C YDYRYSDGS +G FANE Sbjct: 61 RVFHANLSSSFKTIPCLTDMCKIELMDLFSLTNCPTPLTPCGYDYRYSDGSTALGFFANE 120 Query: 859 TVTFGLSNGRKRRVHDVLVGCSESSRGQSFVAADGVMGLGYSNYSFAVRAADKFGGKFSY 680 TVT L GRK ++H+VL+GCSES +GQSF AADGVMGLGYS YSFA++AA+KFGGKFSY Sbjct: 121 TVTVELKEGRKMKLHNVLIGCSESFQGQSFQAADGVMGLGYSKYSFAIKAAEKFGGKFSY 180 Query: 679 CLVDHLSPNNLSSYLIFGSQPQH----TRMRYTELVLGVVNPFYAVAIKGISIGGAMLDI 512 CLVDHLS N+S+YL FGS M YTELVLG+VN FYAV + GISIGGAML I Sbjct: 181 CLVDHLSHKNVSNYLTFGSSRSKEALLNNMTYTELVLGMVNSFYAVNMMGISIGGAMLKI 240 Query: 511 PPDTWDLDGGGGAIVDSGTSLTVLTLPAYKLVVAALQLPL---ERVDLGIGPLEYCFNSS 341 P + WD+ G GG I+DSG+SLT LT PAY+ V+AAL++ L +V++ IGPLEYCFNS+ Sbjct: 241 PSEVWDVKGAGGTILDSGSSLTFLTEPAYQPVMAALRVSLLKFRKVEMDIGPLEYCFNST 300 Query: 340 AGFNETAVPRLVVHFADGARFEPPVKSYVIDAAPGVKCLGFAAAAWPGVSVVGNIMQQNH 161 GF E+ VPRLV HFADGA FEPPVKSYVI AA GV+CLGF + AWPG SVVGNIMQQNH Sbjct: 301 -GFEESLVPRLVFHFADGAEFEPPVKSYVISAADGVRCLGFVSVAWPGTSVVGNIMQQNH 359 Query: 160 LWEFDIVKSRLGFAPSSC 107 LWEFD+ +LGFAPSSC Sbjct: 360 LWEFDLGLKKLGFAPSSC 377 >ref|XP_007049083.1| Eukaryotic aspartyl protease family protein, putative [Theobroma cacao] gi|508701344|gb|EOX93240.1| Eukaryotic aspartyl protease family protein, putative [Theobroma cacao] Length = 478 Score = 378 bits (970), Expect = e-102 Identities = 195/374 (52%), Positives = 250/374 (66%), Gaps = 5/374 (1%) Frame = -2 Query: 1213 IHSAADYGAGLYLVKLRVGSPAQKLELIADTGSDLTWSNXXXXXXXXXXGTXXXXXXXXX 1034 + SAAD G G Y V RVGSP +K +IADTGS LTW Sbjct: 107 MRSAADIGTGQYFVSFRVGSPPKKFIMIADTGSSLTWMRCSYKCKNFSMDRTKLHERIFY 166 Query: 1033 XXXXXXXXXRTVPCSSTICKVDLANLFXXXXXXXXXXXCAYDYRYSDGSATVGLFANETV 854 +PCSS +CKV+L+ F CAYDYRY+DG+ VG+F N+TV Sbjct: 167 ANQSRTFKP--IPCSSDVCKVELSQSFSLALCPTPMAPCAYDYRYADGTRVVGIFGNDTV 224 Query: 853 TFGLSNGRKRRVHDVLVGCSESSRGQSFVAADGVMGLGYSNYSFAVRAADKFGGKFSYCL 674 LS G+K +V DV+VGCSE+ RG +F DGVMGLG+ +SFAV+AA +FG KFSYCL Sbjct: 225 KVRLSGGQKIKVTDVMVGCSEAIRG-NFHDIDGVMGLGFDQHSFAVKAAKEFGDKFSYCL 283 Query: 673 VDHLSPNNLSSYLIFGSQPQHT--RMRYTELVLGVVNPFYAVAIKGISIGGAMLDIPPDT 500 VDHLSP+NL ++L+FG M++T+L+LG+VNP+YAV + GIS+ G MLDIP Sbjct: 284 VDHLSPSNLVNFLVFGGVTSSPLPNMQFTQLILGIVNPYYAVNVSGISVNGKMLDIPSYI 343 Query: 499 WDLDGGGGAIVDSGTSLTVLTLPAYKLVVAALQLPL---ERVDLGIGPLEYCFNSSAGFN 329 WD+ G GG I+DSG+SLT L P + V+AA Q PL ++++L +GP +YCF S+AGF Sbjct: 344 WDVKGDGGVIMDSGSSLTYLVKPLFDKVIAAFQAPLSKFKKLELNLGP-DYCF-SAAGFE 401 Query: 328 ETAVPRLVVHFADGARFEPPVKSYVIDAAPGVKCLGFAAAAWPGVSVVGNIMQQNHLWEF 149 E+ +P+L HFADGA+ PPVKSYVIDA VKCLGF++ +WPG SV+GNI+QQNHLWEF Sbjct: 402 ESLMPKLAFHFADGAKLVPPVKSYVIDAEEAVKCLGFSSTSWPGPSVIGNILQQNHLWEF 461 Query: 148 DIVKSRLGFAPSSC 107 D++ SRLGFA SSC Sbjct: 462 DLLNSRLGFAASSC 475 >ref|XP_006422317.1| hypothetical protein CICLE_v10004908mg [Citrus clementina] gi|568881779|ref|XP_006493729.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Citrus sinensis] gi|557524190|gb|ESR35557.1| hypothetical protein CICLE_v10004908mg [Citrus clementina] Length = 470 Score = 375 bits (963), Expect = e-101 Identities = 189/375 (50%), Positives = 244/375 (65%), Gaps = 6/375 (1%) Frame = -2 Query: 1213 IHSAADYGAGLYLVKLRVGSPAQKLELIADTGSDLTW-SNXXXXXXXXXXGTXXXXXXXX 1037 + + DYG G+Y V+++VG+P+QKL LI DTGS+ +W S Sbjct: 95 LQAGRDYGTGMYFVEIKVGTPSQKLRLIVDTGSEFSWISCRYHCGPSCTKKGTIAGSRRR 154 Query: 1036 XXXXXXXXXXRTVPCSSTICKVDLANLFXXXXXXXXXXXCAYDYRYSDGSATVGLFANET 857 +T+PCSS +CK + A LF CAYDYRY+DGSA G+F E Sbjct: 155 VFKADLSSSFKTIPCSSDMCKSEFARLFSLTFCPTPTSPCAYDYRYADGSAAKGIFGKER 214 Query: 856 VTFGLSNGRKRRVHDVLVGCSESSRGQSFVAADGVMGLGYSNYSFAVRAADKFG---GKF 686 VT GL NG K R+ +V++GCS++ +GQ F ADGV+GL Y YSFA + + GKF Sbjct: 215 VTIGLENGGKTRIEEVVMGCSDTIQGQIFAEADGVLGLSYDKYSFAQKVTNGSTFARGKF 274 Query: 685 SYCLVDHLSPNNLSSYLIFGSQPQHTRMRYTELVLGVVNPFYAVAIKGISIGGAMLDIPP 506 +YCLVDHLS N+S+YLIFG + + RMR +LG++ P Y V++KGISIGG ML+IP Sbjct: 275 AYCLVDHLSHKNVSNYLIFGEESKRMRMRMRYTLLGLIGPDYGVSVKGISIGGVMLNIPS 334 Query: 505 DTWDLDGGGGAIVDSGTSLTVLTLPAYKLVVAALQLPLERVD--LGIGPLEYCFNSSAGF 332 WD + GGG DSGT+LT L PAYK VVAAL++ L R P EYCFNS+ GF Sbjct: 335 QVWDFNRGGGTAFDSGTTLTFLAEPAYKPVVAALEMSLSRYQRLKRDAPFEYCFNST-GF 393 Query: 331 NETAVPRLVVHFADGARFEPPVKSYVIDAAPGVKCLGFAAAAWPGVSVVGNIMQQNHLWE 152 +E++VP+LV HFADGARFEP KSY+I A G++CLGF +A WPG S +GNIMQQN+ WE Sbjct: 394 DESSVPKLVFHFADGARFEPHTKSYIIRVAHGIRCLGFVSATWPGASAIGNIMQQNYFWE 453 Query: 151 FDIVKSRLGFAPSSC 107 FD++K RLGFAPS+C Sbjct: 454 FDLLKDRLGFAPSTC 468 >gb|EPS68033.1| hypothetical protein M569_06741 [Genlisea aurea] Length = 449 Score = 370 bits (951), Expect = e-100 Identities = 203/380 (53%), Positives = 248/380 (65%), Gaps = 11/380 (2%) Frame = -2 Query: 1213 IHSAADYGAGLYLVKLRVGSPAQKLELIADTGSDLTWSNXXXXXXXXXXGTXXXXXXXXX 1034 +++ AD G YLV RVGSPAQ + LIADTGSDLTW+ + Sbjct: 75 MYAGADLGIAQYLVAFRVGSPAQSVALIADTGSDLTWTKCSYGCGGGCRRS-----SGRL 129 Query: 1033 XXXXXXXXXRTVPCSSTICKVDLANLFXXXXXXXXXXXCAYDYRYSDGSATVGLFANETV 854 +TV CSST C VDLA F CAYDYRY+DGS+ G+FA ETV Sbjct: 130 FDADRSTSFKTVECSSTTCTVDLAGAFSLSRCSPPSDPCAYDYRYADGSSAEGIFAGETV 189 Query: 853 TFGLSNGR-KRRVHDVLVGCSESSRGQSFVAADGVMGLGYSNYSFAVRAADKFGGKFSYC 677 L+ GR K R+ +VL+GC+++ G SF +DGV+GLGYSN+SFA AA +FG KFSYC Sbjct: 190 ELKLAKGRGKARLQNVLIGCTKNFSGSSFQTSDGVLGLGYSNFSFAHAAAARFGDKFSYC 249 Query: 676 LVDHLSPNNLSSYLIFGSQPQHTR------MRYTELVLGVVNPFYAVAIKGISIGGAMLD 515 L+DHL+ N SSY+ F S + +RYT+LVLGV+ YAV ++GISIGG+ L Sbjct: 250 LLDHLAAKNKSSYITFSSGRSISASISAGPIRYTDLVLGVIGSNYAVNVRGISIGGSWLR 309 Query: 514 IPPDTW-DLDGGGGAIVDSGTSLTVLTLPAYKLVVAALQLPLERV---DLGIGPLEYCFN 347 IP DTW +L G GG I+DSG+SLT L PAY V+AAL L R + IGP+E CFN Sbjct: 310 IPSDTWNNLSGSGGVIIDSGSSLTALAPPAYAPVIAALNRSLARFGDPHVKIGPMECCFN 369 Query: 346 SSAGFNETAVPRLVVHFADGARFEPPVKSYVIDAAPGVKCLGFAAAAWPGVSVVGNIMQQ 167 S+ GF+E+ VP+L +HFA G RFEPPVKSYVIDAAPGV CLGF AA PGVSV+GNI+QQ Sbjct: 370 ST-GFHESVVPKLAIHFAGGTRFEPPVKSYVIDAAPGVVCLGFVQAASPGVSVIGNILQQ 428 Query: 166 NHLWEFDIVKSRLGFAPSSC 107 NH WEFD+ RLGFA S C Sbjct: 429 NHWWEFDLGNRRLGFAASDC 448 >ref|XP_006297668.1| hypothetical protein CARUB_v10013693mg [Capsella rubella] gi|482566377|gb|EOA30566.1| hypothetical protein CARUB_v10013693mg [Capsella rubella] Length = 448 Score = 368 bits (945), Expect = 2e-99 Identities = 195/373 (52%), Positives = 240/373 (64%), Gaps = 6/373 (1%) Frame = -2 Query: 1207 SAADYGAGLYLVKLRVGSPAQKLELIADTGSDLTWSNXXXXXXXXXXGTXXXXXXXXXXX 1028 S DYG Y ++RVG+PA+K ++ DTGS+LTW N Sbjct: 80 SGIDYGTAQYFTEVRVGTPAKKFRVVVDTGSELTWVNCKYRGRG-----KGRVENRRVFR 134 Query: 1027 XXXXXXXRTVPCSSTICKVDLANLFXXXXXXXXXXXCAYDYRYSDGSATVGLFANETVTF 848 RTV C + CKVDL NLF C+YDYRY+DGSA G+FA ETVT Sbjct: 135 AEESKSFRTVGCFTQTCKVDLMNLFSLSTCPTPSTPCSYDYRYADGSAAQGIFAKETVTV 194 Query: 847 GLSNGRKRRVHDVLVGCSESSRGQSFVAADGVMGLGYSNYSFAVRAADKFGGKFSYCLVD 668 GL+NGRK R+H +L+GCS S GQSF ADGV+GL +S++SF A FG KFSYCLVD Sbjct: 195 GLTNGRKARLHGLLIGCSSSFSGQSFRGADGVLGLAFSDFSFTSTATSLFGAKFSYCLVD 254 Query: 667 HLSPNNLSSYLIFGSQPQHTRM---RYTELVLGVVNPFYAVAIKGISIGGAMLDIPPDTW 497 HLSP N+S+YLIFGS T+ R T L L ++ PFYA+++ GIS+G MLDIP W Sbjct: 255 HLSPKNVSNYLIFGSSSSATKNAPGRTTPLDLTLIPPFYAISVIGISLGEDMLDIPAQVW 314 Query: 496 DLDGGGGAIVDSGTSLTVLTLPAYKLVVAALQL---PLERVDLGIGPLEYCFNSSAGFNE 326 D GGG ++DSGTSLT+L+ AYK VV L LERV P+EYCF+S++GFNE Sbjct: 315 DATTGGGTVLDSGTSLTLLSEAAYKPVVTGLARYLDELERVKPEGVPIEYCFSSTSGFNE 374 Query: 325 TAVPRLVVHFADGARFEPPVKSYVIDAAPGVKCLGFAAAAWPGVSVVGNIMQQNHLWEFD 146 + +P+L H GARFEP KSY+ID APGVKCLGF +A P +VVGNIMQQN+LWEFD Sbjct: 375 SKLPQLTFHMKGGARFEPHRKSYLIDTAPGVKCLGFMSAGTPATNVVGNIMQQNYLWEFD 434 Query: 145 IVKSRLGFAPSSC 107 ++ S L FAPSSC Sbjct: 435 LMASTLSFAPSSC 447 >gb|EXB51212.1| Aspartic proteinase nepenthesin-1 [Morus notabilis] Length = 464 Score = 367 bits (941), Expect = 7e-99 Identities = 191/382 (50%), Positives = 250/382 (65%), Gaps = 13/382 (3%) Frame = -2 Query: 1213 IHSAADYGAGLYLVKLRVGSPAQKLELIADTGSDLTWSNXXXXXXXXXXGTXXXXXXXXX 1034 +++ ADYG G Y V + VG+P Q+ L+ADTGSDLTW + Sbjct: 85 MNAGADYGVGEYFVHVTVGTPGQRFMLVADTGSDLTWMHCRCGRRCGTHK--GRLNNRRV 142 Query: 1033 XXXXXXXXXRTVPCSSTICKVDLANLFXXXXXXXXXXXCAYDYRYSDGSATVGLFANETV 854 +T+PC S +CKV+LANLF CAYDYRY +GS+ +G FANET+ Sbjct: 143 FHADRSSSFKTIPCLSEMCKVELANLFSLSKCPTPLTPCAYDYRYLEGSSAIGFFANETI 202 Query: 853 TFGLSNGRKRRVHDVLVGCSESSRG---QSFVAADGVMGLGYSNYSFAVRAADKFGGKFS 683 + L+NG+KR++ DVLVGC+ES +G F ADGV+GLG+ N++F +AA FGGKFS Sbjct: 203 SVRLANGKKRKLRDVLVGCTESVQGAEESGFKGADGVLGLGFGNHTFTRKAAQYFGGKFS 262 Query: 682 YCLVDHLSPNNLSSYLIFGSQPQH-----TRMRYTELVLGV-VNPFYAVAIKGISIGGAM 521 YCLVDHLSP NLS+Y+IFG + +++T+LVLG PFY V + GISIGG + Sbjct: 263 YCLVDHLSPKNLSNYIIFGHDKADKASCSSSLQHTDLVLGGDYGPFYGVNLSGISIGGVL 322 Query: 520 LDIPPDTWDLDGGGGAIVDSGTSLTVLTLPAYKLVVAALQLPLERVDL----GIGPLEYC 353 L IP W+ GGGAI++SGTSLT LT P Y V + L R G GP E+C Sbjct: 323 LRIPSVAWNASLGGGAILESGTSLTFLTDPVYGPVTSELNKFTSRFGTLLPPGGGPFEFC 382 Query: 352 FNSSAGFNETAVPRLVVHFADGARFEPPVKSYVIDAAPGVKCLGFAAAAWPGVSVVGNIM 173 FNS+ G++E+ +P L +HF++GA FEPPVKSY++D AP KCLGF +A+WPG S++GNIM Sbjct: 383 FNST-GYDESKMPPLRIHFSNGAIFEPPVKSYILDIAPEKKCLGFVSASWPGTSIIGNIM 441 Query: 172 QQNHLWEFDIVKSRLGFAPSSC 107 QQNHLWEFD+ +RLGFAPS+C Sbjct: 442 QQNHLWEFDLENTRLGFAPSTC 463 >ref|XP_007022806.1| Eukaryotic aspartyl protease family protein, putative [Theobroma cacao] gi|508722434|gb|EOY14331.1| Eukaryotic aspartyl protease family protein, putative [Theobroma cacao] Length = 473 Score = 362 bits (928), Expect = 2e-97 Identities = 196/388 (50%), Positives = 251/388 (64%), Gaps = 19/388 (4%) Frame = -2 Query: 1213 IHSAADYGAGLYLVKLRVGSPAQKLELIADTGSDLTWSNXXXXXXXXXXGT--XXXXXXX 1040 + + D+G G Y+ +VG+P+QK LI DTGSDLTW N T Sbjct: 84 LSAGRDFGIGQYVTTFKVGTPSQKFRLIVDTGSDLTWINCRYRCARGDNCTTQERGIKRG 143 Query: 1039 XXXXXXXXXXXRTVPCSSTICKVDLANLFXXXXXXXXXXXCAYDY----------RYSDG 890 R +PC S +CKV+L NLF CAYDY RY DG Sbjct: 144 RVFRAHLSSSFRPIPCFSQMCKVELRNLFSLTICPTPLTPCAYDYRFNSLKLVLNRYIDG 203 Query: 889 SATVGLFANETVTFGLSNGRKRRVHDVLVGCSESSRGQSFVAADGVMGLGYSNYSFAVRA 710 S +G+FA E+VT GL+N R R+HDVL+GCS+SS+G++ DGV+GL S YSF +A Sbjct: 204 SDAMGVFAKESVTVGLTNSRMARLHDVLIGCSDSSQGRTVKNVDGVLGLANSKYSFVTKA 263 Query: 709 ADKFGGKFSYCLVDHLSPNNLSSYLIFGSQPQHTRM----RYTELVLGVVNPFYAVAIKG 542 A+++GGKFSYCLVDHLS N S+YLIFG+ + RYT L L +V+ YAV ++G Sbjct: 264 AERWGGKFSYCLVDHLSHINASNYLIFGANNNQLTVLGNTRYTRLELNLVSFSYAVNVQG 323 Query: 541 ISIGGAMLDIPPDTWDLDGGGGAIVDSGTSLTVLTLPAYKLVVAALQLPLER---VDLGI 371 ISIGG MLDIP WD GGG I+DSGTSL+ LT PAY+ V+AA+++ + + V L Sbjct: 324 ISIGGKMLDIPLQVWDTRKGGGTILDSGTSLSFLTDPAYQPVMAAIKMSVSKYPQVKLHG 383 Query: 370 GPLEYCFNSSAGFNETAVPRLVVHFADGARFEPPVKSYVIDAAPGVKCLGFAAAAWPGVS 191 P+EYCFNS+ GF+ET VP+L++HFADGARFEP +SYVI AA GV+CLGF A +P VS Sbjct: 384 VPMEYCFNST-GFDETLVPKLIIHFADGARFEPHWRSYVISAADGVRCLGFLPARFPSVS 442 Query: 190 VVGNIMQQNHLWEFDIVKSRLGFAPSSC 107 V+GNIMQQN+LWEFD+ ++L FAPSSC Sbjct: 443 VIGNIMQQNYLWEFDLEGNKLRFAPSSC 470 >ref|XP_002882786.1| hypothetical protein ARALYDRAFT_478632 [Arabidopsis lyrata subsp. lyrata] gi|297328626|gb|EFH59045.1| hypothetical protein ARALYDRAFT_478632 [Arabidopsis lyrata subsp. lyrata] Length = 449 Score = 356 bits (913), Expect = 1e-95 Identities = 191/375 (50%), Positives = 235/375 (62%), Gaps = 8/375 (2%) Frame = -2 Query: 1207 SAADYGAGLYLVKLRVGSPAQKLELIADTGSDLTWSNXXXXXXXXXXGTXXXXXXXXXXX 1028 S DYG Y ++RVG+PA+K ++ DTGS+LTW N Sbjct: 79 SGIDYGTAQYFTEVRVGTPAKKFRVVVDTGSELTWVNCRYRGRG-----KGKVKNRRVFR 133 Query: 1027 XXXXXXXRTVPCSSTICKVDLANLFXXXXXXXXXXXCAYDYRYSDGSATVGLFANETVTF 848 +TV C + CKVDL NLF C+YDYRY+DGSA G+FA ET+T Sbjct: 134 AEESKSFKTVGCFTQTCKVDLMNLFSLSTCPTPSTPCSYDYRYADGSAAQGVFAKETITV 193 Query: 847 GLSNGRKRRVHDVLVGCSESSRGQSFVAADGVMGLGYSNYSFAVRAADKFGGKFSYCLVD 668 GL+NGRK R+ +LVGCS S GQSF ADGV+GL +S++SF A FG K SYCLVD Sbjct: 194 GLTNGRKARLRGLLVGCSSSFSGQSFQGADGVLGLAFSDFSFTSTATSLFGAKLSYCLVD 253 Query: 667 HLSPNNLSSYLIFGSQPQHTRM-----RYTELVLGVVNPFYAVAIKGISIGGAMLDIPPD 503 HLS N+S+YLIFG T R T L L ++ PFYA+ I GISIG MLDIP Sbjct: 254 HLSNKNISNYLIFGYSSSSTSTKTAPGRTTPLDLTLIPPFYAINIIGISIGDDMLDIPTQ 313 Query: 502 TWDLDGGGGAIVDSGTSLTVLTLPAYKLVVAALQ---LPLERVDLGIGPLEYCFNSSAGF 332 WD GGG I+DSGTSLT+L AYK VV L + L+RV P+EYCF+S++GF Sbjct: 314 VWDATTGGGTILDSGTSLTLLAEAAYKPVVTGLARYLVELKRVKPEGIPIEYCFSSTSGF 373 Query: 331 NETAVPRLVVHFADGARFEPPVKSYVIDAAPGVKCLGFAAAAWPGVSVVGNIMQQNHLWE 152 NE+ +P+L H GARFEP KSY++DAAPGVKCLGF +A P +VVGNIMQQN+LWE Sbjct: 374 NESKLPQLTFHLKGGARFEPHRKSYLVDAAPGVKCLGFMSAGTPATNVVGNIMQQNYLWE 433 Query: 151 FDIVKSRLGFAPSSC 107 FD++ S L FAPS+C Sbjct: 434 FDLMASTLSFAPSTC 448 >ref|XP_006407304.1| hypothetical protein EUTSA_v10020732mg [Eutrema salsugineum] gi|557108450|gb|ESQ48757.1| hypothetical protein EUTSA_v10020732mg [Eutrema salsugineum] Length = 444 Score = 350 bits (897), Expect = 9e-94 Identities = 185/375 (49%), Positives = 235/375 (62%), Gaps = 7/375 (1%) Frame = -2 Query: 1207 SAADYGAGLYLVKLRVGSPAQKLELIADTGSDLTWSNXXXXXXXXXXGTXXXXXXXXXXX 1028 S DYGA Y ++RVG+PA++ ++ DTGS+LTW N Sbjct: 78 SGFDYGAAQYFAEVRVGTPAKRFRVVVDTGSELTWVNCRFHGKGKENRRVFRAEESSSFR 137 Query: 1027 XXXXXXXRTVPCSSTICKVDLANLFXXXXXXXXXXXCAYDYRYSDGSATVGLFANETVTF 848 V C + CKVDL NLF C+YDYRY+DGSA G+FA ET T Sbjct: 138 K--------VGCLTQTCKVDLMNLFSLSNCPTPSTPCSYDYRYADGSAAQGVFAKETFTV 189 Query: 847 GLSNGRKRRVHDVLVGCSESSRGQSFVAADGVMGLGYSNYSFAVRAADKFGGKFSYCLVD 668 GL+NGRK ++ +L+GCS S G SF ADGV+GL S+YSF +A + FGGKFSYCLVD Sbjct: 190 GLTNGRKAKLRGLLIGCSSSFSGDSFRGADGVLGLALSDYSFTSKATNIFGGKFSYCLVD 249 Query: 667 HLSPNNLSSYLIFGSQPQHTR----MRYTELVLGVVNPFYAVAIKGISIGGAMLDIPPDT 500 HLS N+S+YL FGS T+ +R T L L ++ PFYA+ I GISIG MLDIP Sbjct: 250 HLSNKNVSNYLTFGSSSSTTKTAASIRTTPLDLKLIPPFYAINIIGISIGDDMLDIPTQV 309 Query: 499 WDLDGGGGAIVDSGTSLTVLTLPAYKLVVAALQ---LPLERVDLGIGPLEYCFNSSAGFN 329 WD GGG I+DSGTSLT L AYK VV+ L+ + +RV P+EYCF++++GFN Sbjct: 310 WDATAGGGTILDSGTSLTFLADAAYKAVVSGLERYLVGFKRVKPEGVPIEYCFDTTSGFN 369 Query: 328 ETAVPRLVVHFADGARFEPPVKSYVIDAAPGVKCLGFAAAAWPGVSVVGNIMQQNHLWEF 149 E+ +P+L HF GARFEP +SYV+D GV+CLGF + P +VVGNIMQQN+LWEF Sbjct: 370 ESKLPQLTFHFKGGARFEPHRRSYVVDTLEGVRCLGFVSTGSPATNVVGNIMQQNYLWEF 429 Query: 148 DIVKSRLGFAPSSCV 104 D+V S L FAPS+C+ Sbjct: 430 DLVASTLSFAPSTCL 444 >ref|NP_187876.2| aspartyl protease family protein [Arabidopsis thaliana] gi|11994412|dbj|BAB02414.1| chloroplast nucleoid DNA binding protein-like [Arabidopsis thaliana] gi|332641715|gb|AEE75236.1| aspartyl protease family protein [Arabidopsis thaliana] Length = 461 Score = 345 bits (886), Expect = 2e-92 Identities = 184/372 (49%), Positives = 231/372 (62%), Gaps = 5/372 (1%) Frame = -2 Query: 1207 SAADYGAGLYLVKLRVGSPAQKLELIADTGSDLTWSNXXXXXXXXXXGTXXXXXXXXXXX 1028 S DYG Y ++RVG+PA+K ++ DTGS+LTW N Sbjct: 97 SGIDYGTAQYFTEIRVGTPAKKFRVVVDTGSELTWVNCRYRARGKDNRRVFRADESKSFK 156 Query: 1027 XXXXXXXRTVPCSSTICKVDLANLFXXXXXXXXXXXCAYDYRYSDGSATVGLFANETVTF 848 TV C + CKVDL NLF C+YDYRY+DGSA G+FA ET+T Sbjct: 157 --------TVGCLTQTCKVDLMNLFSLTTCPTPSTPCSYDYRYADGSAAQGVFAKETITV 208 Query: 847 GLSNGRKRRVHDVLVGCSESSRGQSFVAADGVMGLGYSNYSFAVRAADKFGGKFSYCLVD 668 GL+NGR R+ L+GCS S GQSF ADGV+GL +S++SF A +G KFSYCLVD Sbjct: 209 GLTNGRMARLPGHLIGCSSSFTGQSFQGADGVLGLAFSDFSFTSTATSLYGAKFSYCLVD 268 Query: 667 HLSPNNLSSYLIFGS--QPQHTRMRYTELVLGVVNPFYAVAIKGISIGGAMLDIPPDTWD 494 HLS N+S+YLIFGS + R T L L + PFYA+ + GIS+G MLDIP WD Sbjct: 269 HLSNKNVSNYLIFGSSRSTKTAFRRTTPLDLTRIPPFYAINVIGISLGYDMLDIPSQVWD 328 Query: 493 LDGGGGAIVDSGTSLTVLTLPAYKLVVAALQ---LPLERVDLGIGPLEYCFNSSAGFNET 323 GGG I+DSGTSLT+L AYK VV L + L+RV P+EYCF+ ++GFN + Sbjct: 329 ATSGGGTILDSGTSLTLLADAAYKQVVTGLARYLVELKRVKPEGVPIEYCFSFTSGFNVS 388 Query: 322 AVPRLVVHFADGARFEPPVKSYVIDAAPGVKCLGFAAAAWPGVSVVGNIMQQNHLWEFDI 143 +P+L H GARFEP KSY++DAAPGVKCLGF +A P +V+GNIMQQN+LWEFD+ Sbjct: 389 KLPQLTFHLKGGARFEPHRKSYLVDAAPGVKCLGFVSAGTPATNVIGNIMQQNYLWEFDL 448 Query: 142 VKSRLGFAPSSC 107 + S L FAPS+C Sbjct: 449 MASTLSFAPSAC 460 >gb|AAL49921.1| unknown protein [Arabidopsis thaliana] Length = 439 Score = 345 bits (886), Expect = 2e-92 Identities = 184/372 (49%), Positives = 231/372 (62%), Gaps = 5/372 (1%) Frame = -2 Query: 1207 SAADYGAGLYLVKLRVGSPAQKLELIADTGSDLTWSNXXXXXXXXXXGTXXXXXXXXXXX 1028 S DYG Y ++RVG+PA+K ++ DTGS+LTW N Sbjct: 75 SGIDYGTAQYFTEIRVGTPAKKFRVVVDTGSELTWVNCRYRARGKDNRRVFRADESKSFK 134 Query: 1027 XXXXXXXRTVPCSSTICKVDLANLFXXXXXXXXXXXCAYDYRYSDGSATVGLFANETVTF 848 TV C + CKVDL NLF C+YDYRY+DGSA G+FA ET+T Sbjct: 135 --------TVGCLTQTCKVDLMNLFSLTTCPTPSTPCSYDYRYADGSAAQGVFAKETITV 186 Query: 847 GLSNGRKRRVHDVLVGCSESSRGQSFVAADGVMGLGYSNYSFAVRAADKFGGKFSYCLVD 668 GL+NGR R+ L+GCS S GQSF ADGV+GL +S++SF A +G KFSYCLVD Sbjct: 187 GLTNGRMARLPGHLIGCSSSFTGQSFQGADGVLGLAFSDFSFTSTATSLYGAKFSYCLVD 246 Query: 667 HLSPNNLSSYLIFGS--QPQHTRMRYTELVLGVVNPFYAVAIKGISIGGAMLDIPPDTWD 494 HLS N+S+YLIFGS + R T L L + PFYA+ + GIS+G MLDIP WD Sbjct: 247 HLSNKNVSNYLIFGSSRSTKTAFRRTTPLDLTRIPPFYAINVIGISLGYDMLDIPSQVWD 306 Query: 493 LDGGGGAIVDSGTSLTVLTLPAYKLVVAALQ---LPLERVDLGIGPLEYCFNSSAGFNET 323 GGG I+DSGTSLT+L AYK VV L + L+RV P+EYCF+ ++GFN + Sbjct: 307 ATSGGGTILDSGTSLTLLADAAYKQVVTGLARYLVELKRVKPEGVPIEYCFSFTSGFNVS 366 Query: 322 AVPRLVVHFADGARFEPPVKSYVIDAAPGVKCLGFAAAAWPGVSVVGNIMQQNHLWEFDI 143 +P+L H GARFEP KSY++DAAPGVKCLGF +A P +V+GNIMQQN+LWEFD+ Sbjct: 367 KLPQLTFHLKGGARFEPHRKSYLVDAAPGVKCLGFVSAGTPATNVIGNIMQQNYLWEFDL 426 Query: 142 VKSRLGFAPSSC 107 + S L FAPS+C Sbjct: 427 MASTLSFAPSAC 438 >ref|XP_004293837.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Fragaria vesca subsp. vesca] Length = 482 Score = 345 bits (884), Expect = 3e-92 Identities = 188/382 (49%), Positives = 243/382 (63%), Gaps = 13/382 (3%) Frame = -2 Query: 1213 IHSAADYGAGLYLVKLRVGSPAQKLELIADTGSDLTWSNXXXXXXXXXXGTXXXXXXXXX 1034 + SA D+GAG Y V+++VG+P+Q+ LIADTGSDLTW G Sbjct: 103 LSSAWDFGAGQYFVQIKVGTPSQRFLLIADTGSDLTWMKCKYRCVADKCGLKRATMKKNK 162 Query: 1033 XXXXXXXXXRT---VPCSSTICKVDLANLFXXXXXXXXXXXCAYDYRYSDGSATVGLFAN 863 T +PCSS +CK +L F C YDYRY++ S +G FAN Sbjct: 163 KKVFRPAQSSTFKIIPCSSEMCKFELE--FSRQECPTPLSPCKYDYRYAESSGALGFFAN 220 Query: 862 ETVTFGLSNGRKRRVHDVLVGCSES---SRGQSFVAADGVMGLGYSNYSFAVRAADKFGG 692 ETV L+NGR+ R++DVL+GC+ES +G S A DG++GLG+ +SF +AA G Sbjct: 221 ETVRVPLTNGRRARLNDVLIGCTESIEGPKGASIRAGDGILGLGFGKHSFVAKAASNLGD 280 Query: 691 KFSYCLVDHLSPNNLSSYLIFG----SQPQHTRMRYTELVLG--VVNPFYAVAIKGISIG 530 KFSYCLVDH+S N+SSYL FG + Q++RMRYT+L LG + PFYAV + GIS G Sbjct: 281 KFSYCLVDHMSNKNVSSYLTFGRNAETAQQNSRMRYTKLALGGPKIGPFYAVNLVGISAG 340 Query: 529 GAMLDIPPDTWDLDGGGGAIVDSGTSLTVLTLPAYKLVVAALQLPLERVD-LGIGPLEYC 353 ML IP + W+ + GGG IVDSGTSLT LT PAY V+ L + L + + E+C Sbjct: 341 SKMLKIPNEVWNENLGGGTIVDSGTSLTFLTSPAYIHVMDELTMALSKYKKIPSDAFEFC 400 Query: 352 FNSSAGFNETAVPRLVVHFADGARFEPPVKSYVIDAAPGVKCLGFAAAAWPGVSVVGNIM 173 FNS+ G++++ VPR +HFADGA+FEPPVKSYVID A KCLGF +A +PG V+GNIM Sbjct: 401 FNST-GYDQSLVPRFAIHFADGAKFEPPVKSYVIDVAIQTKCLGFQSAPFPGTIVIGNIM 459 Query: 172 QQNHLWEFDIVKSRLGFAPSSC 107 QQN+LWEFD+ RLG+APSSC Sbjct: 460 QQNYLWEFDLRGGRLGYAPSSC 481 >ref|XP_002518395.1| Aspartic proteinase Asp1 precursor, putative [Ricinus communis] gi|223542240|gb|EEF43782.1| Aspartic proteinase Asp1 precursor, putative [Ricinus communis] Length = 489 Score = 340 bits (872), Expect = 7e-91 Identities = 184/387 (47%), Positives = 243/387 (62%), Gaps = 9/387 (2%) Frame = -2 Query: 1213 IHSAADYGAGLYLVKLRVGSPA-QKLELIADTGSDLTWSNXXXXXXXXXXGTXXXXXXXX 1037 IHS AD G Y V +R+G+P QK L+ DTGSDLTW N Sbjct: 108 IHSGADSGQSQYFVSIRIGTPRPQKFILVTDTGSDLTWMNCEYWCKSCPKPNPHPGRVFR 167 Query: 1036 XXXXXXXXXXRTVPCSSTICKVDLANLFXXXXXXXXXXXCAYDYRYSDGSATVGLFANET 857 T+PCSS CK++L + F C +DYRY +G +G+FANET Sbjct: 168 ANDSSSFR---TIPCSSDDCKIELQDYFSLTECPNPNAPCLFDYRYLNGPRAIGVFANET 224 Query: 856 VTFGLSNGRKRRVHDVLVGCSESSRGQSFVAADGVMGLGYSNYSFAVRAADKFGGKFSYC 677 VT GL++ +K R+ DVL+GC+ES ++ DGVMGLGY +S A+R A+ FG KFSYC Sbjct: 225 VTVGLNDHKKIRLFDVLIGCTESFN-ETNGFPDGVMGLGYRKHSLALRLAEIFGNKFSYC 283 Query: 676 LVDHLSPNNLSSYLIFGSQPQHT--RMRYTELVLGVVNPFYAVAIKGISIGGAMLDIPPD 503 LVDHLS +N ++L FG P+ +M++TEL+LG +N FY V + GIS+GG+ML I D Sbjct: 284 LVDHLSSSNHKNFLSFGDIPEMKLPKMQHTELLLGYINAFYPVNVSGISVGGSMLSISSD 343 Query: 502 TWDLDGGGGAIVDSGTSLTVLTLPAYKLVVAALQLPLERVDLGIGPLE------YCFNSS 341 W++ G GG IVDSGTSLT+L AY VV AL+ P+ + P+E +CF Sbjct: 344 IWNVTGVGGMIVDSGTSLTMLAGEAYDKVVDALK-PIFDKHKKVVPIELPELNNFCFEDK 402 Query: 340 AGFNETAVPRLVVHFADGARFEPPVKSYVIDAAPGVKCLGFAAAAWPGVSVVGNIMQQNH 161 GF+ AVPRL++HFADGA F+PPVKSY+ID A G+KCLG A +PG S++GN+MQQNH Sbjct: 403 -GFDRAAVPRLLIHFADGAIFKPPVKSYIIDVAEGIKCLGIIKADFPGSSILGNVMQQNH 461 Query: 160 LWEFDIVKSRLGFAPSSCV*FWTNSSH 80 LWE+D+ + +LGF PSSC+ +NS H Sbjct: 462 LWEYDLGRGKLGFGPSSCIMSNSNSKH 488 >ref|XP_006429804.1| hypothetical protein CICLE_v10013820mg [Citrus clementina] gi|557531861|gb|ESR43044.1| hypothetical protein CICLE_v10013820mg [Citrus clementina] Length = 475 Score = 334 bits (856), Expect = 5e-89 Identities = 184/359 (51%), Positives = 230/359 (64%), Gaps = 6/359 (1%) Frame = -2 Query: 1213 IHSAADYGAGLYLVKLRVGSPAQKLELIADTGSDLTWSNXXXXXXXXXXGTXXXXXXXXX 1034 + S AD G G Y V RVGSP QK LIADTGSDLTW + Sbjct: 117 LRSGADRGLGQYFVSFRVGSPPQKFVLIADTGSDLTWMHCNHKGENCPKD--GLTPPNRM 174 Query: 1033 XXXXXXXXXRTVPCSSTICKVDLANLFXXXXXXXXXXXCAYDYRYSDGSATVGLFANETV 854 +T+PCSS CKVDL + F CAYDY Y DGS G FANETV Sbjct: 175 FQADASSTFKTIPCSSRTCKVDLQDTFSLSMCPTPVTPCAYDYSYFDGSKVRGFFANETV 234 Query: 853 TFGLSNGRKR-RVHDVLVGCSESSRGQSFVAADGVMGLGYSNYSFAVRAADKFGGKFSYC 677 T G + RK+ R+ +V VGC++ + G +F ADGV+GLG+ SFA AA F KFSYC Sbjct: 235 TAGSIDRRKKVRLKEVTVGCTDWANG-NFHNADGVLGLGFGKNSFAATAAKLFDNKFSYC 293 Query: 676 LVDHLSPNNLSSYLIFGS-QPQHTR-MRYTELVLGVVNPFYAVAIKGISIGGAMLDIPPD 503 LVDHLSP+N +++L FG+ QH + M++T+L+LG +NPFYAV + GISI G ML++PP+ Sbjct: 294 LVDHLSPSNFANFLNFGNTSKQHIQNMQHTQLILGELNPFYAVNVSGISIAGKMLNVPPE 353 Query: 502 TWDLDGGGGAIVDSGTSLTVLTLPAYKLVVAALQLPLERVD-LG--IGPLEYCFNSSAGF 332 W + G GG I+DSGT+LT L PAY VAAL+ PLE+ LG +GPL +C+N F Sbjct: 354 MWHIHGAGGVILDSGTTLTFLGEPAYAAAVAALRAPLEKYKKLGHVLGPLRFCYNDPR-F 412 Query: 331 NETAVPRLVVHFADGARFEPPVKSYVIDAAPGVKCLGFAAAAWPGVSVVGNIMQQNHLW 155 + VP+ V+HFADGA+F PP KSYVIDA GVKC+GFA+A WP +V+GNIMQQNHLW Sbjct: 413 DMADVPQFVLHFADGAKFVPPKKSYVIDADVGVKCIGFASAGWPANTVIGNIMQQNHLW 471 >ref|XP_007211847.1| hypothetical protein PRUPE_ppa004710mg [Prunus persica] gi|462407712|gb|EMJ13046.1| hypothetical protein PRUPE_ppa004710mg [Prunus persica] Length = 495 Score = 328 bits (840), Expect = 4e-87 Identities = 175/373 (46%), Positives = 231/373 (61%), Gaps = 9/373 (2%) Frame = -2 Query: 1198 DYGAGLYLVKLRVGSPAQKLELIADTGSDLTWSNXXXXXXXXXXGTXXXXXXXXXXXXXX 1019 DYG G YLVKL++G+PAQK +I TGSDLTW Sbjct: 124 DYGIGQYLVKLKLGTPAQKFTVIPSTGSDLTWVRCGSHCGKSCGIRKGRIDHSRVFNTDR 183 Query: 1018 XXXXRTVPCSSTICKVDLANLFXXXXXXXXXXXCAYDYRYSDGSATVGLFANETVTFGLS 839 ++V CSS +C+ DLAN C YDY Y +GS+ +G F + V LS Sbjct: 184 SSTFKSVTCSSKMCEFDLANFNSLNKCPRPLSPCRYDYSYVEGSSALGTFGTDIVRASLS 243 Query: 838 NGRKRRVHDVLVGCSESSRGQSFV-AADGVMGLGYSNYSFAVRAADKFGGKFSYCLVDHL 662 NGR+ R+ DVL+GC+ES G+ +DG++GLG+ YSF +AA K+GGK SYCL+DH+ Sbjct: 244 NGRRNRMKDVLIGCTESIIGKGTAKGSDGILGLGFGKYSFTTKAALKYGGKVSYCLLDHM 303 Query: 661 SPNNLSSYLIFGSQPQHT---RMRYTELVLGVVNP--FYAVAIKGISIGGAMLDIPPDTW 497 SP N++SYL FG + +MRYT+LV G N FY V ++GIS+GG ML+IP W Sbjct: 304 SPKNVTSYLTFGDNKKAVLQGKMRYTQLVFGNPNKGSFYGVNLQGISVGGKMLNIPLHIW 363 Query: 496 DLDGGGGAIVDSGTSLTVLTLPAYKLVVAALQLPL---ERVDLGIGPLEYCFNSSAGFNE 326 + GGGA+VDSG SLT LT PAYK V+ AL +PL R+ ++CF+ G+ + Sbjct: 364 NPKLGGGALVDSGMSLTFLTKPAYKPVMTALTMPLTKFRRLRSEEDDFDFCFD-PRGYRD 422 Query: 325 TAVPRLVVHFADGARFEPPVKSYVIDAAPGVKCLGFAAAAWPGVSVVGNIMQQNHLWEFD 146 VP+LV HFA GA+F PPVKSYVID +PG+KC+G A G ++GNI+QQNHLWEF+ Sbjct: 423 RLVPKLVFHFAGGAKFAPPVKSYVIDVSPGMKCIGILPLA-EGACIIGNIIQQNHLWEFN 481 Query: 145 IVKSRLGFAPSSC 107 +V+ LGFAPS+C Sbjct: 482 LVRKTLGFAPSTC 494 >ref|NP_001056286.1| Os05g0557100 [Oryza sativa Japonica Group] gi|113579837|dbj|BAF18200.1| Os05g0557100 [Oryza sativa Japonica Group] gi|125553268|gb|EAY98977.1| hypothetical protein OsI_20935 [Oryza sativa Indica Group] Length = 494 Score = 319 bits (818), Expect = 1e-84 Identities = 181/396 (45%), Positives = 225/396 (56%), Gaps = 27/396 (6%) Frame = -2 Query: 1213 IHSAADYGAGLYLVKLRVGSPAQKLELIADTGSDLTW--------SNXXXXXXXXXXGTX 1058 + S A G G Y V+ RVG+PAQ LIADTGSDLTW + Sbjct: 99 LSSGAYTGTGQYFVRFRVGTPAQPFVLIADTGSDLTWVKCRGAASPSHATATASPAAAPS 158 Query: 1057 XXXXXXXXXXXXXXXXXRTVPCSSTICKVDLANLFXXXXXXXXXXXCAYDYRYSDGSATV 878 +PCSS CK + F C+YDYRY+D SA Sbjct: 159 PAVAPPRVFRPGDSKTWSPIPCSSETCKSTIP--FSLANCSSSTAACSYDYRYNDNSAAR 216 Query: 877 GLFANETVTFGLSNGR--------KRRVHDVLVGCSESSRGQSFVAADGVMGLGYSNYSF 722 G+ ++ T LS GR K ++ V++GC+ + GQ F A+DGV+ LGYSN SF Sbjct: 217 GVVGTDSATVALSGGRGGGGGGDRKAKLQGVVLGCTTAHAGQGFEASDGVLSLGYSNISF 276 Query: 721 AVRAADKFGGKFSYCLVDHLSPNNLSSYLIFGSQPQHTRMRY------TELVLGV-VNPF 563 A RAA +FGG+FSYCLVDHL+P N +SYL FG+ P T L+L V PF Sbjct: 277 ASRAASRFGGRFSYCLVDHLAPRNATSYLTFGAGPDAASSSAPAPGSRTPLLLDARVRPF 336 Query: 562 YAVAIKGISIGGAMLDIPPDTWDLDGGGGAIVDSGTSLTVLTLPAYKLVVAALQLPLERV 383 YAVA+ +S+ G LDIP + WD+ GG I+DSGTSLTVL PAYK VVAAL L + Sbjct: 337 YAVAVDSVSVDGVALDIPAEVWDVGSNGGTIIDSGTSLTVLATPAYKAVVAALSEQLAGL 396 Query: 382 D-LGIGPLEYCFNSSA---GFNETAVPRLVVHFADGARFEPPVKSYVIDAAPGVKCLGFA 215 + + P +YC+N +A G + AVP+L V FA AR EPP KSYVIDAAPGVKC+G Sbjct: 397 PRVAMDPFDYCYNWTARGDGGGDLAVPKLAVQFAGSARLEPPAKSYVIDAAPGVKCIGVQ 456 Query: 214 AAAWPGVSVVGNIMQQNHLWEFDIVKSRLGFAPSSC 107 AWPGVSV+GNI+QQ HLWEFD+ L F +SC Sbjct: 457 EGAWPGVSVIGNILQQEHLWEFDLNNRWLRFRQTSC 492 >ref|XP_003566992.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium distachyon] Length = 479 Score = 315 bits (808), Expect = 2e-83 Identities = 182/393 (46%), Positives = 223/393 (56%), Gaps = 26/393 (6%) Frame = -2 Query: 1207 SAADYGAGLYLVKLRVGSPAQKLELIADTGSDLTWSNXXXXXXXXXXG----TXXXXXXX 1040 SAA G G Y V+ RVG+PAQ L+ADTGSDLTW + Sbjct: 86 SAAYTGIGQYFVRFRVGTPAQPFLLVADTGSDLTWVKCRPAKAAAASTNSSSSASASSPR 145 Query: 1039 XXXXXXXXXXXRTVPCSSTICKVDLANLFXXXXXXXXXXXCAYDYRYSDGSATVGLFANE 860 +PC+S C L F CAYDYRY DGSA G E Sbjct: 146 RAFRPEKSKTWAPIPCASDTCSKSLP--FSLSTCPTPGSPCAYDYRYKDGSAARGTVGTE 203 Query: 859 TVTFGLSNG--------RKRRVHDVLVGCSESSRGQSFVAADGVMGLGYSNYSFAVRAAD 704 + T LS+ +K ++ +++GC+ S G SF A+DGV+ LGYSN SFA AA Sbjct: 204 SATIALSSSSSSSKNKVKKAKLQGLVLGCTGSYTGPSFEASDGVLSLGYSNVSFASHAAS 263 Query: 703 KFGGKFSYCLVDHLSPNNLSSYLIFGSQPQHTR---------MRYTELVLGV-VNPFYAV 554 +FGG+FSYCLVDHLSP N +SYL FG + R T LVL + PFY V Sbjct: 264 RFGGRFSYCLVDHLSPRNATSYLTFGPNSALSGPCPAAAGPGARQTPLVLDSRMRPFYDV 323 Query: 553 AIKGISIGGAMLDIPPDTWDLDGGGGAIVDSGTSLTVLTLPAYKLVVAALQLPLERVD-L 377 +IK IS+ G +L IP D W++DGGGG IVDSGTSLTVL PAY+ VVAAL L R + Sbjct: 324 SIKAISVDGELLKIPRDVWEVDGGGGVIVDSGTSLTVLAKPAYRAVVAALGKKLARFPRV 383 Query: 376 GIGPLEYCFNSSAGFNETA---VPRLVVHFADGARFEPPVKSYVIDAAPGVKCLGFAAAA 206 + P EYC+N ++ + +P+L VHFA AR EPP KSYVIDAAPGVKC+G Sbjct: 384 AMDPFEYCYNWTSPSRKDEGDDLPKLAVHFAGSARLEPPSKSYVIDAAPGVKCIGVQEGP 443 Query: 205 WPGVSVVGNIMQQNHLWEFDIVKSRLGFAPSSC 107 WPG+SV+GNI+QQ HLWEFD+ RL F S C Sbjct: 444 WPGISVIGNILQQEHLWEFDLKNRRLRFKRSRC 476