BLASTX nr result
ID: Chrysanthemum22_contig00048435
seq
BLASTX 2.2.26 [Sep-21-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Chrysanthemum22_contig00048435 (1564 letters) Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF excluding environmental samples from WGS projects 149,584,005 sequences; 54,822,741,787 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_023746036.1| aspartic proteinase nepenthesin-1 [Lactuca s... 632 0.0 ref|XP_021970655.1| aspartic proteinase nepenthesin-1 [Helianthu... 631 0.0 gb|KVI08227.1| hypothetical protein Ccrd_013402 [Cynara carduncu... 619 0.0 ref|XP_002522914.1| PREDICTED: aspartic proteinase nepenthesin-1... 490 e-167 gb|OMP01139.1| Peptidase A1 [Corchorus olitorius] 489 e-167 ref|XP_021273783.1| aspartic proteinase nepenthesin-1 [Herrania ... 489 e-167 ref|XP_011046330.1| PREDICTED: aspartic proteinase nepenthesin-1... 484 e-165 ref|XP_007048585.2| PREDICTED: aspartic proteinase nepenthesin-1... 483 e-164 ref|XP_022740638.1| aspartic proteinase nepenthesin-1-like [Duri... 481 e-164 ref|XP_006370905.1| aspartyl protease family protein [Populus tr... 481 e-164 gb|EOX92742.1| Eukaryotic aspartyl protease family protein [Theo... 481 e-164 ref|XP_010025709.1| PREDICTED: aspartic proteinase nepenthesin-1... 481 e-163 ref|XP_022773823.1| aspartic proteinase nepenthesin-1-like [Duri... 478 e-162 ref|XP_021671882.1| aspartic proteinase nepenthesin-1, partial [... 479 e-162 ref|XP_004239638.1| PREDICTED: aspartic proteinase nepenthesin-1... 476 e-162 ref|XP_006345762.1| PREDICTED: aspartic proteinase nepenthesin-1... 475 e-161 gb|PPD91976.1| hypothetical protein GOBAR_DD11093 [Gossypium bar... 474 e-161 ref|XP_016698862.1| PREDICTED: aspartic proteinase nepenthesin-1... 473 e-160 gb|PIN24256.1| Aspartyl protease [Handroanthus impetiginosus] 473 e-160 ref|XP_019245654.1| PREDICTED: aspartic proteinase nepenthesin-1... 473 e-160 >ref|XP_023746036.1| aspartic proteinase nepenthesin-1 [Lactuca sativa] gb|PLY96403.1| hypothetical protein LSAT_2X37681 [Lactuca sativa] Length = 457 Score = 632 bits (1630), Expect = 0.0 Identities = 321/450 (71%), Positives = 349/450 (77%) Frame = +1 Query: 139 SLLFIVVYMLLICSPLTLSTSRRVLRDHFTNHPENGFRITLKHVDSNKNLTKFERLQRGV 318 SLLF + + L+ TLSTSRRVL +H T+ E FR+ L HVDS KN+TKFERLQ GV Sbjct: 10 SLLFTIFCLQLLFLSPTLSTSRRVLHNHVTDQNEAAFRVALTHVDSGKNMTKFERLQLGV 69 Query: 319 MRGNLRLERIINNMMAXXXXXXXXXXXXXXXHAGNGEFLMNLAIGTPPEPYSAIMDTGSD 498 RGNLRLER+INNMMA HAGNGEFLMNLAIGTPPE YSAIMDTGSD Sbjct: 70 KRGNLRLERLINNMMASLSVESSSQVTSPV-HAGNGEFLMNLAIGTPPETYSAIMDTGSD 128 Query: 499 LIWTQCKPCTKCFDAPTPVFDPTKXXXXXXXXXXXXLCKALPTSDCGSDGCEYLYSYGDY 678 LIWTQCKPCTKCFDAPTPVFDP K LCKALPTSDCGSDGCEYLYSYGDY Sbjct: 129 LIWTQCKPCTKCFDAPTPVFDPKKSSSFSKVSCSNSLCKALPTSDCGSDGCEYLYSYGDY 188 Query: 679 SSTQGVLATETFTFDKVSVPKVGFGCGEDNEXXXXXXXXXXXXXXXXXXXXVSQLKQSKF 858 SSTQG+LATETF FDKVSVP VGFGCGEDNE VSQLK+S F Sbjct: 189 SSTQGILATETFMFDKVSVPSVGFGCGEDNEGNGFNQGAGLVGLGRGPLSLVSQLKKSTF 248 Query: 859 SYCLTSMNDDTSSSENPTSTLIMGSLATEISNDTTFTTPLLKNPSQPSFYYLSLTGISVG 1038 SYCLTS+NDDTSS NP+STL+MGSL ++I+ND+ FTTPL+KNPSQPSFYYLSL GISVG Sbjct: 249 SYCLTSINDDTSSG-NPSSTLVMGSLESQIANDSVFTTPLIKNPSQPSFYYLSLQGISVG 307 Query: 1039 NVDLPIEKTTFAINDDGTGGMIIDSGTTITYLEESAFNMLKKEFVSQTKLNVDNSGASGL 1218 NVDLPI+K+TFAIN DGTGG+IIDSGTTITYLEESAFNM+KKEFVSQT LNVDNSG++GL Sbjct: 308 NVDLPIKKSTFAINSDGTGGVIIDSGTTITYLEESAFNMVKKEFVSQTNLNVDNSGSTGL 367 Query: 1219 DLCFELPQDDGSGETTIEIPKLVLHFDGASLDLPGENYMIGDAKNGVACLAMXXXXXXXX 1398 DLCFELP DDGSGE +IEIPKLV HFDGASLDLPGENYMIGD GVACLAM Sbjct: 368 DLCFELPSDDGSGEMSIEIPKLVFHFDGASLDLPGENYMIGDTNAGVACLAMGGSSGISI 427 Query: 1399 XXXXXXXXMMVVHDLDKETLSFVPTNCDKL 1488 MMVVHDL+KETLSF+PT CD+L Sbjct: 428 FGNIQQQNMMVVHDLEKETLSFIPTKCDQL 457 >ref|XP_021970655.1| aspartic proteinase nepenthesin-1 [Helianthus annuus] gb|OTG23273.1| putative eukaryotic aspartyl protease family protein [Helianthus annuus] Length = 453 Score = 631 bits (1628), Expect = 0.0 Identities = 320/456 (70%), Positives = 355/456 (77%) Frame = +1 Query: 121 MASSLHSLLFIVVYMLLICSPLTLSTSRRVLRDHFTNHPENGFRITLKHVDSNKNLTKFE 300 MASSL SL I+ +++ P + STSRRVL H T E GFR+ LKHVDS KNLTKFE Sbjct: 1 MASSLISLSTIL-FIIFSLQP-SFSTSRRVLDVHATRQHETGFRVALKHVDSGKNLTKFE 58 Query: 301 RLQRGVMRGNLRLERIINNMMAXXXXXXXXXXXXXXXHAGNGEFLMNLAIGTPPEPYSAI 480 RLQRGV RGN RL+R++ NMMA HAGNGEFLMNLAIGTPPEPY+AI Sbjct: 59 RLQRGVKRGNHRLQRLMENMMASLSVDSSTTQVTSPVHAGNGEFLMNLAIGTPPEPYAAI 118 Query: 481 MDTGSDLIWTQCKPCTKCFDAPTPVFDPTKXXXXXXXXXXXXLCKALPTSDCGSDGCEYL 660 MDTGSDLIWTQCKPCTKCFDAPTPVFDP K LCKALPTSDCGSDGCEYL Sbjct: 119 MDTGSDLIWTQCKPCTKCFDAPTPVFDPKKSSSFSKIPCSSSLCKALPTSDCGSDGCEYL 178 Query: 661 YSYGDYSSTQGVLATETFTFDKVSVPKVGFGCGEDNEXXXXXXXXXXXXXXXXXXXXVSQ 840 YSYGDYSST+G+LATETFTFD VSVP+VGFGCG+DNE VSQ Sbjct: 179 YSYGDYSSTEGILATETFTFDTVSVPEVGFGCGQDNEGSGFNQGGGLVGLGRGPLSLVSQ 238 Query: 841 LKQSKFSYCLTSMNDDTSSSENPTSTLIMGSLATEISNDTTFTTPLLKNPSQPSFYYLSL 1020 LKQSKFSYCLTS++D+T SS+NPTSTL+MGSLATE SN + +TTPL+KNPS PSFYYLSL Sbjct: 239 LKQSKFSYCLTSISDET-SSQNPTSTLVMGSLATETSNSSVYTTPLIKNPSHPSFYYLSL 297 Query: 1021 TGISVGNVDLPIEKTTFAINDDGTGGMIIDSGTTITYLEESAFNMLKKEFVSQTKLNVDN 1200 GISVGNVDLPIEK+TFA++ DGTGGMIIDSGTTITYL+ESAF ++KKEFVSQ KLNVDN Sbjct: 298 VGISVGNVDLPIEKSTFAVDSDGTGGMIIDSGTTITYLQESAFELVKKEFVSQVKLNVDN 357 Query: 1201 SGASGLDLCFELPQDDGSGETTIEIPKLVLHFDGASLDLPGENYMIGDAKNGVACLAMXX 1380 S ++GLDLCFELP+DDGSGETTIEIPKLV HFDGASLDLPGENYMIGD K+G+ CLAM Sbjct: 358 SDSTGLDLCFELPEDDGSGETTIEIPKLVFHFDGASLDLPGENYMIGDVKSGLVCLAMGS 417 Query: 1381 XXXXXXXXXXXXXXMMVVHDLDKETLSFVPTNCDKL 1488 MMVVHDL+KETLSFVPT CD+L Sbjct: 418 SSGISIFGNIQQQNMMVVHDLEKETLSFVPTKCDQL 453 >gb|KVI08227.1| hypothetical protein Ccrd_013402 [Cynara cardunculus var. scolymus] Length = 456 Score = 619 bits (1596), Expect = 0.0 Identities = 314/457 (68%), Positives = 350/457 (76%), Gaps = 1/457 (0%) Frame = +1 Query: 121 MASSLHSLLFIVVYMLLICSPLTLSTSRRVLRDHFTNHPEN-GFRITLKHVDSNKNLTKF 297 MASSL SLLF + + L+ TLSTSR +L DH T+HP+ F + LKHVDS KNLTKF Sbjct: 1 MASSLFSLLFFIFSLQLLSLSPTLSTSRHLLHDHPTHHPQPPSFTVPLKHVDSGKNLTKF 60 Query: 298 ERLQRGVMRGNLRLERIINNMMAXXXXXXXXXXXXXXXHAGNGEFLMNLAIGTPPEPYSA 477 +RLQRGVMRGNLRLER+INNMMA HAGNGEFLMNLAIGTPPE YSA Sbjct: 61 QRLQRGVMRGNLRLERLINNMMASLSVDSSSKVTSPV-HAGNGEFLMNLAIGTPPETYSA 119 Query: 478 IMDTGSDLIWTQCKPCTKCFDAPTPVFDPTKXXXXXXXXXXXXLCKALPTSDCGSDGCEY 657 IMDTGSDLIWTQCKPCTKCFDAPTPVFDP K LCKALPTS+C +DGCEY Sbjct: 120 IMDTGSDLIWTQCKPCTKCFDAPTPVFDPEKSSSFSKVSCTNSLCKALPTSECAADGCEY 179 Query: 658 LYSYGDYSSTQGVLATETFTFDKVSVPKVGFGCGEDNEXXXXXXXXXXXXXXXXXXXXVS 837 LYSYGDYSS+QG+LA ETFTFDKV+VP++GFGCGEDNE VS Sbjct: 180 LYSYGDYSSSQGILAKETFTFDKVTVPELGFGCGEDNEGSGFDQGGGLVGLGRGPLSLVS 239 Query: 838 QLKQSKFSYCLTSMNDDTSSSENPTSTLIMGSLATEISNDTTFTTPLLKNPSQPSFYYLS 1017 QLK+SKFSYCLTS++DD SS NPTSTL+MGSL ++ISND+ TTPLLKNPSQPSFYYLS Sbjct: 240 QLKKSKFSYCLTSIDDDDPSSGNPTSTLVMGSLVSQISNDSVLTTPLLKNPSQPSFYYLS 299 Query: 1018 LTGISVGNVDLPIEKTTFAINDDGTGGMIIDSGTTITYLEESAFNMLKKEFVSQTKLNVD 1197 L GISVG VDLPI+K+TFAIN DG+GG+IIDSGTTITYLEESAF ++KKEFVSQTKL VD Sbjct: 300 LQGISVGKVDLPIKKSTFAINADGSGGLIIDSGTTITYLEESAFELVKKEFVSQTKLKVD 359 Query: 1198 NSGASGLDLCFELPQDDGSGETTIEIPKLVLHFDGASLDLPGENYMIGDAKNGVACLAMX 1377 NS ++GLDLCFELP++DGSGE IEIPKLV HF+ A LDLPGENYMIGD GV CLAM Sbjct: 360 NSDSTGLDLCFELPENDGSGEMKIEIPKLVFHFEDAKLDLPGENYMIGDLNAGVVCLAMG 419 Query: 1378 XXXXXXXXXXXXXXXMMVVHDLDKETLSFVPTNCDKL 1488 MMVVHDLDKETLSF+PT CD+L Sbjct: 420 SSSGISIFGNIQQQNMMVVHDLDKETLSFIPTKCDQL 456 >ref|XP_002522914.1| PREDICTED: aspartic proteinase nepenthesin-1 [Ricinus communis] gb|EEF39457.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus communis] Length = 442 Score = 490 bits (1261), Expect = e-167 Identities = 264/458 (57%), Positives = 311/458 (67%), Gaps = 4/458 (0%) Frame = +1 Query: 127 SSLHSLLFIVVYMLLICSPLT---LSTSRRVLRDHFTNHPENGFRITLKHVDSNKNLTKF 297 +S SL ++V+ LLI S STSRR L + +NGFRITLKHVDS+KNLTKF Sbjct: 2 ASTFSLSWVVLLSLLILSLSVYPAFSTSRRALS--YPAQLKNGFRITLKHVDSDKNLTKF 59 Query: 298 ERLQRGVMRGNLRLERIINNMMAXXXXXXXXXXXXXXXHAGNGEFLMNLAIGTPPEPYSA 477 +R+Q G+ R N RLER+ ++A +GNGEFLMNLAIGTPPE YSA Sbjct: 60 QRIQHGIKRANHRLERLNAMVLAASSNAEINSPVL----SGNGEFLMNLAIGTPPETYSA 115 Query: 478 IMDTGSDLIWTQCKPCTKCFDAPTPVFDPTKXXXXXXXXXXXXLCKALPTSDCGSDGCEY 657 IMDTGSDLIWTQCKPCT+CFD P+P+FDP K LCKALP S C SD CEY Sbjct: 116 IMDTGSDLIWTQCKPCTQCFDQPSPIFDPKKSSSFSKLSCSSQLCKALPQSSC-SDSCEY 174 Query: 658 LYSYGDYSSTQGVLATETFTFDKVSVPKVGFGCGEDNEXXXXXXXXXXXXXXXXXXXXVS 837 LY+YGDYSSTQG +ATETFTF KVS+P VGFGCGEDNE VS Sbjct: 175 LYTYGDYSSTQGTMATETFTFGKVSIPNVGFGCGEDNEGDGFTQGSGLVGLGRGPLSLVS 234 Query: 838 QLKQSKFSYCLTSMNDDTSSSENPTSTLIMGSLATEISNDTTF-TTPLLKNPSQPSFYYL 1014 QLK++KFSYCLTS++D TSTL+MGSLA+ TTPL++NP QPSFYYL Sbjct: 235 QLKEAKFSYCLTSIDD------TKTSTLLMGSLASVNGTSAAIRTTPLIQNPLQPSFYYL 288 Query: 1015 SLTGISVGNVDLPIEKTTFAINDDGTGGMIIDSGTTITYLEESAFNMLKKEFVSQTKLNV 1194 SL GISVG LPI+++TF + DDGTGG+IIDSGTTITYLEESAF+++KKEF SQ L V Sbjct: 289 SLEGISVGGTRLPIKESTFQLQDDGTGGLIIDSGTTITYLEESAFDLVKKEFTSQMGLPV 348 Query: 1195 DNSGASGLDLCFELPQDDGSGETTIEIPKLVLHFDGASLDLPGENYMIGDAKNGVACLAM 1374 DNSGA+GL+LC+ LP D + +E+PKLVLHF GA L+LPGENYMI D+ GV CLAM Sbjct: 349 DNSGATGLELCYNLPSD----TSELEVPKLVLHFTGADLELPGENYMIADSSMGVICLAM 404 Query: 1375 XXXXXXXXXXXXXXXXMMVVHDLDKETLSFVPTNCDKL 1488 M V HDL+KETLSF+PTNC +L Sbjct: 405 GSSGGMSIFGNVQQQNMFVSHDLEKETLSFLPTNCGQL 442 >gb|OMP01139.1| Peptidase A1 [Corchorus olitorius] Length = 446 Score = 489 bits (1260), Expect = e-167 Identities = 259/461 (56%), Positives = 321/461 (69%), Gaps = 5/461 (1%) Frame = +1 Query: 121 MASSLHSLLFIVVYMLLICSPLT--LSTSRRVLRDHFTNHPENGFRITLKHVDSNKNLTK 294 MASSL+SLL +++ L + ++ +STSR+ L + ++GFR+TLKHVDS KNLTK Sbjct: 1 MASSLYSLLCVIILTLAVALHVSPAVSTSRKALVGK-SKKIQDGFRVTLKHVDSGKNLTK 59 Query: 295 FERLQRGVMRGNLRLERIINNMMAXXXXXXXXXXXXXXXHAGNGEFLMNLAIGTPPEPYS 474 +ER+QRG+ RGN RL+R+ ++A AGNGEFLM+LAIGTPPE YS Sbjct: 60 WERIQRGLKRGNHRLQRLNALVLAATDSAGVFESPVT---AGNGEFLMDLAIGTPPESYS 116 Query: 475 AIMDTGSDLIWTQCKPCTKCFDAPTPVFDPTKXXXXXXXXXXXXLCKALPTSDCGSDGCE 654 AI+DTGSDLIWTQCKPC++CF+ PTP+FDP K LC ALP S C +DGCE Sbjct: 117 AIVDTGSDLIWTQCKPCSQCFNQPTPIFDPKKSSTFSKISCSSDLCSALPQSTC-NDGCE 175 Query: 655 YLYSYGDYSSTQGVLATETFTFD-KVSVPKVGFGCGEDNEXXXXXXXXXXXXXXXXXXXX 831 YLY+YGDYSSTQG+LATETFTFD K+SVPK+GFGCG+DNE Sbjct: 176 YLYTYGDYSSTQGLLATETFTFDDKISVPKIGFGCGDDNEGDGFSQGAGLVGLGRGPLSL 235 Query: 832 VSQLKQSKFSYCLTSMNDDTSSSENPTSTLIMGSLAT--EISNDTTFTTPLLKNPSQPSF 1005 VSQLK+ KF+YCLTS++D S L+MGS+A+ +D TTPL++NP QPSF Sbjct: 236 VSQLKEPKFAYCLTSIDDTQKGS------LLMGSMASVNNTFSDEIKTTPLIQNPLQPSF 289 Query: 1006 YYLSLTGISVGNVDLPIEKTTFAINDDGTGGMIIDSGTTITYLEESAFNMLKKEFVSQTK 1185 YYLSL GISVG +LPI+K+TFA+ DDGTGG+IIDSGTTITYLE+SAF+++KKEF+SQ K Sbjct: 290 YYLSLQGISVGATNLPIKKSTFALQDDGTGGVIIDSGTTITYLEQSAFDLVKKEFISQMK 349 Query: 1186 LNVDNSGASGLDLCFELPQDDGSGETTIEIPKLVLHFDGASLDLPGENYMIGDAKNGVAC 1365 L+VD SG++GLDLCF LP D + +P L HFDGA LDLPGENYMIGD+ +G+ C Sbjct: 350 LSVDTSGSTGLDLCFTLPSD----AADVSVPTLTFHFDGADLDLPGENYMIGDSSSGLLC 405 Query: 1366 LAMXXXXXXXXXXXXXXXXMMVVHDLDKETLSFVPTNCDKL 1488 LAM M+V+HDL KETLSFV T CDKL Sbjct: 406 LAMGGSSGMSIFGNVQQQNMLVIHDLQKETLSFVHTQCDKL 446 >ref|XP_021273783.1| aspartic proteinase nepenthesin-1 [Herrania umbratica] Length = 441 Score = 489 bits (1258), Expect = e-167 Identities = 264/461 (57%), Positives = 320/461 (69%), Gaps = 7/461 (1%) Frame = +1 Query: 127 SSLHSLLFI----VVYMLLICSPLTLSTSRRVLRDHFTNHP--ENGFRITLKHVDSNKNL 288 +SL+SLL + VV + L SP +STSRR L HP +NGFR+TL+HVDS KNL Sbjct: 2 ASLYSLLCVAFLTVVIVALYVSP-AVSTSRRALE-----HPRLQNGFRVTLRHVDSGKNL 55 Query: 289 TKFERLQRGVMRGNLRLERIINNMMAXXXXXXXXXXXXXXXHAGNGEFLMNLAIGTPPEP 468 TK+ER+QRG+ RGN RL+R+ ++A AGNGEFLM+L+IGTPPE Sbjct: 56 TKWERIQRGLKRGNHRLQRLNGMVLAATDASELQAPIT----AGNGEFLMDLSIGTPPES 111 Query: 469 YSAIMDTGSDLIWTQCKPCTKCFDAPTPVFDPTKXXXXXXXXXXXXLCKALPTSDCGSDG 648 YSAI+DTGSDLIWTQCKPC++CFD PTP+FDP K LC ALP S C SDG Sbjct: 112 YSAILDTGSDLIWTQCKPCSQCFDQPTPIFDPKKSSSFSKLSCSSDLCSALPQSAC-SDG 170 Query: 649 CEYLYSYGDYSSTQGVLATETFTFDKVSVPKVGFGCGEDNEXXXXXXXXXXXXXXXXXXX 828 CEYLYSYGDYSSTQGV+A ETFTF KVSVPK+GFGCG DN+ Sbjct: 171 CEYLYSYGDYSSTQGVMAVETFTFGKVSVPKIGFGCGGDNQGDGFTQGAGLVGLGRGPVS 230 Query: 829 XVSQLKQSKFSYCLTSMNDDTSSSENPTSTLIMGSLATEISN-DTTFTTPLLKNPSQPSF 1005 VSQLKQ KFSYCLTS+ DDT S TL+MGSLA+ S TTPL+ NP+QPSF Sbjct: 231 LVSQLKQGKFSYCLTSI-DDTKKS-----TLLMGSLASVNSTLGAIKTTPLIHNPTQPSF 284 Query: 1006 YYLSLTGISVGNVDLPIEKTTFAINDDGTGGMIIDSGTTITYLEESAFNMLKKEFVSQTK 1185 YYLSL GI+VG+ LPI+K+TFA+ DDGTGG+IIDSGTTITYLEESAF+++KKEF+S K Sbjct: 285 YYLSLQGITVGDTRLPIKKSTFALEDDGTGGLIIDSGTTITYLEESAFDVVKKEFISHMK 344 Query: 1186 LNVDNSGASGLDLCFELPQDDGSGETTIEIPKLVLHFDGASLDLPGENYMIGDAKNGVAC 1365 L+ D SG++GL+LCF LP SG T +++P+L+ HF+GA LDLPGENYMI D+ +G+ C Sbjct: 345 LSEDTSGSTGLELCFTLP----SGSTDVDVPRLIFHFEGADLDLPGENYMIADSSSGLLC 400 Query: 1366 LAMXXXXXXXXXXXXXXXXMMVVHDLDKETLSFVPTNCDKL 1488 LAM ++V+HDL+KETLSF T CDKL Sbjct: 401 LAMGSSSGMSIFGNVQQQNILVLHDLEKETLSFQHTQCDKL 441 >ref|XP_011046330.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Populus euphratica] Length = 439 Score = 484 bits (1246), Expect = e-165 Identities = 262/457 (57%), Positives = 311/457 (68%), Gaps = 3/457 (0%) Frame = +1 Query: 127 SSLHSLLFIVVYMLLICSPLTLSTSRRVLRDHFTNHP--ENGFRITLKHVDSNKNLTKFE 300 SSL ++ + ++ L+ STSRRVL HP +NGFR+ LKHVDS KNLTKFE Sbjct: 5 SSLSFVVALAIFALVFSH--AFSTSRRVLE-----HPKAQNGFRVKLKHVDSGKNLTKFE 57 Query: 301 RLQRGVMRGNLRLERIINNMMAXXXXXXXXXXXXXXXHAGNGEFLMNLAIGTPPEPYSAI 480 R+Q GV RG RL+R A GNGEFLMNLAIGTPP YSAI Sbjct: 58 RIQHGVKRGRHRLQRF----KAMALVASSNSEIDAPVLPGNGEFLMNLAIGTPPATYSAI 113 Query: 481 MDTGSDLIWTQCKPCTKCFDAPTPVFDPTKXXXXXXXXXXXXLCKALPTSDCGSDGCEYL 660 MDTGSDLIWTQCKPCT+CFD PTP+FDP K LC+ALP S C SDGCEYL Sbjct: 114 MDTGSDLIWTQCKPCTQCFDQPTPIFDPKKSSSFSKLSCSSKLCEALPQSTC-SDGCEYL 172 Query: 661 YSYGDYSSTQGVLATETFTFDKVSVPKVGFGCGEDNEXXXXXXXXXXXXXXXXXXXXVSQ 840 Y YGDYSSTQG+LA+ET TF KVSVPKV FGCGEDNE VSQ Sbjct: 173 YGYGDYSSTQGILASETLTFGKVSVPKVAFGCGEDNEGSGFSQGSGLVGLGRGPLSLVSQ 232 Query: 841 LKQSKFSYCLTSMNDDTSSSENPTSTLIMGSLATEISNDTTF-TTPLLKNPSQPSFYYLS 1017 LK+ KFSYCLTS+ DDT +S TL+MGSLA+ ++D+ +TPL++N +QPSFYYLS Sbjct: 233 LKEPKFSYCLTSV-DDTKAS-----TLLMGSLASVKASDSEIKSTPLIQNSAQPSFYYLS 286 Query: 1018 LTGISVGNVDLPIEKTTFAINDDGTGGMIIDSGTTITYLEESAFNMLKKEFVSQTKLNVD 1197 L GISVG+ LPI+K+TF++ +DG+GG+IIDSGTTITYLE+SAF+++ KEF SQ L VD Sbjct: 287 LEGISVGDTSLPIKKSTFSLQEDGSGGLIIDSGTTITYLEQSAFDLVSKEFTSQMNLPVD 346 Query: 1198 NSGASGLDLCFELPQDDGSGETTIEIPKLVLHFDGASLDLPGENYMIGDAKNGVACLAMX 1377 NSGA+GL++CF LP SG T IE+PKLV HFDGA L+LP ENYMI DA GVACLAM Sbjct: 347 NSGATGLEVCFTLP----SGSTDIEVPKLVFHFDGADLELPAENYMIADASMGVACLAMG 402 Query: 1378 XXXXXXXXXXXXXXXMMVVHDLDKETLSFVPTNCDKL 1488 M+V+HDL+KETLSF+P CD+L Sbjct: 403 SSSGMSIFGNIQQQNMLVLHDLEKETLSFLPAQCDEL 439 >ref|XP_007048585.2| PREDICTED: aspartic proteinase nepenthesin-1 [Theobroma cacao] Length = 441 Score = 483 bits (1242), Expect = e-164 Identities = 258/458 (56%), Positives = 315/458 (68%), Gaps = 4/458 (0%) Frame = +1 Query: 127 SSLHSLL---FIVVYMLLICSPLTLSTSRRVLRDHFTNHPENGFRITLKHVDSNKNLTKF 297 +SL+SLL F+ + ++ + +STSR L +NGFR+TL+HVDS KNLTK+ Sbjct: 2 ASLYSLLCVAFLTLEIVALSVSPAVSTSRGALEHR---RLQNGFRVTLRHVDSGKNLTKW 58 Query: 298 ERLQRGVMRGNLRLERIINNMMAXXXXXXXXXXXXXXXHAGNGEFLMNLAIGTPPEPYSA 477 ER+QRGV RGN RL+R+ ++A AGNGEFLM+LAIGTPPE YSA Sbjct: 59 ERIQRGVKRGNHRLQRLNAMVLAATDASELQAPIT----AGNGEFLMDLAIGTPPESYSA 114 Query: 478 IMDTGSDLIWTQCKPCTKCFDAPTPVFDPTKXXXXXXXXXXXXLCKALPTSDCGSDGCEY 657 I+DTGSDLIWTQCKPC++CFD PTP+FDP K LC ALP S C SDGCEY Sbjct: 115 ILDTGSDLIWTQCKPCSQCFDQPTPIFDPKKSSSFSKLSCSSHLCSALPQSAC-SDGCEY 173 Query: 658 LYSYGDYSSTQGVLATETFTFDKVSVPKVGFGCGEDNEXXXXXXXXXXXXXXXXXXXXVS 837 LY+YGDYSSTQGV+A ETFTF K+SVP +GFGCG DN+ VS Sbjct: 174 LYTYGDYSSTQGVMAVETFTFGKLSVPNIGFGCGGDNQGDGFTQGAGLVGLGRGPVSLVS 233 Query: 838 QLKQSKFSYCLTSMNDDTSSSENPTSTLIMGSLAT-EISNDTTFTTPLLKNPSQPSFYYL 1014 QLKQ KFSYCLTS+ DDT S TL+MGS+A+ + TTPL+ NP+QPSFYYL Sbjct: 234 QLKQGKFSYCLTSI-DDTKKS-----TLLMGSIASVNRTLGAIKTTPLIHNPTQPSFYYL 287 Query: 1015 SLTGISVGNVDLPIEKTTFAINDDGTGGMIIDSGTTITYLEESAFNMLKKEFVSQTKLNV 1194 SL GI+VG+ LPI+K+TFA+ DDGTGG+IIDSGTTITYLEE AF+M+KKEF+SQ KL+V Sbjct: 288 SLKGITVGDTRLPIKKSTFALEDDGTGGVIIDSGTTITYLEERAFDMVKKEFISQMKLSV 347 Query: 1195 DNSGASGLDLCFELPQDDGSGETTIEIPKLVLHFDGASLDLPGENYMIGDAKNGVACLAM 1374 D SG++GL+LCF LP SG T +E+PKL+ HF+GA LDLPGENYMI D+ +G+ CLAM Sbjct: 348 DTSGSTGLELCFTLP----SGSTDVEVPKLIFHFEGADLDLPGENYMIADSSSGLLCLAM 403 Query: 1375 XXXXXXXXXXXXXXXXMMVVHDLDKETLSFVPTNCDKL 1488 M+V+HDL+K TLSF T CDKL Sbjct: 404 GSSSGMSIFGNVQQQNMLVLHDLEKATLSFQHTQCDKL 441 >ref|XP_022740638.1| aspartic proteinase nepenthesin-1-like [Durio zibethinus] Length = 443 Score = 481 bits (1239), Expect = e-164 Identities = 256/458 (55%), Positives = 313/458 (68%), Gaps = 4/458 (0%) Frame = +1 Query: 127 SSLHSLL---FIVVYMLLICSPLTLSTSRRVLRDHFTNHPENGFRITLKHVDSNKNLTKF 297 +SL+SL F+ + ++ + +STSRR L+ N +NGF++TLKHVDS KNLTK+ Sbjct: 2 ASLYSLCCVSFLALAIVALYVSPAVSTSRRALKQR--NKLQNGFKVTLKHVDSGKNLTKW 59 Query: 298 ERLQRGVMRGNLRLERIINNMMAXXXXXXXXXXXXXXXHAGNGEFLMNLAIGTPPEPYSA 477 ER+QRG+ RG+ RL+R+ ++A AGNGEFLM L+IGTPPE YSA Sbjct: 60 ERIQRGIQRGHHRLQRLNAIVLAASTNSAEVQAPVV---AGNGEFLMELSIGTPPESYSA 116 Query: 478 IMDTGSDLIWTQCKPCTKCFDAPTPVFDPTKXXXXXXXXXXXXLCKALPTSDCGSDGCEY 657 I+DTGSDLIWTQCKPC++CF TP+FDP K LC ALP S C DGCEY Sbjct: 117 IVDTGSDLIWTQCKPCSQCFTQSTPIFDPKKSSTFSKLSCSSDLCAALPQSTC-DDGCEY 175 Query: 658 LYSYGDYSSTQGVLATETFTFDKVSVPKVGFGCGEDNEXXXXXXXXXXXXXXXXXXXXVS 837 LYSYGDYSSTQGV+ TETFTF KVSVP +GFGCGEDNE VS Sbjct: 176 LYSYGDYSSTQGVMGTETFTFGKVSVPNIGFGCGEDNEGDGFSQGAGLVGLGRGPLSLVS 235 Query: 838 QLKQSKFSYCLTSMNDDTSSSENPTSTLIMGSLATEISNDTTF-TTPLLKNPSQPSFYYL 1014 QLK+ KFSYCLTS++ E TSTL+MGS+A+ S TTPL++NPSQPSFYYL Sbjct: 236 QLKEPKFSYCLTSID------ETQTSTLLMGSIASVNSTIGAIKTTPLIRNPSQPSFYYL 289 Query: 1015 SLTGISVGNVDLPIEKTTFAINDDGTGGMIIDSGTTITYLEESAFNMLKKEFVSQTKLNV 1194 SL GISVG LP++K+TFA+ DDGTGG+IIDSGTTITYLE+SAF+ +KKEF+SQ KL+V Sbjct: 290 SLEGISVGGTRLPVKKSTFALEDDGTGGLIIDSGTTITYLEQSAFDEVKKEFISQMKLSV 349 Query: 1195 DNSGASGLDLCFELPQDDGSGETTIEIPKLVLHFDGASLDLPGENYMIGDAKNGVACLAM 1374 DN+G++GL+LCF LP SG T + +PKLV HFDGA LDLPGENYMI D+ +GV CLAM Sbjct: 350 DNTGSTGLELCFSLP----SGSTQVNVPKLVFHFDGADLDLPGENYMIADSTSGVICLAM 405 Query: 1375 XXXXXXXXXXXXXXXXMMVVHDLDKETLSFVPTNCDKL 1488 M V+++L KETLSF+ T C KL Sbjct: 406 GSSSGMSIFGNVQQQNMFVLYNLKKETLSFLQTQCHKL 443 >ref|XP_006370905.1| aspartyl protease family protein [Populus trichocarpa] gb|PNS89732.1| hypothetical protein POPTR_019G002100v3 [Populus trichocarpa] Length = 439 Score = 481 bits (1237), Expect = e-164 Identities = 261/457 (57%), Positives = 310/457 (67%), Gaps = 3/457 (0%) Frame = +1 Query: 127 SSLHSLLFIVVYMLLICSPLTLSTSRRVLRDHFTNHP--ENGFRITLKHVDSNKNLTKFE 300 SSL ++ + ++ + STSRRVL HP +NGFR LKHVDS KNLTKFE Sbjct: 5 SSLSLVVALAIFAFVFSH--AFSTSRRVLE-----HPKVQNGFRAKLKHVDSGKNLTKFE 57 Query: 301 RLQRGVMRGNLRLERIINNMMAXXXXXXXXXXXXXXXHAGNGEFLMNLAIGTPPEPYSAI 480 R+Q GV RG RL+R A GNGEFLM LAIGTPPE YSAI Sbjct: 58 RIQHGVKRGRHRLQRF----KAMALVASSNSEIDAPVLPGNGEFLMKLAIGTPPETYSAI 113 Query: 481 MDTGSDLIWTQCKPCTKCFDAPTPVFDPTKXXXXXXXXXXXXLCKALPTSDCGSDGCEYL 660 MDTGSDLIWTQCKPCT+CFD PTP+FDP K LC+ALP S C SDGCEYL Sbjct: 114 MDTGSDLIWTQCKPCTQCFDQPTPIFDPKKSSSFSKLSCSSKLCEALPQSTC-SDGCEYL 172 Query: 661 YSYGDYSSTQGVLATETFTFDKVSVPKVGFGCGEDNEXXXXXXXXXXXXXXXXXXXXVSQ 840 Y YGDYSSTQG+LA+ET TF KVSVP+V FGCGEDNE VSQ Sbjct: 173 YGYGDYSSTQGMLASETLTFGKVSVPEVAFGCGEDNEGSGFSQGSGLVGLGRGPLSLVSQ 232 Query: 841 LKQSKFSYCLTSMNDDTSSSENPTSTLIMGSLATEISNDTTF-TTPLLKNPSQPSFYYLS 1017 LK+ KFSYCLTS+ DDT +S TL+MGSLA+ ++D+ TTPL++N +QPSFYYLS Sbjct: 233 LKEPKFSYCLTSV-DDTKAS-----TLLMGSLASVKASDSEIKTTPLIQNSAQPSFYYLS 286 Query: 1018 LTGISVGNVDLPIEKTTFAINDDGTGGMIIDSGTTITYLEESAFNMLKKEFVSQTKLNVD 1197 L GISVG+ LPI+K+TF++ +DG+GG+IIDSGTTITYLE+SAF+++ KEF SQ L VD Sbjct: 287 LEGISVGDTSLPIKKSTFSLQEDGSGGLIIDSGTTITYLEQSAFDLVAKEFTSQINLPVD 346 Query: 1198 NSGASGLDLCFELPQDDGSGETTIEIPKLVLHFDGASLDLPGENYMIGDAKNGVACLAMX 1377 NSG++GL++CF LP SG T IE+PKLV HFDGA L+LP ENYMI DA GVACLAM Sbjct: 347 NSGSTGLEVCFTLP----SGSTDIEVPKLVFHFDGADLELPAENYMIADASMGVACLAMG 402 Query: 1378 XXXXXXXXXXXXXXXMMVVHDLDKETLSFVPTNCDKL 1488 M+V+HDL+KETLSF+PT CD+L Sbjct: 403 SSSGMSIFGNIQQQNMLVLHDLEKETLSFLPTQCDEL 439 >gb|EOX92742.1| Eukaryotic aspartyl protease family protein [Theobroma cacao] Length = 441 Score = 481 bits (1237), Expect = e-164 Identities = 260/459 (56%), Positives = 313/459 (68%), Gaps = 5/459 (1%) Frame = +1 Query: 127 SSLHSLLFIVVYML----LICSPLTLSTSRRVLRDHFTNHPENGFRITLKHVDSNKNLTK 294 +SL+SLL + L L SP +STSR L +NGFR+TL+HVDS KNLTK Sbjct: 2 ASLYSLLCVAFLTLEIVALYVSP-AVSTSRGALEHR---RLQNGFRVTLRHVDSGKNLTK 57 Query: 295 FERLQRGVMRGNLRLERIINNMMAXXXXXXXXXXXXXXXHAGNGEFLMNLAIGTPPEPYS 474 +ER+QRGV RGN RL+R+ ++A AGNGEFLM+LAIGTPPE YS Sbjct: 58 WERIQRGVKRGNHRLQRLNAMVLAATDASELQAPIT----AGNGEFLMDLAIGTPPESYS 113 Query: 475 AIMDTGSDLIWTQCKPCTKCFDAPTPVFDPTKXXXXXXXXXXXXLCKALPTSDCGSDGCE 654 AI+DTGSDLIWTQCKPC++CFD PTP+FDP K LC ALP S C SDGCE Sbjct: 114 AILDTGSDLIWTQCKPCSQCFDQPTPIFDPKKSSSFSKLSCSSHLCSALPQSAC-SDGCE 172 Query: 655 YLYSYGDYSSTQGVLATETFTFDKVSVPKVGFGCGEDNEXXXXXXXXXXXXXXXXXXXXV 834 YLY+YGDYSSTQGV+A ETFTF KVSVP +GFGCG DN+ V Sbjct: 173 YLYTYGDYSSTQGVMAVETFTFGKVSVPNIGFGCGGDNQGDGFTQGAGLVGLGRGPVSLV 232 Query: 835 SQLKQSKFSYCLTSMNDDTSSSENPTSTLIMGSLAT-EISNDTTFTTPLLKNPSQPSFYY 1011 SQLKQ KFSYCLTS+ DDT S TL+MGS+A+ + TTPL+ NP+QPSFYY Sbjct: 233 SQLKQGKFSYCLTSI-DDTKKS-----TLLMGSIASVNRTLGAIKTTPLIHNPTQPSFYY 286 Query: 1012 LSLTGISVGNVDLPIEKTTFAINDDGTGGMIIDSGTTITYLEESAFNMLKKEFVSQTKLN 1191 LSL GI+VG+ LPI+K+TFA+ DDGTGG+IIDSGTTITYLEE AF+++KKEF+SQ KL+ Sbjct: 287 LSLKGITVGDTRLPIKKSTFALEDDGTGGVIIDSGTTITYLEERAFDLVKKEFISQMKLS 346 Query: 1192 VDNSGASGLDLCFELPQDDGSGETTIEIPKLVLHFDGASLDLPGENYMIGDAKNGVACLA 1371 VD SG++GL+LCF LP SG T +E+PK + HF+GA LDLPGENYMI D+ +G+ CLA Sbjct: 347 VDTSGSTGLELCFTLP----SGSTDVEVPKFIFHFEGADLDLPGENYMIADSSSGLLCLA 402 Query: 1372 MXXXXXXXXXXXXXXXXMMVVHDLDKETLSFVPTNCDKL 1488 M M+V+HDL+K TLSF T CDKL Sbjct: 403 MGSSSGMSIFGNVQQQNMLVLHDLEKATLSFQHTQCDKL 441 >ref|XP_010025709.1| PREDICTED: aspartic proteinase nepenthesin-1 [Eucalyptus grandis] gb|KCW62430.1| hypothetical protein EUGRSUZ_H05072 [Eucalyptus grandis] Length = 445 Score = 481 bits (1237), Expect = e-163 Identities = 261/460 (56%), Positives = 313/460 (68%), Gaps = 6/460 (1%) Frame = +1 Query: 127 SSLHSLLFIVVYMLLICSPL---TLSTSRRVLRDHFTNHPENGFRITLKHVDSNKNLTKF 297 +SL+SLL V +L I + L + STSRR L + GFR+TLKHVD KN TKF Sbjct: 2 ASLNSLLSSVFLVLAIFASLASPSFSTSRRALGSKEAK--QIGFRVTLKHVDHGKNFTKF 59 Query: 298 ERLQRGVMRGNLRLERIINNMMAXXXXXXXXXXXXXXXHAGNGEFLMNLAIGTPPEPYSA 477 ERLQR + RG RL+R+ ++A HAGNGEFLM L+IGTP + +SA Sbjct: 60 ERLQRAMKRGKSRLQRLNAMVLAAGDSTELASPI----HAGNGEFLMQLSIGTPADSFSA 115 Query: 478 IMDTGSDLIWTQCKPCTKCFDAPTPVFDPTKXXXXXXXXXXXXLCKALPTSDCGSDGCEY 657 I+DTGSDLIWTQCKPCT+CFD TP+FDP K LC+ALPTS CG+DGCEY Sbjct: 116 IVDTGSDLIWTQCKPCTQCFDQSTPIFDPKKSSTFSKLGCSSQLCEALPTSSCGTDGCEY 175 Query: 658 LYSYGDYSSTQGVLATETFTF-DKVSVPKVGFGCGEDNEXXXXXXXXXXXXXXXXXXXXV 834 LY+YGDYSSTQG+LA +TFTF D VSVPKVGFGCGEDNE V Sbjct: 176 LYTYGDYSSTQGILAYDTFTFADSVSVPKVGFGCGEDNEGSGFDQGAGLVGLGRGPLSLV 235 Query: 835 SQLKQSKFSYCLTSMNDDTSSSENPTSTLIMGSLAT--EISNDTTFTTPLLKNPSQPSFY 1008 SQL KFSYCLTS+ DDT+ TS L++GS AT +S TTPL+KNP QPSFY Sbjct: 236 SQLGVPKFSYCLTSI-DDTA-----TSKLLLGSEATSGNLSTKAMKTTPLIKNPLQPSFY 289 Query: 1009 YLSLTGISVGNVDLPIEKTTFAINDDGTGGMIIDSGTTITYLEESAFNMLKKEFVSQTKL 1188 YLSL GISVG+ LPI+K+TFA+ DG+GG+IIDSGTTITY+EESAF+++KKEF SQTKL Sbjct: 290 YLSLEGISVGDTLLPIKKSTFALQSDGSGGVIIDSGTTITYIEESAFDLVKKEFKSQTKL 349 Query: 1189 NVDNSGASGLDLCFELPQDDGSGETTIEIPKLVLHFDGASLDLPGENYMIGDAKNGVACL 1368 VD+SG++GLDLCF+LP D + +E+PKL+ HF+GA LDLPGENYMI D+ G+ CL Sbjct: 350 TVDDSGSAGLDLCFKLPSD----SSQVEVPKLIFHFEGADLDLPGENYMIADSTVGLVCL 405 Query: 1369 AMXXXXXXXXXXXXXXXXMMVVHDLDKETLSFVPTNCDKL 1488 AM MV+HDL KETLSF+PT CDKL Sbjct: 406 AMGSSSGMSIFGNVQQQDTMVIHDLAKETLSFLPTKCDKL 445 >ref|XP_022773823.1| aspartic proteinase nepenthesin-1-like [Durio zibethinus] Length = 442 Score = 478 bits (1230), Expect = e-162 Identities = 254/458 (55%), Positives = 312/458 (68%), Gaps = 4/458 (0%) Frame = +1 Query: 127 SSLHSLL---FIVVYMLLICSPLTLSTSRRVLRDHFTNHPENGFRITLKHVDSNKNLTKF 297 +SL+SL F+ + ++ + +STSRR L N +NGFR+TLKHVDS KNLTK+ Sbjct: 2 ASLYSLCCVSFLALAIVALYVSPAVSTSRRALGHR--NKLQNGFRVTLKHVDSGKNLTKW 59 Query: 298 ERLQRGVMRGNLRLERIINNMMAXXXXXXXXXXXXXXXHAGNGEFLMNLAIGTPPEPYSA 477 E++QRG+ RG+ RL+R+ ++A GNGEFLM L+IGTPP YSA Sbjct: 60 EQIQRGIKRGHHRLQRLNAMVLAATDSAEVQAPVF----VGNGEFLMELSIGTPPNSYSA 115 Query: 478 IMDTGSDLIWTQCKPCTKCFDAPTPVFDPTKXXXXXXXXXXXXLCKALPTSDCGSDGCEY 657 I+DTGSDLIWTQCKPC++CF TP+FDP K LC ALP S C DGC+Y Sbjct: 116 ILDTGSDLIWTQCKPCSQCFSQTTPIFDPKKSSTFSKLSCSSQLCAALPQSIC-DDGCQY 174 Query: 658 LYSYGDYSSTQGVLATETFTFDKVSVPKVGFGCGEDNEXXXXXXXXXXXXXXXXXXXXVS 837 LY+YGDYSSTQGV+ TETFTF KVSVPK+GFGCGEDNE VS Sbjct: 175 LYAYGDYSSTQGVMGTETFTFGKVSVPKIGFGCGEDNEGDGFSQGAGLVGLGRGPLSLVS 234 Query: 838 QLKQSKFSYCLTSMNDDTSSSENPTSTLIMGSLATEISNDTTF-TTPLLKNPSQPSFYYL 1014 QLK+ KFSYCLTS++ E STL+MGS+A+ S TTPL+ NPSQPSFYYL Sbjct: 235 QLKEPKFSYCLTSID------ETQKSTLLMGSIASVNSTSGAIKTTPLILNPSQPSFYYL 288 Query: 1015 SLTGISVGNVDLPIEKTTFAINDDGTGGMIIDSGTTITYLEESAFNMLKKEFVSQTKLNV 1194 SL GISVG+ LPI+K+TFA+ DDGTGG+IIDSGTTITYLEESAF+++KKEF SQ KL+V Sbjct: 289 SLQGISVGSTRLPIKKSTFALQDDGTGGLIIDSGTTITYLEESAFDLVKKEFTSQMKLSV 348 Query: 1195 DNSGASGLDLCFELPQDDGSGETTIEIPKLVLHFDGASLDLPGENYMIGDAKNGVACLAM 1374 DNSG++GL+LCF LP SG T +++PKLV HFDGA LDLPGENYMI D+ +GV CLAM Sbjct: 349 DNSGSTGLELCFSLP----SGSTQVDVPKLVFHFDGADLDLPGENYMIADSSSGVICLAM 404 Query: 1375 XXXXXXXXXXXXXXXXMMVVHDLDKETLSFVPTNCDKL 1488 ++V++D +KETLSF+ T C KL Sbjct: 405 GSSTGMSIFGNIQQQNLLVLYDFEKETLSFLRTQCHKL 442 >ref|XP_021671882.1| aspartic proteinase nepenthesin-1, partial [Hevea brasiliensis] Length = 499 Score = 479 bits (1234), Expect = e-162 Identities = 263/470 (55%), Positives = 315/470 (67%), Gaps = 7/470 (1%) Frame = +1 Query: 100 YTLSL*NMASSLHSLLFIVVYML---LICSPLTLSTSRRVLRDHFTNHP---ENGFRITL 261 Y S+ MAS I+V ++ L SP STSRRVL +HP +NGFR+ L Sbjct: 51 YVFSVTLMASMFPLSWAIIVALVISSLFASP-AFSTSRRVL-----DHPPEVKNGFRVML 104 Query: 262 KHVDSNKNLTKFERLQRGVMRGNLRLERIINNMMAXXXXXXXXXXXXXXXHAGNGEFLMN 441 KHVDS+KNLTKFER+Q G+ R + RL+R+ N M GNGEFLM Sbjct: 105 KHVDSDKNLTKFERIQHGIKRASHRLQRL--NAMVLTASSNSEIDAPVLP--GNGEFLMK 160 Query: 442 LAIGTPPEPYSAIMDTGSDLIWTQCKPCTKCFDAPTPVFDPTKXXXXXXXXXXXXLCKAL 621 LAIGTP E YSAIMDTGSDLIWTQCKPCT+C++ P+P+FDP K LCKAL Sbjct: 161 LAIGTPAETYSAIMDTGSDLIWTQCKPCTQCYNQPSPIFDPKKSSSFSKLSCSSQLCKAL 220 Query: 622 PTSDCGSDGCEYLYSYGDYSSTQGVLATETFTFDKVSVPKVGFGCGEDNEXXXXXXXXXX 801 P S C SDGCEYLY+YGDYSSTQG+LATETFTF VSVP VGFGCGEDNE Sbjct: 221 PQSSC-SDGCEYLYAYGDYSSTQGILATETFTFGNVSVPNVGFGCGEDNEGDGFTQGSGL 279 Query: 802 XXXXXXXXXXVSQLKQSKFSYCLTSMNDDTSSSENPTSTLIMGSLATEISNDTTF-TTPL 978 VSQLK++KFSYCLTS++D TSTL+MGSLA+ N +TPL Sbjct: 280 VGLGRGPLSLVSQLKEAKFSYCLTSIDD------TKTSTLLMGSLASVNGNSNAIKSTPL 333 Query: 979 LKNPSQPSFYYLSLTGISVGNVDLPIEKTTFAINDDGTGGMIIDSGTTITYLEESAFNML 1158 ++NP QPSFYYLSL GI+VG PI+K+TF + +DG+GG+IIDSGTTITYLEESAF+++ Sbjct: 334 IQNPLQPSFYYLSLEGITVGGNRTPIKKSTFQLQEDGSGGVIIDSGTTITYLEESAFDLV 393 Query: 1159 KKEFVSQTKLNVDNSGASGLDLCFELPQDDGSGETTIEIPKLVLHFDGASLDLPGENYMI 1338 KKEF SQ L+VDNSGA+GL+LCF LP D + +E+PK VLHFDGA L+LPGENYMI Sbjct: 394 KKEFTSQMGLSVDNSGATGLELCFTLPSD----SSEVEVPKFVLHFDGADLELPGENYMI 449 Query: 1339 GDAKNGVACLAMXXXXXXXXXXXXXXXXMMVVHDLDKETLSFVPTNCDKL 1488 D+ GV CLAM ++V+HDLDKETLSF+PT CD+L Sbjct: 450 ADSSMGVICLAMGSSSGMSILGNVQQQNILVLHDLDKETLSFLPTKCDQL 499 >ref|XP_004239638.1| PREDICTED: aspartic proteinase nepenthesin-1 [Solanum lycopersicum] Length = 441 Score = 476 bits (1225), Expect = e-162 Identities = 247/457 (54%), Positives = 309/457 (67%), Gaps = 1/457 (0%) Frame = +1 Query: 121 MASSLHSLLFIVVYMLLICSPLTLS-TSRRVLRDHFTNHPENGFRITLKHVDSNKNLTKF 297 MASS + + +++ I L +S TSR V+ +H GFR++LKHVDS N TKF Sbjct: 1 MASSNFAYYILFLFLSSILFALQVSSTSRHVVNNH------KGFRLSLKHVDSGGNFTKF 54 Query: 298 ERLQRGVMRGNLRLERIINNMMAXXXXXXXXXXXXXXXHAGNGEFLMNLAIGTPPEPYSA 477 ERLQR + RG RL+R+ +++A HAGNGEFLM ++IG+P E Y+A Sbjct: 55 ERLQRAMARGKSRLQRL--SLVATLSSRDETNDVKSTIHAGNGEFLMQISIGSPSESYNA 112 Query: 478 IMDTGSDLIWTQCKPCTKCFDAPTPVFDPTKXXXXXXXXXXXXLCKALPTSDCGSDGCEY 657 IMDTGSDLIWTQCKPC +CFD TP+FDP+K LC+ALP S CG CEY Sbjct: 113 IMDTGSDLIWTQCKPCKECFDQSTPIFDPSKSSTFEKISCSNKLCEALPISSCGGSNCEY 172 Query: 658 LYSYGDYSSTQGVLATETFTFDKVSVPKVGFGCGEDNEXXXXXXXXXXXXXXXXXXXXVS 837 +Y+YGDYSS++G LA+ETFTF KVS+P V FGCG DNE VS Sbjct: 173 MYTYGDYSSSEGFLASETFTFGKVSIPNVAFGCGNDNEGSGFSQGAGLVGLGRGPLSLVS 232 Query: 838 QLKQSKFSYCLTSMNDDTSSSENPTSTLIMGSLATEISNDTTFTTPLLKNPSQPSFYYLS 1017 QL S+FSYCLTS+N+D S+ +STL+MGS+A + N+ TTPL+KNP+QPSFYYLS Sbjct: 233 QLHMSRFSYCLTSINEDADST---SSTLLMGSMARDDYNN-IITTPLVKNPTQPSFYYLS 288 Query: 1018 LTGISVGNVDLPIEKTTFAINDDGTGGMIIDSGTTITYLEESAFNMLKKEFVSQTKLNVD 1197 L GISVG+ L I+K+TF++N DG+GGMIIDSGTTITYLEESAF++LKKEF SQ L VD Sbjct: 289 LKGISVGDTQLAIKKSTFSLNKDGSGGMIIDSGTTITYLEESAFSLLKKEFSSQVNLAVD 348 Query: 1198 NSGASGLDLCFELPQDDGSGETTIEIPKLVLHFDGASLDLPGENYMIGDAKNGVACLAMX 1377 +S ++GLDLCF+LP S I++PKL+ HF+GA +DLP ENYMI D++ G+ACLAM Sbjct: 349 DSSSTGLDLCFKLP----SNTNNIQVPKLIFHFEGADMDLPAENYMIADSRMGIACLAMG 404 Query: 1378 XXXXXXXXXXXXXXXMMVVHDLDKETLSFVPTNCDKL 1488 MMV+HDLDKETLSFVP CDKL Sbjct: 405 SSSGMSIFGNVQQQNMMVIHDLDKETLSFVPKQCDKL 441 >ref|XP_006345762.1| PREDICTED: aspartic proteinase nepenthesin-1 [Solanum tuberosum] Length = 444 Score = 475 bits (1223), Expect = e-161 Identities = 249/458 (54%), Positives = 307/458 (67%), Gaps = 2/458 (0%) Frame = +1 Query: 121 MASSLHSLLFIVVYMLLICSPLTLS-TSRRVLRDHFTNHPENGFRITLKHVDSNKNLTKF 297 MASS + + +++ I L +S TSR V+ +H GF++ LKHVDS N TKF Sbjct: 1 MASSKFAYYILFLFLSSILFALQVSSTSRHVVNNH------KGFKLNLKHVDSGGNFTKF 54 Query: 298 ERLQRGVMRGNLRLERI-INNMMAXXXXXXXXXXXXXXXHAGNGEFLMNLAIGTPPEPYS 474 ERLQR + RG RL+R+ + A HAGNGEFLM ++IG+P E Y+ Sbjct: 55 ERLQRAMARGKSRLQRLSLVANFATLSSKDETNDVKSTIHAGNGEFLMQISIGSPSESYN 114 Query: 475 AIMDTGSDLIWTQCKPCTKCFDAPTPVFDPTKXXXXXXXXXXXXLCKALPTSDCGSDGCE 654 AIMDTGSDLIWTQCKPC +CFD TP+FDP+K LC+ALPTS CG + CE Sbjct: 115 AIMDTGSDLIWTQCKPCKECFDQSTPIFDPSKSSTFEKISCSNKLCEALPTSSCGDNNCE 174 Query: 655 YLYSYGDYSSTQGVLATETFTFDKVSVPKVGFGCGEDNEXXXXXXXXXXXXXXXXXXXXV 834 Y+Y+YGDYSS++G LA+ETFTF KVS+P V FGCG DNE V Sbjct: 175 YMYTYGDYSSSEGFLASETFTFGKVSIPNVAFGCGNDNEGSGFSQGAGLVGLGRGSLSLV 234 Query: 835 SQLKQSKFSYCLTSMNDDTSSSENPTSTLIMGSLATEISNDTTFTTPLLKNPSQPSFYYL 1014 SQL S+FSYCLTS+N+D + +STL+MGS+A + N+ TTPL+KNP+QPSFYYL Sbjct: 235 SQLHMSRFSYCLTSINEDAYTK---SSTLLMGSMAHDDYNN-IITTPLVKNPTQPSFYYL 290 Query: 1015 SLTGISVGNVDLPIEKTTFAINDDGTGGMIIDSGTTITYLEESAFNMLKKEFVSQTKLNV 1194 SL GISVG+ L I+K+TF++N DGTGGMIIDSGTTITYLEESAF++LKKEF SQ L V Sbjct: 291 SLKGISVGDTQLAIKKSTFSLNKDGTGGMIIDSGTTITYLEESAFSLLKKEFSSQVNLPV 350 Query: 1195 DNSGASGLDLCFELPQDDGSGETTIEIPKLVLHFDGASLDLPGENYMIGDAKNGVACLAM 1374 D+S ++GLDLCF LP S IE+PKL+ HF+GA +DLP ENYMI D++ G+ACLAM Sbjct: 351 DDSSSTGLDLCFILP----SNTNNIEVPKLIFHFEGADMDLPAENYMIADSRMGIACLAM 406 Query: 1375 XXXXXXXXXXXXXXXXMMVVHDLDKETLSFVPTNCDKL 1488 MMV+HDLDKETLSFVPT CDKL Sbjct: 407 GSSSGMSIFGNVQQQNMMVIHDLDKETLSFVPTQCDKL 444 >gb|PPD91976.1| hypothetical protein GOBAR_DD11093 [Gossypium barbadense] Length = 444 Score = 474 bits (1219), Expect = e-161 Identities = 249/447 (55%), Positives = 306/447 (68%), Gaps = 1/447 (0%) Frame = +1 Query: 151 IVVYMLLICSPLTLSTSRRVLRDHFTNHPENGFRITLKHVDSNKNLTKFERLQRGVMRGN 330 ++V + L SP+ +STSRRVL D+ ENGFRITLKHVDS+KNLTK+ER+QRG+ RGN Sbjct: 14 LLVIVALYVSPV-VSTSRRVLGDY--GKLENGFRITLKHVDSSKNLTKWERIQRGIKRGN 70 Query: 331 LRLERIINNMMAXXXXXXXXXXXXXXXHAGNGEFLMNLAIGTPPEPYSAIMDTGSDLIWT 510 RL+R+ ++A AGNGEFLM+L+IGTPP YSAI+DTGSDLIWT Sbjct: 71 HRLQRLNAMVLAASGDSAEVQAPIV---AGNGEFLMDLSIGTPPNSYSAILDTGSDLIWT 127 Query: 511 QCKPCTKCFDAPTPVFDPTKXXXXXXXXXXXXLCKALPTSDCGSDGCEYLYSYGDYSSTQ 690 QCKPCT+CFD TP+FDP K LC+ALP S C CEYLY+YGDYSSTQ Sbjct: 128 QCKPCTQCFDQSTPIFDPQKSSTFTKLSCSSDLCEALPQSTCSDGSCEYLYTYGDYSSTQ 187 Query: 691 GVLATETFTFDKVSVPKVGFGCGEDNEXXXXXXXXXXXXXXXXXXXXVSQLKQSKFSYCL 870 GV+ATETF FD VSVP +GFGCGEDNE VSQLK+ KFSYCL Sbjct: 188 GVMATETFKFDSVSVPNIGFGCGEDNEGDGFSQGAGLVGLGRGPLSLVSQLKEPKFSYCL 247 Query: 871 TSMNDDTSSSENPTSTLIMGSLAT-EISNDTTFTTPLLKNPSQPSFYYLSLTGISVGNVD 1047 T+M+ E S L+MGS+A+ S TTPL++NPSQPSFYYLSL GI+VG+ Sbjct: 248 TAMD------ETQKSLLLMGSIASANESLGEMRTTPLIRNPSQPSFYYLSLQGITVGSTR 301 Query: 1048 LPIEKTTFAINDDGTGGMIIDSGTTITYLEESAFNMLKKEFVSQTKLNVDNSGASGLDLC 1227 LPI+++TFA+ D+G+GG+IIDSGTTITYLE++AF++LKK F+ + KL VD ++GLDLC Sbjct: 302 LPIKESTFALEDNGSGGVIIDSGTTITYLEQAAFSVLKKAFILEMKLPVDTLSSTGLDLC 361 Query: 1228 FELPQDDGSGETTIEIPKLVLHFDGASLDLPGENYMIGDAKNGVACLAMXXXXXXXXXXX 1407 F LP SG T +E+PKLV HFDGA LDLP ENYMI D+ +GV CLAM Sbjct: 362 FTLP----SGSTQVEVPKLVFHFDGADLDLPAENYMIADSSSGVICLAMGGSSGMSIFGN 417 Query: 1408 XXXXXMMVVHDLDKETLSFVPTNCDKL 1488 M+VVHDL+KET+SF+ T C + Sbjct: 418 VQQQNMLVVHDLEKETVSFIETQCQNI 444 >ref|XP_016698862.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Gossypium hirsutum] Length = 444 Score = 473 bits (1217), Expect = e-160 Identities = 249/446 (55%), Positives = 305/446 (68%), Gaps = 1/446 (0%) Frame = +1 Query: 154 VVYMLLICSPLTLSTSRRVLRDHFTNHPENGFRITLKHVDSNKNLTKFERLQRGVMRGNL 333 +V + L SP+ +STSRRVL D+ ENGFRITLKHVDS+KNLTK+ER+QRG+ RGN Sbjct: 15 LVIVALYVSPV-VSTSRRVLGDY--GKLENGFRITLKHVDSSKNLTKWERIQRGIKRGNH 71 Query: 334 RLERIINNMMAXXXXXXXXXXXXXXXHAGNGEFLMNLAIGTPPEPYSAIMDTGSDLIWTQ 513 RL+R+ ++A AGNGEFLM+L+IGTPP YSAI+DTGSDLIWTQ Sbjct: 72 RLQRLNAMVLAASGDSAEVQAPIV---AGNGEFLMDLSIGTPPNSYSAILDTGSDLIWTQ 128 Query: 514 CKPCTKCFDAPTPVFDPTKXXXXXXXXXXXXLCKALPTSDCGSDGCEYLYSYGDYSSTQG 693 CKPCT+CFD TP+FDP K LC+ALP S C CEYLY+YGDYSSTQG Sbjct: 129 CKPCTQCFDQSTPIFDPQKSSTFTKLSCSSDLCEALPQSTCSDGSCEYLYTYGDYSSTQG 188 Query: 694 VLATETFTFDKVSVPKVGFGCGEDNEXXXXXXXXXXXXXXXXXXXXVSQLKQSKFSYCLT 873 V+ATETF FD VSVP +GFGCGEDNE VSQLK+ KFSYCLT Sbjct: 189 VMATETFKFDSVSVPNIGFGCGEDNEGDGFSQGAGLVGLGRGPLSLVSQLKEPKFSYCLT 248 Query: 874 SMNDDTSSSENPTSTLIMGSLAT-EISNDTTFTTPLLKNPSQPSFYYLSLTGISVGNVDL 1050 +M+ E S L+MGS+A+ S TTPL++NPSQPSFYYLSL GI+VG+ L Sbjct: 249 AMD------ETQKSLLLMGSIASANESLGEMRTTPLIRNPSQPSFYYLSLQGITVGSTRL 302 Query: 1051 PIEKTTFAINDDGTGGMIIDSGTTITYLEESAFNMLKKEFVSQTKLNVDNSGASGLDLCF 1230 PI+++TFA+ D+G+GG+IIDSGTTITYLE++AF++LKK F+ + KL VD ++GLDLCF Sbjct: 303 PIKESTFALEDNGSGGVIIDSGTTITYLEQAAFSVLKKAFILEMKLPVDTLSSTGLDLCF 362 Query: 1231 ELPQDDGSGETTIEIPKLVLHFDGASLDLPGENYMIGDAKNGVACLAMXXXXXXXXXXXX 1410 LP SG T +E+PKLV HFDGA LDLP ENYMI D+ +GV CLAM Sbjct: 363 TLP----SGSTQVEVPKLVFHFDGADLDLPAENYMIADSSSGVICLAMGGSSGMSIFGNV 418 Query: 1411 XXXXMMVVHDLDKETLSFVPTNCDKL 1488 M+VVHDL+KET+SF+ T C + Sbjct: 419 QQQNMLVVHDLEKETVSFIETQCQNI 444 >gb|PIN24256.1| Aspartyl protease [Handroanthus impetiginosus] Length = 437 Score = 473 bits (1216), Expect = e-160 Identities = 253/456 (55%), Positives = 307/456 (67%), Gaps = 2/456 (0%) Frame = +1 Query: 127 SSLHSLLFIVVYMLLICSPLTLSTSRRVLRDHFTNHPENGFRITLKHVDSNKNLTKFERL 306 S SL ++ + L SP ++STSR +L H N P NGF++TLKHVDS +N TKFERL Sbjct: 2 SPYSSLTLLLALIFLFISP-SISTSRNLLDHH--NVP-NGFKVTLKHVDSGRNFTKFERL 57 Query: 307 QRGVMRGNLRLERIINNMMAXXXXXXXXXXXXXXXHAGNGEFLMNLAIGTPPEPYSAIMD 486 QR + R R+ER+ +A HAGNGEFLM L+IGTPPE YSAI+D Sbjct: 58 QRAMKRSGKRMERLYAMALAASDASMEAPI-----HAGNGEFLMELSIGTPPESYSAILD 112 Query: 487 TGSDLIWTQCKPCTKCFDAPTPVFDPTKXXXXXXXXXXXXLCKALPTSDCGSDGCEYLYS 666 TGSDLIWTQCKPC +CF+ PTP+FDP K LC ALP S C ++ CEYLYS Sbjct: 113 TGSDLIWTQCKPCKECFNQPTPIFDPKKSSSFSKMSCSSNLCGALPMSSCSNENCEYLYS 172 Query: 667 YGDYSSTQGVLATETFTFDKVSVPKVGFGCGEDNEXXXXXXXXXXXXXXXXXXXXVSQLK 846 YGDYSSTQGV+ATETFTF VSVPKVGFGCG +NE VSQL Sbjct: 173 YGDYSSTQGVMATETFTFGDVSVPKVGFGCGLENEGGGFNQGGGLVGLGRGPLSLVSQLD 232 Query: 847 QSKFSYCLTSMNDDTSSSENPTSTLIMGSLAT--EISNDTTFTTPLLKNPSQPSFYYLSL 1020 + +FSYCLTS++ + TSTL+MGS A+ + D TTPL+KNPS PSFYYLSL Sbjct: 233 EPEFSYCLTSID------SSKTSTLLMGSSASGNKTQGDEIKTTPLIKNPSSPSFYYLSL 286 Query: 1021 TGISVGNVDLPIEKTTFAINDDGTGGMIIDSGTTITYLEESAFNMLKKEFVSQTKLNVDN 1200 GI+VG+ LPIEK+TFA+N DG+GGMIIDSGTTITY+EESAF+++KKEF+ Q KL VD+ Sbjct: 287 EGITVGDTLLPIEKSTFALNKDGSGGMIIDSGTTITYIEESAFDLVKKEFIKQVKLPVDD 346 Query: 1201 SGASGLDLCFELPQDDGSGETTIEIPKLVLHFDGASLDLPGENYMIGDAKNGVACLAMXX 1380 S +GLDLCF LP SG +E+PKLV HF+GA LDLPG+NY+I D+ +GVACLAM Sbjct: 347 SNQTGLDLCFTLP----SGAQNVEVPKLVFHFNGADLDLPGDNYIIADS-SGVACLAMGS 401 Query: 1381 XXXXXXXXXXXXXXMMVVHDLDKETLSFVPTNCDKL 1488 ++VVH+L KET+SFVP CDKL Sbjct: 402 SNGMSIFGNFQQQNLLVVHNLAKETISFVPKQCDKL 437 >ref|XP_019245654.1| PREDICTED: aspartic proteinase nepenthesin-1 [Nicotiana attenuata] gb|OIT03336.1| aspartyl protease family protein 2 [Nicotiana attenuata] Length = 448 Score = 473 bits (1216), Expect = e-160 Identities = 250/461 (54%), Positives = 309/461 (67%), Gaps = 5/461 (1%) Frame = +1 Query: 121 MASS--LHSLLFIVVYMLLICSPLTLSTSRRVLRDHFTNHPENGFRITLKHVDSNKNLTK 294 MASS + +LF+ + +L + L STSR L ++ + GF+++LKHVDS N TK Sbjct: 1 MASSNFANFILFLSLSVLFVNIVLVNSTSRHALINN-----QKGFKVSLKHVDSGGNFTK 55 Query: 295 FERLQRGVMRGNLRLERI---INNMMAXXXXXXXXXXXXXXXHAGNGEFLMNLAIGTPPE 465 FERLQR + RG RL+R+ NN++A HAGNGEFLM ++IG+P E Sbjct: 56 FERLQRAMARGKSRLQRLNLMANNLVATTTKDDSDIVKSTI-HAGNGEFLMQISIGSPSE 114 Query: 466 PYSAIMDTGSDLIWTQCKPCTKCFDAPTPVFDPTKXXXXXXXXXXXXLCKALPTSDCGSD 645 Y+AIMDTGSDLIWTQCKPC +CFD TP+FDP+K LC+ALP S CG Sbjct: 115 TYNAIMDTGSDLIWTQCKPCKECFDQSTPIFDPSKSSTFSKIPCSNKLCEALPMSSCGDS 174 Query: 646 GCEYLYSYGDYSSTQGVLATETFTFDKVSVPKVGFGCGEDNEXXXXXXXXXXXXXXXXXX 825 CEY+Y+YGDYSS++G LA+ETFTF K S+PKV FGCG DN+ Sbjct: 175 NCEYMYTYGDYSSSEGFLASETFTFGKNSIPKVAFGCGNDNQGSGFSQGAGLVGLGRGPL 234 Query: 826 XXVSQLKQSKFSYCLTSMNDDTSSSENPTSTLIMGSLATEISNDTTFTTPLLKNPSQPSF 1005 VSQL+ KFSYCLTS+NDD +S +STL+MG+++ + TTPL+KNPSQPSF Sbjct: 235 SLVSQLQMPKFSYCLTSINDDANS--KISSTLLMGTISND-DYSNIITTPLVKNPSQPSF 291 Query: 1006 YYLSLTGISVGNVDLPIEKTTFAINDDGTGGMIIDSGTTITYLEESAFNMLKKEFVSQTK 1185 YYLSL GISVG+ LPI+K+TF++N DGTGG+IIDSGTTITYLEESAF +LKKEF SQ Sbjct: 292 YYLSLEGISVGDTRLPIKKSTFSLNQDGTGGVIIDSGTTITYLEESAFKLLKKEFSSQVN 351 Query: 1186 LNVDNSGASGLDLCFELPQDDGSGETTIEIPKLVLHFDGASLDLPGENYMIGDAKNGVAC 1365 L VD+S ++GLDLCF LP S IE+PKLV HF+GA LDLP +NYMI D+ GVAC Sbjct: 352 LPVDDSSSTGLDLCFTLP----SNTNNIEVPKLVFHFEGADLDLPADNYMIADSSMGVAC 407 Query: 1366 LAMXXXXXXXXXXXXXXXXMMVVHDLDKETLSFVPTNCDKL 1488 LAM M+V+HDL+KETLSFVPT CDKL Sbjct: 408 LAMGGSTGMSIFGNVQQQNMLVIHDLNKETLSFVPTQCDKL 448