BLASTX nr result
ID: Akebia25_contig00017374
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Akebia25_contig00017374 (1615 letters) Database: ./nr 38,876,450 sequences; 13,856,398,315 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_002272243.1| PREDICTED: aspartic proteinase nepenthesin-2... 527 e-147 ref|XP_002309394.1| aspartyl protease family protein [Populus tr... 505 e-140 ref|XP_002280866.1| PREDICTED: aspartic proteinase nepenthesin-2... 486 e-134 ref|XP_002523869.1| Aspartic proteinase nepenthesin-2 precursor,... 484 e-134 emb|CAN67405.1| hypothetical protein VITISV_025616 [Vitis vinifera] 483 e-134 ref|XP_006361102.1| PREDICTED: aspartic proteinase nepenthesin-1... 479 e-132 ref|XP_007015710.1| Eukaryotic aspartyl protease family protein,... 479 e-132 ref|XP_004241344.1| PREDICTED: aspartic proteinase nepenthesin-2... 479 e-132 ref|XP_002534234.1| pepsin A, putative [Ricinus communis] gi|223... 471 e-130 gb|EYU18131.1| hypothetical protein MIMGU_mgv1a025649mg [Mimulus... 471 e-130 ref|XP_007027933.1| Eukaryotic aspartyl protease family protein,... 471 e-130 ref|XP_007162958.1| hypothetical protein PHAVU_001G194500g [Phas... 470 e-130 ref|XP_007202027.1| hypothetical protein PRUPE_ppa005104mg [Prun... 461 e-127 ref|XP_003520712.1| PREDICTED: aspartic proteinase nepenthesin-2... 461 e-127 ref|XP_002312826.2| hypothetical protein POPTR_0009s16390g [Popu... 460 e-127 ref|XP_004303503.1| PREDICTED: aspartic proteinase nepenthesin-2... 451 e-124 ref|XP_004136706.1| PREDICTED: aspartic proteinase nepenthesin-2... 451 e-124 ref|XP_006424129.1| hypothetical protein CICLE_v10028374mg [Citr... 449 e-123 ref|XP_004168199.1| PREDICTED: LOW QUALITY PROTEIN: aspartic pro... 449 e-123 ref|XP_006481530.1| PREDICTED: aspartic proteinase nepenthesin-1... 447 e-123 >ref|XP_002272243.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Vitis vinifera] Length = 467 Score = 527 bits (1357), Expect = e-147 Identities = 261/439 (59%), Positives = 314/439 (71%), Gaps = 1/439 (0%) Frame = +2 Query: 80 STTITLSHNHFDINPSPNPWQKLTHMASASLTRAKHLKNPKNTNLSTIPLYPRSYGGYSI 259 ++ ITL + +P P+P++ L H+ SASL RA+HLKNPK T ST PL+ SYG YSI Sbjct: 33 NSPITLPLSASKPSPPPDPYRNLRHLVSASLIRARHLKNPKTTPTSTTPLFTHSYGAYSI 92 Query: 260 SLSFGTPPQTIPFIMDTGSDLVWFPCTHRYLCKNCSFSNPNTKIPTFIPXXXXXXXXXXX 439 LSFGTPPQT+P IMDTGSDLVWFPCTHRY+C+NCSFS N FIP Sbjct: 93 PLSFGTPPQTLPLIMDTGSDLVWFPCTHRYVCRNCSFSTSNPSSNIFIPKSSSSSKVLGC 152 Query: 440 XXXXXTWVHDPDVQSRCTDCEPNSTNCSQICPPYXXXXXXXXXXXXXXXETLDLPKKKVP 619 W+H VQSRC DCEP S NC+QICPPY ETLDLP K VP Sbjct: 153 VNPKCGWIHGSKVQSRCRDCEPTSPNCTQICPPYLVFYGSGITGGIMLSETLDLPGKGVP 212 Query: 620 NFVVGCSLFSSRSPSGIAGFGRGPASLPSQLRLNKFSYCLLPHRFDDSSESTSLVLDGES 799 NF+VGCS+ S+ P+GI+GFGRGP SLPSQL L KFSYCLL R+DD++ES+SLVLDGES Sbjct: 213 NFIVGCSVLSTSQPAGISGFGRGPPSLPSQLGLKKFSYCLLSRRYDDTTESSSLVLDGES 272 Query: 800 DPGDTKTNGVSYTPLLKNQDNSNP-VFSVYYYVGLRKISIGGKHVKIPYSYLSLGSDGNG 976 D G+ KT G+SYTP ++N + FSVYYY+GLR I++GGKHVKIPY YL G+DG+G Sbjct: 273 DSGE-KTAGLSYTPFVQNPKVAGKHAFSVYYYLGLRHITVGGKHVKIPYKYLIPGADGDG 331 Query: 977 GTIVDSGTTFTFMEGRAFELVAREVENQTAHYRRATDVETRTGLRPCFYVSNEKSVSLPK 1156 GTI+DSGTTFT+M+G FELVA E E Q +RAT+VE TGLRPCF +S + S P+ Sbjct: 332 GTIIDSGTTFTYMKGEIFELVAAEFEKQ-VQSKRATEVEGITGLRPCFNISGLNTPSFPE 390 Query: 1157 LVFHFKGGAEMELPLANYFSFIDDDSGVVCMTFVTDGVGGSDIRSGPSVILGNYQMQNYY 1336 L F+GGAEMELPLANY +F+ D VVC+T VTDG G + GP++ILGN+Q QN+Y Sbjct: 391 LTLKFRGGAEMELPLANYVAFLGGDD-VVCLTIVTDGAAGKEFSGGPAIILGNFQQQNFY 449 Query: 1337 VEYDLKNERLGFRQQQTCK 1393 VEYDL+NERLGFR QQ+CK Sbjct: 450 VEYDLRNERLGFR-QQSCK 467 >ref|XP_002309394.1| aspartyl protease family protein [Populus trichocarpa] gi|222855370|gb|EEE92917.1| aspartyl protease family protein [Populus trichocarpa] Length = 469 Score = 505 bits (1301), Expect = e-140 Identities = 251/422 (59%), Positives = 296/422 (70%), Gaps = 3/422 (0%) Frame = +2 Query: 125 SPNPWQKLTHMASASLTRAKHLKNPKNT-NLSTIPLYPRSYGGYSISLSFGTPPQTIPFI 301 S NPW L H+AS SL+RA H+K+PK +L PL+PRSYGGYSISL+FGTPPQT F+ Sbjct: 49 SKNPWGALNHLASLSLSRAHHIKSPKTKFSLLKTPLFPRSYGGYSISLNFGTPPQTTKFV 108 Query: 302 MDTGSDLVWFPCTHRYLCKNCSFSNPN-TKIPTFIPXXXXXXXXXXXXXXXXTWVHDPDV 478 MDTGS LVWFPCT RYLC C F N T IPTFIP +W+ P V Sbjct: 109 MDTGSSLVWFPCTSRYLCSRCDFPNIEVTGIPTFIPKQSSSSNLIGCKNHKCSWLFGPKV 168 Query: 479 QSRCTDCEPNSTNCSQICPPYXXXXXXXXXXXXXXXETLDLPKKK-VPNFVVGCSLFSSR 655 QS+C +C+P + NC+Q CPPY ETLD P KK +P F+VGCSLFS R Sbjct: 169 QSKCQECDPTTQNCTQSCPPYVIQYGLGSTAGLLLSETLDFPHKKTIPGFLVGCSLFSIR 228 Query: 656 SPSGIAGFGRGPASLPSQLRLNKFSYCLLPHRFDDSSESTSLVLDGESDPGDTKTNGVSY 835 P GIAGFGR P SLPSQL L KFSYCL+ H FDD+ S+ LVLD S DTKT G+SY Sbjct: 229 QPEGIAGFGRSPESLPSQLGLKKFSYCLVSHAFDDTPASSDLVLDTGSGSDDTKTPGLSY 288 Query: 836 TPLLKNQDNSNPVFSVYYYVGLRKISIGGKHVKIPYSYLSLGSDGNGGTIVDSGTTFTFM 1015 TP Q N F YYYV LR I IG HVK+PY +L GSDGNGGTIVDSGTTFTFM Sbjct: 289 TPF---QKNPTAAFRDYYYVLLRNIVIGDTHVKVPYKFLVPGSDGNGGTIVDSGTTFTFM 345 Query: 1016 EGRAFELVAREVENQTAHYRRATDVETRTGLRPCFYVSNEKSVSLPKLVFHFKGGAEMEL 1195 E +ELVA+E E Q AHY AT+V+ +TGLRPCF +S EKSVS+P+ +FHFKGGA+M L Sbjct: 346 EKPVYELVAKEFEKQVAHYTVATEVQNQTGLRPCFNISGEKSVSVPEFIFHFKGGAKMAL 405 Query: 1196 PLANYFSFIDDDSGVVCMTFVTDGVGGSDIRSGPSVILGNYQMQNYYVEYDLKNERLGFR 1375 PLANYFSF+ DSGV+C+T V+D + GS I GP++ILGNYQ +N++VE+DLKNER GF+ Sbjct: 406 PLANYFSFV--DSGVICLTIVSDNMSGSGIGGGPAIILGNYQQRNFHVEFDLKNERFGFK 463 Query: 1376 QQ 1381 QQ Sbjct: 464 QQ 465 >ref|XP_002280866.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Vitis vinifera] Length = 469 Score = 486 bits (1250), Expect = e-134 Identities = 243/429 (56%), Positives = 297/429 (69%), Gaps = 1/429 (0%) Frame = +2 Query: 110 FDINPSPNPWQKLTHMASASLTRAKHLKNPKNTNLSTIPLYPRSYGGYSISLSFGTPPQT 289 F NPS +PWQ L+H+ SASLTRA HLK+ KNT+ PL+ SYGGYS+SLSFGTP QT Sbjct: 43 FTKNPSSDPWQLLSHLTSASLTRAHHLKHRKNTSSVNTPLFAHSYGGYSVSLSFGTPSQT 102 Query: 290 IPFIMDTGSDLVWFPCTHRYLCKNCSFSNPN-TKIPTFIPXXXXXXXXXXXXXXXXTWVH 466 + F+MDTGS LVWFPCT RY+C CSF N + KIPTFIP +V Sbjct: 103 LSFVMDTGSSLVWFPCTSRYVCTRCSFPNIDPAKIPTFIPKLSSSAKIVGCLNPKCGFVM 162 Query: 467 DPDVQSRCTDCEPNSTNCSQICPPYXXXXXXXXXXXXXXXETLDLPKKKVPNFVVGCSLF 646 D +V++RC C+ NS NC++ CP Y E+L ++ P+FVVGCS+ Sbjct: 163 DSEVRTRCPGCDQNSANCTKACPTYAIQYGLGTTVGLLLLESLVFAERTEPDFVVGCSIL 222 Query: 647 SSRSPSGIAGFGRGPASLPSQLRLNKFSYCLLPHRFDDSSESTSLVLDGESDPGDTKTNG 826 SSR PSGIAGFGRGP+SLP Q+ L KFSYCLL HRFDDS +S+ + L D D KT G Sbjct: 223 SSRQPSGIAGFGRGPSSLPKQMGLKKFSYCLLSHRFDDSPKSSKMTLYVGPDSKDDKTGG 282 Query: 827 VSYTPLLKNQDNSNPVFSVYYYVGLRKISIGGKHVKIPYSYLSLGSDGNGGTIVDSGTTF 1006 +SYTP KN +SN F YYYV LR I +G K VK+PYS++ GSDGNGGTIVDSG+TF Sbjct: 283 LSYTPFRKNPVSSNSAFKEYYYVTLRHIIVGDKRVKVPYSFMVAGSDGNGGTIVDSGSTF 342 Query: 1007 TFMEGRAFELVAREVENQTAHYRRATDVETRTGLRPCFYVSNEKSVSLPKLVFHFKGGAE 1186 TFME FE VA E + Q A+Y RA DVE +GL+PCF +S SV+LP LVF FKGGA+ Sbjct: 343 TFMEKPVFEAVATEFDRQMANYTRAADVEALSGLKPCFNLSGVGSVALPSLVFQFKGGAK 402 Query: 1187 MELPLANYFSFIDDDSGVVCMTFVTDGVGGSDIRSGPSVILGNYQMQNYYVEYDLKNERL 1366 MELP+ANYFS + D S V+C+T V++ GS + SGPS+ILGNYQ QN+Y EYDL+NER Sbjct: 403 MELPVANYFSLVGDLS-VLCLTIVSNEAVGSTLSSGPSIILGNYQSQNFYTEYDLENERF 461 Query: 1367 GFRQQQTCK 1393 GFR+Q+ CK Sbjct: 462 GFRRQR-CK 469 >ref|XP_002523869.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus communis] gi|223536957|gb|EEF38595.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus communis] Length = 447 Score = 484 bits (1247), Expect = e-134 Identities = 248/437 (56%), Positives = 305/437 (69%), Gaps = 1/437 (0%) Frame = +2 Query: 86 TITLSHNHFDINPSPNPWQKLTHMASASLTRAKHLKNPKNTNLSTIPLYPRSYGGYSISL 265 +I LSH++ + NPS + QKL ++ S SL RA HLKNP+ T P++ SYGGYSISL Sbjct: 27 SIPLSHSYTNQNPSQDHLQKLNYLVSTSLARAHHLKNPQTT-----PVFSHSYGGYSISL 81 Query: 266 SFGTPPQTIPFIMDTGSDLVWFPCTHRYLCKNCSFSNPNTKIPTFIPXXXXXXXXXXXXX 445 SFGTPPQT+ F+MDTGS VWFPCT RYLC NCSF++ +I F+P Sbjct: 82 SFGTPPQTLSFVMDTGSSFVWFPCTLRYLCNNCSFTS---RISPFLPKHSSSSKIIGCKN 138 Query: 446 XXXTWVHDPDVQSRCTDCEPNSTNCSQICPPYXXXXXXXXXXXXXXXETLDLPKKKVPNF 625 +W+H D+ RCTDC+ NS NCSQICPPY ETL L VPNF Sbjct: 139 PKCSWIHQTDL--RCTDCDNNSRNCSQICPPYLILYGSGTTGGVALSETLHLHGLIVPNF 196 Query: 626 VVGCSLFSSRSPSGIAGFGRGPASLPSQLRLNKFSYCLLPHRFDDSSESTSLVLDGESDP 805 +VGCS+FSSR P+GIAGFGRGP+SLPSQL L KFSYCLL H+FDD+ ES+SLVLD +SD Sbjct: 197 LVGCSVFSSRQPAGIAGFGRGPSSLPSQLGLTKFSYCLLSHKFDDTQESSSLVLDSQSD- 255 Query: 806 GDTKTNGVSYTPLLKNQD-NSNPVFSVYYYVGLRKISIGGKHVKIPYSYLSLGSDGNGGT 982 D KT + YTPL+KN P FSVYYYV LR+ISIGG+ VKIPY YLS DGNGGT Sbjct: 256 SDKKTAALMYTPLVKNPKVQDKPAFSVYYYVSLRRISIGGRSVKIPYKYLSPDKDGNGGT 315 Query: 983 IVDSGTTFTFMEGRAFELVAREVENQTAHYRRATDVETRTGLRPCFYVSNEKSVSLPKLV 1162 I+DSGTTFT+M AFE+++ E +Q +Y RA VE +GL+PCF VS K + LP+L Sbjct: 316 IIDSGTTFTYMSTEAFEILSNEFISQVKNYERALMVEALSGLKPCFNVSGAKELELPQLR 375 Query: 1163 FHFKGGAEMELPLANYFSFIDDDSGVVCMTFVTDGVGGSDIRSGPSVILGNYQMQNYYVE 1342 HFKGGA++ELPL NYF+F+ V C T VTD G++ SGP +ILGN+QMQN+YVE Sbjct: 376 LHFKGGADVELPLENYFAFLGSRE-VACFTVVTD---GAEKASGPGMILGNFQMQNFYVE 431 Query: 1343 YDLKNERLGFRQQQTCK 1393 YDL+NERLGF+ +++CK Sbjct: 432 YDLQNERLGFK-KESCK 447 >emb|CAN67405.1| hypothetical protein VITISV_025616 [Vitis vinifera] Length = 609 Score = 483 bits (1244), Expect = e-134 Identities = 241/426 (56%), Positives = 294/426 (69%), Gaps = 1/426 (0%) Frame = +2 Query: 110 FDINPSPNPWQKLTHMASASLTRAKHLKNPKNTNLSTIPLYPRSYGGYSISLSFGTPPQT 289 F NPS +PWQ L+H+ SASLTRA HLK+ KNT+ PL+ SYGGYS+SLSFGTP QT Sbjct: 43 FTKNPSSDPWQLLSHLTSASLTRAHHLKHRKNTSSVNTPLFAHSYGGYSVSLSFGTPSQT 102 Query: 290 IPFIMDTGSDLVWFPCTHRYLCKNCSFSNPN-TKIPTFIPXXXXXXXXXXXXXXXXTWVH 466 + F+MDTGS LVWFPCT RY+C CSF N + KIPTFIP +V Sbjct: 103 LSFVMDTGSSLVWFPCTSRYVCTRCSFPNIDPAKIPTFIPKLSSSAKIVGCLNPKCGFVM 162 Query: 467 DPDVQSRCTDCEPNSTNCSQICPPYXXXXXXXXXXXXXXXETLDLPKKKVPNFVVGCSLF 646 D +V++RC C+ NS NC++ CP Y E+L ++ P+FVVGCS+ Sbjct: 163 DSEVRTRCPGCDQNSANCTKACPTYAIQYGLGTTVGLLLLESLVFAERTEPDFVVGCSIL 222 Query: 647 SSRSPSGIAGFGRGPASLPSQLRLNKFSYCLLPHRFDDSSESTSLVLDGESDPGDTKTNG 826 SSR PSGIAGFGRGP+SLP Q+ L KFSYCLL HRFDDS +S+ + L D D KT G Sbjct: 223 SSRQPSGIAGFGRGPSSLPKQMGLKKFSYCLLSHRFDDSPKSSKMTLYVGPDSKDDKTGG 282 Query: 827 VSYTPLLKNQDNSNPVFSVYYYVGLRKISIGGKHVKIPYSYLSLGSDGNGGTIVDSGTTF 1006 +SYTP KN +SN F YYYV LR I +G K VK PYS++ GSDGNGGTIVDSG+TF Sbjct: 283 LSYTPFRKNPVSSNSAFKEYYYVTLRHIIVGDKRVKXPYSFMVAGSDGNGGTIVDSGSTF 342 Query: 1007 TFMEGRAFELVAREVENQTAHYRRATDVETRTGLRPCFYVSNEKSVSLPKLVFHFKGGAE 1186 TFME FE VA E + Q A+Y RA DVE +GL+PCF +S SV+LP LVF FKGGA+ Sbjct: 343 TFMEKPVFEAVATEFDRQMANYTRAADVEALSGLKPCFNLSGVGSVALPSLVFQFKGGAK 402 Query: 1187 MELPLANYFSFIDDDSGVVCMTFVTDGVGGSDIRSGPSVILGNYQMQNYYVEYDLKNERL 1366 MELP+ANYFS + D S V+C+T V++ GS + SGPS+ILGNYQ QN+Y EYDL+NER Sbjct: 403 MELPVANYFSLVGDLS-VLCLTIVSNEAVGSTLSSGPSIILGNYQSQNFYTEYDLENERF 461 Query: 1367 GFRQQQ 1384 GFR+Q+ Sbjct: 462 GFRRQR 467 >ref|XP_006361102.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Solanum tuberosum] Length = 460 Score = 479 bits (1234), Expect = e-132 Identities = 245/441 (55%), Positives = 310/441 (70%), Gaps = 3/441 (0%) Frame = +2 Query: 80 STTITLSHNHFDI-NPSPNPWQKLTHMASASLTRAKHLKNPKNTNLSTIPLYPRSYGGYS 256 STT T+ + F+ NPS + ++KLTH+AS SL RA ++K +++ +ST PLYP+SYGGYS Sbjct: 25 STTTTIPLSLFNTKNPSQDFYEKLTHLASISLARANYIKKSQDSPVSTTPLYPQSYGGYS 84 Query: 257 ISLSFGTPPQTIPFIMDTGSDLVWFPCTHRYLCKNCSFSNPNTK-IPTFIPXXXXXXXXX 433 I+LSFGTPPQ IPFIMDTGS+ VWFPCT RYLC NC+ S+ ++ IPTFIP Sbjct: 85 IALSFGTPPQKIPFIMDTGSNFVWFPCTTRYLCSNCTVSSATSQSIPTFIPKSSSSARVL 144 Query: 434 XXXXXXXTWVHDPDVQSRCTDCEPNSTNCSQICPPYXXXXXXXXXXXXXXXETLDLPKKK 613 W+H + +SRC DCE + TNC Q+CPPY +TLDL KK Sbjct: 145 GCLNPKCGWIHSNNPKSRCQDCE-SPTNCKQVCPPYIILYGSGSTGGLALVDTLDLSNKK 203 Query: 614 VPNFVVGCSLFSSRSPSGIAGFGRGPASLPSQLRLNKFSYCLLPHRFDDSSESTSLVLDG 793 VPNF+VGCSLFSS+ P+GIAG GRG ASLPSQL + KFSYCL+ H+FDD+ +S++LVLD Sbjct: 204 VPNFLVGCSLFSSKQPAGIAGLGRGLASLPSQLGVKKFSYCLVSHKFDDTGKSSNLVLDF 263 Query: 794 ESDPGDTKTNGVSYTPLLKNQDNSNP-VFSVYYYVGLRKISIGGKHVKIPYSYLSLGSDG 970 + KT+ +SYTPL KN S SVYYYV LRKI++GGK VKIPY YL+ S+G Sbjct: 264 NAS--GEKTSDLSYTPLQKNPVVSEKNALSVYYYVSLRKITVGGKKVKIPYKYLTTDSNG 321 Query: 971 NGGTIVDSGTTFTFMEGRAFELVAREVENQTAHYRRATDVETRTGLRPCFYVSNEKSVSL 1150 NGG+IVDSGTTFTFM FE V Q R+ +E TGLRPCF +S +++VSL Sbjct: 322 NGGSIVDSGTTFTFMNRGVFEPVLDAFVKQVKGIPRSESIEIITGLRPCFNISRQETVSL 381 Query: 1151 PKLVFHFKGGAEMELPLANYFSFIDDDSGVVCMTFVTDGVGGSDIRSGPSVILGNYQMQN 1330 P+L FH+KGGAEM LP+ANYFSF ++ V+C+T VTD G ++ +GPS+ILGN+QMQN Sbjct: 382 PELKFHYKGGAEMTLPIANYFSFA-GETDVICLTMVTDSAFGPELSTGPSIILGNFQMQN 440 Query: 1331 YYVEYDLKNERLGFRQQQTCK 1393 Y VE+DLKNE+ GF+QQ CK Sbjct: 441 YLVEFDLKNEKFGFKQQM-CK 460 >ref|XP_007015710.1| Eukaryotic aspartyl protease family protein, putative [Theobroma cacao] gi|508786073|gb|EOY33329.1| Eukaryotic aspartyl protease family protein, putative [Theobroma cacao] Length = 466 Score = 479 bits (1234), Expect = e-132 Identities = 251/451 (55%), Positives = 303/451 (67%), Gaps = 15/451 (3%) Frame = +2 Query: 74 IKSTTITLSHNHFDINPSPNPWQKLTHMASASLTRAKHLKNPKNT--------NLSTIPL 229 I S+ + L NPSP+P+Q L +AS+SL RA HLKNP+ T +T PL Sbjct: 19 IISSALHLPLAQLQKNPSPDPYQTLNRLASSSLKRAHHLKNPQPTATKGGASPTTTTTPL 78 Query: 230 YPRSYGGYSISLSFGTPPQTIPFIMDTGSDLVWFPCTHRYLCKNCSFSNPNTKIPTFIPX 409 + SYGGY+ISLSFGTPPQT+PF+MDTGSD VWFPCTH YLCKNCSFS+ N IP+FIP Sbjct: 79 FSHSYGGYTISLSFGTPPQTLPFVMDTGSDFVWFPCTHHYLCKNCSFSSSN--IPSFIPK 136 Query: 410 XXXXXXXXXXXXXXXTWVHDPDVQSRCTDCEPNST--NCSQICPPYXXXXXXXXXXXXXX 583 +W+H + ++C +C NST NCSQICPPY Sbjct: 137 QSSSSKILGCQNPKCSWIHHTNA-TQCDECGNNSTPQNCSQICPPYFIFYGLGTTAGFAL 195 Query: 584 XETLDLPKKKVPNFVVGCSLFSSRSPSGIAGFGRGPASLPSQLRLNKFSYCLLPHRFDDS 763 ETL+L + P+F+VGCSL SS P+G+AGFGRG SLP+QL+L+KFSYCL+ HRFDDS Sbjct: 196 SETLNLGDRIEPDFLVGCSLLSSHQPAGVAGFGRGLPSLPTQLKLDKFSYCLISHRFDDS 255 Query: 764 SESTSLVLDGESDPGDTKTNGVSYTPLLKNQD-NSNPVFSVYYYVGLRKISIGGKHVKIP 940 + S+ L+LD SD D K G++YTP LKN F VYYY+GLRKIS+GG+HVK+P Sbjct: 256 TSSSPLILDSNSD-FDKKKIGLTYTPFLKNPIVQGKEAFKVYYYLGLRKISVGGRHVKVP 314 Query: 941 YSYLSLGSDGNGGTIVDSGTTFTFMEGRAFELVAREVENQTAHYRRATDVETRTGLRPCF 1120 Y YLS G+DGNGG+IVDSGTTFTFM FE VA E Q Y RA DVE TGLRPCF Sbjct: 315 YKYLSPGNDGNGGSIVDSGTTFTFMAREVFEPVAEEFVKQVKKYSRARDVEDLTGLRPCF 374 Query: 1121 YVSNEKSVSLPKLVFHFKGGAEMELPLANYFSFIDDDSGVVCMTFVT-DGVGGSD---IR 1288 +V + V LP+L HFKGGAE+ LP NYF + D G C+T VT GVGG + + Sbjct: 375 HVKGREKVELPELRLHFKGGAEIALPPNNYFVLV--DGGAACLTVVTGGGVGGGEGEVGQ 432 Query: 1289 SGPSVILGNYQMQNYYVEYDLKNERLGFRQQ 1381 SGP+VILGN+QMQNYYVEYDL+NERLG R Q Sbjct: 433 SGPAVILGNFQMQNYYVEYDLRNERLGLRPQ 463 >ref|XP_004241344.1| PREDICTED: aspartic proteinase nepenthesin-2-like isoform 1 [Solanum lycopersicum] Length = 461 Score = 479 bits (1232), Expect = e-132 Identities = 246/441 (55%), Positives = 309/441 (70%), Gaps = 3/441 (0%) Frame = +2 Query: 80 STTITLSHNHFDI-NPSPNPWQKLTHMASASLTRAKHLKNPKNTNLSTIPLYPRSYGGYS 256 STT T+ + F+ +PS + ++KLTH+AS SL RA ++K +++ +ST PLYP+SYGGYS Sbjct: 26 STTSTIPLSLFNTKHPSQDLYEKLTHLASISLARANYIKKSQDSPVSTTPLYPQSYGGYS 85 Query: 257 ISLSFGTPPQTIPFIMDTGSDLVWFPCTHRYLCKNCSFSNPNTK-IPTFIPXXXXXXXXX 433 I+LSFGTPPQ IPFIMDTGS VWFPCT RYLC NCS S+ ++ IPTFIP Sbjct: 86 ITLSFGTPPQKIPFIMDTGSSFVWFPCTTRYLCTNCSVSSATSQSIPTFIPKSSSSARVV 145 Query: 434 XXXXXXXTWVHDPDVQSRCTDCEPNSTNCSQICPPYXXXXXXXXXXXXXXXETLDLPKKK 613 W+H + +SRC DCE + TNC Q+CPPY +TLDL KK Sbjct: 146 GCLNPKCGWIHSNNPKSRCQDCE-SPTNCKQVCPPYIILYGSGSTGGLALVDTLDLSNKK 204 Query: 614 VPNFVVGCSLFSSRSPSGIAGFGRGPASLPSQLRLNKFSYCLLPHRFDDSSESTSLVLDG 793 VPNF+VGCSLFSS+ P+GIAG GRG ASLP+QL + KFSYCL+ H+FDD+ +S++LVLD Sbjct: 205 VPNFLVGCSLFSSKQPAGIAGLGRGLASLPNQLGVKKFSYCLVSHKFDDTGKSSNLVLDF 264 Query: 794 ESDPGDTKTNGVSYTPLLKNQDNSNP-VFSVYYYVGLRKISIGGKHVKIPYSYLSLGSDG 970 + KT G+SYTPLLKN S SVYYYV LRKI++GGK VKIPY YL+ S+G Sbjct: 265 NAS--GEKTAGLSYTPLLKNPVVSEKNALSVYYYVSLRKITVGGKKVKIPYKYLTPDSNG 322 Query: 971 NGGTIVDSGTTFTFMEGRAFELVAREVENQTAHYRRATDVETRTGLRPCFYVSNEKSVSL 1150 NGG+IVDSGTTFTFM FE V Q R+ +E TGL+PCF +S +++VSL Sbjct: 323 NGGSIVDSGTTFTFMNRGVFEPVLDAFVKQVKGIPRSESIEIITGLKPCFNISRQETVSL 382 Query: 1151 PKLVFHFKGGAEMELPLANYFSFIDDDSGVVCMTFVTDGVGGSDIRSGPSVILGNYQMQN 1330 P+L FHFKGGAEM LP+ANYFSF + V+C+T VTD G ++ +GPS+ILGN+QMQN Sbjct: 383 PELKFHFKGGAEMTLPIANYFSFA-GEIDVICLTMVTDSAFGPELSTGPSIILGNFQMQN 441 Query: 1331 YYVEYDLKNERLGFRQQQTCK 1393 Y VE+DLKNE+ GF+QQ CK Sbjct: 442 YLVEFDLKNEKFGFKQQM-CK 461 >ref|XP_002534234.1| pepsin A, putative [Ricinus communis] gi|223525662|gb|EEF28148.1| pepsin A, putative [Ricinus communis] Length = 468 Score = 471 bits (1213), Expect = e-130 Identities = 235/429 (54%), Positives = 293/429 (68%), Gaps = 9/429 (2%) Frame = +2 Query: 122 PSPNPWQKLTHMASASLTRAKHLKNPKNTNLSTI--PLYPRSYGGYSISLSFGTPPQTIP 295 PS +PW+ L H+A+ S++RA HLK+PK TN S I PL+ RSYGGYS+SLS GTP QT+ Sbjct: 40 PSSDPWEYLNHLATTSISRAHHLKSPK-TNFSLIKTPLFSRSYGGYSMSLSLGTPSQTVK 98 Query: 296 FIMDTGSDLVWFPCTHRYLCKNCSFSNPN-TKIPTFIPXXXXXXXXXXXXXXXXTWVHDP 472 IMDTGS LVWFPCT RY+C +C+F N + TKIP F+P WV Sbjct: 99 LIMDTGSSLVWFPCTSRYVCASCNFPNTDITKIPKFMPRLSSSSKLIGCKNPKCAWVFGS 158 Query: 473 DVQSRCTDCEPNSTNCSQICPPYXXXXXXXXXXXXXXXETLDLPKKKVPNFVVGCSLFSS 652 VQS+C +C P + NC+Q CPPY ET++ P K + +F+ GCSL S+ Sbjct: 159 SVQSKCHNCNPQAQNCTQACPPYIIQYGLGSTAGLLLSETINFPNKTISDFLAGCSLLST 218 Query: 653 RSPSGIAGFGRGPASLPSQLRLNKFSYCLLPHRFDDSSESTSLVLDGESDPGDTKTNGVS 832 R P GIAGFGR SLP QL L KFSYCL+ RFDDS S+ L+LD D+KT G+S Sbjct: 219 RQPEGIAGFGRSQESLPLQLGLKKFSYCLVSRRFDDSPVSSDLILDMGPSTSDSKTTGLS 278 Query: 833 YTPLLKN-QDNSNPVFSVYYYVGLRKISIGGKHVKIPYSYLSLGSDGNGGTIVDSGTTFT 1009 YTP KN SNP F YYYV LRKI +G HVK+PYS+L GSDGNGGTIVDSG+TFT Sbjct: 279 YTPFQKNLASQSNPAFQEYYYVMLRKIIVGKTHVKVPYSFLVPGSDGNGGTIVDSGSTFT 338 Query: 1010 FMEGRAFELVAREVENQTAHYRRATDVETRTGLRPCFYVSNEKSVSLPKLVFHFKGGAEM 1189 F+EG FEL+A+E E Q A+Y AT+V+ TGLRPCF +S EKSV +P L F FKGGA+M Sbjct: 339 FVEGHVFELLAKEFEKQMANYTVATNVQKLTGLRPCFDISGEKSVVIPDLTFQFKGGAKM 398 Query: 1190 ELPLANYFSFIDDDSGVVCMTFVTD-----GVGGSDIRSGPSVILGNYQMQNYYVEYDLK 1354 +LPL+NYF+F+ D GVVC+T V+D G G SGP++ILGN+Q QN+Y+EYDL+ Sbjct: 399 QLPLSNYFAFV--DMGVVCLTIVSDNAAALGGDGGVRSSGPAIILGNFQQQNFYIEYDLE 456 Query: 1355 NERLGFRQQ 1381 N+R GF++Q Sbjct: 457 NDRFGFKEQ 465 >gb|EYU18131.1| hypothetical protein MIMGU_mgv1a025649mg [Mimulus guttatus] Length = 462 Score = 471 bits (1211), Expect = e-130 Identities = 249/449 (55%), Positives = 299/449 (66%), Gaps = 11/449 (2%) Frame = +2 Query: 80 STTITLSHNHFDINPSP---NPWQKLTHMASASLTRAKHLKNPKNTNLSTI-----PLYP 235 STT++LS +P P NPWQ+L H+++AS TRA LK+P NT+ S PL+P Sbjct: 24 STTLSLSPT--TASPPPPLANPWQRLNHLSAASSTRAHLLKHP-NTSTSAAAATKAPLFP 80 Query: 236 RSYGGYSISLSFGTPPQTIPFIMDTGSDLVWFPCTHRYLCKNCSFSNPN-TKIPTFIPXX 412 R YGGYSISLSFGTPPQT+PF+MDTGS LVWFPCT RY C +C+F N N + I F+P Sbjct: 81 RGYGGYSISLSFGTPPQTLPFVMDTGSSLVWFPCTQRYACNSCNFVNVNPSNISIFLPKS 140 Query: 413 XXXXXXXXXXXXXXTWVHDPDVQSRCTDCEPNSTNCSQICPPYXXXXXXXXXXXXXXXET 592 W+ PDVQ C +C+ NST C + CPPY ET Sbjct: 141 SSSSMIIGCKNPKCRWIF-PDVQ--CKNCDQNSTTCKEFCPPYIIQYGSGSTTGLLLSET 197 Query: 593 LDLPKKKVPNFVVGCSLFSSRSPSGIAGFGRGPASLPSQLRLNKFSYCLLPHRFDDSSES 772 L P+K V NF VGCS+FSSR P+GIAGFGRGP SLP+Q+ L +FSYCL+ HRFDD S Sbjct: 198 LFFPEKSVENFFVGCSIFSSRQPAGIAGFGRGPESLPAQMGLKRFSYCLVSHRFDDEPVS 257 Query: 773 TSLVLDGESDPGDTKTNGVSYTPLLKNQDNSNPVFSVYYYVGLRKISIGGKHVKIPYSYL 952 + LV G GV YTP KN ++NP F YYYV LRKI++GG HVK PY +L Sbjct: 258 SDLVFVGGGGAAGAAA-GVEYTPFRKNPKSANPAFQDYYYVTLRKITVGGVHVKAPYEFL 316 Query: 953 SLGSDGNGGTIVDSGTTFTFMEGRAFELVAREVENQTA--HYRRATDVETRTGLRPCFYV 1126 + G+GGTIVDSGTTFTFME R FE VA E E Q +Y RA +VE R+GLRPCF V Sbjct: 317 VADAAGDGGTIVDSGTTFTFMESRVFEPVAEEFEKQVGRRNYSRAREVEDRSGLRPCFNV 376 Query: 1127 SNEKSVSLPKLVFHFKGGAEMELPLANYFSFIDDDSGVVCMTFVTDGVGGSDIRSGPSVI 1306 S E SVSLP+L FHFKGGAEM LPLA+YFSF+DD V+CMT VT+ I GP++I Sbjct: 377 SGEGSVSLPELSFHFKGGAEMVLPLADYFSFLDD--SVICMTVVTNNSTREGIGPGPAII 434 Query: 1307 LGNYQMQNYYVEYDLKNERLGFRQQQTCK 1393 LGNYQ QN+Y+EYDL+NERLGF+ +Q CK Sbjct: 435 LGNYQQQNFYMEYDLENERLGFK-RQLCK 462 >ref|XP_007027933.1| Eukaryotic aspartyl protease family protein, putative isoform 1 [Theobroma cacao] gi|590632770|ref|XP_007027934.1| Eukaryotic aspartyl protease family protein, putative isoform 1 [Theobroma cacao] gi|590632774|ref|XP_007027935.1| Eukaryotic aspartyl protease family protein, putative isoform 1 [Theobroma cacao] gi|508716538|gb|EOY08435.1| Eukaryotic aspartyl protease family protein, putative isoform 1 [Theobroma cacao] gi|508716539|gb|EOY08436.1| Eukaryotic aspartyl protease family protein, putative isoform 1 [Theobroma cacao] gi|508716540|gb|EOY08437.1| Eukaryotic aspartyl protease family protein, putative isoform 1 [Theobroma cacao] Length = 472 Score = 471 bits (1211), Expect = e-130 Identities = 232/444 (52%), Positives = 296/444 (66%), Gaps = 10/444 (2%) Frame = +2 Query: 80 STTITLSHNHFDINPSPNPWQKLTHMASASLTRAKHLKNPKN---------TNLSTIPLY 232 STTI +S + F PS + +Q L ++A++S++RA HLK P + ++L PL+ Sbjct: 28 STTIKISLSPFPHPPSFDAYQILNNLATSSVSRAHHLKQPTHKIKAKANTTSSLLKTPLF 87 Query: 233 PRSYGGYSISLSFGTPPQTIPFIMDTGSDLVWFPCTHRYLCKNCSFSNPNTK-IPTFIPX 409 P SYGGY+ISL GTPPQT+ FIMDTGS L WFPCT RY+C C+F N + K IPTF P Sbjct: 88 PHSYGGYTISLGIGTPPQTLTFIMDTGSSLSWFPCTSRYICSQCAFPNVDPKKIPTFSPK 147 Query: 410 XXXXXXXXXXXXXXXTWVHDPDVQSRCTDCEPNSTNCSQICPPYXXXXXXXXXXXXXXXE 589 W+ PDV+SRC DCEP S NC+Q CPPY E Sbjct: 148 LSSSKALVGCKNPKCRWLFGPDVESRCQDCEPASKNCTQNCPPYIIQYGLGSTGGLLLVE 207 Query: 590 TLDLPKKKVPNFVVGCSLFSSRSPSGIAGFGRGPASLPSQLRLNKFSYCLLPHRFDDSSE 769 L +K +F+VGCS+FS+R P+GI GFGR P SLPSQL + KFSYCL+ RFDD+ Sbjct: 208 NLVFSQKTFQDFLVGCSIFSNRQPAGIVGFGRRPESLPSQLGVKKFSYCLVSRRFDDTGV 267 Query: 770 STSLVLDGESDPGDTKTNGVSYTPLLKNQDNSNPVFSVYYYVGLRKISIGGKHVKIPYSY 949 S++++L+ S GD KT G+SYTP KNQ S+P+F +YYV +RKI +G KHVK+PY Y Sbjct: 268 SSNMLLETGSGSGDAKTKGLSYTPFYKNQFASHPIFQEFYYVTIRKILVGDKHVKVPYKY 327 Query: 950 LSLGSDGNGGTIVDSGTTFTFMEGRAFELVAREVENQTAHYRRATDVETRTGLRPCFYVS 1129 L G DGNGGTIVDSG+TFTFME FELV++E E Q +Y RA +VE ++GL PC +S Sbjct: 328 LVPGPDGNGGTIVDSGSTFTFMERAVFELVSKEFEKQMGNYSRAHEVENKSGLAPCVNIS 387 Query: 1130 NEKSVSLPKLVFHFKGGAEMELPLANYFSFIDDDSGVVCMTFVTDGVGGSDIRSGPSVIL 1309 KS+S P+L+F FKGGA+M LPLANYFSF+ D VVC+ VTD + G + GP++IL Sbjct: 388 GHKSISFPELIFQFKGGAKMALPLANYFSFL--DVNVVCLMVVTDNIIGQGVSGGPAIIL 445 Query: 1310 GNYQMQNYYVEYDLKNERLGFRQQ 1381 GN+Q QNYY+EYDL NE GF +Q Sbjct: 446 GNFQQQNYYIEYDLANESFGFAKQ 469 >ref|XP_007162958.1| hypothetical protein PHAVU_001G194500g [Phaseolus vulgaris] gi|561036422|gb|ESW34952.1| hypothetical protein PHAVU_001G194500g [Phaseolus vulgaris] Length = 466 Score = 470 bits (1210), Expect = e-130 Identities = 241/439 (54%), Positives = 298/439 (67%), Gaps = 3/439 (0%) Frame = +2 Query: 86 TITLSHNHFDINP-SPNPWQKLTHMASASLTRAKHLKNPKNT-NLSTIPLYPRSYGGYSI 259 TITL + P S +P+ L ASASLTRA HLK+ N + +T +YP+SYGGYSI Sbjct: 28 TITLPLSPLLTKPQSSDPFHSLKLAASASLTRAHHLKHRLNAPSAATTQVYPKSYGGYSI 87 Query: 260 SLSFGTPPQTIPFIMDTGSDLVWFPCTHRYLCKNCSFSNPN-TKIPTFIPXXXXXXXXXX 436 L+FGTPPQT PF++DTGS LVWFPCT RYLC +C F N + TKIPTFIP Sbjct: 88 DLNFGTPPQTSPFVLDTGSSLVWFPCTSRYLCSHCLFPNIDPTKIPTFIPKNSSTSRLLG 147 Query: 437 XXXXXXTWVHDPDVQSRCTDCEPNSTNCSQICPPYXXXXXXXXXXXXXXXETLDLPKKKV 616 ++ D+QSRC C+P+S NCS CPPY + L+ P+K V Sbjct: 148 CKNPKCGYLFGSDLQSRCPQCKPDSQNCSLTCPPYIIQYGLGSTAGFLLLDNLNFPEKIV 207 Query: 617 PNFVVGCSLFSSRSPSGIAGFGRGPASLPSQLRLNKFSYCLLPHRFDDSSESTSLVLDGE 796 P F+VGCS+ S R PSGIAGFGRG SLP+Q+ L +FSYCLL H FDDS+E++ LVL Sbjct: 208 PQFLVGCSILSIRQPSGIAGFGRGQESLPAQMALKRFSYCLLSHNFDDSTENSDLVLQ-I 266 Query: 797 SDPGDTKTNGVSYTPLLKNQDNSNPVFSVYYYVGLRKISIGGKHVKIPYSYLSLGSDGNG 976 S GDTKTNG+SYTP N +NP F YYY+ LRK+ +GGK+VKIP S+L GSDGNG Sbjct: 267 SSTGDTKTNGLSYTPFHPNPSANNPAFLEYYYLSLRKVIVGGKNVKIPLSFLEPGSDGNG 326 Query: 977 GTIVDSGTTFTFMEGRAFELVAREVENQTAHYRRATDVETRTGLRPCFYVSNEKSVSLPK 1156 GTIVDSG+TFTFME A++LV +E Q +Y RA DVE ++GL PCF +S K+V+ PK Sbjct: 327 GTIVDSGSTFTFMERPAYDLVVKEFVKQLGNYSRAEDVEAQSGLGPCFNISGAKTVNFPK 386 Query: 1157 LVFHFKGGAEMELPLANYFSFIDDDSGVVCMTFVTDGVGGSDIRSGPSVILGNYQMQNYY 1336 FKGGA+M LP+ NYFS I DDS VVC+T V+DG G SGP++ILGNYQ QN++ Sbjct: 387 FTLQFKGGAKMTLPVENYFSLI-DDSEVVCLTIVSDGGAGPATTSGPAIILGNYQQQNFH 445 Query: 1337 VEYDLKNERLGFRQQQTCK 1393 +EYDL+NER GF Q+CK Sbjct: 446 IEYDLENERFGF-GPQSCK 463 >ref|XP_007202027.1| hypothetical protein PRUPE_ppa005104mg [Prunus persica] gi|462397558|gb|EMJ03226.1| hypothetical protein PRUPE_ppa005104mg [Prunus persica] Length = 477 Score = 461 bits (1187), Expect = e-127 Identities = 239/458 (52%), Positives = 299/458 (65%), Gaps = 20/458 (4%) Frame = +2 Query: 80 STTITLSHNHFDINPSPNPWQKLTHMASASLTRAKHLKNPK--NTNLSTIPLYPRSYGGY 253 S+ ITL + F +PS +P Q L+ ASAS++RA H+KN + N++L+ +PL+P SYG Y Sbjct: 22 SSKITLPLSPFPNHPSSDPLQALSFHASASISRAHHIKNSRKPNSSLTQVPLFPHSYGDY 81 Query: 254 SISLSFGTPPQTIPFIMDTGSDLVWFPCTHRYLCKNCSFSNPN-TKIPTFIPXXXXXXXX 430 S+SL+FGTPPQT FIMDTGS LVWFPCT RY+C C F N N KIPTF P Sbjct: 82 SVSLNFGTPPQTSSFIMDTGSSLVWFPCTKRYICSRCQFPNINPAKIPTFKPKLSSSSKI 141 Query: 431 XXXXXXXXTWVHDPDVQSRCTDCE-PNSTNCSQICPPYXXXXXXXXXXXXXXXETLDLPK 607 W+ P+V+S+C +C P+ NCSQ CP Y ETLD PK Sbjct: 142 VGCQNPKCGWIFGPEVKSKCPNCNNPSHQNCSQACPTYIIQYGSGTTAGILLSETLDFPK 201 Query: 608 KKVPNFVVGCSLFSSRSPSGIAGFGRGPASLPSQLRLNKFSYCLLPHRFDDSSESTSLVL 787 K VP+F+VGCS S R P+GIAGFGRGP SLP+Q+ L KFSYCL+ HRFDD+ +S+ LVL Sbjct: 202 KIVPDFLVGCSFVSIRQPAGIAGFGRGPQSLPAQMGLTKFSYCLVSHRFDDTPQSSDLVL 261 Query: 788 DG----------------ESDPGDTKTNGVSYTPLLKNQDNSNPVFSVYYYVGLRKISIG 919 ES TK +S TP KN N F YYY+ LRK+ +G Sbjct: 262 YSSSSGSSSSSEEEPTIAESQRNKTKLQSLSSTPFQKNPGPPNSAFREYYYIMLRKVIVG 321 Query: 920 GKHVKIPYSYLSLGSDGNGGTIVDSGTTFTFMEGRAFELVAREVENQTAHYRRATDVETR 1099 K+VKIPY +L G+D +GGTIVDSG+TFTFME FE VA+E E Q A+Y RA D+E + Sbjct: 322 NKNVKIPYKFLVPGADSSGGTIVDSGSTFTFMEKPVFEPVAKEFEAQMANYTRAKDLENK 381 Query: 1100 TGLRPCFYVSNEKSVSLPKLVFHFKGGAEMELPLANYFSFIDDDSGVVCMTFVTDGVGGS 1279 TGLRPCF +S EK V P+LVF FKGGA+MELP NYFS + SGVVC+T VTDGV G Sbjct: 382 TGLRPCFDISKEKKVDFPELVFQFKGGAKMELPSKNYFSMV-SSSGVVCLTIVTDGVVGP 440 Query: 1280 DIRSGPSVILGNYQMQNYYVEYDLKNERLGFRQQQTCK 1393 GP++ILGNYQ Q+++VEYDL++ + GFR +Q+CK Sbjct: 441 GGNGGPAIILGNYQQQDFHVEYDLQHGKFGFR-KQSCK 477 >ref|XP_003520712.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max] Length = 474 Score = 461 bits (1187), Expect = e-127 Identities = 235/436 (53%), Positives = 290/436 (66%), Gaps = 7/436 (1%) Frame = +2 Query: 86 TITLSHNHFDINP---SPNPWQKLTHMASASLTRAKHLKNPKNTN--LSTIPLYPRSYGG 250 TITL + I P +P+ L ASASLTRA HLK+ N + ++T P YP+SYGG Sbjct: 32 TITLPLSPLLIKPHSSDSDPFHSLKFAASASLTRAHHLKHRNNNSPSVATTPAYPKSYGG 91 Query: 251 YSISLSFGTPPQTIPFIMDTGSDLVWFPCTHRYLCKNCSFSN-PNTKIPTFIPXXXXXXX 427 YSI L+ GTPPQT PF++DTGS LVWFPCT RYLC +C+F N TKIPTFIP Sbjct: 92 YSIDLNLGTPPQTSPFVLDTGSSLVWFPCTSRYLCSHCNFPNIDTTKIPTFIPKNSSTAK 151 Query: 428 XXXXXXXXXTWVHDPDVQSRCTDCEPNSTNCSQICPPYXXXXXXXXXXXXXXXETLDLPK 607 ++ DVQ RC C+P S NCS CP Y + L+ P Sbjct: 152 LLGCRNPKCGYIFGSDVQFRCPQCKPESQNCSLTCPAYIIQYGLGSTAGFLLLDNLNFPG 211 Query: 608 KKVPNFVVGCSLFSSRSPSGIAGFGRGPASLPSQLRLNKFSYCLLPHRFDDSSESTSLVL 787 K VP F+VGCS+ S R PSGIAGFGRG SLPSQ+ L +FSYCL+ HRFDD+ +S+ LVL Sbjct: 212 KTVPQFLVGCSILSIRQPSGIAGFGRGQESLPSQMNLKRFSYCLVSHRFDDTPQSSDLVL 271 Query: 788 DGESDPGDTKTNGVSYTPLLKNQDNSNPVFSVYYYVGLRKISIGGKHVKIPYSYLSLGSD 967 S GDTKTNG+SYTP N +NP F YYY+ LRK+ +GGK VKIPY++L GSD Sbjct: 272 Q-ISSTGDTKTNGLSYTPFRSNPSTNNPAFKEYYYLTLRKVIVGGKDVKIPYTFLEPGSD 330 Query: 968 GNGGTIVDSGTTFTFMEGRAFELVARE-VENQTAHYRRATDVETRTGLRPCFYVSNEKSV 1144 GNGGTIVDSG+TFTFME + LVA+E V+ +Y RA D ET++GL PCF +S K+V Sbjct: 331 GNGGTIVDSGSTFTFMERPVYNLVAQEFVKQLEKNYSRAEDAETQSGLSPCFNISGVKTV 390 Query: 1145 SLPKLVFHFKGGAEMELPLANYFSFIDDDSGVVCMTFVTDGVGGSDIRSGPSVILGNYQM 1324 + P+L F FKGGA+M PL NYFS + D+ VVC+T V+DG G +GP++ILGNYQ Sbjct: 391 TFPELTFKFKGGAKMTQPLQNYFSLV-GDAEVVCLTVVSDGGAGPPKTTGPAIILGNYQQ 449 Query: 1325 QNYYVEYDLKNERLGF 1372 QN+Y+EYDL+NER GF Sbjct: 450 QNFYIEYDLENERFGF 465 >ref|XP_002312826.2| hypothetical protein POPTR_0009s16390g [Populus trichocarpa] gi|550331863|gb|EEE86781.2| hypothetical protein POPTR_0009s16390g [Populus trichocarpa] Length = 462 Score = 460 bits (1183), Expect = e-127 Identities = 238/444 (53%), Positives = 298/444 (67%), Gaps = 6/444 (1%) Frame = +2 Query: 80 STTITLSHNHFDINPSPNPWQKLTHMASASLTRAKHLKNPKNT--NLSTIPLYPRSYGGY 253 S TI L H + P + +QKL H+ + SL RA+HLKNP+ T +T PL+ SYGGY Sbjct: 24 SITIPLQHPQTNQIPFQDQYQKLNHLVTTSLARARHLKNPQTTPATTTTAPLFSHSYGGY 83 Query: 254 SISLSFGTPPQTIPFIMDTGSDLVWFPCTHRYLCKNCSFSN--PNTKIPTFIPXXXXXXX 427 S+SLSFGTPPQT+ FIMDTGSD+VWFPCT YLCK+CSFS+ P+++I FIP Sbjct: 84 SVSLSFGTPPQTLSFIMDTGSDIVWFPCTSHYLCKHCSFSSSSPSSRIQPFIPKESSSSK 143 Query: 428 XXXXXXXXXTWVHDPDVQSRCTDCEPNSTNCSQICPPYXXXXXXXXXXXXXXXETLDLPK 607 +W+H ++ DC S +Q CPPY ETL L Sbjct: 144 LLGCKNPKCSWIHHSNINCD-QDCSIKSC-LNQTCPPYMIFYGSGTTGGVALSETLHLHS 201 Query: 608 KKVPNFVVGCSLFSSRSPSGIAGFGRGPASLPSQLRLNKFSYCLLPHRFDDSSE-STSLV 784 PNF+VGCS+FSS P+GIAGFGRG +SLPSQL L KFSYCLL HRFDD ++ S+SLV Sbjct: 202 LSKPNFLVGCSVFSSHQPAGIAGFGRGLSSLPSQLGLGKFSYCLLSHRFDDDTKKSSSLV 261 Query: 785 LDGESDPGDTKTNGVSYTPLLKNQDNSNPV-FSVYYYVGLRKISIGGKHVKIPYSYLSLG 961 LD E D KTN + YTP +KN N FSVYYY+GLR+I++GG HVK+PY YLS G Sbjct: 262 LDMEQLDSDKKTNALVYTPFVKNPKVDNKSSFSVYYYLGLRRITVGGHHVKVPYKYLSPG 321 Query: 962 SDGNGGTIVDSGTTFTFMEGRAFELVAREVENQTAHYRRATDVETRTGLRPCFYVSNEKS 1141 DGNGG I+DSGTTFTFM AFE ++ E Q YRR ++E GLRPCF VS+ K+ Sbjct: 322 EDGNGGVIIDSGTTFTFMAREAFEPLSDEFIRQIKDYRRVKEIEDAIGLRPCFNVSDAKT 381 Query: 1142 VSLPKLVFHFKGGAEMELPLANYFSFIDDDSGVVCMTFVTDGVGGSDIRSGPSVILGNYQ 1321 VS P+L +FKGGA++ LP+ NYF+F+ + V C+T VTDGV G + GP +ILGN+Q Sbjct: 382 VSFPELRLYFKGGADVALPVENYFAFVGGE--VACLTVVTDGVAGPERVGGPGMILGNFQ 439 Query: 1322 MQNYYVEYDLKNERLGFRQQQTCK 1393 MQN+YVEYDL+NERLGF+Q++ CK Sbjct: 440 MQNFYVEYDLRNERLGFKQEK-CK 462 >ref|XP_004303503.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Fragaria vesca subsp. vesca] Length = 458 Score = 451 bits (1160), Expect = e-124 Identities = 237/445 (53%), Positives = 293/445 (65%), Gaps = 7/445 (1%) Frame = +2 Query: 80 STTITLSHNHFDINPSPN-PWQKLTHMASASLTRAKHLKNPK-NTNLSTIPLYPRSYGGY 253 S+ +TL + +PS + P Q L ++SASL+RA HLK PK N++ + +PLYPRSYGGY Sbjct: 23 SSKLTLPLSPLAKHPSSSDPIQTLNLLSSASLSRAHHLKRPKHNSSATKVPLYPRSYGGY 82 Query: 254 SISLSFGTPPQTIPFIMDTGSDLVWFPCTHRYLCKNCSFSN--PNTKIPTFIPXXXXXXX 427 SISLSFGTPPQ F+MDTGS LVWFPCT RYLC CSF N P+T IP FIP Sbjct: 83 SISLSFGTPPQISTFVMDTGSSLVWFPCTSRYLCSRCSFPNIDPST-IPAFIPKLSSSAR 141 Query: 428 XXXXXXXXXTWVHDPDVQSRCTDCEPNSTNCSQICPPYXXXXXXXXXXXXXXXETLDLPK 607 W+ P+V ++C PNS SQ CP Y E+LD P Sbjct: 142 LLGCKNPKCAWIFGPEVNTKC----PNS---SQACPSYVIQYGSGTTAGVLLSESLDFPD 194 Query: 608 KKVPNFVVGCSLFSSRSPSGIAGFGRGPASLPSQLRLNKFSYCLLPHRFDDSSESTSLVL 787 K VP+F+VGCS S R P+G+AGFGRGP SLP Q+ L+KFSYCL+ HRFDD+ S+ LVL Sbjct: 195 KTVPDFLVGCSFLSIRQPAGMAGFGRGPQSLPVQMGLSKFSYCLVSHRFDDTPVSSDLVL 254 Query: 788 -DGESDPGDT--KTNGVSYTPLLKNQDNSNPVFSVYYYVGLRKISIGGKHVKIPYSYLSL 958 G + GD + +SYTP KN +N + YYY+ LRK+ +G KHVKIPY YL Sbjct: 255 YSGSTSDGDEIDDNHDISYTPFQKNPGAANTAYREYYYLALRKVIVGKKHVKIPYKYLVP 314 Query: 959 GSDGNGGTIVDSGTTFTFMEGRAFELVAREVENQTAHYRRATDVETRTGLRPCFYVSNEK 1138 G D NGGTIVDSG+TFTFME FE VA Q Y RA D+E RTGL+PCF +S E+ Sbjct: 315 GEDDNGGTIVDSGSTFTFMERPVFEAVAEAFATQMEKYTRAGDIENRTGLKPCFDISKEE 374 Query: 1139 SVSLPKLVFHFKGGAEMELPLANYFSFIDDDSGVVCMTFVTDGVGGSDIRSGPSVILGNY 1318 V P+LVF FKGGA+M +PL NYF+ + D GVVC+T VTDGV G + +GP+VILGN+ Sbjct: 375 KVDFPELVFQFKGGAKMAMPLNNYFALVTSD-GVVCLTIVTDGVAGPGVAAGPAVILGNF 433 Query: 1319 QMQNYYVEYDLKNERLGFRQQQTCK 1393 Q QN+YVEYDL+ ER GF+ +Q+CK Sbjct: 434 QQQNFYVEYDLERERFGFK-KQSCK 457 >ref|XP_004136706.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Cucumis sativus] Length = 457 Score = 451 bits (1160), Expect = e-124 Identities = 233/442 (52%), Positives = 283/442 (64%), Gaps = 5/442 (1%) Frame = +2 Query: 80 STTITLSHNHFDINPSPNPWQKLTHMASASLTRAKHLKNPKNTNLSTIPLYPRSYGGYSI 259 S ITL N F SP+P Q LT +AS+S TRA +K PK+ ++ PL P SYG YS Sbjct: 24 SNPITLPLNSFPHLSSPDPLQALTFLASSSQTRAHQIKTPKSNSVFKSPLSPHSYGAYST 83 Query: 260 SLSFGTPPQTIPFIMDTGSDLVWFPCTHRYLCKNCSFSNPN-TKIPTFIPXXXXXXXXXX 436 LSFGTP QT+ I DTGS LVWFPCT RYLC CSF + T IP F+P Sbjct: 84 PLSFGTPQQTLHLIFDTGSSLVWFPCTSRYLCSECSFPKIDPTGIPRFVPKLSSSSKLVG 143 Query: 437 XXXXXXTWVHDPDVQSRCTDCEPNSTNCSQICPPYXXXXXXXXXXXXXXXETLDLPKKKV 616 +W+ PDV+S+C C P + NC+Q CP Y ETLD P KK+ Sbjct: 144 CQNPKCSWIFGPDVKSQCRSCNPKTENCTQTCPAYVVQYGSGSTAGLLLSETLDFPDKKI 203 Query: 617 PNFVVGCSLFSSRSPSGIAGFGRGPASLPSQLRLNKFSYCLLPHRFDDSSESTSLVLDGE 796 PNFVVGCS S PSGIAGFGRG SLPSQ+ L KF+YCL +FDDS S L+LD Sbjct: 204 PNFVVGCSFLSIHQPSGIAGFGRGSESLPSQMGLKKFAYCLASRKFDDSPHSGQLILDST 263 Query: 797 SDPGDTKTNGVSYTPLLKNQDNSNPVFSVYYYVGLRKISIGGKHVKIPYSYLSLGSDGNG 976 K++G++YTP +N SN + YYY+ +RKI +G + VK+PY +L G DGNG Sbjct: 264 G----VKSSGLTYTPFRQNPSVSNNAYKEYYYLNIRKIIVGNQAVKVPYKFLVPGPDGNG 319 Query: 977 GTIVDSGTTFTFMEGRAFELVAREVENQTAHYRRATDVETRTGLRPCFYVSNEKSVSLPK 1156 G+I+DSG+TFTFM+ E+VARE E Q A++ RATDVET TGLRPCF +S EKSV P+ Sbjct: 320 GSIIDSGSTFTFMDKPVLEVVAREFEKQLANWTRATDVETLTGLRPCFDISKEKSVKFPE 379 Query: 1157 LVFHFKGGAEMELPLANYFSFIDDDSGVVCMTFVT----DGVGGSDIRSGPSVILGNYQM 1324 L+F FKGGA+ LPL NYF+ + SGV C+T VT DG GG GPSVILG +Q Sbjct: 380 LIFQFKGGAKWALPLNNYFALV-SSSGVACLTVVTHQMEDGGGGG---GGPSVILGAFQQ 435 Query: 1325 QNYYVEYDLKNERLGFRQQQTC 1390 QN+YVEYDL N+RLGFR QQTC Sbjct: 436 QNFYVEYDLVNQRLGFR-QQTC 456 >ref|XP_006424129.1| hypothetical protein CICLE_v10028374mg [Citrus clementina] gi|557526063|gb|ESR37369.1| hypothetical protein CICLE_v10028374mg [Citrus clementina] Length = 467 Score = 449 bits (1155), Expect = e-123 Identities = 236/451 (52%), Positives = 295/451 (65%), Gaps = 14/451 (3%) Frame = +2 Query: 83 TTITLSHNHFDINPSPNPWQKLTHMASASLTRAKHLKNPKN--------TNLSTIPLYPR 238 T++T S + F NPS + +Q L + S+SLTRA H+KNP+ T +T + Sbjct: 27 TSLTFSLSRFHTNPSQDSYQNLNSLVSSSLTRALHIKNPQTKTTTTTTTTTTTTTNISSH 86 Query: 239 SYGGYSISLSFGTPPQTIPFIMDTGSDLVWFPCTHRYLCKNCSFSNPNTKIPTFIPXXXX 418 SYGGYSISLSFGTPPQ IPFI+DTGS LVWFPCT+ Y CK CS S KIP+FIP Sbjct: 87 SYGGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSS----KIPSFIPKLSS 142 Query: 419 XXXXXXXXXXXXTWVHDPDVQSRCTDCEPNST--NCSQICPPYXXXXXXXXXXXXXXXET 592 +W+H +Q R + EP +T NC+QICP Y ET Sbjct: 143 SSRLLGCQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGLTEGIALSET 202 Query: 593 LDLPKKKVPNFVVGCSLFSSRSPSGIAGFGRGPASLPSQLRLNKFSYCLLPHRFDDSSES 772 L+LP + +PNF+VGCS+ SSR P+GIAGFGRG SLPSQL L+KFSYCLL H+FDD++ + Sbjct: 203 LNLPNRIIPNFLVGCSVLSSRQPAGIAGFGRGKTSLPSQLNLDKFSYCLLSHKFDDTTRT 262 Query: 773 TSLVLDGESDPGDTKTNGVSYTPLLKNQD-NSNPVFSVYYYVGLRKISIGGKHVKIPYSY 949 +SL+LD S D KT G++YTP + N FSVYYYVGLR+I++GG+ V++ Y Y Sbjct: 263 SSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWYKY 322 Query: 950 LSLGSDGNGGTIVDSGTTFTFMEGRAFELVAREVENQ---TAHYRRATDVETRTGLRPCF 1120 L+L DGNGGTIVDSGTTFTFM FE +A E +Q +Y RA E TGLRPCF Sbjct: 323 LTLDRDGNGGTIVDSGTTFTFMVPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPCF 382 Query: 1121 YVSNEKSVSLPKLVFHFKGGAEMELPLANYFSFIDDDSGVVCMTFVTDGVGGSDIRSGPS 1300 V EK S P+L HFKGGAE+ LP+ NYF+ + + S VC+T VTD + GPS Sbjct: 383 DVPGEKVASFPELKLHFKGGAEVTLPVENYFAVVGEGS-AVCLTVVTD----REASGGPS 437 Query: 1301 VILGNYQMQNYYVEYDLKNERLGFRQQQTCK 1393 +ILGN+QMQNYYVEYDL+N+RLGF+ QQ CK Sbjct: 438 IILGNFQMQNYYVEYDLRNQRLGFK-QQLCK 467 >ref|XP_004168199.1| PREDICTED: LOW QUALITY PROTEIN: aspartic proteinase nepenthesin-2-like [Cucumis sativus] Length = 457 Score = 449 bits (1154), Expect = e-123 Identities = 232/442 (52%), Positives = 282/442 (63%), Gaps = 5/442 (1%) Frame = +2 Query: 80 STTITLSHNHFDINPSPNPWQKLTHMASASLTRAKHLKNPKNTNLSTIPLYPRSYGGYSI 259 S ITL N F SP+P Q LT +AS+S TRA +K PK+ ++ PL P SYG YS Sbjct: 24 SNPITLPLNSFPHLSSPDPLQALTFLASSSQTRAHQIKTPKSNSVFKSPLSPHSYGAYST 83 Query: 260 SLSFGTPPQTIPFIMDTGSDLVWFPCTHRYLCKNCSFSNPN-TKIPTFIPXXXXXXXXXX 436 LSFGTP QT+ I DTGS LVWFPCT RYLC CSF + T IP F+P Sbjct: 84 PLSFGTPQQTLHLIFDTGSSLVWFPCTSRYLCSECSFPKIDPTGIPRFVPKLSSSSKLVG 143 Query: 437 XXXXXXTWVHDPDVQSRCTDCEPNSTNCSQICPPYXXXXXXXXXXXXXXXETLDLPKKKV 616 +W+ PDV+S+C C P + NC+Q CP Y ETLD P K + Sbjct: 144 CQNPKCSWIFGPDVKSQCRSCNPKTENCTQTCPAYVVQYGSGSTAGLLLSETLDFPDKXI 203 Query: 617 PNFVVGCSLFSSRSPSGIAGFGRGPASLPSQLRLNKFSYCLLPHRFDDSSESTSLVLDGE 796 PNFVVGCS S PSGIAGFGRG SLPSQ+ L KF+YCL +FDDS S L+LD Sbjct: 204 PNFVVGCSFLSIHQPSGIAGFGRGSESLPSQMGLKKFAYCLASRKFDDSPHSGQLILDST 263 Query: 797 SDPGDTKTNGVSYTPLLKNQDNSNPVFSVYYYVGLRKISIGGKHVKIPYSYLSLGSDGNG 976 K++G++YTP +N SN + YYY+ +RKI +G + VK+PY +L G DGNG Sbjct: 264 G----VKSSGLTYTPFRQNPSVSNNAYKEYYYLNIRKIIVGNQAVKVPYKFLVPGPDGNG 319 Query: 977 GTIVDSGTTFTFMEGRAFELVAREVENQTAHYRRATDVETRTGLRPCFYVSNEKSVSLPK 1156 G+I+DSG+TFTFM+ E+VARE E Q A++ RATDVET TGLRPCF +S EKSV P+ Sbjct: 320 GSIIDSGSTFTFMDKPVLEVVAREFEKQLANWTRATDVETLTGLRPCFDISKEKSVKFPE 379 Query: 1157 LVFHFKGGAEMELPLANYFSFIDDDSGVVCMTFVT----DGVGGSDIRSGPSVILGNYQM 1324 L+F FKGGA+ LPL NYF+ + SGV C+T VT DG GG GPSVILG +Q Sbjct: 380 LIFQFKGGAKWALPLNNYFALV-SSSGVACLTVVTHQMEDGGGGG---GGPSVILGAFQQ 435 Query: 1325 QNYYVEYDLKNERLGFRQQQTC 1390 QN+YVEYDL N+RLGFR QQTC Sbjct: 436 QNFYVEYDLVNQRLGFR-QQTC 456 >ref|XP_006481530.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Citrus sinensis] Length = 465 Score = 447 bits (1150), Expect = e-123 Identities = 234/449 (52%), Positives = 296/449 (65%), Gaps = 12/449 (2%) Frame = +2 Query: 83 TTITLSHNHFDINPSPNPWQKLTHMASASLTRAKHLKNPKN------TNLSTIPLYPRSY 244 T++T S + F NPS + +Q L + S+SLTRA H+KNP+ T +T + SY Sbjct: 27 TSLTFSLSRFHTNPSQDSYQNLNSLVSSSLTRALHIKNPQTKTTTTTTTTTTTNISSHSY 86 Query: 245 GGYSISLSFGTPPQTIPFIMDTGSDLVWFPCTHRYLCKNCSFSNPNTKIPTFIPXXXXXX 424 GGYSISLSFGTPPQ IPFI+DTGS LVWFPCT+ Y CK CS S KIP+FIP Sbjct: 87 GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSS----KIPSFIPKLSSSS 142 Query: 425 XXXXXXXXXXTWVHDPDVQSRCTDCEPNST--NCSQICPPYXXXXXXXXXXXXXXXETLD 598 +W+H +Q R + EP +T NC+QICP Y ETL+ Sbjct: 143 RLLGCQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGLTEGIALSETLN 202 Query: 599 LPKKKVPNFVVGCSLFSSRSPSGIAGFGRGPASLPSQLRLNKFSYCLLPHRFDDSSESTS 778 LP + +PNF+VGCS+ SSR P+GIAGFGRG SLPSQL L+KFSYCLL H+FDD++ ++S Sbjct: 203 LPNRIIPNFLVGCSVLSSRQPAGIAGFGRGKTSLPSQLNLDKFSYCLLSHKFDDTTRTSS 262 Query: 779 LVLDGESDPGDTKTNGVSYTPLLKNQD-NSNPVFSVYYYVGLRKISIGGKHVKIPYSYLS 955 L+LD S D KT G++YTP + N FSVYYYVGLR+I++GG+ V++ + YL+ Sbjct: 263 LILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYLT 322 Query: 956 LGSDGNGGTIVDSGTTFTFMEGRAFELVAREVENQ---TAHYRRATDVETRTGLRPCFYV 1126 L DGNGGTIVDSGTTFTFM FE +A E +Q +Y RA E TGLRPCF V Sbjct: 323 LDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPCFDV 382 Query: 1127 SNEKSVSLPKLVFHFKGGAEMELPLANYFSFIDDDSGVVCMTFVTDGVGGSDIRSGPSVI 1306 EK+ S P+L HFKGGAE+ LP+ NYF+ + + S VC+T VTD + GP++I Sbjct: 383 PGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGS-AVCLTVVTD----REASGGPAII 437 Query: 1307 LGNYQMQNYYVEYDLKNERLGFRQQQTCK 1393 LGN+QMQNYYVEYDL+N+RLGF+ QQ CK Sbjct: 438 LGNFQMQNYYVEYDLRNQRLGFK-QQLCK 465