BLASTX nr result
ID: Achyranthes23_contig00005037
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Achyranthes23_contig00005037 (1676 letters) Database: ./nr 37,332,560 sequences; 13,225,080,153 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_004303503.1| PREDICTED: aspartic proteinase nepenthesin-2... 380 e-103 gb|ESW34952.1| hypothetical protein PHAVU_001G194500g [Phaseolus... 377 e-102 ref|XP_003553619.1| PREDICTED: aspartic proteinase nepenthesin-2... 377 e-102 ref|XP_002534234.1| pepsin A, putative [Ricinus communis] gi|223... 375 e-101 ref|XP_003520712.1| PREDICTED: aspartic proteinase nepenthesin-2... 375 e-101 gb|EMJ03226.1| hypothetical protein PRUPE_ppa005104mg [Prunus pe... 374 e-101 ref|XP_004494242.1| PREDICTED: aspartic proteinase nepenthesin-1... 371 e-100 gb|EOY08435.1| Eukaryotic aspartyl protease family protein, puta... 371 e-100 ref|XP_002280866.1| PREDICTED: aspartic proteinase nepenthesin-2... 370 e-100 emb|CAN67405.1| hypothetical protein VITISV_025616 [Vitis vinifera] 369 3e-99 ref|XP_002877867.1| aspartyl protease family protein [Arabidopsi... 364 7e-98 ref|XP_004241344.1| PREDICTED: aspartic proteinase nepenthesin-2... 363 1e-97 ref|XP_002309394.1| aspartyl protease family protein [Populus tr... 361 6e-97 ref|NP_566966.1| aspartyl protease family protein [Arabidopsis t... 360 1e-96 gb|AAS48510.2| aspartic protease [Fagopyrum esculentum] gi|82780... 357 7e-96 ref|XP_006361102.1| PREDICTED: aspartic proteinase nepenthesin-1... 357 9e-96 ref|XP_006403798.1| hypothetical protein EUTSA_v10010339mg [Eutr... 356 1e-95 gb|ESW17797.1| hypothetical protein PHAVU_007G269300g [Phaseolus... 355 4e-95 gb|AAL14384.1| AT3g52500/F22O6_120 [Arabidopsis thaliana] 355 4e-95 ref|XP_006291053.1| hypothetical protein CARUB_v10017168mg [Caps... 352 3e-94 >ref|XP_004303503.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Fragaria vesca subsp. vesca] Length = 458 Score = 380 bits (977), Expect = e-103 Identities = 207/442 (46%), Positives = 272/442 (61%), Gaps = 4/442 (0%) Frame = -3 Query: 1461 ITLTLSPIPNQHPSQNSWEFNSNIVRT-TIARAYHLKNPKKNTFSTTF-LQARSYGGYSV 1288 +TL LSP+ +HPS + N++ + +++RA+HLK PK N+ +T L RSYGGYS+ Sbjct: 26 LTLPLSPLA-KHPSSSDPIQTLNLLSSASLSRAHHLKRPKHNSSATKVPLYPRSYGGYSI 84 Query: 1287 TLNFGTPPQKIPLVFDTGSSISWVPCTSNYECSNCTFSDIDPHKITTFKPELSSTTRVIG 1108 +L+FGTPPQ V DTGSS+ W PCTS Y CS C+F +IDP I F P+LSS+ R++G Sbjct: 85 SLSFGTPPQISTFVMDTGSSLVWFPCTSRYLCSRCSFPNIDPSTIPAFIPKLSSSARLLG 144 Query: 1107 CRNPKCKWIFSGSDVESLXXXXXXXXXSTDCPGYYLQYGLGSTTGYAISDTLDLPGTTVQ 928 C+NPKC WIF G +V + S CP Y +QYG G+T G +S++LD P TV Sbjct: 145 CKNPKCAWIF-GPEVNT-----KCPNSSQACPSYVIQYGSGTTAGVLLSESLDFPDKTVP 198 Query: 927 DFLLGCSISSNRQPEGIIGFGRDLTSLPTQLKLRKFSYCLLPHKFDDSPRNSKLILEGSG 748 DFL+GCS S RQP G+ GFGR SLP Q+ L KFSYCL+ H+FDD+P +S L+L SG Sbjct: 199 DFLVGCSFLSIRQPAGMAGFGRGPQSLPVQMGLSKFSYCLVSHRFDDTPVSSDLVLY-SG 257 Query: 747 KKTHLTHTGRGFEPQYTPF--XXXXXXXXXXXXXXXXXXXXXXXXXXXKLPDKHLVPNSN 574 + + YTPF K+P K+LVP + Sbjct: 258 STSDGDEIDDNHDISYTPFQKNPGAANTAYREYYYLALRKVIVGKKHVKIPYKYLVPGED 317 Query: 573 GDGGTVVDSGMTYTFMDRPLFDPFVQELATQITHYKRADDIESGSGFGLCYDVKNVRNVT 394 +GGT+VDSG T+TFM+RP+F+ + ATQ+ Y RA DIE+ +G C+D+ V Sbjct: 318 DNGGTIVDSGSTFTFMERPVFEAVAEAFATQMEKYTRAGDIENRTGLKPCFDISKEEKVD 377 Query: 393 FPKLVFHFKGGVKMLMPLANYFKYLKDFDALCLSIVTNTSPVAPRRGPVGPAMILGNYQQ 214 FP+LVF FKGG KM MPL NYF + +CL+IVT+ VA GPA+ILGN+QQ Sbjct: 378 FPELVFQFKGGAKMAMPLNNYFALVTSDGVVCLTIVTD--GVAGPGVAAGPAVILGNFQQ 435 Query: 213 QNFNVEYDLENQRLGIRKQKCK 148 QNF VEYDLE +R G +KQ CK Sbjct: 436 QNFYVEYDLERERFGFKKQSCK 457 >gb|ESW34952.1| hypothetical protein PHAVU_001G194500g [Phaseolus vulgaris] Length = 466 Score = 377 bits (969), Expect = e-102 Identities = 202/446 (45%), Positives = 268/446 (60%), Gaps = 8/446 (1%) Frame = -3 Query: 1461 ITLTLSPIPNQHPSQNSWEFNSNIVRTTIARAYHLKNP-KKNTFSTTFLQARSYGGYSVT 1285 ITL LSP+ + S + + ++ RA+HLK+ + +TT + +SYGGYS+ Sbjct: 29 ITLPLSPLLTKPQSSDPFHSLKLAASASLTRAHHLKHRLNAPSAATTQVYPKSYGGYSID 88 Query: 1284 LNFGTPPQKIPLVFDTGSSISWVPCTSNYECSNCTFSDIDPHKITTFKPELSSTTRVIGC 1105 LNFGTPPQ P V DTGSS+ W PCTS Y CS+C F +IDP KI TF P+ SST+R++GC Sbjct: 89 LNFGTPPQTSPFVLDTGSSLVWFPCTSRYLCSHCLFPNIDPTKIPTFIPKNSSTSRLLGC 148 Query: 1104 RNPKCKWIFSGSDVESLXXXXXXXXXSTD--CPGYYLQYGLGSTTGYAISDTLDLPGTTV 931 +NPKC ++F GSD++S + CP Y +QYGLGST G+ + D L+ P V Sbjct: 149 KNPKCGYLF-GSDLQSRCPQCKPDSQNCSLTCPPYIIQYGLGSTAGFLLLDNLNFPEKIV 207 Query: 930 QDFLLGCSISSNRQPEGIIGFGRDLTSLPTQLKLRKFSYCLLPHKFDDSPRNSKLILEGS 751 FL+GCSI S RQP GI GFGR SLP Q+ L++FSYCLL H FDDS NS L+L+ S Sbjct: 208 PQFLVGCSILSIRQPSGIAGFGRGQESLPAQMALKRFSYCLLSHNFDDSTENSDLVLQIS 267 Query: 750 GKKTHLTHTGRGFEPQYTPF--XXXXXXXXXXXXXXXXXXXXXXXXXXXKLPDKHLVPNS 577 T+ YTPF K+P L P S Sbjct: 268 STGDTKTN-----GLSYTPFHPNPSANNPAFLEYYYLSLRKVIVGGKNVKIPLSFLEPGS 322 Query: 576 NGDGGTVVDSGMTYTFMDRPLFDPFVQELATQITHYKRADDIESGSGFGLCYDVKNVRNV 397 +G+GGT+VDSG T+TFM+RP +D V+E Q+ +Y RA+D+E+ SG G C+++ + V Sbjct: 323 DGNGGTIVDSGSTFTFMERPAYDLVVKEFVKQLGNYSRAEDVEAQSGLGPCFNISGAKTV 382 Query: 396 TFPKLVFHFKGGVKMLMPLANYFKYLKDFDALCLSIVTNTSPVAPRRGPV---GPAMILG 226 FPK FKGG KM +P+ NYF + D + +CL+IV++ GP GPA+ILG Sbjct: 383 NFPKFTLQFKGGAKMTLPVENYFSLIDDSEVVCLTIVSDGG-----AGPATTSGPAIILG 437 Query: 225 NYQQQNFNVEYDLENQRLGIRKQKCK 148 NYQQQNF++EYDLEN+R G Q CK Sbjct: 438 NYQQQNFHIEYDLENERFGFGPQSCK 463 >ref|XP_003553619.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max] Length = 470 Score = 377 bits (969), Expect = e-102 Identities = 202/447 (45%), Positives = 274/447 (61%), Gaps = 9/447 (2%) Frame = -3 Query: 1461 ITLTLSPIPNQHPSQNSWEFNSN--IVRTTIARAYHLKNPKKNTFS--TTFLQARSYGGY 1294 ITL LSP+ + S +S F+S +++ RA+HLK+ N+ S TT +SYGGY Sbjct: 29 ITLPLSPLLTKPHSSDSDPFHSVKLAASSSLTRAHHLKHRNNNSPSVATTPAYPKSYGGY 88 Query: 1293 SVTLNFGTPPQKIPLVFDTGSSISWVPCTSNYECSNCTFSDIDPHKITTFKPELSSTTRV 1114 S+ LN GTPPQ P V DTGSS+ W PCTS+Y CS+C F +IDP KI TF P+ SST ++ Sbjct: 89 SIDLNLGTPPQTSPFVLDTGSSLVWFPCTSHYLCSHCNFPNIDPTKIPTFIPKNSSTAKL 148 Query: 1113 IGCRNPKCKWIFSGSDVESLXXXXXXXXXST---DCPGYYLQYGLGSTTGYAISDTLDLP 943 +GCRNPKC ++F G DVES CP Y +QYGLG+T G+ + D L+ P Sbjct: 149 LGCRNPKCGYLF-GPDVESRCPQCKKPGSQNCSLTCPSYIIQYGLGATAGFLLLDNLNFP 207 Query: 942 GTTVQDFLLGCSISSNRQPEGIIGFGRDLTSLPTQLKLRKFSYCLLPHKFDDSPRNSKLI 763 G TV FL+GCSI S RQP GI GFGR SLP+Q+ L++FSYCL+ H+FDD+P++S L+ Sbjct: 208 GKTVPQFLVGCSILSIRQPSGIAGFGRGQESLPSQMNLKRFSYCLVSHRFDDTPQSSDLV 267 Query: 762 LEGSGKKTHLTHTGRGFEPQYTPF-XXXXXXXXXXXXXXXXXXXXXXXXXXXKLPDKHLV 586 L+ S T+ YTPF K+P K L Sbjct: 268 LQISSTGDTKTN-----GLSYTPFRSNPSNNSVFREYYYVTLRKLIVGGVDVKIPYKFLE 322 Query: 585 PNSNGDGGTVVDSGMTYTFMDRPLFDPFVQELATQI-THYKRADDIESGSGFGLCYDVKN 409 P S+G+GGT+VDSG T+TFM+RP+++ QE Q+ Y R +++E+ SG C+++ Sbjct: 323 PGSDGNGGTIVDSGSTFTFMERPVYNLVAQEFLRQLGKKYSREENVEAQSGLSPCFNISG 382 Query: 408 VRNVTFPKLVFHFKGGVKMLMPLANYFKYLKDFDALCLSIVTNTSPVAPRRGPVGPAMIL 229 V+ ++FP+ F FKGG KM PL NYF ++ D + LC ++V++ P+ GPA+IL Sbjct: 383 VKTISFPEFTFQFKGGAKMSQPLLNYFSFVGDAEVLCFTVVSDGGAGQPK--TAGPAIIL 440 Query: 228 GNYQQQNFNVEYDLENQRLGIRKQKCK 148 GNYQQQNF VEYDLEN+R G + CK Sbjct: 441 GNYQQQNFYVEYDLENERFGFGPRNCK 467 >ref|XP_002534234.1| pepsin A, putative [Ricinus communis] gi|223525662|gb|EEF28148.1| pepsin A, putative [Ricinus communis] Length = 468 Score = 375 bits (964), Expect = e-101 Identities = 195/445 (43%), Positives = 270/445 (60%), Gaps = 8/445 (1%) Frame = -3 Query: 1461 ITLTLSPIPNQHPSQNSWEFNSNIVRTTIARAYHLKNPKKN-TFSTTFLQARSYGGYSVT 1285 IT+ LSP + PS + WE+ +++ T+I+RA+HLK+PK N + T L +RSYGGYS++ Sbjct: 28 ITIPLSPTITKRPSSDPWEYLNHLATTSISRAHHLKSPKTNFSLIKTPLFSRSYGGYSMS 87 Query: 1284 LNFGTPPQKIPLVFDTGSSISWVPCTSNYECSNCTFSDIDPHKITTFKPELSSTTRVIGC 1105 L+ GTP Q + L+ DTGSS+ W PCTS Y C++C F + D KI F P LSS++++IGC Sbjct: 88 LSLGTPSQTVKLIMDTGSSLVWFPCTSRYVCASCNFPNTDITKIPKFMPRLSSSSKLIGC 147 Query: 1104 RNPKCKWIFSGSDVESLXXXXXXXXXSTD-CPGYYLQYGLGSTTGYAISDTLDLPGTTVQ 928 +NPKC W+F S T CP Y +QYGLGST G +S+T++ P T+ Sbjct: 148 KNPKCAWVFGSSVQSKCHNCNPQAQNCTQACPPYIIQYGLGSTAGLLLSETINFPNKTIS 207 Query: 927 DFLLGCSISSNRQPEGIIGFGRDLTSLPTQLKLRKFSYCLLPHKFDDSPRNSKLILEGSG 748 DFL GCS+ S RQPEGI GFGR SLP QL L+KFSYCL+ +FDDSP +S LIL+ Sbjct: 208 DFLAGCSLLSTRQPEGIAGFGRSQESLPLQLGLKKFSYCLVSRRFDDSPVSSDLILDMGP 267 Query: 747 KKTHLTHTGRGFEPQYTPF---XXXXXXXXXXXXXXXXXXXXXXXXXXXKLPDKHLVPNS 577 + TG YTPF K+P LVP S Sbjct: 268 STSDSKTTGL----SYTPFQKNLASQSNPAFQEYYYVMLRKIIVGKTHVKVPYSFLVPGS 323 Query: 576 NGDGGTVVDSGMTYTFMDRPLFDPFVQELATQITHYKRADDIESGSGFGLCYDVKNVRNV 397 +G+GGT+VDSG T+TF++ +F+ +E Q+ +Y A +++ +G C+D+ ++V Sbjct: 324 DGNGGTIVDSGSTFTFVEGHVFELLAKEFEKQMANYTVATNVQKLTGLRPCFDISGEKSV 383 Query: 396 TFPKLVFHFKGGVKMLMPLANYFKYLKDFDALCLSIVTNTSPVAPRRGPV---GPAMILG 226 P L F FKGG KM +PL+NYF ++ D +CL+IV++ + G V GPA+ILG Sbjct: 384 VIPDLTFQFKGGAKMQLPLSNYFAFV-DMGVVCLTIVSDNAAALGGDGGVRSSGPAIILG 442 Query: 225 NYQQQNFNVEYDLENQRLGIRKQKC 151 N+QQQNF +EYDLEN R G ++Q C Sbjct: 443 NFQQQNFYIEYDLENDRFGFKEQSC 467 >ref|XP_003520712.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max] Length = 474 Score = 375 bits (962), Expect = e-101 Identities = 204/447 (45%), Positives = 273/447 (61%), Gaps = 9/447 (2%) Frame = -3 Query: 1461 ITLTLSPIPNQHPSQNSWEFNSN--IVRTTIARAYHLKNPKKNTFS--TTFLQARSYGGY 1294 ITL LSP+ + S +S F+S ++ RA+HLK+ N+ S TT +SYGGY Sbjct: 33 ITLPLSPLLIKPHSSDSDPFHSLKFAASASLTRAHHLKHRNNNSPSVATTPAYPKSYGGY 92 Query: 1293 SVTLNFGTPPQKIPLVFDTGSSISWVPCTSNYECSNCTFSDIDPHKITTFKPELSSTTRV 1114 S+ LN GTPPQ P V DTGSS+ W PCTS Y CS+C F +ID KI TF P+ SST ++ Sbjct: 93 SIDLNLGTPPQTSPFVLDTGSSLVWFPCTSRYLCSHCNFPNIDTTKIPTFIPKNSSTAKL 152 Query: 1113 IGCRNPKCKWIFSGSDVESLXXXXXXXXXSTD--CPGYYLQYGLGSTTGYAISDTLDLPG 940 +GCRNPKC +IF GSDV+ + CP Y +QYGLGST G+ + D L+ PG Sbjct: 153 LGCRNPKCGYIF-GSDVQFRCPQCKPESQNCSLTCPAYIIQYGLGSTAGFLLLDNLNFPG 211 Query: 939 TTVQDFLLGCSISSNRQPEGIIGFGRDLTSLPTQLKLRKFSYCLLPHKFDDSPRNSKLIL 760 TV FL+GCSI S RQP GI GFGR SLP+Q+ L++FSYCL+ H+FDD+P++S L+L Sbjct: 212 KTVPQFLVGCSILSIRQPSGIAGFGRGQESLPSQMNLKRFSYCLVSHRFDDTPQSSDLVL 271 Query: 759 EGSGKKTHLTHTGRGFEPQYTPF--XXXXXXXXXXXXXXXXXXXXXXXXXXXKLPDKHLV 586 + S T+ YTPF K+P L Sbjct: 272 QISSTGDTKTN-----GLSYTPFRSNPSTNNPAFKEYYYLTLRKVIVGGKDVKIPYTFLE 326 Query: 585 PNSNGDGGTVVDSGMTYTFMDRPLFDPFVQELATQI-THYKRADDIESGSGFGLCYDVKN 409 P S+G+GGT+VDSG T+TFM+RP+++ QE Q+ +Y RA+D E+ SG C+++ Sbjct: 327 PGSDGNGGTIVDSGSTFTFMERPVYNLVAQEFVKQLEKNYSRAEDAETQSGLSPCFNISG 386 Query: 408 VRNVTFPKLVFHFKGGVKMLMPLANYFKYLKDFDALCLSIVTNTSPVAPRRGPVGPAMIL 229 V+ VTFP+L F FKGG KM PL NYF + D + +CL++V++ P+ GPA+IL Sbjct: 387 VKTVTFPELTFKFKGGAKMTQPLQNYFSLVGDAEVVCLTVVSDGGAGPPK--TTGPAIIL 444 Query: 228 GNYQQQNFNVEYDLENQRLGIRKQKCK 148 GNYQQQNF +EYDLEN+R G + C+ Sbjct: 445 GNYQQQNFYIEYDLENERFGFGPRSCR 471 >gb|EMJ03226.1| hypothetical protein PRUPE_ppa005104mg [Prunus persica] Length = 477 Score = 374 bits (960), Expect = e-101 Identities = 206/457 (45%), Positives = 270/457 (59%), Gaps = 19/457 (4%) Frame = -3 Query: 1461 ITLTLSPIPNQHPSQNSWEFNSNIVRTTIARAYHLKNPKKNTFSTTF--LQARSYGGYSV 1288 ITL LSP PN HPS + + S +I+RA+H+KN +K S T L SYG YSV Sbjct: 25 ITLPLSPFPN-HPSSDPLQALSFHASASISRAHHIKNSRKPNSSLTQVPLFPHSYGDYSV 83 Query: 1287 TLNFGTPPQKIPLVFDTGSSISWVPCTSNYECSNCTFSDIDPHKITTFKPELSSTTRVIG 1108 +LNFGTPPQ + DTGSS+ W PCT Y CS C F +I+P KI TFKP+LSS+++++G Sbjct: 84 SLNFGTPPQTSSFIMDTGSSLVWFPCTKRYICSRCQFPNINPAKIPTFKPKLSSSSKIVG 143 Query: 1107 CRNPKCKWIFSGSDVESLXXXXXXXXXST---DCPGYYLQYGLGSTTGYAISDTLDLPGT 937 C+NPKC WIF G +V+S CP Y +QYG G+T G +S+TLD P Sbjct: 144 CQNPKCGWIF-GPEVKSKCPNCNNPSHQNCSQACPTYIIQYGSGTTAGILLSETLDFPKK 202 Query: 936 TVQDFLLGCSISSNRQPEGIIGFGRDLTSLPTQLKLRKFSYCLLPHKFDDSPRNSKLIL- 760 V DFL+GCS S RQP GI GFGR SLP Q+ L KFSYCL+ H+FDD+P++S L+L Sbjct: 203 IVPDFLVGCSFVSIRQPAGIAGFGRGPQSLPAQMGLTKFSYCLVSHRFDDTPQSSDLVLY 262 Query: 759 -EGSGKKTHLTHTGRGFEPQ----------YTPF--XXXXXXXXXXXXXXXXXXXXXXXX 619 SG + E Q TPF Sbjct: 263 SSSSGSSSSSEEEPTIAESQRNKTKLQSLSSTPFQKNPGPPNSAFREYYYIMLRKVIVGN 322 Query: 618 XXXKLPDKHLVPNSNGDGGTVVDSGMTYTFMDRPLFDPFVQELATQITHYKRADDIESGS 439 K+P K LVP ++ GGT+VDSG T+TFM++P+F+P +E Q+ +Y RA D+E+ + Sbjct: 323 KNVKIPYKFLVPGADSSGGTIVDSGSTFTFMEKPVFEPVAKEFEAQMANYTRAKDLENKT 382 Query: 438 GFGLCYDVKNVRNVTFPKLVFHFKGGVKMLMPLANYFKYLKDFDALCLSIVTNTSPVAPR 259 G C+D+ + V FP+LVF FKGG KM +P NYF + +CL+IVT+ V Sbjct: 383 GLRPCFDISKEKKVDFPELVFQFKGGAKMELPSKNYFSMVSSSGVVCLTIVTD--GVVGP 440 Query: 258 RGPVGPAMILGNYQQQNFNVEYDLENQRLGIRKQKCK 148 G GPA+ILGNYQQQ+F+VEYDL++ + G RKQ CK Sbjct: 441 GGNGGPAIILGNYQQQDFHVEYDLQHGKFGFRKQSCK 477 >ref|XP_004494242.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Cicer arietinum] Length = 474 Score = 371 bits (953), Expect = e-100 Identities = 204/457 (44%), Positives = 275/457 (60%), Gaps = 16/457 (3%) Frame = -3 Query: 1461 ITLTLSPIPNQHPSQNSWEFNSNIVRTTIARAYHLKN---PKKNTFST-----TFLQARS 1306 ITL+LSPI + PS + + +++ RA+HLK KKN+ S+ T + A+S Sbjct: 28 ITLSLSPIFTKSPSSDLFHSLKKATSSSLKRAHHLKTRKLSKKNSPSSSSTINTQVFAKS 87 Query: 1305 YGGYSVTLNFGTPPQKIPLVFDTGSSISWVPCTSNYECSNCTFSDIDPHKITTFKPELSS 1126 YGGYS+ LNFGTPPQ + V DTGSS+ W PCTS+Y CSNC F++I+P I TF P SS Sbjct: 88 YGGYSINLNFGTPPQTLSFVLDTGSSLVWFPCTSHYLCSNCNFANINPTNIPTFIPSKSS 147 Query: 1125 TTRVIGCRNPKCKWIFSGSDVESLXXXXXXXXXSTD---CPGYYLQYGLGSTTGYAISDT 955 +TR+IGC N KC ++F GS++ES + + CP Y L+YGLGST G +S+ Sbjct: 148 STRIIGCTNKKCGYVF-GSNIESRCQGCNPQFQNCNNITCPTYILEYGLGSTAGLLLSEN 206 Query: 954 LDLPGTTVQDFLLGCSISSNRQPEGIIGFGRDLTSLPTQLKLRKFSYCLLPHKFDDSPRN 775 LD PG V DFL+GCSI S QP GI GFGR SLP Q+ L KFSYCLL H FDD+P N Sbjct: 207 LDFPGYIVPDFLVGCSIFSTEQPSGIAGFGRGAESLPAQMGLTKFSYCLLSHNFDDTPVN 266 Query: 774 SKLILE----GSGKKTHLTHTGRGFEPQYTPFXXXXXXXXXXXXXXXXXXXXXXXXXXXK 607 S L+L+ G GK + L +T P + K Sbjct: 267 SNLVLQTTSTGDGKTSGLNYTTFVQNPSMS-------NPAFLEYYYVNLRSFLIGGTRVK 319 Query: 606 LPDKHLVPNSNGDGGTVVDSGMTYTFMDRPLFDPFVQELATQITHYKRADDIESGSGFGL 427 +P P +G+GGT+VDSG T+TFM+RP+FD ++ Q+ ++ RA DIE+ SGF L Sbjct: 320 IPFYLSSPGMDGNGGTIVDSGTTFTFMERPIFDLVARQFELQLANFPRATDIEAASGFNL 379 Query: 426 CYDVKNVRNVTFPKLVFHFKGGVKMLMPLANYFKYLKD-FDALCLSIVTNTSPVAPRRGP 250 C+D ++ FP+LVF FKGG +M++P+ +YF + D + CL+I+T+ + V Sbjct: 380 CFDFTGNNSIPFPELVFQFKGGAEMVLPVDDYFSLVGDGGNVACLTIMTDGNSVPATN-- 437 Query: 249 VGPAMILGNYQQQNFNVEYDLENQRLGIRKQKCKDQS 139 GPAMILGNYQQQNF +E+DLEN+R G C+ + Sbjct: 438 TGPAMILGNYQQQNFIIEFDLENERFGFGAHICQSNA 474 >gb|EOY08435.1| Eukaryotic aspartyl protease family protein, putative isoform 1 [Theobroma cacao] gi|508716539|gb|EOY08436.1| Eukaryotic aspartyl protease family protein, putative isoform 1 [Theobroma cacao] gi|508716540|gb|EOY08437.1| Eukaryotic aspartyl protease family protein, putative isoform 1 [Theobroma cacao] Length = 472 Score = 371 bits (952), Expect = e-100 Identities = 200/453 (44%), Positives = 283/453 (62%), Gaps = 14/453 (3%) Frame = -3 Query: 1467 THITLTLSPIPNQHPSQNSWEFNSNIVRTTIARAYHLKNP------KKNTFST---TFLQ 1315 T I ++LSP P+ PS ++++ +N+ ++++RA+HLK P K NT S+ T L Sbjct: 29 TTIKISLSPFPHP-PSFDAYQILNNLATSSVSRAHHLKQPTHKIKAKANTTSSLLKTPLF 87 Query: 1314 ARSYGGYSVTLNFGTPPQKIPLVFDTGSSISWVPCTSNYECSNCTFSDIDPHKITTFKPE 1135 SYGGY+++L GTPPQ + + DTGSS+SW PCTS Y CS C F ++DP KI TF P+ Sbjct: 88 PHSYGGYTISLGIGTPPQTLTFIMDTGSSLSWFPCTSRYICSQCAFPNVDPKKIPTFSPK 147 Query: 1134 LSSTTRVIGCRNPKCKWIFSGSDVES--LXXXXXXXXXSTDCPGYYLQYGLGSTTGYAIS 961 LSS+ ++GC+NPKC+W+F G DVES + +CP Y +QYGLGST G + Sbjct: 148 LSSSKALVGCKNPKCRWLF-GPDVESRCQDCEPASKNCTQNCPPYIIQYGLGSTGGLLLV 206 Query: 960 DTLDLPGTTVQDFLLGCSISSNRQPEGIIGFGRDLTSLPTQLKLRKFSYCLLPHKFDDSP 781 + L T QDFL+GCSI SNRQP GI+GFGR SLP+QL ++KFSYCL+ +FDD+ Sbjct: 207 ENLVFSQKTFQDFLVGCSIFSNRQPAGIVGFGRRPESLPSQLGVKKFSYCLVSRRFDDTG 266 Query: 780 RNSKLILE-GSGKKTHLTHTGRGFEPQYTPF--XXXXXXXXXXXXXXXXXXXXXXXXXXX 610 +S ++LE GSG T +G YTPF Sbjct: 267 VSSNMLLETGSGSGDAKT---KGL--SYTPFYKNQFASHPIFQEFYYVTIRKILVGDKHV 321 Query: 609 KLPDKHLVPNSNGDGGTVVDSGMTYTFMDRPLFDPFVQELATQITHYKRADDIESGSGFG 430 K+P K+LVP +G+GGT+VDSG T+TFM+R +F+ +E Q+ +Y RA ++E+ SG Sbjct: 322 KVPYKYLVPGPDGNGGTIVDSGSTFTFMERAVFELVSKEFEKQMGNYSRAHEVENKSGLA 381 Query: 429 LCYDVKNVRNVTFPKLVFHFKGGVKMLMPLANYFKYLKDFDALCLSIVTNTSPVAPRRGP 250 C ++ ++++FP+L+F FKGG KM +PLANYF +L D + +CL +VT+ + + Sbjct: 382 PCVNISGHKSISFPELIFQFKGGAKMALPLANYFSFL-DVNVVCLMVVTDN--IIGQGVS 438 Query: 249 VGPAMILGNYQQQNFNVEYDLENQRLGIRKQKC 151 GPA+ILGN+QQQN+ +EYDL N+ G KQ C Sbjct: 439 GGPAIILGNFQQQNYYIEYDLANESFGFAKQSC 471 >ref|XP_002280866.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Vitis vinifera] Length = 469 Score = 370 bits (951), Expect = e-100 Identities = 195/441 (44%), Positives = 268/441 (60%), Gaps = 3/441 (0%) Frame = -3 Query: 1461 ITLTLSPIPNQHPSQNSWEFNSNIVRTTIARAYHLKNPKKNTFSTTFLQARSYGGYSVTL 1282 ITL L+P+ ++PS + W+ S++ ++ RA+HLK+ K + T L A SYGGYSV+L Sbjct: 35 ITLPLTPLFTKNPSSDPWQLLSHLTSASLTRAHHLKHRKNTSSVNTPLFAHSYGGYSVSL 94 Query: 1281 NFGTPPQKIPLVFDTGSSISWVPCTSNYECSNCTFSDIDPHKITTFKPELSSTTRVIGCR 1102 +FGTP Q + V DTGSS+ W PCTS Y C+ C+F +IDP KI TF P+LSS+ +++GC Sbjct: 95 SFGTPSQTLSFVMDTGSSLVWFPCTSRYVCTRCSFPNIDPAKIPTFIPKLSSSAKIVGCL 154 Query: 1101 NPKCKWIFSGSDVESLXXXXXXXXXSTD-CPGYYLQYGLGSTTGYAISDTLDLPGTTVQD 925 NPKC ++ T CP Y +QYGLG+T G + ++L T D Sbjct: 155 NPKCGFVMDSEVRTRCPGCDQNSANCTKACPTYAIQYGLGTTVGLLLLESLVFAERTEPD 214 Query: 924 FLLGCSISSNRQPEGIIGFGRDLTSLPTQLKLRKFSYCLLPHKFDDSPRNSKLILEGSGK 745 F++GCSI S+RQP GI GFGR +SLP Q+ L+KFSYCLL H+FDDSP++SK+ L G Sbjct: 215 FVVGCSILSSRQPSGIAGFGRGPSSLPKQMGLKKFSYCLLSHRFDDSPKSSKMTLY-VGP 273 Query: 744 KTHLTHTGRGFEPQYTPF--XXXXXXXXXXXXXXXXXXXXXXXXXXXKLPDKHLVPNSNG 571 + TG YTPF K+P +V S+G Sbjct: 274 DSKDDKTG---GLSYTPFRKNPVSSNSAFKEYYYVTLRHIIVGDKRVKVPYSFMVAGSDG 330 Query: 570 DGGTVVDSGMTYTFMDRPLFDPFVQELATQITHYKRADDIESGSGFGLCYDVKNVRNVTF 391 +GGT+VDSG T+TFM++P+F+ E Q+ +Y RA D+E+ SG C+++ V +V Sbjct: 331 NGGTIVDSGSTFTFMEKPVFEAVATEFDRQMANYTRAADVEALSGLKPCFNLSGVGSVAL 390 Query: 390 PKLVFHFKGGVKMLMPLANYFKYLKDFDALCLSIVTNTSPVAPRRGPVGPAMILGNYQQQ 211 P LVF FKGG KM +P+ANYF + D LCL+IV+N + + GP++ILGNYQ Q Sbjct: 391 PSLVFQFKGGAKMELPVANYFSLVGDLSVLCLTIVSNEAVGSTLSS--GPSIILGNYQSQ 448 Query: 210 NFNVEYDLENQRLGIRKQKCK 148 NF EYDLEN+R G R+Q+CK Sbjct: 449 NFYTEYDLENERFGFRRQRCK 469 >emb|CAN67405.1| hypothetical protein VITISV_025616 [Vitis vinifera] Length = 609 Score = 369 bits (946), Expect = 3e-99 Identities = 193/440 (43%), Positives = 267/440 (60%), Gaps = 3/440 (0%) Frame = -3 Query: 1461 ITLTLSPIPNQHPSQNSWEFNSNIVRTTIARAYHLKNPKKNTFSTTFLQARSYGGYSVTL 1282 ITL L+P+ ++PS + W+ S++ ++ RA+HLK+ K + T L A SYGGYSV+L Sbjct: 35 ITLPLTPLFTKNPSSDPWQLLSHLTSASLTRAHHLKHRKNTSSVNTPLFAHSYGGYSVSL 94 Query: 1281 NFGTPPQKIPLVFDTGSSISWVPCTSNYECSNCTFSDIDPHKITTFKPELSSTTRVIGCR 1102 +FGTP Q + V DTGSS+ W PCTS Y C+ C+F +IDP KI TF P+LSS+ +++GC Sbjct: 95 SFGTPSQTLSFVMDTGSSLVWFPCTSRYVCTRCSFPNIDPAKIPTFIPKLSSSAKIVGCL 154 Query: 1101 NPKCKWIFSGSDVESLXXXXXXXXXSTD-CPGYYLQYGLGSTTGYAISDTLDLPGTTVQD 925 NPKC ++ T CP Y +QYGLG+T G + ++L T D Sbjct: 155 NPKCGFVMDSEVRTRCPGCDQNSANCTKACPTYAIQYGLGTTVGLLLLESLVFAERTEPD 214 Query: 924 FLLGCSISSNRQPEGIIGFGRDLTSLPTQLKLRKFSYCLLPHKFDDSPRNSKLILEGSGK 745 F++GCSI S+RQP GI GFGR +SLP Q+ L+KFSYCLL H+FDDSP++SK+ L G Sbjct: 215 FVVGCSILSSRQPSGIAGFGRGPSSLPKQMGLKKFSYCLLSHRFDDSPKSSKMTLY-VGP 273 Query: 744 KTHLTHTGRGFEPQYTPFXXXXXXXXXXXXXXXXXXXXXXXXXXXKL--PDKHLVPNSNG 571 + TG YTPF ++ P +V S+G Sbjct: 274 DSKDDKTG---GLSYTPFRKNPVSSNSAFKEYYYVTLRHIIVGDKRVKXPYSFMVAGSDG 330 Query: 570 DGGTVVDSGMTYTFMDRPLFDPFVQELATQITHYKRADDIESGSGFGLCYDVKNVRNVTF 391 +GGT+VDSG T+TFM++P+F+ E Q+ +Y RA D+E+ SG C+++ V +V Sbjct: 331 NGGTIVDSGSTFTFMEKPVFEAVATEFDRQMANYTRAADVEALSGLKPCFNLSGVGSVAL 390 Query: 390 PKLVFHFKGGVKMLMPLANYFKYLKDFDALCLSIVTNTSPVAPRRGPVGPAMILGNYQQQ 211 P LVF FKGG KM +P+ANYF + D LCL+IV+N + + GP++ILGNYQ Q Sbjct: 391 PSLVFQFKGGAKMELPVANYFSLVGDLSVLCLTIVSNEAVGSTLSS--GPSIILGNYQSQ 448 Query: 210 NFNVEYDLENQRLGIRKQKC 151 NF EYDLEN+R G R+Q+C Sbjct: 449 NFYTEYDLENERFGFRRQRC 468 >ref|XP_002877867.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata] gi|297323705|gb|EFH54126.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata] Length = 469 Score = 364 bits (934), Expect = 7e-98 Identities = 200/456 (43%), Positives = 280/456 (61%), Gaps = 19/456 (4%) Frame = -3 Query: 1461 ITLTLSPIPNQHPS-QNSWEFNSNIVRTTIARAYHLKN-----PKKNTFSTT-------- 1324 + L LSP + S ++ + + ++IARA+ LK+ P + S+T Sbjct: 19 VKLPLSPFSHSDQSPKDPYLSLRRLAESSIARAHKLKHGTSIKPDEEALSSTATASATVV 78 Query: 1323 --FLQARSYGGYSVTLNFGTPPQKIPLVFDTGSSISWVPCTSNYECSNCTFSDIDPHKIT 1150 L +SYGGYSV+L+FGTP Q IP VFDTGSS+ W PCTS Y CS+C FS +DP +I Sbjct: 79 KSHLSPKSYGGYSVSLSFGTPSQTIPFVFDTGSSLVWFPCTSRYLCSDCNFSGLDPTQIP 138 Query: 1149 TFKPELSSTTRVIGCRNPKCKWIFSGSDVESLXXXXXXXXXSTDCPGYYLQYGLGSTTGY 970 F P+ SS++RVIGC+NPKC+++F G++V+ + CP Y LQYGLGST G Sbjct: 139 RFIPKNSSSSRVIGCQNPKCQFLF-GANVQCRGCDPNTRNCTVPCPPYILQYGLGSTAGI 197 Query: 969 AISDTLDLPGTTVQDFLLGCSISSNRQPEGIIGFGRDLTSLPTQLKLRKFSYCLLPHKFD 790 IS+ LD P TV DF++GCS+ S R P GI GFGR SLP+Q+KL+ FS+CL+ +FD Sbjct: 198 LISEKLDFPDLTVPDFVVGCSVISTRTPAGIAGFGRGPESLPSQMKLKSFSHCLVSRRFD 257 Query: 789 DSPRNSKLILE-GSGKKTHLTHTGRGFEPQYTPF--XXXXXXXXXXXXXXXXXXXXXXXX 619 D+ + L L+ GSG K+ G YTPF Sbjct: 258 DTNVTTDLGLDTGSGHKSGSKTPGL----SYTPFRKNPNVSNTAFLEYYYLNLRRIYVGS 313 Query: 618 XXXKLPDKHLVPNSNGDGGTVVDSGMTYTFMDRPLFDPFVQELATQITHYKRADDIESGS 439 K+P K L P +NG+GG++VDSG T+TFM+RP+F+ +E ATQ+++Y R D+E S Sbjct: 314 KHVKIPYKFLAPGTNGNGGSIVDSGSTFTFMERPVFELVAEEFATQMSNYTREKDLEKVS 373 Query: 438 GFGLCYDVKNVRNVTFPKLVFHFKGGVKMLMPLANYFKYLKDFDALCLSIVTNTSPVAPR 259 G C+++ +VT P+L+F FKGG KM +PL+NYF ++ + D +CL++V++ + V P Sbjct: 374 GIAPCFNISGKGDVTVPELIFEFKGGAKMELPLSNYFSFVGNADTVCLTVVSDNT-VNP- 431 Query: 258 RGPVGPAMILGNYQQQNFNVEYDLENQRLGIRKQKC 151 G GPA+ILG++QQQN+ VEYDLEN R G K+KC Sbjct: 432 GGGTGPAIILGSFQQQNYLVEYDLENDRFGFAKKKC 467 >ref|XP_004241344.1| PREDICTED: aspartic proteinase nepenthesin-2-like isoform 1 [Solanum lycopersicum] Length = 461 Score = 363 bits (932), Expect = 1e-97 Identities = 187/442 (42%), Positives = 267/442 (60%), Gaps = 2/442 (0%) Frame = -3 Query: 1467 THITLTLSPIPNQHPSQNSWEFNSNIVRTTIARAYHLKNPKKNTFSTTFLQARSYGGYSV 1288 T T+ LS +HPSQ+ +E +++ ++ARA ++K + + STT L +SYGGYS+ Sbjct: 27 TTSTIPLSLFNTKHPSQDLYEKLTHLASISLARANYIKKSQDSPVSTTPLYPQSYGGYSI 86 Query: 1287 TLNFGTPPQKIPLVFDTGSSISWVPCTSNYECSNCTFSDIDPHKITTFKPELSSTTRVIG 1108 TL+FGTPPQKIP + DTGSS W PCT+ Y C+NC+ S I TF P+ SS+ RV+G Sbjct: 87 TLSFGTPPQKIPFIMDTGSSFVWFPCTTRYLCTNCSVSSATSQSIPTFIPKSSSSARVVG 146 Query: 1107 CRNPKCKWIFSGSDVESLXXXXXXXXXSTDCPGYYLQYGLGSTTGYAISDTLDLPGTTVQ 928 C NPKC WI S + CP Y + YG GST G A+ DTLDL V Sbjct: 147 CLNPKCGWIHSNNPKSRCQDCESPTNCKQVCPPYIILYGSGSTGGLALVDTLDLSNKKVP 206 Query: 927 DFLLGCSISSNRQPEGIIGFGRDLTSLPTQLKLRKFSYCLLPHKFDDSPRNSKLILE--G 754 +FL+GCS+ S++QP GI G GR L SLP QL ++KFSYCL+ HKFDD+ ++S L+L+ Sbjct: 207 NFLVGCSLFSSKQPAGIAGLGRGLASLPNQLGVKKFSYCLVSHKFDDTGKSSNLVLDFNA 266 Query: 753 SGKKTHLTHTGRGFEPQYTPFXXXXXXXXXXXXXXXXXXXXXXXXXXXKLPDKHLVPNSN 574 SG+KT G + P K+P K+L P+SN Sbjct: 267 SGEKT----AGLSYTP-LLKNPVVSEKNALSVYYYVSLRKITVGGKKVKIPYKYLTPDSN 321 Query: 573 GDGGTVVDSGMTYTFMDRPLFDPFVQELATQITHYKRADDIESGSGFGLCYDVKNVRNVT 394 G+GG++VDSG T+TFM+R +F+P + Q+ R++ IE +G C+++ V+ Sbjct: 322 GNGGSIVDSGTTFTFMNRGVFEPVLDAFVKQVKGIPRSESIEIITGLKPCFNISRQETVS 381 Query: 393 FPKLVFHFKGGVKMLMPLANYFKYLKDFDALCLSIVTNTSPVAPRRGPVGPAMILGNYQQ 214 P+L FHFKGG +M +P+ANYF + + D +CL++VT+ S P GP++ILGN+Q Sbjct: 382 LPELKFHFKGGAEMTLPIANYFSFAGEIDVICLTMVTD-SAFGPELS-TGPSIILGNFQM 439 Query: 213 QNFNVEYDLENQRLGIRKQKCK 148 QN+ VE+DL+N++ G ++Q CK Sbjct: 440 QNYLVEFDLKNEKFGFKQQMCK 461 >ref|XP_002309394.1| aspartyl protease family protein [Populus trichocarpa] gi|222855370|gb|EEE92917.1| aspartyl protease family protein [Populus trichocarpa] Length = 469 Score = 361 bits (926), Expect = 6e-97 Identities = 192/429 (44%), Positives = 270/429 (62%), Gaps = 5/429 (1%) Frame = -3 Query: 1422 SQNSWEFNSNIVRTTIARAYHLKNPK-KNTFSTTFLQARSYGGYSVTLNFGTPPQKIPLV 1246 S+N W +++ +++RA+H+K+PK K + T L RSYGGYS++LNFGTPPQ V Sbjct: 49 SKNPWGALNHLASLSLSRAHHIKSPKTKFSLLKTPLFPRSYGGYSISLNFGTPPQTTKFV 108 Query: 1245 FDTGSSISWVPCTSNYECSNCTFSDIDPHKITTFKPELSSTTRVIGCRNPKCKWIFSGSD 1066 DTGSS+ W PCTS Y CS C F +I+ I TF P+ SS++ +IGC+N KC W+F G Sbjct: 109 MDTGSSLVWFPCTSRYLCSRCDFPNIEVTGIPTFIPKQSSSSNLIGCKNHKCSWLF-GPK 167 Query: 1065 VES--LXXXXXXXXXSTDCPGYYLQYGLGSTTGYAISDTLDLP-GTTVQDFLLGCSISSN 895 V+S + CP Y +QYGLGST G +S+TLD P T+ FL+GCS+ S Sbjct: 168 VQSKCQECDPTTQNCTQSCPPYVIQYGLGSTAGLLLSETLDFPHKKTIPGFLVGCSLFSI 227 Query: 894 RQPEGIIGFGRDLTSLPTQLKLRKFSYCLLPHKFDDSPRNSKLILE-GSGKKTHLTHTGR 718 RQPEGI GFGR SLP+QL L+KFSYCL+ H FDD+P +S L+L+ GSG T Sbjct: 228 RQPEGIAGFGRSPESLPSQLGLKKFSYCLVSHAFDDTPASSDLVLDTGSGSDDTKTP--- 284 Query: 717 GFEPQYTPFXXXXXXXXXXXXXXXXXXXXXXXXXXXKLPDKHLVPNSNGDGGTVVDSGMT 538 YTPF K+P K LVP S+G+GGT+VDSG T Sbjct: 285 --GLSYTPF-QKNPTAAFRDYYYVLLRNIVIGDTHVKVPYKFLVPGSDGNGGTIVDSGTT 341 Query: 537 YTFMDRPLFDPFVQELATQITHYKRADDIESGSGFGLCYDVKNVRNVTFPKLVFHFKGGV 358 +TFM++P+++ +E Q+ HY A ++++ +G C+++ ++V+ P+ +FHFKGG Sbjct: 342 FTFMEKPVYELVAKEFEKQVAHYTVATEVQNQTGLRPCFNISGEKSVSVPEFIFHFKGGA 401 Query: 357 KMLMPLANYFKYLKDFDALCLSIVTNTSPVAPRRGPVGPAMILGNYQQQNFNVEYDLENQ 178 KM +PLANYF ++ D +CL+IV++ + G GPA+ILGNYQQ+NF+VE+DL+N+ Sbjct: 402 KMALPLANYFSFV-DSGVICLTIVSDNMSGSGIGG--GPAIILGNYQQRNFHVEFDLKNE 458 Query: 177 RLGIRKQKC 151 R G ++Q C Sbjct: 459 RFGFKQQNC 467 >ref|NP_566966.1| aspartyl protease family protein [Arabidopsis thaliana] gi|13430562|gb|AAK25903.1|AF360193_1 unknown protein [Arabidopsis thaliana] gi|4886277|emb|CAB43423.1| putative protein [Arabidopsis thaliana] gi|14532764|gb|AAK64083.1| unknown protein [Arabidopsis thaliana] gi|15450892|gb|AAK96717.1| Unknown protein [Arabidopsis thaliana] gi|30387567|gb|AAP31949.1| At3g52500 [Arabidopsis thaliana] gi|332645431|gb|AEE78952.1| aspartyl protease family protein [Arabidopsis thaliana] Length = 469 Score = 360 bits (923), Expect = 1e-96 Identities = 195/456 (42%), Positives = 282/456 (61%), Gaps = 19/456 (4%) Frame = -3 Query: 1461 ITLTLSPIPNQHPS-QNSWEFNSNIVRTTIARAYHLKN-----PKKNTFSTTF------- 1321 + L LSP + S ++ + + ++IARA+ LK+ P ++ S+T Sbjct: 19 VKLPLSPFSHSDQSPKDPYLSLRRLAESSIARAHKLKHGTSIKPDEDALSSTTTASATVV 78 Query: 1320 ---LQARSYGGYSVTLNFGTPPQKIPLVFDTGSSISWVPCTSNYECSNCTFSDIDPHKIT 1150 L A+SYGGYSV+L+FGTP Q IP VFDTGSS+ W+PCTS Y CS C FS +DP I Sbjct: 79 KSPLSAKSYGGYSVSLSFGTPSQTIPFVFDTGSSLVWLPCTSRYLCSGCDFSGLDPTLIP 138 Query: 1149 TFKPELSSTTRVIGCRNPKCKWIFSGSDVESLXXXXXXXXXSTDCPGYYLQYGLGSTTGY 970 F P+ SS++++IGC++PKC++++ G +V+ + CP Y LQYGLGST G Sbjct: 139 RFIPKNSSSSKIIGCQSPKCQFLY-GPNVQCRGCDPNTRNCTVGCPPYILQYGLGSTAGV 197 Query: 969 AISDTLDLPGTTVQDFLLGCSISSNRQPEGIIGFGRDLTSLPTQLKLRKFSYCLLPHKFD 790 I++ LD P TV DF++GCSI S RQP GI GFGR SLP+Q+ L++FS+CL+ +FD Sbjct: 198 LITEKLDFPDLTVPDFVVGCSIISTRQPAGIAGFGRGPVSLPSQMNLKRFSHCLVSRRFD 257 Query: 789 DSPRNSKLILE-GSGKKTHLTHTGRGFEPQYTPF--XXXXXXXXXXXXXXXXXXXXXXXX 619 D+ + L L+ GSG + G YTPF Sbjct: 258 DTNVTTDLDLDTGSGHNSGSKTPGL----TYTPFRKNPNVSNKAFLEYYYLNLRRIYVGR 313 Query: 618 XXXKLPDKHLVPNSNGDGGTVVDSGMTYTFMDRPLFDPFVQELATQITHYKRADDIESGS 439 K+P K+L P +NGDGG++VDSG T+TFM+RP+F+ +E A+Q+++Y R D+E + Sbjct: 314 KHVKIPYKYLAPGTNGDGGSIVDSGSTFTFMERPVFELVAEEFASQMSNYTREKDLEKET 373 Query: 438 GFGLCYDVKNVRNVTFPKLVFHFKGGVKMLMPLANYFKYLKDFDALCLSIVTNTSPVAPR 259 G G C+++ +VT P+L+F FKGG K+ +PL+NYF ++ + D +CL++V++ + V P Sbjct: 374 GLGPCFNISGKGDVTVPELIFEFKGGAKLELPLSNYFTFVGNTDTVCLTVVSDKT-VNP- 431 Query: 258 RGPVGPAMILGNYQQQNFNVEYDLENQRLGIRKQKC 151 G GPA+ILG++QQQN+ VEYDLEN R G K+KC Sbjct: 432 SGGTGPAIILGSFQQQNYLVEYDLENDRFGFAKKKC 467 >gb|AAS48510.2| aspartic protease [Fagopyrum esculentum] gi|82780908|gb|ABB88696.2| aspartic proteinase-like protein [Fagopyrum esculentum] Length = 447 Score = 357 bits (917), Expect = 7e-96 Identities = 192/436 (44%), Positives = 250/436 (57%), Gaps = 5/436 (1%) Frame = -3 Query: 1443 PIPNQHPSQNSWEFNSNIVRTTIARAYHLKNPKKNTFSTTF-LQARSYGGYSVTLNFGTP 1267 P+ + + WE + ++++RA HLK P T T RSYGGYSV + GTP Sbjct: 24 PLSISPSALDKWESINLAALSSLSRARHLKRPPTLTGKVTLPAYPRSYGGYSVIFSLGTP 83 Query: 1266 PQKIPLVFDTGSSISWVPCT---SNYECSNCTFSDIDPHKITTFKPELSSTTRVIGCRNP 1096 PQK+ LV DTGSS+ W PCT + Y C NCTFS +DP KI + SST + + CR+P Sbjct: 84 PQKVSLVLDTGSSLVWTPCTIPTATYTCQNCTFSGVDPTKIPIYARNKSSTVQSLPCRSP 143 Query: 1095 KCKWIFSGSDVESLXXXXXXXXXSTDCPGYYLQYGLGSTTGYAISDTLDLPGTT-VQDFL 919 KC W+F GSD+ CP Y L+YGLGSTTG +SD L L + DFL Sbjct: 144 KCNWVF-GSDLNCSTTKR--------CPYYGLEYGLGSTTGQLVSDVLGLSKLNRIPDFL 194 Query: 918 LGCSISSNRQPEGIIGFGRDLTSLPTQLKLRKFSYCLLPHKFDDSPRNSKLILEGSGKKT 739 GCS+ SNRQPEGI GFGR L S+P QL L KFSYCL+ H+FDD+P++ L+L + Sbjct: 195 FGCSLVSNRQPEGIAGFGRGLASIPAQLGLTKFSYCLVSHRFDDTPQSGDLVLHRGRRHA 254 Query: 738 HLTHTGRGFEPQYTPFXXXXXXXXXXXXXXXXXXXXXXXXXXXKLPDKHLVPNSNGDGGT 559 G Y PF +P ++LVP+ GDGG Sbjct: 255 DAAANGVA----YAPFTKSPALSPYSEYYYISLSKILVGGKDVPIPPRYLVPSKEGDGGM 310 Query: 558 VVDSGMTYTFMDRPLFDPFVQELATQITHYKRADDIESGSGFGLCYDVKNVRNVTFPKLV 379 +VDSG T+TFM+R +FDP +EL +T YKRA +IE SG G CY++ V PKL Sbjct: 311 IVDSGSTFTFMERIIFDPVARELEKHMTKYKRAKEIEDSSGLGPCYNITGQSEVDVPKLT 370 Query: 378 FHFKGGVKMLMPLANYFKYLKDFDALCLSIVTNTSPVAPRRGPVGPAMILGNYQQQNFNV 199 F FKGG M +PL +YF + D +C++++T+ P P GPA+ILGNYQQQNF + Sbjct: 371 FSFKGGANMDLPLTDYFSLVTD-GVVCMTVLTD--PDEP-GSTTGPAIILGNYQQQNFYI 426 Query: 198 EYDLENQRLGIRKQKC 151 EYDL+ QR G + Q+C Sbjct: 427 EYDLKKQRFGFKPQQC 442 >ref|XP_006361102.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Solanum tuberosum] Length = 460 Score = 357 bits (916), Expect = 9e-96 Identities = 185/445 (41%), Positives = 267/445 (60%), Gaps = 5/445 (1%) Frame = -3 Query: 1467 THITLTLSPIPNQHPSQNSWEFNSNIVRTTIARAYHLKNPKKNTFSTTFLQARSYGGYSV 1288 T T+ LS ++PSQ+ +E +++ ++ARA ++K + + STT L +SYGGYS+ Sbjct: 26 TTTTIPLSLFNTKNPSQDFYEKLTHLASISLARANYIKKSQDSPVSTTPLYPQSYGGYSI 85 Query: 1287 TLNFGTPPQKIPLVFDTGSSISWVPCTSNYECSNCTFSDIDPHKITTFKPELSSTTRVIG 1108 L+FGTPPQKIP + DTGS+ W PCT+ Y CSNCT S I TF P+ SS+ RV+G Sbjct: 86 ALSFGTPPQKIPFIMDTGSNFVWFPCTTRYLCSNCTVSSATSQSIPTFIPKSSSSARVLG 145 Query: 1107 CRNPKCKWIFSGSDVESLXXXXXXXXXSTDCPGYYLQYGLGSTTGYAISDTLDLPGTTVQ 928 C NPKC WI S + CP Y + YG GST G A+ DTLDL V Sbjct: 146 CLNPKCGWIHSNNPKSRCQDCESPTNCKQVCPPYIILYGSGSTGGLALVDTLDLSNKKVP 205 Query: 927 DFLLGCSISSNRQPEGIIGFGRDLTSLPTQLKLRKFSYCLLPHKFDDSPRNSKLILE--G 754 +FL+GCS+ S++QP GI G GR L SLP+QL ++KFSYCL+ HKFDD+ ++S L+L+ Sbjct: 206 NFLVGCSLFSSKQPAGIAGLGRGLASLPSQLGVKKFSYCLVSHKFDDTGKSSNLVLDFNA 265 Query: 753 SGKKTHLTHTGRGFEPQYTPF---XXXXXXXXXXXXXXXXXXXXXXXXXXXKLPDKHLVP 583 SG+KT + YTP K+P K+L Sbjct: 266 SGEKTS--------DLSYTPLQKNPVVSEKNALSVYYYVSLRKITVGGKKVKIPYKYLTT 317 Query: 582 NSNGDGGTVVDSGMTYTFMDRPLFDPFVQELATQITHYKRADDIESGSGFGLCYDVKNVR 403 +SNG+GG++VDSG T+TFM+R +F+P + Q+ R++ IE +G C+++ Sbjct: 318 DSNGNGGSIVDSGTTFTFMNRGVFEPVLDAFVKQVKGIPRSESIEIITGLRPCFNISRQE 377 Query: 402 NVTFPKLVFHFKGGVKMLMPLANYFKYLKDFDALCLSIVTNTSPVAPRRGPVGPAMILGN 223 V+ P+L FH+KGG +M +P+ANYF + + D +CL++VT+ S P GP++ILGN Sbjct: 378 TVSLPELKFHYKGGAEMTLPIANYFSFAGETDVICLTMVTD-SAFGPELS-TGPSIILGN 435 Query: 222 YQQQNFNVEYDLENQRLGIRKQKCK 148 +Q QN+ VE+DL+N++ G ++Q CK Sbjct: 436 FQMQNYLVEFDLKNEKFGFKQQMCK 460 >ref|XP_006403798.1| hypothetical protein EUTSA_v10010339mg [Eutrema salsugineum] gi|557104917|gb|ESQ45251.1| hypothetical protein EUTSA_v10010339mg [Eutrema salsugineum] Length = 471 Score = 356 bits (914), Expect = 1e-95 Identities = 197/459 (42%), Positives = 271/459 (59%), Gaps = 22/459 (4%) Frame = -3 Query: 1461 ITLTLSPIPNQHPSQNSWEFNS--NIVRTTIARAYHLKNPKK-----------NTFSTTF 1321 + L LSP + Q + + S + ++IARA LK P +T S + Sbjct: 22 VKLPLSPFSHHTDQQPNDPYLSLRRLADSSIARAQELKQPTSIKPDEDALSASSTASASA 81 Query: 1320 ------LQARSYGGYSVTLNFGTPPQKIPLVFDTGSSISWVPCTSNYECSNCTFSDIDPH 1159 L RSYGGYSV+L+FGTP Q IP VFDTGSS+ W PCTS Y CS C FS +DP+ Sbjct: 82 AVVKSPLSPRSYGGYSVSLSFGTPSQTIPFVFDTGSSLVWFPCTSRYLCSGCNFSGLDPN 141 Query: 1158 KITTFKPELSSTTRVIGCRNPKCKWIFSGSDVESLXXXXXXXXXSTDCPGYYLQYGLGST 979 +I F P+ SS++R++GC+NPKC +F G +++ + CP Y +QYG GST Sbjct: 142 RIPRFLPKNSSSSRIVGCQNPKCSLLF-GPNLKCRGCDPNTRNCTLGCPPYVIQYGSGST 200 Query: 978 TGYAISDTLDLPGTTVQDFLLGCSISSNRQPEGIIGFGRDLTSLPTQLKLRKFSYCLLPH 799 G ISD L P TV DFL+GCSI S RQP GI GFGR SLP+Q+ L++FS+CL+ Sbjct: 201 AGILISDKLVFPDLTVPDFLVGCSILSTRQPAGIAGFGRGPESLPSQMNLKRFSHCLVSR 260 Query: 798 KFDDSPRNSKLILE-GSGKKTHLTHTGRGFEPQYTPF--XXXXXXXXXXXXXXXXXXXXX 628 +FDD+ + L L+ GSG KT L G YTPF Sbjct: 261 RFDDTNVTTDLDLDTGSGHKTGLKTPGL----SYTPFRNNPNVSNAAFLEYYYLNLRRIF 316 Query: 627 XXXXXXKLPDKHLVPNSNGDGGTVVDSGMTYTFMDRPLFDPFVQELATQITHYKRADDIE 448 K+P K+L P ++G+GGT+VDSG T TFM++P+FD +E ATQ+++Y R D+E Sbjct: 317 VGSKRVKIPYKYLAPGTDGNGGTIVDSGTTLTFMEQPIFDLVAEEFATQMSNYSREKDLE 376 Query: 447 SGSGFGLCYDVKNVRNVTFPKLVFHFKGGVKMLMPLANYFKYLKDFDALCLSIVTNTSPV 268 +G G C+++ ++T P L F FKGG KM +P +NYF ++K D +CL++V+ + Sbjct: 377 KTTGIGPCFNISGKGSLTVPDLTFEFKGGAKMKLPTSNYFAFVKSNDNVCLTVVSADA-- 434 Query: 267 APRRGPVGPAMILGNYQQQNFNVEYDLENQRLGIRKQKC 151 G GPA+ILG++QQQN++VEYDLEN R G ++KC Sbjct: 435 ----GGSGPAIILGSFQQQNYHVEYDLENDRFGFAQKKC 469 >gb|ESW17797.1| hypothetical protein PHAVU_007G269300g [Phaseolus vulgaris] Length = 458 Score = 355 bits (910), Expect = 4e-95 Identities = 198/442 (44%), Positives = 259/442 (58%), Gaps = 4/442 (0%) Frame = -3 Query: 1461 ITLTLSPIPNQHPSQNSWEFNSNIVRTTIARAYHLKNPKKNTFSTTFLQARSYGGYSVTL 1282 ITL LS + HPS + + V T++ RA+HLKN + N T + +SYGGYS+ L Sbjct: 29 ITLPLSHLFTTHPSSHPFHTLKLAVSTSLTRAHHLKNHQPNPPKTQ-IHPKSYGGYSIDL 87 Query: 1281 NFGTPPQKIPLVFDTGSSISWVPCTSNYECSNCTFSDIDPHKITTFKPELSSTTRVIGCR 1102 NFGTPPQ + DTGS++ W+PC+S+Y CSNC P +F P+ SS+++ +GC Sbjct: 88 NFGTPPQTFSFILDTGSTLVWLPCSSHYLCSNCNNFHNSPK---SFIPKNSSSSKFVGCT 144 Query: 1101 NPKCKWIFSGSDVESLXXXXXXXXXSTD--CPGYYLQYGLGSTTGYAISDTLDLPGTTVQ 928 NPKCKW+F G+ VES + CP Y +QYGLGST G+ +S+ L+ PG + Sbjct: 145 NPKCKWVF-GTSVESRCCKQNSATANCSQTCPAYTVQYGLGSTAGFLLSENLNFPGKLLP 203 Query: 927 DFLLGCSISSNRQPEGIIGFGRDLTSLPTQLKLRKFSYCLLPHKFDDSPRNSKLILEGSG 748 DFL+GCSI S QP GI GFGR SLP+Q+ L FSYCLL H+FDDSP S L+L S Sbjct: 204 DFLVGCSIVSVYQPAGIAGFGRGPESLPSQMNLTGFSYCLLSHQFDDSPETSDLVLHTSS 263 Query: 747 KKTHLTHTGRGFEPQYTPF--XXXXXXXXXXXXXXXXXXXXXXXXXXXKLPDKHLVPNSN 574 T+ YTPF ++P + L P+ N Sbjct: 264 SDNKRTN-----GVSYTPFRKNPSSKNPAFGAYYYLTLRRIVVGEKRVRVPKRLLEPDVN 318 Query: 573 GDGGTVVDSGMTYTFMDRPLFDPFVQELATQITHYKRADDIESGSGFGLCYDVKNVRNVT 394 G+GG++VDSG T+TFM+RP+FD +E A Q+ +Y RA +IE SG C+ V T Sbjct: 319 GNGGSIVDSGSTFTFMERPIFDLVAEEFARQV-NYTRAREIEKKSGLSPCFVVSG--TAT 375 Query: 393 FPKLVFHFKGGVKMLMPLANYFKYLKDFDALCLSIVTNTSPVAPRRGPVGPAMILGNYQQ 214 FP+L F F+GG KM +PL NYF + D CL+IV++ VA GPA+ILGNYQQ Sbjct: 376 FPELRFEFRGGAKMSLPLTNYFSLVGKSDVACLTIVSD--DVAGPGVAAGPAVILGNYQQ 433 Query: 213 QNFNVEYDLENQRLGIRKQKCK 148 QNF VEYDL N+R G R Q CK Sbjct: 434 QNFYVEYDLGNERFGFRSQSCK 455 >gb|AAL14384.1| AT3g52500/F22O6_120 [Arabidopsis thaliana] Length = 469 Score = 355 bits (910), Expect = 4e-95 Identities = 194/456 (42%), Positives = 281/456 (61%), Gaps = 19/456 (4%) Frame = -3 Query: 1461 ITLTLSPIPNQHPS-QNSWEFNSNIVRTTIARAYHLKN-----PKKNTFSTTF------- 1321 + L LSP + S ++ + + ++IARA+ LK+ P ++ S+T Sbjct: 19 VKLPLSPFSHSDQSPKDPYLSLRRLAESSIARAHKLKHGTSIKPDEDALSSTTTASATVV 78 Query: 1320 ---LQARSYGGYSVTLNFGTPPQKIPLVFDTGSSISWVPCTSNYECSNCTFSDIDPHKIT 1150 L A+SYGGYSV+L+FGTP Q IP VFDTGSS+ +PCTS Y CS C FS +DP I Sbjct: 79 KSPLSAKSYGGYSVSLSFGTPSQTIPFVFDTGSSLVCLPCTSRYLCSGCDFSGLDPTLIP 138 Query: 1149 TFKPELSSTTRVIGCRNPKCKWIFSGSDVESLXXXXXXXXXSTDCPGYYLQYGLGSTTGY 970 F P+ SS++++IGC++PKC++++ G +V+ + CP Y LQYGLGST G Sbjct: 139 RFIPKNSSSSKIIGCQSPKCQFLY-GPNVQCRGCDPNTRNCTVGCPPYILQYGLGSTAGV 197 Query: 969 AISDTLDLPGTTVQDFLLGCSISSNRQPEGIIGFGRDLTSLPTQLKLRKFSYCLLPHKFD 790 I++ LD P TV DF++GCSI S RQP GI GFGR SLP+Q+ L++FS+CL+ +FD Sbjct: 198 LITEKLDFPDLTVPDFVVGCSIISTRQPAGIAGFGRGPVSLPSQMNLKRFSHCLVSRRFD 257 Query: 789 DSPRNSKLILE-GSGKKTHLTHTGRGFEPQYTPF--XXXXXXXXXXXXXXXXXXXXXXXX 619 D+ + L L+ GSG + G YTPF Sbjct: 258 DTNVTTDLDLDTGSGHNSGSKTPGL----TYTPFRKNPNVSNKAFLEYYYLNLRRIYVGR 313 Query: 618 XXXKLPDKHLVPNSNGDGGTVVDSGMTYTFMDRPLFDPFVQELATQITHYKRADDIESGS 439 K+P K+L P +NGDGG++VDSG T+TFM+RP+F+ +E A+Q+++Y R D+E + Sbjct: 314 KHVKIPYKYLAPGTNGDGGSIVDSGSTFTFMERPVFELVAEEFASQMSNYTREKDLEKET 373 Query: 438 GFGLCYDVKNVRNVTFPKLVFHFKGGVKMLMPLANYFKYLKDFDALCLSIVTNTSPVAPR 259 G G C+++ +VT P+L+F FKGG K+ +PL+NYF ++ + D +CL++V++ + V P Sbjct: 374 GLGPCFNISGKGDVTVPELIFEFKGGAKLELPLSNYFTFVGNTDTVCLTVVSDKT-VNP- 431 Query: 258 RGPVGPAMILGNYQQQNFNVEYDLENQRLGIRKQKC 151 G GPA+ILG++QQQN+ VEYDLEN R G K+KC Sbjct: 432 SGGTGPAIILGSFQQQNYLVEYDLENDRFGFAKKKC 467 >ref|XP_006291053.1| hypothetical protein CARUB_v10017168mg [Capsella rubella] gi|482559760|gb|EOA23951.1| hypothetical protein CARUB_v10017168mg [Capsella rubella] Length = 471 Score = 352 bits (903), Expect = 3e-94 Identities = 190/443 (42%), Positives = 273/443 (61%), Gaps = 18/443 (4%) Frame = -3 Query: 1425 PSQNSWEFNSNIVRTTIARAYHLKN-----------PKKNTFSTTF----LQARSYGGYS 1291 PS++ + + ++IARA+ +K+ +T S T L A+SYGGYS Sbjct: 34 PSKDPYLSLRRLADSSIARAHKIKHGASVKPDDDALSSASTASATVVKSPLSAKSYGGYS 93 Query: 1290 VTLNFGTPPQKIPLVFDTGSSISWVPCTSNYECSNCTFSDIDPHKITTFKPELSSTTRVI 1111 V+L+FGTP Q IP VFDTGSS+ W PCTS Y CS C+FS +DP +I F P+ SS++RVI Sbjct: 94 VSLSFGTPSQTIPFVFDTGSSLVWFPCTSRYLCSGCSFSGLDPTQIPRFIPKNSSSSRVI 153 Query: 1110 GCRNPKCKWIFSGSDVESLXXXXXXXXXSTDCPGYYLQYGLGSTTGYAISDTLDLPGTTV 931 GC+NPKC+++F G++V+ + CP Y LQYGLGST G +++TLD P V Sbjct: 154 GCQNPKCQFLF-GANVQCRGCDPNTRNCTVACPPYILQYGLGSTAGILLTETLDFPDLKV 212 Query: 930 QDFLLGCSISSNRQPEGIIGFGRDLTSLPTQLKLRKFSYCLLPHKFDDSPRNSKLILE-G 754 DF++GCSI S RQP GI GFGR S+P+Q+KL++FS+CL+ +FD++ + L L+ G Sbjct: 213 PDFVVGCSIISTRQPAGIAGFGRGPESIPSQMKLKRFSHCLVSRRFDNTNVTTDLDLDTG 272 Query: 753 SGKKTHLTHTGRGFEPQYTPF--XXXXXXXXXXXXXXXXXXXXXXXXXXXKLPDKHLVPN 580 SG + G YTPF K+P K L P Sbjct: 273 SGHNSGSKTPGL----SYTPFRKNPNVSNAAFLEYYYLNLRRIYVGSKHVKVPYKFLAPG 328 Query: 579 SNGDGGTVVDSGMTYTFMDRPLFDPFVQELATQITHYKRADDIESGSGFGLCYDVKNVRN 400 NG+GG++VDSG T+TFM+RP+F +E A Q+++Y R D+E +G G C+++ + Sbjct: 329 KNGNGGSIVDSGSTFTFMERPVFTLVAEEFAAQMSNYTREKDLEKLTGLGPCFNIAGKGD 388 Query: 399 VTFPKLVFHFKGGVKMLMPLANYFKYLKDFDALCLSIVTNTSPVAPRRGPVGPAMILGNY 220 V+ P+L+F FKGG KM +P++NYF ++ + +CL++V++ + V P G GPA+ILG++ Sbjct: 389 VSVPELIFEFKGGAKMELPISNYFSFVGSSETVCLTVVSDNT-VNP-SGGTGPAIILGSF 446 Query: 219 QQQNFNVEYDLENQRLGIRKQKC 151 QQQN+ VEYDLEN R G K+KC Sbjct: 447 QQQNYLVEYDLENDRFGFAKKKC 469