BLASTX nr result
ID: Rehmannia25_contig00005219
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Rehmannia25_contig00005219 (1860 letters) Database: ./nr 37,332,560 sequences; 13,225,080,153 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_006354730.1| PREDICTED: aspartic proteinase Asp1-like [So... 664 0.0 ref|XP_004241624.1| PREDICTED: aspartic proteinase Asp1-like [So... 662 0.0 gb|EXB56943.1| Aspartic proteinase Asp1 [Morus notabilis] 631 e-178 ref|XP_002275943.1| PREDICTED: aspartic proteinase Asp1-like [Vi... 617 e-174 ref|XP_006440945.1| hypothetical protein CICLE_v10019473mg [Citr... 615 e-173 ref|XP_006485774.1| PREDICTED: aspartic proteinase Asp1-like [Ci... 615 e-173 gb|EOY21001.1| Eukaryotic aspartyl protease family protein, puta... 611 e-172 ref|XP_002511959.1| protein with unknown function [Ricinus commu... 607 e-171 emb|CBI15437.3| unnamed protein product [Vitis vinifera] 584 e-164 ref|XP_004137470.1| PREDICTED: aspartic proteinase Asp1-like [Cu... 584 e-164 ref|XP_002328687.1| predicted protein [Populus trichocarpa] gi|5... 560 e-157 ref|XP_002891474.1| aspartyl protease family protein [Arabidopsi... 556 e-155 ref|XP_006583400.1| PREDICTED: aspartic proteinase Asp1-like [Gl... 548 e-153 gb|ESW24775.1| hypothetical protein PHAVU_004G159200g [Phaseolus... 548 e-153 ref|NP_564539.1| aspartyl protease [Arabidopsis thaliana] gi|777... 548 e-153 ref|XP_006826817.1| hypothetical protein AMTR_s00010p00056950 [A... 538 e-150 ref|XP_004512995.1| PREDICTED: lysine-specific histone demethyla... 536 e-149 ref|XP_006304522.1| hypothetical protein CARUB_v10011364mg [Caps... 535 e-149 ref|XP_006393315.1| hypothetical protein EUTSA_v10011346mg [Eutr... 528 e-147 gb|EOY21002.1| Eukaryotic aspartyl protease family protein, puta... 524 e-146 >ref|XP_006354730.1| PREDICTED: aspartic proteinase Asp1-like [Solanum tuberosum] Length = 558 Score = 664 bits (1712), Expect = 0.0 Identities = 339/569 (59%), Positives = 412/569 (72%), Gaps = 6/569 (1%) Frame = +3 Query: 81 MEETNERPP-QLTVIITLPPPDNPSLGKTITAFTLSD---HQTPTPQSPPQVDESPPVQN 248 MEET PP Q VIITLPPPDNPS GKTITAFTLSD HQ + PPQ + Sbjct: 1 MEETKNSPPIQGVVIITLPPPDNPSYGKTITAFTLSDSPTHQQQQEEEPPQQSQPHNQDL 60 Query: 249 FAXXXXXXXXXXXXXXXXTVLPVLGISVIALYLWVSVSRETLFQLRDELGNDDEHQKNNS 428 V +LGIS+IAL W S+++ETLF+LRD EH +S Sbjct: 61 NTGVLRASLERSFFFRPKIVFGLLGISLIALSFWSSLTQETLFELRDV-----EHDHKSS 115 Query: 429 QHTFLLPLYLKKPNRVNGDLGDFEIKLGRRVSFDSRPISKELDDAMSVGEVNSKLVSTAS 608 +F+LPLY K+ N D E KLGR V F D E +K +S A+ Sbjct: 116 NSSFILPLYPKRGGAWNSRR-DVEFKLGRFVDFKP--------DKFMDQEKIAKSLSAAT 166 Query: 609 RIDATSVIPVRGNIYPDGLYYTYLHFGNPPRPYFLDMDTGSDLTWIQCDAPCTSCAKGAH 788 ++D++ PVRGNI+ +GLYYTY+ GNPPRPYFLD+DTGSDL WIQCDAPCTSCAKGAH Sbjct: 167 KLDSSVNFPVRGNIHSEGLYYTYMLVGNPPRPYFLDIDTGSDLMWIQCDAPCTSCAKGAH 226 Query: 789 PFYKPVKANIIPPKDSYCVEIQRSQITKDCDSCHQCDYEIEYADHSSSVGVLARDELYLN 968 P YKP N+IPPK+ YCVE+Q + +K CD+CHQCDYEIEYAD SSSVGVLA+DEL L Sbjct: 227 PLYKPRNVNMIPPKNPYCVEVQENLKSKYCDNCHQCDYEIEYADRSSSVGVLAKDELQLV 286 Query: 969 IANGSLAKSKVVFGCAYDQQGLLLNTMGKTDGILGLSRAKISFPSQLASQGIINNVVGHC 1148 +ANG+ K VVFGCAYDQQG LLNT+ TDGILGLSRA IS PSQLAS G+INNV+GHC Sbjct: 287 LANGTGTKPSVVFGCAYDQQGTLLNTLASTDGILGLSRAPISLPSQLASHGLINNVIGHC 346 Query: 1149 LATEPSGGGYLFLGDDFVPHSRMTWVPMLQSHNSNSYQAEMVKVSYGGRQIGLGNV--GN 1322 L T+ + GGYLFLG+DFVP RM+WVPML + N YQA+++K++YGG+++ LG+ G Sbjct: 347 LRTD-TNGGYLFLGNDFVPQWRMSWVPMLNNPFPNLYQAQLMKMNYGGKELRLGSTSYGQ 405 Query: 1323 GRLVFDSGSSYSYFTEQAYNNLLTVLDDISSESLVRDMSDTSLPICWQVKSPSRSVKDVR 1502 G +VFDSGS+Y+YFT+QAY L+++L++ISSE L++D SDT+LPICW+ K P RS+++VR Sbjct: 406 GTVVFDSGSTYTYFTDQAYKALISMLEEISSEDLIKDASDTTLPICWRAKFPVRSIEEVR 465 Query: 1503 QLFKPLNLQFGSKWWIMSTKLQIPPEGYLVTNSKGNVCLGILNGRNVHDGSTFILGDISL 1682 Q FKPLNLQFGSKW I+STKL IP EG+L + KGNVCLGIL+G NVHDGS ILGDISL Sbjct: 466 QFFKPLNLQFGSKWRIVSTKLWIPAEGFLTISEKGNVCLGILDGSNVHDGSAIILGDISL 525 Query: 1683 RGLLFVYDNVNEKIGWVRSDCARPRRFES 1769 RG LFVYDNVN+KIGW+RS+C RP + S Sbjct: 526 RGQLFVYDNVNQKIGWIRSNCERPEKVPS 554 >ref|XP_004241624.1| PREDICTED: aspartic proteinase Asp1-like [Solanum lycopersicum] Length = 562 Score = 662 bits (1709), Expect = 0.0 Identities = 340/577 (58%), Positives = 421/577 (72%), Gaps = 10/577 (1%) Frame = +3 Query: 81 MEETNERPP-QLTVIITLPPPDNPSLGKTITAFTLSDHQTPTPQSPPQVDESPPVQNF-- 251 MEET PP Q VIITLPPPDNPS GKTITAFTLSD T Q + +E PP Q+ Sbjct: 1 MEETKNSPPIQGVVIITLPPPDNPSYGKTITAFTLSDSPTHQQQQEQEQEEEPPQQSQPH 60 Query: 252 -----AXXXXXXXXXXXXXXXXTVLPVLGISVIALYLWVSVSRETLFQLRDELGNDDEHQ 416 A V +LGIS+IAL W S+++ETLF+LRD + +H+ Sbjct: 61 NQDVNAGVLHVSLERSFFFRPTIVFGLLGISLIALSFWSSLTQETLFELRDV---EQDHK 117 Query: 417 KNNSQHTFLLPLYLKKPNRVNGDLGDFEIKLGRRVSFDSRPISKELDDAMSVGEVNSKLV 596 +NS +F+LPLY K+ N D E KLGR V F D+ M ++ +K + Sbjct: 118 SSNS--SFILPLYPKRGGAWNSRT-DVEFKLGRFVDFKP-------DNFMDQEKI-AKSL 166 Query: 597 STASRIDATSVIPVRGNIYPDGLYYTYLHFGNPPRPYFLDMDTGSDLTWIQCDAPCTSCA 776 S A+++D+++ PVRGNI+ +GLYYTY+ GNPP+PYFLD+DTGSDL WIQCDAPCTSCA Sbjct: 167 SAATKLDSSANFPVRGNIHSEGLYYTYMLVGNPPKPYFLDIDTGSDLMWIQCDAPCTSCA 226 Query: 777 KGAHPFYKPVKANIIPPKDSYCVEIQRSQITKDCDSCHQCDYEIEYADHSSSVGVLARDE 956 KGAHP YKP N+IPPK+ YCVE+Q + +K CD+CHQCDYEIEYAD SSSVGVLA+DE Sbjct: 227 KGAHPLYKPRNVNMIPPKNPYCVEVQENLRSKYCDNCHQCDYEIEYADRSSSVGVLAKDE 286 Query: 957 LYLNIANGSLAKSKVVFGCAYDQQGLLLNTMGKTDGILGLSRAKISFPSQLASQGIINNV 1136 L L +ANG+ K VVFGCAYDQQG LLNT+ TDGILGLSRA IS PSQLAS G+INNV Sbjct: 287 LQLVLANGTGTKPNVVFGCAYDQQGTLLNTLASTDGILGLSRAPISLPSQLASHGLINNV 346 Query: 1137 VGHCLATEPSGGGYLFLGDDFVPHSRMTWVPMLQSHNSNSYQAEMVKVSYGGRQIGLGNV 1316 +GHCL T+ + GGYLFLG+DFVP RM+WVPML + N YQA+++K++YGG+ + LG+ Sbjct: 347 IGHCLRTD-TNGGYLFLGNDFVPQWRMSWVPMLNNPFPNLYQAQLMKMNYGGKDLQLGSR 405 Query: 1317 GNGR--LVFDSGSSYSYFTEQAYNNLLTVLDDISSESLVRDMSDTSLPICWQVKSPSRSV 1490 G G+ +VFDSGS+Y+YFT+QAY L+++L++ISSE L++D SDT+LPICW+ K P RS+ Sbjct: 406 GYGQDSVVFDSGSTYTYFTDQAYKALISMLEEISSEDLIKDASDTTLPICWRAKFPVRSI 465 Query: 1491 KDVRQLFKPLNLQFGSKWWIMSTKLQIPPEGYLVTNSKGNVCLGILNGRNVHDGSTFILG 1670 ++VRQ FKPLNLQFGSKW ++STKL IP EGYL + K NVCLGIL+G NVHDGS ILG Sbjct: 466 EEVRQFFKPLNLQFGSKWRVVSTKLWIPAEGYLTISEKSNVCLGILDGSNVHDGSAIILG 525 Query: 1671 DISLRGLLFVYDNVNEKIGWVRSDCARPRRFESHSLF 1781 DISLRG LFVYDNVN+KIGW+RS+C RP S F Sbjct: 526 DISLRGQLFVYDNVNQKIGWIRSNCERPENVPSLPFF 562 >gb|EXB56943.1| Aspartic proteinase Asp1 [Morus notabilis] Length = 569 Score = 631 bits (1627), Expect = e-178 Identities = 326/577 (56%), Positives = 403/577 (69%), Gaps = 14/577 (2%) Frame = +3 Query: 93 NERPPQL--TVIITLPPPDNPSLGKTITAFTLSDHQTPTPQSPPQVDESPPVQ---NFAX 257 ++ PPQ+ VIITLPPPDNPSLGKTITAFTLS+ Q + P+Q N Sbjct: 3 SDHPPQIKGVVIITLPPPDNPSLGKTITAFTLSNSSPTQTHQESQNQNNLPIQSPQNPQL 62 Query: 258 XXXXXXXXXXXXXXXTVLPVLGISVIALYLWVSVSRETLFQLRDELGNDDEHQKNNSQHT 437 + +LGIS+ L L+ V + + R NDDE + Sbjct: 63 QFPFPRLRLFHGVPRRLFALLGISIFTLVLFSHVFPTVVEEFRRS--NDDE-----GPES 115 Query: 438 FLLPLYLKKPNRVNGDLGDFEIKLGRRVSFDSRPISKELDDAMSVGEVNSKLVSTASRID 617 F+ PLY K V G D E+KLGR V FD D + +VN KLVS+ +++D Sbjct: 116 FIFPLYSKLG--VPGKK-DVELKLGRFVDFDKENAGVSFGDRVKTQKVN-KLVSSTAKVD 171 Query: 618 ATSVIPVRGNIYPDGLYYTYLHFGNPPRPYFLDMDTGSDLTWIQCDAPCTSCAKGAHPFY 797 +++++PVRGN+YPDGLYYT + GNPPRPY LDMDTGSDLTWIQCDAPCTSCAKGA+P Y Sbjct: 172 SSAILPVRGNVYPDGLYYTQILVGNPPRPYHLDMDTGSDLTWIQCDAPCTSCAKGANPLY 231 Query: 798 KPVKANIIPPKDSYCVEIQRSQITKDCDSCHQCDYEIEYADHSSSVGVLARDELYLNIAN 977 KP K NI+P KDS+C EI+R+Q C +C QCDYEI+YAD SSS+GVLA+D L+L + N Sbjct: 232 KPTKGNIVPSKDSFCTEIRRNQKPGHCKTCQQCDYEIQYADRSSSLGVLAKDGLHLVMEN 291 Query: 978 GSLAKSKVVFGCAYDQQGLLLNTMGKTDGILGLSRAKISFPSQLASQGIINNVVGHCLAT 1157 GSLA VVFGCAYDQQGLLLNT+ KTDGILGLSRAK+S PSQLAS+GII NVVGHCL T Sbjct: 292 GSLANVNVVFGCAYDQQGLLLNTLAKTDGILGLSRAKVSLPSQLASKGIIKNVVGHCLTT 351 Query: 1158 EPSGGGYLFLGDDFVPHSRMTWVPMLQSHNSNSYQAEMVKVSYGGRQIGLGNVGN--GRL 1331 GGGY+FLGDDFVPH M+W+PML+S + + YQ+E+V ++YG + LG + +L Sbjct: 352 NAGGGGYMFLGDDFVPHWGMSWIPMLRSPSMDFYQSEIVSINYGSSALNLGAWSSKARQL 411 Query: 1332 VFDSGSSYSYFTEQAYNNLLTVLDDISSESLVRDMSDTSLPICWQVKSP-------SRSV 1490 VFDSGSSY+YF ++AY+ LL L+++S+ LVRD SD SLPICW+ ++P RSV Sbjct: 412 VFDSGSSYTYFNKRAYSALLASLEEVSTTGLVRDRSDPSLPICWRAETPLNCIHMECRSV 471 Query: 1491 KDVRQLFKPLNLQFGSKWWIMSTKLQIPPEGYLVTNSKGNVCLGILNGRNVHDGSTFILG 1670 DV++ FK + LQFGSKWWI+ST+L+IPPEGYL +SKGNVCLGIL+G VHDG T ILG Sbjct: 472 ADVKRFFKTITLQFGSKWWIISTRLRIPPEGYLTISSKGNVCLGILDGSKVHDGYTTILG 531 Query: 1671 DISLRGLLFVYDNVNEKIGWVRSDCARPRRFESHSLF 1781 DISLRG L VYDN N+KIGW SDC +PRRF+S F Sbjct: 532 DISLRGHLVVYDNENQKIGWTNSDCVKPRRFDSLPFF 568 >ref|XP_002275943.1| PREDICTED: aspartic proteinase Asp1-like [Vitis vinifera] Length = 686 Score = 617 bits (1590), Expect = e-174 Identities = 323/595 (54%), Positives = 409/595 (68%), Gaps = 36/595 (6%) Frame = +3 Query: 105 PQL--TVIITLPPPDNPSLGKTITAFTLSD------------------------------ 188 PQL VIITLPPPDNPSLGKTITAFTLSD Sbjct: 124 PQLKGVVIITLPPPDNPSLGKTITAFTLSDPPLDRPHHTHQQLQRQQHQEEEEEEEEEEE 183 Query: 189 --HQTPTPQSPPQVDESPPVQNFAXXXXXXXXXXXXXXXXTVLPVLGISVIALYLWVSVS 362 HQ P+P SPP P F+ ++ LG+S+ LW S Sbjct: 184 EPHQLPSP-SPPN-----PALQFSVRKLSLGNPRI------LMGFLGVSLFVFLLWNFAS 231 Query: 363 RETLFQLRDELGNDDEHQKNNSQHTFLLPLYLKKPNRVNGDLGDFEIKLGRRVSFDSRPI 542 L +LR + NDD + F+LPLY K +R LGD E+KLG+ V F Sbjct: 232 SSPLVELRRK--NDDREPTS-----FILPLYPKLGSR---SLGDLELKLGKFVDF----- 276 Query: 543 SKELDDAMSVGEVNSKLVSTASRIDATSVIPVRGNIYPDGLYYTYLHFGNPPRPYFLDMD 722 ++D M G +N KL ++ S D++++ PVRG++YP+GLY+T++ G+PPR YFLDMD Sbjct: 277 --HVND-MKPGGIN-KLATSVSAFDSSTIFPVRGDVYPNGLYFTHIFVGSPPRRYFLDMD 332 Query: 723 TGSDLTWIQCDAPCTSCAKGAHPFYKPVKANIIPPKDSYCVEIQRSQITKDCDSCHQCDY 902 TGSDLTWIQCDAPCTSCAKG +P YKP K N++P KDS CVE+QR+ T C++C QCDY Sbjct: 333 TGSDLTWIQCDAPCTSCAKGPNPLYKPKKGNLVPLKDSLCVEVQRNLKTGYCETCEQCDY 392 Query: 903 EIEYADHSSSVGVLARDELYLNIANGSLAKSKVVFGCAYDQQGLLLNTMGKTDGILGLSR 1082 EIEYADHSSS+GVLA D+L+L +ANGSL K ++FGCAYDQQGLLLN++ KTDGILGLS+ Sbjct: 393 EIEYADHSSSMGVLASDDLHLMLANGSLTKLGIMFGCAYDQQGLLLNSLAKTDGILGLSK 452 Query: 1083 AKISFPSQLASQGIINNVVGHCLATEPSGGGYLFLGDDFVPHSRMTWVPMLQSHNSNSYQ 1262 AK+S PSQLASQ IINNV+GHCL ++ +GGGY+FLGDDFVP+ M WVPML SH+ N Y Sbjct: 453 AKVSLPSQLASQRIINNVLGHCLTSDATGGGYMFLGDDFVPYWGMAWVPMLNSHSPN-YH 511 Query: 1263 AEMVKVSYGGRQIGLGNVG--NGRLVFDSGSSYSYFTEQAYNNLLTVLDDISSESLVRDM 1436 ++++K+S+G RQ+ LG R+VFD+GSSY+YF ++AY L+ L D+S E L++D Sbjct: 512 SQIMKISHGSRQLSLGRQDGRTERVVFDTGSSYTYFPKEAYYALVASLKDVSDEGLIQDG 571 Query: 1437 SDTSLPICWQVKSPSRSVKDVRQLFKPLNLQFGSKWWIMSTKLQIPPEGYLVTNSKGNVC 1616 SD +LP+CW+ K P RSV DV+Q F+PL LQF SKWWI+STK +IPPEGYL+ ++KGNVC Sbjct: 572 SDPTLPVCWRAKFPIRSVIDVKQFFQPLTLQFRSKWWIVSTKFRIPPEGYLIISNKGNVC 631 Query: 1617 LGILNGRNVHDGSTFILGDISLRGLLFVYDNVNEKIGWVRSDCARPRRFESHSLF 1781 LGIL+G NVHDGST ILGDISLRG L VYDNVN+KIGW +S C +P++ +S F Sbjct: 632 LGILDGSNVHDGSTIILGDISLRGKLVVYDNVNQKIGWAQSTCVKPQKIKSLPFF 686 >ref|XP_006440945.1| hypothetical protein CICLE_v10019473mg [Citrus clementina] gi|557543207|gb|ESR54185.1| hypothetical protein CICLE_v10019473mg [Citrus clementina] Length = 577 Score = 615 bits (1587), Expect = e-173 Identities = 324/575 (56%), Positives = 399/575 (69%), Gaps = 13/575 (2%) Frame = +3 Query: 84 EETNERPPQLT--VIITLPPPDNPSLGKTITAFTLSDHQTPTPQSPPQVDESPPVQNFAX 257 +E+ PPQLT VIITLPPP+NPSLGKTITA+TL+D+ + Q+ + + P+ Sbjct: 4 DESPSPPPQLTGVVIITLPPPNNPSLGKTITAYTLTDNSPQSQQTRHRQQQEHPLPPQLH 63 Query: 258 XXXXXXXXXXXXXXXTVLP-----VLGISVIALYLWVSVSRETLFQLRDELGNDDEHQKN 422 LP L IS+ AL L+ SV TL Q R + NDDE++++ Sbjct: 64 PPQNSQFNFSLPMLFPGLPRKLFLFLAISIFALILYGSVFSYTL-QDRYKSNNDDENKES 122 Query: 423 NSQHTFLLPLYLKKPNRVNGDLGDFEIKLGRRVSFDSRPISKELDDAM---SVGEVNSKL 593 F+ PLY K R D E KLGR V D + ++D + ++N KL Sbjct: 123 -----FVFPLYHKFGIREVSQR-DAEFKLGRFVDLDGESVVASVNDGIIRPHKSKINKKL 176 Query: 594 VST-ASRIDATSVIPVRGNIYPDGLYYTYLHFGNPPRPYFLDMDTGSDLTWIQCDAPCTS 770 VS+ A +D++S+ P+RGNIYPDGLY+TY+ GNPPRPY+LDMDTGSDLTWIQCDAPC+S Sbjct: 177 VSSNAVAVDSSSIFPLRGNIYPDGLYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSS 236 Query: 771 CAKGAHPFYKPVKANIIPPKDSYCVEIQRSQITKDCDSCHQCDYEIEYADHSSSVGVLAR 950 CAKGA+P YKP NI+P KDS C+EIQR+ C++C QCDYEIEYADHSSS+GVLAR Sbjct: 237 CAKGANPLYKPRMGNILPYKDSLCMEIQRNHKPGYCETCQQCDYEIEYADHSSSMGVLAR 296 Query: 951 DELYLNIANGSLAKSKVVFGCAYDQQGLLLNTMGKTDGILGLSRAKISFPSQLASQGIIN 1130 DEL+L I NGSL K VVFGCAYDQQGLLLNT+ KTDGILGLSRAK+S PSQLASQGII Sbjct: 297 DELHLTIENGSLTKPNVVFGCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIK 356 Query: 1131 NVVGHCLATEPSGGGYLFLGDDFVPHSRMTWVPMLQSHNSNSYQAEMVKVSYGGRQIGLG 1310 NVVGHCL T GGGY+FLG D VP M WVPML S Y E++K++YG + LG Sbjct: 357 NVVGHCLTTNAGGGGYMFLGHDLVPSWGMAWVPMLDSPFMELYHTEILKINYGSSPLNLG 416 Query: 1311 --NVGNGRLVFDSGSSYSYFTEQAYNNLLTVLDDISSESLVRDMSDTSLPICWQVKSPSR 1484 N G +FD+GSSY+YFT+QAY+ L+ L ++SS+ LV D SD +LP+CW+ K P R Sbjct: 417 ARNSQVGWALFDTGSSYTYFTKQAYSELIASLKEVSSDGLVLDASDPTLPVCWRAKFPIR 476 Query: 1485 SVKDVRQLFKPLNLQFGSKWWIMSTKLQIPPEGYLVTNSKGNVCLGILNGRNVHDGSTFI 1664 S+ DV+Q FK L L FGSKW I+STK +I PEGYLV + KGN+CLGIL+G VH+GST I Sbjct: 477 SIVDVKQFFKTLTLHFGSKWQIVSTKFRISPEGYLVISKKGNICLGILDGSEVHNGSTII 536 Query: 1665 LGDISLRGLLFVYDNVNEKIGWVRSDCARPRRFES 1769 LGDISLRG L VYDNVN++IGW +S C P RF+S Sbjct: 537 LGDISLRGQLVVYDNVNKRIGWAKSHCMNPGRFKS 571 >ref|XP_006485774.1| PREDICTED: aspartic proteinase Asp1-like [Citrus sinensis] Length = 577 Score = 615 bits (1586), Expect = e-173 Identities = 325/575 (56%), Positives = 396/575 (68%), Gaps = 13/575 (2%) Frame = +3 Query: 84 EETNERPPQLT--VIITLPPPDNPSLGKTITAFTLSDHQTPTPQSPPQVDESPPVQNFAX 257 +E+ PPQLT VIITLPPP+NPSLGKTITA+TL+D+ + Q+ Q + P+ Sbjct: 4 DESPSPPPQLTGVVIITLPPPNNPSLGKTITAYTLTDNSPQSQQTHHQQQQEHPLPAQLH 63 Query: 258 XXXXXXXXXXXXXXXTVLP-----VLGISVIALYLWVSVSRETLFQLRDELGNDDEHQKN 422 VLP L IS+ AL L+ SV TL Q R + NDDE++++ Sbjct: 64 PPQDSQFNFSLPMLFPVLPRKLFLFLAISIFALILYGSVFSYTL-QHRYKSNNDDENKES 122 Query: 423 NSQHTFLLPLYLKKPNRVNGDLGDFEIKLGRRVSFDSRPISKELDDAM---SVGEVNSKL 593 F+ PLY K R D E KLGR V D + ++D + ++N KL Sbjct: 123 -----FVFPLYHKFGIREVLQR-DAEFKLGRFVDLDGESVVASVNDGIIRPHKSKINKKL 176 Query: 594 V-STASRIDATSVIPVRGNIYPDGLYYTYLHFGNPPRPYFLDMDTGSDLTWIQCDAPCTS 770 V S A +D++S P+RGN+YPDGLY+TY+ GNPPRPY+LDMDTGSDLTWIQCDAPC+S Sbjct: 177 VPSNAVAVDSSSTFPLRGNVYPDGLYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSS 236 Query: 771 CAKGAHPFYKPVKANIIPPKDSYCVEIQRSQITKDCDSCHQCDYEIEYADHSSSVGVLAR 950 CAKGA+P YKP NI+P KDS C+EIQR+ C++C QCDYEIEYADHSSS+GVLAR Sbjct: 237 CAKGANPLYKPRMGNILPYKDSLCMEIQRNHKPGYCETCQQCDYEIEYADHSSSMGVLAR 296 Query: 951 DELYLNIANGSLAKSKVVFGCAYDQQGLLLNTMGKTDGILGLSRAKISFPSQLASQGIIN 1130 DEL+L I NGSL K VVFGCAYDQQGLLLNT+ KTDGILGLSRAK+S PSQLASQGII Sbjct: 297 DELHLTIENGSLTKPNVVFGCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIK 356 Query: 1131 NVVGHCLATEPSGGGYLFLGDDFVPHSRMTWVPMLQSHNSNSYQAEMVKVSYGGRQIGLG 1310 NVVGHCL T GGGY+FLG D VP M WVPML S Y E++K++YG + LG Sbjct: 357 NVVGHCLTTNAGGGGYMFLGHDLVPSWGMAWVPMLDSPFMELYHTEILKINYGSSPLNLG 416 Query: 1311 --NVGNGRLVFDSGSSYSYFTEQAYNNLLTVLDDISSESLVRDMSDTSLPICWQVKSPSR 1484 N G +FD+GSSY+YFT+QAY+ L+ L ++SS LV D SD +LP+CW+ K P R Sbjct: 417 ARNSRVGWALFDTGSSYTYFTKQAYSELIASLKEVSSNGLVLDASDPTLPVCWRAKFPIR 476 Query: 1485 SVKDVRQLFKPLNLQFGSKWWIMSTKLQIPPEGYLVTNSKGNVCLGILNGRNVHDGSTFI 1664 S+ DV+Q FK L L FGSKW I+STK I PEGYLV + KGN+CLGIL+G VH+GST I Sbjct: 477 SIVDVKQYFKTLTLHFGSKWQIVSTKFHISPEGYLVISKKGNICLGILDGSEVHNGSTII 536 Query: 1665 LGDISLRGLLFVYDNVNEKIGWVRSDCARPRRFES 1769 LGDISLRG L VYDNVN++IGW +S C P RF+S Sbjct: 537 LGDISLRGQLVVYDNVNKRIGWAKSHCMNPGRFKS 571 >gb|EOY21001.1| Eukaryotic aspartyl protease family protein, putative isoform 1 [Theobroma cacao] Length = 576 Score = 611 bits (1576), Expect = e-172 Identities = 316/582 (54%), Positives = 399/582 (68%), Gaps = 21/582 (3%) Frame = +3 Query: 87 ETNERPPQLT--VIITLPPPDNPSLGKTITAFTLSD------HQTPTPQSPPQVDESPPV 242 +++ERP Q+T VIITLPP DNPSLGKTITAFTL++ HQT Q + P Sbjct: 2 DSDERPQQVTGVVIITLPPSDNPSLGKTITAFTLTNDVFPQSHQTQQRQQQEEEQTLPTT 61 Query: 243 QNFAXXXXXXXXXXXXXXXX--------TVLPVLGISVIALYLWVSVSRETLFQLRDELG 398 Q +L LGIS+ AL L+ S T +LR+ Sbjct: 62 QILTPAPPSAQNPQRGFSFLGLFSDNPRKLLGFLGISLFALLLYSSAFSNTFVELRNSNN 121 Query: 399 NDDEHQKNNSQHTFLLPLYLKKPNRVNGDLG-DFEIKLGRRVSFDSRPISKELDD-AMSV 572 +DDE ++ F+ PLY K LG D E+KLGR V D + ++ A Sbjct: 122 DDDEKPQS-----FIFPLYHK--------LGADLELKLGRFVDVDKENLVASVEGGATGT 168 Query: 573 GEVNSKLVSTASRIDAT-SVIPVRGNIYPDGLYYTYLHFGNPPRPYFLDMDTGSDLTWIQ 749 ++N + S A+ ID++ +++PVRGN+YPDGLY+TY+ GNP R YFLD+DTGSDLTWIQ Sbjct: 169 QKINKLVASNAAVIDSSGTILPVRGNVYPDGLYFTYMLVGNPQRRYFLDIDTGSDLTWIQ 228 Query: 750 CDAPCTSCAKGAHPFYKPVKANIIPPKDSYCVEIQRSQITKDCDSCHQCDYEIEYADHSS 929 CDAPC+SCAKGA+P YKP + NI+ KD C E+Q++Q ++C++C QCDYEIEYAD SS Sbjct: 229 CDAPCSSCAKGANPLYKPTRVNIVASKDLMCTEVQKNQKPQNCETCQQCDYEIEYADRSS 288 Query: 930 SVGVLARDELYLNIANGSLAKSKVVFGCAYDQQGLLLNTMGKTDGILGLSRAKISFPSQL 1109 S+GVLARDEL+L ANGS VVFGCAYDQQG+LLNT+ KTDGILGLSRAK+S PSQL Sbjct: 289 SLGVLARDELHLVTANGSTTNLDVVFGCAYDQQGILLNTLSKTDGILGLSRAKVSLPSQL 348 Query: 1110 ASQGIINNVVGHCLATEPSGGGYLFLGDDFVPHSRMTWVPMLQSHNSNSYQAEMVKVSYG 1289 AS+GIINNVVGHCLAT+ GY+FLGDDFVP+ M+WVPML S ++ Y ++VK++YG Sbjct: 349 ASKGIINNVVGHCLATDVGASGYMFLGDDFVPNWGMSWVPMLGSPSTEFYHTQIVKINYG 408 Query: 1290 GRQIGLGNVGN--GRLVFDSGSSYSYFTEQAYNNLLTVLDDISSESLVRDMSDTSLPICW 1463 + LG + GR+VFDSGSSY+YF +QAY L+ L ++S ++D++DT+LP+CW Sbjct: 409 SSSLSLGRQHSSIGRVVFDSGSSYTYFMKQAYAELVASLSEVSEVGFIQDVADTTLPMCW 468 Query: 1464 QVKSPSRSVKDVRQLFKPLNLQFGSKWWIMSTKLQIPPEGYLVTNSKGNVCLGILNGRNV 1643 Q P R +KDV+Q FK L LQFGSKWWI+S + IPPEGYL+ + KGNVCLGIL+G V Sbjct: 469 QAPFPIRFIKDVKQFFKTLTLQFGSKWWIISKRFHIPPEGYLIISKKGNVCLGILDGSKV 528 Query: 1644 HDGSTFILGDISLRGLLFVYDNVNEKIGWVRSDCARPRRFES 1769 HDGST ILGDISLRG L VYDN KIGW +SDCA PRRF+S Sbjct: 529 HDGSTIILGDISLRGQLVVYDNEKLKIGWTQSDCAHPRRFKS 570 >ref|XP_002511959.1| protein with unknown function [Ricinus communis] gi|223549139|gb|EEF50628.1| protein with unknown function [Ricinus communis] Length = 583 Score = 607 bits (1566), Expect = e-171 Identities = 320/588 (54%), Positives = 406/588 (69%), Gaps = 21/588 (3%) Frame = +3 Query: 81 MEETNERPPQLTVIITLPPPDNPSLGKTITAFTLSD--HQTPTPQSPPQVDESP------ 236 ME ++ VII+LPPP+NPSLGKTITAFTL+D H PQS ++ P Sbjct: 1 MESDDQSSHVKVVIISLPPPNNPSLGKTITAFTLTDDDHDATYPQSHQNHEQEPSIIQTH 60 Query: 237 -----PVQNFAXXXXXXXXXXXXXXXXTVLP-----VLGISVIALYLWVSVSRETLFQLR 386 PVQ+ + P +L IS+ A+ ++ S+ TL +L+ Sbjct: 61 RESQLPVQSPSLPPQNPQIQFSFSGLYFSTPRKLLFLLCISLFAVIVYRSLFSNTLLELK 120 Query: 387 DELGNDDEHQKNNSQHTFLLPLYLKKPNRVNGDLGDFEIKLGRRVSFDSRPISKELDDAM 566 + +DD +K S F+ PLY K R + E K R V +S S DD + Sbjct: 121 --VSDDDNDEKTKS---FIFPLYHKFGIREISQ-SNLEHKSIRSVYKESLVASVNDDDVI 174 Query: 567 SVGEVNSKLVST-ASRIDATSVIPVRGNIYPDGLYYTYLHFGNPPRPYFLDMDTGSDLTW 743 V N KL S+ A+ +D++SV PVRGN+YPDGLY+TY+ GNPPRPY+LD+DT SDLTW Sbjct: 175 -VPNRNYKLASSNAAAVDSSSVFPVRGNVYPDGLYFTYILVGNPPRPYYLDIDTASDLTW 233 Query: 744 IQCDAPCTSCAKGAHPFYKPVKANIIPPKDSYCVEIQRSQITKDCDSCHQCDYEIEYADH 923 IQCDAPCTSCAKGA+ YKP + NI+ PKDS CVE+ R+Q C++C QCDYEIEYADH Sbjct: 234 IQCDAPCTSCAKGANALYKPRRDNIVTPKDSLCVELHRNQKAGYCETCQQCDYEIEYADH 293 Query: 924 SSSVGVLARDELYLNIANGSLAKSKVVFGCAYDQQGLLLNTMGKTDGILGLSRAKISFPS 1103 SSS+GVLARDEL+L +ANGS K FGCAYDQQGLLLNT+ KTDGILGLS+AK+S PS Sbjct: 294 SSSMGVLARDELHLTMANGSSTNLKFNFGCAYDQQGLLLNTLVKTDGILGLSKAKVSLPS 353 Query: 1104 QLASQGIINNVVGHCLATEPSGGGYLFLGDDFVPHSRMTWVPMLQSHNSNSYQAEMVKVS 1283 QLA++GIINNVVGHCLA + GGGY+FLGDDFVP M+WVPML S + +SYQ +++K++ Sbjct: 354 QLANRGIINNVVGHCLANDVVGGGYMFLGDDFVPRWGMSWVPMLDSPSIDSYQTQIMKLN 413 Query: 1284 YGGRQIGLGNVGN--GRLVFDSGSSYSYFTEQAYNNLLTVLDDISSESLVRDMSDTSLPI 1457 YG + LG R+VFDSGSSY+YFT++AY+ L+ L +S E+L++D SD +LP Sbjct: 414 YGSGPLSLGGQERRVRRIVFDSGSSYTYFTKEAYSELVASLKQVSGEALIQDTSDPTLPF 473 Query: 1458 CWQVKSPSRSVKDVRQLFKPLNLQFGSKWWIMSTKLQIPPEGYLVTNSKGNVCLGILNGR 1637 CW+ K P RSV DV+Q FK L LQFGSKWWI+STK +IPPEGYL+ ++KGNVCLGIL+G Sbjct: 474 CWRAKFPIRSVIDVKQYFKTLTLQFGSKWWIISTKFRIPPEGYLIISNKGNVCLGILDGS 533 Query: 1638 NVHDGSTFILGDISLRGLLFVYDNVNEKIGWVRSDCARPRRFESHSLF 1781 +VHDGS+ ILGDISLRG L +YDNVN KIGW +SDC +P+ F + F Sbjct: 534 DVHDGSSIILGDISLRGQLIIYDNVNNKIGWTQSDCIKPKTFSTLPFF 581 >emb|CBI15437.3| unnamed protein product [Vitis vinifera] Length = 473 Score = 584 bits (1506), Expect = e-164 Identities = 287/490 (58%), Positives = 369/490 (75%), Gaps = 2/490 (0%) Frame = +3 Query: 318 LGISVIALYLWVSVSRETLFQLRDELGNDDEHQKNNSQHTFLLPLYLKKPNRVNGDLGDF 497 LG+S+ LW S L +LR + NDD + F+LPLY K +R LGD Sbjct: 4 LGVSLFVFLLWNFASSSPLVELRRK--NDDREPTS-----FILPLYPKLGSR---SLGDL 53 Query: 498 EIKLGRRVSFDSRPISKELDDAMSVGEVNSKLVSTASRIDATSVIPVRGNIYPDGLYYTY 677 E+KLG+ V F ++D M G +N KL ++ S D++++ PVRG++YP+GLY+T+ Sbjct: 54 ELKLGKFVDF-------HVND-MKPGGIN-KLATSVSAFDSSTIFPVRGDVYPNGLYFTH 104 Query: 678 LHFGNPPRPYFLDMDTGSDLTWIQCDAPCTSCAKGAHPFYKPVKANIIPPKDSYCVEIQR 857 + G+PPR YFLDMDTGSDLTWIQCDAPCTSCAKG +P YKP K N++P KDS CVE+QR Sbjct: 105 IFVGSPPRRYFLDMDTGSDLTWIQCDAPCTSCAKGPNPLYKPKKGNLVPLKDSLCVEVQR 164 Query: 858 SQITKDCDSCHQCDYEIEYADHSSSVGVLARDELYLNIANGSLAKSKVVFGCAYDQQGLL 1037 + T C++C QCDYEIEYADHSSS+GVLA D+L+L +ANGSL K ++FGCAYDQQGLL Sbjct: 165 NLKTGYCETCEQCDYEIEYADHSSSMGVLASDDLHLMLANGSLTKLGIMFGCAYDQQGLL 224 Query: 1038 LNTMGKTDGILGLSRAKISFPSQLASQGIINNVVGHCLATEPSGGGYLFLGDDFVPHSRM 1217 LN++ KTDGILGLS+AK+S PSQLASQ IINNV+GHCL ++ +GGGY+FLGDDFVP+ M Sbjct: 225 LNSLAKTDGILGLSKAKVSLPSQLASQRIINNVLGHCLTSDATGGGYMFLGDDFVPYWGM 284 Query: 1218 TWVPMLQSHNSNSYQAEMVKVSYGGRQIGLGNVG--NGRLVFDSGSSYSYFTEQAYNNLL 1391 WVPML SH+ N Y ++++K+S+G RQ+ LG R+VFD+GSSY+YF ++AY L+ Sbjct: 285 AWVPMLNSHSPN-YHSQIMKISHGSRQLSLGRQDGRTERVVFDTGSSYTYFPKEAYYALV 343 Query: 1392 TVLDDISSESLVRDMSDTSLPICWQVKSPSRSVKDVRQLFKPLNLQFGSKWWIMSTKLQI 1571 L D+S E L++D SD +LP+CW+ K P RSV DV+Q F+PL LQF SKWWI+STK +I Sbjct: 344 ASLKDVSDEGLIQDGSDPTLPVCWRAKFPIRSVIDVKQFFQPLTLQFRSKWWIVSTKFRI 403 Query: 1572 PPEGYLVTNSKGNVCLGILNGRNVHDGSTFILGDISLRGLLFVYDNVNEKIGWVRSDCAR 1751 PPEGYL+ ++KGNVCLGIL+G NVHDGST ILGDISLRG L VYDNVN+KIGW +S C + Sbjct: 404 PPEGYLIISNKGNVCLGILDGSNVHDGSTIILGDISLRGKLVVYDNVNQKIGWAQSTCVK 463 Query: 1752 PRRFESHSLF 1781 P++ +S F Sbjct: 464 PQKIKSLPFF 473 >ref|XP_004137470.1| PREDICTED: aspartic proteinase Asp1-like [Cucumis sativus] gi|449486840|ref|XP_004157418.1| PREDICTED: aspartic proteinase Asp1-like [Cucumis sativus] Length = 570 Score = 584 bits (1505), Expect = e-164 Identities = 297/572 (51%), Positives = 384/572 (67%), Gaps = 17/572 (2%) Frame = +3 Query: 117 VIITLPPPDNPSLGKTITAFTLSDHQTPTPQSPPQVDESPPVQNFAXXXXXXXXXXXXXX 296 V+ITLPPPDNPSLGK++TAFTL+D P VD+ N Sbjct: 10 VVITLPPPDNPSLGKSVTAFTLTDDFPEPPGESVAVDQEVQQPNNDHLTLPPNLPIQAPL 69 Query: 297 XXTVLP---------------VLGISVIALYLWVSVSRETLFQLRDELGNDDEHQKNNSQ 431 +P VLGI++ A+YL+ S ET+ +LR NDD+ + Sbjct: 70 SQRSIPLSRELFAGTPRKLVFVLGIALAAVYLYASNFPETIRELRRSERNDDDRPSS--- 126 Query: 432 HTFLLPLYLKKPNRVNGDLGDFEIKLGRRVSFDSRPISKELDDAMSVGEVNSKLVSTASR 611 FL PLY + GD DF++KLGR V + + +D + V + SKL+S + + Sbjct: 127 --FLFPLYFQSEL---GDSSDFQLKLGRTVRVNKDDLGVRFNDVLGVPKP-SKLISASLK 180 Query: 612 IDATSVIPVRGNIYPDGLYYTYLHFGNPPRPYFLDMDTGSDLTWIQCDAPCTSCAKGAHP 791 D+++V PVRG+IYPDGLYYTY+ G PPRPYFLD+DTGSDLTW+QCDAPC+SC KG P Sbjct: 181 SDSSAVFPVRGDIYPDGLYYTYIMVGEPPRPYFLDIDTGSDLTWVQCDAPCSSCGKGRSP 240 Query: 792 FYKPVKANIIPPKDSYCVEIQRSQITKDCDSCHQCDYEIEYADHSSSVGVLARDELYLNI 971 YKP + N++ KDS C+E+QR+ C +C QC+YE++YAD SSS+GVL +DE L Sbjct: 241 LYKPRRENVVSFKDSLCMEVQRNYDGDQCAACQQCNYEVQYADQSSSLGVLVKDEFTLRF 300 Query: 972 ANGSLAKSKVVFGCAYDQQGLLLNTMGKTDGILGLSRAKISFPSQLASQGIINNVVGHCL 1151 +NGSL K +FGCAYDQQGLLLNT+ KTDGILGLSRAK+S PSQLAS+GIINNVVGHCL Sbjct: 301 SNGSLTKLNAIFGCAYDQQGLLLNTLSKTDGILGLSRAKVSLPSQLASRGIINNVVGHCL 360 Query: 1152 ATEPSGGGYLFLGDDFVPHSRMTWVPMLQSHNSNSYQAEMVKVSYGGRQIGLGNVGNGR- 1328 +P+GGGYLFLGDDFVP M WV ML S + + YQ ++V++ YG + L G+ R Sbjct: 361 TGDPAGGGYLFLGDDFVPQWGMAWVAMLDSPSIDFYQTKVVRIDYGSIPLSLDTWGSSRE 420 Query: 1329 -LVFDSGSSYSYFTEQAYNNLLTVLDDISSESLVRDMSDTSLPICWQVKSPSRSVKDVRQ 1505 +VFDSGSSY+YFT++AY L+ L+++S+ L+ + D+S ICW+ + RSVKDV+ Sbjct: 421 QVVFDSGSSYTYFTKEAYYQLVANLEEVSAFGLI--LQDSSDTICWKTEQSIRSVKDVKH 478 Query: 1506 LFKPLNLQFGSKWWIMSTKLQIPPEGYLVTNSKGNVCLGILNGRNVHDGSTFILGDISLR 1685 FKPL LQFGS++W++STKL I PE YL+ N +GNVCLGIL+G VHDGST ILGD +LR Sbjct: 479 FFKPLTLQFGSRFWLVSTKLVILPENYLLINKEGNVCLGILDGSQVHDGSTIILGDNALR 538 Query: 1686 GLLFVYDNVNEKIGWVRSDCARPRRFESHSLF 1781 G L VYDNVN++IGW SDC PR+ + LF Sbjct: 539 GKLVVYDNVNQRIGWTSSDCHNPRKIKHLPLF 570 >ref|XP_002328687.1| predicted protein [Populus trichocarpa] gi|566206181|ref|XP_006374352.1| aspartyl protease family protein [Populus trichocarpa] gi|550322111|gb|ERP52149.1| aspartyl protease family protein [Populus trichocarpa] Length = 603 Score = 560 bits (1444), Expect = e-157 Identities = 308/620 (49%), Positives = 392/620 (63%), Gaps = 53/620 (8%) Frame = +3 Query: 81 MEETNERPPQL--TVIITLPPPDNPSLGKTITAFTLSDHQTPTPQSPPQVDES------- 233 ME +++ PQL VII+LPPPDNPSLGKTITAFTL+++ P PQ + Sbjct: 1 MESDDDQSPQLKGVVIISLPPPDNPSLGKTITAFTLTNNDYPQSHQTPQTHQEDQLPISS 60 Query: 234 ---PPVQNFAXXXXXXXXXXXXXXXXTVLPVLGISVIALYLWVSVSRETLFQLRDELGND 404 PP QN +L + IS+ AL ++ S+ T +L+ ND Sbjct: 61 PPPPPSQN--SQLQFPSSRLFLGTPRKLLSFVFISLFALAIYSSLFTNTFQELKSN-NND 117 Query: 405 DEHQKNNSQHTFLLPLYLKKPNRVNGDLGDFEIKLGRRVSFDSRPISKELDDAMSVGEVN 584 D+ QK S ++ PLY K LG EI L + R + KE + SV +N Sbjct: 118 DDDQKPKS---YVFPLYHK--------LGIREIPLNDLENHLRRFVYKE-NLVASVDHLN 165 Query: 585 -----SKLVST--ASRIDATSVIPVRGNIYPDGLYYTYLHFGNPPRPYFLDMDTGSDLTW 743 SKL S+ A+ +D++++ PVRGN+YPDG PP+PY+LD DTGSDLTW Sbjct: 166 GPHKISKLASSNAAAAMDSSAIFPVRGNLYPDG----------PPQPYYLDFDTGSDLTW 215 Query: 744 IQCDAPCTSCAKGAHPFYKPVKANIIPPKDSYCVEIQRSQITKDCDSCHQCDYEIEYADH 923 IQCDAPCTSCAKGA+ +YKP + NI+PPKD C+E+QR+Q C++C QCDYEIEYADH Sbjct: 216 IQCDAPCTSCAKGANAWYKPRRGNIVPPKDLLCMEVQRNQKAGYCETCDQCDYEIEYADH 275 Query: 924 SSSVGVLARDELYLNIANGSLAKSKVVFGCAYDQQGLLLNTMGKTDGILGLSRAKISFPS 1103 SSS+GVLA D+L L +ANGSL K +FGCAYDQQGLLL T+ KTDGILGLSRAK+S PS Sbjct: 276 SSSMGVLATDKLLLMVANGSLTKLNFIFGCAYDQQGLLLKTLVKTDGILGLSRAKVSLPS 335 Query: 1104 QLASQGIINNVVGHCLATEPSGGGYLFLGDDFVPHSRMTWVPMLQSHNSNSYQAEMVKVS 1283 QLASQGIINNV+GHCL T+ GGGY+FLGDDFVP M WVPML S + Y E+VK++ Sbjct: 336 QLASQGIINNVIGHCLTTDLGGGGYMFLGDDFVPRWGMAWVPMLDSPSMEFYHTEVVKLN 395 Query: 1284 YGGRQIGLGNVGN--GRLVFDSGSSYSYFTEQAYNNLLTVLDDISSESLVRDMSDTSLPI 1457 YG + LG + + ++FDSGSSY+YF ++AY+ L+ L+++S LV+ SDT+LP+ Sbjct: 396 YGSSPLSLGGMESRVKHILFDSGSSYTYFPKEAYSELVASLNEVSGAGLVQSTSDTTLPL 455 Query: 1458 CWQVKSPSRSV--------------------------------KDVRQLFKPLNLQFGSK 1541 CW+ P R DV++ FK L QFG+K Sbjct: 456 CWRANFPIRKFIYRTELTRPIRRRRRRRRRRRRRRRRRRQHIKGDVKKFFKTLTFQFGTK 515 Query: 1542 WWIMSTKLQIPPEGYLVTNSKGNVCLGILNGRNVHDGSTFILGDISLRGLLFVYDNVNEK 1721 W ++STK +IPPEGYL+ + KGNVCLGIL G VHDGST ILGDISLRG L VYDNVN+K Sbjct: 516 WLVISTKFRIPPEGYLMMSDKGNVCLGILEGSKVHDGSTIILGDISLRGQLVVYDNVNKK 575 Query: 1722 IGWVRSDCARPRRFESHSLF 1781 IGW SDCA+P+R +S F Sbjct: 576 IGWTPSDCAKPKRSDSLQFF 595 >ref|XP_002891474.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata] gi|297337316|gb|EFH67733.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata] Length = 578 Score = 556 bits (1432), Expect = e-155 Identities = 291/570 (51%), Positives = 392/570 (68%), Gaps = 15/570 (2%) Frame = +3 Query: 117 VIITLPPPDNPSLGKTITAFTLSDHQTPTPQSPPQVDESPPVQ----NFAXXXXXXXXXX 284 VIITLPP D+PS GKTI+AFTL+DH P Q PP+ + +P Q + Sbjct: 15 VIITLPPSDDPSQGKTISAFTLNDHDYPL-QIPPEDNPNPSFQPDPLHQNQQSRLLFSDL 73 Query: 285 XXXXXXTVLPVLGISVIALYLWVSVSRET--LFQLRDELGNDDEHQKNNSQHTFLLPLYL 458 VL +LG S++A+ + SV + +F++ DE DD+ + + +F+ P+Y Sbjct: 74 SMGSPRLVLGLLGFSLLAVAFYASVFPNSVQMFRVSDERNRDDDSSRETT--SFVFPVYH 131 Query: 459 KKPNRVNGDLGDFEIK-LGRRVSFDSRPISKELD-DAMSVGEVNSKLVSTASRIDA-TSV 629 K R +F + L + ++ + +D + ++ +VN L ++A ID+ T++ Sbjct: 132 KLRAR------EFHERILAEDLGLENGKFVESMDLELVNPVKVNDVLSTSAGSIDSSTTI 185 Query: 630 IPVRGNIYPDGLYYTYLHFGNPP--RPYFLDMDTGSDLTWIQCDAPCTSCAKGAHPFYKP 803 PV GN+YPDGLYYT + G P + Y LD+DTGSDLTWIQCDAPCTSCAKGA+ YKP Sbjct: 186 FPVGGNVYPDGLYYTRILVGKPEDGQYYHLDIDTGSDLTWIQCDAPCTSCAKGANQLYKP 245 Query: 804 VKANIIPPKDSYCVEIQRSQITKDCDSCHQCDYEIEYADHSSSVGVLARDELYLNIANGS 983 K N++ + +CVE+QR+Q+T+ C+SCHQCDYEIEYADHS S+GVL +D+ +L + NGS Sbjct: 246 RKDNLVRSSEPFCVEVQRNQLTEHCESCHQCDYEIEYADHSYSMGVLTKDKFHLKLHNGS 305 Query: 984 LAKSKVVFGCAYDQQGLLLNTMGKTDGILGLSRAKISFPSQLASQGIINNVVGHCLATEP 1163 LA+S +VFGC YDQQGLLLNT+ KTDGILGLSRAKIS PSQLAS+GII+NVVGHCLA++ Sbjct: 306 LAESDIVFGCGYDQQGLLLNTLLKTDGILGLSRAKISLPSQLASRGIISNVVGHCLASDL 365 Query: 1164 SGGGYLFLGDDFVPHSRMTWVPMLQSHNSNSYQAEMVKVSYGGRQIGL-GNVGN-GRLVF 1337 +G GY+F+G D VP MTWVPML + YQ ++ K+SYG + L G G G+++F Sbjct: 366 NGEGYIFMGSDLVPSHGMTWVPMLHHPHLEVYQMQVTKMSYGNAMLSLDGENGRVGKVLF 425 Query: 1338 DSGSSYSYFTEQAYNNLLTVLDDISSESLVRDMSDTSLPICWQVK--SPSRSVKDVRQLF 1511 D+GSSY+YF QAY+ L+T L ++S L RD SD +LPICW+ K SP S+ DV++ F Sbjct: 426 DTGSSYTYFPNQAYSQLVTSLQEVSDLELTRDDSDEALPICWRAKTNSPISSLSDVKKFF 485 Query: 1512 KPLNLQFGSKWWIMSTKLQIPPEGYLVTNSKGNVCLGILNGRNVHDGSTFILGDISLRGL 1691 +P+ LQ GSKW I+S KL I PE YL+ ++KGNVCLGIL+G NVHDGST I+GDIS+RG Sbjct: 486 RPITLQIGSKWLIISKKLLIQPEDYLIISNKGNVCLGILDGSNVHDGSTIIIGDISMRGR 545 Query: 1692 LFVYDNVNEKIGWVRSDCARPRRFESHSLF 1781 L VYDNV ++IGW++SDC RP F+ + F Sbjct: 546 LIVYDNVKQRIGWMKSDCVRPSEFDHNVPF 575 >ref|XP_006583400.1| PREDICTED: aspartic proteinase Asp1-like [Glycine max] Length = 574 Score = 548 bits (1413), Expect = e-153 Identities = 287/578 (49%), Positives = 377/578 (65%), Gaps = 20/578 (3%) Frame = +3 Query: 81 MEETNERPPQLTVIITLPPPDNPSLGKTITAFTLSDHQTPTPQ--------------SPP 218 ME+ + VII+LPPPDNPSLGKTITAF S++ +P PQ Sbjct: 1 MEDDQSTQIKGVVIISLPPPDNPSLGKTITAFAFSNNPSPPPQLFIQPHQHQSQQTHPNA 60 Query: 219 QVDESPPVQNFAXXXXXXXXXXXXXXXXTV--LPVLGISVIALYLWVSVSRETLFQLRDE 392 Q + PP+Q++ V G + AL+L+ SVS T LR Sbjct: 61 QHNTDPPLQSYPSNPQLSFSFRRLFHSTPVKLFSFFGTLLFALFLYGSVSSTTTVDLRGR 120 Query: 393 LGNDDEHQKNNSQHTFLLPLYLKKPNRVNGDLG--DFEIKLGRRVSFDSRPISKELDDAM 566 + D+ + + FL PL+ K G LG D +++LG+ V + +++ D Sbjct: 121 KNDGDDDKATS----FLFPLFPKF-----GVLGQKDLKLQLGKLVQKEKFLTQRDVGDGS 171 Query: 567 SVGEVNSKLVSTASRIDATSVIPVRGNIYPDGLYYTYLHFGNPPRPYFLDMDTGSDLTWI 746 V V D++SV PV GN+YPDGLY+T L GNPP+ YFLD+DTGSDLTW+ Sbjct: 172 GVVAV-----------DSSSVFPVSGNVYPDGLYFTILRVGNPPKSYFLDVDTGSDLTWM 220 Query: 747 QCDAPCTSCAKGAHPFYKPVKANIIPPKDSYCVEIQRSQITKDCD-SCHQCDYEIEYADH 923 QCDAPC SC KGAH YKP ++N++ DS C+++Q++Q D S QCDYEI+YADH Sbjct: 221 QCDAPCRSCGKGAHVQYKPTRSNVVSSVDSLCLDVQKNQKNGHHDESLLQCDYEIQYADH 280 Query: 924 SSSVGVLARDELYLNIANGSLAKSKVVFGCAYDQQGLLLNTMGKTDGILGLSRAKISFPS 1103 SSS+GVL RDEL+L NGS K VVFGC YDQ+GL+LNT+ KTDGI+GLSRAK+S P Sbjct: 281 SSSLGVLVRDELHLVTTNGSKTKLNVVFGCGYDQEGLILNTLAKTDGIMGLSRAKVSLPY 340 Query: 1104 QLASQGIINNVVGHCLATEPSGGGYLFLGDDFVPHSRMTWVPMLQSHNSNSYQAEMVKVS 1283 QLAS+G+I NVVGHCL+ + +GGGY+FLGDDFVP+ M WVPM + ++ YQ E++ ++ Sbjct: 341 QLASKGLIKNVVGHCLSNDGAGGGYMFLGDDFVPYWGMNWVPMAYTLTTDLYQTEILGIN 400 Query: 1284 YGGRQIGL-GNVGNGRLVFDSGSSYSYFTEQAYNNLLTVLDDISSESLVRDMSDTSLPIC 1460 YG RQ+ G G++ FDSGSSY+YF ++AY +L+ L+++S LV+D SDT+LPIC Sbjct: 401 YGNRQLKFDGQSKVGKVFFDSGSSYTYFPKEAYLDLVASLNEVSGLGLVQDDSDTTLPIC 460 Query: 1461 WQVKSPSRSVKDVRQLFKPLNLQFGSKWWIMSTKLQIPPEGYLVTNSKGNVCLGILNGRN 1640 WQ RS+KDV+ FK L L+FGSKWWI+ST QIPPEGYL+ ++KG+VCLGIL+G Sbjct: 461 WQANFQIRSIKDVKDYFKTLTLRFGSKWWILSTLFQIPPEGYLIISNKGHVCLGILDGSK 520 Query: 1641 VHDGSTFILGDISLRGLLFVYDNVNEKIGWVRSDCARP 1754 V+DGS+ ILGDISLRG VYDNV +KIGW R+DC P Sbjct: 521 VNDGSSIILGDISLRGYSVVYDNVKQKIGWKRADCGMP 558 >gb|ESW24775.1| hypothetical protein PHAVU_004G159200g [Phaseolus vulgaris] Length = 572 Score = 548 bits (1412), Expect = e-153 Identities = 286/569 (50%), Positives = 376/569 (66%), Gaps = 17/569 (2%) Frame = +3 Query: 105 PQL--TVIITLPPPDNPSLGKTITAFTLSDHQTPTP--------QSPPQVDE----SPPV 242 PQ+ VII+LPPPDNPSLGKTITAFT SD +P P Q ++E PP+ Sbjct: 7 PQIKGVVIISLPPPDNPSLGKTITAFTFSDPSSPQPSLLLQQSHQHQTNINEYNNTDPPL 66 Query: 243 QNFAXXXXXXXXXXXXXXXXTV--LPVLGISVIALYLWVSVSRETLFQLRDELGNDDEHQ 416 ++ V G+ + AL+L+ SVS T +L + D+ Sbjct: 67 HSYPSNAQLGFSRRRLFHRTPVRFFSFFGVFLFALFLYGSVSSTTTLELSGPKNDGDDDG 126 Query: 417 KNNSQHTFLLPLYLKKPNRVNGDLGDFEIKLGRRVSFDSRPISKELDDAMSVGEVNSKLV 596 K S +L PLY K G LG +KL + + KE V S++V Sbjct: 127 KPGS---YLFPLYPKF-----GVLGQKNMKLQL-----GKLVHKEKLLTQRKYRVGSEVV 173 Query: 597 STASRIDATSVIPVRGNIYPDGLYYTYLHFGNPPRPYFLDMDTGSDLTWIQCDAPCTSCA 776 + +D++SV PV GN++PDGLY+T L GNPPR YFLD+DTGSDLTW+QCDAPC SC Sbjct: 174 A----VDSSSVFPVSGNVFPDGLYFTILRVGNPPRSYFLDVDTGSDLTWMQCDAPCISCG 229 Query: 777 KGAHPFYKPVKANIIPPKDSYCVEIQRSQITKDCDSCHQCDYEIEYADHSSSVGVLARDE 956 KGAH YKP ++N++P DS C+++Q++Q +S QCDY+IEYAD SSS+GVL RDE Sbjct: 230 KGAHAQYKPTRSNVVPSMDSLCLDVQKNQKDGHHESLQQCDYQIEYADQSSSLGVLIRDE 289 Query: 957 LYLNIANGSLAKSKVVFGCAYDQQGLLLNTMGKTDGILGLSRAKISFPSQLASQGIINNV 1136 L+L NGS K VFGC YDQ+GLLLNT+ KTDGILGLSRAK+S P QLAS+G+I NV Sbjct: 290 LHLVTTNGSKTKLNFVFGCGYDQEGLLLNTLAKTDGILGLSRAKVSLPYQLASKGLIKNV 349 Query: 1137 VGHCLATEPSGGGYLFLGDDFVPHSRMTWVPMLQSHNSNSYQAEMVKVSYGGRQIGL-GN 1313 VGHCL+ + GGGY+FLGDDF+P+ MTWVPM + ++ YQ E++ ++YG RQ+ G Sbjct: 350 VGHCLSNDEVGGGYMFLGDDFLPYWGMTWVPMAYTLTTDLYQTEILGINYGNRQLSFDGQ 409 Query: 1314 VGNGRLVFDSGSSYSYFTEQAYNNLLTVLDDISSESLVRDMSDTSLPICWQVKSPSRSVK 1493 G++VFDSGSSY+YF ++AY +L+ L+++S L++D SDT+LPICW+ P +SVK Sbjct: 410 SKVGKVVFDSGSSYTYFPKEAYLDLVASLNEVSGLRLIQDDSDTTLPICWEANFPIKSVK 469 Query: 1494 DVRQLFKPLNLQFGSKWWIMSTKLQIPPEGYLVTNSKGNVCLGILNGRNVHDGSTFILGD 1673 DV+ FK + L+FGSKWWI+ST QI PEGYL+ ++KG+VCLGIL+G NV+DGS+ ILGD Sbjct: 470 DVKDYFKTITLRFGSKWWILSTMFQIAPEGYLIISNKGHVCLGILDGSNVNDGSSIILGD 529 Query: 1674 ISLRGLLFVYDNVNEKIGWVRSDCARPRR 1760 IS RG L VYDN +KIGW R++C R Sbjct: 530 ISFRGYLVVYDNSKQKIGWKRAECGMSSR 558 >ref|NP_564539.1| aspartyl protease [Arabidopsis thaliana] gi|7770346|gb|AAF69716.1|AC016041_21 F27J15.15 [Arabidopsis thaliana] gi|13430540|gb|AAK25892.1|AF360182_1 unknown protein [Arabidopsis thaliana] gi|14532748|gb|AAK64075.1| unknown protein [Arabidopsis thaliana] gi|332194267|gb|AEE32388.1| aspartyl protease [Arabidopsis thaliana] Length = 583 Score = 548 bits (1411), Expect = e-153 Identities = 289/584 (49%), Positives = 396/584 (67%), Gaps = 16/584 (2%) Frame = +3 Query: 78 LMEETNERPPQLTVIITLPPPDNPSLGKTITAFTLSDHQTPTPQSPPQVDESPPVQ---- 245 L ++ ++ VIITLPP D+PS GKTI+AFTL+DH P + PP+ + +P Q Sbjct: 5 LHDQQQQQRVHSVVIITLPPSDDPSQGKTISAFTLTDHDYPL-EIPPEDNPNPSFQPDPL 63 Query: 246 NFAXXXXXXXXXXXXXXXXTVLPVLGISVIALYLWVSVSRETLFQLR---DELGNDDEHQ 416 + VL +LGIS++A+ + SV ++ R DE DD+ Sbjct: 64 HRNQQSRLLFSDLSMNSPRLVLGLLGISLLAVAFYASVFPNSVQMFRVSPDERNRDDDDN 123 Query: 417 KNNSQHTFLLPLYLKKPNRVNGDLGDFEIK-LGRRVSFDSRPISKELD-DAMSVGEVNSK 590 + +F+ P+Y K R +F + L + ++ + +D + ++ +VN Sbjct: 124 LRETA-SFVFPVYHKLRAR------EFHERILEEDLGLENENFVESMDLELVNPVKVNDV 176 Query: 591 LVSTASRIDA-TSVIPVRGNIYPDGLYYTYLHFGNPP--RPYFLDMDTGSDLTWIQCDAP 761 L ++A ID+ T++ PV GN+YPDGLYYT + G P + Y LD+DTGS+LTWIQCDAP Sbjct: 177 LSTSAGSIDSSTTIFPVGGNVYPDGLYYTRILVGKPEDGQYYHLDIDTGSELTWIQCDAP 236 Query: 762 CTSCAKGAHPFYKPVKANIIPPKDSYCVEIQRSQITKDCDSCHQCDYEIEYADHSSSVGV 941 CTSCAKGA+ YKP K N++ +++CVE+QR+Q+T+ C++CHQCDYEIEYADHS S+GV Sbjct: 237 CTSCAKGANQLYKPRKDNLVRSSEAFCVEVQRNQLTEHCENCHQCDYEIEYADHSYSMGV 296 Query: 942 LARDELYLNIANGSLAKSKVVFGCAYDQQGLLLNTMGKTDGILGLSRAKISFPSQLASQG 1121 L +D+ +L + NGSLA+S +VFGC YDQQGLLLNT+ KTDGILGLSRAKIS PSQLAS+G Sbjct: 297 LTKDKFHLKLHNGSLAESDIVFGCGYDQQGLLLNTLLKTDGILGLSRAKISLPSQLASRG 356 Query: 1122 IINNVVGHCLATEPSGGGYLFLGDDFVPHSRMTWVPMLQSHNSNSYQAEMVKVSYGGRQI 1301 II+NVVGHCLA++ +G GY+F+G D VP MTWVPML ++YQ ++ K+SYG + Sbjct: 357 IISNVVGHCLASDLNGEGYIFMGSDLVPSHGMTWVPMLHDSRLDAYQMQVTKMSYGQGML 416 Query: 1302 GL-GNVGN-GRLVFDSGSSYSYFTEQAYNNLLTVLDDISSESLVRDMSDTSLPICWQVKS 1475 L G G G+++FD+GSSY+YF QAY+ L+T L ++S L RD SD +LPICW+ K+ Sbjct: 417 SLDGENGRVGKVLFDTGSSYTYFPNQAYSQLVTSLQEVSGLELTRDDSDETLPICWRAKT 476 Query: 1476 --PSRSVKDVRQLFKPLNLQFGSKWWIMSTKLQIPPEGYLVTNSKGNVCLGILNGRNVHD 1649 P S+ DV++ F+P+ LQ GSKW I+S KL I PE YL+ ++KGNVCLGIL+G +VHD Sbjct: 477 NFPFSSLSDVKKFFRPITLQIGSKWLIISRKLLIQPEDYLIISNKGNVCLGILDGSSVHD 536 Query: 1650 GSTFILGDISLRGLLFVYDNVNEKIGWVRSDCARPRRFESHSLF 1781 GST ILGDIS+RG L VYDNV +IGW++SDC RPR + + F Sbjct: 537 GSTIILGDISMRGHLIVYDNVKRRIGWMKSDCVRPREIDHNVPF 580 >ref|XP_006826817.1| hypothetical protein AMTR_s00010p00056950 [Amborella trichopoda] gi|548831246|gb|ERM94054.1| hypothetical protein AMTR_s00010p00056950 [Amborella trichopoda] Length = 545 Score = 538 bits (1385), Expect = e-150 Identities = 279/557 (50%), Positives = 367/557 (65%), Gaps = 2/557 (0%) Frame = +3 Query: 117 VIITLPPPDNPSLGKTITAFTLSDHQTPTPQSPPQVDESPPVQNFAXXXXXXXXXXXXXX 296 VII+LPPPD+PS GKTITAFT+ + ++ Q ++ Q + Sbjct: 10 VIISLPPPDDPSKGKTITAFTMVSDPSHQNENQSQNQQTQQPQIASNSIAGSSRGRIGSI 69 Query: 297 XXTVLPVLGISVIALYLWVSVSRETLFQLRDELGNDDEHQKNNSQHTFLLPLYLKKPNRV 476 VL +LG V L+ W VS + E+ + E KNN +FL LY K Sbjct: 70 VVRVLAMLGAVVAVLFFWQWVSGFS------EMDYETERSKNNP--SFLYNLYPKWSEEA 121 Query: 477 NGDLGDFEIKLGRRVSFDSRPISKELDDAMSVGEVNSKLVSTASRIDATSVIPVRGNIYP 656 D ++LG V D + +G + K + S I+++++ PV+GN+YP Sbjct: 122 IEK--DAALRLGTFVKRDE----------VRIGLRDVKTLEAISSINSSTIFPVKGNVYP 169 Query: 657 DGLYYTYLHFGNPPRPYFLDMDTGSDLTWIQCDAPCTSCAKGAHPFYKPVKANIIPPKDS 836 DGLYY + GNP RPY+LDMDTGSDLTWIQC+APCT+CAKG HP Y P K N++P KD Sbjct: 170 DGLYYISILVGNPRRPYYLDMDTGSDLTWIQCNAPCTNCAKGPHPLYNPSKQNLVPSKDP 229 Query: 837 YCVEIQRSQITKDCDSCHQCDYEIEYADHSSSVGVLARDELYLNIANGSLAKSKVVFGCA 1016 +C+E+Q + K + HQCDY+IEYAD SSS+GVL RD+L L I NG++ K+ +VFGCA Sbjct: 230 FCLEVQVNDKGKFAGASHQCDYDIEYADQSSSMGVLVRDDLQLMITNGTVIKTGLVFGCA 289 Query: 1017 YDQQGLLLNTMGKTDGILGLSRAKISFPSQLASQGIINNVVGHCLATEPSGGGYLFLGDD 1196 YDQ+G L ++ KTDGILGLS AK+S PSQLAS+G++ NVVGHC+ + +GGGY+FLGDD Sbjct: 290 YDQRGKLGHSPAKTDGILGLSSAKVSLPSQLASRGLMKNVVGHCIRNDANGGGYMFLGDD 349 Query: 1197 FVPHSRMTWVPMLQSHNSNSYQAEMVKVSYGGRQIGLGNVGN--GRLVFDSGSSYSYFTE 1370 F+P RMTWVPML S ++N+Y AE+ K+S G R I G + GR+VFDSGSSYSY T+ Sbjct: 350 FIPQWRMTWVPMLSSPSTNAYHAEVSKISLGSRPIDGGGLITKIGRVVFDSGSSYSYLTK 409 Query: 1371 QAYNNLLTVLDDISSESLVRDMSDTSLPICWQVKSPSRSVKDVRQLFKPLNLQFGSKWWI 1550 QAY +L+ L D++ + LV D SD +LP+CW+ KSP RS+KDV Q FKPL L FGS+ Sbjct: 410 QAYTSLIKSLKDVAEKGLVLDDSDKTLPVCWKAKSPLRSIKDVNQFFKPLVLNFGSRLLF 469 Query: 1551 MSTKLQIPPEGYLVTNSKGNVCLGILNGRNVHDGSTFILGDISLRGLLFVYDNVNEKIGW 1730 S +IPPEGYL+ ++KGN CLGIL G ++HDG+T ILGDISLR L VYDNV +IGW Sbjct: 470 GSKNFEIPPEGYLIISAKGNACLGILEGSHIHDGATNILGDISLRAKLVVYDNVKRRIGW 529 Query: 1731 VRSDCARPRRFESHSLF 1781 V+SDC +P + +S F Sbjct: 530 VQSDC-QPLKLKSFPFF 545 >ref|XP_004512995.1| PREDICTED: lysine-specific histone demethylase 1 homolog 1-like [Cicer arietinum] Length = 1387 Score = 536 bits (1380), Expect = e-149 Identities = 290/575 (50%), Positives = 381/575 (66%), Gaps = 16/575 (2%) Frame = +3 Query: 84 EETNERPPQL--TVIITLPPPDNPSLGKTITAFTLSDHQTPTPQSPPQVDESP--PVQNF 251 +E+ + PQL VII++PP +NPSLGK ITAFT S++ +PQ PQ + P P+Q++ Sbjct: 5 KESQSQTPQLKSVVIISIPPSNNPSLGKKITAFTFSNNPF-SPQQQPQNNVPPMSPIQSY 63 Query: 252 AXXXXXXXXXXXXXXXXTVLPVL---GISVIALYLWVSVSRETLFQLR-DEL------GN 401 T + GI + AL+L+ S+ L EL G Sbjct: 64 PSNHQLQFSSTRRFFHTTQIKFFTFFGIFLFALFLYGSLFSTITTTLELSELKNHHHDGG 123 Query: 402 DDEHQKNNSQHTFLLPLYLKKPNRVNGDLGDFEIKLGRRVSFDSRPISKELDDAMSVGEV 581 DDE + +S FL PL+ K DL ++K G V+ S D+ + Sbjct: 124 DDESDEPSS---FLFPLFKKYGVVGQRDLKLIDVKKGNFVTQKS-------GDSDGIA-F 172 Query: 582 NSKLVSTASRIDATSVIPVRGNIYPDGLYYTYLHFGNPPRPYFLDMDTGSDLTWIQCDAP 761 +S++V+ S +++V P+ GN+YPDGLYYT++ GNPP+ YF+D+DTGSDLTWIQCDAP Sbjct: 173 SSRVVAVDS--SSSTVFPISGNVYPDGLYYTHVRVGNPPKRYFVDVDTGSDLTWIQCDAP 230 Query: 762 CTSCAKGAHPFYKPVKANIIPPKDSYCVEIQRSQITKDCDSCHQCDYEIEYADHSSSVGV 941 C SCAKGA+ YKP++ NI+P DS C+E+Q++Q +S QCDYEI+YADHSSS+GV Sbjct: 231 CRSCAKGANVPYKPIRTNIVPSLDSLCLEVQKNQKNGYHESFQQCDYEIQYADHSSSMGV 290 Query: 942 LARDELYLNIANGSLAKSKVVFGCAYDQQGLLLNTMGKTDGILGLSRAKISFPSQLASQG 1121 L RDEL+L NGS K VFGC YDQ+GLLLNT+ KTDGI+GLSRAK+ P QL+S+G Sbjct: 291 LIRDELHLMTTNGSKTKLNFVFGCGYDQEGLLLNTLTKTDGIMGLSRAKVGLPYQLSSKG 350 Query: 1122 IINNVVGHCLATEPS-GGGYLFLGDDFVPHSRMTWVPMLQSHNSNSYQAEMVKVSYGGRQ 1298 II NVVGHCL+ GGGY+FLGDDFVP+ MTW PM Q ++ YQ E++ ++YG R Sbjct: 351 IIKNVVGHCLSNNDGVGGGYMFLGDDFVPYWGMTWAPMTQI--TDLYQTEVLGINYGNRL 408 Query: 1299 IGL-GNVGNGRLVFDSGSSYSYFTEQAYNNLLTVLDDISSESLVRDMSDTSLPICWQVKS 1475 + G+ G +VFDSGSSY+YF ++AY +L+ L+++S LV D SDT+LPICWQ Sbjct: 409 LSFDGHSKVGNVVFDSGSSYTYFPKEAYRDLVASLEEVSGLGLVEDDSDTTLPICWQANF 468 Query: 1476 PSRSVKDVRQLFKPLNLQFGSKWWIMSTKLQIPPEGYLVTNSKGNVCLGILNGRNVHDGS 1655 P RSVKDV+ FK L L+FG+KWWI+ST IPPEGYL+ ++KGNVCL IL+G NV+DGS Sbjct: 469 PIRSVKDVKDYFKTLTLRFGNKWWILSTLFHIPPEGYLIISNKGNVCLAILDGSNVNDGS 528 Query: 1656 TFILGDISLRGLLFVYDNVNEKIGWVRSDCARPRR 1760 + ILGDISLRG L VYDNVN+ IGW R+ C P R Sbjct: 529 SIILGDISLRGYLVVYDNVNKNIGWERTKCGMPNR 563 >ref|XP_006304522.1| hypothetical protein CARUB_v10011364mg [Capsella rubella] gi|482573233|gb|EOA37420.1| hypothetical protein CARUB_v10011364mg [Capsella rubella] Length = 580 Score = 535 bits (1378), Expect = e-149 Identities = 287/570 (50%), Positives = 380/570 (66%), Gaps = 15/570 (2%) Frame = +3 Query: 117 VIITLPPPDNPSLGKTITAFTLSDHQTPT---PQSPPQVDESPPVQNFAXXXXXXXXXXX 287 V+ITLPP D+PS GKTI+AFTL+DH P P+ P P QN Sbjct: 17 VVITLPPSDDPSQGKTISAFTLTDHDYPLEIPPEDPSFHQPDPLHQN--PQFRLWFSDLS 74 Query: 288 XXXXXTVLPVLGISVIALYLWVSVSRET--LFQLRDELGNDDEHQKNNSQHTFLLPLYLK 461 VL +LGIS+IA+ L+ SV + +F++ DE DD++ + + +F+ P+Y K Sbjct: 75 MSSPRLVLSLLGISLIAIALYGSVFSNSVQMFRVSDERNRDDDNSRRETT-SFVFPVYHK 133 Query: 462 KPNRVNGDLGDF-EIKLGRRVSFDSRPISKELD-DAMSVGEVNSKLVSTASRIDA--TSV 629 R +F E L + ++ + + +D + ++ +VNS L +TA +D+ T++ Sbjct: 134 LRAR------EFHERVLAEDLGVENGILVESMDLELVNPVKVNSVLSTTAGSVDSSSTTI 187 Query: 630 IPVRGNIYPDGLYYTYLHFGNPP--RPYFLDMDTGSDLTWIQCDAPCTSCAKGAHPFYKP 803 PV GN+YPDGLYYT + G P Y LD+DTGSDLTWIQCDAPCTSCAKGA+ YKP Sbjct: 188 FPVGGNVYPDGLYYTRILVGKPEDGHYYHLDIDTGSDLTWIQCDAPCTSCAKGANQLYKP 247 Query: 804 VKANIIPPKDSYCVEIQRSQITKDCDSCHQCDYEIEYADHSSSVGVLARDELYLNIANGS 983 N++ + CVE QR+Q+T +S QCDYEIEYADHS S+GVL +D+ +L + NGS Sbjct: 248 KNHNLVGSSEPLCVEFQRNQMTGHFESSQQCDYEIEYADHSYSMGVLTKDKFHLKLHNGS 307 Query: 984 LAKSKVVFGCAYDQQGLLLNTMGKTDGILGLSRAKISFPSQLASQGIINNVVGHCLATEP 1163 LA+S +VFGC YDQQG+LLNT+ KTDGILGLSRAKIS PSQL S+GII+NVVGHCLA++ Sbjct: 308 LAESDIVFGCGYDQQGVLLNTLLKTDGILGLSRAKISLPSQLGSRGIISNVVGHCLASDL 367 Query: 1164 SGGGYLFLGDDFVPHSRMTWVPMLQSHNSNSYQAEMVKVSYGGRQIGL-GNVGN-GRLVF 1337 G GY+F+G D VP MTWVPML YQ ++ K+SYG + L G G G+ +F Sbjct: 368 DGEGYIFMGSDLVPSHGMTWVPMLHHSRLEVYQMQVTKMSYGNAMLTLDGENGRVGKALF 427 Query: 1338 DSGSSYSYFTEQAYNNLLTVLDDISSESLVRDMSDTSLPICWQVKS--PSRSVKDVRQLF 1511 D+GSSY+YF QAY L+T L ++S L RD SD +LPICW+ K+ P S+ DV++ F Sbjct: 428 DTGSSYTYFPNQAYTQLVTSLQEVSGSDLTRDDSDETLPICWRAKTNFPISSLSDVKKFF 487 Query: 1512 KPLNLQFGSKWWIMSTKLQIPPEGYLVTNSKGNVCLGILNGRNVHDGSTFILGDISLRGL 1691 +P+ LQ SKW I+S KL I PE YL+ ++KGNVCLGIL+G +VHDGST I+GDIS+RG Sbjct: 488 RPITLQIWSKWLIISRKLLIQPEDYLIISNKGNVCLGILDGSSVHDGSTIIIGDISMRGH 547 Query: 1692 LFVYDNVNEKIGWVRSDCARPRRFESHSLF 1781 L VYDNV +IGW++SDC RPR F+ + F Sbjct: 548 LIVYDNVKRRIGWMKSDCVRPREFDHNVPF 577 >ref|XP_006393315.1| hypothetical protein EUTSA_v10011346mg [Eutrema salsugineum] gi|557089893|gb|ESQ30601.1| hypothetical protein EUTSA_v10011346mg [Eutrema salsugineum] Length = 580 Score = 528 bits (1359), Expect = e-147 Identities = 282/562 (50%), Positives = 377/562 (67%), Gaps = 16/562 (2%) Frame = +3 Query: 117 VIITLPPPDNPSLGKTITAFTLSDHQTPTPQSPPQVDESPPVQNFAXXXXXXXXXXXXXX 296 VIITLPP DNPS GKTI+AFTL+DH P P P+ + +P Q Sbjct: 18 VIITLPPSDNPSKGKTISAFTLTDHDYP-PDIRPEDERNPSFQPDPLHQNPQSGLWFSDL 76 Query: 297 XXT----VLPVLGISVIALYLWVSVSRET--LFQLRDELGNDDEHQKNNSQHTFLLPLYL 458 + VL +LGIS++A+ + SV + LF++ DE D+++++ + +F+ P+Y Sbjct: 77 SMSSPRLVLGLLGISLLAIAFYGSVFPNSVQLFRVSDERDRDEDNRRETA--SFVFPVYH 134 Query: 459 KK-----PNRVNGDLGDFEIKLGRRVSFDSRPISKELDDAMSVGEVNSKLVSTASRIDAT 623 K P R + D +K + +S I +EL + + V +V S S S +T Sbjct: 135 KLRAREIPERNLAEALDV-VKEENGIFVES--IEQELVNPVKVNDVFS--ASVGSLDSST 189 Query: 624 SVIPVRGNIYPDGLYYTYLHFGNPPRP---YFLDMDTGSDLTWIQCDAPCTSCAKGAHPF 794 ++ PV G +YPDGLY+T + GNP + + LD+DTGSDLTWIQCDAPCTSCAKGA+ Sbjct: 190 TIFPVGGYVYPDGLYFTRVFVGNPEKDGHYFHLDIDTGSDLTWIQCDAPCTSCAKGANQL 249 Query: 795 YKPVKANIIPPKDSYCVEIQRSQITKDCDSCHQCDYEIEYADHSSSVGVLARDELYLNIA 974 YKP K ++ + CVE+Q++Q+T+ C+SC QCDYEIEYAD SSS+GVL +DE +L + Sbjct: 250 YKPRKDKLVGSAEHLCVEVQKNQMTELCESCQQCDYEIEYADLSSSLGVLTKDEFHLKLH 309 Query: 975 NGSLAKSKVVFGCAYDQQGLLLNTMGKTDGILGLSRAKISFPSQLASQGIINNVVGHCLA 1154 NGSLA S +VFGC YDQQGLLLNT+ K DGILGLSRAKIS PSQLASQGII+NVVGHCL Sbjct: 310 NGSLAASDIVFGCGYDQQGLLLNTLLKKDGILGLSRAKISLPSQLASQGIISNVVGHCLP 369 Query: 1155 TEPSGGGYLFLGDDFVPHSRMTWVPMLQSHNSNSYQAEMVKVSYGGRQIGLG--NVGNGR 1328 ++ +G GY+F+G D VP MTWVPM + +Q ++ KVSYG + L N G+ Sbjct: 370 SDLNGEGYIFMGSDLVPLHGMTWVPMFHHSHLEVHQMQVTKVSYGNGMLSLSGENGRIGK 429 Query: 1329 LVFDSGSSYSYFTEQAYNNLLTVLDDISSESLVRDMSDTSLPICWQVKSPSRSVKDVRQL 1508 ++FD+GSSY+YF ++AY+ L+T L ++ L RD SD +LPICWQ S+ DV++ Sbjct: 430 VLFDTGSSYTYFPKKAYSQLVTSLQEV---KLTRDESDKALPICWQANFLISSLSDVKRF 486 Query: 1509 FKPLNLQFGSKWWIMSTKLQIPPEGYLVTNSKGNVCLGILNGRNVHDGSTFILGDISLRG 1688 +KP+ +Q GSKWWI+S KL I PE YL+ ++KGNVCLGIL+G +VHDGST ILGDIS+RG Sbjct: 487 YKPITIQIGSKWWIISRKLVIQPEDYLIISNKGNVCLGILDGSSVHDGSTIILGDISMRG 546 Query: 1689 LLFVYDNVNEKIGWVRSDCARP 1754 L VYDNV +IGW++SDC RP Sbjct: 547 RLIVYDNVKRRIGWMKSDCVRP 568 >gb|EOY21002.1| Eukaryotic aspartyl protease family protein, putative isoform 2 [Theobroma cacao] Length = 520 Score = 524 bits (1349), Expect = e-146 Identities = 273/523 (52%), Positives = 352/523 (67%), Gaps = 21/523 (4%) Frame = +3 Query: 87 ETNERPPQLT--VIITLPPPDNPSLGKTITAFTLSD------HQTPTPQSPPQVDESPPV 242 +++ERP Q+T VIITLPP DNPSLGKTITAFTL++ HQT Q + P Sbjct: 2 DSDERPQQVTGVVIITLPPSDNPSLGKTITAFTLTNDVFPQSHQTQQRQQQEEEQTLPTT 61 Query: 243 QNFAXXXXXXXXXXXXXXXX--------TVLPVLGISVIALYLWVSVSRETLFQLRDELG 398 Q +L LGIS+ AL L+ S T +LR+ Sbjct: 62 QILTPAPPSAQNPQRGFSFLGLFSDNPRKLLGFLGISLFALLLYSSAFSNTFVELRNSNN 121 Query: 399 NDDEHQKNNSQHTFLLPLYLKKPNRVNGDLG-DFEIKLGRRVSFDSRPISKELDD-AMSV 572 +DDE ++ F+ PLY K LG D E+KLGR V D + ++ A Sbjct: 122 DDDEKPQS-----FIFPLYHK--------LGADLELKLGRFVDVDKENLVASVEGGATGT 168 Query: 573 GEVNSKLVSTASRIDAT-SVIPVRGNIYPDGLYYTYLHFGNPPRPYFLDMDTGSDLTWIQ 749 ++N + S A+ ID++ +++PVRGN+YPDGLY+TY+ GNP R YFLD+DTGSDLTWIQ Sbjct: 169 QKINKLVASNAAVIDSSGTILPVRGNVYPDGLYFTYMLVGNPQRRYFLDIDTGSDLTWIQ 228 Query: 750 CDAPCTSCAKGAHPFYKPVKANIIPPKDSYCVEIQRSQITKDCDSCHQCDYEIEYADHSS 929 CDAPC+SCAKGA+P YKP + NI+ KD C E+Q++Q ++C++C QCDYEIEYAD SS Sbjct: 229 CDAPCSSCAKGANPLYKPTRVNIVASKDLMCTEVQKNQKPQNCETCQQCDYEIEYADRSS 288 Query: 930 SVGVLARDELYLNIANGSLAKSKVVFGCAYDQQGLLLNTMGKTDGILGLSRAKISFPSQL 1109 S+GVLARDEL+L ANGS VVFGCAYDQQG+LLNT+ KTDGILGLSRAK+S PSQL Sbjct: 289 SLGVLARDELHLVTANGSTTNLDVVFGCAYDQQGILLNTLSKTDGILGLSRAKVSLPSQL 348 Query: 1110 ASQGIINNVVGHCLATEPSGGGYLFLGDDFVPHSRMTWVPMLQSHNSNSYQAEMVKVSYG 1289 AS+GIINNVVGHCLAT+ GY+FLGDDFVP+ M+WVPML S ++ Y ++VK++YG Sbjct: 349 ASKGIINNVVGHCLATDVGASGYMFLGDDFVPNWGMSWVPMLGSPSTEFYHTQIVKINYG 408 Query: 1290 GRQIGLGNVGN--GRLVFDSGSSYSYFTEQAYNNLLTVLDDISSESLVRDMSDTSLPICW 1463 + LG + GR+VFDSGSSY+YF +QAY L+ L ++S ++D++DT+LP+CW Sbjct: 409 SSSLSLGRQHSSIGRVVFDSGSSYTYFMKQAYAELVASLSEVSEVGFIQDVADTTLPMCW 468 Query: 1464 QVKSPSRSVKDVRQLFKPLNLQFGSKWWIMSTKLQIPPEGYLV 1592 Q P R +KDV+Q FK L LQFGSKWWI+S + IPPEGYL+ Sbjct: 469 QAPFPIRFIKDVKQFFKTLTLQFGSKWWIISKRFHIPPEGYLI 511