BLASTX nr result

ID: Rehmannia25_contig00005219 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Rehmannia25_contig00005219
         (1860 letters)

Database: ./nr 
           37,332,560 sequences; 13,225,080,153 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_006354730.1| PREDICTED: aspartic proteinase Asp1-like [So...   664   0.0  
ref|XP_004241624.1| PREDICTED: aspartic proteinase Asp1-like [So...   662   0.0  
gb|EXB56943.1| Aspartic proteinase Asp1 [Morus notabilis]             631   e-178
ref|XP_002275943.1| PREDICTED: aspartic proteinase Asp1-like [Vi...   617   e-174
ref|XP_006440945.1| hypothetical protein CICLE_v10019473mg [Citr...   615   e-173
ref|XP_006485774.1| PREDICTED: aspartic proteinase Asp1-like [Ci...   615   e-173
gb|EOY21001.1| Eukaryotic aspartyl protease family protein, puta...   611   e-172
ref|XP_002511959.1| protein with unknown function [Ricinus commu...   607   e-171
emb|CBI15437.3| unnamed protein product [Vitis vinifera]              584   e-164
ref|XP_004137470.1| PREDICTED: aspartic proteinase Asp1-like [Cu...   584   e-164
ref|XP_002328687.1| predicted protein [Populus trichocarpa] gi|5...   560   e-157
ref|XP_002891474.1| aspartyl protease family protein [Arabidopsi...   556   e-155
ref|XP_006583400.1| PREDICTED: aspartic proteinase Asp1-like [Gl...   548   e-153
gb|ESW24775.1| hypothetical protein PHAVU_004G159200g [Phaseolus...   548   e-153
ref|NP_564539.1| aspartyl protease [Arabidopsis thaliana] gi|777...   548   e-153
ref|XP_006826817.1| hypothetical protein AMTR_s00010p00056950 [A...   538   e-150
ref|XP_004512995.1| PREDICTED: lysine-specific histone demethyla...   536   e-149
ref|XP_006304522.1| hypothetical protein CARUB_v10011364mg [Caps...   535   e-149
ref|XP_006393315.1| hypothetical protein EUTSA_v10011346mg [Eutr...   528   e-147
gb|EOY21002.1| Eukaryotic aspartyl protease family protein, puta...   524   e-146

>ref|XP_006354730.1| PREDICTED: aspartic proteinase Asp1-like [Solanum tuberosum]
          Length = 558

 Score =  664 bits (1712), Expect = 0.0
 Identities = 339/569 (59%), Positives = 412/569 (72%), Gaps = 6/569 (1%)
 Frame = +3

Query: 81   MEETNERPP-QLTVIITLPPPDNPSLGKTITAFTLSD---HQTPTPQSPPQVDESPPVQN 248
            MEET   PP Q  VIITLPPPDNPS GKTITAFTLSD   HQ    + PPQ  +      
Sbjct: 1    MEETKNSPPIQGVVIITLPPPDNPSYGKTITAFTLSDSPTHQQQQEEEPPQQSQPHNQDL 60

Query: 249  FAXXXXXXXXXXXXXXXXTVLPVLGISVIALYLWVSVSRETLFQLRDELGNDDEHQKNNS 428
                               V  +LGIS+IAL  W S+++ETLF+LRD      EH   +S
Sbjct: 61   NTGVLRASLERSFFFRPKIVFGLLGISLIALSFWSSLTQETLFELRDV-----EHDHKSS 115

Query: 429  QHTFLLPLYLKKPNRVNGDLGDFEIKLGRRVSFDSRPISKELDDAMSVGEVNSKLVSTAS 608
              +F+LPLY K+    N    D E KLGR V F          D     E  +K +S A+
Sbjct: 116  NSSFILPLYPKRGGAWNSRR-DVEFKLGRFVDFKP--------DKFMDQEKIAKSLSAAT 166

Query: 609  RIDATSVIPVRGNIYPDGLYYTYLHFGNPPRPYFLDMDTGSDLTWIQCDAPCTSCAKGAH 788
            ++D++   PVRGNI+ +GLYYTY+  GNPPRPYFLD+DTGSDL WIQCDAPCTSCAKGAH
Sbjct: 167  KLDSSVNFPVRGNIHSEGLYYTYMLVGNPPRPYFLDIDTGSDLMWIQCDAPCTSCAKGAH 226

Query: 789  PFYKPVKANIIPPKDSYCVEIQRSQITKDCDSCHQCDYEIEYADHSSSVGVLARDELYLN 968
            P YKP   N+IPPK+ YCVE+Q +  +K CD+CHQCDYEIEYAD SSSVGVLA+DEL L 
Sbjct: 227  PLYKPRNVNMIPPKNPYCVEVQENLKSKYCDNCHQCDYEIEYADRSSSVGVLAKDELQLV 286

Query: 969  IANGSLAKSKVVFGCAYDQQGLLLNTMGKTDGILGLSRAKISFPSQLASQGIINNVVGHC 1148
            +ANG+  K  VVFGCAYDQQG LLNT+  TDGILGLSRA IS PSQLAS G+INNV+GHC
Sbjct: 287  LANGTGTKPSVVFGCAYDQQGTLLNTLASTDGILGLSRAPISLPSQLASHGLINNVIGHC 346

Query: 1149 LATEPSGGGYLFLGDDFVPHSRMTWVPMLQSHNSNSYQAEMVKVSYGGRQIGLGNV--GN 1322
            L T+ + GGYLFLG+DFVP  RM+WVPML +   N YQA+++K++YGG+++ LG+   G 
Sbjct: 347  LRTD-TNGGYLFLGNDFVPQWRMSWVPMLNNPFPNLYQAQLMKMNYGGKELRLGSTSYGQ 405

Query: 1323 GRLVFDSGSSYSYFTEQAYNNLLTVLDDISSESLVRDMSDTSLPICWQVKSPSRSVKDVR 1502
            G +VFDSGS+Y+YFT+QAY  L+++L++ISSE L++D SDT+LPICW+ K P RS+++VR
Sbjct: 406  GTVVFDSGSTYTYFTDQAYKALISMLEEISSEDLIKDASDTTLPICWRAKFPVRSIEEVR 465

Query: 1503 QLFKPLNLQFGSKWWIMSTKLQIPPEGYLVTNSKGNVCLGILNGRNVHDGSTFILGDISL 1682
            Q FKPLNLQFGSKW I+STKL IP EG+L  + KGNVCLGIL+G NVHDGS  ILGDISL
Sbjct: 466  QFFKPLNLQFGSKWRIVSTKLWIPAEGFLTISEKGNVCLGILDGSNVHDGSAIILGDISL 525

Query: 1683 RGLLFVYDNVNEKIGWVRSDCARPRRFES 1769
            RG LFVYDNVN+KIGW+RS+C RP +  S
Sbjct: 526  RGQLFVYDNVNQKIGWIRSNCERPEKVPS 554


>ref|XP_004241624.1| PREDICTED: aspartic proteinase Asp1-like [Solanum lycopersicum]
          Length = 562

 Score =  662 bits (1709), Expect = 0.0
 Identities = 340/577 (58%), Positives = 421/577 (72%), Gaps = 10/577 (1%)
 Frame = +3

Query: 81   MEETNERPP-QLTVIITLPPPDNPSLGKTITAFTLSDHQTPTPQSPPQVDESPPVQNF-- 251
            MEET   PP Q  VIITLPPPDNPS GKTITAFTLSD  T   Q   + +E PP Q+   
Sbjct: 1    MEETKNSPPIQGVVIITLPPPDNPSYGKTITAFTLSDSPTHQQQQEQEQEEEPPQQSQPH 60

Query: 252  -----AXXXXXXXXXXXXXXXXTVLPVLGISVIALYLWVSVSRETLFQLRDELGNDDEHQ 416
                 A                 V  +LGIS+IAL  W S+++ETLF+LRD    + +H+
Sbjct: 61   NQDVNAGVLHVSLERSFFFRPTIVFGLLGISLIALSFWSSLTQETLFELRDV---EQDHK 117

Query: 417  KNNSQHTFLLPLYLKKPNRVNGDLGDFEIKLGRRVSFDSRPISKELDDAMSVGEVNSKLV 596
             +NS  +F+LPLY K+    N    D E KLGR V F         D+ M   ++ +K +
Sbjct: 118  SSNS--SFILPLYPKRGGAWNSRT-DVEFKLGRFVDFKP-------DNFMDQEKI-AKSL 166

Query: 597  STASRIDATSVIPVRGNIYPDGLYYTYLHFGNPPRPYFLDMDTGSDLTWIQCDAPCTSCA 776
            S A+++D+++  PVRGNI+ +GLYYTY+  GNPP+PYFLD+DTGSDL WIQCDAPCTSCA
Sbjct: 167  SAATKLDSSANFPVRGNIHSEGLYYTYMLVGNPPKPYFLDIDTGSDLMWIQCDAPCTSCA 226

Query: 777  KGAHPFYKPVKANIIPPKDSYCVEIQRSQITKDCDSCHQCDYEIEYADHSSSVGVLARDE 956
            KGAHP YKP   N+IPPK+ YCVE+Q +  +K CD+CHQCDYEIEYAD SSSVGVLA+DE
Sbjct: 227  KGAHPLYKPRNVNMIPPKNPYCVEVQENLRSKYCDNCHQCDYEIEYADRSSSVGVLAKDE 286

Query: 957  LYLNIANGSLAKSKVVFGCAYDQQGLLLNTMGKTDGILGLSRAKISFPSQLASQGIINNV 1136
            L L +ANG+  K  VVFGCAYDQQG LLNT+  TDGILGLSRA IS PSQLAS G+INNV
Sbjct: 287  LQLVLANGTGTKPNVVFGCAYDQQGTLLNTLASTDGILGLSRAPISLPSQLASHGLINNV 346

Query: 1137 VGHCLATEPSGGGYLFLGDDFVPHSRMTWVPMLQSHNSNSYQAEMVKVSYGGRQIGLGNV 1316
            +GHCL T+ + GGYLFLG+DFVP  RM+WVPML +   N YQA+++K++YGG+ + LG+ 
Sbjct: 347  IGHCLRTD-TNGGYLFLGNDFVPQWRMSWVPMLNNPFPNLYQAQLMKMNYGGKDLQLGSR 405

Query: 1317 GNGR--LVFDSGSSYSYFTEQAYNNLLTVLDDISSESLVRDMSDTSLPICWQVKSPSRSV 1490
            G G+  +VFDSGS+Y+YFT+QAY  L+++L++ISSE L++D SDT+LPICW+ K P RS+
Sbjct: 406  GYGQDSVVFDSGSTYTYFTDQAYKALISMLEEISSEDLIKDASDTTLPICWRAKFPVRSI 465

Query: 1491 KDVRQLFKPLNLQFGSKWWIMSTKLQIPPEGYLVTNSKGNVCLGILNGRNVHDGSTFILG 1670
            ++VRQ FKPLNLQFGSKW ++STKL IP EGYL  + K NVCLGIL+G NVHDGS  ILG
Sbjct: 466  EEVRQFFKPLNLQFGSKWRVVSTKLWIPAEGYLTISEKSNVCLGILDGSNVHDGSAIILG 525

Query: 1671 DISLRGLLFVYDNVNEKIGWVRSDCARPRRFESHSLF 1781
            DISLRG LFVYDNVN+KIGW+RS+C RP    S   F
Sbjct: 526  DISLRGQLFVYDNVNQKIGWIRSNCERPENVPSLPFF 562


>gb|EXB56943.1| Aspartic proteinase Asp1 [Morus notabilis]
          Length = 569

 Score =  631 bits (1627), Expect = e-178
 Identities = 326/577 (56%), Positives = 403/577 (69%), Gaps = 14/577 (2%)
 Frame = +3

Query: 93   NERPPQL--TVIITLPPPDNPSLGKTITAFTLSDHQTPTPQSPPQVDESPPVQ---NFAX 257
            ++ PPQ+   VIITLPPPDNPSLGKTITAFTLS+          Q   + P+Q   N   
Sbjct: 3    SDHPPQIKGVVIITLPPPDNPSLGKTITAFTLSNSSPTQTHQESQNQNNLPIQSPQNPQL 62

Query: 258  XXXXXXXXXXXXXXXTVLPVLGISVIALYLWVSVSRETLFQLRDELGNDDEHQKNNSQHT 437
                            +  +LGIS+  L L+  V    + + R    NDDE        +
Sbjct: 63   QFPFPRLRLFHGVPRRLFALLGISIFTLVLFSHVFPTVVEEFRRS--NDDE-----GPES 115

Query: 438  FLLPLYLKKPNRVNGDLGDFEIKLGRRVSFDSRPISKELDDAMSVGEVNSKLVSTASRID 617
            F+ PLY K    V G   D E+KLGR V FD         D +   +VN KLVS+ +++D
Sbjct: 116  FIFPLYSKLG--VPGKK-DVELKLGRFVDFDKENAGVSFGDRVKTQKVN-KLVSSTAKVD 171

Query: 618  ATSVIPVRGNIYPDGLYYTYLHFGNPPRPYFLDMDTGSDLTWIQCDAPCTSCAKGAHPFY 797
            +++++PVRGN+YPDGLYYT +  GNPPRPY LDMDTGSDLTWIQCDAPCTSCAKGA+P Y
Sbjct: 172  SSAILPVRGNVYPDGLYYTQILVGNPPRPYHLDMDTGSDLTWIQCDAPCTSCAKGANPLY 231

Query: 798  KPVKANIIPPKDSYCVEIQRSQITKDCDSCHQCDYEIEYADHSSSVGVLARDELYLNIAN 977
            KP K NI+P KDS+C EI+R+Q    C +C QCDYEI+YAD SSS+GVLA+D L+L + N
Sbjct: 232  KPTKGNIVPSKDSFCTEIRRNQKPGHCKTCQQCDYEIQYADRSSSLGVLAKDGLHLVMEN 291

Query: 978  GSLAKSKVVFGCAYDQQGLLLNTMGKTDGILGLSRAKISFPSQLASQGIINNVVGHCLAT 1157
            GSLA   VVFGCAYDQQGLLLNT+ KTDGILGLSRAK+S PSQLAS+GII NVVGHCL T
Sbjct: 292  GSLANVNVVFGCAYDQQGLLLNTLAKTDGILGLSRAKVSLPSQLASKGIIKNVVGHCLTT 351

Query: 1158 EPSGGGYLFLGDDFVPHSRMTWVPMLQSHNSNSYQAEMVKVSYGGRQIGLGNVGN--GRL 1331
               GGGY+FLGDDFVPH  M+W+PML+S + + YQ+E+V ++YG   + LG   +   +L
Sbjct: 352  NAGGGGYMFLGDDFVPHWGMSWIPMLRSPSMDFYQSEIVSINYGSSALNLGAWSSKARQL 411

Query: 1332 VFDSGSSYSYFTEQAYNNLLTVLDDISSESLVRDMSDTSLPICWQVKSP-------SRSV 1490
            VFDSGSSY+YF ++AY+ LL  L+++S+  LVRD SD SLPICW+ ++P        RSV
Sbjct: 412  VFDSGSSYTYFNKRAYSALLASLEEVSTTGLVRDRSDPSLPICWRAETPLNCIHMECRSV 471

Query: 1491 KDVRQLFKPLNLQFGSKWWIMSTKLQIPPEGYLVTNSKGNVCLGILNGRNVHDGSTFILG 1670
             DV++ FK + LQFGSKWWI+ST+L+IPPEGYL  +SKGNVCLGIL+G  VHDG T ILG
Sbjct: 472  ADVKRFFKTITLQFGSKWWIISTRLRIPPEGYLTISSKGNVCLGILDGSKVHDGYTTILG 531

Query: 1671 DISLRGLLFVYDNVNEKIGWVRSDCARPRRFESHSLF 1781
            DISLRG L VYDN N+KIGW  SDC +PRRF+S   F
Sbjct: 532  DISLRGHLVVYDNENQKIGWTNSDCVKPRRFDSLPFF 568


>ref|XP_002275943.1| PREDICTED: aspartic proteinase Asp1-like [Vitis vinifera]
          Length = 686

 Score =  617 bits (1590), Expect = e-174
 Identities = 323/595 (54%), Positives = 409/595 (68%), Gaps = 36/595 (6%)
 Frame = +3

Query: 105  PQL--TVIITLPPPDNPSLGKTITAFTLSD------------------------------ 188
            PQL   VIITLPPPDNPSLGKTITAFTLSD                              
Sbjct: 124  PQLKGVVIITLPPPDNPSLGKTITAFTLSDPPLDRPHHTHQQLQRQQHQEEEEEEEEEEE 183

Query: 189  --HQTPTPQSPPQVDESPPVQNFAXXXXXXXXXXXXXXXXTVLPVLGISVIALYLWVSVS 362
              HQ P+P SPP      P   F+                 ++  LG+S+    LW   S
Sbjct: 184  EPHQLPSP-SPPN-----PALQFSVRKLSLGNPRI------LMGFLGVSLFVFLLWNFAS 231

Query: 363  RETLFQLRDELGNDDEHQKNNSQHTFLLPLYLKKPNRVNGDLGDFEIKLGRRVSFDSRPI 542
               L +LR +  NDD    +     F+LPLY K  +R    LGD E+KLG+ V F     
Sbjct: 232  SSPLVELRRK--NDDREPTS-----FILPLYPKLGSR---SLGDLELKLGKFVDF----- 276

Query: 543  SKELDDAMSVGEVNSKLVSTASRIDATSVIPVRGNIYPDGLYYTYLHFGNPPRPYFLDMD 722
               ++D M  G +N KL ++ S  D++++ PVRG++YP+GLY+T++  G+PPR YFLDMD
Sbjct: 277  --HVND-MKPGGIN-KLATSVSAFDSSTIFPVRGDVYPNGLYFTHIFVGSPPRRYFLDMD 332

Query: 723  TGSDLTWIQCDAPCTSCAKGAHPFYKPVKANIIPPKDSYCVEIQRSQITKDCDSCHQCDY 902
            TGSDLTWIQCDAPCTSCAKG +P YKP K N++P KDS CVE+QR+  T  C++C QCDY
Sbjct: 333  TGSDLTWIQCDAPCTSCAKGPNPLYKPKKGNLVPLKDSLCVEVQRNLKTGYCETCEQCDY 392

Query: 903  EIEYADHSSSVGVLARDELYLNIANGSLAKSKVVFGCAYDQQGLLLNTMGKTDGILGLSR 1082
            EIEYADHSSS+GVLA D+L+L +ANGSL K  ++FGCAYDQQGLLLN++ KTDGILGLS+
Sbjct: 393  EIEYADHSSSMGVLASDDLHLMLANGSLTKLGIMFGCAYDQQGLLLNSLAKTDGILGLSK 452

Query: 1083 AKISFPSQLASQGIINNVVGHCLATEPSGGGYLFLGDDFVPHSRMTWVPMLQSHNSNSYQ 1262
            AK+S PSQLASQ IINNV+GHCL ++ +GGGY+FLGDDFVP+  M WVPML SH+ N Y 
Sbjct: 453  AKVSLPSQLASQRIINNVLGHCLTSDATGGGYMFLGDDFVPYWGMAWVPMLNSHSPN-YH 511

Query: 1263 AEMVKVSYGGRQIGLGNVG--NGRLVFDSGSSYSYFTEQAYNNLLTVLDDISSESLVRDM 1436
            ++++K+S+G RQ+ LG       R+VFD+GSSY+YF ++AY  L+  L D+S E L++D 
Sbjct: 512  SQIMKISHGSRQLSLGRQDGRTERVVFDTGSSYTYFPKEAYYALVASLKDVSDEGLIQDG 571

Query: 1437 SDTSLPICWQVKSPSRSVKDVRQLFKPLNLQFGSKWWIMSTKLQIPPEGYLVTNSKGNVC 1616
            SD +LP+CW+ K P RSV DV+Q F+PL LQF SKWWI+STK +IPPEGYL+ ++KGNVC
Sbjct: 572  SDPTLPVCWRAKFPIRSVIDVKQFFQPLTLQFRSKWWIVSTKFRIPPEGYLIISNKGNVC 631

Query: 1617 LGILNGRNVHDGSTFILGDISLRGLLFVYDNVNEKIGWVRSDCARPRRFESHSLF 1781
            LGIL+G NVHDGST ILGDISLRG L VYDNVN+KIGW +S C +P++ +S   F
Sbjct: 632  LGILDGSNVHDGSTIILGDISLRGKLVVYDNVNQKIGWAQSTCVKPQKIKSLPFF 686


>ref|XP_006440945.1| hypothetical protein CICLE_v10019473mg [Citrus clementina]
            gi|557543207|gb|ESR54185.1| hypothetical protein
            CICLE_v10019473mg [Citrus clementina]
          Length = 577

 Score =  615 bits (1587), Expect = e-173
 Identities = 324/575 (56%), Positives = 399/575 (69%), Gaps = 13/575 (2%)
 Frame = +3

Query: 84   EETNERPPQLT--VIITLPPPDNPSLGKTITAFTLSDHQTPTPQSPPQVDESPPVQNFAX 257
            +E+   PPQLT  VIITLPPP+NPSLGKTITA+TL+D+   + Q+  +  +  P+     
Sbjct: 4    DESPSPPPQLTGVVIITLPPPNNPSLGKTITAYTLTDNSPQSQQTRHRQQQEHPLPPQLH 63

Query: 258  XXXXXXXXXXXXXXXTVLP-----VLGISVIALYLWVSVSRETLFQLRDELGNDDEHQKN 422
                             LP      L IS+ AL L+ SV   TL Q R +  NDDE++++
Sbjct: 64   PPQNSQFNFSLPMLFPGLPRKLFLFLAISIFALILYGSVFSYTL-QDRYKSNNDDENKES 122

Query: 423  NSQHTFLLPLYLKKPNRVNGDLGDFEIKLGRRVSFDSRPISKELDDAM---SVGEVNSKL 593
                 F+ PLY K   R      D E KLGR V  D   +   ++D +      ++N KL
Sbjct: 123  -----FVFPLYHKFGIREVSQR-DAEFKLGRFVDLDGESVVASVNDGIIRPHKSKINKKL 176

Query: 594  VST-ASRIDATSVIPVRGNIYPDGLYYTYLHFGNPPRPYFLDMDTGSDLTWIQCDAPCTS 770
            VS+ A  +D++S+ P+RGNIYPDGLY+TY+  GNPPRPY+LDMDTGSDLTWIQCDAPC+S
Sbjct: 177  VSSNAVAVDSSSIFPLRGNIYPDGLYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSS 236

Query: 771  CAKGAHPFYKPVKANIIPPKDSYCVEIQRSQITKDCDSCHQCDYEIEYADHSSSVGVLAR 950
            CAKGA+P YKP   NI+P KDS C+EIQR+     C++C QCDYEIEYADHSSS+GVLAR
Sbjct: 237  CAKGANPLYKPRMGNILPYKDSLCMEIQRNHKPGYCETCQQCDYEIEYADHSSSMGVLAR 296

Query: 951  DELYLNIANGSLAKSKVVFGCAYDQQGLLLNTMGKTDGILGLSRAKISFPSQLASQGIIN 1130
            DEL+L I NGSL K  VVFGCAYDQQGLLLNT+ KTDGILGLSRAK+S PSQLASQGII 
Sbjct: 297  DELHLTIENGSLTKPNVVFGCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIK 356

Query: 1131 NVVGHCLATEPSGGGYLFLGDDFVPHSRMTWVPMLQSHNSNSYQAEMVKVSYGGRQIGLG 1310
            NVVGHCL T   GGGY+FLG D VP   M WVPML S     Y  E++K++YG   + LG
Sbjct: 357  NVVGHCLTTNAGGGGYMFLGHDLVPSWGMAWVPMLDSPFMELYHTEILKINYGSSPLNLG 416

Query: 1311 --NVGNGRLVFDSGSSYSYFTEQAYNNLLTVLDDISSESLVRDMSDTSLPICWQVKSPSR 1484
              N   G  +FD+GSSY+YFT+QAY+ L+  L ++SS+ LV D SD +LP+CW+ K P R
Sbjct: 417  ARNSQVGWALFDTGSSYTYFTKQAYSELIASLKEVSSDGLVLDASDPTLPVCWRAKFPIR 476

Query: 1485 SVKDVRQLFKPLNLQFGSKWWIMSTKLQIPPEGYLVTNSKGNVCLGILNGRNVHDGSTFI 1664
            S+ DV+Q FK L L FGSKW I+STK +I PEGYLV + KGN+CLGIL+G  VH+GST I
Sbjct: 477  SIVDVKQFFKTLTLHFGSKWQIVSTKFRISPEGYLVISKKGNICLGILDGSEVHNGSTII 536

Query: 1665 LGDISLRGLLFVYDNVNEKIGWVRSDCARPRRFES 1769
            LGDISLRG L VYDNVN++IGW +S C  P RF+S
Sbjct: 537  LGDISLRGQLVVYDNVNKRIGWAKSHCMNPGRFKS 571


>ref|XP_006485774.1| PREDICTED: aspartic proteinase Asp1-like [Citrus sinensis]
          Length = 577

 Score =  615 bits (1586), Expect = e-173
 Identities = 325/575 (56%), Positives = 396/575 (68%), Gaps = 13/575 (2%)
 Frame = +3

Query: 84   EETNERPPQLT--VIITLPPPDNPSLGKTITAFTLSDHQTPTPQSPPQVDESPPVQNFAX 257
            +E+   PPQLT  VIITLPPP+NPSLGKTITA+TL+D+   + Q+  Q  +  P+     
Sbjct: 4    DESPSPPPQLTGVVIITLPPPNNPSLGKTITAYTLTDNSPQSQQTHHQQQQEHPLPAQLH 63

Query: 258  XXXXXXXXXXXXXXXTVLP-----VLGISVIALYLWVSVSRETLFQLRDELGNDDEHQKN 422
                            VLP      L IS+ AL L+ SV   TL Q R +  NDDE++++
Sbjct: 64   PPQDSQFNFSLPMLFPVLPRKLFLFLAISIFALILYGSVFSYTL-QHRYKSNNDDENKES 122

Query: 423  NSQHTFLLPLYLKKPNRVNGDLGDFEIKLGRRVSFDSRPISKELDDAM---SVGEVNSKL 593
                 F+ PLY K   R      D E KLGR V  D   +   ++D +      ++N KL
Sbjct: 123  -----FVFPLYHKFGIREVLQR-DAEFKLGRFVDLDGESVVASVNDGIIRPHKSKINKKL 176

Query: 594  V-STASRIDATSVIPVRGNIYPDGLYYTYLHFGNPPRPYFLDMDTGSDLTWIQCDAPCTS 770
            V S A  +D++S  P+RGN+YPDGLY+TY+  GNPPRPY+LDMDTGSDLTWIQCDAPC+S
Sbjct: 177  VPSNAVAVDSSSTFPLRGNVYPDGLYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSS 236

Query: 771  CAKGAHPFYKPVKANIIPPKDSYCVEIQRSQITKDCDSCHQCDYEIEYADHSSSVGVLAR 950
            CAKGA+P YKP   NI+P KDS C+EIQR+     C++C QCDYEIEYADHSSS+GVLAR
Sbjct: 237  CAKGANPLYKPRMGNILPYKDSLCMEIQRNHKPGYCETCQQCDYEIEYADHSSSMGVLAR 296

Query: 951  DELYLNIANGSLAKSKVVFGCAYDQQGLLLNTMGKTDGILGLSRAKISFPSQLASQGIIN 1130
            DEL+L I NGSL K  VVFGCAYDQQGLLLNT+ KTDGILGLSRAK+S PSQLASQGII 
Sbjct: 297  DELHLTIENGSLTKPNVVFGCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIK 356

Query: 1131 NVVGHCLATEPSGGGYLFLGDDFVPHSRMTWVPMLQSHNSNSYQAEMVKVSYGGRQIGLG 1310
            NVVGHCL T   GGGY+FLG D VP   M WVPML S     Y  E++K++YG   + LG
Sbjct: 357  NVVGHCLTTNAGGGGYMFLGHDLVPSWGMAWVPMLDSPFMELYHTEILKINYGSSPLNLG 416

Query: 1311 --NVGNGRLVFDSGSSYSYFTEQAYNNLLTVLDDISSESLVRDMSDTSLPICWQVKSPSR 1484
              N   G  +FD+GSSY+YFT+QAY+ L+  L ++SS  LV D SD +LP+CW+ K P R
Sbjct: 417  ARNSRVGWALFDTGSSYTYFTKQAYSELIASLKEVSSNGLVLDASDPTLPVCWRAKFPIR 476

Query: 1485 SVKDVRQLFKPLNLQFGSKWWIMSTKLQIPPEGYLVTNSKGNVCLGILNGRNVHDGSTFI 1664
            S+ DV+Q FK L L FGSKW I+STK  I PEGYLV + KGN+CLGIL+G  VH+GST I
Sbjct: 477  SIVDVKQYFKTLTLHFGSKWQIVSTKFHISPEGYLVISKKGNICLGILDGSEVHNGSTII 536

Query: 1665 LGDISLRGLLFVYDNVNEKIGWVRSDCARPRRFES 1769
            LGDISLRG L VYDNVN++IGW +S C  P RF+S
Sbjct: 537  LGDISLRGQLVVYDNVNKRIGWAKSHCMNPGRFKS 571


>gb|EOY21001.1| Eukaryotic aspartyl protease family protein, putative isoform 1
            [Theobroma cacao]
          Length = 576

 Score =  611 bits (1576), Expect = e-172
 Identities = 316/582 (54%), Positives = 399/582 (68%), Gaps = 21/582 (3%)
 Frame = +3

Query: 87   ETNERPPQLT--VIITLPPPDNPSLGKTITAFTLSD------HQTPTPQSPPQVDESPPV 242
            +++ERP Q+T  VIITLPP DNPSLGKTITAFTL++      HQT   Q   +    P  
Sbjct: 2    DSDERPQQVTGVVIITLPPSDNPSLGKTITAFTLTNDVFPQSHQTQQRQQQEEEQTLPTT 61

Query: 243  QNFAXXXXXXXXXXXXXXXX--------TVLPVLGISVIALYLWVSVSRETLFQLRDELG 398
            Q                            +L  LGIS+ AL L+ S    T  +LR+   
Sbjct: 62   QILTPAPPSAQNPQRGFSFLGLFSDNPRKLLGFLGISLFALLLYSSAFSNTFVELRNSNN 121

Query: 399  NDDEHQKNNSQHTFLLPLYLKKPNRVNGDLG-DFEIKLGRRVSFDSRPISKELDD-AMSV 572
            +DDE  ++     F+ PLY K        LG D E+KLGR V  D   +   ++  A   
Sbjct: 122  DDDEKPQS-----FIFPLYHK--------LGADLELKLGRFVDVDKENLVASVEGGATGT 168

Query: 573  GEVNSKLVSTASRIDAT-SVIPVRGNIYPDGLYYTYLHFGNPPRPYFLDMDTGSDLTWIQ 749
             ++N  + S A+ ID++ +++PVRGN+YPDGLY+TY+  GNP R YFLD+DTGSDLTWIQ
Sbjct: 169  QKINKLVASNAAVIDSSGTILPVRGNVYPDGLYFTYMLVGNPQRRYFLDIDTGSDLTWIQ 228

Query: 750  CDAPCTSCAKGAHPFYKPVKANIIPPKDSYCVEIQRSQITKDCDSCHQCDYEIEYADHSS 929
            CDAPC+SCAKGA+P YKP + NI+  KD  C E+Q++Q  ++C++C QCDYEIEYAD SS
Sbjct: 229  CDAPCSSCAKGANPLYKPTRVNIVASKDLMCTEVQKNQKPQNCETCQQCDYEIEYADRSS 288

Query: 930  SVGVLARDELYLNIANGSLAKSKVVFGCAYDQQGLLLNTMGKTDGILGLSRAKISFPSQL 1109
            S+GVLARDEL+L  ANGS     VVFGCAYDQQG+LLNT+ KTDGILGLSRAK+S PSQL
Sbjct: 289  SLGVLARDELHLVTANGSTTNLDVVFGCAYDQQGILLNTLSKTDGILGLSRAKVSLPSQL 348

Query: 1110 ASQGIINNVVGHCLATEPSGGGYLFLGDDFVPHSRMTWVPMLQSHNSNSYQAEMVKVSYG 1289
            AS+GIINNVVGHCLAT+    GY+FLGDDFVP+  M+WVPML S ++  Y  ++VK++YG
Sbjct: 349  ASKGIINNVVGHCLATDVGASGYMFLGDDFVPNWGMSWVPMLGSPSTEFYHTQIVKINYG 408

Query: 1290 GRQIGLGNVGN--GRLVFDSGSSYSYFTEQAYNNLLTVLDDISSESLVRDMSDTSLPICW 1463
               + LG   +  GR+VFDSGSSY+YF +QAY  L+  L ++S    ++D++DT+LP+CW
Sbjct: 409  SSSLSLGRQHSSIGRVVFDSGSSYTYFMKQAYAELVASLSEVSEVGFIQDVADTTLPMCW 468

Query: 1464 QVKSPSRSVKDVRQLFKPLNLQFGSKWWIMSTKLQIPPEGYLVTNSKGNVCLGILNGRNV 1643
            Q   P R +KDV+Q FK L LQFGSKWWI+S +  IPPEGYL+ + KGNVCLGIL+G  V
Sbjct: 469  QAPFPIRFIKDVKQFFKTLTLQFGSKWWIISKRFHIPPEGYLIISKKGNVCLGILDGSKV 528

Query: 1644 HDGSTFILGDISLRGLLFVYDNVNEKIGWVRSDCARPRRFES 1769
            HDGST ILGDISLRG L VYDN   KIGW +SDCA PRRF+S
Sbjct: 529  HDGSTIILGDISLRGQLVVYDNEKLKIGWTQSDCAHPRRFKS 570


>ref|XP_002511959.1| protein with unknown function [Ricinus communis]
            gi|223549139|gb|EEF50628.1| protein with unknown function
            [Ricinus communis]
          Length = 583

 Score =  607 bits (1566), Expect = e-171
 Identities = 320/588 (54%), Positives = 406/588 (69%), Gaps = 21/588 (3%)
 Frame = +3

Query: 81   MEETNERPPQLTVIITLPPPDNPSLGKTITAFTLSD--HQTPTPQSPPQVDESP------ 236
            ME  ++      VII+LPPP+NPSLGKTITAFTL+D  H    PQS    ++ P      
Sbjct: 1    MESDDQSSHVKVVIISLPPPNNPSLGKTITAFTLTDDDHDATYPQSHQNHEQEPSIIQTH 60

Query: 237  -----PVQNFAXXXXXXXXXXXXXXXXTVLP-----VLGISVIALYLWVSVSRETLFQLR 386
                 PVQ+ +                   P     +L IS+ A+ ++ S+   TL +L+
Sbjct: 61   RESQLPVQSPSLPPQNPQIQFSFSGLYFSTPRKLLFLLCISLFAVIVYRSLFSNTLLELK 120

Query: 387  DELGNDDEHQKNNSQHTFLLPLYLKKPNRVNGDLGDFEIKLGRRVSFDSRPISKELDDAM 566
              + +DD  +K  S   F+ PLY K   R      + E K  R V  +S   S   DD +
Sbjct: 121  --VSDDDNDEKTKS---FIFPLYHKFGIREISQ-SNLEHKSIRSVYKESLVASVNDDDVI 174

Query: 567  SVGEVNSKLVST-ASRIDATSVIPVRGNIYPDGLYYTYLHFGNPPRPYFLDMDTGSDLTW 743
             V   N KL S+ A+ +D++SV PVRGN+YPDGLY+TY+  GNPPRPY+LD+DT SDLTW
Sbjct: 175  -VPNRNYKLASSNAAAVDSSSVFPVRGNVYPDGLYFTYILVGNPPRPYYLDIDTASDLTW 233

Query: 744  IQCDAPCTSCAKGAHPFYKPVKANIIPPKDSYCVEIQRSQITKDCDSCHQCDYEIEYADH 923
            IQCDAPCTSCAKGA+  YKP + NI+ PKDS CVE+ R+Q    C++C QCDYEIEYADH
Sbjct: 234  IQCDAPCTSCAKGANALYKPRRDNIVTPKDSLCVELHRNQKAGYCETCQQCDYEIEYADH 293

Query: 924  SSSVGVLARDELYLNIANGSLAKSKVVFGCAYDQQGLLLNTMGKTDGILGLSRAKISFPS 1103
            SSS+GVLARDEL+L +ANGS    K  FGCAYDQQGLLLNT+ KTDGILGLS+AK+S PS
Sbjct: 294  SSSMGVLARDELHLTMANGSSTNLKFNFGCAYDQQGLLLNTLVKTDGILGLSKAKVSLPS 353

Query: 1104 QLASQGIINNVVGHCLATEPSGGGYLFLGDDFVPHSRMTWVPMLQSHNSNSYQAEMVKVS 1283
            QLA++GIINNVVGHCLA +  GGGY+FLGDDFVP   M+WVPML S + +SYQ +++K++
Sbjct: 354  QLANRGIINNVVGHCLANDVVGGGYMFLGDDFVPRWGMSWVPMLDSPSIDSYQTQIMKLN 413

Query: 1284 YGGRQIGLGNVGN--GRLVFDSGSSYSYFTEQAYNNLLTVLDDISSESLVRDMSDTSLPI 1457
            YG   + LG       R+VFDSGSSY+YFT++AY+ L+  L  +S E+L++D SD +LP 
Sbjct: 414  YGSGPLSLGGQERRVRRIVFDSGSSYTYFTKEAYSELVASLKQVSGEALIQDTSDPTLPF 473

Query: 1458 CWQVKSPSRSVKDVRQLFKPLNLQFGSKWWIMSTKLQIPPEGYLVTNSKGNVCLGILNGR 1637
            CW+ K P RSV DV+Q FK L LQFGSKWWI+STK +IPPEGYL+ ++KGNVCLGIL+G 
Sbjct: 474  CWRAKFPIRSVIDVKQYFKTLTLQFGSKWWIISTKFRIPPEGYLIISNKGNVCLGILDGS 533

Query: 1638 NVHDGSTFILGDISLRGLLFVYDNVNEKIGWVRSDCARPRRFESHSLF 1781
            +VHDGS+ ILGDISLRG L +YDNVN KIGW +SDC +P+ F +   F
Sbjct: 534  DVHDGSSIILGDISLRGQLIIYDNVNNKIGWTQSDCIKPKTFSTLPFF 581


>emb|CBI15437.3| unnamed protein product [Vitis vinifera]
          Length = 473

 Score =  584 bits (1506), Expect = e-164
 Identities = 287/490 (58%), Positives = 369/490 (75%), Gaps = 2/490 (0%)
 Frame = +3

Query: 318  LGISVIALYLWVSVSRETLFQLRDELGNDDEHQKNNSQHTFLLPLYLKKPNRVNGDLGDF 497
            LG+S+    LW   S   L +LR +  NDD    +     F+LPLY K  +R    LGD 
Sbjct: 4    LGVSLFVFLLWNFASSSPLVELRRK--NDDREPTS-----FILPLYPKLGSR---SLGDL 53

Query: 498  EIKLGRRVSFDSRPISKELDDAMSVGEVNSKLVSTASRIDATSVIPVRGNIYPDGLYYTY 677
            E+KLG+ V F        ++D M  G +N KL ++ S  D++++ PVRG++YP+GLY+T+
Sbjct: 54   ELKLGKFVDF-------HVND-MKPGGIN-KLATSVSAFDSSTIFPVRGDVYPNGLYFTH 104

Query: 678  LHFGNPPRPYFLDMDTGSDLTWIQCDAPCTSCAKGAHPFYKPVKANIIPPKDSYCVEIQR 857
            +  G+PPR YFLDMDTGSDLTWIQCDAPCTSCAKG +P YKP K N++P KDS CVE+QR
Sbjct: 105  IFVGSPPRRYFLDMDTGSDLTWIQCDAPCTSCAKGPNPLYKPKKGNLVPLKDSLCVEVQR 164

Query: 858  SQITKDCDSCHQCDYEIEYADHSSSVGVLARDELYLNIANGSLAKSKVVFGCAYDQQGLL 1037
            +  T  C++C QCDYEIEYADHSSS+GVLA D+L+L +ANGSL K  ++FGCAYDQQGLL
Sbjct: 165  NLKTGYCETCEQCDYEIEYADHSSSMGVLASDDLHLMLANGSLTKLGIMFGCAYDQQGLL 224

Query: 1038 LNTMGKTDGILGLSRAKISFPSQLASQGIINNVVGHCLATEPSGGGYLFLGDDFVPHSRM 1217
            LN++ KTDGILGLS+AK+S PSQLASQ IINNV+GHCL ++ +GGGY+FLGDDFVP+  M
Sbjct: 225  LNSLAKTDGILGLSKAKVSLPSQLASQRIINNVLGHCLTSDATGGGYMFLGDDFVPYWGM 284

Query: 1218 TWVPMLQSHNSNSYQAEMVKVSYGGRQIGLGNVG--NGRLVFDSGSSYSYFTEQAYNNLL 1391
             WVPML SH+ N Y ++++K+S+G RQ+ LG       R+VFD+GSSY+YF ++AY  L+
Sbjct: 285  AWVPMLNSHSPN-YHSQIMKISHGSRQLSLGRQDGRTERVVFDTGSSYTYFPKEAYYALV 343

Query: 1392 TVLDDISSESLVRDMSDTSLPICWQVKSPSRSVKDVRQLFKPLNLQFGSKWWIMSTKLQI 1571
              L D+S E L++D SD +LP+CW+ K P RSV DV+Q F+PL LQF SKWWI+STK +I
Sbjct: 344  ASLKDVSDEGLIQDGSDPTLPVCWRAKFPIRSVIDVKQFFQPLTLQFRSKWWIVSTKFRI 403

Query: 1572 PPEGYLVTNSKGNVCLGILNGRNVHDGSTFILGDISLRGLLFVYDNVNEKIGWVRSDCAR 1751
            PPEGYL+ ++KGNVCLGIL+G NVHDGST ILGDISLRG L VYDNVN+KIGW +S C +
Sbjct: 404  PPEGYLIISNKGNVCLGILDGSNVHDGSTIILGDISLRGKLVVYDNVNQKIGWAQSTCVK 463

Query: 1752 PRRFESHSLF 1781
            P++ +S   F
Sbjct: 464  PQKIKSLPFF 473


>ref|XP_004137470.1| PREDICTED: aspartic proteinase Asp1-like [Cucumis sativus]
            gi|449486840|ref|XP_004157418.1| PREDICTED: aspartic
            proteinase Asp1-like [Cucumis sativus]
          Length = 570

 Score =  584 bits (1505), Expect = e-164
 Identities = 297/572 (51%), Positives = 384/572 (67%), Gaps = 17/572 (2%)
 Frame = +3

Query: 117  VIITLPPPDNPSLGKTITAFTLSDHQTPTPQSPPQVDESPPVQNFAXXXXXXXXXXXXXX 296
            V+ITLPPPDNPSLGK++TAFTL+D     P     VD+     N                
Sbjct: 10   VVITLPPPDNPSLGKSVTAFTLTDDFPEPPGESVAVDQEVQQPNNDHLTLPPNLPIQAPL 69

Query: 297  XXTVLP---------------VLGISVIALYLWVSVSRETLFQLRDELGNDDEHQKNNSQ 431
                +P               VLGI++ A+YL+ S   ET+ +LR    NDD+   +   
Sbjct: 70   SQRSIPLSRELFAGTPRKLVFVLGIALAAVYLYASNFPETIRELRRSERNDDDRPSS--- 126

Query: 432  HTFLLPLYLKKPNRVNGDLGDFEIKLGRRVSFDSRPISKELDDAMSVGEVNSKLVSTASR 611
              FL PLY +      GD  DF++KLGR V  +   +    +D + V +  SKL+S + +
Sbjct: 127  --FLFPLYFQSEL---GDSSDFQLKLGRTVRVNKDDLGVRFNDVLGVPKP-SKLISASLK 180

Query: 612  IDATSVIPVRGNIYPDGLYYTYLHFGNPPRPYFLDMDTGSDLTWIQCDAPCTSCAKGAHP 791
             D+++V PVRG+IYPDGLYYTY+  G PPRPYFLD+DTGSDLTW+QCDAPC+SC KG  P
Sbjct: 181  SDSSAVFPVRGDIYPDGLYYTYIMVGEPPRPYFLDIDTGSDLTWVQCDAPCSSCGKGRSP 240

Query: 792  FYKPVKANIIPPKDSYCVEIQRSQITKDCDSCHQCDYEIEYADHSSSVGVLARDELYLNI 971
             YKP + N++  KDS C+E+QR+     C +C QC+YE++YAD SSS+GVL +DE  L  
Sbjct: 241  LYKPRRENVVSFKDSLCMEVQRNYDGDQCAACQQCNYEVQYADQSSSLGVLVKDEFTLRF 300

Query: 972  ANGSLAKSKVVFGCAYDQQGLLLNTMGKTDGILGLSRAKISFPSQLASQGIINNVVGHCL 1151
            +NGSL K   +FGCAYDQQGLLLNT+ KTDGILGLSRAK+S PSQLAS+GIINNVVGHCL
Sbjct: 301  SNGSLTKLNAIFGCAYDQQGLLLNTLSKTDGILGLSRAKVSLPSQLASRGIINNVVGHCL 360

Query: 1152 ATEPSGGGYLFLGDDFVPHSRMTWVPMLQSHNSNSYQAEMVKVSYGGRQIGLGNVGNGR- 1328
              +P+GGGYLFLGDDFVP   M WV ML S + + YQ ++V++ YG   + L   G+ R 
Sbjct: 361  TGDPAGGGYLFLGDDFVPQWGMAWVAMLDSPSIDFYQTKVVRIDYGSIPLSLDTWGSSRE 420

Query: 1329 -LVFDSGSSYSYFTEQAYNNLLTVLDDISSESLVRDMSDTSLPICWQVKSPSRSVKDVRQ 1505
             +VFDSGSSY+YFT++AY  L+  L+++S+  L+  + D+S  ICW+ +   RSVKDV+ 
Sbjct: 421  QVVFDSGSSYTYFTKEAYYQLVANLEEVSAFGLI--LQDSSDTICWKTEQSIRSVKDVKH 478

Query: 1506 LFKPLNLQFGSKWWIMSTKLQIPPEGYLVTNSKGNVCLGILNGRNVHDGSTFILGDISLR 1685
             FKPL LQFGS++W++STKL I PE YL+ N +GNVCLGIL+G  VHDGST ILGD +LR
Sbjct: 479  FFKPLTLQFGSRFWLVSTKLVILPENYLLINKEGNVCLGILDGSQVHDGSTIILGDNALR 538

Query: 1686 GLLFVYDNVNEKIGWVRSDCARPRRFESHSLF 1781
            G L VYDNVN++IGW  SDC  PR+ +   LF
Sbjct: 539  GKLVVYDNVNQRIGWTSSDCHNPRKIKHLPLF 570


>ref|XP_002328687.1| predicted protein [Populus trichocarpa]
            gi|566206181|ref|XP_006374352.1| aspartyl protease family
            protein [Populus trichocarpa] gi|550322111|gb|ERP52149.1|
            aspartyl protease family protein [Populus trichocarpa]
          Length = 603

 Score =  560 bits (1444), Expect = e-157
 Identities = 308/620 (49%), Positives = 392/620 (63%), Gaps = 53/620 (8%)
 Frame = +3

Query: 81   MEETNERPPQL--TVIITLPPPDNPSLGKTITAFTLSDHQTPTPQSPPQVDES------- 233
            ME  +++ PQL   VII+LPPPDNPSLGKTITAFTL+++  P     PQ  +        
Sbjct: 1    MESDDDQSPQLKGVVIISLPPPDNPSLGKTITAFTLTNNDYPQSHQTPQTHQEDQLPISS 60

Query: 234  ---PPVQNFAXXXXXXXXXXXXXXXXTVLPVLGISVIALYLWVSVSRETLFQLRDELGND 404
               PP QN                   +L  + IS+ AL ++ S+   T  +L+    ND
Sbjct: 61   PPPPPSQN--SQLQFPSSRLFLGTPRKLLSFVFISLFALAIYSSLFTNTFQELKSN-NND 117

Query: 405  DEHQKNNSQHTFLLPLYLKKPNRVNGDLGDFEIKLGRRVSFDSRPISKELDDAMSVGEVN 584
            D+ QK  S   ++ PLY K        LG  EI L    +   R + KE +   SV  +N
Sbjct: 118  DDDQKPKS---YVFPLYHK--------LGIREIPLNDLENHLRRFVYKE-NLVASVDHLN 165

Query: 585  -----SKLVST--ASRIDATSVIPVRGNIYPDGLYYTYLHFGNPPRPYFLDMDTGSDLTW 743
                 SKL S+  A+ +D++++ PVRGN+YPDG          PP+PY+LD DTGSDLTW
Sbjct: 166  GPHKISKLASSNAAAAMDSSAIFPVRGNLYPDG----------PPQPYYLDFDTGSDLTW 215

Query: 744  IQCDAPCTSCAKGAHPFYKPVKANIIPPKDSYCVEIQRSQITKDCDSCHQCDYEIEYADH 923
            IQCDAPCTSCAKGA+ +YKP + NI+PPKD  C+E+QR+Q    C++C QCDYEIEYADH
Sbjct: 216  IQCDAPCTSCAKGANAWYKPRRGNIVPPKDLLCMEVQRNQKAGYCETCDQCDYEIEYADH 275

Query: 924  SSSVGVLARDELYLNIANGSLAKSKVVFGCAYDQQGLLLNTMGKTDGILGLSRAKISFPS 1103
            SSS+GVLA D+L L +ANGSL K   +FGCAYDQQGLLL T+ KTDGILGLSRAK+S PS
Sbjct: 276  SSSMGVLATDKLLLMVANGSLTKLNFIFGCAYDQQGLLLKTLVKTDGILGLSRAKVSLPS 335

Query: 1104 QLASQGIINNVVGHCLATEPSGGGYLFLGDDFVPHSRMTWVPMLQSHNSNSYQAEMVKVS 1283
            QLASQGIINNV+GHCL T+  GGGY+FLGDDFVP   M WVPML S +   Y  E+VK++
Sbjct: 336  QLASQGIINNVIGHCLTTDLGGGGYMFLGDDFVPRWGMAWVPMLDSPSMEFYHTEVVKLN 395

Query: 1284 YGGRQIGLGNVGN--GRLVFDSGSSYSYFTEQAYNNLLTVLDDISSESLVRDMSDTSLPI 1457
            YG   + LG + +    ++FDSGSSY+YF ++AY+ L+  L+++S   LV+  SDT+LP+
Sbjct: 396  YGSSPLSLGGMESRVKHILFDSGSSYTYFPKEAYSELVASLNEVSGAGLVQSTSDTTLPL 455

Query: 1458 CWQVKSPSRSV--------------------------------KDVRQLFKPLNLQFGSK 1541
            CW+   P R                                   DV++ FK L  QFG+K
Sbjct: 456  CWRANFPIRKFIYRTELTRPIRRRRRRRRRRRRRRRRRRQHIKGDVKKFFKTLTFQFGTK 515

Query: 1542 WWIMSTKLQIPPEGYLVTNSKGNVCLGILNGRNVHDGSTFILGDISLRGLLFVYDNVNEK 1721
            W ++STK +IPPEGYL+ + KGNVCLGIL G  VHDGST ILGDISLRG L VYDNVN+K
Sbjct: 516  WLVISTKFRIPPEGYLMMSDKGNVCLGILEGSKVHDGSTIILGDISLRGQLVVYDNVNKK 575

Query: 1722 IGWVRSDCARPRRFESHSLF 1781
            IGW  SDCA+P+R +S   F
Sbjct: 576  IGWTPSDCAKPKRSDSLQFF 595


>ref|XP_002891474.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
            gi|297337316|gb|EFH67733.1| aspartyl protease family
            protein [Arabidopsis lyrata subsp. lyrata]
          Length = 578

 Score =  556 bits (1432), Expect = e-155
 Identities = 291/570 (51%), Positives = 392/570 (68%), Gaps = 15/570 (2%)
 Frame = +3

Query: 117  VIITLPPPDNPSLGKTITAFTLSDHQTPTPQSPPQVDESPPVQ----NFAXXXXXXXXXX 284
            VIITLPP D+PS GKTI+AFTL+DH  P  Q PP+ + +P  Q    +            
Sbjct: 15   VIITLPPSDDPSQGKTISAFTLNDHDYPL-QIPPEDNPNPSFQPDPLHQNQQSRLLFSDL 73

Query: 285  XXXXXXTVLPVLGISVIALYLWVSVSRET--LFQLRDELGNDDEHQKNNSQHTFLLPLYL 458
                   VL +LG S++A+  + SV   +  +F++ DE   DD+  +  +  +F+ P+Y 
Sbjct: 74   SMGSPRLVLGLLGFSLLAVAFYASVFPNSVQMFRVSDERNRDDDSSRETT--SFVFPVYH 131

Query: 459  KKPNRVNGDLGDFEIK-LGRRVSFDSRPISKELD-DAMSVGEVNSKLVSTASRIDA-TSV 629
            K   R      +F  + L   +  ++    + +D + ++  +VN  L ++A  ID+ T++
Sbjct: 132  KLRAR------EFHERILAEDLGLENGKFVESMDLELVNPVKVNDVLSTSAGSIDSSTTI 185

Query: 630  IPVRGNIYPDGLYYTYLHFGNPP--RPYFLDMDTGSDLTWIQCDAPCTSCAKGAHPFYKP 803
             PV GN+YPDGLYYT +  G P   + Y LD+DTGSDLTWIQCDAPCTSCAKGA+  YKP
Sbjct: 186  FPVGGNVYPDGLYYTRILVGKPEDGQYYHLDIDTGSDLTWIQCDAPCTSCAKGANQLYKP 245

Query: 804  VKANIIPPKDSYCVEIQRSQITKDCDSCHQCDYEIEYADHSSSVGVLARDELYLNIANGS 983
             K N++   + +CVE+QR+Q+T+ C+SCHQCDYEIEYADHS S+GVL +D+ +L + NGS
Sbjct: 246  RKDNLVRSSEPFCVEVQRNQLTEHCESCHQCDYEIEYADHSYSMGVLTKDKFHLKLHNGS 305

Query: 984  LAKSKVVFGCAYDQQGLLLNTMGKTDGILGLSRAKISFPSQLASQGIINNVVGHCLATEP 1163
            LA+S +VFGC YDQQGLLLNT+ KTDGILGLSRAKIS PSQLAS+GII+NVVGHCLA++ 
Sbjct: 306  LAESDIVFGCGYDQQGLLLNTLLKTDGILGLSRAKISLPSQLASRGIISNVVGHCLASDL 365

Query: 1164 SGGGYLFLGDDFVPHSRMTWVPMLQSHNSNSYQAEMVKVSYGGRQIGL-GNVGN-GRLVF 1337
            +G GY+F+G D VP   MTWVPML   +   YQ ++ K+SYG   + L G  G  G+++F
Sbjct: 366  NGEGYIFMGSDLVPSHGMTWVPMLHHPHLEVYQMQVTKMSYGNAMLSLDGENGRVGKVLF 425

Query: 1338 DSGSSYSYFTEQAYNNLLTVLDDISSESLVRDMSDTSLPICWQVK--SPSRSVKDVRQLF 1511
            D+GSSY+YF  QAY+ L+T L ++S   L RD SD +LPICW+ K  SP  S+ DV++ F
Sbjct: 426  DTGSSYTYFPNQAYSQLVTSLQEVSDLELTRDDSDEALPICWRAKTNSPISSLSDVKKFF 485

Query: 1512 KPLNLQFGSKWWIMSTKLQIPPEGYLVTNSKGNVCLGILNGRNVHDGSTFILGDISLRGL 1691
            +P+ LQ GSKW I+S KL I PE YL+ ++KGNVCLGIL+G NVHDGST I+GDIS+RG 
Sbjct: 486  RPITLQIGSKWLIISKKLLIQPEDYLIISNKGNVCLGILDGSNVHDGSTIIIGDISMRGR 545

Query: 1692 LFVYDNVNEKIGWVRSDCARPRRFESHSLF 1781
            L VYDNV ++IGW++SDC RP  F+ +  F
Sbjct: 546  LIVYDNVKQRIGWMKSDCVRPSEFDHNVPF 575


>ref|XP_006583400.1| PREDICTED: aspartic proteinase Asp1-like [Glycine max]
          Length = 574

 Score =  548 bits (1413), Expect = e-153
 Identities = 287/578 (49%), Positives = 377/578 (65%), Gaps = 20/578 (3%)
 Frame = +3

Query: 81   MEETNERPPQLTVIITLPPPDNPSLGKTITAFTLSDHQTPTPQ--------------SPP 218
            ME+      +  VII+LPPPDNPSLGKTITAF  S++ +P PQ                 
Sbjct: 1    MEDDQSTQIKGVVIISLPPPDNPSLGKTITAFAFSNNPSPPPQLFIQPHQHQSQQTHPNA 60

Query: 219  QVDESPPVQNFAXXXXXXXXXXXXXXXXTV--LPVLGISVIALYLWVSVSRETLFQLRDE 392
            Q +  PP+Q++                  V      G  + AL+L+ SVS  T   LR  
Sbjct: 61   QHNTDPPLQSYPSNPQLSFSFRRLFHSTPVKLFSFFGTLLFALFLYGSVSSTTTVDLRGR 120

Query: 393  LGNDDEHQKNNSQHTFLLPLYLKKPNRVNGDLG--DFEIKLGRRVSFDSRPISKELDDAM 566
              + D+ +  +    FL PL+ K      G LG  D +++LG+ V  +     +++ D  
Sbjct: 121  KNDGDDDKATS----FLFPLFPKF-----GVLGQKDLKLQLGKLVQKEKFLTQRDVGDGS 171

Query: 567  SVGEVNSKLVSTASRIDATSVIPVRGNIYPDGLYYTYLHFGNPPRPYFLDMDTGSDLTWI 746
             V  V           D++SV PV GN+YPDGLY+T L  GNPP+ YFLD+DTGSDLTW+
Sbjct: 172  GVVAV-----------DSSSVFPVSGNVYPDGLYFTILRVGNPPKSYFLDVDTGSDLTWM 220

Query: 747  QCDAPCTSCAKGAHPFYKPVKANIIPPKDSYCVEIQRSQITKDCD-SCHQCDYEIEYADH 923
            QCDAPC SC KGAH  YKP ++N++   DS C+++Q++Q     D S  QCDYEI+YADH
Sbjct: 221  QCDAPCRSCGKGAHVQYKPTRSNVVSSVDSLCLDVQKNQKNGHHDESLLQCDYEIQYADH 280

Query: 924  SSSVGVLARDELYLNIANGSLAKSKVVFGCAYDQQGLLLNTMGKTDGILGLSRAKISFPS 1103
            SSS+GVL RDEL+L   NGS  K  VVFGC YDQ+GL+LNT+ KTDGI+GLSRAK+S P 
Sbjct: 281  SSSLGVLVRDELHLVTTNGSKTKLNVVFGCGYDQEGLILNTLAKTDGIMGLSRAKVSLPY 340

Query: 1104 QLASQGIINNVVGHCLATEPSGGGYLFLGDDFVPHSRMTWVPMLQSHNSNSYQAEMVKVS 1283
            QLAS+G+I NVVGHCL+ + +GGGY+FLGDDFVP+  M WVPM  +  ++ YQ E++ ++
Sbjct: 341  QLASKGLIKNVVGHCLSNDGAGGGYMFLGDDFVPYWGMNWVPMAYTLTTDLYQTEILGIN 400

Query: 1284 YGGRQIGL-GNVGNGRLVFDSGSSYSYFTEQAYNNLLTVLDDISSESLVRDMSDTSLPIC 1460
            YG RQ+   G    G++ FDSGSSY+YF ++AY +L+  L+++S   LV+D SDT+LPIC
Sbjct: 401  YGNRQLKFDGQSKVGKVFFDSGSSYTYFPKEAYLDLVASLNEVSGLGLVQDDSDTTLPIC 460

Query: 1461 WQVKSPSRSVKDVRQLFKPLNLQFGSKWWIMSTKLQIPPEGYLVTNSKGNVCLGILNGRN 1640
            WQ     RS+KDV+  FK L L+FGSKWWI+ST  QIPPEGYL+ ++KG+VCLGIL+G  
Sbjct: 461  WQANFQIRSIKDVKDYFKTLTLRFGSKWWILSTLFQIPPEGYLIISNKGHVCLGILDGSK 520

Query: 1641 VHDGSTFILGDISLRGLLFVYDNVNEKIGWVRSDCARP 1754
            V+DGS+ ILGDISLRG   VYDNV +KIGW R+DC  P
Sbjct: 521  VNDGSSIILGDISLRGYSVVYDNVKQKIGWKRADCGMP 558


>gb|ESW24775.1| hypothetical protein PHAVU_004G159200g [Phaseolus vulgaris]
          Length = 572

 Score =  548 bits (1412), Expect = e-153
 Identities = 286/569 (50%), Positives = 376/569 (66%), Gaps = 17/569 (2%)
 Frame = +3

Query: 105  PQL--TVIITLPPPDNPSLGKTITAFTLSDHQTPTP--------QSPPQVDE----SPPV 242
            PQ+   VII+LPPPDNPSLGKTITAFT SD  +P P        Q    ++E     PP+
Sbjct: 7    PQIKGVVIISLPPPDNPSLGKTITAFTFSDPSSPQPSLLLQQSHQHQTNINEYNNTDPPL 66

Query: 243  QNFAXXXXXXXXXXXXXXXXTV--LPVLGISVIALYLWVSVSRETLFQLRDELGNDDEHQ 416
             ++                  V      G+ + AL+L+ SVS  T  +L     + D+  
Sbjct: 67   HSYPSNAQLGFSRRRLFHRTPVRFFSFFGVFLFALFLYGSVSSTTTLELSGPKNDGDDDG 126

Query: 417  KNNSQHTFLLPLYLKKPNRVNGDLGDFEIKLGRRVSFDSRPISKELDDAMSVGEVNSKLV 596
            K  S   +L PLY K      G LG   +KL        + + KE         V S++V
Sbjct: 127  KPGS---YLFPLYPKF-----GVLGQKNMKLQL-----GKLVHKEKLLTQRKYRVGSEVV 173

Query: 597  STASRIDATSVIPVRGNIYPDGLYYTYLHFGNPPRPYFLDMDTGSDLTWIQCDAPCTSCA 776
            +    +D++SV PV GN++PDGLY+T L  GNPPR YFLD+DTGSDLTW+QCDAPC SC 
Sbjct: 174  A----VDSSSVFPVSGNVFPDGLYFTILRVGNPPRSYFLDVDTGSDLTWMQCDAPCISCG 229

Query: 777  KGAHPFYKPVKANIIPPKDSYCVEIQRSQITKDCDSCHQCDYEIEYADHSSSVGVLARDE 956
            KGAH  YKP ++N++P  DS C+++Q++Q     +S  QCDY+IEYAD SSS+GVL RDE
Sbjct: 230  KGAHAQYKPTRSNVVPSMDSLCLDVQKNQKDGHHESLQQCDYQIEYADQSSSLGVLIRDE 289

Query: 957  LYLNIANGSLAKSKVVFGCAYDQQGLLLNTMGKTDGILGLSRAKISFPSQLASQGIINNV 1136
            L+L   NGS  K   VFGC YDQ+GLLLNT+ KTDGILGLSRAK+S P QLAS+G+I NV
Sbjct: 290  LHLVTTNGSKTKLNFVFGCGYDQEGLLLNTLAKTDGILGLSRAKVSLPYQLASKGLIKNV 349

Query: 1137 VGHCLATEPSGGGYLFLGDDFVPHSRMTWVPMLQSHNSNSYQAEMVKVSYGGRQIGL-GN 1313
            VGHCL+ +  GGGY+FLGDDF+P+  MTWVPM  +  ++ YQ E++ ++YG RQ+   G 
Sbjct: 350  VGHCLSNDEVGGGYMFLGDDFLPYWGMTWVPMAYTLTTDLYQTEILGINYGNRQLSFDGQ 409

Query: 1314 VGNGRLVFDSGSSYSYFTEQAYNNLLTVLDDISSESLVRDMSDTSLPICWQVKSPSRSVK 1493
               G++VFDSGSSY+YF ++AY +L+  L+++S   L++D SDT+LPICW+   P +SVK
Sbjct: 410  SKVGKVVFDSGSSYTYFPKEAYLDLVASLNEVSGLRLIQDDSDTTLPICWEANFPIKSVK 469

Query: 1494 DVRQLFKPLNLQFGSKWWIMSTKLQIPPEGYLVTNSKGNVCLGILNGRNVHDGSTFILGD 1673
            DV+  FK + L+FGSKWWI+ST  QI PEGYL+ ++KG+VCLGIL+G NV+DGS+ ILGD
Sbjct: 470  DVKDYFKTITLRFGSKWWILSTMFQIAPEGYLIISNKGHVCLGILDGSNVNDGSSIILGD 529

Query: 1674 ISLRGLLFVYDNVNEKIGWVRSDCARPRR 1760
            IS RG L VYDN  +KIGW R++C    R
Sbjct: 530  ISFRGYLVVYDNSKQKIGWKRAECGMSSR 558


>ref|NP_564539.1| aspartyl protease [Arabidopsis thaliana]
            gi|7770346|gb|AAF69716.1|AC016041_21 F27J15.15
            [Arabidopsis thaliana]
            gi|13430540|gb|AAK25892.1|AF360182_1 unknown protein
            [Arabidopsis thaliana] gi|14532748|gb|AAK64075.1| unknown
            protein [Arabidopsis thaliana]
            gi|332194267|gb|AEE32388.1| aspartyl protease
            [Arabidopsis thaliana]
          Length = 583

 Score =  548 bits (1411), Expect = e-153
 Identities = 289/584 (49%), Positives = 396/584 (67%), Gaps = 16/584 (2%)
 Frame = +3

Query: 78   LMEETNERPPQLTVIITLPPPDNPSLGKTITAFTLSDHQTPTPQSPPQVDESPPVQ---- 245
            L ++  ++     VIITLPP D+PS GKTI+AFTL+DH  P  + PP+ + +P  Q    
Sbjct: 5    LHDQQQQQRVHSVVIITLPPSDDPSQGKTISAFTLTDHDYPL-EIPPEDNPNPSFQPDPL 63

Query: 246  NFAXXXXXXXXXXXXXXXXTVLPVLGISVIALYLWVSVSRETLFQLR---DELGNDDEHQ 416
            +                   VL +LGIS++A+  + SV   ++   R   DE   DD+  
Sbjct: 64   HRNQQSRLLFSDLSMNSPRLVLGLLGISLLAVAFYASVFPNSVQMFRVSPDERNRDDDDN 123

Query: 417  KNNSQHTFLLPLYLKKPNRVNGDLGDFEIK-LGRRVSFDSRPISKELD-DAMSVGEVNSK 590
               +  +F+ P+Y K   R      +F  + L   +  ++    + +D + ++  +VN  
Sbjct: 124  LRETA-SFVFPVYHKLRAR------EFHERILEEDLGLENENFVESMDLELVNPVKVNDV 176

Query: 591  LVSTASRIDA-TSVIPVRGNIYPDGLYYTYLHFGNPP--RPYFLDMDTGSDLTWIQCDAP 761
            L ++A  ID+ T++ PV GN+YPDGLYYT +  G P   + Y LD+DTGS+LTWIQCDAP
Sbjct: 177  LSTSAGSIDSSTTIFPVGGNVYPDGLYYTRILVGKPEDGQYYHLDIDTGSELTWIQCDAP 236

Query: 762  CTSCAKGAHPFYKPVKANIIPPKDSYCVEIQRSQITKDCDSCHQCDYEIEYADHSSSVGV 941
            CTSCAKGA+  YKP K N++   +++CVE+QR+Q+T+ C++CHQCDYEIEYADHS S+GV
Sbjct: 237  CTSCAKGANQLYKPRKDNLVRSSEAFCVEVQRNQLTEHCENCHQCDYEIEYADHSYSMGV 296

Query: 942  LARDELYLNIANGSLAKSKVVFGCAYDQQGLLLNTMGKTDGILGLSRAKISFPSQLASQG 1121
            L +D+ +L + NGSLA+S +VFGC YDQQGLLLNT+ KTDGILGLSRAKIS PSQLAS+G
Sbjct: 297  LTKDKFHLKLHNGSLAESDIVFGCGYDQQGLLLNTLLKTDGILGLSRAKISLPSQLASRG 356

Query: 1122 IINNVVGHCLATEPSGGGYLFLGDDFVPHSRMTWVPMLQSHNSNSYQAEMVKVSYGGRQI 1301
            II+NVVGHCLA++ +G GY+F+G D VP   MTWVPML     ++YQ ++ K+SYG   +
Sbjct: 357  IISNVVGHCLASDLNGEGYIFMGSDLVPSHGMTWVPMLHDSRLDAYQMQVTKMSYGQGML 416

Query: 1302 GL-GNVGN-GRLVFDSGSSYSYFTEQAYNNLLTVLDDISSESLVRDMSDTSLPICWQVKS 1475
             L G  G  G+++FD+GSSY+YF  QAY+ L+T L ++S   L RD SD +LPICW+ K+
Sbjct: 417  SLDGENGRVGKVLFDTGSSYTYFPNQAYSQLVTSLQEVSGLELTRDDSDETLPICWRAKT 476

Query: 1476 --PSRSVKDVRQLFKPLNLQFGSKWWIMSTKLQIPPEGYLVTNSKGNVCLGILNGRNVHD 1649
              P  S+ DV++ F+P+ LQ GSKW I+S KL I PE YL+ ++KGNVCLGIL+G +VHD
Sbjct: 477  NFPFSSLSDVKKFFRPITLQIGSKWLIISRKLLIQPEDYLIISNKGNVCLGILDGSSVHD 536

Query: 1650 GSTFILGDISLRGLLFVYDNVNEKIGWVRSDCARPRRFESHSLF 1781
            GST ILGDIS+RG L VYDNV  +IGW++SDC RPR  + +  F
Sbjct: 537  GSTIILGDISMRGHLIVYDNVKRRIGWMKSDCVRPREIDHNVPF 580


>ref|XP_006826817.1| hypothetical protein AMTR_s00010p00056950 [Amborella trichopoda]
            gi|548831246|gb|ERM94054.1| hypothetical protein
            AMTR_s00010p00056950 [Amborella trichopoda]
          Length = 545

 Score =  538 bits (1385), Expect = e-150
 Identities = 279/557 (50%), Positives = 367/557 (65%), Gaps = 2/557 (0%)
 Frame = +3

Query: 117  VIITLPPPDNPSLGKTITAFTLSDHQTPTPQSPPQVDESPPVQNFAXXXXXXXXXXXXXX 296
            VII+LPPPD+PS GKTITAFT+    +   ++  Q  ++   Q  +              
Sbjct: 10   VIISLPPPDDPSKGKTITAFTMVSDPSHQNENQSQNQQTQQPQIASNSIAGSSRGRIGSI 69

Query: 297  XXTVLPVLGISVIALYLWVSVSRETLFQLRDELGNDDEHQKNNSQHTFLLPLYLKKPNRV 476
               VL +LG  V  L+ W  VS  +      E+  + E  KNN   +FL  LY K     
Sbjct: 70   VVRVLAMLGAVVAVLFFWQWVSGFS------EMDYETERSKNNP--SFLYNLYPKWSEEA 121

Query: 477  NGDLGDFEIKLGRRVSFDSRPISKELDDAMSVGEVNSKLVSTASRIDATSVIPVRGNIYP 656
                 D  ++LG  V  D           + +G  + K +   S I+++++ PV+GN+YP
Sbjct: 122  IEK--DAALRLGTFVKRDE----------VRIGLRDVKTLEAISSINSSTIFPVKGNVYP 169

Query: 657  DGLYYTYLHFGNPPRPYFLDMDTGSDLTWIQCDAPCTSCAKGAHPFYKPVKANIIPPKDS 836
            DGLYY  +  GNP RPY+LDMDTGSDLTWIQC+APCT+CAKG HP Y P K N++P KD 
Sbjct: 170  DGLYYISILVGNPRRPYYLDMDTGSDLTWIQCNAPCTNCAKGPHPLYNPSKQNLVPSKDP 229

Query: 837  YCVEIQRSQITKDCDSCHQCDYEIEYADHSSSVGVLARDELYLNIANGSLAKSKVVFGCA 1016
            +C+E+Q +   K   + HQCDY+IEYAD SSS+GVL RD+L L I NG++ K+ +VFGCA
Sbjct: 230  FCLEVQVNDKGKFAGASHQCDYDIEYADQSSSMGVLVRDDLQLMITNGTVIKTGLVFGCA 289

Query: 1017 YDQQGLLLNTMGKTDGILGLSRAKISFPSQLASQGIINNVVGHCLATEPSGGGYLFLGDD 1196
            YDQ+G L ++  KTDGILGLS AK+S PSQLAS+G++ NVVGHC+  + +GGGY+FLGDD
Sbjct: 290  YDQRGKLGHSPAKTDGILGLSSAKVSLPSQLASRGLMKNVVGHCIRNDANGGGYMFLGDD 349

Query: 1197 FVPHSRMTWVPMLQSHNSNSYQAEMVKVSYGGRQIGLGNVGN--GRLVFDSGSSYSYFTE 1370
            F+P  RMTWVPML S ++N+Y AE+ K+S G R I  G +    GR+VFDSGSSYSY T+
Sbjct: 350  FIPQWRMTWVPMLSSPSTNAYHAEVSKISLGSRPIDGGGLITKIGRVVFDSGSSYSYLTK 409

Query: 1371 QAYNNLLTVLDDISSESLVRDMSDTSLPICWQVKSPSRSVKDVRQLFKPLNLQFGSKWWI 1550
            QAY +L+  L D++ + LV D SD +LP+CW+ KSP RS+KDV Q FKPL L FGS+   
Sbjct: 410  QAYTSLIKSLKDVAEKGLVLDDSDKTLPVCWKAKSPLRSIKDVNQFFKPLVLNFGSRLLF 469

Query: 1551 MSTKLQIPPEGYLVTNSKGNVCLGILNGRNVHDGSTFILGDISLRGLLFVYDNVNEKIGW 1730
             S   +IPPEGYL+ ++KGN CLGIL G ++HDG+T ILGDISLR  L VYDNV  +IGW
Sbjct: 470  GSKNFEIPPEGYLIISAKGNACLGILEGSHIHDGATNILGDISLRAKLVVYDNVKRRIGW 529

Query: 1731 VRSDCARPRRFESHSLF 1781
            V+SDC +P + +S   F
Sbjct: 530  VQSDC-QPLKLKSFPFF 545


>ref|XP_004512995.1| PREDICTED: lysine-specific histone demethylase 1 homolog 1-like
            [Cicer arietinum]
          Length = 1387

 Score =  536 bits (1380), Expect = e-149
 Identities = 290/575 (50%), Positives = 381/575 (66%), Gaps = 16/575 (2%)
 Frame = +3

Query: 84   EETNERPPQL--TVIITLPPPDNPSLGKTITAFTLSDHQTPTPQSPPQVDESP--PVQNF 251
            +E+  + PQL   VII++PP +NPSLGK ITAFT S++   +PQ  PQ +  P  P+Q++
Sbjct: 5    KESQSQTPQLKSVVIISIPPSNNPSLGKKITAFTFSNNPF-SPQQQPQNNVPPMSPIQSY 63

Query: 252  AXXXXXXXXXXXXXXXXTVLPVL---GISVIALYLWVSVSRETLFQLR-DEL------GN 401
                             T +      GI + AL+L+ S+       L   EL      G 
Sbjct: 64   PSNHQLQFSSTRRFFHTTQIKFFTFFGIFLFALFLYGSLFSTITTTLELSELKNHHHDGG 123

Query: 402  DDEHQKNNSQHTFLLPLYLKKPNRVNGDLGDFEIKLGRRVSFDSRPISKELDDAMSVGEV 581
            DDE  + +S   FL PL+ K       DL   ++K G  V+  S        D+  +   
Sbjct: 124  DDESDEPSS---FLFPLFKKYGVVGQRDLKLIDVKKGNFVTQKS-------GDSDGIA-F 172

Query: 582  NSKLVSTASRIDATSVIPVRGNIYPDGLYYTYLHFGNPPRPYFLDMDTGSDLTWIQCDAP 761
            +S++V+  S   +++V P+ GN+YPDGLYYT++  GNPP+ YF+D+DTGSDLTWIQCDAP
Sbjct: 173  SSRVVAVDS--SSSTVFPISGNVYPDGLYYTHVRVGNPPKRYFVDVDTGSDLTWIQCDAP 230

Query: 762  CTSCAKGAHPFYKPVKANIIPPKDSYCVEIQRSQITKDCDSCHQCDYEIEYADHSSSVGV 941
            C SCAKGA+  YKP++ NI+P  DS C+E+Q++Q     +S  QCDYEI+YADHSSS+GV
Sbjct: 231  CRSCAKGANVPYKPIRTNIVPSLDSLCLEVQKNQKNGYHESFQQCDYEIQYADHSSSMGV 290

Query: 942  LARDELYLNIANGSLAKSKVVFGCAYDQQGLLLNTMGKTDGILGLSRAKISFPSQLASQG 1121
            L RDEL+L   NGS  K   VFGC YDQ+GLLLNT+ KTDGI+GLSRAK+  P QL+S+G
Sbjct: 291  LIRDELHLMTTNGSKTKLNFVFGCGYDQEGLLLNTLTKTDGIMGLSRAKVGLPYQLSSKG 350

Query: 1122 IINNVVGHCLATEPS-GGGYLFLGDDFVPHSRMTWVPMLQSHNSNSYQAEMVKVSYGGRQ 1298
            II NVVGHCL+     GGGY+FLGDDFVP+  MTW PM Q   ++ YQ E++ ++YG R 
Sbjct: 351  IIKNVVGHCLSNNDGVGGGYMFLGDDFVPYWGMTWAPMTQI--TDLYQTEVLGINYGNRL 408

Query: 1299 IGL-GNVGNGRLVFDSGSSYSYFTEQAYNNLLTVLDDISSESLVRDMSDTSLPICWQVKS 1475
            +   G+   G +VFDSGSSY+YF ++AY +L+  L+++S   LV D SDT+LPICWQ   
Sbjct: 409  LSFDGHSKVGNVVFDSGSSYTYFPKEAYRDLVASLEEVSGLGLVEDDSDTTLPICWQANF 468

Query: 1476 PSRSVKDVRQLFKPLNLQFGSKWWIMSTKLQIPPEGYLVTNSKGNVCLGILNGRNVHDGS 1655
            P RSVKDV+  FK L L+FG+KWWI+ST   IPPEGYL+ ++KGNVCL IL+G NV+DGS
Sbjct: 469  PIRSVKDVKDYFKTLTLRFGNKWWILSTLFHIPPEGYLIISNKGNVCLAILDGSNVNDGS 528

Query: 1656 TFILGDISLRGLLFVYDNVNEKIGWVRSDCARPRR 1760
            + ILGDISLRG L VYDNVN+ IGW R+ C  P R
Sbjct: 529  SIILGDISLRGYLVVYDNVNKNIGWERTKCGMPNR 563


>ref|XP_006304522.1| hypothetical protein CARUB_v10011364mg [Capsella rubella]
            gi|482573233|gb|EOA37420.1| hypothetical protein
            CARUB_v10011364mg [Capsella rubella]
          Length = 580

 Score =  535 bits (1378), Expect = e-149
 Identities = 287/570 (50%), Positives = 380/570 (66%), Gaps = 15/570 (2%)
 Frame = +3

Query: 117  VIITLPPPDNPSLGKTITAFTLSDHQTPT---PQSPPQVDESPPVQNFAXXXXXXXXXXX 287
            V+ITLPP D+PS GKTI+AFTL+DH  P    P+ P      P  QN             
Sbjct: 17   VVITLPPSDDPSQGKTISAFTLTDHDYPLEIPPEDPSFHQPDPLHQN--PQFRLWFSDLS 74

Query: 288  XXXXXTVLPVLGISVIALYLWVSVSRET--LFQLRDELGNDDEHQKNNSQHTFLLPLYLK 461
                  VL +LGIS+IA+ L+ SV   +  +F++ DE   DD++ +  +  +F+ P+Y K
Sbjct: 75   MSSPRLVLSLLGISLIAIALYGSVFSNSVQMFRVSDERNRDDDNSRRETT-SFVFPVYHK 133

Query: 462  KPNRVNGDLGDF-EIKLGRRVSFDSRPISKELD-DAMSVGEVNSKLVSTASRIDA--TSV 629
               R      +F E  L   +  ++  + + +D + ++  +VNS L +TA  +D+  T++
Sbjct: 134  LRAR------EFHERVLAEDLGVENGILVESMDLELVNPVKVNSVLSTTAGSVDSSSTTI 187

Query: 630  IPVRGNIYPDGLYYTYLHFGNPP--RPYFLDMDTGSDLTWIQCDAPCTSCAKGAHPFYKP 803
             PV GN+YPDGLYYT +  G P     Y LD+DTGSDLTWIQCDAPCTSCAKGA+  YKP
Sbjct: 188  FPVGGNVYPDGLYYTRILVGKPEDGHYYHLDIDTGSDLTWIQCDAPCTSCAKGANQLYKP 247

Query: 804  VKANIIPPKDSYCVEIQRSQITKDCDSCHQCDYEIEYADHSSSVGVLARDELYLNIANGS 983
               N++   +  CVE QR+Q+T   +S  QCDYEIEYADHS S+GVL +D+ +L + NGS
Sbjct: 248  KNHNLVGSSEPLCVEFQRNQMTGHFESSQQCDYEIEYADHSYSMGVLTKDKFHLKLHNGS 307

Query: 984  LAKSKVVFGCAYDQQGLLLNTMGKTDGILGLSRAKISFPSQLASQGIINNVVGHCLATEP 1163
            LA+S +VFGC YDQQG+LLNT+ KTDGILGLSRAKIS PSQL S+GII+NVVGHCLA++ 
Sbjct: 308  LAESDIVFGCGYDQQGVLLNTLLKTDGILGLSRAKISLPSQLGSRGIISNVVGHCLASDL 367

Query: 1164 SGGGYLFLGDDFVPHSRMTWVPMLQSHNSNSYQAEMVKVSYGGRQIGL-GNVGN-GRLVF 1337
             G GY+F+G D VP   MTWVPML       YQ ++ K+SYG   + L G  G  G+ +F
Sbjct: 368  DGEGYIFMGSDLVPSHGMTWVPMLHHSRLEVYQMQVTKMSYGNAMLTLDGENGRVGKALF 427

Query: 1338 DSGSSYSYFTEQAYNNLLTVLDDISSESLVRDMSDTSLPICWQVKS--PSRSVKDVRQLF 1511
            D+GSSY+YF  QAY  L+T L ++S   L RD SD +LPICW+ K+  P  S+ DV++ F
Sbjct: 428  DTGSSYTYFPNQAYTQLVTSLQEVSGSDLTRDDSDETLPICWRAKTNFPISSLSDVKKFF 487

Query: 1512 KPLNLQFGSKWWIMSTKLQIPPEGYLVTNSKGNVCLGILNGRNVHDGSTFILGDISLRGL 1691
            +P+ LQ  SKW I+S KL I PE YL+ ++KGNVCLGIL+G +VHDGST I+GDIS+RG 
Sbjct: 488  RPITLQIWSKWLIISRKLLIQPEDYLIISNKGNVCLGILDGSSVHDGSTIIIGDISMRGH 547

Query: 1692 LFVYDNVNEKIGWVRSDCARPRRFESHSLF 1781
            L VYDNV  +IGW++SDC RPR F+ +  F
Sbjct: 548  LIVYDNVKRRIGWMKSDCVRPREFDHNVPF 577


>ref|XP_006393315.1| hypothetical protein EUTSA_v10011346mg [Eutrema salsugineum]
            gi|557089893|gb|ESQ30601.1| hypothetical protein
            EUTSA_v10011346mg [Eutrema salsugineum]
          Length = 580

 Score =  528 bits (1359), Expect = e-147
 Identities = 282/562 (50%), Positives = 377/562 (67%), Gaps = 16/562 (2%)
 Frame = +3

Query: 117  VIITLPPPDNPSLGKTITAFTLSDHQTPTPQSPPQVDESPPVQNFAXXXXXXXXXXXXXX 296
            VIITLPP DNPS GKTI+AFTL+DH  P P   P+ + +P  Q                 
Sbjct: 18   VIITLPPSDNPSKGKTISAFTLTDHDYP-PDIRPEDERNPSFQPDPLHQNPQSGLWFSDL 76

Query: 297  XXT----VLPVLGISVIALYLWVSVSRET--LFQLRDELGNDDEHQKNNSQHTFLLPLYL 458
              +    VL +LGIS++A+  + SV   +  LF++ DE   D+++++  +  +F+ P+Y 
Sbjct: 77   SMSSPRLVLGLLGISLLAIAFYGSVFPNSVQLFRVSDERDRDEDNRRETA--SFVFPVYH 134

Query: 459  KK-----PNRVNGDLGDFEIKLGRRVSFDSRPISKELDDAMSVGEVNSKLVSTASRIDAT 623
            K      P R   +  D  +K    +  +S  I +EL + + V +V S   S  S   +T
Sbjct: 135  KLRAREIPERNLAEALDV-VKEENGIFVES--IEQELVNPVKVNDVFS--ASVGSLDSST 189

Query: 624  SVIPVRGNIYPDGLYYTYLHFGNPPRP---YFLDMDTGSDLTWIQCDAPCTSCAKGAHPF 794
            ++ PV G +YPDGLY+T +  GNP +    + LD+DTGSDLTWIQCDAPCTSCAKGA+  
Sbjct: 190  TIFPVGGYVYPDGLYFTRVFVGNPEKDGHYFHLDIDTGSDLTWIQCDAPCTSCAKGANQL 249

Query: 795  YKPVKANIIPPKDSYCVEIQRSQITKDCDSCHQCDYEIEYADHSSSVGVLARDELYLNIA 974
            YKP K  ++   +  CVE+Q++Q+T+ C+SC QCDYEIEYAD SSS+GVL +DE +L + 
Sbjct: 250  YKPRKDKLVGSAEHLCVEVQKNQMTELCESCQQCDYEIEYADLSSSLGVLTKDEFHLKLH 309

Query: 975  NGSLAKSKVVFGCAYDQQGLLLNTMGKTDGILGLSRAKISFPSQLASQGIINNVVGHCLA 1154
            NGSLA S +VFGC YDQQGLLLNT+ K DGILGLSRAKIS PSQLASQGII+NVVGHCL 
Sbjct: 310  NGSLAASDIVFGCGYDQQGLLLNTLLKKDGILGLSRAKISLPSQLASQGIISNVVGHCLP 369

Query: 1155 TEPSGGGYLFLGDDFVPHSRMTWVPMLQSHNSNSYQAEMVKVSYGGRQIGLG--NVGNGR 1328
            ++ +G GY+F+G D VP   MTWVPM    +   +Q ++ KVSYG   + L   N   G+
Sbjct: 370  SDLNGEGYIFMGSDLVPLHGMTWVPMFHHSHLEVHQMQVTKVSYGNGMLSLSGENGRIGK 429

Query: 1329 LVFDSGSSYSYFTEQAYNNLLTVLDDISSESLVRDMSDTSLPICWQVKSPSRSVKDVRQL 1508
            ++FD+GSSY+YF ++AY+ L+T L ++    L RD SD +LPICWQ      S+ DV++ 
Sbjct: 430  VLFDTGSSYTYFPKKAYSQLVTSLQEV---KLTRDESDKALPICWQANFLISSLSDVKRF 486

Query: 1509 FKPLNLQFGSKWWIMSTKLQIPPEGYLVTNSKGNVCLGILNGRNVHDGSTFILGDISLRG 1688
            +KP+ +Q GSKWWI+S KL I PE YL+ ++KGNVCLGIL+G +VHDGST ILGDIS+RG
Sbjct: 487  YKPITIQIGSKWWIISRKLVIQPEDYLIISNKGNVCLGILDGSSVHDGSTIILGDISMRG 546

Query: 1689 LLFVYDNVNEKIGWVRSDCARP 1754
             L VYDNV  +IGW++SDC RP
Sbjct: 547  RLIVYDNVKRRIGWMKSDCVRP 568


>gb|EOY21002.1| Eukaryotic aspartyl protease family protein, putative isoform 2
            [Theobroma cacao]
          Length = 520

 Score =  524 bits (1349), Expect = e-146
 Identities = 273/523 (52%), Positives = 352/523 (67%), Gaps = 21/523 (4%)
 Frame = +3

Query: 87   ETNERPPQLT--VIITLPPPDNPSLGKTITAFTLSD------HQTPTPQSPPQVDESPPV 242
            +++ERP Q+T  VIITLPP DNPSLGKTITAFTL++      HQT   Q   +    P  
Sbjct: 2    DSDERPQQVTGVVIITLPPSDNPSLGKTITAFTLTNDVFPQSHQTQQRQQQEEEQTLPTT 61

Query: 243  QNFAXXXXXXXXXXXXXXXX--------TVLPVLGISVIALYLWVSVSRETLFQLRDELG 398
            Q                            +L  LGIS+ AL L+ S    T  +LR+   
Sbjct: 62   QILTPAPPSAQNPQRGFSFLGLFSDNPRKLLGFLGISLFALLLYSSAFSNTFVELRNSNN 121

Query: 399  NDDEHQKNNSQHTFLLPLYLKKPNRVNGDLG-DFEIKLGRRVSFDSRPISKELDD-AMSV 572
            +DDE  ++     F+ PLY K        LG D E+KLGR V  D   +   ++  A   
Sbjct: 122  DDDEKPQS-----FIFPLYHK--------LGADLELKLGRFVDVDKENLVASVEGGATGT 168

Query: 573  GEVNSKLVSTASRIDAT-SVIPVRGNIYPDGLYYTYLHFGNPPRPYFLDMDTGSDLTWIQ 749
             ++N  + S A+ ID++ +++PVRGN+YPDGLY+TY+  GNP R YFLD+DTGSDLTWIQ
Sbjct: 169  QKINKLVASNAAVIDSSGTILPVRGNVYPDGLYFTYMLVGNPQRRYFLDIDTGSDLTWIQ 228

Query: 750  CDAPCTSCAKGAHPFYKPVKANIIPPKDSYCVEIQRSQITKDCDSCHQCDYEIEYADHSS 929
            CDAPC+SCAKGA+P YKP + NI+  KD  C E+Q++Q  ++C++C QCDYEIEYAD SS
Sbjct: 229  CDAPCSSCAKGANPLYKPTRVNIVASKDLMCTEVQKNQKPQNCETCQQCDYEIEYADRSS 288

Query: 930  SVGVLARDELYLNIANGSLAKSKVVFGCAYDQQGLLLNTMGKTDGILGLSRAKISFPSQL 1109
            S+GVLARDEL+L  ANGS     VVFGCAYDQQG+LLNT+ KTDGILGLSRAK+S PSQL
Sbjct: 289  SLGVLARDELHLVTANGSTTNLDVVFGCAYDQQGILLNTLSKTDGILGLSRAKVSLPSQL 348

Query: 1110 ASQGIINNVVGHCLATEPSGGGYLFLGDDFVPHSRMTWVPMLQSHNSNSYQAEMVKVSYG 1289
            AS+GIINNVVGHCLAT+    GY+FLGDDFVP+  M+WVPML S ++  Y  ++VK++YG
Sbjct: 349  ASKGIINNVVGHCLATDVGASGYMFLGDDFVPNWGMSWVPMLGSPSTEFYHTQIVKINYG 408

Query: 1290 GRQIGLGNVGN--GRLVFDSGSSYSYFTEQAYNNLLTVLDDISSESLVRDMSDTSLPICW 1463
               + LG   +  GR+VFDSGSSY+YF +QAY  L+  L ++S    ++D++DT+LP+CW
Sbjct: 409  SSSLSLGRQHSSIGRVVFDSGSSYTYFMKQAYAELVASLSEVSEVGFIQDVADTTLPMCW 468

Query: 1464 QVKSPSRSVKDVRQLFKPLNLQFGSKWWIMSTKLQIPPEGYLV 1592
            Q   P R +KDV+Q FK L LQFGSKWWI+S +  IPPEGYL+
Sbjct: 469  QAPFPIRFIKDVKQFFKTLTLQFGSKWWIISKRFHIPPEGYLI 511


Top