BLASTX nr result

ID: Rauwolfia21_contig00017713 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Rauwolfia21_contig00017713
         (2636 letters)

Database: ./nr 
           37,332,560 sequences; 13,225,080,153 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_004241624.1| PREDICTED: aspartic proteinase Asp1-like [So...   679   0.0  
ref|XP_006354730.1| PREDICTED: aspartic proteinase Asp1-like [So...   678   0.0  
ref|XP_006440945.1| hypothetical protein CICLE_v10019473mg [Citr...   641   0.0  
gb|EXB56943.1| Aspartic proteinase Asp1 [Morus notabilis]             636   e-179
ref|XP_006485774.1| PREDICTED: aspartic proteinase Asp1-like [Ci...   631   e-178
ref|XP_002275943.1| PREDICTED: aspartic proteinase Asp1-like [Vi...   630   e-178
gb|EOY21001.1| Eukaryotic aspartyl protease family protein, puta...   617   e-174
ref|XP_002511959.1| protein with unknown function [Ricinus commu...   612   e-172
ref|XP_002328687.1| predicted protein [Populus trichocarpa] gi|5...   605   e-170
ref|XP_006583400.1| PREDICTED: aspartic proteinase Asp1-like [Gl...   578   e-162
emb|CBI15437.3| unnamed protein product [Vitis vinifera]              576   e-161
ref|XP_004137470.1| PREDICTED: aspartic proteinase Asp1-like [Cu...   574   e-161
gb|ESW24775.1| hypothetical protein PHAVU_004G159200g [Phaseolus...   563   e-157
ref|XP_006826817.1| hypothetical protein AMTR_s00010p00056950 [A...   548   e-153
ref|XP_006588295.1| PREDICTED: lysine-specific histone demethyla...   545   e-152
ref|XP_004512995.1| PREDICTED: lysine-specific histone demethyla...   545   e-152
ref|XP_002891474.1| aspartyl protease family protein [Arabidopsi...   535   e-149
gb|EOY21002.1| Eukaryotic aspartyl protease family protein, puta...   532   e-148
gb|ESW24776.1| hypothetical protein PHAVU_004G159200g [Phaseolus...   526   e-146
ref|NP_564539.1| aspartyl protease [Arabidopsis thaliana] gi|777...   525   e-146

>ref|XP_004241624.1| PREDICTED: aspartic proteinase Asp1-like [Solanum lycopersicum]
          Length = 562

 Score =  679 bits (1751), Expect = 0.0
 Identities = 349/570 (61%), Positives = 417/570 (73%), Gaps = 4/570 (0%)
 Frame = +3

Query: 183  MEENEESPQLKGIVIIQLPPPENPSLGKTITAITFSENTQSPIRQAPEPHRQVQPVGPSQ 362
            MEE + SP ++G+VII LPPP+NPS GKTITA T S+   SP  Q  +   Q +      
Sbjct: 1    MEETKNSPPIQGVVIITLPPPDNPSYGKTITAFTLSD---SPTHQQQQEQEQEEEPPQQS 57

Query: 363  HPQTQE-NPGVQGRFSSKGFLFGSPIVFSCLLSLSLIALFSWVSSTQETLYELRDADNDR 539
             P  Q+ N GV      + F F   IVF  LL +SLIAL  W S TQETL+ELRD + D 
Sbjct: 58   QPHNQDVNAGVLHVSLERSFFFRPTIVFG-LLGISLIALSFWSSLTQETLFELRDVEQDH 116

Query: 540  KSN--SIIFPLYPKLGIGGNVPQDAEIKLGRFVDSNFKIVMDNLNDGTTGGKFAKSMISE 713
            KS+  S I PLYPK G   N   D E KLGRFVD  FK   DN  D     K AKS+ + 
Sbjct: 117  KSSNSSFILPLYPKRGGAWNSRTDVEFKLGRFVD--FK--PDNFMDQE---KIAKSLSAA 169

Query: 714  SKIDSTAILTARGNIHPEGLFYTFMLVGNPPKPYFLDIDTGSDLTWIQCDAPCTSCAKGA 893
            +K+DS+A    RGNIH EGL+YT+MLVGNPPKPYFLDIDTGSDL WIQCDAPCTSCAKGA
Sbjct: 170  TKLDSSANFPVRGNIHSEGLYYTYMLVGNPPKPYFLDIDTGSDLMWIQCDAPCTSCAKGA 229

Query: 894  HPFYKPKRGNLIHSKDSYCIEVQKSPGTK-CETCDQCDYEIEYADSSSSIGVLIKDKFQV 1070
            HP YKP+  N+I  K+ YC+EVQ++  +K C+ C QCDYEIEYAD SSS+GVL KD+ Q+
Sbjct: 230  HPLYKPRNVNMIPPKNPYCVEVQENLRSKYCDNCHQCDYEIEYADRSSSVGVLAKDELQL 289

Query: 1071 MISNGSLVRPNLVFGCAYDQQGLLLKSLAKTDGILGLGRGKISLPFQLASHGNIKNVVAH 1250
            +++NG+  +PN+VFGCAYDQQG LL +LA TDGILGL R  ISLP QLASHG I NV+ H
Sbjct: 290  VLANGTGTKPNVVFGCAYDQQGTLLNTLASTDGILGLSRAPISLPSQLASHGLINNVIGH 349

Query: 1251 CLASDAAGGGYVFLGDDFVPHQQMVWVPMLNSPFMNSYHAEVTKISYGGKQFSLGSIGKG 1430
            CL +D   GGY+FLG+DFVP  +M WVPMLN+PF N Y A++ K++YGGK   LGS G G
Sbjct: 350  CLRTDT-NGGYLFLGNDFVPQWRMSWVPMLNNPFPNLYQAQLMKMNYGGKDLQLGSRGYG 408

Query: 1431 IGHVVFDSGSSYTYFTKQAYNDLVANLEDISAKGFVLDASDDTLPICWRAKFPVRSIAEL 1610
               VVFDSGS+YTYFT QAY  L++ LE+IS++  + DASD TLPICWRAKFPVRSI E+
Sbjct: 409  QDSVVFDSGSTYTYFTDQAYKALISMLEEISSEDLIKDASDTTLPICWRAKFPVRSIEEV 468

Query: 1611 RKFFKPLDFQFGSKWWIASQKLRIPPEGYLITSNKGNICLGILEGSHVNDGSSVILGDIS 1790
            R+FFKPL+ QFGSKW + S KL IP EGYL  S K N+CLGIL+GS+V+DGS++ILGDIS
Sbjct: 469  RQFFKPLNLQFGSKWRVVSTKLWIPAEGYLTISEKSNVCLGILDGSNVHDGSAIILGDIS 528

Query: 1791 LRGQLFVYDNENQKIGWIRSDCRRPQTFDS 1880
            LRGQLFVYDN NQKIGWIRS+C RP+   S
Sbjct: 529  LRGQLFVYDNVNQKIGWIRSNCERPENVPS 558


>ref|XP_006354730.1| PREDICTED: aspartic proteinase Asp1-like [Solanum tuberosum]
          Length = 558

 Score =  678 bits (1750), Expect = 0.0
 Identities = 346/569 (60%), Positives = 419/569 (73%), Gaps = 3/569 (0%)
 Frame = +3

Query: 183  MEENEESPQLKGIVIIQLPPPENPSLGKTITAITFSENTQSPIRQAPEPHRQVQPVGPSQ 362
            MEE + SP ++G+VII LPPP+NPS GKTITA T S++     +Q  EP +Q QP     
Sbjct: 1    MEETKNSPPIQGVVIITLPPPDNPSYGKTITAFTLSDSPTHQQQQEEEPPQQSQP----- 55

Query: 363  HPQTQENPGVQGRFSSKGFLFGSPIVFSCLLSLSLIALFSWVSSTQETLYELRDADNDRK 542
            H Q   N GV      + F F   IVF  LL +SLIAL  W S TQETL+ELRD ++D K
Sbjct: 56   HNQDL-NTGVLRASLERSFFFRPKIVFG-LLGISLIALSFWSSLTQETLFELRDVEHDHK 113

Query: 543  SN--SIIFPLYPKLGIGGNVPQDAEIKLGRFVDSNFKIVMDNLNDGTTGGKFAKSMISES 716
            S+  S I PLYPK G   N  +D E KLGRFVD      MD         K AKS+ + +
Sbjct: 114  SSNSSFILPLYPKRGGAWNSRRDVEFKLGRFVDFKPDKFMDQ-------EKIAKSLSAAT 166

Query: 717  KIDSTAILTARGNIHPEGLFYTFMLVGNPPKPYFLDIDTGSDLTWIQCDAPCTSCAKGAH 896
            K+DS+     RGNIH EGL+YT+MLVGNPP+PYFLDIDTGSDL WIQCDAPCTSCAKGAH
Sbjct: 167  KLDSSVNFPVRGNIHSEGLYYTYMLVGNPPRPYFLDIDTGSDLMWIQCDAPCTSCAKGAH 226

Query: 897  PFYKPKRGNLIHSKDSYCIEVQKSPGTK-CETCDQCDYEIEYADSSSSIGVLIKDKFQVM 1073
            P YKP+  N+I  K+ YC+EVQ++  +K C+ C QCDYEIEYAD SSS+GVL KD+ Q++
Sbjct: 227  PLYKPRNVNMIPPKNPYCVEVQENLKSKYCDNCHQCDYEIEYADRSSSVGVLAKDELQLV 286

Query: 1074 ISNGSLVRPNLVFGCAYDQQGLLLKSLAKTDGILGLGRGKISLPFQLASHGNIKNVVAHC 1253
            ++NG+  +P++VFGCAYDQQG LL +LA TDGILGL R  ISLP QLASHG I NV+ HC
Sbjct: 287  LANGTGTKPSVVFGCAYDQQGTLLNTLASTDGILGLSRAPISLPSQLASHGLINNVIGHC 346

Query: 1254 LASDAAGGGYVFLGDDFVPHQQMVWVPMLNSPFMNSYHAEVTKISYGGKQFSLGSIGKGI 1433
            L +D   GGY+FLG+DFVP  +M WVPMLN+PF N Y A++ K++YGGK+  LGS   G 
Sbjct: 347  LRTDT-NGGYLFLGNDFVPQWRMSWVPMLNNPFPNLYQAQLMKMNYGGKELRLGSTSYGQ 405

Query: 1434 GHVVFDSGSSYTYFTKQAYNDLVANLEDISAKGFVLDASDDTLPICWRAKFPVRSIAELR 1613
            G VVFDSGS+YTYFT QAY  L++ LE+IS++  + DASD TLPICWRAKFPVRSI E+R
Sbjct: 406  GTVVFDSGSTYTYFTDQAYKALISMLEEISSEDLIKDASDTTLPICWRAKFPVRSIEEVR 465

Query: 1614 KFFKPLDFQFGSKWWIASQKLRIPPEGYLITSNKGNICLGILEGSHVNDGSSVILGDISL 1793
            +FFKPL+ QFGSKW I S KL IP EG+L  S KGN+CLGIL+GS+V+DGS++ILGDISL
Sbjct: 466  QFFKPLNLQFGSKWRIVSTKLWIPAEGFLTISEKGNVCLGILDGSNVHDGSAIILGDISL 525

Query: 1794 RGQLFVYDNENQKIGWIRSDCRRPQTFDS 1880
            RGQLFVYDN NQKIGWIRS+C RP+   S
Sbjct: 526  RGQLFVYDNVNQKIGWIRSNCERPEKVPS 554


>ref|XP_006440945.1| hypothetical protein CICLE_v10019473mg [Citrus clementina]
            gi|557543207|gb|ESR54185.1| hypothetical protein
            CICLE_v10019473mg [Citrus clementina]
          Length = 577

 Score =  641 bits (1653), Expect = 0.0
 Identities = 320/573 (55%), Positives = 409/573 (71%), Gaps = 8/573 (1%)
 Frame = +3

Query: 186  EENEESPQLKGIVIIQLPPPENPSLGKTITAITFSENTQSPIRQAPEPHRQVQPVGPSQH 365
            E     PQL G+VII LPPP NPSLGKTITA T ++N+    +Q     +Q  P+ P  H
Sbjct: 5    ESPSPPPQLTGVVIITLPPPNNPSLGKTITAYTLTDNSPQS-QQTRHRQQQEHPLPPQLH 63

Query: 366  PQTQENPGVQGRFSSKGFLFGSPIVFSCLLSLSLIALFSWVSSTQETLYELRDADNDRKS 545
            P        Q  FS      G P      L++S+ AL  + S    TL +   ++ND ++
Sbjct: 64   PPQNS----QFNFSLPMLFPGLPRKLFLFLAISIFALILYGSVFSYTLQDRYKSNNDDEN 119

Query: 546  N-SIIFPLYPKLGIGGNVPQDAEIKLGRFVDSNFKIVMDNLNDGTTG---GKFAKSMISE 713
              S +FPLY K GI     +DAE KLGRFVD + + V+ ++NDG       K  K ++S 
Sbjct: 120  KESFVFPLYHKFGIREVSQRDAEFKLGRFVDLDGESVVASVNDGIIRPHKSKINKKLVSS 179

Query: 714  SKI--DSTAILTARGNIHPEGLFYTFMLVGNPPKPYFLDIDTGSDLTWIQCDAPCTSCAK 887
            + +  DS++I   RGNI+P+GL++T+M+VGNPP+PY+LD+DTGSDLTWIQCDAPC+SCAK
Sbjct: 180  NAVAVDSSSIFPLRGNIYPDGLYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAK 239

Query: 888  GAHPFYKPKRGNLIHSKDSYCIEVQKS--PGTKCETCDQCDYEIEYADSSSSIGVLIKDK 1061
            GA+P YKP+ GN++  KDS C+E+Q++  PG  CETC QCDYEIEYAD SSS+GVL +D+
Sbjct: 240  GANPLYKPRMGNILPYKDSLCMEIQRNHKPGY-CETCQQCDYEIEYADHSSSMGVLARDE 298

Query: 1062 FQVMISNGSLVRPNLVFGCAYDQQGLLLKSLAKTDGILGLGRGKISLPFQLASHGNIKNV 1241
              + I NGSL +PN+VFGCAYDQQGLLL +L KTDGILGL R K+SLP QLAS G IKNV
Sbjct: 299  LHLTIENGSLTKPNVVFGCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNV 358

Query: 1242 VAHCLASDAAGGGYVFLGDDFVPHQQMVWVPMLNSPFMNSYHAEVTKISYGGKQFSLGSI 1421
            V HCL ++A GGGY+FLG D VP   M WVPML+SPFM  YH E+ KI+YG    +LG+ 
Sbjct: 359  VGHCLTTNAGGGGYMFLGHDLVPSWGMAWVPMLDSPFMELYHTEILKINYGSSPLNLGAR 418

Query: 1422 GKGIGHVVFDSGSSYTYFTKQAYNDLVANLEDISAKGFVLDASDDTLPICWRAKFPVRSI 1601
               +G  +FD+GSSYTYFTKQAY++L+A+L+++S+ G VLDASD TLP+CWRAKFP+RSI
Sbjct: 419  NSQVGWALFDTGSSYTYFTKQAYSELIASLKEVSSDGLVLDASDPTLPVCWRAKFPIRSI 478

Query: 1602 AELRKFFKPLDFQFGSKWWIASQKLRIPPEGYLITSNKGNICLGILEGSHVNDGSSVILG 1781
             ++++FFK L   FGSKW I S K RI PEGYL+ S KGNICLGIL+GS V++GS++ILG
Sbjct: 479  VDVKQFFKTLTLHFGSKWQIVSTKFRISPEGYLVISKKGNICLGILDGSEVHNGSTIILG 538

Query: 1782 DISLRGQLFVYDNENQKIGWIRSDCRRPQTFDS 1880
            DISLRGQL VYDN N++IGW +S C  P  F S
Sbjct: 539  DISLRGQLVVYDNVNKRIGWAKSHCMNPGRFKS 571


>gb|EXB56943.1| Aspartic proteinase Asp1 [Morus notabilis]
          Length = 569

 Score =  636 bits (1641), Expect = e-179
 Identities = 322/575 (56%), Positives = 409/575 (71%), Gaps = 11/575 (1%)
 Frame = +3

Query: 189  ENEESPQLKGIVIIQLPPPENPSLGKTITAITFSENTQSPIRQAPEPHRQVQPVGPSQHP 368
            E++  PQ+KG+VII LPPP+NPSLGKTITA T S ++ +   Q  +    +    P Q P
Sbjct: 2    ESDHPPQIKGVVIITLPPPDNPSLGKTITAFTLSNSSPTQTHQESQNQNNL----PIQSP 57

Query: 369  QTQENPGVQGRFSSKGFLFGSPIVFSCLLSLSL--IALFSWVSSTQETLYELRDADNDRK 542
            Q   NP +Q  F       G P     LL +S+  + LFS V  T   + E R +++D  
Sbjct: 58   Q---NPQLQFPFPRLRLFHGVPRRLFALLGISIFTLVLFSHVFPT--VVEEFRRSNDDEG 112

Query: 543  SNSIIFPLYPKLGIGGNVPQDAEIKLGRFVDSNFKIVMDNLNDGTTGGKFAKSMISESKI 722
              S IFPLY KLG+ G   +D E+KLGRFVD + +    +  D     K  K + S +K+
Sbjct: 113  PESFIFPLYSKLGVPGK--KDVELKLGRFVDFDKENAGVSFGDRVKTQKVNKLVSSTAKV 170

Query: 723  DSTAILTARGNIHPEGLFYTFMLVGNPPKPYFLDIDTGSDLTWIQCDAPCTSCAKGAHPF 902
            DS+AIL  RGN++P+GL+YT +LVGNPP+PY LD+DTGSDLTWIQCDAPCTSCAKGA+P 
Sbjct: 171  DSSAILPVRGNVYPDGLYYTQILVGNPPRPYHLDMDTGSDLTWIQCDAPCTSCAKGANPL 230

Query: 903  YKPKRGNLIHSKDSYCIEVQKS--PGTKCETCDQCDYEIEYADSSSSIGVLIKDKFQVMI 1076
            YKP +GN++ SKDS+C E++++  PG  C+TC QCDYEI+YAD SSS+GVL KD   +++
Sbjct: 231  YKPTKGNIVPSKDSFCTEIRRNQKPG-HCKTCQQCDYEIQYADRSSSLGVLAKDGLHLVM 289

Query: 1077 SNGSLVRPNLVFGCAYDQQGLLLKSLAKTDGILGLGRGKISLPFQLASHGNIKNVVAHCL 1256
             NGSL   N+VFGCAYDQQGLLL +LAKTDGILGL R K+SLP QLAS G IKNVV HCL
Sbjct: 290  ENGSLANVNVVFGCAYDQQGLLLNTLAKTDGILGLSRAKVSLPSQLASKGIIKNVVGHCL 349

Query: 1257 ASDAAGGGYVFLGDDFVPHQQMVWVPMLNSPFMNSYHAEVTKISYGGKQFSLGSIGKGIG 1436
             ++A GGGY+FLGDDFVPH  M W+PML SP M+ Y +E+  I+YG    +LG+      
Sbjct: 350  TTNAGGGGYMFLGDDFVPHWGMSWIPMLRSPSMDFYQSEIVSINYGSSALNLGAWSSKAR 409

Query: 1437 HVVFDSGSSYTYFTKQAYNDLVANLEDISAKGFVLDASDDTLPICWRAKFPV-------R 1595
             +VFDSGSSYTYF K+AY+ L+A+LE++S  G V D SD +LPICWRA+ P+       R
Sbjct: 410  QLVFDSGSSYTYFNKRAYSALLASLEEVSTTGLVRDRSDPSLPICWRAETPLNCIHMECR 469

Query: 1596 SIAELRKFFKPLDFQFGSKWWIASQKLRIPPEGYLITSNKGNICLGILEGSHVNDGSSVI 1775
            S+A++++FFK +  QFGSKWWI S +LRIPPEGYL  S+KGN+CLGIL+GS V+DG + I
Sbjct: 470  SVADVKRFFKTITLQFGSKWWIISTRLRIPPEGYLTISSKGNVCLGILDGSKVHDGYTTI 529

Query: 1776 LGDISLRGQLFVYDNENQKIGWIRSDCRRPQTFDS 1880
            LGDISLRG L VYDNENQKIGW  SDC +P+ FDS
Sbjct: 530  LGDISLRGHLVVYDNENQKIGWTNSDCVKPRRFDS 564


>ref|XP_006485774.1| PREDICTED: aspartic proteinase Asp1-like [Citrus sinensis]
          Length = 577

 Score =  631 bits (1627), Expect = e-178
 Identities = 316/581 (54%), Positives = 412/581 (70%), Gaps = 16/581 (2%)
 Frame = +3

Query: 186  EENEESPQLKGIVIIQLPPPENPSLGKTITAITFSENTQSPIRQAPEPHRQVQPVGPSQH 365
            E     PQL G+VII LPPP NPSLGKTITA T ++N+     Q+ + H Q Q     +H
Sbjct: 5    ESPSPPPQLTGVVIITLPPPNNPSLGKTITAYTLTDNSP----QSQQTHHQQQ----QEH 56

Query: 366  P-QTQENPGVQGRFSSKGFLFGSPIVFSCL-------LSLSLIALFSWVSSTQETL-YEL 518
            P   Q +P    +F+     F  P++F  L       L++S+ AL  + S    TL +  
Sbjct: 57   PLPAQLHPPQDSQFN-----FSLPMLFPVLPRKLFLFLAISIFALILYGSVFSYTLQHRY 111

Query: 519  RDADNDRKSNSIIFPLYPKLGIGGNVPQDAEIKLGRFVDSNFKIVMDNLNDGTTG---GK 689
            +  ++D    S +FPLY K GI   + +DAE KLGRFVD + + V+ ++NDG       K
Sbjct: 112  KSNNDDENKESFVFPLYHKFGIREVLQRDAEFKLGRFVDLDGESVVASVNDGIIRPHKSK 171

Query: 690  FAKSMISESKI--DSTAILTARGNIHPEGLFYTFMLVGNPPKPYFLDIDTGSDLTWIQCD 863
              K ++  + +  DS++    RGN++P+GL++T+M+VGNPP+PY+LD+DTGSDLTWIQCD
Sbjct: 172  INKKLVPSNAVAVDSSSTFPLRGNVYPDGLYFTYMIVGNPPRPYYLDMDTGSDLTWIQCD 231

Query: 864  APCTSCAKGAHPFYKPKRGNLIHSKDSYCIEVQKS--PGTKCETCDQCDYEIEYADSSSS 1037
            APC+SCAKGA+P YKP+ GN++  KDS C+E+Q++  PG  CETC QCDYEIEYAD SSS
Sbjct: 232  APCSSCAKGANPLYKPRMGNILPYKDSLCMEIQRNHKPGY-CETCQQCDYEIEYADHSSS 290

Query: 1038 IGVLIKDKFQVMISNGSLVRPNLVFGCAYDQQGLLLKSLAKTDGILGLGRGKISLPFQLA 1217
            +GVL +D+  + I NGSL +PN+VFGCAYDQQGLLL +L KTDGILGL R K+SLP QLA
Sbjct: 291  MGVLARDELHLTIENGSLTKPNVVFGCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLA 350

Query: 1218 SHGNIKNVVAHCLASDAAGGGYVFLGDDFVPHQQMVWVPMLNSPFMNSYHAEVTKISYGG 1397
            S G IKNVV HCL ++A GGGY+FLG D VP   M WVPML+SPFM  YH E+ KI+YG 
Sbjct: 351  SQGIIKNVVGHCLTTNAGGGGYMFLGHDLVPSWGMAWVPMLDSPFMELYHTEILKINYGS 410

Query: 1398 KQFSLGSIGKGIGHVVFDSGSSYTYFTKQAYNDLVANLEDISAKGFVLDASDDTLPICWR 1577
               +LG+    +G  +FD+GSSYTYFTKQAY++L+A+L+++S+ G VLDASD TLP+CWR
Sbjct: 411  SPLNLGARNSRVGWALFDTGSSYTYFTKQAYSELIASLKEVSSNGLVLDASDPTLPVCWR 470

Query: 1578 AKFPVRSIAELRKFFKPLDFQFGSKWWIASQKLRIPPEGYLITSNKGNICLGILEGSHVN 1757
            AKFP+RSI +++++FK L   FGSKW I S K  I PEGYL+ S KGNICLGIL+GS V+
Sbjct: 471  AKFPIRSIVDVKQYFKTLTLHFGSKWQIVSTKFHISPEGYLVISKKGNICLGILDGSEVH 530

Query: 1758 DGSSVILGDISLRGQLFVYDNENQKIGWIRSDCRRPQTFDS 1880
            +GS++ILGDISLRGQL VYDN N++IGW +S C  P  F S
Sbjct: 531  NGSTIILGDISLRGQLVVYDNVNKRIGWAKSHCMNPGRFKS 571


>ref|XP_002275943.1| PREDICTED: aspartic proteinase Asp1-like [Vitis vinifera]
          Length = 686

 Score =  630 bits (1625), Expect = e-178
 Identities = 321/577 (55%), Positives = 404/577 (70%), Gaps = 16/577 (2%)
 Frame = +3

Query: 198  ESPQLKGIVIIQLPPPENPSLGKTITAITFSENTQSPIRQAPEPHRQVQPVGPSQHPQTQ 377
            +SPQLKG+VII LPPP+NPSLGKTITA T S+    P+ +    H+Q+Q     +  + +
Sbjct: 122  QSPQLKGVVIITLPPPDNPSLGKTITAFTLSD---PPLDRPHHTHQQLQRQQHQEEEEEE 178

Query: 378  E---------------NPGVQGRFSSKGFLFGSPIVFSCLLSLSLIALFSWVSSTQETLY 512
            E               NP +Q  FS +    G+P +    L +SL     W  ++   L 
Sbjct: 179  EEEEEEPHQLPSPSPPNPALQ--FSVRKLSLGNPRILMGFLGVSLFVFLLWNFASSSPLV 236

Query: 513  ELRDADNDRKSNSIIFPLYPKLGIGGNVPQDAEIKLGRFVDSNFKIVMDNLNDGTTGGKF 692
            ELR  ++DR+  S I PLYPKLG       D E+KLG+FVD +       +ND   GG  
Sbjct: 237  ELRRKNDDREPTSFILPLYPKLG--SRSLGDLELKLGKFVDFH-------VNDMKPGG-I 286

Query: 693  AKSMISESKIDSTAILTARGNIHPEGLFYTFMLVGNPPKPYFLDIDTGSDLTWIQCDAPC 872
             K   S S  DS+ I   RG+++P GL++T + VG+PP+ YFLD+DTGSDLTWIQCDAPC
Sbjct: 287  NKLATSVSAFDSSTIFPVRGDVYPNGLYFTHIFVGSPPRRYFLDMDTGSDLTWIQCDAPC 346

Query: 873  TSCAKGAHPFYKPKRGNLIHSKDSYCIEVQKSPGTK-CETCDQCDYEIEYADSSSSIGVL 1049
            TSCAKG +P YKPK+GNL+  KDS C+EVQ++  T  CETC+QCDYEIEYAD SSS+GVL
Sbjct: 347  TSCAKGPNPLYKPKKGNLVPLKDSLCVEVQRNLKTGYCETCEQCDYEIEYADHSSSMGVL 406

Query: 1050 IKDKFQVMISNGSLVRPNLVFGCAYDQQGLLLKSLAKTDGILGLGRGKISLPFQLASHGN 1229
              D   +M++NGSL +  ++FGCAYDQQGLLL SLAKTDGILGL + K+SLP QLAS   
Sbjct: 407  ASDDLHLMLANGSLTKLGIMFGCAYDQQGLLLNSLAKTDGILGLSKAKVSLPSQLASQRI 466

Query: 1230 IKNVVAHCLASDAAGGGYVFLGDDFVPHQQMVWVPMLNSPFMNSYHAEVTKISYGGKQFS 1409
            I NV+ HCL SDA GGGY+FLGDDFVP+  M WVPMLNS   N YH+++ KIS+G +Q S
Sbjct: 467  INNVLGHCLTSDATGGGYMFLGDDFVPYWGMAWVPMLNSHSPN-YHSQIMKISHGSRQLS 525

Query: 1410 LGSIGKGIGHVVFDSGSSYTYFTKQAYNDLVANLEDISAKGFVLDASDDTLPICWRAKFP 1589
            LG        VVFD+GSSYTYF K+AY  LVA+L+D+S +G + D SD TLP+CWRAKFP
Sbjct: 526  LGRQDGRTERVVFDTGSSYTYFPKEAYYALVASLKDVSDEGLIQDGSDPTLPVCWRAKFP 585

Query: 1590 VRSIAELRKFFKPLDFQFGSKWWIASQKLRIPPEGYLITSNKGNICLGILEGSHVNDGSS 1769
            +RS+ ++++FF+PL  QF SKWWI S K RIPPEGYLI SNKGN+CLGIL+GS+V+DGS+
Sbjct: 586  IRSVIDVKQFFQPLTLQFRSKWWIVSTKFRIPPEGYLIISNKGNVCLGILDGSNVHDGST 645

Query: 1770 VILGDISLRGQLFVYDNENQKIGWIRSDCRRPQTFDS 1880
            +ILGDISLRG+L VYDN NQKIGW +S C +PQ   S
Sbjct: 646  IILGDISLRGKLVVYDNVNQKIGWAQSTCVKPQKIKS 682


>gb|EOY21001.1| Eukaryotic aspartyl protease family protein, putative isoform 1
            [Theobroma cacao]
          Length = 576

 Score =  617 bits (1591), Expect = e-174
 Identities = 318/580 (54%), Positives = 405/580 (69%), Gaps = 14/580 (2%)
 Frame = +3

Query: 183  MEENEESPQLKGIVIIQLPPPENPSLGKTITAIT-----FSENTQSPIRQAPEPHRQV-- 341
            M+ +E   Q+ G+VII LPP +NPSLGKTITA T     F ++ Q+  RQ  E  + +  
Sbjct: 1    MDSDERPQQVTGVVIITLPPSDNPSLGKTITAFTLTNDVFPQSHQTQQRQQQEEEQTLPT 60

Query: 342  -QPVGPSQHPQTQENPGVQGRFSSKGFLFGSPIVFSCLLSLSLIALFSWVSSTQETLYEL 518
             Q + P+  P + +NP  Q  FS  G    +P      L +SL AL  + S+   T  EL
Sbjct: 61   TQILTPA--PPSAQNP--QRGFSFLGLFSDNPRKLLGFLGISLFALLLYSSAFSNTFVEL 116

Query: 519  RDADND--RKSNSIIFPLYPKLGIGGNVPQDAEIKLGRFVDSNFKIVMDNLNDGTTGGKF 692
            R+++ND   K  S IFPLY KLG       D E+KLGRFVD + + ++ ++  G TG + 
Sbjct: 117  RNSNNDDDEKPQSFIFPLYHKLGA------DLELKLGRFVDVDKENLVASVEGGATGTQK 170

Query: 693  AKSMISESK--IDSTA-ILTARGNIHPEGLFYTFMLVGNPPKPYFLDIDTGSDLTWIQCD 863
               +++ +   IDS+  IL  RGN++P+GL++T+MLVGNP + YFLDIDTGSDLTWIQCD
Sbjct: 171  INKLVASNAAVIDSSGTILPVRGNVYPDGLYFTYMLVGNPQRRYFLDIDTGSDLTWIQCD 230

Query: 864  APCTSCAKGAHPFYKPKRGNLIHSKDSYCIEVQKSPGTK-CETCDQCDYEIEYADSSSSI 1040
            APC+SCAKGA+P YKP R N++ SKD  C EVQK+   + CETC QCDYEIEYAD SSS+
Sbjct: 231  APCSSCAKGANPLYKPTRVNIVASKDLMCTEVQKNQKPQNCETCQQCDYEIEYADRSSSL 290

Query: 1041 GVLIKDKFQVMISNGSLVRPNLVFGCAYDQQGLLLKSLAKTDGILGLGRGKISLPFQLAS 1220
            GVL +D+  ++ +NGS    ++VFGCAYDQQG+LL +L+KTDGILGL R K+SLP QLAS
Sbjct: 291  GVLARDELHLVTANGSTTNLDVVFGCAYDQQGILLNTLSKTDGILGLSRAKVSLPSQLAS 350

Query: 1221 HGNIKNVVAHCLASDAAGGGYVFLGDDFVPHQQMVWVPMLNSPFMNSYHAEVTKISYGGK 1400
             G I NVV HCLA+D    GY+FLGDDFVP+  M WVPML SP    YH ++ KI+YG  
Sbjct: 351  KGIINNVVGHCLATDVGASGYMFLGDDFVPNWGMSWVPMLGSPSTEFYHTQIVKINYGSS 410

Query: 1401 QFSLGSIGKGIGHVVFDSGSSYTYFTKQAYNDLVANLEDISAKGFVLDASDDTLPICWRA 1580
              SLG     IG VVFDSGSSYTYF KQAY +LVA+L ++S  GF+ D +D TLP+CW+A
Sbjct: 411  SLSLGRQHSSIGRVVFDSGSSYTYFMKQAYAELVASLSEVSEVGFIQDVADTTLPMCWQA 470

Query: 1581 KFPVRSIAELRKFFKPLDFQFGSKWWIASQKLRIPPEGYLITSNKGNICLGILEGSHVND 1760
             FP+R I ++++FFK L  QFGSKWWI S++  IPPEGYLI S KGN+CLGIL+GS V+D
Sbjct: 471  PFPIRFIKDVKQFFKTLTLQFGSKWWIISKRFHIPPEGYLIISKKGNVCLGILDGSKVHD 530

Query: 1761 GSSVILGDISLRGQLFVYDNENQKIGWIRSDCRRPQTFDS 1880
            GS++ILGDISLRGQL VYDNE  KIGW +SDC  P+ F S
Sbjct: 531  GSTIILGDISLRGQLVVYDNEKLKIGWTQSDCAHPRRFKS 570


>ref|XP_002511959.1| protein with unknown function [Ricinus communis]
            gi|223549139|gb|EEF50628.1| protein with unknown function
            [Ricinus communis]
          Length = 583

 Score =  612 bits (1579), Expect = e-172
 Identities = 315/583 (54%), Positives = 403/583 (69%), Gaps = 17/583 (2%)
 Frame = +3

Query: 183  MEENEESPQLKGIVIIQLPPPENPSLGKTITAITFSENT------QSPIRQAPEP----- 329
            ME +++S  +K +VII LPPP NPSLGKTITA T +++       QS      EP     
Sbjct: 1    MESDDQSSHVK-VVIISLPPPNNPSLGKTITAFTLTDDDHDATYPQSHQNHEQEPSIIQT 59

Query: 330  HRQVQ-PV-GPSQHPQTQENPGVQGRFSSKGFLFGSPIVFSCLLSLSLIALFSWVSSTQE 503
            HR+ Q PV  PS  PQ   NP +Q  FS  G  F +P     LL +SL A+  + S    
Sbjct: 60   HRESQLPVQSPSLPPQ---NPQIQ--FSFSGLYFSTPRKLLFLLCISLFAVIVYRSLFSN 114

Query: 504  TLYELR--DADNDRKSNSIIFPLYPKLGIGGNVPQDAEIKLGRFVDSNFKIVMDNLNDGT 677
            TL EL+  D DND K+ S IFPLY K GI      + E K  R V     +   N +D  
Sbjct: 115  TLLELKVSDDDNDEKTKSFIFPLYHKFGIREISQSNLEHKSIRSVYKESLVASVNDDDVI 174

Query: 678  TGGKFAKSMISESK-IDSTAILTARGNIHPEGLFYTFMLVGNPPKPYFLDIDTGSDLTWI 854
               +  K   S +  +DS+++   RGN++P+GL++T++LVGNPP+PY+LDIDT SDLTWI
Sbjct: 175  VPNRNYKLASSNAAAVDSSSVFPVRGNVYPDGLYFTYILVGNPPRPYYLDIDTASDLTWI 234

Query: 855  QCDAPCTSCAKGAHPFYKPKRGNLIHSKDSYCIEVQKSPGTK-CETCDQCDYEIEYADSS 1031
            QCDAPCTSCAKGA+  YKP+R N++  KDS C+E+ ++     CETC QCDYEIEYAD S
Sbjct: 235  QCDAPCTSCAKGANALYKPRRDNIVTPKDSLCVELHRNQKAGYCETCQQCDYEIEYADHS 294

Query: 1032 SSIGVLIKDKFQVMISNGSLVRPNLVFGCAYDQQGLLLKSLAKTDGILGLGRGKISLPFQ 1211
            SS+GVL +D+  + ++NGS       FGCAYDQQGLLL +L KTDGILGL + K+SLP Q
Sbjct: 295  SSMGVLARDELHLTMANGSSTNLKFNFGCAYDQQGLLLNTLVKTDGILGLSKAKVSLPSQ 354

Query: 1212 LASHGNIKNVVAHCLASDAAGGGYVFLGDDFVPHQQMVWVPMLNSPFMNSYHAEVTKISY 1391
            LA+ G I NVV HCLA+D  GGGY+FLGDDFVP   M WVPML+SP ++SY  ++ K++Y
Sbjct: 355  LANRGIINNVVGHCLANDVVGGGYMFLGDDFVPRWGMSWVPMLDSPSIDSYQTQIMKLNY 414

Query: 1392 GGKQFSLGSIGKGIGHVVFDSGSSYTYFTKQAYNDLVANLEDISAKGFVLDASDDTLPIC 1571
            G    SLG   + +  +VFDSGSSYTYFTK+AY++LVA+L+ +S +  + D SD TLP C
Sbjct: 415  GSGPLSLGGQERRVRRIVFDSGSSYTYFTKEAYSELVASLKQVSGEALIQDTSDPTLPFC 474

Query: 1572 WRAKFPVRSIAELRKFFKPLDFQFGSKWWIASQKLRIPPEGYLITSNKGNICLGILEGSH 1751
            WRAKFP+RS+ +++++FK L  QFGSKWWI S K RIPPEGYLI SNKGN+CLGIL+GS 
Sbjct: 475  WRAKFPIRSVIDVKQYFKTLTLQFGSKWWIISTKFRIPPEGYLIISNKGNVCLGILDGSD 534

Query: 1752 VNDGSSVILGDISLRGQLFVYDNENQKIGWIRSDCRRPQTFDS 1880
            V+DGSS+ILGDISLRGQL +YDN N KIGW +SDC +P+TF +
Sbjct: 535  VHDGSSIILGDISLRGQLIIYDNVNNKIGWTQSDCIKPKTFST 577


>ref|XP_002328687.1| predicted protein [Populus trichocarpa]
            gi|566206181|ref|XP_006374352.1| aspartyl protease family
            protein [Populus trichocarpa] gi|550322111|gb|ERP52149.1|
            aspartyl protease family protein [Populus trichocarpa]
          Length = 603

 Score =  605 bits (1560), Expect = e-170
 Identities = 314/601 (52%), Positives = 394/601 (65%), Gaps = 37/601 (6%)
 Frame = +3

Query: 189  ENEESPQLKGIVIIQLPPPENPSLGKTITAITFSENTQSPIRQAPEPHRQVQPVGPSQHP 368
            ++++SPQLKG+VII LPPP+NPSLGKTITA T + N      Q P+ H++ Q    S  P
Sbjct: 4    DDDQSPQLKGVVIISLPPPDNPSLGKTITAFTLTNNDYPQSHQTPQTHQEDQLPISSPPP 63

Query: 369  QTQENPGVQGRFSSKGFLFGSPIVFSCLLSLSLIALFSWVSSTQETLYELR---DADNDR 539
               +N  +Q  F S     G+P      + +SL AL  + S    T  EL+   + D+D+
Sbjct: 64   PPSQNSQLQ--FPSSRLFLGTPRKLLSFVFISLFALAIYSSLFTNTFQELKSNNNDDDDQ 121

Query: 540  KSNSIIFPLYPKLGIGGNVPQDAEIKLGRFV-DSNFKIVMDNLNDGTTGGKFAKSMISES 716
            K  S +FPLY KLGI      D E  L RFV   N    +D+LN      K A S  + +
Sbjct: 122  KPKSYVFPLYHKLGIREIPLNDLENHLRRFVYKENLVASVDHLNGPHKISKLASSNAAAA 181

Query: 717  KIDSTAILTARGNIHPEGLFYTFMLVGNPPKPYFLDIDTGSDLTWIQCDAPCTSCAKGAH 896
             +DS+AI   RGN++P+G          PP+PY+LD DTGSDLTWIQCDAPCTSCAKGA+
Sbjct: 182  -MDSSAIFPVRGNLYPDG----------PPQPYYLDFDTGSDLTWIQCDAPCTSCAKGAN 230

Query: 897  PFYKPKRGNLIHSKDSYCIEVQKSPGTK-CETCDQCDYEIEYADSSSSIGVLIKDKFQVM 1073
             +YKP+RGN++  KD  C+EVQ++     CETCDQCDYEIEYAD SSS+GVL  DK  +M
Sbjct: 231  AWYKPRRGNIVPPKDLLCMEVQRNQKAGYCETCDQCDYEIEYADHSSSMGVLATDKLLLM 290

Query: 1074 ISNGSLVRPNLVFGCAYDQQGLLLKSLAKTDGILGLGRGKISLPFQLASHGNIKNVVAHC 1253
            ++NGSL + N +FGCAYDQQGLLLK+L KTDGILGL R K+SLP QLAS G I NV+ HC
Sbjct: 291  VANGSLTKLNFIFGCAYDQQGLLLKTLVKTDGILGLSRAKVSLPSQLASQGIINNVIGHC 350

Query: 1254 LASDAAGGGYVFLGDDFVPHQQMVWVPMLNSPFMNSYHAEVTKISYGGKQFSLGSIGKGI 1433
            L +D  GGGY+FLGDDFVP   M WVPML+SP M  YH EV K++YG    SLG +   +
Sbjct: 351  LTTDLGGGGYMFLGDDFVPRWGMAWVPMLDSPSMEFYHTEVVKLNYGSSPLSLGGMESRV 410

Query: 1434 GHVVFDSGSSYTYFTKQAYNDLVANLEDISAKGFVLDASDDTLPICWRAKFPVRSI---- 1601
             H++FDSGSSYTYF K+AY++LVA+L ++S  G V   SD TLP+CWRA FP+R      
Sbjct: 411  KHILFDSGSSYTYFPKEAYSELVASLNEVSGAGLVQSTSDTTLPLCWRANFPIRKFIYRT 470

Query: 1602 ----------------------------AELRKFFKPLDFQFGSKWWIASQKLRIPPEGY 1697
                                         +++KFFK L FQFG+KW + S K RIPPEGY
Sbjct: 471  ELTRPIRRRRRRRRRRRRRRRRRRQHIKGDVKKFFKTLTFQFGTKWLVISTKFRIPPEGY 530

Query: 1698 LITSNKGNICLGILEGSHVNDGSSVILGDISLRGQLFVYDNENQKIGWIRSDCRRPQTFD 1877
            L+ S+KGN+CLGILEGS V+DGS++ILGDISLRGQL VYDN N+KIGW  SDC +P+  D
Sbjct: 531  LMMSDKGNVCLGILEGSKVHDGSTIILGDISLRGQLVVYDNVNKKIGWTPSDCAKPKRSD 590

Query: 1878 S 1880
            S
Sbjct: 591  S 591


>ref|XP_006583400.1| PREDICTED: aspartic proteinase Asp1-like [Glycine max]
          Length = 574

 Score =  578 bits (1490), Expect = e-162
 Identities = 297/572 (51%), Positives = 388/572 (67%), Gaps = 11/572 (1%)
 Frame = +3

Query: 189  ENEESPQLKGIVIIQLPPPENPSLGKTITAITFSENTQSPIRQAPEPHR-QVQPVGPS-Q 362
            E+++S Q+KG+VII LPPP+NPSLGKTITA  FS N   P +   +PH+ Q Q   P+ Q
Sbjct: 2    EDDQSTQIKGVVIISLPPPDNPSLGKTITAFAFSNNPSPPPQLFIQPHQHQSQQTHPNAQ 61

Query: 363  H---PQTQENPG-VQGRFSSKGFLFGSPIVFSCLLSLSLIALFSWVSSTQETLYELRDAD 530
            H   P  Q  P   Q  FS +     +P+         L ALF + S +  T  +LR   
Sbjct: 62   HNTDPPLQSYPSNPQLSFSFRRLFHSTPVKLFSFFGTLLFALFLYGSVSSTTTVDLRGRK 121

Query: 531  NDR---KSNSIIFPLYPKLGIGGNVPQDAEIKLGRFVDSNFKIVMDNLNDGTTGGKFAKS 701
            ND    K+ S +FPL+PK G+ G   +D +++LG+ V     +   ++ DG+  G  A  
Sbjct: 122  NDGDDDKATSFLFPLFPKFGVLGQ--KDLKLQLGKLVQKEKFLTQRDVGDGS--GVVA-- 175

Query: 702  MISESKIDSTAILTARGNIHPEGLFYTFMLVGNPPKPYFLDIDTGSDLTWIQCDAPCTSC 881
                  +DS+++    GN++P+GL++T + VGNPPK YFLD+DTGSDLTW+QCDAPC SC
Sbjct: 176  ------VDSSSVFPVSGNVYPDGLYFTILRVGNPPKSYFLDVDTGSDLTWMQCDAPCRSC 229

Query: 882  AKGAHPFYKPKRGNLIHSKDSYCIEVQKSP--GTKCETCDQCDYEIEYADSSSSIGVLIK 1055
             KGAH  YKP R N++ S DS C++VQK+   G   E+  QCDYEI+YAD SSS+GVL++
Sbjct: 230  GKGAHVQYKPTRSNVVSSVDSLCLDVQKNQKNGHHDESLLQCDYEIQYADHSSSLGVLVR 289

Query: 1056 DKFQVMISNGSLVRPNLVFGCAYDQQGLLLKSLAKTDGILGLGRGKISLPFQLASHGNIK 1235
            D+  ++ +NGS  + N+VFGC YDQ+GL+L +LAKTDGI+GL R K+SLP+QLAS G IK
Sbjct: 290  DELHLVTTNGSKTKLNVVFGCGYDQEGLILNTLAKTDGIMGLSRAKVSLPYQLASKGLIK 349

Query: 1236 NVVAHCLASDAAGGGYVFLGDDFVPHQQMVWVPMLNSPFMNSYHAEVTKISYGGKQFSLG 1415
            NVV HCL++D AGGGY+FLGDDFVP+  M WVPM  +   + Y  E+  I+YG +Q    
Sbjct: 350  NVVGHCLSNDGAGGGYMFLGDDFVPYWGMNWVPMAYTLTTDLYQTEILGINYGNRQLKFD 409

Query: 1416 SIGKGIGHVVFDSGSSYTYFTKQAYNDLVANLEDISAKGFVLDASDDTLPICWRAKFPVR 1595
               K +G V FDSGSSYTYF K+AY DLVA+L ++S  G V D SD TLPICW+A F +R
Sbjct: 410  GQSK-VGKVFFDSGSSYTYFPKEAYLDLVASLNEVSGLGLVQDDSDTTLPICWQANFQIR 468

Query: 1596 SIAELRKFFKPLDFQFGSKWWIASQKLRIPPEGYLITSNKGNICLGILEGSHVNDGSSVI 1775
            SI +++ +FK L  +FGSKWWI S   +IPPEGYLI SNKG++CLGIL+GS VNDGSS+I
Sbjct: 469  SIKDVKDYFKTLTLRFGSKWWILSTLFQIPPEGYLIISNKGHVCLGILDGSKVNDGSSII 528

Query: 1776 LGDISLRGQLFVYDNENQKIGWIRSDCRRPQT 1871
            LGDISLRG   VYDN  QKIGW R+DC  P +
Sbjct: 529  LGDISLRGYSVVYDNVKQKIGWKRADCGMPSS 560


>emb|CBI15437.3| unnamed protein product [Vitis vinifera]
          Length = 473

 Score =  576 bits (1484), Expect = e-161
 Identities = 284/477 (59%), Positives = 352/477 (73%), Gaps = 1/477 (0%)
 Frame = +3

Query: 453  LSLSLIALFSWVSSTQETLYELRDADNDRKSNSIIFPLYPKLGIGGNVPQDAEIKLGRFV 632
            L +SL     W  ++   L ELR  ++DR+  S I PLYPKLG       D E+KLG+FV
Sbjct: 4    LGVSLFVFLLWNFASSSPLVELRRKNDDREPTSFILPLYPKLG--SRSLGDLELKLGKFV 61

Query: 633  DSNFKIVMDNLNDGTTGGKFAKSMISESKIDSTAILTARGNIHPEGLFYTFMLVGNPPKP 812
            D +       +ND   GG   K   S S  DS+ I   RG+++P GL++T + VG+PP+ 
Sbjct: 62   DFH-------VNDMKPGG-INKLATSVSAFDSSTIFPVRGDVYPNGLYFTHIFVGSPPRR 113

Query: 813  YFLDIDTGSDLTWIQCDAPCTSCAKGAHPFYKPKRGNLIHSKDSYCIEVQKSPGTK-CET 989
            YFLD+DTGSDLTWIQCDAPCTSCAKG +P YKPK+GNL+  KDS C+EVQ++  T  CET
Sbjct: 114  YFLDMDTGSDLTWIQCDAPCTSCAKGPNPLYKPKKGNLVPLKDSLCVEVQRNLKTGYCET 173

Query: 990  CDQCDYEIEYADSSSSIGVLIKDKFQVMISNGSLVRPNLVFGCAYDQQGLLLKSLAKTDG 1169
            C+QCDYEIEYAD SSS+GVL  D   +M++NGSL +  ++FGCAYDQQGLLL SLAKTDG
Sbjct: 174  CEQCDYEIEYADHSSSMGVLASDDLHLMLANGSLTKLGIMFGCAYDQQGLLLNSLAKTDG 233

Query: 1170 ILGLGRGKISLPFQLASHGNIKNVVAHCLASDAAGGGYVFLGDDFVPHQQMVWVPMLNSP 1349
            ILGL + K+SLP QLAS   I NV+ HCL SDA GGGY+FLGDDFVP+  M WVPMLNS 
Sbjct: 234  ILGLSKAKVSLPSQLASQRIINNVLGHCLTSDATGGGYMFLGDDFVPYWGMAWVPMLNSH 293

Query: 1350 FMNSYHAEVTKISYGGKQFSLGSIGKGIGHVVFDSGSSYTYFTKQAYNDLVANLEDISAK 1529
              N YH+++ KIS+G +Q SLG        VVFD+GSSYTYF K+AY  LVA+L+D+S +
Sbjct: 294  SPN-YHSQIMKISHGSRQLSLGRQDGRTERVVFDTGSSYTYFPKEAYYALVASLKDVSDE 352

Query: 1530 GFVLDASDDTLPICWRAKFPVRSIAELRKFFKPLDFQFGSKWWIASQKLRIPPEGYLITS 1709
            G + D SD TLP+CWRAKFP+RS+ ++++FF+PL  QF SKWWI S K RIPPEGYLI S
Sbjct: 353  GLIQDGSDPTLPVCWRAKFPIRSVIDVKQFFQPLTLQFRSKWWIVSTKFRIPPEGYLIIS 412

Query: 1710 NKGNICLGILEGSHVNDGSSVILGDISLRGQLFVYDNENQKIGWIRSDCRRPQTFDS 1880
            NKGN+CLGIL+GS+V+DGS++ILGDISLRG+L VYDN NQKIGW +S C +PQ   S
Sbjct: 413  NKGNVCLGILDGSNVHDGSTIILGDISLRGKLVVYDNVNQKIGWAQSTCVKPQKIKS 469


>ref|XP_004137470.1| PREDICTED: aspartic proteinase Asp1-like [Cucumis sativus]
            gi|449486840|ref|XP_004157418.1| PREDICTED: aspartic
            proteinase Asp1-like [Cucumis sativus]
          Length = 570

 Score =  574 bits (1479), Expect = e-161
 Identities = 287/570 (50%), Positives = 393/570 (68%), Gaps = 13/570 (2%)
 Frame = +3

Query: 198  ESPQLKGIVIIQLPPPENPSLGKTITAITFSENTQSP----------IRQAPEPHRQVQP 347
            +S ++KG+V+I LPPP+NPSLGK++TA T +++   P          ++Q    H  + P
Sbjct: 2    DSDKIKGVVVITLPPPDNPSLGKSVTAFTLTDDFPEPPGESVAVDQEVQQPNNDHLTLPP 61

Query: 348  VGPSQHPQTQENPGVQGRFSSKGFLFGSPIVFSCLLSLSLIALFSWVSSTQETLYELR-- 521
              P Q P +Q +  +     S+    G+P     +L ++L A++ + S+  ET+ ELR  
Sbjct: 62   NLPIQAPLSQRSIPL-----SRELFAGTPRKLVFVLGIALAAVYLYASNFPETIRELRRS 116

Query: 522  DADNDRKSNSIIFPLYPKLGIGGNVPQDAEIKLGRFVDSNFKIVMDNLNDGTTGGKFAKS 701
            + ++D + +S +FPLY +  +G +   D ++KLGR V  N   +    ND     K +K 
Sbjct: 117  ERNDDDRPSSFLFPLYFQSELGDS--SDFQLKLGRTVRVNKDDLGVRFNDVLGVPKPSKL 174

Query: 702  MISESKIDSTAILTARGNIHPEGLFYTFMLVGNPPKPYFLDIDTGSDLTWIQCDAPCTSC 881
            + +  K DS+A+   RG+I+P+GL+YT+++VG PP+PYFLDIDTGSDLTW+QCDAPC+SC
Sbjct: 175  ISASLKSDSSAVFPVRGDIYPDGLYYTYIMVGEPPRPYFLDIDTGSDLTWVQCDAPCSSC 234

Query: 882  AKGAHPFYKPKRGNLIHSKDSYCIEVQKS-PGTKCETCDQCDYEIEYADSSSSIGVLIKD 1058
             KG  P YKP+R N++  KDS C+EVQ++  G +C  C QC+YE++YAD SSS+GVL+KD
Sbjct: 235  GKGRSPLYKPRRENVVSFKDSLCMEVQRNYDGDQCAACQQCNYEVQYADQSSSLGVLVKD 294

Query: 1059 KFQVMISNGSLVRPNLVFGCAYDQQGLLLKSLAKTDGILGLGRGKISLPFQLASHGNIKN 1238
            +F +  SNGSL + N +FGCAYDQQGLLL +L+KTDGILGL R K+SLP QLAS G I N
Sbjct: 295  EFTLRFSNGSLTKLNAIFGCAYDQQGLLLNTLSKTDGILGLSRAKVSLPSQLASRGIINN 354

Query: 1239 VVAHCLASDAAGGGYVFLGDDFVPHQQMVWVPMLNSPFMNSYHAEVTKISYGGKQFSLGS 1418
            VV HCL  D AGGGY+FLGDDFVP   M WV ML+SP ++ Y  +V +I YG    SL +
Sbjct: 355  VVGHCLTGDPAGGGYLFLGDDFVPQWGMAWVAMLDSPSIDFYQTKVVRIDYGSIPLSLDT 414

Query: 1419 IGKGIGHVVFDSGSSYTYFTKQAYNDLVANLEDISAKGFVLDASDDTLPICWRAKFPVRS 1598
             G     VVFDSGSSYTYFTK+AY  LVANLE++SA G +L  S DT  ICW+ +  +RS
Sbjct: 415  WGSSREQVVFDSGSSYTYFTKEAYYQLVANLEEVSAFGLILQDSSDT--ICWKTEQSIRS 472

Query: 1599 IAELRKFFKPLDFQFGSKWWIASQKLRIPPEGYLITSNKGNICLGILEGSHVNDGSSVIL 1778
            + +++ FFKPL  QFGS++W+ S KL I PE YL+ + +GN+CLGIL+GS V+DGS++IL
Sbjct: 473  VKDVKHFFKPLTLQFGSRFWLVSTKLVILPENYLLINKEGNVCLGILDGSQVHDGSTIIL 532

Query: 1779 GDISLRGQLFVYDNENQKIGWIRSDCRRPQ 1868
            GD +LRG+L VYDN NQ+IGW  SDC  P+
Sbjct: 533  GDNALRGKLVVYDNVNQRIGWTSSDCHNPR 562


>gb|ESW24775.1| hypothetical protein PHAVU_004G159200g [Phaseolus vulgaris]
          Length = 572

 Score =  563 bits (1450), Expect = e-157
 Identities = 280/565 (49%), Positives = 375/565 (66%), Gaps = 9/565 (1%)
 Frame = +3

Query: 189  ENEESPQLKGIVIIQLPPPENPSLGKTITAITFSENTQSP----IRQAPEPHRQVQPVGP 356
            ++++ PQ+KG+VII LPPP+NPSLGKTITA TFS+ +       ++Q+ +    +     
Sbjct: 2    DDDQFPQIKGVVIISLPPPDNPSLGKTITAFTFSDPSSPQPSLLLQQSHQHQTNINEYNN 61

Query: 357  SQHPQTQENPGVQGRFSSKGFLFGSPIVFSCLLSLSLIALFSWVSSTQETLYEL----RD 524
            +  P        Q  FS +     +P+ F     + L ALF + S +  T  EL     D
Sbjct: 62   TDPPLHSYPSNAQLGFSRRRLFHRTPVRFFSFFGVFLFALFLYGSVSSTTTLELSGPKND 121

Query: 525  ADNDRKSNSIIFPLYPKLGIGGNVPQDAEIKLGRFVDSNFKIVMDNLNDGTTGGKFAKSM 704
             D+D K  S +FPLYPK G+ G   ++ +++LG+ V     +       G+         
Sbjct: 122  GDDDGKPGSYLFPLYPKFGVLGQ--KNMKLQLGKLVHKEKLLTQRKYRVGS--------- 170

Query: 705  ISESKIDSTAILTARGNIHPEGLFYTFMLVGNPPKPYFLDIDTGSDLTWIQCDAPCTSCA 884
                 +DS+++    GN+ P+GL++T + VGNPP+ YFLD+DTGSDLTW+QCDAPC SC 
Sbjct: 171  -EVVAVDSSSVFPVSGNVFPDGLYFTILRVGNPPRSYFLDVDTGSDLTWMQCDAPCISCG 229

Query: 885  KGAHPFYKPKRGNLIHSKDSYCIEVQKSPGT-KCETCDQCDYEIEYADSSSSIGVLIKDK 1061
            KGAH  YKP R N++ S DS C++VQK+      E+  QCDY+IEYAD SSS+GVLI+D+
Sbjct: 230  KGAHAQYKPTRSNVVPSMDSLCLDVQKNQKDGHHESLQQCDYQIEYADQSSSLGVLIRDE 289

Query: 1062 FQVMISNGSLVRPNLVFGCAYDQQGLLLKSLAKTDGILGLGRGKISLPFQLASHGNIKNV 1241
              ++ +NGS  + N VFGC YDQ+GLLL +LAKTDGILGL R K+SLP+QLAS G IKNV
Sbjct: 290  LHLVTTNGSKTKLNFVFGCGYDQEGLLLNTLAKTDGILGLSRAKVSLPYQLASKGLIKNV 349

Query: 1242 VAHCLASDAAGGGYVFLGDDFVPHQQMVWVPMLNSPFMNSYHAEVTKISYGGKQFSLGSI 1421
            V HCL++D  GGGY+FLGDDF+P+  M WVPM  +   + Y  E+  I+YG +Q S    
Sbjct: 350  VGHCLSNDEVGGGYMFLGDDFLPYWGMTWVPMAYTLTTDLYQTEILGINYGNRQLSFDGQ 409

Query: 1422 GKGIGHVVFDSGSSYTYFTKQAYNDLVANLEDISAKGFVLDASDDTLPICWRAKFPVRSI 1601
             K +G VVFDSGSSYTYF K+AY DLVA+L ++S    + D SD TLPICW A FP++S+
Sbjct: 410  SK-VGKVVFDSGSSYTYFPKEAYLDLVASLNEVSGLRLIQDDSDTTLPICWEANFPIKSV 468

Query: 1602 AELRKFFKPLDFQFGSKWWIASQKLRIPPEGYLITSNKGNICLGILEGSHVNDGSSVILG 1781
             +++ +FK +  +FGSKWWI S   +I PEGYLI SNKG++CLGIL+GS+VNDGSS+ILG
Sbjct: 469  KDVKDYFKTITLRFGSKWWILSTMFQIAPEGYLIISNKGHVCLGILDGSNVNDGSSIILG 528

Query: 1782 DISLRGQLFVYDNENQKIGWIRSDC 1856
            DIS RG L VYDN  QKIGW R++C
Sbjct: 529  DISFRGYLVVYDNSKQKIGWKRAEC 553


>ref|XP_006826817.1| hypothetical protein AMTR_s00010p00056950 [Amborella trichopoda]
            gi|548831246|gb|ERM94054.1| hypothetical protein
            AMTR_s00010p00056950 [Amborella trichopoda]
          Length = 545

 Score =  548 bits (1413), Expect = e-153
 Identities = 282/557 (50%), Positives = 374/557 (67%), Gaps = 3/557 (0%)
 Frame = +3

Query: 198  ESPQLKGIVIIQLPPPENPSLGKTITAITFSENTQSPIRQAPEPHRQVQPVGPSQHPQTQ 377
            E P+++G VII LPPP++PS GKTITA T       P  Q     +  Q    +Q PQ  
Sbjct: 2    EQPEIQGFVIISLPPPDDPSKGKTITAFTM---VSDPSHQNENQSQNQQ----TQQPQIA 54

Query: 378  ENPGVQGRFSSKGFLFGSPIVFSCLLSLSLIALF--SWVSSTQETLYELRDADNDRKSNS 551
             N  + G  SS+G +    +    +L   +  LF   WVS   E  YE    +  + + S
Sbjct: 55   SN-SIAG--SSRGRIGSIVVRVLAMLGAVVAVLFFWQWVSGFSEMDYE---TERSKNNPS 108

Query: 552  IIFPLYPKLGIGGNVPQDAEIKLGRFVDSNFKIVMDNLNDGTTGGKFAKSMISESKIDST 731
             ++ LYPK      + +DA ++LG FV  +         +   G +  K++ + S I+S+
Sbjct: 109  FLYNLYPKWSEEA-IEKDAALRLGTFVKRD---------EVRIGLRDVKTLEAISSINSS 158

Query: 732  AILTARGNIHPEGLFYTFMLVGNPPKPYFLDIDTGSDLTWIQCDAPCTSCAKGAHPFYKP 911
             I   +GN++P+GL+Y  +LVGNP +PY+LD+DTGSDLTWIQC+APCT+CAKG HP Y P
Sbjct: 159  TIFPVKGNVYPDGLYYISILVGNPRRPYYLDMDTGSDLTWIQCNAPCTNCAKGPHPLYNP 218

Query: 912  KRGNLIHSKDSYCIEVQ-KSPGTKCETCDQCDYEIEYADSSSSIGVLIKDKFQVMISNGS 1088
             + NL+ SKD +C+EVQ    G       QCDY+IEYAD SSS+GVL++D  Q+MI+NG+
Sbjct: 219  SKQNLVPSKDPFCLEVQVNDKGKFAGASHQCDYDIEYADQSSSMGVLVRDDLQLMITNGT 278

Query: 1089 LVRPNLVFGCAYDQQGLLLKSLAKTDGILGLGRGKISLPFQLASHGNIKNVVAHCLASDA 1268
            +++  LVFGCAYDQ+G L  S AKTDGILGL   K+SLP QLAS G +KNVV HC+ +DA
Sbjct: 279  VIKTGLVFGCAYDQRGKLGHSPAKTDGILGLSSAKVSLPSQLASRGLMKNVVGHCIRNDA 338

Query: 1269 AGGGYVFLGDDFVPHQQMVWVPMLNSPFMNSYHAEVTKISYGGKQFSLGSIGKGIGHVVF 1448
             GGGY+FLGDDF+P  +M WVPML+SP  N+YHAEV+KIS G +    G +   IG VVF
Sbjct: 339  NGGGYMFLGDDFIPQWRMTWVPMLSSPSTNAYHAEVSKISLGSRPIDGGGLITKIGRVVF 398

Query: 1449 DSGSSYTYFTKQAYNDLVANLEDISAKGFVLDASDDTLPICWRAKFPVRSIAELRKFFKP 1628
            DSGSSY+Y TKQAY  L+ +L+D++ KG VLD SD TLP+CW+AK P+RSI ++ +FFKP
Sbjct: 399  DSGSSYSYLTKQAYTSLIKSLKDVAEKGLVLDDSDKTLPVCWKAKSPLRSIKDVNQFFKP 458

Query: 1629 LDFQFGSKWWIASQKLRIPPEGYLITSNKGNICLGILEGSHVNDGSSVILGDISLRGQLF 1808
            L   FGS+    S+   IPPEGYLI S KGN CLGILEGSH++DG++ ILGDISLR +L 
Sbjct: 459  LVLNFGSRLLFGSKNFEIPPEGYLIISAKGNACLGILEGSHIHDGATNILGDISLRAKLV 518

Query: 1809 VYDNENQKIGWIRSDCR 1859
            VYDN  ++IGW++SDC+
Sbjct: 519  VYDNVKRRIGWVQSDCQ 535


>ref|XP_006588295.1| PREDICTED: lysine-specific histone demethylase 1 homolog 1-like
            [Glycine max]
          Length = 1430

 Score =  545 bits (1405), Expect = e-152
 Identities = 282/547 (51%), Positives = 374/547 (68%), Gaps = 12/547 (2%)
 Frame = +3

Query: 189  ENEESPQLKGIVIIQLPPPENPSLGKTITAITFSENTQSPIRQAPEPHR-QVQPVGPS-Q 362
            E++ESPQ+KG+VII LPPP+NPSLGKTITA TFS  +  P  Q   PH+ Q QP  P+ Q
Sbjct: 2    EDDESPQIKGVVIISLPPPDNPSLGKTITAFTFSNPSPQPSIQ---PHQHQSQPTHPNAQ 58

Query: 363  H---PQTQENPG-VQGRFSSKGFLFGSPIVFSCLLSLSLIALFSWVSSTQETLYELR--- 521
            H   P  Q  P   Q  FS +     +P+       + L ALF + S +  T  ELR   
Sbjct: 59   HNTDPPLQSYPSNPQLSFSFRRLFHSTPVKLFSFFGILLFALFLYGSVSSTTTVELRGRN 118

Query: 522  -DADNDRKSNSIIFPLYPKLGIGGNVPQDAEIKLGRFVDSNFKIVMDNLNDGTTGGKFAK 698
             D D+D K+ S +FPL+PK G+ G   +D +++LG+   S  +  + + +DG   G  A 
Sbjct: 119  NDDDDDDKATSFLFPLFPKFGVLGQ--KDLKLQLGKL--SQKEKFLTHRDDGDGSGVVA- 173

Query: 699  SMISESKIDSTAILTARGNIHPEGLFYTFMLVGNPPKPYFLDIDTGSDLTWIQCDAPCTS 878
                   +DS+++    GN++P+GL++T + VGNPPK YFLD+DTGSDLTW+QCDAPC S
Sbjct: 174  -------VDSSSVFPVSGNVYPDGLYFTILRVGNPPKSYFLDVDTGSDLTWMQCDAPCIS 226

Query: 879  CAKGAHPFYKPKRGNLIHSKDSYCIEVQKSP--GTKCETCDQCDYEIEYADSSSSIGVLI 1052
            C KGAH  YKP R N++ S D+ C++VQK+   G   E+  QCDYEI+YAD SSS+GVL+
Sbjct: 227  CGKGAHVLYKPTRSNVVSSVDALCLDVQKNQKNGHHDESLLQCDYEIQYADHSSSLGVLV 286

Query: 1053 KDKFQVMISNGSLVRPNLVFGCAYDQQGLLLKSLAKTDGILGLGRGKISLPFQLASHGNI 1232
            +D+  ++ +NGS  + N+VFGC YDQ GLLL +L KTDGI+GL R K+SLP+QLAS G I
Sbjct: 287  RDELHLVTTNGSKTKLNVVFGCGYDQAGLLLNTLGKTDGIMGLSRAKVSLPYQLASKGLI 346

Query: 1233 KNVVAHCLASDAAGGGYVFLGDDFVPHQQMVWVPMLNSPFMNSYHAEVTKISYGGKQFSL 1412
            KNVV HCL++D AGGGY+FLGDDFVP+  M WVPM  +   + Y  E+  I+YG +Q   
Sbjct: 347  KNVVGHCLSNDGAGGGYMFLGDDFVPYWGMNWVPMAYTLTTDLYQTEILGINYGNRQLRF 406

Query: 1413 GSIGKGIGHVVFDSGSSYTYFTKQAYNDLVANLEDISAKGFVLDASDDTLPICWRAKFPV 1592
                K +G +VFDSGSSYTYF K+AY DLVA+L ++S  G V D SD TLPICW+A FP+
Sbjct: 407  DGQSK-VGKMVFDSGSSYTYFPKEAYLDLVASLNEVSGLGLVQDDSDTTLPICWQANFPI 465

Query: 1593 RSIAELRKFFKPLDFQFGSKWWIASQKLRIPPEGYLITSNKGNICLGILEGSHVNDGSSV 1772
            +S+ +++ +FK L  +FGSKWWI S   +I PEGYLI SNKG++CLGIL+GS+VNDGSS+
Sbjct: 466  KSVKDVKDYFKTLTLRFGSKWWILSTLFQISPEGYLIISNKGHVCLGILDGSNVNDGSSI 525

Query: 1773 ILGDISL 1793
            ILG  +L
Sbjct: 526  ILGGSTL 532


>ref|XP_004512995.1| PREDICTED: lysine-specific histone demethylase 1 homolog 1-like
            [Cicer arietinum]
          Length = 1387

 Score =  545 bits (1403), Expect = e-152
 Identities = 293/574 (51%), Positives = 374/574 (65%), Gaps = 14/574 (2%)
 Frame = +3

Query: 186  EENEESPQLKGIVIIQLPPPENPSLGKTITAITFSENTQSPIRQAPEPHRQVQPVGPSQH 365
            E   ++PQLK +VII +PP  NPSLGK ITA TFS N  SP +Q   P   V P+ P Q 
Sbjct: 6    ESQSQTPQLKSVVIISIPPSNNPSLGKKITAFTFSNNPFSPQQQ---PQNNVPPMSPIQS 62

Query: 366  PQTQENPGVQGRFSS-KGFLFGSPIVFSCLLSLSLIALFSWVS--STQETLYELRDADND 536
              +      Q +FSS + F   + I F     + L ALF + S  ST  T  EL +  N 
Sbjct: 63   YPSNH----QLQFSSTRRFFHTTQIKFFTFFGIFLFALFLYGSLFSTITTTLELSELKNH 118

Query: 537  R---------KSNSIIFPLYPKLGIGGNVPQDAEIKLGRFVDSNFKIVMDNLNDGTTGGK 689
                      + +S +FPL+ K G+ G      ++KL      NF       +DG     
Sbjct: 119  HHDGGDDESDEPSSFLFPLFKKYGVVGQ----RDLKLIDVKKGNFVTQKSGDSDGIA--- 171

Query: 690  FAKSMISESKIDSTAILTARGNIHPEGLFYTFMLVGNPPKPYFLDIDTGSDLTWIQCDAP 869
            F+  +++     ST +    GN++P+GL+YT + VGNPPK YF+D+DTGSDLTWIQCDAP
Sbjct: 172  FSSRVVAVDSSSST-VFPISGNVYPDGLYYTHVRVGNPPKRYFVDVDTGSDLTWIQCDAP 230

Query: 870  CTSCAKGAHPFYKPKRGNLIHSKDSYCIEVQKSPGTKC-ETCDQCDYEIEYADSSSSIGV 1046
            C SCAKGA+  YKP R N++ S DS C+EVQK+      E+  QCDYEI+YAD SSS+GV
Sbjct: 231  CRSCAKGANVPYKPIRTNIVPSLDSLCLEVQKNQKNGYHESFQQCDYEIQYADHSSSMGV 290

Query: 1047 LIKDKFQVMISNGSLVRPNLVFGCAYDQQGLLLKSLAKTDGILGLGRGKISLPFQLASHG 1226
            LI+D+  +M +NGS  + N VFGC YDQ+GLLL +L KTDGI+GL R K+ LP+QL+S G
Sbjct: 291  LIRDELHLMTTNGSKTKLNFVFGCGYDQEGLLLNTLTKTDGIMGLSRAKVGLPYQLSSKG 350

Query: 1227 NIKNVVAHCLAS-DAAGGGYVFLGDDFVPHQQMVWVPMLNSPFMNSYHAEVTKISYGGKQ 1403
             IKNVV HCL++ D  GGGY+FLGDDFVP+  M W PM  +   + Y  EV  I+YG + 
Sbjct: 351  IIKNVVGHCLSNNDGVGGGYMFLGDDFVPYWGMTWAPM--TQITDLYQTEVLGINYGNRL 408

Query: 1404 FSLGSIGKGIGHVVFDSGSSYTYFTKQAYNDLVANLEDISAKGFVLDASDDTLPICWRAK 1583
             S     K +G+VVFDSGSSYTYF K+AY DLVA+LE++S  G V D SD TLPICW+A 
Sbjct: 409  LSFDGHSK-VGNVVFDSGSSYTYFPKEAYRDLVASLEEVSGLGLVEDDSDTTLPICWQAN 467

Query: 1584 FPVRSIAELRKFFKPLDFQFGSKWWIASQKLRIPPEGYLITSNKGNICLGILEGSHVNDG 1763
            FP+RS+ +++ +FK L  +FG+KWWI S    IPPEGYLI SNKGN+CL IL+GS+VNDG
Sbjct: 468  FPIRSVKDVKDYFKTLTLRFGNKWWILSTLFHIPPEGYLIISNKGNVCLAILDGSNVNDG 527

Query: 1764 SSVILGDISLRGQLFVYDNENQKIGWIRSDCRRP 1865
            SS+ILGDISLRG L VYDN N+ IGW R+ C  P
Sbjct: 528  SSIILGDISLRGYLVVYDNVNKNIGWERTKCGMP 561


>ref|XP_002891474.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
            gi|297337316|gb|EFH67733.1| aspartyl protease family
            protein [Arabidopsis lyrata subsp. lyrata]
          Length = 578

 Score =  535 bits (1377), Expect = e-149
 Identities = 284/579 (49%), Positives = 375/579 (64%), Gaps = 19/579 (3%)
 Frame = +3

Query: 198  ESPQLKGIVIIQLPPPENPSLGKTITAITFSENT---QSPIRQAPEPHRQVQPVGPSQHP 368
            E  +L  +VII LPP ++PS GKTI+A T +++    Q P    P P  Q  P+  +Q  
Sbjct: 7    EQQRLHSVVIITLPPSDDPSQGKTISAFTLNDHDYPLQIPPEDNPNPSFQPDPLHQNQ-- 64

Query: 369  QTQENPGVQGRFSSKGFLFGSPIVFSCLLSLSLIALFSWVSSTQETLYELR-------DA 527
                    Q R        GSP +   LL  SL+A+  + S    ++   R       D 
Sbjct: 65   --------QSRLLFSDLSMGSPRLVLGLLGFSLLAVAFYASVFPNSVQMFRVSDERNRDD 116

Query: 528  DNDRKSNSIIFPLYPKLGIGGN----VPQDAEIKLGRFVDSNFKIVMDNLNDGTTGGKFA 695
            D+ R++ S +FP+Y KL         + +D  ++ G+FV+S    +++ +         A
Sbjct: 117  DSSRETTSFVFPVYHKLRAREFHERILAEDLGLENGKFVESMDLELVNPVKVNDVLSTSA 176

Query: 696  KSMISESKIDSTAILTARGNIHPEGLFYTFMLVGNPP--KPYFLDIDTGSDLTWIQCDAP 869
             S+ S     ST I    GN++P+GL+YT +LVG P   + Y LDIDTGSDLTWIQCDAP
Sbjct: 177  GSIDS-----STTIFPVGGNVYPDGLYYTRILVGKPEDGQYYHLDIDTGSDLTWIQCDAP 231

Query: 870  CTSCAKGAHPFYKPKRGNLIHSKDSYCIEVQKSPGTK-CETCDQCDYEIEYADSSSSIGV 1046
            CTSCAKGA+  YKP++ NL+ S + +C+EVQ++  T+ CE+C QCDYEIEYAD S S+GV
Sbjct: 232  CTSCAKGANQLYKPRKDNLVRSSEPFCVEVQRNQLTEHCESCHQCDYEIEYADHSYSMGV 291

Query: 1047 LIKDKFQVMISNGSLVRPNLVFGCAYDQQGLLLKSLAKTDGILGLGRGKISLPFQLASHG 1226
            L KDKF + + NGSL   ++VFGC YDQQGLLL +L KTDGILGL R KISLP QLAS G
Sbjct: 292  LTKDKFHLKLHNGSLAESDIVFGCGYDQQGLLLNTLLKTDGILGLSRAKISLPSQLASRG 351

Query: 1227 NIKNVVAHCLASDAAGGGYVFLGDDFVPHQQMVWVPMLNSPFMNSYHAEVTKISYGGKQF 1406
             I NVV HCLASD  G GY+F+G D VP   M WVPML+ P +  Y  +VTK+SYG    
Sbjct: 352  IISNVVGHCLASDLNGEGYIFMGSDLVPSHGMTWVPMLHHPHLEVYQMQVTKMSYGNAML 411

Query: 1407 SLGSIGKGIGHVVFDSGSSYTYFTKQAYNDLVANLEDISAKGFVLDASDDTLPICWRAK- 1583
            SL      +G V+FD+GSSYTYF  QAY+ LV +L+++S      D SD+ LPICWRAK 
Sbjct: 412  SLDGENGRVGKVLFDTGSSYTYFPNQAYSQLVTSLQEVSDLELTRDDSDEALPICWRAKT 471

Query: 1584 -FPVRSIAELRKFFKPLDFQFGSKWWIASQKLRIPPEGYLITSNKGNICLGILEGSHVND 1760
              P+ S+++++KFF+P+  Q GSKW I S+KL I PE YLI SNKGN+CLGIL+GS+V+D
Sbjct: 472  NSPISSLSDVKKFFRPITLQIGSKWLIISKKLLIQPEDYLIISNKGNVCLGILDGSNVHD 531

Query: 1761 GSSVILGDISLRGQLFVYDNENQKIGWIRSDCRRPQTFD 1877
            GS++I+GDIS+RG+L VYDN  Q+IGW++SDC RP  FD
Sbjct: 532  GSTIIIGDISMRGRLIVYDNVKQRIGWMKSDCVRPSEFD 570


>gb|EOY21002.1| Eukaryotic aspartyl protease family protein, putative isoform 2
            [Theobroma cacao]
          Length = 520

 Score =  532 bits (1371), Expect = e-148
 Identities = 278/523 (53%), Positives = 358/523 (68%), Gaps = 14/523 (2%)
 Frame = +3

Query: 183  MEENEESPQLKGIVIIQLPPPENPSLGKTITAIT-----FSENTQSPIRQAPEPHRQV-- 341
            M+ +E   Q+ G+VII LPP +NPSLGKTITA T     F ++ Q+  RQ  E  + +  
Sbjct: 1    MDSDERPQQVTGVVIITLPPSDNPSLGKTITAFTLTNDVFPQSHQTQQRQQQEEEQTLPT 60

Query: 342  -QPVGPSQHPQTQENPGVQGRFSSKGFLFGSPIVFSCLLSLSLIALFSWVSSTQETLYEL 518
             Q + P+  P + +NP  Q  FS  G    +P      L +SL AL  + S+   T  EL
Sbjct: 61   TQILTPA--PPSAQNP--QRGFSFLGLFSDNPRKLLGFLGISLFALLLYSSAFSNTFVEL 116

Query: 519  RDADND--RKSNSIIFPLYPKLGIGGNVPQDAEIKLGRFVDSNFKIVMDNLNDGTTGGKF 692
            R+++ND   K  S IFPLY KLG       D E+KLGRFVD + + ++ ++  G TG + 
Sbjct: 117  RNSNNDDDEKPQSFIFPLYHKLGA------DLELKLGRFVDVDKENLVASVEGGATGTQK 170

Query: 693  AKSMISESK--IDSTA-ILTARGNIHPEGLFYTFMLVGNPPKPYFLDIDTGSDLTWIQCD 863
               +++ +   IDS+  IL  RGN++P+GL++T+MLVGNP + YFLDIDTGSDLTWIQCD
Sbjct: 171  INKLVASNAAVIDSSGTILPVRGNVYPDGLYFTYMLVGNPQRRYFLDIDTGSDLTWIQCD 230

Query: 864  APCTSCAKGAHPFYKPKRGNLIHSKDSYCIEVQKSPGTK-CETCDQCDYEIEYADSSSSI 1040
            APC+SCAKGA+P YKP R N++ SKD  C EVQK+   + CETC QCDYEIEYAD SSS+
Sbjct: 231  APCSSCAKGANPLYKPTRVNIVASKDLMCTEVQKNQKPQNCETCQQCDYEIEYADRSSSL 290

Query: 1041 GVLIKDKFQVMISNGSLVRPNLVFGCAYDQQGLLLKSLAKTDGILGLGRGKISLPFQLAS 1220
            GVL +D+  ++ +NGS    ++VFGCAYDQQG+LL +L+KTDGILGL R K+SLP QLAS
Sbjct: 291  GVLARDELHLVTANGSTTNLDVVFGCAYDQQGILLNTLSKTDGILGLSRAKVSLPSQLAS 350

Query: 1221 HGNIKNVVAHCLASDAAGGGYVFLGDDFVPHQQMVWVPMLNSPFMNSYHAEVTKISYGGK 1400
             G I NVV HCLA+D    GY+FLGDDFVP+  M WVPML SP    YH ++ KI+YG  
Sbjct: 351  KGIINNVVGHCLATDVGASGYMFLGDDFVPNWGMSWVPMLGSPSTEFYHTQIVKINYGSS 410

Query: 1401 QFSLGSIGKGIGHVVFDSGSSYTYFTKQAYNDLVANLEDISAKGFVLDASDDTLPICWRA 1580
              SLG     IG VVFDSGSSYTYF KQAY +LVA+L ++S  GF+ D +D TLP+CW+A
Sbjct: 411  SLSLGRQHSSIGRVVFDSGSSYTYFMKQAYAELVASLSEVSEVGFIQDVADTTLPMCWQA 470

Query: 1581 KFPVRSIAELRKFFKPLDFQFGSKWWIASQKLRIPPEGYLITS 1709
             FP+R I ++++FFK L  QFGSKWWI S++  IPPEGYLI S
Sbjct: 471  PFPIRFIKDVKQFFKTLTLQFGSKWWIISKRFHIPPEGYLIIS 513


>gb|ESW24776.1| hypothetical protein PHAVU_004G159200g [Phaseolus vulgaris]
          Length = 579

 Score =  526 bits (1355), Expect = e-146
 Identities = 263/540 (48%), Positives = 356/540 (65%), Gaps = 9/540 (1%)
 Frame = +3

Query: 189  ENEESPQLKGIVIIQLPPPENPSLGKTITAITFSENTQSP----IRQAPEPHRQVQPVGP 356
            ++++ PQ+KG+VII LPPP+NPSLGKTITA TFS+ +       ++Q+ +    +     
Sbjct: 2    DDDQFPQIKGVVIISLPPPDNPSLGKTITAFTFSDPSSPQPSLLLQQSHQHQTNINEYNN 61

Query: 357  SQHPQTQENPGVQGRFSSKGFLFGSPIVFSCLLSLSLIALFSWVSSTQETLYEL----RD 524
            +  P        Q  FS +     +P+ F     + L ALF + S +  T  EL     D
Sbjct: 62   TDPPLHSYPSNAQLGFSRRRLFHRTPVRFFSFFGVFLFALFLYGSVSSTTTLELSGPKND 121

Query: 525  ADNDRKSNSIIFPLYPKLGIGGNVPQDAEIKLGRFVDSNFKIVMDNLNDGTTGGKFAKSM 704
             D+D K  S +FPLYPK G+ G   ++ +++LG+ V     +       G+         
Sbjct: 122  GDDDGKPGSYLFPLYPKFGVLGQ--KNMKLQLGKLVHKEKLLTQRKYRVGS--------- 170

Query: 705  ISESKIDSTAILTARGNIHPEGLFYTFMLVGNPPKPYFLDIDTGSDLTWIQCDAPCTSCA 884
                 +DS+++    GN+ P+GL++T + VGNPP+ YFLD+DTGSDLTW+QCDAPC SC 
Sbjct: 171  -EVVAVDSSSVFPVSGNVFPDGLYFTILRVGNPPRSYFLDVDTGSDLTWMQCDAPCISCG 229

Query: 885  KGAHPFYKPKRGNLIHSKDSYCIEVQKSPGT-KCETCDQCDYEIEYADSSSSIGVLIKDK 1061
            KGAH  YKP R N++ S DS C++VQK+      E+  QCDY+IEYAD SSS+GVLI+D+
Sbjct: 230  KGAHAQYKPTRSNVVPSMDSLCLDVQKNQKDGHHESLQQCDYQIEYADQSSSLGVLIRDE 289

Query: 1062 FQVMISNGSLVRPNLVFGCAYDQQGLLLKSLAKTDGILGLGRGKISLPFQLASHGNIKNV 1241
              ++ +NGS  + N VFGC YDQ+GLLL +LAKTDGILGL R K+SLP+QLAS G IKNV
Sbjct: 290  LHLVTTNGSKTKLNFVFGCGYDQEGLLLNTLAKTDGILGLSRAKVSLPYQLASKGLIKNV 349

Query: 1242 VAHCLASDAAGGGYVFLGDDFVPHQQMVWVPMLNSPFMNSYHAEVTKISYGGKQFSLGSI 1421
            V HCL++D  GGGY+FLGDDF+P+  M WVPM  +   + Y  E+  I+YG +Q S    
Sbjct: 350  VGHCLSNDEVGGGYMFLGDDFLPYWGMTWVPMAYTLTTDLYQTEILGINYGNRQLSFDGQ 409

Query: 1422 GKGIGHVVFDSGSSYTYFTKQAYNDLVANLEDISAKGFVLDASDDTLPICWRAKFPVRSI 1601
             K +G VVFDSGSSYTYF K+AY DLVA+L ++S    + D SD TLPICW A FP++S+
Sbjct: 410  SK-VGKVVFDSGSSYTYFPKEAYLDLVASLNEVSGLRLIQDDSDTTLPICWEANFPIKSV 468

Query: 1602 AELRKFFKPLDFQFGSKWWIASQKLRIPPEGYLITSNKGNICLGILEGSHVNDGSSVILG 1781
             +++ +FK +  +FGSKWWI S   +I PEGYLI SNKG++CLGIL+GS+VNDGSS+ILG
Sbjct: 469  KDVKDYFKTITLRFGSKWWILSTMFQIAPEGYLIISNKGHVCLGILDGSNVNDGSSIILG 528


>ref|NP_564539.1| aspartyl protease [Arabidopsis thaliana]
            gi|7770346|gb|AAF69716.1|AC016041_21 F27J15.15
            [Arabidopsis thaliana]
            gi|13430540|gb|AAK25892.1|AF360182_1 unknown protein
            [Arabidopsis thaliana] gi|14532748|gb|AAK64075.1| unknown
            protein [Arabidopsis thaliana]
            gi|332194267|gb|AEE32388.1| aspartyl protease
            [Arabidopsis thaliana]
          Length = 583

 Score =  525 bits (1351), Expect = e-146
 Identities = 281/582 (48%), Positives = 371/582 (63%), Gaps = 17/582 (2%)
 Frame = +3

Query: 183  MEENEESPQLKGIVIIQLPPPENPSLGKTITAITFSENT---QSPIRQAPEPHRQVQPVG 353
            + + ++  ++  +VII LPP ++PS GKTI+A T +++    + P    P P  Q  P+ 
Sbjct: 5    LHDQQQQQRVHSVVIITLPPSDDPSQGKTISAFTLTDHDYPLEIPPEDNPNPSFQPDPLH 64

Query: 354  PSQHPQTQENPGVQGRFSSKGFLFGSPIVFSCLLSLSLIALFSWVSSTQETLYELR---- 521
             +Q          Q R         SP +   LL +SL+A+  + S    ++   R    
Sbjct: 65   RNQ----------QSRLLFSDLSMNSPRLVLGLLGISLLAVAFYASVFPNSVQMFRVSPD 114

Query: 522  -----DADNDRKSNSIIFPLYPKLGIGGNVPQDAEIKLGRFVDSNFKIVMDNLNDGTTGG 686
                 D DN R++ S +FP+Y KL       +  E  LG   + NF   MD         
Sbjct: 115  ERNRDDDDNLRETASFVFPVYHKLRAREFHERILEEDLG-LENENFVESMDLELVNPVKV 173

Query: 687  KFAKSMISESKIDSTAILTARGNIHPEGLFYTFMLVGNPP--KPYFLDIDTGSDLTWIQC 860
                S  + S   ST I    GN++P+GL+YT +LVG P   + Y LDIDTGS+LTWIQC
Sbjct: 174  NDVLSTSAGSIDSSTTIFPVGGNVYPDGLYYTRILVGKPEDGQYYHLDIDTGSELTWIQC 233

Query: 861  DAPCTSCAKGAHPFYKPKRGNLIHSKDSYCIEVQKSPGTK-CETCDQCDYEIEYADSSSS 1037
            DAPCTSCAKGA+  YKP++ NL+ S +++C+EVQ++  T+ CE C QCDYEIEYAD S S
Sbjct: 234  DAPCTSCAKGANQLYKPRKDNLVRSSEAFCVEVQRNQLTEHCENCHQCDYEIEYADHSYS 293

Query: 1038 IGVLIKDKFQVMISNGSLVRPNLVFGCAYDQQGLLLKSLAKTDGILGLGRGKISLPFQLA 1217
            +GVL KDKF + + NGSL   ++VFGC YDQQGLLL +L KTDGILGL R KISLP QLA
Sbjct: 294  MGVLTKDKFHLKLHNGSLAESDIVFGCGYDQQGLLLNTLLKTDGILGLSRAKISLPSQLA 353

Query: 1218 SHGNIKNVVAHCLASDAAGGGYVFLGDDFVPHQQMVWVPMLNSPFMNSYHAEVTKISYGG 1397
            S G I NVV HCLASD  G GY+F+G D VP   M WVPML+   +++Y  +VTK+SYG 
Sbjct: 354  SRGIISNVVGHCLASDLNGEGYIFMGSDLVPSHGMTWVPMLHDSRLDAYQMQVTKMSYGQ 413

Query: 1398 KQFSLGSIGKGIGHVVFDSGSSYTYFTKQAYNDLVANLEDISAKGFVLDASDDTLPICWR 1577
               SL      +G V+FD+GSSYTYF  QAY+ LV +L+++S      D SD+TLPICWR
Sbjct: 414  GMLSLDGENGRVGKVLFDTGSSYTYFPNQAYSQLVTSLQEVSGLELTRDDSDETLPICWR 473

Query: 1578 AK--FPVRSIAELRKFFKPLDFQFGSKWWIASQKLRIPPEGYLITSNKGNICLGILEGSH 1751
            AK  FP  S+++++KFF+P+  Q GSKW I S+KL I PE YLI SNKGN+CLGIL+GS 
Sbjct: 474  AKTNFPFSSLSDVKKFFRPITLQIGSKWLIISRKLLIQPEDYLIISNKGNVCLGILDGSS 533

Query: 1752 VNDGSSVILGDISLRGQLFVYDNENQKIGWIRSDCRRPQTFD 1877
            V+DGS++ILGDIS+RG L VYDN  ++IGW++SDC RP+  D
Sbjct: 534  VHDGSTIILGDISMRGHLIVYDNVKRRIGWMKSDCVRPREID 575


Top