BLASTX nr result

ID: Paeonia25_contig00006750 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Paeonia25_contig00006750
         (2156 letters)

Database: ./nr 
           38,876,450 sequences; 13,856,398,315 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_002275943.1| PREDICTED: aspartic proteinase Asp1-like [Vi...   753   0.0  
ref|XP_006440945.1| hypothetical protein CICLE_v10019473mg [Citr...   727   0.0  
gb|EXB56943.1| Aspartic proteinase Asp1 [Morus notabilis]             724   0.0  
ref|XP_006485774.1| PREDICTED: aspartic proteinase Asp1-like [Ci...   718   0.0  
ref|XP_007036500.1| Eukaryotic aspartyl protease family protein,...   701   0.0  
emb|CBI15437.3| unnamed protein product [Vitis vinifera]              688   0.0  
ref|XP_002511959.1| protein with unknown function [Ricinus commu...   688   0.0  
ref|XP_006374352.1| aspartyl protease family protein [Populus tr...   671   0.0  
ref|XP_004241624.1| PREDICTED: aspartic proteinase Asp1-like [So...   633   e-179
ref|XP_006354730.1| PREDICTED: aspartic proteinase Asp1-like [So...   633   e-178
ref|XP_006583400.1| PREDICTED: aspartic proteinase Asp1-like [Gl...   614   e-173
ref|XP_007152781.1| hypothetical protein PHAVU_004G159200g [Phas...   610   e-171
ref|XP_004137470.1| PREDICTED: aspartic proteinase Asp1-like [Cu...   606   e-170
ref|XP_007036501.1| Eukaryotic aspartyl protease family protein,...   600   e-169
ref|XP_004512995.1| PREDICTED: lysine-specific histone demethyla...   581   e-163
ref|XP_007152782.1| hypothetical protein PHAVU_004G159200g [Phas...   575   e-161
ref|XP_006588295.1| PREDICTED: lysine-specific histone demethyla...   575   e-161
ref|XP_002891474.1| aspartyl protease family protein [Arabidopsi...   558   e-156
ref|NP_564539.1| aspartyl protease [Arabidopsis thaliana] gi|777...   555   e-155
ref|XP_006393315.1| hypothetical protein EUTSA_v10011346mg [Eutr...   545   e-152

>ref|XP_002275943.1| PREDICTED: aspartic proteinase Asp1-like [Vitis vinifera]
          Length = 686

 Score =  753 bits (1945), Expect = 0.0
 Identities = 389/584 (66%), Positives = 452/584 (77%), Gaps = 19/584 (3%)
 Frame = +3

Query: 189  REMEADQSPQLKGIVIISLPPPDNPSLGKTITAFTISDSP---PPPTRTXXXXXXXEV-- 353
            R+ME  QSPQLKG+VII+LPPPDNPSLGKTITAFT+SD P   P  T         +   
Sbjct: 116  RDMEFGQSPQLKGVVIITLPPPDNPSLGKTITAFTLSDPPLDRPHHTHQQLQRQQHQEEE 175

Query: 354  -----------RLPIQSAPHSQPQVSSRRFFHNSPRIIVGGFLGISLFACILWRSSVYSE 500
                       +LP  S P+   Q S R+    +PRI++G FLG+SLF  +LW  +  S 
Sbjct: 176  EEEEEEEEEPHQLPSPSPPNPALQFSVRKLSLGNPRILMG-FLGVSLFVFLLWNFAS-SS 233

Query: 501  TLPELRNSDDDGKPNSFIFPLYSKLGSRQMWQREVELKLGRFVDIDSKTVVASINDGMKP 680
             L ELR  +DD +P SFI PLY KLGSR +   ++ELKLG+FVD         +ND MKP
Sbjct: 234  PLVELRRKNDDREPTSFILPLYPKLGSRSLG--DLELKLGKFVDFH-------VND-MKP 283

Query: 681  HNKINKLISSQAAIDSSTILPVRGNIYPDGLYFTHMLLGSPPRPYFLDMDTGSDLTWIQC 860
               INKL +S +A DSSTI PVRG++YP+GLYFTH+ +GSPPR YFLDMDTGSDLTWIQC
Sbjct: 284  GG-INKLATSVSAFDSSTIFPVRGDVYPNGLYFTHIFVGSPPRRYFLDMDTGSDLTWIQC 342

Query: 861  DAPCTSCAKGPHPLYKPTKGKIVPSKDSLCVEVQRNQKDGYCETCLQCDYEIEYADHSSS 1040
            DAPCTSCAKGP+PLYKP KG +VP KDSLCVEVQRN K GYCETC QCDYEIEYADHSSS
Sbjct: 343  DAPCTSCAKGPNPLYKPKKGNLVPLKDSLCVEVQRNLKTGYCETCEQCDYEIEYADHSSS 402

Query: 1041 LGVLASDDLHLMIANGSMAKFNTVFGCAYDQQGSLLNSLAKTDGILGLSRAKVSLPSQLA 1220
            +GVLASDDLHLM+ANGS+ K   +FGCAYDQQG LLNSLAKTDGILGLS+AKVSLPSQLA
Sbjct: 403  MGVLASDDLHLMLANGSLTKLGIMFGCAYDQQGLLLNSLAKTDGILGLSKAKVSLPSQLA 462

Query: 1221 SQKIINNVVGHCLTSDVXXXXXXXXXXXFVPYRSMAWVPMLNSPSMNFYHTELVKMSYGN 1400
            SQ+IINNV+GHCLTSD            FVPY  MAWVPMLNS S N YH++++K+S+G+
Sbjct: 463  SQRIINNVLGHCLTSDATGGGYMFLGDDFVPYWGMAWVPMLNSHSPN-YHSQIMKISHGS 521

Query: 1401 RQLNLGGRD--SSHIVFDSGSSYTYLTKQAYFDFVVSLKEISG-GLIQDASDPTMPICWR 1571
            RQL+LG +D  +  +VFD+GSSYTY  K+AY+  V SLK++S  GLIQD SDPT+P+CWR
Sbjct: 522  RQLSLGRQDGRTERVVFDTGSSYTYFPKEAYYALVASLKDVSDEGLIQDGSDPTLPVCWR 581

Query: 1572 AKSPIRSVADVKQFFKTLTLQFGSKWWIVSTKLRIPPEGYLVVNNKGNVCLGILDGSDVH 1751
            AK PIRSV DVKQFF+ LTLQF SKWWIVSTK RIPPEGYL+++NKGNVCLGILDGS+VH
Sbjct: 582  AKFPIRSVIDVKQFFQPLTLQFRSKWWIVSTKFRIPPEGYLIISNKGNVCLGILDGSNVH 641

Query: 1752 DGSTIILGDISLHGQMVVYDNVNQKIGWMQSDCVKPQRFKSLPF 1883
            DGSTIILGDISL G++VVYDNVNQKIGW QS CVKPQ+ KSLPF
Sbjct: 642  DGSTIILGDISLRGKLVVYDNVNQKIGWAQSTCVKPQKIKSLPF 685


>ref|XP_006440945.1| hypothetical protein CICLE_v10019473mg [Citrus clementina]
            gi|557543207|gb|ESR54185.1| hypothetical protein
            CICLE_v10019473mg [Citrus clementina]
          Length = 577

 Score =  727 bits (1876), Expect = 0.0
 Identities = 374/578 (64%), Positives = 452/578 (78%), Gaps = 14/578 (2%)
 Frame = +3

Query: 195  MEADQSP----QLKGIVIISLPPPDNPSLGKTITAFTISDSPPPPTRTXXXXXXXEVRLP 362
            M++D+SP    QL G+VII+LPPP+NPSLGKTITA+T++D+ P   +T       E  LP
Sbjct: 1    MDSDESPSPPPQLTGVVIITLPPPNNPSLGKTITAYTLTDNSPQSQQTRHRQQQ-EHPLP 59

Query: 363  IQSAP--HSQPQVSSRRFFHNSPRIIVGGFLGISLFACILWRSSVYSETLPE-LRNSDDD 533
             Q  P  +SQ   S    F   PR +   FL IS+FA IL+  SV+S TL +  ++++DD
Sbjct: 60   PQLHPPQNSQFNFSLPMLFPGLPRKLFL-FLAISIFALILY-GSVFSYTLQDRYKSNNDD 117

Query: 534  GKPNSFIFPLYSKLGSRQMWQREVELKLGRFVDIDSKTVVASINDGM-KPH-NKINK-LI 704
                SF+FPLY K G R++ QR+ E KLGRFVD+D ++VVAS+NDG+ +PH +KINK L+
Sbjct: 118  ENKESFVFPLYHKFGIREVSQRDAEFKLGRFVDLDGESVVASVNDGIIRPHKSKINKKLV 177

Query: 705  SSQA-AIDSSTILPVRGNIYPDGLYFTHMLLGSPPRPYFLDMDTGSDLTWIQCDAPCTSC 881
            SS A A+DSS+I P+RGNIYPDGLYFT+M++G+PPRPY+LDMDTGSDLTWIQCDAPC+SC
Sbjct: 178  SSNAVAVDSSSIFPLRGNIYPDGLYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSC 237

Query: 882  AKGPHPLYKPTKGKIVPSKDSLCVEVQRNQKDGYCETCLQCDYEIEYADHSSSLGVLASD 1061
            AKG +PLYKP  G I+P KDSLC+E+QRN K GYCETC QCDYEIEYADHSSS+GVLA D
Sbjct: 238  AKGANPLYKPRMGNILPYKDSLCMEIQRNHKPGYCETCQQCDYEIEYADHSSSMGVLARD 297

Query: 1062 DLHLMIANGSMAKFNTVFGCAYDQQGSLLNSLAKTDGILGLSRAKVSLPSQLASQKIINN 1241
            +LHL I NGS+ K N VFGCAYDQQG LLN+L KTDGILGLSRAKVSLPSQLASQ II N
Sbjct: 298  ELHLTIENGSLTKPNVVFGCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKN 357

Query: 1242 VVGHCLTSDVXXXXXXXXXXXFVPYRSMAWVPMLNSPSMNFYHTELVKMSYGNRQLNLGG 1421
            VVGHCLT++             VP   MAWVPML+SP M  YHTE++K++YG+  LNLG 
Sbjct: 358  VVGHCLTTNAGGGGYMFLGHDLVPSWGMAWVPMLDSPFMELYHTEILKINYGSSPLNLGA 417

Query: 1422 RDS--SHIVFDSGSSYTYLTKQAYFDFVVSLKEISG-GLIQDASDPTMPICWRAKSPIRS 1592
            R+S     +FD+GSSYTY TKQAY + + SLKE+S  GL+ DASDPT+P+CWRAK PIRS
Sbjct: 418  RNSQVGWALFDTGSSYTYFTKQAYSELIASLKEVSSDGLVLDASDPTLPVCWRAKFPIRS 477

Query: 1593 VADVKQFFKTLTLQFGSKWWIVSTKLRIPPEGYLVVNNKGNVCLGILDGSDVHDGSTIIL 1772
            + DVKQFFKTLTL FGSKW IVSTK RI PEGYLV++ KGN+CLGILDGS+VH+GSTIIL
Sbjct: 478  IVDVKQFFKTLTLHFGSKWQIVSTKFRISPEGYLVISKKGNICLGILDGSEVHNGSTIIL 537

Query: 1773 GDISLHGQMVVYDNVNQKIGWMQSDCVKPQRFKSLPFL 1886
            GDISL GQ+VVYDNVN++IGW +S C+ P RFKSLPFL
Sbjct: 538  GDISLRGQLVVYDNVNKRIGWAKSHCMNPGRFKSLPFL 575


>gb|EXB56943.1| Aspartic proteinase Asp1 [Morus notabilis]
          Length = 569

 Score =  724 bits (1870), Expect = 0.0
 Identities = 367/575 (63%), Positives = 438/575 (76%), Gaps = 12/575 (2%)
 Frame = +3

Query: 195  MEADQSPQLKGIVIISLPPPDNPSLGKTITAFTISDSPPPPTRTXXXXXXXEVRLPIQSA 374
            ME+D  PQ+KG+VII+LPPPDNPSLGKTITAFT+S+S P  T         +  LPIQS 
Sbjct: 1    MESDHPPQIKGVVIITLPPPDNPSLGKTITAFTLSNSSPTQTHQESQN---QNNLPIQSP 57

Query: 375  PHSQPQVS--SRRFFHNSPRIIVGGFLGISLFACILWRSSVYSETLPELRNSDDDGKPNS 548
             + Q Q      R FH  PR +    LGIS+F  +L+ S V+   + E R S+DD  P S
Sbjct: 58   QNPQLQFPFPRLRLFHGVPRRLFA-LLGISIFTLVLF-SHVFPTVVEEFRRSNDDEGPES 115

Query: 549  FIFPLYSKLGSRQMWQREVELKLGRFVDIDSKTVVASINDGMKPHNKINKLISSQAAIDS 728
            FIFPLYSKLG     +++VELKLGRFVD D +    S  D +K   K+NKL+SS A +DS
Sbjct: 116  FIFPLYSKLGVPG--KKDVELKLGRFVDFDKENAGVSFGDRVKTQ-KVNKLVSSTAKVDS 172

Query: 729  STILPVRGNIYPDGLYFTHMLLGSPPRPYFLDMDTGSDLTWIQCDAPCTSCAKGPHPLYK 908
            S ILPVRGN+YPDGLY+T +L+G+PPRPY LDMDTGSDLTWIQCDAPCTSCAKG +PLYK
Sbjct: 173  SAILPVRGNVYPDGLYYTQILVGNPPRPYHLDMDTGSDLTWIQCDAPCTSCAKGANPLYK 232

Query: 909  PTKGKIVPSKDSLCVEVQRNQKDGYCETCLQCDYEIEYADHSSSLGVLASDDLHLMIANG 1088
            PTKG IVPSKDS C E++RNQK G+C+TC QCDYEI+YAD SSSLGVLA D LHL++ NG
Sbjct: 233  PTKGNIVPSKDSFCTEIRRNQKPGHCKTCQQCDYEIQYADRSSSLGVLAKDGLHLVMENG 292

Query: 1089 SMAKFNTVFGCAYDQQGSLLNSLAKTDGILGLSRAKVSLPSQLASQKIINNVVGHCLTSD 1268
            S+A  N VFGCAYDQQG LLN+LAKTDGILGLSRAKVSLPSQLAS+ II NVVGHCLT++
Sbjct: 293  SLANVNVVFGCAYDQQGLLLNTLAKTDGILGLSRAKVSLPSQLASKGIIKNVVGHCLTTN 352

Query: 1269 VXXXXXXXXXXXFVPYRSMAWVPMLNSPSMNFYHTELVKMSYGNRQLNLGGRDSS--HIV 1442
                        FVP+  M+W+PML SPSM+FY +E+V ++YG+  LNLG   S    +V
Sbjct: 353  AGGGGYMFLGDDFVPHWGMSWIPMLRSPSMDFYQSEIVSINYGSSALNLGAWSSKARQLV 412

Query: 1443 FDSGSSYTYLTKQAYFDFVVSLKEIS-GGLIQDASDPTMPICWRAKSPI-------RSVA 1598
            FDSGSSYTY  K+AY   + SL+E+S  GL++D SDP++PICWRA++P+       RSVA
Sbjct: 413  FDSGSSYTYFNKRAYSALLASLEEVSTTGLVRDRSDPSLPICWRAETPLNCIHMECRSVA 472

Query: 1599 DVKQFFKTLTLQFGSKWWIVSTKLRIPPEGYLVVNNKGNVCLGILDGSDVHDGSTIILGD 1778
            DVK+FFKT+TLQFGSKWWI+ST+LRIPPEGYL +++KGNVCLGILDGS VHDG T ILGD
Sbjct: 473  DVKRFFKTITLQFGSKWWIISTRLRIPPEGYLTISSKGNVCLGILDGSKVHDGYTTILGD 532

Query: 1779 ISLHGQMVVYDNVNQKIGWMQSDCVKPQRFKSLPF 1883
            ISL G +VVYDN NQKIGW  SDCVKP+RF SLPF
Sbjct: 533  ISLRGHLVVYDNENQKIGWTNSDCVKPRRFDSLPF 567


>ref|XP_006485774.1| PREDICTED: aspartic proteinase Asp1-like [Citrus sinensis]
          Length = 577

 Score =  718 bits (1854), Expect = 0.0
 Identities = 367/578 (63%), Positives = 446/578 (77%), Gaps = 14/578 (2%)
 Frame = +3

Query: 195  MEADQSP----QLKGIVIISLPPPDNPSLGKTITAFTISDSPPPPTRTXXXXXXXEVRLP 362
            M++D+SP    QL G+VII+LPPP+NPSLGKTITA+T++D+ P   +T       E  LP
Sbjct: 1    MDSDESPSPPPQLTGVVIITLPPPNNPSLGKTITAYTLTDNSPQSQQTHHQQQQ-EHPLP 59

Query: 363  IQSAP--HSQPQVSSRRFFHNSPRIIVGGFLGISLFACILWRSSVYSETLPE-LRNSDDD 533
             Q  P   SQ   S    F   PR +   FL IS+FA IL+  SV+S TL    ++++DD
Sbjct: 60   AQLHPPQDSQFNFSLPMLFPVLPRKLFL-FLAISIFALILY-GSVFSYTLQHRYKSNNDD 117

Query: 534  GKPNSFIFPLYSKLGSRQMWQREVELKLGRFVDIDSKTVVASINDGM-KPH-NKINKLI- 704
                SF+FPLY K G R++ QR+ E KLGRFVD+D ++VVAS+NDG+ +PH +KINK + 
Sbjct: 118  ENKESFVFPLYHKFGIREVLQRDAEFKLGRFVDLDGESVVASVNDGIIRPHKSKINKKLV 177

Query: 705  -SSQAAIDSSTILPVRGNIYPDGLYFTHMLLGSPPRPYFLDMDTGSDLTWIQCDAPCTSC 881
             S+  A+DSS+  P+RGN+YPDGLYFT+M++G+PPRPY+LDMDTGSDLTWIQCDAPC+SC
Sbjct: 178  PSNAVAVDSSSTFPLRGNVYPDGLYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSC 237

Query: 882  AKGPHPLYKPTKGKIVPSKDSLCVEVQRNQKDGYCETCLQCDYEIEYADHSSSLGVLASD 1061
            AKG +PLYKP  G I+P KDSLC+E+QRN K GYCETC QCDYEIEYADHSSS+GVLA D
Sbjct: 238  AKGANPLYKPRMGNILPYKDSLCMEIQRNHKPGYCETCQQCDYEIEYADHSSSMGVLARD 297

Query: 1062 DLHLMIANGSMAKFNTVFGCAYDQQGSLLNSLAKTDGILGLSRAKVSLPSQLASQKIINN 1241
            +LHL I NGS+ K N VFGCAYDQQG LLN+L KTDGILGLSRAKVSLPSQLASQ II N
Sbjct: 298  ELHLTIENGSLTKPNVVFGCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKN 357

Query: 1242 VVGHCLTSDVXXXXXXXXXXXFVPYRSMAWVPMLNSPSMNFYHTELVKMSYGNRQLNLGG 1421
            VVGHCLT++             VP   MAWVPML+SP M  YHTE++K++YG+  LNLG 
Sbjct: 358  VVGHCLTTNAGGGGYMFLGHDLVPSWGMAWVPMLDSPFMELYHTEILKINYGSSPLNLGA 417

Query: 1422 RDS--SHIVFDSGSSYTYLTKQAYFDFVVSLKEI-SGGLIQDASDPTMPICWRAKSPIRS 1592
            R+S     +FD+GSSYTY TKQAY + + SLKE+ S GL+ DASDPT+P+CWRAK PIRS
Sbjct: 418  RNSRVGWALFDTGSSYTYFTKQAYSELIASLKEVSSNGLVLDASDPTLPVCWRAKFPIRS 477

Query: 1593 VADVKQFFKTLTLQFGSKWWIVSTKLRIPPEGYLVVNNKGNVCLGILDGSDVHDGSTIIL 1772
            + DVKQ+FKTLTL FGSKW IVSTK  I PEGYLV++ KGN+CLGILDGS+VH+GSTIIL
Sbjct: 478  IVDVKQYFKTLTLHFGSKWQIVSTKFHISPEGYLVISKKGNICLGILDGSEVHNGSTIIL 537

Query: 1773 GDISLHGQMVVYDNVNQKIGWMQSDCVKPQRFKSLPFL 1886
            GDISL GQ+VVYDNVN++IGW +S C+ P RFKSLPFL
Sbjct: 538  GDISLRGQLVVYDNVNKRIGWAKSHCMNPGRFKSLPFL 575


>ref|XP_007036500.1| Eukaryotic aspartyl protease family protein, putative isoform 1
            [Theobroma cacao] gi|508773745|gb|EOY21001.1| Eukaryotic
            aspartyl protease family protein, putative isoform 1
            [Theobroma cacao]
          Length = 576

 Score =  701 bits (1809), Expect = 0.0
 Identities = 363/582 (62%), Positives = 433/582 (74%), Gaps = 18/582 (3%)
 Frame = +3

Query: 195  MEADQSPQ-LKGIVIISLPPPDNPSLGKTITAFTISDSPPPPTRTXXXXXXXEVRLPIQ- 368
            M++D+ PQ + G+VII+LPP DNPSLGKTITAFT+++   P +         E    +  
Sbjct: 1    MDSDERPQQVTGVVIITLPPSDNPSLGKTITAFTLTNDVFPQSHQTQQRQQQEEEQTLPT 60

Query: 369  ---------SAPHSQPQVSSRRFFHNSPRIIVGGFLGISLFACILWRSSVYSETLPELRN 521
                     SA + Q   S    F ++PR ++G FLGISLFA +L+ SS +S T  ELRN
Sbjct: 61   TQILTPAPPSAQNPQRGFSFLGLFSDNPRKLLG-FLGISLFALLLY-SSAFSNTFVELRN 118

Query: 522  S--DDDGKPNSFIFPLYSKLGSRQMWQREVELKLGRFVDIDSKTVVASINDGMKPHNKIN 695
            S  DDD KP SFIFPLY KLG+      ++ELKLGRFVD+D + +VAS+  G     KIN
Sbjct: 119  SNNDDDEKPQSFIFPLYHKLGA------DLELKLGRFVDVDKENLVASVEGGATGTQKIN 172

Query: 696  KLISSQAAI--DSSTILPVRGNIYPDGLYFTHMLLGSPPRPYFLDMDTGSDLTWIQCDAP 869
            KL++S AA+   S TILPVRGN+YPDGLYFT+ML+G+P R YFLD+DTGSDLTWIQCDAP
Sbjct: 173  KLVASNAAVIDSSGTILPVRGNVYPDGLYFTYMLVGNPQRRYFLDIDTGSDLTWIQCDAP 232

Query: 870  CTSCAKGPHPLYKPTKGKIVPSKDSLCVEVQRNQKDGYCETCLQCDYEIEYADHSSSLGV 1049
            C+SCAKG +PLYKPT+  IV SKD +C EVQ+NQK   CETC QCDYEIEYAD SSSLGV
Sbjct: 233  CSSCAKGANPLYKPTRVNIVASKDLMCTEVQKNQKPQNCETCQQCDYEIEYADRSSSLGV 292

Query: 1050 LASDDLHLMIANGSMAKFNTVFGCAYDQQGSLLNSLAKTDGILGLSRAKVSLPSQLASQK 1229
            LA D+LHL+ ANGS    + VFGCAYDQQG LLN+L+KTDGILGLSRAKVSLPSQLAS+ 
Sbjct: 293  LARDELHLVTANGSTTNLDVVFGCAYDQQGILLNTLSKTDGILGLSRAKVSLPSQLASKG 352

Query: 1230 IINNVVGHCLTSDVXXXXXXXXXXXFVPYRSMAWVPMLNSPSMNFYHTELVKMSYGNRQL 1409
            IINNVVGHCL +DV           FVP   M+WVPML SPS  FYHT++VK++YG+  L
Sbjct: 353  IINNVVGHCLATDVGASGYMFLGDDFVPNWGMSWVPMLGSPSTEFYHTQIVKINYGSSSL 412

Query: 1410 NLGGRDSS--HIVFDSGSSYTYLTKQAYFDFVVSLKEISG-GLIQDASDPTMPICWRAKS 1580
            +LG + SS   +VFDSGSSYTY  KQAY + V SL E+S  G IQD +D T+P+CW+A  
Sbjct: 413  SLGRQHSSIGRVVFDSGSSYTYFMKQAYAELVASLSEVSEVGFIQDVADTTLPMCWQAPF 472

Query: 1581 PIRSVADVKQFFKTLTLQFGSKWWIVSTKLRIPPEGYLVVNNKGNVCLGILDGSDVHDGS 1760
            PIR + DVKQFFKTLTLQFGSKWWI+S +  IPPEGYL+++ KGNVCLGILDGS VHDGS
Sbjct: 473  PIRFIKDVKQFFKTLTLQFGSKWWIISKRFHIPPEGYLIISKKGNVCLGILDGSKVHDGS 532

Query: 1761 TIILGDISLHGQMVVYDNVNQKIGWMQSDCVKPQRFKSLPFL 1886
            TIILGDISL GQ+VVYDN   KIGW QSDC  P+RFKSLPF+
Sbjct: 533  TIILGDISLRGQLVVYDNEKLKIGWTQSDCAHPRRFKSLPFV 574


>emb|CBI15437.3| unnamed protein product [Vitis vinifera]
          Length = 473

 Score =  688 bits (1776), Expect = 0.0
 Identities = 344/484 (71%), Positives = 396/484 (81%), Gaps = 3/484 (0%)
 Frame = +3

Query: 441  GFLGISLFACILWRSSVYSETLPELRNSDDDGKPNSFIFPLYSKLGSRQMWQREVELKLG 620
            GFLG+SLF  +LW  +  S  L ELR  +DD +P SFI PLY KLGSR +   ++ELKLG
Sbjct: 2    GFLGVSLFVFLLWNFAS-SSPLVELRRKNDDREPTSFILPLYPKLGSRSLG--DLELKLG 58

Query: 621  RFVDIDSKTVVASINDGMKPHNKINKLISSQAAIDSSTILPVRGNIYPDGLYFTHMLLGS 800
            +FVD         +ND MKP   INKL +S +A DSSTI PVRG++YP+GLYFTH+ +GS
Sbjct: 59   KFVDFH-------VND-MKPGG-INKLATSVSAFDSSTIFPVRGDVYPNGLYFTHIFVGS 109

Query: 801  PPRPYFLDMDTGSDLTWIQCDAPCTSCAKGPHPLYKPTKGKIVPSKDSLCVEVQRNQKDG 980
            PPR YFLDMDTGSDLTWIQCDAPCTSCAKGP+PLYKP KG +VP KDSLCVEVQRN K G
Sbjct: 110  PPRRYFLDMDTGSDLTWIQCDAPCTSCAKGPNPLYKPKKGNLVPLKDSLCVEVQRNLKTG 169

Query: 981  YCETCLQCDYEIEYADHSSSLGVLASDDLHLMIANGSMAKFNTVFGCAYDQQGSLLNSLA 1160
            YCETC QCDYEIEYADHSSS+GVLASDDLHLM+ANGS+ K   +FGCAYDQQG LLNSLA
Sbjct: 170  YCETCEQCDYEIEYADHSSSMGVLASDDLHLMLANGSLTKLGIMFGCAYDQQGLLLNSLA 229

Query: 1161 KTDGILGLSRAKVSLPSQLASQKIINNVVGHCLTSDVXXXXXXXXXXXFVPYRSMAWVPM 1340
            KTDGILGLS+AKVSLPSQLASQ+IINNV+GHCLTSD            FVPY  MAWVPM
Sbjct: 230  KTDGILGLSKAKVSLPSQLASQRIINNVLGHCLTSDATGGGYMFLGDDFVPYWGMAWVPM 289

Query: 1341 LNSPSMNFYHTELVKMSYGNRQLNLGGRD--SSHIVFDSGSSYTYLTKQAYFDFVVSLKE 1514
            LNS S N YH++++K+S+G+RQL+LG +D  +  +VFD+GSSYTY  K+AY+  V SLK+
Sbjct: 290  LNSHSPN-YHSQIMKISHGSRQLSLGRQDGRTERVVFDTGSSYTYFPKEAYYALVASLKD 348

Query: 1515 ISG-GLIQDASDPTMPICWRAKSPIRSVADVKQFFKTLTLQFGSKWWIVSTKLRIPPEGY 1691
            +S  GLIQD SDPT+P+CWRAK PIRSV DVKQFF+ LTLQF SKWWIVSTK RIPPEGY
Sbjct: 349  VSDEGLIQDGSDPTLPVCWRAKFPIRSVIDVKQFFQPLTLQFRSKWWIVSTKFRIPPEGY 408

Query: 1692 LVVNNKGNVCLGILDGSDVHDGSTIILGDISLHGQMVVYDNVNQKIGWMQSDCVKPQRFK 1871
            L+++NKGNVCLGILDGS+VHDGSTIILGDISL G++VVYDNVNQKIGW QS CVKPQ+ K
Sbjct: 409  LIISNKGNVCLGILDGSNVHDGSTIILGDISLRGKLVVYDNVNQKIGWAQSTCVKPQKIK 468

Query: 1872 SLPF 1883
            SLPF
Sbjct: 469  SLPF 472


>ref|XP_002511959.1| protein with unknown function [Ricinus communis]
            gi|223549139|gb|EEF50628.1| protein with unknown function
            [Ricinus communis]
          Length = 583

 Score =  688 bits (1775), Expect = 0.0
 Identities = 356/584 (60%), Positives = 434/584 (74%), Gaps = 21/584 (3%)
 Frame = +3

Query: 195  MEADQSPQLKGIVIISLPPPDNPSLGKTITAFTISDSPPPPT----------RTXXXXXX 344
            ME+D       +VIISLPPP+NPSLGKTITAFT++D     T                  
Sbjct: 1    MESDDQSSHVKVVIISLPPPNNPSLGKTITAFTLTDDDHDATYPQSHQNHEQEPSIIQTH 60

Query: 345  XEVRLPIQSA--PHSQPQV--SSRRFFHNSPRIIVGGFLGISLFACILWRSSVYSETLPE 512
             E +LP+QS   P   PQ+  S    + ++PR ++   L ISLFA I++RS ++S TL E
Sbjct: 61   RESQLPVQSPSLPPQNPQIQFSFSGLYFSTPRKLLF-LLCISLFAVIVYRS-LFSNTLLE 118

Query: 513  LRNSDDDG--KPNSFIFPLYSKLGSRQMWQREVELKLGRFVDIDSKTVVASINDG--MKP 680
            L+ SDDD   K  SFIFPLY K G R++ Q  +E K  R V  +S  +VAS+ND   + P
Sbjct: 119  LKVSDDDNDEKTKSFIFPLYHKFGIREISQSNLEHKSIRSVYKES--LVASVNDDDVIVP 176

Query: 681  HNKINKLISSQAAIDSSTILPVRGNIYPDGLYFTHMLLGSPPRPYFLDMDTGSDLTWIQC 860
            +       S+ AA+DSS++ PVRGN+YPDGLYFT++L+G+PPRPY+LD+DT SDLTWIQC
Sbjct: 177  NRNYKLASSNAAAVDSSSVFPVRGNVYPDGLYFTYILVGNPPRPYYLDIDTASDLTWIQC 236

Query: 861  DAPCTSCAKGPHPLYKPTKGKIVPSKDSLCVEVQRNQKDGYCETCLQCDYEIEYADHSSS 1040
            DAPCTSCAKG + LYKP +  IV  KDSLCVE+ RNQK GYCETC QCDYEIEYADHSSS
Sbjct: 237  DAPCTSCAKGANALYKPRRDNIVTPKDSLCVELHRNQKAGYCETCQQCDYEIEYADHSSS 296

Query: 1041 LGVLASDDLHLMIANGSMAKFNTVFGCAYDQQGSLLNSLAKTDGILGLSRAKVSLPSQLA 1220
            +GVLA D+LHL +ANGS       FGCAYDQQG LLN+L KTDGILGLS+AKVSLPSQLA
Sbjct: 297  MGVLARDELHLTMANGSSTNLKFNFGCAYDQQGLLLNTLVKTDGILGLSKAKVSLPSQLA 356

Query: 1221 SQKIINNVVGHCLTSDVXXXXXXXXXXXFVPYRSMAWVPMLNSPSMNFYHTELVKMSYGN 1400
            ++ IINNVVGHCL +DV           FVP   M+WVPML+SPS++ Y T+++K++YG+
Sbjct: 357  NRGIINNVVGHCLANDVVGGGYMFLGDDFVPRWGMSWVPMLDSPSIDSYQTQIMKLNYGS 416

Query: 1401 RQLNLGGRDS--SHIVFDSGSSYTYLTKQAYFDFVVSLKEISG-GLIQDASDPTMPICWR 1571
              L+LGG++     IVFDSGSSYTY TK+AY + V SLK++SG  LIQD SDPT+P CWR
Sbjct: 417  GPLSLGGQERRVRRIVFDSGSSYTYFTKEAYSELVASLKQVSGEALIQDTSDPTLPFCWR 476

Query: 1572 AKSPIRSVADVKQFFKTLTLQFGSKWWIVSTKLRIPPEGYLVVNNKGNVCLGILDGSDVH 1751
            AK PIRSV DVKQ+FKTLTLQFGSKWWI+STK RIPPEGYL+++NKGNVCLGILDGSDVH
Sbjct: 477  AKFPIRSVIDVKQYFKTLTLQFGSKWWIISTKFRIPPEGYLIISNKGNVCLGILDGSDVH 536

Query: 1752 DGSTIILGDISLHGQMVVYDNVNQKIGWMQSDCVKPQRFKSLPF 1883
            DGS+IILGDISL GQ+++YDNVN KIGW QSDC+KP+ F +LPF
Sbjct: 537  DGSSIILGDISLRGQLIIYDNVNNKIGWTQSDCIKPKTFSTLPF 580


>ref|XP_006374352.1| aspartyl protease family protein [Populus trichocarpa]
            gi|550322111|gb|ERP52149.1| aspartyl protease family
            protein [Populus trichocarpa]
          Length = 603

 Score =  671 bits (1732), Expect = 0.0
 Identities = 360/609 (59%), Positives = 432/609 (70%), Gaps = 45/609 (7%)
 Frame = +3

Query: 192  EMEADQSPQLKGIVIISLPPPDNPSLGKTITAFTISDSPPPPTRTXXXXXXXEVRLPIQS 371
            E + DQSPQLKG+VIISLPPPDNPSLGKTITAFT++++  P +         E +LPI S
Sbjct: 2    ESDDDQSPQLKGVVIISLPPPDNPSLGKTITAFTLTNNDYPQSHQTPQTHQ-EDQLPISS 60

Query: 372  AP-----HSQPQVSSRRFFHNSPRIIVGGFLGISLFACILWRSSVYSETLPELR---NSD 527
             P     +SQ Q  S R F  +PR ++  F+ ISLFA  ++ SS+++ T  EL+   N D
Sbjct: 61   PPPPPSQNSQLQFPSSRLFLGTPRKLLS-FVFISLFALAIY-SSLFTNTFQELKSNNNDD 118

Query: 528  DDGKPNSFIFPLYSKLGSRQMWQREVELKLGRFVDIDSKTVVASINDGMKPHNKINKLIS 707
            DD KP S++FPLY KLG R++   ++E  L RFV    + +VAS++    PH KI+KL S
Sbjct: 119  DDQKPKSYVFPLYHKLGIREIPLNDLENHLRRFVY--KENLVASVDHLNGPH-KISKLAS 175

Query: 708  SQAA--IDSSTILPVRGNIYPDGLYFTHMLLGSPPRPYFLDMDTGSDLTWIQCDAPCTSC 881
            S AA  +DSS I PVRGN+YPDG          PP+PY+LD DTGSDLTWIQCDAPCTSC
Sbjct: 176  SNAAAAMDSSAIFPVRGNLYPDG----------PPQPYYLDFDTGSDLTWIQCDAPCTSC 225

Query: 882  AKGPHPLYKPTKGKIVPSKDSLCVEVQRNQKDGYCETCLQCDYEIEYADHSSSLGVLASD 1061
            AKG +  YKP +G IVP KD LC+EVQRNQK GYCETC QCDYEIEYADHSSS+GVLA+D
Sbjct: 226  AKGANAWYKPRRGNIVPPKDLLCMEVQRNQKAGYCETCDQCDYEIEYADHSSSMGVLATD 285

Query: 1062 DLHLMIANGSMAKFNTVFGCAYDQQGSLLNSLAKTDGILGLSRAKVSLPSQLASQKIINN 1241
             L LM+ANGS+ K N +FGCAYDQQG LL +L KTDGILGLSRAKVSLPSQLASQ IINN
Sbjct: 286  KLLLMVANGSLTKLNFIFGCAYDQQGLLLKTLVKTDGILGLSRAKVSLPSQLASQGIINN 345

Query: 1242 VVGHCLTSDVXXXXXXXXXXXFVPYRSMAWVPMLNSPSMNFYHTELVKMSYGNRQLNLGG 1421
            V+GHCLT+D+           FVP   MAWVPML+SPSM FYHTE+VK++YG+  L+LGG
Sbjct: 346  VIGHCLTTDLGGGGYMFLGDDFVPRWGMAWVPMLDSPSMEFYHTEVVKLNYGSSPLSLGG 405

Query: 1422 RDS--SHIVFDSGSSYTYLTKQAYFDFVVSLKEISG-GLIQDASDPTMPICWRAKSPIRS 1592
             +S   HI+FDSGSSYTY  K+AY + V SL E+SG GL+Q  SD T+P+CWRA  PIR 
Sbjct: 406  MESRVKHILFDSGSSYTYFPKEAYSELVASLNEVSGAGLVQSTSDTTLPLCWRANFPIRK 465

Query: 1593 V--------------------------------ADVKQFFKTLTLQFGSKWWIVSTKLRI 1676
                                              DVK+FFKTLT QFG+KW ++STK RI
Sbjct: 466  FIYRTELTRPIRRRRRRRRRRRRRRRRRRQHIKGDVKKFFKTLTFQFGTKWLVISTKFRI 525

Query: 1677 PPEGYLVVNNKGNVCLGILDGSDVHDGSTIILGDISLHGQMVVYDNVNQKIGWMQSDCVK 1856
            PPEGYL++++KGNVCLGIL+GS VHDGSTIILGDISL GQ+VVYDNVN+KIGW  SDC K
Sbjct: 526  PPEGYLMMSDKGNVCLGILEGSKVHDGSTIILGDISLRGQLVVYDNVNKKIGWTPSDCAK 585

Query: 1857 PQRFKSLPF 1883
            P+R  SL F
Sbjct: 586  PKRSDSLQF 594


>ref|XP_004241624.1| PREDICTED: aspartic proteinase Asp1-like [Solanum lycopersicum]
          Length = 562

 Score =  633 bits (1633), Expect = e-179
 Identities = 332/573 (57%), Positives = 398/573 (69%), Gaps = 11/573 (1%)
 Frame = +3

Query: 198  EADQSPQLKGIVIISLPPPDNPSLGKTITAFTISDSPPPPTRTXXXXXXXEVRLPIQSAP 377
            E   SP ++G+VII+LPPPDNPS GKTITAFT+SDSP   T         E   P QS P
Sbjct: 3    ETKNSPPIQGVVIITLPPPDNPSYGKTITAFTLSDSP---THQQQQEQEQEEEPPQQSQP 59

Query: 378  HSQP------QVSSRRFFHNSPRIIVGGFLGISLFACILWRSSVYSETLPELRNSDDDGK 539
            H+Q        VS  R F   P I+ G  LGISL A   W SS+  ETL ELR+ + D K
Sbjct: 60   HNQDVNAGVLHVSLERSFFFRPTIVFG-LLGISLIALSFW-SSLTQETLFELRDVEQDHK 117

Query: 540  PN--SFIFPLYSKLGSRQMWQREVELKLGRFVDIDSKTVVASINDGMKPHNKINKLISSQ 713
             +  SFI PLY K G     + +VE KLGRFVD           D      KI K +S+ 
Sbjct: 118  SSNSSFILPLYPKRGGAWNSRTDVEFKLGRFVDFKP--------DNFMDQEKIAKSLSAA 169

Query: 714  AAIDSSTILPVRGNIYPDGLYFTHMLLGSPPRPYFLDMDTGSDLTWIQCDAPCTSCAKGP 893
              +DSS   PVRGNI+ +GLY+T+ML+G+PP+PYFLD+DTGSDL WIQCDAPCTSCAKG 
Sbjct: 170  TKLDSSANFPVRGNIHSEGLYYTYMLVGNPPKPYFLDIDTGSDLMWIQCDAPCTSCAKGA 229

Query: 894  HPLYKPTKGKIVPSKDSLCVEVQRNQKDGYCETCLQCDYEIEYADHSSSLGVLASDDLHL 1073
            HPLYKP    ++P K+  CVEVQ N +  YC+ C QCDYEIEYAD SSS+GVLA D+L L
Sbjct: 230  HPLYKPRNVNMIPPKNPYCVEVQENLRSKYCDNCHQCDYEIEYADRSSSVGVLAKDELQL 289

Query: 1074 MIANGSMAKFNTVFGCAYDQQGSLLNSLAKTDGILGLSRAKVSLPSQLASQKIINNVVGH 1253
            ++ANG+  K N VFGCAYDQQG+LLN+LA TDGILGLSRA +SLPSQLAS  +INNV+GH
Sbjct: 290  VLANGTGTKPNVVFGCAYDQQGTLLNTLASTDGILGLSRAPISLPSQLASHGLINNVIGH 349

Query: 1254 CLTSDVXXXXXXXXXXXFVPYRSMAWVPMLNSPSMNFYHTELVKMSYGNRQLNLGGR--D 1427
            CL +D            FVP   M+WVPMLN+P  N Y  +L+KM+YG + L LG R   
Sbjct: 350  CLRTDT-NGGYLFLGNDFVPQWRMSWVPMLNNPFPNLYQAQLMKMNYGGKDLQLGSRGYG 408

Query: 1428 SSHIVFDSGSSYTYLTKQAYFDFVVSLKEISG-GLIQDASDPTMPICWRAKSPIRSVADV 1604
               +VFDSGS+YTY T QAY   +  L+EIS   LI+DASD T+PICWRAK P+RS+ +V
Sbjct: 409  QDSVVFDSGSTYTYFTDQAYKALISMLEEISSEDLIKDASDTTLPICWRAKFPVRSIEEV 468

Query: 1605 KQFFKTLTLQFGSKWWIVSTKLRIPPEGYLVVNNKGNVCLGILDGSDVHDGSTIILGDIS 1784
            +QFFK L LQFGSKW +VSTKL IP EGYL ++ K NVCLGILDGS+VHDGS IILGDIS
Sbjct: 469  RQFFKPLNLQFGSKWRVVSTKLWIPAEGYLTISEKSNVCLGILDGSNVHDGSAIILGDIS 528

Query: 1785 LHGQMVVYDNVNQKIGWMQSDCVKPQRFKSLPF 1883
            L GQ+ VYDNVNQKIGW++S+C +P+   SLPF
Sbjct: 529  LRGQLFVYDNVNQKIGWIRSNCERPENVPSLPF 561


>ref|XP_006354730.1| PREDICTED: aspartic proteinase Asp1-like [Solanum tuberosum]
          Length = 558

 Score =  633 bits (1632), Expect = e-178
 Identities = 332/573 (57%), Positives = 402/573 (70%), Gaps = 11/573 (1%)
 Frame = +3

Query: 198  EADQSPQLKGIVIISLPPPDNPSLGKTITAFTISDSPPPPTRTXXXXXXXEVRLPIQSAP 377
            E   SP ++G+VII+LPPPDNPS GKTITAFT+SDSP    +        E   P QS P
Sbjct: 3    ETKNSPPIQGVVIITLPPPDNPSYGKTITAFTLSDSPTHQQQQ-------EEEPPQQSQP 55

Query: 378  HSQP------QVSSRRFFHNSPRIIVGGFLGISLFACILWRSSVYSETLPELRNSDDDGK 539
            H+Q       + S  R F   P+I+ G  LGISL A   W SS+  ETL ELR+ + D K
Sbjct: 56   HNQDLNTGVLRASLERSFFFRPKIVFG-LLGISLIALSFW-SSLTQETLFELRDVEHDHK 113

Query: 540  PN--SFIFPLYSKLGSRQMWQREVELKLGRFVDIDSKTVVASINDGMKPHNKINKLISSQ 713
             +  SFI PLY K G     +R+VE KLGRFVD           D      KI K +S+ 
Sbjct: 114  SSNSSFILPLYPKRGGAWNSRRDVEFKLGRFVDFKP--------DKFMDQEKIAKSLSAA 165

Query: 714  AAIDSSTILPVRGNIYPDGLYFTHMLLGSPPRPYFLDMDTGSDLTWIQCDAPCTSCAKGP 893
              +DSS   PVRGNI+ +GLY+T+ML+G+PPRPYFLD+DTGSDL WIQCDAPCTSCAKG 
Sbjct: 166  TKLDSSVNFPVRGNIHSEGLYYTYMLVGNPPRPYFLDIDTGSDLMWIQCDAPCTSCAKGA 225

Query: 894  HPLYKPTKGKIVPSKDSLCVEVQRNQKDGYCETCLQCDYEIEYADHSSSLGVLASDDLHL 1073
            HPLYKP    ++P K+  CVEVQ N K  YC+ C QCDYEIEYAD SSS+GVLA D+L L
Sbjct: 226  HPLYKPRNVNMIPPKNPYCVEVQENLKSKYCDNCHQCDYEIEYADRSSSVGVLAKDELQL 285

Query: 1074 MIANGSMAKFNTVFGCAYDQQGSLLNSLAKTDGILGLSRAKVSLPSQLASQKIINNVVGH 1253
            ++ANG+  K + VFGCAYDQQG+LLN+LA TDGILGLSRA +SLPSQLAS  +INNV+GH
Sbjct: 286  VLANGTGTKPSVVFGCAYDQQGTLLNTLASTDGILGLSRAPISLPSQLASHGLINNVIGH 345

Query: 1254 CLTSDVXXXXXXXXXXXFVPYRSMAWVPMLNSPSMNFYHTELVKMSYGNRQLNLGGRD-- 1427
            CL +D            FVP   M+WVPMLN+P  N Y  +L+KM+YG ++L LG     
Sbjct: 346  CLRTDT-NGGYLFLGNDFVPQWRMSWVPMLNNPFPNLYQAQLMKMNYGGKELRLGSTSYG 404

Query: 1428 SSHIVFDSGSSYTYLTKQAYFDFVVSLKEISG-GLIQDASDPTMPICWRAKSPIRSVADV 1604
               +VFDSGS+YTY T QAY   +  L+EIS   LI+DASD T+PICWRAK P+RS+ +V
Sbjct: 405  QGTVVFDSGSTYTYFTDQAYKALISMLEEISSEDLIKDASDTTLPICWRAKFPVRSIEEV 464

Query: 1605 KQFFKTLTLQFGSKWWIVSTKLRIPPEGYLVVNNKGNVCLGILDGSDVHDGSTIILGDIS 1784
            +QFFK L LQFGSKW IVSTKL IP EG+L ++ KGNVCLGILDGS+VHDGS IILGDIS
Sbjct: 465  RQFFKPLNLQFGSKWRIVSTKLWIPAEGFLTISEKGNVCLGILDGSNVHDGSAIILGDIS 524

Query: 1785 LHGQMVVYDNVNQKIGWMQSDCVKPQRFKSLPF 1883
            L GQ+ VYDNVNQKIGW++S+C +P++  SLPF
Sbjct: 525  LRGQLFVYDNVNQKIGWIRSNCERPEKVPSLPF 557


>ref|XP_006583400.1| PREDICTED: aspartic proteinase Asp1-like [Glycine max]
          Length = 574

 Score =  614 bits (1584), Expect = e-173
 Identities = 319/574 (55%), Positives = 401/574 (69%), Gaps = 19/574 (3%)
 Frame = +3

Query: 195  MEADQSPQLKGIVIISLPPPDNPSLGKTITAFTISDSPPPPTRTXXXXXXXEVRL----- 359
            ME DQS Q+KG+VIISLPPPDNPSLGKTITAF  S++P PP +        + +      
Sbjct: 1    MEDDQSTQIKGVVIISLPPPDNPSLGKTITAFAFSNNPSPPPQLFIQPHQHQSQQTHPNA 60

Query: 360  ------PIQSAPHSQPQVSS--RRFFHNSPRIIVGGFLGISLFACILWRSSVYSETLPEL 515
                  P+QS P S PQ+S   RR FH++P + +  F G  LFA  L+  SV S T  +L
Sbjct: 61   QHNTDPPLQSYP-SNPQLSFSFRRLFHSTP-VKLFSFFGTLLFALFLY-GSVSSTTTVDL 117

Query: 516  R---NSDDDGKPNSFIFPLYSKLGSRQMWQREVELKLGRFVDIDSKTVVASINDGMKPHN 686
            R   N  DD K  SF+FPL+ K G   + Q++++L+LG+ V  +       + DG     
Sbjct: 118  RGRKNDGDDDKATSFLFPLFPKFGV--LGQKDLKLQLGKLVQKEKFLTQRDVGDG----- 170

Query: 687  KINKLISSQAAIDSSTILPVRGNIYPDGLYFTHMLLGSPPRPYFLDMDTGSDLTWIQCDA 866
                  S   A+DSS++ PV GN+YPDGLYFT + +G+PP+ YFLD+DTGSDLTW+QCDA
Sbjct: 171  ------SGVVAVDSSSVFPVSGNVYPDGLYFTILRVGNPPKSYFLDVDTGSDLTWMQCDA 224

Query: 867  PCTSCAKGPHPLYKPTKGKIVPSKDSLCVEVQRNQKDGYC-ETCLQCDYEIEYADHSSSL 1043
            PC SC KG H  YKPT+  +V S DSLC++VQ+NQK+G+  E+ LQCDYEI+YADHSSSL
Sbjct: 225  PCRSCGKGAHVQYKPTRSNVVSSVDSLCLDVQKNQKNGHHDESLLQCDYEIQYADHSSSL 284

Query: 1044 GVLASDDLHLMIANGSMAKFNTVFGCAYDQQGSLLNSLAKTDGILGLSRAKVSLPSQLAS 1223
            GVL  D+LHL+  NGS  K N VFGC YDQ+G +LN+LAKTDGI+GLSRAKVSLP QLAS
Sbjct: 285  GVLVRDELHLVTTNGSKTKLNVVFGCGYDQEGLILNTLAKTDGIMGLSRAKVSLPYQLAS 344

Query: 1224 QKIINNVVGHCLTSDVXXXXXXXXXXXFVPYRSMAWVPMLNSPSMNFYHTELVKMSYGNR 1403
            + +I NVVGHCL++D            FVPY  M WVPM  + + + Y TE++ ++YGNR
Sbjct: 345  KGLIKNVVGHCLSNDGAGGGYMFLGDDFVPYWGMNWVPMAYTLTTDLYQTEILGINYGNR 404

Query: 1404 QLNLGGRDS-SHIVFDSGSSYTYLTKQAYFDFVVSLKEISG-GLIQDASDPTMPICWRAK 1577
            QL   G+     + FDSGSSYTY  K+AY D V SL E+SG GL+QD SD T+PICW+A 
Sbjct: 405  QLKFDGQSKVGKVFFDSGSSYTYFPKEAYLDLVASLNEVSGLGLVQDDSDTTLPICWQAN 464

Query: 1578 SPIRSVADVKQFFKTLTLQFGSKWWIVSTKLRIPPEGYLVVNNKGNVCLGILDGSDVHDG 1757
              IRS+ DVK +FKTLTL+FGSKWWI+ST  +IPPEGYL+++NKG+VCLGILDGS V+DG
Sbjct: 465  FQIRSIKDVKDYFKTLTLRFGSKWWILSTLFQIPPEGYLIISNKGHVCLGILDGSKVNDG 524

Query: 1758 STIILGDISLHGQMVVYDNVNQKIGWMQSDCVKP 1859
            S+IILGDISL G  VVYDNV QKIGW ++DC  P
Sbjct: 525  SSIILGDISLRGYSVVYDNVKQKIGWKRADCGMP 558


>ref|XP_007152781.1| hypothetical protein PHAVU_004G159200g [Phaseolus vulgaris]
            gi|561026090|gb|ESW24775.1| hypothetical protein
            PHAVU_004G159200g [Phaseolus vulgaris]
          Length = 572

 Score =  610 bits (1572), Expect = e-171
 Identities = 315/568 (55%), Positives = 395/568 (69%), Gaps = 16/568 (2%)
 Frame = +3

Query: 195  MEADQSPQLKGIVIISLPPPDNPSLGKTITAFTISD-SPPPPTRTXXXXXXXEVRL---- 359
            M+ DQ PQ+KG+VIISLPPPDNPSLGKTITAFT SD S P P+         +  +    
Sbjct: 1    MDDDQFPQIKGVVIISLPPPDNPSLGKTITAFTFSDPSSPQPSLLLQQSHQHQTNINEYN 60

Query: 360  ----PIQSAP-HSQPQVSSRRFFHNSPRIIVGGFLGISLFACILWRSSVYSETLPEL--- 515
                P+ S P ++Q   S RR FH +P +    F G+ LFA  L+  SV S T  EL   
Sbjct: 61   NTDPPLHSYPSNAQLGFSRRRLFHRTP-VRFFSFFGVFLFALFLY-GSVSSTTTLELSGP 118

Query: 516  -RNSDDDGKPNSFIFPLYSKLGSRQMWQREVELKLGRFVDIDSKTVVASINDGMKPHNKI 692
              + DDDGKP S++FPLY K G   + Q+ ++L+LG+ V  +          G       
Sbjct: 119  KNDGDDDGKPGSYLFPLYPKFGV--LGQKNMKLQLGKLVHKEKLLTQRKYRVG------- 169

Query: 693  NKLISSQAAIDSSTILPVRGNIYPDGLYFTHMLLGSPPRPYFLDMDTGSDLTWIQCDAPC 872
                S   A+DSS++ PV GN++PDGLYFT + +G+PPR YFLD+DTGSDLTW+QCDAPC
Sbjct: 170  ----SEVVAVDSSSVFPVSGNVFPDGLYFTILRVGNPPRSYFLDVDTGSDLTWMQCDAPC 225

Query: 873  TSCAKGPHPLYKPTKGKIVPSKDSLCVEVQRNQKDGYCETCLQCDYEIEYADHSSSLGVL 1052
             SC KG H  YKPT+  +VPS DSLC++VQ+NQKDG+ E+  QCDY+IEYAD SSSLGVL
Sbjct: 226  ISCGKGAHAQYKPTRSNVVPSMDSLCLDVQKNQKDGHHESLQQCDYQIEYADQSSSLGVL 285

Query: 1053 ASDDLHLMIANGSMAKFNTVFGCAYDQQGSLLNSLAKTDGILGLSRAKVSLPSQLASQKI 1232
              D+LHL+  NGS  K N VFGC YDQ+G LLN+LAKTDGILGLSRAKVSLP QLAS+ +
Sbjct: 286  IRDELHLVTTNGSKTKLNFVFGCGYDQEGLLLNTLAKTDGILGLSRAKVSLPYQLASKGL 345

Query: 1233 INNVVGHCLTSDVXXXXXXXXXXXFVPYRSMAWVPMLNSPSMNFYHTELVKMSYGNRQLN 1412
            I NVVGHCL++D            F+PY  M WVPM  + + + Y TE++ ++YGNRQL+
Sbjct: 346  IKNVVGHCLSNDEVGGGYMFLGDDFLPYWGMTWVPMAYTLTTDLYQTEILGINYGNRQLS 405

Query: 1413 LGGRDS-SHIVFDSGSSYTYLTKQAYFDFVVSLKEISG-GLIQDASDPTMPICWRAKSPI 1586
              G+     +VFDSGSSYTY  K+AY D V SL E+SG  LIQD SD T+PICW A  PI
Sbjct: 406  FDGQSKVGKVVFDSGSSYTYFPKEAYLDLVASLNEVSGLRLIQDDSDTTLPICWEANFPI 465

Query: 1587 RSVADVKQFFKTLTLQFGSKWWIVSTKLRIPPEGYLVVNNKGNVCLGILDGSDVHDGSTI 1766
            +SV DVK +FKT+TL+FGSKWWI+ST  +I PEGYL+++NKG+VCLGILDGS+V+DGS+I
Sbjct: 466  KSVKDVKDYFKTITLRFGSKWWILSTMFQIAPEGYLIISNKGHVCLGILDGSNVNDGSSI 525

Query: 1767 ILGDISLHGQMVVYDNVNQKIGWMQSDC 1850
            ILGDIS  G +VVYDN  QKIGW +++C
Sbjct: 526  ILGDISFRGYLVVYDNSKQKIGWKRAEC 553


>ref|XP_004137470.1| PREDICTED: aspartic proteinase Asp1-like [Cucumis sativus]
            gi|449486840|ref|XP_004157418.1| PREDICTED: aspartic
            proteinase Asp1-like [Cucumis sativus]
          Length = 570

 Score =  606 bits (1563), Expect = e-170
 Identities = 322/577 (55%), Positives = 408/577 (70%), Gaps = 20/577 (3%)
 Frame = +3

Query: 210  SPQLKGIVIISLPPPDNPSLGKTITAFTISDS-PPPPTRTXXXXXXXEV----------R 356
            S ++KG+V+I+LPPPDNPSLGK++TAFT++D  P PP  +       +            
Sbjct: 3    SDKIKGVVVITLPPPDNPSLGKSVTAFTLTDDFPEPPGESVAVDQEVQQPNNDHLTLPPN 62

Query: 357  LPIQSAPHSQPQVS-SRRFFHNSPRIIVGGFLGISLFACILWRSSVYSETLPELRNSD-- 527
            LPIQ AP SQ  +  SR  F  +PR +V   LGI+L A  L+ S+ + ET+ ELR S+  
Sbjct: 63   LPIQ-APLSQRSIPLSRELFAGTPRKLVF-VLGIALAAVYLYASN-FPETIRELRRSERN 119

Query: 528  DDGKPNSFIFPLY--SKLGSRQMWQREVELKLGRFVDIDSKTVVASINDGMKPHNKINKL 701
            DD +P+SF+FPLY  S+LG    +Q    LKLGR V ++   +    ND +    K +KL
Sbjct: 120  DDDRPSSFLFPLYFQSELGDSSDFQ----LKLGRTVRVNKDDLGVRFNDVLGVP-KPSKL 174

Query: 702  ISSQAAIDSSTILPVRGNIYPDGLYFTHMLLGSPPRPYFLDMDTGSDLTWIQCDAPCTSC 881
            IS+    DSS + PVRG+IYPDGLY+T++++G PPRPYFLD+DTGSDLTW+QCDAPC+SC
Sbjct: 175  ISASLKSDSSAVFPVRGDIYPDGLYYTYIMVGEPPRPYFLDIDTGSDLTWVQCDAPCSSC 234

Query: 882  AKGPHPLYKPTKGKIVPSKDSLCVEVQRNQKDGYCETCLQCDYEIEYADHSSSLGVLASD 1061
             KG  PLYKP +  +V  KDSLC+EVQRN     C  C QC+YE++YAD SSSLGVL  D
Sbjct: 235  GKGRSPLYKPRRENVVSFKDSLCMEVQRNYDGDQCAACQQCNYEVQYADQSSSLGVLVKD 294

Query: 1062 DLHLMIANGSMAKFNTVFGCAYDQQGSLLNSLAKTDGILGLSRAKVSLPSQLASQKIINN 1241
            +  L  +NGS+ K N +FGCAYDQQG LLN+L+KTDGILGLSRAKVSLPSQLAS+ IINN
Sbjct: 295  EFTLRFSNGSLTKLNAIFGCAYDQQGLLLNTLSKTDGILGLSRAKVSLPSQLASRGIINN 354

Query: 1242 VVGHCLTSDVXXXXXXXXXXXFVPYRSMAWVPMLNSPSMNFYHTELVKMSYGNRQLNLG- 1418
            VVGHCLT D            FVP   MAWV ML+SPS++FY T++V++ YG+  L+L  
Sbjct: 355  VVGHCLTGDPAGGGYLFLGDDFVPQWGMAWVAMLDSPSIDFYQTKVVRIDYGSIPLSLDT 414

Query: 1419 -GRDSSHIVFDSGSSYTYLTKQAYFDFVVSLKEIS--GGLIQDASDPTMPICWRAKSPIR 1589
             G     +VFDSGSSYTY TK+AY+  V +L+E+S  G ++QD+SD    ICW+ +  IR
Sbjct: 415  WGSSREQVVFDSGSSYTYFTKEAYYQLVANLEEVSAFGLILQDSSD---TICWKTEQSIR 471

Query: 1590 SVADVKQFFKTLTLQFGSKWWIVSTKLRIPPEGYLVVNNKGNVCLGILDGSDVHDGSTII 1769
            SV DVK FFK LTLQFGS++W+VSTKL I PE YL++N +GNVCLGILDGS VHDGSTII
Sbjct: 472  SVKDVKHFFKPLTLQFGSRFWLVSTKLVILPENYLLINKEGNVCLGILDGSQVHDGSTII 531

Query: 1770 LGDISLHGQMVVYDNVNQKIGWMQSDCVKPQRFKSLP 1880
            LGD +L G++VVYDNVNQ+IGW  SDC  P++ K LP
Sbjct: 532  LGDNALRGKLVVYDNVNQRIGWTSSDCHNPRKIKHLP 568


>ref|XP_007036501.1| Eukaryotic aspartyl protease family protein, putative isoform 2
            [Theobroma cacao] gi|508773746|gb|EOY21002.1| Eukaryotic
            aspartyl protease family protein, putative isoform 2
            [Theobroma cacao]
          Length = 520

 Score =  600 bits (1547), Expect = e-169
 Identities = 314/521 (60%), Positives = 381/521 (73%), Gaps = 18/521 (3%)
 Frame = +3

Query: 195  MEADQSPQ-LKGIVIISLPPPDNPSLGKTITAFTISDSPPPPTRTXXXXXXXEVRLPIQ- 368
            M++D+ PQ + G+VII+LPP DNPSLGKTITAFT+++   P +         E    +  
Sbjct: 1    MDSDERPQQVTGVVIITLPPSDNPSLGKTITAFTLTNDVFPQSHQTQQRQQQEEEQTLPT 60

Query: 369  ---------SAPHSQPQVSSRRFFHNSPRIIVGGFLGISLFACILWRSSVYSETLPELRN 521
                     SA + Q   S    F ++PR ++G FLGISLFA +L+ SS +S T  ELRN
Sbjct: 61   TQILTPAPPSAQNPQRGFSFLGLFSDNPRKLLG-FLGISLFALLLY-SSAFSNTFVELRN 118

Query: 522  S--DDDGKPNSFIFPLYSKLGSRQMWQREVELKLGRFVDIDSKTVVASINDGMKPHNKIN 695
            S  DDD KP SFIFPLY KLG+      ++ELKLGRFVD+D + +VAS+  G     KIN
Sbjct: 119  SNNDDDEKPQSFIFPLYHKLGA------DLELKLGRFVDVDKENLVASVEGGATGTQKIN 172

Query: 696  KLISSQAAI--DSSTILPVRGNIYPDGLYFTHMLLGSPPRPYFLDMDTGSDLTWIQCDAP 869
            KL++S AA+   S TILPVRGN+YPDGLYFT+ML+G+P R YFLD+DTGSDLTWIQCDAP
Sbjct: 173  KLVASNAAVIDSSGTILPVRGNVYPDGLYFTYMLVGNPQRRYFLDIDTGSDLTWIQCDAP 232

Query: 870  CTSCAKGPHPLYKPTKGKIVPSKDSLCVEVQRNQKDGYCETCLQCDYEIEYADHSSSLGV 1049
            C+SCAKG +PLYKPT+  IV SKD +C EVQ+NQK   CETC QCDYEIEYAD SSSLGV
Sbjct: 233  CSSCAKGANPLYKPTRVNIVASKDLMCTEVQKNQKPQNCETCQQCDYEIEYADRSSSLGV 292

Query: 1050 LASDDLHLMIANGSMAKFNTVFGCAYDQQGSLLNSLAKTDGILGLSRAKVSLPSQLASQK 1229
            LA D+LHL+ ANGS    + VFGCAYDQQG LLN+L+KTDGILGLSRAKVSLPSQLAS+ 
Sbjct: 293  LARDELHLVTANGSTTNLDVVFGCAYDQQGILLNTLSKTDGILGLSRAKVSLPSQLASKG 352

Query: 1230 IINNVVGHCLTSDVXXXXXXXXXXXFVPYRSMAWVPMLNSPSMNFYHTELVKMSYGNRQL 1409
            IINNVVGHCL +DV           FVP   M+WVPML SPS  FYHT++VK++YG+  L
Sbjct: 353  IINNVVGHCLATDVGASGYMFLGDDFVPNWGMSWVPMLGSPSTEFYHTQIVKINYGSSSL 412

Query: 1410 NLGGRDSS--HIVFDSGSSYTYLTKQAYFDFVVSLKEISG-GLIQDASDPTMPICWRAKS 1580
            +LG + SS   +VFDSGSSYTY  KQAY + V SL E+S  G IQD +D T+P+CW+A  
Sbjct: 413  SLGRQHSSIGRVVFDSGSSYTYFMKQAYAELVASLSEVSEVGFIQDVADTTLPMCWQAPF 472

Query: 1581 PIRSVADVKQFFKTLTLQFGSKWWIVSTKLRIPPEGYLVVN 1703
            PIR + DVKQFFKTLTLQFGSKWWI+S +  IPPEGYL+++
Sbjct: 473  PIRFIKDVKQFFKTLTLQFGSKWWIISKRFHIPPEGYLIIS 513


>ref|XP_004512995.1| PREDICTED: lysine-specific histone demethylase 1 homolog 1-like
            [Cicer arietinum]
          Length = 1387

 Score =  581 bits (1498), Expect = e-163
 Identities = 313/571 (54%), Positives = 397/571 (69%), Gaps = 18/571 (3%)
 Frame = +3

Query: 207  QSPQLKGIVIISLPPPDNPSLGKTITAFTISDSPPPPTRTXXXXXXXEVRLPIQSAP--H 380
            Q+PQLK +VIIS+PP +NPSLGK ITAFT S++P  P +            PIQS P  H
Sbjct: 10   QTPQLKSVVIISIPPSNNPSLGKKITAFTFSNNPFSPQQQPQNNVPP--MSPIQSYPSNH 67

Query: 381  SQPQVSSRRFFHNSPRIIVGGFLGISLFACILWRSSVYSET----LPELRNSDDDG---- 536
                 S+RRFFH + +I    F GI LFA  L+ S   + T    L EL+N   DG    
Sbjct: 68   QLQFSSTRRFFHTT-QIKFFTFFGIFLFALFLYGSLFSTITTTLELSELKNHHHDGGDDE 126

Query: 537  --KPNSFIFPLYSKLGSRQMWQREVEL---KLGRFVDIDSKTVVASINDGMKPHNKINKL 701
              +P+SF+FPL+ K G   + QR+++L   K G FV     T  +  +DG+   +++  +
Sbjct: 127  SDEPSSFLFPLFKKYGV--VGQRDLKLIDVKKGNFV-----TQKSGDSDGIAFSSRVVAV 179

Query: 702  ISSQAAIDSSTILPVRGNIYPDGLYFTHMLLGSPPRPYFLDMDTGSDLTWIQCDAPCTSC 881
             SS     SST+ P+ GN+YPDGLY+TH+ +G+PP+ YF+D+DTGSDLTWIQCDAPC SC
Sbjct: 180  DSS-----SSTVFPISGNVYPDGLYYTHVRVGNPPKRYFVDVDTGSDLTWIQCDAPCRSC 234

Query: 882  AKGPHPLYKPTKGKIVPSKDSLCVEVQRNQKDGYCETCLQCDYEIEYADHSSSLGVLASD 1061
            AKG +  YKP +  IVPS DSLC+EVQ+NQK+GY E+  QCDYEI+YADHSSS+GVL  D
Sbjct: 235  AKGANVPYKPIRTNIVPSLDSLCLEVQKNQKNGYHESFQQCDYEIQYADHSSSMGVLIRD 294

Query: 1062 DLHLMIANGSMAKFNTVFGCAYDQQGSLLNSLAKTDGILGLSRAKVSLPSQLASQKIINN 1241
            +LHLM  NGS  K N VFGC YDQ+G LLN+L KTDGI+GLSRAKV LP QL+S+ II N
Sbjct: 295  ELHLMTTNGSKTKLNFVFGCGYDQEGLLLNTLTKTDGIMGLSRAKVGLPYQLSSKGIIKN 354

Query: 1242 VVGHCLT-SDVXXXXXXXXXXXFVPYRSMAWVPMLNSPSMNFYHTELVKMSYGNRQLNLG 1418
            VVGHCL+ +D            FVPY  M W PM  +   + Y TE++ ++YGNR L+  
Sbjct: 355  VVGHCLSNNDGVGGGYMFLGDDFVPYWGMTWAPM--TQITDLYQTEVLGINYGNRLLSFD 412

Query: 1419 GRDS-SHIVFDSGSSYTYLTKQAYFDFVVSLKEISG-GLIQDASDPTMPICWRAKSPIRS 1592
            G     ++VFDSGSSYTY  K+AY D V SL+E+SG GL++D SD T+PICW+A  PIRS
Sbjct: 413  GHSKVGNVVFDSGSSYTYFPKEAYRDLVASLEEVSGLGLVEDDSDTTLPICWQANFPIRS 472

Query: 1593 VADVKQFFKTLTLQFGSKWWIVSTKLRIPPEGYLVVNNKGNVCLGILDGSDVHDGSTIIL 1772
            V DVK +FKTLTL+FG+KWWI+ST   IPPEGYL+++NKGNVCL ILDGS+V+DGS+IIL
Sbjct: 473  VKDVKDYFKTLTLRFGNKWWILSTLFHIPPEGYLIISNKGNVCLAILDGSNVNDGSSIIL 532

Query: 1773 GDISLHGQMVVYDNVNQKIGWMQSDCVKPQR 1865
            GDISL G +VVYDNVN+ IGW ++ C  P R
Sbjct: 533  GDISLRGYLVVYDNVNKNIGWERTKCGMPNR 563


>ref|XP_007152782.1| hypothetical protein PHAVU_004G159200g [Phaseolus vulgaris]
            gi|561026091|gb|ESW24776.1| hypothetical protein
            PHAVU_004G159200g [Phaseolus vulgaris]
          Length = 579

 Score =  575 bits (1483), Expect = e-161
 Identities = 300/543 (55%), Positives = 376/543 (69%), Gaps = 16/543 (2%)
 Frame = +3

Query: 195  MEADQSPQLKGIVIISLPPPDNPSLGKTITAFTISD-SPPPPTRTXXXXXXXEVRL---- 359
            M+ DQ PQ+KG+VIISLPPPDNPSLGKTITAFT SD S P P+         +  +    
Sbjct: 1    MDDDQFPQIKGVVIISLPPPDNPSLGKTITAFTFSDPSSPQPSLLLQQSHQHQTNINEYN 60

Query: 360  ----PIQSAP-HSQPQVSSRRFFHNSPRIIVGGFLGISLFACILWRSSVYSETLPEL--- 515
                P+ S P ++Q   S RR FH +P +    F G+ LFA  L+  SV S T  EL   
Sbjct: 61   NTDPPLHSYPSNAQLGFSRRRLFHRTP-VRFFSFFGVFLFALFLY-GSVSSTTTLELSGP 118

Query: 516  -RNSDDDGKPNSFIFPLYSKLGSRQMWQREVELKLGRFVDIDSKTVVASINDGMKPHNKI 692
              + DDDGKP S++FPLY K G   + Q+ ++L+LG+ V  +          G       
Sbjct: 119  KNDGDDDGKPGSYLFPLYPKFGV--LGQKNMKLQLGKLVHKEKLLTQRKYRVG------- 169

Query: 693  NKLISSQAAIDSSTILPVRGNIYPDGLYFTHMLLGSPPRPYFLDMDTGSDLTWIQCDAPC 872
                S   A+DSS++ PV GN++PDGLYFT + +G+PPR YFLD+DTGSDLTW+QCDAPC
Sbjct: 170  ----SEVVAVDSSSVFPVSGNVFPDGLYFTILRVGNPPRSYFLDVDTGSDLTWMQCDAPC 225

Query: 873  TSCAKGPHPLYKPTKGKIVPSKDSLCVEVQRNQKDGYCETCLQCDYEIEYADHSSSLGVL 1052
             SC KG H  YKPT+  +VPS DSLC++VQ+NQKDG+ E+  QCDY+IEYAD SSSLGVL
Sbjct: 226  ISCGKGAHAQYKPTRSNVVPSMDSLCLDVQKNQKDGHHESLQQCDYQIEYADQSSSLGVL 285

Query: 1053 ASDDLHLMIANGSMAKFNTVFGCAYDQQGSLLNSLAKTDGILGLSRAKVSLPSQLASQKI 1232
              D+LHL+  NGS  K N VFGC YDQ+G LLN+LAKTDGILGLSRAKVSLP QLAS+ +
Sbjct: 286  IRDELHLVTTNGSKTKLNFVFGCGYDQEGLLLNTLAKTDGILGLSRAKVSLPYQLASKGL 345

Query: 1233 INNVVGHCLTSDVXXXXXXXXXXXFVPYRSMAWVPMLNSPSMNFYHTELVKMSYGNRQLN 1412
            I NVVGHCL++D            F+PY  M WVPM  + + + Y TE++ ++YGNRQL+
Sbjct: 346  IKNVVGHCLSNDEVGGGYMFLGDDFLPYWGMTWVPMAYTLTTDLYQTEILGINYGNRQLS 405

Query: 1413 LGGRDS-SHIVFDSGSSYTYLTKQAYFDFVVSLKEISG-GLIQDASDPTMPICWRAKSPI 1586
              G+     +VFDSGSSYTY  K+AY D V SL E+SG  LIQD SD T+PICW A  PI
Sbjct: 406  FDGQSKVGKVVFDSGSSYTYFPKEAYLDLVASLNEVSGLRLIQDDSDTTLPICWEANFPI 465

Query: 1587 RSVADVKQFFKTLTLQFGSKWWIVSTKLRIPPEGYLVVNNKGNVCLGILDGSDVHDGSTI 1766
            +SV DVK +FKT+TL+FGSKWWI+ST  +I PEGYL+++NKG+VCLGILDGS+V+DGS+I
Sbjct: 466  KSVKDVKDYFKTITLRFGSKWWILSTMFQIAPEGYLIISNKGHVCLGILDGSNVNDGSSI 525

Query: 1767 ILG 1775
            ILG
Sbjct: 526  ILG 528


>ref|XP_006588295.1| PREDICTED: lysine-specific histone demethylase 1 homolog 1-like
            [Glycine max]
          Length = 1430

 Score =  575 bits (1481), Expect = e-161
 Identities = 303/548 (55%), Positives = 382/548 (69%), Gaps = 17/548 (3%)
 Frame = +3

Query: 195  MEADQSPQLKGIVIISLPPPDNPSLGKTITAFTISDSPPPPT--------RTXXXXXXXE 350
            ME D+SPQ+KG+VIISLPPPDNPSLGKTITAFT S+  P P+        +         
Sbjct: 1    MEDDESPQIKGVVIISLPPPDNPSLGKTITAFTFSNPSPQPSIQPHQHQSQPTHPNAQHN 60

Query: 351  VRLPIQSAPHSQPQVSS--RRFFHNSPRIIVGGFLGISLFACILWRSSVYSETLPELR-- 518
               P+QS P S PQ+S   RR FH++P + +  F GI LFA  L+  SV S T  ELR  
Sbjct: 61   TDPPLQSYP-SNPQLSFSFRRLFHSTP-VKLFSFFGILLFALFLY-GSVSSTTTVELRGR 117

Query: 519  --NSDDDGKPNSFIFPLYSKLGSRQMWQREVELKLGRFVDIDSKTVVASINDGMKPHNKI 692
              + DDD K  SF+FPL+ K G   + Q++++L+LG+    +         DG       
Sbjct: 118  NNDDDDDDKATSFLFPLFPKFGV--LGQKDLKLQLGKLSQKEKFLTHRDDGDG------- 168

Query: 693  NKLISSQAAIDSSTILPVRGNIYPDGLYFTHMLLGSPPRPYFLDMDTGSDLTWIQCDAPC 872
                S   A+DSS++ PV GN+YPDGLYFT + +G+PP+ YFLD+DTGSDLTW+QCDAPC
Sbjct: 169  ----SGVVAVDSSSVFPVSGNVYPDGLYFTILRVGNPPKSYFLDVDTGSDLTWMQCDAPC 224

Query: 873  TSCAKGPHPLYKPTKGKIVPSKDSLCVEVQRNQKDGYC-ETCLQCDYEIEYADHSSSLGV 1049
             SC KG H LYKPT+  +V S D+LC++VQ+NQK+G+  E+ LQCDYEI+YADHSSSLGV
Sbjct: 225  ISCGKGAHVLYKPTRSNVVSSVDALCLDVQKNQKNGHHDESLLQCDYEIQYADHSSSLGV 284

Query: 1050 LASDDLHLMIANGSMAKFNTVFGCAYDQQGSLLNSLAKTDGILGLSRAKVSLPSQLASQK 1229
            L  D+LHL+  NGS  K N VFGC YDQ G LLN+L KTDGI+GLSRAKVSLP QLAS+ 
Sbjct: 285  LVRDELHLVTTNGSKTKLNVVFGCGYDQAGLLLNTLGKTDGIMGLSRAKVSLPYQLASKG 344

Query: 1230 IINNVVGHCLTSDVXXXXXXXXXXXFVPYRSMAWVPMLNSPSMNFYHTELVKMSYGNRQL 1409
            +I NVVGHCL++D            FVPY  M WVPM  + + + Y TE++ ++YGNRQL
Sbjct: 345  LIKNVVGHCLSNDGAGGGYMFLGDDFVPYWGMNWVPMAYTLTTDLYQTEILGINYGNRQL 404

Query: 1410 NLGGRDS-SHIVFDSGSSYTYLTKQAYFDFVVSLKEISG-GLIQDASDPTMPICWRAKSP 1583
               G+     +VFDSGSSYTY  K+AY D V SL E+SG GL+QD SD T+PICW+A  P
Sbjct: 405  RFDGQSKVGKMVFDSGSSYTYFPKEAYLDLVASLNEVSGLGLVQDDSDTTLPICWQANFP 464

Query: 1584 IRSVADVKQFFKTLTLQFGSKWWIVSTKLRIPPEGYLVVNNKGNVCLGILDGSDVHDGST 1763
            I+SV DVK +FKTLTL+FGSKWWI+ST  +I PEGYL+++NKG+VCLGILDGS+V+DGS+
Sbjct: 465  IKSVKDVKDYFKTLTLRFGSKWWILSTLFQISPEGYLIISNKGHVCLGILDGSNVNDGSS 524

Query: 1764 IILGDISL 1787
            IILG  +L
Sbjct: 525  IILGGSTL 532


>ref|XP_002891474.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
            gi|297337316|gb|EFH67733.1| aspartyl protease family
            protein [Arabidopsis lyrata subsp. lyrata]
          Length = 578

 Score =  558 bits (1439), Expect = e-156
 Identities = 305/595 (51%), Positives = 398/595 (66%), Gaps = 31/595 (5%)
 Frame = +3

Query: 192  EMEADQSPQLKGIVIISLPPPDNPSLGKTITAFTISDSPPPPTRTXXXXXXXEVRLPIQS 371
            E +  +  +L  +VII+LPP D+PS GKTI+AFT++D   P            +++P + 
Sbjct: 2    EPDLHEQQRLHSVVIITLPPSDDPSQGKTISAFTLNDHDYP------------LQIPPED 49

Query: 372  APHSQPQVS-------SRRFFHN----SPRIIVGGFLGISLFACILWRS----SVYSETL 506
             P+   Q         SR  F +    SPR+++G  LG SL A   + S    SV    +
Sbjct: 50   NPNPSFQPDPLHQNQQSRLLFSDLSMGSPRLVLG-LLGFSLLAVAFYASVFPNSVQMFRV 108

Query: 507  PELRNSDDDG--KPNSFIFPLYSKLGSRQMWQR----EVELKLGRFVDIDSKTVVASIND 668
             + RN DDD   +  SF+FP+Y KL +R+  +R    ++ L+ G+FV+     +V  +  
Sbjct: 109  SDERNRDDDSSRETTSFVFPVYHKLRAREFHERILAEDLGLENGKFVESMDLELVNPV-- 166

Query: 669  GMKPHNKINKLISSQA-AIDSST-ILPVRGNIYPDGLYFTHMLLGSPP--RPYFLDMDTG 836
                  K+N ++S+ A +IDSST I PV GN+YPDGLY+T +L+G P   + Y LD+DTG
Sbjct: 167  ------KVNDVLSTSAGSIDSSTTIFPVGGNVYPDGLYYTRILVGKPEDGQYYHLDIDTG 220

Query: 837  SDLTWIQCDAPCTSCAKGPHPLYKPTKGKIVPSKDSLCVEVQRNQKDGYCETCLQCDYEI 1016
            SDLTWIQCDAPCTSCAKG + LYKP K  +V S +  CVEVQRNQ   +CE+C QCDYEI
Sbjct: 221  SDLTWIQCDAPCTSCAKGANQLYKPRKDNLVRSSEPFCVEVQRNQLTEHCESCHQCDYEI 280

Query: 1017 EYADHSSSLGVLASDDLHLMIANGSMAKFNTVFGCAYDQQGSLLNSLAKTDGILGLSRAK 1196
            EYADHS S+GVL  D  HL + NGS+A+ + VFGC YDQQG LLN+L KTDGILGLSRAK
Sbjct: 281  EYADHSYSMGVLTKDKFHLKLHNGSLAESDIVFGCGYDQQGLLLNTLLKTDGILGLSRAK 340

Query: 1197 VSLPSQLASQKIINNVVGHCLTSDVXXXXXXXXXXXFVPYRSMAWVPMLNSPSMNFYHTE 1376
            +SLPSQLAS+ II+NVVGHCL SD+            VP   M WVPML+ P +  Y  +
Sbjct: 341  ISLPSQLASRGIISNVVGHCLASDLNGEGYIFMGSDLVPSHGMTWVPMLHHPHLEVYQMQ 400

Query: 1377 LVKMSYGNRQLNLGGRDS--SHIVFDSGSSYTYLTKQAYFDFVVSLKEISG-GLIQDASD 1547
            + KMSYGN  L+L G +     ++FD+GSSYTY   QAY   V SL+E+S   L +D SD
Sbjct: 401  VTKMSYGNAMLSLDGENGRVGKVLFDTGSSYTYFPNQAYSQLVTSLQEVSDLELTRDDSD 460

Query: 1548 PTMPICWRAK--SPIRSVADVKQFFKTLTLQFGSKWWIVSTKLRIPPEGYLVVNNKGNVC 1721
              +PICWRAK  SPI S++DVK+FF+ +TLQ GSKW I+S KL I PE YL+++NKGNVC
Sbjct: 461  EALPICWRAKTNSPISSLSDVKKFFRPITLQIGSKWLIISKKLLIQPEDYLIISNKGNVC 520

Query: 1722 LGILDGSDVHDGSTIILGDISLHGQMVVYDNVNQKIGWMQSDCVKPQRF-KSLPF 1883
            LGILDGS+VHDGSTII+GDIS+ G+++VYDNV Q+IGWM+SDCV+P  F  ++PF
Sbjct: 521  LGILDGSNVHDGSTIIIGDISMRGRLIVYDNVKQRIGWMKSDCVRPSEFDHNVPF 575


>ref|NP_564539.1| aspartyl protease [Arabidopsis thaliana]
            gi|7770346|gb|AAF69716.1|AC016041_21 F27J15.15
            [Arabidopsis thaliana]
            gi|13430540|gb|AAK25892.1|AF360182_1 unknown protein
            [Arabidopsis thaliana] gi|14532748|gb|AAK64075.1| unknown
            protein [Arabidopsis thaliana]
            gi|332194267|gb|AEE32388.1| aspartyl protease
            [Arabidopsis thaliana]
          Length = 583

 Score =  555 bits (1431), Expect = e-155
 Identities = 302/592 (51%), Positives = 400/592 (67%), Gaps = 30/592 (5%)
 Frame = +3

Query: 198  EADQSPQLKGIVIISLPPPDNPSLGKTITAFTISDSPPPPTRTXXXXXXXEVRLPIQSAP 377
            +  Q  ++  +VII+LPP D+PS GKTI+AFT++D   P            + +P +  P
Sbjct: 7    DQQQQQRVHSVVIITLPPSDDPSQGKTISAFTLTDHDYP------------LEIPPEDNP 54

Query: 378  HSQPQVS-------SRRFFH----NSPRIIVGGFLGISLFACILWRSSVYSETL------ 506
            +   Q         SR  F     NSPR+++G  LGISL A + + +SV+  ++      
Sbjct: 55   NPSFQPDPLHRNQQSRLLFSDLSMNSPRLVLG-LLGISLLA-VAFYASVFPNSVQMFRVS 112

Query: 507  PELRNSDDDG---KPNSFIFPLYSKLGSRQMWQREVELKLGRFVDIDSKTVVASINDGMK 677
            P+ RN DDD    +  SF+FP+Y KL +R+  +R +E  LG    ++++  V S++  + 
Sbjct: 113  PDERNRDDDDNLRETASFVFPVYHKLRAREFHERILEEDLG----LENENFVESMDLELV 168

Query: 678  PHNKINKLISSQA-AIDSST-ILPVRGNIYPDGLYFTHMLLGSPP--RPYFLDMDTGSDL 845
               K+N ++S+ A +IDSST I PV GN+YPDGLY+T +L+G P   + Y LD+DTGS+L
Sbjct: 169  NPVKVNDVLSTSAGSIDSSTTIFPVGGNVYPDGLYYTRILVGKPEDGQYYHLDIDTGSEL 228

Query: 846  TWIQCDAPCTSCAKGPHPLYKPTKGKIVPSKDSLCVEVQRNQKDGYCETCLQCDYEIEYA 1025
            TWIQCDAPCTSCAKG + LYKP K  +V S ++ CVEVQRNQ   +CE C QCDYEIEYA
Sbjct: 229  TWIQCDAPCTSCAKGANQLYKPRKDNLVRSSEAFCVEVQRNQLTEHCENCHQCDYEIEYA 288

Query: 1026 DHSSSLGVLASDDLHLMIANGSMAKFNTVFGCAYDQQGSLLNSLAKTDGILGLSRAKVSL 1205
            DHS S+GVL  D  HL + NGS+A+ + VFGC YDQQG LLN+L KTDGILGLSRAK+SL
Sbjct: 289  DHSYSMGVLTKDKFHLKLHNGSLAESDIVFGCGYDQQGLLLNTLLKTDGILGLSRAKISL 348

Query: 1206 PSQLASQKIINNVVGHCLTSDVXXXXXXXXXXXFVPYRSMAWVPMLNSPSMNFYHTELVK 1385
            PSQLAS+ II+NVVGHCL SD+            VP   M WVPML+   ++ Y  ++ K
Sbjct: 349  PSQLASRGIISNVVGHCLASDLNGEGYIFMGSDLVPSHGMTWVPMLHDSRLDAYQMQVTK 408

Query: 1386 MSYGNRQLNLGGRDS--SHIVFDSGSSYTYLTKQAYFDFVVSLKEISG-GLIQDASDPTM 1556
            MSYG   L+L G +     ++FD+GSSYTY   QAY   V SL+E+SG  L +D SD T+
Sbjct: 409  MSYGQGMLSLDGENGRVGKVLFDTGSSYTYFPNQAYSQLVTSLQEVSGLELTRDDSDETL 468

Query: 1557 PICWRAKS--PIRSVADVKQFFKTLTLQFGSKWWIVSTKLRIPPEGYLVVNNKGNVCLGI 1730
            PICWRAK+  P  S++DVK+FF+ +TLQ GSKW I+S KL I PE YL+++NKGNVCLGI
Sbjct: 469  PICWRAKTNFPFSSLSDVKKFFRPITLQIGSKWLIISRKLLIQPEDYLIISNKGNVCLGI 528

Query: 1731 LDGSDVHDGSTIILGDISLHGQMVVYDNVNQKIGWMQSDCVKPQRF-KSLPF 1883
            LDGS VHDGSTIILGDIS+ G ++VYDNV ++IGWM+SDCV+P+    ++PF
Sbjct: 529  LDGSSVHDGSTIILGDISMRGHLIVYDNVKRRIGWMKSDCVRPREIDHNVPF 580


>ref|XP_006393315.1| hypothetical protein EUTSA_v10011346mg [Eutrema salsugineum]
            gi|557089893|gb|ESQ30601.1| hypothetical protein
            EUTSA_v10011346mg [Eutrema salsugineum]
          Length = 580

 Score =  545 bits (1403), Expect = e-152
 Identities = 295/574 (51%), Positives = 384/574 (66%), Gaps = 15/574 (2%)
 Frame = +3

Query: 207  QSPQLKGIVIISLPPPDNPSLGKTITAFTISDSP-PPPTRTXXXXXXXEVRLPIQSAPHS 383
            Q  ++ G+VII+LPP DNPS GKTI+AFT++D   PP  R            P+   P S
Sbjct: 10   QQQRVHGVVIITLPPSDNPSKGKTISAFTLTDHDYPPDIRPEDERNPSFQPDPLHQNPQS 69

Query: 384  QPQVSSRRFFHNSPRIIVGGFLGISLFACILWRS----SVYSETLPELRNSDDDGKPN-- 545
                S      +SPR+++G  LGISL A   + S    SV    + + R+ D+D +    
Sbjct: 70   GLWFSDLSM--SSPRLVLG-LLGISLLAIAFYGSVFPNSVQLFRVSDERDRDEDNRRETA 126

Query: 546  SFIFPLYSKLGSRQMWQREVELKLGRFVDIDSKTVVASINDGMKPHNKINKLIS-SQAAI 722
            SF+FP+Y KL +R++ +R +   L   V  ++   V SI   +    K+N + S S  ++
Sbjct: 127  SFVFPVYHKLRAREIPERNLAEALD-VVKEENGIFVESIEQELVNPVKVNDVFSASVGSL 185

Query: 723  DSST-ILPVRGNIYPDGLYFTHMLLGSPPRP---YFLDMDTGSDLTWIQCDAPCTSCAKG 890
            DSST I PV G +YPDGLYFT + +G+P +    + LD+DTGSDLTWIQCDAPCTSCAKG
Sbjct: 186  DSSTTIFPVGGYVYPDGLYFTRVFVGNPEKDGHYFHLDIDTGSDLTWIQCDAPCTSCAKG 245

Query: 891  PHPLYKPTKGKIVPSKDSLCVEVQRNQKDGYCETCLQCDYEIEYADHSSSLGVLASDDLH 1070
             + LYKP K K+V S + LCVEVQ+NQ    CE+C QCDYEIEYAD SSSLGVL  D+ H
Sbjct: 246  ANQLYKPRKDKLVGSAEHLCVEVQKNQMTELCESCQQCDYEIEYADLSSSLGVLTKDEFH 305

Query: 1071 LMIANGSMAKFNTVFGCAYDQQGSLLNSLAKTDGILGLSRAKVSLPSQLASQKIINNVVG 1250
            L + NGS+A  + VFGC YDQQG LLN+L K DGILGLSRAK+SLPSQLASQ II+NVVG
Sbjct: 306  LKLHNGSLAASDIVFGCGYDQQGLLLNTLLKKDGILGLSRAKISLPSQLASQGIISNVVG 365

Query: 1251 HCLTSDVXXXXXXXXXXXFVPYRSMAWVPMLNSPSMNFYHTELVKMSYGNRQLNLGGRDS 1430
            HCL SD+            VP   M WVPM +   +  +  ++ K+SYGN  L+L G + 
Sbjct: 366  HCLPSDLNGEGYIFMGSDLVPLHGMTWVPMFHHSHLEVHQMQVTKVSYGNGMLSLSGENG 425

Query: 1431 --SHIVFDSGSSYTYLTKQAYFDFVVSLKEISGGLIQDASDPTMPICWRAKSPIRSVADV 1604
                ++FD+GSSYTY  K+AY   V SL+E+   L +D SD  +PICW+A   I S++DV
Sbjct: 426  RIGKVLFDTGSSYTYFPKKAYSQLVTSLQEVK--LTRDESDKALPICWQANFLISSLSDV 483

Query: 1605 KQFFKTLTLQFGSKWWIVSTKLRIPPEGYLVVNNKGNVCLGILDGSDVHDGSTIILGDIS 1784
            K+F+K +T+Q GSKWWI+S KL I PE YL+++NKGNVCLGILDGS VHDGSTIILGDIS
Sbjct: 484  KRFYKPITIQIGSKWWIISRKLVIQPEDYLIISNKGNVCLGILDGSSVHDGSTIILGDIS 543

Query: 1785 LHGQMVVYDNVNQKIGWMQSDCVKP-QRFKSLPF 1883
            + G+++VYDNV ++IGWM+SDCV+P +  + LPF
Sbjct: 544  MRGRLIVYDNVKRRIGWMKSDCVRPHESDQKLPF 577


Top