BLASTX nr result

ID: Rheum21_contig00006192 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Rheum21_contig00006192
         (2112 letters)

Database: ./nr 
           37,332,560 sequences; 13,225,080,153 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_002275943.1| PREDICTED: aspartic proteinase Asp1-like [Vi...   627   e-177
gb|EXB56943.1| Aspartic proteinase Asp1 [Morus notabilis]             605   e-170
emb|CBI15437.3| unnamed protein product [Vitis vinifera]              604   e-170
gb|EOY21001.1| Eukaryotic aspartyl protease family protein, puta...   599   e-168
ref|XP_006440945.1| hypothetical protein CICLE_v10019473mg [Citr...   598   e-168
ref|XP_002511959.1| protein with unknown function [Ricinus commu...   585   e-164
ref|XP_006485774.1| PREDICTED: aspartic proteinase Asp1-like [Ci...   580   e-162
ref|XP_002328687.1| predicted protein [Populus trichocarpa] gi|5...   570   e-160
ref|XP_006354730.1| PREDICTED: aspartic proteinase Asp1-like [So...   567   e-159
gb|ESW24775.1| hypothetical protein PHAVU_004G159200g [Phaseolus...   559   e-156
ref|XP_004241624.1| PREDICTED: aspartic proteinase Asp1-like [So...   558   e-156
ref|XP_004137470.1| PREDICTED: aspartic proteinase Asp1-like [Cu...   541   e-151
ref|XP_006583400.1| PREDICTED: aspartic proteinase Asp1-like [Gl...   532   e-148
gb|ESW24776.1| hypothetical protein PHAVU_004G159200g [Phaseolus...   521   e-145
ref|XP_004512995.1| PREDICTED: lysine-specific histone demethyla...   520   e-145
ref|XP_002891474.1| aspartyl protease family protein [Arabidopsi...   516   e-143
ref|XP_002452609.1| hypothetical protein SORBIDRAFT_04g028990 [S...   516   e-143
ref|XP_006826817.1| hypothetical protein AMTR_s00010p00056950 [A...   515   e-143
gb|ACN34727.1| unknown [Zea mays] gi|413923868|gb|AFW63800.1| hy...   508   e-141
ref|XP_006393315.1| hypothetical protein EUTSA_v10011346mg [Eutr...   506   e-140

>ref|XP_002275943.1| PREDICTED: aspartic proteinase Asp1-like [Vitis vinifera]
          Length = 686

 Score =  627 bits (1617), Expect = e-177
 Identities = 323/584 (55%), Positives = 404/584 (69%), Gaps = 22/584 (3%)
 Frame = -1

Query: 1953 TMEPEEAQSP-VKGVVVISLPPRDDPSMGKTVAFYTISGDS----HHQINEIRQPQ---- 1801
            T + E  QSP +KGVV+I+LPP D+PS+GKT+  +T+S       HH   ++++ Q    
Sbjct: 115  TRDMEFGQSPQLKGVVIITLPPPDNPSLGKTITAFTLSDPPLDRPHHTHQQLQRQQHQEE 174

Query: 1800 -------------LSIHRPPQNPLRIRARITGAGKRXXXXXXXXXXXXXXXXXXXSPQSP 1660
                         L    PP   L+   R    G                     +  SP
Sbjct: 175  EEEEEEEEEEPHQLPSPSPPNPALQFSVRKLSLGNPRILMGFLGVSLFVFLLWNFASSSP 234

Query: 1659 FKLXXXXXXXXXXXXXXXTFVLPLFPKLGFPGDKSTGDVELKLGRFVLRENGGVGMGRYK 1480
                              +F+LPL+PKLG    +S GD+ELKLG+FV      +  G   
Sbjct: 235  L----VELRRKNDDREPTSFILPLYPKLG---SRSLGDLELKLGKFVDFHVNDMKPGG-- 285

Query: 1479 VNRLFSGSKASEIDSSSMPFPVKGNVYPYGLYFTQLYIGNPPRSYFLDLDTGSDLTWMQC 1300
            +N+L   +  S  DSS++ FPV+G+VYP GLYFT +++G+PPR YFLD+DTGSDLTW+QC
Sbjct: 286  INKL--ATSVSAFDSSTI-FPVRGDVYPNGLYFTHIFVGSPPRRYFLDMDTGSDLTWIQC 342

Query: 1299 DAPCTSCAKGPNSLYKPTRGKIVPPKDSLCLEVQKNLKSVYCDTCKQCDYEIEYADHSSS 1120
            DAPCTSCAKGPN LYKP +G +VP KDSLC+EVQ+NLK+ YC+TC+QCDYEIEYADHSSS
Sbjct: 343  DAPCTSCAKGPNPLYKPKKGNLVPLKDSLCVEVQRNLKTGYCETCEQCDYEIEYADHSSS 402

Query: 1119 MGVLVRDELNLMSSNGSLSKPNVVFGCAYDQQGTLLNSLAKTDGILGLSRSKVSLPAQLA 940
            MGVL  D+L+LM +NGSL+K  ++FGCAYDQQG LLNSLAKTDGILGLS++KVSLP+QLA
Sbjct: 403  MGVLASDDLHLMLANGSLTKLGIMFGCAYDQQGLLLNSLAKTDGILGLSKAKVSLPSQLA 462

Query: 939  SRGIVKNVFGHCLTSDIAGGGYAFIGDELVPYWNMAWVPMVNHLSVSFYETEVARISYGG 760
            S+ I+ NV GHCLTSD  GGGY F+GD+ VPYW MAWVPM+N  S + Y +++ +IS+G 
Sbjct: 463  SQRIINNVLGHCLTSDATGGGYMFLGDDFVPYWGMAWVPMLNSHSPN-YHSQIMKISHGS 521

Query: 759  KKISLAQSGDAKGRVVFDSGSSYTYLPKQAYADLVHSVNYLLDGKLIQDASDATLPLCWR 580
            +++SL +      RVVFD+GSSYTY PK+AY  LV S+  + D  LIQD SD TLP+CWR
Sbjct: 522  RQLSLGRQDGRTERVVFDTGSSYTYFPKEAYYALVASLKDVSDEGLIQDGSDPTLPVCWR 581

Query: 579  ADTPIRSVDDVKHFFRPLTFHFGSKWYFVSTKFQIPPEGYLVTNSKGNVCLGILDGSNVH 400
            A  PIRSV DVK FF+PLT  F SKW+ VSTKF+IPPEGYL+ ++KGNVCLGILDGSNVH
Sbjct: 582  AKFPIRSVIDVKQFFQPLTLQFRSKWWIVSTKFRIPPEGYLIISNKGNVCLGILDGSNVH 641

Query: 399  DGSTFILGDVSFRGKLVVYDNENQKVGWTDSNCKSPRRFKSLPF 268
            DGST ILGD+S RGKLVVYDN NQK+GW  S C  P++ KSLPF
Sbjct: 642  DGSTIILGDISLRGKLVVYDNVNQKIGWAQSTCVKPQKIKSLPF 685


>gb|EXB56943.1| Aspartic proteinase Asp1 [Morus notabilis]
          Length = 569

 Score =  605 bits (1560), Expect = e-170
 Identities = 309/572 (54%), Positives = 397/572 (69%), Gaps = 20/572 (3%)
 Frame = -1

Query: 1923 VKGVVVISLPPRDDPSMGKTVAFYTISGDSHHQINEIRQPQLSIH-RPPQNPL------R 1765
            +KGVV+I+LPP D+PS+GKT+  +T+S  S  Q ++  Q Q ++  + PQNP       R
Sbjct: 9    IKGVVIITLPPPDNPSLGKTITAFTLSNSSPTQTHQESQNQNNLPIQSPQNPQLQFPFPR 68

Query: 1764 IRARITGAGKRXXXXXXXXXXXXXXXXXXXSPQSPFKLXXXXXXXXXXXXXXXTFVLPLF 1585
            +R    G  +R                        F                 +F+ PL+
Sbjct: 69   LRL-FHGVPRRLFALLGISIFTLVLFSHV------FPTVVEEFRRSNDDEGPESFIFPLY 121

Query: 1584 PKLGFPGDKSTGDVELKLGRFVL--RENGGVGMG----RYKVNRLFSGSKASEIDSSSMP 1423
             KLG PG K   DVELKLGRFV   +EN GV  G      KVN+L S +  +++DSS++ 
Sbjct: 122  SKLGVPGKK---DVELKLGRFVDFDKENAGVSFGDRVKTQKVNKLVSST--AKVDSSAI- 175

Query: 1422 FPVKGNVYPYGLYFTQLYIGNPPRSYFLDLDTGSDLTWMQCDAPCTSCAKGPNSLYKPTR 1243
             PV+GNVYP GLY+TQ+ +GNPPR Y LD+DTGSDLTW+QCDAPCTSCAKG N LYKPT+
Sbjct: 176  LPVRGNVYPDGLYYTQILVGNPPRPYHLDMDTGSDLTWIQCDAPCTSCAKGANPLYKPTK 235

Query: 1242 GKIVPPKDSLCLEVQKNLKSVYCDTCKQCDYEIEYADHSSSMGVLVRDELNLMSSNGSLS 1063
            G IVP KDS C E+++N K  +C TC+QCDYEI+YAD SSS+GVL +D L+L+  NGSL+
Sbjct: 236  GNIVPSKDSFCTEIRRNQKPGHCKTCQQCDYEIQYADRSSSLGVLAKDGLHLVMENGSLA 295

Query: 1062 KPNVVFGCAYDQQGTLLNSLAKTDGILGLSRSKVSLPAQLASRGIVKNVFGHCLTSDIAG 883
              NVVFGCAYDQQG LLN+LAKTDGILGLSR+KVSLP+QLAS+GI+KNV GHCLT++  G
Sbjct: 296  NVNVVFGCAYDQQGLLLNTLAKTDGILGLSRAKVSLPSQLASKGIIKNVVGHCLTTNAGG 355

Query: 882  GGYAFIGDELVPYWNMAWVPMVNHLSVSFYETEVARISYGGKKISLAQSGDAKGRVVFDS 703
            GGY F+GD+ VP+W M+W+PM+   S+ FY++E+  I+YG   ++L        ++VFDS
Sbjct: 356  GGYMFLGDDFVPHWGMSWIPMLRSPSMDFYQSEIVSINYGSSALNLGAWSSKARQLVFDS 415

Query: 702  GSSYTYLPKQAYADLVHSVNYLLDGKLIQDASDATLPLCWRADTPI-------RSVDDVK 544
            GSSYTY  K+AY+ L+ S+  +    L++D SD +LP+CWRA+TP+       RSV DVK
Sbjct: 416  GSSYTYFNKRAYSALLASLEEVSTTGLVRDRSDPSLPICWRAETPLNCIHMECRSVADVK 475

Query: 543  HFFRPLTFHFGSKWYFVSTKFQIPPEGYLVTNSKGNVCLGILDGSNVHDGSTFILGDVSF 364
             FF+ +T  FGSKW+ +ST+ +IPPEGYL  +SKGNVCLGILDGS VHDG T ILGD+S 
Sbjct: 476  RFFKTITLQFGSKWWIISTRLRIPPEGYLTISSKGNVCLGILDGSKVHDGYTTILGDISL 535

Query: 363  RGKLVVYDNENQKVGWTDSNCKSPRRFKSLPF 268
            RG LVVYDNENQK+GWT+S+C  PRRF SLPF
Sbjct: 536  RGHLVVYDNENQKIGWTNSDCVKPRRFDSLPF 567


>emb|CBI15437.3| unnamed protein product [Vitis vinifera]
          Length = 473

 Score =  604 bits (1558), Expect = e-170
 Identities = 291/445 (65%), Positives = 356/445 (80%)
 Frame = -1

Query: 1602 FVLPLFPKLGFPGDKSTGDVELKLGRFVLRENGGVGMGRYKVNRLFSGSKASEIDSSSMP 1423
            F+LPL+PKLG    +S GD+ELKLG+FV      +  G   +N+L   +  S  DSS++ 
Sbjct: 37   FILPLYPKLG---SRSLGDLELKLGKFVDFHVNDMKPGG--INKL--ATSVSAFDSSTI- 88

Query: 1422 FPVKGNVYPYGLYFTQLYIGNPPRSYFLDLDTGSDLTWMQCDAPCTSCAKGPNSLYKPTR 1243
            FPV+G+VYP GLYFT +++G+PPR YFLD+DTGSDLTW+QCDAPCTSCAKGPN LYKP +
Sbjct: 89   FPVRGDVYPNGLYFTHIFVGSPPRRYFLDMDTGSDLTWIQCDAPCTSCAKGPNPLYKPKK 148

Query: 1242 GKIVPPKDSLCLEVQKNLKSVYCDTCKQCDYEIEYADHSSSMGVLVRDELNLMSSNGSLS 1063
            G +VP KDSLC+EVQ+NLK+ YC+TC+QCDYEIEYADHSSSMGVL  D+L+LM +NGSL+
Sbjct: 149  GNLVPLKDSLCVEVQRNLKTGYCETCEQCDYEIEYADHSSSMGVLASDDLHLMLANGSLT 208

Query: 1062 KPNVVFGCAYDQQGTLLNSLAKTDGILGLSRSKVSLPAQLASRGIVKNVFGHCLTSDIAG 883
            K  ++FGCAYDQQG LLNSLAKTDGILGLS++KVSLP+QLAS+ I+ NV GHCLTSD  G
Sbjct: 209  KLGIMFGCAYDQQGLLLNSLAKTDGILGLSKAKVSLPSQLASQRIINNVLGHCLTSDATG 268

Query: 882  GGYAFIGDELVPYWNMAWVPMVNHLSVSFYETEVARISYGGKKISLAQSGDAKGRVVFDS 703
            GGY F+GD+ VPYW MAWVPM+N  S + Y +++ +IS+G +++SL +      RVVFD+
Sbjct: 269  GGYMFLGDDFVPYWGMAWVPMLNSHSPN-YHSQIMKISHGSRQLSLGRQDGRTERVVFDT 327

Query: 702  GSSYTYLPKQAYADLVHSVNYLLDGKLIQDASDATLPLCWRADTPIRSVDDVKHFFRPLT 523
            GSSYTY PK+AY  LV S+  + D  LIQD SD TLP+CWRA  PIRSV DVK FF+PLT
Sbjct: 328  GSSYTYFPKEAYYALVASLKDVSDEGLIQDGSDPTLPVCWRAKFPIRSVIDVKQFFQPLT 387

Query: 522  FHFGSKWYFVSTKFQIPPEGYLVTNSKGNVCLGILDGSNVHDGSTFILGDVSFRGKLVVY 343
              F SKW+ VSTKF+IPPEGYL+ ++KGNVCLGILDGSNVHDGST ILGD+S RGKLVVY
Sbjct: 388  LQFRSKWWIVSTKFRIPPEGYLIISNKGNVCLGILDGSNVHDGSTIILGDISLRGKLVVY 447

Query: 342  DNENQKVGWTDSNCKSPRRFKSLPF 268
            DN NQK+GW  S C  P++ KSLPF
Sbjct: 448  DNVNQKIGWAQSTCVKPQKIKSLPF 472


>gb|EOY21001.1| Eukaryotic aspartyl protease family protein, putative isoform 1
            [Theobroma cacao]
          Length = 576

 Score =  599 bits (1544), Expect = e-168
 Identities = 310/587 (52%), Positives = 389/587 (66%), Gaps = 23/587 (3%)
 Frame = -1

Query: 1950 MEPEEAQSPVKGVVVISLPPRDDPSMGKTVAFYTISGDSHHQINEIRQ------------ 1807
            M+ +E    V GVV+I+LPP D+PS+GKT+  +T++ D   Q ++ +Q            
Sbjct: 1    MDSDERPQQVTGVVIITLPPSDNPSLGKTITAFTLTNDVFPQSHQTQQRQQQEEEQTLPT 60

Query: 1806 PQLSIHRPP--QNPLRIRARITGAGKRXXXXXXXXXXXXXXXXXXXSPQSPFK--LXXXX 1639
             Q+    PP  QNP R     +  G                        S F        
Sbjct: 61   TQILTPAPPSAQNPQR---GFSFLGLFSDNPRKLLGFLGISLFALLLYSSAFSNTFVELR 117

Query: 1638 XXXXXXXXXXXTFVLPLFPKLGFPGDKSTGDVELKLGRFVL--REN-----GGVGMGRYK 1480
                       +F+ PL+ KLG        D+ELKLGRFV   +EN      G   G  K
Sbjct: 118  NSNNDDDEKPQSFIFPLYHKLG-------ADLELKLGRFVDVDKENLVASVEGGATGTQK 170

Query: 1479 VNRLFSGSKASEIDSSSMPFPVKGNVYPYGLYFTQLYIGNPPRSYFLDLDTGSDLTWMQC 1300
            +N+L + S A+ IDSS    PV+GNVYP GLYFT + +GNP R YFLD+DTGSDLTW+QC
Sbjct: 171  INKLVA-SNAAVIDSSGTILPVRGNVYPDGLYFTYMLVGNPQRRYFLDIDTGSDLTWIQC 229

Query: 1299 DAPCTSCAKGPNSLYKPTRGKIVPPKDSLCLEVQKNLKSVYCDTCKQCDYEIEYADHSSS 1120
            DAPC+SCAKG N LYKPTR  IV  KD +C EVQKN K   C+TC+QCDYEIEYAD SSS
Sbjct: 230  DAPCSSCAKGANPLYKPTRVNIVASKDLMCTEVQKNQKPQNCETCQQCDYEIEYADRSSS 289

Query: 1119 MGVLVRDELNLMSSNGSLSKPNVVFGCAYDQQGTLLNSLAKTDGILGLSRSKVSLPAQLA 940
            +GVL RDEL+L+++NGS +  +VVFGCAYDQQG LLN+L+KTDGILGLSR+KVSLP+QLA
Sbjct: 290  LGVLARDELHLVTANGSTTNLDVVFGCAYDQQGILLNTLSKTDGILGLSRAKVSLPSQLA 349

Query: 939  SRGIVKNVFGHCLTSDIAGGGYAFIGDELVPYWNMAWVPMVNHLSVSFYETEVARISYGG 760
            S+GI+ NV GHCL +D+   GY F+GD+ VP W M+WVPM+   S  FY T++ +I+YG 
Sbjct: 350  SKGIINNVVGHCLATDVGASGYMFLGDDFVPNWGMSWVPMLGSPSTEFYHTQIVKINYGS 409

Query: 759  KKISLAQSGDAKGRVVFDSGSSYTYLPKQAYADLVHSVNYLLDGKLIQDASDATLPLCWR 580
              +SL +   + GRVVFDSGSSYTY  KQAYA+LV S++ + +   IQD +D TLP+CW+
Sbjct: 410  SSLSLGRQHSSIGRVVFDSGSSYTYFMKQAYAELVASLSEVSEVGFIQDVADTTLPMCWQ 469

Query: 579  ADTPIRSVDDVKHFFRPLTFHFGSKWYFVSTKFQIPPEGYLVTNSKGNVCLGILDGSNVH 400
            A  PIR + DVK FF+ LT  FGSKW+ +S +F IPPEGYL+ + KGNVCLGILDGS VH
Sbjct: 470  APFPIRFIKDVKQFFKTLTLQFGSKWWIISKRFHIPPEGYLIISKKGNVCLGILDGSKVH 529

Query: 399  DGSTFILGDVSFRGKLVVYDNENQKVGWTDSNCKSPRRFKSLPFIAG 259
            DGST ILGD+S RG+LVVYDNE  K+GWT S+C  PRRFKSLPF+ G
Sbjct: 530  DGSTIILGDISLRGQLVVYDNEKLKIGWTQSDCAHPRRFKSLPFVEG 576


>ref|XP_006440945.1| hypothetical protein CICLE_v10019473mg [Citrus clementina]
            gi|557543207|gb|ESR54185.1| hypothetical protein
            CICLE_v10019473mg [Citrus clementina]
          Length = 577

 Score =  598 bits (1543), Expect = e-168
 Identities = 305/583 (52%), Positives = 390/583 (66%), Gaps = 19/583 (3%)
 Frame = -1

Query: 1950 MEPEEAQSP---VKGVVVISLPPRDDPSMGKTVAFYTISGDS-------HHQINEIRQPQ 1801
            M+ +E+ SP   + GVV+I+LPP ++PS+GKT+  YT++ +S       H Q  E   P 
Sbjct: 1    MDSDESPSPPPQLTGVVIITLPPPNNPSLGKTITAYTLTDNSPQSQQTRHRQQQEHPLPP 60

Query: 1800 LSIHRPPQNPLRIRARITGAGKRXXXXXXXXXXXXXXXXXXXSPQSPFKLXXXXXXXXXX 1621
              +H P  +       +   G                           +           
Sbjct: 61   -QLHPPQNSQFNFSLPMLFPGLPRKLFLFLAISIFALILYGSVFSYTLQ---DRYKSNND 116

Query: 1620 XXXXXTFVLPLFPKLGFPGDKSTGDVELKLGRFVLRE---------NGGVGMGRYKVNRL 1468
                 +FV PL+ K G   + S  D E KLGRFV  +         +G +   + K+N+ 
Sbjct: 117  DENKESFVFPLYHKFGIR-EVSQRDAEFKLGRFVDLDGESVVASVNDGIIRPHKSKINKK 175

Query: 1467 FSGSKASEIDSSSMPFPVKGNVYPYGLYFTQLYIGNPPRSYFLDLDTGSDLTWMQCDAPC 1288
               S A  +DSSS+ FP++GN+YP GLYFT + +GNPPR Y+LD+DTGSDLTW+QCDAPC
Sbjct: 176  LVSSNAVAVDSSSI-FPLRGNIYPDGLYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPC 234

Query: 1287 TSCAKGPNSLYKPTRGKIVPPKDSLCLEVQKNLKSVYCDTCKQCDYEIEYADHSSSMGVL 1108
            +SCAKG N LYKP  G I+P KDSLC+E+Q+N K  YC+TC+QCDYEIEYADHSSSMGVL
Sbjct: 235  SSCAKGANPLYKPRMGNILPYKDSLCMEIQRNHKPGYCETCQQCDYEIEYADHSSSMGVL 294

Query: 1107 VRDELNLMSSNGSLSKPNVVFGCAYDQQGTLLNSLAKTDGILGLSRSKVSLPAQLASRGI 928
             RDEL+L   NGSL+KPNVVFGCAYDQQG LLN+L KTDGILGLSR+KVSLP+QLAS+GI
Sbjct: 295  ARDELHLTIENGSLTKPNVVFGCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGI 354

Query: 927  VKNVFGHCLTSDIAGGGYAFIGDELVPYWNMAWVPMVNHLSVSFYETEVARISYGGKKIS 748
            +KNV GHCLT++  GGGY F+G +LVP W MAWVPM++   +  Y TE+ +I+YG   ++
Sbjct: 355  IKNVVGHCLTTNAGGGGYMFLGHDLVPSWGMAWVPMLDSPFMELYHTEILKINYGSSPLN 414

Query: 747  LAQSGDAKGRVVFDSGSSYTYLPKQAYADLVHSVNYLLDGKLIQDASDATLPLCWRADTP 568
            L       G  +FD+GSSYTY  KQAY++L+ S+  +    L+ DASD TLP+CWRA  P
Sbjct: 415  LGARNSQVGWALFDTGSSYTYFTKQAYSELIASLKEVSSDGLVLDASDPTLPVCWRAKFP 474

Query: 567  IRSVDDVKHFFRPLTFHFGSKWYFVSTKFQIPPEGYLVTNSKGNVCLGILDGSNVHDGST 388
            IRS+ DVK FF+ LT HFGSKW  VSTKF+I PEGYLV + KGN+CLGILDGS VH+GST
Sbjct: 475  IRSIVDVKQFFKTLTLHFGSKWQIVSTKFRISPEGYLVISKKGNICLGILDGSEVHNGST 534

Query: 387  FILGDVSFRGKLVVYDNENQKVGWTDSNCKSPRRFKSLPFIAG 259
             ILGD+S RG+LVVYDN N+++GW  S+C +P RFKSLPF+ G
Sbjct: 535  IILGDISLRGQLVVYDNVNKRIGWAKSHCMNPGRFKSLPFLEG 577


>ref|XP_002511959.1| protein with unknown function [Ricinus communis]
            gi|223549139|gb|EEF50628.1| protein with unknown function
            [Ricinus communis]
          Length = 583

 Score =  585 bits (1507), Expect = e-164
 Identities = 300/589 (50%), Positives = 391/589 (66%), Gaps = 25/589 (4%)
 Frame = -1

Query: 1950 MEPEEAQSPVKGVVVISLPPRDDPSMGKTVAFYTISGDSH-------HQINEIRQPQLSI 1792
            ME ++  S VK VV+ISLPP ++PS+GKT+  +T++ D H       HQ +E     +  
Sbjct: 1    MESDDQSSHVK-VVIISLPPPNNPSLGKTITAFTLTDDDHDATYPQSHQNHEQEPSIIQT 59

Query: 1791 HR-----------PPQNPLRIRARITGAGKRXXXXXXXXXXXXXXXXXXXSPQSPFK--L 1651
            HR           PPQNP   + + + +G                       +S F   L
Sbjct: 60   HRESQLPVQSPSLPPQNP---QIQFSFSGLYFSTPRKLLFLLCISLFAVIVYRSLFSNTL 116

Query: 1650 XXXXXXXXXXXXXXXTFVLPLFPKLGFPGDKSTGDVELKLGRFVLRENGGVGMGRYKV-- 1477
                           +F+ PL+ K G   + S  ++E K  R V +E+    +    V  
Sbjct: 117  LELKVSDDDNDEKTKSFIFPLYHKFGIR-EISQSNLEHKSIRSVYKESLVASVNDDDVIV 175

Query: 1476 ---NRLFSGSKASEIDSSSMPFPVKGNVYPYGLYFTQLYIGNPPRSYFLDLDTGSDLTWM 1306
               N   + S A+ +DSSS+ FPV+GNVYP GLYFT + +GNPPR Y+LD+DT SDLTW+
Sbjct: 176  PNRNYKLASSNAAAVDSSSV-FPVRGNVYPDGLYFTYILVGNPPRPYYLDIDTASDLTWI 234

Query: 1305 QCDAPCTSCAKGPNSLYKPTRGKIVPPKDSLCLEVQKNLKSVYCDTCKQCDYEIEYADHS 1126
            QCDAPCTSCAKG N+LYKP R  IV PKDSLC+E+ +N K+ YC+TC+QCDYEIEYADHS
Sbjct: 235  QCDAPCTSCAKGANALYKPRRDNIVTPKDSLCVELHRNQKAGYCETCQQCDYEIEYADHS 294

Query: 1125 SSMGVLVRDELNLMSSNGSLSKPNVVFGCAYDQQGTLLNSLAKTDGILGLSRSKVSLPAQ 946
            SSMGVL RDEL+L  +NGS +     FGCAYDQQG LLN+L KTDGILGLS++KVSLP+Q
Sbjct: 295  SSMGVLARDELHLTMANGSSTNLKFNFGCAYDQQGLLLNTLVKTDGILGLSKAKVSLPSQ 354

Query: 945  LASRGIVKNVFGHCLTSDIAGGGYAFIGDELVPYWNMAWVPMVNHLSVSFYETEVARISY 766
            LA+RGI+ NV GHCL +D+ GGGY F+GD+ VP W M+WVPM++  S+  Y+T++ +++Y
Sbjct: 355  LANRGIINNVVGHCLANDVVGGGYMFLGDDFVPRWGMSWVPMLDSPSIDSYQTQIMKLNY 414

Query: 765  GGKKISLAQSGDAKGRVVFDSGSSYTYLPKQAYADLVHSVNYLLDGKLIQDASDATLPLC 586
            G   +SL        R+VFDSGSSYTY  K+AY++LV S+  +    LIQD SD TLP C
Sbjct: 415  GSGPLSLGGQERRVRRIVFDSGSSYTYFTKEAYSELVASLKQVSGEALIQDTSDPTLPFC 474

Query: 585  WRADTPIRSVDDVKHFFRPLTFHFGSKWYFVSTKFQIPPEGYLVTNSKGNVCLGILDGSN 406
            WRA  PIRSV DVK +F+ LT  FGSKW+ +STKF+IPPEGYL+ ++KGNVCLGILDGS+
Sbjct: 475  WRAKFPIRSVIDVKQYFKTLTLQFGSKWWIISTKFRIPPEGYLIISNKGNVCLGILDGSD 534

Query: 405  VHDGSTFILGDVSFRGKLVVYDNENQKVGWTDSNCKSPRRFKSLPFIAG 259
            VHDGS+ ILGD+S RG+L++YDN N K+GWT S+C  P+ F +LPF  G
Sbjct: 535  VHDGSSIILGDISLRGQLIIYDNVNNKIGWTQSDCIKPKTFSTLPFFQG 583


>ref|XP_006485774.1| PREDICTED: aspartic proteinase Asp1-like [Citrus sinensis]
          Length = 577

 Score =  580 bits (1494), Expect = e-162
 Identities = 278/457 (60%), Positives = 343/457 (75%), Gaps = 9/457 (1%)
 Frame = -1

Query: 1602 FVLPLFPKLGFPGDKSTGDVELKLGRFVLRE---------NGGVGMGRYKVNRLFSGSKA 1450
            FV PL+ K G   +    D E KLGRFV  +         +G +   + K+N+    S A
Sbjct: 123  FVFPLYHKFGIR-EVLQRDAEFKLGRFVDLDGESVVASVNDGIIRPHKSKINKKLVPSNA 181

Query: 1449 SEIDSSSMPFPVKGNVYPYGLYFTQLYIGNPPRSYFLDLDTGSDLTWMQCDAPCTSCAKG 1270
              +DSSS  FP++GNVYP GLYFT + +GNPPR Y+LD+DTGSDLTW+QCDAPC+SCAKG
Sbjct: 182  VAVDSSST-FPLRGNVYPDGLYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKG 240

Query: 1269 PNSLYKPTRGKIVPPKDSLCLEVQKNLKSVYCDTCKQCDYEIEYADHSSSMGVLVRDELN 1090
             N LYKP  G I+P KDSLC+E+Q+N K  YC+TC+QCDYEIEYADHSSSMGVL RDEL+
Sbjct: 241  ANPLYKPRMGNILPYKDSLCMEIQRNHKPGYCETCQQCDYEIEYADHSSSMGVLARDELH 300

Query: 1089 LMSSNGSLSKPNVVFGCAYDQQGTLLNSLAKTDGILGLSRSKVSLPAQLASRGIVKNVFG 910
            L   NGSL+KPNVVFGCAYDQQG LLN+L KTDGILGLSR+KVSLP+QLAS+GI+KNV G
Sbjct: 301  LTIENGSLTKPNVVFGCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVG 360

Query: 909  HCLTSDIAGGGYAFIGDELVPYWNMAWVPMVNHLSVSFYETEVARISYGGKKISLAQSGD 730
            HCLT++  GGGY F+G +LVP W MAWVPM++   +  Y TE+ +I+YG   ++L     
Sbjct: 361  HCLTTNAGGGGYMFLGHDLVPSWGMAWVPMLDSPFMELYHTEILKINYGSSPLNLGARNS 420

Query: 729  AKGRVVFDSGSSYTYLPKQAYADLVHSVNYLLDGKLIQDASDATLPLCWRADTPIRSVDD 550
              G  +FD+GSSYTY  KQAY++L+ S+  +    L+ DASD TLP+CWRA  PIRS+ D
Sbjct: 421  RVGWALFDTGSSYTYFTKQAYSELIASLKEVSSNGLVLDASDPTLPVCWRAKFPIRSIVD 480

Query: 549  VKHFFRPLTFHFGSKWYFVSTKFQIPPEGYLVTNSKGNVCLGILDGSNVHDGSTFILGDV 370
            VK +F+ LT HFGSKW  VSTKF I PEGYLV + KGN+CLGILDGS VH+GST ILGD+
Sbjct: 481  VKQYFKTLTLHFGSKWQIVSTKFHISPEGYLVISKKGNICLGILDGSEVHNGSTIILGDI 540

Query: 369  SFRGKLVVYDNENQKVGWTDSNCKSPRRFKSLPFIAG 259
            S RG+LVVYDN N+++GW  S+C +P RFKSLPF+ G
Sbjct: 541  SLRGQLVVYDNVNKRIGWAKSHCMNPGRFKSLPFLEG 577


>ref|XP_002328687.1| predicted protein [Populus trichocarpa]
            gi|566206181|ref|XP_006374352.1| aspartyl protease family
            protein [Populus trichocarpa] gi|550322111|gb|ERP52149.1|
            aspartyl protease family protein [Populus trichocarpa]
          Length = 603

 Score =  570 bits (1470), Expect = e-160
 Identities = 303/611 (49%), Positives = 390/611 (63%), Gaps = 46/611 (7%)
 Frame = -1

Query: 1950 MEPEEAQSP-VKGVVVISLPPRDDPSMGKTVAFYTISGDSHHQINEIRQP----QLSIHR 1786
            ME ++ QSP +KGVV+ISLPP D+PS+GKT+  +T++ + + Q ++  Q     QL I  
Sbjct: 1    MESDDDQSPQLKGVVIISLPPPDNPSLGKTITAFTLTNNDYPQSHQTPQTHQEDQLPISS 60

Query: 1785 PPQNP-----LRIRARITGAGKRXXXXXXXXXXXXXXXXXXXSPQSPFKLXXXXXXXXXX 1621
            PP  P     L+  +     G                        + F+           
Sbjct: 61   PPPPPSQNSQLQFPSSRLFLGTPRKLLSFVFISLFALAIYSSLFTNTFQ-ELKSNNNDDD 119

Query: 1620 XXXXXTFVLPLFPKLGFPGDKSTGDVELKLGRFVLRENGGVGM----GRYKVNRLFSGSK 1453
                 ++V PL+ KLG   +    D+E  L RFV +EN    +    G +K+++L S + 
Sbjct: 120  DQKPKSYVFPLYHKLGIR-EIPLNDLENHLRRFVYKENLVASVDHLNGPHKISKLASSNA 178

Query: 1452 ASEIDSSSMPFPVKGNVYPYGLYFTQLYIGNPPRSYFLDLDTGSDLTWMQCDAPCTSCAK 1273
            A+ +DSS++ FPV+GN+YP G          PP+ Y+LD DTGSDLTW+QCDAPCTSCAK
Sbjct: 179  AAAMDSSAI-FPVRGNLYPDG----------PPQPYYLDFDTGSDLTWIQCDAPCTSCAK 227

Query: 1272 GPNSLYKPTRGKIVPPKDSLCLEVQKNLKSVYCDTCKQCDYEIEYADHSSSMGVLVRDEL 1093
            G N+ YKP RG IVPPKD LC+EVQ+N K+ YC+TC QCDYEIEYADHSSSMGVL  D+L
Sbjct: 228  GANAWYKPRRGNIVPPKDLLCMEVQRNQKAGYCETCDQCDYEIEYADHSSSMGVLATDKL 287

Query: 1092 NLMSSNGSLSKPNVVFGCAYDQQGTLLNSLAKTDGILGLSRSKVSLPAQLASRGIVKNVF 913
             LM +NGSL+K N +FGCAYDQQG LL +L KTDGILGLSR+KVSLP+QLAS+GI+ NV 
Sbjct: 288  LLMVANGSLTKLNFIFGCAYDQQGLLLKTLVKTDGILGLSRAKVSLPSQLASQGIINNVI 347

Query: 912  GHCLTSDIAGGGYAFIGDELVPYWNMAWVPMVNHLSVSFYETEVARISYGGKKISLAQSG 733
            GHCLT+D+ GGGY F+GD+ VP W MAWVPM++  S+ FY TEV +++YG   +SL    
Sbjct: 348  GHCLTTDLGGGGYMFLGDDFVPRWGMAWVPMLDSPSMEFYHTEVVKLNYGSSPLSLGGME 407

Query: 732  DAKGRVVFDSGSSYTYLPKQAYADLVHSVNYLLDGKLIQDASDATLPLCWRADTPIRSV- 556
                 ++FDSGSSYTY PK+AY++LV S+N +    L+Q  SD TLPLCWRA+ PIR   
Sbjct: 408  SRVKHILFDSGSSYTYFPKEAYSELVASLNEVSGAGLVQSTSDTTLPLCWRANFPIRKFI 467

Query: 555  -------------------------------DDVKHFFRPLTFHFGSKWYFVSTKFQIPP 469
                                            DVK FF+ LTF FG+KW  +STKF+IPP
Sbjct: 468  YRTELTRPIRRRRRRRRRRRRRRRRRRQHIKGDVKKFFKTLTFQFGTKWLVISTKFRIPP 527

Query: 468  EGYLVTNSKGNVCLGILDGSNVHDGSTFILGDVSFRGKLVVYDNENQKVGWTDSNCKSPR 289
            EGYL+ + KGNVCLGIL+GS VHDGST ILGD+S RG+LVVYDN N+K+GWT S+C  P+
Sbjct: 528  EGYLMMSDKGNVCLGILEGSKVHDGSTIILGDISLRGQLVVYDNVNKKIGWTPSDCAKPK 587

Query: 288  RFKSLPFIAGM 256
            R  SL F  G+
Sbjct: 588  RSDSLQFFDGL 598


>ref|XP_006354730.1| PREDICTED: aspartic proteinase Asp1-like [Solanum tuberosum]
          Length = 558

 Score =  567 bits (1461), Expect = e-159
 Identities = 287/563 (50%), Positives = 378/563 (67%), Gaps = 2/563 (0%)
 Frame = -1

Query: 1950 MEPEEAQSPVKGVVVISLPPRDDPSMGKTVAFYTISGDSHHQINEIRQP--QLSIHRPPQ 1777
            ME  +   P++GVV+I+LPP D+PS GKT+  +T+S    HQ  +  +P  Q   H    
Sbjct: 1    MEETKNSPPIQGVVIITLPPPDNPSYGKTITAFTLSDSPTHQQQQEEEPPQQSQPHNQDL 60

Query: 1776 NPLRIRARITGAGKRXXXXXXXXXXXXXXXXXXXSPQSPFKLXXXXXXXXXXXXXXXTFV 1597
            N   +RA +  +                      S  +   L               +F+
Sbjct: 61   NTGVLRASLERSFFFRPKIVFGLLGISLIALSFWSSLTQETLFELRDVEHDHKSSNSSFI 120

Query: 1596 LPLFPKLGFPGDKSTGDVELKLGRFVLRENGGVGMGRYKVNRLFSGSKASEIDSSSMPFP 1417
            LPL+PK G   + S  DVE KLGRFV  +     M + K+ +  S S A+++DSS + FP
Sbjct: 121  LPLYPKRGGAWN-SRRDVEFKLGRFVDFKPDKF-MDQEKIAK--SLSAATKLDSS-VNFP 175

Query: 1416 VKGNVYPYGLYFTQLYIGNPPRSYFLDLDTGSDLTWMQCDAPCTSCAKGPNSLYKPTRGK 1237
            V+GN++  GLY+T + +GNPPR YFLD+DTGSDL W+QCDAPCTSCAKG + LYKP    
Sbjct: 176  VRGNIHSEGLYYTYMLVGNPPRPYFLDIDTGSDLMWIQCDAPCTSCAKGAHPLYKPRNVN 235

Query: 1236 IVPPKDSLCLEVQKNLKSVYCDTCKQCDYEIEYADHSSSMGVLVRDELNLMSSNGSLSKP 1057
            ++PPK+  C+EVQ+NLKS YCD C QCDYEIEYAD SSS+GVL +DEL L+ +NG+ +KP
Sbjct: 236  MIPPKNPYCVEVQENLKSKYCDNCHQCDYEIEYADRSSSVGVLAKDELQLVLANGTGTKP 295

Query: 1056 NVVFGCAYDQQGTLLNSLAKTDGILGLSRSKVSLPAQLASRGIVKNVFGHCLTSDIAGGG 877
            +VVFGCAYDQQGTLLN+LA TDGILGLSR+ +SLP+QLAS G++ NV GHCL +D   GG
Sbjct: 296  SVVFGCAYDQQGTLLNTLASTDGILGLSRAPISLPSQLASHGLINNVIGHCLRTD-TNGG 354

Query: 876  YAFIGDELVPYWNMAWVPMVNHLSVSFYETEVARISYGGKKISLAQSGDAKGRVVFDSGS 697
            Y F+G++ VP W M+WVPM+N+   + Y+ ++ +++YGGK++ L  +   +G VVFDSGS
Sbjct: 355  YLFLGNDFVPQWRMSWVPMLNNPFPNLYQAQLMKMNYGGKELRLGSTSYGQGTVVFDSGS 414

Query: 696  SYTYLPKQAYADLVHSVNYLLDGKLIQDASDATLPLCWRADTPIRSVDDVKHFFRPLTFH 517
            +YTY   QAY  L+  +  +    LI+DASD TLP+CWRA  P+RS+++V+ FF+PL   
Sbjct: 415  TYTYFTDQAYKALISMLEEISSEDLIKDASDTTLPICWRAKFPVRSIEEVRQFFKPLNLQ 474

Query: 516  FGSKWYFVSTKFQIPPEGYLVTNSKGNVCLGILDGSNVHDGSTFILGDVSFRGKLVVYDN 337
            FGSKW  VSTK  IP EG+L  + KGNVCLGILDGSNVHDGS  ILGD+S RG+L VYDN
Sbjct: 475  FGSKWRIVSTKLWIPAEGFLTISEKGNVCLGILDGSNVHDGSAIILGDISLRGQLFVYDN 534

Query: 336  ENQKVGWTDSNCKSPRRFKSLPF 268
             NQK+GW  SNC+ P +  SLPF
Sbjct: 535  VNQKIGWIRSNCERPEKVPSLPF 557


>gb|ESW24775.1| hypothetical protein PHAVU_004G159200g [Phaseolus vulgaris]
          Length = 572

 Score =  559 bits (1441), Expect = e-156
 Identities = 294/565 (52%), Positives = 379/565 (67%), Gaps = 17/565 (3%)
 Frame = -1

Query: 1923 VKGVVVISLPPRDDPSMGKTVAFYTISGDSHHQ--------------INEIRQPQLSIHR 1786
            +KGVV+ISLPP D+PS+GKT+  +T S  S  Q              INE       +H 
Sbjct: 9    IKGVVIISLPPPDNPSLGKTITAFTFSDPSSPQPSLLLQQSHQHQTNINEYNNTDPPLHS 68

Query: 1785 PPQNPLRIRARITGAGKRXXXXXXXXXXXXXXXXXXXSPQSPFKLXXXXXXXXXXXXXXX 1606
             P N     +R     +                    S  S   L               
Sbjct: 69   YPSNAQLGFSRRRLFHRTPVRFFSFFGVFLFALFLYGSVSSTTTLELSGPKNDGDDDGKP 128

Query: 1605 T-FVLPLFPKLGFPGDKSTGDVELKLGRFVLRENGGVGMGRYKVNRLFSGSKASEIDSSS 1429
              ++ PL+PK G  G K+   ++L+LG+ V +E   +   +Y+V     GS+   +DSSS
Sbjct: 129  GSYLFPLYPKFGVLGQKN---MKLQLGKLVHKEKL-LTQRKYRV-----GSEVVAVDSSS 179

Query: 1428 MPFPVKGNVYPYGLYFTQLYIGNPPRSYFLDLDTGSDLTWMQCDAPCTSCAKGPNSLYKP 1249
            + FPV GNV+P GLYFT L +GNPPRSYFLD+DTGSDLTWMQCDAPC SC KG ++ YKP
Sbjct: 180  V-FPVSGNVFPDGLYFTILRVGNPPRSYFLDVDTGSDLTWMQCDAPCISCGKGAHAQYKP 238

Query: 1248 TRGKIVPPKDSLCLEVQKNLKSVYCDTCKQCDYEIEYADHSSSMGVLVRDELNLMSSNGS 1069
            TR  +VP  DSLCL+VQKN K  + ++ +QCDY+IEYAD SSS+GVL+RDEL+L+++NGS
Sbjct: 239  TRSNVVPSMDSLCLDVQKNQKDGHHESLQQCDYQIEYADQSSSLGVLIRDELHLVTTNGS 298

Query: 1068 LSKPNVVFGCAYDQQGTLLNSLAKTDGILGLSRSKVSLPAQLASRGIVKNVFGHCLTSDI 889
             +K N VFGC YDQ+G LLN+LAKTDGILGLSR+KVSLP QLAS+G++KNV GHCL++D 
Sbjct: 299  KTKLNFVFGCGYDQEGLLLNTLAKTDGILGLSRAKVSLPYQLASKGLIKNVVGHCLSNDE 358

Query: 888  AGGGYAFIGDELVPYWNMAWVPMVNHLSVSFYETEVARISYGGKKISLAQSGDAK-GRVV 712
             GGGY F+GD+ +PYW M WVPM   L+   Y+TE+  I+YG +++S    G +K G+VV
Sbjct: 359  VGGGYMFLGDDFLPYWGMTWVPMAYTLTTDLYQTEILGINYGNRQLSF--DGQSKVGKVV 416

Query: 711  FDSGSSYTYLPKQAYADLVHSVNYLLDGKLIQDASDATLPLCWRADTPIRSVDDVKHFFR 532
            FDSGSSYTY PK+AY DLV S+N +   +LIQD SD TLP+CW A+ PI+SV DVK +F+
Sbjct: 417  FDSGSSYTYFPKEAYLDLVASLNEVSGLRLIQDDSDTTLPICWEANFPIKSVKDVKDYFK 476

Query: 531  PLTFHFGSKWYFVSTKFQIPPEGYLVTNSKGNVCLGILDGSNVHDGSTFILGDVSFRGKL 352
             +T  FGSKW+ +ST FQI PEGYL+ ++KG+VCLGILDGSNV+DGS+ ILGD+SFRG L
Sbjct: 477  TITLRFGSKWWILSTMFQIAPEGYLIISNKGHVCLGILDGSNVNDGSSIILGDISFRGYL 536

Query: 351  VVYDNENQKVGWTDSNC-KSPRRFK 280
            VVYDN  QK+GW  + C  S RR +
Sbjct: 537  VVYDNSKQKIGWKRAECGMSSRRLR 561


>ref|XP_004241624.1| PREDICTED: aspartic proteinase Asp1-like [Solanum lycopersicum]
          Length = 562

 Score =  558 bits (1437), Expect = e-156
 Identities = 285/567 (50%), Positives = 374/567 (65%), Gaps = 6/567 (1%)
 Frame = -1

Query: 1950 MEPEEAQSPVKGVVVISLPPRDDPSMGKTVAFYTISGDSHHQINEIRQ-----PQLSI-H 1789
            ME  +   P++GVV+I+LPP D+PS GKT+  +T+S    HQ  + ++     PQ S  H
Sbjct: 1    MEETKNSPPIQGVVIITLPPPDNPSYGKTITAFTLSDSPTHQQQQEQEQEEEPPQQSQPH 60

Query: 1788 RPPQNPLRIRARITGAGKRXXXXXXXXXXXXXXXXXXXSPQSPFKLXXXXXXXXXXXXXX 1609
                N   +   +  +                      S  +   L              
Sbjct: 61   NQDVNAGVLHVSLERSFFFRPTIVFGLLGISLIALSFWSSLTQETLFELRDVEQDHKSSN 120

Query: 1608 XTFVLPLFPKLGFPGDKSTGDVELKLGRFVLRENGGVGMGRYKVNRLFSGSKASEIDSSS 1429
             +F+LPL+PK G   +  T DVE KLGRFV  +     M + K+ +  S S A+++DSS+
Sbjct: 121  SSFILPLYPKRGGAWNSRT-DVEFKLGRFVDFKPDNF-MDQEKIAK--SLSAATKLDSSA 176

Query: 1428 MPFPVKGNVYPYGLYFTQLYIGNPPRSYFLDLDTGSDLTWMQCDAPCTSCAKGPNSLYKP 1249
              FPV+GN++  GLY+T + +GNPP+ YFLD+DTGSDL W+QCDAPCTSCAKG + LYKP
Sbjct: 177  N-FPVRGNIHSEGLYYTYMLVGNPPKPYFLDIDTGSDLMWIQCDAPCTSCAKGAHPLYKP 235

Query: 1248 TRGKIVPPKDSLCLEVQKNLKSVYCDTCKQCDYEIEYADHSSSMGVLVRDELNLMSSNGS 1069
                ++PPK+  C+EVQ+NL+S YCD C QCDYEIEYAD SSS+GVL +DEL L+ +NG+
Sbjct: 236  RNVNMIPPKNPYCVEVQENLRSKYCDNCHQCDYEIEYADRSSSVGVLAKDELQLVLANGT 295

Query: 1068 LSKPNVVFGCAYDQQGTLLNSLAKTDGILGLSRSKVSLPAQLASRGIVKNVFGHCLTSDI 889
             +KPNVVFGCAYDQQGTLLN+LA TDGILGLSR+ +SLP+QLAS G++ NV GHCL +D 
Sbjct: 296  GTKPNVVFGCAYDQQGTLLNTLASTDGILGLSRAPISLPSQLASHGLINNVIGHCLRTD- 354

Query: 888  AGGGYAFIGDELVPYWNMAWVPMVNHLSVSFYETEVARISYGGKKISLAQSGDAKGRVVF 709
              GGY F+G++ VP W M+WVPM+N+   + Y+ ++ +++YGGK + L   G  +  VVF
Sbjct: 355  TNGGYLFLGNDFVPQWRMSWVPMLNNPFPNLYQAQLMKMNYGGKDLQLGSRGYGQDSVVF 414

Query: 708  DSGSSYTYLPKQAYADLVHSVNYLLDGKLIQDASDATLPLCWRADTPIRSVDDVKHFFRP 529
            DSGS+YTY   QAY  L+  +  +    LI+DASD TLP+CWRA  P+RS+++V+ FF+P
Sbjct: 415  DSGSTYTYFTDQAYKALISMLEEISSEDLIKDASDTTLPICWRAKFPVRSIEEVRQFFKP 474

Query: 528  LTFHFGSKWYFVSTKFQIPPEGYLVTNSKGNVCLGILDGSNVHDGSTFILGDVSFRGKLV 349
            L   FGSKW  VSTK  IP EGYL  + K NVCLGILDGSNVHDGS  ILGD+S RG+L 
Sbjct: 475  LNLQFGSKWRVVSTKLWIPAEGYLTISEKSNVCLGILDGSNVHDGSAIILGDISLRGQLF 534

Query: 348  VYDNENQKVGWTDSNCKSPRRFKSLPF 268
            VYDN NQK+GW  SNC+ P    SLPF
Sbjct: 535  VYDNVNQKIGWIRSNCERPENVPSLPF 561


>ref|XP_004137470.1| PREDICTED: aspartic proteinase Asp1-like [Cucumis sativus]
            gi|449486840|ref|XP_004157418.1| PREDICTED: aspartic
            proteinase Asp1-like [Cucumis sativus]
          Length = 570

 Score =  541 bits (1394), Expect = e-151
 Identities = 291/575 (50%), Positives = 376/575 (65%), Gaps = 24/575 (4%)
 Frame = -1

Query: 1923 VKGVVVISLPPRDDPSMGKTVAFYTIS-------GDSHHQINEIRQPQ-----LSIHRPP 1780
            +KGVVVI+LPP D+PS+GK+V  +T++       G+S     E++QP      L  + P 
Sbjct: 6    IKGVVVITLPPPDNPSLGKSVTAFTLTDDFPEPPGESVAVDQEVQQPNNDHLTLPPNLPI 65

Query: 1779 QNPLRIRA-----RITGAGKRXXXXXXXXXXXXXXXXXXXSPQSPFKLXXXXXXXXXXXX 1615
            Q PL  R+      +     R                    P++  +L            
Sbjct: 66   QAPLSQRSIPLSRELFAGTPRKLVFVLGIALAAVYLYASNFPETIRELRRSERNDDDRPS 125

Query: 1614 XXXTFVLPLFPKLGFPGDKSTGDVELKLGRFVLRENGGVG------MGRYKVNRLFSGSK 1453
                F+ PL+ +    GD S  D +LKLGR V      +G      +G  K ++L S S 
Sbjct: 126  S---FLFPLYFQSEL-GDSS--DFQLKLGRTVRVNKDDLGVRFNDVLGVPKPSKLISASL 179

Query: 1452 ASEIDSSSMPFPVKGNVYPYGLYFTQLYIGNPPRSYFLDLDTGSDLTWMQCDAPCTSCAK 1273
             S+   SS  FPV+G++YP GLY+T + +G PPR YFLD+DTGSDLTW+QCDAPC+SC K
Sbjct: 180  KSD---SSAVFPVRGDIYPDGLYYTYIMVGEPPRPYFLDIDTGSDLTWVQCDAPCSSCGK 236

Query: 1272 GPNSLYKPTRGKIVPPKDSLCLEVQKNLKSVYCDTCKQCDYEIEYADHSSSMGVLVRDEL 1093
            G + LYKP R  +V  KDSLC+EVQ+N     C  C+QC+YE++YAD SSS+GVLV+DE 
Sbjct: 237  GRSPLYKPRRENVVSFKDSLCMEVQRNYDGDQCAACQQCNYEVQYADQSSSLGVLVKDEF 296

Query: 1092 NLMSSNGSLSKPNVVFGCAYDQQGTLLNSLAKTDGILGLSRSKVSLPAQLASRGIVKNVF 913
             L  SNGSL+K N +FGCAYDQQG LLN+L+KTDGILGLSR+KVSLP+QLASRGI+ NV 
Sbjct: 297  TLRFSNGSLTKLNAIFGCAYDQQGLLLNTLSKTDGILGLSRAKVSLPSQLASRGIINNVV 356

Query: 912  GHCLTSDIAGGGYAFIGDELVPYWNMAWVPMVNHLSVSFYETEVARISYGGKKISLAQSG 733
            GHCLT D AGGGY F+GD+ VP W MAWV M++  S+ FY+T+V RI YG   +SL   G
Sbjct: 357  GHCLTGDPAGGGYLFLGDDFVPQWGMAWVAMLDSPSIDFYQTKVVRIDYGSIPLSLDTWG 416

Query: 732  DAKGRVVFDSGSSYTYLPKQAYADLVHSVNYL-LDGKLIQDASDATLPLCWRADTPIRSV 556
             ++ +VVFDSGSSYTY  K+AY  LV ++  +   G ++QD+SD    +CW+ +  IRSV
Sbjct: 417  SSREQVVFDSGSSYTYFTKEAYYQLVANLEEVSAFGLILQDSSDT---ICWKTEQSIRSV 473

Query: 555  DDVKHFFRPLTFHFGSKWYFVSTKFQIPPEGYLVTNSKGNVCLGILDGSNVHDGSTFILG 376
             DVKHFF+PLT  FGS+++ VSTK  I PE YL+ N +GNVCLGILDGS VHDGST ILG
Sbjct: 474  KDVKHFFKPLTLQFGSRFWLVSTKLVILPENYLLINKEGNVCLGILDGSQVHDGSTIILG 533

Query: 375  DVSFRGKLVVYDNENQKVGWTDSNCKSPRRFKSLP 271
            D + RGKLVVYDN NQ++GWT S+C +PR+ K LP
Sbjct: 534  DNALRGKLVVYDNVNQRIGWTSSDCHNPRKIKHLP 568


>ref|XP_006583400.1| PREDICTED: aspartic proteinase Asp1-like [Glycine max]
          Length = 574

 Score =  532 bits (1370), Expect = e-148
 Identities = 263/440 (59%), Positives = 333/440 (75%), Gaps = 3/440 (0%)
 Frame = -1

Query: 1602 FVLPLFPKLGFPGDKSTGDVELKLGRFVLRENGGVGMGRYKVNR-LFSGSKASEIDSSSM 1426
            F+ PLFPK G  G K   D++L+LG+ V +E       ++   R +  GS    +DSSS+
Sbjct: 132  FLFPLFPKFGVLGQK---DLKLQLGKLVQKE-------KFLTQRDVGDGSGVVAVDSSSV 181

Query: 1425 PFPVKGNVYPYGLYFTQLYIGNPPRSYFLDLDTGSDLTWMQCDAPCTSCAKGPNSLYKPT 1246
             FPV GNVYP GLYFT L +GNPP+SYFLD+DTGSDLTWMQCDAPC SC KG +  YKPT
Sbjct: 182  -FPVSGNVYPDGLYFTILRVGNPPKSYFLDVDTGSDLTWMQCDAPCRSCGKGAHVQYKPT 240

Query: 1245 RGKIVPPKDSLCLEVQKNLKSVYCD-TCKQCDYEIEYADHSSSMGVLVRDELNLMSSNGS 1069
            R  +V   DSLCL+VQKN K+ + D +  QCDYEI+YADHSSS+GVLVRDEL+L+++NGS
Sbjct: 241  RSNVVSSVDSLCLDVQKNQKNGHHDESLLQCDYEIQYADHSSSLGVLVRDELHLVTTNGS 300

Query: 1068 LSKPNVVFGCAYDQQGTLLNSLAKTDGILGLSRSKVSLPAQLASRGIVKNVFGHCLTSDI 889
             +K NVVFGC YDQ+G +LN+LAKTDGI+GLSR+KVSLP QLAS+G++KNV GHCL++D 
Sbjct: 301  KTKLNVVFGCGYDQEGLILNTLAKTDGIMGLSRAKVSLPYQLASKGLIKNVVGHCLSNDG 360

Query: 888  AGGGYAFIGDELVPYWNMAWVPMVNHLSVSFYETEVARISYGGKKISLAQSGDAK-GRVV 712
            AGGGY F+GD+ VPYW M WVPM   L+   Y+TE+  I+YG +++     G +K G+V 
Sbjct: 361  AGGGYMFLGDDFVPYWGMNWVPMAYTLTTDLYQTEILGINYGNRQLKF--DGQSKVGKVF 418

Query: 711  FDSGSSYTYLPKQAYADLVHSVNYLLDGKLIQDASDATLPLCWRADTPIRSVDDVKHFFR 532
            FDSGSSYTY PK+AY DLV S+N +    L+QD SD TLP+CW+A+  IRS+ DVK +F+
Sbjct: 419  FDSGSSYTYFPKEAYLDLVASLNEVSGLGLVQDDSDTTLPICWQANFQIRSIKDVKDYFK 478

Query: 531  PLTFHFGSKWYFVSTKFQIPPEGYLVTNSKGNVCLGILDGSNVHDGSTFILGDVSFRGKL 352
             LT  FGSKW+ +ST FQIPPEGYL+ ++KG+VCLGILDGS V+DGS+ ILGD+S RG  
Sbjct: 479  TLTLRFGSKWWILSTLFQIPPEGYLIISNKGHVCLGILDGSKVNDGSSIILGDISLRGYS 538

Query: 351  VVYDNENQKVGWTDSNCKSP 292
            VVYDN  QK+GW  ++C  P
Sbjct: 539  VVYDNVKQKIGWKRADCGMP 558


>gb|ESW24776.1| hypothetical protein PHAVU_004G159200g [Phaseolus vulgaris]
          Length = 579

 Score =  521 bits (1343), Expect = e-145
 Identities = 275/532 (51%), Positives = 356/532 (66%), Gaps = 16/532 (3%)
 Frame = -1

Query: 1923 VKGVVVISLPPRDDPSMGKTVAFYTISGDSHHQ--------------INEIRQPQLSIHR 1786
            +KGVV+ISLPP D+PS+GKT+  +T S  S  Q              INE       +H 
Sbjct: 9    IKGVVIISLPPPDNPSLGKTITAFTFSDPSSPQPSLLLQQSHQHQTNINEYNNTDPPLHS 68

Query: 1785 PPQNPLRIRARITGAGKRXXXXXXXXXXXXXXXXXXXSPQSPFKLXXXXXXXXXXXXXXX 1606
             P N     +R     +                    S  S   L               
Sbjct: 69   YPSNAQLGFSRRRLFHRTPVRFFSFFGVFLFALFLYGSVSSTTTLELSGPKNDGDDDGKP 128

Query: 1605 T-FVLPLFPKLGFPGDKSTGDVELKLGRFVLRENGGVGMGRYKVNRLFSGSKASEIDSSS 1429
              ++ PL+PK G  G K+   ++L+LG+ V +E   +   +Y+V     GS+   +DSSS
Sbjct: 129  GSYLFPLYPKFGVLGQKN---MKLQLGKLVHKEKL-LTQRKYRV-----GSEVVAVDSSS 179

Query: 1428 MPFPVKGNVYPYGLYFTQLYIGNPPRSYFLDLDTGSDLTWMQCDAPCTSCAKGPNSLYKP 1249
            + FPV GNV+P GLYFT L +GNPPRSYFLD+DTGSDLTWMQCDAPC SC KG ++ YKP
Sbjct: 180  V-FPVSGNVFPDGLYFTILRVGNPPRSYFLDVDTGSDLTWMQCDAPCISCGKGAHAQYKP 238

Query: 1248 TRGKIVPPKDSLCLEVQKNLKSVYCDTCKQCDYEIEYADHSSSMGVLVRDELNLMSSNGS 1069
            TR  +VP  DSLCL+VQKN K  + ++ +QCDY+IEYAD SSS+GVL+RDEL+L+++NGS
Sbjct: 239  TRSNVVPSMDSLCLDVQKNQKDGHHESLQQCDYQIEYADQSSSLGVLIRDELHLVTTNGS 298

Query: 1068 LSKPNVVFGCAYDQQGTLLNSLAKTDGILGLSRSKVSLPAQLASRGIVKNVFGHCLTSDI 889
             +K N VFGC YDQ+G LLN+LAKTDGILGLSR+KVSLP QLAS+G++KNV GHCL++D 
Sbjct: 299  KTKLNFVFGCGYDQEGLLLNTLAKTDGILGLSRAKVSLPYQLASKGLIKNVVGHCLSNDE 358

Query: 888  AGGGYAFIGDELVPYWNMAWVPMVNHLSVSFYETEVARISYGGKKISLAQSGDAK-GRVV 712
             GGGY F+GD+ +PYW M WVPM   L+   Y+TE+  I+YG +++S    G +K G+VV
Sbjct: 359  VGGGYMFLGDDFLPYWGMTWVPMAYTLTTDLYQTEILGINYGNRQLSF--DGQSKVGKVV 416

Query: 711  FDSGSSYTYLPKQAYADLVHSVNYLLDGKLIQDASDATLPLCWRADTPIRSVDDVKHFFR 532
            FDSGSSYTY PK+AY DLV S+N +   +LIQD SD TLP+CW A+ PI+SV DVK +F+
Sbjct: 417  FDSGSSYTYFPKEAYLDLVASLNEVSGLRLIQDDSDTTLPICWEANFPIKSVKDVKDYFK 476

Query: 531  PLTFHFGSKWYFVSTKFQIPPEGYLVTNSKGNVCLGILDGSNVHDGSTFILG 376
             +T  FGSKW+ +ST FQI PEGYL+ ++KG+VCLGILDGSNV+DGS+ ILG
Sbjct: 477  TITLRFGSKWWILSTMFQIAPEGYLIISNKGHVCLGILDGSNVNDGSSIILG 528


>ref|XP_004512995.1| PREDICTED: lysine-specific histone demethylase 1 homolog 1-like
            [Cicer arietinum]
          Length = 1387

 Score =  520 bits (1340), Expect = e-145
 Identities = 255/441 (57%), Positives = 320/441 (72%), Gaps = 2/441 (0%)
 Frame = -1

Query: 1602 FVLPLFPKLGFPGDKSTGDVELKLGRFVLRENGGVGMGRYKVNRLFSGSKASEIDSSSMP 1423
            F+ PLF K G  G +    +++K G FV +++G            FS    +   SSS  
Sbjct: 133  FLFPLFKKYGVVGQRDLKLIDVKKGNFVTQKSGDSD------GIAFSSRVVAVDSSSSTV 186

Query: 1422 FPVKGNVYPYGLYFTQLYIGNPPRSYFLDLDTGSDLTWMQCDAPCTSCAKGPNSLYKPTR 1243
            FP+ GNVYP GLY+T + +GNPP+ YF+D+DTGSDLTW+QCDAPC SCAKG N  YKP R
Sbjct: 187  FPISGNVYPDGLYYTHVRVGNPPKRYFVDVDTGSDLTWIQCDAPCRSCAKGANVPYKPIR 246

Query: 1242 GKIVPPKDSLCLEVQKNLKSVYCDTCKQCDYEIEYADHSSSMGVLVRDELNLMSSNGSLS 1063
              IVP  DSLCLEVQKN K+ Y ++ +QCDYEI+YADHSSSMGVL+RDEL+LM++NGS +
Sbjct: 247  TNIVPSLDSLCLEVQKNQKNGYHESFQQCDYEIQYADHSSSMGVLIRDELHLMTTNGSKT 306

Query: 1062 KPNVVFGCAYDQQGTLLNSLAKTDGILGLSRSKVSLPAQLASRGIVKNVFGHCLT-SDIA 886
            K N VFGC YDQ+G LLN+L KTDGI+GLSR+KV LP QL+S+GI+KNV GHCL+ +D  
Sbjct: 307  KLNFVFGCGYDQEGLLLNTLTKTDGIMGLSRAKVGLPYQLSSKGIIKNVVGHCLSNNDGV 366

Query: 885  GGGYAFIGDELVPYWNMAWVPMVNHLSVSFYETEVARISYGGKKISLAQSGDAK-GRVVF 709
            GGGY F+GD+ VPYW M W PM        Y+TEV  I+YG + +S    G +K G VVF
Sbjct: 367  GGGYMFLGDDFVPYWGMTWAPMTQ--ITDLYQTEVLGINYGNRLLSF--DGHSKVGNVVF 422

Query: 708  DSGSSYTYLPKQAYADLVHSVNYLLDGKLIQDASDATLPLCWRADTPIRSVDDVKHFFRP 529
            DSGSSYTY PK+AY DLV S+  +    L++D SD TLP+CW+A+ PIRSV DVK +F+ 
Sbjct: 423  DSGSSYTYFPKEAYRDLVASLEEVSGLGLVEDDSDTTLPICWQANFPIRSVKDVKDYFKT 482

Query: 528  LTFHFGSKWYFVSTKFQIPPEGYLVTNSKGNVCLGILDGSNVHDGSTFILGDVSFRGKLV 349
            LT  FG+KW+ +ST F IPPEGYL+ ++KGNVCL ILDGSNV+DGS+ ILGD+S RG LV
Sbjct: 483  LTLRFGNKWWILSTLFHIPPEGYLIISNKGNVCLAILDGSNVNDGSSIILGDISLRGYLV 542

Query: 348  VYDNENQKVGWTDSNCKSPRR 286
            VYDN N+ +GW  + C  P R
Sbjct: 543  VYDNVNKNIGWERTKCGMPNR 563


>ref|XP_002891474.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
            gi|297337316|gb|EFH67733.1| aspartyl protease family
            protein [Arabidopsis lyrata subsp. lyrata]
          Length = 578

 Score =  516 bits (1329), Expect = e-143
 Identities = 276/580 (47%), Positives = 366/580 (63%), Gaps = 16/580 (2%)
 Frame = -1

Query: 1950 MEPE-EAQSPVKGVVVISLPPRDDPSMGKTVAFYTISGDSHH-QINEIRQPQLSIHRPP- 1780
            MEP+   Q  +  VV+I+LPP DDPS GKT++ +T++   +  QI     P  S    P 
Sbjct: 1    MEPDLHEQQRLHSVVIITLPPSDDPSQGKTISAFTLNDHDYPLQIPPEDNPNPSFQPDPL 60

Query: 1779 ---QNPLRIRARITGAGKRXXXXXXXXXXXXXXXXXXXSPQSP--FKLXXXXXXXXXXXX 1615
               Q    + + ++    R                    P S   F++            
Sbjct: 61   HQNQQSRLLFSDLSMGSPRLVLGLLGFSLLAVAFYASVFPNSVQMFRVSDERNRDDDSSR 120

Query: 1614 XXXTFVLPLFPKLG---FPGDKSTGDVELKLGRFVLRENGGVGMGRYKVNRLFSGSKASE 1444
               +FV P++ KL    F       D+ L+ G+FV   +  + +   KVN + S S A  
Sbjct: 121  ETTSFVFPVYHKLRAREFHERILAEDLGLENGKFVESMDLEL-VNPVKVNDVLSTS-AGS 178

Query: 1443 IDSSSMPFPVKGNVYPYGLYFTQLYIGNPP--RSYFLDLDTGSDLTWMQCDAPCTSCAKG 1270
            IDSS+  FPV GNVYP GLY+T++ +G P   + Y LD+DTGSDLTW+QCDAPCTSCAKG
Sbjct: 179  IDSSTTIFPVGGNVYPDGLYYTRILVGKPEDGQYYHLDIDTGSDLTWIQCDAPCTSCAKG 238

Query: 1269 PNSLYKPTRGKIVPPKDSLCLEVQKNLKSVYCDTCKQCDYEIEYADHSSSMGVLVRDELN 1090
             N LYKP +  +V   +  C+EVQ+N  + +C++C QCDYEIEYADHS SMGVL +D+ +
Sbjct: 239  ANQLYKPRKDNLVRSSEPFCVEVQRNQLTEHCESCHQCDYEIEYADHSYSMGVLTKDKFH 298

Query: 1089 LMSSNGSLSKPNVVFGCAYDQQGTLLNSLAKTDGILGLSRSKVSLPAQLASRGIVKNVFG 910
            L   NGSL++ ++VFGC YDQQG LLN+L KTDGILGLSR+K+SLP+QLASRGI+ NV G
Sbjct: 299  LKLHNGSLAESDIVFGCGYDQQGLLLNTLLKTDGILGLSRAKISLPSQLASRGIISNVVG 358

Query: 909  HCLTSDIAGGGYAFIGDELVPYWNMAWVPMVNHLSVSFYETEVARISYGGKKISLAQSGD 730
            HCL SD+ G GY F+G +LVP   M WVPM++H  +  Y+ +V ++SYG   +SL     
Sbjct: 359  HCLASDLNGEGYIFMGSDLVPSHGMTWVPMLHHPHLEVYQMQVTKMSYGNAMLSLDGENG 418

Query: 729  AKGRVVFDSGSSYTYLPKQAYADLVHSVNYLLDGKLIQDASDATLPLCWRADT--PIRSV 556
              G+V+FD+GSSYTY P QAY+ LV S+  + D +L +D SD  LP+CWRA T  PI S+
Sbjct: 419  RVGKVLFDTGSSYTYFPNQAYSQLVTSLQEVSDLELTRDDSDEALPICWRAKTNSPISSL 478

Query: 555  DDVKHFFRPLTFHFGSKWYFVSTKFQIPPEGYLVTNSKGNVCLGILDGSNVHDGSTFILG 376
             DVK FFRP+T   GSKW  +S K  I PE YL+ ++KGNVCLGILDGSNVHDGST I+G
Sbjct: 479  SDVKKFFRPITLQIGSKWLIISKKLLIQPEDYLIISNKGNVCLGILDGSNVHDGSTIIIG 538

Query: 375  DVSFRGKLVVYDNENQKVGWTDSNCKSPRRF-KSLPFIAG 259
            D+S RG+L+VYDN  Q++GW  S+C  P  F  ++PF  G
Sbjct: 539  DISMRGRLIVYDNVKQRIGWMKSDCVRPSEFDHNVPFFQG 578


>ref|XP_002452609.1| hypothetical protein SORBIDRAFT_04g028990 [Sorghum bicolor]
            gi|241932440|gb|EES05585.1| hypothetical protein
            SORBIDRAFT_04g028990 [Sorghum bicolor]
          Length = 557

 Score =  516 bits (1329), Expect = e-143
 Identities = 265/569 (46%), Positives = 358/569 (62%), Gaps = 10/569 (1%)
 Frame = -1

Query: 1944 PEEAQSP-VKGVVVISLPPRDDPSMGKTVAFYTISGDSHHQINEIRQPQLSIHRPPQNPL 1768
            P  AQ P + GVV+I+LPP D PS GKT+  +T + D+       R P+  +  P    +
Sbjct: 8    PAGAQQPQLHGVVIITLPPSDQPSKGKTITAFTYTDDAPPPP---RPPEPVMGYPAATQV 64

Query: 1767 RIRARITGAGKRXXXXXXXXXXXXXXXXXXXSPQSPFK-LXXXXXXXXXXXXXXXTFVLP 1591
            R R R   + +R                         + L               +F+LP
Sbjct: 65   RRRPRRVLSTRRVAAAALVLGALAVAAYYCFYSDVAVQFLGMEQEEAQKDRNETRSFLLP 124

Query: 1590 LFPKLGFPGDKSTGDVELKLGRFVLRENGGVGMGRYKV--------NRLFSGSKASEIDS 1435
            L PK              + GR  LRE G V +   ++        N++     A+   +
Sbjct: 125  LHPKA-------------RQGR-ALREFGDVKLAARRIDDGWRKARNKMEVAKAAAAGTN 170

Query: 1434 SSMPFPVKGNVYPYGLYFTQLYIGNPPRSYFLDLDTGSDLTWMQCDAPCTSCAKGPNSLY 1255
            S+   P+KGNV+P G Y+T +++GNPPR YFLD+DTGSDLTW+QCDAPCT+CAKGP+ LY
Sbjct: 171  STALLPIKGNVFPDGQYYTSIFVGNPPRPYFLDVDTGSDLTWIQCDAPCTNCAKGPHPLY 230

Query: 1254 KPTRGKIVPPKDSLCLEVQKNLKSVYCDTCKQCDYEIEYADHSSSMGVLVRDELNLMSSN 1075
            KPT+ KIVPP+D LC E+Q N    YC+TCKQCDYEIEYAD SSSMGVL RD+++L+++N
Sbjct: 231  KPTKEKIVPPRDLLCQELQGNQN--YCETCKQCDYEIEYADQSSSMGVLARDDMHLIATN 288

Query: 1074 GSLSKPNVVFGCAYDQQGTLLNSLAKTDGILGLSRSKVSLPAQLASRGIVKNVFGHCLTS 895
            G   K + VFGCAYDQQG LL+S AKTDGILGLS + +SLP+QLAS GI+ N+FGHC+T 
Sbjct: 289  GGREKLDFVFGCAYDQQGQLLSSPAKTDGILGLSNAAISLPSQLASHGIISNIFGHCITR 348

Query: 894  DIAGGGYAFIGDELVPYWNMAWVPMVNHLSVSFYETEVARISYGGKKISLAQSGDAKGRV 715
            +  GGGY F+GD+ VP W + W   +     + Y TE   + YG +++ + +      +V
Sbjct: 349  EQGGGGYMFLGDDYVPRWGITWT-SIRSGPDNLYHTEAHHVKYGDQQLRMREQAGNTVQV 407

Query: 714  VFDSGSSYTYLPKQAYADLVHSVNYLLDGKLIQDASDATLPLCWRADTPIRSVDDVKHFF 535
            +FDSGSSYTYLP + Y +LV ++ Y   G  +QD+SD TLPLCW+AD P+R ++DVK FF
Sbjct: 408  IFDSGSSYTYLPDEIYENLVAAIKYASPG-FVQDSSDRTLPLCWKADFPVRYLEDVKQFF 466

Query: 534  RPLTFHFGSKWYFVSTKFQIPPEGYLVTNSKGNVCLGILDGSNVHDGSTFILGDVSFRGK 355
            +PL  HFG KW F+S  F I PE YL+ + KGNVCLG+L+G+ ++ GST I+GDVS RGK
Sbjct: 467  KPLNLHFGKKWLFMSKTFTISPEDYLIISDKGNVCLGLLNGTEINHGSTIIVGDVSLRGK 526

Query: 354  LVVYDNENQKVGWTDSNCKSPRRFKSLPF 268
            LVVYDN+ +++GWT+S+C  P+  K  PF
Sbjct: 527  LVVYDNQRRQIGWTNSDCTKPQSQKGFPF 555


>ref|XP_006826817.1| hypothetical protein AMTR_s00010p00056950 [Amborella trichopoda]
            gi|548831246|gb|ERM94054.1| hypothetical protein
            AMTR_s00010p00056950 [Amborella trichopoda]
          Length = 545

 Score =  515 bits (1327), Expect = e-143
 Identities = 267/556 (48%), Positives = 351/556 (63%), Gaps = 1/556 (0%)
 Frame = -1

Query: 1932 QSPVKGVVVISLPPRDDPSMGKTVAFYTISGDSHHQINEIRQPQLSIHRPPQNPLRIRAR 1753
            Q  ++G V+ISLPP DDPS GKT+  +T+  D  HQ     Q Q +     Q P      
Sbjct: 3    QPEIQGFVIISLPPPDDPSKGKTITAFTMVSDPSHQNENQSQNQQT-----QQPQIASNS 57

Query: 1752 ITGAGK-RXXXXXXXXXXXXXXXXXXXSPQSPFKLXXXXXXXXXXXXXXXTFVLPLFPKL 1576
            I G+ + R                                          +F+  L+PK 
Sbjct: 58   IAGSSRGRIGSIVVRVLAMLGAVVAVLFFWQWVSGFSEMDYETERSKNNPSFLYNLYPK- 116

Query: 1575 GFPGDKSTGDVELKLGRFVLRENGGVGMGRYKVNRLFSGSKASEIDSSSMPFPVKGNVYP 1396
             +  +    D  L+LG FV R+   +G+   K     S   +S I      FPVKGNVYP
Sbjct: 117  -WSEEAIEKDAALRLGTFVKRDEVRIGLRDVKTLEAISSINSSTI------FPVKGNVYP 169

Query: 1395 YGLYFTQLYIGNPPRSYFLDLDTGSDLTWMQCDAPCTSCAKGPNSLYKPTRGKIVPPKDS 1216
             GLY+  + +GNP R Y+LD+DTGSDLTW+QC+APCT+CAKGP+ LY P++  +VP KD 
Sbjct: 170  DGLYYISILVGNPRRPYYLDMDTGSDLTWIQCNAPCTNCAKGPHPLYNPSKQNLVPSKDP 229

Query: 1215 LCLEVQKNLKSVYCDTCKQCDYEIEYADHSSSMGVLVRDELNLMSSNGSLSKPNVVFGCA 1036
             CLEVQ N K  +     QCDY+IEYAD SSSMGVLVRD+L LM +NG++ K  +VFGCA
Sbjct: 230  FCLEVQVNDKGKFAGASHQCDYDIEYADQSSSMGVLVRDDLQLMITNGTVIKTGLVFGCA 289

Query: 1035 YDQQGTLLNSLAKTDGILGLSRSKVSLPAQLASRGIVKNVFGHCLTSDIAGGGYAFIGDE 856
            YDQ+G L +S AKTDGILGLS +KVSLP+QLASRG++KNV GHC+ +D  GGGY F+GD+
Sbjct: 290  YDQRGKLGHSPAKTDGILGLSSAKVSLPSQLASRGLMKNVVGHCIRNDANGGGYMFLGDD 349

Query: 855  LVPYWNMAWVPMVNHLSVSFYETEVARISYGGKKISLAQSGDAKGRVVFDSGSSYTYLPK 676
             +P W M WVPM++  S + Y  EV++IS G + I         GRVVFDSGSSY+YL K
Sbjct: 350  FIPQWRMTWVPMLSSPSTNAYHAEVSKISLGSRPIDGGGLITKIGRVVFDSGSSYSYLTK 409

Query: 675  QAYADLVHSVNYLLDGKLIQDASDATLPLCWRADTPIRSVDDVKHFFRPLTFHFGSKWYF 496
            QAY  L+ S+  + +  L+ D SD TLP+CW+A +P+RS+ DV  FF+PL  +FGS+  F
Sbjct: 410  QAYTSLIKSLKDVAEKGLVLDDSDKTLPVCWKAKSPLRSIKDVNQFFKPLVLNFGSRLLF 469

Query: 495  VSTKFQIPPEGYLVTNSKGNVCLGILDGSNVHDGSTFILGDVSFRGKLVVYDNENQKVGW 316
             S  F+IPPEGYL+ ++KGN CLGIL+GS++HDG+T ILGD+S R KLVVYDN  +++GW
Sbjct: 470  GSKNFEIPPEGYLIISAKGNACLGILEGSHIHDGATNILGDISLRAKLVVYDNVKRRIGW 529

Query: 315  TDSNCKSPRRFKSLPF 268
              S+C+ P + KS PF
Sbjct: 530  VQSDCQ-PLKLKSFPF 544


>gb|ACN34727.1| unknown [Zea mays] gi|413923868|gb|AFW63800.1| hypothetical protein
            ZEAMMB73_012138 [Zea mays]
          Length = 557

 Score =  508 bits (1308), Expect = e-141
 Identities = 261/558 (46%), Positives = 347/558 (62%), Gaps = 1/558 (0%)
 Frame = -1

Query: 1938 EAQSPVKGVVVISLPPRDDPSMGKTVAFYTISGDSHHQINEIRQPQLSIHRPPQNPLRIR 1759
            E Q  + GVV+I+LPP D PS GKTV  +  + D     +    P   +  P     R R
Sbjct: 13   EQQPQLHGVVIITLPPADQPSKGKTVTAFAYTNDPPPPRSP---PDPVMGYPAATEARRR 69

Query: 1758 ARITGAGKRXXXXXXXXXXXXXXXXXXXSPQSPFKLXXXXXXXXXXXXXXXTFVLPLFPK 1579
             R   + +R                         +                 F+LPL+PK
Sbjct: 70   PRRALSTRRVATAALVLGALAVAAYYCFYSDVAVQFLGMEQEEEQRNETRS-FLLPLYPK 128

Query: 1578 LGFPGD-KSTGDVELKLGRFVLRENGGVGMGRYKVNRLFSGSKASEIDSSSMPFPVKGNV 1402
                   +  GDV+L   R    ++GG    R   NR+     A+   +S+   P+KGNV
Sbjct: 129  ARQGRALREFGDVKLAARRV---DDGG----RKARNRMEVAKAATARTNSTALLPIKGNV 181

Query: 1401 YPYGLYFTQLYIGNPPRSYFLDLDTGSDLTWMQCDAPCTSCAKGPNSLYKPTRGKIVPPK 1222
            +P G Y+T ++IGNPPR YFLD+DTGSDLTW+QCDAPCT+CAKGP+ LYKP + KIVPP+
Sbjct: 182  FPDGQYYTSIFIGNPPRPYFLDVDTGSDLTWIQCDAPCTNCAKGPHPLYKPAKEKIVPPR 241

Query: 1221 DSLCLEVQKNLKSVYCDTCKQCDYEIEYADHSSSMGVLVRDELNLMSSNGSLSKPNVVFG 1042
            D LC E+Q N    YC+TCKQCDYEIEYAD SSSMGVL RD+++++++NG   K + VFG
Sbjct: 242  DLLCQELQGNQN--YCETCKQCDYEIEYADQSSSMGVLARDDMHMIATNGGREKLDFVFG 299

Query: 1041 CAYDQQGTLLNSLAKTDGILGLSRSKVSLPAQLASRGIVKNVFGHCLTSDIAGGGYAFIG 862
            CAYDQQG LL+S AKTDGILGLS + +S P+QLAS GI+ NVFGHC+T +  GGGY F+G
Sbjct: 300  CAYDQQGQLLSSPAKTDGILGLSSAAISFPSQLASHGIIANVFGHCITREQGGGGYMFLG 359

Query: 861  DELVPYWNMAWVPMVNHLSVSFYETEVARISYGGKKISLAQSGDAKGRVVFDSGSSYTYL 682
            D+ VP W + W   +     + Y T+   + YG +++   +   +  +V+FDSGSSYTYL
Sbjct: 360  DDYVPRWGVTWT-SIRSGPDNLYHTQAHHVKYGDQQLRRPEQAGSTVQVIFDSGSSYTYL 418

Query: 681  PKQAYADLVHSVNYLLDGKLIQDASDATLPLCWRADTPIRSVDDVKHFFRPLTFHFGSKW 502
            P + Y +LV ++ Y   G  +QD SD TLPLCW+AD P+R ++DVK FF PL  HFG KW
Sbjct: 419  PNEIYENLVAAIKYASPG-FVQDTSDRTLPLCWKADFPVRYLEDVKQFFEPLNLHFGKKW 477

Query: 501  YFVSTKFQIPPEGYLVTNSKGNVCLGILDGSNVHDGSTFILGDVSFRGKLVVYDNENQKV 322
             F+S  F I PE YL+ + KGNVCLG+L+G+ ++ GST I+GDVS RGKLVVYDN+ +++
Sbjct: 478  LFMSKTFTISPEDYLIISDKGNVCLGLLNGTEINHGSTIIVGDVSLRGKLVVYDNQRKQI 537

Query: 321  GWTDSNCKSPRRFKSLPF 268
            GW DS+C  P+  K  PF
Sbjct: 538  GWADSDCTKPQSQKGFPF 555


>ref|XP_006393315.1| hypothetical protein EUTSA_v10011346mg [Eutrema salsugineum]
            gi|557089893|gb|ESQ30601.1| hypothetical protein
            EUTSA_v10011346mg [Eutrema salsugineum]
          Length = 580

 Score =  506 bits (1304), Expect = e-140
 Identities = 270/588 (45%), Positives = 372/588 (63%), Gaps = 24/588 (4%)
 Frame = -1

Query: 1950 MEPE----EAQSPVKGVVVISLPPRDDPSMGKTVAFYTISGDSHH---QINEIRQPQLS- 1795
            MEP+    + Q  V GVV+I+LPP D+PS GKT++ +T++   +    +  + R P    
Sbjct: 1    MEPDLHDHQQQQRVHGVVIITLPPSDNPSKGKTISAFTLTDHDYPPDIRPEDERNPSFQP 60

Query: 1794 --IHRPPQNPLRIRARITGAGKRXXXXXXXXXXXXXXXXXXXSPQSP--FKLXXXXXXXX 1627
              +H+ PQ+ L   + ++ +  R                    P S   F++        
Sbjct: 61   DPLHQNPQSGLWF-SDLSMSSPRLVLGLLGISLLAIAFYGSVFPNSVQLFRVSDERDRDE 119

Query: 1626 XXXXXXXTFVLPLFPKLGFPGDKSTGDVELKLGRFVLRENGGVGM--------GRYKVNR 1471
                   +FV P++ KL     +   +  L     V++E  G+ +           KVN 
Sbjct: 120  DNRRETASFVFPVYHKLRA---REIPERNLAEALDVVKEENGIFVESIEQELVNPVKVND 176

Query: 1470 LFSGSKASEIDSSSMPFPVKGNVYPYGLYFTQLYIGNPPRS---YFLDLDTGSDLTWMQC 1300
            +FS S  S +DSS+  FPV G VYP GLYFT++++GNP +    + LD+DTGSDLTW+QC
Sbjct: 177  VFSASVGS-LDSSTTIFPVGGYVYPDGLYFTRVFVGNPEKDGHYFHLDIDTGSDLTWIQC 235

Query: 1299 DAPCTSCAKGPNSLYKPTRGKIVPPKDSLCLEVQKNLKSVYCDTCKQCDYEIEYADHSSS 1120
            DAPCTSCAKG N LYKP + K+V   + LC+EVQKN  +  C++C+QCDYEIEYAD SSS
Sbjct: 236  DAPCTSCAKGANQLYKPRKDKLVGSAEHLCVEVQKNQMTELCESCQQCDYEIEYADLSSS 295

Query: 1119 MGVLVRDELNLMSSNGSLSKPNVVFGCAYDQQGTLLNSLAKTDGILGLSRSKVSLPAQLA 940
            +GVL +DE +L   NGSL+  ++VFGC YDQQG LLN+L K DGILGLSR+K+SLP+QLA
Sbjct: 296  LGVLTKDEFHLKLHNGSLAASDIVFGCGYDQQGLLLNTLLKKDGILGLSRAKISLPSQLA 355

Query: 939  SRGIVKNVFGHCLTSDIAGGGYAFIGDELVPYWNMAWVPMVNHLSVSFYETEVARISYGG 760
            S+GI+ NV GHCL SD+ G GY F+G +LVP   M WVPM +H  +  ++ +V ++SYG 
Sbjct: 356  SQGIISNVVGHCLPSDLNGEGYIFMGSDLVPLHGMTWVPMFHHSHLEVHQMQVTKVSYGN 415

Query: 759  KKISLAQSGDAKGRVVFDSGSSYTYLPKQAYADLVHSVNYLLDGKLIQDASDATLPLCWR 580
              +SL+      G+V+FD+GSSYTY PK+AY+ LV S   L + KL +D SD  LP+CW+
Sbjct: 416  GMLSLSGENGRIGKVLFDTGSSYTYFPKKAYSQLVTS---LQEVKLTRDESDKALPICWQ 472

Query: 579  ADTPIRSVDDVKHFFRPLTFHFGSKWYFVSTKFQIPPEGYLVTNSKGNVCLGILDGSNVH 400
            A+  I S+ DVK F++P+T   GSKW+ +S K  I PE YL+ ++KGNVCLGILDGS+VH
Sbjct: 473  ANFLISSLSDVKRFYKPITIQIGSKWWIISRKLVIQPEDYLIISNKGNVCLGILDGSSVH 532

Query: 399  DGSTFILGDVSFRGKLVVYDNENQKVGWTDSNCKSPRRF-KSLPFIAG 259
            DGST ILGD+S RG+L+VYDN  +++GW  S+C  P    + LPF  G
Sbjct: 533  DGSTIILGDISMRGRLIVYDNVKRRIGWMKSDCVRPHESDQKLPFFQG 580


Top