BLASTX nr result

ID: Mentha22_contig00030217 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Mentha22_contig00030217
         (1217 letters)

Database: ./nr 
           38,876,450 sequences; 13,856,398,315 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

gb|EYU28892.1| hypothetical protein MIMGU_mgv1a006975mg [Mimulus...   258   4e-66
ref|XP_007045750.1| 18S pre-ribosomal assembly protein gar2-rela...   191   7e-46
ref|XP_007045749.1| 18S pre-ribosomal assembly protein gar2-rela...   191   7e-46
gb|EXB44897.1| hypothetical protein L484_026481 [Morus notabilis]     180   1e-42
gb|AAX55105.1| hypothetical protein At2g03810 [Arabidopsis thali...   176   2e-41
ref|NP_178475.2| 18S pre-ribosomal assembly protein gar2-related...   176   2e-41
ref|XP_006484258.1| PREDICTED: dentin sialophosphoprotein-like i...   174   5e-41
ref|XP_006484256.1| PREDICTED: dentin sialophosphoprotein-like i...   174   5e-41
ref|XP_002514588.1| conserved hypothetical protein [Ricinus comm...   173   1e-40
emb|CBI27399.3| unnamed protein product [Vitis vinifera]              172   2e-40
gb|AAM77645.1|AF517846_1 hypothetical protein [Arabidopsis thali...   172   3e-40
ref|XP_002875229.1| hypothetical protein ARALYDRAFT_484289 [Arab...   169   2e-39
ref|XP_006395670.1| hypothetical protein EUTSA_v10004181mg [Eutr...   167   8e-39
ref|XP_006291111.1| hypothetical protein CARUB_v10017222mg [Caps...   165   3e-38
ref|XP_006348792.1| PREDICTED: dentin sialophosphoprotein-like [...   162   4e-37
ref|XP_007222119.1| hypothetical protein PRUPE_ppa004630mg [Prun...   160   1e-36
ref|XP_002311412.1| 18S pre-ribosomal assembly protein gar2 [Pop...   159   3e-36
ref|XP_006363556.1| PREDICTED: dentin sialophosphoprotein-like i...   155   3e-35
ref|XP_004239019.1| PREDICTED: uncharacterized protein LOC101258...   155   3e-35
ref|XP_004239018.1| PREDICTED: uncharacterized protein LOC101258...   155   3e-35

>gb|EYU28892.1| hypothetical protein MIMGU_mgv1a006975mg [Mimulus guttatus]
          Length = 424

 Score =  258 bits (659), Expect = 4e-66
 Identities = 176/431 (40%), Positives = 232/431 (53%), Gaps = 35/431 (8%)
 Frame = +1

Query: 16   DRESFDTVEDLADCNASDSRDSGSP--------------LFTDKNVLECDVPEFEVCYRE 153
            D+E  +   +   CN ++S+DS  P              LFTDKNVLEC +PEFEV  +E
Sbjct: 47   DKERQNLGHENMPCNGNESQDSSPPCNSASGLSQTTDANLFTDKNVLECGMPEFEVFCKE 106

Query: 154  NDCHLLKDICIDEGSPEKDVNAIES-------GLSPLPPKEDPLLDADSFTAEPCGSKEE 312
             D  ++KDIC+DEG P+      ES       GL   P   +   +     A  CG+KEE
Sbjct: 107  IDYQIVKDICVDEGRPDNKDKITESCKDDKSDGLFHQPTNSNHS-EITITEANQCGTKEE 165

Query: 313  NDVIKLISQEEKLDSSLKNLFDKDSIKH-CEPENTVETSEACFDETLSPEDSLADRKLPI 489
            ND       +   D+S    FD+D+ K  C+P  +V+TSE   ++    EDSL   K P+
Sbjct: 166  ND------GKSPSDTS----FDEDTAKKDCDPAKSVQTSEITDNQE---EDSLVGIKPPV 212

Query: 490  EDSGDQHGL--------DEGNKVMQLSDQVLNEGAVSESPAVLSTEAEETG--KDVQ-SS 636
            ++   ++ L        DEG  V Q  DQ+LNE   S S A  S+ AE  G  +DV+ SS
Sbjct: 213  QELVTRNSLRSFLYPLGDEGGVVTQPPDQILNEKPASRSSAATSSSAEAEGVEEDVEASS 272

Query: 637  CLPYNSKVENEIITFNFSAPEGVAASTGTAEDIEEQSTENLPTATSNEED--DSHKQSLV 810
             + YNS+VE+  ITFNF +           E+++ Q + +  + TSN  D   S K    
Sbjct: 273  SVLYNSEVESGTITFNFDST--------VTENMKPQDSVDSSSVTSNNIDCVGSSKDRED 324

Query: 811  GNEENSTTSEGSNCANAPEASSEHKQANXXXXXXXXXXXXEPNAPPVVNQLQQDMGERSF 990
             NE+NS  +EGS+                                 +  Q++ + GE SF
Sbjct: 325  ENEKNSEQNEGSSAI-------------------------------ISRQMKYEEGETSF 353

Query: 991  SAASMIDYSGPIAFSGSLSHRSDGSTTSGKSFAFPVLQSEWNSSPVRMAKADRRDFRKHK 1170
            +AAS++ YSGPIA+SGSLS RSDGS  SG+SFAFP+LQSEWNSSPVRMAKADRR FRKHK
Sbjct: 354  AAASLVTYSGPIAYSGSLSLRSDGSAASGRSFAFPILQSEWNSSPVRMAKADRRHFRKHK 413

Query: 1171 GWRSGLLCCRF 1203
            GWRSGLLCCRF
Sbjct: 414  GWRSGLLCCRF 424


>ref|XP_007045750.1| 18S pre-ribosomal assembly protein gar2-related, putative isoform 2
            [Theobroma cacao] gi|590698568|ref|XP_007045751.1| 18S
            pre-ribosomal assembly protein gar2-related, putative
            isoform 2 [Theobroma cacao]
            gi|590698571|ref|XP_007045752.1| 18S pre-ribosomal
            assembly protein gar2-related, putative isoform 2
            [Theobroma cacao] gi|508709685|gb|EOY01582.1| 18S
            pre-ribosomal assembly protein gar2-related, putative
            isoform 2 [Theobroma cacao] gi|508709686|gb|EOY01583.1|
            18S pre-ribosomal assembly protein gar2-related, putative
            isoform 2 [Theobroma cacao] gi|508709687|gb|EOY01584.1|
            18S pre-ribosomal assembly protein gar2-related, putative
            isoform 2 [Theobroma cacao]
          Length = 470

 Score =  191 bits (484), Expect = 7e-46
 Identities = 151/457 (33%), Positives = 223/457 (48%), Gaps = 67/457 (14%)
 Frame = +1

Query: 34   TVEDLADCNASDSRD---SGSP------------LFTDKNVLECDVPEFEVCYRENDCHL 168
            +V D A+ N  + RD   S SP             + DK+V+EC++PE  VCY+E+  H+
Sbjct: 34   SVNDFANGNEKEVRDFVTSNSPSLKNMDSFQNSVFYLDKSVMECELPELVVCYKESTYHV 93

Query: 169  LKDICIDEGSPEKDVNAIESGLSPLPPKEDPLLDADSFTAEPCGSKEENDVIKLISQEEK 348
            +KDICIDEG P +D    E+G+       D  +D +   +E    KE++  +      EK
Sbjct: 94   VKDICIDEGVPTQDKFLFETGM-------DEKIDCNFLPSE----KEQDSQL----MTEK 138

Query: 349  LDSSL----------KNLFDKDSIKHCEPENTVETSEACFDETLSPEDSLADRKLPIE-D 495
            L++ +          +N   KD    C     V+T     D +LS E + +++ +P + D
Sbjct: 139  LETDMCMQDVSMSPGENQSGKDIDNECGSNKKVDTDTCMQDVSLSLEKNESNKGIPNQCD 198

Query: 496  SGDQH--GLDEGNKVMQLSDQVLNE----GAVSESPAVLSTEAEETGKDVQSSCLP---Y 648
            S D     + +G+ +  ++D V  E    G +     +    +E    D +S  +    +
Sbjct: 199  SKDLMLTRVVKGDAMKMVTDDVSKELFTLGELLSMSELSKVNSEAMSSDCKSDGIEQQSF 258

Query: 649  NSKVENEIITFNFSAPEGVAASTGTAEDIEEQSTENLPTATS-NEEDDSHK-------QS 804
             S  + E++         + ++   ++D  E++  ++P   S  EE DS K        +
Sbjct: 259  QSSSKKEVMVM-----PPLVSAVEESKDSNEEAIVSVPALVSATEELDSGKGEAILISPA 313

Query: 805  LVGNEENSTTSEGSN-----------------CANAPEASSE--HKQANXXXXXXXXXXX 927
             V   E ST+S   N                  ++AP +S +  H   +           
Sbjct: 314  QVSTSEESTSSSLVNEVSYDNKLETGSITFNLDSSAPTSSKDECHHNLDSEPLGTGSTPK 373

Query: 928  XEPNAPPVV-NQLQQDMGERSFSAASM----IDYSGPIAFSGSLSHRSDGSTTSGKSFAF 1092
             E  A   + N LQQ +GE SFSAA +    I YSGP+A+SGSLS RSD STTS +SFAF
Sbjct: 374  LEVAADQSISNNLQQGIGESSFSAAGLVTGLISYSGPVAYSGSLSLRSDSSTTSTRSFAF 433

Query: 1093 PVLQSEWNSSPVRMAKADRRDFRKHKGWRSGLLCCRF 1203
            P+LQSEWN SPVRMAKADRR +RKHKGWR GLLCCRF
Sbjct: 434  PILQSEWNCSPVRMAKADRRHYRKHKGWRHGLLCCRF 470


>ref|XP_007045749.1| 18S pre-ribosomal assembly protein gar2-related, putative isoform 1
            [Theobroma cacao] gi|508709684|gb|EOY01581.1| 18S
            pre-ribosomal assembly protein gar2-related, putative
            isoform 1 [Theobroma cacao]
          Length = 527

 Score =  191 bits (484), Expect = 7e-46
 Identities = 151/457 (33%), Positives = 223/457 (48%), Gaps = 67/457 (14%)
 Frame = +1

Query: 34   TVEDLADCNASDSRD---SGSP------------LFTDKNVLECDVPEFEVCYRENDCHL 168
            +V D A+ N  + RD   S SP             + DK+V+EC++PE  VCY+E+  H+
Sbjct: 91   SVNDFANGNEKEVRDFVTSNSPSLKNMDSFQNSVFYLDKSVMECELPELVVCYKESTYHV 150

Query: 169  LKDICIDEGSPEKDVNAIESGLSPLPPKEDPLLDADSFTAEPCGSKEENDVIKLISQEEK 348
            +KDICIDEG P +D    E+G+       D  +D +   +E    KE++  +      EK
Sbjct: 151  VKDICIDEGVPTQDKFLFETGM-------DEKIDCNFLPSE----KEQDSQL----MTEK 195

Query: 349  LDSSL----------KNLFDKDSIKHCEPENTVETSEACFDETLSPEDSLADRKLPIE-D 495
            L++ +          +N   KD    C     V+T     D +LS E + +++ +P + D
Sbjct: 196  LETDMCMQDVSMSPGENQSGKDIDNECGSNKKVDTDTCMQDVSLSLEKNESNKGIPNQCD 255

Query: 496  SGDQH--GLDEGNKVMQLSDQVLNE----GAVSESPAVLSTEAEETGKDVQSSCLP---Y 648
            S D     + +G+ +  ++D V  E    G +     +    +E    D +S  +    +
Sbjct: 256  SKDLMLTRVVKGDAMKMVTDDVSKELFTLGELLSMSELSKVNSEAMSSDCKSDGIEQQSF 315

Query: 649  NSKVENEIITFNFSAPEGVAASTGTAEDIEEQSTENLPTATS-NEEDDSHK-------QS 804
             S  + E++         + ++   ++D  E++  ++P   S  EE DS K        +
Sbjct: 316  QSSSKKEVMVM-----PPLVSAVEESKDSNEEAIVSVPALVSATEELDSGKGEAILISPA 370

Query: 805  LVGNEENSTTSEGSN-----------------CANAPEASSE--HKQANXXXXXXXXXXX 927
             V   E ST+S   N                  ++AP +S +  H   +           
Sbjct: 371  QVSTSEESTSSSLVNEVSYDNKLETGSITFNLDSSAPTSSKDECHHNLDSEPLGTGSTPK 430

Query: 928  XEPNAPPVV-NQLQQDMGERSFSAASM----IDYSGPIAFSGSLSHRSDGSTTSGKSFAF 1092
             E  A   + N LQQ +GE SFSAA +    I YSGP+A+SGSLS RSD STTS +SFAF
Sbjct: 431  LEVAADQSISNNLQQGIGESSFSAAGLVTGLISYSGPVAYSGSLSLRSDSSTTSTRSFAF 490

Query: 1093 PVLQSEWNSSPVRMAKADRRDFRKHKGWRSGLLCCRF 1203
            P+LQSEWN SPVRMAKADRR +RKHKGWR GLLCCRF
Sbjct: 491  PILQSEWNCSPVRMAKADRRHYRKHKGWRHGLLCCRF 527


>gb|EXB44897.1| hypothetical protein L484_026481 [Morus notabilis]
          Length = 642

 Score =  180 bits (457), Expect = 1e-42
 Identities = 160/499 (32%), Positives = 222/499 (44%), Gaps = 120/499 (24%)
 Frame = +1

Query: 67   DSRDSGSPLFTDKNVLECDVPEFEVCYRENDCHLLKDICIDEGSPEKDVNAIESGLSPLP 246
            +S + GS  +TDK+V EC++PEF+VCYRE+  + +KDICIDEG P  D    ESG     
Sbjct: 155  ESLEKGSDDYTDKSVTECEMPEFQVCYRESSYNSVKDICIDEGVPALDNILFESGA---- 210

Query: 247  PKEDPLLDADSFTAEPCGSKEENDVIKL------ISQEEKLDSSLKNLFDKDSIKHCEPE 408
                   D  S        +++N  +         +    L+S  K   +K+ I   EP+
Sbjct: 211  -------DMKSLCTFVFPDQDQNSQLNKGRVDIGAASPNGLNSLTKTESEKEFINVLEPK 263

Query: 409  NTVETSEACFDET-----------LSPEDSLADRKLPIEDSGDQHGLDEGNKV--MQLSD 549
            + ++  E   D T           + PE+++  ++L  ++S       +G+    +Q+S 
Sbjct: 264  DFMQQGEGNCDATDKIENDISKDKVFPENAILMKELGADNSHPWSPSWDGDAAAQVQISR 323

Query: 550  QVLNEGAVSESPAV------LSTEAEET---------GKDVQS-SCLP----YNSKVENE 669
               +E   + SP        LS  +EE           K+ +S S LP    YNSKVE  
Sbjct: 324  DKASETTNTISPGFDLAAEKLSNSSEEALAIPVPVSEAKESKSGSSLPNDLAYNSKVEKR 383

Query: 670  IITFNFSAPEGVAAS------TGTAEDIEEQSTENLPTATSNEEDDS----HKQS-LVGN 816
             ITF+F +   V  +       G +E +E ++   +   T+N +  S    H  S L G 
Sbjct: 384  RITFDFRSLATVPVAKEECPQNGISERLETENISTVDDVTTNMQFVSSQVQHDSSPLTGT 443

Query: 817  EEN---------------STTSEGS-------------------------NCANAPEASS 876
             E+               S   +GS                          C   P  SS
Sbjct: 444  REDCFQNAVHECGQTQNMSVVEDGSANAQIVPSNAQHEVAREEVPQNGVCTCVETPNTSS 503

Query: 877  ---------------EHKQANXXXXXXXXXXXXE-PNAPPVVNQL----------QQDMG 978
                           +H  A             E P+ P VV+ +          Q  +G
Sbjct: 504  VNDDTSGLQKVSSSLQHVTAREEGLPSTDTLCCETPDTPMVVDGISGSQVVSGHFQYGVG 563

Query: 979  ERSFSAAS----MIDYSGPIAFSGSLSHRSDGSTTSGKSFAFPVLQSEWNSSPVRMAKAD 1146
            E SFSAA      I+YSGPI +SGS+S RSD STTS +SFAFPVLQSEWNSSPVRMAKAD
Sbjct: 564  ESSFSAAGPLSGRINYSGPIPYSGSISLRSDSSTTSTRSFAFPVLQSEWNSSPVRMAKAD 623

Query: 1147 RRDFRKHKGWRSGLLCCRF 1203
            RR FRKH+GWR G+LCCRF
Sbjct: 624  RRHFRKHRGWRQGILCCRF 642


>gb|AAX55105.1| hypothetical protein At2g03810 [Arabidopsis thaliana]
          Length = 439

 Score =  176 bits (445), Expect = 2e-41
 Identities = 143/427 (33%), Positives = 197/427 (46%), Gaps = 31/427 (7%)
 Frame = +1

Query: 16   DRESFDTVEDLA-DCNAS-DSRDSGSPLF-TDKNVLECDVPEFEVCYRENDCHLLKDICI 186
            + E+   V D + DC+A+ DS +   P+F  DKNV  CD+PE  VCY+EN  H++KDIC+
Sbjct: 61   ENEAGKKVRDTSHDCDANVDSPEKKDPVFYMDKNVTACDLPEIVVCYKENTYHIVKDICV 120

Query: 187  DEGSPEKDVNAIESGLSPLPPKEDPLLDADSFTAEPCGSKEENDVIKLISQEE-----KL 351
            DEG P ++        S      + L+ AD     P  +K   D I  +   E     K 
Sbjct: 121  DEGVPVQEKFLFGEKDSVKSSSTEDLMKADKTNVNPSETKSAEDSISKVDDSEFCNDHKT 180

Query: 352  DSSLKNLFDKD------SIKHCEPENTVETSEACFDETLSPEDSLADRKL-PIEDSGDQH 510
            D  ++    +D      +  +   E+ + T E       SP   L+  ++ P E+S D+ 
Sbjct: 181  DRDVEESSGEDFADAEGTSSNYNQEHLIVTEEV----KASPTHGLSPSEIEPDENSKDEV 236

Query: 511  GLDEGNKVMQLSDQVLNEGAVSESPAVLSTEAEETGKDVQSSCLPYNSKVENEIITFNFS 690
             + + N     S + L  G       +LS E E+   +                   N S
Sbjct: 237  AISQDND----SKECLTLG------DILSREDEQKSLNQD-----------------NIS 269

Query: 691  APEGVAASTGTAEDIEEQSTENLPTATSNEEDDSHKQ--SLVGNEENSTTSEGSNCANAP 864
            +      S    +D E++S E     T  E+ +  KQ    + +   +T+ E +   N P
Sbjct: 270  SDSHEEQSPSQLQDKEKRSLETTAIETELEKTEEPKQGEEKLSSVSTTTSQEPNKTCNEP 329

Query: 865  E--ASSEHKQANXXXXXXXXXXXXEPNAPPVVNQLQQD------MGERSFSAASM----- 1005
            E   +  H Q N                  V N  + D       GE SFSAA       
Sbjct: 330  EKPETENHHQQNCL----------------VENSYEDDKFSSSRFGETSFSAADSVSISG 373

Query: 1006 -IDYSGPIAFSGSLSHRSDGSTTSGKSFAFPVLQSEWNSSPVRMAKADRRDFRKHKGWRS 1182
             I YSGPIA+SGSLS RSD STTSG+SFAFP+LQSEWNSSPVRMAKAD+R  R+  GWR 
Sbjct: 374  HITYSGPIAYSGSLSVRSDASTTSGRSFAFPILQSEWNSSPVRMAKADKR--RQKGGWRH 431

Query: 1183 GLLCCRF 1203
             LLCCRF
Sbjct: 432  TLLCCRF 438


>ref|NP_178475.2| 18S pre-ribosomal assembly protein gar2-related protein [Arabidopsis
            thaliana] gi|42570677|ref|NP_973412.1| 18S pre-ribosomal
            assembly protein gar2-related protein [Arabidopsis
            thaliana] gi|79316683|ref|NP_001030966.1| 18S
            pre-ribosomal assembly protein gar2-related protein
            [Arabidopsis thaliana] gi|186499149|ref|NP_001118260.1|
            18S pre-ribosomal assembly protein gar2-related protein
            [Arabidopsis thaliana] gi|330250656|gb|AEC05750.1| 18S
            pre-ribosomal assembly protein gar2-related protein
            [Arabidopsis thaliana] gi|330250657|gb|AEC05751.1| 18S
            pre-ribosomal assembly protein gar2-related protein
            [Arabidopsis thaliana] gi|330250658|gb|AEC05752.1| 18S
            pre-ribosomal assembly protein gar2-related protein
            [Arabidopsis thaliana] gi|330250659|gb|AEC05753.1| 18S
            pre-ribosomal assembly protein gar2-related protein
            [Arabidopsis thaliana]
          Length = 439

 Score =  176 bits (445), Expect = 2e-41
 Identities = 143/427 (33%), Positives = 197/427 (46%), Gaps = 31/427 (7%)
 Frame = +1

Query: 16   DRESFDTVEDLA-DCNAS-DSRDSGSPLF-TDKNVLECDVPEFEVCYRENDCHLLKDICI 186
            + E+   V D + DC+A+ DS +   P+F  DKNV  CD+PE  VCY+EN  H++KDIC+
Sbjct: 61   ENEAGKKVRDTSHDCDANVDSPEKKDPVFYMDKNVTACDLPEIVVCYKENTYHIVKDICV 120

Query: 187  DEGSPEKDVNAIESGLSPLPPKEDPLLDADSFTAEPCGSKEENDVIKLISQEE-----KL 351
            DEG P ++        S      + L+ AD     P  +K   D I  +   E     K 
Sbjct: 121  DEGVPVQEKFLFGEKDSVKSSSTEDLMKADKTNVNPSETKSAEDSISKVDDSEFCNDHKT 180

Query: 352  DSSLKNLFDKD------SIKHCEPENTVETSEACFDETLSPEDSLADRKL-PIEDSGDQH 510
            D  ++    +D      +  +   E+ + T E       SP   L+  ++ P E+S D+ 
Sbjct: 181  DRDVEESSGEDFADAEGTSSNYNQEHLIVTEEV----KASPTHGLSPSEIEPDENSKDEV 236

Query: 511  GLDEGNKVMQLSDQVLNEGAVSESPAVLSTEAEETGKDVQSSCLPYNSKVENEIITFNFS 690
             + + N     S + L  G       +LS E E+   +                   N S
Sbjct: 237  AISQDND----SKECLTLG------DILSREDEQKSLNQD-----------------NIS 269

Query: 691  APEGVAASTGTAEDIEEQSTENLPTATSNEEDDSHKQ--SLVGNEENSTTSEGSNCANAP 864
            +      S    +D E++S E     T  E+ +  KQ    + +   +T+ E +   N P
Sbjct: 270  SDSHEEQSPSQLQDKEKRSLETTAIETELEKTEEPKQGEEKLSSVSTTTSQEPNKTCNEP 329

Query: 865  E--ASSEHKQANXXXXXXXXXXXXEPNAPPVVNQLQQD------MGERSFSAASM----- 1005
            E   +  H Q N                  V N  + D       GE SFSAA       
Sbjct: 330  EKPETENHHQQNCL----------------VENSYEDDKFSSSRFGETSFSAADSVSISG 373

Query: 1006 -IDYSGPIAFSGSLSHRSDGSTTSGKSFAFPVLQSEWNSSPVRMAKADRRDFRKHKGWRS 1182
             I YSGPIA+SGSLS RSD STTSG+SFAFP+LQSEWNSSPVRMAKAD+R  R+  GWR 
Sbjct: 374  HITYSGPIAYSGSLSVRSDASTTSGRSFAFPILQSEWNSSPVRMAKADKR--RQKGGWRH 431

Query: 1183 GLLCCRF 1203
             LLCCRF
Sbjct: 432  TLLCCRF 438


>ref|XP_006484258.1| PREDICTED: dentin sialophosphoprotein-like isoform X3 [Citrus
            sinensis]
          Length = 483

 Score =  174 bits (442), Expect = 5e-41
 Identities = 150/459 (32%), Positives = 212/459 (46%), Gaps = 65/459 (14%)
 Frame = +1

Query: 22   ESFDTVEDLADCNASDSRDSGSP---------------LFTDKNVLECDVPEFEVCYREN 156
            E   ++ DLA  N  + +D  SP                + DK+V EC++PE  VCY+EN
Sbjct: 46   ERSTSLNDLAKDNEKNVQDLESPNSHSCGEMESFREPVFYMDKSVTECELPELIVCYKEN 105

Query: 157  DCHLLKDICIDEGSPEKDVNAIESGL-----SPLPPKEDPLLDADSFTAEPCGSKEENDV 321
              H+ KDICIDEG    D    ES +     S LPPKED     +S   E    + +N V
Sbjct: 106  TYHV-KDICIDEGVHSHDRILFESDVGKSVRSFLPPKED----RNSELLE----ESKNSV 156

Query: 322  IKLISQEEKLDSSLKNLFDKDSIKHC----EPENTVETSEACFDETLSPEDSLADRKLPI 489
            I +    + L SS +N  D+  +  C    E ++  +  + C  + L P   + D     
Sbjct: 157  IPI---PDVLKSSAENYSDEHIVNRCGSSQESDSDEDIDDICDSKDLRPAGDVKD----- 208

Query: 490  EDSGDQHGLDEGNKVMQLSD-----QVLNEGAVSESPAVLSTEAEE------TGKDVQSS 636
             D+ +++  D   K+  L D      V  + ++S+S      +AE+      + K   ++
Sbjct: 209  -DATEENTNDVSRKLFLLGDLLSMHNVGTKNSLSKSAIGNEIDAEKESFQGSSAKAALAN 267

Query: 637  CLPYNSKVENEIITFNFSAPEGVAASTGTAEDIEEQSTENLPTATSNEEDDSHKQSLVGN 816
                N     EI+T          +  G  E I    T  L +A+    D S + SL   
Sbjct: 268  PEEANGGTAEEILTGADFVSASEESQNGCGEGISGNPT--LVSASEKAHDKSEEASLASP 325

Query: 817  EENSTTSEGSNC----------------------ANAPEASSEHK--QANXXXXXXXXXX 924
            +  S  SE +                        A+AP AS + +  Q            
Sbjct: 326  DGVSALSESTKISTAEKSSYNSMVETGSITFDFDASAPGASGKEEPLQIGDSQRIETPGM 385

Query: 925  XXEPNAP--PVVNQLQQDMGERSFSAA----SMIDYSGPIAFSGSLSHRSDGSTTSGKSF 1086
                +AP   V +Q    +GE SFSAA    S+I YSGP+A+SGS+S RSD STTS +SF
Sbjct: 386  SRLEDAPRQSVSSQFHSGLGESSFSAAGSLPSLISYSGPVAYSGSISLRSDSSTTSTRSF 445

Query: 1087 AFPVLQSEWNSSPVRMAKADRRDFRKHKGWRSGLLCCRF 1203
            AFP+LQ+EW+ SPVRMAKADRR +RKHK W+ GLLCCRF
Sbjct: 446  AFPILQTEWDRSPVRMAKADRRHYRKHK-WKQGLLCCRF 483


>ref|XP_006484256.1| PREDICTED: dentin sialophosphoprotein-like isoform X1 [Citrus
            sinensis] gi|568861537|ref|XP_006484257.1| PREDICTED:
            dentin sialophosphoprotein-like isoform X2 [Citrus
            sinensis]
          Length = 496

 Score =  174 bits (442), Expect = 5e-41
 Identities = 150/459 (32%), Positives = 212/459 (46%), Gaps = 65/459 (14%)
 Frame = +1

Query: 22   ESFDTVEDLADCNASDSRDSGSP---------------LFTDKNVLECDVPEFEVCYREN 156
            E   ++ DLA  N  + +D  SP                + DK+V EC++PE  VCY+EN
Sbjct: 59   ERSTSLNDLAKDNEKNVQDLESPNSHSCGEMESFREPVFYMDKSVTECELPELIVCYKEN 118

Query: 157  DCHLLKDICIDEGSPEKDVNAIESGL-----SPLPPKEDPLLDADSFTAEPCGSKEENDV 321
              H+ KDICIDEG    D    ES +     S LPPKED     +S   E    + +N V
Sbjct: 119  TYHV-KDICIDEGVHSHDRILFESDVGKSVRSFLPPKED----RNSELLE----ESKNSV 169

Query: 322  IKLISQEEKLDSSLKNLFDKDSIKHC----EPENTVETSEACFDETLSPEDSLADRKLPI 489
            I +    + L SS +N  D+  +  C    E ++  +  + C  + L P   + D     
Sbjct: 170  IPI---PDVLKSSAENYSDEHIVNRCGSSQESDSDEDIDDICDSKDLRPAGDVKD----- 221

Query: 490  EDSGDQHGLDEGNKVMQLSD-----QVLNEGAVSESPAVLSTEAEE------TGKDVQSS 636
             D+ +++  D   K+  L D      V  + ++S+S      +AE+      + K   ++
Sbjct: 222  -DATEENTNDVSRKLFLLGDLLSMHNVGTKNSLSKSAIGNEIDAEKESFQGSSAKAALAN 280

Query: 637  CLPYNSKVENEIITFNFSAPEGVAASTGTAEDIEEQSTENLPTATSNEEDDSHKQSLVGN 816
                N     EI+T          +  G  E I    T  L +A+    D S + SL   
Sbjct: 281  PEEANGGTAEEILTGADFVSASEESQNGCGEGISGNPT--LVSASEKAHDKSEEASLASP 338

Query: 817  EENSTTSEGSNC----------------------ANAPEASSEHK--QANXXXXXXXXXX 924
            +  S  SE +                        A+AP AS + +  Q            
Sbjct: 339  DGVSALSESTKISTAEKSSYNSMVETGSITFDFDASAPGASGKEEPLQIGDSQRIETPGM 398

Query: 925  XXEPNAP--PVVNQLQQDMGERSFSAA----SMIDYSGPIAFSGSLSHRSDGSTTSGKSF 1086
                +AP   V +Q    +GE SFSAA    S+I YSGP+A+SGS+S RSD STTS +SF
Sbjct: 399  SRLEDAPRQSVSSQFHSGLGESSFSAAGSLPSLISYSGPVAYSGSISLRSDSSTTSTRSF 458

Query: 1087 AFPVLQSEWNSSPVRMAKADRRDFRKHKGWRSGLLCCRF 1203
            AFP+LQ+EW+ SPVRMAKADRR +RKHK W+ GLLCCRF
Sbjct: 459  AFPILQTEWDRSPVRMAKADRRHYRKHK-WKQGLLCCRF 496


>ref|XP_002514588.1| conserved hypothetical protein [Ricinus communis]
            gi|223546192|gb|EEF47694.1| conserved hypothetical
            protein [Ricinus communis]
          Length = 488

 Score =  173 bits (439), Expect = 1e-40
 Identities = 135/399 (33%), Positives = 186/399 (46%), Gaps = 20/399 (5%)
 Frame = +1

Query: 67   DSRDSGSPLFTDKNVLECDVPEFEVCYRENDCHLLKDICIDEGSPEK-----DVNAIESG 231
            +S D  S  + DKNV+E ++PE  +CY+EN  H++KDIC+DEG P +     D +  +  
Sbjct: 104  ESFDKDSVFYIDKNVMEPELPELVLCYKENTYHVVKDICVDEGVPSQENFLFDTSVDQEK 163

Query: 232  LSP-LPPKEDPLLDADSFTAEPCGSKEENDVIKLISQEEKLDSSLK-NLFDKDSIKHCEP 405
            L P L P++D   +           KE  D+        K D+S K +  +  +I   E 
Sbjct: 164  LCPYLIPEKDIKSEIQ---------KERVDLDMSTQYLSKNDNSFKCDSKESMAIAEIED 214

Query: 406  ENTVETSEACFDETLSPEDSLADRKLPIEDSGDQHGLDEGNKVMQLSDQVLNEGAV--SE 579
            +   E +     ET S  + L   ++  E S  +  L+  ++  QLS Q  +E  V  + 
Sbjct: 215  DAMEEIANYTSKETFSLGELLLMPEVVAELSHSKSLLNSTDEAEQLSIQRPSENIVLATA 274

Query: 580  SPAVLSTEAEETGKDVQSSCLPYNSKVENEIITFNFSAPEGVAASTGTAED-------IE 738
            S    S  A E    V  +  P   +  +E         +    ++    D         
Sbjct: 275  SACEESKYATEQFLLVTPAVDPLVEESGHEEAKLGTLTSDSSPKASDHGHDEVILASLAP 334

Query: 739  EQSTENLPTATSNEEDDSHKQSLVGNEENSTTSEGSNCANAPEASSEHKQANXXXXXXXX 918
              +TE         +  SH    V +  +S  +       +    SEH ++         
Sbjct: 335  SYATEEPENGAKAAKSPSHTLDSVSDLNSSAPTASGGEEGSQVGGSEHLESRNSSRHEDT 394

Query: 919  XXXXEPNAPPVVNQLQQDMGERSFSAAS----MIDYSGPIAFSGSLSHRSDGSTTSGKSF 1086
                     P   QLQ   GE SFSAA     +I YSGPIA+SGSLS RSD STTS +SF
Sbjct: 395  SI-----TEPFSGQLQYSHGESSFSAAGPLSGLISYSGPIAYSGSLSLRSDSSTTSTRSF 449

Query: 1087 AFPVLQSEWNSSPVRMAKADRRDFRKHKGWRSGLLCCRF 1203
            AFP+LQSEWNSSPVRMAKADRR FRKH+ WR GLLCCRF
Sbjct: 450  AFPILQSEWNSSPVRMAKADRRHFRKHRSWRQGLLCCRF 488


>emb|CBI27399.3| unnamed protein product [Vitis vinifera]
          Length = 435

 Score =  172 bits (437), Expect = 2e-40
 Identities = 147/397 (37%), Positives = 197/397 (49%), Gaps = 18/397 (4%)
 Frame = +1

Query: 67   DSRDSGSPLFTDKNVLECDVPEFEVCYRENDCHLLKDICIDEG--SPEKDV--NAIES-- 228
            +S +    + TDK+V + ++P   VC  E+  H +KDICIDEG  SPEK +  N  E   
Sbjct: 107  ESFEKDGDMCTDKSVTKHELP---VCCEESTYHAVKDICIDEGMLSPEKILVENGKEEHE 163

Query: 229  GLSP-LPPKEDPLLDADSFTAE-----PCGSKE--ENDVIKLISQEEKLDSSLKNLFDKD 384
            G  P LPP  D  +D    TA+     P G K   END  K + QEE+      N   +D
Sbjct: 164  GFCPFLPPDTDKNVDPTKETADKELPLPDGQKASAENDCGKDLMQEEE------NYDARD 217

Query: 385  SIKHCEPENTVETSEACFDETLSPEDSLADRKLPIEDSGDQHGLDEGNKVMQLSDQVLNE 564
             I         +TSE    E + PED     +L   +S  +     G ++     Q  N 
Sbjct: 218  KI-------ISDTSE----EKIVPEDIFLIPELSKANSMPESSEFNGMEIEHQCIQNPNG 266

Query: 565  GAVSESPAVLSTEAEETGKDVQSSCLPYNSKVENEIITFNFSAPEGVAASTGTAEDIEEQ 744
             AV E+PA++S EAEE+ K+   + L YNSK+E+  ITF+F +      ST + +   E 
Sbjct: 267  EAVLENPALVS-EAEESDKNSFPNELSYNSKLESGTITFDFGS------STTSMDSGREV 319

Query: 745  STENLPTATSNEEDDSHKQSLVGNEENSTTSEGSNCANAPEASSEHKQANXXXXXXXXXX 924
            S +N        E     Q+L   E+ S +                              
Sbjct: 320  SPQN-----DGCEPPLESQNLSKLEDGSESL----------------------------- 345

Query: 925  XXEPNAPPVVNQLQQDMGERSFSAA----SMIDYSGPIAFSGSLSHRSDGSTTSGKSFAF 1092
                   P   Q+Q+ +GE SFSAA    ++I YSG I  SG++S RSD STTS +SFAF
Sbjct: 346  -------PFSGQIQRGLGESSFSAAGPSSALISYSGQITHSGNISLRSDSSTTSTRSFAF 398

Query: 1093 PVLQSEWNSSPVRMAKADRRDFRKHKGWRSGLLCCRF 1203
            PVLQ+EWNSSPVRMAKA+RR  RKH+ WR G+LCCRF
Sbjct: 399  PVLQTEWNSSPVRMAKAERRHLRKHRSWRRGILCCRF 435


>gb|AAM77645.1|AF517846_1 hypothetical protein [Arabidopsis thaliana]
            gi|41059759|gb|AAR99354.1| hypothetical protein At2g03810
            [Arabidopsis thaliana]
          Length = 439

 Score =  172 bits (435), Expect = 3e-40
 Identities = 141/427 (33%), Positives = 195/427 (45%), Gaps = 31/427 (7%)
 Frame = +1

Query: 16   DRESFDTVEDLA-DCNAS-DSRDSGSPLF-TDKNVLECDVPEFEVCYRENDCHLLKDICI 186
            + E+   V D + DC+A+ DS +   P+F  DKNV  CD+PE   CY+EN  H++KDIC+
Sbjct: 61   ENEAGKKVRDTSHDCDANVDSPEKKDPVFYMDKNVTACDLPEIVACYKENTYHIVKDICV 120

Query: 187  DEGSPEKDVNAIESGLSPLPPKEDPLLDADSFTAEPCGSKEENDVIKLISQEE-----KL 351
            DE  P ++        S      + L+ AD     P  +K   D I  +   E     K 
Sbjct: 121  DESVPVQEKFLFGEKDSVKSSSTEDLMKADKTNVNPSETKSAEDSISKVDDSEFCNDHKT 180

Query: 352  DSSLKNLFDKD------SIKHCEPENTVETSEACFDETLSPEDSLADRKL-PIEDSGDQH 510
            D  ++    +D      +  +   E+ + T E       SP   L+  ++ P E+S D+ 
Sbjct: 181  DRDVEESSGEDFADAEGTSSNYNQEHLIVTEEV----XASPTHGLSPSEIEPDENSKDEV 236

Query: 511  GLDEGNKVMQLSDQVLNEGAVSESPAVLSTEAEETGKDVQSSCLPYNSKVENEIITFNFS 690
             + + N     S + L  G       +LS E E+   +                   N S
Sbjct: 237  AISQDND----SKECLTLG------DILSREDEQKSLNQD-----------------NIS 269

Query: 691  APEGVAASTGTAEDIEEQSTENLPTATSNEEDDSHKQ--SLVGNEENSTTSEGSNCANAP 864
            +      S    +D E++S E     T  E+ +  KQ    + +   +T+ E +   N P
Sbjct: 270  SDSHEEQSPSQLQDKEKRSLETTAIETELEKTEEPKQGEEKLSSVSTTTSQEPNKTCNEP 329

Query: 865  E--ASSEHKQANXXXXXXXXXXXXEPNAPPVVNQLQQD------MGERSFSAASM----- 1005
            E   +  H Q N                  V N  + D       GE SFSAA       
Sbjct: 330  EKPETENHHQQNCL----------------VENSYEDDKFSSSRFGETSFSAADSVSISG 373

Query: 1006 -IDYSGPIAFSGSLSHRSDGSTTSGKSFAFPVLQSEWNSSPVRMAKADRRDFRKHKGWRS 1182
             I YSGPIA+SGSLS RSD STTSG+SFAFP+LQSEWNSSPVRMAKAD+R  R+  GWR 
Sbjct: 374  HITYSGPIAYSGSLSVRSDASTTSGRSFAFPILQSEWNSSPVRMAKADKR--RQKGGWRH 431

Query: 1183 GLLCCRF 1203
             LLCCRF
Sbjct: 432  TLLCCRF 438


>ref|XP_002875229.1| hypothetical protein ARALYDRAFT_484289 [Arabidopsis lyrata subsp.
            lyrata] gi|297321067|gb|EFH51488.1| hypothetical protein
            ARALYDRAFT_484289 [Arabidopsis lyrata subsp. lyrata]
          Length = 435

 Score =  169 bits (429), Expect = 2e-39
 Identities = 142/427 (33%), Positives = 192/427 (44%), Gaps = 33/427 (7%)
 Frame = +1

Query: 16   DRESFDTVEDLA-DCNAS-DSRDSGSPLF-TDKNVLECDVPEFEVCYRENDCHLLKDICI 186
            D E+   V D++ DC+A+ DS D   P+F  DKNV  CD+PE  VCY+EN  H++KDIC+
Sbjct: 61   DNEAGKKVRDISHDCDANVDSPDKKDPVFYMDKNVTACDLPEIVVCYKENTYHVVKDICV 120

Query: 187  DEGSPEKDVNAIESGLSPLPPKEDPLLDADSFTAEPCGSKEENDVIKLISQEE-----KL 351
            DEG P ++        S      + L  AD     P  SK   D    +   E     K 
Sbjct: 121  DEGVPVQEKFLFGEKDSVKSSSTEDLTKADKTNVNPSESKSAEDSNTKVDDSEFCNNCKT 180

Query: 352  D-----SSLKNLFDKDSIKHCEPENTVETSEACFDETLSPEDSLADRKL-PIEDSGDQHG 513
            D     SS ++  D +       E+ + T EA      SP   L   ++ P E+S D+  
Sbjct: 181  DRDVEESSREDFADAEGSSAYNQEHLIVTEEA----KASPSHGLNPSEIEPDENSNDEVA 236

Query: 514  LD---EGNKVMQLSDQVLNEGAVSESPAVLSTEAEETGKDVQSSCLPYNSKVENEIITFN 684
            +    +  + + L D             +LS E E+                   +   N
Sbjct: 237  ISSETDSKESLTLGD-------------ILSREDEQ-----------------KSLNHGN 266

Query: 685  FSAPEGVAASTGTAEDIEEQSTENLPTATSNEEDDSHK--QSLVGNEENSTTSEGSNCAN 858
             S+      S    +D E++S E     T  E+ +  K  +  + +   +T  E +   N
Sbjct: 267  ISSDSHEEQSPSQLQDKEKRSLETAAIETELEKTEEPKPVEEKLPSASTTTLQEPNKTCN 326

Query: 859  APEA--SSEHKQANXXXXXXXXXXXXEPNAPPVVNQLQQD------MGERSFSAASMID- 1011
             PE   +  H Q N                  V N  + D       GE SFSAA  +  
Sbjct: 327  DPEKPETENHHQQNSL----------------VENSYEDDKLSSSRFGETSFSAAESVSI 370

Query: 1012 -----YSGPIAFSGSLSHRSDGSTTSGKSFAFPVLQSEWNSSPVRMAKADRRDFRKHKGW 1176
                 YSGPIA+SGSLS RSD STTSG+SFAFP+LQSEWNSSPVRMAKAD+R  R+  GW
Sbjct: 371  SGHITYSGPIAYSGSLSVRSDASTTSGRSFAFPILQSEWNSSPVRMAKADKR--RQKGGW 428

Query: 1177 RSGLLCC 1197
            R  LLCC
Sbjct: 429  RHTLLCC 435


>ref|XP_006395670.1| hypothetical protein EUTSA_v10004181mg [Eutrema salsugineum]
            gi|567142661|ref|XP_006395671.1| hypothetical protein
            EUTSA_v10004181mg [Eutrema salsugineum]
            gi|557092309|gb|ESQ32956.1| hypothetical protein
            EUTSA_v10004181mg [Eutrema salsugineum]
            gi|557092310|gb|ESQ32957.1| hypothetical protein
            EUTSA_v10004181mg [Eutrema salsugineum]
          Length = 458

 Score =  167 bits (423), Expect = 8e-39
 Identities = 131/404 (32%), Positives = 182/404 (45%), Gaps = 25/404 (6%)
 Frame = +1

Query: 67   DSRDSGSPLF-TDKNVLECDVPEFEVCYRENDCHLLKDICIDEGSP--------EKDVNA 219
            DS +   P+F  DKNV  CD+PE  VCY+EN  H++KDIC+DEG P        EKD  +
Sbjct: 96   DSLEKLDPVFYMDKNVTACDLPEIVVCYKENTYHVVKDICVDEGVPVQEKFLFGEKD--S 153

Query: 220  IESGLSPLPPKEDPLLDADSFTAEPCGSKEENDVIKLISQEEKL----------DSSLKN 369
            ++   +    + + L++AD  ++    SK   D    +   E            +SS + 
Sbjct: 154  VKCSSNSNKCESEDLMEADKASSNLLESKSLEDRNSKLDDSELCNGTKTNRDVEESSREE 213

Query: 370  LFDKDSIKHCEPENTVETSEACFDETLSPEDSLADRKLPIEDSGDQHGLDEGNKVMQLSD 549
              D +   +C  E+   T EA    T     S    ++  +++  +H            +
Sbjct: 214  FADAEGSSNCNQEHLTVTREAKDSPTHGVNHSEISHEIESDENSKKH------------E 261

Query: 550  QVLNEGAVSESPAVLSTEAEETGKDVQSSCLPYNSKVENEIITFNFSAPEGVAASTGTAE 729
               +E  VSE    L         D+ S       + E + +  N S+      S    +
Sbjct: 262  VATSENVVSECCLTLG--------DILS------REDEQKHLNNNNSSNRREEHSPPLLQ 307

Query: 730  DIEEQSTENLPTATSNEEDDSHKQSLVGNEENSTTSEGSNCANAPEASSEHKQANXXXXX 909
            ++E++S E  P  T   +    K S V     +T+ E +   N PE      Q       
Sbjct: 308  EMEKRSLETTPLETEEPKQAEEKLSSV---STTTSQEPNKTCNDPERPETENQQQPKLRV 364

Query: 910  XXXXXXXEPNAPPVVNQLQQDMGERSFSA------ASMIDYSGPIAFSGSLSHRSDGSTT 1071
                   +              GE SFSA      +  I YSGPIAFSGSLS RSD STT
Sbjct: 365  EDSYEDDK--------LFSSGFGETSFSASEPVSISGHITYSGPIAFSGSLSVRSDASTT 416

Query: 1072 SGKSFAFPVLQSEWNSSPVRMAKADRRDFRKHKGWRSGLLCCRF 1203
            SG+SFAFP+LQSEWNSSPVRMAKAD+   R+ KGWR  LLCCRF
Sbjct: 417  SGRSFAFPILQSEWNSSPVRMAKADK---RRQKGWRHILLCCRF 457


>ref|XP_006291111.1| hypothetical protein CARUB_v10017222mg [Capsella rubella]
            gi|482559818|gb|EOA24009.1| hypothetical protein
            CARUB_v10017222mg [Capsella rubella]
          Length = 455

 Score =  165 bits (418), Expect = 3e-38
 Identities = 129/401 (32%), Positives = 191/401 (47%), Gaps = 22/401 (5%)
 Frame = +1

Query: 67   DSRDSGSPLF-TDKNVLECDVPEFEVCYRENDCHLLKDICIDEGSPEKDVNAIESGLSPL 243
            DS +  +P+F  DKNV  CD+PE  VCY+EN  H++KDIC+DEG P ++          L
Sbjct: 96   DSPEKVNPVFYMDKNVTACDLPEIVVCYKENSYHVVKDICVDEGVPVQE--------KFL 147

Query: 244  PPKEDPLLDADSFTAEPCGSKEENDVIKLISQEEKLDSSLKNLFDKDSIKHCEPENTVET 423
              ++D +    +  +  CGS    D++K+   + K  S  K+L D +S          ++
Sbjct: 148  FGEKDSVKSTTN--SNHCGSV---DLMKVDKTDVK-PSETKSLEDSNS-------KVDDS 194

Query: 424  SEACFDETLSPEDSLADRKLPIEDSGDQHGLDEGNKVMQLSDQVLNEGAVS---ESPAVL 594
            SE C D+T+  +D     +    D+      D+ + ++      L    +S   ES  + 
Sbjct: 195  SEVCNDKTV--QDVEESSREAFADAEGSSNYDQEHLIVTSPTLALKPSEISLEVESEEIS 252

Query: 595  STEAEETGKDVQSSCLPYNSKVENE-----IITFNFSAPEGVAASTGTAEDIEEQSTENL 759
              E   + +D  S  L     +  E     +   N + PE ++      ++     T  L
Sbjct: 253  KDEVVISSEDFLSESLTLGDILSREDKQKSLKNDNGNRPEELSPPQHQEKEKRSLETTGL 312

Query: 760  PTATSNEEDDSHKQSLVGNEENSTTSE-GSNCANAPEASSEHKQANXXXXXXXXXXXXEP 936
             T     E+    +  + +   +T  E   +C +  +  +E+ Q N              
Sbjct: 313  DTKLEKVEEPKTAEENLSSASTTTVQEPNKSCNDLEKPETENHQQNR------------- 359

Query: 937  NAPPVVNQLQQD------MGERSFSAASMID------YSGPIAFSGSLSHRSDGSTTSGK 1080
                +VN  + D       GE SFSAA  +       YSGPIA+SGSLS RSD STTSG+
Sbjct: 360  ----LVNSYEDDKLSSSRFGETSFSAAESVSISGHITYSGPIAYSGSLSVRSDASTTSGR 415

Query: 1081 SFAFPVLQSEWNSSPVRMAKADRRDFRKHKGWRSGLLCCRF 1203
            SFAFP+LQSEWNSSPVRMAKAD+R  R+  GWR  LLCC+F
Sbjct: 416  SFAFPILQSEWNSSPVRMAKADKR--RQKGGWRHTLLCCKF 454


>ref|XP_006348792.1| PREDICTED: dentin sialophosphoprotein-like [Solanum tuberosum]
          Length = 586

 Score =  162 bits (409), Expect = 4e-37
 Identities = 130/427 (30%), Positives = 194/427 (45%), Gaps = 31/427 (7%)
 Frame = +1

Query: 16   DRESFDTVEDLADCNASDSRDSGSPLFTDKNVLECDVPEFEVCYRENDCHLLKDICIDEG 195
            D+E+     D    + S+  DS    ++DK V + ++PE  VCYREN+ +++KDIC+DEG
Sbjct: 195  DKENETIDSDSPFTSHSELFDSNKHFYSDKGVTDHELPELTVCYRENNFNMVKDICMDEG 254

Query: 196  SPEKDVNAIESGLSPLPPKEDPLLDADSFTAEPCGSKEENDVIKLISQEEK--------- 348
             P  D   IES     P     +   +   +    S +    I  +SQ+           
Sbjct: 255  VPAVDKVLIESWKDGQPSTSVSVDADEEQQSNTRKSVDMGSTIASVSQDSSFKDAKNIAV 314

Query: 349  ----------------LDSSLKNLFDKDSIKHCEPENTVETSEACFDETLSPEDSLADRK 480
                             + SL+N  +KD+ K    E+ +    +      S + S  +  
Sbjct: 315  THDTEIEATGAPVPNGFNPSLENNANKDADKDSYLEDLLMIFGSKCTTNASEKPSSLNTV 374

Query: 481  LPIEDSGDQHGLDEGNKVMQLSDQVLNEGAVSESPAVLSTEAEETGKDVQSSCLPYNSKV 660
            + +E+S  +    +G++     DQV +E  +    AV ++       +++         V
Sbjct: 375  VRVEESNIK--TSDGDQSTLQPDQVPSEQTLKSQTAVSASGQTNNKGNIKEG-------V 425

Query: 661  ENEIITFNFSAPEGVAASTGTAEDIEEQSTENLPTATSNEEDDSHKQSLVGNEENSTTSE 840
               I   N + PE    + G   ++ E S  ++P A S      HK    GN +N++ S 
Sbjct: 426  GTSIFDVNLTKPESTKTTEGGVGNLPEDS--HMPKAVS-----VHKN---GNSDNNSASS 475

Query: 841  GSNCAN-APEASSEHKQANXXXXXXXXXXXXEPNAPPVVNQLQQDMGERSFSAA-----S 1002
                AN A  A  +H ++                      Q     GE SFSAA      
Sbjct: 476  QVPFANTADNAHQQHLESQNMAN----------------GQSHFADGEASFSAARGPISG 519

Query: 1003 MIDYSGPIAFSGSLSHRSDGSTTSGKSFAFPVLQSEWNSSPVRMAKADRRDFRKHKGWRS 1182
             I YSGPI++SGS+S RS+ STTS +SFAFPVLQ+EWNSSPVRMAKA+RR   K KGW+ 
Sbjct: 520  SITYSGPISYSGSVSLRSESSTTSTRSFAFPVLQNEWNSSPVRMAKAERRRLSKQKGWKQ 579

Query: 1183 GLLCCRF 1203
            G+LCCRF
Sbjct: 580  GILCCRF 586


>ref|XP_007222119.1| hypothetical protein PRUPE_ppa004630mg [Prunus persica]
            gi|462419055|gb|EMJ23318.1| hypothetical protein
            PRUPE_ppa004630mg [Prunus persica]
          Length = 499

 Score =  160 bits (405), Expect = 1e-36
 Identities = 135/466 (28%), Positives = 210/466 (45%), Gaps = 87/466 (18%)
 Frame = +1

Query: 67   DSRDSGSPLFTDKNVLECDVPEFEVCYRENDCHLLKDICIDEGSPEKDVNAIESGLSP-- 240
            ++ +  S  + DK+V+EC++PE  VCY+E+ C+ +KDICIDEG P +D N  E+G+    
Sbjct: 47   EALEKESDYYMDKSVMECELPELIVCYKESSCNTIKDICIDEGVPSQDKNRFETGVDEKE 106

Query: 241  ----LPPKEDPLLDADSFTAEPCGSKEENDVIKLISQEEKLDSSLKNLFDKDSIKHCEPE 408
                L P ED               +E+ D++  ++  +   SS  +  +K  +  C+ +
Sbjct: 107  CCTFLSPDEDQNKQL---------LEEQMDIV--VTLPDGFKSSAHDDLEKGFVIPCDSK 155

Query: 409  NTVETSEACF------DETLSPEDSLADRKLPIED--SGDQHGLDEGNK--------VMQ 540
               +  +A +      +  +S E       LP+++  +G+ H     N+         +Q
Sbjct: 156  GLTQIGDAIYYTQEKTEIEVSKEIFFPANVLPMQELGAGNAHSSKSSNEESTEAVQDTVQ 215

Query: 541  LSDQVLNEGAVSESPAVLSTEAEETGKDVQSSC------------LPYNSKVENEIITFN 684
             S + ++E A + S AV+S   E +  + ++              L  NSKVEN   T  
Sbjct: 216  SSGEKVSEIAQTGSTAVVSVTEESSHSEKKALVSAAEESNFHVDELSNNSKVENGSTTSG 275

Query: 685  FSAPEGVAASTGTA---EDIEEQ-STENLPTATSNEEDDSHKQS---------------L 807
             S      ++T  A    D+ +   T+ +P     +++D +                  +
Sbjct: 276  LSDTSVHVSTTRDACPDNDVHKHFETQTMPAGDDGDDNDDNMPDAEIVPSQVQPCSAPVV 335

Query: 808  VGNEE------------NSTTSEGSNCANAPEASSE--HKQANXXXXXXXXXXXXEPNAP 945
             G EE            +ST+       ++   SS+  H  A                 P
Sbjct: 336  TGREECPENGVCQPLDTSSTSKVDDEIPHSVIVSSQVQHYSAPVTISREERPENGVWQCP 395

Query: 946  PVVN----------------QLQQDMGERSFSAA----SMIDYSGPIAFSGSLSHRSDGS 1065
               N                 +Q+  GE SFSAA    S+++ SGP  +SG++S RS+ S
Sbjct: 396  ETSNAFMVGDVNSDTQYASFHVQRGFGESSFSAAGHFSSLMNTSGP--YSGNVSLRSESS 453

Query: 1066 TTSGKSFAFPVLQSEWNSSPVRMAKADRRDFRKHKGWRSGLLCCRF 1203
            TTS +SFAFPVLQSEWNSSPVRMAKADRR  RKH+GW   LLCCRF
Sbjct: 454  TTSTRSFAFPVLQSEWNSSPVRMAKADRRHLRKHRGWGHSLLCCRF 499


>ref|XP_002311412.1| 18S pre-ribosomal assembly protein gar2 [Populus trichocarpa]
            gi|222851232|gb|EEE88779.1| 18S pre-ribosomal assembly
            protein gar2 [Populus trichocarpa]
          Length = 486

 Score =  159 bits (401), Expect = 3e-36
 Identities = 136/431 (31%), Positives = 194/431 (45%), Gaps = 58/431 (13%)
 Frame = +1

Query: 85   SPLFTDKNVLECDVPEFEVCYRENDCHLLKDICIDEGSPEKD-----VNAIESGLSPLPP 249
            S  + DK+V+  +VPE  VCY+EN  H+ KDIC+DEG P +D      +A +  +    P
Sbjct: 71   SVFYMDKSVMVREVPELIVCYKENTYHV-KDICVDEGVPLQDKFLFDTDAHKKNMCEFLP 129

Query: 250  KEDPL--------LDADSFTAEPCGSKEENDVIKL-ISQEEKLDSSLKNLFDKDSIKHCE 402
             E  +         D D    E   S  E   + L +   + L SS +     D    C+
Sbjct: 130  SERDMNNEMVKEKSDLDMLIPEMLKSSSEKQNVDLHLPVPDVLISSEEKGSKHDLSLDCD 189

Query: 403  PENTVETSEACFDETLSPEDSLADRKLPIED--------------SGDQHGLDEGNKVMQ 540
            P++ + T E     T    D+ +   L + D              +   H +D   KV Q
Sbjct: 190  PKHLMPTEEVMDYGTKKVTDNASKEILSLRDLLSMSELGAKCTPANASYHNMD---KVEQ 246

Query: 541  LSDQVLNEGAVSESPAVLSTEAEETGKDVQSSCLPYNSKVENEIITFNFSAPEGVAASTG 720
             S     E A+ E+ +  S E+E  G++  S     ++ +E+  +      P       G
Sbjct: 247  QSLLCPRENAILETDSA-SEESEHCGEETIS-----DNGLESATLAIPTQDPAYQEGDHG 300

Query: 721  TAEDIEEQSTENLPTATSNEEDDSHKQSLVGNEENSTTSEGS------------------ 846
              E +        PT TS  E+   K++ + +    + SEGS                  
Sbjct: 301  HTEAVLVS-----PTLTSAAEESDSKETKLASHALDSFSEGSTSRIEDELPYNSKTETRS 355

Query: 847  ----NCANAPEASSEHKQANXXXXXXXXXXXX---EPNAPPVVN-QLQQDMGERSFSAAS 1002
                N ++AP AS+     N               +PNA  +   QLQ   GE SFS++ 
Sbjct: 356  ISFDNDSSAPAASARESPQNGESQRLGTRIVSRFEDPNAERLSGGQLQYADGESSFSSSG 415

Query: 1003 ----MIDYSGPIAFSGSLSHRSDGSTTSGKSFAFPVLQSEWNSSPVRMAKADRRDFRKHK 1170
                +  +SGPIA+SGS+S RSD STTS +SFAFP+LQSEWNSSP RMAKADRR F+K +
Sbjct: 416  PLFGLTSHSGPIAYSGSVSLRSDSSTTSTRSFAFPILQSEWNSSPARMAKADRRHFQKPR 475

Query: 1171 GWRSGLLCCRF 1203
             W  GLLCCRF
Sbjct: 476  KWMQGLLCCRF 486


>ref|XP_006363556.1| PREDICTED: dentin sialophosphoprotein-like isoform X1 [Solanum
            tuberosum] gi|565395867|ref|XP_006363557.1| PREDICTED:
            dentin sialophosphoprotein-like isoform X2 [Solanum
            tuberosum] gi|565395869|ref|XP_006363558.1| PREDICTED:
            dentin sialophosphoprotein-like isoform X3 [Solanum
            tuberosum] gi|565395871|ref|XP_006363559.1| PREDICTED:
            dentin sialophosphoprotein-like isoform X4 [Solanum
            tuberosum]
          Length = 532

 Score =  155 bits (393), Expect = 3e-35
 Identities = 137/429 (31%), Positives = 204/429 (47%), Gaps = 33/429 (7%)
 Frame = +1

Query: 16   DRESFDTVEDLADCNASDSRDSGSPLFTDKNVLECDVPEFEVCYRENDCHLLKDICIDEG 195
            D+E    V      + S+   + + L+TDK VLE  +PE  +CY EN+ +++KDIC+DEG
Sbjct: 127  DKEKETVVSSAQFTSLSELFGTNTHLYTDKGVLEFKLPELTICYNENNYNIMKDICMDEG 186

Query: 196  SPEKDVNAIESGLSPLPPKEDPLLDADSFTAEPCGSKEENDVIKLISQEEKLDSSLKNLF 375
             P  D    ES     P     L   +    +P  ++E  D  +L+S  E  DSS++N  
Sbjct: 187  VPLMDKIVTESRKYHQPDSSISLAVDEH---QPRNTREGVDS-ELVSSGESKDSSVENAV 242

Query: 376  DKDSIKHCEPENTVETSEACFDETLSP--EDSLA-----DRKLPI--------------- 489
                  H   E+  E +++     ++P  ED+++     D  L +               
Sbjct: 243  KISVDHHTTKED--EDTKSLGPNGINPFLEDNMSKYADKDSSLDVMKIFGSKDTTTAKAT 300

Query: 490  ---EDSGDQHGLDEGNKVMQLS----DQVLNEGAVSESPAVLSTEAEETGKDVQSSCLPY 648
               E+  D   L E N   + S    +Q+    A   S   +S  A+ T  +   S    
Sbjct: 301  NISENESDIQNLKESNSDAEQSALQANQIPTFVAAFNSQNTVSA-ADGTNNNGPGSNFSN 359

Query: 649  NSKVENEIITFNFSAPEGVAASTGTAEDIEEQSTENLPTATSNEEDDSHKQSLVGNEENS 828
            NSK E+  IT +F+  E +A S+  A+     S ++LP       + SHK   V ++++ 
Sbjct: 360  NSKSESGAITCDFNLTE-LALSSSVAK-----SDKHLP-------EQSHKLEAVSSQKDG 406

Query: 829  TTSEGSNCANAPEASS-EHKQANXXXXXXXXXXXXEPNAPPVVNQLQQDM--GERSFSAA 999
            ++   S       A+S +   ++            E N+  +   +      GE SF  A
Sbjct: 407  SSDSFSAATQVHFANSVDSCNSSIHADPPNVANLEEKNSGSIPLGVHGHFANGEASFGPA 466

Query: 1000 S-MIDYSGPIAFSGSLSHRSDGSTTSGKSFAFPVLQSEWNSSPVRMAKADRRDFRKHKGW 1176
            S +I YSG I  SG++S RSD STTS +SFAFPVLQSEWNSSPVRMAKA+RR +   KGW
Sbjct: 467  SGLISYSGHITHSGNISLRSDSSTTSARSFAFPVLQSEWNSSPVRMAKAERRHY---KGW 523

Query: 1177 RSGLLCCRF 1203
            R  LLCC+F
Sbjct: 524  RQSLLCCKF 532


>ref|XP_004239019.1| PREDICTED: uncharacterized protein LOC101258367 isoform 2 [Solanum
            lycopersicum]
          Length = 554

 Score =  155 bits (393), Expect = 3e-35
 Identities = 142/459 (30%), Positives = 201/459 (43%), Gaps = 62/459 (13%)
 Frame = +1

Query: 13   PDRESFDTVEDLADCNASDSRDSGSPL-------------FTDKNVLECDVPEFEVCYRE 153
            P+ ES D ++D      +++ DS SP              ++DK V + ++ E  VCYRE
Sbjct: 147  PEYESLDFLDD----KGNETIDSDSPFTSHSELFENNKHFYSDKGVTDHELSELTVCYRE 202

Query: 154  NDCHLLKDICIDEGSPEKDVNAIESGLSPLPPKEDPL---LDADSFTAEPCGSKEENDV- 321
            N+ +++KDIC+DEG P  D    ES       K+D L   +  D+       +K+  D+ 
Sbjct: 203  NNFNIVKDICMDEGVPAVDKVLTESW------KDDQLSTSVSVDADEEHQSNTKKSVDMG 256

Query: 322  --IKLISQEEKLDS-------------------------SLKNLFDKDSIKHCEPENTVE 420
              I  +SQ+   +                          SL+N  +KD+ K    E+ + 
Sbjct: 257  SSIATVSQDSSCEDAKNIAVTHGAEIEPTGAPIPNDFNPSLENKANKDADKDSYLEDLLM 316

Query: 421  T-SEACFDE----TLSPEDSLADRKLPIEDSGDQHGLDEGNKVMQLSDQVLNEGAVSESP 585
                 C         S + S  +  + +E+S  +    +G++     DQV  +  +    
Sbjct: 317  IFGSKCTTNGKTTNASEKPSSPNTVVRVEESNIK--TSDGDQSTLQPDQVPFDQTLKSQT 374

Query: 586  AVLSTEAEETGKDVQSSCLPYNSK--VENEIITFNFSAPEGVAASTGTAEDIEEQSTENL 759
            A+ + +     K         NSK      I  FN + PE    + G          ENL
Sbjct: 375  AISAADESNNNKG--------NSKEGAGTNIFDFNLTKPESTTTTEG--------GVENL 418

Query: 760  PTATSNEEDDSHKQSLV-----GNEENSTTSEGSNCAN-APEASSEHKQANXXXXXXXXX 921
            P       +DSHK   V     GN +N + S     AN A  A  +H ++          
Sbjct: 419  P-------EDSHKPKAVSVHKNGNSDNISASSQVPFANTADNAHQQHLESQNMAN----- 466

Query: 922  XXXEPNAPPVVNQLQQDMGERSFSAA-----SMIDYSGPIAFSGSLSHRSDGSTTSGKSF 1086
                        Q     GE SFSAA       I YSGPI++SGSLS RS+ STTS +SF
Sbjct: 467  -----------GQGHFADGEASFSAARGPISGSITYSGPISYSGSLSLRSESSTTSTRSF 515

Query: 1087 AFPVLQSEWNSSPVRMAKADRRDFRKHKGWRSGLLCCRF 1203
            AFPVLQ+EWNSSPVRMAKA+RR   K KGW+ GLLCCRF
Sbjct: 516  AFPVLQNEWNSSPVRMAKAERRRLSKQKGWKQGLLCCRF 554


>ref|XP_004239018.1| PREDICTED: uncharacterized protein LOC101258367 isoform 1 [Solanum
            lycopersicum]
          Length = 586

 Score =  155 bits (393), Expect = 3e-35
 Identities = 142/459 (30%), Positives = 201/459 (43%), Gaps = 62/459 (13%)
 Frame = +1

Query: 13   PDRESFDTVEDLADCNASDSRDSGSPL-------------FTDKNVLECDVPEFEVCYRE 153
            P+ ES D ++D      +++ DS SP              ++DK V + ++ E  VCYRE
Sbjct: 179  PEYESLDFLDD----KGNETIDSDSPFTSHSELFENNKHFYSDKGVTDHELSELTVCYRE 234

Query: 154  NDCHLLKDICIDEGSPEKDVNAIESGLSPLPPKEDPL---LDADSFTAEPCGSKEENDV- 321
            N+ +++KDIC+DEG P  D    ES       K+D L   +  D+       +K+  D+ 
Sbjct: 235  NNFNIVKDICMDEGVPAVDKVLTESW------KDDQLSTSVSVDADEEHQSNTKKSVDMG 288

Query: 322  --IKLISQEEKLDS-------------------------SLKNLFDKDSIKHCEPENTVE 420
              I  +SQ+   +                          SL+N  +KD+ K    E+ + 
Sbjct: 289  SSIATVSQDSSCEDAKNIAVTHGAEIEPTGAPIPNDFNPSLENKANKDADKDSYLEDLLM 348

Query: 421  T-SEACFDE----TLSPEDSLADRKLPIEDSGDQHGLDEGNKVMQLSDQVLNEGAVSESP 585
                 C         S + S  +  + +E+S  +    +G++     DQV  +  +    
Sbjct: 349  IFGSKCTTNGKTTNASEKPSSPNTVVRVEESNIK--TSDGDQSTLQPDQVPFDQTLKSQT 406

Query: 586  AVLSTEAEETGKDVQSSCLPYNSK--VENEIITFNFSAPEGVAASTGTAEDIEEQSTENL 759
            A+ + +     K         NSK      I  FN + PE    + G          ENL
Sbjct: 407  AISAADESNNNKG--------NSKEGAGTNIFDFNLTKPESTTTTEG--------GVENL 450

Query: 760  PTATSNEEDDSHKQSLV-----GNEENSTTSEGSNCAN-APEASSEHKQANXXXXXXXXX 921
            P       +DSHK   V     GN +N + S     AN A  A  +H ++          
Sbjct: 451  P-------EDSHKPKAVSVHKNGNSDNISASSQVPFANTADNAHQQHLESQNMAN----- 498

Query: 922  XXXEPNAPPVVNQLQQDMGERSFSAA-----SMIDYSGPIAFSGSLSHRSDGSTTSGKSF 1086
                        Q     GE SFSAA       I YSGPI++SGSLS RS+ STTS +SF
Sbjct: 499  -----------GQGHFADGEASFSAARGPISGSITYSGPISYSGSLSLRSESSTTSTRSF 547

Query: 1087 AFPVLQSEWNSSPVRMAKADRRDFRKHKGWRSGLLCCRF 1203
            AFPVLQ+EWNSSPVRMAKA+RR   K KGW+ GLLCCRF
Sbjct: 548  AFPVLQNEWNSSPVRMAKAERRRLSKQKGWKQGLLCCRF 586


Top