BLASTX nr result

ID: Rheum21_contig00014854 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Rheum21_contig00014854
         (2158 letters)

Database: ./nr 
           37,332,560 sequences; 13,225,080,153 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

emb|CBI27399.3| unnamed protein product [Vitis vinifera]              173   2e-40
ref|XP_006484258.1| PREDICTED: dentin sialophosphoprotein-like i...   170   3e-39
ref|XP_006484256.1| PREDICTED: dentin sialophosphoprotein-like i...   170   3e-39
ref|XP_002514588.1| conserved hypothetical protein [Ricinus comm...   164   1e-37
gb|EOY01581.1| 18S pre-ribosomal assembly protein gar2-related, ...   161   9e-37
ref|XP_002311412.1| 18S pre-ribosomal assembly protein gar2 [Pop...   156   3e-35
ref|XP_002266889.2| PREDICTED: uncharacterized protein LOC100247...   147   1e-32
gb|AAX55105.1| hypothetical protein At2g03810 [Arabidopsis thali...   147   2e-32
ref|NP_178475.2| 18S pre-ribosomal assembly protein gar2-related...   147   2e-32
ref|XP_004239019.1| PREDICTED: uncharacterized protein LOC101258...   146   3e-32
ref|XP_004239018.1| PREDICTED: uncharacterized protein LOC101258...   146   3e-32
gb|EOY01582.1| 18S pre-ribosomal assembly protein gar2-related, ...   145   7e-32
gb|AAM77645.1|AF517846_1 hypothetical protein [Arabidopsis thali...   142   8e-31
ref|XP_002875229.1| hypothetical protein ARALYDRAFT_484289 [Arab...   140   3e-30
ref|XP_006348792.1| PREDICTED: dentin sialophosphoprotein-like [...   137   1e-29
ref|XP_006291111.1| hypothetical protein CARUB_v10017222mg [Caps...   137   1e-29
ref|XP_006395670.1| hypothetical protein EUTSA_v10004181mg [Eutr...   135   7e-29
ref|XP_004144157.1| PREDICTED: uncharacterized protein LOC101217...   133   3e-28
ref|XP_004237062.1| PREDICTED: uncharacterized protein LOC101254...   133   4e-28
ref|XP_006363556.1| PREDICTED: dentin sialophosphoprotein-like i...   131   1e-27

>emb|CBI27399.3| unnamed protein product [Vitis vinifera]
          Length = 435

 Score =  173 bits (439), Expect = 2e-40
 Identities = 155/495 (31%), Positives = 237/495 (47%), Gaps = 37/495 (7%)
 Frame = -3

Query: 1574 MKLENDPGNLHPSPPRKSYN-PFDSDDDALDSPGLKSENMYVNVKENLNGFDLKAFDKSS 1398
            MKL+++P   H +   KS + PF+ +D +LD+   KS N  V   + +   DLK  ++ +
Sbjct: 1    MKLDSEPVFCHSTLVHKSDSKPFEYNDYSLDTAVPKSGNEIVKENQKVISCDLKGHERDA 60

Query: 1397 DTVIFHMDNGDFLW---EEDVLVQPATDDHAFDANSNVNKAYS----------------- 1278
            D     +D  D  W   E D  +    DD A    + V  + +                 
Sbjct: 61   DP----LDGEDRFWNTSERDCSIN--VDDIANACGNEVRNSVATCVVSSEKLESFEKDGD 114

Query: 1277 ---DKNVTECELPEFVICYKESN---IKDIGVDVGVPADDKLWVQSDKTDVD-----MKV 1131
               DK+VT+ ELP   +C +ES    +KDI +D G+ + +K+ V++ K + +     +  
Sbjct: 115  MCTDKSVTKHELP---VCCEESTYHAVKDICIDEGMLSPEKILVENGKEEHEGFCPFLPP 171

Query: 1130 EENNNQLGVVEAPKSTLDDEW-----NKAAALYSSDHPVNPDELNEEALTGQYLLHEFGD 966
            + + N    V+  K T D E       KA+A       +  +E N +A           D
Sbjct: 172  DTDKN----VDPTKETADKELPLPDGQKASAENDCGKDLMQEEENYDAR----------D 217

Query: 965  EDLLKTTELILDVEDRVCPVAELVADVADHHPSTEKEINHEDDAQEKKGHETSQGEDESA 786
            + +  T+E  +  ED +  + EL             + N   ++ E  G E      E  
Sbjct: 218  KIISDTSEEKIVPED-IFLIPEL------------SKANSMPESSEFNGMEI-----EHQ 259

Query: 785  TTQSKSQLLTEEIPNKEESLANSSGDAAMQESGDAHDPNKLAYNSKVESRSITFDFGSST 606
              Q+         PN E  L N +  +  +ES     PN+L+YNSK+ES +ITFDFGSST
Sbjct: 260  CIQN---------PNGEAVLENPALVSEAEESDKNSFPNELSYNSKLESGTITFDFGSST 310

Query: 605  NSEPEKQEDETEKAEASKPPSAQTMTRQEENQEGMISSSSILLARLEDRVHGETSFSSGM 426
             S    +E   +      P  +Q +++ E+  E +  S  I       R  GE+SFS+  
Sbjct: 311  TSMDSGREVSPQNDGCEPPLESQNLSKLEDGSESLPFSGQI------QRGLGESSFSAAG 364

Query: 425  PVPLPSLIAFSEPLTYSGSVSLRSESSAGTASTRSFAFPVLQTEWNSSPVRIGKTGQRHM 246
            P    +LI++S  +T+SG++SLRS+SS  T STRSFAFPVLQTEWNSSPVR+ K  +RH+
Sbjct: 365  PSS--ALISYSGQITHSGNISLRSDSS--TTSTRSFAFPVLQTEWNSSPVRMAKAERRHL 420

Query: 245  KKHRGWVQAFSCCKF 201
            +KHR W +   CC+F
Sbjct: 421  RKHRSWRRGILCCRF 435


>ref|XP_006484258.1| PREDICTED: dentin sialophosphoprotein-like isoform X3 [Citrus
            sinensis]
          Length = 483

 Score =  170 bits (430), Expect = 3e-39
 Identities = 143/475 (30%), Positives = 232/475 (48%), Gaps = 58/475 (12%)
 Frame = -3

Query: 1451 NVKENLNGFDLKAFDKSSDTVIFHMDNGDFLWEEDVLVQPATDDHAFDANSNVNKA--YS 1278
            +V  +++G  ++  ++S+       DN     E++V    + + H+     +  +   Y 
Sbjct: 32   HVTNDVDGCTVRKLERSTSLNDLAKDN-----EKNVQDLESPNSHSCGEMESFREPVFYM 86

Query: 1277 DKNVTECELPEFVICYKES--NIKDIGVDVGVPADDKLWVQSD-----KTDVDMKVEENN 1119
            DK+VTECELPE ++CYKE+  ++KDI +D GV + D++  +SD     ++ +  K + N+
Sbjct: 87   DKSVTECELPELIVCYKENTYHVKDICIDEGVHSHDRILFESDVGKSVRSFLPPKEDRNS 146

Query: 1118 NQLGVVEAPKSTLDDEWNKAAALYSSDHPVNPDELNEEALTGQYL--------LHEFGDE 963
              L   +     + D    +A  YS +H VN    ++E+ + + +        L   GD 
Sbjct: 147  ELLEESKNSVIPIPDVLKSSAENYSDEHIVNRCGSSQESDSDEDIDDICDSKDLRPAGDV 206

Query: 962  DLLKTTELILDVEDRVCPVAELVA--DVADHHPSTEKEINHEDDAQEK------------ 825
                T E   DV  ++  + +L++  +V   +  ++  I +E DA+++            
Sbjct: 207  KDDATEENTNDVSRKLFLLGDLLSMHNVGTKNSLSKSAIGNEIDAEKESFQGSSAKAALA 266

Query: 824  KGHETSQGEDESATTQSKSQLLTEEIPN-----------------------KEESLANSS 714
               E + G  E   T +     +EE  N                       +E SLA+  
Sbjct: 267  NPEEANGGTAEEILTGADFVSASEESQNGCGEGISGNPTLVSASEKAHDKSEEASLASPD 326

Query: 713  GDAAMQESGDAHDPNKLAYNSKVESRSITFDFGSSTNSEPEKQED-ETEKAEASKPPSAQ 537
            G +A+ ES       K +YNS VE+ SITFDF +S      K+E  +   ++  + P   
Sbjct: 327  GVSALSESTKISTAEKSSYNSMVETGSITFDFDASAPGASGKEEPLQIGDSQRIETPG-- 384

Query: 536  TMTRQEENQEGMISSSSILLARLEDRVH---GETSFSSGMPVPLPSLIAFSEPLTYSGSV 366
             M+R E+     +SS          + H   GE+SFS+     LPSLI++S P+ YSGS+
Sbjct: 385  -MSRLEDAPRQSVSS----------QFHSGLGESSFSAA--GSLPSLISYSGPVAYSGSI 431

Query: 365  SLRSESSAGTASTRSFAFPVLQTEWNSSPVRIGKTGQRHMKKHRGWVQAFSCCKF 201
            SLRS+SS  T STRSFAFP+LQTEW+ SPVR+ K  +RH +KH+ W Q   CC+F
Sbjct: 432  SLRSDSS--TTSTRSFAFPILQTEWDRSPVRMAKADRRHYRKHK-WKQGLLCCRF 483


>ref|XP_006484256.1| PREDICTED: dentin sialophosphoprotein-like isoform X1 [Citrus
            sinensis] gi|568861537|ref|XP_006484257.1| PREDICTED:
            dentin sialophosphoprotein-like isoform X2 [Citrus
            sinensis]
          Length = 496

 Score =  170 bits (430), Expect = 3e-39
 Identities = 143/475 (30%), Positives = 232/475 (48%), Gaps = 58/475 (12%)
 Frame = -3

Query: 1451 NVKENLNGFDLKAFDKSSDTVIFHMDNGDFLWEEDVLVQPATDDHAFDANSNVNKA--YS 1278
            +V  +++G  ++  ++S+       DN     E++V    + + H+     +  +   Y 
Sbjct: 45   HVTNDVDGCTVRKLERSTSLNDLAKDN-----EKNVQDLESPNSHSCGEMESFREPVFYM 99

Query: 1277 DKNVTECELPEFVICYKES--NIKDIGVDVGVPADDKLWVQSD-----KTDVDMKVEENN 1119
            DK+VTECELPE ++CYKE+  ++KDI +D GV + D++  +SD     ++ +  K + N+
Sbjct: 100  DKSVTECELPELIVCYKENTYHVKDICIDEGVHSHDRILFESDVGKSVRSFLPPKEDRNS 159

Query: 1118 NQLGVVEAPKSTLDDEWNKAAALYSSDHPVNPDELNEEALTGQYL--------LHEFGDE 963
              L   +     + D    +A  YS +H VN    ++E+ + + +        L   GD 
Sbjct: 160  ELLEESKNSVIPIPDVLKSSAENYSDEHIVNRCGSSQESDSDEDIDDICDSKDLRPAGDV 219

Query: 962  DLLKTTELILDVEDRVCPVAELVA--DVADHHPSTEKEINHEDDAQEK------------ 825
                T E   DV  ++  + +L++  +V   +  ++  I +E DA+++            
Sbjct: 220  KDDATEENTNDVSRKLFLLGDLLSMHNVGTKNSLSKSAIGNEIDAEKESFQGSSAKAALA 279

Query: 824  KGHETSQGEDESATTQSKSQLLTEEIPN-----------------------KEESLANSS 714
               E + G  E   T +     +EE  N                       +E SLA+  
Sbjct: 280  NPEEANGGTAEEILTGADFVSASEESQNGCGEGISGNPTLVSASEKAHDKSEEASLASPD 339

Query: 713  GDAAMQESGDAHDPNKLAYNSKVESRSITFDFGSSTNSEPEKQED-ETEKAEASKPPSAQ 537
            G +A+ ES       K +YNS VE+ SITFDF +S      K+E  +   ++  + P   
Sbjct: 340  GVSALSESTKISTAEKSSYNSMVETGSITFDFDASAPGASGKEEPLQIGDSQRIETPG-- 397

Query: 536  TMTRQEENQEGMISSSSILLARLEDRVH---GETSFSSGMPVPLPSLIAFSEPLTYSGSV 366
             M+R E+     +SS          + H   GE+SFS+     LPSLI++S P+ YSGS+
Sbjct: 398  -MSRLEDAPRQSVSS----------QFHSGLGESSFSAA--GSLPSLISYSGPVAYSGSI 444

Query: 365  SLRSESSAGTASTRSFAFPVLQTEWNSSPVRIGKTGQRHMKKHRGWVQAFSCCKF 201
            SLRS+SS  T STRSFAFP+LQTEW+ SPVR+ K  +RH +KH+ W Q   CC+F
Sbjct: 445  SLRSDSS--TTSTRSFAFPILQTEWDRSPVRMAKADRRHYRKHK-WKQGLLCCRF 496


>ref|XP_002514588.1| conserved hypothetical protein [Ricinus communis]
            gi|223546192|gb|EEF47694.1| conserved hypothetical
            protein [Ricinus communis]
          Length = 488

 Score =  164 bits (416), Expect = 1e-37
 Identities = 158/520 (30%), Positives = 243/520 (46%), Gaps = 62/520 (11%)
 Frame = -3

Query: 1574 MKLENDPGNLHPSPPRKSYNP-FDSDDDALDSPGLKSENMYVNVKENLNGFDLKAFDKSS 1398
            MKL+++    H +   KS +  F  +  ALDS GL+S N+ VN  EN   +DLKA + ++
Sbjct: 1    MKLDSEQVLCHGTGDHKSISKSFGYNKIALDSSGLRSGNVIVNEDENGPFYDLKAREGNT 60

Query: 1397 DTVIFHMDNGDFLW------------------EEDVLVQPATDDHAFDANSNVNKAYSDK 1272
            D  + ++ NG+  W                  EE+V    +    +FD +S     Y DK
Sbjct: 61   DQ-LHYLVNGEDGWNASKLDSCTGVNVSIHDKEEEVRNFTSLKIESFDKDSVF---YIDK 116

Query: 1271 NVTECELPEFVICYKESN---IKDIGVDVGVPADDKLW----VQSDK------TDVDMKV 1131
            NV E ELPE V+CYKE+    +KDI VD GVP+ +       V  +K       + D+K 
Sbjct: 117  NVMEPELPELVLCYKENTYHVVKDICVDEGVPSQENFLFDTSVDQEKLCPYLIPEKDIKS 176

Query: 1130 EENNNQLGVVEAPKSTLDDEWNKAAALYSSDHPVNPDELNEEALTGQYLLHEFGDEDLLK 951
            E    +   V+   ST     N  +    S   +   E+ ++A+     +  +  ++   
Sbjct: 177  EIQKER---VDLDMSTQYLSKNDNSFKCDSKESMAIAEIEDDAMEE---IANYTSKETFS 230

Query: 950  TTELILDVEDRVCPVAELVADVADHHPSTEKEINHEDDAQEKKGHETSQG---------- 801
              EL+L        + E+VA+++     ++  +N  D+A++      S+           
Sbjct: 231  LGELLL--------MPEVVAELS----HSKSLLNSTDEAEQLSIQRPSENIVLATASACE 278

Query: 800  EDESATTQ-----SKSQLLTEEIPNKEESLANSSGDAAMQESGDAHDPNKLA-----YNS 651
            E + AT Q          L EE  ++E  L   + D++ + S   HD   LA     Y +
Sbjct: 279  ESKYATEQFLLVTPAVDPLVEESGHEEAKLGTLTSDSSPKASDHGHDEVILASLAPSYAT 338

Query: 650  K--------VESRSITFDFGSSTNSEPEKQE--DETEKAEASKPPSAQTMTRQEENQEGM 501
            +         +S S T D  S  NS        +E  +   S+   ++  +R E+     
Sbjct: 339  EEPENGAKAAKSPSHTLDSVSDLNSSAPTASGGEEGSQVGGSEHLESRNSSRHEDTSITE 398

Query: 500  ISSSSILLARLEDRVHGETSFSSGMPVPLPSLIAFSEPLTYSGSVSLRSESSAGTASTRS 321
              S  +  +      HGE+SFS+    PL  LI++S P+ YSGS+SLRS+SS  T STRS
Sbjct: 399  PFSGQLQYS------HGESSFSAA--GPLSGLISYSGPIAYSGSLSLRSDSS--TTSTRS 448

Query: 320  FAFPVLQTEWNSSPVRIGKTGQRHMKKHRGWVQAFSCCKF 201
            FAFP+LQ+EWNSSPVR+ K  +RH +KHR W Q   CC+F
Sbjct: 449  FAFPILQSEWNSSPVRMAKADRRHFRKHRSWRQGLLCCRF 488


>gb|EOY01581.1| 18S pre-ribosomal assembly protein gar2-related, putative isoform 1
            [Theobroma cacao]
          Length = 527

 Score =  161 bits (408), Expect = 9e-37
 Identities = 151/547 (27%), Positives = 250/547 (45%), Gaps = 89/547 (16%)
 Frame = -3

Query: 1574 MKLENDPGNLHPSPPRKSYN----------PFDSDDDALDSPGLKSENMYVNVKENLNGF 1425
            MKL+N+    H     KS +          PF++ D  LDS GL +E +   VKEN NG 
Sbjct: 1    MKLDNEQVLCHSITGHKSDSKPYSFLADTKPFENKDKPLDSTGLNAEGV---VKENQNGV 57

Query: 1424 --DLKAFDKSSDTVIFHMDNGDFLWE---------------------EDVLVQPATDDHA 1314
              D+K  D  SD  ++ +DN    W                       D +   +     
Sbjct: 58   MHDIKGNDGDSDPSLY-LDNTRGGWPALKLDCSISVNDFANGNEKEVRDFVTSNSPSLKN 116

Query: 1313 FDANSNVNKAYSDKNVTECELPEFVICYKESN---IKDIGVDVGVPADDKLWVQS----- 1158
             D+  N +  Y DK+V ECELPE V+CYKES    +KDI +D GVP  DK   ++     
Sbjct: 117  MDSFQN-SVFYLDKSVMECELPELVVCYKESTYHVVKDICIDEGVPTQDKFLFETGMDEK 175

Query: 1157 ------------------DKTDVDMKVEENNNQLGVVEAPKSTLDDEWNKAAALYSSDHP 1032
                              +K + DM +++ +   G  ++ K  +D+E      + +    
Sbjct: 176  IDCNFLPSEKEQDSQLMTEKLETDMCMQDVSMSPGENQSGKD-IDNECGSNKKVDTDTCM 234

Query: 1031 VNPDELNEEALTGQYLLHEFGDEDLLKTTEL--------ILDVEDRVCPVAELVA--DVA 882
             +     E+  + + + ++   +DL+ T  +          DV   +  + EL++  +++
Sbjct: 235  QDVSLSLEKNESNKGIPNQCDSKDLMLTRVVKGDAMKMVTDDVSKELFTLGELLSMSELS 294

Query: 881  DHHPSTEKEINHEDDAQEKKGHETSQGE-----------DESATTQSKSQLLTEEIPNKE 735
              +          D  +++    +S+ E           +ES  +  ++ +    + +  
Sbjct: 295  KVNSEAMSSDCKSDGIEQQSFQSSSKKEVMVMPPLVSAVEESKDSNEEAIVSVPALVSAT 354

Query: 734  ESLANSSGDAAM---------QESGDAHDPNKLAYNSKVESRSITFDFGSSTNSEPEKQE 582
            E L +  G+A +         +ES  +   N+++Y++K+E+ SITF+  SS    P   +
Sbjct: 355  EELDSGKGEAILISPAQVSTSEESTSSSLVNEVSYDNKLETGSITFNLDSSA---PTSSK 411

Query: 581  DETEKAEASKPPSAQTMTRQEENQEGMISSSSILLARLEDRVHGETSFSSGMPVPLPSLI 402
            DE      S+P    +  + E   +  IS++      L+  + GE+SFS+   V    LI
Sbjct: 412  DECHHNLDSEPLGTGSTPKLEVAADQSISNN------LQQGI-GESSFSAAGLVT--GLI 462

Query: 401  AFSEPLTYSGSVSLRSESSAGTASTRSFAFPVLQTEWNSSPVRIGKTGQRHMKKHRGWVQ 222
            ++S P+ YSGS+SLRS+SS  T STRSFAFP+LQ+EWN SPVR+ K  +RH +KH+GW  
Sbjct: 463  SYSGPVAYSGSLSLRSDSS--TTSTRSFAFPILQSEWNCSPVRMAKADRRHYRKHKGWRH 520

Query: 221  AFSCCKF 201
               CC+F
Sbjct: 521  GLLCCRF 527


>ref|XP_002311412.1| 18S pre-ribosomal assembly protein gar2 [Populus trichocarpa]
            gi|222851232|gb|EEE88779.1| 18S pre-ribosomal assembly
            protein gar2 [Populus trichocarpa]
          Length = 486

 Score =  156 bits (395), Expect = 3e-35
 Identities = 153/507 (30%), Positives = 222/507 (43%), Gaps = 69/507 (13%)
 Frame = -3

Query: 1514 PFDSDDDALDSPGLKSENMYVNVKENLNGFDLKAFDKSSDTVIFHMDNGDFLWEEDVLVQ 1335
            P + +D+ALDS GLKS N  V   EN    DL   +  +D     + N          V 
Sbjct: 9    PVEYNDNALDSIGLKSGNGSVKEIENGKFSDLNGMEGDAD----RLPN----------VA 54

Query: 1334 PATDDHAFDANSNVNKA--YSDKNVTECELPEFVICYKES--NIKDIGVDVGVPADDKLW 1167
            P    H+        ++  Y DK+V   E+PE ++CYKE+  ++KDI VD GVP  DK  
Sbjct: 55   PVPSPHSSLKMEPFEESVFYMDKSVMVREVPELIVCYKENTYHVKDICVDEGVPLQDKFL 114

Query: 1166 VQSDKTDVDM-----KVEENNNQLGVVEAPKSTLDDEWNKAAALYSSD--HPVNPDEL-- 1014
              +D    +M        + NN++   ++    L  E  K+++   +   H   PD L  
Sbjct: 115  FDTDAHKKNMCEFLPSERDMNNEMVKEKSDLDMLIPEMLKSSSEKQNVDLHLPVPDVLIS 174

Query: 1013 NEEALTGQYLLHEFGDEDLLKTTEL---------------ILDVEDRVCPVAELVADVAD 879
            +EE  +   L  +   + L+ T E+               IL + D +  ++EL A    
Sbjct: 175  SEEKGSKHDLSLDCDPKHLMPTEEVMDYGTKKVTDNASKEILSLRD-LLSMSELGAKCTP 233

Query: 878  HHPS---------------TEKEINHEDDAQEKKGH------------------------ 816
             + S                E  I   D A E+  H                        
Sbjct: 234  ANASYHNMDKVEQQSLLCPRENAILETDSASEESEHCGEETISDNGLESATLAIPTQDPA 293

Query: 815  --ETSQGEDESATTQSKSQLLTEEIPNKEESLANSSGDAAMQESGDAHDPNKLAYNSKVE 642
              E   G  E+           EE  +KE  LA+ + D+   E   +   ++L YNSK E
Sbjct: 294  YQEGDHGHTEAVLVSPTLTSAAEESDSKETKLASHALDS-FSEGSTSRIEDELPYNSKTE 352

Query: 641  SRSITFDFGSSTNSEPEKQEDETEKAEASKPPSAQTMTRQEENQEGMISSSSILLARLED 462
            +RSI+FD  SS    P     E+ +   S+    + ++R E+     +S   +  A    
Sbjct: 353  TRSISFDNDSSA---PAASARESPQNGESQRLGTRIVSRFEDPNAERLSGGQLQYA---- 405

Query: 461  RVHGETSFSSGMPVPLPSLIAFSEPLTYSGSVSLRSESSAGTASTRSFAFPVLQTEWNSS 282
               GE+SFSS  P  L  L + S P+ YSGSVSLRS+SS  T STRSFAFP+LQ+EWNSS
Sbjct: 406  --DGESSFSSSGP--LFGLTSHSGPIAYSGSVSLRSDSS--TTSTRSFAFPILQSEWNSS 459

Query: 281  PVRIGKTGQRHMKKHRGWVQAFSCCKF 201
            P R+ K  +RH +K R W+Q   CC+F
Sbjct: 460  PARMAKADRRHFQKPRKWMQGLLCCRF 486


>ref|XP_002266889.2| PREDICTED: uncharacterized protein LOC100247891 [Vitis vinifera]
          Length = 229

 Score =  147 bits (372), Expect = 1e-32
 Identities = 83/182 (45%), Positives = 114/182 (62%)
 Frame = -3

Query: 746 PNKEESLANSSGDAAMQESGDAHDPNKLAYNSKVESRSITFDFGSSTNSEPEKQEDETEK 567
           PN E  L N +  +  +ES     PN+L+YNSK+ES +ITFDFGSST S    +E   + 
Sbjct: 58  PNGEAVLENPALVSEAEESDKNSFPNELSYNSKLESGTITFDFGSSTTSMDSGREVSPQN 117

Query: 566 AEASKPPSAQTMTRQEENQEGMISSSSILLARLEDRVHGETSFSSGMPVPLPSLIAFSEP 387
                P  +Q +++ E+  E +  S  I       R  GE+SFS+  P    +LI++S  
Sbjct: 118 DGCEPPLESQNLSKLEDGSESLPFSGQI------QRGLGESSFSAAGPSS--ALISYSGQ 169

Query: 386 LTYSGSVSLRSESSAGTASTRSFAFPVLQTEWNSSPVRIGKTGQRHMKKHRGWVQAFSCC 207
           +T+SG++SLRS+SS  T STRSFAFPVLQTEWNSSPVR+ K  +RH++KHR W +   CC
Sbjct: 170 ITHSGNISLRSDSS--TTSTRSFAFPVLQTEWNSSPVRMAKAERRHLRKHRSWRRGILCC 227

Query: 206 KF 201
           +F
Sbjct: 228 RF 229


>gb|AAX55105.1| hypothetical protein At2g03810 [Arabidopsis thaliana]
          Length = 439

 Score =  147 bits (370), Expect = 2e-32
 Identities = 129/418 (30%), Positives = 185/418 (44%), Gaps = 32/418 (7%)
 Frame = -3

Query: 1358 WEEDVLVQPATDDHAFDANSNVNKA-----YSDKNVTECELPEFVICYKESN---IKDIG 1203
            WE +   +     H  DAN +  +      Y DKNVT C+LPE V+CYKE+    +KDI 
Sbjct: 60   WENEAGKKVRDTSHDCDANVDSPEKKDPVFYMDKNVTACDLPEIVVCYKENTYHIVKDIC 119

Query: 1202 VDVGVPADDKLW------VQSDKTDVDMKVEENN---NQLGVVEAPKSTLDDEWNKAAAL 1050
            VD GVP  +K        V+S  T+  MK ++ N   ++    E   S +DD      + 
Sbjct: 120  VDEGVPVQEKFLFGEKDSVKSSSTEDLMKADKTNVNPSETKSAEDSISKVDD------SE 173

Query: 1049 YSSDHPVNPDELNEEALTGQYLLHEFGDEDLLKTTELILDVEDRVCPVAELVADVADHHP 870
            + +DH  + D    E  +G+      G         LI+  E +  P           H 
Sbjct: 174  FCNDHKTDRDV---EESSGEDFADAEGTSSNYNQEHLIVTEEVKASPT----------HG 220

Query: 869  STEKEINHEDDAQEKKGHETSQGEDESATTQSKSQLLTEEIPNKEESLANSSGDAAMQES 690
             +  EI  E D   K     SQ  D      SK  L   +I ++E+   + + D    +S
Sbjct: 221  LSPSEI--EPDENSKDEVAISQDND------SKECLTLGDILSREDEQKSLNQDNISSDS 272

Query: 689  GDAHDPNKLAYNSKVESRSITFDFGSSTNSEPEKQEDETEKAEASKPPSAQTMTRQEEN- 513
             +   P++L      E RS+      +T  E E ++ E  K    K  S  T T QE N 
Sbjct: 273  HEEQSPSQL---QDKEKRSL-----ETTAIETELEKTEEPKQGEEKLSSVSTTTSQEPNK 324

Query: 512  --------------QEGMISSSSILLARLEDRVHGETSFSSGMPVPLPSLIAFSEPLTYS 375
                          Q+  +  +S    +      GETSFS+   V +   I +S P+ YS
Sbjct: 325  TCNEPEKPETENHHQQNCLVENSYEDDKFSSSRFGETSFSAADSVSISGHITYSGPIAYS 384

Query: 374  GSVSLRSESSAGTASTRSFAFPVLQTEWNSSPVRIGKTGQRHMKKHRGWVQAFSCCKF 201
            GS+S+RS++S  T S RSFAFP+LQ+EWNSSPVR+ K  +R  K   GW     CC+F
Sbjct: 385  GSLSVRSDAS--TTSGRSFAFPILQSEWNSSPVRMAKADKRRQK--GGWRHTLLCCRF 438


>ref|NP_178475.2| 18S pre-ribosomal assembly protein gar2-related protein [Arabidopsis
            thaliana] gi|42570677|ref|NP_973412.1| 18S pre-ribosomal
            assembly protein gar2-related protein [Arabidopsis
            thaliana] gi|79316683|ref|NP_001030966.1| 18S
            pre-ribosomal assembly protein gar2-related protein
            [Arabidopsis thaliana] gi|186499149|ref|NP_001118260.1|
            18S pre-ribosomal assembly protein gar2-related protein
            [Arabidopsis thaliana] gi|330250656|gb|AEC05750.1| 18S
            pre-ribosomal assembly protein gar2-related protein
            [Arabidopsis thaliana] gi|330250657|gb|AEC05751.1| 18S
            pre-ribosomal assembly protein gar2-related protein
            [Arabidopsis thaliana] gi|330250658|gb|AEC05752.1| 18S
            pre-ribosomal assembly protein gar2-related protein
            [Arabidopsis thaliana] gi|330250659|gb|AEC05753.1| 18S
            pre-ribosomal assembly protein gar2-related protein
            [Arabidopsis thaliana]
          Length = 439

 Score =  147 bits (370), Expect = 2e-32
 Identities = 129/418 (30%), Positives = 185/418 (44%), Gaps = 32/418 (7%)
 Frame = -3

Query: 1358 WEEDVLVQPATDDHAFDANSNVNKA-----YSDKNVTECELPEFVICYKESN---IKDIG 1203
            WE +   +     H  DAN +  +      Y DKNVT C+LPE V+CYKE+    +KDI 
Sbjct: 60   WENEAGKKVRDTSHDCDANVDSPEKKDPVFYMDKNVTACDLPEIVVCYKENTYHIVKDIC 119

Query: 1202 VDVGVPADDKLW------VQSDKTDVDMKVEENN---NQLGVVEAPKSTLDDEWNKAAAL 1050
            VD GVP  +K        V+S  T+  MK ++ N   ++    E   S +DD      + 
Sbjct: 120  VDEGVPVQEKFLFGEKDSVKSSSTEDLMKADKTNVNPSETKSAEDSISKVDD------SE 173

Query: 1049 YSSDHPVNPDELNEEALTGQYLLHEFGDEDLLKTTELILDVEDRVCPVAELVADVADHHP 870
            + +DH  + D    E  +G+      G         LI+  E +  P           H 
Sbjct: 174  FCNDHKTDRDV---EESSGEDFADAEGTSSNYNQEHLIVTEEVKASPT----------HG 220

Query: 869  STEKEINHEDDAQEKKGHETSQGEDESATTQSKSQLLTEEIPNKEESLANSSGDAAMQES 690
             +  EI  E D   K     SQ  D      SK  L   +I ++E+   + + D    +S
Sbjct: 221  LSPSEI--EPDENSKDEVAISQDND------SKECLTLGDILSREDEQKSLNQDNISSDS 272

Query: 689  GDAHDPNKLAYNSKVESRSITFDFGSSTNSEPEKQEDETEKAEASKPPSAQTMTRQEEN- 513
             +   P++L      E RS+      +T  E E ++ E  K    K  S  T T QE N 
Sbjct: 273  HEEQSPSQL---QDKEKRSL-----ETTAIETELEKTEEPKQGEEKLSSVSTTTSQEPNK 324

Query: 512  --------------QEGMISSSSILLARLEDRVHGETSFSSGMPVPLPSLIAFSEPLTYS 375
                          Q+  +  +S    +      GETSFS+   V +   I +S P+ YS
Sbjct: 325  TCNEPEKPETENHHQQNCLVENSYEDDKFSSSRFGETSFSAADSVSISGHITYSGPIAYS 384

Query: 374  GSVSLRSESSAGTASTRSFAFPVLQTEWNSSPVRIGKTGQRHMKKHRGWVQAFSCCKF 201
            GS+S+RS++S  T S RSFAFP+LQ+EWNSSPVR+ K  +R  K   GW     CC+F
Sbjct: 385  GSLSVRSDAS--TTSGRSFAFPILQSEWNSSPVRMAKADKRRQK--GGWRHTLLCCRF 438


>ref|XP_004239019.1| PREDICTED: uncharacterized protein LOC101258367 isoform 2 [Solanum
            lycopersicum]
          Length = 554

 Score =  146 bits (369), Expect = 3e-32
 Identities = 137/459 (29%), Positives = 208/459 (45%), Gaps = 41/459 (8%)
 Frame = -3

Query: 1454 VNVKENLNGFDLKAFDKSSDTVIFHMDNGDFLWEEDVLVQPATDDHAFDANS----NVNK 1287
            ++ K   N F+    D++    I   ++ DFL  +D   +    D  F ++S    N   
Sbjct: 124  IHTKRGGNPFECDTKDRNQPWNIPEYESLDFL--DDKGNETIDSDSPFTSHSELFENNKH 181

Query: 1286 AYSDKNVTECELPEFVICYKESN---IKDIGVDVGVPADDKLWVQSDKTD-------VDM 1137
             YSDK VT+ EL E  +CY+E+N   +KDI +D GVPA DK+  +S K D       VD 
Sbjct: 182  FYSDKGVTDHELSELTVCYRENNFNIVKDICMDEGVPAVDKVLTESWKDDQLSTSVSVDA 241

Query: 1136 KVEENNNQLGVVE--------APKSTLDDEWNKAAALYSSDHPVNPDELNEEALTGQYLL 981
              E  +N    V+        +  S+ +D  N A    +   P      N+   + +   
Sbjct: 242  DEEHQSNTKKSVDMGSSIATVSQDSSCEDAKNIAVTHGAEIEPTGAPIPNDFNPSLENKA 301

Query: 980  HEFGDEDLLKTTELILDVEDRVCPVAELVADVADHHPSTEKEINHEDDAQEKKGHETSQG 801
            ++  D+D     E +L +    C       + A   PS+   +   +++  K    TS G
Sbjct: 302  NKDADKD--SYLEDLLMIFGSKCTTNGKTTN-ASEKPSSPNTVVRVEESNIK----TSDG 354

Query: 800  EDESATTQSKSQLLTEEIPNKEESLANSSGDAAMQESGDAHDPNKLAYNSKVESRSITFD 621
            +        +S L  +++P  +++L + +  +A  ES      N    NSK  + +  FD
Sbjct: 355  D--------QSTLQPDQVPF-DQTLKSQTAISAADES------NNNKGNSKEGAGTNIFD 399

Query: 620  FGSSTNSEPEKQEDETEKA-EASKPPSAQTMTRQEENQEGMISSSSILLARLEDRVH--- 453
            F  +        E   E   E S  P A ++  +  N + + +SS +  A   D  H   
Sbjct: 400  FNLTKPESTTTTEGGVENLPEDSHKPKAVSV-HKNGNSDNISASSQVPFANTADNAHQQH 458

Query: 452  ---------------GETSFSSGMPVPLPSLIAFSEPLTYSGSVSLRSESSAGTASTRSF 318
                           GE SFS+    P+   I +S P++YSGS+SLRSESS  T STRSF
Sbjct: 459  LESQNMANGQGHFADGEASFSAARG-PISGSITYSGPISYSGSLSLRSESS--TTSTRSF 515

Query: 317  AFPVLQTEWNSSPVRIGKTGQRHMKKHRGWVQAFSCCKF 201
            AFPVLQ EWNSSPVR+ K  +R + K +GW Q   CC+F
Sbjct: 516  AFPVLQNEWNSSPVRMAKAERRRLSKQKGWKQGLLCCRF 554


>ref|XP_004239018.1| PREDICTED: uncharacterized protein LOC101258367 isoform 1 [Solanum
            lycopersicum]
          Length = 586

 Score =  146 bits (369), Expect = 3e-32
 Identities = 137/459 (29%), Positives = 208/459 (45%), Gaps = 41/459 (8%)
 Frame = -3

Query: 1454 VNVKENLNGFDLKAFDKSSDTVIFHMDNGDFLWEEDVLVQPATDDHAFDANS----NVNK 1287
            ++ K   N F+    D++    I   ++ DFL  +D   +    D  F ++S    N   
Sbjct: 156  IHTKRGGNPFECDTKDRNQPWNIPEYESLDFL--DDKGNETIDSDSPFTSHSELFENNKH 213

Query: 1286 AYSDKNVTECELPEFVICYKESN---IKDIGVDVGVPADDKLWVQSDKTD-------VDM 1137
             YSDK VT+ EL E  +CY+E+N   +KDI +D GVPA DK+  +S K D       VD 
Sbjct: 214  FYSDKGVTDHELSELTVCYRENNFNIVKDICMDEGVPAVDKVLTESWKDDQLSTSVSVDA 273

Query: 1136 KVEENNNQLGVVE--------APKSTLDDEWNKAAALYSSDHPVNPDELNEEALTGQYLL 981
              E  +N    V+        +  S+ +D  N A    +   P      N+   + +   
Sbjct: 274  DEEHQSNTKKSVDMGSSIATVSQDSSCEDAKNIAVTHGAEIEPTGAPIPNDFNPSLENKA 333

Query: 980  HEFGDEDLLKTTELILDVEDRVCPVAELVADVADHHPSTEKEINHEDDAQEKKGHETSQG 801
            ++  D+D     E +L +    C       + A   PS+   +   +++  K    TS G
Sbjct: 334  NKDADKD--SYLEDLLMIFGSKCTTNGKTTN-ASEKPSSPNTVVRVEESNIK----TSDG 386

Query: 800  EDESATTQSKSQLLTEEIPNKEESLANSSGDAAMQESGDAHDPNKLAYNSKVESRSITFD 621
            +        +S L  +++P  +++L + +  +A  ES      N    NSK  + +  FD
Sbjct: 387  D--------QSTLQPDQVPF-DQTLKSQTAISAADES------NNNKGNSKEGAGTNIFD 431

Query: 620  FGSSTNSEPEKQEDETEKA-EASKPPSAQTMTRQEENQEGMISSSSILLARLEDRVH--- 453
            F  +        E   E   E S  P A ++  +  N + + +SS +  A   D  H   
Sbjct: 432  FNLTKPESTTTTEGGVENLPEDSHKPKAVSV-HKNGNSDNISASSQVPFANTADNAHQQH 490

Query: 452  ---------------GETSFSSGMPVPLPSLIAFSEPLTYSGSVSLRSESSAGTASTRSF 318
                           GE SFS+    P+   I +S P++YSGS+SLRSESS  T STRSF
Sbjct: 491  LESQNMANGQGHFADGEASFSAARG-PISGSITYSGPISYSGSLSLRSESS--TTSTRSF 547

Query: 317  AFPVLQTEWNSSPVRIGKTGQRHMKKHRGWVQAFSCCKF 201
            AFPVLQ EWNSSPVR+ K  +R + K +GW Q   CC+F
Sbjct: 548  AFPVLQNEWNSSPVRMAKAERRRLSKQKGWKQGLLCCRF 586


>gb|EOY01582.1| 18S pre-ribosomal assembly protein gar2-related, putative isoform 2
            [Theobroma cacao] gi|508709686|gb|EOY01583.1| 18S
            pre-ribosomal assembly protein gar2-related, putative
            isoform 2 [Theobroma cacao] gi|508709687|gb|EOY01584.1|
            18S pre-ribosomal assembly protein gar2-related, putative
            isoform 2 [Theobroma cacao]
          Length = 470

 Score =  145 bits (366), Expect = 7e-32
 Identities = 118/417 (28%), Positives = 202/417 (48%), Gaps = 56/417 (13%)
 Frame = -3

Query: 1283 YSDKNVTECELPEFVICYKESN---IKDIGVDVGVPADDKLWVQS--------------- 1158
            Y DK+V ECELPE V+CYKES    +KDI +D GVP  DK   ++               
Sbjct: 69   YLDKSVMECELPELVVCYKESTYHVVKDICIDEGVPTQDKFLFETGMDEKIDCNFLPSEK 128

Query: 1157 --------DKTDVDMKVEENNNQLGVVEAPKSTLDDEWNKAAALYSSDHPVNPDELNEEA 1002
                    +K + DM +++ +   G  ++ K  +D+E      + +     +     E+ 
Sbjct: 129  EQDSQLMTEKLETDMCMQDVSMSPGENQSGKD-IDNECGSNKKVDTDTCMQDVSLSLEKN 187

Query: 1001 LTGQYLLHEFGDEDLLKTTEL--------ILDVEDRVCPVAELVA--DVADHHPSTEKEI 852
             + + + ++   +DL+ T  +          DV   +  + EL++  +++  +       
Sbjct: 188  ESNKGIPNQCDSKDLMLTRVVKGDAMKMVTDDVSKELFTLGELLSMSELSKVNSEAMSSD 247

Query: 851  NHEDDAQEKKGHETSQGE-----------DESATTQSKSQLLTEEIPNKEESLANSSGDA 705
               D  +++    +S+ E           +ES  +  ++ +    + +  E L +  G+A
Sbjct: 248  CKSDGIEQQSFQSSSKKEVMVMPPLVSAVEESKDSNEEAIVSVPALVSATEELDSGKGEA 307

Query: 704  AM---------QESGDAHDPNKLAYNSKVESRSITFDFGSSTNSEPEKQEDETEKAEASK 552
             +         +ES  +   N+++Y++K+E+ SITF+  SS    P   +DE      S+
Sbjct: 308  ILISPAQVSTSEESTSSSLVNEVSYDNKLETGSITFNLDSSA---PTSSKDECHHNLDSE 364

Query: 551  PPSAQTMTRQEENQEGMISSSSILLARLEDRVHGETSFSSGMPVPLPSLIAFSEPLTYSG 372
            P    +  + E   +  IS++      L+  + GE+SFS+   V    LI++S P+ YSG
Sbjct: 365  PLGTGSTPKLEVAADQSISNN------LQQGI-GESSFSAAGLVT--GLISYSGPVAYSG 415

Query: 371  SVSLRSESSAGTASTRSFAFPVLQTEWNSSPVRIGKTGQRHMKKHRGWVQAFSCCKF 201
            S+SLRS+SS  T STRSFAFP+LQ+EWN SPVR+ K  +RH +KH+GW     CC+F
Sbjct: 416  SLSLRSDSS--TTSTRSFAFPILQSEWNCSPVRMAKADRRHYRKHKGWRHGLLCCRF 470


>gb|AAM77645.1|AF517846_1 hypothetical protein [Arabidopsis thaliana]
            gi|41059759|gb|AAR99354.1| hypothetical protein At2g03810
            [Arabidopsis thaliana]
          Length = 439

 Score =  142 bits (357), Expect = 8e-31
 Identities = 128/418 (30%), Positives = 182/418 (43%), Gaps = 32/418 (7%)
 Frame = -3

Query: 1358 WEEDVLVQPATDDHAFDANSNVNKA-----YSDKNVTECELPEFVICYKESN---IKDIG 1203
            WE +   +     H  DAN +  +      Y DKNVT C+LPE V CYKE+    +KDI 
Sbjct: 60   WENEAGKKVRDTSHDCDANVDSPEKKDPVFYMDKNVTACDLPEIVACYKENTYHIVKDIC 119

Query: 1202 VDVGVPADDKLW------VQSDKTDVDMKVEENN---NQLGVVEAPKSTLDDEWNKAAAL 1050
            VD  VP  +K        V+S  T+  MK ++ N   ++    E   S +DD      + 
Sbjct: 120  VDESVPVQEKFLFGEKDSVKSSSTEDLMKADKTNVNPSETKSAEDSISKVDD------SE 173

Query: 1049 YSSDHPVNPDELNEEALTGQYLLHEFGDEDLLKTTELILDVEDRVCPVAELVADVADHHP 870
            + +DH  + D    E  +G+      G         LI+  E    P           H 
Sbjct: 174  FCNDHKTDRDV---EESSGEDFADAEGTSSNYNQEHLIVTEEVXASPT----------HG 220

Query: 869  STEKEINHEDDAQEKKGHETSQGEDESATTQSKSQLLTEEIPNKEESLANSSGDAAMQES 690
             +  EI  E D   K     SQ  D      SK  L   +I ++E+   + + D    +S
Sbjct: 221  LSPSEI--EPDENSKDEVAISQDND------SKECLTLGDILSREDEQKSLNQDNISSDS 272

Query: 689  GDAHDPNKLAYNSKVESRSITFDFGSSTNSEPEKQEDETEKAEASKPPSAQTMTRQEEN- 513
             +   P++L      E RS+      +T  E E ++ E  K    K  S  T T QE N 
Sbjct: 273  HEEQSPSQL---QDKEKRSL-----ETTAIETELEKTEEPKQGEEKLSSVSTTTSQEPNK 324

Query: 512  --------------QEGMISSSSILLARLEDRVHGETSFSSGMPVPLPSLIAFSEPLTYS 375
                          Q+  +  +S    +      GETSFS+   V +   I +S P+ YS
Sbjct: 325  TCNEPEKPETENHHQQNCLVENSYEDDKFSSSRFGETSFSAADSVSISGHITYSGPIAYS 384

Query: 374  GSVSLRSESSAGTASTRSFAFPVLQTEWNSSPVRIGKTGQRHMKKHRGWVQAFSCCKF 201
            GS+S+RS++S  T S RSFAFP+LQ+EWNSSPVR+ K  +R  K   GW     CC+F
Sbjct: 385  GSLSVRSDAS--TTSGRSFAFPILQSEWNSSPVRMAKADKRRQK--GGWRHTLLCCRF 438


>ref|XP_002875229.1| hypothetical protein ARALYDRAFT_484289 [Arabidopsis lyrata subsp.
            lyrata] gi|297321067|gb|EFH51488.1| hypothetical protein
            ARALYDRAFT_484289 [Arabidopsis lyrata subsp. lyrata]
          Length = 435

 Score =  140 bits (352), Expect = 3e-30
 Identities = 121/397 (30%), Positives = 184/397 (46%), Gaps = 25/397 (6%)
 Frame = -3

Query: 1322 DHAFDANSNVNKA-------YSDKNVTECELPEFVICYKESN---IKDIGVDVGVPADDK 1173
            D + D ++NV+         Y DKNVT C+LPE V+CYKE+    +KDI VD GVP  +K
Sbjct: 70   DISHDCDANVDSPDKKDPVFYMDKNVTACDLPEIVVCYKENTYHVVKDICVDEGVPVQEK 129

Query: 1172 LWVQSDKTDVDMKVEENNNQLGVVEAPKSTLDDEWNKAAALYSSDHPVNPDELNEEALTG 993
             ++  +K  V     E+     + +A K+ ++   +K+A    S+  V+  E      T 
Sbjct: 130  -FLFGEKDSVKSSSTED-----LTKADKTNVNPSESKSAE--DSNTKVDDSEFCNNCKTD 181

Query: 992  QYLLHEFGDEDLLKTTELILDVEDRVCPVAELVADVADHHPSTEKEINHEDDAQEKKGHE 813
            + +  E   ED           ++ +    E  A  +  H     EI  ++++ ++    
Sbjct: 182  RDV-EESSREDFADAEGSSAYNQEHLIVTEE--AKASPSHGLNPSEIEPDENSNDEVAIS 238

Query: 812  TSQGEDESATTQSKSQLLTEEIPNKEESLANSSGDAAMQESGDAHDPNKLAYNSKVESRS 633
            +     ES T      +L+ E   K  +  N S D+  ++S     P++L      E RS
Sbjct: 239  SETDSKESLTL---GDILSREDEQKSLNHGNISSDSHEEQS-----PSQL---QDKEKRS 287

Query: 632  ITFDFGSSTNSEPEKQEDETEKAEASKPPSAQTMTRQEEN---------------QEGMI 498
            +      +   E E ++ E  K    K PSA T T QE N               Q+  +
Sbjct: 288  L-----ETAAIETELEKTEEPKPVEEKLPSASTTTLQEPNKTCNDPEKPETENHHQQNSL 342

Query: 497  SSSSILLARLEDRVHGETSFSSGMPVPLPSLIAFSEPLTYSGSVSLRSESSAGTASTRSF 318
              +S    +L     GETSFS+   V +   I +S P+ YSGS+S+RS++S  T S RSF
Sbjct: 343  VENSYEDDKLSSSRFGETSFSAAESVSISGHITYSGPIAYSGSLSVRSDAS--TTSGRSF 400

Query: 317  AFPVLQTEWNSSPVRIGKTGQRHMKKHRGWVQAFSCC 207
            AFP+LQ+EWNSSPVR+ K  +R  K   GW     CC
Sbjct: 401  AFPILQSEWNSSPVRMAKADKRRQK--GGWRHTLLCC 435


>ref|XP_006348792.1| PREDICTED: dentin sialophosphoprotein-like [Solanum tuberosum]
          Length = 586

 Score =  137 bits (346), Expect = 1e-29
 Identities = 126/404 (31%), Positives = 183/404 (45%), Gaps = 33/404 (8%)
 Frame = -3

Query: 1313 FDANSNVNKAYSDKNVTECELPEFVICYKESN---IKDIGVDVGVPADDKLWVQSDK--- 1152
            FD+N +    YSDK VT+ ELPE  +CY+E+N   +KDI +D GVPA DK+ ++S K   
Sbjct: 214  FDSNKHF---YSDKGVTDHELPELTVCYRENNFNMVKDICMDEGVPAVDKVLIESWKDGQ 270

Query: 1151 ----TDVDMKVEENNNQLGVVEAPK---STLDDEWNKAAALYSSDHPVNPDELNEEALTG 993
                  VD   E+ +N    V+      S   D   K A   +  H    +        G
Sbjct: 271  PSTSVSVDADEEQQSNTRKSVDMGSTIASVSQDSSFKDAKNIAVTHDTEIEATGAPVPNG 330

Query: 992  -QYLLHEFGDEDLLKTTELILDVEDRVCPVAELVADVADHHPSTEKEINHEDDAQEKKGH 816
                L    ++D  K + L    ED +          A   PS+   +   +++  K   
Sbjct: 331  FNPSLENNANKDADKDSYL----EDLLMIFGSKCTTNASEKPSSLNTVVRVEESNIK--- 383

Query: 815  ETSQGEDESATTQSKSQLLTEEIPNKEESLANSSGDAAMQESGDAHDPNKLAYNSKVESR 636
             TS G+        +S L  +++P+ E++L + +   A+  SG  ++      N K    
Sbjct: 384  -TSDGD--------QSTLQPDQVPS-EQTLKSQT---AVSASGQTNNKG----NIKEGVG 426

Query: 635  SITFDFGSSTNSEPEKQEDETEKA-EASKPPSAQTMTRQEENQEGMISSSSILLARLEDR 459
            +  FD   +     +  E       E S  P A ++  +  N +   +SS +  A   D 
Sbjct: 427  TSIFDVNLTKPESTKTTEGGVGNLPEDSHMPKAVSV-HKNGNSDNNSASSQVPFANTADN 485

Query: 458  VH------------------GETSFSSGMPVPLPSLIAFSEPLTYSGSVSLRSESSAGTA 333
             H                  GE SFS+    P+   I +S P++YSGSVSLRSESS  T 
Sbjct: 486  AHQQHLESQNMANGQSHFADGEASFSAARG-PISGSITYSGPISYSGSVSLRSESS--TT 542

Query: 332  STRSFAFPVLQTEWNSSPVRIGKTGQRHMKKHRGWVQAFSCCKF 201
            STRSFAFPVLQ EWNSSPVR+ K  +R + K +GW Q   CC+F
Sbjct: 543  STRSFAFPVLQNEWNSSPVRMAKAERRRLSKQKGWKQGILCCRF 586


>ref|XP_006291111.1| hypothetical protein CARUB_v10017222mg [Capsella rubella]
            gi|482559818|gb|EOA24009.1| hypothetical protein
            CARUB_v10017222mg [Capsella rubella]
          Length = 455

 Score =  137 bits (346), Expect = 1e-29
 Identities = 122/398 (30%), Positives = 185/398 (46%), Gaps = 28/398 (7%)
 Frame = -3

Query: 1310 DANSNVNKA-YSDKNVTECELPEFVICYKESN---IKDIGVDVGVPADDKLWVQSDKTDV 1143
            D+   VN   Y DKNVT C+LPE V+CYKE++   +KDI VD GVP  +K ++  +K  V
Sbjct: 96   DSPEKVNPVFYMDKNVTACDLPEIVVCYKENSYHVVKDICVDEGVPVQEK-FLFGEKDSV 154

Query: 1142 DMKVEENNNQLGVVEAPK-----------STLDDEWNKA--AALYSSDHPVNP-DELNEE 1005
              K   N+N  G V+  K            +L+D  +K   ++   +D  V   +E + E
Sbjct: 155  --KSTTNSNHCGSVDLMKVDKTDVKPSETKSLEDSNSKVDDSSEVCNDKTVQDVEESSRE 212

Query: 1004 ALTGQYLLHEFGDEDL--------LKTTELILDVEDRVCPVAELVADVADH--HPSTEKE 855
            A         +  E L        LK +E+ L+VE       E+V    D      T  +
Sbjct: 213  AFADAEGSSNYDQEHLIVTSPTLALKPSEISLEVESEEISKDEVVISSEDFLSESLTLGD 272

Query: 854  INHEDDAQEKKGHETSQGEDESATTQSKSQLLTEEIPNKEESLANSSGDAAMQESGDAHD 675
            I   +D Q+   ++     +E +  Q + +        ++ SL  +  D  +++     +
Sbjct: 273  ILSREDKQKSLKNDNGNRPEELSPPQHQEK--------EKRSLETTGLDTKLEK---VEE 321

Query: 674  PNKLAYNSKVESRSITFDFGSSTNSEPEKQEDETEKAEASKPPSAQTMTRQEENQEGMIS 495
            P     N    S        ++T  EP K  ++ EK E       + +   E+++   +S
Sbjct: 322  PKTAEENLSSAS--------TTTVQEPNKSCNDLEKPETENHQQNRLVNSYEDDK---LS 370

Query: 494  SSSILLARLEDRVHGETSFSSGMPVPLPSLIAFSEPLTYSGSVSLRSESSAGTASTRSFA 315
            SS            GETSFS+   V +   I +S P+ YSGS+S+RS++S  T S RSFA
Sbjct: 371  SSRF----------GETSFSAAESVSISGHITYSGPIAYSGSLSVRSDAS--TTSGRSFA 418

Query: 314  FPVLQTEWNSSPVRIGKTGQRHMKKHRGWVQAFSCCKF 201
            FP+LQ+EWNSSPVR+ K  +R  K   GW     CCKF
Sbjct: 419  FPILQSEWNSSPVRMAKADKRRQK--GGWRHTLLCCKF 454


>ref|XP_006395670.1| hypothetical protein EUTSA_v10004181mg [Eutrema salsugineum]
            gi|567142661|ref|XP_006395671.1| hypothetical protein
            EUTSA_v10004181mg [Eutrema salsugineum]
            gi|557092309|gb|ESQ32956.1| hypothetical protein
            EUTSA_v10004181mg [Eutrema salsugineum]
            gi|557092310|gb|ESQ32957.1| hypothetical protein
            EUTSA_v10004181mg [Eutrema salsugineum]
          Length = 458

 Score =  135 bits (340), Expect = 7e-29
 Identities = 116/386 (30%), Positives = 166/386 (43%), Gaps = 25/386 (6%)
 Frame = -3

Query: 1283 YSDKNVTECELPEFVICYKESN---IKDIGVDVGVPADDKLWVQSDKTDVDMKVEENNNQ 1113
            Y DKNVT C+LPE V+CYKE+    +KDI VD GVP  +K ++  +K  V  K   N+N+
Sbjct: 106  YMDKNVTACDLPEIVVCYKENTYHVVKDICVDEGVPVQEK-FLFGEKDSV--KCSSNSNK 162

Query: 1112 LGVVEAPKSTLDDEWNKAAALYSSDHPVNPDELNEEALTGQYLLH---EFGDEDLLKTTE 942
                                   S+  +  D+ +   L  + L     +  D +L   T+
Sbjct: 163  C---------------------ESEDLMEADKASSNLLESKSLEDRNSKLDDSELCNGTK 201

Query: 941  LILDVEDRVCPVAELVADVADHHPSTEKEINHEDDAQEKKGHETSQGED----ESATTQS 774
               DVE+      E  AD        ++ +    +A++   H  +  E     ES     
Sbjct: 202  TNRDVEESS---REEFADAEGSSNCNQEHLTVTREAKDSPTHGVNHSEISHEIESDENSK 258

Query: 773  KSQLLTEEIPNKEESLANSSGDAAMQESGDAHDPNKLAYNSKVESRSITFDFGSSTNSEP 594
            K ++ T E  N       + GD   +E    H  N  + N + E            + E 
Sbjct: 259  KHEVATSE--NVVSECCLTLGDILSREDEQKHLNNNNSSNRREEHSPPLLQEMEKRSLET 316

Query: 593  EKQEDETEKAEASKPPSAQTMTRQEEN------QEGMISSSSILLARLEDRVH------- 453
               E E  K    K  S  T T QE N      +     +      R+ED          
Sbjct: 317  TPLETEEPKQAEEKLSSVSTTTSQEPNKTCNDPERPETENQQQPKLRVEDSYEDDKLFSS 376

Query: 452  --GETSFSSGMPVPLPSLIAFSEPLTYSGSVSLRSESSAGTASTRSFAFPVLQTEWNSSP 279
              GETSFS+  PV +   I +S P+ +SGS+S+RS++S  T S RSFAFP+LQ+EWNSSP
Sbjct: 377  GFGETSFSASEPVSISGHITYSGPIAFSGSLSVRSDAS--TTSGRSFAFPILQSEWNSSP 434

Query: 278  VRIGKTGQRHMKKHRGWVQAFSCCKF 201
            VR+ K  +R  K   GW     CC+F
Sbjct: 435  VRMAKADKRRQK---GWRHILLCCRF 457


>ref|XP_004144157.1| PREDICTED: uncharacterized protein LOC101217989 [Cucumis sativus]
            gi|449523672|ref|XP_004168847.1| PREDICTED:
            uncharacterized protein LOC101224727 [Cucumis sativus]
          Length = 431

 Score =  133 bits (335), Expect = 3e-28
 Identities = 116/383 (30%), Positives = 188/383 (49%), Gaps = 11/383 (2%)
 Frame = -3

Query: 1316 AFDANSNVNKAYS--DKNVTECELPEFVICYKESNI---KDIGVDVGVPADDKLWVQSDK 1152
            AF A+SN+  ++S  DK+V EC++ + ++C +E N+   KDI +D GV + +  + +S  
Sbjct: 91   AFGASSNMKPSFSYVDKSVMECQMSKTIVCDQEVNVNDVKDICIDDGVASLENFFFKSTA 150

Query: 1151 TDVDMKV---EENNNQLGVVEAPKSTLDDEWNKAAALYSSDHPVNPDELNEEALTGQYLL 981
                 K+   EE+ N+  + E   S+      + +   + D  V+     E+     +  
Sbjct: 151  EKSISKISPLEEDRNEGSIKEKETSS------EVSKFIADDRKVSL----EDHFAMDWTT 200

Query: 980  HEFGDEDLLKTTELILDVEDRVCPVAELVADVADHHPSTEKEINHEDDAQEKKGHETSQG 801
            H    +DL +  E  L++ +    + +LV            + ++  ++ +K G + S G
Sbjct: 201  HNDA-KDLTQIEEEKLNLSEPELLMQKLV------------KRSYSSESLDKIGLQIS-G 246

Query: 800  ED---ESATTQSKSQLLTEEIPNKEESLANSSGDAAMQESGDAHDPNKLAYNSKVESRSI 630
            E    E  ++ SKS     + P         + D+A +   D    +   YN + E+ SI
Sbjct: 247  EKTNLEDPSSASKSVDSCNDTP---------ALDSAAEPPKDNIPAHPSGYNDEFENGSI 297

Query: 629  TFDFGSSTNSEPEKQEDETEKAEASKPPSAQTMTRQEENQEGMISSSSILLARLEDRVHG 450
               F S +      +E +     +      Q +T  E       +S S LL+       G
Sbjct: 298  ALTFNSISPVANGGEERQECCGRSDSVIGTQVLTNLEYR-----TSDSRLLSSQNMHDIG 352

Query: 449  ETSFSSGMPVPLPSLIAFSEPLTYSGSVSLRSESSAGTASTRSFAFPVLQTEWNSSPVRI 270
            E+SFS+    PL SL+ +S P+ YSGS+SLRSESS  T STRSFAFP+LQ+EWNSSPV++
Sbjct: 353  ESSFSA--VDPLASLVTYSGPVAYSGSISLRSESS--TTSTRSFAFPILQSEWNSSPVKM 408

Query: 269  GKTGQRHMKKHRGWVQAFSCCKF 201
             K  +RH +K+RGW +   CCKF
Sbjct: 409  VKAERRHYRKYRGWREGLLCCKF 431


>ref|XP_004237062.1| PREDICTED: uncharacterized protein LOC101254294 [Solanum
            lycopersicum]
          Length = 532

 Score =  133 bits (334), Expect = 4e-28
 Identities = 140/502 (27%), Positives = 227/502 (45%), Gaps = 46/502 (9%)
 Frame = -3

Query: 1568 LENDPGNL--HPSPPRKSYNPFDSDDDALDSPGL--KSENMYVNVKENLNGFDLKAFDKS 1401
            L++DP     H +  +++ NPF  D    D P    K E+  +     +N FD    DK 
Sbjct: 78   LKDDPDEAPSHLTSCKRNGNPFACDTADRDHPWSIPKFEDPII-----VNFFD----DKE 128

Query: 1400 SDTVIFHMDNGDFLWEEDVLVQPATDDHAFDANSNVNKAYSDKNVTECELPEFVICYKES 1221
             +TV+                Q  +    F A++++   Y+DK V E ELPE  ICYKE+
Sbjct: 129  KETVVSS-------------TQFTSLSELFGADTHL---YTDKGVLEFELPESTICYKEN 172

Query: 1220 N---IKDIGVDVGVPADDKLWVQSDKTD-----VDMKVEENNNQLGVVEAPKSTLDDEWN 1065
            +   +KDI +D GVP  DK+  +S K D     + +  +E+  ++         +    +
Sbjct: 173  DYNIMKDICMDEGVPLMDKIVTESRKYDQPDSSISLAADEHQPRITREGVDSELVSSGES 232

Query: 1064 KAAALYSS------DHPVNPDELNEEALTGQYLLHEFGDEDLLKTTELILDVEDRVCPVA 903
            KA+++ S+       H    DE N+  +     ++ F ++++ K  E             
Sbjct: 233  KASSVESAVKISVDHHTTKEDEGNKSLVPNG--INPFLEDNMSKDAE------------K 278

Query: 902  ELVADVADHHPSTEKEINHEDDAQEKKGHETSQGEDESATTQSKSQLLTEEIPNKEESLA 723
            +   DV     S +  +    +  EK+    +  E  S   QS  Q    ++P   E+  
Sbjct: 279  DPYLDVMKIFGSKDTTMAKPTNISEKESDSQNFKESNSDADQSAQQ--ANQMPTSVEAF- 335

Query: 722  NSSGDAAMQESGDAHDP-NKLAYNSKVESRSITFDFG-------SSTNSEPEKQEDETEK 567
            NS    +  +  + + P +  + NSK +S +IT DF        SS     +   +++ K
Sbjct: 336  NSQYTVSPADGTNNYGPGSNFSNNSKSKSGAITCDFNLTELALSSSVTKSDKHLPEQSHK 395

Query: 566  AEA---SKPPSAQTMTRQEE----NQEGMISSSSIL-----LARLEDR--------VHGE 447
             EA    K  S+ + +   +    N     +SS+I      +A LE++        VHG 
Sbjct: 396  LEAVSGQKDGSSDSFSAATQVHFANSVDSSNSSTIHADPPNVANLEEKNSSSIPLGVHGH 455

Query: 446  TSFSSGMPVPLPSLIAFSEPLTYSGSVSLRSESSAGTASTRSFAFPVLQTEWNSSPVRIG 267
             +       P   LI++S  + +SG++SLRS+SS  T S RSFAFPVLQ+EWNSSPVR+ 
Sbjct: 456  FANGEASFGPASGLISYSGHIAHSGNISLRSDSS--TTSARSFAFPVLQSEWNSSPVRMA 513

Query: 266  KTGQRHMKKHRGWVQAFSCCKF 201
            K  +RH   ++GW Q+  CCKF
Sbjct: 514  KAERRH---YKGWRQSLLCCKF 532


>ref|XP_006363556.1| PREDICTED: dentin sialophosphoprotein-like isoform X1 [Solanum
            tuberosum] gi|565395867|ref|XP_006363557.1| PREDICTED:
            dentin sialophosphoprotein-like isoform X2 [Solanum
            tuberosum] gi|565395869|ref|XP_006363558.1| PREDICTED:
            dentin sialophosphoprotein-like isoform X3 [Solanum
            tuberosum] gi|565395871|ref|XP_006363559.1| PREDICTED:
            dentin sialophosphoprotein-like isoform X4 [Solanum
            tuberosum]
          Length = 532

 Score =  131 bits (330), Expect = 1e-27
 Identities = 135/484 (27%), Positives = 220/484 (45%), Gaps = 41/484 (8%)
 Frame = -3

Query: 1529 RKSYNPFDSDDDALDSPGL--KSENMYVNVKENLNGFDLKAFDKSSDTVIFHMDNGDFLW 1356
            +++ NPF  D    D P    K E+  +     +N FD    DK  +TV+          
Sbjct: 94   KRNGNPFACDTADRDHPWSIPKFEDPMI-----VNFFD----DKEKETVVSS-------- 136

Query: 1355 EEDVLVQPATDDHAFDANSNVNKAYSDKNVTECELPEFVICYKESN---IKDIGVDVGVP 1185
                  Q  +    F  N+++   Y+DK V E +LPE  ICY E+N   +KDI +D GVP
Sbjct: 137  -----AQFTSLSELFGTNTHL---YTDKGVLEFKLPELTICYNENNYNIMKDICMDEGVP 188

Query: 1184 ADDKLWVQSDK-----TDVDMKVEEN---NNQLGVVEAPKSTLD--DEWNKAAALYSSDH 1035
              DK+  +S K     + + + V+E+   N + GV     S+ +  D   + A   S DH
Sbjct: 189  LMDKIVTESRKYHQPDSSISLAVDEHQPRNTREGVDSELVSSGESKDSSVENAVKISVDH 248

Query: 1034 PVNPDELNEEALTGQYLLHEFGDEDLLKTTELILDVEDRVCPVAELVADVADHHPSTEKE 855
                ++ + ++L G   ++ F ++++ K  +     +D    V ++              
Sbjct: 249  HTTKEDEDTKSL-GPNGINPFLEDNMSKYAD-----KDSSLDVMKIFGSKDTTTAKATNI 302

Query: 854  INHEDDAQEKKGHETSQGEDESATTQSKSQLLTEEIPNKEESLANSSGDAAMQESGDAHD 675
              +E D Q  K         ES +   +S L   +IP    +  + +  +A   + +   
Sbjct: 303  SENESDIQNLK---------ESNSDAEQSALQANQIPTFVAAFNSQNTVSAADGTNNNGP 353

Query: 674  PNKLAYNSKVESRSITFDFG------SSTNSEPEK----QEDETEKAEASKPPSAQTMT- 528
             +  + NSK ES +IT DF       SS+ ++ +K    Q  + E   + K  S+ + + 
Sbjct: 354  GSNFSNNSKSESGAITCDFNLTELALSSSVAKSDKHLPEQSHKLEAVSSQKDGSSDSFSA 413

Query: 527  -------RQEENQEGMISSSSILLARLEDR--------VHGETSFSSGMPVPLPSLIAFS 393
                      ++    I +    +A LE++        VHG  +       P   LI++S
Sbjct: 414  ATQVHFANSVDSCNSSIHADPPNVANLEEKNSGSIPLGVHGHFANGEASFGPASGLISYS 473

Query: 392  EPLTYSGSVSLRSESSAGTASTRSFAFPVLQTEWNSSPVRIGKTGQRHMKKHRGWVQAFS 213
              +T+SG++SLRS+SS  T S RSFAFPVLQ+EWNSSPVR+ K  +RH   ++GW Q+  
Sbjct: 474  GHITHSGNISLRSDSS--TTSARSFAFPVLQSEWNSSPVRMAKAERRH---YKGWRQSLL 528

Query: 212  CCKF 201
            CCKF
Sbjct: 529  CCKF 532


Top