BLASTX nr result

ID: Ephedra26_contig00018655 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Ephedra26_contig00018655
         (1304 letters)

Database: ./nr 
           37,332,560 sequences; 13,225,080,153 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

gb|ABK26175.1| unknown [Picea sitchensis]                             379   e-102
gb|ABK26131.1| unknown [Picea sitchensis]                             378   e-102
ref|XP_001763512.1| predicted protein [Physcomitrella patens] gi...   337   8e-90
ref|XP_001781916.1| predicted protein [Physcomitrella patens] gi...   249   2e-63
ref|WP_006102408.1| glycosyhydrolase [Coleofasciculus chthonopla...   217   7e-54
ref|WP_017747445.1| glycosyhydrolase [Scytonema hofmanni]             214   8e-53
ref|WP_006194355.1| glycosyhydrolase [Nodularia spumigena] gi|11...   210   9e-52
ref|WP_019497328.1| glycosyhydrolase [Calothrix sp. PCC 7103]         207   1e-50
ref|YP_007167016.1| hypothetical protein PCC7418_0574 [Halothece...   205   3e-50
ref|WP_009150368.1| glycosyhydrolase [Thiorhodovibrio sp. 970] g...   181   8e-43
ref|WP_023410516.1| hypothetical protein [uncultured Thiohalocap...   172   4e-40
ref|XP_001753037.1| predicted protein [Physcomitrella patens] gi...   141   7e-31
gb|EWM26647.1| Glycosyl hydrolase family 43, five-bladed beta-pr...   140   9e-31
emb|CBN79395.1| putative lipoprotein [Ectocarpus siliculosus]         136   2e-29
ref|XP_004297285.1| PREDICTED: uncharacterized protein LOC101300...   128   5e-27
ref|XP_006827322.1| hypothetical protein AMTR_s00010p00267390 [A...   125   3e-26
ref|XP_002336049.1| predicted protein [Populus trichocarpa]           124   1e-25
ref|XP_006379671.1| hypothetical protein POPTR_0008s08930g [Popu...   123   1e-25
ref|XP_002266397.1| PREDICTED: uncharacterized protein LOC100264...   123   2e-25
gb|EXC20011.1| hypothetical protein L484_015688 [Morus notabilis]     121   6e-25

>gb|ABK26175.1| unknown [Picea sitchensis]
          Length = 351

 Score =  379 bits (973), Expect = e-102
 Identities = 195/356 (54%), Positives = 244/356 (68%), Gaps = 7/356 (1%)
 Frame = +1

Query: 133  WESGRVLSPSPAGSGWWDAKCVAGPVVLQEPS-SSAYRMYYYGRDSDNWNMGLKAINPNL 309
            W SGRVL P+P  S WWD K  +G VV++EP  ++ YRMYYYGR  + WNMG++  N  L
Sbjct: 3    WNSGRVLGPAPPDSNWWDRKLFSGAVVVKEPKGATGYRMYYYGRSEEVWNMGVQPFNNTL 62

Query: 310  PTGRIGLALSTDGISFHRHRGPLQGGAVLDPSNDKHAFDCVQVAVSDALHDEPTLTWRIX 489
            PTGRIG+A S+DG++F RHRGPL GGAV+DPS++  AFDCV VA+SDA++      W + 
Sbjct: 63   PTGRIGVAFSSDGLAFQRHRGPLPGGAVMDPSDNPAAFDCVHVAISDAVYTGE--KWLL- 119

Query: 490  XXXXXXXXXXXXXXXXPGVALSSDGVSITERKGPLLAAGGPGEWDERGVSWPRIFTG-LH 666
                            PG+A S DG+ I  R+GP+L  G PG WD+ GVSWPRIF G   
Sbjct: 120  -YYFGGGMAAEGRRLLPGLASSVDGIHIDGREGPVLDVGEPGAWDQNGVSWPRIFPGDDS 178

Query: 667  GQTLMTYHCLEAG-----GFFSAGIAVSQDGGLTWEKTGKVLTRGDEGSWDDKGVSVRHV 831
            G+  MTYH  E+G     GFFSAG+AVS D G  W+K GK+L+ GD GSWD+ GVSVRHV
Sbjct: 179  GRLFMTYHTRESGGSAGIGFFSAGMAVSDD-GKNWQKVGKILSCGDAGSWDEGGVSVRHV 237

Query: 832  VRVGEREYVMFYEGSKMTERGWEFAIGMAVSEDGVTWRKDEEAGDEPGGPVLRPRKGMDV 1011
            +RVG R ++MFYEGS      +EFAIG+A+S+DG+TW+KDE  G EPGGPVL  RKG +V
Sbjct: 238  IRVG-RRFLMFYEGSNFK---FEFAIGLAISDDGLTWKKDEYVGKEPGGPVLTARKGENV 293

Query: 1012 WDNRVVGTPYVVVMESGEMRMYYLGRGEDKECEGVLTTGIGLAVSHVSNFRSWKRH 1179
            WDN +VGTPYV+ M  G   MYYLG G+ +E E +   GIGLAVS   NFR W+R+
Sbjct: 294  WDNLIVGTPYVLQMPDGSFGMYYLGLGK-REGEEMSQQGIGLAVSDGPNFRLWRRY 348


>gb|ABK26131.1| unknown [Picea sitchensis]
          Length = 397

 Score =  378 bits (971), Expect = e-102
 Identities = 195/356 (54%), Positives = 243/356 (68%), Gaps = 7/356 (1%)
 Frame = +1

Query: 133  WESGRVLSPSPAGSGWWDAKCVAGPVVLQEPS-SSAYRMYYYGRDSDNWNMGLKAINPNL 309
            W SGRVL P+P  S WWD K  +G VV++EP  ++ YRMYYYGR  + WNMG++  N  L
Sbjct: 49   WNSGRVLGPAPLDSNWWDRKLFSGAVVVKEPKGATGYRMYYYGRSEEVWNMGVQPFNNTL 108

Query: 310  PTGRIGLALSTDGISFHRHRGPLQGGAVLDPSNDKHAFDCVQVAVSDALHDEPTLTWRIX 489
            PTGRIG+A S+DG++F RHRGPL GGAV+DPS++  AFDCV VA+SDA++      W + 
Sbjct: 109  PTGRIGVAFSSDGLAFQRHRGPLPGGAVMDPSDNPAAFDCVHVAISDAVYTGE--KWLL- 165

Query: 490  XXXXXXXXXXXXXXXXPGVALSSDGVSITERKGPLLAAGGPGEWDERGVSWPRIFTG-LH 666
                            PG+A S DG+ I  R+GP+L  G PG WD+ GVSWPRIF G   
Sbjct: 166  -YYFGGGMAAEGRRLLPGLASSVDGIHIDGREGPVLDVGEPGAWDQNGVSWPRIFPGDDS 224

Query: 667  GQTLMTYHCLEAG-----GFFSAGIAVSQDGGLTWEKTGKVLTRGDEGSWDDKGVSVRHV 831
            G+  MTYH  E+G     GFFSAG+AVS D G  W K GK+L+ GD GSWD+ GVSVRHV
Sbjct: 225  GRLFMTYHTRESGGSAGIGFFSAGMAVSDD-GKNWRKVGKILSCGDAGSWDEGGVSVRHV 283

Query: 832  VRVGEREYVMFYEGSKMTERGWEFAIGMAVSEDGVTWRKDEEAGDEPGGPVLRPRKGMDV 1011
            +RVG R ++MFYEGS      +EFAIG+A+S+DG+TW+KDE  G EPGGPVL  RKG +V
Sbjct: 284  IRVG-RRFLMFYEGSNFK---FEFAIGLAISDDGLTWKKDEYVGKEPGGPVLTARKGENV 339

Query: 1012 WDNRVVGTPYVVVMESGEMRMYYLGRGEDKECEGVLTTGIGLAVSHVSNFRSWKRH 1179
            WDN +VGTPYV+ M  G   MYYLG G+ +E E +   GIGLAVS   NFR W+R+
Sbjct: 340  WDNLIVGTPYVLQMPDGSFGMYYLGLGK-REGEEMSQQGIGLAVSDGPNFRLWRRY 394


>ref|XP_001763512.1| predicted protein [Physcomitrella patens] gi|162685305|gb|EDQ71701.1|
            predicted protein [Physcomitrella patens]
          Length = 449

 Score =  337 bits (863), Expect = 8e-90
 Identities = 177/373 (47%), Positives = 226/373 (60%), Gaps = 24/373 (6%)
 Frame = +1

Query: 139  SGRVLSPSPAGSGWWDAKCVAGPVVLQEPSSSAYRMYYYGRDSDNWNMGLKAINPNLPTG 318
            +GRVL+P+P  S WWD K ++G VV+ E   S YRMYYYGR  D W  G++  N +LPTG
Sbjct: 83   TGRVLAPAPGDSDWWDKKLLSGAVVVPEIGKSGYRMYYYGRGGDEWAKGVQPFNASLPTG 142

Query: 319  RIGLALSTDGISFHRHRGPLQGGAVLDPSNDKHAFDCVQVAVSDALHDEPTLTWRI---- 486
            R G+A+S DG+ F R+ G L GGA++DPS D  AFD V + VSD L+D+    WR+    
Sbjct: 143  RNGMAVSEDGLQFERYMGHLSGGAIMDPSPDYAAFDAVHIGVSDVLYDQAEDVWRMFYFG 202

Query: 487  ------XXXXXXXXXXXXXXXXXPGVALSSDGVSITERKGPLLAAGGPGEWDERGVSWPR 648
                                   PGVA S DG+S  +R+GP+L  G  G WDE GVSWPR
Sbjct: 203  GGYEESTLLGLNPDKLFRGVKLRPGVAASKDGLSFDDREGPILELGEKGAWDENGVSWPR 262

Query: 649  IFTGLH---------GQTLMTYHCLEAG-----GFFSAGIAVSQDGGLTWEKTGKVLTRG 786
            +                 LMTYH  ++G     GFFSAG+A S D G  W K  K+L+ G
Sbjct: 263  VLPPEENGDEKDKGKSNWLMTYHTRQSGGPNNFGFFSAGVATSAD-GKRWHKHSKILSAG 321

Query: 787  DEGSWDDKGVSVRHVVRVGEREYVMFYEGSKMTERGWEFAIGMAVSEDGVTWRKDEEAGD 966
            D G+WD+ GVSVRHV+RV ++ YVMFYEGS      ++FAIG+A S+DG+ W KD + G 
Sbjct: 322  DPGAWDEGGVSVRHVLRVNDK-YVMFYEGSNYK---FQFAIGLATSDDGLVWEKDFQVGP 377

Query: 967  EPGGPVLRPRKGMDVWDNRVVGTPYVVVMESGEMRMYYLGRGEDKECEGVLTTGIGLAVS 1146
            EPGGP+L+ R G +VWDN +VGTPYVV +  G  R+YYLG G+    E     GIGLAVS
Sbjct: 378  EPGGPILKARVGENVWDNVIVGTPYVVALSDGSFRLYYLGVGKMVGDE-ASKQGIGLAVS 436

Query: 1147 HVSNFRSWKRHVE 1185
               N+RSW+R  E
Sbjct: 437  DGPNYRSWRRFNE 449


>ref|XP_001781916.1| predicted protein [Physcomitrella patens] gi|162666632|gb|EDQ53281.1|
            predicted protein [Physcomitrella patens]
          Length = 450

 Score =  249 bits (635), Expect = 2e-63
 Identities = 150/368 (40%), Positives = 193/368 (52%), Gaps = 22/368 (5%)
 Frame = +1

Query: 139  SGRVLSPSPAGSGWWDAKCVAGPVVLQEPSSSAYRMYYYGRDSDNWNMGLKAINPNLPTG 318
            +GR+  P   G  WWD KCV  PVV++EPSSS +RMYYYGRD DNW+ G++     L TG
Sbjct: 95   AGRLFGPGEEGEDWWDHKCVFHPVVVREPSSSTWRMYYYGRDGDNWSSGVRPAL--LSTG 152

Query: 319  RIGLALSTDGISFHRHRGPLQGGAVLDPSNDKHAFDCVQVAVSDAL-HDEPTLTWRIXXX 495
            R+GLALS DG+ + RHRGPL  GA+LDP      FD V V  SD L HD     +     
Sbjct: 153  RVGLALSEDGVHWSRHRGPLPQGAILDPEEGGDTFDSVHVGCSDVLFHDGQWWMFYFGGS 212

Query: 496  XXXXXXXXXXXXXX-----PGVALSSDGVSITERK---GPLLAAGGPGEWDERGVSWPRI 651
                               PG+  S+DGV          PLL  G   EWDE  ++WPR+
Sbjct: 213  AEKMEIGPSNKALQGLRMLPGLVKSADGVVFDRALFTGNPLLNVGLQNEWDELFIAWPRV 272

Query: 652  FTGLHGQT-----------LMTYHCLEAGG--FFSAGIAVSQDGGLTWEKTGKVLTRGDE 792
                               LMTY  +E     F S G+A+S DG   W K GK LTRG  
Sbjct: 273  LPPSSRHRCQDPSDEDKTWLMTYSSIEKQTLPFSSIGVALSADGH-KWFKAGKALTRGAP 331

Query: 793  GSWDDKGVSVRHVVRVGEREYVMFYEGSKMTERGWEFAIGMAVSEDGVTWRKDEEAGDEP 972
            GSWD+ GV  RHV+ + + EY MFYEG  +  +G    IG+A+S DG+ W +D       
Sbjct: 332  GSWDEGGVGRRHVLLI-DNEYFMFYEG--VNNKGIH-GIGLAISPDGIHWERDPLCQ--- 384

Query: 973  GGPVLRPRKGMDVWDNRVVGTPYVVVMESGEMRMYYLGRGEDKECEGVLTTGIGLAVSHV 1152
             GP+   R G +VWDN  V  P+V+ M+ G  RMYY+G  + K       + +G+A+S  
Sbjct: 385  -GPIFSARVGEEVWDNGTVAAPHVLQMDDGSFRMYYVGTNDTK-----TESAMGMAISKG 438

Query: 1153 SNFRSWKR 1176
             NFR+W R
Sbjct: 439  KNFRTWTR 446


>ref|WP_006102408.1| glycosyhydrolase [Coleofasciculus chthonoplastes]
            gi|196179107|gb|EDX74103.1| Glycosyl hydrolases family 32
            [Coleofasciculus chthonoplastes PCC 7420]
          Length = 353

 Score =  217 bits (553), Expect = 7e-54
 Identities = 136/348 (39%), Positives = 189/348 (54%), Gaps = 9/348 (2%)
 Frame = +1

Query: 160  SPAGSGWWDAKCVAGPVVLQEPSSSAYRMYYYGRDSDNWNMGLKAINPNLPTGRIGLALS 339
            +P   GWWD++ V+ P V++ P  + ++M+YYGRD+ +++  +     NLPTGR GLA+S
Sbjct: 17   TPGSEGWWDSERVSCPRVMRCPDGT-WKMWYYGRDA-SFDRQI-----NLPTGRCGLAVS 69

Query: 340  TDGISFHRHRGPLQGGAVLDPSNDKHAFDCVQVAVSDA-LHDEPTLTWRIXXXXXXXXXX 516
            +DGI + R +GPL  GAV +P  D   FD   + VSD    D     W            
Sbjct: 70   SDGIHWERVKGPLTMGAVFEPHPDPQRFDSAHLGVSDVNFWDGLYWMWYFGGDHQVLDMG 129

Query: 517  XXXXXXX---PGVALSSDGVSITER----KGPLLAAGGPGEWDERGVSWPRIFTGLHGQT 675
                      PG A+S DG++        +G  L  G PGE+D     WP++    H + 
Sbjct: 130  KFKAKGLQMLPGCAISRDGINWVRLEGFYRGAFLECGQPGEFDALFCGWPQVLRE-HDRW 188

Query: 676  LMTYHCLEAGGFFSAGIAVSQDGGLTWEKTGKVLTRGDEGSWDDKGVSVRHVVRVGEREY 855
             M YH L +   F  G+A+S DG   WEK G++L  G+ GS+D++G+  RHV+ +   +Y
Sbjct: 189  KMYYHTLSSNREFLVGLAMSTDG-FRWEKVGQILGPGEPGSFDERGIGTRHVLNING-DY 246

Query: 856  VMFYEGSKMTERGWEFAIGMAVSEDGVTWRKDEEAGDEPGGPVL-RPRKGMDVWDNRVVG 1032
            VMFYEG  +   G+  +IG+A+SEDG+TW K +  G + GG V     KG   WD R VG
Sbjct: 247  VMFYEG--VNTSGYH-SIGLAISEDGITWEKQK--GYKAGGAVFSHAPKGSGRWDARAVG 301

Query: 1033 TPYVVVMESGEMRMYYLGRGEDKECEGVLTTGIGLAVSHVSNFRSWKR 1176
            TP+VV ME G  RMYY+G  E    E      IGLAVS   NF  W+R
Sbjct: 302  TPWVVSMEDGSFRMYYIGANEGGHDELSSQHQIGLAVSQGKNFDQWQR 349


>ref|WP_017747445.1| glycosyhydrolase [Scytonema hofmanni]
          Length = 350

 Score =  214 bits (544), Expect = 8e-53
 Identities = 136/356 (38%), Positives = 192/356 (53%), Gaps = 9/356 (2%)
 Frame = +1

Query: 142  GRVLSPSPAGSGWWDAKCVAGPVVLQEPSSSAYRMYYYGRDSDNWNMGLKAINPNLPTGR 321
            G +L+P P   GWWD++ V+ P V++ P  + ++M+YYGRD+ +++  +     NLPTGR
Sbjct: 11   GLILTPGP--EGWWDSERVSSPQVMRCPDGT-WKMWYYGRDA-SFDRDI-----NLPTGR 61

Query: 322  IGLALSTDGISFHRHRGPLQGGAVLDPSNDKHAFDCVQVAVSDALH-DEPTLTWRIXXXX 498
             GLA+STDG+ + R +GPL  GAVL+PS D   FD   V VS     D     W      
Sbjct: 62   CGLAISTDGVCWERVKGPLTMGAVLEPSPDPSRFDSAHVGVSHVKFIDGLYWMWYFGGDH 121

Query: 499  XXXXXXXXXXXXX---PGVALSSDGVSITERKGP----LLAAGGPGEWDERGVSWPRIFT 657
                            PG A+S DG+     +GP     L  G  GE+D    +W ++  
Sbjct: 122  TVIDIGQFKAKGIQMRPGCAISRDGIHWVRLEGPYQGAFLEIGKTGEFDALFCAWSQVLR 181

Query: 658  GLHGQTLMTYHCLEAGGFFSAGIAVSQDGGLTWEKTGKVLTRGDEGSWDDKGVSVRHVVR 837
               G   M YH       F  G+AVS DG   W+K G++L  G+ GS+D++G+  RHV++
Sbjct: 182  DDDGTWKMYYHTFNPTVGFLVGLAVSTDG-FRWKKVGQILGPGEPGSFDERGIGTRHVLK 240

Query: 838  VGEREYVMFYEGSKMTERGWEFAIGMAVSEDGVTWRKDEEAGDEPGGPVL-RPRKGMDVW 1014
            +   ++VMFYEG  +   G+  +IG+A+S+DG+ W+K E  G++ GG V     KG   W
Sbjct: 241  INS-QFVMFYEG--VNNSGYH-SIGVAISDDGIHWQKGE--GEQSGGSVFSHAPKGSGCW 294

Query: 1015 DNRVVGTPYVVVMESGEMRMYYLGRGEDKECEGVLTTGIGLAVSHVSNFRSWKRHV 1182
            D R VGT  VV M+ G  RMYY+G  E    E      IGLAVS  SNF  W R++
Sbjct: 295  DARAVGTLCVVPMDDGSFRMYYIGANEGGHDELSSQHQIGLAVSDGSNFHQWYRYL 350



 Score = 84.7 bits (208), Expect = 8e-14
 Identities = 86/286 (30%), Positives = 129/286 (45%), Gaps = 19/286 (6%)
 Frame = +1

Query: 127 VAWES-------GRVLSPSPAGSGWWDAKCVAGPVVLQEPSSSAYRMYYYGRDSDNWNMG 285
           V WE        G VL PSP  S +  A      V   +     Y M+Y+G D    ++G
Sbjct: 71  VCWERVKGPLTMGAVLEPSPDPSRFDSAHVGVSHVKFID---GLYWMWYFGGDHTVIDIG 127

Query: 286 -LKAINPNLPTGRIGLALSTDGISFHRHRGPLQGGAVLDPSNDKHAFDCVQVAVSDALHD 462
             KA    +   R G A+S DGI + R  GP Q GA L+       FD +  A S  L D
Sbjct: 128 QFKAKGIQM---RPGCAISRDGIHWVRLEGPYQ-GAFLEIGKTGE-FDALFCAWSQVLRD 182

Query: 463 EPTLTWRIXXXXXXXXXXXXXXXXXPGVALSSDGVSITERKGPLLAAGGPGEWDERGVSW 642
           +   TW++                  G+A+S+DG    ++ G +L  G PG +DERG+  
Sbjct: 183 DDG-TWKMYYHTFNPTVGFLV-----GLAVSTDGFR-WKKVGQILGPGEPGSFDERGIGT 235

Query: 643 PRIFTGLHGQTLMTYHCLEAGGFFSAGIAVSQDGGLTWEK------TGKVLTRGDEGS-- 798
             +   ++ Q +M Y  +   G+ S G+A+S D G+ W+K       G V +   +GS  
Sbjct: 236 RHVLK-INSQFVMFYEGVNNSGYHSIGVAISDD-GIHWQKGEGEQSGGSVFSHAPKGSGC 293

Query: 799 WDDKGVSVRHVVRVGEREYVMFYEGSK---MTERGWEFAIGMAVSE 927
           WD + V    VV + +  + M+Y G+      E   +  IG+AVS+
Sbjct: 294 WDARAVGTLCVVPMDDGSFRMYYIGANEGGHDELSSQHQIGLAVSD 339


>ref|WP_006194355.1| glycosyhydrolase [Nodularia spumigena] gi|119466253|gb|EAW47139.1|
            putative lipoprotein [Nodularia spumigena CCY9414]
            gi|585119514|gb|AHJ26456.1| putative extracellular
            nuclease [Nodularia spumigena CCY9414]
          Length = 348

 Score =  210 bits (535), Expect = 9e-52
 Identities = 132/356 (37%), Positives = 190/356 (53%), Gaps = 9/356 (2%)
 Frame = +1

Query: 136  ESGRVLSPSPAGSGWWDAKCVAGPVVLQEPSSSAYRMYYYGRDSDNWNMGLKAINPNLPT 315
            + G VL+P     GWWD++ V+ P V++ P  + ++M+YYGRD+            NLPT
Sbjct: 10   QPGLVLAPGT--EGWWDSERVSSPQVIRCPDGT-WKMWYYGRDAAFDRQ------INLPT 60

Query: 316  GRIGLALSTDGISFHRHRGPLQGGAVLDPSNDKHAFDCVQVAVSDA-LHDEPTLTWRIXX 492
            GR GLA+S DG+++ R +G L  G+VL+P  D H FD   + VS+   +D     W    
Sbjct: 61   GRCGLAISPDGVNWQRVKGQLTMGSVLEPHPDTHRFDSAHLGVSNVNFYDGLYWMWYFGG 120

Query: 493  XXXXXXXXXXXXXXX---PGVALSSDGVSITERKGP----LLAAGGPGEWDERGVSWPRI 651
                              PG A+S +G++    +G     +L  G  GE+D    +WP++
Sbjct: 121  NHTVVDIGKFAAKGLQMRPGCAISRNGINWVRLEGAYQGAILDVGKNGEFDALFCAWPQV 180

Query: 652  FTGLHGQTLMTYHCLEAGGFFSAGIAVSQDGGLTWEKTGKVLTRGDEGSWDDKGVSVRHV 831
                 G   + YH L     F  G+AVS DG L WEK G++L  G+ GS+D++G+  RHV
Sbjct: 181  LHDNDGTWKLYYHTLNPDKGFLVGLAVSTDG-LRWEKVGQILGAGESGSFDERGIGTRHV 239

Query: 832  VRVGEREYVMFYEGSKMTERGWEFAIGMAVSEDGVTWRKDEEAGDEPGGPVL-RPRKGMD 1008
            +++   +Y+MFYEG  +   G+  +IG+A+S+DG+ W+KD +       PV     KG  
Sbjct: 240  LKING-QYLMFYEG--VNNSGYH-SIGLAISDDGIHWQKDADV------PVFSHAEKGSG 289

Query: 1009 VWDNRVVGTPYVVVMESGEMRMYYLGRGEDKECEGVLTTGIGLAVSHVSNFRSWKR 1176
             WD R VGTP VV M+ G  RMYY+G  E    E      IGLAVS  +NFR W R
Sbjct: 290  YWDARAVGTPCVVPMDDGSFRMYYIGANEGGHDELSSQHQIGLAVSAGTNFRQWHR 345


>ref|WP_019497328.1| glycosyhydrolase [Calothrix sp. PCC 7103]
          Length = 352

 Score =  207 bits (526), Expect = 1e-50
 Identities = 137/357 (38%), Positives = 186/357 (52%), Gaps = 10/357 (2%)
 Frame = +1

Query: 136  ESGRVLSPSPAGSGWWDAKCVAGPVVLQEPSSSAYRMYYYGRDSDNWNMGLKAINPNLPT 315
            + G + +P P   GWWD++ V+ P V++  +   ++M+YYGRD+ +++  +     NLPT
Sbjct: 9    QPGLIFTPGP--QGWWDSERVSCPRVIR-CADGTWKMWYYGRDA-SFDRQI-----NLPT 59

Query: 316  GRIGLALSTDGISFHRHRGPLQGGAVLDPSNDKHAFDCVQVAVSDALHDEPTLTWR---- 483
            GR GLA+S DGIS+ R +G +  GAVL+PS D   FD   + VS   + +  L W     
Sbjct: 60   GRCGLAISADGISWERVKGFMTMGAVLEPSPDPSRFDSAHIGVSSVEYSD-GLYWMWYFG 118

Query: 484  -IXXXXXXXXXXXXXXXXXPGVALSSDGVSITE----RKGPLLAAGGPGEWDERGVSWPR 648
                               PG A+S DG++        +G  L  G   E+D     WP+
Sbjct: 119  GDQTVQEVGKFKAKGIQMRPGCAISRDGINWVRLEGAYRGAFLDIGKTPEFDCLFCGWPQ 178

Query: 649  IFTGLHGQTLMTYHCLEAGGFFSAGIAVSQDGGLTWEKTGKVLTRGDEGSWDDKGVSVRH 828
            +     G   M YH       F  G+AVS D G  WEK G++L  G+ GS+D++G+  RH
Sbjct: 179  VLRDDDGTWKMYYHTFNPELGFLVGLAVSTD-GFNWEKVGQILGPGELGSFDERGIGTRH 237

Query: 829  VVRVGEREYVMFYEGSKMTERGWEFAIGMAVSEDGVTWRKDEEAGDEPGGPVLR-PRKGM 1005
            V++V  R YVMFYEG+  +       IG+A S DG+ W K E  G++PGGPV R    G 
Sbjct: 238  VLKVSSR-YVMFYEGANNSS---YHCIGVATSNDGIKWSKYE--GEQPGGPVFRHAPSGS 291

Query: 1006 DVWDNRVVGTPYVVVMESGEMRMYYLGRGEDKECEGVLTTGIGLAVSHVSNFRSWKR 1176
              WD R VGTP VV M  G  RMYY+G  E    E      IGLAVS  SNF  W R
Sbjct: 292  LRWDARAVGTPCVVPMNDGSFRMYYIGANEGGYDELSTQHQIGLAVSDGSNFYKWYR 348


>ref|YP_007167016.1| hypothetical protein PCC7418_0574 [Halothece sp. PCC 7418]
            gi|505037578|ref|WP_015224680.1| hypothetical protein
            [Halothece sp. PCC 7418] gi|428689508|gb|AFZ42802.1|
            hypothetical protein PCC7418_0574 [Halothece sp. PCC
            7418]
          Length = 349

 Score =  205 bits (522), Expect = 3e-50
 Identities = 139/356 (39%), Positives = 177/356 (49%), Gaps = 9/356 (2%)
 Frame = +1

Query: 136  ESGRVLSPSPAGSGWWDAKCVAGPVVLQEPSSSAYRMYYYGRDSDNWNMGLKAINPNLPT 315
            E G +L P     GWWD++ V+ P VL+  S   +RM+YYGRD +           NLPT
Sbjct: 7    ELGLILKPG--APGWWDSERVSSPCVLR-CSDGKWRMWYYGRDPEFDR------EINLPT 57

Query: 316  GRIGLALSTDGISFHRHRGPLQGGAVLDPSNDKHAFDCVQVAVSD-ALHDEPTLTWRI-- 486
            GR GLA+S DGI + R  GPL  G+V +P  D + FD   V VSD  + D     W    
Sbjct: 58   GRCGLAISDDGIHWERVSGPLTMGSVFEPHPDPNRFDSAHVGVSDVTVRDGLYWLWYFGG 117

Query: 487  -XXXXXXXXXXXXXXXXXPGVALSSDGVSITERKGP----LLAAGGPGEWDERGVSWPRI 651
                              PG A+S DG++    +GP    LL  G  G +DE    +P++
Sbjct: 118  DHDFTPVGNMKAKGIRMQPGCAVSGDGLNWIRLEGPDRGALLPTGESGAFDELFCGFPQV 177

Query: 652  FTGLHGQTLMTYHCLEAGGFFSAGIAVSQDGGLTWEKTGKVLTRGDEGSWDDKGVSVRHV 831
                 G   M YH L     F  G+AVS+D GL WEK G+VL  G  G +D++GV  RHV
Sbjct: 178  LRENDGSWKMYYHTLNPEKGFLVGLAVSED-GLQWEKVGEVLGSGSPGRFDERGVGTRHV 236

Query: 832  VRVGEREYVMFYEGSKMTERGWEFAIGMAVSEDGVTWRKDEEAGDEPGGPVL-RPRKGMD 1008
            + +G  EYVMFYEG           IG+A S DG  W K +  GDE GG V      G  
Sbjct: 237  LPIG-NEYVMFYEG---VNNSSYHCIGIATSSDGYHWEKQD--GDEIGGSVFAHAPSGSG 290

Query: 1009 VWDNRVVGTPYVVVMESGEMRMYYLGRGEDKECEGVLTTGIGLAVSHVSNFRSWKR 1176
             WD R VGTP VV +  G   +YY+G  E    E      IGLAVS  +++  WKR
Sbjct: 291  RWDARAVGTPCVVPLADGSWYLYYIGANEGGSDELTSQHQIGLAVSDGTDYGKWKR 346



 Score = 59.3 bits (142), Expect = 3e-06
 Identities = 65/216 (30%), Positives = 85/216 (39%), Gaps = 25/216 (11%)
 Frame = +1

Query: 574  TERKGPLLAAGGPGEWDERGVSWPRIFTGLHGQTLMTYH----------CLEAGGFFSAG 723
            TE  G +L  G PG WD   VS P +     G+  M Y+           L  G     G
Sbjct: 5    TEELGLILKPGAPGWWDSERVSSPCVLRCSDGKWRMWYYGRDPEFDREINLPTG---RCG 61

Query: 724  IAVSQDGGLTWEKTGKVLTRG-------DEGSWDDKGVSVRHVVRVGEREYVMFYEG--- 873
            +A+S D G+ WE+    LT G       D   +D   V V  V  V +  Y ++Y G   
Sbjct: 62   LAISDD-GIHWERVSGPLTMGSVFEPHPDPNRFDSAHVGVSDVT-VRDGLYWLWYFGGDH 119

Query: 874  -----SKMTERGWEFAIGMAVSEDGVTWRKDEEAGDEPGGPVLRPRKGMDVWDNRVVGTP 1038
                   M  +G     G AVS DG+ W + E     P    L P      +D    G P
Sbjct: 120  DFTPVGNMKAKGIRMQPGCAVSGDGLNWIRLE----GPDRGALLPTGESGAFDELFCGFP 175

Query: 1039 YVVVMESGEMRMYYLGRGEDKECEGVLTTGIGLAVS 1146
             V+    G  +MYY     +K   G L   +GLAVS
Sbjct: 176  QVLRENDGSWKMYYHTLNPEK---GFL---VGLAVS 205


>ref|WP_009150368.1| glycosyhydrolase [Thiorhodovibrio sp. 970]
            gi|380877873|gb|EIC19965.1| hypothetical protein
            Thi970DRAFT_03576 [Thiorhodovibrio sp. 970]
          Length = 352

 Score =  181 bits (458), Expect = 8e-43
 Identities = 126/359 (35%), Positives = 177/359 (49%), Gaps = 14/359 (3%)
 Frame = +1

Query: 142  GRVLSPSPAGSGWWDAKCVAGPVVLQEPSSSAYRMYYYGRDSDNWNMGLKAINP--NLPT 315
            G  L P PAG+  WD   V+GPVVL+E +   +RM+YYGRD+          +P  NLPT
Sbjct: 10   GLTLLPGPAGA--WDDARVSGPVVLRE-ADGRWRMWYYGRDT--------GFDPEINLPT 58

Query: 316  GRIGLALSTDGISFHRHRGPLQGGAVLDPSNDKHAFDCVQVAVSDALHDEPTL-TWRIXX 492
            GR GLA+S DG+ + R  GPL  GAV +P  D   FD   V VSD  ++      W +  
Sbjct: 59   GRCGLAVSDDGLHWARVLGPLTNGAVFEPHPDPKRFDSSHVGVSDLTYENGLYWMWYLGG 118

Query: 493  XXXXXXXXXXXXXXX---PGVALSSDGVSITERKGP----LLAAGGPGEWDERGVSWPRI 651
                              PG A+S DG+     +GP    LL  G PGE D     WP++
Sbjct: 119  DQQRTRIGQFEVKGLQLRPGCAVSRDGLHWLRLEGPYCGALLDLGAPGEPDMAVCGWPQV 178

Query: 652  FTGLHGQTLMTYHCLEAGGFFSAGIAVSQDGGLTWEKTGKVLTRGDEGSWDDKGVSVRHV 831
                 G   M YH L+             + GLTW K G++L  G+ G +D  G+  RH+
Sbjct: 179  RRFPDGIWRMYYHSLDPARMVFVVCLAESNDGLTWTKRGEILGPGEAGGFDALGIGTRHI 238

Query: 832  VRVGEREYVMFYEGSKMTERGWEFAIGMAVSEDGVTWRKDEEAGDEPGGPVL-RPRKGMD 1008
                +  ++MFYEG  + E G+  +IG+A+S++G+ W++  + G E  G +     KG  
Sbjct: 239  F-AHQGRWLMFYEG--VGEGGYR-SIGLAISDNGLDWQR--QPGPESNGSIFTHAPKGSG 292

Query: 1009 VWDNRVVGTPYVVVMESGEMRMYYLGRGEDK---ECEGVLTTGIGLAVSHVSNFRSWKR 1176
             WD   VGTP VV +  G  R+YY+G  E       E  +   IG+A+S   +F  W+R
Sbjct: 293  RWDAFAVGTPRVVALPDGTFRLYYVGANETPAGFADELAMVHQIGVAMSDGPDFTRWER 351


>ref|WP_023410516.1| hypothetical protein [uncultured Thiohalocapsa sp. PB-PSB1]
            gi|557039680|gb|ESQ16309.1| hypothetical protein
            N838_06220 [uncultured Thiohalocapsa sp. PB-PSB1]
          Length = 354

 Score =  172 bits (435), Expect = 4e-40
 Identities = 130/358 (36%), Positives = 176/358 (49%), Gaps = 13/358 (3%)
 Frame = +1

Query: 142  GRVLSPSPAGSGWWDAKCVAGPVVLQEPSSSAYRMYYYGRDSDNWNMGLKAINPNLPTGR 321
            G +  P P G+  WD + V+GP V   P  + +RM+YYGRD  +++  +     NLPTGR
Sbjct: 9    GLMFPPGPPGA--WDDERVSGPRVRSLPDGT-WRMWYYGRDR-SFDREI-----NLPTGR 59

Query: 322  IGLALSTDGISFHRHRGPLQGGAVLDPSNDKHAFDCVQVAVSD-ALHDEPTLTWRIXXXX 498
            +GLA S DG+ + R  GPL GGAV +P  D   FD   V V D  L D     W      
Sbjct: 60   VGLARSNDGLHWQRVLGPLTGGAVFEPHADPARFDSAHVGVGDVTLTDGTYRLWYFGGDH 119

Query: 499  XXXXXXXXXXXXXP---GVALSSDGVSITER----KGPLLAAGGPGEWDERGVSWPRIFT 657
                             G ALS+DGV    +    +G LL  G PG +D    +WP++  
Sbjct: 120  TRQRFGRFEAKGLQLRIGCALSTDGVHWQRQDGAHRGALLDLGAPGTFDSATCAWPQVLA 179

Query: 658  GLHGQTLMTYHCLEAGG-FFSAGIAVSQDGGLTWEKTGKVLTRGDEGSWDDKGVSVRHVV 834
               G+  M YH L+A    F  G+A S D  L W + G+V   G+ G++D  GV  RHV+
Sbjct: 180  LDDGRWRMYYHSLDAKRMLFVVGVAESAD-QLEWTRRGEVFGPGEAGAFDALGVGTRHVL 238

Query: 835  RVGEREYVMFYEGSKMTERGWEFAIGMAVSEDGVTWRKDEEAGDEPGGPVL-RPRKGMDV 1011
            R  +  ++MFYEG +    G+  +IG+A SEDG+ W +    G E  G V     +G   
Sbjct: 239  R-HQGRWLMFYEGVRAD--GYR-SIGLAESEDGLRWTR--IPGHESDGSVFAHAPRGSGR 292

Query: 1012 WDNRVVGTPYVVVMESGEMRMYYLGRGEDK---ECEGVLTTGIGLAVSHVSNFRSWKR 1176
            WD   VGTP  V M  G  R+YY+G  E       E  +   IG+AVS  +N   W R
Sbjct: 293  WDAFAVGTPCAVPMSDGSWRLYYVGANETPGGYADELAMRHQIGVAVSDGANLMRWIR 350


>ref|XP_001753037.1| predicted protein [Physcomitrella patens] gi|162695736|gb|EDQ82078.1|
            predicted protein [Physcomitrella patens]
          Length = 235

 Score =  141 bits (355), Expect = 7e-31
 Identities = 88/229 (38%), Positives = 118/229 (51%), Gaps = 16/229 (6%)
 Frame = +1

Query: 538  PGVALSSDGVSITERK---GPLLAAGGPGEWDERGVSWPRIFTGLHGQT----------- 675
            PG+  S+DGV          PLL  G   EWDE  ++WPR+                   
Sbjct: 17   PGLVKSADGVVFDRALFTGNPLLNVGLQNEWDELFIAWPRVLPPSSRHRCQDPSDEDKTW 76

Query: 676  LMTYHCLEAGG--FFSAGIAVSQDGGLTWEKTGKVLTRGDEGSWDDKGVSVRHVVRVGER 849
            LMTY  +E     F S G+A+  DG   W K GK LTRG  GSWD+ GV  RHV+ + + 
Sbjct: 77   LMTYSSIEKQTLPFSSIGVALPADGH-KWFKAGKALTRGAPGSWDEGGVGRRHVLLI-DN 134

Query: 850  EYVMFYEGSKMTERGWEFAIGMAVSEDGVTWRKDEEAGDEPGGPVLRPRKGMDVWDNRVV 1029
            EY MFYEG  +  +G    IG+A+S DG+ W +D        GP+   R G +VWDN  V
Sbjct: 135  EYFMFYEG--VNNKGIH-GIGLAISPDGIHWERDPLCQ----GPIFSARVGEEVWDNGTV 187

Query: 1030 GTPYVVVMESGEMRMYYLGRGEDKECEGVLTTGIGLAVSHVSNFRSWKR 1176
              P+V+ M+ G  RMYY+G  + K       + +G+A+S   NFR+W R
Sbjct: 188  AAPHVLQMDDGSFRMYYVGTNDTK-----TESAMGMAISKGKNFRTWTR 231


>gb|EWM26647.1| Glycosyl hydrolase family 43, five-bladed beta-propellor domain
            protein [Nannochloropsis gaditana]
          Length = 410

 Score =  140 bits (354), Expect = 9e-31
 Identities = 124/367 (33%), Positives = 168/367 (45%), Gaps = 22/367 (5%)
 Frame = +1

Query: 142  GRVLSPSPAGSGWWDAKCVAGPVVLQEPSSSAYRMYYYGRDSDNWNMGLKAINPNLPTGR 321
            G++L P   G+  WD   V+GPVV Q    S + M+YYGR S  +N     IN  LP+G 
Sbjct: 57   GKILGPGVGGA--WDDYKVSGPVVRQSRDGS-WNMWYYGR-SKEFNKTANLIN--LPSGH 110

Query: 322  IGLALSTDGIS-FHRHRGPLQGGAVLDPSNDK---HAFDCVQVAVSDALHDEPTLTWRIX 489
            IGL+ S DG++ F R +GPL  GAV  PS+++    AFD + V + D + +E    W + 
Sbjct: 111  IGLSQSKDGLTDFARVKGPLLDGAVFGPSSNEAGGKAFDSLHVGIGDIVWEEARQQWVMY 170

Query: 490  XXXXXXXXXXXXXXXXPGV------ALSSDGVSITERKGPLLAAGGPGEWDERGVSWPRI 651
                             G+      A S DGV  +   G LL  G  G++D   V WP++
Sbjct: 171  YFGGDGEYGPTPYGQARGIYMKIGKAESPDGVQWSRAPGVLLDKGAAGDFDALFVGWPQV 230

Query: 652  FTGLHGQTL------MTYHCLEAGGF-FSAGIAVSQD-GGLTWEKTGKV-LTRGDEGSWD 804
                    +      M YH   A  F F  G+A S D  G  W K G + L RG  GS+ 
Sbjct: 231  VDFGKADVIPGMKQGMFYHTFNAVTFGFEIGLATSSDTTGNKWIKRGPIHLPRGAPGSYC 290

Query: 805  DKGVSVRHVV--RVGEREYVMFYEGSKMTERGWEFAIGMAVSEDGVTWRKDEEAGDEPGG 978
            D G + R V+      ++ +MF E +          I +  S DG+ W         P G
Sbjct: 291  DLGHATRCVIPDPANRKKLLMFAECANTQNSN---CIAVYGSNDGLLWE-----NMHPAG 342

Query: 979  PV-LRPRKGMDVWDNRVVGTPYVVVMESGEMRMYYLGRGEDKECEGVLTTGIGLAVSHVS 1155
            PV +   KG   WD + +G P V V + G   MYY+G  E    EG     IGLAVS   
Sbjct: 343  PVFVGAGKGSGRWDAQALGCPSVTVRD-GCFYMYYVGFTESSS-EGAALGCIGLAVSDGM 400

Query: 1156 NFRSWKR 1176
            +F  WKR
Sbjct: 401  DFTKWKR 407


>emb|CBN79395.1| putative lipoprotein [Ectocarpus siliculosus]
          Length = 464

 Score =  136 bits (343), Expect = 2e-29
 Identities = 116/381 (30%), Positives = 167/381 (43%), Gaps = 41/381 (10%)
 Frame = +1

Query: 157  PSPAGSGWWDAKCVAGPVVLQ--EPSSSAYRMYYYGRDSDNWNMGLKAINPNLPTGRIGL 330
            P+    GW+D+  V  P V +      + + M+Y+G+D++ WN   K +  N+ TGRIG 
Sbjct: 93   PTDKEEGWFDSASVGSPRVHRYYHDEGNRWVMWYHGQDTE-WNKEGKGVM-NVGTGRIGR 150

Query: 331  ALSTDGISFHRHRGPLQGGAVLDPSNDKH-AFDCVQVAVSD---------ALHDEPTLTW 480
            A STDG+++ R  G +   +VLD + ++   FD   V + D         A        +
Sbjct: 151  AESTDGLTWRRTAGQMAMSSVLDKNTEQWWGFDTAHVGLGDVNLGASSRVATESSVYFMY 210

Query: 481  RIXXXXXXXXXXXXXXXXXP-----------------GVALSSDGVSITERKGP-----L 594
                               P                 GVALS DG++    +G       
Sbjct: 211  YFGGDYEETDVQAEFGLSNPVVCGDSNAPPKGVRMRIGVALSQDGLNWCRVEGEHPTGAC 270

Query: 595  LAAGGPGEWDERGVSWPRIFTGLHGQTLMTYHCLEAGGF-FSAGIAVSQDGGLTWEKTGK 771
            +  GG GEWD   V WP +   +  +  M YH L+     F  G+A SQDG L WEK G 
Sbjct: 271  VDVGGSGEWDRLFVGWPVVINHMEKEFRMYYHALDPDTKKFRVGMATSQDG-LAWEKKGP 329

Query: 772  VLTRGDEGSWDDKGVSVRHVVRVGEREYVMFYEGSKMTERGWEFAIGMAVSEDGVTWRKD 951
            V   G EGS+D++G   R +V + +  Y M YEG    ++    A+G+A S+DG+ W   
Sbjct: 330  VFDGGPEGSFDERGAGRRRIV-MHKGVYHMVYEG---VDKDGVHALGLATSKDGIKW--- 382

Query: 952  EEAGDEPGGPVLRPRKGMDVWDNRVVGTPYVVVMESGEMRMYYLGRGEDK---ECEGVLT 1122
            E   D+P     R   G   WD   V  P +V  + G   +YY G  E K   E EG   
Sbjct: 383  ERHSDQP--IFERSPPGSGAWDAGGVSNPEIVETDGGMWYLYYSGYPEKKEGGEGEGAGV 440

Query: 1123 TG---IGLAVSHVSNFRSWKR 1176
             G   IG+AV+   +   W R
Sbjct: 441  FGNSAIGVAVAVGDDLTKWTR 461


>ref|XP_004297285.1| PREDICTED: uncharacterized protein LOC101300242 [Fragaria vesca
            subsp. vesca]
          Length = 454

 Score =  128 bits (322), Expect = 5e-27
 Identities = 119/379 (31%), Positives = 173/379 (45%), Gaps = 31/379 (8%)
 Frame = +1

Query: 133  WESGRVLSPSPAGSGWWDAKCVAGPVVLQEPSSSAYR--MYYYGRDSDNWNMGLKAINPN 306
            W +G V       S  WD+  +  PVV +    +  R  M+Y+GR SD          PN
Sbjct: 101  WSTGLVFDLGNKTS--WDSSEIGSPVVKRFIGDNEERWYMWYHGRSSD----------PN 148

Query: 307  LPTGRIGLALSTDGISFHRHRGPLQG-----GAVLDPSNDKHAFDCVQVAVSD-ALHDEP 468
                 IGLA+S++GI + R    +       G V++ SN+  AFD   +  S+  +   P
Sbjct: 149  --NSSIGLAVSSNGIHWARGAEHVMSCGEDEGLVMNCSNNWWAFDTKSIKPSEMVIMSSP 206

Query: 469  TLT---WRIXXXXXXXXXXXXXXXXX----------PGVALSSDG-----VSITERKGPL 594
              +   W                             PG+A S DG     +      G L
Sbjct: 207  MYSAVYWLYYTGYSSEKVDCTNYPVGQYPVDALKSLPGLACSQDGRNWARIEGDYHSGAL 266

Query: 595  LAAGGPGEWDERGVSWPRIFTGLHGQTLMTYHCLEAG-GFFSAGIAVSQDGGLTWEKTGK 771
            L  G   EWD   ++ P++         M YH  +A  G F+ G+A S+DG + W K GK
Sbjct: 267  LDVGSKNEWDSLSIAKPQVVVHGSDDMRMYYHSFDAEKGHFAIGMARSRDG-IRWVKLGK 325

Query: 772  VLTRGDEGSWDDKGVSVRHVVR-VGEREYVMFYEGSKMTERGWEFAIGMAVSEDGVT-WR 945
            ++  G +GS+D+ GV    VVR   E  Y+M YEG +   +    +IG+AVS DG+  W 
Sbjct: 326  IMGGGPKGSFDELGVKNACVVRNRKEGNYLMAYEGVRADGK---ISIGLAVSPDGLNEWT 382

Query: 946  KDEEAGDEPGGPVLRPRKGMDVWDNRVVGTPYVVVMESGE--MRMYYLGRGEDKECEGVL 1119
            + +       G VL P  G D WDN+ VG+P +V ME      R+YY+G GED       
Sbjct: 383  RVQ------NGAVLSP-SGDDAWDNKGVGSPSLVQMEGNADLWRLYYVGVGEDGR----- 430

Query: 1120 TTGIGLAVSHVSNFRSWKR 1176
             TGIG+A+S  S+ R+++R
Sbjct: 431  -TGIGMAISEGSDVRTFRR 448


>ref|XP_006827322.1| hypothetical protein AMTR_s00010p00267390 [Amborella trichopoda]
            gi|548831751|gb|ERM94559.1| hypothetical protein
            AMTR_s00010p00267390 [Amborella trichopoda]
          Length = 462

 Score =  125 bits (315), Expect = 3e-26
 Identities = 115/374 (30%), Positives = 163/374 (43%), Gaps = 43/374 (11%)
 Frame = +1

Query: 181  WDAKCVAGPVVLQEPSSSAYR--MYYYGRDSDNWNMGLKAINPNLPTGRIGLALSTDGIS 354
            WD+K +  PV+ +  S    R  M+Y+GR     +              IG A+S++G+ 
Sbjct: 113  WDSKEIGSPVIRRYLSDDEERWCMWYHGRGDQTCDS-------------IGFAISSNGVH 159

Query: 355  FHRHRGPLQG----GAVLDPSNDKHAFDCVQVAVSDAL---------------------- 456
            + R  GP +     G V+  S D  AFD   +  SD L                      
Sbjct: 160  WERGSGPARTTEDVGLVMHCSQDWWAFDTENIRPSDILIMSSNRDRASIGVYWLYYTGFN 219

Query: 457  -HDEPTLTWRIXXXXXXXXXXXXXXXXXPGVALSSDG-----VSITERKGPLLAAGGPGE 618
              +   +T                    PG+A+S DG     +      G L+  GG GE
Sbjct: 220  SEEIDMMTKPPIGNPDRGSGSHQIFRSLPGLAMSQDGKHWARIEGEHHSGALIDVGGEGE 279

Query: 619  WDERGVSWPRIFTGLHGQTLMTYHCLE-AGGFFSAGIAVSQDGGLTWEKTGKVLTRGDEG 795
            WD   ++ P++         M YH  +   G F  G+A S+D G+ W K GK++  G  G
Sbjct: 280  WDSTFIASPQVVYHCRNDLRMYYHSFDMQAGCFCIGLARSRD-GIKWVKLGKIMGGGLPG 338

Query: 796  SWDDKGVSVRHVV-RVGERE---YVMFYEGSKMTERGWEFAIGMAVSEDGV-TWRKDEEA 960
            S+D+ GV    VV ++G+     YVM YEG K        +IG+A S DG+  WR+  E 
Sbjct: 339  SFDEGGVMGPQVVSKIGKESGEGYVMVYEGLK---ADGSRSIGLAESPDGLKDWRRCSE- 394

Query: 961  GDEPGGPVLRPRKGMDVWDNRVVGTPYVVVMESG-EMRMYYLGRGEDKECEGVLTTGIGL 1137
                G  VL P    D WDN  VG+P VV M+ G E R+YY G G+      V  +GIG+
Sbjct: 395  ----GVVVLGPSAEKDRWDNGGVGSPSVVRMDGGNEWRLYYRGFGK------VGRSGIGM 444

Query: 1138 AVSH--VSNFRSWK 1173
            AVS   +  F+ WK
Sbjct: 445  AVSDDGLKTFKRWK 458


>ref|XP_002336049.1| predicted protein [Populus trichocarpa]
          Length = 343

 Score =  124 bits (310), Expect = 1e-25
 Identities = 111/366 (30%), Positives = 161/366 (43%), Gaps = 17/366 (4%)
 Frame = +1

Query: 124  MVAWESGRVLSPSPAGSGWWDAKCVAGPVVLQEPSSSAYRMYYYGRDSDNWNMGLKAINP 303
            M +   G V    P  S  WD K +  PVV +  S    R Y +   + + N G      
Sbjct: 1    MESLSRGLVFDLGPLNS--WDGKEIGSPVVKRFLSDEEERWYMWYHGNSSQNSG------ 52

Query: 304  NLPTGRIGLALSTDGISFHRHRGPLQG----GAVLDPSNDKHAFDCVQVAVSDALHDEPT 471
                  IGLA+S++GI + R  GP+      G+V+    D  AFD + +   + +    +
Sbjct: 53   --SADSIGLAVSSNGIHWERGVGPVSSSGDVGSVMKCGQDWWAFDTMSIRPGEVVVMSSS 110

Query: 472  LTWRIXXXXXXXXXXXXXXXXXPGVALSSDG-----VSITERKGPLLAAGGPGEWDERGV 636
               R                  PG+A+S DG     +      G L   G   EWD   +
Sbjct: 111  KV-RASSAVYWLYYSGRIFKSLPGLAMSQDGRHWARIEGEHHSGALFDVGSEREWDSLFI 169

Query: 637  SWPRIFTGLHGQTLMTYHCLEA-GGFFSAGIAVSQDGGLTWEKTGKVLTRGDEGSWDDKG 813
            + PR+    +    M YH  +   G F  GIA S+DG + W K GK++  G   S+D+ G
Sbjct: 170  AGPRVVFHGNSDLRMYYHSFDVESGQFGIGIARSRDG-INWMKLGKIIGGGKISSFDEFG 228

Query: 814  VSVRHVVR-VGEREYVMFYEGSKMTERGWEFAIGMAVSEDGVT-WRKDEEAGDEPGGPVL 987
                 VVR   +  Y+M YEG      G + +IG+AVS DG+  WR+ ++         +
Sbjct: 229  ALNACVVRNKKDGRYLMAYEG---VAAGGKRSIGLAVSPDGLRDWRRFQDEA-------V 278

Query: 988  RPRKGMDVWDNRVVGTPYVVVM--ESGEMRMYYLGRGEDKECEGVLTTGIGLAVSH---V 1152
                  D WDN+ VG+P +V M  E  E R+YY G G +        TGIG+A+S    V
Sbjct: 279  LESSVKDGWDNKGVGSPCLVQMDGEVDEWRLYYRGVGNEGR------TGIGMAISQGNDV 332

Query: 1153 SNFRSW 1170
            S+FR W
Sbjct: 333  SSFRRW 338


>ref|XP_006379671.1| hypothetical protein POPTR_0008s08930g [Populus trichocarpa]
            gi|550332693|gb|ERP57468.1| hypothetical protein
            POPTR_0008s08930g [Populus trichocarpa]
          Length = 453

 Score =  123 bits (309), Expect = 1e-25
 Identities = 110/360 (30%), Positives = 159/360 (44%), Gaps = 17/360 (4%)
 Frame = +1

Query: 142  GRVLSPSPAGSGWWDAKCVAGPVVLQEPSSSAYRMYYYGRDSDNWNMGLKAINPNLPTGR 321
            G V    P  S  WD K +  PVV +  S    R Y +   + + N G            
Sbjct: 117  GLVFDLGPLNS--WDGKEIGSPVVKRFLSDEEERWYMWYHGNSSQNSG--------SADS 166

Query: 322  IGLALSTDGISFHRHRGPLQG----GAVLDPSNDKHAFDCVQVAVSDALHDEPTLTWRIX 489
            IGLA+S++GI + R  GP+      G+V+    D  AFD + +   + +    +   R  
Sbjct: 167  IGLAVSSNGIHWERGVGPVSSSGDVGSVMKCGQDWWAFDTMSIRPGEVVVMSSSKV-RAS 225

Query: 490  XXXXXXXXXXXXXXXXPGVALSSDG-----VSITERKGPLLAAGGPGEWDERGVSWPRIF 654
                            PG+A+S DG     +      G L   G   EWD   ++ PR+ 
Sbjct: 226  SAVYWLYYSGRIFKSLPGLAMSQDGRHWARIEGEHHSGALFDVGSEREWDSLFIAGPRVV 285

Query: 655  TGLHGQTLMTYHCLEA-GGFFSAGIAVSQDGGLTWEKTGKVLTRGDEGSWDDKGVSVRHV 831
               +    M YH  +   G F  GIA S+DG + W K GK++  G   S+D+ G     V
Sbjct: 286  FHGNSDLRMYYHSFDVESGQFGIGIARSRDG-INWMKLGKIIGGGKISSFDEFGALNACV 344

Query: 832  VR-VGEREYVMFYEGSKMTERGWEFAIGMAVSEDGVT-WRKDEEAGDEPGGPVLRPRKGM 1005
            VR   +  Y+M YEG      G + +IG+AVS DG+  WR+ ++         +      
Sbjct: 345  VRNKKDGRYLMAYEG---VAAGGKRSIGLAVSPDGLRDWRRFQDEA-------VLESSVK 394

Query: 1006 DVWDNRVVGTPYVVVM--ESGEMRMYYLGRGEDKECEGVLTTGIGLAVSH---VSNFRSW 1170
            D WDN+ VG+P +V M  E  E R+YY G G +        TGIG+A+S    VS+FR W
Sbjct: 395  DGWDNKGVGSPCLVQMDGEVDEWRLYYRGVGNEGR------TGIGMAISQGNDVSSFRRW 448


>ref|XP_002266397.1| PREDICTED: uncharacterized protein LOC100264211 [Vitis vinifera]
          Length = 491

 Score =  123 bits (308), Expect = 2e-25
 Identities = 117/395 (29%), Positives = 170/395 (43%), Gaps = 52/395 (13%)
 Frame = +1

Query: 142  GRVLSPSPAGSGWWDAKCVAGPVVLQEPSSSAYRMYYYGRDSDNWNMGLKAINPNLPTGR 321
            G V    P+ S  WD+  +  PVV +  S    R Y        W  G  A N N  +  
Sbjct: 120  GLVFDLGPSNS--WDSAQIGSPVVKRFLSDDEERWYM-------WYHG--ASNENSASDS 168

Query: 322  IGLALSTDGISFHRHRGPLQGGA----VLDPSNDKHAFDCVQVAVSDAL-------HDEP 468
            IGLA+S++G+ + R  GP++ G     V++   D  AFD + +  SD +           
Sbjct: 169  IGLAVSSNGVHWERGGGPVRSGGDVGLVMNCGKDWWAFDTMSIRPSDVVIMSSNRVRGSS 228

Query: 469  TLTW--------------------------RIXXXXXXXXXXXXXXXXXPGVALSSDG-- 564
             + W                          R                  PG+A+S DG  
Sbjct: 229  AVYWLYYTGYSSEKVVFLDDSLELYLENPERAGAENGENGGIGKIFKSLPGLAISQDGRH 288

Query: 565  ---VSITERKGPLLAAGGPGEWDERGVSWPRIFTGLHGQTLMTYHCLEA-GGFFSAGIAV 732
               +      G L   G   EWD   ++ P++    +G   M YH  +   G F+ GIA 
Sbjct: 289  WARIEGEHHTGALFDVGLENEWDSMYIASPQVVFHGNGDLRMYYHSFDVENGQFAIGIAR 348

Query: 733  SQDGGLTWEKTGKVLTRGDEGSWDDKGVSVRHVVR-VGEREYVMFYEGSKMTERGWEFAI 909
            S+DG + W K GK++  G  GS+D+ GV    VV+   + +YVM YEG    +     +I
Sbjct: 349  SKDG-IRWVKLGKIMGGGISGSFDESGVVKACVVKNRRDGKYVMAYEG---VDGNGRRSI 404

Query: 910  GMAVSEDGV-TWRKDEEAGDEPGGPVLRPRKGMDVWDNRVVGTPYVVVM----ESGEMRM 1074
            G+AVS DG+  WR+ ++        VL P +  D WDN+ VG+P +V M    + GE R+
Sbjct: 405  GLAVSPDGLKEWRRSQDEA------VLMPAED-DGWDNKGVGSPCLVQMDGDGDGGEWRL 457

Query: 1075 YYLGRGEDKECEGVLTTGIGLAVSHVSN---FRSW 1170
            YY G G+         TGIG+AV   S+   FR W
Sbjct: 458  YYRGIGQGGR------TGIGMAVCEGSDRRRFRKW 486


>gb|EXC20011.1| hypothetical protein L484_015688 [Morus notabilis]
          Length = 481

 Score =  121 bits (304), Expect = 6e-25
 Identities = 120/407 (29%), Positives = 182/407 (44%), Gaps = 65/407 (15%)
 Frame = +1

Query: 151  LSPSPAGSGW-----------WDAKCVAGPVVLQEPSSSAYR--MYYYGRDSDNWNMGLK 291
            LSPS A S             WD+  +  PVV +  S    R  M+Y+GR S + N    
Sbjct: 90   LSPSSASSSGGLVFDLGIENSWDSAEIGSPVVKRFLSDEEERWYMWYHGRSSRSKN---D 146

Query: 292  AINPNLPTGRIGLALSTDGISFHRHRGPLQG----GAVLDPSNDKHAFDCV--------- 432
            + NP L +  +GLA+S++G+ + R  GP+Q     G V+    D  AFD +         
Sbjct: 147  SENPCLDS--VGLAVSSNGVHWERGVGPVQASRDVGFVMSCGKDWWAFDTLSIRPSKVVI 204

Query: 433  ----QVAVSDALH----------------DEPTLTWRIXXXXXXXXXXXXXXXXX----- 537
                +V VS A++                 + +  + +                      
Sbjct: 205  MSSSKVRVSSAVYWMYYTGFSSEEIDIDISDESFKFSLENPERFFGDFEGGSTSSGKIHK 264

Query: 538  --PGVALSSDG-----VSITERKGPLLAAGGPGEWDERGVSWPRIFTGLHGQTLMTYHCL 696
              PG+A+S DG     +      G L   G   EWD   ++ P++    +G   M YH  
Sbjct: 265  SLPGLAISQDGRYWARIEGEHHSGALFDVGAEKEWDSLFIASPQVVFHGNGDLRMYYHSF 324

Query: 697  EAG-GFFSAGIAVSQDGGLTWEKTGKVL--TRGDEGSWDDKGVSVRHVVRVG-EREYVMF 864
            + G G F  G+A S+DG + W K GK++   +   G++D+ G    +VVR   + +Y+M 
Sbjct: 325  DVGNGEFCIGMARSRDG-IRWVKLGKIIGGEKNTSGAFDEFGALNANVVRNRKDGKYLMA 383

Query: 865  YEGSKMTERGWEFAIGMAVSEDGV-TWRKDEEAGDEPGGPVLRPRKGMDVWDNRVVGTPY 1041
            YEG        E +IG+A+S+DG+  W K  +      GPVL+  +  + WDNR VG+P 
Sbjct: 384  YEGVSCNG---ERSIGLAMSQDGLKNWTKFRD------GPVLKASEAQNGWDNRGVGSPC 434

Query: 1042 VVVM--ESGEMRMYYLGRGEDKECEGVLTTGIGLAVSHVSNFRSWKR 1176
            +V M  E  E R+YY G G +        TGIG+A SH S+F  + R
Sbjct: 435  LVQMDGEEDEWRLYYRGVGNEGR------TGIGMAASHGSDFGRFTR 475


Top