BLASTX nr result

ID: Ephedra28_contig00001912 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Ephedra28_contig00001912
         (1905 letters)

Database: ./nr 
           37,332,560 sequences; 13,225,080,153 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_002982343.1| hypothetical protein SELMODRAFT_445126 [Sela...   256   2e-65
ref|XP_002966502.1| hypothetical protein SELMODRAFT_439533 [Sela...   224   8e-56
ref|XP_003078927.1| unnamed protein product [Ostreococcus tauri]...   208   6e-51
ref|XP_002503729.1| predicted protein [Micromonas sp. RCC299] gi...   202   3e-49
ref|XP_003059314.1| predicted protein [Micromonas pusilla CCMP15...   190   2e-45
emb|CCO17636.1| predicted protein [Bathycoccus prasinos]              186   3e-44
ref|XP_001753862.1| predicted protein [Physcomitrella patens] gi...   169   4e-39
ref|XP_005850298.1| hypothetical protein CHLNCDRAFT_142045 [Chlo...   128   1e-26
ref|XP_002502719.1| predicted protein [Micromonas sp. RCC299] gi...    97   3e-17
ref|XP_001421028.1| predicted protein [Ostreococcus lucimarinus ...    92   6e-16
ref|XP_003082486.1| unnamed protein product [Ostreococcus tauri]...    88   1e-14
ref|XP_003061086.1| predicted protein [Micromonas pusilla CCMP15...    81   2e-12
ref|XP_002955892.1| hypothetical protein VOLCADRAFT_96785 [Volvo...    73   4e-10
ref|WP_002701319.1| pdz domain (also known as dhr or glgf) prote...    69   7e-09
ref|YP_005012075.1| PDZ/DHR/GLGF domain-containing protein [Nias...    68   1e-08
ref|WP_002698385.1| hypothetical protein [Microscilla marina] gi...    66   6e-08
ref|WP_018630490.1| hypothetical protein [Niabella aurantiaca]         65   1e-07
ref|XP_005649893.1| hypothetical protein COCSUDRAFT_46683, parti...    60   3e-06
emb|CDF78235.1| PDZ/DHR/GLGF domain-containing protein [Formosa ...    60   3e-06
ref|WP_018478012.1| hypothetical protein [Pontibacter roseus]          60   4e-06

>ref|XP_002982343.1| hypothetical protein SELMODRAFT_445126 [Selaginella moellendorffii]
            gi|300149935|gb|EFJ16588.1| hypothetical protein
            SELMODRAFT_445126 [Selaginella moellendorffii]
          Length = 532

 Score =  256 bits (654), Expect = 2e-65
 Identities = 157/526 (29%), Positives = 262/526 (49%), Gaps = 9/526 (1%)
 Frame = -3

Query: 1840 LKECLHNLSLSTGVHYTK---PTQHFTLKGGQHMLGLDGDWVFSWCMDGRFYEKFDSQDI 1670
            ++EC+  +  + G+   +     +  T+ G Q  + + GD+ F+WC DGRF+E+F  +  
Sbjct: 52   VEECIERMRDALGIENVEGECADKFITMTGKQWFMDMKGDYKFTWCADGRFHERFKGEHF 111

Query: 1669 TFECGYDGNNGHWHVDHTGRVTAVELDDDEVTQLCVYLRTGFWLTEKGQNLLNIQFEGEG 1490
            T E  +DG N  W+ D  GRVT ++LD  E+  L  ++R G+WLT +GQ  L+I+   + 
Sbjct: 112  TVEWAHDGRNT-WYADFAGRVTKLQLDSLELCLLTAWIRNGYWLTAEGQEKLSIKHRTDA 170

Query: 1489 NFDAIKKKTGEIKFAICLKNK--KIVAHLHVDACQWLPLSMELKVLGQTERWEYKDWRTI 1316
                     G++  A+  + +   +VA + VD+  +LP  +  KV G+ E W Y DW+T+
Sbjct: 171  ---------GDLPVAVNPRMQILDVVASVVVDSVSFLPSRLVAKVCGEIESWGYSDWQTL 221

Query: 1315 DDCLKHRYPYLCVRYPGAGGKDTFTVAHSVLHNGVEDTAGNTIDRGLPYRYPDTLLVPRH 1136
            +   K + P        +G  D+  V+ S + +  + +          +  P +  +PR 
Sbjct: 222  EPECKCQVPLSVAHVTSSGSIDSIFVSCSKVGSNEDSSV---------FVLPSSRFLPRD 272

Query: 1135 NQFYPRVSIDAACPSAVKMLRAKGGQLLVKPHVDGSDIGYFILDTGAGGLAISPSLADKL 956
               YP    DA     VK+   + G  LV+P VDG DIGYF+LDTGA GL I  +   +L
Sbjct: 273  G-VYPHAEFDAGVSPNVKLFGCESGHCLVRPLVDGQDIGYFLLDTGASGLMIDANKTREL 331

Query: 955  KMGAFGEVFVSGFHGHKKSHYRRGNSFQLGPLKIESPLFVEVDTASLNIGPHPIVGVCGF 776
             M  FGEV + G     +S + RG  FQ+G LKI +P+++E                   
Sbjct: 332  GMEGFGEVHLLGVETRVRSRFVRGKQFQIGGLKIVNPIYIE------------------- 372

Query: 775  DVFQQGIVKLSCREESISIFDSKEFERDSEYSLEWEKLAFIQNVPNIAAKINGQNALLLF 596
                     +S  +  + +FD   +   S  +L+W+++  + N+P + AKING   LLL 
Sbjct: 373  -------TSMSVSKGLLCVFDPASYVPSS--NLQWQEMILLDNLPYVPAKINGHQVLLLL 423

Query: 595  DTGASGVDVIFHSESDIYTNTQLE---RSTNIRGINSQGLRQQVSDMVINRVEIANHVFK 425
            DTGA G DVI H+ +       ++   +  +++  +S G         I  +E    VF+
Sbjct: 424  DTGAGGADVILHARAAREYFPGIDERLQCGSVKESSSGGGGLVAGTGEIQTLEFGGFVFE 483

Query: 424  NVHAICLSSET-KLPLSEYTSGILCMDMIENFVIIIDYPNRRIALV 290
             + A+ L  +   L +S+YT+G+LC  +++  +I++DY N R  L+
Sbjct: 484  KLEALFLRYKVGNLDVSQYTAGLLCGRLLKTCLIVLDYQNSRFGLL 529


>ref|XP_002966502.1| hypothetical protein SELMODRAFT_439533 [Selaginella moellendorffii]
            gi|300165922|gb|EFJ32529.1| hypothetical protein
            SELMODRAFT_439533 [Selaginella moellendorffii]
          Length = 429

 Score =  224 bits (572), Expect = 8e-56
 Identities = 126/396 (31%), Positives = 206/396 (52%), Gaps = 5/396 (1%)
 Frame = -3

Query: 1840 LKECLHNLSLSTGVHYTK---PTQHFTLKGGQHMLGLDGDWVFSWCMDGRFYEKFDSQDI 1670
            ++EC+  +  + G+   +     +  T+ G Q  + + GD+ F+WC DGRF+E+F  +  
Sbjct: 52   VEECIERMRDALGIENVEGECADKFITMTGKQWFMDMKGDYKFTWCADGRFHERFKGEHF 111

Query: 1669 TFECGYDGNNGHWHVDHTGRVTAVELDDDEVTQLCVYLRTGFWLTEKGQNLLNIQFEGEG 1490
            T E  +DG N  W+ D  GRVT ++LD  E+  L  ++R G+WLT +GQ  L+I+   + 
Sbjct: 112  TVEWAHDGRNT-WYADFAGRVTKLQLDSLELCLLTAWIRNGYWLTAEGQEKLSIKHRTDA 170

Query: 1489 NFDAIKKKTGEIKFAICLKNK--KIVAHLHVDACQWLPLSMELKVLGQTERWEYKDWRTI 1316
                     G++  A+  + +   +VA + VD+  +LP  +  KV G+ E W Y DW+T+
Sbjct: 171  ---------GDLPVAVNPRMQILDVVASVVVDSVSFLPSRLVAKVCGEIESWGYSDWQTL 221

Query: 1315 DDCLKHRYPYLCVRYPGAGGKDTFTVAHSVLHNGVEDTAGNTIDRGLPYRYPDTLLVPRH 1136
            +   K + P        +G  D+  V+ S + +  + +          +  P +  +PR 
Sbjct: 222  EPECKCQVPLSVAHVTSSGSIDSIFVSCSKVGSNEDSSV---------FALPSSRFLPRD 272

Query: 1135 NQFYPRVSIDAACPSAVKMLRAKGGQLLVKPHVDGSDIGYFILDTGAGGLAISPSLADKL 956
               YP    DA     VK+   + G  LV+P VDG DIGYF+LDTGA GL I  +   +L
Sbjct: 273  G-VYPHAEFDAGVSPNVKLFGCESGHCLVRPLVDGQDIGYFLLDTGASGLMIDANKTREL 331

Query: 955  KMGAFGEVFVSGFHGHKKSHYRRGNSFQLGPLKIESPLFVEVDTASLNIGPHPIVGVCGF 776
             M  FGEV + G     +S + RG  FQ+G LKI +P+++E     +      IVGVCGF
Sbjct: 332  GMEGFGEVHLLGVETRVRSRFVRGKQFQIGGLKIVNPIYIETSARGVGETAEAIVGVCGF 391

Query: 775  DVFQQGIVKLSCREESISIFDSKEFERDSEYSLEWE 668
            D+F   ++++S  +  + +FD   +   S  +L+W+
Sbjct: 392  DLFFHCVIEMSVSKGLLCVFDPASYVPSS--NLQWQ 425


>ref|XP_003078927.1| unnamed protein product [Ostreococcus tauri]
            gi|116057380|emb|CAL51807.1| unnamed protein product
            [Ostreococcus tauri]
          Length = 561

 Score =  208 bits (530), Expect = 6e-51
 Identities = 157/531 (29%), Positives = 250/531 (47%), Gaps = 32/531 (6%)
 Frame = -3

Query: 1783 TQHFTLKG-GQHMLGLDGDWVFSWCM---DGRFYEKFDSQDITFECGYD-GNNGH-WHVD 1622
            ++ +TL G GQH LGL       WC+   D  F E+  + ++ +  G   G  G  W VD
Sbjct: 58   SRSYTLWGRGQH-LGLPA----LWCLRTDDEAFVEETMNAELCYVSGRSRGREGSCWEVD 112

Query: 1621 HTGRVTAVELDDDEVTQLCVYLRTGFWLTEKGQN-LLNIQFEGEGNFDAIKKKTGEIKFA 1445
             +G    +ELDD E   L  + R+GFW T +  N +L ++   +G    + K    ++  
Sbjct: 113  FSGYAQRLELDDAEAATLGAWTRSGFWATRQCANEMLEMELLDDGREGGVCKIEMRLR-- 170

Query: 1444 ICLKNKKIVAHLHVDACQWLPLSMELKVLGQTERWEYKDWRTIDDCLKHRYPYLCVRYPG 1265
               +  ++   L +DA  W P  + + V G  + W ++DW T D      YP        
Sbjct: 171  ---RGGRVRGELELDASTWKPRRLRVAVCGDEDEWTFEDW-TRDATSGLPYPRTTRLRGA 226

Query: 1264 AGGKDTFTVAHSVLHNGVEDTAGNTIDRGLPYRYPDTLLVPRHNQFYPRVSIDAACPSAV 1085
             GG   F V  + +    E        RG+ +  P        N+  P V+ D++    +
Sbjct: 227  NGGIQEFIVTGAAVGRNKEG-------RGM-FEKPT-------NEKPPTVNFDSSAAPDI 271

Query: 1084 KMLRAKGGQLLVKPHVDGSDIGYFILDTGAGGLAISPSLADKLKMGAFGEVFVSGFHGHK 905
             +LRA    +LV+P +DG  +G FILDTGA GL I+P  A  L + AFGEV VSG  G  
Sbjct: 272  PVLRAGSSHVLVEPMIDGESVGSFILDTGASGLVITPKAAQTLGLRAFGEVHVSGVSGRV 331

Query: 904  KSHYRRGNSFQLGPLKIESPLFVEVDTASL-NIGPHPIVGVCGFDVFQQGIVKLSCREES 728
               +RRG   +LGPL ++ P+F+E+    +    P P+ G+ GFD F+  I+ +S     
Sbjct: 332  PCQFRRGKDLKLGPLTLKKPVFMEMRLDGIVTKPPKPVAGIIGFDAFKSAILIVSEEGRR 391

Query: 727  ISIFDSKEFERDSEYSLEWEKLAFIQNVPNIAAKINGQN-------ALLLFDTGASGVDV 569
            + I+D+ +     E S  W++L  + NVP++AA  +G N        + + D+GA G DV
Sbjct: 392  VQIYDADDKSAIDE-SWAWQELRLVSNVPHVAATFSGVNDSQKLEPNIFMIDSGAGGADV 450

Query: 568  IFHSES--DIYTNTQL--------ERSTNIRGINSQGLRQQVSDMVINRVEIANH----- 434
            IFH+++  D+     L         R   + G +  G         ++ +++A+      
Sbjct: 451  IFHAKAVKDLGLTELLGPEGGRISSRVRGVSGADGAGSSTLTYRTTMDWLQLASADANGT 510

Query: 433  --VFKNVHAICLSSETKLPLSEYTSGILCMDMIENFVIIIDYPNRRIALVD 287
               F  +  +  S E    LSE++ G++C  ++    I+ D P RRIA VD
Sbjct: 511  TVRFDEIDTLLASGE-GFSLSEHSCGMICATLLRKKRIVYDVPRRRIAFVD 560


>ref|XP_002503729.1| predicted protein [Micromonas sp. RCC299] gi|226518996|gb|ACO64987.1|
            predicted protein [Micromonas sp. RCC299]
          Length = 720

 Score =  202 bits (515), Expect = 3e-49
 Identities = 134/444 (30%), Positives = 213/444 (47%), Gaps = 38/444 (8%)
 Frame = -3

Query: 1705 GRFYEKFDSQDITFECGY-DGNNGHWHVDHTGRVTAVELDDDEVTQLCVYLRTGFWLTEK 1529
            G F E+    D+++   +  G +  W  DHTG    V+LDD E + L  + RTG+W    
Sbjct: 125  GSFAEEAAGPDLSYVSTHCAGADEVWETDHTGYTQVVQLDDHEASLLAAWARTGWWAHPD 184

Query: 1528 GQNLLNIQFEGEGNFDAIKKKTG----EIKFAICLK-NKKIVAHLHVDACQWLPLSMELK 1364
             ++ + ++  G    DA   +      E +  I L+    IVA ++VD  +WLP  M ++
Sbjct: 185  AKHDVELELGGGAAGDAADDEARPAPHECEVTIRLRAGGLIVASMYVDTREWLPTRMRMR 244

Query: 1363 VLGQTERWEYKDWRTIDDC-----LKHRYPYLCVRYPGAGGKDTFTVAHSVLHNGVEDTA 1199
            V G  E W Y+DW+ +            YP L      AGG  TF    +  ++GV    
Sbjct: 245  VCGDDESWFYEDWKNVGGSNASAGAAKPYPALTTLKGAAGGTQTFRTDGARANSGVGSGY 304

Query: 1198 GNTIDRGLPYRYPDTLLVPRHNQFYPR----------------VSIDAACPSAVKMLRAK 1067
                     YR P TL       F  +                V  DA     V +  AK
Sbjct: 305  ---------YRRPGTLPGDAFGSFPAKTQLAATDEDGFDNAGSVRFDAKKRPEVNIEEAK 355

Query: 1066 GGQLLVKPHVDGSDIGYFILDTGAGGLAISPSLADKLKMGAFGEVFVSGFHGHKKSHYRR 887
               +LV+P +DG D+G FILDTGA GL IS + A++L +  FGEV+VSG  G    ++RR
Sbjct: 356  SSHVLVRPLIDGVDVGPFILDTGASGLVISGAAAERLNLRKFGEVWVSGVAGKVPCNFRR 415

Query: 886  GNSFQLGPLKIESPLFVEVDTASLNIG-PHPIVGVCGFDVFQQGIVKLSCREESISIFDS 710
            G++ +LGP+ I++P+F+E+    +  G   P+ G+ GFDVF+  I+++      + ++D 
Sbjct: 416  GDTLELGPITIDAPVFMEMSVGGIVSGSSEPVAGIVGFDVFKSAILEVGPGGSPVRLYDP 475

Query: 709  KEFERDSEYSLEWEKLAFIQNVPNIAAKI----NGQNALLLFDTGASGVDVIFHSESDIY 542
            ++F      + +W+ L  + NVP++AA       G+  + + D+GA G D IFH+ +   
Sbjct: 476  QKF--IPPVTWDWKPLLMVSNVPHVAASFAGAPRGKPQIFMIDSGAGGADCIFHARAVRE 533

Query: 541  TNTQ------LERSTNIRGINSQG 488
             N +         S+ +RG+   G
Sbjct: 534  MNLEKLLPGTKRASSRVRGVGGSG 557


>ref|XP_003059314.1| predicted protein [Micromonas pusilla CCMP1545]
            gi|226459150|gb|EEH56446.1| predicted protein [Micromonas
            pusilla CCMP1545]
          Length = 775

 Score =  190 bits (483), Expect = 2e-45
 Identities = 132/444 (29%), Positives = 208/444 (46%), Gaps = 38/444 (8%)
 Frame = -3

Query: 1717 WCM--DGR-FYEKFDSQDITFECGY-DGNNGHWHVDHTGRVTAVELDDDEVTQLCVYLRT 1550
            WCM  DG  F E+     +++  G+ +G    W  D +G    ++LDD E   L  + RT
Sbjct: 165  WCMRHDGASFAEEAVGPQLSYASGHAEGEREVWETDFSGYTQTLQLDDHEAALLAGWTRT 224

Query: 1549 GFWLTEKGQNLLNIQF--------EGEGNFDAIKKKTGEIKFAICLKNKKIVA-HLHVDA 1397
            G+W+T      L ++         E E +        G +  ++ L++  ++A  + +D 
Sbjct: 225  GWWVTADASRALELELVEEEEEEEEEEEDVSDEDSAAGVVTVSLRLRDGGVIAARVAIDT 284

Query: 1396 CQWLPLSMELKVLGQTERWEYKDWRTIDDCLKHR------------YPYLCVRYPGAGGK 1253
              WLP+ M+L V G  E W ++ W    D                 YP        AGG 
Sbjct: 285  TTWLPVGMKLNVCGDEEAWTFEGWHDALDGPSSSSSGSSERRGGMLYPRTTTLRGAAGGT 344

Query: 1252 DTFTVAHSVLHNGVEDTAGNTIDRGLPYRYPDTLLVPRHNQFYPRVSIDAACPSAVKMLR 1073
             TFT   +  + GV+           P  +   +   R +     V  +A  P AV +  
Sbjct: 345  QTFTQTGARANAGVK-----------PSWFQKPIGENRGS-----VRFNANLPPAVPIES 388

Query: 1072 AKGGQLLVKPHVDGSDIGYFILDTGAGGLAISPSLADKLKMGAFGEVFVSGFHGHKKSHY 893
            A+   +LV+P +DG D+G FILDTGA GL IS   A +LK+ AFGEVFVSG  G     +
Sbjct: 389  ARSSHVLVRPTIDGLDVGPFILDTGASGLVISA--ASRLKLLAFGEVFVSGVAGKVPCRF 446

Query: 892  RRGNSFQLGPLKIESPLFVEVDTASLNIG-PHPIVGVCGFDVFQQGIVKLSCREESISIF 716
            RR    +LGP+ ++ P+F+E+D   +  G    + G+ GFDVF+  ++++S R E + + 
Sbjct: 447  RRAKKLRLGPITVDRPVFMEMDVGGIVSGSAETVAGIVGFDVFKSAVLEVSPRGEEVYLH 506

Query: 715  DSKEFERDSEYSLEWEKLAFIQNVPNIAAKINGQNA----LLLFDTGASGVDVIFHSESD 548
            D   F   + +   W  L  + NVP++AA  +G       + + D+GA G DVIFH+ + 
Sbjct: 507  DPSTFVAPAAW--RWNPLLMVSNVPHVAATFSGAPGGAPQIFMIDSGAGGADVIFHARAV 564

Query: 547  --------IYTNTQLERSTNIRGI 500
                    +    Q  R T++RG+
Sbjct: 565  KRLRLDRLLPPPGQERRGTSVRGV 588


>emb|CCO17636.1| predicted protein [Bathycoccus prasinos]
          Length = 653

 Score =  186 bits (472), Expect = 3e-44
 Identities = 134/526 (25%), Positives = 252/526 (47%), Gaps = 35/526 (6%)
 Frame = -3

Query: 1759 GQHMLGLDGDWVFSWCMDGRFYEKFDSQDITFECGYDGN-NGHWHVDHTGRVTAVELDDD 1583
            G+H LG+D  W  ++  D  F ++  +  + +  G+D   N  W  D +G    ++LDD 
Sbjct: 140  GEH-LGMDVRWKLAYA-DENFIDECVNPHLAYVSGFDAERNEVWETDFSGYTQTLDLDDR 197

Query: 1582 EVTQLCVYLRTGFWLTE---KGQNLLNIQFEGEGNFDAIKKKTGEIKFAICLKNKK-IVA 1415
            E   L  ++RTG+W++    + + LL + +  + N         E   A+ LK+   IVA
Sbjct: 198  EAALLATWIRTGYWVSRDCLEMKGLLEVTYVKDTN-------ENERVVAVRLKDDGLIVA 250

Query: 1414 HLHVDACQWLPLSMELKVLGQTERWEYKDWRTIDDCLKHRYPYLCVRYPGAGGKDTFTVA 1235
            ++ +    +LP  +++K  G  E W+Y  W+      +  +   C     +G    F  A
Sbjct: 251  NVFLCKETYLPKKVQIKCCGSVETWKYSRWKAYQHG-QFMFAETCEIIGSSGSTQRFD-A 308

Query: 1234 HSVLHNGVEDTAGNTIDRGLPYRYPDTLLVPRHNQFYPRVSIDAACPSA---------VK 1082
                      T  +      P +  D+ L+        RV   +   S+         V+
Sbjct: 309  QGYRAKSSSKTFSS------PEKRFDSELISIEKDDDNRVGESSGSSSSSSSSSSKYNVE 362

Query: 1081 MLRAKGGQLLVKPHVDGSDIGYFILDTGAGGLAISPSLADKLKMGAFGEVFVSGFHGHKK 902
            +++     +LV+P+++G D+G FILDTGA GL +   +AD L +  FGEV VSG     K
Sbjct: 363  VVKCSSDHVLVRPYINGRDVGPFILDTGASGLVLDQRVADDLDLATFGEVHVSGVSTKVK 422

Query: 901  SHYRRGNSFQLGPLKIESPLFVEVDTASLNIG-PHPIVGVCGFDVFQQGIVKLSC-REES 728
              +RR    ++G LKIE P+F+++D + +  G    + G+ GFD F+  IV +S   +++
Sbjct: 423  CAFRRAKELKIGKLKIEKPVFMQMDASGVVSGCSERVAGIIGFDAFKSSIVDVSSGNDKT 482

Query: 727  ISIFDSKEFERDSEYSLEWEKLAFIQNVPNIAAKINGQN------ALLLFDTGASGVDVI 566
            + I+    F+ +      W+ ++ + NVP++ A+ +G+        + + D+GA G DVI
Sbjct: 483  VHIYPRGYFDAN---DWPWQNVSIVSNVPHLKARFSGKGNHQTKLRMFMVDSGAGGADVI 539

Query: 565  FHS--------ESDIYTNTQLERSTNIRGINSQ----GLRQQVSDMVINRVEIANHVFK- 425
            FH         E+ + +  ++ R++ +RG++      G  ++     ++ +E  N   + 
Sbjct: 540  FHGRAVESLDLENALLSKNEVRRTSTVRGVSGSGGGGGGAEKCVKATLDWIEFENKGMRV 599

Query: 424  NVHAICLSSETKLPLSEYTSGILCMDMIENFVIIIDYPNRRIALVD 287
                  L++ +   LSE+  G++C +++ +  ++ D PNRR+ L +
Sbjct: 600  QELKTLLANGSGFDLSEFGVGMVCANVLNSRRVVYDMPNRRMCLFE 645


>ref|XP_001753862.1| predicted protein [Physcomitrella patens] gi|162694838|gb|EDQ81184.1|
            predicted protein [Physcomitrella patens]
          Length = 716

 Score =  169 bits (428), Expect = 4e-39
 Identities = 98/323 (30%), Positives = 162/323 (50%), Gaps = 28/323 (8%)
 Frame = -3

Query: 1858 EEEALALKECLHNLSLSTGVHYTKPTQHFTLKGGQHMLGLDGDWVFSWCMD-GRFYEKFD 1682
            EE+  A ++C+  +  + GV  +     F + G Q +L ++G W  +W  D GRF+E+FD
Sbjct: 92   EEDEEAERDCIERIRAAIGVDESFYGTAFWISGTQTLLAMEGKWQLNWRNDDGRFWEQFD 151

Query: 1681 SQDITFECGYDGNNGH-WHVDHTGRVTAVELDDDEVTQLCVYLRTGFWLTEKGQNLLNIQ 1505
              ++T ECG+DG   + W V   G  + +ELDD EV  L  ++RTG+W+TE  +  ++++
Sbjct: 152  GVEMTLECGFDGEGQNSWAVGAAGIRSTLELDDREVCLLGAWIRTGYWVTEAARTQIHVR 211

Query: 1504 FEGEGNFDAIKKKTGEIKFAICLKNKKIVAHLHVDACQWLPLSMELKVLGQTERWEYKDW 1325
                          GEI  ++ L++ K+VA++ VDA    PL   +K  G   +W+Y DW
Sbjct: 212  LVQGDVAGVAMASWGEITLSVQLRDCKVVAYVVVDATTMYPLRASMKAFGGWNKWQYSDW 271

Query: 1324 RTIDDCLKHRYPYLCVRYPGAGGKDTFTVAHSVLHNGVEDTAGNTIDRGLPY--RYPDTL 1151
            + +     + +P+ C     +G ++ ++V        +++     + R  P+   +   L
Sbjct: 272  KPVVAGQGYLFPFSCTYTQASGNEEVYSVVEICSVESIDE----VVARESPFSSHFLSNL 327

Query: 1150 LV---PRHNQFYPRVSIDAACPSAVKMLRAKGGQLLVKPHVDGSDIG------------- 1019
            L+   PR    +P V  D++CP  V+MLR   G  LV+P V+G  +G             
Sbjct: 328  LIVPRPRGQNNHPHVDFDSSCPPDVQMLRTDSGHYLVQPLVNGKTVGLFSVVYLGLYNIN 387

Query: 1018 --------YFILDTGAGGLAISP 974
                    YFI+DTGA GL ISP
Sbjct: 388  SLLVSIYSYFIVDTGASGLIISP 410



 Score =  164 bits (414), Expect = 2e-37
 Identities = 84/230 (36%), Positives = 135/230 (58%), Gaps = 5/230 (2%)
 Frame = -3

Query: 967  ADKLKMGAFGEVFVSGFHGHKKSHYRRGNSFQLGPLKIESPLFVEVDTASLNIGPHPIVG 788
            A +L +  FGE+ ++G +G  KS Y R N+FQLGPL+I +P+F+E+  + +  G   +VG
Sbjct: 483  ARELNLTTFGEIHITGVNGVVKSQYCRANTFQLGPLRITNPMFLEIPVSGIVRGDPLVVG 542

Query: 787  VCGFDVFQQGIVKLSCREESISIFDSKEFERDSEYSLEWEKLAFIQNVPNIAAKINGQNA 608
            +CGFD+F Q IV+++  E  +S+FD   +      SL+W  L F++N+P+++A  NG +A
Sbjct: 543  MCGFDMFAQCIVEMAHDEGRLSLFDPALYSISCSKSLQWHTLRFLENIPHVSATFNGHDA 602

Query: 607  LLLFDTGASGVDVIFH----SESDIYTNTQLERSTNIRGINSQGLRQQVSDMVINRVEIA 440
            L L DTGA GVDVIFH     E  +    Q+E    + GIN+ G   +V    +N++ +A
Sbjct: 603  LFLIDTGAGGVDVIFHKRAVEEFGLLKLVQVETYAELMGINASGKGVEVILGTLNQLAVA 662

Query: 439  NHVFKNVHAICLSSET-KLPLSEYTSGILCMDMIENFVIIIDYPNRRIAL 293
               FK    +        L  SEY +G+LC D++    ++ +Y  R++A+
Sbjct: 663  GKPFKQTTTMLARENVGNLDTSEYVAGLLCGDLLTKCRVVFNYAQRQLAI 712


>ref|XP_005850298.1| hypothetical protein CHLNCDRAFT_142045 [Chlorella variabilis]
            gi|307109959|gb|EFN58196.1| hypothetical protein
            CHLNCDRAFT_142045 [Chlorella variabilis]
          Length = 698

 Score =  128 bits (321), Expect = 1e-26
 Identities = 98/329 (29%), Positives = 144/329 (43%), Gaps = 25/329 (7%)
 Frame = -3

Query: 1657 GYDGNNGH-WHVDHTGRVTAVELDDDEVTQLCVYLRTGFWLTEKGQNLLNIQFEG----E 1493
            GY G  G  W  D  G    ++LDD+E   L  ++RTG W+  +    L+I        +
Sbjct: 62   GYFGTAGVCWASDDAGCPRVMQLDDEEALLLSTWVRTGTWVLPRVAAHLHIAAVAAPAQK 121

Query: 1492 GNFDAIKKKTGEIKFA-ICLKNK-KIVAHLHVDACQWLPLSMELKVLGQTERWEYKDWRT 1319
            G   A K + G   +  + L+   ++V  L +    WLP SM L++ G  E WE  DWR 
Sbjct: 122  GTAKAAKAQGGSTAWVELRLRGGGQVVVWLELCTTSWLPRSMRLRLAGDEEVWELSDWRE 181

Query: 1318 IDDCLKHRYPYLCVRYPGAGGKDTFTVAHSVLHNGV---------------EDTAGNTID 1184
                L              G  +T+      L +                   + G+   
Sbjct: 182  WQPGLWLAGTATQSSSEEGGVMNTYHACSVTLQDAPLSLLPAMESGGASSSGSSGGSRSS 241

Query: 1183 RGLPYRYPDTLLVPRHNQFYPRVSIDAACPSAVKMLRAKGGQLLVKPHVDGSDIGYFILD 1004
             G  +  P   L+P  + F P V      P  V       G  LV+P +DG  +GYFI D
Sbjct: 242  SGSRFTPPQAPLMPDDSSFLPGV------PEEVPAWYTISGHSLVQPLLDGRPVGYFIFD 295

Query: 1003 TGAGGLAISPSLADKLKMGAFGEVFVSGFHGHKKSHYRRGNSFQLGPLKIESPLFVEVDT 824
            TGA G  + P++A+ L + AFGE+ V+   G   S +RR  SFQLGPL+IE PL +E+  
Sbjct: 296  TGASGFVLDPAVAEALGLEAFGELQVTSMVGKVASRFRRAASFQLGPLRIERPLLMELPC 355

Query: 823  ASLNIG---PHPIVGVCGFDVFQQGIVKL 746
              L  G      + G+ G DV ++ +V +
Sbjct: 356  TGLVAGLPEGGRVAGIVGHDVLRRALVHM 384


>ref|XP_002502719.1| predicted protein [Micromonas sp. RCC299] gi|226517985|gb|ACO63977.1|
            predicted protein [Micromonas sp. RCC299]
          Length = 772

 Score = 96.7 bits (239), Expect = 3e-17
 Identities = 109/445 (24%), Positives = 175/445 (39%), Gaps = 78/445 (17%)
 Frame = -3

Query: 1390 WLPLSMELKVLGQTERWEYKDWRTIDDCLKHRYPYLCVRYPGAGGKDTFTVAHSVLHNGV 1211
            W+P   EL      E W + +W   D  L    P +  +   AG    +    +      
Sbjct: 331  WVPHRCELATPEGVEAWVFGEWSRSDVGLL--VPGVSHQTHPAGNSCVYRAKRTASILDG 388

Query: 1210 EDTAGNTIDRGLPYR-YP---DTLLVPRHNQFYPRVSIDAACPSAVKMLRAKGGQLLVKP 1043
            E ++     +  PY  YP   D   VP ++     V+  AA   AV   R +GG +LV P
Sbjct: 389  EKSSSKIFSQ--PYEGYPTDDDRWPVPTYSLNDAGVTEAAA---AVPAARGEGGHVLVAP 443

Query: 1042 HVDG-SDIGYFILDTGAGGLAISPSLADKLKMGAFGEVFVSGFHGHKKS-HYRRGNSFQL 869
             ++G +  G+F+LDT   G AI PS+AD+L M AFGE+ V G      S   RRG +   
Sbjct: 444  RLNGQASPGWFVLDTSCAGFAIEPSVADRLGMPAFGELSVVGVSAAALSGAMRRGTTISA 503

Query: 868  GPLKIESPLFVEVD-TASLNIGPHPI----------VGVCGFDVFQQGIVKLSCREE--- 731
            G + + +P+++E    A+L   P P+          VGV G D  Q  +V+L        
Sbjct: 504  GCVDVPAPVYMEQSLAAALRTPPAPVGSDPSAGGGLVGVLGTDFLQHCVVELRAPRRVPG 563

Query: 730  -------SISIFDSKEFERDSEYSLEWEKLAFIQNVPNIAAKI----------------- 623
                    +  FD  ++E        W+++ +IQ VP++  K+                 
Sbjct: 564  SPTAPSFDVFCFDPAKYEPTDRVRAAWQRVEWIQGVPHVRVKLTVADDGLTPEPSLEQKL 623

Query: 622  ---------------------NGQNALLLFDTGASGVDVIFHSESDIYTNTQLERSTNIR 506
                                   +  L     GA G   I  + +       +ER+  ++
Sbjct: 624  PRGGPQAGTGAGPENPATDGSGWEGRLFRLSLGAGGAGAIVSARA-AKEWKMVERTVGLQ 682

Query: 505  G---INSQGLRQQVSDMV--------INRVEIANHVFKNVHAICLSS--ETKLPLSEYTS 365
                ++  G  +    MV        + RVE A   F+ V A+  +S     L LS +  
Sbjct: 683  PGGVMSGPGEERSRLAMVEPEVVTGRLRRVEFAGGRFETVRALTHTSGDPPDLALSPHAD 742

Query: 364  GILCMDMIENFVIIIDYPNRRIALV 290
            G LC D+     +++D+   RIA++
Sbjct: 743  GALCADLFRGCTLVLDFGRNRIAVL 767


>ref|XP_001421028.1| predicted protein [Ostreococcus lucimarinus CCE9901]
            gi|144581264|gb|ABO99321.1| predicted protein
            [Ostreococcus lucimarinus CCE9901]
          Length = 621

 Score = 92.4 bits (228), Expect = 6e-16
 Identities = 132/600 (22%), Positives = 209/600 (34%), Gaps = 107/600 (17%)
 Frame = -3

Query: 1771 TLKGGQHMLGLDGDWVFSWCMDGR---FYEKF------DSQDITFECGYDGNNGHWHVDH 1619
            T +G     GLDG   F    D R   F E+       D+  +T E G D +   W  D 
Sbjct: 35   TTRGTCTAHGLDG--AFELTHDARGTTFVERVTLARGDDADALTTESGRDDDGTTWDADW 92

Query: 1618 TGRVTAVELDDDEVTQLCVYLRTGFWLTE------------------------------- 1532
             G   A  LD    + +  ++RT  W  E                               
Sbjct: 93   YGDRRATTLDGAHASAMMTWVRTNQWNDELAAAGRLRRKKLAEGVEATAALRRYGVEAPT 152

Query: 1531 ----------KGQNLLNIQFEGEGNFDA--------IKKKTGEIKFAICLKNKKIVAHLH 1406
                      + +      F G G F A        + +    +     L   ++   + 
Sbjct: 153  PAKVEKRGRGRAKRRTGRGFGGFGGFGASSDARAPRVPRCATSVYAVSMLDGGRVGGRVF 212

Query: 1405 VDACQWLPLSMELKVLGQTERWEYKDWR--TIDDCLKHRYPYLCVRYPGAGGKDTFTVAH 1232
            VD         E       E W ++ W    + +  +   P L  R    G   TF    
Sbjct: 213  VDEATGFAWRAEFYHQRGVETWMFEGWERVAVSETDEVAVPRLAHRTHAEGQVTTFRA-- 270

Query: 1231 SVLHNGVEDTAGNTIDRGLPYRYPDTLLVPRHNQFYPRVSIDAACPSAVKMLRAKGGQLL 1052
                   E T+  + DR      P + + P  ++        +   + +   R +GG +L
Sbjct: 271  -------ESTSSASEDRETYAALPSSSVAPWASE-----RAWSGDDATIATCRGEGGHIL 318

Query: 1051 VKPHVDGSDIG------YFILDTGAGGLAISPSLADKLKMGAFGEVFVSGFHGHKKSHYR 890
            VK  ++ SD G      +F+LDT + GLA++P +AD + M +FG + + G     +   R
Sbjct: 319  VKATLESSDDGGASLTDWFVLDTASTGLAVAPHVADAVSMPSFGSMAIVGVAAPLEGALR 378

Query: 889  RGNSFQLGPLKIESPLFVE--VDTASLNIGPHPIVGVCGFDVF-QQGIVKLSC------- 740
            RG    +G L + SP+F+E  +D A        + GV G  V     IV++         
Sbjct: 379  RGKKLSVGALSLSSPIFMEQNLDGALRVPNGERLAGVIGLSVLGAHAIVRIHAPLRVPGS 438

Query: 739  ---REESISIFDSKEFERDSEYSLEWEKLAFIQNVPNIAAKINGQN-------------- 611
                +  + +F  + +E  +E    WE + FI  VP +       N              
Sbjct: 439  RDPPKLDVRVFRPEAYEPSAEIERAWEPVVFIDGVPYVECSYTIANDGFQGVTEMTTERR 498

Query: 610  ALLLFDTGASGVDVIFHS----ESDIYTNTQLERSTNIRG--INSQGLRQQVSDMV---- 461
             L     G  GV V+       E+D+   T+  +   I      S G  Q+V D +    
Sbjct: 499  GLFKLALGTGGVGVVLGDRVAVEADVANRTKALQPGGIMSGPGESAGRLQRVGDEIVTGR 558

Query: 460  INRVEIANHVFKNVHAICL--SSETKLPLSEYTSGILCMDMIE--NFVIIIDYPNRRIAL 293
            I  V   +  FKNV A+     +     LS +  G  C D+     FV+ +   N RIA+
Sbjct: 559  IETVRFKSFEFKNVRAVVHLDGAPPDADLSPHADGAACADLFRGCEFVLDLRPSNPRIAV 618


>ref|XP_003082486.1| unnamed protein product [Ostreococcus tauri]
            gi|116060955|emb|CAL56343.1| unnamed protein product
            [Ostreococcus tauri]
          Length = 562

 Score = 88.2 bits (217), Expect = 1e-14
 Identities = 94/441 (21%), Positives = 163/441 (36%), Gaps = 42/441 (9%)
 Frame = -3

Query: 1486 FDAIKKKTGEIKFAICLKNKKIVAHLHVDACQWLPLSMELKVLGQTERWEYKDWRTI--D 1313
            FD   K    +     L   KI A + VD     P   E       E W +++W  +  D
Sbjct: 138  FDHAPKCATSVYAMTVLDGGKIGARVFVDESTKKPWRAEFYHQRGVETWTFEEWDDVKLD 197

Query: 1312 DCLKHRYPYLCVRYPGAGGKDTFTVAHSVLHNGVEDTAGNTIDRGLPYRYPDTLLVPRHN 1133
            D      P +  R    G    +          + +T   T D    +  P +      +
Sbjct: 198  DGTVASTPSVARRKNSEGQITVY----------LSETTTATTDGDASFAMPASTSTSERS 247

Query: 1132 QFYPRVSIDAACPSAVKMLRAKGGQLLVKPHVDGSDIGYFILDTGAGGLAISPSLADKLK 953
                     ++  S +  +R  GG +LVK  ++  D G+F+LDT + GL I+P  AD   
Sbjct: 248  W--------SSKSSTLSAVRGDGGHVLVKATLESGDEGWFVLDTASTGLVIAPWAADSCS 299

Query: 952  MGAFGEVFVSGFHGHKKSHYRRGNSFQLGPLKIESPLFVE--VDTASLNIGPHPIVGVCG 779
            M +FG + V+      +   RRG    +G  ++ SP+F+E  +D A        + GV G
Sbjct: 300  MPSFGSMAVASIAAPLEGALRRGREISIGSFRMMSPIFMEQNLDGALRVPDGQRLTGVLG 359

Query: 778  FDVFQQGIVKL----------SCREESISIFDSKEFERDSEYSLEWEKLAFIQNVPN--- 638
                   IV L             + ++   D + +    E    W+ + FI  VP+   
Sbjct: 360  VPALAHSIVCLHAPMRVPGSRDAPKLTVEFHDPESYVPSGEIERAWQDVTFIDGVPHVQL 419

Query: 637  -----------IAAKINGQNALLLFDTGASGVDVIFHS----ESDIYTNTQLERSTNIRG 503
                       + A  + +  L     G  G   +  +    E+++   T+  +   +  
Sbjct: 420  AYTVANDGFKGVTAMTDERVGLFKLSLGTGGTGAVLSARVAKEAELAERTKALQPGGVMS 479

Query: 502  --INSQGLRQQVSDMVI----NRVEIANHVFKNVHAICL--SSETKLPLSEYTSGILCMD 347
                S G  Q+V D ++      +      FKNV A+           +S +  G +C+D
Sbjct: 480  GPGESAGRLQRVGDEIVTGRMETIRFKGFEFKNVRAVVHLDGDPPDADISPHADGAVCVD 539

Query: 346  MIENFVIIIDYPNR--RIALV 290
            +     +++D  N   R+A+V
Sbjct: 540  LFRGCELVLDLRNAQPRVAVV 560


>ref|XP_003061086.1| predicted protein [Micromonas pusilla CCMP1545]
            gi|226457437|gb|EEH54736.1| predicted protein [Micromonas
            pusilla CCMP1545]
          Length = 800

 Score = 80.9 bits (198), Expect = 2e-12
 Identities = 103/491 (20%), Positives = 180/491 (36%), Gaps = 95/491 (19%)
 Frame = -3

Query: 1471 KKTGEIKFAICLKNKKIVAHLHVDACQ---WLPLSMELKVLGQTERWEYKDWRTIDDCLK 1301
            K+      A+ ++  KI A L ++      W+P  +E+      E W + +W        
Sbjct: 307  KRANGALVALAVRGGKIGARLFLEEAADLGWVPRRLEVATPEGPEVWHFAEWTREVRANG 366

Query: 1300 HRYPYLCVRYPGAGGK------DTFTVAHSVLHNGVEDTAGNTIDRGLPYRYPDTLLVPR 1139
               P +  +   AG        D  T   +        +   +      +  P   L P 
Sbjct: 367  LLLPRVAHQTLPAGNTCAYELTDAATAKTTTTSTSTSTSTSTSTSTSSVFAPPYLGLPPS 426

Query: 1138 HNQF-YPRVSIDAACPSAVKML-RAKGGQLLVKPHVDG---------------------- 1031
             +   +P  S   +  +A  +  R +GG +LV P +                        
Sbjct: 427  SDDCSWPPTSYATSRDAAEAICARGEGGHVLVVPELSAPAGVGGGGGAGAGGRGSSGTSC 486

Query: 1030 -SDIGYFILDTGAGGLAISPSLADKLKMGAFGEVFVSGFHGHK-KSHYRRGNSFQLGPLK 857
             +  G+F+LDT   G AI PS+AD L M +FGE+ V G      +   RRG +  LG + 
Sbjct: 487  SARPGWFVLDTSCAGYAIEPSVADALAMPSFGELAVVGVSAAALRGSMRRGGALTLGAVT 546

Query: 856  IESPLFVE------VDTASL------NIGPHPIVGVCGFDVFQQGIVKLSCREESISIFD 713
              +PLF+E      + T SL      +I    +VGV G D  Q+ +V++  +       D
Sbjct: 547  ARAPLFMEQALAAALRTPSLGAIAGGDIDNASLVGVLGTDFMQRAVVEIRAKRRVPGSPD 606

Query: 712  SK----------EFERDSEYSLEWEKLAFIQNVPNIAAKINGQN---------------- 611
                         +      +  W+++ +I  VP++  K+   N                
Sbjct: 607  PATTKAFFHAPGAYAPSDRVAASWQRVEWIGGVPHLRVKVTVANDRITSLEPPPEGAASN 666

Query: 610  ---------ALLLFDTGASGVDVIFHSESDIYTNTQLERSTNIR--------GINSQGLR 482
                      L     GA G  V+  + +       +ER+  I         G +   L 
Sbjct: 667  KAEKDGWTGRLFRLSLGAGGAGVVV-AAAAAREWRMVERTVGIHPGGVMSGPGEDRARLA 725

Query: 481  QQVSDMVINR---VEIANHVFKNVHAI--CLSSETKLPLSEYTSGILCMDMIENFVIIID 317
            +   ++V  R   +E+    F+N+ A+         L LS +  G+LC D+    V+++D
Sbjct: 726  RVEPEVVTGRFDAIELKGATFRNIRALTHLNGDPPDLALSPHADGVLCADLFRGCVVVLD 785

Query: 316  YPNRRIALVDN 284
            +   R+A+V +
Sbjct: 786  FSRDRVAVVQD 796


>ref|XP_002955892.1| hypothetical protein VOLCADRAFT_96785 [Volvox carteri f. nagariensis]
            gi|300258860|gb|EFJ43093.1| hypothetical protein
            VOLCADRAFT_96785 [Volvox carteri f. nagariensis]
          Length = 652

 Score = 73.2 bits (178), Expect = 4e-10
 Identities = 99/473 (20%), Positives = 162/473 (34%), Gaps = 81/473 (17%)
 Frame = -3

Query: 1897 IRAQQKTKATNVQEEEALALKECLHNLSLSTGVHYTKPTQHFTLKGGQHMLGLDGDWVFS 1718
            ++A+  T        +++AL+  L  L  + G+    P     + G    LG+   W   
Sbjct: 41   LKARASTLEDRTGSRQSVALEHVLEELRKAQGIDKLGPDVEAVVSGRGKHLGIAVSWSLR 100

Query: 1717 WCMDGRFYEKFDSQDITFECGYDG--NNGHWH--------------VDHTGRVTAVELDD 1586
            W   G F E+     +TF+ GYDG  + G W               VD  G V  +E DD
Sbjct: 101  WTASGSFTEEIRGPQLTFKWGYDGRKDGGCWEDLIVLDPDLAFAVVVDSAGLVRVMECDD 160

Query: 1585 DEVTQLCVYLRT---------GFWLTEKGQNLLNIQF-EGEGNFDAIKKKTGEIKFAICL 1436
             E   +  Y+RT         G   T +G+    +    G     A  +    +   + +
Sbjct: 161  HEAVLMTTYIRTATHISSGGSGSTTTARGRKTAAVALTHGAPGSPAADE---VVAIGLRV 217

Query: 1435 KNKKIVAHLHVDACQWLPLSMELKVLGQTERWEYKDWRTIDDCLKHRYPYLCVRYPGAGG 1256
            + +K+   + V    W  L    ++    E WE  +W+   + +   YP   +     GG
Sbjct: 218  RGRKLRVRVFVRRRDWRLLGFNHRMCADVEMWELGEWKEWAEGVS--YPSSALHKASNGG 275

Query: 1255 KDTFTVAHSVLHNGVEDTAGNTIDRGLPYRYPDTLLVPRHNQFYPRVSIDAACPSAVKML 1076
            +      H+ L N          +R  P   P  LL+       P   +  + P   ++ 
Sbjct: 276  Q------HAYLTN------SGAAERVAPLPPPPPLLL-----LQPPEPLRTSPPPLPRVA 318

Query: 1075 RAKGGQLLVKPHVD-------GSDIGYFIL-----DT-----GAGG-------------- 989
             A+ G L   P           +D G  +L     DT     G GG              
Sbjct: 319  SAEAGSLPSPPPPQQLLQLPPPTDFGLPLLPPIPGDTSYSPPGGGGGQGVPLWRAASGHY 378

Query: 988  -----------------------LAISPSLADKLKMGAFGEVFVSGFHGHKKSHYRRGNS 878
                                     ++P  A +L   +FGE   +   G   + + R  S
Sbjct: 379  LVRPTINGQEGGGYFVVDTGASGFVLTPQAAARLGGQSFGETHAASISGKVAARFVRLGS 438

Query: 877  FQLGPLKIESPLFVEVDTASLNIG-PHPIVGVCGFDVFQQGIVKLSCREESIS 722
            + LG L I  P+ + +    L  G P  +VG+ G D+F +  V  +     IS
Sbjct: 439  WSLGGLSIRQPVLMVMALEGLVRGAPGEVVGIVGHDLFSRDGVATATATAHIS 491


>ref|WP_002701319.1| pdz domain (also known as dhr or glgf) protein [Microscilla marina]
            gi|123986490|gb|EAY26296.1| pdz domain (also known as dhr
            or glgf) protein [Microscilla marina ATCC 23134]
          Length = 383

 Score = 68.9 bits (167), Expect = 7e-09
 Identities = 73/290 (25%), Positives = 127/290 (43%), Gaps = 19/290 (6%)
 Frame = -3

Query: 1093 SAVKM-LRAKGGQLLVKPHVDGSDIGYFILDTGAGGLAISPSLADKLKMGAFGEVFVSGF 917
            +AV++ ++  G  + +K  V  S    FI DTGAGG  I+   A  LK+   G   V   
Sbjct: 14   AAVRLPMKMVGEHIFIKLKVGDSQHLNFIFDTGAGGTVINLRTAKNLKLMPSGFTPVKSA 73

Query: 916  HGHKKSHYRRGNSFQLGPLKIESPLFVEVDTASLNIGPHPIV-GVCGFDVFQQGIVKLSC 740
            H    + Y   N  +L  L +     +    + L      ++ G+ GF + ++ IV+++ 
Sbjct: 74   HSQDFTPYFANNELKLAHLSVPDVRLLGSSLSHLEARSGAVIDGIVGFAILKKYIVRINH 133

Query: 739  REESISIFDSKEFE---RDSEYSLEWEKLAFIQNVPNIAAKI---NGQNAL--LLFDTGA 584
                + +++  +++       Y + W        VP+I A I   NG+      L DTGA
Sbjct: 134  DAHQLELYNKAQYQFPKNAKAYDVSW-----WFPVPSIKASITLDNGEKVTGSFLIDTGA 188

Query: 583  SGVDVIFHSESDIYTNTQLERSTNIRGIN------SQGL-RQQVSDMV--INRVEIANHV 431
                    S S I T   + R   ++  N      SQGL   +V+D    +  +++ +  
Sbjct: 189  --------STSLILTTPFVNRHQLLKKFNKKLKHSSQGLTSSKVTDFKARLKGMKVGDFN 240

Query: 430  FKNVHAICLSSETKLPLSEYTSGILCMDMIENFVIIIDYPNRRIALVDNR 281
            F NV      ++  L  S+   GIL   +++ F +I+DY N+++ L  NR
Sbjct: 241  FSNVPVNLSRAKRGLLASDKIDGILGNAILKRFNLILDYSNQKLILEPNR 290


>ref|YP_005012075.1| PDZ/DHR/GLGF domain-containing protein [Niastella koreensis GR20-10]
            gi|503988588|ref|WP_014222582.1| signal protein PDZ
            [Niastella koreensis] gi|361063680|gb|AEW02672.1|
            PDZ/DHR/GLGF domain protein [Niastella koreensis GR20-10]
          Length = 399

 Score = 68.2 bits (165), Expect = 1e-08
 Identities = 66/271 (24%), Positives = 120/271 (44%), Gaps = 10/271 (3%)
 Frame = -3

Query: 1066 GGQLLVKPHVDG-SDIGYFILDTGAGGLAISPSLADKLKMGA-FGEVFVSGFHGHKKSHY 893
            GG +L+K  +    D   F+LDTG+GG+++  + AD LK+     +  + G  G ++  +
Sbjct: 42   GGIILIKARLSNFPDTLNFVLDTGSGGISLDSTTADYLKITTQLSDRTIRGIAGVRRVFF 101

Query: 892  RRGNSFQLGPLKIESPLF--VEVDTASLNIGPHPIVGVCGFDVFQQGIVKLSCREESISI 719
              G +  L  L +E+  F   + D  +   G   I G+ G     + I+K+     +IS+
Sbjct: 102  SYGQTLHLPNLAVENLDFHINDYDVLTSAYG-EKIDGIIGLSFLSRYILKIDYDSLAISV 160

Query: 718  FDSKEFERDSEYSLEWEKLAFIQNVPNIAAKINGQNAL---LLFDTGASG---VDVIFHS 557
            +    F+      L      FIQ +P + A I     L     FDTGA     +   F S
Sbjct: 161  YTKGTFKYPRGGFL---LKPFIQTIPILNANIKDNRPLASRFYFDTGAGMCLLLSADFVS 217

Query: 556  ESDIYTNTQLERSTNIRGINSQGLRQQVSDMVINRVEIANHVFKNVHAICLSSETKLPLS 377
            +S+     +   +T   G+   G +  +   VI ++++    FKNV       E  +   
Sbjct: 218  DSNFVKPKRKWYATQAEGL---GGKAPMKQGVIKQLQLGPFRFKNVPTYIFDDEFNVTQY 274

Query: 376  EYTSGILCMDMIENFVIIIDYPNRRIALVDN 284
               +G++  D++  F +I++Y  + I ++ N
Sbjct: 275  PALAGLVGNDLLRRFNLILNYERKDIYMIPN 305


>ref|WP_002698385.1| hypothetical protein [Microscilla marina] gi|123988568|gb|EAY28209.1|
            hypothetical protein M23134_03470 [Microscilla marina
            ATCC 23134]
          Length = 406

 Score = 65.9 bits (159), Expect = 6e-08
 Identities = 63/275 (22%), Positives = 126/275 (45%), Gaps = 15/275 (5%)
 Frame = -3

Query: 1057 LLVKPHVDGSDIGYFILDTGAGGLAIS-PSLADKLKMGAFGEVFVSGFHGHK--KSHYRR 887
            +++   ++GS    FILDTG G   I+ PS+A  L +  F ++ V+G       K+H   
Sbjct: 37   IIIPVQINGSKPFNFILDTGVGSTLITDPSVALALDLPMFRKLKVAGVSSQNRLKAHVSN 96

Query: 886  GNSFQLGPLKIESPLFVEV-DTASLNIGPH---PIVGVCGFDVFQQGIVKLSCREESISI 719
              S ++    +    +V V +   LN+  +   PI G+ G+D+F + IVK++     I++
Sbjct: 97   IESIKIFKHIVALKQYVIVLEEDVLNLSGYAGIPIHGIIGYDLFSKFIVKINYDYHKITL 156

Query: 718  FDSKEFERDSEYSLEWEKLAFIQNVPNIAAKI------NGQNALLLFDTGASGVDVIFHS 557
            ++ + F    +   E   +      P + AK+            L+FDTGA    +++ +
Sbjct: 157  YNPERFNYRKKKKHEVLPIRIEAKKPILEAKVWCEQHKQAAPVRLVFDTGAGHALLLYQN 216

Query: 556  ESDIYTNTQLERSTNIRGINSQGLRQQVSDMVINRVEIANHVFKNVHAICLSSETKLPLS 377
             S   +        ++    S  L  ++    + ++++  +    V      S + + + 
Sbjct: 217  SSPGISLPAKTIKAHLGATLSGNLIGKLGK--LKQIQLGKYTLPQVVTSFPDSSSYVSMK 274

Query: 376  EYT--SGILCMDMIENFVIIIDYPNRRIALVDNRN 278
             +T  +G + + +I+ F  IIDYPN+R+ +  NR+
Sbjct: 275  AFTPRNGNIGLGIIKRFHTIIDYPNKRLIVRPNRH 309


>ref|WP_018630490.1| hypothetical protein [Niabella aurantiaca]
          Length = 399

 Score = 64.7 bits (156), Expect = 1e-07
 Identities = 67/277 (24%), Positives = 122/277 (44%), Gaps = 8/277 (2%)
 Frame = -3

Query: 1087 VKMLRAKGGQLLVKPHVDG-SDIGYFILDTGAGGLAISPSLAD--KLKMGAFGEVFVSGF 917
            +  ++  GG ++ K  +D   D   FILDTG+GG+++  +     +LK GA  E  + G 
Sbjct: 36   IPFVQFTGGVIVFKALLDDFKDSLNFILDTGSGGISLDSTTVKNLQLKPGA-PERIIRGI 94

Query: 916  HGHKKSHYRRGNSFQLGPLKIESPLFVEVDTASLN-IGPHPIVGVCGFDVFQQGIVKLSC 740
             G +K  + +  S  L   +I+S  F  VD   L  +    I G+ G+ V ++ I+K++ 
Sbjct: 95   GGVRKVSFLKNRSLHLAGYEIDSLNFHVVDYDVLTALYGQKIDGIVGYSVLKRYILKINY 154

Query: 739  REESISIFDSKEFERDSEYSLEWEKLAFIQNVPNIAAKINGQNALLLFDTGASGVDVIFH 560
             ++ I  F     +      +    +  +    +  A    +    LFD GA G+ V+F 
Sbjct: 155  EKQEIGFFSRGSIKYPRGGYMMRPYIRMLPYSKSTIADNRKREFNYLFDLGA-GLTVLF- 212

Query: 559  SESDIYTNTQL----ERSTNIRGINSQGLRQQVSDMVINRVEIANHVFKNVHAICLSSET 392
              SD + N       +R T ++     G R  +   V+  + I  + F+NV       + 
Sbjct: 213  --SDDFMNDSAFLKSKRKTYLKQGEGLGGRVDMRLTVMKSLRIGPYKFRNVPINIFDDDY 270

Query: 391  KLPLSEYTSGILCMDMIENFVIIIDYPNRRIALVDNR 281
             +       G++  ++   F +II+Y  ++I L  NR
Sbjct: 271  NVTSYPSLGGLIGNEIFRRFNVIINYGKQQIHLTPNR 307


>ref|XP_005649893.1| hypothetical protein COCSUDRAFT_46683, partial [Coccomyxa
            subellipsoidea C-169] gi|384251872|gb|EIE25349.1|
            hypothetical protein COCSUDRAFT_46683, partial [Coccomyxa
            subellipsoidea C-169]
          Length = 326

 Score = 60.5 bits (145), Expect = 3e-06
 Identities = 53/242 (21%), Positives = 99/242 (40%), Gaps = 38/242 (15%)
 Frame = -3

Query: 1843 ALKECLHNLSLSTGVHYTKPTQHFTLKGGQHMLGLDGDWVFSWC--MDGRFYEKFDSQDI 1670
            +L   L  +  + G+ + +  +      G HM G+   W    C  +DG F E+   + +
Sbjct: 71   SLGNILQGVQKAIGITFPEDKEACFSGNGIHM-GMGVSWQLR-CRGVDGAFIEEIRGRHL 128

Query: 1669 TFECGYDGNNGH------WHVDHTGRVTAVELDDDEVTQLCVYLRTGFWLTEKGQNLLNI 1508
            +F+ G+ G +        W ++       ++LDD+E   +  ++RTGFW+TE+  + L I
Sbjct: 129  SFKYGHPGVSTGTRTAQTWGMEWDDLPKELQLDDNEALLIQAWVRTGFWVTEQALDHLAI 188

Query: 1507 QFEGEGNFDAIKKKTGE---------------------------IKFAICLKNKKIVAHL 1409
                E   + ++                                +  ++ L+  K+ A L
Sbjct: 189  SVNSEAEREELRAPEASESRLSSVHVPEDGPSTSDLGAQDTEESVVLSLRLRRGKVKALL 248

Query: 1408 HVDACQWLPLSMELKVLGQTERWEYK---DWRTIDDCLKHRYPYLCVRYPGAGGKDTFTV 1238
             V    W PL +   + GQ +   Y+   DW+T    ++H  P L V+  G GG ++F  
Sbjct: 249  WVCTRTWQPLRLACPLCGQIDMSTYEHWTDWQTSGPSIRH--PKLVVQRGGTGGNNSFAT 306

Query: 1237 AH 1232
             +
Sbjct: 307  VN 308


>emb|CDF78235.1| PDZ/DHR/GLGF domain-containing protein [Formosa agariphila KMM 3901]
          Length = 428

 Score = 60.1 bits (144), Expect = 3e-06
 Identities = 65/261 (24%), Positives = 117/261 (44%), Gaps = 16/261 (6%)
 Frame = -3

Query: 1015 FILDTGAGGLAISPSL--ADKLKMGAFGEVFVSGFHGHKKSHYR---RGNSFQLGPLKIE 851
            FILDTG     +   L   D L++       + G  G  KS      + N F++G     
Sbjct: 47   FILDTGVAKPIVFSFLNERDTLQINNTEAYLLRGL-GEGKSFEALKSKNNIFKIGKAVNL 105

Query: 850  SPLFVEVDTASLNIGPH---PIVGVCGFDVFQQGIVKLSCREESISIFDSKEFERDSEYS 680
            +     +  ASLN  P    PI G+ GFD+F+  +V+++   + I +   K ++     S
Sbjct: 106  NQELYAIHDASLNFTPQLGIPIHGIIGFDLFKDFVVEINYASKYIKMHKPKNYKYKKCKS 165

Query: 679  LEWEKLAFIQNVPNIAAKI--NGQN--ALLLFDTGASGVDVIFHSES-DIYTNTQLERST 515
             EW  L F  N P + A+I  +G+     LL DTG S    +F +E+  +Y   +     
Sbjct: 166  CEWVNLEFYNNKPFLNAEITLHGKQIPVKLLIDTGGSDSLWVFENEALGLYEQDKYFVDY 225

Query: 514  NIRGINSQGLRQQVSDMVINRVEIANHVFKNVHAICLSSETKLPLSE---YTSGILCMDM 344
               G+N     ++     I++++I +   K V+ +     + + L+      +G +  ++
Sbjct: 226  LGSGLNGSVYGKRAK---IDKLKINSFTLKEVN-VSYPDSSSIKLARRLVVRNGSVSGNV 281

Query: 343  IENFVIIIDYPNRRIALVDNR 281
            ++ F +I+DY   R+ L  N+
Sbjct: 282  LKRFNLIVDYRKARLILKKNK 302


>ref|WP_018478012.1| hypothetical protein [Pontibacter roseus]
          Length = 415

 Score = 59.7 bits (143), Expect = 4e-06
 Identities = 71/281 (25%), Positives = 138/281 (49%), Gaps = 22/281 (7%)
 Frame = -3

Query: 1057 LLVKPHVDGSDIGYFILDTGAGGLAISP-SLADKLKMGAFGEVFVSGFH-GHK-KSHYRR 887
            +++   ++GS   +FILD+G     I+    +D L +   G++ + G   GH+ ++ Y R
Sbjct: 51   IVIPVQINGSQPLHFILDSGVKNTLITRLHYSDSLSLNQAGKITLQGLGTGHEIEALYSR 110

Query: 886  GNSFQLGPLKIESP----LFVEVDTASLNIGPHPIVGVCGFDVFQQGIVKLSCREESISI 719
            GN+ QL  ++ ++     L  ++   S  +G   + G+ G+D+F+  IVK++   + +++
Sbjct: 111  GNNLQLPGIRGDNHQVYVLMEDIFNLSTRMGMS-VHGIIGYDIFRNFIVKINYDTKHLTL 169

Query: 718  F----DSKEFERDSEYSLEWEKL-AFIQNVPNIAAKINGQN--ALLLFDTGASGVDVIFH 560
            +    + K  +R   Y L  E   A+++       + NG +    L+ DTGAS      H
Sbjct: 170  YRPDLNLKVPKRGEVYPLMIENTKAYVEAG---VRQYNGDSLKVKLVIDTGAS------H 220

Query: 559  SESDIYTNTQ---LERSTNIRGINSQGLRQQVSDMV--INRVEIANHVFKNVHAICLSSE 395
            S S +Y  T    L     +     +GL   ++  +  IN  +I  +V +++ A     E
Sbjct: 221  SIS-LYLPTDERLLLPPKVMEAYLGRGLSGDINGKIGRINSFKIGKYVLQDLPASYPDEE 279

Query: 394  T---KLPLSEYTSGILCMDMIENFVIIIDYPNRRIALVDNR 281
            +    L L+   +G +  D+++ F +I DYP++R+ LV N+
Sbjct: 280  SIRAALNLAN-RNGNIGSDILKRFTVIFDYPHQRMMLVPNK 319


Top