BLASTX nr result

ID: Ephedra26_contig00020415 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Ephedra26_contig00020415
         (919 letters)

Database: ./nr 
           37,332,560 sequences; 13,225,080,153 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

gb|ADE75937.1| unknown [Picea sitchensis]                             390   e-106
ref|XP_002295272.1| predicted protein [Thalassiosira pseudonana ...   175   2e-41
ref|XP_002180280.1| predicted protein [Phaeodactylum tricornutum...   152   2e-34
ref|XP_002182733.1| predicted protein [Phaeodactylum tricornutum...   139   2e-30
ref|XP_002669182.1| predicted protein [Naegleria gruberi] gi|284...   137   6e-30
ref|XP_005789382.1| hypothetical protein EMIHUDRAFT_98339 [Emili...   137   7e-30
ref|XP_002182676.1| predicted protein [Phaeodactylum tricornutum...   129   1e-27
ref|XP_002676812.1| predicted protein [Naegleria gruberi] gi|284...   126   1e-26
gb|EJK61576.1| hypothetical protein THAOC_17910 [Thalassiosira o...   116   1e-23
ref|XP_005838126.1| hypothetical protein GUITHDRAFT_66230 [Guill...   100   8e-19
ref|XP_002185821.1| predicted protein [Phaeodactylum tricornutum...   100   1e-18
gb|EJK47010.1| hypothetical protein THAOC_34300, partial [Thalas...    87   7e-15
ref|XP_641212.1| transmembrane protein 144 A [Dictyostelium disc...    87   1e-14
ref|XP_003294072.1| hypothetical protein DICPUDRAFT_90514 [Dicty...    86   1e-14
ref|XP_004350600.1| transmembrane protein [Dictyostelium fascicu...    85   3e-14
ref|XP_001775344.1| predicted protein [Physcomitrella patens] gi...    85   3e-14
ref|XP_003293389.1| hypothetical protein DICPUDRAFT_50939 [Dicty...    84   6e-14
gb|ABK23204.1| unknown [Picea sitchensis]                              84   6e-14
ref|XP_005758882.1| hypothetical protein EMIHUDRAFT_121297 [Emil...    82   4e-13
gb|EJK73333.1| hypothetical protein THAOC_05052 [Thalassiosira o...    80   8e-13

>gb|ADE75937.1| unknown [Picea sitchensis]
          Length = 359

 Score =  390 bits (1001), Expect = e-106
 Identities = 194/297 (65%), Positives = 232/297 (78%), Gaps = 6/297 (2%)
 Frame = +1

Query: 43  FVRQLYKSTACFLTSWLILVYTPYKFTWWGVLGGLIWVINGIVAIFSVRWAGIGVAQCLW 222
           FV Q YKST CFLTSWL+L+YTP+KFTWWG+LG LIWV NG++AI +VRWAGIGV+Q LW
Sbjct: 36  FVFQSYKSTTCFLTSWLVLLYTPFKFTWWGILGALIWVTNGVLAIVAVRWAGIGVSQSLW 95

Query: 223 SGLSLFISYIWGAYIFKEPIKNHGLSLLALCVMALGMLGVGFSVSNKRRIR-LFNNFPTH 399
           SGLSLF +YIWGAY+ KEP+KNHGLS+LAL VMALGM+GVGF+VS K   + L + +   
Sbjct: 96  SGLSLFTAYIWGAYVLKEPLKNHGLSILALLVMALGMIGVGFAVSEKTVFQSLLDIWLKL 155

Query: 400 TESSTTIKTNDE----ELHDSVTPLMSNSPTSSCEREVEFTDQD-KNERDLLKGVLGAVF 564
              ST IK   +    ++ DS   L+    T +C  E E+ DQ  + E  L+KGVL AV 
Sbjct: 156 NPCSTKIKDCPQLSCIDVQDSSEALIPCETTKTCGVEEEYADQKYERENKLVKGVLCAVL 215

Query: 565 VGISNGSFMVPLKYAHKDIVGVEYLLSFGIGAMTMTMLVFGIYSSMLSFRKIPQPSLYIP 744
           VG  NGSFMVPLKYAHKD+VG EYL+SFGIGAMTMT+++ GIY + L+F   P PSLYIP
Sbjct: 216 VGTLNGSFMVPLKYAHKDVVGAEYLVSFGIGAMTMTIILLGIYMTALAFHGRPLPSLYIP 275

Query: 745 GATLPAFLAGLLWSLGNFFSIYATLYLGLALGWPLVQCQLLVSAMWAVFYYKEVRTK 915
           GA  PAFLAG LWS+GNFFSIYATLYLG+ALGWPLVQCQL+VSAMWAVF+YKEV ++
Sbjct: 276 GAAGPAFLAGFLWSMGNFFSIYATLYLGVALGWPLVQCQLIVSAMWAVFFYKEVTSR 332


>ref|XP_002295272.1| predicted protein [Thalassiosira pseudonana CCMP1335]
           gi|220968995|gb|EED87338.1| predicted protein
           [Thalassiosira pseudonana CCMP1335]
          Length = 373

 Score =  175 bits (444), Expect = 2e-41
 Identities = 103/313 (32%), Positives = 169/313 (53%), Gaps = 26/313 (8%)
 Frame = +1

Query: 46  VRQLYKSTACFLTSWLILVY-TPYKFTWWGVLGGLIWVINGIVAIFSVRWAGIGVAQCLW 222
           V Q YKS  CFL+SWL+L+    + FTWWG++ GL WV  G   IF++R AG+ V+Q + 
Sbjct: 64  VMQSYKSLMCFLSSWLVLLCGQEFTFTWWGIVSGLFWVPAGAFNIFAIRNAGLAVSQGIV 123

Query: 223 SGLSLFISYIWGAYIFKEPIKNHGLSLLALCVMALGMLGVGFSVSNKRRIRLFNNFPTHT 402
           S   + +S+IWG  IF+E +K+  ++  A+C++  G+ G+ F  +++ +       P HT
Sbjct: 124 SSSIVMVSFIWGDLIFREAVKSELIAYFAVCLIMAGLYGMSFFSTSEEQ-------PEHT 176

Query: 403 ESSTTIKTNDEEL----HDSVTPLMSNSPTSSC--------------EREVEFTDQDKNE 528
             S      +E+L    H+S     S++  SS                R +    +  + 
Sbjct: 177 SVSDNDNNGEEKLDLMRHESSDSFDSSNDNSSMGPLEISERRKPSIRGRPILICGKTYSR 236

Query: 529 RDLLKGVLGAVFVGISNGSFMVPLKYAHKDIVGVEYLLSFGIGAMTMTML--VFGIYSSM 702
           R++  G+  A+  G+  GS +VP+ YA  D+ G+ Y++SF +GA+T+T+L  V      +
Sbjct: 237 RNI--GLCSALICGVWGGSCLVPMHYAQGDVKGLAYVISFSVGALTVTVLLWVARFAYHL 294

Query: 703 LSFRKIPQ-----PSLYIPGATLPAFLAGLLWSLGNFFSIYATLYLGLALGWPLVQCQLL 867
           +  + + +     PS ++    LP   AG LWS+GN  SI A  +LG  +G+   Q  LL
Sbjct: 295 VKLKSVWEAYEVLPSFHLRVMLLPGATAGSLWSIGNVGSIVAVKHLGQGVGYSASQAALL 354

Query: 868 VSAMWAVFYYKEV 906
           VS MW +FY+K++
Sbjct: 355 VSGMWGIFYFKQM 367


>ref|XP_002180280.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
           gi|217408537|gb|EEC48471.1| predicted protein
           [Phaeodactylum tricornutum CCAP 1055/1]
          Length = 346

 Score =  152 bits (384), Expect = 2e-34
 Identities = 102/301 (33%), Positives = 154/301 (51%), Gaps = 13/301 (4%)
 Frame = +1

Query: 46  VRQLYKSTACFLTSWLILVY-TPYKFTWWGVLGGLIWVINGIVAIFSVRWAGIGVAQCLW 222
           V Q YK     LTSWL+L++  P+ FT WG + GL  V  G    F+V+ AG+ V+Q +W
Sbjct: 38  VLQTYKIGMTLLTSWLVLLFGVPFTFTPWGFVSGLFMVPGGTAGYFAVQNAGMAVSQGIW 97

Query: 223 SGLSLFISYIWGAYIFKEPIKNHGLSLLALCVMALGMLGVGFSVSNKRRIRLFNNFPTHT 402
           S L + +++ WG  IF EP+ +   + LA+ ++ +G+ GV    + +             
Sbjct: 98  SSLKVLVAFCWGILIFHEPVHSKLGTTLAIALLMVGLAGVSIFAAPR------------- 144

Query: 403 ESSTTIKTNDEELHDSVTPLMSNSPTSSCEREVEFTDQDKNERDLLK----GVLGAVFVG 570
              T+  +  EE      PL+   P    + + E  D +K+    LK    G+LGAV  G
Sbjct: 145 ---TSTSSPQEE------PLL---PDVEEQNQPEIVD-NKDYLGFLKRRHVGLLGAVIDG 191

Query: 571 ISNGSFMVPLKYA-HKDIVGVEYLLSFGIGAMTMTMLVF-------GIYSSMLSFRKIPQ 726
              GS +VP+ YA  K   G+ Y++SF IG  ++  +V+        +    L       
Sbjct: 192 AYGGSVLVPMHYAGPKTTNGLSYVMSFAIGCSSVVTMVWVLRLLFNSVQGQSLRVGYDRL 251

Query: 727 PSLYIPGATLPAFLAGLLWSLGNFFSIYATLYLGLALGWPLVQCQLLVSAMWAVFYYKEV 906
           PSL++      A LAGL+WSLGN  SI     LG  +G+ +VQ QLLV+ +W VF+YKE+
Sbjct: 252 PSLHVTTIGPYAALAGLIWSLGNVSSILTVALLGEGVGYSIVQSQLLVAGLWGVFWYKEI 311

Query: 907 R 909
           R
Sbjct: 312 R 312


>ref|XP_002182733.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
            gi|217406079|gb|EEC46020.1| predicted protein
            [Phaeodactylum tricornutum CCAP 1055/1]
          Length = 404

 Score =  139 bits (349), Expect = 2e-30
 Identities = 89/329 (27%), Positives = 164/329 (49%), Gaps = 41/329 (12%)
 Frame = +1

Query: 46   VRQLYKSTACFLTSWLILVY-TPYKFTWWGVLGGLIWVINGIVAIFSVRWAGIGVAQCLW 222
            V Q YK+   F+ SWL++       +T WG++ G +WV+ G   + ++R AG+ +A   W
Sbjct: 44   VFQSYKTITMFMLSWLVIFMGIAPSWTSWGLVSGGLWVVGGTGGVLAIRMAGLAIAVGTW 103

Query: 223  SGLSLFISYIWGAYIFKEPIKNHGLSLLALCVMALGMLGVGFSVSNKRRIRLFNNFPTHT 402
            + + + I+++ G  +F+EP+ +   +L A  ++ALG++G+    + +   +L       T
Sbjct: 104  ASVMIVINFLVGIVLFQEPVSDMFATLGAFLLLALGLVGMSLYSTPQPVDQL-----PST 158

Query: 403  ESSTTIKTNDEELHDSVTPLM----------------------SNSPTSSCER-EVEFT- 510
            E +  I  N  E+ +    L+                      S S  SS +  E  FT 
Sbjct: 159  EMTENIGPNQNEVEEIDRALIVKRTSSYTGKIDHRDIQRRNEESGSYGSSADADEPLFTI 218

Query: 511  -DQDKNERDLLKGVLGAVFVGISNGSFMVPLKYAH-KDIVGVEYLLSFGIGAMTMTMLVF 684
             D  K +R    G+ GA+F G+  GS ++PL YA  +   G  Y++S+  GA+ M  L++
Sbjct: 219  PDGTKRKRSGPTGICGAIFNGVMTGSSLIPLHYAKTQGYGGANYMISYASGAIVMNCLIW 278

Query: 685  GIYSSMLSFRKIPQ--------------PSLYIPGATLPAFLAGLLWSLGNFFSIYATLY 822
            G++ +   ++ + Q              P+ +     LP F +G+L ++  F SI +  Y
Sbjct: 279  GVFFAYTCYQTVQQDLNVPVLLHTFQVMPAWHFRKLWLPGFTSGVLLTIAMFGSILSVTY 338

Query: 823  LGLALGWPLVQCQLLVSAMWAVFYYKEVR 909
            LG  +G  +VQ ++L+S +W +F+++E+R
Sbjct: 339  LGQGIGNSIVQAKILISGLWGIFWFREIR 367


>ref|XP_002669182.1| predicted protein [Naegleria gruberi] gi|284082726|gb|EFC36438.1|
            predicted protein [Naegleria gruberi]
          Length = 425

 Score =  137 bits (345), Expect = 6e-30
 Identities = 83/313 (26%), Positives = 157/313 (50%), Gaps = 25/313 (7%)
 Frame = +1

Query: 46   VRQLYKSTACFLTSWLILVYTPYKFTWWGVLGGLIWVINGIVAIFSVRWAGIGVAQCLWS 225
            V Q Y S+   +TS+++L++  + F++WG+LG  +WV   ++++ ++   G+GVAQ +WS
Sbjct: 88   VFQFYFSSMVLITSFIVLIWNEWYFSFWGILGAAVWVPASLLSLIAIHLLGLGVAQGVWS 147

Query: 226  GLSLFISYIWGAYIFKEPIKNHGLSLLALCVMALGMLGVG----------FSVSNKRRIR 375
            G+++  S+ WG  +F   I N  L+ LAL +M +G++G+              S+     
Sbjct: 148  GVNIITSFTWGVALFHSEIGNPYLTALALILMVVGIVGIATCSKWNLPELLPASSTETKS 207

Query: 376  LFNNFPTHTES-----------STTIKTNDEELHDSV--TPLMSNSPTSSCEREVEFTDQ 516
            L N   TH +            +  ++ N++ +  ++  T      PT    R+ +    
Sbjct: 208  LVNETVTHYDGNEENPEAPNTFNPEVQNNEQAVEQTIETTQEEEEYPTQPLSRKEKIVSI 267

Query: 517  DKNERDLLKGVLGAVFVGISNGSFMVPLKYAHKDIVGVEYLLSFGIGAMTMTMLVFGIYS 696
             K+ ++ + G+  +V VG+  GS  VP ++  K   GV Y++ FG G+  +T  +  IY 
Sbjct: 268  LKSSKNYILGLACSVGVGVLGGSQFVPSRFEEKP--GVVYVVGFGFGSAGITSAILVIYY 325

Query: 697  SMLSFR-KIPQPSLYIPGATLPAFLAGLLWSLGNFFSIYATL-YLGLALGWPLVQCQLLV 870
                 R ++  P  + P   +   +   LW +GN  + Y ++  LG  +G PL Q  L+V
Sbjct: 326  IYYIIRYRVVLP--FHPKVAVFPCITACLWQVGNVMATYVSMSSLGFTIGLPLTQASLVV 383

Query: 871  SAMWAVFYYKEVR 909
            + +  + ++KE+R
Sbjct: 384  AGICGLLFFKELR 396


>ref|XP_005789382.1| hypothetical protein EMIHUDRAFT_98339 [Emiliania huxleyi CCMP1516]
           gi|485642886|gb|EOD36953.1| hypothetical protein
           EMIHUDRAFT_98339 [Emiliania huxleyi CCMP1516]
          Length = 358

 Score =  137 bits (344), Expect = 7e-30
 Identities = 84/293 (28%), Positives = 137/293 (46%), Gaps = 7/293 (2%)
 Frame = +1

Query: 52  QLYKSTACFLTSWLILVYTPYKFTWWGVLGGLIWVINGIVAIFSVRWAGIGVAQCLWSGL 231
           QLY S    L+S L+L  TP+ F++WG  G  +W+ + +     +   G GVA   W   
Sbjct: 72  QLYFSAGVALSSILVLALTPFSFSFWGFAGASLWISSMMCGKIGIDGIGYGVAVATWGST 131

Query: 232 SLFISYIWGAYIFKEPIKNHGLSLLALCVMALGMLGVGFSVSNKRRIRLFNNFPTHTESS 411
           ++ +S++WG  +F E   +   ++ ALC +A G+ GV  + S           P   E++
Sbjct: 132 TMIVSFLWGTLVFAERPSSVTGAVAALCTLAAGVAGVATAQSGSLG-------PPEAEAA 184

Query: 412 TTIKTNDEELHDSVTPLMSNSPTSSCEREVEFTDQDKNERDLLKGVLGAVFVGISNGSFM 591
                N  E         + +                       G LGA+  G+ NGS M
Sbjct: 185 AEAFLNPAEGRVGGAAARAGA-----------------------GWLGALGCGLLNGSLM 221

Query: 592 VPLKYAHKD-------IVGVEYLLSFGIGAMTMTMLVFGIYSSMLSFRKIPQPSLYIPGA 750
           VP  Y  ++        VG+ Y+ +F  G   +  + F +Y+ +  FR  PQP L     
Sbjct: 222 VPFHYFSEERSGQDGASVGMGYIATFATGVAAVQPIFFLLYARV-PFR--PQPPLLCSEL 278

Query: 751 TLPAFLAGLLWSLGNFFSIYATLYLGLALGWPLVQCQLLVSAMWAVFYYKEVR 909
            LP  + G+ W++GNF S +ATL+LG A+G+PL Q  ++V+ +W   ++ E+R
Sbjct: 279 ALPGLITGVFWAIGNFESTFATLHLGQAVGYPLTQTCIVVAGLWGALFFGEIR 331


>ref|XP_002182676.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
            gi|217406022|gb|EEC45963.1| predicted protein
            [Phaeodactylum tricornutum CCAP 1055/1]
          Length = 413

 Score =  129 bits (325), Expect = 1e-27
 Identities = 98/340 (28%), Positives = 161/340 (47%), Gaps = 52/340 (15%)
 Frame = +1

Query: 46   VRQLYKSTACFLTSWL-ILVYTPYKFTWWGVLGGLIWVINGIVAIFSVRWAGIGVAQCLW 222
            V Q YK++ CFLT WL IL+    ++T +G++ GL WV    + IF +R AG+ VA   W
Sbjct: 44   VMQSYKTSVCFLTCWLVILLGEEPRWTPYGIVSGLFWVPGAAMGIFGIRNAGLAVAVGTW 103

Query: 223  SGLSLFISYIWGAYIFKEPIKNHGLSLLALCVMALGMLGVGFSVSNKRRI-------RLF 381
            S +++  S+ +G  +F+E +K+   + LA   + +G++G+    ++++++       R  
Sbjct: 104  SSITVLTSFFFGIIVFQERVKSFYQTCLAFGCLIIGLIGMSRFSAHQQQVDTLAVSYRSV 163

Query: 382  NNFPTHT--------ESSTTIKTNDEELHDSVT-PLMSNSPTSSCEREVEFTDQD----- 519
                +H          + +TI  N      S+T PL+  S     E E   TD +     
Sbjct: 164  KTAASHPLGLGQKLKRAGSTIAEN------SITVPLVGASGVIPMEIEPFATDGEDIVMG 217

Query: 520  ---------KNERDLL-----------KGVLGAVFVGISNGSFMVPLKYA--HKDIVGVE 633
                       +R +L            G+LGAV  G   G  ++PL +A   +D+ G  
Sbjct: 218  TYDDAKSVLSKDRLVLFGGRVSLTRRQMGILGAVINGAWGGMNLIPLHFALQEEDMTGAG 277

Query: 634  YLLSFGIGAMTMTMLVF----GIYSSMLSFRKIPQ----PSLYIPGATLPAFLAGLLWSL 789
            YL+S+  G++ +   ++    G Y    +          P  +     +P  +AGLL+S 
Sbjct: 278  YLISYATGSLIVNTCIWLAFLGYYLHQTNGHWNEAVDCLPKWHFEHLLIPGLMAGLLYSF 337

Query: 790  GNFFSIYATLYLGLALGWPLVQCQLLVSAMWAVFYYKEVR 909
            GNF SI A  YLG   G+   Q QL VS +W VF++KEV+
Sbjct: 338  GNFCSILAVTYLGQGTGFSFCQMQLFVSGLWGVFFFKEVQ 377


>ref|XP_002676812.1| predicted protein [Naegleria gruberi] gi|284090416|gb|EFC44068.1|
           predicted protein [Naegleria gruberi]
          Length = 383

 Score =  126 bits (316), Expect = 1e-26
 Identities = 82/323 (25%), Positives = 149/323 (46%), Gaps = 35/323 (10%)
 Frame = +1

Query: 46  VRQLYKSTACFLTSWLILVYTPYKFTWWGVLGGLIWVINGIVAIFSVRWAGIGVAQCLWS 225
           V Q Y S    L S ++L +  +K++WW V G  IWV + + +I +V + G  VAQ  W+
Sbjct: 30  VFQFYFSLVVGLMSLIVLAWNEFKWSWWAVAGSGIWVPSSLFSIVAVEYLGAAVAQSTWA 89

Query: 226 GLSLFISYIWGAYIFKEPIKNHGLSLLALCVMALGMLGVGFSVSNKRRIRLFNNFPTHTE 405
           G  +  ++IWG  +F+  I N  L++  L +M +G+ G   + S            + T 
Sbjct: 90  GCVIITNFIWGVTLFQSKIGNIYLTVFGLVIMIIGIFGTA-TCSKWNNPEPVAEKQSETS 148

Query: 406 SSTTIKTNDEELHDSVTPLMS---------------------NSPTSSCE---------- 492
            + +++ + +E +   TPL                       N PT   E          
Sbjct: 149 INASVEESGQENNTETTPLYQQENSTNQQENISSDVPIYPSVNDPTLYSELSEIESTIGV 208

Query: 493 ---REVEFTDQDKNERDLLKGVLGAVFVGISNGSFMVPLKYAHKDIVGVEYLLSFGIGAM 663
              +  +F    KN +    G++ +V  GI+ GS  VP +    +  G+ Y+++FGIG+ 
Sbjct: 209 YETKSQKFIKILKNSKRYFIGLVASVLCGITGGSMFVPSRL--DEDTGLVYMVAFGIGSF 266

Query: 664 TMTMLVFGIYSSMLSFRKIPQPSLYIPGATLPAFLAGLLWSLGNFFSIYATLY-LGLALG 840
            +T  +  +Y      R   +   ++  +  PA L   LW  GNFF+ Y ++  LGL +G
Sbjct: 267 VITTAILIVYYVYYLIRFKKRVPFHLKLSIFPA-LTAFLWQTGNFFAYYVSVSPLGLTIG 325

Query: 841 WPLVQCQLLVSAMWAVFYYKEVR 909
            PL +  ++++ +  + +++E+R
Sbjct: 326 MPLTETAMVITGICGLVFFRELR 348


>gb|EJK61576.1| hypothetical protein THAOC_17910 [Thalassiosira oceanica]
          Length = 360

 Score =  116 bits (291), Expect = 1e-23
 Identities = 85/327 (25%), Positives = 146/327 (44%), Gaps = 41/327 (12%)
 Frame = +1

Query: 46  VRQLYKSTACFLTSWLILVYTPYK---------------FTWWGVLGGLIWVINGIVAIF 180
           V Q YK+ A F+TS L++ +                   FT W  +  + WV  G   +F
Sbjct: 43  VFQTYKAVAVFVTSLLLVAFCNLMHGTHPDSFDYWSFADFTHWAFVSAIFWVPGGTAGVF 102

Query: 181 SVRWAGIGVAQCLWSGLSLFISYIWGAYIFKEPIKNHGLSLLALCVMALGMLGVGFSVSN 360
           +VR AG+ ++  LWS + + +SY+WG  IF E  ++   ++ A+ +M +G++G+    S 
Sbjct: 103 AVRRAGLAISTGLWSCVIILLSYLWGVLIFHEKQESAVGAVGAVLLMCVGLIGIAHFSS- 161

Query: 361 KRRIRLFNNFPTHTESSTTIKTNDEELHDSVTPLMSNSPTSSCEREV-EFTDQDKNERDL 537
              I +         +  +++       D  TPL   +  +  + ++ + T Q       
Sbjct: 162 ---IEVRPGLDQARAAPRSVEECRPACSDETTPLNGINRANDAQFDLAKLTSQ------- 211

Query: 538 LKGVLGAVFVGISNGSFMVPLKYAHKDIV-GVEYLLSFGIGAMTMTMLVFGIYSSMLSFR 714
           L G+  AV  G+   S M+PL YA  +   G+ Y +SFGI A+ +  + + I    L+  
Sbjct: 212 LPGLFAAVLNGLFAASIMLPLHYAPPNTTKGIGYSMSFGIAAVVVVFIFWTIRLLALTAA 271

Query: 715 KIPQ------------------------PSLYIPGATLPAFLAGLLWSLGNFFSIYATLY 822
           +                           PS +      P F AGLL+S GN F I +  +
Sbjct: 272 EFAAKQNEAKRITPNIIRESLREGYSQLPSFHFSEMWRPGFTAGLLYSGGNLFGIVSIQH 331

Query: 823 LGLALGWPLVQCQLLVSAMWAVFYYKE 903
           LG  +G+ L Q  +++S  W +F+Y+E
Sbjct: 332 LGNFMGYSLNQSSMIISGCWGLFWYRE 358


>ref|XP_005838126.1| hypothetical protein GUITHDRAFT_66230 [Guillardia theta CCMP2712]
           gi|428182285|gb|EKX51146.1| hypothetical protein
           GUITHDRAFT_66230 [Guillardia theta CCMP2712]
          Length = 341

 Score =  100 bits (249), Expect = 8e-19
 Identities = 73/298 (24%), Positives = 132/298 (44%), Gaps = 21/298 (7%)
 Frame = +1

Query: 85  SWLILVYTPYKFTWWGVLGGLIWVINGIVAIFSVRWAGIGVAQCLWSGLSLFISYIWGAY 264
           S  +L  +P +++ WG  G L+       A  +V   G      +W G+ + ++++WG  
Sbjct: 50  SLALLKGSPVRWSSWGAAGALLLTATQCCAWPAVGALGAAAGPGIWCGVGMSVAFMWGTI 109

Query: 265 IFKEPIKNHGLSLLALCVMALGMLGVGFSVSNKRRIRLFNNFPTHTESSTTIKTNDEELH 444
           +F+E +++  L ++AL ++  G++G+    S+  + RL                      
Sbjct: 110 VFQEAVRSLALCIVALILLFFGIVGISLVQSSMLQ-RLLGE------------------- 149

Query: 445 DSVTPLMSNSPTSSCEREVEFTDQDKNERDLLKGVLGAVFVGISNGSFMVPLKY------ 606
              T LMS   ++   R             +  GVL A+  G+ +GS M P K       
Sbjct: 150 SGATGLMSEEESNKTGRA-----------RIAVGVLLALMTGLFDGSLMAPFKAYLASHP 198

Query: 607 -------------AHKDIVGVEYLLSFGIGAMTMT--MLVFGIYSSMLSFRKIPQPSLYI 741
                        +  D+V  EYL SF +    +    LV  ++    +    P  S + 
Sbjct: 199 SLVSSSSSSSSSSSSSDVVVFEYLGSFALALPVVAGGSLVLIMFYQHRALNSGPDRSSFR 258

Query: 742 PGATLPAFLAGLLWSLGNFFSIYATLYLGLALGWPLVQCQLLVSAMWAVFYYKEVRTK 915
             A  P F AG+LW++GN  S++ATL LG ++G+P+ Q  +++SA+W +  +KE+  +
Sbjct: 259 QAA-YPGFCAGVLWAVGNVLSVHATLELGQSIGFPMTQSCVVISALWGIVVFKEMTAR 315


>ref|XP_002185821.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
           gi|209582670|gb|ACI65291.1| predicted protein
           [Phaeodactylum tricornutum CCAP 1055/1]
          Length = 451

 Score = 99.8 bits (247), Expect = 1e-18
 Identities = 49/128 (38%), Positives = 76/128 (59%), Gaps = 7/128 (5%)
 Frame = +1

Query: 544 GVLGAVFVGISNGSFMVPLKYAHKDIVGVEYLLSFGIGAMT----MTMLVFG---IYSSM 702
           G++ A+F G+  GS M P+K+   D  G  +LLSF IGA      M ++ +G   ++   
Sbjct: 294 GMVAAMFCGVWGGSIMAPMKFCQSDTKGTHFLLSFSIGASIVNTGMWLVRYGYNVLHYQS 353

Query: 703 LSFRKIPQPSLYIPGATLPAFLAGLLWSLGNFFSIYATLYLGLALGWPLVQCQLLVSAMW 882
            S      PS ++    L   L+G+LWS+GNFFS+ +  YLG  +G+PLVQ  ++VS +W
Sbjct: 354 CSKAYASLPSFHLHTMWLAGGLSGMLWSIGNFFSLISVFYLGQGVGYPLVQTSIIVSGLW 413

Query: 883 AVFYYKEV 906
            +FY+KE+
Sbjct: 414 GIFYFKEI 421



 Score = 87.0 bits (214), Expect = 9e-15
 Identities = 40/102 (39%), Positives = 64/102 (62%), Gaps = 1/102 (0%)
 Frame = +1

Query: 46  VRQLYKSTACFLTSWLILVY-TPYKFTWWGVLGGLIWVINGIVAIFSVRWAGIGVAQCLW 222
           V Q YK+  CF TSWL+L+   P+ FT WG++ GL WV  G   IF+V+ AG+ +   + 
Sbjct: 49  VFQTYKTFMCFATSWLVLLAGEPFTFTPWGIVSGLFWVPGGTATIFAVKNAGLAIGIGIG 108

Query: 223 SGLSLFISYIWGAYIFKEPIKNHGLSLLALCVMALGMLGVGF 348
           S   + +S+IWG ++F+E + +   + LA+  M LG+LG+ +
Sbjct: 109 SSFIVLVSFIWGIFVFEEAVHSKTGACLAIFSMMLGLLGMSY 150


>gb|EJK47010.1| hypothetical protein THAOC_34300, partial [Thalassiosira oceanica]
          Length = 641

 Score = 87.4 bits (215), Expect = 7e-15
 Identities = 41/102 (40%), Positives = 65/102 (63%), Gaps = 1/102 (0%)
 Frame = +1

Query: 46  VRQLYKSTACFLTSWLI-LVYTPYKFTWWGVLGGLIWVINGIVAIFSVRWAGIGVAQCLW 222
           V Q YKS  CFLTSWL+ L+     FT WG++ GL WV  G   IF++R AG+ ++Q + 
Sbjct: 41  VMQSYKSLMCFLTSWLVVLLGVEVTFTPWGIVSGLFWVPGGAFNIFAIRNAGLAISQGIV 100

Query: 223 SGLSLFISYIWGAYIFKEPIKNHGLSLLALCVMALGMLGVGF 348
           +   + +S+IWG  IFKEP+ +  ++  A+ ++ LG+ G+ +
Sbjct: 101 ASSIVMVSFIWGNIIFKEPVHSEVIAYSAVWLIMLGLYGMSY 142


>ref|XP_641212.1| transmembrane protein 144 A [Dictyostelium discoideum AX4]
           gi|74855822|sp|Q54V96.1|T144B_DICDI RecName:
           Full=Transmembrane protein 144 homolog B; AltName:
           Full=Transmembrane protein 144 homolog 2
           gi|60469237|gb|EAL67232.1| transmembrane protein 144 A
           [Dictyostelium discoideum AX4]
          Length = 358

 Score = 86.7 bits (213), Expect = 1e-14
 Identities = 71/282 (25%), Positives = 125/282 (44%), Gaps = 16/282 (5%)
 Frame = +1

Query: 112 YKFTWWGVLGGLIWVINGIVAIFSVRWAGIGVAQCLWSGLSLFISYIWGAY----IFKEP 279
           Y F  WG+LGG +W I     I  V+  GIG+   LW   S+   Y  G +    I K+ 
Sbjct: 59  YIFDPWGLLGGTLWSIGNFCVIPIVKTIGIGLGLLLWCCSSIITGYFTGKFGWFGIDKQK 118

Query: 280 IKNHGLSLLALCVMALGMLGVGFSVSNKRRIRLFNNFPTHTESSTTIKTNDEELHDSVTP 459
           + +  L+ +    +   ++   F           +      +S       D   ++S+  
Sbjct: 119 VSHPALNWIGFACIVAAVIFFFFIEPTIEEKDEHSYSSIVDDSEIGNNGIDNNGYNSINN 178

Query: 460 LMSNSPTSSCEREVEFTDQDKN---ER-----DLLKGVLGAVFVGISNGSFMVPL---KY 606
             +N+  ++  R   F  Q K    ER     + + G++ +VF GI  G  MVP+   K 
Sbjct: 179 --NNNNGNNKRRSGAFNKQPKKSIFERMPPPYNTILGIVLSVFSGIMYGVNMVPMQLWKQ 236

Query: 607 AHKDIVGVEYLLSFGIGAMTMTMLVFGIYSSMLSFRKIPQPSLYIPGATLPAFLAGLLWS 786
           ++ D   + ++     G       VF +YS ++      +P    P    P+F +GLLW 
Sbjct: 237 SNVDASPLSFVFCHFSGIFLANTAVFIVYSIIV------RPPQIFPQTIFPSFFSGLLWG 290

Query: 787 LGNFFSIYATLYLGLALGWPLVQ-CQLLVSAMWAVFYYKEVR 909
           + N   + AT  LG  +G+P+     ++VS++W+VFY++E++
Sbjct: 291 IANVGLMVATQNLGYTIGFPMGSGGPMIVSSLWSVFYFREIQ 332


>ref|XP_003294072.1| hypothetical protein DICPUDRAFT_90514 [Dictyostelium purpureum]
           gi|325075525|gb|EGC29401.1| hypothetical protein
           DICPUDRAFT_90514 [Dictyostelium purpureum]
          Length = 337

 Score = 86.3 bits (212), Expect = 1e-14
 Identities = 67/269 (24%), Positives = 132/269 (49%), Gaps = 12/269 (4%)
 Frame = +1

Query: 136 LGGLIWVINGIVAIFSVRWAGIGVAQCLWSGLSLFISYIWGA---YIFKEPIKNHG-LSL 303
           +GG +W    ++ I +++  G+G+A  L+S + +   +I G    +  KE    H  ++ 
Sbjct: 53  MGGSLWCCANLLVIPTIKLLGLGLAVLLYSSIGIVAGFIVGKAGLFGLKEAAAAHDWMNY 112

Query: 304 LALCVMALGMLGVGF---SVSNKRRIRLFNNFPTHTESSTTIKTNDEELHDSVTPLMSNS 474
           L L  + L ++   F   ++  +++     N+    +  + I  +DEE++   +PL+ N+
Sbjct: 113 LGLAGIILSVIFFFFIKPNLEEEKKADTKGNYHGSYDDFSNI--SDEEIN---SPLIVNT 167

Query: 475 PTSSCEREVEFTDQDKNERDLLKGVLGAVFVGISNGSFMVPL---KYAHKDIVGVEYLLS 645
            T     EV   D+  N    + GV+ A+ +GI  G  M+P+   K  + D   ++Y  S
Sbjct: 168 QTQIKNYEVSIYDRIPNRLKTVSGVVFALVIGILLGVNMIPMQLWKQRNPDANPLDYTFS 227

Query: 646 FGIGAMTMTMLVFGIYSSMLSFRKIPQPSLYIPGATLPAFLAGLLWSLGNFFSIYATLYL 825
              G       VF +Y+       I +P    P   LP+F +G++  + N   + +T  L
Sbjct: 228 QFSGIFLANTFVFILYTI------IKRPPQIYPQTILPSFCSGVVLGVANIGLMISTENL 281

Query: 826 GLALGWPLVQCQ--LLVSAMWAVFYYKEV 906
           G  +G+P + C   ++VS++W++FY++E+
Sbjct: 282 GYTVGYP-ISCSGPMIVSSLWSIFYFREI 309


>ref|XP_004350600.1| transmembrane protein [Dictyostelium fasciculatum]
           gi|328865506|gb|EGG13892.1| transmembrane protein
           [Dictyostelium fasciculatum]
          Length = 435

 Score = 85.1 bits (209), Expect = 3e-14
 Identities = 70/286 (24%), Positives = 126/286 (44%), Gaps = 20/286 (6%)
 Frame = +1

Query: 112 YKFTWWGVLGGLIWVINGIVAIFSVRWAGIGVAQCLWSGLSLFISYIWGAY----IFKEP 279
           Y F  WG+LGG +W +  +  I  V+  G+G+   LWS  SL   ++ G +    + K+ 
Sbjct: 145 YIFDPWGLLGGSLWSVGNLCVIPIVKTIGLGLGLLLWSCTSLVTGFLIGKFGAFGLDKQS 204

Query: 280 IKNHGLSLLALCVMALGMLGVGF-----------SVSNKRRIRLFNNFPTHTESSTTIKT 426
           + +  L+ L    + + +L   F           + S KR  + +   P   E   +I +
Sbjct: 205 VAHPVLNWLGFSAIVVAILFFFFIKPTLNKEEPTTPSKKRLSQRYEYSPIVDEQQISINS 264

Query: 427 NDEELHDSVTPLMSNSPTSSCEREVEFTDQDKNERDLLKGVLGAVFVGISNGSFMVPL-- 600
            +            NS     +   E   +  N+   + GV+ +VF G+  G  MVP+  
Sbjct: 265 TE------------NSAPVEGQMIFEKIPEPYNK---IFGVMLSVFSGVLYGVNMVPMQL 309

Query: 601 --KYAHKDIVGVEYLLSFGIGAMTMTMLVFGIYSSMLSFRKIPQPSLYIPGATLPAFLAG 774
             +    D+  + ++     G       VF +YS       I +P    P   LP+F++G
Sbjct: 310 WKQQQPSDVNPLSFIFCHFSGIFLFNTAVFFVYSI------IKRPPQVFPQTMLPSFISG 363

Query: 775 LLWSLGNFFSIYATLYLGLALGWPLVQC-QLLVSAMWAVFYYKEVR 909
           +LW + N   + AT  LG  +G+P+     ++VS++W+V  +KE++
Sbjct: 364 VLWGVANCGLMVATQILGYTIGFPIGSSGPMVVSSLWSVLLFKEIQ 409


>ref|XP_001775344.1| predicted protein [Physcomitrella patens]
           gi|162673289|gb|EDQ59814.1| predicted protein
           [Physcomitrella patens]
          Length = 344

 Score = 85.1 bits (209), Expect = 3e-14
 Identities = 70/279 (25%), Positives = 114/279 (40%), Gaps = 4/279 (1%)
 Frame = +1

Query: 79  LTSWLILVYTPYKFTWWGVLGGLIWVINGIVAIFSVRWAGIGVAQCLWSGLSLFISYIWG 258
           L+S L+L    + F   G+L G+ +V++ I    +VR  G+ VA  +W+G +  +   W 
Sbjct: 78  LSSLLLLFKYKFVFALEGLLSGVFFVLSFINIFRAVRLLGVSVAYGIWAGTAAIVGVAWS 137

Query: 259 AYIFKEPIKNHGLSLLALCVMALGMLGVGFSVSNKRRIRLFNNFPTHTESSTTIKTNDEE 438
             +  EP                               + F     HT+  T I++    
Sbjct: 138 GQMSWEP-------------------------------QDFYEDDDHTQ--TLIQSQPSF 164

Query: 439 L----HDSVTPLMSNSPTSSCEREVEFTDQDKNERDLLKGVLGAVFVGISNGSFMVPLKY 606
                H  +  +   S   S ER       +   R    GV  AV  GI  G  M+P   
Sbjct: 165 AGWVQHRKLWDVAGQS--KSGERPKNVLTGEPASRSFPAGVFSAVLAGILGGLVMIPANQ 222

Query: 607 AHKDIVGVEYLLSFGIGAMTMTMLVFGIYSSMLSFRKIPQPSLYIPGATLPAFLAGLLWS 786
           A     G  +L SFGI     T +V    +S+        P L    A  P  L+G +++
Sbjct: 223 APDMAQGNAFLPSFGIAVAIFTPIV----TSLPYLSGCELPDLSAREAAGPGILSGFIYN 278

Query: 787 LGNFFSIYATLYLGLALGWPLVQCQLLVSAMWAVFYYKE 903
           +GN  +I A  Y+G ++ +PL QC ++V+ +W + Y++E
Sbjct: 279 IGNMLNIVAIFYVGSSVAYPLFQCGIIVAGIWGMLYFEE 317


>ref|XP_003293389.1| hypothetical protein DICPUDRAFT_50939 [Dictyostelium purpureum]
           gi|325076279|gb|EGC30078.1| hypothetical protein
           DICPUDRAFT_50939 [Dictyostelium purpureum]
          Length = 348

 Score = 84.3 bits (207), Expect = 6e-14
 Identities = 70/275 (25%), Positives = 122/275 (44%), Gaps = 9/275 (3%)
 Frame = +1

Query: 112 YKFTWWGVLGGLIWVINGIVAIFSVRWAGIGVAQCLWSGLSLFISYIWGAYIFKEPIKNH 291
           Y F  WG+LGG +W +     I  V+  G+G+   LW   S+   +  G +         
Sbjct: 64  YLFDPWGLLGGTLWSLGNFCVIPIVKTIGLGLGLLLWCCCSIVAGFFTGKFGL------F 117

Query: 292 GLSLLALCVMALGMLGVGFSVSNKRRIRLFNNFPTHTESSTTIKTNDEELHDSVTPLMSN 471
           GL    +   A+  +G G  V+    I  F       E+  T   +   + D   P+++ 
Sbjct: 118 GLEKQVVSHPAMNWIGFGCIVA---AIVFFFFIKPTLENEDTESNSYSSIVDDY-PIINE 173

Query: 472 SPTSSCEREVEFTDQDKNER--DLLKGVLG---AVFVGISNGSFMVPL---KYAHKDIVG 627
           +      +  + T++   ER    +K +LG   A+F GI  G  MVP+   K +      
Sbjct: 174 AGYRGSIQSSKLTEKSFFERIPQPMKTMLGIGLAIFSGIMYGVNMVPMQLWKQSDPSANP 233

Query: 628 VEYLLSFGIGAMTMTMLVFGIYSSMLSFRKIPQPSLYIPGATLPAFLAGLLWSLGNFFSI 807
           + ++     G      +VF +Y+       I +P    P   LP+F +G+LW + N   +
Sbjct: 234 LSFIFCHFSGIFIFNTIVFFVYAI------IKRPPQVFPQTMLPSFFSGVLWGIANCGLM 287

Query: 808 YATLYLGLALGWPL-VQCQLLVSAMWAVFYYKEVR 909
            AT  LG  +G+P+     ++VS++W+V Y+KE++
Sbjct: 288 VATQNLGYTVGFPMGASGPMVVSSIWSVVYFKEIQ 322


>gb|ABK23204.1| unknown [Picea sitchensis]
          Length = 196

 Score = 84.3 bits (207), Expect = 6e-14
 Identities = 41/125 (32%), Positives = 71/125 (56%)
 Frame = +1

Query: 535 LLKGVLGAVFVGISNGSFMVPLKYAHKDIVGVEYLLSFGIGAMTMTMLVFGIYSSMLSFR 714
           L +GV+ A+  GI  G  M+P+  +   I GV YL SF IG      +V  I    LS +
Sbjct: 51  LSQGVIAALLTGILGGLIMMPMTQSPPAIQGVSYLPSFAIGVAIFAPVVTAI--PYLSTQ 108

Query: 715 KIPQPSLYIPGATLPAFLAGLLWSLGNFFSIYATLYLGLALGWPLVQCQLLVSAMWAVFY 894
           + P+  LY+    LP  ++G++W++GN  S+ A   +G  + +P++ C + ++ +W +F 
Sbjct: 109 ECPRMELYV--GALPGIISGIVWNIGNILSMLAIGIIGYTIAYPILYCGIFIAGLWGMFL 166

Query: 895 YKEVR 909
           +KE+R
Sbjct: 167 FKEIR 171


>ref|XP_005758882.1| hypothetical protein EMIHUDRAFT_121297 [Emiliania huxleyi CCMP1516]
           gi|485606731|gb|EOD06453.1| hypothetical protein
           EMIHUDRAFT_121297 [Emiliania huxleyi CCMP1516]
          Length = 334

 Score = 81.6 bits (200), Expect = 4e-13
 Identities = 72/290 (24%), Positives = 129/290 (44%), Gaps = 28/290 (9%)
 Frame = +1

Query: 124 WWGVLGGLIWVINGIVAIFSVRWAGIGVAQCLWSGLSLFISYIWGAYIFKEPIKNHGLSL 303
           W G    + W    +  +++V   GI V Q   +GL + ++ +WGA +F E ++  G  +
Sbjct: 30  WRGAASAICWAPATVAYVYAVNAIGITVTQTCVAGLLVAVNVLWGACMFSEQLE--GTCI 87

Query: 304 LALCVMALGML-GVGFSVSNKRRIRLFNNFPTHTE-------SSTTIKTNDEEL------ 441
           + L +   G+L G+     ++R++       +H +       +S+T  T+ E+L      
Sbjct: 88  MGLALTTCGILAGINAKKFSERQMNRIAGQHSHLDLSQPVQAASSTAPTDQEQLVEKGAL 147

Query: 442 --HDSVTPLMSNSPTSSCEREVEFTDQDKNERDLLKGVLGAVFVGISNGSFMVP---LKY 606
             H+ V      +     +++           +   G+    F GI  GS MVP   +  
Sbjct: 148 RSHELVVSNGGGAHAGGGQQQPAAAHPVPAPSEWRVGMAAVCFNGIWGGSIMVPTHGMSR 207

Query: 607 AHKDIVGVEYLLSFGIGAMTMTMLVFGIYSSMLSFRKIPQPSLYIPGATL---PAFLAGL 777
           A  D     Y L+FG  A+ +  +++      +++R +P   LY+    +       +G 
Sbjct: 208 APADY--ASYSLAFGSTALLVIAVLW-----CINWRSVP---LYVAQCRVVWRRGLASGC 257

Query: 778 LWSLGNFFSIYAT------LYLGLALGWPLVQCQLLVSAMWAVFYYKEVR 909
           LW++GN  S  A         LGLALG+  VQC L+VS+ W V  ++EV+
Sbjct: 258 LWAVGNVCSAVAVNGWGDMAGLGLALGYSAVQCNLVVSSTWGVCLFREVQ 307


>gb|EJK73333.1| hypothetical protein THAOC_05052 [Thalassiosira oceanica]
          Length = 313

 Score = 80.5 bits (197), Expect = 8e-13
 Identities = 45/140 (32%), Positives = 72/140 (51%), Gaps = 2/140 (1%)
 Frame = +1

Query: 46  VRQLYKSTACFLTS--WLILVYTPYKFTWWGVLGGLIWVINGIVAIFSVRWAGIGVAQCL 219
           V Q YK+T CFL S   ++L+    +FT WG+L G+ WV  G   I+ +R AG+ VA   
Sbjct: 43  VMQSYKTTLCFLMSSPMVMLLGERPRFTHWGILSGVFWVPGGAAGIYGIRKAGLAVAVGT 102

Query: 220 WSGLSLFISYIWGAYIFKEPIKNHGLSLLALCVMALGMLGVGFSVSNKRRIRLFNNFPTH 399
           WS L +  S+ WG ++F E +K+   +  A   + LG++G+    S  +  +   +  + 
Sbjct: 103 WSSLVVLTSFFWGIHVFGERVKSPNGAAGACLTLILGLIGMANFSSKGKPKKKEKDICSK 162

Query: 400 TESSTTIKTNDEELHDSVTP 459
            E+     T D E   + TP
Sbjct: 163 AETLIRDSTRDLESQQTSTP 182


Top