BLASTX nr result

ID: Ephedra29_contig00006959 seq

BLASTX 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Ephedra29_contig00006959
         (1023 letters)

Database: ./nr 
           115,041,592 sequences; 42,171,959,267 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ADE75937.1 unknown [Picea sitchensis]                                 309   e-100
OAE20319.1 hypothetical protein AXG93_2338s1080 [Marchantia poly...   207   4e-58
CEM31581.1 unnamed protein product [Vitrella brassicaformis CCMP...   122   2e-28
XP_002295272.1 predicted protein [Thalassiosira pseudonana CCMP1...   120   1e-27
XP_013754871.1 hypothetical protein AMSG_09158 [Thecamonas trahe...   120   1e-27
XP_002180280.1 predicted protein [Phaeodactylum tricornutum CCAP...   116   4e-26
XP_005789382.1 hypothetical protein EMIHUDRAFT_98339 [Emiliania ...   107   1e-22
XP_002185821.1 predicted protein [Phaeodactylum tricornutum CCAP...   106   6e-22
XP_002182733.1 predicted protein [Phaeodactylum tricornutum CCAP...   104   1e-21
OAE34031.1 hypothetical protein AXG93_4142s1160 [Marchantia poly...    94   3e-18
XP_002676812.1 predicted protein [Naegleria gruberi] EFC44068.1 ...    94   9e-18
KOO29593.1 hypothetical protein Ctob_007331 [Chrysochromulina sp...    93   1e-17
ABK23204.1 unknown [Picea sitchensis]                                  89   3e-17
XP_005838126.1 hypothetical protein GUITHDRAFT_66230 [Guillardia...    91   5e-17
XP_002669182.1 predicted protein [Naegleria gruberi] EFC36438.1 ...    91   1e-16
OEU17733.1 hypothetical protein FRACYDRAFT_184267 [Fragilariopsi...    90   1e-16
XP_002182676.1 predicted protein [Phaeodactylum tricornutum CCAP...    87   2e-15
XP_001775344.1 predicted protein [Physcomitrella patens] EDQ5981...    83   3e-14
KOO34993.1 hypothetical protein Ctob_007190 [Chrysochromulina sp...    85   3e-14
EJK61576.1 hypothetical protein THAOC_17910 [Thalassiosira ocean...    83   4e-14

>ADE75937.1 unknown [Picea sitchensis]
          Length = 359

 Score =  309 bits (792), Expect = e-100
 Identities = 161/270 (59%), Positives = 202/270 (74%), Gaps = 6/270 (2%)
 Frame = +1

Query: 130 IASLLFFGSYVFISYIWGAYIFKEPIKNHGLSLLALCVMALGMLGVGFSVSNKSRIR-LF 306
           ++  L+ G  +F +YIWGAY+ KEP+KNHGLS+LAL VMALGM+GVGF+VS K+  + L 
Sbjct: 90  VSQSLWSGLSLFTAYIWGAYVLKEPLKNHGLSILALLVMALGMIGVGFAVSEKTVFQSLL 149

Query: 307 NNFPTHTESSTTIKTNDE----ELHDSVTPLMSNSPTSSYEREVEFTDQD-KNERDLLKG 471
           + +      ST IK   +    ++ DS   L+    T +   E E+ DQ  + E  L+KG
Sbjct: 150 DIWLKLNPCSTKIKDCPQLSCIDVQDSSEALIPCETTKTCGVEEEYADQKYERENKLVKG 209

Query: 472 VLGAVFVGISNGSFMVPLKYAHKDIVGVEYLLSFGIGAMTMTMLVFGIYSSMLSFRKIPQ 651
           VL AV VG  NGSFMVPLKYAHKD+VG EYL+SFGIGAMTMT+++ GIY + L+F   P 
Sbjct: 210 VLCAVLVGTLNGSFMVPLKYAHKDVVGAEYLVSFGIGAMTMTIILLGIYMTALAFHGRPL 269

Query: 652 PSLYIPGATLPAFLAGLLWSLGNFFSIYATLYLGLALGWPLVQCQLLVSAMWAVFYYKEV 831
           PSLYIPGA  PAFLAG LWS+GNFFSIYATLYLG+ALGWPLVQCQL+VSAMWAVF+YKEV
Sbjct: 270 PSLYIPGAAGPAFLAGFLWSMGNFFSIYATLYLGVALGWPLVQCQLIVSAMWAVFFYKEV 329

Query: 832 RTKVGAFSLITSSVIVVMGVVMLAQFGSVG 921
            ++ GA  LI SS++VV+G +ML+QFGS+G
Sbjct: 330 TSRTGAALLIGSSIVVVLGAIMLSQFGSIG 359


>OAE20319.1 hypothetical protein AXG93_2338s1080 [Marchantia polymorpha subsp.
            polymorpha]
          Length = 624

 Score =  207 bits (527), Expect = 4e-58
 Identities = 113/247 (45%), Positives = 151/247 (61%), Gaps = 14/247 (5%)
 Frame = +1

Query: 130  IASLLFFGSYVFISYIWGAYIFKEPIKNHGLSLLALCVMALGMLGVGFSVSN-------- 285
            +A  L+ G  +F+SYIWGAY+F EPIK+  LS++AL VMA+GM+G+G + S         
Sbjct: 305  VAQSLWSGIGIFVSYIWGAYVFNEPIKHPLLSMMALGVMAVGMMGIGMAASGSLSPGPSR 364

Query: 286  ------KSRIRLFNNFPTHTESSTTIKTNDEELHDSVTPLMSNSPTSSYEREVEFTDQDK 447
                  K + R   +F    E    I+ + E       P+      S Y R+ +F     
Sbjct: 365  VHALSPKGKTRKMFSFKREEEIDL-IQMDGEAKAVLPRPVRDEKGGSKYRRDNKF----- 418

Query: 448  NERDLLKGVLGAVFVGISNGSFMVPLKYAHKDIVGVEYLLSFGIGAMTMTMLVFGIYSSM 627
                 L+G+L A+FVG+SNGSF++P KYA K+I GVEYL+SFG+GA+T++  +  IYS +
Sbjct: 419  -----LQGILCAIFVGVSNGSFLIPFKYATKEIHGVEYLVSFGVGAVTISSAILTIYSML 473

Query: 628  LSFRKIPQPSLYIPGATLPAFLAGLLWSLGNFFSIYATLYLGLALGWPLVQCQLLVSAMW 807
              +R  P PS +      P  L G+LWS GN  SI AT YLG+ALGWPLVQCQLLVSAMW
Sbjct: 474  QVYRGKPPPSFHFKAVAGPGLLTGVLWSAGNVCSIVATEYLGMALGWPLVQCQLLVSAMW 533

Query: 808  AVFYYKE 828
            AVFYY+E
Sbjct: 534  AVFYYEE 540


>CEM31581.1 unnamed protein product [Vitrella brassicaformis CCMP3155]
          Length = 359

 Score =  122 bits (307), Expect = 2e-28
 Identities = 75/264 (28%), Positives = 134/264 (50%), Gaps = 5/264 (1%)
 Frame = +1

Query: 127 AIASLLFFGSYVFISYIWGAYIFKEPIKNHGLSLLALCVMALGMLGVGFSVSNKSRIRLF 306
           +I   ++ G+ + +SY+WG   F + I N  L+L+ L ++ +G  G+ F    +   RL 
Sbjct: 101 SIGQGVWCGTAIIVSYLWGTLAFGDVITNVPLNLIGLVLLLVGSTGIAFC---QELPRLM 157

Query: 307 NNFPTHTESSTTIKTNDEELHDSVTPLMSNSPTSSYEREVEFTDQDK-----NERDLLKG 471
               T T S TT  +   E            P +   R+ +  D+ +     +    L G
Sbjct: 158 AQ--TFTTSGTTAGSGANEQQMRQEDSNQREPLT---RDTDDADRGRPSSPHHSSSKLVG 212

Query: 472 VLGAVFVGISNGSFMVPLKYAHKDIVGVEYLLSFGIGAMTMTMLVFGIYSSMLSFRKIPQ 651
           +  +V  G+  GS +VP  Y  K   G+ +L SF +G +   +LV G+Y ++++ R++  
Sbjct: 213 LGVSVLTGVFGGSILVPKHYVPKSADGIAFLPSFAVGVVAFMVLVMGVYVTVITKRRV-- 270

Query: 652 PSLYIPGATLPAFLAGLLWSLGNFFSIYATLYLGLALGWPLVQCQLLVSAMWAVFYYKEV 831
              ++  A LP  ++G LW+LGN  SI A   LG ++ +PL+QC LL+S +  +  ++EV
Sbjct: 271 -QWHVQAALLPGLISGCLWNLGNICSIAAIPRLGYSVAYPLLQCALLISGLLGIVVFREV 329

Query: 832 RTKVGAFSLITSSVIVVMGVVMLA 903
           R          SS+++++G V+L+
Sbjct: 330 RQPAAVSCFWGSSIVLILGAVLLS 353


>XP_002295272.1 predicted protein [Thalassiosira pseudonana CCMP1335] EED87338.1
           predicted protein [Thalassiosira pseudonana CCMP1335]
          Length = 373

 Score =  120 bits (302), Expect = 1e-27
 Identities = 75/260 (28%), Positives = 135/260 (51%), Gaps = 25/260 (9%)
 Frame = +1

Query: 127 AIASLLFFGSYVFISYIWGAYIFKEPIKNHGLSLLALCVMALGMLGVGFSVSNKSRIRLF 306
           A++  +   S V +S+IWG  IF+E +K+  ++  A+C++  G+ G+ F  +++ +    
Sbjct: 117 AVSQGIVSSSIVMVSFIWGDLIFREAVKSELIAYFAVCLIMAGLYGMSFFSTSEEQ---- 172

Query: 307 NNFPTHTESSTTIKTNDEEL----HDSVTPLMSNSPTSSYE--------------REVEF 432
              P HT  S      +E+L    H+S     S++  SS                R +  
Sbjct: 173 ---PEHTSVSDNDNNGEEKLDLMRHESSDSFDSSNDNSSMGPLEISERRKPSIRGRPILI 229

Query: 433 TDQDKNERDLLKGVLGAVFVGISNGSFMVPLKYAHKDIVGVEYLLSFGIGAMTMTMLVFG 612
             +  + R++  G+  A+  G+  GS +VP+ YA  D+ G+ Y++SF +GA+T+T+L++ 
Sbjct: 230 CGKTYSRRNI--GLCSALICGVWGGSCLVPMHYAQGDVKGLAYVISFSVGALTVTVLLWV 287

Query: 613 IYSS--MLSFRKIPQ-----PSLYIPGATLPAFLAGLLWSLGNFFSIYATLYLGLALGWP 771
              +  ++  + + +     PS ++    LP   AG LWS+GN  SI A  +LG  +G+ 
Sbjct: 288 ARFAYHLVKLKSVWEAYEVLPSFHLRVMLLPGATAGSLWSIGNVGSIVAVKHLGQGVGYS 347

Query: 772 LVQCQLLVSAMWAVFYYKEV 831
             Q  LLVS MW +FY+K++
Sbjct: 348 ASQAALLVSGMWGIFYFKQM 367


>XP_013754871.1 hypothetical protein AMSG_09158 [Thecamonas trahens ATCC 50062]
           KNC52982.1 hypothetical protein AMSG_09158 [Thecamonas
           trahens ATCC 50062]
          Length = 358

 Score =  120 bits (301), Expect = 1e-27
 Identities = 79/269 (29%), Positives = 137/269 (50%), Gaps = 7/269 (2%)
 Frame = +1

Query: 127 AIASLLFFGSYVFISYIWGAYIFKEPIKNHGLSLLALCVMALGM-LGVGFSVSNKSRIRL 303
           ++A  ++ G+ + IS++WGA  + + + +  L+++AL  +  G+ L      S  + +  
Sbjct: 94  SVAQTVWSGTTIMISFLWGALAYHDKVSSLPLAMVALATLLCGLVLLASLHSSLMAGVHA 153

Query: 304 FNNFPTHTESSTTIKTNDEELHDSVTPLMSNSPTSSYEREVEFTDQDKNERD------LL 465
           +  F    ES    +  D      +    + +   S  R ++         D      L+
Sbjct: 154 WLGFAVDDESLRPHRPGDVRAAAPLVAAEAPAVNVSGGRGLDDAYASAGGGDGKPLAKLV 213

Query: 466 KGVLGAVFVGISNGSFMVPLKYAHKDIVGVEYLLSFGIGAMTMTMLVFGIYSSMLSFRKI 645
           +GV+ A+ +G+ NGS MVPL+   K   G+ Y++SF IG +++T L+   Y +++   + 
Sbjct: 214 RGVVFALVLGVLNGSLMVPLRETPKRASGINYIISFSIGVVSITPLLALGYMAIM---RA 270

Query: 646 PQPSLYIPGATLPAFLAGLLWSLGNFFSIYATLYLGLALGWPLVQCQLLVSAMWAVFYYK 825
           P   L+   A  P  L GLLW +GN+ SIYATLYLGL +G+PL Q  LLV+ +W   YY 
Sbjct: 271 PL-QLHWRVALGPGLLTGLLWQIGNYCSIYATLYLGLTIGYPLTQLALLVAGLWGWLYYG 329

Query: 826 EVRTKVGAFSLITSSVIVVMGVVMLAQFG 912
           E+           ++++V+ G  +LA  G
Sbjct: 330 ELPQARHVAHFWLAAIVVLGGAALLAIAG 358


>XP_002180280.1 predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
           EEC48471.1 predicted protein [Phaeodactylum tricornutum
           CCAP 1055/1]
          Length = 346

 Score =  116 bits (290), Expect = 4e-26
 Identities = 90/289 (31%), Positives = 142/289 (49%), Gaps = 15/289 (5%)
 Frame = +1

Query: 85  IQPAIMSFLPGVFAAIASLLFFGSYVFISYIWGAYIFKEPIKNHGLSLLALCVMALGMLG 264
           +Q A M+   G+++++         V +++ WG  IF EP+ +   + LA+ ++ +G+ G
Sbjct: 85  VQNAGMAVSQGIWSSLK--------VLVAFCWGILIFHEPVHSKLGTTLAIALLMVGLAG 136

Query: 265 VGFSVSNKSRIRLFNNFPTHTESSTTIKTNDEELHDSVTPLMSNSPTSSYEREVEFTDQD 444
           V    + +                T+  +  EE      PL+   P    + + E  D +
Sbjct: 137 VSIFAAPR----------------TSTSSPQEE------PLL---PDVEEQNQPEIVD-N 170

Query: 445 KNERDLLK----GVLGAVFVGISNGSFMVPLKYAH-KDIVGVEYLLSFGIGAMTMTMLVF 609
           K+    LK    G+LGAV  G   GS +VP+ YA  K   G+ Y++SF IG  ++  +V+
Sbjct: 171 KDYLGFLKRRHVGLLGAVIDGAYGGSVLVPMHYAGPKTTNGLSYVMSFAIGCSSVVTMVW 230

Query: 610 GIYSSMLSFRKIPQPSLYIPGATLP----------AFLAGLLWSLGNFFSIYATLYLGLA 759
            +    L F  +   SL +    LP          A LAGL+WSLGN  SI     LG  
Sbjct: 231 VL---RLLFNSVQGQSLRVGYDRLPSLHVTTIGPYAALAGLIWSLGNVSSILTVALLGEG 287

Query: 760 LGWPLVQCQLLVSAMWAVFYYKEVRTKVGAFSLITSSVIVVMGVVMLAQ 906
           +G+ +VQ QLLV+ +W VF+YKE+R      S  T +VI V G+VML++
Sbjct: 288 VGYSIVQSQLLVAGLWGVFWYKEIRGMRAIASWFTFAVITVAGIVMLSR 336


>XP_005789382.1 hypothetical protein EMIHUDRAFT_98339 [Emiliania huxleyi CCMP1516]
           EOD36953.1 hypothetical protein EMIHUDRAFT_98339
           [Emiliania huxleyi CCMP1516]
          Length = 358

 Score =  107 bits (266), Expect = 1e-22
 Identities = 71/275 (25%), Positives = 124/275 (45%), Gaps = 7/275 (2%)
 Frame = +1

Query: 109 LPGVFAAIASLLFFGSYVFISYIWGAYIFKEPIKNHGLSLLALCVMALGMLGVGFSVSNK 288
           + G+   +A   +  + + +S++WG  +F E   +   ++ ALC +A G+ GV  + S  
Sbjct: 116 IDGIGYGVAVATWGSTTMIVSFLWGTLVFAERPSSVTGAVAALCTLAAGVAGVATAQSGS 175

Query: 289 SRIRLFNNFPTHTESSTTIKTNDEELHDSVTPLMSNSPTSSYEREVEFTDQDKNERDLLK 468
                    P   E++     N  E         + +                       
Sbjct: 176 LG-------PPEAEAAAEAFLNPAEGRVGGAAARAGA----------------------- 205

Query: 469 GVLGAVFVGISNGSFMVPLKYAHKD-------IVGVEYLLSFGIGAMTMTMLVFGIYSSM 627
           G LGA+  G+ NGS MVP  Y  ++        VG+ Y+ +F  G   +  + F +Y+ +
Sbjct: 206 GWLGALGCGLLNGSLMVPFHYFSEERSGQDGASVGMGYIATFATGVAAVQPIFFLLYARV 265

Query: 628 LSFRKIPQPSLYIPGATLPAFLAGLLWSLGNFFSIYATLYLGLALGWPLVQCQLLVSAMW 807
             FR  PQP L      LP  + G+ W++GNF S +ATL+LG A+G+PL Q  ++V+ +W
Sbjct: 266 -PFR--PQPPLLCSELALPGLITGVFWAIGNFESTFATLHLGQAVGYPLTQTCIVVAGLW 322

Query: 808 AVFYYKEVRTKVGAFSLITSSVIVVMGVVMLAQFG 912
              ++ E+R          S ++++ G V+L  +G
Sbjct: 323 GALFFGEIRGAPSLLLFSVSVLVIIGGAVLLGMYG 357


>XP_002185821.1 predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
           ACI65291.1 predicted protein [Phaeodactylum tricornutum
           CCAP 1055/1]
          Length = 451

 Score =  106 bits (264), Expect = 6e-22
 Identities = 53/151 (35%), Positives = 87/151 (57%), Gaps = 7/151 (4%)
 Frame = +1

Query: 469 GVLGAVFVGISNGSFMVPLKYAHKDIVGVEYLLSFGIGAMT----MTMLVFG---IYSSM 627
           G++ A+F G+  GS M P+K+   D  G  +LLSF IGA      M ++ +G   ++   
Sbjct: 294 GMVAAMFCGVWGGSIMAPMKFCQSDTKGTHFLLSFSIGASIVNTGMWLVRYGYNVLHYQS 353

Query: 628 LSFRKIPQPSLYIPGATLPAFLAGLLWSLGNFFSIYATLYLGLALGWPLVQCQLLVSAMW 807
            S      PS ++    L   L+G+LWS+GNFFS+ +  YLG  +G+PLVQ  ++VS +W
Sbjct: 354 CSKAYASLPSFHLHTMWLAGGLSGMLWSIGNFFSLISVFYLGQGVGYPLVQTSIIVSGLW 413

Query: 808 AVFYYKEVRTKVGAFSLITSSVIVVMGVVML 900
            +FY+KE+         + SS++ + G+++L
Sbjct: 414 GIFYFKEITGFERISKWLASSLLTIFGILLL 444


>XP_002182733.1 predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
           EEC46020.1 predicted protein [Phaeodactylum tricornutum
           CCAP 1055/1]
          Length = 404

 Score =  104 bits (260), Expect = 1e-21
 Identities = 77/300 (25%), Positives = 144/300 (48%), Gaps = 41/300 (13%)
 Frame = +1

Query: 127 AIASLLFFGSYVFISYIWGAYIFKEPIKNHGLSLLALCVMALGMLGVGFSVSNKSRIRLF 306
           AIA   +    + I+++ G  +F+EP+ +   +L A  ++ALG++G+    S  S  +  
Sbjct: 97  AIAVGTWASVMIVINFLVGIVLFQEPVSDMFATLGAFLLLALGLVGM----SLYSTPQPV 152

Query: 307 NNFPTHTESSTTIKTNDEELHDSVTPLMSNSPTSSYEREVEFTD-QDKNE---------- 453
           +  P+ TE +  I  N  E+ +    L+    TSSY  +++  D Q +NE          
Sbjct: 153 DQLPS-TEMTENIGPNQNEVEEIDRALIVKR-TSSYTGKIDHRDIQRRNEESGSYGSSAD 210

Query: 454 ---------------RDLLKGVLGAVFVGISNGSFMVPLKYAH-KDIVGVEYLLSFGIGA 585
                          R    G+ GA+F G+  GS ++PL YA  +   G  Y++S+  GA
Sbjct: 211 ADEPLFTIPDGTKRKRSGPTGICGAIFNGVMTGSSLIPLHYAKTQGYGGANYMISYASGA 270

Query: 586 MTMTMLVFGIYSSMLSFRKIPQ--------------PSLYIPGATLPAFLAGLLWSLGNF 723
           + M  L++G++ +   ++ + Q              P+ +     LP F +G+L ++  F
Sbjct: 271 IVMNCLIWGVFFAYTCYQTVQQDLNVPVLLHTFQVMPAWHFRKLWLPGFTSGVLLTIAMF 330

Query: 724 FSIYATLYLGLALGWPLVQCQLLVSAMWAVFYYKEVRTKVGAFSLITSSVIVVMGVVMLA 903
            SI +  YLG  +G  +VQ ++L+S +W +F+++E+R          S+ + V G++ L+
Sbjct: 331 GSILSVTYLGQGIGNSIVQAKILISGLWGIFWFREIRGMYIVTKWFLSASLTVAGILWLS 390


>OAE34031.1 hypothetical protein AXG93_4142s1160 [Marchantia polymorpha subsp.
           polymorpha]
          Length = 345

 Score = 94.4 bits (233), Expect = 3e-18
 Identities = 65/260 (25%), Positives = 129/260 (49%), Gaps = 1/260 (0%)
 Frame = +1

Query: 127 AIASLLFFGSYVFISYIWGAYIFKE-PIKNHGLSLLALCVMALGMLGVGFSVSNKSRIRL 303
           ++ + ++ G+ V +S++ G        +K+   +L+A+ ++ +G++GV ++  +  R+  
Sbjct: 117 SVGTGIWCGTAVLVSFVAGLVFDPNGALKSKLWALVAIIIIMIGIVGVAYA-GHVGRVAA 175

Query: 304 FNNFPTHTESSTTIKTNDEELHDSVTPLMSNSPTSSYEREVEFTDQDKNERDLLKGVLGA 483
                 H E+  T+  +               P +  +R   F+           GV  A
Sbjct: 176 -----GHEENLDTLLQH---------------PVTGEQRSGTFST----------GVFTA 205

Query: 484 VFVGISNGSFMVPLKYAHKDIVGVEYLLSFGIGAMTMTMLVFGIYSSMLSFRKIPQPSLY 663
           +F G++ G  M+PL  A  +  G+ YL SF IG      +V  I       R++P  +L 
Sbjct: 206 IFAGLAGGLIMIPLTRASAEAQGIPYLPSFAIGVAIFAPIVTAIPYVTRVDREMP--NLA 263

Query: 664 IPGATLPAFLAGLLWSLGNFFSIYATLYLGLALGWPLVQCQLLVSAMWAVFYYKEVRTKV 843
              A LP  ++G++W++GN FSI A  Y+  ++ +P++QC +LV+ +W +  ++E++   
Sbjct: 264 PAAAALPGIVSGIVWNIGNVFSILAIGYISYSIAYPIMQCGILVAGLWGMLLFEEIQGSS 323

Query: 844 GAFSLITSSVIVVMGVVMLA 903
                I+ ++I+V GV +LA
Sbjct: 324 STAYWISGAIIIV-GVAILA 342


>XP_002676812.1 predicted protein [Naegleria gruberi] EFC44068.1 predicted protein
           [Naegleria gruberi]
          Length = 383

 Score = 93.6 bits (231), Expect = 9e-18
 Identities = 72/299 (24%), Positives = 135/299 (45%), Gaps = 36/299 (12%)
 Frame = +1

Query: 124 AAIASLLFFGSYVFISYIWGAYIFKEPIKNHGLSLLALCVMALGMLGVGFSVSNKSRIRL 303
           AA+A   + G  +  ++IWG  +F+  I N  L++  L +M +G+ G   + S  +    
Sbjct: 81  AAVAQSTWAGCVIITNFIWGVTLFQSKIGNIYLTVFGLVIMIIGIFGTA-TCSKWNNPEP 139

Query: 304 FNNFPTHTESSTTIKTNDEELHDSVTPLMS---------------------NSPTSSYE- 417
                + T  + +++ + +E +   TPL                       N PT   E 
Sbjct: 140 VAEKQSETSINASVEESGQENNTETTPLYQQENSTNQQENISSDVPIYPSVNDPTLYSEL 199

Query: 418 ------------REVEFTDQDKNERDLLKGVLGAVFVGISNGSFMVPLKYAHKDIVGVEY 561
                       +  +F    KN +    G++ +V  GI+ GS  VP +    +  G+ Y
Sbjct: 200 SEIESTIGVYETKSQKFIKILKNSKRYFIGLVASVLCGITGGSMFVPSRL--DEDTGLVY 257

Query: 562 LLSFGIGAMTMTMLVFGIYSSMLSFRKIPQPSLYIPGATLPAFLAGLLWSLGNFFSIYAT 741
           +++FGIG+  +T  +  +Y      R   +   ++  +  PA L   LW  GNFF+ Y +
Sbjct: 258 MVAFGIGSFVITTAILIVYYVYYLIRFKKRVPFHLKLSIFPA-LTAFLWQTGNFFAYYVS 316

Query: 742 LY-LGLALGWPLVQCQLLVSAMWAVFYYKEVR-TKVGAFSLITSSVIVVMGVVMLAQFG 912
           +  LGL +G PL +  ++++ +  + +++E+R  K      I+  V++V G ++LA FG
Sbjct: 317 VSPLGLTIGMPLTETAMVITGICGLVFFRELRGWKAILQFFISVLVLLVPGCILLALFG 375


>KOO29593.1 hypothetical protein Ctob_007331 [Chrysochromulina sp. CCMP291]
          Length = 405

 Score = 93.2 bits (230), Expect = 1e-17
 Identities = 80/286 (27%), Positives = 131/286 (45%), Gaps = 23/286 (8%)
 Frame = +1

Query: 124 AAIASLLFFGSYVFISYIWGAYIFKEPIKNHGLSLLALCVMALGMLGVGFSVSNKSRIRL 303
           AA+   ++ G  +  S++WG ++F EP K   L+++AL VM +G +G G + S     R 
Sbjct: 124 AAVGPGIWCGVGMLTSFLWGVFVFHEPFKTPALAVMAL-VMLVGGVG-GVASSQVLNHRD 181

Query: 304 FNNFPTHTESSTTIKTNDEELHDSVTPLM---SNSPTSSYEREVEFTDQDKNERDLLKGV 474
            +N P        +    EEL  S +  +   S SP    ER  E          L+ GV
Sbjct: 182 ASNQPPPPSQPPPLSPAAEELDGSGSSAVVSASASPALPLERPSEPLSA------LVLGV 235

Query: 475 LGAVFVGISNGSFMVP-------LKYAHKDIVGVEYLLSFGIGA-------MTMTMLVFG 612
             A+  G+ +GS M P       L  A  D V + YL  F +         + + +LV  
Sbjct: 236 ACALGTGLLDGSLMAPFSSYKTSLGLAPGDQVALRYLGGFALALPVVALLPLLLALLVEH 295

Query: 613 IYSSMLSFRKI-----PQPSLYIP-GATLPAFLAGLLWSLGNFFSIYATLYLGLALGWPL 774
           +     S R++       P+ ++     L     G LW+ GN  S++A++ LG A+G+PL
Sbjct: 296 LRQGARSTRRLLPLDGRSPARFLNLSCALSGMSCGALWAAGNVLSVHASMRLGQAVGFPL 355

Query: 775 VQCQLLVSAMWAVFYYKEVRTKVGAFSLITSSVIVVMGVVMLAQFG 912
            Q  +++SA+W + ++ E+          +SSV+V+ G V L   G
Sbjct: 356 TQVCVVISALWGILFFGELPQPRARALFASSSVVVLAGAVALKASG 401


>ABK23204.1 unknown [Picea sitchensis]
          Length = 196

 Score = 88.6 bits (218), Expect = 3e-17
 Identities = 45/148 (30%), Positives = 82/148 (55%)
 Frame = +1

Query: 460 LLKGVLGAVFVGISNGSFMVPLKYAHKDIVGVEYLLSFGIGAMTMTMLVFGIYSSMLSFR 639
           L +GV+ A+  GI  G  M+P+  +   I GV YL SF IG      +V  I    LS +
Sbjct: 51  LSQGVIAALLTGILGGLIMMPMTQSPPAIQGVSYLPSFAIGVAIFAPVVTAI--PYLSTQ 108

Query: 640 KIPQPSLYIPGATLPAFLAGLLWSLGNFFSIYATLYLGLALGWPLVQCQLLVSAMWAVFY 819
           + P+  LY+    LP  ++G++W++GN  S+ A   +G  + +P++ C + ++ +W +F 
Sbjct: 109 ECPRMELYV--GALPGIISGIVWNIGNILSMLAIGIIGYTIAYPILYCGIFIAGLWGMFL 166

Query: 820 YKEVRTKVGAFSLITSSVIVVMGVVMLA 903
           +KE+R    A     S  +++ G+++L+
Sbjct: 167 FKEIRGNAAAL-YWGSGFLILTGIILLS 193


>XP_005838126.1 hypothetical protein GUITHDRAFT_66230 [Guillardia theta CCMP2712]
           EKX51146.1 hypothetical protein GUITHDRAFT_66230
           [Guillardia theta CCMP2712]
          Length = 341

 Score = 90.9 bits (224), Expect = 5e-17
 Identities = 69/280 (24%), Positives = 126/280 (45%), Gaps = 21/280 (7%)
 Frame = +1

Query: 124 AAIASLLFFGSYVFISYIWGAYIFKEPIKNHGLSLLALCVMALGMLGVGFSVSNKSRIRL 303
           AA    ++ G  + ++++WG  +F+E +++  L ++AL ++  G++G+    S+  + RL
Sbjct: 88  AAAGPGIWCGVGMSVAFMWGTIVFQEAVRSLALCIVALILLFFGIVGISLVQSSMLQ-RL 146

Query: 304 FNNFPTHTESSTTIKTNDEELHDSVTPLMSNSPTSSYEREVEFTDQDKNERDLLKGVLGA 483
                   ES  T   ++EE + +                            +  GVL A
Sbjct: 147 LG------ESGATGLMSEEESNKT------------------------GRARIAVGVLLA 176

Query: 484 VFVGISNGSFMVPLKY-------------------AHKDIVGVEYLLSFGIGAMTMT--M 600
           +  G+ +GS M P K                    +  D+V  EYL SF +    +    
Sbjct: 177 LMTGLFDGSLMAPFKAYLASHPSLVSSSSSSSSSSSSSDVVVFEYLGSFALALPVVAGGS 236

Query: 601 LVFGIYSSMLSFRKIPQPSLYIPGATLPAFLAGLLWSLGNFFSIYATLYLGLALGWPLVQ 780
           LV  ++    +    P  S +   A  P F AG+LW++GN  S++ATL LG ++G+P+ Q
Sbjct: 237 LVLIMFYQHRALNSGPDRSSFRQAA-YPGFCAGVLWAVGNVLSVHATLELGQSIGFPMTQ 295

Query: 781 CQLLVSAMWAVFYYKEVRTKVGAFSLITSSVIVVMGVVML 900
             +++SA+W +  +KE+  +      + SS +V  G  +L
Sbjct: 296 SCVVISALWGIVVFKEMTARTPLLLFLLSSTLVAAGASLL 335


>XP_002669182.1 predicted protein [Naegleria gruberi] EFC36438.1 predicted protein
           [Naegleria gruberi]
          Length = 425

 Score = 90.9 bits (224), Expect = 1e-16
 Identities = 69/287 (24%), Positives = 130/287 (45%), Gaps = 26/287 (9%)
 Frame = +1

Query: 130 IASLLFFGSYVFISYIWGAYIFKEPIKNHGLSLLALCVMALGMLGVG----------FSV 279
           +A  ++ G  +  S+ WG  +F   I N  L+ LAL +M +G++G+              
Sbjct: 141 VAQGVWSGVNIITSFTWGVALFHSEIGNPYLTALALILMVVGIVGIATCSKWNLPELLPA 200

Query: 280 SNKSRIRLFNNFPTHTES-----------STTIKTNDEELHDSV--TPLMSNSPTSSYER 420
           S+     L N   TH +            +  ++ N++ +  ++  T      PT    R
Sbjct: 201 SSTETKSLVNETVTHYDGNEENPEAPNTFNPEVQNNEQAVEQTIETTQEEEEYPTQPLSR 260

Query: 421 EVEFTDQDKNERDLLKGVLGAVFVGISNGSFMVPLKYAHKDIVGVEYLLSFGIGAMTMTM 600
           + +     K+ ++ + G+  +V VG+  GS  VP ++  K   GV Y++ FG G+  +T 
Sbjct: 261 KEKIVSILKSSKNYILGLACSVGVGVLGGSQFVPSRFEEKP--GVVYVVGFGFGSAGITS 318

Query: 601 LVFGIYSSMLSFR-KIPQPSLYIPGATLPAFLAGLLWSLGNFFSIYATL-YLGLALGWPL 774
            +  IY      R ++  P  + P   +   +   LW +GN  + Y ++  LG  +G PL
Sbjct: 319 AILVIYYIYYIIRYRVVLP--FHPKVAVFPCITACLWQVGNVMATYVSMSSLGFTIGLPL 376

Query: 775 VQCQLLVSAMWAVFYYKEVRTKVGAFSLITSS-VIVVMGVVMLAQFG 912
            Q  L+V+ +  + ++KE+R          S+ V ++ G ++L+ FG
Sbjct: 377 TQASLVVAGICGLLFFKELRGWKAILQFFVSALVFLIPGCILLSLFG 423


>OEU17733.1 hypothetical protein FRACYDRAFT_184267 [Fragilariopsis cylindrus
           CCMP1102]
          Length = 377

 Score = 90.1 bits (222), Expect = 1e-16
 Identities = 71/269 (26%), Positives = 122/269 (45%), Gaps = 21/269 (7%)
 Frame = +1

Query: 160 VFISYIWGAYIFKEPIKNHGLSLLALCVMALGMLGVGFSVSNKSRIRLFNNFPT---HTE 330
           V +S+ WG + F E + +   +  A+  M  G+ G+ +  S           PT   H  
Sbjct: 117 VLVSFSWGIFFFDEHVHSRVQACSAVACMLCGLGGMAYYSS-----------PTVAHHRH 165

Query: 331 SSTTIKTNDE-------ELHDSVTPLMSNSPTSSYEREVEFTDQDKNERDLLKGVLGAVF 489
           +S+T +           + +  + P   +      E +  F      ER L  G+L A+F
Sbjct: 166 ASSTAEDGGGGGGGGGGDYYQPIRPDDDDFLPPDDEDD-SFGYIRIQERTL--GILAALF 222

Query: 490 V-GISNGSFMVPLKYAHKDIVGVEYLLSFGIGAMTMTMLVFGIYSSMLSFRKIPQPSLYI 666
           + G+  GS MVP+++A     G+ +L SF IGA  +T+ ++ I    +   K        
Sbjct: 223 ITGLWGGSMMVPMEFAPNIDKGLPFLTSFAIGATIVTICLWTIRYIYIVIFKTNNYCFGE 282

Query: 667 PGATLPAF----------LAGLLWSLGNFFSIYATLYLGLALGWPLVQCQLLVSAMWAVF 816
               LP+F            GLLWS+GNF SI +  +LG  +G+   Q  +L+S +W +F
Sbjct: 283 AYNALPSFHFKQMWPYGMTCGLLWSIGNFCSILSVEFLGEGVGYSSTQASMLISGLWGIF 342

Query: 817 YYKEVRTKVGAFSLITSSVIVVMGVVMLA 903
           Y+ E+      F    S+ + V+G+++L+
Sbjct: 343 YFNEIEGSNAIFKWYLSASVTVLGILLLS 371


>XP_002182676.1 predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
           EEC45963.1 predicted protein [Phaeodactylum tricornutum
           CCAP 1055/1]
          Length = 413

 Score = 87.4 bits (215), Expect = 2e-15
 Identities = 73/300 (24%), Positives = 128/300 (42%), Gaps = 52/300 (17%)
 Frame = +1

Query: 160 VFISYIWGAYIFKEPIKNHGLSLLALCVMALGMLGVGFSVSNKSRI-------------- 297
           V  S+ +G  +F+E +K+   + LA   + +G++G+    +++ ++              
Sbjct: 108 VLTSFFFGIIVFQERVKSFYQTCLAFGCLIIGLIGMSRFSAHQQQVDTLAVSYRSVKTAA 167

Query: 298 --------RLFNNFPTHTESSTTIK----------------TNDEEL----HDSVTPLMS 393
                   +L     T  E+S T+                 T+ E++    +D    ++S
Sbjct: 168 SHPLGLGQKLKRAGSTIAENSITVPLVGASGVIPMEIEPFATDGEDIVMGTYDDAKSVLS 227

Query: 394 NSPTSSYEREVEFTDQDKNERDLLKGVLGAVFVGISNGSFMVPLKYA--HKDIVGVEYLL 567
                 +   V  T +         G+LGAV  G   G  ++PL +A   +D+ G  YL+
Sbjct: 228 KDRLVLFGGRVSLTRRQM-------GILGAVINGAWGGMNLIPLHFALQEEDMTGAGYLI 280

Query: 568 SFGIGAMTMTMLVF----GIYSSMLSFRKIPQ----PSLYIPGATLPAFLAGLLWSLGNF 723
           S+  G++ +   ++    G Y    +          P  +     +P  +AGLL+S GNF
Sbjct: 281 SYATGSLIVNTCIWLAFLGYYLHQTNGHWNEAVDCLPKWHFEHLLIPGLMAGLLYSFGNF 340

Query: 724 FSIYATLYLGLALGWPLVQCQLLVSAMWAVFYYKEVRTKVGAFSLITSSVIVVMGVVMLA 903
            SI A  YLG   G+   Q QL VS +W VF++KEV+          S+ + V+G+V LA
Sbjct: 341 CSILAVTYLGQGTGFSFCQMQLFVSGLWGVFFFKEVQGTDTITKWFISASVAVLGIVWLA 400


>XP_001775344.1 predicted protein [Physcomitrella patens] EDQ59814.1 predicted
           protein [Physcomitrella patens]
          Length = 344

 Score = 83.2 bits (204), Expect = 3e-14
 Identities = 76/313 (24%), Positives = 137/313 (43%), Gaps = 16/313 (5%)
 Frame = +1

Query: 13  IFPVTSTCSFSIITKACMSRNFAKIQPAIMSFLPGVFAAIASLLFFGSYVFISYIWGAY- 189
           I    +  SF++++K    R  A++ P I +F   +   ++SLL    Y F+  + G   
Sbjct: 40  ILSAVANGSFAVLSKTRSIRR-ARVSPLIFNFWACLGVLLSSLLLLFKYKFVFALEGLLS 98

Query: 190 ----------IFKEPIKNHGLSLL-ALCVMALGMLGVGFSVSNKSRIRLFNNFPTHTESS 336
                     IF+  ++  G+S+   +      ++GV +S       + F     HT+  
Sbjct: 99  GVFFVLSFINIFRA-VRLLGVSVAYGIWAGTAAIVGVAWSGQMSWEPQDFYEDDDHTQ-- 155

Query: 337 TTIKTNDEEL----HDSVTPLMSNSPTSSYEREVEFTDQDKNERDLLKGVLGAVFVGISN 504
           T I++         H  +  +   S   S ER       +   R    GV  AV  GI  
Sbjct: 156 TLIQSQPSFAGWVQHRKLWDVAGQS--KSGERPKNVLTGEPASRSFPAGVFSAVLAGILG 213

Query: 505 GSFMVPLKYAHKDIVGVEYLLSFGIGAMTMTMLVFGIYSSMLSFRKIPQPSLYIPGATLP 684
           G  M+P   A     G  +L SFGI     T +V    +S+        P L    A  P
Sbjct: 214 GLVMIPANQAPDMAQGNAFLPSFGIAVAIFTPIV----TSLPYLSGCELPDLSAREAAGP 269

Query: 685 AFLAGLLWSLGNFFSIYATLYLGLALGWPLVQCQLLVSAMWAVFYYKEVRTKVGAFSLIT 864
             L+G ++++GN  +I A  Y+G ++ +PL QC ++V+ +W + Y++E        +L  
Sbjct: 270 GILSGFIYNIGNMLNIVAIFYVGSSVAYPLFQCGIIVAGIWGMLYFEESHGN-ALITLWA 328

Query: 865 SSVIVVMGVVMLA 903
           + V++++G+++L+
Sbjct: 329 ADVVLLVGIILLS 341


>KOO34993.1 hypothetical protein Ctob_007190 [Chrysochromulina sp. CCMP291]
          Length = 1567

 Score = 84.7 bits (208), Expect = 3e-14
 Identities = 66/269 (24%), Positives = 125/269 (46%), Gaps = 15/269 (5%)
 Frame = +1

Query: 151  GSYVFISYIWGAY---IFKEPIKNHGLSLLALCVMALGMLGVGFSVSNKSRIRLFNNFPT 321
            G+ + +S++WG         P+++  LSL+A+ ++ +G+LG+  SV     +        
Sbjct: 948  GTAIVVSFLWGTLGPEPICAPVQSVALSLVAVGLLLIGVLGIVKSVPIGDALGRLCEARA 1007

Query: 322  HTES--STTIKTNDEELHDSVTPLMSNSPT-----SSYEREVEFTDQDKNERDLLKGVLG 480
             T +  S  ++ ++  +      + S+ P       + +          + R +  G+  
Sbjct: 1008 RTSAVDSAAVRLHESVVASDDPVVASDDPAVGGGAPAVDGNAAAAASAASTRAV--GLAA 1065

Query: 481  AVFVGISNGSFMVPLKYAHKDIVG---VEYLLSFGIGAMTMTMLVFGIYSSMLSFRKIPQ 651
            A+ VG+  GS +VP  +A +   G   +  L SFGIGA++M +L  G + + L  R    
Sbjct: 1066 ALMVGLFGGSVLVPASFAGEAFSGQKAIALLPSFGIGALSMALLTTGSWYARLLSRG-EA 1124

Query: 652  PSLYIPGATLPAFLAGLLWSLGNFFSIYATLYLGLALG--WPLVQCQLLVSAMWAVFYYK 825
            P L++  A     L+G  W+ GN  SI A  Y  +  G  +PL+Q  L+   +  +F + 
Sbjct: 1125 PPLHLRAALWAGCLSGATWNAGNLCSIVAINYYSVPFGVAYPLLQASLVFGGLLGIFVFG 1184

Query: 826  EVRTKVGAFSLITSSVIVVMGVVMLAQFG 912
            E++ +        S+ +V+ G V+L  +G
Sbjct: 1185 ELKKRKAIAVFFASAALVLGGAVLLGMYG 1213


>EJK61576.1 hypothetical protein THAOC_17910 [Thalassiosira oceanica]
          Length = 360

 Score = 82.8 bits (203), Expect = 4e-14
 Identities = 69/270 (25%), Positives = 122/270 (45%), Gaps = 32/270 (11%)
 Frame = +1

Query: 115 GVFA------AIASLLFFGSYVFISYIWGAYIFKEPIKNHGLSLLALCVMALGMLGVGFS 276
           GVFA      AI++ L+    + +SY+WG  IF E  ++   ++ A+ +M +G++G    
Sbjct: 100 GVFAVRRAGLAISTGLWSCVIILLSYLWGVLIFHEKQESAVGAVGAVLLMCVGLIG---- 155

Query: 277 VSNKSRIRLFNNFPTHTESSTTIKTNDEELHDSVTPLMSNSPTSSYEREV-EFTDQDKNE 453
           +++ S I +         +  +++       D  TPL   +  +  + ++ + T Q    
Sbjct: 156 IAHFSSIEVRPGLDQARAAPRSVEECRPACSDETTPLNGINRANDAQFDLAKLTSQ---- 211

Query: 454 RDLLKGVLGAVFVGISNGSFMVPLKYAHKDIV-GVEYLLSFGIGAMTMTMLVFGIYSSML 630
              L G+  AV  G+   S M+PL YA  +   G+ Y +SFGI A+ +  + + I    L
Sbjct: 212 ---LPGLFAAVLNGLFAASIMLPLHYAPPNTTKGIGYSMSFGIAAVVVVFIFWTIRLLAL 268

Query: 631 SFRKIPQ------------------------PSLYIPGATLPAFLAGLLWSLGNFFSIYA 738
           +  +                           PS +      P F AGLL+S GN F I +
Sbjct: 269 TAAEFAAKQNEAKRITPNIIRESLREGYSQLPSFHFSEMWRPGFTAGLLYSGGNLFGIVS 328

Query: 739 TLYLGLALGWPLVQCQLLVSAMWAVFYYKE 828
             +LG  +G+ L Q  +++S  W +F+Y+E
Sbjct: 329 IQHLGNFMGYSLNQSSMIISGCWGLFWYRE 358


Top