BLASTX nr result
ID: Ephedra25_contig00019491
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Ephedra25_contig00019491 (1048 letters) Database: ./nr 37,332,560 sequences; 13,225,080,153 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value gb|ADE75937.1| unknown [Picea sitchensis] 424 e-116 ref|XP_002295272.1| predicted protein [Thalassiosira pseudonana ... 175 3e-41 ref|XP_002180280.1| predicted protein [Phaeodactylum tricornutum... 166 1e-38 ref|XP_002182733.1| predicted protein [Phaeodactylum tricornutum... 148 3e-33 ref|XP_005789382.1| hypothetical protein EMIHUDRAFT_98339 [Emili... 147 7e-33 ref|XP_002669182.1| predicted protein [Naegleria gruberi] gi|284... 145 3e-32 ref|XP_002182676.1| predicted protein [Phaeodactylum tricornutum... 136 2e-29 ref|XP_002676812.1| predicted protein [Naegleria gruberi] gi|284... 135 2e-29 gb|EJK61576.1| hypothetical protein THAOC_17910 [Thalassiosira o... 119 2e-24 ref|XP_005838126.1| hypothetical protein GUITHDRAFT_66230 [Guill... 110 7e-22 ref|XP_002185821.1| predicted protein [Phaeodactylum tricornutum... 107 8e-21 ref|XP_001775344.1| predicted protein [Physcomitrella patens] gi... 96 3e-17 ref|XP_003294072.1| hypothetical protein DICPUDRAFT_90514 [Dicty... 94 9e-17 ref|XP_641212.1| transmembrane protein 144 A [Dictyostelium disc... 93 2e-16 ref|XP_005758882.1| hypothetical protein EMIHUDRAFT_121297 [Emil... 91 1e-15 ref|XP_004350600.1| transmembrane protein [Dictyostelium fascicu... 91 1e-15 ref|XP_003293389.1| hypothetical protein DICPUDRAFT_50939 [Dicty... 89 2e-15 gb|EJK47010.1| hypothetical protein THAOC_34300, partial [Thalas... 89 3e-15 gb|ABK23204.1| unknown [Picea sitchensis] 87 8e-15 gb|EJK73333.1| hypothetical protein THAOC_05052 [Thalassiosira o... 80 2e-12 >gb|ADE75937.1| unknown [Picea sitchensis] Length = 359 Score = 424 bits (1090), Expect = e-116 Identities = 210/324 (64%), Positives = 254/324 (78%), Gaps = 6/324 (1%) Frame = -1 Query: 1006 FVFQLYKSTACFLTSWLILVYIPYKFTWWGVLGGLIWVINGIVAIFSVRWAGIGVAQCLW 827 FVFQ YKST CFLTSWL+L+Y P+KFTWWG+LG LIWV NG++AI +VRWAGIGV+Q LW Sbjct: 36 FVFQSYKSTTCFLTSWLVLLYTPFKFTWWGILGALIWVTNGVLAIVAVRWAGIGVSQSLW 95 Query: 826 SGLSLFISYIWGAYIFKEPIKNHGLSLLALCVMALGMLGVGFSVSNKRRIQ-LFNNFPTH 650 SGLSLF +YIWGAY+ KEP+KNHGLS+LAL VMALGM+GVGF+VS K Q L + + Sbjct: 96 SGLSLFTAYIWGAYVLKEPLKNHGLSILALLVMALGMIGVGFAVSEKTVFQSLLDIWLKL 155 Query: 649 TESSTTIKTNDE----ELHDSVTPLMSNSPTSSCEREVEFTDQD-KNERDLLKGVLGAVF 485 ST IK + ++ DS L+ T +C E E+ DQ + E L+KGVL AV Sbjct: 156 NPCSTKIKDCPQLSCIDVQDSSEALIPCETTKTCGVEEEYADQKYERENKLVKGVLCAVL 215 Query: 484 VGISNGSFMVPLKYAHKDIVGVEYLLSFGIGAMTMTMLVFGMYSSMLSFRKIPQPSLYIP 305 VG NGSFMVPLKYAHKD+VG EYL+SFGIGAMTMT+++ G+Y + L+F P PSLYIP Sbjct: 216 VGTLNGSFMVPLKYAHKDVVGAEYLVSFGIGAMTMTIILLGIYMTALAFHGRPLPSLYIP 275 Query: 304 GATLPAFLAGLLWSLGNFFSIYATLYLGLALGWPLVQCQLLVSAMWAVFYYKEVRTKVGA 125 GA PAFLAG LWS+GNFFSIYATLYLG+ALGWPLVQCQL+VSAMWAVF+YKEV ++ GA Sbjct: 276 GAAGPAFLAGFLWSMGNFFSIYATLYLGVALGWPLVQCQLIVSAMWAVFFYKEVTSRTGA 335 Query: 124 FSLITSSVIVVMGVVMLAQFGSVG 53 LI SS++VV+G +ML+QFGS+G Sbjct: 336 ALLIGSSIVVVLGAIMLSQFGSIG 359 >ref|XP_002295272.1| predicted protein [Thalassiosira pseudonana CCMP1335] gi|220968995|gb|EED87338.1| predicted protein [Thalassiosira pseudonana CCMP1335] Length = 373 Score = 175 bits (443), Expect = 3e-41 Identities = 103/313 (32%), Positives = 169/313 (53%), Gaps = 26/313 (8%) Frame = -1 Query: 1003 VFQLYKSTACFLTSWLILVY-IPYKFTWWGVLGGLIWVINGIVAIFSVRWAGIGVAQCLW 827 V Q YKS CFL+SWL+L+ + FTWWG++ GL WV G IF++R AG+ V+Q + Sbjct: 64 VMQSYKSLMCFLSSWLVLLCGQEFTFTWWGIVSGLFWVPAGAFNIFAIRNAGLAVSQGIV 123 Query: 826 SGLSLFISYIWGAYIFKEPIKNHGLSLLALCVMALGMLGVGFSVSNKRRIQLFNNFPTHT 647 S + +S+IWG IF+E +K+ ++ A+C++ G+ G+ F +++ + P HT Sbjct: 124 SSSIVMVSFIWGDLIFREAVKSELIAYFAVCLIMAGLYGMSFFSTSEEQ-------PEHT 176 Query: 646 ESSTTIKTNDEEL----HDSVTPLMSNSPTSSC--------------EREVEFTDQDKNE 521 S +E+L H+S S++ SS R + + + Sbjct: 177 SVSDNDNNGEEKLDLMRHESSDSFDSSNDNSSMGPLEISERRKPSIRGRPILICGKTYSR 236 Query: 520 RDLLKGVLGAVFVGISNGSFMVPLKYAHKDIVGVEYLLSFGIGAMTMTML--VFGMYSSM 347 R++ G+ A+ G+ GS +VP+ YA D+ G+ Y++SF +GA+T+T+L V + Sbjct: 237 RNI--GLCSALICGVWGGSCLVPMHYAQGDVKGLAYVISFSVGALTVTVLLWVARFAYHL 294 Query: 346 LSFRKIPQ-----PSLYIPGATLPAFLAGLLWSLGNFFSIYATLYLGLALGWPLVQCQLL 182 + + + + PS ++ LP AG LWS+GN SI A +LG +G+ Q LL Sbjct: 295 VKLKSVWEAYEVLPSFHLRVMLLPGATAGSLWSIGNVGSIVAVKHLGQGVGYSASQAALL 354 Query: 181 VSAMWAVFYYKEV 143 VS MW +FY+K++ Sbjct: 355 VSGMWGIFYFKQM 367 >ref|XP_002180280.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1] gi|217408537|gb|EEC48471.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1] Length = 346 Score = 166 bits (421), Expect = 1e-38 Identities = 113/328 (34%), Positives = 170/328 (51%), Gaps = 16/328 (4%) Frame = -1 Query: 1003 VFQLYKSTACFLTSWLILVY-IPYKFTWWGVLGGLIWVINGIVAIFSVRWAGIGVAQCLW 827 V Q YK LTSWL+L++ +P+ FT WG + GL V G F+V+ AG+ V+Q +W Sbjct: 38 VLQTYKIGMTLLTSWLVLLFGVPFTFTPWGFVSGLFMVPGGTAGYFAVQNAGMAVSQGIW 97 Query: 826 SGLSLFISYIWGAYIFKEPIKNHGLSLLALCVMALGMLGVGFSVSNKRRIQLFNNFPTHT 647 S L + +++ WG IF EP+ + + LA+ ++ +G+ GV + + Sbjct: 98 SSLKVLVAFCWGILIFHEPVHSKLGTTLAIALLMVGLAGVSIFAAPR------------- 144 Query: 646 ESSTTIKTNDEELHDSVTPLMSNSPTSSCEREVEFTDQDKNERDLLK----GVLGAVFVG 479 T+ + EE PL+ P + + E D +K+ LK G+LGAV G Sbjct: 145 ---TSTSSPQEE------PLL---PDVEEQNQPEIVD-NKDYLGFLKRRHVGLLGAVIDG 191 Query: 478 ISNGSFMVPLKYAH-KDIVGVEYLLSFGIGAMTMTMLVFGMYSSMLSFRKIPQPSLYIPG 302 GS +VP+ YA K G+ Y++SF IG ++ +V+ + L F + SL + Sbjct: 192 AYGGSVLVPMHYAGPKTTNGLSYVMSFAIGCSSVVTMVWVL---RLLFNSVQGQSLRVGY 248 Query: 301 ATLP----------AFLAGLLWSLGNFFSIYATLYLGLALGWPLVQCQLLVSAMWAVFYY 152 LP A LAGL+WSLGN SI LG +G+ +VQ QLLV+ +W VF+Y Sbjct: 249 DRLPSLHVTTIGPYAALAGLIWSLGNVSSILTVALLGEGVGYSIVQSQLLVAGLWGVFWY 308 Query: 151 KEVRTKVGAFSLITSSVIVVMGVVMLAQ 68 KE+R S T +VI V G+VML++ Sbjct: 309 KEIRGMRAIASWFTFAVITVAGIVMLSR 336 >ref|XP_002182733.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1] gi|217406079|gb|EEC46020.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1] Length = 404 Score = 148 bits (374), Expect = 3e-33 Identities = 96/352 (27%), Positives = 175/352 (49%), Gaps = 41/352 (11%) Frame = -1 Query: 1003 VFQLYKSTACFLTSWLILVY-IPYKFTWWGVLGGLIWVINGIVAIFSVRWAGIGVAQCLW 827 VFQ YK+ F+ SWL++ I +T WG++ G +WV+ G + ++R AG+ +A W Sbjct: 44 VFQSYKTITMFMLSWLVIFMGIAPSWTSWGLVSGGLWVVGGTGGVLAIRMAGLAIAVGTW 103 Query: 826 SGLSLFISYIWGAYIFKEPIKNHGLSLLALCVMALGMLGVGFSVSNKRRIQLFNNFPTHT 647 + + + I+++ G +F+EP+ + +L A ++ALG++G+ + + QL T Sbjct: 104 ASVMIVINFLVGIVLFQEPVSDMFATLGAFLLLALGLVGMSLYSTPQPVDQL-----PST 158 Query: 646 ESSTTIKTNDEELHDSVTPLM----------------------SNSPTSSCER-EVEFT- 539 E + I N E+ + L+ S S SS + E FT Sbjct: 159 EMTENIGPNQNEVEEIDRALIVKRTSSYTGKIDHRDIQRRNEESGSYGSSADADEPLFTI 218 Query: 538 -DQDKNERDLLKGVLGAVFVGISNGSFMVPLKYAH-KDIVGVEYLLSFGIGAMTMTMLVF 365 D K +R G+ GA+F G+ GS ++PL YA + G Y++S+ GA+ M L++ Sbjct: 219 PDGTKRKRSGPTGICGAIFNGVMTGSSLIPLHYAKTQGYGGANYMISYASGAIVMNCLIW 278 Query: 364 GMYSSMLSFRKIPQ--------------PSLYIPGATLPAFLAGLLWSLGNFFSIYATLY 227 G++ + ++ + Q P+ + LP F +G+L ++ F SI + Y Sbjct: 279 GVFFAYTCYQTVQQDLNVPVLLHTFQVMPAWHFRKLWLPGFTSGVLLTIAMFGSILSVTY 338 Query: 226 LGLALGWPLVQCQLLVSAMWAVFYYKEVRTKVGAFSLITSSVIVVMGVVMLA 71 LG +G +VQ ++L+S +W +F+++E+R S+ + V G++ L+ Sbjct: 339 LGQGIGNSIVQAKILISGLWGIFWFREIRGMYIVTKWFLSASLTVAGILWLS 390 >ref|XP_005789382.1| hypothetical protein EMIHUDRAFT_98339 [Emiliania huxleyi CCMP1516] gi|485642886|gb|EOD36953.1| hypothetical protein EMIHUDRAFT_98339 [Emiliania huxleyi CCMP1516] Length = 358 Score = 147 bits (371), Expect = 7e-33 Identities = 89/321 (27%), Positives = 149/321 (46%), Gaps = 7/321 (2%) Frame = -1 Query: 1003 VFQLYKSTACFLTSWLILVYIPYKFTWWGVLGGLIWVINGIVAIFSVRWAGIGVAQCLWS 824 +FQLY S L+S L+L P+ F++WG G +W+ + + + G GVA W Sbjct: 70 IFQLYFSAGVALSSILVLALTPFSFSFWGFAGASLWISSMMCGKIGIDGIGYGVAVATWG 129 Query: 823 GLSLFISYIWGAYIFKEPIKNHGLSLLALCVMALGMLGVGFSVSNKRRIQLFNNFPTHTE 644 ++ +S++WG +F E + ++ ALC +A G+ GV + S P E Sbjct: 130 STTMIVSFLWGTLVFAERPSSVTGAVAALCTLAAGVAGVATAQSGSLG-------PPEAE 182 Query: 643 SSTTIKTNDEELHDSVTPLMSNSPTSSCEREVEFTDQDKNERDLLKGVLGAVFVGISNGS 464 ++ N E + + G LGA+ G+ NGS Sbjct: 183 AAAEAFLNPAEGRVGGAAARAGA-----------------------GWLGALGCGLLNGS 219 Query: 463 FMVPLKYAHKD-------IVGVEYLLSFGIGAMTMTMLVFGMYSSMLSFRKIPQPSLYIP 305 MVP Y ++ VG+ Y+ +F G + + F +Y+ + FR PQP L Sbjct: 220 LMVPFHYFSEERSGQDGASVGMGYIATFATGVAAVQPIFFLLYARV-PFR--PQPPLLCS 276 Query: 304 GATLPAFLAGLLWSLGNFFSIYATLYLGLALGWPLVQCQLLVSAMWAVFYYKEVRTKVGA 125 LP + G+ W++GNF S +ATL+LG A+G+PL Q ++V+ +W ++ E+R Sbjct: 277 ELALPGLITGVFWAIGNFESTFATLHLGQAVGYPLTQTCIVVAGLWGALFFGEIRGAPSL 336 Query: 124 FSLITSSVIVVMGVVMLAQFG 62 S ++++ G V+L +G Sbjct: 337 LLFSVSVLVIIGGAVLLGMYG 357 >ref|XP_002669182.1| predicted protein [Naegleria gruberi] gi|284082726|gb|EFC36438.1| predicted protein [Naegleria gruberi] Length = 425 Score = 145 bits (365), Expect = 3e-32 Identities = 89/340 (26%), Positives = 170/340 (50%), Gaps = 26/340 (7%) Frame = -1 Query: 1003 VFQLYKSTACFLTSWLILVYIPYKFTWWGVLGGLIWVINGIVAIFSVRWAGIGVAQCLWS 824 VFQ Y S+ +TS+++L++ + F++WG+LG +WV ++++ ++ G+GVAQ +WS Sbjct: 88 VFQFYFSSMVLITSFIVLIWNEWYFSFWGILGAAVWVPASLLSLIAIHLLGLGVAQGVWS 147 Query: 823 GLSLFISYIWGAYIFKEPIKNHGLSLLALCVMALGMLGVG----------FSVSNKRRIQ 674 G+++ S+ WG +F I N L+ LAL +M +G++G+ S+ Sbjct: 148 GVNIITSFTWGVALFHSEIGNPYLTALALILMVVGIVGIATCSKWNLPELLPASSTETKS 207 Query: 673 LFNNFPTHTES-----------STTIKTNDEELHDSV--TPLMSNSPTSSCEREVEFTDQ 533 L N TH + + ++ N++ + ++ T PT R+ + Sbjct: 208 LVNETVTHYDGNEENPEAPNTFNPEVQNNEQAVEQTIETTQEEEEYPTQPLSRKEKIVSI 267 Query: 532 DKNERDLLKGVLGAVFVGISNGSFMVPLKYAHKDIVGVEYLLSFGIGAMTMTMLVFGMYS 353 K+ ++ + G+ +V VG+ GS VP ++ K GV Y++ FG G+ +T + +Y Sbjct: 268 LKSSKNYILGLACSVGVGVLGGSQFVPSRFEEKP--GVVYVVGFGFGSAGITSAILVIYY 325 Query: 352 SMLSFR-KIPQPSLYIPGATLPAFLAGLLWSLGNFFSIYATL-YLGLALGWPLVQCQLLV 179 R ++ P + P + + LW +GN + Y ++ LG +G PL Q L+V Sbjct: 326 IYYIIRYRVVLP--FHPKVAVFPCITACLWQVGNVMATYVSMSSLGFTIGLPLTQASLVV 383 Query: 178 SAMWAVFYYKEVRTKVGAFSLITSS-VIVVMGVVMLAQFG 62 + + + ++KE+R S+ V ++ G ++L+ FG Sbjct: 384 AGICGLLFFKELRGWKAILQFFVSALVFLIPGCILLSLFG 423 >ref|XP_002182676.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1] gi|217406022|gb|EEC45963.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1] Length = 413 Score = 136 bits (342), Expect = 2e-29 Identities = 101/357 (28%), Positives = 170/357 (47%), Gaps = 46/357 (12%) Frame = -1 Query: 1003 VFQLYKSTACFLTSWL-ILVYIPYKFTWWGVLGGLIWVINGIVAIFSVRWAGIGVAQCLW 827 V Q YK++ CFLT WL IL+ ++T +G++ GL WV + IF +R AG+ VA W Sbjct: 44 VMQSYKTSVCFLTCWLVILLGEEPRWTPYGIVSGLFWVPGAAMGIFGIRNAGLAVAVGTW 103 Query: 826 SGLSLFISYIWGAYIFKEPIKNHGLSLLALCVMALGMLGVGFSVSNKRRIQLF--NNFPT 653 S +++ S+ +G +F+E +K+ + LA + +G++G+ ++++++ + Sbjct: 104 SSITVLTSFFFGIIVFQERVKSFYQTCLAFGCLIIGLIGMSRFSAHQQQVDTLAVSYRSV 163 Query: 652 HTESSTTI-------KTNDEELHDSVT-PLMSNSPTSSCEREVEFTDQD----------- 530 T +S + + +S+T PL+ S E E TD + Sbjct: 164 KTAASHPLGLGQKLKRAGSTIAENSITVPLVGASGVIPMEIEPFATDGEDIVMGTYDDAK 223 Query: 529 ---KNERDLL-----------KGVLGAVFVGISNGSFMVPLKYA--HKDIVGVEYLLSFG 398 +R +L G+LGAV G G ++PL +A +D+ G YL+S+ Sbjct: 224 SVLSKDRLVLFGGRVSLTRRQMGILGAVINGAWGGMNLIPLHFALQEEDMTGAGYLISYA 283 Query: 397 IGAMTMTMLVF----GMYSSMLSFRKIPQ----PSLYIPGATLPAFLAGLLWSLGNFFSI 242 G++ + ++ G Y + P + +P +AGLL+S GNF SI Sbjct: 284 TGSLIVNTCIWLAFLGYYLHQTNGHWNEAVDCLPKWHFEHLLIPGLMAGLLYSFGNFCSI 343 Query: 241 YATLYLGLALGWPLVQCQLLVSAMWAVFYYKEVRTKVGAFSLITSSVIVVMGVVMLA 71 A YLG G+ Q QL VS +W VF++KEV+ S+ + V+G+V LA Sbjct: 344 LAVTYLGQGTGFSFCQMQLFVSGLWGVFFFKEVQGTDTITKWFISASVAVLGIVWLA 400 >ref|XP_002676812.1| predicted protein [Naegleria gruberi] gi|284090416|gb|EFC44068.1| predicted protein [Naegleria gruberi] Length = 383 Score = 135 bits (341), Expect = 2e-29 Identities = 92/350 (26%), Positives = 165/350 (47%), Gaps = 36/350 (10%) Frame = -1 Query: 1003 VFQLYKSTACFLTSWLILVYIPYKFTWWGVLGGLIWVINGIVAIFSVRWAGIGVAQCLWS 824 VFQ Y S L S ++L + +K++WW V G IWV + + +I +V + G VAQ W+ Sbjct: 30 VFQFYFSLVVGLMSLIVLAWNEFKWSWWAVAGSGIWVPSSLFSIVAVEYLGAAVAQSTWA 89 Query: 823 GLSLFISYIWGAYIFKEPIKNHGLSLLALCVMALGMLGVGFSVSNKRRIQLFNNFPTHTE 644 G + ++IWG +F+ I N L++ L +M +G+ G + S + + T Sbjct: 90 GCVIITNFIWGVTLFQSKIGNIYLTVFGLVIMIIGIFGTA-TCSKWNNPEPVAEKQSETS 148 Query: 643 SSTTIKTNDEELHDSVTPLMS---------------------NSPTSSCE---------- 557 + +++ + +E + TPL N PT E Sbjct: 149 INASVEESGQENNTETTPLYQQENSTNQQENISSDVPIYPSVNDPTLYSELSEIESTIGV 208 Query: 556 ---REVEFTDQDKNERDLLKGVLGAVFVGISNGSFMVPLKYAHKDIVGVEYLLSFGIGAM 386 + +F KN + G++ +V GI+ GS VP + + G+ Y+++FGIG+ Sbjct: 209 YETKSQKFIKILKNSKRYFIGLVASVLCGITGGSMFVPSRL--DEDTGLVYMVAFGIGSF 266 Query: 385 TMTMLVFGMYSSMLSFRKIPQPSLYIPGATLPAFLAGLLWSLGNFFSIYATLY-LGLALG 209 +T + +Y R + ++ + PA L LW GNFF+ Y ++ LGL +G Sbjct: 267 VITTAILIVYYVYYLIRFKKRVPFHLKLSIFPA-LTAFLWQTGNFFAYYVSVSPLGLTIG 325 Query: 208 WPLVQCQLLVSAMWAVFYYKEVR-TKVGAFSLITSSVIVVMGVVMLAQFG 62 PL + ++++ + + +++E+R K I+ V++V G ++LA FG Sbjct: 326 MPLTETAMVITGICGLVFFRELRGWKAILQFFISVLVLLVPGCILLALFG 375 >gb|EJK61576.1| hypothetical protein THAOC_17910 [Thalassiosira oceanica] Length = 360 Score = 119 bits (299), Expect = 2e-24 Identities = 85/327 (25%), Positives = 148/327 (45%), Gaps = 41/327 (12%) Frame = -1 Query: 1003 VFQLYKSTACFLTSWLILVYIPYK---------------FTWWGVLGGLIWVINGIVAIF 869 VFQ YK+ A F+TS L++ + FT W + + WV G +F Sbjct: 43 VFQTYKAVAVFVTSLLLVAFCNLMHGTHPDSFDYWSFADFTHWAFVSAIFWVPGGTAGVF 102 Query: 868 SVRWAGIGVAQCLWSGLSLFISYIWGAYIFKEPIKNHGLSLLALCVMALGMLGVGFSVSN 689 +VR AG+ ++ LWS + + +SY+WG IF E ++ ++ A+ +M +G++G+ S Sbjct: 103 AVRRAGLAISTGLWSCVIILLSYLWGVLIFHEKQESAVGAVGAVLLMCVGLIGIAHFSS- 161 Query: 688 KRRIQLFNNFPTHTESSTTIKTNDEELHDSVTPLMSNSPTSSCEREV-EFTDQDKNERDL 512 I++ + +++ D TPL + + + ++ + T Q Sbjct: 162 ---IEVRPGLDQARAAPRSVEECRPACSDETTPLNGINRANDAQFDLAKLTSQ------- 211 Query: 511 LKGVLGAVFVGISNGSFMVPLKYAHKDIV-GVEYLLSFGIGAMTMTMLVFGMYSSMLSFR 335 L G+ AV G+ S M+PL YA + G+ Y +SFGI A+ + + + + L+ Sbjct: 212 LPGLFAAVLNGLFAASIMLPLHYAPPNTTKGIGYSMSFGIAAVVVVFIFWTIRLLALTAA 271 Query: 334 KIPQ------------------------PSLYIPGATLPAFLAGLLWSLGNFFSIYATLY 227 + PS + P F AGLL+S GN F I + + Sbjct: 272 EFAAKQNEAKRITPNIIRESLREGYSQLPSFHFSEMWRPGFTAGLLYSGGNLFGIVSIQH 331 Query: 226 LGLALGWPLVQCQLLVSAMWAVFYYKE 146 LG +G+ L Q +++S W +F+Y+E Sbjct: 332 LGNFMGYSLNQSSMIISGCWGLFWYRE 358 >ref|XP_005838126.1| hypothetical protein GUITHDRAFT_66230 [Guillardia theta CCMP2712] gi|428182285|gb|EKX51146.1| hypothetical protein GUITHDRAFT_66230 [Guillardia theta CCMP2712] Length = 341 Score = 110 bits (276), Expect = 7e-22 Identities = 83/334 (24%), Positives = 147/334 (44%), Gaps = 23/334 (6%) Frame = -1 Query: 1006 FVFQLYKSTACFLT--SWLILVYIPYKFTWWGVLGGLIWVINGIVAIFSVRWAGIGVAQC 833 F FQL+ + +T S +L P +++ WG G L+ A +V G Sbjct: 34 FGFQLWVTAGNAMTTMSLALLKGSPVRWSSWGAAGALLLTATQCCAWPAVGALGAAAGPG 93 Query: 832 LWSGLSLFISYIWGAYIFKEPIKNHGLSLLALCVMALGMLGVGFSVSNKRRIQLFNNFPT 653 +W G+ + ++++WG +F+E +++ L ++AL ++ G++G+ S+ + L + Sbjct: 94 IWCGVGMSVAFMWGTIVFQEAVRSLALCIVALILLFFGIVGISLVQSSMLQRLLGES--- 150 Query: 652 HTESSTTIKTNDEELHDSVTPLMSNSPTSSCEREVEFTDQDKNERDLLKGVLGAVFVGIS 473 T LMS ++ R + GVL A+ G+ Sbjct: 151 -----------------GATGLMSEEESNKTGRA-----------RIAVGVLLALMTGLF 182 Query: 472 NGSFMVPLKY-------------------AHKDIVGVEYLLSFGIGAMTMT--MLVFGMY 356 +GS M P K + D+V EYL SF + + LV M+ Sbjct: 183 DGSLMAPFKAYLASHPSLVSSSSSSSSSSSSSDVVVFEYLGSFALALPVVAGGSLVLIMF 242 Query: 355 SSMLSFRKIPQPSLYIPGATLPAFLAGLLWSLGNFFSIYATLYLGLALGWPLVQCQLLVS 176 + P S + A P F AG+LW++GN S++ATL LG ++G+P+ Q +++S Sbjct: 243 YQHRALNSGPDRSSFRQAA-YPGFCAGVLWAVGNVLSVHATLELGQSIGFPMTQSCVVIS 301 Query: 175 AMWAVFYYKEVRTKVGAFSLITSSVIVVMGVVML 74 A+W + +KE+ + + SS +V G +L Sbjct: 302 ALWGIVVFKEMTARTPLLLFLLSSTLVAAGASLL 335 >ref|XP_002185821.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1] gi|209582670|gb|ACI65291.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1] Length = 451 Score = 107 bits (267), Expect = 8e-21 Identities = 54/154 (35%), Positives = 87/154 (56%), Gaps = 10/154 (6%) Frame = -1 Query: 505 GVLGAVFVGISNGSFMVPLKYAHKDIVGVEYLLSFGIGAMTMTMLVFGMYSSMLSFRKIP 326 G++ A+F G+ GS M P+K+ D G +LLSF IGA + GM+ + + Sbjct: 294 GMVAAMFCGVWGGSIMAPMKFCQSDTKGTHFLLSFSIGASIVNT---GMWLVRYGYNVLH 350 Query: 325 QPSLYIPGATLPAF----------LAGLLWSLGNFFSIYATLYLGLALGWPLVQCQLLVS 176 S A+LP+F L+G+LWS+GNFFS+ + YLG +G+PLVQ ++VS Sbjct: 351 YQSCSKAYASLPSFHLHTMWLAGGLSGMLWSIGNFFSLISVFYLGQGVGYPLVQTSIIVS 410 Query: 175 AMWAVFYYKEVRTKVGAFSLITSSVIVVMGVVML 74 +W +FY+KE+ + SS++ + G+++L Sbjct: 411 GLWGIFYFKEITGFERISKWLASSLLTIFGILLL 444 Score = 89.7 bits (221), Expect = 2e-15 Identities = 41/102 (40%), Positives = 65/102 (63%), Gaps = 1/102 (0%) Frame = -1 Query: 1003 VFQLYKSTACFLTSWLILVY-IPYKFTWWGVLGGLIWVINGIVAIFSVRWAGIGVAQCLW 827 VFQ YK+ CF TSWL+L+ P+ FT WG++ GL WV G IF+V+ AG+ + + Sbjct: 49 VFQTYKTFMCFATSWLVLLAGEPFTFTPWGIVSGLFWVPGGTATIFAVKNAGLAIGIGIG 108 Query: 826 SGLSLFISYIWGAYIFKEPIKNHGLSLLALCVMALGMLGVGF 701 S + +S+IWG ++F+E + + + LA+ M LG+LG+ + Sbjct: 109 SSFIVLVSFIWGIFVFEEAVHSKTGACLAIFSMMLGLLGMSY 150 >ref|XP_001775344.1| predicted protein [Physcomitrella patens] gi|162673289|gb|EDQ59814.1| predicted protein [Physcomitrella patens] Length = 344 Score = 95.5 bits (236), Expect = 3e-17 Identities = 76/315 (24%), Positives = 131/315 (41%), Gaps = 4/315 (1%) Frame = -1 Query: 1003 VFQLYKSTACFLTSWLILVYIPYKFTWWGVLGGLIWVINGIVAIFSVRWAGIGVAQCLWS 824 +F + L+S L+L + F G+L G+ +V++ I +VR G+ VA +W+ Sbjct: 67 IFNFWACLGVLLSSLLLLFKYKFVFALEGLLSGVFFVLSFINIFRAVRLLGVSVAYGIWA 126 Query: 823 GLSLFISYIWGAYIFKEPIKNHGLSLLALCVMALGMLGVGFSVSNKRRIQLFNNFPTHTE 644 G + + W + EP Q F HT+ Sbjct: 127 GTAAIVGVAWSGQMSWEP-------------------------------QDFYEDDDHTQ 155 Query: 643 SSTTIKTNDEEL----HDSVTPLMSNSPTSSCEREVEFTDQDKNERDLLKGVLGAVFVGI 476 T I++ H + + S S ER + R GV AV GI Sbjct: 156 --TLIQSQPSFAGWVQHRKLWDVAGQS--KSGERPKNVLTGEPASRSFPAGVFSAVLAGI 211 Query: 475 SNGSFMVPLKYAHKDIVGVEYLLSFGIGAMTMTMLVFGMYSSMLSFRKIPQPSLYIPGAT 296 G M+P A G +L SFGI T +V +S+ P L A Sbjct: 212 LGGLVMIPANQAPDMAQGNAFLPSFGIAVAIFTPIV----TSLPYLSGCELPDLSAREAA 267 Query: 295 LPAFLAGLLWSLGNFFSIYATLYLGLALGWPLVQCQLLVSAMWAVFYYKEVRTKVGAFSL 116 P L+G ++++GN +I A Y+G ++ +PL QC ++V+ +W + Y++E +L Sbjct: 268 GPGILSGFIYNIGNMLNIVAIFYVGSSVAYPLFQCGIIVAGIWGMLYFEESHGN-ALITL 326 Query: 115 ITSSVIVVMGVVMLA 71 + V++++G+++L+ Sbjct: 327 WAADVVLLVGIILLS 341 >ref|XP_003294072.1| hypothetical protein DICPUDRAFT_90514 [Dictyostelium purpureum] gi|325075525|gb|EGC29401.1| hypothetical protein DICPUDRAFT_90514 [Dictyostelium purpureum] Length = 337 Score = 94.0 bits (232), Expect = 9e-17 Identities = 73/293 (24%), Positives = 143/293 (48%), Gaps = 12/293 (4%) Frame = -1 Query: 913 LGGLIWVINGIVAIFSVRWAGIGVAQCLWSGLSLFISYIWGA---YIFKEPIKNHG-LSL 746 +GG +W ++ I +++ G+G+A L+S + + +I G + KE H ++ Sbjct: 53 MGGSLWCCANLLVIPTIKLLGLGLAVLLYSSIGIVAGFIVGKAGLFGLKEAAAAHDWMNY 112 Query: 745 LALCVMALGMLGVGF---SVSNKRRIQLFNNFPTHTESSTTIKTNDEELHDSVTPLMSNS 575 L L + L ++ F ++ +++ N+ + + I +DEE++ +PL+ N+ Sbjct: 113 LGLAGIILSVIFFFFIKPNLEEEKKADTKGNYHGSYDDFSNI--SDEEIN---SPLIVNT 167 Query: 574 PTSSCEREVEFTDQDKNERDLLKGVLGAVFVGISNGSFMVPL---KYAHKDIVGVEYLLS 404 T EV D+ N + GV+ A+ +GI G M+P+ K + D ++Y S Sbjct: 168 QTQIKNYEVSIYDRIPNRLKTVSGVVFALVIGILLGVNMIPMQLWKQRNPDANPLDYTFS 227 Query: 403 FGIGAMTMTMLVFGMYSSMLSFRKIPQPSLYIPGATLPAFLAGLLWSLGNFFSIYATLYL 224 G VF +Y+ I +P P LP+F +G++ + N + +T L Sbjct: 228 QFSGIFLANTFVFILYTI------IKRPPQIYPQTILPSFCSGVVLGVANIGLMISTENL 281 Query: 223 GLALGWPLVQCQ--LLVSAMWAVFYYKEVRTKVGAFSLITSSVIVVMGVVMLA 71 G +G+P + C ++VS++W++FY++E+ L S I++ G+V++A Sbjct: 282 GYTVGYP-ISCSGPMIVSSLWSIFYFREITGLRNFIILFVSFSILIGGIVLMA 333 >ref|XP_641212.1| transmembrane protein 144 A [Dictyostelium discoideum AX4] gi|74855822|sp|Q54V96.1|T144B_DICDI RecName: Full=Transmembrane protein 144 homolog B; AltName: Full=Transmembrane protein 144 homolog 2 gi|60469237|gb|EAL67232.1| transmembrane protein 144 A [Dictyostelium discoideum AX4] Length = 358 Score = 92.8 bits (229), Expect = 2e-16 Identities = 77/305 (25%), Positives = 134/305 (43%), Gaps = 16/305 (5%) Frame = -1 Query: 937 YKFTWWGVLGGLIWVINGIVAIFSVRWAGIGVAQCLWSGLSLFISYIWGAY----IFKEP 770 Y F WG+LGG +W I I V+ GIG+ LW S+ Y G + I K+ Sbjct: 59 YIFDPWGLLGGTLWSIGNFCVIPIVKTIGIGLGLLLWCCSSIITGYFTGKFGWFGIDKQK 118 Query: 769 IKNHGLSLLALCVMALGMLGVGFSVSNKRRIQLFNNFPTHTESSTTIKTNDEELHDSVTP 590 + + L+ + + ++ F + +S D ++S+ Sbjct: 119 VSHPALNWIGFACIVAAVIFFFFIEPTIEEKDEHSYSSIVDDSEIGNNGIDNNGYNSIN- 177 Query: 589 LMSNSPTSSCEREVEFTDQDKN---ER-----DLLKGVLGAVFVGISNGSFMVPL---KY 443 +N+ ++ R F Q K ER + + G++ +VF GI G MVP+ K Sbjct: 178 -NNNNNGNNKRRSGAFNKQPKKSIFERMPPPYNTILGIVLSVFSGIMYGVNMVPMQLWKQ 236 Query: 442 AHKDIVGVEYLLSFGIGAMTMTMLVFGMYSSMLSFRKIPQPSLYIPGATLPAFLAGLLWS 263 ++ D + ++ G VF +YS I +P P P+F +GLLW Sbjct: 237 SNVDASPLSFVFCHFSGIFLANTAVFIVYSI------IVRPPQIFPQTIFPSFFSGLLWG 290 Query: 262 LGNFFSIYATLYLGLALGWPLVQ-CQLLVSAMWAVFYYKEVRTKVGAFSLITSSVIVVMG 86 + N + AT LG +G+P+ ++VS++W+VFY++E++ L+ S + + G Sbjct: 291 IANVGLMVATQNLGYTIGFPMGSGGPMIVSSLWSVFYFREIQGVKNLLILLISFIFLGAG 350 Query: 85 VVMLA 71 + +LA Sbjct: 351 ITILA 355 >ref|XP_005758882.1| hypothetical protein EMIHUDRAFT_121297 [Emiliania huxleyi CCMP1516] gi|485606731|gb|EOD06453.1| hypothetical protein EMIHUDRAFT_121297 [Emiliania huxleyi CCMP1516] Length = 334 Score = 90.5 bits (223), Expect = 1e-15 Identities = 77/313 (24%), Positives = 141/313 (45%), Gaps = 28/313 (8%) Frame = -1 Query: 925 WWGVLGGLIWVINGIVAIFSVRWAGIGVAQCLWSGLSLFISYIWGAYIFKEPIKNHGLSL 746 W G + W + +++V GI V Q +GL + ++ +WGA +F E ++ G + Sbjct: 30 WRGAASAICWAPATVAYVYAVNAIGITVTQTCVAGLLVAVNVLWGACMFSEQLE--GTCI 87 Query: 745 LALCVMALGML-GVGFSVSNKRRIQLFNNFPTHTE-------SSTTIKTNDEEL------ 608 + L + G+L G+ ++R++ +H + +S+T T+ E+L Sbjct: 88 MGLALTTCGILAGINAKKFSERQMNRIAGQHSHLDLSQPVQAASSTAPTDQEQLVEKGAL 147 Query: 607 --HDSVTPLMSNSPTSSCEREVEFTDQDKNERDLLKGVLGAVFVGISNGSFMVP---LKY 443 H+ V + +++ + G+ F GI GS MVP + Sbjct: 148 RSHELVVSNGGGAHAGGGQQQPAAAHPVPAPSEWRVGMAAVCFNGIWGGSIMVPTHGMSR 207 Query: 442 AHKDIVGVEYLLSFGIGAMTMTMLVFGMYSSMLSFRKIPQPSLYIPGATLP---AFLAGL 272 A D Y L+FG A+ + +++ +++R +P LY+ + +G Sbjct: 208 APADYAS--YSLAFGSTALLVIAVLW-----CINWRSVP---LYVAQCRVVWRRGLASGC 257 Query: 271 LWSLGNFFSIYAT------LYLGLALGWPLVQCQLLVSAMWAVFYYKEVRTKVGAFSLIT 110 LW++GN S A LGLALG+ VQC L+VS+ W V ++EV+ + Sbjct: 258 LWAVGNVCSAVAVNGWGDMAGLGLALGYSAVQCNLVVSSTWGVCLFREVQGAMSIGIWAV 317 Query: 109 SSVIVVMGVVMLA 71 +V+V+ G++M+A Sbjct: 318 GAVLVLAGIIMIA 330 >ref|XP_004350600.1| transmembrane protein [Dictyostelium fasciculatum] gi|328865506|gb|EGG13892.1| transmembrane protein [Dictyostelium fasciculatum] Length = 435 Score = 90.5 bits (223), Expect = 1e-15 Identities = 75/309 (24%), Positives = 135/309 (43%), Gaps = 20/309 (6%) Frame = -1 Query: 937 YKFTWWGVLGGLIWVINGIVAIFSVRWAGIGVAQCLWSGLSLFISYIWGAY----IFKEP 770 Y F WG+LGG +W + + I V+ G+G+ LWS SL ++ G + + K+ Sbjct: 145 YIFDPWGLLGGSLWSVGNLCVIPIVKTIGLGLGLLLWSCTSLVTGFLIGKFGAFGLDKQS 204 Query: 769 IKNHGLSLLALCVMALGMLGVGF-----------SVSNKRRIQLFNNFPTHTESSTTIKT 623 + + L+ L + + +L F + S KR Q + P E +I + Sbjct: 205 VAHPVLNWLGFSAIVVAILFFFFIKPTLNKEEPTTPSKKRLSQRYEYSPIVDEQQISINS 264 Query: 622 NDEELHDSVTPLMSNSPTSSCEREVEFTDQDKNERDLLKGVLGAVFVGISNGSFMVPL-- 449 + NS + E + N+ + GV+ +VF G+ G MVP+ Sbjct: 265 TE------------NSAPVEGQMIFEKIPEPYNK---IFGVMLSVFSGVLYGVNMVPMQL 309 Query: 448 --KYAHKDIVGVEYLLSFGIGAMTMTMLVFGMYSSMLSFRKIPQPSLYIPGATLPAFLAG 275 + D+ + ++ G VF +YS I +P P LP+F++G Sbjct: 310 WKQQQPSDVNPLSFIFCHFSGIFLFNTAVFFVYSI------IKRPPQVFPQTMLPSFISG 363 Query: 274 LLWSLGNFFSIYATLYLGLALGWPLVQC-QLLVSAMWAVFYYKEVRTKVGAFSLITSSVI 98 +LW + N + AT LG +G+P+ ++VS++W+V +KE++ L+ S + Sbjct: 364 VLWGVANCGLMVATQILGYTIGFPIGSSGPMVVSSLWSVLLFKEIQGTKNLLILLISFIF 423 Query: 97 VVMGVVMLA 71 + G+ L+ Sbjct: 424 LGAGITCLS 432 >ref|XP_003293389.1| hypothetical protein DICPUDRAFT_50939 [Dictyostelium purpureum] gi|325076279|gb|EGC30078.1| hypothetical protein DICPUDRAFT_50939 [Dictyostelium purpureum] Length = 348 Score = 89.4 bits (220), Expect = 2e-15 Identities = 76/301 (25%), Positives = 132/301 (43%), Gaps = 9/301 (2%) Frame = -1 Query: 937 YKFTWWGVLGGLIWVINGIVAIFSVRWAGIGVAQCLWSGLSLFISYIWGAYIFKEPIKNH 758 Y F WG+LGG +W + I V+ G+G+ LW S+ + G + Sbjct: 64 YLFDPWGLLGGTLWSLGNFCVIPIVKTIGLGLGLLLWCCCSIVAGFFTGKFGL------F 117 Query: 757 GLSLLALCVMALGMLGVGFSVSNKRRIQLFNNFPTHTESSTTIKTNDEELHDSVTPLMSN 578 GL + A+ +G G V+ I F E+ T + + D P+++ Sbjct: 118 GLEKQVVSHPAMNWIGFGCIVA---AIVFFFFIKPTLENEDTESNSYSSIVDDY-PIINE 173 Query: 577 SPTSSCEREVEFTDQDKNER--DLLKGVLG---AVFVGISNGSFMVPL---KYAHKDIVG 422 + + + T++ ER +K +LG A+F GI G MVP+ K + Sbjct: 174 AGYRGSIQSSKLTEKSFFERIPQPMKTMLGIGLAIFSGIMYGVNMVPMQLWKQSDPSANP 233 Query: 421 VEYLLSFGIGAMTMTMLVFGMYSSMLSFRKIPQPSLYIPGATLPAFLAGLLWSLGNFFSI 242 + ++ G +VF +Y+ I +P P LP+F +G+LW + N + Sbjct: 234 LSFIFCHFSGIFIFNTIVFFVYAI------IKRPPQVFPQTMLPSFFSGVLWGIANCGLM 287 Query: 241 YATLYLGLALGWPL-VQCQLLVSAMWAVFYYKEVRTKVGAFSLITSSVIVVMGVVMLAQF 65 AT LG +G+P+ ++VS++W+V Y+KE++ L+ S + + G+ LA Sbjct: 288 VATQNLGYTVGFPMGASGPMVVSSIWSVVYFKEIQGVKNLLILLISFLFLGAGITTLALS 347 Query: 64 G 62 G Sbjct: 348 G 348 >gb|EJK47010.1| hypothetical protein THAOC_34300, partial [Thalassiosira oceanica] Length = 641 Score = 89.0 bits (219), Expect = 3e-15 Identities = 41/102 (40%), Positives = 66/102 (64%), Gaps = 1/102 (0%) Frame = -1 Query: 1003 VFQLYKSTACFLTSWLI-LVYIPYKFTWWGVLGGLIWVINGIVAIFSVRWAGIGVAQCLW 827 V Q YKS CFLTSWL+ L+ + FT WG++ GL WV G IF++R AG+ ++Q + Sbjct: 41 VMQSYKSLMCFLTSWLVVLLGVEVTFTPWGIVSGLFWVPGGAFNIFAIRNAGLAISQGIV 100 Query: 826 SGLSLFISYIWGAYIFKEPIKNHGLSLLALCVMALGMLGVGF 701 + + +S+IWG IFKEP+ + ++ A+ ++ LG+ G+ + Sbjct: 101 ASSIVMVSFIWGNIIFKEPVHSEVIAYSAVWLIMLGLYGMSY 142 >gb|ABK23204.1| unknown [Picea sitchensis] Length = 196 Score = 87.4 bits (215), Expect = 8e-15 Identities = 44/148 (29%), Positives = 82/148 (55%) Frame = -1 Query: 514 LLKGVLGAVFVGISNGSFMVPLKYAHKDIVGVEYLLSFGIGAMTMTMLVFGMYSSMLSFR 335 L +GV+ A+ GI G M+P+ + I GV YL SF IG +V + LS + Sbjct: 51 LSQGVIAALLTGILGGLIMMPMTQSPPAIQGVSYLPSFAIGVAIFAPVVTAI--PYLSTQ 108 Query: 334 KIPQPSLYIPGATLPAFLAGLLWSLGNFFSIYATLYLGLALGWPLVQCQLLVSAMWAVFY 155 + P+ LY+ LP ++G++W++GN S+ A +G + +P++ C + ++ +W +F Sbjct: 109 ECPRMELYV--GALPGIISGIVWNIGNILSMLAIGIIGYTIAYPILYCGIFIAGLWGMFL 166 Query: 154 YKEVRTKVGAFSLITSSVIVVMGVVMLA 71 +KE+R A S +++ G+++L+ Sbjct: 167 FKEIRGNAAAL-YWGSGFLILTGIILLS 193 >gb|EJK73333.1| hypothetical protein THAOC_05052 [Thalassiosira oceanica] Length = 313 Score = 79.7 bits (195), Expect = 2e-12 Identities = 45/140 (32%), Positives = 72/140 (51%), Gaps = 2/140 (1%) Frame = -1 Query: 1003 VFQLYKSTACFLTS--WLILVYIPYKFTWWGVLGGLIWVINGIVAIFSVRWAGIGVAQCL 830 V Q YK+T CFL S ++L+ +FT WG+L G+ WV G I+ +R AG+ VA Sbjct: 43 VMQSYKTTLCFLMSSPMVMLLGERPRFTHWGILSGVFWVPGGAAGIYGIRKAGLAVAVGT 102 Query: 829 WSGLSLFISYIWGAYIFKEPIKNHGLSLLALCVMALGMLGVGFSVSNKRRIQLFNNFPTH 650 WS L + S+ WG ++F E +K+ + A + LG++G+ S + + + + Sbjct: 103 WSSLVVLTSFFWGIHVFGERVKSPNGAAGACLTLILGLIGMANFSSKGKPKKKEKDICSK 162 Query: 649 TESSTTIKTNDEELHDSVTP 590 E+ T D E + TP Sbjct: 163 AETLIRDSTRDLESQQTSTP 182