BLASTX nr result
ID: Ephedra26_contig00020415
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Ephedra26_contig00020415 (919 letters) Database: ./nr 37,332,560 sequences; 13,225,080,153 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value gb|ADE75937.1| unknown [Picea sitchensis] 390 e-106 ref|XP_002295272.1| predicted protein [Thalassiosira pseudonana ... 175 2e-41 ref|XP_002180280.1| predicted protein [Phaeodactylum tricornutum... 152 2e-34 ref|XP_002182733.1| predicted protein [Phaeodactylum tricornutum... 139 2e-30 ref|XP_002669182.1| predicted protein [Naegleria gruberi] gi|284... 137 6e-30 ref|XP_005789382.1| hypothetical protein EMIHUDRAFT_98339 [Emili... 137 7e-30 ref|XP_002182676.1| predicted protein [Phaeodactylum tricornutum... 129 1e-27 ref|XP_002676812.1| predicted protein [Naegleria gruberi] gi|284... 126 1e-26 gb|EJK61576.1| hypothetical protein THAOC_17910 [Thalassiosira o... 116 1e-23 ref|XP_005838126.1| hypothetical protein GUITHDRAFT_66230 [Guill... 100 8e-19 ref|XP_002185821.1| predicted protein [Phaeodactylum tricornutum... 100 1e-18 gb|EJK47010.1| hypothetical protein THAOC_34300, partial [Thalas... 87 7e-15 ref|XP_641212.1| transmembrane protein 144 A [Dictyostelium disc... 87 1e-14 ref|XP_003294072.1| hypothetical protein DICPUDRAFT_90514 [Dicty... 86 1e-14 ref|XP_004350600.1| transmembrane protein [Dictyostelium fascicu... 85 3e-14 ref|XP_001775344.1| predicted protein [Physcomitrella patens] gi... 85 3e-14 ref|XP_003293389.1| hypothetical protein DICPUDRAFT_50939 [Dicty... 84 6e-14 gb|ABK23204.1| unknown [Picea sitchensis] 84 6e-14 ref|XP_005758882.1| hypothetical protein EMIHUDRAFT_121297 [Emil... 82 4e-13 gb|EJK73333.1| hypothetical protein THAOC_05052 [Thalassiosira o... 80 8e-13 >gb|ADE75937.1| unknown [Picea sitchensis] Length = 359 Score = 390 bits (1001), Expect = e-106 Identities = 194/297 (65%), Positives = 232/297 (78%), Gaps = 6/297 (2%) Frame = +1 Query: 43 FVRQLYKSTACFLTSWLILVYTPYKFTWWGVLGGLIWVINGIVAIFSVRWAGIGVAQCLW 222 FV Q YKST CFLTSWL+L+YTP+KFTWWG+LG LIWV NG++AI +VRWAGIGV+Q LW Sbjct: 36 FVFQSYKSTTCFLTSWLVLLYTPFKFTWWGILGALIWVTNGVLAIVAVRWAGIGVSQSLW 95 Query: 223 SGLSLFISYIWGAYIFKEPIKNHGLSLLALCVMALGMLGVGFSVSNKRRIR-LFNNFPTH 399 SGLSLF +YIWGAY+ KEP+KNHGLS+LAL VMALGM+GVGF+VS K + L + + Sbjct: 96 SGLSLFTAYIWGAYVLKEPLKNHGLSILALLVMALGMIGVGFAVSEKTVFQSLLDIWLKL 155 Query: 400 TESSTTIKTNDE----ELHDSVTPLMSNSPTSSCEREVEFTDQD-KNERDLLKGVLGAVF 564 ST IK + ++ DS L+ T +C E E+ DQ + E L+KGVL AV Sbjct: 156 NPCSTKIKDCPQLSCIDVQDSSEALIPCETTKTCGVEEEYADQKYERENKLVKGVLCAVL 215 Query: 565 VGISNGSFMVPLKYAHKDIVGVEYLLSFGIGAMTMTMLVFGIYSSMLSFRKIPQPSLYIP 744 VG NGSFMVPLKYAHKD+VG EYL+SFGIGAMTMT+++ GIY + L+F P PSLYIP Sbjct: 216 VGTLNGSFMVPLKYAHKDVVGAEYLVSFGIGAMTMTIILLGIYMTALAFHGRPLPSLYIP 275 Query: 745 GATLPAFLAGLLWSLGNFFSIYATLYLGLALGWPLVQCQLLVSAMWAVFYYKEVRTK 915 GA PAFLAG LWS+GNFFSIYATLYLG+ALGWPLVQCQL+VSAMWAVF+YKEV ++ Sbjct: 276 GAAGPAFLAGFLWSMGNFFSIYATLYLGVALGWPLVQCQLIVSAMWAVFFYKEVTSR 332 >ref|XP_002295272.1| predicted protein [Thalassiosira pseudonana CCMP1335] gi|220968995|gb|EED87338.1| predicted protein [Thalassiosira pseudonana CCMP1335] Length = 373 Score = 175 bits (444), Expect = 2e-41 Identities = 103/313 (32%), Positives = 169/313 (53%), Gaps = 26/313 (8%) Frame = +1 Query: 46 VRQLYKSTACFLTSWLILVY-TPYKFTWWGVLGGLIWVINGIVAIFSVRWAGIGVAQCLW 222 V Q YKS CFL+SWL+L+ + FTWWG++ GL WV G IF++R AG+ V+Q + Sbjct: 64 VMQSYKSLMCFLSSWLVLLCGQEFTFTWWGIVSGLFWVPAGAFNIFAIRNAGLAVSQGIV 123 Query: 223 SGLSLFISYIWGAYIFKEPIKNHGLSLLALCVMALGMLGVGFSVSNKRRIRLFNNFPTHT 402 S + +S+IWG IF+E +K+ ++ A+C++ G+ G+ F +++ + P HT Sbjct: 124 SSSIVMVSFIWGDLIFREAVKSELIAYFAVCLIMAGLYGMSFFSTSEEQ-------PEHT 176 Query: 403 ESSTTIKTNDEEL----HDSVTPLMSNSPTSSC--------------EREVEFTDQDKNE 528 S +E+L H+S S++ SS R + + + Sbjct: 177 SVSDNDNNGEEKLDLMRHESSDSFDSSNDNSSMGPLEISERRKPSIRGRPILICGKTYSR 236 Query: 529 RDLLKGVLGAVFVGISNGSFMVPLKYAHKDIVGVEYLLSFGIGAMTMTML--VFGIYSSM 702 R++ G+ A+ G+ GS +VP+ YA D+ G+ Y++SF +GA+T+T+L V + Sbjct: 237 RNI--GLCSALICGVWGGSCLVPMHYAQGDVKGLAYVISFSVGALTVTVLLWVARFAYHL 294 Query: 703 LSFRKIPQ-----PSLYIPGATLPAFLAGLLWSLGNFFSIYATLYLGLALGWPLVQCQLL 867 + + + + PS ++ LP AG LWS+GN SI A +LG +G+ Q LL Sbjct: 295 VKLKSVWEAYEVLPSFHLRVMLLPGATAGSLWSIGNVGSIVAVKHLGQGVGYSASQAALL 354 Query: 868 VSAMWAVFYYKEV 906 VS MW +FY+K++ Sbjct: 355 VSGMWGIFYFKQM 367 >ref|XP_002180280.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1] gi|217408537|gb|EEC48471.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1] Length = 346 Score = 152 bits (384), Expect = 2e-34 Identities = 102/301 (33%), Positives = 154/301 (51%), Gaps = 13/301 (4%) Frame = +1 Query: 46 VRQLYKSTACFLTSWLILVY-TPYKFTWWGVLGGLIWVINGIVAIFSVRWAGIGVAQCLW 222 V Q YK LTSWL+L++ P+ FT WG + GL V G F+V+ AG+ V+Q +W Sbjct: 38 VLQTYKIGMTLLTSWLVLLFGVPFTFTPWGFVSGLFMVPGGTAGYFAVQNAGMAVSQGIW 97 Query: 223 SGLSLFISYIWGAYIFKEPIKNHGLSLLALCVMALGMLGVGFSVSNKRRIRLFNNFPTHT 402 S L + +++ WG IF EP+ + + LA+ ++ +G+ GV + + Sbjct: 98 SSLKVLVAFCWGILIFHEPVHSKLGTTLAIALLMVGLAGVSIFAAPR------------- 144 Query: 403 ESSTTIKTNDEELHDSVTPLMSNSPTSSCEREVEFTDQDKNERDLLK----GVLGAVFVG 570 T+ + EE PL+ P + + E D +K+ LK G+LGAV G Sbjct: 145 ---TSTSSPQEE------PLL---PDVEEQNQPEIVD-NKDYLGFLKRRHVGLLGAVIDG 191 Query: 571 ISNGSFMVPLKYA-HKDIVGVEYLLSFGIGAMTMTMLVF-------GIYSSMLSFRKIPQ 726 GS +VP+ YA K G+ Y++SF IG ++ +V+ + L Sbjct: 192 AYGGSVLVPMHYAGPKTTNGLSYVMSFAIGCSSVVTMVWVLRLLFNSVQGQSLRVGYDRL 251 Query: 727 PSLYIPGATLPAFLAGLLWSLGNFFSIYATLYLGLALGWPLVQCQLLVSAMWAVFYYKEV 906 PSL++ A LAGL+WSLGN SI LG +G+ +VQ QLLV+ +W VF+YKE+ Sbjct: 252 PSLHVTTIGPYAALAGLIWSLGNVSSILTVALLGEGVGYSIVQSQLLVAGLWGVFWYKEI 311 Query: 907 R 909 R Sbjct: 312 R 312 >ref|XP_002182733.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1] gi|217406079|gb|EEC46020.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1] Length = 404 Score = 139 bits (349), Expect = 2e-30 Identities = 89/329 (27%), Positives = 164/329 (49%), Gaps = 41/329 (12%) Frame = +1 Query: 46 VRQLYKSTACFLTSWLILVY-TPYKFTWWGVLGGLIWVINGIVAIFSVRWAGIGVAQCLW 222 V Q YK+ F+ SWL++ +T WG++ G +WV+ G + ++R AG+ +A W Sbjct: 44 VFQSYKTITMFMLSWLVIFMGIAPSWTSWGLVSGGLWVVGGTGGVLAIRMAGLAIAVGTW 103 Query: 223 SGLSLFISYIWGAYIFKEPIKNHGLSLLALCVMALGMLGVGFSVSNKRRIRLFNNFPTHT 402 + + + I+++ G +F+EP+ + +L A ++ALG++G+ + + +L T Sbjct: 104 ASVMIVINFLVGIVLFQEPVSDMFATLGAFLLLALGLVGMSLYSTPQPVDQL-----PST 158 Query: 403 ESSTTIKTNDEELHDSVTPLM----------------------SNSPTSSCER-EVEFT- 510 E + I N E+ + L+ S S SS + E FT Sbjct: 159 EMTENIGPNQNEVEEIDRALIVKRTSSYTGKIDHRDIQRRNEESGSYGSSADADEPLFTI 218 Query: 511 -DQDKNERDLLKGVLGAVFVGISNGSFMVPLKYAH-KDIVGVEYLLSFGIGAMTMTMLVF 684 D K +R G+ GA+F G+ GS ++PL YA + G Y++S+ GA+ M L++ Sbjct: 219 PDGTKRKRSGPTGICGAIFNGVMTGSSLIPLHYAKTQGYGGANYMISYASGAIVMNCLIW 278 Query: 685 GIYSSMLSFRKIPQ--------------PSLYIPGATLPAFLAGLLWSLGNFFSIYATLY 822 G++ + ++ + Q P+ + LP F +G+L ++ F SI + Y Sbjct: 279 GVFFAYTCYQTVQQDLNVPVLLHTFQVMPAWHFRKLWLPGFTSGVLLTIAMFGSILSVTY 338 Query: 823 LGLALGWPLVQCQLLVSAMWAVFYYKEVR 909 LG +G +VQ ++L+S +W +F+++E+R Sbjct: 339 LGQGIGNSIVQAKILISGLWGIFWFREIR 367 >ref|XP_002669182.1| predicted protein [Naegleria gruberi] gi|284082726|gb|EFC36438.1| predicted protein [Naegleria gruberi] Length = 425 Score = 137 bits (345), Expect = 6e-30 Identities = 83/313 (26%), Positives = 157/313 (50%), Gaps = 25/313 (7%) Frame = +1 Query: 46 VRQLYKSTACFLTSWLILVYTPYKFTWWGVLGGLIWVINGIVAIFSVRWAGIGVAQCLWS 225 V Q Y S+ +TS+++L++ + F++WG+LG +WV ++++ ++ G+GVAQ +WS Sbjct: 88 VFQFYFSSMVLITSFIVLIWNEWYFSFWGILGAAVWVPASLLSLIAIHLLGLGVAQGVWS 147 Query: 226 GLSLFISYIWGAYIFKEPIKNHGLSLLALCVMALGMLGVG----------FSVSNKRRIR 375 G+++ S+ WG +F I N L+ LAL +M +G++G+ S+ Sbjct: 148 GVNIITSFTWGVALFHSEIGNPYLTALALILMVVGIVGIATCSKWNLPELLPASSTETKS 207 Query: 376 LFNNFPTHTES-----------STTIKTNDEELHDSV--TPLMSNSPTSSCEREVEFTDQ 516 L N TH + + ++ N++ + ++ T PT R+ + Sbjct: 208 LVNETVTHYDGNEENPEAPNTFNPEVQNNEQAVEQTIETTQEEEEYPTQPLSRKEKIVSI 267 Query: 517 DKNERDLLKGVLGAVFVGISNGSFMVPLKYAHKDIVGVEYLLSFGIGAMTMTMLVFGIYS 696 K+ ++ + G+ +V VG+ GS VP ++ K GV Y++ FG G+ +T + IY Sbjct: 268 LKSSKNYILGLACSVGVGVLGGSQFVPSRFEEKP--GVVYVVGFGFGSAGITSAILVIYY 325 Query: 697 SMLSFR-KIPQPSLYIPGATLPAFLAGLLWSLGNFFSIYATL-YLGLALGWPLVQCQLLV 870 R ++ P + P + + LW +GN + Y ++ LG +G PL Q L+V Sbjct: 326 IYYIIRYRVVLP--FHPKVAVFPCITACLWQVGNVMATYVSMSSLGFTIGLPLTQASLVV 383 Query: 871 SAMWAVFYYKEVR 909 + + + ++KE+R Sbjct: 384 AGICGLLFFKELR 396 >ref|XP_005789382.1| hypothetical protein EMIHUDRAFT_98339 [Emiliania huxleyi CCMP1516] gi|485642886|gb|EOD36953.1| hypothetical protein EMIHUDRAFT_98339 [Emiliania huxleyi CCMP1516] Length = 358 Score = 137 bits (344), Expect = 7e-30 Identities = 84/293 (28%), Positives = 137/293 (46%), Gaps = 7/293 (2%) Frame = +1 Query: 52 QLYKSTACFLTSWLILVYTPYKFTWWGVLGGLIWVINGIVAIFSVRWAGIGVAQCLWSGL 231 QLY S L+S L+L TP+ F++WG G +W+ + + + G GVA W Sbjct: 72 QLYFSAGVALSSILVLALTPFSFSFWGFAGASLWISSMMCGKIGIDGIGYGVAVATWGST 131 Query: 232 SLFISYIWGAYIFKEPIKNHGLSLLALCVMALGMLGVGFSVSNKRRIRLFNNFPTHTESS 411 ++ +S++WG +F E + ++ ALC +A G+ GV + S P E++ Sbjct: 132 TMIVSFLWGTLVFAERPSSVTGAVAALCTLAAGVAGVATAQSGSLG-------PPEAEAA 184 Query: 412 TTIKTNDEELHDSVTPLMSNSPTSSCEREVEFTDQDKNERDLLKGVLGAVFVGISNGSFM 591 N E + + G LGA+ G+ NGS M Sbjct: 185 AEAFLNPAEGRVGGAAARAGA-----------------------GWLGALGCGLLNGSLM 221 Query: 592 VPLKYAHKD-------IVGVEYLLSFGIGAMTMTMLVFGIYSSMLSFRKIPQPSLYIPGA 750 VP Y ++ VG+ Y+ +F G + + F +Y+ + FR PQP L Sbjct: 222 VPFHYFSEERSGQDGASVGMGYIATFATGVAAVQPIFFLLYARV-PFR--PQPPLLCSEL 278 Query: 751 TLPAFLAGLLWSLGNFFSIYATLYLGLALGWPLVQCQLLVSAMWAVFYYKEVR 909 LP + G+ W++GNF S +ATL+LG A+G+PL Q ++V+ +W ++ E+R Sbjct: 279 ALPGLITGVFWAIGNFESTFATLHLGQAVGYPLTQTCIVVAGLWGALFFGEIR 331 >ref|XP_002182676.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1] gi|217406022|gb|EEC45963.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1] Length = 413 Score = 129 bits (325), Expect = 1e-27 Identities = 98/340 (28%), Positives = 161/340 (47%), Gaps = 52/340 (15%) Frame = +1 Query: 46 VRQLYKSTACFLTSWL-ILVYTPYKFTWWGVLGGLIWVINGIVAIFSVRWAGIGVAQCLW 222 V Q YK++ CFLT WL IL+ ++T +G++ GL WV + IF +R AG+ VA W Sbjct: 44 VMQSYKTSVCFLTCWLVILLGEEPRWTPYGIVSGLFWVPGAAMGIFGIRNAGLAVAVGTW 103 Query: 223 SGLSLFISYIWGAYIFKEPIKNHGLSLLALCVMALGMLGVGFSVSNKRRI-------RLF 381 S +++ S+ +G +F+E +K+ + LA + +G++G+ ++++++ R Sbjct: 104 SSITVLTSFFFGIIVFQERVKSFYQTCLAFGCLIIGLIGMSRFSAHQQQVDTLAVSYRSV 163 Query: 382 NNFPTHT--------ESSTTIKTNDEELHDSVT-PLMSNSPTSSCEREVEFTDQD----- 519 +H + +TI N S+T PL+ S E E TD + Sbjct: 164 KTAASHPLGLGQKLKRAGSTIAEN------SITVPLVGASGVIPMEIEPFATDGEDIVMG 217 Query: 520 ---------KNERDLL-----------KGVLGAVFVGISNGSFMVPLKYA--HKDIVGVE 633 +R +L G+LGAV G G ++PL +A +D+ G Sbjct: 218 TYDDAKSVLSKDRLVLFGGRVSLTRRQMGILGAVINGAWGGMNLIPLHFALQEEDMTGAG 277 Query: 634 YLLSFGIGAMTMTMLVF----GIYSSMLSFRKIPQ----PSLYIPGATLPAFLAGLLWSL 789 YL+S+ G++ + ++ G Y + P + +P +AGLL+S Sbjct: 278 YLISYATGSLIVNTCIWLAFLGYYLHQTNGHWNEAVDCLPKWHFEHLLIPGLMAGLLYSF 337 Query: 790 GNFFSIYATLYLGLALGWPLVQCQLLVSAMWAVFYYKEVR 909 GNF SI A YLG G+ Q QL VS +W VF++KEV+ Sbjct: 338 GNFCSILAVTYLGQGTGFSFCQMQLFVSGLWGVFFFKEVQ 377 >ref|XP_002676812.1| predicted protein [Naegleria gruberi] gi|284090416|gb|EFC44068.1| predicted protein [Naegleria gruberi] Length = 383 Score = 126 bits (316), Expect = 1e-26 Identities = 82/323 (25%), Positives = 149/323 (46%), Gaps = 35/323 (10%) Frame = +1 Query: 46 VRQLYKSTACFLTSWLILVYTPYKFTWWGVLGGLIWVINGIVAIFSVRWAGIGVAQCLWS 225 V Q Y S L S ++L + +K++WW V G IWV + + +I +V + G VAQ W+ Sbjct: 30 VFQFYFSLVVGLMSLIVLAWNEFKWSWWAVAGSGIWVPSSLFSIVAVEYLGAAVAQSTWA 89 Query: 226 GLSLFISYIWGAYIFKEPIKNHGLSLLALCVMALGMLGVGFSVSNKRRIRLFNNFPTHTE 405 G + ++IWG +F+ I N L++ L +M +G+ G + S + T Sbjct: 90 GCVIITNFIWGVTLFQSKIGNIYLTVFGLVIMIIGIFGTA-TCSKWNNPEPVAEKQSETS 148 Query: 406 SSTTIKTNDEELHDSVTPLMS---------------------NSPTSSCE---------- 492 + +++ + +E + TPL N PT E Sbjct: 149 INASVEESGQENNTETTPLYQQENSTNQQENISSDVPIYPSVNDPTLYSELSEIESTIGV 208 Query: 493 ---REVEFTDQDKNERDLLKGVLGAVFVGISNGSFMVPLKYAHKDIVGVEYLLSFGIGAM 663 + +F KN + G++ +V GI+ GS VP + + G+ Y+++FGIG+ Sbjct: 209 YETKSQKFIKILKNSKRYFIGLVASVLCGITGGSMFVPSRL--DEDTGLVYMVAFGIGSF 266 Query: 664 TMTMLVFGIYSSMLSFRKIPQPSLYIPGATLPAFLAGLLWSLGNFFSIYATLY-LGLALG 840 +T + +Y R + ++ + PA L LW GNFF+ Y ++ LGL +G Sbjct: 267 VITTAILIVYYVYYLIRFKKRVPFHLKLSIFPA-LTAFLWQTGNFFAYYVSVSPLGLTIG 325 Query: 841 WPLVQCQLLVSAMWAVFYYKEVR 909 PL + ++++ + + +++E+R Sbjct: 326 MPLTETAMVITGICGLVFFRELR 348 >gb|EJK61576.1| hypothetical protein THAOC_17910 [Thalassiosira oceanica] Length = 360 Score = 116 bits (291), Expect = 1e-23 Identities = 85/327 (25%), Positives = 146/327 (44%), Gaps = 41/327 (12%) Frame = +1 Query: 46 VRQLYKSTACFLTSWLILVYTPYK---------------FTWWGVLGGLIWVINGIVAIF 180 V Q YK+ A F+TS L++ + FT W + + WV G +F Sbjct: 43 VFQTYKAVAVFVTSLLLVAFCNLMHGTHPDSFDYWSFADFTHWAFVSAIFWVPGGTAGVF 102 Query: 181 SVRWAGIGVAQCLWSGLSLFISYIWGAYIFKEPIKNHGLSLLALCVMALGMLGVGFSVSN 360 +VR AG+ ++ LWS + + +SY+WG IF E ++ ++ A+ +M +G++G+ S Sbjct: 103 AVRRAGLAISTGLWSCVIILLSYLWGVLIFHEKQESAVGAVGAVLLMCVGLIGIAHFSS- 161 Query: 361 KRRIRLFNNFPTHTESSTTIKTNDEELHDSVTPLMSNSPTSSCEREV-EFTDQDKNERDL 537 I + + +++ D TPL + + + ++ + T Q Sbjct: 162 ---IEVRPGLDQARAAPRSVEECRPACSDETTPLNGINRANDAQFDLAKLTSQ------- 211 Query: 538 LKGVLGAVFVGISNGSFMVPLKYAHKDIV-GVEYLLSFGIGAMTMTMLVFGIYSSMLSFR 714 L G+ AV G+ S M+PL YA + G+ Y +SFGI A+ + + + I L+ Sbjct: 212 LPGLFAAVLNGLFAASIMLPLHYAPPNTTKGIGYSMSFGIAAVVVVFIFWTIRLLALTAA 271 Query: 715 KIPQ------------------------PSLYIPGATLPAFLAGLLWSLGNFFSIYATLY 822 + PS + P F AGLL+S GN F I + + Sbjct: 272 EFAAKQNEAKRITPNIIRESLREGYSQLPSFHFSEMWRPGFTAGLLYSGGNLFGIVSIQH 331 Query: 823 LGLALGWPLVQCQLLVSAMWAVFYYKE 903 LG +G+ L Q +++S W +F+Y+E Sbjct: 332 LGNFMGYSLNQSSMIISGCWGLFWYRE 358 >ref|XP_005838126.1| hypothetical protein GUITHDRAFT_66230 [Guillardia theta CCMP2712] gi|428182285|gb|EKX51146.1| hypothetical protein GUITHDRAFT_66230 [Guillardia theta CCMP2712] Length = 341 Score = 100 bits (249), Expect = 8e-19 Identities = 73/298 (24%), Positives = 132/298 (44%), Gaps = 21/298 (7%) Frame = +1 Query: 85 SWLILVYTPYKFTWWGVLGGLIWVINGIVAIFSVRWAGIGVAQCLWSGLSLFISYIWGAY 264 S +L +P +++ WG G L+ A +V G +W G+ + ++++WG Sbjct: 50 SLALLKGSPVRWSSWGAAGALLLTATQCCAWPAVGALGAAAGPGIWCGVGMSVAFMWGTI 109 Query: 265 IFKEPIKNHGLSLLALCVMALGMLGVGFSVSNKRRIRLFNNFPTHTESSTTIKTNDEELH 444 +F+E +++ L ++AL ++ G++G+ S+ + RL Sbjct: 110 VFQEAVRSLALCIVALILLFFGIVGISLVQSSMLQ-RLLGE------------------- 149 Query: 445 DSVTPLMSNSPTSSCEREVEFTDQDKNERDLLKGVLGAVFVGISNGSFMVPLKY------ 606 T LMS ++ R + GVL A+ G+ +GS M P K Sbjct: 150 SGATGLMSEEESNKTGRA-----------RIAVGVLLALMTGLFDGSLMAPFKAYLASHP 198 Query: 607 -------------AHKDIVGVEYLLSFGIGAMTMT--MLVFGIYSSMLSFRKIPQPSLYI 741 + D+V EYL SF + + LV ++ + P S + Sbjct: 199 SLVSSSSSSSSSSSSSDVVVFEYLGSFALALPVVAGGSLVLIMFYQHRALNSGPDRSSFR 258 Query: 742 PGATLPAFLAGLLWSLGNFFSIYATLYLGLALGWPLVQCQLLVSAMWAVFYYKEVRTK 915 A P F AG+LW++GN S++ATL LG ++G+P+ Q +++SA+W + +KE+ + Sbjct: 259 QAA-YPGFCAGVLWAVGNVLSVHATLELGQSIGFPMTQSCVVISALWGIVVFKEMTAR 315 >ref|XP_002185821.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1] gi|209582670|gb|ACI65291.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1] Length = 451 Score = 99.8 bits (247), Expect = 1e-18 Identities = 49/128 (38%), Positives = 76/128 (59%), Gaps = 7/128 (5%) Frame = +1 Query: 544 GVLGAVFVGISNGSFMVPLKYAHKDIVGVEYLLSFGIGAMT----MTMLVFG---IYSSM 702 G++ A+F G+ GS M P+K+ D G +LLSF IGA M ++ +G ++ Sbjct: 294 GMVAAMFCGVWGGSIMAPMKFCQSDTKGTHFLLSFSIGASIVNTGMWLVRYGYNVLHYQS 353 Query: 703 LSFRKIPQPSLYIPGATLPAFLAGLLWSLGNFFSIYATLYLGLALGWPLVQCQLLVSAMW 882 S PS ++ L L+G+LWS+GNFFS+ + YLG +G+PLVQ ++VS +W Sbjct: 354 CSKAYASLPSFHLHTMWLAGGLSGMLWSIGNFFSLISVFYLGQGVGYPLVQTSIIVSGLW 413 Query: 883 AVFYYKEV 906 +FY+KE+ Sbjct: 414 GIFYFKEI 421 Score = 87.0 bits (214), Expect = 9e-15 Identities = 40/102 (39%), Positives = 64/102 (62%), Gaps = 1/102 (0%) Frame = +1 Query: 46 VRQLYKSTACFLTSWLILVY-TPYKFTWWGVLGGLIWVINGIVAIFSVRWAGIGVAQCLW 222 V Q YK+ CF TSWL+L+ P+ FT WG++ GL WV G IF+V+ AG+ + + Sbjct: 49 VFQTYKTFMCFATSWLVLLAGEPFTFTPWGIVSGLFWVPGGTATIFAVKNAGLAIGIGIG 108 Query: 223 SGLSLFISYIWGAYIFKEPIKNHGLSLLALCVMALGMLGVGF 348 S + +S+IWG ++F+E + + + LA+ M LG+LG+ + Sbjct: 109 SSFIVLVSFIWGIFVFEEAVHSKTGACLAIFSMMLGLLGMSY 150 >gb|EJK47010.1| hypothetical protein THAOC_34300, partial [Thalassiosira oceanica] Length = 641 Score = 87.4 bits (215), Expect = 7e-15 Identities = 41/102 (40%), Positives = 65/102 (63%), Gaps = 1/102 (0%) Frame = +1 Query: 46 VRQLYKSTACFLTSWLI-LVYTPYKFTWWGVLGGLIWVINGIVAIFSVRWAGIGVAQCLW 222 V Q YKS CFLTSWL+ L+ FT WG++ GL WV G IF++R AG+ ++Q + Sbjct: 41 VMQSYKSLMCFLTSWLVVLLGVEVTFTPWGIVSGLFWVPGGAFNIFAIRNAGLAISQGIV 100 Query: 223 SGLSLFISYIWGAYIFKEPIKNHGLSLLALCVMALGMLGVGF 348 + + +S+IWG IFKEP+ + ++ A+ ++ LG+ G+ + Sbjct: 101 ASSIVMVSFIWGNIIFKEPVHSEVIAYSAVWLIMLGLYGMSY 142 >ref|XP_641212.1| transmembrane protein 144 A [Dictyostelium discoideum AX4] gi|74855822|sp|Q54V96.1|T144B_DICDI RecName: Full=Transmembrane protein 144 homolog B; AltName: Full=Transmembrane protein 144 homolog 2 gi|60469237|gb|EAL67232.1| transmembrane protein 144 A [Dictyostelium discoideum AX4] Length = 358 Score = 86.7 bits (213), Expect = 1e-14 Identities = 71/282 (25%), Positives = 125/282 (44%), Gaps = 16/282 (5%) Frame = +1 Query: 112 YKFTWWGVLGGLIWVINGIVAIFSVRWAGIGVAQCLWSGLSLFISYIWGAY----IFKEP 279 Y F WG+LGG +W I I V+ GIG+ LW S+ Y G + I K+ Sbjct: 59 YIFDPWGLLGGTLWSIGNFCVIPIVKTIGIGLGLLLWCCSSIITGYFTGKFGWFGIDKQK 118 Query: 280 IKNHGLSLLALCVMALGMLGVGFSVSNKRRIRLFNNFPTHTESSTTIKTNDEELHDSVTP 459 + + L+ + + ++ F + +S D ++S+ Sbjct: 119 VSHPALNWIGFACIVAAVIFFFFIEPTIEEKDEHSYSSIVDDSEIGNNGIDNNGYNSINN 178 Query: 460 LMSNSPTSSCEREVEFTDQDKN---ER-----DLLKGVLGAVFVGISNGSFMVPL---KY 606 +N+ ++ R F Q K ER + + G++ +VF GI G MVP+ K Sbjct: 179 --NNNNGNNKRRSGAFNKQPKKSIFERMPPPYNTILGIVLSVFSGIMYGVNMVPMQLWKQ 236 Query: 607 AHKDIVGVEYLLSFGIGAMTMTMLVFGIYSSMLSFRKIPQPSLYIPGATLPAFLAGLLWS 786 ++ D + ++ G VF +YS ++ +P P P+F +GLLW Sbjct: 237 SNVDASPLSFVFCHFSGIFLANTAVFIVYSIIV------RPPQIFPQTIFPSFFSGLLWG 290 Query: 787 LGNFFSIYATLYLGLALGWPLVQ-CQLLVSAMWAVFYYKEVR 909 + N + AT LG +G+P+ ++VS++W+VFY++E++ Sbjct: 291 IANVGLMVATQNLGYTIGFPMGSGGPMIVSSLWSVFYFREIQ 332 >ref|XP_003294072.1| hypothetical protein DICPUDRAFT_90514 [Dictyostelium purpureum] gi|325075525|gb|EGC29401.1| hypothetical protein DICPUDRAFT_90514 [Dictyostelium purpureum] Length = 337 Score = 86.3 bits (212), Expect = 1e-14 Identities = 67/269 (24%), Positives = 132/269 (49%), Gaps = 12/269 (4%) Frame = +1 Query: 136 LGGLIWVINGIVAIFSVRWAGIGVAQCLWSGLSLFISYIWGA---YIFKEPIKNHG-LSL 303 +GG +W ++ I +++ G+G+A L+S + + +I G + KE H ++ Sbjct: 53 MGGSLWCCANLLVIPTIKLLGLGLAVLLYSSIGIVAGFIVGKAGLFGLKEAAAAHDWMNY 112 Query: 304 LALCVMALGMLGVGF---SVSNKRRIRLFNNFPTHTESSTTIKTNDEELHDSVTPLMSNS 474 L L + L ++ F ++ +++ N+ + + I +DEE++ +PL+ N+ Sbjct: 113 LGLAGIILSVIFFFFIKPNLEEEKKADTKGNYHGSYDDFSNI--SDEEIN---SPLIVNT 167 Query: 475 PTSSCEREVEFTDQDKNERDLLKGVLGAVFVGISNGSFMVPL---KYAHKDIVGVEYLLS 645 T EV D+ N + GV+ A+ +GI G M+P+ K + D ++Y S Sbjct: 168 QTQIKNYEVSIYDRIPNRLKTVSGVVFALVIGILLGVNMIPMQLWKQRNPDANPLDYTFS 227 Query: 646 FGIGAMTMTMLVFGIYSSMLSFRKIPQPSLYIPGATLPAFLAGLLWSLGNFFSIYATLYL 825 G VF +Y+ I +P P LP+F +G++ + N + +T L Sbjct: 228 QFSGIFLANTFVFILYTI------IKRPPQIYPQTILPSFCSGVVLGVANIGLMISTENL 281 Query: 826 GLALGWPLVQCQ--LLVSAMWAVFYYKEV 906 G +G+P + C ++VS++W++FY++E+ Sbjct: 282 GYTVGYP-ISCSGPMIVSSLWSIFYFREI 309 >ref|XP_004350600.1| transmembrane protein [Dictyostelium fasciculatum] gi|328865506|gb|EGG13892.1| transmembrane protein [Dictyostelium fasciculatum] Length = 435 Score = 85.1 bits (209), Expect = 3e-14 Identities = 70/286 (24%), Positives = 126/286 (44%), Gaps = 20/286 (6%) Frame = +1 Query: 112 YKFTWWGVLGGLIWVINGIVAIFSVRWAGIGVAQCLWSGLSLFISYIWGAY----IFKEP 279 Y F WG+LGG +W + + I V+ G+G+ LWS SL ++ G + + K+ Sbjct: 145 YIFDPWGLLGGSLWSVGNLCVIPIVKTIGLGLGLLLWSCTSLVTGFLIGKFGAFGLDKQS 204 Query: 280 IKNHGLSLLALCVMALGMLGVGF-----------SVSNKRRIRLFNNFPTHTESSTTIKT 426 + + L+ L + + +L F + S KR + + P E +I + Sbjct: 205 VAHPVLNWLGFSAIVVAILFFFFIKPTLNKEEPTTPSKKRLSQRYEYSPIVDEQQISINS 264 Query: 427 NDEELHDSVTPLMSNSPTSSCEREVEFTDQDKNERDLLKGVLGAVFVGISNGSFMVPL-- 600 + NS + E + N+ + GV+ +VF G+ G MVP+ Sbjct: 265 TE------------NSAPVEGQMIFEKIPEPYNK---IFGVMLSVFSGVLYGVNMVPMQL 309 Query: 601 --KYAHKDIVGVEYLLSFGIGAMTMTMLVFGIYSSMLSFRKIPQPSLYIPGATLPAFLAG 774 + D+ + ++ G VF +YS I +P P LP+F++G Sbjct: 310 WKQQQPSDVNPLSFIFCHFSGIFLFNTAVFFVYSI------IKRPPQVFPQTMLPSFISG 363 Query: 775 LLWSLGNFFSIYATLYLGLALGWPLVQC-QLLVSAMWAVFYYKEVR 909 +LW + N + AT LG +G+P+ ++VS++W+V +KE++ Sbjct: 364 VLWGVANCGLMVATQILGYTIGFPIGSSGPMVVSSLWSVLLFKEIQ 409 >ref|XP_001775344.1| predicted protein [Physcomitrella patens] gi|162673289|gb|EDQ59814.1| predicted protein [Physcomitrella patens] Length = 344 Score = 85.1 bits (209), Expect = 3e-14 Identities = 70/279 (25%), Positives = 114/279 (40%), Gaps = 4/279 (1%) Frame = +1 Query: 79 LTSWLILVYTPYKFTWWGVLGGLIWVINGIVAIFSVRWAGIGVAQCLWSGLSLFISYIWG 258 L+S L+L + F G+L G+ +V++ I +VR G+ VA +W+G + + W Sbjct: 78 LSSLLLLFKYKFVFALEGLLSGVFFVLSFINIFRAVRLLGVSVAYGIWAGTAAIVGVAWS 137 Query: 259 AYIFKEPIKNHGLSLLALCVMALGMLGVGFSVSNKRRIRLFNNFPTHTESSTTIKTNDEE 438 + EP + F HT+ T I++ Sbjct: 138 GQMSWEP-------------------------------QDFYEDDDHTQ--TLIQSQPSF 164 Query: 439 L----HDSVTPLMSNSPTSSCEREVEFTDQDKNERDLLKGVLGAVFVGISNGSFMVPLKY 606 H + + S S ER + R GV AV GI G M+P Sbjct: 165 AGWVQHRKLWDVAGQS--KSGERPKNVLTGEPASRSFPAGVFSAVLAGILGGLVMIPANQ 222 Query: 607 AHKDIVGVEYLLSFGIGAMTMTMLVFGIYSSMLSFRKIPQPSLYIPGATLPAFLAGLLWS 786 A G +L SFGI T +V +S+ P L A P L+G +++ Sbjct: 223 APDMAQGNAFLPSFGIAVAIFTPIV----TSLPYLSGCELPDLSAREAAGPGILSGFIYN 278 Query: 787 LGNFFSIYATLYLGLALGWPLVQCQLLVSAMWAVFYYKE 903 +GN +I A Y+G ++ +PL QC ++V+ +W + Y++E Sbjct: 279 IGNMLNIVAIFYVGSSVAYPLFQCGIIVAGIWGMLYFEE 317 >ref|XP_003293389.1| hypothetical protein DICPUDRAFT_50939 [Dictyostelium purpureum] gi|325076279|gb|EGC30078.1| hypothetical protein DICPUDRAFT_50939 [Dictyostelium purpureum] Length = 348 Score = 84.3 bits (207), Expect = 6e-14 Identities = 70/275 (25%), Positives = 122/275 (44%), Gaps = 9/275 (3%) Frame = +1 Query: 112 YKFTWWGVLGGLIWVINGIVAIFSVRWAGIGVAQCLWSGLSLFISYIWGAYIFKEPIKNH 291 Y F WG+LGG +W + I V+ G+G+ LW S+ + G + Sbjct: 64 YLFDPWGLLGGTLWSLGNFCVIPIVKTIGLGLGLLLWCCCSIVAGFFTGKFGL------F 117 Query: 292 GLSLLALCVMALGMLGVGFSVSNKRRIRLFNNFPTHTESSTTIKTNDEELHDSVTPLMSN 471 GL + A+ +G G V+ I F E+ T + + D P+++ Sbjct: 118 GLEKQVVSHPAMNWIGFGCIVA---AIVFFFFIKPTLENEDTESNSYSSIVDDY-PIINE 173 Query: 472 SPTSSCEREVEFTDQDKNER--DLLKGVLG---AVFVGISNGSFMVPL---KYAHKDIVG 627 + + + T++ ER +K +LG A+F GI G MVP+ K + Sbjct: 174 AGYRGSIQSSKLTEKSFFERIPQPMKTMLGIGLAIFSGIMYGVNMVPMQLWKQSDPSANP 233 Query: 628 VEYLLSFGIGAMTMTMLVFGIYSSMLSFRKIPQPSLYIPGATLPAFLAGLLWSLGNFFSI 807 + ++ G +VF +Y+ I +P P LP+F +G+LW + N + Sbjct: 234 LSFIFCHFSGIFIFNTIVFFVYAI------IKRPPQVFPQTMLPSFFSGVLWGIANCGLM 287 Query: 808 YATLYLGLALGWPL-VQCQLLVSAMWAVFYYKEVR 909 AT LG +G+P+ ++VS++W+V Y+KE++ Sbjct: 288 VATQNLGYTVGFPMGASGPMVVSSIWSVVYFKEIQ 322 >gb|ABK23204.1| unknown [Picea sitchensis] Length = 196 Score = 84.3 bits (207), Expect = 6e-14 Identities = 41/125 (32%), Positives = 71/125 (56%) Frame = +1 Query: 535 LLKGVLGAVFVGISNGSFMVPLKYAHKDIVGVEYLLSFGIGAMTMTMLVFGIYSSMLSFR 714 L +GV+ A+ GI G M+P+ + I GV YL SF IG +V I LS + Sbjct: 51 LSQGVIAALLTGILGGLIMMPMTQSPPAIQGVSYLPSFAIGVAIFAPVVTAI--PYLSTQ 108 Query: 715 KIPQPSLYIPGATLPAFLAGLLWSLGNFFSIYATLYLGLALGWPLVQCQLLVSAMWAVFY 894 + P+ LY+ LP ++G++W++GN S+ A +G + +P++ C + ++ +W +F Sbjct: 109 ECPRMELYV--GALPGIISGIVWNIGNILSMLAIGIIGYTIAYPILYCGIFIAGLWGMFL 166 Query: 895 YKEVR 909 +KE+R Sbjct: 167 FKEIR 171 >ref|XP_005758882.1| hypothetical protein EMIHUDRAFT_121297 [Emiliania huxleyi CCMP1516] gi|485606731|gb|EOD06453.1| hypothetical protein EMIHUDRAFT_121297 [Emiliania huxleyi CCMP1516] Length = 334 Score = 81.6 bits (200), Expect = 4e-13 Identities = 72/290 (24%), Positives = 129/290 (44%), Gaps = 28/290 (9%) Frame = +1 Query: 124 WWGVLGGLIWVINGIVAIFSVRWAGIGVAQCLWSGLSLFISYIWGAYIFKEPIKNHGLSL 303 W G + W + +++V GI V Q +GL + ++ +WGA +F E ++ G + Sbjct: 30 WRGAASAICWAPATVAYVYAVNAIGITVTQTCVAGLLVAVNVLWGACMFSEQLE--GTCI 87 Query: 304 LALCVMALGML-GVGFSVSNKRRIRLFNNFPTHTE-------SSTTIKTNDEEL------ 441 + L + G+L G+ ++R++ +H + +S+T T+ E+L Sbjct: 88 MGLALTTCGILAGINAKKFSERQMNRIAGQHSHLDLSQPVQAASSTAPTDQEQLVEKGAL 147 Query: 442 --HDSVTPLMSNSPTSSCEREVEFTDQDKNERDLLKGVLGAVFVGISNGSFMVP---LKY 606 H+ V + +++ + G+ F GI GS MVP + Sbjct: 148 RSHELVVSNGGGAHAGGGQQQPAAAHPVPAPSEWRVGMAAVCFNGIWGGSIMVPTHGMSR 207 Query: 607 AHKDIVGVEYLLSFGIGAMTMTMLVFGIYSSMLSFRKIPQPSLYIPGATL---PAFLAGL 777 A D Y L+FG A+ + +++ +++R +P LY+ + +G Sbjct: 208 APADY--ASYSLAFGSTALLVIAVLW-----CINWRSVP---LYVAQCRVVWRRGLASGC 257 Query: 778 LWSLGNFFSIYAT------LYLGLALGWPLVQCQLLVSAMWAVFYYKEVR 909 LW++GN S A LGLALG+ VQC L+VS+ W V ++EV+ Sbjct: 258 LWAVGNVCSAVAVNGWGDMAGLGLALGYSAVQCNLVVSSTWGVCLFREVQ 307 >gb|EJK73333.1| hypothetical protein THAOC_05052 [Thalassiosira oceanica] Length = 313 Score = 80.5 bits (197), Expect = 8e-13 Identities = 45/140 (32%), Positives = 72/140 (51%), Gaps = 2/140 (1%) Frame = +1 Query: 46 VRQLYKSTACFLTS--WLILVYTPYKFTWWGVLGGLIWVINGIVAIFSVRWAGIGVAQCL 219 V Q YK+T CFL S ++L+ +FT WG+L G+ WV G I+ +R AG+ VA Sbjct: 43 VMQSYKTTLCFLMSSPMVMLLGERPRFTHWGILSGVFWVPGGAAGIYGIRKAGLAVAVGT 102 Query: 220 WSGLSLFISYIWGAYIFKEPIKNHGLSLLALCVMALGMLGVGFSVSNKRRIRLFNNFPTH 399 WS L + S+ WG ++F E +K+ + A + LG++G+ S + + + + Sbjct: 103 WSSLVVLTSFFWGIHVFGERVKSPNGAAGACLTLILGLIGMANFSSKGKPKKKEKDICSK 162 Query: 400 TESSTTIKTNDEELHDSVTP 459 E+ T D E + TP Sbjct: 163 AETLIRDSTRDLESQQTSTP 182