BLASTX nr result
ID: Ephedra26_contig00007941
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Ephedra26_contig00007941 (2341 letters) Database: ./nr 37,332,560 sequences; 13,225,080,153 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_005847678.1| hypothetical protein CHLNCDRAFT_134071 [Chlo... 216 4e-53 ref|XP_003055315.1| predicted protein [Micromonas pusilla CCMP15... 196 5e-47 ref|XP_002507025.1| predicted protein [Micromonas sp. RCC299] gi... 175 9e-41 emb|CCO17906.1| predicted protein [Bathycoccus prasinos] 172 5e-40 ref|XP_005789384.1| hypothetical protein EMIHUDRAFT_449164, part... 163 4e-37 ref|XP_001745584.1| hypothetical protein [Monosiga brevicollis M... 155 1e-34 ref|XP_005838507.1| hypothetical protein GUITHDRAFT_102791 [Guil... 148 9e-33 gb|EGB12079.1| hypothetical protein AURANDRAFT_61406 [Aureococcu... 144 2e-31 ref|XP_004998524.1| hypothetical protein PTSG_01049 [Salpingoeca... 142 7e-31 ref|XP_005717155.1| unnamed protein product [Chondrus crispus] g... 142 9e-31 ref|XP_001422117.1| predicted protein [Ostreococcus lucimarinus ... 130 3e-27 ref|XP_003084132.1| unnamed protein product, partial [Ostreococc... 120 3e-24 gb|ETO26699.1| hypothetical protein RFI_10435 [Reticulomyxa filosa] 90 4e-15 gb|EJK51783.1| hypothetical protein THAOC_29017, partial [Thalas... 61 3e-06 ref|YP_002960771.1| hypothetical protein MCJ_002570 [Mycoplasma ... 61 3e-06 >ref|XP_005847678.1| hypothetical protein CHLNCDRAFT_134071 [Chlorella variabilis] gi|307107333|gb|EFN55576.1| hypothetical protein CHLNCDRAFT_134071 [Chlorella variabilis] Length = 451 Score = 216 bits (550), Expect = 4e-53 Identities = 144/450 (32%), Positives = 224/450 (49%), Gaps = 23/450 (5%) Frame = +2 Query: 641 LRLVLMTLEYRKSTFSGNGVYAQSIARSLANVGNHVLVVSARPVNRTSSSATDTKRPIQE 820 ++ +L+++E+ TFSGNGVYA S AR+L+ +G+ VLV++ P +S++A Sbjct: 1 MKFLLLSIEFNAGTFSGNGVYACSQARALSQLGHEVLVIAGAPPGHSSAAAAPEDGGNGG 60 Query: 821 AGL-RLTEMEVDVDGAQWGRLDWKCPWQSFADGI-TQNVVKTIMDFQPQWILVVDWSSLP 994 AG+ R+ E+ V WGRLD C W+ +A V + F L VDW S+ Sbjct: 61 AGMQRVVHFELPV----WGRLDAGCGWREYAACAGVPTVAAQVAAFGAAAALGVDWHSVG 116 Query: 995 AFKHLSE-LAGQQKWTMGFLNFRIYYISEYNGEQGLLEKDFYKRMESEAVSMASAIAALS 1171 A+ L+ L +LN+R+Y+ + E ++E E A+ ++ LS Sbjct: 117 AYDALAAALPAAALPPFVYLNYRVYHRTASAEELAVIEG-----REQRALRLSRCSMVLS 171 Query: 1172 SRDAHILANELGVGLSPGVVPKPLFPPLREDIRSMAISRLRVLS---------VWSKDRK 1324 DA L + L P LR D+ + L S W+ R+ Sbjct: 172 RSDADYLRRHFPAATAAAAPLHVLLPALRSDMERLPAPGLGAESEEAGGGGGLAWATGRR 231 Query: 1325 YISCCVRLSPEKNADLFASIIESISS--FLIRRGIVPFLCGGAHGSGGDYAESIKQRVKV 1498 +++CCVRLSPEK F ++E +++ L R G+VP +CG G Y +++R++ Sbjct: 232 HLACCVRLSPEKEPHRFVEVVEVMAARGSLERLGVVPAMCGA--GWNSPYGADLRRRLQQ 289 Query: 1499 AVPDAVVYEGFMGPEQLAEVYSKTLLNVHPCIYDAYG---------MTIVEAAAFEAPSI 1651 VP VV++ FMGP LA++Y+ TLLN+HP YDAYG MTIVEAA+ APS+ Sbjct: 290 NVPQCVVHDSFMGPGDLAQLYAATLLNLHPPTYDAYGESASFPLLCMTIVEAASQGAPSL 349 Query: 1652 XXXXXXXXXXXXEFLESDKGQIFALDLNSPMHALSDKLEKILEDTERLSQTGKAASGRSL 1831 + L S G++F D+ P+ L+D +E +L D RL+ G+ A ++ Sbjct: 350 --VQEGGHVGATDLLSSQAGEVFLCDMEQPVGQLADLVEALLADRARLAAAGRLAQAKAR 407 Query: 1832 SWNEDANAQQLINILDSSSGSFMKNPKPSS 1921 SW E NAQ L+ + + P P++ Sbjct: 408 SWTEHDNAQALVQHVQEALAGGEAAPAPAA 437 >ref|XP_003055315.1| predicted protein [Micromonas pusilla CCMP1545] gi|226463289|gb|EEH60567.1| predicted protein [Micromonas pusilla CCMP1545] Length = 668 Score = 196 bits (497), Expect = 5e-47 Identities = 152/467 (32%), Positives = 226/467 (48%), Gaps = 42/467 (8%) Frame = +2 Query: 602 PSDPMGTIKRNEPLRLVLMTLEYRKSTFSGNGVYAQSIARSLANVGNHVLVVSARPVN-- 775 P+ P PLR++++TLE+ STFSGNGVYA+S AR+LA G+ VLVVS RP + Sbjct: 5 PAAPASPPPPPRPLRVLVITLEFAASTFSGNGVYARSQARALARAGHRVLVVSGRPDDLD 64 Query: 776 --RTSSSATDTKRPIQEAGLRLTEMEVDVDGAQWGRLDWKCPWQSFAD--GITQNVVKTI 943 S+A+D +A R V V A WGRLD C +FA ++ + + I Sbjct: 65 DAAAESAASDPTADDDDAS-RPIVRTVPVPAATWGRLDASCGHAAFASAAAASREITRAI 123 Query: 944 MDFQPQWILVVDWSSLPAFKHL-----SELAGQQKWTMGFLNFRIYYISEYNGEQGLLEK 1108 P+ +L VDWSS PA+ L S ++ Q + NFR++ S + +G Sbjct: 124 ASHAPEVVLGVDWSSYPAWAALRDALPSSISQQPVPPFVYSNFRVF--SRTDDARGT--- 178 Query: 1109 DFYKRMESEAVSMASAIAALSSRDAHILANEL---GVGLSPGVVPKPLFPPLREDIRSMA 1279 ++ +E+ AV+ A+A+ AL DA +A L G ++P VV PPLRED+R+MA Sbjct: 179 --HRALEAAAVAGAAAVVALCEDDADFVAARLAPRGAAVAPFVV----LPPLREDVRAMA 232 Query: 1280 ISRLRVLSVWSKD------RKYISCCVRLSPEKNADLFASIIESI--------------- 1396 + R+Y++C VRLSPEK + F ++ + Sbjct: 233 TEMASEEEEETHGDEALPRRRYVTCVVRLSPEKEPERFVDLVRELRARGTFDDDGRNGDD 292 Query: 1397 SSFLIRRGIVPFLCGGAHGSGGDYAESIKQR-VKVAVPDAVVYEGFMGPEQLAEVYSKTL 1573 + + P LC G+ YA+ I+ R V+ + V F+ L +V+S+T Sbjct: 293 ACVTTASSLTPVLCASTSGA---YADDIRARFVEASGGKGKVVADFVDARGLRDVFSRTT 349 Query: 1574 LNVHPCIYDAYGMTIVEAAAFEAPSI-----XXXXXXXXXXXXEFLESDKGQIFALD-LN 1735 LNVHPC DAYGMT+VEAAAF APSI F+ +D A + Sbjct: 350 LNVHPCARDAYGMTVVEAAAFGAPSIVQGGGGVGCTGLVGFEDAFVRADWTAAAAAEGGG 409 Query: 1736 SPMHALSDKLEKILEDTERLSQTGKAASGRSLSWNEDANAQQLINIL 1876 + ++D + L D + L A +L+W+E ANA + ++L Sbjct: 410 GGVAGVADVARRALADADALRAIAARARTAALAWDEAANASAVADVL 456 >ref|XP_002507025.1| predicted protein [Micromonas sp. RCC299] gi|226522300|gb|ACO68283.1| predicted protein [Micromonas sp. RCC299] Length = 694 Score = 175 bits (443), Expect = 9e-41 Identities = 152/494 (30%), Positives = 218/494 (44%), Gaps = 82/494 (16%) Frame = +2 Query: 641 LRLVLMTLEYRKSTFSGNGVYAQSIARSLANVGNHVLVVSARPVNRTSSSATDTKRPIQE 820 LR++ +TLE+R TFSGNGV AQS A LA G+ VLVVS P N + D++R I + Sbjct: 8 LRVLFLTLEFRHGTFSGNGVLAQSQAHGLAKAGHAVLVVSGCPDNLDVTD--DSQREISQ 65 Query: 821 AGLRLTEMEVDVDGAQWGRLDWKCPWQSFADGI--TQNVVKTIMDFQPQWILVVDWSSLP 994 + + V ++WG+L +CPW+ AD + + + I F P +L +DW P Sbjct: 66 TAGAVEVRTLRVPASKWGKLTARCPWRELADAAHADEPLHERIRSFAPDVVLGIDWHVTP 125 Query: 995 AFKHLSELAGQ-----QKWTMGFLNFRIYYISEYN----GEQGLLEKD---FYKRMESEA 1138 ++ L + ++ +N I S Y + +D + E +A Sbjct: 126 TWRALRRRVWRDSDAARETDPNGMNPDIASSSPYPPFVYSNYRVFARDGDPTHAAKERDA 185 Query: 1139 VSMASAIAALSSRDAHILANEL---GVGLSPGVVPKPLFPPLREDIRSMAI--------- 1282 V A+A+ AL DA L EL G ++PGVV PLRED+ ++A Sbjct: 186 VEEAAAVVALCQVDADYLCEELSPEGAAVAPGVV----IAPLREDVHALATHAQTPMTPA 241 Query: 1283 --------------SRLRVLSVWSKD--------RKYISCCVRLSPEKNADLFASIIESI 1396 L ++ W D R ++CCVRLSPEK + F S+ Sbjct: 242 TTAENGVELDETGAQHLERVAPWRLDAVPGQNTPRFLLTCCVRLSPEKEPERFVSLCAE- 300 Query: 1397 SSFLIRRG-------------IVPFLCGGAHGSGGDYAESIKQRVKVAVPD-AVVYEGFM 1534 L+RRG IVP LC + GDYAE+I+ + A +V F+ Sbjct: 301 ---LVRRGVVLGGTNEPGTTEIVPVLCAS---TAGDYAETIRAKFLDATRGKGLVCRDFL 354 Query: 1535 GPEQLAEVYSKTLLNVHPCIYDAYGMTIVEAAAFEAPSIXXXXXXXXXXXXEFLESDKGQ 1714 +A VY +T+LNVHPC YDAYGMT+VEAAAF APS L S+ G Sbjct: 355 DARGMASVYRRTVLNVHPCAYDAYGMTVVEAAAFGAPSCVQRGDSVGCCA---LLSETGN 411 Query: 1715 IF-----ALDLNSPMHALSDKLEKIL---------------EDTERLSQTGKAASGRSLS 1834 F D + ++D +E L +D + L G A R+L Sbjct: 412 EFVPVDWTKDKGDKVDLIADAVEAYLRAGWDGSKRGGDEHHDDDDSLRAIGTRARRRALG 471 Query: 1835 WNEDANAQQLINIL 1876 W+ A + + IL Sbjct: 472 WDLFACGRDIAKIL 485 >emb|CCO17906.1| predicted protein [Bathycoccus prasinos] Length = 586 Score = 172 bits (437), Expect = 5e-40 Identities = 133/449 (29%), Positives = 222/449 (49%), Gaps = 37/449 (8%) Frame = +2 Query: 623 IKRNEPLRLVLMTLEYRKSTFSGNGVYAQSIARSLAN-----------VGNHVLVVSARP 769 +K+ + +++ ++ EY TFSGNGV A S R N + +LVV ARP Sbjct: 6 VKKTKKTKVLFISYEYTFGTFSGNGVLAASTVRGFLNDDSDFDDDDDAMTTEILVVCARP 65 Query: 770 VNRTSSSATDTKRPIQEAGLRLTEMEVDVDGAQWGRLDWKCPWQSFADGITQNVVKTIMD 949 + + ++ G+ + V V ++WGRLD C ++ FA +++ ++ I+ Sbjct: 66 TELEEELRMEKEDEEEDKGV----LSVPVPKSKWGRLDKTCAYEEFATNVSEKQLEAIVK 121 Query: 950 FQPQWILVVDWSSLPAFKHLSELAGQQKWT--------MGFLNFRIYYISEYNGEQGLLE 1105 F+ +L VDWSSL ++ + + +++ + +LN+R++ L Sbjct: 122 FRADVVLGVDWSSLEVYERIRKRYQEEEEDGFSPPPPPLTYLNYRVF----------TLH 171 Query: 1106 KDFYKRMESEAVSMASAIAALSSRDAHILANELGVGLSPG---VVPKPLFPPLREDI--- 1267 + ++ +E ++ ++ + L DA L + VG G VV L PPLRED+ Sbjct: 172 DESHRALEQRMITASTHVICLCINDAKFLRDAF-VGGDDGKEMVVAVVLHPPLREDVLRD 230 Query: 1268 RSMAISRLRVLSVWS-----KDRKYISCCVRLSPEKNADLFASIIESISSF-LIRRGIVP 1429 A++R + V S K R ++C VRLS EK + F ++E ++ +VP Sbjct: 231 AKNALAREQPSPVTSTTGAKKKRNALTCVVRLSKEKEPERFVELVEELAKRDAFGDTLVP 290 Query: 1430 FLCGGAHGSGGDYAESIKQRVKVA---VPDA--VVYEGFMGPEQLAEVYSKTLLNVHPCI 1594 L A ++AE +K+R + A +P+A + E F+ QL E+YS+T LNVHPC Sbjct: 291 ILVAPATS---EFAEGLKKRFREATKNIPNASELCIESFLNATQLGEIYSRTKLNVHPCR 347 Query: 1595 YDAYGMTIVEAAAFEAPSIXXXXXXXXXXXXEFLESDKGQIFALDL-NSPMHALSDKLEK 1771 DAYGMT++EAAAF APS+ E L ++G +F+ D N + AL+D + Sbjct: 348 KDAYGMTVIEAAAFGAPSL--VCSGGKVGSSELLMENQG-VFSFDYENKSVEALADFITN 404 Query: 1772 ILEDTERLSQTGKAASGRSLSWNEDANAQ 1858 I D RL + K R+L+++E A+ Sbjct: 405 I--DENRLEEVAKIGRERALAYDETEYAR 431 >ref|XP_005789384.1| hypothetical protein EMIHUDRAFT_449164, partial [Emiliania huxleyi CCMP1516] gi|485642888|gb|EOD36955.1| hypothetical protein EMIHUDRAFT_449164, partial [Emiliania huxleyi CCMP1516] Length = 423 Score = 163 bits (412), Expect = 4e-37 Identities = 129/435 (29%), Positives = 203/435 (46%), Gaps = 32/435 (7%) Frame = +2 Query: 641 LRLVLMTLEYRKSTFSGNGVYAQSIARSLANVGNHVLVVSARPVNRTSSSATDT---KRP 811 ++++L+T E+ + FSGNG+ ++S+A+SL ++G + V+ RP + A D + Sbjct: 1 MKVLLITYEFTHAPFSGNGMLSRSLAKSLLSLGASLRVICCRPAPTLAGLAGDNHLAEPE 60 Query: 812 IQEAGLRLTEMEVDVDGAQWGRLDWKCPWQSFADGITQNVVKTIMDFQPQWILVVDWSSL 991 + + L +++ + A W RLD ++ F G + + F P+ ++ VDW+ Sbjct: 61 VARPDIELWTIQLQ-EAAGWKRLDDASAYKDFWAG-AGGMGAAVARFAPEAVVAVDWTGA 118 Query: 992 PAFKHLSELAGQQKWTMGFLNFRIYYISEYNGEQGLLEKDFYKRMESEAVSMASAIAALS 1171 A++ L E +G + +LNFR+Y G ++ RME+ A+ A + ALS Sbjct: 119 GAWRALRE-SGVADSKLCYLNFRVYASGMAESRGG-----WFDRMEAAALEQAQLVLALS 172 Query: 1172 SRDAHILANELGVGLSPGVVPKP----LFPPLREDIRSMAISR----------------- 1288 D LA L SP P L PPLR D+ ++A++ Sbjct: 173 PADQKSLA-ALQRRSSPPTATLPPVRVLMPPLRADVAALALASGSGAAAAAASAALPPAL 231 Query: 1289 LRVLSVWSKDRKYISCCVRLSPEKNADLFASIIESISSFLIRRGIVPFLCGGAHGSGGDY 1468 L++ R++++C VRLS EK F + E Y Sbjct: 232 AAALALPGARRRFVTCGVRLSSEKEPMRFVAFAE-------------------------Y 266 Query: 1469 AESIKQRVKVAVP-------DAVVYEGFMGPEQLAEVYSKTLLNVHPCIYDAYGMTIVEA 1627 AE K R++ A DAVV F+GP LA V++ T LN HPC YDAYGM++VEA Sbjct: 267 AEEAKTRLRAACEAAAALSGDAVVLSDFIGPPALAAVFAATALNFHPCRYDAYGMSVVEA 326 Query: 1628 AAFEAPSIXXXXXXXXXXXXEFLESDKGQIFALDLNSPM-HALSDKLEKILEDTERLSQT 1804 AAF APS+ + L +D+G F + A+S ++ +L D RL+ Sbjct: 327 AAFGAPSV--VNGGAKVGAAQLLPADEGASFEARFDGAADEAVSAEVLALLGDQARLASV 384 Query: 1805 GKAASGRSLSWNEDA 1849 G AA R+L W+E A Sbjct: 385 GSAARERALGWDESA 399 >ref|XP_001745584.1| hypothetical protein [Monosiga brevicollis MX1] gi|163775933|gb|EDQ89555.1| predicted protein [Monosiga brevicollis MX1] Length = 316 Score = 155 bits (391), Expect = 1e-34 Identities = 106/315 (33%), Positives = 165/315 (52%), Gaps = 7/315 (2%) Frame = +2 Query: 965 ILVVDWSSLPAFKHLSELAGQQKWT-MGFLNFRIYYISEYNGEQGLLEKDFYKRMESEAV 1141 + VDW+ L A + L+ + G ++ + FL +R++ S N Y+ E+ A Sbjct: 10 VCAVDWTGLAAVQTLASMYGAKRSPPLVFLCYRVFSAS-VNITTNPDAAATYRMYEASAW 68 Query: 1142 SMASAIAALSSRDAHILANELGVGLSPGVVPKP------LFPPLREDIRSMAISRLRVLS 1303 S A A+ LS D L L P+P L+PPLR D+ MA S+ ++L Sbjct: 69 SNAQALVVLSQADQRNLEQ-----LQSSDDPRPPSACFLLYPPLRADMVQMA-SQAQLLP 122 Query: 1304 VWSKDRKYISCCVRLSPEKNADLFASIIESISSFLIRRGIVPFLCGGAHGSGGDYAESIK 1483 ++ +Y++CCVRLSPEK+ D F I+ + + L IVP LCG G+ Y ++ Sbjct: 123 P-ARGPRYLTCCVRLSPEKSPDTFLDILSHLRTSLCSLNIVPLLCGT--GTDPAYVSALH 179 Query: 1484 QRVKVAVPDAVVYEGFMGPEQLAEVYSKTLLNVHPCIYDAYGMTIVEAAAFEAPSIXXXX 1663 R + P+ ++ GFMGP ++AEV+ +T+LNVHP DA+GMTI EAAAF P+ Sbjct: 180 HRAQAVCPETMIRSGFMGPSEMAEVWQQTILNVHPSQRDAFGMTIAEAAAFGVPT--AGS 237 Query: 1664 XXXXXXXXEFLESDKGQIFALDLNSPMHALSDKLEKILEDTERLSQTGKAASGRSLSWNE 1843 +FL +G + L++P A + + +L D RLSQ +AA R+L+W Sbjct: 238 DGGDVGALDFLA--QGAFLPIKLSAPAKAAA-AIRDLLHDPTRLSQLQRAAQARALAWTA 294 Query: 1844 DANAQQLINILDSSS 1888 A+ +L+ IL ++ Sbjct: 295 TAHGAELLTILQQAA 309 >ref|XP_005838507.1| hypothetical protein GUITHDRAFT_102791 [Guillardia theta CCMP2712] gi|428182667|gb|EKX51527.1| hypothetical protein GUITHDRAFT_102791 [Guillardia theta CCMP2712] Length = 419 Score = 148 bits (374), Expect = 9e-33 Identities = 126/428 (29%), Positives = 200/428 (46%), Gaps = 17/428 (3%) Frame = +2 Query: 653 LMTLEYRKSTFSGNGVYAQSIARSLANVGNHVLVV--SARPVNRTSSSATDTKRPIQEAG 826 L+++E FSGNG+ ++ +ARS+ + + L V ++ + + A R + EA Sbjct: 5 LISMEVLDPIFSGNGIASRFVARSMLSRNSTSLFVLCGSKTAGQGAGLAHRCDREVFEAI 64 Query: 827 LRLTEMEVDVDGAQ-----WGRLDWKCPWQSFADGITQNVVK-TIMDFQPQWILVVDWSS 988 + + V G WG+LD ++ FA G +V+ TI + L VDWS Sbjct: 65 SPKRKDQSRVKGISIPLPIWGKLDKNSSYEEFAAGACDKLVEETIKIDKVDTFLCVDWSG 124 Query: 989 LPAFKHLSE---LAGQQKWTMGFLNFRIYYI--SEYNGEQGLLEKDFYKRMESEAVSMAS 1153 F + E + K M + F +++ S N E G FYK E+ ++ A+ Sbjct: 125 CYVFHRMKEQGLIVDDAK--MIYFPFCVFHALYSASNEEVG-----FYKAEETRSIESAT 177 Query: 1154 AIAALSSRDAHILANELGVGLSPGVVPKPLFPPLREDIRSMAISRLRVLSVWSKDRKYIS 1333 + AL D LA + + L PPLRED R++ + V+ + RK+I+ Sbjct: 178 KVIALCESDLEKLAK---LNSKHRGKIEVLPPPLREDFRAVVLRNSEVV----RQRKFIT 230 Query: 1334 CCVRLSPEKNADLFASIIESISSFLIRRGIVPFLCGGAHGSGGDYAESIKQRVKVAVPDA 1513 CC+R +P KN +F + I L RGI+ +CG YA ++ ++ + Sbjct: 231 CCLRFTPSKNVKVFCEAVSIIHEELSARGIMVVMCGAVIDES--YANECRRVMEESNTRH 288 Query: 1514 VVYEGFMGPEQLAEVYSKTLLNVHPCIYDAYGMTIVEAAAFEAPSIXXXXXXXXXXXXEF 1693 + + F+ + L EV S++L+N+H Y+AYGMTIVEAAA +PSI Sbjct: 289 NIVKDFLDAQGLQEVLSQSLVNIHTAPYEAYGMTIVEAAACGSPSIIHEQWRDIGATILL 348 Query: 1694 -LESDKGQIF---ALDLNSPMHALSDKLEKILEDTERLSQTGKAASGRSLSWNEDANAQQ 1861 SD G + LD N A+ D + D E L + + A +LSW+ED A++ Sbjct: 349 PPTSDGGSCYLANMLDANDLARAILD----AIRDQESLGEASRRARELALSWDEDKYAER 404 Query: 1862 LINILDSS 1885 LI +LD S Sbjct: 405 LIEVLDDS 412 >gb|EGB12079.1| hypothetical protein AURANDRAFT_61406 [Aureococcus anophagefferens] Length = 483 Score = 144 bits (362), Expect = 2e-31 Identities = 136/466 (29%), Positives = 202/466 (43%), Gaps = 54/466 (11%) Frame = +2 Query: 641 LRLVLMTLEYRKSTFSGNGVYAQSIARSLANV-GNHVLVVSARPVNRTSSSATDTKRPIQ 817 + ++++T EY TFSGNGVYAQS+ R L V G V VV P A + + P + Sbjct: 3 INILVLTFEYDAFTFSGNGVYAQSLVRGLEMVEGVSVDVVCGGPFG-----AENCQPPSK 57 Query: 818 EAGLRLTEMEVDVDGAQWGRLDWKCPWQSFADGITQNVVKTIMDFQ-PQWILVVDWSSLP 994 A V V+ +WG LD C ++ FA + + + P +L VDW L Sbjct: 58 AA--------VVVNLPRWGHLDASCCFREFAAQVNARLDEYYESRAWPDLVLGVDWHVLQ 109 Query: 995 AFKHLSELAGQQKWTMGFLNFRIYYISEYNGEQGLLEKD--FYKRMESEAVSMASAIAAL 1168 ++HLS ++ M +LNFR++ S G G + F+K E + + AL Sbjct: 110 VYEHLSR---KKPLPMVYLNFRVFSRS---GPPGASQHGAAFFKAQEFAMMDRSWLTLAL 163 Query: 1169 SSRDAHILANELGVGLSPGVVPKPLFPPLREDIRSMAISRLRVLSVWSKDRKYISCCVRL 1348 S DA L L P + L PP+R D+ + A LR V +R I+CCVRL Sbjct: 164 SETDAACLRE-----LHPNAEVRVLLPPIRADVMARAQRPLRKADV--PERHLITCCVRL 216 Query: 1349 SPEKNADLFASIIESISS-------FLIRRGIVPFLCGGAHGSGGDYAESIKQRVKVAVP 1507 S EK + F ++E+++ F + G+ P L G D + +R A P Sbjct: 217 SDEKEPERFVRLVENLAERKAFDGVFGYKVGLRPVLVTGRC----DVDTPLVRRFLDAHP 272 Query: 1508 DAVVYEGFMGPEQLAEVYSKTLLNVHPCIYDAYGMTIVEAAAFEAPSI------------ 1651 + VY F+ E L E+Y K+LL+VHP YD +GM + EAAAF AP++ Sbjct: 273 ETDVY-AFLDAEGLEELYRKSLLHVHPPAYDPFGMAVAEAAAFGAPTVFLKGADVGAKTI 331 Query: 1652 ---------XXXXXXXXXXXXEFLE-------------------SDKGQIFALDLNSPMH 1747 F E +D Q+ D +H Sbjct: 332 LQRGDTPELDPRTGKELFDQDAFFEFDLDDDHAGYVAPEDRPTTADLEQLDEFDRRRWIH 391 Query: 1748 ALSD---KLEKILEDTERLSQTGKAASGRSLSWNEDANAQQLINIL 1876 D +L +++ D RL G+ A R+L W E A+ +L++ L Sbjct: 392 KHEDAVTELLRLVRDESRLRDVGRLAKKRALPWTEKAHGARLMDFL 437 >ref|XP_004998524.1| hypothetical protein PTSG_01049 [Salpingoeca rosetta] gi|326430779|gb|EGD76349.1| hypothetical protein PTSG_01049 [Salpingoeca rosetta] Length = 813 Score = 142 bits (358), Expect = 7e-31 Identities = 122/423 (28%), Positives = 179/423 (42%), Gaps = 57/423 (13%) Frame = +2 Query: 635 EPLRLVLMTLEYRKSTFSGNGVYAQSIARSLANVGNHVLVVSARPVNRTSSSATDTKRPI 814 E LR++++T E+ FSGNGV ++S+ L ++G+ V V+ ARP + T P Sbjct: 307 ERLRVLVVTAEFVDPIFSGNGVLSRSLCEGLHSIGHSVFVLCARP----KDADPPTTFPC 362 Query: 815 QEAGLRLTEMEVDVDGAQWGRLDWKCPWQSFADGITQNVVKTIMDFQPQWILVVDWSSLP 994 A +T + W RLD W+ A G+T +V++ I F P L VDW++L Sbjct: 363 HGATHVVT-----IPVTTWRRLDRYGHWEEMAHGVTPHVIERIKAFSPTTCLFVDWTTLA 417 Query: 995 AFKHLSELAGQQKWTMGFLNFRIYYISEYNGEQGLLEKDFYKRMESEAVSMASAIAALSS 1174 + L WT FL FR + S E ++ FY + E ++ +A LS Sbjct: 418 TVRQLPF----HNWT--FLCFRTFAASTELHEHPD-DEAFYIKHERASLELAPKTICLSC 470 Query: 1175 RDAHIL-----------ANELGVGLSPGVVPKPLFPPLREDIRSMAISRLRVLSVWSKDR 1321 D L E V PG L+PPLR + + V S R Sbjct: 471 ADKLALQRLRHTRTSSATQEPSVSQHPGRHIHVLYPPLRHEFAHLPEPATTVTSSIVDSR 530 Query: 1322 ----------------------------------------------KYISCCVRLSPEKN 1363 + + CCVRLSPEK Sbjct: 531 CTDQMATTTPASTSTSTSTSAPATSTIKTTTTNNGESRQPQSAPHVRRLLCCVRLSPEKG 590 Query: 1364 ADLFASIIESISSFLIRRGIVPFLCGGAHGSGGDYAESIKQRVKVAVPDAVVYEGFMGPE 1543 A FA ++ + L R IVP LCG + DYA +++Q ++ P+ V F+ Sbjct: 591 AMRFAEMVAHLREDLRRHNIVPTLCGAV--ADADYATAVRQALRSTSPNCQVINRFLDAR 648 Query: 1544 QLAEVYSKTLLNVHPCIYDAYGMTIVEAAAFEAPSIXXXXXXXXXXXXEFLESDKGQIFA 1723 L +++ +T++NVHP ++DA+GMTIVEAAA A S L SD G + A Sbjct: 649 ALQQLFRETVINVHPSLHDAFGMTIVEAAACGAVS---------------LVSDDGTVGA 693 Query: 1724 LDL 1732 +DL Sbjct: 694 VDL 696 >ref|XP_005717155.1| unnamed protein product [Chondrus crispus] gi|507113726|emb|CDF37336.1| unnamed protein product [Chondrus crispus] Length = 449 Score = 142 bits (357), Expect = 9e-31 Identities = 119/427 (27%), Positives = 202/427 (47%), Gaps = 19/427 (4%) Frame = +2 Query: 644 RLVLMTLEY-RKSTFSGNGVYAQSIARSLANVGNHVLVVSARPVNRTSSSATDTKRPIQE 820 RLV ++LEY + FSG+G+ A++ R A V+V+ RP + T++P + Sbjct: 40 RLVYLSLEYCQPDLFSGSGIAARAQVRGFAAKNVSVVVICGRPAH--------TQQP-EP 90 Query: 821 AGLRLTEMEVDVDGAQWGRLDWKCPWQSFADGITQNVVKTIMDFQPQWILVVDWSSLPAF 1000 A + + V +D +WG D Q +A+G+ + + + + IL VDW++ A Sbjct: 91 ARRNVKVISVPLD--KWGTTDRCASHQQYAEGVARILGDGLGQYDA--ILAVDWTAANAV 146 Query: 1001 KHLSELAGQQKWTMGFLNFRIYYISEYNGEQGLLEKD--FYKRMESEAVSMA----SAIA 1162 + G K M +L+FR+Y G+ D FY+ E+ AV +A + Sbjct: 147 RLFLCHGGTLKVPMIYLSFRVYC-----SMTGISRDDARFYREEEAAAVDLALTSGGGVV 201 Query: 1163 ALSSRDAHILANELGVGLSPG--------VVPKPLFPPLREDIRSMAISRLRVLSVWSKD 1318 AL D L+ P V+P P LR++ ++A + + + Sbjct: 202 ALCDMDFETLSTLRTARSHPADKLPHQFAVIP----PMLRDEFSNIAKDDEDQILDFPNN 257 Query: 1319 RKYISCCVRLSPEKNADLFASIIESISS----FLIRRGIVPFLCGGAHGSGGDYAESIKQ 1486 R Y+ VRLS +K F +++++ R G+VP +CG S YA+ I + Sbjct: 258 RMYLVSLVRLSRDKGPHRFVKLLQNLQQKDPDIWERTGVVPLICGSH--SQPHYADRILR 315 Query: 1487 RVKVAVPDAVVYEGFMGPEQLAEVYSKTLLNVHPCIYDAYGMTIVEAAAFEAPSIXXXXX 1666 ++ AVP +VV + F+ PE+LA V ++LN+HP Y+AYGMTI+EAAA P++ Sbjct: 316 ELRSAVPHSVVIDKFLTPEELAVVLKNSVLNIHPAEYEAYGMTIIEAAAMGCPTV---LN 372 Query: 1667 XXXXXXXEFLESDKGQIFALDLNSPMHALSDKLEKILEDTERLSQTGKAASGRSLSWNED 1846 + L+ A+D+ + A ++ + ++LED R Q +A + SW E Sbjct: 373 RTGIGATQLLDPKNKACAAVDVTDEV-AFANTVRRLLEDAPRRQQLAHSAFLHATSWTEA 431 Query: 1847 ANAQQLI 1867 + + L+ Sbjct: 432 EHVRALL 438 >ref|XP_001422117.1| predicted protein [Ostreococcus lucimarinus CCE9901] gi|144582357|gb|ABP00434.1| predicted protein [Ostreococcus lucimarinus CCE9901] Length = 271 Score = 130 bits (327), Expect = 3e-27 Identities = 90/283 (31%), Positives = 140/283 (49%), Gaps = 7/283 (2%) Frame = +2 Query: 1055 FRIYYISEYNGEQGLLEKDFYKRMESEAVSMASAIAALSSRDAHILANELGVGLSPGVVP 1234 FRI+ S+++G L E + V + + AL + DA + LG + P VV Sbjct: 13 FRIFSRSDFDGHHAL---------ECDGVRRSDGVLALCASDADFVTERLGAAVRPLVV- 62 Query: 1235 KPLFPPLREDIRSMAISRLRVLSVWSKDRKYISCCVRLSPEKNADLFASIIESISSFLIR 1414 PPLRE + +A +R ++ RKY++C VR+S EK F + E ++ R Sbjct: 63 ---HPPLRESVLKIARARRGE----TRRRKYLTCVVRVSEEKEPHRFVELCEELA----R 111 Query: 1415 RGIV------PFLCGGAH-GSGGDYAESIKQRVKVAVPDAVVYEGFMGPEQLAEVYSKTL 1573 RG+ P C A YA+ +K R + P V E F+ PE+L E++S+T+ Sbjct: 112 RGVFDSKALSPVFCANASLCQTSAYAQDLKARFQKCAPSGRVIEHFLSPEELGELFSETI 171 Query: 1574 LNVHPCIYDAYGMTIVEAAAFEAPSIXXXXXXXXXXXXEFLESDKGQIFALDLNSPMHAL 1753 LN+HP +DA+GMTIVE+AAF APS+ S+ Q +D+ +P Sbjct: 172 LNIHPPTHDAFGMTIVESAAFGAPSVVHHAGAVG-------ASELLQTIGVDVEAPTREF 224 Query: 1754 SDKLEKILEDTERLSQTGKAASGRSLSWNEDANAQQLINILDS 1882 +DK+E+IL D R + A ++L W+E + + + L S Sbjct: 225 ADKIEEILLDA-RTRVVAENAHSKALEWDEASFGHAVFDFLFS 266 >ref|XP_003084132.1| unnamed protein product, partial [Ostreococcus tauri] gi|116056015|emb|CAL58548.1| unnamed protein product, partial [Ostreococcus tauri] Length = 355 Score = 120 bits (300), Expect = 3e-24 Identities = 102/356 (28%), Positives = 161/356 (45%), Gaps = 11/356 (3%) Frame = +2 Query: 842 MEVDVDGAQWGRLDWKCPWQSFADGITQ-NVVKTIMDFQPQWILVVDWSSLPAFKHLSEL 1018 + + V A+W RLD + +FA G VV+ + F VD++S+ A E Sbjct: 37 IRIPVPMARWHRLDRRSAHDAFARGCADARVVERVRAFACDVAFAVDFTSVRAV----EA 92 Query: 1019 AGQQKWTMGFLNFRIYYISEYNGEQGLLEKDFYKRMESEAVSMASAIAALSSRDAHILAN 1198 G ++ E D + E AV+ ALS DA + Sbjct: 93 IGLRRD----------------------EGDAHAVYERAAVAGTWMTIALSRSDAMFVME 130 Query: 1199 ELGVGLSPGVVPKPLFPPLREDIRSMAISRLRVLSVWSKDRKYISCCVRLSPEKNADLFA 1378 LG G + PPLR D A+ + R+Y+SC VR S EK F Sbjct: 131 RLG---GRGRT-RWTHPPLRRDAAREAMRD----DGTRRARRYVSCVVRPSEEKKPHRFV 182 Query: 1379 SIIESISSFLIRRGI----------VPFLCGGAHGSGGDYAESIKQRVKVAVPDAVVYEG 1528 ++ E ++ RRG+ P +C A DYA +++R + P+A V + Sbjct: 183 AMCEELA----RRGVFDAKDGGAPLAPLMCINAEVKS-DYARDLRRRFEACSPNARVVDE 237 Query: 1529 FMGPEQLAEVYSKTLLNVHPCIYDAYGMTIVEAAAFEAPSIXXXXXXXXXXXXEFLESDK 1708 F+G L +V+ +TLLNVHP YD++GMTI+EAAAF AP++ +SD Sbjct: 238 FLGTRALGDVFEETLLNVHPPSYDSFGMTIIEAAAFGAPTVMHNGGDVGARDLLSADSDV 297 Query: 1709 GQIFALDLNSPMHALSDKLEKILEDTERLSQTGKAASGRSLSWNEDANAQQLINIL 1876 F +++ + +D +E I+ D ER+ + + A R+L W+EDA + +++I+ Sbjct: 298 -SCFDMNMEASASEQADVIETIINDPERIHKVAQNARLRALKWDEDAFGEAILDII 352 >gb|ETO26699.1| hypothetical protein RFI_10435 [Reticulomyxa filosa] Length = 526 Score = 90.1 bits (222), Expect = 4e-15 Identities = 97/374 (25%), Positives = 171/374 (45%), Gaps = 33/374 (8%) Frame = +2 Query: 629 RNEPLRLVLMTLEYRKST-FSGNGVYAQSIARSLANVGNHVLVVSARP-VNRTSSSATDT 802 R R + + E+ +ST FSGNGVY ++I SL V+VV A P + + T+ Sbjct: 19 RESQRRYLFVAYEFIQSTVFSGNGVYGRTIVSSLLQKDCDVVVVCAHPQIVKNDEKNTNK 78 Query: 803 KRPIQEAGLRLTEMEV-DVDGAQWGRLDWKCPWQSFADGITQNVVKTIMDFQPQW---IL 970 + ++E +M V D D + + W + A +NV T + +L Sbjct: 79 MQFLKE------QMTVQDADKKKTMNDNRHNCWHTGACKQMENVGSTERLSKTSCFSHLL 132 Query: 971 VVDWSSLPAFKHLSELAGQQKWTMGFLNFRIYY-------ISEYNGEQGLLEKD------ 1111 VD+S+ A + L + + NFR+++ S+ N E ++ + Sbjct: 133 FVDYSACLASQVLVTILNLFTIKKTYFNFRVFHKNSNLTKSSKKNNEPVTVDVNDDDDVS 192 Query: 1112 FYKRMESEAVSMASAIAALSSRDAHILANELGVGLSP---GVVPKP---LFPPLREDIRS 1273 FY +E + V + I AL D L LS G K L+PPL+ + + Sbjct: 193 FYVSVEKQCVDWSDQIIALCQMDLQALQQLSDSKLSSDSGGPHEKMWHVLYPPLQPAVHA 252 Query: 1274 MAISRLRVLSV--WSKDRKYISCCVRLSPEKNADLFASIIESIS-SFLIRRGIVPFLCGG 1444 +A+ +L+ + +RKYI CCVR+SPEKN +F ++ + + + F+ G Sbjct: 253 LALKTKPILNEQKFEGNRKYILCCVRISPEKNMMVFIDMLPYLQMERFADKNVSLFIVGS 312 Query: 1445 AHGSGGDYAESIKQRVK-----VAVPDAVVYEGFMGPEQLAEVYSKTLLNVHPCIYDAYG 1609 + Y +++ +++ +P V F+ ++ + + +++NVH + + YG Sbjct: 313 CNDVR--YKQTLVSKLEHYSKQYHIP--YVMHEFLQVDEFSVILRSSIVNVHTSLNEPYG 368 Query: 1610 MTIVEAAAFEAPSI 1651 MTI+E+AAF+ PSI Sbjct: 369 MTIMESAAFQTPSI 382 >gb|EJK51783.1| hypothetical protein THAOC_29017, partial [Thalassiosira oceanica] Length = 309 Score = 60.8 bits (146), Expect = 3e-06 Identities = 55/186 (29%), Positives = 83/186 (44%), Gaps = 18/186 (9%) Frame = +2 Query: 866 QWGRLDWKCPWQSFAD-GITQNVVKTIMDFQPQWILVVDWSSLPAFKHLSELAGQQK--- 1033 +W RLD + PW+ F D + +V ++ ++P + VDW + A++ + E AG + Sbjct: 88 RWKRLDREGPWEEFRDLASSPRLVGDVVRYRPTHAVAVDWHGMLAWRAIVEGAGPGRLGG 147 Query: 1034 WTMGFLNFRIYYISEYNGEQGLLEKDFYKRMESEAVSMASAIAALSSRDAHILANELGVG 1213 + NFR+Y S + G DFY E A A I LS +D LA +G Sbjct: 148 CRAVYYNFRVYSASAFGDGDG----DFYAVKERLACRHADQIVCLSEKDRDSLAGLMGKD 203 Query: 1214 ---------LSPGVVPKPLFPPLREDIRSMAISRLRVLSVWSKDRKYISCC-----VRLS 1351 S G+ L PPLR DI +A R + S++ +S C RL+ Sbjct: 204 GGHDQEAREKSAGIC--VLHPPLRGDICELARQGARSDAETSEEGGSVSDCHLPEAARLA 261 Query: 1352 PEKNAD 1369 E+ AD Sbjct: 262 IERLAD 267 >ref|YP_002960771.1| hypothetical protein MCJ_002570 [Mycoplasma conjunctivae HRC/581] gi|502286350|ref|WP_012751464.1| hypothetical protein [Mycoplasma conjunctivae] gi|239984955|emb|CAT04948.1| CONSERVED HYPOTHETICAL Hypothetical transmembrane protein [Mycoplasma conjunctivae] Length = 380 Score = 60.8 bits (146), Expect = 3e-06 Identities = 50/189 (26%), Positives = 87/189 (46%), Gaps = 4/189 (2%) Frame = +2 Query: 1313 KDRKYISCCVRLSPEKNADLFASIIESISSFLIRRGIVP---FLCGGAHGSGGDYAESIK 1483 + +K IS R S KN + ++ S + + P F+ G GG Y + IK Sbjct: 205 RGKKIISMVARASKVKNTNF------ALRSLAKLKNVYPNFVFIFAG----GGSYLKVIK 254 Query: 1484 QRV-KVAVPDAVVYEGFMGPEQLAEVYSKTLLNVHPCIYDAYGMTIVEAAAFEAPSIXXX 1660 ++ ++ + V + G + E+L E++S T ++ P +D G+ + EAAAF PS+ Sbjct: 255 RQASRLGLNSHVFFPGSLKKEELFELFSVTNIHFFPSFFDTDGLVVNEAAAFNIPSV--- 311 Query: 1661 XXXXXXXXXEFLESDKGQIFALDLNSPMHALSDKLEKILEDTERLSQTGKAASGRSLSWN 1840 F+ ++ A + A SDK+ +I++D+ L+ GK A SW+ Sbjct: 312 VIENTGASERFVNNES----AFIIKDSTQAASDKILEIIDDSHLLNAVGKKAYSCYQSWD 367 Query: 1841 EDANAQQLI 1867 E A L+ Sbjct: 368 EIAQQYILL 376