BLASTX nr result
ID: Ophiopogon26_contig00038785
seq
BLASTX 2.2.26 [Sep-21-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Ophiopogon26_contig00038785 (1403 letters) Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF excluding environmental samples from WGS projects 149,584,005 sequences; 54,822,741,787 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value gb|POG81663.1| hypothetical protein GLOIN_2v1504207 [Rhizophagus... 780 0.0 gb|EXX72146.1| hypothetical protein RirG_072170 [Rhizophagus irr... 766 0.0 gb|PKY45927.1| hypothetical protein RhiirA4_401815 [Rhizophagus ... 680 0.0 dbj|GBC40160.1| tyrosyl-tRNA synthetase [Rhizophagus irregularis... 679 0.0 ref|XP_004333807.1| Tyrosyl-tRNA synthetase [Acanthamoeba castel... 246 2e-74 ref|WP_096830388.1| hypothetical protein [Tychonema bourrellyi] ... 241 2e-72 ref|WP_038073215.1| hypothetical protein [Tolypothrix bouteillei... 229 1e-67 ref|WP_069966748.1| hypothetical protein [Desertifilum sp. IPPAS... 227 4e-67 gb|KHD05220.1| hypothetical protein OT06_49285 [Candidatus Thiom... 216 5e-63 gb|KUK49216.1| Uncharacterized protein XD74_0174 [Actinobacteria... 213 1e-61 ref|WP_106301821.1| hypothetical protein [Chamaesiphon polymorph... 208 2e-59 gb|APR82132.1| Hypothetical protein A7982_07481 [Minicystis rosea] 207 3e-59 gb|PRP77223.1| hypothetical protein PROFUN_15117 [Planoprotostel... 200 1e-56 ref|WP_050430843.1| hypothetical protein [Chondromyces crocatus]... 196 3e-55 dbj|GBD32688.1| hypothetical protein HRbin33_01662 [bacterium HR33] 196 4e-55 ref|WP_082838724.1| hypothetical protein [Gemmata sp. SH-PL17] >... 193 5e-54 ref|XP_004345553.2| tyrosyl-tRNA synthetase [Capsaspora owczarza... 194 5e-54 ref|WP_006971973.1| hypothetical protein [Plesiocystis pacifica]... 191 2e-53 ref|XP_002677363.1| predicted protein [Naegleria gruberi] >gi|28... 189 4e-52 ref|WP_084207322.1| hypothetical protein [Sulfuritalea hydrogeni... 185 7e-51 >gb|POG81663.1| hypothetical protein GLOIN_2v1504207 [Rhizophagus irregularis DAOM 181602=DAOM 197198] Length = 446 Score = 780 bits (2015), Expect = 0.0 Identities = 375/396 (94%), Positives = 383/396 (96%), Gaps = 3/396 (0%) Frame = +3 Query: 3 HLFHFIMELHQILKVIKNHLTHHESFIANELIKQNPPKWYCKNKKLIYEMATPEGADSFH 182 HLFHFIMEL QI+KVIKNHLTHHESFIANEL+KQNPPKWYCKNKKLIYEMATPEGADSFH Sbjct: 51 HLFHFIMELRQIIKVIKNHLTHHESFIANELVKQNPPKWYCKNKKLIYEMATPEGADSFH 110 Query: 183 GKLEYTRWEEFPLPKSFKGESDNHQGGKEMEIEVKNDVFDYSSPEDDNTVNWYINFADKH 362 GKLEYTRWEEFPLPKSFKGESDN QGGKEM+IEVKNDVFDYSSPED+NTVNWYINFADKH Sbjct: 111 GKLEYTRWEEFPLPKSFKGESDNPQGGKEMKIEVKNDVFDYSSPEDNNTVNWYINFADKH 170 Query: 363 VFGYYGGNLFAQDESQCLEHPALCSLRDYLVKGDQESRFTTARPILTK---GSSFISTPI 533 VFGYYGGNLFAQDESQCLEHP LCSLRDYLVKGDQES FTTARPILTK SSFISTPI Sbjct: 171 VFGYYGGNLFAQDESQCLEHPVLCSLRDYLVKGDQESCFTTARPILTKDSISSSFISTPI 230 Query: 534 LVRGVERKIQVKTDVNKNEKRPFGLYGNQFAHASPEAIKLATTIYDKRIDPKTGNPYYSN 713 LVRGVERK+QVKTDVNKNEKRPFGLYGNQFAHASPEAIKLATTIYDKRIDPKTGNPYYSN Sbjct: 231 LVRGVERKVQVKTDVNKNEKRPFGLYGNQFAHASPEAIKLATTIYDKRIDPKTGNPYYSN 290 Query: 714 IIAIEAPKHGHGNYTFGTIKKILATAYSGFLAAKFESFVEKGILERNHEENQEHTKININ 893 IIAIEAPKHGHG YTF TIK+ILAT YSGFLAAKFES VEKGILERNHEENQEHTKININ Sbjct: 291 IIAIEAPKHGHGKYTFNTIKRILATGYSGFLAAKFESLVEKGILERNHEENQEHTKININ 350 Query: 894 EEESVPKPRVIIHTGNWGCGAYGGNITIMSCLQIAAAHLAGIDKIIYHAMGNQDECNSGL 1073 EEESVPKPRVIIHTGNWGCGAYGGNITIMSCLQIAAAHLAGIDKI+YHAMGNQDE NSGL Sbjct: 351 EEESVPKPRVIIHTGNWGCGAYGGNITIMSCLQIAAAHLAGIDKIVYHAMGNQDEFNSGL 410 Query: 1074 QVYNELIKDVEGDVKIDDFIKNVELKDFQWGLSNGT 1181 Q+YNELIKDVEGDVKIDDFIKNVELKDFQWGLSNGT Sbjct: 411 QIYNELIKDVEGDVKIDDFIKNVELKDFQWGLSNGT 446 >gb|EXX72146.1| hypothetical protein RirG_072170 [Rhizophagus irregularis DAOM 197198w] gb|PKC11192.1| hypothetical protein RhiirA5_413392 [Rhizophagus irregularis] gb|PKC74051.1| hypothetical protein RhiirA1_450433 [Rhizophagus irregularis] gb|PKY17737.1| hypothetical protein RhiirB3_430415 [Rhizophagus irregularis] Length = 390 Score = 766 bits (1979), Expect = 0.0 Identities = 369/390 (94%), Positives = 377/390 (96%), Gaps = 3/390 (0%) Frame = +3 Query: 21 MELHQILKVIKNHLTHHESFIANELIKQNPPKWYCKNKKLIYEMATPEGADSFHGKLEYT 200 MEL QI+KVIKNHLTHHESFIANEL+KQNPPKWYCKNKKLIYEMATPEGADSFHGKLEYT Sbjct: 1 MELRQIIKVIKNHLTHHESFIANELVKQNPPKWYCKNKKLIYEMATPEGADSFHGKLEYT 60 Query: 201 RWEEFPLPKSFKGESDNHQGGKEMEIEVKNDVFDYSSPEDDNTVNWYINFADKHVFGYYG 380 RWEEFPLPKSFKGESDN QGGKEM+IEVKNDVFDYSSPED+NTVNWYINFADKHVFGYYG Sbjct: 61 RWEEFPLPKSFKGESDNPQGGKEMKIEVKNDVFDYSSPEDNNTVNWYINFADKHVFGYYG 120 Query: 381 GNLFAQDESQCLEHPALCSLRDYLVKGDQESRFTTARPILTK---GSSFISTPILVRGVE 551 GNLFAQDESQCLEHP LCSLRDYLVKGDQES FTTARPILTK SSFISTPILVRGVE Sbjct: 121 GNLFAQDESQCLEHPVLCSLRDYLVKGDQESCFTTARPILTKDSISSSFISTPILVRGVE 180 Query: 552 RKIQVKTDVNKNEKRPFGLYGNQFAHASPEAIKLATTIYDKRIDPKTGNPYYSNIIAIEA 731 RK+QVKTDVNKNEKRPFGLYGNQFAHASPEAIKLATTIYDKRIDPKTGNPYYSNIIAIEA Sbjct: 181 RKVQVKTDVNKNEKRPFGLYGNQFAHASPEAIKLATTIYDKRIDPKTGNPYYSNIIAIEA 240 Query: 732 PKHGHGNYTFGTIKKILATAYSGFLAAKFESFVEKGILERNHEENQEHTKININEEESVP 911 PKHGHG YTF TIK+ILAT YSGFLAAKFES VEKGILERNHEENQEHTKININEEESVP Sbjct: 241 PKHGHGKYTFNTIKRILATGYSGFLAAKFESLVEKGILERNHEENQEHTKININEEESVP 300 Query: 912 KPRVIIHTGNWGCGAYGGNITIMSCLQIAAAHLAGIDKIIYHAMGNQDECNSGLQVYNEL 1091 KPRVIIHTGNWGCGAYGGNITIMSCLQIAAAHLAGIDKI+YHAMGNQDE NSGLQ+YNEL Sbjct: 301 KPRVIIHTGNWGCGAYGGNITIMSCLQIAAAHLAGIDKIVYHAMGNQDEFNSGLQIYNEL 360 Query: 1092 IKDVEGDVKIDDFIKNVELKDFQWGLSNGT 1181 IKDVEGDVKIDDFIKNVELKDFQWGLSNGT Sbjct: 361 IKDVEGDVKIDDFIKNVELKDFQWGLSNGT 390 >gb|PKY45927.1| hypothetical protein RhiirA4_401815 [Rhizophagus irregularis] Length = 347 Score = 680 bits (1755), Expect = 0.0 Identities = 330/347 (95%), Positives = 335/347 (96%), Gaps = 3/347 (0%) Frame = +3 Query: 150 MATPEGADSFHGKLEYTRWEEFPLPKSFKGESDNHQGGKEMEIEVKNDVFDYSSPEDDNT 329 MATPEGADSFHGKLEYTRWEEFPLPKSFKGESDN QGGKEM+IEVKNDVFDYSSPEDDNT Sbjct: 1 MATPEGADSFHGKLEYTRWEEFPLPKSFKGESDNPQGGKEMKIEVKNDVFDYSSPEDDNT 60 Query: 330 VNWYINFADKHVFGYYGGNLFAQDESQCLEHPALCSLRDYLVKGDQESRFTTARPILTK- 506 VNWYINFADKHVFGYYGGNLFAQDESQCLEHP LCSLRDYLVKGDQES FTTARPILTK Sbjct: 61 VNWYINFADKHVFGYYGGNLFAQDESQCLEHPVLCSLRDYLVKGDQESCFTTARPILTKD 120 Query: 507 --GSSFISTPILVRGVERKIQVKTDVNKNEKRPFGLYGNQFAHASPEAIKLATTIYDKRI 680 SSFISTPILVRGVERK+QVKTDVNKNEKRPFGLYGNQFAHASPEAIKLATTIYDKRI Sbjct: 121 SISSSFISTPILVRGVERKVQVKTDVNKNEKRPFGLYGNQFAHASPEAIKLATTIYDKRI 180 Query: 681 DPKTGNPYYSNIIAIEAPKHGHGNYTFGTIKKILATAYSGFLAAKFESFVEKGILERNHE 860 DPKTGNPYYSNIIAIEAPK+GHG YTF TIKKILAT YSGFLAAKFES VEKGILERNHE Sbjct: 181 DPKTGNPYYSNIIAIEAPKYGHGKYTFNTIKKILATGYSGFLAAKFESLVEKGILERNHE 240 Query: 861 ENQEHTKININEEESVPKPRVIIHTGNWGCGAYGGNITIMSCLQIAAAHLAGIDKIIYHA 1040 ENQEHTKININEEESVPKPRVIIHTGNWGCGAYGGNITIMSCLQIAAAHLAGIDKI+YHA Sbjct: 241 ENQEHTKININEEESVPKPRVIIHTGNWGCGAYGGNITIMSCLQIAAAHLAGIDKIVYHA 300 Query: 1041 MGNQDECNSGLQVYNELIKDVEGDVKIDDFIKNVELKDFQWGLSNGT 1181 MGNQDE NSGLQ+YNELIKDVEGDVKIDDFIKNVELKDFQWGLSNGT Sbjct: 301 MGNQDEFNSGLQIYNELIKDVEGDVKIDDFIKNVELKDFQWGLSNGT 347 >dbj|GBC40160.1| tyrosyl-tRNA synthetase [Rhizophagus irregularis DAOM 181602] gb|PKK79190.1| hypothetical protein RhiirC2_727940 [Rhizophagus irregularis] Length = 347 Score = 679 bits (1753), Expect = 0.0 Identities = 329/347 (94%), Positives = 335/347 (96%), Gaps = 3/347 (0%) Frame = +3 Query: 150 MATPEGADSFHGKLEYTRWEEFPLPKSFKGESDNHQGGKEMEIEVKNDVFDYSSPEDDNT 329 MATPEGADSFHGKLEYTRWEEFPLPKSFKGESDN QGGKEM+IEVKNDVFDYSSPED+NT Sbjct: 1 MATPEGADSFHGKLEYTRWEEFPLPKSFKGESDNPQGGKEMKIEVKNDVFDYSSPEDNNT 60 Query: 330 VNWYINFADKHVFGYYGGNLFAQDESQCLEHPALCSLRDYLVKGDQESRFTTARPILTK- 506 VNWYINFADKHVFGYYGGNLFAQDESQCLEHP LCSLRDYLVKGDQES FTTARPILTK Sbjct: 61 VNWYINFADKHVFGYYGGNLFAQDESQCLEHPVLCSLRDYLVKGDQESCFTTARPILTKD 120 Query: 507 --GSSFISTPILVRGVERKIQVKTDVNKNEKRPFGLYGNQFAHASPEAIKLATTIYDKRI 680 SSFISTPILVRGVERK+QVKTDVNKNEKRPFGLYGNQFAHASPEAIKLATTIYDKRI Sbjct: 121 SISSSFISTPILVRGVERKVQVKTDVNKNEKRPFGLYGNQFAHASPEAIKLATTIYDKRI 180 Query: 681 DPKTGNPYYSNIIAIEAPKHGHGNYTFGTIKKILATAYSGFLAAKFESFVEKGILERNHE 860 DPKTGNPYYSNIIAIEAPKHGHG YTF TIK+ILAT YSGFLAAKFES VEKGILERNHE Sbjct: 181 DPKTGNPYYSNIIAIEAPKHGHGKYTFNTIKRILATGYSGFLAAKFESLVEKGILERNHE 240 Query: 861 ENQEHTKININEEESVPKPRVIIHTGNWGCGAYGGNITIMSCLQIAAAHLAGIDKIIYHA 1040 ENQEHTKININEEESVPKPRVIIHTGNWGCGAYGGNITIMSCLQIAAAHLAGIDKI+YHA Sbjct: 241 ENQEHTKININEEESVPKPRVIIHTGNWGCGAYGGNITIMSCLQIAAAHLAGIDKIVYHA 300 Query: 1041 MGNQDECNSGLQVYNELIKDVEGDVKIDDFIKNVELKDFQWGLSNGT 1181 MGNQDE NSGLQ+YNELIKDVEGDVKIDDFIKNVELKDFQWGLSNGT Sbjct: 301 MGNQDEFNSGLQIYNELIKDVEGDVKIDDFIKNVELKDFQWGLSNGT 347 >ref|XP_004333807.1| Tyrosyl-tRNA synthetase [Acanthamoeba castellanii str. Neff] gb|ELR11794.1| Tyrosyl-tRNA synthetase [Acanthamoeba castellanii str. Neff] Length = 332 Score = 246 bits (628), Expect = 2e-74 Identities = 154/386 (39%), Positives = 206/386 (53%), Gaps = 10/386 (2%) Frame = +3 Query: 51 KNHLTHHESFIANELIKQNPPKWYCKNKKLIYEMATPEGADSFHGKLEYTRWEEFPLPKS 230 ++++T SF ANEL +PP+W NKKL+Y +A P GADS G L Y+R++E PLP Sbjct: 6 EDYVTCQASFDANELATDHPPRWLHPNKKLVYALACPRGADSHRGTLHYSRYKEIPLPSL 65 Query: 231 FKGESDNHQGGKEMEIEVKNDVFDY----SSPEDDNTVNWYINFADKHVFGYYGGNLFAQ 398 + + Q + ++E++ D F Y E V+WY NFA H+F Y G L+AQ Sbjct: 66 YVPDDAMKQ---KTQLEMREDAFTYEPTAQEEEGRPVVSWYKNFAHSHLFIAYAGGLYAQ 122 Query: 399 DESQCLEHPALCSLRDYLVKGDQESRFTTARPILTKGSSFISTPILVRGVERKIQVKTDV 578 DE Q EHP L SLR+ L ++ RF RP+ T+G+ TPIL+RGVER+I VKTD Sbjct: 123 DEMQVAEHPILASLREALT-SSRDKRF---RPLTTEGNE--PTPILIRGVERRIFVKTDR 176 Query: 579 NKNEKRPFGLYGNQFAHASPEAIKLATTIYDKRIDPKTGNPYYSNIIAIEAPKHGHGNYT 758 N RP+GLYG FA AS AI+ A+ + P SNI+A+EAP G G Y Sbjct: 177 NPGAGRPYGLYGGAFAAASEAAIRNASVVLKP--------PTISNILALEAPPGGRGAYR 228 Query: 759 FGTIKKILATAYSGFLAAKFESFVEKGILERNHEENQEHTKININEEESVPKPRVIIHTG 938 + EKG ES P P V+IHTG Sbjct: 229 Y-----------------------EKGF-------------------ESAP-PFVVIHTG 245 Query: 939 NWGCGAYGGNITIMSCLQIAAAHLAGIDKIIYHAMGNQ--DECNSGLQVYNE-LIKDVEG 1109 NWG GAYGGN +M+ LQ+ AA LAG+DK++YH Q D G+ + +E L+ D EG Sbjct: 246 NWGTGAYGGNKVLMALLQVLAARLAGVDKLVYHTFERQSSDAYREGVALLDERLVADSEG 305 Query: 1110 DVK---IDDFIKNVELKDFQWGLSNG 1178 K +D+ I + F+WGLS+G Sbjct: 306 QQKQIAVDELIGKLTALQFRWGLSDG 331 >ref|WP_096830388.1| hypothetical protein [Tychonema bourrellyi] gb|PHX54006.1| hypothetical protein CP500_018375 [Tychonema bourrellyi FEM_GT703] Length = 340 Score = 241 bits (615), Expect = 2e-72 Identities = 143/373 (38%), Positives = 211/373 (56%), Gaps = 4/373 (1%) Frame = +3 Query: 72 ESFIANELIKQNPPKWYCKNKKLIYEMATPEGADSFHGKLEYTRWEEFPLPKSFKGESDN 251 ++F +LI +PPK+Y NK+L+Y++ P G + HG+L ++RW LP+ S Sbjct: 16 QTFDTQDLINDHPPKFYNGNKQLVYDICCPPGCNH-HGQLAFSRWYAMVLPEYLS--SLE 72 Query: 252 HQGGKEMEIEVKNDVFDYSSPEDDNTVNWYINFADKHVFGYYGGNLFAQDESQCLEHPAL 431 HQ +I + F+Y ED + WY+NFA +F YG +LFAQDE Q EHPAL Sbjct: 73 HQ----TDISERKGYFEYEPSEDSTQMEWYLNFAHYELFFAYGSSLFAQDEMQVAEHPAL 128 Query: 432 CSLRDYLVKGDQESRFTTARPILTKGSSFISTPILVRGVERKIQVKTDVNKNEKRPFGLY 611 SLR+ L+ D + + T S TPIL+RGVER+ + TD N + RPFGLY Sbjct: 129 SSLREALL--DSKIKSLTV-------ESQQPTPILIRGVERRCAISTDPNSEQGRPFGLY 179 Query: 612 GNQFAHASPEAIKLATTIYDKRIDPKTGNPYYSNIIAIEAPKHGHGNYTFGTIKKILATA 791 GN F A+ +AI+ AT K ++P P +NIIA+EAP G G Y+ I+ IL TA Sbjct: 180 GNNFGRATSDAIEQAT----KPLNP----PTITNIIAMEAPAGGRGYYSMAQIEYILQTA 231 Query: 792 YSGFLAAKFESFVEKGILERNHEENQEHTKININEEESVPK-PRVIIHTGNWGCGAYGGN 968 ++GF AA+ ES +E +P+ P ++IHTG WGCGAYGGN Sbjct: 232 FTGFSAARIESQLE------------------------LPQAPSLMIHTGFWGCGAYGGN 267 Query: 969 ITIMSCLQIAAAHLAGIDKIIYH---AMGNQDECNSGLQVYNELIKDVEGDVKIDDFIKN 1139 +M+ LQ+ +A L+ ++ +++H AMG+QD + + +L+ D + VK+ D ++ Sbjct: 268 RVLMALLQLLSARLSQVNCLVFHTSDAMGSQDLATAQQILDRDLVPD-DSPVKVSDLVEK 326 Query: 1140 VELKDFQWGLSNG 1178 + +FQWG S+G Sbjct: 327 IHAMEFQWGFSDG 339 >ref|WP_038073215.1| hypothetical protein [Tolypothrix bouteillei] gb|KIE11963.1| hypothetical protein DA73_0210005 [Tolypothrix bouteillei VB521301] Length = 338 Score = 229 bits (583), Expect = 1e-67 Identities = 142/385 (36%), Positives = 205/385 (53%), Gaps = 5/385 (1%) Frame = +3 Query: 39 LKVIKNHLTHHESFIANELIKQNPPKWYCKNKKLIYEMATPEGADSFHGKLEYTRWEEFP 218 + ++L F L+ +PPK KNKK++Y++A P G G++ ++RW + Sbjct: 1 MNAASDNLICRHKFTTQSLVDTHPPKLKNKNKKIVYQIACPPGC-IHSGEIVFSRWRKIT 59 Query: 219 LPKSFKGESDNHQGGKEMEIEVKNDVFDYS-SPEDDNTVNWYINFADKHVFGYYGGNLFA 395 LP + SD E E + F+Y S E ++ V WY+NFA +F Y G+L A Sbjct: 60 LPVNLSSSSDR------TEFEERYGYFEYEPSHEKNDEVEWYLNFAHCDLFCAYSGSLLA 113 Query: 396 QDESQCLEHPALCSLRDYLVKGDQESRFTTARPILTKGSSFISTPILVRGVERKIQVKTD 575 QDE Q EHPAL SLR+ L+ + P + TPIL+RGVER+ + TD Sbjct: 114 QDEMQVAEHPALGSLREALLDAGID-------PFTVEAGE--PTPILIRGVERRCAIATD 164 Query: 576 VNKNEKRPFGLYGNQFAHASPEAIKLATTIYDKRIDPKTGNPYYSNIIAIEAPKHGHGNY 755 + RP+GLYGN FA A+ EAI+LAT K ++P P +NIIA+EAP +G+G Y Sbjct: 165 ASVENARPYGLYGNNFARATAEAIELAT----KPLNP----PTVTNIIAMEAPSNGYGVY 216 Query: 756 TFGTIKKILATAYSGFLAAKFESFVEKGILERNHEENQEHTKININEEESVPKPRVIIHT 935 T I+ IL TA++GF AK ES ES + VIIHT Sbjct: 217 TQQEIRYILDTAFTGFATAKVESCF-----------------------ESAQEQFVIIHT 253 Query: 936 GNWGCGAYGGNITIMSCLQIAAAHLAGIDKIIYH----AMGNQDECNSGLQVYNELIKDV 1103 G WGCGAYGGN +M+ LQ+ AA LA ++++++H A QD + + NE + V Sbjct: 254 GFWGCGAYGGNRILMALLQLLAARLAQVNRLVFHTGVDAKSAQDFA-TAQHILNENLAPV 312 Query: 1104 EGDVKIDDFIKNVELKDFQWGLSNG 1178 +V++ + ++ FQWG+S+G Sbjct: 313 GSNVEVSTLLVKIQALGFQWGISDG 337 >ref|WP_069966748.1| hypothetical protein [Desertifilum sp. IPPAS B-1220] gb|OEJ75663.1| hypothetical protein BH720_08465 [Desertifilum sp. IPPAS B-1220] Length = 332 Score = 227 bits (579), Expect = 4e-67 Identities = 142/385 (36%), Positives = 206/385 (53%), Gaps = 3/385 (0%) Frame = +3 Query: 33 QILKVIKNHLTHHESFIANELIKQNPPKWYCKNKKLIYEMATPEGADSFHGKLEYTRWEE 212 ++ +++ N + H +F EL++ PPK Y NK++IY++A P G+ G L ++RW Sbjct: 3 KVRELLDNLICRH-AFNTQELVETYPPKLYHPNKRVIYDIACPPGS-VHRGTLCFSRWRG 60 Query: 213 FPLPKSFKGESDNHQGGKEMEIEVKNDVFDYSSPEDDNTVNWYINFADKHVFGYYGGNLF 392 LP+ E +E + F+Y D + WY+NFAD +F YG LF Sbjct: 61 MKLPEGLPASG-------ETVLEEYSGYFEYEPSPDSQEMEWYLNFADLDLFYAYGSPLF 113 Query: 393 AQDESQCLEHPALCSLRDYLVKGDQESRFTTARPILTKGSSFISTPILVRGVERKIQVKT 572 AQDE Q EHPAL SLR+ L+ E TA KG TP+LVRGVER+ ++ T Sbjct: 114 AQDEMQVAEHPALASLREALLAN--EINLFTAE----KGGP---TPVLVRGVERRCEIAT 164 Query: 573 DVNKNEKRPFGLYGNQFAHASPEAIKLATTIYDKRIDPKTGNPYYSNIIAIEAPKHGHGN 752 + + +++RPFGLYGN FA A+ EAI LAT K ++P P +NI+AI AP G+ Sbjct: 165 NPDASQQRPFGLYGNHFAGATTEAIALAT----KPLNP----PTITNILAISAPSCRSGS 216 Query: 753 YTFGTIKKILATAYSGFLAAKFESFVEKGILERNHEENQEHTKININEEESVPKPRVIIH 932 YT I+ IL TA++GF AA+ +S P V IH Sbjct: 217 YTQKQIEHILTTAFTGFTAARLDS----------------------------EAPLVAIH 248 Query: 933 TGNWGCGAYGGNITIMSCLQIAAAHLAGIDKIIYHA---MGNQDECNSGLQVYNELIKDV 1103 TG WGCGA+GGN +M+ LQ+ AAHLA ++++I+H G++ + YN + + Sbjct: 249 TGFWGCGAFGGNRVLMALLQLLAAHLAQVNRLIFHTSDRSGSEALATAQDLFYNAIAPN- 307 Query: 1104 EGDVKIDDFIKNVELKDFQWGLSNG 1178 + + D I + DFQWG+S+G Sbjct: 308 -SSLSVSDLITQIHAMDFQWGVSDG 331 >gb|KHD05220.1| hypothetical protein OT06_49285 [Candidatus Thiomargarita nelsonii] Length = 325 Score = 216 bits (551), Expect = 5e-63 Identities = 142/372 (38%), Positives = 199/372 (53%), Gaps = 5/372 (1%) Frame = +3 Query: 78 FIANELIKQNPPKWYCKNKKLIYEMATPEGADSFH-GKLEYTRWEEFPLPKSFKGESDNH 254 F +L+ + PP Y NKK++Y++A P G S H G+L +RW LP +K N Sbjct: 14 FETQKLVDEFPPNLYDSNKKIVYQIACPPG--SVHSGQLVLSRWNLMRLP--YKVSFKNT 69 Query: 255 QGGKEMEIEVKNDVFDYSSPEDDNTVNWYINFADKHVFGYYGGNLFAQDESQCLEHPALC 434 Q + E + D F Y D + V WY+NFA +F YGG LFAQDE Q EHPAL Sbjct: 70 Q----VVFEGREDYFGYE--RDTSVVEWYLNFAHYDLFCAYGGGLFAQDEMQVAEHPALG 123 Query: 435 SLRDYLVKGDQESRFTTARPILTKGSSFISTPILVRGVERKIQVKTDVNKNEKRPFGLYG 614 SLR+ L+ E P+ + TPIL++GVER+ V T+ N +++R FGLYG Sbjct: 124 SLREALLASGIE-------PLTVENGK--PTPILIKGVERRCAVSTEPNPSQQRHFGLYG 174 Query: 615 NQFAHASPEAIKLATTIYDKRIDPKTGNPYYSNIIAIEAPKHGHGNYTFGTIKKILATAY 794 N FA AS EAIKLATT ++P P +N+IA+EAP G YT I+ IL TA+ Sbjct: 175 NNFAQASEEAIKLATT----PLNP----PTLTNLIAMEAPACGRDFYTQDEIEYILRTAF 226 Query: 795 SGFLAAKFESFVEKGILERNHEENQEHTKININEEESVPKPRVIIHTGNWGCGAYGGNIT 974 +GF AAK E+ ++ +IHTG WGCGAYGGN Sbjct: 227 TGFSAAKIETKADE----------------------------TVIHTGFWGCGAYGGNRV 258 Query: 975 IMSCLQIAAAHLAGIDKIIYHAMGNQDECNSGLQVYNELIKDVEGDV----KIDDFIKNV 1142 +MS LQ+ AA ++ +D++++H +SGL+ + + +E D+ ID I + Sbjct: 259 LMSLLQLIAAVMSQVDRLVFHT------GSSGLEDFQRACRILEEDLASLPNIDSVINKL 312 Query: 1143 ELKDFQWGLSNG 1178 F+WG+S+G Sbjct: 313 TEMKFEWGISDG 324 >gb|KUK49216.1| Uncharacterized protein XD74_0174 [Actinobacteria bacterium 66_15] Length = 332 Score = 213 bits (543), Expect = 1e-61 Identities = 137/371 (36%), Positives = 197/371 (53%), Gaps = 2/371 (0%) Frame = +3 Query: 75 SFIANELIKQNPPKWYCKNKKLIYEMATPEGADSFHGKLEYTRWEEFPLPKSFKGESDNH 254 +F L+ ++PP + NK++++E+A EG++ G++ YT+W F LP E + Sbjct: 11 TFDVATLMAEHPPLIHHPNKRVVFEIACGEGSEC-SGEIGYTQWPAFSLP-----ERVDP 64 Query: 255 QGGKEMEIEVKNDVFDYSSPED-DNTVNWYINFADKHVFGYYGGNLFAQDESQCLEHPAL 431 G + +E + + DY ED V W++NFAD+ +F YG LFAQDE QC EHP L Sbjct: 65 TAGLDA-LESRCGIMDYEPVEDFPGAVEWHVNFADQMLFFAYGSGLFAQDEMQCAEHPVL 123 Query: 432 CSLRDYLVKGDQESRFTTARPILTKGSSFISTPILVRGVERKIQVKTDVNKNEKRPFGLY 611 +L + L + + TA TP+LV GVER+++VKT++N + RP GLY Sbjct: 124 GALVEALRADGRRAVTETADG---------PTPVLVTGVERRVKVKTNMNAKKGRPRGLY 174 Query: 612 GNQFAHASPEAIKLATTIYDKRIDPKTGNPYYSNIIAIEAPKHGHGNYTFGTIKKILATA 791 GN+FA ASPEA++ AT KRIDP P +NIIA+ AP + +G Y I++IL TA Sbjct: 175 GNEFAVASPEAVQRAT----KRIDP----PTITNIIAMAAPTYRNGRYERSMIERILVTA 226 Query: 792 YSGFLAAKFESFVEKGILERNHEENQEHTKININEEESVPKPRVIIHTGNWGCGAYGGNI 971 Y+GF AA ES P V IHTG WGCGA+GGN Sbjct: 227 YTGFAAAVAES------------------------RRMAPGAPVAIHTGYWGCGAFGGNR 262 Query: 972 TIMSCLQIAAAHLAGIDKIIYHAMGNQD-ECNSGLQVYNELIKDVEGDVKIDDFIKNVEL 1148 +MS LQ+ AA +A + + +H +D ++ E + E + DD I ++ Sbjct: 263 VLMSLLQLLAAGMAEVTCLAFHTANAEDAPLVEATRIITEDLSSGE-SLSADDLIDRIDA 321 Query: 1149 KDFQWGLSNGT 1181 F+WG S GT Sbjct: 322 MAFEWGRSEGT 332 >ref|WP_106301821.1| hypothetical protein [Chamaesiphon polymorphus] gb|PSB57937.1| hypothetical protein C7B77_06640 [Chamaesiphon polymorphus CCALA 037] Length = 355 Score = 208 bits (529), Expect = 2e-59 Identities = 141/395 (35%), Positives = 200/395 (50%), Gaps = 7/395 (1%) Frame = +3 Query: 15 FIMELHQILKVIKN---HLTHHESFIANELIKQNPPKWYCKNKKLIYEMATPEGADSFH- 182 F E+ +L + N +L +F A +L+ +P K NKK+++ +A P +S H Sbjct: 11 FFWEIAYLLDRMNNSSDNLICRHTFNAQQLVDSHPAKIRNANKKIVHRIACPP--NSIHQ 68 Query: 183 GKLEYTRWEEFPLPKSFKGESDNHQGGKEMEIEVKNDVFDYS-SPEDDNTVNWYINFADK 359 G++ ++RW L + +D EI+ + F Y S D V WY+NFA Sbjct: 69 GEIVFSRWRSIELAEISPSLTDR------TEIQEQKSYFKYPRSQHRDRLVEWYLNFAHS 122 Query: 360 HVFGYYGGNLFAQDESQCLEHPALCSLRDYLVKGDQESRFTTARPILTKGSSFISTPILV 539 +F YG +FAQDE Q EHP L SLR+ L+ + FT R TPIL+ Sbjct: 123 DLFCAYGERVFAQDEMQVAEHPVLASLREALLDAKIDP-FTVERGE--------PTPILI 173 Query: 540 RGVERKIQVKTDVNKNEKRPFGLYGNQFAHASPEAIKLATTIYDKRIDPKTGNPYYSNII 719 RG+ER+ ++ T+++ + RP GLYGN FA A AI+LATT IDP P +NII Sbjct: 174 RGIERRCEIATNIDSEQGRPLGLYGNNFAKAPAAAIELATT----PIDP----PTITNII 225 Query: 720 AIEAPKHGHGNYTFGTIKKILATAYSGFLAAKFESFVEKGILERNHEENQEHTKININEE 899 A+EAP G+ Y + I+ IL TA +GF AAK ES +E Sbjct: 226 AMEAPSGGYNFYEYDIIEFILTTAVTGFTAAKIESQLE---------------------- 263 Query: 900 ESVPKPRVIIHTGNWGCGAYGGNITIMSCLQIAAAHLAGIDKIIYHAMGNQDECNSGLQV 1079 + P V IHTG WGCGAYGGN +M+ LQ+ AA LA IDK+++H D L Sbjct: 264 --IASPIVSIHTGFWGCGAYGGNRILMALLQLLAARLAQIDKLVFHT--TDDAGAKALAT 319 Query: 1080 YNELI--KDVEGDVKIDDFIKNVELKDFQWGLSNG 1178 +I + V + I + + K F+WG+ +G Sbjct: 320 ARSIIDRELVIAEASIPQILDKIYAKAFKWGIGDG 354 >gb|APR82132.1| Hypothetical protein A7982_07481 [Minicystis rosea] Length = 337 Score = 207 bits (527), Expect = 3e-59 Identities = 138/375 (36%), Positives = 198/375 (52%), Gaps = 7/375 (1%) Frame = +3 Query: 75 SFIANELIKQNPPKWYCKNKKLIYEMATPEGADSFHGKLEYTRWEEFPLPKSFKGESDNH 254 +F A L+ +PP++ K+K+L+Y +++P G L ++R PL +H Sbjct: 13 TFDAAALVAAHPPRFTHKHKQLVYALSSPPSRPP-QGALVFSRHHAMPL--------GDH 63 Query: 255 QGGKEMEIEVKNDVFDY-----SSPEDDNTVNWYINFADKHVFGYYGGNLFAQDESQCLE 419 +E++ DVF Y SSP TV WY+NFAD +F YGG+L AQDE Q LE Sbjct: 64 LPAAAPTVEMREDVFGYEPLPKSSPP---TVEWYLNFADPQLFVAYGGSLLAQDELQVLE 120 Query: 420 HPALCSLRDYLVKGDQESRFTTARPILTKGSSFISTPILVRGVERKIQVKTDVNKNEKRP 599 HPAL SL ++L + + RF P+ G + +TP+LVRGVER+ TD + E RP Sbjct: 121 HPALGSLCEHL-RASPDPRFA---PLTHDGDA--ATPVLVRGVERRCAFATDPDLLEGRP 174 Query: 600 FGLYGNQFAHASPEAIKLATTIYDKRIDPKTGNPYYSNIIAIEAPKHGHGNYTFGTIKKI 779 GLYGN+FA AS +AI+ A T+ D P SNI+A+ AP G G Y+ I+ I Sbjct: 175 LGLYGNRFARASEDAIRRAVTVLDP--------PTLSNILAMAAPPGGTGAYSIDEIRSI 226 Query: 780 LATAYSGFLAAKFESFVEKGILERNHEENQEHTKININEEESVPKPRVIIHTGNWGCGAY 959 L TA +GF AA+ ES + + V+IHTG+WG GA+ Sbjct: 227 LTTATTGFSAARIES------------------------DLAAKGAAVVIHTGHWGTGAF 262 Query: 960 GGNITIMSCLQIAAAHLAGIDKIIYHAMGN--QDECNSGLQVYNELIKDVEGDVKIDDFI 1133 GGN +M+ LQ+ AA LA ID+++YH + D G + +L+ D + + D I Sbjct: 263 GGNKVLMTILQLLAARLARIDRLVYHTFDSTGSDAFQEGAKRLAKLLPD-GASMPVADMI 321 Query: 1134 KNVELKDFQWGLSNG 1178 + + F WG S+G Sbjct: 322 QKLFRIGFVWGESDG 336 >gb|PRP77223.1| hypothetical protein PROFUN_15117 [Planoprotostelium fungivorum] Length = 344 Score = 200 bits (509), Expect = 1e-56 Identities = 130/377 (34%), Positives = 194/377 (51%), Gaps = 11/377 (2%) Frame = +3 Query: 84 ANELIKQNPPKWYCKNKKLIYEMATPEGADSFHGKLEYTRWEEFPLPKSFKGESDNHQGG 263 + +L K PK NK+L+ +M EG + G L + RW P S + + +G Sbjct: 19 SKKLWKNFRPKMQSSNKRLLLDMIDKEG-EKPQGDLIFERWSIIQ-PSSLSLDPERLKG- 75 Query: 264 KEMEIEVKNDVFDYSSPEDDNTVNWYINFADKHVFGYYGGNLFAQDESQCLEHPALCSLR 443 + +E D++ Y + +T ++Y+NFAD +FG+YGG LFAQDE Q EHP L SLR Sbjct: 76 --LIVEETLDIYRY---QQTDTEDYYVNFADASLFGFYGGPLFAQDEHQVAEHPILGSLR 130 Query: 444 DYL-VKGDQESRFTTARPILTKGSSFISTPILVRGVERKIQVKTDVNKNEKRPFGLYGNQ 620 +L ++ E++ A P G + +TP L+ +R + ++T + + R +YGN Sbjct: 131 RWLDLEAASETKNKEAIPWTKIGDN--ATPCLIFNAQRSLVIETQADPTKGRQ-SIYGNS 187 Query: 621 FAHASPEAIKLATTIYDKRIDPKTGNPYYSNIIAIEAPKHGHGNYTFGTIKKILATAYSG 800 F++ASP I+ ATT+ K G ++N IAIEAPKHGHG Y I+ I TA+SG Sbjct: 188 FSYASPATIRAATTVITKETAELNGLRSHNNFIAIEAPKHGHGTYDRSEIEYIFFTAFSG 247 Query: 801 FLAAKFESFVEKGILERNHEENQEHTKININEEESVPKPRVIIHTGNWGCGAYGGNITIM 980 F AA+ S + +IHTGNWGCGA+GGN +IM Sbjct: 248 FEAARLHS-----------------------------GDKTVIHTGNWGCGAFGGNGSIM 278 Query: 981 SCLQIAAAHLAGIDKIIYHAMGNQDECNSGLQVYNELIKDVEGDVKIDD----------F 1130 + LQIAAA ++G+ KI+YH Q + L + EG K+ D Sbjct: 279 AMLQIAAAAMSGVKKIVYHTFD---------QKHTRLFR--EGQKKLQDLWNSRRDLHAL 327 Query: 1131 IKNVELKDFQWGLSNGT 1181 + ++ +++QWG+ NGT Sbjct: 328 LAAIQEEEYQWGVGNGT 344 >ref|WP_050430843.1| hypothetical protein [Chondromyces crocatus] gb|AKT38636.1| uncharacterized protein CMC5_027830 [Chondromyces crocatus] Length = 337 Score = 196 bits (499), Expect = 3e-55 Identities = 134/374 (35%), Positives = 190/374 (50%), Gaps = 6/374 (1%) Frame = +3 Query: 75 SFIANELIKQNPPKWYCKNKKLIYEMATPEGADSFHGKLEYTRWEEFPLPKSFKGESDNH 254 SF A+EL+ +PP+ NK++I+++A P G G L +RW P+P + Sbjct: 13 SFDAHELVTSHPPRLANPNKQVIHDIACPPGT-KHGGTLVVSRWWALPVPAQLPSHTP-- 69 Query: 255 QGGKEMEIEVKNDVFDYSSPEDDNT---VNWYINFADKHVFGYYGGNLFAQDESQCLEHP 425 E + F Y T + WY+NFAD ++F YGG LFAQDE Q EHP Sbjct: 70 ------EFVLDRSFFTYEPETASGTAPQMAWYVNFADVNLFFGYGGPLFAQDELQTAEHP 123 Query: 426 ALCSLRDYL-VKGDQESRFTTARPILTKGSSFISTPILVRGVERKIQVKTDVNKNEKRPF 602 AL SLR+ L V D F AR K + TP+L+RGVER+ + D P Sbjct: 124 ALGSLREALKVSADP---FVKARTRENK----VPTPVLIRGVERRCAI--DTLHPAALPD 174 Query: 603 GLYGNQFAHASPEAIKLATTIYDKRIDPKTGNPYYSNIIAIEAPKHGHGNYTFGTIKKIL 782 GLYGN+FA A+ + I+ AT + +DP P SN+IA+EA G YT I ++ Sbjct: 175 GLYGNRFARATADVIRKAT----RPLDP----PTVSNLIAMEAIPGASGRYTAEQIADVV 226 Query: 783 ATAYSGFLAAKFESFVEKGILERNHEENQEHTKININEEESVPKPRVIIHTGNWGCGAYG 962 TA++GF AA+ ES + G E P+V IHTG+WG GA+G Sbjct: 227 QTAFTGFTAARLESQLATGHAE----------------------PQVTIHTGHWGTGAFG 264 Query: 963 GNITIMSCLQIAAAHLAGIDKIIYHAMGNQDE--CNSGLQVYNELIKDVEGDVKIDDFIK 1136 GN +M+CLQ+ AA LAG+ ++++H + Q E C L + E + + + + Sbjct: 265 GNKVLMACLQMLAARLAGLSRLVFHTVDAQGEAACREALGLLEERL--LPAHASTHELLG 322 Query: 1137 NVELKDFQWGLSNG 1178 +E F WGLS+G Sbjct: 323 ALESMGFGWGLSDG 336 >dbj|GBD32688.1| hypothetical protein HRbin33_01662 [bacterium HR33] Length = 339 Score = 196 bits (499), Expect = 4e-55 Identities = 127/377 (33%), Positives = 192/377 (50%), Gaps = 4/377 (1%) Frame = +3 Query: 60 LTHHESFIANELIKQNPPKWYCKNKKLIYEMATPEGADSFHGKLEYTRWEEFPLPKSFKG 239 L SF + L+ ++PP W+ NK L++E+A P G+ + G + Y+RW +P Sbjct: 11 LIERASFETSRLMAEHPPVWHHPNKALVFEIACPSGS-VYRGTVRYSRWRGL-VPGCLWD 68 Query: 240 ESDNHQGGKEMEIEVKNDVFDYSSPED-DNTVNWYINFADKHVFGYYGGNLFAQDESQCL 416 + + K +DYS D +V W++NFAD H+F YG LFAQDE Q Sbjct: 69 AA-----AALRRVRSKAGFYDYSEQSDLPGSVEWHVNFADPHLFVAYGSGLFAQDEMQVA 123 Query: 417 EHPALCSLRDYLVKGDQESRFTTARPILTKGSSFISTPILVRGVERKIQVKTDVNKNEKR 596 EHPAL +LR+ L+ + P+ + TP+LV GVER+ ++ T+ + R Sbjct: 124 EHPALGALREALLA-------RGSLPLTVEAGG--PTPVLVMGVERRCRIATEPDPLAGR 174 Query: 597 PFGLYGNQFAHASPEAIKLATTIYDKRIDPKTGNPYYSNIIAIEAPKHGHGNYTFGTIKK 776 P GLYGN+FA A P+ ++ AT R+DP P SNIIA+ + G G Y ++ Sbjct: 175 PHGLYGNRFAAAPPDVVRRATV----RLDP----PTISNIIAMASLPGGDGRYAPEEVRY 226 Query: 777 ILATAYSGFLAAKFESFVEKGILERNHEENQEHTKININEEESVPKPRVIIHTGNWGCGA 956 +L+TAYSGF AA ES + G P V++HTG WGCGA Sbjct: 227 LLSTAYSGFRAAVLESHRDGG-----------------------PAVPVVVHTGFWGCGA 263 Query: 957 YGGNITIMSCLQIAAAHLAGIDKIIYHAMGNQDECNSGLQVYNELIKDV---EGDVKIDD 1127 +GGN +M+ +QI AA AG++++++H E + L+ L+ + E + D Sbjct: 264 FGGNRVLMALIQILAAGAAGVERLVFHTGDPAGEIS--LEQARALLAEKLGGEQPLSTDA 321 Query: 1128 FIKNVELKDFQWGLSNG 1178 + + F WG+S+G Sbjct: 322 LVARLVGLGFTWGVSDG 338 >ref|WP_082838724.1| hypothetical protein [Gemmata sp. SH-PL17] gb|AMV23751.1| Poly (ADP-ribose) glycohydrolase (PARG) [Gemmata sp. SH-PL17] Length = 333 Score = 193 bits (491), Expect = 5e-54 Identities = 130/371 (35%), Positives = 187/371 (50%), Gaps = 4/371 (1%) Frame = +3 Query: 78 FIANELIKQNPPKWYCKNKKLIYEMATPEGADSFH-GKLEYTRWEEFPLPKSFKGESDNH 254 F A L+ + PP++ NKK++Y ++ P D+ H G++ ++RW P + Sbjct: 13 FDAVALVAEFPPRFSHPNKKVVYGISCPP--DAVHSGRVTFSRWAAVAPPSEVPQNATT- 69 Query: 255 QGGKEMEIEVKNDVFDYSS-PEDDNTVNWYINFADKHVFGYYGGNLFAQDESQCLEHPAL 431 IE + D F Y P V WY+NF+ +F YGG LFAQDE Q EHPAL Sbjct: 70 -------IEPREDYFGYEPVPAGLGRVEWYLNFSHYDLFCAYGGGLFAQDEMQVTEHPAL 122 Query: 432 CSLRDYLVKGDQESRFTTARPILTKGSSFISTPILVRGVERKIQVKTDVNKNEKRPFGLY 611 SLR+ L++ E P+ K S TP+LV GVER+ +V + + RPFGLY Sbjct: 123 GSLREALLQSGVE-------PLTVKDRS--PTPVLVTGVERRCRVAINPDAALGRPFGLY 173 Query: 612 GNQFAHASPEAIKLATTIYDKRIDPKTGNPYYSNIIAIEAPKHGHGNYTFGTIKKILATA 791 GN FA A P+ I AT + + P P +N++A+EAP G G YT I+ +L TA Sbjct: 174 GNNFARAKPDVIARAT----EALVP----PTITNVLAMEAPTGGSGRYTRSAIEYVLRTA 225 Query: 792 YSGFLAAKFESFVEKGILERNHEENQEHTKININEEESVPKPRVIIHTGNWGCGAYGGNI 971 ++GFLAA+ ES G L + P V++HTG WGCGAYGG+ Sbjct: 226 HTGFLAARIES----GRLSAS--------------------PEVVVHTGYWGCGAYGGHR 261 Query: 972 TIMSCLQIAAAHLAGIDKIIYHA--MGNQDECNSGLQVYNELIKDVEGDVKIDDFIKNVE 1145 T+M+ LQI AA A +D++++H + L V + E + I ++ Sbjct: 262 TLMALLQILAARTAQLDRLVFHTGDAAGSATLRNALVVSERDLGLRETPTPLAAVIDKLD 321 Query: 1146 LKDFQWGLSNG 1178 F WG+ +G Sbjct: 322 AMAFHWGVGDG 332 >ref|XP_004345553.2| tyrosyl-tRNA synthetase [Capsaspora owczarzaki ATCC 30864] gb|KJE95514.1| tyrosyl-tRNA synthetase [Capsaspora owczarzaki ATCC 30864] Length = 365 Score = 194 bits (493), Expect = 5e-54 Identities = 132/385 (34%), Positives = 186/385 (48%), Gaps = 19/385 (4%) Frame = +3 Query: 84 ANELIKQNPPKWYCKNKKLIYEMATPEGADS------------FHGKLEYTRW-----EE 212 A +L+++ PP+ +NKKL++E+++ G + F G + TRW +E Sbjct: 20 ARDLVRRCPPRLQARNKKLVFELSSESGECTLVPKASIANPAPFEGDVRVTRWRAPHHDE 79 Query: 213 FPLPKSFKGESDNHQGGKEMEIEVKNDVFDYSSPEDDN--TVNWYINFADKHVFGYYGGN 386 P F G EM +D + D V WY+NFAD +VFG+YGG Sbjct: 80 MPATLGFDA------GTVEMSFPASIFAYDIPTTSQDGKPVVPWYVNFADSNVFGFYGGG 133 Query: 387 LFAQDESQCLEHPALCSLRDYLVKGDQESRFTTARPILTKGSSFISTPILVRGVERKIQV 566 L+AQDE Q EHP L S+R L D LT + TPILV V+R++ V Sbjct: 134 LYAQDEMQVTEHPILGSVRQMLENLDLSKN--PKMKALTMETQ--PTPILVENVQRRVVV 189 Query: 567 KTDVNKNEKRPFGLYGNQFAHASPEAIKLATTIYDKRIDPKTGNPYYSNIIAIEAPKHGH 746 D + P GLYGN FA AS E I AT + + P SNIIAI A +G Sbjct: 190 --DTFPSAAAPGGLYGNAFASASFETIVQATHVLNP--------PTMSNIIAIAAQGYGF 239 Query: 747 GNYTFGTIKKILATAYSGFLAAKFESFVEKGILERNHEENQEHTKININEEESVPKPRVI 926 G Y I TAY+GF AA S++ G + +V+ Sbjct: 240 GEYALPVINFSFLTAYTGFAAAVASSWLRLG------------------KPADRKSFKVV 281 Query: 927 IHTGNWGCGAYGGNITIMSCLQIAAAHLAGIDKIIYHAMGNQDECNSGLQVYNELIKDVE 1106 I+TGNWGCGA+GGN T+M+ +Q AAA AG+D++IY + N +++NEL+ + Sbjct: 282 INTGNWGCGAFGGNPTMMALIQFAAAQAAGVDELIYSTVMPSPAVNRAREIWNELVPTLR 341 Query: 1107 GDVKIDDFIKNVELKDFQWGLSNGT 1181 D + ++ E +WG+SNGT Sbjct: 342 -DKPVGAWLGAFEKLRLRWGVSNGT 365 >ref|WP_006971973.1| hypothetical protein [Plesiocystis pacifica] gb|EDM78917.1| tyrosyl-tRNA synthetase [Plesiocystis pacifica SIR-1] Length = 322 Score = 191 bits (486), Expect = 2e-53 Identities = 124/368 (33%), Positives = 183/368 (49%), Gaps = 3/368 (0%) Frame = +3 Query: 84 ANELIKQNPPKWYCKNKKLIYEMATPEGADSFHGKLEYTRWEEFPLPKSFKGESDNHQGG 263 A EL++ +PP W NKK+I ++ P A+ G++ TRW LP++ S Sbjct: 15 AAELVRSHPPVWRDANKKVIAALSCPADAEH-RGQIRVTRWRAGALPETLPESSP----- 68 Query: 264 KEMEIEVKNDVFDYSSPEDDNTVNWYINFADKHVFGYYGGNLFAQDESQCLEHPALCSLR 443 +E D+F Y+ P D +WY+NFAD+ +F YG L AQDE Q EHPAL S+ Sbjct: 69 ---ALEAHADLFGYA-PAPDGETHWYLNFADRRLFIAYGSGLLAQDELQVAEHPALGSVA 124 Query: 444 DYLVKGDQESRFTTARPILTKGSSFISTPILVRGVERKIQVKTDVNKNEKRPFGLYGNQF 623 + + + T TPILV GVER+ + T + + R +GLYG++F Sbjct: 125 EAMAALPDQVPLTAEDE---------PTPILVAGVERRCVLDTAPDLDAGRVYGLYGHRF 175 Query: 624 AHASPEAIKLATTIYDKRIDPKTGNPYYSNIIAIEAPKHGHGNYTFGTIKKILATAYSGF 803 A+P+ ++ A T+ D P SNI+AIEAP G YT I+ IL TA +G+ Sbjct: 176 QRATPDEVRGAVTVLDP--------PTVSNILAIEAPTAYRGAYTAKQIRFILRTAVAGY 227 Query: 804 LAAKFESFVEKGILERNHEENQEHTKININEEESVPKPRVIIHTGNWGCGAYGGNITIMS 983 AA ES ++IHTG WGCGAYGGN +M+ Sbjct: 228 RAAALES-----------------------------AGALVIHTGFWGCGAYGGNRELMA 258 Query: 984 CLQIAAAHLAGIDKIIYHAMGNQDECNSGLQVY---NELIKDVEGDVKIDDFIKNVELKD 1154 LQI AA +AG+ ++++HA ++ GL +Y + ++ G +D I + Sbjct: 259 LLQILAARIAGVSRLVFHAFDSE-----GLALYRAGEAVAAELAGLTSLDQAIAVIVELG 313 Query: 1155 FQWGLSNG 1178 +QWG+S+G Sbjct: 314 YQWGVSDG 321 >ref|XP_002677363.1| predicted protein [Naegleria gruberi] gb|EFC44619.1| predicted protein [Naegleria gruberi] Length = 358 Score = 189 bits (480), Expect = 4e-52 Identities = 127/377 (33%), Positives = 198/377 (52%), Gaps = 14/377 (3%) Frame = +3 Query: 90 ELIKQNPPKWYCKNKKLIYEMATPEGADSFHGKLEYTRWEEFPLPKSFKGESDNHQGGKE 269 E+I++ PP ++ KNK +Y + + + + G L Y+R++ PK++K E Q Sbjct: 25 EIIEKFPPSFFSKNKVFLYSLHSDQL--DYEGDLIYSRFKARTRPKAYKKEVAEDQ---V 79 Query: 270 MEIEVKNDVFDYSSP---EDDNTVNWYINFADKHVFGYYGGNLFAQDESQCLEHPALCSL 440 E+ V +D F Y ++N +WY+NFAD+ +F + G LFAQDE Q EHP L SL Sbjct: 80 CEVIVSSDGFKYDEKLKVGEENAKHWYLNFADERLFIAWKGQLFAQDEIQVCEHPILGSL 139 Query: 441 RDYLVKGD-QESRFTTARPILTKGSSFISTPILVRGVERKIQVKTDVNKNEKRPFGLYGN 617 +YL K ++R++ PI + S TP+L++ V+R+I V + + +YGN Sbjct: 140 CEYLRKESLNDARYS---PITQQTSP---TPVLIQNVDRRIAVNV-------KDYNIYGN 186 Query: 618 QFAHASPEAIKLATTIYDKRIDPKTGNPYYSNIIAIEAPKHGHGNYTFGTIKKILATAYS 797 FA AS + I+ ATT+ D + + K SNI+AI AP+ GHG Y G + I T YS Sbjct: 187 NFAKASTDIIEQATTVLDMKTNRK------SNILAISAPRGGHGEYKLGEVNFIFDTLYS 240 Query: 798 GFLAAKFESFVEKGILERNHEENQEHTKININEEESVPKPRVIIHTGNWGCGAYGGNITI 977 GF A + T++ + E+ P V+IHTGN+GCGA+G N + Sbjct: 241 GFKACCMD------------------TEMYAQDPEN--PPTVVIHTGNFGCGAFGNNREL 280 Query: 978 MSCLQIAAAHLAGIDKIIYHAMGNQ--DECNSGLQVYNELI-------KDVEGD-VKIDD 1127 ++ LQI AA +AGI + YHA + ++V +E D+EG+ V ++ Sbjct: 281 IAILQILAARMAGIKYLYYHAFSEEGVKSVKKAIKVIDEEFDLVCKDPSDIEGNLVSLNK 340 Query: 1128 FIKNVELKDFQWGLSNG 1178 + K ++WG S+G Sbjct: 341 LFDMILQKGYKWGFSDG 357 >ref|WP_084207322.1| hypothetical protein [Sulfuritalea hydrogenivorans] dbj|BAO29669.1| hypothetical protein SUTH_01877 [Sulfuritalea hydrogenivorans sk43H] Length = 327 Score = 185 bits (469), Expect = 7e-51 Identities = 126/374 (33%), Positives = 180/374 (48%), Gaps = 2/374 (0%) Frame = +3 Query: 63 THHESFIANELIKQNPPKWYCKNKKLIYEMATPEGADSFHGKLEYTRWEEFPLPKSFKGE 242 T + + + ++ K PK++ NK+ ++++A G + +G L Y+RW PLP Sbjct: 6 TARKQWDSYQISKNMMPKFHHSNKEFLFKLAFSNGFSN-NGTLGYSRWSSRPLPTLLT-- 62 Query: 243 SDNHQGGKEMEIEVKNDVFDYSSPEDDNTVNWYINFADKHVFGYYGGNLFAQDESQCLEH 422 +G E E+ + FDY W++NFA+ +F + +L AQDE Q EH Sbjct: 63 ----EG--ETEVLQRPGFFDYEVSSSPQAAEWHMNFANNEIFSAWATSLLAQDELQVAEH 116 Query: 423 PALCSLRDYLVKGDQESRFTTARPILTKGSSFISTPILVRGVERKIQVKTDVNKNEKRPF 602 PAL +R +K L TPIL+ GVER++ + T N+ P Sbjct: 117 PALIGMRIEAMKEGIS---------LWSVEDCAPTPILITGVERRLSIDTSPNEGAGIPH 167 Query: 603 GLYGNQFAHASPEAIKLATTIYDKRIDPKTGNPYYSNIIAIEAPKHGHGNYTFGTIKKIL 782 G+YGN F +AS I ATT+ I P T SNI+AIEAP +G G YT +I IL Sbjct: 168 GIYGNYFRNASESQIARATTV----ITPPTN----SNILAIEAPAYGSGRYTSNSIAFIL 219 Query: 783 ATAYSGFLAAKFESFVEKGILERNHEENQEHTKININEEESVPKPRVIIHTGNWGCGAYG 962 +TAYSGF A ES + S+ V IH+G WGCGAYG Sbjct: 220 STAYSGFSAVLDES------------------------QLSLNASTVRIHSGFWGCGAYG 255 Query: 963 GNITIMSCLQIAAAHLAGIDKIIYHAMGNQDEC--NSGLQVYNELIKDVEGDVKIDDFIK 1136 GN +M LQ+ AA LAGID+I +H ++Y + + G++ +D I Sbjct: 256 GNRVLMLLLQMVAARLAGIDQITFHTGDGSGSLPFRESYEIYQRIQR---GNLPVDQVID 312 Query: 1137 NVELKDFQWGLSNG 1178 V F+WG S+G Sbjct: 313 LVADYHFEWGESDG 326