BLASTX nr result

ID: Ophiopogon26_contig00038785 seq

BLASTX 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Ophiopogon26_contig00038785
         (1403 letters)

Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF excluding environmental samples
from WGS projects 
           149,584,005 sequences; 54,822,741,787 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

gb|POG81663.1| hypothetical protein GLOIN_2v1504207 [Rhizophagus...   780   0.0  
gb|EXX72146.1| hypothetical protein RirG_072170 [Rhizophagus irr...   766   0.0  
gb|PKY45927.1| hypothetical protein RhiirA4_401815 [Rhizophagus ...   680   0.0  
dbj|GBC40160.1| tyrosyl-tRNA synthetase [Rhizophagus irregularis...   679   0.0  
ref|XP_004333807.1| Tyrosyl-tRNA synthetase [Acanthamoeba castel...   246   2e-74
ref|WP_096830388.1| hypothetical protein [Tychonema bourrellyi] ...   241   2e-72
ref|WP_038073215.1| hypothetical protein [Tolypothrix bouteillei...   229   1e-67
ref|WP_069966748.1| hypothetical protein [Desertifilum sp. IPPAS...   227   4e-67
gb|KHD05220.1| hypothetical protein OT06_49285 [Candidatus Thiom...   216   5e-63
gb|KUK49216.1| Uncharacterized protein XD74_0174 [Actinobacteria...   213   1e-61
ref|WP_106301821.1| hypothetical protein [Chamaesiphon polymorph...   208   2e-59
gb|APR82132.1| Hypothetical protein A7982_07481 [Minicystis rosea]    207   3e-59
gb|PRP77223.1| hypothetical protein PROFUN_15117 [Planoprotostel...   200   1e-56
ref|WP_050430843.1| hypothetical protein [Chondromyces crocatus]...   196   3e-55
dbj|GBD32688.1| hypothetical protein HRbin33_01662 [bacterium HR33]   196   4e-55
ref|WP_082838724.1| hypothetical protein [Gemmata sp. SH-PL17] >...   193   5e-54
ref|XP_004345553.2| tyrosyl-tRNA synthetase [Capsaspora owczarza...   194   5e-54
ref|WP_006971973.1| hypothetical protein [Plesiocystis pacifica]...   191   2e-53
ref|XP_002677363.1| predicted protein [Naegleria gruberi] >gi|28...   189   4e-52
ref|WP_084207322.1| hypothetical protein [Sulfuritalea hydrogeni...   185   7e-51

>gb|POG81663.1| hypothetical protein GLOIN_2v1504207 [Rhizophagus irregularis DAOM
            181602=DAOM 197198]
          Length = 446

 Score =  780 bits (2015), Expect = 0.0
 Identities = 375/396 (94%), Positives = 383/396 (96%), Gaps = 3/396 (0%)
 Frame = +3

Query: 3    HLFHFIMELHQILKVIKNHLTHHESFIANELIKQNPPKWYCKNKKLIYEMATPEGADSFH 182
            HLFHFIMEL QI+KVIKNHLTHHESFIANEL+KQNPPKWYCKNKKLIYEMATPEGADSFH
Sbjct: 51   HLFHFIMELRQIIKVIKNHLTHHESFIANELVKQNPPKWYCKNKKLIYEMATPEGADSFH 110

Query: 183  GKLEYTRWEEFPLPKSFKGESDNHQGGKEMEIEVKNDVFDYSSPEDDNTVNWYINFADKH 362
            GKLEYTRWEEFPLPKSFKGESDN QGGKEM+IEVKNDVFDYSSPED+NTVNWYINFADKH
Sbjct: 111  GKLEYTRWEEFPLPKSFKGESDNPQGGKEMKIEVKNDVFDYSSPEDNNTVNWYINFADKH 170

Query: 363  VFGYYGGNLFAQDESQCLEHPALCSLRDYLVKGDQESRFTTARPILTK---GSSFISTPI 533
            VFGYYGGNLFAQDESQCLEHP LCSLRDYLVKGDQES FTTARPILTK    SSFISTPI
Sbjct: 171  VFGYYGGNLFAQDESQCLEHPVLCSLRDYLVKGDQESCFTTARPILTKDSISSSFISTPI 230

Query: 534  LVRGVERKIQVKTDVNKNEKRPFGLYGNQFAHASPEAIKLATTIYDKRIDPKTGNPYYSN 713
            LVRGVERK+QVKTDVNKNEKRPFGLYGNQFAHASPEAIKLATTIYDKRIDPKTGNPYYSN
Sbjct: 231  LVRGVERKVQVKTDVNKNEKRPFGLYGNQFAHASPEAIKLATTIYDKRIDPKTGNPYYSN 290

Query: 714  IIAIEAPKHGHGNYTFGTIKKILATAYSGFLAAKFESFVEKGILERNHEENQEHTKININ 893
            IIAIEAPKHGHG YTF TIK+ILAT YSGFLAAKFES VEKGILERNHEENQEHTKININ
Sbjct: 291  IIAIEAPKHGHGKYTFNTIKRILATGYSGFLAAKFESLVEKGILERNHEENQEHTKININ 350

Query: 894  EEESVPKPRVIIHTGNWGCGAYGGNITIMSCLQIAAAHLAGIDKIIYHAMGNQDECNSGL 1073
            EEESVPKPRVIIHTGNWGCGAYGGNITIMSCLQIAAAHLAGIDKI+YHAMGNQDE NSGL
Sbjct: 351  EEESVPKPRVIIHTGNWGCGAYGGNITIMSCLQIAAAHLAGIDKIVYHAMGNQDEFNSGL 410

Query: 1074 QVYNELIKDVEGDVKIDDFIKNVELKDFQWGLSNGT 1181
            Q+YNELIKDVEGDVKIDDFIKNVELKDFQWGLSNGT
Sbjct: 411  QIYNELIKDVEGDVKIDDFIKNVELKDFQWGLSNGT 446


>gb|EXX72146.1| hypothetical protein RirG_072170 [Rhizophagus irregularis DAOM
            197198w]
 gb|PKC11192.1| hypothetical protein RhiirA5_413392 [Rhizophagus irregularis]
 gb|PKC74051.1| hypothetical protein RhiirA1_450433 [Rhizophagus irregularis]
 gb|PKY17737.1| hypothetical protein RhiirB3_430415 [Rhizophagus irregularis]
          Length = 390

 Score =  766 bits (1979), Expect = 0.0
 Identities = 369/390 (94%), Positives = 377/390 (96%), Gaps = 3/390 (0%)
 Frame = +3

Query: 21   MELHQILKVIKNHLTHHESFIANELIKQNPPKWYCKNKKLIYEMATPEGADSFHGKLEYT 200
            MEL QI+KVIKNHLTHHESFIANEL+KQNPPKWYCKNKKLIYEMATPEGADSFHGKLEYT
Sbjct: 1    MELRQIIKVIKNHLTHHESFIANELVKQNPPKWYCKNKKLIYEMATPEGADSFHGKLEYT 60

Query: 201  RWEEFPLPKSFKGESDNHQGGKEMEIEVKNDVFDYSSPEDDNTVNWYINFADKHVFGYYG 380
            RWEEFPLPKSFKGESDN QGGKEM+IEVKNDVFDYSSPED+NTVNWYINFADKHVFGYYG
Sbjct: 61   RWEEFPLPKSFKGESDNPQGGKEMKIEVKNDVFDYSSPEDNNTVNWYINFADKHVFGYYG 120

Query: 381  GNLFAQDESQCLEHPALCSLRDYLVKGDQESRFTTARPILTK---GSSFISTPILVRGVE 551
            GNLFAQDESQCLEHP LCSLRDYLVKGDQES FTTARPILTK    SSFISTPILVRGVE
Sbjct: 121  GNLFAQDESQCLEHPVLCSLRDYLVKGDQESCFTTARPILTKDSISSSFISTPILVRGVE 180

Query: 552  RKIQVKTDVNKNEKRPFGLYGNQFAHASPEAIKLATTIYDKRIDPKTGNPYYSNIIAIEA 731
            RK+QVKTDVNKNEKRPFGLYGNQFAHASPEAIKLATTIYDKRIDPKTGNPYYSNIIAIEA
Sbjct: 181  RKVQVKTDVNKNEKRPFGLYGNQFAHASPEAIKLATTIYDKRIDPKTGNPYYSNIIAIEA 240

Query: 732  PKHGHGNYTFGTIKKILATAYSGFLAAKFESFVEKGILERNHEENQEHTKININEEESVP 911
            PKHGHG YTF TIK+ILAT YSGFLAAKFES VEKGILERNHEENQEHTKININEEESVP
Sbjct: 241  PKHGHGKYTFNTIKRILATGYSGFLAAKFESLVEKGILERNHEENQEHTKININEEESVP 300

Query: 912  KPRVIIHTGNWGCGAYGGNITIMSCLQIAAAHLAGIDKIIYHAMGNQDECNSGLQVYNEL 1091
            KPRVIIHTGNWGCGAYGGNITIMSCLQIAAAHLAGIDKI+YHAMGNQDE NSGLQ+YNEL
Sbjct: 301  KPRVIIHTGNWGCGAYGGNITIMSCLQIAAAHLAGIDKIVYHAMGNQDEFNSGLQIYNEL 360

Query: 1092 IKDVEGDVKIDDFIKNVELKDFQWGLSNGT 1181
            IKDVEGDVKIDDFIKNVELKDFQWGLSNGT
Sbjct: 361  IKDVEGDVKIDDFIKNVELKDFQWGLSNGT 390


>gb|PKY45927.1| hypothetical protein RhiirA4_401815 [Rhizophagus irregularis]
          Length = 347

 Score =  680 bits (1755), Expect = 0.0
 Identities = 330/347 (95%), Positives = 335/347 (96%), Gaps = 3/347 (0%)
 Frame = +3

Query: 150  MATPEGADSFHGKLEYTRWEEFPLPKSFKGESDNHQGGKEMEIEVKNDVFDYSSPEDDNT 329
            MATPEGADSFHGKLEYTRWEEFPLPKSFKGESDN QGGKEM+IEVKNDVFDYSSPEDDNT
Sbjct: 1    MATPEGADSFHGKLEYTRWEEFPLPKSFKGESDNPQGGKEMKIEVKNDVFDYSSPEDDNT 60

Query: 330  VNWYINFADKHVFGYYGGNLFAQDESQCLEHPALCSLRDYLVKGDQESRFTTARPILTK- 506
            VNWYINFADKHVFGYYGGNLFAQDESQCLEHP LCSLRDYLVKGDQES FTTARPILTK 
Sbjct: 61   VNWYINFADKHVFGYYGGNLFAQDESQCLEHPVLCSLRDYLVKGDQESCFTTARPILTKD 120

Query: 507  --GSSFISTPILVRGVERKIQVKTDVNKNEKRPFGLYGNQFAHASPEAIKLATTIYDKRI 680
               SSFISTPILVRGVERK+QVKTDVNKNEKRPFGLYGNQFAHASPEAIKLATTIYDKRI
Sbjct: 121  SISSSFISTPILVRGVERKVQVKTDVNKNEKRPFGLYGNQFAHASPEAIKLATTIYDKRI 180

Query: 681  DPKTGNPYYSNIIAIEAPKHGHGNYTFGTIKKILATAYSGFLAAKFESFVEKGILERNHE 860
            DPKTGNPYYSNIIAIEAPK+GHG YTF TIKKILAT YSGFLAAKFES VEKGILERNHE
Sbjct: 181  DPKTGNPYYSNIIAIEAPKYGHGKYTFNTIKKILATGYSGFLAAKFESLVEKGILERNHE 240

Query: 861  ENQEHTKININEEESVPKPRVIIHTGNWGCGAYGGNITIMSCLQIAAAHLAGIDKIIYHA 1040
            ENQEHTKININEEESVPKPRVIIHTGNWGCGAYGGNITIMSCLQIAAAHLAGIDKI+YHA
Sbjct: 241  ENQEHTKININEEESVPKPRVIIHTGNWGCGAYGGNITIMSCLQIAAAHLAGIDKIVYHA 300

Query: 1041 MGNQDECNSGLQVYNELIKDVEGDVKIDDFIKNVELKDFQWGLSNGT 1181
            MGNQDE NSGLQ+YNELIKDVEGDVKIDDFIKNVELKDFQWGLSNGT
Sbjct: 301  MGNQDEFNSGLQIYNELIKDVEGDVKIDDFIKNVELKDFQWGLSNGT 347


>dbj|GBC40160.1| tyrosyl-tRNA synthetase [Rhizophagus irregularis DAOM 181602]
 gb|PKK79190.1| hypothetical protein RhiirC2_727940 [Rhizophagus irregularis]
          Length = 347

 Score =  679 bits (1753), Expect = 0.0
 Identities = 329/347 (94%), Positives = 335/347 (96%), Gaps = 3/347 (0%)
 Frame = +3

Query: 150  MATPEGADSFHGKLEYTRWEEFPLPKSFKGESDNHQGGKEMEIEVKNDVFDYSSPEDDNT 329
            MATPEGADSFHGKLEYTRWEEFPLPKSFKGESDN QGGKEM+IEVKNDVFDYSSPED+NT
Sbjct: 1    MATPEGADSFHGKLEYTRWEEFPLPKSFKGESDNPQGGKEMKIEVKNDVFDYSSPEDNNT 60

Query: 330  VNWYINFADKHVFGYYGGNLFAQDESQCLEHPALCSLRDYLVKGDQESRFTTARPILTK- 506
            VNWYINFADKHVFGYYGGNLFAQDESQCLEHP LCSLRDYLVKGDQES FTTARPILTK 
Sbjct: 61   VNWYINFADKHVFGYYGGNLFAQDESQCLEHPVLCSLRDYLVKGDQESCFTTARPILTKD 120

Query: 507  --GSSFISTPILVRGVERKIQVKTDVNKNEKRPFGLYGNQFAHASPEAIKLATTIYDKRI 680
               SSFISTPILVRGVERK+QVKTDVNKNEKRPFGLYGNQFAHASPEAIKLATTIYDKRI
Sbjct: 121  SISSSFISTPILVRGVERKVQVKTDVNKNEKRPFGLYGNQFAHASPEAIKLATTIYDKRI 180

Query: 681  DPKTGNPYYSNIIAIEAPKHGHGNYTFGTIKKILATAYSGFLAAKFESFVEKGILERNHE 860
            DPKTGNPYYSNIIAIEAPKHGHG YTF TIK+ILAT YSGFLAAKFES VEKGILERNHE
Sbjct: 181  DPKTGNPYYSNIIAIEAPKHGHGKYTFNTIKRILATGYSGFLAAKFESLVEKGILERNHE 240

Query: 861  ENQEHTKININEEESVPKPRVIIHTGNWGCGAYGGNITIMSCLQIAAAHLAGIDKIIYHA 1040
            ENQEHTKININEEESVPKPRVIIHTGNWGCGAYGGNITIMSCLQIAAAHLAGIDKI+YHA
Sbjct: 241  ENQEHTKININEEESVPKPRVIIHTGNWGCGAYGGNITIMSCLQIAAAHLAGIDKIVYHA 300

Query: 1041 MGNQDECNSGLQVYNELIKDVEGDVKIDDFIKNVELKDFQWGLSNGT 1181
            MGNQDE NSGLQ+YNELIKDVEGDVKIDDFIKNVELKDFQWGLSNGT
Sbjct: 301  MGNQDEFNSGLQIYNELIKDVEGDVKIDDFIKNVELKDFQWGLSNGT 347


>ref|XP_004333807.1| Tyrosyl-tRNA synthetase [Acanthamoeba castellanii str. Neff]
 gb|ELR11794.1| Tyrosyl-tRNA synthetase [Acanthamoeba castellanii str. Neff]
          Length = 332

 Score =  246 bits (628), Expect = 2e-74
 Identities = 154/386 (39%), Positives = 206/386 (53%), Gaps = 10/386 (2%)
 Frame = +3

Query: 51   KNHLTHHESFIANELIKQNPPKWYCKNKKLIYEMATPEGADSFHGKLEYTRWEEFPLPKS 230
            ++++T   SF ANEL   +PP+W   NKKL+Y +A P GADS  G L Y+R++E PLP  
Sbjct: 6    EDYVTCQASFDANELATDHPPRWLHPNKKLVYALACPRGADSHRGTLHYSRYKEIPLPSL 65

Query: 231  FKGESDNHQGGKEMEIEVKNDVFDY----SSPEDDNTVNWYINFADKHVFGYYGGNLFAQ 398
            +  +    Q   + ++E++ D F Y       E    V+WY NFA  H+F  Y G L+AQ
Sbjct: 66   YVPDDAMKQ---KTQLEMREDAFTYEPTAQEEEGRPVVSWYKNFAHSHLFIAYAGGLYAQ 122

Query: 399  DESQCLEHPALCSLRDYLVKGDQESRFTTARPILTKGSSFISTPILVRGVERKIQVKTDV 578
            DE Q  EHP L SLR+ L    ++ RF   RP+ T+G+    TPIL+RGVER+I VKTD 
Sbjct: 123  DEMQVAEHPILASLREALT-SSRDKRF---RPLTTEGNE--PTPILIRGVERRIFVKTDR 176

Query: 579  NKNEKRPFGLYGNQFAHASPEAIKLATTIYDKRIDPKTGNPYYSNIIAIEAPKHGHGNYT 758
            N    RP+GLYG  FA AS  AI+ A+ +           P  SNI+A+EAP  G G Y 
Sbjct: 177  NPGAGRPYGLYGGAFAAASEAAIRNASVVLKP--------PTISNILALEAPPGGRGAYR 228

Query: 759  FGTIKKILATAYSGFLAAKFESFVEKGILERNHEENQEHTKININEEESVPKPRVIIHTG 938
            +                       EKG                    ES P P V+IHTG
Sbjct: 229  Y-----------------------EKGF-------------------ESAP-PFVVIHTG 245

Query: 939  NWGCGAYGGNITIMSCLQIAAAHLAGIDKIIYHAMGNQ--DECNSGLQVYNE-LIKDVEG 1109
            NWG GAYGGN  +M+ LQ+ AA LAG+DK++YH    Q  D    G+ + +E L+ D EG
Sbjct: 246  NWGTGAYGGNKVLMALLQVLAARLAGVDKLVYHTFERQSSDAYREGVALLDERLVADSEG 305

Query: 1110 DVK---IDDFIKNVELKDFQWGLSNG 1178
              K   +D+ I  +    F+WGLS+G
Sbjct: 306  QQKQIAVDELIGKLTALQFRWGLSDG 331


>ref|WP_096830388.1| hypothetical protein [Tychonema bourrellyi]
 gb|PHX54006.1| hypothetical protein CP500_018375 [Tychonema bourrellyi FEM_GT703]
          Length = 340

 Score =  241 bits (615), Expect = 2e-72
 Identities = 143/373 (38%), Positives = 211/373 (56%), Gaps = 4/373 (1%)
 Frame = +3

Query: 72   ESFIANELIKQNPPKWYCKNKKLIYEMATPEGADSFHGKLEYTRWEEFPLPKSFKGESDN 251
            ++F   +LI  +PPK+Y  NK+L+Y++  P G +  HG+L ++RW    LP+     S  
Sbjct: 16   QTFDTQDLINDHPPKFYNGNKQLVYDICCPPGCNH-HGQLAFSRWYAMVLPEYLS--SLE 72

Query: 252  HQGGKEMEIEVKNDVFDYSSPEDDNTVNWYINFADKHVFGYYGGNLFAQDESQCLEHPAL 431
            HQ     +I  +   F+Y   ED   + WY+NFA   +F  YG +LFAQDE Q  EHPAL
Sbjct: 73   HQ----TDISERKGYFEYEPSEDSTQMEWYLNFAHYELFFAYGSSLFAQDEMQVAEHPAL 128

Query: 432  CSLRDYLVKGDQESRFTTARPILTKGSSFISTPILVRGVERKIQVKTDVNKNEKRPFGLY 611
             SLR+ L+  D + +  T         S   TPIL+RGVER+  + TD N  + RPFGLY
Sbjct: 129  SSLREALL--DSKIKSLTV-------ESQQPTPILIRGVERRCAISTDPNSEQGRPFGLY 179

Query: 612  GNQFAHASPEAIKLATTIYDKRIDPKTGNPYYSNIIAIEAPKHGHGNYTFGTIKKILATA 791
            GN F  A+ +AI+ AT    K ++P    P  +NIIA+EAP  G G Y+   I+ IL TA
Sbjct: 180  GNNFGRATSDAIEQAT----KPLNP----PTITNIIAMEAPAGGRGYYSMAQIEYILQTA 231

Query: 792  YSGFLAAKFESFVEKGILERNHEENQEHTKININEEESVPK-PRVIIHTGNWGCGAYGGN 968
            ++GF AA+ ES +E                        +P+ P ++IHTG WGCGAYGGN
Sbjct: 232  FTGFSAARIESQLE------------------------LPQAPSLMIHTGFWGCGAYGGN 267

Query: 969  ITIMSCLQIAAAHLAGIDKIIYH---AMGNQDECNSGLQVYNELIKDVEGDVKIDDFIKN 1139
              +M+ LQ+ +A L+ ++ +++H   AMG+QD   +   +  +L+ D +  VK+ D ++ 
Sbjct: 268  RVLMALLQLLSARLSQVNCLVFHTSDAMGSQDLATAQQILDRDLVPD-DSPVKVSDLVEK 326

Query: 1140 VELKDFQWGLSNG 1178
            +   +FQWG S+G
Sbjct: 327  IHAMEFQWGFSDG 339


>ref|WP_038073215.1| hypothetical protein [Tolypothrix bouteillei]
 gb|KIE11963.1| hypothetical protein DA73_0210005 [Tolypothrix bouteillei VB521301]
          Length = 338

 Score =  229 bits (583), Expect = 1e-67
 Identities = 142/385 (36%), Positives = 205/385 (53%), Gaps = 5/385 (1%)
 Frame = +3

Query: 39   LKVIKNHLTHHESFIANELIKQNPPKWYCKNKKLIYEMATPEGADSFHGKLEYTRWEEFP 218
            +    ++L     F    L+  +PPK   KNKK++Y++A P G     G++ ++RW +  
Sbjct: 1    MNAASDNLICRHKFTTQSLVDTHPPKLKNKNKKIVYQIACPPGC-IHSGEIVFSRWRKIT 59

Query: 219  LPKSFKGESDNHQGGKEMEIEVKNDVFDYS-SPEDDNTVNWYINFADKHVFGYYGGNLFA 395
            LP +    SD        E E +   F+Y  S E ++ V WY+NFA   +F  Y G+L A
Sbjct: 60   LPVNLSSSSDR------TEFEERYGYFEYEPSHEKNDEVEWYLNFAHCDLFCAYSGSLLA 113

Query: 396  QDESQCLEHPALCSLRDYLVKGDQESRFTTARPILTKGSSFISTPILVRGVERKIQVKTD 575
            QDE Q  EHPAL SLR+ L+    +       P   +      TPIL+RGVER+  + TD
Sbjct: 114  QDEMQVAEHPALGSLREALLDAGID-------PFTVEAGE--PTPILIRGVERRCAIATD 164

Query: 576  VNKNEKRPFGLYGNQFAHASPEAIKLATTIYDKRIDPKTGNPYYSNIIAIEAPKHGHGNY 755
             +    RP+GLYGN FA A+ EAI+LAT    K ++P    P  +NIIA+EAP +G+G Y
Sbjct: 165  ASVENARPYGLYGNNFARATAEAIELAT----KPLNP----PTVTNIIAMEAPSNGYGVY 216

Query: 756  TFGTIKKILATAYSGFLAAKFESFVEKGILERNHEENQEHTKININEEESVPKPRVIIHT 935
            T   I+ IL TA++GF  AK ES                         ES  +  VIIHT
Sbjct: 217  TQQEIRYILDTAFTGFATAKVESCF-----------------------ESAQEQFVIIHT 253

Query: 936  GNWGCGAYGGNITIMSCLQIAAAHLAGIDKIIYH----AMGNQDECNSGLQVYNELIKDV 1103
            G WGCGAYGGN  +M+ LQ+ AA LA ++++++H    A   QD   +   + NE +  V
Sbjct: 254  GFWGCGAYGGNRILMALLQLLAARLAQVNRLVFHTGVDAKSAQDFA-TAQHILNENLAPV 312

Query: 1104 EGDVKIDDFIKNVELKDFQWGLSNG 1178
              +V++   +  ++   FQWG+S+G
Sbjct: 313  GSNVEVSTLLVKIQALGFQWGISDG 337


>ref|WP_069966748.1| hypothetical protein [Desertifilum sp. IPPAS B-1220]
 gb|OEJ75663.1| hypothetical protein BH720_08465 [Desertifilum sp. IPPAS B-1220]
          Length = 332

 Score =  227 bits (579), Expect = 4e-67
 Identities = 142/385 (36%), Positives = 206/385 (53%), Gaps = 3/385 (0%)
 Frame = +3

Query: 33   QILKVIKNHLTHHESFIANELIKQNPPKWYCKNKKLIYEMATPEGADSFHGKLEYTRWEE 212
            ++ +++ N +  H +F   EL++  PPK Y  NK++IY++A P G+    G L ++RW  
Sbjct: 3    KVRELLDNLICRH-AFNTQELVETYPPKLYHPNKRVIYDIACPPGS-VHRGTLCFSRWRG 60

Query: 213  FPLPKSFKGESDNHQGGKEMEIEVKNDVFDYSSPEDDNTVNWYINFADKHVFGYYGGNLF 392
              LP+             E  +E  +  F+Y    D   + WY+NFAD  +F  YG  LF
Sbjct: 61   MKLPEGLPASG-------ETVLEEYSGYFEYEPSPDSQEMEWYLNFADLDLFYAYGSPLF 113

Query: 393  AQDESQCLEHPALCSLRDYLVKGDQESRFTTARPILTKGSSFISTPILVRGVERKIQVKT 572
            AQDE Q  EHPAL SLR+ L+    E    TA     KG     TP+LVRGVER+ ++ T
Sbjct: 114  AQDEMQVAEHPALASLREALLAN--EINLFTAE----KGGP---TPVLVRGVERRCEIAT 164

Query: 573  DVNKNEKRPFGLYGNQFAHASPEAIKLATTIYDKRIDPKTGNPYYSNIIAIEAPKHGHGN 752
            + + +++RPFGLYGN FA A+ EAI LAT    K ++P    P  +NI+AI AP    G+
Sbjct: 165  NPDASQQRPFGLYGNHFAGATTEAIALAT----KPLNP----PTITNILAISAPSCRSGS 216

Query: 753  YTFGTIKKILATAYSGFLAAKFESFVEKGILERNHEENQEHTKININEEESVPKPRVIIH 932
            YT   I+ IL TA++GF AA+ +S                              P V IH
Sbjct: 217  YTQKQIEHILTTAFTGFTAARLDS----------------------------EAPLVAIH 248

Query: 933  TGNWGCGAYGGNITIMSCLQIAAAHLAGIDKIIYHA---MGNQDECNSGLQVYNELIKDV 1103
            TG WGCGA+GGN  +M+ LQ+ AAHLA ++++I+H     G++    +    YN +  + 
Sbjct: 249  TGFWGCGAFGGNRVLMALLQLLAAHLAQVNRLIFHTSDRSGSEALATAQDLFYNAIAPN- 307

Query: 1104 EGDVKIDDFIKNVELKDFQWGLSNG 1178
               + + D I  +   DFQWG+S+G
Sbjct: 308  -SSLSVSDLITQIHAMDFQWGVSDG 331


>gb|KHD05220.1| hypothetical protein OT06_49285 [Candidatus Thiomargarita nelsonii]
          Length = 325

 Score =  216 bits (551), Expect = 5e-63
 Identities = 142/372 (38%), Positives = 199/372 (53%), Gaps = 5/372 (1%)
 Frame = +3

Query: 78   FIANELIKQNPPKWYCKNKKLIYEMATPEGADSFH-GKLEYTRWEEFPLPKSFKGESDNH 254
            F   +L+ + PP  Y  NKK++Y++A P G  S H G+L  +RW    LP  +K    N 
Sbjct: 14   FETQKLVDEFPPNLYDSNKKIVYQIACPPG--SVHSGQLVLSRWNLMRLP--YKVSFKNT 69

Query: 255  QGGKEMEIEVKNDVFDYSSPEDDNTVNWYINFADKHVFGYYGGNLFAQDESQCLEHPALC 434
            Q    +  E + D F Y    D + V WY+NFA   +F  YGG LFAQDE Q  EHPAL 
Sbjct: 70   Q----VVFEGREDYFGYE--RDTSVVEWYLNFAHYDLFCAYGGGLFAQDEMQVAEHPALG 123

Query: 435  SLRDYLVKGDQESRFTTARPILTKGSSFISTPILVRGVERKIQVKTDVNKNEKRPFGLYG 614
            SLR+ L+    E       P+  +      TPIL++GVER+  V T+ N +++R FGLYG
Sbjct: 124  SLREALLASGIE-------PLTVENGK--PTPILIKGVERRCAVSTEPNPSQQRHFGLYG 174

Query: 615  NQFAHASPEAIKLATTIYDKRIDPKTGNPYYSNIIAIEAPKHGHGNYTFGTIKKILATAY 794
            N FA AS EAIKLATT     ++P    P  +N+IA+EAP  G   YT   I+ IL TA+
Sbjct: 175  NNFAQASEEAIKLATT----PLNP----PTLTNLIAMEAPACGRDFYTQDEIEYILRTAF 226

Query: 795  SGFLAAKFESFVEKGILERNHEENQEHTKININEEESVPKPRVIIHTGNWGCGAYGGNIT 974
            +GF AAK E+  ++                             +IHTG WGCGAYGGN  
Sbjct: 227  TGFSAAKIETKADE----------------------------TVIHTGFWGCGAYGGNRV 258

Query: 975  IMSCLQIAAAHLAGIDKIIYHAMGNQDECNSGLQVYNELIKDVEGDV----KIDDFIKNV 1142
            +MS LQ+ AA ++ +D++++H        +SGL+ +    + +E D+     ID  I  +
Sbjct: 259  LMSLLQLIAAVMSQVDRLVFHT------GSSGLEDFQRACRILEEDLASLPNIDSVINKL 312

Query: 1143 ELKDFQWGLSNG 1178
                F+WG+S+G
Sbjct: 313  TEMKFEWGISDG 324


>gb|KUK49216.1| Uncharacterized protein XD74_0174 [Actinobacteria bacterium 66_15]
          Length = 332

 Score =  213 bits (543), Expect = 1e-61
 Identities = 137/371 (36%), Positives = 197/371 (53%), Gaps = 2/371 (0%)
 Frame = +3

Query: 75   SFIANELIKQNPPKWYCKNKKLIYEMATPEGADSFHGKLEYTRWEEFPLPKSFKGESDNH 254
            +F    L+ ++PP  +  NK++++E+A  EG++   G++ YT+W  F LP     E  + 
Sbjct: 11   TFDVATLMAEHPPLIHHPNKRVVFEIACGEGSEC-SGEIGYTQWPAFSLP-----ERVDP 64

Query: 255  QGGKEMEIEVKNDVFDYSSPED-DNTVNWYINFADKHVFGYYGGNLFAQDESQCLEHPAL 431
              G +  +E +  + DY   ED    V W++NFAD+ +F  YG  LFAQDE QC EHP L
Sbjct: 65   TAGLDA-LESRCGIMDYEPVEDFPGAVEWHVNFADQMLFFAYGSGLFAQDEMQCAEHPVL 123

Query: 432  CSLRDYLVKGDQESRFTTARPILTKGSSFISTPILVRGVERKIQVKTDVNKNEKRPFGLY 611
             +L + L    + +   TA            TP+LV GVER+++VKT++N  + RP GLY
Sbjct: 124  GALVEALRADGRRAVTETADG---------PTPVLVTGVERRVKVKTNMNAKKGRPRGLY 174

Query: 612  GNQFAHASPEAIKLATTIYDKRIDPKTGNPYYSNIIAIEAPKHGHGNYTFGTIKKILATA 791
            GN+FA ASPEA++ AT    KRIDP    P  +NIIA+ AP + +G Y    I++IL TA
Sbjct: 175  GNEFAVASPEAVQRAT----KRIDP----PTITNIIAMAAPTYRNGRYERSMIERILVTA 226

Query: 792  YSGFLAAKFESFVEKGILERNHEENQEHTKININEEESVPKPRVIIHTGNWGCGAYGGNI 971
            Y+GF AA  ES                            P   V IHTG WGCGA+GGN 
Sbjct: 227  YTGFAAAVAES------------------------RRMAPGAPVAIHTGYWGCGAFGGNR 262

Query: 972  TIMSCLQIAAAHLAGIDKIIYHAMGNQD-ECNSGLQVYNELIKDVEGDVKIDDFIKNVEL 1148
             +MS LQ+ AA +A +  + +H    +D       ++  E +   E  +  DD I  ++ 
Sbjct: 263  VLMSLLQLLAAGMAEVTCLAFHTANAEDAPLVEATRIITEDLSSGE-SLSADDLIDRIDA 321

Query: 1149 KDFQWGLSNGT 1181
              F+WG S GT
Sbjct: 322  MAFEWGRSEGT 332


>ref|WP_106301821.1| hypothetical protein [Chamaesiphon polymorphus]
 gb|PSB57937.1| hypothetical protein C7B77_06640 [Chamaesiphon polymorphus CCALA 037]
          Length = 355

 Score =  208 bits (529), Expect = 2e-59
 Identities = 141/395 (35%), Positives = 200/395 (50%), Gaps = 7/395 (1%)
 Frame = +3

Query: 15   FIMELHQILKVIKN---HLTHHESFIANELIKQNPPKWYCKNKKLIYEMATPEGADSFH- 182
            F  E+  +L  + N   +L    +F A +L+  +P K    NKK+++ +A P   +S H 
Sbjct: 11   FFWEIAYLLDRMNNSSDNLICRHTFNAQQLVDSHPAKIRNANKKIVHRIACPP--NSIHQ 68

Query: 183  GKLEYTRWEEFPLPKSFKGESDNHQGGKEMEIEVKNDVFDYS-SPEDDNTVNWYINFADK 359
            G++ ++RW    L +     +D        EI+ +   F Y  S   D  V WY+NFA  
Sbjct: 69   GEIVFSRWRSIELAEISPSLTDR------TEIQEQKSYFKYPRSQHRDRLVEWYLNFAHS 122

Query: 360  HVFGYYGGNLFAQDESQCLEHPALCSLRDYLVKGDQESRFTTARPILTKGSSFISTPILV 539
             +F  YG  +FAQDE Q  EHP L SLR+ L+    +  FT  R           TPIL+
Sbjct: 123  DLFCAYGERVFAQDEMQVAEHPVLASLREALLDAKIDP-FTVERGE--------PTPILI 173

Query: 540  RGVERKIQVKTDVNKNEKRPFGLYGNQFAHASPEAIKLATTIYDKRIDPKTGNPYYSNII 719
            RG+ER+ ++ T+++  + RP GLYGN FA A   AI+LATT     IDP    P  +NII
Sbjct: 174  RGIERRCEIATNIDSEQGRPLGLYGNNFAKAPAAAIELATT----PIDP----PTITNII 225

Query: 720  AIEAPKHGHGNYTFGTIKKILATAYSGFLAAKFESFVEKGILERNHEENQEHTKININEE 899
            A+EAP  G+  Y +  I+ IL TA +GF AAK ES +E                      
Sbjct: 226  AMEAPSGGYNFYEYDIIEFILTTAVTGFTAAKIESQLE---------------------- 263

Query: 900  ESVPKPRVIIHTGNWGCGAYGGNITIMSCLQIAAAHLAGIDKIIYHAMGNQDECNSGLQV 1079
              +  P V IHTG WGCGAYGGN  +M+ LQ+ AA LA IDK+++H     D     L  
Sbjct: 264  --IASPIVSIHTGFWGCGAYGGNRILMALLQLLAARLAQIDKLVFHT--TDDAGAKALAT 319

Query: 1080 YNELI--KDVEGDVKIDDFIKNVELKDFQWGLSNG 1178
               +I  + V  +  I   +  +  K F+WG+ +G
Sbjct: 320  ARSIIDRELVIAEASIPQILDKIYAKAFKWGIGDG 354


>gb|APR82132.1| Hypothetical protein A7982_07481 [Minicystis rosea]
          Length = 337

 Score =  207 bits (527), Expect = 3e-59
 Identities = 138/375 (36%), Positives = 198/375 (52%), Gaps = 7/375 (1%)
 Frame = +3

Query: 75   SFIANELIKQNPPKWYCKNKKLIYEMATPEGADSFHGKLEYTRWEEFPLPKSFKGESDNH 254
            +F A  L+  +PP++  K+K+L+Y +++P       G L ++R    PL         +H
Sbjct: 13   TFDAAALVAAHPPRFTHKHKQLVYALSSPPSRPP-QGALVFSRHHAMPL--------GDH 63

Query: 255  QGGKEMEIEVKNDVFDY-----SSPEDDNTVNWYINFADKHVFGYYGGNLFAQDESQCLE 419
                   +E++ DVF Y     SSP    TV WY+NFAD  +F  YGG+L AQDE Q LE
Sbjct: 64   LPAAAPTVEMREDVFGYEPLPKSSPP---TVEWYLNFADPQLFVAYGGSLLAQDELQVLE 120

Query: 420  HPALCSLRDYLVKGDQESRFTTARPILTKGSSFISTPILVRGVERKIQVKTDVNKNEKRP 599
            HPAL SL ++L +   + RF    P+   G +  +TP+LVRGVER+    TD +  E RP
Sbjct: 121  HPALGSLCEHL-RASPDPRFA---PLTHDGDA--ATPVLVRGVERRCAFATDPDLLEGRP 174

Query: 600  FGLYGNQFAHASPEAIKLATTIYDKRIDPKTGNPYYSNIIAIEAPKHGHGNYTFGTIKKI 779
             GLYGN+FA AS +AI+ A T+ D         P  SNI+A+ AP  G G Y+   I+ I
Sbjct: 175  LGLYGNRFARASEDAIRRAVTVLDP--------PTLSNILAMAAPPGGTGAYSIDEIRSI 226

Query: 780  LATAYSGFLAAKFESFVEKGILERNHEENQEHTKININEEESVPKPRVIIHTGNWGCGAY 959
            L TA +GF AA+ ES                        + +     V+IHTG+WG GA+
Sbjct: 227  LTTATTGFSAARIES------------------------DLAAKGAAVVIHTGHWGTGAF 262

Query: 960  GGNITIMSCLQIAAAHLAGIDKIIYHAMGN--QDECNSGLQVYNELIKDVEGDVKIDDFI 1133
            GGN  +M+ LQ+ AA LA ID+++YH   +   D    G +   +L+ D    + + D I
Sbjct: 263  GGNKVLMTILQLLAARLARIDRLVYHTFDSTGSDAFQEGAKRLAKLLPD-GASMPVADMI 321

Query: 1134 KNVELKDFQWGLSNG 1178
            + +    F WG S+G
Sbjct: 322  QKLFRIGFVWGESDG 336


>gb|PRP77223.1| hypothetical protein PROFUN_15117 [Planoprotostelium fungivorum]
          Length = 344

 Score =  200 bits (509), Expect = 1e-56
 Identities = 130/377 (34%), Positives = 194/377 (51%), Gaps = 11/377 (2%)
 Frame = +3

Query: 84   ANELIKQNPPKWYCKNKKLIYEMATPEGADSFHGKLEYTRWEEFPLPKSFKGESDNHQGG 263
            + +L K   PK    NK+L+ +M   EG +   G L + RW     P S   + +  +G 
Sbjct: 19   SKKLWKNFRPKMQSSNKRLLLDMIDKEG-EKPQGDLIFERWSIIQ-PSSLSLDPERLKG- 75

Query: 264  KEMEIEVKNDVFDYSSPEDDNTVNWYINFADKHVFGYYGGNLFAQDESQCLEHPALCSLR 443
              + +E   D++ Y   +  +T ++Y+NFAD  +FG+YGG LFAQDE Q  EHP L SLR
Sbjct: 76   --LIVEETLDIYRY---QQTDTEDYYVNFADASLFGFYGGPLFAQDEHQVAEHPILGSLR 130

Query: 444  DYL-VKGDQESRFTTARPILTKGSSFISTPILVRGVERKIQVKTDVNKNEKRPFGLYGNQ 620
             +L ++   E++   A P    G +  +TP L+   +R + ++T  +  + R   +YGN 
Sbjct: 131  RWLDLEAASETKNKEAIPWTKIGDN--ATPCLIFNAQRSLVIETQADPTKGRQ-SIYGNS 187

Query: 621  FAHASPEAIKLATTIYDKRIDPKTGNPYYSNIIAIEAPKHGHGNYTFGTIKKILATAYSG 800
            F++ASP  I+ ATT+  K      G   ++N IAIEAPKHGHG Y    I+ I  TA+SG
Sbjct: 188  FSYASPATIRAATTVITKETAELNGLRSHNNFIAIEAPKHGHGTYDRSEIEYIFFTAFSG 247

Query: 801  FLAAKFESFVEKGILERNHEENQEHTKININEEESVPKPRVIIHTGNWGCGAYGGNITIM 980
            F AA+  S                               + +IHTGNWGCGA+GGN +IM
Sbjct: 248  FEAARLHS-----------------------------GDKTVIHTGNWGCGAFGGNGSIM 278

Query: 981  SCLQIAAAHLAGIDKIIYHAMGNQDECNSGLQVYNELIKDVEGDVKIDD----------F 1130
            + LQIAAA ++G+ KI+YH            Q +  L +  EG  K+ D           
Sbjct: 279  AMLQIAAAAMSGVKKIVYHTFD---------QKHTRLFR--EGQKKLQDLWNSRRDLHAL 327

Query: 1131 IKNVELKDFQWGLSNGT 1181
            +  ++ +++QWG+ NGT
Sbjct: 328  LAAIQEEEYQWGVGNGT 344


>ref|WP_050430843.1| hypothetical protein [Chondromyces crocatus]
 gb|AKT38636.1| uncharacterized protein CMC5_027830 [Chondromyces crocatus]
          Length = 337

 Score =  196 bits (499), Expect = 3e-55
 Identities = 134/374 (35%), Positives = 190/374 (50%), Gaps = 6/374 (1%)
 Frame = +3

Query: 75   SFIANELIKQNPPKWYCKNKKLIYEMATPEGADSFHGKLEYTRWEEFPLPKSFKGESDNH 254
            SF A+EL+  +PP+    NK++I+++A P G     G L  +RW   P+P      +   
Sbjct: 13   SFDAHELVTSHPPRLANPNKQVIHDIACPPGT-KHGGTLVVSRWWALPVPAQLPSHTP-- 69

Query: 255  QGGKEMEIEVKNDVFDYSSPEDDNT---VNWYINFADKHVFGYYGGNLFAQDESQCLEHP 425
                  E  +    F Y       T   + WY+NFAD ++F  YGG LFAQDE Q  EHP
Sbjct: 70   ------EFVLDRSFFTYEPETASGTAPQMAWYVNFADVNLFFGYGGPLFAQDELQTAEHP 123

Query: 426  ALCSLRDYL-VKGDQESRFTTARPILTKGSSFISTPILVRGVERKIQVKTDVNKNEKRPF 602
            AL SLR+ L V  D    F  AR    K    + TP+L+RGVER+  +  D       P 
Sbjct: 124  ALGSLREALKVSADP---FVKARTRENK----VPTPVLIRGVERRCAI--DTLHPAALPD 174

Query: 603  GLYGNQFAHASPEAIKLATTIYDKRIDPKTGNPYYSNIIAIEAPKHGHGNYTFGTIKKIL 782
            GLYGN+FA A+ + I+ AT    + +DP    P  SN+IA+EA     G YT   I  ++
Sbjct: 175  GLYGNRFARATADVIRKAT----RPLDP----PTVSNLIAMEAIPGASGRYTAEQIADVV 226

Query: 783  ATAYSGFLAAKFESFVEKGILERNHEENQEHTKININEEESVPKPRVIIHTGNWGCGAYG 962
             TA++GF AA+ ES +  G  E                      P+V IHTG+WG GA+G
Sbjct: 227  QTAFTGFTAARLESQLATGHAE----------------------PQVTIHTGHWGTGAFG 264

Query: 963  GNITIMSCLQIAAAHLAGIDKIIYHAMGNQDE--CNSGLQVYNELIKDVEGDVKIDDFIK 1136
            GN  +M+CLQ+ AA LAG+ ++++H +  Q E  C   L +  E +  +       + + 
Sbjct: 265  GNKVLMACLQMLAARLAGLSRLVFHTVDAQGEAACREALGLLEERL--LPAHASTHELLG 322

Query: 1137 NVELKDFQWGLSNG 1178
             +E   F WGLS+G
Sbjct: 323  ALESMGFGWGLSDG 336


>dbj|GBD32688.1| hypothetical protein HRbin33_01662 [bacterium HR33]
          Length = 339

 Score =  196 bits (499), Expect = 4e-55
 Identities = 127/377 (33%), Positives = 192/377 (50%), Gaps = 4/377 (1%)
 Frame = +3

Query: 60   LTHHESFIANELIKQNPPKWYCKNKKLIYEMATPEGADSFHGKLEYTRWEEFPLPKSFKG 239
            L    SF  + L+ ++PP W+  NK L++E+A P G+  + G + Y+RW    +P     
Sbjct: 11   LIERASFETSRLMAEHPPVWHHPNKALVFEIACPSGS-VYRGTVRYSRWRGL-VPGCLWD 68

Query: 240  ESDNHQGGKEMEIEVKNDVFDYSSPED-DNTVNWYINFADKHVFGYYGGNLFAQDESQCL 416
             +          +  K   +DYS   D   +V W++NFAD H+F  YG  LFAQDE Q  
Sbjct: 69   AA-----AALRRVRSKAGFYDYSEQSDLPGSVEWHVNFADPHLFVAYGSGLFAQDEMQVA 123

Query: 417  EHPALCSLRDYLVKGDQESRFTTARPILTKGSSFISTPILVRGVERKIQVKTDVNKNEKR 596
            EHPAL +LR+ L+          + P+  +      TP+LV GVER+ ++ T+ +    R
Sbjct: 124  EHPALGALREALLA-------RGSLPLTVEAGG--PTPVLVMGVERRCRIATEPDPLAGR 174

Query: 597  PFGLYGNQFAHASPEAIKLATTIYDKRIDPKTGNPYYSNIIAIEAPKHGHGNYTFGTIKK 776
            P GLYGN+FA A P+ ++ AT     R+DP    P  SNIIA+ +   G G Y    ++ 
Sbjct: 175  PHGLYGNRFAAAPPDVVRRATV----RLDP----PTISNIIAMASLPGGDGRYAPEEVRY 226

Query: 777  ILATAYSGFLAAKFESFVEKGILERNHEENQEHTKININEEESVPKPRVIIHTGNWGCGA 956
            +L+TAYSGF AA  ES  + G                       P   V++HTG WGCGA
Sbjct: 227  LLSTAYSGFRAAVLESHRDGG-----------------------PAVPVVVHTGFWGCGA 263

Query: 957  YGGNITIMSCLQIAAAHLAGIDKIIYHAMGNQDECNSGLQVYNELIKDV---EGDVKIDD 1127
            +GGN  +M+ +QI AA  AG++++++H      E +  L+    L+ +    E  +  D 
Sbjct: 264  FGGNRVLMALIQILAAGAAGVERLVFHTGDPAGEIS--LEQARALLAEKLGGEQPLSTDA 321

Query: 1128 FIKNVELKDFQWGLSNG 1178
             +  +    F WG+S+G
Sbjct: 322  LVARLVGLGFTWGVSDG 338


>ref|WP_082838724.1| hypothetical protein [Gemmata sp. SH-PL17]
 gb|AMV23751.1| Poly (ADP-ribose) glycohydrolase (PARG) [Gemmata sp. SH-PL17]
          Length = 333

 Score =  193 bits (491), Expect = 5e-54
 Identities = 130/371 (35%), Positives = 187/371 (50%), Gaps = 4/371 (1%)
 Frame = +3

Query: 78   FIANELIKQNPPKWYCKNKKLIYEMATPEGADSFH-GKLEYTRWEEFPLPKSFKGESDNH 254
            F A  L+ + PP++   NKK++Y ++ P   D+ H G++ ++RW     P      +   
Sbjct: 13   FDAVALVAEFPPRFSHPNKKVVYGISCPP--DAVHSGRVTFSRWAAVAPPSEVPQNATT- 69

Query: 255  QGGKEMEIEVKNDVFDYSS-PEDDNTVNWYINFADKHVFGYYGGNLFAQDESQCLEHPAL 431
                   IE + D F Y   P     V WY+NF+   +F  YGG LFAQDE Q  EHPAL
Sbjct: 70   -------IEPREDYFGYEPVPAGLGRVEWYLNFSHYDLFCAYGGGLFAQDEMQVTEHPAL 122

Query: 432  CSLRDYLVKGDQESRFTTARPILTKGSSFISTPILVRGVERKIQVKTDVNKNEKRPFGLY 611
             SLR+ L++   E       P+  K  S   TP+LV GVER+ +V  + +    RPFGLY
Sbjct: 123  GSLREALLQSGVE-------PLTVKDRS--PTPVLVTGVERRCRVAINPDAALGRPFGLY 173

Query: 612  GNQFAHASPEAIKLATTIYDKRIDPKTGNPYYSNIIAIEAPKHGHGNYTFGTIKKILATA 791
            GN FA A P+ I  AT    + + P    P  +N++A+EAP  G G YT   I+ +L TA
Sbjct: 174  GNNFARAKPDVIARAT----EALVP----PTITNVLAMEAPTGGSGRYTRSAIEYVLRTA 225

Query: 792  YSGFLAAKFESFVEKGILERNHEENQEHTKININEEESVPKPRVIIHTGNWGCGAYGGNI 971
            ++GFLAA+ ES    G L  +                    P V++HTG WGCGAYGG+ 
Sbjct: 226  HTGFLAARIES----GRLSAS--------------------PEVVVHTGYWGCGAYGGHR 261

Query: 972  TIMSCLQIAAAHLAGIDKIIYHA--MGNQDECNSGLQVYNELIKDVEGDVKIDDFIKNVE 1145
            T+M+ LQI AA  A +D++++H           + L V    +   E    +   I  ++
Sbjct: 262  TLMALLQILAARTAQLDRLVFHTGDAAGSATLRNALVVSERDLGLRETPTPLAAVIDKLD 321

Query: 1146 LKDFQWGLSNG 1178
               F WG+ +G
Sbjct: 322  AMAFHWGVGDG 332


>ref|XP_004345553.2| tyrosyl-tRNA synthetase [Capsaspora owczarzaki ATCC 30864]
 gb|KJE95514.1| tyrosyl-tRNA synthetase [Capsaspora owczarzaki ATCC 30864]
          Length = 365

 Score =  194 bits (493), Expect = 5e-54
 Identities = 132/385 (34%), Positives = 186/385 (48%), Gaps = 19/385 (4%)
 Frame = +3

Query: 84   ANELIKQNPPKWYCKNKKLIYEMATPEGADS------------FHGKLEYTRW-----EE 212
            A +L+++ PP+   +NKKL++E+++  G  +            F G +  TRW     +E
Sbjct: 20   ARDLVRRCPPRLQARNKKLVFELSSESGECTLVPKASIANPAPFEGDVRVTRWRAPHHDE 79

Query: 213  FPLPKSFKGESDNHQGGKEMEIEVKNDVFDYSSPEDDN--TVNWYINFADKHVFGYYGGN 386
             P    F        G  EM        +D  +   D    V WY+NFAD +VFG+YGG 
Sbjct: 80   MPATLGFDA------GTVEMSFPASIFAYDIPTTSQDGKPVVPWYVNFADSNVFGFYGGG 133

Query: 387  LFAQDESQCLEHPALCSLRDYLVKGDQESRFTTARPILTKGSSFISTPILVRGVERKIQV 566
            L+AQDE Q  EHP L S+R  L   D           LT  +    TPILV  V+R++ V
Sbjct: 134  LYAQDEMQVTEHPILGSVRQMLENLDLSKN--PKMKALTMETQ--PTPILVENVQRRVVV 189

Query: 567  KTDVNKNEKRPFGLYGNQFAHASPEAIKLATTIYDKRIDPKTGNPYYSNIIAIEAPKHGH 746
              D   +   P GLYGN FA AS E I  AT + +         P  SNIIAI A  +G 
Sbjct: 190  --DTFPSAAAPGGLYGNAFASASFETIVQATHVLNP--------PTMSNIIAIAAQGYGF 239

Query: 747  GNYTFGTIKKILATAYSGFLAAKFESFVEKGILERNHEENQEHTKININEEESVPKPRVI 926
            G Y    I     TAY+GF AA   S++  G                  +       +V+
Sbjct: 240  GEYALPVINFSFLTAYTGFAAAVASSWLRLG------------------KPADRKSFKVV 281

Query: 927  IHTGNWGCGAYGGNITIMSCLQIAAAHLAGIDKIIYHAMGNQDECNSGLQVYNELIKDVE 1106
            I+TGNWGCGA+GGN T+M+ +Q AAA  AG+D++IY  +      N   +++NEL+  + 
Sbjct: 282  INTGNWGCGAFGGNPTMMALIQFAAAQAAGVDELIYSTVMPSPAVNRAREIWNELVPTLR 341

Query: 1107 GDVKIDDFIKNVELKDFQWGLSNGT 1181
             D  +  ++   E    +WG+SNGT
Sbjct: 342  -DKPVGAWLGAFEKLRLRWGVSNGT 365


>ref|WP_006971973.1| hypothetical protein [Plesiocystis pacifica]
 gb|EDM78917.1| tyrosyl-tRNA synthetase [Plesiocystis pacifica SIR-1]
          Length = 322

 Score =  191 bits (486), Expect = 2e-53
 Identities = 124/368 (33%), Positives = 183/368 (49%), Gaps = 3/368 (0%)
 Frame = +3

Query: 84   ANELIKQNPPKWYCKNKKLIYEMATPEGADSFHGKLEYTRWEEFPLPKSFKGESDNHQGG 263
            A EL++ +PP W   NKK+I  ++ P  A+   G++  TRW    LP++    S      
Sbjct: 15   AAELVRSHPPVWRDANKKVIAALSCPADAEH-RGQIRVTRWRAGALPETLPESSP----- 68

Query: 264  KEMEIEVKNDVFDYSSPEDDNTVNWYINFADKHVFGYYGGNLFAQDESQCLEHPALCSLR 443
                +E   D+F Y+ P  D   +WY+NFAD+ +F  YG  L AQDE Q  EHPAL S+ 
Sbjct: 69   ---ALEAHADLFGYA-PAPDGETHWYLNFADRRLFIAYGSGLLAQDELQVAEHPALGSVA 124

Query: 444  DYLVKGDQESRFTTARPILTKGSSFISTPILVRGVERKIQVKTDVNKNEKRPFGLYGNQF 623
            + +     +   T              TPILV GVER+  + T  + +  R +GLYG++F
Sbjct: 125  EAMAALPDQVPLTAEDE---------PTPILVAGVERRCVLDTAPDLDAGRVYGLYGHRF 175

Query: 624  AHASPEAIKLATTIYDKRIDPKTGNPYYSNIIAIEAPKHGHGNYTFGTIKKILATAYSGF 803
              A+P+ ++ A T+ D         P  SNI+AIEAP    G YT   I+ IL TA +G+
Sbjct: 176  QRATPDEVRGAVTVLDP--------PTVSNILAIEAPTAYRGAYTAKQIRFILRTAVAGY 227

Query: 804  LAAKFESFVEKGILERNHEENQEHTKININEEESVPKPRVIIHTGNWGCGAYGGNITIMS 983
             AA  ES                                ++IHTG WGCGAYGGN  +M+
Sbjct: 228  RAAALES-----------------------------AGALVIHTGFWGCGAYGGNRELMA 258

Query: 984  CLQIAAAHLAGIDKIIYHAMGNQDECNSGLQVY---NELIKDVEGDVKIDDFIKNVELKD 1154
             LQI AA +AG+ ++++HA  ++     GL +Y     +  ++ G   +D  I  +    
Sbjct: 259  LLQILAARIAGVSRLVFHAFDSE-----GLALYRAGEAVAAELAGLTSLDQAIAVIVELG 313

Query: 1155 FQWGLSNG 1178
            +QWG+S+G
Sbjct: 314  YQWGVSDG 321


>ref|XP_002677363.1| predicted protein [Naegleria gruberi]
 gb|EFC44619.1| predicted protein [Naegleria gruberi]
          Length = 358

 Score =  189 bits (480), Expect = 4e-52
 Identities = 127/377 (33%), Positives = 198/377 (52%), Gaps = 14/377 (3%)
 Frame = +3

Query: 90   ELIKQNPPKWYCKNKKLIYEMATPEGADSFHGKLEYTRWEEFPLPKSFKGESDNHQGGKE 269
            E+I++ PP ++ KNK  +Y + + +    + G L Y+R++    PK++K E    Q    
Sbjct: 25   EIIEKFPPSFFSKNKVFLYSLHSDQL--DYEGDLIYSRFKARTRPKAYKKEVAEDQ---V 79

Query: 270  MEIEVKNDVFDYSSP---EDDNTVNWYINFADKHVFGYYGGNLFAQDESQCLEHPALCSL 440
             E+ V +D F Y       ++N  +WY+NFAD+ +F  + G LFAQDE Q  EHP L SL
Sbjct: 80   CEVIVSSDGFKYDEKLKVGEENAKHWYLNFADERLFIAWKGQLFAQDEIQVCEHPILGSL 139

Query: 441  RDYLVKGD-QESRFTTARPILTKGSSFISTPILVRGVERKIQVKTDVNKNEKRPFGLYGN 617
             +YL K    ++R++   PI  + S    TP+L++ V+R+I V         + + +YGN
Sbjct: 140  CEYLRKESLNDARYS---PITQQTSP---TPVLIQNVDRRIAVNV-------KDYNIYGN 186

Query: 618  QFAHASPEAIKLATTIYDKRIDPKTGNPYYSNIIAIEAPKHGHGNYTFGTIKKILATAYS 797
             FA AS + I+ ATT+ D + + K      SNI+AI AP+ GHG Y  G +  I  T YS
Sbjct: 187  NFAKASTDIIEQATTVLDMKTNRK------SNILAISAPRGGHGEYKLGEVNFIFDTLYS 240

Query: 798  GFLAAKFESFVEKGILERNHEENQEHTKININEEESVPKPRVIIHTGNWGCGAYGGNITI 977
            GF A   +                  T++   + E+   P V+IHTGN+GCGA+G N  +
Sbjct: 241  GFKACCMD------------------TEMYAQDPEN--PPTVVIHTGNFGCGAFGNNREL 280

Query: 978  MSCLQIAAAHLAGIDKIIYHAMGNQ--DECNSGLQVYNELI-------KDVEGD-VKIDD 1127
            ++ LQI AA +AGI  + YHA   +        ++V +E          D+EG+ V ++ 
Sbjct: 281  IAILQILAARMAGIKYLYYHAFSEEGVKSVKKAIKVIDEEFDLVCKDPSDIEGNLVSLNK 340

Query: 1128 FIKNVELKDFQWGLSNG 1178
                +  K ++WG S+G
Sbjct: 341  LFDMILQKGYKWGFSDG 357


>ref|WP_084207322.1| hypothetical protein [Sulfuritalea hydrogenivorans]
 dbj|BAO29669.1| hypothetical protein SUTH_01877 [Sulfuritalea hydrogenivorans sk43H]
          Length = 327

 Score =  185 bits (469), Expect = 7e-51
 Identities = 126/374 (33%), Positives = 180/374 (48%), Gaps = 2/374 (0%)
 Frame = +3

Query: 63   THHESFIANELIKQNPPKWYCKNKKLIYEMATPEGADSFHGKLEYTRWEEFPLPKSFKGE 242
            T  + + + ++ K   PK++  NK+ ++++A   G  + +G L Y+RW   PLP      
Sbjct: 6    TARKQWDSYQISKNMMPKFHHSNKEFLFKLAFSNGFSN-NGTLGYSRWSSRPLPTLLT-- 62

Query: 243  SDNHQGGKEMEIEVKNDVFDYSSPEDDNTVNWYINFADKHVFGYYGGNLFAQDESQCLEH 422
                +G  E E+  +   FDY          W++NFA+  +F  +  +L AQDE Q  EH
Sbjct: 63   ----EG--ETEVLQRPGFFDYEVSSSPQAAEWHMNFANNEIFSAWATSLLAQDELQVAEH 116

Query: 423  PALCSLRDYLVKGDQESRFTTARPILTKGSSFISTPILVRGVERKIQVKTDVNKNEKRPF 602
            PAL  +R   +K             L        TPIL+ GVER++ + T  N+    P 
Sbjct: 117  PALIGMRIEAMKEGIS---------LWSVEDCAPTPILITGVERRLSIDTSPNEGAGIPH 167

Query: 603  GLYGNQFAHASPEAIKLATTIYDKRIDPKTGNPYYSNIIAIEAPKHGHGNYTFGTIKKIL 782
            G+YGN F +AS   I  ATT+    I P T     SNI+AIEAP +G G YT  +I  IL
Sbjct: 168  GIYGNYFRNASESQIARATTV----ITPPTN----SNILAIEAPAYGSGRYTSNSIAFIL 219

Query: 783  ATAYSGFLAAKFESFVEKGILERNHEENQEHTKININEEESVPKPRVIIHTGNWGCGAYG 962
            +TAYSGF A   ES                        + S+    V IH+G WGCGAYG
Sbjct: 220  STAYSGFSAVLDES------------------------QLSLNASTVRIHSGFWGCGAYG 255

Query: 963  GNITIMSCLQIAAAHLAGIDKIIYHAMGNQDEC--NSGLQVYNELIKDVEGDVKIDDFIK 1136
            GN  +M  LQ+ AA LAGID+I +H              ++Y  + +   G++ +D  I 
Sbjct: 256  GNRVLMLLLQMVAARLAGIDQITFHTGDGSGSLPFRESYEIYQRIQR---GNLPVDQVID 312

Query: 1137 NVELKDFQWGLSNG 1178
             V    F+WG S+G
Sbjct: 313  LVADYHFEWGESDG 326


Top