BLASTX nr result

ID: Akebia27_contig00028506 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Akebia27_contig00028506
         (812 letters)

Database: ./nr 
           38,876,450 sequences; 13,856,398,315 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_005651480.1| hypothetical protein COCSUDRAFT_59434 [Cocco...   260   5e-67
ref|XP_005651481.1| hypothetical protein COCSUDRAFT_59435 [Cocco...   253   8e-65
ref|XP_004984408.1| PREDICTED: desiccation-related protein PCC13...    97   5e-18
ref|XP_004984409.1| PREDICTED: desiccation-related protein PCC13...    93   1e-16
ref|WP_023461881.1| hypothetical protein [Asticcacaulis sp. YBE2...    92   2e-16
ref|WP_008712540.1| putative exported protein [Rhodococcus sp. A...    92   3e-16
ref|WP_003527020.1| hypothetical protein [Sinorhizobium meliloti...    91   4e-16
ref|XP_005651445.1| hypothetical protein COCSUDRAFT_59405 [Cocco...    91   7e-16
ref|XP_002465286.1| hypothetical protein SORBIDRAFT_01g035570 [S...    90   9e-16
ref|NP_001142402.1| hypothetical protein precursor [Zea mays] gi...    90   1e-15
ref|YP_007686488.1| conserved secreted protein [Clavibacter mich...    89   1e-15
ref|YP_006814614.1| hypothetical protein BN406_04525 [Sinorhizob...    89   1e-15
ref|YP_001711098.1| hypothetical protein CMS_2441 [Clavibacter m...    89   2e-15
ref|WP_023721731.1| hypothetical protein [Mesorhizobium sp. LSHC...    89   2e-15
ref|NP_436035.2| hypothetical protein SMa1445 [Sinorhizobium mel...    89   2e-15
ref|WP_022887404.1| hypothetical protein [Glaciibacter superstes]      88   3e-15
ref|YP_007192908.1| hypothetical protein C770_GR4pC0588 [Sinorhi...    88   3e-15
ref|WP_018096307.1| hypothetical protein [Sinorhizobium meliloti]      87   6e-15
gb|EAY89979.1| hypothetical protein OsI_11540 [Oryza sativa Indi...    87   7e-15
ref|YP_007951538.1| hypothetical protein L083_3548 [Actinoplanes...    87   9e-15

>ref|XP_005651480.1| hypothetical protein COCSUDRAFT_59434 [Coccomyxa subellipsoidea
           C-169] gi|384253461|gb|EIE26936.1| hypothetical protein
           COCSUDRAFT_59434 [Coccomyxa subellipsoidea C-169]
          Length = 437

 Score =  260 bits (664), Expect = 5e-67
 Identities = 132/253 (52%), Positives = 167/253 (66%), Gaps = 2/253 (0%)
 Frame = +2

Query: 59  GLRKANLSPRTKALLEEVALSEQGHALYTRQAGSKIPCPYVDYDAGFNAVFARAYGLKDG 238
           G RKANLS      ++EVAL+EQGHAL+TRQAGS +PCP +D+  GFN  F  AY L   
Sbjct: 104 GARKANLSDEVLPFMQEVALNEQGHALFTRQAGSDLPCPAIDFTGGFNKYFGAAYNLTGN 163

Query: 239 ETISKLFGKDWDPYVNDATFALSMVFLEELGATGNKGLALLHTNPVXXXXXXXXXXXXXX 418
           ETI   FG  +DP+ ND  + LS++ LEELGATGNKGL  L TNPV              
Sbjct: 164 ETIESKFGAPFDPFANDENYLLSVLSLEELGATGNKGLTGLLTNPVLANAVAGLATSATG 223

Query: 419 XXXIERSILFDLKDEIVPPTNETVTQVFARLSAYRDSMDGPQIDDQGLLNKDPRFISVPG 598
              ++R +L+  ++  V P NETV QVFAR+SA RDS+DGP +DDQGL+N D R I+VP 
Sbjct: 224 QATVQRMLLWQRRNNTVYPFNETVQQVFARISALRDSLDGPPVDDQGLVNTDSRTIAVPQ 283

Query: 599 SFVNNIPTDIRGISFSRTPLMNLNILTLGAKDGKGGFFPEGIAGRINTPEGYDKLADGIE 778
            +VN IPTD+RG++FSRTP   +NI+TLG+ DGKG FFP G+ G IN P GY++ A G++
Sbjct: 284 YYVNMIPTDVRGLTFSRTPQQIINIVTLGSLDGKGVFFPNGLGGAINKPTGYNETASGLD 343

Query: 779 D--TKGKPEAIQL 811
                G   A QL
Sbjct: 344 SFPASGAKVATQL 356


>ref|XP_005651481.1| hypothetical protein COCSUDRAFT_59435 [Coccomyxa subellipsoidea
           C-169] gi|384253462|gb|EIE26937.1| hypothetical protein
           COCSUDRAFT_59435 [Coccomyxa subellipsoidea C-169]
          Length = 387

 Score =  253 bits (645), Expect = 8e-65
 Identities = 127/234 (54%), Positives = 161/234 (68%)
 Frame = +2

Query: 59  GLRKANLSPRTKALLEEVALSEQGHALYTRQAGSKIPCPYVDYDAGFNAVFARAYGLKDG 238
           G RKANLS      ++EVAL+EQGHAL+TRQAGS +PCP +D+  GFN  F  AY L  G
Sbjct: 58  GARKANLSDAVLLYMQEVALNEQGHALFTRQAGSDLPCPPIDFTGGFNKYFGAAYNLTGG 117

Query: 239 ETISKLFGKDWDPYVNDATFALSMVFLEELGATGNKGLALLHTNPVXXXXXXXXXXXXXX 418
            TI   FG  +DP+ ND  F LS++ LEELGATGNKGL  L  NPV              
Sbjct: 118 RTIESEFGTPFDPFANDENFLLSVLSLEELGATGNKGLVGLLGNPVIANGVAGLATSATA 177

Query: 419 XXXIERSILFDLKDEIVPPTNETVTQVFARLSAYRDSMDGPQIDDQGLLNKDPRFISVPG 598
              ++R +L+  ++ IV P NETV QVFAR+SA RDS+DGPQIDDQGL N DPR+I+VP 
Sbjct: 178 QATVQRVLLWQRRNNIVRPFNETVQQVFARISALRDSLDGPQIDDQGLQNTDPRYIAVPA 237

Query: 599 SFVNNIPTDIRGISFSRTPLMNLNILTLGAKDGKGGFFPEGIAGRINTPEGYDK 760
           +++N IPTDIRG++FSR+P   +NI+TLG+  GKG FFPEG+ G I TP  +++
Sbjct: 238 NYINIIPTDIRGLTFSRSPEQVINIVTLGSPVGKGVFFPEGLLGAIVTPFSFNE 291


>ref|XP_004984408.1| PREDICTED: desiccation-related protein PCC13-62-like [Setaria
           italica]
          Length = 336

 Score = 97.4 bits (241), Expect = 5e-18
 Identities = 77/237 (32%), Positives = 106/237 (44%), Gaps = 8/237 (3%)
 Frame = +2

Query: 50  PSKGLRKANLSPRTKALLEEVALSEQGHALYTRQAGSKIPCPYVDYDA-GFNAVFARAYG 226
           PS G RKANL   T  ++ E  L E GH    ++    IP P +D  A  F  V   A+G
Sbjct: 108 PSVGARKANLDEVTWPIIAEFGLQEVGHVRSIQRTVGGIPRPLIDLSAHNFARVMDEAFG 167

Query: 227 LKDGETISKLFGKDWDPYVNDATFALSMVFLEELGATGNKGLALLHTNPVXXXXXXXXXX 406
            K            +DPY+N   F L+   +  LG  G  G     TNP+          
Sbjct: 168 YK--------LDPPFDPYINSLNFLLASYVIPYLGLNGYVG-----TNPIIDGYETKKLL 214

Query: 407 XXXXXXXIE-----RSILFDLKDEIVPPTNETVTQVFARLSAYRDSMDGPQIDDQGLLNK 571
                         R++LF  +DE VPP N TV +   R+SA R+ +    + D+GL   
Sbjct: 215 AGLLGVESGQDAAFRTLLFGRRDEAVPPYNVTVAEFTDRVSALRNRLGRCGVKDEGL--T 272

Query: 572 DPRFISVPGSFVNNI-PTDIRGISFSRTPLMNLNILTL-GAKDGKGGFFPEGIAGRI 736
            PR +   G+   N+   D   +S+SRTP   L IL L G +   GGF+P+G  G+I
Sbjct: 273 VPRELGAEGAICTNVLSADRDSLSYSRTPAELLRILYLTGDEHVPGGFYPDGANGKI 329


>ref|XP_004984409.1| PREDICTED: desiccation-related protein PCC13-62-like [Setaria
           italica]
          Length = 347

 Score = 92.8 bits (229), Expect = 1e-16
 Identities = 75/243 (30%), Positives = 105/243 (43%), Gaps = 8/243 (3%)
 Frame = +2

Query: 32  LGPNSKPSKGLRKANLSPRTKALLEEVALSEQGHALYTRQAGSKIPCPYVDYDA-GFNAV 208
           L     P  G RKANL   T+ ++ E AL E GH    +Q     P P ++  A  F  V
Sbjct: 103 LARGGPPPVGARKANLDEETRRVVSEFALQEVGHLRVIQQTVGGFPRPLLNLSADNFARV 162

Query: 209 FARAYGLKDGETISKLFGKDWDPYVNDATFALSMVFLEELGATGNKGLALLHTNPVXXXX 388
              A+G +            +DPY N   F L+   +  LG  G  G     TNP+    
Sbjct: 163 MDNAFGYR--------LNPPFDPYTNSLNFLLACYVIPYLGINGYVG-----TNPIIDGY 209

Query: 389 XXXXXXXXXXXXXIE-----RSILFDLKDEIVPPTNETVTQVFARLSAYRDSMDGPQIDD 553
                               R++LF  + E VPP N TV +   R+SA R+ +    + D
Sbjct: 210 KTKELVAGLLGVEAGQDAAFRTLLFRRRGEAVPPYNVTVAEFTDRVSALRNRLGRCGVKD 269

Query: 554 QGLLNKDPRFISVPGSFVNNI-PTDIRGISFSRTPLMNLNILTL-GAKDGKGGFFPEGIA 727
           +GL    PR +   G+   N+   D   +S+SRTP   L IL L G +   GGF+P+G  
Sbjct: 270 EGL--TVPRELGAEGAICTNVLSADRDSLSYSRTPAELLRILYLTGDEHVPGGFYPDGAN 327

Query: 728 GRI 736
           G+I
Sbjct: 328 GKI 330


>ref|WP_023461881.1| hypothetical protein [Asticcacaulis sp. YBE204]
           gi|557340038|gb|ESQ79764.1| hypothetical protein
           AEYBE204_07930 [Asticcacaulis sp. YBE204]
          Length = 329

 Score = 92.0 bits (227), Expect = 2e-16
 Identities = 77/235 (32%), Positives = 104/235 (44%), Gaps = 7/235 (2%)
 Frame = +2

Query: 59  GLRKANLSPRT-KALLEEVALSEQGHALYTRQA--GSKIPCPYVDYDAGFNAVF---ARA 220
           G RK N + +  +    E+A  E  H  + R A   S +  P +D  AG +  F   ARA
Sbjct: 110 GGRKVNFTDKVVEQYAREIAQDEVAHVRFLRTALGSSAVAQPAIDVGAGVDGAFSAAARA 169

Query: 221 YGLKDGETISKLFGKDWDPYVNDATFALSMVFLEELGATGNKGLALLHTNPVXXXXXXXX 400
            GL          G  +DPY  D  F L     E++G T  KG A L TN          
Sbjct: 170 SGLVGA-------GVAFDPYATDENFLLGAYIFEDVGVTAYKGAAPLITNKTYLEAAAGI 222

Query: 401 XXXXXXXXXIERSILFDLKDEIVPPTNETVTQVFARLSAYRDSMDGPQIDDQGLLNKDPR 580
                    I R++L+  +  I  P+  T T    R+S  RDS+DG    DQG+L  D  
Sbjct: 223 LAAEAYHAAIVRTVLY--RKGIAAPSLATGT---VRISDARDSLDGSSDLDQGILGAD-- 275

Query: 581 FISVPGSFVNNIPTDIRGISFSRTPLMNLNILTLG-AKDGKGGFFPEGIAGRINT 742
                 +  N +PTD  G+++SR+    LNI+ L      KGGFFP G+ G I T
Sbjct: 276 -----STISNIVPTDASGLTYSRSTGQVLNIVYLNKLAVAKGGFFPNGVNGNIIT 325


>ref|WP_008712540.1| putative exported protein [Rhodococcus sp. AW25M09]
           gi|443416263|emb|CCQ14596.1| putative exported protein
           [Rhodococcus sp. AW25M09]
          Length = 319

 Score = 91.7 bits (226), Expect = 3e-16
 Identities = 82/256 (32%), Positives = 109/256 (42%), Gaps = 6/256 (2%)
 Frame = +2

Query: 2   VQGQDFKKDLLGPNSKPSK---GLRKANLSPRTKALLEEVALSEQGHALYTRQA--GSKI 166
           V G+     L+G    P     G +    S   KA  EE+A  E  H  + R A   + +
Sbjct: 91  VTGKGLPDTLVGGTGTPGPVTGGRQVTFESKLIKAYAEEIAFDELNHVAFLRGALGNAAV 150

Query: 167 PCPYVDYDAGFNAVFARAYGLKDGETISKLFGKDWDPYVNDATFALSMVFLEELGATGNK 346
             P +D DA F A    A  +  GET        +D Y N+  F L     E++G T  K
Sbjct: 151 ARPAIDLDASFTAAAMAAGLIGAGET--------FDVYANEKNFLLGAFIFEDVGVTAYK 202

Query: 347 GLALLHTNPVXXXXXXXXXXXXXXXXXIERSILFDLKDEIVPPTNETVTQVFARLSAYRD 526
           G A L +N                   I R+ LF L  E   P N         +S  RD
Sbjct: 203 GAAPLVSNKTYLEAAAGILAAEAYHAGIIRTSLFSLGLEA--PANA--------ISDARD 252

Query: 527 SMDGPQIDDQGLLNKDPRFISVPGSFVNNIPTDIRGISFSRTPLMNLNILTLG-AKDGKG 703
           S+DGP   DQG        I++ G+  N +P D  GI++SR+P   LNI+ L  A    G
Sbjct: 253 SLDGPDDLDQG--------ITLDGA-ANLVPLDANGIAYSRSPGQVLNIVYLNPAPVRSG 303

Query: 704 GFFPEGIAGRINTPEG 751
           GFFP G+ G +NT  G
Sbjct: 304 GFFPAGVNGELNTSGG 319


>ref|WP_003527020.1| hypothetical protein [Sinorhizobium meliloti]
           gi|359506210|gb|EHK78726.1| hypothetical protein
           SM0020_07262 [Sinorhizobium meliloti CCNWSX0020]
          Length = 290

 Score = 91.3 bits (225), Expect = 4e-16
 Identities = 73/241 (30%), Positives = 112/241 (46%), Gaps = 5/241 (2%)
 Frame = +2

Query: 35  GPNSKPSKGLRKANL-SPRTKALLEEVALSEQGHALYTRQ--AGSKIPCPYVDYDAGFNA 205
           G ++ P  G ++ +  +P     ++EVA +E  H  + R+  A   +P P +D+DAGF A
Sbjct: 77  GSDAGPVTGGKQVSFDTPAIGEFMQEVAENELAHVRFYRKTLADQAVPRPAIDFDAGFAA 136

Query: 206 VFARAYGLKDGETISKLFGKDWDPYVNDATFALSMVFLEELGATGNKGLALLHTNPVXXX 385
           V A+A GL          G+D+DP+ N+  F L  +  E++G T   G A +  N     
Sbjct: 137 V-AKAAGL----------GEDFDPFGNETNFVLGGILFEDVGVTAYAGAATVLKNKDFLA 185

Query: 386 XXXXXXXXXXXXXXIERSILFDLKDEIVPPTNETVTQVFARLSAYRDSMDGPQIDDQGLL 565
                         + RS L+           E   +    +S  RD +DGP+  DQGL 
Sbjct: 186 AAAGILAVEAYHMGMARSTLY--------RKGEEAWKAAQAVSDARDKIDGPEDKDQGL- 236

Query: 566 NKDPRFISVPGSFVNNIPTDIRGISFSRTPLMNLNILTLGAKDG--KGGFFPEGIAGRIN 739
                   V G   N +P+    I+F+RTP   L I+ L  K+G  KGGF+P G+ G+I 
Sbjct: 237 -------QVDGK-ANIVPSTPDAIAFTRTPQEVLRIVYLSDKEGASKGGFYPNGMNGKIK 288

Query: 740 T 742
           +
Sbjct: 289 S 289


>ref|XP_005651445.1| hypothetical protein COCSUDRAFT_59405 [Coccomyxa subellipsoidea
           C-169] gi|384253426|gb|EIE26901.1| hypothetical protein
           COCSUDRAFT_59405 [Coccomyxa subellipsoidea C-169]
          Length = 289

 Score = 90.5 bits (223), Expect = 7e-16
 Identities = 72/234 (30%), Positives = 98/234 (41%), Gaps = 8/234 (3%)
 Frame = +2

Query: 59  GLRKANLSPRTKALLEEVALSEQGHALYTRQAG--SKIPCPYVDYDAGFNAVFARAYGLK 232
           G +KA LSP  + +  E A  E  H  + R+A   + +PCP +D    FNAV   A G +
Sbjct: 65  GGQKARLSPAVQTIAAEFARDEVAHLAFLRKAAGAAAVPCPQIDIGGSFNAVIKAALGSR 124

Query: 233 DGETISKLFGKDWDPYVNDATFALSMVFLEELGATGNKGLALLHTNPVXXXXXXXXXXXX 412
            G+ +       + PY ND  F LS    E++GAT   G   + T PV            
Sbjct: 125 AGDNV-------FSPYTNDVNFLLSAFLFEDVGATAFAGAIPVLTGPVATGAAAGILGVE 177

Query: 413 XXXXXIERSILFDLKDEIVPPTNETVTQVFARLSAYRDSMDGPQIDDQGLLNKDPRFISV 592
                + R  LF+  D IV P    +      LS  R  + G +  D+G+        S+
Sbjct: 178 AYHGGLLRQWLFNNGDLIVQPYGIQIVSFVQALSDLRAKVGGGK--DEGITIPSAT-ASI 234

Query: 593 PGSFV------NNIPTDIRGISFSRTPLMNLNILTLGAKDGKGGFFPEGIAGRI 736
            G  V      N +P DI    F+RTP   L I   G     G FFP G+ G I
Sbjct: 235 YGPNVLNFFQANIVPADIDAKIFARTPQEVLAIAYGGDATKPGAFFPSGLNGSI 288


>ref|XP_002465286.1| hypothetical protein SORBIDRAFT_01g035570 [Sorghum bicolor]
           gi|241919140|gb|EER92284.1| hypothetical protein
           SORBIDRAFT_01g035570 [Sorghum bicolor]
          Length = 363

 Score = 90.1 bits (222), Expect = 9e-16
 Identities = 79/252 (31%), Positives = 110/252 (43%), Gaps = 9/252 (3%)
 Frame = +2

Query: 8   GQDFKKDLLGPNSKPSKGLRKANLSPRTKALLEEVALSEQGHALYTRQAGSKIPCPYVDY 187
           G D     L     P  G RKANL   T  ++ E AL E GH     +  + IP P +D 
Sbjct: 107 GLDHVAPKLALGGPPPVGARKANLDEVTWRIVAEFALQEVGHIRAIERTSAGIPRPLIDL 166

Query: 188 DA-GFNAVFARAYGLKDGETISKLFGKDWDPYVNDATFALSMVFLEELGATGNKGLALLH 364
            A  F  +  +A+G +            +DPYVN   F L+   +  LG  G  G     
Sbjct: 167 SARNFARLMDKAFGYR--------LDPPFDPYVNSLNFMLASYVIPYLGINGYVG----- 213

Query: 365 TNPV-----XXXXXXXXXXXXXXXXXIERSILFDLKDEIVPP-TNETVTQVFARLSAYRD 526
           TNP+                      + R+ LF+   E VPP  N TV +   R+SA R+
Sbjct: 214 TNPIIDGYETKKLLAGLLGVEAAQDAVIRARLFEHLGEAVPPYRNITVAEFTDRVSALRN 273

Query: 527 SMDGPQIDDQGLLNKDPRFISVPGSFVNNI-PTDIRGISFSRTPLMNLNILTL-GAKDGK 700
            +    + D+GL    PR +   G+   N+   D   +S++RTP   L+IL L G +   
Sbjct: 274 ELGRCGVKDEGL--TVPRALGAEGAICTNVLSADRDSLSYARTPAELLSILYLTGDEHVP 331

Query: 701 GGFFPEGIAGRI 736
           GGF+PEG  GRI
Sbjct: 332 GGFYPEGGNGRI 343


>ref|NP_001142402.1| hypothetical protein precursor [Zea mays]
           gi|194708654|gb|ACF88411.1| unknown [Zea mays]
           gi|238007370|gb|ACR34720.1| unknown [Zea mays]
           gi|414866768|tpg|DAA45325.1| TPA: hypothetical protein
           ZEAMMB73_576945 [Zea mays]
          Length = 353

 Score = 89.7 bits (221), Expect = 1e-15
 Identities = 79/252 (31%), Positives = 109/252 (43%), Gaps = 9/252 (3%)
 Frame = +2

Query: 8   GQDFKKDLLGPNSKPSKGLRKANLSPRTKALLEEVALSEQGHALYTRQAGSKIPCPYVDY 187
           G D     L     P  G RKANL   T+ ++ E  L E GH    ++    IP P +D 
Sbjct: 106 GLDHLAPRLALGGPPPVGARKANLDEVTRRIVAEFGLQEVGHIRAIQRTVGGIPRPLIDL 165

Query: 188 DA-GFNAVFARAYGLKDGETISKLFGKDWDPYVNDATFALSMVFLEELGATGNKGLALLH 364
            A  F  V   A+G +            +DPYVN   F L+   +  LG  G  G     
Sbjct: 166 SAHNFARVMDEAFGTR--------LDPPFDPYVNSLNFLLASYVIPYLGINGYVG----- 212

Query: 365 TNPV-----XXXXXXXXXXXXXXXXXIERSILFDLKDEIVPP-TNETVTQVFARLSAYRD 526
           TNP+                      + R+ LF+   E VPP  N TV +   R+SA R+
Sbjct: 213 TNPIVDGYQTKKLLAGLLGVEAAQDAVFRARLFERLGEAVPPYGNITVAEFTDRVSALRN 272

Query: 527 SMDGPQIDDQGLLNKDPRFISVPGSFVNNI-PTDIRGISFSRTPLMNLNILTL-GAKDGK 700
            +    + D+GL    PR +   G+   N+   D   +S++RTP   L+IL L G +   
Sbjct: 273 RLGRCGVKDEGL--TVPRRLGAEGAICTNVLSADRDSLSYARTPAELLSILYLTGDERVP 330

Query: 701 GGFFPEGIAGRI 736
           GGF+PEG  GRI
Sbjct: 331 GGFYPEGANGRI 342


>ref|YP_007686488.1| conserved secreted protein [Clavibacter michiganensis subsp.
           nebraskensis NCPPB 2581]
           gi|505303787|ref|WP_015490889.1| conserved secreted
           protein [Clavibacter michiganensis]
           gi|472822617|emb|CCE76165.1| conserved secreted protein
           [Clavibacter michiganensis subsp. nebraskensis NCPPB
           2581]
          Length = 313

 Score = 89.4 bits (220), Expect = 1e-15
 Identities = 72/215 (33%), Positives = 99/215 (46%), Gaps = 3/215 (1%)
 Frame = +2

Query: 107 EVALSEQGHALYTRQA--GSKIPCPYVDYDAGFNAVFARAYGLKDGETISKLFGKDWDPY 280
           E+A  E+ H  + R A   +K+  P +D DA F+A    A  +K G+         +D +
Sbjct: 122 EIAQDEKAHVKFLRSALGSAKVARPAIDLDAAFSAAATAAGLIKPGQK--------FDAF 173

Query: 281 VNDATFALSMVFLEELGATGNKGLALLHTNPVXXXXXXXXXXXXXXXXXIERSILFDLKD 460
            +D  F L+    E++G T  KG A L TN                   I R+ LF    
Sbjct: 174 ASDENFLLASFVFEDVGVTAYKGAAPLITNKTYLEAAAGILAVEAYHAGIIRTSLF--AK 231

Query: 461 EIVPPTNETVTQVFARLSAYRDSMDGPQIDDQGLLNKDPRFISVPGSFVNNIPTDIRGIS 640
            +  PTN         +S  RDS+DG    DQG        I+V G   N +PTD  GI+
Sbjct: 232 GLAAPTNA--------ISNARDSLDGSSDLDQG--------ITVSGG-ANLVPTDANGIA 274

Query: 641 FSRTPLMNLNILTLGAKD-GKGGFFPEGIAGRINT 742
           FSRT    LNI+ L +K   +GGF+P G+ G INT
Sbjct: 275 FSRTTGQVLNIVYLNSKAVTRGGFYPNGVNGGINT 309


>ref|YP_006814614.1| hypothetical protein BN406_04525 [Sinorhizobium meliloti Rm41]
           gi|504803046|ref|WP_014990148.1| hypothetical protein
           [Sinorhizobium meliloti] gi|407322205|emb|CCM70807.1|
           hypothetical protein BN406_04525 [Sinorhizobium meliloti
           Rm41]
          Length = 290

 Score = 89.4 bits (220), Expect = 1e-15
 Identities = 72/241 (29%), Positives = 112/241 (46%), Gaps = 5/241 (2%)
 Frame = +2

Query: 35  GPNSKPSKGLRKANL-SPRTKALLEEVALSEQGHALYTRQ--AGSKIPCPYVDYDAGFNA 205
           G ++ P  G ++ +  +P     ++EVA +E  H  + R+  A   +P P +D+DAGF A
Sbjct: 77  GSDAGPVTGGKQVSFDTPAIGEFMQEVAENELAHVRFYRKTLADQAVPRPAIDFDAGFAA 136

Query: 206 VFARAYGLKDGETISKLFGKDWDPYVNDATFALSMVFLEELGATGNKGLALLHTNPVXXX 385
           V A++ GL          G+D+DP+ N+  F L  +  E++G T   G A +  N     
Sbjct: 137 V-AKSAGL----------GEDFDPFGNETNFVLGGMLFEDVGVTAYAGAATVLKNKDFLA 185

Query: 386 XXXXXXXXXXXXXXIERSILFDLKDEIVPPTNETVTQVFARLSAYRDSMDGPQIDDQGLL 565
                         + RS L+           E   +    +S  RD +DGP+  DQGL 
Sbjct: 186 AAAGILAVEAYHMGMARSTLY--------RKGEEAWKAAQAVSDARDKIDGPEDKDQGL- 236

Query: 566 NKDPRFISVPGSFVNNIPTDIRGISFSRTPLMNLNILTLGAKDG--KGGFFPEGIAGRIN 739
                   V G   N +P+    I+F+RTP   L I+ L  K+G  KGGF+P G+ G+I 
Sbjct: 237 -------QVDGK-ANIVPSTPDAIAFTRTPQEVLRIVYLSDKEGASKGGFYPNGMNGKIK 288

Query: 740 T 742
           +
Sbjct: 289 S 289


>ref|YP_001711098.1| hypothetical protein CMS_2441 [Clavibacter michiganensis subsp.
           sepedonicus] gi|501256692|ref|WP_012299710.1|
           hypothetical protein [Clavibacter michiganensis]
           gi|169157334|emb|CAQ02521.1| putative exported protein
           [Clavibacter michiganensis subsp. sepedonicus]
          Length = 313

 Score = 89.0 bits (219), Expect = 2e-15
 Identities = 73/215 (33%), Positives = 98/215 (45%), Gaps = 3/215 (1%)
 Frame = +2

Query: 107 EVALSEQGHALYTRQA--GSKIPCPYVDYDAGFNAVFARAYGLKDGETISKLFGKDWDPY 280
           E+A  E+ H  + R A   +K+  P +D DA F+A    A  +K GE         +D +
Sbjct: 122 EIAQDEKAHVKFLRSALGSAKVARPAIDLDAAFSAAAQAAGLIKAGEK--------FDAF 173

Query: 281 VNDATFALSMVFLEELGATGNKGLALLHTNPVXXXXXXXXXXXXXXXXXIERSILFDLKD 460
            +D  F L+    E++G T  KG A L TN                   I R+ LF    
Sbjct: 174 ASDENFLLASFVFEDVGVTAYKGAAPLITNKTYLEAAAGILAVEAYHAGIIRTSLF--AK 231

Query: 461 EIVPPTNETVTQVFARLSAYRDSMDGPQIDDQGLLNKDPRFISVPGSFVNNIPTDIRGIS 640
            +  PTN         +S  RDS+DG    DQG        I++ G   N +PTD  GI+
Sbjct: 232 GLAAPTNA--------ISNARDSLDGSTDLDQG--------ITISGG-ANLVPTDANGIA 274

Query: 641 FSRTPLMNLNILTLGAKD-GKGGFFPEGIAGRINT 742
           FSRT    LNI+ L  K   KGGF+P G+ G INT
Sbjct: 275 FSRTTGQVLNIVYLNNKAVTKGGFYPNGVNGGINT 309


>ref|WP_023721731.1| hypothetical protein [Mesorhizobium sp. LSHC420B00]
           gi|563040721|gb|ESX65762.1| hypothetical protein
           X759_28625 [Mesorhizobium sp. LSHC420B00]
          Length = 275

 Score = 88.6 bits (218), Expect = 2e-15
 Identities = 77/246 (31%), Positives = 111/246 (45%), Gaps = 7/246 (2%)
 Frame = +2

Query: 26  DLLGPNSKPSKGLRKANLSPRTKAL---LEEVALSEQGHALYTRQAGSK--IPCPYVDYD 190
           D     SKP   +    +S  T A+   ++EVA +E  H  + R+  +K  +  P +D+D
Sbjct: 57  DAADAGSKPGDVVGGKKVSFETPAIGEFMQEVAENELAHVRFYRKTLAKNAVDRPAIDFD 116

Query: 191 AGFNAVFARAYGLKDGETISKLFGKDWDPYVNDATFALSMVFLEELGATGNKGLALLHTN 370
           AGF AV A A GL          G D+DP+ N+  F L  +  E++G T   G A L  N
Sbjct: 117 AGFKAV-AEAAGL----------GPDFDPFGNETNFVLGGMLFEDVGVTAYAGAATLLKN 165

Query: 371 PVXXXXXXXXXXXXXXXXXIERSILFDLKDEIVPPTNETVTQVFARLSAYRDSMDGPQID 550
                              + RS L+   ++     N         +S  RD +DGP+  
Sbjct: 166 KDFLAAAAGILAVEAYHMGMARSTLYRKGEKAWKAANA--------VSDARDKIDGPEDK 217

Query: 551 DQGLLNKDPRFISVPGSFVNNIPTDIRGISFSRTPLMNLNILTLGAKDG--KGGFFPEGI 724
           DQG        I V G   N +P+    I+F+RTP   L I+ L  KDG  KGGF+PEG+
Sbjct: 218 DQG--------IQVNGK-ANFVPSTPDAIAFTRTPKEVLRIVYLTDKDGVSKGGFYPEGM 268

Query: 725 AGRINT 742
            G + +
Sbjct: 269 NGTLKS 274


>ref|NP_436035.2| hypothetical protein SMa1445 [Sinorhizobium meliloti 1021]
           gi|334319061|ref|YP_004551620.1| hypothetical protein
           Sinme_5982 [Sinorhizobium meliloti AK83]
           gi|384532581|ref|YP_005718185.1| hypothetical protein
           [Sinorhizobium meliloti BL225C]
           gi|384540660|ref|YP_005724743.1| hypothetical protein
           SM11_pC0861 [Sinorhizobium meliloti SM11]
           gi|470184775|ref|YP_007572748.1| Hypothetical protein
           SM2011_a1445 [Sinorhizobium meliloti 2011]
           gi|499270366|ref|WP_010967759.1| hypothetical protein
           [Sinorhizobium meliloti] gi|193073131|gb|AAK65447.2|
           hypothetical protein SMa1445 [Sinorhizobium meliloti
           1021] gi|333814757|gb|AEG07425.1| hypothetical protein
           SinmeB_6302 [Sinorhizobium meliloti BL225C]
           gi|334099488|gb|AEG57497.1| hypothetical protein
           Sinme_5982 [Sinorhizobium meliloti AK83]
           gi|336036003|gb|AEH81934.1| hypothetical protein
           SM11_pC0861 [Sinorhizobium meliloti SM11]
           gi|459643434|gb|AGG70480.1| Hypothetical protein
           SM2011_a1445 [Sinorhizobium meliloti 2011]
           gi|589249892|emb|CDH82040.1| hypothetical protein
           SMRU11_pSmeRU11d_0906 [Sinorhizobium meliloti RU11/001]
          Length = 290

 Score = 88.6 bits (218), Expect = 2e-15
 Identities = 71/241 (29%), Positives = 112/241 (46%), Gaps = 5/241 (2%)
 Frame = +2

Query: 35  GPNSKPSKGLRKANL-SPRTKALLEEVALSEQGHALYTRQ--AGSKIPCPYVDYDAGFNA 205
           G ++ P  G ++ +  +P     ++EVA +E  H  + R+  A   +P P +D+DAGF A
Sbjct: 77  GSDAGPVTGGKQVSFDTPAIGEFMQEVAENELAHVRFYRKTLADQAVPRPAIDFDAGFAA 136

Query: 206 VFARAYGLKDGETISKLFGKDWDPYVNDATFALSMVFLEELGATGNKGLALLHTNPVXXX 385
           V A++ GL          G+D+DP+ N+  F L  +  E++G T   G A +  N     
Sbjct: 137 V-AKSAGL----------GEDFDPFGNETNFVLGGMLFEDVGVTAYAGAATVLKNKDFLA 185

Query: 386 XXXXXXXXXXXXXXIERSILFDLKDEIVPPTNETVTQVFARLSAYRDSMDGPQIDDQGLL 565
                         + RS L+           E   +    +S  RD +DGP+  DQGL 
Sbjct: 186 AAAGILAVEAYHMGMARSTLY--------RKGEEAWKAAQAVSDARDKIDGPEDKDQGL- 236

Query: 566 NKDPRFISVPGSFVNNIPTDIRGISFSRTPLMNLNILTLGAKDG--KGGFFPEGIAGRIN 739
                   V G   N +P+    I+F+RTP   L I+ +  K+G  KGGF+P G+ G+I 
Sbjct: 237 -------QVDGK-ANIVPSTPDAIAFTRTPQEVLRIVYISDKEGASKGGFYPNGMNGKIK 288

Query: 740 T 742
           +
Sbjct: 289 S 289


>ref|WP_022887404.1| hypothetical protein [Glaciibacter superstes]
          Length = 306

 Score = 88.2 bits (217), Expect = 3e-15
 Identities = 73/251 (29%), Positives = 112/251 (44%), Gaps = 6/251 (2%)
 Frame = +2

Query: 2   VQGQDFKKDLLGPNSKPSK--GLRKANL-SPRTKALLEEVALSEQGHALYTRQA--GSKI 166
           V GQ    +++G    P +  G R+ +  SP  K +  E+A+ E+ H  + R A   + +
Sbjct: 69  VTGQGLPDEMVGGTGTPGQVSGGRQVDFRSPLIKNIAREIAMDERAHVAFLRGALGDAAV 128

Query: 167 PCPYVDYDAGFNAVFARAYGLKDGETISKLFGKDWDPYVNDATFALSMVFLEELGATGNK 346
             P +  D  F A    A  +K GE         +D Y ND  F  +    E++G T  K
Sbjct: 129 ARPRISLDHSFTAAATAAGLIKPGEI--------FDAYANDRNFLFAAFLFEDVGVTAFK 180

Query: 347 GLALLHTNPVXXXXXXXXXXXXXXXXXIERSILFDLKDEIVPPTNETVTQVFARLSAYRD 526
           G A   +N                   I R+ LF   + +V   +++V  V  ++SA R+
Sbjct: 181 GAAPFISNKTYLDAAAGLLATEAYHAGIVRATLF--SEGLV---DDSVFDVVHKISAVRN 235

Query: 527 SMDGPQIDDQGLLNKDPRFISVPGSFVNNIPTDIRGISFSRTPLMNLNILTLGAKDG-KG 703
           ++ GP  DDQ L  KD           N +PTD  G +F R+    LNI+ L  + G  G
Sbjct: 236 AVSGPTDDDQDLGTKD---------VANLVPTDENGRAFGRSAAEILNIVYLDPQGGNSG 286

Query: 704 GFFPEGIAGRI 736
           GF+P+G+ G I
Sbjct: 287 GFYPDGLNGDI 297


>ref|YP_007192908.1| hypothetical protein C770_GR4pC0588 [Sinorhizobium meliloti GR4]
           gi|505055092|ref|WP_015242194.1| hypothetical protein
           [Sinorhizobium meliloti] gi|429554360|gb|AGA09309.1|
           hypothetical protein C770_GR4pC0588 [Sinorhizobium
           meliloti GR4]
          Length = 290

 Score = 88.2 bits (217), Expect = 3e-15
 Identities = 71/241 (29%), Positives = 111/241 (46%), Gaps = 5/241 (2%)
 Frame = +2

Query: 35  GPNSKPSKGLRKANL-SPRTKALLEEVALSEQGHALYTRQ--AGSKIPCPYVDYDAGFNA 205
           G ++ P  G ++ +  +P     ++EVA  E  H  + R+  A   +P P +D+DAGF A
Sbjct: 77  GSDAGPVTGGKQVSFDTPAIGEFMQEVAEDELAHVRFYRKTLADQAVPRPAIDFDAGFAA 136

Query: 206 VFARAYGLKDGETISKLFGKDWDPYVNDATFALSMVFLEELGATGNKGLALLHTNPVXXX 385
           V A++ GL          G+D+DP+ N+  F L  +  E++G T   G A +  N     
Sbjct: 137 V-AKSAGL----------GEDFDPFGNETNFVLGGMLFEDVGVTAYAGAATVLKNKDFLA 185

Query: 386 XXXXXXXXXXXXXXIERSILFDLKDEIVPPTNETVTQVFARLSAYRDSMDGPQIDDQGLL 565
                         + RS L+           E   +    +S  RD +DGP+  DQGL 
Sbjct: 186 AAAGILAVEAYHMGMARSTLY--------RKGEEAWKAAQAVSDARDKIDGPEDKDQGL- 236

Query: 566 NKDPRFISVPGSFVNNIPTDIRGISFSRTPLMNLNILTLGAKDG--KGGFFPEGIAGRIN 739
                   V G   N +P+    I+F+RTP   L I+ +  K+G  KGGF+P G+ G+I 
Sbjct: 237 -------QVDGK-ANIVPSTPDAIAFTRTPQEVLRIVYISDKEGASKGGFYPNGMNGKIK 288

Query: 740 T 742
           +
Sbjct: 289 S 289


>ref|WP_018096307.1| hypothetical protein [Sinorhizobium meliloti]
          Length = 290

 Score = 87.4 bits (215), Expect = 6e-15
 Identities = 72/241 (29%), Positives = 111/241 (46%), Gaps = 5/241 (2%)
 Frame = +2

Query: 35  GPNSKPSKGLRKANLS-PRTKALLEEVALSEQGHALYTRQ--AGSKIPCPYVDYDAGFNA 205
           G ++ P  G ++ + + P   A + EVA +E  H  + R+  A   +P P +D+DAGF A
Sbjct: 77  GSDAGPVTGGKQVSFATPAIGAFMREVAENELAHVRFYRKTLADQAVPRPAIDFDAGFAA 136

Query: 206 VFARAYGLKDGETISKLFGKDWDPYVNDATFALSMVFLEELGATGNKGLALLHTNPVXXX 385
           V A+A GL          G+D+DP+ N+  F L  +  E++G T   G A +  N     
Sbjct: 137 V-AKAAGL----------GEDFDPFGNETNFVLGGMLFEDVGVTAYAGAATVLKNKDFLA 185

Query: 386 XXXXXXXXXXXXXXIERSILFDLKDEIVPPTNETVTQVFARLSAYRDSMDGPQIDDQGLL 565
                         + RS L+           E   +    +S  RD +DG +  DQG  
Sbjct: 186 AAAGILAVEAYHMGMARSTLY--------RKGEEAWKAAQAVSDARDKIDGAEDKDQG-- 235

Query: 566 NKDPRFISVPGSFVNNIPTDIRGISFSRTPLMNLNILTLGAKDG--KGGFFPEGIAGRIN 739
                 I + G   N +P+    I+F+RTP   L I+ L  K+G  KGGF+P G+ G+I 
Sbjct: 236 ------IQMDGK-ANIVPSTPDAIAFTRTPQEVLRIVYLSDKEGVSKGGFYPNGMNGKIK 288

Query: 740 T 742
           +
Sbjct: 289 S 289


>gb|EAY89979.1| hypothetical protein OsI_11540 [Oryza sativa Indica Group]
          Length = 346

 Score = 87.0 bits (214), Expect = 7e-15
 Identities = 74/243 (30%), Positives = 102/243 (41%), Gaps = 6/243 (2%)
 Frame = +2

Query: 26  DLLGPN----SKPSKGLRKANLSPRTKALLEEVALSEQGHALYTRQAGSKIPCPYVDYDA 193
           D L PN      P  G RKA L   T  +  E A  E GH    ++    IP P +D  A
Sbjct: 98  DHLAPNLTLGGPPPVGARKAGLDELTWRVCAEFAYQEIGHLRAIQRTVGGIPRPLIDLSA 157

Query: 194 GFNAVFARAYGLKDGETISKLFGKDWDPYVNDATFALSMVFLEELGATGNKGLALLHTNP 373
                FAR       E +       +DPY N   F L++  +  LG  G  G   L    
Sbjct: 158 HN---FARVMD----EAVGYHLDPPFDPYANSLNFLLAVYVIPYLGINGYTGTNPLIDGY 210

Query: 374 VXXXXXXXXXXXXXXXXXIERSILFDLKDEIVPPTNETVTQVFARLSAYRDSMDGPQIDD 553
                             + R +LF+ + E V P   TV ++  R+SA R+ +    + D
Sbjct: 211 ATKRLVAGLLAVESGQDAVVRGLLFEHRRETVSPYGATVAELTDRVSALRNKLGQCGVKD 270

Query: 554 QGLLNKDPRFISVPGSFVNNI-PTDIRGISFSRTPLMNLNILTL-GAKDGKGGFFPEGIA 727
           +GL+   P  +   G    NI   ++  +S+SRTP   L IL L G +   GGF+PEG  
Sbjct: 271 EGLI--VPEQLGAEGKICTNILSANVDSLSYSRTPAELLRILYLTGDEHVPGGFYPEGAN 328

Query: 728 GRI 736
           GRI
Sbjct: 329 GRI 331


>ref|YP_007951538.1| hypothetical protein L083_3548 [Actinoplanes sp. N902-109]
           gi|505434514|ref|WP_015621616.1| hypothetical protein
           [Actinoplanes sp. N902-109] gi|492005738|gb|AGL17058.1|
           hypothetical protein L083_3548 [Actinoplanes sp.
           N902-109]
          Length = 340

 Score = 86.7 bits (213), Expect = 9e-15
 Identities = 69/222 (31%), Positives = 103/222 (46%), Gaps = 3/222 (1%)
 Frame = +2

Query: 80  SPRTKALLEEVALSEQGHALYTRQA--GSKIPCPYVDYDAGFNAVFARAYGLKDGETISK 253
           +P  K   +E+A  E+ H  + R A   + +  P ++    F A  ARA GL  G T   
Sbjct: 142 TPAVKQYAQEIANDERAHVNFLRGALGSAAVARPAINIRDSFTAA-ARAAGLI-GST--- 196

Query: 254 LFGKDWDPYVNDATFALSMVFLEELGATGNKGLALLHTNPVXXXXXXXXXXXXXXXXXIE 433
              + +DPY N+  F L+    E++G T  KG A L  N                     
Sbjct: 197 ---ETFDPYANENNFLLAAFLFEDVGVTAYKGAAPLIHNKTYLEAAAGILAVEAYHAATI 253

Query: 434 RSILFDLKDEIVPPTNETVTQVFARLSAYRDSMDGPQIDDQGLLNKDPRFISVPGSFVNN 613
           R+ LF+          + +T    +L+A R+S+DGP  D+QGL+  D           N 
Sbjct: 254 RTSLFE----------KGLTDEVQKLTAARNSLDGPANDEQGLILNDR---------ANI 294

Query: 614 IPTDIRGISFSRTPLMNLNILTLG-AKDGKGGFFPEGIAGRI 736
           +PTD  G++FSRTP   LNI+ L   K  +GGF+P+G+ G +
Sbjct: 295 VPTDKSGVAFSRTPGRVLNIVYLNPGKVSRGGFYPKGVNGDV 336


Top