BLASTX nr result
ID: Akebia25_contig00039399
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Akebia25_contig00039399 (1039 letters) Database: ./nr 38,876,450 sequences; 13,856,398,315 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|YP_003133881.1| hypothetical protein Svir_20370 [Saccharomon... 233 1e-58 gb|EJT72281.1| hypothetical protein GGTG_09147 [Gaeumannomyces g... 228 2e-57 ref|YP_004406608.1| cellulose-binding family ii [Verrucosispora ... 226 1e-56 gb|EWC58086.1| hypothetical protein UO65_6642 [Actinokineospora ... 226 2e-56 ref|WP_019811585.1| hypothetical protein [Saccharomonospora halo... 220 9e-55 ref|WP_007460956.1| cellulose-binding protein II [Micromonospora... 220 9e-55 ref|YP_003836925.1| cellulose-binding family II protein [Micromo... 218 3e-54 ref|YP_291040.1| hypothetical protein Tfu_2984 [Thermobifida fus... 218 4e-54 ref|YP_004084218.1| cellulose-binding family II [Micromonospora ... 217 6e-54 gb|EWM64219.1| cellulose-binding protein [Micromonospora sp. M42] 216 9e-54 gb|EUC48056.1| hypothetical protein COCMIDRAFT_3017 [Bipolaris o... 213 8e-53 gb|EUC34978.1| hypothetical protein COCCADRAFT_35432 [Bipolaris ... 210 9e-52 gb|EUN27976.1| hypothetical protein COCVIDRAFT_96993 [Bipolaris ... 209 1e-51 gb|EMD65577.1| hypothetical protein COCSADRAFT_139683 [Bipolaris... 209 1e-51 ref|YP_007951265.1| cellulose-binding family II [Actinoplanes sp... 209 2e-51 gb|EMD89809.1| hypothetical protein COCHEDRAFT_1177789 [Bipolari... 207 6e-51 ref|XP_001937624.1| conserved hypothetical protein [Pyrenophora ... 207 7e-51 ref|XP_003658813.1| hypothetical protein MYCTH_2295076 [Myceliop... 206 1e-50 ref|WP_019608417.1| hypothetical protein [Nocardiopsis sp. CNS639] 205 2e-50 ref|YP_003680621.1| hypothetical protein Ndas_2700 [Nocardiopsis... 205 3e-50 >ref|YP_003133881.1| hypothetical protein Svir_20370 [Saccharomonospora viridis DSM 43017] gi|506266592|ref|WP_015786367.1| hypothetical protein [Saccharomonospora viridis] gi|256585921|gb|ACU97054.1| hypothetical protein Svir_20370 [Saccharomonospora viridis DSM 43017] Length = 284 Score = 233 bits (594), Expect = 1e-58 Identities = 114/242 (47%), Positives = 151/242 (62%), Gaps = 3/242 (1%) Frame = -1 Query: 757 PYKSQLSDVPGLAKHTLYFPIHPPTDKKIPAIVWGNGGCSGDGAWFSNLLNEIASHGFLI 578 PY + VP L HT Y P PTD+ +P + WGNG C DG WF N+L E ASHGFL+ Sbjct: 43 PYPASYETVPDLPNHTAYRPNSLPTDETLPIVAWGNGACRADGTWFENILTEFASHGFLV 102 Query: 577 IANGASNGGKSSQTTGKDLPDAIDW-ITSNAGK-GKF-ANVDKTKIAAAGQSCGGIQAYS 407 IANG NG S QT+ L +A+DW I NA KF +D IA GQSCGG++ Sbjct: 103 IANGKPNG--SGQTSPDMLLEAVDWAIAENANPTSKFHGRIDTANIAVMGQSCGGLETME 160 Query: 406 ASVDPRVVATGIFNSGLLNPDNTPLLKTLHAPIGYFLGGPTDIAYENGERDFKNLPANIP 227 + DPR+ T ++NSG++NP + L+ LHAPI YF+GGP+DIAY N D++ LPA +P Sbjct: 161 VADDPRITTTVMWNSGIINPLDKRQLRRLHAPIAYFVGGPSDIAYRNAMDDWERLPAGLP 220 Query: 226 AIKANLDAGHFATYSNPNAGKFGKAAVLFFQWQLQGNATAKAQFLQPTSSSLTKDGWDIV 47 A NLD GH TY+ PN G+FG+ + +W+L+G+ATA+A+F+ P L WD+ Sbjct: 221 AFMGNLDVGHSGTYAEPNGGEFGRVGSNWLKWRLKGDATARAEFVGP-DCGLCSTEWDVQ 279 Query: 46 SK 41 K Sbjct: 280 QK 281 >gb|EJT72281.1| hypothetical protein GGTG_09147 [Gaeumannomyces graminis var. tritici R3-111a-1] Length = 250 Score = 228 bits (582), Expect = 2e-57 Identities = 119/240 (49%), Positives = 153/240 (63%), Gaps = 2/240 (0%) Frame = -1 Query: 754 YKSQLSDVPGLAKHTLYFPIHPPTDKKIPAIVWGNGGCSGDGAWFSNLLNEIASHGFLII 575 Y + + L KHT+Y P P K+P +VWGNG C+ DG F L E+ASHG++II Sbjct: 11 YPAAYTTDSSLPKHTIYAPKTVPQGVKLPIMVWGNGACAADGLAFRGFLTEVASHGYVII 70 Query: 574 ANGASNGGKSSQTTGKDLPDAIDWITSNAGKG--KFANVDKTKIAAAGQSCGGIQAYSAS 401 A+GA NG S TT K + DAIDWI++ AG +A VDKTK+AAAG SCGG++AY Sbjct: 71 ASGAPNGQGS--TTSKQMRDAIDWISARAGTAGTPYAAVDKTKVAAAGMSCGGVEAYDQK 128 Query: 400 VDPRVVATGIFNSGLLNPDNTPLLKTLHAPIGYFLGGPTDIAYENGERDFKNLPANIPAI 221 D RV GIFNSGLL NT + T+ P+ YF+GG +DIAY+N ERD+K LPA P+ Sbjct: 129 DDARVATLGIFNSGLLQ--NTGAVATIRKPVFYFMGGTSDIAYQNAERDYKGLPAGTPSW 186 Query: 220 KANLDAGHFATYSNPNAGKFGKAAVLFFQWQLQGNATAKAQFLQPTSSSLTKDGWDIVSK 41 K NL GH TY+ N GKFGKAAV + W L+G+AT+ A F +S+ DG+ + K Sbjct: 187 KGNLPVGHGGTYNQANGGKFGKAAVHWLNWVLKGDATSGAYF---KTSAAAADGFTVEKK 243 >ref|YP_004406608.1| cellulose-binding family ii [Verrucosispora maris AB-18-032] gi|503499999|ref|WP_013734660.1| cellulose-binding protein II [Verrucosispora maris] gi|328811836|gb|AEB46008.1| cellulose-binding family ii [Verrucosispora maris AB-18-032] Length = 405 Score = 226 bits (576), Expect = 1e-56 Identities = 116/242 (47%), Positives = 149/242 (61%), Gaps = 3/242 (1%) Frame = -1 Query: 757 PYKSQLSDVPGLAKHTLYFPIHPPTDKKIPAIVWGNGGCSGDGAWFSNLLNEIASHGFLI 578 PY + P LA HT+Y P + P ++ +P + WGNG CSG+G N L EIASHGFL Sbjct: 39 PYPADYETSPSLANHTIYRPQNLPAER-LPILAWGNGACSGNGLSQHNFLREIASHGFLA 97 Query: 577 IANGASNGGKSSQTTGKDLPDAIDWIT---SNAGKGKFANVDKTKIAAAGQSCGGIQAYS 407 +ANGA NGG S T + L IDW S G + +D TKIA AG SCGG++AY+ Sbjct: 98 VANGAPNGGGS--TNAQMLTQTIDWAVAENSRQGSKYYNKLDTTKIAVAGFSCGGVEAYA 155 Query: 406 ASVDPRVVATGIFNSGLLNPDNTPLLKTLHAPIGYFLGGPTDIAYENGERDFKNLPANIP 227 S DPRV TGIFNSGLLN + L+ L PI YF+GGP+DIAY N D+ LPA +P Sbjct: 156 VSNDPRVTTTGIFNSGLLNDADDYQLRRLTKPIAYFIGGPSDIAYPNAMDDWGKLPAGLP 215 Query: 226 AIKANLDAGHFATYSNPNAGKFGKAAVLFFQWQLQGNATAKAQFLQPTSSSLTKDGWDIV 47 A NL+ GH ATY PN G+F + A L+F+W+L+G+ A A F+ P + L + W + Sbjct: 216 AFMGNLNVGHGATYDQPNGGEFARVATLYFKWRLKGDTAAGANFVGP-NCGLCRTQWTVQ 274 Query: 46 SK 41 K Sbjct: 275 QK 276 >gb|EWC58086.1| hypothetical protein UO65_6642 [Actinokineospora sp. EG49] Length = 275 Score = 226 bits (575), Expect = 2e-56 Identities = 107/231 (46%), Positives = 144/231 (62%), Gaps = 3/231 (1%) Frame = -1 Query: 724 LAKHTLYFPIHPPTDKKIPAIVWGNGGCSGDGAWFSNLLNEIASHGFLIIANGASNGGKS 545 L HT+Y P PT K+P + WGNG C DG WF N+L E ASHGFL+IANG G S Sbjct: 46 LTNHTIYRPNTLPTGVKLPIVAWGNGACRADGTWFENILTEWASHGFLVIANGRPGG--S 103 Query: 544 SQTTGKDLPDAIDWITSNAGKGK---FANVDKTKIAAAGQSCGGIQAYSASVDPRVVATG 374 T L ++IDW + + + +D TK+A GQSCGGI+ Y + DPR+ T Sbjct: 104 GSTDSDMLTESIDWAVAENSRRTSKYYGRIDTTKVAVMGQSCGGIETYEVADDPRITTTV 163 Query: 373 IFNSGLLNPDNTPLLKTLHAPIGYFLGGPTDIAYENGERDFKNLPANIPAIKANLDAGHF 194 ++NSGLL+ LL+ LHAPI YF+GGP+DIA+ N D++ LPA +PA NLD GHF Sbjct: 164 LWNSGLLDDAQNGLLQRLHAPIAYFIGGPSDIAHPNAMDDWRRLPAGLPAFMGNLDVGHF 223 Query: 193 ATYSNPNAGKFGKAAVLFFQWQLQGNATAKAQFLQPTSSSLTKDGWDIVSK 41 T+S PN G+FG+ + +WQL+G+ T++AQF+ T L WD++ K Sbjct: 224 GTFSEPNGGEFGRVGGHWLKWQLKGDLTSRAQFV-GTDCGLCASDWDVLRK 273 >ref|WP_019811585.1| hypothetical protein [Saccharomonospora halophila] Length = 276 Score = 220 bits (560), Expect = 9e-55 Identities = 107/242 (44%), Positives = 143/242 (59%), Gaps = 3/242 (1%) Frame = -1 Query: 757 PYKSQLSDVPGLAKHTLYFPIHPPTDKKIPAIVWGNGGCSGDGAWFSNLLNEIASHGFLI 578 PY + L HT+Y P PTD+K+P + WGNG C DG WF N+L E ASHG+L+ Sbjct: 35 PYPASYETTFTLPDHTVYRPDTLPTDEKMPIVAWGNGACRADGTWFENILTEFASHGYLV 94 Query: 577 IANGASNGGKSSQTTGKDLPDAIDWI---TSNAGKGKFANVDKTKIAAAGQSCGGIQAYS 407 IANG G S T L DA+DW S + +D IA GQSCGG++ Sbjct: 95 IANGEPGG--SGSTDADMLIDAVDWAIERNSRPWSDYYGKLDTDNIAVMGQSCGGLETME 152 Query: 406 ASVDPRVVATGIFNSGLLNPDNTPLLKTLHAPIGYFLGGPTDIAYENGERDFKNLPANIP 227 + DPR+ T ++NSG+++ + LL LHAPI YF+GGP DIAY+N D+ LPA +P Sbjct: 153 VADDPRITTTVMWNSGIMSSWDKRLLGDLHAPIAYFVGGPEDIAYDNAMDDWGRLPAGLP 212 Query: 226 AIKANLDAGHFATYSNPNAGKFGKAAVLFFQWQLQGNATAKAQFLQPTSSSLTKDGWDIV 47 A NLD GH+ T+ PN G+FG+ V + WQL+G A A+AQF+ P + L + WD+ Sbjct: 213 AFMGNLDVGHYGTFEEPNGGEFGRVGVEWLDWQLKGEADARAQFVGP-NCGLCQTEWDVQ 271 Query: 46 SK 41 K Sbjct: 272 PK 273 >ref|WP_007460956.1| cellulose-binding protein II [Micromonospora lupini] gi|385885087|emb|CCH19100.1| Extracellular cellulose-binding hydrolase [Micromonospora lupini str. Lupac 08] Length = 416 Score = 220 bits (560), Expect = 9e-55 Identities = 115/242 (47%), Positives = 148/242 (61%), Gaps = 3/242 (1%) Frame = -1 Query: 757 PYKSQLSDVPGLAKHTLYFPIHPPTDKKIPAIVWGNGGCSGDGAWFSNLLNEIASHGFLI 578 PY + LA HT++ P P+++ +P +VWGNGGCS +G N L EIASHGFL Sbjct: 39 PYPADYETSASLANHTIFRPQTLPSER-LPILVWGNGGCSANGLSQGNFLREIASHGFLA 97 Query: 577 IANGASNGGKSSQTTGKDLPDAIDWIT---SNAGKGKFANVDKTKIAAAGQSCGGIQAYS 407 IANGA NG S TT + L +IDW S G F +D TK+A AG SCGG++AY+ Sbjct: 98 IANGAPNG--SGSTTSQMLTQSIDWAVAENSRQGSKYFNKLDTTKVAVAGFSCGGLEAYA 155 Query: 406 ASVDPRVVATGIFNSGLLNPDNTPLLKTLHAPIGYFLGGPTDIAYENGERDFKNLPANIP 227 S DPRV TGIF+SGLLN + L+ L PI YF+GGP+DIAY N D+ LPA +P Sbjct: 156 VSGDPRVTTTGIFSSGLLNDADDYQLRRLTKPIAYFVGGPSDIAYPNAMDDWGKLPAGLP 215 Query: 226 AIKANLDAGHFATYSNPNAGKFGKAAVLFFQWQLQGNATAKAQFLQPTSSSLTKDGWDIV 47 A NL+ GH TY PN G+FG+ AVL+ +W+L+G+ TA A F+ L W + Sbjct: 216 AFMGNLNVGHGGTYDQPNGGEFGRVAVLYLKWRLKGDTTAGANFV-GADCGLCHSQWTVQ 274 Query: 46 SK 41 K Sbjct: 275 QK 276 >ref|YP_003836925.1| cellulose-binding family II protein [Micromonospora aurantiaca ATCC 27029] gi|503051997|ref|WP_013286973.1| cellulose-binding protein II [Micromonospora aurantiaca] gi|302571147|gb|ADL47349.1| cellulose-binding family II [Micromonospora aurantiaca ATCC 27029] Length = 404 Score = 218 bits (555), Expect = 3e-54 Identities = 113/242 (46%), Positives = 147/242 (60%), Gaps = 3/242 (1%) Frame = -1 Query: 757 PYKSQLSDVPGLAKHTLYFPIHPPTDKKIPAIVWGNGGCSGDGAWFSNLLNEIASHGFLI 578 PY + L HT++ P P+++ +P +VWGNGGCS +G N L EIASHGFL Sbjct: 39 PYPADYETSSSLPNHTIFRPQTLPSER-LPVLVWGNGGCSANGLSQGNFLREIASHGFLA 97 Query: 577 IANGASNGGKSSQTTGKDLPDAIDWIT---SNAGKGKFANVDKTKIAAAGQSCGGIQAYS 407 IANGA NG S T + L +IDW S G + +D TKIA AG SCGG++AY+ Sbjct: 98 IANGAPNG--SGSTNAQMLTQSIDWAVAENSRPGSRYYNRIDTTKIAVAGFSCGGLEAYA 155 Query: 406 ASVDPRVVATGIFNSGLLNPDNTPLLKTLHAPIGYFLGGPTDIAYENGERDFKNLPANIP 227 S DPRV TGIF+SGLLN + L+ L PI YF+GGP+DIAY N D+ LPA +P Sbjct: 156 VSNDPRVTTTGIFSSGLLNDADDYQLRRLTKPIAYFVGGPSDIAYPNAMDDWGKLPAGLP 215 Query: 226 AIKANLDAGHFATYSNPNAGKFGKAAVLFFQWQLQGNATAKAQFLQPTSSSLTKDGWDIV 47 A NL+ GH TY PN G+FG+ AVL+ +W+L+G+ A A F+ P L + W + Sbjct: 216 AFMGNLNVGHGGTYDQPNGGEFGRVAVLYLKWRLKGDVGAGANFVGP-DCGLCRSQWSVQ 274 Query: 46 SK 41 K Sbjct: 275 QK 276 >ref|YP_291040.1| hypothetical protein Tfu_2984 [Thermobifida fusca YX] gi|499612667|ref|WP_011293401.1| hypothetical protein [Thermobifida fusca] gi|71917115|gb|AAZ57017.1| hypothetical protein Tfu_2984 [Thermobifida fusca YX] gi|507500183|gb|EOR69929.1| hypothetical protein TM51_15306 [Thermobifida fusca TM51] Length = 285 Score = 218 bits (554), Expect = 4e-54 Identities = 109/245 (44%), Positives = 150/245 (61%), Gaps = 6/245 (2%) Frame = -1 Query: 757 PYKSQLSDVPGLAKHTLYFPIHPPTDKKIPAIVWGNGGCSGDGAWFSNLLNEIASHGFLI 578 PY ++ L HT++ P P+++K+P +VWGNG C +G F N+L E ASHGFL+ Sbjct: 39 PYPAEYVTTLRLRNHTIFRPATLPSNEKLPIVVWGNGACLANGTMFENILLEFASHGFLV 98 Query: 577 IANGASNGGKSSQTTGKDLPDAIDW-ITSNAG-----KGKFANVDKTKIAAAGQSCGGIQ 416 IANG NG +T L +AIDW I N+ +GK +D TKIAA GQSCGG++ Sbjct: 99 IANGRPNG--FGRTDAAMLTEAIDWAIEENSRLLSPYRGK---LDTTKIAAMGQSCGGLE 153 Query: 415 AYSASVDPRVVATGIFNSGLLNPDNTPLLKTLHAPIGYFLGGPTDIAYENGERDFKNLPA 236 Y + DPR+ T ++NSGLL+ + LL LHAPI YF GGP+DIAY N D+ LPA Sbjct: 154 VYEIADDPRITTTVLWNSGLLSDRDNHLLTHLHAPIAYFTGGPSDIAYANAVDDWGRLPA 213 Query: 235 NIPAIKANLDAGHFATYSNPNAGKFGKAAVLFFQWQLQGNATAKAQFLQPTSSSLTKDGW 56 +PA +LD GH+ T+ PN G++G+ V + +WQL+G+ A+ QF+ P T W Sbjct: 214 GLPAFMGHLDVGHYGTFGQPNGGEYGRVGVQWLKWQLKGDQNARQQFVGPHCGLCTHPEW 273 Query: 55 DIVSK 41 D+ K Sbjct: 274 DVAQK 278 >ref|YP_004084218.1| cellulose-binding family II [Micromonospora sp. L5] gi|503241686|ref|WP_013476347.1| cellulose-binding protein II [Micromonospora sp. L5] gi|315411950|gb|ADU10067.1| cellulose-binding family II [Micromonospora sp. L5] Length = 455 Score = 217 bits (553), Expect = 6e-54 Identities = 113/242 (46%), Positives = 147/242 (60%), Gaps = 3/242 (1%) Frame = -1 Query: 757 PYKSQLSDVPGLAKHTLYFPIHPPTDKKIPAIVWGNGGCSGDGAWFSNLLNEIASHGFLI 578 PY + L HT++ P P+++ +P +VWGNGGCS +G N L EIASHGFL Sbjct: 39 PYPADYETSSSLPNHTIFRPQTLPSER-LPVLVWGNGGCSANGLSQGNFLREIASHGFLA 97 Query: 577 IANGASNGGKSSQTTGKDLPDAIDWIT---SNAGKGKFANVDKTKIAAAGQSCGGIQAYS 407 IANGA NG S T + L +IDW S G + +D TKIA AG SCGG++AY+ Sbjct: 98 IANGAPNG--SGSTNAQMLTQSIDWAVAENSRPGSRYYNRIDITKIAVAGFSCGGLEAYA 155 Query: 406 ASVDPRVVATGIFNSGLLNPDNTPLLKTLHAPIGYFLGGPTDIAYENGERDFKNLPANIP 227 S DPRV TGIF+SGLLN + L+ L PI YF+GGP+DIAY N D+ LPA +P Sbjct: 156 VSNDPRVTTTGIFSSGLLNDADDYQLRRLTKPIAYFVGGPSDIAYPNAMDDWGKLPAGLP 215 Query: 226 AIKANLDAGHFATYSNPNAGKFGKAAVLFFQWQLQGNATAKAQFLQPTSSSLTKDGWDIV 47 A NL+ GH TY PN G+FG+ AVL+ +W+L+G+ A A F+ P L + W + Sbjct: 216 AFMGNLNVGHGGTYDQPNGGEFGRVAVLYLKWRLKGDVGAGANFVGP-DCGLCRSQWSVQ 274 Query: 46 SK 41 K Sbjct: 275 QK 276 >gb|EWM64219.1| cellulose-binding protein [Micromonospora sp. M42] Length = 410 Score = 216 bits (551), Expect = 9e-54 Identities = 113/242 (46%), Positives = 148/242 (61%), Gaps = 3/242 (1%) Frame = -1 Query: 757 PYKSQLSDVPGLAKHTLYFPIHPPTDKKIPAIVWGNGGCSGDGAWFSNLLNEIASHGFLI 578 PY + LA HT++ P P+++ +P +VWGNG CS +G N L EIASHGFL Sbjct: 39 PYPADYETSSTLANHTIFRPQTLPSER-LPILVWGNGACSANGLSQGNFLREIASHGFLA 97 Query: 577 IANGASNGGKSSQTTGKDLPDAIDWIT---SNAGKGKFANVDKTKIAAAGQSCGGIQAYS 407 IANGA NG S T + L +IDW S +G F +D TKIA AG SCGG++AY+ Sbjct: 98 IANGAPNG--SGSTNAQMLTQSIDWAVAENSRSGSKYFNRLDTTKIAVAGFSCGGVEAYA 155 Query: 406 ASVDPRVVATGIFNSGLLNPDNTPLLKTLHAPIGYFLGGPTDIAYENGERDFKNLPANIP 227 S DPRV TGIF+SGLLN + L+ L PI YF+GGP+DIAY N D+ LP+ +P Sbjct: 156 VSNDPRVTTTGIFSSGLLNDADDYQLRRLTKPIAYFIGGPSDIAYPNAMDDWGKLPSGLP 215 Query: 226 AIKANLDAGHFATYSNPNAGKFGKAAVLFFQWQLQGNATAKAQFLQPTSSSLTKDGWDIV 47 A NL+ GH TY PN G+FG+ AVL+ +W+L+G+ A A F+ P L + W + Sbjct: 216 AFMGNLNVGHGGTYDQPNGGEFGRVAVLYLKWRLKGDVGAGANFVGP-DCGLCRTQWTVQ 274 Query: 46 SK 41 K Sbjct: 275 QK 276 >gb|EUC48056.1| hypothetical protein COCMIDRAFT_3017 [Bipolaris oryzae ATCC 44560] Length = 270 Score = 213 bits (543), Expect = 8e-53 Identities = 112/229 (48%), Positives = 145/229 (63%) Frame = -1 Query: 727 GLAKHTLYFPIHPPTDKKIPAIVWGNGGCSGDGAWFSNLLNEIASHGFLIIANGASNGGK 548 GL HT+Y P PT+ K P +VWGNG CS DG S LL +A++GFL I+ G NGG Sbjct: 40 GLPGHTIYLPAGNPTNGKYPVLVWGNGACSTDGRSNSALLRNMAANGFLAISEGGLNGGG 99 Query: 547 SSQTTGKDLPDAIDWITSNAGKGKFANVDKTKIAAAGQSCGGIQAYSASVDPRVVATGIF 368 SS T + + AIDWIT AG G++ANVD ++I AAG SCGG+QA DPRV GI Sbjct: 100 SS--TAQTMKAAIDWITKTAGTGRYANVDASRIMAAGFSCGGVQAADNLNDPRVDTVGII 157 Query: 367 NSGLLNPDNTPLLKTLHAPIGYFLGGPTDIAYENGERDFKNLPANIPAIKANLDAGHFAT 188 +SGLL+ NT K+ P+ + LGG DIAY+NGERD++NLPA P+ K N+ GH T Sbjct: 158 SSGLLS--NTDAAKSWKKPVLFVLGGTGDIAYQNGERDYRNLPAGTPSWKGNIPVGHGGT 215 Query: 187 YSNPNAGKFGKAAVLFFQWQLQGNATAKAQFLQPTSSSLTKDGWDIVSK 41 + N GKFG+A + + +W L+G+ TAK F S+ DGW + SK Sbjct: 216 LGDANGGKFGRAILNWAKWTLKGDQTAKQWF----STGYQADGWQVQSK 260 >gb|EUC34978.1| hypothetical protein COCCADRAFT_35432 [Bipolaris zeicola 26-R-13] Length = 270 Score = 210 bits (534), Expect = 9e-52 Identities = 110/229 (48%), Positives = 143/229 (62%) Frame = -1 Query: 727 GLAKHTLYFPIHPPTDKKIPAIVWGNGGCSGDGAWFSNLLNEIASHGFLIIANGASNGGK 548 GL HT+Y P PT+ K P +VWGNG CS DG S LL +A++GFL I+ G NGG Sbjct: 40 GLPGHTIYLPAGNPTNGKYPVLVWGNGACSTDGRSNSALLRNMAANGFLAISEGGLNGGG 99 Query: 547 SSQTTGKDLPDAIDWITSNAGKGKFANVDKTKIAAAGQSCGGIQAYSASVDPRVVATGIF 368 SS T + + AIDWIT AG G++ANVD ++I AAG SCGG+QA DPRV GI Sbjct: 100 SS--TAQTMKAAIDWITKTAGTGRYANVDASRIMAAGFSCGGVQAADNINDPRVDTVGII 157 Query: 367 NSGLLNPDNTPLLKTLHAPIGYFLGGPTDIAYENGERDFKNLPANIPAIKANLDAGHFAT 188 +SGLL+ NT K+ P+ + LGG DIAY+NGERD++NLPA P+ K N+ GH T Sbjct: 158 SSGLLS--NTDAAKSWKKPVLFVLGGTGDIAYQNGERDYRNLPAGTPSWKGNIPVGHGGT 215 Query: 187 YSNPNAGKFGKAAVLFFQWQLQGNATAKAQFLQPTSSSLTKDGWDIVSK 41 + N GKFG+A + + +W L+G+ TA F + DGW + SK Sbjct: 216 LGDANGGKFGRAILNWAKWSLKGDQTAAQWF----KNGYQADGWQVQSK 260 >gb|EUN27976.1| hypothetical protein COCVIDRAFT_96993 [Bipolaris victoriae FI3] Length = 270 Score = 209 bits (533), Expect = 1e-51 Identities = 110/229 (48%), Positives = 143/229 (62%) Frame = -1 Query: 727 GLAKHTLYFPIHPPTDKKIPAIVWGNGGCSGDGAWFSNLLNEIASHGFLIIANGASNGGK 548 GL HT+Y P PT+ K P +VWGNG CS DG S LL +A++GFL I+ G NGG Sbjct: 40 GLPGHTIYLPAGNPTNGKYPVLVWGNGACSTDGRSNSALLRNMAANGFLAISEGGLNGGG 99 Query: 547 SSQTTGKDLPDAIDWITSNAGKGKFANVDKTKIAAAGQSCGGIQAYSASVDPRVVATGIF 368 SS T + + AIDWIT AG G++ANVD ++I AAG SCGG+QA DPRV GI Sbjct: 100 SS--TAQTMKAAIDWITKTAGTGRYANVDASRIMAAGFSCGGVQAADNINDPRVDTVGII 157 Query: 367 NSGLLNPDNTPLLKTLHAPIGYFLGGPTDIAYENGERDFKNLPANIPAIKANLDAGHFAT 188 +SGLL+ NT K+ P+ + LGG DIAY+NGERD++NLPA P+ K N+ GH T Sbjct: 158 SSGLLS--NTDAAKSWKKPVLFVLGGTGDIAYQNGERDYRNLPAGTPSWKGNIPVGHGGT 215 Query: 187 YSNPNAGKFGKAAVLFFQWQLQGNATAKAQFLQPTSSSLTKDGWDIVSK 41 + N GKFG+A + + +W L+G+ TA F + DGW + SK Sbjct: 216 LGDANGGKFGRAILNWAKWTLKGDQTAAQWF----KNGYQADGWQVQSK 260 >gb|EMD65577.1| hypothetical protein COCSADRAFT_139683 [Bipolaris sorokiniana ND90Pr] Length = 269 Score = 209 bits (533), Expect = 1e-51 Identities = 110/229 (48%), Positives = 143/229 (62%) Frame = -1 Query: 727 GLAKHTLYFPIHPPTDKKIPAIVWGNGGCSGDGAWFSNLLNEIASHGFLIIANGASNGGK 548 GL HT+Y P PT+ K P +VWGNG CS DG S LL +A++GFL I+ G NGG Sbjct: 39 GLPGHTIYLPAGNPTNGKYPVLVWGNGACSTDGRSNSALLRNMAANGFLAISEGGLNGGG 98 Query: 547 SSQTTGKDLPDAIDWITSNAGKGKFANVDKTKIAAAGQSCGGIQAYSASVDPRVVATGIF 368 SS T + + AIDWIT AG G++ANVD ++I AAG SCGG+QA DPRV GI Sbjct: 99 SS--TAQTMKAAIDWITKTAGTGRYANVDASRIMAAGFSCGGVQAADNINDPRVDTVGII 156 Query: 367 NSGLLNPDNTPLLKTLHAPIGYFLGGPTDIAYENGERDFKNLPANIPAIKANLDAGHFAT 188 +SGLL+ NT K+ P+ + LGG DIAY+NGERD++NLPA P+ K N+ GH T Sbjct: 157 SSGLLS--NTDAAKSWKKPVLFVLGGTGDIAYQNGERDYRNLPAGTPSWKGNIPVGHGGT 214 Query: 187 YSNPNAGKFGKAAVLFFQWQLQGNATAKAQFLQPTSSSLTKDGWDIVSK 41 + N GKFG+A + + +W L+G+ TA F + DGW + SK Sbjct: 215 LGDANGGKFGRAILNWAKWTLKGDQTAAQWF----RNGYQSDGWQVQSK 259 >ref|YP_007951265.1| cellulose-binding family II [Actinoplanes sp. N902-109] gi|505434241|ref|WP_015621343.1| cellulose-binding family II [Actinoplanes sp. N902-109] gi|492005465|gb|AGL16785.1| cellulose-binding family II [Actinoplanes sp. N902-109] Length = 384 Score = 209 bits (532), Expect = 2e-51 Identities = 113/242 (46%), Positives = 146/242 (60%), Gaps = 3/242 (1%) Frame = -1 Query: 757 PYKSQLSDVPGLAKHTLYFPIHPPTDKKIPAIVWGNGGCSGDGAWFSNLLNEIASHGFLI 578 PY + L HT+Y P P+ ++P +VWGNGGC+ +G N L EIASHGFL+ Sbjct: 33 PYPADYETSATLTNHTIYRPQTLPS-ARMPLMVWGNGGCAANGTGQINFLREIASHGFLV 91 Query: 577 IANGASNGGKSSQTTGKDLPDAIDW-ITSNAGKGK--FANVDKTKIAAAGQSCGGIQAYS 407 IANGA NG S TT + L +IDW + NA G + VD +K+A AG SCGG++AY+ Sbjct: 92 IANGAPNG--SGSTTSQMLTQSIDWAVAENARTGSKYYGKVDTSKVAVAGWSCGGLEAYA 149 Query: 406 ASVDPRVVATGIFNSGLLNPDNTPLLKTLHAPIGYFLGGPTDIAYENGERDFKNLPANIP 227 S DPRV T IF+SGLLN + L L PIGYF+GG +DIAY N D+ LPA +P Sbjct: 150 VSNDPRVTTTMIFSSGLLNDADDYQLARLTKPIGYFIGGTSDIAYPNAMDDWGKLPAGLP 209 Query: 226 AIKANLDAGHFATYSNPNAGKFGKAAVLFFQWQLQGNATAKAQFLQPTSSSLTKDGWDIV 47 A NL+ GH TY N G+FG+ AVL+ +WQL G+ TA A F+ + L W + Sbjct: 210 AFMGNLNVGHGGTYDQVNGGEFGRVAVLWLKWQLLGDTTAGAAFV-GSDCGLCHSQWQVQ 268 Query: 46 SK 41 K Sbjct: 269 QK 270 >gb|EMD89809.1| hypothetical protein COCHEDRAFT_1177789 [Bipolaris maydis C5] gi|477592909|gb|ENI09979.1| hypothetical protein COCC4DRAFT_36337 [Bipolaris maydis ATCC 48331] Length = 270 Score = 207 bits (527), Expect = 6e-51 Identities = 109/229 (47%), Positives = 142/229 (62%) Frame = -1 Query: 727 GLAKHTLYFPIHPPTDKKIPAIVWGNGGCSGDGAWFSNLLNEIASHGFLIIANGASNGGK 548 GL HT+Y P PT+ K P +VWGNG CS DG S LL +A++GFL I+ G NGG Sbjct: 40 GLPGHTIYLPAGNPTNGKYPVLVWGNGACSTDGRSNSALLRNMAANGFLAISEGGLNGGG 99 Query: 547 SSQTTGKDLPDAIDWITSNAGKGKFANVDKTKIAAAGQSCGGIQAYSASVDPRVVATGIF 368 SS + + AIDWIT AG G++ANVD ++I AAG SCGG++A DPRV GI Sbjct: 100 SSNA--QTMKAAIDWITKTAGTGRYANVDASRIMAAGFSCGGVEAADNLNDPRVDTVGII 157 Query: 367 NSGLLNPDNTPLLKTLHAPIGYFLGGPTDIAYENGERDFKNLPANIPAIKANLDAGHFAT 188 +SGLL+ NT K+ P+ + LGG DIAY NGERD++NLPA P+ K N+ GH T Sbjct: 158 SSGLLS--NTDAAKSWRKPVLFVLGGTGDIAYPNGERDYRNLPAGTPSWKGNIPVGHGGT 215 Query: 187 YSNPNAGKFGKAAVLFFQWQLQGNATAKAQFLQPTSSSLTKDGWDIVSK 41 + N GKFG+A + + +W L+G+ TA F S+ DGW + SK Sbjct: 216 LGDANGGKFGRAILNWAKWTLKGDQTAAQWF----SNGYQADGWQVQSK 260 >ref|XP_001937624.1| conserved hypothetical protein [Pyrenophora tritici-repentis Pt-1C-BFP] gi|187984723|gb|EDU50211.1| conserved hypothetical protein [Pyrenophora tritici-repentis Pt-1C-BFP] Length = 274 Score = 207 bits (526), Expect = 7e-51 Identities = 109/225 (48%), Positives = 137/225 (60%) Frame = -1 Query: 724 LAKHTLYFPIHPPTDKKIPAIVWGNGGCSGDGAWFSNLLNEIASHGFLIIANGASNGGKS 545 LA HT+Y+P KIP +VWGNG CS DG S LL +IASHGFL I+ G+ NGG S Sbjct: 47 LAGHTIYYPTKSTGSTKIPVLVWGNGACSTDGKSNSALLQQIASHGFLAISEGSPNGGGS 106 Query: 544 SQTTGKDLPDAIDWITSNAGKGKFANVDKTKIAAAGQSCGGIQAYSASVDPRVVATGIFN 365 S + AIDW+T AG G +ANVD +KI AAG SCGG+QA DPRV G+ + Sbjct: 107 SSAA--TMKAAIDWVTKVAGTGAYANVDASKIMAAGFSCGGVQAMDNINDPRVDTIGVVS 164 Query: 364 SGLLNPDNTPLLKTLHAPIGYFLGGPTDIAYENGERDFKNLPANIPAIKANLDAGHFATY 185 SGLL+ NT K+ P+ + +GG DIAY NGERDFKNLPA P+ K N+ GH T Sbjct: 165 SGLLS--NTNAAKSWKKPVLFVMGGSGDIAYNNGERDFKNLPAGTPSWKGNIPVGHGGTL 222 Query: 184 SNPNAGKFGKAAVLFFQWQLQGNATAKAQFLQPTSSSLTKDGWDI 50 + N GKFGKA + + W ++G+ A F +S DGW + Sbjct: 223 GDANGGKFGKAILNWMLWTMKGDQAAAQYF----TSGYQADGWQV 263 >ref|XP_003658813.1| hypothetical protein MYCTH_2295076 [Myceliophthora thermophila ATCC 42464] gi|347006080|gb|AEO53568.1| hypothetical protein MYCTH_2295076 [Myceliophthora thermophila ATCC 42464] Length = 275 Score = 206 bits (524), Expect = 1e-50 Identities = 106/226 (46%), Positives = 144/226 (63%) Frame = -1 Query: 727 GLAKHTLYFPIHPPTDKKIPAIVWGNGGCSGDGAWFSNLLNEIASHGFLIIANGASNGGK 548 GL H+ Y P + P+D K+P ++WGNGGCS D + L E+ASHG L+IA+G G Sbjct: 43 GLEGHSFYAPQNIPSDAKLPVMLWGNGGCSADATGQAPFLTELASHGVLVIASGTPGNGG 102 Query: 547 SSQTTGKDLPDAIDWITSNAGKGKFANVDKTKIAAAGQSCGGIQAYSASVDPRVVATGIF 368 S TT + +ID+ITSNAG+G++AN+D ++I AAG SCGGI+AY+ D RV + GI+ Sbjct: 103 S--TTADMMTQSIDFITSNAGQGEWANIDASRITAAGWSCGGIEAYAQIWDDRVQSIGIW 160 Query: 367 NSGLLNPDNTPLLKTLHAPIGYFLGGPTDIAYENGERDFKNLPANIPAIKANLDAGHFAT 188 +SGLL DN P+ +FLGGP DI Y NGERD+ +PA +P K NLD GH T Sbjct: 161 SSGLL--DNHMAANDFTKPVFFFLGGPCDIPYGNGERDYAAMPAGMPKWKGNLDVGHGGT 218 Query: 187 YSNPNAGKFGKAAVLFFQWQLQGNATAKAQFLQPTSSSLTKDGWDI 50 Y+ PN GKFG + +W ++G+A+A A +L T DGW + Sbjct: 219 YTEPNRGKFGVIGGYWVEWIMRGDASA-ADYL--TGDGAKNDGWSV 261 >ref|WP_019608417.1| hypothetical protein [Nocardiopsis sp. CNS639] Length = 281 Score = 205 bits (522), Expect = 2e-50 Identities = 98/242 (40%), Positives = 142/242 (58%), Gaps = 3/242 (1%) Frame = -1 Query: 757 PYKSQLSDVPGLAKHTLYFPIHPPTDKKIPAIVWGNGGCSGDGAWFSNLLNEIASHGFLI 578 PY ++ P L HT+Y P+ P +++P + WGNG C DG WF N+L E ASHGFL+ Sbjct: 40 PYDAEYVTTPRLRDHTVYRPVDLPEGERLPIVAWGNGACRADGTWFENILTEFASHGFLV 99 Query: 577 IANGASNGGKSSQTTGKDLPDAIDWITSNAGK---GKFANVDKTKIAAAGQSCGGIQAYS 407 IA+G G S T + L +A+DW + ++D +A GQSCGG++ Y Sbjct: 100 IASGRPGGFGS--TDPEMLTEAVDWAIEENDRLFSEYRGHIDTDSVAIMGQSCGGLETYE 157 Query: 406 ASVDPRVVATGIFNSGLLNPDNTPLLKTLHAPIGYFLGGPTDIAYENGERDFKNLPANIP 227 + DPRV T ++NSGLL + LL+ LHAP+ YF GGP DIAYEN D+ LP +P Sbjct: 158 VADDPRVDTTVLWNSGLLTDRDNDLLEDLHAPVAYFTGGPGDIAYENALDDYARLPQGLP 217 Query: 226 AIKANLDAGHFATYSNPNAGKFGKAAVLFFQWQLQGNATAKAQFLQPTSSSLTKDGWDIV 47 A +LD GH+ T++ P+ G++G+ V + +W+L+G+ A+ +F+ L WD Sbjct: 218 AFIGHLDVGHYGTFAEPDGGEYGRVGVAWLEWRLKGDQRARQEFV-GDDCGLCSTEWDTD 276 Query: 46 SK 41 SK Sbjct: 277 SK 278 >ref|YP_003680621.1| hypothetical protein Ndas_2700 [Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111] gi|502918746|ref|WP_013153722.1| hypothetical protein [Nocardiopsis dassonvillei] gi|296846095|gb|ADH68115.1| conserved hypothetical protein [Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111] Length = 281 Score = 205 bits (521), Expect = 3e-50 Identities = 97/242 (40%), Positives = 143/242 (59%), Gaps = 3/242 (1%) Frame = -1 Query: 757 PYKSQLSDVPGLAKHTLYFPIHPPTDKKIPAIVWGNGGCSGDGAWFSNLLNEIASHGFLI 578 PY ++ P L HT+Y P+ P +++P + WGNG C DG WF N+L E ASHGFL+ Sbjct: 40 PYDAEYVTTPRLRDHTVYRPVDLPEGERLPIVAWGNGACRADGTWFENILTEFASHGFLV 99 Query: 577 IANGASNGGKSSQTTGKDLPDAIDWITSNAGK---GKFANVDKTKIAAAGQSCGGIQAYS 407 IA+G G S++ + L +A+DW + ++D +A GQSCGG++ Y Sbjct: 100 IASGRPGGFGSTEP--EMLTEAVDWAIEENDRLFSEYRGHIDTDSVAIMGQSCGGLETYE 157 Query: 406 ASVDPRVVATGIFNSGLLNPDNTPLLKTLHAPIGYFLGGPTDIAYENGERDFKNLPANIP 227 + DPRV T ++NSGLL + LL+ LHAP+ YF GGP DIAYEN D+ LP +P Sbjct: 158 VADDPRVDTTVLWNSGLLTDRDNDLLEDLHAPVAYFTGGPGDIAYENALDDYGRLPRGLP 217 Query: 226 AIKANLDAGHFATYSNPNAGKFGKAAVLFFQWQLQGNATAKAQFLQPTSSSLTKDGWDIV 47 A +LD GH+ T++ P+ G++G+ V + +W+L+G+ A+ +F+ L WD Sbjct: 218 AFIGHLDVGHYGTFAEPDGGEYGRVGVAWLEWRLKGDQRARREFV-GDDCGLCSTEWDTD 276 Query: 46 SK 41 SK Sbjct: 277 SK 278