BLASTX nr result
ID: Cheilocostus21_contig00000811
seq
BLASTX 2.2.26 [Sep-21-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Cheilocostus21_contig00000811 (632 letters) Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF excluding environmental samples from WGS projects 149,584,005 sequences; 54,822,741,787 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value gb|ABI99685.1| hypothetical protein APECO1_1785 [Escherichia col... 157 8e-52 gb|EDU61869.1| hypothetical protein PROSTU_00109 [Providencia st... 154 1e-51 gb|ABJ03470.1| conserved hypothetical protein [Escherichia coli ... 157 5e-50 dbj|GAL60547.1| hypothetical protein EV102420_46_00003 [Pseudesc... 159 6e-47 emb|CBA31922.1| hypothetical protein Csp_D29540 [Curvibacter put... 159 2e-46 emb|CCI74358.1| unnamed protein product [Klebsiella pneumoniae s... 157 5e-46 emb|CRH31471.1| Klebsiella pneumoniae subsp. rhinoscleromatis st... 155 1e-45 gb|EEU95451.1| hypothetical protein FAEPRAA2165_02967, partial [... 158 3e-44 gb|EFQ08391.1| hypothetical protein HMPREF9436_00081, partial [F... 156 4e-44 gb|EDP22261.1| hypothetical protein FAEPRAM212_00886 [Faecalibac... 156 9e-44 gb|EDP21864.1| hypothetical protein FAEPRAM212_01694 [Faecalibac... 156 9e-44 gb|EDP19791.1| hypothetical protein FAEPRAM212_02572 [Faecalibac... 156 1e-43 gb|EDS10617.1| hypothetical protein ANACOL_02699 [Anaerotruncus ... 152 2e-43 gb|EDS13040.1| hypothetical protein ANACOL_00227 [Anaerotruncus ... 152 3e-43 gb|KMS64810.1| hypothetical protein BVRB_042430, partial [Beta v... 145 2e-41 emb|CDW61111.1| hypothetical protein TTRE_0000953901 [Trichuris ... 145 1e-39 emb|SAC97892.1| Uncharacterised protein [Enterobacter cloacae] >... 136 2e-38 gb|ABE08377.1| hypothetical protein UTI89_C2919 [Escherichia col... 136 2e-38 gb|ABE09158.1| hypothetical protein UTI89_C3718 [Escherichia col... 136 3e-38 gb|ACO04175.1| conserved hypothetical protein [Persephonella mar... 138 1e-37 >gb|ABI99685.1| hypothetical protein APECO1_1785 [Escherichia coli APEC O1] Length = 187 Score = 157 bits (396), Expect(2) = 8e-52 Identities = 79/86 (91%), Positives = 80/86 (93%) Frame = -1 Query: 473 QSLPPILHRQAQSSV*SYSKGSRGLSVLPRVHCIFTAISISLSLGWRQRGHRYAIRAGRN 294 QSLPPILH +AQ SV SYSKGSRGLSVLPRVHCIFTA SISLSLGWRQ GH YAIRAGRN Sbjct: 99 QSLPPILHIKAQCSVSSYSKGSRGLSVLPRVHCIFTASSISLSLGWRQPGHHYAIRAGRN 158 Query: 293 LPDKEFRYLRTVIVTAAVYRGFDQEL 216 LPDKEFRYLRTVIVTAAVYRGFDQEL Sbjct: 159 LPDKEFRYLRTVIVTAAVYRGFDQEL 184 Score = 75.5 bits (184), Expect(2) = 8e-52 Identities = 35/39 (89%), Positives = 35/39 (89%) Frame = -3 Query: 579 PPDTVRNPDYGSRLEHQTLKGGISRSAPCRLASTLSKPP 463 PPDTVRNPDYGS LEHQTLKGGISRSAPCRLASTL P Sbjct: 64 PPDTVRNPDYGSTLEHQTLKGGISRSAPCRLASTLQSLP 102 >gb|EDU61869.1| hypothetical protein PROSTU_00109 [Providencia stuartii ATCC 25827] Length = 189 Score = 154 bits (388), Expect(2) = 1e-51 Identities = 78/86 (90%), Positives = 79/86 (91%) Frame = -1 Query: 473 QSLPPILHRQAQSSV*SYSKGSRGLSVLPRVHCIFTAISISLSLGWRQRGHRYAIRAGRN 294 QSLPPILH +AQ SV SYSKGSRGLSVLPRVHCIFTA SISLSLGWRQ GH YAIRAGRN Sbjct: 96 QSLPPILHIKAQCSVSSYSKGSRGLSVLPRVHCIFTASSISLSLGWRQPGHHYAIRAGRN 155 Query: 293 LPDKEFRYLRTVIVTAAVYRGFDQEL 216 LPDKEFRYLRTVIVTAAVY GFDQEL Sbjct: 156 LPDKEFRYLRTVIVTAAVYWGFDQEL 181 Score = 78.2 bits (191), Expect(2) = 1e-51 Identities = 36/43 (83%), Positives = 37/43 (86%) Frame = -3 Query: 591 QSNYPPDTVRNPDYGSRLEHQTLKGGISRSAPCRLASTLSKPP 463 QSNYPPDTVR PDYG+ LEHQTLKGGISR APCRLASTL P Sbjct: 57 QSNYPPDTVRTPDYGATLEHQTLKGGISRLAPCRLASTLQSLP 99 >gb|ABJ03470.1| conserved hypothetical protein [Escherichia coli APEC O1] Length = 217 Score = 157 bits (396), Expect(2) = 5e-50 Identities = 79/86 (91%), Positives = 80/86 (93%) Frame = -1 Query: 473 QSLPPILHRQAQSSV*SYSKGSRGLSVLPRVHCIFTAISISLSLGWRQRGHRYAIRAGRN 294 QSLPPILH +AQ SV SYSKGSRGLSVLPRVHCIFTA SISLSLGWRQ GH YAIRAGRN Sbjct: 124 QSLPPILHIKAQCSVSSYSKGSRGLSVLPRVHCIFTASSISLSLGWRQPGHHYAIRAGRN 183 Query: 293 LPDKEFRYLRTVIVTAAVYRGFDQEL 216 LPDKEFRYLRTVIVTAAVYRGFDQEL Sbjct: 184 LPDKEFRYLRTVIVTAAVYRGFDQEL 209 Score = 69.3 bits (168), Expect(2) = 5e-50 Identities = 35/44 (79%), Positives = 35/44 (79%), Gaps = 1/44 (2%) Frame = -3 Query: 591 QSNYPPDTVRNPDY-GSRLEHQTLKGGISRSAPCRLASTLSKPP 463 QSNYPPDTVR P G LEHQTLKGGISRSAPCRLASTL P Sbjct: 84 QSNYPPDTVRKPGITGPTLEHQTLKGGISRSAPCRLASTLQSLP 127 >dbj|GAL60547.1| hypothetical protein EV102420_46_00003 [Pseudescherichia vulneris NBRC 102420] Length = 104 Score = 159 bits (402), Expect = 6e-47 Identities = 80/86 (93%), Positives = 81/86 (94%) Frame = -1 Query: 473 QSLPPILHRQAQSSV*SYSKGSRGLSVLPRVHCIFTAISISLSLGWRQRGHRYAIRAGRN 294 QSLPPILH +AQ SV SYSKGSRGLSVLPRVHCIFTAISISLSLGWRQ GH YAIRAGRN Sbjct: 11 QSLPPILHIKAQCSVSSYSKGSRGLSVLPRVHCIFTAISISLSLGWRQPGHHYAIRAGRN 70 Query: 293 LPDKEFRYLRTVIVTAAVYRGFDQEL 216 LPDKEFRYLRTVIVTAAVYRGFDQEL Sbjct: 71 LPDKEFRYLRTVIVTAAVYRGFDQEL 96 >emb|CBA31922.1| hypothetical protein Csp_D29540 [Curvibacter putative symbiont of Hydra magnipapillata] Length = 156 Score = 159 bits (403), Expect = 2e-46 Identities = 84/112 (75%), Positives = 90/112 (80%) Frame = -1 Query: 458 ILHRQAQSSV*SYSKGSRGLSVLPRVHCIFTAISISLSLGWRQRGHRYAIRAGRNLPDKE 279 +LHR QS + SYSKGS GLSV PR CI T IS SLSLG RQ GHRYAIRAGRNLPDKE Sbjct: 1 MLHRSVQSPIQSYSKGSWGLSVFPRGDCIITNISTSLSLGRRQCGHRYAIRAGRNLPDKE 60 Query: 278 FRYLRTVIVTAAVYRGFDQELAPLLLTFRHRAGVRPYTSSCDFAESCVFDKQ 123 FRYLRTVIVTAAVY F+QELAP L F+HRAGV PYTS+ FAE CVF+KQ Sbjct: 61 FRYLRTVIVTAAVYWDFNQELAPHHLIFQHRAGVTPYTSTFVFAECCVFNKQ 112 >emb|CCI74358.1| unnamed protein product [Klebsiella pneumoniae subsp. rhinoscleromatis SB3432] emb|CCI75589.1| unnamed protein product [Klebsiella pneumoniae subsp. rhinoscleromatis SB3432] Length = 104 Score = 157 bits (396), Expect = 5e-46 Identities = 79/86 (91%), Positives = 80/86 (93%) Frame = -1 Query: 473 QSLPPILHRQAQSSV*SYSKGSRGLSVLPRVHCIFTAISISLSLGWRQRGHRYAIRAGRN 294 QSLPPILH +AQ SV SYSKGSRGLSVLPRVHCIFTA SISLSLGWRQ GH YAIRAGRN Sbjct: 11 QSLPPILHIKAQCSVSSYSKGSRGLSVLPRVHCIFTASSISLSLGWRQPGHHYAIRAGRN 70 Query: 293 LPDKEFRYLRTVIVTAAVYRGFDQEL 216 LPDKEFRYLRTVIVTAAVYRGFDQEL Sbjct: 71 LPDKEFRYLRTVIVTAAVYRGFDQEL 96 >emb|CRH31471.1| Klebsiella pneumoniae subsp. rhinoscleromatis strain SB3432, complete genome {ECO:0000313|EMBL:CCI75589.1} [Pantoea ananatis] emb|CRH31335.1| Klebsiella pneumoniae subsp. rhinoscleromatis strain SB3432, complete genome {ECO:0000313|EMBL:CCI75589.1} [Pantoea ananatis] emb|CRH31052.1| Klebsiella pneumoniae subsp. rhinoscleromatis strain SB3432, complete genome {ECO:0000313|EMBL:CCI75589.1} [Pantoea ananatis] emb|CRH30484.1| Klebsiella pneumoniae subsp. rhinoscleromatis strain SB3432, complete genome {ECO:0000313|EMBL:CCI75589.1} [Pantoea ananatis] emb|CRH28276.1| Klebsiella pneumoniae subsp. rhinoscleromatis strain SB3432, complete genome {ECO:0000313|EMBL:CCI75589.1} [Pantoea ananatis] emb|CRH27688.1| Klebsiella pneumoniae subsp. rhinoscleromatis strain SB3432, complete genome {ECO:0000313|EMBL:CCI75589.1} [Pantoea ananatis] emb|CRH27640.1| Klebsiella pneumoniae subsp. rhinoscleromatis strain SB3432, complete genome {ECO:0000313|EMBL:CCI75589.1} [Pantoea ananatis] emb|CRH40527.1| Klebsiella pneumoniae subsp. rhinoscleromatis strain SB3432, complete genome {ECO:0000313|EMBL:CCI75589.1} [Pantoea ananatis] emb|CRH40389.1| Klebsiella pneumoniae subsp. rhinoscleromatis strain SB3432, complete genome {ECO:0000313|EMBL:CCI75589.1} [Pantoea ananatis] emb|CRH40109.1| Klebsiella pneumoniae subsp. rhinoscleromatis strain SB3432, complete genome {ECO:0000313|EMBL:CCI75589.1} [Pantoea ananatis] emb|CRH39519.1| Klebsiella pneumoniae subsp. rhinoscleromatis strain SB3432, complete genome {ECO:0000313|EMBL:CCI75589.1} [Pantoea ananatis] emb|CRH36768.1| Klebsiella pneumoniae subsp. rhinoscleromatis strain SB3432, complete genome {ECO:0000313|EMBL:CCI75589.1} [Pantoea ananatis] emb|CRH35537.1| Klebsiella pneumoniae subsp. rhinoscleromatis strain SB3432, complete genome {ECO:0000313|EMBL:CCI75589.1} [Pantoea ananatis] emb|CRH35405.1| Klebsiella pneumoniae subsp. rhinoscleromatis strain SB3432, complete genome {ECO:0000313|EMBL:CCI75589.1} [Pantoea ananatis] emb|CRH37029.1| Klebsiella pneumoniae subsp. rhinoscleromatis strain SB3432, complete genome [Pantoea ananatis] emb|CRH36773.1| Klebsiella pneumoniae subsp. rhinoscleromatis strain SB3432, complete genome [Pantoea ananatis] emb|CRH36178.1| Klebsiella pneumoniae subsp. rhinoscleromatis strain SB3432, complete genome [Pantoea ananatis] emb|CRH34996.1| Klebsiella pneumoniae subsp. rhinoscleromatis strain SB3432, complete genome [Pantoea ananatis] emb|CRH32691.1| Klebsiella pneumoniae subsp. rhinoscleromatis strain SB3432, complete genome [Pantoea ananatis] emb|CRH32063.1| Klebsiella pneumoniae subsp. rhinoscleromatis strain SB3432, complete genome [Pantoea ananatis] emb|CRH32016.1| Klebsiella pneumoniae subsp. rhinoscleromatis strain SB3432, complete genome [Pantoea ananatis] Length = 99 Score = 155 bits (393), Expect = 1e-45 Identities = 79/86 (91%), Positives = 80/86 (93%) Frame = -1 Query: 473 QSLPPILHRQAQSSV*SYSKGSRGLSVLPRVHCIFTAISISLSLGWRQRGHRYAIRAGRN 294 QSLPPILH +AQ SV S SKGSRGLSVLPRVHCIFTAISISLSLGWRQ GH YAIRAGRN Sbjct: 6 QSLPPILHIKAQCSVSSCSKGSRGLSVLPRVHCIFTAISISLSLGWRQPGHHYAIRAGRN 65 Query: 293 LPDKEFRYLRTVIVTAAVYRGFDQEL 216 LPDKEFRYLRTVIVTAAVYRGFDQEL Sbjct: 66 LPDKEFRYLRTVIVTAAVYRGFDQEL 91 >gb|EEU95451.1| hypothetical protein FAEPRAA2165_02967, partial [Faecalibacterium prausnitzii A2-165] Length = 288 Score = 158 bits (400), Expect = 3e-44 Identities = 100/201 (49%), Positives = 118/201 (58%) Frame = -1 Query: 620 LRYSLGGDRPSQTTHQTLSATRITGLG*NIKH*RVVFQGRLHADWRPHFQSLPPILHRQA 441 LRYSLGGDRPSQT H T+S I ++ + +P F +P IL Q Sbjct: 34 LRYSLGGDRPSQTAHLTMSPDSIQSRRLEFQYRKDGIPTASPPKPKPWFPRVPSILCMQH 93 Query: 440 QSSV*SYSKGSRGLSVLPRVHCIFTAISISLSLGWRQRGHRYAIRAGRNLPDKEFRYLRT 261 ++ + YSK GLSVL RV IFT +IS RQ + YA AG+NLPDKEFRYLRT Sbjct: 94 RNPILGYSKAPWGLSVLSRVTGIFTGTTISPGGLLRQCPNHYAFHAGQNLPDKEFRYLRT 153 Query: 260 VIVTAAVYRGFDQELAPLLLTFRHRAGVRPYTSSCDFAESCVFDKQXXXXXXXXXXXLAR 81 VIVTAAV+ GFD LA LLLTF+HRAGV YTSS D A++CVF KQ A Sbjct: 154 VIVTAAVHWGFDSMLAHLLLTFQHRAGVSSYTSSFDLAQTCVFGKQLLGPILCGSISGAP 213 Query: 80 AMGLLIPKLRR*IAEFLQHSS 18 L+PKLR AEFL + S Sbjct: 214 ----LLPKLRGQFAEFLNNPS 230 >gb|EFQ08391.1| hypothetical protein HMPREF9436_00081, partial [Faecalibacterium cf. prausnitzii KLE1255] Length = 242 Score = 156 bits (395), Expect = 4e-44 Identities = 107/214 (50%), Positives = 123/214 (57%), Gaps = 9/214 (4%) Frame = -1 Query: 632 TFVLLRYSLGGDRPSQTTHQTLSATRITGLG*NIKH*RVVFQGRLHADWRPH-------- 477 TF LRYSLGGDRPSQT H T+S I + R+ FQ R D P Sbjct: 9 TFERLRYSLGGDRPSQTAHLTMSPALI-------QRRRLEFQYR--KDGIPTATPQAPKH 59 Query: 476 -FQSLPPILHRQAQSSV*SYSKGSRGLSVLPRVHCIFTAISISLSLGWRQRGHRYAIRAG 300 S+P IL Q ++ + YSK GLSVL RV IFT +IS RQ + YA AG Sbjct: 60 LLPSVPSILCMQHRNPILGYSKAPWGLSVLSRVTGIFTGTTISPGGLLRQCPNHYAFHAG 119 Query: 299 RNLPDKEFRYLRTVIVTAAVYRGFDQELAPLLLTFRHRAGVRPYTSSCDFAESCVFDKQX 120 +NLPDKEFRYLRTVIVTAAV+ GFD LA LLLTF+HRAGV YTSS D A++CVF KQ Sbjct: 120 QNLPDKEFRYLRTVIVTAAVHWGFDSMLAHLLLTFQHRAGVSSYTSSFDLAQTCVFGKQL 179 Query: 119 XXXXXXXXXXLARAMGLLIPKLRR*IAEFLQHSS 18 A L+PKLR AEFL + S Sbjct: 180 LGPILCGCI----AAAPLLPKLRGQFAEFLNNPS 209 >gb|EDP22261.1| hypothetical protein FAEPRAM212_00886 [Faecalibacterium prausnitzii M21/2] gb|EDP22759.1| hypothetical protein FAEPRAM212_00540 [Faecalibacterium prausnitzii M21/2] Length = 267 Score = 156 bits (395), Expect = 9e-44 Identities = 106/212 (50%), Positives = 122/212 (57%), Gaps = 7/212 (3%) Frame = -1 Query: 632 TFVLLRYSLGGDRPSQTTHQTLSATRITGLG*NIKH*RVVFQGRLHA------DWRPH-F 474 TF LRYSLGGDRPSQT H T+S I + R+ FQ R H F Sbjct: 9 TFERLRYSLGGDRPSQTAHLTMSPALI-------QRRRLEFQYRKDGIPTATPQMPKHLF 61 Query: 473 QSLPPILHRQAQSSV*SYSKGSRGLSVLPRVHCIFTAISISLSLGWRQRGHRYAIRAGRN 294 +P IL Q ++ + YSK GLSVL RV IFT +IS RQ + YA AG+N Sbjct: 62 PCVPSILCMQHRNPILGYSKAPWGLSVLSRVTGIFTGTTISPGGLSRQCPNHYAFHAGQN 121 Query: 293 LPDKEFRYLRTVIVTAAVYRGFDQELAPLLLTFRHRAGVRPYTSSCDFAESCVFDKQXXX 114 LPDKEFRYLRTVIVTAAV+ GFD LA LLLTF+HRAGV YTSS D A++CVF KQ Sbjct: 122 LPDKEFRYLRTVIVTAAVHWGFDSMLAHLLLTFQHRAGVSSYTSSFDLAQTCVFGKQLLG 181 Query: 113 XXXXXXXXLARAMGLLIPKLRR*IAEFLQHSS 18 A L+PKLR AEFL + S Sbjct: 182 PILCGSISGAP----LLPKLRGQFAEFLNNPS 209 >gb|EDP21864.1| hypothetical protein FAEPRAM212_01694 [Faecalibacterium prausnitzii M21/2] Length = 267 Score = 156 bits (395), Expect = 9e-44 Identities = 106/212 (50%), Positives = 122/212 (57%), Gaps = 7/212 (3%) Frame = -1 Query: 632 TFVLLRYSLGGDRPSQTTHQTLSATRITGLG*NIKH*RVVFQGRLHA------DWRPH-F 474 TF LRYSLGGDRPSQT H T+S I + R+ FQ R H F Sbjct: 9 TFERLRYSLGGDRPSQTAHLTMSPALI-------QRRRLEFQYRKDGIPTATPQMPKHLF 61 Query: 473 QSLPPILHRQAQSSV*SYSKGSRGLSVLPRVHCIFTAISISLSLGWRQRGHRYAIRAGRN 294 +P IL Q ++ + YSK GLSVL RV IFT +IS RQ + YA AG+N Sbjct: 62 PCVPSILCMQHRNPILGYSKAPWGLSVLSRVTGIFTGTTISPGGLSRQCPNHYAFHAGQN 121 Query: 293 LPDKEFRYLRTVIVTAAVYRGFDQELAPLLLTFRHRAGVRPYTSSCDFAESCVFDKQXXX 114 LPDKEFRYLRTVIVTAAV+ GFD LA LLLTF+HRAGV YTSS D A++CVF KQ Sbjct: 122 LPDKEFRYLRTVIVTAAVHWGFDSMLAHLLLTFQHRAGVSSYTSSFDLAQTCVFGKQLLG 181 Query: 113 XXXXXXXXLARAMGLLIPKLRR*IAEFLQHSS 18 A L+PKLR AEFL + S Sbjct: 182 PILCGSISGAP----LLPKLRGQFAEFLNNPS 209 >gb|EDP19791.1| hypothetical protein FAEPRAM212_02572 [Faecalibacterium prausnitzii M21/2] gb|EDP22128.1| hypothetical protein FAEPRAM212_01164 [Faecalibacterium prausnitzii M21/2] gb|EDP22975.1| hypothetical protein FAEPRAM212_00173 [Faecalibacterium prausnitzii M21/2] Length = 267 Score = 156 bits (394), Expect = 1e-43 Identities = 106/212 (50%), Positives = 122/212 (57%), Gaps = 7/212 (3%) Frame = -1 Query: 632 TFVLLRYSLGGDRPSQTTHQTLSATRITGLG*NIKH*RVVFQGRLHA------DWRPHF- 474 TF LRYSLGGDRPSQT H T+S I + R+ FQ R H Sbjct: 9 TFERLRYSLGGDRPSQTAHLTMSPALI-------QRRRLEFQYRKDGIPTATPQMPKHLL 61 Query: 473 QSLPPILHRQAQSSV*SYSKGSRGLSVLPRVHCIFTAISISLSLGWRQRGHRYAIRAGRN 294 S+P IL Q ++ + YSK GLSVL RV IFT +IS RQ + YA AG+N Sbjct: 62 PSVPSILCMQHRNPILGYSKAPWGLSVLSRVTGIFTGTTISPGGLSRQCPNHYAFHAGQN 121 Query: 293 LPDKEFRYLRTVIVTAAVYRGFDQELAPLLLTFRHRAGVRPYTSSCDFAESCVFDKQXXX 114 LPDKEFRYLRTVIVTAAV+ GFD LA LLLTF+HRAGV YTSS D A++CVF KQ Sbjct: 122 LPDKEFRYLRTVIVTAAVHWGFDSMLAHLLLTFQHRAGVSSYTSSFDLAQTCVFGKQLLG 181 Query: 113 XXXXXXXXLARAMGLLIPKLRR*IAEFLQHSS 18 A L+PKLR AEFL + S Sbjct: 182 PILCGSISGAP----LLPKLRGQFAEFLNNPS 209 >gb|EDS10617.1| hypothetical protein ANACOL_02699 [Anaerotruncus colihominis DSM 17241] Length = 170 Score = 152 bits (384), Expect = 2e-43 Identities = 92/162 (56%), Positives = 103/162 (63%) Frame = -1 Query: 632 TFVLLRYSLGGDRPSQTTHQTLSATRITGLG*NIKH*RVVFQGRLHADWRPHFQSLPPIL 453 TF LRY GGDRPSQT TLS RI G +H + A LPPIL Sbjct: 9 TFERLRYLFGGDRPSQTARLTLSPDRIHGRRLEFQHLKSGIPTAAPATLACCLLCLPPIL 68 Query: 452 HRQAQSSV*SYSKGSRGLSVLPRVHCIFTAISISLSLGWRQRGHRYAIRAGRNLPDKEFR 273 + + ++ + S SK GLSVL RV IFT +IS RQ RY IRAG+NLPDKEFR Sbjct: 69 YMKYRNPILSCSKAPWGLSVLSRVTGIFTGTTISPGGLLRQCPDRYTIRAGQNLPDKEFR 128 Query: 272 YLRTVIVTAAVYRGFDQELAPLLLTFRHRAGVRPYTSSCDFA 147 YLRTVIVTAAVYRGF+ EL+ LLLTFRHRAGV PYTSS D A Sbjct: 129 YLRTVIVTAAVYRGFNSELSLLLLTFRHRAGVTPYTSSFDLA 170 >gb|EDS13040.1| hypothetical protein ANACOL_00227 [Anaerotruncus colihominis DSM 17241] Length = 170 Score = 152 bits (383), Expect = 3e-43 Identities = 92/162 (56%), Positives = 103/162 (63%) Frame = -1 Query: 632 TFVLLRYSLGGDRPSQTTHQTLSATRITGLG*NIKH*RVVFQGRLHADWRPHFQSLPPIL 453 TF LRY GGDRPSQT TLS RI G +H + A LPPIL Sbjct: 9 TFERLRYLFGGDRPSQTARLTLSPDRIHGRRLEFQHLKSGIPTPAPATLACCLLCLPPIL 68 Query: 452 HRQAQSSV*SYSKGSRGLSVLPRVHCIFTAISISLSLGWRQRGHRYAIRAGRNLPDKEFR 273 + + ++ + S SK GLSVL RV IFT +IS RQ RY IRAG+NLPDKEFR Sbjct: 69 YMKYRNPILSCSKAPWGLSVLSRVTGIFTGTTISPGGLLRQCPDRYTIRAGQNLPDKEFR 128 Query: 272 YLRTVIVTAAVYRGFDQELAPLLLTFRHRAGVRPYTSSCDFA 147 YLRTVIVTAAVYRGF+ EL+ LLLTFRHRAGV PYTSS D A Sbjct: 129 YLRTVIVTAAVYRGFNSELSLLLLTFRHRAGVTPYTSSFDLA 170 >gb|KMS64810.1| hypothetical protein BVRB_042430, partial [Beta vulgaris subsp. vulgaris] Length = 120 Score = 145 bits (367), Expect = 2e-41 Identities = 74/84 (88%), Positives = 75/84 (89%) Frame = -1 Query: 374 IFTAISISLSLGWRQRGHRYAIRAGRNLPDKEFRYLRTVIVTAAVYRGFDQELAPLLLTF 195 IFT SISLSL WRQRG RYAIRAGRNLPDKEFRYLRTVIVTAAVYRGF+ LAPLLLTF Sbjct: 4 IFTGNSISLSLCWRQRGSRYAIRAGRNLPDKEFRYLRTVIVTAAVYRGFNSVLAPLLLTF 63 Query: 194 RHRAGVRPYTSSCDFAESCVFDKQ 123 RHRAGVRPYTSS DFAE CVF KQ Sbjct: 64 RHRAGVRPYTSSYDFAEPCVFVKQ 87 >emb|CDW61111.1| hypothetical protein TTRE_0000953901 [Trichuris trichiura] Length = 253 Score = 145 bits (366), Expect = 1e-39 Identities = 98/205 (47%), Positives = 115/205 (56%) Frame = -1 Query: 632 TFVLLRYSLGGDRPSQTTHQTLSATRITGLG*NIKH*RVVFQGRLHADWRPHFQSLPPIL 453 +F LRY LGGDRPSQT T+S +I G + + P F SLPPIL Sbjct: 9 SFERLRYFLGGDRPSQTARLTVSPDQIHGRRLETQQSKGGIPRVTPQKLTPLFLSLPPIL 68 Query: 452 HRQAQSSV*SYSKGSRGLSVLPRVHCIFTAISISLSLGWRQRGHRYAIRAGRNLPDKEFR 273 + + S+ RV IFT +IS RQ +RYAIRAG+NLPDKEFR Sbjct: 69 YMNYRISL--------------RVTGIFTGTTISPGRLLRQCPNRYAIRAGQNLPDKEFR 114 Query: 272 YLRTVIVTAAVYRGFDQELAPLLLTFRHRAGVRPYTSSCDFAESCVFDKQXXXXXXXXXX 93 YLRTVIVTAAV+ GFD LA LLLTF+HRAGV YTSS D A++CVF KQ Sbjct: 115 YLRTVIVTAAVHWGFDSMLAHLLLTFQHRAGVSSYTSSFDLAQTCVFGKQLLGPILCGSI 174 Query: 92 XLARAMGLLIPKLRR*IAEFLQHSS 18 A L+PKLR AEFL + S Sbjct: 175 SGAP----LLPKLRGQFAEFLNNPS 195 >emb|SAC97892.1| Uncharacterised protein [Enterobacter cloacae] emb|SAD50077.1| Uncharacterised protein [Enterobacter cloacae] Length = 80 Score = 136 bits (343), Expect = 2e-38 Identities = 67/70 (95%), Positives = 67/70 (95%) Frame = -1 Query: 425 SYSKGSRGLSVLPRVHCIFTAISISLSLGWRQRGHRYAIRAGRNLPDKEFRYLRTVIVTA 246 SYSKGSRGLSVLPRVHCIFTA SISLSLGWRQ GH YAIRAGRNLPDKEFRYLRTVIVTA Sbjct: 3 SYSKGSRGLSVLPRVHCIFTASSISLSLGWRQPGHHYAIRAGRNLPDKEFRYLRTVIVTA 62 Query: 245 AVYRGFDQEL 216 AVYRGFDQEL Sbjct: 63 AVYRGFDQEL 72 >gb|ABE08377.1| hypothetical protein UTI89_C2919 [Escherichia coli UTI89] gb|ABE09239.1| hypothetical protein UTI89_C3808 [Escherichia coli UTI89] gb|ABE09740.1| hypothetical protein TC0129 [Escherichia coli UTI89] gb|ABE09977.1| hypothetical protein UTI89_C4568 [Escherichia coli UTI89] gb|EFJ75983.1| hypothetical protein HMPREF9552_00335 [Escherichia coli MS 198-1] Length = 80 Score = 136 bits (343), Expect = 2e-38 Identities = 67/70 (95%), Positives = 67/70 (95%) Frame = -1 Query: 425 SYSKGSRGLSVLPRVHCIFTAISISLSLGWRQRGHRYAIRAGRNLPDKEFRYLRTVIVTA 246 SYSKGSRGLSVLPRVHCIFTA SISLSLGWRQ GH YAIRAGRNLPDKEFRYLRTVIVTA Sbjct: 3 SYSKGSRGLSVLPRVHCIFTASSISLSLGWRQPGHHYAIRAGRNLPDKEFRYLRTVIVTA 62 Query: 245 AVYRGFDQEL 216 AVYRGFDQEL Sbjct: 63 AVYRGFDQEL 72 >gb|ABE09158.1| hypothetical protein UTI89_C3718 [Escherichia coli UTI89] Length = 80 Score = 136 bits (342), Expect = 3e-38 Identities = 67/70 (95%), Positives = 67/70 (95%) Frame = -1 Query: 425 SYSKGSRGLSVLPRVHCIFTAISISLSLGWRQRGHRYAIRAGRNLPDKEFRYLRTVIVTA 246 SYSKGSRGLSVLPRVHCIFTA SISLSLGWRQ GH YAIRAGRNLPDKEFRYLRTVIVTA Sbjct: 3 SYSKGSRGLSVLPRVHCIFTANSISLSLGWRQPGHHYAIRAGRNLPDKEFRYLRTVIVTA 62 Query: 245 AVYRGFDQEL 216 AVYRGFDQEL Sbjct: 63 AVYRGFDQEL 72 >gb|ACO04175.1| conserved hypothetical protein [Persephonella marina EX-H1] gb|ACO04521.1| conserved hypothetical protein [Persephonella marina EX-H1] Length = 189 Score = 138 bits (348), Expect = 1e-37 Identities = 75/116 (64%), Positives = 83/116 (71%) Frame = -1 Query: 470 SLPPILHRQAQSSV*SYSKGSRGLSVLPRVHCIFTAISISLSLGWRQRGHRYAIRAGRNL 291 SLPPIL + S+ YSK SRGLSVLPRV I T +IS L RQRG R+ I AGRNL Sbjct: 30 SLPPILRMTRKQSMPGYSKASRGLSVLPRVVGILTDTTISPGLSSRQRGDRWTIHAGRNL 89 Query: 290 PDKEFRYLRTVIVTAAVYRGFDQELAPLLLTFRHRAGVRPYTSSCDFAESCVFDKQ 123 PDKEFRY RTVIVTAAVY GF LAPL LT+ H AG P+TSS +FA+ CVF KQ Sbjct: 90 PDKEFRYHRTVIVTAAVYPGFGSRLAPLPLTYGHWAGFTPHTSSFEFAQCCVFGKQ 145