BLASTX nr result
ID: Mentha22_contig00049855
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Mentha22_contig00049855 (816 letters) Database: ./nr 38,876,450 sequences; 13,856,398,315 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_004261352.1| hypothetical protein EIN_243570 [Entamoeba i... 75 3e-11 ref|XP_004255482.1| hypothetical protein EIN_530810 [Entamoeba i... 70 7e-10 ref|XP_004258468.1| cysteine protease, putative [Entamoeba invad... 70 9e-10 ref|XP_004259050.1| hypothetical protein EIN_119130 [Entamoeba i... 69 2e-09 gb|EAS00754.2| zinc finger lsd1 subclass family protein [Tetrahy... 68 4e-09 ref|XP_001020999.1| zinc finger domain, LSD1 subclass family pro... 68 4e-09 ref|XP_004185383.1| cysteine protease, putative [Entamoeba invad... 65 3e-08 ref|XP_004255486.1| cysteine protease, putative [Entamoeba invad... 64 7e-08 ref|XP_004185892.1| hypothetical protein EIN_161400 [Entamoeba i... 64 7e-08 ref|XP_004257097.1| hypothetical protein EIN_189460, partial [En... 62 3e-07 ref|XP_004261273.1| hypothetical protein EIN_208910 [Entamoeba i... 61 6e-07 ref|XP_004183248.1| hypothetical protein EIN_440760 [Entamoeba i... 60 7e-07 gb|ESU44484.1| Variant-specific surface protein [Giardia intesti... 59 2e-06 ref|XP_004256302.1| hypothetical protein EIN_218430 [Entamoeba i... 59 3e-06 gb|EWS73791.1| hypothetical protein TTHERM_000344108 [Tetrahymen... 58 4e-06 ref|XP_001018455.1| Neurohypophysial hormone, N-terminal Domain ... 58 4e-06 ref|XP_004260469.1| hypothetical protein EIN_432100, partial [En... 58 5e-06 gb|EAR90505.2| zinc finger lsd1 subclass family protein [Tetrahy... 57 6e-06 ref|XP_004257140.1| hypothetical protein EIN_014980 [Entamoeba i... 57 8e-06 ref|XP_004035100.1| zinc finger lsd1 subclass family protein, pu... 57 8e-06 >ref|XP_004261352.1| hypothetical protein EIN_243570 [Entamoeba invadens IP1] gi|440302258|gb|ELP94581.1| hypothetical protein EIN_243570 [Entamoeba invadens IP1] Length = 450 Score = 75.1 bits (183), Expect = 3e-11 Identities = 75/262 (28%), Positives = 96/262 (36%), Gaps = 17/262 (6%) Frame = -1 Query: 789 ITGTLQTCTAGTPNSPCQTCQSGTNNGLCATCVSAGGPYYLNAGATDCVTACGAAEYLNS 610 + T QTCTA C+TC G + +C C +YLN GA CV C LN Sbjct: 69 VANTPQTCTAEN----CETCAEGKTD-VCDKCAET---FYLNQGA--CVATCPEGTTLNE 118 Query: 609 AGTQCTAT---------CGTDEYGATTPSRACKKXXXXXXXXXXXXXXXXXXXXXXSKYL 457 +C A C T G T CK+ YL Sbjct: 119 ETKECVANTPQTCTAENCETCAEGKTDVCVKCKETY----------------------YL 156 Query: 456 -AADKXXXXXXXXXXXXXXXXXTNKVCFSCTSITDCTKCASNTQ--CATCASSKFVSNDM 286 DK +CT + +C CA CA CA++ ++S Sbjct: 157 DLTDKCGLDCPEGSKKDETNMKCVADTQTCT-VVNCETCADEKTDICAKCAATYYLSEG- 214 Query: 285 TSCLTACL-GTDFSDXXXXXXXXXXKI----HADCVACGSTTACTKCGSSKYVSTDSKSC 121 +C+T C GT +D +I + D G T C KC S+ Y+ D KSC Sbjct: 215 -TCVTQCPEGTVKNDEKMECVADTPQICNVDNCDTCVEGKTDICNKCKSNYYLQVDQKSC 273 Query: 120 VTDCGSGETKDAGTMMCKKSTT 55 TDC SG KD M C K TT Sbjct: 274 ATDCASG-YKDTTNMKCVKCTT 294 >ref|XP_004255482.1| hypothetical protein EIN_530810 [Entamoeba invadens IP1] gi|440295848|gb|ELP88711.1| hypothetical protein EIN_530810 [Entamoeba invadens IP1] gi|511091173|dbj|BAN41370.1| hypothetical protein [Entamoeba invadens] Length = 504 Score = 70.5 bits (171), Expect = 7e-10 Identities = 75/264 (28%), Positives = 98/264 (37%), Gaps = 21/264 (7%) Frame = -1 Query: 783 GTLQTCTAGTPNSPCQT-----CQSGTNNGLCATCVSAGGPYYLNAGATDCVTACGAAEY 619 G +T T C+T C + T G TC YLN GA CV+AC + Sbjct: 112 GKCETGMKVTETKKCETVTIENCDASTKIGETETCNKCSENNYLNQGA--CVSACPESTT 169 Query: 618 LNSAGTQCTA----TCGTD--EYGATTPSRACKKXXXXXXXXXXXXXXXXXXXXXXSKYL 457 LN T+C A TC + E A + C K YL Sbjct: 170 LNDDKTECVANTPQTCTVENCETCAENNNEVCVKCKETY-------------------YL 210 Query: 456 AADKXXXXXXXXXXXXXXXXXTNKVCFSCT---SITDCTKCASNTQ--CATCASSKFVSN 292 TN C + T ++ +C CA CA CA++ ++S Sbjct: 211 ---DLTGKCGLDCPEGSKKDETNMKCVADTQTCTVENCETCAEGKTDVCAKCAATYYLSE 267 Query: 291 DMTSCLTACL-GTDFSDXXXXXXXXXXKI----HADCVACGSTTACTKCGSSKYVSTDSK 127 +C+T C GT +D +I + D G T C KC S+ Y+ D K Sbjct: 268 G--TCVTQCPEGTVKNDEKMECVADTPQICNVDNCDTCVEGKTDICNKCKSNYYLQVDQK 325 Query: 126 SCVTDCGSGETKDAGTMMCKKSTT 55 SC TDC SG KD M C K TT Sbjct: 326 SCATDCASG-YKDTTNMKCVKCTT 348 >ref|XP_004258468.1| cysteine protease, putative [Entamoeba invadens IP1] gi|440299089|gb|ELP91697.1| cysteine protease, putative [Entamoeba invadens IP1] Length = 1041 Score = 70.1 bits (170), Expect = 9e-10 Identities = 60/250 (24%), Positives = 85/250 (34%), Gaps = 13/250 (5%) Frame = -1 Query: 765 TAGTPNSPCQTCQSGTNNGLCATCVSAGG--------PYYLNAGATDCVTA-----CGAA 625 T T N C C++GTN CV+ G P +G + C + C Sbjct: 430 TCNTTNLLCTECKAGTNKDRRGVCVAPGQYDFQPVNPPIKCISGCSSCTDSTTCNVCNTG 489 Query: 624 EYLNSAGTQCTATCGTDEYGATTPSRACKKXXXXXXXXXXXXXXXXXXXXXXSKYLAADK 445 + L C C + +Y ++ C +KYLA DK Sbjct: 490 KVLQKDKKACLDKCPSGQY--PNSNKLCTSCGVSSCETCVTSPTNKCDTCPTNKYLAVDK 547 Query: 444 XXXXXXXXXXXXXXXXXTNKVCFSCTSITDCTKCASNTQCATCASSKFVSNDMTSCLTAC 265 NK C CT+ +C C+++ C TC N + C Sbjct: 548 TSCLASCPNGQYPNA---NKQCTVCTT-ANCATCSASNVCTTCKP-----NYTKTATNQC 598 Query: 264 LGTDFSDXXXXXXXXXXKIHADCVACGSTTACTKCGSSKYVSTDSKSCVTDCGSGETKDA 85 T D A+C C + T CT C + +S + K+C C G T + Sbjct: 599 TATPCGDGKFGTQPNCGNCLANCKTCTNATICTACTGTYKLSENKKTCYATCPKG-TYTS 657 Query: 84 GTMMCKKSTT 55 GT CKK TT Sbjct: 658 GT-NCKKCTT 666 >ref|XP_004259050.1| hypothetical protein EIN_119130 [Entamoeba invadens IP1] gi|440299731|gb|ELP92279.1| hypothetical protein EIN_119130 [Entamoeba invadens IP1] Length = 504 Score = 68.9 bits (167), Expect = 2e-09 Identities = 74/264 (28%), Positives = 96/264 (36%), Gaps = 21/264 (7%) Frame = -1 Query: 783 GTLQTCTAGTPNSPCQT-----CQSGTNNGLCATCVSAGGPYYLNAGATDCVTACGAAEY 619 G +T T C+T C + T G TC YLN GA CV+AC Sbjct: 112 GKCETGMKVTETKKCETVNIENCDASTKTGETETCNKCSENNYLNQGA--CVSACPEGTI 169 Query: 618 LNSAGTQCTA----TCGTD--EYGATTPSRACKKXXXXXXXXXXXXXXXXXXXXXXSKYL 457 LN T+C A TC + E A + C K YL Sbjct: 170 LNEEKTECVAIPPQTCTVENCETCAENNNEVCVKCKETY-------------------YL 210 Query: 456 AADKXXXXXXXXXXXXXXXXXTNKVCFSCT---SITDCTKCASNTQ--CATCASSKFVSN 292 TN C + T ++ +C CA CA CA++ ++S Sbjct: 211 ---DLTGKCGLDCPEGSKKDETNMKCVADTQTCTVENCETCAEGKTDVCAKCAATYYLSE 267 Query: 291 DMTSCLTACL-GTDFSDXXXXXXXXXXKI----HADCVACGSTTACTKCGSSKYVSTDSK 127 +C+T C GT +D +I + D T C KC S+ Y+ D K Sbjct: 268 G--TCVTQCPEGTVKNDEKMECVADTPQICNVDNCDTCVEDRTDICNKCKSNYYLQVDQK 325 Query: 126 SCVTDCGSGETKDAGTMMCKKSTT 55 SC TDC SG KD M C K TT Sbjct: 326 SCATDCASG-YKDTTNMKCVKCTT 348 >gb|EAS00754.2| zinc finger lsd1 subclass family protein [Tetrahymena thermophila SB210] Length = 2540 Score = 68.2 bits (165), Expect = 4e-09 Identities = 61/232 (26%), Positives = 79/232 (34%), Gaps = 5/232 (2%) Frame = -1 Query: 750 NSPCQTCQSGTNNGLCATCVSAGGPYYLNAGATDCVTACGAAEY--LNSAG-TQCTATCG 580 +S CQTC SG N C +C+ P Y T CVT C + Y N+A C ATC Sbjct: 824 DSSCQTC-SGPNANQCLSCIL---PNYFQPDTTQCVTTCKTSYYPVQNTATCAPCNATCY 879 Query: 579 TDEYGATTPSRACKKXXXXXXXXXXXXXXXXXXXXXXSKYLAADKXXXXXXXXXXXXXXX 400 +C YL + Sbjct: 880 QCSASTANDCTSCTGNL----------------------YL---QNNTCSSTCQNGTYPD 914 Query: 399 XXTNKVCFSCTSITDCTKCASNTQCATCASSKFVSNDMTSCLTACLGTDFSDXXXXXXXX 220 TNK C C S +NT C TC+ ++ D SCLT C ++ D Sbjct: 915 KTTNK-CTQCDSTCLTCSAGTNTDCLTCSPPNYLQTDKNSCLTTCKSNEYQDNSSNKCVA 973 Query: 219 XXKIHADCVACGSTTACT-KCGSSKYVSTDS-KSCVTDCGSGETKDAGTMMC 70 + A C ST T + G Y S D+ K+CV C G D +C Sbjct: 974 CNVLCATCSGPASTQCLTCQAGQILYTSPDNKKTCVNSCPDGYYSDTKNNVC 1025 >ref|XP_001020999.1| zinc finger domain, LSD1 subclass family protein [Tetrahymena thermophila] Length = 2495 Score = 68.2 bits (165), Expect = 4e-09 Identities = 61/232 (26%), Positives = 79/232 (34%), Gaps = 5/232 (2%) Frame = -1 Query: 750 NSPCQTCQSGTNNGLCATCVSAGGPYYLNAGATDCVTACGAAEY--LNSAG-TQCTATCG 580 +S CQTC SG N C +C+ P Y T CVT C + Y N+A C ATC Sbjct: 824 DSSCQTC-SGPNANQCLSCIL---PNYFQPDTTQCVTTCKTSYYPVQNTATCAPCNATCY 879 Query: 579 TDEYGATTPSRACKKXXXXXXXXXXXXXXXXXXXXXXSKYLAADKXXXXXXXXXXXXXXX 400 +C YL + Sbjct: 880 QCSASTANDCTSCTGNL----------------------YL---QNNTCSSTCQNGTYPD 914 Query: 399 XXTNKVCFSCTSITDCTKCASNTQCATCASSKFVSNDMTSCLTACLGTDFSDXXXXXXXX 220 TNK C C S +NT C TC+ ++ D SCLT C ++ D Sbjct: 915 KTTNK-CTQCDSTCLTCSAGTNTDCLTCSPPNYLQTDKNSCLTTCKSNEYQDNSSNKCVA 973 Query: 219 XXKIHADCVACGSTTACT-KCGSSKYVSTDS-KSCVTDCGSGETKDAGTMMC 70 + A C ST T + G Y S D+ K+CV C G D +C Sbjct: 974 CNVLCATCSGPASTQCLTCQAGQILYTSPDNKKTCVNSCPDGYYSDTKNNVC 1025 >ref|XP_004185383.1| cysteine protease, putative [Entamoeba invadens IP1] gi|440292860|gb|ELP86037.1| cysteine protease, putative [Entamoeba invadens IP1] Length = 1005 Score = 65.1 bits (157), Expect = 3e-08 Identities = 59/253 (23%), Positives = 87/253 (34%), Gaps = 13/253 (5%) Frame = -1 Query: 774 QTCTAGTPNSPCQTCQSGTNNGLCATCVSAGG--------PYYLNAGATDCV-----TAC 634 +TC A + C C++GT+ CV+ G P +G + C C Sbjct: 393 KTCNA--TDLLCTECKAGTDKDKRGVCVAPGQYDFQPVNPPITCISGCSSCTDTTTCNVC 450 Query: 633 GAAEYLNSAGTQCTATCGTDEYGATTPSRACKKXXXXXXXXXXXXXXXXXXXXXXSKYLA 454 + L C C + +Y + ++ C +KYLA Sbjct: 451 NTGKVLQKDKKACLDKCPSGQYADS--NKLCNFCGVSTCETCAISPANKCDTCPTNKYLA 508 Query: 453 ADKXXXXXXXXXXXXXXXXXTNKVCFSCTSITDCTKCASNTQCATCASSKFVSNDMTSCL 274 DK NK C +CT+ +C C ++ C C + N + Sbjct: 509 VDKTSCLASCPNGQYPNA---NKQCTACTT-ANCATCDASNVCTACKT-----NFTKTST 559 Query: 273 TACLGTDFSDXXXXXXXXXXKIHADCVACGSTTACTKCGSSKYVSTDSKSCVTDCGSGET 94 C T D A+C C + T CT C + +S D K+C C G T Sbjct: 560 NQCTATPCGDGKFGTLPNCGNCLANCKTCTNATICTVCTGTNKLSEDKKTCYATCPKG-T 618 Query: 93 KDAGTMMCKKSTT 55 +GT CKK TT Sbjct: 619 YTSGT-NCKKCTT 630 >ref|XP_004255486.1| cysteine protease, putative [Entamoeba invadens IP1] gi|440295852|gb|ELP88715.1| cysteine protease, putative [Entamoeba invadens IP1] Length = 762 Score = 63.9 bits (154), Expect = 7e-08 Identities = 36/113 (31%), Positives = 52/113 (46%), Gaps = 6/113 (5%) Frame = -1 Query: 369 TSITDCTKCASNTQCATCASSKFVSNDMTSCLTACLGTDFSDXXXXXXXXXXKIHADCVA 190 T ++ C KC + T C TCA K++ ND +CLTAC + + A+C Sbjct: 432 TCLSGCVKCTNATICETCAVGKYLKNDRKACLTACPNGQYPNNNKVCVACST---ANCAT 488 Query: 189 CGSTTACTKCGSSKYVSTDSKSC-VTDCGSGETKDAGTMM-----CKKSTTGS 49 CG+ CT C S Y T S +C + CG G+ + CK T+G+ Sbjct: 489 CGTNNVCTAC-KSGYQKTASNTCQLIPCGDGKFGTSPNCQNCLANCKTCTSGA 540 >ref|XP_004185892.1| hypothetical protein EIN_161400 [Entamoeba invadens IP1] gi|440293429|gb|ELP86546.1| hypothetical protein EIN_161400 [Entamoeba invadens IP1] Length = 642 Score = 63.9 bits (154), Expect = 7e-08 Identities = 62/262 (23%), Positives = 89/262 (33%), Gaps = 23/262 (8%) Frame = -1 Query: 771 TCTAGTPNSPCQTCQS-----------GT--------NNGLCATCVSAGGPYYLNAGATD 649 TCT T + C +C++ GT NNG+C+ C T Sbjct: 252 TCTKETAATKCDSCKNSLKLSSDKTACGTTCPNGEIDNNGICSKCSVKNCITCTTDPTTK 311 Query: 648 CVTACGAAEYLNSAGTQCTATCGTDEYGATTPSRACKKXXXXXXXXXXXXXXXXXXXXXX 469 C +C L T+C C EY TT C K Sbjct: 312 C-DSCNTGYNLYYNKTKCGTKCPDGEYSGTT--NICNKCTVSNCKTCDTNNTKCDTCIDN 368 Query: 468 SKYLAADKXXXXXXXXXXXXXXXXXTNKVCFSCTSITDCTKCASN--TQCATCASSKFVS 295 +K L+ADK N +C +C + +C C S+ +C +C + +S Sbjct: 369 NK-LSADK---TKCSTSCAAGEYENGNNMCTTC-GVANCGSCTSSEPNKCISCTGTNKLS 423 Query: 294 NDMTSCLTAC-LGTDFSDXXXXXXXXXXKIHADCVAC-GSTTACTKCGSSKYVSTDSKSC 121 D T C + C G F + C C +T C KC ++ V D +C Sbjct: 424 VDKTKCSSTCPSGQTFINNNTCISCSVSL----CSVCDADSTKCEKCSATNVVQIDQLAC 479 Query: 120 VTDCGSGETKDAGTMMCKKSTT 55 + C +GE C K TT Sbjct: 480 IEKCPNGEYAKGNNKQCTKCTT 501 >ref|XP_004257097.1| hypothetical protein EIN_189460, partial [Entamoeba invadens IP1] gi|440297678|gb|ELP90326.1| hypothetical protein EIN_189460, partial [Entamoeba invadens IP1] Length = 415 Score = 61.6 bits (148), Expect = 3e-07 Identities = 36/105 (34%), Positives = 46/105 (43%), Gaps = 2/105 (1%) Frame = -1 Query: 357 DCTKCASNTQCATCASSKFVSNDMTSCLTACLGTDFSDXXXXXXXXXXKIHADCVACGST 178 +C CA + +C C K++ D+ C AC F D K DC C Sbjct: 69 NCKVCAVD-KCYQCKDDKYLKEDLFECFDACDDGYFKDTINENNHTCGKCQPDCKTCTDG 127 Query: 177 TACTKCGSSKYVSTDSKSCV--TDCGSGETKDAGTMMCKKSTTGS 49 T CT C + + D+ CV T+C SG KD T KKS TGS Sbjct: 128 TKCTSCPENALLLEDTGKCVTATECPSGYYKDTVTATRKKSKTGS 172 >ref|XP_004261273.1| hypothetical protein EIN_208910 [Entamoeba invadens IP1] gi|440302157|gb|ELP94502.1| hypothetical protein EIN_208910 [Entamoeba invadens IP1] Length = 650 Score = 60.8 bits (146), Expect = 6e-07 Identities = 34/111 (30%), Positives = 44/111 (39%), Gaps = 2/111 (1%) Frame = -1 Query: 390 NKVCFSCTSITDCTKCASNTQCATCASSKFVSNDMTSCLTACLGTDFSDXXXXXXXXXXK 211 N C C SITDC +C S T+C C + D T C C D Sbjct: 367 NTNCMKC-SITDCGECTSKTECTKCTKGNLI-KDKTKCEANC-----PDKFYPVNNVCTD 419 Query: 210 IHADCVACGSTTACTKCGSSKYVSTDSKSCVT--DCGSGETKDAGTMMCKK 64 C C CT+C + ++ D+K C T +C +G TKD C K Sbjct: 420 CDTTCKKCTEAGKCTECPDNTFLIEDTKKCTTNSECPAGYTKDTANKKCFK 470 >ref|XP_004183248.1| hypothetical protein EIN_440760 [Entamoeba invadens IP1] gi|440290500|gb|ELP83902.1| hypothetical protein EIN_440760 [Entamoeba invadens IP1] Length = 783 Score = 60.5 bits (145), Expect = 7e-07 Identities = 33/110 (30%), Positives = 46/110 (41%) Frame = -1 Query: 381 CFSCTSITDCTKCASNTQCATCASSKFVSNDMTSCLTACLGTDFSDXXXXXXXXXXKIHA 202 C SC+SI C+KC+S+ C C + + D SCL C +S Sbjct: 492 CISCSSIQGCSKCSSSDICTECTTDN-LQPDNKSCLPTCPTGYYSSNKVCMKCSD----- 545 Query: 201 DCVACGSTTACTKCGSSKYVSTDSKSCVTDCGSGETKDAGTMMCKKSTTG 52 +C +C C KC + ++ D+K CV C G K MCK G Sbjct: 546 NCDSCKDGKQCDKCKTDYLLTEDTKVCVVTCSDGYFKSTQEKMCKTCQEG 595 >gb|ESU44484.1| Variant-specific surface protein [Giardia intestinalis] Length = 742 Score = 59.3 bits (142), Expect = 2e-06 Identities = 64/257 (24%), Positives = 83/257 (32%), Gaps = 17/257 (6%) Frame = -1 Query: 741 CQTCQSGTNNGL---------CATCVSAGGPYYLNAGATDCVTACGAAEYLNSAGTQCTA 589 CQ+C SG N L CA C Y +A T T C +YL S GT + Sbjct: 269 CQSC-SGANTDLSPAGAGVAGCAACT------YDSAKVT--CTKCETGKYLKSDGTCADS 319 Query: 588 TCGTDEYGATTPSRACKKXXXXXXXXXXXXXXXXXXXXXXSKYLAADKXXXXXXXXXXXX 409 E+ P+ K K L Sbjct: 320 CTANTEFVKNDPTNGNKCVSCGDQTDGIADCKTCSKTGDTLKCLTCGDSKKPNAD----- 374 Query: 408 XXXXXTNKVCFSCTSITDCTKCASNTQCATCASSKFVSNDMTSCLTACLGTDFSDXXXXX 229 C +CT ITDC C CA C S+K +S +CL C + D Sbjct: 375 ------GTACVACT-ITDCASCDKENVCAACTSNKKLSPLKDACLDGCPAGTYDD----- 422 Query: 228 XXXXXKIHADCVACGS---TTACTKC--GSSKYVSTDSK---SCVTDCGSGETKDAGTMM 73 H C C + T+CT C G +TDS C+ +C ++ M Sbjct: 423 NNVCTPCHTSCAECNNNAEATSCTACYPGHVLNRTTDSSPAGMCIPECTGRYVENCEAGM 482 Query: 72 CKKSTTGSFGALKYAMG 22 C GS KYA+G Sbjct: 483 CTAVLGGSKYCSKYAVG 499 >ref|XP_004256302.1| hypothetical protein EIN_218430 [Entamoeba invadens IP1] gi|440296765|gb|ELP89531.1| hypothetical protein EIN_218430 [Entamoeba invadens IP1] Length = 731 Score = 58.5 bits (140), Expect = 3e-06 Identities = 35/105 (33%), Positives = 48/105 (45%), Gaps = 3/105 (2%) Frame = -1 Query: 354 CTKCASNTQCATCASSKFVSNDMTSCLTACLGTDFSDXXXXXXXXXXKIHADCVACGSTT 175 C CA + +C C +K++ D+ C C +SD K A+C C Sbjct: 369 CKVCAVD-KCYQCQENKYLKEDLFECFDKCDDGYYSDDSTATSFKCKKCLAECQNCTDGI 427 Query: 174 ACTKCGSSKYVSTDSKSCV--TDCGSGETKD-AGTMMCKKSTTGS 49 CT C +K + D+ CV T+C SG KD T +CKK TGS Sbjct: 428 KCTSCPENKLLLEDTGKCVTATECPSGYYKDTTATAICKKCKTGS 472 >gb|EWS73791.1| hypothetical protein TTHERM_000344108 [Tetrahymena thermophila SB210] Length = 1302 Score = 58.2 bits (139), Expect = 4e-06 Identities = 31/112 (27%), Positives = 56/112 (50%) Frame = -1 Query: 390 NKVCFSCTSITDCTKCASNTQCATCASSKFVSNDMTSCLTACLGTDFSDXXXXXXXXXXK 211 ++ C C + +C KC+S+ C C + ++ N+ SCL+ C F D + Sbjct: 860 SRACQKC--MNNCDKCSSSNSCDQCVQNFYLLNN-NSCLSECPQKYFKDSQKNICVLCFE 916 Query: 210 IHADCVACGSTTACTKCGSSKYVSTDSKSCVTDCGSGETKDAGTMMCKKSTT 55 +CV C +T++C KCG+ ++ D + CV +C G +D +C+ +T Sbjct: 917 ---NCVKCSNTSSCLKCGNGLHL-LDGQQCVQNCPDGYFEDYSLGICQICST 964 >ref|XP_001018455.1| Neurohypophysial hormone, N-terminal Domain containing protein [Tetrahymena thermophila] Length = 1410 Score = 58.2 bits (139), Expect = 4e-06 Identities = 31/112 (27%), Positives = 56/112 (50%) Frame = -1 Query: 390 NKVCFSCTSITDCTKCASNTQCATCASSKFVSNDMTSCLTACLGTDFSDXXXXXXXXXXK 211 ++ C C + +C KC+S+ C C + ++ N+ SCL+ C F D + Sbjct: 666 SRACQKC--MNNCDKCSSSNSCDQCVQNFYLLNN-NSCLSECPQKYFKDSQKNICVLCFE 722 Query: 210 IHADCVACGSTTACTKCGSSKYVSTDSKSCVTDCGSGETKDAGTMMCKKSTT 55 +CV C +T++C KCG+ ++ D + CV +C G +D +C+ +T Sbjct: 723 ---NCVKCSNTSSCLKCGNGLHL-LDGQQCVQNCPDGYFEDYSLGICQICST 770 >ref|XP_004260469.1| hypothetical protein EIN_432100, partial [Entamoeba invadens IP1] gi|440301291|gb|ELP93698.1| hypothetical protein EIN_432100, partial [Entamoeba invadens IP1] Length = 1308 Score = 57.8 bits (138), Expect = 5e-06 Identities = 40/123 (32%), Positives = 53/123 (43%), Gaps = 27/123 (21%) Frame = -1 Query: 381 CFSCTSI-TDCTKCASNTQCATCASSKF-----------VSNDMTSC-----LTACLGTD 253 C +C SI T+C KC SNT C+TC + F V +D +C T C+ Sbjct: 46 CGTCQSIITNCQKCYSNTTCSTCLTKYFPNSGQCALCSSVISDCETCNSGTECTKCINNK 105 Query: 252 F-SDXXXXXXXXXXKIHADCVACGSTTACTKCGSSKYVSTDSKSCV---------TDCGS 103 + D +I A C AC S T CT C Y++++SK V T C S Sbjct: 106 YLKDTDRTKCFACGEIMAGCTACTSGTVCTTCSDGYYINSNSKCGVCSTTLTTNCTKCSS 165 Query: 102 GET 94 G T Sbjct: 166 GTT 168 >gb|EAR90505.2| zinc finger lsd1 subclass family protein [Tetrahymena thermophila SB210] Length = 1184 Score = 57.4 bits (137), Expect = 6e-06 Identities = 55/263 (20%), Positives = 89/263 (33%), Gaps = 24/263 (9%) Frame = -1 Query: 780 TLQTCTAGTPNSPCQTCQSGTNNGLCATCVSAGGPYY-------------------LNAG 658 T TC+ N+ C TC++ T +C+S Y ++ G Sbjct: 510 TCLTCSTPQSNTSCLTCKTNTYLNPNKSCLSNCPSKYWTDQTNWKCQVCDPTCYNCISPG 569 Query: 657 ATDCVTACGAAEYLNSAGTQCTATCGTDEYGAT-TPSRACKKXXXXXXXXXXXXXXXXXX 481 + T+C YL S QC TC T+ + T T + C+ Sbjct: 570 DQNSCTSCSGTLYLFS--NQCINTCPTNTFYLTQTNNNICQPCHNSCKTCDGPNNNNCQS 627 Query: 480 XXXXSKYLAADKXXXXXXXXXXXXXXXXXTNKVCFSCTSITDCTKCA--SNTQCATCASS 307 S + + K N +C SC C C+ +N C TC S Sbjct: 628 CLALSLFQQSSKTCVSQCNPNQYQNNSDPNNLICSSCDP--SCATCSGPNNNNCVTCTGS 685 Query: 306 KFVSNDMTSCLTACLGTDFSDXXXXXXXXXXKIHADCVACG--STTACTKCGSSKYVSTD 133 F+ + C++ C +++ + C C +T CT C Y Sbjct: 686 LFLYQNQ--CISNCPKKYYNNTQNNQCTQCD---SSCYTCNGIATNNCTSCQLPLYFEPT 740 Query: 132 SKSCVTDCGSGETKDAGTMMCKK 64 S C+ +C S + D ++ CK+ Sbjct: 741 SNQCLQNCNSNQYPDTNSISCKQ 763 >ref|XP_004257140.1| hypothetical protein EIN_014980 [Entamoeba invadens IP1] gi|440297728|gb|ELP90369.1| hypothetical protein EIN_014980 [Entamoeba invadens IP1] Length = 574 Score = 57.0 bits (136), Expect = 8e-06 Identities = 34/106 (32%), Positives = 44/106 (41%), Gaps = 3/106 (2%) Frame = -1 Query: 357 DCTKCASNTQCATCASSKFVSNDMTSCLTACLGTDFSDXXXXXXXXXXKIHADCVACGST 178 +C CA + +C C K++ D+ C C F D K DC C Sbjct: 211 NCKVCAVD-KCYQCKDDKYLKEDLFECFDKCDDGYFKDTINENNHTCGKCQPDCKTCTDG 269 Query: 177 TACTKCGSSKYVSTDSKSCV--TDCGSGETKD-AGTMMCKKSTTGS 49 T CT C + + D+ CV T+C SG KD CKK TGS Sbjct: 270 TKCTSCPENALLLEDTGKCVTATECPSGYYKDKTAAATCKKCKTGS 315 >ref|XP_004035100.1| zinc finger lsd1 subclass family protein, putative [Ichthyophthirius multifiliis] gi|340505265|gb|EGR31614.1| zinc finger lsd1 subclass family protein, putative [Ichthyophthirius multifiliis] Length = 1266 Score = 57.0 bits (136), Expect = 8e-06 Identities = 34/117 (29%), Positives = 53/117 (45%), Gaps = 5/117 (4%) Frame = -1 Query: 390 NKVCFSCTSITDCTKCAS--NTQCATCASSKFVSNDMTSCLTACLGTDF-SDXXXXXXXX 220 N +C +C S CT+C T+C C+ S F+ D T C+++C + + Sbjct: 593 NNICDNCNS--KCTECIGPLETECTKCSESLFL--DQTQCISSCPERKYKKNGPGNINNI 648 Query: 219 XXKIHADCVACG--STTACTKCGSSKYVSTDSKSCVTDCGSGETKDAGTMMCKKSTT 55 H C C S CTKC SKY + C+TDCG+ + + T C+ ++ Sbjct: 649 CDDCHNTCYLCNGPSDNECTKCSGSKYFK--ANKCLTDCGNHQYGNPITQNCENCSS 703