BLASTX nr result

ID: Mentha22_contig00049855 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Mentha22_contig00049855
         (816 letters)

Database: ./nr 
           38,876,450 sequences; 13,856,398,315 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_004261352.1| hypothetical protein EIN_243570 [Entamoeba i...    75   3e-11
ref|XP_004255482.1| hypothetical protein EIN_530810 [Entamoeba i...    70   7e-10
ref|XP_004258468.1| cysteine protease, putative [Entamoeba invad...    70   9e-10
ref|XP_004259050.1| hypothetical protein EIN_119130 [Entamoeba i...    69   2e-09
gb|EAS00754.2| zinc finger lsd1 subclass family protein [Tetrahy...    68   4e-09
ref|XP_001020999.1| zinc finger domain, LSD1 subclass family pro...    68   4e-09
ref|XP_004185383.1| cysteine protease, putative [Entamoeba invad...    65   3e-08
ref|XP_004255486.1| cysteine protease, putative [Entamoeba invad...    64   7e-08
ref|XP_004185892.1| hypothetical protein EIN_161400 [Entamoeba i...    64   7e-08
ref|XP_004257097.1| hypothetical protein EIN_189460, partial [En...    62   3e-07
ref|XP_004261273.1| hypothetical protein EIN_208910 [Entamoeba i...    61   6e-07
ref|XP_004183248.1| hypothetical protein EIN_440760 [Entamoeba i...    60   7e-07
gb|ESU44484.1| Variant-specific surface protein [Giardia intesti...    59   2e-06
ref|XP_004256302.1| hypothetical protein EIN_218430 [Entamoeba i...    59   3e-06
gb|EWS73791.1| hypothetical protein TTHERM_000344108 [Tetrahymen...    58   4e-06
ref|XP_001018455.1| Neurohypophysial hormone, N-terminal Domain ...    58   4e-06
ref|XP_004260469.1| hypothetical protein EIN_432100, partial [En...    58   5e-06
gb|EAR90505.2| zinc finger lsd1 subclass family protein [Tetrahy...    57   6e-06
ref|XP_004257140.1| hypothetical protein EIN_014980 [Entamoeba i...    57   8e-06
ref|XP_004035100.1| zinc finger lsd1 subclass family protein, pu...    57   8e-06

>ref|XP_004261352.1| hypothetical protein EIN_243570 [Entamoeba invadens IP1]
           gi|440302258|gb|ELP94581.1| hypothetical protein
           EIN_243570 [Entamoeba invadens IP1]
          Length = 450

 Score = 75.1 bits (183), Expect = 3e-11
 Identities = 75/262 (28%), Positives = 96/262 (36%), Gaps = 17/262 (6%)
 Frame = -1

Query: 789 ITGTLQTCTAGTPNSPCQTCQSGTNNGLCATCVSAGGPYYLNAGATDCVTACGAAEYLNS 610
           +  T QTCTA      C+TC  G  + +C  C      +YLN GA  CV  C     LN 
Sbjct: 69  VANTPQTCTAEN----CETCAEGKTD-VCDKCAET---FYLNQGA--CVATCPEGTTLNE 118

Query: 609 AGTQCTAT---------CGTDEYGATTPSRACKKXXXXXXXXXXXXXXXXXXXXXXSKYL 457
              +C A          C T   G T     CK+                        YL
Sbjct: 119 ETKECVANTPQTCTAENCETCAEGKTDVCVKCKETY----------------------YL 156

Query: 456 -AADKXXXXXXXXXXXXXXXXXTNKVCFSCTSITDCTKCASNTQ--CATCASSKFVSNDM 286
              DK                       +CT + +C  CA      CA CA++ ++S   
Sbjct: 157 DLTDKCGLDCPEGSKKDETNMKCVADTQTCT-VVNCETCADEKTDICAKCAATYYLSEG- 214

Query: 285 TSCLTACL-GTDFSDXXXXXXXXXXKI----HADCVACGSTTACTKCGSSKYVSTDSKSC 121
            +C+T C  GT  +D          +I    + D    G T  C KC S+ Y+  D KSC
Sbjct: 215 -TCVTQCPEGTVKNDEKMECVADTPQICNVDNCDTCVEGKTDICNKCKSNYYLQVDQKSC 273

Query: 120 VTDCGSGETKDAGTMMCKKSTT 55
            TDC SG  KD   M C K TT
Sbjct: 274 ATDCASG-YKDTTNMKCVKCTT 294


>ref|XP_004255482.1| hypothetical protein EIN_530810 [Entamoeba invadens IP1]
           gi|440295848|gb|ELP88711.1| hypothetical protein
           EIN_530810 [Entamoeba invadens IP1]
           gi|511091173|dbj|BAN41370.1| hypothetical protein
           [Entamoeba invadens]
          Length = 504

 Score = 70.5 bits (171), Expect = 7e-10
 Identities = 75/264 (28%), Positives = 98/264 (37%), Gaps = 21/264 (7%)
 Frame = -1

Query: 783 GTLQTCTAGTPNSPCQT-----CQSGTNNGLCATCVSAGGPYYLNAGATDCVTACGAAEY 619
           G  +T    T    C+T     C + T  G   TC       YLN GA  CV+AC  +  
Sbjct: 112 GKCETGMKVTETKKCETVTIENCDASTKIGETETCNKCSENNYLNQGA--CVSACPESTT 169

Query: 618 LNSAGTQCTA----TCGTD--EYGATTPSRACKKXXXXXXXXXXXXXXXXXXXXXXSKYL 457
           LN   T+C A    TC  +  E  A   +  C K                        YL
Sbjct: 170 LNDDKTECVANTPQTCTVENCETCAENNNEVCVKCKETY-------------------YL 210

Query: 456 AADKXXXXXXXXXXXXXXXXXTNKVCFSCT---SITDCTKCASNTQ--CATCASSKFVSN 292
                                TN  C + T   ++ +C  CA      CA CA++ ++S 
Sbjct: 211 ---DLTGKCGLDCPEGSKKDETNMKCVADTQTCTVENCETCAEGKTDVCAKCAATYYLSE 267

Query: 291 DMTSCLTACL-GTDFSDXXXXXXXXXXKI----HADCVACGSTTACTKCGSSKYVSTDSK 127
              +C+T C  GT  +D          +I    + D    G T  C KC S+ Y+  D K
Sbjct: 268 G--TCVTQCPEGTVKNDEKMECVADTPQICNVDNCDTCVEGKTDICNKCKSNYYLQVDQK 325

Query: 126 SCVTDCGSGETKDAGTMMCKKSTT 55
           SC TDC SG  KD   M C K TT
Sbjct: 326 SCATDCASG-YKDTTNMKCVKCTT 348


>ref|XP_004258468.1| cysteine protease, putative [Entamoeba invadens IP1]
            gi|440299089|gb|ELP91697.1| cysteine protease, putative
            [Entamoeba invadens IP1]
          Length = 1041

 Score = 70.1 bits (170), Expect = 9e-10
 Identities = 60/250 (24%), Positives = 85/250 (34%), Gaps = 13/250 (5%)
 Frame = -1

Query: 765  TAGTPNSPCQTCQSGTNNGLCATCVSAGG--------PYYLNAGATDCVTA-----CGAA 625
            T  T N  C  C++GTN      CV+ G         P    +G + C  +     C   
Sbjct: 430  TCNTTNLLCTECKAGTNKDRRGVCVAPGQYDFQPVNPPIKCISGCSSCTDSTTCNVCNTG 489

Query: 624  EYLNSAGTQCTATCGTDEYGATTPSRACKKXXXXXXXXXXXXXXXXXXXXXXSKYLAADK 445
            + L      C   C + +Y     ++ C                        +KYLA DK
Sbjct: 490  KVLQKDKKACLDKCPSGQY--PNSNKLCTSCGVSSCETCVTSPTNKCDTCPTNKYLAVDK 547

Query: 444  XXXXXXXXXXXXXXXXXTNKVCFSCTSITDCTKCASNTQCATCASSKFVSNDMTSCLTAC 265
                              NK C  CT+  +C  C+++  C TC       N   +    C
Sbjct: 548  TSCLASCPNGQYPNA---NKQCTVCTT-ANCATCSASNVCTTCKP-----NYTKTATNQC 598

Query: 264  LGTDFSDXXXXXXXXXXKIHADCVACGSTTACTKCGSSKYVSTDSKSCVTDCGSGETKDA 85
              T   D             A+C  C + T CT C  +  +S + K+C   C  G T  +
Sbjct: 599  TATPCGDGKFGTQPNCGNCLANCKTCTNATICTACTGTYKLSENKKTCYATCPKG-TYTS 657

Query: 84   GTMMCKKSTT 55
            GT  CKK TT
Sbjct: 658  GT-NCKKCTT 666


>ref|XP_004259050.1| hypothetical protein EIN_119130 [Entamoeba invadens IP1]
           gi|440299731|gb|ELP92279.1| hypothetical protein
           EIN_119130 [Entamoeba invadens IP1]
          Length = 504

 Score = 68.9 bits (167), Expect = 2e-09
 Identities = 74/264 (28%), Positives = 96/264 (36%), Gaps = 21/264 (7%)
 Frame = -1

Query: 783 GTLQTCTAGTPNSPCQT-----CQSGTNNGLCATCVSAGGPYYLNAGATDCVTACGAAEY 619
           G  +T    T    C+T     C + T  G   TC       YLN GA  CV+AC     
Sbjct: 112 GKCETGMKVTETKKCETVNIENCDASTKTGETETCNKCSENNYLNQGA--CVSACPEGTI 169

Query: 618 LNSAGTQCTA----TCGTD--EYGATTPSRACKKXXXXXXXXXXXXXXXXXXXXXXSKYL 457
           LN   T+C A    TC  +  E  A   +  C K                        YL
Sbjct: 170 LNEEKTECVAIPPQTCTVENCETCAENNNEVCVKCKETY-------------------YL 210

Query: 456 AADKXXXXXXXXXXXXXXXXXTNKVCFSCT---SITDCTKCASNTQ--CATCASSKFVSN 292
                                TN  C + T   ++ +C  CA      CA CA++ ++S 
Sbjct: 211 ---DLTGKCGLDCPEGSKKDETNMKCVADTQTCTVENCETCAEGKTDVCAKCAATYYLSE 267

Query: 291 DMTSCLTACL-GTDFSDXXXXXXXXXXKI----HADCVACGSTTACTKCGSSKYVSTDSK 127
              +C+T C  GT  +D          +I    + D      T  C KC S+ Y+  D K
Sbjct: 268 G--TCVTQCPEGTVKNDEKMECVADTPQICNVDNCDTCVEDRTDICNKCKSNYYLQVDQK 325

Query: 126 SCVTDCGSGETKDAGTMMCKKSTT 55
           SC TDC SG  KD   M C K TT
Sbjct: 326 SCATDCASG-YKDTTNMKCVKCTT 348


>gb|EAS00754.2| zinc finger lsd1 subclass family protein [Tetrahymena thermophila
            SB210]
          Length = 2540

 Score = 68.2 bits (165), Expect = 4e-09
 Identities = 61/232 (26%), Positives = 79/232 (34%), Gaps = 5/232 (2%)
 Frame = -1

Query: 750  NSPCQTCQSGTNNGLCATCVSAGGPYYLNAGATDCVTACGAAEY--LNSAG-TQCTATCG 580
            +S CQTC SG N   C +C+    P Y     T CVT C  + Y   N+A    C ATC 
Sbjct: 824  DSSCQTC-SGPNANQCLSCIL---PNYFQPDTTQCVTTCKTSYYPVQNTATCAPCNATCY 879

Query: 579  TDEYGATTPSRACKKXXXXXXXXXXXXXXXXXXXXXXSKYLAADKXXXXXXXXXXXXXXX 400
                       +C                          YL   +               
Sbjct: 880  QCSASTANDCTSCTGNL----------------------YL---QNNTCSSTCQNGTYPD 914

Query: 399  XXTNKVCFSCTSITDCTKCASNTQCATCASSKFVSNDMTSCLTACLGTDFSDXXXXXXXX 220
              TNK C  C S        +NT C TC+   ++  D  SCLT C   ++ D        
Sbjct: 915  KTTNK-CTQCDSTCLTCSAGTNTDCLTCSPPNYLQTDKNSCLTTCKSNEYQDNSSNKCVA 973

Query: 219  XXKIHADCVACGSTTACT-KCGSSKYVSTDS-KSCVTDCGSGETKDAGTMMC 70
               + A C    ST   T + G   Y S D+ K+CV  C  G   D    +C
Sbjct: 974  CNVLCATCSGPASTQCLTCQAGQILYTSPDNKKTCVNSCPDGYYSDTKNNVC 1025


>ref|XP_001020999.1| zinc finger domain, LSD1 subclass family protein [Tetrahymena
            thermophila]
          Length = 2495

 Score = 68.2 bits (165), Expect = 4e-09
 Identities = 61/232 (26%), Positives = 79/232 (34%), Gaps = 5/232 (2%)
 Frame = -1

Query: 750  NSPCQTCQSGTNNGLCATCVSAGGPYYLNAGATDCVTACGAAEY--LNSAG-TQCTATCG 580
            +S CQTC SG N   C +C+    P Y     T CVT C  + Y   N+A    C ATC 
Sbjct: 824  DSSCQTC-SGPNANQCLSCIL---PNYFQPDTTQCVTTCKTSYYPVQNTATCAPCNATCY 879

Query: 579  TDEYGATTPSRACKKXXXXXXXXXXXXXXXXXXXXXXSKYLAADKXXXXXXXXXXXXXXX 400
                       +C                          YL   +               
Sbjct: 880  QCSASTANDCTSCTGNL----------------------YL---QNNTCSSTCQNGTYPD 914

Query: 399  XXTNKVCFSCTSITDCTKCASNTQCATCASSKFVSNDMTSCLTACLGTDFSDXXXXXXXX 220
              TNK C  C S        +NT C TC+   ++  D  SCLT C   ++ D        
Sbjct: 915  KTTNK-CTQCDSTCLTCSAGTNTDCLTCSPPNYLQTDKNSCLTTCKSNEYQDNSSNKCVA 973

Query: 219  XXKIHADCVACGSTTACT-KCGSSKYVSTDS-KSCVTDCGSGETKDAGTMMC 70
               + A C    ST   T + G   Y S D+ K+CV  C  G   D    +C
Sbjct: 974  CNVLCATCSGPASTQCLTCQAGQILYTSPDNKKTCVNSCPDGYYSDTKNNVC 1025


>ref|XP_004185383.1| cysteine protease, putative [Entamoeba invadens IP1]
            gi|440292860|gb|ELP86037.1| cysteine protease, putative
            [Entamoeba invadens IP1]
          Length = 1005

 Score = 65.1 bits (157), Expect = 3e-08
 Identities = 59/253 (23%), Positives = 87/253 (34%), Gaps = 13/253 (5%)
 Frame = -1

Query: 774  QTCTAGTPNSPCQTCQSGTNNGLCATCVSAGG--------PYYLNAGATDCV-----TAC 634
            +TC A   +  C  C++GT+      CV+ G         P    +G + C        C
Sbjct: 393  KTCNA--TDLLCTECKAGTDKDKRGVCVAPGQYDFQPVNPPITCISGCSSCTDTTTCNVC 450

Query: 633  GAAEYLNSAGTQCTATCGTDEYGATTPSRACKKXXXXXXXXXXXXXXXXXXXXXXSKYLA 454
               + L      C   C + +Y  +  ++ C                        +KYLA
Sbjct: 451  NTGKVLQKDKKACLDKCPSGQYADS--NKLCNFCGVSTCETCAISPANKCDTCPTNKYLA 508

Query: 453  ADKXXXXXXXXXXXXXXXXXTNKVCFSCTSITDCTKCASNTQCATCASSKFVSNDMTSCL 274
             DK                  NK C +CT+  +C  C ++  C  C +     N   +  
Sbjct: 509  VDKTSCLASCPNGQYPNA---NKQCTACTT-ANCATCDASNVCTACKT-----NFTKTST 559

Query: 273  TACLGTDFSDXXXXXXXXXXKIHADCVACGSTTACTKCGSSKYVSTDSKSCVTDCGSGET 94
              C  T   D             A+C  C + T CT C  +  +S D K+C   C  G T
Sbjct: 560  NQCTATPCGDGKFGTLPNCGNCLANCKTCTNATICTVCTGTNKLSEDKKTCYATCPKG-T 618

Query: 93   KDAGTMMCKKSTT 55
              +GT  CKK TT
Sbjct: 619  YTSGT-NCKKCTT 630


>ref|XP_004255486.1| cysteine protease, putative [Entamoeba invadens IP1]
           gi|440295852|gb|ELP88715.1| cysteine protease, putative
           [Entamoeba invadens IP1]
          Length = 762

 Score = 63.9 bits (154), Expect = 7e-08
 Identities = 36/113 (31%), Positives = 52/113 (46%), Gaps = 6/113 (5%)
 Frame = -1

Query: 369 TSITDCTKCASNTQCATCASSKFVSNDMTSCLTACLGTDFSDXXXXXXXXXXKIHADCVA 190
           T ++ C KC + T C TCA  K++ ND  +CLTAC    + +             A+C  
Sbjct: 432 TCLSGCVKCTNATICETCAVGKYLKNDRKACLTACPNGQYPNNNKVCVACST---ANCAT 488

Query: 189 CGSTTACTKCGSSKYVSTDSKSC-VTDCGSGETKDAGTMM-----CKKSTTGS 49
           CG+   CT C  S Y  T S +C +  CG G+   +         CK  T+G+
Sbjct: 489 CGTNNVCTAC-KSGYQKTASNTCQLIPCGDGKFGTSPNCQNCLANCKTCTSGA 540


>ref|XP_004185892.1| hypothetical protein EIN_161400 [Entamoeba invadens IP1]
            gi|440293429|gb|ELP86546.1| hypothetical protein
            EIN_161400 [Entamoeba invadens IP1]
          Length = 642

 Score = 63.9 bits (154), Expect = 7e-08
 Identities = 62/262 (23%), Positives = 89/262 (33%), Gaps = 23/262 (8%)
 Frame = -1

Query: 771  TCTAGTPNSPCQTCQS-----------GT--------NNGLCATCVSAGGPYYLNAGATD 649
            TCT  T  + C +C++           GT        NNG+C+ C             T 
Sbjct: 252  TCTKETAATKCDSCKNSLKLSSDKTACGTTCPNGEIDNNGICSKCSVKNCITCTTDPTTK 311

Query: 648  CVTACGAAEYLNSAGTQCTATCGTDEYGATTPSRACKKXXXXXXXXXXXXXXXXXXXXXX 469
            C  +C     L    T+C   C   EY  TT    C K                      
Sbjct: 312  C-DSCNTGYNLYYNKTKCGTKCPDGEYSGTT--NICNKCTVSNCKTCDTNNTKCDTCIDN 368

Query: 468  SKYLAADKXXXXXXXXXXXXXXXXXTNKVCFSCTSITDCTKCASN--TQCATCASSKFVS 295
            +K L+ADK                  N +C +C  + +C  C S+   +C +C  +  +S
Sbjct: 369  NK-LSADK---TKCSTSCAAGEYENGNNMCTTC-GVANCGSCTSSEPNKCISCTGTNKLS 423

Query: 294  NDMTSCLTAC-LGTDFSDXXXXXXXXXXKIHADCVAC-GSTTACTKCGSSKYVSTDSKSC 121
             D T C + C  G  F +               C  C   +T C KC ++  V  D  +C
Sbjct: 424  VDKTKCSSTCPSGQTFINNNTCISCSVSL----CSVCDADSTKCEKCSATNVVQIDQLAC 479

Query: 120  VTDCGSGETKDAGTMMCKKSTT 55
            +  C +GE        C K TT
Sbjct: 480  IEKCPNGEYAKGNNKQCTKCTT 501


>ref|XP_004257097.1| hypothetical protein EIN_189460, partial [Entamoeba invadens IP1]
           gi|440297678|gb|ELP90326.1| hypothetical protein
           EIN_189460, partial [Entamoeba invadens IP1]
          Length = 415

 Score = 61.6 bits (148), Expect = 3e-07
 Identities = 36/105 (34%), Positives = 46/105 (43%), Gaps = 2/105 (1%)
 Frame = -1

Query: 357 DCTKCASNTQCATCASSKFVSNDMTSCLTACLGTDFSDXXXXXXXXXXKIHADCVACGST 178
           +C  CA + +C  C   K++  D+  C  AC    F D          K   DC  C   
Sbjct: 69  NCKVCAVD-KCYQCKDDKYLKEDLFECFDACDDGYFKDTINENNHTCGKCQPDCKTCTDG 127

Query: 177 TACTKCGSSKYVSTDSKSCV--TDCGSGETKDAGTMMCKKSTTGS 49
           T CT C  +  +  D+  CV  T+C SG  KD  T   KKS TGS
Sbjct: 128 TKCTSCPENALLLEDTGKCVTATECPSGYYKDTVTATRKKSKTGS 172


>ref|XP_004261273.1| hypothetical protein EIN_208910 [Entamoeba invadens IP1]
           gi|440302157|gb|ELP94502.1| hypothetical protein
           EIN_208910 [Entamoeba invadens IP1]
          Length = 650

 Score = 60.8 bits (146), Expect = 6e-07
 Identities = 34/111 (30%), Positives = 44/111 (39%), Gaps = 2/111 (1%)
 Frame = -1

Query: 390 NKVCFSCTSITDCTKCASNTQCATCASSKFVSNDMTSCLTACLGTDFSDXXXXXXXXXXK 211
           N  C  C SITDC +C S T+C  C     +  D T C   C      D           
Sbjct: 367 NTNCMKC-SITDCGECTSKTECTKCTKGNLI-KDKTKCEANC-----PDKFYPVNNVCTD 419

Query: 210 IHADCVACGSTTACTKCGSSKYVSTDSKSCVT--DCGSGETKDAGTMMCKK 64
               C  C     CT+C  + ++  D+K C T  +C +G TKD     C K
Sbjct: 420 CDTTCKKCTEAGKCTECPDNTFLIEDTKKCTTNSECPAGYTKDTANKKCFK 470


>ref|XP_004183248.1| hypothetical protein EIN_440760 [Entamoeba invadens IP1]
           gi|440290500|gb|ELP83902.1| hypothetical protein
           EIN_440760 [Entamoeba invadens IP1]
          Length = 783

 Score = 60.5 bits (145), Expect = 7e-07
 Identities = 33/110 (30%), Positives = 46/110 (41%)
 Frame = -1

Query: 381 CFSCTSITDCTKCASNTQCATCASSKFVSNDMTSCLTACLGTDFSDXXXXXXXXXXKIHA 202
           C SC+SI  C+KC+S+  C  C +   +  D  SCL  C    +S               
Sbjct: 492 CISCSSIQGCSKCSSSDICTECTTDN-LQPDNKSCLPTCPTGYYSSNKVCMKCSD----- 545

Query: 201 DCVACGSTTACTKCGSSKYVSTDSKSCVTDCGSGETKDAGTMMCKKSTTG 52
           +C +C     C KC +   ++ D+K CV  C  G  K     MCK    G
Sbjct: 546 NCDSCKDGKQCDKCKTDYLLTEDTKVCVVTCSDGYFKSTQEKMCKTCQEG 595


>gb|ESU44484.1| Variant-specific surface protein [Giardia intestinalis]
          Length = 742

 Score = 59.3 bits (142), Expect = 2e-06
 Identities = 64/257 (24%), Positives = 83/257 (32%), Gaps = 17/257 (6%)
 Frame = -1

Query: 741 CQTCQSGTNNGL---------CATCVSAGGPYYLNAGATDCVTACGAAEYLNSAGTQCTA 589
           CQ+C SG N  L         CA C       Y +A  T   T C   +YL S GT   +
Sbjct: 269 CQSC-SGANTDLSPAGAGVAGCAACT------YDSAKVT--CTKCETGKYLKSDGTCADS 319

Query: 588 TCGTDEYGATTPSRACKKXXXXXXXXXXXXXXXXXXXXXXSKYLAADKXXXXXXXXXXXX 409
                E+    P+   K                        K L                
Sbjct: 320 CTANTEFVKNDPTNGNKCVSCGDQTDGIADCKTCSKTGDTLKCLTCGDSKKPNAD----- 374

Query: 408 XXXXXTNKVCFSCTSITDCTKCASNTQCATCASSKFVSNDMTSCLTACLGTDFSDXXXXX 229
                    C +CT ITDC  C     CA C S+K +S    +CL  C    + D     
Sbjct: 375 ------GTACVACT-ITDCASCDKENVCAACTSNKKLSPLKDACLDGCPAGTYDD----- 422

Query: 228 XXXXXKIHADCVACGS---TTACTKC--GSSKYVSTDSK---SCVTDCGSGETKDAGTMM 73
                  H  C  C +    T+CT C  G     +TDS     C+ +C     ++    M
Sbjct: 423 NNVCTPCHTSCAECNNNAEATSCTACYPGHVLNRTTDSSPAGMCIPECTGRYVENCEAGM 482

Query: 72  CKKSTTGSFGALKYAMG 22
           C     GS    KYA+G
Sbjct: 483 CTAVLGGSKYCSKYAVG 499


>ref|XP_004256302.1| hypothetical protein EIN_218430 [Entamoeba invadens IP1]
           gi|440296765|gb|ELP89531.1| hypothetical protein
           EIN_218430 [Entamoeba invadens IP1]
          Length = 731

 Score = 58.5 bits (140), Expect = 3e-06
 Identities = 35/105 (33%), Positives = 48/105 (45%), Gaps = 3/105 (2%)
 Frame = -1

Query: 354 CTKCASNTQCATCASSKFVSNDMTSCLTACLGTDFSDXXXXXXXXXXKIHADCVACGSTT 175
           C  CA + +C  C  +K++  D+  C   C    +SD          K  A+C  C    
Sbjct: 369 CKVCAVD-KCYQCQENKYLKEDLFECFDKCDDGYYSDDSTATSFKCKKCLAECQNCTDGI 427

Query: 174 ACTKCGSSKYVSTDSKSCV--TDCGSGETKD-AGTMMCKKSTTGS 49
            CT C  +K +  D+  CV  T+C SG  KD   T +CKK  TGS
Sbjct: 428 KCTSCPENKLLLEDTGKCVTATECPSGYYKDTTATAICKKCKTGS 472


>gb|EWS73791.1| hypothetical protein TTHERM_000344108 [Tetrahymena thermophila SB210]
          Length = 1302

 Score = 58.2 bits (139), Expect = 4e-06
 Identities = 31/112 (27%), Positives = 56/112 (50%)
 Frame = -1

Query: 390  NKVCFSCTSITDCTKCASNTQCATCASSKFVSNDMTSCLTACLGTDFSDXXXXXXXXXXK 211
            ++ C  C  + +C KC+S+  C  C  + ++ N+  SCL+ C    F D          +
Sbjct: 860  SRACQKC--MNNCDKCSSSNSCDQCVQNFYLLNN-NSCLSECPQKYFKDSQKNICVLCFE 916

Query: 210  IHADCVACGSTTACTKCGSSKYVSTDSKSCVTDCGSGETKDAGTMMCKKSTT 55
               +CV C +T++C KCG+  ++  D + CV +C  G  +D    +C+  +T
Sbjct: 917  ---NCVKCSNTSSCLKCGNGLHL-LDGQQCVQNCPDGYFEDYSLGICQICST 964


>ref|XP_001018455.1| Neurohypophysial hormone, N-terminal Domain containing protein
           [Tetrahymena thermophila]
          Length = 1410

 Score = 58.2 bits (139), Expect = 4e-06
 Identities = 31/112 (27%), Positives = 56/112 (50%)
 Frame = -1

Query: 390 NKVCFSCTSITDCTKCASNTQCATCASSKFVSNDMTSCLTACLGTDFSDXXXXXXXXXXK 211
           ++ C  C  + +C KC+S+  C  C  + ++ N+  SCL+ C    F D          +
Sbjct: 666 SRACQKC--MNNCDKCSSSNSCDQCVQNFYLLNN-NSCLSECPQKYFKDSQKNICVLCFE 722

Query: 210 IHADCVACGSTTACTKCGSSKYVSTDSKSCVTDCGSGETKDAGTMMCKKSTT 55
              +CV C +T++C KCG+  ++  D + CV +C  G  +D    +C+  +T
Sbjct: 723 ---NCVKCSNTSSCLKCGNGLHL-LDGQQCVQNCPDGYFEDYSLGICQICST 770


>ref|XP_004260469.1| hypothetical protein EIN_432100, partial [Entamoeba invadens IP1]
           gi|440301291|gb|ELP93698.1| hypothetical protein
           EIN_432100, partial [Entamoeba invadens IP1]
          Length = 1308

 Score = 57.8 bits (138), Expect = 5e-06
 Identities = 40/123 (32%), Positives = 53/123 (43%), Gaps = 27/123 (21%)
 Frame = -1

Query: 381 CFSCTSI-TDCTKCASNTQCATCASSKF-----------VSNDMTSC-----LTACLGTD 253
           C +C SI T+C KC SNT C+TC +  F           V +D  +C      T C+   
Sbjct: 46  CGTCQSIITNCQKCYSNTTCSTCLTKYFPNSGQCALCSSVISDCETCNSGTECTKCINNK 105

Query: 252 F-SDXXXXXXXXXXKIHADCVACGSTTACTKCGSSKYVSTDSKSCV---------TDCGS 103
           +  D          +I A C AC S T CT C    Y++++SK  V         T C S
Sbjct: 106 YLKDTDRTKCFACGEIMAGCTACTSGTVCTTCSDGYYINSNSKCGVCSTTLTTNCTKCSS 165

Query: 102 GET 94
           G T
Sbjct: 166 GTT 168


>gb|EAR90505.2| zinc finger lsd1 subclass family protein [Tetrahymena thermophila
            SB210]
          Length = 1184

 Score = 57.4 bits (137), Expect = 6e-06
 Identities = 55/263 (20%), Positives = 89/263 (33%), Gaps = 24/263 (9%)
 Frame = -1

Query: 780  TLQTCTAGTPNSPCQTCQSGTNNGLCATCVSAGGPYY-------------------LNAG 658
            T  TC+    N+ C TC++ T      +C+S     Y                   ++ G
Sbjct: 510  TCLTCSTPQSNTSCLTCKTNTYLNPNKSCLSNCPSKYWTDQTNWKCQVCDPTCYNCISPG 569

Query: 657  ATDCVTACGAAEYLNSAGTQCTATCGTDEYGAT-TPSRACKKXXXXXXXXXXXXXXXXXX 481
              +  T+C    YL S   QC  TC T+ +  T T +  C+                   
Sbjct: 570  DQNSCTSCSGTLYLFS--NQCINTCPTNTFYLTQTNNNICQPCHNSCKTCDGPNNNNCQS 627

Query: 480  XXXXSKYLAADKXXXXXXXXXXXXXXXXXTNKVCFSCTSITDCTKCA--SNTQCATCASS 307
                S +  + K                  N +C SC     C  C+  +N  C TC  S
Sbjct: 628  CLALSLFQQSSKTCVSQCNPNQYQNNSDPNNLICSSCDP--SCATCSGPNNNNCVTCTGS 685

Query: 306  KFVSNDMTSCLTACLGTDFSDXXXXXXXXXXKIHADCVACG--STTACTKCGSSKYVSTD 133
             F+  +   C++ C    +++             + C  C   +T  CT C    Y    
Sbjct: 686  LFLYQNQ--CISNCPKKYYNNTQNNQCTQCD---SSCYTCNGIATNNCTSCQLPLYFEPT 740

Query: 132  SKSCVTDCGSGETKDAGTMMCKK 64
            S  C+ +C S +  D  ++ CK+
Sbjct: 741  SNQCLQNCNSNQYPDTNSISCKQ 763


>ref|XP_004257140.1| hypothetical protein EIN_014980 [Entamoeba invadens IP1]
           gi|440297728|gb|ELP90369.1| hypothetical protein
           EIN_014980 [Entamoeba invadens IP1]
          Length = 574

 Score = 57.0 bits (136), Expect = 8e-06
 Identities = 34/106 (32%), Positives = 44/106 (41%), Gaps = 3/106 (2%)
 Frame = -1

Query: 357 DCTKCASNTQCATCASSKFVSNDMTSCLTACLGTDFSDXXXXXXXXXXKIHADCVACGST 178
           +C  CA + +C  C   K++  D+  C   C    F D          K   DC  C   
Sbjct: 211 NCKVCAVD-KCYQCKDDKYLKEDLFECFDKCDDGYFKDTINENNHTCGKCQPDCKTCTDG 269

Query: 177 TACTKCGSSKYVSTDSKSCV--TDCGSGETKD-AGTMMCKKSTTGS 49
           T CT C  +  +  D+  CV  T+C SG  KD      CKK  TGS
Sbjct: 270 TKCTSCPENALLLEDTGKCVTATECPSGYYKDKTAAATCKKCKTGS 315


>ref|XP_004035100.1| zinc finger lsd1 subclass family protein, putative
           [Ichthyophthirius multifiliis]
           gi|340505265|gb|EGR31614.1| zinc finger lsd1 subclass
           family protein, putative [Ichthyophthirius multifiliis]
          Length = 1266

 Score = 57.0 bits (136), Expect = 8e-06
 Identities = 34/117 (29%), Positives = 53/117 (45%), Gaps = 5/117 (4%)
 Frame = -1

Query: 390 NKVCFSCTSITDCTKCAS--NTQCATCASSKFVSNDMTSCLTACLGTDF-SDXXXXXXXX 220
           N +C +C S   CT+C     T+C  C+ S F+  D T C+++C    +  +        
Sbjct: 593 NNICDNCNS--KCTECIGPLETECTKCSESLFL--DQTQCISSCPERKYKKNGPGNINNI 648

Query: 219 XXKIHADCVACG--STTACTKCGSSKYVSTDSKSCVTDCGSGETKDAGTMMCKKSTT 55
               H  C  C   S   CTKC  SKY    +  C+TDCG+ +  +  T  C+  ++
Sbjct: 649 CDDCHNTCYLCNGPSDNECTKCSGSKYFK--ANKCLTDCGNHQYGNPITQNCENCSS 703


Top