BLASTX nr result
ID: Ophiopogon26_contig00037190
seq
BLASTX 2.2.26 [Sep-21-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Ophiopogon26_contig00037190 (2602 letters) Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF excluding environmental samples from WGS projects 149,584,005 sequences; 54,822,741,787 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value gb|EXX72996.1| hypothetical protein RirG_064170 [Rhizophagus irr... 1434 0.0 gb|POG59933.1| hypothetical protein GLOIN_2v814911 [Rhizophagus ... 885 0.0 gb|PKK77274.1| hypothetical protein RhiirC2_732395, partial [Rhi... 376 e-122 gb|PKC72120.1| hypothetical protein RhiirA1_412246, partial [Rhi... 376 e-122 gb|PKC13420.1| hypothetical protein RhiirA5_351663, partial [Rhi... 376 e-121 gb|PKY45445.1| hypothetical protein RhiirA4_401225, partial [Rhi... 372 e-120 gb|KFH71756.1| hypothetical protein MVEG_02051 [Mortierella vert... 194 6e-47 gb|POG59934.1| hypothetical protein GLOIN_2v815234 [Rhizophagus ... 176 4e-42 ref|XP_003290038.1| hypothetical protein DICPUDRAFT_36750 [Dicty... 166 3e-39 ref|XP_629657.1| hypothetical protein DDB_G0292494 [Dictyosteliu... 135 7e-29 ref|XP_004359171.1| hypothetical protein DFA_01202 [Cavenderia f... 130 1e-27 emb|CCC91195.1| conserved hypothetical protein [Trypanosoma cong... 129 2e-27 ref|XP_012756146.1| hypothetical protein SAMD00019534_041870 [Ac... 119 2e-24 ref|XP_001684979.1| conserved hypothetical protein [Leishmania m... 117 1e-23 gb|KYQ93255.1| hypothetical protein DLAC_05909 [Tieghemostelium ... 115 5e-23 ref|XP_002674212.1| predicted protein [Naegleria gruberi] >gi|28... 106 4e-20 gb|EPY40371.1| tetratricopeptidedomain 39C [Angomonas deanei] 103 4e-19 ref|XP_009307371.1| tetratricopeptide repeat domain 39B [Trypano... 101 2e-18 gb|ORC90684.1| tetratricopeptide repeat domain 39B [Trypanosoma ... 100 7e-18 ref|XP_002682030.1| predicted protein [Naegleria gruberi] >gi|28... 98 3e-17 >gb|EXX72996.1| hypothetical protein RirG_064170 [Rhizophagus irregularis DAOM 197198w] dbj|GBC41094.1| tetratricopeptide repeat domain 39b [Rhizophagus irregularis DAOM 181602] Length = 1578 Score = 1434 bits (3711), Expect = 0.0 Identities = 736/827 (88%), Positives = 739/827 (89%) Frame = -1 Query: 2554 QTIFTNSFNIAVKKFWNNEFSEAEGIFNRFKNCIPRWNVTYAEMQLVKHLMKGQSLDNEN 2375 QTIFTNSFNIAVKKFWNNEFSEAEGIFNRFKNCIPRWNVTYAEMQLVKHLMKGQSLDNEN Sbjct: 726 QTIFTNSFNIAVKKFWNNEFSEAEGIFNRFKNCIPRWNVTYAEMQLVKHLMKGQSLDNEN 785 Query: 2374 PELSNALMEAEKLAIKVCENKDDFETSFSLFRTDIWKAYIRPNNSPSEDEAGFASLRANF 2195 PELSNALMEAEKLAIKVCENKDDFETSFSLFRTDIWKAYIRPNNSPSEDEAGFASLRANF Sbjct: 786 PELSNALMEAEKLAIKVCENKDDFETSFSLFRTDIWKAYIRPNNSPSEDEAGFASLRANF 845 Query: 2194 RWDCELAMADILLFHAILQVVGGSEIKGAFNLRKAWKTYSKVRDEIDRIKRDAGKNDHKH 2015 RWDCELAMADILLFHAILQVVGGSEIKGAFNLRKAWKTYSKVRDEIDRIKRDAGKNDHKH Sbjct: 846 RWDCELAMADILLFHAILQVVGGSEIKGAFNLRKAWKTYSKVRDEIDRIKRDAGKNDHKH 905 Query: 2014 ELXXXXXXXXXXSAILGRGSISNMVGFGGNKNQNQDGSSATLDGINNNVEIYSDIEDCLE 1835 EL SAILGRGSISNMVGFGGNKNQ DGSSATLDGINNNVEIYSDIEDCLE Sbjct: 906 ELSSSSNRWSLGSAILGRGSISNMVGFGGNKNQ--DGSSATLDGINNNVEIYSDIEDCLE 963 Query: 1834 FGIGVFYFILSIVPGSFQSILKAIGFNAERDEGIQMLENCYLRDGVRAPFAAFFLLVNYL 1655 FGIGVFYFILSIVPGSFQSILKAIGFNAERDEGIQMLENCYLRDGVRAP AAFFLLVNYL Sbjct: 964 FGIGVFYFILSIVPGSFQSILKAIGFNAERDEGIQMLENCYLRDGVRAPSAAFFLLVNYL 1023 Query: 1654 FLSRGLADPTLSLNKAGAIVQECVKKYPKSSPFLFMACQQARKTGQIKEALNHITNGIYS 1475 FLSRGLADPTLSLNKAG+IVQECVKKYPKSSPFLFMACQQARKTGQIKEALNHITNGIYS Sbjct: 1024 FLSRGLADPTLSLNKAGSIVQECVKKYPKSSPFLFMACQQARKTGQIKEALNHITNGIYS 1083 Query: 1474 CEMIGVTSTNYRFEKGMTYLINLDFTAAKDIFELLFYGNTIVFTGKNGSIRXXXXXXXXX 1295 CEMIGVTSTNYRFEKGMTYLINLDFTAAKDIFELLFYGNTIVFTGKNGSIR Sbjct: 1084 CEMIGVTSTNYRFEKGMTYLINLDFTAAKDIFELLFYGNTIVFTGKNGSIRLHGSIRGSI 1143 Query: 1294 XXXXXLLSKDGSTKKDNSLLKFFEFELRPFCGLCLAGCYLILKSSQIAMKEALDVLKQTK 1115 LLSKDGSTKKDNSLLKFFEFELRPFCGLCLAGCYLILKSSQIAMKEALDVLKQTK Sbjct: 1144 HGSSRLLSKDGSTKKDNSLLKFFEFELRPFCGLCLAGCYLILKSSQIAMKEALDVLKQTK 1203 Query: 1114 AMXXXXXXXXXXXXXIGLLGTSASXXXXXXXXXXXNSDKKEPKTNRYNKFAGRHSAKDVE 935 AM IGLLGTSAS NSDKKEPKTNRYNKFAGRHSAKDVE Sbjct: 1204 AMTNQNNENNSITNSIGLLGTSASGLVGFGVGGNNNSDKKEPKTNRYNKFAGRHSAKDVE 1263 Query: 934 NNSVTPFLIFIILYLRRDIFYMPLELKKRWANLLESTWKNYDKSIDLDTNAVYLLIRGVF 755 NNSVTPFLIFIILYLRRDIFYMPLELKKRWANLLESTWKNYDKSID DTNAVYLLIRG+F Sbjct: 1264 NNSVTPFLIFIILYLRRDIFYMPLELKKRWANLLESTWKNYDKSIDPDTNAVYLLIRGIF 1323 Query: 754 EKFLNQDDPTIAQITLCECLSLETGIVSETWVIPHCRYELGELFYKQFGNQEAATEQFRW 575 EKFLNQDDPTIAQ TLCECLSLETGIVSETWVIPHCRYELGELFYKQFGNQEAATEQFRW Sbjct: 1324 EKFLNQDDPTIAQKTLCECLSLETGIVSETWVIPHCRYELGELFYKQFGNQEAATEQFRW 1383 Query: 574 ILKGPRPISRGGGLXXXXXXXXXXXXXXXXXXXXXSYGNSNPDRFKKYEFSKVLKNRCTV 395 ILKGPRPISRGGGL SYGNSNPDRFKKYEFSKVLKNRCTV Sbjct: 1384 ILKGPRPISRGGGLISRRDSIASIASRASIDSNSSSYGNSNPDRFKKYEFSKVLKNRCTV 1443 Query: 394 ATDQIRANIIQPTNILNNNSKFEQPPSSIPLYHXXXXXXXXXXXXKEVQEQSLLQKHHRR 215 ATDQIRANIIQPTNIL NNSKFEQPPSSIPLYH KE+QEQSLLQKHHRR Sbjct: 1444 ATDQIRANIIQPTNIL-NNSKFEQPPSSIPLYHSRKKSGSSSSLLKEIQEQSLLQKHHRR 1502 Query: 214 QSSTDIKVGTPTNTENKKSGKNDTVLSSLFVKRHKPRSLSLPINEDR 74 QSSTDIKVGTPTNTENKKSGKNDTVLSSLFVKRHKPRSLSLP NEDR Sbjct: 1503 QSSTDIKVGTPTNTENKKSGKNDTVLSSLFVKRHKPRSLSLPTNEDR 1549 >gb|POG59933.1| hypothetical protein GLOIN_2v814911 [Rhizophagus irregularis DAOM 181602=DAOM 197198] Length = 525 Score = 885 bits (2288), Expect = 0.0 Identities = 457/513 (89%), Positives = 460/513 (89%) Frame = -1 Query: 2173 MADILLFHAILQVVGGSEIKGAFNLRKAWKTYSKVRDEIDRIKRDAGKNDHKHELXXXXX 1994 MADILLFHAILQVVGGSEIKGAFNLRKAWKTYSKVRDEIDRIKRDAGKNDHKHEL Sbjct: 1 MADILLFHAILQVVGGSEIKGAFNLRKAWKTYSKVRDEIDRIKRDAGKNDHKHELSSSSN 60 Query: 1993 XXXXXSAILGRGSISNMVGFGGNKNQNQDGSSATLDGINNNVEIYSDIEDCLEFGIGVFY 1814 SAILGRGSISNMVGFGGNKNQ DGSSATLDGINNNVEIYSDIEDCLEFGIGVFY Sbjct: 61 RWSLGSAILGRGSISNMVGFGGNKNQ--DGSSATLDGINNNVEIYSDIEDCLEFGIGVFY 118 Query: 1813 FILSIVPGSFQSILKAIGFNAERDEGIQMLENCYLRDGVRAPFAAFFLLVNYLFLSRGLA 1634 FILSIVPGSFQSILKAIGFNAERDEGIQMLENCYLRDGVRAP AAFFLLVNYLFLSRGLA Sbjct: 119 FILSIVPGSFQSILKAIGFNAERDEGIQMLENCYLRDGVRAPSAAFFLLVNYLFLSRGLA 178 Query: 1633 DPTLSLNKAGAIVQECVKKYPKSSPFLFMACQQARKTGQIKEALNHITNGIYSCEMIGVT 1454 DPTLSLNKAG+IVQECVKKYPKSSPFLFMACQQARKTGQIKEALNHITNGIYSCEMIGVT Sbjct: 179 DPTLSLNKAGSIVQECVKKYPKSSPFLFMACQQARKTGQIKEALNHITNGIYSCEMIGVT 238 Query: 1453 STNYRFEKGMTYLINLDFTAAKDIFELLFYGNTIVFTGKNGSIRXXXXXXXXXXXXXXLL 1274 STNYRFEKGMTYLINLDFTAAKDIFELLFYGNTIVFTGKNGSIR LL Sbjct: 239 STNYRFEKGMTYLINLDFTAAKDIFELLFYGNTIVFTGKNGSIRLHGSIRGSIHGSSRLL 298 Query: 1273 SKDGSTKKDNSLLKFFEFELRPFCGLCLAGCYLILKSSQIAMKEALDVLKQTKAMXXXXX 1094 SKDGSTKKDNSLLKFFEFELRPFCGLCLAGCYLILKSSQIAMKEALDVLKQTKAM Sbjct: 299 SKDGSTKKDNSLLKFFEFELRPFCGLCLAGCYLILKSSQIAMKEALDVLKQTKAMTNQNN 358 Query: 1093 XXXXXXXXIGLLGTSASXXXXXXXXXXXNSDKKEPKTNRYNKFAGRHSAKDVENNSVTPF 914 IGLLGTSAS NSDKKEPKTNRYNKFAGRHSAKDVENNSVTPF Sbjct: 359 ENNSITNSIGLLGTSASGLVGFGVGGNNNSDKKEPKTNRYNKFAGRHSAKDVENNSVTPF 418 Query: 913 LIFIILYLRRDIFYMPLELKKRWANLLESTWKNYDKSIDLDTNAVYLLIRGVFEKFLNQD 734 LIFIILYLRRDIFYMPLELKKRWANLLESTWKNYDKSID DTNAVYLLIRG+FEKFLNQD Sbjct: 419 LIFIILYLRRDIFYMPLELKKRWANLLESTWKNYDKSIDPDTNAVYLLIRGIFEKFLNQD 478 Query: 733 DPTIAQITLCECLSLETGIVSETWVIPHCRYEL 635 DPTIAQ TLCECLSLETGIVSETWVIPHCRYE+ Sbjct: 479 DPTIAQKTLCECLSLETGIVSETWVIPHCRYEV 511 >gb|PKK77274.1| hypothetical protein RhiirC2_732395, partial [Rhizophagus irregularis] Length = 217 Score = 376 bits (965), Expect = e-122 Identities = 182/182 (100%), Positives = 182/182 (100%) Frame = -1 Query: 2554 QTIFTNSFNIAVKKFWNNEFSEAEGIFNRFKNCIPRWNVTYAEMQLVKHLMKGQSLDNEN 2375 QTIFTNSFNIAVKKFWNNEFSEAEGIFNRFKNCIPRWNVTYAEMQLVKHLMKGQSLDNEN Sbjct: 36 QTIFTNSFNIAVKKFWNNEFSEAEGIFNRFKNCIPRWNVTYAEMQLVKHLMKGQSLDNEN 95 Query: 2374 PELSNALMEAEKLAIKVCENKDDFETSFSLFRTDIWKAYIRPNNSPSEDEAGFASLRANF 2195 PELSNALMEAEKLAIKVCENKDDFETSFSLFRTDIWKAYIRPNNSPSEDEAGFASLRANF Sbjct: 96 PELSNALMEAEKLAIKVCENKDDFETSFSLFRTDIWKAYIRPNNSPSEDEAGFASLRANF 155 Query: 2194 RWDCELAMADILLFHAILQVVGGSEIKGAFNLRKAWKTYSKVRDEIDRIKRDAGKNDHKH 2015 RWDCELAMADILLFHAILQVVGGSEIKGAFNLRKAWKTYSKVRDEIDRIKRDAGKNDHKH Sbjct: 156 RWDCELAMADILLFHAILQVVGGSEIKGAFNLRKAWKTYSKVRDEIDRIKRDAGKNDHKH 215 Query: 2014 EL 2009 EL Sbjct: 216 EL 217 >gb|PKC72120.1| hypothetical protein RhiirA1_412246, partial [Rhizophagus irregularis] gb|PKY19663.1| hypothetical protein RhiirB3_407183, partial [Rhizophagus irregularis] Length = 217 Score = 376 bits (965), Expect = e-122 Identities = 182/182 (100%), Positives = 182/182 (100%) Frame = -1 Query: 2554 QTIFTNSFNIAVKKFWNNEFSEAEGIFNRFKNCIPRWNVTYAEMQLVKHLMKGQSLDNEN 2375 QTIFTNSFNIAVKKFWNNEFSEAEGIFNRFKNCIPRWNVTYAEMQLVKHLMKGQSLDNEN Sbjct: 36 QTIFTNSFNIAVKKFWNNEFSEAEGIFNRFKNCIPRWNVTYAEMQLVKHLMKGQSLDNEN 95 Query: 2374 PELSNALMEAEKLAIKVCENKDDFETSFSLFRTDIWKAYIRPNNSPSEDEAGFASLRANF 2195 PELSNALMEAEKLAIKVCENKDDFETSFSLFRTDIWKAYIRPNNSPSEDEAGFASLRANF Sbjct: 96 PELSNALMEAEKLAIKVCENKDDFETSFSLFRTDIWKAYIRPNNSPSEDEAGFASLRANF 155 Query: 2194 RWDCELAMADILLFHAILQVVGGSEIKGAFNLRKAWKTYSKVRDEIDRIKRDAGKNDHKH 2015 RWDCELAMADILLFHAILQVVGGSEIKGAFNLRKAWKTYSKVRDEIDRIKRDAGKNDHKH Sbjct: 156 RWDCELAMADILLFHAILQVVGGSEIKGAFNLRKAWKTYSKVRDEIDRIKRDAGKNDHKH 215 Query: 2014 EL 2009 EL Sbjct: 216 EL 217 >gb|PKC13420.1| hypothetical protein RhiirA5_351663, partial [Rhizophagus irregularis] Length = 218 Score = 376 bits (965), Expect = e-121 Identities = 182/182 (100%), Positives = 182/182 (100%) Frame = -1 Query: 2554 QTIFTNSFNIAVKKFWNNEFSEAEGIFNRFKNCIPRWNVTYAEMQLVKHLMKGQSLDNEN 2375 QTIFTNSFNIAVKKFWNNEFSEAEGIFNRFKNCIPRWNVTYAEMQLVKHLMKGQSLDNEN Sbjct: 37 QTIFTNSFNIAVKKFWNNEFSEAEGIFNRFKNCIPRWNVTYAEMQLVKHLMKGQSLDNEN 96 Query: 2374 PELSNALMEAEKLAIKVCENKDDFETSFSLFRTDIWKAYIRPNNSPSEDEAGFASLRANF 2195 PELSNALMEAEKLAIKVCENKDDFETSFSLFRTDIWKAYIRPNNSPSEDEAGFASLRANF Sbjct: 97 PELSNALMEAEKLAIKVCENKDDFETSFSLFRTDIWKAYIRPNNSPSEDEAGFASLRANF 156 Query: 2194 RWDCELAMADILLFHAILQVVGGSEIKGAFNLRKAWKTYSKVRDEIDRIKRDAGKNDHKH 2015 RWDCELAMADILLFHAILQVVGGSEIKGAFNLRKAWKTYSKVRDEIDRIKRDAGKNDHKH Sbjct: 157 RWDCELAMADILLFHAILQVVGGSEIKGAFNLRKAWKTYSKVRDEIDRIKRDAGKNDHKH 216 Query: 2014 EL 2009 EL Sbjct: 217 EL 218 >gb|PKY45445.1| hypothetical protein RhiirA4_401225, partial [Rhizophagus irregularis] Length = 215 Score = 372 bits (956), Expect = e-120 Identities = 180/180 (100%), Positives = 180/180 (100%) Frame = -1 Query: 2554 QTIFTNSFNIAVKKFWNNEFSEAEGIFNRFKNCIPRWNVTYAEMQLVKHLMKGQSLDNEN 2375 QTIFTNSFNIAVKKFWNNEFSEAEGIFNRFKNCIPRWNVTYAEMQLVKHLMKGQSLDNEN Sbjct: 36 QTIFTNSFNIAVKKFWNNEFSEAEGIFNRFKNCIPRWNVTYAEMQLVKHLMKGQSLDNEN 95 Query: 2374 PELSNALMEAEKLAIKVCENKDDFETSFSLFRTDIWKAYIRPNNSPSEDEAGFASLRANF 2195 PELSNALMEAEKLAIKVCENKDDFETSFSLFRTDIWKAYIRPNNSPSEDEAGFASLRANF Sbjct: 96 PELSNALMEAEKLAIKVCENKDDFETSFSLFRTDIWKAYIRPNNSPSEDEAGFASLRANF 155 Query: 2194 RWDCELAMADILLFHAILQVVGGSEIKGAFNLRKAWKTYSKVRDEIDRIKRDAGKNDHKH 2015 RWDCELAMADILLFHAILQVVGGSEIKGAFNLRKAWKTYSKVRDEIDRIKRDAGKNDHKH Sbjct: 156 RWDCELAMADILLFHAILQVVGGSEIKGAFNLRKAWKTYSKVRDEIDRIKRDAGKNDHKH 215 >gb|KFH71756.1| hypothetical protein MVEG_02051 [Mortierella verticillata NRRL 6337] Length = 2845 Score = 194 bits (492), Expect = 6e-47 Identities = 107/247 (43%), Positives = 145/247 (58%), Gaps = 9/247 (3%) Frame = -1 Query: 1870 VEIYSDIEDCLEFGIGVFYFILSIVPGSFQSILKAIGFNAERDEGIQMLENCYLRDGVRA 1691 V + D ED L++GIG+FYFI+SIVP S L+ IG + ++GI+ E+ R RA Sbjct: 2011 VNVLEDTEDYLQYGIGLFYFIVSIVPKSLVPALRTIGLQSNPEQGIKNFEDVLSRKNGRA 2070 Query: 1690 PFAAFFLLVNYLFLSRGLADPTLSLNKAGAIVQECVKKYPKSSPFLFMACQQARKTG-QI 1514 PFAA FLL+NYLFL RG+ADP++SL +AG I+ E +++ P S +L MAC ARKTG I Sbjct: 2071 PFAALFLLINYLFLPRGVADPSISLGRAGVILNEALRRCPNGSSYLLMACHHARKTGNMI 2130 Query: 1513 KEALNHITNGIYSCEMIGVTSTNYRFEKGMTYLINLDFTAAKDIFELLFYGNTIVFTGKN 1334 ALNHIT GI +CE G+ S NYRFE G+T+ I+ +F A DIFE+L+ V +N Sbjct: 2131 PSALNHITRGIQTCEAAGIPSINYRFELGLTFFIHQEFGKAADIFEILWRKYISVPDRQN 2190 Query: 1333 --------GSIRXXXXXXXXXXXXXXLLSKDGSTKKDNSLLKFFEFELRPFCGLCLAGCY 1178 G R LS G+ ++D +FEL PFCGLCL Sbjct: 2191 QDNILRGYGGRRKGRSQSLGQPVQASALSVSGTVEEDEE----DDFELAPFCGLCLIASK 2246 Query: 1177 LILKSSQ 1157 ++++ Q Sbjct: 2247 VVVRLGQ 2253 Score = 146 bits (369), Expect = 6e-32 Identities = 70/143 (48%), Positives = 92/143 (64%) Frame = -1 Query: 988 KTNRYNKFAGRHSAKDVENNSVTPFLIFIILYLRRDIFYMPLELKKRWANLLESTWKNYD 809 K NR+NKFA K ++ ++PFL +ILYLRRD+ YM L +++ LLES WK Sbjct: 2380 KLNRFNKFAWNQCQKSIQKGRISPFLPLVILYLRRDLAYMKPVLLRKYRTLLESIWKTVS 2439 Query: 808 KSIDLDTNAVYLLIRGVFEKFLNQDDPTIAQITLCECLSLETGIVSETWVIPHCRYELGE 629 + +D DT A+YLL+ V + L DD T A L +CL LE+ I SE WV+P+C YELGE Sbjct: 2440 QPVDADTQAIYLLLSAVVHRQLLPDDATFAYTALTDCLLLESIIESEMWVVPYCHYELGE 2499 Query: 628 LFYKQFGNQEAATEQFRWILKGP 560 L +K+ AA EQF+WILKGP Sbjct: 2500 LLFKKLQLPHAAIEQFQWILKGP 2522 Score = 100 bits (248), Expect = 2e-17 Identities = 57/176 (32%), Positives = 94/176 (53%), Gaps = 11/176 (6%) Frame = -1 Query: 2539 NSFNIAVKKFWNNEFSEAEGIFNRFKNCIPRWNVTYAEMQLVKHLMKGQSLDNENPELSN 2360 ++ N + WNNEF A IF PRW + AE+ +V+ L+ GQ L + +L++ Sbjct: 1716 SALNEGIWACWNNEFGAALEIFKEHAATYPRWCLAAAEVHIVRQLISGQ-LSEPDLDLTD 1774 Query: 2359 ALMEAEKLAIKVCENKDDFETSF----SLFRTDIWKAYIRPNNSPSEDEAGFASLRANFR 2192 AL +EK++ +V + K +F++++ SL D + N +LR N++ Sbjct: 1775 ALQLSEKVSSRVLDKKQEFDSNYMGYRSLCAADASLVTVNDN-----------TLRQNYK 1823 Query: 2191 WDCELAMADILLFHAILQVVGGSEIKGAF-------NLRKAWKTYSKVRDEIDRIK 2045 WDCE+A D+LL+ ILQ+ S+ KG F LR+AWK Y +++ E++ K Sbjct: 1824 WDCEMAFYDMLLYRGILQLTSASDTKGTFTDIKGGLQLRRAWKGYMRIKQEMELAK 1879 >gb|POG59934.1| hypothetical protein GLOIN_2v815234 [Rhizophagus irregularis DAOM 181602=DAOM 197198] Length = 821 Score = 176 bits (447), Expect = 4e-42 Identities = 84/84 (100%), Positives = 84/84 (100%) Frame = -1 Query: 2554 QTIFTNSFNIAVKKFWNNEFSEAEGIFNRFKNCIPRWNVTYAEMQLVKHLMKGQSLDNEN 2375 QTIFTNSFNIAVKKFWNNEFSEAEGIFNRFKNCIPRWNVTYAEMQLVKHLMKGQSLDNEN Sbjct: 726 QTIFTNSFNIAVKKFWNNEFSEAEGIFNRFKNCIPRWNVTYAEMQLVKHLMKGQSLDNEN 785 Query: 2374 PELSNALMEAEKLAIKVCENKDDF 2303 PELSNALMEAEKLAIKVCENKDDF Sbjct: 786 PELSNALMEAEKLAIKVCENKDDF 809 >ref|XP_003290038.1| hypothetical protein DICPUDRAFT_36750 [Dictyostelium purpureum] gb|EGC33419.1| hypothetical protein DICPUDRAFT_36750, partial [Dictyostelium purpureum] Length = 676 Score = 166 bits (421), Expect = 3e-39 Identities = 163/699 (23%), Positives = 259/699 (37%), Gaps = 47/699 (6%) Frame = -1 Query: 2551 TIFTNSFNIAVKKFWNNEFSEAEGIFNRFKNCIPRWNVTYAEMQLVKHLMKGQSLDNENP 2372 TI N A+ WNN+F EAE IF+ + PR+++ AE +K + L+ EN Sbjct: 4 TIINNEIKNAIDLIWNNKFKEAEEIFSSKSSSTPRYSLHNAETAFLKSFITANELETENA 63 Query: 2371 ELSNALMEAEKLAIKVCENKDDFETSFSLFRTD-IWKAYIRPNNSPSEDEAGFASLRANF 2195 L K+ + K+ E S +F ++ + + Y PN ED N Sbjct: 64 LL------------KLKQTKELAEQSIRIFESNKVPQNYFPPNIKTKEDFK-------NH 104 Query: 2194 RWDCELAMADILLFHAILQVVGGSEIKGAFNLRKAWKTYSKVRDEIDRIKRDAGKNDHKH 2015 DC++ + D L A+LQ+ + KG FNLRK+WK + D + ++K Sbjct: 105 ILDCKIVLGDSLYMLAVLQITRDHKFKGCFNLRKSWKIFE---DTLRQVK---------- 151 Query: 2014 ELXXXXXXXXXXSAILGRGSISNMVGFGGNKNQNQDGSSATLDGINNNVEIYSDIEDCLE 1835 D N + SD+ + L Sbjct: 152 ------------------------------------------DDEENGFQYSSDLTELLR 169 Query: 1834 FGIGVFYFILSIVPGSFQSILKAIGFNAERDEGIQMLENCYLRDGVRAPFAAFFLLVNYL 1655 FG G FYF +SI+P + +++ +GF A+RD G+ L++C + G+R PFA +L N L Sbjct: 170 FGCGFFYFAVSIIPQKYLKLVELVGFKADRDMGLMYLKDCLSKGGIRTPFATMVILFNNL 229 Query: 1654 FLSRGLADPTLSLNKAGAIVQECVKKYPKSSPFLFMACQQARKTGQIKEALNHITNGIYS 1475 L RGL +PT LN+A ++++ ++KYP S F M RK +I L + I + Sbjct: 230 LLPRGLYNPTHQLNEAQQLIEKNLEKYPYGSLFQVMGSHCYRKQCKIDMGLQCMEVAISN 289 Query: 1474 CEMIGVTSTNYRFEKGMTYLINLDFTAAKDIFELLFYGNTIVFTGKNGSIRXXXXXXXXX 1295 C + Y++E Y I L + A +FE Sbjct: 290 CNSLSKAPLIYKYELANCYCIKLQWDKAIQVFE--------------------------- 322 Query: 1294 XXXXXLLSKDGSTKKDNSLLKFFEFELRPFCGLCLAGCYLILKSSQIAMKEALDVLKQTK 1115 L+K +F++R C L L CY+I + ++ ++ K Sbjct: 323 -----------------ELVKEEKFQIRALCSLQLGSCYIINGQKEKGIQSFNNIKNFCK 365 Query: 1114 AMXXXXXXXXXXXXXIGLLGTSASXXXXXXXXXXXNSDKKEPKTNRYNKFAGRHSAKDVE 935 K NK A R+ Sbjct: 366 --------------------------------------KSSSVDIVINKQAQRYI----- 382 Query: 934 NNSVTPFLIFIILYLRRDIFYMPLELKKRWANLLESTWK--NYDKSIDLDT--------- 788 N+ F + +LYLRRD+ M L + ++L S K DK I T Sbjct: 383 -NNGGSFSVLEVLYLRRDMAKMEKSLANKTMDILNSIGKKLGVDKPIPQQTVQPISNNTT 441 Query: 787 -----------------------------------NAVYLLIRGVFEKFLNQDDPTIAQI 713 A YLL++G K ++ D ++ Sbjct: 442 INNISLSTSSFLSRTFSNLTKKKEEIQQNDILIHDRAAYLLLKGAIYKSIDLYDQSMECF 501 Query: 712 TLCECLSLETGIVSETWVIPHCRYELGELFYKQFGNQEA 596 E LS+ + +++E + IP+ YEL E +Y + N +A Sbjct: 502 E--ELLSM-SHLLTEKFYIPYSYYELSEGYYHKKNNAKA 537 >ref|XP_629657.1| hypothetical protein DDB_G0292494 [Dictyostelium discoideum AX4] gb|EAL61226.1| hypothetical protein DDB_G0292494 [Dictyostelium discoideum AX4] Length = 795 Score = 135 bits (340), Expect = 7e-29 Identities = 105/419 (25%), Positives = 168/419 (40%), Gaps = 25/419 (5%) Frame = -1 Query: 2551 TIFTNSFNIAVKKFWNNEFSEAEGIFNRFKNCIPRWNVTYAEMQLVKHLMKGQSLDNENP 2372 + F N N A++ WNN F EAE IF + PR+++ AE +K + NEN Sbjct: 14 SFFPNEINNALELLWNNNFKEAEKIFEKHSKTTPRYSLHNAETSFLKSFITA----NEN- 68 Query: 2371 ELSNALMEAEKLAIKVCENKDDFETSFSLFRTDIWKAYIRPNNSPSEDEAGFASLRA--- 2201 E E K+ E K+ E F N P F + Sbjct: 69 -------ETELAIKKLKETKELSEKCIKQFEQ---------NKIPQNYNYQFKIVNQKND 112 Query: 2200 --NFRWDCELAMADILLFHAILQVVGGSEIKGAFNLRKAWKTYSKVRDEIDRIKRDAGKN 2027 N DC++ + D L AILQ+ + KG NLR +WK + + +I + Sbjct: 113 FKNHILDCKIILGDSLYMLAILQISSDQKFKGCLNLRSSWKVFEETLKQIKQ-------- 164 Query: 2026 DHKHELXXXXXXXXXXSAILGRGSISNMVGFGGNKNQNQDGSSATLDGINNNVEIYSDIE 1847 + Q++ SS + ++ + D+ Sbjct: 165 ----------------------------------EEQSESSSSPS----PSSFKYSKDLL 186 Query: 1846 DCLEFGIGVFYFILSIVPGSFQSILKAIGFNAERDEGIQMLENCYLRDGVRAPFAAFFLL 1667 +CL FG+G FY+ SI+P + +++ +GF +R++G+ + +C G+R+ FA +L Sbjct: 187 ECLRFGVGFFYYATSIIPSKYLKLIELVGFKCDREQGLIYIRDCLNSGGIRSQFATMLIL 246 Query: 1666 VNYLFLSRGLADPTLSLNKAGAIVQECVKKYPKSSPFLFMACQQARKTGQIKEALNHITN 1487 N L L RGL +P LN+A +++ ++KYP SS F RK +I + + Sbjct: 247 FNNLILPRGLYNPVHQLNEAQQLIENNLEKYPNSSLFQVFGSHCYRKQNKIDLGIRCLEF 306 Query: 1486 GI----YSCEMIGVTSTN----------------YRFEKGMTYLINLDFTAAKDIFELL 1370 I YS N +++E Y I + + A DIFELL Sbjct: 307 AISFYNYSNNNYNDNDDNNNHQQQQTPQDKQPLIFKYELACCYCIKMQWNKAIDIFELL 365 >ref|XP_004359171.1| hypothetical protein DFA_01202 [Cavenderia fasciculata] gb|EGG21321.1| hypothetical protein DFA_01202 [Cavenderia fasciculata] Length = 595 Score = 130 bits (326), Expect = 1e-27 Identities = 112/465 (24%), Positives = 192/465 (41%), Gaps = 42/465 (9%) Frame = -1 Query: 1879 NNNVEIYSDIEDCLEFGIGVFYFILSIVPGSFQSILKAIGFNAERDEGIQMLENCYLRDG 1700 ++ ++ D+ +CL FG G FYF +SI+P + ++ IGF ++RD G+Q + +C + G Sbjct: 150 SSTIKYEEDLLECLHFGAGFFYFAMSIIPSNVIKFVELIGFKSDRDLGLQYIRDCSEKAG 209 Query: 1699 VRAPFAAFFLLVNYLFLSRGLADPTLSLNKAGAIVQECVKKYPKSSPFLFMACQQARKTG 1520 VR+ FA LL N L L RGL +PT L +A ++ + +K+YP+ S F MA RK Sbjct: 210 VRSAFATMVLLFNNLLLPRGLYNPTKHLKEAEVLIDDNLKRYPQGSLFQVMASHCYRKQC 269 Query: 1519 QIKEALNHITNGIYSCEMIGVTSTNYRFEKGMTYLINLDFTAAKDIFELLFYGNTIVFTG 1340 ++ L + I +C + Y +E YLI LD+++A IFE Sbjct: 270 RVDLGLECMQRAIENCSALPKPPLIYSYELANCYLIKLDWSSAIGIFE------------ 317 Query: 1339 KNGSIRXXXXXXXXXXXXXXLLSKDGSTKKDNSLLKFFEFELRPFCGLCLAGCYLIL--- 1169 SL+K F++R CGL LAGCY+++ Sbjct: 318 --------------------------------SLVKEENFQIRALCGLQLAGCYVMMGEP 345 Query: 1168 KSSQIAMKEALDVLKQTKAMXXXXXXXXXXXXXIGLLGTSASXXXXXXXXXXXNSDKKEP 989 K +Q A + D +K Sbjct: 346 KKAQDAFTKIKDYVK--------------------------------------------- 360 Query: 988 KTNRYNKFAGRHSAKDVENNSVTPFLIFIILYLRRDIFYMP--------LELK------- 854 K++ + R S + + NN F F ++Y+RRD+ M LEL Sbjct: 361 KSSSVDPIILRQSQRYIANNG--HFSAFEVMYIRRDMAKMERISAEKTLLELTKCAQQAG 418 Query: 853 ----------------------KRWANLLESTWKNYDKSIDLDTNAVYLLIRGVFEKFLN 740 K +++L +S + + ++ A YLL++G K + Sbjct: 419 VEKPLACQANIGKNQPSTNSFFKSFSSLTKSKKDDLNTDNQIEDRASYLLLKGSVLKGIE 478 Query: 739 QDDPTIAQITLCECLSLETGIVSETWVIPHCRYELGELFY--KQF 611 + + +++ E +S++ + + + +P+C YE+ E ++ KQF Sbjct: 479 KYEESMSCFE--ELMSIQHLLQDKNFYVPYCLYEMSESYFHRKQF 521 Score = 63.5 bits (153), Expect = 2e-06 Identities = 48/169 (28%), Positives = 74/169 (43%), Gaps = 4/169 (2%) Frame = -1 Query: 2539 NSFNIAVKKFWNNEFSEAEGIFNRFKNCIPRWNVTYAEMQLVKHLMKGQSLDNENPELSN 2360 N A+ WNN+F EAE + PR+++ +AE+ ++ + D E+ Sbjct: 5 NDIKDAISLVWNNKFQEAENLLKDKSTISPRYSLHFAEIIFLRSFITADVKDTESA--LK 62 Query: 2359 ALMEAEKLAIKVCENKDDFETSFSLFRTDIWKAYIRPNNSPSEDEAGFASLRANFRWDCE 2180 L E +LA K + +T Y + S E + N DC+ Sbjct: 63 RLKETRELAEKYISLLESNKTP---------PTYNQELKSKEEFK--------NNLLDCK 105 Query: 2179 LAMADILLFHAILQVVGGSEIKGAFNLRKAWKTY----SKVRDEIDRIK 2045 L + D L A+LQ+ +IKG FNLRK+WKT+ +V+D IK Sbjct: 106 LVLGDSLYMLAVLQLTRDHKIKGCFNLRKSWKTFEECLKQVKDTSSTIK 154 >emb|CCC91195.1| conserved hypothetical protein [Trypanosoma congolense IL3000] Length = 566 Score = 129 bits (323), Expect = 2e-27 Identities = 110/400 (27%), Positives = 174/400 (43%), Gaps = 10/400 (2%) Frame = -1 Query: 2539 NSFNIAVKKFWNNEFSEAEGIFNRFKNCIPRWNVTYAEMQLVKHLMKGQSLDNENPELSN 2360 +S AV WNNE+ EA+ + K PR+++ YA + L+K LM NE+ E+ N Sbjct: 16 HSVEEAVHMTWNNEYEEAKERLSLHKGSNPRFSLEYANVFLIKTLMDST---NESREMIN 72 Query: 2359 ALM-EAEKLAIKVCENKDDFETSFSLFRTD----IWKAYIRPNNSPSEDEAGFASLRAN- 2198 L+ EA+ LA F + S D I KA + N E A Sbjct: 73 DLLHEADSLARSSRHCDPMFIPALSTSSDDSHKHIRKADRQKNKKEYEKRRKSAQRSGEA 132 Query: 2197 ----FRWDCELAMADILLFHAILQVVGGSEIKGAFNLRKAWKTYSKVRDEIDRIKRDAGK 2030 ++ +C++ A+ L A+ Q++ + +GA NLRK+W Y K+ E++ A Sbjct: 133 FDNTWKLECDVIYAEALFVRAVGQLMMNAYFRGAINLRKSWGLYYKLVREME-----ADT 187 Query: 2029 NDHKHELXXXXXXXXXXSAILGRGSISNMVGFGGNKNQNQDGSSATLDGINNNVEIYSDI 1850 +DH I S++ Sbjct: 188 DDH----------------------------------------------------IPSEL 195 Query: 1849 EDCLEFGIGVFYFILSIVPGSFQSILKAIGFNAERDEGIQMLENCYLRDGVRAPFAAFFL 1670 + C+++G G FY L++VP S +L IGF ++R+ G Q L + DG+R+PFAA L Sbjct: 196 KMCIKYGAGTFYVFLALVPSSLMKVLNVIGFVSDRELGEQYLTEVFESDGIRSPFAALAL 255 Query: 1669 LVNYLFLSRGLADPTLSLNKAGAIVQECVKKYPKSSPFLFMACQQARKTGQIKEALNHIT 1490 YLFL GL +L KA I+ + +Y ++ F + RK G+I +AL I Sbjct: 256 CTLYLFLPTGLGKVEETLAKAERILNKVNARYEGNTYFNGYSNFYHRKRGEIDKALETIR 315 Query: 1489 NGIYSCEMIGVTSTNYRFEKGMTYLINLDFTAAKDIFELL 1370 + E +G+ R+ T ++L F+ AK+ +E L Sbjct: 316 RAEANAERVGLVPILIRYLGADTLFMDLRFSEAKERYEAL 355 >ref|XP_012756146.1| hypothetical protein SAMD00019534_041870 [Acytostelium subglobosum LB1] dbj|GAM21012.1| hypothetical protein SAMD00019534_041870 [Acytostelium subglobosum LB1] Length = 566 Score = 119 bits (299), Expect = 2e-24 Identities = 64/181 (35%), Positives = 102/181 (56%) Frame = -1 Query: 1912 QDGSSATLDGINNNVEIYSDIEDCLEFGIGVFYFILSIVPGSFQSILKAIGFNAERDEGI 1733 +D S DG N ++ +CL FG+G F + +SI+P F +++ +GF A+R++G+ Sbjct: 128 EDALSQVKDGHKYNEQLL----ECLHFGVGFFLYAMSIIPQKFLRLVEFVGFKADREQGM 183 Query: 1732 QMLENCYLRDGVRAPFAAFFLLVNYLFLSRGLADPTLSLNKAGAIVQECVKKYPKSSPFL 1553 Q +++C G+RAPFA LL N L L RGL +PT L +A ++Q+ +K+YP+ S F Sbjct: 184 QYIKHCGTNGGIRAPFANMVLLFNNLLLPRGLYNPTKHLREAELVIQDNMKRYPEGSLFQ 243 Query: 1552 FMACQQARKTGQIKEALNHITNGIYSCEMIGVTSTNYRFEKGMTYLINLDFTAAKDIFEL 1373 MA RK +I+E L + + +C Y +E Y+I LD+ A ++FE Sbjct: 244 VMASHCYRKQCRIEEGLACMVKALDNCATFQRPPLIYSYELANCYVIMLDWPKAIEVFER 303 Query: 1372 L 1370 L Sbjct: 304 L 304 >ref|XP_001684979.1| conserved hypothetical protein [Leishmania major strain Friedlin] emb|CAJ06903.1| conserved hypothetical protein [Leishmania major strain Friedlin] Length = 581 Score = 117 bits (293), Expect = 1e-23 Identities = 98/389 (25%), Positives = 168/389 (43%), Gaps = 11/389 (2%) Frame = -1 Query: 2524 AVKKFWNNEFSEAEGIFNRFKNCIPRWNVTYAEMQLVKHLMKGQSLDNENPELSNALMEA 2345 AV WNNE+SEA + K+ PR+ + +A + LVK LM + D E L + A Sbjct: 32 AVYMMWNNEYSEALELLRAKKDKNPRYALEWANVSLVKTLMSSTNEDRER--LLDLFKAA 89 Query: 2344 EKLAIKVCENK---------DDFETSFSLFRTDIWKAYIRPNNSPSEDEA--GFASLRAN 2198 + L+ N DD E + + + K + + +A G A Sbjct: 90 DSLSTSSKYNDPMFSEDDEDDDREEDKTRKQLKMEKKKNKKVFKARKRDATKGGAYFDQT 149 Query: 2197 FRWDCELAMADILLFHAILQVVGGSEIKGAFNLRKAWKTYSKVRDEIDRIKRDAGKNDHK 2018 ++ +C++ AD LL +I Q++ S +KG NLRKAW Y + I ++++D Sbjct: 150 WKLECDVIYADALLIRSIGQLMMNSYLKGGINLRKAWGCYHSL---IQQVEQD------- 199 Query: 2017 HELXXXXXXXXXXSAILGRGSISNMVGFGGNKNQNQDGSSATLDGINNNVEIYSDIEDCL 1838 I N + + +++ C+ Sbjct: 200 ---------------------IENRIPY--------------------------ELKMCI 212 Query: 1837 EFGIGVFYFILSIVPGSFQSILKAIGFNAERDEGIQMLENCYLRDGVRAPFAAFFLLVNY 1658 ++G G FY L++VP + +L IGF ++RD G Q L + + +R+PFAA L Y Sbjct: 213 KYGTGTFYAFLALVPANLMKVLSIIGFISDRDLGEQYLTEVFESNTIRSPFAALVLCTLY 272 Query: 1657 LFLSRGLADPTLSLNKAGAIVQECVKKYPKSSPFLFMACQQARKTGQIKEALNHITNGIY 1478 LFL GL + ++L++A +++ +YP ++ F A RK G+++ A+ IT Sbjct: 273 LFLPTGLGNVDVTLSRARRVLETMNARYPNNTYFNGYANFYFRKKGEVEPAVRSITLAAE 332 Query: 1477 SCEMIGVTSTNYRFEKGMTYLINLDFTAA 1391 + E G+ ++ T +N + A Sbjct: 333 NAEKAGLVPLLIKYLYADTLFMNQQWAEA 361 >gb|KYQ93255.1| hypothetical protein DLAC_05909 [Tieghemostelium lacteum] Length = 599 Score = 115 bits (289), Expect = 5e-23 Identities = 105/456 (23%), Positives = 180/456 (39%), Gaps = 34/456 (7%) Frame = -1 Query: 1879 NNNVEIYSDIEDCLEFGIGVFYFILSIVPGSFQSILKAIGFNAERDEGIQMLENCYLRDG 1700 + N+ + +CL FG G F+F +SI+P F I++ +GF A+RD G+ ++ C + G Sbjct: 151 DKNIHYDEGLLECLHFGAGFFFFAISIIPQKFLKIVEFVGFKADRDLGLNYIKECSQKGG 210 Query: 1699 VRAPFAAFFLLVNYLFLSRGLADPTLSLNKAGAIVQECVKKYPKSSPFLFMACQQARKTG 1520 +RAPFA +L N L L RGL +P L +A ++ ++KYP S F MA RK Sbjct: 211 IRAPFATMVILFNNLLLPRGLYNPVHQLKEAEELIVTNLQKYPNGSLFQVMASHCYRKQC 270 Query: 1519 QIKEALNHITNGIYSCEMIGVTSTNYRFEKGMTYLINLDFTAAKDIFELLFYGNTIVFTG 1340 +I L + I +C+ + Y++E Y + L++ A IFE Sbjct: 271 KIDLGLECMLKAISNCQQLSRAPLIYKYELANCYCMLLNWDEAIKIFE------------ 318 Query: 1339 KNGSIRXXXXXXXXXXXXXXLLSKDGSTKKDNSLLKFFEFELRPFCGLCLAGCYLILKSS 1160 L+ F++R C L LA CY+ + Sbjct: 319 --------------------------------ELVAEESFQIRALCALQLASCYVQVGKK 346 Query: 1159 QIAMKEALDVLKQTKAMXXXXXXXXXXXXXIGLLGTSASXXXXXXXXXXXNSDKKEPKTN 980 KEA+++ + K K + Sbjct: 347 ----KEAMEMFSKIKIY--------------------------------------SKKAS 364 Query: 979 RYNKFAGRHSAKDVENNSVTPFLIFIILYLRRDIFYMPLELKKR---------------- 848 + R S + + N + F F I+Y+RRD+ M + ++ Sbjct: 365 SIDPIIQRQSMRYL--NGSSDFSAFEIMYIRRDLAKMERGMAEKSLASLDEIAYRLSVGE 422 Query: 847 -----------WANLLESTWKNYDKSID-------LDTNAVYLLIRGVFEKFLNQDDPTI 722 +NL +S K D+S ++ + YLLI+G K +++ D + Sbjct: 423 KLNLTNSSSNSASNLFKSLLKKKDESQHEKNMEQLVNDRSAYLLIKGSILKGIDRIDEAL 482 Query: 721 AQITLCECLSLETGIVSETWVIPHCRYELGELFYKQ 614 L E + ++ I +E + +P+C YEL E ++++ Sbjct: 483 H--CLDEIVQMQNQI-TEKFYLPYCYYELSECYFQK 515 >ref|XP_002674212.1| predicted protein [Naegleria gruberi] gb|EFC41468.1| predicted protein [Naegleria gruberi] Length = 554 Score = 106 bits (265), Expect = 4e-20 Identities = 73/280 (26%), Positives = 128/280 (45%), Gaps = 2/280 (0%) Frame = -1 Query: 2203 ANFRWDCELAMADILLFHAILQVVGGSEIKGAFNLRKAWKTYSKVRDEIDRIKRDAGKND 2024 +NF D +L A+ L +LQ + GS ++G +N R ++ + +V +I+ +K D K Sbjct: 95 SNFELDLKLVDAEAHLIRGLLQFMDGSYLRGFYNFRNSYLKFKEVYHKIETLK-DEPKTS 153 Query: 2023 HKHELXXXXXXXXXXSAILGRGSISNMVGFGGNKNQNQDGSSATLDGINNNVEIYSDIED 1844 K ++SD+ Sbjct: 154 SKF--------------------------------------------------VHSDVIF 163 Query: 1843 CLEFGIGVFYFILSIVPGSFQSILKAIGFNAERDEGIQMLENCYLRDGVRAPFAAFFLLV 1664 FG+GVF F++S++P + SIL +GF+A+R+ G++++ G A+F L + Sbjct: 164 PSYFGMGVFNFLISVLPPTLASILSVLGFDADRELGLKLMTQAQEYGGRGLGNASFMLCI 223 Query: 1663 NYLFLSRGLADPTLSLNKAGAIVQECVKKYPKSSPFLFMACQQARKTGQIKEALNHITNG 1484 NYLF+ R L D ++LN I+ + K+YPK F ++ K G + ++ HI Sbjct: 224 NYLFVPRALQDRQVNLNLVKPILDKIYKQYPKGGFFKWVLSHYELKIGDLTNSVIHIKEA 283 Query: 1483 IYSC-EMIGVTSTNYRFEKGMTYLINLDFTAAKDI-FELL 1370 + E +G + NY FE G T ++ + F A++I +EL+ Sbjct: 284 VSLLKEAMGTSPNNYLFETGFTLVVAMQFEEAQEILYELI 323 >gb|EPY40371.1| tetratricopeptidedomain 39C [Angomonas deanei] Length = 552 Score = 103 bits (257), Expect = 4e-19 Identities = 96/364 (26%), Positives = 156/364 (42%), Gaps = 6/364 (1%) Frame = -1 Query: 2509 WNNEFSEAEGIFNRFKNCIPRWNVTYAEMQLVKHLMKGQSLDNENPELSNALMEAEKLAI 2330 WNNEF EA I + K PR+ + +A MQ++K LM Q+ E A +E K+A Sbjct: 38 WNNEFDEAARILSGKKTTHPRYALEFAHMQIIKELMSSQNDKRE------ATLEFYKIAD 91 Query: 2329 KVCENKDDFETSFSLFRTDIWKAYIRPNNSPSEDEAGFASLRANFRWDCELAMADILLFH 2150 + E F + EDE + + + ELA Sbjct: 92 TLASATKYNEPMFEV---------------DDEDEVPDSPEKDALKSKRELA-------- 128 Query: 2149 AILQVVGGSEIKGAFNLRKAWKTYSKVRDEIDRIKRDAGKNDHKHELXXXXXXXXXXSAI 1970 K +KA+K+ K ++ + D K + A+ Sbjct: 129 -----------KEKKKKKKAFKSNLKAAEKSGQSFNDIWKLE---------CDVIYADAL 168 Query: 1969 LGRGSISNMVGF---GGNKNQNQDGSSATL-DGINNNVE--IYSDIEDCLEFGIGVFYFI 1808 L R I M+ GG + G TL D + + E I ++E C++FG G+FY Sbjct: 169 LVRAIIQLMMNSYLKGGINLRKAWGCYHTLIDIVEKDTEKRIPRELEMCIKFGAGLFYTF 228 Query: 1807 LSIVPGSFQSILKAIGFNAERDEGIQMLENCYLRDGVRAPFAAFFLLVNYLFLSRGLADP 1628 L++VP + +L IGF +++D G Q L + + +R+P AA L YLFL GL D Sbjct: 229 LALVPANLMKLLSIIGFISDKDLGEQYLTEVFESNSIRSPHAALILCTLYLFLPTGLGDV 288 Query: 1627 TLSLNKAGAIVQECVKKYPKSSPFLFMACQQARKTGQIKEALNHITNGIYSCEMIGVTST 1448 ++L++A +++ ++YP+++ F + RK G A+N IT + E +G+ Sbjct: 289 NVALDRAKRVLETMNERYPQNTYFFGYSNFYYRKRGDTAPAVNCITVAADNAERVGLVPL 348 Query: 1447 NYRF 1436 R+ Sbjct: 349 LIRY 352 >ref|XP_009307371.1| tetratricopeptide repeat domain 39B [Trypanosoma grayi] gb|KEG14386.1| tetratricopeptide repeat domain 39B [Trypanosoma grayi] Length = 571 Score = 101 bits (252), Expect = 2e-18 Identities = 106/428 (24%), Positives = 173/428 (40%) Frame = -1 Query: 1864 IYSDIEDCLEFGIGVFYFILSIVPGSFQSILKAIGFNAERDEGIQMLENCYLRDGVRAPF 1685 I +++ C+++G G FY L++VP + +L IGF ++++ G + L + +G+R+PF Sbjct: 196 IPKELKMCIKYGTGTFYAFLALVPANLMKLLNVIGFISDKELGEEYLTEVFESNGIRSPF 255 Query: 1684 AAFFLLVNYLFLSRGLADPTLSLNKAGAIVQECVKKYPKSSPFLFMACQQARKTGQIKEA 1505 AA L YLFL G+ +L KA ++ K+Y ++ F + RK G+ EA Sbjct: 256 AALVLCTLYLFLPTGIGRVEETLQKAKHVLDTMNKRYEHNTYFYGYSNFYHRKKGETVEA 315 Query: 1504 LNHITNGIYSCEMIGVTSTNYRFEKGMTYLINLDFTAAKDIFELLFYGNTIVFTGKNGSI 1325 L I + E G+ R+ T ++L F AK+ + L Sbjct: 316 LQAIERAAANAERAGLVPLLIRYLHADTLFMDLRFAEAKEKYAAL--------------- 360 Query: 1324 RXXXXXXXXXXXXXXLLSKDGSTKKDNSLLKFFEFELRPFCGLCLAGCYLILKSSQIAMK 1145 LS +TK+ F L LA CY++L AM+ Sbjct: 361 ----------------LSHLSATKE--------TFAYTGQVVLSLAACYVMLGDDAKAME 396 Query: 1144 EALDVLKQTKAMXXXXXXXXXXXXXIGLLGTSASXXXXXXXXXXXNSDKKEPKTNRYNKF 965 V G + S S +D PK F Sbjct: 397 WLRKV---------------------GSMYNSRS-----------KNDANSPK------F 418 Query: 964 AGRHSAKDVENNSVTPFLIFIILYLRRDIFYMPLELKKRWANLLESTWKNYDKSIDLDTN 785 A R A N + P +LY+ RD+ +M +E +R N L + D S + Sbjct: 419 AARVIA----NQRLLPLCGVYMLYINRDLAHMKVEQAERVLNELHRVTEGRDLS-GPEAE 473 Query: 784 AVYLLIRGVFEKFLNQDDPTIAQITLCECLSLETGIVSETWVIPHCRYELGELFYKQFGN 605 +Y L GV +K ++ + + + + + E I S++ ++P+ YE GEL Y++ G Sbjct: 474 NMYTLFVGVIQKGCDRTEEALK--CMEKIFANEKRIPSDSMILPYTYYETGELEYRR-GK 530 Query: 604 QEAATEQF 581 E A E F Sbjct: 531 LERAKELF 538 >gb|ORC90684.1| tetratricopeptide repeat domain 39B [Trypanosoma theileri] Length = 585 Score = 99.8 bits (247), Expect = 7e-18 Identities = 101/391 (25%), Positives = 164/391 (41%), Gaps = 6/391 (1%) Frame = -1 Query: 2524 AVKKFWNNEFSEAEGIFNRFKNCIPRWNVTYAEMQLVKHLMKGQSLDNENPELSNALMEA 2345 AV W N + EAE + + KN PR+ + YA LV+ LM N E AL++ Sbjct: 22 AVHMMWTNRYPEAEALLSVHKNIHPRYALEYANCFLVQTLM------NSTNESREALLDL 75 Query: 2344 EKLAIKVCENKDDFETSFSLFRTDIWKAYIRPNNSPSEDEAGFASLRANFRWDCELAMAD 2165 KLA E F + + S+D + A R ++ Sbjct: 76 FKLADSRATAAKYSEPMFFI-------------DDDSDDIGNGDNSSAGGR-------SN 115 Query: 2164 ILLFHAILQVVGGSEIKGAFNLRKAWKTYSKVRDEIDRIKRDAGKNDHKHELXXXXXXXX 1985 L ++ + GG+E K +KA+K K ++ +H E Sbjct: 116 ELSSNSSITHTGGAERK---KKKKAFKGRRKAAEKA---------GEHFDESWQLECDVV 163 Query: 1984 XXSAILGRGSISNMVGF---GGNKNQNQDGSSATLDGI---NNNVEIYSDIEDCLEFGIG 1823 A+L R M+ GG + G L I + I ++I+ C+++G G Sbjct: 164 YADALLMRSVCQLMMNSYLKGGINLRRTWGIYHRLIQIVEADTANRIPNEIKMCIKYGTG 223 Query: 1822 VFYFILSIVPGSFQSILKAIGFNAERDEGIQMLENCYLRDGVRAPFAAFFLLVNYLFLSR 1643 FY L++VP + +L IGF ++++ G Q L + +G+R+PFAA L YLFL Sbjct: 224 TFYAFLALVPANLMKLLNVIGFISDKELGEQYLTEVFENNGIRSPFAALVLCTLYLFLPT 283 Query: 1642 GLADPTLSLNKAGAIVQECVKKYPKSSPFLFMACQQARKTGQIKEALNHITNGIYSCEMI 1463 G+ +L KA I+ ++YP+++ F + RK G+ EAL I + E Sbjct: 284 GIGRVEETLQKAKHILNTMNQRYPENTYFHGYSNFYHRKRGETAEALIAIEKAARNAERA 343 Query: 1462 GVTSTNYRFEKGMTYLINLDFTAAKDIFELL 1370 G+ R+ T ++L + AK+ + L Sbjct: 344 GLVPLLIRYLHADTLFMDLRYAEAKEKYAAL 374 >ref|XP_002682030.1| predicted protein [Naegleria gruberi] gb|EFC49286.1| predicted protein [Naegleria gruberi] Length = 587 Score = 97.8 bits (242), Expect = 3e-17 Identities = 57/171 (33%), Positives = 90/171 (52%), Gaps = 1/171 (0%) Frame = -1 Query: 1879 NNNVEIYSDIEDCLEFGIGVFYFILSIVPGSFQSILKAIGFNAERDEGIQMLENCYLRDG 1700 ++N I+SD+ + FG+G+F F+LSI+P SIL IGF+A+RD G+++++ + G Sbjct: 182 SSNEFIHSDVLHNVYFGMGIFNFMLSILPPMLTSILSMIGFDADRDHGLELMKFDHEYGG 241 Query: 1699 VRAPFAAFFLLVNYLFLSRGLADPTLSLNKAGAIVQECVKKYPKSSPFLFMACQQARKTG 1520 + A+F L +NYLF+ R L D L AG ++++ K +P+S F M RK G Sbjct: 242 RQFGVASFMLSMNYLFIPRALEDRETKLAIAGGLLEKSEKIFPRSGGFKMMKSHYERKKG 301 Query: 1519 QIKEALNHITNGIYSC-EMIGVTSTNYRFEKGMTYLINLDFTAAKDIFELL 1370 I A+ ++ I C E +G E Y + DF A +L Sbjct: 302 DISNAITSLSEAITICEESMGFMPNILVHELSWCYFLTEDFEKASHYLNIL 352