We systematically generated large-scale data sets to improve genome annotation for the nematode Caenorhabditis elegans, a key model organism. These data sets include transcriptome profiling across a developmental time course, genome-wide identification of transcription factor–binding sites, and maps of chromatin organization. From this, we created more complete and accurate gene models, including alternative splice forms and candidate noncoding RNAs. We constructed hierarchical networks of transcription factor–binding and microRNA interactions and discovered chromosomal locations bound by an unusually large number of transcription factors. Different patterns of chromatin composition and histone modification were revealed between chromosome arms and centers, with similarly prominent differences between autosomes and the X chromosome. Integrating data types, we built statistical models relating chromatin, transcription factor binding, and gene expression. Overall, our analyses ascribed putative functions to most of the conserved genome.
Introductory paragraphAmong organisms with chromosome-based mechanisms of sex determination, failure to equalize expression of X-linked genes between the sexes is typically lethal. In C. elegans, XX hermaphrodites halve transcription from each X chromosome to match the output of XO males 1 . Here, we mapped the binding location of the condensin homolog DPY-27 and the zinc finger protein SDC-3, two components of the C. elegans dosage compensation complex (DCC) 2,3 . Strong foci of DCC binding were observed on X, around which broader regions of localization were centered. Binding foci, but not adjacent regions of localization, were distinguished by clusters of a stereotypic 10-bp DNA sequence, suggesting a recruitment-andspreading mechanism for X recognition. In contrast to the Drosophila DCC, the C. elegans DCC was bound preferentially upstream of genes, suggesting modulation of transcriptional initiation and polymerase-coupled spreading. A mechanism for tuning DCC activity at specific loci was indicated by stronger DCC binding upstream of genes with high transcriptional activity. These data provide a basis for understanding how proteins involved in higher-order chromosome dynamics can regulate transcription at individual loci. Main TextTo compensate for differences in X-linked gene dosage between XY males and XX females, mammals inactivate most genes on one of the two female X chromosomes 4 . In contrast, C. elegans XX hermaphrodites dosage compensate by reducing transcription from each X chromosome by a factor of two to match the expression of XO males 1 . This mechanism is remarkable in that the subtle two-fold downregulation is imposed upon X-linked genes expressed over a large dynamic range 5,6 . The dosage compensation complex (DCC) required for C. elegans X-repression is composed of proteins encoded by the genes sdc-1, sdc-2, sdc-3, dpy-21, dpy-26, dpy-27, dpy-28, dpy-30 and mix-1 7 . DPY-26, DPY-27, DPY-28 and MIX-1 are homologous to members of the condensin complex, which is required for chromosome condensation and segregation in organisms ranging from bacteria to humans 8 . While the *Correspondence should be addressed to: Jason D. Lieb jlieb@bio.unc.edu (919) 843-3228. AUTHOR CONTRIBUTIONS This study was designed by SE and JDL. SE conducted the experiments. SE, PGG, CMW, and JDL conducted the data analysis. XZ and RDG designed, manufactured, and hybridized the DNA microarrays. SE and JDL wrote the paper. COMPETING INTERESTS STATEMENTThe authors declare that they have no competing financial interests. NIH Public Access NIH-PA Author ManuscriptNIH-PA Author Manuscript NIH-PA Author Manuscript molecular parallels to mitotic chromosome condensation broadly suggest a mechanism for reducing X-linked transcription 9,10 , the features of X that control DCC binding have proven more difficult to investigate. Four loci on X (rex-1-4) sufficient for DCC recruitment were identified recently by using confocal microscopy to detect DCC binding to multiple-copy extrachromosomal transgenic DNA 11,12 . Here, we take an ...
In all eukaryotes, histone variants are incorporated into a subset of nucleosomes to create functionally specialized regions of chromatin. One such variant, H2A.Z, replaces histone H2A and is required for development and viability in all animals tested to date. However, the function of H2A.Z in development remains unclear. Here, we use ChIP-chip, genetic mutation, RNAi, and immunofluorescence microscopy to interrogate the function of H2A.Z (HTZ-1) during embryogenesis in Caenorhabditis elegans, a key model of metazoan development. We find that HTZ-1 is expressed in every cell of the developing embryo and is essential for normal development. The sites of HTZ-1 incorporation during embryogenesis reveal a genome wrought by developmental processes. HTZ-1 is incorporated upstream of 23% of C. elegans genes. While these genes tend to be required for development and occupied by RNA polymerase II, HTZ-1 incorporation does not specify a stereotypic transcription program. The data also provide evidence for unexpectedly widespread independent regulation of genes within operons during development; in 37% of operons, HTZ-1 is incorporated upstream of internally encoded genes. Fewer sites of HTZ-1 incorporation occur on the X chromosome relative to autosomes, which our data suggest is due to a paucity of developmentally important genes on X, rather than a direct function for HTZ-1 in dosage compensation. Our experiments indicate that HTZ-1 functions in establishing or maintaining an essential chromatin state at promoters regulated dynamically during C. elegans embryogenesis.
SummaryTwo members of the mitogen-activated protein kinase (MAPK) family have been previously characterized in Plasmodium falciparum , but in vitro attempts at identifying MAP kinase kinase (MAPKK) homologues have failed. Here we report the characterization of a novel plasmodial protein kinase, PfPK7, whose top scores in BLASTP analysis belong to the MAPKK3/6 subgroup of MAPKKs. However, homology to MAPKKs is restricted to regions of the C-terminal lobe of the kinase domain, whereas the N-terminal region is closer to fungal protein kinase A enzymes (PKA, members of the AGC group of protein kinases). Hence, PfPK7 is a 'composite' enzyme displaying regions of similarity to more than one protein kinase family, similar to a few other plasmodial protein kinases. PfPK7 is expressed in several developmental stages of the parasite, both in the mosquito vector and in the human host. Recombinant PfPK7 displayed kinase activity towards a variety of substrates, but was unable to phosphorylate the two P. falciparum MAPK homologues in vitro , and was insensitive to PKA and MEK inhibitors. Together with the absence of a typical MAPKK activation site in its T-loop, this suggests that PfPK7 is not a MAPKK orthologue, despite the fact that this enzyme is the most 'MAPKKlike' enzyme encoded in the P. falciparum genome. This is consistent with recent observations that the plasmodial MAPKs are not true orthologues of the ERK1/2, p38 or JNK MAPKs, and strengthens the evidence that classical three-component moduledependent MAPK signalling pathways do not operate in malaria parasites, a feature that has not been described in any other eukaryote.
The molecular mechanisms regulating cell proliferation and development during the life cycle of malaria parasites remain to be elucidated. The peculiarities of the cell cycle organization during Plasmodium falciparum schizogony suggest that the modalities of cell cycle control in this organism may differ from those in other eukaryotes. Indeed, existing data concerning Plasmodium cell cycle regulators such as cyclin-dependent kinases reveal structural and functional properties that are divergent from those of their homologues in other systems. The work presented here lies in the context of the exploitation of the recently available P. falciparum genome sequence toward the characterization of putative cell cycle regulators. We describe the in silico identification of three open reading frames encoding proteins with maximal homology to various members of the cyclin family and demonstrate that the corresponding polypeptides are expressed in the erythrocytic stages of the infection. We present evidence that these proteins possess cyclin activity by demonstrating either their association with histone H1 kinase activity in parasite extracts or their ability to activate PfPK5, a P. falciparum cyclin-dependent kinase homologue, in vitro. Furthermore, we show that RINGO, a protein with no sequence homology to cyclins but that is nevertheless a strong activator of mammalian CDK1/2, is also a strong activator of PfPK5 in vitro. This raises the possibility that "cryptic" cell cycle regulators may be found among the 50% of the open reading frames in the P. falciparum genome that display no homology to any known proteins.
The conserved nuclear factor I (NFI) family of transcription factors is unique to animals and essential for mammalian development. The Caenorhabditis elegans genome encodes a single NFI family member, whereas vertebrate genomes encode 4 distinct NFI protein subtypes (A, B, C, and X). NFI-1-deficient worms exhibit abnormalities, including reduced lifespan, defects in movement and pharyngeal pumping, and delayed egg-laying. To explore the functional basis of these phenotypes, we sought to comprehensively identify NFI-1-bound loci in C. elegans . We first established NFI-1 DNA-binding specificity using an in vitro DNA-selection strategy. Analysis yielded a consensus motif of TTGGCA(N) 3 TGCCAA, which occurs 586 times in the genome, a 100-fold higher frequency than expected. We next asked which sites were occupied by NFI-1 in vivo by performing chromatin immunoprecipitation of NFI-1 followed by microarray hybridization. Only 55 genomic locations were identified, an unexpectedly small target set. In vivo NFI-1 binding sites tend to be upstream of genes involved in core cellular processes, such as chromatin remodeling, mRNA splicing, and translation. Remarkably, 59 out of 70 (84%) of the C. briggsae orthologs of the identified targets contain conserved NFI binding sites in their promoters. These experiments provide a foundation for understanding how NFI-1 is recruited to unexpectedly few in vivo sites to perform its developmental functions, despite a vast over-representation of its binding motif.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
hi@scite.ai
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.