We describe the draft genome of the microcrustacean Daphnia pulex, which is only 200 Mb and contains at least 30,907 genes. The high gene count is a consequence of an elevated rate of gene duplication resulting in tandem gene clusters. More than 1/3 of Daphnia’s genes have no detectable homologs in any other available proteome, and the most amplified gene families are specific to the Daphnia lineage. The co-expansion of gene families interacting within metabolic pathways suggests that the maintenance of duplicated genes is not random, and the analysis of gene expression under different environmental conditions reveals that numerous paralogs acquire divergent expression patterns soon after duplication. Daphnia-specific genes – including many additional loci within sequenced regions that are otherwise devoid of annotations – are the most responsive genes to ecological challenges.
We systematically generated large-scale data sets to improve genome annotation for the nematode Caenorhabditis elegans, a key model organism. These data sets include transcriptome profiling across a developmental time course, genome-wide identification of transcription factor–binding sites, and maps of chromatin organization. From this, we created more complete and accurate gene models, including alternative splice forms and candidate noncoding RNAs. We constructed hierarchical networks of transcription factor–binding and microRNA interactions and discovered chromosomal locations bound by an unusually large number of transcription factors. Different patterns of chromatin composition and histone modification were revealed between chromosome arms and centers, with similarly prominent differences between autosomes and the X chromosome. Integrating data types, we built statistical models relating chromatin, transcription factor binding, and gene expression. Overall, our analyses ascribed putative functions to most of the conserved genome.
We report here genome sequences and comparative analyses of three closely related parasitoid wasps: Nasonia vitripennis, N. giraulti, and N. longicornis. Parasitoids are important regulators of arthropod populations, including major agricultural pests and disease vectors, and Nasonia is an emerging genetic model, particularly for evolutionary and developmental genetics. Key findings include the identification of a functional DNA methylation tool kit; hymenopteran-specific genes including diverse venoms; lateral gene transfers among Pox viruses, Wolbachia, and Nasonia; and the rapid evolution of genes involved in nuclear-mitochondrial interactions that are implicated in speciation. Newly developed genome resources advance Nasonia for genetic research, accelerate mapping and cloning of quantitative trait loci, and will ultimately provide tools and knowledge for further increasing the utility of parasitoids as pest insect-control agents.
This chapter describes gene expression analysis by Singular Value Decomposition (SVD), emphasizing initial characterization of the data. We describe SVD methods for visualization of gene expression data, representation of the data using a smaller number of variables, and detection of patterns in noisy gene expression data. In addition, we describe the precise relation between SVD analysis and Principal Component Analysis (PCA) when PCA is calculated using the covariance matrix, enabling our descriptions to apply equally well to either method. Our aim is to provide definitions, interpretations, examples, and references that will serve as resources for understanding and extending the application of SVD and PCA to gene expression analysis.
Chromatin immunoprecipitation identifies specific interactions between genomic DNA and proteins, advancing our understanding of gene-level and chromosome-level regulation. Based on chromatin immunoprecipitation experiments using validated antibodies, we define the genome-wide distributions of 19 histone modifications, one histone variant, and eight chromatin-associated proteins in Caenorhabditis elegans embryos and L3 larvae. Cluster analysis identified five groups of chromatin marks with shared features: Two groups correlate with gene repression, two with gene activation, and one with the X chromosome. The X chromosome displays numerous unique properties, including enrichment of monomethylated H4K20 and H3K27, which correlate with the different repressive mechanisms that operate in somatic tissues and germ cells, respectively. The data also revealed striking differences in chromatin composition between the autosomes and between chromosome arms and centers. Chromosomes I and III are globally enriched for marks of active genes, consistent with containing more highly expressed genes, compared to chromosomes II, IV, and especially V. Consistent with the absence of cytological heterochromatin and the holocentric nature of C. elegans chromosomes, markers of heterochromatin such as H3K9 methylation are not concentrated at a single region on each chromosome. Instead, H3K9 methylation is enriched on chromosome arms, coincident with zones of elevated meiotic recombination. Active genes in chromosome arms and centers have very similar histone mark distributions, suggesting that active domains in the arms are interspersed with heterochromatin-like structure. These data, which confirm and extend previous studies, allow for in-depth analysis of the organization and deployment of the C. elegans genome during development.
We report testing of the specificity and utility of over 200 antibodies raised against 57 different histone modifications, in Drosophila melanogaster, Caenorhabditis elegans and human cells. While most antibodies performed well, over 25% failed specificity tests by dot blot or western blot. Among specific antibodies, over 20% failed in chromatin immunoprecipitation experiments. We advise rigorous testing of histone-modification antibodies before use and provide a website for posting new test results.
Methylation of histone H3K36 in higher eukaryotes is mediated by multiple methyltransferases. Set2-related H3K36 methyltransferases are targeted to genes by association with RNA Polymerase II and are involved in preventing aberrant transcription initiation within the body of genes. The targeting and roles of the NSD family of mammalian H3K36 methyltransferases, known to be involved in human developmental disorders and oncogenesis, are not known. We used genome-wide chromatin immunoprecipitation (ChIP) to investigate the targeting and roles of the Caenorhabditis elegans NSD homolog MES-4, which is maternally provided to progeny and is required for the survival of nascent germ cells. ChIP analysis in early C. elegans embryos revealed that, consistent with immunostaining results, MES-4 binding sites are concentrated on the autosomes and the leftmost ∼2% (300 kb) of the X chromosome. MES-4 overlies the coding regions of approximately 5,000 genes, with a modest elevation in the 5′ regions of gene bodies. Although MES-4 is generally found over Pol II-bound genes, analysis of gene sets with different temporal-spatial patterns of expression revealed that Pol II association with genes is neither necessary nor sufficient to recruit MES-4. In early embryos, MES-4 associates with genes that were previously expressed in the maternal germ line, an interaction that does not require continued association of Pol II with those loci. Conversely, Pol II association with genes newly expressed in embryos does not lead to recruitment of MES-4 to those genes. These and other findings suggest that MES-4, and perhaps the related mammalian NSD proteins, provide an epigenetic function for H3K36 methylation that is novel and likely to be unrelated to ongoing transcription. We propose that MES-4 transmits the memory of gene expression in the parental germ line to offspring and that this memory role is critical for the PGCs to execute a proper germline program.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
hi@scite.ai
334 Leonard St
Brooklyn, NY 11211
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.