In addition to protein coding sequence, the human genome contains a significant amount of regulatory DNA, the identification of which is proving somewhat recalcitrant to both in silico and functional methods. An approach that has been used with some success is comparative sequence analysis, whereby equivalent genomic regions from different organisms are compared in order to identify both similarities and differences. In general, similarities in sequence between highly divergent organisms imply functional constraint. We have used a whole-genome comparison between humans and the pufferfish, Fugu rubripes, to identify nearly 1,400 highly conserved non-coding sequences. Given the evolutionary divergence between these species, it is likely that these sequences are found in, and furthermore are essential to, all vertebrates. Most, and possibly all, of these sequences are located in and around genes that act as developmental regulators. Some of these sequences are over 90% identical across more than 500 bases, being more highly conserved than coding sequence between these two species. Despite this, we cannot find any similar sequences in invertebrate genomes. In order to begin to functionally test this set of sequences, we have used a rapid in vivo assay system using zebrafish embryos that allows tissue-specific enhancer activity to be identified. Functional data is presented for highly conserved non-coding sequences associated with four unrelated developmental regulators (SOX21, PAX6, HLXB9, and SHH), in order to demonstrate the suitability of this screen to a wide range of genes and expression patterns. Of 25 sequence elements tested around these four genes, 23 show significant enhancer activity in one or more tissues. We have identified a set of non-coding sequences that are highly conserved throughout vertebrates. They are found in clusters across the human genome, principally around genes that are implicated in the regulation of development, including many transcription factors. These highly conserved non-coding sequences are likely to form part of the genomic circuitry that uniquely defines vertebrate development.
Establishment of a proper chromatin landscape is central to genome function. Here, we explain H3 variant distribution by specific targeting and dynamics of deposition involving the CAF-1 and HIRA histone chaperones. Impairing replicative H3.1 incorporation via CAF-1 enables an alternative H3.3 deposition at replication sites via HIRA. Conversely, the H3.3 incorporation throughout the cell cycle via HIRA cannot be replaced by H3.1. ChIP-seq analyses reveal correlation between HIRA-dependent H3.3 accumulation and RNA pol II at transcription sites and specific regulatory elements, further supported by their biochemical association. The HIRA complex shows unique DNA binding properties, and depletion of HIRA increases DNA sensitivity to nucleases. We propose that protective nucleosome gap filling of naked DNA by HIRA leads to a broad distribution of H3.3, and HIRA association with Pol II ensures local H3.3 enrichment at specific sites. We discuss the importance of this H3.3 deposition as a salvage pathway to maintain chromatin integrity.
Centromeres are essential for ensuring proper chromosome segregation in eukaryotes. Their definition relies on the presence of a centromere-specific H3 histone variant CenH3, known as CENP-A in mammals. Its overexpression in aggressive cancers raises questions concerning its effect on chromatin dynamics and contribution to tumorigenesis. We find that CenH3 overexpression in human cells leads to ectopic enrichment at sites of active histone turnover involving a heterotypic tetramer containing CenH3-H4 with H3.3-H4. Ectopic localization of this particle depends on the H3.3 chaperone DAXX rather than the dedicated CenH3 chaperone HJURP. This aberrant nucleosome occludes CTCF binding and has a minor effect on gene expression. Cells overexpressing CenH3 are more tolerant of DNA damage. Both the survival advantage and CTCF occlusion in these cells are dependent on DAXX. Our findings illustrate how changes in histone variant levels can disrupt chromatin dynamics and suggests a possible mechanism for cell resistance to anticancer treatments.
A comparative analysis of SNPs and their exonic and intronic environments identifies the features predictive of splice affecting variants.
Fish-mammal genomic comparisons have proved powerful in identifying conserved noncoding elements likely to be cis-regulatory in nature, and the majority of those tested in vivo have been shown to act as tissue-specific enhancers associated with genes involved in transcriptional regulation of development. Although most of these elements share little sequence identity to each other, a small number are remarkably similar and appear to be the product of duplication events. Here, we searched for duplicated conserved noncoding elements in the human genome, using comparisons with Fugu to select putative cis-regulatory sequences. We identified 124 families of duplicated elements, each containing between two and five members, that are highly conserved within and between vertebrate genomes. In 74% of cases, we were able to assign a specific set of paralogous genes with annotation relating to transcriptional regulation and/or development to each family, thus removing much of the ambiguity in identifying associated genes. We find that duplicate elements have the potential to up-regulate reporter gene expression in a tissue-specific manner and that expression domains often overlap, but are not necessarily identical, between family members. Over two thirds of the families are conserved in duplicate in fish and appear to predate the large-scale duplication events thought to have occurred at the origin of vertebrates. We propose a model whereby gene duplication and the evolution of cis-regulatory elements can be considered in the context of increased morphological diversity and the emergence of the modern vertebrate body plan.
Comparisons between diverse vertebrate genomes have uncovered thousands of highly conserved non-coding sequences, an increasing number of which have been shown to function as enhancers during early development. Despite their extreme conservation over 500 million years from humans to cartilaginous fish, these elements appear to be largely absent in invertebrates, and, to date, there has been little understanding of their mode of action or the evolutionary processes that have modelled them. We have now exploited emerging genomic sequence data for the sea lamprey, Petromyzon marinus, to explore the depth of conservation of this type of element in the earliest diverging extant vertebrate lineage, the jawless fish (agnathans). We searched for conserved non-coding elements (CNEs) at 13 human gene loci and identified lamprey elements associated with all but two of these gene regions. Although markedly shorter and less well conserved than within jawed vertebrates, identified lamprey CNEs are able to drive specific patterns of expression in zebrafish embryos, which are almost identical to those driven by the equivalent human elements. These CNEs are therefore a unique and defining characteristic of all vertebrates. Furthermore, alignment of lamprey and other vertebrate CNEs should permit the identification of persistent sequence signatures that are responsible for common patterns of expression and contribute to the elucidation of the regulatory language in CNEs. Identifying the core regulatory code for development, common to all vertebrates, provides a foundation upon which regulatory networks can be constructed and might also illuminate how large conserved regulatory sequence blocks evolve and become fixed in genomic DNA.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
hi@scite.ai
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.