We describe Hi-C, a method that probes the three-dimensional architecture of whole genomes by coupling proximity-based ligation with massively parallel sequencing. We constructed spatial proximity maps of the human genome with Hi-C at a resolution of 1Mb. These maps confirm the presence of chromosome territories and the spatial proximity of small, gene rich chromosomes. We identified an additional level of genome organization that is characterized by the spatial segregation of open and closed chromatin to form two genome-wide compartments. At the megabase scale, the chromatin conformation is consistent with a fractal globule, a knot-free conformation that enables maximally dense packing while preserving the ability to easily fold and
DNaseI hypersensitive sites (DHSs) are markers of regulatory DNA and have underpinned the discovery of all classes of cis-regulatory elements including enhancers, promoters, insulators, silencers, and locus control regions. Here we present the first extensive map of human DHSs identified through genome-wide profiling in 125 diverse cell and tissue types. We identify ~2.9 million DHSs that encompass virtually all known experimentally-validated cis-regulatory sequences and expose a vast trove of novel elements, most with highly cell-selective regulation. Annotating these elements using ENCODE data reveals novel relationships between chromatin accessibility, transcription, DNA methylation, and regulatory factor occupancy patterns. We connect ~580,000 distal DHSs with their target promoters, revealing systematic pairing of different classes of distal DHSs and specific promoter types. Patterning of chromatin accessibility at many regulatory regions is choreographed with dozens to hundreds of co-activated elements, and the trans-cellular DNaseI sensitivity pattern at a given region can predict cell type-specific functional behaviors. The DHS landscape shows signatures of recent functional evolutionary constraint. However, the DHS compartment in pluripotent and immortalized cells exhibits higher mutation rates than that in highly differentiated cells, exposing an unexpected link between chromatin accessibility, proliferative potential and patterns of human variation.
We report the generation and analysis of functional data from multiple, diverse experiments performed on a targeted 1% of the human genome as part of the pilot phase of the ENCODE Project. These data have been further integrated and augmented by a number of evolutionary and computational analyses. Together, our results advance the collective knowledge about human genome function in several major areas. First, our studies provide convincing evidence that the genome is pervasively transcribed, such that the majority of its bases can be found in primary transcripts, including non-protein-coding transcripts, and those that extensively overlap one another. Second, systematic examination of transcriptional regulation has yielded new understanding about transcription start sites, including their relationship to specific regulatory sequences and features of chromatin accessibility and histone modification. Third, a more sophisticated view of chromatin structure has emerged, including its inter-relationship with DNA replication and transcriptional regulation. Finally, integration of these new sources of information, in particular with respect to mammalian evolution based on inter- and intra-species sequence comparisons, has yielded new mechanistic and evolutionary insights concerning the functional landscape of the human genome. Together, these studies are defining a path for pursuit of a more comprehensive characterization of human genome function.
Faithful transmission of genetic material to daughter cells involves a characteristic temporal order of DNA replication, which may play a significant role in the inheritance of epigenetic states. We developed a genome-scale approach-Repli Seq-to map temporally ordered replicating DNA using massively parallel sequencing and applied it to study regional variation in human DNA replication time across multiple human cell types. The method requires as few as 8,000 cytometryfractionated cells for a single analysis, and provides high-resolution DNA replication patterns with respect to both cell-cycle time and genomic position. We find that different cell types exhibit characteristic replication signatures that reveal striking plasticity in regional replication time patterns covering at least 50% of the human genome. We also identified autosomal regions with marked biphasic replication timing that include known regions of monoallelic expression as well as many previously uncharacterized domains. Comparison with highresolution genome-wide profiles of DNaseI sensitivity revealed that DNA replication typically initiates within foci of accessible chromatin comprising clustered DNaseI hypersensitive sites, and that replication time is better correlated with chromatin accessibility than with gene expression. The data collectively provide a unique, genomewide picture of the epigenetic compartmentalization of the human genome and suggest that cell-lineage specification involves extensive reprogramming of replication timing patterns.chromatin structure | gene expression | tissue specificity
The incorporation of genomics into medicine is stimulating interest on the return of incidental findings (IFs) from exome and genome sequencing. However, no large-scale study has yet estimated the number of expected actionable findings per individual; therefore, we classified actionable pathogenic single-nucleotide variants in 500 European- and 500 African-descent participants randomly selected from the National Heart, Lung, and Blood Institute Exome Sequencing Project. The 1,000 individuals were screened for variants in 114 genes selected by an expert panel for their association with medically actionable genetic conditions possibly undiagnosed in adults. Among the 1,000 participants, 585 instances of 239 unique variants were identified as disease causing in the Human Gene Mutation Database (HGMD). The primary literature supporting the variants' pathogenicity was reviewed. Of the identified IFs, only 16 unique autosomal-dominant variants in 17 individuals were assessed to be pathogenic or likely pathogenic, and one participant had two pathogenic variants for an autosomal-recessive disease. Furthermore, one pathogenic and four likely pathogenic variants not listed as disease causing in HGMD were identified. These data can provide an estimate of the frequency (∼3.4% for European descent and ∼1.2% for African descent) of the high-penetrance actionable pathogenic or likely pathogenic variants in adults. The 23 participants with pathogenic or likely pathogenic variants were disproportionately of European (17) versus African (6) descent. The process of classifying these variants underscores the need for a more comprehensive and diverse centralized resource to provide curated information on pathogenicity for clinical use to minimize health disparities in genomic medicine.
Evaluating the pathogenicity of a variant is challenging given the plethora of types of genetic evidence that laboratories consider. Deciding how to weigh each type of evidence is difficult, and standards have been needed. In 2015, the American College of Medical Genetics and Genomics (ACMG) and the Association for Molecular Pathology (AMP) published guidelines for the assessment of variants in genes associated with Mendelian diseases. Nine molecular diagnostic laboratories involved in the Clinical Sequencing Exploratory Research (CSER) consortium piloted these guidelines on 99 variants spanning all categories (pathogenic, likely pathogenic, uncertain significance, likely benign, and benign). Nine variants were distributed to all laboratories, and the remaining 90 were evaluated by three laboratories. The laboratories classified each variant by using both the laboratory's own method and the ACMG-AMP criteria. The agreement between the two methods used within laboratories was high (K-alpha = 0.91) with 79% concordance. However, there was only 34% concordance for either classification system across laboratories. After consensus discussions and detailed review of the ACMG-AMP criteria, concordance increased to 71%. Causes of initial discordance in ACMG-AMP classifications were identified, and recommendations on clarification and increased specification of the ACMG-AMP criteria were made. In summary, although an initial pilot of the ACMG-AMP guidelines did not lead to increased concordance in variant interpretation, comparing variant interpretations to identify differences and having a common framework to facilitate resolution of those differences were beneficial for improving agreement, allowing iterative movement toward increased reporting consistency for variants in genes associated with monogenic disease.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
hi@scite.ai
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.