Higher-order genome organization and its variation in different cellular conditions remain poorly understood. Recent high-coverage genome-wide chromatin interaction mapping using Hi-C has revealed spatial segregation of chromosomes in the human genome into distinct subcompartments. However, subcompartment annotation, which requires Hi-C data with high sequencing coverage, is currently only available in the GM12878 cell line, making it impractical to compare subcompartment patterns across cell types. Here we develop a computational approach, SNIPER (Subcompartment iNference using Imputed Probabilistic ExpRessions), based on denoising autoencoder and multilayer perceptron classifier to infer subcompartments using typical Hi-C datasets with moderate coverage. SNIPER accurately reveals subcompartments using moderate coverage Hi-C datasets and outperforms an existing method that uses epigenomic features in GM12878. We apply SNIPER to eight additional cell lines and find that chromosomal regions with conserved and cell-type specific subcompartment annotations have different patterns of functional genomic features. SNIPER enables the identification of subcompartments without high-coverage Hi-C data and provides insights into the function and mechanisms of spatial genome organization variation across cell types.
The higher-order genome organization and its variation in different cellular conditions remains poorly understood. Recent high-resolution genome-wide mapping of chromatin interactions using Hi-C has revealed that chromosomes in the human genome are spatially segregated into distinct subcompartments. However, due to the requirement on sequencing coverage of the Hi-C data to define subcompartments, to date subcompartment annotation is only available in the GM12878 cell line, making it impractical to compare Hi-C subcompartment patterns across multiple cell types. Here we develop a new computational approach, named SNIPER, based on an autoencoder and multilayer perceptron classifier to infer subcompartments using typical Hi-C datasets with moderate coverage. We demonstrated that SNIPER can accurately reveal subcompartments based on Hi-C datasets with moderate coverage and can significantly outperform an existing method that uses numerous epigenomic datasets as input features in GM12878. We applied SNIPER to eight additional cell lines to identify the variation of Hi-C subcompartments across different cell types. SNIPER revealed that chromosomal regions with conserved and more dynamic subcompartment annotations across cell types have different patterns of functional genomic features. This work demonstrates that SNIPER is effective in identifying subcompartments without the need of high-coverage Hi-C data and has the potential to provide new insights into the spatial genome organization variation across different cell types.
New single-cell Hi-C (scHi-C) technologies enable probing of the genome-wide cell-to-cell variability in 3D genome organization from individual cells. Several computational methods have been developed to reveal single-cell 3D genome features based on scHi-C data, including A/B compartments, topologically-associating domains, and chromatin loops. However, no scHi-C analysis method currently exists for annotating single-cell subcompartments, which are crucial for providing a more refined view of large-scale chromosome spatial localization in single cells. Here, we present scGHOST, a single-cell subcompartment annotation method based on graph embedding with constrained random walk sampling. Applications of scGHOST to scHi-C data and single-cell 3D genome imaging data demonstrate the reliable identification of single-cell subcompartments and offer new insights into cell-to-cell variability of nuclear subcompartments. Using scHi-C data from the human prefrontal cortex, scGHOST identifies cell type-specific subcompartments that are strongly connected to cell type-specific gene expression, suggesting the functional implications of single-cell subcompartments. Overall, scGHOST is an effective new method for single-cell 3D genome subcompartment annotation based on scHi-C data for a broad range of biological contexts.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
hi@scite.ai
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.