Assessment of computational methods for the analysis of single-cell ATAC-seq data

Chen, Huidong; Lareau, Caleb A.; Andreani, Tommaso; Vinyard, Michael; Garcia, Sara P.; Clement, Kendell; Andrade‐Navarro, Miguel A.; Buenrostro, Jason D.; Pinello, Luca

doi:10.1186/s13059-019-1854-5

Cited by 273 publications

(324 citation statements)

References 36 publications

Supporting

Mentioning

293

Contrasting

Order By: Relevance

“…Despite the analyses listed for bulk ATAC-seq, another important analysis for single-cell is clustering. A recent benchmarking study from Chen et al about clustering methods in scATAC-seq showed that SnapATAC, Cusanovich2018 and cisTopic outperformed other methods [23,[173][174][175]. These three methods are featured by workflows combining window-based genome binning, binarization of the accessibility, coverage bias correction, and dimension reduction using principle component analysis, which specifically handle the sparse scATAC-seq data [175].…”

Section: Single-cell Atac-seqmentioning

confidence: 99%

From reads to insight: a hitchhiker’s guide to ATAC-seq data analysis

et al. 2020

View full text Add to dashboard Cite

Assay of Transposase Accessible Chromatin sequencing (ATAC-seq) is widely used in studying chromatin biology, but a comprehensive review of the analysis tools has not been completed yet. Here, we discuss the major steps in ATAC-seq data analysis, including pre-analysis (quality check and alignment), core analysis (peak calling), and advanced analysis (peak differential analysis and annotation, motif enrichment, footprinting, and nucleosome position analysis). We also review the reconstruction of transcriptional regulatory networks with multiomics data and highlight the current challenges of each step. Finally, we describe the potential of single-cell ATAC-seq and highlight the necessity of developing ATAC-seq specific analysis tools to obtain biologically meaningful insights.

show abstract

Section: Single-cell Atac-seqmentioning

confidence: 99%

From reads to insight: a hitchhiker’s guide to ATAC-seq data analysis

et al. 2020

View full text Add to dashboard Cite

show abstract

“…type identification as a downstream task of DR. Both measures are frequently used to assess the performance of cell embedding techniques [18,11,20,23,10]. Another standard measure to assess the ability of a clustering algorithm to recover known classes is the adjusted rand index (ARI) [11,20,23,10]; however, we found that AMI and ARI are extremely correlated (Additional file 1: Fig.…”

Section: Plates Basedmentioning

confidence: 93%

“…For Louvain, the nearest neighbors graph construction was done with scanpy (version 1.3.8) using default parameters, and the clustering was also run with default parameters using the 'taynaud' flavor. Note that since we had to automatically run Louvain on all embeddings, as done in [23], we could not properly tune the resolution nor the size of the neighborhood and thus Louvain could either overcluster or undercluster.…”

Section: Clustering Algorithmsmentioning

confidence: 99%

Tuning parameters of dimensionality reduction methods for single-cell RNA-seq analysis

Raimundo¹,

Vallot

Vert³

2020

Preprint

View full text Add to dashboard Cite

Many computational methods have been developed recently to analyze single-cell RNA-seq (scRNA-seq) data. Several benchmark studies have compared these methods on their ability for dimensionality reduction, clustering or differential analysis, often relying on default parameters. Yet given the biological diversity of scRNA-seq datasets, parameter tuning might be essential for the optimal usage of methods, and determining how to tune parameters remains an unmet need. Here, we propose a benchmark to assess the performance of five methods, systematically varying their tunable parameters, for dimension reduction (DR) of scRNA-seq data, a common first step to many downstream applications such as cell type identification or trajectory inference. We run a total of ∼1.5 million experiments to assess the influence of parameter changes on the performance of each method, and propose two strategies to automatically tune parameters for methods that need it. We find that principal component analysis (PCA)-based methods like scran and Seurat are competitive with default parameters but do not benefit much from parameter tuning, while more complex models like ZinbWave, DCA and scVI can reach better performance but after parameter tuning. We propose and evaluate two strategies to tune parameters automatically.

show abstract

“…We generated simulated data with different levels of noise from the bulk ATAC-seq data of 13 primary human blood cell types [8] using the same strategy as that in [47]. We started with the bulk peak-by-cell count matrix and generated count for peak i in cell type t using a binomial distribution binomð2; p t i Þ, where p t i ¼ ð1−qÞr t i =2 þ qn=2k, r t i is the percentage of all reads overlapping with peak i in cell type t, k is the total number of peaks in the bulk data, n is the number of simulated fragments, and q is a parameter specifying the level of noise; q = 0 indicates no noise while q = 1 indicates the highest level of noise.…”

Section: Scratmentioning

confidence: 99%

scATAC-pro: a comprehensive workbench for single-cell chromatin accessibility sequencing data

Uzun

Zhu

et al. 2020

Genome Biol

View full text Add to dashboard Cite

Single-cell chromatin accessibility sequencing has become a powerful technology for understanding epigenetic heterogeneity of complex tissues. However, there is a lack of open-source software for comprehensive processing, analysis, and visualization of such data generated using all existing experimental protocols. Here, we present scATAC-pro for quality assessment, analysis, and visualization of single-cell chromatin accessibility sequencing data. scATAC-pro computes a range of quality control metrics for several key steps of experimental protocols, with a flexible choice of methods. It generates summary reports for both quality assessment and downstream analysis. scATAC-pro is available at https://github.com/tanlabcode/scATAC-pro.

show abstract

Assessment of computational methods for the analysis of single-cell ATAC-seq data

Cited by 273 publications

References 36 publications

From reads to insight: a hitchhiker’s guide to ATAC-seq data analysis

From reads to insight: a hitchhiker’s guide to ATAC-seq data analysis

Tuning parameters of dimensionality reduction methods for single-cell RNA-seq analysis

scATAC-pro: a comprehensive workbench for single-cell chromatin accessibility sequencing data

Contact Info

Product

Resources

About