GranatumX: A Community-engaging, Modularized, and Flexible Webtool for Single-cell Data Analysis

Garmire, David; Zhu, Xun; Mantravadi, Aravind; Huang, Qizhao; Yunits, Breck; Yu, Li; Wolfgruber, Thomas; Poirion, Olivier; Zhao, Tianying; Arisdakessian, Cédric; Stanojevic, Stefan; Garmire, Lana X.

doi:10.1101/385591

Cited by 5 publications

(2 citation statements)

References 20 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…However, these SV markers represent a different category from those HV gene markers detected by quantitative variabilities conventionally [36][37][38]. Questions remained: (1) if SV gene based clustering can be improved by integrating additional SV genes, which are normally used in single cell RNA-Seq analysis for clustering; (2) if integration of SV and HV genes can improve clustering results in spatial transcriptomics data, which computational method(s) to use.…”

Section: Discussionmentioning

confidence: 99%

Benchmarking Computational Integration Methods for Spatial Transcriptomics Data

Stanojevic

et al. 2021

Preprint

Self Cite

View full text Add to dashboard Cite

The increasing popularity of spatial transcriptomics has allowed researchers to analyze transcriptome data in its tissue sample's spatial context. Various methods have been developed for detecting SV (spatially variable) genes, with distinct spatial expression patterns. However, the accuracy of using such SV genes in clustering cell types has not been thoroughly studied. On the other hand, in single cell resolution sequencing data, clustering analysis is usually done on highly variable (HV) genes. Here we investigate if integrating SV genes and HV genes from spatial transcriptomics data can improve clustering performance beyond using SV genes alone. We evaluated six methods that integrate different features measured from the same samples including MOFA+, scVI, Seurat v4 , CIMLR, SNF, and the straightforward concatenation approach. We applied these methods on 19 real datasets from three different spatial transcriptomics technologies (merFISH, SeqFISH+, and Visium) as well as 20 simulated datasets of varying spatial expression conditions. Our evaluations show that the performances of these integration methods are largely dependent on spatial transcriptomics platforms. Despite the variations among the results, in general MOFA+ and simple concatenation have good performances across different types of spatial transcriptomics platforms. This work shows that integrating quantitative and spatial marker genes in the spatial transcriptomics data can improve clustering. It also provides practical guides on the choices of computational methods to accomplish this goal.

show abstract

Section: Discussionmentioning

confidence: 99%

Benchmarking Computational Integration Methods for Spatial Transcriptomics Data

Stanojevic

et al. 2021

Preprint

Self Cite

View full text Add to dashboard Cite

show abstract

“…As mentioned earlier, the constantly increasing accessibility of NGS methods makes the generation of omics data easier and less expensive. Today's single cell NGS START [107] ASAP [108], SINCERA [109] Limma [110] Canonical correlation analysis [111] Salmon [112] Docker [113] DEBrowser [114] FastGenomics [115] Seurat [111] ComBat [116] Sailfish [117] Singularity [118] iDEP [119] Granatum [120] and Grana-tumX [121] Scanpy [122] SVA [123] Mutual nearest neighbors [124] Shiny-Seq [125] Monocle [126] technologies can generate an enormous amount of data, where noise, e.g., from amplification and dropout is a common problem. Kharchenko et al proposed a noise tolerant Bayesian approach, which allows the identification of differential gene expression and subpopulations in single-cell data using a probabilistic model of expression-magnitude distortions [40].…”

Section: Experimental Advances and Challenges: Dealing With Big Data mentioning

confidence: 99%

Modeling population heterogeneity from microbial communities to immune response in cells

Pecht

Aschenbrenner

Ulas

et al. 2019

Cell. Mol. Life Sci.

View full text Add to dashboard Cite

Heterogeneity is universally observed in all natural systems and across multiple scales. Understanding population heterogeneity is an intriguing and attractive topic of research in different disciplines, including microbiology and immunology. Microbes and mammalian immune cells present obviously rather different system-specific biological features. Nevertheless, as typically occurs in science, similar methods can be used to study both types of cells. This is particularly true for mathematical modeling, in which key features of a system are translated into algorithms to challenge our mechanistic understanding of the underlying biology. In this review, we first present a broad overview of the experimental developments that allowed observing heterogeneity at the single cell level. We then highlight how this "data revolution" requires the parallel advancement of algorithms and computing infrastructure for data processing and analysis, and finally present representative examples of computational models of population heterogeneity, from microbial communities to immune response in cells.Publisher's Note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

show abstract

Evaluation of Cell Type Annotation R Packages on Single Cell RNA-seq Data

Huang

Liu

et al. 2019

Preprint

Self Cite

View full text Add to dashboard Cite

24Annotating cell types is a critical step in single cell RNA-Seq (scRNA-Seq) data analysis. Some 25 deconvolution methods have recently emerged to enable automated cell type identification. 26 However, comprehensive evaluations of these methods are lacking to provide practical guidelines. 27 Moreover, it is not clear whether some deconvolution methods originally designed for analyzing 28 other omics data are adaptable to scRNA-Seq analysis. In this study, we evaluated ten cell-type 29 deconvolution methods publicly available as R packages. Eight of them are popular methods 30 developed specifically for single cell research (Seurat, scmap, SingleR, CHETAH, SingleCellNet, 31 scID, Garnett, SCINA). The other two methods are repurposed from deconvoluting DNA 32 methylation data: Linear Constrained Projection (CP) and Robust Partial Correlations (RPC). We 33 conducted systematic comparisons on a wide variety of public scRNA-seq datasets as well as 34 simulation data. We assessed the accuracy through intra-dataset and inter-dataset predictions, the 35 robustness over practical challenges such as gene filtering and high similarity among cell types, as 36 well as the capabilities on rare and unknown cell-type detection. Overall, methods such as Seurat, 37SingleR, CP, RPC and SingleCellNet performed well, with Seurat being the best at annotating 38 major cell types. Also, Seurat, SingleR and CP are more robust against down-sampling. However, 39Seurat does have a major drawback at predicting rare cell populations, and it is suboptimal at 40 differentiating cell types that are highly similar to each other, while SingleR and CP are much 41 better in these aspects. 42 43 45 46 47 48 49 50 51 52 Single cell RNA sequencing (scRNA-seq) has emerged as a powerful tool to enable the 53 characterization of cell types and states in complex tissues and organisms at the single-cell level 54 [1-5]. Annotating cell types amongst the cell clusters is a critical step before other downstream 55 analyses, such as differential gene expression and pseudo time analysis [6][7][8][9].56Conventionally, a set of priorly known cell-type specific markers are used to label the cell types 57 of the clusters manually. This process is laborious and often is a rate-limiting step for scRNA-seq 58 analysis. This approach is also prone to bias and errors. The marker may not be specific enough to 59 differentiate the cell subpopulations in the same dataset, or it may not be generic enough to be 60 applied from one study to another. Automating the cell type labeling is critical to enhance 61 reproducibility and consistency among single cell studies. 62Recently some deconvolution methods have emerged to systematically assign cell types in the 63 new scRNA-seq dataset, based on existing annotations from another dataset. Instead of using only 64 top differentiating markers, most methods project or correlate the new cells onto similar cells in 65 the well-annotated reference datasets, by leveraging the whole transcriptome profiles. These 66 decon...

show abstract

GranatumX: A Community-engaging, Modularized, and Flexible Webtool for Single-cell Data Analysis

Cited by 5 publications

References 20 publications

Benchmarking Computational Integration Methods for Spatial Transcriptomics Data

Benchmarking Computational Integration Methods for Spatial Transcriptomics Data

Modeling population heterogeneity from microbial communities to immune response in cells

Evaluation of Cell Type Annotation R Packages on Single Cell RNA-seq Data

Contact Info

Product

Resources

About