2020
DOI: 10.1038/s41467-020-17900-3
|View full text |Cite
|
Sign up to set email alerts
|

A clustering-independent method for finding differentially expressed genes in single-cell transcriptome data

Abstract: A common analysis of single-cell sequencing data includes clustering of cells and identifying differentially expressed genes (DEGs). How cell clusters are defined has important consequences for downstream analyses and the interpretation of results, but is often not straightforward. To address this difficulty, we present singleCellHaystack, a method that enables the prediction of DEGs without relying on explicit clustering of cells. Our method uses Kullback-Leibler divergence to find genes that are expressed in… Show more

Help me understand this report
View preprint versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

4
109
0

Year Published

2020
2020
2024
2024

Publication Types

Select...
6
2

Relationship

0
8

Authors

Journals

citations
Cited by 68 publications
(113 citation statements)
references
References 33 publications
4
109
0
Order By: Relevance
“…As another check on the quality of the cell type assignments, we ran the cell and cluster independent haystack gene search and pairwise differential expression tests between the predicted cell types (see methods for further details). 47 We show the five most differentially expressed genes for each of the major retina cell types are consistent with known retinal cell markers (Figure 3a). As a simple metric to identify known and unknown genes relating to the cell type specific expression we search PubMed for the number of publications with two searches per gene.…”
Section: Resultssupporting
confidence: 64%
See 1 more Smart Citation
“…As another check on the quality of the cell type assignments, we ran the cell and cluster independent haystack gene search and pairwise differential expression tests between the predicted cell types (see methods for further details). 47 We show the five most differentially expressed genes for each of the major retina cell types are consistent with known retinal cell markers (Figure 3a). As a simple metric to identify known and unknown genes relating to the cell type specific expression we search PubMed for the number of publications with two searches per gene.…”
Section: Resultssupporting
confidence: 64%
“…To identify marker genes across the CellType (predict) and cluster groups, we used the scran findmarkers (wilcox test) along with the singleCellHaystack algorithm. 47 The scran findmarkers test runs a wilcox test in a pairwise manner (e.g. Rods vs all other cell types).…”
Section: Methodsmentioning
confidence: 99%
“…Gene selection methods abound, and those tailored to spatial transcriptomic data attempt to identify genes with high variance and whose expression is not random across the tissue. Genes can be scored according to their spatial autocorrelation (using Moran's I or Geary's C) 125 , neighbor enrichment (for example, in BinSpect) 111 or entropy (for example, in Haystack) 126 . Trendsceek 127 uses a marked point processes approach 128 and is able to identify hotspots, streaks, and gradients of expression.…”
Section: Selectmentioning
confidence: 99%
“…As the differential expression analysis in downstream depends on the clustering result, if a clustering algorithm fails to properly cluster a group of cells, the informative genes of that cluster will be missed. In this sense, the need for a method that can find DEGs independent of clustering was widely discussed 5,13,14 . We demonstrated this situation as a proof-of-concept using realistic simulation datasets generated by Symsim 19 , a simulator of single-cell RNA-seq experiments.…”
Section: Resultsmentioning
confidence: 99%
“…In our analysis, we show that they do not assign higher ranks to DEGs that are obviously differentially expressed among cell types with sufficient precision compared to the standard DEG analysis with clustering. Another more advanced approach is singleCellHaystack, a recently developed method that extracts the list of candidates for DEGs by examining non-random expression pattern before clustering 7 . However, although the method may sort out DEGs more accurately than the HVG methods, it does not tell which groups of cells ‘differentially’ express the identified genes.…”
Section: Introductionmentioning
confidence: 99%