2021
DOI: 10.1101/2021.02.09.430550
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

scPNMF: sparse gene encoding of single cells to facilitate gene selection for targeted gene profiling

Abstract: Single-cell RNA sequencing (scRNA-seq) captures whole transcriptome information of indi- vidual cells. While scRNA-seq measures thousands of genes, researchers are often interested in only dozens to hundreds of genes for a closer study. Then a question is how to select those informative genes from scRNA-seq data. Moreover, single-cell targeted gene profiling technologies are gaining popularity for their low costs, high sensitivity, and extra (e.g., spatial) information; however, they typically can only measure… Show more

Help me understand this report
View published versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1

Citation Types

0
6
0

Year Published

2021
2021
2022
2022

Publication Types

Select...
5
1

Relationship

0
6

Authors

Journals

citations
Cited by 6 publications
(6 citation statements)
references
References 55 publications
0
6
0
Order By: Relevance
“…When benchmarking for pre-clustering, the top 1000 variable genes identified by SeuratVST [35], scPNMF (v1.0) [37], and Scater (v1.20.1) [38] were used for dimension reduction, respectively. Rand index (ARI) values were calculated as described previously [37].…”
Section: Artificial Contamination Of Simulated Pbmc Single-cell Datasetmentioning
confidence: 99%
“…When benchmarking for pre-clustering, the top 1000 variable genes identified by SeuratVST [35], scPNMF (v1.0) [37], and Scater (v1.20.1) [38] were used for dimension reduction, respectively. Rand index (ARI) values were calculated as described previously [37].…”
Section: Artificial Contamination Of Simulated Pbmc Single-cell Datasetmentioning
confidence: 99%
“…Several of these approaches have also been applied to probe set selection 3,4,6 . Dedicated probe set selection approaches that use a reference scRNA-seq dataset to optimize for cell type classification [20][21][22][23][24][25][26][27][28][29][30] , or the capture of transcriptional variation [31][32][33][34] have also been proposed. However, few approaches account for both cell-type and gene variation, and none of the above methods include technical constraints in their selection procedure.…”
Section: Introductionmentioning
confidence: 99%
“…The number of selected genes is typically dependent on a user-defined threshold, but ordinarily is on the order of one to a few thousand genes [ 2 , 5 ]. A recently developed approach, scPNMF, further addresses the gene complexity problem by leveraging a Non-Negative Matrix Factorization (NMF) representation of scRNA-seq, with selected features being suggested to represent interesting biological variability in the data [ 6 ]. scPNMF relies on the chosen dimension for the NMF representation and also does not directly compare informativeness between different factors, thus impeding the ability to compare the importance (i.e., gene weights) between different factors.…”
Section: Introductionmentioning
confidence: 99%