2020
DOI: 10.1186/s13059-020-02054-8
|View full text |Cite
|
Sign up to set email alerts
|

CB2 improves power of cell detection in droplet-based single-cell RNA sequencing data

Abstract: An important challenge in pre-processing data from droplet-based single-cell RNA sequencing protocols is distinguishing barcodes associated with real cells from those binding background reads. Existing methods test barcodes individually and consequently do not leverage the strong cell-to-cell correlation present in most datasets. To improve cell detection, we introduce CB2, a cluster-based approach for distinguishing real cells from background barcodes. As demonstrated in simulated and case study datasets, CB2… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
19
0

Year Published

2020
2020
2023
2023

Publication Types

Select...
6
2

Relationship

0
8

Authors

Journals

citations
Cited by 18 publications
(19 citation statements)
references
References 29 publications
0
19
0
Order By: Relevance
“…The pipeline then performs the following initial cell filtering steps: 1) true cells are distinguished from empty droplets based on the cumulative distribution of total molecule counts; 2) cells with a high fraction of mitochondrial molecules are filtered (> 20%); and 3) cells with low library complexity are filtered (cells that express very few unique genes). In addition, we perform additional filtering of empty droplets using the CB2 package with parameter ''lower'' set at 100 to estimate the background distribution of ambient RNA and an FDR threshold of 0.01 for calling real cells (Ni et al, 2020). Putative doublets were removed using the DoubletDetection package (https://doi.org/10.5281/zenodo.2658729).…”
Section: Statistical Analysis Of In Vitro and In Vivo Experimentsmentioning
confidence: 99%
“…The pipeline then performs the following initial cell filtering steps: 1) true cells are distinguished from empty droplets based on the cumulative distribution of total molecule counts; 2) cells with a high fraction of mitochondrial molecules are filtered (> 20%); and 3) cells with low library complexity are filtered (cells that express very few unique genes). In addition, we perform additional filtering of empty droplets using the CB2 package with parameter ''lower'' set at 100 to estimate the background distribution of ambient RNA and an FDR threshold of 0.01 for calling real cells (Ni et al, 2020). Putative doublets were removed using the DoubletDetection package (https://doi.org/10.5281/zenodo.2658729).…”
Section: Statistical Analysis Of In Vitro and In Vivo Experimentsmentioning
confidence: 99%
“…Often, these cells are excluded by setting a minimum threshold on the number of UMIs per cell and/or genes detected. (2) Another aspect unique to droplet-based microfluidic devices is that the majority of the droplets (>90%) will not contain an actual cell 7 , 8 . Despite the absence of a cell, these “empty droplets” may contain low levels of background ambient RNA that was present in the cell solution 9 .…”
Section: Introductionmentioning
confidence: 99%
“…The raw Cell Ranger barcode matrix output for each scRNA-seq dataset was filtered to remove empty droplets using the cluster-based R package scCB2 (Ni et al, 2020). Using EmptyDrops (ED) as a scaffold, scCB2 increases the power to distinguish real cells from background empty droplets as well as low-expressing, small cells by pooling low-count barcodes with similar gene expression patterns.…”
Section: Methodsmentioning
confidence: 99%