2020
DOI: 10.1101/2020.10.08.332288
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

Automated quality control and cell identification of droplet-based single-cell data using dropkick

Abstract: A major challenge for droplet-based single-cell sequencing technologies is distinguishing true cells from uninformative barcodes in datasets with disparate library sizes confounded by high technical noise (i.e. batch-specific ambient RNA). We present dropkick, a fully automated software tool for quality control and filtering of single-cell RNA sequencing (scRNA-seq) data with a focus on excluding ambient barcodes and recovering real cells bordering the quality threshold. By automatically determining dataset-sp… Show more

Help me understand this report
View published versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
16
0

Year Published

2021
2021
2022
2022

Publication Types

Select...
3
2

Relationship

4
1

Authors

Journals

citations
Cited by 8 publications
(16 citation statements)
references
References 28 publications
0
16
0
Order By: Relevance
“…Single-cell analysis identifies neoplastic cells that arose from subtype-specific tumorigenic processes We generated scRNA-seq data on 70,691 (DIS dataset) and 71,374 cells (VAL dataset) (total: 142,065), after filtering for high-quality barcodes using dropkick (Heiser et al, 2021), and See also Figure S1 and Table S1. Article cells from specimens with unconfirmed histology (labeled UNC) were transcriptomically classified (Table S3).…”
Section: Resultsmentioning
confidence: 99%
See 1 more Smart Citation
“…Single-cell analysis identifies neoplastic cells that arose from subtype-specific tumorigenic processes We generated scRNA-seq data on 70,691 (DIS dataset) and 71,374 cells (VAL dataset) (total: 142,065), after filtering for high-quality barcodes using dropkick (Heiser et al, 2021), and See also Figure S1 and Table S1. Article cells from specimens with unconfirmed histology (labeled UNC) were transcriptomically classified (Table S3).…”
Section: Resultsmentioning
confidence: 99%
“…COLON MAP scRNA-seq, droplet matrix quality control We identified high-quality, cell-containing droplets and their respective barcodes through the joint application of cumulative sum inflection point thresholding, our dropkick QC algorithm (Heiser et al, 2021), and prior-knowledge gene expression profiling. This droplet matrix was processed as an AnnData object using our preprocessing pipeline which utilizes the Scanpy toolkit (Wolf et al, 2018).…”
Section: Colon Map Whole Exome Sequencing and Alignmentmentioning
confidence: 99%
“…This variant serves the same function as the heuristic droplet filtering section. For automated droplet filtering in Python, dropkick is a machine learning tool that builds a probabilistic model of single-cell barcode transcriptome quality and returns a score for all barcodes in the input scRNA-seq droplet matrix (see step 7 for generating .h5ad from DropEst files) ( Heiser et al., 2020 ). dropkick can be run from the command line or interactively in a Jupyter Notebook.…”
Section: Step-by-step Methods Detailsmentioning
confidence: 99%
“…Step 21: Automated droplet filtering with the dropkick Python package provides quality control metrics and cell probabilities for each barcode that can be used to subset an unfiltered counts matrix using a gene-based logistic regression model ( Heiser, et al. 2020 ).…”
Section: Expected Outcomesmentioning
confidence: 99%
See 1 more Smart Citation