Our system is currently under heavy load due to increased usage. We're actively working on upgrades to improve performance. Thank you for your patience.
2022
DOI: 10.1101/2022.05.26.493607
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

Accelerating single-cell genomic analysis with GPUs

Abstract: Single-cell genomic technologies are rapidly improving our understanding of cellular heterogeneity in biological systems. In recent years, technological and computational improvements have continuously increased the scale of single-cell experiments, and now allow for millions of cells to be analyzed in a single experiment. However, existing software tools for single-cell analysis do not scale well to such large datasets. RAPIDS is an open-source suite of Python libraries that use GPU computing to accelerate da… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
13
0

Year Published

2022
2022
2023
2023

Publication Types

Select...
4
3

Relationship

0
7

Authors

Journals

citations
Cited by 9 publications
(13 citation statements)
references
References 22 publications
0
13
0
Order By: Relevance
“…We then constructed a kNN graph (k=30) in the RSS space and clustered the data set using the Leiden algorithm38 (resolution=80). For both steps we used the GPU-accelerated RAPIDS implementation which is provided through scanpy 51,53 . For all cell type marker genes on a given level in the hierarchy, we computed the area under the receiver operating characteristic curve (AUROC) as well as the detection rate across clusters.…”
Section: Methodsmentioning
confidence: 99%
See 1 more Smart Citation
“…We then constructed a kNN graph (k=30) in the RSS space and clustered the data set using the Leiden algorithm38 (resolution=80). For both steps we used the GPU-accelerated RAPIDS implementation which is provided through scanpy 51,53 . For all cell type marker genes on a given level in the hierarchy, we computed the area under the receiver operating characteristic curve (AUROC) as well as the detection rate across clusters.…”
Section: Methodsmentioning
confidence: 99%
“…We then constructed a kNN graph (k=30) in the RSS space and clustered the data set using the Leiden algorithm38 (resolution=80). For both steps we used the GPU-accelerated RAPIDS implementation which is provided through scanpy 51,53 .…”
Section: Methodsmentioning
confidence: 99%
“…Batch correction was then performed using Harmony ( 16 ) based on 17 lineage markers (CD163, CD68, CD19, CD20, CD21, CD3e, CD8, CD4, CD45RO, ICOS, FOXP3, E-CAD, Pan-Cytokeratin, Beta-Catenin, CD31, CD34, SMA) in order to minimize batch effects between the tissue slides. The feature matrix returned by Harmony was then used for unsupervised clustering using Leiden algorithm ( 17 ) and the GPU-accelerated package, Rapids ( 18 ). Leiden resolution equals to 1-6 were tested and the resolution of 4 was chosen for manual cluster annotation as it presents a sufficient number of clusters that separate the cells into smaller groups (N = 70) differentiated by protein expression variability over the technical variability.…”
Section: Methodsmentioning
confidence: 99%
“…For example, the GPU-accelerated tool RAPIDS [60] builds on top of the Dask framework in order to scale its data preparation and model training steps across multiple GPUs and machines [23]. In this assisting capacity, Dask has made it possible to scale single-cell analysis pipelines to upwards of millions of cells [61]. In large-scale distributed environments, i.e.…”
Section: Scaling Computational Biology With Daskmentioning
confidence: 99%