2018
DOI: 10.1101/397042
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

Fast Batch Alignment of Single Cell Transcriptomes Unifies Multiple Mouse Cell Atlases into an Integrated Landscape

Abstract: Increasing numbers of large scale single cell RNA-Seq projects are leading to a data explosion, which can only be fully exploited through data integration.Therefore, efficient computational tools for combining diverse datasets are crucial for biology in the single cell genomics era. A number of methods have been developed to assist data integration by removing technical batch effects, but most are computationally intensive. To overcome the challenge of enormous datasets, we have developed BBKNN, an extremely f… Show more

Help me understand this report
View published versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2
1

Citation Types

0
47
0

Year Published

2019
2019
2021
2021

Publication Types

Select...
6
2

Relationship

1
7

Authors

Journals

citations
Cited by 36 publications
(47 citation statements)
references
References 40 publications
0
47
0
Order By: Relevance
“…To achieve good clustering by cell types, number of counts, mitochondrial percentage and donor effects were regressed out. PCA was carried out on highly variable genes, the donor effect was reduced by BBKNN tool (43) . Leiden clustering (44) and B_cell (B_follicular, B_Hypermutation, B_mantle) in Spleen.…”
Section: Clustering and Annotation Of Cell Typesmentioning
confidence: 99%
“…To achieve good clustering by cell types, number of counts, mitochondrial percentage and donor effects were regressed out. PCA was carried out on highly variable genes, the donor effect was reduced by BBKNN tool (43) . Leiden clustering (44) and B_cell (B_follicular, B_Hypermutation, B_mantle) in Spleen.…”
Section: Clustering and Annotation Of Cell Typesmentioning
confidence: 99%
“…This approach will confound the batch effect with biological differences between cell types or states that are not shared among datasets. Data integration methods such as Canonical Correlation Analysis (CCA; Butler et al , ), Mutual Nearest Neighbours (MNN; Haghverdi et al , ), Scanorama (preprint: Hie et al , ), RISC (preprint: Liu et al , ), scGen (preprint: Lotfollahi et al , ), LIGER (preprint: Welch et al , ), BBKNN (preprint: Park et al , ), and Harmony (preprint: Korsunsky et al , ) have been developed to overcome this issue. While data integration methods can also be applied to simple batch correction problems, we recommend to be wary of over‐correction given the increased degrees of freedom of non‐linear data integration approaches.…”
Section: Introductionmentioning
confidence: 99%
“…In contrast, when analyzing and integrating samples of multiple conditions (i.e., disease vs. health), batch effect removal must be performed with caution. Several approaches have been proposed to reduce batch effect including mutual nearest neighbors (MNN) correct, seurat3, ResNet, Harmony, Scanorama, BBKNN, scGen, and so on. MNN Correct is used to identify the connections shared by two datasets.…”
Section: Limitation and Challenges Of Scrna‐seq Technologymentioning
confidence: 99%
“…The second is how to remove batch effect 90 ResNet, 92 Harmony, 93 Scanorama, 94 BBKNN, 95 scGen, 96 and so on. MNN Correct is used to identify the connections shared by two datasets.…”
Section: Limitation and Challenges Of Scrna-seq Technologymentioning
confidence: 99%