2019
DOI: 10.1186/s13059-019-1917-7
|View full text |Cite
|
Sign up to set email alerts
|

A comparison framework and guideline of clustering methods for mass cytometry data

Abstract: BackgroundWith the expanding applications of mass cytometry in medical research, a wide variety of clustering methods, both semi-supervised and unsupervised, have been developed for data analysis. Selecting the optimal clustering method can accelerate the identification of meaningful cell populations.ResultTo address this issue, we compared three classes of performance measures, “precision” as external evaluation, “coherence” as internal evaluation, and stability, of nine methods based on six independent bench… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
4
1

Citation Types

2
110
0

Year Published

2020
2020
2024
2024

Publication Types

Select...
6
1

Relationship

1
6

Authors

Journals

citations
Cited by 99 publications
(113 citation statements)
references
References 38 publications
2
110
0
Order By: Relevance
“…Hypercluster allows comprehensive evaluation of multiple hyperparameters and clustering algorithms simultaneously, reducing the allure of biased or arbitrary parameter selection. It also aids computational biologists who are testing and benchmarking new clustering algorithms, evaluation metrics and pre-or post-processing steps [10]. Future iterations of hypercluster could include further cutting-edge clustering techniques, including those designed for larger data sets [31,32] or account for multiple types of data [48].…”
Section: Discussionmentioning
confidence: 99%
See 1 more Smart Citation
“…Hypercluster allows comprehensive evaluation of multiple hyperparameters and clustering algorithms simultaneously, reducing the allure of biased or arbitrary parameter selection. It also aids computational biologists who are testing and benchmarking new clustering algorithms, evaluation metrics and pre-or post-processing steps [10]. Future iterations of hypercluster could include further cutting-edge clustering techniques, including those designed for larger data sets [31,32] or account for multiple types of data [48].…”
Section: Discussionmentioning
confidence: 99%
“…Clustering is a particularly central challenge in the analysis of single-cell measurement data (e.g. single cell RNA-seq) due to its high dimensionality [ 8 10 ]. Clustering is also increasingly being used for disease subtype classification and risk stratification [ 11 19 ].…”
Section: Introductionmentioning
confidence: 99%
“…For instance, random sub-sampling of cells from each sample is routinely carried out to reduce computational complexity and clustering time 15 . Therefore, to overcome some of the limitations associated with unsupervised clustering, semi-automated approaches use "prior" knowledge or "ground truth" about the marker expression in each of the given cell types to annotated every cell of the unlabeled dataset 16 . Currently, only a few semi-automated approaches for cell label predictions are available, viz.…”
Section: Introductionmentioning
confidence: 99%
“…Both ACDC and SCINA, uses the list of pre-defined markers for a given cell type to annotate the unsupervised cell cluster(s) that expresses these signature markers. However, these methods depend upon the unsupervised clustering of cells and assume that the expression of target marker as binary (expressed or not expressed), which restricts their ability to classify highly similar cell sub-types, especially non-canonical cell types, that cannot be separated linearly 10,16 . Instead of defined marker lists, DeepCyTOF and LDA use manually gated cell types in form of a marker expression matrix, as training and test set, to build a Machine Learning (ML) model for the cell type prediction.…”
Section: Introductionmentioning
confidence: 99%
See 1 more Smart Citation