“…Analysis algorithms typically rely on aggregate statistics for groups of cells, but the process of grouping the cells works best with larger, established populations ( Diggins et al, 2015 ; Irish et al, 2006 ; Saeys et al, 2016 ) or may include pre-filtering of cells by human experts ( Greenplate et al, 2016a ; Greenplate et al, 2019 ). Cytometry tools like SPADE ( Bendall et al, 2011 ; Qiu et al, 2011 ), FlowSOM ( Van Gassen et al, 2015 ), Phenograph ( Levine et al, 2015 ), Citrus ( Bruggner et al, 2014 ), and RAPID ( Leelatian et al, 2020 ) generally work best to characterize cell subsets representing >1% of the sample and are less capable of capturing extremely rare cells or subsets distinguished by only a fraction of measured features. Tools like t-SNE ( Amir el et al, 2013 ; Krijthe et al, 2015 ), opt-SNE ( Belkina et al, 2019 ), and UMAP ( Becht et al, 2018 ; McInnes et al, 2018 ) embed cells or learn a manifold and represent these transformations as algorithmically-generated axes.…”