2019
DOI: 10.1101/810234
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

Unifying single-cell annotations based on the Cell Ontology

Abstract: Single cell technologies have rapidly generated an unprecedented amount of data that enables us to understand biological systems at single-cell resolution. However, analyzing datasets generated by independent labs remains challenging due to a lack of consistent terminology to describe cell types. Here, we present OnClass, an algorithm and accompanying software for automatically classifying cells into cell types represented by a controlled vocabulary derived from the Cell Ontology. Cell type similarity is infer… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
20
0

Year Published

2020
2020
2024
2024

Publication Types

Select...
6
4

Relationship

1
9

Authors

Journals

citations
Cited by 22 publications
(20 citation statements)
references
References 48 publications
(50 reference statements)
0
20
0
Order By: Relevance
“…In order to combine the two matrices into a single aggregated query, we "humanized" the mouse expression matrix by mapping mouse genes to their orthologous human genes. This mapping was computed using the biomaRt R package 47 , mapping mgi_symbol from the mmusculus_gene_ensembl database to hgnc_symbol from the hsapien_gene_ensembl database. We represented this map as a matrix, with mouse genes as rows, human genes as columns, and values in {0,1} assigned to denote whether a mouse gene maps to a human gene.…”
Section: Constructing the Pancreas Query With Mouse And Humanmentioning
confidence: 99%
“…In order to combine the two matrices into a single aggregated query, we "humanized" the mouse expression matrix by mapping mouse genes to their orthologous human genes. This mapping was computed using the biomaRt R package 47 , mapping mgi_symbol from the mmusculus_gene_ensembl database to hgnc_symbol from the hsapien_gene_ensembl database. We represented this map as a matrix, with mouse genes as rows, human genes as columns, and values in {0,1} assigned to denote whether a mouse gene maps to a human gene.…”
Section: Constructing the Pancreas Query With Mouse And Humanmentioning
confidence: 99%
“…Such methods often tend to use class embeddings [46] and try to leverage prior knowledge on similarities between the classes. A similar approach has been applied in predicting novel cell types from gene expression data using Cell Ontology [22] embeddings [47]. Such approaches can additionally be useful for describing new terms that are occasionally added to the ontology, even before they accumulate many annotations.…”
Section: Prediction Methodsmentioning
confidence: 99%
“…Generalized cell type prediction within an ontology adjusts for annotation coarseness A core difficulty for deploying predictive models for cell type labels based on single-cell RNA-seq is that the changing universe of cell types is redefined and new types discovered as part of cellular atlas efforts 12 . We address this issue by defining versioned cell type universes, which are sets of leave nodes of a cell ontology that correspond to the most fine grained cell types that are characterized within a given tissue.…”
Section: Sfaira Versions Decentralized Parametric Models To Allow Repmentioning
confidence: 99%