2019
DOI: 10.1101/818393
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

Machine Learning Approaches Identify Genes Containing Spatial Information from Single-Cell Transcriptomics Data

Abstract: 16Motivation: We participated in the DREAM Single Cell Transcriptomics Challenge. The 17 challenge's focus was two-fold; a) to identify the top 60, 40 and 20 genes that contain the most 18 spatial information, and b) to reconstruct the 3-D arrangement of the D. melanogaster embryo 19 using information from those genes. 20Results: We developed two independent approaches, leveraging machine learning models from 21 Lasso and Deep Neural Networks, that we successfully apply to high-dimensional single-cell 22 seque… Show more

Help me understand this report
View published versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
1

Citation Types

0
2
0

Year Published

2020
2020
2021
2021

Publication Types

Select...
3

Relationship

2
1

Authors

Journals

citations
Cited by 3 publications
(2 citation statements)
references
References 17 publications
(26 reference statements)
0
2
0
Order By: Relevance
“…For the latter, the most used one was unsupervised or supervised feature importance estimation and ranking. For example, in a supervised feature importance estimation approach, a Random Forest (BCBU, OmicsEngineering) or a neural network (DeepCMC ( 16 Preprint ), NAD) were trained to predict the coordinates of each cell, given the transcriptomics data as input and using either all genes or the genes with available in situ hybridization measurements. There were examples of unsupervised feature importance estimation and ranking by expression-based clustering (NAD, Christoph Hafemeister, MLB), or a greedy feature selection based on predictability of expression from other genes (WhatATeam).…”
Section: Resultsmentioning
confidence: 99%
“…For the latter, the most used one was unsupervised or supervised feature importance estimation and ranking. For example, in a supervised feature importance estimation approach, a Random Forest (BCBU, OmicsEngineering) or a neural network (DeepCMC ( 16 Preprint ), NAD) were trained to predict the coordinates of each cell, given the transcriptomics data as input and using either all genes or the genes with available in situ hybridization measurements. There were examples of unsupervised feature importance estimation and ranking by expression-based clustering (NAD, Christoph Hafemeister, MLB), or a greedy feature selection based on predictability of expression from other genes (WhatATeam).…”
Section: Resultsmentioning
confidence: 99%
“…Authors would like to acknowledge Professor Isidore Rigoutsos for providing access to computational resources and reviewing their manuscript. This manuscript has been released as a pre-print at bioRxiv ( Loher and Karathanasis, 2019 ).…”
mentioning
confidence: 99%