2017
DOI: 10.1186/s12859-017-1565-4
|View full text |Cite
|
Sign up to set email alerts
|

Sparse Proteomics Analysis – a compressed sensing-based approach for feature selection and classification of high-dimensional proteomics mass spectrometry data

Abstract: BackgroundHigh-throughput proteomics techniques, such as mass spectrometry (MS)-based approaches, produce very high-dimensional data-sets. In a clinical setting one is often interested in how mass spectra differ between patients of different classes, for example spectra from healthy patients vs. spectra from patients having a particular disease. Machine learning algorithms are needed to (a) identify these discriminating features and (b) classify unknown spectra based on this feature set. Since the acquired dat… Show more

Help me understand this report
View preprint versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
5

Citation Types

0
30
0

Year Published

2017
2017
2022
2022

Publication Types

Select...
3
3
1

Relationship

0
7

Authors

Journals

citations
Cited by 21 publications
(30 citation statements)
references
References 57 publications
0
30
0
Order By: Relevance
“…In this work we introduce a feature selection algorithm that takes steps to mitigate this issue. This method, which we call R ANK C ORR , is inspired by the method in [13] and relies on the ranking the scRNA-seq counts before marker selection. Below, we provide some intuition as to why ranking scRNA-seq data is a useful strategy for understanding scRNA-seq counts data.…”
Section: Introductionmentioning
confidence: 99%
See 2 more Smart Citations
“…In this work we introduce a feature selection algorithm that takes steps to mitigate this issue. This method, which we call R ANK C ORR , is inspired by the method in [13] and relies on the ranking the scRNA-seq counts before marker selection. Below, we provide some intuition as to why ranking scRNA-seq data is a useful strategy for understanding scRNA-seq counts data.…”
Section: Introductionmentioning
confidence: 99%
“…Below, we provide some intuition as to why ranking scRNA-seq data is a useful strategy for understanding scRNA-seq counts data. The method in [13] is based on convex optimization (which would generally be quite slow on large data sets); it is possible to find a quick deterministic solution to this optimization, however, so that R ANK C ORR runs quickly. R ANK C ORR is able to handle over one million cells in an amount of time that is competitive with simple statistical methods.…”
Section: Introductionmentioning
confidence: 99%
See 1 more Smart Citation
“…Data reduction algorithms will be of critical importance in medicine going forward, having extensive application in the areas of disease risk assessment and personalized medicine. Major efforts are focused on improving classification while reducing computation [1]. Many algorithms have been proposed to classify MS data.…”
Section: Introductionmentioning
confidence: 99%
“…al [5] propose a Recursive Feature Elimination (SVMRFE) algorithm that selects important genes/biomarkers for the classification of noisy data. The sparse proteomics analysis (SPA) is another way to complete feature selection based on the compressive sensing concept [6]. Sparse features are a small subset of features that can be used to accurately predict unknown proteomic data.…”
Section: Introductionmentioning
confidence: 99%