2007
DOI: 10.1093/bioinformatics/btm344
|View full text |Cite
|
Sign up to set email alerts
|

A review of feature selection techniques in bioinformatics

Abstract: Feature selection techniques have become an apparent need in many bioinformatics applications. In addition to the large pool of techniques that have already been developed in the machine learning and data mining fields, specific applications in bioinformatics have led to a wealth of newly proposed techniques. In this article, we make the interested reader aware of the possibilities of feature selection, providing a basic taxonomy of feature selection techniques, and discussing their use, variety and potential … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

8
2,770
0
48

Year Published

2008
2008
2020
2020

Publication Types

Select...
5
4

Relationship

0
9

Authors

Journals

citations
Cited by 4,347 publications
(2,961 citation statements)
references
References 128 publications
8
2,770
0
48
Order By: Relevance
“…With the increased proliferation of powerful computational facilities, these algorithms have also been coupled with feature selection techniques that allow the decomposition of an optimal set of variables (spectral features) from the spectral data which maximize the predictive capacity and speed of implementation of the modelling algorithms, together with simplifying them and rendering them more interpretable scientifically [74]. All of the algorithms seek to calibrate a model that relates the spectral dataset (X-matrix) to a target of interest (Y-matrix, eg.…”
Section: Quantitative Multivariate Analytical Methodsologiesmentioning
confidence: 99%
“…With the increased proliferation of powerful computational facilities, these algorithms have also been coupled with feature selection techniques that allow the decomposition of an optimal set of variables (spectral features) from the spectral data which maximize the predictive capacity and speed of implementation of the modelling algorithms, together with simplifying them and rendering them more interpretable scientifically [74]. All of the algorithms seek to calibrate a model that relates the spectral dataset (X-matrix) to a target of interest (Y-matrix, eg.…”
Section: Quantitative Multivariate Analytical Methodsologiesmentioning
confidence: 99%
“…The large gap between optimal and estimated correlation is at least in part due to the inaccuracy of the cross-validation type error estimators with small sample size; see, e.g., [9]. …”
Section: Resultsmentioning
confidence: 99%
“…Significance was assessed by student t test. All statistical tests were done employing SPSS program version 16 (SPSS Software, SPSS Inc., Chicago, USA) and the differences were considered significant at p B 0.05 [19].…”
Section: Resultsmentioning
confidence: 99%