2018
DOI: 10.1016/j.jbi.2018.07.015
|View full text |Cite
|
Sign up to set email alerts
|

Benchmarking relief-based feature selection methods for bioinformatics data mining

Abstract: Modern biomedical data mining requires feature selection methods that can (1) be applied to large scale feature spaces (e.g. 'omics' data), (2) function in noisy problems, (3) detect complex patterns of association (e.g. gene-gene interactions), (4) be flexibly adapted to various problem domains and data types (e.g. genetic variants, gene expression, and clinical data) and (5) are computationally tractable. To that end, this work examines a set of filter-style feature selection algorithms inspired by the 'Reli… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

3
208
0

Year Published

2019
2019
2024
2024

Publication Types

Select...
5
1

Relationship

0
6

Authors

Journals

citations
Cited by 196 publications
(211 citation statements)
references
References 50 publications
(142 reference statements)
3
208
0
Order By: Relevance
“…Spatially uniform ReliefF (SURF) (Greene, Penrod, Kiralis, & Moore, ) is an extension of ReliefF that, instead of choosing a constant number of nearest neighbors, selects all neighbors within a fixed distance T of the individual. Furthermore, it has been recommended to apply an iterative Relief approach such as tuned ReliefF (TURF) in large feature space with more than 10,000 features (Urbanowicz et al, ). TURF algorithm (Moore & White, ) improves the performance of ReliefF by running it several times.…”
Section: Methodsmentioning
confidence: 99%
“…Spatially uniform ReliefF (SURF) (Greene, Penrod, Kiralis, & Moore, ) is an extension of ReliefF that, instead of choosing a constant number of nearest neighbors, selects all neighbors within a fixed distance T of the individual. Furthermore, it has been recommended to apply an iterative Relief approach such as tuned ReliefF (TURF) in large feature space with more than 10,000 features (Urbanowicz et al, ). TURF algorithm (Moore & White, ) improves the performance of ReliefF by running it several times.…”
Section: Methodsmentioning
confidence: 99%
“…Based on the iid assumption for X ia and X ja , it follows from Thm. 2.1 that the joint density function g (1) of X ia and Z q a is given by…”
Section: Distribution Of |Dmentioning
confidence: 99%
“…Let F Z q a denote the distribution function of the random variable Z q a . Furthermore, we define the events E (1) and E (2) as…”
Section: Distribution Of |Dmentioning
confidence: 99%
See 2 more Smart Citations