2017
DOI: 10.1007/s10994-017-5648-2
|View full text |Cite
|
Sign up to set email alerts
|

Simple strategies for semi-supervised feature selection

Abstract: What is the simplest thing you can do to solve a problem? In the context of semisupervised feature selection, we tackle exactly this-how much we can gain from two simple classifier-independent strategies. If we have some binary labelled data and some unlabelled, we could assume the unlabelled data are all positives, or assume them all negatives. These minimalist, seemingly naive, approaches have not previously been studied in depth. However, with theoretical and empirical studies, we show they provide powerful… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
16
0

Year Published

2018
2018
2024
2024

Publication Types

Select...
8
2

Relationship

4
6

Authors

Journals

citations
Cited by 38 publications
(18 citation statements)
references
References 38 publications
0
16
0
Order By: Relevance
“…For feature selection, one is interested in ranking the features in order of mutual information between the features and the label. Interestingly, this order remains the same when the unlabeled examples are considered as negative [89].…”
Section: Hypothesis Testingmentioning
confidence: 77%
“…For feature selection, one is interested in ranking the features in order of mutual information between the features and the label. Interestingly, this order remains the same when the unlabeled examples are considered as negative [89].…”
Section: Hypothesis Testingmentioning
confidence: 77%
“…Semi -JMI is a method of using a semisupervised dataset as a training set for JMI. More details can be seen from Reference [36]. In this paper, the missingness mechanism is class-prior-change semisupervised scenario (MAR-C) [37].…”
Section: Methodsmentioning
confidence: 99%
“…We expect that this tool will prove beneficial in visualizing and interpreting biomarker investigations for clinical trials. Finally, by formalizing the problem of predictive biomarker discovery in information theoretic terms, we can potentially extend this work to other challenging scenarios, such as misclassification bias ( Sechidis et al , 2017 ) or partially labelled data ( Sechidis and Brown, 2018 ).…”
Section: Discussionmentioning
confidence: 99%