Evaluation of semi-supervised learning for classification of protein crystallization imagery

Sigdel, Madhav; Dinc, Imren; Dinç, Semih; Sigdel, Madhu S.; Pusey, Marc L.; Aygün, Ramazan

doi:10.1109/secon.2014.6950649

Cited by 10 publications

(10 citation statements)

References 9 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…More analytically, we evaluated the performances of self-training and Yet Another Two Stage Idea (YATSI) approaches with the most classic classification methods. Self-training and YATSI constitute two of the most efficient and frequently utilized semisupervised algorithms which have been successfully used in a variety of real-world applications (Catal & Diri, 2009;Driessens, Reutemann, Pfahringer, & Leschi, 2006;Levatic, Dzeroski, Supek, & Smuc, 2013;Roli & Marcialis 2006;Rosenberg, Hebert, & Schneiderman, 2005;Sigdel et al, 2014) providing some promising classification results. Our preliminary numerical experiments illustrate that the classification accuracy can be significantly improved, utilizing a few labeled and many unlabeled data for developing reliable prediction models.…”

Section: Introductionmentioning

confidence: 99%

Predicting Secondary School Students' Performance Utilizing a Semi-supervised Learning Approach

Livieris

Drakopoulou

Tampakas

et al. 2018

Journal of Educational Computing Research

View full text Add to dashboard Cite

Educational data mining constitutes a recent research field which gained popularity over the last decade because of its ability to monitor students' academic performance and predict future progression. Numerous machine learning techniques and especially supervised learning algorithms have been applied to develop accurate models to predict student's characteristics which induce their behavior and performance. In this work, we examine and evaluate the effectiveness of two wrapper methods for semisupervised learning algorithms for predicting the students' performance in the final examinations. Our preliminary numerical experiments indicate that the advantage of semisupervised methods is that the classification accuracy can be significantly improved by utilizing a few labeled and many unlabeled data for developing reliable prediction models.

show abstract

Section: Introductionmentioning

confidence: 99%

Predicting Secondary School Students' Performance Utilizing a Semi-supervised Learning Approach

Livieris

Drakopoulou

Tampakas

et al. 2018

Journal of Educational Computing Research

View full text Add to dashboard Cite

show abstract

“…In the literature, a variety of self-labeled methods has been proposed each with a different philosophy and methodology on exploiting the information hidden in the unlabeled data. In this work, we focus our attention on self-training, co-training and tri-training, which constitute the most useful and commonly-used self-labeled methods [12,16,20,21]. Notice that the crucial difference between them is the mechanism used to label unlabeled data.…”

Section: A Review Of Semi-supervised Self-labeled Learningmentioning

confidence: 99%

“…It has been established as a very popular algorithm due to its simplicity, and it is often found to be more accurate than other semi-supervised algorithms [16,20,23]. In the self-training framework, an arbitrary classifier is initially trained with a small amount of labeled data, which comprise its training set, aiming to classify unlabeled points.…”

Section: Self-trainingmentioning

confidence: 99%

An Ensemble SSL Algorithm for Efficient Chest X-Ray Image Classification

et al. 2018

View full text Add to dashboard Cite

A critical component in the computer-aided medical diagnosis of digital chest X-rays is the automatic detection of lung abnormalities, since the effective identification at an initial stage constitutes a significant and crucial factor in patient’s treatment. The vigorous advances in computer and digital technologies have ultimately led to the development of large repositories of labeled and unlabeled images. Due to the effort and expense involved in labeling data, training datasets are of a limited size, while in contrast, electronic medical record systems contain a significant number of unlabeled images. Semi-supervised learning algorithms have become a hot topic of research as an alternative to traditional classification methods, exploiting the explicit classification information of labeled data with the knowledge hidden in the unlabeled data for building powerful and effective classifiers. In the present work, we evaluate the performance of an ensemble semi-supervised learning algorithm for the classification of chest X-rays of tuberculosis. The efficacy of the presented algorithm is demonstrated by several experiments and confirmed by the statistical nonparametric tests, illustrating that reliable and robust prediction models could be developed utilizing a few labeled and many unlabeled data.

show abstract

“…Hence, these methods have the advantage of reducing the effort of supervision to a minimum, while still preserving competitive recognition performance. Nowadays, these algorithms have great interest both in theory and in practice and have become a topic of significant research as an alternative to traditional methods of machine learning, since they require less human effort and frequently present higher accuracy [4][5][6][7][8][9][10]. The main issue of semi-supervised learning is how to efficiently exploit the hidden information in the unlabeled data.…”

Section: Introductionmentioning

confidence: 99%

“…Self-training constitutes perhaps the most popular and frequently used SSL algorithm due to its simplicity and classification accuracy [4,5,9]. This algorithm wraps around a base learner and uses its own predictions to assign labels to unlabeled data.…”

Section: Introductionmentioning

confidence: 99%

An Auto-Adjustable Semi-Supervised Self-Training Algorithm

et al. 2018

View full text Add to dashboard Cite

Semi-supervised learning algorithms have become a topic of significant research as an alternative to traditional classification methods which exhibit remarkable performance over labeled data but lack the ability to be applied on large amounts of unlabeled data. In this work, we propose a new semi-supervised learning algorithm that dynamically selects the most promising learner for a classification problem from a pool of classifiers based on a self-training philosophy. Our experimental results illustrate that the proposed algorithm outperforms its component semi-supervised learning algorithms in terms of accuracy, leading to more efficient, stable and robust predictive models.

show abstract

Evaluation of semi-supervised learning for classification of protein crystallization imagery

Cited by 10 publications

References 9 publications

Predicting Secondary School Students' Performance Utilizing a Semi-supervised Learning Approach

Predicting Secondary School Students' Performance Utilizing a Semi-supervised Learning Approach

An Ensemble SSL Algorithm for Efficient Chest X-Ray Image Classification

An Auto-Adjustable Semi-Supervised Self-Training Algorithm

Contact Info

Product

Resources

About