2023
DOI: 10.1007/s10462-023-10393-8
|View full text |Cite
|
Sign up to set email alerts
|

A review of semi-supervised learning for text classification

Abstract: A huge amount of data is generated daily leading to big data challenges. One of them is related to text mining, especially text classification. To perform this task we usually need a large set of labeled data that can be expensive, time-consuming, or difficult to be obtained. Considering this scenario semi-supervised learning (SSL), the branch of machine learning concerned with using labeled and unlabeled data has expanded in volume and scope. Since no recent survey exists to overview how SSL has been used in … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

1
5
0

Year Published

2023
2023
2024
2024

Publication Types

Select...
6
2
2

Relationship

0
10

Authors

Journals

citations
Cited by 23 publications
(10 citation statements)
references
References 181 publications
1
5
0
Order By: Relevance
“…Semi-supervised learning is a learning approach that combines supervised and unsupervised learning ( 30 ). In the presence of a small amount of labeled data, semi-supervised models infer the structure and features of unlabeled data to perform classification and prediction tasks, thereby enhancing model performance with limited labeled data ( 31 ).…”
Section: Methodsmentioning
confidence: 99%
“…Semi-supervised learning is a learning approach that combines supervised and unsupervised learning ( 30 ). In the presence of a small amount of labeled data, semi-supervised models infer the structure and features of unlabeled data to perform classification and prediction tasks, thereby enhancing model performance with limited labeled data ( 31 ).…”
Section: Methodsmentioning
confidence: 99%
“…In Att-BILSTM, the attention mechanism can assign a weight to each input position, allowing the model to better focus on key parts and use softmax classifiers to predict labels from the discrete set of category Y in sentence S . The classifier takes hidden state h * as input ( Duarte & Berton, 2023 ):…”
Section: Fundamentals Of Research Theorymentioning
confidence: 99%
“…Machine learning-based methods use traditional machine learning algorithms to extract features from corpus samples with sentiment labels (Duarte and Berton, 2023). The current approaches can be classified into two categories: supervised (Naz et al, 2018;Rathor et al, 2018;Wang and Zhao, 2020) and semi-supervised (Fu et al, 2019;Wu et al, 2021;Palanivinayagam et al, 2023;Duan et al, 2020).…”
Section: Text Emotion Classificationmentioning
confidence: 99%