2022
DOI: 10.31577/cai_2022_1_98
|View full text |Cite
|
Sign up to set email alerts
|

Reducing the Effect of Imbalance in Text Classification Using SVD and GloVe with Ensemble and Deep Learning

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1

Citation Types

0
1
0

Year Published

2022
2022
2024
2024

Publication Types

Select...
7
1

Relationship

0
8

Authors

Journals

citations
Cited by 9 publications
(3 citation statements)
references
References 0 publications
0
1
0
Order By: Relevance
“…GloVe is used to represent words using an embedding matrix containing many words. Each of these words corresponds to several numerical values, representing the vectors embedding this word, which are then employed as the input layer for neural networks of deep learning classifiers [13], [14]. Recurrent neural network (RNN) is one type of deep learning classifier based on keeping the output of a certain layer and feeding it back to the input to predict the layer's output, but it suffers from the problem of vanishing and exploding gradients.…”
Section: Preliminariesmentioning
confidence: 99%
“…GloVe is used to represent words using an embedding matrix containing many words. Each of these words corresponds to several numerical values, representing the vectors embedding this word, which are then employed as the input layer for neural networks of deep learning classifiers [13], [14]. Recurrent neural network (RNN) is one type of deep learning classifier based on keeping the output of a certain layer and feeding it back to the input to predict the layer's output, but it suffers from the problem of vanishing and exploding gradients.…”
Section: Preliminariesmentioning
confidence: 99%
“…The relevance of the terms in the corpus texts is assessed using TF-IDF. The TF-IDF equation is shown below [14,44]. Here tf i,j = the total number of occurrences of i in j, df i = the total number of documents containing i, and N = the total number of documents.…”
Section: Feature Extractionmentioning
confidence: 99%
“…It is worth highlighting that the majority of systems for diagnosing thyroid disease relied on attribute selection whereas, model training was carried out using an imbalanced dataset. Numerous research demonstrated that skewed results are produced by imbalanced data [ 32 , 33 ]. Nevertheless, since they lack sufficient prior knowledge, they may even provide overfitted or under-fitted predictions [ 34 ].…”
Section: Introductionmentioning
confidence: 99%