2023
DOI: 10.1109/access.2023.3309697
|View full text |Cite
|
Sign up to set email alerts
|

MCNN-LSTM: Combining CNN and LSTM to Classify Multi-Class Text in Imbalanced News Data

Khan Md Hasib,
Sami Azam,
Asif Karim
et al.

Abstract: Searching, retrieving, and arranging text in ever-larger document collections necessitate more efficient information processing algorithms. Document categorization is a crucial component of various information processing systems for supervised learning. As the quantity of documents grows, the performance of classic supervised classifiers has deteriorated because of the number of document categories. Assigning documents to a predetermined set of classes is called text classification. It is utilized extensively … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1

Citation Types

0
4
0

Year Published

2023
2023
2025
2025

Publication Types

Select...
8
1

Relationship

0
9

Authors

Journals

citations
Cited by 23 publications
(4 citation statements)
references
References 52 publications
(49 reference statements)
0
4
0
Order By: Relevance
“…The methods of handling imbalanced data can be divided into algorithmic-level and data-level methods [41]. Algorithmic-level methods focus on designing new classification algorithms or enhancing existing ones (for example, [42][43][44]), while data-level methods attempt to balance the data by reducing the majority class or expanding the minority class.…”
Section: Handling Class Imbalancementioning
confidence: 99%
“…The methods of handling imbalanced data can be divided into algorithmic-level and data-level methods [41]. Algorithmic-level methods focus on designing new classification algorithms or enhancing existing ones (for example, [42][43][44]), while data-level methods attempt to balance the data by reducing the majority class or expanding the minority class.…”
Section: Handling Class Imbalancementioning
confidence: 99%
“…Data balancing is an important task to reduce model skewness and as such several works have made use of oversampling approaches to reduce model overfitting and increase performance [7][8][9]. Sarakit et al [10] employed the SMOTE method to detect emotion in unbalanced YouTube datasets using three machine learning classifiers.…”
Section: Related Workmentioning
confidence: 99%
“…Their approach utilizes a range of machine learning and deep learning models, with BERT reaching a maximum accuracy of 99.04% in balanced datasets and 72.23% in imbalanced datasets. Another noteworthy contribution by (Hasib et al, 2023a) introduces MCNN-LSTM, a novel fusion of CNN and LSTM for news text classification. After balancing the dataset using the Tomek-Link algorithm, their model attains remarkable performance, achieving a 98% F1-score and 99.71% accuracy compared to prior research.…”
Section: Handling Class Imbalancementioning
confidence: 99%