2020
DOI: 10.11591/ijeecs.v17.i3.pp1524-1530
|View full text |Cite
|
Sign up to set email alerts
|

Medical documents classification using topic modeling

Abstract: The number of digital medical documents is increasing continuously; several medical websites share a lot of unclassified articles. These articles have very long texts that should be read to determine the topic of each document. The classification of these documents is important so researchers can use these documents easily and the effort and time in reading and searching for a specific topic will be reduced. Therefore, an automatic way to extract latent topics from these text documents is needed. Topic modelin… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2

Citation Types

0
2
0

Year Published

2020
2020
2024
2024

Publication Types

Select...
3
1

Relationship

0
4

Authors

Journals

citations
Cited by 4 publications
(2 citation statements)
references
References 20 publications
(15 reference statements)
0
2
0
Order By: Relevance
“…Ahn et al [11] implemented topic modelling using LDA to find similar and different topics between certain organisations regarding Ridgecrest earthquake tweets to predict public engagement and encourage more effective communication during natural disasters. Other than that, in this study, LDA got a resulting accuracy rate of 71.4% for documents correctly classified, which involves developing topic modeling for medical documents with constructing a document term matrix to capture word occurrences in each document [12].…”
Section: Introductionmentioning
confidence: 92%
“…Ahn et al [11] implemented topic modelling using LDA to find similar and different topics between certain organisations regarding Ridgecrest earthquake tweets to predict public engagement and encourage more effective communication during natural disasters. Other than that, in this study, LDA got a resulting accuracy rate of 71.4% for documents correctly classified, which involves developing topic modeling for medical documents with constructing a document term matrix to capture word occurrences in each document [12].…”
Section: Introductionmentioning
confidence: 92%
“…To date, most of the text classification methods generally used to assign multiple topics to documents [6], grouping of documents into a fixed number of predefined classes [7], sentiment analysis to determine the viewpoint/polarity of a writer with respect to some topic [8], spam filtering of emails [9], automatic hate speech detection [10]. In the era of big data, the increasing number of complex documents makes traditional machine learning methods difficult to implement because conventional learning processes are not designed for big data and will not work properly with high data volumes.…”
Section: Introductionmentioning
confidence: 99%