“…Below is an array containing 30 sample data entries that represent the overall outcome of preprocessing: ['accelerating', 'academiaedu', 'analysing', 'apa', 'citation', 'cite', 'chicago', 'downloaded', 'formation', 'get', 'health', 'international', 'mla', 'patterns', 'public', 'pulmonary', 'rainfall', 'research', 'related', 'spatial', 'visual', 'hendra', 'rohman', 'science', 'paper', 'styles', 'tuberculosis', 'papers', 'world'] C. Exploratory Data Analysis (EDA) [36]: Calculating Coherence Score In this phase, the selection of the number of topics is based on the coherence score. The processing weighting of word analysis using Term Frequency-Inverse Document Frequency technique to reduce unnecessary words, vocabulary and eliminating noise [22]. which indicates the model's capacity to present data in a comprehensible manner for humans.…”