A Systematic Literature Review on Identifying Patterns Using Unsupervised Clustering Algorithms: A Data Mining Perspective

Chaudhry, Mahnoor; Shafi, Imran; Mahnoor, Mahnoor; Vargas, Debora Libertad Ramírez; Thompson, Ernesto Bautista; Ashraf, Imran

doi:10.3390/sym15091679

Cited by 9 publications

(4 citation statements)

References 139 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…Afterward, unnecessary labels or information that could disrupt the data mining process are removed [21]. 2) Data Integration, the next stage involves data integration, where data that is separated into multiple tables is merged into one [22]. 3) Data Selection, the subsequent stage is data selection, where data or attributes needed for analysis and data mining are chosen.…”

Section: Data Processing Methodsmentioning

confidence: 99%

Application of Clustering-Based Data Mining for the Assessment of Nutritional Status in Toddlers at Community Health Centers

Fianty,

Johan,

Aulia

et al. 2023

J. Inf. Syst. Informatics

View full text Add to dashboard Cite

Nutritional status is a crucial foundation for human health and development. Global facts indicate serious challenges in ensuring adequate nutrition, and the situation is no different in Indonesia. This research collected data from the Kelapa Dua Tangerang community health center and utilized data mining techniques with the k-means clustering algorithm to delve deeper into the nutritional status of toddlers. The research findings revealed that nearly 37.3% of toddlers experience issues with abnormal height or weight, as well as poor nutritional conditions, highlighting the importance of careful and timely intervention. With regular health monitoring by community health centers and active parental involvement, actions can be taken to support the optimal growth and development of these children. The results of this research provide a strong understanding to address malnutrition issues, which will ultimately support the formation of a healthier and more promising future generation in Indonesia.

show abstract

Section: Data Processing Methodsmentioning

confidence: 99%

Application of Clustering-Based Data Mining for the Assessment of Nutritional Status in Toddlers at Community Health Centers

Fianty,

Johan,

Aulia

et al. 2023

J. Inf. Syst. Informatics

View full text Add to dashboard Cite

show abstract

“…Unsupervised methods, such as clustering algorithms (e.g., k-means or DBSCAN), can identify patterns and group similar data points without prior labels. These groups can then be used as input for supervised learning models like neural networks or decision trees, which further refine the detection process by learning from labeled data [78,79]. This combination allows the system to benefit from the exploratory nature of unsupervised learning while harnessing the accuracy of supervised learning [80].…”

Section: Hybrid Approachesmentioning

confidence: 99%

Combating the Challenges of False Positives in AI-Driven Anomaly Detection Systems and Enhancing Data Security in the Cloud

Olateju,

Okon,

Igwenagu

et al. 2024

Asian J. Res. Com. Sci.

View full text Add to dashboard Cite

Anomaly detection is critical for network security, fraud detection, and system health monitoring applications. Traditional methods like statistical approaches and distance-based techniques often struggle with high-dimensional and complex data, leading to high false positive rates. This study addresses the challenge by investigating advanced AI-driven techniques to reduce false positives and enhance data security within cloud computing environments. This study employs deep learning models, integrates contextual data, and incorporates comprehensive security measures to enhance anomaly detection performance. Data from synthetic sources, such as the NSL-KDD dataset and real-world cloud environments, were utilized to capture user behavior logs, system states, and network traffic. Over 50 academic journals were reviewed, and 21 were selected based on inclusion criteria, such as relevance to AI-driven anomaly detection, empirical performance metrics, and the focus on cloud environments, and exclusion criteria that filtered out studies lacking empirical data or not specific to cloud-based systems. Methodologically, the research involves a comparative analysis of different AI techniques and their impact on false positive rates, accuracy, precision, and recall. The findings demonstrate that deep learning techniques significantly outperform traditional methods, achieving a lower false positive rate and higher accuracy. The results underscore the importance of contextual data and robust security protocols in reliable anomaly detection. This research fills a gap by thoroughly evaluating advanced AI techniques for reducing false positives in cloud environments. The study's significance lies in guiding the development of more effective anomaly detection systems, thereby enhancing security and reliability across various applications. Additionally, organizations should invest in continuously developing and integrating AI-driven anomaly detection systems with comprehensive security measures to improve their effectiveness the study suggests that further study be conducted with large datasets to evaluate the effectiveness of Hybrid anomaly detection systems in detecting and addressing false positives.

show abstract

“…Ezugwu et al [7] and Saxena et al [8] reported that clustering techniques can be divided into two major categories, namely, hierarchical clustering algorithms and partition clustering algorithms. More clustering categories, including grid clustering, density clustering, and model clustering, were proposed by Chaudhry et al [9] and Oyewole and Thopil [10]. K-means and hierarchical clustering techniques are the most widely used algorithms in the literature.…”

Section: Clustering Techniques and Applications For Medical Data Anal...mentioning

confidence: 99%

“…It can be observed that most studies dealt with a single disease, and K-means was commonly used as a popular clustering technique for analyzing medical data. The clustering approaches can be generally classified into categories: hierarchical clustering algorithms and partition clustering algorithms [7][8][9][10]. This study employed four clustering methods, K-means (KM), hierarchical clustering (HC), the K-means autoencoder (AEKM), and the K-means self-organizing map (SOMKM), to analyze medical data.…”

Section: Clustering Techniques and Applications For Medical Data Anal...mentioning

confidence: 99%

Using Medical Data and Clustering Techniques for a Smart Healthcare System

Yang,

Lai,

Liu

et al. 2023

Electronics

View full text Add to dashboard Cite

With the rapid advancement of information technology, both hardware and software, smart healthcare has become increasingly achievable. The integration of medical data and machine-learning technology is the key to realizing this potential. The quality of medical data influences the results of a smart healthcare system to a great extent. This study aimed to design a smart healthcare system based on clustering techniques and medical data (SHCM) to analyze potential risks and trends in patients in a given time frame. Evidence-based medicine was also employed to explore the results generated by the proposed SHCM system. Thus, similar and different discoveries examined by applying evidence-based medicine could be investigated and integrated into the SHCM to provide personalized smart medical services. In addition, the presented SHCM system analyzes the relationship between health conditions and patients in terms of the clustering results. The findings of this study show the similarities and differences in the clusters obtained between indigenous patients and non-indigenous patients in terms of diseases, time, and numbers. Therefore, the analyzed potential health risks could be further employed in hospital management, such as personalized health education control, personal healthcare, improvement in the utilization of medical resources, and the evaluation of medical expenses.

show abstract

A Systematic Literature Review on Identifying Patterns Using Unsupervised Clustering Algorithms: A Data Mining Perspective

Cited by 9 publications

References 139 publications

Application of Clustering-Based Data Mining for the Assessment of Nutritional Status in Toddlers at Community Health Centers

Application of Clustering-Based Data Mining for the Assessment of Nutritional Status in Toddlers at Community Health Centers

Combating the Challenges of False Positives in AI-Driven Anomaly Detection Systems and Enhancing Data Security in the Cloud

Using Medical Data and Clustering Techniques for a Smart Healthcare System

Contact Info

Product

Resources

About