2020
DOI: 10.1186/s40537-020-00325-6
|View full text |Cite
|
Sign up to set email alerts
|

OsamorSoft: clustering index for comparison and quality validation in high throughput dataset

Abstract: The existence of some differences in the results obtained from varying clustering k-means algorithms necessitated the need for a simplified approach in validation of cluster quality obtained. This is partly because of differences in the way the algorithms select their first seed or centroid either randomly, sequentially or some other principles influences which tend to influence the final result outcome. Popular external cluster quality validation and comparison models require the computation of varying cluste… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
5
0

Year Published

2021
2021
2022
2022

Publication Types

Select...
6
1

Relationship

2
5

Authors

Journals

citations
Cited by 8 publications
(5 citation statements)
references
References 49 publications
0
5
0
Order By: Relevance
“…In most previous studies 4 , 14 , 24 , single machine learning algorithms have been used for classification, and in cases where more than one algorithm is used, the emphasis is on comparison to determine which of the algorithm had a better performance. However, in this work, we introduced the use of the ensemble technique to combine more than one machine learning algorithms to produce an optimal result that shows performance is better using an ensemble for classification than using a single classifier.…”
Section: Resultsmentioning
confidence: 99%
See 1 more Smart Citation
“…In most previous studies 4 , 14 , 24 , single machine learning algorithms have been used for classification, and in cases where more than one algorithm is used, the emphasis is on comparison to determine which of the algorithm had a better performance. However, in this work, we introduced the use of the ensemble technique to combine more than one machine learning algorithms to produce an optimal result that shows performance is better using an ensemble for classification than using a single classifier.…”
Section: Resultsmentioning
confidence: 99%
“…This gives more credence to the model been developed to solve a particular classification issue. It is worthy to note that efforts have been made to develop medical diagnosis tools 24 especially for TB infections 25 this enhanced Weighed Voting Ensembled will further contribute to the machine learning solutions in TB diagnosis.…”
Section: Discussionmentioning
confidence: 99%
“…Two other measures: Matthews correlation coefficient (MCC) [44] and Fowlkes-Mallows index (FMI) [45] are defined as: .…”
Section: Algorithm 3 Pseudocode Of Csomentioning
confidence: 99%
“…In the literature on the clustering validation topics, one can find many alternative ways of building clustering validity indices, defining the combination strategy and estimation algorithm and justifying the final "true" number of clusters (Osamor & Osamor, 2020;Nerurkar et al, 2019;Kumar et al, 2019;Granichin et al, 2015;Santos & Embrechts, 2014). All these lead to a myriad of alternative simple and sophisticated clustering validation solutions.…”
Section: Cluster Analysismentioning
confidence: 99%