Qualitative Data Clustering to Detect Outliers

Nowak-Brzezińska, Agnieszka; Łazarz, Weronika

doi:10.3390/e23070869

Cited by 6 publications

(3 citation statements)

References 29 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“… 7 Like other centroid-based clustering algorithms, K-means is sensitive to outliers; the source datasets for this study were subjected to outlier imputation, defining observations in the top and bottom 1% of each continuous variable’s distribution as outliers, as previously described. 2 , 3 , 8 The optimal number of clusters was determined by calculating the within-cluster variance for a range of 1–9 clusters and identifying the inflection point at which a greater number of clusters (and attendant decrease in cluster sizes) would not substantially tighten the clusters (decrease the within-cluster sum of squares). Value of care, the primary outcome, was calculated as inverted observed-to-expected mortality ratios divided by median total costs and multiplied by a constant, as previously described.…”

Section: Methodsmentioning

confidence: 99%

Association of Sociodemographic Factors With Overtriage, Undertriage, and Value of Care After Major Surgery

Loftus,

Ruppert,

Shickel

et al. 2024

Annals of Surgery Open

View full text Add to dashboard Cite

Objective: To determine whether certain patients are vulnerable to errant triage decisions immediately after major surgery and whether there are unique sociodemographic phenotypes within overtriaged and undertriaged cohorts. Background: In a fair system, overtriage of low-acuity patients to intensive care units (ICUs) and undertriage of high-acuity patients to general wards would affect all sociodemographic subgroups equally. Methods: This multicenter, longitudinal cohort study of hospital admissions immediately after major surgery compared hospital mortality and value of care (risk-adjusted mortality/total costs) across 4 cohorts: overtriage (N = 660), risk-matched overtriage controls admitted to general wards (N = 3077), undertriage (N = 2335), and risk-matched undertriage controls admitted to ICUs (N = 4774). K-means clustering identified sociodemographic phenotypes within overtriage and undertriage cohorts. Results: Compared with controls, overtriaged admissions had a predominance of male patients (56.2% vs 43.1%, P < 0.001) and commercial insurance (6.4% vs 2.5%, P < 0.001); undertriaged admissions had a predominance of Black patients (28.4% vs 24.4%, P < 0.001) and greater socioeconomic deprivation. Overtriage was associated with increased total direct costs [$16.2K ($11.4K–$23.5K) vs $14.1K ($9.1K–$20.7K), P < 0.001] and low value of care; undertriage was associated with increased hospital mortality (1.5% vs 0.7%, P = 0.002) and hospice care (2.2% vs 0.6%, P < 0.001) and low value of care. Unique sociodemographic phenotypes within both overtriage and undertriage cohorts had similar outcomes and value of care, suggesting that triage decisions, rather than patient characteristics, drive outcomes and value of care. Conclusions: Postoperative triage decisions should ensure equality across sociodemographic groups by anchoring triage decisions to objective patient acuity assessments, circumventing cognitive shortcuts and mitigating bias.

show abstract

Section: Methodsmentioning

confidence: 99%

Association of Sociodemographic Factors With Overtriage, Undertriage, and Value of Care After Major Surgery

Loftus,

Ruppert,

Shickel

et al. 2024

Annals of Surgery Open

View full text Add to dashboard Cite

show abstract

“…Data quality is very important, as it is affected by the number of variables and the amount of data acquired, which can lead to information sparsity, especially in cases where the quality of the data appears to be poor [21]. In addition, process analysis allows for the observation of unusual activities and behaviors, which can lead to the detection of "outliers", alarm objects, and calls for intervention [22]. Given the power of the method, LA can therefore be a major feedback tool for educators and instructional designers to improve the learning experience [23].…”

Section: Related Workmentioning

confidence: 99%

Acquiring, Analyzing and Interpreting Knowledge Data for Sustainable Engineering Education: An Experimental Study Using YouTube

et al. 2022

View full text Add to dashboard Cite

With the immersion of a plethora of technological tools in the early post-COVID-19 era in university education, instructors around the world have been at the forefront of implementing hybrid learning spaces for knowledge delivery. The purpose of this experimental study is not only to divert the primary use of a YouTube channel into a tool to support asynchronous teaching; it also aims to provide feedback to instructors and suggest steps and actions to implement in their teaching modules to ensure students’ access to new knowledge while promoting their engagement and satisfaction, regardless of the learning environment, i.e., face-to-face, distance and hybrid. Learners’ viewing habits were analyzed in depth from the channel’s 37 instructional videos, all of which were related to the completion of a computer-aided mechanical design course. By analyzing and interpreting data directly from YouTube channel reports, six variables were identified and tested to quantify the lack of statistically significant changes in learners’ viewing habits. Two time periods were specifically studied: 2020–2021, when instruction was delivered exclusively via distance education, and 2021–2022, in a hybrid learning mode. The results of both parametric and non-parametric statistical tests showed that “Number of views” and “Number of unique viewers” are the two variables that behave the same regardless of the two time periods studied, demonstrating the relevance of the proposed concept for asynchronous instructional support regardless of the learning environment. Finally, a forthcoming instructor’s manual for learning CAD has been developed, integrating the proposed methodology into a sustainable academic educational process.

show abstract

“…The presence of artifacts in the signals, as well as the estimation of parameters without a priori signal processing, can lead to the generation of false alarms or isolation of false outliers, which is becoming a relevant topic in database processing [ 16 ]. Recording by a capacitive electrode in clinical conditions, while the subjects were sitting still, was analyzed to determine the possibility of clinical application [ 17 ].…”

Section: Introductionmentioning

confidence: 99%

Reduction of Artifacts in Capacitive Electrocardiogram Signals of Driving Subjects

Škorić

2021

Entropy

View full text Add to dashboard Cite

The development of smart cars with e-health services allows monitoring of the health condition of the driver. Driver comfort is preserved by the use of capacitive electrodes, but the recorded signal is characterized by large artifacts. This paper proposes a method for reducing artifacts from the ECG signal recorded by capacitive electrodes (cECG) in moving subjects. Two dominant artifact types are coarse and slow-changing artifacts. Slow-changing artifacts removal by classical filtering is not feasible as the spectral bands of artifacts and cECG overlap, mostly in the band from 0.5 to 15 Hz. We developed a method for artifact removal, based on estimating the fluctuation around linear trend, for both artifact types, including a condition for determining the presence of coarse artifacts. The method was validated on cECG recorded while driving, with the artifacts predominantly due to the movements, as well as on cECG recorded while lying, where the movements were performed according to a predefined protocol. The proposed method eliminates 96% to 100% of the coarse artifacts, while the slow-changing artifacts are completely reduced for the recorded cECG signals larger than 0.3 V. The obtained results are in accordance with the opinion of medical experts. The method is intended for reliable extraction of cardiovascular parameters to monitor driver fatigue status.

show abstract

Qualitative Data Clustering to Detect Outliers

Cited by 6 publications

References 29 publications

Association of Sociodemographic Factors With Overtriage, Undertriage, and Value of Care After Major Surgery

Association of Sociodemographic Factors With Overtriage, Undertriage, and Value of Care After Major Surgery

Acquiring, Analyzing and Interpreting Knowledge Data for Sustainable Engineering Education: An Experimental Study Using YouTube

Reduction of Artifacts in Capacitive Electrocardiogram Signals of Driving Subjects

Contact Info

Product

Resources

About