Abstract. From medical charts to national census, healthcare has traditionally operated under a paper-based paradigm. However, the past decade has marked a long and arduous transformation bringing healthcare into the digital age. Ranging from electronic health records, to digitized imaging and laboratory reports, to public health datasets, today, healthcare now generates an incredible amount of digital information. Such a wealth of data presents an exciting opportunity for integrated machine learning solutions to address problems across multiple facets of healthcare practice and administration. Unfortunately, the ability to derive accurate and informative insights requires more than the ability to execute machine learning models. Rather, a deeper understanding of the data on which the models are run is imperative for their success. While a significant effort has been undertaken to develop models able to process the volume of data obtained during the analysis of millions of digitalized patient records, it is important to remember that volume represents only one aspect of the data. In fact, drawing on data from an increasingly diverse set of sources, healthcare data presents an incredibly complex set of attributes that must be accounted for throughout the machine learning pipeline. This chapter focuses on highlighting such challenges, and is broken down into three distinct components, each representing a phase of the pipeline. We begin with attributes of the data accounted for during preprocessing, then move to considerations during model building, and end with challenges to the interpretation of model output. For each component, we present a discussion around data as it relates to the healthcare domain and offer insight into the challenges each may impose on the efficiency of machine learning techniques.
Social networks influence health-related behavior, such as obesity and smoking. While researchers have studied social networks as a driver for diffusion of influences and behavior, it is less understood how the structure or topology of the network, in itself, impacts an individual’s health behavior and wellness state. In this paper, we investigate whether the structure or topology of a social network offers additional insight and predictability on an individual’s health and wellness. We develop a method called the Network-Driven health predictor (NetCARE) that leverages features representative of social network structure. Using a large longitudinal data set of students enrolled in the NetHealth study at the University of Notre Dame, we show that the NetCARE method improves the overall prediction performance over the baseline models—that use demographics and physical attributes—by 38%, 65%, 55%, and 54% for the wellness states—stress, happiness, positive attitude, and self-assessed health—considered in this paper.
Despite proper sleep hygiene being critical to our health, guidelines for improving sleep habits often focus on only a single component, namely, sleep duration. Recent works, however, have brought to light the importance of another aspect of sleep: bedtime regularity, given its ties to cognitive and metabolic health outcomes. To further our understanding of this often-neglected component of sleep, the objective of this work was to investigate the association between bedtime regularity and resting heart rate (RHR): an important biomarker for cardiovascular health. Utilizing Fitbit Charge HRs to measure bedtimes, sleep and RHR, 255,736 nights of data were collected from a cohort of 557 college students. We observed that going to bed even 30 minutes later than one's normal bedtime was associated with a significantly higher RHR throughout sleep (Coeff +0.18; 95% CI: +0.11, +0.26 bpm), persisting into the following day and converging with one's normal RHR in the early evening. Bedtimes of at least 1 hour earlier were also associated with significantly higher RHRs throughout sleep; however, they converged with one's normal rate by the end of the sleep session, not extending into the following day. These observations stress the importance of maintaining proper sleep habits, beyond sleep duration, as high variability in bedtimes may be detrimental to one's cardiovascular health.
Background
Known colloquially as the “weekend effect,” the association between weekend admissions and increased mortality within hospital settings has become a highly contested topic over the last two decades. Drawing interest from practitioners and researchers alike, a sundry of works have emerged arguing for and against the presence of the effect across various patient cohorts. However, it has become evident that simply studying population characteristics is insufficient for understanding how the effect manifests. Rather, to truly understand the effect, investigations into its underlying factors must be considered. As such, the work presented in this manuscript serves to address this consideration by moving beyond identification of patient cohorts to examining the role of ICU performance.
Methods
Employing a comprehensive, publicly available database of electronic medical records (EMR), we began by utilizing multiple logistic regression to identify and isolate a specific cohort in which the weekend effect was present. Next, we leveraged the highly detailed nature of the EMR to evaluate ICU performance using well-established ICU quality scorecards to assess differences in clinical factors among patients admitted to an ICU on the weekend versus weekday.
Results
Our results demonstrate the weekend effect to be most prevalent among emergency surgery patients (OR 1.53; 95% CI 1.19, 1.96), specifically those diagnosed with circulatory diseases (
P
<.001). Differences between weekday and weekend admissions for this cohort included a variety of clinical factors such as ventilatory support and night-time discharges.
Conclusions
This work reinforces the importance of accounting for differences in clinical factors as well as patient cohorts in studies investigating the weekend effect.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.