BackgroundDiabetes case finding based on structured medical records does not fully identify diabetic patients whose medical histories related to diabetes are available in the form of free text. Manual chart reviews have been used but involve high labor costs and long latency.ObjectiveThis study developed and tested a Web-based diabetes case finding algorithm using both structured and unstructured electronic medical records (EMRs).MethodsThis study was based on the health information exchange (HIE) EMR database that covers almost all health facilities in the state of Maine, United States. Using narrative clinical notes, a Web-based natural language processing (NLP) case finding algorithm was retrospectively (July 1, 2012, to June 30, 2013) developed with a random subset of HIE-associated facilities, which was then blind tested with the remaining facilities. The NLP-based algorithm was subsequently integrated into the HIE database and validated prospectively (July 1, 2013, to June 30, 2014).ResultsOf the 935,891 patients in the prospective cohort, 64,168 diabetes cases were identified using diagnosis codes alone. Our NLP-based case finding algorithm prospectively found an additional 5756 uncodified cases (5756/64,168, 8.97% increase) with a positive predictive value of .90. Of the 21,720 diabetic patients identified by both methods, 6616 patients (6616/21,720, 30.46%) were identified by the NLP-based algorithm before a diabetes diagnosis was noted in the structured EMR (mean time difference = 48 days).ConclusionsThe online NLP algorithm was effective in identifying uncodified diabetes cases in real time, leading to a significant improvement in diabetes case finding. The successful integration of the NLP-based case finding algorithm into the Maine HIE database indicates a strong potential for application of this novel method to achieve a more complete ascertainment of diagnoses of diabetes mellitus.
IMPORTANCE How to appropriately care for patients who become PCR-negative for severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) is still not known.Patients who have recovered from coronavirus disease 2019 (COVID-19) could profoundly impact the health care system if a subset were to be PCR-positive again with reactivated SARS-CoV-2.OBJECTIVE To characterize a single center COVID-19 cohort with and without recurrence of PCR positivity, and develop an algorithm to identify patients at high risk of retest positivity after discharge to inform health care policy and case management decision-making.
DESIGN, SETTING, AND PARTICIPANTSA cohort of 414 patients with confirmed SARS-CoV-2 infection, at The Second Affiliated EXPOSURES Polymerase chain reaction (PCR) and IgM-IgG antibody confirmed SARS-CoV-2 infection. MAIN OUTCOMES AND MEASURES Univariable and multivariable statistical analysis of the clinical, laboratory, radiologic image, medical treatment, and clinical course of admission/quarantine/readmission data to develop an algorithm to predict patients at risk of recurrence of PCR positivity.RESULTS 16.7% (95CI: 13.0%-20.3%) patients retest PCR positive 1 to 3 times after discharge, despite being in strict quarantine. The driving factors in the recurrence prediction model included: age, BMI; lowest levels of the blood laboratory tests
Suicide is the tenth leading cause of death in the United States (US). An early-warning system (EWS) for suicide attempt could prove valuable for identifying those at risk of suicide attempts, and analyzing the contribution of repeated attempts to the risk of eventual death by suicide. In this study we sought to develop an EWS for high-risk suicide attempt patients through the development of a population-based risk stratification surveillance system. Advanced machine-learning algorithms and deep neural networks were utilized to build models with the data from electronic health records (EHRs). A final risk score was calculated for each individual and calibrated to indicate the probability of a suicide attempt in the following 1-year time period. Risk scores were subjected to individual-level analysis in order to aid in the interpretation of the results for health-care providers managing the at-risk cohorts. The 1-year suicide attempt risk model attained an area under the curve (AUC ROC) of 0.792 and 0.769 in the retrospective and prospective cohorts, respectively. The suicide attempt rate in the "very high risk" category was 60 times greater than the population baseline when tested in the prospective cohorts. Mental health disorders including depression, bipolar disorders and anxiety, along with substance abuse, impulse control disorders, clinical utilization indicators, and socioeconomic determinants were recognized as significant features associated with incident suicide attempt.
Aims: This study was undertaken to determine the retention of Salmonella through Alphitobius diaperinus metamorphosis and its contribution, through defecation, to external contamination.
Methods and Results: Insects were exposed to a tagged Salmonella enterica and evaluated for external elimination. (i) Each day for 3 weeks, a filter collected frass from a restrained insect for analysis. (ii) Exposed larvae in a closed container were followed through pupation, and newly emerged adults were examined for their retention of marker bacteria.
Conclusions: Exposed adults and larvae produced Salmonella‐positive frass for an average of 8 days, ranging from 6 to 11 days and 6 to 12 days, respectively. Nineteen per cent of the larvae carried Salmonella through metamorphosis and eclosion, with 5% of the pupal exuviae being positive as well.
Significance and Impact of the Study: Many sources of foodborne pathogens within the poultry production facilities, including reservoir populations, currently go unrecognized. This diminishes the ability of producers to mitigate the transfer of pathogens between animals, humans and the environment. Poultry management standards accept the reutilization of litter. Alphitobius diaperinus survive between flock rotations on the reutilized litter, and it was demonstrated in this study that the Salmonella they carry can survive with them.
Background
Lung cancer is the leading cause of cancer death worldwide. Early detection of individuals at risk of lung cancer is critical to reduce the mortality rate.
Objective
The aim of this study was to develop and validate a prospective risk prediction model to identify patients at risk of new incident lung cancer within the next 1 year in the general population.
Methods
Data from individual patient electronic health records (EHRs) were extracted from the Maine Health Information Exchange network. The study population consisted of patients with at least one EHR between April 1, 2016, and March 31, 2018, who had no history of lung cancer. A retrospective cohort (N=873,598) and a prospective cohort (N=836,659) were formed for model construction and validation. An Extreme Gradient Boosting (XGBoost) algorithm was adopted to build the model. It assigned a score to each individual to quantify the probability of a new incident lung cancer diagnosis from October 1, 2016, to September 31, 2017. The model was trained with the clinical profile in the retrospective cohort from the preceding 6 months and validated with the prospective cohort to predict the risk of incident lung cancer from April 1, 2017, to March 31, 2018.
Results
The model had an area under the curve (AUC) of 0.881 (95% CI 0.873-0.889) in the prospective cohort. Two thresholds of 0.0045 and 0.01 were applied to the predictive scores to stratify the population into low-, medium-, and high-risk categories. The incidence of lung cancer in the high-risk category (579/53,922, 1.07%) was 7.7 times higher than that in the overall cohort (1167/836,659, 0.14%). Age, a history of pulmonary diseases and other chronic diseases, medications for mental disorders, and social disparities were found to be associated with new incident lung cancer.
Conclusions
We retrospectively developed and prospectively validated an accurate risk prediction model of new incident lung cancer occurring in the next 1 year. Through statistical learning from the statewide EHR data in the preceding 6 months, our model was able to identify statewide high-risk patients, which will benefit the population health through establishment of preventive interventions or more intensive surveillance.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.