SummaryBackgroundTo effectively prevent, detect, and treat health conditions that affect people during their lifecourse, health-care professionals and researchers need to know which sections of the population are susceptible to which health conditions and at which ages. Hence, we aimed to map the course of human health by identifying the 50 most common health conditions in each decade of life and estimating the median age at first diagnosis.MethodsWe developed phenotyping algorithms and codelists for physical and mental health conditions that involve intensive use of health-care resources. Individuals older than 1 year were included in the study if their primary-care and hospital-admission records met research standards set by the Clinical Practice Research Datalink and they had been registered in a general practice in England contributing up-to-standard data for at least 1 year during the study period. We used linked records of individuals from the CALIBER platform to calculate the sex-standardised cumulative incidence for these conditions by 10-year age groups between April 1, 2010, and March 31, 2015. We also derived the median age at diagnosis and prevalence estimates stratified by age, sex, and ethnicity (black, white, south Asian) over the study period from the primary-care and secondary-care records of patients.FindingsWe developed case definitions for 308 disease phenotypes. We used records of 2 784 138 patients for the calculation of cumulative incidence and of 3 872 451 patients for the calculation of period prevalence and median age at diagnosis of these conditions. Conditions that first gained prominence at key stages of life were: atopic conditions and infections that led to hospital admission in children (<10 years); acne and menstrual disorders in the teenage years (10–19 years); mental health conditions, obesity, and migraine in individuals aged 20–29 years; soft-tissue disorders and gastro-oesophageal reflux disease in individuals aged 30–39 years; dyslipidaemia, hypertension, and erectile dysfunction in individuals aged 40–59 years; cancer, osteoarthritis, benign prostatic hyperplasia, cataract, diverticular disease, type 2 diabetes, and deafness in individuals aged 60–79 years; and atrial fibrillation, dementia, acute and chronic kidney disease, heart failure, ischaemic heart disease, anaemia, and osteoporosis in individuals aged 80 years or older. Black or south-Asian individuals were diagnosed earlier than white individuals for 258 (84%) of the 308 conditions. Bone fractures and atopic conditions were recorded earlier in male individuals, whereas female individuals were diagnosed at younger ages with nutritional anaemias, tubulointerstitial nephritis, and urinary disorders.InterpretationWe have produced the first chronological map of human health with cumulative-incidence and period-prevalence estimates for multiple morbidities in parallel from birth to advanced age. This can guide clinicians, policy makers, and researchers on how to formulate differential diagnoses, allocate resources, and ta...
Objective Electronic health records (EHRs) are a rich source of information on human diseases, but the information is variably structured, fragmented, curated using different coding systems, and collected for purposes other than medical research. We describe an approach for developing, validating, and sharing reproducible phenotypes from national structured EHR in the United Kingdom with applications for translational research. Materials and Methods We implemented a rule-based phenotyping framework, with up to 6 approaches of validation. We applied our framework to a sample of 15 million individuals in a national EHR data source (population-based primary care, all ages) linked to hospitalization and death records in England. Data comprised continuous measurements (for example, blood pressure; medication information; coded diagnoses, symptoms, procedures, and referrals), recorded using 5 controlled clinical terminologies: (1) read (primary care, subset of SNOMED-CT [Systematized Nomenclature of Medicine Clinical Terms]), (2) International Classification of Diseases–Ninth Revision and Tenth Revision (secondary care diagnoses and cause of mortality), (3) Office of Population Censuses and Surveys Classification of Surgical Operations and Procedures, Fourth Revision (hospital surgical procedures), and (4) DM+D prescription codes. Results Using the CALIBER phenotyping framework, we created algorithms for 51 diseases, syndromes, biomarkers, and lifestyle risk factors and provide up to 6 validation approaches. The EHR phenotypes are curated in the open-access CALIBER Portal (https://www.caliberresearch.org/portal) and have been used by 40 national and international research groups in 60 peer-reviewed publications. Conclusions We describe a UK EHR phenomics approach within the CALIBER EHR data platform with initial evidence of validity and use, as an important step toward international use of UK EHR data for health research.
Lack of efficacy in the intended disease indication is the major cause of clinical phase drug development failure. Explanations could include the poor external validity of pre-clinical (cell, tissue, and animal) models of human disease and the high false discovery rate (FDR) in preclinical science. FDR is related to the proportion of true relationships available for discovery (γ), and the type 1 (false-positive) and type 2 (false negative) error rates of the experiments designed to uncover them. We estimated the FDR in preclinical science, its effect on drug development success rates, and improvements expected from use of human genomics rather than preclinical studies as the primary source of evidence for drug target identification. Calculations were based on a sample space defined by all human diseases – the ‘disease-ome’ – represented as columns; and all protein coding genes – ‘the protein-coding genome’– represented as rows, producing a matrix of unique gene- (or protein-) disease pairings. We parameterised the space based on 10,000 diseases, 20,000 protein-coding genes, 100 causal genes per disease and 4000 genes encoding druggable targets, examining the effect of varying the parameters and a range of underlying assumptions, on the inferences drawn. We estimated γ, defined mathematical relationships between preclinical FDR and drug development success rates, and estimated improvements in success rates based on human genomics (rather than orthodox preclinical studies). Around one in every 200 protein-disease pairings was estimated to be causal (γ = 0.005) giving an FDR in preclinical research of 92.6%, which likely makes a major contribution to the reported drug development failure rate of 96%. Observed success rate was only slightly greater than expected for a random pick from the sample space. Values for γ back-calculated from reported preclinical and clinical drug development success rates were also close to the a priori estimates. Substituting genome wide (or druggable genome wide) association studies for preclinical studies as the major information source for drug target identification was estimated to reverse the probability of late stage failure because of the more stringent type 1 error rate employed and the ability to interrogate every potential druggable target in the same experiment. Genetic studies conducted at much larger scale, with greater resolution of disease end-points, e.g. by connecting genomics and electronic health record data within healthcare systems has the potential to produce radical improvement in drug development success rate.
BackgroundVitamin D is essential for a wide range of physiological processes including immune function and calcium homeostasis. Recent investigations have identified candidate genes which are strongly linked to concentrations of 25-hydroxyvitamin D. Since there is insufficient UVB radiation to induce year-round cutaneous synthesis of vitamin D at latitudes distant from the equator it is likely that these genes were subject to forces of natural selection. We used the fixation index (FST) to measure differences in allele frequencies in 993 individuals from ten populations to identify the presence of evolutionary selection in genes in the vitamin D pathway. We then explored the length of haplotypes in chromosomes to confirm recent positive selection.ResultsWe find evidence of positive selection for DHCR7, which governs availability of 7-dehydrocholesterol for conversion to vitamin D3 by the action of sunlight on the skin. We show that extended haplotypes related to vitamin D status are highly prevalent at Northern latitudes (Europe 0.72, Northeast Asia 0.41). The common DHCR7 haplotype underwent a recent selective sweep in Northeast Asia, with relative extended haplotype homozygosity of 5.03 (99th percentile). In contrast, CYP2R1, which 25-hydroxylates vitamin D, is under balancing selection and we found no evidence of recent selection pressure on GC, which is responsible for vitamin D transport.ConclusionsOur results suggest that genetic variation in DHCR7 is the major adaptation affecting vitamin D metabolism in recent evolutionary history which helped early humans to avoid severe vitamin D deficiency and enabled them to inhabit areas further from the equator.
Severe preeclampsia is a common cause of maternal and perinatal morbidity worldwide. The disease clusters in families; however, individual genetic studies have produced inconsistent results. We conducted a review to examine relationships between maternal genotype and severe preeclampsia. We searched the MEDLINE and Embase databases for prospective and retrospective cohort and case-control studies reporting associations between genes and severe preeclampsia. Four reviewers independently undertook study selection, quality assessment, and data extraction. We performed random-effects meta-analyses by genotype and predefined functional gene group (thrombophilic, vasoactive, metabolic, immune, and cell signalling). Fifty-seven studies evaluated 50 genotypes in 5,049 cases and 16,989 controls. Meta-analysis showed a higher risk of severe preeclampsia with coagulation factor V gene (proaccelerin, labile factor) (F5) polymorphism rs6025 (odds ratio = 1.90, 95% confidence interval: 1.42, 2.54; 23 studies, I(2) = 29%), coagulation factor II (thrombin) gene (F2) mutation G20210A (rs1799963) (odds ratio = 2.01, 95% confidence interval: 1.14, 3.55, 9 studies, I(2) = 0%), leptin receptor gene (LEPR) polymorphism rs1137100 (odds ratio = 1.75, 95% confidence interval: 1.15, 2.65; 2 studies, I(2) = 0%), and the thrombophilic gene group (odds ratio = 1.87, 95% confidence interval: 1.43, 2.45, I(2) = 27%). There were no associations with other gene groups. There was moderate heterogeneity between studies and potential for bias from poor-quality genotyping and inconsistent definition of phenotype. Further studies with robust methods should investigate genetic factors that might potentially be used to stratify pregnancies according to risk of complications.
ObjectivesTo understand views of pharmacy advisers about smoker recruitment and retention in the National Health Service community pharmacy stop smoking programme.DesignThematic framework analysis of semistructured, in-depth interviews applying the Theoretical Domains Framework and COM-B behaviour change model. We aimed to identify aspects of adviser behaviour that might be modified to increase numbers joining and completing the programme.Participants25 stop smoking advisers (13 pharmacists and 12 support staff).Setting29 community pharmacies in 3 inner east London boroughs.ResultsAdvisers had preconceived ideas about smokers likely to join or drop out and made judgements about smokers' readiness to quit. Actively recruiting smokers was accorded low priority due in part to perceived insufficient remuneration to the pharmacy and anticipated challenging interactions with smokers. Suggestions to improve smoker recruitment and retention included developing a more holistic and supportive approach using patient-centred communication. Training counter assistants were seen to be important as was flexibility to extend the programme duration to fit better with smokers’ needs.ConclusionsCessation advisers feel they lack the interpersonal skills necessary to engage well with smokers and help them to quit. Addressing advisers' behaviours about active engagement and follow-up of clients, together with regular skills training including staff not formally trained as cessation advisers, could potentially boost numbers recruited and retained in the stop smoking programme. Adjustments to the pharmacy remuneration structure to incentivise recruitment and to allow personalisation of the programme for individual smokers should also be considered.
Reducing the burden of late-life morbidity requires an understanding of the mechanisms of ageing-related diseases (ARDs), defined as diseases that accumulate with increasing age. This has been hampered by the lack of formal criteria to identify ARDs. Here, we present a framework to identify ARDs using two complementary methods consisting of unsupervised machine learning and actuarial techniques, which we applied to electronic health records (EHRs) from 3,009,048 individuals in England using primary care data from the Clinical Practice Research Datalink (CPRD) linked to the Hospital Episode Statistics admitted patient care dataset between 1 April 2010 and 31 March 2015 (mean age 49.7 years (s.d. 18.6), 51% female, 70% white ethnicity). We grouped 278 high-burden diseases into nine main clusters according to their patterns of disease onset, using a hierarchical agglomerative clustering algorithm. Four of these clusters, encompassing 207 diseases spanning diverse organ systems and clinical specialties, had rates of disease onset that clearly increased with chronological age. However, the ages of onset for these four clusters were strikingly different, with median age of onset 82 years (IQR 82–83) for Cluster 1, 77 years (IQR 75–77) for Cluster 2, 69 years (IQR 66–71) for Cluster 3 and 57 years (IQR 54–59) for Cluster 4. Fitting to ageing-related actuarial models confirmed that the vast majority of these 207 diseases had a high probability of being ageing-related. Cardiovascular diseases and cancers were highly represented, while benign neoplastic, skin and psychiatric conditions were largely absent from the four ageing-related clusters. Our framework identifies and clusters ARDs and can form the basis for fundamental and translational research into ageing pathways.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
hi@scite.ai
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.