The Million Veteran Program (MVP) was established in 2011 as a national research initiative to determine how genetic variation influences the health of U.S. military veterans. We genotyped 312,571 MVP participants using a custom biobank array and linked the genetic data to laboratory and clinical phenotypes extracted from electronic health records covering a median of 10.0 years of follow-up. Among 297,626 veterans with at least 1 blood lipid measurement including 57,332 blacks and 24,743 Hispanics, we tested up to ~32 million variants for association with lipid levels and identified 118 novel genome-wide significant loci after meta-analysis with data from the Global Lipids Genetics Consortium (total N > 600,000). Through a focus on mutations predicted to result in a loss of gene function and a phenome-wide association study, we propose novel indications for pharmaceutical inhibitors targeting PCSK9 (abdominal aortic aneurysm), ANGPTL4 (type 2 diabetes), and PDE3B (triglycerides and coronary disease).
Large-scale multi-ethnic cohorts offer unprecedented opportunities to elucidate the genetic factors influencing complex traits related to health and disease among minority populations. At the same time, the genetic diversity in these cohorts presents new challenges for analysis and interpretation. We consider the utility of race and/or ethnicity categories in genome-wide association studies (GWASs) of multi-ethnic cohorts. We demonstrate that race/ethnicity information enhances the ability to understand population-specific genetic architecture. To address the practical issue that self-identified racial/ethnic information may be incomplete, we propose a machine learning algorithm that produces a surrogate variable, termed HARE. We use height as a model trait to demonstrate the utility of HARE and ethnicity-specific GWASs.
IMPORTANCE Data are limited regarding statin therapy for primary prevention of atherosclerotic cardiovascular disease (ASCVD) in adults 75 years and older.OBJECTIVE To evaluate the role of statin use for mortality and primary prevention of ASCVD in veterans 75 years and older. DESIGN, SETTING, AND PARTICIPANTSRetrospective cohort study that used Veterans Health Administration (VHA) data on adults 75 years and older, free of ASCVD, and with a clinical visit in 2002-2012. Follow-up continued through December 31, 2016. All data were linked to Medicare and Medicaid claims and pharmaceutical data. A new-user design was used, excluding those with any prior statin use. Cox proportional hazards models were fit to evaluate the association of statin use with outcomes. Analyses were conducted using propensity score overlap weighting to balance baseline characteristics.EXPOSURES Any new statin prescription. MAIN OUTCOMES AND MEASURESThe primary outcomes were all-cause and cardiovascular mortality. Secondary outcomes included a composite of ASCVD events (myocardial infarction, ischemic stroke, and revascularization with coronary artery bypass graft surgery or percutaneous coronary intervention). RESULTSOf 326 981 eligible veterans (mean [SD] age, 81.1 [4.1] years; 97% men; 91% white), 57 178 (17.5%) newly initiated statins during the study period. During a mean follow-up of 6.8 (SD, 3.9) years, a total 206 902 deaths occurred including 53 296 cardiovascular deaths, with 78.7 and 98.2 total deaths/1000 person-years among statin users and nonusers, respectively (weighted incidence rate difference [IRD]/1000 person-years, -19.5 [95% CI, -20.4 to -18.5]). There were 22.6 and 25.7 cardiovascular deaths per 1000 person-years among statin users and nonusers, respectively (weighted IRD/1000 person-years, -3.1 [95 CI, -3.6 to -2.6]). For the composite ASCVD outcome there were 123 379 events, with 66.3 and 70.4 events/1000 person-years among statin users and nonusers, respectively (weighted IRD/1000 person-years, -4.1 [95% CI, -5.1 to -3.0]). After propensity score overlap weighting was applied, the hazard ratio was 0.75 (95% CI, 0.74-0.76) for all-cause mortality, 0.80 (95% CI, 0.78-0.81) for cardiovascular mortality, and 0.92 (95% CI, 0.91-0.94) for a composite of ASCVD events when comparing statin users with nonusers.CONCLUSIONS AND RELEVANCE Among US veterans 75 years and older and free of ASCVD at baseline, new statin use was significantly associated with a lower risk of all-cause and cardiovascular mortality. Further research, including from randomized clinical trials, is needed to more definitively determine the role of statin therapy in older adults for primary prevention of ASCVD.
In this proof-of-concept study, we demonstrated application of the PheWAS using large EHR biobanks to inform drug effects. The findings of an association of the IL6R SNP with reduced risk for aortic aneurysms correspond with the newest indication for IL6R blockade, giant cell arteritis, of which a major complication is aortic aneurysm.
Phenotypes are the foundation for clinical and genetic studies of disease risk and outcomes. The growth of biobanks linked to electronic medical record (EMR) data has both facilitated and increased the demand for efficient, accurate, and robust approaches for phenotyping millions of patients. Challenges to phenotyping using EMR data include variation in the accuracy of codes, as well as the high level of manual input required to identify features for the algorithm and to obtain gold standard labels. To address these challenges, we developed PheCAP, a high-throughput semisupervised phenotyping pipeline. PheCAP begins with data from the EMR, including structured data and information extracted from the narrative notes using natural language processing (NLP). The standardized steps integrate automated procedures reducing the level of manual input, and machine learning approaches for algorithm training. PheCAP itself can be executed in 1-2 days if all data are available; however, the timing is largely dependent on the chart review stage which typically requires at least 2 weeks. The final products of PheCAP include a phenotype algorithm, the probability of the phenotype for all patients, and a phenotype classification (yes or no).
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
hi@scite.ai
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.