Abstract:We have designed a Biobank Portal that lets researchers request Biobank samples and genotypic data, query associated electronic health records, and design and download datasets containing de-identified attributes about consented Biobank subjects. This do-it-yourself functionality puts a wide variety and volume of data at the fingertips of investigators, allowing them to create custom datasets for their clinical and genomic research from complex phenotypic data and quickly obtain corresponding samples and genom… Show more
“…We conduct simulation studies using the UK Biobank genetic data 13,14 , and demonstrate that PRS-CS dramatically improves the predictive performance of PRS over existing methods across a wide range of genetic architectures, especially when the training sample size is large. We apply PRS-CS to predict six curated common complex diseases (breast cancer, coronary artery disease, depression, inflammatory bowel disease, rheumatoid arthritis, and type 2 diabetes mellitus) and six quantitative traits (height, body mass index, high-density lipoproteins, low-density lipoproteins, cholesterol, and triglycerides) in the Partners HealthCare Biobank 15 , and further demonstrate the potential of PRS-CS for the clinical translation of polygenic prediction.…”
Polygenic prediction has shown promise in identifying individuals at high risk for complex diseases, and may become clinically useful as the predictive performance of polygenic risk scores (PRS) improves. Here, we present PRS-CS, a novel polygenic prediction method that infers posterior SNP effect sizes using GWAS summary statistics and an external linkage disequilibrium (LD) reference panel. PRS-CS utilizes a highdimensional Bayesian regression framework, and is distinct from previous work by placing a continuous shrinkage (CS) prior on SNP effect sizes, which is robust to varying genetic architectures, provides substantial computational advantages, and enables multivariate modeling of local LD patterns. Simulation studies using data from the UK Biobank show that PRS-CS outperforms existing methods across a wide range of effect size distributions, especially when the training sample size is large. We apply PRS-CS to predict six complex diseases and six quantitative traits in the Partners HealthCare Biobank, and further demonstrate the improvement of PRS-CS in prediction accuracy over alternative methods.
“…We conduct simulation studies using the UK Biobank genetic data 13,14 , and demonstrate that PRS-CS dramatically improves the predictive performance of PRS over existing methods across a wide range of genetic architectures, especially when the training sample size is large. We apply PRS-CS to predict six curated common complex diseases (breast cancer, coronary artery disease, depression, inflammatory bowel disease, rheumatoid arthritis, and type 2 diabetes mellitus) and six quantitative traits (height, body mass index, high-density lipoproteins, low-density lipoproteins, cholesterol, and triglycerides) in the Partners HealthCare Biobank 15 , and further demonstrate the potential of PRS-CS for the clinical translation of polygenic prediction.…”
Polygenic prediction has shown promise in identifying individuals at high risk for complex diseases, and may become clinically useful as the predictive performance of polygenic risk scores (PRS) improves. Here, we present PRS-CS, a novel polygenic prediction method that infers posterior SNP effect sizes using GWAS summary statistics and an external linkage disequilibrium (LD) reference panel. PRS-CS utilizes a highdimensional Bayesian regression framework, and is distinct from previous work by placing a continuous shrinkage (CS) prior on SNP effect sizes, which is robust to varying genetic architectures, provides substantial computational advantages, and enables multivariate modeling of local LD patterns. Simulation studies using data from the UK Biobank show that PRS-CS outperforms existing methods across a wide range of effect size distributions, especially when the training sample size is large. We apply PRS-CS to predict six complex diseases and six quantitative traits in the Partners HealthCare Biobank, and further demonstrate the improvement of PRS-CS in prediction accuracy over alternative methods.
“…We drew on three cohorts of patients seen in the Brigham and Women's Hospital network and the Massachusetts General Hospital network, representing the first 15,064 individuals genotyped as part of the Partners HealthCare Biobank initiative (10). These individuals provided informed consent for their electronic health records (EHRs) to be examined in investigations approved by the Partners Institutional Review Board, and provided blood samples for DNA extraction.…”
Section: Cohort Derivation and Genotypingmentioning
“…We drew on three waves of participants in the Partners Biobank from the Brigham and Women's Hospital network as well as the Massachusetts General Hospital network, representing the first ∼15,000 individuals genotyped as part of the Partners HealthCare Biobank initiative . Structured data, including sociodemographic data as well as inpatient and outpatient diagnostic codes (International Classification of Diseases, Ninth Revision [ICD9]), were extracted from the longitudinal electronic health record (EHR) of these two academic medical centers . We included individuals age 18 years or older with biobank data, and at least one of the following ICD9 codes representing nonunion (733.81 or 733.82) (identifying cases), or fracture of upper (810–819) or lower limb (820–829) in the absence of nonunion (identifying controls).…”
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.