UK Biobank has released the whole-exome sequencing (WES) data for 200,000 participants, but the best practices remain unclear for rare variant tests, and an existing approach, SAIGE-GENE, can have inflated type I error rates with high computation cost. Here, we propose SAIGE-GENE+ with greatly improved type I error control and computational efficiency compared to SAIGE-GENE. In the analysis of UKBB WES data of 30 quantitative and 141 binary traits, SAIGE-GENE+ identified 551 gene-phenotype associations. In addition, we showed that incorporating multiple MAF cutoffs and functional annotations can help identify novel gene-phenotype associations and SAIGE-GENE+ can facilitate this.
There are over 15,000 known variants that cause human inherited disease by disrupting RNA splicing. While several in silico methods such as CADD, EIGEN and LINSIGHT are commonly used to predict the pathogenicity of noncoding variants, we introduce S-CAP, a tool developed specially for splicing which is better able to effectively distinguish pathogenic splicing-relevant variants from benign variants. S-CAP is a novel splicing pathogenicity predictor that reduces the number of splicing-relevant variants of uncertain significance in patient exomes by 41%, a nearly 3-fold improvement over existing noncoding pathogenicity measures while correctly classifying known pathogenic splicing-relevant variants with a clinical-grade 95% sensitivity.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.