As part of a broader collaborative network of exome sequencing studies, we developed a jointly called data set of 5,685 Ashkenazi Jewish exomes. We make publicly available a resource of site and allele frequencies, which should serve as a reference for medical genetics in the Ashkenazim (hosted in part at https://ibd.broadinstitute.org, also available in gnomAD at http://gnomad.broadinstitute.org). We estimate that 34% of protein-coding alleles present in the Ashkenazi Jewish population at frequencies greater than 0.2% are significantly more frequent (mean 15-fold) than their maximum frequency observed in other reference populations. Arising via a well-described founder effect approximately 30 generations ago, this catalog of enriched alleles can contribute to differences in genetic risk and overall prevalence of diseases between populations. As validation we document 148 AJ enriched protein-altering alleles that overlap with "pathogenic" ClinVar alleles (table available at https://github.com/macarthur-lab/clinvar/blob/master/output/clinvar.tsv), including those that account for 10–100 fold differences in prevalence between AJ and non-AJ populations of some rare diseases, especially recessive conditions, including Gaucher disease (GBA, p.Asn409Ser, 8-fold enrichment); Canavan disease (ASPA, p.Glu285Ala, 12-fold enrichment); and Tay-Sachs disease (HEXA, c.1421+1G>C, 27-fold enrichment; p.Tyr427IlefsTer5, 12-fold enrichment). We next sought to use this catalog, of well-established relevance to Mendelian disease, to explore Crohn's disease, a common disease with an estimated two to four-fold excess prevalence in AJ. We specifically attempt to evaluate whether strong acting rare alleles, particularly protein-truncating or otherwise large effect-size alleles, enriched by the same founder-effect, contribute excess genetic risk to Crohn's disease in AJ, and find that ten rare genetic risk factors in NOD2 and LRRK2 are enriched in AJ (p < 0.005), including several novel contributing alleles, show evidence of association to CD. Independently, we find that genomewide common variant risk defined by GWAS shows a strong difference between AJ and non-AJ European control population samples (0.97 s.d. higher, p<10−16). Taken together, the results suggest coordinated selection in AJ population for higher CD risk alleles in general. The results and approach illustrate the value of exome sequencing data in case-control studies along with reference data sets like ExAC (sites VCF available via FTP at ftp.broadinstitute.org/pub/ExAC_release/release0.3/) to pinpoint genetic variation that contributes to variable disease predisposition across populations.
BackgroundLarge databases permit quantitative description of genes in terms of intolerance to loss of function (‘haploinsufficiency’) and prevalence of missense variants. We explored these parameters in inherited retinal disease (IRD) genes.MethodsIRD genes (from the ‘RetNet’ resource) were classified by probability of loss of function intolerance (pLI) using online Genome Aggregation Database (gnomAD) and DatabasE of genomiC varIation and Phenotype in Humans using Ensembl Resources (DECIPHER) databases. Genes were identified having pLI ≥0.9 together with one or both of the following: upper bound of CI <0.35 for observed to expected (o/e) ratio of loss of function variants in the gnomAD resource; haploinsufficiency score <10 in the DECIPHER resource. IRD genes in which missense variants appeared under-represented or over-represented (Z score for o/e ratio of <−2.99 or >2.99, respectively) were also identified. The genes were evaluated in the gene ontology Protein Analysis THrough Evolutionary Relationships (PANTHER) resource.ResultsOf 280 analysed genes, 39 (13.9%) were predicted loss of function intolerant. A greater proportion of X-linked than autosomal IRD genes fulfilled these criteria, as expected. Most autosomal genes were associated with dominant disease. PANTHER analysis showed >100 fold enrichment of spliceosome tri-snRNP complex assembly. Most encoded proteins were longer than the median length in the UniProt database. Fourteen genes (11 of which were in the ‘haploinsufficient’ group) showed under-representation of missense variants. Six genes (SAMD11, ALMS1, WFS1, RP1L1, KCNV2, ADAMTS18) showed over-representation of missense variants.ConclusionA minority of IRD-associated genes appear to be ‘haploinsufficient’. Over-representation of spliceosome pathways was observed. When interpreting genetic tests, variants found in genes with over-representation of missense variants should be interpreted with caution.
As part of a broader collaborative network of exome sequencing studies, we developed a jointly called data set of 5,685 Ashkenazi Jewish exomes. We make publicly available a resource of site and allele frequencies, which should serve as a reference for medical genetics in the Ashkenazim. We estimate that 30% of proteincoding alleles present in the Ashkenazi Jewish population at frequencies greater than 0.2% are significantly more frequent (mean 7.6fold) than their maximum frequency observed in other reference populations. Arising via a welldescribed founder effect, this catalog of enriched alleles can contribute to differences in genetic risk and overall prevalence of diseases between populations. As validation we document 151 AJ enriched proteinaltering alleles that overlap with "pathogenic
Purpose: To identify, using genome sequencing (GS), likely pathogenic non-coding variants in inherited retinal dystrophy (IRD) genes Methods: Patients with IRD were recruited to the study and underwent comprehensive ophthalmological evaluation and GS. The results of GS were investigated through virtual gene panel analysis and plausible pathogenic variants and clinical phenotype evaluated by multi-disciplinary team (MDT) discussion. For unsolved patients in whom a specific gene was suspected to harbour a missed pathogenic variant, targeted re-analysis of non-coding regions was performed on GS data. Candidate variants were functionally tested including by mRNA analysis, minigene and luciferase reporter assays. Results: Previously unreported, likely pathogenic, non-coding variants, in 7 genes (PRPF31, NDP, IFT140, CRB1, USH2A, BBS10, and GUCY2D), were identified in 11 patients. These were shown to lead to mis-splicing (PRPF31, IFT140, CRB1, USH2A) or altered transcription levels (BBS10, GUCY2D). Conclusion: MDT-led, phenotype driven, non-coding variant re-analysis of GS is effective in identifying missing causative alleles.
Purpose Autosomal dominant cone rod dystrophy 7 (CORD7) was initially linked to the gene RIMS1 and reported in a 4-generation British family in 1998. The purpose of this study was to investigate the legitimacy of this association, and to correctly characterize the genetic cause of this condition. Methods The allele frequency of RIMS1 c.2459G>A, p.Arg820His, was investigated in the Genomes Aggregation Dataset (gnomAD) datasets and whole genome sequencing (WGS) was performed for 4 members of the CORD7 family with filtering of rare pathogenic variants in a virtual gene panel comprising all genes known to be associated with inherited retinal dystrophy (IRD). Cytogenetic analysis was performed to rule out interchromosomal translocation. Results RIMS1 p.Arg820His has a maximal carrier frequency of >1:5000 in Europeans. A previously well-characterized PROM1 variant: c.1118C>T, p.Arg373Cys, was detected in 9 affected members of the CORD7 family who underwent WGS or direct sequencing. One affected family member is now known to have macular dystrophy in the absence of RIMS1 p.Arg820His. Clinical analysis of affected family members and 27 individuals with retinopathy associated with the same – PROM1 – variant showed consistent phenotypes. Conclusions The case for pathogenicity of RIMS1 p.Arg820His is not strong based on its presence on 10 alleles in the gnomAD dataset and absence from additional CORD affected individuals. The finding of a known pathogenic variant in PROM1 correlates well with the phenotypic characteristics of the affected individuals, and is likely to account for the condition. Clear evidence of association between RIMS1 and a retinal dystrophy is yet to be described.
Background Family studies support a genetic predisposition to inflammatory bowel diseases (IBD), but known genetic variants only partially explain the disease heritability. Families with multiple affected individuals potentially harbour rare and high-impact causal variants. Long regions of homozygosity due to recent inbreeding may increase the risk of individuals bearing homozygous loss-of-function variants. This study aimed to identify rare and homozygous genetic variants contributing to IBD. Methods Four families with known consanguinity and multiple cases of IBD were recruited. In a family-specific analysis, we utilised homozygosity mapping complemented by whole-exome sequencing. Results We detected a single region of homozygosity shared by Crohn's disease cases from a family of Druze ancestry, spanning 2.6 Mb containing the NOD2 gene. Whole-exome sequencing did not identify any potentially damaging variants within the region, suggesting that non-coding variation may be involved. In addition, affected individuals in the families harboured several rare and potentially damaging homozygous variants in genes with a role in autophagy and innate immunity including LRRK1, WHAMM, DENND3, and C5. Conclusion This study examined the potential contribution of rare, high-impact homozygous variants in consanguineous families with IBD. While the analysis was not designed to achieve statistical significance, our findings highlight genes or loci that warrant further research. Non-coding variants affecting NOD2 may be of importance in Druze patients with Crohn's disease.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
hi@scite.ai
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.