Deciphering the contribution of genetic instability in somatic cells is critical to our understanding of many human disorders. Myotonic dystrophy type 1 (DM1) is one such disorder that is caused by the expansion of a CTG repeat that shows extremely high levels of somatic instability. This somatic instability has compromised attempts to measure intergenerational repeat dynamics and infer genotype-phenotype relationships. Using single-molecule PCR, we have characterized more than 17 000 de novo somatic mutations from a large cohort of DM1 patients. These data reveal that the estimated progenitor allele length is the major modifier of age of onset. We find no evidence for a threshold above which repeat length does not contribute toward age at onset, suggesting pathogenesis is not constrained to a simple molecular switch such as nuclear retention of the DMPK transcript or haploinsufficiency for DMPK and/or SIX5. Importantly, we also show that age at onset is further modified by the level of somatic instability; patients in whom the repeat expands more rapidly, develop the symptoms earlier. These data establish a primary role for somatic instability in DM1 severity, further highlighting it as a therapeutic target. In addition, we show that the level of instability is highly heritable, implying a role for individual-specific trans-acting genetic modifiers. Identifying these trans-acting genetic modifiers will facilitate the formulation of novel therapies that curtail the accumulation of somatic expansions and may provide clues to the role these factors play in the development of cancer, aging and inherited disease in the general population.
The mismatch repair gene MSH3 has been implicated as a genetic modifier of the CAG·CTG repeat expansion disorders Huntington’s disease and myotonic dystrophy type 1. A recent Huntington’s disease genome-wide association study found rs557874766, an imputed single nucleotide polymorphism located within a polymorphic 9 bp tandem repeat in MSH3/DHFR, as the variant most significantly associated with progression in Huntington’s disease. Using Illumina sequencing in Huntington’s disease and myotonic dystrophy type 1 subjects, we show that rs557874766 is an alignment artefact, the minor allele for which corresponds to a three-repeat allele in MSH3 exon 1 that is associated with a reduced rate of somatic CAG·CTG expansion (P = 0.004) and delayed disease onset (P = 0.003) in both Huntington’s disease and myotonic dystrophy type 1, and slower progression (P = 3.86 × 10−7) in Huntington’s disease. RNA-Seq of whole blood in the Huntington’s disease subjects found that repeat variants are associated with MSH3 and DHFR expression. A transcriptome-wide association study in the Huntington’s disease cohort found increased MSH3 and DHFR expression are associated with disease progression. These results suggest that variation in the MSH3 exon 1 repeat region influences somatic expansion and disease phenotype in Huntington’s disease and myotonic dystrophy type 1, and suggests a common DNA repair mechanism operates in both repeat expansion diseases.
Several human genetic diseases are associated with inheriting an abnormally large unstable DNA simple sequence repeat. These sequences mutate, by changing the number of repeats, many times during the lifetime of those affected, with a bias towards expansion. These somatic changes lead not only to the presence of cells with different numbers of repeats in the same tissue, but also produce increasingly longer repeats, contributing towards the progressive nature of the symptoms. Modelling the progression of repeat length throughout the lifetime of individuals has potential for improving prognostic information as well as providing a deeper understanding of the underlying biological process. A large data set comprising blood DNA samples from individuals with one such disease, myotonic dystrophy type 1, provides an opportunity to parameterize a mathematical model for repeat length evolution that we can use to infer biological parameters of interest. We developed new mathematical models by modifying a proposed stochastic birth process to incorporate possible contraction. A hierarchical Bayesian approach was used as the basis for inference, and we estimated the distribution of mutation rates in the population. We used model comparison analysis to reveal, for the first time, that the expansion bias observed in the distributions of repeat lengths is likely to be the cumulative effect of many expansion and contraction events. We predict that mutation events can occur as frequently as every other day, which matches the timing of regular cell activities such as DNA repair and transcription but not DNA replication.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.