Mutations generate sequence diversity and provide a substrate for selection. The rate of de novo mutations is therefore of major importance to evolution. We conducted a study of genomewide mutation rate by sequencing the entire genomes of 78 Icelandic parent-offspring trios at high coverage. Here we show that in our samples, with an average father’s age of 29.7, the average de novo mutation rate is 1.20×10−8 per nucleotide per generation. Most strikingly, the diversity in mutation rate of single-nucleotide polymorphism (SNP) is dominated by the age of the father at conception of the child. The effect is an increase of about 2 mutations per year. After accounting for random Poisson variation, father’s age is estimated to explain nearly all of the remaining variation in the de novo mutation counts. These observations shed light on the importance of the father’s age on the risk of diseases such as schizophrenia and autism.
Determination of recombination rates across the human genome has been constrained by the limited resolution and accuracy of existing genetic maps and the draft genome sequence. We have genotyped 5,136 microsatellite markers for 146 families, with a total of 1,257 meiotic events, to build a high-resolution genetic map meant to: (i) improve the genetic order of polymorphic markers; (ii) improve the precision of estimates of genetic distances; (iii) correct portions of the sequence assembly and SNP map of the human genome; and (iv) build a map of recombination rates. Recombination rates are significantly correlated with both cytogenetic structures (staining intensity of G bands) and sequence (GC content, CpG motifs and poly(A)/poly(T) stretches). Maternal and paternal chromosomes show many differences in locations of recombination maxima. We detected systematic differences in recombination rates between mothers and between gametes from the same mother, suggesting that there is some underlying component determined by both genetic and environmental factors that affects maternal recombination rates.
The characterization of mutational processes that generate sequence diversity in the human genome is of paramount importance both to medical genetics and to evolutionary studies. To understand how the age and sex of transmitting parents affect de novo mutations, here we sequence 1,548 Icelanders, their parents, and, for a subset of 225, at least one child, to 35× genome-wide coverage. We find 108,778 de novo mutations, both single nucleotide polymorphisms and indels, and determine the parent of origin of 42,961. The number of de novo mutations from mothers increases by 0.37 per year of age (95% CI 0.32-0.43), a quarter of the 1.51 per year from fathers (95% CI 1.45-1.57). The number of clustered mutations increases faster with the mother's age than with the father's, and the genomic span of maternal de novo mutation clusters is greater than that of paternal ones. The types of de novo mutation from mothers change substantially with age, with a 0.26% (95% CI 0.19-0.33%) decrease in cytosine-phosphate-guanine to thymine-phosphate-guanine (CpG>TpG) de novo mutations and a 0.33% (95% CI 0.28-0.38%) increase in C>G de novo mutations per year, respectively. Remarkably, these age-related changes are not distributed uniformly across the genome. A striking example is a 20 megabase region on chromosome 8p, with a maternal C>G mutation rate that is up to 50-fold greater than the rest of the genome. The age-related accumulation of maternal non-crossover gene conversions also mostly occurs within these regions. Increased sequence diversity and linkage disequilibrium of C>G variants within regions affected by excess maternal mutations indicate that the underlying mutational process has persisted in humans for thousands of years. Moreover, the regional excess of C>G variation in humans is largely shared by chimpanzees, less by gorillas, and is almost absent from orangutans. This demonstrates that sequence diversity in humans results from evolving interactions between age, sex, mutation type, and genomic location.
Meiotic recombinations contribute to genetic diversity by yielding new combinations of alleles. Recently, high-resolution recombination maps were inferred from high-density single-nucleotide polymorphism (SNP) data using linkage disequilibrium (LD) patterns that capture historical recombination events. The use of these maps has been demonstrated by the identification of recombination hotspots and associated motifs, and the discovery that the PRDM9 gene affects the proportion of recombinations occurring at hotspots. However, these maps provide no information about individual or sex differences. Moreover, locus-specific demographic factors like natural selection can bias LD-based estimates of recombination rate. Existing genetic maps based on family data avoid these shortcomings, but their resolution is limited by relatively few meioses and a low density of markers. Here we used genome-wide SNP data from 15,257 parent-offspring pairs to construct the first recombination maps based on directly observed recombinations with a resolution that is effective down to 10 kilobases (kb). Comparing male and female maps reveals that about 15% of hotspots in one sex are specific to that sex. Although male recombinations result in more shuffling of exons within genes, female recombinations generate more new combinations of nearby genes. We discover novel associations between recombination characteristics of individuals and variants in the PRDM9 gene and we identify new recombination hotspots. Comparisons of our maps with two LD-based maps inferred from data of HapMap populations of Utah residents with ancestry from northern and western Europe (CEU) and Yoruba in Ibadan, Nigeria (YRI) reveal population differences previously masked by noise and map differences at regions previously described as targets of natural selection.
Key Points Whole-genome sequencing of 11 262 Icelanders reveals that clonal hematopoiesis is very common in the elderly. Somatic mutation of some genes is strongly associated with clonal hematopoiesis, but in most cases, no driver mutations were evident.
Here we describe the insights gained from sequencing the whole genomes of 2,636 Icelanders to a median depth of 20×. We found 20 million SNPs and 1.5 million insertions-deletions (indels). We describe the density and frequency spectra of sequence variants in relation to their functional annotation, gene position, pathway and conservation score. We demonstrate an excess of homozygosity and rare protein-coding variants in Iceland. We imputed these variants into 104,220 individuals down to a minor allele frequency of 0.1% and found a recessive frameshift mutation in MYL4 that causes early-onset atrial fibrillation, several mutations in ABCB4 that increase risk of liver diseases and an intronic variant in GNAS associating with increased thyroid-stimulating hormone levels when maternally inherited. These data provide a study design that can be used to determine how variation in the sequence of the human genome gives rise to human diversity.
Familial clustering studies indicate that breast cancer risk has a substantial genetic component. To identify new breast cancer risk variants, we genotyped approximately 300,000 SNPs in 1,600 Icelandic individuals with breast cancer and 11,563 controls using the Illumina Hap300 platform. We then tested selected SNPs in five replication sample sets. Overall, we studied 4,554 affected individuals and 17,577 controls. Two SNPs consistently associated with breast cancer: approximately 25% of individuals of European descent are homozygous for allele A of rs13387042 on chromosome 2q35 and have an estimated 1.44-fold greater risk than noncarriers, and for allele T of rs3803662 on 16q12, about 7% are homozygous and have a 1.64-fold greater risk. Risk from both alleles was confined to estrogen receptor-positive tumors. At present, no genes have been identified in the linkage disequilibrium block containing rs13387042. rs3803662 is near the 5' end of TNRC9 , a high mobility group chromatin-associated protein whose expression is implicated in breast cancer metastasis to bone.
Effects of susceptibility variants may depend on from which parent they are inherited. While many associations between sequence variants and human traits have been discovered through genome-wide associations, the impact of parental origin has largely been ignored. Combining genealogy with long range phasing, we demonstrate that for 38,167 Icelanders genotyped using SNP chips, the parental origin of most alleles can be determined. We then focused on SNPs that associate with diseases and are within 500kb of known imprinted genes. Seven independent SNP associations were examined. Five, one each with breast cancer and basal cell carcinoma, and three with type 2 diabetes (T2D), exhibit parental-origin specific associations. These variants are located in two genomic regions, 11p15 and 7q32, each harbouring a cluster of imprinted genes. Furthermore, a novel variant rs2334499 at 11p15 was seen to associate with T2D where the allele that confers risk when paternally inherited is protective when maternally transmitted. We identified a differentially methylated CTCF binding site at 11p15 and demonstrated correlation of rs2334499 with decreased methylation of that site.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
hi@scite.ai
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.