A main assumption of molecular population genetics is that genomic mutation rate does not depend on sequence function. Challenging this assumption, a recent study has found a reduction in the mutation rate in exons compared to introns in somatic cells. This reduction was ascribed to an enhanced exonic mismatch repair system activity. If this reduction happens also in the germline, it can compromise studies of population genomics, including the detection of the footprint of selection when using introns as proxies of neutrality. Here we compiled and analyzed published germline de novo mutation (DNM) data to test if the exonic mutation rate is also reduced in germ cells. We detected ascertainment bias in studies using DNM data from diseased probands and investigated the impact of extended nucleotide context on de novo mutation rate. After controlling for these factors, we found no reduction in the mutation rate in exons compared to introns in the germline genome, in contrast to what has been previously described in somatic cells. Therefore, there is no evidence of an enhanced mismatch repair system activity in exons with respect to adjacent introns in germline cells.
1One of the most general and widely accepted predictions of the neutral theory of molecular evolution is that "the more sequence conservation, the more functional (selective) constraint on the sequence" [1]. This principle explains why different functional regions in the genome have different levels of polymorphism and divergence, such as the lower variation at nonsynonymous vs synonymous sites in protein-coding genes or in exonic vs intronic sequences [2]. This relationship between constraint and variation constitutes one of the most powerful approaches in the current search for functional regions in the genome and the detection of natural selection at the molecular level. An integral part of estimating constraint, or purifying selection, on functional genomic regions is the comparison of the observed number of mutations to the expectation under neutral evolution. In genes, this neutral expectation is usually estimated from putatively non-functional regions or sites, including intronic sequence [3,4]. A main requirement for the validation of this assumption is that mutation rate on exons and introns does not correlate with that sequence function.Mutation rates can vary strongly across the human genome, with positional differences up to 3-fold in the germline [5] and up to 5-fold in tumor cells [6]. It is influenced by several factors, including replication time, chromatin state, and expression level [7,8,6]. A priori, none of these factors are expected to correlate directly with genic sequence function (exonic vs intronic). Another important determinant of mutation rate is DNA sequence composition. Recent studies have addressed mutational processes and their associated sequence-dependent "signatures", both in the soma and the germline [9,10,11,12]. Germline and many cancer tumor signatures exhibit a higher relative rate of C>T transitions for single nu...