The nearly-neutral theory predicts specific relations between effective population size (N e ), and patterns of divergence and polymorphism, which depend on the shape of the distribution of fitness effects (DFE) of new mutations. However, testing these relations is not straightforward since N e is difficult to estimate in practice. For that reason, indirect proxies for N e have often been used to test the nearly-neutral theory, although with mixed results. Here, we introduce an integrative comparative framework allowing for an explicit reconstruction of the phylogenetic history of N e , thus leading to a quantitative test of the nearly-neutral theory and an independent estimation of the shape parameter of the DFE. We applied our method to primates, for which the nearlyneutral predictions were mostly verified. Estimates of the shape parameter were compatible with independent measures based on site frequency spectra. The reconstructed history of N e in primates seems consistent with current knowledge and shows a clear phylogenetic structure at the superfamily level. Altogether, our integrative framework provides a quantitative assessment of the role of N e in modulating patterns of genetic variation, while giving a synthetic picture of the long-term trends in N e variation across a group of species. diversification models (Fitzjohn, 2010), so as to examine the role of N e in speciation and extinction patterns. Finally, the underlying codon model could be further elaborated (Rodrigue et al., 2010;De Maio et al., 2013), so as to achieve a more complete integration of polymorphism and divergence in model-based molecular evolutionary studies (Hernandez et al., 2011).
Materials & MethodsCoding sequence data, phylogenetic tree and fossil calibration The coding sequences were taken from Perelman et al. (2011) and modified. It consists in a modified subset, codon compliant, based on 54 nuclear autosomal genes in 61 species of primates, and of a total length 15.9 kb. We used the tree topology published by Perelman et al. (itself based on a maximum likelihood analysis), as well as the eight fossil calibrations that were used in this previous study to estimate divergence times. These calibrations were encoded as hard constraints on the molecular dating analysis.