Surveys of variability of homologous microsatellite loci among species reveal an ascertainment bias for microsatellite length where microsatellite loci isolated in one species tend to be longer than homologous loci in related species. Here, we take advantage of the availability of aligned human and chimpanzee genome sequences to compare length difference of homologous microsatellites for loci identified in humans to length difference for loci identified in chimpanzees. We are able to quantify ascertainment bias for a range of motifs and microsatellite lengths. Because ascertainment bias should not exist if a microsatellite selected in one species is as likely to be longer as it is to be shorter than its homologue, we propose that the nature of ascertainment bias can provide evidence for understanding how microsatellites evolve. We show that bias is greater for longer microsatellites but also that many long microsatellites have short homologues. These results are consistent with the notion that growth of long microsatellites is constrained by an upper length boundary that, when reached, sometimes results in large deletions. By evaluating ascertainment bias separately for interrupted and uninterrupted repeats we also show that long microsatellites tend to become interrupted, thereby contributing a second component of ascertainment bias. Having accounted for ascertainment bias, in agreement with results published elsewhere, we find that microsatellites in humans are longer on average than those in chimpanzees. This length difference is similar among repeat motifs but surprisingly comprises two roughly equal components, one associated with the repeats themselves and one with the flanking sequences. The differences we find can only be explained if microsatellites are both evolving directionally under a biased mutation process and are doing so at different rates in different closely related species.
Microsatellites are a major component of the human genome, and their evolution has been much studied. However, the evolution of microsatellite flanking sequences has received less attention, with reports of both high and low mutation rates and of a tendency for microsatellites to cluster. From the human genome we generated a database of many thousands of (AC)n flanking sequences within which we searched for common characteristics. Sequences flanking microsatellites of similar length show remarkable levels of convergent evolution, indicating shared mutational biases. These biases extend 25–50 bases either side of the microsatellite and may therefore affect more than 30% of the entire genome. To explore the extent and absolute strength of these effects, we quantified the observed convergence. We also compared homologous human and chimpanzee loci to look for evidence of changes in mutation rate around microsatellites. Most models of DNA sequence evolution assume that mutations are independent and occur randomly. Allowances may be made for sites mutating at different rates and for general mutation biases such as the faster rate of transitions over transversions. Our analysis suggests that these models may be inadequate, in that proximity to even very short microsatellites may alter the rate and distribution of mutations that occur. The elevated local mutation rate combined with sequence convergence, both of which we find evidence for, also provide a possible resolution for the apparently contradictory inferences of mutation rates in microsatellite flanking sequences.
When homologous microsatellites are compared between species, significant differences in mean length are often noted. A dominant cause of these length differences is ascertainment bias due to selection for maximum repeat number and repeat purity when the markers are being developed. However, even after ascertainment bias has been allowed for through reciprocal comparisons, significant length differences remain, suggesting that the average microsatellite mutation rate differs between species. Two classes of mechanism have been proposed: rapid evolution of enzymes involved in the generation and repair of slippage products (enzyme evolution model) and heterozygote instability, whereby interchromosomal events at heterozygous sites offer extra opportunities for mutations to occur (heterozygote instability model). To examine which of these hypotheses is most likely, we compared ascertainment bias and species length differences between humans and chimpanzees in autosomal and Y chromosomal microsatellites. We find that levels of ascertainment bias are indistinguishable, but that interspecies length differences are significantly greater for autosomal loci compared with haploid Y chromosomal loci. Such a pattern is consistent with predictions from the heterozygote instability model and is not expected under models of microsatellite evolution that do not include interchromosomal events such as the enzyme evolution model.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
hi@scite.ai
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.